11/06/2021 21:16:12 - INFO - __main__ - Distributed environment: MULTI_GPU Backend: nccl Num processes: 16 Process index: 0 Local process index: 0 Device: cuda:0 Use FP16 precision: True 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - /home/leandro/codeparrot-small/./ is already a clone of https://huggingface.co/lvwerra/codeparrot-small. Make sure you pull the latest changes with `repo.git_pull()`. 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - Revision `proud-haze-135` does not exist. Created and checked out branch `proud-haze-135`. 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - 11/06/2021 21:16:15 - INFO - datasets.data_files - Some files matched the pattern '*' at /home/leandro/codeparrot-clean-train but don't have valid data file extensions: [PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/0b/f3/0bf3cd1320065c163f47a112458dc107650e3e862094b703b76073bd0b68663d'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-rebase.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/fsmonitor-watchman.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-applypatch.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/37/26/3726a0239b5cb7d0ef3ea36886c533d0becc7404217763015559edb546d53c94'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/e7/a9/e7a9ccbfe6bd92476f83eba205c47ed23732ace4c1bd7458d76d666ebbba3b1c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/73/73/737327c2b47693e00050aa3410c5eb402c66211a79740ab57f1c763a1e557563'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2a/7e/2a7e50bbdb90d6c4cec534c3f1dc7ec0e6a0dada15c07cfd94615940c632ce02'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5a/5f/5a5fbc19e0e76787f668ada7235203c10b0cbcdea0ecf8f873f8ec281cfe3494'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/74/31/7431977a8e3a6eb0348b821009495f85d9373c1f730f4a74b0db43326568f77d'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/50/38/503872def2ac44733fbefc2602ab16224caca0896aa1eba045025ef2d60efcdc'), PosixPath('/home/leandro/codeparrot-clean-train/.git/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/update.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b6/ce/b6ce495492aedfc91b66efdfd214b2dfe44867c719d51590e1868e42f4e9b6dd'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/df/08/df0840d1657530c8fa9f82864be5999c515f54341d926c430a82528a6bb83740'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2f/62/2f628d890bceee216f87edb3c45d2e384ee2501ce41a4c4169efaa3363bef1d2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/0f/7a/0f7a67cd83c1c069995f0f2510ebf818dcc71d9658f189de1231d2b7aac8883c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/05/39/053944e1daead0b6de8e46ea2e0bc68b9247604c63a55d444ac3b9adb12e2cd2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/dc/ac/dcacb03d8f43f7879c5eab4422644d7b3797b47dbb0c9c84d88cbc85822d8306'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ac/e3/ace3ac440b380d604ab198cf8e838a2a375e7b0a6b5699ec74a8c79648f4bab8'), PosixPath('/home/leandro/codeparrot-clean-train/.git/description'), PosixPath('/home/leandro/codeparrot-clean-train/.git/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/d4/9f/d49f1929644619c39cff677367ff2e18223a8046ec8f61e224954a10aa2ccf8f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2e/aa/2eaa21b832ed1496fb7f0b259666dbfc36ed483d81494d1e8705f9d601509c12'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/f1/a7/f1a7a250e1f6164a7fb602131ff54b69deb305258792f2358075403769d58fe5'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/d0/02/d0024828eece6d4d1c25cb4e539328be97fa28ce66a3b8d2374a117711cfd520'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/90/a5/90a573501de640c3e0e6f1b3508306febc96faf6061bb33c67894c168a1879c6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-checkout'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-commit'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5d/42/5d42ba9f195510757a3699005a7c43ddede4b598caf8a5f2f8c84d1125fa6324'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/packed-refs'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5f/d1/5fd1bb56db810b65d1fd3866dc43d9c7b690c8f52b9ca8119b2a5f4c49d13eec'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/7c/0e/7c0ef87edb0e556939282c859c7c893a91b5b0f931394ca4cca4f4ec98a61951'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ee/c1/eec1a9546aac0444a706c09f6aab67cd64403940657417e30212b7ff1e16665c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/55/b6/55b6989a41ae296337356153e6081c61484d0b6734b6905683823e7317d01c42'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-commit.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/cc/58/cc58b22515c4fd7d891287ee717c2054290b20c17b1c34693fd8964ab730687b'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b6/8a/b68a74f9784402dcb311f4db72a873035e47b98b185a1813ab2c1645cb7255a2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/fb/84/fb84ca8000808f62718994e4b44e79d88a05b345e9638d9f6cf6c8a5472da01f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/objects/pack/pack-12438cb8112d3b4104fefcb88d751872b5e0fd6e.pack'), PosixPath('/home/leandro/codeparrot-clean-train/.git/info/exclude'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/12/8d/128d56e09d9d741b2778d733e595838a50a5e82fdc9adbb0aa8645457716b97e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/objects/pack/pack-12438cb8112d3b4104fefcb88d751872b5e0fd6e.idx'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b4/83/b4836655e350f0796acd2b1a206e657c2808d9f136afae095e0b94a790c704e1'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/3e/f2/3ef240d0b394384803ae1bbe3b30974e11eb9b1b6ad4f49afc2ed0f7c9eae0d6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/14/08/14089cad26037080ee900bede2fd42d5cac70738b2e77402b36681e1d2a521f6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/86/0e/860eda34e90456533e9dd41a5c0fdb74c54dc8d9cf43d6c60b887b2c858be831'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-update.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-push'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/15/ac/15ac016e4cd702bb184457cbf5674d71b632fc34c29611ba4de549b85c67acfb'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/a4/6b/a46b5c08d39691524b46fadf78eab5efefa29978edfee799ec3587d928dc1302'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/fa/e6/fae6b44a24c1c35f15053a19a6b2b2af5cc9fb8bdaf0da409068a2a1f333f28e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/17/5e/175e7375d6f65993071aa653bdd4e8b117cc02d1d2353cd7bcdbaaf7fe8b3c9c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ac/36/ac36d12d37c1dc8ee8d3b8f0eae93966ae73482ef725615bb1a715802ddd4dd4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/17/96/1796f12729d0407cc57500c9c87959e0e7becd729f37374702868ed8765015f4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/67/f1/67f1ff0d590fbf4aa9afa161c290fe9be17538d4b723278bb21fd6408b0e6a3e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/55/c9/55c9c0b2f26de96e0311ee43e8eaa78ad1af387d0c59a26f22c5ebd507dda321'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-push.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/config'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/9f/7e/9f7e18a3980d4b3d5ed9469ab7a2d67b608e8aa6fff38d876f86719c8f2a7a82'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/e6/48/e6484a578778beccab26c8549608ec13970e6bcdb9541cdccad20f4d984e8181'), PosixPath('/home/leandro/codeparrot-clean-train/.git/index'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/54/60/5460223b92bb118814a7777a939f4005b7426a7e4a068c193c10d1b86eeb862b'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/prepare-commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ae/45/ae45741df674456bc63bad91374d2ba5ef988d33d6e2a322ef0a5ac8af040371'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/60/41/604177fe5560efd99d93091fadab6293afe7cd7d12f81638c301de1c937c1583'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ef/e1/efe1759837b74b5b5ed3df1a09d4c880f9ad20413d958f79d35bf1cb6a2a09d4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-receive.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/32/be/32beb30e381ff02fb71854b5534306f395ef00f51f02b62da1f027c8c7fab26f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/9b/1b/9b1b8e52b9262f03f1719d3950dc8dfa2b9719dc2e273603023f6f329c1b2068'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/56/80/56803c607a19ccb576c90bdb10a02cfa7b3affc67dd150fa41b00cc22213b174'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/4e/39/4e392fcaae564652d234d07b4f71eeed90efe51b1b714831e39d77f3e537d3df'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/applypatch-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/cd/33/cd339656799518495d23aedf1503459be6d3086e22672e80edab8403d12ded1c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/f1/62/f162b06b5dca01aa85ef9a675d396c0fbab1d009b5bee1c5b7ea6b415c6f12a4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-merge')] 11/06/2021 21:16:15 - WARNING - datasets.builder - Using custom data configuration codeparrot-clean-train-e839c6c1585da466 11/06/2021 21:16:15 - INFO - datasets.data_files - Some files matched the pattern '*' at /home/leandro/codeparrot-clean-valid but don't have valid data file extensions: [PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/60/0dc2964cf471fa4aac706659009777cf176497'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/95/7b2579c6ef20995a09efd9a17f8fd90606f5ed'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-push.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/config'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/info/exclude'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/applypatch-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-commit.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/update.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/index'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/25/747fcf966f2b7b3a2f4149130bff69ebe83718'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/15/4f5f07c68026fb069c4bdfe3966893737035f4'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/6d/d1188965fcd7feab0efc3506668a615805e13f'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/description'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-merge'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/prepare-commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/d9/cd7ad451bcd8a388471b341a961d0e6e6ff558'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-rebase.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/55/36bbd68dd8f283092b22eb77a051175c1b727a'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/lfs/objects/7f/8c/7f8c20a737c9084779bcdb853325ad4774d0db52c74aa2a63fd658d6787eb35b'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/fsmonitor-watchman.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-applypatch.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-update.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/09/e6a70d1aadc53ed29b9890332f184f89d0a39b'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/5e/d5325308cb9a07b2c5807dad51120c9a75b6db'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-push'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-checkout'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-commit'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/packed-refs'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-receive.sample')] 11/06/2021 21:16:15 - WARNING - datasets.builder - Using custom data configuration codeparrot-clean-valid-ced470bd23403144 11/06/2021 21:16:43 - INFO - __main__ - Step 1: {'lr': 0.0, 'samples': 192, 'steps': 0, 'loss/train': 10.55798625946045} 11/06/2021 21:16:43 - INFO - root - Reducer buckets have been rebuilt in this iteration. 11/06/2021 21:16:43 - INFO - __main__ - Step 2: {'lr': 2.5e-07, 'samples': 384, 'steps': 1, 'loss/train': 10.535750389099121} 11/06/2021 21:16:43 - INFO - __main__ - Step 3: {'lr': 5e-07, 'samples': 576, 'steps': 2, 'loss/train': 10.530282974243164} 11/06/2021 21:16:44 - INFO - __main__ - Step 4: {'lr': 7.5e-07, 'samples': 768, 'steps': 3, 'loss/train': 10.527787208557129} 11/06/2021 21:16:45 - INFO - __main__ - Step 5: {'lr': 1e-06, 'samples': 960, 'steps': 4, 'loss/train': 10.491048812866211} 11/06/2021 21:16:46 - INFO - __main__ - Step 6: {'lr': 1.25e-06, 'samples': 1152, 'steps': 5, 'loss/train': 10.409588813781738} 11/06/2021 21:16:46 - INFO - __main__ - Step 7: {'lr': 1.5e-06, 'samples': 1344, 'steps': 6, 'loss/train': 10.350728034973145} 11/06/2021 21:16:46 - INFO - __main__ - Step 8: {'lr': 1.75e-06, 'samples': 1536, 'steps': 7, 'loss/train': 10.252238273620605} 11/06/2021 21:16:47 - INFO - __main__ - Step 9: {'lr': 2e-06, 'samples': 1728, 'steps': 8, 'loss/train': 10.193534851074219} 11/06/2021 21:16:47 - INFO - __main__ - Step 10: {'lr': 2.25e-06, 'samples': 1920, 'steps': 9, 'loss/train': 9.953790664672852} 11/06/2021 21:16:48 - INFO - __main__ - Step 11: {'lr': 2.5e-06, 'samples': 2112, 'steps': 10, 'loss/train': 10.194929122924805} 11/06/2021 21:16:49 - INFO - __main__ - Step 12: {'lr': 2.75e-06, 'samples': 2304, 'steps': 11, 'loss/train': 9.89802074432373} 11/06/2021 21:16:49 - INFO - __main__ - Step 13: {'lr': 3e-06, 'samples': 2496, 'steps': 12, 'loss/train': 9.843729972839355} 11/06/2021 21:16:49 - INFO - __main__ - Step 14: {'lr': 3.25e-06, 'samples': 2688, 'steps': 13, 'loss/train': 9.845044136047363} 11/06/2021 21:16:50 - INFO - __main__ - Step 15: {'lr': 3.5e-06, 'samples': 2880, 'steps': 14, 'loss/train': 9.869210243225098} 11/06/2021 21:16:51 - INFO - __main__ - Step 16: {'lr': 3.75e-06, 'samples': 3072, 'steps': 15, 'loss/train': 9.587459564208984} 11/06/2021 21:16:51 - INFO - __main__ - Step 17: {'lr': 4e-06, 'samples': 3264, 'steps': 16, 'loss/train': 9.667202949523926} 11/06/2021 21:16:51 - INFO - __main__ - Step 18: {'lr': 4.250000000000001e-06, 'samples': 3456, 'steps': 17, 'loss/train': 9.495230674743652} 11/06/2021 21:16:52 - INFO - __main__ - Step 19: {'lr': 4.5e-06, 'samples': 3648, 'steps': 18, 'loss/train': 9.640376091003418} 11/06/2021 21:16:52 - INFO - __main__ - Step 20: {'lr': 4.75e-06, 'samples': 3840, 'steps': 19, 'loss/train': 9.428448677062988} 11/06/2021 21:16:52 - INFO - __main__ - Step 21: {'lr': 5e-06, 'samples': 4032, 'steps': 20, 'loss/train': 9.341026306152344} 11/06/2021 21:16:54 - INFO - __main__ - Step 22: {'lr': 5.2500000000000006e-06, 'samples': 4224, 'steps': 21, 'loss/train': 9.372577667236328} 11/06/2021 21:16:54 - INFO - __main__ - Step 23: {'lr': 5.5e-06, 'samples': 4416, 'steps': 22, 'loss/train': 8.967851638793945} 11/06/2021 21:16:54 - INFO - __main__ - Step 24: {'lr': 5.75e-06, 'samples': 4608, 'steps': 23, 'loss/train': 8.74506950378418} 11/06/2021 21:16:55 - INFO - __main__ - Step 25: {'lr': 6e-06, 'samples': 4800, 'steps': 24, 'loss/train': 9.786674499511719} 11/06/2021 21:16:55 - INFO - __main__ - Step 26: {'lr': 6.25e-06, 'samples': 4992, 'steps': 25, 'loss/train': 9.504456520080566} 11/06/2021 21:16:56 - INFO - __main__ - Step 27: {'lr': 6.5e-06, 'samples': 5184, 'steps': 26, 'loss/train': 9.166744232177734} 11/06/2021 21:16:56 - INFO - __main__ - Step 28: {'lr': 6.75e-06, 'samples': 5376, 'steps': 27, 'loss/train': 8.682860374450684} 11/06/2021 21:16:57 - INFO - __main__ - Step 29: {'lr': 7e-06, 'samples': 5568, 'steps': 28, 'loss/train': 8.596318244934082} 11/06/2021 21:16:57 - INFO - __main__ - Step 30: {'lr': 7.250000000000001e-06, 'samples': 5760, 'steps': 29, 'loss/train': 9.048979759216309} 11/06/2021 21:16:57 - INFO - __main__ - Step 31: {'lr': 7.5e-06, 'samples': 5952, 'steps': 30, 'loss/train': 9.320890426635742} 11/06/2021 21:16:58 - INFO - __main__ - Step 32: {'lr': 7.75e-06, 'samples': 6144, 'steps': 31, 'loss/train': 8.952228546142578} 11/06/2021 21:16:59 - INFO - __main__ - Step 33: {'lr': 8e-06, 'samples': 6336, 'steps': 32, 'loss/train': 8.751225471496582} 11/06/2021 21:16:59 - INFO - __main__ - Step 34: {'lr': 8.25e-06, 'samples': 6528, 'steps': 33, 'loss/train': 9.156981468200684} 11/06/2021 21:17:00 - INFO - __main__ - Step 35: {'lr': 8.500000000000002e-06, 'samples': 6720, 'steps': 34, 'loss/train': 8.837956428527832} 11/06/2021 21:17:00 - INFO - __main__ - Step 36: {'lr': 8.750000000000001e-06, 'samples': 6912, 'steps': 35, 'loss/train': 8.935142517089844} 11/06/2021 21:17:01 - INFO - __main__ - Step 37: {'lr': 9e-06, 'samples': 7104, 'steps': 36, 'loss/train': 9.019933700561523} 11/06/2021 21:17:02 - INFO - __main__ - Step 38: {'lr': 9.25e-06, 'samples': 7296, 'steps': 37, 'loss/train': 8.594483375549316} 11/06/2021 21:17:02 - INFO - __main__ - Step 39: {'lr': 9.5e-06, 'samples': 7488, 'steps': 38, 'loss/train': 9.565625190734863} 11/06/2021 21:17:02 - INFO - __main__ - Step 40: {'lr': 9.75e-06, 'samples': 7680, 'steps': 39, 'loss/train': 9.195219039916992} 11/06/2021 21:17:03 - INFO - __main__ - Step 41: {'lr': 1e-05, 'samples': 7872, 'steps': 40, 'loss/train': 9.008049011230469} 11/06/2021 21:17:04 - INFO - __main__ - Step 42: {'lr': 1.025e-05, 'samples': 8064, 'steps': 41, 'loss/train': 9.54212760925293} 11/06/2021 21:17:04 - INFO - __main__ - Step 43: {'lr': 1.0500000000000001e-05, 'samples': 8256, 'steps': 42, 'loss/train': 9.074606895446777} 11/06/2021 21:17:04 - INFO - __main__ - Step 44: {'lr': 1.0749999999999999e-05, 'samples': 8448, 'steps': 43, 'loss/train': 9.575305938720703} 11/06/2021 21:17:05 - INFO - __main__ - Step 45: {'lr': 1.1e-05, 'samples': 8640, 'steps': 44, 'loss/train': 9.862631797790527} 11/06/2021 21:17:05 - INFO - __main__ - Step 46: {'lr': 1.1249999999999999e-05, 'samples': 8832, 'steps': 45, 'loss/train': 8.833338737487793} 11/06/2021 21:17:06 - INFO - __main__ - Step 47: {'lr': 1.15e-05, 'samples': 9024, 'steps': 46, 'loss/train': 8.830769538879395} 11/06/2021 21:17:07 - INFO - __main__ - Step 48: {'lr': 1.1750000000000001e-05, 'samples': 9216, 'steps': 47, 'loss/train': 8.828520774841309} 11/06/2021 21:17:07 - INFO - __main__ - Step 49: {'lr': 1.2e-05, 'samples': 9408, 'steps': 48, 'loss/train': 8.692312240600586} 11/06/2021 21:17:07 - INFO - __main__ - Step 50: {'lr': 1.2250000000000001e-05, 'samples': 9600, 'steps': 49, 'loss/train': 8.698874473571777} 11/06/2021 21:17:08 - INFO - __main__ - Step 51: {'lr': 1.25e-05, 'samples': 9792, 'steps': 50, 'loss/train': 8.904641151428223} 11/06/2021 21:17:09 - INFO - __main__ - Step 52: {'lr': 1.275e-05, 'samples': 9984, 'steps': 51, 'loss/train': 8.66476821899414} 11/06/2021 21:17:09 - INFO - __main__ - Step 53: {'lr': 1.3e-05, 'samples': 10176, 'steps': 52, 'loss/train': 8.561541557312012} 11/06/2021 21:17:10 - INFO - __main__ - Step 54: {'lr': 1.325e-05, 'samples': 10368, 'steps': 53, 'loss/train': 8.71354866027832} 11/06/2021 21:17:10 - INFO - __main__ - Step 55: {'lr': 1.35e-05, 'samples': 10560, 'steps': 54, 'loss/train': 8.084650993347168} 11/06/2021 21:17:10 - INFO - __main__ - Step 56: {'lr': 1.375e-05, 'samples': 10752, 'steps': 55, 'loss/train': 8.701323509216309} 11/06/2021 21:17:11 - INFO - __main__ - Step 57: {'lr': 1.4e-05, 'samples': 10944, 'steps': 56, 'loss/train': 8.886054039001465} 11/06/2021 21:17:12 - INFO - __main__ - Step 58: {'lr': 1.425e-05, 'samples': 11136, 'steps': 57, 'loss/train': 8.962408065795898} 11/06/2021 21:17:12 - INFO - __main__ - Step 59: {'lr': 1.4500000000000002e-05, 'samples': 11328, 'steps': 58, 'loss/train': 8.731340408325195} 11/06/2021 21:17:13 - INFO - __main__ - Step 60: {'lr': 1.475e-05, 'samples': 11520, 'steps': 59, 'loss/train': 8.48225212097168} 11/06/2021 21:17:13 - INFO - __main__ - Step 61: {'lr': 1.5e-05, 'samples': 11712, 'steps': 60, 'loss/train': 8.860502243041992} 11/06/2021 21:17:14 - INFO - __main__ - Step 62: {'lr': 1.525e-05, 'samples': 11904, 'steps': 61, 'loss/train': 8.848859786987305} 11/06/2021 21:17:15 - INFO - __main__ - Step 63: {'lr': 1.55e-05, 'samples': 12096, 'steps': 62, 'loss/train': 8.20711612701416} 11/06/2021 21:17:15 - INFO - __main__ - Step 64: {'lr': 1.575e-05, 'samples': 12288, 'steps': 63, 'loss/train': 10.296394348144531} 11/06/2021 21:17:15 - INFO - __main__ - Step 65: {'lr': 1.6e-05, 'samples': 12480, 'steps': 64, 'loss/train': 7.71311092376709} 11/06/2021 21:17:16 - INFO - __main__ - Step 66: {'lr': 1.6250000000000002e-05, 'samples': 12672, 'steps': 65, 'loss/train': 8.466562271118164} 11/06/2021 21:17:16 - INFO - __main__ - Step 67: {'lr': 1.65e-05, 'samples': 12864, 'steps': 66, 'loss/train': 8.35257339477539} 11/06/2021 21:17:17 - INFO - __main__ - Step 68: {'lr': 1.675e-05, 'samples': 13056, 'steps': 67, 'loss/train': 8.386396408081055} 11/06/2021 21:17:18 - INFO - __main__ - Step 69: {'lr': 1.7000000000000003e-05, 'samples': 13248, 'steps': 68, 'loss/train': 8.12002944946289} 11/06/2021 21:17:18 - INFO - __main__ - Step 70: {'lr': 1.7250000000000003e-05, 'samples': 13440, 'steps': 69, 'loss/train': 8.70462417602539} 11/06/2021 21:17:18 - INFO - __main__ - Step 71: {'lr': 1.7500000000000002e-05, 'samples': 13632, 'steps': 70, 'loss/train': 8.239697456359863} 11/06/2021 21:17:19 - INFO - __main__ - Step 72: {'lr': 1.7749999999999998e-05, 'samples': 13824, 'steps': 71, 'loss/train': 7.610179424285889} 11/06/2021 21:17:20 - INFO - __main__ - Step 73: {'lr': 1.8e-05, 'samples': 14016, 'steps': 72, 'loss/train': 7.8869452476501465} 11/06/2021 21:17:20 - INFO - __main__ - Step 74: {'lr': 1.825e-05, 'samples': 14208, 'steps': 73, 'loss/train': 7.692283630371094} 11/06/2021 21:17:21 - INFO - __main__ - Step 75: {'lr': 1.85e-05, 'samples': 14400, 'steps': 74, 'loss/train': 8.208292007446289} 11/06/2021 21:17:21 - INFO - __main__ - Step 76: {'lr': 1.875e-05, 'samples': 14592, 'steps': 75, 'loss/train': 7.97852897644043} 11/06/2021 21:17:21 - INFO - __main__ - Step 77: {'lr': 1.9e-05, 'samples': 14784, 'steps': 76, 'loss/train': 8.777739524841309} 11/06/2021 21:17:22 - INFO - __main__ - Step 78: {'lr': 1.925e-05, 'samples': 14976, 'steps': 77, 'loss/train': 7.68981409072876} 11/06/2021 21:17:23 - INFO - __main__ - Step 79: {'lr': 1.95e-05, 'samples': 15168, 'steps': 78, 'loss/train': 7.656458854675293} 11/06/2021 21:17:23 - INFO - __main__ - Step 80: {'lr': 1.975e-05, 'samples': 15360, 'steps': 79, 'loss/train': 8.30695915222168} 11/06/2021 21:17:23 - INFO - __main__ - Step 81: {'lr': 2e-05, 'samples': 15552, 'steps': 80, 'loss/train': 7.897383689880371} 11/06/2021 21:17:24 - INFO - __main__ - Step 82: {'lr': 2.025e-05, 'samples': 15744, 'steps': 81, 'loss/train': 8.267080307006836} 11/06/2021 21:17:25 - INFO - __main__ - Step 83: {'lr': 2.05e-05, 'samples': 15936, 'steps': 82, 'loss/train': 8.247127532958984} 11/06/2021 21:17:25 - INFO - __main__ - Step 84: {'lr': 2.0750000000000003e-05, 'samples': 16128, 'steps': 83, 'loss/train': 8.18776798248291} 11/06/2021 21:17:25 - INFO - __main__ - Step 85: {'lr': 2.1000000000000002e-05, 'samples': 16320, 'steps': 84, 'loss/train': 7.7213358879089355} 11/06/2021 21:17:26 - INFO - __main__ - Step 86: {'lr': 2.125e-05, 'samples': 16512, 'steps': 85, 'loss/train': 7.880347728729248} 11/06/2021 21:17:26 - INFO - __main__ - Step 87: {'lr': 2.1499999999999997e-05, 'samples': 16704, 'steps': 86, 'loss/train': 8.094650268554688} 11/06/2021 21:17:27 - INFO - __main__ - Step 88: {'lr': 2.175e-05, 'samples': 16896, 'steps': 87, 'loss/train': 7.1267242431640625} 11/06/2021 21:17:27 - INFO - __main__ - Step 89: {'lr': 2.2e-05, 'samples': 17088, 'steps': 88, 'loss/train': 7.992337703704834} 11/06/2021 21:17:28 - INFO - __main__ - Step 90: {'lr': 2.225e-05, 'samples': 17280, 'steps': 89, 'loss/train': 7.918111801147461} 11/06/2021 21:17:28 - INFO - __main__ - Step 91: {'lr': 2.2499999999999998e-05, 'samples': 17472, 'steps': 90, 'loss/train': 7.4610795974731445} 11/06/2021 21:17:29 - INFO - __main__ - Step 92: {'lr': 2.275e-05, 'samples': 17664, 'steps': 91, 'loss/train': 7.861474990844727} 11/06/2021 21:17:29 - INFO - __main__ - Step 93: {'lr': 2.3e-05, 'samples': 17856, 'steps': 92, 'loss/train': 7.493942737579346} 11/06/2021 21:17:30 - INFO - __main__ - Step 94: {'lr': 2.325e-05, 'samples': 18048, 'steps': 93, 'loss/train': 8.051450729370117} 11/06/2021 21:17:30 - INFO - __main__ - Step 95: {'lr': 2.3500000000000002e-05, 'samples': 18240, 'steps': 94, 'loss/train': 7.800515651702881} 11/06/2021 21:17:31 - INFO - __main__ - Step 96: {'lr': 2.375e-05, 'samples': 18432, 'steps': 95, 'loss/train': 7.40056037902832} 11/06/2021 21:17:31 - INFO - __main__ - Step 97: {'lr': 2.4e-05, 'samples': 18624, 'steps': 96, 'loss/train': 7.518774032592773} 11/06/2021 21:17:31 - INFO - __main__ - Step 98: {'lr': 2.425e-05, 'samples': 18816, 'steps': 97, 'loss/train': 7.735995292663574} 11/06/2021 21:17:32 - INFO - __main__ - Step 99: {'lr': 2.4500000000000003e-05, 'samples': 19008, 'steps': 98, 'loss/train': 7.60399866104126} 11/06/2021 21:17:33 - INFO - __main__ - Step 100: {'lr': 2.4750000000000002e-05, 'samples': 19200, 'steps': 99, 'loss/train': 7.875792980194092} 11/06/2021 21:17:33 - INFO - __main__ - Step 101: {'lr': 2.5e-05, 'samples': 19392, 'steps': 100, 'loss/train': 7.704280853271484} 11/06/2021 21:17:34 - INFO - __main__ - Step 102: {'lr': 2.525e-05, 'samples': 19584, 'steps': 101, 'loss/train': 7.629642009735107} 11/06/2021 21:17:34 - INFO - __main__ - Step 103: {'lr': 2.55e-05, 'samples': 19776, 'steps': 102, 'loss/train': 7.427465915679932} 11/06/2021 21:17:35 - INFO - __main__ - Step 104: {'lr': 2.575e-05, 'samples': 19968, 'steps': 103, 'loss/train': 7.325517654418945} 11/06/2021 21:17:35 - INFO - __main__ - Step 105: {'lr': 2.6e-05, 'samples': 20160, 'steps': 104, 'loss/train': 7.825165271759033} 11/06/2021 21:17:36 - INFO - __main__ - Step 106: {'lr': 2.625e-05, 'samples': 20352, 'steps': 105, 'loss/train': 7.557342529296875} 11/06/2021 21:17:36 - INFO - __main__ - Step 107: {'lr': 2.65e-05, 'samples': 20544, 'steps': 106, 'loss/train': 7.5471510887146} 11/06/2021 21:17:36 - INFO - __main__ - Step 108: {'lr': 2.675e-05, 'samples': 20736, 'steps': 107, 'loss/train': 7.51965856552124} 11/06/2021 21:17:37 - INFO - __main__ - Step 109: {'lr': 2.7e-05, 'samples': 20928, 'steps': 108, 'loss/train': 7.727786540985107} 11/06/2021 21:17:38 - INFO - __main__ - Step 110: {'lr': 2.725e-05, 'samples': 21120, 'steps': 109, 'loss/train': 8.008770942687988} 11/06/2021 21:17:38 - INFO - __main__ - Step 111: {'lr': 2.75e-05, 'samples': 21312, 'steps': 110, 'loss/train': 7.874051094055176} 11/06/2021 21:17:38 - INFO - __main__ - Step 112: {'lr': 2.775e-05, 'samples': 21504, 'steps': 111, 'loss/train': 7.637380599975586} 11/06/2021 21:17:39 - INFO - __main__ - Step 113: {'lr': 2.8e-05, 'samples': 21696, 'steps': 112, 'loss/train': 7.677240371704102} 11/06/2021 21:17:40 - INFO - __main__ - Step 114: {'lr': 2.8250000000000002e-05, 'samples': 21888, 'steps': 113, 'loss/train': 7.03343391418457} 11/06/2021 21:17:40 - INFO - __main__ - Step 115: {'lr': 2.85e-05, 'samples': 22080, 'steps': 114, 'loss/train': 7.615724563598633} 11/06/2021 21:17:41 - INFO - __main__ - Step 116: {'lr': 2.875e-05, 'samples': 22272, 'steps': 115, 'loss/train': 7.741561412811279} 11/06/2021 21:17:41 - INFO - __main__ - Step 117: {'lr': 2.9000000000000004e-05, 'samples': 22464, 'steps': 116, 'loss/train': 8.360411643981934} 11/06/2021 21:17:41 - INFO - __main__ - Step 118: {'lr': 2.9250000000000003e-05, 'samples': 22656, 'steps': 117, 'loss/train': 7.302130222320557} 11/06/2021 21:17:42 - INFO - __main__ - Step 119: {'lr': 2.95e-05, 'samples': 22848, 'steps': 118, 'loss/train': 7.071781158447266} 11/06/2021 21:17:43 - INFO - __main__ - Step 120: {'lr': 2.9749999999999998e-05, 'samples': 23040, 'steps': 119, 'loss/train': 7.1986308097839355} 11/06/2021 21:17:43 - INFO - __main__ - Step 121: {'lr': 3e-05, 'samples': 23232, 'steps': 120, 'loss/train': 6.897414684295654} 11/06/2021 21:17:43 - INFO - __main__ - Step 122: {'lr': 3.025e-05, 'samples': 23424, 'steps': 121, 'loss/train': 6.9648213386535645} 11/06/2021 21:17:44 - INFO - __main__ - Step 123: {'lr': 3.05e-05, 'samples': 23616, 'steps': 122, 'loss/train': 7.184485912322998} 11/06/2021 21:17:45 - INFO - __main__ - Step 124: {'lr': 3.075e-05, 'samples': 23808, 'steps': 123, 'loss/train': 6.945891857147217} 11/06/2021 21:17:45 - INFO - __main__ - Step 125: {'lr': 3.1e-05, 'samples': 24000, 'steps': 124, 'loss/train': 7.381505489349365} 11/06/2021 21:17:46 - INFO - __main__ - Step 126: {'lr': 3.125e-05, 'samples': 24192, 'steps': 125, 'loss/train': 7.457914352416992} 11/06/2021 21:17:46 - INFO - __main__ - Step 127: {'lr': 3.15e-05, 'samples': 24384, 'steps': 126, 'loss/train': 7.405432224273682} 11/06/2021 21:17:47 - INFO - __main__ - Step 128: {'lr': 3.175e-05, 'samples': 24576, 'steps': 127, 'loss/train': 7.172966003417969} 11/06/2021 21:17:48 - INFO - __main__ - Step 129: {'lr': 3.2e-05, 'samples': 24768, 'steps': 128, 'loss/train': 7.367163181304932} 11/06/2021 21:17:48 - INFO - __main__ - Step 130: {'lr': 3.2250000000000005e-05, 'samples': 24960, 'steps': 129, 'loss/train': 6.170543670654297} 11/06/2021 21:17:48 - INFO - __main__ - Step 131: {'lr': 3.2500000000000004e-05, 'samples': 25152, 'steps': 130, 'loss/train': 6.940507411956787} 11/06/2021 21:17:49 - INFO - __main__ - Step 132: {'lr': 3.275e-05, 'samples': 25344, 'steps': 131, 'loss/train': 7.2779364585876465} 11/06/2021 21:17:49 - INFO - __main__ - Step 133: {'lr': 3.3e-05, 'samples': 25536, 'steps': 132, 'loss/train': 6.9434428215026855} 11/06/2021 21:17:49 - INFO - __main__ - Step 134: {'lr': 3.325e-05, 'samples': 25728, 'steps': 133, 'loss/train': 7.035802841186523} 11/06/2021 21:17:50 - INFO - __main__ - Step 135: {'lr': 3.35e-05, 'samples': 25920, 'steps': 134, 'loss/train': 8.246500015258789} 11/06/2021 21:17:51 - INFO - __main__ - Step 136: {'lr': 3.375e-05, 'samples': 26112, 'steps': 135, 'loss/train': 6.835116386413574} 11/06/2021 21:17:51 - INFO - __main__ - Step 137: {'lr': 3.4000000000000007e-05, 'samples': 26304, 'steps': 136, 'loss/train': 6.883285999298096} 11/06/2021 21:17:51 - INFO - __main__ - Step 138: {'lr': 3.4250000000000006e-05, 'samples': 26496, 'steps': 137, 'loss/train': 7.106326103210449} 11/06/2021 21:17:52 - INFO - __main__ - Step 139: {'lr': 3.4500000000000005e-05, 'samples': 26688, 'steps': 138, 'loss/train': 7.33680534362793} 11/06/2021 21:17:53 - INFO - __main__ - Step 140: {'lr': 3.4750000000000004e-05, 'samples': 26880, 'steps': 139, 'loss/train': 6.972182273864746} 11/06/2021 21:17:53 - INFO - __main__ - Step 141: {'lr': 3.5000000000000004e-05, 'samples': 27072, 'steps': 140, 'loss/train': 6.676812171936035} 11/06/2021 21:17:54 - INFO - __main__ - Step 142: {'lr': 3.5249999999999996e-05, 'samples': 27264, 'steps': 141, 'loss/train': 7.476287364959717} 11/06/2021 21:17:54 - INFO - __main__ - Step 143: {'lr': 3.5499999999999996e-05, 'samples': 27456, 'steps': 142, 'loss/train': 6.697681903839111} 11/06/2021 21:17:54 - INFO - __main__ - Step 144: {'lr': 3.5749999999999995e-05, 'samples': 27648, 'steps': 143, 'loss/train': 6.98452091217041} 11/06/2021 21:17:55 - INFO - __main__ - Step 145: {'lr': 3.6e-05, 'samples': 27840, 'steps': 144, 'loss/train': 6.702653408050537} 11/06/2021 21:17:56 - INFO - __main__ - Step 146: {'lr': 3.625e-05, 'samples': 28032, 'steps': 145, 'loss/train': 7.03615140914917} 11/06/2021 21:17:56 - INFO - __main__ - Step 147: {'lr': 3.65e-05, 'samples': 28224, 'steps': 146, 'loss/train': 6.963271617889404} 11/06/2021 21:17:56 - INFO - __main__ - Step 148: {'lr': 3.675e-05, 'samples': 28416, 'steps': 147, 'loss/train': 6.389257907867432} 11/06/2021 21:17:57 - INFO - __main__ - Step 149: {'lr': 3.7e-05, 'samples': 28608, 'steps': 148, 'loss/train': 6.685713768005371} 11/06/2021 21:17:58 - INFO - __main__ - Step 150: {'lr': 3.725e-05, 'samples': 28800, 'steps': 149, 'loss/train': 6.8497772216796875} 11/06/2021 21:17:58 - INFO - __main__ - Step 151: {'lr': 3.75e-05, 'samples': 28992, 'steps': 150, 'loss/train': 6.36531925201416} 11/06/2021 21:17:59 - INFO - __main__ - Step 152: {'lr': 3.775e-05, 'samples': 29184, 'steps': 151, 'loss/train': 6.523715019226074} 11/06/2021 21:17:59 - INFO - __main__ - Step 153: {'lr': 3.8e-05, 'samples': 29376, 'steps': 152, 'loss/train': 7.334853172302246} 11/06/2021 21:17:59 - INFO - __main__ - Step 154: {'lr': 3.825e-05, 'samples': 29568, 'steps': 153, 'loss/train': 6.311091423034668} 11/06/2021 21:18:00 - INFO - __main__ - Step 155: {'lr': 3.85e-05, 'samples': 29760, 'steps': 154, 'loss/train': 6.930693626403809} 11/06/2021 21:18:01 - INFO - __main__ - Step 156: {'lr': 3.875e-05, 'samples': 29952, 'steps': 155, 'loss/train': 7.10976505279541} 11/06/2021 21:18:01 - INFO - __main__ - Step 157: {'lr': 3.9e-05, 'samples': 30144, 'steps': 156, 'loss/train': 6.257956504821777} 11/06/2021 21:18:01 - INFO - __main__ - Step 158: {'lr': 3.925e-05, 'samples': 30336, 'steps': 157, 'loss/train': 6.675495624542236} 11/06/2021 21:18:02 - INFO - __main__ - Step 159: {'lr': 3.95e-05, 'samples': 30528, 'steps': 158, 'loss/train': 6.756046295166016} 11/06/2021 21:18:03 - INFO - __main__ - Step 160: {'lr': 3.9750000000000004e-05, 'samples': 30720, 'steps': 159, 'loss/train': 7.092185020446777} 11/06/2021 21:18:03 - INFO - __main__ - Step 161: {'lr': 4e-05, 'samples': 30912, 'steps': 160, 'loss/train': 6.609521389007568} 11/06/2021 21:18:04 - INFO - __main__ - Step 162: {'lr': 4.025e-05, 'samples': 31104, 'steps': 161, 'loss/train': 6.6135945320129395} 11/06/2021 21:18:04 - INFO - __main__ - Step 163: {'lr': 4.05e-05, 'samples': 31296, 'steps': 162, 'loss/train': 6.731355667114258} 11/06/2021 21:18:04 - INFO - __main__ - Step 164: {'lr': 4.075e-05, 'samples': 31488, 'steps': 163, 'loss/train': 6.836953163146973} 11/06/2021 21:18:05 - INFO - __main__ - Step 165: {'lr': 4.1e-05, 'samples': 31680, 'steps': 164, 'loss/train': 6.64618444442749} 11/06/2021 21:18:06 - INFO - __main__ - Step 166: {'lr': 4.125e-05, 'samples': 31872, 'steps': 165, 'loss/train': 6.482452869415283} 11/06/2021 21:18:06 - INFO - __main__ - Step 167: {'lr': 4.1500000000000006e-05, 'samples': 32064, 'steps': 166, 'loss/train': 6.185523509979248} 11/06/2021 21:18:06 - INFO - __main__ - Step 168: {'lr': 4.1750000000000005e-05, 'samples': 32256, 'steps': 167, 'loss/train': 6.377339839935303} 11/06/2021 21:18:07 - INFO - __main__ - Step 169: {'lr': 4.2000000000000004e-05, 'samples': 32448, 'steps': 168, 'loss/train': 6.502751350402832} 11/06/2021 21:18:08 - INFO - __main__ - Step 170: {'lr': 4.2250000000000004e-05, 'samples': 32640, 'steps': 169, 'loss/train': 7.073266506195068} 11/06/2021 21:18:08 - INFO - __main__ - Step 171: {'lr': 4.25e-05, 'samples': 32832, 'steps': 170, 'loss/train': 6.543142318725586} 11/06/2021 21:18:08 - INFO - __main__ - Step 172: {'lr': 4.275e-05, 'samples': 33024, 'steps': 171, 'loss/train': 6.489956378936768} 11/06/2021 21:18:09 - INFO - __main__ - Step 173: {'lr': 4.2999999999999995e-05, 'samples': 33216, 'steps': 172, 'loss/train': 6.199307918548584} 11/06/2021 21:18:09 - INFO - __main__ - Step 174: {'lr': 4.325e-05, 'samples': 33408, 'steps': 173, 'loss/train': 6.356565475463867} 11/06/2021 21:18:10 - INFO - __main__ - Step 175: {'lr': 4.35e-05, 'samples': 33600, 'steps': 174, 'loss/train': 5.826794147491455} 11/06/2021 21:18:11 - INFO - __main__ - Step 176: {'lr': 4.375e-05, 'samples': 33792, 'steps': 175, 'loss/train': 7.043330192565918} 11/06/2021 21:18:11 - INFO - __main__ - Step 177: {'lr': 4.4e-05, 'samples': 33984, 'steps': 176, 'loss/train': 6.09688663482666} 11/06/2021 21:18:11 - INFO - __main__ - Step 178: {'lr': 4.425e-05, 'samples': 34176, 'steps': 177, 'loss/train': 6.213244915008545} 11/06/2021 21:18:12 - INFO - __main__ - Step 179: {'lr': 4.45e-05, 'samples': 34368, 'steps': 178, 'loss/train': 6.990470886230469} 11/06/2021 21:18:13 - INFO - __main__ - Step 180: {'lr': 4.475e-05, 'samples': 34560, 'steps': 179, 'loss/train': 6.256852149963379} 11/06/2021 21:18:13 - INFO - __main__ - Step 181: {'lr': 4.4999999999999996e-05, 'samples': 34752, 'steps': 180, 'loss/train': 7.177120685577393} 11/06/2021 21:18:13 - INFO - __main__ - Step 182: {'lr': 4.525e-05, 'samples': 34944, 'steps': 181, 'loss/train': 6.373216152191162} 11/06/2021 21:18:14 - INFO - __main__ - Step 183: {'lr': 4.55e-05, 'samples': 35136, 'steps': 182, 'loss/train': 7.0793328285217285} 11/06/2021 21:18:14 - INFO - __main__ - Step 184: {'lr': 4.575e-05, 'samples': 35328, 'steps': 183, 'loss/train': 6.392989158630371} 11/06/2021 21:18:15 - INFO - __main__ - Step 185: {'lr': 4.6e-05, 'samples': 35520, 'steps': 184, 'loss/train': 8.500471115112305} 11/06/2021 21:18:16 - INFO - __main__ - Step 186: {'lr': 4.625e-05, 'samples': 35712, 'steps': 185, 'loss/train': 6.313254356384277} 11/06/2021 21:18:16 - INFO - __main__ - Step 187: {'lr': 4.65e-05, 'samples': 35904, 'steps': 186, 'loss/train': 6.720587253570557} 11/06/2021 21:18:16 - INFO - __main__ - Step 188: {'lr': 4.675e-05, 'samples': 36096, 'steps': 187, 'loss/train': 5.526083469390869} 11/06/2021 21:18:17 - INFO - __main__ - Step 189: {'lr': 4.7000000000000004e-05, 'samples': 36288, 'steps': 188, 'loss/train': 5.93545389175415} 11/06/2021 21:18:17 - INFO - __main__ - Step 190: {'lr': 4.725e-05, 'samples': 36480, 'steps': 189, 'loss/train': 7.147458553314209} 11/06/2021 21:18:18 - INFO - __main__ - Step 191: {'lr': 4.75e-05, 'samples': 36672, 'steps': 190, 'loss/train': 6.479673385620117} 11/06/2021 21:18:18 - INFO - __main__ - Step 192: {'lr': 4.775e-05, 'samples': 36864, 'steps': 191, 'loss/train': 6.075639247894287} 11/06/2021 21:18:19 - INFO - __main__ - Step 193: {'lr': 4.8e-05, 'samples': 37056, 'steps': 192, 'loss/train': 6.69281005859375} 11/06/2021 21:18:19 - INFO - __main__ - Step 194: {'lr': 4.825e-05, 'samples': 37248, 'steps': 193, 'loss/train': 6.355683326721191} 11/06/2021 21:18:20 - INFO - __main__ - Step 195: {'lr': 4.85e-05, 'samples': 37440, 'steps': 194, 'loss/train': 6.180767059326172} 11/06/2021 21:18:20 - INFO - __main__ - Step 196: {'lr': 4.8750000000000006e-05, 'samples': 37632, 'steps': 195, 'loss/train': 5.40876579284668} 11/06/2021 21:18:21 - INFO - __main__ - Step 197: {'lr': 4.9000000000000005e-05, 'samples': 37824, 'steps': 196, 'loss/train': 7.284364700317383} 11/06/2021 21:18:21 - INFO - __main__ - Step 198: {'lr': 4.9250000000000004e-05, 'samples': 38016, 'steps': 197, 'loss/train': 6.532893180847168} 11/06/2021 21:18:22 - INFO - __main__ - Step 199: {'lr': 4.9500000000000004e-05, 'samples': 38208, 'steps': 198, 'loss/train': 6.542242527008057} 11/06/2021 21:18:22 - INFO - __main__ - Step 200: {'lr': 4.975e-05, 'samples': 38400, 'steps': 199, 'loss/train': 6.113000392913818} 11/06/2021 21:18:22 - INFO - __main__ - Step 201: {'lr': 5e-05, 'samples': 38592, 'steps': 200, 'loss/train': 6.284862995147705} 11/06/2021 21:18:23 - INFO - __main__ - Step 202: {'lr': 5.025e-05, 'samples': 38784, 'steps': 201, 'loss/train': 6.143826007843018} 11/06/2021 21:18:24 - INFO - __main__ - Step 203: {'lr': 5.05e-05, 'samples': 38976, 'steps': 202, 'loss/train': 6.595575332641602} 11/06/2021 21:18:24 - INFO - __main__ - Step 204: {'lr': 5.075000000000001e-05, 'samples': 39168, 'steps': 203, 'loss/train': 6.1036787033081055} 11/06/2021 21:18:24 - INFO - __main__ - Step 205: {'lr': 5.1e-05, 'samples': 39360, 'steps': 204, 'loss/train': 6.222804546356201} 11/06/2021 21:18:25 - INFO - __main__ - Step 206: {'lr': 5.125e-05, 'samples': 39552, 'steps': 205, 'loss/train': 5.817543029785156} 11/06/2021 21:18:26 - INFO - __main__ - Step 207: {'lr': 5.15e-05, 'samples': 39744, 'steps': 206, 'loss/train': 6.056823253631592} 11/06/2021 21:18:26 - INFO - __main__ - Step 208: {'lr': 5.175e-05, 'samples': 39936, 'steps': 207, 'loss/train': 6.261317729949951} 11/06/2021 21:18:27 - INFO - __main__ - Step 209: {'lr': 5.2e-05, 'samples': 40128, 'steps': 208, 'loss/train': 6.321830749511719} 11/06/2021 21:18:27 - INFO - __main__ - Step 210: {'lr': 5.2249999999999996e-05, 'samples': 40320, 'steps': 209, 'loss/train': 5.724458694458008} 11/06/2021 21:18:27 - INFO - __main__ - Step 211: {'lr': 5.25e-05, 'samples': 40512, 'steps': 210, 'loss/train': 6.434157371520996} 11/06/2021 21:18:28 - INFO - __main__ - Step 212: {'lr': 5.275e-05, 'samples': 40704, 'steps': 211, 'loss/train': 5.677988529205322} 11/06/2021 21:18:29 - INFO - __main__ - Step 213: {'lr': 5.3e-05, 'samples': 40896, 'steps': 212, 'loss/train': 5.744022846221924} 11/06/2021 21:18:29 - INFO - __main__ - Step 214: {'lr': 5.325e-05, 'samples': 41088, 'steps': 213, 'loss/train': 5.744656562805176} 11/06/2021 21:18:29 - INFO - __main__ - Step 215: {'lr': 5.35e-05, 'samples': 41280, 'steps': 214, 'loss/train': 5.984218597412109} 11/06/2021 21:18:30 - INFO - __main__ - Step 216: {'lr': 5.375e-05, 'samples': 41472, 'steps': 215, 'loss/train': 6.307746887207031} 11/06/2021 21:18:31 - INFO - __main__ - Step 217: {'lr': 5.4e-05, 'samples': 41664, 'steps': 216, 'loss/train': 6.040472030639648} 11/06/2021 21:18:31 - INFO - __main__ - Step 218: {'lr': 5.4250000000000004e-05, 'samples': 41856, 'steps': 217, 'loss/train': 6.029814720153809} 11/06/2021 21:18:31 - INFO - __main__ - Step 219: {'lr': 5.45e-05, 'samples': 42048, 'steps': 218, 'loss/train': 6.033048629760742} 11/06/2021 21:18:32 - INFO - __main__ - Step 220: {'lr': 5.475e-05, 'samples': 42240, 'steps': 219, 'loss/train': 5.9049153327941895} 11/06/2021 21:18:32 - INFO - __main__ - Step 221: {'lr': 5.5e-05, 'samples': 42432, 'steps': 220, 'loss/train': 6.097071170806885} 11/06/2021 21:18:33 - INFO - __main__ - Step 222: {'lr': 5.525e-05, 'samples': 42624, 'steps': 221, 'loss/train': 6.372702598571777} 11/06/2021 21:18:33 - INFO - __main__ - Step 223: {'lr': 5.55e-05, 'samples': 42816, 'steps': 222, 'loss/train': 7.301460266113281} 11/06/2021 21:18:34 - INFO - __main__ - Step 224: {'lr': 5.575e-05, 'samples': 43008, 'steps': 223, 'loss/train': 6.467023849487305} 11/06/2021 21:18:34 - INFO - __main__ - Step 225: {'lr': 5.6e-05, 'samples': 43200, 'steps': 224, 'loss/train': 6.188560962677002} 11/06/2021 21:18:34 - INFO - __main__ - Step 226: {'lr': 5.6250000000000005e-05, 'samples': 43392, 'steps': 225, 'loss/train': 5.94260311126709} 11/06/2021 21:18:36 - INFO - __main__ - Step 227: {'lr': 5.6500000000000005e-05, 'samples': 43584, 'steps': 226, 'loss/train': 5.925175189971924} 11/06/2021 21:18:36 - INFO - __main__ - Step 228: {'lr': 5.6750000000000004e-05, 'samples': 43776, 'steps': 227, 'loss/train': 5.587348937988281} 11/06/2021 21:18:36 - INFO - __main__ - Step 229: {'lr': 5.7e-05, 'samples': 43968, 'steps': 228, 'loss/train': 5.978050231933594} 11/06/2021 21:18:37 - INFO - __main__ - Step 230: {'lr': 5.725e-05, 'samples': 44160, 'steps': 229, 'loss/train': 5.728048324584961} 11/06/2021 21:18:37 - INFO - __main__ - Step 231: {'lr': 5.75e-05, 'samples': 44352, 'steps': 230, 'loss/train': 6.196109771728516} 11/06/2021 21:18:37 - INFO - __main__ - Step 232: {'lr': 5.775e-05, 'samples': 44544, 'steps': 231, 'loss/train': 6.567146301269531} 11/06/2021 21:18:38 - INFO - __main__ - Step 233: {'lr': 5.800000000000001e-05, 'samples': 44736, 'steps': 232, 'loss/train': 5.301417827606201} 11/06/2021 21:18:39 - INFO - __main__ - Step 234: {'lr': 5.8250000000000006e-05, 'samples': 44928, 'steps': 233, 'loss/train': 5.6201348304748535} 11/06/2021 21:18:39 - INFO - __main__ - Step 235: {'lr': 5.8500000000000006e-05, 'samples': 45120, 'steps': 234, 'loss/train': 5.957462787628174} 11/06/2021 21:18:39 - INFO - __main__ - Step 236: {'lr': 5.875e-05, 'samples': 45312, 'steps': 235, 'loss/train': 6.2582688331604} 11/06/2021 21:18:40 - INFO - __main__ - Step 237: {'lr': 5.9e-05, 'samples': 45504, 'steps': 236, 'loss/train': 5.807918071746826} 11/06/2021 21:18:41 - INFO - __main__ - Step 238: {'lr': 5.925e-05, 'samples': 45696, 'steps': 237, 'loss/train': 5.977503299713135} 11/06/2021 21:18:41 - INFO - __main__ - Step 239: {'lr': 5.9499999999999996e-05, 'samples': 45888, 'steps': 238, 'loss/train': 5.574709892272949} 11/06/2021 21:18:42 - INFO - __main__ - Step 240: {'lr': 5.9749999999999995e-05, 'samples': 46080, 'steps': 239, 'loss/train': 5.93025016784668} 11/06/2021 21:18:42 - INFO - __main__ - Step 241: {'lr': 6e-05, 'samples': 46272, 'steps': 240, 'loss/train': 5.585808277130127} 11/06/2021 21:18:42 - INFO - __main__ - Step 242: {'lr': 6.025e-05, 'samples': 46464, 'steps': 241, 'loss/train': 6.203149318695068} 11/06/2021 21:18:43 - INFO - __main__ - Step 243: {'lr': 6.05e-05, 'samples': 46656, 'steps': 242, 'loss/train': 4.716400623321533} 11/06/2021 21:18:44 - INFO - __main__ - Step 244: {'lr': 6.075e-05, 'samples': 46848, 'steps': 243, 'loss/train': 5.850978851318359} 11/06/2021 21:18:44 - INFO - __main__ - Step 245: {'lr': 6.1e-05, 'samples': 47040, 'steps': 244, 'loss/train': 5.889804840087891} 11/06/2021 21:18:44 - INFO - __main__ - Step 246: {'lr': 6.125e-05, 'samples': 47232, 'steps': 245, 'loss/train': 5.8663225173950195} 11/06/2021 21:18:45 - INFO - __main__ - Step 247: {'lr': 6.15e-05, 'samples': 47424, 'steps': 246, 'loss/train': 5.8062334060668945} 11/06/2021 21:18:45 - INFO - __main__ - Step 248: {'lr': 6.175e-05, 'samples': 47616, 'steps': 247, 'loss/train': 5.754123210906982} 11/06/2021 21:18:46 - INFO - __main__ - Step 249: {'lr': 6.2e-05, 'samples': 47808, 'steps': 248, 'loss/train': 5.87885856628418} 11/06/2021 21:18:47 - INFO - __main__ - Step 250: {'lr': 6.225e-05, 'samples': 48000, 'steps': 249, 'loss/train': 5.715115070343018} 11/06/2021 21:18:47 - INFO - __main__ - Step 251: {'lr': 6.25e-05, 'samples': 48192, 'steps': 250, 'loss/train': 5.726807594299316} 11/06/2021 21:18:47 - INFO - __main__ - Step 252: {'lr': 6.275000000000001e-05, 'samples': 48384, 'steps': 251, 'loss/train': 5.94719934463501} 11/06/2021 21:18:48 - INFO - __main__ - Step 253: {'lr': 6.3e-05, 'samples': 48576, 'steps': 252, 'loss/train': 5.911177158355713} 11/06/2021 21:18:49 - INFO - __main__ - Step 254: {'lr': 6.325e-05, 'samples': 48768, 'steps': 253, 'loss/train': 5.7925004959106445} 11/06/2021 21:18:49 - INFO - __main__ - Step 255: {'lr': 6.35e-05, 'samples': 48960, 'steps': 254, 'loss/train': 5.541946887969971} 11/06/2021 21:18:50 - INFO - __main__ - Step 256: {'lr': 6.375e-05, 'samples': 49152, 'steps': 255, 'loss/train': 5.5716233253479} 11/06/2021 21:18:50 - INFO - __main__ - Step 257: {'lr': 6.4e-05, 'samples': 49344, 'steps': 256, 'loss/train': 6.297529697418213} 11/06/2021 21:18:50 - INFO - __main__ - Step 258: {'lr': 6.425e-05, 'samples': 49536, 'steps': 257, 'loss/train': 4.350067138671875} 11/06/2021 21:18:51 - INFO - __main__ - Step 259: {'lr': 6.450000000000001e-05, 'samples': 49728, 'steps': 258, 'loss/train': 5.1816887855529785} 11/06/2021 21:18:52 - INFO - __main__ - Step 260: {'lr': 6.475e-05, 'samples': 49920, 'steps': 259, 'loss/train': 3.9729881286621094} 11/06/2021 21:18:52 - INFO - __main__ - Step 261: {'lr': 6.500000000000001e-05, 'samples': 50112, 'steps': 260, 'loss/train': 5.403891563415527} 11/06/2021 21:18:52 - INFO - __main__ - Step 262: {'lr': 6.525e-05, 'samples': 50304, 'steps': 261, 'loss/train': 6.115725517272949} 11/06/2021 21:18:53 - INFO - __main__ - Step 263: {'lr': 6.55e-05, 'samples': 50496, 'steps': 262, 'loss/train': 6.0679426193237305} 11/06/2021 21:18:53 - INFO - __main__ - Step 264: {'lr': 6.575e-05, 'samples': 50688, 'steps': 263, 'loss/train': 5.964505195617676} 11/06/2021 21:18:54 - INFO - __main__ - Step 265: {'lr': 6.6e-05, 'samples': 50880, 'steps': 264, 'loss/train': 5.48927640914917} 11/06/2021 21:18:55 - INFO - __main__ - Step 266: {'lr': 6.625000000000001e-05, 'samples': 51072, 'steps': 265, 'loss/train': 6.112322807312012} 11/06/2021 21:18:55 - INFO - __main__ - Step 267: {'lr': 6.65e-05, 'samples': 51264, 'steps': 266, 'loss/train': 6.019900798797607} 11/06/2021 21:18:55 - INFO - __main__ - Step 268: {'lr': 6.675000000000001e-05, 'samples': 51456, 'steps': 267, 'loss/train': 6.1478657722473145} 11/06/2021 21:18:56 - INFO - __main__ - Step 269: {'lr': 6.7e-05, 'samples': 51648, 'steps': 268, 'loss/train': 5.52264404296875} 11/06/2021 21:18:57 - INFO - __main__ - Step 270: {'lr': 6.725000000000001e-05, 'samples': 51840, 'steps': 269, 'loss/train': 5.913082599639893} 11/06/2021 21:18:57 - INFO - __main__ - Step 271: {'lr': 6.75e-05, 'samples': 52032, 'steps': 270, 'loss/train': 5.843954086303711} 11/06/2021 21:18:57 - INFO - __main__ - Step 272: {'lr': 6.775000000000001e-05, 'samples': 52224, 'steps': 271, 'loss/train': 5.43068790435791} 11/06/2021 21:18:58 - INFO - __main__ - Step 273: {'lr': 6.800000000000001e-05, 'samples': 52416, 'steps': 272, 'loss/train': 5.731388568878174} 11/06/2021 21:18:58 - INFO - __main__ - Step 274: {'lr': 6.825e-05, 'samples': 52608, 'steps': 273, 'loss/train': 6.002760887145996} 11/06/2021 21:18:59 - INFO - __main__ - Step 275: {'lr': 6.850000000000001e-05, 'samples': 52800, 'steps': 274, 'loss/train': 5.909034729003906} 11/06/2021 21:18:59 - INFO - __main__ - Step 276: {'lr': 6.875e-05, 'samples': 52992, 'steps': 275, 'loss/train': 5.980253219604492} 11/06/2021 21:19:00 - INFO - __main__ - Step 277: {'lr': 6.900000000000001e-05, 'samples': 53184, 'steps': 276, 'loss/train': 5.36932373046875} 11/06/2021 21:19:00 - INFO - __main__ - Step 278: {'lr': 6.925e-05, 'samples': 53376, 'steps': 277, 'loss/train': 5.7302045822143555} 11/06/2021 21:19:00 - INFO - __main__ - Step 279: {'lr': 6.950000000000001e-05, 'samples': 53568, 'steps': 278, 'loss/train': 5.460746765136719} 11/06/2021 21:19:01 - INFO - __main__ - Step 280: {'lr': 6.975e-05, 'samples': 53760, 'steps': 279, 'loss/train': 5.948640823364258} 11/06/2021 21:19:02 - INFO - __main__ - Step 281: {'lr': 7.000000000000001e-05, 'samples': 53952, 'steps': 280, 'loss/train': 5.746571063995361} 11/06/2021 21:19:02 - INFO - __main__ - Step 282: {'lr': 7.025000000000001e-05, 'samples': 54144, 'steps': 281, 'loss/train': 6.098116874694824} 11/06/2021 21:19:03 - INFO - __main__ - Step 283: {'lr': 7.049999999999999e-05, 'samples': 54336, 'steps': 282, 'loss/train': 5.805056571960449} 11/06/2021 21:19:03 - INFO - __main__ - Step 284: {'lr': 7.075e-05, 'samples': 54528, 'steps': 283, 'loss/train': 5.200248718261719} 11/06/2021 21:19:04 - INFO - __main__ - Step 285: {'lr': 7.099999999999999e-05, 'samples': 54720, 'steps': 284, 'loss/train': 6.579744815826416} 11/06/2021 21:19:04 - INFO - __main__ - Step 286: {'lr': 7.125e-05, 'samples': 54912, 'steps': 285, 'loss/train': 5.594106197357178} 11/06/2021 21:19:05 - INFO - __main__ - Step 287: {'lr': 7.149999999999999e-05, 'samples': 55104, 'steps': 286, 'loss/train': 6.083644390106201} 11/06/2021 21:19:05 - INFO - __main__ - Step 288: {'lr': 7.175e-05, 'samples': 55296, 'steps': 287, 'loss/train': 5.514430999755859} 11/06/2021 21:19:05 - INFO - __main__ - Step 289: {'lr': 7.2e-05, 'samples': 55488, 'steps': 288, 'loss/train': 5.3459954261779785} 11/06/2021 21:19:06 - INFO - __main__ - Step 290: {'lr': 7.225e-05, 'samples': 55680, 'steps': 289, 'loss/train': 6.18521785736084} 11/06/2021 21:19:07 - INFO - __main__ - Step 291: {'lr': 7.25e-05, 'samples': 55872, 'steps': 290, 'loss/train': 6.124849319458008} 11/06/2021 21:19:07 - INFO - __main__ - Step 292: {'lr': 7.274999999999999e-05, 'samples': 56064, 'steps': 291, 'loss/train': 5.46663761138916} 11/06/2021 21:19:07 - INFO - __main__ - Step 293: {'lr': 7.3e-05, 'samples': 56256, 'steps': 292, 'loss/train': 5.0395188331604} 11/06/2021 21:19:08 - INFO - __main__ - Step 294: {'lr': 7.324999999999999e-05, 'samples': 56448, 'steps': 293, 'loss/train': 5.422224998474121} 11/06/2021 21:19:08 - INFO - __main__ - Step 295: {'lr': 7.35e-05, 'samples': 56640, 'steps': 294, 'loss/train': 5.153810024261475} 11/06/2021 21:19:09 - INFO - __main__ - Step 296: {'lr': 7.375e-05, 'samples': 56832, 'steps': 295, 'loss/train': 5.397673606872559} 11/06/2021 21:19:10 - INFO - __main__ - Step 297: {'lr': 7.4e-05, 'samples': 57024, 'steps': 296, 'loss/train': 5.323544979095459} 11/06/2021 21:19:10 - INFO - __main__ - Step 298: {'lr': 7.425e-05, 'samples': 57216, 'steps': 297, 'loss/train': 5.2873454093933105} 11/06/2021 21:19:10 - INFO - __main__ - Step 299: {'lr': 7.45e-05, 'samples': 57408, 'steps': 298, 'loss/train': 5.626234531402588} 11/06/2021 21:19:11 - INFO - __main__ - Step 300: {'lr': 7.475e-05, 'samples': 57600, 'steps': 299, 'loss/train': 5.648954391479492} 11/06/2021 21:19:12 - INFO - __main__ - Step 301: {'lr': 7.5e-05, 'samples': 57792, 'steps': 300, 'loss/train': 6.47790002822876} 11/06/2021 21:19:12 - INFO - __main__ - Step 302: {'lr': 7.525e-05, 'samples': 57984, 'steps': 301, 'loss/train': 5.377347469329834} 11/06/2021 21:19:12 - INFO - __main__ - Step 303: {'lr': 7.55e-05, 'samples': 58176, 'steps': 302, 'loss/train': 5.396330833435059} 11/06/2021 21:19:13 - INFO - __main__ - Step 304: {'lr': 7.575e-05, 'samples': 58368, 'steps': 303, 'loss/train': 6.444887161254883} 11/06/2021 21:19:13 - INFO - __main__ - Step 305: {'lr': 7.6e-05, 'samples': 58560, 'steps': 304, 'loss/train': 5.51333475112915} 11/06/2021 21:19:14 - INFO - __main__ - Step 306: {'lr': 7.625e-05, 'samples': 58752, 'steps': 305, 'loss/train': 5.4495744705200195} 11/06/2021 21:19:15 - INFO - __main__ - Step 307: {'lr': 7.65e-05, 'samples': 58944, 'steps': 306, 'loss/train': 5.420361518859863} 11/06/2021 21:19:15 - INFO - __main__ - Step 308: {'lr': 7.675e-05, 'samples': 59136, 'steps': 307, 'loss/train': 5.876494407653809} 11/06/2021 21:19:15 - INFO - __main__ - Step 309: {'lr': 7.7e-05, 'samples': 59328, 'steps': 308, 'loss/train': 5.0450520515441895} 11/06/2021 21:19:16 - INFO - __main__ - Step 310: {'lr': 7.725000000000001e-05, 'samples': 59520, 'steps': 309, 'loss/train': 5.425472736358643} 11/06/2021 21:19:17 - INFO - __main__ - Step 311: {'lr': 7.75e-05, 'samples': 59712, 'steps': 310, 'loss/train': 5.596251010894775} 11/06/2021 21:19:17 - INFO - __main__ - Step 312: {'lr': 7.775e-05, 'samples': 59904, 'steps': 311, 'loss/train': 5.2670392990112305} 11/06/2021 21:19:17 - INFO - __main__ - Step 313: {'lr': 7.8e-05, 'samples': 60096, 'steps': 312, 'loss/train': 7.779051780700684} 11/06/2021 21:19:18 - INFO - __main__ - Step 314: {'lr': 7.825e-05, 'samples': 60288, 'steps': 313, 'loss/train': 5.498117923736572} 11/06/2021 21:19:18 - INFO - __main__ - Step 315: {'lr': 7.85e-05, 'samples': 60480, 'steps': 314, 'loss/train': 5.533538341522217} 11/06/2021 21:19:19 - INFO - __main__ - Step 316: {'lr': 7.875e-05, 'samples': 60672, 'steps': 315, 'loss/train': 5.794958591461182} 11/06/2021 21:19:19 - INFO - __main__ - Step 317: {'lr': 7.9e-05, 'samples': 60864, 'steps': 316, 'loss/train': 5.267005443572998} 11/06/2021 21:19:20 - INFO - __main__ - Step 318: {'lr': 7.925e-05, 'samples': 61056, 'steps': 317, 'loss/train': 5.625141143798828} 11/06/2021 21:19:20 - INFO - __main__ - Step 319: {'lr': 7.950000000000001e-05, 'samples': 61248, 'steps': 318, 'loss/train': 5.229969024658203} 11/06/2021 21:19:21 - INFO - __main__ - Step 320: {'lr': 7.975e-05, 'samples': 61440, 'steps': 319, 'loss/train': 5.5839314460754395} 11/06/2021 21:19:21 - INFO - __main__ - Step 321: {'lr': 8e-05, 'samples': 61632, 'steps': 320, 'loss/train': 5.807913303375244} 11/06/2021 21:19:22 - INFO - __main__ - Step 322: {'lr': 8.025e-05, 'samples': 61824, 'steps': 321, 'loss/train': 5.966512203216553} 11/06/2021 21:19:22 - INFO - __main__ - Step 323: {'lr': 8.05e-05, 'samples': 62016, 'steps': 322, 'loss/train': 5.348308563232422} 11/06/2021 21:19:23 - INFO - __main__ - Step 324: {'lr': 8.075e-05, 'samples': 62208, 'steps': 323, 'loss/train': 5.878225326538086} 11/06/2021 21:19:23 - INFO - __main__ - Step 325: {'lr': 8.1e-05, 'samples': 62400, 'steps': 324, 'loss/train': 5.854434013366699} 11/06/2021 21:19:23 - INFO - __main__ - Step 326: {'lr': 8.125000000000001e-05, 'samples': 62592, 'steps': 325, 'loss/train': 5.428937911987305} 11/06/2021 21:19:24 - INFO - __main__ - Step 327: {'lr': 8.15e-05, 'samples': 62784, 'steps': 326, 'loss/train': 5.665132522583008} 11/06/2021 21:19:25 - INFO - __main__ - Step 328: {'lr': 8.175000000000001e-05, 'samples': 62976, 'steps': 327, 'loss/train': 5.691938400268555} 11/06/2021 21:19:25 - INFO - __main__ - Step 329: {'lr': 8.2e-05, 'samples': 63168, 'steps': 328, 'loss/train': 6.869819641113281} 11/06/2021 21:19:26 - INFO - __main__ - Step 330: {'lr': 8.225000000000001e-05, 'samples': 63360, 'steps': 329, 'loss/train': 4.917605876922607} 11/06/2021 21:19:26 - INFO - __main__ - Step 331: {'lr': 8.25e-05, 'samples': 63552, 'steps': 330, 'loss/train': 5.495372295379639} 11/06/2021 21:19:27 - INFO - __main__ - Step 332: {'lr': 8.275e-05, 'samples': 63744, 'steps': 331, 'loss/train': 5.180616855621338} 11/06/2021 21:19:27 - INFO - __main__ - Step 333: {'lr': 8.300000000000001e-05, 'samples': 63936, 'steps': 332, 'loss/train': 5.5182318687438965} 11/06/2021 21:19:28 - INFO - __main__ - Step 334: {'lr': 8.325e-05, 'samples': 64128, 'steps': 333, 'loss/train': 5.079967975616455} 11/06/2021 21:19:28 - INFO - __main__ - Step 335: {'lr': 8.350000000000001e-05, 'samples': 64320, 'steps': 334, 'loss/train': 5.6893181800842285} 11/06/2021 21:19:28 - INFO - __main__ - Step 336: {'lr': 8.375e-05, 'samples': 64512, 'steps': 335, 'loss/train': 5.519853115081787} 11/06/2021 21:19:29 - INFO - __main__ - Step 337: {'lr': 8.400000000000001e-05, 'samples': 64704, 'steps': 336, 'loss/train': 5.515233516693115} 11/06/2021 21:19:30 - INFO - __main__ - Step 338: {'lr': 8.425e-05, 'samples': 64896, 'steps': 337, 'loss/train': 4.627192974090576} 11/06/2021 21:19:30 - INFO - __main__ - Step 339: {'lr': 8.450000000000001e-05, 'samples': 65088, 'steps': 338, 'loss/train': 5.647910118103027} 11/06/2021 21:19:30 - INFO - __main__ - Step 340: {'lr': 8.475000000000001e-05, 'samples': 65280, 'steps': 339, 'loss/train': 5.477786540985107} 11/06/2021 21:19:31 - INFO - __main__ - Step 341: {'lr': 8.5e-05, 'samples': 65472, 'steps': 340, 'loss/train': 5.654776573181152} 11/06/2021 21:19:32 - INFO - __main__ - Step 342: {'lr': 8.525000000000001e-05, 'samples': 65664, 'steps': 341, 'loss/train': 5.568868637084961} 11/06/2021 21:19:32 - INFO - __main__ - Step 343: {'lr': 8.55e-05, 'samples': 65856, 'steps': 342, 'loss/train': 3.5874297618865967} 11/06/2021 21:19:33 - INFO - __main__ - Step 344: {'lr': 8.575000000000001e-05, 'samples': 66048, 'steps': 343, 'loss/train': 5.646244525909424} 11/06/2021 21:19:33 - INFO - __main__ - Step 345: {'lr': 8.599999999999999e-05, 'samples': 66240, 'steps': 344, 'loss/train': 5.599254131317139} 11/06/2021 21:19:33 - INFO - __main__ - Step 346: {'lr': 8.625e-05, 'samples': 66432, 'steps': 345, 'loss/train': 5.349778652191162} 11/06/2021 21:19:34 - INFO - __main__ - Step 347: {'lr': 8.65e-05, 'samples': 66624, 'steps': 346, 'loss/train': 5.612128257751465} 11/06/2021 21:19:35 - INFO - __main__ - Step 348: {'lr': 8.675e-05, 'samples': 66816, 'steps': 347, 'loss/train': 5.6107988357543945} 11/06/2021 21:19:35 - INFO - __main__ - Step 349: {'lr': 8.7e-05, 'samples': 67008, 'steps': 348, 'loss/train': 5.735456943511963} 11/06/2021 21:19:35 - INFO - __main__ - Step 350: {'lr': 8.724999999999999e-05, 'samples': 67200, 'steps': 349, 'loss/train': 5.3645172119140625} 11/06/2021 21:19:36 - INFO - __main__ - Step 351: {'lr': 8.75e-05, 'samples': 67392, 'steps': 350, 'loss/train': 5.467144966125488} 11/06/2021 21:19:36 - INFO - __main__ - Step 352: {'lr': 8.774999999999999e-05, 'samples': 67584, 'steps': 351, 'loss/train': 6.211921215057373} 11/06/2021 21:19:37 - INFO - __main__ - Step 353: {'lr': 8.8e-05, 'samples': 67776, 'steps': 352, 'loss/train': 5.452548980712891} 11/06/2021 21:19:37 - INFO - __main__ - Step 354: {'lr': 8.824999999999999e-05, 'samples': 67968, 'steps': 353, 'loss/train': 5.824997901916504} 11/06/2021 21:19:38 - INFO - __main__ - Step 355: {'lr': 8.85e-05, 'samples': 68160, 'steps': 354, 'loss/train': 5.646512985229492} 11/06/2021 21:19:38 - INFO - __main__ - Step 356: {'lr': 8.875e-05, 'samples': 68352, 'steps': 355, 'loss/train': 5.443009853363037} 11/06/2021 21:19:39 - INFO - __main__ - Step 357: {'lr': 8.9e-05, 'samples': 68544, 'steps': 356, 'loss/train': 5.597517490386963} 11/06/2021 21:19:40 - INFO - __main__ - Step 358: {'lr': 8.925e-05, 'samples': 68736, 'steps': 357, 'loss/train': 5.510553359985352} 11/06/2021 21:19:40 - INFO - __main__ - Step 359: {'lr': 8.95e-05, 'samples': 68928, 'steps': 358, 'loss/train': 5.271029949188232} 11/06/2021 21:19:40 - INFO - __main__ - Step 360: {'lr': 8.975e-05, 'samples': 69120, 'steps': 359, 'loss/train': 5.6077141761779785} 11/06/2021 21:19:41 - INFO - __main__ - Step 361: {'lr': 8.999999999999999e-05, 'samples': 69312, 'steps': 360, 'loss/train': 7.289037227630615} 11/06/2021 21:19:41 - INFO - __main__ - Step 362: {'lr': 9.025e-05, 'samples': 69504, 'steps': 361, 'loss/train': 5.1300740242004395} 11/06/2021 21:19:42 - INFO - __main__ - Step 363: {'lr': 9.05e-05, 'samples': 69696, 'steps': 362, 'loss/train': 5.349740982055664} 11/06/2021 21:19:42 - INFO - __main__ - Step 364: {'lr': 9.075e-05, 'samples': 69888, 'steps': 363, 'loss/train': 5.435561656951904} 11/06/2021 21:19:43 - INFO - __main__ - Step 365: {'lr': 9.1e-05, 'samples': 70080, 'steps': 364, 'loss/train': 5.4572978019714355} 11/06/2021 21:19:43 - INFO - __main__ - Step 366: {'lr': 9.125e-05, 'samples': 70272, 'steps': 365, 'loss/train': 5.351955890655518} 11/06/2021 21:19:43 - INFO - __main__ - Step 367: {'lr': 9.15e-05, 'samples': 70464, 'steps': 366, 'loss/train': 5.565761089324951} 11/06/2021 21:19:44 - INFO - __main__ - Step 368: {'lr': 9.175e-05, 'samples': 70656, 'steps': 367, 'loss/train': 5.959819316864014} 11/06/2021 21:19:45 - INFO - __main__ - Step 369: {'lr': 9.2e-05, 'samples': 70848, 'steps': 368, 'loss/train': 5.474288463592529} 11/06/2021 21:19:45 - INFO - __main__ - Step 370: {'lr': 9.225e-05, 'samples': 71040, 'steps': 369, 'loss/train': 5.607541561126709} 11/06/2021 21:19:46 - INFO - __main__ - Step 371: {'lr': 9.25e-05, 'samples': 71232, 'steps': 370, 'loss/train': 5.084637641906738} 11/06/2021 21:19:46 - INFO - __main__ - Step 372: {'lr': 9.275e-05, 'samples': 71424, 'steps': 371, 'loss/train': 4.969635009765625} 11/06/2021 21:19:46 - INFO - __main__ - Step 373: {'lr': 9.3e-05, 'samples': 71616, 'steps': 372, 'loss/train': 5.273648262023926} 11/06/2021 21:19:47 - INFO - __main__ - Step 374: {'lr': 9.325e-05, 'samples': 71808, 'steps': 373, 'loss/train': 5.385776042938232} 11/06/2021 21:19:48 - INFO - __main__ - Step 375: {'lr': 9.35e-05, 'samples': 72000, 'steps': 374, 'loss/train': 4.57074499130249} 11/06/2021 21:19:48 - INFO - __main__ - Step 376: {'lr': 9.375e-05, 'samples': 72192, 'steps': 375, 'loss/train': 5.399175643920898} 11/06/2021 21:19:48 - INFO - __main__ - Step 377: {'lr': 9.400000000000001e-05, 'samples': 72384, 'steps': 376, 'loss/train': 5.5180864334106445} 11/06/2021 21:19:49 - INFO - __main__ - Step 378: {'lr': 9.425e-05, 'samples': 72576, 'steps': 377, 'loss/train': 5.192264556884766} 11/06/2021 21:19:50 - INFO - __main__ - Step 379: {'lr': 9.45e-05, 'samples': 72768, 'steps': 378, 'loss/train': 5.126402854919434} 11/06/2021 21:19:50 - INFO - __main__ - Step 380: {'lr': 9.475e-05, 'samples': 72960, 'steps': 379, 'loss/train': 5.3973708152771} 11/06/2021 21:19:50 - INFO - __main__ - Step 381: {'lr': 9.5e-05, 'samples': 73152, 'steps': 380, 'loss/train': 5.583156585693359} 11/06/2021 21:19:51 - INFO - __main__ - Step 382: {'lr': 9.525e-05, 'samples': 73344, 'steps': 381, 'loss/train': 5.458894729614258} 11/06/2021 21:19:51 - INFO - __main__ - Step 383: {'lr': 9.55e-05, 'samples': 73536, 'steps': 382, 'loss/train': 5.020319938659668} 11/06/2021 21:19:52 - INFO - __main__ - Step 384: {'lr': 9.575000000000001e-05, 'samples': 73728, 'steps': 383, 'loss/train': 5.400721073150635} 11/06/2021 21:19:53 - INFO - __main__ - Step 385: {'lr': 9.6e-05, 'samples': 73920, 'steps': 384, 'loss/train': 5.1949381828308105} 11/06/2021 21:19:53 - INFO - __main__ - Step 386: {'lr': 9.625000000000001e-05, 'samples': 74112, 'steps': 385, 'loss/train': 5.291417121887207} 11/06/2021 21:19:53 - INFO - __main__ - Step 387: {'lr': 9.65e-05, 'samples': 74304, 'steps': 386, 'loss/train': 6.089571475982666} 11/06/2021 21:19:54 - INFO - __main__ - Step 388: {'lr': 9.675000000000001e-05, 'samples': 74496, 'steps': 387, 'loss/train': 5.51926326751709} 11/06/2021 21:19:55 - INFO - __main__ - Step 389: {'lr': 9.7e-05, 'samples': 74688, 'steps': 388, 'loss/train': 5.781015396118164} 11/06/2021 21:19:55 - INFO - __main__ - Step 390: {'lr': 9.725e-05, 'samples': 74880, 'steps': 389, 'loss/train': 4.908177852630615} 11/06/2021 21:19:55 - INFO - __main__ - Step 391: {'lr': 9.750000000000001e-05, 'samples': 75072, 'steps': 390, 'loss/train': 5.012897968292236} 11/06/2021 21:19:56 - INFO - __main__ - Step 392: {'lr': 9.775e-05, 'samples': 75264, 'steps': 391, 'loss/train': 5.399777889251709} 11/06/2021 21:19:56 - INFO - __main__ - Step 393: {'lr': 9.800000000000001e-05, 'samples': 75456, 'steps': 392, 'loss/train': 5.350080966949463} 11/06/2021 21:19:57 - INFO - __main__ - Step 394: {'lr': 9.825e-05, 'samples': 75648, 'steps': 393, 'loss/train': 5.737140655517578} 11/06/2021 21:19:57 - INFO - __main__ - Step 395: {'lr': 9.850000000000001e-05, 'samples': 75840, 'steps': 394, 'loss/train': 5.23768949508667} 11/06/2021 21:19:58 - INFO - __main__ - Step 396: {'lr': 9.875e-05, 'samples': 76032, 'steps': 395, 'loss/train': 5.202757358551025} 11/06/2021 21:19:58 - INFO - __main__ - Step 397: {'lr': 9.900000000000001e-05, 'samples': 76224, 'steps': 396, 'loss/train': 5.724830150604248} 11/06/2021 21:19:59 - INFO - __main__ - Step 398: {'lr': 9.925000000000001e-05, 'samples': 76416, 'steps': 397, 'loss/train': 6.981353282928467} 11/06/2021 21:19:59 - INFO - __main__ - Step 399: {'lr': 9.95e-05, 'samples': 76608, 'steps': 398, 'loss/train': 7.47460412979126} 11/06/2021 21:20:00 - INFO - __main__ - Step 400: {'lr': 9.975000000000001e-05, 'samples': 76800, 'steps': 399, 'loss/train': 5.741088390350342} 11/06/2021 21:20:00 - INFO - __main__ - Step 401: {'lr': 0.0001, 'samples': 76992, 'steps': 400, 'loss/train': 5.029991626739502} 11/06/2021 21:20:01 - INFO - __main__ - Step 402: {'lr': 0.00010025000000000001, 'samples': 77184, 'steps': 401, 'loss/train': 5.318662166595459} 11/06/2021 21:20:01 - INFO - __main__ - Step 403: {'lr': 0.0001005, 'samples': 77376, 'steps': 402, 'loss/train': 5.997198104858398} 11/06/2021 21:20:01 - INFO - __main__ - Step 404: {'lr': 0.00010075000000000001, 'samples': 77568, 'steps': 403, 'loss/train': 6.158583641052246} 11/06/2021 21:20:02 - INFO - __main__ - Step 405: {'lr': 0.000101, 'samples': 77760, 'steps': 404, 'loss/train': 5.336831569671631} 11/06/2021 21:20:03 - INFO - __main__ - Step 406: {'lr': 0.00010125000000000001, 'samples': 77952, 'steps': 405, 'loss/train': 5.227279186248779} 11/06/2021 21:20:03 - INFO - __main__ - Step 407: {'lr': 0.00010150000000000001, 'samples': 78144, 'steps': 406, 'loss/train': 5.527342319488525} 11/06/2021 21:20:03 - INFO - __main__ - Step 408: {'lr': 0.00010174999999999999, 'samples': 78336, 'steps': 407, 'loss/train': 5.73346471786499} 11/06/2021 21:20:04 - INFO - __main__ - Step 409: {'lr': 0.000102, 'samples': 78528, 'steps': 408, 'loss/train': 6.215020179748535} 11/06/2021 21:20:05 - INFO - __main__ - Step 410: {'lr': 0.00010224999999999999, 'samples': 78720, 'steps': 409, 'loss/train': 5.491655349731445} 11/06/2021 21:20:05 - INFO - __main__ - Step 411: {'lr': 0.0001025, 'samples': 78912, 'steps': 410, 'loss/train': 4.847282886505127} 11/06/2021 21:20:05 - INFO - __main__ - Step 412: {'lr': 0.00010274999999999999, 'samples': 79104, 'steps': 411, 'loss/train': 5.18861198425293} 11/06/2021 21:20:06 - INFO - __main__ - Step 413: {'lr': 0.000103, 'samples': 79296, 'steps': 412, 'loss/train': 5.54094934463501} 11/06/2021 21:20:06 - INFO - __main__ - Step 414: {'lr': 0.00010325, 'samples': 79488, 'steps': 413, 'loss/train': 5.201007843017578} 11/06/2021 21:20:07 - INFO - __main__ - Step 415: {'lr': 0.0001035, 'samples': 79680, 'steps': 414, 'loss/train': 5.937044143676758} 11/06/2021 21:20:08 - INFO - __main__ - Step 416: {'lr': 0.00010375, 'samples': 79872, 'steps': 415, 'loss/train': 5.265571117401123} 11/06/2021 21:20:08 - INFO - __main__ - Step 417: {'lr': 0.000104, 'samples': 80064, 'steps': 416, 'loss/train': 4.91490364074707} 11/06/2021 21:20:08 - INFO - __main__ - Step 418: {'lr': 0.00010425, 'samples': 80256, 'steps': 417, 'loss/train': 5.490600109100342} 11/06/2021 21:20:09 - INFO - __main__ - Step 419: {'lr': 0.00010449999999999999, 'samples': 80448, 'steps': 418, 'loss/train': 4.848526954650879} 11/06/2021 21:20:10 - INFO - __main__ - Step 420: {'lr': 0.00010475, 'samples': 80640, 'steps': 419, 'loss/train': 5.088794231414795} 11/06/2021 21:20:10 - INFO - __main__ - Step 421: {'lr': 0.000105, 'samples': 80832, 'steps': 420, 'loss/train': 5.438802719116211} 11/06/2021 21:20:10 - INFO - __main__ - Step 422: {'lr': 0.00010525, 'samples': 81024, 'steps': 421, 'loss/train': 5.321382522583008} 11/06/2021 21:20:11 - INFO - __main__ - Step 423: {'lr': 0.0001055, 'samples': 81216, 'steps': 422, 'loss/train': 5.291647434234619} 11/06/2021 21:20:11 - INFO - __main__ - Step 424: {'lr': 0.00010575, 'samples': 81408, 'steps': 423, 'loss/train': 4.753524303436279} 11/06/2021 21:20:11 - INFO - __main__ - Step 425: {'lr': 0.000106, 'samples': 81600, 'steps': 424, 'loss/train': 5.298032760620117} 11/06/2021 21:20:13 - INFO - __main__ - Step 426: {'lr': 0.00010625, 'samples': 81792, 'steps': 425, 'loss/train': 6.007701873779297} 11/06/2021 21:20:13 - INFO - __main__ - Step 427: {'lr': 0.0001065, 'samples': 81984, 'steps': 426, 'loss/train': 5.437173366546631} 11/06/2021 21:20:13 - INFO - __main__ - Step 428: {'lr': 0.00010675, 'samples': 82176, 'steps': 427, 'loss/train': 5.345945358276367} 11/06/2021 21:20:14 - INFO - __main__ - Step 429: {'lr': 0.000107, 'samples': 82368, 'steps': 428, 'loss/train': 5.066455841064453} 11/06/2021 21:20:14 - INFO - __main__ - Step 430: {'lr': 0.00010725, 'samples': 82560, 'steps': 429, 'loss/train': 3.3696258068084717} 11/06/2021 21:20:15 - INFO - __main__ - Step 431: {'lr': 0.0001075, 'samples': 82752, 'steps': 430, 'loss/train': 4.334190845489502} 11/06/2021 21:20:15 - INFO - __main__ - Step 432: {'lr': 0.00010775, 'samples': 82944, 'steps': 431, 'loss/train': 5.169923305511475} 11/06/2021 21:20:16 - INFO - __main__ - Step 433: {'lr': 0.000108, 'samples': 83136, 'steps': 432, 'loss/train': 5.3268537521362305} 11/06/2021 21:20:16 - INFO - __main__ - Step 434: {'lr': 0.00010825, 'samples': 83328, 'steps': 433, 'loss/train': 5.3919854164123535} 11/06/2021 21:20:16 - INFO - __main__ - Step 435: {'lr': 0.00010850000000000001, 'samples': 83520, 'steps': 434, 'loss/train': 5.3725810050964355} 11/06/2021 21:20:17 - INFO - __main__ - Step 436: {'lr': 0.00010875, 'samples': 83712, 'steps': 435, 'loss/train': 5.403895854949951} 11/06/2021 21:20:18 - INFO - __main__ - Step 437: {'lr': 0.000109, 'samples': 83904, 'steps': 436, 'loss/train': 5.205580711364746} 11/06/2021 21:20:18 - INFO - __main__ - Step 438: {'lr': 0.00010925, 'samples': 84096, 'steps': 437, 'loss/train': 5.33302640914917} 11/06/2021 21:20:18 - INFO - __main__ - Step 439: {'lr': 0.0001095, 'samples': 84288, 'steps': 438, 'loss/train': 5.41857385635376} 11/06/2021 21:20:19 - INFO - __main__ - Step 440: {'lr': 0.00010975, 'samples': 84480, 'steps': 439, 'loss/train': 5.0854315757751465} 11/06/2021 21:20:20 - INFO - __main__ - Step 441: {'lr': 0.00011, 'samples': 84672, 'steps': 440, 'loss/train': 4.953428745269775} 11/06/2021 21:20:21 - INFO - __main__ - Step 442: {'lr': 0.00011025, 'samples': 84864, 'steps': 441, 'loss/train': 5.355175495147705} 11/06/2021 21:20:21 - INFO - __main__ - Step 443: {'lr': 0.0001105, 'samples': 85056, 'steps': 442, 'loss/train': 5.322221279144287} 11/06/2021 21:20:21 - INFO - __main__ - Step 444: {'lr': 0.00011075000000000001, 'samples': 85248, 'steps': 443, 'loss/train': 5.694289207458496} 11/06/2021 21:20:22 - INFO - __main__ - Step 445: {'lr': 0.000111, 'samples': 85440, 'steps': 444, 'loss/train': 5.615618705749512} 11/06/2021 21:20:23 - INFO - __main__ - Step 446: {'lr': 0.00011125000000000001, 'samples': 85632, 'steps': 445, 'loss/train': 4.579447269439697} 11/06/2021 21:20:23 - INFO - __main__ - Step 447: {'lr': 0.0001115, 'samples': 85824, 'steps': 446, 'loss/train': 5.4635491371154785} 11/06/2021 21:20:23 - INFO - __main__ - Step 448: {'lr': 0.00011175, 'samples': 86016, 'steps': 447, 'loss/train': 5.6371636390686035} 11/06/2021 21:20:24 - INFO - __main__ - Step 449: {'lr': 0.000112, 'samples': 86208, 'steps': 448, 'loss/train': 5.192485332489014} 11/06/2021 21:20:24 - INFO - __main__ - Step 450: {'lr': 0.00011225, 'samples': 86400, 'steps': 449, 'loss/train': 5.126009464263916} 11/06/2021 21:20:25 - INFO - __main__ - Step 451: {'lr': 0.00011250000000000001, 'samples': 86592, 'steps': 450, 'loss/train': 4.955085754394531} 11/06/2021 21:20:25 - INFO - __main__ - Step 452: {'lr': 0.00011275, 'samples': 86784, 'steps': 451, 'loss/train': 4.919888973236084} 11/06/2021 21:20:26 - INFO - __main__ - Step 453: {'lr': 0.00011300000000000001, 'samples': 86976, 'steps': 452, 'loss/train': 5.37556266784668} 11/06/2021 21:20:26 - INFO - __main__ - Step 454: {'lr': 0.00011325, 'samples': 87168, 'steps': 453, 'loss/train': 5.237241744995117} 11/06/2021 21:20:26 - INFO - __main__ - Step 455: {'lr': 0.00011350000000000001, 'samples': 87360, 'steps': 454, 'loss/train': 4.927133083343506} 11/06/2021 21:20:27 - INFO - __main__ - Step 456: {'lr': 0.00011375, 'samples': 87552, 'steps': 455, 'loss/train': 5.2103400230407715} 11/06/2021 21:20:28 - INFO - __main__ - Step 457: {'lr': 0.000114, 'samples': 87744, 'steps': 456, 'loss/train': 5.820415496826172} 11/06/2021 21:20:28 - INFO - __main__ - Step 458: {'lr': 0.00011425000000000001, 'samples': 87936, 'steps': 457, 'loss/train': 3.8978383541107178} 11/06/2021 21:20:29 - INFO - __main__ - Step 459: {'lr': 0.0001145, 'samples': 88128, 'steps': 458, 'loss/train': 6.301289081573486} 11/06/2021 21:20:29 - INFO - __main__ - Step 460: {'lr': 0.00011475000000000001, 'samples': 88320, 'steps': 459, 'loss/train': 5.069262504577637} 11/06/2021 21:20:30 - INFO - __main__ - Step 461: {'lr': 0.000115, 'samples': 88512, 'steps': 460, 'loss/train': 5.043389320373535} 11/06/2021 21:20:30 - INFO - __main__ - Step 462: {'lr': 0.00011525000000000001, 'samples': 88704, 'steps': 461, 'loss/train': 5.780721664428711} 11/06/2021 21:20:31 - INFO - __main__ - Step 463: {'lr': 0.0001155, 'samples': 88896, 'steps': 462, 'loss/train': 4.882776737213135} 11/06/2021 21:20:31 - INFO - __main__ - Step 464: {'lr': 0.00011575000000000001, 'samples': 89088, 'steps': 463, 'loss/train': 4.844412326812744} 11/06/2021 21:20:31 - INFO - __main__ - Step 465: {'lr': 0.00011600000000000001, 'samples': 89280, 'steps': 464, 'loss/train': 5.012292385101318} 11/06/2021 21:20:32 - INFO - __main__ - Step 466: {'lr': 0.00011625, 'samples': 89472, 'steps': 465, 'loss/train': 5.141354084014893} 11/06/2021 21:20:33 - INFO - __main__ - Step 467: {'lr': 0.00011650000000000001, 'samples': 89664, 'steps': 466, 'loss/train': 5.3636956214904785} 11/06/2021 21:20:33 - INFO - __main__ - Step 468: {'lr': 0.00011675, 'samples': 89856, 'steps': 467, 'loss/train': 5.613701820373535} 11/06/2021 21:20:33 - INFO - __main__ - Step 469: {'lr': 0.00011700000000000001, 'samples': 90048, 'steps': 468, 'loss/train': 4.824275493621826} 11/06/2021 21:20:34 - INFO - __main__ - Step 470: {'lr': 0.00011724999999999999, 'samples': 90240, 'steps': 469, 'loss/train': 4.927989482879639} 11/06/2021 21:20:34 - INFO - __main__ - Step 471: {'lr': 0.0001175, 'samples': 90432, 'steps': 470, 'loss/train': 5.5396223068237305} 11/06/2021 21:20:35 - INFO - __main__ - Step 472: {'lr': 0.00011775, 'samples': 90624, 'steps': 471, 'loss/train': 5.157430648803711} 11/06/2021 21:20:36 - INFO - __main__ - Step 473: {'lr': 0.000118, 'samples': 90816, 'steps': 472, 'loss/train': 4.571201324462891} 11/06/2021 21:20:36 - INFO - __main__ - Step 474: {'lr': 0.00011825, 'samples': 91008, 'steps': 473, 'loss/train': 5.168905735015869} 11/06/2021 21:20:36 - INFO - __main__ - Step 475: {'lr': 0.0001185, 'samples': 91200, 'steps': 474, 'loss/train': 5.4914021492004395} 11/06/2021 21:20:37 - INFO - __main__ - Step 476: {'lr': 0.00011875, 'samples': 91392, 'steps': 475, 'loss/train': 5.1100382804870605} 11/06/2021 21:20:38 - INFO - __main__ - Step 477: {'lr': 0.00011899999999999999, 'samples': 91584, 'steps': 476, 'loss/train': 5.027097225189209} 11/06/2021 21:20:38 - INFO - __main__ - Step 478: {'lr': 0.00011925, 'samples': 91776, 'steps': 477, 'loss/train': 3.3699212074279785} 11/06/2021 21:20:38 - INFO - __main__ - Step 479: {'lr': 0.00011949999999999999, 'samples': 91968, 'steps': 478, 'loss/train': 5.552117824554443} 11/06/2021 21:20:39 - INFO - __main__ - Step 480: {'lr': 0.00011975, 'samples': 92160, 'steps': 479, 'loss/train': 5.236664295196533} 11/06/2021 21:20:39 - INFO - __main__ - Step 481: {'lr': 0.00012, 'samples': 92352, 'steps': 480, 'loss/train': 5.218561172485352} 11/06/2021 21:20:40 - INFO - __main__ - Step 482: {'lr': 0.00012025, 'samples': 92544, 'steps': 481, 'loss/train': 4.788586139678955} 11/06/2021 21:20:41 - INFO - __main__ - Step 483: {'lr': 0.0001205, 'samples': 92736, 'steps': 482, 'loss/train': 4.982636451721191} 11/06/2021 21:20:41 - INFO - __main__ - Step 484: {'lr': 0.00012075, 'samples': 92928, 'steps': 483, 'loss/train': 4.840455055236816} 11/06/2021 21:20:41 - INFO - __main__ - Step 485: {'lr': 0.000121, 'samples': 93120, 'steps': 484, 'loss/train': 5.3554840087890625} 11/06/2021 21:20:42 - INFO - __main__ - Step 486: {'lr': 0.00012124999999999999, 'samples': 93312, 'steps': 485, 'loss/train': 5.182967185974121} 11/06/2021 21:20:43 - INFO - __main__ - Step 487: {'lr': 0.0001215, 'samples': 93504, 'steps': 486, 'loss/train': 5.115420818328857} 11/06/2021 21:20:43 - INFO - __main__ - Step 488: {'lr': 0.00012175, 'samples': 93696, 'steps': 487, 'loss/train': 5.1655683517456055} 11/06/2021 21:20:43 - INFO - __main__ - Step 489: {'lr': 0.000122, 'samples': 93888, 'steps': 488, 'loss/train': 5.405882835388184} 11/06/2021 21:20:44 - INFO - __main__ - Step 490: {'lr': 0.00012225, 'samples': 94080, 'steps': 489, 'loss/train': 5.127898693084717} 11/06/2021 21:20:44 - INFO - __main__ - Step 491: {'lr': 0.0001225, 'samples': 94272, 'steps': 490, 'loss/train': 5.024046421051025} 11/06/2021 21:20:45 - INFO - __main__ - Step 492: {'lr': 0.00012275, 'samples': 94464, 'steps': 491, 'loss/train': 5.145169734954834} 11/06/2021 21:20:45 - INFO - __main__ - Step 493: {'lr': 0.000123, 'samples': 94656, 'steps': 492, 'loss/train': 4.954655170440674} 11/06/2021 21:20:46 - INFO - __main__ - Step 494: {'lr': 0.00012325000000000001, 'samples': 94848, 'steps': 493, 'loss/train': 5.174392223358154} 11/06/2021 21:20:46 - INFO - __main__ - Step 495: {'lr': 0.0001235, 'samples': 95040, 'steps': 494, 'loss/train': 4.580348014831543} 11/06/2021 21:20:47 - INFO - __main__ - Step 496: {'lr': 0.00012375, 'samples': 95232, 'steps': 495, 'loss/train': 4.821667194366455} 11/06/2021 21:20:47 - INFO - __main__ - Step 497: {'lr': 0.000124, 'samples': 95424, 'steps': 496, 'loss/train': 5.254716873168945} 11/06/2021 21:20:48 - INFO - __main__ - Step 498: {'lr': 0.00012425, 'samples': 95616, 'steps': 497, 'loss/train': 4.732685565948486} 11/06/2021 21:20:48 - INFO - __main__ - Step 499: {'lr': 0.0001245, 'samples': 95808, 'steps': 498, 'loss/train': 5.048670291900635} 11/06/2021 21:20:49 - INFO - __main__ - Step 500: {'lr': 0.00012475, 'samples': 96000, 'steps': 499, 'loss/train': 4.834158420562744} 11/06/2021 21:20:49 - INFO - __main__ - Step 501: {'lr': 0.000125, 'samples': 96192, 'steps': 500, 'loss/train': 5.50537109375} 11/06/2021 21:20:49 - INFO - __main__ - Step 502: {'lr': 0.00012525, 'samples': 96384, 'steps': 501, 'loss/train': 4.96640157699585} 11/06/2021 21:20:50 - INFO - __main__ - Step 503: {'lr': 0.00012550000000000001, 'samples': 96576, 'steps': 502, 'loss/train': 4.255295753479004} 11/06/2021 21:20:51 - INFO - __main__ - Step 504: {'lr': 0.00012575, 'samples': 96768, 'steps': 503, 'loss/train': 3.4102871417999268} 11/06/2021 21:20:51 - INFO - __main__ - Step 505: {'lr': 0.000126, 'samples': 96960, 'steps': 504, 'loss/train': 5.220231056213379} 11/06/2021 21:20:51 - INFO - __main__ - Step 506: {'lr': 0.00012625, 'samples': 97152, 'steps': 505, 'loss/train': 4.843728542327881} 11/06/2021 21:20:52 - INFO - __main__ - Step 507: {'lr': 0.0001265, 'samples': 97344, 'steps': 506, 'loss/train': 4.380231857299805} 11/06/2021 21:20:53 - INFO - __main__ - Step 508: {'lr': 0.00012675, 'samples': 97536, 'steps': 507, 'loss/train': 4.899729251861572} 11/06/2021 21:20:54 - INFO - __main__ - Step 509: {'lr': 0.000127, 'samples': 97728, 'steps': 508, 'loss/train': 4.977561950683594} 11/06/2021 21:20:54 - INFO - __main__ - Step 510: {'lr': 0.00012725, 'samples': 97920, 'steps': 509, 'loss/train': 5.346168518066406} 11/06/2021 21:20:54 - INFO - __main__ - Step 511: {'lr': 0.0001275, 'samples': 98112, 'steps': 510, 'loss/train': 4.873198509216309} 11/06/2021 21:20:55 - INFO - __main__ - Step 512: {'lr': 0.00012775000000000002, 'samples': 98304, 'steps': 511, 'loss/train': 5.125716686248779} 11/06/2021 21:20:56 - INFO - __main__ - Step 513: {'lr': 0.000128, 'samples': 98496, 'steps': 512, 'loss/train': 5.299740314483643} 11/06/2021 21:20:56 - INFO - __main__ - Step 514: {'lr': 0.00012825, 'samples': 98688, 'steps': 513, 'loss/train': 5.17880916595459} 11/06/2021 21:20:57 - INFO - __main__ - Step 515: {'lr': 0.0001285, 'samples': 98880, 'steps': 514, 'loss/train': 5.38531494140625} 11/06/2021 21:20:57 - INFO - __main__ - Step 516: {'lr': 0.00012875, 'samples': 99072, 'steps': 515, 'loss/train': 5.111538410186768} 11/06/2021 21:20:58 - INFO - __main__ - Step 517: {'lr': 0.00012900000000000002, 'samples': 99264, 'steps': 516, 'loss/train': 4.971146583557129} 11/06/2021 21:20:59 - INFO - __main__ - Step 518: {'lr': 0.00012925, 'samples': 99456, 'steps': 517, 'loss/train': 4.8264899253845215} 11/06/2021 21:20:59 - INFO - __main__ - Step 519: {'lr': 0.0001295, 'samples': 99648, 'steps': 518, 'loss/train': 4.728095054626465} 11/06/2021 21:20:59 - INFO - __main__ - Step 520: {'lr': 0.00012975, 'samples': 99840, 'steps': 519, 'loss/train': 4.82402229309082} 11/06/2021 21:21:00 - INFO - __main__ - Step 521: {'lr': 0.00013000000000000002, 'samples': 100032, 'steps': 520, 'loss/train': 4.82277774810791} 11/06/2021 21:21:00 - INFO - __main__ - Step 522: {'lr': 0.00013025, 'samples': 100224, 'steps': 521, 'loss/train': 4.736476898193359} 11/06/2021 21:21:01 - INFO - __main__ - Step 523: {'lr': 0.0001305, 'samples': 100416, 'steps': 522, 'loss/train': 5.3957390785217285} 11/06/2021 21:21:02 - INFO - __main__ - Step 524: {'lr': 0.00013075, 'samples': 100608, 'steps': 523, 'loss/train': 4.804849624633789} 11/06/2021 21:21:02 - INFO - __main__ - Step 525: {'lr': 0.000131, 'samples': 100800, 'steps': 524, 'loss/train': 5.213521957397461} 11/06/2021 21:21:02 - INFO - __main__ - Step 526: {'lr': 0.00013125000000000002, 'samples': 100992, 'steps': 525, 'loss/train': 4.800134181976318} 11/06/2021 21:21:03 - INFO - __main__ - Step 527: {'lr': 0.0001315, 'samples': 101184, 'steps': 526, 'loss/train': 5.979576587677002} 11/06/2021 21:21:04 - INFO - __main__ - Step 528: {'lr': 0.00013175, 'samples': 101376, 'steps': 527, 'loss/train': 4.6841630935668945} 11/06/2021 21:21:04 - INFO - __main__ - Step 529: {'lr': 0.000132, 'samples': 101568, 'steps': 528, 'loss/train': 5.14123010635376} 11/06/2021 21:21:04 - INFO - __main__ - Step 530: {'lr': 0.00013225000000000002, 'samples': 101760, 'steps': 529, 'loss/train': 4.850307941436768} 11/06/2021 21:21:05 - INFO - __main__ - Step 531: {'lr': 0.00013250000000000002, 'samples': 101952, 'steps': 530, 'loss/train': 4.945707321166992} 11/06/2021 21:21:05 - INFO - __main__ - Step 532: {'lr': 0.00013275, 'samples': 102144, 'steps': 531, 'loss/train': 5.073093891143799} 11/06/2021 21:21:06 - INFO - __main__ - Step 533: {'lr': 0.000133, 'samples': 102336, 'steps': 532, 'loss/train': 4.880060195922852} 11/06/2021 21:21:07 - INFO - __main__ - Step 534: {'lr': 0.00013325, 'samples': 102528, 'steps': 533, 'loss/train': 5.960422039031982} 11/06/2021 21:21:07 - INFO - __main__ - Step 535: {'lr': 0.00013350000000000002, 'samples': 102720, 'steps': 534, 'loss/train': 4.578385829925537} 11/06/2021 21:21:07 - INFO - __main__ - Step 536: {'lr': 0.00013375, 'samples': 102912, 'steps': 535, 'loss/train': 5.22959566116333} 11/06/2021 21:21:08 - INFO - __main__ - Step 537: {'lr': 0.000134, 'samples': 103104, 'steps': 536, 'loss/train': 5.195213317871094} 11/06/2021 21:21:08 - INFO - __main__ - Step 538: {'lr': 0.00013425, 'samples': 103296, 'steps': 537, 'loss/train': 5.1764912605285645} 11/06/2021 21:21:09 - INFO - __main__ - Step 539: {'lr': 0.00013450000000000002, 'samples': 103488, 'steps': 538, 'loss/train': 4.329580783843994} 11/06/2021 21:21:09 - INFO - __main__ - Step 540: {'lr': 0.00013475000000000002, 'samples': 103680, 'steps': 539, 'loss/train': 5.327682971954346} 11/06/2021 21:21:10 - INFO - __main__ - Step 541: {'lr': 0.000135, 'samples': 103872, 'steps': 540, 'loss/train': 5.050226211547852} 11/06/2021 21:21:10 - INFO - __main__ - Step 542: {'lr': 0.00013525, 'samples': 104064, 'steps': 541, 'loss/train': 6.563281059265137} 11/06/2021 21:21:10 - INFO - __main__ - Step 543: {'lr': 0.00013550000000000001, 'samples': 104256, 'steps': 542, 'loss/train': 5.260067462921143} 11/06/2021 21:21:12 - INFO - __main__ - Step 544: {'lr': 0.00013575000000000002, 'samples': 104448, 'steps': 543, 'loss/train': 5.054920196533203} 11/06/2021 21:21:12 - INFO - __main__ - Step 545: {'lr': 0.00013600000000000003, 'samples': 104640, 'steps': 544, 'loss/train': 4.304242134094238} 11/06/2021 21:21:12 - INFO - __main__ - Step 546: {'lr': 0.00013625, 'samples': 104832, 'steps': 545, 'loss/train': 5.143182277679443} 11/06/2021 21:21:13 - INFO - __main__ - Step 547: {'lr': 0.0001365, 'samples': 105024, 'steps': 546, 'loss/train': 5.063644886016846} 11/06/2021 21:21:13 - INFO - __main__ - Step 548: {'lr': 0.00013675000000000002, 'samples': 105216, 'steps': 547, 'loss/train': 4.523825168609619} 11/06/2021 21:21:14 - INFO - __main__ - Step 549: {'lr': 0.00013700000000000002, 'samples': 105408, 'steps': 548, 'loss/train': 5.415563583374023} 11/06/2021 21:21:14 - INFO - __main__ - Step 550: {'lr': 0.00013725, 'samples': 105600, 'steps': 549, 'loss/train': 5.242093086242676} 11/06/2021 21:21:15 - INFO - __main__ - Step 551: {'lr': 0.0001375, 'samples': 105792, 'steps': 550, 'loss/train': 5.3217949867248535} 11/06/2021 21:21:15 - INFO - __main__ - Step 552: {'lr': 0.00013775000000000001, 'samples': 105984, 'steps': 551, 'loss/train': 4.965686798095703} 11/06/2021 21:21:15 - INFO - __main__ - Step 553: {'lr': 0.00013800000000000002, 'samples': 106176, 'steps': 552, 'loss/train': 4.69637393951416} 11/06/2021 21:21:16 - INFO - __main__ - Step 554: {'lr': 0.00013825000000000003, 'samples': 106368, 'steps': 553, 'loss/train': 5.67413854598999} 11/06/2021 21:21:17 - INFO - __main__ - Step 555: {'lr': 0.0001385, 'samples': 106560, 'steps': 554, 'loss/train': 1.4391024112701416} 11/06/2021 21:21:17 - INFO - __main__ - Step 556: {'lr': 0.00013875, 'samples': 106752, 'steps': 555, 'loss/train': 4.693598747253418} 11/06/2021 21:21:18 - INFO - __main__ - Step 557: {'lr': 0.00013900000000000002, 'samples': 106944, 'steps': 556, 'loss/train': 4.811655521392822} 11/06/2021 21:21:18 - INFO - __main__ - Step 558: {'lr': 0.00013925000000000002, 'samples': 107136, 'steps': 557, 'loss/train': 5.708633899688721} 11/06/2021 21:21:18 - INFO - __main__ - Step 559: {'lr': 0.0001395, 'samples': 107328, 'steps': 558, 'loss/train': 5.34543514251709} 11/06/2021 21:21:19 - INFO - __main__ - Step 560: {'lr': 0.00013975, 'samples': 107520, 'steps': 559, 'loss/train': 4.994585990905762} 11/06/2021 21:21:20 - INFO - __main__ - Step 561: {'lr': 0.00014000000000000001, 'samples': 107712, 'steps': 560, 'loss/train': 4.977428913116455} 11/06/2021 21:21:20 - INFO - __main__ - Step 562: {'lr': 0.00014025000000000002, 'samples': 107904, 'steps': 561, 'loss/train': 5.134328365325928} 11/06/2021 21:21:20 - INFO - __main__ - Step 563: {'lr': 0.00014050000000000003, 'samples': 108096, 'steps': 562, 'loss/train': 5.075902462005615} 11/06/2021 21:21:21 - INFO - __main__ - Step 564: {'lr': 0.00014074999999999998, 'samples': 108288, 'steps': 563, 'loss/train': 4.605697154998779} 11/06/2021 21:21:22 - INFO - __main__ - Step 565: {'lr': 0.00014099999999999998, 'samples': 108480, 'steps': 564, 'loss/train': 4.873136520385742} 11/06/2021 21:21:22 - INFO - __main__ - Step 566: {'lr': 0.00014125, 'samples': 108672, 'steps': 565, 'loss/train': 4.94666862487793} 11/06/2021 21:21:22 - INFO - __main__ - Step 567: {'lr': 0.0001415, 'samples': 108864, 'steps': 566, 'loss/train': 4.661070823669434} 11/06/2021 21:21:23 - INFO - __main__ - Step 568: {'lr': 0.00014175, 'samples': 109056, 'steps': 567, 'loss/train': 5.004296779632568} 11/06/2021 21:21:23 - INFO - __main__ - Step 569: {'lr': 0.00014199999999999998, 'samples': 109248, 'steps': 568, 'loss/train': 4.945058345794678} 11/06/2021 21:21:24 - INFO - __main__ - Step 570: {'lr': 0.00014225, 'samples': 109440, 'steps': 569, 'loss/train': 4.626468658447266} 11/06/2021 21:21:25 - INFO - __main__ - Step 571: {'lr': 0.0001425, 'samples': 109632, 'steps': 570, 'loss/train': 4.994494915008545} 11/06/2021 21:21:25 - INFO - __main__ - Step 572: {'lr': 0.00014275, 'samples': 109824, 'steps': 571, 'loss/train': 5.284569263458252} 11/06/2021 21:21:26 - INFO - __main__ - Step 573: {'lr': 0.00014299999999999998, 'samples': 110016, 'steps': 572, 'loss/train': 5.315004348754883} 11/06/2021 21:21:26 - INFO - __main__ - Step 574: {'lr': 0.00014324999999999999, 'samples': 110208, 'steps': 573, 'loss/train': 4.810165882110596} 11/06/2021 21:21:27 - INFO - __main__ - Step 575: {'lr': 0.0001435, 'samples': 110400, 'steps': 574, 'loss/train': 5.227445125579834} 11/06/2021 21:21:27 - INFO - __main__ - Step 576: {'lr': 0.00014375, 'samples': 110592, 'steps': 575, 'loss/train': 5.34274435043335} 11/06/2021 21:21:28 - INFO - __main__ - Step 577: {'lr': 0.000144, 'samples': 110784, 'steps': 576, 'loss/train': 5.176715850830078} 11/06/2021 21:21:28 - INFO - __main__ - Step 578: {'lr': 0.00014424999999999998, 'samples': 110976, 'steps': 577, 'loss/train': 4.281363487243652} 11/06/2021 21:21:28 - INFO - __main__ - Step 579: {'lr': 0.0001445, 'samples': 111168, 'steps': 578, 'loss/train': 5.460202217102051} 11/06/2021 21:21:29 - INFO - __main__ - Step 580: {'lr': 0.00014475, 'samples': 111360, 'steps': 579, 'loss/train': 5.397484302520752} 11/06/2021 21:21:30 - INFO - __main__ - Step 581: {'lr': 0.000145, 'samples': 111552, 'steps': 580, 'loss/train': 4.4818291664123535} 11/06/2021 21:21:30 - INFO - __main__ - Step 582: {'lr': 0.00014524999999999998, 'samples': 111744, 'steps': 581, 'loss/train': 4.609002590179443} 11/06/2021 21:21:31 - INFO - __main__ - Step 583: {'lr': 0.00014549999999999999, 'samples': 111936, 'steps': 582, 'loss/train': 5.447309970855713} 11/06/2021 21:21:31 - INFO - __main__ - Step 584: {'lr': 0.00014575, 'samples': 112128, 'steps': 583, 'loss/train': 5.413631439208984} 11/06/2021 21:21:32 - INFO - __main__ - Step 585: {'lr': 0.000146, 'samples': 112320, 'steps': 584, 'loss/train': 6.304897308349609} 11/06/2021 21:21:32 - INFO - __main__ - Step 586: {'lr': 0.00014625, 'samples': 112512, 'steps': 585, 'loss/train': 3.9225189685821533} 11/06/2021 21:21:33 - INFO - __main__ - Step 587: {'lr': 0.00014649999999999998, 'samples': 112704, 'steps': 586, 'loss/train': 4.575343132019043} 11/06/2021 21:21:33 - INFO - __main__ - Step 588: {'lr': 0.00014675, 'samples': 112896, 'steps': 587, 'loss/train': 4.97996711730957} 11/06/2021 21:21:34 - INFO - __main__ - Step 589: {'lr': 0.000147, 'samples': 113088, 'steps': 588, 'loss/train': 5.520531177520752} 11/06/2021 21:21:35 - INFO - __main__ - Step 590: {'lr': 0.00014725, 'samples': 113280, 'steps': 589, 'loss/train': 4.435773849487305} 11/06/2021 21:21:35 - INFO - __main__ - Step 591: {'lr': 0.0001475, 'samples': 113472, 'steps': 590, 'loss/train': 5.016930103302002} 11/06/2021 21:21:35 - INFO - __main__ - Step 592: {'lr': 0.00014774999999999999, 'samples': 113664, 'steps': 591, 'loss/train': 4.882648468017578} 11/06/2021 21:21:36 - INFO - __main__ - Step 593: {'lr': 0.000148, 'samples': 113856, 'steps': 592, 'loss/train': 6.573506832122803} 11/06/2021 21:21:36 - INFO - __main__ - Step 594: {'lr': 0.00014825, 'samples': 114048, 'steps': 593, 'loss/train': 4.22829532623291} 11/06/2021 21:21:36 - INFO - __main__ - Step 595: {'lr': 0.0001485, 'samples': 114240, 'steps': 594, 'loss/train': 4.915690898895264} 11/06/2021 21:21:37 - INFO - __main__ - Step 596: {'lr': 0.00014874999999999998, 'samples': 114432, 'steps': 595, 'loss/train': 4.756763458251953} 11/06/2021 21:21:38 - INFO - __main__ - Step 597: {'lr': 0.000149, 'samples': 114624, 'steps': 596, 'loss/train': 4.910470485687256} 11/06/2021 21:21:38 - INFO - __main__ - Step 598: {'lr': 0.00014925, 'samples': 114816, 'steps': 597, 'loss/train': 4.912417411804199} 11/06/2021 21:21:38 - INFO - __main__ - Step 599: {'lr': 0.0001495, 'samples': 115008, 'steps': 598, 'loss/train': 4.568467140197754} 11/06/2021 21:21:39 - INFO - __main__ - Step 600: {'lr': 0.00014975, 'samples': 115200, 'steps': 599, 'loss/train': 4.727468967437744} 11/06/2021 21:21:40 - INFO - __main__ - Step 601: {'lr': 0.00015, 'samples': 115392, 'steps': 600, 'loss/train': 4.731727123260498} 11/06/2021 21:21:40 - INFO - __main__ - Step 602: {'lr': 0.00015025, 'samples': 115584, 'steps': 601, 'loss/train': 5.941672325134277} 11/06/2021 21:21:41 - INFO - __main__ - Step 603: {'lr': 0.0001505, 'samples': 115776, 'steps': 602, 'loss/train': 4.752991676330566} 11/06/2021 21:21:41 - INFO - __main__ - Step 604: {'lr': 0.00015075, 'samples': 115968, 'steps': 603, 'loss/train': 4.749768257141113} 11/06/2021 21:21:41 - INFO - __main__ - Step 605: {'lr': 0.000151, 'samples': 116160, 'steps': 604, 'loss/train': 4.868316650390625} 11/06/2021 21:21:43 - INFO - __main__ - Step 606: {'lr': 0.00015125, 'samples': 116352, 'steps': 605, 'loss/train': 4.696784973144531} 11/06/2021 21:21:43 - INFO - __main__ - Step 607: {'lr': 0.0001515, 'samples': 116544, 'steps': 606, 'loss/train': 5.38928747177124} 11/06/2021 21:21:43 - INFO - __main__ - Step 608: {'lr': 0.00015175, 'samples': 116736, 'steps': 607, 'loss/train': 4.822604179382324} 11/06/2021 21:21:44 - INFO - __main__ - Step 609: {'lr': 0.000152, 'samples': 116928, 'steps': 608, 'loss/train': 4.739031791687012} 11/06/2021 21:21:44 - INFO - __main__ - Step 610: {'lr': 0.00015225, 'samples': 117120, 'steps': 609, 'loss/train': 4.78218412399292} 11/06/2021 21:21:44 - INFO - __main__ - Step 611: {'lr': 0.0001525, 'samples': 117312, 'steps': 610, 'loss/train': 5.469936370849609} 11/06/2021 21:21:46 - INFO - __main__ - Step 612: {'lr': 0.00015275, 'samples': 117504, 'steps': 611, 'loss/train': 5.018012523651123} 11/06/2021 21:21:46 - INFO - __main__ - Step 613: {'lr': 0.000153, 'samples': 117696, 'steps': 612, 'loss/train': 4.723076343536377} 11/06/2021 21:21:46 - INFO - __main__ - Step 614: {'lr': 0.00015325, 'samples': 117888, 'steps': 613, 'loss/train': 4.444612503051758} 11/06/2021 21:21:47 - INFO - __main__ - Step 615: {'lr': 0.0001535, 'samples': 118080, 'steps': 614, 'loss/train': 5.002647876739502} 11/06/2021 21:21:47 - INFO - __main__ - Step 616: {'lr': 0.00015375, 'samples': 118272, 'steps': 615, 'loss/train': 5.234433174133301} 11/06/2021 21:21:48 - INFO - __main__ - Step 617: {'lr': 0.000154, 'samples': 118464, 'steps': 616, 'loss/train': 5.180642604827881} 11/06/2021 21:21:48 - INFO - __main__ - Step 618: {'lr': 0.00015425, 'samples': 118656, 'steps': 617, 'loss/train': 4.186741828918457} 11/06/2021 21:21:49 - INFO - __main__ - Step 619: {'lr': 0.00015450000000000001, 'samples': 118848, 'steps': 618, 'loss/train': 4.601840019226074} 11/06/2021 21:21:49 - INFO - __main__ - Step 620: {'lr': 0.00015475, 'samples': 119040, 'steps': 619, 'loss/train': 5.070303440093994} 11/06/2021 21:21:50 - INFO - __main__ - Step 621: {'lr': 0.000155, 'samples': 119232, 'steps': 620, 'loss/train': 4.981260776519775} 11/06/2021 21:21:50 - INFO - __main__ - Step 622: {'lr': 0.00015525, 'samples': 119424, 'steps': 621, 'loss/train': 4.912421703338623} 11/06/2021 21:21:51 - INFO - __main__ - Step 623: {'lr': 0.0001555, 'samples': 119616, 'steps': 622, 'loss/train': 4.604945182800293} 11/06/2021 21:21:51 - INFO - __main__ - Step 624: {'lr': 0.00015575, 'samples': 119808, 'steps': 623, 'loss/train': 4.606297969818115} 11/06/2021 21:21:52 - INFO - __main__ - Step 625: {'lr': 0.000156, 'samples': 120000, 'steps': 624, 'loss/train': 4.950876235961914} 11/06/2021 21:21:52 - INFO - __main__ - Step 626: {'lr': 0.00015625, 'samples': 120192, 'steps': 625, 'loss/train': 4.56068229675293} 11/06/2021 21:21:52 - INFO - __main__ - Step 627: {'lr': 0.0001565, 'samples': 120384, 'steps': 626, 'loss/train': 4.22480583190918} 11/06/2021 21:21:53 - INFO - __main__ - Step 628: {'lr': 0.00015675000000000002, 'samples': 120576, 'steps': 627, 'loss/train': 5.061352729797363} 11/06/2021 21:21:54 - INFO - __main__ - Step 629: {'lr': 0.000157, 'samples': 120768, 'steps': 628, 'loss/train': 4.828984260559082} 11/06/2021 21:21:54 - INFO - __main__ - Step 630: {'lr': 0.00015725, 'samples': 120960, 'steps': 629, 'loss/train': 4.675282955169678} 11/06/2021 21:21:55 - INFO - __main__ - Step 631: {'lr': 0.0001575, 'samples': 121152, 'steps': 630, 'loss/train': 4.3496575355529785} 11/06/2021 21:21:55 - INFO - __main__ - Step 632: {'lr': 0.00015775, 'samples': 121344, 'steps': 631, 'loss/train': 4.719287872314453} 11/06/2021 21:21:56 - INFO - __main__ - Step 633: {'lr': 0.000158, 'samples': 121536, 'steps': 632, 'loss/train': 4.833708763122559} 11/06/2021 21:21:56 - INFO - __main__ - Step 634: {'lr': 0.00015825, 'samples': 121728, 'steps': 633, 'loss/train': 4.082372665405273} 11/06/2021 21:21:57 - INFO - __main__ - Step 635: {'lr': 0.0001585, 'samples': 121920, 'steps': 634, 'loss/train': 4.89099645614624} 11/06/2021 21:21:57 - INFO - __main__ - Step 636: {'lr': 0.00015875, 'samples': 122112, 'steps': 635, 'loss/train': 5.596816062927246} 11/06/2021 21:21:57 - INFO - __main__ - Step 637: {'lr': 0.00015900000000000002, 'samples': 122304, 'steps': 636, 'loss/train': 4.681394100189209} 11/06/2021 21:21:58 - INFO - __main__ - Step 638: {'lr': 0.00015925, 'samples': 122496, 'steps': 637, 'loss/train': 4.834127426147461} 11/06/2021 21:21:59 - INFO - __main__ - Step 639: {'lr': 0.0001595, 'samples': 122688, 'steps': 638, 'loss/train': 5.277633190155029} 11/06/2021 21:21:59 - INFO - __main__ - Step 640: {'lr': 0.00015975, 'samples': 122880, 'steps': 639, 'loss/train': 5.002859115600586} 11/06/2021 21:21:59 - INFO - __main__ - Step 641: {'lr': 0.00016, 'samples': 123072, 'steps': 640, 'loss/train': 5.2396440505981445} 11/06/2021 21:22:00 - INFO - __main__ - Step 642: {'lr': 0.00016025000000000002, 'samples': 123264, 'steps': 641, 'loss/train': 5.183033466339111} 11/06/2021 21:22:01 - INFO - __main__ - Step 643: {'lr': 0.0001605, 'samples': 123456, 'steps': 642, 'loss/train': 4.6580305099487305} 11/06/2021 21:22:01 - INFO - __main__ - Step 644: {'lr': 0.00016075, 'samples': 123648, 'steps': 643, 'loss/train': 4.906527519226074} 11/06/2021 21:22:02 - INFO - __main__ - Step 645: {'lr': 0.000161, 'samples': 123840, 'steps': 644, 'loss/train': 4.350686073303223} 11/06/2021 21:22:02 - INFO - __main__ - Step 646: {'lr': 0.00016125000000000002, 'samples': 124032, 'steps': 645, 'loss/train': 4.576035499572754} 11/06/2021 21:22:02 - INFO - __main__ - Step 647: {'lr': 0.0001615, 'samples': 124224, 'steps': 646, 'loss/train': 4.445610046386719} 11/06/2021 21:22:03 - INFO - __main__ - Step 648: {'lr': 0.00016175, 'samples': 124416, 'steps': 647, 'loss/train': 4.549352645874023} 11/06/2021 21:22:04 - INFO - __main__ - Step 649: {'lr': 0.000162, 'samples': 124608, 'steps': 648, 'loss/train': 4.443591117858887} 11/06/2021 21:22:04 - INFO - __main__ - Step 650: {'lr': 0.00016225000000000001, 'samples': 124800, 'steps': 649, 'loss/train': 4.2263288497924805} 11/06/2021 21:22:04 - INFO - __main__ - Step 651: {'lr': 0.00016250000000000002, 'samples': 124992, 'steps': 650, 'loss/train': 4.449684143066406} 11/06/2021 21:22:05 - INFO - __main__ - Step 652: {'lr': 0.00016275, 'samples': 125184, 'steps': 651, 'loss/train': 5.221400737762451} 11/06/2021 21:22:06 - INFO - __main__ - Step 653: {'lr': 0.000163, 'samples': 125376, 'steps': 652, 'loss/train': 5.1638994216918945} 11/06/2021 21:22:06 - INFO - __main__ - Step 654: {'lr': 0.00016325, 'samples': 125568, 'steps': 653, 'loss/train': 4.728190898895264} 11/06/2021 21:22:06 - INFO - __main__ - Step 655: {'lr': 0.00016350000000000002, 'samples': 125760, 'steps': 654, 'loss/train': 4.6709394454956055} 11/06/2021 21:22:07 - INFO - __main__ - Step 656: {'lr': 0.00016375000000000002, 'samples': 125952, 'steps': 655, 'loss/train': 4.627468109130859} 11/06/2021 21:22:07 - INFO - __main__ - Step 657: {'lr': 0.000164, 'samples': 126144, 'steps': 656, 'loss/train': 4.362966060638428} 11/06/2021 21:22:08 - INFO - __main__ - Step 658: {'lr': 0.00016425, 'samples': 126336, 'steps': 657, 'loss/train': 5.120889663696289} 11/06/2021 21:22:08 - INFO - __main__ - Step 659: {'lr': 0.00016450000000000001, 'samples': 126528, 'steps': 658, 'loss/train': 4.917827129364014} 11/06/2021 21:22:09 - INFO - __main__ - Step 660: {'lr': 0.00016475000000000002, 'samples': 126720, 'steps': 659, 'loss/train': 4.704123020172119} 11/06/2021 21:22:09 - INFO - __main__ - Step 661: {'lr': 0.000165, 'samples': 126912, 'steps': 660, 'loss/train': 4.710036277770996} 11/06/2021 21:22:10 - INFO - __main__ - Step 662: {'lr': 0.00016525, 'samples': 127104, 'steps': 661, 'loss/train': 4.979597091674805} 11/06/2021 21:22:10 - INFO - __main__ - Step 663: {'lr': 0.0001655, 'samples': 127296, 'steps': 662, 'loss/train': 4.606518268585205} 11/06/2021 21:22:11 - INFO - __main__ - Step 664: {'lr': 0.00016575000000000002, 'samples': 127488, 'steps': 663, 'loss/train': 4.490879058837891} 11/06/2021 21:22:11 - INFO - __main__ - Step 665: {'lr': 0.00016600000000000002, 'samples': 127680, 'steps': 664, 'loss/train': 4.3809075355529785} 11/06/2021 21:22:12 - INFO - __main__ - Step 666: {'lr': 0.00016625, 'samples': 127872, 'steps': 665, 'loss/train': 4.735256195068359} 11/06/2021 21:22:12 - INFO - __main__ - Step 667: {'lr': 0.0001665, 'samples': 128064, 'steps': 666, 'loss/train': 4.7257890701293945} 11/06/2021 21:22:12 - INFO - __main__ - Step 668: {'lr': 0.00016675000000000001, 'samples': 128256, 'steps': 667, 'loss/train': 4.959212303161621} 11/06/2021 21:22:13 - INFO - __main__ - Step 669: {'lr': 0.00016700000000000002, 'samples': 128448, 'steps': 668, 'loss/train': 4.34332275390625} 11/06/2021 21:22:14 - INFO - __main__ - Step 670: {'lr': 0.00016725000000000003, 'samples': 128640, 'steps': 669, 'loss/train': 4.633439064025879} 11/06/2021 21:22:14 - INFO - __main__ - Step 671: {'lr': 0.0001675, 'samples': 128832, 'steps': 670, 'loss/train': 4.631658554077148} 11/06/2021 21:22:15 - INFO - __main__ - Step 672: {'lr': 0.00016775, 'samples': 129024, 'steps': 671, 'loss/train': 4.435437202453613} 11/06/2021 21:22:15 - INFO - __main__ - Step 673: {'lr': 0.00016800000000000002, 'samples': 129216, 'steps': 672, 'loss/train': 3.3415842056274414} 11/06/2021 21:22:16 - INFO - __main__ - Step 674: {'lr': 0.00016825000000000002, 'samples': 129408, 'steps': 673, 'loss/train': 4.65171480178833} 11/06/2021 21:22:16 - INFO - __main__ - Step 675: {'lr': 0.0001685, 'samples': 129600, 'steps': 674, 'loss/train': 4.98203182220459} 11/06/2021 21:22:17 - INFO - __main__ - Step 676: {'lr': 0.00016875, 'samples': 129792, 'steps': 675, 'loss/train': 5.456784248352051} 11/06/2021 21:22:17 - INFO - __main__ - Step 677: {'lr': 0.00016900000000000002, 'samples': 129984, 'steps': 676, 'loss/train': 4.679195880889893} 11/06/2021 21:22:17 - INFO - __main__ - Step 678: {'lr': 0.00016925000000000002, 'samples': 130176, 'steps': 677, 'loss/train': 4.434994697570801} 11/06/2021 21:22:18 - INFO - __main__ - Step 679: {'lr': 0.00016950000000000003, 'samples': 130368, 'steps': 678, 'loss/train': 4.650574684143066} 11/06/2021 21:22:19 - INFO - __main__ - Step 680: {'lr': 0.00016975, 'samples': 130560, 'steps': 679, 'loss/train': 4.444644451141357} 11/06/2021 21:22:19 - INFO - __main__ - Step 681: {'lr': 0.00017, 'samples': 130752, 'steps': 680, 'loss/train': 4.423855781555176} 11/06/2021 21:22:20 - INFO - __main__ - Step 682: {'lr': 0.00017025000000000002, 'samples': 130944, 'steps': 681, 'loss/train': 4.7202301025390625} 11/06/2021 21:22:20 - INFO - __main__ - Step 683: {'lr': 0.00017050000000000002, 'samples': 131136, 'steps': 682, 'loss/train': 4.4818854331970215} 11/06/2021 21:22:21 - INFO - __main__ - Step 684: {'lr': 0.00017075, 'samples': 131328, 'steps': 683, 'loss/train': 5.359105587005615} 11/06/2021 21:22:21 - INFO - __main__ - Step 685: {'lr': 0.000171, 'samples': 131520, 'steps': 684, 'loss/train': 4.730793476104736} 11/06/2021 21:22:22 - INFO - __main__ - Step 686: {'lr': 0.00017125000000000002, 'samples': 131712, 'steps': 685, 'loss/train': 4.558211803436279} 11/06/2021 21:22:22 - INFO - __main__ - Step 687: {'lr': 0.00017150000000000002, 'samples': 131904, 'steps': 686, 'loss/train': 4.429317951202393} 11/06/2021 21:22:22 - INFO - __main__ - Step 688: {'lr': 0.00017175000000000003, 'samples': 132096, 'steps': 687, 'loss/train': 4.6293816566467285} 11/06/2021 21:22:24 - INFO - __main__ - Step 689: {'lr': 0.00017199999999999998, 'samples': 132288, 'steps': 688, 'loss/train': 5.065909385681152} 11/06/2021 21:22:24 - INFO - __main__ - Step 690: {'lr': 0.00017224999999999999, 'samples': 132480, 'steps': 689, 'loss/train': 4.581766605377197} 11/06/2021 21:22:24 - INFO - __main__ - Step 691: {'lr': 0.0001725, 'samples': 132672, 'steps': 690, 'loss/train': 4.165865421295166} 11/06/2021 21:22:25 - INFO - __main__ - Step 692: {'lr': 0.00017275, 'samples': 132864, 'steps': 691, 'loss/train': 4.4996209144592285} 11/06/2021 21:22:25 - INFO - __main__ - Step 693: {'lr': 0.000173, 'samples': 133056, 'steps': 692, 'loss/train': 4.5448150634765625} 11/06/2021 21:22:25 - INFO - __main__ - Step 694: {'lr': 0.00017324999999999998, 'samples': 133248, 'steps': 693, 'loss/train': 3.898272752761841} 11/06/2021 21:22:26 - INFO - __main__ - Step 695: {'lr': 0.0001735, 'samples': 133440, 'steps': 694, 'loss/train': 4.2713165283203125} 11/06/2021 21:22:27 - INFO - __main__ - Step 696: {'lr': 0.00017375, 'samples': 133632, 'steps': 695, 'loss/train': 5.151446342468262} 11/06/2021 21:22:27 - INFO - __main__ - Step 697: {'lr': 0.000174, 'samples': 133824, 'steps': 696, 'loss/train': 4.418706893920898} 11/06/2021 21:22:28 - INFO - __main__ - Step 698: {'lr': 0.00017424999999999998, 'samples': 134016, 'steps': 697, 'loss/train': 4.960476875305176} 11/06/2021 21:22:28 - INFO - __main__ - Step 699: {'lr': 0.00017449999999999999, 'samples': 134208, 'steps': 698, 'loss/train': 4.722470760345459} 11/06/2021 21:22:30 - INFO - __main__ - Step 700: {'lr': 0.00017475, 'samples': 134400, 'steps': 699, 'loss/train': 4.800037384033203} 11/06/2021 21:22:30 - INFO - __main__ - Step 701: {'lr': 0.000175, 'samples': 134592, 'steps': 700, 'loss/train': 4.3655500411987305} 11/06/2021 21:22:30 - INFO - __main__ - Step 702: {'lr': 0.00017525, 'samples': 134784, 'steps': 701, 'loss/train': 4.725539684295654} 11/06/2021 21:22:31 - INFO - __main__ - Step 703: {'lr': 0.00017549999999999998, 'samples': 134976, 'steps': 702, 'loss/train': 4.290457248687744} 11/06/2021 21:22:31 - INFO - __main__ - Step 704: {'lr': 0.00017575, 'samples': 135168, 'steps': 703, 'loss/train': 4.849248886108398} 11/06/2021 21:22:31 - INFO - __main__ - Step 705: {'lr': 0.000176, 'samples': 135360, 'steps': 704, 'loss/train': 2.431931734085083} 11/06/2021 21:22:32 - INFO - __main__ - Step 706: {'lr': 0.00017625, 'samples': 135552, 'steps': 705, 'loss/train': 2.3060343265533447} 11/06/2021 21:22:33 - INFO - __main__ - Step 707: {'lr': 0.00017649999999999998, 'samples': 135744, 'steps': 706, 'loss/train': 2.4550411701202393} 11/06/2021 21:22:33 - INFO - __main__ - Step 708: {'lr': 0.00017675, 'samples': 135936, 'steps': 707, 'loss/train': 4.702536106109619} 11/06/2021 21:22:33 - INFO - __main__ - Step 709: {'lr': 0.000177, 'samples': 136128, 'steps': 708, 'loss/train': 4.986819267272949} 11/06/2021 21:22:34 - INFO - __main__ - Step 710: {'lr': 0.00017725, 'samples': 136320, 'steps': 709, 'loss/train': 4.857694149017334} 11/06/2021 21:22:34 - INFO - __main__ - Step 711: {'lr': 0.0001775, 'samples': 136512, 'steps': 710, 'loss/train': 4.735157012939453} 11/06/2021 21:22:35 - INFO - __main__ - Step 712: {'lr': 0.00017774999999999998, 'samples': 136704, 'steps': 711, 'loss/train': 4.753922462463379} 11/06/2021 21:22:36 - INFO - __main__ - Step 713: {'lr': 0.000178, 'samples': 136896, 'steps': 712, 'loss/train': 4.611956596374512} 11/06/2021 21:22:36 - INFO - __main__ - Step 714: {'lr': 0.00017825, 'samples': 137088, 'steps': 713, 'loss/train': 4.634572982788086} 11/06/2021 21:22:36 - INFO - __main__ - Step 715: {'lr': 0.0001785, 'samples': 137280, 'steps': 714, 'loss/train': 4.737353801727295} 11/06/2021 21:22:37 - INFO - __main__ - Step 716: {'lr': 0.00017875, 'samples': 137472, 'steps': 715, 'loss/train': 4.557127475738525} 11/06/2021 21:22:38 - INFO - __main__ - Step 717: {'lr': 0.000179, 'samples': 137664, 'steps': 716, 'loss/train': 5.090901851654053} 11/06/2021 21:22:38 - INFO - __main__ - Step 718: {'lr': 0.00017925, 'samples': 137856, 'steps': 717, 'loss/train': 4.926265716552734} 11/06/2021 21:22:39 - INFO - __main__ - Step 719: {'lr': 0.0001795, 'samples': 138048, 'steps': 718, 'loss/train': 4.575839519500732} 11/06/2021 21:22:39 - INFO - __main__ - Step 720: {'lr': 0.00017975, 'samples': 138240, 'steps': 719, 'loss/train': 4.539857864379883} 11/06/2021 21:22:39 - INFO - __main__ - Step 721: {'lr': 0.00017999999999999998, 'samples': 138432, 'steps': 720, 'loss/train': 5.017696380615234} 11/06/2021 21:22:41 - INFO - __main__ - Step 722: {'lr': 0.00018025, 'samples': 138624, 'steps': 721, 'loss/train': 4.860723495483398} 11/06/2021 21:22:41 - INFO - __main__ - Step 723: {'lr': 0.0001805, 'samples': 138816, 'steps': 722, 'loss/train': 4.767000675201416} 11/06/2021 21:22:41 - INFO - __main__ - Step 724: {'lr': 0.00018075, 'samples': 139008, 'steps': 723, 'loss/train': 4.792560577392578} 11/06/2021 21:22:42 - INFO - __main__ - Step 725: {'lr': 0.000181, 'samples': 139200, 'steps': 724, 'loss/train': 4.775363922119141} 11/06/2021 21:22:42 - INFO - __main__ - Step 726: {'lr': 0.00018125, 'samples': 139392, 'steps': 725, 'loss/train': 4.8343377113342285} 11/06/2021 21:22:42 - INFO - __main__ - Step 727: {'lr': 0.0001815, 'samples': 139584, 'steps': 726, 'loss/train': 4.501916885375977} 11/06/2021 21:22:43 - INFO - __main__ - Step 728: {'lr': 0.00018175, 'samples': 139776, 'steps': 727, 'loss/train': 4.543506145477295} 11/06/2021 21:22:44 - INFO - __main__ - Step 729: {'lr': 0.000182, 'samples': 139968, 'steps': 728, 'loss/train': 4.346724033355713} 11/06/2021 21:22:44 - INFO - __main__ - Step 730: {'lr': 0.00018225, 'samples': 140160, 'steps': 729, 'loss/train': 4.893301486968994} 11/06/2021 21:22:44 - INFO - __main__ - Step 731: {'lr': 0.0001825, 'samples': 140352, 'steps': 730, 'loss/train': 4.508211612701416} 11/06/2021 21:22:45 - INFO - __main__ - Step 732: {'lr': 0.00018275, 'samples': 140544, 'steps': 731, 'loss/train': 4.329798221588135} 11/06/2021 21:22:46 - INFO - __main__ - Step 733: {'lr': 0.000183, 'samples': 140736, 'steps': 732, 'loss/train': 4.946159362792969} 11/06/2021 21:22:46 - INFO - __main__ - Step 734: {'lr': 0.00018325, 'samples': 140928, 'steps': 733, 'loss/train': 4.753419399261475} 11/06/2021 21:22:47 - INFO - __main__ - Step 735: {'lr': 0.0001835, 'samples': 141120, 'steps': 734, 'loss/train': 5.069578170776367} 11/06/2021 21:22:47 - INFO - __main__ - Step 736: {'lr': 0.00018375, 'samples': 141312, 'steps': 735, 'loss/train': 5.126845359802246} 11/06/2021 21:22:47 - INFO - __main__ - Step 737: {'lr': 0.000184, 'samples': 141504, 'steps': 736, 'loss/train': 4.666621685028076} 11/06/2021 21:22:48 - INFO - __main__ - Step 738: {'lr': 0.00018425, 'samples': 141696, 'steps': 737, 'loss/train': 4.2837958335876465} 11/06/2021 21:22:49 - INFO - __main__ - Step 739: {'lr': 0.0001845, 'samples': 141888, 'steps': 738, 'loss/train': 4.565089702606201} 11/06/2021 21:22:49 - INFO - __main__ - Step 740: {'lr': 0.00018475, 'samples': 142080, 'steps': 739, 'loss/train': 4.330541133880615} 11/06/2021 21:22:49 - INFO - __main__ - Step 741: {'lr': 0.000185, 'samples': 142272, 'steps': 740, 'loss/train': 4.524784564971924} 11/06/2021 21:22:50 - INFO - __main__ - Step 742: {'lr': 0.00018525, 'samples': 142464, 'steps': 741, 'loss/train': 4.280323505401611} 11/06/2021 21:22:51 - INFO - __main__ - Step 743: {'lr': 0.0001855, 'samples': 142656, 'steps': 742, 'loss/train': 4.447424411773682} 11/06/2021 21:22:51 - INFO - __main__ - Step 744: {'lr': 0.00018575000000000002, 'samples': 142848, 'steps': 743, 'loss/train': 4.917816638946533} 11/06/2021 21:22:51 - INFO - __main__ - Step 745: {'lr': 0.000186, 'samples': 143040, 'steps': 744, 'loss/train': 4.4243059158325195} 11/06/2021 21:22:52 - INFO - __main__ - Step 746: {'lr': 0.00018625, 'samples': 143232, 'steps': 745, 'loss/train': 5.763511657714844} 11/06/2021 21:22:52 - INFO - __main__ - Step 747: {'lr': 0.0001865, 'samples': 143424, 'steps': 746, 'loss/train': 4.306034564971924} 11/06/2021 21:22:53 - INFO - __main__ - Step 748: {'lr': 0.00018675, 'samples': 143616, 'steps': 747, 'loss/train': 5.227732181549072} 11/06/2021 21:22:54 - INFO - __main__ - Step 749: {'lr': 0.000187, 'samples': 143808, 'steps': 748, 'loss/train': 5.433281421661377} 11/06/2021 21:22:54 - INFO - __main__ - Step 750: {'lr': 0.00018725, 'samples': 144000, 'steps': 749, 'loss/train': 4.236339092254639} 11/06/2021 21:22:54 - INFO - __main__ - Step 751: {'lr': 0.0001875, 'samples': 144192, 'steps': 750, 'loss/train': 4.570340633392334} 11/06/2021 21:22:55 - INFO - __main__ - Step 752: {'lr': 0.00018775, 'samples': 144384, 'steps': 751, 'loss/train': 4.103482723236084} 11/06/2021 21:22:55 - INFO - __main__ - Step 753: {'lr': 0.00018800000000000002, 'samples': 144576, 'steps': 752, 'loss/train': 4.504870891571045} 11/06/2021 21:22:56 - INFO - __main__ - Step 754: {'lr': 0.00018825, 'samples': 144768, 'steps': 753, 'loss/train': 4.6825480461120605} 11/06/2021 21:22:57 - INFO - __main__ - Step 755: {'lr': 0.0001885, 'samples': 144960, 'steps': 754, 'loss/train': 4.360548973083496} 11/06/2021 21:22:57 - INFO - __main__ - Step 756: {'lr': 0.00018875, 'samples': 145152, 'steps': 755, 'loss/train': 4.626955032348633} 11/06/2021 21:22:57 - INFO - __main__ - Step 757: {'lr': 0.000189, 'samples': 145344, 'steps': 756, 'loss/train': 4.462242126464844} 11/06/2021 21:22:58 - INFO - __main__ - Step 758: {'lr': 0.00018925, 'samples': 145536, 'steps': 757, 'loss/train': 4.262608528137207} 11/06/2021 21:22:59 - INFO - __main__ - Step 759: {'lr': 0.0001895, 'samples': 145728, 'steps': 758, 'loss/train': 4.549057960510254} 11/06/2021 21:22:59 - INFO - __main__ - Step 760: {'lr': 0.00018975, 'samples': 145920, 'steps': 759, 'loss/train': 4.4868574142456055} 11/06/2021 21:22:59 - INFO - __main__ - Step 761: {'lr': 0.00019, 'samples': 146112, 'steps': 760, 'loss/train': 4.394392013549805} 11/06/2021 21:23:00 - INFO - __main__ - Step 762: {'lr': 0.00019025000000000002, 'samples': 146304, 'steps': 761, 'loss/train': 4.9143500328063965} 11/06/2021 21:23:00 - INFO - __main__ - Step 763: {'lr': 0.0001905, 'samples': 146496, 'steps': 762, 'loss/train': 4.456435680389404} 11/06/2021 21:23:01 - INFO - __main__ - Step 764: {'lr': 0.00019075, 'samples': 146688, 'steps': 763, 'loss/train': 5.696299076080322} 11/06/2021 21:23:02 - INFO - __main__ - Step 765: {'lr': 0.000191, 'samples': 146880, 'steps': 764, 'loss/train': 4.935421466827393} 11/06/2021 21:23:02 - INFO - __main__ - Step 766: {'lr': 0.00019125000000000001, 'samples': 147072, 'steps': 765, 'loss/train': 4.943374156951904} 11/06/2021 21:23:02 - INFO - __main__ - Step 767: {'lr': 0.00019150000000000002, 'samples': 147264, 'steps': 766, 'loss/train': 5.322519302368164} 11/06/2021 21:23:03 - INFO - __main__ - Step 768: {'lr': 0.00019175, 'samples': 147456, 'steps': 767, 'loss/train': 4.189104080200195} 11/06/2021 21:23:04 - INFO - __main__ - Step 769: {'lr': 0.000192, 'samples': 147648, 'steps': 768, 'loss/train': 5.1701979637146} 11/06/2021 21:23:04 - INFO - __main__ - Step 770: {'lr': 0.00019225, 'samples': 147840, 'steps': 769, 'loss/train': 4.428781032562256} 11/06/2021 21:23:04 - INFO - __main__ - Step 771: {'lr': 0.00019250000000000002, 'samples': 148032, 'steps': 770, 'loss/train': 4.415109157562256} 11/06/2021 21:23:05 - INFO - __main__ - Step 772: {'lr': 0.00019275, 'samples': 148224, 'steps': 771, 'loss/train': 4.811757564544678} 11/06/2021 21:23:05 - INFO - __main__ - Step 773: {'lr': 0.000193, 'samples': 148416, 'steps': 772, 'loss/train': 4.7581634521484375} 11/06/2021 21:23:06 - INFO - __main__ - Step 774: {'lr': 0.00019325, 'samples': 148608, 'steps': 773, 'loss/train': 4.4999098777771} 11/06/2021 21:23:06 - INFO - __main__ - Step 775: {'lr': 0.00019350000000000001, 'samples': 148800, 'steps': 774, 'loss/train': 4.690352439880371} 11/06/2021 21:23:07 - INFO - __main__ - Step 776: {'lr': 0.00019375000000000002, 'samples': 148992, 'steps': 775, 'loss/train': 4.419637203216553} 11/06/2021 21:23:07 - INFO - __main__ - Step 777: {'lr': 0.000194, 'samples': 149184, 'steps': 776, 'loss/train': 4.947287559509277} 11/06/2021 21:23:08 - INFO - __main__ - Step 778: {'lr': 0.00019425, 'samples': 149376, 'steps': 777, 'loss/train': 5.457443714141846} 11/06/2021 21:23:09 - INFO - __main__ - Step 779: {'lr': 0.0001945, 'samples': 149568, 'steps': 778, 'loss/train': 4.38627815246582} 11/06/2021 21:23:09 - INFO - __main__ - Step 780: {'lr': 0.00019475000000000002, 'samples': 149760, 'steps': 779, 'loss/train': 5.3790178298950195} 11/06/2021 21:23:09 - INFO - __main__ - Step 781: {'lr': 0.00019500000000000002, 'samples': 149952, 'steps': 780, 'loss/train': 4.04803466796875} 11/06/2021 21:23:10 - INFO - __main__ - Step 782: {'lr': 0.00019525, 'samples': 150144, 'steps': 781, 'loss/train': 4.558263301849365} 11/06/2021 21:23:10 - INFO - __main__ - Step 783: {'lr': 0.0001955, 'samples': 150336, 'steps': 782, 'loss/train': 4.308804988861084} 11/06/2021 21:23:11 - INFO - __main__ - Step 784: {'lr': 0.00019575000000000001, 'samples': 150528, 'steps': 783, 'loss/train': 5.033785820007324} 11/06/2021 21:23:11 - INFO - __main__ - Step 785: {'lr': 0.00019600000000000002, 'samples': 150720, 'steps': 784, 'loss/train': 4.059752464294434} 11/06/2021 21:23:12 - INFO - __main__ - Step 786: {'lr': 0.00019625, 'samples': 150912, 'steps': 785, 'loss/train': 4.140824317932129} 11/06/2021 21:23:12 - INFO - __main__ - Step 787: {'lr': 0.0001965, 'samples': 151104, 'steps': 786, 'loss/train': 4.625868797302246} 11/06/2021 21:23:12 - INFO - __main__ - Step 788: {'lr': 0.00019675, 'samples': 151296, 'steps': 787, 'loss/train': 4.85765266418457} 11/06/2021 21:23:14 - INFO - __main__ - Step 789: {'lr': 0.00019700000000000002, 'samples': 151488, 'steps': 788, 'loss/train': 4.904588222503662} 11/06/2021 21:23:14 - INFO - __main__ - Step 790: {'lr': 0.00019725000000000002, 'samples': 151680, 'steps': 789, 'loss/train': 4.804408550262451} 11/06/2021 21:23:14 - INFO - __main__ - Step 791: {'lr': 0.0001975, 'samples': 151872, 'steps': 790, 'loss/train': 4.8462114334106445} 11/06/2021 21:23:15 - INFO - __main__ - Step 792: {'lr': 0.00019775, 'samples': 152064, 'steps': 791, 'loss/train': 4.475534439086914} 11/06/2021 21:23:15 - INFO - __main__ - Step 793: {'lr': 0.00019800000000000002, 'samples': 152256, 'steps': 792, 'loss/train': 4.764225482940674} 11/06/2021 21:23:16 - INFO - __main__ - Step 794: {'lr': 0.00019825000000000002, 'samples': 152448, 'steps': 793, 'loss/train': 4.824385643005371} 11/06/2021 21:23:16 - INFO - __main__ - Step 795: {'lr': 0.00019850000000000003, 'samples': 152640, 'steps': 794, 'loss/train': 4.384396076202393} 11/06/2021 21:23:17 - INFO - __main__ - Step 796: {'lr': 0.00019875, 'samples': 152832, 'steps': 795, 'loss/train': 4.154908657073975} 11/06/2021 21:23:17 - INFO - __main__ - Step 797: {'lr': 0.000199, 'samples': 153024, 'steps': 796, 'loss/train': 4.558513641357422} 11/06/2021 21:23:18 - INFO - __main__ - Step 798: {'lr': 0.00019925000000000002, 'samples': 153216, 'steps': 797, 'loss/train': 4.913057327270508} 11/06/2021 21:23:18 - INFO - __main__ - Step 799: {'lr': 0.00019950000000000002, 'samples': 153408, 'steps': 798, 'loss/train': 4.051199913024902} 11/06/2021 21:23:20 - INFO - __main__ - Step 800: {'lr': 0.00019975, 'samples': 153600, 'steps': 799, 'loss/train': 3.938993215560913} 11/06/2021 21:23:20 - INFO - __main__ - Step 801: {'lr': 0.0002, 'samples': 153792, 'steps': 800, 'loss/train': 4.724459648132324} 11/06/2021 21:23:20 - INFO - __main__ - Step 802: {'lr': 0.00020025000000000002, 'samples': 153984, 'steps': 801, 'loss/train': 6.338046550750732} 11/06/2021 21:23:21 - INFO - __main__ - Step 803: {'lr': 0.00020050000000000002, 'samples': 154176, 'steps': 802, 'loss/train': 2.221121072769165} 11/06/2021 21:23:21 - INFO - __main__ - Step 804: {'lr': 0.00020075000000000003, 'samples': 154368, 'steps': 803, 'loss/train': 2.2776834964752197} 11/06/2021 21:23:21 - INFO - __main__ - Step 805: {'lr': 0.000201, 'samples': 154560, 'steps': 804, 'loss/train': 2.1242752075195312} 11/06/2021 21:23:22 - INFO - __main__ - Step 806: {'lr': 0.00020125, 'samples': 154752, 'steps': 805, 'loss/train': 3.73689603805542} 11/06/2021 21:23:23 - INFO - __main__ - Step 807: {'lr': 0.00020150000000000002, 'samples': 154944, 'steps': 806, 'loss/train': 4.354226112365723} 11/06/2021 21:23:23 - INFO - __main__ - Step 808: {'lr': 0.00020175000000000003, 'samples': 155136, 'steps': 807, 'loss/train': 4.5094475746154785} 11/06/2021 21:23:24 - INFO - __main__ - Step 809: {'lr': 0.000202, 'samples': 155328, 'steps': 808, 'loss/train': 4.677899360656738} 11/06/2021 21:23:24 - INFO - __main__ - Step 810: {'lr': 0.00020225, 'samples': 155520, 'steps': 809, 'loss/train': 4.373321533203125} 11/06/2021 21:23:25 - INFO - __main__ - Step 811: {'lr': 0.00020250000000000002, 'samples': 155712, 'steps': 810, 'loss/train': 5.149594306945801} 11/06/2021 21:23:26 - INFO - __main__ - Step 812: {'lr': 0.00020275000000000002, 'samples': 155904, 'steps': 811, 'loss/train': 4.3080902099609375} 11/06/2021 21:23:26 - INFO - __main__ - Step 813: {'lr': 0.00020300000000000003, 'samples': 156096, 'steps': 812, 'loss/train': 4.42739725112915} 11/06/2021 21:23:26 - INFO - __main__ - Step 814: {'lr': 0.00020324999999999998, 'samples': 156288, 'steps': 813, 'loss/train': 4.6587677001953125} 11/06/2021 21:23:27 - INFO - __main__ - Step 815: {'lr': 0.00020349999999999999, 'samples': 156480, 'steps': 814, 'loss/train': 4.584706783294678} 11/06/2021 21:23:27 - INFO - __main__ - Step 816: {'lr': 0.00020375, 'samples': 156672, 'steps': 815, 'loss/train': 4.675464153289795} 11/06/2021 21:23:28 - INFO - __main__ - Step 817: {'lr': 0.000204, 'samples': 156864, 'steps': 816, 'loss/train': 4.275284767150879} 11/06/2021 21:23:28 - INFO - __main__ - Step 818: {'lr': 0.00020425, 'samples': 157056, 'steps': 817, 'loss/train': 4.487129211425781} 11/06/2021 21:23:29 - INFO - __main__ - Step 819: {'lr': 0.00020449999999999998, 'samples': 157248, 'steps': 818, 'loss/train': 3.8839242458343506} 11/06/2021 21:23:29 - INFO - __main__ - Step 820: {'lr': 0.00020475, 'samples': 157440, 'steps': 819, 'loss/train': 5.175144672393799} 11/06/2021 21:23:30 - INFO - __main__ - Step 821: {'lr': 0.000205, 'samples': 157632, 'steps': 820, 'loss/train': 4.6907877922058105} 11/06/2021 21:23:31 - INFO - __main__ - Step 822: {'lr': 0.00020525, 'samples': 157824, 'steps': 821, 'loss/train': 5.107820987701416} 11/06/2021 21:23:31 - INFO - __main__ - Step 823: {'lr': 0.00020549999999999998, 'samples': 158016, 'steps': 822, 'loss/train': 4.4801106452941895} 11/06/2021 21:23:31 - INFO - __main__ - Step 824: {'lr': 0.00020575, 'samples': 158208, 'steps': 823, 'loss/train': 4.510085105895996} 11/06/2021 21:23:32 - INFO - __main__ - Step 825: {'lr': 0.000206, 'samples': 158400, 'steps': 824, 'loss/train': 4.396790981292725} 11/06/2021 21:23:32 - INFO - __main__ - Step 826: {'lr': 0.00020625, 'samples': 158592, 'steps': 825, 'loss/train': 4.010310649871826} 11/06/2021 21:23:32 - INFO - __main__ - Step 827: {'lr': 0.0002065, 'samples': 158784, 'steps': 826, 'loss/train': 4.797766208648682} 11/06/2021 21:23:33 - INFO - __main__ - Step 828: {'lr': 0.00020674999999999998, 'samples': 158976, 'steps': 827, 'loss/train': 4.619658470153809} 11/06/2021 21:23:34 - INFO - __main__ - Step 829: {'lr': 0.000207, 'samples': 159168, 'steps': 828, 'loss/train': 4.163397312164307} 11/06/2021 21:23:34 - INFO - __main__ - Step 830: {'lr': 0.00020725, 'samples': 159360, 'steps': 829, 'loss/train': 4.228115081787109} 11/06/2021 21:23:34 - INFO - __main__ - Step 831: {'lr': 0.0002075, 'samples': 159552, 'steps': 830, 'loss/train': 4.451252460479736} 11/06/2021 21:23:35 - INFO - __main__ - Step 832: {'lr': 0.00020774999999999998, 'samples': 159744, 'steps': 831, 'loss/train': 4.284130096435547} 11/06/2021 21:23:36 - INFO - __main__ - Step 833: {'lr': 0.000208, 'samples': 159936, 'steps': 832, 'loss/train': 4.7883687019348145} 11/06/2021 21:23:36 - INFO - __main__ - Step 834: {'lr': 0.00020825, 'samples': 160128, 'steps': 833, 'loss/train': 4.043850898742676} 11/06/2021 21:23:37 - INFO - __main__ - Step 835: {'lr': 0.0002085, 'samples': 160320, 'steps': 834, 'loss/train': 4.566141128540039} 11/06/2021 21:23:37 - INFO - __main__ - Step 836: {'lr': 0.00020875, 'samples': 160512, 'steps': 835, 'loss/train': 5.373464584350586} 11/06/2021 21:23:37 - INFO - __main__ - Step 837: {'lr': 0.00020899999999999998, 'samples': 160704, 'steps': 836, 'loss/train': 4.873067378997803} 11/06/2021 21:23:39 - INFO - __main__ - Step 838: {'lr': 0.00020925, 'samples': 160896, 'steps': 837, 'loss/train': 4.9676995277404785} 11/06/2021 21:23:39 - INFO - __main__ - Step 839: {'lr': 0.0002095, 'samples': 161088, 'steps': 838, 'loss/train': 4.646881103515625} 11/06/2021 21:23:39 - INFO - __main__ - Step 840: {'lr': 0.00020975, 'samples': 161280, 'steps': 839, 'loss/train': 4.013716220855713} 11/06/2021 21:23:40 - INFO - __main__ - Step 841: {'lr': 0.00021, 'samples': 161472, 'steps': 840, 'loss/train': 6.019371509552002} 11/06/2021 21:23:40 - INFO - __main__ - Step 842: {'lr': 0.00021025, 'samples': 161664, 'steps': 841, 'loss/train': 3.9948506355285645} 11/06/2021 21:23:41 - INFO - __main__ - Step 843: {'lr': 0.0002105, 'samples': 161856, 'steps': 842, 'loss/train': 4.164978504180908} 11/06/2021 21:23:42 - INFO - __main__ - Step 844: {'lr': 0.00021075, 'samples': 162048, 'steps': 843, 'loss/train': 4.208803176879883} 11/06/2021 21:23:42 - INFO - __main__ - Step 845: {'lr': 0.000211, 'samples': 162240, 'steps': 844, 'loss/train': 5.149277210235596} 11/06/2021 21:23:42 - INFO - __main__ - Step 846: {'lr': 0.00021124999999999998, 'samples': 162432, 'steps': 845, 'loss/train': 4.742812156677246} 11/06/2021 21:23:43 - INFO - __main__ - Step 847: {'lr': 0.0002115, 'samples': 162624, 'steps': 846, 'loss/train': 4.207589149475098} 11/06/2021 21:23:43 - INFO - __main__ - Step 848: {'lr': 0.00021175, 'samples': 162816, 'steps': 847, 'loss/train': 4.035452842712402} 11/06/2021 21:23:44 - INFO - __main__ - Step 849: {'lr': 0.000212, 'samples': 163008, 'steps': 848, 'loss/train': 4.860066890716553} 11/06/2021 21:23:45 - INFO - __main__ - Step 850: {'lr': 0.00021225, 'samples': 163200, 'steps': 849, 'loss/train': 3.371035575866699} 11/06/2021 21:23:45 - INFO - __main__ - Step 851: {'lr': 0.0002125, 'samples': 163392, 'steps': 850, 'loss/train': 4.5741753578186035} 11/06/2021 21:23:45 - INFO - __main__ - Step 852: {'lr': 0.00021275, 'samples': 163584, 'steps': 851, 'loss/train': 4.537678241729736} 11/06/2021 21:23:46 - INFO - __main__ - Step 853: {'lr': 0.000213, 'samples': 163776, 'steps': 852, 'loss/train': 4.552616119384766} 11/06/2021 21:23:47 - INFO - __main__ - Step 854: {'lr': 0.00021325, 'samples': 163968, 'steps': 853, 'loss/train': 4.6721577644348145} 11/06/2021 21:23:47 - INFO - __main__ - Step 855: {'lr': 0.0002135, 'samples': 164160, 'steps': 854, 'loss/train': 4.655636310577393} 11/06/2021 21:23:47 - INFO - __main__ - Step 856: {'lr': 0.00021375, 'samples': 164352, 'steps': 855, 'loss/train': 4.305333614349365} 11/06/2021 21:23:48 - INFO - __main__ - Step 857: {'lr': 0.000214, 'samples': 164544, 'steps': 856, 'loss/train': 4.127890110015869} 11/06/2021 21:23:48 - INFO - __main__ - Step 858: {'lr': 0.00021425, 'samples': 164736, 'steps': 857, 'loss/train': 4.549274921417236} 11/06/2021 21:23:49 - INFO - __main__ - Step 859: {'lr': 0.0002145, 'samples': 164928, 'steps': 858, 'loss/train': 3.803431510925293} 11/06/2021 21:23:50 - INFO - __main__ - Step 860: {'lr': 0.00021475, 'samples': 165120, 'steps': 859, 'loss/train': 4.316712379455566} 11/06/2021 21:23:50 - INFO - __main__ - Step 861: {'lr': 0.000215, 'samples': 165312, 'steps': 860, 'loss/train': 4.917266845703125} 11/06/2021 21:23:50 - INFO - __main__ - Step 862: {'lr': 0.00021525, 'samples': 165504, 'steps': 861, 'loss/train': 4.41719913482666} 11/06/2021 21:23:51 - INFO - __main__ - Step 863: {'lr': 0.0002155, 'samples': 165696, 'steps': 862, 'loss/train': 4.436905384063721} 11/06/2021 21:23:52 - INFO - __main__ - Step 864: {'lr': 0.00021575, 'samples': 165888, 'steps': 863, 'loss/train': 4.34383487701416} 11/06/2021 21:23:52 - INFO - __main__ - Step 865: {'lr': 0.000216, 'samples': 166080, 'steps': 864, 'loss/train': 4.4350080490112305} 11/06/2021 21:23:52 - INFO - __main__ - Step 866: {'lr': 0.00021625, 'samples': 166272, 'steps': 865, 'loss/train': 4.548778057098389} 11/06/2021 21:23:53 - INFO - __main__ - Step 867: {'lr': 0.0002165, 'samples': 166464, 'steps': 866, 'loss/train': 4.653875827789307} 11/06/2021 21:23:53 - INFO - __main__ - Step 868: {'lr': 0.00021675, 'samples': 166656, 'steps': 867, 'loss/train': 4.1312150955200195} 11/06/2021 21:23:54 - INFO - __main__ - Step 869: {'lr': 0.00021700000000000002, 'samples': 166848, 'steps': 868, 'loss/train': 4.243196964263916} 11/06/2021 21:23:54 - INFO - __main__ - Step 870: {'lr': 0.00021725, 'samples': 167040, 'steps': 869, 'loss/train': 4.393092155456543} 11/06/2021 21:23:55 - INFO - __main__ - Step 871: {'lr': 0.0002175, 'samples': 167232, 'steps': 870, 'loss/train': 4.559785842895508} 11/06/2021 21:23:55 - INFO - __main__ - Step 872: {'lr': 0.00021775, 'samples': 167424, 'steps': 871, 'loss/train': 1.9217588901519775} 11/06/2021 21:23:55 - INFO - __main__ - Step 873: {'lr': 0.000218, 'samples': 167616, 'steps': 872, 'loss/train': 4.757959365844727} 11/06/2021 21:23:57 - INFO - __main__ - Step 874: {'lr': 0.00021825, 'samples': 167808, 'steps': 873, 'loss/train': 4.589957237243652} 11/06/2021 21:23:57 - INFO - __main__ - Step 875: {'lr': 0.0002185, 'samples': 168000, 'steps': 874, 'loss/train': 4.484537124633789} 11/06/2021 21:23:57 - INFO - __main__ - Step 876: {'lr': 0.00021875, 'samples': 168192, 'steps': 875, 'loss/train': 4.340537071228027} 11/06/2021 21:23:58 - INFO - __main__ - Step 877: {'lr': 0.000219, 'samples': 168384, 'steps': 876, 'loss/train': 4.769442558288574} 11/06/2021 21:23:58 - INFO - __main__ - Step 878: {'lr': 0.00021925000000000002, 'samples': 168576, 'steps': 877, 'loss/train': 4.576560974121094} 11/06/2021 21:23:59 - INFO - __main__ - Step 879: {'lr': 0.0002195, 'samples': 168768, 'steps': 878, 'loss/train': 4.7822418212890625} 11/06/2021 21:23:59 - INFO - __main__ - Step 880: {'lr': 0.00021975, 'samples': 168960, 'steps': 879, 'loss/train': 4.239711284637451} 11/06/2021 21:24:00 - INFO - __main__ - Step 881: {'lr': 0.00022, 'samples': 169152, 'steps': 880, 'loss/train': 4.899833679199219} 11/06/2021 21:24:00 - INFO - __main__ - Step 882: {'lr': 0.00022025000000000001, 'samples': 169344, 'steps': 881, 'loss/train': 4.500170707702637} 11/06/2021 21:24:00 - INFO - __main__ - Step 883: {'lr': 0.0002205, 'samples': 169536, 'steps': 882, 'loss/train': 4.454834938049316} 11/06/2021 21:24:02 - INFO - __main__ - Step 884: {'lr': 0.00022075, 'samples': 169728, 'steps': 883, 'loss/train': 4.347689151763916} 11/06/2021 21:24:02 - INFO - __main__ - Step 885: {'lr': 0.000221, 'samples': 169920, 'steps': 884, 'loss/train': 4.470192909240723} 11/06/2021 21:24:02 - INFO - __main__ - Step 886: {'lr': 0.00022125, 'samples': 170112, 'steps': 885, 'loss/train': 3.8449060916900635} 11/06/2021 21:24:03 - INFO - __main__ - Step 887: {'lr': 0.00022150000000000002, 'samples': 170304, 'steps': 886, 'loss/train': 3.2878403663635254} 11/06/2021 21:24:03 - INFO - __main__ - Step 888: {'lr': 0.00022175, 'samples': 170496, 'steps': 887, 'loss/train': 4.312256813049316} 11/06/2021 21:24:04 - INFO - __main__ - Step 889: {'lr': 0.000222, 'samples': 170688, 'steps': 888, 'loss/train': 4.553658962249756} 11/06/2021 21:24:04 - INFO - __main__ - Step 890: {'lr': 0.00022225, 'samples': 170880, 'steps': 889, 'loss/train': 4.031077861785889} 11/06/2021 21:24:05 - INFO - __main__ - Step 891: {'lr': 0.00022250000000000001, 'samples': 171072, 'steps': 890, 'loss/train': 4.104956150054932} 11/06/2021 21:24:05 - INFO - __main__ - Step 892: {'lr': 0.00022275000000000002, 'samples': 171264, 'steps': 891, 'loss/train': 4.234619617462158} 11/06/2021 21:24:05 - INFO - __main__ - Step 893: {'lr': 0.000223, 'samples': 171456, 'steps': 892, 'loss/train': 3.9380950927734375} 11/06/2021 21:24:06 - INFO - __main__ - Step 894: {'lr': 0.00022325, 'samples': 171648, 'steps': 893, 'loss/train': 3.4913272857666016} 11/06/2021 21:24:07 - INFO - __main__ - Step 895: {'lr': 0.0002235, 'samples': 171840, 'steps': 894, 'loss/train': 2.99515700340271} 11/06/2021 21:24:07 - INFO - __main__ - Step 896: {'lr': 0.00022375000000000002, 'samples': 172032, 'steps': 895, 'loss/train': 4.716649055480957} 11/06/2021 21:24:07 - INFO - __main__ - Step 897: {'lr': 0.000224, 'samples': 172224, 'steps': 896, 'loss/train': 3.291337251663208} 11/06/2021 21:24:08 - INFO - __main__ - Step 898: {'lr': 0.00022425, 'samples': 172416, 'steps': 897, 'loss/train': 4.090959548950195} 11/06/2021 21:24:08 - INFO - __main__ - Step 899: {'lr': 0.0002245, 'samples': 172608, 'steps': 898, 'loss/train': 4.231109142303467} 11/06/2021 21:24:09 - INFO - __main__ - Step 900: {'lr': 0.00022475000000000001, 'samples': 172800, 'steps': 899, 'loss/train': 3.5604536533355713} 11/06/2021 21:24:09 - INFO - __main__ - Step 901: {'lr': 0.00022500000000000002, 'samples': 172992, 'steps': 900, 'loss/train': 4.001966953277588} 11/06/2021 21:24:10 - INFO - __main__ - Step 902: {'lr': 0.00022525, 'samples': 173184, 'steps': 901, 'loss/train': 4.180810928344727} 11/06/2021 21:24:10 - INFO - __main__ - Step 903: {'lr': 0.0002255, 'samples': 173376, 'steps': 902, 'loss/train': 4.559982776641846} 11/06/2021 21:24:11 - INFO - __main__ - Step 904: {'lr': 0.00022575, 'samples': 173568, 'steps': 903, 'loss/train': 4.321451663970947} 11/06/2021 21:24:12 - INFO - __main__ - Step 905: {'lr': 0.00022600000000000002, 'samples': 173760, 'steps': 904, 'loss/train': 4.82120943069458} 11/06/2021 21:24:12 - INFO - __main__ - Step 906: {'lr': 0.00022625000000000002, 'samples': 173952, 'steps': 905, 'loss/train': 4.165206432342529} 11/06/2021 21:24:12 - INFO - __main__ - Step 907: {'lr': 0.0002265, 'samples': 174144, 'steps': 906, 'loss/train': 2.5543439388275146} 11/06/2021 21:24:13 - INFO - __main__ - Step 908: {'lr': 0.00022675, 'samples': 174336, 'steps': 907, 'loss/train': 4.742263317108154} 11/06/2021 21:24:13 - INFO - __main__ - Step 909: {'lr': 0.00022700000000000002, 'samples': 174528, 'steps': 908, 'loss/train': 4.112782001495361} 11/06/2021 21:24:14 - INFO - __main__ - Step 910: {'lr': 0.00022725000000000002, 'samples': 174720, 'steps': 909, 'loss/train': 4.566103458404541} 11/06/2021 21:24:15 - INFO - __main__ - Step 911: {'lr': 0.0002275, 'samples': 174912, 'steps': 910, 'loss/train': 4.344858169555664} 11/06/2021 21:24:15 - INFO - __main__ - Step 912: {'lr': 0.00022775, 'samples': 175104, 'steps': 911, 'loss/train': 3.5020289421081543} 11/06/2021 21:24:15 - INFO - __main__ - Step 913: {'lr': 0.000228, 'samples': 175296, 'steps': 912, 'loss/train': 4.5875163078308105} 11/06/2021 21:24:16 - INFO - __main__ - Step 914: {'lr': 0.00022825000000000002, 'samples': 175488, 'steps': 913, 'loss/train': 4.796538352966309} 11/06/2021 21:24:17 - INFO - __main__ - Step 915: {'lr': 0.00022850000000000002, 'samples': 175680, 'steps': 914, 'loss/train': 4.283998966217041} 11/06/2021 21:24:17 - INFO - __main__ - Step 916: {'lr': 0.00022875, 'samples': 175872, 'steps': 915, 'loss/train': 4.674391746520996} 11/06/2021 21:24:17 - INFO - __main__ - Step 917: {'lr': 0.000229, 'samples': 176064, 'steps': 916, 'loss/train': 5.0381999015808105} 11/06/2021 21:24:18 - INFO - __main__ - Step 918: {'lr': 0.00022925000000000002, 'samples': 176256, 'steps': 917, 'loss/train': 4.393980503082275} 11/06/2021 21:24:18 - INFO - __main__ - Step 919: {'lr': 0.00022950000000000002, 'samples': 176448, 'steps': 918, 'loss/train': 3.857478141784668} 11/06/2021 21:24:19 - INFO - __main__ - Step 920: {'lr': 0.00022975000000000003, 'samples': 176640, 'steps': 919, 'loss/train': 4.180598258972168} 11/06/2021 21:24:19 - INFO - __main__ - Step 921: {'lr': 0.00023, 'samples': 176832, 'steps': 920, 'loss/train': 4.508198261260986} 11/06/2021 21:24:20 - INFO - __main__ - Step 922: {'lr': 0.00023025, 'samples': 177024, 'steps': 921, 'loss/train': 4.902273654937744} 11/06/2021 21:24:20 - INFO - __main__ - Step 923: {'lr': 0.00023050000000000002, 'samples': 177216, 'steps': 922, 'loss/train': 3.9419641494750977} 11/06/2021 21:24:21 - INFO - __main__ - Step 924: {'lr': 0.00023075000000000003, 'samples': 177408, 'steps': 923, 'loss/train': 4.182397365570068} 11/06/2021 21:24:21 - INFO - __main__ - Step 925: {'lr': 0.000231, 'samples': 177600, 'steps': 924, 'loss/train': 5.673306941986084} 11/06/2021 21:24:22 - INFO - __main__ - Step 926: {'lr': 0.00023125, 'samples': 177792, 'steps': 925, 'loss/train': 4.469267845153809} 11/06/2021 21:24:22 - INFO - __main__ - Step 927: {'lr': 0.00023150000000000002, 'samples': 177984, 'steps': 926, 'loss/train': 4.016385078430176} 11/06/2021 21:24:23 - INFO - __main__ - Step 928: {'lr': 0.00023175000000000002, 'samples': 178176, 'steps': 927, 'loss/train': 4.510035037994385} 11/06/2021 21:24:23 - INFO - __main__ - Step 929: {'lr': 0.00023200000000000003, 'samples': 178368, 'steps': 928, 'loss/train': 3.650172233581543} 11/06/2021 21:24:23 - INFO - __main__ - Step 930: {'lr': 0.00023225, 'samples': 178560, 'steps': 929, 'loss/train': 3.8825185298919678} 11/06/2021 21:24:24 - INFO - __main__ - Step 931: {'lr': 0.0002325, 'samples': 178752, 'steps': 930, 'loss/train': 4.376020908355713} 11/06/2021 21:24:25 - INFO - __main__ - Step 932: {'lr': 0.00023275000000000002, 'samples': 178944, 'steps': 931, 'loss/train': 4.174633979797363} 11/06/2021 21:24:25 - INFO - __main__ - Step 933: {'lr': 0.00023300000000000003, 'samples': 179136, 'steps': 932, 'loss/train': 4.296014785766602} 11/06/2021 21:24:25 - INFO - __main__ - Step 934: {'lr': 0.00023325, 'samples': 179328, 'steps': 933, 'loss/train': 3.821877956390381} 11/06/2021 21:24:26 - INFO - __main__ - Step 935: {'lr': 0.0002335, 'samples': 179520, 'steps': 934, 'loss/train': 4.20390510559082} 11/06/2021 21:24:27 - INFO - __main__ - Step 936: {'lr': 0.00023375000000000002, 'samples': 179712, 'steps': 935, 'loss/train': 4.282657623291016} 11/06/2021 21:24:27 - INFO - __main__ - Step 937: {'lr': 0.00023400000000000002, 'samples': 179904, 'steps': 936, 'loss/train': 3.8457953929901123} 11/06/2021 21:24:28 - INFO - __main__ - Step 938: {'lr': 0.00023425000000000003, 'samples': 180096, 'steps': 937, 'loss/train': 4.099065780639648} 11/06/2021 21:24:28 - INFO - __main__ - Step 939: {'lr': 0.00023449999999999998, 'samples': 180288, 'steps': 938, 'loss/train': 4.02554988861084} 11/06/2021 21:24:28 - INFO - __main__ - Step 940: {'lr': 0.00023475, 'samples': 180480, 'steps': 939, 'loss/train': 4.181853294372559} 11/06/2021 21:24:29 - INFO - __main__ - Step 941: {'lr': 0.000235, 'samples': 180672, 'steps': 940, 'loss/train': 4.0363969802856445} 11/06/2021 21:24:30 - INFO - __main__ - Step 942: {'lr': 0.00023525, 'samples': 180864, 'steps': 941, 'loss/train': 4.118391036987305} 11/06/2021 21:24:30 - INFO - __main__ - Step 943: {'lr': 0.0002355, 'samples': 181056, 'steps': 942, 'loss/train': 4.28350830078125} 11/06/2021 21:24:30 - INFO - __main__ - Step 944: {'lr': 0.00023574999999999998, 'samples': 181248, 'steps': 943, 'loss/train': 3.8540680408477783} 11/06/2021 21:24:31 - INFO - __main__ - Step 945: {'lr': 0.000236, 'samples': 181440, 'steps': 944, 'loss/train': 4.089186191558838} 11/06/2021 21:24:32 - INFO - __main__ - Step 946: {'lr': 0.00023625, 'samples': 181632, 'steps': 945, 'loss/train': 4.698647975921631} 11/06/2021 21:24:32 - INFO - __main__ - Step 947: {'lr': 0.0002365, 'samples': 181824, 'steps': 946, 'loss/train': 4.4743123054504395} 11/06/2021 21:24:32 - INFO - __main__ - Step 948: {'lr': 0.00023674999999999998, 'samples': 182016, 'steps': 947, 'loss/train': 4.072103500366211} 11/06/2021 21:24:33 - INFO - __main__ - Step 949: {'lr': 0.000237, 'samples': 182208, 'steps': 948, 'loss/train': 4.225581169128418} 11/06/2021 21:24:33 - INFO - __main__ - Step 950: {'lr': 0.00023725, 'samples': 182400, 'steps': 949, 'loss/train': 4.038322448730469} 11/06/2021 21:24:34 - INFO - __main__ - Step 951: {'lr': 0.0002375, 'samples': 182592, 'steps': 950, 'loss/train': 4.349215984344482} 11/06/2021 21:24:35 - INFO - __main__ - Step 952: {'lr': 0.00023775, 'samples': 182784, 'steps': 951, 'loss/train': 4.141671180725098} 11/06/2021 21:24:35 - INFO - __main__ - Step 953: {'lr': 0.00023799999999999998, 'samples': 182976, 'steps': 952, 'loss/train': 3.437227725982666} 11/06/2021 21:24:35 - INFO - __main__ - Step 954: {'lr': 0.00023825, 'samples': 183168, 'steps': 953, 'loss/train': 4.3065505027771} 11/06/2021 21:24:36 - INFO - __main__ - Step 955: {'lr': 0.0002385, 'samples': 183360, 'steps': 954, 'loss/train': 4.3600687980651855} 11/06/2021 21:24:36 - INFO - __main__ - Step 956: {'lr': 0.00023875, 'samples': 183552, 'steps': 955, 'loss/train': 3.731297731399536} 11/06/2021 21:24:37 - INFO - __main__ - Step 957: {'lr': 0.00023899999999999998, 'samples': 183744, 'steps': 956, 'loss/train': 4.126706600189209} 11/06/2021 21:24:37 - INFO - __main__ - Step 958: {'lr': 0.00023925, 'samples': 183936, 'steps': 957, 'loss/train': 4.03998327255249} 11/06/2021 21:24:38 - INFO - __main__ - Step 959: {'lr': 0.0002395, 'samples': 184128, 'steps': 958, 'loss/train': 4.376034259796143} 11/06/2021 21:24:38 - INFO - __main__ - Step 960: {'lr': 0.00023975, 'samples': 184320, 'steps': 959, 'loss/train': 4.163696765899658} 11/06/2021 21:24:39 - INFO - __main__ - Step 961: {'lr': 0.00024, 'samples': 184512, 'steps': 960, 'loss/train': 3.9525156021118164} 11/06/2021 21:24:40 - INFO - __main__ - Step 962: {'lr': 0.00024024999999999999, 'samples': 184704, 'steps': 961, 'loss/train': 4.245794296264648} 11/06/2021 21:24:40 - INFO - __main__ - Step 963: {'lr': 0.0002405, 'samples': 184896, 'steps': 962, 'loss/train': 4.1179914474487305} 11/06/2021 21:24:41 - INFO - __main__ - Step 964: {'lr': 0.00024075, 'samples': 185088, 'steps': 963, 'loss/train': 3.9547386169433594} 11/06/2021 21:24:41 - INFO - __main__ - Step 965: {'lr': 0.000241, 'samples': 185280, 'steps': 964, 'loss/train': 5.651031494140625} 11/06/2021 21:24:41 - INFO - __main__ - Step 966: {'lr': 0.00024125, 'samples': 185472, 'steps': 965, 'loss/train': 4.45121431350708} 11/06/2021 21:24:42 - INFO - __main__ - Step 967: {'lr': 0.0002415, 'samples': 185664, 'steps': 966, 'loss/train': 4.361720561981201} 11/06/2021 21:24:43 - INFO - __main__ - Step 968: {'lr': 0.00024175, 'samples': 185856, 'steps': 967, 'loss/train': 4.614392280578613} 11/06/2021 21:24:43 - INFO - __main__ - Step 969: {'lr': 0.000242, 'samples': 186048, 'steps': 968, 'loss/train': 4.159306526184082} 11/06/2021 21:24:43 - INFO - __main__ - Step 970: {'lr': 0.00024225, 'samples': 186240, 'steps': 969, 'loss/train': 4.633758544921875} 11/06/2021 21:24:44 - INFO - __main__ - Step 971: {'lr': 0.00024249999999999999, 'samples': 186432, 'steps': 970, 'loss/train': 3.539381742477417} 11/06/2021 21:24:45 - INFO - __main__ - Step 972: {'lr': 0.00024275, 'samples': 186624, 'steps': 971, 'loss/train': 4.320156097412109} 11/06/2021 21:24:45 - INFO - __main__ - Step 973: {'lr': 0.000243, 'samples': 186816, 'steps': 972, 'loss/train': 4.2555952072143555} 11/06/2021 21:24:45 - INFO - __main__ - Step 974: {'lr': 0.00024325, 'samples': 187008, 'steps': 973, 'loss/train': 4.501039505004883} 11/06/2021 21:24:46 - INFO - __main__ - Step 975: {'lr': 0.0002435, 'samples': 187200, 'steps': 974, 'loss/train': 4.149434566497803} 11/06/2021 21:24:46 - INFO - __main__ - Step 976: {'lr': 0.00024375, 'samples': 187392, 'steps': 975, 'loss/train': 4.1013922691345215} 11/06/2021 21:24:47 - INFO - __main__ - Step 977: {'lr': 0.000244, 'samples': 187584, 'steps': 976, 'loss/train': 4.386606216430664} 11/06/2021 21:24:47 - INFO - __main__ - Step 978: {'lr': 0.00024425, 'samples': 187776, 'steps': 977, 'loss/train': 5.693066596984863} 11/06/2021 21:24:48 - INFO - __main__ - Step 979: {'lr': 0.0002445, 'samples': 187968, 'steps': 978, 'loss/train': 4.069120407104492} 11/06/2021 21:24:48 - INFO - __main__ - Step 980: {'lr': 0.00024475, 'samples': 188160, 'steps': 979, 'loss/train': 3.942934513092041} 11/06/2021 21:24:49 - INFO - __main__ - Step 981: {'lr': 0.000245, 'samples': 188352, 'steps': 980, 'loss/train': 4.399833679199219} 11/06/2021 21:24:49 - INFO - __main__ - Step 982: {'lr': 0.00024525, 'samples': 188544, 'steps': 981, 'loss/train': 4.462058067321777} 11/06/2021 21:24:50 - INFO - __main__ - Step 983: {'lr': 0.0002455, 'samples': 188736, 'steps': 982, 'loss/train': 4.437088966369629} 11/06/2021 21:24:50 - INFO - __main__ - Step 984: {'lr': 0.00024575, 'samples': 188928, 'steps': 983, 'loss/train': 3.954718828201294} 11/06/2021 21:24:51 - INFO - __main__ - Step 985: {'lr': 0.000246, 'samples': 189120, 'steps': 984, 'loss/train': 3.88590669631958} 11/06/2021 21:24:51 - INFO - __main__ - Step 986: {'lr': 0.00024625, 'samples': 189312, 'steps': 985, 'loss/train': 4.265262603759766} 11/06/2021 21:24:51 - INFO - __main__ - Step 987: {'lr': 0.00024650000000000003, 'samples': 189504, 'steps': 986, 'loss/train': 3.8113763332366943} 11/06/2021 21:24:52 - INFO - __main__ - Step 988: {'lr': 0.00024675, 'samples': 189696, 'steps': 987, 'loss/train': 4.336015701293945} 11/06/2021 21:24:53 - INFO - __main__ - Step 989: {'lr': 0.000247, 'samples': 189888, 'steps': 988, 'loss/train': 3.8807554244995117} 11/06/2021 21:24:53 - INFO - __main__ - Step 990: {'lr': 0.00024725, 'samples': 190080, 'steps': 989, 'loss/train': 3.6231210231781006} 11/06/2021 21:24:53 - INFO - __main__ - Step 991: {'lr': 0.0002475, 'samples': 190272, 'steps': 990, 'loss/train': 4.3600897789001465} 11/06/2021 21:24:54 - INFO - __main__ - Step 992: {'lr': 0.00024775, 'samples': 190464, 'steps': 991, 'loss/train': 4.003350734710693} 11/06/2021 21:24:55 - INFO - __main__ - Step 993: {'lr': 0.000248, 'samples': 190656, 'steps': 992, 'loss/train': 4.158915996551514} 11/06/2021 21:24:55 - INFO - __main__ - Step 994: {'lr': 0.00024825, 'samples': 190848, 'steps': 993, 'loss/train': 4.062311172485352} 11/06/2021 21:24:55 - INFO - __main__ - Step 995: {'lr': 0.0002485, 'samples': 191040, 'steps': 994, 'loss/train': 4.686992645263672} 11/06/2021 21:24:56 - INFO - __main__ - Step 996: {'lr': 0.00024875, 'samples': 191232, 'steps': 995, 'loss/train': 4.652307510375977} 11/06/2021 21:24:57 - INFO - __main__ - Step 997: {'lr': 0.000249, 'samples': 191424, 'steps': 996, 'loss/train': 4.7089762687683105} 11/06/2021 21:24:57 - INFO - __main__ - Step 998: {'lr': 0.00024925, 'samples': 191616, 'steps': 997, 'loss/train': 4.194471836090088} 11/06/2021 21:24:57 - INFO - __main__ - Step 999: {'lr': 0.0002495, 'samples': 191808, 'steps': 998, 'loss/train': 4.319421768188477} 11/06/2021 21:24:58 - INFO - __main__ - Step 1000: {'lr': 0.00024975, 'samples': 192000, 'steps': 999, 'loss/train': 4.688298225402832} 11/06/2021 21:24:58 - INFO - __main__ - Step 1001: {'lr': 0.00025, 'samples': 192192, 'steps': 1000, 'loss/train': 5.178300857543945} 11/06/2021 21:24:58 - INFO - __main__ - Step 1002: {'lr': 0.00025025, 'samples': 192384, 'steps': 1001, 'loss/train': 4.494810104370117} 11/06/2021 21:25:00 - INFO - __main__ - Step 1003: {'lr': 0.0002505, 'samples': 192576, 'steps': 1002, 'loss/train': 4.467245101928711} 11/06/2021 21:25:00 - INFO - __main__ - Step 1004: {'lr': 0.00025075, 'samples': 192768, 'steps': 1003, 'loss/train': 3.843966484069824} 11/06/2021 21:25:00 - INFO - __main__ - Step 1005: {'lr': 0.00025100000000000003, 'samples': 192960, 'steps': 1004, 'loss/train': 3.936264753341675} 11/06/2021 21:25:01 - INFO - __main__ - Step 1006: {'lr': 0.00025124999999999995, 'samples': 193152, 'steps': 1005, 'loss/train': 2.070441961288452} 11/06/2021 21:25:01 - INFO - __main__ - Step 1007: {'lr': 0.0002515, 'samples': 193344, 'steps': 1006, 'loss/train': 2.1154818534851074} 11/06/2021 21:25:01 - INFO - __main__ - Step 1008: {'lr': 0.00025174999999999997, 'samples': 193536, 'steps': 1007, 'loss/train': 4.228350639343262} 11/06/2021 21:25:02 - INFO - __main__ - Step 1009: {'lr': 0.000252, 'samples': 193728, 'steps': 1008, 'loss/train': 3.9860525131225586} 11/06/2021 21:25:03 - INFO - __main__ - Step 1010: {'lr': 0.00025225, 'samples': 193920, 'steps': 1009, 'loss/train': 4.507659912109375} 11/06/2021 21:25:03 - INFO - __main__ - Step 1011: {'lr': 0.0002525, 'samples': 194112, 'steps': 1010, 'loss/train': 4.104220867156982} 11/06/2021 21:25:04 - INFO - __main__ - Step 1012: {'lr': 0.00025275, 'samples': 194304, 'steps': 1011, 'loss/train': 3.661561965942383} 11/06/2021 21:25:04 - INFO - __main__ - Step 1013: {'lr': 0.000253, 'samples': 194496, 'steps': 1012, 'loss/train': 4.302820682525635} 11/06/2021 21:25:05 - INFO - __main__ - Step 1014: {'lr': 0.00025325, 'samples': 194688, 'steps': 1013, 'loss/train': 4.471502780914307} 11/06/2021 21:25:05 - INFO - __main__ - Step 1015: {'lr': 0.0002535, 'samples': 194880, 'steps': 1014, 'loss/train': 4.165027141571045} 11/06/2021 21:25:06 - INFO - __main__ - Step 1016: {'lr': 0.00025374999999999996, 'samples': 195072, 'steps': 1015, 'loss/train': 3.9462151527404785} 11/06/2021 21:25:06 - INFO - __main__ - Step 1017: {'lr': 0.000254, 'samples': 195264, 'steps': 1016, 'loss/train': 6.699620723724365} 11/06/2021 21:25:06 - INFO - __main__ - Step 1018: {'lr': 0.00025425, 'samples': 195456, 'steps': 1017, 'loss/train': 3.6332051753997803} 11/06/2021 21:25:07 - INFO - __main__ - Step 1019: {'lr': 0.0002545, 'samples': 195648, 'steps': 1018, 'loss/train': 4.286611080169678} 11/06/2021 21:25:08 - INFO - __main__ - Step 1020: {'lr': 0.00025475, 'samples': 195840, 'steps': 1019, 'loss/train': 3.8143672943115234} 11/06/2021 21:25:08 - INFO - __main__ - Step 1021: {'lr': 0.000255, 'samples': 196032, 'steps': 1020, 'loss/train': 3.97232723236084} 11/06/2021 21:25:08 - INFO - __main__ - Step 1022: {'lr': 0.00025525, 'samples': 196224, 'steps': 1021, 'loss/train': 4.28098726272583} 11/06/2021 21:25:09 - INFO - __main__ - Step 1023: {'lr': 0.00025550000000000003, 'samples': 196416, 'steps': 1022, 'loss/train': 4.043383598327637} 11/06/2021 21:25:10 - INFO - __main__ - Step 1024: {'lr': 0.00025575, 'samples': 196608, 'steps': 1023, 'loss/train': 4.659992218017578} 11/06/2021 21:25:10 - INFO - __main__ - Step 1025: {'lr': 0.000256, 'samples': 196800, 'steps': 1024, 'loss/train': 4.093658924102783} 11/06/2021 21:25:10 - INFO - __main__ - Step 1026: {'lr': 0.00025624999999999997, 'samples': 196992, 'steps': 1025, 'loss/train': 3.535585880279541} 11/06/2021 21:25:11 - INFO - __main__ - Step 1027: {'lr': 0.0002565, 'samples': 197184, 'steps': 1026, 'loss/train': 4.1824493408203125} 11/06/2021 21:25:11 - INFO - __main__ - Step 1028: {'lr': 0.00025675, 'samples': 197376, 'steps': 1027, 'loss/train': 4.0930609703063965} 11/06/2021 21:25:11 - INFO - __main__ - Step 1029: {'lr': 0.000257, 'samples': 197568, 'steps': 1028, 'loss/train': 3.6540799140930176} 11/06/2021 21:25:13 - INFO - __main__ - Step 1030: {'lr': 0.00025725, 'samples': 197760, 'steps': 1029, 'loss/train': 4.0925822257995605} 11/06/2021 21:25:13 - INFO - __main__ - Step 1031: {'lr': 0.0002575, 'samples': 197952, 'steps': 1030, 'loss/train': 3.85776424407959} 11/06/2021 21:25:13 - INFO - __main__ - Step 1032: {'lr': 0.00025775, 'samples': 198144, 'steps': 1031, 'loss/train': 4.097292423248291} 11/06/2021 21:25:14 - INFO - __main__ - Step 1033: {'lr': 0.00025800000000000004, 'samples': 198336, 'steps': 1032, 'loss/train': 1.9851371049880981} 11/06/2021 21:25:14 - INFO - __main__ - Step 1034: {'lr': 0.00025824999999999996, 'samples': 198528, 'steps': 1033, 'loss/train': 4.119291305541992} 11/06/2021 21:25:15 - INFO - __main__ - Step 1035: {'lr': 0.0002585, 'samples': 198720, 'steps': 1034, 'loss/train': 4.1994733810424805} 11/06/2021 21:25:15 - INFO - __main__ - Step 1036: {'lr': 0.00025875, 'samples': 198912, 'steps': 1035, 'loss/train': 4.295950412750244} 11/06/2021 21:25:16 - INFO - __main__ - Step 1037: {'lr': 0.000259, 'samples': 199104, 'steps': 1036, 'loss/train': 3.5869834423065186} 11/06/2021 21:25:16 - INFO - __main__ - Step 1038: {'lr': 0.00025925, 'samples': 199296, 'steps': 1037, 'loss/train': 5.716175556182861} 11/06/2021 21:25:16 - INFO - __main__ - Step 1039: {'lr': 0.0002595, 'samples': 199488, 'steps': 1038, 'loss/train': 4.104729175567627} 11/06/2021 21:25:18 - INFO - __main__ - Step 1040: {'lr': 0.00025975, 'samples': 199680, 'steps': 1039, 'loss/train': 4.121339321136475} 11/06/2021 21:25:18 - INFO - __main__ - Step 1041: {'lr': 0.00026000000000000003, 'samples': 199872, 'steps': 1040, 'loss/train': 4.044469356536865} 11/06/2021 21:25:18 - INFO - __main__ - Step 1042: {'lr': 0.00026025, 'samples': 200064, 'steps': 1041, 'loss/train': 4.489924430847168} 11/06/2021 21:25:19 - INFO - __main__ - Step 1043: {'lr': 0.0002605, 'samples': 200256, 'steps': 1042, 'loss/train': 4.626270771026611} 11/06/2021 21:25:19 - INFO - __main__ - Step 1044: {'lr': 0.00026074999999999997, 'samples': 200448, 'steps': 1043, 'loss/train': 3.788095712661743} 11/06/2021 21:25:20 - INFO - __main__ - Step 1045: {'lr': 0.000261, 'samples': 200640, 'steps': 1044, 'loss/train': 4.697029113769531} 11/06/2021 21:25:20 - INFO - __main__ - Step 1046: {'lr': 0.00026125, 'samples': 200832, 'steps': 1045, 'loss/train': 3.7859976291656494} 11/06/2021 21:25:21 - INFO - __main__ - Step 1047: {'lr': 0.0002615, 'samples': 201024, 'steps': 1046, 'loss/train': 4.636324405670166} 11/06/2021 21:25:21 - INFO - __main__ - Step 1048: {'lr': 0.00026175, 'samples': 201216, 'steps': 1047, 'loss/train': 4.295483112335205} 11/06/2021 21:25:21 - INFO - __main__ - Step 1049: {'lr': 0.000262, 'samples': 201408, 'steps': 1048, 'loss/train': 3.938405990600586} 11/06/2021 21:25:22 - INFO - __main__ - Step 1050: {'lr': 0.00026225, 'samples': 201600, 'steps': 1049, 'loss/train': 5.09901762008667} 11/06/2021 21:25:23 - INFO - __main__ - Step 1051: {'lr': 0.00026250000000000004, 'samples': 201792, 'steps': 1050, 'loss/train': 4.2966108322143555} 11/06/2021 21:25:23 - INFO - __main__ - Step 1052: {'lr': 0.00026274999999999996, 'samples': 201984, 'steps': 1051, 'loss/train': 3.858107805252075} 11/06/2021 21:25:23 - INFO - __main__ - Step 1053: {'lr': 0.000263, 'samples': 202176, 'steps': 1052, 'loss/train': 4.3338303565979} 11/06/2021 21:25:24 - INFO - __main__ - Step 1054: {'lr': 0.00026325, 'samples': 202368, 'steps': 1053, 'loss/train': 4.086007118225098} 11/06/2021 21:25:25 - INFO - __main__ - Step 1055: {'lr': 0.0002635, 'samples': 202560, 'steps': 1054, 'loss/train': 3.4907381534576416} 11/06/2021 21:25:25 - INFO - __main__ - Step 1056: {'lr': 0.00026375, 'samples': 202752, 'steps': 1055, 'loss/train': 6.328280925750732} 11/06/2021 21:25:26 - INFO - __main__ - Step 1057: {'lr': 0.000264, 'samples': 202944, 'steps': 1056, 'loss/train': 4.75831413269043} 11/06/2021 21:25:26 - INFO - __main__ - Step 1058: {'lr': 0.00026425, 'samples': 203136, 'steps': 1057, 'loss/train': 4.140725135803223} 11/06/2021 21:25:26 - INFO - __main__ - Step 1059: {'lr': 0.00026450000000000003, 'samples': 203328, 'steps': 1058, 'loss/train': 4.232659339904785} 11/06/2021 21:25:27 - INFO - __main__ - Step 1060: {'lr': 0.00026475, 'samples': 203520, 'steps': 1059, 'loss/train': 3.662130117416382} 11/06/2021 21:25:28 - INFO - __main__ - Step 1061: {'lr': 0.00026500000000000004, 'samples': 203712, 'steps': 1060, 'loss/train': 4.175182819366455} 11/06/2021 21:25:28 - INFO - __main__ - Step 1062: {'lr': 0.00026524999999999997, 'samples': 203904, 'steps': 1061, 'loss/train': 4.066892623901367} 11/06/2021 21:25:28 - INFO - __main__ - Step 1063: {'lr': 0.0002655, 'samples': 204096, 'steps': 1062, 'loss/train': 3.988063335418701} 11/06/2021 21:25:29 - INFO - __main__ - Step 1064: {'lr': 0.00026575, 'samples': 204288, 'steps': 1063, 'loss/train': 5.1581830978393555} 11/06/2021 21:25:29 - INFO - __main__ - Step 1065: {'lr': 0.000266, 'samples': 204480, 'steps': 1064, 'loss/train': 3.77549409866333} 11/06/2021 21:25:30 - INFO - __main__ - Step 1066: {'lr': 0.00026625, 'samples': 204672, 'steps': 1065, 'loss/train': 3.962754011154175} 11/06/2021 21:25:30 - INFO - __main__ - Step 1067: {'lr': 0.0002665, 'samples': 204864, 'steps': 1066, 'loss/train': 4.140915393829346} 11/06/2021 21:25:31 - INFO - __main__ - Step 1068: {'lr': 0.00026675, 'samples': 205056, 'steps': 1067, 'loss/train': 3.9314892292022705} 11/06/2021 21:25:31 - INFO - __main__ - Step 1069: {'lr': 0.00026700000000000004, 'samples': 205248, 'steps': 1068, 'loss/train': 4.5690155029296875} 11/06/2021 21:25:31 - INFO - __main__ - Step 1070: {'lr': 0.00026725, 'samples': 205440, 'steps': 1069, 'loss/train': 2.362624168395996} 11/06/2021 21:25:33 - INFO - __main__ - Step 1071: {'lr': 0.0002675, 'samples': 205632, 'steps': 1070, 'loss/train': 3.6357979774475098} 11/06/2021 21:25:33 - INFO - __main__ - Step 1072: {'lr': 0.00026775, 'samples': 205824, 'steps': 1071, 'loss/train': 4.618821144104004} 11/06/2021 21:25:33 - INFO - __main__ - Step 1073: {'lr': 0.000268, 'samples': 206016, 'steps': 1072, 'loss/train': 3.773068904876709} 11/06/2021 21:25:34 - INFO - __main__ - Step 1074: {'lr': 0.00026825, 'samples': 206208, 'steps': 1073, 'loss/train': 3.240086078643799} 11/06/2021 21:25:34 - INFO - __main__ - Step 1075: {'lr': 0.0002685, 'samples': 206400, 'steps': 1074, 'loss/train': 5.478127479553223} 11/06/2021 21:25:35 - INFO - __main__ - Step 1076: {'lr': 0.00026875, 'samples': 206592, 'steps': 1075, 'loss/train': 4.112662315368652} 11/06/2021 21:25:35 - INFO - __main__ - Step 1077: {'lr': 0.00026900000000000003, 'samples': 206784, 'steps': 1076, 'loss/train': 4.273911476135254} 11/06/2021 21:25:36 - INFO - __main__ - Step 1078: {'lr': 0.00026925, 'samples': 206976, 'steps': 1077, 'loss/train': 4.561737060546875} 11/06/2021 21:25:36 - INFO - __main__ - Step 1079: {'lr': 0.00026950000000000005, 'samples': 207168, 'steps': 1078, 'loss/train': 4.370131015777588} 11/06/2021 21:25:36 - INFO - __main__ - Step 1080: {'lr': 0.00026974999999999997, 'samples': 207360, 'steps': 1079, 'loss/train': 4.151457786560059} 11/06/2021 21:25:37 - INFO - __main__ - Step 1081: {'lr': 0.00027, 'samples': 207552, 'steps': 1080, 'loss/train': 4.187550067901611} 11/06/2021 21:25:38 - INFO - __main__ - Step 1082: {'lr': 0.00027025, 'samples': 207744, 'steps': 1081, 'loss/train': 3.7700130939483643} 11/06/2021 21:25:38 - INFO - __main__ - Step 1083: {'lr': 0.0002705, 'samples': 207936, 'steps': 1082, 'loss/train': 3.5349581241607666} 11/06/2021 21:25:38 - INFO - __main__ - Step 1084: {'lr': 0.00027075, 'samples': 208128, 'steps': 1083, 'loss/train': 3.959185838699341} 11/06/2021 21:25:39 - INFO - __main__ - Step 1085: {'lr': 0.00027100000000000003, 'samples': 208320, 'steps': 1084, 'loss/train': 4.234473705291748} 11/06/2021 21:25:40 - INFO - __main__ - Step 1086: {'lr': 0.00027125, 'samples': 208512, 'steps': 1085, 'loss/train': 4.3777313232421875} 11/06/2021 21:25:40 - INFO - __main__ - Step 1087: {'lr': 0.00027150000000000004, 'samples': 208704, 'steps': 1086, 'loss/train': 4.0575103759765625} 11/06/2021 21:25:41 - INFO - __main__ - Step 1088: {'lr': 0.00027175, 'samples': 208896, 'steps': 1087, 'loss/train': 3.6606838703155518} 11/06/2021 21:25:41 - INFO - __main__ - Step 1089: {'lr': 0.00027200000000000005, 'samples': 209088, 'steps': 1088, 'loss/train': 4.134492874145508} 11/06/2021 21:25:41 - INFO - __main__ - Step 1090: {'lr': 0.00027225, 'samples': 209280, 'steps': 1089, 'loss/train': 3.7823238372802734} 11/06/2021 21:25:42 - INFO - __main__ - Step 1091: {'lr': 0.0002725, 'samples': 209472, 'steps': 1090, 'loss/train': 3.5656397342681885} 11/06/2021 21:25:43 - INFO - __main__ - Step 1092: {'lr': 0.00027275, 'samples': 209664, 'steps': 1091, 'loss/train': 4.612241744995117} 11/06/2021 21:25:43 - INFO - __main__ - Step 1093: {'lr': 0.000273, 'samples': 209856, 'steps': 1092, 'loss/train': 4.211349010467529} 11/06/2021 21:25:43 - INFO - __main__ - Step 1094: {'lr': 0.00027325, 'samples': 210048, 'steps': 1093, 'loss/train': 3.8772196769714355} 11/06/2021 21:25:44 - INFO - __main__ - Step 1095: {'lr': 0.00027350000000000003, 'samples': 210240, 'steps': 1094, 'loss/train': 4.242520332336426} 11/06/2021 21:25:45 - INFO - __main__ - Step 1096: {'lr': 0.00027375, 'samples': 210432, 'steps': 1095, 'loss/train': 4.067473411560059} 11/06/2021 21:25:45 - INFO - __main__ - Step 1097: {'lr': 0.00027400000000000005, 'samples': 210624, 'steps': 1096, 'loss/train': 4.022539138793945} 11/06/2021 21:25:46 - INFO - __main__ - Step 1098: {'lr': 0.00027425, 'samples': 210816, 'steps': 1097, 'loss/train': 3.810528516769409} 11/06/2021 21:25:46 - INFO - __main__ - Step 1099: {'lr': 0.0002745, 'samples': 211008, 'steps': 1098, 'loss/train': 4.089423179626465} 11/06/2021 21:25:47 - INFO - __main__ - Step 1100: {'lr': 0.00027475, 'samples': 211200, 'steps': 1099, 'loss/train': 4.257665157318115} 11/06/2021 21:25:48 - INFO - __main__ - Step 1101: {'lr': 0.000275, 'samples': 211392, 'steps': 1100, 'loss/train': 4.2323384284973145} 11/06/2021 21:25:48 - INFO - __main__ - Step 1102: {'lr': 0.00027525, 'samples': 211584, 'steps': 1101, 'loss/train': 4.789181709289551} 11/06/2021 21:25:48 - INFO - __main__ - Step 1103: {'lr': 0.00027550000000000003, 'samples': 211776, 'steps': 1102, 'loss/train': 6.588270664215088} 11/06/2021 21:25:49 - INFO - __main__ - Step 1104: {'lr': 0.00027575, 'samples': 211968, 'steps': 1103, 'loss/train': 3.7694168090820312} 11/06/2021 21:25:49 - INFO - __main__ - Step 1105: {'lr': 0.00027600000000000004, 'samples': 212160, 'steps': 1104, 'loss/train': 3.7395424842834473} 11/06/2021 21:25:50 - INFO - __main__ - Step 1106: {'lr': 0.00027625, 'samples': 212352, 'steps': 1105, 'loss/train': 4.408657073974609} 11/06/2021 21:25:51 - INFO - __main__ - Step 1107: {'lr': 0.00027650000000000005, 'samples': 212544, 'steps': 1106, 'loss/train': 4.209283828735352} 11/06/2021 21:25:51 - INFO - __main__ - Step 1108: {'lr': 0.00027675, 'samples': 212736, 'steps': 1107, 'loss/train': 4.449584007263184} 11/06/2021 21:25:51 - INFO - __main__ - Step 1109: {'lr': 0.000277, 'samples': 212928, 'steps': 1108, 'loss/train': 3.7549164295196533} 11/06/2021 21:25:52 - INFO - __main__ - Step 1110: {'lr': 0.00027725, 'samples': 213120, 'steps': 1109, 'loss/train': 4.272808074951172} 11/06/2021 21:25:52 - INFO - __main__ - Step 1111: {'lr': 0.0002775, 'samples': 213312, 'steps': 1110, 'loss/train': 4.251165390014648} 11/06/2021 21:25:53 - INFO - __main__ - Step 1112: {'lr': 0.00027775, 'samples': 213504, 'steps': 1111, 'loss/train': 4.609951972961426} 11/06/2021 21:25:54 - INFO - __main__ - Step 1113: {'lr': 0.00027800000000000004, 'samples': 213696, 'steps': 1112, 'loss/train': 4.141632556915283} 11/06/2021 21:25:54 - INFO - __main__ - Step 1114: {'lr': 0.00027825, 'samples': 213888, 'steps': 1113, 'loss/train': 4.153262615203857} 11/06/2021 21:25:54 - INFO - __main__ - Step 1115: {'lr': 0.00027850000000000005, 'samples': 214080, 'steps': 1114, 'loss/train': 3.717008113861084} 11/06/2021 21:25:55 - INFO - __main__ - Step 1116: {'lr': 0.00027875, 'samples': 214272, 'steps': 1115, 'loss/train': 3.0349667072296143} 11/06/2021 21:25:56 - INFO - __main__ - Step 1117: {'lr': 0.000279, 'samples': 214464, 'steps': 1116, 'loss/train': 5.148609161376953} 11/06/2021 21:25:56 - INFO - __main__ - Step 1118: {'lr': 0.00027925, 'samples': 214656, 'steps': 1117, 'loss/train': 4.0414958000183105} 11/06/2021 21:25:56 - INFO - __main__ - Step 1119: {'lr': 0.0002795, 'samples': 214848, 'steps': 1118, 'loss/train': 3.6977531909942627} 11/06/2021 21:25:57 - INFO - __main__ - Step 1120: {'lr': 0.00027975, 'samples': 215040, 'steps': 1119, 'loss/train': 3.3791584968566895} 11/06/2021 21:25:57 - INFO - __main__ - Step 1121: {'lr': 0.00028000000000000003, 'samples': 215232, 'steps': 1120, 'loss/train': 3.486509084701538} 11/06/2021 21:25:57 - INFO - __main__ - Step 1122: {'lr': 0.00028025, 'samples': 215424, 'steps': 1121, 'loss/train': 4.281680583953857} 11/06/2021 21:25:59 - INFO - __main__ - Step 1123: {'lr': 0.00028050000000000004, 'samples': 215616, 'steps': 1122, 'loss/train': 5.040244102478027} 11/06/2021 21:25:59 - INFO - __main__ - Step 1124: {'lr': 0.00028075, 'samples': 215808, 'steps': 1123, 'loss/train': 3.963624954223633} 11/06/2021 21:25:59 - INFO - __main__ - Step 1125: {'lr': 0.00028100000000000005, 'samples': 216000, 'steps': 1124, 'loss/train': 3.8375720977783203} 11/06/2021 21:26:00 - INFO - __main__ - Step 1126: {'lr': 0.00028125000000000003, 'samples': 216192, 'steps': 1125, 'loss/train': 4.240171432495117} 11/06/2021 21:26:00 - INFO - __main__ - Step 1127: {'lr': 0.00028149999999999996, 'samples': 216384, 'steps': 1126, 'loss/train': 3.637873888015747} 11/06/2021 21:26:01 - INFO - __main__ - Step 1128: {'lr': 0.00028175, 'samples': 216576, 'steps': 1127, 'loss/train': 3.8652379512786865} 11/06/2021 21:26:01 - INFO - __main__ - Step 1129: {'lr': 0.00028199999999999997, 'samples': 216768, 'steps': 1128, 'loss/train': 4.120966911315918} 11/06/2021 21:26:02 - INFO - __main__ - Step 1130: {'lr': 0.00028225, 'samples': 216960, 'steps': 1129, 'loss/train': 4.294138431549072} 11/06/2021 21:26:02 - INFO - __main__ - Step 1131: {'lr': 0.0002825, 'samples': 217152, 'steps': 1130, 'loss/train': 3.7788121700286865} 11/06/2021 21:26:02 - INFO - __main__ - Step 1132: {'lr': 0.00028275, 'samples': 217344, 'steps': 1131, 'loss/train': 3.901336908340454} 11/06/2021 21:26:03 - INFO - __main__ - Step 1133: {'lr': 0.000283, 'samples': 217536, 'steps': 1132, 'loss/train': 4.027707099914551} 11/06/2021 21:26:04 - INFO - __main__ - Step 1134: {'lr': 0.00028325000000000003, 'samples': 217728, 'steps': 1133, 'loss/train': 4.218957424163818} 11/06/2021 21:26:04 - INFO - __main__ - Step 1135: {'lr': 0.0002835, 'samples': 217920, 'steps': 1134, 'loss/train': 4.311142444610596} 11/06/2021 21:26:05 - INFO - __main__ - Step 1136: {'lr': 0.00028375, 'samples': 218112, 'steps': 1135, 'loss/train': 4.181615352630615} 11/06/2021 21:26:05 - INFO - __main__ - Step 1137: {'lr': 0.00028399999999999996, 'samples': 218304, 'steps': 1136, 'loss/train': 3.916645050048828} 11/06/2021 21:26:06 - INFO - __main__ - Step 1138: {'lr': 0.00028425, 'samples': 218496, 'steps': 1137, 'loss/train': 4.085984706878662} 11/06/2021 21:26:06 - INFO - __main__ - Step 1139: {'lr': 0.0002845, 'samples': 218688, 'steps': 1138, 'loss/train': 4.673676013946533} 11/06/2021 21:26:07 - INFO - __main__ - Step 1140: {'lr': 0.00028475, 'samples': 218880, 'steps': 1139, 'loss/train': 4.280994415283203} 11/06/2021 21:26:07 - INFO - __main__ - Step 1141: {'lr': 0.000285, 'samples': 219072, 'steps': 1140, 'loss/train': 4.047427177429199} 11/06/2021 21:26:07 - INFO - __main__ - Step 1142: {'lr': 0.00028525, 'samples': 219264, 'steps': 1141, 'loss/train': 3.885024070739746} 11/06/2021 21:26:08 - INFO - __main__ - Step 1143: {'lr': 0.0002855, 'samples': 219456, 'steps': 1142, 'loss/train': 4.40393590927124} 11/06/2021 21:26:09 - INFO - __main__ - Step 1144: {'lr': 0.00028575000000000003, 'samples': 219648, 'steps': 1143, 'loss/train': 4.992481708526611} 11/06/2021 21:26:09 - INFO - __main__ - Step 1145: {'lr': 0.00028599999999999996, 'samples': 219840, 'steps': 1144, 'loss/train': 4.026183128356934} 11/06/2021 21:26:10 - INFO - __main__ - Step 1146: {'lr': 0.00028625, 'samples': 220032, 'steps': 1145, 'loss/train': 3.5987935066223145} 11/06/2021 21:26:10 - INFO - __main__ - Step 1147: {'lr': 0.00028649999999999997, 'samples': 220224, 'steps': 1146, 'loss/train': 4.403866767883301} 11/06/2021 21:26:11 - INFO - __main__ - Step 1148: {'lr': 0.00028675, 'samples': 220416, 'steps': 1147, 'loss/train': 3.9268059730529785} 11/06/2021 21:26:11 - INFO - __main__ - Step 1149: {'lr': 0.000287, 'samples': 220608, 'steps': 1148, 'loss/train': 3.6902759075164795} 11/06/2021 21:26:12 - INFO - __main__ - Step 1150: {'lr': 0.00028725, 'samples': 220800, 'steps': 1149, 'loss/train': 4.134997367858887} 11/06/2021 21:26:12 - INFO - __main__ - Step 1151: {'lr': 0.0002875, 'samples': 220992, 'steps': 1150, 'loss/train': 3.9367594718933105} 11/06/2021 21:26:12 - INFO - __main__ - Step 1152: {'lr': 0.00028775000000000003, 'samples': 221184, 'steps': 1151, 'loss/train': 3.4053988456726074} 11/06/2021 21:26:13 - INFO - __main__ - Step 1153: {'lr': 0.000288, 'samples': 221376, 'steps': 1152, 'loss/train': 4.11300802230835} 11/06/2021 21:26:14 - INFO - __main__ - Step 1154: {'lr': 0.00028825, 'samples': 221568, 'steps': 1153, 'loss/train': 3.790684700012207} 11/06/2021 21:26:14 - INFO - __main__ - Step 1155: {'lr': 0.00028849999999999997, 'samples': 221760, 'steps': 1154, 'loss/train': 4.096200942993164} 11/06/2021 21:26:14 - INFO - __main__ - Step 1156: {'lr': 0.00028875, 'samples': 221952, 'steps': 1155, 'loss/train': 4.3799262046813965} 11/06/2021 21:26:15 - INFO - __main__ - Step 1157: {'lr': 0.000289, 'samples': 222144, 'steps': 1156, 'loss/train': 4.1134748458862305} 11/06/2021 21:26:15 - INFO - __main__ - Step 1158: {'lr': 0.00028925, 'samples': 222336, 'steps': 1157, 'loss/train': 4.462249755859375} 11/06/2021 21:26:16 - INFO - __main__ - Step 1159: {'lr': 0.0002895, 'samples': 222528, 'steps': 1158, 'loss/train': 2.7520174980163574} 11/06/2021 21:26:16 - INFO - __main__ - Step 1160: {'lr': 0.00028975, 'samples': 222720, 'steps': 1159, 'loss/train': 4.172967910766602} 11/06/2021 21:26:17 - INFO - __main__ - Step 1161: {'lr': 0.00029, 'samples': 222912, 'steps': 1160, 'loss/train': 3.451906204223633} 11/06/2021 21:26:17 - INFO - __main__ - Step 1162: {'lr': 0.00029025000000000003, 'samples': 223104, 'steps': 1161, 'loss/train': 4.355815887451172} 11/06/2021 21:26:18 - INFO - __main__ - Step 1163: {'lr': 0.00029049999999999996, 'samples': 223296, 'steps': 1162, 'loss/train': 4.1517109870910645} 11/06/2021 21:26:19 - INFO - __main__ - Step 1164: {'lr': 0.00029075, 'samples': 223488, 'steps': 1163, 'loss/train': 4.004212856292725} 11/06/2021 21:26:19 - INFO - __main__ - Step 1165: {'lr': 0.00029099999999999997, 'samples': 223680, 'steps': 1164, 'loss/train': 4.073708534240723} 11/06/2021 21:26:19 - INFO - __main__ - Step 1166: {'lr': 0.00029125, 'samples': 223872, 'steps': 1165, 'loss/train': 4.045283317565918} 11/06/2021 21:26:20 - INFO - __main__ - Step 1167: {'lr': 0.0002915, 'samples': 224064, 'steps': 1166, 'loss/train': 3.959521770477295} 11/06/2021 21:26:20 - INFO - __main__ - Step 1168: {'lr': 0.00029175, 'samples': 224256, 'steps': 1167, 'loss/train': 3.8172402381896973} 11/06/2021 21:26:21 - INFO - __main__ - Step 1169: {'lr': 0.000292, 'samples': 224448, 'steps': 1168, 'loss/train': 3.809446096420288} 11/06/2021 21:26:21 - INFO - __main__ - Step 1170: {'lr': 0.00029225000000000003, 'samples': 224640, 'steps': 1169, 'loss/train': 4.352285385131836} 11/06/2021 21:26:22 - INFO - __main__ - Step 1171: {'lr': 0.0002925, 'samples': 224832, 'steps': 1170, 'loss/train': 3.0493404865264893} 11/06/2021 21:26:22 - INFO - __main__ - Step 1172: {'lr': 0.00029275000000000004, 'samples': 225024, 'steps': 1171, 'loss/train': 4.092573642730713} 11/06/2021 21:26:22 - INFO - __main__ - Step 1173: {'lr': 0.00029299999999999997, 'samples': 225216, 'steps': 1172, 'loss/train': 4.186103820800781} 11/06/2021 21:26:23 - INFO - __main__ - Step 1174: {'lr': 0.00029325, 'samples': 225408, 'steps': 1173, 'loss/train': 3.786396026611328} 11/06/2021 21:26:24 - INFO - __main__ - Step 1175: {'lr': 0.0002935, 'samples': 225600, 'steps': 1174, 'loss/train': 4.53329610824585} 11/06/2021 21:26:24 - INFO - __main__ - Step 1176: {'lr': 0.00029375, 'samples': 225792, 'steps': 1175, 'loss/train': 3.6532394886016846} 11/06/2021 21:26:24 - INFO - __main__ - Step 1177: {'lr': 0.000294, 'samples': 225984, 'steps': 1176, 'loss/train': 3.6320600509643555} 11/06/2021 21:26:25 - INFO - __main__ - Step 1178: {'lr': 0.00029425, 'samples': 226176, 'steps': 1177, 'loss/train': 3.9214487075805664} 11/06/2021 21:26:26 - INFO - __main__ - Step 1179: {'lr': 0.0002945, 'samples': 226368, 'steps': 1178, 'loss/train': 3.6811251640319824} 11/06/2021 21:26:26 - INFO - __main__ - Step 1180: {'lr': 0.00029475000000000004, 'samples': 226560, 'steps': 1179, 'loss/train': 3.892512559890747} 11/06/2021 21:26:27 - INFO - __main__ - Step 1181: {'lr': 0.000295, 'samples': 226752, 'steps': 1180, 'loss/train': 4.335596561431885} 11/06/2021 21:26:27 - INFO - __main__ - Step 1182: {'lr': 0.00029525, 'samples': 226944, 'steps': 1181, 'loss/train': 3.9504594802856445} 11/06/2021 21:26:27 - INFO - __main__ - Step 1183: {'lr': 0.00029549999999999997, 'samples': 227136, 'steps': 1182, 'loss/train': 4.172230243682861} 11/06/2021 21:26:28 - INFO - __main__ - Step 1184: {'lr': 0.00029575, 'samples': 227328, 'steps': 1183, 'loss/train': 4.093757152557373} 11/06/2021 21:26:29 - INFO - __main__ - Step 1185: {'lr': 0.000296, 'samples': 227520, 'steps': 1184, 'loss/train': 3.9272379875183105} 11/06/2021 21:26:29 - INFO - __main__ - Step 1186: {'lr': 0.00029625, 'samples': 227712, 'steps': 1185, 'loss/train': 3.678920269012451} 11/06/2021 21:26:29 - INFO - __main__ - Step 1187: {'lr': 0.0002965, 'samples': 227904, 'steps': 1186, 'loss/train': 3.9552578926086426} 11/06/2021 21:26:30 - INFO - __main__ - Step 1188: {'lr': 0.00029675000000000003, 'samples': 228096, 'steps': 1187, 'loss/train': 3.7164466381073} 11/06/2021 21:26:30 - INFO - __main__ - Step 1189: {'lr': 0.000297, 'samples': 228288, 'steps': 1188, 'loss/train': 4.196506977081299} 11/06/2021 21:26:31 - INFO - __main__ - Step 1190: {'lr': 0.00029725000000000004, 'samples': 228480, 'steps': 1189, 'loss/train': 3.887439489364624} 11/06/2021 21:26:31 - INFO - __main__ - Step 1191: {'lr': 0.00029749999999999997, 'samples': 228672, 'steps': 1190, 'loss/train': 3.7323334217071533} 11/06/2021 21:26:32 - INFO - __main__ - Step 1192: {'lr': 0.00029775, 'samples': 228864, 'steps': 1191, 'loss/train': 4.008239269256592} 11/06/2021 21:26:32 - INFO - __main__ - Step 1193: {'lr': 0.000298, 'samples': 229056, 'steps': 1192, 'loss/train': 5.458200454711914} 11/06/2021 21:26:33 - INFO - __main__ - Step 1194: {'lr': 0.00029825, 'samples': 229248, 'steps': 1193, 'loss/train': 3.7591049671173096} 11/06/2021 21:26:34 - INFO - __main__ - Step 1195: {'lr': 0.0002985, 'samples': 229440, 'steps': 1194, 'loss/train': 4.328658103942871} 11/06/2021 21:26:34 - INFO - __main__ - Step 1196: {'lr': 0.00029875, 'samples': 229632, 'steps': 1195, 'loss/train': 3.8931264877319336} 11/06/2021 21:26:34 - INFO - __main__ - Step 1197: {'lr': 0.000299, 'samples': 229824, 'steps': 1196, 'loss/train': 4.966779708862305} 11/06/2021 21:26:35 - INFO - __main__ - Step 1198: {'lr': 0.00029925000000000004, 'samples': 230016, 'steps': 1197, 'loss/train': 3.1090831756591797} 11/06/2021 21:26:35 - INFO - __main__ - Step 1199: {'lr': 0.0002995, 'samples': 230208, 'steps': 1198, 'loss/train': 3.9816088676452637} 11/06/2021 21:26:36 - INFO - __main__ - Step 1200: {'lr': 0.00029975000000000005, 'samples': 230400, 'steps': 1199, 'loss/train': 4.041784286499023} 11/06/2021 21:26:36 - INFO - __main__ - Step 1201: {'lr': 0.0003, 'samples': 230592, 'steps': 1200, 'loss/train': 3.73962664604187} 11/06/2021 21:26:37 - INFO - __main__ - Step 1202: {'lr': 0.00030025, 'samples': 230784, 'steps': 1201, 'loss/train': 3.259591579437256} 11/06/2021 21:26:37 - INFO - __main__ - Step 1203: {'lr': 0.0003005, 'samples': 230976, 'steps': 1202, 'loss/train': 4.60701322555542} 11/06/2021 21:26:37 - INFO - __main__ - Step 1204: {'lr': 0.00030075, 'samples': 231168, 'steps': 1203, 'loss/train': 4.123647689819336} 11/06/2021 21:26:39 - INFO - __main__ - Step 1205: {'lr': 0.000301, 'samples': 231360, 'steps': 1204, 'loss/train': 4.385739326477051} 11/06/2021 21:26:39 - INFO - __main__ - Step 1206: {'lr': 0.00030125000000000003, 'samples': 231552, 'steps': 1205, 'loss/train': 3.796212911605835} 11/06/2021 21:26:39 - INFO - __main__ - Step 1207: {'lr': 0.0003015, 'samples': 231744, 'steps': 1206, 'loss/train': 4.638833999633789} 11/06/2021 21:26:40 - INFO - __main__ - Step 1208: {'lr': 0.00030175000000000004, 'samples': 231936, 'steps': 1207, 'loss/train': 4.022327423095703} 11/06/2021 21:26:40 - INFO - __main__ - Step 1209: {'lr': 0.000302, 'samples': 232128, 'steps': 1208, 'loss/train': 4.19215202331543} 11/06/2021 21:26:41 - INFO - __main__ - Step 1210: {'lr': 0.00030225, 'samples': 232320, 'steps': 1209, 'loss/train': 4.021753787994385} 11/06/2021 21:26:41 - INFO - __main__ - Step 1211: {'lr': 0.0003025, 'samples': 232512, 'steps': 1210, 'loss/train': 3.7708232402801514} 11/06/2021 21:26:42 - INFO - __main__ - Step 1212: {'lr': 0.00030275, 'samples': 232704, 'steps': 1211, 'loss/train': 3.500831365585327} 11/06/2021 21:26:42 - INFO - __main__ - Step 1213: {'lr': 0.000303, 'samples': 232896, 'steps': 1212, 'loss/train': 3.7347517013549805} 11/06/2021 21:26:42 - INFO - __main__ - Step 1214: {'lr': 0.00030325, 'samples': 233088, 'steps': 1213, 'loss/train': 3.712738275527954} 11/06/2021 21:26:43 - INFO - __main__ - Step 1215: {'lr': 0.0003035, 'samples': 233280, 'steps': 1214, 'loss/train': 4.544304370880127} 11/06/2021 21:26:44 - INFO - __main__ - Step 1216: {'lr': 0.00030375000000000004, 'samples': 233472, 'steps': 1215, 'loss/train': 3.8694875240325928} 11/06/2021 21:26:44 - INFO - __main__ - Step 1217: {'lr': 0.000304, 'samples': 233664, 'steps': 1216, 'loss/train': 3.3171164989471436} 11/06/2021 21:26:44 - INFO - __main__ - Step 1218: {'lr': 0.00030425000000000005, 'samples': 233856, 'steps': 1217, 'loss/train': 3.7530202865600586} 11/06/2021 21:26:45 - INFO - __main__ - Step 1219: {'lr': 0.0003045, 'samples': 234048, 'steps': 1218, 'loss/train': 3.967923879623413} 11/06/2021 21:26:45 - INFO - __main__ - Step 1220: {'lr': 0.00030475, 'samples': 234240, 'steps': 1219, 'loss/train': 3.715089797973633} 11/06/2021 21:26:46 - INFO - __main__ - Step 1221: {'lr': 0.000305, 'samples': 234432, 'steps': 1220, 'loss/train': 3.693310260772705} 11/06/2021 21:26:46 - INFO - __main__ - Step 1222: {'lr': 0.00030525, 'samples': 234624, 'steps': 1221, 'loss/train': 3.5755531787872314} 11/06/2021 21:26:47 - INFO - __main__ - Step 1223: {'lr': 0.0003055, 'samples': 234816, 'steps': 1222, 'loss/train': 4.144630432128906} 11/06/2021 21:26:47 - INFO - __main__ - Step 1224: {'lr': 0.00030575000000000003, 'samples': 235008, 'steps': 1223, 'loss/train': 3.8937008380889893} 11/06/2021 21:26:48 - INFO - __main__ - Step 1225: {'lr': 0.000306, 'samples': 235200, 'steps': 1224, 'loss/train': 3.7765586376190186} 11/06/2021 21:26:48 - INFO - __main__ - Step 1226: {'lr': 0.00030625000000000004, 'samples': 235392, 'steps': 1225, 'loss/train': 4.3001017570495605} 11/06/2021 21:26:49 - INFO - __main__ - Step 1227: {'lr': 0.0003065, 'samples': 235584, 'steps': 1226, 'loss/train': 3.861295461654663} 11/06/2021 21:26:49 - INFO - __main__ - Step 1228: {'lr': 0.00030675, 'samples': 235776, 'steps': 1227, 'loss/train': 4.244755268096924} 11/06/2021 21:26:50 - INFO - __main__ - Step 1229: {'lr': 0.000307, 'samples': 235968, 'steps': 1228, 'loss/train': 4.392697811126709} 11/06/2021 21:26:50 - INFO - __main__ - Step 1230: {'lr': 0.00030725, 'samples': 236160, 'steps': 1229, 'loss/train': 4.003592014312744} 11/06/2021 21:26:51 - INFO - __main__ - Step 1231: {'lr': 0.0003075, 'samples': 236352, 'steps': 1230, 'loss/train': 3.7558765411376953} 11/06/2021 21:26:51 - INFO - __main__ - Step 1232: {'lr': 0.00030775, 'samples': 236544, 'steps': 1231, 'loss/train': 3.535252571105957} 11/06/2021 21:26:52 - INFO - __main__ - Step 1233: {'lr': 0.000308, 'samples': 236736, 'steps': 1232, 'loss/train': 3.4438316822052} 11/06/2021 21:26:52 - INFO - __main__ - Step 1234: {'lr': 0.00030825000000000004, 'samples': 236928, 'steps': 1233, 'loss/train': 3.6062018871307373} 11/06/2021 21:26:52 - INFO - __main__ - Step 1235: {'lr': 0.0003085, 'samples': 237120, 'steps': 1234, 'loss/train': 3.9174911975860596} 11/06/2021 21:26:53 - INFO - __main__ - Step 1236: {'lr': 0.00030875000000000005, 'samples': 237312, 'steps': 1235, 'loss/train': 3.6126482486724854} 11/06/2021 21:26:54 - INFO - __main__ - Step 1237: {'lr': 0.00030900000000000003, 'samples': 237504, 'steps': 1236, 'loss/train': 3.4603700637817383} 11/06/2021 21:26:54 - INFO - __main__ - Step 1238: {'lr': 0.00030925, 'samples': 237696, 'steps': 1237, 'loss/train': 5.033356666564941} 11/06/2021 21:26:54 - INFO - __main__ - Step 1239: {'lr': 0.0003095, 'samples': 237888, 'steps': 1238, 'loss/train': 3.9721944332122803} 11/06/2021 21:26:55 - INFO - __main__ - Step 1240: {'lr': 0.00030975, 'samples': 238080, 'steps': 1239, 'loss/train': 3.999577045440674} 11/06/2021 21:26:56 - INFO - __main__ - Step 1241: {'lr': 0.00031, 'samples': 238272, 'steps': 1240, 'loss/train': 3.221419334411621} 11/06/2021 21:26:56 - INFO - __main__ - Step 1242: {'lr': 0.00031025000000000003, 'samples': 238464, 'steps': 1241, 'loss/train': 3.553842067718506} 11/06/2021 21:26:57 - INFO - __main__ - Step 1243: {'lr': 0.0003105, 'samples': 238656, 'steps': 1242, 'loss/train': 3.4750494956970215} 11/06/2021 21:26:57 - INFO - __main__ - Step 1244: {'lr': 0.00031075000000000005, 'samples': 238848, 'steps': 1243, 'loss/train': 3.7925238609313965} 11/06/2021 21:26:57 - INFO - __main__ - Step 1245: {'lr': 0.000311, 'samples': 239040, 'steps': 1244, 'loss/train': 4.293489456176758} 11/06/2021 21:26:58 - INFO - __main__ - Step 1246: {'lr': 0.00031125000000000006, 'samples': 239232, 'steps': 1245, 'loss/train': 3.7371480464935303} 11/06/2021 21:26:59 - INFO - __main__ - Step 1247: {'lr': 0.0003115, 'samples': 239424, 'steps': 1246, 'loss/train': 4.00302791595459} 11/06/2021 21:26:59 - INFO - __main__ - Step 1248: {'lr': 0.00031175, 'samples': 239616, 'steps': 1247, 'loss/train': 3.7047793865203857} 11/06/2021 21:27:00 - INFO - __main__ - Step 1249: {'lr': 0.000312, 'samples': 239808, 'steps': 1248, 'loss/train': 4.603279113769531} 11/06/2021 21:27:00 - INFO - __main__ - Step 1250: {'lr': 0.00031225000000000003, 'samples': 240000, 'steps': 1249, 'loss/train': 3.6074068546295166} 11/06/2021 21:27:00 - INFO - __main__ - Step 1251: {'lr': 0.0003125, 'samples': 240192, 'steps': 1250, 'loss/train': 4.296557903289795} 11/06/2021 21:27:01 - INFO - __main__ - Step 1252: {'lr': 0.00031275, 'samples': 240384, 'steps': 1251, 'loss/train': 3.7983367443084717} 11/06/2021 21:27:02 - INFO - __main__ - Step 1253: {'lr': 0.000313, 'samples': 240576, 'steps': 1252, 'loss/train': 3.8672046661376953} 11/06/2021 21:27:02 - INFO - __main__ - Step 1254: {'lr': 0.00031325, 'samples': 240768, 'steps': 1253, 'loss/train': 3.8182613849639893} 11/06/2021 21:27:02 - INFO - __main__ - Step 1255: {'lr': 0.00031350000000000003, 'samples': 240960, 'steps': 1254, 'loss/train': 3.425907850265503} 11/06/2021 21:27:03 - INFO - __main__ - Step 1256: {'lr': 0.00031374999999999996, 'samples': 241152, 'steps': 1255, 'loss/train': 3.5726277828216553} 11/06/2021 21:27:04 - INFO - __main__ - Step 1257: {'lr': 0.000314, 'samples': 241344, 'steps': 1256, 'loss/train': 3.928713083267212} 11/06/2021 21:27:04 - INFO - __main__ - Step 1258: {'lr': 0.00031424999999999997, 'samples': 241536, 'steps': 1257, 'loss/train': 3.986309289932251} 11/06/2021 21:27:04 - INFO - __main__ - Step 1259: {'lr': 0.0003145, 'samples': 241728, 'steps': 1258, 'loss/train': 4.367678642272949} 11/06/2021 21:27:05 - INFO - __main__ - Step 1260: {'lr': 0.00031475, 'samples': 241920, 'steps': 1259, 'loss/train': 3.995647430419922} 11/06/2021 21:27:05 - INFO - __main__ - Step 1261: {'lr': 0.000315, 'samples': 242112, 'steps': 1260, 'loss/train': 3.605346441268921} 11/06/2021 21:27:06 - INFO - __main__ - Step 1262: {'lr': 0.00031525, 'samples': 242304, 'steps': 1261, 'loss/train': 3.96653413772583} 11/06/2021 21:27:06 - INFO - __main__ - Step 1263: {'lr': 0.0003155, 'samples': 242496, 'steps': 1262, 'loss/train': 3.9042797088623047} 11/06/2021 21:27:07 - INFO - __main__ - Step 1264: {'lr': 0.00031575, 'samples': 242688, 'steps': 1263, 'loss/train': 3.5482776165008545} 11/06/2021 21:27:07 - INFO - __main__ - Step 1265: {'lr': 0.000316, 'samples': 242880, 'steps': 1264, 'loss/train': 3.473890781402588} 11/06/2021 21:27:07 - INFO - __main__ - Step 1266: {'lr': 0.00031624999999999996, 'samples': 243072, 'steps': 1265, 'loss/train': 3.6147239208221436} 11/06/2021 21:27:08 - INFO - __main__ - Step 1267: {'lr': 0.0003165, 'samples': 243264, 'steps': 1266, 'loss/train': 3.760866403579712} 11/06/2021 21:27:09 - INFO - __main__ - Step 1268: {'lr': 0.00031675, 'samples': 243456, 'steps': 1267, 'loss/train': 3.5120933055877686} 11/06/2021 21:27:10 - INFO - __main__ - Step 1269: {'lr': 0.000317, 'samples': 243648, 'steps': 1268, 'loss/train': 4.119743824005127} 11/06/2021 21:27:10 - INFO - __main__ - Step 1270: {'lr': 0.00031725, 'samples': 243840, 'steps': 1269, 'loss/train': 4.16604471206665} 11/06/2021 21:27:10 - INFO - __main__ - Step 1271: {'lr': 0.0003175, 'samples': 244032, 'steps': 1270, 'loss/train': 3.718264102935791} 11/06/2021 21:27:11 - INFO - __main__ - Step 1272: {'lr': 0.00031775, 'samples': 244224, 'steps': 1271, 'loss/train': 3.804915428161621} 11/06/2021 21:27:12 - INFO - __main__ - Step 1273: {'lr': 0.00031800000000000003, 'samples': 244416, 'steps': 1272, 'loss/train': 3.632591724395752} 11/06/2021 21:27:12 - INFO - __main__ - Step 1274: {'lr': 0.00031825, 'samples': 244608, 'steps': 1273, 'loss/train': 3.5990989208221436} 11/06/2021 21:27:12 - INFO - __main__ - Step 1275: {'lr': 0.0003185, 'samples': 244800, 'steps': 1274, 'loss/train': 3.5454981327056885} 11/06/2021 21:27:13 - INFO - __main__ - Step 1276: {'lr': 0.00031874999999999997, 'samples': 244992, 'steps': 1275, 'loss/train': 2.8243398666381836} 11/06/2021 21:27:13 - INFO - __main__ - Step 1277: {'lr': 0.000319, 'samples': 245184, 'steps': 1276, 'loss/train': 4.374372959136963} 11/06/2021 21:27:14 - INFO - __main__ - Step 1278: {'lr': 0.00031925, 'samples': 245376, 'steps': 1277, 'loss/train': 2.880706548690796} 11/06/2021 21:27:15 - INFO - __main__ - Step 1279: {'lr': 0.0003195, 'samples': 245568, 'steps': 1278, 'loss/train': 3.749918222427368} 11/06/2021 21:27:15 - INFO - __main__ - Step 1280: {'lr': 0.00031975, 'samples': 245760, 'steps': 1279, 'loss/train': 3.91780686378479} 11/06/2021 21:27:15 - INFO - __main__ - Step 1281: {'lr': 0.00032, 'samples': 245952, 'steps': 1280, 'loss/train': 3.8567497730255127} 11/06/2021 21:27:16 - INFO - __main__ - Step 1282: {'lr': 0.00032025, 'samples': 246144, 'steps': 1281, 'loss/train': 2.492274761199951} 11/06/2021 21:27:17 - INFO - __main__ - Step 1283: {'lr': 0.00032050000000000004, 'samples': 246336, 'steps': 1282, 'loss/train': 4.012140274047852} 11/06/2021 21:27:17 - INFO - __main__ - Step 1284: {'lr': 0.00032074999999999996, 'samples': 246528, 'steps': 1283, 'loss/train': 3.700061559677124} 11/06/2021 21:27:17 - INFO - __main__ - Step 1285: {'lr': 0.000321, 'samples': 246720, 'steps': 1284, 'loss/train': 4.031924724578857} 11/06/2021 21:27:18 - INFO - __main__ - Step 1286: {'lr': 0.00032125, 'samples': 246912, 'steps': 1285, 'loss/train': 4.252817153930664} 11/06/2021 21:27:18 - INFO - __main__ - Step 1287: {'lr': 0.0003215, 'samples': 247104, 'steps': 1286, 'loss/train': 3.6342227458953857} 11/06/2021 21:27:19 - INFO - __main__ - Step 1288: {'lr': 0.00032175, 'samples': 247296, 'steps': 1287, 'loss/train': 3.6176106929779053} 11/06/2021 21:27:20 - INFO - __main__ - Step 1289: {'lr': 0.000322, 'samples': 247488, 'steps': 1288, 'loss/train': 5.5985236167907715} 11/06/2021 21:27:20 - INFO - __main__ - Step 1290: {'lr': 0.00032225, 'samples': 247680, 'steps': 1289, 'loss/train': 3.5907435417175293} 11/06/2021 21:27:20 - INFO - __main__ - Step 1291: {'lr': 0.00032250000000000003, 'samples': 247872, 'steps': 1290, 'loss/train': 3.7204911708831787} 11/06/2021 21:27:21 - INFO - __main__ - Step 1292: {'lr': 0.00032275, 'samples': 248064, 'steps': 1291, 'loss/train': 3.9456405639648438} 11/06/2021 21:27:22 - INFO - __main__ - Step 1293: {'lr': 0.000323, 'samples': 248256, 'steps': 1292, 'loss/train': 3.7843210697174072} 11/06/2021 21:27:22 - INFO - __main__ - Step 1294: {'lr': 0.00032324999999999997, 'samples': 248448, 'steps': 1293, 'loss/train': 3.915496349334717} 11/06/2021 21:27:22 - INFO - __main__ - Step 1295: {'lr': 0.0003235, 'samples': 248640, 'steps': 1294, 'loss/train': 4.2259440422058105} 11/06/2021 21:27:23 - INFO - __main__ - Step 1296: {'lr': 0.00032375, 'samples': 248832, 'steps': 1295, 'loss/train': 3.61068058013916} 11/06/2021 21:27:23 - INFO - __main__ - Step 1297: {'lr': 0.000324, 'samples': 249024, 'steps': 1296, 'loss/train': 3.682305097579956} 11/06/2021 21:27:24 - INFO - __main__ - Step 1298: {'lr': 0.00032425, 'samples': 249216, 'steps': 1297, 'loss/train': 3.750075340270996} 11/06/2021 21:27:25 - INFO - __main__ - Step 1299: {'lr': 0.00032450000000000003, 'samples': 249408, 'steps': 1298, 'loss/train': 3.9268178939819336} 11/06/2021 21:27:25 - INFO - __main__ - Step 1300: {'lr': 0.00032475, 'samples': 249600, 'steps': 1299, 'loss/train': 3.8040266036987305} 11/06/2021 21:27:25 - INFO - __main__ - Step 1301: {'lr': 0.00032500000000000004, 'samples': 249792, 'steps': 1300, 'loss/train': 3.4303956031799316} 11/06/2021 21:27:26 - INFO - __main__ - Step 1302: {'lr': 0.00032524999999999996, 'samples': 249984, 'steps': 1301, 'loss/train': 4.067983150482178} 11/06/2021 21:27:27 - INFO - __main__ - Step 1303: {'lr': 0.0003255, 'samples': 250176, 'steps': 1302, 'loss/train': 3.6090753078460693} 11/06/2021 21:27:27 - INFO - __main__ - Step 1304: {'lr': 0.00032575, 'samples': 250368, 'steps': 1303, 'loss/train': 3.846041440963745} 11/06/2021 21:27:27 - INFO - __main__ - Step 1305: {'lr': 0.000326, 'samples': 250560, 'steps': 1304, 'loss/train': 4.3484272956848145} 11/06/2021 21:27:28 - INFO - __main__ - Step 1306: {'lr': 0.00032625, 'samples': 250752, 'steps': 1305, 'loss/train': 3.714219331741333} 11/06/2021 21:27:28 - INFO - __main__ - Step 1307: {'lr': 0.0003265, 'samples': 250944, 'steps': 1306, 'loss/train': 3.670234441757202} 11/06/2021 21:27:29 - INFO - __main__ - Step 1308: {'lr': 0.00032675, 'samples': 251136, 'steps': 1307, 'loss/train': 4.2095441818237305} 11/06/2021 21:27:29 - INFO - __main__ - Step 1309: {'lr': 0.00032700000000000003, 'samples': 251328, 'steps': 1308, 'loss/train': 3.6904191970825195} 11/06/2021 21:27:30 - INFO - __main__ - Step 1310: {'lr': 0.00032725, 'samples': 251520, 'steps': 1309, 'loss/train': 3.5426836013793945} 11/06/2021 21:27:30 - INFO - __main__ - Step 1311: {'lr': 0.00032750000000000005, 'samples': 251712, 'steps': 1310, 'loss/train': 3.8077216148376465} 11/06/2021 21:27:31 - INFO - __main__ - Step 1312: {'lr': 0.00032774999999999997, 'samples': 251904, 'steps': 1311, 'loss/train': 3.6772751808166504} 11/06/2021 21:27:31 - INFO - __main__ - Step 1313: {'lr': 0.000328, 'samples': 252096, 'steps': 1312, 'loss/train': 4.283506393432617} 11/06/2021 21:27:32 - INFO - __main__ - Step 1314: {'lr': 0.00032825, 'samples': 252288, 'steps': 1313, 'loss/train': 3.820636510848999} 11/06/2021 21:27:32 - INFO - __main__ - Step 1315: {'lr': 0.0003285, 'samples': 252480, 'steps': 1314, 'loss/train': 3.645063877105713} 11/06/2021 21:27:33 - INFO - __main__ - Step 1316: {'lr': 0.00032875, 'samples': 252672, 'steps': 1315, 'loss/train': 3.96279239654541} 11/06/2021 21:27:33 - INFO - __main__ - Step 1317: {'lr': 0.00032900000000000003, 'samples': 252864, 'steps': 1316, 'loss/train': 3.848276138305664} 11/06/2021 21:27:33 - INFO - __main__ - Step 1318: {'lr': 0.00032925, 'samples': 253056, 'steps': 1317, 'loss/train': 3.3637754917144775} 11/06/2021 21:27:34 - INFO - __main__ - Step 1319: {'lr': 0.00032950000000000004, 'samples': 253248, 'steps': 1318, 'loss/train': 3.951634168624878} 11/06/2021 21:27:35 - INFO - __main__ - Step 1320: {'lr': 0.00032975, 'samples': 253440, 'steps': 1319, 'loss/train': 4.035619735717773} 11/06/2021 21:27:35 - INFO - __main__ - Step 1321: {'lr': 0.00033, 'samples': 253632, 'steps': 1320, 'loss/train': 3.623622179031372} 11/06/2021 21:27:35 - INFO - __main__ - Step 1322: {'lr': 0.00033025, 'samples': 253824, 'steps': 1321, 'loss/train': 4.259491920471191} 11/06/2021 21:27:36 - INFO - __main__ - Step 1323: {'lr': 0.0003305, 'samples': 254016, 'steps': 1322, 'loss/train': 3.655144214630127} 11/06/2021 21:27:37 - INFO - __main__ - Step 1324: {'lr': 0.00033075, 'samples': 254208, 'steps': 1323, 'loss/train': 3.6705875396728516} 11/06/2021 21:27:37 - INFO - __main__ - Step 1325: {'lr': 0.000331, 'samples': 254400, 'steps': 1324, 'loss/train': 3.493577480316162} 11/06/2021 21:27:38 - INFO - __main__ - Step 1326: {'lr': 0.00033125, 'samples': 254592, 'steps': 1325, 'loss/train': 3.0190889835357666} 11/06/2021 21:27:38 - INFO - __main__ - Step 1327: {'lr': 0.00033150000000000003, 'samples': 254784, 'steps': 1326, 'loss/train': 4.079080104827881} 11/06/2021 21:27:38 - INFO - __main__ - Step 1328: {'lr': 0.00033175, 'samples': 254976, 'steps': 1327, 'loss/train': 3.815504550933838} 11/06/2021 21:27:39 - INFO - __main__ - Step 1329: {'lr': 0.00033200000000000005, 'samples': 255168, 'steps': 1328, 'loss/train': 3.4887773990631104} 11/06/2021 21:27:40 - INFO - __main__ - Step 1330: {'lr': 0.00033224999999999997, 'samples': 255360, 'steps': 1329, 'loss/train': 3.8788294792175293} 11/06/2021 21:27:40 - INFO - __main__ - Step 1331: {'lr': 0.0003325, 'samples': 255552, 'steps': 1330, 'loss/train': 3.5784215927124023} 11/06/2021 21:27:40 - INFO - __main__ - Step 1332: {'lr': 0.00033275, 'samples': 255744, 'steps': 1331, 'loss/train': 3.404329299926758} 11/06/2021 21:27:41 - INFO - __main__ - Step 1333: {'lr': 0.000333, 'samples': 255936, 'steps': 1332, 'loss/train': 3.8690433502197266} 11/06/2021 21:27:42 - INFO - __main__ - Step 1334: {'lr': 0.00033325, 'samples': 256128, 'steps': 1333, 'loss/train': 3.4080007076263428} 11/06/2021 21:27:42 - INFO - __main__ - Step 1335: {'lr': 0.00033350000000000003, 'samples': 256320, 'steps': 1334, 'loss/train': 3.692984104156494} 11/06/2021 21:27:42 - INFO - __main__ - Step 1336: {'lr': 0.00033375, 'samples': 256512, 'steps': 1335, 'loss/train': 3.570852756500244} 11/06/2021 21:27:43 - INFO - __main__ - Step 1337: {'lr': 0.00033400000000000004, 'samples': 256704, 'steps': 1336, 'loss/train': 3.4716129302978516} 11/06/2021 21:27:43 - INFO - __main__ - Step 1338: {'lr': 0.00033425, 'samples': 256896, 'steps': 1337, 'loss/train': 3.4797582626342773} 11/06/2021 21:27:44 - INFO - __main__ - Step 1339: {'lr': 0.00033450000000000005, 'samples': 257088, 'steps': 1338, 'loss/train': 3.386270046234131} 11/06/2021 21:27:45 - INFO - __main__ - Step 1340: {'lr': 0.00033475, 'samples': 257280, 'steps': 1339, 'loss/train': 4.1568217277526855} 11/06/2021 21:27:45 - INFO - __main__ - Step 1341: {'lr': 0.000335, 'samples': 257472, 'steps': 1340, 'loss/train': 3.2953333854675293} 11/06/2021 21:27:45 - INFO - __main__ - Step 1342: {'lr': 0.00033525, 'samples': 257664, 'steps': 1341, 'loss/train': 3.355146646499634} 11/06/2021 21:27:46 - INFO - __main__ - Step 1343: {'lr': 0.0003355, 'samples': 257856, 'steps': 1342, 'loss/train': 3.234477996826172} 11/06/2021 21:27:47 - INFO - __main__ - Step 1344: {'lr': 0.00033575, 'samples': 258048, 'steps': 1343, 'loss/train': 3.457892656326294} 11/06/2021 21:27:47 - INFO - __main__ - Step 1345: {'lr': 0.00033600000000000004, 'samples': 258240, 'steps': 1344, 'loss/train': 3.948150634765625} 11/06/2021 21:27:47 - INFO - __main__ - Step 1346: {'lr': 0.00033625, 'samples': 258432, 'steps': 1345, 'loss/train': 3.7880961894989014} 11/06/2021 21:27:48 - INFO - __main__ - Step 1347: {'lr': 0.00033650000000000005, 'samples': 258624, 'steps': 1346, 'loss/train': 3.310671329498291} 11/06/2021 21:27:48 - INFO - __main__ - Step 1348: {'lr': 0.00033675, 'samples': 258816, 'steps': 1347, 'loss/train': 3.0229480266571045} 11/06/2021 21:27:49 - INFO - __main__ - Step 1349: {'lr': 0.000337, 'samples': 259008, 'steps': 1348, 'loss/train': 3.778203010559082} 11/06/2021 21:27:50 - INFO - __main__ - Step 1350: {'lr': 0.00033725, 'samples': 259200, 'steps': 1349, 'loss/train': 3.488584280014038} 11/06/2021 21:27:50 - INFO - __main__ - Step 1351: {'lr': 0.0003375, 'samples': 259392, 'steps': 1350, 'loss/train': 2.763584613800049} 11/06/2021 21:27:50 - INFO - __main__ - Step 1352: {'lr': 0.00033775, 'samples': 259584, 'steps': 1351, 'loss/train': 3.3598763942718506} 11/06/2021 21:27:51 - INFO - __main__ - Step 1353: {'lr': 0.00033800000000000003, 'samples': 259776, 'steps': 1352, 'loss/train': 3.7228636741638184} 11/06/2021 21:27:51 - INFO - __main__ - Step 1354: {'lr': 0.00033825, 'samples': 259968, 'steps': 1353, 'loss/train': 4.054864883422852} 11/06/2021 21:27:52 - INFO - __main__ - Step 1355: {'lr': 0.00033850000000000004, 'samples': 260160, 'steps': 1354, 'loss/train': 3.6283679008483887} 11/06/2021 21:27:53 - INFO - __main__ - Step 1356: {'lr': 0.00033875, 'samples': 260352, 'steps': 1355, 'loss/train': 3.3796229362487793} 11/06/2021 21:27:53 - INFO - __main__ - Step 1357: {'lr': 0.00033900000000000005, 'samples': 260544, 'steps': 1356, 'loss/train': 3.939152717590332} 11/06/2021 21:27:53 - INFO - __main__ - Step 1358: {'lr': 0.00033925, 'samples': 260736, 'steps': 1357, 'loss/train': 4.031463623046875} 11/06/2021 21:27:54 - INFO - __main__ - Step 1359: {'lr': 0.0003395, 'samples': 260928, 'steps': 1358, 'loss/train': 4.39647912979126} 11/06/2021 21:27:55 - INFO - __main__ - Step 1360: {'lr': 0.00033975, 'samples': 261120, 'steps': 1359, 'loss/train': 3.659208059310913} 11/06/2021 21:27:55 - INFO - __main__ - Step 1361: {'lr': 0.00034, 'samples': 261312, 'steps': 1360, 'loss/train': 4.465490341186523} 11/06/2021 21:27:55 - INFO - __main__ - Step 1362: {'lr': 0.00034025, 'samples': 261504, 'steps': 1361, 'loss/train': 4.4638543128967285} 11/06/2021 21:27:56 - INFO - __main__ - Step 1363: {'lr': 0.00034050000000000004, 'samples': 261696, 'steps': 1362, 'loss/train': 4.332038402557373} 11/06/2021 21:27:56 - INFO - __main__ - Step 1364: {'lr': 0.00034075, 'samples': 261888, 'steps': 1363, 'loss/train': 4.257739543914795} 11/06/2021 21:27:57 - INFO - __main__ - Step 1365: {'lr': 0.00034100000000000005, 'samples': 262080, 'steps': 1364, 'loss/train': 3.7311654090881348} 11/06/2021 21:27:57 - INFO - __main__ - Step 1366: {'lr': 0.00034125000000000003, 'samples': 262272, 'steps': 1365, 'loss/train': 3.6005234718322754} 11/06/2021 21:27:58 - INFO - __main__ - Step 1367: {'lr': 0.0003415, 'samples': 262464, 'steps': 1366, 'loss/train': 3.586486339569092} 11/06/2021 21:27:58 - INFO - __main__ - Step 1368: {'lr': 0.00034175, 'samples': 262656, 'steps': 1367, 'loss/train': 2.7847213745117188} 11/06/2021 21:27:59 - INFO - __main__ - Step 1369: {'lr': 0.000342, 'samples': 262848, 'steps': 1368, 'loss/train': 3.146930456161499} 11/06/2021 21:28:00 - INFO - __main__ - Step 1370: {'lr': 0.00034225, 'samples': 263040, 'steps': 1369, 'loss/train': 3.4413974285125732} 11/06/2021 21:28:00 - INFO - __main__ - Step 1371: {'lr': 0.00034250000000000003, 'samples': 263232, 'steps': 1370, 'loss/train': 3.6233880519866943} 11/06/2021 21:28:00 - INFO - __main__ - Step 1372: {'lr': 0.00034275, 'samples': 263424, 'steps': 1371, 'loss/train': 3.721611261367798} 11/06/2021 21:28:01 - INFO - __main__ - Step 1373: {'lr': 0.00034300000000000004, 'samples': 263616, 'steps': 1372, 'loss/train': 4.222573757171631} 11/06/2021 21:28:01 - INFO - __main__ - Step 1374: {'lr': 0.00034325, 'samples': 263808, 'steps': 1373, 'loss/train': 3.599992513656616} 11/06/2021 21:28:02 - INFO - __main__ - Step 1375: {'lr': 0.00034350000000000006, 'samples': 264000, 'steps': 1374, 'loss/train': 3.8981473445892334} 11/06/2021 21:28:03 - INFO - __main__ - Step 1376: {'lr': 0.00034375, 'samples': 264192, 'steps': 1375, 'loss/train': 3.6449296474456787} 11/06/2021 21:28:03 - INFO - __main__ - Step 1377: {'lr': 0.00034399999999999996, 'samples': 264384, 'steps': 1376, 'loss/train': 3.2894580364227295} 11/06/2021 21:28:03 - INFO - __main__ - Step 1378: {'lr': 0.00034425, 'samples': 264576, 'steps': 1377, 'loss/train': 3.513908624649048} 11/06/2021 21:28:04 - INFO - __main__ - Step 1379: {'lr': 0.00034449999999999997, 'samples': 264768, 'steps': 1378, 'loss/train': 4.0787034034729} 11/06/2021 21:28:05 - INFO - __main__ - Step 1380: {'lr': 0.00034475, 'samples': 264960, 'steps': 1379, 'loss/train': 3.7418301105499268} 11/06/2021 21:28:05 - INFO - __main__ - Step 1381: {'lr': 0.000345, 'samples': 265152, 'steps': 1380, 'loss/train': 3.700737237930298} 11/06/2021 21:28:05 - INFO - __main__ - Step 1382: {'lr': 0.00034525, 'samples': 265344, 'steps': 1381, 'loss/train': 3.3428897857666016} 11/06/2021 21:28:06 - INFO - __main__ - Step 1383: {'lr': 0.0003455, 'samples': 265536, 'steps': 1382, 'loss/train': 3.495915651321411} 11/06/2021 21:28:06 - INFO - __main__ - Step 1384: {'lr': 0.00034575000000000003, 'samples': 265728, 'steps': 1383, 'loss/train': 3.5448079109191895} 11/06/2021 21:28:07 - INFO - __main__ - Step 1385: {'lr': 0.000346, 'samples': 265920, 'steps': 1384, 'loss/train': 3.691251277923584} 11/06/2021 21:28:07 - INFO - __main__ - Step 1386: {'lr': 0.00034625, 'samples': 266112, 'steps': 1385, 'loss/train': 2.9062933921813965} 11/06/2021 21:28:08 - INFO - __main__ - Step 1387: {'lr': 0.00034649999999999997, 'samples': 266304, 'steps': 1386, 'loss/train': 3.5462396144866943} 11/06/2021 21:28:08 - INFO - __main__ - Step 1388: {'lr': 0.00034675, 'samples': 266496, 'steps': 1387, 'loss/train': 5.252135276794434} 11/06/2021 21:28:08 - INFO - __main__ - Step 1389: {'lr': 0.000347, 'samples': 266688, 'steps': 1388, 'loss/train': 3.7084007263183594} 11/06/2021 21:28:09 - INFO - __main__ - Step 1390: {'lr': 0.00034725, 'samples': 266880, 'steps': 1389, 'loss/train': 4.000763893127441} 11/06/2021 21:28:10 - INFO - __main__ - Step 1391: {'lr': 0.0003475, 'samples': 267072, 'steps': 1390, 'loss/train': 3.437042474746704} 11/06/2021 21:28:10 - INFO - __main__ - Step 1392: {'lr': 0.00034775, 'samples': 267264, 'steps': 1391, 'loss/train': 3.971050977706909} 11/06/2021 21:28:11 - INFO - __main__ - Step 1393: {'lr': 0.000348, 'samples': 267456, 'steps': 1392, 'loss/train': 3.698169231414795} 11/06/2021 21:28:11 - INFO - __main__ - Step 1394: {'lr': 0.00034825000000000004, 'samples': 267648, 'steps': 1393, 'loss/train': 3.2756340503692627} 11/06/2021 21:28:11 - INFO - __main__ - Step 1395: {'lr': 0.00034849999999999996, 'samples': 267840, 'steps': 1394, 'loss/train': 3.3742105960845947} 11/06/2021 21:28:12 - INFO - __main__ - Step 1396: {'lr': 0.00034875, 'samples': 268032, 'steps': 1395, 'loss/train': 3.8026742935180664} 11/06/2021 21:28:13 - INFO - __main__ - Step 1397: {'lr': 0.00034899999999999997, 'samples': 268224, 'steps': 1396, 'loss/train': 3.521596908569336} 11/06/2021 21:28:13 - INFO - __main__ - Step 1398: {'lr': 0.00034925, 'samples': 268416, 'steps': 1397, 'loss/train': 3.4449994564056396} 11/06/2021 21:28:13 - INFO - __main__ - Step 1399: {'lr': 0.0003495, 'samples': 268608, 'steps': 1398, 'loss/train': 3.146643877029419} 11/06/2021 21:28:14 - INFO - __main__ - Step 1400: {'lr': 0.00034975, 'samples': 268800, 'steps': 1399, 'loss/train': 3.3726940155029297} 11/06/2021 21:28:15 - INFO - __main__ - Step 1401: {'lr': 0.00035, 'samples': 268992, 'steps': 1400, 'loss/train': 3.4211158752441406} 11/06/2021 21:28:15 - INFO - __main__ - Step 1402: {'lr': 0.00035025000000000003, 'samples': 269184, 'steps': 1401, 'loss/train': 3.7196311950683594} 11/06/2021 21:28:15 - INFO - __main__ - Step 1403: {'lr': 0.0003505, 'samples': 269376, 'steps': 1402, 'loss/train': 3.6825990676879883} 11/06/2021 21:28:16 - INFO - __main__ - Step 1404: {'lr': 0.00035075, 'samples': 269568, 'steps': 1403, 'loss/train': 3.5246872901916504} 11/06/2021 21:28:16 - INFO - __main__ - Step 1405: {'lr': 0.00035099999999999997, 'samples': 269760, 'steps': 1404, 'loss/train': 3.948554277420044} 11/06/2021 21:28:17 - INFO - __main__ - Step 1406: {'lr': 0.00035125, 'samples': 269952, 'steps': 1405, 'loss/train': 3.4808645248413086} 11/06/2021 21:28:18 - INFO - __main__ - Step 1407: {'lr': 0.0003515, 'samples': 270144, 'steps': 1406, 'loss/train': 3.6043059825897217} 11/06/2021 21:28:18 - INFO - __main__ - Step 1408: {'lr': 0.00035175, 'samples': 270336, 'steps': 1407, 'loss/train': 3.91089129447937} 11/06/2021 21:28:18 - INFO - __main__ - Step 1409: {'lr': 0.000352, 'samples': 270528, 'steps': 1408, 'loss/train': 3.544276237487793} 11/06/2021 21:28:19 - INFO - __main__ - Step 1410: {'lr': 0.00035225, 'samples': 270720, 'steps': 1409, 'loss/train': 3.8686203956604004} 11/06/2021 21:28:20 - INFO - __main__ - Step 1411: {'lr': 0.0003525, 'samples': 270912, 'steps': 1410, 'loss/train': 3.5301620960235596} 11/06/2021 21:28:20 - INFO - __main__ - Step 1412: {'lr': 0.00035275000000000004, 'samples': 271104, 'steps': 1411, 'loss/train': 3.6756720542907715} 11/06/2021 21:28:21 - INFO - __main__ - Step 1413: {'lr': 0.00035299999999999996, 'samples': 271296, 'steps': 1412, 'loss/train': 3.3864593505859375} 11/06/2021 21:28:21 - INFO - __main__ - Step 1414: {'lr': 0.00035325, 'samples': 271488, 'steps': 1413, 'loss/train': 3.0657331943511963} 11/06/2021 21:28:21 - INFO - __main__ - Step 1415: {'lr': 0.0003535, 'samples': 271680, 'steps': 1414, 'loss/train': 3.454667091369629} 11/06/2021 21:28:22 - INFO - __main__ - Step 1416: {'lr': 0.00035375, 'samples': 271872, 'steps': 1415, 'loss/train': 3.242163896560669} 11/06/2021 21:28:23 - INFO - __main__ - Step 1417: {'lr': 0.000354, 'samples': 272064, 'steps': 1416, 'loss/train': 3.9992828369140625} 11/06/2021 21:28:23 - INFO - __main__ - Step 1418: {'lr': 0.00035425, 'samples': 272256, 'steps': 1417, 'loss/train': 3.688415050506592} 11/06/2021 21:28:23 - INFO - __main__ - Step 1419: {'lr': 0.0003545, 'samples': 272448, 'steps': 1418, 'loss/train': 3.7804338932037354} 11/06/2021 21:28:24 - INFO - __main__ - Step 1420: {'lr': 0.00035475000000000003, 'samples': 272640, 'steps': 1419, 'loss/train': 3.729741096496582} 11/06/2021 21:28:25 - INFO - __main__ - Step 1421: {'lr': 0.000355, 'samples': 272832, 'steps': 1420, 'loss/train': 4.303407192230225} 11/06/2021 21:28:25 - INFO - __main__ - Step 1422: {'lr': 0.00035525000000000004, 'samples': 273024, 'steps': 1421, 'loss/train': 3.4556620121002197} 11/06/2021 21:28:26 - INFO - __main__ - Step 1423: {'lr': 0.00035549999999999997, 'samples': 273216, 'steps': 1422, 'loss/train': 2.991054058074951} 11/06/2021 21:28:26 - INFO - __main__ - Step 1424: {'lr': 0.00035575, 'samples': 273408, 'steps': 1423, 'loss/train': 4.185265064239502} 11/06/2021 21:28:26 - INFO - __main__ - Step 1425: {'lr': 0.000356, 'samples': 273600, 'steps': 1424, 'loss/train': 3.529508590698242} 11/06/2021 21:28:27 - INFO - __main__ - Step 1426: {'lr': 0.00035625, 'samples': 273792, 'steps': 1425, 'loss/train': 3.394829034805298} 11/06/2021 21:28:28 - INFO - __main__ - Step 1427: {'lr': 0.0003565, 'samples': 273984, 'steps': 1426, 'loss/train': 4.052621364593506} 11/06/2021 21:28:28 - INFO - __main__ - Step 1428: {'lr': 0.00035675, 'samples': 274176, 'steps': 1427, 'loss/train': 3.0512208938598633} 11/06/2021 21:28:28 - INFO - __main__ - Step 1429: {'lr': 0.000357, 'samples': 274368, 'steps': 1428, 'loss/train': 3.3234691619873047} 11/06/2021 21:28:29 - INFO - __main__ - Step 1430: {'lr': 0.00035725000000000004, 'samples': 274560, 'steps': 1429, 'loss/train': 3.588322162628174} 11/06/2021 21:28:29 - INFO - __main__ - Step 1431: {'lr': 0.0003575, 'samples': 274752, 'steps': 1430, 'loss/train': 3.39493727684021} 11/06/2021 21:28:30 - INFO - __main__ - Step 1432: {'lr': 0.00035775, 'samples': 274944, 'steps': 1431, 'loss/train': 2.880502939224243} 11/06/2021 21:28:30 - INFO - __main__ - Step 1433: {'lr': 0.000358, 'samples': 275136, 'steps': 1432, 'loss/train': 3.7747838497161865} 11/06/2021 21:28:31 - INFO - __main__ - Step 1434: {'lr': 0.00035825, 'samples': 275328, 'steps': 1433, 'loss/train': 3.3060061931610107} 11/06/2021 21:28:31 - INFO - __main__ - Step 1435: {'lr': 0.0003585, 'samples': 275520, 'steps': 1434, 'loss/train': 3.7018392086029053} 11/06/2021 21:28:32 - INFO - __main__ - Step 1436: {'lr': 0.00035875, 'samples': 275712, 'steps': 1435, 'loss/train': 3.0778818130493164} 11/06/2021 21:28:33 - INFO - __main__ - Step 1437: {'lr': 0.000359, 'samples': 275904, 'steps': 1436, 'loss/train': 3.736198902130127} 11/06/2021 21:28:33 - INFO - __main__ - Step 1438: {'lr': 0.00035925000000000003, 'samples': 276096, 'steps': 1437, 'loss/train': 3.045379877090454} 11/06/2021 21:28:33 - INFO - __main__ - Step 1439: {'lr': 0.0003595, 'samples': 276288, 'steps': 1438, 'loss/train': 3.695364475250244} 11/06/2021 21:28:34 - INFO - __main__ - Step 1440: {'lr': 0.00035975000000000004, 'samples': 276480, 'steps': 1439, 'loss/train': 3.711172342300415} 11/06/2021 21:28:34 - INFO - __main__ - Step 1441: {'lr': 0.00035999999999999997, 'samples': 276672, 'steps': 1440, 'loss/train': 2.9761195182800293} 11/06/2021 21:28:35 - INFO - __main__ - Step 1442: {'lr': 0.00036025, 'samples': 276864, 'steps': 1441, 'loss/train': 3.2470874786376953} 11/06/2021 21:28:35 - INFO - __main__ - Step 1443: {'lr': 0.0003605, 'samples': 277056, 'steps': 1442, 'loss/train': 3.0125174522399902} 11/06/2021 21:28:36 - INFO - __main__ - Step 1444: {'lr': 0.00036075, 'samples': 277248, 'steps': 1443, 'loss/train': 3.5066580772399902} 11/06/2021 21:28:36 - INFO - __main__ - Step 1445: {'lr': 0.000361, 'samples': 277440, 'steps': 1444, 'loss/train': 3.245635747909546} 11/06/2021 21:28:36 - INFO - __main__ - Step 1446: {'lr': 0.00036125, 'samples': 277632, 'steps': 1445, 'loss/train': 4.3313374519348145} 11/06/2021 21:28:38 - INFO - __main__ - Step 1447: {'lr': 0.0003615, 'samples': 277824, 'steps': 1446, 'loss/train': 3.90049409866333} 11/06/2021 21:28:38 - INFO - __main__ - Step 1448: {'lr': 0.00036175000000000004, 'samples': 278016, 'steps': 1447, 'loss/train': 2.849242687225342} 11/06/2021 21:28:38 - INFO - __main__ - Step 1449: {'lr': 0.000362, 'samples': 278208, 'steps': 1448, 'loss/train': 3.8985612392425537} 11/06/2021 21:28:39 - INFO - __main__ - Step 1450: {'lr': 0.00036225000000000005, 'samples': 278400, 'steps': 1449, 'loss/train': 3.1428816318511963} 11/06/2021 21:28:39 - INFO - __main__ - Step 1451: {'lr': 0.0003625, 'samples': 278592, 'steps': 1450, 'loss/train': 3.8840200901031494} 11/06/2021 21:28:39 - INFO - __main__ - Step 1452: {'lr': 0.00036275, 'samples': 278784, 'steps': 1451, 'loss/train': 3.16904354095459} 11/06/2021 21:28:40 - INFO - __main__ - Step 1453: {'lr': 0.000363, 'samples': 278976, 'steps': 1452, 'loss/train': 2.8679397106170654} 11/06/2021 21:28:41 - INFO - __main__ - Step 1454: {'lr': 0.00036325, 'samples': 279168, 'steps': 1453, 'loss/train': 3.2666327953338623} 11/06/2021 21:28:41 - INFO - __main__ - Step 1455: {'lr': 0.0003635, 'samples': 279360, 'steps': 1454, 'loss/train': 3.257840871810913} 11/06/2021 21:28:41 - INFO - __main__ - Step 1456: {'lr': 0.00036375000000000003, 'samples': 279552, 'steps': 1455, 'loss/train': 3.4981942176818848} 11/06/2021 21:28:42 - INFO - __main__ - Step 1457: {'lr': 0.000364, 'samples': 279744, 'steps': 1456, 'loss/train': 3.474823236465454} 11/06/2021 21:28:43 - INFO - __main__ - Step 1458: {'lr': 0.00036425000000000004, 'samples': 279936, 'steps': 1457, 'loss/train': 3.6593263149261475} 11/06/2021 21:28:43 - INFO - __main__ - Step 1459: {'lr': 0.0003645, 'samples': 280128, 'steps': 1458, 'loss/train': 3.4789998531341553} 11/06/2021 21:28:43 - INFO - __main__ - Step 1460: {'lr': 0.00036475, 'samples': 280320, 'steps': 1459, 'loss/train': 3.415144920349121} 11/06/2021 21:28:44 - INFO - __main__ - Step 1461: {'lr': 0.000365, 'samples': 280512, 'steps': 1460, 'loss/train': 3.892705202102661} 11/06/2021 21:28:44 - INFO - __main__ - Step 1462: {'lr': 0.00036525, 'samples': 280704, 'steps': 1461, 'loss/train': 3.5692012310028076} 11/06/2021 21:28:45 - INFO - __main__ - Step 1463: {'lr': 0.0003655, 'samples': 280896, 'steps': 1462, 'loss/train': 3.964817762374878} 11/06/2021 21:28:46 - INFO - __main__ - Step 1464: {'lr': 0.00036575, 'samples': 281088, 'steps': 1463, 'loss/train': 3.4829154014587402} 11/06/2021 21:28:46 - INFO - __main__ - Step 1465: {'lr': 0.000366, 'samples': 281280, 'steps': 1464, 'loss/train': 3.2892184257507324} 11/06/2021 21:28:46 - INFO - __main__ - Step 1466: {'lr': 0.00036625000000000004, 'samples': 281472, 'steps': 1465, 'loss/train': 3.158153772354126} 11/06/2021 21:28:47 - INFO - __main__ - Step 1467: {'lr': 0.0003665, 'samples': 281664, 'steps': 1466, 'loss/train': 3.4137485027313232} 11/06/2021 21:28:48 - INFO - __main__ - Step 1468: {'lr': 0.00036675000000000005, 'samples': 281856, 'steps': 1467, 'loss/train': 3.1349167823791504} 11/06/2021 21:28:48 - INFO - __main__ - Step 1469: {'lr': 0.000367, 'samples': 282048, 'steps': 1468, 'loss/train': 2.7978010177612305} 11/06/2021 21:28:48 - INFO - __main__ - Step 1470: {'lr': 0.00036725, 'samples': 282240, 'steps': 1469, 'loss/train': 3.418675184249878} 11/06/2021 21:28:49 - INFO - __main__ - Step 1471: {'lr': 0.0003675, 'samples': 282432, 'steps': 1470, 'loss/train': 3.3369407653808594} 11/06/2021 21:28:49 - INFO - __main__ - Step 1472: {'lr': 0.00036775, 'samples': 282624, 'steps': 1471, 'loss/train': 3.0756335258483887} 11/06/2021 21:28:50 - INFO - __main__ - Step 1473: {'lr': 0.000368, 'samples': 282816, 'steps': 1472, 'loss/train': 3.3924009799957275} 11/06/2021 21:28:51 - INFO - __main__ - Step 1474: {'lr': 0.00036825000000000003, 'samples': 283008, 'steps': 1473, 'loss/train': 3.3130877017974854} 11/06/2021 21:28:51 - INFO - __main__ - Step 1475: {'lr': 0.0003685, 'samples': 283200, 'steps': 1474, 'loss/train': 2.7034976482391357} 11/06/2021 21:28:51 - INFO - __main__ - Step 1476: {'lr': 0.00036875000000000005, 'samples': 283392, 'steps': 1475, 'loss/train': 2.9324753284454346} 11/06/2021 21:28:52 - INFO - __main__ - Step 1477: {'lr': 0.000369, 'samples': 283584, 'steps': 1476, 'loss/train': 3.632042407989502} 11/06/2021 21:28:53 - INFO - __main__ - Step 1478: {'lr': 0.00036925, 'samples': 283776, 'steps': 1477, 'loss/train': 3.7734642028808594} 11/06/2021 21:28:54 - INFO - __main__ - Step 1479: {'lr': 0.0003695, 'samples': 283968, 'steps': 1478, 'loss/train': 3.285372018814087} 11/06/2021 21:28:54 - INFO - __main__ - Step 1480: {'lr': 0.00036975, 'samples': 284160, 'steps': 1479, 'loss/train': 3.5449750423431396} 11/06/2021 21:28:54 - INFO - __main__ - Step 1481: {'lr': 0.00037, 'samples': 284352, 'steps': 1480, 'loss/train': 3.1665492057800293} 11/06/2021 21:28:55 - INFO - __main__ - Step 1482: {'lr': 0.00037025000000000003, 'samples': 284544, 'steps': 1481, 'loss/train': 2.68510103225708} 11/06/2021 21:28:55 - INFO - __main__ - Step 1483: {'lr': 0.0003705, 'samples': 284736, 'steps': 1482, 'loss/train': 3.09177303314209} 11/06/2021 21:28:56 - INFO - __main__ - Step 1484: {'lr': 0.00037075000000000004, 'samples': 284928, 'steps': 1483, 'loss/train': 0.934282124042511} 11/06/2021 21:28:57 - INFO - __main__ - Step 1485: {'lr': 0.000371, 'samples': 285120, 'steps': 1484, 'loss/train': 0.7881891131401062} 11/06/2021 21:28:57 - INFO - __main__ - Step 1486: {'lr': 0.00037125000000000005, 'samples': 285312, 'steps': 1485, 'loss/train': 3.3031082153320312} 11/06/2021 21:28:58 - INFO - __main__ - Step 1487: {'lr': 0.00037150000000000003, 'samples': 285504, 'steps': 1486, 'loss/train': 3.4887430667877197} 11/06/2021 21:28:58 - INFO - __main__ - Step 1488: {'lr': 0.00037175, 'samples': 285696, 'steps': 1487, 'loss/train': 3.392094850540161} 11/06/2021 21:28:58 - INFO - __main__ - Step 1489: {'lr': 0.000372, 'samples': 285888, 'steps': 1488, 'loss/train': 3.5077946186065674} 11/06/2021 21:28:59 - INFO - __main__ - Step 1490: {'lr': 0.00037225, 'samples': 286080, 'steps': 1489, 'loss/train': 3.9329774379730225} 11/06/2021 21:29:00 - INFO - __main__ - Step 1491: {'lr': 0.0003725, 'samples': 286272, 'steps': 1490, 'loss/train': 3.5324740409851074} 11/06/2021 21:29:00 - INFO - __main__ - Step 1492: {'lr': 0.00037275000000000003, 'samples': 286464, 'steps': 1491, 'loss/train': 2.4463398456573486} 11/06/2021 21:29:00 - INFO - __main__ - Step 1493: {'lr': 0.000373, 'samples': 286656, 'steps': 1492, 'loss/train': 3.127577066421509} 11/06/2021 21:29:01 - INFO - __main__ - Step 1494: {'lr': 0.00037325000000000005, 'samples': 286848, 'steps': 1493, 'loss/train': 1.6627174615859985} 11/06/2021 21:29:03 - INFO - __main__ - Step 1495: {'lr': 0.0003735, 'samples': 287040, 'steps': 1494, 'loss/train': 3.5371575355529785} 11/06/2021 21:29:03 - INFO - __main__ - Step 1496: {'lr': 0.00037375000000000006, 'samples': 287232, 'steps': 1495, 'loss/train': 3.178628921508789} 11/06/2021 21:29:03 - INFO - __main__ - Step 1497: {'lr': 0.000374, 'samples': 287424, 'steps': 1496, 'loss/train': 2.7111098766326904} 11/06/2021 21:29:04 - INFO - __main__ - Step 1498: {'lr': 0.00037425, 'samples': 287616, 'steps': 1497, 'loss/train': 4.202212810516357} 11/06/2021 21:29:04 - INFO - __main__ - Step 1499: {'lr': 0.0003745, 'samples': 287808, 'steps': 1498, 'loss/train': 3.379865884780884} 11/06/2021 21:29:04 - INFO - __main__ - Step 1500: {'lr': 0.00037475000000000003, 'samples': 288000, 'steps': 1499, 'loss/train': 3.4952762126922607} 11/06/2021 21:29:05 - INFO - __main__ - Step 1501: {'lr': 0.000375, 'samples': 288192, 'steps': 1500, 'loss/train': 4.174196720123291} 11/06/2021 21:29:06 - INFO - __main__ - Step 1502: {'lr': 0.00037525, 'samples': 288384, 'steps': 1501, 'loss/train': 3.272667169570923} 11/06/2021 21:29:06 - INFO - __main__ - Step 1503: {'lr': 0.0003755, 'samples': 288576, 'steps': 1502, 'loss/train': 2.8478939533233643} 11/06/2021 21:29:07 - INFO - __main__ - Step 1504: {'lr': 0.00037575, 'samples': 288768, 'steps': 1503, 'loss/train': 4.3909077644348145} 11/06/2021 21:29:07 - INFO - __main__ - Step 1505: {'lr': 0.00037600000000000003, 'samples': 288960, 'steps': 1504, 'loss/train': 3.602169990539551} 11/06/2021 21:29:08 - INFO - __main__ - Step 1506: {'lr': 0.00037624999999999996, 'samples': 289152, 'steps': 1505, 'loss/train': 3.1311495304107666} 11/06/2021 21:29:08 - INFO - __main__ - Step 1507: {'lr': 0.0003765, 'samples': 289344, 'steps': 1506, 'loss/train': 2.88991379737854} 11/06/2021 21:29:09 - INFO - __main__ - Step 1508: {'lr': 0.00037674999999999997, 'samples': 289536, 'steps': 1507, 'loss/train': 2.8797428607940674} 11/06/2021 21:29:09 - INFO - __main__ - Step 1509: {'lr': 0.000377, 'samples': 289728, 'steps': 1508, 'loss/train': 3.3940863609313965} 11/06/2021 21:29:09 - INFO - __main__ - Step 1510: {'lr': 0.00037725, 'samples': 289920, 'steps': 1509, 'loss/train': 3.036484479904175} 11/06/2021 21:29:10 - INFO - __main__ - Step 1511: {'lr': 0.0003775, 'samples': 290112, 'steps': 1510, 'loss/train': 4.253868579864502} 11/06/2021 21:29:11 - INFO - __main__ - Step 1512: {'lr': 0.00037775, 'samples': 290304, 'steps': 1511, 'loss/train': 3.5372188091278076} 11/06/2021 21:29:11 - INFO - __main__ - Step 1513: {'lr': 0.000378, 'samples': 290496, 'steps': 1512, 'loss/train': 3.0633351802825928} 11/06/2021 21:29:12 - INFO - __main__ - Step 1514: {'lr': 0.00037825, 'samples': 290688, 'steps': 1513, 'loss/train': 3.378868818283081} 11/06/2021 21:29:12 - INFO - __main__ - Step 1515: {'lr': 0.0003785, 'samples': 290880, 'steps': 1514, 'loss/train': 4.058987140655518} 11/06/2021 21:29:12 - INFO - __main__ - Step 1516: {'lr': 0.00037874999999999996, 'samples': 291072, 'steps': 1515, 'loss/train': 3.8939054012298584} 11/06/2021 21:29:14 - INFO - __main__ - Step 1517: {'lr': 0.000379, 'samples': 291264, 'steps': 1516, 'loss/train': 3.405064344406128} 11/06/2021 21:29:15 - INFO - __main__ - Step 1518: {'lr': 0.00037925, 'samples': 291456, 'steps': 1517, 'loss/train': 3.7268097400665283} 11/06/2021 21:29:15 - INFO - __main__ - Step 1519: {'lr': 0.0003795, 'samples': 291648, 'steps': 1518, 'loss/train': 3.949538469314575} 11/06/2021 21:29:16 - INFO - __main__ - Step 1520: {'lr': 0.00037975, 'samples': 291840, 'steps': 1519, 'loss/train': 2.9000167846679688} 11/06/2021 21:29:16 - INFO - __main__ - Step 1521: {'lr': 0.00038, 'samples': 292032, 'steps': 1520, 'loss/train': 1.4307708740234375} 11/06/2021 21:29:16 - INFO - __main__ - Step 1522: {'lr': 0.00038025, 'samples': 292224, 'steps': 1521, 'loss/train': 1.3860046863555908} 11/06/2021 21:29:17 - INFO - __main__ - Step 1523: {'lr': 0.00038050000000000003, 'samples': 292416, 'steps': 1522, 'loss/train': 1.1130292415618896} 11/06/2021 21:29:18 - INFO - __main__ - Step 1524: {'lr': 0.00038075, 'samples': 292608, 'steps': 1523, 'loss/train': 3.4155771732330322} 11/06/2021 21:29:18 - INFO - __main__ - Step 1525: {'lr': 0.000381, 'samples': 292800, 'steps': 1524, 'loss/train': 3.0278680324554443} 11/06/2021 21:29:19 - INFO - __main__ - Step 1526: {'lr': 0.00038124999999999997, 'samples': 292992, 'steps': 1525, 'loss/train': 3.4204211235046387} 11/06/2021 21:29:19 - INFO - __main__ - Step 1527: {'lr': 0.0003815, 'samples': 293184, 'steps': 1526, 'loss/train': 2.968132972717285} 11/06/2021 21:29:19 - INFO - __main__ - Step 1528: {'lr': 0.00038175, 'samples': 293376, 'steps': 1527, 'loss/train': 3.8879196643829346} 11/06/2021 21:29:20 - INFO - __main__ - Step 1529: {'lr': 0.000382, 'samples': 293568, 'steps': 1528, 'loss/train': 3.570026159286499} 11/06/2021 21:29:21 - INFO - __main__ - Step 1530: {'lr': 0.00038225, 'samples': 293760, 'steps': 1529, 'loss/train': 3.1837944984436035} 11/06/2021 21:29:21 - INFO - __main__ - Step 1531: {'lr': 0.00038250000000000003, 'samples': 293952, 'steps': 1530, 'loss/train': 2.6540393829345703} 11/06/2021 21:29:22 - INFO - __main__ - Step 1532: {'lr': 0.00038275, 'samples': 294144, 'steps': 1531, 'loss/train': 2.824615478515625} 11/06/2021 21:29:22 - INFO - __main__ - Step 1533: {'lr': 0.00038300000000000004, 'samples': 294336, 'steps': 1532, 'loss/train': 2.8562939167022705} 11/06/2021 21:29:23 - INFO - __main__ - Step 1534: {'lr': 0.00038324999999999996, 'samples': 294528, 'steps': 1533, 'loss/train': 3.2858169078826904} 11/06/2021 21:29:23 - INFO - __main__ - Step 1535: {'lr': 0.0003835, 'samples': 294720, 'steps': 1534, 'loss/train': 3.5091545581817627} 11/06/2021 21:29:24 - INFO - __main__ - Step 1536: {'lr': 0.00038375, 'samples': 294912, 'steps': 1535, 'loss/train': 3.2510337829589844} 11/06/2021 21:29:24 - INFO - __main__ - Step 1537: {'lr': 0.000384, 'samples': 295104, 'steps': 1536, 'loss/train': 3.5380523204803467} 11/06/2021 21:29:24 - INFO - __main__ - Step 1538: {'lr': 0.00038425, 'samples': 295296, 'steps': 1537, 'loss/train': 2.9871280193328857} 11/06/2021 21:29:25 - INFO - __main__ - Step 1539: {'lr': 0.0003845, 'samples': 295488, 'steps': 1538, 'loss/train': 3.5664308071136475} 11/06/2021 21:29:26 - INFO - __main__ - Step 1540: {'lr': 0.00038475, 'samples': 295680, 'steps': 1539, 'loss/train': 3.1113128662109375} 11/06/2021 21:29:26 - INFO - __main__ - Step 1541: {'lr': 0.00038500000000000003, 'samples': 295872, 'steps': 1540, 'loss/train': 4.263197422027588} 11/06/2021 21:29:26 - INFO - __main__ - Step 1542: {'lr': 0.00038525, 'samples': 296064, 'steps': 1541, 'loss/train': 3.313586711883545} 11/06/2021 21:29:27 - INFO - __main__ - Step 1543: {'lr': 0.0003855, 'samples': 296256, 'steps': 1542, 'loss/train': 3.427462100982666} 11/06/2021 21:29:27 - INFO - __main__ - Step 1544: {'lr': 0.00038574999999999997, 'samples': 296448, 'steps': 1543, 'loss/train': 4.165167808532715} 11/06/2021 21:29:28 - INFO - __main__ - Step 1545: {'lr': 0.000386, 'samples': 296640, 'steps': 1544, 'loss/train': 3.6539011001586914} 11/06/2021 21:29:29 - INFO - __main__ - Step 1546: {'lr': 0.00038625, 'samples': 296832, 'steps': 1545, 'loss/train': 3.510122060775757} 11/06/2021 21:29:29 - INFO - __main__ - Step 1547: {'lr': 0.0003865, 'samples': 297024, 'steps': 1546, 'loss/train': 3.2956809997558594} 11/06/2021 21:29:29 - INFO - __main__ - Step 1548: {'lr': 0.00038675, 'samples': 297216, 'steps': 1547, 'loss/train': 3.2712979316711426} 11/06/2021 21:29:30 - INFO - __main__ - Step 1549: {'lr': 0.00038700000000000003, 'samples': 297408, 'steps': 1548, 'loss/train': 2.8422179222106934} 11/06/2021 21:29:31 - INFO - __main__ - Step 1550: {'lr': 0.00038725, 'samples': 297600, 'steps': 1549, 'loss/train': 3.2148244380950928} 11/06/2021 21:29:31 - INFO - __main__ - Step 1551: {'lr': 0.00038750000000000004, 'samples': 297792, 'steps': 1550, 'loss/train': 3.082228422164917} 11/06/2021 21:29:31 - INFO - __main__ - Step 1552: {'lr': 0.00038774999999999997, 'samples': 297984, 'steps': 1551, 'loss/train': 3.0857651233673096} 11/06/2021 21:29:32 - INFO - __main__ - Step 1553: {'lr': 0.000388, 'samples': 298176, 'steps': 1552, 'loss/train': 3.537327289581299} 11/06/2021 21:29:32 - INFO - __main__ - Step 1554: {'lr': 0.00038825, 'samples': 298368, 'steps': 1553, 'loss/train': 3.118391275405884} 11/06/2021 21:29:33 - INFO - __main__ - Step 1555: {'lr': 0.0003885, 'samples': 298560, 'steps': 1554, 'loss/train': 3.2976584434509277} 11/06/2021 21:29:34 - INFO - __main__ - Step 1556: {'lr': 0.00038875, 'samples': 298752, 'steps': 1555, 'loss/train': 3.208622694015503} 11/06/2021 21:29:34 - INFO - __main__ - Step 1557: {'lr': 0.000389, 'samples': 298944, 'steps': 1556, 'loss/train': 2.393932580947876} 11/06/2021 21:29:35 - INFO - __main__ - Step 1558: {'lr': 0.00038925, 'samples': 299136, 'steps': 1557, 'loss/train': 3.1180825233459473} 11/06/2021 21:29:35 - INFO - __main__ - Step 1559: {'lr': 0.00038950000000000003, 'samples': 299328, 'steps': 1558, 'loss/train': 3.363607406616211} 11/06/2021 21:29:35 - INFO - __main__ - Step 1560: {'lr': 0.00038975, 'samples': 299520, 'steps': 1559, 'loss/train': 3.7669551372528076} 11/06/2021 21:29:36 - INFO - __main__ - Step 1561: {'lr': 0.00039000000000000005, 'samples': 299712, 'steps': 1560, 'loss/train': 2.959444284439087} 11/06/2021 21:29:37 - INFO - __main__ - Step 1562: {'lr': 0.00039024999999999997, 'samples': 299904, 'steps': 1561, 'loss/train': 3.453453779220581} 11/06/2021 21:29:37 - INFO - __main__ - Step 1563: {'lr': 0.0003905, 'samples': 300096, 'steps': 1562, 'loss/train': 3.2474799156188965} 11/06/2021 21:29:37 - INFO - __main__ - Step 1564: {'lr': 0.00039075, 'samples': 300288, 'steps': 1563, 'loss/train': 3.6809487342834473} 11/06/2021 21:29:38 - INFO - __main__ - Step 1565: {'lr': 0.000391, 'samples': 300480, 'steps': 1564, 'loss/train': 3.5540361404418945} 11/06/2021 21:29:39 - INFO - __main__ - Step 1566: {'lr': 0.00039125, 'samples': 300672, 'steps': 1565, 'loss/train': 3.8347136974334717} 11/06/2021 21:29:39 - INFO - __main__ - Step 1567: {'lr': 0.00039150000000000003, 'samples': 300864, 'steps': 1566, 'loss/train': 3.0421507358551025} 11/06/2021 21:29:40 - INFO - __main__ - Step 1568: {'lr': 0.00039175, 'samples': 301056, 'steps': 1567, 'loss/train': 2.8751606941223145} 11/06/2021 21:29:40 - INFO - __main__ - Step 1569: {'lr': 0.00039200000000000004, 'samples': 301248, 'steps': 1568, 'loss/train': 2.795197010040283} 11/06/2021 21:29:40 - INFO - __main__ - Step 1570: {'lr': 0.00039225, 'samples': 301440, 'steps': 1569, 'loss/train': 2.370975971221924} 11/06/2021 21:29:41 - INFO - __main__ - Step 1571: {'lr': 0.0003925, 'samples': 301632, 'steps': 1570, 'loss/train': 2.498239040374756} 11/06/2021 21:29:42 - INFO - __main__ - Step 1572: {'lr': 0.00039275, 'samples': 301824, 'steps': 1571, 'loss/train': 1.6988508701324463} 11/06/2021 21:29:42 - INFO - __main__ - Step 1573: {'lr': 0.000393, 'samples': 302016, 'steps': 1572, 'loss/train': 4.021992206573486} 11/06/2021 21:29:42 - INFO - __main__ - Step 1574: {'lr': 0.00039325, 'samples': 302208, 'steps': 1573, 'loss/train': 2.898738145828247} 11/06/2021 21:29:43 - INFO - __main__ - Step 1575: {'lr': 0.0003935, 'samples': 302400, 'steps': 1574, 'loss/train': 2.607938289642334} 11/06/2021 21:29:43 - INFO - __main__ - Step 1576: {'lr': 0.00039375, 'samples': 302592, 'steps': 1575, 'loss/train': 3.263978958129883} 11/06/2021 21:29:44 - INFO - __main__ - Step 1577: {'lr': 0.00039400000000000004, 'samples': 302784, 'steps': 1576, 'loss/train': 2.8440113067626953} 11/06/2021 21:29:44 - INFO - __main__ - Step 1578: {'lr': 0.00039425, 'samples': 302976, 'steps': 1577, 'loss/train': 3.6558189392089844} 11/06/2021 21:29:45 - INFO - __main__ - Step 1579: {'lr': 0.00039450000000000005, 'samples': 303168, 'steps': 1578, 'loss/train': 3.0387959480285645} 11/06/2021 21:29:45 - INFO - __main__ - Step 1580: {'lr': 0.00039474999999999997, 'samples': 303360, 'steps': 1579, 'loss/train': 3.356081008911133} 11/06/2021 21:29:46 - INFO - __main__ - Step 1581: {'lr': 0.000395, 'samples': 303552, 'steps': 1580, 'loss/train': 2.985703706741333} 11/06/2021 21:29:47 - INFO - __main__ - Step 1582: {'lr': 0.00039525, 'samples': 303744, 'steps': 1581, 'loss/train': 3.263591766357422} 11/06/2021 21:29:47 - INFO - __main__ - Step 1583: {'lr': 0.0003955, 'samples': 303936, 'steps': 1582, 'loss/train': 3.0408103466033936} 11/06/2021 21:29:47 - INFO - __main__ - Step 1584: {'lr': 0.00039575, 'samples': 304128, 'steps': 1583, 'loss/train': 3.310321807861328} 11/06/2021 21:29:48 - INFO - __main__ - Step 1585: {'lr': 0.00039600000000000003, 'samples': 304320, 'steps': 1584, 'loss/train': 2.9202144145965576} 11/06/2021 21:29:48 - INFO - __main__ - Step 1586: {'lr': 0.00039625, 'samples': 304512, 'steps': 1585, 'loss/train': 3.2440686225891113} 11/06/2021 21:29:49 - INFO - __main__ - Step 1587: {'lr': 0.00039650000000000004, 'samples': 304704, 'steps': 1586, 'loss/train': 3.232225179672241} 11/06/2021 21:29:49 - INFO - __main__ - Step 1588: {'lr': 0.00039675, 'samples': 304896, 'steps': 1587, 'loss/train': 3.2099974155426025} 11/06/2021 21:29:50 - INFO - __main__ - Step 1589: {'lr': 0.00039700000000000005, 'samples': 305088, 'steps': 1588, 'loss/train': 2.729403018951416} 11/06/2021 21:29:50 - INFO - __main__ - Step 1590: {'lr': 0.00039725, 'samples': 305280, 'steps': 1589, 'loss/train': 2.7173776626586914} 11/06/2021 21:29:50 - INFO - __main__ - Step 1591: {'lr': 0.0003975, 'samples': 305472, 'steps': 1590, 'loss/train': 3.206320285797119} 11/06/2021 21:29:51 - INFO - __main__ - Step 1592: {'lr': 0.00039775, 'samples': 305664, 'steps': 1591, 'loss/train': 2.313783884048462} 11/06/2021 21:29:52 - INFO - __main__ - Step 1593: {'lr': 0.000398, 'samples': 305856, 'steps': 1592, 'loss/train': 3.254606008529663} 11/06/2021 21:29:52 - INFO - __main__ - Step 1594: {'lr': 0.00039825, 'samples': 306048, 'steps': 1593, 'loss/train': 3.374396324157715} 11/06/2021 21:29:52 - INFO - __main__ - Step 1595: {'lr': 0.00039850000000000004, 'samples': 306240, 'steps': 1594, 'loss/train': 3.533506155014038} 11/06/2021 21:29:53 - INFO - __main__ - Step 1596: {'lr': 0.00039875, 'samples': 306432, 'steps': 1595, 'loss/train': 2.543515205383301} 11/06/2021 21:29:54 - INFO - __main__ - Step 1597: {'lr': 0.00039900000000000005, 'samples': 306624, 'steps': 1596, 'loss/train': 2.980104684829712} 11/06/2021 21:29:54 - INFO - __main__ - Step 1598: {'lr': 0.00039925000000000003, 'samples': 306816, 'steps': 1597, 'loss/train': 2.649993896484375} 11/06/2021 21:29:55 - INFO - __main__ - Step 1599: {'lr': 0.0003995, 'samples': 307008, 'steps': 1598, 'loss/train': 2.8795385360717773} 11/06/2021 21:29:55 - INFO - __main__ - Step 1600: {'lr': 0.00039975, 'samples': 307200, 'steps': 1599, 'loss/train': 3.1416678428649902} 11/06/2021 21:29:55 - INFO - __main__ - Step 1601: {'lr': 0.0004, 'samples': 307392, 'steps': 1600, 'loss/train': 2.955704689025879} 11/06/2021 21:29:56 - INFO - __main__ - Step 1602: {'lr': 0.00040025, 'samples': 307584, 'steps': 1601, 'loss/train': 3.7033560276031494} 11/06/2021 21:29:57 - INFO - __main__ - Step 1603: {'lr': 0.00040050000000000003, 'samples': 307776, 'steps': 1602, 'loss/train': 2.7389466762542725} 11/06/2021 21:29:57 - INFO - __main__ - Step 1604: {'lr': 0.00040075, 'samples': 307968, 'steps': 1603, 'loss/train': 3.5183396339416504} 11/06/2021 21:29:58 - INFO - __main__ - Step 1605: {'lr': 0.00040100000000000004, 'samples': 308160, 'steps': 1604, 'loss/train': 3.134627103805542} 11/06/2021 21:29:58 - INFO - __main__ - Step 1606: {'lr': 0.00040125, 'samples': 308352, 'steps': 1605, 'loss/train': 2.952859878540039} 11/06/2021 21:29:58 - INFO - __main__ - Step 1607: {'lr': 0.00040150000000000006, 'samples': 308544, 'steps': 1606, 'loss/train': 3.224421977996826} 11/06/2021 21:29:59 - INFO - __main__ - Step 1608: {'lr': 0.00040175, 'samples': 308736, 'steps': 1607, 'loss/train': 2.4461114406585693} 11/06/2021 21:30:00 - INFO - __main__ - Step 1609: {'lr': 0.000402, 'samples': 308928, 'steps': 1608, 'loss/train': 3.342259645462036} 11/06/2021 21:30:00 - INFO - __main__ - Step 1610: {'lr': 0.00040225, 'samples': 309120, 'steps': 1609, 'loss/train': 3.2026968002319336} 11/06/2021 21:30:00 - INFO - __main__ - Step 1611: {'lr': 0.0004025, 'samples': 309312, 'steps': 1610, 'loss/train': 3.4691827297210693} 11/06/2021 21:30:01 - INFO - __main__ - Step 1612: {'lr': 0.00040275, 'samples': 309504, 'steps': 1611, 'loss/train': 3.279484987258911} 11/06/2021 21:30:02 - INFO - __main__ - Step 1613: {'lr': 0.00040300000000000004, 'samples': 309696, 'steps': 1612, 'loss/train': 3.016733407974243} 11/06/2021 21:30:02 - INFO - __main__ - Step 1614: {'lr': 0.00040325, 'samples': 309888, 'steps': 1613, 'loss/train': 3.192936897277832} 11/06/2021 21:30:02 - INFO - __main__ - Step 1615: {'lr': 0.00040350000000000005, 'samples': 310080, 'steps': 1614, 'loss/train': 3.0827951431274414} 11/06/2021 21:30:03 - INFO - __main__ - Step 1616: {'lr': 0.00040375000000000003, 'samples': 310272, 'steps': 1615, 'loss/train': 2.6593739986419678} 11/06/2021 21:30:03 - INFO - __main__ - Step 1617: {'lr': 0.000404, 'samples': 310464, 'steps': 1616, 'loss/train': 2.6806726455688477} 11/06/2021 21:30:04 - INFO - __main__ - Step 1618: {'lr': 0.00040425, 'samples': 310656, 'steps': 1617, 'loss/train': 2.476854085922241} 11/06/2021 21:30:04 - INFO - __main__ - Step 1619: {'lr': 0.0004045, 'samples': 310848, 'steps': 1618, 'loss/train': 3.110135793685913} 11/06/2021 21:30:05 - INFO - __main__ - Step 1620: {'lr': 0.00040475, 'samples': 311040, 'steps': 1619, 'loss/train': 2.908262014389038} 11/06/2021 21:30:05 - INFO - __main__ - Step 1621: {'lr': 0.00040500000000000003, 'samples': 311232, 'steps': 1620, 'loss/train': 3.027345657348633} 11/06/2021 21:30:05 - INFO - __main__ - Step 1622: {'lr': 0.00040525, 'samples': 311424, 'steps': 1621, 'loss/train': 3.2688331604003906} 11/06/2021 21:30:07 - INFO - __main__ - Step 1623: {'lr': 0.00040550000000000004, 'samples': 311616, 'steps': 1622, 'loss/train': 2.6070456504821777} 11/06/2021 21:30:07 - INFO - __main__ - Step 1624: {'lr': 0.00040575, 'samples': 311808, 'steps': 1623, 'loss/train': 3.40415620803833} 11/06/2021 21:30:07 - INFO - __main__ - Step 1625: {'lr': 0.00040600000000000006, 'samples': 312000, 'steps': 1624, 'loss/train': 3.013209819793701} 11/06/2021 21:30:08 - INFO - __main__ - Step 1626: {'lr': 0.00040625000000000004, 'samples': 312192, 'steps': 1625, 'loss/train': 2.7747411727905273} 11/06/2021 21:30:08 - INFO - __main__ - Step 1627: {'lr': 0.00040649999999999996, 'samples': 312384, 'steps': 1626, 'loss/train': 2.614873170852661} 11/06/2021 21:30:09 - INFO - __main__ - Step 1628: {'lr': 0.00040675, 'samples': 312576, 'steps': 1627, 'loss/train': 2.9833312034606934} 11/06/2021 21:30:09 - INFO - __main__ - Step 1629: {'lr': 0.00040699999999999997, 'samples': 312768, 'steps': 1628, 'loss/train': 3.6013989448547363} 11/06/2021 21:30:10 - INFO - __main__ - Step 1630: {'lr': 0.00040725, 'samples': 312960, 'steps': 1629, 'loss/train': 3.32792067527771} 11/06/2021 21:30:10 - INFO - __main__ - Step 1631: {'lr': 0.0004075, 'samples': 313152, 'steps': 1630, 'loss/train': 2.2372703552246094} 11/06/2021 21:30:10 - INFO - __main__ - Step 1632: {'lr': 0.00040775, 'samples': 313344, 'steps': 1631, 'loss/train': 3.2227723598480225} 11/06/2021 21:30:11 - INFO - __main__ - Step 1633: {'lr': 0.000408, 'samples': 313536, 'steps': 1632, 'loss/train': 2.9736287593841553} 11/06/2021 21:30:12 - INFO - __main__ - Step 1634: {'lr': 0.00040825000000000003, 'samples': 313728, 'steps': 1633, 'loss/train': 3.227055072784424} 11/06/2021 21:30:12 - INFO - __main__ - Step 1635: {'lr': 0.0004085, 'samples': 313920, 'steps': 1634, 'loss/train': 4.195490837097168} 11/06/2021 21:30:13 - INFO - __main__ - Step 1636: {'lr': 0.00040875, 'samples': 314112, 'steps': 1635, 'loss/train': 2.5357518196105957} 11/06/2021 21:30:13 - INFO - __main__ - Step 1637: {'lr': 0.00040899999999999997, 'samples': 314304, 'steps': 1636, 'loss/train': 3.0355074405670166} 11/06/2021 21:30:13 - INFO - __main__ - Step 1638: {'lr': 0.00040925, 'samples': 314496, 'steps': 1637, 'loss/train': 3.301243305206299} 11/06/2021 21:30:14 - INFO - __main__ - Step 1639: {'lr': 0.0004095, 'samples': 314688, 'steps': 1638, 'loss/train': 3.4538841247558594} 11/06/2021 21:30:15 - INFO - __main__ - Step 1640: {'lr': 0.00040975, 'samples': 314880, 'steps': 1639, 'loss/train': 3.0793862342834473} 11/06/2021 21:30:15 - INFO - __main__ - Step 1641: {'lr': 0.00041, 'samples': 315072, 'steps': 1640, 'loss/train': 3.042006492614746} 11/06/2021 21:30:15 - INFO - __main__ - Step 1642: {'lr': 0.00041025, 'samples': 315264, 'steps': 1641, 'loss/train': 2.085822105407715} 11/06/2021 21:30:16 - INFO - __main__ - Step 1643: {'lr': 0.0004105, 'samples': 315456, 'steps': 1642, 'loss/train': 3.430644989013672} 11/06/2021 21:30:17 - INFO - __main__ - Step 1644: {'lr': 0.00041075000000000004, 'samples': 315648, 'steps': 1643, 'loss/train': 2.801948308944702} 11/06/2021 21:30:17 - INFO - __main__ - Step 1645: {'lr': 0.00041099999999999996, 'samples': 315840, 'steps': 1644, 'loss/train': 2.5270915031433105} 11/06/2021 21:30:18 - INFO - __main__ - Step 1646: {'lr': 0.00041125, 'samples': 316032, 'steps': 1645, 'loss/train': 2.423734188079834} 11/06/2021 21:30:18 - INFO - __main__ - Step 1647: {'lr': 0.0004115, 'samples': 316224, 'steps': 1646, 'loss/train': 3.113656520843506} 11/06/2021 21:30:18 - INFO - __main__ - Step 1648: {'lr': 0.00041175, 'samples': 316416, 'steps': 1647, 'loss/train': 3.738720655441284} 11/06/2021 21:30:19 - INFO - __main__ - Step 1649: {'lr': 0.000412, 'samples': 316608, 'steps': 1648, 'loss/train': 3.3871734142303467} 11/06/2021 21:30:20 - INFO - __main__ - Step 1650: {'lr': 0.00041225, 'samples': 316800, 'steps': 1649, 'loss/train': 3.1259071826934814} 11/06/2021 21:30:20 - INFO - __main__ - Step 1651: {'lr': 0.0004125, 'samples': 316992, 'steps': 1650, 'loss/train': 2.9349560737609863} 11/06/2021 21:30:21 - INFO - __main__ - Step 1652: {'lr': 0.00041275000000000003, 'samples': 317184, 'steps': 1651, 'loss/train': 2.337951183319092} 11/06/2021 21:30:21 - INFO - __main__ - Step 1653: {'lr': 0.000413, 'samples': 317376, 'steps': 1652, 'loss/train': 3.269364595413208} 11/06/2021 21:30:21 - INFO - __main__ - Step 1654: {'lr': 0.00041325, 'samples': 317568, 'steps': 1653, 'loss/train': 2.937201499938965} 11/06/2021 21:30:22 - INFO - __main__ - Step 1655: {'lr': 0.00041349999999999997, 'samples': 317760, 'steps': 1654, 'loss/train': 3.1462650299072266} 11/06/2021 21:30:23 - INFO - __main__ - Step 1656: {'lr': 0.00041375, 'samples': 317952, 'steps': 1655, 'loss/train': 3.218020439147949} 11/06/2021 21:30:23 - INFO - __main__ - Step 1657: {'lr': 0.000414, 'samples': 318144, 'steps': 1656, 'loss/train': 2.919787645339966} 11/06/2021 21:30:23 - INFO - __main__ - Step 1658: {'lr': 0.00041425, 'samples': 318336, 'steps': 1657, 'loss/train': 2.882232666015625} 11/06/2021 21:30:24 - INFO - __main__ - Step 1659: {'lr': 0.0004145, 'samples': 318528, 'steps': 1658, 'loss/train': 3.018845558166504} 11/06/2021 21:30:24 - INFO - __main__ - Step 1660: {'lr': 0.00041475, 'samples': 318720, 'steps': 1659, 'loss/train': 3.1834557056427} 11/06/2021 21:30:25 - INFO - __main__ - Step 1661: {'lr': 0.000415, 'samples': 318912, 'steps': 1660, 'loss/train': 2.9305548667907715} 11/06/2021 21:30:26 - INFO - __main__ - Step 1662: {'lr': 0.00041525000000000004, 'samples': 319104, 'steps': 1661, 'loss/train': 2.6207032203674316} 11/06/2021 21:30:26 - INFO - __main__ - Step 1663: {'lr': 0.00041549999999999996, 'samples': 319296, 'steps': 1662, 'loss/train': 2.4517478942871094} 11/06/2021 21:30:26 - INFO - __main__ - Step 1664: {'lr': 0.00041575, 'samples': 319488, 'steps': 1663, 'loss/train': 3.7537591457366943} 11/06/2021 21:30:27 - INFO - __main__ - Step 1665: {'lr': 0.000416, 'samples': 319680, 'steps': 1664, 'loss/train': 2.756814956665039} 11/06/2021 21:30:28 - INFO - __main__ - Step 1666: {'lr': 0.00041625, 'samples': 319872, 'steps': 1665, 'loss/train': 1.4056485891342163} 11/06/2021 21:30:28 - INFO - __main__ - Step 1667: {'lr': 0.0004165, 'samples': 320064, 'steps': 1666, 'loss/train': 3.1112024784088135} 11/06/2021 21:30:29 - INFO - __main__ - Step 1668: {'lr': 0.00041675, 'samples': 320256, 'steps': 1667, 'loss/train': 2.9976048469543457} 11/06/2021 21:30:29 - INFO - __main__ - Step 1669: {'lr': 0.000417, 'samples': 320448, 'steps': 1668, 'loss/train': 2.675729274749756} 11/06/2021 21:30:29 - INFO - __main__ - Step 1670: {'lr': 0.00041725000000000003, 'samples': 320640, 'steps': 1669, 'loss/train': 3.3706750869750977} 11/06/2021 21:30:30 - INFO - __main__ - Step 1671: {'lr': 0.0004175, 'samples': 320832, 'steps': 1670, 'loss/train': 2.178697109222412} 11/06/2021 21:30:31 - INFO - __main__ - Step 1672: {'lr': 0.00041775000000000004, 'samples': 321024, 'steps': 1671, 'loss/train': 3.380084753036499} 11/06/2021 21:30:31 - INFO - __main__ - Step 1673: {'lr': 0.00041799999999999997, 'samples': 321216, 'steps': 1672, 'loss/train': 3.366750717163086} 11/06/2021 21:30:31 - INFO - __main__ - Step 1674: {'lr': 0.00041825, 'samples': 321408, 'steps': 1673, 'loss/train': 3.209273338317871} 11/06/2021 21:30:32 - INFO - __main__ - Step 1675: {'lr': 0.0004185, 'samples': 321600, 'steps': 1674, 'loss/train': 2.97452449798584} 11/06/2021 21:30:32 - INFO - __main__ - Step 1676: {'lr': 0.00041875, 'samples': 321792, 'steps': 1675, 'loss/train': 2.767279863357544} 11/06/2021 21:30:33 - INFO - __main__ - Step 1677: {'lr': 0.000419, 'samples': 321984, 'steps': 1676, 'loss/train': 2.001840114593506} 11/06/2021 21:30:34 - INFO - __main__ - Step 1678: {'lr': 0.00041925, 'samples': 322176, 'steps': 1677, 'loss/train': 2.869393825531006} 11/06/2021 21:30:34 - INFO - __main__ - Step 1679: {'lr': 0.0004195, 'samples': 322368, 'steps': 1678, 'loss/train': 3.0054664611816406} 11/06/2021 21:30:34 - INFO - __main__ - Step 1680: {'lr': 0.00041975000000000004, 'samples': 322560, 'steps': 1679, 'loss/train': 3.0030410289764404} 11/06/2021 21:30:35 - INFO - __main__ - Step 1681: {'lr': 0.00042, 'samples': 322752, 'steps': 1680, 'loss/train': 3.1293513774871826} 11/06/2021 21:30:36 - INFO - __main__ - Step 1682: {'lr': 0.00042025, 'samples': 322944, 'steps': 1681, 'loss/train': 2.867621421813965} 11/06/2021 21:30:36 - INFO - __main__ - Step 1683: {'lr': 0.0004205, 'samples': 323136, 'steps': 1682, 'loss/train': 3.148564100265503} 11/06/2021 21:30:36 - INFO - __main__ - Step 1684: {'lr': 0.00042075, 'samples': 323328, 'steps': 1683, 'loss/train': 3.7352867126464844} 11/06/2021 21:30:37 - INFO - __main__ - Step 1685: {'lr': 0.000421, 'samples': 323520, 'steps': 1684, 'loss/train': 2.9373221397399902} 11/06/2021 21:30:37 - INFO - __main__ - Step 1686: {'lr': 0.00042125, 'samples': 323712, 'steps': 1685, 'loss/train': 3.0616962909698486} 11/06/2021 21:30:38 - INFO - __main__ - Step 1687: {'lr': 0.0004215, 'samples': 323904, 'steps': 1686, 'loss/train': 3.012714147567749} 11/06/2021 21:30:38 - INFO - __main__ - Step 1688: {'lr': 0.00042175000000000003, 'samples': 324096, 'steps': 1687, 'loss/train': 2.1148290634155273} 11/06/2021 21:30:39 - INFO - __main__ - Step 1689: {'lr': 0.000422, 'samples': 324288, 'steps': 1688, 'loss/train': 2.7919063568115234} 11/06/2021 21:30:39 - INFO - __main__ - Step 1690: {'lr': 0.00042225000000000005, 'samples': 324480, 'steps': 1689, 'loss/train': 2.9707889556884766} 11/06/2021 21:30:39 - INFO - __main__ - Step 1691: {'lr': 0.00042249999999999997, 'samples': 324672, 'steps': 1690, 'loss/train': 2.998028039932251} 11/06/2021 21:30:40 - INFO - __main__ - Step 1692: {'lr': 0.00042275, 'samples': 324864, 'steps': 1691, 'loss/train': 2.591597557067871} 11/06/2021 21:30:41 - INFO - __main__ - Step 1693: {'lr': 0.000423, 'samples': 325056, 'steps': 1692, 'loss/train': 1.9363735914230347} 11/06/2021 21:30:41 - INFO - __main__ - Step 1694: {'lr': 0.00042325, 'samples': 325248, 'steps': 1693, 'loss/train': 4.028586387634277} 11/06/2021 21:30:42 - INFO - __main__ - Step 1695: {'lr': 0.0004235, 'samples': 325440, 'steps': 1694, 'loss/train': 2.9330668449401855} 11/06/2021 21:30:42 - INFO - __main__ - Step 1696: {'lr': 0.00042375000000000003, 'samples': 325632, 'steps': 1695, 'loss/train': 2.7509193420410156} 11/06/2021 21:30:42 - INFO - __main__ - Step 1697: {'lr': 0.000424, 'samples': 325824, 'steps': 1696, 'loss/train': 2.7954797744750977} 11/06/2021 21:30:44 - INFO - __main__ - Step 1698: {'lr': 0.00042425000000000004, 'samples': 326016, 'steps': 1697, 'loss/train': 2.988008499145508} 11/06/2021 21:30:44 - INFO - __main__ - Step 1699: {'lr': 0.0004245, 'samples': 326208, 'steps': 1698, 'loss/train': 1.511644959449768} 11/06/2021 21:30:44 - INFO - __main__ - Step 1700: {'lr': 0.00042475000000000005, 'samples': 326400, 'steps': 1699, 'loss/train': 2.778535842895508} 11/06/2021 21:30:45 - INFO - __main__ - Step 1701: {'lr': 0.000425, 'samples': 326592, 'steps': 1700, 'loss/train': 3.4467451572418213} 11/06/2021 21:30:45 - INFO - __main__ - Step 1702: {'lr': 0.00042525, 'samples': 326784, 'steps': 1701, 'loss/train': 2.888021469116211} 11/06/2021 21:30:46 - INFO - __main__ - Step 1703: {'lr': 0.0004255, 'samples': 326976, 'steps': 1702, 'loss/train': 3.509974718093872} 11/06/2021 21:30:46 - INFO - __main__ - Step 1704: {'lr': 0.00042575, 'samples': 327168, 'steps': 1703, 'loss/train': 3.2961580753326416} 11/06/2021 21:30:47 - INFO - __main__ - Step 1705: {'lr': 0.000426, 'samples': 327360, 'steps': 1704, 'loss/train': 2.881072521209717} 11/06/2021 21:30:47 - INFO - __main__ - Step 1706: {'lr': 0.00042625000000000003, 'samples': 327552, 'steps': 1705, 'loss/train': 2.8724474906921387} 11/06/2021 21:30:48 - INFO - __main__ - Step 1707: {'lr': 0.0004265, 'samples': 327744, 'steps': 1706, 'loss/train': 2.812833547592163} 11/06/2021 21:30:49 - INFO - __main__ - Step 1708: {'lr': 0.00042675000000000005, 'samples': 327936, 'steps': 1707, 'loss/train': 2.946378231048584} 11/06/2021 21:30:49 - INFO - __main__ - Step 1709: {'lr': 0.000427, 'samples': 328128, 'steps': 1708, 'loss/train': 3.0991225242614746} 11/06/2021 21:30:49 - INFO - __main__ - Step 1710: {'lr': 0.00042725, 'samples': 328320, 'steps': 1709, 'loss/train': 2.76066517829895} 11/06/2021 21:30:50 - INFO - __main__ - Step 1711: {'lr': 0.0004275, 'samples': 328512, 'steps': 1710, 'loss/train': 3.4125115871429443} 11/06/2021 21:30:50 - INFO - __main__ - Step 1712: {'lr': 0.00042775, 'samples': 328704, 'steps': 1711, 'loss/train': 3.148629903793335} 11/06/2021 21:30:51 - INFO - __main__ - Step 1713: {'lr': 0.000428, 'samples': 328896, 'steps': 1712, 'loss/train': 2.9542479515075684} 11/06/2021 21:30:51 - INFO - __main__ - Step 1714: {'lr': 0.00042825000000000003, 'samples': 329088, 'steps': 1713, 'loss/train': 2.862802743911743} 11/06/2021 21:30:52 - INFO - __main__ - Step 1715: {'lr': 0.0004285, 'samples': 329280, 'steps': 1714, 'loss/train': 3.073770046234131} 11/06/2021 21:30:52 - INFO - __main__ - Step 1716: {'lr': 0.00042875000000000004, 'samples': 329472, 'steps': 1715, 'loss/train': 3.261838436126709} 11/06/2021 21:30:52 - INFO - __main__ - Step 1717: {'lr': 0.000429, 'samples': 329664, 'steps': 1716, 'loss/train': 2.599968194961548} 11/06/2021 21:30:53 - INFO - __main__ - Step 1718: {'lr': 0.00042925000000000005, 'samples': 329856, 'steps': 1717, 'loss/train': 2.516638994216919} 11/06/2021 21:30:54 - INFO - __main__ - Step 1719: {'lr': 0.0004295, 'samples': 330048, 'steps': 1718, 'loss/train': 3.0501821041107178} 11/06/2021 21:30:54 - INFO - __main__ - Step 1720: {'lr': 0.00042975, 'samples': 330240, 'steps': 1719, 'loss/train': 2.847111225128174} 11/06/2021 21:30:55 - INFO - __main__ - Step 1721: {'lr': 0.00043, 'samples': 330432, 'steps': 1720, 'loss/train': 3.3006772994995117} 11/06/2021 21:30:55 - INFO - __main__ - Step 1722: {'lr': 0.00043025, 'samples': 330624, 'steps': 1721, 'loss/train': 2.7652220726013184} 11/06/2021 21:30:55 - INFO - __main__ - Step 1723: {'lr': 0.0004305, 'samples': 330816, 'steps': 1722, 'loss/train': 2.9833667278289795} 11/06/2021 21:30:56 - INFO - __main__ - Step 1724: {'lr': 0.00043075000000000003, 'samples': 331008, 'steps': 1723, 'loss/train': 2.814836263656616} 11/06/2021 21:30:57 - INFO - __main__ - Step 1725: {'lr': 0.000431, 'samples': 331200, 'steps': 1724, 'loss/train': 2.774881601333618} 11/06/2021 21:30:57 - INFO - __main__ - Step 1726: {'lr': 0.00043125000000000005, 'samples': 331392, 'steps': 1725, 'loss/train': 2.4785728454589844} 11/06/2021 21:30:57 - INFO - __main__ - Step 1727: {'lr': 0.0004315, 'samples': 331584, 'steps': 1726, 'loss/train': 2.6767044067382812} 11/06/2021 21:30:58 - INFO - __main__ - Step 1728: {'lr': 0.00043175, 'samples': 331776, 'steps': 1727, 'loss/train': 2.493360757827759} 11/06/2021 21:30:59 - INFO - __main__ - Step 1729: {'lr': 0.000432, 'samples': 331968, 'steps': 1728, 'loss/train': 2.8869781494140625} 11/06/2021 21:30:59 - INFO - __main__ - Step 1730: {'lr': 0.00043225, 'samples': 332160, 'steps': 1729, 'loss/train': 3.2163848876953125} 11/06/2021 21:30:59 - INFO - __main__ - Step 1731: {'lr': 0.0004325, 'samples': 332352, 'steps': 1730, 'loss/train': 2.487175703048706} 11/06/2021 21:31:00 - INFO - __main__ - Step 1732: {'lr': 0.00043275000000000003, 'samples': 332544, 'steps': 1731, 'loss/train': 3.1521098613739014} 11/06/2021 21:31:00 - INFO - __main__ - Step 1733: {'lr': 0.000433, 'samples': 332736, 'steps': 1732, 'loss/train': 2.8074662685394287} 11/06/2021 21:31:01 - INFO - __main__ - Step 1734: {'lr': 0.00043325000000000004, 'samples': 332928, 'steps': 1733, 'loss/train': 3.05580735206604} 11/06/2021 21:31:01 - INFO - __main__ - Step 1735: {'lr': 0.0004335, 'samples': 333120, 'steps': 1734, 'loss/train': 3.6109087467193604} 11/06/2021 21:31:02 - INFO - __main__ - Step 1736: {'lr': 0.00043375000000000005, 'samples': 333312, 'steps': 1735, 'loss/train': 2.645113945007324} 11/06/2021 21:31:02 - INFO - __main__ - Step 1737: {'lr': 0.00043400000000000003, 'samples': 333504, 'steps': 1736, 'loss/train': 3.2458112239837646} 11/06/2021 21:31:02 - INFO - __main__ - Step 1738: {'lr': 0.00043425, 'samples': 333696, 'steps': 1737, 'loss/train': 2.0575199127197266} 11/06/2021 21:31:03 - INFO - __main__ - Step 1739: {'lr': 0.0004345, 'samples': 333888, 'steps': 1738, 'loss/train': 2.66829514503479} 11/06/2021 21:31:04 - INFO - __main__ - Step 1740: {'lr': 0.00043475, 'samples': 334080, 'steps': 1739, 'loss/train': 2.9333367347717285} 11/06/2021 21:31:04 - INFO - __main__ - Step 1741: {'lr': 0.000435, 'samples': 334272, 'steps': 1740, 'loss/train': 3.1755714416503906} 11/06/2021 21:31:04 - INFO - __main__ - Step 1742: {'lr': 0.00043525000000000004, 'samples': 334464, 'steps': 1741, 'loss/train': 3.1603219509124756} 11/06/2021 21:31:05 - INFO - __main__ - Step 1743: {'lr': 0.0004355, 'samples': 334656, 'steps': 1742, 'loss/train': 2.6941590309143066} 11/06/2021 21:31:05 - INFO - __main__ - Step 1744: {'lr': 0.00043575000000000005, 'samples': 334848, 'steps': 1743, 'loss/train': 2.652301788330078} 11/06/2021 21:31:06 - INFO - __main__ - Step 1745: {'lr': 0.000436, 'samples': 335040, 'steps': 1744, 'loss/train': 2.9104788303375244} 11/06/2021 21:31:06 - INFO - __main__ - Step 1746: {'lr': 0.00043625000000000006, 'samples': 335232, 'steps': 1745, 'loss/train': 2.98048996925354} 11/06/2021 21:31:07 - INFO - __main__ - Step 1747: {'lr': 0.0004365, 'samples': 335424, 'steps': 1746, 'loss/train': 2.9957258701324463} 11/06/2021 21:31:07 - INFO - __main__ - Step 1748: {'lr': 0.00043675, 'samples': 335616, 'steps': 1747, 'loss/train': 2.979066848754883} 11/06/2021 21:31:08 - INFO - __main__ - Step 1749: {'lr': 0.000437, 'samples': 335808, 'steps': 1748, 'loss/train': 2.8784215450286865} 11/06/2021 21:31:09 - INFO - __main__ - Step 1750: {'lr': 0.00043725000000000003, 'samples': 336000, 'steps': 1749, 'loss/train': 2.7070152759552} 11/06/2021 21:31:09 - INFO - __main__ - Step 1751: {'lr': 0.0004375, 'samples': 336192, 'steps': 1750, 'loss/train': 3.0066816806793213} 11/06/2021 21:31:09 - INFO - __main__ - Step 1752: {'lr': 0.00043775, 'samples': 336384, 'steps': 1751, 'loss/train': 2.9297680854797363} 11/06/2021 21:31:10 - INFO - __main__ - Step 1753: {'lr': 0.000438, 'samples': 336576, 'steps': 1752, 'loss/train': 2.3090498447418213} 11/06/2021 21:31:10 - INFO - __main__ - Step 1754: {'lr': 0.00043825, 'samples': 336768, 'steps': 1753, 'loss/train': 2.7136483192443848} 11/06/2021 21:31:11 - INFO - __main__ - Step 1755: {'lr': 0.00043850000000000003, 'samples': 336960, 'steps': 1754, 'loss/train': 2.4475595951080322} 11/06/2021 21:31:11 - INFO - __main__ - Step 1756: {'lr': 0.00043874999999999996, 'samples': 337152, 'steps': 1755, 'loss/train': 2.8239781856536865} 11/06/2021 21:31:12 - INFO - __main__ - Step 1757: {'lr': 0.000439, 'samples': 337344, 'steps': 1756, 'loss/train': 3.583254337310791} 11/06/2021 21:31:12 - INFO - __main__ - Step 1758: {'lr': 0.00043924999999999997, 'samples': 337536, 'steps': 1757, 'loss/train': 2.8755791187286377} 11/06/2021 21:31:12 - INFO - __main__ - Step 1759: {'lr': 0.0004395, 'samples': 337728, 'steps': 1758, 'loss/train': 2.466693878173828} 11/06/2021 21:31:14 - INFO - __main__ - Step 1760: {'lr': 0.00043975, 'samples': 337920, 'steps': 1759, 'loss/train': 1.9713040590286255} 11/06/2021 21:31:14 - INFO - __main__ - Step 1761: {'lr': 0.00044, 'samples': 338112, 'steps': 1760, 'loss/train': 2.958988904953003} 11/06/2021 21:31:14 - INFO - __main__ - Step 1762: {'lr': 0.00044025, 'samples': 338304, 'steps': 1761, 'loss/train': 2.85742449760437} 11/06/2021 21:31:15 - INFO - __main__ - Step 1763: {'lr': 0.00044050000000000003, 'samples': 338496, 'steps': 1762, 'loss/train': 3.1893415451049805} 11/06/2021 21:31:15 - INFO - __main__ - Step 1764: {'lr': 0.00044075, 'samples': 338688, 'steps': 1763, 'loss/train': 2.489380121231079} 11/06/2021 21:31:16 - INFO - __main__ - Step 1765: {'lr': 0.000441, 'samples': 338880, 'steps': 1764, 'loss/train': 2.26016902923584} 11/06/2021 21:31:16 - INFO - __main__ - Step 1766: {'lr': 0.00044124999999999996, 'samples': 339072, 'steps': 1765, 'loss/train': 2.8823275566101074} 11/06/2021 21:31:17 - INFO - __main__ - Step 1767: {'lr': 0.0004415, 'samples': 339264, 'steps': 1766, 'loss/train': 3.0336639881134033} 11/06/2021 21:31:17 - INFO - __main__ - Step 1768: {'lr': 0.00044175, 'samples': 339456, 'steps': 1767, 'loss/train': 2.4429008960723877} 11/06/2021 21:31:17 - INFO - __main__ - Step 1769: {'lr': 0.000442, 'samples': 339648, 'steps': 1768, 'loss/train': 3.196798086166382} 11/06/2021 21:31:18 - INFO - __main__ - Step 1770: {'lr': 0.00044225, 'samples': 339840, 'steps': 1769, 'loss/train': 3.4543304443359375} 11/06/2021 21:31:19 - INFO - __main__ - Step 1771: {'lr': 0.0004425, 'samples': 340032, 'steps': 1770, 'loss/train': 2.851696252822876} 11/06/2021 21:31:19 - INFO - __main__ - Step 1772: {'lr': 0.00044275, 'samples': 340224, 'steps': 1771, 'loss/train': 3.060702085494995} 11/06/2021 21:31:19 - INFO - __main__ - Step 1773: {'lr': 0.00044300000000000003, 'samples': 340416, 'steps': 1772, 'loss/train': 2.7474842071533203} 11/06/2021 21:31:20 - INFO - __main__ - Step 1774: {'lr': 0.00044325, 'samples': 340608, 'steps': 1773, 'loss/train': 2.9190454483032227} 11/06/2021 21:31:21 - INFO - __main__ - Step 1775: {'lr': 0.0004435, 'samples': 340800, 'steps': 1774, 'loss/train': 2.802793025970459} 11/06/2021 21:31:21 - INFO - __main__ - Step 1776: {'lr': 0.00044374999999999997, 'samples': 340992, 'steps': 1775, 'loss/train': 2.8785560131073} 11/06/2021 21:31:21 - INFO - __main__ - Step 1777: {'lr': 0.000444, 'samples': 341184, 'steps': 1776, 'loss/train': 2.995643138885498} 11/06/2021 21:31:22 - INFO - __main__ - Step 1778: {'lr': 0.00044425, 'samples': 341376, 'steps': 1777, 'loss/train': 1.8993228673934937} 11/06/2021 21:31:22 - INFO - __main__ - Step 1779: {'lr': 0.0004445, 'samples': 341568, 'steps': 1778, 'loss/train': 2.85463547706604} 11/06/2021 21:31:23 - INFO - __main__ - Step 1780: {'lr': 0.00044475, 'samples': 341760, 'steps': 1779, 'loss/train': 2.185767650604248} 11/06/2021 21:31:24 - INFO - __main__ - Step 1781: {'lr': 0.00044500000000000003, 'samples': 341952, 'steps': 1780, 'loss/train': 2.92201828956604} 11/06/2021 21:31:24 - INFO - __main__ - Step 1782: {'lr': 0.00044525, 'samples': 342144, 'steps': 1781, 'loss/train': 3.134248971939087} 11/06/2021 21:31:24 - INFO - __main__ - Step 1783: {'lr': 0.00044550000000000004, 'samples': 342336, 'steps': 1782, 'loss/train': 2.170048952102661} 11/06/2021 21:31:25 - INFO - __main__ - Step 1784: {'lr': 0.00044574999999999997, 'samples': 342528, 'steps': 1783, 'loss/train': 3.8918814659118652} 11/06/2021 21:31:26 - INFO - __main__ - Step 1785: {'lr': 0.000446, 'samples': 342720, 'steps': 1784, 'loss/train': 2.8046724796295166} 11/06/2021 21:31:26 - INFO - __main__ - Step 1786: {'lr': 0.00044625, 'samples': 342912, 'steps': 1785, 'loss/train': 1.7093374729156494} 11/06/2021 21:31:26 - INFO - __main__ - Step 1787: {'lr': 0.0004465, 'samples': 343104, 'steps': 1786, 'loss/train': 2.761946678161621} 11/06/2021 21:31:27 - INFO - __main__ - Step 1788: {'lr': 0.00044675, 'samples': 343296, 'steps': 1787, 'loss/train': 3.1174871921539307} 11/06/2021 21:31:27 - INFO - __main__ - Step 1789: {'lr': 0.000447, 'samples': 343488, 'steps': 1788, 'loss/train': 0.9450645446777344} 11/06/2021 21:31:27 - INFO - __main__ - Step 1790: {'lr': 0.00044725, 'samples': 343680, 'steps': 1789, 'loss/train': 2.685868263244629} 11/06/2021 21:31:28 - INFO - __main__ - Step 1791: {'lr': 0.00044750000000000004, 'samples': 343872, 'steps': 1790, 'loss/train': 3.149627685546875} 11/06/2021 21:31:29 - INFO - __main__ - Step 1792: {'lr': 0.00044775, 'samples': 344064, 'steps': 1791, 'loss/train': 2.8557493686676025} 11/06/2021 21:31:29 - INFO - __main__ - Step 1793: {'lr': 0.000448, 'samples': 344256, 'steps': 1792, 'loss/train': 2.3917579650878906} 11/06/2021 21:31:29 - INFO - __main__ - Step 1794: {'lr': 0.00044824999999999997, 'samples': 344448, 'steps': 1793, 'loss/train': 2.5539019107818604} 11/06/2021 21:31:30 - INFO - __main__ - Step 1795: {'lr': 0.0004485, 'samples': 344640, 'steps': 1794, 'loss/train': 2.866556167602539} 11/06/2021 21:31:31 - INFO - __main__ - Step 1796: {'lr': 0.00044875, 'samples': 344832, 'steps': 1795, 'loss/train': 2.69342303276062} 11/06/2021 21:31:31 - INFO - __main__ - Step 1797: {'lr': 0.000449, 'samples': 345024, 'steps': 1796, 'loss/train': 3.4725310802459717} 11/06/2021 21:31:32 - INFO - __main__ - Step 1798: {'lr': 0.00044925, 'samples': 345216, 'steps': 1797, 'loss/train': 3.1553845405578613} 11/06/2021 21:31:32 - INFO - __main__ - Step 1799: {'lr': 0.00044950000000000003, 'samples': 345408, 'steps': 1798, 'loss/train': 2.5512850284576416} 11/06/2021 21:31:32 - INFO - __main__ - Step 1800: {'lr': 0.00044975, 'samples': 345600, 'steps': 1799, 'loss/train': 2.6328558921813965} 11/06/2021 21:31:33 - INFO - __main__ - Step 1801: {'lr': 0.00045000000000000004, 'samples': 345792, 'steps': 1800, 'loss/train': 2.415855646133423} 11/06/2021 21:31:34 - INFO - __main__ - Step 1802: {'lr': 0.00045024999999999997, 'samples': 345984, 'steps': 1801, 'loss/train': 2.065478801727295} 11/06/2021 21:31:34 - INFO - __main__ - Step 1803: {'lr': 0.0004505, 'samples': 346176, 'steps': 1802, 'loss/train': 3.1000237464904785} 11/06/2021 21:31:34 - INFO - __main__ - Step 1804: {'lr': 0.00045075, 'samples': 346368, 'steps': 1803, 'loss/train': 3.079942464828491} 11/06/2021 21:31:35 - INFO - __main__ - Step 1805: {'lr': 0.000451, 'samples': 346560, 'steps': 1804, 'loss/train': 2.9212794303894043} 11/06/2021 21:31:36 - INFO - __main__ - Step 1806: {'lr': 0.00045125, 'samples': 346752, 'steps': 1805, 'loss/train': 2.327099323272705} 11/06/2021 21:31:36 - INFO - __main__ - Step 1807: {'lr': 0.0004515, 'samples': 346944, 'steps': 1806, 'loss/train': 2.2962563037872314} 11/06/2021 21:31:36 - INFO - __main__ - Step 1808: {'lr': 0.00045175, 'samples': 347136, 'steps': 1807, 'loss/train': 2.611140012741089} 11/06/2021 21:31:37 - INFO - __main__ - Step 1809: {'lr': 0.00045200000000000004, 'samples': 347328, 'steps': 1808, 'loss/train': 2.6826744079589844} 11/06/2021 21:31:37 - INFO - __main__ - Step 1810: {'lr': 0.00045225, 'samples': 347520, 'steps': 1809, 'loss/train': 1.936403512954712} 11/06/2021 21:31:38 - INFO - __main__ - Step 1811: {'lr': 0.00045250000000000005, 'samples': 347712, 'steps': 1810, 'loss/train': 2.373436450958252} 11/06/2021 21:31:39 - INFO - __main__ - Step 1812: {'lr': 0.00045275, 'samples': 347904, 'steps': 1811, 'loss/train': 2.730616807937622} 11/06/2021 21:31:39 - INFO - __main__ - Step 1813: {'lr': 0.000453, 'samples': 348096, 'steps': 1812, 'loss/train': 2.415011405944824} 11/06/2021 21:31:39 - INFO - __main__ - Step 1814: {'lr': 0.00045325, 'samples': 348288, 'steps': 1813, 'loss/train': 3.2586300373077393} 11/06/2021 21:31:40 - INFO - __main__ - Step 1815: {'lr': 0.0004535, 'samples': 348480, 'steps': 1814, 'loss/train': 2.568127155303955} 11/06/2021 21:31:40 - INFO - __main__ - Step 1816: {'lr': 0.00045375, 'samples': 348672, 'steps': 1815, 'loss/train': 2.9116480350494385} 11/06/2021 21:31:41 - INFO - __main__ - Step 1817: {'lr': 0.00045400000000000003, 'samples': 348864, 'steps': 1816, 'loss/train': 2.36753249168396} 11/06/2021 21:31:41 - INFO - __main__ - Step 1818: {'lr': 0.00045425, 'samples': 349056, 'steps': 1817, 'loss/train': 1.9278064966201782} 11/06/2021 21:31:42 - INFO - __main__ - Step 1819: {'lr': 0.00045450000000000004, 'samples': 349248, 'steps': 1818, 'loss/train': 3.1639034748077393} 11/06/2021 21:31:42 - INFO - __main__ - Step 1820: {'lr': 0.00045475, 'samples': 349440, 'steps': 1819, 'loss/train': 2.775869846343994} 11/06/2021 21:31:42 - INFO - __main__ - Step 1821: {'lr': 0.000455, 'samples': 349632, 'steps': 1820, 'loss/train': 1.739648699760437} 11/06/2021 21:31:44 - INFO - __main__ - Step 1822: {'lr': 0.00045525, 'samples': 349824, 'steps': 1821, 'loss/train': 2.914762258529663} 11/06/2021 21:31:44 - INFO - __main__ - Step 1823: {'lr': 0.0004555, 'samples': 350016, 'steps': 1822, 'loss/train': 2.881727457046509} 11/06/2021 21:31:44 - INFO - __main__ - Step 1824: {'lr': 0.00045575, 'samples': 350208, 'steps': 1823, 'loss/train': 2.6797189712524414} 11/06/2021 21:31:45 - INFO - __main__ - Step 1825: {'lr': 0.000456, 'samples': 350400, 'steps': 1824, 'loss/train': 3.1673336029052734} 11/06/2021 21:31:45 - INFO - __main__ - Step 1826: {'lr': 0.00045625, 'samples': 350592, 'steps': 1825, 'loss/train': 2.9814321994781494} 11/06/2021 21:31:45 - INFO - __main__ - Step 1827: {'lr': 0.00045650000000000004, 'samples': 350784, 'steps': 1826, 'loss/train': 2.706651210784912} 11/06/2021 21:31:47 - INFO - __main__ - Step 1828: {'lr': 0.00045675, 'samples': 350976, 'steps': 1827, 'loss/train': 2.262235403060913} 11/06/2021 21:31:47 - INFO - __main__ - Step 1829: {'lr': 0.00045700000000000005, 'samples': 351168, 'steps': 1828, 'loss/train': 3.5420644283294678} 11/06/2021 21:31:47 - INFO - __main__ - Step 1830: {'lr': 0.00045725, 'samples': 351360, 'steps': 1829, 'loss/train': 2.597929000854492} 11/06/2021 21:31:48 - INFO - __main__ - Step 1831: {'lr': 0.0004575, 'samples': 351552, 'steps': 1830, 'loss/train': 3.1444313526153564} 11/06/2021 21:31:48 - INFO - __main__ - Step 1832: {'lr': 0.00045775, 'samples': 351744, 'steps': 1831, 'loss/train': 2.755781650543213} 11/06/2021 21:31:49 - INFO - __main__ - Step 1833: {'lr': 0.000458, 'samples': 351936, 'steps': 1832, 'loss/train': 2.7278811931610107} 11/06/2021 21:31:50 - INFO - __main__ - Step 1834: {'lr': 0.00045825, 'samples': 352128, 'steps': 1833, 'loss/train': 3.7509689331054688} 11/06/2021 21:31:50 - INFO - __main__ - Step 1835: {'lr': 0.00045850000000000003, 'samples': 352320, 'steps': 1834, 'loss/train': 2.9152324199676514} 11/06/2021 21:31:50 - INFO - __main__ - Step 1836: {'lr': 0.00045875, 'samples': 352512, 'steps': 1835, 'loss/train': 3.254568338394165} 11/06/2021 21:31:51 - INFO - __main__ - Step 1837: {'lr': 0.00045900000000000004, 'samples': 352704, 'steps': 1836, 'loss/train': 3.5360164642333984} 11/06/2021 21:31:51 - INFO - __main__ - Step 1838: {'lr': 0.00045925, 'samples': 352896, 'steps': 1837, 'loss/train': 2.5410776138305664} 11/06/2021 21:31:52 - INFO - __main__ - Step 1839: {'lr': 0.00045950000000000006, 'samples': 353088, 'steps': 1838, 'loss/train': 3.515568971633911} 11/06/2021 21:31:52 - INFO - __main__ - Step 1840: {'lr': 0.00045975, 'samples': 353280, 'steps': 1839, 'loss/train': 2.9194540977478027} 11/06/2021 21:31:53 - INFO - __main__ - Step 1841: {'lr': 0.00046, 'samples': 353472, 'steps': 1840, 'loss/train': 3.339538335800171} 11/06/2021 21:31:53 - INFO - __main__ - Step 1842: {'lr': 0.00046025, 'samples': 353664, 'steps': 1841, 'loss/train': 3.2247955799102783} 11/06/2021 21:31:53 - INFO - __main__ - Step 1843: {'lr': 0.0004605, 'samples': 353856, 'steps': 1842, 'loss/train': 2.9142487049102783} 11/06/2021 21:31:54 - INFO - __main__ - Step 1844: {'lr': 0.00046075, 'samples': 354048, 'steps': 1843, 'loss/train': 3.003730058670044} 11/06/2021 21:31:55 - INFO - __main__ - Step 1845: {'lr': 0.00046100000000000004, 'samples': 354240, 'steps': 1844, 'loss/train': 2.9719865322113037} 11/06/2021 21:31:55 - INFO - __main__ - Step 1846: {'lr': 0.00046125, 'samples': 354432, 'steps': 1845, 'loss/train': 2.9568333625793457} 11/06/2021 21:31:55 - INFO - __main__ - Step 1847: {'lr': 0.00046150000000000005, 'samples': 354624, 'steps': 1846, 'loss/train': 3.0182647705078125} 11/06/2021 21:31:56 - INFO - __main__ - Step 1848: {'lr': 0.00046175000000000003, 'samples': 354816, 'steps': 1847, 'loss/train': 2.928616762161255} 11/06/2021 21:31:57 - INFO - __main__ - Step 1849: {'lr': 0.000462, 'samples': 355008, 'steps': 1848, 'loss/train': 2.610539674758911} 11/06/2021 21:31:57 - INFO - __main__ - Step 1850: {'lr': 0.00046225, 'samples': 355200, 'steps': 1849, 'loss/train': 2.6568455696105957} 11/06/2021 21:31:57 - INFO - __main__ - Step 1851: {'lr': 0.0004625, 'samples': 355392, 'steps': 1850, 'loss/train': 3.0052649974823} 11/06/2021 21:31:58 - INFO - __main__ - Step 1852: {'lr': 0.00046275, 'samples': 355584, 'steps': 1851, 'loss/train': 2.291944980621338} 11/06/2021 21:31:58 - INFO - __main__ - Step 1853: {'lr': 0.00046300000000000003, 'samples': 355776, 'steps': 1852, 'loss/train': 2.464561700820923} 11/06/2021 21:31:59 - INFO - __main__ - Step 1854: {'lr': 0.00046325, 'samples': 355968, 'steps': 1853, 'loss/train': 2.724757671356201} 11/06/2021 21:32:00 - INFO - __main__ - Step 1855: {'lr': 0.00046350000000000004, 'samples': 356160, 'steps': 1854, 'loss/train': 2.5522210597991943} 11/06/2021 21:32:00 - INFO - __main__ - Step 1856: {'lr': 0.00046375, 'samples': 356352, 'steps': 1855, 'loss/train': 2.9748101234436035} 11/06/2021 21:32:00 - INFO - __main__ - Step 1857: {'lr': 0.00046400000000000006, 'samples': 356544, 'steps': 1856, 'loss/train': 2.8747081756591797} 11/06/2021 21:32:01 - INFO - __main__ - Step 1858: {'lr': 0.00046425, 'samples': 356736, 'steps': 1857, 'loss/train': 3.183314085006714} 11/06/2021 21:32:01 - INFO - __main__ - Step 1859: {'lr': 0.0004645, 'samples': 356928, 'steps': 1858, 'loss/train': 2.9345762729644775} 11/06/2021 21:32:02 - INFO - __main__ - Step 1860: {'lr': 0.00046475, 'samples': 357120, 'steps': 1859, 'loss/train': 2.89750337600708} 11/06/2021 21:32:02 - INFO - __main__ - Step 1861: {'lr': 0.000465, 'samples': 357312, 'steps': 1860, 'loss/train': 2.8957128524780273} 11/06/2021 21:32:03 - INFO - __main__ - Step 1862: {'lr': 0.00046525, 'samples': 357504, 'steps': 1861, 'loss/train': 3.1396281719207764} 11/06/2021 21:32:03 - INFO - __main__ - Step 1863: {'lr': 0.00046550000000000004, 'samples': 357696, 'steps': 1862, 'loss/train': 2.8213651180267334} 11/06/2021 21:32:03 - INFO - __main__ - Step 1864: {'lr': 0.00046575, 'samples': 357888, 'steps': 1863, 'loss/train': 3.4859542846679688} 11/06/2021 21:32:04 - INFO - __main__ - Step 1865: {'lr': 0.00046600000000000005, 'samples': 358080, 'steps': 1864, 'loss/train': 2.8421342372894287} 11/06/2021 21:32:05 - INFO - __main__ - Step 1866: {'lr': 0.00046625000000000003, 'samples': 358272, 'steps': 1865, 'loss/train': 2.0108532905578613} 11/06/2021 21:32:05 - INFO - __main__ - Step 1867: {'lr': 0.0004665, 'samples': 358464, 'steps': 1866, 'loss/train': 3.0010299682617188} 11/06/2021 21:32:05 - INFO - __main__ - Step 1868: {'lr': 0.00046675, 'samples': 358656, 'steps': 1867, 'loss/train': 2.391618013381958} 11/06/2021 21:32:06 - INFO - __main__ - Step 1869: {'lr': 0.000467, 'samples': 358848, 'steps': 1868, 'loss/train': 1.2518956661224365} 11/06/2021 21:32:07 - INFO - __main__ - Step 1870: {'lr': 0.00046725, 'samples': 359040, 'steps': 1869, 'loss/train': 2.0170106887817383} 11/06/2021 21:32:07 - INFO - __main__ - Step 1871: {'lr': 0.00046750000000000003, 'samples': 359232, 'steps': 1870, 'loss/train': 1.179073691368103} 11/06/2021 21:32:08 - INFO - __main__ - Step 1872: {'lr': 0.00046775, 'samples': 359424, 'steps': 1871, 'loss/train': 2.682760715484619} 11/06/2021 21:32:08 - INFO - __main__ - Step 1873: {'lr': 0.00046800000000000005, 'samples': 359616, 'steps': 1872, 'loss/train': 2.571380138397217} 11/06/2021 21:32:08 - INFO - __main__ - Step 1874: {'lr': 0.00046825, 'samples': 359808, 'steps': 1873, 'loss/train': 3.113672971725464} 11/06/2021 21:32:10 - INFO - __main__ - Step 1875: {'lr': 0.00046850000000000006, 'samples': 360000, 'steps': 1874, 'loss/train': 3.40779447555542} 11/06/2021 21:32:10 - INFO - __main__ - Step 1876: {'lr': 0.00046875, 'samples': 360192, 'steps': 1875, 'loss/train': 2.6321020126342773} 11/06/2021 21:32:10 - INFO - __main__ - Step 1877: {'lr': 0.00046899999999999996, 'samples': 360384, 'steps': 1876, 'loss/train': 2.4527957439422607} 11/06/2021 21:32:11 - INFO - __main__ - Step 1878: {'lr': 0.00046925, 'samples': 360576, 'steps': 1877, 'loss/train': 2.8265886306762695} 11/06/2021 21:32:11 - INFO - __main__ - Step 1879: {'lr': 0.0004695, 'samples': 360768, 'steps': 1878, 'loss/train': 2.863210916519165} 11/06/2021 21:32:11 - INFO - __main__ - Step 1880: {'lr': 0.00046975, 'samples': 360960, 'steps': 1879, 'loss/train': 2.7804150581359863} 11/06/2021 21:32:12 - INFO - __main__ - Step 1881: {'lr': 0.00047, 'samples': 361152, 'steps': 1880, 'loss/train': 2.7934844493865967} 11/06/2021 21:32:13 - INFO - __main__ - Step 1882: {'lr': 0.00047025, 'samples': 361344, 'steps': 1881, 'loss/train': 2.5030977725982666} 11/06/2021 21:32:13 - INFO - __main__ - Step 1883: {'lr': 0.0004705, 'samples': 361536, 'steps': 1882, 'loss/train': 2.1091949939727783} 11/06/2021 21:32:13 - INFO - __main__ - Step 1884: {'lr': 0.00047075000000000003, 'samples': 361728, 'steps': 1883, 'loss/train': 2.5549018383026123} 11/06/2021 21:32:14 - INFO - __main__ - Step 1885: {'lr': 0.000471, 'samples': 361920, 'steps': 1884, 'loss/train': 2.7697577476501465} 11/06/2021 21:32:15 - INFO - __main__ - Step 1886: {'lr': 0.00047125, 'samples': 362112, 'steps': 1885, 'loss/train': 2.8604817390441895} 11/06/2021 21:32:15 - INFO - __main__ - Step 1887: {'lr': 0.00047149999999999997, 'samples': 362304, 'steps': 1886, 'loss/train': 2.9714930057525635} 11/06/2021 21:32:16 - INFO - __main__ - Step 1888: {'lr': 0.00047175, 'samples': 362496, 'steps': 1887, 'loss/train': 3.0450122356414795} 11/06/2021 21:32:16 - INFO - __main__ - Step 1889: {'lr': 0.000472, 'samples': 362688, 'steps': 1888, 'loss/train': 2.69807505607605} 11/06/2021 21:32:16 - INFO - __main__ - Step 1890: {'lr': 0.00047225, 'samples': 362880, 'steps': 1889, 'loss/train': 2.992997884750366} 11/06/2021 21:32:17 - INFO - __main__ - Step 1891: {'lr': 0.0004725, 'samples': 363072, 'steps': 1890, 'loss/train': 2.015784740447998} 11/06/2021 21:32:18 - INFO - __main__ - Step 1892: {'lr': 0.00047275, 'samples': 363264, 'steps': 1891, 'loss/train': 2.8816325664520264} 11/06/2021 21:32:18 - INFO - __main__ - Step 1893: {'lr': 0.000473, 'samples': 363456, 'steps': 1892, 'loss/train': 3.2066597938537598} 11/06/2021 21:32:18 - INFO - __main__ - Step 1894: {'lr': 0.00047325000000000004, 'samples': 363648, 'steps': 1893, 'loss/train': 2.5090718269348145} 11/06/2021 21:32:19 - INFO - __main__ - Step 1895: {'lr': 0.00047349999999999996, 'samples': 363840, 'steps': 1894, 'loss/train': 2.715529680252075} 11/06/2021 21:32:19 - INFO - __main__ - Step 1896: {'lr': 0.00047375, 'samples': 364032, 'steps': 1895, 'loss/train': 2.8144490718841553} 11/06/2021 21:32:20 - INFO - __main__ - Step 1897: {'lr': 0.000474, 'samples': 364224, 'steps': 1896, 'loss/train': 2.670848846435547} 11/06/2021 21:32:21 - INFO - __main__ - Step 1898: {'lr': 0.00047425, 'samples': 364416, 'steps': 1897, 'loss/train': 2.614722728729248} 11/06/2021 21:32:21 - INFO - __main__ - Step 1899: {'lr': 0.0004745, 'samples': 364608, 'steps': 1898, 'loss/train': 2.741778612136841} 11/06/2021 21:32:21 - INFO - __main__ - Step 1900: {'lr': 0.00047475, 'samples': 364800, 'steps': 1899, 'loss/train': 3.1232104301452637} 11/06/2021 21:32:22 - INFO - __main__ - Step 1901: {'lr': 0.000475, 'samples': 364992, 'steps': 1900, 'loss/train': 2.564000368118286} 11/06/2021 21:32:23 - INFO - __main__ - Step 1902: {'lr': 0.00047525000000000003, 'samples': 365184, 'steps': 1901, 'loss/train': 2.7664237022399902} 11/06/2021 21:32:23 - INFO - __main__ - Step 1903: {'lr': 0.0004755, 'samples': 365376, 'steps': 1902, 'loss/train': 2.554410934448242} 11/06/2021 21:32:23 - INFO - __main__ - Step 1904: {'lr': 0.00047575, 'samples': 365568, 'steps': 1903, 'loss/train': 2.5446524620056152} 11/06/2021 21:32:24 - INFO - __main__ - Step 1905: {'lr': 0.00047599999999999997, 'samples': 365760, 'steps': 1904, 'loss/train': 2.754312515258789} 11/06/2021 21:32:24 - INFO - __main__ - Step 1906: {'lr': 0.00047625, 'samples': 365952, 'steps': 1905, 'loss/train': 2.7138187885284424} 11/06/2021 21:32:25 - INFO - __main__ - Step 1907: {'lr': 0.0004765, 'samples': 366144, 'steps': 1906, 'loss/train': 2.9011712074279785} 11/06/2021 21:32:25 - INFO - __main__ - Step 1908: {'lr': 0.00047675, 'samples': 366336, 'steps': 1907, 'loss/train': 2.7939672470092773} 11/06/2021 21:32:26 - INFO - __main__ - Step 1909: {'lr': 0.000477, 'samples': 366528, 'steps': 1908, 'loss/train': 3.0879056453704834} 11/06/2021 21:32:26 - INFO - __main__ - Step 1910: {'lr': 0.00047725, 'samples': 366720, 'steps': 1909, 'loss/train': 2.4980082511901855} 11/06/2021 21:32:26 - INFO - __main__ - Step 1911: {'lr': 0.0004775, 'samples': 366912, 'steps': 1910, 'loss/train': 2.949843645095825} 11/06/2021 21:32:27 - INFO - __main__ - Step 1912: {'lr': 0.00047775000000000004, 'samples': 367104, 'steps': 1911, 'loss/train': 2.661370277404785} 11/06/2021 21:32:28 - INFO - __main__ - Step 1913: {'lr': 0.00047799999999999996, 'samples': 367296, 'steps': 1912, 'loss/train': 2.855457305908203} 11/06/2021 21:32:28 - INFO - __main__ - Step 1914: {'lr': 0.00047825, 'samples': 367488, 'steps': 1913, 'loss/train': 2.790748119354248} 11/06/2021 21:32:28 - INFO - __main__ - Step 1915: {'lr': 0.0004785, 'samples': 367680, 'steps': 1914, 'loss/train': 2.1920478343963623} 11/06/2021 21:32:29 - INFO - __main__ - Step 1916: {'lr': 0.00047875, 'samples': 367872, 'steps': 1915, 'loss/train': 3.010140895843506} 11/06/2021 21:32:30 - INFO - __main__ - Step 1917: {'lr': 0.000479, 'samples': 368064, 'steps': 1916, 'loss/train': 2.7934629917144775} 11/06/2021 21:32:30 - INFO - __main__ - Step 1918: {'lr': 0.00047925, 'samples': 368256, 'steps': 1917, 'loss/train': 2.759575128555298} 11/06/2021 21:32:31 - INFO - __main__ - Step 1919: {'lr': 0.0004795, 'samples': 368448, 'steps': 1918, 'loss/train': 2.1986517906188965} 11/06/2021 21:32:31 - INFO - __main__ - Step 1920: {'lr': 0.00047975000000000003, 'samples': 368640, 'steps': 1919, 'loss/train': 2.7708301544189453} 11/06/2021 21:32:31 - INFO - __main__ - Step 1921: {'lr': 0.00048, 'samples': 368832, 'steps': 1920, 'loss/train': 4.627929210662842} 11/06/2021 21:32:32 - INFO - __main__ - Step 1922: {'lr': 0.00048025000000000005, 'samples': 369024, 'steps': 1921, 'loss/train': 2.932852029800415} 11/06/2021 21:32:33 - INFO - __main__ - Step 1923: {'lr': 0.00048049999999999997, 'samples': 369216, 'steps': 1922, 'loss/train': 2.970147132873535} 11/06/2021 21:32:33 - INFO - __main__ - Step 1924: {'lr': 0.00048075, 'samples': 369408, 'steps': 1923, 'loss/train': 3.2570888996124268} 11/06/2021 21:32:33 - INFO - __main__ - Step 1925: {'lr': 0.000481, 'samples': 369600, 'steps': 1924, 'loss/train': 2.7547266483306885} 11/06/2021 21:32:34 - INFO - __main__ - Step 1926: {'lr': 0.00048125, 'samples': 369792, 'steps': 1925, 'loss/train': 2.5552988052368164} 11/06/2021 21:32:34 - INFO - __main__ - Step 1927: {'lr': 0.0004815, 'samples': 369984, 'steps': 1926, 'loss/train': 2.8720643520355225} 11/06/2021 21:32:35 - INFO - __main__ - Step 1928: {'lr': 0.00048175000000000003, 'samples': 370176, 'steps': 1927, 'loss/train': 3.4235241413116455} 11/06/2021 21:32:35 - INFO - __main__ - Step 1929: {'lr': 0.000482, 'samples': 370368, 'steps': 1928, 'loss/train': 2.794508457183838} 11/06/2021 21:32:36 - INFO - __main__ - Step 1930: {'lr': 0.00048225000000000004, 'samples': 370560, 'steps': 1929, 'loss/train': 3.3702750205993652} 11/06/2021 21:32:36 - INFO - __main__ - Step 1931: {'lr': 0.0004825, 'samples': 370752, 'steps': 1930, 'loss/train': 3.3786303997039795} 11/06/2021 21:32:36 - INFO - __main__ - Step 1932: {'lr': 0.00048275, 'samples': 370944, 'steps': 1931, 'loss/train': 2.7347774505615234} 11/06/2021 21:32:37 - INFO - __main__ - Step 1933: {'lr': 0.000483, 'samples': 371136, 'steps': 1932, 'loss/train': 2.8218722343444824} 11/06/2021 21:32:38 - INFO - __main__ - Step 1934: {'lr': 0.00048325, 'samples': 371328, 'steps': 1933, 'loss/train': 1.854628324508667} 11/06/2021 21:32:38 - INFO - __main__ - Step 1935: {'lr': 0.0004835, 'samples': 371520, 'steps': 1934, 'loss/train': 2.730201244354248} 11/06/2021 21:32:38 - INFO - __main__ - Step 1936: {'lr': 0.00048375, 'samples': 371712, 'steps': 1935, 'loss/train': 2.558645725250244} 11/06/2021 21:32:39 - INFO - __main__ - Step 1937: {'lr': 0.000484, 'samples': 371904, 'steps': 1936, 'loss/train': 2.328354597091675} 11/06/2021 21:32:40 - INFO - __main__ - Step 1938: {'lr': 0.00048425000000000003, 'samples': 372096, 'steps': 1937, 'loss/train': 2.217445135116577} 11/06/2021 21:32:40 - INFO - __main__ - Step 1939: {'lr': 0.0004845, 'samples': 372288, 'steps': 1938, 'loss/train': 2.765631675720215} 11/06/2021 21:32:41 - INFO - __main__ - Step 1940: {'lr': 0.00048475000000000005, 'samples': 372480, 'steps': 1939, 'loss/train': 2.3212947845458984} 11/06/2021 21:32:41 - INFO - __main__ - Step 1941: {'lr': 0.00048499999999999997, 'samples': 372672, 'steps': 1940, 'loss/train': 2.563457727432251} 11/06/2021 21:32:41 - INFO - __main__ - Step 1942: {'lr': 0.00048525, 'samples': 372864, 'steps': 1941, 'loss/train': 2.725562334060669} 11/06/2021 21:32:42 - INFO - __main__ - Step 1943: {'lr': 0.0004855, 'samples': 373056, 'steps': 1942, 'loss/train': 2.3403518199920654} 11/06/2021 21:32:43 - INFO - __main__ - Step 1944: {'lr': 0.00048575, 'samples': 373248, 'steps': 1943, 'loss/train': 2.5952484607696533} 11/06/2021 21:32:43 - INFO - __main__ - Step 1945: {'lr': 0.000486, 'samples': 373440, 'steps': 1944, 'loss/train': 3.019125461578369} 11/06/2021 21:32:43 - INFO - __main__ - Step 1946: {'lr': 0.00048625000000000003, 'samples': 373632, 'steps': 1945, 'loss/train': 2.831516981124878} 11/06/2021 21:32:44 - INFO - __main__ - Step 1947: {'lr': 0.0004865, 'samples': 373824, 'steps': 1946, 'loss/train': 2.3622794151306152} 11/06/2021 21:32:45 - INFO - __main__ - Step 1948: {'lr': 0.00048675000000000004, 'samples': 374016, 'steps': 1947, 'loss/train': 2.709298849105835} 11/06/2021 21:32:45 - INFO - __main__ - Step 1949: {'lr': 0.000487, 'samples': 374208, 'steps': 1948, 'loss/train': 2.702164888381958} 11/06/2021 21:32:45 - INFO - __main__ - Step 1950: {'lr': 0.00048725000000000005, 'samples': 374400, 'steps': 1949, 'loss/train': 2.7699882984161377} 11/06/2021 21:32:46 - INFO - __main__ - Step 1951: {'lr': 0.0004875, 'samples': 374592, 'steps': 1950, 'loss/train': 3.081878900527954} 11/06/2021 21:32:46 - INFO - __main__ - Step 1952: {'lr': 0.00048775, 'samples': 374784, 'steps': 1951, 'loss/train': 2.322448492050171} 11/06/2021 21:32:47 - INFO - __main__ - Step 1953: {'lr': 0.000488, 'samples': 374976, 'steps': 1952, 'loss/train': 3.033129930496216} 11/06/2021 21:32:48 - INFO - __main__ - Step 1954: {'lr': 0.00048825, 'samples': 375168, 'steps': 1953, 'loss/train': 3.0832743644714355} 11/06/2021 21:32:48 - INFO - __main__ - Step 1955: {'lr': 0.0004885, 'samples': 375360, 'steps': 1954, 'loss/train': 2.644732713699341} 11/06/2021 21:32:48 - INFO - __main__ - Step 1956: {'lr': 0.00048875, 'samples': 375552, 'steps': 1955, 'loss/train': 2.585529327392578} 11/06/2021 21:32:49 - INFO - __main__ - Step 1957: {'lr': 0.000489, 'samples': 375744, 'steps': 1956, 'loss/train': 2.618997812271118} 11/06/2021 21:32:49 - INFO - __main__ - Step 1958: {'lr': 0.00048925, 'samples': 375936, 'steps': 1957, 'loss/train': 2.3187615871429443} 11/06/2021 21:32:50 - INFO - __main__ - Step 1959: {'lr': 0.0004895, 'samples': 376128, 'steps': 1958, 'loss/train': 0.9907362461090088} 11/06/2021 21:32:50 - INFO - __main__ - Step 1960: {'lr': 0.0004897500000000001, 'samples': 376320, 'steps': 1959, 'loss/train': 2.583451986312866} 11/06/2021 21:32:51 - INFO - __main__ - Step 1961: {'lr': 0.00049, 'samples': 376512, 'steps': 1960, 'loss/train': 2.766582489013672} 11/06/2021 21:32:51 - INFO - __main__ - Step 1962: {'lr': 0.00049025, 'samples': 376704, 'steps': 1961, 'loss/train': 2.8893637657165527} 11/06/2021 21:32:51 - INFO - __main__ - Step 1963: {'lr': 0.0004905, 'samples': 376896, 'steps': 1962, 'loss/train': 2.1860101222991943} 11/06/2021 21:32:53 - INFO - __main__ - Step 1964: {'lr': 0.0004907500000000001, 'samples': 377088, 'steps': 1963, 'loss/train': 2.513594150543213} 11/06/2021 21:32:53 - INFO - __main__ - Step 1965: {'lr': 0.000491, 'samples': 377280, 'steps': 1964, 'loss/train': 1.6197631359100342} 11/06/2021 21:32:53 - INFO - __main__ - Step 1966: {'lr': 0.00049125, 'samples': 377472, 'steps': 1965, 'loss/train': 2.5828354358673096} 11/06/2021 21:32:54 - INFO - __main__ - Step 1967: {'lr': 0.0004915, 'samples': 377664, 'steps': 1966, 'loss/train': 3.0170748233795166} 11/06/2021 21:32:54 - INFO - __main__ - Step 1968: {'lr': 0.00049175, 'samples': 377856, 'steps': 1967, 'loss/train': 1.9359134435653687} 11/06/2021 21:32:55 - INFO - __main__ - Step 1969: {'lr': 0.000492, 'samples': 378048, 'steps': 1968, 'loss/train': 2.57096266746521} 11/06/2021 21:32:55 - INFO - __main__ - Step 1970: {'lr': 0.0004922500000000001, 'samples': 378240, 'steps': 1969, 'loss/train': 2.656981945037842} 11/06/2021 21:32:56 - INFO - __main__ - Step 1971: {'lr': 0.0004925, 'samples': 378432, 'steps': 1970, 'loss/train': 2.5854079723358154} 11/06/2021 21:32:56 - INFO - __main__ - Step 1972: {'lr': 0.00049275, 'samples': 378624, 'steps': 1971, 'loss/train': 2.834716796875} 11/06/2021 21:32:56 - INFO - __main__ - Step 1973: {'lr': 0.0004930000000000001, 'samples': 378816, 'steps': 1972, 'loss/train': 2.5970420837402344} 11/06/2021 21:32:57 - INFO - __main__ - Step 1974: {'lr': 0.00049325, 'samples': 379008, 'steps': 1973, 'loss/train': 2.524550676345825} 11/06/2021 21:32:58 - INFO - __main__ - Step 1975: {'lr': 0.0004935, 'samples': 379200, 'steps': 1974, 'loss/train': 2.7941884994506836} 11/06/2021 21:32:58 - INFO - __main__ - Step 1976: {'lr': 0.00049375, 'samples': 379392, 'steps': 1975, 'loss/train': 2.233511447906494} 11/06/2021 21:32:58 - INFO - __main__ - Step 1977: {'lr': 0.000494, 'samples': 379584, 'steps': 1976, 'loss/train': 2.7673490047454834} 11/06/2021 21:32:59 - INFO - __main__ - Step 1978: {'lr': 0.00049425, 'samples': 379776, 'steps': 1977, 'loss/train': 2.7276995182037354} 11/06/2021 21:32:59 - INFO - __main__ - Step 1979: {'lr': 0.0004945, 'samples': 379968, 'steps': 1978, 'loss/train': 3.163912534713745} 11/06/2021 21:33:00 - INFO - __main__ - Step 1980: {'lr': 0.0004947500000000001, 'samples': 380160, 'steps': 1979, 'loss/train': 2.734490394592285} 11/06/2021 21:33:00 - INFO - __main__ - Step 1981: {'lr': 0.000495, 'samples': 380352, 'steps': 1980, 'loss/train': 2.695145845413208} 11/06/2021 21:33:01 - INFO - __main__ - Step 1982: {'lr': 0.00049525, 'samples': 380544, 'steps': 1981, 'loss/train': 2.7770211696624756} 11/06/2021 21:33:01 - INFO - __main__ - Step 1983: {'lr': 0.0004955, 'samples': 380736, 'steps': 1982, 'loss/train': 2.7686867713928223} 11/06/2021 21:33:02 - INFO - __main__ - Step 1984: {'lr': 0.00049575, 'samples': 380928, 'steps': 1983, 'loss/train': 2.8708817958831787} 11/06/2021 21:33:03 - INFO - __main__ - Step 1985: {'lr': 0.000496, 'samples': 381120, 'steps': 1984, 'loss/train': 2.6449930667877197} 11/06/2021 21:33:03 - INFO - __main__ - Step 1986: {'lr': 0.0004962500000000001, 'samples': 381312, 'steps': 1985, 'loss/train': 2.249175786972046} 11/06/2021 21:33:03 - INFO - __main__ - Step 1987: {'lr': 0.0004965, 'samples': 381504, 'steps': 1986, 'loss/train': 2.399980068206787} 11/06/2021 21:33:04 - INFO - __main__ - Step 1988: {'lr': 0.00049675, 'samples': 381696, 'steps': 1987, 'loss/train': 2.932748556137085} 11/06/2021 21:33:04 - INFO - __main__ - Step 1989: {'lr': 0.000497, 'samples': 381888, 'steps': 1988, 'loss/train': 2.3569090366363525} 11/06/2021 21:33:05 - INFO - __main__ - Step 1990: {'lr': 0.0004972500000000001, 'samples': 382080, 'steps': 1989, 'loss/train': 2.4476397037506104} 11/06/2021 21:33:05 - INFO - __main__ - Step 1991: {'lr': 0.0004975, 'samples': 382272, 'steps': 1990, 'loss/train': 3.7033238410949707} 11/06/2021 21:33:06 - INFO - __main__ - Step 1992: {'lr': 0.00049775, 'samples': 382464, 'steps': 1991, 'loss/train': 2.872666835784912} 11/06/2021 21:33:06 - INFO - __main__ - Step 1993: {'lr': 0.000498, 'samples': 382656, 'steps': 1992, 'loss/train': 2.463730573654175} 11/06/2021 21:33:06 - INFO - __main__ - Step 1994: {'lr': 0.00049825, 'samples': 382848, 'steps': 1993, 'loss/train': 2.2862772941589355} 11/06/2021 21:33:07 - INFO - __main__ - Step 1995: {'lr': 0.0004985, 'samples': 383040, 'steps': 1994, 'loss/train': 2.9158213138580322} 11/06/2021 21:33:08 - INFO - __main__ - Step 1996: {'lr': 0.0004987500000000001, 'samples': 383232, 'steps': 1995, 'loss/train': 2.440356731414795} 11/06/2021 21:33:08 - INFO - __main__ - Step 1997: {'lr': 0.000499, 'samples': 383424, 'steps': 1996, 'loss/train': 2.8165547847747803} 11/06/2021 21:33:08 - INFO - __main__ - Step 1998: {'lr': 0.00049925, 'samples': 383616, 'steps': 1997, 'loss/train': 2.460160970687866} 11/06/2021 21:33:09 - INFO - __main__ - Step 1999: {'lr': 0.0004995, 'samples': 383808, 'steps': 1998, 'loss/train': 2.8963072299957275} 11/06/2021 21:33:10 - INFO - __main__ - Step 2000: {'lr': 0.0004997500000000001, 'samples': 384000, 'steps': 1999, 'loss/train': 2.667146921157837} 11/06/2021 21:33:10 - INFO - __main__ - Step 2001: {'lr': 0.0005, 'samples': 384192, 'steps': 2000, 'loss/train': 1.9915542602539062} 11/06/2021 21:33:10 - INFO - __main__ - Step 2002: {'lr': 0.0004999999999436769, 'samples': 384384, 'steps': 2001, 'loss/train': 2.369316816329956} 11/06/2021 21:33:11 - INFO - __main__ - Step 2003: {'lr': 0.0004999999997747077, 'samples': 384576, 'steps': 2002, 'loss/train': 2.117219924926758} 11/06/2021 21:33:11 - INFO - __main__ - Step 2004: {'lr': 0.0004999999994930923, 'samples': 384768, 'steps': 2003, 'loss/train': 3.217236280441284} 11/06/2021 21:33:12 - INFO - __main__ - Step 2005: {'lr': 0.0004999999990988309, 'samples': 384960, 'steps': 2004, 'loss/train': 2.4494330883026123} 11/06/2021 21:33:13 - INFO - __main__ - Step 2006: {'lr': 0.0004999999985919232, 'samples': 385152, 'steps': 2005, 'loss/train': 2.7397634983062744} 11/06/2021 21:33:13 - INFO - __main__ - Step 2007: {'lr': 0.0004999999979723695, 'samples': 385344, 'steps': 2006, 'loss/train': 1.72874915599823} 11/06/2021 21:33:13 - INFO - __main__ - Step 2008: {'lr': 0.0004999999972401696, 'samples': 385536, 'steps': 2007, 'loss/train': 2.854842185974121} 11/06/2021 21:33:14 - INFO - __main__ - Step 2009: {'lr': 0.0004999999963953234, 'samples': 385728, 'steps': 2008, 'loss/train': 2.8656506538391113} 11/06/2021 21:33:14 - INFO - __main__ - Step 2010: {'lr': 0.0004999999954378312, 'samples': 385920, 'steps': 2009, 'loss/train': 2.108156681060791} 11/06/2021 21:33:15 - INFO - __main__ - Step 2011: {'lr': 0.000499999994367693, 'samples': 386112, 'steps': 2010, 'loss/train': 2.626335382461548} 11/06/2021 21:33:15 - INFO - __main__ - Step 2012: {'lr': 0.0004999999931849084, 'samples': 386304, 'steps': 2011, 'loss/train': 2.684321403503418} 11/06/2021 21:33:16 - INFO - __main__ - Step 2013: {'lr': 0.0004999999918894778, 'samples': 386496, 'steps': 2012, 'loss/train': 2.6967201232910156} 11/06/2021 21:33:16 - INFO - __main__ - Step 2014: {'lr': 0.000499999990481401, 'samples': 386688, 'steps': 2013, 'loss/train': 2.6608567237854004} 11/06/2021 21:33:16 - INFO - __main__ - Step 2015: {'lr': 0.0004999999889606781, 'samples': 386880, 'steps': 2014, 'loss/train': 2.773587942123413} 11/06/2021 21:33:17 - INFO - __main__ - Step 2016: {'lr': 0.0004999999873273091, 'samples': 387072, 'steps': 2015, 'loss/train': 2.5396175384521484} 11/06/2021 21:33:18 - INFO - __main__ - Step 2017: {'lr': 0.000499999985581294, 'samples': 387264, 'steps': 2016, 'loss/train': 2.9286653995513916} 11/06/2021 21:33:18 - INFO - __main__ - Step 2018: {'lr': 0.0004999999837226326, 'samples': 387456, 'steps': 2017, 'loss/train': 2.4174294471740723} 11/06/2021 21:33:18 - INFO - __main__ - Step 2019: {'lr': 0.0004999999817513252, 'samples': 387648, 'steps': 2018, 'loss/train': 2.4650332927703857} 11/06/2021 21:33:19 - INFO - __main__ - Step 2020: {'lr': 0.0004999999796673716, 'samples': 387840, 'steps': 2019, 'loss/train': 2.799029588699341} 11/06/2021 21:33:20 - INFO - __main__ - Step 2021: {'lr': 0.0004999999774707719, 'samples': 388032, 'steps': 2020, 'loss/train': 2.8545684814453125} 11/06/2021 21:33:20 - INFO - __main__ - Step 2022: {'lr': 0.0004999999751615261, 'samples': 388224, 'steps': 2021, 'loss/train': 2.4763200283050537} 11/06/2021 21:33:20 - INFO - __main__ - Step 2023: {'lr': 0.0004999999727396341, 'samples': 388416, 'steps': 2022, 'loss/train': 2.094083309173584} 11/06/2021 21:33:21 - INFO - __main__ - Step 2024: {'lr': 0.0004999999702050959, 'samples': 388608, 'steps': 2023, 'loss/train': 2.4490573406219482} 11/06/2021 21:33:21 - INFO - __main__ - Step 2025: {'lr': 0.0004999999675579118, 'samples': 388800, 'steps': 2024, 'loss/train': 3.2666213512420654} 11/06/2021 21:33:22 - INFO - __main__ - Step 2026: {'lr': 0.0004999999647980814, 'samples': 388992, 'steps': 2025, 'loss/train': 2.2224512100219727} 11/06/2021 21:33:23 - INFO - __main__ - Step 2027: {'lr': 0.0004999999619256049, 'samples': 389184, 'steps': 2026, 'loss/train': 3.0559303760528564} 11/06/2021 21:33:23 - INFO - __main__ - Step 2028: {'lr': 0.0004999999589404822, 'samples': 389376, 'steps': 2027, 'loss/train': 2.4753761291503906} 11/06/2021 21:33:23 - INFO - __main__ - Step 2029: {'lr': 0.0004999999558427136, 'samples': 389568, 'steps': 2028, 'loss/train': 2.350665807723999} 11/06/2021 21:33:24 - INFO - __main__ - Step 2030: {'lr': 0.0004999999526322987, 'samples': 389760, 'steps': 2029, 'loss/train': 2.9428741931915283} 11/06/2021 21:33:24 - INFO - __main__ - Step 2031: {'lr': 0.0004999999493092377, 'samples': 389952, 'steps': 2030, 'loss/train': 2.561460018157959} 11/06/2021 21:33:25 - INFO - __main__ - Step 2032: {'lr': 0.0004999999458735306, 'samples': 390144, 'steps': 2031, 'loss/train': 2.2386529445648193} 11/06/2021 21:33:26 - INFO - __main__ - Step 2033: {'lr': 0.0004999999423251774, 'samples': 390336, 'steps': 2032, 'loss/train': 2.3020198345184326} 11/06/2021 21:33:26 - INFO - __main__ - Step 2034: {'lr': 0.0004999999386641781, 'samples': 390528, 'steps': 2033, 'loss/train': 2.7840397357940674} 11/06/2021 21:33:26 - INFO - __main__ - Step 2035: {'lr': 0.0004999999348905326, 'samples': 390720, 'steps': 2034, 'loss/train': 2.2460341453552246} 11/06/2021 21:33:27 - INFO - __main__ - Step 2036: {'lr': 0.000499999931004241, 'samples': 390912, 'steps': 2035, 'loss/train': 2.3942503929138184} 11/06/2021 21:33:28 - INFO - __main__ - Step 2037: {'lr': 0.0004999999270053034, 'samples': 391104, 'steps': 2036, 'loss/train': 3.297150135040283} 11/06/2021 21:33:28 - INFO - __main__ - Step 2038: {'lr': 0.0004999999228937196, 'samples': 391296, 'steps': 2037, 'loss/train': 2.8241195678710938} 11/06/2021 21:33:28 - INFO - __main__ - Step 2039: {'lr': 0.0004999999186694897, 'samples': 391488, 'steps': 2038, 'loss/train': 2.7766733169555664} 11/06/2021 21:33:29 - INFO - __main__ - Step 2040: {'lr': 0.0004999999143326137, 'samples': 391680, 'steps': 2039, 'loss/train': 2.7513749599456787} 11/06/2021 21:33:29 - INFO - __main__ - Step 2041: {'lr': 0.0004999999098830916, 'samples': 391872, 'steps': 2040, 'loss/train': 2.4639899730682373} 11/06/2021 21:33:30 - INFO - __main__ - Step 2042: {'lr': 0.0004999999053209235, 'samples': 392064, 'steps': 2041, 'loss/train': 2.83908748626709} 11/06/2021 21:33:30 - INFO - __main__ - Step 2043: {'lr': 0.0004999999006461091, 'samples': 392256, 'steps': 2042, 'loss/train': 2.7250239849090576} 11/06/2021 21:33:31 - INFO - __main__ - Step 2044: {'lr': 0.0004999998958586487, 'samples': 392448, 'steps': 2043, 'loss/train': 2.2710461616516113} 11/06/2021 21:33:31 - INFO - __main__ - Step 2045: {'lr': 0.0004999998909585423, 'samples': 392640, 'steps': 2044, 'loss/train': 2.9703567028045654} 11/06/2021 21:33:31 - INFO - __main__ - Step 2046: {'lr': 0.0004999998859457896, 'samples': 392832, 'steps': 2045, 'loss/train': 1.9827837944030762} 11/06/2021 21:33:33 - INFO - __main__ - Step 2047: {'lr': 0.0004999998808203909, 'samples': 393024, 'steps': 2046, 'loss/train': 2.4545347690582275} 11/06/2021 21:33:33 - INFO - __main__ - Step 2048: {'lr': 0.0004999998755823462, 'samples': 393216, 'steps': 2047, 'loss/train': 2.7062439918518066} 11/06/2021 21:33:33 - INFO - __main__ - Step 2049: {'lr': 0.0004999998702316553, 'samples': 393408, 'steps': 2048, 'loss/train': 2.6722590923309326} 11/06/2021 21:33:34 - INFO - __main__ - Step 2050: {'lr': 0.0004999998647683184, 'samples': 393600, 'steps': 2049, 'loss/train': 2.4513065814971924} 11/06/2021 21:33:34 - INFO - __main__ - Step 2051: {'lr': 0.0004999998591923353, 'samples': 393792, 'steps': 2050, 'loss/train': 2.691094398498535} 11/06/2021 21:33:34 - INFO - __main__ - Step 2052: {'lr': 0.0004999998535037063, 'samples': 393984, 'steps': 2051, 'loss/train': 2.209742307662964} 11/06/2021 21:33:35 - INFO - __main__ - Step 2053: {'lr': 0.0004999998477024311, 'samples': 394176, 'steps': 2052, 'loss/train': 2.50882887840271} 11/06/2021 21:33:36 - INFO - __main__ - Step 2054: {'lr': 0.0004999998417885099, 'samples': 394368, 'steps': 2053, 'loss/train': 2.435020685195923} 11/06/2021 21:33:36 - INFO - __main__ - Step 2055: {'lr': 0.0004999998357619425, 'samples': 394560, 'steps': 2054, 'loss/train': 2.0775961875915527} 11/06/2021 21:33:36 - INFO - __main__ - Step 2056: {'lr': 0.0004999998296227291, 'samples': 394752, 'steps': 2055, 'loss/train': 2.863607168197632} 11/06/2021 21:33:37 - INFO - __main__ - Step 2057: {'lr': 0.0004999998233708697, 'samples': 394944, 'steps': 2056, 'loss/train': 2.371857166290283} 11/06/2021 21:33:38 - INFO - __main__ - Step 2058: {'lr': 0.0004999998170063642, 'samples': 395136, 'steps': 2057, 'loss/train': 2.9623489379882812} 11/06/2021 21:33:38 - INFO - __main__ - Step 2059: {'lr': 0.0004999998105292126, 'samples': 395328, 'steps': 2058, 'loss/train': 2.637700319290161} 11/06/2021 21:33:39 - INFO - __main__ - Step 2060: {'lr': 0.000499999803939415, 'samples': 395520, 'steps': 2059, 'loss/train': 2.8465237617492676} 11/06/2021 21:33:39 - INFO - __main__ - Step 2061: {'lr': 0.0004999997972369713, 'samples': 395712, 'steps': 2060, 'loss/train': 2.0144639015197754} 11/06/2021 21:33:39 - INFO - __main__ - Step 2062: {'lr': 0.0004999997904218816, 'samples': 395904, 'steps': 2061, 'loss/train': 2.385756254196167} 11/06/2021 21:33:40 - INFO - __main__ - Step 2063: {'lr': 0.0004999997834941459, 'samples': 396096, 'steps': 2062, 'loss/train': 3.102501153945923} 11/06/2021 21:33:41 - INFO - __main__ - Step 2064: {'lr': 0.000499999776453764, 'samples': 396288, 'steps': 2063, 'loss/train': 1.95815908908844} 11/06/2021 21:33:41 - INFO - __main__ - Step 2065: {'lr': 0.0004999997693007361, 'samples': 396480, 'steps': 2064, 'loss/train': 2.9032347202301025} 11/06/2021 21:33:41 - INFO - __main__ - Step 2066: {'lr': 0.0004999997620350622, 'samples': 396672, 'steps': 2065, 'loss/train': 2.321173906326294} 11/06/2021 21:33:42 - INFO - __main__ - Step 2067: {'lr': 0.0004999997546567423, 'samples': 396864, 'steps': 2066, 'loss/train': 2.925659418106079} 11/06/2021 21:33:42 - INFO - __main__ - Step 2068: {'lr': 0.0004999997471657763, 'samples': 397056, 'steps': 2067, 'loss/train': 2.795663356781006} 11/06/2021 21:33:43 - INFO - __main__ - Step 2069: {'lr': 0.0004999997395621642, 'samples': 397248, 'steps': 2068, 'loss/train': 2.198864698410034} 11/06/2021 21:33:43 - INFO - __main__ - Step 2070: {'lr': 0.0004999997318459064, 'samples': 397440, 'steps': 2069, 'loss/train': 2.6380443572998047} 11/06/2021 21:33:44 - INFO - __main__ - Step 2071: {'lr': 0.0004999997240170023, 'samples': 397632, 'steps': 2070, 'loss/train': 1.3615984916687012} 11/06/2021 21:33:44 - INFO - __main__ - Step 2072: {'lr': 0.0004999997160754522, 'samples': 397824, 'steps': 2071, 'loss/train': 2.0112996101379395} 11/06/2021 21:33:45 - INFO - __main__ - Step 2073: {'lr': 0.0004999997080212561, 'samples': 398016, 'steps': 2072, 'loss/train': 2.477386236190796} 11/06/2021 21:33:46 - INFO - __main__ - Step 2074: {'lr': 0.000499999699854414, 'samples': 398208, 'steps': 2073, 'loss/train': 3.2133283615112305} 11/06/2021 21:33:46 - INFO - __main__ - Step 2075: {'lr': 0.0004999996915749259, 'samples': 398400, 'steps': 2074, 'loss/train': 2.5600574016571045} 11/06/2021 21:33:46 - INFO - __main__ - Step 2076: {'lr': 0.0004999996831827918, 'samples': 398592, 'steps': 2075, 'loss/train': 2.0490965843200684} 11/06/2021 21:33:47 - INFO - __main__ - Step 2077: {'lr': 0.0004999996746780117, 'samples': 398784, 'steps': 2076, 'loss/train': 2.4900460243225098} 11/06/2021 21:33:47 - INFO - __main__ - Step 2078: {'lr': 0.0004999996660605856, 'samples': 398976, 'steps': 2077, 'loss/train': 2.7026772499084473} 11/06/2021 21:33:47 - INFO - __main__ - Step 2079: {'lr': 0.0004999996573305135, 'samples': 399168, 'steps': 2078, 'loss/train': 1.7970755100250244} 11/06/2021 21:33:48 - INFO - __main__ - Step 2080: {'lr': 0.0004999996484877955, 'samples': 399360, 'steps': 2079, 'loss/train': 2.834831953048706} 11/06/2021 21:33:49 - INFO - __main__ - Step 2081: {'lr': 0.0004999996395324313, 'samples': 399552, 'steps': 2080, 'loss/train': 2.1271467208862305} 11/06/2021 21:33:49 - INFO - __main__ - Step 2082: {'lr': 0.0004999996304644213, 'samples': 399744, 'steps': 2081, 'loss/train': 2.7261979579925537} 11/06/2021 21:33:49 - INFO - __main__ - Step 2083: {'lr': 0.0004999996212837653, 'samples': 399936, 'steps': 2082, 'loss/train': 2.50911021232605} 11/06/2021 21:33:50 - INFO - __main__ - Step 2084: {'lr': 0.0004999996119904633, 'samples': 400128, 'steps': 2083, 'loss/train': 2.5453062057495117} 11/06/2021 21:33:51 - INFO - __main__ - Step 2085: {'lr': 0.0004999996025845154, 'samples': 400320, 'steps': 2084, 'loss/train': 2.562901496887207} 11/06/2021 21:33:51 - INFO - __main__ - Step 2086: {'lr': 0.0004999995930659215, 'samples': 400512, 'steps': 2085, 'loss/train': 2.8472900390625} 11/06/2021 21:33:51 - INFO - __main__ - Step 2087: {'lr': 0.0004999995834346815, 'samples': 400704, 'steps': 2086, 'loss/train': 2.197766065597534} 11/06/2021 21:33:52 - INFO - __main__ - Step 2088: {'lr': 0.0004999995736907957, 'samples': 400896, 'steps': 2087, 'loss/train': 2.4093174934387207} 11/06/2021 21:33:52 - INFO - __main__ - Step 2089: {'lr': 0.000499999563834264, 'samples': 401088, 'steps': 2088, 'loss/train': 2.8441734313964844} 11/06/2021 21:33:53 - INFO - __main__ - Step 2090: {'lr': 0.0004999995538650862, 'samples': 401280, 'steps': 2089, 'loss/train': 2.863389492034912} 11/06/2021 21:33:53 - INFO - __main__ - Step 2091: {'lr': 0.0004999995437832626, 'samples': 401472, 'steps': 2090, 'loss/train': 2.4036967754364014} 11/06/2021 21:33:54 - INFO - __main__ - Step 2092: {'lr': 0.0004999995335887929, 'samples': 401664, 'steps': 2091, 'loss/train': 2.5592567920684814} 11/06/2021 21:33:54 - INFO - __main__ - Step 2093: {'lr': 0.0004999995232816774, 'samples': 401856, 'steps': 2092, 'loss/train': 2.0404558181762695} 11/06/2021 21:33:54 - INFO - __main__ - Step 2094: {'lr': 0.000499999512861916, 'samples': 402048, 'steps': 2093, 'loss/train': 2.452481985092163} 11/06/2021 21:33:56 - INFO - __main__ - Step 2095: {'lr': 0.0004999995023295086, 'samples': 402240, 'steps': 2094, 'loss/train': 2.8676137924194336} 11/06/2021 21:33:56 - INFO - __main__ - Step 2096: {'lr': 0.0004999994916844552, 'samples': 402432, 'steps': 2095, 'loss/train': 2.586151599884033} 11/06/2021 21:33:56 - INFO - __main__ - Step 2097: {'lr': 0.0004999994809267561, 'samples': 402624, 'steps': 2096, 'loss/train': 2.180725574493408} 11/06/2021 21:33:57 - INFO - __main__ - Step 2098: {'lr': 0.0004999994700564109, 'samples': 402816, 'steps': 2097, 'loss/train': 2.6748390197753906} 11/06/2021 21:33:57 - INFO - __main__ - Step 2099: {'lr': 0.0004999994590734199, 'samples': 403008, 'steps': 2098, 'loss/train': 2.2900843620300293} 11/06/2021 21:33:58 - INFO - __main__ - Step 2100: {'lr': 0.000499999447977783, 'samples': 403200, 'steps': 2099, 'loss/train': 2.7498953342437744} 11/06/2021 21:33:58 - INFO - __main__ - Step 2101: {'lr': 0.0004999994367695001, 'samples': 403392, 'steps': 2100, 'loss/train': 2.9146597385406494} 11/06/2021 21:33:59 - INFO - __main__ - Step 2102: {'lr': 0.0004999994254485714, 'samples': 403584, 'steps': 2101, 'loss/train': 2.650158166885376} 11/06/2021 21:33:59 - INFO - __main__ - Step 2103: {'lr': 0.0004999994140149969, 'samples': 403776, 'steps': 2102, 'loss/train': 2.6396777629852295} 11/06/2021 21:33:59 - INFO - __main__ - Step 2104: {'lr': 0.0004999994024687764, 'samples': 403968, 'steps': 2103, 'loss/train': 2.664414644241333} 11/06/2021 21:34:00 - INFO - __main__ - Step 2105: {'lr': 0.00049999939080991, 'samples': 404160, 'steps': 2104, 'loss/train': 1.7636381387710571} 11/06/2021 21:34:01 - INFO - __main__ - Step 2106: {'lr': 0.0004999993790383978, 'samples': 404352, 'steps': 2105, 'loss/train': 2.3601715564727783} 11/06/2021 21:34:01 - INFO - __main__ - Step 2107: {'lr': 0.0004999993671542397, 'samples': 404544, 'steps': 2106, 'loss/train': 1.9566963911056519} 11/06/2021 21:34:01 - INFO - __main__ - Step 2108: {'lr': 0.0004999993551574358, 'samples': 404736, 'steps': 2107, 'loss/train': 1.9892882108688354} 11/06/2021 21:34:02 - INFO - __main__ - Step 2109: {'lr': 0.000499999343047986, 'samples': 404928, 'steps': 2108, 'loss/train': 2.693075180053711} 11/06/2021 21:34:02 - INFO - __main__ - Step 2110: {'lr': 0.0004999993308258904, 'samples': 405120, 'steps': 2109, 'loss/train': 2.1097216606140137} 11/06/2021 21:34:03 - INFO - __main__ - Step 2111: {'lr': 0.0004999993184911489, 'samples': 405312, 'steps': 2110, 'loss/train': 2.149388074874878} 11/06/2021 21:34:04 - INFO - __main__ - Step 2112: {'lr': 0.0004999993060437616, 'samples': 405504, 'steps': 2111, 'loss/train': 1.7287077903747559} 11/06/2021 21:34:04 - INFO - __main__ - Step 2113: {'lr': 0.0004999992934837284, 'samples': 405696, 'steps': 2112, 'loss/train': 2.66508150100708} 11/06/2021 21:34:04 - INFO - __main__ - Step 2114: {'lr': 0.0004999992808110495, 'samples': 405888, 'steps': 2113, 'loss/train': 2.4178295135498047} 11/06/2021 21:34:05 - INFO - __main__ - Step 2115: {'lr': 0.0004999992680257247, 'samples': 406080, 'steps': 2114, 'loss/train': 2.6689789295196533} 11/06/2021 21:34:06 - INFO - __main__ - Step 2116: {'lr': 0.0004999992551277541, 'samples': 406272, 'steps': 2115, 'loss/train': 2.902282953262329} 11/06/2021 21:34:06 - INFO - __main__ - Step 2117: {'lr': 0.0004999992421171377, 'samples': 406464, 'steps': 2116, 'loss/train': 2.81160306930542} 11/06/2021 21:34:06 - INFO - __main__ - Step 2118: {'lr': 0.0004999992289938755, 'samples': 406656, 'steps': 2117, 'loss/train': 2.694124698638916} 11/06/2021 21:34:07 - INFO - __main__ - Step 2119: {'lr': 0.0004999992157579676, 'samples': 406848, 'steps': 2118, 'loss/train': 3.20439076423645} 11/06/2021 21:34:07 - INFO - __main__ - Step 2120: {'lr': 0.0004999992024094138, 'samples': 407040, 'steps': 2119, 'loss/train': 2.9919791221618652} 11/06/2021 21:34:08 - INFO - __main__ - Step 2121: {'lr': 0.0004999991889482142, 'samples': 407232, 'steps': 2120, 'loss/train': 2.432868003845215} 11/06/2021 21:34:08 - INFO - __main__ - Step 2122: {'lr': 0.0004999991753743689, 'samples': 407424, 'steps': 2121, 'loss/train': 2.5452983379364014} 11/06/2021 21:34:09 - INFO - __main__ - Step 2123: {'lr': 0.0004999991616878777, 'samples': 407616, 'steps': 2122, 'loss/train': 2.8512609004974365} 11/06/2021 21:34:09 - INFO - __main__ - Step 2124: {'lr': 0.0004999991478887409, 'samples': 407808, 'steps': 2123, 'loss/train': 2.363013505935669} 11/06/2021 21:34:09 - INFO - __main__ - Step 2125: {'lr': 0.0004999991339769582, 'samples': 408000, 'steps': 2124, 'loss/train': 2.3409323692321777} 11/06/2021 21:34:10 - INFO - __main__ - Step 2126: {'lr': 0.0004999991199525299, 'samples': 408192, 'steps': 2125, 'loss/train': 2.357633113861084} 11/06/2021 21:34:11 - INFO - __main__ - Step 2127: {'lr': 0.0004999991058154557, 'samples': 408384, 'steps': 2126, 'loss/train': 2.454667091369629} 11/06/2021 21:34:11 - INFO - __main__ - Step 2128: {'lr': 0.0004999990915657359, 'samples': 408576, 'steps': 2127, 'loss/train': 1.9439713954925537} 11/06/2021 21:34:11 - INFO - __main__ - Step 2129: {'lr': 0.0004999990772033702, 'samples': 408768, 'steps': 2128, 'loss/train': 2.639913320541382} 11/06/2021 21:34:12 - INFO - __main__ - Step 2130: {'lr': 0.000499999062728359, 'samples': 408960, 'steps': 2129, 'loss/train': 2.727692127227783} 11/06/2021 21:34:12 - INFO - __main__ - Step 2131: {'lr': 0.0004999990481407018, 'samples': 409152, 'steps': 2130, 'loss/train': 2.4127743244171143} 11/06/2021 21:34:13 - INFO - __main__ - Step 2132: {'lr': 0.0004999990334403991, 'samples': 409344, 'steps': 2131, 'loss/train': 2.539153575897217} 11/06/2021 21:34:14 - INFO - __main__ - Step 2133: {'lr': 0.0004999990186274506, 'samples': 409536, 'steps': 2132, 'loss/train': 2.580580949783325} 11/06/2021 21:34:14 - INFO - __main__ - Step 2134: {'lr': 0.0004999990037018564, 'samples': 409728, 'steps': 2133, 'loss/train': 2.830368757247925} 11/06/2021 21:34:14 - INFO - __main__ - Step 2135: {'lr': 0.0004999989886636166, 'samples': 409920, 'steps': 2134, 'loss/train': 2.996830463409424} 11/06/2021 21:34:15 - INFO - __main__ - Step 2136: {'lr': 0.000499998973512731, 'samples': 410112, 'steps': 2135, 'loss/train': 2.449648141860962} 11/06/2021 21:34:16 - INFO - __main__ - Step 2137: {'lr': 0.0004999989582491998, 'samples': 410304, 'steps': 2136, 'loss/train': 2.1310882568359375} 11/06/2021 21:34:16 - INFO - __main__ - Step 2138: {'lr': 0.0004999989428730229, 'samples': 410496, 'steps': 2137, 'loss/train': 2.7264044284820557} 11/06/2021 21:34:17 - INFO - __main__ - Step 2139: {'lr': 0.0004999989273842003, 'samples': 410688, 'steps': 2138, 'loss/train': 2.348263740539551} 11/06/2021 21:34:17 - INFO - __main__ - Step 2140: {'lr': 0.0004999989117827321, 'samples': 410880, 'steps': 2139, 'loss/train': 2.6819005012512207} 11/06/2021 21:34:17 - INFO - __main__ - Step 2141: {'lr': 0.0004999988960686182, 'samples': 411072, 'steps': 2140, 'loss/train': 1.7351843118667603} 11/06/2021 21:34:18 - INFO - __main__ - Step 2142: {'lr': 0.0004999988802418587, 'samples': 411264, 'steps': 2141, 'loss/train': 2.5951883792877197} 11/06/2021 21:34:19 - INFO - __main__ - Step 2143: {'lr': 0.0004999988643024536, 'samples': 411456, 'steps': 2142, 'loss/train': 2.5091636180877686} 11/06/2021 21:34:19 - INFO - __main__ - Step 2144: {'lr': 0.0004999988482504027, 'samples': 411648, 'steps': 2143, 'loss/train': 2.721625804901123} 11/06/2021 21:34:19 - INFO - __main__ - Step 2145: {'lr': 0.0004999988320857063, 'samples': 411840, 'steps': 2144, 'loss/train': 2.778151750564575} 11/06/2021 21:34:20 - INFO - __main__ - Step 2146: {'lr': 0.0004999988158083643, 'samples': 412032, 'steps': 2145, 'loss/train': 2.607226848602295} 11/06/2021 21:34:20 - INFO - __main__ - Step 2147: {'lr': 0.0004999987994183766, 'samples': 412224, 'steps': 2146, 'loss/train': 2.325345277786255} 11/06/2021 21:34:21 - INFO - __main__ - Step 2148: {'lr': 0.0004999987829157434, 'samples': 412416, 'steps': 2147, 'loss/train': 2.363879442214966} 11/06/2021 21:34:21 - INFO - __main__ - Step 2149: {'lr': 0.0004999987663004646, 'samples': 412608, 'steps': 2148, 'loss/train': 2.3791654109954834} 11/06/2021 21:34:22 - INFO - __main__ - Step 2150: {'lr': 0.0004999987495725401, 'samples': 412800, 'steps': 2149, 'loss/train': 1.6959710121154785} 11/06/2021 21:34:22 - INFO - __main__ - Step 2151: {'lr': 0.0004999987327319701, 'samples': 412992, 'steps': 2150, 'loss/train': 2.9837472438812256} 11/06/2021 21:34:22 - INFO - __main__ - Step 2152: {'lr': 0.0004999987157787546, 'samples': 413184, 'steps': 2151, 'loss/train': 2.8918848037719727} 11/06/2021 21:34:24 - INFO - __main__ - Step 2153: {'lr': 0.0004999986987128934, 'samples': 413376, 'steps': 2152, 'loss/train': 2.0959744453430176} 11/06/2021 21:34:24 - INFO - __main__ - Step 2154: {'lr': 0.0004999986815343867, 'samples': 413568, 'steps': 2153, 'loss/train': 2.464357614517212} 11/06/2021 21:34:24 - INFO - __main__ - Step 2155: {'lr': 0.0004999986642432345, 'samples': 413760, 'steps': 2154, 'loss/train': 2.6230199337005615} 11/06/2021 21:34:25 - INFO - __main__ - Step 2156: {'lr': 0.0004999986468394367, 'samples': 413952, 'steps': 2155, 'loss/train': 2.50052547454834} 11/06/2021 21:34:25 - INFO - __main__ - Step 2157: {'lr': 0.0004999986293229934, 'samples': 414144, 'steps': 2156, 'loss/train': 2.743523359298706} 11/06/2021 21:34:26 - INFO - __main__ - Step 2158: {'lr': 0.0004999986116939045, 'samples': 414336, 'steps': 2157, 'loss/train': 2.9330990314483643} 11/06/2021 21:34:26 - INFO - __main__ - Step 2159: {'lr': 0.0004999985939521702, 'samples': 414528, 'steps': 2158, 'loss/train': 3.325244426727295} 11/06/2021 21:34:27 - INFO - __main__ - Step 2160: {'lr': 0.0004999985760977903, 'samples': 414720, 'steps': 2159, 'loss/train': 2.2128641605377197} 11/06/2021 21:34:27 - INFO - __main__ - Step 2161: {'lr': 0.000499998558130765, 'samples': 414912, 'steps': 2160, 'loss/train': 2.9361557960510254} 11/06/2021 21:34:27 - INFO - __main__ - Step 2162: {'lr': 0.0004999985400510941, 'samples': 415104, 'steps': 2161, 'loss/train': 2.5086746215820312} 11/06/2021 21:34:28 - INFO - __main__ - Step 2163: {'lr': 0.0004999985218587777, 'samples': 415296, 'steps': 2162, 'loss/train': 1.6962592601776123} 11/06/2021 21:34:29 - INFO - __main__ - Step 2164: {'lr': 0.0004999985035538159, 'samples': 415488, 'steps': 2163, 'loss/train': 2.615097761154175} 11/06/2021 21:34:29 - INFO - __main__ - Step 2165: {'lr': 0.0004999984851362086, 'samples': 415680, 'steps': 2164, 'loss/train': 2.722036600112915} 11/06/2021 21:34:29 - INFO - __main__ - Step 2166: {'lr': 0.0004999984666059559, 'samples': 415872, 'steps': 2165, 'loss/train': 2.560600996017456} 11/06/2021 21:34:30 - INFO - __main__ - Step 2167: {'lr': 0.0004999984479630577, 'samples': 416064, 'steps': 2166, 'loss/train': 2.6970345973968506} 11/06/2021 21:34:30 - INFO - __main__ - Step 2168: {'lr': 0.000499998429207514, 'samples': 416256, 'steps': 2167, 'loss/train': 2.9107210636138916} 11/06/2021 21:34:31 - INFO - __main__ - Step 2169: {'lr': 0.000499998410339325, 'samples': 416448, 'steps': 2168, 'loss/train': 2.580718994140625} 11/06/2021 21:34:32 - INFO - __main__ - Step 2170: {'lr': 0.0004999983913584904, 'samples': 416640, 'steps': 2169, 'loss/train': 2.8567304611206055} 11/06/2021 21:34:32 - INFO - __main__ - Step 2171: {'lr': 0.0004999983722650106, 'samples': 416832, 'steps': 2170, 'loss/train': 2.371682643890381} 11/06/2021 21:34:32 - INFO - __main__ - Step 2172: {'lr': 0.0004999983530588853, 'samples': 417024, 'steps': 2171, 'loss/train': 2.061142921447754} 11/06/2021 21:34:33 - INFO - __main__ - Step 2173: {'lr': 0.0004999983337401145, 'samples': 417216, 'steps': 2172, 'loss/train': 2.451244354248047} 11/06/2021 21:34:34 - INFO - __main__ - Step 2174: {'lr': 0.0004999983143086984, 'samples': 417408, 'steps': 2173, 'loss/train': 1.99120032787323} 11/06/2021 21:34:34 - INFO - __main__ - Step 2175: {'lr': 0.0004999982947646368, 'samples': 417600, 'steps': 2174, 'loss/train': 2.0224850177764893} 11/06/2021 21:34:34 - INFO - __main__ - Step 2176: {'lr': 0.00049999827510793, 'samples': 417792, 'steps': 2175, 'loss/train': 2.3628859519958496} 11/06/2021 21:34:35 - INFO - __main__ - Step 2177: {'lr': 0.0004999982553385778, 'samples': 417984, 'steps': 2176, 'loss/train': 2.5779266357421875} 11/06/2021 21:34:35 - INFO - __main__ - Step 2178: {'lr': 0.0004999982354565802, 'samples': 418176, 'steps': 2177, 'loss/train': 2.631287097930908} 11/06/2021 21:34:36 - INFO - __main__ - Step 2179: {'lr': 0.0004999982154619372, 'samples': 418368, 'steps': 2178, 'loss/train': 2.691127300262451} 11/06/2021 21:34:36 - INFO - __main__ - Step 2180: {'lr': 0.000499998195354649, 'samples': 418560, 'steps': 2179, 'loss/train': 2.6451637744903564} 11/06/2021 21:34:37 - INFO - __main__ - Step 2181: {'lr': 0.0004999981751347153, 'samples': 418752, 'steps': 2180, 'loss/train': 3.3670623302459717} 11/06/2021 21:34:37 - INFO - __main__ - Step 2182: {'lr': 0.0004999981548021364, 'samples': 418944, 'steps': 2181, 'loss/train': 2.2093632221221924} 11/06/2021 21:34:37 - INFO - __main__ - Step 2183: {'lr': 0.0004999981343569122, 'samples': 419136, 'steps': 2182, 'loss/train': 2.704352855682373} 11/06/2021 21:34:38 - INFO - __main__ - Step 2184: {'lr': 0.0004999981137990425, 'samples': 419328, 'steps': 2183, 'loss/train': 3.1171112060546875} 11/06/2021 21:34:39 - INFO - __main__ - Step 2185: {'lr': 0.0004999980931285278, 'samples': 419520, 'steps': 2184, 'loss/train': 2.5423855781555176} 11/06/2021 21:34:39 - INFO - __main__ - Step 2186: {'lr': 0.0004999980723453676, 'samples': 419712, 'steps': 2185, 'loss/train': 1.9052900075912476} 11/06/2021 21:34:40 - INFO - __main__ - Step 2187: {'lr': 0.0004999980514495623, 'samples': 419904, 'steps': 2186, 'loss/train': 2.7308359146118164} 11/06/2021 21:34:40 - INFO - __main__ - Step 2188: {'lr': 0.0004999980304411116, 'samples': 420096, 'steps': 2187, 'loss/train': 2.344801902770996} 11/06/2021 21:34:40 - INFO - __main__ - Step 2189: {'lr': 0.0004999980093200157, 'samples': 420288, 'steps': 2188, 'loss/train': 2.7158095836639404} 11/06/2021 21:34:41 - INFO - __main__ - Step 2190: {'lr': 0.0004999979880862745, 'samples': 420480, 'steps': 2189, 'loss/train': 1.4334872961044312} 11/06/2021 21:34:42 - INFO - __main__ - Step 2191: {'lr': 0.0004999979667398882, 'samples': 420672, 'steps': 2190, 'loss/train': 2.5495493412017822} 11/06/2021 21:34:42 - INFO - __main__ - Step 2192: {'lr': 0.0004999979452808565, 'samples': 420864, 'steps': 2191, 'loss/train': 2.356931447982788} 11/06/2021 21:34:42 - INFO - __main__ - Step 2193: {'lr': 0.0004999979237091796, 'samples': 421056, 'steps': 2192, 'loss/train': 2.551086902618408} 11/06/2021 21:34:43 - INFO - __main__ - Step 2194: {'lr': 0.0004999979020248577, 'samples': 421248, 'steps': 2193, 'loss/train': 2.4983091354370117} 11/06/2021 21:34:43 - INFO - __main__ - Step 2195: {'lr': 0.0004999978802278904, 'samples': 421440, 'steps': 2194, 'loss/train': 2.6009490489959717} 11/06/2021 21:34:44 - INFO - __main__ - Step 2196: {'lr': 0.000499997858318278, 'samples': 421632, 'steps': 2195, 'loss/train': 2.3032186031341553} 11/06/2021 21:34:45 - INFO - __main__ - Step 2197: {'lr': 0.0004999978362960204, 'samples': 421824, 'steps': 2196, 'loss/train': 2.835674285888672} 11/06/2021 21:34:45 - INFO - __main__ - Step 2198: {'lr': 0.0004999978141611176, 'samples': 422016, 'steps': 2197, 'loss/train': 3.615586042404175} 11/06/2021 21:34:45 - INFO - __main__ - Step 2199: {'lr': 0.0004999977919135696, 'samples': 422208, 'steps': 2198, 'loss/train': 1.8978424072265625} 11/06/2021 21:34:46 - INFO - __main__ - Step 2200: {'lr': 0.0004999977695533766, 'samples': 422400, 'steps': 2199, 'loss/train': 2.526134967803955} 11/06/2021 21:34:47 - INFO - __main__ - Step 2201: {'lr': 0.0004999977470805383, 'samples': 422592, 'steps': 2200, 'loss/train': 2.3798577785491943} 11/06/2021 21:34:47 - INFO - __main__ - Step 2202: {'lr': 0.0004999977244950551, 'samples': 422784, 'steps': 2201, 'loss/train': 2.6443750858306885} 11/06/2021 21:34:47 - INFO - __main__ - Step 2203: {'lr': 0.0004999977017969266, 'samples': 422976, 'steps': 2202, 'loss/train': 1.9695329666137695} 11/06/2021 21:34:48 - INFO - __main__ - Step 2204: {'lr': 0.000499997678986153, 'samples': 423168, 'steps': 2203, 'loss/train': 2.704178810119629} 11/06/2021 21:34:48 - INFO - __main__ - Step 2205: {'lr': 0.0004999976560627344, 'samples': 423360, 'steps': 2204, 'loss/train': 2.8710994720458984} 11/06/2021 21:34:49 - INFO - __main__ - Step 2206: {'lr': 0.0004999976330266707, 'samples': 423552, 'steps': 2205, 'loss/train': 2.6105599403381348} 11/06/2021 21:34:49 - INFO - __main__ - Step 2207: {'lr': 0.0004999976098779618, 'samples': 423744, 'steps': 2206, 'loss/train': 2.541774272918701} 11/06/2021 21:34:50 - INFO - __main__ - Step 2208: {'lr': 0.0004999975866166079, 'samples': 423936, 'steps': 2207, 'loss/train': 2.3094847202301025} 11/06/2021 21:34:50 - INFO - __main__ - Step 2209: {'lr': 0.000499997563242609, 'samples': 424128, 'steps': 2208, 'loss/train': 2.5492539405822754} 11/06/2021 21:34:50 - INFO - __main__ - Step 2210: {'lr': 0.0004999975397559649, 'samples': 424320, 'steps': 2209, 'loss/train': 2.095379114151001} 11/06/2021 21:34:52 - INFO - __main__ - Step 2211: {'lr': 0.000499997516156676, 'samples': 424512, 'steps': 2210, 'loss/train': 2.605085849761963} 11/06/2021 21:34:52 - INFO - __main__ - Step 2212: {'lr': 0.000499997492444742, 'samples': 424704, 'steps': 2211, 'loss/train': 2.455345630645752} 11/06/2021 21:34:52 - INFO - __main__ - Step 2213: {'lr': 0.0004999974686201629, 'samples': 424896, 'steps': 2212, 'loss/train': 2.637754201889038} 11/06/2021 21:34:53 - INFO - __main__ - Step 2214: {'lr': 0.0004999974446829389, 'samples': 425088, 'steps': 2213, 'loss/train': 2.4633007049560547} 11/06/2021 21:34:53 - INFO - __main__ - Step 2215: {'lr': 0.0004999974206330698, 'samples': 425280, 'steps': 2214, 'loss/train': 2.5374715328216553} 11/06/2021 21:34:53 - INFO - __main__ - Step 2216: {'lr': 0.0004999973964705558, 'samples': 425472, 'steps': 2215, 'loss/train': 2.7447330951690674} 11/06/2021 21:34:54 - INFO - __main__ - Step 2217: {'lr': 0.0004999973721953968, 'samples': 425664, 'steps': 2216, 'loss/train': 2.3357722759246826} 11/06/2021 21:34:55 - INFO - __main__ - Step 2218: {'lr': 0.0004999973478075928, 'samples': 425856, 'steps': 2217, 'loss/train': 2.6798555850982666} 11/06/2021 21:34:55 - INFO - __main__ - Step 2219: {'lr': 0.0004999973233071438, 'samples': 426048, 'steps': 2218, 'loss/train': 2.2830727100372314} 11/06/2021 21:34:55 - INFO - __main__ - Step 2220: {'lr': 0.00049999729869405, 'samples': 426240, 'steps': 2219, 'loss/train': 2.466994524002075} 11/06/2021 21:34:56 - INFO - __main__ - Step 2221: {'lr': 0.0004999972739683113, 'samples': 426432, 'steps': 2220, 'loss/train': 2.3057074546813965} 11/06/2021 21:34:57 - INFO - __main__ - Step 2222: {'lr': 0.0004999972491299276, 'samples': 426624, 'steps': 2221, 'loss/train': 2.5645909309387207} 11/06/2021 21:34:57 - INFO - __main__ - Step 2223: {'lr': 0.000499997224178899, 'samples': 426816, 'steps': 2222, 'loss/train': 2.493025064468384} 11/06/2021 21:34:57 - INFO - __main__ - Step 2224: {'lr': 0.0004999971991152256, 'samples': 427008, 'steps': 2223, 'loss/train': 3.2061784267425537} 11/06/2021 21:34:58 - INFO - __main__ - Step 2225: {'lr': 0.0004999971739389072, 'samples': 427200, 'steps': 2224, 'loss/train': 2.114089250564575} 11/06/2021 21:34:58 - INFO - __main__ - Step 2226: {'lr': 0.000499997148649944, 'samples': 427392, 'steps': 2225, 'loss/train': 2.6324121952056885} 11/06/2021 21:34:59 - INFO - __main__ - Step 2227: {'lr': 0.0004999971232483359, 'samples': 427584, 'steps': 2226, 'loss/train': 2.3817732334136963} 11/06/2021 21:34:59 - INFO - __main__ - Step 2228: {'lr': 0.0004999970977340829, 'samples': 427776, 'steps': 2227, 'loss/train': 2.323068380355835} 11/06/2021 21:35:00 - INFO - __main__ - Step 2229: {'lr': 0.0004999970721071852, 'samples': 427968, 'steps': 2228, 'loss/train': 2.610438585281372} 11/06/2021 21:35:00 - INFO - __main__ - Step 2230: {'lr': 0.0004999970463676427, 'samples': 428160, 'steps': 2229, 'loss/train': 2.492368221282959} 11/06/2021 21:35:00 - INFO - __main__ - Step 2231: {'lr': 0.0004999970205154553, 'samples': 428352, 'steps': 2230, 'loss/train': 2.369539499282837} 11/06/2021 21:35:02 - INFO - __main__ - Step 2232: {'lr': 0.000499996994550623, 'samples': 428544, 'steps': 2231, 'loss/train': 2.712465524673462} 11/06/2021 21:35:02 - INFO - __main__ - Step 2233: {'lr': 0.000499996968473146, 'samples': 428736, 'steps': 2232, 'loss/train': 3.136364221572876} 11/06/2021 21:35:02 - INFO - __main__ - Step 2234: {'lr': 0.0004999969422830242, 'samples': 428928, 'steps': 2233, 'loss/train': 1.8193436861038208} 11/06/2021 21:35:03 - INFO - __main__ - Step 2235: {'lr': 0.0004999969159802577, 'samples': 429120, 'steps': 2234, 'loss/train': 2.6638810634613037} 11/06/2021 21:35:03 - INFO - __main__ - Step 2236: {'lr': 0.0004999968895648464, 'samples': 429312, 'steps': 2235, 'loss/train': 2.946650505065918} 11/06/2021 21:35:03 - INFO - __main__ - Step 2237: {'lr': 0.0004999968630367905, 'samples': 429504, 'steps': 2236, 'loss/train': 1.9133739471435547} 11/06/2021 21:35:04 - INFO - __main__ - Step 2238: {'lr': 0.0004999968363960897, 'samples': 429696, 'steps': 2237, 'loss/train': 2.461745023727417} 11/06/2021 21:35:05 - INFO - __main__ - Step 2239: {'lr': 0.0004999968096427443, 'samples': 429888, 'steps': 2238, 'loss/train': 1.956204891204834} 11/06/2021 21:35:05 - INFO - __main__ - Step 2240: {'lr': 0.0004999967827767541, 'samples': 430080, 'steps': 2239, 'loss/train': 2.4149010181427} 11/06/2021 21:35:05 - INFO - __main__ - Step 2241: {'lr': 0.0004999967557981192, 'samples': 430272, 'steps': 2240, 'loss/train': 2.339151382446289} 11/06/2021 21:35:06 - INFO - __main__ - Step 2242: {'lr': 0.0004999967287068396, 'samples': 430464, 'steps': 2241, 'loss/train': 1.5045028924942017} 11/06/2021 21:35:07 - INFO - __main__ - Step 2243: {'lr': 0.0004999967015029155, 'samples': 430656, 'steps': 2242, 'loss/train': 2.4464919567108154} 11/06/2021 21:35:07 - INFO - __main__ - Step 2244: {'lr': 0.0004999966741863467, 'samples': 430848, 'steps': 2243, 'loss/train': 2.7989583015441895} 11/06/2021 21:35:07 - INFO - __main__ - Step 2245: {'lr': 0.000499996646757133, 'samples': 431040, 'steps': 2244, 'loss/train': 2.849099636077881} 11/06/2021 21:35:08 - INFO - __main__ - Step 2246: {'lr': 0.0004999966192152749, 'samples': 431232, 'steps': 2245, 'loss/train': 2.3336470127105713} 11/06/2021 21:35:08 - INFO - __main__ - Step 2247: {'lr': 0.0004999965915607722, 'samples': 431424, 'steps': 2246, 'loss/train': 1.3028373718261719} 11/06/2021 21:35:09 - INFO - __main__ - Step 2248: {'lr': 0.0004999965637936248, 'samples': 431616, 'steps': 2247, 'loss/train': 2.4899795055389404} 11/06/2021 21:35:10 - INFO - __main__ - Step 2249: {'lr': 0.0004999965359138329, 'samples': 431808, 'steps': 2248, 'loss/train': 3.0825843811035156} 11/06/2021 21:35:10 - INFO - __main__ - Step 2250: {'lr': 0.0004999965079213964, 'samples': 432000, 'steps': 2249, 'loss/train': 2.510723352432251} 11/06/2021 21:35:10 - INFO - __main__ - Step 2251: {'lr': 0.0004999964798163152, 'samples': 432192, 'steps': 2250, 'loss/train': 2.384847640991211} 11/06/2021 21:35:11 - INFO - __main__ - Step 2252: {'lr': 0.0004999964515985896, 'samples': 432384, 'steps': 2251, 'loss/train': 2.7811596393585205} 11/06/2021 21:35:12 - INFO - __main__ - Step 2253: {'lr': 0.0004999964232682194, 'samples': 432576, 'steps': 2252, 'loss/train': 3.195866823196411} 11/06/2021 21:35:12 - INFO - __main__ - Step 2254: {'lr': 0.0004999963948252046, 'samples': 432768, 'steps': 2253, 'loss/train': 1.684893012046814} 11/06/2021 21:35:12 - INFO - __main__ - Step 2255: {'lr': 0.0004999963662695453, 'samples': 432960, 'steps': 2254, 'loss/train': 2.470139265060425} 11/06/2021 21:35:13 - INFO - __main__ - Step 2256: {'lr': 0.0004999963376012416, 'samples': 433152, 'steps': 2255, 'loss/train': 2.2551157474517822} 11/06/2021 21:35:13 - INFO - __main__ - Step 2257: {'lr': 0.0004999963088202934, 'samples': 433344, 'steps': 2256, 'loss/train': 1.778255581855774} 11/06/2021 21:35:13 - INFO - __main__ - Step 2258: {'lr': 0.0004999962799267006, 'samples': 433536, 'steps': 2257, 'loss/train': 2.4314968585968018} 11/06/2021 21:35:14 - INFO - __main__ - Step 2259: {'lr': 0.0004999962509204634, 'samples': 433728, 'steps': 2258, 'loss/train': 2.8687584400177} 11/06/2021 21:35:15 - INFO - __main__ - Step 2260: {'lr': 0.0004999962218015818, 'samples': 433920, 'steps': 2259, 'loss/train': 2.5391645431518555} 11/06/2021 21:35:15 - INFO - __main__ - Step 2261: {'lr': 0.0004999961925700557, 'samples': 434112, 'steps': 2260, 'loss/train': 2.284990072250366} 11/06/2021 21:35:15 - INFO - __main__ - Step 2262: {'lr': 0.0004999961632258851, 'samples': 434304, 'steps': 2261, 'loss/train': 2.2912981510162354} 11/06/2021 21:35:16 - INFO - __main__ - Step 2263: {'lr': 0.0004999961337690703, 'samples': 434496, 'steps': 2262, 'loss/train': 2.5647494792938232} 11/06/2021 21:35:17 - INFO - __main__ - Step 2264: {'lr': 0.0004999961041996109, 'samples': 434688, 'steps': 2263, 'loss/train': 2.3833937644958496} 11/06/2021 21:35:17 - INFO - __main__ - Step 2265: {'lr': 0.0004999960745175071, 'samples': 434880, 'steps': 2264, 'loss/train': 2.4869725704193115} 11/06/2021 21:35:18 - INFO - __main__ - Step 2266: {'lr': 0.0004999960447227591, 'samples': 435072, 'steps': 2265, 'loss/train': 2.0234827995300293} 11/06/2021 21:35:18 - INFO - __main__ - Step 2267: {'lr': 0.0004999960148153667, 'samples': 435264, 'steps': 2266, 'loss/train': 2.4819018840789795} 11/06/2021 21:35:18 - INFO - __main__ - Step 2268: {'lr': 0.0004999959847953299, 'samples': 435456, 'steps': 2267, 'loss/train': 1.9739242792129517} 11/06/2021 21:35:19 - INFO - __main__ - Step 2269: {'lr': 0.0004999959546626487, 'samples': 435648, 'steps': 2268, 'loss/train': 2.5445456504821777} 11/06/2021 21:35:20 - INFO - __main__ - Step 2270: {'lr': 0.0004999959244173232, 'samples': 435840, 'steps': 2269, 'loss/train': 1.6040853261947632} 11/06/2021 21:35:20 - INFO - __main__ - Step 2271: {'lr': 0.0004999958940593535, 'samples': 436032, 'steps': 2270, 'loss/train': 2.391418218612671} 11/06/2021 21:35:20 - INFO - __main__ - Step 2272: {'lr': 0.0004999958635887394, 'samples': 436224, 'steps': 2271, 'loss/train': 2.7278921604156494} 11/06/2021 21:35:21 - INFO - __main__ - Step 2273: {'lr': 0.0004999958330054811, 'samples': 436416, 'steps': 2272, 'loss/train': 2.206878900527954} 11/06/2021 21:35:21 - INFO - __main__ - Step 2274: {'lr': 0.0004999958023095785, 'samples': 436608, 'steps': 2273, 'loss/train': 2.4823598861694336} 11/06/2021 21:35:22 - INFO - __main__ - Step 2275: {'lr': 0.0004999957715010317, 'samples': 436800, 'steps': 2274, 'loss/train': 2.0036473274230957} 11/06/2021 21:35:22 - INFO - __main__ - Step 2276: {'lr': 0.0004999957405798405, 'samples': 436992, 'steps': 2275, 'loss/train': 2.6134819984436035} 11/06/2021 21:35:23 - INFO - __main__ - Step 2277: {'lr': 0.0004999957095460052, 'samples': 437184, 'steps': 2276, 'loss/train': 2.5736441612243652} 11/06/2021 21:35:23 - INFO - __main__ - Step 2278: {'lr': 0.0004999956783995257, 'samples': 437376, 'steps': 2277, 'loss/train': 2.585498809814453} 11/06/2021 21:35:23 - INFO - __main__ - Step 2279: {'lr': 0.0004999956471404021, 'samples': 437568, 'steps': 2278, 'loss/train': 2.7420217990875244} 11/06/2021 21:35:24 - INFO - __main__ - Step 2280: {'lr': 0.0004999956157686341, 'samples': 437760, 'steps': 2279, 'loss/train': 2.3299975395202637} 11/06/2021 21:35:25 - INFO - __main__ - Step 2281: {'lr': 0.0004999955842842222, 'samples': 437952, 'steps': 2280, 'loss/train': 2.463308334350586} 11/06/2021 21:35:25 - INFO - __main__ - Step 2282: {'lr': 0.0004999955526871659, 'samples': 438144, 'steps': 2281, 'loss/train': 1.7193819284439087} 11/06/2021 21:35:26 - INFO - __main__ - Step 2283: {'lr': 0.0004999955209774656, 'samples': 438336, 'steps': 2282, 'loss/train': 2.3171699047088623} 11/06/2021 21:35:26 - INFO - __main__ - Step 2284: {'lr': 0.0004999954891551211, 'samples': 438528, 'steps': 2283, 'loss/train': 3.057791233062744} 11/06/2021 21:35:27 - INFO - __main__ - Step 2285: {'lr': 0.0004999954572201326, 'samples': 438720, 'steps': 2284, 'loss/train': 2.662093162536621} 11/06/2021 21:35:27 - INFO - __main__ - Step 2286: {'lr': 0.0004999954251724999, 'samples': 438912, 'steps': 2285, 'loss/train': 2.635653495788574} 11/06/2021 21:35:28 - INFO - __main__ - Step 2287: {'lr': 0.0004999953930122231, 'samples': 439104, 'steps': 2286, 'loss/train': 2.4084901809692383} 11/06/2021 21:35:28 - INFO - __main__ - Step 2288: {'lr': 0.0004999953607393023, 'samples': 439296, 'steps': 2287, 'loss/train': 2.4325740337371826} 11/06/2021 21:35:28 - INFO - __main__ - Step 2289: {'lr': 0.0004999953283537374, 'samples': 439488, 'steps': 2288, 'loss/train': 2.2441446781158447} 11/06/2021 21:35:29 - INFO - __main__ - Step 2290: {'lr': 0.0004999952958555285, 'samples': 439680, 'steps': 2289, 'loss/train': 3.243093729019165} 11/06/2021 21:35:30 - INFO - __main__ - Step 2291: {'lr': 0.0004999952632446756, 'samples': 439872, 'steps': 2290, 'loss/train': 2.7363367080688477} 11/06/2021 21:35:30 - INFO - __main__ - Step 2292: {'lr': 0.0004999952305211786, 'samples': 440064, 'steps': 2291, 'loss/train': 1.7157084941864014} 11/06/2021 21:35:31 - INFO - __main__ - Step 2293: {'lr': 0.0004999951976850377, 'samples': 440256, 'steps': 2292, 'loss/train': 1.871019721031189} 11/06/2021 21:35:31 - INFO - __main__ - Step 2294: {'lr': 0.0004999951647362527, 'samples': 440448, 'steps': 2293, 'loss/train': 2.4851877689361572} 11/06/2021 21:35:32 - INFO - __main__ - Step 2295: {'lr': 0.0004999951316748239, 'samples': 440640, 'steps': 2294, 'loss/train': 2.7389845848083496} 11/06/2021 21:35:32 - INFO - __main__ - Step 2296: {'lr': 0.0004999950985007511, 'samples': 440832, 'steps': 2295, 'loss/train': 1.447107195854187} 11/06/2021 21:35:33 - INFO - __main__ - Step 2297: {'lr': 0.0004999950652140343, 'samples': 441024, 'steps': 2296, 'loss/train': 2.8587417602539062} 11/06/2021 21:35:33 - INFO - __main__ - Step 2298: {'lr': 0.0004999950318146737, 'samples': 441216, 'steps': 2297, 'loss/train': 2.5910117626190186} 11/06/2021 21:35:33 - INFO - __main__ - Step 2299: {'lr': 0.0004999949983026691, 'samples': 441408, 'steps': 2298, 'loss/train': 2.3904645442962646} 11/06/2021 21:35:34 - INFO - __main__ - Step 2300: {'lr': 0.0004999949646780205, 'samples': 441600, 'steps': 2299, 'loss/train': 2.857335329055786} 11/06/2021 21:35:35 - INFO - __main__ - Step 2301: {'lr': 0.0004999949309407283, 'samples': 441792, 'steps': 2300, 'loss/train': 2.917921543121338} 11/06/2021 21:35:35 - INFO - __main__ - Step 2302: {'lr': 0.0004999948970907921, 'samples': 441984, 'steps': 2301, 'loss/train': 2.345691442489624} 11/06/2021 21:35:36 - INFO - __main__ - Step 2303: {'lr': 0.0004999948631282119, 'samples': 442176, 'steps': 2302, 'loss/train': 1.4263627529144287} 11/06/2021 21:35:36 - INFO - __main__ - Step 2304: {'lr': 0.0004999948290529881, 'samples': 442368, 'steps': 2303, 'loss/train': 2.424299955368042} 11/06/2021 21:35:36 - INFO - __main__ - Step 2305: {'lr': 0.0004999947948651204, 'samples': 442560, 'steps': 2304, 'loss/train': 1.874759316444397} 11/06/2021 21:35:37 - INFO - __main__ - Step 2306: {'lr': 0.0004999947605646089, 'samples': 442752, 'steps': 2305, 'loss/train': 2.55169677734375} 11/06/2021 21:35:38 - INFO - __main__ - Step 2307: {'lr': 0.0004999947261514537, 'samples': 442944, 'steps': 2306, 'loss/train': 2.5056583881378174} 11/06/2021 21:35:38 - INFO - __main__ - Step 2308: {'lr': 0.0004999946916256547, 'samples': 443136, 'steps': 2307, 'loss/train': 2.708646535873413} 11/06/2021 21:35:38 - INFO - __main__ - Step 2309: {'lr': 0.0004999946569872118, 'samples': 443328, 'steps': 2308, 'loss/train': 2.174234628677368} 11/06/2021 21:35:39 - INFO - __main__ - Step 2310: {'lr': 0.0004999946222361254, 'samples': 443520, 'steps': 2309, 'loss/train': 2.365739107131958} 11/06/2021 21:35:40 - INFO - __main__ - Step 2311: {'lr': 0.0004999945873723951, 'samples': 443712, 'steps': 2310, 'loss/train': 2.3640260696411133} 11/06/2021 21:35:40 - INFO - __main__ - Step 2312: {'lr': 0.0004999945523960212, 'samples': 443904, 'steps': 2311, 'loss/train': 1.8592215776443481} 11/06/2021 21:35:40 - INFO - __main__ - Step 2313: {'lr': 0.0004999945173070035, 'samples': 444096, 'steps': 2312, 'loss/train': 2.9647116661071777} 11/06/2021 21:35:41 - INFO - __main__ - Step 2314: {'lr': 0.0004999944821053422, 'samples': 444288, 'steps': 2313, 'loss/train': 2.653313636779785} 11/06/2021 21:35:41 - INFO - __main__ - Step 2315: {'lr': 0.0004999944467910372, 'samples': 444480, 'steps': 2314, 'loss/train': 2.4893155097961426} 11/06/2021 21:35:42 - INFO - __main__ - Step 2316: {'lr': 0.0004999944113640887, 'samples': 444672, 'steps': 2315, 'loss/train': 2.724855422973633} 11/06/2021 21:35:43 - INFO - __main__ - Step 2317: {'lr': 0.0004999943758244964, 'samples': 444864, 'steps': 2316, 'loss/train': 2.472731590270996} 11/06/2021 21:35:43 - INFO - __main__ - Step 2318: {'lr': 0.0004999943401722606, 'samples': 445056, 'steps': 2317, 'loss/train': 2.70874285697937} 11/06/2021 21:35:43 - INFO - __main__ - Step 2319: {'lr': 0.0004999943044073813, 'samples': 445248, 'steps': 2318, 'loss/train': 2.925724983215332} 11/06/2021 21:35:44 - INFO - __main__ - Step 2320: {'lr': 0.0004999942685298582, 'samples': 445440, 'steps': 2319, 'loss/train': 2.7367475032806396} 11/06/2021 21:35:44 - INFO - __main__ - Step 2321: {'lr': 0.0004999942325396916, 'samples': 445632, 'steps': 2320, 'loss/train': 2.439120054244995} 11/06/2021 21:35:45 - INFO - __main__ - Step 2322: {'lr': 0.0004999941964368817, 'samples': 445824, 'steps': 2321, 'loss/train': 2.9538588523864746} 11/06/2021 21:35:45 - INFO - __main__ - Step 2323: {'lr': 0.000499994160221428, 'samples': 446016, 'steps': 2322, 'loss/train': 2.5466785430908203} 11/06/2021 21:35:46 - INFO - __main__ - Step 2324: {'lr': 0.0004999941238933308, 'samples': 446208, 'steps': 2323, 'loss/train': 2.833611488342285} 11/06/2021 21:35:46 - INFO - __main__ - Step 2325: {'lr': 0.0004999940874525902, 'samples': 446400, 'steps': 2324, 'loss/train': 2.2067928314208984} 11/06/2021 21:35:46 - INFO - __main__ - Step 2326: {'lr': 0.0004999940508992061, 'samples': 446592, 'steps': 2325, 'loss/train': 2.688612222671509} 11/06/2021 21:35:48 - INFO - __main__ - Step 2327: {'lr': 0.0004999940142331785, 'samples': 446784, 'steps': 2326, 'loss/train': 2.2611327171325684} 11/06/2021 21:35:48 - INFO - __main__ - Step 2328: {'lr': 0.0004999939774545074, 'samples': 446976, 'steps': 2327, 'loss/train': 2.940781593322754} 11/06/2021 21:35:48 - INFO - __main__ - Step 2329: {'lr': 0.000499993940563193, 'samples': 447168, 'steps': 2328, 'loss/train': 1.4350242614746094} 11/06/2021 21:35:49 - INFO - __main__ - Step 2330: {'lr': 0.0004999939035592351, 'samples': 447360, 'steps': 2329, 'loss/train': 1.2856340408325195} 11/06/2021 21:35:49 - INFO - __main__ - Step 2331: {'lr': 0.0004999938664426339, 'samples': 447552, 'steps': 2330, 'loss/train': 2.2000558376312256} 11/06/2021 21:35:50 - INFO - __main__ - Step 2332: {'lr': 0.0004999938292133894, 'samples': 447744, 'steps': 2331, 'loss/train': 2.552149534225464} 11/06/2021 21:35:51 - INFO - __main__ - Step 2333: {'lr': 0.0004999937918715013, 'samples': 447936, 'steps': 2332, 'loss/train': 2.23763370513916} 11/06/2021 21:35:51 - INFO - __main__ - Step 2334: {'lr': 0.00049999375441697, 'samples': 448128, 'steps': 2333, 'loss/train': 2.5193448066711426} 11/06/2021 21:35:51 - INFO - __main__ - Step 2335: {'lr': 0.0004999937168497954, 'samples': 448320, 'steps': 2334, 'loss/train': 2.982269763946533} 11/06/2021 21:35:52 - INFO - __main__ - Step 2336: {'lr': 0.0004999936791699773, 'samples': 448512, 'steps': 2335, 'loss/train': 2.2376298904418945} 11/06/2021 21:35:53 - INFO - __main__ - Step 2337: {'lr': 0.0004999936413775161, 'samples': 448704, 'steps': 2336, 'loss/train': 2.544750928878784} 11/06/2021 21:35:53 - INFO - __main__ - Step 2338: {'lr': 0.0004999936034724115, 'samples': 448896, 'steps': 2337, 'loss/train': 2.562856912612915} 11/06/2021 21:35:53 - INFO - __main__ - Step 2339: {'lr': 0.0004999935654546638, 'samples': 449088, 'steps': 2338, 'loss/train': 2.4741744995117188} 11/06/2021 21:35:54 - INFO - __main__ - Step 2340: {'lr': 0.0004999935273242727, 'samples': 449280, 'steps': 2339, 'loss/train': 2.637646198272705} 11/06/2021 21:35:54 - INFO - __main__ - Step 2341: {'lr': 0.0004999934890812384, 'samples': 449472, 'steps': 2340, 'loss/train': 2.313438892364502} 11/06/2021 21:35:55 - INFO - __main__ - Step 2342: {'lr': 0.0004999934507255609, 'samples': 449664, 'steps': 2341, 'loss/train': 1.6301710605621338} 11/06/2021 21:35:55 - INFO - __main__ - Step 2343: {'lr': 0.0004999934122572403, 'samples': 449856, 'steps': 2342, 'loss/train': 2.490684747695923} 11/06/2021 21:35:56 - INFO - __main__ - Step 2344: {'lr': 0.0004999933736762763, 'samples': 450048, 'steps': 2343, 'loss/train': 2.5411455631256104} 11/06/2021 21:35:56 - INFO - __main__ - Step 2345: {'lr': 0.0004999933349826694, 'samples': 450240, 'steps': 2344, 'loss/train': 2.634213924407959} 11/06/2021 21:35:57 - INFO - __main__ - Step 2346: {'lr': 0.0004999932961764192, 'samples': 450432, 'steps': 2345, 'loss/train': 2.2040252685546875} 11/06/2021 21:35:58 - INFO - __main__ - Step 2347: {'lr': 0.000499993257257526, 'samples': 450624, 'steps': 2346, 'loss/train': 3.1868784427642822} 11/06/2021 21:35:58 - INFO - __main__ - Step 2348: {'lr': 0.0004999932182259897, 'samples': 450816, 'steps': 2347, 'loss/train': 2.624990940093994} 11/06/2021 21:35:58 - INFO - __main__ - Step 2349: {'lr': 0.0004999931790818102, 'samples': 451008, 'steps': 2348, 'loss/train': 2.1832830905914307} 11/06/2021 21:35:59 - INFO - __main__ - Step 2350: {'lr': 0.0004999931398249876, 'samples': 451200, 'steps': 2349, 'loss/train': 1.3503772020339966} 11/06/2021 21:35:59 - INFO - __main__ - Step 2351: {'lr': 0.0004999931004555221, 'samples': 451392, 'steps': 2350, 'loss/train': 2.434903383255005} 11/06/2021 21:36:00 - INFO - __main__ - Step 2352: {'lr': 0.0004999930609734135, 'samples': 451584, 'steps': 2351, 'loss/train': 1.9292891025543213} 11/06/2021 21:36:01 - INFO - __main__ - Step 2353: {'lr': 0.0004999930213786619, 'samples': 451776, 'steps': 2352, 'loss/train': 2.576533555984497} 11/06/2021 21:36:01 - INFO - __main__ - Step 2354: {'lr': 0.0004999929816712672, 'samples': 451968, 'steps': 2353, 'loss/train': 2.6374588012695312} 11/06/2021 21:36:01 - INFO - __main__ - Step 2355: {'lr': 0.0004999929418512296, 'samples': 452160, 'steps': 2354, 'loss/train': 2.645233392715454} 11/06/2021 21:36:02 - INFO - __main__ - Step 2356: {'lr': 0.0004999929019185491, 'samples': 452352, 'steps': 2355, 'loss/train': 2.3898088932037354} 11/06/2021 21:36:02 - INFO - __main__ - Step 2357: {'lr': 0.0004999928618732256, 'samples': 452544, 'steps': 2356, 'loss/train': 2.8378491401672363} 11/06/2021 21:36:03 - INFO - __main__ - Step 2358: {'lr': 0.0004999928217152591, 'samples': 452736, 'steps': 2357, 'loss/train': 2.8093080520629883} 11/06/2021 21:36:03 - INFO - __main__ - Step 2359: {'lr': 0.0004999927814446498, 'samples': 452928, 'steps': 2358, 'loss/train': 2.382434844970703} 11/06/2021 21:36:04 - INFO - __main__ - Step 2360: {'lr': 0.0004999927410613975, 'samples': 453120, 'steps': 2359, 'loss/train': 2.509754180908203} 11/06/2021 21:36:04 - INFO - __main__ - Step 2361: {'lr': 0.0004999927005655024, 'samples': 453312, 'steps': 2360, 'loss/train': 2.2279305458068848} 11/06/2021 21:36:04 - INFO - __main__ - Step 2362: {'lr': 0.0004999926599569644, 'samples': 453504, 'steps': 2361, 'loss/train': 2.2018115520477295} 11/06/2021 21:36:05 - INFO - __main__ - Step 2363: {'lr': 0.0004999926192357836, 'samples': 453696, 'steps': 2362, 'loss/train': 2.667649269104004} 11/06/2021 21:36:06 - INFO - __main__ - Step 2364: {'lr': 0.00049999257840196, 'samples': 453888, 'steps': 2363, 'loss/train': 1.9997769594192505} 11/06/2021 21:36:06 - INFO - __main__ - Step 2365: {'lr': 0.0004999925374554936, 'samples': 454080, 'steps': 2364, 'loss/train': 2.3278563022613525} 11/06/2021 21:36:06 - INFO - __main__ - Step 2366: {'lr': 0.0004999924963963845, 'samples': 454272, 'steps': 2365, 'loss/train': 2.7401719093322754} 11/06/2021 21:36:07 - INFO - __main__ - Step 2367: {'lr': 0.0004999924552246324, 'samples': 454464, 'steps': 2366, 'loss/train': 2.5451650619506836} 11/06/2021 21:36:08 - INFO - __main__ - Step 2368: {'lr': 0.0004999924139402378, 'samples': 454656, 'steps': 2367, 'loss/train': 2.171853542327881} 11/06/2021 21:36:08 - INFO - __main__ - Step 2369: {'lr': 0.0004999923725432004, 'samples': 454848, 'steps': 2368, 'loss/train': 2.3794937133789062} 11/06/2021 21:36:09 - INFO - __main__ - Step 2370: {'lr': 0.0004999923310335202, 'samples': 455040, 'steps': 2369, 'loss/train': 2.2186713218688965} 11/06/2021 21:36:09 - INFO - __main__ - Step 2371: {'lr': 0.0004999922894111975, 'samples': 455232, 'steps': 2370, 'loss/train': 2.5309088230133057} 11/06/2021 21:36:09 - INFO - __main__ - Step 2372: {'lr': 0.000499992247676232, 'samples': 455424, 'steps': 2371, 'loss/train': 2.399653196334839} 11/06/2021 21:36:10 - INFO - __main__ - Step 2373: {'lr': 0.0004999922058286238, 'samples': 455616, 'steps': 2372, 'loss/train': 2.432457208633423} 11/06/2021 21:36:11 - INFO - __main__ - Step 2374: {'lr': 0.0004999921638683731, 'samples': 455808, 'steps': 2373, 'loss/train': 2.936699390411377} 11/06/2021 21:36:11 - INFO - __main__ - Step 2375: {'lr': 0.0004999921217954797, 'samples': 456000, 'steps': 2374, 'loss/train': 2.8602395057678223} 11/06/2021 21:36:11 - INFO - __main__ - Step 2376: {'lr': 0.0004999920796099437, 'samples': 456192, 'steps': 2375, 'loss/train': 2.113866090774536} 11/06/2021 21:36:12 - INFO - __main__ - Step 2377: {'lr': 0.0004999920373117652, 'samples': 456384, 'steps': 2376, 'loss/train': 2.382596492767334} 11/06/2021 21:36:13 - INFO - __main__ - Step 2378: {'lr': 0.0004999919949009442, 'samples': 456576, 'steps': 2377, 'loss/train': 1.9662163257598877} 11/06/2021 21:36:13 - INFO - __main__ - Step 2379: {'lr': 0.0004999919523774806, 'samples': 456768, 'steps': 2378, 'loss/train': 2.765549659729004} 11/06/2021 21:36:13 - INFO - __main__ - Step 2380: {'lr': 0.0004999919097413743, 'samples': 456960, 'steps': 2379, 'loss/train': 2.0128252506256104} 11/06/2021 21:36:14 - INFO - __main__ - Step 2381: {'lr': 0.0004999918669926258, 'samples': 457152, 'steps': 2380, 'loss/train': 2.4427826404571533} 11/06/2021 21:36:14 - INFO - __main__ - Step 2382: {'lr': 0.0004999918241312346, 'samples': 457344, 'steps': 2381, 'loss/train': 3.2769882678985596} 11/06/2021 21:36:15 - INFO - __main__ - Step 2383: {'lr': 0.0004999917811572011, 'samples': 457536, 'steps': 2382, 'loss/train': 2.264770030975342} 11/06/2021 21:36:16 - INFO - __main__ - Step 2384: {'lr': 0.000499991738070525, 'samples': 457728, 'steps': 2383, 'loss/train': 2.820688247680664} 11/06/2021 21:36:16 - INFO - __main__ - Step 2385: {'lr': 0.0004999916948712066, 'samples': 457920, 'steps': 2384, 'loss/train': 2.739671230316162} 11/06/2021 21:36:16 - INFO - __main__ - Step 2386: {'lr': 0.0004999916515592458, 'samples': 458112, 'steps': 2385, 'loss/train': 2.355753183364868} 11/06/2021 21:36:17 - INFO - __main__ - Step 2387: {'lr': 0.0004999916081346426, 'samples': 458304, 'steps': 2386, 'loss/train': 2.6739003658294678} 11/06/2021 21:36:17 - INFO - __main__ - Step 2388: {'lr': 0.000499991564597397, 'samples': 458496, 'steps': 2387, 'loss/train': 2.985884189605713} 11/06/2021 21:36:18 - INFO - __main__ - Step 2389: {'lr': 0.0004999915209475091, 'samples': 458688, 'steps': 2388, 'loss/train': 2.5510177612304688} 11/06/2021 21:36:18 - INFO - __main__ - Step 2390: {'lr': 0.0004999914771849788, 'samples': 458880, 'steps': 2389, 'loss/train': 2.628279447555542} 11/06/2021 21:36:19 - INFO - __main__ - Step 2391: {'lr': 0.0004999914333098063, 'samples': 459072, 'steps': 2390, 'loss/train': 2.3192155361175537} 11/06/2021 21:36:19 - INFO - __main__ - Step 2392: {'lr': 0.0004999913893219915, 'samples': 459264, 'steps': 2391, 'loss/train': 2.4803824424743652} 11/06/2021 21:36:19 - INFO - __main__ - Step 2393: {'lr': 0.0004999913452215345, 'samples': 459456, 'steps': 2392, 'loss/train': 2.0477797985076904} 11/06/2021 21:36:20 - INFO - __main__ - Step 2394: {'lr': 0.0004999913010084351, 'samples': 459648, 'steps': 2393, 'loss/train': 2.529292583465576} 11/06/2021 21:36:21 - INFO - __main__ - Step 2395: {'lr': 0.0004999912566826935, 'samples': 459840, 'steps': 2394, 'loss/train': 2.484360456466675} 11/06/2021 21:36:21 - INFO - __main__ - Step 2396: {'lr': 0.0004999912122443098, 'samples': 460032, 'steps': 2395, 'loss/train': 2.425950288772583} 11/06/2021 21:36:22 - INFO - __main__ - Step 2397: {'lr': 0.0004999911676932838, 'samples': 460224, 'steps': 2396, 'loss/train': 2.7710964679718018} 11/06/2021 21:36:22 - INFO - __main__ - Step 2398: {'lr': 0.0004999911230296158, 'samples': 460416, 'steps': 2397, 'loss/train': 2.69278883934021} 11/06/2021 21:36:23 - INFO - __main__ - Step 2399: {'lr': 0.0004999910782533055, 'samples': 460608, 'steps': 2398, 'loss/train': 2.1563870906829834} 11/06/2021 21:36:23 - INFO - __main__ - Step 2400: {'lr': 0.0004999910333643531, 'samples': 460800, 'steps': 2399, 'loss/train': 2.21958327293396} 11/06/2021 21:36:24 - INFO - __main__ - Step 2401: {'lr': 0.0004999909883627587, 'samples': 460992, 'steps': 2400, 'loss/train': 2.3043596744537354} 11/06/2021 21:36:24 - INFO - __main__ - Step 2402: {'lr': 0.0004999909432485221, 'samples': 461184, 'steps': 2401, 'loss/train': 2.142707586288452} 11/06/2021 21:36:24 - INFO - __main__ - Step 2403: {'lr': 0.0004999908980216436, 'samples': 461376, 'steps': 2402, 'loss/train': 2.6707916259765625} 11/06/2021 21:36:25 - INFO - __main__ - Step 2404: {'lr': 0.0004999908526821229, 'samples': 461568, 'steps': 2403, 'loss/train': 2.7008554935455322} 11/06/2021 21:36:26 - INFO - __main__ - Step 2405: {'lr': 0.0004999908072299602, 'samples': 461760, 'steps': 2404, 'loss/train': 2.4561195373535156} 11/06/2021 21:36:26 - INFO - __main__ - Step 2406: {'lr': 0.0004999907616651556, 'samples': 461952, 'steps': 2405, 'loss/train': 2.6300549507141113} 11/06/2021 21:36:26 - INFO - __main__ - Step 2407: {'lr': 0.000499990715987709, 'samples': 462144, 'steps': 2406, 'loss/train': 2.5278778076171875} 11/06/2021 21:36:27 - INFO - __main__ - Step 2408: {'lr': 0.0004999906701976203, 'samples': 462336, 'steps': 2407, 'loss/train': 2.5817818641662598} 11/06/2021 21:36:28 - INFO - __main__ - Step 2409: {'lr': 0.0004999906242948898, 'samples': 462528, 'steps': 2408, 'loss/train': 2.7424280643463135} 11/06/2021 21:36:28 - INFO - __main__ - Step 2410: {'lr': 0.0004999905782795173, 'samples': 462720, 'steps': 2409, 'loss/train': 2.3306918144226074} 11/06/2021 21:36:29 - INFO - __main__ - Step 2411: {'lr': 0.000499990532151503, 'samples': 462912, 'steps': 2410, 'loss/train': 2.3240315914154053} 11/06/2021 21:36:29 - INFO - __main__ - Step 2412: {'lr': 0.0004999904859108467, 'samples': 463104, 'steps': 2411, 'loss/train': 2.3780782222747803} 11/06/2021 21:36:29 - INFO - __main__ - Step 2413: {'lr': 0.0004999904395575486, 'samples': 463296, 'steps': 2412, 'loss/train': 2.3109028339385986} 11/06/2021 21:36:30 - INFO - __main__ - Step 2414: {'lr': 0.0004999903930916087, 'samples': 463488, 'steps': 2413, 'loss/train': 2.878998279571533} 11/06/2021 21:36:31 - INFO - __main__ - Step 2415: {'lr': 0.000499990346513027, 'samples': 463680, 'steps': 2414, 'loss/train': 2.0418524742126465} 11/06/2021 21:36:31 - INFO - __main__ - Step 2416: {'lr': 0.0004999902998218034, 'samples': 463872, 'steps': 2415, 'loss/train': 2.2002334594726562} 11/06/2021 21:36:31 - INFO - __main__ - Step 2417: {'lr': 0.000499990253017938, 'samples': 464064, 'steps': 2416, 'loss/train': 2.537015438079834} 11/06/2021 21:36:32 - INFO - __main__ - Step 2418: {'lr': 0.0004999902061014311, 'samples': 464256, 'steps': 2417, 'loss/train': 2.3986918926239014} 11/06/2021 21:36:32 - INFO - __main__ - Step 2419: {'lr': 0.0004999901590722823, 'samples': 464448, 'steps': 2418, 'loss/train': 2.3698606491088867} 11/06/2021 21:36:33 - INFO - __main__ - Step 2420: {'lr': 0.0004999901119304919, 'samples': 464640, 'steps': 2419, 'loss/train': 2.4910922050476074} 11/06/2021 21:36:33 - INFO - __main__ - Step 2421: {'lr': 0.0004999900646760597, 'samples': 464832, 'steps': 2420, 'loss/train': 2.2482521533966064} 11/06/2021 21:36:34 - INFO - __main__ - Step 2422: {'lr': 0.0004999900173089858, 'samples': 465024, 'steps': 2421, 'loss/train': 3.0635385513305664} 11/06/2021 21:36:34 - INFO - __main__ - Step 2423: {'lr': 0.0004999899698292703, 'samples': 465216, 'steps': 2422, 'loss/train': 2.6223621368408203} 11/06/2021 21:36:35 - INFO - __main__ - Step 2424: {'lr': 0.0004999899222369132, 'samples': 465408, 'steps': 2423, 'loss/train': 2.0457282066345215} 11/06/2021 21:36:35 - INFO - __main__ - Step 2425: {'lr': 0.0004999898745319145, 'samples': 465600, 'steps': 2424, 'loss/train': 2.419081449508667} 11/06/2021 21:36:36 - INFO - __main__ - Step 2426: {'lr': 0.0004999898267142741, 'samples': 465792, 'steps': 2425, 'loss/train': 2.6468558311462402} 11/06/2021 21:36:36 - INFO - __main__ - Step 2427: {'lr': 0.0004999897787839923, 'samples': 465984, 'steps': 2426, 'loss/train': 2.8656837940216064} 11/06/2021 21:36:37 - INFO - __main__ - Step 2428: {'lr': 0.000499989730741069, 'samples': 466176, 'steps': 2427, 'loss/train': 2.517319679260254} 11/06/2021 21:36:37 - INFO - __main__ - Step 2429: {'lr': 0.000499989682585504, 'samples': 466368, 'steps': 2428, 'loss/train': 1.5299700498580933} 11/06/2021 21:36:37 - INFO - __main__ - Step 2430: {'lr': 0.0004999896343172976, 'samples': 466560, 'steps': 2429, 'loss/train': 1.4747153520584106} 11/06/2021 21:36:38 - INFO - __main__ - Step 2431: {'lr': 0.0004999895859364498, 'samples': 466752, 'steps': 2430, 'loss/train': 1.975550651550293} 11/06/2021 21:36:39 - INFO - __main__ - Step 2432: {'lr': 0.0004999895374429605, 'samples': 466944, 'steps': 2431, 'loss/train': 2.7396199703216553} 11/06/2021 21:36:39 - INFO - __main__ - Step 2433: {'lr': 0.0004999894888368297, 'samples': 467136, 'steps': 2432, 'loss/train': 2.1605048179626465} 11/06/2021 21:36:39 - INFO - __main__ - Step 2434: {'lr': 0.0004999894401180576, 'samples': 467328, 'steps': 2433, 'loss/train': 2.631399631500244} 11/06/2021 21:36:40 - INFO - __main__ - Step 2435: {'lr': 0.0004999893912866441, 'samples': 467520, 'steps': 2434, 'loss/train': 2.6873080730438232} 11/06/2021 21:36:41 - INFO - __main__ - Step 2436: {'lr': 0.0004999893423425892, 'samples': 467712, 'steps': 2435, 'loss/train': 2.2480852603912354} 11/06/2021 21:36:41 - INFO - __main__ - Step 2437: {'lr': 0.0004999892932858929, 'samples': 467904, 'steps': 2436, 'loss/train': 2.1071290969848633} 11/06/2021 21:36:42 - INFO - __main__ - Step 2438: {'lr': 0.0004999892441165554, 'samples': 468096, 'steps': 2437, 'loss/train': 2.8640754222869873} 11/06/2021 21:36:42 - INFO - __main__ - Step 2439: {'lr': 0.0004999891948345765, 'samples': 468288, 'steps': 2438, 'loss/train': 2.4221558570861816} 11/06/2021 21:36:42 - INFO - __main__ - Step 2440: {'lr': 0.0004999891454399565, 'samples': 468480, 'steps': 2439, 'loss/train': 2.7163968086242676} 11/06/2021 21:36:43 - INFO - __main__ - Step 2441: {'lr': 0.000499989095932695, 'samples': 468672, 'steps': 2440, 'loss/train': 2.371148109436035} 11/06/2021 21:36:44 - INFO - __main__ - Step 2442: {'lr': 0.0004999890463127924, 'samples': 468864, 'steps': 2441, 'loss/train': 2.5758795738220215} 11/06/2021 21:36:44 - INFO - __main__ - Step 2443: {'lr': 0.0004999889965802486, 'samples': 469056, 'steps': 2442, 'loss/train': 2.778921127319336} 11/06/2021 21:36:44 - INFO - __main__ - Step 2444: {'lr': 0.0004999889467350636, 'samples': 469248, 'steps': 2443, 'loss/train': 1.8492902517318726} 11/06/2021 21:36:45 - INFO - __main__ - Step 2445: {'lr': 0.0004999888967772375, 'samples': 469440, 'steps': 2444, 'loss/train': 2.191366672515869} 11/06/2021 21:36:45 - INFO - __main__ - Step 2446: {'lr': 0.0004999888467067702, 'samples': 469632, 'steps': 2445, 'loss/train': 2.458528757095337} 11/06/2021 21:36:46 - INFO - __main__ - Step 2447: {'lr': 0.0004999887965236617, 'samples': 469824, 'steps': 2446, 'loss/train': 2.605138063430786} 11/06/2021 21:36:46 - INFO - __main__ - Step 2448: {'lr': 0.0004999887462279123, 'samples': 470016, 'steps': 2447, 'loss/train': 2.1305642127990723} 11/06/2021 21:36:47 - INFO - __main__ - Step 2449: {'lr': 0.0004999886958195216, 'samples': 470208, 'steps': 2448, 'loss/train': 2.6707427501678467} 11/06/2021 21:36:47 - INFO - __main__ - Step 2450: {'lr': 0.00049998864529849, 'samples': 470400, 'steps': 2449, 'loss/train': 1.814773440361023} 11/06/2021 21:36:47 - INFO - __main__ - Step 2451: {'lr': 0.0004999885946648174, 'samples': 470592, 'steps': 2450, 'loss/train': 2.4460277557373047} 11/06/2021 21:36:48 - INFO - __main__ - Step 2452: {'lr': 0.0004999885439185037, 'samples': 470784, 'steps': 2451, 'loss/train': 2.429396390914917} 11/06/2021 21:36:49 - INFO - __main__ - Step 2453: {'lr': 0.0004999884930595491, 'samples': 470976, 'steps': 2452, 'loss/train': 2.094714879989624} 11/06/2021 21:36:49 - INFO - __main__ - Step 2454: {'lr': 0.0004999884420879534, 'samples': 471168, 'steps': 2453, 'loss/train': 2.2643680572509766} 11/06/2021 21:36:49 - INFO - __main__ - Step 2455: {'lr': 0.000499988391003717, 'samples': 471360, 'steps': 2454, 'loss/train': 2.5920233726501465} 11/06/2021 21:36:50 - INFO - __main__ - Step 2456: {'lr': 0.0004999883398068396, 'samples': 471552, 'steps': 2455, 'loss/train': 3.2174901962280273} 11/06/2021 21:36:51 - INFO - __main__ - Step 2457: {'lr': 0.0004999882884973212, 'samples': 471744, 'steps': 2456, 'loss/train': 2.311598062515259} 11/06/2021 21:36:52 - INFO - __main__ - Step 2458: {'lr': 0.000499988237075162, 'samples': 471936, 'steps': 2457, 'loss/train': 2.8705482482910156} 11/06/2021 21:36:52 - INFO - __main__ - Step 2459: {'lr': 0.000499988185540362, 'samples': 472128, 'steps': 2458, 'loss/train': 3.6789543628692627} 11/06/2021 21:36:52 - INFO - __main__ - Step 2460: {'lr': 0.0004999881338929211, 'samples': 472320, 'steps': 2459, 'loss/train': 2.2293074131011963} 11/06/2021 21:36:53 - INFO - __main__ - Step 2461: {'lr': 0.0004999880821328395, 'samples': 472512, 'steps': 2460, 'loss/train': 2.13460373878479} 11/06/2021 21:36:53 - INFO - __main__ - Step 2462: {'lr': 0.000499988030260117, 'samples': 472704, 'steps': 2461, 'loss/train': 2.2145402431488037} 11/06/2021 21:36:54 - INFO - __main__ - Step 2463: {'lr': 0.0004999879782747539, 'samples': 472896, 'steps': 2462, 'loss/train': 2.412808418273926} 11/06/2021 21:36:55 - INFO - __main__ - Step 2464: {'lr': 0.00049998792617675, 'samples': 473088, 'steps': 2463, 'loss/train': 1.355026125907898} 11/06/2021 21:36:55 - INFO - __main__ - Step 2465: {'lr': 0.0004999878739661053, 'samples': 473280, 'steps': 2464, 'loss/train': 2.797194242477417} 11/06/2021 21:36:55 - INFO - __main__ - Step 2466: {'lr': 0.0004999878216428201, 'samples': 473472, 'steps': 2465, 'loss/train': 3.099053144454956} 11/06/2021 21:36:56 - INFO - __main__ - Step 2467: {'lr': 0.0004999877692068942, 'samples': 473664, 'steps': 2466, 'loss/train': 2.666736602783203} 11/06/2021 21:36:57 - INFO - __main__ - Step 2468: {'lr': 0.0004999877166583276, 'samples': 473856, 'steps': 2467, 'loss/train': 2.2007715702056885} 11/06/2021 21:36:57 - INFO - __main__ - Step 2469: {'lr': 0.0004999876639971204, 'samples': 474048, 'steps': 2468, 'loss/train': 2.5760223865509033} 11/06/2021 21:36:57 - INFO - __main__ - Step 2470: {'lr': 0.0004999876112232726, 'samples': 474240, 'steps': 2469, 'loss/train': 2.2806897163391113} 11/06/2021 21:36:58 - INFO - __main__ - Step 2471: {'lr': 0.0004999875583367844, 'samples': 474432, 'steps': 2470, 'loss/train': 2.3899688720703125} 11/06/2021 21:36:58 - INFO - __main__ - Step 2472: {'lr': 0.0004999875053376555, 'samples': 474624, 'steps': 2471, 'loss/train': 2.720411539077759} 11/06/2021 21:36:59 - INFO - __main__ - Step 2473: {'lr': 0.0004999874522258861, 'samples': 474816, 'steps': 2472, 'loss/train': 2.3538568019866943} 11/06/2021 21:36:59 - INFO - __main__ - Step 2474: {'lr': 0.0004999873990014763, 'samples': 475008, 'steps': 2473, 'loss/train': 2.296961784362793} 11/06/2021 21:37:00 - INFO - __main__ - Step 2475: {'lr': 0.0004999873456644259, 'samples': 475200, 'steps': 2474, 'loss/train': 3.126654624938965} 11/06/2021 21:37:00 - INFO - __main__ - Step 2476: {'lr': 0.0004999872922147352, 'samples': 475392, 'steps': 2475, 'loss/train': 2.8180599212646484} 11/06/2021 21:37:01 - INFO - __main__ - Step 2477: {'lr': 0.0004999872386524041, 'samples': 475584, 'steps': 2476, 'loss/train': 2.209730625152588} 11/06/2021 21:37:02 - INFO - __main__ - Step 2478: {'lr': 0.0004999871849774325, 'samples': 475776, 'steps': 2477, 'loss/train': 2.773592472076416} 11/06/2021 21:37:02 - INFO - __main__ - Step 2479: {'lr': 0.0004999871311898205, 'samples': 475968, 'steps': 2478, 'loss/train': 2.503931760787964} 11/06/2021 21:37:02 - INFO - __main__ - Step 2480: {'lr': 0.0004999870772895683, 'samples': 476160, 'steps': 2479, 'loss/train': 2.848806858062744} 11/06/2021 21:37:03 - INFO - __main__ - Step 2481: {'lr': 0.0004999870232766756, 'samples': 476352, 'steps': 2480, 'loss/train': 2.6014020442962646} 11/06/2021 21:37:03 - INFO - __main__ - Step 2482: {'lr': 0.0004999869691511428, 'samples': 476544, 'steps': 2481, 'loss/train': 2.4127416610717773} 11/06/2021 21:37:04 - INFO - __main__ - Step 2483: {'lr': 0.0004999869149129696, 'samples': 476736, 'steps': 2482, 'loss/train': 1.9022128582000732} 11/06/2021 21:37:04 - INFO - __main__ - Step 2484: {'lr': 0.0004999868605621563, 'samples': 476928, 'steps': 2483, 'loss/train': 2.502490282058716} 11/06/2021 21:37:05 - INFO - __main__ - Step 2485: {'lr': 0.0004999868060987027, 'samples': 477120, 'steps': 2484, 'loss/train': 2.5162899494171143} 11/06/2021 21:37:05 - INFO - __main__ - Step 2486: {'lr': 0.0004999867515226088, 'samples': 477312, 'steps': 2485, 'loss/train': 2.565920829772949} 11/06/2021 21:37:05 - INFO - __main__ - Step 2487: {'lr': 0.0004999866968338748, 'samples': 477504, 'steps': 2486, 'loss/train': 2.152040481567383} 11/06/2021 21:37:07 - INFO - __main__ - Step 2488: {'lr': 0.0004999866420325006, 'samples': 477696, 'steps': 2487, 'loss/train': 2.5506532192230225} 11/06/2021 21:37:07 - INFO - __main__ - Step 2489: {'lr': 0.0004999865871184863, 'samples': 477888, 'steps': 2488, 'loss/train': 2.6349217891693115} 11/06/2021 21:37:07 - INFO - __main__ - Step 2490: {'lr': 0.000499986532091832, 'samples': 478080, 'steps': 2489, 'loss/train': 1.8564928770065308} 11/06/2021 21:37:08 - INFO - __main__ - Step 2491: {'lr': 0.0004999864769525375, 'samples': 478272, 'steps': 2490, 'loss/train': 2.035461664199829} 11/06/2021 21:37:08 - INFO - __main__ - Step 2492: {'lr': 0.000499986421700603, 'samples': 478464, 'steps': 2491, 'loss/train': 2.712242364883423} 11/06/2021 21:37:09 - INFO - __main__ - Step 2493: {'lr': 0.0004999863663360285, 'samples': 478656, 'steps': 2492, 'loss/train': 1.668128490447998} 11/06/2021 21:37:09 - INFO - __main__ - Step 2494: {'lr': 0.000499986310858814, 'samples': 478848, 'steps': 2493, 'loss/train': 2.30643892288208} 11/06/2021 21:37:10 - INFO - __main__ - Step 2495: {'lr': 0.0004999862552689595, 'samples': 479040, 'steps': 2494, 'loss/train': 2.329277992248535} 11/06/2021 21:37:10 - INFO - __main__ - Step 2496: {'lr': 0.000499986199566465, 'samples': 479232, 'steps': 2495, 'loss/train': 2.888967275619507} 11/06/2021 21:37:10 - INFO - __main__ - Step 2497: {'lr': 0.0004999861437513306, 'samples': 479424, 'steps': 2496, 'loss/train': 2.5283238887786865} 11/06/2021 21:37:11 - INFO - __main__ - Step 2498: {'lr': 0.0004999860878235564, 'samples': 479616, 'steps': 2497, 'loss/train': 2.4118523597717285} 11/06/2021 21:37:12 - INFO - __main__ - Step 2499: {'lr': 0.0004999860317831423, 'samples': 479808, 'steps': 2498, 'loss/train': 1.975379228591919} 11/06/2021 21:37:12 - INFO - __main__ - Step 2500: {'lr': 0.0004999859756300883, 'samples': 480000, 'steps': 2499, 'loss/train': 2.776916027069092} 11/06/2021 21:37:13 - INFO - __main__ - Step 2501: {'lr': 0.0004999859193643945, 'samples': 480192, 'steps': 2500, 'loss/train': 1.9438010454177856} 11/06/2021 21:37:13 - INFO - __main__ - Step 2502: {'lr': 0.0004999858629860609, 'samples': 480384, 'steps': 2501, 'loss/train': 2.3400754928588867} 11/06/2021 21:37:14 - INFO - __main__ - Step 2503: {'lr': 0.0004999858064950875, 'samples': 480576, 'steps': 2502, 'loss/train': 2.453505516052246} 11/06/2021 21:37:14 - INFO - __main__ - Step 2504: {'lr': 0.0004999857498914744, 'samples': 480768, 'steps': 2503, 'loss/train': 1.737013339996338} 11/06/2021 21:37:15 - INFO - __main__ - Step 2505: {'lr': 0.0004999856931752215, 'samples': 480960, 'steps': 2504, 'loss/train': 2.44512677192688} 11/06/2021 21:37:15 - INFO - __main__ - Step 2506: {'lr': 0.000499985636346329, 'samples': 481152, 'steps': 2505, 'loss/train': 2.2163236141204834} 11/06/2021 21:37:15 - INFO - __main__ - Step 2507: {'lr': 0.0004999855794047968, 'samples': 481344, 'steps': 2506, 'loss/train': 2.2586050033569336} 11/06/2021 21:37:16 - INFO - __main__ - Step 2508: {'lr': 0.000499985522350625, 'samples': 481536, 'steps': 2507, 'loss/train': 2.6702263355255127} 11/06/2021 21:37:17 - INFO - __main__ - Step 2509: {'lr': 0.0004999854651838134, 'samples': 481728, 'steps': 2508, 'loss/train': 1.7556337118148804} 11/06/2021 21:37:17 - INFO - __main__ - Step 2510: {'lr': 0.0004999854079043624, 'samples': 481920, 'steps': 2509, 'loss/train': 2.5938799381256104} 11/06/2021 21:37:17 - INFO - __main__ - Step 2511: {'lr': 0.0004999853505122718, 'samples': 482112, 'steps': 2510, 'loss/train': 2.314603567123413} 11/06/2021 21:37:18 - INFO - __main__ - Step 2512: {'lr': 0.0004999852930075416, 'samples': 482304, 'steps': 2511, 'loss/train': 3.3958892822265625} 11/06/2021 21:37:18 - INFO - __main__ - Step 2513: {'lr': 0.0004999852353901719, 'samples': 482496, 'steps': 2512, 'loss/train': 2.0846526622772217} 11/06/2021 21:37:19 - INFO - __main__ - Step 2514: {'lr': 0.0004999851776601627, 'samples': 482688, 'steps': 2513, 'loss/train': 2.1145737171173096} 11/06/2021 21:37:19 - INFO - __main__ - Step 2515: {'lr': 0.0004999851198175141, 'samples': 482880, 'steps': 2514, 'loss/train': 2.3666229248046875} 11/06/2021 21:37:20 - INFO - __main__ - Step 2516: {'lr': 0.0004999850618622259, 'samples': 483072, 'steps': 2515, 'loss/train': 1.9783778190612793} 11/06/2021 21:37:20 - INFO - __main__ - Step 2517: {'lr': 0.0004999850037942984, 'samples': 483264, 'steps': 2516, 'loss/train': 1.6819298267364502} 11/06/2021 21:37:21 - INFO - __main__ - Step 2518: {'lr': 0.0004999849456137316, 'samples': 483456, 'steps': 2517, 'loss/train': 2.5427420139312744} 11/06/2021 21:37:22 - INFO - __main__ - Step 2519: {'lr': 0.0004999848873205254, 'samples': 483648, 'steps': 2518, 'loss/train': 1.944232702255249} 11/06/2021 21:37:22 - INFO - __main__ - Step 2520: {'lr': 0.0004999848289146798, 'samples': 483840, 'steps': 2519, 'loss/train': 2.538642644882202} 11/06/2021 21:37:22 - INFO - __main__ - Step 2521: {'lr': 0.0004999847703961948, 'samples': 484032, 'steps': 2520, 'loss/train': 2.035733699798584} 11/06/2021 21:37:23 - INFO - __main__ - Step 2522: {'lr': 0.0004999847117650708, 'samples': 484224, 'steps': 2521, 'loss/train': 2.2497646808624268} 11/06/2021 21:37:23 - INFO - __main__ - Step 2523: {'lr': 0.0004999846530213074, 'samples': 484416, 'steps': 2522, 'loss/train': 2.4687421321868896} 11/06/2021 21:37:24 - INFO - __main__ - Step 2524: {'lr': 0.0004999845941649048, 'samples': 484608, 'steps': 2523, 'loss/train': 2.6273131370544434} 11/06/2021 21:37:25 - INFO - __main__ - Step 2525: {'lr': 0.0004999845351958629, 'samples': 484800, 'steps': 2524, 'loss/train': 2.2162749767303467} 11/06/2021 21:37:25 - INFO - __main__ - Step 2526: {'lr': 0.0004999844761141818, 'samples': 484992, 'steps': 2525, 'loss/train': 2.4674794673919678} 11/06/2021 21:37:25 - INFO - __main__ - Step 2527: {'lr': 0.0004999844169198617, 'samples': 485184, 'steps': 2526, 'loss/train': 2.444234609603882} 11/06/2021 21:37:26 - INFO - __main__ - Step 2528: {'lr': 0.0004999843576129024, 'samples': 485376, 'steps': 2527, 'loss/train': 2.046820640563965} 11/06/2021 21:37:27 - INFO - __main__ - Step 2529: {'lr': 0.000499984298193304, 'samples': 485568, 'steps': 2528, 'loss/train': 3.3430094718933105} 11/06/2021 21:37:27 - INFO - __main__ - Step 2530: {'lr': 0.0004999842386610666, 'samples': 485760, 'steps': 2529, 'loss/train': 2.4798810482025146} 11/06/2021 21:37:27 - INFO - __main__ - Step 2531: {'lr': 0.0004999841790161901, 'samples': 485952, 'steps': 2530, 'loss/train': 2.1105799674987793} 11/06/2021 21:37:28 - INFO - __main__ - Step 2532: {'lr': 0.0004999841192586746, 'samples': 486144, 'steps': 2531, 'loss/train': 2.247392177581787} 11/06/2021 21:37:28 - INFO - __main__ - Step 2533: {'lr': 0.0004999840593885201, 'samples': 486336, 'steps': 2532, 'loss/train': 2.632009983062744} 11/06/2021 21:37:29 - INFO - __main__ - Step 2534: {'lr': 0.0004999839994057266, 'samples': 486528, 'steps': 2533, 'loss/train': 2.2957653999328613} 11/06/2021 21:37:29 - INFO - __main__ - Step 2535: {'lr': 0.0004999839393102943, 'samples': 486720, 'steps': 2534, 'loss/train': 2.15610671043396} 11/06/2021 21:37:30 - INFO - __main__ - Step 2536: {'lr': 0.0004999838791022229, 'samples': 486912, 'steps': 2535, 'loss/train': 1.8324062824249268} 11/06/2021 21:37:30 - INFO - __main__ - Step 2537: {'lr': 0.0004999838187815128, 'samples': 487104, 'steps': 2536, 'loss/train': 2.1403756141662598} 11/06/2021 21:37:30 - INFO - __main__ - Step 2538: {'lr': 0.0004999837583481638, 'samples': 487296, 'steps': 2537, 'loss/train': 2.4630844593048096} 11/06/2021 21:37:31 - INFO - __main__ - Step 2539: {'lr': 0.000499983697802176, 'samples': 487488, 'steps': 2538, 'loss/train': 2.763502836227417} 11/06/2021 21:37:32 - INFO - __main__ - Step 2540: {'lr': 0.0004999836371435494, 'samples': 487680, 'steps': 2539, 'loss/train': 1.6404800415039062} 11/06/2021 21:37:32 - INFO - __main__ - Step 2541: {'lr': 0.000499983576372284, 'samples': 487872, 'steps': 2540, 'loss/train': 0.7703345417976379} 11/06/2021 21:37:32 - INFO - __main__ - Step 2542: {'lr': 0.0004999835154883798, 'samples': 488064, 'steps': 2541, 'loss/train': 2.3808321952819824} 11/06/2021 21:37:33 - INFO - __main__ - Step 2543: {'lr': 0.0004999834544918369, 'samples': 488256, 'steps': 2542, 'loss/train': 1.3541216850280762} 11/06/2021 21:37:33 - INFO - __main__ - Step 2544: {'lr': 0.0004999833933826554, 'samples': 488448, 'steps': 2543, 'loss/train': 2.6432573795318604} 11/06/2021 21:37:34 - INFO - __main__ - Step 2545: {'lr': 0.0004999833321608351, 'samples': 488640, 'steps': 2544, 'loss/train': 2.655363082885742} 11/06/2021 21:37:35 - INFO - __main__ - Step 2546: {'lr': 0.0004999832708263764, 'samples': 488832, 'steps': 2545, 'loss/train': 1.829182744026184} 11/06/2021 21:37:35 - INFO - __main__ - Step 2547: {'lr': 0.000499983209379279, 'samples': 489024, 'steps': 2546, 'loss/train': 2.714189291000366} 11/06/2021 21:37:35 - INFO - __main__ - Step 2548: {'lr': 0.0004999831478195429, 'samples': 489216, 'steps': 2547, 'loss/train': 2.1510307788848877} 11/06/2021 21:37:36 - INFO - __main__ - Step 2549: {'lr': 0.0004999830861471684, 'samples': 489408, 'steps': 2548, 'loss/train': 2.5913808345794678} 11/06/2021 21:37:37 - INFO - __main__ - Step 2550: {'lr': 0.0004999830243621553, 'samples': 489600, 'steps': 2549, 'loss/train': 2.1029839515686035} 11/06/2021 21:37:37 - INFO - __main__ - Step 2551: {'lr': 0.0004999829624645037, 'samples': 489792, 'steps': 2550, 'loss/train': 2.4384331703186035} 11/06/2021 21:37:37 - INFO - __main__ - Step 2552: {'lr': 0.0004999829004542136, 'samples': 489984, 'steps': 2551, 'loss/train': 2.5452609062194824} 11/06/2021 21:37:38 - INFO - __main__ - Step 2553: {'lr': 0.0004999828383312851, 'samples': 490176, 'steps': 2552, 'loss/train': 2.293485403060913} 11/06/2021 21:37:38 - INFO - __main__ - Step 2554: {'lr': 0.0004999827760957182, 'samples': 490368, 'steps': 2553, 'loss/train': 2.1594653129577637} 11/06/2021 21:37:39 - INFO - __main__ - Step 2555: {'lr': 0.000499982713747513, 'samples': 490560, 'steps': 2554, 'loss/train': 2.2348339557647705} 11/06/2021 21:37:39 - INFO - __main__ - Step 2556: {'lr': 0.0004999826512866693, 'samples': 490752, 'steps': 2555, 'loss/train': 2.039029598236084} 11/06/2021 21:37:40 - INFO - __main__ - Step 2557: {'lr': 0.0004999825887131874, 'samples': 490944, 'steps': 2556, 'loss/train': 2.3221940994262695} 11/06/2021 21:37:40 - INFO - __main__ - Step 2558: {'lr': 0.0004999825260270671, 'samples': 491136, 'steps': 2557, 'loss/train': 1.9088735580444336} 11/06/2021 21:37:40 - INFO - __main__ - Step 2559: {'lr': 0.0004999824632283086, 'samples': 491328, 'steps': 2558, 'loss/train': 2.34741473197937} 11/06/2021 21:37:41 - INFO - __main__ - Step 2560: {'lr': 0.0004999824003169119, 'samples': 491520, 'steps': 2559, 'loss/train': 2.832210063934326} 11/06/2021 21:37:42 - INFO - __main__ - Step 2561: {'lr': 0.000499982337292877, 'samples': 491712, 'steps': 2560, 'loss/train': 2.518115758895874} 11/06/2021 21:37:42 - INFO - __main__ - Step 2562: {'lr': 0.0004999822741562038, 'samples': 491904, 'steps': 2561, 'loss/train': 1.5568736791610718} 11/06/2021 21:37:42 - INFO - __main__ - Step 2563: {'lr': 0.0004999822109068925, 'samples': 492096, 'steps': 2562, 'loss/train': 2.128868818283081} 11/06/2021 21:37:43 - INFO - __main__ - Step 2564: {'lr': 0.000499982147544943, 'samples': 492288, 'steps': 2563, 'loss/train': 2.274667739868164} 11/06/2021 21:37:44 - INFO - __main__ - Step 2565: {'lr': 0.0004999820840703554, 'samples': 492480, 'steps': 2564, 'loss/train': 2.438570022583008} 11/06/2021 21:37:44 - INFO - __main__ - Step 2566: {'lr': 0.0004999820204831298, 'samples': 492672, 'steps': 2565, 'loss/train': 1.8974850177764893} 11/06/2021 21:37:44 - INFO - __main__ - Step 2567: {'lr': 0.0004999819567832661, 'samples': 492864, 'steps': 2566, 'loss/train': 1.991250991821289} 11/06/2021 21:37:45 - INFO - __main__ - Step 2568: {'lr': 0.0004999818929707645, 'samples': 493056, 'steps': 2567, 'loss/train': 2.29544734954834} 11/06/2021 21:37:45 - INFO - __main__ - Step 2569: {'lr': 0.0004999818290456249, 'samples': 493248, 'steps': 2568, 'loss/train': 2.5977227687835693} 11/06/2021 21:37:45 - INFO - __main__ - Step 2570: {'lr': 0.0004999817650078474, 'samples': 493440, 'steps': 2569, 'loss/train': 2.2066221237182617} 11/06/2021 21:37:47 - INFO - __main__ - Step 2571: {'lr': 0.0004999817008574318, 'samples': 493632, 'steps': 2570, 'loss/train': 2.1649460792541504} 11/06/2021 21:37:47 - INFO - __main__ - Step 2572: {'lr': 0.0004999816365943784, 'samples': 493824, 'steps': 2571, 'loss/train': 2.5640785694122314} 11/06/2021 21:37:47 - INFO - __main__ - Step 2573: {'lr': 0.000499981572218687, 'samples': 494016, 'steps': 2572, 'loss/train': 2.131150007247925} 11/06/2021 21:37:48 - INFO - __main__ - Step 2574: {'lr': 0.0004999815077303579, 'samples': 494208, 'steps': 2573, 'loss/train': 2.548187017440796} 11/06/2021 21:37:48 - INFO - __main__ - Step 2575: {'lr': 0.000499981443129391, 'samples': 494400, 'steps': 2574, 'loss/train': 1.7672605514526367} 11/06/2021 21:37:49 - INFO - __main__ - Step 2576: {'lr': 0.0004999813784157863, 'samples': 494592, 'steps': 2575, 'loss/train': 1.979129433631897} 11/06/2021 21:37:49 - INFO - __main__ - Step 2577: {'lr': 0.0004999813135895438, 'samples': 494784, 'steps': 2576, 'loss/train': 2.5306124687194824} 11/06/2021 21:37:50 - INFO - __main__ - Step 2578: {'lr': 0.0004999812486506637, 'samples': 494976, 'steps': 2577, 'loss/train': 1.985874056816101} 11/06/2021 21:37:50 - INFO - __main__ - Step 2579: {'lr': 0.0004999811835991457, 'samples': 495168, 'steps': 2578, 'loss/train': 2.2465949058532715} 11/06/2021 21:37:50 - INFO - __main__ - Step 2580: {'lr': 0.0004999811184349902, 'samples': 495360, 'steps': 2579, 'loss/train': 1.718479871749878} 11/06/2021 21:37:51 - INFO - __main__ - Step 2581: {'lr': 0.000499981053158197, 'samples': 495552, 'steps': 2580, 'loss/train': 2.3030970096588135} 11/06/2021 21:37:52 - INFO - __main__ - Step 2582: {'lr': 0.0004999809877687662, 'samples': 495744, 'steps': 2581, 'loss/train': 2.4952125549316406} 11/06/2021 21:37:52 - INFO - __main__ - Step 2583: {'lr': 0.0004999809222666978, 'samples': 495936, 'steps': 2582, 'loss/train': 2.9251458644866943} 11/06/2021 21:37:52 - INFO - __main__ - Step 2584: {'lr': 0.0004999808566519919, 'samples': 496128, 'steps': 2583, 'loss/train': 2.0826971530914307} 11/06/2021 21:37:53 - INFO - __main__ - Step 2585: {'lr': 0.0004999807909246485, 'samples': 496320, 'steps': 2584, 'loss/train': 2.6530747413635254} 11/06/2021 21:37:54 - INFO - __main__ - Step 2586: {'lr': 0.0004999807250846676, 'samples': 496512, 'steps': 2585, 'loss/train': 2.023077964782715} 11/06/2021 21:37:54 - INFO - __main__ - Step 2587: {'lr': 0.0004999806591320492, 'samples': 496704, 'steps': 2586, 'loss/train': 2.128833532333374} 11/06/2021 21:37:54 - INFO - __main__ - Step 2588: {'lr': 0.0004999805930667934, 'samples': 496896, 'steps': 2587, 'loss/train': 2.2073006629943848} 11/06/2021 21:37:55 - INFO - __main__ - Step 2589: {'lr': 0.0004999805268889003, 'samples': 497088, 'steps': 2588, 'loss/train': 2.542658567428589} 11/06/2021 21:37:55 - INFO - __main__ - Step 2590: {'lr': 0.0004999804605983697, 'samples': 497280, 'steps': 2589, 'loss/train': 1.7954328060150146} 11/06/2021 21:37:56 - INFO - __main__ - Step 2591: {'lr': 0.0004999803941952018, 'samples': 497472, 'steps': 2590, 'loss/train': 1.970054030418396} 11/06/2021 21:37:57 - INFO - __main__ - Step 2592: {'lr': 0.0004999803276793965, 'samples': 497664, 'steps': 2591, 'loss/train': 2.5273969173431396} 11/06/2021 21:37:57 - INFO - __main__ - Step 2593: {'lr': 0.0004999802610509541, 'samples': 497856, 'steps': 2592, 'loss/train': 2.8750476837158203} 11/06/2021 21:37:57 - INFO - __main__ - Step 2594: {'lr': 0.0004999801943098743, 'samples': 498048, 'steps': 2593, 'loss/train': 2.145956039428711} 11/06/2021 21:37:58 - INFO - __main__ - Step 2595: {'lr': 0.0004999801274561573, 'samples': 498240, 'steps': 2594, 'loss/train': 1.3880176544189453} 11/06/2021 21:37:58 - INFO - __main__ - Step 2596: {'lr': 0.0004999800604898032, 'samples': 498432, 'steps': 2595, 'loss/train': 2.8618476390838623} 11/06/2021 21:37:59 - INFO - __main__ - Step 2597: {'lr': 0.000499979993410812, 'samples': 498624, 'steps': 2596, 'loss/train': 1.0421839952468872} 11/06/2021 21:37:59 - INFO - __main__ - Step 2598: {'lr': 0.0004999799262191835, 'samples': 498816, 'steps': 2597, 'loss/train': 2.2791144847869873} 11/06/2021 21:38:00 - INFO - __main__ - Step 2599: {'lr': 0.0004999798589149179, 'samples': 499008, 'steps': 2598, 'loss/train': 1.718825101852417} 11/06/2021 21:38:00 - INFO - __main__ - Step 2600: {'lr': 0.0004999797914980154, 'samples': 499200, 'steps': 2599, 'loss/train': 2.3873207569122314} 11/06/2021 21:38:01 - INFO - __main__ - Step 2601: {'lr': 0.0004999797239684757, 'samples': 499392, 'steps': 2600, 'loss/train': 1.5167217254638672} 11/06/2021 21:38:02 - INFO - __main__ - Step 2602: {'lr': 0.0004999796563262991, 'samples': 499584, 'steps': 2601, 'loss/train': 2.33709454536438} 11/06/2021 21:38:02 - INFO - __main__ - Step 2603: {'lr': 0.0004999795885714855, 'samples': 499776, 'steps': 2602, 'loss/train': 1.5589454174041748} 11/06/2021 21:38:02 - INFO - __main__ - Step 2604: {'lr': 0.0004999795207040349, 'samples': 499968, 'steps': 2603, 'loss/train': 3.2038750648498535} 11/06/2021 21:38:03 - INFO - __main__ - Step 2605: {'lr': 0.0004999794527239474, 'samples': 500160, 'steps': 2604, 'loss/train': 2.761178731918335} 11/06/2021 21:38:03 - INFO - __main__ - Step 2606: {'lr': 0.000499979384631223, 'samples': 500352, 'steps': 2605, 'loss/train': 2.426281690597534} 11/06/2021 21:38:03 - INFO - __main__ - Step 2607: {'lr': 0.000499979316425862, 'samples': 500544, 'steps': 2606, 'loss/train': 2.246225118637085} 11/06/2021 21:38:04 - INFO - __main__ - Step 2608: {'lr': 0.0004999792481078639, 'samples': 500736, 'steps': 2607, 'loss/train': 1.7112780809402466} 11/06/2021 21:38:05 - INFO - __main__ - Step 2609: {'lr': 0.000499979179677229, 'samples': 500928, 'steps': 2608, 'loss/train': 2.6037216186523438} 11/06/2021 21:38:05 - INFO - __main__ - Step 2610: {'lr': 0.0004999791111339574, 'samples': 501120, 'steps': 2609, 'loss/train': 2.3880434036254883} 11/06/2021 21:38:05 - INFO - __main__ - Step 2611: {'lr': 0.0004999790424780492, 'samples': 501312, 'steps': 2610, 'loss/train': 2.595461368560791} 11/06/2021 21:38:06 - INFO - __main__ - Step 2612: {'lr': 0.0004999789737095041, 'samples': 501504, 'steps': 2611, 'loss/train': 1.1569690704345703} 11/06/2021 21:38:07 - INFO - __main__ - Step 2613: {'lr': 0.0004999789048283224, 'samples': 501696, 'steps': 2612, 'loss/train': 3.0552828311920166} 11/06/2021 21:38:07 - INFO - __main__ - Step 2614: {'lr': 0.0004999788358345041, 'samples': 501888, 'steps': 2613, 'loss/train': 2.318122386932373} 11/06/2021 21:38:08 - INFO - __main__ - Step 2615: {'lr': 0.0004999787667280492, 'samples': 502080, 'steps': 2614, 'loss/train': 2.3980915546417236} 11/06/2021 21:38:08 - INFO - __main__ - Step 2616: {'lr': 0.0004999786975089577, 'samples': 502272, 'steps': 2615, 'loss/train': 4.580399513244629} 11/06/2021 21:38:08 - INFO - __main__ - Step 2617: {'lr': 0.0004999786281772296, 'samples': 502464, 'steps': 2616, 'loss/train': 2.6063055992126465} 11/06/2021 21:38:09 - INFO - __main__ - Step 2618: {'lr': 0.0004999785587328651, 'samples': 502656, 'steps': 2617, 'loss/train': 2.637915849685669} 11/06/2021 21:38:10 - INFO - __main__ - Step 2619: {'lr': 0.0004999784891758641, 'samples': 502848, 'steps': 2618, 'loss/train': 2.712459087371826} 11/06/2021 21:38:10 - INFO - __main__ - Step 2620: {'lr': 0.0004999784195062266, 'samples': 503040, 'steps': 2619, 'loss/train': 1.5384448766708374} 11/06/2021 21:38:10 - INFO - __main__ - Step 2621: {'lr': 0.0004999783497239526, 'samples': 503232, 'steps': 2620, 'loss/train': 2.1913907527923584} 11/06/2021 21:38:11 - INFO - __main__ - Step 2622: {'lr': 0.0004999782798290424, 'samples': 503424, 'steps': 2621, 'loss/train': 2.0295615196228027} 11/06/2021 21:38:11 - INFO - __main__ - Step 2623: {'lr': 0.0004999782098214957, 'samples': 503616, 'steps': 2622, 'loss/train': 2.4918506145477295} 11/06/2021 21:38:12 - INFO - __main__ - Step 2624: {'lr': 0.0004999781397013127, 'samples': 503808, 'steps': 2623, 'loss/train': 2.2522714138031006} 11/06/2021 21:38:13 - INFO - __main__ - Step 2625: {'lr': 0.0004999780694684934, 'samples': 504000, 'steps': 2624, 'loss/train': 2.477266550064087} 11/06/2021 21:38:13 - INFO - __main__ - Step 2626: {'lr': 0.000499977999123038, 'samples': 504192, 'steps': 2625, 'loss/train': 2.451108455657959} 11/06/2021 21:38:13 - INFO - __main__ - Step 2627: {'lr': 0.0004999779286649461, 'samples': 504384, 'steps': 2626, 'loss/train': 2.620913028717041} 11/06/2021 21:38:14 - INFO - __main__ - Step 2628: {'lr': 0.0004999778580942183, 'samples': 504576, 'steps': 2627, 'loss/train': 1.3361049890518188} 11/06/2021 21:38:15 - INFO - __main__ - Step 2629: {'lr': 0.000499977787410854, 'samples': 504768, 'steps': 2628, 'loss/train': 2.3474013805389404} 11/06/2021 21:38:15 - INFO - __main__ - Step 2630: {'lr': 0.0004999777166148539, 'samples': 504960, 'steps': 2629, 'loss/train': 2.328670024871826} 11/06/2021 21:38:15 - INFO - __main__ - Step 2631: {'lr': 0.0004999776457062175, 'samples': 505152, 'steps': 2630, 'loss/train': 2.4533984661102295} 11/06/2021 21:38:16 - INFO - __main__ - Step 2632: {'lr': 0.0004999775746849451, 'samples': 505344, 'steps': 2631, 'loss/train': 1.9219590425491333} 11/06/2021 21:38:16 - INFO - __main__ - Step 2633: {'lr': 0.0004999775035510367, 'samples': 505536, 'steps': 2632, 'loss/train': 1.3246666193008423} 11/06/2021 21:38:17 - INFO - __main__ - Step 2634: {'lr': 0.0004999774323044922, 'samples': 505728, 'steps': 2633, 'loss/train': 2.5250437259674072} 11/06/2021 21:38:17 - INFO - __main__ - Step 2635: {'lr': 0.0004999773609453118, 'samples': 505920, 'steps': 2634, 'loss/train': 2.8457674980163574} 11/06/2021 21:38:18 - INFO - __main__ - Step 2636: {'lr': 0.0004999772894734954, 'samples': 506112, 'steps': 2635, 'loss/train': 1.6440210342407227} 11/06/2021 21:38:18 - INFO - __main__ - Step 2637: {'lr': 0.000499977217889043, 'samples': 506304, 'steps': 2636, 'loss/train': 2.209573984146118} 11/06/2021 21:38:18 - INFO - __main__ - Step 2638: {'lr': 0.0004999771461919549, 'samples': 506496, 'steps': 2637, 'loss/train': 2.6149797439575195} 11/06/2021 21:38:20 - INFO - __main__ - Step 2639: {'lr': 0.0004999770743822309, 'samples': 506688, 'steps': 2638, 'loss/train': 1.8034636974334717} 11/06/2021 21:38:20 - INFO - __main__ - Step 2640: {'lr': 0.0004999770024598711, 'samples': 506880, 'steps': 2639, 'loss/train': 2.848144054412842} 11/06/2021 21:38:20 - INFO - __main__ - Step 2641: {'lr': 0.0004999769304248754, 'samples': 507072, 'steps': 2640, 'loss/train': 2.00382661819458} 11/06/2021 21:38:21 - INFO - __main__ - Step 2642: {'lr': 0.0004999768582772442, 'samples': 507264, 'steps': 2641, 'loss/train': 2.284688711166382} 11/06/2021 21:38:21 - INFO - __main__ - Step 2643: {'lr': 0.000499976786016977, 'samples': 507456, 'steps': 2642, 'loss/train': 2.6324987411499023} 11/06/2021 21:38:22 - INFO - __main__ - Step 2644: {'lr': 0.0004999767136440742, 'samples': 507648, 'steps': 2643, 'loss/train': 2.4299545288085938} 11/06/2021 21:38:22 - INFO - __main__ - Step 2645: {'lr': 0.0004999766411585359, 'samples': 507840, 'steps': 2644, 'loss/train': 2.3650753498077393} 11/06/2021 21:38:23 - INFO - __main__ - Step 2646: {'lr': 0.0004999765685603618, 'samples': 508032, 'steps': 2645, 'loss/train': 1.2427858114242554} 11/06/2021 21:38:23 - INFO - __main__ - Step 2647: {'lr': 0.0004999764958495522, 'samples': 508224, 'steps': 2646, 'loss/train': 1.7125329971313477} 11/06/2021 21:38:23 - INFO - __main__ - Step 2648: {'lr': 0.0004999764230261072, 'samples': 508416, 'steps': 2647, 'loss/train': 2.69739031791687} 11/06/2021 21:38:24 - INFO - __main__ - Step 2649: {'lr': 0.0004999763500900265, 'samples': 508608, 'steps': 2648, 'loss/train': 2.207709550857544} 11/06/2021 21:38:25 - INFO - __main__ - Step 2650: {'lr': 0.0004999762770413103, 'samples': 508800, 'steps': 2649, 'loss/train': 2.108356237411499} 11/06/2021 21:38:25 - INFO - __main__ - Step 2651: {'lr': 0.0004999762038799587, 'samples': 508992, 'steps': 2650, 'loss/train': 2.2225661277770996} 11/06/2021 21:38:25 - INFO - __main__ - Step 2652: {'lr': 0.0004999761306059717, 'samples': 509184, 'steps': 2651, 'loss/train': 2.4659764766693115} 11/06/2021 21:38:26 - INFO - __main__ - Step 2653: {'lr': 0.0004999760572193492, 'samples': 509376, 'steps': 2652, 'loss/train': 2.0966432094573975} 11/06/2021 21:38:26 - INFO - __main__ - Step 2654: {'lr': 0.0004999759837200914, 'samples': 509568, 'steps': 2653, 'loss/train': 2.6494579315185547} 11/06/2021 21:38:27 - INFO - __main__ - Step 2655: {'lr': 0.0004999759101081984, 'samples': 509760, 'steps': 2654, 'loss/train': 2.213627815246582} 11/06/2021 21:38:28 - INFO - __main__ - Step 2656: {'lr': 0.0004999758363836701, 'samples': 509952, 'steps': 2655, 'loss/train': 2.514630079269409} 11/06/2021 21:38:28 - INFO - __main__ - Step 2657: {'lr': 0.0004999757625465063, 'samples': 510144, 'steps': 2656, 'loss/train': 1.6061630249023438} 11/06/2021 21:38:28 - INFO - __main__ - Step 2658: {'lr': 0.0004999756885967075, 'samples': 510336, 'steps': 2657, 'loss/train': 1.924759030342102} 11/06/2021 21:38:29 - INFO - __main__ - Step 2659: {'lr': 0.0004999756145342735, 'samples': 510528, 'steps': 2658, 'loss/train': 1.7294937372207642} 11/06/2021 21:38:30 - INFO - __main__ - Step 2660: {'lr': 0.0004999755403592043, 'samples': 510720, 'steps': 2659, 'loss/train': 2.623060941696167} 11/06/2021 21:38:30 - INFO - __main__ - Step 2661: {'lr': 0.0004999754660714999, 'samples': 510912, 'steps': 2660, 'loss/train': 2.1438426971435547} 11/06/2021 21:38:30 - INFO - __main__ - Step 2662: {'lr': 0.0004999753916711606, 'samples': 511104, 'steps': 2661, 'loss/train': 1.9994590282440186} 11/06/2021 21:38:31 - INFO - __main__ - Step 2663: {'lr': 0.0004999753171581862, 'samples': 511296, 'steps': 2662, 'loss/train': 2.382080554962158} 11/06/2021 21:38:31 - INFO - __main__ - Step 2664: {'lr': 0.0004999752425325766, 'samples': 511488, 'steps': 2663, 'loss/train': 2.0769588947296143} 11/06/2021 21:38:32 - INFO - __main__ - Step 2665: {'lr': 0.0004999751677943322, 'samples': 511680, 'steps': 2664, 'loss/train': 1.7713021039962769} 11/06/2021 21:38:32 - INFO - __main__ - Step 2666: {'lr': 0.0004999750929434527, 'samples': 511872, 'steps': 2665, 'loss/train': 2.075993537902832} 11/06/2021 21:38:33 - INFO - __main__ - Step 2667: {'lr': 0.0004999750179799383, 'samples': 512064, 'steps': 2666, 'loss/train': 2.5356898307800293} 11/06/2021 21:38:33 - INFO - __main__ - Step 2668: {'lr': 0.0004999749429037892, 'samples': 512256, 'steps': 2667, 'loss/train': 2.999053716659546} 11/06/2021 21:38:33 - INFO - __main__ - Step 2669: {'lr': 0.0004999748677150051, 'samples': 512448, 'steps': 2668, 'loss/train': 2.6738812923431396} 11/06/2021 21:38:34 - INFO - __main__ - Step 2670: {'lr': 0.0004999747924135862, 'samples': 512640, 'steps': 2669, 'loss/train': 2.2657175064086914} 11/06/2021 21:38:35 - INFO - __main__ - Step 2671: {'lr': 0.0004999747169995325, 'samples': 512832, 'steps': 2670, 'loss/train': 2.049834966659546} 11/06/2021 21:38:35 - INFO - __main__ - Step 2672: {'lr': 0.0004999746414728441, 'samples': 513024, 'steps': 2671, 'loss/train': 2.0094645023345947} 11/06/2021 21:38:35 - INFO - __main__ - Step 2673: {'lr': 0.0004999745658335209, 'samples': 513216, 'steps': 2672, 'loss/train': 2.6007511615753174} 11/06/2021 21:38:36 - INFO - __main__ - Step 2674: {'lr': 0.000499974490081563, 'samples': 513408, 'steps': 2673, 'loss/train': 2.5795583724975586} 11/06/2021 21:38:37 - INFO - __main__ - Step 2675: {'lr': 0.0004999744142169707, 'samples': 513600, 'steps': 2674, 'loss/train': 1.848941445350647} 11/06/2021 21:38:37 - INFO - __main__ - Step 2676: {'lr': 0.0004999743382397435, 'samples': 513792, 'steps': 2675, 'loss/train': 2.2464025020599365} 11/06/2021 21:38:38 - INFO - __main__ - Step 2677: {'lr': 0.0004999742621498818, 'samples': 513984, 'steps': 2676, 'loss/train': 2.2649552822113037} 11/06/2021 21:38:38 - INFO - __main__ - Step 2678: {'lr': 0.0004999741859473857, 'samples': 514176, 'steps': 2677, 'loss/train': 2.3065757751464844} 11/06/2021 21:38:38 - INFO - __main__ - Step 2679: {'lr': 0.0004999741096322549, 'samples': 514368, 'steps': 2678, 'loss/train': 2.34328031539917} 11/06/2021 21:38:39 - INFO - __main__ - Step 2680: {'lr': 0.0004999740332044898, 'samples': 514560, 'steps': 2679, 'loss/train': 2.229511022567749} 11/06/2021 21:38:40 - INFO - __main__ - Step 2681: {'lr': 0.0004999739566640901, 'samples': 514752, 'steps': 2680, 'loss/train': 2.1822242736816406} 11/06/2021 21:38:40 - INFO - __main__ - Step 2682: {'lr': 0.000499973880011056, 'samples': 514944, 'steps': 2681, 'loss/train': 2.409409999847412} 11/06/2021 21:38:41 - INFO - __main__ - Step 2683: {'lr': 0.0004999738032453876, 'samples': 515136, 'steps': 2682, 'loss/train': 2.3872311115264893} 11/06/2021 21:38:41 - INFO - __main__ - Step 2684: {'lr': 0.0004999737263670848, 'samples': 515328, 'steps': 2683, 'loss/train': 2.3661561012268066} 11/06/2021 21:38:41 - INFO - __main__ - Step 2685: {'lr': 0.0004999736493761477, 'samples': 515520, 'steps': 2684, 'loss/train': 2.4622879028320312} 11/06/2021 21:38:42 - INFO - __main__ - Step 2686: {'lr': 0.0004999735722725765, 'samples': 515712, 'steps': 2685, 'loss/train': 1.6226389408111572} 11/06/2021 21:38:43 - INFO - __main__ - Step 2687: {'lr': 0.0004999734950563709, 'samples': 515904, 'steps': 2686, 'loss/train': 2.3979785442352295} 11/06/2021 21:38:43 - INFO - __main__ - Step 2688: {'lr': 0.0004999734177275311, 'samples': 516096, 'steps': 2687, 'loss/train': 2.707308769226074} 11/06/2021 21:38:43 - INFO - __main__ - Step 2689: {'lr': 0.0004999733402860572, 'samples': 516288, 'steps': 2688, 'loss/train': 2.6095635890960693} 11/06/2021 21:38:44 - INFO - __main__ - Step 2690: {'lr': 0.0004999732627319491, 'samples': 516480, 'steps': 2689, 'loss/train': 2.480802297592163} 11/06/2021 21:38:45 - INFO - __main__ - Step 2691: {'lr': 0.000499973185065207, 'samples': 516672, 'steps': 2690, 'loss/train': 2.4791250228881836} 11/06/2021 21:38:45 - INFO - __main__ - Step 2692: {'lr': 0.0004999731072858307, 'samples': 516864, 'steps': 2691, 'loss/train': 2.2488434314727783} 11/06/2021 21:38:45 - INFO - __main__ - Step 2693: {'lr': 0.0004999730293938205, 'samples': 517056, 'steps': 2692, 'loss/train': 2.4395229816436768} 11/06/2021 21:38:46 - INFO - __main__ - Step 2694: {'lr': 0.0004999729513891762, 'samples': 517248, 'steps': 2693, 'loss/train': 2.7909016609191895} 11/06/2021 21:38:46 - INFO - __main__ - Step 2695: {'lr': 0.000499972873271898, 'samples': 517440, 'steps': 2694, 'loss/train': 2.6461949348449707} 11/06/2021 21:38:47 - INFO - __main__ - Step 2696: {'lr': 0.0004999727950419859, 'samples': 517632, 'steps': 2695, 'loss/train': 2.5715339183807373} 11/06/2021 21:38:48 - INFO - __main__ - Step 2697: {'lr': 0.0004999727166994399, 'samples': 517824, 'steps': 2696, 'loss/train': 2.065563917160034} 11/06/2021 21:38:48 - INFO - __main__ - Step 2698: {'lr': 0.0004999726382442601, 'samples': 518016, 'steps': 2697, 'loss/train': 1.9356613159179688} 11/06/2021 21:38:48 - INFO - __main__ - Step 2699: {'lr': 0.0004999725596764465, 'samples': 518208, 'steps': 2698, 'loss/train': 1.576759934425354} 11/06/2021 21:38:49 - INFO - __main__ - Step 2700: {'lr': 0.000499972480995999, 'samples': 518400, 'steps': 2699, 'loss/train': 2.4644877910614014} 11/06/2021 21:38:49 - INFO - __main__ - Step 2701: {'lr': 0.0004999724022029179, 'samples': 518592, 'steps': 2700, 'loss/train': 2.8699357509613037} 11/06/2021 21:38:51 - INFO - __main__ - Step 2702: {'lr': 0.000499972323297203, 'samples': 518784, 'steps': 2701, 'loss/train': 2.5225210189819336} 11/06/2021 21:38:51 - INFO - __main__ - Step 2703: {'lr': 0.0004999722442788544, 'samples': 518976, 'steps': 2702, 'loss/train': 2.4827229976654053} 11/06/2021 21:38:51 - INFO - __main__ - Step 2704: {'lr': 0.0004999721651478723, 'samples': 519168, 'steps': 2703, 'loss/train': 1.9105079174041748} 11/06/2021 21:38:52 - INFO - __main__ - Step 2705: {'lr': 0.0004999720859042565, 'samples': 519360, 'steps': 2704, 'loss/train': 2.2500193119049072} 11/06/2021 21:38:52 - INFO - __main__ - Step 2706: {'lr': 0.0004999720065480071, 'samples': 519552, 'steps': 2705, 'loss/train': 2.6277735233306885} 11/06/2021 21:38:52 - INFO - __main__ - Step 2707: {'lr': 0.0004999719270791242, 'samples': 519744, 'steps': 2706, 'loss/train': 1.4648323059082031} 11/06/2021 21:38:53 - INFO - __main__ - Step 2708: {'lr': 0.0004999718474976078, 'samples': 519936, 'steps': 2707, 'loss/train': 2.773413896560669} 11/06/2021 21:38:54 - INFO - __main__ - Step 2709: {'lr': 0.000499971767803458, 'samples': 520128, 'steps': 2708, 'loss/train': 2.3420262336730957} 11/06/2021 21:38:54 - INFO - __main__ - Step 2710: {'lr': 0.0004999716879966747, 'samples': 520320, 'steps': 2709, 'loss/train': 2.6745805740356445} 11/06/2021 21:38:54 - INFO - __main__ - Step 2711: {'lr': 0.000499971608077258, 'samples': 520512, 'steps': 2710, 'loss/train': 1.5498085021972656} 11/06/2021 21:38:55 - INFO - __main__ - Step 2712: {'lr': 0.000499971528045208, 'samples': 520704, 'steps': 2711, 'loss/train': 2.7040622234344482} 11/06/2021 21:38:55 - INFO - __main__ - Step 2713: {'lr': 0.0004999714479005248, 'samples': 520896, 'steps': 2712, 'loss/train': 3.5146377086639404} 11/06/2021 21:38:56 - INFO - __main__ - Step 2714: {'lr': 0.0004999713676432082, 'samples': 521088, 'steps': 2713, 'loss/train': 2.1871511936187744} 11/06/2021 21:38:56 - INFO - __main__ - Step 2715: {'lr': 0.0004999712872732584, 'samples': 521280, 'steps': 2714, 'loss/train': 2.5216920375823975} 11/06/2021 21:38:57 - INFO - __main__ - Step 2716: {'lr': 0.0004999712067906754, 'samples': 521472, 'steps': 2715, 'loss/train': 2.882300615310669} 11/06/2021 21:38:57 - INFO - __main__ - Step 2717: {'lr': 0.0004999711261954591, 'samples': 521664, 'steps': 2716, 'loss/train': 2.44183349609375} 11/06/2021 21:38:57 - INFO - __main__ - Step 2718: {'lr': 0.0004999710454876099, 'samples': 521856, 'steps': 2717, 'loss/train': 2.10378360748291} 11/06/2021 21:38:59 - INFO - __main__ - Step 2719: {'lr': 0.0004999709646671274, 'samples': 522048, 'steps': 2718, 'loss/train': 2.456674098968506} 11/06/2021 21:38:59 - INFO - __main__ - Step 2720: {'lr': 0.0004999708837340119, 'samples': 522240, 'steps': 2719, 'loss/train': 2.4138710498809814} 11/06/2021 21:38:59 - INFO - __main__ - Step 2721: {'lr': 0.0004999708026882635, 'samples': 522432, 'steps': 2720, 'loss/train': 2.910074234008789} 11/06/2021 21:39:00 - INFO - __main__ - Step 2722: {'lr': 0.000499970721529882, 'samples': 522624, 'steps': 2721, 'loss/train': 3.079136848449707} 11/06/2021 21:39:00 - INFO - __main__ - Step 2723: {'lr': 0.0004999706402588675, 'samples': 522816, 'steps': 2722, 'loss/train': 2.3095333576202393} 11/06/2021 21:39:01 - INFO - __main__ - Step 2724: {'lr': 0.0004999705588752202, 'samples': 523008, 'steps': 2723, 'loss/train': 1.8380732536315918} 11/06/2021 21:39:01 - INFO - __main__ - Step 2725: {'lr': 0.00049997047737894, 'samples': 523200, 'steps': 2724, 'loss/train': 2.072640895843506} 11/06/2021 21:39:02 - INFO - __main__ - Step 2726: {'lr': 0.0004999703957700269, 'samples': 523392, 'steps': 2725, 'loss/train': 2.321617603302002} 11/06/2021 21:39:02 - INFO - __main__ - Step 2727: {'lr': 0.000499970314048481, 'samples': 523584, 'steps': 2726, 'loss/train': 2.2870664596557617} 11/06/2021 21:39:02 - INFO - __main__ - Step 2728: {'lr': 0.0004999702322143023, 'samples': 523776, 'steps': 2727, 'loss/train': 2.508100748062134} 11/06/2021 21:39:03 - INFO - __main__ - Step 2729: {'lr': 0.000499970150267491, 'samples': 523968, 'steps': 2728, 'loss/train': 2.8131275177001953} 11/06/2021 21:39:04 - INFO - __main__ - Step 2730: {'lr': 0.0004999700682080469, 'samples': 524160, 'steps': 2729, 'loss/train': 2.412299871444702} 11/06/2021 21:39:04 - INFO - __main__ - Step 2731: {'lr': 0.0004999699860359702, 'samples': 524352, 'steps': 2730, 'loss/train': 2.656719446182251} 11/06/2021 21:39:04 - INFO - __main__ - Step 2732: {'lr': 0.0004999699037512608, 'samples': 524544, 'steps': 2731, 'loss/train': 2.0453786849975586} 11/06/2021 21:39:05 - INFO - __main__ - Step 2733: {'lr': 0.000499969821353919, 'samples': 524736, 'steps': 2732, 'loss/train': 1.5553648471832275} 11/06/2021 21:39:06 - INFO - __main__ - Step 2734: {'lr': 0.0004999697388439444, 'samples': 524928, 'steps': 2733, 'loss/train': 2.4622724056243896} 11/06/2021 21:39:06 - INFO - __main__ - Step 2735: {'lr': 0.0004999696562213375, 'samples': 525120, 'steps': 2734, 'loss/train': 2.221712827682495} 11/06/2021 21:39:06 - INFO - __main__ - Step 2736: {'lr': 0.0004999695734860981, 'samples': 525312, 'steps': 2735, 'loss/train': 2.3038675785064697} 11/06/2021 21:39:07 - INFO - __main__ - Step 2737: {'lr': 0.0004999694906382262, 'samples': 525504, 'steps': 2736, 'loss/train': 2.3742024898529053} 11/06/2021 21:39:07 - INFO - __main__ - Step 2738: {'lr': 0.0004999694076777219, 'samples': 525696, 'steps': 2737, 'loss/train': 2.387545347213745} 11/06/2021 21:39:08 - INFO - __main__ - Step 2739: {'lr': 0.0004999693246045854, 'samples': 525888, 'steps': 2738, 'loss/train': 2.2682254314422607} 11/06/2021 21:39:08 - INFO - __main__ - Step 2740: {'lr': 0.0004999692414188164, 'samples': 526080, 'steps': 2739, 'loss/train': 1.0920865535736084} 11/06/2021 21:39:09 - INFO - __main__ - Step 2741: {'lr': 0.0004999691581204152, 'samples': 526272, 'steps': 2740, 'loss/train': 2.4095451831817627} 11/06/2021 21:39:09 - INFO - __main__ - Step 2742: {'lr': 0.0004999690747093816, 'samples': 526464, 'steps': 2741, 'loss/train': 2.6026110649108887} 11/06/2021 21:39:09 - INFO - __main__ - Step 2743: {'lr': 0.000499968991185716, 'samples': 526656, 'steps': 2742, 'loss/train': 2.7001304626464844} 11/06/2021 21:39:10 - INFO - __main__ - Step 2744: {'lr': 0.0004999689075494182, 'samples': 526848, 'steps': 2743, 'loss/train': 2.8792505264282227} 11/06/2021 21:39:11 - INFO - __main__ - Step 2745: {'lr': 0.0004999688238004882, 'samples': 527040, 'steps': 2744, 'loss/train': 1.739349603652954} 11/06/2021 21:39:11 - INFO - __main__ - Step 2746: {'lr': 0.0004999687399389262, 'samples': 527232, 'steps': 2745, 'loss/train': 1.9313324689865112} 11/06/2021 21:39:11 - INFO - __main__ - Step 2747: {'lr': 0.0004999686559647319, 'samples': 527424, 'steps': 2746, 'loss/train': 2.5279910564422607} 11/06/2021 21:39:12 - INFO - __main__ - Step 2748: {'lr': 0.0004999685718779058, 'samples': 527616, 'steps': 2747, 'loss/train': 2.2004690170288086} 11/06/2021 21:39:12 - INFO - __main__ - Step 2749: {'lr': 0.0004999684876784477, 'samples': 527808, 'steps': 2748, 'loss/train': 2.4345414638519287} 11/06/2021 21:39:13 - INFO - __main__ - Step 2750: {'lr': 0.0004999684033663576, 'samples': 528000, 'steps': 2749, 'loss/train': 1.688835620880127} 11/06/2021 21:39:14 - INFO - __main__ - Step 2751: {'lr': 0.0004999683189416356, 'samples': 528192, 'steps': 2750, 'loss/train': 1.597861647605896} 11/06/2021 21:39:14 - INFO - __main__ - Step 2752: {'lr': 0.0004999682344042817, 'samples': 528384, 'steps': 2751, 'loss/train': 2.844277858734131} 11/06/2021 21:39:14 - INFO - __main__ - Step 2753: {'lr': 0.000499968149754296, 'samples': 528576, 'steps': 2752, 'loss/train': 2.3030660152435303} 11/06/2021 21:39:15 - INFO - __main__ - Step 2754: {'lr': 0.0004999680649916786, 'samples': 528768, 'steps': 2753, 'loss/train': 2.6878316402435303} 11/06/2021 21:39:16 - INFO - __main__ - Step 2755: {'lr': 0.0004999679801164295, 'samples': 528960, 'steps': 2754, 'loss/train': 0.8238305449485779} 11/06/2021 21:39:17 - INFO - __main__ - Step 2756: {'lr': 0.0004999678951285485, 'samples': 529152, 'steps': 2755, 'loss/train': 2.0880188941955566} 11/06/2021 21:39:17 - INFO - __main__ - Step 2757: {'lr': 0.0004999678100280358, 'samples': 529344, 'steps': 2756, 'loss/train': 2.4757795333862305} 11/06/2021 21:39:17 - INFO - __main__ - Step 2758: {'lr': 0.0004999677248148916, 'samples': 529536, 'steps': 2757, 'loss/train': 2.4732248783111572} 11/06/2021 21:39:18 - INFO - __main__ - Step 2759: {'lr': 0.0004999676394891158, 'samples': 529728, 'steps': 2758, 'loss/train': 2.1484673023223877} 11/06/2021 21:39:18 - INFO - __main__ - Step 2760: {'lr': 0.0004999675540507083, 'samples': 529920, 'steps': 2759, 'loss/train': 3.0172383785247803} 11/06/2021 21:39:19 - INFO - __main__ - Step 2761: {'lr': 0.0004999674684996694, 'samples': 530112, 'steps': 2760, 'loss/train': 2.785417079925537} 11/06/2021 21:39:19 - INFO - __main__ - Step 2762: {'lr': 0.0004999673828359989, 'samples': 530304, 'steps': 2761, 'loss/train': 2.5744974613189697} 11/06/2021 21:39:20 - INFO - __main__ - Step 2763: {'lr': 0.0004999672970596971, 'samples': 530496, 'steps': 2762, 'loss/train': 2.5258822441101074} 11/06/2021 21:39:20 - INFO - __main__ - Step 2764: {'lr': 0.0004999672111707639, 'samples': 530688, 'steps': 2763, 'loss/train': 3.058380603790283} 11/06/2021 21:39:21 - INFO - __main__ - Step 2765: {'lr': 0.0004999671251691991, 'samples': 530880, 'steps': 2764, 'loss/train': 2.8081789016723633} 11/06/2021 21:39:21 - INFO - __main__ - Step 2766: {'lr': 0.0004999670390550032, 'samples': 531072, 'steps': 2765, 'loss/train': 2.2795279026031494} 11/06/2021 21:39:22 - INFO - __main__ - Step 2767: {'lr': 0.000499966952828176, 'samples': 531264, 'steps': 2766, 'loss/train': 1.8465057611465454} 11/06/2021 21:39:22 - INFO - __main__ - Step 2768: {'lr': 0.0004999668664887175, 'samples': 531456, 'steps': 2767, 'loss/train': 2.2657551765441895} 11/06/2021 21:39:23 - INFO - __main__ - Step 2769: {'lr': 0.0004999667800366278, 'samples': 531648, 'steps': 2768, 'loss/train': 2.2362067699432373} 11/06/2021 21:39:23 - INFO - __main__ - Step 2770: {'lr': 0.0004999666934719069, 'samples': 531840, 'steps': 2769, 'loss/train': 1.8398356437683105} 11/06/2021 21:39:23 - INFO - __main__ - Step 2771: {'lr': 0.0004999666067945548, 'samples': 532032, 'steps': 2770, 'loss/train': 2.37894868850708} 11/06/2021 21:39:24 - INFO - __main__ - Step 2772: {'lr': 0.0004999665200045716, 'samples': 532224, 'steps': 2771, 'loss/train': 1.0660734176635742} 11/06/2021 21:39:25 - INFO - __main__ - Step 2773: {'lr': 0.0004999664331019574, 'samples': 532416, 'steps': 2772, 'loss/train': 2.489011287689209} 11/06/2021 21:39:25 - INFO - __main__ - Step 2774: {'lr': 0.0004999663460867123, 'samples': 532608, 'steps': 2773, 'loss/train': 2.3319153785705566} 11/06/2021 21:39:25 - INFO - __main__ - Step 2775: {'lr': 0.000499966258958836, 'samples': 532800, 'steps': 2774, 'loss/train': 2.5055582523345947} 11/06/2021 21:39:26 - INFO - __main__ - Step 2776: {'lr': 0.000499966171718329, 'samples': 532992, 'steps': 2775, 'loss/train': 2.4398868083953857} 11/06/2021 21:39:27 - INFO - __main__ - Step 2777: {'lr': 0.000499966084365191, 'samples': 533184, 'steps': 2776, 'loss/train': 2.4442226886749268} 11/06/2021 21:39:27 - INFO - __main__ - Step 2778: {'lr': 0.0004999659968994221, 'samples': 533376, 'steps': 2777, 'loss/train': 2.8139424324035645} 11/06/2021 21:39:28 - INFO - __main__ - Step 2779: {'lr': 0.0004999659093210223, 'samples': 533568, 'steps': 2778, 'loss/train': 2.4639973640441895} 11/06/2021 21:39:28 - INFO - __main__ - Step 2780: {'lr': 0.0004999658216299919, 'samples': 533760, 'steps': 2779, 'loss/train': 2.217278242111206} 11/06/2021 21:39:28 - INFO - __main__ - Step 2781: {'lr': 0.0004999657338263308, 'samples': 533952, 'steps': 2780, 'loss/train': 2.8546433448791504} 11/06/2021 21:39:29 - INFO - __main__ - Step 2782: {'lr': 0.0004999656459100388, 'samples': 534144, 'steps': 2781, 'loss/train': 2.2053894996643066} 11/06/2021 21:39:30 - INFO - __main__ - Step 2783: {'lr': 0.0004999655578811161, 'samples': 534336, 'steps': 2782, 'loss/train': 2.2209572792053223} 11/06/2021 21:39:30 - INFO - __main__ - Step 2784: {'lr': 0.0004999654697395629, 'samples': 534528, 'steps': 2783, 'loss/train': 2.64697527885437} 11/06/2021 21:39:30 - INFO - __main__ - Step 2785: {'lr': 0.0004999653814853791, 'samples': 534720, 'steps': 2784, 'loss/train': 2.1000912189483643} 11/06/2021 21:39:31 - INFO - __main__ - Step 2786: {'lr': 0.0004999652931185648, 'samples': 534912, 'steps': 2785, 'loss/train': 2.200310468673706} 11/06/2021 21:39:32 - INFO - __main__ - Step 2787: {'lr': 0.00049996520463912, 'samples': 535104, 'steps': 2786, 'loss/train': 2.8074491024017334} 11/06/2021 21:39:32 - INFO - __main__ - Step 2788: {'lr': 0.0004999651160470447, 'samples': 535296, 'steps': 2787, 'loss/train': 2.2512643337249756} 11/06/2021 21:39:32 - INFO - __main__ - Step 2789: {'lr': 0.0004999650273423389, 'samples': 535488, 'steps': 2788, 'loss/train': 2.207897663116455} 11/06/2021 21:39:33 - INFO - __main__ - Step 2790: {'lr': 0.0004999649385250028, 'samples': 535680, 'steps': 2789, 'loss/train': 2.473784923553467} 11/06/2021 21:39:33 - INFO - __main__ - Step 2791: {'lr': 0.0004999648495950363, 'samples': 535872, 'steps': 2790, 'loss/train': 2.487807273864746} 11/06/2021 21:39:34 - INFO - __main__ - Step 2792: {'lr': 0.0004999647605524396, 'samples': 536064, 'steps': 2791, 'loss/train': 2.234999895095825} 11/06/2021 21:39:35 - INFO - __main__ - Step 2793: {'lr': 0.0004999646713972126, 'samples': 536256, 'steps': 2792, 'loss/train': 2.1799986362457275} 11/06/2021 21:39:35 - INFO - __main__ - Step 2794: {'lr': 0.0004999645821293552, 'samples': 536448, 'steps': 2793, 'loss/train': 1.1634377241134644} 11/06/2021 21:39:35 - INFO - __main__ - Step 2795: {'lr': 0.0004999644927488678, 'samples': 536640, 'steps': 2794, 'loss/train': 2.290945529937744} 11/06/2021 21:39:36 - INFO - __main__ - Step 2796: {'lr': 0.0004999644032557503, 'samples': 536832, 'steps': 2795, 'loss/train': 2.273956537246704} 11/06/2021 21:39:37 - INFO - __main__ - Step 2797: {'lr': 0.0004999643136500027, 'samples': 537024, 'steps': 2796, 'loss/train': 1.5732803344726562} 11/06/2021 21:39:37 - INFO - __main__ - Step 2798: {'lr': 0.0004999642239316249, 'samples': 537216, 'steps': 2797, 'loss/train': 2.646408796310425} 11/06/2021 21:39:38 - INFO - __main__ - Step 2799: {'lr': 0.000499964134100617, 'samples': 537408, 'steps': 2798, 'loss/train': 2.802149772644043} 11/06/2021 21:39:38 - INFO - __main__ - Step 2800: {'lr': 0.0004999640441569793, 'samples': 537600, 'steps': 2799, 'loss/train': 1.9303408861160278} 11/06/2021 21:39:38 - INFO - __main__ - Step 2801: {'lr': 0.0004999639541007116, 'samples': 537792, 'steps': 2800, 'loss/train': 2.0123252868652344} 11/06/2021 21:39:39 - INFO - __main__ - Step 2802: {'lr': 0.0004999638639318141, 'samples': 537984, 'steps': 2801, 'loss/train': 2.5675606727600098} 11/06/2021 21:39:40 - INFO - __main__ - Step 2803: {'lr': 0.0004999637736502866, 'samples': 538176, 'steps': 2802, 'loss/train': 2.3583943843841553} 11/06/2021 21:39:40 - INFO - __main__ - Step 2804: {'lr': 0.0004999636832561293, 'samples': 538368, 'steps': 2803, 'loss/train': 2.4021947383880615} 11/06/2021 21:39:40 - INFO - __main__ - Step 2805: {'lr': 0.0004999635927493423, 'samples': 538560, 'steps': 2804, 'loss/train': 2.4244282245635986} 11/06/2021 21:39:41 - INFO - __main__ - Step 2806: {'lr': 0.0004999635021299255, 'samples': 538752, 'steps': 2805, 'loss/train': 2.661571741104126} 11/06/2021 21:39:41 - INFO - __main__ - Step 2807: {'lr': 0.0004999634113978791, 'samples': 538944, 'steps': 2806, 'loss/train': 2.562166929244995} 11/06/2021 21:39:42 - INFO - __main__ - Step 2808: {'lr': 0.0004999633205532029, 'samples': 539136, 'steps': 2807, 'loss/train': 2.0239551067352295} 11/06/2021 21:39:43 - INFO - __main__ - Step 2809: {'lr': 0.0004999632295958972, 'samples': 539328, 'steps': 2808, 'loss/train': 2.355177164077759} 11/06/2021 21:39:43 - INFO - __main__ - Step 2810: {'lr': 0.0004999631385259617, 'samples': 539520, 'steps': 2809, 'loss/train': 2.7225186824798584} 11/06/2021 21:39:43 - INFO - __main__ - Step 2811: {'lr': 0.000499963047343397, 'samples': 539712, 'steps': 2810, 'loss/train': 1.7809796333312988} 11/06/2021 21:39:44 - INFO - __main__ - Step 2812: {'lr': 0.0004999629560482026, 'samples': 539904, 'steps': 2811, 'loss/train': 1.2682995796203613} 11/06/2021 21:39:45 - INFO - __main__ - Step 2813: {'lr': 0.0004999628646403788, 'samples': 540096, 'steps': 2812, 'loss/train': 2.061248302459717} 11/06/2021 21:39:45 - INFO - __main__ - Step 2814: {'lr': 0.0004999627731199256, 'samples': 540288, 'steps': 2813, 'loss/train': 2.505889654159546} 11/06/2021 21:39:45 - INFO - __main__ - Step 2815: {'lr': 0.0004999626814868429, 'samples': 540480, 'steps': 2814, 'loss/train': 1.8169455528259277} 11/06/2021 21:39:46 - INFO - __main__ - Step 2816: {'lr': 0.0004999625897411311, 'samples': 540672, 'steps': 2815, 'loss/train': 2.5656561851501465} 11/06/2021 21:39:46 - INFO - __main__ - Step 2817: {'lr': 0.0004999624978827899, 'samples': 540864, 'steps': 2816, 'loss/train': 2.479475736618042} 11/06/2021 21:39:47 - INFO - __main__ - Step 2818: {'lr': 0.0004999624059118194, 'samples': 541056, 'steps': 2817, 'loss/train': 1.6999139785766602} 11/06/2021 21:39:47 - INFO - __main__ - Step 2819: {'lr': 0.0004999623138282198, 'samples': 541248, 'steps': 2818, 'loss/train': 2.207723617553711} 11/06/2021 21:39:48 - INFO - __main__ - Step 2820: {'lr': 0.000499962221631991, 'samples': 541440, 'steps': 2819, 'loss/train': 2.569610357284546} 11/06/2021 21:39:48 - INFO - __main__ - Step 2821: {'lr': 0.0004999621293231331, 'samples': 541632, 'steps': 2820, 'loss/train': 2.8650639057159424} 11/06/2021 21:39:48 - INFO - __main__ - Step 2822: {'lr': 0.0004999620369016461, 'samples': 541824, 'steps': 2821, 'loss/train': 2.0223443508148193} 11/06/2021 21:39:49 - INFO - __main__ - Step 2823: {'lr': 0.00049996194436753, 'samples': 542016, 'steps': 2822, 'loss/train': 2.039158344268799} 11/06/2021 21:39:50 - INFO - __main__ - Step 2824: {'lr': 0.000499961851720785, 'samples': 542208, 'steps': 2823, 'loss/train': 2.3247768878936768} 11/06/2021 21:39:50 - INFO - __main__ - Step 2825: {'lr': 0.000499961758961411, 'samples': 542400, 'steps': 2824, 'loss/train': 2.4017333984375} 11/06/2021 21:39:50 - INFO - __main__ - Step 2826: {'lr': 0.0004999616660894081, 'samples': 542592, 'steps': 2825, 'loss/train': 2.326307535171509} 11/06/2021 21:39:51 - INFO - __main__ - Step 2827: {'lr': 0.0004999615731047762, 'samples': 542784, 'steps': 2826, 'loss/train': 2.21771240234375} 11/06/2021 21:39:52 - INFO - __main__ - Step 2828: {'lr': 0.0004999614800075158, 'samples': 542976, 'steps': 2827, 'loss/train': 1.7368288040161133} 11/06/2021 21:39:52 - INFO - __main__ - Step 2829: {'lr': 0.0004999613867976264, 'samples': 543168, 'steps': 2828, 'loss/train': 2.7821502685546875} 11/06/2021 21:39:53 - INFO - __main__ - Step 2830: {'lr': 0.0004999612934751082, 'samples': 543360, 'steps': 2829, 'loss/train': 2.315138101577759} 11/06/2021 21:39:53 - INFO - __main__ - Step 2831: {'lr': 0.0004999612000399614, 'samples': 543552, 'steps': 2830, 'loss/train': 2.502418041229248} 11/06/2021 21:39:53 - INFO - __main__ - Step 2832: {'lr': 0.0004999611064921859, 'samples': 543744, 'steps': 2831, 'loss/train': 2.381127119064331} 11/06/2021 21:39:54 - INFO - __main__ - Step 2833: {'lr': 0.0004999610128317818, 'samples': 543936, 'steps': 2832, 'loss/train': 1.7340575456619263} 11/06/2021 21:39:55 - INFO - __main__ - Step 2834: {'lr': 0.0004999609190587492, 'samples': 544128, 'steps': 2833, 'loss/train': 2.435819387435913} 11/06/2021 21:39:55 - INFO - __main__ - Step 2835: {'lr': 0.000499960825173088, 'samples': 544320, 'steps': 2834, 'loss/train': 2.3425838947296143} 11/06/2021 21:39:55 - INFO - __main__ - Step 2836: {'lr': 0.0004999607311747983, 'samples': 544512, 'steps': 2835, 'loss/train': 1.7845638990402222} 11/06/2021 21:39:56 - INFO - __main__ - Step 2837: {'lr': 0.0004999606370638801, 'samples': 544704, 'steps': 2836, 'loss/train': 1.946204423904419} 11/06/2021 21:39:56 - INFO - __main__ - Step 2838: {'lr': 0.0004999605428403336, 'samples': 544896, 'steps': 2837, 'loss/train': 2.6006252765655518} 11/06/2021 21:39:57 - INFO - __main__ - Step 2839: {'lr': 0.0004999604485041585, 'samples': 545088, 'steps': 2838, 'loss/train': 2.409191131591797} 11/06/2021 21:39:57 - INFO - __main__ - Step 2840: {'lr': 0.0004999603540553554, 'samples': 545280, 'steps': 2839, 'loss/train': 2.562089204788208} 11/06/2021 21:39:58 - INFO - __main__ - Step 2841: {'lr': 0.0004999602594939238, 'samples': 545472, 'steps': 2840, 'loss/train': 2.2880022525787354} 11/06/2021 21:39:58 - INFO - __main__ - Step 2842: {'lr': 0.0004999601648198641, 'samples': 545664, 'steps': 2841, 'loss/train': 2.791858434677124} 11/06/2021 21:39:58 - INFO - __main__ - Step 2843: {'lr': 0.0004999600700331761, 'samples': 545856, 'steps': 2842, 'loss/train': 2.366997003555298} 11/06/2021 21:40:00 - INFO - __main__ - Step 2844: {'lr': 0.0004999599751338601, 'samples': 546048, 'steps': 2843, 'loss/train': 2.355536937713623} 11/06/2021 21:40:00 - INFO - __main__ - Step 2845: {'lr': 0.0004999598801219158, 'samples': 546240, 'steps': 2844, 'loss/train': 1.3548896312713623} 11/06/2021 21:40:00 - INFO - __main__ - Step 2846: {'lr': 0.0004999597849973435, 'samples': 546432, 'steps': 2845, 'loss/train': 2.1477577686309814} 11/06/2021 21:40:01 - INFO - __main__ - Step 2847: {'lr': 0.0004999596897601432, 'samples': 546624, 'steps': 2846, 'loss/train': 1.9438403844833374} 11/06/2021 21:40:01 - INFO - __main__ - Step 2848: {'lr': 0.0004999595944103149, 'samples': 546816, 'steps': 2847, 'loss/train': 2.6734774112701416} 11/06/2021 21:40:02 - INFO - __main__ - Step 2849: {'lr': 0.0004999594989478587, 'samples': 547008, 'steps': 2848, 'loss/train': 2.6153666973114014} 11/06/2021 21:40:02 - INFO - __main__ - Step 2850: {'lr': 0.0004999594033727747, 'samples': 547200, 'steps': 2849, 'loss/train': 2.0793142318725586} 11/06/2021 21:40:03 - INFO - __main__ - Step 2851: {'lr': 0.0004999593076850627, 'samples': 547392, 'steps': 2850, 'loss/train': 2.2441251277923584} 11/06/2021 21:40:03 - INFO - __main__ - Step 2852: {'lr': 0.0004999592118847229, 'samples': 547584, 'steps': 2851, 'loss/train': 2.7743895053863525} 11/06/2021 21:40:03 - INFO - __main__ - Step 2853: {'lr': 0.0004999591159717554, 'samples': 547776, 'steps': 2852, 'loss/train': 2.089233160018921} 11/06/2021 21:40:04 - INFO - __main__ - Step 2854: {'lr': 0.0004999590199461602, 'samples': 547968, 'steps': 2853, 'loss/train': 2.220766067504883} 11/06/2021 21:40:05 - INFO - __main__ - Step 2855: {'lr': 0.0004999589238079373, 'samples': 548160, 'steps': 2854, 'loss/train': 2.098376989364624} 11/06/2021 21:40:05 - INFO - __main__ - Step 2856: {'lr': 0.0004999588275570868, 'samples': 548352, 'steps': 2855, 'loss/train': 2.4644670486450195} 11/06/2021 21:40:05 - INFO - __main__ - Step 2857: {'lr': 0.0004999587311936086, 'samples': 548544, 'steps': 2856, 'loss/train': 2.239152193069458} 11/06/2021 21:40:06 - INFO - __main__ - Step 2858: {'lr': 0.000499958634717503, 'samples': 548736, 'steps': 2857, 'loss/train': 1.4249017238616943} 11/06/2021 21:40:07 - INFO - __main__ - Step 2859: {'lr': 0.0004999585381287696, 'samples': 548928, 'steps': 2858, 'loss/train': 2.2999162673950195} 11/06/2021 21:40:07 - INFO - __main__ - Step 2860: {'lr': 0.000499958441427409, 'samples': 549120, 'steps': 2859, 'loss/train': 2.8387157917022705} 11/06/2021 21:40:08 - INFO - __main__ - Step 2861: {'lr': 0.0004999583446134209, 'samples': 549312, 'steps': 2860, 'loss/train': 2.874333620071411} 11/06/2021 21:40:08 - INFO - __main__ - Step 2862: {'lr': 0.0004999582476868055, 'samples': 549504, 'steps': 2861, 'loss/train': 2.6905484199523926} 11/06/2021 21:40:08 - INFO - __main__ - Step 2863: {'lr': 0.0004999581506475627, 'samples': 549696, 'steps': 2862, 'loss/train': 2.1297647953033447} 11/06/2021 21:40:09 - INFO - __main__ - Step 2864: {'lr': 0.0004999580534956927, 'samples': 549888, 'steps': 2863, 'loss/train': 2.871276378631592} 11/06/2021 21:40:10 - INFO - __main__ - Step 2865: {'lr': 0.0004999579562311953, 'samples': 550080, 'steps': 2864, 'loss/train': 1.109349012374878} 11/06/2021 21:40:10 - INFO - __main__ - Step 2866: {'lr': 0.0004999578588540709, 'samples': 550272, 'steps': 2865, 'loss/train': 2.0491299629211426} 11/06/2021 21:40:10 - INFO - __main__ - Step 2867: {'lr': 0.0004999577613643192, 'samples': 550464, 'steps': 2866, 'loss/train': 2.0363290309906006} 11/06/2021 21:40:11 - INFO - __main__ - Step 2868: {'lr': 0.0004999576637619404, 'samples': 550656, 'steps': 2867, 'loss/train': 2.5006585121154785} 11/06/2021 21:40:11 - INFO - __main__ - Step 2869: {'lr': 0.0004999575660469347, 'samples': 550848, 'steps': 2868, 'loss/train': 2.5980803966522217} 11/06/2021 21:40:12 - INFO - __main__ - Step 2870: {'lr': 0.0004999574682193017, 'samples': 551040, 'steps': 2869, 'loss/train': 1.786831259727478} 11/06/2021 21:40:13 - INFO - __main__ - Step 2871: {'lr': 0.0004999573702790419, 'samples': 551232, 'steps': 2870, 'loss/train': 1.8787875175476074} 11/06/2021 21:40:13 - INFO - __main__ - Step 2872: {'lr': 0.0004999572722261551, 'samples': 551424, 'steps': 2871, 'loss/train': 1.8773044347763062} 11/06/2021 21:40:13 - INFO - __main__ - Step 2873: {'lr': 0.0004999571740606415, 'samples': 551616, 'steps': 2872, 'loss/train': 2.097402572631836} 11/06/2021 21:40:14 - INFO - __main__ - Step 2874: {'lr': 0.000499957075782501, 'samples': 551808, 'steps': 2873, 'loss/train': 1.8168832063674927} 11/06/2021 21:40:15 - INFO - __main__ - Step 2875: {'lr': 0.0004999569773917337, 'samples': 552000, 'steps': 2874, 'loss/train': 2.0323092937469482} 11/06/2021 21:40:15 - INFO - __main__ - Step 2876: {'lr': 0.0004999568788883397, 'samples': 552192, 'steps': 2875, 'loss/train': 1.8158880472183228} 11/06/2021 21:40:15 - INFO - __main__ - Step 2877: {'lr': 0.0004999567802723188, 'samples': 552384, 'steps': 2876, 'loss/train': 2.2598671913146973} 11/06/2021 21:40:16 - INFO - __main__ - Step 2878: {'lr': 0.0004999566815436715, 'samples': 552576, 'steps': 2877, 'loss/train': 2.445081949234009} 11/06/2021 21:40:16 - INFO - __main__ - Step 2879: {'lr': 0.0004999565827023974, 'samples': 552768, 'steps': 2878, 'loss/train': 2.130836009979248} 11/06/2021 21:40:17 - INFO - __main__ - Step 2880: {'lr': 0.0004999564837484967, 'samples': 552960, 'steps': 2879, 'loss/train': 2.3038902282714844} 11/06/2021 21:40:17 - INFO - __main__ - Step 2881: {'lr': 0.0004999563846819696, 'samples': 553152, 'steps': 2880, 'loss/train': 2.3476128578186035} 11/06/2021 21:40:18 - INFO - __main__ - Step 2882: {'lr': 0.0004999562855028159, 'samples': 553344, 'steps': 2881, 'loss/train': 2.6095919609069824} 11/06/2021 21:40:18 - INFO - __main__ - Step 2883: {'lr': 0.0004999561862110358, 'samples': 553536, 'steps': 2882, 'loss/train': 1.861746072769165} 11/06/2021 21:40:18 - INFO - __main__ - Step 2884: {'lr': 0.0004999560868066293, 'samples': 553728, 'steps': 2883, 'loss/train': 2.09774112701416} 11/06/2021 21:40:20 - INFO - __main__ - Step 2885: {'lr': 0.0004999559872895964, 'samples': 553920, 'steps': 2884, 'loss/train': 1.9192487001419067} 11/06/2021 21:40:20 - INFO - __main__ - Step 2886: {'lr': 0.0004999558876599373, 'samples': 554112, 'steps': 2885, 'loss/train': 2.2191450595855713} 11/06/2021 21:40:20 - INFO - __main__ - Step 2887: {'lr': 0.0004999557879176518, 'samples': 554304, 'steps': 2886, 'loss/train': 2.411027193069458} 11/06/2021 21:40:21 - INFO - __main__ - Step 2888: {'lr': 0.0004999556880627401, 'samples': 554496, 'steps': 2887, 'loss/train': 2.7569634914398193} 11/06/2021 21:40:21 - INFO - __main__ - Step 2889: {'lr': 0.0004999555880952023, 'samples': 554688, 'steps': 2888, 'loss/train': 2.49636173248291} 11/06/2021 21:40:22 - INFO - __main__ - Step 2890: {'lr': 0.0004999554880150383, 'samples': 554880, 'steps': 2889, 'loss/train': 2.0643162727355957} 11/06/2021 21:40:22 - INFO - __main__ - Step 2891: {'lr': 0.0004999553878222482, 'samples': 555072, 'steps': 2890, 'loss/train': 1.2182731628417969} 11/06/2021 21:40:23 - INFO - __main__ - Step 2892: {'lr': 0.0004999552875168321, 'samples': 555264, 'steps': 2891, 'loss/train': 2.110027551651001} 11/06/2021 21:40:23 - INFO - __main__ - Step 2893: {'lr': 0.0004999551870987901, 'samples': 555456, 'steps': 2892, 'loss/train': 2.0621578693389893} 11/06/2021 21:40:23 - INFO - __main__ - Step 2894: {'lr': 0.000499955086568122, 'samples': 555648, 'steps': 2893, 'loss/train': 2.4326114654541016} 11/06/2021 21:40:24 - INFO - __main__ - Step 2895: {'lr': 0.000499954985924828, 'samples': 555840, 'steps': 2894, 'loss/train': 1.992488980293274} 11/06/2021 21:40:25 - INFO - __main__ - Step 2896: {'lr': 0.0004999548851689082, 'samples': 556032, 'steps': 2895, 'loss/train': 1.6926281452178955} 11/06/2021 21:40:25 - INFO - __main__ - Step 2897: {'lr': 0.0004999547843003627, 'samples': 556224, 'steps': 2896, 'loss/train': 2.2497506141662598} 11/06/2021 21:40:25 - INFO - __main__ - Step 2898: {'lr': 0.0004999546833191912, 'samples': 556416, 'steps': 2897, 'loss/train': 2.4135076999664307} 11/06/2021 21:40:26 - INFO - __main__ - Step 2899: {'lr': 0.0004999545822253941, 'samples': 556608, 'steps': 2898, 'loss/train': 1.9909908771514893} 11/06/2021 21:40:27 - INFO - __main__ - Step 2900: {'lr': 0.0004999544810189713, 'samples': 556800, 'steps': 2899, 'loss/train': 1.9784951210021973} 11/06/2021 21:40:27 - INFO - __main__ - Step 2901: {'lr': 0.0004999543796999228, 'samples': 556992, 'steps': 2900, 'loss/train': 2.4324984550476074} 11/06/2021 21:40:27 - INFO - __main__ - Step 2902: {'lr': 0.0004999542782682489, 'samples': 557184, 'steps': 2901, 'loss/train': 1.6676363945007324} 11/06/2021 21:40:28 - INFO - __main__ - Step 2903: {'lr': 0.0004999541767239493, 'samples': 557376, 'steps': 2902, 'loss/train': 2.2086076736450195} 11/06/2021 21:40:28 - INFO - __main__ - Step 2904: {'lr': 0.0004999540750670243, 'samples': 557568, 'steps': 2903, 'loss/train': 2.221266746520996} 11/06/2021 21:40:28 - INFO - __main__ - Step 2905: {'lr': 0.0004999539732974738, 'samples': 557760, 'steps': 2904, 'loss/train': 1.9565421342849731} 11/06/2021 21:40:29 - INFO - __main__ - Step 2906: {'lr': 0.0004999538714152978, 'samples': 557952, 'steps': 2905, 'loss/train': 2.1874446868896484} 11/06/2021 21:40:30 - INFO - __main__ - Step 2907: {'lr': 0.0004999537694204966, 'samples': 558144, 'steps': 2906, 'loss/train': 1.4559849500656128} 11/06/2021 21:40:30 - INFO - __main__ - Step 2908: {'lr': 0.0004999536673130701, 'samples': 558336, 'steps': 2907, 'loss/train': 2.1022891998291016} 11/06/2021 21:40:30 - INFO - __main__ - Step 2909: {'lr': 0.0004999535650930182, 'samples': 558528, 'steps': 2908, 'loss/train': 1.6833531856536865} 11/06/2021 21:40:31 - INFO - __main__ - Step 2910: {'lr': 0.0004999534627603411, 'samples': 558720, 'steps': 2909, 'loss/train': 1.6895768642425537} 11/06/2021 21:40:33 - INFO - __main__ - Step 2911: {'lr': 0.0004999533603150389, 'samples': 558912, 'steps': 2910, 'loss/train': 2.199605703353882} 11/06/2021 21:40:33 - INFO - __main__ - Step 2912: {'lr': 0.0004999532577571116, 'samples': 559104, 'steps': 2911, 'loss/train': 2.348113536834717} 11/06/2021 21:40:33 - INFO - __main__ - Step 2913: {'lr': 0.0004999531550865592, 'samples': 559296, 'steps': 2912, 'loss/train': 1.9907268285751343} 11/06/2021 21:40:34 - INFO - __main__ - Step 2914: {'lr': 0.0004999530523033817, 'samples': 559488, 'steps': 2913, 'loss/train': 1.3060599565505981} 11/06/2021 21:40:34 - INFO - __main__ - Step 2915: {'lr': 0.0004999529494075792, 'samples': 559680, 'steps': 2914, 'loss/train': 2.2429051399230957} 11/06/2021 21:40:34 - INFO - __main__ - Step 2916: {'lr': 0.0004999528463991518, 'samples': 559872, 'steps': 2915, 'loss/train': 3.520594596862793} 11/06/2021 21:40:35 - INFO - __main__ - Step 2917: {'lr': 0.0004999527432780995, 'samples': 560064, 'steps': 2916, 'loss/train': 2.8024425506591797} 11/06/2021 21:40:36 - INFO - __main__ - Step 2918: {'lr': 0.0004999526400444223, 'samples': 560256, 'steps': 2917, 'loss/train': 2.3068015575408936} 11/06/2021 21:40:36 - INFO - __main__ - Step 2919: {'lr': 0.0004999525366981204, 'samples': 560448, 'steps': 2918, 'loss/train': 2.4727816581726074} 11/06/2021 21:40:36 - INFO - __main__ - Step 2920: {'lr': 0.0004999524332391937, 'samples': 560640, 'steps': 2919, 'loss/train': 1.4589003324508667} 11/06/2021 21:40:37 - INFO - __main__ - Step 2921: {'lr': 0.0004999523296676423, 'samples': 560832, 'steps': 2920, 'loss/train': 2.8546531200408936} 11/06/2021 21:40:38 - INFO - __main__ - Step 2922: {'lr': 0.0004999522259834662, 'samples': 561024, 'steps': 2921, 'loss/train': 2.01436185836792} 11/06/2021 21:40:38 - INFO - __main__ - Step 2923: {'lr': 0.0004999521221866655, 'samples': 561216, 'steps': 2922, 'loss/train': 2.9022979736328125} 11/06/2021 21:40:38 - INFO - __main__ - Step 2924: {'lr': 0.0004999520182772402, 'samples': 561408, 'steps': 2923, 'loss/train': 2.112090826034546} 11/06/2021 21:40:39 - INFO - __main__ - Step 2925: {'lr': 0.0004999519142551905, 'samples': 561600, 'steps': 2924, 'loss/train': 2.724461317062378} 11/06/2021 21:40:39 - INFO - __main__ - Step 2926: {'lr': 0.0004999518101205162, 'samples': 561792, 'steps': 2925, 'loss/train': 3.0705063343048096} 11/06/2021 21:40:40 - INFO - __main__ - Step 2927: {'lr': 0.0004999517058732175, 'samples': 561984, 'steps': 2926, 'loss/train': 2.3945233821868896} 11/06/2021 21:40:41 - INFO - __main__ - Step 2928: {'lr': 0.0004999516015132945, 'samples': 562176, 'steps': 2927, 'loss/train': 1.2997283935546875} 11/06/2021 21:40:41 - INFO - __main__ - Step 2929: {'lr': 0.0004999514970407471, 'samples': 562368, 'steps': 2928, 'loss/train': 1.4309011697769165} 11/06/2021 21:40:41 - INFO - __main__ - Step 2930: {'lr': 0.0004999513924555754, 'samples': 562560, 'steps': 2929, 'loss/train': 1.5472474098205566} 11/06/2021 21:40:42 - INFO - __main__ - Step 2931: {'lr': 0.0004999512877577794, 'samples': 562752, 'steps': 2930, 'loss/train': 2.173020601272583} 11/06/2021 21:40:43 - INFO - __main__ - Step 2932: {'lr': 0.0004999511829473593, 'samples': 562944, 'steps': 2931, 'loss/train': 1.8353915214538574} 11/06/2021 21:40:43 - INFO - __main__ - Step 2933: {'lr': 0.0004999510780243151, 'samples': 563136, 'steps': 2932, 'loss/train': 1.7260181903839111} 11/06/2021 21:40:43 - INFO - __main__ - Step 2934: {'lr': 0.0004999509729886467, 'samples': 563328, 'steps': 2933, 'loss/train': 2.6757678985595703} 11/06/2021 21:40:44 - INFO - __main__ - Step 2935: {'lr': 0.0004999508678403542, 'samples': 563520, 'steps': 2934, 'loss/train': 2.1792075634002686} 11/06/2021 21:40:44 - INFO - __main__ - Step 2936: {'lr': 0.0004999507625794378, 'samples': 563712, 'steps': 2935, 'loss/train': 2.2181787490844727} 11/06/2021 21:40:45 - INFO - __main__ - Step 2937: {'lr': 0.0004999506572058974, 'samples': 563904, 'steps': 2936, 'loss/train': 2.060746192932129} 11/06/2021 21:40:45 - INFO - __main__ - Step 2938: {'lr': 0.0004999505517197331, 'samples': 564096, 'steps': 2937, 'loss/train': 2.2606637477874756} 11/06/2021 21:40:46 - INFO - __main__ - Step 2939: {'lr': 0.000499950446120945, 'samples': 564288, 'steps': 2938, 'loss/train': 1.9064645767211914} 11/06/2021 21:40:46 - INFO - __main__ - Step 2940: {'lr': 0.000499950340409533, 'samples': 564480, 'steps': 2939, 'loss/train': 2.1608986854553223} 11/06/2021 21:40:46 - INFO - __main__ - Step 2941: {'lr': 0.0004999502345854973, 'samples': 564672, 'steps': 2940, 'loss/train': 2.2429375648498535} 11/06/2021 21:40:47 - INFO - __main__ - Step 2942: {'lr': 0.0004999501286488378, 'samples': 564864, 'steps': 2941, 'loss/train': 2.142625093460083} 11/06/2021 21:40:48 - INFO - __main__ - Step 2943: {'lr': 0.0004999500225995547, 'samples': 565056, 'steps': 2942, 'loss/train': 2.0483415126800537} 11/06/2021 21:40:48 - INFO - __main__ - Step 2944: {'lr': 0.000499949916437648, 'samples': 565248, 'steps': 2943, 'loss/train': 2.363560199737549} 11/06/2021 21:40:49 - INFO - __main__ - Step 2945: {'lr': 0.0004999498101631177, 'samples': 565440, 'steps': 2944, 'loss/train': 2.750171422958374} 11/06/2021 21:40:49 - INFO - __main__ - Step 2946: {'lr': 0.0004999497037759638, 'samples': 565632, 'steps': 2945, 'loss/train': 2.285203218460083} 11/06/2021 21:40:49 - INFO - __main__ - Step 2947: {'lr': 0.0004999495972761865, 'samples': 565824, 'steps': 2946, 'loss/train': 2.084728479385376} 11/06/2021 21:40:50 - INFO - __main__ - Step 2948: {'lr': 0.0004999494906637857, 'samples': 566016, 'steps': 2947, 'loss/train': 2.138784646987915} 11/06/2021 21:40:51 - INFO - __main__ - Step 2949: {'lr': 0.0004999493839387615, 'samples': 566208, 'steps': 2948, 'loss/train': 0.8493632674217224} 11/06/2021 21:40:51 - INFO - __main__ - Step 2950: {'lr': 0.000499949277101114, 'samples': 566400, 'steps': 2949, 'loss/train': 2.2582223415374756} 11/06/2021 21:40:51 - INFO - __main__ - Step 2951: {'lr': 0.0004999491701508433, 'samples': 566592, 'steps': 2950, 'loss/train': 2.1516506671905518} 11/06/2021 21:40:52 - INFO - __main__ - Step 2952: {'lr': 0.0004999490630879493, 'samples': 566784, 'steps': 2951, 'loss/train': 1.5355583429336548} 11/06/2021 21:40:53 - INFO - __main__ - Step 2953: {'lr': 0.0004999489559124321, 'samples': 566976, 'steps': 2952, 'loss/train': 1.9333853721618652} 11/06/2021 21:40:53 - INFO - __main__ - Step 2954: {'lr': 0.0004999488486242918, 'samples': 567168, 'steps': 2953, 'loss/train': 1.470977783203125} 11/06/2021 21:40:53 - INFO - __main__ - Step 2955: {'lr': 0.0004999487412235284, 'samples': 567360, 'steps': 2954, 'loss/train': 2.435324192047119} 11/06/2021 21:40:54 - INFO - __main__ - Step 2956: {'lr': 0.0004999486337101419, 'samples': 567552, 'steps': 2955, 'loss/train': 2.1768510341644287} 11/06/2021 21:40:54 - INFO - __main__ - Step 2957: {'lr': 0.0004999485260841324, 'samples': 567744, 'steps': 2956, 'loss/train': 2.5534868240356445} 11/06/2021 21:40:55 - INFO - __main__ - Step 2958: {'lr': 0.0004999484183455, 'samples': 567936, 'steps': 2957, 'loss/train': 2.551486015319824} 11/06/2021 21:40:55 - INFO - __main__ - Step 2959: {'lr': 0.0004999483104942446, 'samples': 568128, 'steps': 2958, 'loss/train': 2.710608720779419} 11/06/2021 21:40:56 - INFO - __main__ - Step 2960: {'lr': 0.0004999482025303665, 'samples': 568320, 'steps': 2959, 'loss/train': 2.0834438800811768} 11/06/2021 21:40:56 - INFO - __main__ - Step 2961: {'lr': 0.0004999480944538655, 'samples': 568512, 'steps': 2960, 'loss/train': 2.1335995197296143} 11/06/2021 21:40:57 - INFO - __main__ - Step 2962: {'lr': 0.0004999479862647417, 'samples': 568704, 'steps': 2961, 'loss/train': 1.7323172092437744} 11/06/2021 21:40:57 - INFO - __main__ - Step 2963: {'lr': 0.0004999478779629953, 'samples': 568896, 'steps': 2962, 'loss/train': 1.3542144298553467} 11/06/2021 21:40:58 - INFO - __main__ - Step 2964: {'lr': 0.0004999477695486261, 'samples': 569088, 'steps': 2963, 'loss/train': 2.248361349105835} 11/06/2021 21:40:58 - INFO - __main__ - Step 2965: {'lr': 0.0004999476610216345, 'samples': 569280, 'steps': 2964, 'loss/train': 1.651444673538208} 11/06/2021 21:40:59 - INFO - __main__ - Step 2966: {'lr': 0.0004999475523820203, 'samples': 569472, 'steps': 2965, 'loss/train': 1.6673521995544434} 11/06/2021 21:40:59 - INFO - __main__ - Step 2967: {'lr': 0.0004999474436297835, 'samples': 569664, 'steps': 2966, 'loss/train': 2.576014995574951} 11/06/2021 21:40:59 - INFO - __main__ - Step 2968: {'lr': 0.0004999473347649242, 'samples': 569856, 'steps': 2967, 'loss/train': 2.7090260982513428} 11/06/2021 21:41:00 - INFO - __main__ - Step 2969: {'lr': 0.0004999472257874426, 'samples': 570048, 'steps': 2968, 'loss/train': 2.4172799587249756} 11/06/2021 21:41:01 - INFO - __main__ - Step 2970: {'lr': 0.0004999471166973385, 'samples': 570240, 'steps': 2969, 'loss/train': 2.754880905151367} 11/06/2021 21:41:01 - INFO - __main__ - Step 2971: {'lr': 0.0004999470074946122, 'samples': 570432, 'steps': 2970, 'loss/train': 2.2452704906463623} 11/06/2021 21:41:01 - INFO - __main__ - Step 2972: {'lr': 0.0004999468981792636, 'samples': 570624, 'steps': 2971, 'loss/train': 1.8505585193634033} 11/06/2021 21:41:02 - INFO - __main__ - Step 2973: {'lr': 0.0004999467887512928, 'samples': 570816, 'steps': 2972, 'loss/train': 2.324302911758423} 11/06/2021 21:41:03 - INFO - __main__ - Step 2974: {'lr': 0.0004999466792106998, 'samples': 571008, 'steps': 2973, 'loss/train': 2.198997974395752} 11/06/2021 21:41:03 - INFO - __main__ - Step 2975: {'lr': 0.0004999465695574848, 'samples': 571200, 'steps': 2974, 'loss/train': 2.4964334964752197} 11/06/2021 21:41:03 - INFO - __main__ - Step 2976: {'lr': 0.0004999464597916476, 'samples': 571392, 'steps': 2975, 'loss/train': 2.4963150024414062} 11/06/2021 21:41:04 - INFO - __main__ - Step 2977: {'lr': 0.0004999463499131884, 'samples': 571584, 'steps': 2976, 'loss/train': 2.5181667804718018} 11/06/2021 21:41:04 - INFO - __main__ - Step 2978: {'lr': 0.0004999462399221073, 'samples': 571776, 'steps': 2977, 'loss/train': 3.5236194133758545} 11/06/2021 21:41:05 - INFO - __main__ - Step 2979: {'lr': 0.0004999461298184042, 'samples': 571968, 'steps': 2978, 'loss/train': 2.367652416229248} 11/06/2021 21:41:06 - INFO - __main__ - Step 2980: {'lr': 0.0004999460196020793, 'samples': 572160, 'steps': 2979, 'loss/train': 2.657733201980591} 11/06/2021 21:41:06 - INFO - __main__ - Step 2981: {'lr': 0.0004999459092731326, 'samples': 572352, 'steps': 2980, 'loss/train': 2.142723321914673} 11/06/2021 21:41:06 - INFO - __main__ - Step 2982: {'lr': 0.000499945798831564, 'samples': 572544, 'steps': 2981, 'loss/train': 2.780032157897949} 11/06/2021 21:41:07 - INFO - __main__ - Step 2983: {'lr': 0.0004999456882773737, 'samples': 572736, 'steps': 2982, 'loss/train': 2.4689583778381348} 11/06/2021 21:41:08 - INFO - __main__ - Step 2984: {'lr': 0.0004999455776105618, 'samples': 572928, 'steps': 2983, 'loss/train': 1.9315149784088135} 11/06/2021 21:41:08 - INFO - __main__ - Step 2985: {'lr': 0.0004999454668311283, 'samples': 573120, 'steps': 2984, 'loss/train': 2.1817145347595215} 11/06/2021 21:41:08 - INFO - __main__ - Step 2986: {'lr': 0.0004999453559390731, 'samples': 573312, 'steps': 2985, 'loss/train': 2.152888774871826} 11/06/2021 21:41:09 - INFO - __main__ - Step 2987: {'lr': 0.0004999452449343967, 'samples': 573504, 'steps': 2986, 'loss/train': 2.3532779216766357} 11/06/2021 21:41:09 - INFO - __main__ - Step 2988: {'lr': 0.0004999451338170985, 'samples': 573696, 'steps': 2987, 'loss/train': 2.4001834392547607} 11/06/2021 21:41:10 - INFO - __main__ - Step 2989: {'lr': 0.000499945022587179, 'samples': 573888, 'steps': 2988, 'loss/train': 2.3787949085235596} 11/06/2021 21:41:10 - INFO - __main__ - Step 2990: {'lr': 0.0004999449112446381, 'samples': 574080, 'steps': 2989, 'loss/train': 2.610903263092041} 11/06/2021 21:41:11 - INFO - __main__ - Step 2991: {'lr': 0.000499944799789476, 'samples': 574272, 'steps': 2990, 'loss/train': 2.203000545501709} 11/06/2021 21:41:11 - INFO - __main__ - Step 2992: {'lr': 0.0004999446882216925, 'samples': 574464, 'steps': 2991, 'loss/train': 1.9404032230377197} 11/06/2021 21:41:11 - INFO - __main__ - Step 2993: {'lr': 0.0004999445765412878, 'samples': 574656, 'steps': 2992, 'loss/train': 2.0062878131866455} 11/06/2021 21:41:12 - INFO - __main__ - Step 2994: {'lr': 0.0004999444647482619, 'samples': 574848, 'steps': 2993, 'loss/train': 2.512753963470459} 11/06/2021 21:41:13 - INFO - __main__ - Step 2995: {'lr': 0.0004999443528426149, 'samples': 575040, 'steps': 2994, 'loss/train': 2.4999752044677734} 11/06/2021 21:41:13 - INFO - __main__ - Step 2996: {'lr': 0.0004999442408243469, 'samples': 575232, 'steps': 2995, 'loss/train': 2.140004873275757} 11/06/2021 21:41:13 - INFO - __main__ - Step 2997: {'lr': 0.0004999441286934578, 'samples': 575424, 'steps': 2996, 'loss/train': 2.258283853530884} 11/06/2021 21:41:14 - INFO - __main__ - Step 2998: {'lr': 0.0004999440164499478, 'samples': 575616, 'steps': 2997, 'loss/train': 2.103290557861328} 11/06/2021 21:41:14 - INFO - __main__ - Step 2999: {'lr': 0.0004999439040938168, 'samples': 575808, 'steps': 2998, 'loss/train': 2.217271327972412} 11/06/2021 21:41:15 - INFO - __main__ - Step 3000: {'lr': 0.000499943791625065, 'samples': 576000, 'steps': 2999, 'loss/train': 2.1993706226348877} 11/06/2021 21:41:15 - INFO - __main__ - Step 3001: {'lr': 0.0004999436790436923, 'samples': 576192, 'steps': 3000, 'loss/train': 1.9046951532363892} 11/06/2021 21:41:16 - INFO - __main__ - Step 3002: {'lr': 0.000499943566349699, 'samples': 576384, 'steps': 3001, 'loss/train': 2.5663747787475586} 11/06/2021 21:41:16 - INFO - __main__ - Step 3003: {'lr': 0.0004999434535430848, 'samples': 576576, 'steps': 3002, 'loss/train': 2.3395111560821533} 11/06/2021 21:41:17 - INFO - __main__ - Step 3004: {'lr': 0.0004999433406238501, 'samples': 576768, 'steps': 3003, 'loss/train': 2.3144681453704834} 11/06/2021 21:41:18 - INFO - __main__ - Step 3005: {'lr': 0.0004999432275919947, 'samples': 576960, 'steps': 3004, 'loss/train': 2.3449320793151855} 11/06/2021 21:41:18 - INFO - __main__ - Step 3006: {'lr': 0.0004999431144475187, 'samples': 577152, 'steps': 3005, 'loss/train': 2.1510040760040283} 11/06/2021 21:41:18 - INFO - __main__ - Step 3007: {'lr': 0.0004999430011904222, 'samples': 577344, 'steps': 3006, 'loss/train': 0.7324512004852295} 11/06/2021 21:41:19 - INFO - __main__ - Step 3008: {'lr': 0.0004999428878207054, 'samples': 577536, 'steps': 3007, 'loss/train': 2.3040289878845215} 11/06/2021 21:41:19 - INFO - __main__ - Step 3009: {'lr': 0.000499942774338368, 'samples': 577728, 'steps': 3008, 'loss/train': 2.3875339031219482} 11/06/2021 21:41:20 - INFO - __main__ - Step 3010: {'lr': 0.0004999426607434104, 'samples': 577920, 'steps': 3009, 'loss/train': 2.562206268310547} 11/06/2021 21:41:20 - INFO - __main__ - Step 3011: {'lr': 0.0004999425470358324, 'samples': 578112, 'steps': 3010, 'loss/train': 2.1042847633361816} 11/06/2021 21:41:21 - INFO - __main__ - Step 3012: {'lr': 0.0004999424332156341, 'samples': 578304, 'steps': 3011, 'loss/train': 1.8335940837860107} 11/06/2021 21:41:21 - INFO - __main__ - Step 3013: {'lr': 0.0004999423192828156, 'samples': 578496, 'steps': 3012, 'loss/train': 2.3943774700164795} 11/06/2021 21:41:21 - INFO - __main__ - Step 3014: {'lr': 0.0004999422052373771, 'samples': 578688, 'steps': 3013, 'loss/train': 2.3687760829925537} 11/06/2021 21:41:23 - INFO - __main__ - Step 3015: {'lr': 0.0004999420910793183, 'samples': 578880, 'steps': 3014, 'loss/train': 2.2933881282806396} 11/06/2021 21:41:23 - INFO - __main__ - Step 3016: {'lr': 0.0004999419768086397, 'samples': 579072, 'steps': 3015, 'loss/train': 2.4723987579345703} 11/06/2021 21:41:23 - INFO - __main__ - Step 3017: {'lr': 0.0004999418624253408, 'samples': 579264, 'steps': 3016, 'loss/train': 2.3024117946624756} 11/06/2021 21:41:24 - INFO - __main__ - Step 3018: {'lr': 0.0004999417479294221, 'samples': 579456, 'steps': 3017, 'loss/train': 2.373169183731079} 11/06/2021 21:41:24 - INFO - __main__ - Step 3019: {'lr': 0.0004999416333208835, 'samples': 579648, 'steps': 3018, 'loss/train': 1.6759024858474731} 11/06/2021 21:41:25 - INFO - __main__ - Step 3020: {'lr': 0.0004999415185997252, 'samples': 579840, 'steps': 3019, 'loss/train': 2.323343515396118} 11/06/2021 21:41:25 - INFO - __main__ - Step 3021: {'lr': 0.0004999414037659468, 'samples': 580032, 'steps': 3020, 'loss/train': 1.5138059854507446} 11/06/2021 21:41:26 - INFO - __main__ - Step 3022: {'lr': 0.000499941288819549, 'samples': 580224, 'steps': 3021, 'loss/train': 1.5361846685409546} 11/06/2021 21:41:26 - INFO - __main__ - Step 3023: {'lr': 0.0004999411737605313, 'samples': 580416, 'steps': 3022, 'loss/train': 1.946350336074829} 11/06/2021 21:41:27 - INFO - __main__ - Step 3024: {'lr': 0.000499941058588894, 'samples': 580608, 'steps': 3023, 'loss/train': 2.60408091545105} 11/06/2021 21:41:28 - INFO - __main__ - Step 3025: {'lr': 0.0004999409433046371, 'samples': 580800, 'steps': 3024, 'loss/train': 2.4534120559692383} 11/06/2021 21:41:28 - INFO - __main__ - Step 3026: {'lr': 0.0004999408279077607, 'samples': 580992, 'steps': 3025, 'loss/train': 2.68884015083313} 11/06/2021 21:41:29 - INFO - __main__ - Step 3027: {'lr': 0.0004999407123982649, 'samples': 581184, 'steps': 3026, 'loss/train': 2.0190529823303223} 11/06/2021 21:41:29 - INFO - __main__ - Step 3028: {'lr': 0.0004999405967761495, 'samples': 581376, 'steps': 3027, 'loss/train': 1.0339617729187012} 11/06/2021 21:41:29 - INFO - __main__ - Step 3029: {'lr': 0.0004999404810414149, 'samples': 581568, 'steps': 3028, 'loss/train': 0.895268440246582} 11/06/2021 21:41:31 - INFO - __main__ - Step 3030: {'lr': 0.0004999403651940608, 'samples': 581760, 'steps': 3029, 'loss/train': 2.5212795734405518} 11/06/2021 21:41:31 - INFO - __main__ - Step 3031: {'lr': 0.0004999402492340875, 'samples': 581952, 'steps': 3030, 'loss/train': 2.4372713565826416} 11/06/2021 21:41:31 - INFO - __main__ - Step 3032: {'lr': 0.000499940133161495, 'samples': 582144, 'steps': 3031, 'loss/train': 2.357042074203491} 11/06/2021 21:41:32 - INFO - __main__ - Step 3033: {'lr': 0.0004999400169762834, 'samples': 582336, 'steps': 3032, 'loss/train': 1.6500097513198853} 11/06/2021 21:41:32 - INFO - __main__ - Step 3034: {'lr': 0.0004999399006784525, 'samples': 582528, 'steps': 3033, 'loss/train': 2.2816481590270996} 11/06/2021 21:41:33 - INFO - __main__ - Step 3035: {'lr': 0.0004999397842680027, 'samples': 582720, 'steps': 3034, 'loss/train': 2.364650249481201} 11/06/2021 21:41:33 - INFO - __main__ - Step 3036: {'lr': 0.0004999396677449338, 'samples': 582912, 'steps': 3035, 'loss/train': 0.6538060307502747} 11/06/2021 21:41:34 - INFO - __main__ - Step 3037: {'lr': 0.000499939551109246, 'samples': 583104, 'steps': 3036, 'loss/train': 2.646855354309082} 11/06/2021 21:41:34 - INFO - __main__ - Step 3038: {'lr': 0.0004999394343609393, 'samples': 583296, 'steps': 3037, 'loss/train': 2.262348175048828} 11/06/2021 21:41:34 - INFO - __main__ - Step 3039: {'lr': 0.0004999393175000137, 'samples': 583488, 'steps': 3038, 'loss/train': 2.1946680545806885} 11/06/2021 21:41:35 - INFO - __main__ - Step 3040: {'lr': 0.0004999392005264694, 'samples': 583680, 'steps': 3039, 'loss/train': 2.3588778972625732} 11/06/2021 21:41:36 - INFO - __main__ - Step 3041: {'lr': 0.0004999390834403062, 'samples': 583872, 'steps': 3040, 'loss/train': 1.918365716934204} 11/06/2021 21:41:36 - INFO - __main__ - Step 3042: {'lr': 0.0004999389662415244, 'samples': 584064, 'steps': 3041, 'loss/train': 2.238910436630249} 11/06/2021 21:41:37 - INFO - __main__ - Step 3043: {'lr': 0.000499938848930124, 'samples': 584256, 'steps': 3042, 'loss/train': 1.9267772436141968} 11/06/2021 21:41:37 - INFO - __main__ - Step 3044: {'lr': 0.0004999387315061049, 'samples': 584448, 'steps': 3043, 'loss/train': 2.763803005218506} 11/06/2021 21:41:37 - INFO - __main__ - Step 3045: {'lr': 0.0004999386139694673, 'samples': 584640, 'steps': 3044, 'loss/train': 5.723489761352539} 11/06/2021 21:41:38 - INFO - __main__ - Step 3046: {'lr': 0.0004999384963202113, 'samples': 584832, 'steps': 3045, 'loss/train': 2.657350778579712} 11/06/2021 21:41:39 - INFO - __main__ - Step 3047: {'lr': 0.0004999383785583368, 'samples': 585024, 'steps': 3046, 'loss/train': 2.1708598136901855} 11/06/2021 21:41:39 - INFO - __main__ - Step 3048: {'lr': 0.0004999382606838439, 'samples': 585216, 'steps': 3047, 'loss/train': 2.4225125312805176} 11/06/2021 21:41:39 - INFO - __main__ - Step 3049: {'lr': 0.0004999381426967327, 'samples': 585408, 'steps': 3048, 'loss/train': 2.1795241832733154} 11/06/2021 21:41:40 - INFO - __main__ - Step 3050: {'lr': 0.0004999380245970033, 'samples': 585600, 'steps': 3049, 'loss/train': 2.3345532417297363} 11/06/2021 21:41:40 - INFO - __main__ - Step 3051: {'lr': 0.0004999379063846555, 'samples': 585792, 'steps': 3050, 'loss/train': 2.1658904552459717} 11/06/2021 21:41:41 - INFO - __main__ - Step 3052: {'lr': 0.0004999377880596897, 'samples': 585984, 'steps': 3051, 'loss/train': 2.3104407787323} 11/06/2021 21:41:41 - INFO - __main__ - Step 3053: {'lr': 0.0004999376696221057, 'samples': 586176, 'steps': 3052, 'loss/train': 2.1022863388061523} 11/06/2021 21:41:42 - INFO - __main__ - Step 3054: {'lr': 0.0004999375510719037, 'samples': 586368, 'steps': 3053, 'loss/train': 2.1373109817504883} 11/06/2021 21:41:42 - INFO - __main__ - Step 3055: {'lr': 0.0004999374324090837, 'samples': 586560, 'steps': 3054, 'loss/train': 2.6458959579467773} 11/06/2021 21:41:43 - INFO - __main__ - Step 3056: {'lr': 0.0004999373136336457, 'samples': 586752, 'steps': 3055, 'loss/train': 2.1818501949310303} 11/06/2021 21:41:44 - INFO - __main__ - Step 3057: {'lr': 0.0004999371947455899, 'samples': 586944, 'steps': 3056, 'loss/train': 3.2870852947235107} 11/06/2021 21:41:44 - INFO - __main__ - Step 3058: {'lr': 0.0004999370757449162, 'samples': 587136, 'steps': 3057, 'loss/train': 1.6503088474273682} 11/06/2021 21:41:45 - INFO - __main__ - Step 3059: {'lr': 0.0004999369566316247, 'samples': 587328, 'steps': 3058, 'loss/train': 3.2985236644744873} 11/06/2021 21:41:45 - INFO - __main__ - Step 3060: {'lr': 0.0004999368374057155, 'samples': 587520, 'steps': 3059, 'loss/train': 2.637629985809326} 11/06/2021 21:41:45 - INFO - __main__ - Step 3061: {'lr': 0.0004999367180671886, 'samples': 587712, 'steps': 3060, 'loss/train': 2.058300733566284} 11/06/2021 21:41:46 - INFO - __main__ - Step 3062: {'lr': 0.000499936598616044, 'samples': 587904, 'steps': 3061, 'loss/train': 2.2234086990356445} 11/06/2021 21:41:46 - INFO - __main__ - Step 3063: {'lr': 0.0004999364790522819, 'samples': 588096, 'steps': 3062, 'loss/train': 1.9552711248397827} 11/06/2021 21:41:47 - INFO - __main__ - Step 3064: {'lr': 0.0004999363593759022, 'samples': 588288, 'steps': 3063, 'loss/train': 2.5826447010040283} 11/06/2021 21:41:47 - INFO - __main__ - Step 3065: {'lr': 0.0004999362395869052, 'samples': 588480, 'steps': 3064, 'loss/train': 2.5143256187438965} 11/06/2021 21:41:48 - INFO - __main__ - Step 3066: {'lr': 0.0004999361196852906, 'samples': 588672, 'steps': 3065, 'loss/train': 2.092707633972168} 11/06/2021 21:41:48 - INFO - __main__ - Step 3067: {'lr': 0.0004999359996710588, 'samples': 588864, 'steps': 3066, 'loss/train': 2.1652286052703857} 11/06/2021 21:41:49 - INFO - __main__ - Step 3068: {'lr': 0.0004999358795442096, 'samples': 589056, 'steps': 3067, 'loss/train': 2.3341615200042725} 11/06/2021 21:41:50 - INFO - __main__ - Step 3069: {'lr': 0.0004999357593047431, 'samples': 589248, 'steps': 3068, 'loss/train': 1.7487068176269531} 11/06/2021 21:41:50 - INFO - __main__ - Step 3070: {'lr': 0.0004999356389526595, 'samples': 589440, 'steps': 3069, 'loss/train': 2.2745633125305176} 11/06/2021 21:41:50 - INFO - __main__ - Step 3071: {'lr': 0.0004999355184879587, 'samples': 589632, 'steps': 3070, 'loss/train': 1.911078691482544} 11/06/2021 21:41:51 - INFO - __main__ - Step 3072: {'lr': 0.0004999353979106409, 'samples': 589824, 'steps': 3071, 'loss/train': 2.032170057296753} 11/06/2021 21:41:52 - INFO - __main__ - Step 3073: {'lr': 0.000499935277220706, 'samples': 590016, 'steps': 3072, 'loss/train': 0.5505169034004211} 11/06/2021 21:41:52 - INFO - __main__ - Step 3074: {'lr': 0.0004999351564181541, 'samples': 590208, 'steps': 3073, 'loss/train': 2.0557284355163574} 11/06/2021 21:41:53 - INFO - __main__ - Step 3075: {'lr': 0.0004999350355029854, 'samples': 590400, 'steps': 3074, 'loss/train': 2.871734857559204} 11/06/2021 21:41:53 - INFO - __main__ - Step 3076: {'lr': 0.0004999349144751997, 'samples': 590592, 'steps': 3075, 'loss/train': 2.093614339828491} 11/06/2021 21:41:53 - INFO - __main__ - Step 3077: {'lr': 0.0004999347933347972, 'samples': 590784, 'steps': 3076, 'loss/train': 2.2639260292053223} 11/06/2021 21:41:55 - INFO - __main__ - Step 3078: {'lr': 0.0004999346720817779, 'samples': 590976, 'steps': 3077, 'loss/train': 1.5730407238006592} 11/06/2021 21:41:55 - INFO - __main__ - Step 3079: {'lr': 0.000499934550716142, 'samples': 591168, 'steps': 3078, 'loss/train': 2.4267160892486572} 11/06/2021 21:41:55 - INFO - __main__ - Step 3080: {'lr': 0.0004999344292378893, 'samples': 591360, 'steps': 3079, 'loss/train': 3.6418893337249756} 11/06/2021 21:41:56 - INFO - __main__ - Step 3081: {'lr': 0.0004999343076470202, 'samples': 591552, 'steps': 3080, 'loss/train': 1.6918516159057617} 11/06/2021 21:41:56 - INFO - __main__ - Step 3082: {'lr': 0.0004999341859435345, 'samples': 591744, 'steps': 3081, 'loss/train': 2.610215902328491} 11/06/2021 21:41:57 - INFO - __main__ - Step 3083: {'lr': 0.0004999340641274322, 'samples': 591936, 'steps': 3082, 'loss/train': 2.330768346786499} 11/06/2021 21:41:57 - INFO - __main__ - Step 3084: {'lr': 0.0004999339421987136, 'samples': 592128, 'steps': 3083, 'loss/train': 1.9727678298950195} 11/06/2021 21:41:58 - INFO - __main__ - Step 3085: {'lr': 0.0004999338201573786, 'samples': 592320, 'steps': 3084, 'loss/train': 2.6285154819488525} 11/06/2021 21:41:58 - INFO - __main__ - Step 3086: {'lr': 0.0004999336980034271, 'samples': 592512, 'steps': 3085, 'loss/train': 2.5483882427215576} 11/06/2021 21:41:59 - INFO - __main__ - Step 3087: {'lr': 0.0004999335757368595, 'samples': 592704, 'steps': 3086, 'loss/train': 2.4830641746520996} 11/06/2021 21:41:59 - INFO - __main__ - Step 3088: {'lr': 0.0004999334533576757, 'samples': 592896, 'steps': 3087, 'loss/train': 2.06929087638855} 11/06/2021 21:41:59 - INFO - __main__ - Step 3089: {'lr': 0.0004999333308658756, 'samples': 593088, 'steps': 3088, 'loss/train': 2.439222812652588} 11/06/2021 21:42:01 - INFO - __main__ - Step 3090: {'lr': 0.0004999332082614597, 'samples': 593280, 'steps': 3089, 'loss/train': 1.6735726594924927} 11/06/2021 21:42:01 - INFO - __main__ - Step 3091: {'lr': 0.0004999330855444274, 'samples': 593472, 'steps': 3090, 'loss/train': 2.4830446243286133} 11/06/2021 21:42:01 - INFO - __main__ - Step 3092: {'lr': 0.0004999329627147792, 'samples': 593664, 'steps': 3091, 'loss/train': 1.9454426765441895} 11/06/2021 21:42:02 - INFO - __main__ - Step 3093: {'lr': 0.0004999328397725152, 'samples': 593856, 'steps': 3092, 'loss/train': 2.445350408554077} 11/06/2021 21:42:02 - INFO - __main__ - Step 3094: {'lr': 0.0004999327167176352, 'samples': 594048, 'steps': 3093, 'loss/train': 2.219423532485962} 11/06/2021 21:42:02 - INFO - __main__ - Step 3095: {'lr': 0.0004999325935501395, 'samples': 594240, 'steps': 3094, 'loss/train': 2.2615649700164795} 11/06/2021 21:42:03 - INFO - __main__ - Step 3096: {'lr': 0.0004999324702700279, 'samples': 594432, 'steps': 3095, 'loss/train': 2.060243844985962} 11/06/2021 21:42:04 - INFO - __main__ - Step 3097: {'lr': 0.0004999323468773007, 'samples': 594624, 'steps': 3096, 'loss/train': 1.7364767789840698} 11/06/2021 21:42:04 - INFO - __main__ - Step 3098: {'lr': 0.0004999322233719578, 'samples': 594816, 'steps': 3097, 'loss/train': 2.3466086387634277} 11/06/2021 21:42:04 - INFO - __main__ - Step 3099: {'lr': 0.0004999320997539992, 'samples': 595008, 'steps': 3098, 'loss/train': 2.1478734016418457} 11/06/2021 21:42:05 - INFO - __main__ - Step 3100: {'lr': 0.0004999319760234251, 'samples': 595200, 'steps': 3099, 'loss/train': 1.8214367628097534} 11/06/2021 21:42:06 - INFO - __main__ - Step 3101: {'lr': 0.0004999318521802356, 'samples': 595392, 'steps': 3100, 'loss/train': 2.0618221759796143} 11/06/2021 21:42:06 - INFO - __main__ - Step 3102: {'lr': 0.0004999317282244305, 'samples': 595584, 'steps': 3101, 'loss/train': 2.218130350112915} 11/06/2021 21:42:07 - INFO - __main__ - Step 3103: {'lr': 0.0004999316041560102, 'samples': 595776, 'steps': 3102, 'loss/train': 1.9381036758422852} 11/06/2021 21:42:07 - INFO - __main__ - Step 3104: {'lr': 0.0004999314799749745, 'samples': 595968, 'steps': 3103, 'loss/train': 3.2617650032043457} 11/06/2021 21:42:07 - INFO - __main__ - Step 3105: {'lr': 0.0004999313556813235, 'samples': 596160, 'steps': 3104, 'loss/train': 2.255150079727173} 11/06/2021 21:42:08 - INFO - __main__ - Step 3106: {'lr': 0.0004999312312750573, 'samples': 596352, 'steps': 3105, 'loss/train': 1.6376298666000366} 11/06/2021 21:42:09 - INFO - __main__ - Step 3107: {'lr': 0.000499931106756176, 'samples': 596544, 'steps': 3106, 'loss/train': 2.3858001232147217} 11/06/2021 21:42:09 - INFO - __main__ - Step 3108: {'lr': 0.0004999309821246795, 'samples': 596736, 'steps': 3107, 'loss/train': 2.6677393913269043} 11/06/2021 21:42:10 - INFO - __main__ - Step 3109: {'lr': 0.000499930857380568, 'samples': 596928, 'steps': 3108, 'loss/train': 1.4987972974777222} 11/06/2021 21:42:10 - INFO - __main__ - Step 3110: {'lr': 0.0004999307325238416, 'samples': 597120, 'steps': 3109, 'loss/train': 1.9903048276901245} 11/06/2021 21:42:10 - INFO - __main__ - Step 3111: {'lr': 0.0004999306075545002, 'samples': 597312, 'steps': 3110, 'loss/train': 2.7295010089874268} 11/06/2021 21:42:11 - INFO - __main__ - Step 3112: {'lr': 0.0004999304824725439, 'samples': 597504, 'steps': 3111, 'loss/train': 2.1662516593933105} 11/06/2021 21:42:12 - INFO - __main__ - Step 3113: {'lr': 0.0004999303572779727, 'samples': 597696, 'steps': 3112, 'loss/train': 1.8486442565917969} 11/06/2021 21:42:12 - INFO - __main__ - Step 3114: {'lr': 0.0004999302319707869, 'samples': 597888, 'steps': 3113, 'loss/train': 2.575617551803589} 11/06/2021 21:42:12 - INFO - __main__ - Step 3115: {'lr': 0.0004999301065509863, 'samples': 598080, 'steps': 3114, 'loss/train': 2.4557383060455322} 11/06/2021 21:42:13 - INFO - __main__ - Step 3116: {'lr': 0.0004999299810185712, 'samples': 598272, 'steps': 3115, 'loss/train': 1.887520670890808} 11/06/2021 21:42:14 - INFO - __main__ - Step 3117: {'lr': 0.0004999298553735413, 'samples': 598464, 'steps': 3116, 'loss/train': 2.2960591316223145} 11/06/2021 21:42:14 - INFO - __main__ - Step 3118: {'lr': 0.000499929729615897, 'samples': 598656, 'steps': 3117, 'loss/train': 2.5776569843292236} 11/06/2021 21:42:14 - INFO - __main__ - Step 3119: {'lr': 0.0004999296037456381, 'samples': 598848, 'steps': 3118, 'loss/train': 2.3189525604248047} 11/06/2021 21:42:15 - INFO - __main__ - Step 3120: {'lr': 0.0004999294777627649, 'samples': 599040, 'steps': 3119, 'loss/train': 1.3478337526321411} 11/06/2021 21:42:15 - INFO - __main__ - Step 3121: {'lr': 0.0004999293516672773, 'samples': 599232, 'steps': 3120, 'loss/train': 2.864377021789551} 11/06/2021 21:42:16 - INFO - __main__ - Step 3122: {'lr': 0.0004999292254591754, 'samples': 599424, 'steps': 3121, 'loss/train': 2.7079334259033203} 11/06/2021 21:42:16 - INFO - __main__ - Step 3123: {'lr': 0.0004999290991384591, 'samples': 599616, 'steps': 3122, 'loss/train': 2.133533477783203} 11/06/2021 21:42:17 - INFO - __main__ - Step 3124: {'lr': 0.0004999289727051289, 'samples': 599808, 'steps': 3123, 'loss/train': 2.1969847679138184} 11/06/2021 21:42:17 - INFO - __main__ - Step 3125: {'lr': 0.0004999288461591842, 'samples': 600000, 'steps': 3124, 'loss/train': 1.7113127708435059} 11/06/2021 21:42:17 - INFO - __main__ - Step 3126: {'lr': 0.0004999287195006257, 'samples': 600192, 'steps': 3125, 'loss/train': 2.617366313934326} 11/06/2021 21:42:19 - INFO - __main__ - Step 3127: {'lr': 0.000499928592729453, 'samples': 600384, 'steps': 3126, 'loss/train': 2.459381103515625} 11/06/2021 21:42:19 - INFO - __main__ - Step 3128: {'lr': 0.0004999284658456665, 'samples': 600576, 'steps': 3127, 'loss/train': 2.3718807697296143} 11/06/2021 21:42:19 - INFO - __main__ - Step 3129: {'lr': 0.000499928338849266, 'samples': 600768, 'steps': 3128, 'loss/train': 2.543180465698242} 11/06/2021 21:42:20 - INFO - __main__ - Step 3130: {'lr': 0.0004999282117402516, 'samples': 600960, 'steps': 3129, 'loss/train': 1.7423886060714722} 11/06/2021 21:42:20 - INFO - __main__ - Step 3131: {'lr': 0.0004999280845186235, 'samples': 601152, 'steps': 3130, 'loss/train': 2.2052173614501953} 11/06/2021 21:42:21 - INFO - __main__ - Step 3132: {'lr': 0.0004999279571843816, 'samples': 601344, 'steps': 3131, 'loss/train': 1.9484204053878784} 11/06/2021 21:42:22 - INFO - __main__ - Step 3133: {'lr': 0.000499927829737526, 'samples': 601536, 'steps': 3132, 'loss/train': 2.428269147872925} 11/06/2021 21:42:22 - INFO - __main__ - Step 3134: {'lr': 0.0004999277021780569, 'samples': 601728, 'steps': 3133, 'loss/train': 2.089491128921509} 11/06/2021 21:42:22 - INFO - __main__ - Step 3135: {'lr': 0.0004999275745059741, 'samples': 601920, 'steps': 3134, 'loss/train': 2.9237546920776367} 11/06/2021 21:42:23 - INFO - __main__ - Step 3136: {'lr': 0.0004999274467212779, 'samples': 602112, 'steps': 3135, 'loss/train': 2.297293186187744} 11/06/2021 21:42:23 - INFO - __main__ - Step 3137: {'lr': 0.0004999273188239681, 'samples': 602304, 'steps': 3136, 'loss/train': 2.1000170707702637} 11/06/2021 21:42:24 - INFO - __main__ - Step 3138: {'lr': 0.0004999271908140451, 'samples': 602496, 'steps': 3137, 'loss/train': 2.391845703125} 11/06/2021 21:42:24 - INFO - __main__ - Step 3139: {'lr': 0.0004999270626915086, 'samples': 602688, 'steps': 3138, 'loss/train': 2.167910099029541} 11/06/2021 21:42:25 - INFO - __main__ - Step 3140: {'lr': 0.0004999269344563589, 'samples': 602880, 'steps': 3139, 'loss/train': 2.10351300239563} 11/06/2021 21:42:25 - INFO - __main__ - Step 3141: {'lr': 0.0004999268061085959, 'samples': 603072, 'steps': 3140, 'loss/train': 2.418550968170166} 11/06/2021 21:42:25 - INFO - __main__ - Step 3142: {'lr': 0.0004999266776482199, 'samples': 603264, 'steps': 3141, 'loss/train': 2.0939297676086426} 11/06/2021 21:42:27 - INFO - __main__ - Step 3143: {'lr': 0.0004999265490752306, 'samples': 603456, 'steps': 3142, 'loss/train': 2.4321014881134033} 11/06/2021 21:42:27 - INFO - __main__ - Step 3144: {'lr': 0.0004999264203896284, 'samples': 603648, 'steps': 3143, 'loss/train': 2.1129066944122314} 11/06/2021 21:42:27 - INFO - __main__ - Step 3145: {'lr': 0.0004999262915914132, 'samples': 603840, 'steps': 3144, 'loss/train': 1.9223228693008423} 11/06/2021 21:42:28 - INFO - __main__ - Step 3146: {'lr': 0.000499926162680585, 'samples': 604032, 'steps': 3145, 'loss/train': 1.9708632230758667} 11/06/2021 21:42:28 - INFO - __main__ - Step 3147: {'lr': 0.000499926033657144, 'samples': 604224, 'steps': 3146, 'loss/train': 2.535964250564575} 11/06/2021 21:42:29 - INFO - __main__ - Step 3148: {'lr': 0.0004999259045210901, 'samples': 604416, 'steps': 3147, 'loss/train': 1.4385805130004883} 11/06/2021 21:42:29 - INFO - __main__ - Step 3149: {'lr': 0.0004999257752724234, 'samples': 604608, 'steps': 3148, 'loss/train': 2.0960731506347656} 11/06/2021 21:42:30 - INFO - __main__ - Step 3150: {'lr': 0.0004999256459111443, 'samples': 604800, 'steps': 3149, 'loss/train': 1.577193021774292} 11/06/2021 21:42:30 - INFO - __main__ - Step 3151: {'lr': 0.0004999255164372523, 'samples': 604992, 'steps': 3150, 'loss/train': 1.8809454441070557} 11/06/2021 21:42:30 - INFO - __main__ - Step 3152: {'lr': 0.0004999253868507476, 'samples': 605184, 'steps': 3151, 'loss/train': 2.5812270641326904} 11/06/2021 21:42:31 - INFO - __main__ - Step 3153: {'lr': 0.0004999252571516306, 'samples': 605376, 'steps': 3152, 'loss/train': 2.324453353881836} 11/06/2021 21:42:32 - INFO - __main__ - Step 3154: {'lr': 0.0004999251273399011, 'samples': 605568, 'steps': 3153, 'loss/train': 1.5840191841125488} 11/06/2021 21:42:32 - INFO - __main__ - Step 3155: {'lr': 0.0004999249974155592, 'samples': 605760, 'steps': 3154, 'loss/train': 2.419705390930176} 11/06/2021 21:42:33 - INFO - __main__ - Step 3156: {'lr': 0.0004999248673786049, 'samples': 605952, 'steps': 3155, 'loss/train': 1.7660547494888306} 11/06/2021 21:42:33 - INFO - __main__ - Step 3157: {'lr': 0.0004999247372290383, 'samples': 606144, 'steps': 3156, 'loss/train': 3.5126683712005615} 11/06/2021 21:42:33 - INFO - __main__ - Step 3158: {'lr': 0.0004999246069668596, 'samples': 606336, 'steps': 3157, 'loss/train': 2.0734901428222656} 11/06/2021 21:42:34 - INFO - __main__ - Step 3159: {'lr': 0.0004999244765920687, 'samples': 606528, 'steps': 3158, 'loss/train': 2.5572397708892822} 11/06/2021 21:42:35 - INFO - __main__ - Step 3160: {'lr': 0.0004999243461046656, 'samples': 606720, 'steps': 3159, 'loss/train': 1.9452892541885376} 11/06/2021 21:42:35 - INFO - __main__ - Step 3161: {'lr': 0.0004999242155046504, 'samples': 606912, 'steps': 3160, 'loss/train': 0.912672221660614} 11/06/2021 21:42:35 - INFO - __main__ - Step 3162: {'lr': 0.0004999240847920233, 'samples': 607104, 'steps': 3161, 'loss/train': 1.7738767862319946} 11/06/2021 21:42:36 - INFO - __main__ - Step 3163: {'lr': 0.0004999239539667842, 'samples': 607296, 'steps': 3162, 'loss/train': 1.6333916187286377} 11/06/2021 21:42:37 - INFO - __main__ - Step 3164: {'lr': 0.0004999238230289333, 'samples': 607488, 'steps': 3163, 'loss/train': 2.6016409397125244} 11/06/2021 21:42:37 - INFO - __main__ - Step 3165: {'lr': 0.0004999236919784705, 'samples': 607680, 'steps': 3164, 'loss/train': 2.146538734436035} 11/06/2021 21:42:37 - INFO - __main__ - Step 3166: {'lr': 0.0004999235608153961, 'samples': 607872, 'steps': 3165, 'loss/train': 1.9972015619277954} 11/06/2021 21:42:38 - INFO - __main__ - Step 3167: {'lr': 0.0004999234295397098, 'samples': 608064, 'steps': 3166, 'loss/train': 2.117999315261841} 11/06/2021 21:42:38 - INFO - __main__ - Step 3168: {'lr': 0.000499923298151412, 'samples': 608256, 'steps': 3167, 'loss/train': 2.080382823944092} 11/06/2021 21:42:39 - INFO - __main__ - Step 3169: {'lr': 0.0004999231666505025, 'samples': 608448, 'steps': 3168, 'loss/train': 2.333287239074707} 11/06/2021 21:42:39 - INFO - __main__ - Step 3170: {'lr': 0.0004999230350369816, 'samples': 608640, 'steps': 3169, 'loss/train': 2.8902382850646973} 11/06/2021 21:42:40 - INFO - __main__ - Step 3171: {'lr': 0.0004999229033108492, 'samples': 608832, 'steps': 3170, 'loss/train': 1.6683903932571411} 11/06/2021 21:42:40 - INFO - __main__ - Step 3172: {'lr': 0.0004999227714721054, 'samples': 609024, 'steps': 3171, 'loss/train': 1.9496757984161377} 11/06/2021 21:42:40 - INFO - __main__ - Step 3173: {'lr': 0.0004999226395207501, 'samples': 609216, 'steps': 3172, 'loss/train': 2.0327744483947754} 11/06/2021 21:42:41 - INFO - __main__ - Step 3174: {'lr': 0.0004999225074567837, 'samples': 609408, 'steps': 3173, 'loss/train': 2.1947336196899414} 11/06/2021 21:42:42 - INFO - __main__ - Step 3175: {'lr': 0.000499922375280206, 'samples': 609600, 'steps': 3174, 'loss/train': 2.3255155086517334} 11/06/2021 21:42:42 - INFO - __main__ - Step 3176: {'lr': 0.0004999222429910171, 'samples': 609792, 'steps': 3175, 'loss/train': 2.275817394256592} 11/06/2021 21:42:42 - INFO - __main__ - Step 3177: {'lr': 0.0004999221105892172, 'samples': 609984, 'steps': 3176, 'loss/train': 2.14980411529541} 11/06/2021 21:42:43 - INFO - __main__ - Step 3178: {'lr': 0.0004999219780748062, 'samples': 610176, 'steps': 3177, 'loss/train': 3.107052803039551} 11/06/2021 21:42:44 - INFO - __main__ - Step 3179: {'lr': 0.0004999218454477843, 'samples': 610368, 'steps': 3178, 'loss/train': 1.9000006914138794} 11/06/2021 21:42:44 - INFO - __main__ - Step 3180: {'lr': 0.0004999217127081514, 'samples': 610560, 'steps': 3179, 'loss/train': 2.2246599197387695} 11/06/2021 21:42:44 - INFO - __main__ - Step 3181: {'lr': 0.0004999215798559076, 'samples': 610752, 'steps': 3180, 'loss/train': 2.3011488914489746} 11/06/2021 21:42:45 - INFO - __main__ - Step 3182: {'lr': 0.000499921446891053, 'samples': 610944, 'steps': 3181, 'loss/train': 2.965468168258667} 11/06/2021 21:42:45 - INFO - __main__ - Step 3183: {'lr': 0.0004999213138135877, 'samples': 611136, 'steps': 3182, 'loss/train': 2.451305389404297} 11/06/2021 21:42:46 - INFO - __main__ - Step 3184: {'lr': 0.0004999211806235117, 'samples': 611328, 'steps': 3183, 'loss/train': 2.616995334625244} 11/06/2021 21:42:47 - INFO - __main__ - Step 3185: {'lr': 0.000499921047320825, 'samples': 611520, 'steps': 3184, 'loss/train': 1.994917869567871} 11/06/2021 21:42:47 - INFO - __main__ - Step 3186: {'lr': 0.0004999209139055278, 'samples': 611712, 'steps': 3185, 'loss/train': 2.5971717834472656} 11/06/2021 21:42:47 - INFO - __main__ - Step 3187: {'lr': 0.0004999207803776201, 'samples': 611904, 'steps': 3186, 'loss/train': 2.031585693359375} 11/06/2021 21:42:48 - INFO - __main__ - Step 3188: {'lr': 0.000499920646737102, 'samples': 612096, 'steps': 3187, 'loss/train': 2.4705827236175537} 11/06/2021 21:42:48 - INFO - __main__ - Step 3189: {'lr': 0.0004999205129839734, 'samples': 612288, 'steps': 3188, 'loss/train': 2.301380157470703} 11/06/2021 21:42:49 - INFO - __main__ - Step 3190: {'lr': 0.0004999203791182345, 'samples': 612480, 'steps': 3189, 'loss/train': 2.5135273933410645} 11/06/2021 21:42:49 - INFO - __main__ - Step 3191: {'lr': 0.0004999202451398853, 'samples': 612672, 'steps': 3190, 'loss/train': 2.0862905979156494} 11/06/2021 21:42:50 - INFO - __main__ - Step 3192: {'lr': 0.000499920111048926, 'samples': 612864, 'steps': 3191, 'loss/train': 2.3521876335144043} 11/06/2021 21:42:50 - INFO - __main__ - Step 3193: {'lr': 0.0004999199768453565, 'samples': 613056, 'steps': 3192, 'loss/train': 5.900242328643799} 11/06/2021 21:42:50 - INFO - __main__ - Step 3194: {'lr': 0.0004999198425291769, 'samples': 613248, 'steps': 3193, 'loss/train': 2.1287336349487305} 11/06/2021 21:42:52 - INFO - __main__ - Step 3195: {'lr': 0.0004999197081003873, 'samples': 613440, 'steps': 3194, 'loss/train': 1.8804785013198853} 11/06/2021 21:42:52 - INFO - __main__ - Step 3196: {'lr': 0.0004999195735589877, 'samples': 613632, 'steps': 3195, 'loss/train': 2.1079533100128174} 11/06/2021 21:42:52 - INFO - __main__ - Step 3197: {'lr': 0.0004999194389049783, 'samples': 613824, 'steps': 3196, 'loss/train': 2.4482831954956055} 11/06/2021 21:42:53 - INFO - __main__ - Step 3198: {'lr': 0.0004999193041383588, 'samples': 614016, 'steps': 3197, 'loss/train': 2.2492079734802246} 11/06/2021 21:42:53 - INFO - __main__ - Step 3199: {'lr': 0.0004999191692591299, 'samples': 614208, 'steps': 3198, 'loss/train': 1.8088970184326172} 11/06/2021 21:42:54 - INFO - __main__ - Step 3200: {'lr': 0.000499919034267291, 'samples': 614400, 'steps': 3199, 'loss/train': 2.175657272338867} 11/06/2021 21:42:54 - INFO - __main__ - Step 3201: {'lr': 0.0004999188991628425, 'samples': 614592, 'steps': 3200, 'loss/train': 2.242400646209717} 11/06/2021 21:42:55 - INFO - __main__ - Step 3202: {'lr': 0.0004999187639457844, 'samples': 614784, 'steps': 3201, 'loss/train': 2.0254077911376953} 11/06/2021 21:42:55 - INFO - __main__ - Step 3203: {'lr': 0.0004999186286161169, 'samples': 614976, 'steps': 3202, 'loss/train': 2.909585952758789} 11/06/2021 21:42:55 - INFO - __main__ - Step 3204: {'lr': 0.0004999184931738397, 'samples': 615168, 'steps': 3203, 'loss/train': 1.7180776596069336} 11/06/2021 21:42:56 - INFO - __main__ - Step 3205: {'lr': 0.0004999183576189532, 'samples': 615360, 'steps': 3204, 'loss/train': 2.3504021167755127} 11/06/2021 21:42:57 - INFO - __main__ - Step 3206: {'lr': 0.0004999182219514573, 'samples': 615552, 'steps': 3205, 'loss/train': 2.0548884868621826} 11/06/2021 21:42:57 - INFO - __main__ - Step 3207: {'lr': 0.0004999180861713522, 'samples': 615744, 'steps': 3206, 'loss/train': 2.1941933631896973} 11/06/2021 21:42:58 - INFO - __main__ - Step 3208: {'lr': 0.0004999179502786377, 'samples': 615936, 'steps': 3207, 'loss/train': 6.997474193572998} 11/06/2021 21:42:58 - INFO - __main__ - Step 3209: {'lr': 0.0004999178142733141, 'samples': 616128, 'steps': 3208, 'loss/train': 2.0363223552703857} 11/06/2021 21:42:59 - INFO - __main__ - Step 3210: {'lr': 0.0004999176781553815, 'samples': 616320, 'steps': 3209, 'loss/train': 2.108640432357788} 11/06/2021 21:42:59 - INFO - __main__ - Step 3211: {'lr': 0.0004999175419248398, 'samples': 616512, 'steps': 3210, 'loss/train': 2.0920512676239014} 11/06/2021 21:43:00 - INFO - __main__ - Step 3212: {'lr': 0.0004999174055816891, 'samples': 616704, 'steps': 3211, 'loss/train': 2.339956760406494} 11/06/2021 21:43:00 - INFO - __main__ - Step 3213: {'lr': 0.0004999172691259293, 'samples': 616896, 'steps': 3212, 'loss/train': 1.8541127443313599} 11/06/2021 21:43:01 - INFO - __main__ - Step 3214: {'lr': 0.0004999171325575609, 'samples': 617088, 'steps': 3213, 'loss/train': 1.9735697507858276} 11/06/2021 21:43:01 - INFO - __main__ - Step 3215: {'lr': 0.0004999169958765836, 'samples': 617280, 'steps': 3214, 'loss/train': 2.325428009033203} 11/06/2021 21:43:02 - INFO - __main__ - Step 3216: {'lr': 0.0004999168590829975, 'samples': 617472, 'steps': 3215, 'loss/train': 2.0824670791625977} 11/06/2021 21:43:02 - INFO - __main__ - Step 3217: {'lr': 0.0004999167221768028, 'samples': 617664, 'steps': 3216, 'loss/train': 2.0834946632385254} 11/06/2021 21:43:03 - INFO - __main__ - Step 3218: {'lr': 0.0004999165851579994, 'samples': 617856, 'steps': 3217, 'loss/train': 1.993496060371399} 11/06/2021 21:43:03 - INFO - __main__ - Step 3219: {'lr': 0.0004999164480265875, 'samples': 618048, 'steps': 3218, 'loss/train': 2.200866460800171} 11/06/2021 21:43:03 - INFO - __main__ - Step 3220: {'lr': 0.0004999163107825671, 'samples': 618240, 'steps': 3219, 'loss/train': 2.439110040664673} 11/06/2021 21:43:04 - INFO - __main__ - Step 3221: {'lr': 0.0004999161734259383, 'samples': 618432, 'steps': 3220, 'loss/train': 1.0018621683120728} 11/06/2021 21:43:05 - INFO - __main__ - Step 3222: {'lr': 0.0004999160359567011, 'samples': 618624, 'steps': 3221, 'loss/train': 1.9945106506347656} 11/06/2021 21:43:05 - INFO - __main__ - Step 3223: {'lr': 0.0004999158983748555, 'samples': 618816, 'steps': 3222, 'loss/train': 2.565582036972046} 11/06/2021 21:43:06 - INFO - __main__ - Step 3224: {'lr': 0.0004999157606804018, 'samples': 619008, 'steps': 3223, 'loss/train': 2.3982090950012207} 11/06/2021 21:43:06 - INFO - __main__ - Step 3225: {'lr': 0.0004999156228733398, 'samples': 619200, 'steps': 3224, 'loss/train': 2.0050203800201416} 11/06/2021 21:43:07 - INFO - __main__ - Step 3226: {'lr': 0.0004999154849536698, 'samples': 619392, 'steps': 3225, 'loss/train': 2.3978424072265625} 11/06/2021 21:43:07 - INFO - __main__ - Step 3227: {'lr': 0.0004999153469213917, 'samples': 619584, 'steps': 3226, 'loss/train': 1.5546318292617798} 11/06/2021 21:43:08 - INFO - __main__ - Step 3228: {'lr': 0.0004999152087765055, 'samples': 619776, 'steps': 3227, 'loss/train': 1.9464197158813477} 11/06/2021 21:43:08 - INFO - __main__ - Step 3229: {'lr': 0.0004999150705190114, 'samples': 619968, 'steps': 3228, 'loss/train': 2.4329493045806885} 11/06/2021 21:43:08 - INFO - __main__ - Step 3230: {'lr': 0.0004999149321489095, 'samples': 620160, 'steps': 3229, 'loss/train': 3.806713581085205} 11/06/2021 21:43:09 - INFO - __main__ - Step 3231: {'lr': 0.0004999147936661997, 'samples': 620352, 'steps': 3230, 'loss/train': 1.9667352437973022} 11/06/2021 21:43:10 - INFO - __main__ - Step 3232: {'lr': 0.0004999146550708822, 'samples': 620544, 'steps': 3231, 'loss/train': 2.300558567047119} 11/06/2021 21:43:10 - INFO - __main__ - Step 3233: {'lr': 0.000499914516362957, 'samples': 620736, 'steps': 3232, 'loss/train': 1.624573826789856} 11/06/2021 21:43:10 - INFO - __main__ - Step 3234: {'lr': 0.0004999143775424241, 'samples': 620928, 'steps': 3233, 'loss/train': 2.1690218448638916} 11/06/2021 21:43:11 - INFO - __main__ - Step 3235: {'lr': 0.0004999142386092838, 'samples': 621120, 'steps': 3234, 'loss/train': 1.306965708732605} 11/06/2021 21:43:12 - INFO - __main__ - Step 3236: {'lr': 0.000499914099563536, 'samples': 621312, 'steps': 3235, 'loss/train': 2.066814422607422} 11/06/2021 21:43:12 - INFO - __main__ - Step 3237: {'lr': 0.0004999139604051806, 'samples': 621504, 'steps': 3236, 'loss/train': 2.289586305618286} 11/06/2021 21:43:12 - INFO - __main__ - Step 3238: {'lr': 0.0004999138211342179, 'samples': 621696, 'steps': 3237, 'loss/train': 2.0394601821899414} 11/06/2021 21:43:13 - INFO - __main__ - Step 3239: {'lr': 0.0004999136817506478, 'samples': 621888, 'steps': 3238, 'loss/train': 2.375570297241211} 11/06/2021 21:43:13 - INFO - __main__ - Step 3240: {'lr': 0.0004999135422544707, 'samples': 622080, 'steps': 3239, 'loss/train': 1.8367230892181396} 11/06/2021 21:43:14 - INFO - __main__ - Step 3241: {'lr': 0.0004999134026456862, 'samples': 622272, 'steps': 3240, 'loss/train': 2.2936620712280273} 11/06/2021 21:43:14 - INFO - __main__ - Step 3242: {'lr': 0.0004999132629242946, 'samples': 622464, 'steps': 3241, 'loss/train': 2.675550937652588} 11/06/2021 21:43:15 - INFO - __main__ - Step 3243: {'lr': 0.000499913123090296, 'samples': 622656, 'steps': 3242, 'loss/train': 2.318620204925537} 11/06/2021 21:43:15 - INFO - __main__ - Step 3244: {'lr': 0.0004999129831436904, 'samples': 622848, 'steps': 3243, 'loss/train': 2.7929880619049072} 11/06/2021 21:43:15 - INFO - __main__ - Step 3245: {'lr': 0.0004999128430844778, 'samples': 623040, 'steps': 3244, 'loss/train': 1.8520056009292603} 11/06/2021 21:43:16 - INFO - __main__ - Step 3246: {'lr': 0.0004999127029126585, 'samples': 623232, 'steps': 3245, 'loss/train': 1.9427516460418701} 11/06/2021 21:43:17 - INFO - __main__ - Step 3247: {'lr': 0.0004999125626282322, 'samples': 623424, 'steps': 3246, 'loss/train': 2.054605484008789} 11/06/2021 21:43:17 - INFO - __main__ - Step 3248: {'lr': 0.0004999124222311993, 'samples': 623616, 'steps': 3247, 'loss/train': 1.6573278903961182} 11/06/2021 21:43:17 - INFO - __main__ - Step 3249: {'lr': 0.0004999122817215595, 'samples': 623808, 'steps': 3248, 'loss/train': 1.8662605285644531} 11/06/2021 21:43:18 - INFO - __main__ - Step 3250: {'lr': 0.0004999121410993133, 'samples': 624000, 'steps': 3249, 'loss/train': 2.0213828086853027} 11/06/2021 21:43:19 - INFO - __main__ - Step 3251: {'lr': 0.0004999120003644604, 'samples': 624192, 'steps': 3250, 'loss/train': 2.3014163970947266} 11/06/2021 21:43:19 - INFO - __main__ - Step 3252: {'lr': 0.0004999118595170011, 'samples': 624384, 'steps': 3251, 'loss/train': 2.2614598274230957} 11/06/2021 21:43:20 - INFO - __main__ - Step 3253: {'lr': 0.0004999117185569354, 'samples': 624576, 'steps': 3252, 'loss/train': 2.0688071250915527} 11/06/2021 21:43:20 - INFO - __main__ - Step 3254: {'lr': 0.0004999115774842633, 'samples': 624768, 'steps': 3253, 'loss/train': 1.5158661603927612} 11/06/2021 21:43:20 - INFO - __main__ - Step 3255: {'lr': 0.0004999114362989849, 'samples': 624960, 'steps': 3254, 'loss/train': 2.2318007946014404} 11/06/2021 21:43:21 - INFO - __main__ - Step 3256: {'lr': 0.0004999112950011002, 'samples': 625152, 'steps': 3255, 'loss/train': 1.8678102493286133} 11/06/2021 21:43:22 - INFO - __main__ - Step 3257: {'lr': 0.0004999111535906094, 'samples': 625344, 'steps': 3256, 'loss/train': 2.0674848556518555} 11/06/2021 21:43:22 - INFO - __main__ - Step 3258: {'lr': 0.0004999110120675125, 'samples': 625536, 'steps': 3257, 'loss/train': 1.557898998260498} 11/06/2021 21:43:23 - INFO - __main__ - Step 3259: {'lr': 0.0004999108704318095, 'samples': 625728, 'steps': 3258, 'loss/train': 1.9527461528778076} 11/06/2021 21:43:23 - INFO - __main__ - Step 3260: {'lr': 0.0004999107286835006, 'samples': 625920, 'steps': 3259, 'loss/train': 0.5624179244041443} 11/06/2021 21:43:23 - INFO - __main__ - Step 3261: {'lr': 0.0004999105868225858, 'samples': 626112, 'steps': 3260, 'loss/train': 2.3505382537841797} 11/06/2021 21:43:24 - INFO - __main__ - Step 3262: {'lr': 0.0004999104448490649, 'samples': 626304, 'steps': 3261, 'loss/train': 1.716923475265503} 11/06/2021 21:43:25 - INFO - __main__ - Step 3263: {'lr': 0.0004999103027629384, 'samples': 626496, 'steps': 3262, 'loss/train': 2.0216355323791504} 11/06/2021 21:43:25 - INFO - __main__ - Step 3264: {'lr': 0.0004999101605642061, 'samples': 626688, 'steps': 3263, 'loss/train': 2.3034634590148926} 11/06/2021 21:43:25 - INFO - __main__ - Step 3265: {'lr': 0.0004999100182528683, 'samples': 626880, 'steps': 3264, 'loss/train': 2.1556177139282227} 11/06/2021 21:43:26 - INFO - __main__ - Step 3266: {'lr': 0.0004999098758289248, 'samples': 627072, 'steps': 3265, 'loss/train': 2.250798463821411} 11/06/2021 21:43:27 - INFO - __main__ - Step 3267: {'lr': 0.0004999097332923758, 'samples': 627264, 'steps': 3266, 'loss/train': 2.280691623687744} 11/06/2021 21:43:27 - INFO - __main__ - Step 3268: {'lr': 0.0004999095906432213, 'samples': 627456, 'steps': 3267, 'loss/train': 2.3557052612304688} 11/06/2021 21:43:28 - INFO - __main__ - Step 3269: {'lr': 0.0004999094478814613, 'samples': 627648, 'steps': 3268, 'loss/train': 2.4633193016052246} 11/06/2021 21:43:28 - INFO - __main__ - Step 3270: {'lr': 0.0004999093050070961, 'samples': 627840, 'steps': 3269, 'loss/train': 5.240302562713623} 11/06/2021 21:43:28 - INFO - __main__ - Step 3271: {'lr': 0.0004999091620201255, 'samples': 628032, 'steps': 3270, 'loss/train': 4.105412006378174} 11/06/2021 21:43:29 - INFO - __main__ - Step 3272: {'lr': 0.0004999090189205498, 'samples': 628224, 'steps': 3271, 'loss/train': 2.502105951309204} 11/06/2021 21:43:29 - INFO - __main__ - Step 3273: {'lr': 0.0004999088757083689, 'samples': 628416, 'steps': 3272, 'loss/train': 2.263211250305176} 11/06/2021 21:43:30 - INFO - __main__ - Step 3274: {'lr': 0.0004999087323835829, 'samples': 628608, 'steps': 3273, 'loss/train': 1.1719061136245728} 11/06/2021 21:43:31 - INFO - __main__ - Step 3275: {'lr': 0.0004999085889461919, 'samples': 628800, 'steps': 3274, 'loss/train': 2.298391103744507} 11/06/2021 21:43:31 - INFO - __main__ - Step 3276: {'lr': 0.0004999084453961959, 'samples': 628992, 'steps': 3275, 'loss/train': 1.848705530166626} 11/06/2021 21:43:31 - INFO - __main__ - Step 3277: {'lr': 0.0004999083017335951, 'samples': 629184, 'steps': 3276, 'loss/train': 2.3697509765625} 11/06/2021 21:43:32 - INFO - __main__ - Step 3278: {'lr': 0.0004999081579583895, 'samples': 629376, 'steps': 3277, 'loss/train': 1.937208652496338} 11/06/2021 21:43:33 - INFO - __main__ - Step 3279: {'lr': 0.0004999080140705791, 'samples': 629568, 'steps': 3278, 'loss/train': 2.3870885372161865} 11/06/2021 21:43:33 - INFO - __main__ - Step 3280: {'lr': 0.0004999078700701639, 'samples': 629760, 'steps': 3279, 'loss/train': 2.4358227252960205} 11/06/2021 21:43:33 - INFO - __main__ - Step 3281: {'lr': 0.0004999077259571442, 'samples': 629952, 'steps': 3280, 'loss/train': 2.139662504196167} 11/06/2021 21:43:34 - INFO - __main__ - Step 3282: {'lr': 0.0004999075817315199, 'samples': 630144, 'steps': 3281, 'loss/train': 2.3960695266723633} 11/06/2021 21:43:34 - INFO - __main__ - Step 3283: {'lr': 0.0004999074373932911, 'samples': 630336, 'steps': 3282, 'loss/train': 1.9690511226654053} 11/06/2021 21:43:35 - INFO - __main__ - Step 3284: {'lr': 0.0004999072929424579, 'samples': 630528, 'steps': 3283, 'loss/train': 1.8000233173370361} 11/06/2021 21:43:35 - INFO - __main__ - Step 3285: {'lr': 0.0004999071483790203, 'samples': 630720, 'steps': 3284, 'loss/train': 2.917104721069336} 11/06/2021 21:43:36 - INFO - __main__ - Step 3286: {'lr': 0.0004999070037029783, 'samples': 630912, 'steps': 3285, 'loss/train': 2.3763699531555176} 11/06/2021 21:43:36 - INFO - __main__ - Step 3287: {'lr': 0.0004999068589143322, 'samples': 631104, 'steps': 3286, 'loss/train': 2.728950023651123} 11/06/2021 21:43:36 - INFO - __main__ - Step 3288: {'lr': 0.0004999067140130819, 'samples': 631296, 'steps': 3287, 'loss/train': 2.0896875858306885} 11/06/2021 21:43:37 - INFO - __main__ - Step 3289: {'lr': 0.0004999065689992273, 'samples': 631488, 'steps': 3288, 'loss/train': 2.303443431854248} 11/06/2021 21:43:38 - INFO - __main__ - Step 3290: {'lr': 0.0004999064238727689, 'samples': 631680, 'steps': 3289, 'loss/train': 2.483797073364258} 11/06/2021 21:43:38 - INFO - __main__ - Step 3291: {'lr': 0.0004999062786337064, 'samples': 631872, 'steps': 3290, 'loss/train': 1.8650814294815063} 11/06/2021 21:43:39 - INFO - __main__ - Step 3292: {'lr': 0.0004999061332820401, 'samples': 632064, 'steps': 3291, 'loss/train': 1.7333624362945557} 11/06/2021 21:43:39 - INFO - __main__ - Step 3293: {'lr': 0.0004999059878177699, 'samples': 632256, 'steps': 3292, 'loss/train': 2.357539176940918} 11/06/2021 21:43:40 - INFO - __main__ - Step 3294: {'lr': 0.0004999058422408959, 'samples': 632448, 'steps': 3293, 'loss/train': 2.1987416744232178} 11/06/2021 21:43:40 - INFO - __main__ - Step 3295: {'lr': 0.0004999056965514181, 'samples': 632640, 'steps': 3294, 'loss/train': 1.9118432998657227} 11/06/2021 21:43:41 - INFO - __main__ - Step 3296: {'lr': 0.0004999055507493368, 'samples': 632832, 'steps': 3295, 'loss/train': 2.26961612701416} 11/06/2021 21:43:41 - INFO - __main__ - Step 3297: {'lr': 0.0004999054048346517, 'samples': 633024, 'steps': 3296, 'loss/train': 2.7712512016296387} 11/06/2021 21:43:41 - INFO - __main__ - Step 3298: {'lr': 0.0004999052588073633, 'samples': 633216, 'steps': 3297, 'loss/train': 2.1366281509399414} 11/06/2021 21:43:42 - INFO - __main__ - Step 3299: {'lr': 0.0004999051126674714, 'samples': 633408, 'steps': 3298, 'loss/train': 1.3426307439804077} 11/06/2021 21:43:43 - INFO - __main__ - Step 3300: {'lr': 0.0004999049664149761, 'samples': 633600, 'steps': 3299, 'loss/train': 3.066981315612793} 11/06/2021 21:43:43 - INFO - __main__ - Step 3301: {'lr': 0.0004999048200498774, 'samples': 633792, 'steps': 3300, 'loss/train': 2.1740355491638184} 11/06/2021 21:43:43 - INFO - __main__ - Step 3302: {'lr': 0.0004999046735721755, 'samples': 633984, 'steps': 3301, 'loss/train': 2.4104175567626953} 11/06/2021 21:43:44 - INFO - __main__ - Step 3303: {'lr': 0.0004999045269818704, 'samples': 634176, 'steps': 3302, 'loss/train': 2.4756033420562744} 11/06/2021 21:43:45 - INFO - __main__ - Step 3304: {'lr': 0.0004999043802789622, 'samples': 634368, 'steps': 3303, 'loss/train': 1.4023329019546509} 11/06/2021 21:43:45 - INFO - __main__ - Step 3305: {'lr': 0.000499904233463451, 'samples': 634560, 'steps': 3304, 'loss/train': 1.6189985275268555} 11/06/2021 21:43:45 - INFO - __main__ - Step 3306: {'lr': 0.0004999040865353367, 'samples': 634752, 'steps': 3305, 'loss/train': 1.5078233480453491} 11/06/2021 21:43:46 - INFO - __main__ - Step 3307: {'lr': 0.0004999039394946196, 'samples': 634944, 'steps': 3306, 'loss/train': 1.8554377555847168} 11/06/2021 21:43:46 - INFO - __main__ - Step 3308: {'lr': 0.0004999037923412995, 'samples': 635136, 'steps': 3307, 'loss/train': 2.242231845855713} 11/06/2021 21:43:47 - INFO - __main__ - Step 3309: {'lr': 0.0004999036450753767, 'samples': 635328, 'steps': 3308, 'loss/train': 2.1639621257781982} 11/06/2021 21:43:47 - INFO - __main__ - Step 3310: {'lr': 0.0004999034976968511, 'samples': 635520, 'steps': 3309, 'loss/train': 2.1223878860473633} 11/06/2021 21:43:48 - INFO - __main__ - Step 3311: {'lr': 0.0004999033502057228, 'samples': 635712, 'steps': 3310, 'loss/train': 2.3255131244659424} 11/06/2021 21:43:48 - INFO - __main__ - Step 3312: {'lr': 0.000499903202601992, 'samples': 635904, 'steps': 3311, 'loss/train': 2.050856113433838} 11/06/2021 21:43:48 - INFO - __main__ - Step 3313: {'lr': 0.0004999030548856586, 'samples': 636096, 'steps': 3312, 'loss/train': 1.3736777305603027} 11/06/2021 21:43:49 - INFO - __main__ - Step 3314: {'lr': 0.0004999029070567229, 'samples': 636288, 'steps': 3313, 'loss/train': 2.3344199657440186} 11/06/2021 21:43:50 - INFO - __main__ - Step 3315: {'lr': 0.0004999027591151847, 'samples': 636480, 'steps': 3314, 'loss/train': 2.361743927001953} 11/06/2021 21:43:50 - INFO - __main__ - Step 3316: {'lr': 0.0004999026110610442, 'samples': 636672, 'steps': 3315, 'loss/train': 1.3730331659317017} 11/06/2021 21:43:51 - INFO - __main__ - Step 3317: {'lr': 0.0004999024628943014, 'samples': 636864, 'steps': 3316, 'loss/train': 2.4670941829681396} 11/06/2021 21:43:51 - INFO - __main__ - Step 3318: {'lr': 0.0004999023146149565, 'samples': 637056, 'steps': 3317, 'loss/train': 1.93985116481781} 11/06/2021 21:43:51 - INFO - __main__ - Step 3319: {'lr': 0.0004999021662230093, 'samples': 637248, 'steps': 3318, 'loss/train': 1.4250794649124146} 11/06/2021 21:43:52 - INFO - __main__ - Step 3320: {'lr': 0.0004999020177184601, 'samples': 637440, 'steps': 3319, 'loss/train': 2.3671536445617676} 11/06/2021 21:43:53 - INFO - __main__ - Step 3321: {'lr': 0.000499901869101309, 'samples': 637632, 'steps': 3320, 'loss/train': 2.1804182529449463} 11/06/2021 21:43:53 - INFO - __main__ - Step 3322: {'lr': 0.0004999017203715559, 'samples': 637824, 'steps': 3321, 'loss/train': 2.0329232215881348} 11/06/2021 21:43:53 - INFO - __main__ - Step 3323: {'lr': 0.000499901571529201, 'samples': 638016, 'steps': 3322, 'loss/train': 1.9539903402328491} 11/06/2021 21:43:54 - INFO - __main__ - Step 3324: {'lr': 0.0004999014225742442, 'samples': 638208, 'steps': 3323, 'loss/train': 1.762226939201355} 11/06/2021 21:43:55 - INFO - __main__ - Step 3325: {'lr': 0.0004999012735066858, 'samples': 638400, 'steps': 3324, 'loss/train': 2.084671974182129} 11/06/2021 21:43:55 - INFO - __main__ - Step 3326: {'lr': 0.0004999011243265257, 'samples': 638592, 'steps': 3325, 'loss/train': 2.541097402572632} 11/06/2021 21:43:55 - INFO - __main__ - Step 3327: {'lr': 0.000499900975033764, 'samples': 638784, 'steps': 3326, 'loss/train': 2.444103240966797} 11/06/2021 21:43:56 - INFO - __main__ - Step 3328: {'lr': 0.0004999008256284008, 'samples': 638976, 'steps': 3327, 'loss/train': 2.3885836601257324} 11/06/2021 21:43:56 - INFO - __main__ - Step 3329: {'lr': 0.0004999006761104361, 'samples': 639168, 'steps': 3328, 'loss/train': 1.9459644556045532} 11/06/2021 21:43:57 - INFO - __main__ - Step 3330: {'lr': 0.0004999005264798701, 'samples': 639360, 'steps': 3329, 'loss/train': 2.031470775604248} 11/06/2021 21:43:58 - INFO - __main__ - Step 3331: {'lr': 0.0004999003767367027, 'samples': 639552, 'steps': 3330, 'loss/train': 2.199577808380127} 11/06/2021 21:43:58 - INFO - __main__ - Step 3332: {'lr': 0.0004999002268809339, 'samples': 639744, 'steps': 3331, 'loss/train': 2.1211884021759033} 11/06/2021 21:43:59 - INFO - __main__ - Step 3333: {'lr': 0.0004999000769125642, 'samples': 639936, 'steps': 3332, 'loss/train': 3.076802968978882} 11/06/2021 21:43:59 - INFO - __main__ - Step 3334: {'lr': 0.0004998999268315932, 'samples': 640128, 'steps': 3333, 'loss/train': 2.300365447998047} 11/06/2021 21:43:59 - INFO - __main__ - Step 3335: {'lr': 0.0004998997766380212, 'samples': 640320, 'steps': 3334, 'loss/train': 2.047560214996338} 11/06/2021 21:44:00 - INFO - __main__ - Step 3336: {'lr': 0.0004998996263318482, 'samples': 640512, 'steps': 3335, 'loss/train': 2.0564115047454834} 11/06/2021 21:44:00 - INFO - __main__ - Step 3337: {'lr': 0.0004998994759130743, 'samples': 640704, 'steps': 3336, 'loss/train': 1.807005763053894} 11/06/2021 21:44:01 - INFO - __main__ - Step 3338: {'lr': 0.0004998993253816996, 'samples': 640896, 'steps': 3337, 'loss/train': 2.106369733810425} 11/06/2021 21:44:01 - INFO - __main__ - Step 3339: {'lr': 0.000499899174737724, 'samples': 641088, 'steps': 3338, 'loss/train': 2.363450765609741} 11/06/2021 21:44:02 - INFO - __main__ - Step 3340: {'lr': 0.0004998990239811477, 'samples': 641280, 'steps': 3339, 'loss/train': 1.9341990947723389} 11/06/2021 21:44:03 - INFO - __main__ - Step 3341: {'lr': 0.0004998988731119709, 'samples': 641472, 'steps': 3340, 'loss/train': 2.0908918380737305} 11/06/2021 21:44:03 - INFO - __main__ - Step 3342: {'lr': 0.0004998987221301935, 'samples': 641664, 'steps': 3341, 'loss/train': 2.078077793121338} 11/06/2021 21:44:03 - INFO - __main__ - Step 3343: {'lr': 0.0004998985710358155, 'samples': 641856, 'steps': 3342, 'loss/train': 1.9851315021514893} 11/06/2021 21:44:04 - INFO - __main__ - Step 3344: {'lr': 0.0004998984198288371, 'samples': 642048, 'steps': 3343, 'loss/train': 2.1168434619903564} 11/06/2021 21:44:04 - INFO - __main__ - Step 3345: {'lr': 0.0004998982685092583, 'samples': 642240, 'steps': 3344, 'loss/train': 2.281632900238037} 11/06/2021 21:44:05 - INFO - __main__ - Step 3346: {'lr': 0.0004998981170770792, 'samples': 642432, 'steps': 3345, 'loss/train': 2.356718063354492} 11/06/2021 21:44:05 - INFO - __main__ - Step 3347: {'lr': 0.0004998979655323, 'samples': 642624, 'steps': 3346, 'loss/train': 1.8560844659805298} 11/06/2021 21:44:06 - INFO - __main__ - Step 3348: {'lr': 0.0004998978138749204, 'samples': 642816, 'steps': 3347, 'loss/train': 1.9993972778320312} 11/06/2021 21:44:06 - INFO - __main__ - Step 3349: {'lr': 0.0004998976621049408, 'samples': 643008, 'steps': 3348, 'loss/train': 2.5754454135894775} 11/06/2021 21:44:06 - INFO - __main__ - Step 3350: {'lr': 0.0004998975102223612, 'samples': 643200, 'steps': 3349, 'loss/train': 2.1721551418304443} 11/06/2021 21:44:08 - INFO - __main__ - Step 3351: {'lr': 0.0004998973582271817, 'samples': 643392, 'steps': 3350, 'loss/train': 1.9218946695327759} 11/06/2021 21:44:08 - INFO - __main__ - Step 3352: {'lr': 0.0004998972061194022, 'samples': 643584, 'steps': 3351, 'loss/train': 2.063539743423462} 11/06/2021 21:44:08 - INFO - __main__ - Step 3353: {'lr': 0.0004998970538990228, 'samples': 643776, 'steps': 3352, 'loss/train': 2.91345477104187} 11/06/2021 21:44:09 - INFO - __main__ - Step 3354: {'lr': 0.0004998969015660438, 'samples': 643968, 'steps': 3353, 'loss/train': 1.8647414445877075} 11/06/2021 21:44:09 - INFO - __main__ - Step 3355: {'lr': 0.0004998967491204651, 'samples': 644160, 'steps': 3354, 'loss/train': 2.418487787246704} 11/06/2021 21:44:09 - INFO - __main__ - Step 3356: {'lr': 0.0004998965965622867, 'samples': 644352, 'steps': 3355, 'loss/train': 1.3407362699508667} 11/06/2021 21:44:11 - INFO - __main__ - Step 3357: {'lr': 0.0004998964438915088, 'samples': 644544, 'steps': 3356, 'loss/train': 2.0935134887695312} 11/06/2021 21:44:11 - INFO - __main__ - Step 3358: {'lr': 0.0004998962911081314, 'samples': 644736, 'steps': 3357, 'loss/train': 2.340810775756836} 11/06/2021 21:44:11 - INFO - __main__ - Step 3359: {'lr': 0.0004998961382121546, 'samples': 644928, 'steps': 3358, 'loss/train': 2.3082637786865234} 11/06/2021 21:44:12 - INFO - __main__ - Step 3360: {'lr': 0.0004998959852035785, 'samples': 645120, 'steps': 3359, 'loss/train': 1.8081005811691284} 11/06/2021 21:44:12 - INFO - __main__ - Step 3361: {'lr': 0.0004998958320824031, 'samples': 645312, 'steps': 3360, 'loss/train': 2.52008318901062} 11/06/2021 21:44:13 - INFO - __main__ - Step 3362: {'lr': 0.0004998956788486284, 'samples': 645504, 'steps': 3361, 'loss/train': 2.574852228164673} 11/06/2021 21:44:13 - INFO - __main__ - Step 3363: {'lr': 0.0004998955255022547, 'samples': 645696, 'steps': 3362, 'loss/train': 2.9360525608062744} 11/06/2021 21:44:14 - INFO - __main__ - Step 3364: {'lr': 0.0004998953720432818, 'samples': 645888, 'steps': 3363, 'loss/train': 2.0350911617279053} 11/06/2021 21:44:14 - INFO - __main__ - Step 3365: {'lr': 0.00049989521847171, 'samples': 646080, 'steps': 3364, 'loss/train': 1.7591485977172852} 11/06/2021 21:44:14 - INFO - __main__ - Step 3366: {'lr': 0.0004998950647875392, 'samples': 646272, 'steps': 3365, 'loss/train': 1.764539122581482} 11/06/2021 21:44:15 - INFO - __main__ - Step 3367: {'lr': 0.0004998949109907697, 'samples': 646464, 'steps': 3366, 'loss/train': 2.29533052444458} 11/06/2021 21:44:16 - INFO - __main__ - Step 3368: {'lr': 0.0004998947570814012, 'samples': 646656, 'steps': 3367, 'loss/train': 2.4703001976013184} 11/06/2021 21:44:16 - INFO - __main__ - Step 3369: {'lr': 0.0004998946030594341, 'samples': 646848, 'steps': 3368, 'loss/train': 2.0543324947357178} 11/06/2021 21:44:16 - INFO - __main__ - Step 3370: {'lr': 0.0004998944489248683, 'samples': 647040, 'steps': 3369, 'loss/train': 2.8122878074645996} 11/06/2021 21:44:17 - INFO - __main__ - Step 3371: {'lr': 0.000499894294677704, 'samples': 647232, 'steps': 3370, 'loss/train': 2.024826765060425} 11/06/2021 21:44:18 - INFO - __main__ - Step 3372: {'lr': 0.000499894140317941, 'samples': 647424, 'steps': 3371, 'loss/train': 2.372493267059326} 11/06/2021 21:44:18 - INFO - __main__ - Step 3373: {'lr': 0.0004998939858455798, 'samples': 647616, 'steps': 3372, 'loss/train': 1.8553732633590698} 11/06/2021 21:44:19 - INFO - __main__ - Step 3374: {'lr': 0.0004998938312606201, 'samples': 647808, 'steps': 3373, 'loss/train': 6.488889694213867} 11/06/2021 21:44:19 - INFO - __main__ - Step 3375: {'lr': 0.000499893676563062, 'samples': 648000, 'steps': 3374, 'loss/train': 2.1932318210601807} 11/06/2021 21:44:19 - INFO - __main__ - Step 3376: {'lr': 0.0004998935217529058, 'samples': 648192, 'steps': 3375, 'loss/train': 2.0442917346954346} 11/06/2021 21:44:20 - INFO - __main__ - Step 3377: {'lr': 0.0004998933668301514, 'samples': 648384, 'steps': 3376, 'loss/train': 1.9358372688293457} 11/06/2021 21:44:21 - INFO - __main__ - Step 3378: {'lr': 0.0004998932117947989, 'samples': 648576, 'steps': 3377, 'loss/train': 1.9061901569366455} 11/06/2021 21:44:21 - INFO - __main__ - Step 3379: {'lr': 0.0004998930566468484, 'samples': 648768, 'steps': 3378, 'loss/train': 2.5020077228546143} 11/06/2021 21:44:21 - INFO - __main__ - Step 3380: {'lr': 0.0004998929013863, 'samples': 648960, 'steps': 3379, 'loss/train': 1.080916404724121} 11/06/2021 21:44:22 - INFO - __main__ - Step 3381: {'lr': 0.0004998927460131535, 'samples': 649152, 'steps': 3380, 'loss/train': 2.488973379135132} 11/06/2021 21:44:22 - INFO - __main__ - Step 3382: {'lr': 0.0004998925905274094, 'samples': 649344, 'steps': 3381, 'loss/train': 1.9152915477752686} 11/06/2021 21:44:23 - INFO - __main__ - Step 3383: {'lr': 0.0004998924349290674, 'samples': 649536, 'steps': 3382, 'loss/train': 0.7557737827301025} 11/06/2021 21:44:24 - INFO - __main__ - Step 3384: {'lr': 0.0004998922792181278, 'samples': 649728, 'steps': 3383, 'loss/train': 1.4994975328445435} 11/06/2021 21:44:24 - INFO - __main__ - Step 3385: {'lr': 0.0004998921233945907, 'samples': 649920, 'steps': 3384, 'loss/train': 1.6330103874206543} 11/06/2021 21:44:24 - INFO - __main__ - Step 3386: {'lr': 0.0004998919674584559, 'samples': 650112, 'steps': 3385, 'loss/train': 1.484663486480713} 11/06/2021 21:44:25 - INFO - __main__ - Step 3387: {'lr': 0.0004998918114097237, 'samples': 650304, 'steps': 3386, 'loss/train': 2.346414566040039} 11/06/2021 21:44:26 - INFO - __main__ - Step 3388: {'lr': 0.0004998916552483941, 'samples': 650496, 'steps': 3387, 'loss/train': 1.351393699645996} 11/06/2021 21:44:26 - INFO - __main__ - Step 3389: {'lr': 0.0004998914989744671, 'samples': 650688, 'steps': 3388, 'loss/train': 2.10522198677063} 11/06/2021 21:44:26 - INFO - __main__ - Step 3390: {'lr': 0.000499891342587943, 'samples': 650880, 'steps': 3389, 'loss/train': 1.9814081192016602} 11/06/2021 21:44:27 - INFO - __main__ - Step 3391: {'lr': 0.0004998911860888217, 'samples': 651072, 'steps': 3390, 'loss/train': 1.96201491355896} 11/06/2021 21:44:27 - INFO - __main__ - Step 3392: {'lr': 0.0004998910294771032, 'samples': 651264, 'steps': 3391, 'loss/train': 2.6143369674682617} 11/06/2021 21:44:28 - INFO - __main__ - Step 3393: {'lr': 0.0004998908727527877, 'samples': 651456, 'steps': 3392, 'loss/train': 2.523289442062378} 11/06/2021 21:44:28 - INFO - __main__ - Step 3394: {'lr': 0.0004998907159158752, 'samples': 651648, 'steps': 3393, 'loss/train': 1.4275950193405151} 11/06/2021 21:44:29 - INFO - __main__ - Step 3395: {'lr': 0.0004998905589663658, 'samples': 651840, 'steps': 3394, 'loss/train': 2.1565451622009277} 11/06/2021 21:44:29 - INFO - __main__ - Step 3396: {'lr': 0.0004998904019042596, 'samples': 652032, 'steps': 3395, 'loss/train': 2.2461233139038086} 11/06/2021 21:44:30 - INFO - __main__ - Step 3397: {'lr': 0.0004998902447295567, 'samples': 652224, 'steps': 3396, 'loss/train': 2.1691172122955322} 11/06/2021 21:44:31 - INFO - __main__ - Step 3398: {'lr': 0.000499890087442257, 'samples': 652416, 'steps': 3397, 'loss/train': 2.0089993476867676} 11/06/2021 21:44:31 - INFO - __main__ - Step 3399: {'lr': 0.0004998899300423607, 'samples': 652608, 'steps': 3398, 'loss/train': 2.1266562938690186} 11/06/2021 21:44:31 - INFO - __main__ - Step 3400: {'lr': 0.0004998897725298679, 'samples': 652800, 'steps': 3399, 'loss/train': 1.9188565015792847} 11/06/2021 21:44:32 - INFO - __main__ - Step 3401: {'lr': 0.0004998896149047786, 'samples': 652992, 'steps': 3400, 'loss/train': 2.4505343437194824} 11/06/2021 21:44:32 - INFO - __main__ - Step 3402: {'lr': 0.0004998894571670929, 'samples': 653184, 'steps': 3401, 'loss/train': 2.3925869464874268} 11/06/2021 21:44:32 - INFO - __main__ - Step 3403: {'lr': 0.0004998892993168109, 'samples': 653376, 'steps': 3402, 'loss/train': 2.170724391937256} 11/06/2021 21:44:33 - INFO - __main__ - Step 3404: {'lr': 0.0004998891413539326, 'samples': 653568, 'steps': 3403, 'loss/train': 1.9499857425689697} 11/06/2021 21:44:34 - INFO - __main__ - Step 3405: {'lr': 0.0004998889832784581, 'samples': 653760, 'steps': 3404, 'loss/train': 2.608039379119873} 11/06/2021 21:44:34 - INFO - __main__ - Step 3406: {'lr': 0.0004998888250903875, 'samples': 653952, 'steps': 3405, 'loss/train': 2.1653521060943604} 11/06/2021 21:44:34 - INFO - __main__ - Step 3407: {'lr': 0.0004998886667897209, 'samples': 654144, 'steps': 3406, 'loss/train': 2.1557538509368896} 11/06/2021 21:44:35 - INFO - __main__ - Step 3408: {'lr': 0.0004998885083764582, 'samples': 654336, 'steps': 3407, 'loss/train': 2.54028582572937} 11/06/2021 21:44:36 - INFO - __main__ - Step 3409: {'lr': 0.0004998883498505996, 'samples': 654528, 'steps': 3408, 'loss/train': 2.0685014724731445} 11/06/2021 21:44:36 - INFO - __main__ - Step 3410: {'lr': 0.0004998881912121453, 'samples': 654720, 'steps': 3409, 'loss/train': 1.7899914979934692} 11/06/2021 21:44:37 - INFO - __main__ - Step 3411: {'lr': 0.0004998880324610952, 'samples': 654912, 'steps': 3410, 'loss/train': 2.2215492725372314} 11/06/2021 21:44:37 - INFO - __main__ - Step 3412: {'lr': 0.0004998878735974493, 'samples': 655104, 'steps': 3411, 'loss/train': 2.2348644733428955} 11/06/2021 21:44:37 - INFO - __main__ - Step 3413: {'lr': 0.0004998877146212079, 'samples': 655296, 'steps': 3412, 'loss/train': 2.385568618774414} 11/06/2021 21:44:38 - INFO - __main__ - Step 3414: {'lr': 0.0004998875555323708, 'samples': 655488, 'steps': 3413, 'loss/train': 2.1982250213623047} 11/06/2021 21:44:38 - INFO - __main__ - Step 3415: {'lr': 0.0004998873963309384, 'samples': 655680, 'steps': 3414, 'loss/train': 2.085184097290039} 11/06/2021 21:44:39 - INFO - __main__ - Step 3416: {'lr': 0.0004998872370169105, 'samples': 655872, 'steps': 3415, 'loss/train': 2.228264808654785} 11/06/2021 21:44:39 - INFO - __main__ - Step 3417: {'lr': 0.0004998870775902872, 'samples': 656064, 'steps': 3416, 'loss/train': 2.356597900390625} 11/06/2021 21:44:40 - INFO - __main__ - Step 3418: {'lr': 0.0004998869180510688, 'samples': 656256, 'steps': 3417, 'loss/train': 1.9543739557266235} 11/06/2021 21:44:41 - INFO - __main__ - Step 3419: {'lr': 0.0004998867583992551, 'samples': 656448, 'steps': 3418, 'loss/train': 1.8418172597885132} 11/06/2021 21:44:41 - INFO - __main__ - Step 3420: {'lr': 0.0004998865986348464, 'samples': 656640, 'steps': 3419, 'loss/train': 1.725966453552246} 11/06/2021 21:44:42 - INFO - __main__ - Step 3421: {'lr': 0.0004998864387578426, 'samples': 656832, 'steps': 3420, 'loss/train': 1.9191683530807495} 11/06/2021 21:44:42 - INFO - __main__ - Step 3422: {'lr': 0.0004998862787682438, 'samples': 657024, 'steps': 3421, 'loss/train': 2.177302598953247} 11/06/2021 21:44:42 - INFO - __main__ - Step 3423: {'lr': 0.00049988611866605, 'samples': 657216, 'steps': 3422, 'loss/train': 1.9818519353866577} 11/06/2021 21:44:43 - INFO - __main__ - Step 3424: {'lr': 0.0004998859584512615, 'samples': 657408, 'steps': 3423, 'loss/train': 2.42037296295166} 11/06/2021 21:44:44 - INFO - __main__ - Step 3425: {'lr': 0.0004998857981238782, 'samples': 657600, 'steps': 3424, 'loss/train': 0.6604869961738586} 11/06/2021 21:44:44 - INFO - __main__ - Step 3426: {'lr': 0.0004998856376839003, 'samples': 657792, 'steps': 3425, 'loss/train': 0.3991982340812683} 11/06/2021 21:44:45 - INFO - __main__ - Step 3427: {'lr': 0.0004998854771313277, 'samples': 657984, 'steps': 3426, 'loss/train': 2.094467878341675} 11/06/2021 21:44:45 - INFO - __main__ - Step 3428: {'lr': 0.0004998853164661606, 'samples': 658176, 'steps': 3427, 'loss/train': 1.7819459438323975} 11/06/2021 21:44:45 - INFO - __main__ - Step 3429: {'lr': 0.000499885155688399, 'samples': 658368, 'steps': 3428, 'loss/train': 2.345151901245117} 11/06/2021 21:44:46 - INFO - __main__ - Step 3430: {'lr': 0.000499884994798043, 'samples': 658560, 'steps': 3429, 'loss/train': 1.721934199333191} 11/06/2021 21:44:47 - INFO - __main__ - Step 3431: {'lr': 0.0004998848337950927, 'samples': 658752, 'steps': 3430, 'loss/train': 1.6747421026229858} 11/06/2021 21:44:47 - INFO - __main__ - Step 3432: {'lr': 0.0004998846726795482, 'samples': 658944, 'steps': 3431, 'loss/train': 2.4393625259399414} 11/06/2021 21:44:47 - INFO - __main__ - Step 3433: {'lr': 0.0004998845114514095, 'samples': 659136, 'steps': 3432, 'loss/train': 1.8727471828460693} 11/06/2021 21:44:48 - INFO - __main__ - Step 3434: {'lr': 0.0004998843501106766, 'samples': 659328, 'steps': 3433, 'loss/train': 2.1409528255462646} 11/06/2021 21:44:49 - INFO - __main__ - Step 3435: {'lr': 0.0004998841886573496, 'samples': 659520, 'steps': 3434, 'loss/train': 2.510873317718506} 11/06/2021 21:44:49 - INFO - __main__ - Step 3436: {'lr': 0.0004998840270914288, 'samples': 659712, 'steps': 3435, 'loss/train': 1.8569953441619873} 11/06/2021 21:44:50 - INFO - __main__ - Step 3437: {'lr': 0.0004998838654129142, 'samples': 659904, 'steps': 3436, 'loss/train': 1.0409950017929077} 11/06/2021 21:44:50 - INFO - __main__ - Step 3438: {'lr': 0.0004998837036218056, 'samples': 660096, 'steps': 3437, 'loss/train': 2.066178798675537} 11/06/2021 21:44:51 - INFO - __main__ - Step 3439: {'lr': 0.0004998835417181033, 'samples': 660288, 'steps': 3438, 'loss/train': 1.7412632703781128} 11/06/2021 21:44:52 - INFO - __main__ - Step 3440: {'lr': 0.0004998833797018074, 'samples': 660480, 'steps': 3439, 'loss/train': 1.823896050453186} 11/06/2021 21:44:52 - INFO - __main__ - Step 3441: {'lr': 0.0004998832175729179, 'samples': 660672, 'steps': 3440, 'loss/train': 2.1356678009033203} 11/06/2021 21:44:52 - INFO - __main__ - Step 3442: {'lr': 0.0004998830553314349, 'samples': 660864, 'steps': 3441, 'loss/train': 1.7477060556411743} 11/06/2021 21:44:53 - INFO - __main__ - Step 3443: {'lr': 0.0004998828929773583, 'samples': 661056, 'steps': 3442, 'loss/train': 2.1472830772399902} 11/06/2021 21:44:53 - INFO - __main__ - Step 3444: {'lr': 0.0004998827305106884, 'samples': 661248, 'steps': 3443, 'loss/train': 1.7395120859146118} 11/06/2021 21:44:54 - INFO - __main__ - Step 3445: {'lr': 0.0004998825679314253, 'samples': 661440, 'steps': 3444, 'loss/train': 2.5220179557800293} 11/06/2021 21:44:54 - INFO - __main__ - Step 3446: {'lr': 0.0004998824052395689, 'samples': 661632, 'steps': 3445, 'loss/train': 2.2687270641326904} 11/06/2021 21:44:55 - INFO - __main__ - Step 3447: {'lr': 0.0004998822424351193, 'samples': 661824, 'steps': 3446, 'loss/train': 2.2349984645843506} 11/06/2021 21:44:55 - INFO - __main__ - Step 3448: {'lr': 0.0004998820795180766, 'samples': 662016, 'steps': 3447, 'loss/train': 2.5539913177490234} 11/06/2021 21:44:55 - INFO - __main__ - Step 3449: {'lr': 0.000499881916488441, 'samples': 662208, 'steps': 3448, 'loss/train': 1.9206302165985107} 11/06/2021 21:44:57 - INFO - __main__ - Step 3450: {'lr': 0.0004998817533462123, 'samples': 662400, 'steps': 3449, 'loss/train': 2.2440977096557617} 11/06/2021 21:44:57 - INFO - __main__ - Step 3451: {'lr': 0.0004998815900913909, 'samples': 662592, 'steps': 3450, 'loss/train': 1.2015467882156372} 11/06/2021 21:44:57 - INFO - __main__ - Step 3452: {'lr': 0.0004998814267239767, 'samples': 662784, 'steps': 3451, 'loss/train': 2.6361520290374756} 11/06/2021 21:44:58 - INFO - __main__ - Step 3453: {'lr': 0.0004998812632439697, 'samples': 662976, 'steps': 3452, 'loss/train': 1.7184646129608154} 11/06/2021 21:44:58 - INFO - __main__ - Step 3454: {'lr': 0.00049988109965137, 'samples': 663168, 'steps': 3453, 'loss/train': 2.2237768173217773} 11/06/2021 21:44:58 - INFO - __main__ - Step 3455: {'lr': 0.000499880935946178, 'samples': 663360, 'steps': 3454, 'loss/train': 1.60335111618042} 11/06/2021 21:44:59 - INFO - __main__ - Step 3456: {'lr': 0.0004998807721283932, 'samples': 663552, 'steps': 3455, 'loss/train': 4.451015472412109} 11/06/2021 21:45:00 - INFO - __main__ - Step 3457: {'lr': 0.0004998806081980162, 'samples': 663744, 'steps': 3456, 'loss/train': 2.589240074157715} 11/06/2021 21:45:00 - INFO - __main__ - Step 3458: {'lr': 0.0004998804441550467, 'samples': 663936, 'steps': 3457, 'loss/train': 1.5202341079711914} 11/06/2021 21:45:00 - INFO - __main__ - Step 3459: {'lr': 0.000499880279999485, 'samples': 664128, 'steps': 3458, 'loss/train': 2.2377476692199707} 11/06/2021 21:45:01 - INFO - __main__ - Step 3460: {'lr': 0.0004998801157313311, 'samples': 664320, 'steps': 3459, 'loss/train': 2.0967650413513184} 11/06/2021 21:45:02 - INFO - __main__ - Step 3461: {'lr': 0.0004998799513505851, 'samples': 664512, 'steps': 3460, 'loss/train': 2.196791648864746} 11/06/2021 21:45:02 - INFO - __main__ - Step 3462: {'lr': 0.000499879786857247, 'samples': 664704, 'steps': 3461, 'loss/train': 2.2895452976226807} 11/06/2021 21:45:02 - INFO - __main__ - Step 3463: {'lr': 0.0004998796222513169, 'samples': 664896, 'steps': 3462, 'loss/train': 2.4116644859313965} 11/06/2021 21:45:03 - INFO - __main__ - Step 3464: {'lr': 0.000499879457532795, 'samples': 665088, 'steps': 3463, 'loss/train': 2.3247618675231934} 11/06/2021 21:45:03 - INFO - __main__ - Step 3465: {'lr': 0.0004998792927016812, 'samples': 665280, 'steps': 3464, 'loss/train': 2.557910203933716} 11/06/2021 21:45:04 - INFO - __main__ - Step 3466: {'lr': 0.0004998791277579757, 'samples': 665472, 'steps': 3465, 'loss/train': 2.2992501258850098} 11/06/2021 21:45:04 - INFO - __main__ - Step 3467: {'lr': 0.0004998789627016784, 'samples': 665664, 'steps': 3466, 'loss/train': 2.114197254180908} 11/06/2021 21:45:05 - INFO - __main__ - Step 3468: {'lr': 0.0004998787975327896, 'samples': 665856, 'steps': 3467, 'loss/train': 2.2296416759490967} 11/06/2021 21:45:05 - INFO - __main__ - Step 3469: {'lr': 0.0004998786322513093, 'samples': 666048, 'steps': 3468, 'loss/train': 2.309483528137207} 11/06/2021 21:45:05 - INFO - __main__ - Step 3470: {'lr': 0.0004998784668572375, 'samples': 666240, 'steps': 3469, 'loss/train': 1.9443069696426392} 11/06/2021 21:45:06 - INFO - __main__ - Step 3471: {'lr': 0.0004998783013505743, 'samples': 666432, 'steps': 3470, 'loss/train': 2.0072755813598633} 11/06/2021 21:45:07 - INFO - __main__ - Step 3472: {'lr': 0.0004998781357313198, 'samples': 666624, 'steps': 3471, 'loss/train': 2.455528974533081} 11/06/2021 21:45:07 - INFO - __main__ - Step 3473: {'lr': 0.0004998779699994741, 'samples': 666816, 'steps': 3472, 'loss/train': 2.9603943824768066} 11/06/2021 21:45:07 - INFO - __main__ - Step 3474: {'lr': 0.0004998778041550372, 'samples': 667008, 'steps': 3473, 'loss/train': 2.1143574714660645} 11/06/2021 21:45:08 - INFO - __main__ - Step 3475: {'lr': 0.0004998776381980092, 'samples': 667200, 'steps': 3474, 'loss/train': 2.124244213104248} 11/06/2021 21:45:08 - INFO - __main__ - Step 3476: {'lr': 0.0004998774721283903, 'samples': 667392, 'steps': 3475, 'loss/train': 2.0879619121551514} 11/06/2021 21:45:09 - INFO - __main__ - Step 3477: {'lr': 0.0004998773059461803, 'samples': 667584, 'steps': 3476, 'loss/train': 2.3110809326171875} 11/06/2021 21:45:10 - INFO - __main__ - Step 3478: {'lr': 0.0004998771396513796, 'samples': 667776, 'steps': 3477, 'loss/train': 2.4501490592956543} 11/06/2021 21:45:10 - INFO - __main__ - Step 3479: {'lr': 0.000499876973243988, 'samples': 667968, 'steps': 3478, 'loss/train': 1.411661148071289} 11/06/2021 21:45:10 - INFO - __main__ - Step 3480: {'lr': 0.0004998768067240059, 'samples': 668160, 'steps': 3479, 'loss/train': 2.6114776134490967} 11/06/2021 21:45:11 - INFO - __main__ - Step 3481: {'lr': 0.0004998766400914329, 'samples': 668352, 'steps': 3480, 'loss/train': 2.1029934883117676} 11/06/2021 21:45:12 - INFO - __main__ - Step 3482: {'lr': 0.0004998764733462694, 'samples': 668544, 'steps': 3481, 'loss/train': 1.7680230140686035} 11/06/2021 21:45:12 - INFO - __main__ - Step 3483: {'lr': 0.0004998763064885155, 'samples': 668736, 'steps': 3482, 'loss/train': 2.153639554977417} 11/06/2021 21:45:12 - INFO - __main__ - Step 3484: {'lr': 0.0004998761395181712, 'samples': 668928, 'steps': 3483, 'loss/train': 2.03013014793396} 11/06/2021 21:45:13 - INFO - __main__ - Step 3485: {'lr': 0.0004998759724352365, 'samples': 669120, 'steps': 3484, 'loss/train': 0.6211015582084656} 11/06/2021 21:45:13 - INFO - __main__ - Step 3486: {'lr': 0.0004998758052397115, 'samples': 669312, 'steps': 3485, 'loss/train': 1.9367340803146362} 11/06/2021 21:45:14 - INFO - __main__ - Step 3487: {'lr': 0.0004998756379315964, 'samples': 669504, 'steps': 3486, 'loss/train': 1.8777623176574707} 11/06/2021 21:45:14 - INFO - __main__ - Step 3488: {'lr': 0.0004998754705108912, 'samples': 669696, 'steps': 3487, 'loss/train': 2.432539939880371} 11/06/2021 21:45:15 - INFO - __main__ - Step 3489: {'lr': 0.000499875302977596, 'samples': 669888, 'steps': 3488, 'loss/train': 2.30979585647583} 11/06/2021 21:45:15 - INFO - __main__ - Step 3490: {'lr': 0.0004998751353317108, 'samples': 670080, 'steps': 3489, 'loss/train': 2.282562017440796} 11/06/2021 21:45:15 - INFO - __main__ - Step 3491: {'lr': 0.0004998749675732357, 'samples': 670272, 'steps': 3490, 'loss/train': 2.008915662765503} 11/06/2021 21:45:17 - INFO - __main__ - Step 3492: {'lr': 0.0004998747997021708, 'samples': 670464, 'steps': 3491, 'loss/train': 2.295339584350586} 11/06/2021 21:45:17 - INFO - __main__ - Step 3493: {'lr': 0.0004998746317185162, 'samples': 670656, 'steps': 3492, 'loss/train': 2.1820249557495117} 11/06/2021 21:45:18 - INFO - __main__ - Step 3494: {'lr': 0.000499874463622272, 'samples': 670848, 'steps': 3493, 'loss/train': 2.051396131515503} 11/06/2021 21:45:18 - INFO - __main__ - Step 3495: {'lr': 0.000499874295413438, 'samples': 671040, 'steps': 3494, 'loss/train': 1.9268534183502197} 11/06/2021 21:45:18 - INFO - __main__ - Step 3496: {'lr': 0.0004998741270920147, 'samples': 671232, 'steps': 3495, 'loss/train': 2.2112491130828857} 11/06/2021 21:45:19 - INFO - __main__ - Step 3497: {'lr': 0.0004998739586580019, 'samples': 671424, 'steps': 3496, 'loss/train': 2.0771069526672363} 11/06/2021 21:45:19 - INFO - __main__ - Step 3498: {'lr': 0.0004998737901113999, 'samples': 671616, 'steps': 3497, 'loss/train': 5.947204113006592} 11/06/2021 21:45:20 - INFO - __main__ - Step 3499: {'lr': 0.0004998736214522084, 'samples': 671808, 'steps': 3498, 'loss/train': 3.6572864055633545} 11/06/2021 21:45:20 - INFO - __main__ - Step 3500: {'lr': 0.0004998734526804278, 'samples': 672000, 'steps': 3499, 'loss/train': 2.2772557735443115} 11/06/2021 21:45:21 - INFO - __main__ - Step 3501: {'lr': 0.0004998732837960581, 'samples': 672192, 'steps': 3500, 'loss/train': 1.6389656066894531} 11/06/2021 21:45:21 - INFO - __main__ - Step 3502: {'lr': 0.0004998731147990993, 'samples': 672384, 'steps': 3501, 'loss/train': 2.0018649101257324} 11/06/2021 21:45:21 - INFO - __main__ - Step 3503: {'lr': 0.0004998729456895516, 'samples': 672576, 'steps': 3502, 'loss/train': 2.645084857940674} 11/06/2021 21:45:23 - INFO - __main__ - Step 3504: {'lr': 0.0004998727764674149, 'samples': 672768, 'steps': 3503, 'loss/train': 2.0491695404052734} 11/06/2021 21:45:23 - INFO - __main__ - Step 3505: {'lr': 0.0004998726071326896, 'samples': 672960, 'steps': 3504, 'loss/train': 1.6065165996551514} 11/06/2021 21:45:23 - INFO - __main__ - Step 3506: {'lr': 0.0004998724376853754, 'samples': 673152, 'steps': 3505, 'loss/train': 1.9586995840072632} 11/06/2021 21:45:24 - INFO - __main__ - Step 3507: {'lr': 0.0004998722681254725, 'samples': 673344, 'steps': 3506, 'loss/train': 1.835058569908142} 11/06/2021 21:45:24 - INFO - __main__ - Step 3508: {'lr': 0.0004998720984529811, 'samples': 673536, 'steps': 3507, 'loss/train': 2.592658519744873} 11/06/2021 21:45:25 - INFO - __main__ - Step 3509: {'lr': 0.0004998719286679011, 'samples': 673728, 'steps': 3508, 'loss/train': 1.8576128482818604} 11/06/2021 21:45:25 - INFO - __main__ - Step 3510: {'lr': 0.0004998717587702328, 'samples': 673920, 'steps': 3509, 'loss/train': 1.9809731245040894} 11/06/2021 21:45:26 - INFO - __main__ - Step 3511: {'lr': 0.0004998715887599759, 'samples': 674112, 'steps': 3510, 'loss/train': 1.734330415725708} 11/06/2021 21:45:26 - INFO - __main__ - Step 3512: {'lr': 0.000499871418637131, 'samples': 674304, 'steps': 3511, 'loss/train': 1.7968868017196655} 11/06/2021 21:45:26 - INFO - __main__ - Step 3513: {'lr': 0.0004998712484016977, 'samples': 674496, 'steps': 3512, 'loss/train': 2.3915631771087646} 11/06/2021 21:45:27 - INFO - __main__ - Step 3514: {'lr': 0.0004998710780536763, 'samples': 674688, 'steps': 3513, 'loss/train': 2.1563174724578857} 11/06/2021 21:45:28 - INFO - __main__ - Step 3515: {'lr': 0.0004998709075930669, 'samples': 674880, 'steps': 3514, 'loss/train': 1.6511495113372803} 11/06/2021 21:45:28 - INFO - __main__ - Step 3516: {'lr': 0.0004998707370198695, 'samples': 675072, 'steps': 3515, 'loss/train': 2.0000085830688477} 11/06/2021 21:45:28 - INFO - __main__ - Step 3517: {'lr': 0.0004998705663340843, 'samples': 675264, 'steps': 3516, 'loss/train': 1.445250153541565} 11/06/2021 21:45:29 - INFO - __main__ - Step 3518: {'lr': 0.0004998703955357111, 'samples': 675456, 'steps': 3517, 'loss/train': 1.4034719467163086} 11/06/2021 21:45:30 - INFO - __main__ - Step 3519: {'lr': 0.0004998702246247502, 'samples': 675648, 'steps': 3518, 'loss/train': 2.529625415802002} 11/06/2021 21:45:30 - INFO - __main__ - Step 3520: {'lr': 0.0004998700536012017, 'samples': 675840, 'steps': 3519, 'loss/train': 2.3188982009887695} 11/06/2021 21:45:30 - INFO - __main__ - Step 3521: {'lr': 0.0004998698824650655, 'samples': 676032, 'steps': 3520, 'loss/train': 0.6927474737167358} 11/06/2021 21:45:31 - INFO - __main__ - Step 3522: {'lr': 0.000499869711216342, 'samples': 676224, 'steps': 3521, 'loss/train': 1.4308483600616455} 11/06/2021 21:45:31 - INFO - __main__ - Step 3523: {'lr': 0.0004998695398550309, 'samples': 676416, 'steps': 3522, 'loss/train': 2.8342642784118652} 11/06/2021 21:45:32 - INFO - __main__ - Step 3524: {'lr': 0.0004998693683811325, 'samples': 676608, 'steps': 3523, 'loss/train': 2.05680513381958} 11/06/2021 21:45:32 - INFO - __main__ - Step 3525: {'lr': 0.0004998691967946468, 'samples': 676800, 'steps': 3524, 'loss/train': 2.1919660568237305} 11/06/2021 21:45:33 - INFO - __main__ - Step 3526: {'lr': 0.000499869025095574, 'samples': 676992, 'steps': 3525, 'loss/train': 1.6675937175750732} 11/06/2021 21:45:33 - INFO - __main__ - Step 3527: {'lr': 0.0004998688532839139, 'samples': 677184, 'steps': 3526, 'loss/train': 2.5993382930755615} 11/06/2021 21:45:34 - INFO - __main__ - Step 3528: {'lr': 0.0004998686813596668, 'samples': 677376, 'steps': 3527, 'loss/train': 2.326568126678467} 11/06/2021 21:45:34 - INFO - __main__ - Step 3529: {'lr': 0.0004998685093228327, 'samples': 677568, 'steps': 3528, 'loss/train': 1.6023788452148438} 11/06/2021 21:45:35 - INFO - __main__ - Step 3530: {'lr': 0.0004998683371734118, 'samples': 677760, 'steps': 3529, 'loss/train': 1.248134732246399} 11/06/2021 21:45:35 - INFO - __main__ - Step 3531: {'lr': 0.000499868164911404, 'samples': 677952, 'steps': 3530, 'loss/train': 1.8184454441070557} 11/06/2021 21:45:36 - INFO - __main__ - Step 3532: {'lr': 0.0004998679925368094, 'samples': 678144, 'steps': 3531, 'loss/train': 1.9451133012771606} 11/06/2021 21:45:36 - INFO - __main__ - Step 3533: {'lr': 0.0004998678200496283, 'samples': 678336, 'steps': 3532, 'loss/train': 2.026451587677002} 11/06/2021 21:45:36 - INFO - __main__ - Step 3534: {'lr': 0.0004998676474498606, 'samples': 678528, 'steps': 3533, 'loss/train': 1.7797132730484009} 11/06/2021 21:45:37 - INFO - __main__ - Step 3535: {'lr': 0.0004998674747375063, 'samples': 678720, 'steps': 3534, 'loss/train': 2.1139495372772217} 11/06/2021 21:45:38 - INFO - __main__ - Step 3536: {'lr': 0.0004998673019125657, 'samples': 678912, 'steps': 3535, 'loss/train': 2.0227255821228027} 11/06/2021 21:45:38 - INFO - __main__ - Step 3537: {'lr': 0.0004998671289750386, 'samples': 679104, 'steps': 3536, 'loss/train': 2.357372283935547} 11/06/2021 21:45:38 - INFO - __main__ - Step 3538: {'lr': 0.0004998669559249252, 'samples': 679296, 'steps': 3537, 'loss/train': 2.0382673740386963} 11/06/2021 21:45:39 - INFO - __main__ - Step 3539: {'lr': 0.0004998667827622258, 'samples': 679488, 'steps': 3538, 'loss/train': 1.8378405570983887} 11/06/2021 21:45:40 - INFO - __main__ - Step 3540: {'lr': 0.0004998666094869402, 'samples': 679680, 'steps': 3539, 'loss/train': 1.9578657150268555} 11/06/2021 21:45:40 - INFO - __main__ - Step 3541: {'lr': 0.0004998664360990685, 'samples': 679872, 'steps': 3540, 'loss/train': 2.1692068576812744} 11/06/2021 21:45:41 - INFO - __main__ - Step 3542: {'lr': 0.0004998662625986109, 'samples': 680064, 'steps': 3541, 'loss/train': 1.96601140499115} 11/06/2021 21:45:41 - INFO - __main__ - Step 3543: {'lr': 0.0004998660889855674, 'samples': 680256, 'steps': 3542, 'loss/train': 2.4479618072509766} 11/06/2021 21:45:41 - INFO - __main__ - Step 3544: {'lr': 0.0004998659152599381, 'samples': 680448, 'steps': 3543, 'loss/train': 2.9186902046203613} 11/06/2021 21:45:43 - INFO - __main__ - Step 3545: {'lr': 0.000499865741421723, 'samples': 680640, 'steps': 3544, 'loss/train': 1.9552292823791504} 11/06/2021 21:45:43 - INFO - __main__ - Step 3546: {'lr': 0.0004998655674709224, 'samples': 680832, 'steps': 3545, 'loss/train': 2.1400067806243896} 11/06/2021 21:45:43 - INFO - __main__ - Step 3547: {'lr': 0.0004998653934075361, 'samples': 681024, 'steps': 3546, 'loss/train': 2.2904787063598633} 11/06/2021 21:45:44 - INFO - __main__ - Step 3548: {'lr': 0.0004998652192315644, 'samples': 681216, 'steps': 3547, 'loss/train': 1.8039276599884033} 11/06/2021 21:45:44 - INFO - __main__ - Step 3549: {'lr': 0.0004998650449430073, 'samples': 681408, 'steps': 3548, 'loss/train': 1.7798856496810913} 11/06/2021 21:45:45 - INFO - __main__ - Step 3550: {'lr': 0.0004998648705418648, 'samples': 681600, 'steps': 3549, 'loss/train': 1.5824065208435059} 11/06/2021 21:45:45 - INFO - __main__ - Step 3551: {'lr': 0.000499864696028137, 'samples': 681792, 'steps': 3550, 'loss/train': 1.8516236543655396} 11/06/2021 21:45:46 - INFO - __main__ - Step 3552: {'lr': 0.000499864521401824, 'samples': 681984, 'steps': 3551, 'loss/train': 2.100595474243164} 11/06/2021 21:45:46 - INFO - __main__ - Step 3553: {'lr': 0.000499864346662926, 'samples': 682176, 'steps': 3552, 'loss/train': 2.4325428009033203} 11/06/2021 21:45:46 - INFO - __main__ - Step 3554: {'lr': 0.000499864171811443, 'samples': 682368, 'steps': 3553, 'loss/train': 1.2289140224456787} 11/06/2021 21:45:47 - INFO - __main__ - Step 3555: {'lr': 0.0004998639968473751, 'samples': 682560, 'steps': 3554, 'loss/train': 2.2943685054779053} 11/06/2021 21:45:48 - INFO - __main__ - Step 3556: {'lr': 0.0004998638217707222, 'samples': 682752, 'steps': 3555, 'loss/train': 2.264774799346924} 11/06/2021 21:45:48 - INFO - __main__ - Step 3557: {'lr': 0.0004998636465814846, 'samples': 682944, 'steps': 3556, 'loss/train': 1.637046217918396} 11/06/2021 21:45:48 - INFO - __main__ - Step 3558: {'lr': 0.0004998634712796622, 'samples': 683136, 'steps': 3557, 'loss/train': 2.0852251052856445} 11/06/2021 21:45:49 - INFO - __main__ - Step 3559: {'lr': 0.0004998632958652554, 'samples': 683328, 'steps': 3558, 'loss/train': 1.820117473602295} 11/06/2021 21:45:50 - INFO - __main__ - Step 3560: {'lr': 0.0004998631203382639, 'samples': 683520, 'steps': 3559, 'loss/train': 2.074711561203003} 11/06/2021 21:45:50 - INFO - __main__ - Step 3561: {'lr': 0.0004998629446986879, 'samples': 683712, 'steps': 3560, 'loss/train': 2.1228792667388916} 11/06/2021 21:45:51 - INFO - __main__ - Step 3562: {'lr': 0.0004998627689465276, 'samples': 683904, 'steps': 3561, 'loss/train': 2.0950121879577637} 11/06/2021 21:45:51 - INFO - __main__ - Step 3563: {'lr': 0.0004998625930817829, 'samples': 684096, 'steps': 3562, 'loss/train': 2.041194200515747} 11/06/2021 21:45:51 - INFO - __main__ - Step 3564: {'lr': 0.0004998624171044541, 'samples': 684288, 'steps': 3563, 'loss/train': 1.9695250988006592} 11/06/2021 21:45:52 - INFO - __main__ - Step 3565: {'lr': 0.000499862241014541, 'samples': 684480, 'steps': 3564, 'loss/train': 2.691624164581299} 11/06/2021 21:45:53 - INFO - __main__ - Step 3566: {'lr': 0.0004998620648120439, 'samples': 684672, 'steps': 3565, 'loss/train': 1.932613730430603} 11/06/2021 21:45:53 - INFO - __main__ - Step 3567: {'lr': 0.0004998618884969628, 'samples': 684864, 'steps': 3566, 'loss/train': 2.155407190322876} 11/06/2021 21:45:53 - INFO - __main__ - Step 3568: {'lr': 0.0004998617120692977, 'samples': 685056, 'steps': 3567, 'loss/train': 1.8647109270095825} 11/06/2021 21:45:54 - INFO - __main__ - Step 3569: {'lr': 0.0004998615355290489, 'samples': 685248, 'steps': 3568, 'loss/train': 0.48862022161483765} 11/06/2021 21:45:55 - INFO - __main__ - Step 3570: {'lr': 0.0004998613588762163, 'samples': 685440, 'steps': 3569, 'loss/train': 2.087411642074585} 11/06/2021 21:45:55 - INFO - __main__ - Step 3571: {'lr': 0.0004998611821108001, 'samples': 685632, 'steps': 3570, 'loss/train': 2.365281820297241} 11/06/2021 21:45:55 - INFO - __main__ - Step 3572: {'lr': 0.0004998610052328002, 'samples': 685824, 'steps': 3571, 'loss/train': 1.9633747339248657} 11/06/2021 21:45:56 - INFO - __main__ - Step 3573: {'lr': 0.0004998608282422169, 'samples': 686016, 'steps': 3572, 'loss/train': 2.195950746536255} 11/06/2021 21:45:56 - INFO - __main__ - Step 3574: {'lr': 0.0004998606511390501, 'samples': 686208, 'steps': 3573, 'loss/train': 1.7376683950424194} 11/06/2021 21:45:56 - INFO - __main__ - Step 3575: {'lr': 0.0004998604739232999, 'samples': 686400, 'steps': 3574, 'loss/train': 2.2785563468933105} 11/06/2021 21:45:58 - INFO - __main__ - Step 3576: {'lr': 0.0004998602965949664, 'samples': 686592, 'steps': 3575, 'loss/train': 1.848177433013916} 11/06/2021 21:45:58 - INFO - __main__ - Step 3577: {'lr': 0.0004998601191540499, 'samples': 686784, 'steps': 3576, 'loss/train': 2.7372477054595947} 11/06/2021 21:45:58 - INFO - __main__ - Step 3578: {'lr': 0.0004998599416005502, 'samples': 686976, 'steps': 3577, 'loss/train': 2.2560975551605225} 11/06/2021 21:45:59 - INFO - __main__ - Step 3579: {'lr': 0.0004998597639344674, 'samples': 687168, 'steps': 3578, 'loss/train': 2.355379819869995} 11/06/2021 21:45:59 - INFO - __main__ - Step 3580: {'lr': 0.0004998595861558016, 'samples': 687360, 'steps': 3579, 'loss/train': 2.119218111038208} 11/06/2021 21:46:00 - INFO - __main__ - Step 3581: {'lr': 0.000499859408264553, 'samples': 687552, 'steps': 3580, 'loss/train': 1.7983760833740234} 11/06/2021 21:46:00 - INFO - __main__ - Step 3582: {'lr': 0.0004998592302607217, 'samples': 687744, 'steps': 3581, 'loss/train': 2.229571580886841} 11/06/2021 21:46:01 - INFO - __main__ - Step 3583: {'lr': 0.0004998590521443075, 'samples': 687936, 'steps': 3582, 'loss/train': 1.9575389623641968} 11/06/2021 21:46:01 - INFO - __main__ - Step 3584: {'lr': 0.0004998588739153108, 'samples': 688128, 'steps': 3583, 'loss/train': 1.7447315454483032} 11/06/2021 21:46:01 - INFO - __main__ - Step 3585: {'lr': 0.0004998586955737316, 'samples': 688320, 'steps': 3584, 'loss/train': 2.1006550788879395} 11/06/2021 21:46:02 - INFO - __main__ - Step 3586: {'lr': 0.0004998585171195698, 'samples': 688512, 'steps': 3585, 'loss/train': 3.3587646484375} 11/06/2021 21:46:03 - INFO - __main__ - Step 3587: {'lr': 0.0004998583385528256, 'samples': 688704, 'steps': 3586, 'loss/train': 2.2408502101898193} 11/06/2021 21:46:03 - INFO - __main__ - Step 3588: {'lr': 0.0004998581598734991, 'samples': 688896, 'steps': 3587, 'loss/train': 1.8953609466552734} 11/06/2021 21:46:04 - INFO - __main__ - Step 3589: {'lr': 0.0004998579810815905, 'samples': 689088, 'steps': 3588, 'loss/train': 2.701934576034546} 11/06/2021 21:46:04 - INFO - __main__ - Step 3590: {'lr': 0.0004998578021770995, 'samples': 689280, 'steps': 3589, 'loss/train': 1.9593024253845215} 11/06/2021 21:46:05 - INFO - __main__ - Step 3591: {'lr': 0.0004998576231600267, 'samples': 689472, 'steps': 3590, 'loss/train': 1.8404669761657715} 11/06/2021 21:46:05 - INFO - __main__ - Step 3592: {'lr': 0.0004998574440303718, 'samples': 689664, 'steps': 3591, 'loss/train': 2.3145079612731934} 11/06/2021 21:46:06 - INFO - __main__ - Step 3593: {'lr': 0.0004998572647881349, 'samples': 689856, 'steps': 3592, 'loss/train': 1.889504075050354} 11/06/2021 21:46:06 - INFO - __main__ - Step 3594: {'lr': 0.0004998570854333163, 'samples': 690048, 'steps': 3593, 'loss/train': 1.687559962272644} 11/06/2021 21:46:06 - INFO - __main__ - Step 3595: {'lr': 0.0004998569059659158, 'samples': 690240, 'steps': 3594, 'loss/train': 1.928205966949463} 11/06/2021 21:46:07 - INFO - __main__ - Step 3596: {'lr': 0.0004998567263859338, 'samples': 690432, 'steps': 3595, 'loss/train': 2.1471500396728516} 11/06/2021 21:46:08 - INFO - __main__ - Step 3597: {'lr': 0.0004998565466933702, 'samples': 690624, 'steps': 3596, 'loss/train': 2.6100289821624756} 11/06/2021 21:46:08 - INFO - __main__ - Step 3598: {'lr': 0.000499856366888225, 'samples': 690816, 'steps': 3597, 'loss/train': 1.6505409479141235} 11/06/2021 21:46:08 - INFO - __main__ - Step 3599: {'lr': 0.0004998561869704983, 'samples': 691008, 'steps': 3598, 'loss/train': 1.5047534704208374} 11/06/2021 21:46:09 - INFO - __main__ - Step 3600: {'lr': 0.0004998560069401905, 'samples': 691200, 'steps': 3599, 'loss/train': 2.1573095321655273} 11/06/2021 21:46:10 - INFO - __main__ - Step 3601: {'lr': 0.0004998558267973013, 'samples': 691392, 'steps': 3600, 'loss/train': 2.1335763931274414} 11/06/2021 21:46:10 - INFO - __main__ - Step 3602: {'lr': 0.0004998556465418309, 'samples': 691584, 'steps': 3601, 'loss/train': 2.135831832885742} 11/06/2021 21:46:11 - INFO - __main__ - Step 3603: {'lr': 0.0004998554661737795, 'samples': 691776, 'steps': 3602, 'loss/train': 2.5998644828796387} 11/06/2021 21:46:11 - INFO - __main__ - Step 3604: {'lr': 0.000499855285693147, 'samples': 691968, 'steps': 3603, 'loss/train': 2.1776552200317383} 11/06/2021 21:46:11 - INFO - __main__ - Step 3605: {'lr': 0.0004998551050999336, 'samples': 692160, 'steps': 3604, 'loss/train': 1.8231139183044434} 11/06/2021 21:46:12 - INFO - __main__ - Step 3606: {'lr': 0.0004998549243941393, 'samples': 692352, 'steps': 3605, 'loss/train': 1.756971001625061} 11/06/2021 21:46:13 - INFO - __main__ - Step 3607: {'lr': 0.0004998547435757643, 'samples': 692544, 'steps': 3606, 'loss/train': 1.6793618202209473} 11/06/2021 21:46:13 - INFO - __main__ - Step 3608: {'lr': 0.0004998545626448087, 'samples': 692736, 'steps': 3607, 'loss/train': 2.033261299133301} 11/06/2021 21:46:13 - INFO - __main__ - Step 3609: {'lr': 0.0004998543816012723, 'samples': 692928, 'steps': 3608, 'loss/train': 2.1296401023864746} 11/06/2021 21:46:14 - INFO - __main__ - Step 3610: {'lr': 0.0004998542004451554, 'samples': 693120, 'steps': 3609, 'loss/train': 2.213966131210327} 11/06/2021 21:46:14 - INFO - __main__ - Step 3611: {'lr': 0.000499854019176458, 'samples': 693312, 'steps': 3610, 'loss/train': 2.0920517444610596} 11/06/2021 21:46:15 - INFO - __main__ - Step 3612: {'lr': 0.0004998538377951803, 'samples': 693504, 'steps': 3611, 'loss/train': 0.8657620549201965} 11/06/2021 21:46:15 - INFO - __main__ - Step 3613: {'lr': 0.0004998536563013224, 'samples': 693696, 'steps': 3612, 'loss/train': 2.072887659072876} 11/06/2021 21:46:16 - INFO - __main__ - Step 3614: {'lr': 0.0004998534746948843, 'samples': 693888, 'steps': 3613, 'loss/train': 1.9264954328536987} 11/06/2021 21:46:16 - INFO - __main__ - Step 3615: {'lr': 0.000499853292975866, 'samples': 694080, 'steps': 3614, 'loss/train': 0.808038592338562} 11/06/2021 21:46:16 - INFO - __main__ - Step 3616: {'lr': 0.0004998531111442676, 'samples': 694272, 'steps': 3615, 'loss/train': 1.6517269611358643} 11/06/2021 21:46:18 - INFO - __main__ - Step 3617: {'lr': 0.0004998529292000893, 'samples': 694464, 'steps': 3616, 'loss/train': 2.1595749855041504} 11/06/2021 21:46:18 - INFO - __main__ - Step 3618: {'lr': 0.0004998527471433312, 'samples': 694656, 'steps': 3617, 'loss/train': 1.3440548181533813} 11/06/2021 21:46:18 - INFO - __main__ - Step 3619: {'lr': 0.0004998525649739932, 'samples': 694848, 'steps': 3618, 'loss/train': 1.8935426473617554} 11/06/2021 21:46:19 - INFO - __main__ - Step 3620: {'lr': 0.0004998523826920756, 'samples': 695040, 'steps': 3619, 'loss/train': 2.0238497257232666} 11/06/2021 21:46:19 - INFO - __main__ - Step 3621: {'lr': 0.0004998522002975783, 'samples': 695232, 'steps': 3620, 'loss/train': 2.4836816787719727} 11/06/2021 21:46:20 - INFO - __main__ - Step 3622: {'lr': 0.0004998520177905015, 'samples': 695424, 'steps': 3621, 'loss/train': 1.8629294633865356} 11/06/2021 21:46:20 - INFO - __main__ - Step 3623: {'lr': 0.0004998518351708452, 'samples': 695616, 'steps': 3622, 'loss/train': 2.230546712875366} 11/06/2021 21:46:21 - INFO - __main__ - Step 3624: {'lr': 0.0004998516524386095, 'samples': 695808, 'steps': 3623, 'loss/train': 2.3679111003875732} 11/06/2021 21:46:21 - INFO - __main__ - Step 3625: {'lr': 0.0004998514695937945, 'samples': 696000, 'steps': 3624, 'loss/train': 2.3775951862335205} 11/06/2021 21:46:21 - INFO - __main__ - Step 3626: {'lr': 0.0004998512866364003, 'samples': 696192, 'steps': 3625, 'loss/train': 1.8260321617126465} 11/06/2021 21:46:23 - INFO - __main__ - Step 3627: {'lr': 0.000499851103566427, 'samples': 696384, 'steps': 3626, 'loss/train': 2.3718161582946777} 11/06/2021 21:46:24 - INFO - __main__ - Step 3628: {'lr': 0.0004998509203838746, 'samples': 696576, 'steps': 3627, 'loss/train': 2.2593507766723633} 11/06/2021 21:46:24 - INFO - __main__ - Step 3629: {'lr': 0.0004998507370887433, 'samples': 696768, 'steps': 3628, 'loss/train': 2.683199882507324} 11/06/2021 21:46:24 - INFO - __main__ - Step 3630: {'lr': 0.000499850553681033, 'samples': 696960, 'steps': 3629, 'loss/train': 3.9143762588500977} 11/06/2021 21:46:25 - INFO - __main__ - Step 3631: {'lr': 0.000499850370160744, 'samples': 697152, 'steps': 3630, 'loss/train': 5.931422233581543} 11/06/2021 21:46:25 - INFO - __main__ - Step 3632: {'lr': 0.0004998501865278762, 'samples': 697344, 'steps': 3631, 'loss/train': 1.1052885055541992} 11/06/2021 21:46:25 - INFO - __main__ - Step 3633: {'lr': 0.0004998500027824298, 'samples': 697536, 'steps': 3632, 'loss/train': 1.352131724357605} 11/06/2021 21:46:26 - INFO - __main__ - Step 3634: {'lr': 0.0004998498189244049, 'samples': 697728, 'steps': 3633, 'loss/train': 2.097627639770508} 11/06/2021 21:46:27 - INFO - __main__ - Step 3635: {'lr': 0.0004998496349538015, 'samples': 697920, 'steps': 3634, 'loss/train': 2.872110605239868} 11/06/2021 21:46:27 - INFO - __main__ - Step 3636: {'lr': 0.0004998494508706196, 'samples': 698112, 'steps': 3635, 'loss/train': 1.6961387395858765} 11/06/2021 21:46:27 - INFO - __main__ - Step 3637: {'lr': 0.0004998492666748594, 'samples': 698304, 'steps': 3636, 'loss/train': 2.1306369304656982} 11/06/2021 21:46:28 - INFO - __main__ - Step 3638: {'lr': 0.0004998490823665211, 'samples': 698496, 'steps': 3637, 'loss/train': 2.1220192909240723} 11/06/2021 21:46:29 - INFO - __main__ - Step 3639: {'lr': 0.0004998488979456046, 'samples': 698688, 'steps': 3638, 'loss/train': 2.4169700145721436} 11/06/2021 21:46:29 - INFO - __main__ - Step 3640: {'lr': 0.00049984871341211, 'samples': 698880, 'steps': 3639, 'loss/train': 2.1508548259735107} 11/06/2021 21:46:30 - INFO - __main__ - Step 3641: {'lr': 0.0004998485287660375, 'samples': 699072, 'steps': 3640, 'loss/train': 2.348625898361206} 11/06/2021 21:46:30 - INFO - __main__ - Step 3642: {'lr': 0.0004998483440073871, 'samples': 699264, 'steps': 3641, 'loss/train': 2.3207335472106934} 11/06/2021 21:46:30 - INFO - __main__ - Step 3643: {'lr': 0.0004998481591361589, 'samples': 699456, 'steps': 3642, 'loss/train': 2.584690570831299} 11/06/2021 21:46:31 - INFO - __main__ - Step 3644: {'lr': 0.000499847974152353, 'samples': 699648, 'steps': 3643, 'loss/train': 1.8976331949234009} 11/06/2021 21:46:32 - INFO - __main__ - Step 3645: {'lr': 0.0004998477890559693, 'samples': 699840, 'steps': 3644, 'loss/train': 1.7949036359786987} 11/06/2021 21:46:32 - INFO - __main__ - Step 3646: {'lr': 0.0004998476038470082, 'samples': 700032, 'steps': 3645, 'loss/train': 2.1402628421783447} 11/06/2021 21:46:32 - INFO - __main__ - Step 3647: {'lr': 0.0004998474185254696, 'samples': 700224, 'steps': 3646, 'loss/train': 2.140929937362671} 11/06/2021 21:46:33 - INFO - __main__ - Step 3648: {'lr': 0.0004998472330913535, 'samples': 700416, 'steps': 3647, 'loss/train': 1.1526223421096802} 11/06/2021 21:46:34 - INFO - __main__ - Step 3649: {'lr': 0.0004998470475446603, 'samples': 700608, 'steps': 3648, 'loss/train': 2.2785234451293945} 11/06/2021 21:46:34 - INFO - __main__ - Step 3650: {'lr': 0.0004998468618853896, 'samples': 700800, 'steps': 3649, 'loss/train': 2.1500542163848877} 11/06/2021 21:46:35 - INFO - __main__ - Step 3651: {'lr': 0.000499846676113542, 'samples': 700992, 'steps': 3650, 'loss/train': 2.041510820388794} 11/06/2021 21:46:35 - INFO - __main__ - Step 3652: {'lr': 0.0004998464902291173, 'samples': 701184, 'steps': 3651, 'loss/train': 2.313647985458374} 11/06/2021 21:46:35 - INFO - __main__ - Step 3653: {'lr': 0.0004998463042321155, 'samples': 701376, 'steps': 3652, 'loss/train': 1.6131926774978638} 11/06/2021 21:46:36 - INFO - __main__ - Step 3654: {'lr': 0.0004998461181225369, 'samples': 701568, 'steps': 3653, 'loss/train': 2.0128300189971924} 11/06/2021 21:46:37 - INFO - __main__ - Step 3655: {'lr': 0.0004998459319003815, 'samples': 701760, 'steps': 3654, 'loss/train': 2.388474464416504} 11/06/2021 21:46:37 - INFO - __main__ - Step 3656: {'lr': 0.0004998457455656493, 'samples': 701952, 'steps': 3655, 'loss/train': 1.8429226875305176} 11/06/2021 21:46:37 - INFO - __main__ - Step 3657: {'lr': 0.0004998455591183406, 'samples': 702144, 'steps': 3656, 'loss/train': 2.267188310623169} 11/06/2021 21:46:38 - INFO - __main__ - Step 3658: {'lr': 0.0004998453725584552, 'samples': 702336, 'steps': 3657, 'loss/train': 2.0592057704925537} 11/06/2021 21:46:39 - INFO - __main__ - Step 3659: {'lr': 0.0004998451858859934, 'samples': 702528, 'steps': 3658, 'loss/train': 1.9567612409591675} 11/06/2021 21:46:39 - INFO - __main__ - Step 3660: {'lr': 0.0004998449991009552, 'samples': 702720, 'steps': 3659, 'loss/train': 2.084226608276367} 11/06/2021 21:46:39 - INFO - __main__ - Step 3661: {'lr': 0.0004998448122033408, 'samples': 702912, 'steps': 3660, 'loss/train': 2.5636651515960693} 11/06/2021 21:46:40 - INFO - __main__ - Step 3662: {'lr': 0.00049984462519315, 'samples': 703104, 'steps': 3661, 'loss/train': 2.506985664367676} 11/06/2021 21:46:40 - INFO - __main__ - Step 3663: {'lr': 0.0004998444380703832, 'samples': 703296, 'steps': 3662, 'loss/train': 2.208871603012085} 11/06/2021 21:46:41 - INFO - __main__ - Step 3664: {'lr': 0.0004998442508350404, 'samples': 703488, 'steps': 3663, 'loss/train': 2.199995994567871} 11/06/2021 21:46:41 - INFO - __main__ - Step 3665: {'lr': 0.0004998440634871215, 'samples': 703680, 'steps': 3664, 'loss/train': 1.8999465703964233} 11/06/2021 21:46:42 - INFO - __main__ - Step 3666: {'lr': 0.0004998438760266267, 'samples': 703872, 'steps': 3665, 'loss/train': 2.107455253601074} 11/06/2021 21:46:42 - INFO - __main__ - Step 3667: {'lr': 0.0004998436884535562, 'samples': 704064, 'steps': 3666, 'loss/train': 1.854463815689087} 11/06/2021 21:46:42 - INFO - __main__ - Step 3668: {'lr': 0.00049984350076791, 'samples': 704256, 'steps': 3667, 'loss/train': 2.7667548656463623} 11/06/2021 21:46:44 - INFO - __main__ - Step 3669: {'lr': 0.0004998433129696882, 'samples': 704448, 'steps': 3668, 'loss/train': 1.5981824398040771} 11/06/2021 21:46:44 - INFO - __main__ - Step 3670: {'lr': 0.0004998431250588907, 'samples': 704640, 'steps': 3669, 'loss/train': 2.160209894180298} 11/06/2021 21:46:44 - INFO - __main__ - Step 3671: {'lr': 0.0004998429370355179, 'samples': 704832, 'steps': 3670, 'loss/train': 1.8680161237716675} 11/06/2021 21:46:45 - INFO - __main__ - Step 3672: {'lr': 0.0004998427488995697, 'samples': 705024, 'steps': 3671, 'loss/train': 2.518526315689087} 11/06/2021 21:46:45 - INFO - __main__ - Step 3673: {'lr': 0.0004998425606510461, 'samples': 705216, 'steps': 3672, 'loss/train': 2.714362621307373} 11/06/2021 21:46:45 - INFO - __main__ - Step 3674: {'lr': 0.0004998423722899475, 'samples': 705408, 'steps': 3673, 'loss/train': 1.8246042728424072} 11/06/2021 21:46:46 - INFO - __main__ - Step 3675: {'lr': 0.0004998421838162735, 'samples': 705600, 'steps': 3674, 'loss/train': 0.9275121688842773} 11/06/2021 21:46:47 - INFO - __main__ - Step 3676: {'lr': 0.0004998419952300247, 'samples': 705792, 'steps': 3675, 'loss/train': 2.0111794471740723} 11/06/2021 21:46:47 - INFO - __main__ - Step 3677: {'lr': 0.0004998418065312009, 'samples': 705984, 'steps': 3676, 'loss/train': 1.89971923828125} 11/06/2021 21:46:47 - INFO - __main__ - Step 3678: {'lr': 0.0004998416177198022, 'samples': 706176, 'steps': 3677, 'loss/train': 1.930174708366394} 11/06/2021 21:46:48 - INFO - __main__ - Step 3679: {'lr': 0.0004998414287958288, 'samples': 706368, 'steps': 3678, 'loss/train': 2.1186258792877197} 11/06/2021 21:46:49 - INFO - __main__ - Step 3680: {'lr': 0.0004998412397592807, 'samples': 706560, 'steps': 3679, 'loss/train': 2.1540133953094482} 11/06/2021 21:46:49 - INFO - __main__ - Step 3681: {'lr': 0.0004998410506101579, 'samples': 706752, 'steps': 3680, 'loss/train': 1.5001546144485474} 11/06/2021 21:46:50 - INFO - __main__ - Step 3682: {'lr': 0.0004998408613484605, 'samples': 706944, 'steps': 3681, 'loss/train': 1.7224041223526} 11/06/2021 21:46:50 - INFO - __main__ - Step 3683: {'lr': 0.0004998406719741888, 'samples': 707136, 'steps': 3682, 'loss/train': 1.118072509765625} 11/06/2021 21:46:51 - INFO - __main__ - Step 3684: {'lr': 0.0004998404824873428, 'samples': 707328, 'steps': 3683, 'loss/train': 1.8773263692855835} 11/06/2021 21:46:51 - INFO - __main__ - Step 3685: {'lr': 0.0004998402928879225, 'samples': 707520, 'steps': 3684, 'loss/train': 1.9647787809371948} 11/06/2021 21:46:52 - INFO - __main__ - Step 3686: {'lr': 0.000499840103175928, 'samples': 707712, 'steps': 3685, 'loss/train': 2.2132978439331055} 11/06/2021 21:46:52 - INFO - __main__ - Step 3687: {'lr': 0.0004998399133513594, 'samples': 707904, 'steps': 3686, 'loss/train': 2.232158899307251} 11/06/2021 21:46:53 - INFO - __main__ - Step 3688: {'lr': 0.0004998397234142167, 'samples': 708096, 'steps': 3687, 'loss/train': 2.528634786605835} 11/06/2021 21:46:53 - INFO - __main__ - Step 3689: {'lr': 0.0004998395333645002, 'samples': 708288, 'steps': 3688, 'loss/train': 2.5124094486236572} 11/06/2021 21:46:54 - INFO - __main__ - Step 3690: {'lr': 0.0004998393432022098, 'samples': 708480, 'steps': 3689, 'loss/train': 2.4985034465789795} 11/06/2021 21:46:54 - INFO - __main__ - Step 3691: {'lr': 0.0004998391529273457, 'samples': 708672, 'steps': 3690, 'loss/train': 2.0140085220336914} 11/06/2021 21:46:55 - INFO - __main__ - Step 3692: {'lr': 0.0004998389625399079, 'samples': 708864, 'steps': 3691, 'loss/train': 1.7472585439682007} 11/06/2021 21:46:55 - INFO - __main__ - Step 3693: {'lr': 0.0004998387720398965, 'samples': 709056, 'steps': 3692, 'loss/train': 2.138169527053833} 11/06/2021 21:46:55 - INFO - __main__ - Step 3694: {'lr': 0.0004998385814273116, 'samples': 709248, 'steps': 3693, 'loss/train': 1.8398798704147339} 11/06/2021 21:46:56 - INFO - __main__ - Step 3695: {'lr': 0.0004998383907021533, 'samples': 709440, 'steps': 3694, 'loss/train': 2.2453384399414062} 11/06/2021 21:46:57 - INFO - __main__ - Step 3696: {'lr': 0.0004998381998644217, 'samples': 709632, 'steps': 3695, 'loss/train': 2.210822105407715} 11/06/2021 21:46:57 - INFO - __main__ - Step 3697: {'lr': 0.0004998380089141169, 'samples': 709824, 'steps': 3696, 'loss/train': 1.83198082447052} 11/06/2021 21:46:57 - INFO - __main__ - Step 3698: {'lr': 0.0004998378178512388, 'samples': 710016, 'steps': 3697, 'loss/train': 2.363736391067505} 11/06/2021 21:46:58 - INFO - __main__ - Step 3699: {'lr': 0.0004998376266757878, 'samples': 710208, 'steps': 3698, 'loss/train': 1.7178457975387573} 11/06/2021 21:46:58 - INFO - __main__ - Step 3700: {'lr': 0.0004998374353877638, 'samples': 710400, 'steps': 3699, 'loss/train': 1.9114118814468384} 11/06/2021 21:46:59 - INFO - __main__ - Step 3701: {'lr': 0.0004998372439871668, 'samples': 710592, 'steps': 3700, 'loss/train': 2.2929277420043945} 11/06/2021 21:46:59 - INFO - __main__ - Step 3702: {'lr': 0.000499837052473997, 'samples': 710784, 'steps': 3701, 'loss/train': 2.206447124481201} 11/06/2021 21:47:00 - INFO - __main__ - Step 3703: {'lr': 0.0004998368608482546, 'samples': 710976, 'steps': 3702, 'loss/train': 2.2719476222991943} 11/06/2021 21:47:00 - INFO - __main__ - Step 3704: {'lr': 0.0004998366691099395, 'samples': 711168, 'steps': 3703, 'loss/train': 1.9696950912475586} 11/06/2021 21:47:01 - INFO - __main__ - Step 3705: {'lr': 0.0004998364772590518, 'samples': 711360, 'steps': 3704, 'loss/train': 1.9279074668884277} 11/06/2021 21:47:02 - INFO - __main__ - Step 3706: {'lr': 0.0004998362852955918, 'samples': 711552, 'steps': 3705, 'loss/train': 2.3700718879699707} 11/06/2021 21:47:02 - INFO - __main__ - Step 3707: {'lr': 0.0004998360932195593, 'samples': 711744, 'steps': 3706, 'loss/train': 2.2039358615875244} 11/06/2021 21:47:02 - INFO - __main__ - Step 3708: {'lr': 0.0004998359010309544, 'samples': 711936, 'steps': 3707, 'loss/train': 2.483283758163452} 11/06/2021 21:47:03 - INFO - __main__ - Step 3709: {'lr': 0.0004998357087297775, 'samples': 712128, 'steps': 3708, 'loss/train': 1.5728569030761719} 11/06/2021 21:47:03 - INFO - __main__ - Step 3710: {'lr': 0.0004998355163160285, 'samples': 712320, 'steps': 3709, 'loss/train': 1.6009992361068726} 11/06/2021 21:47:04 - INFO - __main__ - Step 3711: {'lr': 0.0004998353237897073, 'samples': 712512, 'steps': 3710, 'loss/train': 2.459540605545044} 11/06/2021 21:47:04 - INFO - __main__ - Step 3712: {'lr': 0.0004998351311508143, 'samples': 712704, 'steps': 3711, 'loss/train': 1.7308366298675537} 11/06/2021 21:47:05 - INFO - __main__ - Step 3713: {'lr': 0.0004998349383993493, 'samples': 712896, 'steps': 3712, 'loss/train': 2.191849708557129} 11/06/2021 21:47:05 - INFO - __main__ - Step 3714: {'lr': 0.0004998347455353126, 'samples': 713088, 'steps': 3713, 'loss/train': 2.918881893157959} 11/06/2021 21:47:05 - INFO - __main__ - Step 3715: {'lr': 0.0004998345525587042, 'samples': 713280, 'steps': 3714, 'loss/train': 2.113539695739746} 11/06/2021 21:47:06 - INFO - __main__ - Step 3716: {'lr': 0.0004998343594695242, 'samples': 713472, 'steps': 3715, 'loss/train': 1.7858660221099854} 11/06/2021 21:47:07 - INFO - __main__ - Step 3717: {'lr': 0.0004998341662677728, 'samples': 713664, 'steps': 3716, 'loss/train': 1.8981730937957764} 11/06/2021 21:47:07 - INFO - __main__ - Step 3718: {'lr': 0.0004998339729534499, 'samples': 713856, 'steps': 3717, 'loss/train': 2.5536508560180664} 11/06/2021 21:47:07 - INFO - __main__ - Step 3719: {'lr': 0.0004998337795265557, 'samples': 714048, 'steps': 3718, 'loss/train': 2.2563905715942383} 11/06/2021 21:47:08 - INFO - __main__ - Step 3720: {'lr': 0.0004998335859870903, 'samples': 714240, 'steps': 3719, 'loss/train': 1.7998706102371216} 11/06/2021 21:47:09 - INFO - __main__ - Step 3721: {'lr': 0.0004998333923350536, 'samples': 714432, 'steps': 3720, 'loss/train': 2.0226428508758545} 11/06/2021 21:47:09 - INFO - __main__ - Step 3722: {'lr': 0.000499833198570446, 'samples': 714624, 'steps': 3721, 'loss/train': 1.3011538982391357} 11/06/2021 21:47:10 - INFO - __main__ - Step 3723: {'lr': 0.0004998330046932672, 'samples': 714816, 'steps': 3722, 'loss/train': 2.5336453914642334} 11/06/2021 21:47:10 - INFO - __main__ - Step 3724: {'lr': 0.0004998328107035176, 'samples': 715008, 'steps': 3723, 'loss/train': 2.392589807510376} 11/06/2021 21:47:10 - INFO - __main__ - Step 3725: {'lr': 0.0004998326166011973, 'samples': 715200, 'steps': 3724, 'loss/train': 2.1815335750579834} 11/06/2021 21:47:11 - INFO - __main__ - Step 3726: {'lr': 0.0004998324223863061, 'samples': 715392, 'steps': 3725, 'loss/train': 6.898614406585693} 11/06/2021 21:47:12 - INFO - __main__ - Step 3727: {'lr': 0.0004998322280588445, 'samples': 715584, 'steps': 3726, 'loss/train': 1.5503253936767578} 11/06/2021 21:47:12 - INFO - __main__ - Step 3728: {'lr': 0.0004998320336188121, 'samples': 715776, 'steps': 3727, 'loss/train': 2.2275030612945557} 11/06/2021 21:47:12 - INFO - __main__ - Step 3729: {'lr': 0.0004998318390662095, 'samples': 715968, 'steps': 3728, 'loss/train': 1.9741984605789185} 11/06/2021 21:47:13 - INFO - __main__ - Step 3730: {'lr': 0.0004998316444010363, 'samples': 716160, 'steps': 3729, 'loss/train': 1.3006471395492554} 11/06/2021 21:47:13 - INFO - __main__ - Step 3731: {'lr': 0.0004998314496232929, 'samples': 716352, 'steps': 3730, 'loss/train': 2.1504995822906494} 11/06/2021 21:47:14 - INFO - __main__ - Step 3732: {'lr': 0.0004998312547329793, 'samples': 716544, 'steps': 3731, 'loss/train': 2.3918561935424805} 11/06/2021 21:47:14 - INFO - __main__ - Step 3733: {'lr': 0.0004998310597300956, 'samples': 716736, 'steps': 3732, 'loss/train': 2.505441665649414} 11/06/2021 21:47:15 - INFO - __main__ - Step 3734: {'lr': 0.0004998308646146419, 'samples': 716928, 'steps': 3733, 'loss/train': 1.6491429805755615} 11/06/2021 21:47:15 - INFO - __main__ - Step 3735: {'lr': 0.0004998306693866181, 'samples': 717120, 'steps': 3734, 'loss/train': 2.3682384490966797} 11/06/2021 21:47:15 - INFO - __main__ - Step 3736: {'lr': 0.0004998304740460247, 'samples': 717312, 'steps': 3735, 'loss/train': 1.748855471611023} 11/06/2021 21:47:16 - INFO - __main__ - Step 3737: {'lr': 0.0004998302785928614, 'samples': 717504, 'steps': 3736, 'loss/train': 3.4032115936279297} 11/06/2021 21:47:17 - INFO - __main__ - Step 3738: {'lr': 0.0004998300830271285, 'samples': 717696, 'steps': 3737, 'loss/train': 2.2181355953216553} 11/06/2021 21:47:17 - INFO - __main__ - Step 3739: {'lr': 0.000499829887348826, 'samples': 717888, 'steps': 3738, 'loss/train': 2.0063092708587646} 11/06/2021 21:47:17 - INFO - __main__ - Step 3740: {'lr': 0.0004998296915579539, 'samples': 718080, 'steps': 3739, 'loss/train': 2.2116055488586426} 11/06/2021 21:47:18 - INFO - __main__ - Step 3741: {'lr': 0.0004998294956545125, 'samples': 718272, 'steps': 3740, 'loss/train': 2.458434820175171} 11/06/2021 21:47:19 - INFO - __main__ - Step 3742: {'lr': 0.0004998292996385019, 'samples': 718464, 'steps': 3741, 'loss/train': 2.001145839691162} 11/06/2021 21:47:19 - INFO - __main__ - Step 3743: {'lr': 0.0004998291035099219, 'samples': 718656, 'steps': 3742, 'loss/train': 1.889823079109192} 11/06/2021 21:47:20 - INFO - __main__ - Step 3744: {'lr': 0.0004998289072687728, 'samples': 718848, 'steps': 3743, 'loss/train': 2.2206709384918213} 11/06/2021 21:47:20 - INFO - __main__ - Step 3745: {'lr': 0.0004998287109150547, 'samples': 719040, 'steps': 3744, 'loss/train': 1.8798199892044067} 11/06/2021 21:47:20 - INFO - __main__ - Step 3746: {'lr': 0.0004998285144487676, 'samples': 719232, 'steps': 3745, 'loss/train': 2.057377576828003} 11/06/2021 21:47:21 - INFO - __main__ - Step 3747: {'lr': 0.0004998283178699116, 'samples': 719424, 'steps': 3746, 'loss/train': 1.8531581163406372} 11/06/2021 21:47:22 - INFO - __main__ - Step 3748: {'lr': 0.0004998281211784869, 'samples': 719616, 'steps': 3747, 'loss/train': 2.3435657024383545} 11/06/2021 21:47:22 - INFO - __main__ - Step 3749: {'lr': 0.0004998279243744934, 'samples': 719808, 'steps': 3748, 'loss/train': 2.2125914096832275} 11/06/2021 21:47:22 - INFO - __main__ - Step 3750: {'lr': 0.0004998277274579313, 'samples': 720000, 'steps': 3749, 'loss/train': 2.172743558883667} 11/06/2021 21:47:23 - INFO - __main__ - Step 3751: {'lr': 0.0004998275304288007, 'samples': 720192, 'steps': 3750, 'loss/train': 1.698510766029358} 11/06/2021 21:47:24 - INFO - __main__ - Step 3752: {'lr': 0.0004998273332871017, 'samples': 720384, 'steps': 3751, 'loss/train': 2.1584572792053223} 11/06/2021 21:47:24 - INFO - __main__ - Step 3753: {'lr': 0.0004998271360328344, 'samples': 720576, 'steps': 3752, 'loss/train': 1.8731579780578613} 11/06/2021 21:47:24 - INFO - __main__ - Step 3754: {'lr': 0.0004998269386659988, 'samples': 720768, 'steps': 3753, 'loss/train': 2.281588077545166} 11/06/2021 21:47:25 - INFO - __main__ - Step 3755: {'lr': 0.000499826741186595, 'samples': 720960, 'steps': 3754, 'loss/train': 2.172957420349121} 11/06/2021 21:47:25 - INFO - __main__ - Step 3756: {'lr': 0.0004998265435946232, 'samples': 721152, 'steps': 3755, 'loss/train': 2.0597519874572754} 11/06/2021 21:47:26 - INFO - __main__ - Step 3757: {'lr': 0.0004998263458900833, 'samples': 721344, 'steps': 3756, 'loss/train': 1.874323844909668} 11/06/2021 21:47:26 - INFO - __main__ - Step 3758: {'lr': 0.0004998261480729755, 'samples': 721536, 'steps': 3757, 'loss/train': 2.4779069423675537} 11/06/2021 21:47:27 - INFO - __main__ - Step 3759: {'lr': 0.0004998259501433, 'samples': 721728, 'steps': 3758, 'loss/train': 2.595695734024048} 11/06/2021 21:47:27 - INFO - __main__ - Step 3760: {'lr': 0.0004998257521010567, 'samples': 721920, 'steps': 3759, 'loss/train': 2.313114643096924} 11/06/2021 21:47:27 - INFO - __main__ - Step 3761: {'lr': 0.0004998255539462459, 'samples': 722112, 'steps': 3760, 'loss/train': 1.7417014837265015} 11/06/2021 21:47:28 - INFO - __main__ - Step 3762: {'lr': 0.0004998253556788675, 'samples': 722304, 'steps': 3761, 'loss/train': 2.088002920150757} 11/06/2021 21:47:29 - INFO - __main__ - Step 3763: {'lr': 0.0004998251572989217, 'samples': 722496, 'steps': 3762, 'loss/train': 2.2535784244537354} 11/06/2021 21:47:29 - INFO - __main__ - Step 3764: {'lr': 0.0004998249588064085, 'samples': 722688, 'steps': 3763, 'loss/train': 1.551037073135376} 11/06/2021 21:47:30 - INFO - __main__ - Step 3765: {'lr': 0.0004998247602013278, 'samples': 722880, 'steps': 3764, 'loss/train': 2.5815794467926025} 11/06/2021 21:47:30 - INFO - __main__ - Step 3766: {'lr': 0.0004998245614836802, 'samples': 723072, 'steps': 3765, 'loss/train': 2.278775930404663} 11/06/2021 21:47:30 - INFO - __main__ - Step 3767: {'lr': 0.0004998243626534655, 'samples': 723264, 'steps': 3766, 'loss/train': 2.4035115242004395} 11/06/2021 21:47:31 - INFO - __main__ - Step 3768: {'lr': 0.0004998241637106836, 'samples': 723456, 'steps': 3767, 'loss/train': 2.4984121322631836} 11/06/2021 21:47:32 - INFO - __main__ - Step 3769: {'lr': 0.0004998239646553349, 'samples': 723648, 'steps': 3768, 'loss/train': 1.8377968072891235} 11/06/2021 21:47:32 - INFO - __main__ - Step 3770: {'lr': 0.0004998237654874195, 'samples': 723840, 'steps': 3769, 'loss/train': 2.3453054428100586} 11/06/2021 21:47:32 - INFO - __main__ - Step 3771: {'lr': 0.0004998235662069372, 'samples': 724032, 'steps': 3770, 'loss/train': 1.7649283409118652} 11/06/2021 21:47:33 - INFO - __main__ - Step 3772: {'lr': 0.0004998233668138883, 'samples': 724224, 'steps': 3771, 'loss/train': 1.8481817245483398} 11/06/2021 21:47:34 - INFO - __main__ - Step 3773: {'lr': 0.0004998231673082729, 'samples': 724416, 'steps': 3772, 'loss/train': 2.0594215393066406} 11/06/2021 21:47:34 - INFO - __main__ - Step 3774: {'lr': 0.000499822967690091, 'samples': 724608, 'steps': 3773, 'loss/train': 2.3945000171661377} 11/06/2021 21:47:35 - INFO - __main__ - Step 3775: {'lr': 0.0004998227679593426, 'samples': 724800, 'steps': 3774, 'loss/train': 1.9772299528121948} 11/06/2021 21:47:35 - INFO - __main__ - Step 3776: {'lr': 0.0004998225681160281, 'samples': 724992, 'steps': 3775, 'loss/train': 1.82159423828125} 11/06/2021 21:47:35 - INFO - __main__ - Step 3777: {'lr': 0.0004998223681601474, 'samples': 725184, 'steps': 3776, 'loss/train': 2.3857104778289795} 11/06/2021 21:47:36 - INFO - __main__ - Step 3778: {'lr': 0.0004998221680917004, 'samples': 725376, 'steps': 3777, 'loss/train': 2.202800989151001} 11/06/2021 21:47:37 - INFO - __main__ - Step 3779: {'lr': 0.0004998219679106876, 'samples': 725568, 'steps': 3778, 'loss/train': 2.383378028869629} 11/06/2021 21:47:37 - INFO - __main__ - Step 3780: {'lr': 0.0004998217676171088, 'samples': 725760, 'steps': 3779, 'loss/train': 2.38877010345459} 11/06/2021 21:47:37 - INFO - __main__ - Step 3781: {'lr': 0.0004998215672109641, 'samples': 725952, 'steps': 3780, 'loss/train': 2.0370190143585205} 11/06/2021 21:47:38 - INFO - __main__ - Step 3782: {'lr': 0.0004998213666922537, 'samples': 726144, 'steps': 3781, 'loss/train': 1.9329248666763306} 11/06/2021 21:47:39 - INFO - __main__ - Step 3783: {'lr': 0.0004998211660609777, 'samples': 726336, 'steps': 3782, 'loss/train': 3.1192450523376465} 11/06/2021 21:47:39 - INFO - __main__ - Step 3784: {'lr': 0.0004998209653171361, 'samples': 726528, 'steps': 3783, 'loss/train': 0.4665497839450836} 11/06/2021 21:47:39 - INFO - __main__ - Step 3785: {'lr': 0.0004998207644607291, 'samples': 726720, 'steps': 3784, 'loss/train': 2.076326370239258} 11/06/2021 21:47:40 - INFO - __main__ - Step 3786: {'lr': 0.0004998205634917566, 'samples': 726912, 'steps': 3785, 'loss/train': 2.1383769512176514} 11/06/2021 21:47:40 - INFO - __main__ - Step 3787: {'lr': 0.0004998203624102188, 'samples': 727104, 'steps': 3786, 'loss/train': 2.3829987049102783} 11/06/2021 21:47:41 - INFO - __main__ - Step 3788: {'lr': 0.0004998201612161159, 'samples': 727296, 'steps': 3787, 'loss/train': 1.7519925832748413} 11/06/2021 21:47:42 - INFO - __main__ - Step 3789: {'lr': 0.0004998199599094478, 'samples': 727488, 'steps': 3788, 'loss/train': 1.9867311716079712} 11/06/2021 21:47:42 - INFO - __main__ - Step 3790: {'lr': 0.0004998197584902147, 'samples': 727680, 'steps': 3789, 'loss/train': 1.7998645305633545} 11/06/2021 21:47:42 - INFO - __main__ - Step 3791: {'lr': 0.0004998195569584168, 'samples': 727872, 'steps': 3790, 'loss/train': 2.279456377029419} 11/06/2021 21:47:43 - INFO - __main__ - Step 3792: {'lr': 0.0004998193553140539, 'samples': 728064, 'steps': 3791, 'loss/train': 1.8781626224517822} 11/06/2021 21:47:44 - INFO - __main__ - Step 3793: {'lr': 0.0004998191535571264, 'samples': 728256, 'steps': 3792, 'loss/train': 2.181241750717163} 11/06/2021 21:47:44 - INFO - __main__ - Step 3794: {'lr': 0.0004998189516876342, 'samples': 728448, 'steps': 3793, 'loss/train': 2.345247983932495} 11/06/2021 21:47:44 - INFO - __main__ - Step 3795: {'lr': 0.0004998187497055773, 'samples': 728640, 'steps': 3794, 'loss/train': 0.6545143723487854} 11/06/2021 21:47:45 - INFO - __main__ - Step 3796: {'lr': 0.000499818547610956, 'samples': 728832, 'steps': 3795, 'loss/train': 1.8099079132080078} 11/06/2021 21:47:45 - INFO - __main__ - Step 3797: {'lr': 0.0004998183454037703, 'samples': 729024, 'steps': 3796, 'loss/train': 1.515761137008667} 11/06/2021 21:47:45 - INFO - __main__ - Step 3798: {'lr': 0.0004998181430840204, 'samples': 729216, 'steps': 3797, 'loss/train': 1.9123553037643433} 11/06/2021 21:47:46 - INFO - __main__ - Step 3799: {'lr': 0.0004998179406517063, 'samples': 729408, 'steps': 3798, 'loss/train': 1.983812689781189} 11/06/2021 21:47:47 - INFO - __main__ - Step 3800: {'lr': 0.000499817738106828, 'samples': 729600, 'steps': 3799, 'loss/train': 5.782630920410156} 11/06/2021 21:47:47 - INFO - __main__ - Step 3801: {'lr': 0.0004998175354493857, 'samples': 729792, 'steps': 3800, 'loss/train': 1.7894034385681152} 11/06/2021 21:47:47 - INFO - __main__ - Step 3802: {'lr': 0.0004998173326793795, 'samples': 729984, 'steps': 3801, 'loss/train': 1.8084430694580078} 11/06/2021 21:47:48 - INFO - __main__ - Step 3803: {'lr': 0.0004998171297968095, 'samples': 730176, 'steps': 3802, 'loss/train': 2.3628175258636475} 11/06/2021 21:47:49 - INFO - __main__ - Step 3804: {'lr': 0.0004998169268016757, 'samples': 730368, 'steps': 3803, 'loss/train': 2.064181089401245} 11/06/2021 21:47:49 - INFO - __main__ - Step 3805: {'lr': 0.0004998167236939783, 'samples': 730560, 'steps': 3804, 'loss/train': 2.378023147583008} 11/06/2021 21:47:50 - INFO - __main__ - Step 3806: {'lr': 0.0004998165204737173, 'samples': 730752, 'steps': 3805, 'loss/train': 1.9755092859268188} 11/06/2021 21:47:50 - INFO - __main__ - Step 3807: {'lr': 0.0004998163171408928, 'samples': 730944, 'steps': 3806, 'loss/train': 2.0892038345336914} 11/06/2021 21:47:50 - INFO - __main__ - Step 3808: {'lr': 0.000499816113695505, 'samples': 731136, 'steps': 3807, 'loss/train': 2.038058042526245} 11/06/2021 21:47:52 - INFO - __main__ - Step 3809: {'lr': 0.0004998159101375538, 'samples': 731328, 'steps': 3808, 'loss/train': 2.4071855545043945} 11/06/2021 21:47:52 - INFO - __main__ - Step 3810: {'lr': 0.0004998157064670395, 'samples': 731520, 'steps': 3809, 'loss/train': 1.957269549369812} 11/06/2021 21:47:52 - INFO - __main__ - Step 3811: {'lr': 0.0004998155026839621, 'samples': 731712, 'steps': 3810, 'loss/train': 1.6153275966644287} 11/06/2021 21:47:53 - INFO - __main__ - Step 3812: {'lr': 0.0004998152987883217, 'samples': 731904, 'steps': 3811, 'loss/train': 1.7951699495315552} 11/06/2021 21:47:53 - INFO - __main__ - Step 3813: {'lr': 0.0004998150947801182, 'samples': 732096, 'steps': 3812, 'loss/train': 1.9503369331359863} 11/06/2021 21:47:54 - INFO - __main__ - Step 3814: {'lr': 0.000499814890659352, 'samples': 732288, 'steps': 3813, 'loss/train': 1.8731478452682495} 11/06/2021 21:47:55 - INFO - __main__ - Step 3815: {'lr': 0.0004998146864260231, 'samples': 732480, 'steps': 3814, 'loss/train': 2.178349733352661} 11/06/2021 21:47:55 - INFO - __main__ - Step 3816: {'lr': 0.0004998144820801316, 'samples': 732672, 'steps': 3815, 'loss/train': 2.257465124130249} 11/06/2021 21:47:55 - INFO - __main__ - Step 3817: {'lr': 0.0004998142776216775, 'samples': 732864, 'steps': 3816, 'loss/train': 2.292968988418579} 11/06/2021 21:47:56 - INFO - __main__ - Step 3818: {'lr': 0.0004998140730506609, 'samples': 733056, 'steps': 3817, 'loss/train': 1.7161427736282349} 11/06/2021 21:47:56 - INFO - __main__ - Step 3819: {'lr': 0.000499813868367082, 'samples': 733248, 'steps': 3818, 'loss/train': 1.899258017539978} 11/06/2021 21:47:57 - INFO - __main__ - Step 3820: {'lr': 0.0004998136635709408, 'samples': 733440, 'steps': 3819, 'loss/train': 2.3737716674804688} 11/06/2021 21:47:57 - INFO - __main__ - Step 3821: {'lr': 0.0004998134586622374, 'samples': 733632, 'steps': 3820, 'loss/train': 2.9889981746673584} 11/06/2021 21:47:58 - INFO - __main__ - Step 3822: {'lr': 0.0004998132536409718, 'samples': 733824, 'steps': 3821, 'loss/train': 1.6760238409042358} 11/06/2021 21:47:58 - INFO - __main__ - Step 3823: {'lr': 0.0004998130485071444, 'samples': 734016, 'steps': 3822, 'loss/train': 2.2223479747772217} 11/06/2021 21:47:58 - INFO - __main__ - Step 3824: {'lr': 0.000499812843260755, 'samples': 734208, 'steps': 3823, 'loss/train': 2.103163480758667} 11/06/2021 21:47:59 - INFO - __main__ - Step 3825: {'lr': 0.0004998126379018038, 'samples': 734400, 'steps': 3824, 'loss/train': 1.4351319074630737} 11/06/2021 21:48:00 - INFO - __main__ - Step 3826: {'lr': 0.000499812432430291, 'samples': 734592, 'steps': 3825, 'loss/train': 2.600419282913208} 11/06/2021 21:48:00 - INFO - __main__ - Step 3827: {'lr': 0.0004998122268462164, 'samples': 734784, 'steps': 3826, 'loss/train': 1.890721082687378} 11/06/2021 21:48:00 - INFO - __main__ - Step 3828: {'lr': 0.0004998120211495803, 'samples': 734976, 'steps': 3827, 'loss/train': 1.8643083572387695} 11/06/2021 21:48:01 - INFO - __main__ - Step 3829: {'lr': 0.0004998118153403827, 'samples': 735168, 'steps': 3828, 'loss/train': 2.375256299972534} 11/06/2021 21:48:02 - INFO - __main__ - Step 3830: {'lr': 0.0004998116094186239, 'samples': 735360, 'steps': 3829, 'loss/train': 1.937308430671692} 11/06/2021 21:48:02 - INFO - __main__ - Step 3831: {'lr': 0.0004998114033843038, 'samples': 735552, 'steps': 3830, 'loss/train': 2.2312798500061035} 11/06/2021 21:48:03 - INFO - __main__ - Step 3832: {'lr': 0.0004998111972374225, 'samples': 735744, 'steps': 3831, 'loss/train': 2.345470905303955} 11/06/2021 21:48:03 - INFO - __main__ - Step 3833: {'lr': 0.0004998109909779801, 'samples': 735936, 'steps': 3832, 'loss/train': 2.126286745071411} 11/06/2021 21:48:03 - INFO - __main__ - Step 3834: {'lr': 0.0004998107846059768, 'samples': 736128, 'steps': 3833, 'loss/train': 2.274366855621338} 11/06/2021 21:48:04 - INFO - __main__ - Step 3835: {'lr': 0.0004998105781214126, 'samples': 736320, 'steps': 3834, 'loss/train': 2.0822598934173584} 11/06/2021 21:48:05 - INFO - __main__ - Step 3836: {'lr': 0.0004998103715242875, 'samples': 736512, 'steps': 3835, 'loss/train': 2.376988649368286} 11/06/2021 21:48:05 - INFO - __main__ - Step 3837: {'lr': 0.0004998101648146018, 'samples': 736704, 'steps': 3836, 'loss/train': 2.1825759410858154} 11/06/2021 21:48:05 - INFO - __main__ - Step 3838: {'lr': 0.0004998099579923555, 'samples': 736896, 'steps': 3837, 'loss/train': 2.2881062030792236} 11/06/2021 21:48:06 - INFO - __main__ - Step 3839: {'lr': 0.0004998097510575487, 'samples': 737088, 'steps': 3838, 'loss/train': 2.04844069480896} 11/06/2021 21:48:06 - INFO - __main__ - Step 3840: {'lr': 0.0004998095440101815, 'samples': 737280, 'steps': 3839, 'loss/train': 2.203437089920044} 11/06/2021 21:48:07 - INFO - __main__ - Step 3841: {'lr': 0.0004998093368502539, 'samples': 737472, 'steps': 3840, 'loss/train': 2.4474103450775146} 11/06/2021 21:48:07 - INFO - __main__ - Step 3842: {'lr': 0.000499809129577766, 'samples': 737664, 'steps': 3841, 'loss/train': 1.3475080728530884} 11/06/2021 21:48:08 - INFO - __main__ - Step 3843: {'lr': 0.0004998089221927182, 'samples': 737856, 'steps': 3842, 'loss/train': 6.6313982009887695} 11/06/2021 21:48:08 - INFO - __main__ - Step 3844: {'lr': 0.0004998087146951101, 'samples': 738048, 'steps': 3843, 'loss/train': 0.9520333409309387} 11/06/2021 21:48:09 - INFO - __main__ - Step 3845: {'lr': 0.0004998085070849422, 'samples': 738240, 'steps': 3844, 'loss/train': 2.355161428451538} 11/06/2021 21:48:09 - INFO - __main__ - Step 3846: {'lr': 0.0004998082993622144, 'samples': 738432, 'steps': 3845, 'loss/train': 1.7078429460525513} 11/06/2021 21:48:10 - INFO - __main__ - Step 3847: {'lr': 0.0004998080915269268, 'samples': 738624, 'steps': 3846, 'loss/train': 1.6196483373641968} 11/06/2021 21:48:10 - INFO - __main__ - Step 3848: {'lr': 0.0004998078835790796, 'samples': 738816, 'steps': 3847, 'loss/train': 1.4258253574371338} 11/06/2021 21:48:11 - INFO - __main__ - Step 3849: {'lr': 0.0004998076755186727, 'samples': 739008, 'steps': 3848, 'loss/train': 1.7136802673339844} 11/06/2021 21:48:11 - INFO - __main__ - Step 3850: {'lr': 0.0004998074673457064, 'samples': 739200, 'steps': 3849, 'loss/train': 2.093299627304077} 11/06/2021 21:48:12 - INFO - __main__ - Step 3851: {'lr': 0.0004998072590601808, 'samples': 739392, 'steps': 3850, 'loss/train': 1.7674520015716553} 11/06/2021 21:48:12 - INFO - __main__ - Step 3852: {'lr': 0.0004998070506620957, 'samples': 739584, 'steps': 3851, 'loss/train': 2.6275033950805664} 11/06/2021 21:48:13 - INFO - __main__ - Step 3853: {'lr': 0.0004998068421514515, 'samples': 739776, 'steps': 3852, 'loss/train': 1.4663965702056885} 11/06/2021 21:48:13 - INFO - __main__ - Step 3854: {'lr': 0.0004998066335282483, 'samples': 739968, 'steps': 3853, 'loss/train': 1.9457613229751587} 11/06/2021 21:48:13 - INFO - __main__ - Step 3855: {'lr': 0.0004998064247924859, 'samples': 740160, 'steps': 3854, 'loss/train': 2.18340802192688} 11/06/2021 21:48:14 - INFO - __main__ - Step 3856: {'lr': 0.0004998062159441648, 'samples': 740352, 'steps': 3855, 'loss/train': 2.0008771419525146} 11/06/2021 21:48:15 - INFO - __main__ - Step 3857: {'lr': 0.0004998060069832846, 'samples': 740544, 'steps': 3856, 'loss/train': 1.8760361671447754} 11/06/2021 21:48:15 - INFO - __main__ - Step 3858: {'lr': 0.0004998057979098459, 'samples': 740736, 'steps': 3857, 'loss/train': 2.1014292240142822} 11/06/2021 21:48:15 - INFO - __main__ - Step 3859: {'lr': 0.0004998055887238485, 'samples': 740928, 'steps': 3858, 'loss/train': 1.7220332622528076} 11/06/2021 21:48:16 - INFO - __main__ - Step 3860: {'lr': 0.0004998053794252925, 'samples': 741120, 'steps': 3859, 'loss/train': 1.1343226432800293} 11/06/2021 21:48:16 - INFO - __main__ - Step 3861: {'lr': 0.0004998051700141781, 'samples': 741312, 'steps': 3860, 'loss/train': 1.9412497282028198} 11/06/2021 21:48:17 - INFO - __main__ - Step 3862: {'lr': 0.0004998049604905052, 'samples': 741504, 'steps': 3861, 'loss/train': 2.0013692378997803} 11/06/2021 21:48:17 - INFO - __main__ - Step 3863: {'lr': 0.0004998047508542742, 'samples': 741696, 'steps': 3862, 'loss/train': 2.296785593032837} 11/06/2021 21:48:18 - INFO - __main__ - Step 3864: {'lr': 0.000499804541105485, 'samples': 741888, 'steps': 3863, 'loss/train': 2.1500766277313232} 11/06/2021 21:48:18 - INFO - __main__ - Step 3865: {'lr': 0.0004998043312441378, 'samples': 742080, 'steps': 3864, 'loss/train': 2.002265214920044} 11/06/2021 21:48:18 - INFO - __main__ - Step 3866: {'lr': 0.0004998041212702325, 'samples': 742272, 'steps': 3865, 'loss/train': 1.9484808444976807} 11/06/2021 21:48:20 - INFO - __main__ - Step 3867: {'lr': 0.0004998039111837694, 'samples': 742464, 'steps': 3866, 'loss/train': 2.214564323425293} 11/06/2021 21:48:20 - INFO - __main__ - Step 3868: {'lr': 0.0004998037009847485, 'samples': 742656, 'steps': 3867, 'loss/train': 2.071249485015869} 11/06/2021 21:48:20 - INFO - __main__ - Step 3869: {'lr': 0.0004998034906731699, 'samples': 742848, 'steps': 3868, 'loss/train': 1.7390302419662476} 11/06/2021 21:48:21 - INFO - __main__ - Step 3870: {'lr': 0.0004998032802490337, 'samples': 743040, 'steps': 3869, 'loss/train': 1.92386794090271} 11/06/2021 21:48:21 - INFO - __main__ - Step 3871: {'lr': 0.0004998030697123399, 'samples': 743232, 'steps': 3870, 'loss/train': 2.0642926692962646} 11/06/2021 21:48:22 - INFO - __main__ - Step 3872: {'lr': 0.0004998028590630887, 'samples': 743424, 'steps': 3871, 'loss/train': 2.2832822799682617} 11/06/2021 21:48:22 - INFO - __main__ - Step 3873: {'lr': 0.0004998026483012803, 'samples': 743616, 'steps': 3872, 'loss/train': 0.3874906599521637} 11/06/2021 21:48:23 - INFO - __main__ - Step 3874: {'lr': 0.0004998024374269147, 'samples': 743808, 'steps': 3873, 'loss/train': 1.9504804611206055} 11/06/2021 21:48:23 - INFO - __main__ - Step 3875: {'lr': 0.000499802226439992, 'samples': 744000, 'steps': 3874, 'loss/train': 1.7076032161712646} 11/06/2021 21:48:23 - INFO - __main__ - Step 3876: {'lr': 0.0004998020153405121, 'samples': 744192, 'steps': 3875, 'loss/train': 2.5436949729919434} 11/06/2021 21:48:25 - INFO - __main__ - Step 3877: {'lr': 0.0004998018041284754, 'samples': 744384, 'steps': 3876, 'loss/train': 1.7179841995239258} 11/06/2021 21:48:25 - INFO - __main__ - Step 3878: {'lr': 0.0004998015928038819, 'samples': 744576, 'steps': 3877, 'loss/train': 1.9025819301605225} 11/06/2021 21:48:25 - INFO - __main__ - Step 3879: {'lr': 0.0004998013813667315, 'samples': 744768, 'steps': 3878, 'loss/train': 6.284189701080322} 11/06/2021 21:48:26 - INFO - __main__ - Step 3880: {'lr': 0.0004998011698170245, 'samples': 744960, 'steps': 3879, 'loss/train': 2.270320415496826} 11/06/2021 21:48:26 - INFO - __main__ - Step 3881: {'lr': 0.000499800958154761, 'samples': 745152, 'steps': 3880, 'loss/train': 1.9726059436798096} 11/06/2021 21:48:27 - INFO - __main__ - Step 3882: {'lr': 0.000499800746379941, 'samples': 745344, 'steps': 3881, 'loss/train': 2.072402000427246} 11/06/2021 21:48:28 - INFO - __main__ - Step 3883: {'lr': 0.0004998005344925647, 'samples': 745536, 'steps': 3882, 'loss/train': 1.0873523950576782} 11/06/2021 21:48:28 - INFO - __main__ - Step 3884: {'lr': 0.0004998003224926321, 'samples': 745728, 'steps': 3883, 'loss/train': 1.970503807067871} 11/06/2021 21:48:29 - INFO - __main__ - Step 3885: {'lr': 0.0004998001103801433, 'samples': 745920, 'steps': 3884, 'loss/train': 2.3565833568573} 11/06/2021 21:48:29 - INFO - __main__ - Step 3886: {'lr': 0.0004997998981550985, 'samples': 746112, 'steps': 3885, 'loss/train': 1.7249259948730469} 11/06/2021 21:48:29 - INFO - __main__ - Step 3887: {'lr': 0.0004997996858174976, 'samples': 746304, 'steps': 3886, 'loss/train': 2.158630132675171} 11/06/2021 21:48:30 - INFO - __main__ - Step 3888: {'lr': 0.0004997994733673409, 'samples': 746496, 'steps': 3887, 'loss/train': 2.0216381549835205} 11/06/2021 21:48:31 - INFO - __main__ - Step 3889: {'lr': 0.0004997992608046283, 'samples': 746688, 'steps': 3888, 'loss/train': 2.1885178089141846} 11/06/2021 21:48:31 - INFO - __main__ - Step 3890: {'lr': 0.0004997990481293602, 'samples': 746880, 'steps': 3889, 'loss/train': 2.567704916000366} 11/06/2021 21:48:31 - INFO - __main__ - Step 3891: {'lr': 0.0004997988353415364, 'samples': 747072, 'steps': 3890, 'loss/train': 2.117457389831543} 11/06/2021 21:48:32 - INFO - __main__ - Step 3892: {'lr': 0.0004997986224411571, 'samples': 747264, 'steps': 3891, 'loss/train': 1.900480031967163} 11/06/2021 21:48:32 - INFO - __main__ - Step 3893: {'lr': 0.0004997984094282224, 'samples': 747456, 'steps': 3892, 'loss/train': 1.8909573554992676} 11/06/2021 21:48:33 - INFO - __main__ - Step 3894: {'lr': 0.0004997981963027324, 'samples': 747648, 'steps': 3893, 'loss/train': 2.2381651401519775} 11/06/2021 21:48:33 - INFO - __main__ - Step 3895: {'lr': 0.0004997979830646871, 'samples': 747840, 'steps': 3894, 'loss/train': 1.9018830060958862} 11/06/2021 21:48:34 - INFO - __main__ - Step 3896: {'lr': 0.0004997977697140868, 'samples': 748032, 'steps': 3895, 'loss/train': 4.09332799911499} 11/06/2021 21:48:34 - INFO - __main__ - Step 3897: {'lr': 0.0004997975562509315, 'samples': 748224, 'steps': 3896, 'loss/train': 1.9882245063781738} 11/06/2021 21:48:34 - INFO - __main__ - Step 3898: {'lr': 0.0004997973426752212, 'samples': 748416, 'steps': 3897, 'loss/train': 2.2893571853637695} 11/06/2021 21:48:35 - INFO - __main__ - Step 3899: {'lr': 0.0004997971289869561, 'samples': 748608, 'steps': 3898, 'loss/train': 2.5305356979370117} 11/06/2021 21:48:36 - INFO - __main__ - Step 3900: {'lr': 0.0004997969151861362, 'samples': 748800, 'steps': 3899, 'loss/train': 1.9628640413284302} 11/06/2021 21:48:36 - INFO - __main__ - Step 3901: {'lr': 0.0004997967012727618, 'samples': 748992, 'steps': 3900, 'loss/train': 1.784785270690918} 11/06/2021 21:48:37 - INFO - __main__ - Step 3902: {'lr': 0.0004997964872468327, 'samples': 749184, 'steps': 3901, 'loss/train': 2.4843428134918213} 11/06/2021 21:48:37 - INFO - __main__ - Step 3903: {'lr': 0.0004997962731083492, 'samples': 749376, 'steps': 3902, 'loss/train': 2.3795173168182373} 11/06/2021 21:48:37 - INFO - __main__ - Step 3904: {'lr': 0.0004997960588573115, 'samples': 749568, 'steps': 3903, 'loss/train': 1.7460960149765015} 11/06/2021 21:48:38 - INFO - __main__ - Step 3905: {'lr': 0.0004997958444937193, 'samples': 749760, 'steps': 3904, 'loss/train': 2.3648343086242676} 11/06/2021 21:48:39 - INFO - __main__ - Step 3906: {'lr': 0.0004997956300175732, 'samples': 749952, 'steps': 3905, 'loss/train': 2.6430604457855225} 11/06/2021 21:48:39 - INFO - __main__ - Step 3907: {'lr': 0.000499795415428873, 'samples': 750144, 'steps': 3906, 'loss/train': 1.1248779296875} 11/06/2021 21:48:39 - INFO - __main__ - Step 3908: {'lr': 0.0004997952007276187, 'samples': 750336, 'steps': 3907, 'loss/train': 1.4855318069458008} 11/06/2021 21:48:40 - INFO - __main__ - Step 3909: {'lr': 0.0004997949859138106, 'samples': 750528, 'steps': 3908, 'loss/train': 2.541107177734375} 11/06/2021 21:48:41 - INFO - __main__ - Step 3910: {'lr': 0.0004997947709874487, 'samples': 750720, 'steps': 3909, 'loss/train': 2.4403843879699707} 11/06/2021 21:48:41 - INFO - __main__ - Step 3911: {'lr': 0.0004997945559485333, 'samples': 750912, 'steps': 3910, 'loss/train': 2.18257212638855} 11/06/2021 21:48:41 - INFO - __main__ - Step 3912: {'lr': 0.0004997943407970642, 'samples': 751104, 'steps': 3911, 'loss/train': 2.1708099842071533} 11/06/2021 21:48:42 - INFO - __main__ - Step 3913: {'lr': 0.0004997941255330416, 'samples': 751296, 'steps': 3912, 'loss/train': 1.9108836650848389} 11/06/2021 21:48:42 - INFO - __main__ - Step 3914: {'lr': 0.0004997939101564656, 'samples': 751488, 'steps': 3913, 'loss/train': 9.35435676574707} 11/06/2021 21:48:43 - INFO - __main__ - Step 3915: {'lr': 0.0004997936946673365, 'samples': 751680, 'steps': 3914, 'loss/train': 1.6310161352157593} 11/06/2021 21:48:44 - INFO - __main__ - Step 3916: {'lr': 0.000499793479065654, 'samples': 751872, 'steps': 3915, 'loss/train': 2.2418010234832764} 11/06/2021 21:48:44 - INFO - __main__ - Step 3917: {'lr': 0.0004997932633514185, 'samples': 752064, 'steps': 3916, 'loss/train': 2.000413417816162} 11/06/2021 21:48:44 - INFO - __main__ - Step 3918: {'lr': 0.00049979304752463, 'samples': 752256, 'steps': 3917, 'loss/train': 1.5085346698760986} 11/06/2021 21:48:45 - INFO - __main__ - Step 3919: {'lr': 0.0004997928315852887, 'samples': 752448, 'steps': 3918, 'loss/train': 2.0506739616394043} 11/06/2021 21:48:46 - INFO - __main__ - Step 3920: {'lr': 0.0004997926155333944, 'samples': 752640, 'steps': 3919, 'loss/train': 1.832281231880188} 11/06/2021 21:48:46 - INFO - __main__ - Step 3921: {'lr': 0.0004997923993689476, 'samples': 752832, 'steps': 3920, 'loss/train': 1.8612034320831299} 11/06/2021 21:48:46 - INFO - __main__ - Step 3922: {'lr': 0.0004997921830919481, 'samples': 753024, 'steps': 3921, 'loss/train': 0.8128635287284851} 11/06/2021 21:48:47 - INFO - __main__ - Step 3923: {'lr': 0.0004997919667023962, 'samples': 753216, 'steps': 3922, 'loss/train': 6.834549903869629} 11/06/2021 21:48:47 - INFO - __main__ - Step 3924: {'lr': 0.0004997917502002917, 'samples': 753408, 'steps': 3923, 'loss/train': 2.564049243927002} 11/06/2021 21:48:47 - INFO - __main__ - Step 3925: {'lr': 0.000499791533585635, 'samples': 753600, 'steps': 3924, 'loss/train': 1.4833868741989136} 11/06/2021 21:48:49 - INFO - __main__ - Step 3926: {'lr': 0.0004997913168584262, 'samples': 753792, 'steps': 3925, 'loss/train': 1.9561244249343872} 11/06/2021 21:48:49 - INFO - __main__ - Step 3927: {'lr': 0.0004997911000186651, 'samples': 753984, 'steps': 3926, 'loss/train': 2.2300021648406982} 11/06/2021 21:48:49 - INFO - __main__ - Step 3928: {'lr': 0.0004997908830663521, 'samples': 754176, 'steps': 3927, 'loss/train': 2.1345834732055664} 11/06/2021 21:48:50 - INFO - __main__ - Step 3929: {'lr': 0.0004997906660014871, 'samples': 754368, 'steps': 3928, 'loss/train': 1.8014755249023438} 11/06/2021 21:48:50 - INFO - __main__ - Step 3930: {'lr': 0.0004997904488240704, 'samples': 754560, 'steps': 3929, 'loss/train': 1.9056246280670166} 11/06/2021 21:48:51 - INFO - __main__ - Step 3931: {'lr': 0.0004997902315341019, 'samples': 754752, 'steps': 3930, 'loss/train': 2.0737431049346924} 11/06/2021 21:48:51 - INFO - __main__ - Step 3932: {'lr': 0.0004997900141315817, 'samples': 754944, 'steps': 3931, 'loss/train': 2.1492855548858643} 11/06/2021 21:48:52 - INFO - __main__ - Step 3933: {'lr': 0.0004997897966165101, 'samples': 755136, 'steps': 3932, 'loss/train': 1.6962233781814575} 11/06/2021 21:48:52 - INFO - __main__ - Step 3934: {'lr': 0.000499789578988887, 'samples': 755328, 'steps': 3933, 'loss/train': 2.532392740249634} 11/06/2021 21:48:52 - INFO - __main__ - Step 3935: {'lr': 0.0004997893612487126, 'samples': 755520, 'steps': 3934, 'loss/train': 2.003080368041992} 11/06/2021 21:48:53 - INFO - __main__ - Step 3936: {'lr': 0.000499789143395987, 'samples': 755712, 'steps': 3935, 'loss/train': 2.3303253650665283} 11/06/2021 21:48:54 - INFO - __main__ - Step 3937: {'lr': 0.0004997889254307103, 'samples': 755904, 'steps': 3936, 'loss/train': 2.4294395446777344} 11/06/2021 21:48:54 - INFO - __main__ - Step 3938: {'lr': 0.0004997887073528825, 'samples': 756096, 'steps': 3937, 'loss/train': 2.0944905281066895} 11/06/2021 21:48:54 - INFO - __main__ - Step 3939: {'lr': 0.0004997884891625037, 'samples': 756288, 'steps': 3938, 'loss/train': 2.200429916381836} 11/06/2021 21:48:55 - INFO - __main__ - Step 3940: {'lr': 0.0004997882708595742, 'samples': 756480, 'steps': 3939, 'loss/train': 1.588007926940918} 11/06/2021 21:48:55 - INFO - __main__ - Step 3941: {'lr': 0.0004997880524440939, 'samples': 756672, 'steps': 3940, 'loss/train': 2.470881700515747} 11/06/2021 21:48:56 - INFO - __main__ - Step 3942: {'lr': 0.0004997878339160628, 'samples': 756864, 'steps': 3941, 'loss/train': 2.125309705734253} 11/06/2021 21:48:56 - INFO - __main__ - Step 3943: {'lr': 0.0004997876152754814, 'samples': 757056, 'steps': 3942, 'loss/train': 1.4756630659103394} 11/06/2021 21:48:57 - INFO - __main__ - Step 3944: {'lr': 0.0004997873965223495, 'samples': 757248, 'steps': 3943, 'loss/train': 2.6807804107666016} 11/06/2021 21:48:57 - INFO - __main__ - Step 3945: {'lr': 0.0004997871776566672, 'samples': 757440, 'steps': 3944, 'loss/train': 2.2836551666259766} 11/06/2021 21:48:58 - INFO - __main__ - Step 3946: {'lr': 0.0004997869586784346, 'samples': 757632, 'steps': 3945, 'loss/train': 2.121295690536499} 11/06/2021 21:48:59 - INFO - __main__ - Step 3947: {'lr': 0.0004997867395876519, 'samples': 757824, 'steps': 3946, 'loss/train': 2.3428549766540527} 11/06/2021 21:48:59 - INFO - __main__ - Step 3948: {'lr': 0.0004997865203843192, 'samples': 758016, 'steps': 3947, 'loss/train': 1.777374267578125} 11/06/2021 21:48:59 - INFO - __main__ - Step 3949: {'lr': 0.0004997863010684365, 'samples': 758208, 'steps': 3948, 'loss/train': 0.35795682668685913} 11/06/2021 21:49:00 - INFO - __main__ - Step 3950: {'lr': 0.0004997860816400039, 'samples': 758400, 'steps': 3949, 'loss/train': 2.5950117111206055} 11/06/2021 21:49:00 - INFO - __main__ - Step 3951: {'lr': 0.0004997858620990217, 'samples': 758592, 'steps': 3950, 'loss/train': 1.9574391841888428} 11/06/2021 21:49:01 - INFO - __main__ - Step 3952: {'lr': 0.0004997856424454897, 'samples': 758784, 'steps': 3951, 'loss/train': 1.4650882482528687} 11/06/2021 21:49:02 - INFO - __main__ - Step 3953: {'lr': 0.0004997854226794082, 'samples': 758976, 'steps': 3952, 'loss/train': 2.324097156524658} 11/06/2021 21:49:02 - INFO - __main__ - Step 3954: {'lr': 0.0004997852028007772, 'samples': 759168, 'steps': 3953, 'loss/train': 1.7094135284423828} 11/06/2021 21:49:02 - INFO - __main__ - Step 3955: {'lr': 0.0004997849828095969, 'samples': 759360, 'steps': 3954, 'loss/train': 2.0461413860321045} 11/06/2021 21:49:03 - INFO - __main__ - Step 3956: {'lr': 0.0004997847627058673, 'samples': 759552, 'steps': 3955, 'loss/train': 1.8901828527450562} 11/06/2021 21:49:04 - INFO - __main__ - Step 3957: {'lr': 0.0004997845424895886, 'samples': 759744, 'steps': 3956, 'loss/train': 2.1595001220703125} 11/06/2021 21:49:04 - INFO - __main__ - Step 3958: {'lr': 0.0004997843221607607, 'samples': 759936, 'steps': 3957, 'loss/train': 2.0419363975524902} 11/06/2021 21:49:04 - INFO - __main__ - Step 3959: {'lr': 0.0004997841017193841, 'samples': 760128, 'steps': 3958, 'loss/train': 2.1688108444213867} 11/06/2021 21:49:05 - INFO - __main__ - Step 3960: {'lr': 0.0004997838811654584, 'samples': 760320, 'steps': 3959, 'loss/train': 2.6631009578704834} 11/06/2021 21:49:05 - INFO - __main__ - Step 3961: {'lr': 0.000499783660498984, 'samples': 760512, 'steps': 3960, 'loss/train': 2.2320704460144043} 11/06/2021 21:49:06 - INFO - __main__ - Step 3962: {'lr': 0.0004997834397199609, 'samples': 760704, 'steps': 3961, 'loss/train': 2.1878163814544678} 11/06/2021 21:49:06 - INFO - __main__ - Step 3963: {'lr': 0.0004997832188283893, 'samples': 760896, 'steps': 3962, 'loss/train': 2.8972058296203613} 11/06/2021 21:49:07 - INFO - __main__ - Step 3964: {'lr': 0.0004997829978242693, 'samples': 761088, 'steps': 3963, 'loss/train': 1.9565014839172363} 11/06/2021 21:49:07 - INFO - __main__ - Step 3965: {'lr': 0.0004997827767076008, 'samples': 761280, 'steps': 3964, 'loss/train': 1.7469931840896606} 11/06/2021 21:49:07 - INFO - __main__ - Step 3966: {'lr': 0.0004997825554783841, 'samples': 761472, 'steps': 3965, 'loss/train': 1.9404348134994507} 11/06/2021 21:49:09 - INFO - __main__ - Step 3967: {'lr': 0.0004997823341366192, 'samples': 761664, 'steps': 3966, 'loss/train': 2.1852457523345947} 11/06/2021 21:49:09 - INFO - __main__ - Step 3968: {'lr': 0.0004997821126823062, 'samples': 761856, 'steps': 3967, 'loss/train': 2.048008680343628} 11/06/2021 21:49:09 - INFO - __main__ - Step 3969: {'lr': 0.0004997818911154454, 'samples': 762048, 'steps': 3968, 'loss/train': 2.4784631729125977} 11/06/2021 21:49:10 - INFO - __main__ - Step 3970: {'lr': 0.0004997816694360367, 'samples': 762240, 'steps': 3969, 'loss/train': 2.501253128051758} 11/06/2021 21:49:10 - INFO - __main__ - Step 3971: {'lr': 0.00049978144764408, 'samples': 762432, 'steps': 3970, 'loss/train': 2.0882301330566406} 11/06/2021 21:49:10 - INFO - __main__ - Step 3972: {'lr': 0.0004997812257395758, 'samples': 762624, 'steps': 3971, 'loss/train': 0.8743030428886414} 11/06/2021 21:49:11 - INFO - __main__ - Step 3973: {'lr': 0.0004997810037225241, 'samples': 762816, 'steps': 3972, 'loss/train': 1.5809043645858765} 11/06/2021 21:49:12 - INFO - __main__ - Step 3974: {'lr': 0.0004997807815929248, 'samples': 763008, 'steps': 3973, 'loss/train': 2.1613574028015137} 11/06/2021 21:49:12 - INFO - __main__ - Step 3975: {'lr': 0.0004997805593507783, 'samples': 763200, 'steps': 3974, 'loss/train': 1.512646198272705} 11/06/2021 21:49:12 - INFO - __main__ - Step 3976: {'lr': 0.0004997803369960844, 'samples': 763392, 'steps': 3975, 'loss/train': 1.4740993976593018} 11/06/2021 21:49:13 - INFO - __main__ - Step 3977: {'lr': 0.0004997801145288433, 'samples': 763584, 'steps': 3976, 'loss/train': 1.5394988059997559} 11/06/2021 21:49:14 - INFO - __main__ - Step 3978: {'lr': 0.0004997798919490553, 'samples': 763776, 'steps': 3977, 'loss/train': 1.7356780767440796} 11/06/2021 21:49:14 - INFO - __main__ - Step 3979: {'lr': 0.0004997796692567202, 'samples': 763968, 'steps': 3978, 'loss/train': 2.2947258949279785} 11/06/2021 21:49:14 - INFO - __main__ - Step 3980: {'lr': 0.0004997794464518383, 'samples': 764160, 'steps': 3979, 'loss/train': 2.025333881378174} 11/06/2021 21:49:15 - INFO - __main__ - Step 3981: {'lr': 0.0004997792235344096, 'samples': 764352, 'steps': 3980, 'loss/train': 2.1326520442962646} 11/06/2021 21:49:15 - INFO - __main__ - Step 3982: {'lr': 0.0004997790005044343, 'samples': 764544, 'steps': 3981, 'loss/train': 1.8855067491531372} 11/06/2021 21:49:16 - INFO - __main__ - Step 3983: {'lr': 0.0004997787773619123, 'samples': 764736, 'steps': 3982, 'loss/train': 2.2183456420898438} 11/06/2021 21:49:17 - INFO - __main__ - Step 3984: {'lr': 0.0004997785541068439, 'samples': 764928, 'steps': 3983, 'loss/train': 2.175524950027466} 11/06/2021 21:49:17 - INFO - __main__ - Step 3985: {'lr': 0.0004997783307392292, 'samples': 765120, 'steps': 3984, 'loss/train': 1.8312244415283203} 11/06/2021 21:49:17 - INFO - __main__ - Step 3986: {'lr': 0.0004997781072590683, 'samples': 765312, 'steps': 3985, 'loss/train': 5.831401348114014} 11/06/2021 21:49:18 - INFO - __main__ - Step 3987: {'lr': 0.000499777883666361, 'samples': 765504, 'steps': 3986, 'loss/train': 2.1856584548950195} 11/06/2021 21:49:19 - INFO - __main__ - Step 3988: {'lr': 0.0004997776599611078, 'samples': 765696, 'steps': 3987, 'loss/train': 1.9828382730484009} 11/06/2021 21:49:19 - INFO - __main__ - Step 3989: {'lr': 0.0004997774361433086, 'samples': 765888, 'steps': 3988, 'loss/train': 2.3569905757904053} 11/06/2021 21:49:19 - INFO - __main__ - Step 3990: {'lr': 0.0004997772122129635, 'samples': 766080, 'steps': 3989, 'loss/train': 1.4840973615646362} 11/06/2021 21:49:20 - INFO - __main__ - Step 3991: {'lr': 0.0004997769881700727, 'samples': 766272, 'steps': 3990, 'loss/train': 2.4721338748931885} 11/06/2021 21:49:20 - INFO - __main__ - Step 3992: {'lr': 0.0004997767640146363, 'samples': 766464, 'steps': 3991, 'loss/train': 1.890811800956726} 11/06/2021 21:49:21 - INFO - __main__ - Step 3993: {'lr': 0.0004997765397466543, 'samples': 766656, 'steps': 3992, 'loss/train': 1.7789920568466187} 11/06/2021 21:49:22 - INFO - __main__ - Step 3994: {'lr': 0.0004997763153661269, 'samples': 766848, 'steps': 3993, 'loss/train': 0.7541496157646179} 11/06/2021 21:49:22 - INFO - __main__ - Step 3995: {'lr': 0.000499776090873054, 'samples': 767040, 'steps': 3994, 'loss/train': 1.7700446844100952} 11/06/2021 21:49:22 - INFO - __main__ - Step 3996: {'lr': 0.000499775866267436, 'samples': 767232, 'steps': 3995, 'loss/train': 2.0774638652801514} 11/06/2021 21:49:23 - INFO - __main__ - Step 3997: {'lr': 0.0004997756415492727, 'samples': 767424, 'steps': 3996, 'loss/train': 2.1475741863250732} 11/06/2021 21:49:23 - INFO - __main__ - Step 3998: {'lr': 0.0004997754167185644, 'samples': 767616, 'steps': 3997, 'loss/train': 2.2139883041381836} 11/06/2021 21:49:24 - INFO - __main__ - Step 3999: {'lr': 0.0004997751917753113, 'samples': 767808, 'steps': 3998, 'loss/train': 1.3149014711380005} 11/06/2021 21:49:24 - INFO - __main__ - Step 4000: {'lr': 0.0004997749667195132, 'samples': 768000, 'steps': 3999, 'loss/train': 2.0716893672943115} 11/06/2021 21:49:25 - INFO - __main__ - Step 4001: {'lr': 0.0004997747415511704, 'samples': 768192, 'steps': 4000, 'loss/train': 2.3865301609039307} 11/06/2021 21:49:25 - INFO - __main__ - Step 4002: {'lr': 0.000499774516270283, 'samples': 768384, 'steps': 4001, 'loss/train': 2.4549617767333984} 11/06/2021 21:49:25 - INFO - __main__ - Step 4003: {'lr': 0.0004997742908768508, 'samples': 768576, 'steps': 4002, 'loss/train': 2.512511968612671} 11/06/2021 21:49:27 - INFO - __main__ - Step 4004: {'lr': 0.0004997740653708744, 'samples': 768768, 'steps': 4003, 'loss/train': 1.9932130575180054} 11/06/2021 21:49:27 - INFO - __main__ - Step 4005: {'lr': 0.0004997738397523537, 'samples': 768960, 'steps': 4004, 'loss/train': 2.0718185901641846} 11/06/2021 21:49:27 - INFO - __main__ - Step 4006: {'lr': 0.0004997736140212887, 'samples': 769152, 'steps': 4005, 'loss/train': 2.311023712158203} 11/06/2021 21:49:28 - INFO - __main__ - Step 4007: {'lr': 0.0004997733881776796, 'samples': 769344, 'steps': 4006, 'loss/train': 2.1917130947113037} 11/06/2021 21:49:28 - INFO - __main__ - Step 4008: {'lr': 0.0004997731622215264, 'samples': 769536, 'steps': 4007, 'loss/train': 2.4064438343048096} 11/06/2021 21:49:29 - INFO - __main__ - Step 4009: {'lr': 0.0004997729361528292, 'samples': 769728, 'steps': 4008, 'loss/train': 2.3731486797332764} 11/06/2021 21:49:29 - INFO - __main__ - Step 4010: {'lr': 0.0004997727099715882, 'samples': 769920, 'steps': 4009, 'loss/train': 1.5370467901229858} 11/06/2021 21:49:30 - INFO - __main__ - Step 4011: {'lr': 0.0004997724836778036, 'samples': 770112, 'steps': 4010, 'loss/train': 1.7078628540039062} 11/06/2021 21:49:30 - INFO - __main__ - Step 4012: {'lr': 0.0004997722572714753, 'samples': 770304, 'steps': 4011, 'loss/train': 1.5252915620803833} 11/06/2021 21:49:30 - INFO - __main__ - Step 4013: {'lr': 0.0004997720307526034, 'samples': 770496, 'steps': 4012, 'loss/train': 2.1763927936553955} 11/06/2021 21:49:31 - INFO - __main__ - Step 4014: {'lr': 0.0004997718041211881, 'samples': 770688, 'steps': 4013, 'loss/train': 2.195366382598877} 11/06/2021 21:49:32 - INFO - __main__ - Step 4015: {'lr': 0.0004997715773772296, 'samples': 770880, 'steps': 4014, 'loss/train': 1.1582077741622925} 11/06/2021 21:49:32 - INFO - __main__ - Step 4016: {'lr': 0.0004997713505207278, 'samples': 771072, 'steps': 4015, 'loss/train': 1.8005876541137695} 11/06/2021 21:49:32 - INFO - __main__ - Step 4017: {'lr': 0.0004997711235516829, 'samples': 771264, 'steps': 4016, 'loss/train': 1.86122727394104} 11/06/2021 21:49:33 - INFO - __main__ - Step 4018: {'lr': 0.000499770896470095, 'samples': 771456, 'steps': 4017, 'loss/train': 1.915101170539856} 11/06/2021 21:49:34 - INFO - __main__ - Step 4019: {'lr': 0.0004997706692759642, 'samples': 771648, 'steps': 4018, 'loss/train': 2.1200358867645264} 11/06/2021 21:49:34 - INFO - __main__ - Step 4020: {'lr': 0.0004997704419692905, 'samples': 771840, 'steps': 4019, 'loss/train': 2.1703789234161377} 11/06/2021 21:49:34 - INFO - __main__ - Step 4021: {'lr': 0.0004997702145500741, 'samples': 772032, 'steps': 4020, 'loss/train': 2.4524030685424805} 11/06/2021 21:49:35 - INFO - __main__ - Step 4022: {'lr': 0.0004997699870183151, 'samples': 772224, 'steps': 4021, 'loss/train': 2.2087340354919434} 11/06/2021 21:49:35 - INFO - __main__ - Step 4023: {'lr': 0.0004997697593740137, 'samples': 772416, 'steps': 4022, 'loss/train': 1.8903212547302246} 11/06/2021 21:49:36 - INFO - __main__ - Step 4024: {'lr': 0.0004997695316171698, 'samples': 772608, 'steps': 4023, 'loss/train': 2.31099534034729} 11/06/2021 21:49:37 - INFO - __main__ - Step 4025: {'lr': 0.0004997693037477837, 'samples': 772800, 'steps': 4024, 'loss/train': 2.07493257522583} 11/06/2021 21:49:37 - INFO - __main__ - Step 4026: {'lr': 0.0004997690757658552, 'samples': 772992, 'steps': 4025, 'loss/train': 1.9826215505599976} 11/06/2021 21:49:37 - INFO - __main__ - Step 4027: {'lr': 0.0004997688476713848, 'samples': 773184, 'steps': 4026, 'loss/train': 2.102391004562378} 11/06/2021 21:49:38 - INFO - __main__ - Step 4028: {'lr': 0.0004997686194643724, 'samples': 773376, 'steps': 4027, 'loss/train': 1.9948838949203491} 11/06/2021 21:49:38 - INFO - __main__ - Step 4029: {'lr': 0.0004997683911448181, 'samples': 773568, 'steps': 4028, 'loss/train': 1.7805095911026} 11/06/2021 21:49:39 - INFO - __main__ - Step 4030: {'lr': 0.000499768162712722, 'samples': 773760, 'steps': 4029, 'loss/train': 2.1849615573883057} 11/06/2021 21:49:39 - INFO - __main__ - Step 4031: {'lr': 0.0004997679341680843, 'samples': 773952, 'steps': 4030, 'loss/train': 2.313040256500244} 11/06/2021 21:49:40 - INFO - __main__ - Step 4032: {'lr': 0.0004997677055109049, 'samples': 774144, 'steps': 4031, 'loss/train': 1.893852949142456} 11/06/2021 21:49:40 - INFO - __main__ - Step 4033: {'lr': 0.0004997674767411841, 'samples': 774336, 'steps': 4032, 'loss/train': 2.130466938018799} 11/06/2021 21:49:40 - INFO - __main__ - Step 4034: {'lr': 0.0004997672478589219, 'samples': 774528, 'steps': 4033, 'loss/train': 1.615881085395813} 11/06/2021 21:49:41 - INFO - __main__ - Step 4035: {'lr': 0.0004997670188641183, 'samples': 774720, 'steps': 4034, 'loss/train': 2.1269845962524414} 11/06/2021 21:49:42 - INFO - __main__ - Step 4036: {'lr': 0.0004997667897567738, 'samples': 774912, 'steps': 4035, 'loss/train': 2.2043027877807617} 11/06/2021 21:49:42 - INFO - __main__ - Step 4037: {'lr': 0.0004997665605368881, 'samples': 775104, 'steps': 4036, 'loss/train': 1.7043681144714355} 11/06/2021 21:49:42 - INFO - __main__ - Step 4038: {'lr': 0.0004997663312044614, 'samples': 775296, 'steps': 4037, 'loss/train': 1.8073010444641113} 11/06/2021 21:49:43 - INFO - __main__ - Step 4039: {'lr': 0.0004997661017594939, 'samples': 775488, 'steps': 4038, 'loss/train': 2.2743730545043945} 11/06/2021 21:49:44 - INFO - __main__ - Step 4040: {'lr': 0.0004997658722019857, 'samples': 775680, 'steps': 4039, 'loss/train': 1.5168383121490479} 11/06/2021 21:49:44 - INFO - __main__ - Step 4041: {'lr': 0.0004997656425319367, 'samples': 775872, 'steps': 4040, 'loss/train': 2.3586344718933105} 11/06/2021 21:49:44 - INFO - __main__ - Step 4042: {'lr': 0.0004997654127493473, 'samples': 776064, 'steps': 4041, 'loss/train': 2.290358066558838} 11/06/2021 21:49:45 - INFO - __main__ - Step 4043: {'lr': 0.0004997651828542173, 'samples': 776256, 'steps': 4042, 'loss/train': 2.349440336227417} 11/06/2021 21:49:45 - INFO - __main__ - Step 4044: {'lr': 0.0004997649528465471, 'samples': 776448, 'steps': 4043, 'loss/train': 2.3541646003723145} 11/06/2021 21:49:46 - INFO - __main__ - Step 4045: {'lr': 0.0004997647227263367, 'samples': 776640, 'steps': 4044, 'loss/train': 2.01802396774292} 11/06/2021 21:49:47 - INFO - __main__ - Step 4046: {'lr': 0.000499764492493586, 'samples': 776832, 'steps': 4045, 'loss/train': 1.9747871160507202} 11/06/2021 21:49:47 - INFO - __main__ - Step 4047: {'lr': 0.0004997642621482955, 'samples': 777024, 'steps': 4046, 'loss/train': 3.536803960800171} 11/06/2021 21:49:47 - INFO - __main__ - Step 4048: {'lr': 0.0004997640316904649, 'samples': 777216, 'steps': 4047, 'loss/train': 2.313307523727417} 11/06/2021 21:49:48 - INFO - __main__ - Step 4049: {'lr': 0.0004997638011200946, 'samples': 777408, 'steps': 4048, 'loss/train': 1.8340681791305542} 11/06/2021 21:49:49 - INFO - __main__ - Step 4050: {'lr': 0.0004997635704371844, 'samples': 777600, 'steps': 4049, 'loss/train': 2.015902519226074} 11/06/2021 21:49:49 - INFO - __main__ - Step 4051: {'lr': 0.0004997633396417348, 'samples': 777792, 'steps': 4050, 'loss/train': 2.1511425971984863} 11/06/2021 21:49:49 - INFO - __main__ - Step 4052: {'lr': 0.0004997631087337456, 'samples': 777984, 'steps': 4051, 'loss/train': 1.9303054809570312} 11/06/2021 21:49:50 - INFO - __main__ - Step 4053: {'lr': 0.000499762877713217, 'samples': 778176, 'steps': 4052, 'loss/train': 2.202286720275879} 11/06/2021 21:49:50 - INFO - __main__ - Step 4054: {'lr': 0.0004997626465801492, 'samples': 778368, 'steps': 4053, 'loss/train': 2.0874695777893066} 11/06/2021 21:49:51 - INFO - __main__ - Step 4055: {'lr': 0.000499762415334542, 'samples': 778560, 'steps': 4054, 'loss/train': 1.871009111404419} 11/06/2021 21:49:51 - INFO - __main__ - Step 4056: {'lr': 0.0004997621839763958, 'samples': 778752, 'steps': 4055, 'loss/train': 1.8566710948944092} 11/06/2021 21:49:52 - INFO - __main__ - Step 4057: {'lr': 0.0004997619525057106, 'samples': 778944, 'steps': 4056, 'loss/train': 1.2701704502105713} 11/06/2021 21:49:52 - INFO - __main__ - Step 4058: {'lr': 0.0004997617209224866, 'samples': 779136, 'steps': 4057, 'loss/train': 2.2687175273895264} 11/06/2021 21:49:52 - INFO - __main__ - Step 4059: {'lr': 0.0004997614892267238, 'samples': 779328, 'steps': 4058, 'loss/train': 1.7731740474700928} 11/06/2021 21:49:53 - INFO - __main__ - Step 4060: {'lr': 0.0004997612574184223, 'samples': 779520, 'steps': 4059, 'loss/train': 2.616718292236328} 11/06/2021 21:49:54 - INFO - __main__ - Step 4061: {'lr': 0.0004997610254975823, 'samples': 779712, 'steps': 4060, 'loss/train': 1.441390872001648} 11/06/2021 21:49:54 - INFO - __main__ - Step 4062: {'lr': 0.0004997607934642038, 'samples': 779904, 'steps': 4061, 'loss/train': 2.379161834716797} 11/06/2021 21:49:55 - INFO - __main__ - Step 4063: {'lr': 0.0004997605613182868, 'samples': 780096, 'steps': 4062, 'loss/train': 2.2010395526885986} 11/06/2021 21:49:55 - INFO - __main__ - Step 4064: {'lr': 0.0004997603290598317, 'samples': 780288, 'steps': 4063, 'loss/train': 1.8180835247039795} 11/06/2021 21:49:55 - INFO - __main__ - Step 4065: {'lr': 0.0004997600966888384, 'samples': 780480, 'steps': 4064, 'loss/train': 1.5754492282867432} 11/06/2021 21:49:56 - INFO - __main__ - Step 4066: {'lr': 0.000499759864205307, 'samples': 780672, 'steps': 4065, 'loss/train': 1.9471435546875} 11/06/2021 21:49:57 - INFO - __main__ - Step 4067: {'lr': 0.0004997596316092378, 'samples': 780864, 'steps': 4066, 'loss/train': 0.3540392220020294} 11/06/2021 21:49:57 - INFO - __main__ - Step 4068: {'lr': 0.0004997593989006306, 'samples': 781056, 'steps': 4067, 'loss/train': 1.7563910484313965} 11/06/2021 21:49:58 - INFO - __main__ - Step 4069: {'lr': 0.0004997591660794858, 'samples': 781248, 'steps': 4068, 'loss/train': 2.3376967906951904} 11/06/2021 21:49:58 - INFO - __main__ - Step 4070: {'lr': 0.0004997589331458034, 'samples': 781440, 'steps': 4069, 'loss/train': 2.6819803714752197} 11/06/2021 21:49:59 - INFO - __main__ - Step 4071: {'lr': 0.0004997587000995833, 'samples': 781632, 'steps': 4070, 'loss/train': 2.0289735794067383} 11/06/2021 21:49:59 - INFO - __main__ - Step 4072: {'lr': 0.000499758466940826, 'samples': 781824, 'steps': 4071, 'loss/train': 1.8696802854537964} 11/06/2021 21:50:00 - INFO - __main__ - Step 4073: {'lr': 0.0004997582336695312, 'samples': 782016, 'steps': 4072, 'loss/train': 2.1343019008636475} 11/06/2021 21:50:00 - INFO - __main__ - Step 4074: {'lr': 0.0004997580002856993, 'samples': 782208, 'steps': 4073, 'loss/train': 2.6347973346710205} 11/06/2021 21:50:00 - INFO - __main__ - Step 4075: {'lr': 0.0004997577667893303, 'samples': 782400, 'steps': 4074, 'loss/train': 2.238234519958496} 11/06/2021 21:50:01 - INFO - __main__ - Step 4076: {'lr': 0.0004997575331804243, 'samples': 782592, 'steps': 4075, 'loss/train': 1.6434662342071533} 11/06/2021 21:50:02 - INFO - __main__ - Step 4077: {'lr': 0.0004997572994589812, 'samples': 782784, 'steps': 4076, 'loss/train': 2.2874550819396973} 11/06/2021 21:50:02 - INFO - __main__ - Step 4078: {'lr': 0.0004997570656250016, 'samples': 782976, 'steps': 4077, 'loss/train': 2.18599009513855} 11/06/2021 21:50:02 - INFO - __main__ - Step 4079: {'lr': 0.0004997568316784852, 'samples': 783168, 'steps': 4078, 'loss/train': 2.6030142307281494} 11/06/2021 21:50:03 - INFO - __main__ - Step 4080: {'lr': 0.0004997565976194323, 'samples': 783360, 'steps': 4079, 'loss/train': 2.873201608657837} 11/06/2021 21:50:03 - INFO - __main__ - Step 4081: {'lr': 0.0004997563634478429, 'samples': 783552, 'steps': 4080, 'loss/train': 1.6083195209503174} 11/06/2021 21:50:04 - INFO - __main__ - Step 4082: {'lr': 0.000499756129163717, 'samples': 783744, 'steps': 4081, 'loss/train': 2.6470818519592285} 11/06/2021 21:50:05 - INFO - __main__ - Step 4083: {'lr': 0.000499755894767055, 'samples': 783936, 'steps': 4082, 'loss/train': 2.0863211154937744} 11/06/2021 21:50:05 - INFO - __main__ - Step 4084: {'lr': 0.0004997556602578568, 'samples': 784128, 'steps': 4083, 'loss/train': 2.274592161178589} 11/06/2021 21:50:05 - INFO - __main__ - Step 4085: {'lr': 0.0004997554256361225, 'samples': 784320, 'steps': 4084, 'loss/train': 1.7604327201843262} 11/06/2021 21:50:06 - INFO - __main__ - Step 4086: {'lr': 0.0004997551909018524, 'samples': 784512, 'steps': 4085, 'loss/train': 1.6476736068725586} 11/06/2021 21:50:07 - INFO - __main__ - Step 4087: {'lr': 0.0004997549560550464, 'samples': 784704, 'steps': 4086, 'loss/train': 2.3296737670898438} 11/06/2021 21:50:07 - INFO - __main__ - Step 4088: {'lr': 0.0004997547210957047, 'samples': 784896, 'steps': 4087, 'loss/train': 1.9330281019210815} 11/06/2021 21:50:07 - INFO - __main__ - Step 4089: {'lr': 0.0004997544860238272, 'samples': 785088, 'steps': 4088, 'loss/train': 2.002901315689087} 11/06/2021 21:50:08 - INFO - __main__ - Step 4090: {'lr': 0.0004997542508394144, 'samples': 785280, 'steps': 4089, 'loss/train': 2.227198362350464} 11/06/2021 21:50:08 - INFO - __main__ - Step 4091: {'lr': 0.000499754015542466, 'samples': 785472, 'steps': 4090, 'loss/train': 2.175802707672119} 11/06/2021 21:50:09 - INFO - __main__ - Step 4092: {'lr': 0.0004997537801329824, 'samples': 785664, 'steps': 4091, 'loss/train': 1.697835087776184} 11/06/2021 21:50:10 - INFO - __main__ - Step 4093: {'lr': 0.0004997535446109637, 'samples': 785856, 'steps': 4092, 'loss/train': 1.8053069114685059} 11/06/2021 21:50:10 - INFO - __main__ - Step 4094: {'lr': 0.0004997533089764097, 'samples': 786048, 'steps': 4093, 'loss/train': 1.4036602973937988} 11/06/2021 21:50:10 - INFO - __main__ - Step 4095: {'lr': 0.0004997530732293209, 'samples': 786240, 'steps': 4094, 'loss/train': 1.8951164484024048} 11/06/2021 21:50:11 - INFO - __main__ - Step 4096: {'lr': 0.000499752837369697, 'samples': 786432, 'steps': 4095, 'loss/train': 2.195209264755249} 11/06/2021 21:50:12 - INFO - __main__ - Step 4097: {'lr': 0.0004997526013975385, 'samples': 786624, 'steps': 4096, 'loss/train': 1.7116293907165527} 11/06/2021 21:50:12 - INFO - __main__ - Step 4098: {'lr': 0.0004997523653128453, 'samples': 786816, 'steps': 4097, 'loss/train': 1.4756605625152588} 11/06/2021 21:50:12 - INFO - __main__ - Step 4099: {'lr': 0.0004997521291156175, 'samples': 787008, 'steps': 4098, 'loss/train': 2.56708025932312} 11/06/2021 21:50:13 - INFO - __main__ - Step 4100: {'lr': 0.0004997518928058553, 'samples': 787200, 'steps': 4099, 'loss/train': 1.9039506912231445} 11/06/2021 21:50:13 - INFO - __main__ - Step 4101: {'lr': 0.0004997516563835587, 'samples': 787392, 'steps': 4100, 'loss/train': 2.242305040359497} 11/06/2021 21:50:14 - INFO - __main__ - Step 4102: {'lr': 0.0004997514198487279, 'samples': 787584, 'steps': 4101, 'loss/train': 1.7869306802749634} 11/06/2021 21:50:15 - INFO - __main__ - Step 4103: {'lr': 0.0004997511832013629, 'samples': 787776, 'steps': 4102, 'loss/train': 1.4681452512741089} 11/06/2021 21:50:15 - INFO - __main__ - Step 4104: {'lr': 0.0004997509464414639, 'samples': 787968, 'steps': 4103, 'loss/train': 1.8288829326629639} 11/06/2021 21:50:15 - INFO - __main__ - Step 4105: {'lr': 0.000499750709569031, 'samples': 788160, 'steps': 4104, 'loss/train': 2.145322322845459} 11/06/2021 21:50:16 - INFO - __main__ - Step 4106: {'lr': 0.0004997504725840644, 'samples': 788352, 'steps': 4105, 'loss/train': 1.8525432348251343} 11/06/2021 21:50:17 - INFO - __main__ - Step 4107: {'lr': 0.0004997502354865639, 'samples': 788544, 'steps': 4106, 'loss/train': 2.4128339290618896} 11/06/2021 21:50:17 - INFO - __main__ - Step 4108: {'lr': 0.0004997499982765299, 'samples': 788736, 'steps': 4107, 'loss/train': 2.3028995990753174} 11/06/2021 21:50:17 - INFO - __main__ - Step 4109: {'lr': 0.0004997497609539623, 'samples': 788928, 'steps': 4108, 'loss/train': 1.7529590129852295} 11/06/2021 21:50:18 - INFO - __main__ - Step 4110: {'lr': 0.0004997495235188614, 'samples': 789120, 'steps': 4109, 'loss/train': 2.2879607677459717} 11/06/2021 21:50:18 - INFO - __main__ - Step 4111: {'lr': 0.0004997492859712272, 'samples': 789312, 'steps': 4110, 'loss/train': 2.290275812149048} 11/06/2021 21:50:19 - INFO - __main__ - Step 4112: {'lr': 0.0004997490483110599, 'samples': 789504, 'steps': 4111, 'loss/train': 2.2251839637756348} 11/06/2021 21:50:19 - INFO - __main__ - Step 4113: {'lr': 0.0004997488105383594, 'samples': 789696, 'steps': 4112, 'loss/train': 1.8223003149032593} 11/06/2021 21:50:20 - INFO - __main__ - Step 4114: {'lr': 0.000499748572653126, 'samples': 789888, 'steps': 4113, 'loss/train': 2.5275418758392334} 11/06/2021 21:50:20 - INFO - __main__ - Step 4115: {'lr': 0.0004997483346553597, 'samples': 790080, 'steps': 4114, 'loss/train': 2.204193592071533} 11/06/2021 21:50:21 - INFO - __main__ - Step 4116: {'lr': 0.0004997480965450607, 'samples': 790272, 'steps': 4115, 'loss/train': 2.2369582653045654} 11/06/2021 21:50:21 - INFO - __main__ - Step 4117: {'lr': 0.0004997478583222291, 'samples': 790464, 'steps': 4116, 'loss/train': 1.5711380243301392} 11/06/2021 21:50:22 - INFO - __main__ - Step 4118: {'lr': 0.0004997476199868649, 'samples': 790656, 'steps': 4117, 'loss/train': 2.431384563446045} 11/06/2021 21:50:22 - INFO - __main__ - Step 4119: {'lr': 0.0004997473815389683, 'samples': 790848, 'steps': 4118, 'loss/train': 2.284108877182007} 11/06/2021 21:50:23 - INFO - __main__ - Step 4120: {'lr': 0.0004997471429785394, 'samples': 791040, 'steps': 4119, 'loss/train': 2.340329170227051} 11/06/2021 21:50:23 - INFO - __main__ - Step 4121: {'lr': 0.0004997469043055784, 'samples': 791232, 'steps': 4120, 'loss/train': 1.350162386894226} 11/06/2021 21:50:24 - INFO - __main__ - Step 4122: {'lr': 0.000499746665520085, 'samples': 791424, 'steps': 4121, 'loss/train': 2.2945504188537598} 11/06/2021 21:50:24 - INFO - __main__ - Step 4123: {'lr': 0.0004997464266220599, 'samples': 791616, 'steps': 4122, 'loss/train': 1.9403982162475586} 11/06/2021 21:50:25 - INFO - __main__ - Step 4124: {'lr': 0.0004997461876115029, 'samples': 791808, 'steps': 4123, 'loss/train': 1.7669719457626343} 11/06/2021 21:50:25 - INFO - __main__ - Step 4125: {'lr': 0.0004997459484884139, 'samples': 792000, 'steps': 4124, 'loss/train': 2.5181665420532227} 11/06/2021 21:50:25 - INFO - __main__ - Step 4126: {'lr': 0.0004997457092527934, 'samples': 792192, 'steps': 4125, 'loss/train': 1.4069466590881348} 11/06/2021 21:50:26 - INFO - __main__ - Step 4127: {'lr': 0.0004997454699046412, 'samples': 792384, 'steps': 4126, 'loss/train': 1.1352291107177734} 11/06/2021 21:50:27 - INFO - __main__ - Step 4128: {'lr': 0.0004997452304439577, 'samples': 792576, 'steps': 4127, 'loss/train': 2.2297260761260986} 11/06/2021 21:50:27 - INFO - __main__ - Step 4129: {'lr': 0.0004997449908707428, 'samples': 792768, 'steps': 4128, 'loss/train': 1.6142773628234863} 11/06/2021 21:50:27 - INFO - __main__ - Step 4130: {'lr': 0.0004997447511849966, 'samples': 792960, 'steps': 4129, 'loss/train': 0.43251675367355347} 11/06/2021 21:50:28 - INFO - __main__ - Step 4131: {'lr': 0.0004997445113867193, 'samples': 793152, 'steps': 4130, 'loss/train': 2.1551005840301514} 11/06/2021 21:50:28 - INFO - __main__ - Step 4132: {'lr': 0.000499744271475911, 'samples': 793344, 'steps': 4131, 'loss/train': 1.9457584619522095} 11/06/2021 21:50:29 - INFO - __main__ - Step 4133: {'lr': 0.0004997440314525718, 'samples': 793536, 'steps': 4132, 'loss/train': 1.7447444200515747} 11/06/2021 21:50:29 - INFO - __main__ - Step 4134: {'lr': 0.0004997437913167018, 'samples': 793728, 'steps': 4133, 'loss/train': 2.1197428703308105} 11/06/2021 21:50:30 - INFO - __main__ - Step 4135: {'lr': 0.0004997435510683011, 'samples': 793920, 'steps': 4134, 'loss/train': 1.950053095817566} 11/06/2021 21:50:30 - INFO - __main__ - Step 4136: {'lr': 0.0004997433107073697, 'samples': 794112, 'steps': 4135, 'loss/train': 2.236959457397461} 11/06/2021 21:50:30 - INFO - __main__ - Step 4137: {'lr': 0.000499743070233908, 'samples': 794304, 'steps': 4136, 'loss/train': 2.2369892597198486} 11/06/2021 21:50:31 - INFO - __main__ - Step 4138: {'lr': 0.0004997428296479158, 'samples': 794496, 'steps': 4137, 'loss/train': 1.6959571838378906} 11/06/2021 21:50:32 - INFO - __main__ - Step 4139: {'lr': 0.0004997425889493933, 'samples': 794688, 'steps': 4138, 'loss/train': 2.1793487071990967} 11/06/2021 21:50:32 - INFO - __main__ - Step 4140: {'lr': 0.0004997423481383407, 'samples': 794880, 'steps': 4139, 'loss/train': 1.875229001045227} 11/06/2021 21:50:33 - INFO - __main__ - Step 4141: {'lr': 0.0004997421072147581, 'samples': 795072, 'steps': 4140, 'loss/train': 2.0105512142181396} 11/06/2021 21:50:33 - INFO - __main__ - Step 4142: {'lr': 0.0004997418661786455, 'samples': 795264, 'steps': 4141, 'loss/train': 2.1727404594421387} 11/06/2021 21:50:34 - INFO - __main__ - Step 4143: {'lr': 0.0004997416250300031, 'samples': 795456, 'steps': 4142, 'loss/train': 2.191661834716797} 11/06/2021 21:50:34 - INFO - __main__ - Step 4144: {'lr': 0.0004997413837688309, 'samples': 795648, 'steps': 4143, 'loss/train': 1.4901235103607178} 11/06/2021 21:50:35 - INFO - __main__ - Step 4145: {'lr': 0.0004997411423951292, 'samples': 795840, 'steps': 4144, 'loss/train': 2.327420234680176} 11/06/2021 21:50:35 - INFO - __main__ - Step 4146: {'lr': 0.0004997409009088979, 'samples': 796032, 'steps': 4145, 'loss/train': 2.225456476211548} 11/06/2021 21:50:35 - INFO - __main__ - Step 4147: {'lr': 0.0004997406593101373, 'samples': 796224, 'steps': 4146, 'loss/train': 1.6128720045089722} 11/06/2021 21:50:36 - INFO - __main__ - Step 4148: {'lr': 0.0004997404175988474, 'samples': 796416, 'steps': 4147, 'loss/train': 1.3797961473464966} 11/06/2021 21:50:37 - INFO - __main__ - Step 4149: {'lr': 0.0004997401757750282, 'samples': 796608, 'steps': 4148, 'loss/train': 2.3454394340515137} 11/06/2021 21:50:37 - INFO - __main__ - Step 4150: {'lr': 0.00049973993383868, 'samples': 796800, 'steps': 4149, 'loss/train': 1.4798073768615723} 11/06/2021 21:50:37 - INFO - __main__ - Step 4151: {'lr': 0.0004997396917898029, 'samples': 796992, 'steps': 4150, 'loss/train': 1.8693385124206543} 11/06/2021 21:50:38 - INFO - __main__ - Step 4152: {'lr': 0.0004997394496283969, 'samples': 797184, 'steps': 4151, 'loss/train': 1.895919919013977} 11/06/2021 21:50:39 - INFO - __main__ - Step 4153: {'lr': 0.0004997392073544622, 'samples': 797376, 'steps': 4152, 'loss/train': 2.0135254859924316} 11/06/2021 21:50:39 - INFO - __main__ - Step 4154: {'lr': 0.0004997389649679987, 'samples': 797568, 'steps': 4153, 'loss/train': 1.9823795557022095} 11/06/2021 21:50:39 - INFO - __main__ - Step 4155: {'lr': 0.0004997387224690068, 'samples': 797760, 'steps': 4154, 'loss/train': 2.079752206802368} 11/06/2021 21:50:40 - INFO - __main__ - Step 4156: {'lr': 0.0004997384798574865, 'samples': 797952, 'steps': 4155, 'loss/train': 1.7831960916519165} 11/06/2021 21:50:40 - INFO - __main__ - Step 4157: {'lr': 0.0004997382371334379, 'samples': 798144, 'steps': 4156, 'loss/train': 1.613570213317871} 11/06/2021 21:50:41 - INFO - __main__ - Step 4158: {'lr': 0.0004997379942968611, 'samples': 798336, 'steps': 4157, 'loss/train': 2.2602744102478027} 11/06/2021 21:50:41 - INFO - __main__ - Step 4159: {'lr': 0.0004997377513477562, 'samples': 798528, 'steps': 4158, 'loss/train': 1.372780203819275} 11/06/2021 21:50:42 - INFO - __main__ - Step 4160: {'lr': 0.0004997375082861234, 'samples': 798720, 'steps': 4159, 'loss/train': 0.5014187693595886} 11/06/2021 21:50:42 - INFO - __main__ - Step 4161: {'lr': 0.0004997372651119626, 'samples': 798912, 'steps': 4160, 'loss/train': 2.1681416034698486} 11/06/2021 21:50:42 - INFO - __main__ - Step 4162: {'lr': 0.0004997370218252741, 'samples': 799104, 'steps': 4161, 'loss/train': 1.6607433557510376} 11/06/2021 21:50:44 - INFO - __main__ - Step 4163: {'lr': 0.000499736778426058, 'samples': 799296, 'steps': 4162, 'loss/train': 1.9761664867401123} 11/06/2021 21:50:44 - INFO - __main__ - Step 4164: {'lr': 0.0004997365349143142, 'samples': 799488, 'steps': 4163, 'loss/train': 2.0288639068603516} 11/06/2021 21:50:44 - INFO - __main__ - Step 4165: {'lr': 0.0004997362912900432, 'samples': 799680, 'steps': 4164, 'loss/train': 2.2376766204833984} 11/06/2021 21:50:45 - INFO - __main__ - Step 4166: {'lr': 0.0004997360475532447, 'samples': 799872, 'steps': 4165, 'loss/train': 2.014327049255371} 11/06/2021 21:50:45 - INFO - __main__ - Step 4167: {'lr': 0.000499735803703919, 'samples': 800064, 'steps': 4166, 'loss/train': 2.371185302734375} 11/06/2021 21:50:46 - INFO - __main__ - Step 4168: {'lr': 0.0004997355597420663, 'samples': 800256, 'steps': 4167, 'loss/train': 1.7514441013336182} 11/06/2021 21:50:46 - INFO - __main__ - Step 4169: {'lr': 0.0004997353156676866, 'samples': 800448, 'steps': 4168, 'loss/train': 2.4108896255493164} 11/06/2021 21:50:47 - INFO - __main__ - Step 4170: {'lr': 0.0004997350714807799, 'samples': 800640, 'steps': 4169, 'loss/train': 1.9724844694137573} 11/06/2021 21:50:47 - INFO - __main__ - Step 4171: {'lr': 0.0004997348271813466, 'samples': 800832, 'steps': 4170, 'loss/train': 2.2488949298858643} 11/06/2021 21:50:47 - INFO - __main__ - Step 4172: {'lr': 0.0004997345827693865, 'samples': 801024, 'steps': 4171, 'loss/train': 2.0820884704589844} 11/06/2021 21:50:48 - INFO - __main__ - Step 4173: {'lr': 0.0004997343382448999, 'samples': 801216, 'steps': 4172, 'loss/train': 1.9005787372589111} 11/06/2021 21:50:49 - INFO - __main__ - Step 4174: {'lr': 0.0004997340936078869, 'samples': 801408, 'steps': 4173, 'loss/train': 2.1329050064086914} 11/06/2021 21:50:49 - INFO - __main__ - Step 4175: {'lr': 0.0004997338488583475, 'samples': 801600, 'steps': 4174, 'loss/train': 2.015869617462158} 11/06/2021 21:50:50 - INFO - __main__ - Step 4176: {'lr': 0.000499733603996282, 'samples': 801792, 'steps': 4175, 'loss/train': 1.6287065744400024} 11/06/2021 21:50:50 - INFO - __main__ - Step 4177: {'lr': 0.0004997333590216902, 'samples': 801984, 'steps': 4176, 'loss/train': 2.286756753921509} 11/06/2021 21:50:50 - INFO - __main__ - Step 4178: {'lr': 0.0004997331139345725, 'samples': 802176, 'steps': 4177, 'loss/train': 2.0928821563720703} 11/06/2021 21:50:51 - INFO - __main__ - Step 4179: {'lr': 0.000499732868734929, 'samples': 802368, 'steps': 4178, 'loss/train': 1.4197605848312378} 11/06/2021 21:50:52 - INFO - __main__ - Step 4180: {'lr': 0.0004997326234227596, 'samples': 802560, 'steps': 4179, 'loss/train': 1.9068725109100342} 11/06/2021 21:50:52 - INFO - __main__ - Step 4181: {'lr': 0.0004997323779980646, 'samples': 802752, 'steps': 4180, 'loss/train': 0.9633380174636841} 11/06/2021 21:50:52 - INFO - __main__ - Step 4182: {'lr': 0.0004997321324608441, 'samples': 802944, 'steps': 4181, 'loss/train': 2.2076642513275146} 11/06/2021 21:50:53 - INFO - __main__ - Step 4183: {'lr': 0.0004997318868110981, 'samples': 803136, 'steps': 4182, 'loss/train': 2.6250104904174805} 11/06/2021 21:50:54 - INFO - __main__ - Step 4184: {'lr': 0.0004997316410488267, 'samples': 803328, 'steps': 4183, 'loss/train': 2.3980045318603516} 11/06/2021 21:50:54 - INFO - __main__ - Step 4185: {'lr': 0.0004997313951740301, 'samples': 803520, 'steps': 4184, 'loss/train': 1.4753458499908447} 11/06/2021 21:50:54 - INFO - __main__ - Step 4186: {'lr': 0.0004997311491867083, 'samples': 803712, 'steps': 4185, 'loss/train': 1.5125452280044556} 11/06/2021 21:50:55 - INFO - __main__ - Step 4187: {'lr': 0.0004997309030868617, 'samples': 803904, 'steps': 4186, 'loss/train': 1.2969268560409546} 11/06/2021 21:50:55 - INFO - __main__ - Step 4188: {'lr': 0.0004997306568744901, 'samples': 804096, 'steps': 4187, 'loss/train': 2.0338566303253174} 11/06/2021 21:50:56 - INFO - __main__ - Step 4189: {'lr': 0.0004997304105495938, 'samples': 804288, 'steps': 4188, 'loss/train': 1.7441582679748535} 11/06/2021 21:50:57 - INFO - __main__ - Step 4190: {'lr': 0.0004997301641121727, 'samples': 804480, 'steps': 4189, 'loss/train': 2.200920581817627} 11/06/2021 21:50:57 - INFO - __main__ - Step 4191: {'lr': 0.0004997299175622271, 'samples': 804672, 'steps': 4190, 'loss/train': 2.2595231533050537} 11/06/2021 21:50:57 - INFO - __main__ - Step 4192: {'lr': 0.000499729670899757, 'samples': 804864, 'steps': 4191, 'loss/train': 2.5529255867004395} 11/06/2021 21:50:58 - INFO - __main__ - Step 4193: {'lr': 0.0004997294241247627, 'samples': 805056, 'steps': 4192, 'loss/train': 1.6093616485595703} 11/06/2021 21:50:59 - INFO - __main__ - Step 4194: {'lr': 0.0004997291772372441, 'samples': 805248, 'steps': 4193, 'loss/train': 1.4874943494796753} 11/06/2021 21:50:59 - INFO - __main__ - Step 4195: {'lr': 0.0004997289302372014, 'samples': 805440, 'steps': 4194, 'loss/train': 2.0579819679260254} 11/06/2021 21:51:00 - INFO - __main__ - Step 4196: {'lr': 0.0004997286831246347, 'samples': 805632, 'steps': 4195, 'loss/train': 2.4122848510742188} 11/06/2021 21:51:00 - INFO - __main__ - Step 4197: {'lr': 0.0004997284358995441, 'samples': 805824, 'steps': 4196, 'loss/train': 2.2387192249298096} 11/06/2021 21:51:00 - INFO - __main__ - Step 4198: {'lr': 0.0004997281885619297, 'samples': 806016, 'steps': 4197, 'loss/train': 1.6527743339538574} 11/06/2021 21:51:01 - INFO - __main__ - Step 4199: {'lr': 0.0004997279411117916, 'samples': 806208, 'steps': 4198, 'loss/train': 0.3961840569972992} 11/06/2021 21:51:02 - INFO - __main__ - Step 4200: {'lr': 0.00049972769354913, 'samples': 806400, 'steps': 4199, 'loss/train': 1.9782782793045044} 11/06/2021 21:51:02 - INFO - __main__ - Step 4201: {'lr': 0.0004997274458739449, 'samples': 806592, 'steps': 4200, 'loss/train': 2.000523328781128} 11/06/2021 21:51:02 - INFO - __main__ - Step 4202: {'lr': 0.0004997271980862366, 'samples': 806784, 'steps': 4201, 'loss/train': 2.5717737674713135} 11/06/2021 21:51:03 - INFO - __main__ - Step 4203: {'lr': 0.000499726950186005, 'samples': 806976, 'steps': 4202, 'loss/train': 2.567746639251709} 11/06/2021 21:51:03 - INFO - __main__ - Step 4204: {'lr': 0.0004997267021732502, 'samples': 807168, 'steps': 4203, 'loss/train': 2.4571382999420166} 11/06/2021 21:51:04 - INFO - __main__ - Step 4205: {'lr': 0.0004997264540479724, 'samples': 807360, 'steps': 4204, 'loss/train': 1.8436121940612793} 11/06/2021 21:51:05 - INFO - __main__ - Step 4206: {'lr': 0.0004997262058101719, 'samples': 807552, 'steps': 4205, 'loss/train': 2.192099094390869} 11/06/2021 21:51:05 - INFO - __main__ - Step 4207: {'lr': 0.0004997259574598485, 'samples': 807744, 'steps': 4206, 'loss/train': 2.6766111850738525} 11/06/2021 21:51:05 - INFO - __main__ - Step 4208: {'lr': 0.0004997257089970024, 'samples': 807936, 'steps': 4207, 'loss/train': 2.0918617248535156} 11/06/2021 21:51:06 - INFO - __main__ - Step 4209: {'lr': 0.0004997254604216338, 'samples': 808128, 'steps': 4208, 'loss/train': 2.2900002002716064} 11/06/2021 21:51:07 - INFO - __main__ - Step 4210: {'lr': 0.0004997252117337428, 'samples': 808320, 'steps': 4209, 'loss/train': 1.7111936807632446} 11/06/2021 21:51:07 - INFO - __main__ - Step 4211: {'lr': 0.0004997249629333294, 'samples': 808512, 'steps': 4210, 'loss/train': 2.1886565685272217} 11/06/2021 21:51:07 - INFO - __main__ - Step 4212: {'lr': 0.0004997247140203939, 'samples': 808704, 'steps': 4211, 'loss/train': 2.0272629261016846} 11/06/2021 21:51:08 - INFO - __main__ - Step 4213: {'lr': 0.0004997244649949362, 'samples': 808896, 'steps': 4212, 'loss/train': 1.792849063873291} 11/06/2021 21:51:08 - INFO - __main__ - Step 4214: {'lr': 0.0004997242158569564, 'samples': 809088, 'steps': 4213, 'loss/train': 1.8342564105987549} 11/06/2021 21:51:09 - INFO - __main__ - Step 4215: {'lr': 0.0004997239666064549, 'samples': 809280, 'steps': 4214, 'loss/train': 2.516932249069214} 11/06/2021 21:51:09 - INFO - __main__ - Step 4216: {'lr': 0.0004997237172434316, 'samples': 809472, 'steps': 4215, 'loss/train': 1.893647313117981} 11/06/2021 21:51:10 - INFO - __main__ - Step 4217: {'lr': 0.0004997234677678867, 'samples': 809664, 'steps': 4216, 'loss/train': 2.4590060710906982} 11/06/2021 21:51:10 - INFO - __main__ - Step 4218: {'lr': 0.0004997232181798201, 'samples': 809856, 'steps': 4217, 'loss/train': 1.6287667751312256} 11/06/2021 21:51:10 - INFO - __main__ - Step 4219: {'lr': 0.0004997229684792322, 'samples': 810048, 'steps': 4218, 'loss/train': 2.320136070251465} 11/06/2021 21:51:12 - INFO - __main__ - Step 4220: {'lr': 0.000499722718666123, 'samples': 810240, 'steps': 4219, 'loss/train': 2.140002965927124} 11/06/2021 21:51:12 - INFO - __main__ - Step 4221: {'lr': 0.0004997224687404926, 'samples': 810432, 'steps': 4220, 'loss/train': 2.0114235877990723} 11/06/2021 21:51:12 - INFO - __main__ - Step 4222: {'lr': 0.0004997222187023409, 'samples': 810624, 'steps': 4221, 'loss/train': 1.9719492197036743} 11/06/2021 21:51:13 - INFO - __main__ - Step 4223: {'lr': 0.0004997219685516684, 'samples': 810816, 'steps': 4222, 'loss/train': 2.0050837993621826} 11/06/2021 21:51:13 - INFO - __main__ - Step 4224: {'lr': 0.000499721718288475, 'samples': 811008, 'steps': 4223, 'loss/train': 1.0554298162460327} 11/06/2021 21:51:14 - INFO - __main__ - Step 4225: {'lr': 0.0004997214679127609, 'samples': 811200, 'steps': 4224, 'loss/train': 2.309093713760376} 11/06/2021 21:51:14 - INFO - __main__ - Step 4226: {'lr': 0.000499721217424526, 'samples': 811392, 'steps': 4225, 'loss/train': 1.7864500284194946} 11/06/2021 21:51:15 - INFO - __main__ - Step 4227: {'lr': 0.0004997209668237707, 'samples': 811584, 'steps': 4226, 'loss/train': 2.3149712085723877} 11/06/2021 21:51:15 - INFO - __main__ - Step 4228: {'lr': 0.0004997207161104951, 'samples': 811776, 'steps': 4227, 'loss/train': 2.2613799571990967} 11/06/2021 21:51:15 - INFO - __main__ - Step 4229: {'lr': 0.0004997204652846991, 'samples': 811968, 'steps': 4228, 'loss/train': 1.9725654125213623} 11/06/2021 21:51:16 - INFO - __main__ - Step 4230: {'lr': 0.0004997202143463828, 'samples': 812160, 'steps': 4229, 'loss/train': 2.0676584243774414} 11/06/2021 21:51:17 - INFO - __main__ - Step 4231: {'lr': 0.0004997199632955464, 'samples': 812352, 'steps': 4230, 'loss/train': 2.0852909088134766} 11/06/2021 21:51:17 - INFO - __main__ - Step 4232: {'lr': 0.0004997197121321903, 'samples': 812544, 'steps': 4231, 'loss/train': 2.0031850337982178} 11/06/2021 21:51:18 - INFO - __main__ - Step 4233: {'lr': 0.0004997194608563142, 'samples': 812736, 'steps': 4232, 'loss/train': 1.9812066555023193} 11/06/2021 21:51:18 - INFO - __main__ - Step 4234: {'lr': 0.0004997192094679183, 'samples': 812928, 'steps': 4233, 'loss/train': 2.6864309310913086} 11/06/2021 21:51:18 - INFO - __main__ - Step 4235: {'lr': 0.0004997189579670028, 'samples': 813120, 'steps': 4234, 'loss/train': 2.032951831817627} 11/06/2021 21:51:19 - INFO - __main__ - Step 4236: {'lr': 0.0004997187063535679, 'samples': 813312, 'steps': 4235, 'loss/train': 2.289562463760376} 11/06/2021 21:51:20 - INFO - __main__ - Step 4237: {'lr': 0.0004997184546276135, 'samples': 813504, 'steps': 4236, 'loss/train': 1.5230915546417236} 11/06/2021 21:51:20 - INFO - __main__ - Step 4238: {'lr': 0.0004997182027891399, 'samples': 813696, 'steps': 4237, 'loss/train': 2.07342529296875} 11/06/2021 21:51:21 - INFO - __main__ - Step 4239: {'lr': 0.000499717950838147, 'samples': 813888, 'steps': 4238, 'loss/train': 1.0521665811538696} 11/06/2021 21:51:21 - INFO - __main__ - Step 4240: {'lr': 0.0004997176987746352, 'samples': 814080, 'steps': 4239, 'loss/train': 2.1802866458892822} 11/06/2021 21:51:21 - INFO - __main__ - Step 4241: {'lr': 0.0004997174465986043, 'samples': 814272, 'steps': 4240, 'loss/train': 1.9899753332138062} 11/06/2021 21:51:23 - INFO - __main__ - Step 4242: {'lr': 0.0004997171943100547, 'samples': 814464, 'steps': 4241, 'loss/train': 1.9145240783691406} 11/06/2021 21:51:23 - INFO - __main__ - Step 4243: {'lr': 0.0004997169419089863, 'samples': 814656, 'steps': 4242, 'loss/train': 2.2730929851531982} 11/06/2021 21:51:23 - INFO - __main__ - Step 4244: {'lr': 0.0004997166893953994, 'samples': 814848, 'steps': 4243, 'loss/train': 2.227478504180908} 11/06/2021 21:51:24 - INFO - __main__ - Step 4245: {'lr': 0.000499716436769294, 'samples': 815040, 'steps': 4244, 'loss/train': 1.7130650281906128} 11/06/2021 21:51:24 - INFO - __main__ - Step 4246: {'lr': 0.0004997161840306701, 'samples': 815232, 'steps': 4245, 'loss/train': 2.1107656955718994} 11/06/2021 21:51:25 - INFO - __main__ - Step 4247: {'lr': 0.0004997159311795281, 'samples': 815424, 'steps': 4246, 'loss/train': 2.0193605422973633} 11/06/2021 21:51:25 - INFO - __main__ - Step 4248: {'lr': 0.0004997156782158679, 'samples': 815616, 'steps': 4247, 'loss/train': 2.169619083404541} 11/06/2021 21:51:26 - INFO - __main__ - Step 4249: {'lr': 0.0004997154251396896, 'samples': 815808, 'steps': 4248, 'loss/train': 1.5952482223510742} 11/06/2021 21:51:26 - INFO - __main__ - Step 4250: {'lr': 0.0004997151719509935, 'samples': 816000, 'steps': 4249, 'loss/train': 2.200782060623169} 11/06/2021 21:51:26 - INFO - __main__ - Step 4251: {'lr': 0.0004997149186497795, 'samples': 816192, 'steps': 4250, 'loss/train': 2.353931188583374} 11/06/2021 21:51:27 - INFO - __main__ - Step 4252: {'lr': 0.0004997146652360478, 'samples': 816384, 'steps': 4251, 'loss/train': 1.8407737016677856} 11/06/2021 21:51:28 - INFO - __main__ - Step 4253: {'lr': 0.0004997144117097986, 'samples': 816576, 'steps': 4252, 'loss/train': 1.8652644157409668} 11/06/2021 21:51:28 - INFO - __main__ - Step 4254: {'lr': 0.0004997141580710318, 'samples': 816768, 'steps': 4253, 'loss/train': 2.0572428703308105} 11/06/2021 21:51:28 - INFO - __main__ - Step 4255: {'lr': 0.0004997139043197478, 'samples': 816960, 'steps': 4254, 'loss/train': 1.7917412519454956} 11/06/2021 21:51:29 - INFO - __main__ - Step 4256: {'lr': 0.0004997136504559465, 'samples': 817152, 'steps': 4255, 'loss/train': 1.9230844974517822} 11/06/2021 21:51:30 - INFO - __main__ - Step 4257: {'lr': 0.0004997133964796281, 'samples': 817344, 'steps': 4256, 'loss/train': 2.0481159687042236} 11/06/2021 21:51:30 - INFO - __main__ - Step 4258: {'lr': 0.0004997131423907927, 'samples': 817536, 'steps': 4257, 'loss/train': 2.003450632095337} 11/06/2021 21:51:31 - INFO - __main__ - Step 4259: {'lr': 0.0004997128881894404, 'samples': 817728, 'steps': 4258, 'loss/train': 2.5174543857574463} 11/06/2021 21:51:31 - INFO - __main__ - Step 4260: {'lr': 0.0004997126338755714, 'samples': 817920, 'steps': 4259, 'loss/train': 2.247880697250366} 11/06/2021 21:51:31 - INFO - __main__ - Step 4261: {'lr': 0.0004997123794491856, 'samples': 818112, 'steps': 4260, 'loss/train': 2.505103349685669} 11/06/2021 21:51:32 - INFO - __main__ - Step 4262: {'lr': 0.0004997121249102834, 'samples': 818304, 'steps': 4261, 'loss/train': 1.9285527467727661} 11/06/2021 21:51:33 - INFO - __main__ - Step 4263: {'lr': 0.0004997118702588647, 'samples': 818496, 'steps': 4262, 'loss/train': 2.225656509399414} 11/06/2021 21:51:33 - INFO - __main__ - Step 4264: {'lr': 0.0004997116154949297, 'samples': 818688, 'steps': 4263, 'loss/train': 1.5641311407089233} 11/06/2021 21:51:34 - INFO - __main__ - Step 4265: {'lr': 0.0004997113606184785, 'samples': 818880, 'steps': 4264, 'loss/train': 1.9400560855865479} 11/06/2021 21:51:34 - INFO - __main__ - Step 4266: {'lr': 0.0004997111056295111, 'samples': 819072, 'steps': 4265, 'loss/train': 2.0672645568847656} 11/06/2021 21:51:35 - INFO - __main__ - Step 4267: {'lr': 0.0004997108505280279, 'samples': 819264, 'steps': 4266, 'loss/train': 2.7679026126861572} 11/06/2021 21:51:35 - INFO - __main__ - Step 4268: {'lr': 0.0004997105953140288, 'samples': 819456, 'steps': 4267, 'loss/train': 1.6954803466796875} 11/06/2021 21:51:36 - INFO - __main__ - Step 4269: {'lr': 0.0004997103399875139, 'samples': 819648, 'steps': 4268, 'loss/train': 1.9309132099151611} 11/06/2021 21:51:36 - INFO - __main__ - Step 4270: {'lr': 0.0004997100845484834, 'samples': 819840, 'steps': 4269, 'loss/train': 1.524571418762207} 11/06/2021 21:51:37 - INFO - __main__ - Step 4271: {'lr': 0.0004997098289969374, 'samples': 820032, 'steps': 4270, 'loss/train': 1.9430183172225952} 11/06/2021 21:51:37 - INFO - __main__ - Step 4272: {'lr': 0.0004997095733328761, 'samples': 820224, 'steps': 4271, 'loss/train': 2.232881784439087} 11/06/2021 21:51:38 - INFO - __main__ - Step 4273: {'lr': 0.0004997093175562994, 'samples': 820416, 'steps': 4272, 'loss/train': 2.1205177307128906} 11/06/2021 21:51:38 - INFO - __main__ - Step 4274: {'lr': 0.0004997090616672076, 'samples': 820608, 'steps': 4273, 'loss/train': 2.2687668800354004} 11/06/2021 21:51:39 - INFO - __main__ - Step 4275: {'lr': 0.0004997088056656006, 'samples': 820800, 'steps': 4274, 'loss/train': 2.34332275390625} 11/06/2021 21:51:39 - INFO - __main__ - Step 4276: {'lr': 0.0004997085495514788, 'samples': 820992, 'steps': 4275, 'loss/train': 2.0807552337646484} 11/06/2021 21:51:39 - INFO - __main__ - Step 4277: {'lr': 0.0004997082933248421, 'samples': 821184, 'steps': 4276, 'loss/train': 2.0649728775024414} 11/06/2021 21:51:40 - INFO - __main__ - Step 4278: {'lr': 0.0004997080369856907, 'samples': 821376, 'steps': 4277, 'loss/train': 2.0820047855377197} 11/06/2021 21:51:41 - INFO - __main__ - Step 4279: {'lr': 0.0004997077805340248, 'samples': 821568, 'steps': 4278, 'loss/train': 1.6447721719741821} 11/06/2021 21:51:41 - INFO - __main__ - Step 4280: {'lr': 0.0004997075239698445, 'samples': 821760, 'steps': 4279, 'loss/train': 1.9901349544525146} 11/06/2021 21:51:41 - INFO - __main__ - Step 4281: {'lr': 0.0004997072672931497, 'samples': 821952, 'steps': 4280, 'loss/train': 2.4467787742614746} 11/06/2021 21:51:42 - INFO - __main__ - Step 4282: {'lr': 0.0004997070105039407, 'samples': 822144, 'steps': 4281, 'loss/train': 1.6936836242675781} 11/06/2021 21:51:43 - INFO - __main__ - Step 4283: {'lr': 0.0004997067536022176, 'samples': 822336, 'steps': 4282, 'loss/train': 2.122401475906372} 11/06/2021 21:51:43 - INFO - __main__ - Step 4284: {'lr': 0.0004997064965879804, 'samples': 822528, 'steps': 4283, 'loss/train': 2.0260608196258545} 11/06/2021 21:51:44 - INFO - __main__ - Step 4285: {'lr': 0.0004997062394612293, 'samples': 822720, 'steps': 4284, 'loss/train': 2.1867868900299072} 11/06/2021 21:51:44 - INFO - __main__ - Step 4286: {'lr': 0.0004997059822219645, 'samples': 822912, 'steps': 4285, 'loss/train': 2.057260274887085} 11/06/2021 21:51:44 - INFO - __main__ - Step 4287: {'lr': 0.000499705724870186, 'samples': 823104, 'steps': 4286, 'loss/train': 2.2277965545654297} 11/06/2021 21:51:45 - INFO - __main__ - Step 4288: {'lr': 0.0004997054674058941, 'samples': 823296, 'steps': 4287, 'loss/train': 2.590022325515747} 11/06/2021 21:51:46 - INFO - __main__ - Step 4289: {'lr': 0.0004997052098290886, 'samples': 823488, 'steps': 4288, 'loss/train': 2.015610694885254} 11/06/2021 21:51:46 - INFO - __main__ - Step 4290: {'lr': 0.0004997049521397698, 'samples': 823680, 'steps': 4289, 'loss/train': 2.311326503753662} 11/06/2021 21:51:46 - INFO - __main__ - Step 4291: {'lr': 0.0004997046943379379, 'samples': 823872, 'steps': 4290, 'loss/train': 1.8215492963790894} 11/06/2021 21:51:47 - INFO - __main__ - Step 4292: {'lr': 0.0004997044364235928, 'samples': 824064, 'steps': 4291, 'loss/train': 1.7921706438064575} 11/06/2021 21:51:47 - INFO - __main__ - Step 4293: {'lr': 0.0004997041783967348, 'samples': 824256, 'steps': 4292, 'loss/train': 2.276956558227539} 11/06/2021 21:51:48 - INFO - __main__ - Step 4294: {'lr': 0.0004997039202573639, 'samples': 824448, 'steps': 4293, 'loss/train': 1.8650037050247192} 11/06/2021 21:51:48 - INFO - __main__ - Step 4295: {'lr': 0.0004997036620054803, 'samples': 824640, 'steps': 4294, 'loss/train': 2.3925178050994873} 11/06/2021 21:51:49 - INFO - __main__ - Step 4296: {'lr': 0.0004997034036410841, 'samples': 824832, 'steps': 4295, 'loss/train': 1.9569441080093384} 11/06/2021 21:51:49 - INFO - __main__ - Step 4297: {'lr': 0.0004997031451641754, 'samples': 825024, 'steps': 4296, 'loss/train': 1.8525234460830688} 11/06/2021 21:51:49 - INFO - __main__ - Step 4298: {'lr': 0.0004997028865747542, 'samples': 825216, 'steps': 4297, 'loss/train': 2.4801597595214844} 11/06/2021 21:51:51 - INFO - __main__ - Step 4299: {'lr': 0.0004997026278728209, 'samples': 825408, 'steps': 4298, 'loss/train': 2.3479583263397217} 11/06/2021 21:51:51 - INFO - __main__ - Step 4300: {'lr': 0.0004997023690583753, 'samples': 825600, 'steps': 4299, 'loss/train': 1.8593815565109253} 11/06/2021 21:51:51 - INFO - __main__ - Step 4301: {'lr': 0.0004997021101314179, 'samples': 825792, 'steps': 4300, 'loss/train': 2.492985248565674} 11/06/2021 21:51:52 - INFO - __main__ - Step 4302: {'lr': 0.0004997018510919483, 'samples': 825984, 'steps': 4301, 'loss/train': 1.1239334344863892} 11/06/2021 21:51:52 - INFO - __main__ - Step 4303: {'lr': 0.0004997015919399671, 'samples': 826176, 'steps': 4302, 'loss/train': 2.068119525909424} 11/06/2021 21:51:53 - INFO - __main__ - Step 4304: {'lr': 0.0004997013326754742, 'samples': 826368, 'steps': 4303, 'loss/train': 2.1923468112945557} 11/06/2021 21:51:53 - INFO - __main__ - Step 4305: {'lr': 0.0004997010732984696, 'samples': 826560, 'steps': 4304, 'loss/train': 2.267517566680908} 11/06/2021 21:51:54 - INFO - __main__ - Step 4306: {'lr': 0.0004997008138089536, 'samples': 826752, 'steps': 4305, 'loss/train': 2.289822816848755} 11/06/2021 21:51:54 - INFO - __main__ - Step 4307: {'lr': 0.0004997005542069263, 'samples': 826944, 'steps': 4306, 'loss/train': 2.516078233718872} 11/06/2021 21:51:54 - INFO - __main__ - Step 4308: {'lr': 0.0004997002944923878, 'samples': 827136, 'steps': 4307, 'loss/train': 2.071676731109619} 11/06/2021 21:51:55 - INFO - __main__ - Step 4309: {'lr': 0.0004997000346653381, 'samples': 827328, 'steps': 4308, 'loss/train': 2.0077381134033203} 11/06/2021 21:51:56 - INFO - __main__ - Step 4310: {'lr': 0.0004996997747257775, 'samples': 827520, 'steps': 4309, 'loss/train': 2.347663640975952} 11/06/2021 21:51:56 - INFO - __main__ - Step 4311: {'lr': 0.000499699514673706, 'samples': 827712, 'steps': 4310, 'loss/train': 1.8493750095367432} 11/06/2021 21:51:56 - INFO - __main__ - Step 4312: {'lr': 0.0004996992545091239, 'samples': 827904, 'steps': 4311, 'loss/train': 1.9490984678268433} 11/06/2021 21:51:57 - INFO - __main__ - Step 4313: {'lr': 0.000499698994232031, 'samples': 828096, 'steps': 4312, 'loss/train': 2.2209248542785645} 11/06/2021 21:51:57 - INFO - __main__ - Step 4314: {'lr': 0.0004996987338424276, 'samples': 828288, 'steps': 4313, 'loss/train': 2.120884418487549} 11/06/2021 21:51:58 - INFO - __main__ - Step 4315: {'lr': 0.0004996984733403138, 'samples': 828480, 'steps': 4314, 'loss/train': 2.1321933269500732} 11/06/2021 21:51:59 - INFO - __main__ - Step 4316: {'lr': 0.0004996982127256898, 'samples': 828672, 'steps': 4315, 'loss/train': 2.0887527465820312} 11/06/2021 21:51:59 - INFO - __main__ - Step 4317: {'lr': 0.0004996979519985556, 'samples': 828864, 'steps': 4316, 'loss/train': 2.1263225078582764} 11/06/2021 21:51:59 - INFO - __main__ - Step 4318: {'lr': 0.0004996976911589114, 'samples': 829056, 'steps': 4317, 'loss/train': 1.8161555528640747} 11/06/2021 21:52:00 - INFO - __main__ - Step 4319: {'lr': 0.0004996974302067572, 'samples': 829248, 'steps': 4318, 'loss/train': 1.67631995677948} 11/06/2021 21:52:01 - INFO - __main__ - Step 4320: {'lr': 0.0004996971691420931, 'samples': 829440, 'steps': 4319, 'loss/train': 1.9163763523101807} 11/06/2021 21:52:01 - INFO - __main__ - Step 4321: {'lr': 0.0004996969079649195, 'samples': 829632, 'steps': 4320, 'loss/train': 1.8467737436294556} 11/06/2021 21:52:01 - INFO - __main__ - Step 4322: {'lr': 0.0004996966466752362, 'samples': 829824, 'steps': 4321, 'loss/train': 2.3159868717193604} 11/06/2021 21:52:02 - INFO - __main__ - Step 4323: {'lr': 0.0004996963852730436, 'samples': 830016, 'steps': 4322, 'loss/train': 1.9152145385742188} 11/06/2021 21:52:02 - INFO - __main__ - Step 4324: {'lr': 0.0004996961237583415, 'samples': 830208, 'steps': 4323, 'loss/train': 2.11564302444458} 11/06/2021 21:52:03 - INFO - __main__ - Step 4325: {'lr': 0.0004996958621311302, 'samples': 830400, 'steps': 4324, 'loss/train': 2.3892369270324707} 11/06/2021 21:52:03 - INFO - __main__ - Step 4326: {'lr': 0.00049969560039141, 'samples': 830592, 'steps': 4325, 'loss/train': 1.4963157176971436} 11/06/2021 21:52:04 - INFO - __main__ - Step 4327: {'lr': 0.0004996953385391806, 'samples': 830784, 'steps': 4326, 'loss/train': 2.2413148880004883} 11/06/2021 21:52:04 - INFO - __main__ - Step 4328: {'lr': 0.0004996950765744424, 'samples': 830976, 'steps': 4327, 'loss/train': 1.8104511499404907} 11/06/2021 21:52:04 - INFO - __main__ - Step 4329: {'lr': 0.0004996948144971953, 'samples': 831168, 'steps': 4328, 'loss/train': 1.7227517366409302} 11/06/2021 21:52:05 - INFO - __main__ - Step 4330: {'lr': 0.0004996945523074398, 'samples': 831360, 'steps': 4329, 'loss/train': 1.6873306035995483} 11/06/2021 21:52:06 - INFO - __main__ - Step 4331: {'lr': 0.0004996942900051757, 'samples': 831552, 'steps': 4330, 'loss/train': 2.224438190460205} 11/06/2021 21:52:06 - INFO - __main__ - Step 4332: {'lr': 0.0004996940275904031, 'samples': 831744, 'steps': 4331, 'loss/train': 2.1473422050476074} 11/06/2021 21:52:07 - INFO - __main__ - Step 4333: {'lr': 0.0004996937650631224, 'samples': 831936, 'steps': 4332, 'loss/train': 2.5668904781341553} 11/06/2021 21:52:07 - INFO - __main__ - Step 4334: {'lr': 0.0004996935024233335, 'samples': 832128, 'steps': 4333, 'loss/train': 2.192603349685669} 11/06/2021 21:52:08 - INFO - __main__ - Step 4335: {'lr': 0.0004996932396710365, 'samples': 832320, 'steps': 4334, 'loss/train': 1.7712533473968506} 11/06/2021 21:52:08 - INFO - __main__ - Step 4336: {'lr': 0.0004996929768062316, 'samples': 832512, 'steps': 4335, 'loss/train': 1.558101773262024} 11/06/2021 21:52:09 - INFO - __main__ - Step 4337: {'lr': 0.0004996927138289189, 'samples': 832704, 'steps': 4336, 'loss/train': 2.441239833831787} 11/06/2021 21:52:09 - INFO - __main__ - Step 4338: {'lr': 0.0004996924507390985, 'samples': 832896, 'steps': 4337, 'loss/train': 2.24872088432312} 11/06/2021 21:52:09 - INFO - __main__ - Step 4339: {'lr': 0.0004996921875367705, 'samples': 833088, 'steps': 4338, 'loss/train': 2.0886828899383545} 11/06/2021 21:52:10 - INFO - __main__ - Step 4340: {'lr': 0.0004996919242219352, 'samples': 833280, 'steps': 4339, 'loss/train': 2.2574408054351807} 11/06/2021 21:52:11 - INFO - __main__ - Step 4341: {'lr': 0.0004996916607945925, 'samples': 833472, 'steps': 4340, 'loss/train': 1.8286736011505127} 11/06/2021 21:52:11 - INFO - __main__ - Step 4342: {'lr': 0.0004996913972547426, 'samples': 833664, 'steps': 4341, 'loss/train': 2.018918037414551} 11/06/2021 21:52:11 - INFO - __main__ - Step 4343: {'lr': 0.0004996911336023855, 'samples': 833856, 'steps': 4342, 'loss/train': 2.208406925201416} 11/06/2021 21:52:12 - INFO - __main__ - Step 4344: {'lr': 0.0004996908698375216, 'samples': 834048, 'steps': 4343, 'loss/train': 1.9386653900146484} 11/06/2021 21:52:12 - INFO - __main__ - Step 4345: {'lr': 0.0004996906059601507, 'samples': 834240, 'steps': 4344, 'loss/train': 1.8494505882263184} 11/06/2021 21:52:13 - INFO - __main__ - Step 4346: {'lr': 0.0004996903419702731, 'samples': 834432, 'steps': 4345, 'loss/train': 2.3450777530670166} 11/06/2021 21:52:13 - INFO - __main__ - Step 4347: {'lr': 0.0004996900778678889, 'samples': 834624, 'steps': 4346, 'loss/train': 2.0560898780822754} 11/06/2021 21:52:14 - INFO - __main__ - Step 4348: {'lr': 0.0004996898136529982, 'samples': 834816, 'steps': 4347, 'loss/train': 2.482868194580078} 11/06/2021 21:52:14 - INFO - __main__ - Step 4349: {'lr': 0.0004996895493256012, 'samples': 835008, 'steps': 4348, 'loss/train': 1.759129524230957} 11/06/2021 21:52:14 - INFO - __main__ - Step 4350: {'lr': 0.0004996892848856978, 'samples': 835200, 'steps': 4349, 'loss/train': 1.8137215375900269} 11/06/2021 21:52:16 - INFO - __main__ - Step 4351: {'lr': 0.0004996890203332883, 'samples': 835392, 'steps': 4350, 'loss/train': 1.9098490476608276} 11/06/2021 21:52:16 - INFO - __main__ - Step 4352: {'lr': 0.0004996887556683729, 'samples': 835584, 'steps': 4351, 'loss/train': 2.027623414993286} 11/06/2021 21:52:16 - INFO - __main__ - Step 4353: {'lr': 0.0004996884908909515, 'samples': 835776, 'steps': 4352, 'loss/train': 2.083651304244995} 11/06/2021 21:52:17 - INFO - __main__ - Step 4354: {'lr': 0.0004996882260010243, 'samples': 835968, 'steps': 4353, 'loss/train': 1.6562211513519287} 11/06/2021 21:52:17 - INFO - __main__ - Step 4355: {'lr': 0.0004996879609985915, 'samples': 836160, 'steps': 4354, 'loss/train': 1.846291422843933} 11/06/2021 21:52:18 - INFO - __main__ - Step 4356: {'lr': 0.0004996876958836532, 'samples': 836352, 'steps': 4355, 'loss/train': 2.066518783569336} 11/06/2021 21:52:19 - INFO - __main__ - Step 4357: {'lr': 0.0004996874306562093, 'samples': 836544, 'steps': 4356, 'loss/train': 2.2601191997528076} 11/06/2021 21:52:19 - INFO - __main__ - Step 4358: {'lr': 0.0004996871653162602, 'samples': 836736, 'steps': 4357, 'loss/train': 2.134087562561035} 11/06/2021 21:52:19 - INFO - __main__ - Step 4359: {'lr': 0.0004996868998638059, 'samples': 836928, 'steps': 4358, 'loss/train': 2.3234333992004395} 11/06/2021 21:52:20 - INFO - __main__ - Step 4360: {'lr': 0.0004996866342988467, 'samples': 837120, 'steps': 4359, 'loss/train': 2.3369786739349365} 11/06/2021 21:52:20 - INFO - __main__ - Step 4361: {'lr': 0.0004996863686213823, 'samples': 837312, 'steps': 4360, 'loss/train': 2.203856945037842} 11/06/2021 21:52:21 - INFO - __main__ - Step 4362: {'lr': 0.0004996861028314133, 'samples': 837504, 'steps': 4361, 'loss/train': 2.04062557220459} 11/06/2021 21:52:21 - INFO - __main__ - Step 4363: {'lr': 0.0004996858369289394, 'samples': 837696, 'steps': 4362, 'loss/train': 2.0130574703216553} 11/06/2021 21:52:22 - INFO - __main__ - Step 4364: {'lr': 0.000499685570913961, 'samples': 837888, 'steps': 4363, 'loss/train': 1.978306770324707} 11/06/2021 21:52:22 - INFO - __main__ - Step 4365: {'lr': 0.0004996853047864781, 'samples': 838080, 'steps': 4364, 'loss/train': 2.3143999576568604} 11/06/2021 21:52:22 - INFO - __main__ - Step 4366: {'lr': 0.0004996850385464909, 'samples': 838272, 'steps': 4365, 'loss/train': 2.2573633193969727} 11/06/2021 21:52:23 - INFO - __main__ - Step 4367: {'lr': 0.0004996847721939994, 'samples': 838464, 'steps': 4366, 'loss/train': 1.7779031991958618} 11/06/2021 21:52:24 - INFO - __main__ - Step 4368: {'lr': 0.0004996845057290039, 'samples': 838656, 'steps': 4367, 'loss/train': 2.1156513690948486} 11/06/2021 21:52:24 - INFO - __main__ - Step 4369: {'lr': 0.0004996842391515044, 'samples': 838848, 'steps': 4368, 'loss/train': 1.8346737623214722} 11/06/2021 21:52:25 - INFO - __main__ - Step 4370: {'lr': 0.000499683972461501, 'samples': 839040, 'steps': 4369, 'loss/train': 1.7656928300857544} 11/06/2021 21:52:25 - INFO - __main__ - Step 4371: {'lr': 0.0004996837056589938, 'samples': 839232, 'steps': 4370, 'loss/train': 1.4817529916763306} 11/06/2021 21:52:25 - INFO - __main__ - Step 4372: {'lr': 0.0004996834387439831, 'samples': 839424, 'steps': 4371, 'loss/train': 2.4186854362487793} 11/06/2021 21:52:26 - INFO - __main__ - Step 4373: {'lr': 0.0004996831717164689, 'samples': 839616, 'steps': 4372, 'loss/train': 2.3830738067626953} 11/06/2021 21:52:27 - INFO - __main__ - Step 4374: {'lr': 0.0004996829045764512, 'samples': 839808, 'steps': 4373, 'loss/train': 2.039133310317993} 11/06/2021 21:52:27 - INFO - __main__ - Step 4375: {'lr': 0.0004996826373239303, 'samples': 840000, 'steps': 4374, 'loss/train': 1.8050706386566162} 11/06/2021 21:52:27 - INFO - __main__ - Step 4376: {'lr': 0.0004996823699589062, 'samples': 840192, 'steps': 4375, 'loss/train': 2.005566120147705} 11/06/2021 21:52:28 - INFO - __main__ - Step 4377: {'lr': 0.0004996821024813791, 'samples': 840384, 'steps': 4376, 'loss/train': 2.3006296157836914} 11/06/2021 21:52:29 - INFO - __main__ - Step 4378: {'lr': 0.0004996818348913491, 'samples': 840576, 'steps': 4377, 'loss/train': 1.9584013223648071} 11/06/2021 21:52:29 - INFO - __main__ - Step 4379: {'lr': 0.0004996815671888163, 'samples': 840768, 'steps': 4378, 'loss/train': 2.180757522583008} 11/06/2021 21:52:29 - INFO - __main__ - Step 4380: {'lr': 0.000499681299373781, 'samples': 840960, 'steps': 4379, 'loss/train': 2.559288740158081} 11/06/2021 21:52:30 - INFO - __main__ - Step 4381: {'lr': 0.0004996810314462429, 'samples': 841152, 'steps': 4380, 'loss/train': 1.7387698888778687} 11/06/2021 21:52:30 - INFO - __main__ - Step 4382: {'lr': 0.0004996807634062025, 'samples': 841344, 'steps': 4381, 'loss/train': 2.247429847717285} 11/06/2021 21:52:31 - INFO - __main__ - Step 4383: {'lr': 0.0004996804952536599, 'samples': 841536, 'steps': 4382, 'loss/train': 2.5863077640533447} 11/06/2021 21:52:31 - INFO - __main__ - Step 4384: {'lr': 0.0004996802269886149, 'samples': 841728, 'steps': 4383, 'loss/train': 1.9979889392852783} 11/06/2021 21:52:32 - INFO - __main__ - Step 4385: {'lr': 0.0004996799586110681, 'samples': 841920, 'steps': 4384, 'loss/train': 2.161149501800537} 11/06/2021 21:52:32 - INFO - __main__ - Step 4386: {'lr': 0.0004996796901210192, 'samples': 842112, 'steps': 4385, 'loss/train': 2.0866355895996094} 11/06/2021 21:52:32 - INFO - __main__ - Step 4387: {'lr': 0.0004996794215184685, 'samples': 842304, 'steps': 4386, 'loss/train': 2.5882389545440674} 11/06/2021 21:52:33 - INFO - __main__ - Step 4388: {'lr': 0.0004996791528034161, 'samples': 842496, 'steps': 4387, 'loss/train': 1.5546870231628418} 11/06/2021 21:52:34 - INFO - __main__ - Step 4389: {'lr': 0.0004996788839758622, 'samples': 842688, 'steps': 4388, 'loss/train': 2.4196763038635254} 11/06/2021 21:52:34 - INFO - __main__ - Step 4390: {'lr': 0.0004996786150358068, 'samples': 842880, 'steps': 4389, 'loss/train': 1.6892547607421875} 11/06/2021 21:52:35 - INFO - __main__ - Step 4391: {'lr': 0.00049967834598325, 'samples': 843072, 'steps': 4390, 'loss/train': 2.1776304244995117} 11/06/2021 21:52:35 - INFO - __main__ - Step 4392: {'lr': 0.0004996780768181921, 'samples': 843264, 'steps': 4391, 'loss/train': 2.4177329540252686} 11/06/2021 21:52:35 - INFO - __main__ - Step 4393: {'lr': 0.0004996778075406331, 'samples': 843456, 'steps': 4392, 'loss/train': 1.8474143743515015} 11/06/2021 21:52:36 - INFO - __main__ - Step 4394: {'lr': 0.0004996775381505731, 'samples': 843648, 'steps': 4393, 'loss/train': 1.766103744506836} 11/06/2021 21:52:37 - INFO - __main__ - Step 4395: {'lr': 0.0004996772686480122, 'samples': 843840, 'steps': 4394, 'loss/train': 2.221249580383301} 11/06/2021 21:52:37 - INFO - __main__ - Step 4396: {'lr': 0.0004996769990329507, 'samples': 844032, 'steps': 4395, 'loss/train': 1.950240135192871} 11/06/2021 21:52:37 - INFO - __main__ - Step 4397: {'lr': 0.0004996767293053885, 'samples': 844224, 'steps': 4396, 'loss/train': 1.8675569295883179} 11/06/2021 21:52:38 - INFO - __main__ - Step 4398: {'lr': 0.0004996764594653258, 'samples': 844416, 'steps': 4397, 'loss/train': 2.6407816410064697} 11/06/2021 21:52:39 - INFO - __main__ - Step 4399: {'lr': 0.0004996761895127628, 'samples': 844608, 'steps': 4398, 'loss/train': 1.6900591850280762} 11/06/2021 21:52:39 - INFO - __main__ - Step 4400: {'lr': 0.0004996759194476996, 'samples': 844800, 'steps': 4399, 'loss/train': 0.33388879895210266} 11/06/2021 21:52:39 - INFO - __main__ - Step 4401: {'lr': 0.0004996756492701362, 'samples': 844992, 'steps': 4400, 'loss/train': 1.999221682548523} 11/06/2021 21:52:40 - INFO - __main__ - Step 4402: {'lr': 0.0004996753789800729, 'samples': 845184, 'steps': 4401, 'loss/train': 2.3262462615966797} 11/06/2021 21:52:40 - INFO - __main__ - Step 4403: {'lr': 0.0004996751085775096, 'samples': 845376, 'steps': 4402, 'loss/train': 2.0857207775115967} 11/06/2021 21:52:41 - INFO - __main__ - Step 4404: {'lr': 0.0004996748380624467, 'samples': 845568, 'steps': 4403, 'loss/train': 1.382709264755249} 11/06/2021 21:52:41 - INFO - __main__ - Step 4405: {'lr': 0.000499674567434884, 'samples': 845760, 'steps': 4404, 'loss/train': 2.260540008544922} 11/06/2021 21:52:42 - INFO - __main__ - Step 4406: {'lr': 0.0004996742966948219, 'samples': 845952, 'steps': 4405, 'loss/train': 1.8191969394683838} 11/06/2021 21:52:42 - INFO - __main__ - Step 4407: {'lr': 0.0004996740258422604, 'samples': 846144, 'steps': 4406, 'loss/train': 2.2261736392974854} 11/06/2021 21:52:43 - INFO - __main__ - Step 4408: {'lr': 0.0004996737548771997, 'samples': 846336, 'steps': 4407, 'loss/train': 1.8432106971740723} 11/06/2021 21:52:44 - INFO - __main__ - Step 4409: {'lr': 0.0004996734837996397, 'samples': 846528, 'steps': 4408, 'loss/train': 2.118229627609253} 11/06/2021 21:52:44 - INFO - __main__ - Step 4410: {'lr': 0.0004996732126095807, 'samples': 846720, 'steps': 4409, 'loss/train': 1.7854758501052856} 11/06/2021 21:52:44 - INFO - __main__ - Step 4411: {'lr': 0.0004996729413070229, 'samples': 846912, 'steps': 4410, 'loss/train': 2.3659751415252686} 11/06/2021 21:52:45 - INFO - __main__ - Step 4412: {'lr': 0.0004996726698919664, 'samples': 847104, 'steps': 4411, 'loss/train': 1.817520260810852} 11/06/2021 21:52:45 - INFO - __main__ - Step 4413: {'lr': 0.0004996723983644112, 'samples': 847296, 'steps': 4412, 'loss/train': 1.4988713264465332} 11/06/2021 21:52:46 - INFO - __main__ - Step 4414: {'lr': 0.0004996721267243573, 'samples': 847488, 'steps': 4413, 'loss/train': 1.2573717832565308} 11/06/2021 21:52:47 - INFO - __main__ - Step 4415: {'lr': 0.0004996718549718051, 'samples': 847680, 'steps': 4414, 'loss/train': 1.9494376182556152} 11/06/2021 21:52:47 - INFO - __main__ - Step 4416: {'lr': 0.0004996715831067546, 'samples': 847872, 'steps': 4415, 'loss/train': 0.341790109872818} 11/06/2021 21:52:47 - INFO - __main__ - Step 4417: {'lr': 0.000499671311129206, 'samples': 848064, 'steps': 4416, 'loss/train': 1.7873398065567017} 11/06/2021 21:52:48 - INFO - __main__ - Step 4418: {'lr': 0.0004996710390391593, 'samples': 848256, 'steps': 4417, 'loss/train': 2.2925448417663574} 11/06/2021 21:52:49 - INFO - __main__ - Step 4419: {'lr': 0.0004996707668366147, 'samples': 848448, 'steps': 4418, 'loss/train': 2.3310353755950928} 11/06/2021 21:52:49 - INFO - __main__ - Step 4420: {'lr': 0.0004996704945215724, 'samples': 848640, 'steps': 4419, 'loss/train': 1.234442949295044} 11/06/2021 21:52:49 - INFO - __main__ - Step 4421: {'lr': 0.0004996702220940322, 'samples': 848832, 'steps': 4420, 'loss/train': 2.263622522354126} 11/06/2021 21:52:50 - INFO - __main__ - Step 4422: {'lr': 0.0004996699495539947, 'samples': 849024, 'steps': 4421, 'loss/train': 2.1597673892974854} 11/06/2021 21:52:50 - INFO - __main__ - Step 4423: {'lr': 0.0004996696769014596, 'samples': 849216, 'steps': 4422, 'loss/train': 2.2731714248657227} 11/06/2021 21:52:50 - INFO - __main__ - Step 4424: {'lr': 0.0004996694041364272, 'samples': 849408, 'steps': 4423, 'loss/train': 2.049422264099121} 11/06/2021 21:52:51 - INFO - __main__ - Step 4425: {'lr': 0.0004996691312588977, 'samples': 849600, 'steps': 4424, 'loss/train': 1.7566492557525635} 11/06/2021 21:52:52 - INFO - __main__ - Step 4426: {'lr': 0.0004996688582688711, 'samples': 849792, 'steps': 4425, 'loss/train': 2.2029178142547607} 11/06/2021 21:52:52 - INFO - __main__ - Step 4427: {'lr': 0.0004996685851663477, 'samples': 849984, 'steps': 4426, 'loss/train': 2.1259970664978027} 11/06/2021 21:52:52 - INFO - __main__ - Step 4428: {'lr': 0.0004996683119513274, 'samples': 850176, 'steps': 4427, 'loss/train': 1.6493570804595947} 11/06/2021 21:52:53 - INFO - __main__ - Step 4429: {'lr': 0.0004996680386238103, 'samples': 850368, 'steps': 4428, 'loss/train': 1.9351447820663452} 11/06/2021 21:52:54 - INFO - __main__ - Step 4430: {'lr': 0.0004996677651837967, 'samples': 850560, 'steps': 4429, 'loss/train': 2.663370132446289} 11/06/2021 21:52:54 - INFO - __main__ - Step 4431: {'lr': 0.0004996674916312867, 'samples': 850752, 'steps': 4430, 'loss/train': 2.205284595489502} 11/06/2021 21:52:55 - INFO - __main__ - Step 4432: {'lr': 0.0004996672179662803, 'samples': 850944, 'steps': 4431, 'loss/train': 0.6293364763259888} 11/06/2021 21:52:55 - INFO - __main__ - Step 4433: {'lr': 0.0004996669441887778, 'samples': 851136, 'steps': 4432, 'loss/train': 1.934459924697876} 11/06/2021 21:52:55 - INFO - __main__ - Step 4434: {'lr': 0.0004996666702987791, 'samples': 851328, 'steps': 4433, 'loss/train': 1.715638518333435} 11/06/2021 21:52:56 - INFO - __main__ - Step 4435: {'lr': 0.0004996663962962846, 'samples': 851520, 'steps': 4434, 'loss/train': 2.070112705230713} 11/06/2021 21:52:57 - INFO - __main__ - Step 4436: {'lr': 0.0004996661221812942, 'samples': 851712, 'steps': 4435, 'loss/train': 2.642328977584839} 11/06/2021 21:52:57 - INFO - __main__ - Step 4437: {'lr': 0.0004996658479538081, 'samples': 851904, 'steps': 4436, 'loss/train': 2.1696090698242188} 11/06/2021 21:52:57 - INFO - __main__ - Step 4438: {'lr': 0.0004996655736138265, 'samples': 852096, 'steps': 4437, 'loss/train': 2.149383783340454} 11/06/2021 21:52:58 - INFO - __main__ - Step 4439: {'lr': 0.0004996652991613494, 'samples': 852288, 'steps': 4438, 'loss/train': 1.425952434539795} 11/06/2021 21:52:59 - INFO - __main__ - Step 4440: {'lr': 0.0004996650245963768, 'samples': 852480, 'steps': 4439, 'loss/train': 1.9291316270828247} 11/06/2021 21:53:00 - INFO - __main__ - Step 4441: {'lr': 0.0004996647499189092, 'samples': 852672, 'steps': 4440, 'loss/train': 1.8874024152755737} 11/06/2021 21:53:00 - INFO - __main__ - Step 4442: {'lr': 0.0004996644751289464, 'samples': 852864, 'steps': 4441, 'loss/train': 1.797756552696228} 11/06/2021 21:53:00 - INFO - __main__ - Step 4443: {'lr': 0.0004996642002264887, 'samples': 853056, 'steps': 4442, 'loss/train': 2.348628282546997} 11/06/2021 21:53:01 - INFO - __main__ - Step 4444: {'lr': 0.0004996639252115362, 'samples': 853248, 'steps': 4443, 'loss/train': 1.8861411809921265} 11/06/2021 21:53:01 - INFO - __main__ - Step 4445: {'lr': 0.000499663650084089, 'samples': 853440, 'steps': 4444, 'loss/train': 2.2785558700561523} 11/06/2021 21:53:01 - INFO - __main__ - Step 4446: {'lr': 0.0004996633748441472, 'samples': 853632, 'steps': 4445, 'loss/train': 2.116128444671631} 11/06/2021 21:53:03 - INFO - __main__ - Step 4447: {'lr': 0.0004996630994917108, 'samples': 853824, 'steps': 4446, 'loss/train': 1.9847468137741089} 11/06/2021 21:53:03 - INFO - __main__ - Step 4448: {'lr': 0.0004996628240267802, 'samples': 854016, 'steps': 4447, 'loss/train': 2.634124755859375} 11/06/2021 21:53:03 - INFO - __main__ - Step 4449: {'lr': 0.0004996625484493554, 'samples': 854208, 'steps': 4448, 'loss/train': 1.749208927154541} 11/06/2021 21:53:04 - INFO - __main__ - Step 4450: {'lr': 0.0004996622727594363, 'samples': 854400, 'steps': 4449, 'loss/train': 1.287934422492981} 11/06/2021 21:53:04 - INFO - __main__ - Step 4451: {'lr': 0.0004996619969570234, 'samples': 854592, 'steps': 4450, 'loss/train': 2.1173369884490967} 11/06/2021 21:53:05 - INFO - __main__ - Step 4452: {'lr': 0.0004996617210421166, 'samples': 854784, 'steps': 4451, 'loss/train': 1.4185445308685303} 11/06/2021 21:53:05 - INFO - __main__ - Step 4453: {'lr': 0.0004996614450147161, 'samples': 854976, 'steps': 4452, 'loss/train': 2.3131630420684814} 11/06/2021 21:53:06 - INFO - __main__ - Step 4454: {'lr': 0.0004996611688748221, 'samples': 855168, 'steps': 4453, 'loss/train': 0.5444425940513611} 11/06/2021 21:53:06 - INFO - __main__ - Step 4455: {'lr': 0.0004996608926224345, 'samples': 855360, 'steps': 4454, 'loss/train': 2.0416908264160156} 11/06/2021 21:53:06 - INFO - __main__ - Step 4456: {'lr': 0.0004996606162575536, 'samples': 855552, 'steps': 4455, 'loss/train': 1.4575010538101196} 11/06/2021 21:53:07 - INFO - __main__ - Step 4457: {'lr': 0.0004996603397801795, 'samples': 855744, 'steps': 4456, 'loss/train': 2.1322479248046875} 11/06/2021 21:53:08 - INFO - __main__ - Step 4458: {'lr': 0.0004996600631903123, 'samples': 855936, 'steps': 4457, 'loss/train': 2.1813089847564697} 11/06/2021 21:53:08 - INFO - __main__ - Step 4459: {'lr': 0.0004996597864879521, 'samples': 856128, 'steps': 4458, 'loss/train': 2.2517218589782715} 11/06/2021 21:53:08 - INFO - __main__ - Step 4460: {'lr': 0.000499659509673099, 'samples': 856320, 'steps': 4459, 'loss/train': 2.0033187866210938} 11/06/2021 21:53:09 - INFO - __main__ - Step 4461: {'lr': 0.0004996592327457533, 'samples': 856512, 'steps': 4460, 'loss/train': 1.793095350265503} 11/06/2021 21:53:10 - INFO - __main__ - Step 4462: {'lr': 0.000499658955705915, 'samples': 856704, 'steps': 4461, 'loss/train': 2.264535665512085} 11/06/2021 21:53:10 - INFO - __main__ - Step 4463: {'lr': 0.0004996586785535841, 'samples': 856896, 'steps': 4462, 'loss/train': 2.0258359909057617} 11/06/2021 21:53:11 - INFO - __main__ - Step 4464: {'lr': 0.000499658401288761, 'samples': 857088, 'steps': 4463, 'loss/train': 1.9607642889022827} 11/06/2021 21:53:11 - INFO - __main__ - Step 4465: {'lr': 0.0004996581239114456, 'samples': 857280, 'steps': 4464, 'loss/train': 1.6711504459381104} 11/06/2021 21:53:11 - INFO - __main__ - Step 4466: {'lr': 0.0004996578464216381, 'samples': 857472, 'steps': 4465, 'loss/train': 2.1529288291931152} 11/06/2021 21:53:12 - INFO - __main__ - Step 4467: {'lr': 0.0004996575688193386, 'samples': 857664, 'steps': 4466, 'loss/train': 1.737178087234497} 11/06/2021 21:53:13 - INFO - __main__ - Step 4468: {'lr': 0.0004996572911045473, 'samples': 857856, 'steps': 4467, 'loss/train': 2.233004570007324} 11/06/2021 21:53:13 - INFO - __main__ - Step 4469: {'lr': 0.0004996570132772642, 'samples': 858048, 'steps': 4468, 'loss/train': 1.742701530456543} 11/06/2021 21:53:13 - INFO - __main__ - Step 4470: {'lr': 0.0004996567353374896, 'samples': 858240, 'steps': 4469, 'loss/train': 2.380733013153076} 11/06/2021 21:53:14 - INFO - __main__ - Step 4471: {'lr': 0.0004996564572852235, 'samples': 858432, 'steps': 4470, 'loss/train': 2.0802853107452393} 11/06/2021 21:53:14 - INFO - __main__ - Step 4472: {'lr': 0.000499656179120466, 'samples': 858624, 'steps': 4471, 'loss/train': 1.6092973947525024} 11/06/2021 21:53:15 - INFO - __main__ - Step 4473: {'lr': 0.0004996559008432173, 'samples': 858816, 'steps': 4472, 'loss/train': 3.0951523780822754} 11/06/2021 21:53:16 - INFO - __main__ - Step 4474: {'lr': 0.0004996556224534776, 'samples': 859008, 'steps': 4473, 'loss/train': 2.4669737815856934} 11/06/2021 21:53:16 - INFO - __main__ - Step 4475: {'lr': 0.0004996553439512468, 'samples': 859200, 'steps': 4474, 'loss/train': 2.577219247817993} 11/06/2021 21:53:16 - INFO - __main__ - Step 4476: {'lr': 0.0004996550653365253, 'samples': 859392, 'steps': 4475, 'loss/train': 1.7655971050262451} 11/06/2021 21:53:17 - INFO - __main__ - Step 4477: {'lr': 0.0004996547866093129, 'samples': 859584, 'steps': 4476, 'loss/train': 2.026312828063965} 11/06/2021 21:53:17 - INFO - __main__ - Step 4478: {'lr': 0.00049965450776961, 'samples': 859776, 'steps': 4477, 'loss/train': 2.0732200145721436} 11/06/2021 21:53:18 - INFO - __main__ - Step 4479: {'lr': 0.0004996542288174166, 'samples': 859968, 'steps': 4478, 'loss/train': 2.4320411682128906} 11/06/2021 21:53:18 - INFO - __main__ - Step 4480: {'lr': 0.0004996539497527329, 'samples': 860160, 'steps': 4479, 'loss/train': 1.8877087831497192} 11/06/2021 21:53:19 - INFO - __main__ - Step 4481: {'lr': 0.000499653670575559, 'samples': 860352, 'steps': 4480, 'loss/train': 1.376404047012329} 11/06/2021 21:53:19 - INFO - __main__ - Step 4482: {'lr': 0.0004996533912858949, 'samples': 860544, 'steps': 4481, 'loss/train': 1.817132592201233} 11/06/2021 21:53:19 - INFO - __main__ - Step 4483: {'lr': 0.000499653111883741, 'samples': 860736, 'steps': 4482, 'loss/train': 2.1700544357299805} 11/06/2021 21:53:20 - INFO - __main__ - Step 4484: {'lr': 0.0004996528323690971, 'samples': 860928, 'steps': 4483, 'loss/train': 1.9818646907806396} 11/06/2021 21:53:21 - INFO - __main__ - Step 4485: {'lr': 0.0004996525527419636, 'samples': 861120, 'steps': 4484, 'loss/train': 1.9006667137145996} 11/06/2021 21:53:21 - INFO - __main__ - Step 4486: {'lr': 0.0004996522730023404, 'samples': 861312, 'steps': 4485, 'loss/train': 1.9432722330093384} 11/06/2021 21:53:21 - INFO - __main__ - Step 4487: {'lr': 0.0004996519931502279, 'samples': 861504, 'steps': 4486, 'loss/train': 2.3324050903320312} 11/06/2021 21:53:22 - INFO - __main__ - Step 4488: {'lr': 0.0004996517131856259, 'samples': 861696, 'steps': 4487, 'loss/train': 1.8573492765426636} 11/06/2021 21:53:23 - INFO - __main__ - Step 4489: {'lr': 0.0004996514331085348, 'samples': 861888, 'steps': 4488, 'loss/train': 1.8287177085876465} 11/06/2021 21:53:23 - INFO - __main__ - Step 4490: {'lr': 0.0004996511529189546, 'samples': 862080, 'steps': 4489, 'loss/train': 1.9560521841049194} 11/06/2021 21:53:24 - INFO - __main__ - Step 4491: {'lr': 0.0004996508726168854, 'samples': 862272, 'steps': 4490, 'loss/train': 2.2228381633758545} 11/06/2021 21:53:24 - INFO - __main__ - Step 4492: {'lr': 0.0004996505922023274, 'samples': 862464, 'steps': 4491, 'loss/train': 2.6008143424987793} 11/06/2021 21:53:24 - INFO - __main__ - Step 4493: {'lr': 0.0004996503116752807, 'samples': 862656, 'steps': 4492, 'loss/train': 2.061739206314087} 11/06/2021 21:53:25 - INFO - __main__ - Step 4494: {'lr': 0.0004996500310357454, 'samples': 862848, 'steps': 4493, 'loss/train': 1.7886160612106323} 11/06/2021 21:53:26 - INFO - __main__ - Step 4495: {'lr': 0.0004996497502837217, 'samples': 863040, 'steps': 4494, 'loss/train': 1.68747878074646} 11/06/2021 21:53:26 - INFO - __main__ - Step 4496: {'lr': 0.0004996494694192096, 'samples': 863232, 'steps': 4495, 'loss/train': 1.7942966222763062} 11/06/2021 21:53:26 - INFO - __main__ - Step 4497: {'lr': 0.0004996491884422092, 'samples': 863424, 'steps': 4496, 'loss/train': 2.276155948638916} 11/06/2021 21:53:27 - INFO - __main__ - Step 4498: {'lr': 0.0004996489073527208, 'samples': 863616, 'steps': 4497, 'loss/train': 2.0408365726470947} 11/06/2021 21:53:28 - INFO - __main__ - Step 4499: {'lr': 0.0004996486261507445, 'samples': 863808, 'steps': 4498, 'loss/train': 2.200601816177368} 11/06/2021 21:53:28 - INFO - __main__ - Step 4500: {'lr': 0.0004996483448362805, 'samples': 864000, 'steps': 4499, 'loss/train': 2.0834293365478516} 11/06/2021 21:53:28 - INFO - __main__ - Step 4501: {'lr': 0.0004996480634093287, 'samples': 864192, 'steps': 4500, 'loss/train': 1.751892328262329} 11/06/2021 21:53:29 - INFO - __main__ - Step 4502: {'lr': 0.0004996477818698893, 'samples': 864384, 'steps': 4501, 'loss/train': 2.13613224029541} 11/06/2021 21:53:29 - INFO - __main__ - Step 4503: {'lr': 0.0004996475002179625, 'samples': 864576, 'steps': 4502, 'loss/train': 2.1853764057159424} 11/06/2021 21:53:30 - INFO - __main__ - Step 4504: {'lr': 0.0004996472184535484, 'samples': 864768, 'steps': 4503, 'loss/train': 1.418428659439087} 11/06/2021 21:53:30 - INFO - __main__ - Step 4505: {'lr': 0.0004996469365766471, 'samples': 864960, 'steps': 4504, 'loss/train': 1.7238401174545288} 11/06/2021 21:53:31 - INFO - __main__ - Step 4506: {'lr': 0.0004996466545872588, 'samples': 865152, 'steps': 4505, 'loss/train': 1.999024510383606} 11/06/2021 21:53:31 - INFO - __main__ - Step 4507: {'lr': 0.0004996463724853834, 'samples': 865344, 'steps': 4506, 'loss/train': 2.0768072605133057} 11/06/2021 21:53:31 - INFO - __main__ - Step 4508: {'lr': 0.0004996460902710214, 'samples': 865536, 'steps': 4507, 'loss/train': 1.8393359184265137} 11/06/2021 21:53:32 - INFO - __main__ - Step 4509: {'lr': 0.0004996458079441727, 'samples': 865728, 'steps': 4508, 'loss/train': 1.8585339784622192} 11/06/2021 21:53:33 - INFO - __main__ - Step 4510: {'lr': 0.0004996455255048373, 'samples': 865920, 'steps': 4509, 'loss/train': 1.3014329671859741} 11/06/2021 21:53:33 - INFO - __main__ - Step 4511: {'lr': 0.0004996452429530156, 'samples': 866112, 'steps': 4510, 'loss/train': 2.1434712409973145} 11/06/2021 21:53:33 - INFO - __main__ - Step 4512: {'lr': 0.0004996449602887075, 'samples': 866304, 'steps': 4511, 'loss/train': 1.680091142654419} 11/06/2021 21:53:34 - INFO - __main__ - Step 4513: {'lr': 0.0004996446775119134, 'samples': 866496, 'steps': 4512, 'loss/train': 1.8141558170318604} 11/06/2021 21:53:35 - INFO - __main__ - Step 4514: {'lr': 0.0004996443946226331, 'samples': 866688, 'steps': 4513, 'loss/train': 1.9032708406448364} 11/06/2021 21:53:35 - INFO - __main__ - Step 4515: {'lr': 0.000499644111620867, 'samples': 866880, 'steps': 4514, 'loss/train': 1.6730246543884277} 11/06/2021 21:53:36 - INFO - __main__ - Step 4516: {'lr': 0.000499643828506615, 'samples': 867072, 'steps': 4515, 'loss/train': 2.1784026622772217} 11/06/2021 21:53:36 - INFO - __main__ - Step 4517: {'lr': 0.0004996435452798775, 'samples': 867264, 'steps': 4516, 'loss/train': 1.8776859045028687} 11/06/2021 21:53:36 - INFO - __main__ - Step 4518: {'lr': 0.0004996432619406543, 'samples': 867456, 'steps': 4517, 'loss/train': 2.0420169830322266} 11/06/2021 21:53:37 - INFO - __main__ - Step 4519: {'lr': 0.0004996429784889458, 'samples': 867648, 'steps': 4518, 'loss/train': 2.412881851196289} 11/06/2021 21:53:38 - INFO - __main__ - Step 4520: {'lr': 0.000499642694924752, 'samples': 867840, 'steps': 4519, 'loss/train': 1.9957324266433716} 11/06/2021 21:53:38 - INFO - __main__ - Step 4521: {'lr': 0.000499642411248073, 'samples': 868032, 'steps': 4520, 'loss/train': 1.6232712268829346} 11/06/2021 21:53:38 - INFO - __main__ - Step 4522: {'lr': 0.0004996421274589091, 'samples': 868224, 'steps': 4521, 'loss/train': 2.311401128768921} 11/06/2021 21:53:39 - INFO - __main__ - Step 4523: {'lr': 0.0004996418435572603, 'samples': 868416, 'steps': 4522, 'loss/train': 2.561228036880493} 11/06/2021 21:53:39 - INFO - __main__ - Step 4524: {'lr': 0.0004996415595431267, 'samples': 868608, 'steps': 4523, 'loss/train': 2.3829054832458496} 11/06/2021 21:53:40 - INFO - __main__ - Step 4525: {'lr': 0.0004996412754165084, 'samples': 868800, 'steps': 4524, 'loss/train': 1.9954290390014648} 11/06/2021 21:53:40 - INFO - __main__ - Step 4526: {'lr': 0.0004996409911774056, 'samples': 868992, 'steps': 4525, 'loss/train': 1.9752427339553833} 11/06/2021 21:53:41 - INFO - __main__ - Step 4527: {'lr': 0.0004996407068258186, 'samples': 869184, 'steps': 4526, 'loss/train': 2.2408523559570312} 11/06/2021 21:53:41 - INFO - __main__ - Step 4528: {'lr': 0.0004996404223617471, 'samples': 869376, 'steps': 4527, 'loss/train': 1.6397135257720947} 11/06/2021 21:53:41 - INFO - __main__ - Step 4529: {'lr': 0.0004996401377851917, 'samples': 869568, 'steps': 4528, 'loss/train': 2.1448097229003906} 11/06/2021 21:53:42 - INFO - __main__ - Step 4530: {'lr': 0.0004996398530961522, 'samples': 869760, 'steps': 4529, 'loss/train': 1.8006598949432373} 11/06/2021 21:53:43 - INFO - __main__ - Step 4531: {'lr': 0.0004996395682946288, 'samples': 869952, 'steps': 4530, 'loss/train': 1.2451746463775635} 11/06/2021 21:53:43 - INFO - __main__ - Step 4532: {'lr': 0.0004996392833806217, 'samples': 870144, 'steps': 4531, 'loss/train': 1.7764043807983398} 11/06/2021 21:53:44 - INFO - __main__ - Step 4533: {'lr': 0.000499638998354131, 'samples': 870336, 'steps': 4532, 'loss/train': 2.118048667907715} 11/06/2021 21:53:44 - INFO - __main__ - Step 4534: {'lr': 0.0004996387132151567, 'samples': 870528, 'steps': 4533, 'loss/train': 2.315908908843994} 11/06/2021 21:53:45 - INFO - __main__ - Step 4535: {'lr': 0.0004996384279636993, 'samples': 870720, 'steps': 4534, 'loss/train': 2.341172218322754} 11/06/2021 21:53:45 - INFO - __main__ - Step 4536: {'lr': 0.0004996381425997584, 'samples': 870912, 'steps': 4535, 'loss/train': 2.5195581912994385} 11/06/2021 21:53:46 - INFO - __main__ - Step 4537: {'lr': 0.0004996378571233347, 'samples': 871104, 'steps': 4536, 'loss/train': 2.251058578491211} 11/06/2021 21:53:46 - INFO - __main__ - Step 4538: {'lr': 0.0004996375715344278, 'samples': 871296, 'steps': 4537, 'loss/train': 2.1683661937713623} 11/06/2021 21:53:46 - INFO - __main__ - Step 4539: {'lr': 0.0004996372858330382, 'samples': 871488, 'steps': 4538, 'loss/train': 2.623131275177002} 11/06/2021 21:53:47 - INFO - __main__ - Step 4540: {'lr': 0.0004996370000191657, 'samples': 871680, 'steps': 4539, 'loss/train': 2.218451738357544} 11/06/2021 21:53:48 - INFO - __main__ - Step 4541: {'lr': 0.0004996367140928107, 'samples': 871872, 'steps': 4540, 'loss/train': 2.884366989135742} 11/06/2021 21:53:48 - INFO - __main__ - Step 4542: {'lr': 0.0004996364280539734, 'samples': 872064, 'steps': 4541, 'loss/train': 2.3906033039093018} 11/06/2021 21:53:48 - INFO - __main__ - Step 4543: {'lr': 0.0004996361419026537, 'samples': 872256, 'steps': 4542, 'loss/train': 2.0911548137664795} 11/06/2021 21:53:49 - INFO - __main__ - Step 4544: {'lr': 0.0004996358556388518, 'samples': 872448, 'steps': 4543, 'loss/train': 2.255887269973755} 11/06/2021 21:53:49 - INFO - __main__ - Step 4545: {'lr': 0.0004996355692625678, 'samples': 872640, 'steps': 4544, 'loss/train': 2.3875391483306885} 11/06/2021 21:53:50 - INFO - __main__ - Step 4546: {'lr': 0.0004996352827738018, 'samples': 872832, 'steps': 4545, 'loss/train': 1.9046090841293335} 11/06/2021 21:53:50 - INFO - __main__ - Step 4547: {'lr': 0.0004996349961725542, 'samples': 873024, 'steps': 4546, 'loss/train': 1.6704214811325073} 11/06/2021 21:53:51 - INFO - __main__ - Step 4548: {'lr': 0.0004996347094588247, 'samples': 873216, 'steps': 4547, 'loss/train': 1.5265803337097168} 11/06/2021 21:53:51 - INFO - __main__ - Step 4549: {'lr': 0.0004996344226326137, 'samples': 873408, 'steps': 4548, 'loss/train': 2.1202685832977295} 11/06/2021 21:53:52 - INFO - __main__ - Step 4550: {'lr': 0.0004996341356939214, 'samples': 873600, 'steps': 4549, 'loss/train': 2.068074941635132} 11/06/2021 21:53:53 - INFO - __main__ - Step 4551: {'lr': 0.0004996338486427477, 'samples': 873792, 'steps': 4550, 'loss/train': 2.596259593963623} 11/06/2021 21:53:53 - INFO - __main__ - Step 4552: {'lr': 0.0004996335614790929, 'samples': 873984, 'steps': 4551, 'loss/train': 1.9318180084228516} 11/06/2021 21:53:53 - INFO - __main__ - Step 4553: {'lr': 0.0004996332742029571, 'samples': 874176, 'steps': 4552, 'loss/train': 1.7639127969741821} 11/06/2021 21:53:54 - INFO - __main__ - Step 4554: {'lr': 0.0004996329868143404, 'samples': 874368, 'steps': 4553, 'loss/train': 2.062023639678955} 11/06/2021 21:53:54 - INFO - __main__ - Step 4555: {'lr': 0.0004996326993132428, 'samples': 874560, 'steps': 4554, 'loss/train': 2.341677665710449} 11/06/2021 21:53:55 - INFO - __main__ - Step 4556: {'lr': 0.0004996324116996647, 'samples': 874752, 'steps': 4555, 'loss/train': 1.2803831100463867} 11/06/2021 21:53:55 - INFO - __main__ - Step 4557: {'lr': 0.0004996321239736059, 'samples': 874944, 'steps': 4556, 'loss/train': 1.2265723943710327} 11/06/2021 21:53:56 - INFO - __main__ - Step 4558: {'lr': 0.000499631836135067, 'samples': 875136, 'steps': 4557, 'loss/train': 1.5673511028289795} 11/06/2021 21:53:56 - INFO - __main__ - Step 4559: {'lr': 0.0004996315481840476, 'samples': 875328, 'steps': 4558, 'loss/train': 2.2120308876037598} 11/06/2021 21:53:56 - INFO - __main__ - Step 4560: {'lr': 0.0004996312601205482, 'samples': 875520, 'steps': 4559, 'loss/train': 1.771849513053894} 11/06/2021 21:53:57 - INFO - __main__ - Step 4561: {'lr': 0.0004996309719445687, 'samples': 875712, 'steps': 4560, 'loss/train': 1.6283718347549438} 11/06/2021 21:53:58 - INFO - __main__ - Step 4562: {'lr': 0.0004996306836561094, 'samples': 875904, 'steps': 4561, 'loss/train': 2.1488685607910156} 11/06/2021 21:53:58 - INFO - __main__ - Step 4563: {'lr': 0.0004996303952551704, 'samples': 876096, 'steps': 4562, 'loss/train': 3.6791834831237793} 11/06/2021 21:53:58 - INFO - __main__ - Step 4564: {'lr': 0.0004996301067417517, 'samples': 876288, 'steps': 4563, 'loss/train': 1.914914846420288} 11/06/2021 21:53:59 - INFO - __main__ - Step 4565: {'lr': 0.0004996298181158536, 'samples': 876480, 'steps': 4564, 'loss/train': 2.1919569969177246} 11/06/2021 21:53:59 - INFO - __main__ - Step 4566: {'lr': 0.0004996295293774762, 'samples': 876672, 'steps': 4565, 'loss/train': 2.0341947078704834} 11/06/2021 21:54:00 - INFO - __main__ - Step 4567: {'lr': 0.0004996292405266195, 'samples': 876864, 'steps': 4566, 'loss/train': 3.0565133094787598} 11/06/2021 21:54:01 - INFO - __main__ - Step 4568: {'lr': 0.0004996289515632838, 'samples': 877056, 'steps': 4567, 'loss/train': 1.6455254554748535} 11/06/2021 21:54:01 - INFO - __main__ - Step 4569: {'lr': 0.0004996286624874691, 'samples': 877248, 'steps': 4568, 'loss/train': 2.0319526195526123} 11/06/2021 21:54:01 - INFO - __main__ - Step 4570: {'lr': 0.0004996283732991755, 'samples': 877440, 'steps': 4569, 'loss/train': 2.2277791500091553} 11/06/2021 21:54:02 - INFO - __main__ - Step 4571: {'lr': 0.0004996280839984033, 'samples': 877632, 'steps': 4570, 'loss/train': 1.485971450805664} 11/06/2021 21:54:03 - INFO - __main__ - Step 4572: {'lr': 0.0004996277945851525, 'samples': 877824, 'steps': 4571, 'loss/train': 2.180328369140625} 11/06/2021 21:54:03 - INFO - __main__ - Step 4573: {'lr': 0.0004996275050594233, 'samples': 878016, 'steps': 4572, 'loss/train': 1.8692883253097534} 11/06/2021 21:54:03 - INFO - __main__ - Step 4574: {'lr': 0.0004996272154212158, 'samples': 878208, 'steps': 4573, 'loss/train': 0.9339056611061096} 11/06/2021 21:54:04 - INFO - __main__ - Step 4575: {'lr': 0.0004996269256705301, 'samples': 878400, 'steps': 4574, 'loss/train': 2.103940486907959} 11/06/2021 21:54:04 - INFO - __main__ - Step 4576: {'lr': 0.0004996266358073664, 'samples': 878592, 'steps': 4575, 'loss/train': 1.7924268245697021} 11/06/2021 21:54:05 - INFO - __main__ - Step 4577: {'lr': 0.0004996263458317248, 'samples': 878784, 'steps': 4576, 'loss/train': 1.6914095878601074} 11/06/2021 21:54:05 - INFO - __main__ - Step 4578: {'lr': 0.0004996260557436053, 'samples': 878976, 'steps': 4577, 'loss/train': 1.528101921081543} 11/06/2021 21:54:06 - INFO - __main__ - Step 4579: {'lr': 0.0004996257655430083, 'samples': 879168, 'steps': 4578, 'loss/train': 1.8863238096237183} 11/06/2021 21:54:06 - INFO - __main__ - Step 4580: {'lr': 0.0004996254752299337, 'samples': 879360, 'steps': 4579, 'loss/train': 1.8763165473937988} 11/06/2021 21:54:06 - INFO - __main__ - Step 4581: {'lr': 0.0004996251848043817, 'samples': 879552, 'steps': 4580, 'loss/train': 1.885318636894226} 11/06/2021 21:54:07 - INFO - __main__ - Step 4582: {'lr': 0.0004996248942663525, 'samples': 879744, 'steps': 4581, 'loss/train': 1.8357065916061401} 11/06/2021 21:54:08 - INFO - __main__ - Step 4583: {'lr': 0.000499624603615846, 'samples': 879936, 'steps': 4582, 'loss/train': 1.8866019248962402} 11/06/2021 21:54:08 - INFO - __main__ - Step 4584: {'lr': 0.0004996243128528628, 'samples': 880128, 'steps': 4583, 'loss/train': 2.051255464553833} 11/06/2021 21:54:08 - INFO - __main__ - Step 4585: {'lr': 0.0004996240219774025, 'samples': 880320, 'steps': 4584, 'loss/train': 1.9675004482269287} 11/06/2021 21:54:09 - INFO - __main__ - Step 4586: {'lr': 0.0004996237309894656, 'samples': 880512, 'steps': 4585, 'loss/train': 2.283939838409424} 11/06/2021 21:54:10 - INFO - __main__ - Step 4587: {'lr': 0.0004996234398890521, 'samples': 880704, 'steps': 4586, 'loss/train': 1.3432775735855103} 11/06/2021 21:54:10 - INFO - __main__ - Step 4588: {'lr': 0.000499623148676162, 'samples': 880896, 'steps': 4587, 'loss/train': 2.1022074222564697} 11/06/2021 21:54:10 - INFO - __main__ - Step 4589: {'lr': 0.0004996228573507957, 'samples': 881088, 'steps': 4588, 'loss/train': 2.6343271732330322} 11/06/2021 21:54:11 - INFO - __main__ - Step 4590: {'lr': 0.0004996225659129531, 'samples': 881280, 'steps': 4589, 'loss/train': 1.8193244934082031} 11/06/2021 21:54:11 - INFO - __main__ - Step 4591: {'lr': 0.0004996222743626345, 'samples': 881472, 'steps': 4590, 'loss/train': 2.7049386501312256} 11/06/2021 21:54:12 - INFO - __main__ - Step 4592: {'lr': 0.0004996219826998399, 'samples': 881664, 'steps': 4591, 'loss/train': 2.1974246501922607} 11/06/2021 21:54:13 - INFO - __main__ - Step 4593: {'lr': 0.0004996216909245695, 'samples': 881856, 'steps': 4592, 'loss/train': 2.143535852432251} 11/06/2021 21:54:13 - INFO - __main__ - Step 4594: {'lr': 0.0004996213990368234, 'samples': 882048, 'steps': 4593, 'loss/train': 2.2130115032196045} 11/06/2021 21:54:13 - INFO - __main__ - Step 4595: {'lr': 0.0004996211070366018, 'samples': 882240, 'steps': 4594, 'loss/train': 2.191429376602173} 11/06/2021 21:54:14 - INFO - __main__ - Step 4596: {'lr': 0.0004996208149239047, 'samples': 882432, 'steps': 4595, 'loss/train': 2.114386558532715} 11/06/2021 21:54:14 - INFO - __main__ - Step 4597: {'lr': 0.0004996205226987324, 'samples': 882624, 'steps': 4596, 'loss/train': 1.8462443351745605} 11/06/2021 21:54:15 - INFO - __main__ - Step 4598: {'lr': 0.0004996202303610849, 'samples': 882816, 'steps': 4597, 'loss/train': 1.8497729301452637} 11/06/2021 21:54:15 - INFO - __main__ - Step 4599: {'lr': 0.0004996199379109624, 'samples': 883008, 'steps': 4598, 'loss/train': 1.8294633626937866} 11/06/2021 21:54:16 - INFO - __main__ - Step 4600: {'lr': 0.000499619645348365, 'samples': 883200, 'steps': 4599, 'loss/train': 2.2398879528045654} 11/06/2021 21:54:16 - INFO - __main__ - Step 4601: {'lr': 0.0004996193526732929, 'samples': 883392, 'steps': 4600, 'loss/train': 2.352756977081299} 11/06/2021 21:54:16 - INFO - __main__ - Step 4602: {'lr': 0.0004996190598857461, 'samples': 883584, 'steps': 4601, 'loss/train': 1.9614698886871338} 11/06/2021 21:54:17 - INFO - __main__ - Step 4603: {'lr': 0.0004996187669857247, 'samples': 883776, 'steps': 4602, 'loss/train': 1.7905255556106567} 11/06/2021 21:54:18 - INFO - __main__ - Step 4604: {'lr': 0.0004996184739732291, 'samples': 883968, 'steps': 4603, 'loss/train': 2.1833322048187256} 11/06/2021 21:54:18 - INFO - __main__ - Step 4605: {'lr': 0.0004996181808482592, 'samples': 884160, 'steps': 4604, 'loss/train': 1.840651035308838} 11/06/2021 21:54:18 - INFO - __main__ - Step 4606: {'lr': 0.0004996178876108152, 'samples': 884352, 'steps': 4605, 'loss/train': 1.8321231603622437} 11/06/2021 21:54:19 - INFO - __main__ - Step 4607: {'lr': 0.0004996175942608973, 'samples': 884544, 'steps': 4606, 'loss/train': 2.244662284851074} 11/06/2021 21:54:20 - INFO - __main__ - Step 4608: {'lr': 0.0004996173007985055, 'samples': 884736, 'steps': 4607, 'loss/train': 1.933355689048767} 11/06/2021 21:54:20 - INFO - __main__ - Step 4609: {'lr': 0.00049961700722364, 'samples': 884928, 'steps': 4608, 'loss/train': 1.835882306098938} 11/06/2021 21:54:21 - INFO - __main__ - Step 4610: {'lr': 0.0004996167135363009, 'samples': 885120, 'steps': 4609, 'loss/train': 0.5920076966285706} 11/06/2021 21:54:21 - INFO - __main__ - Step 4611: {'lr': 0.0004996164197364884, 'samples': 885312, 'steps': 4610, 'loss/train': 2.2849209308624268} 11/06/2021 21:54:21 - INFO - __main__ - Step 4612: {'lr': 0.0004996161258242025, 'samples': 885504, 'steps': 4611, 'loss/train': 2.7224349975585938} 11/06/2021 21:54:22 - INFO - __main__ - Step 4613: {'lr': 0.0004996158317994436, 'samples': 885696, 'steps': 4612, 'loss/train': 2.293109893798828} 11/06/2021 21:54:23 - INFO - __main__ - Step 4614: {'lr': 0.0004996155376622115, 'samples': 885888, 'steps': 4613, 'loss/train': 2.077742099761963} 11/06/2021 21:54:23 - INFO - __main__ - Step 4615: {'lr': 0.0004996152434125066, 'samples': 886080, 'steps': 4614, 'loss/train': 2.129293441772461} 11/06/2021 21:54:23 - INFO - __main__ - Step 4616: {'lr': 0.0004996149490503289, 'samples': 886272, 'steps': 4615, 'loss/train': 1.7415257692337036} 11/06/2021 21:54:24 - INFO - __main__ - Step 4617: {'lr': 0.0004996146545756786, 'samples': 886464, 'steps': 4616, 'loss/train': 1.9531595706939697} 11/06/2021 21:54:25 - INFO - __main__ - Step 4618: {'lr': 0.0004996143599885557, 'samples': 886656, 'steps': 4617, 'loss/train': 2.034966468811035} 11/06/2021 21:54:25 - INFO - __main__ - Step 4619: {'lr': 0.0004996140652889603, 'samples': 886848, 'steps': 4618, 'loss/train': 1.855339765548706} 11/06/2021 21:54:25 - INFO - __main__ - Step 4620: {'lr': 0.0004996137704768929, 'samples': 887040, 'steps': 4619, 'loss/train': 2.010745048522949} 11/06/2021 21:54:26 - INFO - __main__ - Step 4621: {'lr': 0.0004996134755523532, 'samples': 887232, 'steps': 4620, 'loss/train': 1.572229027748108} 11/06/2021 21:54:26 - INFO - __main__ - Step 4622: {'lr': 0.0004996131805153417, 'samples': 887424, 'steps': 4621, 'loss/train': 2.434321880340576} 11/06/2021 21:54:27 - INFO - __main__ - Step 4623: {'lr': 0.0004996128853658583, 'samples': 887616, 'steps': 4622, 'loss/train': 1.803308129310608} 11/06/2021 21:54:27 - INFO - __main__ - Step 4624: {'lr': 0.0004996125901039031, 'samples': 887808, 'steps': 4623, 'loss/train': 2.078205108642578} 11/06/2021 21:54:28 - INFO - __main__ - Step 4625: {'lr': 0.0004996122947294764, 'samples': 888000, 'steps': 4624, 'loss/train': 1.3897693157196045} 11/06/2021 21:54:28 - INFO - __main__ - Step 4626: {'lr': 0.0004996119992425782, 'samples': 888192, 'steps': 4625, 'loss/train': 2.1612305641174316} 11/06/2021 21:54:28 - INFO - __main__ - Step 4627: {'lr': 0.0004996117036432087, 'samples': 888384, 'steps': 4626, 'loss/train': 1.204795479774475} 11/06/2021 21:54:29 - INFO - __main__ - Step 4628: {'lr': 0.000499611407931368, 'samples': 888576, 'steps': 4627, 'loss/train': 1.87538480758667} 11/06/2021 21:54:30 - INFO - __main__ - Step 4629: {'lr': 0.0004996111121070562, 'samples': 888768, 'steps': 4628, 'loss/train': 1.9798368215560913} 11/06/2021 21:54:30 - INFO - __main__ - Step 4630: {'lr': 0.0004996108161702736, 'samples': 888960, 'steps': 4629, 'loss/train': 1.8879029750823975} 11/06/2021 21:54:30 - INFO - __main__ - Step 4631: {'lr': 0.0004996105201210202, 'samples': 889152, 'steps': 4630, 'loss/train': 1.498540997505188} 11/06/2021 21:54:31 - INFO - __main__ - Step 4632: {'lr': 0.0004996102239592961, 'samples': 889344, 'steps': 4631, 'loss/train': 2.450267791748047} 11/06/2021 21:54:31 - INFO - __main__ - Step 4633: {'lr': 0.0004996099276851015, 'samples': 889536, 'steps': 4632, 'loss/train': 1.960253357887268} 11/06/2021 21:54:32 - INFO - __main__ - Step 4634: {'lr': 0.0004996096312984365, 'samples': 889728, 'steps': 4633, 'loss/train': 2.13409423828125} 11/06/2021 21:54:33 - INFO - __main__ - Step 4635: {'lr': 0.0004996093347993013, 'samples': 889920, 'steps': 4634, 'loss/train': 2.0862467288970947} 11/06/2021 21:54:33 - INFO - __main__ - Step 4636: {'lr': 0.000499609038187696, 'samples': 890112, 'steps': 4635, 'loss/train': 1.7066518068313599} 11/06/2021 21:54:33 - INFO - __main__ - Step 4637: {'lr': 0.0004996087414636207, 'samples': 890304, 'steps': 4636, 'loss/train': 1.917240858078003} 11/06/2021 21:54:34 - INFO - __main__ - Step 4638: {'lr': 0.0004996084446270755, 'samples': 890496, 'steps': 4637, 'loss/train': 1.5188076496124268} 11/06/2021 21:54:35 - INFO - __main__ - Step 4639: {'lr': 0.0004996081476780607, 'samples': 890688, 'steps': 4638, 'loss/train': 2.3080358505249023} 11/06/2021 21:54:35 - INFO - __main__ - Step 4640: {'lr': 0.0004996078506165762, 'samples': 890880, 'steps': 4639, 'loss/train': 1.5977153778076172} 11/06/2021 21:54:35 - INFO - __main__ - Step 4641: {'lr': 0.0004996075534426222, 'samples': 891072, 'steps': 4640, 'loss/train': 1.9571729898452759} 11/06/2021 21:54:36 - INFO - __main__ - Step 4642: {'lr': 0.000499607256156199, 'samples': 891264, 'steps': 4641, 'loss/train': 1.8053956031799316} 11/06/2021 21:54:36 - INFO - __main__ - Step 4643: {'lr': 0.0004996069587573067, 'samples': 891456, 'steps': 4642, 'loss/train': 1.5705664157867432} 11/06/2021 21:54:37 - INFO - __main__ - Step 4644: {'lr': 0.0004996066612459452, 'samples': 891648, 'steps': 4643, 'loss/train': 1.735826849937439} 11/06/2021 21:54:37 - INFO - __main__ - Step 4645: {'lr': 0.0004996063636221148, 'samples': 891840, 'steps': 4644, 'loss/train': 2.2214009761810303} 11/06/2021 21:54:38 - INFO - __main__ - Step 4646: {'lr': 0.0004996060658858158, 'samples': 892032, 'steps': 4645, 'loss/train': 2.121854305267334} 11/06/2021 21:54:38 - INFO - __main__ - Step 4647: {'lr': 0.000499605768037048, 'samples': 892224, 'steps': 4646, 'loss/train': 2.353457450866699} 11/06/2021 21:54:39 - INFO - __main__ - Step 4648: {'lr': 0.0004996054700758117, 'samples': 892416, 'steps': 4647, 'loss/train': 2.0667355060577393} 11/06/2021 21:54:40 - INFO - __main__ - Step 4649: {'lr': 0.0004996051720021071, 'samples': 892608, 'steps': 4648, 'loss/train': 2.009962558746338} 11/06/2021 21:54:40 - INFO - __main__ - Step 4650: {'lr': 0.0004996048738159342, 'samples': 892800, 'steps': 4649, 'loss/train': 0.5192378163337708} 11/06/2021 21:54:40 - INFO - __main__ - Step 4651: {'lr': 0.0004996045755172932, 'samples': 892992, 'steps': 4650, 'loss/train': 1.84172785282135} 11/06/2021 21:54:41 - INFO - __main__ - Step 4652: {'lr': 0.0004996042771061843, 'samples': 893184, 'steps': 4651, 'loss/train': 2.217998504638672} 11/06/2021 21:54:41 - INFO - __main__ - Step 4653: {'lr': 0.0004996039785826075, 'samples': 893376, 'steps': 4652, 'loss/train': 1.648202657699585} 11/06/2021 21:54:42 - INFO - __main__ - Step 4654: {'lr': 0.000499603679946563, 'samples': 893568, 'steps': 4653, 'loss/train': 2.03910756111145} 11/06/2021 21:54:43 - INFO - __main__ - Step 4655: {'lr': 0.0004996033811980509, 'samples': 893760, 'steps': 4654, 'loss/train': 1.7975298166275024} 11/06/2021 21:54:43 - INFO - __main__ - Step 4656: {'lr': 0.0004996030823370715, 'samples': 893952, 'steps': 4655, 'loss/train': 1.8534296751022339} 11/06/2021 21:54:43 - INFO - __main__ - Step 4657: {'lr': 0.0004996027833636247, 'samples': 894144, 'steps': 4656, 'loss/train': 2.475618600845337} 11/06/2021 21:54:44 - INFO - __main__ - Step 4658: {'lr': 0.0004996024842777106, 'samples': 894336, 'steps': 4657, 'loss/train': 1.9979157447814941} 11/06/2021 21:54:44 - INFO - __main__ - Step 4659: {'lr': 0.0004996021850793297, 'samples': 894528, 'steps': 4658, 'loss/train': 2.030845880508423} 11/06/2021 21:54:45 - INFO - __main__ - Step 4660: {'lr': 0.0004996018857684818, 'samples': 894720, 'steps': 4659, 'loss/train': 2.158336877822876} 11/06/2021 21:54:45 - INFO - __main__ - Step 4661: {'lr': 0.0004996015863451672, 'samples': 894912, 'steps': 4660, 'loss/train': 1.8965908288955688} 11/06/2021 21:54:46 - INFO - __main__ - Step 4662: {'lr': 0.0004996012868093859, 'samples': 895104, 'steps': 4661, 'loss/train': 2.2160189151763916} 11/06/2021 21:54:46 - INFO - __main__ - Step 4663: {'lr': 0.0004996009871611382, 'samples': 895296, 'steps': 4662, 'loss/train': 2.1121556758880615} 11/06/2021 21:54:46 - INFO - __main__ - Step 4664: {'lr': 0.0004996006874004241, 'samples': 895488, 'steps': 4663, 'loss/train': 1.954535961151123} 11/06/2021 21:54:49 - INFO - __main__ - Step 4665: {'lr': 0.0004996003875272438, 'samples': 895680, 'steps': 4664, 'loss/train': 1.9362295866012573} 11/06/2021 21:54:49 - INFO - __main__ - Step 4666: {'lr': 0.0004996000875415973, 'samples': 895872, 'steps': 4665, 'loss/train': 1.841673731803894} 11/06/2021 21:54:50 - INFO - __main__ - Step 4667: {'lr': 0.000499599787443485, 'samples': 896064, 'steps': 4666, 'loss/train': 1.7296624183654785} 11/06/2021 21:54:50 - INFO - __main__ - Step 4668: {'lr': 0.0004995994872329069, 'samples': 896256, 'steps': 4667, 'loss/train': 2.1744236946105957} 11/06/2021 21:54:50 - INFO - __main__ - Step 4669: {'lr': 0.000499599186909863, 'samples': 896448, 'steps': 4668, 'loss/train': 2.424499273300171} 11/06/2021 21:54:51 - INFO - __main__ - Step 4670: {'lr': 0.0004995988864743536, 'samples': 896640, 'steps': 4669, 'loss/train': 2.350759744644165} 11/06/2021 21:54:51 - INFO - __main__ - Step 4671: {'lr': 0.0004995985859263789, 'samples': 896832, 'steps': 4670, 'loss/train': 1.975557804107666} 11/06/2021 21:54:51 - INFO - __main__ - Step 4672: {'lr': 0.0004995982852659388, 'samples': 897024, 'steps': 4671, 'loss/train': 2.1741600036621094} 11/06/2021 21:54:52 - INFO - __main__ - Step 4673: {'lr': 0.0004995979844930336, 'samples': 897216, 'steps': 4672, 'loss/train': 2.133139133453369} 11/06/2021 21:54:53 - INFO - __main__ - Step 4674: {'lr': 0.0004995976836076635, 'samples': 897408, 'steps': 4673, 'loss/train': 2.012343168258667} 11/06/2021 21:54:53 - INFO - __main__ - Step 4675: {'lr': 0.0004995973826098283, 'samples': 897600, 'steps': 4674, 'loss/train': 1.9310002326965332} 11/06/2021 21:54:53 - INFO - __main__ - Step 4676: {'lr': 0.0004995970814995285, 'samples': 897792, 'steps': 4675, 'loss/train': 1.969705581665039} 11/06/2021 21:54:54 - INFO - __main__ - Step 4677: {'lr': 0.0004995967802767641, 'samples': 897984, 'steps': 4676, 'loss/train': 0.5457145571708679} 11/06/2021 21:54:55 - INFO - __main__ - Step 4678: {'lr': 0.0004995964789415353, 'samples': 898176, 'steps': 4677, 'loss/train': 2.1169257164001465} 11/06/2021 21:54:55 - INFO - __main__ - Step 4679: {'lr': 0.0004995961774938423, 'samples': 898368, 'steps': 4678, 'loss/train': 2.047304630279541} 11/06/2021 21:54:56 - INFO - __main__ - Step 4680: {'lr': 0.0004995958759336849, 'samples': 898560, 'steps': 4679, 'loss/train': 2.2334115505218506} 11/06/2021 21:54:56 - INFO - __main__ - Step 4681: {'lr': 0.0004995955742610635, 'samples': 898752, 'steps': 4680, 'loss/train': 1.9930462837219238} 11/06/2021 21:54:56 - INFO - __main__ - Step 4682: {'lr': 0.0004995952724759781, 'samples': 898944, 'steps': 4681, 'loss/train': 2.6837527751922607} 11/06/2021 21:54:57 - INFO - __main__ - Step 4683: {'lr': 0.0004995949705784291, 'samples': 899136, 'steps': 4682, 'loss/train': 2.2613110542297363} 11/06/2021 21:54:58 - INFO - __main__ - Step 4684: {'lr': 0.0004995946685684164, 'samples': 899328, 'steps': 4683, 'loss/train': 1.6057881116867065} 11/06/2021 21:54:58 - INFO - __main__ - Step 4685: {'lr': 0.0004995943664459401, 'samples': 899520, 'steps': 4684, 'loss/train': 4.826809406280518} 11/06/2021 21:54:58 - INFO - __main__ - Step 4686: {'lr': 0.0004995940642110005, 'samples': 899712, 'steps': 4685, 'loss/train': 2.056427478790283} 11/06/2021 21:54:59 - INFO - __main__ - Step 4687: {'lr': 0.0004995937618635977, 'samples': 899904, 'steps': 4686, 'loss/train': 2.5549728870391846} 11/06/2021 21:55:00 - INFO - __main__ - Step 4688: {'lr': 0.0004995934594037316, 'samples': 900096, 'steps': 4687, 'loss/train': 1.6349835395812988} 11/06/2021 21:55:00 - INFO - __main__ - Step 4689: {'lr': 0.0004995931568314028, 'samples': 900288, 'steps': 4688, 'loss/train': 2.1520705223083496} 11/06/2021 21:55:00 - INFO - __main__ - Step 4690: {'lr': 0.0004995928541466111, 'samples': 900480, 'steps': 4689, 'loss/train': 2.086479902267456} 11/06/2021 21:55:01 - INFO - __main__ - Step 4691: {'lr': 0.0004995925513493567, 'samples': 900672, 'steps': 4690, 'loss/train': 2.070526123046875} 11/06/2021 21:55:01 - INFO - __main__ - Step 4692: {'lr': 0.0004995922484396397, 'samples': 900864, 'steps': 4691, 'loss/train': 1.8946828842163086} 11/06/2021 21:55:01 - INFO - __main__ - Step 4693: {'lr': 0.0004995919454174603, 'samples': 901056, 'steps': 4692, 'loss/train': 2.0533816814422607} 11/06/2021 21:55:02 - INFO - __main__ - Step 4694: {'lr': 0.0004995916422828187, 'samples': 901248, 'steps': 4693, 'loss/train': 1.6909483671188354} 11/06/2021 21:55:03 - INFO - __main__ - Step 4695: {'lr': 0.0004995913390357148, 'samples': 901440, 'steps': 4694, 'loss/train': 2.1296300888061523} 11/06/2021 21:55:03 - INFO - __main__ - Step 4696: {'lr': 0.0004995910356761491, 'samples': 901632, 'steps': 4695, 'loss/train': 1.5140820741653442} 11/06/2021 21:55:03 - INFO - __main__ - Step 4697: {'lr': 0.0004995907322041214, 'samples': 901824, 'steps': 4696, 'loss/train': 2.322920560836792} 11/06/2021 21:55:04 - INFO - __main__ - Step 4698: {'lr': 0.000499590428619632, 'samples': 902016, 'steps': 4697, 'loss/train': 1.875250220298767} 11/06/2021 21:55:05 - INFO - __main__ - Step 4699: {'lr': 0.000499590124922681, 'samples': 902208, 'steps': 4698, 'loss/train': 0.6247835159301758} 11/06/2021 21:55:05 - INFO - __main__ - Step 4700: {'lr': 0.0004995898211132685, 'samples': 902400, 'steps': 4699, 'loss/train': 2.233553171157837} 11/06/2021 21:55:06 - INFO - __main__ - Step 4701: {'lr': 0.0004995895171913947, 'samples': 902592, 'steps': 4700, 'loss/train': 2.2635233402252197} 11/06/2021 21:55:06 - INFO - __main__ - Step 4702: {'lr': 0.0004995892131570598, 'samples': 902784, 'steps': 4701, 'loss/train': 1.2235736846923828} 11/06/2021 21:55:06 - INFO - __main__ - Step 4703: {'lr': 0.0004995889090102638, 'samples': 902976, 'steps': 4702, 'loss/train': 1.9816572666168213} 11/06/2021 21:55:07 - INFO - __main__ - Step 4704: {'lr': 0.0004995886047510068, 'samples': 903168, 'steps': 4703, 'loss/train': 2.1657280921936035} 11/06/2021 21:55:08 - INFO - __main__ - Step 4705: {'lr': 0.0004995883003792891, 'samples': 903360, 'steps': 4704, 'loss/train': 2.1573469638824463} 11/06/2021 21:55:08 - INFO - __main__ - Step 4706: {'lr': 0.0004995879958951107, 'samples': 903552, 'steps': 4705, 'loss/train': 1.8357532024383545} 11/06/2021 21:55:08 - INFO - __main__ - Step 4707: {'lr': 0.0004995876912984719, 'samples': 903744, 'steps': 4706, 'loss/train': 2.194171667098999} 11/06/2021 21:55:09 - INFO - __main__ - Step 4708: {'lr': 0.0004995873865893727, 'samples': 903936, 'steps': 4707, 'loss/train': 2.122776985168457} 11/06/2021 21:55:10 - INFO - __main__ - Step 4709: {'lr': 0.0004995870817678133, 'samples': 904128, 'steps': 4708, 'loss/train': 1.9645999670028687} 11/06/2021 21:55:10 - INFO - __main__ - Step 4710: {'lr': 0.0004995867768337938, 'samples': 904320, 'steps': 4709, 'loss/train': 2.073693037033081} 11/06/2021 21:55:10 - INFO - __main__ - Step 4711: {'lr': 0.0004995864717873143, 'samples': 904512, 'steps': 4710, 'loss/train': 1.3138636350631714} 11/06/2021 21:55:11 - INFO - __main__ - Step 4712: {'lr': 0.000499586166628375, 'samples': 904704, 'steps': 4711, 'loss/train': 2.1905508041381836} 11/06/2021 21:55:11 - INFO - __main__ - Step 4713: {'lr': 0.0004995858613569761, 'samples': 904896, 'steps': 4712, 'loss/train': 2.189453363418579} 11/06/2021 21:55:12 - INFO - __main__ - Step 4714: {'lr': 0.0004995855559731176, 'samples': 905088, 'steps': 4713, 'loss/train': 2.266838550567627} 11/06/2021 21:55:12 - INFO - __main__ - Step 4715: {'lr': 0.0004995852504767997, 'samples': 905280, 'steps': 4714, 'loss/train': 2.4359450340270996} 11/06/2021 21:55:13 - INFO - __main__ - Step 4716: {'lr': 0.0004995849448680225, 'samples': 905472, 'steps': 4715, 'loss/train': 2.0849661827087402} 11/06/2021 21:55:13 - INFO - __main__ - Step 4717: {'lr': 0.0004995846391467862, 'samples': 905664, 'steps': 4716, 'loss/train': 2.252028465270996} 11/06/2021 21:55:13 - INFO - __main__ - Step 4718: {'lr': 0.000499584333313091, 'samples': 905856, 'steps': 4717, 'loss/train': 1.7350718975067139} 11/06/2021 21:55:14 - INFO - __main__ - Step 4719: {'lr': 0.0004995840273669369, 'samples': 906048, 'steps': 4718, 'loss/train': 2.1980574131011963} 11/06/2021 21:55:15 - INFO - __main__ - Step 4720: {'lr': 0.0004995837213083241, 'samples': 906240, 'steps': 4719, 'loss/train': 2.655808925628662} 11/06/2021 21:55:15 - INFO - __main__ - Step 4721: {'lr': 0.0004995834151372526, 'samples': 906432, 'steps': 4720, 'loss/train': 1.8203097581863403} 11/06/2021 21:55:16 - INFO - __main__ - Step 4722: {'lr': 0.0004995831088537229, 'samples': 906624, 'steps': 4721, 'loss/train': 1.9288336038589478} 11/06/2021 21:55:16 - INFO - __main__ - Step 4723: {'lr': 0.0004995828024577346, 'samples': 906816, 'steps': 4722, 'loss/train': 2.221000909805298} 11/06/2021 21:55:16 - INFO - __main__ - Step 4724: {'lr': 0.0004995824959492884, 'samples': 907008, 'steps': 4723, 'loss/train': 2.1988742351531982} 11/06/2021 21:55:17 - INFO - __main__ - Step 4725: {'lr': 0.0004995821893283841, 'samples': 907200, 'steps': 4724, 'loss/train': 1.5321357250213623} 11/06/2021 21:55:18 - INFO - __main__ - Step 4726: {'lr': 0.0004995818825950218, 'samples': 907392, 'steps': 4725, 'loss/train': 2.327254295349121} 11/06/2021 21:55:18 - INFO - __main__ - Step 4727: {'lr': 0.0004995815757492019, 'samples': 907584, 'steps': 4726, 'loss/train': 1.697227954864502} 11/06/2021 21:55:18 - INFO - __main__ - Step 4728: {'lr': 0.0004995812687909243, 'samples': 907776, 'steps': 4727, 'loss/train': 1.925517201423645} 11/06/2021 21:55:19 - INFO - __main__ - Step 4729: {'lr': 0.0004995809617201894, 'samples': 907968, 'steps': 4728, 'loss/train': 2.1235063076019287} 11/06/2021 21:55:20 - INFO - __main__ - Step 4730: {'lr': 0.000499580654536997, 'samples': 908160, 'steps': 4729, 'loss/train': 1.6067290306091309} 11/06/2021 21:55:20 - INFO - __main__ - Step 4731: {'lr': 0.0004995803472413474, 'samples': 908352, 'steps': 4730, 'loss/train': 2.062716245651245} 11/06/2021 21:55:21 - INFO - __main__ - Step 4732: {'lr': 0.0004995800398332409, 'samples': 908544, 'steps': 4731, 'loss/train': 2.0088040828704834} 11/06/2021 21:55:21 - INFO - __main__ - Step 4733: {'lr': 0.0004995797323126774, 'samples': 908736, 'steps': 4732, 'loss/train': 2.1273770332336426} 11/06/2021 21:55:21 - INFO - __main__ - Step 4734: {'lr': 0.0004995794246796571, 'samples': 908928, 'steps': 4733, 'loss/train': 1.9056154489517212} 11/06/2021 21:55:22 - INFO - __main__ - Step 4735: {'lr': 0.0004995791169341801, 'samples': 909120, 'steps': 4734, 'loss/train': 1.6100257635116577} 11/06/2021 21:55:23 - INFO - __main__ - Step 4736: {'lr': 0.0004995788090762467, 'samples': 909312, 'steps': 4735, 'loss/train': 1.8864790201187134} 11/06/2021 21:55:23 - INFO - __main__ - Step 4737: {'lr': 0.000499578501105857, 'samples': 909504, 'steps': 4736, 'loss/train': 1.9987335205078125} 11/06/2021 21:55:23 - INFO - __main__ - Step 4738: {'lr': 0.000499578193023011, 'samples': 909696, 'steps': 4737, 'loss/train': 2.262216567993164} 11/06/2021 21:55:24 - INFO - __main__ - Step 4739: {'lr': 0.0004995778848277088, 'samples': 909888, 'steps': 4738, 'loss/train': 1.9893282651901245} 11/06/2021 21:55:24 - INFO - __main__ - Step 4740: {'lr': 0.0004995775765199509, 'samples': 910080, 'steps': 4739, 'loss/train': 2.1088781356811523} 11/06/2021 21:55:25 - INFO - __main__ - Step 4741: {'lr': 0.000499577268099737, 'samples': 910272, 'steps': 4740, 'loss/train': 1.7496652603149414} 11/06/2021 21:55:25 - INFO - __main__ - Step 4742: {'lr': 0.0004995769595670675, 'samples': 910464, 'steps': 4741, 'loss/train': 1.8243907690048218} 11/06/2021 21:55:26 - INFO - __main__ - Step 4743: {'lr': 0.0004995766509219425, 'samples': 910656, 'steps': 4742, 'loss/train': 1.8536678552627563} 11/06/2021 21:55:26 - INFO - __main__ - Step 4744: {'lr': 0.0004995763421643621, 'samples': 910848, 'steps': 4743, 'loss/train': 2.054396152496338} 11/06/2021 21:55:26 - INFO - __main__ - Step 4745: {'lr': 0.0004995760332943264, 'samples': 911040, 'steps': 4744, 'loss/train': 2.0926995277404785} 11/06/2021 21:55:28 - INFO - __main__ - Step 4746: {'lr': 0.0004995757243118356, 'samples': 911232, 'steps': 4745, 'loss/train': 2.342751979827881} 11/06/2021 21:55:28 - INFO - __main__ - Step 4747: {'lr': 0.0004995754152168899, 'samples': 911424, 'steps': 4746, 'loss/train': 1.4246360063552856} 11/06/2021 21:55:28 - INFO - __main__ - Step 4748: {'lr': 0.0004995751060094893, 'samples': 911616, 'steps': 4747, 'loss/train': 1.6921579837799072} 11/06/2021 21:55:29 - INFO - __main__ - Step 4749: {'lr': 0.000499574796689634, 'samples': 911808, 'steps': 4748, 'loss/train': 1.5545167922973633} 11/06/2021 21:55:29 - INFO - __main__ - Step 4750: {'lr': 0.0004995744872573242, 'samples': 912000, 'steps': 4749, 'loss/train': 1.6792792081832886} 11/06/2021 21:55:30 - INFO - __main__ - Step 4751: {'lr': 0.00049957417771256, 'samples': 912192, 'steps': 4750, 'loss/train': 2.301424741744995} 11/06/2021 21:55:31 - INFO - __main__ - Step 4752: {'lr': 0.0004995738680553415, 'samples': 912384, 'steps': 4751, 'loss/train': 2.0780701637268066} 11/06/2021 21:55:31 - INFO - __main__ - Step 4753: {'lr': 0.0004995735582856689, 'samples': 912576, 'steps': 4752, 'loss/train': 2.5025558471679688} 11/06/2021 21:55:31 - INFO - __main__ - Step 4754: {'lr': 0.0004995732484035422, 'samples': 912768, 'steps': 4753, 'loss/train': 1.6821069717407227} 11/06/2021 21:55:32 - INFO - __main__ - Step 4755: {'lr': 0.0004995729384089618, 'samples': 912960, 'steps': 4754, 'loss/train': 1.8580917119979858} 11/06/2021 21:55:32 - INFO - __main__ - Step 4756: {'lr': 0.0004995726283019275, 'samples': 913152, 'steps': 4755, 'loss/train': 5.469394207000732} 11/06/2021 21:55:33 - INFO - __main__ - Step 4757: {'lr': 0.0004995723180824397, 'samples': 913344, 'steps': 4756, 'loss/train': 2.1818153858184814} 11/06/2021 21:55:33 - INFO - __main__ - Step 4758: {'lr': 0.0004995720077504986, 'samples': 913536, 'steps': 4757, 'loss/train': 1.8625547885894775} 11/06/2021 21:55:34 - INFO - __main__ - Step 4759: {'lr': 0.0004995716973061041, 'samples': 913728, 'steps': 4758, 'loss/train': 2.1026546955108643} 11/06/2021 21:55:34 - INFO - __main__ - Step 4760: {'lr': 0.0004995713867492564, 'samples': 913920, 'steps': 4759, 'loss/train': 2.178823471069336} 11/06/2021 21:55:34 - INFO - __main__ - Step 4761: {'lr': 0.0004995710760799557, 'samples': 914112, 'steps': 4760, 'loss/train': 1.872559905052185} 11/06/2021 21:55:36 - INFO - __main__ - Step 4762: {'lr': 0.0004995707652982022, 'samples': 914304, 'steps': 4761, 'loss/train': 1.871140718460083} 11/06/2021 21:55:36 - INFO - __main__ - Step 4763: {'lr': 0.0004995704544039958, 'samples': 914496, 'steps': 4762, 'loss/train': 2.3007659912109375} 11/06/2021 21:55:36 - INFO - __main__ - Step 4764: {'lr': 0.0004995701433973369, 'samples': 914688, 'steps': 4763, 'loss/train': 0.6473186016082764} 11/06/2021 21:55:37 - INFO - __main__ - Step 4765: {'lr': 0.0004995698322782257, 'samples': 914880, 'steps': 4764, 'loss/train': 1.983769416809082} 11/06/2021 21:55:37 - INFO - __main__ - Step 4766: {'lr': 0.0004995695210466619, 'samples': 915072, 'steps': 4765, 'loss/train': 1.983061671257019} 11/06/2021 21:55:37 - INFO - __main__ - Step 4767: {'lr': 0.0004995692097026461, 'samples': 915264, 'steps': 4766, 'loss/train': 1.7104268074035645} 11/06/2021 21:55:38 - INFO - __main__ - Step 4768: {'lr': 0.0004995688982461783, 'samples': 915456, 'steps': 4767, 'loss/train': 2.606029748916626} 11/06/2021 21:55:39 - INFO - __main__ - Step 4769: {'lr': 0.0004995685866772586, 'samples': 915648, 'steps': 4768, 'loss/train': 1.642903447151184} 11/06/2021 21:55:39 - INFO - __main__ - Step 4770: {'lr': 0.000499568274995887, 'samples': 915840, 'steps': 4769, 'loss/train': 2.383634328842163} 11/06/2021 21:55:39 - INFO - __main__ - Step 4771: {'lr': 0.0004995679632020639, 'samples': 916032, 'steps': 4770, 'loss/train': 2.0660829544067383} 11/06/2021 21:55:40 - INFO - __main__ - Step 4772: {'lr': 0.0004995676512957892, 'samples': 916224, 'steps': 4771, 'loss/train': 2.0167953968048096} 11/06/2021 21:55:41 - INFO - __main__ - Step 4773: {'lr': 0.0004995673392770634, 'samples': 916416, 'steps': 4772, 'loss/train': 2.2633042335510254} 11/06/2021 21:55:41 - INFO - __main__ - Step 4774: {'lr': 0.0004995670271458863, 'samples': 916608, 'steps': 4773, 'loss/train': 2.0929172039031982} 11/06/2021 21:55:42 - INFO - __main__ - Step 4775: {'lr': 0.0004995667149022581, 'samples': 916800, 'steps': 4774, 'loss/train': 2.1483800411224365} 11/06/2021 21:55:42 - INFO - __main__ - Step 4776: {'lr': 0.000499566402546179, 'samples': 916992, 'steps': 4775, 'loss/train': 1.7766259908676147} 11/06/2021 21:55:42 - INFO - __main__ - Step 4777: {'lr': 0.0004995660900776491, 'samples': 917184, 'steps': 4776, 'loss/train': 1.8469535112380981} 11/06/2021 21:55:43 - INFO - __main__ - Step 4778: {'lr': 0.0004995657774966686, 'samples': 917376, 'steps': 4777, 'loss/train': 2.0229592323303223} 11/06/2021 21:55:44 - INFO - __main__ - Step 4779: {'lr': 0.0004995654648032377, 'samples': 917568, 'steps': 4778, 'loss/train': 1.3622205257415771} 11/06/2021 21:55:44 - INFO - __main__ - Step 4780: {'lr': 0.0004995651519973563, 'samples': 917760, 'steps': 4779, 'loss/train': 2.1144587993621826} 11/06/2021 21:55:44 - INFO - __main__ - Step 4781: {'lr': 0.0004995648390790249, 'samples': 917952, 'steps': 4780, 'loss/train': 1.954245686531067} 11/06/2021 21:55:45 - INFO - __main__ - Step 4782: {'lr': 0.0004995645260482432, 'samples': 918144, 'steps': 4781, 'loss/train': 2.7569739818573} 11/06/2021 21:55:46 - INFO - __main__ - Step 4783: {'lr': 0.0004995642129050117, 'samples': 918336, 'steps': 4782, 'loss/train': 1.3092371225357056} 11/06/2021 21:55:46 - INFO - __main__ - Step 4784: {'lr': 0.0004995638996493304, 'samples': 918528, 'steps': 4783, 'loss/train': 1.6529567241668701} 11/06/2021 21:55:46 - INFO - __main__ - Step 4785: {'lr': 0.0004995635862811994, 'samples': 918720, 'steps': 4784, 'loss/train': 2.129288911819458} 11/06/2021 21:55:47 - INFO - __main__ - Step 4786: {'lr': 0.000499563272800619, 'samples': 918912, 'steps': 4785, 'loss/train': 2.2793755531311035} 11/06/2021 21:55:47 - INFO - __main__ - Step 4787: {'lr': 0.0004995629592075892, 'samples': 919104, 'steps': 4786, 'loss/train': 2.265683174133301} 11/06/2021 21:55:48 - INFO - __main__ - Step 4788: {'lr': 0.0004995626455021101, 'samples': 919296, 'steps': 4787, 'loss/train': 2.4997894763946533} 11/06/2021 21:55:49 - INFO - __main__ - Step 4789: {'lr': 0.0004995623316841821, 'samples': 919488, 'steps': 4788, 'loss/train': 2.061391592025757} 11/06/2021 21:55:49 - INFO - __main__ - Step 4790: {'lr': 0.0004995620177538051, 'samples': 919680, 'steps': 4789, 'loss/train': 2.428020715713501} 11/06/2021 21:55:50 - INFO - __main__ - Step 4791: {'lr': 0.0004995617037109792, 'samples': 919872, 'steps': 4790, 'loss/train': 1.7995200157165527} 11/06/2021 21:55:50 - INFO - __main__ - Step 4792: {'lr': 0.0004995613895557048, 'samples': 920064, 'steps': 4791, 'loss/train': 1.883157730102539} 11/06/2021 21:55:51 - INFO - __main__ - Step 4793: {'lr': 0.0004995610752879818, 'samples': 920256, 'steps': 4792, 'loss/train': 2.25808048248291} 11/06/2021 21:55:51 - INFO - __main__ - Step 4794: {'lr': 0.0004995607609078104, 'samples': 920448, 'steps': 4793, 'loss/train': 2.118656873703003} 11/06/2021 21:55:52 - INFO - __main__ - Step 4795: {'lr': 0.0004995604464151908, 'samples': 920640, 'steps': 4794, 'loss/train': 2.104627847671509} 11/06/2021 21:55:52 - INFO - __main__ - Step 4796: {'lr': 0.0004995601318101231, 'samples': 920832, 'steps': 4795, 'loss/train': 2.0214784145355225} 11/06/2021 21:55:52 - INFO - __main__ - Step 4797: {'lr': 0.0004995598170926074, 'samples': 921024, 'steps': 4796, 'loss/train': 1.9268819093704224} 11/06/2021 21:55:53 - INFO - __main__ - Step 4798: {'lr': 0.000499559502262644, 'samples': 921216, 'steps': 4797, 'loss/train': 1.5341185331344604} 11/06/2021 21:55:54 - INFO - __main__ - Step 4799: {'lr': 0.000499559187320233, 'samples': 921408, 'steps': 4798, 'loss/train': 2.1940500736236572} 11/06/2021 21:55:54 - INFO - __main__ - Step 4800: {'lr': 0.0004995588722653743, 'samples': 921600, 'steps': 4799, 'loss/train': 1.9055429697036743} 11/06/2021 21:55:54 - INFO - __main__ - Step 4801: {'lr': 0.0004995585570980684, 'samples': 921792, 'steps': 4800, 'loss/train': 2.3950815200805664} 11/06/2021 21:55:55 - INFO - __main__ - Step 4802: {'lr': 0.0004995582418183151, 'samples': 921984, 'steps': 4801, 'loss/train': 1.969565510749817} 11/06/2021 21:55:55 - INFO - __main__ - Step 4803: {'lr': 0.0004995579264261148, 'samples': 922176, 'steps': 4802, 'loss/train': 1.7654180526733398} 11/06/2021 21:55:56 - INFO - __main__ - Step 4804: {'lr': 0.0004995576109214676, 'samples': 922368, 'steps': 4803, 'loss/train': 2.6347360610961914} 11/06/2021 21:55:57 - INFO - __main__ - Step 4805: {'lr': 0.0004995572953043736, 'samples': 922560, 'steps': 4804, 'loss/train': 1.7408865690231323} 11/06/2021 21:55:57 - INFO - __main__ - Step 4806: {'lr': 0.0004995569795748328, 'samples': 922752, 'steps': 4805, 'loss/train': 1.1435315608978271} 11/06/2021 21:55:57 - INFO - __main__ - Step 4807: {'lr': 0.0004995566637328456, 'samples': 922944, 'steps': 4806, 'loss/train': 1.0032495260238647} 11/06/2021 21:55:58 - INFO - __main__ - Step 4808: {'lr': 0.0004995563477784119, 'samples': 923136, 'steps': 4807, 'loss/train': 2.07177472114563} 11/06/2021 21:55:59 - INFO - __main__ - Step 4809: {'lr': 0.000499556031711532, 'samples': 923328, 'steps': 4808, 'loss/train': 2.4952619075775146} 11/06/2021 21:55:59 - INFO - __main__ - Step 4810: {'lr': 0.000499555715532206, 'samples': 923520, 'steps': 4809, 'loss/train': 1.7936309576034546} 11/06/2021 21:55:59 - INFO - __main__ - Step 4811: {'lr': 0.0004995553992404342, 'samples': 923712, 'steps': 4810, 'loss/train': 1.8712501525878906} 11/06/2021 21:56:00 - INFO - __main__ - Step 4812: {'lr': 0.0004995550828362163, 'samples': 923904, 'steps': 4811, 'loss/train': 2.1187033653259277} 11/06/2021 21:56:00 - INFO - __main__ - Step 4813: {'lr': 0.000499554766319553, 'samples': 924096, 'steps': 4812, 'loss/train': 1.8378645181655884} 11/06/2021 21:56:01 - INFO - __main__ - Step 4814: {'lr': 0.0004995544496904441, 'samples': 924288, 'steps': 4813, 'loss/train': 2.4952051639556885} 11/06/2021 21:56:01 - INFO - __main__ - Step 4815: {'lr': 0.0004995541329488897, 'samples': 924480, 'steps': 4814, 'loss/train': 1.766731858253479} 11/06/2021 21:56:02 - INFO - __main__ - Step 4816: {'lr': 0.0004995538160948901, 'samples': 924672, 'steps': 4815, 'loss/train': 1.933300495147705} 11/06/2021 21:56:02 - INFO - __main__ - Step 4817: {'lr': 0.0004995534991284455, 'samples': 924864, 'steps': 4816, 'loss/train': 1.8735243082046509} 11/06/2021 21:56:02 - INFO - __main__ - Step 4818: {'lr': 0.0004995531820495559, 'samples': 925056, 'steps': 4817, 'loss/train': 2.582186460494995} 11/06/2021 21:56:03 - INFO - __main__ - Step 4819: {'lr': 0.0004995528648582214, 'samples': 925248, 'steps': 4818, 'loss/train': 2.391286611557007} 11/06/2021 21:56:04 - INFO - __main__ - Step 4820: {'lr': 0.0004995525475544423, 'samples': 925440, 'steps': 4819, 'loss/train': 2.931082248687744} 11/06/2021 21:56:04 - INFO - __main__ - Step 4821: {'lr': 0.0004995522301382187, 'samples': 925632, 'steps': 4820, 'loss/train': 1.65629243850708} 11/06/2021 21:56:05 - INFO - __main__ - Step 4822: {'lr': 0.0004995519126095506, 'samples': 925824, 'steps': 4821, 'loss/train': 2.3707022666931152} 11/06/2021 21:56:05 - INFO - __main__ - Step 4823: {'lr': 0.0004995515949684384, 'samples': 926016, 'steps': 4822, 'loss/train': 1.6298257112503052} 11/06/2021 21:56:05 - INFO - __main__ - Step 4824: {'lr': 0.000499551277214882, 'samples': 926208, 'steps': 4823, 'loss/train': 1.6904128789901733} 11/06/2021 21:56:06 - INFO - __main__ - Step 4825: {'lr': 0.0004995509593488818, 'samples': 926400, 'steps': 4824, 'loss/train': 2.1001079082489014} 11/06/2021 21:56:07 - INFO - __main__ - Step 4826: {'lr': 0.0004995506413704376, 'samples': 926592, 'steps': 4825, 'loss/train': 1.8763246536254883} 11/06/2021 21:56:07 - INFO - __main__ - Step 4827: {'lr': 0.0004995503232795498, 'samples': 926784, 'steps': 4826, 'loss/train': 1.9993937015533447} 11/06/2021 21:56:07 - INFO - __main__ - Step 4828: {'lr': 0.0004995500050762185, 'samples': 926976, 'steps': 4827, 'loss/train': 2.1457080841064453} 11/06/2021 21:56:08 - INFO - __main__ - Step 4829: {'lr': 0.0004995496867604438, 'samples': 927168, 'steps': 4828, 'loss/train': 1.929814338684082} 11/06/2021 21:56:09 - INFO - __main__ - Step 4830: {'lr': 0.0004995493683322259, 'samples': 927360, 'steps': 4829, 'loss/train': 1.904067873954773} 11/06/2021 21:56:09 - INFO - __main__ - Step 4831: {'lr': 0.0004995490497915649, 'samples': 927552, 'steps': 4830, 'loss/train': 2.4040565490722656} 11/06/2021 21:56:09 - INFO - __main__ - Step 4832: {'lr': 0.0004995487311384609, 'samples': 927744, 'steps': 4831, 'loss/train': 2.00188946723938} 11/06/2021 21:56:10 - INFO - __main__ - Step 4833: {'lr': 0.0004995484123729141, 'samples': 927936, 'steps': 4832, 'loss/train': 2.0301706790924072} 11/06/2021 21:56:10 - INFO - __main__ - Step 4834: {'lr': 0.0004995480934949247, 'samples': 928128, 'steps': 4833, 'loss/train': 1.7859026193618774} 11/06/2021 21:56:11 - INFO - __main__ - Step 4835: {'lr': 0.0004995477745044927, 'samples': 928320, 'steps': 4834, 'loss/train': 1.3403795957565308} 11/06/2021 21:56:12 - INFO - __main__ - Step 4836: {'lr': 0.0004995474554016184, 'samples': 928512, 'steps': 4835, 'loss/train': 2.4826717376708984} 11/06/2021 21:56:12 - INFO - __main__ - Step 4837: {'lr': 0.0004995471361863017, 'samples': 928704, 'steps': 4836, 'loss/train': 2.1667168140411377} 11/06/2021 21:56:12 - INFO - __main__ - Step 4838: {'lr': 0.0004995468168585431, 'samples': 928896, 'steps': 4837, 'loss/train': 1.3020102977752686} 11/06/2021 21:56:13 - INFO - __main__ - Step 4839: {'lr': 0.0004995464974183424, 'samples': 929088, 'steps': 4838, 'loss/train': 2.1668708324432373} 11/06/2021 21:56:14 - INFO - __main__ - Step 4840: {'lr': 0.0004995461778657002, 'samples': 929280, 'steps': 4839, 'loss/train': 1.8353111743927002} 11/06/2021 21:56:14 - INFO - __main__ - Step 4841: {'lr': 0.000499545858200616, 'samples': 929472, 'steps': 4840, 'loss/train': 1.82950758934021} 11/06/2021 21:56:14 - INFO - __main__ - Step 4842: {'lr': 0.0004995455384230904, 'samples': 929664, 'steps': 4841, 'loss/train': 1.8687033653259277} 11/06/2021 21:56:15 - INFO - __main__ - Step 4843: {'lr': 0.0004995452185331235, 'samples': 929856, 'steps': 4842, 'loss/train': 0.33405929803848267} 11/06/2021 21:56:15 - INFO - __main__ - Step 4844: {'lr': 0.0004995448985307153, 'samples': 930048, 'steps': 4843, 'loss/train': 1.0525949001312256} 11/06/2021 21:56:16 - INFO - __main__ - Step 4845: {'lr': 0.0004995445784158661, 'samples': 930240, 'steps': 4844, 'loss/train': 2.037909507751465} 11/06/2021 21:56:16 - INFO - __main__ - Step 4846: {'lr': 0.0004995442581885759, 'samples': 930432, 'steps': 4845, 'loss/train': 1.789475679397583} 11/06/2021 21:56:17 - INFO - __main__ - Step 4847: {'lr': 0.0004995439378488449, 'samples': 930624, 'steps': 4846, 'loss/train': 1.7952702045440674} 11/06/2021 21:56:17 - INFO - __main__ - Step 4848: {'lr': 0.0004995436173966733, 'samples': 930816, 'steps': 4847, 'loss/train': 1.640943169593811} 11/06/2021 21:56:17 - INFO - __main__ - Step 4849: {'lr': 0.0004995432968320611, 'samples': 931008, 'steps': 4848, 'loss/train': 2.3322298526763916} 11/06/2021 21:56:19 - INFO - __main__ - Step 4850: {'lr': 0.0004995429761550086, 'samples': 931200, 'steps': 4849, 'loss/train': 2.3583528995513916} 11/06/2021 21:56:19 - INFO - __main__ - Step 4851: {'lr': 0.0004995426553655159, 'samples': 931392, 'steps': 4850, 'loss/train': 1.6499428749084473} 11/06/2021 21:56:19 - INFO - __main__ - Step 4852: {'lr': 0.0004995423344635831, 'samples': 931584, 'steps': 4851, 'loss/train': 1.9984158277511597} 11/06/2021 21:56:20 - INFO - __main__ - Step 4853: {'lr': 0.0004995420134492105, 'samples': 931776, 'steps': 4852, 'loss/train': 2.299149513244629} 11/06/2021 21:56:20 - INFO - __main__ - Step 4854: {'lr': 0.0004995416923223979, 'samples': 931968, 'steps': 4853, 'loss/train': 2.4122209548950195} 11/06/2021 21:56:20 - INFO - __main__ - Step 4855: {'lr': 0.0004995413710831458, 'samples': 932160, 'steps': 4854, 'loss/train': 2.0572216510772705} 11/06/2021 21:56:21 - INFO - __main__ - Step 4856: {'lr': 0.0004995410497314542, 'samples': 932352, 'steps': 4855, 'loss/train': 1.9187612533569336} 11/06/2021 21:56:22 - INFO - __main__ - Step 4857: {'lr': 0.0004995407282673232, 'samples': 932544, 'steps': 4856, 'loss/train': 1.4883348941802979} 11/06/2021 21:56:22 - INFO - __main__ - Step 4858: {'lr': 0.000499540406690753, 'samples': 932736, 'steps': 4857, 'loss/train': 1.9691627025604248} 11/06/2021 21:56:22 - INFO - __main__ - Step 4859: {'lr': 0.0004995400850017438, 'samples': 932928, 'steps': 4858, 'loss/train': 1.930557131767273} 11/06/2021 21:56:23 - INFO - __main__ - Step 4860: {'lr': 0.0004995397632002957, 'samples': 933120, 'steps': 4859, 'loss/train': 1.965431809425354} 11/06/2021 21:56:24 - INFO - __main__ - Step 4861: {'lr': 0.0004995394412864088, 'samples': 933312, 'steps': 4860, 'loss/train': 1.9139984846115112} 11/06/2021 21:56:24 - INFO - __main__ - Step 4862: {'lr': 0.0004995391192600834, 'samples': 933504, 'steps': 4861, 'loss/train': 2.268007516860962} 11/06/2021 21:56:24 - INFO - __main__ - Step 4863: {'lr': 0.0004995387971213194, 'samples': 933696, 'steps': 4862, 'loss/train': 1.8037782907485962} 11/06/2021 21:56:25 - INFO - __main__ - Step 4864: {'lr': 0.000499538474870117, 'samples': 933888, 'steps': 4863, 'loss/train': 1.7647819519042969} 11/06/2021 21:56:25 - INFO - __main__ - Step 4865: {'lr': 0.0004995381525064765, 'samples': 934080, 'steps': 4864, 'loss/train': 2.0817017555236816} 11/06/2021 21:56:26 - INFO - __main__ - Step 4866: {'lr': 0.0004995378300303979, 'samples': 934272, 'steps': 4865, 'loss/train': 2.1606099605560303} 11/06/2021 21:56:27 - INFO - __main__ - Step 4867: {'lr': 0.0004995375074418815, 'samples': 934464, 'steps': 4866, 'loss/train': 2.0382091999053955} 11/06/2021 21:56:27 - INFO - __main__ - Step 4868: {'lr': 0.0004995371847409273, 'samples': 934656, 'steps': 4867, 'loss/train': 0.9957026243209839} 11/06/2021 21:56:27 - INFO - __main__ - Step 4869: {'lr': 0.0004995368619275355, 'samples': 934848, 'steps': 4868, 'loss/train': 1.8586081266403198} 11/06/2021 21:56:28 - INFO - __main__ - Step 4870: {'lr': 0.0004995365390017062, 'samples': 935040, 'steps': 4869, 'loss/train': 1.3379945755004883} 11/06/2021 21:56:28 - INFO - __main__ - Step 4871: {'lr': 0.0004995362159634396, 'samples': 935232, 'steps': 4870, 'loss/train': 1.8667609691619873} 11/06/2021 21:56:29 - INFO - __main__ - Step 4872: {'lr': 0.0004995358928127359, 'samples': 935424, 'steps': 4871, 'loss/train': 1.3676517009735107} 11/06/2021 21:56:29 - INFO - __main__ - Step 4873: {'lr': 0.0004995355695495952, 'samples': 935616, 'steps': 4872, 'loss/train': 1.948101282119751} 11/06/2021 21:56:30 - INFO - __main__ - Step 4874: {'lr': 0.0004995352461740174, 'samples': 935808, 'steps': 4873, 'loss/train': 1.8517183065414429} 11/06/2021 21:56:30 - INFO - __main__ - Step 4875: {'lr': 0.0004995349226860031, 'samples': 936000, 'steps': 4874, 'loss/train': 1.5299654006958008} 11/06/2021 21:56:30 - INFO - __main__ - Step 4876: {'lr': 0.0004995345990855522, 'samples': 936192, 'steps': 4875, 'loss/train': 2.013723850250244} 11/06/2021 21:56:31 - INFO - __main__ - Step 4877: {'lr': 0.0004995342753726647, 'samples': 936384, 'steps': 4876, 'loss/train': 2.0262885093688965} 11/06/2021 21:56:32 - INFO - __main__ - Step 4878: {'lr': 0.0004995339515473411, 'samples': 936576, 'steps': 4877, 'loss/train': 1.6470019817352295} 11/06/2021 21:56:32 - INFO - __main__ - Step 4879: {'lr': 0.0004995336276095812, 'samples': 936768, 'steps': 4878, 'loss/train': 1.9853967428207397} 11/06/2021 21:56:32 - INFO - __main__ - Step 4880: {'lr': 0.0004995333035593853, 'samples': 936960, 'steps': 4879, 'loss/train': 1.7942310571670532} 11/06/2021 21:56:33 - INFO - __main__ - Step 4881: {'lr': 0.0004995329793967537, 'samples': 937152, 'steps': 4880, 'loss/train': 2.1231250762939453} 11/06/2021 21:56:34 - INFO - __main__ - Step 4882: {'lr': 0.0004995326551216862, 'samples': 937344, 'steps': 4881, 'loss/train': 1.599252700805664} 11/06/2021 21:56:34 - INFO - __main__ - Step 4883: {'lr': 0.0004995323307341832, 'samples': 937536, 'steps': 4882, 'loss/train': 1.833884596824646} 11/06/2021 21:56:34 - INFO - __main__ - Step 4884: {'lr': 0.0004995320062342449, 'samples': 937728, 'steps': 4883, 'loss/train': 1.6169829368591309} 11/06/2021 21:56:35 - INFO - __main__ - Step 4885: {'lr': 0.0004995316816218712, 'samples': 937920, 'steps': 4884, 'loss/train': 2.102433681488037} 11/06/2021 21:56:35 - INFO - __main__ - Step 4886: {'lr': 0.0004995313568970625, 'samples': 938112, 'steps': 4885, 'loss/train': 2.166229724884033} 11/06/2021 21:56:36 - INFO - __main__ - Step 4887: {'lr': 0.0004995310320598187, 'samples': 938304, 'steps': 4886, 'loss/train': 2.130415916442871} 11/06/2021 21:56:36 - INFO - __main__ - Step 4888: {'lr': 0.0004995307071101401, 'samples': 938496, 'steps': 4887, 'loss/train': 2.281303882598877} 11/06/2021 21:56:37 - INFO - __main__ - Step 4889: {'lr': 0.0004995303820480268, 'samples': 938688, 'steps': 4888, 'loss/train': 1.7925283908843994} 11/06/2021 21:56:37 - INFO - __main__ - Step 4890: {'lr': 0.000499530056873479, 'samples': 938880, 'steps': 4889, 'loss/train': 2.1150357723236084} 11/06/2021 21:56:37 - INFO - __main__ - Step 4891: {'lr': 0.0004995297315864968, 'samples': 939072, 'steps': 4890, 'loss/train': 1.8841792345046997} 11/06/2021 21:56:39 - INFO - __main__ - Step 4892: {'lr': 0.0004995294061870802, 'samples': 939264, 'steps': 4891, 'loss/train': 1.9895226955413818} 11/06/2021 21:56:39 - INFO - __main__ - Step 4893: {'lr': 0.0004995290806752297, 'samples': 939456, 'steps': 4892, 'loss/train': 2.05294132232666} 11/06/2021 21:56:39 - INFO - __main__ - Step 4894: {'lr': 0.0004995287550509452, 'samples': 939648, 'steps': 4893, 'loss/train': 1.6693605184555054} 11/06/2021 21:56:40 - INFO - __main__ - Step 4895: {'lr': 0.0004995284293142268, 'samples': 939840, 'steps': 4894, 'loss/train': 2.2964420318603516} 11/06/2021 21:56:40 - INFO - __main__ - Step 4896: {'lr': 0.0004995281034650748, 'samples': 940032, 'steps': 4895, 'loss/train': 2.0679450035095215} 11/06/2021 21:56:41 - INFO - __main__ - Step 4897: {'lr': 0.0004995277775034894, 'samples': 940224, 'steps': 4896, 'loss/train': 1.8550939559936523} 11/06/2021 21:56:41 - INFO - __main__ - Step 4898: {'lr': 0.0004995274514294706, 'samples': 940416, 'steps': 4897, 'loss/train': 1.983759880065918} 11/06/2021 21:56:42 - INFO - __main__ - Step 4899: {'lr': 0.0004995271252430184, 'samples': 940608, 'steps': 4898, 'loss/train': 2.2435123920440674} 11/06/2021 21:56:42 - INFO - __main__ - Step 4900: {'lr': 0.0004995267989441332, 'samples': 940800, 'steps': 4899, 'loss/train': 1.9692909717559814} 11/06/2021 21:56:42 - INFO - __main__ - Step 4901: {'lr': 0.0004995264725328151, 'samples': 940992, 'steps': 4900, 'loss/train': 1.1252635717391968} 11/06/2021 21:56:43 - INFO - __main__ - Step 4902: {'lr': 0.0004995261460090644, 'samples': 941184, 'steps': 4901, 'loss/train': 2.1014087200164795} 11/06/2021 21:56:44 - INFO - __main__ - Step 4903: {'lr': 0.0004995258193728809, 'samples': 941376, 'steps': 4902, 'loss/train': 2.087357521057129} 11/06/2021 21:56:44 - INFO - __main__ - Step 4904: {'lr': 0.0004995254926242649, 'samples': 941568, 'steps': 4903, 'loss/train': 1.3872030973434448} 11/06/2021 21:56:44 - INFO - __main__ - Step 4905: {'lr': 0.0004995251657632165, 'samples': 941760, 'steps': 4904, 'loss/train': 1.6950825452804565} 11/06/2021 21:56:45 - INFO - __main__ - Step 4906: {'lr': 0.000499524838789736, 'samples': 941952, 'steps': 4905, 'loss/train': 2.146413803100586} 11/06/2021 21:56:46 - INFO - __main__ - Step 4907: {'lr': 0.0004995245117038235, 'samples': 942144, 'steps': 4906, 'loss/train': 2.0605762004852295} 11/06/2021 21:56:46 - INFO - __main__ - Step 4908: {'lr': 0.0004995241845054791, 'samples': 942336, 'steps': 4907, 'loss/train': 2.1872684955596924} 11/06/2021 21:56:46 - INFO - __main__ - Step 4909: {'lr': 0.0004995238571947029, 'samples': 942528, 'steps': 4908, 'loss/train': 1.7782576084136963} 11/06/2021 21:56:47 - INFO - __main__ - Step 4910: {'lr': 0.0004995235297714951, 'samples': 942720, 'steps': 4909, 'loss/train': 1.769343614578247} 11/06/2021 21:56:47 - INFO - __main__ - Step 4911: {'lr': 0.0004995232022358559, 'samples': 942912, 'steps': 4910, 'loss/train': 2.6425294876098633} 11/06/2021 21:56:48 - INFO - __main__ - Step 4912: {'lr': 0.0004995228745877853, 'samples': 943104, 'steps': 4911, 'loss/train': 1.9566208124160767} 11/06/2021 21:56:49 - INFO - __main__ - Step 4913: {'lr': 0.0004995225468272836, 'samples': 943296, 'steps': 4912, 'loss/train': 1.9008264541625977} 11/06/2021 21:56:49 - INFO - __main__ - Step 4914: {'lr': 0.0004995222189543509, 'samples': 943488, 'steps': 4913, 'loss/train': 6.304197311401367} 11/06/2021 21:56:49 - INFO - __main__ - Step 4915: {'lr': 0.0004995218909689873, 'samples': 943680, 'steps': 4914, 'loss/train': 1.216681718826294} 11/06/2021 21:56:50 - INFO - __main__ - Step 4916: {'lr': 0.0004995215628711931, 'samples': 943872, 'steps': 4915, 'loss/train': 2.1877267360687256} 11/06/2021 21:56:50 - INFO - __main__ - Step 4917: {'lr': 0.0004995212346609682, 'samples': 944064, 'steps': 4916, 'loss/train': 2.259002923965454} 11/06/2021 21:56:51 - INFO - __main__ - Step 4918: {'lr': 0.0004995209063383129, 'samples': 944256, 'steps': 4917, 'loss/train': 1.607182264328003} 11/06/2021 21:56:51 - INFO - __main__ - Step 4919: {'lr': 0.0004995205779032274, 'samples': 944448, 'steps': 4918, 'loss/train': 2.302341938018799} 11/06/2021 21:56:52 - INFO - __main__ - Step 4920: {'lr': 0.0004995202493557118, 'samples': 944640, 'steps': 4919, 'loss/train': 1.7484567165374756} 11/06/2021 21:56:52 - INFO - __main__ - Step 4921: {'lr': 0.0004995199206957662, 'samples': 944832, 'steps': 4920, 'loss/train': 2.0463926792144775} 11/06/2021 21:56:53 - INFO - __main__ - Step 4922: {'lr': 0.0004995195919233906, 'samples': 945024, 'steps': 4921, 'loss/train': 2.1027209758758545} 11/06/2021 21:56:54 - INFO - __main__ - Step 4923: {'lr': 0.0004995192630385855, 'samples': 945216, 'steps': 4922, 'loss/train': 1.897849678993225} 11/06/2021 21:56:54 - INFO - __main__ - Step 4924: {'lr': 0.0004995189340413509, 'samples': 945408, 'steps': 4923, 'loss/train': 1.5796188116073608} 11/06/2021 21:56:55 - INFO - __main__ - Step 4925: {'lr': 0.0004995186049316868, 'samples': 945600, 'steps': 4924, 'loss/train': 2.1608316898345947} 11/06/2021 21:56:55 - INFO - __main__ - Step 4926: {'lr': 0.0004995182757095935, 'samples': 945792, 'steps': 4925, 'loss/train': 2.3778295516967773} 11/06/2021 21:56:55 - INFO - __main__ - Step 4927: {'lr': 0.0004995179463750712, 'samples': 945984, 'steps': 4926, 'loss/train': 2.6206376552581787} 11/06/2021 21:56:56 - INFO - __main__ - Step 4928: {'lr': 0.0004995176169281199, 'samples': 946176, 'steps': 4927, 'loss/train': 2.371328115463257} 11/06/2021 21:56:57 - INFO - __main__ - Step 4929: {'lr': 0.0004995172873687398, 'samples': 946368, 'steps': 4928, 'loss/train': 2.762335777282715} 11/06/2021 21:56:57 - INFO - __main__ - Step 4930: {'lr': 0.0004995169576969311, 'samples': 946560, 'steps': 4929, 'loss/train': 1.838969349861145} 11/06/2021 21:56:58 - INFO - __main__ - Step 4931: {'lr': 0.0004995166279126938, 'samples': 946752, 'steps': 4930, 'loss/train': 1.8372935056686401} 11/06/2021 21:56:58 - INFO - __main__ - Step 4932: {'lr': 0.0004995162980160283, 'samples': 946944, 'steps': 4931, 'loss/train': 1.7743967771530151} 11/06/2021 21:56:58 - INFO - __main__ - Step 4933: {'lr': 0.0004995159680069346, 'samples': 947136, 'steps': 4932, 'loss/train': 1.8914155960083008} 11/06/2021 21:56:59 - INFO - __main__ - Step 4934: {'lr': 0.0004995156378854127, 'samples': 947328, 'steps': 4933, 'loss/train': 1.7607487440109253} 11/06/2021 21:57:00 - INFO - __main__ - Step 4935: {'lr': 0.000499515307651463, 'samples': 947520, 'steps': 4934, 'loss/train': 2.1030728816986084} 11/06/2021 21:57:00 - INFO - __main__ - Step 4936: {'lr': 0.0004995149773050857, 'samples': 947712, 'steps': 4935, 'loss/train': 2.1479721069335938} 11/06/2021 21:57:01 - INFO - __main__ - Step 4937: {'lr': 0.0004995146468462806, 'samples': 947904, 'steps': 4936, 'loss/train': 1.576684832572937} 11/06/2021 21:57:01 - INFO - __main__ - Step 4938: {'lr': 0.0004995143162750481, 'samples': 948096, 'steps': 4937, 'loss/train': 2.200626850128174} 11/06/2021 21:57:02 - INFO - __main__ - Step 4939: {'lr': 0.0004995139855913883, 'samples': 948288, 'steps': 4938, 'loss/train': 2.6972954273223877} 11/06/2021 21:57:02 - INFO - __main__ - Step 4940: {'lr': 0.0004995136547953014, 'samples': 948480, 'steps': 4939, 'loss/train': 2.025506019592285} 11/06/2021 21:57:02 - INFO - __main__ - Step 4941: {'lr': 0.0004995133238867874, 'samples': 948672, 'steps': 4940, 'loss/train': 2.79832124710083} 11/06/2021 21:57:03 - INFO - __main__ - Step 4942: {'lr': 0.0004995129928658466, 'samples': 948864, 'steps': 4941, 'loss/train': 1.9636731147766113} 11/06/2021 21:57:03 - INFO - __main__ - Step 4943: {'lr': 0.0004995126617324791, 'samples': 949056, 'steps': 4942, 'loss/train': 2.302797317504883} 11/06/2021 21:57:04 - INFO - __main__ - Step 4944: {'lr': 0.000499512330486685, 'samples': 949248, 'steps': 4943, 'loss/train': 1.3910671472549438} 11/06/2021 21:57:05 - INFO - __main__ - Step 4945: {'lr': 0.0004995119991284645, 'samples': 949440, 'steps': 4944, 'loss/train': 1.6283067464828491} 11/06/2021 21:57:05 - INFO - __main__ - Step 4946: {'lr': 0.0004995116676578178, 'samples': 949632, 'steps': 4945, 'loss/train': 2.0636019706726074} 11/06/2021 21:57:05 - INFO - __main__ - Step 4947: {'lr': 0.000499511336074745, 'samples': 949824, 'steps': 4946, 'loss/train': 1.653847575187683} 11/06/2021 21:57:06 - INFO - __main__ - Step 4948: {'lr': 0.0004995110043792462, 'samples': 950016, 'steps': 4947, 'loss/train': 1.9270820617675781} 11/06/2021 21:57:06 - INFO - __main__ - Step 4949: {'lr': 0.0004995106725713217, 'samples': 950208, 'steps': 4948, 'loss/train': 2.0320072174072266} 11/06/2021 21:57:07 - INFO - __main__ - Step 4950: {'lr': 0.0004995103406509713, 'samples': 950400, 'steps': 4949, 'loss/train': 1.9087032079696655} 11/06/2021 21:57:07 - INFO - __main__ - Step 4951: {'lr': 0.0004995100086181957, 'samples': 950592, 'steps': 4950, 'loss/train': 2.294539451599121} 11/06/2021 21:57:08 - INFO - __main__ - Step 4952: {'lr': 0.0004995096764729945, 'samples': 950784, 'steps': 4951, 'loss/train': 1.427341341972351} 11/06/2021 21:57:08 - INFO - __main__ - Step 4953: {'lr': 0.0004995093442153681, 'samples': 950976, 'steps': 4952, 'loss/train': 1.223702073097229} 11/06/2021 21:57:09 - INFO - __main__ - Step 4954: {'lr': 0.0004995090118453167, 'samples': 951168, 'steps': 4953, 'loss/train': 1.9926605224609375} 11/06/2021 21:57:09 - INFO - __main__ - Step 4955: {'lr': 0.0004995086793628405, 'samples': 951360, 'steps': 4954, 'loss/train': 2.119666337966919} 11/06/2021 21:57:10 - INFO - __main__ - Step 4956: {'lr': 0.0004995083467679394, 'samples': 951552, 'steps': 4955, 'loss/train': 1.4242171049118042} 11/06/2021 21:57:10 - INFO - __main__ - Step 4957: {'lr': 0.0004995080140606137, 'samples': 951744, 'steps': 4956, 'loss/train': 2.439540147781372} 11/06/2021 21:57:11 - INFO - __main__ - Step 4958: {'lr': 0.0004995076812408636, 'samples': 951936, 'steps': 4957, 'loss/train': 1.9032562971115112} 11/06/2021 21:57:11 - INFO - __main__ - Step 4959: {'lr': 0.0004995073483086891, 'samples': 952128, 'steps': 4958, 'loss/train': 1.8582649230957031} 11/06/2021 21:57:12 - INFO - __main__ - Step 4960: {'lr': 0.0004995070152640905, 'samples': 952320, 'steps': 4959, 'loss/train': 2.372265100479126} 11/06/2021 21:57:12 - INFO - __main__ - Step 4961: {'lr': 0.0004995066821070679, 'samples': 952512, 'steps': 4960, 'loss/train': 1.438249945640564} 11/06/2021 21:57:13 - INFO - __main__ - Step 4962: {'lr': 0.0004995063488376214, 'samples': 952704, 'steps': 4961, 'loss/train': 2.022662401199341} 11/06/2021 21:57:13 - INFO - __main__ - Step 4963: {'lr': 0.0004995060154557513, 'samples': 952896, 'steps': 4962, 'loss/train': 2.1550307273864746} 11/06/2021 21:57:13 - INFO - __main__ - Step 4964: {'lr': 0.0004995056819614575, 'samples': 953088, 'steps': 4963, 'loss/train': 1.8373814821243286} 11/06/2021 21:57:14 - INFO - __main__ - Step 4965: {'lr': 0.0004995053483547404, 'samples': 953280, 'steps': 4964, 'loss/train': 2.0746841430664062} 11/06/2021 21:57:15 - INFO - __main__ - Step 4966: {'lr': 0.0004995050146355999, 'samples': 953472, 'steps': 4965, 'loss/train': 1.7231754064559937} 11/06/2021 21:57:15 - INFO - __main__ - Step 4967: {'lr': 0.0004995046808040363, 'samples': 953664, 'steps': 4966, 'loss/train': 2.3439464569091797} 11/06/2021 21:57:15 - INFO - __main__ - Step 4968: {'lr': 0.0004995043468600499, 'samples': 953856, 'steps': 4967, 'loss/train': 1.1684303283691406} 11/06/2021 21:57:16 - INFO - __main__ - Step 4969: {'lr': 0.0004995040128036405, 'samples': 954048, 'steps': 4968, 'loss/train': 1.818418264389038} 11/06/2021 21:57:16 - INFO - __main__ - Step 4970: {'lr': 0.0004995036786348086, 'samples': 954240, 'steps': 4969, 'loss/train': 2.059702157974243} 11/06/2021 21:57:17 - INFO - __main__ - Step 4971: {'lr': 0.0004995033443535541, 'samples': 954432, 'steps': 4970, 'loss/train': 1.7052550315856934} 11/06/2021 21:57:17 - INFO - __main__ - Step 4972: {'lr': 0.0004995030099598773, 'samples': 954624, 'steps': 4971, 'loss/train': 2.043684720993042} 11/06/2021 21:57:18 - INFO - __main__ - Step 4973: {'lr': 0.0004995026754537783, 'samples': 954816, 'steps': 4972, 'loss/train': 1.259334683418274} 11/06/2021 21:57:18 - INFO - __main__ - Step 4974: {'lr': 0.0004995023408352572, 'samples': 955008, 'steps': 4973, 'loss/train': 2.089118242263794} 11/06/2021 21:57:18 - INFO - __main__ - Step 4975: {'lr': 0.0004995020061043142, 'samples': 955200, 'steps': 4974, 'loss/train': 2.190615177154541} 11/06/2021 21:57:20 - INFO - __main__ - Step 4976: {'lr': 0.0004995016712609495, 'samples': 955392, 'steps': 4975, 'loss/train': 2.123932361602783} 11/06/2021 21:57:20 - INFO - __main__ - Step 4977: {'lr': 0.0004995013363051631, 'samples': 955584, 'steps': 4976, 'loss/train': 2.357672929763794} 11/06/2021 21:57:20 - INFO - __main__ - Step 4978: {'lr': 0.0004995010012369554, 'samples': 955776, 'steps': 4977, 'loss/train': 2.4011850357055664} 11/06/2021 21:57:21 - INFO - __main__ - Step 4979: {'lr': 0.0004995006660563262, 'samples': 955968, 'steps': 4978, 'loss/train': 2.2604053020477295} 11/06/2021 21:57:21 - INFO - __main__ - Step 4980: {'lr': 0.000499500330763276, 'samples': 956160, 'steps': 4979, 'loss/train': 1.697622537612915} 11/06/2021 21:57:22 - INFO - __main__ - Step 4981: {'lr': 0.0004994999953578048, 'samples': 956352, 'steps': 4980, 'loss/train': 2.418104887008667} 11/06/2021 21:57:22 - INFO - __main__ - Step 4982: {'lr': 0.0004994996598399127, 'samples': 956544, 'steps': 4981, 'loss/train': 1.7995312213897705} 11/06/2021 21:57:23 - INFO - __main__ - Step 4983: {'lr': 0.0004994993242095999, 'samples': 956736, 'steps': 4982, 'loss/train': 1.0818099975585938} 11/06/2021 21:57:23 - INFO - __main__ - Step 4984: {'lr': 0.0004994989884668665, 'samples': 956928, 'steps': 4983, 'loss/train': 1.7262250185012817} 11/06/2021 21:57:23 - INFO - __main__ - Step 4985: {'lr': 0.0004994986526117127, 'samples': 957120, 'steps': 4984, 'loss/train': 1.8972023725509644} 11/06/2021 21:57:24 - INFO - __main__ - Step 4986: {'lr': 0.0004994983166441388, 'samples': 957312, 'steps': 4985, 'loss/train': 1.5889555215835571} 11/06/2021 21:57:25 - INFO - __main__ - Step 4987: {'lr': 0.0004994979805641448, 'samples': 957504, 'steps': 4986, 'loss/train': 2.0608432292938232} 11/06/2021 21:57:25 - INFO - __main__ - Step 4988: {'lr': 0.0004994976443717308, 'samples': 957696, 'steps': 4987, 'loss/train': 2.1303555965423584} 11/06/2021 21:57:25 - INFO - __main__ - Step 4989: {'lr': 0.000499497308066897, 'samples': 957888, 'steps': 4988, 'loss/train': 1.3450038433074951} 11/06/2021 21:57:26 - INFO - __main__ - Step 4990: {'lr': 0.0004994969716496435, 'samples': 958080, 'steps': 4989, 'loss/train': 1.9656953811645508} 11/06/2021 21:57:27 - INFO - __main__ - Step 4991: {'lr': 0.0004994966351199706, 'samples': 958272, 'steps': 4990, 'loss/train': 1.8768229484558105} 11/06/2021 21:57:27 - INFO - __main__ - Step 4992: {'lr': 0.0004994962984778784, 'samples': 958464, 'steps': 4991, 'loss/train': 0.9767778515815735} 11/06/2021 21:57:27 - INFO - __main__ - Step 4993: {'lr': 0.0004994959617233669, 'samples': 958656, 'steps': 4992, 'loss/train': 1.952646255493164} 11/06/2021 21:57:28 - INFO - __main__ - Step 4994: {'lr': 0.0004994956248564364, 'samples': 958848, 'steps': 4993, 'loss/train': 2.012577533721924} 11/06/2021 21:57:28 - INFO - __main__ - Step 4995: {'lr': 0.000499495287877087, 'samples': 959040, 'steps': 4994, 'loss/train': 2.6770846843719482} 11/06/2021 21:57:29 - INFO - __main__ - Step 4996: {'lr': 0.000499494950785319, 'samples': 959232, 'steps': 4995, 'loss/train': 0.9503514766693115} 11/06/2021 21:57:30 - INFO - __main__ - Step 4997: {'lr': 0.0004994946135811324, 'samples': 959424, 'steps': 4996, 'loss/train': 1.3578064441680908} 11/06/2021 21:57:30 - INFO - __main__ - Step 4998: {'lr': 0.0004994942762645274, 'samples': 959616, 'steps': 4997, 'loss/train': 1.6439087390899658} 11/06/2021 21:57:30 - INFO - __main__ - Step 4999: {'lr': 0.000499493938835504, 'samples': 959808, 'steps': 4998, 'loss/train': 1.955640196800232} 11/06/2021 21:57:31 - INFO - __main__ - Step 5000: {'lr': 0.0004994936012940626, 'samples': 960000, 'steps': 4999, 'loss/train': 2.0416653156280518} 11/06/2021 21:57:32 - INFO - __main__ - Step 5001: {'lr': 0.0004994932636402031, 'samples': 960192, 'steps': 5000, 'loss/train': 2.211860418319702} 11/06/2021 21:57:32 - INFO - __main__ - Step 5002: {'lr': 0.000499492925873926, 'samples': 960384, 'steps': 5001, 'loss/train': 2.095062017440796} 11/06/2021 21:57:33 - INFO - __main__ - Step 5003: {'lr': 0.000499492587995231, 'samples': 960576, 'steps': 5002, 'loss/train': 2.049713611602783} 11/06/2021 21:57:33 - INFO - __main__ - Step 5004: {'lr': 0.0004994922500041186, 'samples': 960768, 'steps': 5003, 'loss/train': 1.5910391807556152} 11/06/2021 21:57:33 - INFO - __main__ - Step 5005: {'lr': 0.0004994919119005888, 'samples': 960960, 'steps': 5004, 'loss/train': 2.3668973445892334} 11/06/2021 21:57:34 - INFO - __main__ - Step 5006: {'lr': 0.0004994915736846418, 'samples': 961152, 'steps': 5005, 'loss/train': 2.0674057006835938} 11/06/2021 21:57:35 - INFO - __main__ - Step 5007: {'lr': 0.0004994912353562778, 'samples': 961344, 'steps': 5006, 'loss/train': 2.0364692211151123} 11/06/2021 21:57:35 - INFO - __main__ - Step 5008: {'lr': 0.0004994908969154968, 'samples': 961536, 'steps': 5007, 'loss/train': 1.763395071029663} 11/06/2021 21:57:36 - INFO - __main__ - Step 5009: {'lr': 0.0004994905583622992, 'samples': 961728, 'steps': 5008, 'loss/train': 1.6759827136993408} 11/06/2021 21:57:36 - INFO - __main__ - Step 5010: {'lr': 0.000499490219696685, 'samples': 961920, 'steps': 5009, 'loss/train': 1.3407360315322876} 11/06/2021 21:57:36 - INFO - __main__ - Step 5011: {'lr': 0.0004994898809186542, 'samples': 962112, 'steps': 5010, 'loss/train': 2.153372049331665} 11/06/2021 21:57:37 - INFO - __main__ - Step 5012: {'lr': 0.0004994895420282072, 'samples': 962304, 'steps': 5011, 'loss/train': 2.228909969329834} 11/06/2021 21:57:38 - INFO - __main__ - Step 5013: {'lr': 0.000499489203025344, 'samples': 962496, 'steps': 5012, 'loss/train': 1.8992091417312622} 11/06/2021 21:57:38 - INFO - __main__ - Step 5014: {'lr': 0.000499488863910065, 'samples': 962688, 'steps': 5013, 'loss/train': 1.6911919116973877} 11/06/2021 21:57:38 - INFO - __main__ - Step 5015: {'lr': 0.00049948852468237, 'samples': 962880, 'steps': 5014, 'loss/train': 1.5895403623580933} 11/06/2021 21:57:39 - INFO - __main__ - Step 5016: {'lr': 0.0004994881853422594, 'samples': 963072, 'steps': 5015, 'loss/train': 1.8377931118011475} 11/06/2021 21:57:40 - INFO - __main__ - Step 5017: {'lr': 0.0004994878458897332, 'samples': 963264, 'steps': 5016, 'loss/train': 2.5478765964508057} 11/06/2021 21:57:40 - INFO - __main__ - Step 5018: {'lr': 0.0004994875063247916, 'samples': 963456, 'steps': 5017, 'loss/train': 1.8936893939971924} 11/06/2021 21:57:41 - INFO - __main__ - Step 5019: {'lr': 0.0004994871666474348, 'samples': 963648, 'steps': 5018, 'loss/train': 2.324462890625} 11/06/2021 21:57:41 - INFO - __main__ - Step 5020: {'lr': 0.000499486826857663, 'samples': 963840, 'steps': 5019, 'loss/train': 1.8107075691223145} 11/06/2021 21:57:41 - INFO - __main__ - Step 5021: {'lr': 0.0004994864869554763, 'samples': 964032, 'steps': 5020, 'loss/train': 1.989519476890564} 11/06/2021 21:57:42 - INFO - __main__ - Step 5022: {'lr': 0.0004994861469408748, 'samples': 964224, 'steps': 5021, 'loss/train': 1.991037368774414} 11/06/2021 21:57:43 - INFO - __main__ - Step 5023: {'lr': 0.0004994858068138587, 'samples': 964416, 'steps': 5022, 'loss/train': 1.7982574701309204} 11/06/2021 21:57:43 - INFO - __main__ - Step 5024: {'lr': 0.0004994854665744282, 'samples': 964608, 'steps': 5023, 'loss/train': 2.380490303039551} 11/06/2021 21:57:43 - INFO - __main__ - Step 5025: {'lr': 0.0004994851262225832, 'samples': 964800, 'steps': 5024, 'loss/train': 1.3919230699539185} 11/06/2021 21:57:44 - INFO - __main__ - Step 5026: {'lr': 0.0004994847857583242, 'samples': 964992, 'steps': 5025, 'loss/train': 2.2827370166778564} 11/06/2021 21:57:45 - INFO - __main__ - Step 5027: {'lr': 0.0004994844451816512, 'samples': 965184, 'steps': 5026, 'loss/train': 2.1347203254699707} 11/06/2021 21:57:45 - INFO - __main__ - Step 5028: {'lr': 0.0004994841044925644, 'samples': 965376, 'steps': 5027, 'loss/train': 1.6065678596496582} 11/06/2021 21:57:45 - INFO - __main__ - Step 5029: {'lr': 0.0004994837636910638, 'samples': 965568, 'steps': 5028, 'loss/train': 1.7136719226837158} 11/06/2021 21:57:46 - INFO - __main__ - Step 5030: {'lr': 0.0004994834227771498, 'samples': 965760, 'steps': 5029, 'loss/train': 2.129387617111206} 11/06/2021 21:57:46 - INFO - __main__ - Step 5031: {'lr': 0.0004994830817508224, 'samples': 965952, 'steps': 5030, 'loss/train': 2.3650991916656494} 11/06/2021 21:57:47 - INFO - __main__ - Step 5032: {'lr': 0.0004994827406120816, 'samples': 966144, 'steps': 5031, 'loss/train': 1.8486300706863403} 11/06/2021 21:57:47 - INFO - __main__ - Step 5033: {'lr': 0.0004994823993609279, 'samples': 966336, 'steps': 5032, 'loss/train': 0.8590723872184753} 11/06/2021 21:57:48 - INFO - __main__ - Step 5034: {'lr': 0.0004994820579973612, 'samples': 966528, 'steps': 5033, 'loss/train': 1.7398724555969238} 11/06/2021 21:57:48 - INFO - __main__ - Step 5035: {'lr': 0.0004994817165213817, 'samples': 966720, 'steps': 5034, 'loss/train': 2.0410008430480957} 11/06/2021 21:57:48 - INFO - __main__ - Step 5036: {'lr': 0.0004994813749329897, 'samples': 966912, 'steps': 5035, 'loss/train': 1.631545066833496} 11/06/2021 21:57:50 - INFO - __main__ - Step 5037: {'lr': 0.0004994810332321852, 'samples': 967104, 'steps': 5036, 'loss/train': 1.492642879486084} 11/06/2021 21:57:50 - INFO - __main__ - Step 5038: {'lr': 0.0004994806914189684, 'samples': 967296, 'steps': 5037, 'loss/train': 2.159714460372925} 11/06/2021 21:57:50 - INFO - __main__ - Step 5039: {'lr': 0.0004994803494933394, 'samples': 967488, 'steps': 5038, 'loss/train': 2.5530550479888916} 11/06/2021 21:57:51 - INFO - __main__ - Step 5040: {'lr': 0.0004994800074552985, 'samples': 967680, 'steps': 5039, 'loss/train': 1.7678231000900269} 11/06/2021 21:57:51 - INFO - __main__ - Step 5041: {'lr': 0.0004994796653048457, 'samples': 967872, 'steps': 5040, 'loss/train': 2.178983211517334} 11/06/2021 21:57:51 - INFO - __main__ - Step 5042: {'lr': 0.0004994793230419812, 'samples': 968064, 'steps': 5041, 'loss/train': 2.832493305206299} 11/06/2021 21:57:52 - INFO - __main__ - Step 5043: {'lr': 0.0004994789806667052, 'samples': 968256, 'steps': 5042, 'loss/train': 1.9693965911865234} 11/06/2021 21:57:53 - INFO - __main__ - Step 5044: {'lr': 0.0004994786381790178, 'samples': 968448, 'steps': 5043, 'loss/train': 1.6492239236831665} 11/06/2021 21:57:53 - INFO - __main__ - Step 5045: {'lr': 0.0004994782955789191, 'samples': 968640, 'steps': 5044, 'loss/train': 1.885124683380127} 11/06/2021 21:57:53 - INFO - __main__ - Step 5046: {'lr': 0.0004994779528664095, 'samples': 968832, 'steps': 5045, 'loss/train': 2.0043511390686035} 11/06/2021 21:57:54 - INFO - __main__ - Step 5047: {'lr': 0.0004994776100414888, 'samples': 969024, 'steps': 5046, 'loss/train': 2.129291534423828} 11/06/2021 21:57:55 - INFO - __main__ - Step 5048: {'lr': 0.0004994772671041575, 'samples': 969216, 'steps': 5047, 'loss/train': 2.2023603916168213} 11/06/2021 21:57:55 - INFO - __main__ - Step 5049: {'lr': 0.0004994769240544155, 'samples': 969408, 'steps': 5048, 'loss/train': 2.209613084793091} 11/06/2021 21:57:55 - INFO - __main__ - Step 5050: {'lr': 0.000499476580892263, 'samples': 969600, 'steps': 5049, 'loss/train': 1.4724314212799072} 11/06/2021 21:57:56 - INFO - __main__ - Step 5051: {'lr': 0.0004994762376177004, 'samples': 969792, 'steps': 5050, 'loss/train': 2.012944459915161} 11/06/2021 21:57:56 - INFO - __main__ - Step 5052: {'lr': 0.0004994758942307274, 'samples': 969984, 'steps': 5051, 'loss/train': 1.9832876920700073} 11/06/2021 21:57:57 - INFO - __main__ - Step 5053: {'lr': 0.0004994755507313446, 'samples': 970176, 'steps': 5052, 'loss/train': 2.164370536804199} 11/06/2021 21:57:58 - INFO - __main__ - Step 5054: {'lr': 0.000499475207119552, 'samples': 970368, 'steps': 5053, 'loss/train': 2.118231773376465} 11/06/2021 21:57:58 - INFO - __main__ - Step 5055: {'lr': 0.0004994748633953495, 'samples': 970560, 'steps': 5054, 'loss/train': 1.800482153892517} 11/06/2021 21:57:58 - INFO - __main__ - Step 5056: {'lr': 0.0004994745195587376, 'samples': 970752, 'steps': 5055, 'loss/train': 2.163815975189209} 11/06/2021 21:57:59 - INFO - __main__ - Step 5057: {'lr': 0.0004994741756097164, 'samples': 970944, 'steps': 5056, 'loss/train': 2.414111852645874} 11/06/2021 21:58:00 - INFO - __main__ - Step 5058: {'lr': 0.0004994738315482859, 'samples': 971136, 'steps': 5057, 'loss/train': 1.5080969333648682} 11/06/2021 21:58:00 - INFO - __main__ - Step 5059: {'lr': 0.0004994734873744464, 'samples': 971328, 'steps': 5058, 'loss/train': 2.688166379928589} 11/06/2021 21:58:00 - INFO - __main__ - Step 5060: {'lr': 0.0004994731430881979, 'samples': 971520, 'steps': 5059, 'loss/train': 2.305680990219116} 11/06/2021 21:58:01 - INFO - __main__ - Step 5061: {'lr': 0.0004994727986895408, 'samples': 971712, 'steps': 5060, 'loss/train': 1.9738892316818237} 11/06/2021 21:58:01 - INFO - __main__ - Step 5062: {'lr': 0.0004994724541784749, 'samples': 971904, 'steps': 5061, 'loss/train': 2.3541839122772217} 11/06/2021 21:58:02 - INFO - __main__ - Step 5063: {'lr': 0.0004994721095550008, 'samples': 972096, 'steps': 5062, 'loss/train': 1.869810938835144} 11/06/2021 21:58:02 - INFO - __main__ - Step 5064: {'lr': 0.0004994717648191182, 'samples': 972288, 'steps': 5063, 'loss/train': 1.6675723791122437} 11/06/2021 21:58:03 - INFO - __main__ - Step 5065: {'lr': 0.0004994714199708276, 'samples': 972480, 'steps': 5064, 'loss/train': 0.3723197877407074} 11/06/2021 21:58:03 - INFO - __main__ - Step 5066: {'lr': 0.000499471075010129, 'samples': 972672, 'steps': 5065, 'loss/train': 6.243468761444092} 11/06/2021 21:58:03 - INFO - __main__ - Step 5067: {'lr': 0.0004994707299370226, 'samples': 972864, 'steps': 5066, 'loss/train': 2.330059051513672} 11/06/2021 21:58:04 - INFO - __main__ - Step 5068: {'lr': 0.0004994703847515084, 'samples': 973056, 'steps': 5067, 'loss/train': 1.0603464841842651} 11/06/2021 21:58:05 - INFO - __main__ - Step 5069: {'lr': 0.0004994700394535869, 'samples': 973248, 'steps': 5068, 'loss/train': 2.0306475162506104} 11/06/2021 21:58:05 - INFO - __main__ - Step 5070: {'lr': 0.000499469694043258, 'samples': 973440, 'steps': 5069, 'loss/train': 2.2023260593414307} 11/06/2021 21:58:06 - INFO - __main__ - Step 5071: {'lr': 0.0004994693485205218, 'samples': 973632, 'steps': 5070, 'loss/train': 1.8960644006729126} 11/06/2021 21:58:06 - INFO - __main__ - Step 5072: {'lr': 0.0004994690028853787, 'samples': 973824, 'steps': 5071, 'loss/train': 1.8335449695587158} 11/06/2021 21:58:07 - INFO - __main__ - Step 5073: {'lr': 0.0004994686571378286, 'samples': 974016, 'steps': 5072, 'loss/train': 0.4239940643310547} 11/06/2021 21:58:08 - INFO - __main__ - Step 5074: {'lr': 0.0004994683112778718, 'samples': 974208, 'steps': 5073, 'loss/train': 1.9211030006408691} 11/06/2021 21:58:08 - INFO - __main__ - Step 5075: {'lr': 0.0004994679653055085, 'samples': 974400, 'steps': 5074, 'loss/train': 2.271078109741211} 11/06/2021 21:58:08 - INFO - __main__ - Step 5076: {'lr': 0.0004994676192207387, 'samples': 974592, 'steps': 5075, 'loss/train': 1.9957528114318848} 11/06/2021 21:58:09 - INFO - __main__ - Step 5077: {'lr': 0.0004994672730235626, 'samples': 974784, 'steps': 5076, 'loss/train': 2.396184206008911} 11/06/2021 21:58:09 - INFO - __main__ - Step 5078: {'lr': 0.0004994669267139806, 'samples': 974976, 'steps': 5077, 'loss/train': 2.0437135696411133} 11/06/2021 21:58:10 - INFO - __main__ - Step 5079: {'lr': 0.0004994665802919925, 'samples': 975168, 'steps': 5078, 'loss/train': 2.0462563037872314} 11/06/2021 21:58:11 - INFO - __main__ - Step 5080: {'lr': 0.0004994662337575986, 'samples': 975360, 'steps': 5079, 'loss/train': 2.834306478500366} 11/06/2021 21:58:11 - INFO - __main__ - Step 5081: {'lr': 0.000499465887110799, 'samples': 975552, 'steps': 5080, 'loss/train': 2.6196563243865967} 11/06/2021 21:58:11 - INFO - __main__ - Step 5082: {'lr': 0.0004994655403515941, 'samples': 975744, 'steps': 5081, 'loss/train': 1.8559695482254028} 11/06/2021 21:58:12 - INFO - __main__ - Step 5083: {'lr': 0.0004994651934799837, 'samples': 975936, 'steps': 5082, 'loss/train': 2.3033602237701416} 11/06/2021 21:58:12 - INFO - __main__ - Step 5084: {'lr': 0.0004994648464959683, 'samples': 976128, 'steps': 5083, 'loss/train': 1.8257298469543457} 11/06/2021 21:58:13 - INFO - __main__ - Step 5085: {'lr': 0.0004994644993995478, 'samples': 976320, 'steps': 5084, 'loss/train': 2.538905143737793} 11/06/2021 21:58:13 - INFO - __main__ - Step 5086: {'lr': 0.0004994641521907224, 'samples': 976512, 'steps': 5085, 'loss/train': 2.443490743637085} 11/06/2021 21:58:14 - INFO - __main__ - Step 5087: {'lr': 0.0004994638048694924, 'samples': 976704, 'steps': 5086, 'loss/train': 2.3946332931518555} 11/06/2021 21:58:14 - INFO - __main__ - Step 5088: {'lr': 0.0004994634574358579, 'samples': 976896, 'steps': 5087, 'loss/train': 1.450162649154663} 11/06/2021 21:58:14 - INFO - __main__ - Step 5089: {'lr': 0.0004994631098898188, 'samples': 977088, 'steps': 5088, 'loss/train': 1.5360685586929321} 11/06/2021 21:58:15 - INFO - __main__ - Step 5090: {'lr': 0.0004994627622313757, 'samples': 977280, 'steps': 5089, 'loss/train': 1.8420456647872925} 11/06/2021 21:58:16 - INFO - __main__ - Step 5091: {'lr': 0.0004994624144605284, 'samples': 977472, 'steps': 5090, 'loss/train': 2.4474048614501953} 11/06/2021 21:58:16 - INFO - __main__ - Step 5092: {'lr': 0.0004994620665772772, 'samples': 977664, 'steps': 5091, 'loss/train': 1.8615306615829468} 11/06/2021 21:58:16 - INFO - __main__ - Step 5093: {'lr': 0.0004994617185816222, 'samples': 977856, 'steps': 5092, 'loss/train': 2.000317335128784} 11/06/2021 21:58:17 - INFO - __main__ - Step 5094: {'lr': 0.0004994613704735638, 'samples': 978048, 'steps': 5093, 'loss/train': 2.3359320163726807} 11/06/2021 21:58:18 - INFO - __main__ - Step 5095: {'lr': 0.0004994610222531018, 'samples': 978240, 'steps': 5094, 'loss/train': 2.213413953781128} 11/06/2021 21:58:18 - INFO - __main__ - Step 5096: {'lr': 0.0004994606739202365, 'samples': 978432, 'steps': 5095, 'loss/train': 2.049318552017212} 11/06/2021 21:58:19 - INFO - __main__ - Step 5097: {'lr': 0.0004994603254749681, 'samples': 978624, 'steps': 5096, 'loss/train': 2.276933431625366} 11/06/2021 21:58:19 - INFO - __main__ - Step 5098: {'lr': 0.0004994599769172967, 'samples': 978816, 'steps': 5097, 'loss/train': 2.2054691314697266} 11/06/2021 21:58:19 - INFO - __main__ - Step 5099: {'lr': 0.0004994596282472225, 'samples': 979008, 'steps': 5098, 'loss/train': 1.958910346031189} 11/06/2021 21:58:20 - INFO - __main__ - Step 5100: {'lr': 0.0004994592794647457, 'samples': 979200, 'steps': 5099, 'loss/train': 2.0862319469451904} 11/06/2021 21:58:21 - INFO - __main__ - Step 5101: {'lr': 0.0004994589305698663, 'samples': 979392, 'steps': 5100, 'loss/train': 1.7089622020721436} 11/06/2021 21:58:21 - INFO - __main__ - Step 5102: {'lr': 0.0004994585815625847, 'samples': 979584, 'steps': 5101, 'loss/train': 1.952847957611084} 11/06/2021 21:58:21 - INFO - __main__ - Step 5103: {'lr': 0.0004994582324429008, 'samples': 979776, 'steps': 5102, 'loss/train': 1.894245982170105} 11/06/2021 21:58:22 - INFO - __main__ - Step 5104: {'lr': 0.0004994578832108148, 'samples': 979968, 'steps': 5103, 'loss/train': 1.3645782470703125} 11/06/2021 21:58:22 - INFO - __main__ - Step 5105: {'lr': 0.000499457533866327, 'samples': 980160, 'steps': 5104, 'loss/train': 2.4554972648620605} 11/06/2021 21:58:23 - INFO - __main__ - Step 5106: {'lr': 0.0004994571844094375, 'samples': 980352, 'steps': 5105, 'loss/train': 2.340075731277466} 11/06/2021 21:58:23 - INFO - __main__ - Step 5107: {'lr': 0.0004994568348401466, 'samples': 980544, 'steps': 5106, 'loss/train': 1.1222628355026245} 11/06/2021 21:58:24 - INFO - __main__ - Step 5108: {'lr': 0.0004994564851584541, 'samples': 980736, 'steps': 5107, 'loss/train': 1.5331889390945435} 11/06/2021 21:58:24 - INFO - __main__ - Step 5109: {'lr': 0.0004994561353643604, 'samples': 980928, 'steps': 5108, 'loss/train': 1.8134424686431885} 11/06/2021 21:58:25 - INFO - __main__ - Step 5110: {'lr': 0.0004994557854578656, 'samples': 981120, 'steps': 5109, 'loss/train': 1.9003137350082397} 11/06/2021 21:58:26 - INFO - __main__ - Step 5111: {'lr': 0.0004994554354389699, 'samples': 981312, 'steps': 5110, 'loss/train': 1.9330469369888306} 11/06/2021 21:58:26 - INFO - __main__ - Step 5112: {'lr': 0.0004994550853076734, 'samples': 981504, 'steps': 5111, 'loss/train': 1.9353581666946411} 11/06/2021 21:58:27 - INFO - __main__ - Step 5113: {'lr': 0.0004994547350639764, 'samples': 981696, 'steps': 5112, 'loss/train': 2.078845500946045} 11/06/2021 21:58:27 - INFO - __main__ - Step 5114: {'lr': 0.0004994543847078787, 'samples': 981888, 'steps': 5113, 'loss/train': 1.3129998445510864} 11/06/2021 21:58:27 - INFO - __main__ - Step 5115: {'lr': 0.000499454034239381, 'samples': 982080, 'steps': 5114, 'loss/train': 1.7932987213134766} 11/06/2021 21:58:28 - INFO - __main__ - Step 5116: {'lr': 0.000499453683658483, 'samples': 982272, 'steps': 5115, 'loss/train': 2.2063400745391846} 11/06/2021 21:58:28 - INFO - __main__ - Step 5117: {'lr': 0.0004994533329651849, 'samples': 982464, 'steps': 5116, 'loss/train': 1.820799469947815} 11/06/2021 21:58:29 - INFO - __main__ - Step 5118: {'lr': 0.0004994529821594872, 'samples': 982656, 'steps': 5117, 'loss/train': 2.5855801105499268} 11/06/2021 21:58:29 - INFO - __main__ - Step 5119: {'lr': 0.0004994526312413897, 'samples': 982848, 'steps': 5118, 'loss/train': 1.6690380573272705} 11/06/2021 21:58:30 - INFO - __main__ - Step 5120: {'lr': 0.0004994522802108927, 'samples': 983040, 'steps': 5119, 'loss/train': 1.896449327468872} 11/06/2021 21:58:30 - INFO - __main__ - Step 5121: {'lr': 0.0004994519290679964, 'samples': 983232, 'steps': 5120, 'loss/train': 1.6968704462051392} 11/06/2021 21:58:31 - INFO - __main__ - Step 5122: {'lr': 0.0004994515778127009, 'samples': 983424, 'steps': 5121, 'loss/train': 1.7569773197174072} 11/06/2021 21:58:31 - INFO - __main__ - Step 5123: {'lr': 0.0004994512264450064, 'samples': 983616, 'steps': 5122, 'loss/train': 2.063852310180664} 11/06/2021 21:58:32 - INFO - __main__ - Step 5124: {'lr': 0.000499450874964913, 'samples': 983808, 'steps': 5123, 'loss/train': 2.1291959285736084} 11/06/2021 21:58:32 - INFO - __main__ - Step 5125: {'lr': 0.000499450523372421, 'samples': 984000, 'steps': 5124, 'loss/train': 2.469874143600464} 11/06/2021 21:58:32 - INFO - __main__ - Step 5126: {'lr': 0.0004994501716675303, 'samples': 984192, 'steps': 5125, 'loss/train': 1.7951209545135498} 11/06/2021 21:58:33 - INFO - __main__ - Step 5127: {'lr': 0.0004994498198502412, 'samples': 984384, 'steps': 5126, 'loss/train': 2.0104482173919678} 11/06/2021 21:58:34 - INFO - __main__ - Step 5128: {'lr': 0.0004994494679205539, 'samples': 984576, 'steps': 5127, 'loss/train': 2.0760858058929443} 11/06/2021 21:58:34 - INFO - __main__ - Step 5129: {'lr': 0.0004994491158784684, 'samples': 984768, 'steps': 5128, 'loss/train': 2.1686079502105713} 11/06/2021 21:58:34 - INFO - __main__ - Step 5130: {'lr': 0.0004994487637239851, 'samples': 984960, 'steps': 5129, 'loss/train': 1.1807832717895508} 11/06/2021 21:58:35 - INFO - __main__ - Step 5131: {'lr': 0.0004994484114571041, 'samples': 985152, 'steps': 5130, 'loss/train': 1.4355896711349487} 11/06/2021 21:58:36 - INFO - __main__ - Step 5132: {'lr': 0.0004994480590778254, 'samples': 985344, 'steps': 5131, 'loss/train': 1.8569763898849487} 11/06/2021 21:58:36 - INFO - __main__ - Step 5133: {'lr': 0.0004994477065861493, 'samples': 985536, 'steps': 5132, 'loss/train': 2.1365954875946045} 11/06/2021 21:58:37 - INFO - __main__ - Step 5134: {'lr': 0.0004994473539820758, 'samples': 985728, 'steps': 5133, 'loss/train': 1.7661622762680054} 11/06/2021 21:58:37 - INFO - __main__ - Step 5135: {'lr': 0.0004994470012656052, 'samples': 985920, 'steps': 5134, 'loss/train': 2.2697160243988037} 11/06/2021 21:58:37 - INFO - __main__ - Step 5136: {'lr': 0.0004994466484367378, 'samples': 986112, 'steps': 5135, 'loss/train': 2.3424150943756104} 11/06/2021 21:58:38 - INFO - __main__ - Step 5137: {'lr': 0.0004994462954954734, 'samples': 986304, 'steps': 5136, 'loss/train': 1.250626564025879} 11/06/2021 21:58:39 - INFO - __main__ - Step 5138: {'lr': 0.0004994459424418125, 'samples': 986496, 'steps': 5137, 'loss/train': 2.22953462600708} 11/06/2021 21:58:39 - INFO - __main__ - Step 5139: {'lr': 0.000499445589275755, 'samples': 986688, 'steps': 5138, 'loss/train': 2.005419969558716} 11/06/2021 21:58:39 - INFO - __main__ - Step 5140: {'lr': 0.0004994452359973012, 'samples': 986880, 'steps': 5139, 'loss/train': 1.6053614616394043} 11/06/2021 21:58:40 - INFO - __main__ - Step 5141: {'lr': 0.0004994448826064512, 'samples': 987072, 'steps': 5140, 'loss/train': 2.132984161376953} 11/06/2021 21:58:41 - INFO - __main__ - Step 5142: {'lr': 0.0004994445291032053, 'samples': 987264, 'steps': 5141, 'loss/train': 1.4963831901550293} 11/06/2021 21:58:41 - INFO - __main__ - Step 5143: {'lr': 0.0004994441754875634, 'samples': 987456, 'steps': 5142, 'loss/train': 2.0756425857543945} 11/06/2021 21:58:41 - INFO - __main__ - Step 5144: {'lr': 0.0004994438217595259, 'samples': 987648, 'steps': 5143, 'loss/train': 2.1674396991729736} 11/06/2021 21:58:42 - INFO - __main__ - Step 5145: {'lr': 0.0004994434679190928, 'samples': 987840, 'steps': 5144, 'loss/train': 2.1360890865325928} 11/06/2021 21:58:42 - INFO - __main__ - Step 5146: {'lr': 0.0004994431139662643, 'samples': 988032, 'steps': 5145, 'loss/train': 2.855836868286133} 11/06/2021 21:58:43 - INFO - __main__ - Step 5147: {'lr': 0.0004994427599010406, 'samples': 988224, 'steps': 5146, 'loss/train': 2.0516815185546875} 11/06/2021 21:58:44 - INFO - __main__ - Step 5148: {'lr': 0.0004994424057234219, 'samples': 988416, 'steps': 5147, 'loss/train': 1.951188087463379} 11/06/2021 21:58:45 - INFO - __main__ - Step 5149: {'lr': 0.0004994420514334082, 'samples': 988608, 'steps': 5148, 'loss/train': 1.9776523113250732} 11/06/2021 21:58:45 - INFO - __main__ - Step 5150: {'lr': 0.0004994416970309999, 'samples': 988800, 'steps': 5149, 'loss/train': 2.2853457927703857} 11/06/2021 21:58:45 - INFO - __main__ - Step 5151: {'lr': 0.0004994413425161969, 'samples': 988992, 'steps': 5150, 'loss/train': 1.2515465021133423} 11/06/2021 21:58:46 - INFO - __main__ - Step 5152: {'lr': 0.0004994409878889995, 'samples': 989184, 'steps': 5151, 'loss/train': 1.107923984527588} 11/06/2021 21:58:46 - INFO - __main__ - Step 5153: {'lr': 0.0004994406331494079, 'samples': 989376, 'steps': 5152, 'loss/train': 0.8732290267944336} 11/06/2021 21:58:47 - INFO - __main__ - Step 5154: {'lr': 0.0004994402782974222, 'samples': 989568, 'steps': 5153, 'loss/train': 1.9641544818878174} 11/06/2021 21:58:47 - INFO - __main__ - Step 5155: {'lr': 0.0004994399233330426, 'samples': 989760, 'steps': 5154, 'loss/train': 2.2096810340881348} 11/06/2021 21:58:48 - INFO - __main__ - Step 5156: {'lr': 0.000499439568256269, 'samples': 989952, 'steps': 5155, 'loss/train': 1.8779668807983398} 11/06/2021 21:58:48 - INFO - __main__ - Step 5157: {'lr': 0.000499439213067102, 'samples': 990144, 'steps': 5156, 'loss/train': 2.0117697715759277} 11/06/2021 21:58:48 - INFO - __main__ - Step 5158: {'lr': 0.0004994388577655415, 'samples': 990336, 'steps': 5157, 'loss/train': 1.9738129377365112} 11/06/2021 21:58:49 - INFO - __main__ - Step 5159: {'lr': 0.0004994385023515876, 'samples': 990528, 'steps': 5158, 'loss/train': 2.6475677490234375} 11/06/2021 21:58:50 - INFO - __main__ - Step 5160: {'lr': 0.0004994381468252406, 'samples': 990720, 'steps': 5159, 'loss/train': 2.463541269302368} 11/06/2021 21:58:50 - INFO - __main__ - Step 5161: {'lr': 0.0004994377911865007, 'samples': 990912, 'steps': 5160, 'loss/train': 2.4826114177703857} 11/06/2021 21:58:51 - INFO - __main__ - Step 5162: {'lr': 0.0004994374354353679, 'samples': 991104, 'steps': 5161, 'loss/train': 2.0047404766082764} 11/06/2021 21:58:51 - INFO - __main__ - Step 5163: {'lr': 0.0004994370795718425, 'samples': 991296, 'steps': 5162, 'loss/train': 2.0610859394073486} 11/06/2021 21:58:51 - INFO - __main__ - Step 5164: {'lr': 0.0004994367235959245, 'samples': 991488, 'steps': 5163, 'loss/train': 1.4020670652389526} 11/06/2021 21:58:52 - INFO - __main__ - Step 5165: {'lr': 0.0004994363675076143, 'samples': 991680, 'steps': 5164, 'loss/train': 1.8617390394210815} 11/06/2021 21:58:53 - INFO - __main__ - Step 5166: {'lr': 0.0004994360113069118, 'samples': 991872, 'steps': 5165, 'loss/train': 2.2617647647857666} 11/06/2021 21:58:53 - INFO - __main__ - Step 5167: {'lr': 0.0004994356549938173, 'samples': 992064, 'steps': 5166, 'loss/train': 2.5071494579315186} 11/06/2021 21:58:53 - INFO - __main__ - Step 5168: {'lr': 0.000499435298568331, 'samples': 992256, 'steps': 5167, 'loss/train': 2.4085593223571777} 11/06/2021 21:58:54 - INFO - __main__ - Step 5169: {'lr': 0.000499434942030453, 'samples': 992448, 'steps': 5168, 'loss/train': 1.7436469793319702} 11/06/2021 21:58:55 - INFO - __main__ - Step 5170: {'lr': 0.0004994345853801834, 'samples': 992640, 'steps': 5169, 'loss/train': 1.887163758277893} 11/06/2021 21:58:55 - INFO - __main__ - Step 5171: {'lr': 0.0004994342286175225, 'samples': 992832, 'steps': 5170, 'loss/train': 1.4238390922546387} 11/06/2021 21:58:56 - INFO - __main__ - Step 5172: {'lr': 0.0004994338717424704, 'samples': 993024, 'steps': 5171, 'loss/train': 1.6073421239852905} 11/06/2021 21:58:56 - INFO - __main__ - Step 5173: {'lr': 0.0004994335147550272, 'samples': 993216, 'steps': 5172, 'loss/train': 2.328388214111328} 11/06/2021 21:58:56 - INFO - __main__ - Step 5174: {'lr': 0.0004994331576551931, 'samples': 993408, 'steps': 5173, 'loss/train': 2.2518372535705566} 11/06/2021 21:58:57 - INFO - __main__ - Step 5175: {'lr': 0.0004994328004429683, 'samples': 993600, 'steps': 5174, 'loss/train': 1.5607566833496094} 11/06/2021 21:58:58 - INFO - __main__ - Step 5176: {'lr': 0.000499432443118353, 'samples': 993792, 'steps': 5175, 'loss/train': 2.3936564922332764} 11/06/2021 21:58:58 - INFO - __main__ - Step 5177: {'lr': 0.0004994320856813471, 'samples': 993984, 'steps': 5176, 'loss/train': 1.8367432355880737} 11/06/2021 21:58:58 - INFO - __main__ - Step 5178: {'lr': 0.000499431728131951, 'samples': 994176, 'steps': 5177, 'loss/train': 1.9661235809326172} 11/06/2021 21:58:59 - INFO - __main__ - Step 5179: {'lr': 0.0004994313704701648, 'samples': 994368, 'steps': 5178, 'loss/train': 2.193740129470825} 11/06/2021 21:59:00 - INFO - __main__ - Step 5180: {'lr': 0.0004994310126959887, 'samples': 994560, 'steps': 5179, 'loss/train': 1.998255968093872} 11/06/2021 21:59:00 - INFO - __main__ - Step 5181: {'lr': 0.000499430654809423, 'samples': 994752, 'steps': 5180, 'loss/train': 2.1848907470703125} 11/06/2021 21:59:00 - INFO - __main__ - Step 5182: {'lr': 0.0004994302968104675, 'samples': 994944, 'steps': 5181, 'loss/train': 2.1371874809265137} 11/06/2021 21:59:01 - INFO - __main__ - Step 5183: {'lr': 0.0004994299386991227, 'samples': 995136, 'steps': 5182, 'loss/train': 2.302549362182617} 11/06/2021 21:59:01 - INFO - __main__ - Step 5184: {'lr': 0.0004994295804753885, 'samples': 995328, 'steps': 5183, 'loss/train': 1.9894294738769531} 11/06/2021 21:59:02 - INFO - __main__ - Step 5185: {'lr': 0.0004994292221392652, 'samples': 995520, 'steps': 5184, 'loss/train': 2.0415422916412354} 11/06/2021 21:59:02 - INFO - __main__ - Step 5186: {'lr': 0.000499428863690753, 'samples': 995712, 'steps': 5185, 'loss/train': 2.606383800506592} 11/06/2021 21:59:03 - INFO - __main__ - Step 5187: {'lr': 0.0004994285051298519, 'samples': 995904, 'steps': 5186, 'loss/train': 2.254190444946289} 11/06/2021 21:59:03 - INFO - __main__ - Step 5188: {'lr': 0.0004994281464565623, 'samples': 996096, 'steps': 5187, 'loss/train': 1.9112964868545532} 11/06/2021 21:59:03 - INFO - __main__ - Step 5189: {'lr': 0.0004994277876708841, 'samples': 996288, 'steps': 5188, 'loss/train': 1.7943540811538696} 11/06/2021 21:59:05 - INFO - __main__ - Step 5190: {'lr': 0.0004994274287728177, 'samples': 996480, 'steps': 5189, 'loss/train': 2.0679004192352295} 11/06/2021 21:59:05 - INFO - __main__ - Step 5191: {'lr': 0.0004994270697623631, 'samples': 996672, 'steps': 5190, 'loss/train': 1.3824659585952759} 11/06/2021 21:59:06 - INFO - __main__ - Step 5192: {'lr': 0.0004994267106395205, 'samples': 996864, 'steps': 5191, 'loss/train': 1.7324517965316772} 11/06/2021 21:59:06 - INFO - __main__ - Step 5193: {'lr': 0.0004994263514042901, 'samples': 997056, 'steps': 5192, 'loss/train': 0.9526032209396362} 11/06/2021 21:59:06 - INFO - __main__ - Step 5194: {'lr': 0.0004994259920566719, 'samples': 997248, 'steps': 5193, 'loss/train': 0.5770443677902222} 11/06/2021 21:59:07 - INFO - __main__ - Step 5195: {'lr': 0.0004994256325966663, 'samples': 997440, 'steps': 5194, 'loss/train': 1.4841455221176147} 11/06/2021 21:59:08 - INFO - __main__ - Step 5196: {'lr': 0.0004994252730242734, 'samples': 997632, 'steps': 5195, 'loss/train': 2.2125959396362305} 11/06/2021 21:59:08 - INFO - __main__ - Step 5197: {'lr': 0.0004994249133394933, 'samples': 997824, 'steps': 5196, 'loss/train': 1.9928648471832275} 11/06/2021 21:59:08 - INFO - __main__ - Step 5198: {'lr': 0.0004994245535423262, 'samples': 998016, 'steps': 5197, 'loss/train': 2.454911947250366} 11/06/2021 21:59:09 - INFO - __main__ - Step 5199: {'lr': 0.0004994241936327722, 'samples': 998208, 'steps': 5198, 'loss/train': 1.8284403085708618} 11/06/2021 21:59:10 - INFO - __main__ - Step 5200: {'lr': 0.0004994238336108315, 'samples': 998400, 'steps': 5199, 'loss/train': 1.941856861114502} 11/06/2021 21:59:10 - INFO - __main__ - Step 5201: {'lr': 0.0004994234734765043, 'samples': 998592, 'steps': 5200, 'loss/train': 1.9864927530288696} 11/06/2021 21:59:10 - INFO - __main__ - Step 5202: {'lr': 0.0004994231132297907, 'samples': 998784, 'steps': 5201, 'loss/train': 1.9978981018066406} 11/06/2021 21:59:11 - INFO - __main__ - Step 5203: {'lr': 0.0004994227528706909, 'samples': 998976, 'steps': 5202, 'loss/train': 0.5917396545410156} 11/06/2021 21:59:11 - INFO - __main__ - Step 5204: {'lr': 0.0004994223923992052, 'samples': 999168, 'steps': 5203, 'loss/train': 1.8816064596176147} 11/06/2021 21:59:12 - INFO - __main__ - Step 5205: {'lr': 0.0004994220318153334, 'samples': 999360, 'steps': 5204, 'loss/train': 1.95060133934021} 11/06/2021 21:59:13 - INFO - __main__ - Step 5206: {'lr': 0.000499421671119076, 'samples': 999552, 'steps': 5205, 'loss/train': 2.2630045413970947} 11/06/2021 21:59:13 - INFO - __main__ - Step 5207: {'lr': 0.0004994213103104331, 'samples': 999744, 'steps': 5206, 'loss/train': 1.9655598402023315} 11/06/2021 21:59:13 - INFO - __main__ - Step 5208: {'lr': 0.0004994209493894046, 'samples': 999936, 'steps': 5207, 'loss/train': 2.0341320037841797} 11/06/2021 21:59:14 - INFO - __main__ - Step 5209: {'lr': 0.000499420588355991, 'samples': 1000128, 'steps': 5208, 'loss/train': 2.1907718181610107} 11/06/2021 21:59:14 - INFO - __main__ - Step 5210: {'lr': 0.0004994202272101923, 'samples': 1000320, 'steps': 5209, 'loss/train': 2.0395348072052} 11/06/2021 21:59:15 - INFO - __main__ - Step 5211: {'lr': 0.0004994198659520087, 'samples': 1000512, 'steps': 5210, 'loss/train': 1.4296295642852783} 11/06/2021 21:59:15 - INFO - __main__ - Step 5212: {'lr': 0.0004994195045814404, 'samples': 1000704, 'steps': 5211, 'loss/train': 1.9126372337341309} 11/06/2021 21:59:16 - INFO - __main__ - Step 5213: {'lr': 0.0004994191430984876, 'samples': 1000896, 'steps': 5212, 'loss/train': 1.5323981046676636} 11/06/2021 21:59:16 - INFO - __main__ - Step 5214: {'lr': 0.0004994187815031502, 'samples': 1001088, 'steps': 5213, 'loss/train': 2.079664468765259} 11/06/2021 21:59:16 - INFO - __main__ - Step 5215: {'lr': 0.0004994184197954286, 'samples': 1001280, 'steps': 5214, 'loss/train': 2.4426677227020264} 11/06/2021 21:59:18 - INFO - __main__ - Step 5216: {'lr': 0.000499418057975323, 'samples': 1001472, 'steps': 5215, 'loss/train': 1.766026258468628} 11/06/2021 21:59:18 - INFO - __main__ - Step 5217: {'lr': 0.0004994176960428333, 'samples': 1001664, 'steps': 5216, 'loss/train': 2.4069254398345947} 11/06/2021 21:59:18 - INFO - __main__ - Step 5218: {'lr': 0.00049941733399796, 'samples': 1001856, 'steps': 5217, 'loss/train': 2.27449369430542} 11/06/2021 21:59:19 - INFO - __main__ - Step 5219: {'lr': 0.000499416971840703, 'samples': 1002048, 'steps': 5218, 'loss/train': 1.5242971181869507} 11/06/2021 21:59:19 - INFO - __main__ - Step 5220: {'lr': 0.0004994166095710626, 'samples': 1002240, 'steps': 5219, 'loss/train': 1.8974841833114624} 11/06/2021 21:59:20 - INFO - __main__ - Step 5221: {'lr': 0.000499416247189039, 'samples': 1002432, 'steps': 5220, 'loss/train': 1.832592248916626} 11/06/2021 21:59:21 - INFO - __main__ - Step 5222: {'lr': 0.0004994158846946321, 'samples': 1002624, 'steps': 5221, 'loss/train': 1.9344158172607422} 11/06/2021 21:59:21 - INFO - __main__ - Step 5223: {'lr': 0.0004994155220878425, 'samples': 1002816, 'steps': 5222, 'loss/train': 1.6130883693695068} 11/06/2021 21:59:21 - INFO - __main__ - Step 5224: {'lr': 0.0004994151593686699, 'samples': 1003008, 'steps': 5223, 'loss/train': 0.5793285965919495} 11/06/2021 21:59:22 - INFO - __main__ - Step 5225: {'lr': 0.0004994147965371147, 'samples': 1003200, 'steps': 5224, 'loss/train': 2.1755013465881348} 11/06/2021 21:59:23 - INFO - __main__ - Step 5226: {'lr': 0.0004994144335931772, 'samples': 1003392, 'steps': 5225, 'loss/train': 1.7877094745635986} 11/06/2021 21:59:23 - INFO - __main__ - Step 5227: {'lr': 0.0004994140705368573, 'samples': 1003584, 'steps': 5226, 'loss/train': 2.2187447547912598} 11/06/2021 21:59:24 - INFO - __main__ - Step 5228: {'lr': 0.0004994137073681552, 'samples': 1003776, 'steps': 5227, 'loss/train': 1.6575233936309814} 11/06/2021 21:59:24 - INFO - __main__ - Step 5229: {'lr': 0.0004994133440870712, 'samples': 1003968, 'steps': 5228, 'loss/train': 2.3762245178222656} 11/06/2021 21:59:24 - INFO - __main__ - Step 5230: {'lr': 0.0004994129806936054, 'samples': 1004160, 'steps': 5229, 'loss/train': 2.717776298522949} 11/06/2021 21:59:25 - INFO - __main__ - Step 5231: {'lr': 0.000499412617187758, 'samples': 1004352, 'steps': 5230, 'loss/train': 1.9042540788650513} 11/06/2021 21:59:26 - INFO - __main__ - Step 5232: {'lr': 0.0004994122535695291, 'samples': 1004544, 'steps': 5231, 'loss/train': 2.3282763957977295} 11/06/2021 21:59:26 - INFO - __main__ - Step 5233: {'lr': 0.0004994118898389189, 'samples': 1004736, 'steps': 5232, 'loss/train': 1.996087670326233} 11/06/2021 21:59:26 - INFO - __main__ - Step 5234: {'lr': 0.0004994115259959274, 'samples': 1004928, 'steps': 5233, 'loss/train': 1.9831849336624146} 11/06/2021 21:59:27 - INFO - __main__ - Step 5235: {'lr': 0.0004994111620405551, 'samples': 1005120, 'steps': 5234, 'loss/train': 1.956398844718933} 11/06/2021 21:59:28 - INFO - __main__ - Step 5236: {'lr': 0.0004994107979728019, 'samples': 1005312, 'steps': 5235, 'loss/train': 1.0656429529190063} 11/06/2021 21:59:28 - INFO - __main__ - Step 5237: {'lr': 0.0004994104337926681, 'samples': 1005504, 'steps': 5236, 'loss/train': 2.004157066345215} 11/06/2021 21:59:28 - INFO - __main__ - Step 5238: {'lr': 0.0004994100695001537, 'samples': 1005696, 'steps': 5237, 'loss/train': 1.8911778926849365} 11/06/2021 21:59:29 - INFO - __main__ - Step 5239: {'lr': 0.0004994097050952591, 'samples': 1005888, 'steps': 5238, 'loss/train': 2.1261074542999268} 11/06/2021 21:59:29 - INFO - __main__ - Step 5240: {'lr': 0.0004994093405779842, 'samples': 1006080, 'steps': 5239, 'loss/train': 1.9837573766708374} 11/06/2021 21:59:29 - INFO - __main__ - Step 5241: {'lr': 0.0004994089759483294, 'samples': 1006272, 'steps': 5240, 'loss/train': 1.9001221656799316} 11/06/2021 21:59:31 - INFO - __main__ - Step 5242: {'lr': 0.0004994086112062948, 'samples': 1006464, 'steps': 5241, 'loss/train': 1.5564855337142944} 11/06/2021 21:59:31 - INFO - __main__ - Step 5243: {'lr': 0.0004994082463518804, 'samples': 1006656, 'steps': 5242, 'loss/train': 1.7091917991638184} 11/06/2021 21:59:31 - INFO - __main__ - Step 5244: {'lr': 0.0004994078813850865, 'samples': 1006848, 'steps': 5243, 'loss/train': 2.3507444858551025} 11/06/2021 21:59:32 - INFO - __main__ - Step 5245: {'lr': 0.0004994075163059134, 'samples': 1007040, 'steps': 5244, 'loss/train': 1.456217646598816} 11/06/2021 21:59:32 - INFO - __main__ - Step 5246: {'lr': 0.0004994071511143609, 'samples': 1007232, 'steps': 5245, 'loss/train': 2.2415311336517334} 11/06/2021 21:59:33 - INFO - __main__ - Step 5247: {'lr': 0.0004994067858104296, 'samples': 1007424, 'steps': 5246, 'loss/train': 1.7924270629882812} 11/06/2021 21:59:33 - INFO - __main__ - Step 5248: {'lr': 0.0004994064203941195, 'samples': 1007616, 'steps': 5247, 'loss/train': 1.7448370456695557} 11/06/2021 21:59:34 - INFO - __main__ - Step 5249: {'lr': 0.0004994060548654304, 'samples': 1007808, 'steps': 5248, 'loss/train': 2.0024914741516113} 11/06/2021 21:59:34 - INFO - __main__ - Step 5250: {'lr': 0.000499405689224363, 'samples': 1008000, 'steps': 5249, 'loss/train': 2.1484951972961426} 11/06/2021 21:59:35 - INFO - __main__ - Step 5251: {'lr': 0.0004994053234709172, 'samples': 1008192, 'steps': 5250, 'loss/train': 2.0290279388427734} 11/06/2021 21:59:35 - INFO - __main__ - Step 5252: {'lr': 0.0004994049576050933, 'samples': 1008384, 'steps': 5251, 'loss/train': 1.8063730001449585} 11/06/2021 21:59:36 - INFO - __main__ - Step 5253: {'lr': 0.0004994045916268913, 'samples': 1008576, 'steps': 5252, 'loss/train': 1.8525824546813965} 11/06/2021 21:59:36 - INFO - __main__ - Step 5254: {'lr': 0.0004994042255363115, 'samples': 1008768, 'steps': 5253, 'loss/train': 1.7812204360961914} 11/06/2021 21:59:37 - INFO - __main__ - Step 5255: {'lr': 0.0004994038593333539, 'samples': 1008960, 'steps': 5254, 'loss/train': 1.8962163925170898} 11/06/2021 21:59:37 - INFO - __main__ - Step 5256: {'lr': 0.0004994034930180188, 'samples': 1009152, 'steps': 5255, 'loss/train': 1.78254234790802} 11/06/2021 21:59:38 - INFO - __main__ - Step 5257: {'lr': 0.0004994031265903063, 'samples': 1009344, 'steps': 5256, 'loss/train': 2.477855920791626} 11/06/2021 21:59:38 - INFO - __main__ - Step 5258: {'lr': 0.0004994027600502167, 'samples': 1009536, 'steps': 5257, 'loss/train': 1.9207967519760132} 11/06/2021 21:59:39 - INFO - __main__ - Step 5259: {'lr': 0.00049940239339775, 'samples': 1009728, 'steps': 5258, 'loss/train': 1.7611019611358643} 11/06/2021 21:59:39 - INFO - __main__ - Step 5260: {'lr': 0.0004994020266329064, 'samples': 1009920, 'steps': 5259, 'loss/train': 1.293487548828125} 11/06/2021 21:59:39 - INFO - __main__ - Step 5261: {'lr': 0.0004994016597556862, 'samples': 1010112, 'steps': 5260, 'loss/train': 2.4520223140716553} 11/06/2021 21:59:40 - INFO - __main__ - Step 5262: {'lr': 0.0004994012927660894, 'samples': 1010304, 'steps': 5261, 'loss/train': 2.1690826416015625} 11/06/2021 21:59:41 - INFO - __main__ - Step 5263: {'lr': 0.0004994009256641162, 'samples': 1010496, 'steps': 5262, 'loss/train': 1.7376418113708496} 11/06/2021 21:59:41 - INFO - __main__ - Step 5264: {'lr': 0.0004994005584497667, 'samples': 1010688, 'steps': 5263, 'loss/train': 1.8751423358917236} 11/06/2021 21:59:41 - INFO - __main__ - Step 5265: {'lr': 0.0004994001911230413, 'samples': 1010880, 'steps': 5264, 'loss/train': 1.917157769203186} 11/06/2021 21:59:42 - INFO - __main__ - Step 5266: {'lr': 0.00049939982368394, 'samples': 1011072, 'steps': 5265, 'loss/train': 1.825718879699707} 11/06/2021 21:59:42 - INFO - __main__ - Step 5267: {'lr': 0.000499399456132463, 'samples': 1011264, 'steps': 5266, 'loss/train': 2.2549986839294434} 11/06/2021 21:59:44 - INFO - __main__ - Step 5268: {'lr': 0.0004993990884686105, 'samples': 1011456, 'steps': 5267, 'loss/train': 1.5457383394241333} 11/06/2021 21:59:44 - INFO - __main__ - Step 5269: {'lr': 0.0004993987206923825, 'samples': 1011648, 'steps': 5268, 'loss/train': 1.0259302854537964} 11/06/2021 21:59:44 - INFO - __main__ - Step 5270: {'lr': 0.0004993983528037793, 'samples': 1011840, 'steps': 5269, 'loss/train': 1.1901259422302246} 11/06/2021 21:59:45 - INFO - __main__ - Step 5271: {'lr': 0.0004993979848028011, 'samples': 1012032, 'steps': 5270, 'loss/train': 2.2096664905548096} 11/06/2021 21:59:45 - INFO - __main__ - Step 5272: {'lr': 0.000499397616689448, 'samples': 1012224, 'steps': 5271, 'loss/train': 1.2520197629928589} 11/06/2021 21:59:46 - INFO - __main__ - Step 5273: {'lr': 0.0004993972484637202, 'samples': 1012416, 'steps': 5272, 'loss/train': 1.571405053138733} 11/06/2021 21:59:47 - INFO - __main__ - Step 5274: {'lr': 0.0004993968801256178, 'samples': 1012608, 'steps': 5273, 'loss/train': 2.2659175395965576} 11/06/2021 21:59:47 - INFO - __main__ - Step 5275: {'lr': 0.0004993965116751411, 'samples': 1012800, 'steps': 5274, 'loss/train': 1.6691346168518066} 11/06/2021 21:59:47 - INFO - __main__ - Step 5276: {'lr': 0.0004993961431122901, 'samples': 1012992, 'steps': 5275, 'loss/train': 2.171734094619751} 11/06/2021 21:59:48 - INFO - __main__ - Step 5277: {'lr': 0.0004993957744370651, 'samples': 1013184, 'steps': 5276, 'loss/train': 2.206928253173828} 11/06/2021 21:59:49 - INFO - __main__ - Step 5278: {'lr': 0.0004993954056494662, 'samples': 1013376, 'steps': 5277, 'loss/train': 1.2756541967391968} 11/06/2021 21:59:49 - INFO - __main__ - Step 5279: {'lr': 0.0004993950367494936, 'samples': 1013568, 'steps': 5278, 'loss/train': 2.2866268157958984} 11/06/2021 21:59:49 - INFO - __main__ - Step 5280: {'lr': 0.0004993946677371474, 'samples': 1013760, 'steps': 5279, 'loss/train': 2.0941381454467773} 11/06/2021 21:59:50 - INFO - __main__ - Step 5281: {'lr': 0.0004993942986124278, 'samples': 1013952, 'steps': 5280, 'loss/train': 1.8966882228851318} 11/06/2021 21:59:50 - INFO - __main__ - Step 5282: {'lr': 0.000499393929375335, 'samples': 1014144, 'steps': 5281, 'loss/train': 1.9149951934814453} 11/06/2021 21:59:51 - INFO - __main__ - Step 5283: {'lr': 0.0004993935600258691, 'samples': 1014336, 'steps': 5282, 'loss/train': 2.0702826976776123} 11/06/2021 21:59:51 - INFO - __main__ - Step 5284: {'lr': 0.0004993931905640305, 'samples': 1014528, 'steps': 5283, 'loss/train': 2.0960628986358643} 11/06/2021 21:59:52 - INFO - __main__ - Step 5285: {'lr': 0.000499392820989819, 'samples': 1014720, 'steps': 5284, 'loss/train': 1.8415803909301758} 11/06/2021 21:59:52 - INFO - __main__ - Step 5286: {'lr': 0.0004993924513032349, 'samples': 1014912, 'steps': 5285, 'loss/train': 2.124772071838379} 11/06/2021 21:59:52 - INFO - __main__ - Step 5287: {'lr': 0.0004993920815042785, 'samples': 1015104, 'steps': 5286, 'loss/train': 1.629605770111084} 11/06/2021 21:59:53 - INFO - __main__ - Step 5288: {'lr': 0.0004993917115929498, 'samples': 1015296, 'steps': 5287, 'loss/train': 2.094998359680176} 11/06/2021 21:59:54 - INFO - __main__ - Step 5289: {'lr': 0.0004993913415692492, 'samples': 1015488, 'steps': 5288, 'loss/train': 0.9913315176963806} 11/06/2021 21:59:54 - INFO - __main__ - Step 5290: {'lr': 0.0004993909714331766, 'samples': 1015680, 'steps': 5289, 'loss/train': 2.3629143238067627} 11/06/2021 21:59:54 - INFO - __main__ - Step 5291: {'lr': 0.0004993906011847323, 'samples': 1015872, 'steps': 5290, 'loss/train': 1.4902851581573486} 11/06/2021 21:59:55 - INFO - __main__ - Step 5292: {'lr': 0.0004993902308239164, 'samples': 1016064, 'steps': 5291, 'loss/train': 2.1800575256347656} 11/06/2021 21:59:55 - INFO - __main__ - Step 5293: {'lr': 0.0004993898603507292, 'samples': 1016256, 'steps': 5292, 'loss/train': 1.8672491312026978} 11/06/2021 21:59:56 - INFO - __main__ - Step 5294: {'lr': 0.0004993894897651706, 'samples': 1016448, 'steps': 5293, 'loss/train': 1.4217222929000854} 11/06/2021 21:59:57 - INFO - __main__ - Step 5295: {'lr': 0.0004993891190672411, 'samples': 1016640, 'steps': 5294, 'loss/train': 2.300929307937622} 11/06/2021 21:59:57 - INFO - __main__ - Step 5296: {'lr': 0.0004993887482569407, 'samples': 1016832, 'steps': 5295, 'loss/train': 1.8544442653656006} 11/06/2021 21:59:57 - INFO - __main__ - Step 5297: {'lr': 0.0004993883773342695, 'samples': 1017024, 'steps': 5296, 'loss/train': 1.9744642972946167} 11/06/2021 21:59:58 - INFO - __main__ - Step 5298: {'lr': 0.0004993880062992279, 'samples': 1017216, 'steps': 5297, 'loss/train': 1.6615900993347168} 11/06/2021 21:59:58 - INFO - __main__ - Step 5299: {'lr': 0.0004993876351518157, 'samples': 1017408, 'steps': 5298, 'loss/train': 1.865243911743164} 11/06/2021 21:59:59 - INFO - __main__ - Step 5300: {'lr': 0.0004993872638920335, 'samples': 1017600, 'steps': 5299, 'loss/train': 1.881446123123169} 11/06/2021 22:00:00 - INFO - __main__ - Step 5301: {'lr': 0.0004993868925198811, 'samples': 1017792, 'steps': 5300, 'loss/train': 2.0153396129608154} 11/06/2021 22:00:00 - INFO - __main__ - Step 5302: {'lr': 0.0004993865210353588, 'samples': 1017984, 'steps': 5301, 'loss/train': 1.8959330320358276} 11/06/2021 22:00:00 - INFO - __main__ - Step 5303: {'lr': 0.0004993861494384669, 'samples': 1018176, 'steps': 5302, 'loss/train': 1.8540432453155518} 11/06/2021 22:00:01 - INFO - __main__ - Step 5304: {'lr': 0.0004993857777292053, 'samples': 1018368, 'steps': 5303, 'loss/train': 1.6123359203338623} 11/06/2021 22:00:02 - INFO - __main__ - Step 5305: {'lr': 0.0004993854059075745, 'samples': 1018560, 'steps': 5304, 'loss/train': 2.1076877117156982} 11/06/2021 22:00:02 - INFO - __main__ - Step 5306: {'lr': 0.0004993850339735744, 'samples': 1018752, 'steps': 5305, 'loss/train': 1.8283599615097046} 11/06/2021 22:00:02 - INFO - __main__ - Step 5307: {'lr': 0.0004993846619272052, 'samples': 1018944, 'steps': 5306, 'loss/train': 2.057713508605957} 11/06/2021 22:00:03 - INFO - __main__ - Step 5308: {'lr': 0.0004993842897684672, 'samples': 1019136, 'steps': 5307, 'loss/train': 1.3499886989593506} 11/06/2021 22:00:03 - INFO - __main__ - Step 5309: {'lr': 0.0004993839174973604, 'samples': 1019328, 'steps': 5308, 'loss/train': 1.8223806619644165} 11/06/2021 22:00:04 - INFO - __main__ - Step 5310: {'lr': 0.0004993835451138851, 'samples': 1019520, 'steps': 5309, 'loss/train': 1.0041699409484863} 11/06/2021 22:00:04 - INFO - __main__ - Step 5311: {'lr': 0.0004993831726180414, 'samples': 1019712, 'steps': 5310, 'loss/train': 0.6815013289451599} 11/06/2021 22:00:05 - INFO - __main__ - Step 5312: {'lr': 0.0004993828000098296, 'samples': 1019904, 'steps': 5311, 'loss/train': 1.6194506883621216} 11/06/2021 22:00:05 - INFO - __main__ - Step 5313: {'lr': 0.0004993824272892497, 'samples': 1020096, 'steps': 5312, 'loss/train': 1.8829015493392944} 11/06/2021 22:00:05 - INFO - __main__ - Step 5314: {'lr': 0.0004993820544563018, 'samples': 1020288, 'steps': 5313, 'loss/train': 2.2411489486694336} 11/06/2021 22:00:06 - INFO - __main__ - Step 5315: {'lr': 0.0004993816815109863, 'samples': 1020480, 'steps': 5314, 'loss/train': 1.758228063583374} 11/06/2021 22:00:07 - INFO - __main__ - Step 5316: {'lr': 0.0004993813084533033, 'samples': 1020672, 'steps': 5315, 'loss/train': 1.9328948259353638} 11/06/2021 22:00:07 - INFO - __main__ - Step 5317: {'lr': 0.0004993809352832529, 'samples': 1020864, 'steps': 5316, 'loss/train': 1.9451228380203247} 11/06/2021 22:00:08 - INFO - __main__ - Step 5318: {'lr': 0.0004993805620008353, 'samples': 1021056, 'steps': 5317, 'loss/train': 2.118875026702881} 11/06/2021 22:00:08 - INFO - __main__ - Step 5319: {'lr': 0.0004993801886060506, 'samples': 1021248, 'steps': 5318, 'loss/train': 2.0214946269989014} 11/06/2021 22:00:09 - INFO - __main__ - Step 5320: {'lr': 0.0004993798150988991, 'samples': 1021440, 'steps': 5319, 'loss/train': 2.4562036991119385} 11/06/2021 22:00:09 - INFO - __main__ - Step 5321: {'lr': 0.0004993794414793808, 'samples': 1021632, 'steps': 5320, 'loss/train': 1.9569580554962158} 11/06/2021 22:00:10 - INFO - __main__ - Step 5322: {'lr': 0.0004993790677474962, 'samples': 1021824, 'steps': 5321, 'loss/train': 1.3694133758544922} 11/06/2021 22:00:10 - INFO - __main__ - Step 5323: {'lr': 0.0004993786939032451, 'samples': 1022016, 'steps': 5322, 'loss/train': 1.7816057205200195} 11/06/2021 22:00:10 - INFO - __main__ - Step 5324: {'lr': 0.0004993783199466278, 'samples': 1022208, 'steps': 5323, 'loss/train': 2.492426633834839} 11/06/2021 22:00:11 - INFO - __main__ - Step 5325: {'lr': 0.0004993779458776444, 'samples': 1022400, 'steps': 5324, 'loss/train': 2.574648857116699} 11/06/2021 22:00:12 - INFO - __main__ - Step 5326: {'lr': 0.0004993775716962953, 'samples': 1022592, 'steps': 5325, 'loss/train': 1.6549577713012695} 11/06/2021 22:00:12 - INFO - __main__ - Step 5327: {'lr': 0.0004993771974025805, 'samples': 1022784, 'steps': 5326, 'loss/train': 1.884212851524353} 11/06/2021 22:00:12 - INFO - __main__ - Step 5328: {'lr': 0.0004993768229965001, 'samples': 1022976, 'steps': 5327, 'loss/train': 2.4046313762664795} 11/06/2021 22:00:13 - INFO - __main__ - Step 5329: {'lr': 0.0004993764484780543, 'samples': 1023168, 'steps': 5328, 'loss/train': 1.4917925596237183} 11/06/2021 22:00:13 - INFO - __main__ - Step 5330: {'lr': 0.0004993760738472435, 'samples': 1023360, 'steps': 5329, 'loss/train': 1.8615087270736694} 11/06/2021 22:00:14 - INFO - __main__ - Step 5331: {'lr': 0.0004993756991040675, 'samples': 1023552, 'steps': 5330, 'loss/train': 1.8130137920379639} 11/06/2021 22:00:14 - INFO - __main__ - Step 5332: {'lr': 0.0004993753242485268, 'samples': 1023744, 'steps': 5331, 'loss/train': 2.364469051361084} 11/06/2021 22:00:15 - INFO - __main__ - Step 5333: {'lr': 0.0004993749492806214, 'samples': 1023936, 'steps': 5332, 'loss/train': 2.498106002807617} 11/06/2021 22:00:15 - INFO - __main__ - Step 5334: {'lr': 0.0004993745742003515, 'samples': 1024128, 'steps': 5333, 'loss/train': 1.809753656387329} 11/06/2021 22:00:16 - INFO - __main__ - Step 5335: {'lr': 0.0004993741990077172, 'samples': 1024320, 'steps': 5334, 'loss/train': 2.3342111110687256} 11/06/2021 22:00:16 - INFO - __main__ - Step 5336: {'lr': 0.0004993738237027188, 'samples': 1024512, 'steps': 5335, 'loss/train': 2.1999897956848145} 11/06/2021 22:00:17 - INFO - __main__ - Step 5337: {'lr': 0.0004993734482853563, 'samples': 1024704, 'steps': 5336, 'loss/train': 2.134817123413086} 11/06/2021 22:00:17 - INFO - __main__ - Step 5338: {'lr': 0.0004993730727556301, 'samples': 1024896, 'steps': 5337, 'loss/train': 1.9797841310501099} 11/06/2021 22:00:17 - INFO - __main__ - Step 5339: {'lr': 0.0004993726971135402, 'samples': 1025088, 'steps': 5338, 'loss/train': 1.4383511543273926} 11/06/2021 22:00:18 - INFO - __main__ - Step 5340: {'lr': 0.0004993723213590868, 'samples': 1025280, 'steps': 5339, 'loss/train': 2.2036027908325195} 11/06/2021 22:00:19 - INFO - __main__ - Step 5341: {'lr': 0.0004993719454922701, 'samples': 1025472, 'steps': 5340, 'loss/train': 2.0086007118225098} 11/06/2021 22:00:19 - INFO - __main__ - Step 5342: {'lr': 0.0004993715695130902, 'samples': 1025664, 'steps': 5341, 'loss/train': 1.6373944282531738} 11/06/2021 22:00:20 - INFO - __main__ - Step 5343: {'lr': 0.0004993711934215473, 'samples': 1025856, 'steps': 5342, 'loss/train': 1.591683268547058} 11/06/2021 22:00:20 - INFO - __main__ - Step 5344: {'lr': 0.0004993708172176417, 'samples': 1026048, 'steps': 5343, 'loss/train': 1.9710315465927124} 11/06/2021 22:00:20 - INFO - __main__ - Step 5345: {'lr': 0.0004993704409013734, 'samples': 1026240, 'steps': 5344, 'loss/train': 2.084678888320923} 11/06/2021 22:00:21 - INFO - __main__ - Step 5346: {'lr': 0.0004993700644727425, 'samples': 1026432, 'steps': 5345, 'loss/train': 2.1554689407348633} 11/06/2021 22:00:22 - INFO - __main__ - Step 5347: {'lr': 0.0004993696879317495, 'samples': 1026624, 'steps': 5346, 'loss/train': 1.5472863912582397} 11/06/2021 22:00:22 - INFO - __main__ - Step 5348: {'lr': 0.0004993693112783943, 'samples': 1026816, 'steps': 5347, 'loss/train': 1.995734691619873} 11/06/2021 22:00:22 - INFO - __main__ - Step 5349: {'lr': 0.0004993689345126771, 'samples': 1027008, 'steps': 5348, 'loss/train': 2.151108503341675} 11/06/2021 22:00:23 - INFO - __main__ - Step 5350: {'lr': 0.0004993685576345981, 'samples': 1027200, 'steps': 5349, 'loss/train': 2.3165838718414307} 11/06/2021 22:00:24 - INFO - __main__ - Step 5351: {'lr': 0.0004993681806441575, 'samples': 1027392, 'steps': 5350, 'loss/train': 2.325005292892456} 11/06/2021 22:00:24 - INFO - __main__ - Step 5352: {'lr': 0.0004993678035413554, 'samples': 1027584, 'steps': 5351, 'loss/train': 1.954711675643921} 11/06/2021 22:00:24 - INFO - __main__ - Step 5353: {'lr': 0.0004993674263261921, 'samples': 1027776, 'steps': 5352, 'loss/train': 1.639163851737976} 11/06/2021 22:00:25 - INFO - __main__ - Step 5354: {'lr': 0.0004993670489986677, 'samples': 1027968, 'steps': 5353, 'loss/train': 2.0933964252471924} 11/06/2021 22:00:25 - INFO - __main__ - Step 5355: {'lr': 0.0004993666715587823, 'samples': 1028160, 'steps': 5354, 'loss/train': 2.3471012115478516} 11/06/2021 22:00:26 - INFO - __main__ - Step 5356: {'lr': 0.0004993662940065361, 'samples': 1028352, 'steps': 5355, 'loss/train': 1.8415415287017822} 11/06/2021 22:00:26 - INFO - __main__ - Step 5357: {'lr': 0.0004993659163419294, 'samples': 1028544, 'steps': 5356, 'loss/train': 2.263580322265625} 11/06/2021 22:00:27 - INFO - __main__ - Step 5358: {'lr': 0.0004993655385649621, 'samples': 1028736, 'steps': 5357, 'loss/train': 1.8695333003997803} 11/06/2021 22:00:27 - INFO - __main__ - Step 5359: {'lr': 0.0004993651606756347, 'samples': 1028928, 'steps': 5358, 'loss/train': 2.1542937755584717} 11/06/2021 22:00:28 - INFO - __main__ - Step 5360: {'lr': 0.0004993647826739471, 'samples': 1029120, 'steps': 5359, 'loss/train': 1.7459410429000854} 11/06/2021 22:00:28 - INFO - __main__ - Step 5361: {'lr': 0.0004993644045598997, 'samples': 1029312, 'steps': 5360, 'loss/train': 2.4149515628814697} 11/06/2021 22:00:29 - INFO - __main__ - Step 5362: {'lr': 0.0004993640263334924, 'samples': 1029504, 'steps': 5361, 'loss/train': 1.8618431091308594} 11/06/2021 22:00:29 - INFO - __main__ - Step 5363: {'lr': 0.0004993636479947256, 'samples': 1029696, 'steps': 5362, 'loss/train': 1.1702266931533813} 11/06/2021 22:00:30 - INFO - __main__ - Step 5364: {'lr': 0.0004993632695435993, 'samples': 1029888, 'steps': 5363, 'loss/train': 2.124359369277954} 11/06/2021 22:00:30 - INFO - __main__ - Step 5365: {'lr': 0.0004993628909801138, 'samples': 1030080, 'steps': 5364, 'loss/train': 1.8695933818817139} 11/06/2021 22:00:31 - INFO - __main__ - Step 5366: {'lr': 0.0004993625123042694, 'samples': 1030272, 'steps': 5365, 'loss/train': 2.126741409301758} 11/06/2021 22:00:31 - INFO - __main__ - Step 5367: {'lr': 0.0004993621335160659, 'samples': 1030464, 'steps': 5366, 'loss/train': 2.1236331462860107} 11/06/2021 22:00:32 - INFO - __main__ - Step 5368: {'lr': 0.0004993617546155037, 'samples': 1030656, 'steps': 5367, 'loss/train': 1.1388921737670898} 11/06/2021 22:00:32 - INFO - __main__ - Step 5369: {'lr': 0.000499361375602583, 'samples': 1030848, 'steps': 5368, 'loss/train': 2.0226683616638184} 11/06/2021 22:00:32 - INFO - __main__ - Step 5370: {'lr': 0.0004993609964773039, 'samples': 1031040, 'steps': 5369, 'loss/train': 1.933193325996399} 11/06/2021 22:00:33 - INFO - __main__ - Step 5371: {'lr': 0.0004993606172396665, 'samples': 1031232, 'steps': 5370, 'loss/train': 1.9464812278747559} 11/06/2021 22:00:34 - INFO - __main__ - Step 5372: {'lr': 0.0004993602378896712, 'samples': 1031424, 'steps': 5371, 'loss/train': 1.1575310230255127} 11/06/2021 22:00:34 - INFO - __main__ - Step 5373: {'lr': 0.0004993598584273179, 'samples': 1031616, 'steps': 5372, 'loss/train': 2.0827817916870117} 11/06/2021 22:00:34 - INFO - __main__ - Step 5374: {'lr': 0.0004993594788526069, 'samples': 1031808, 'steps': 5373, 'loss/train': 2.1130635738372803} 11/06/2021 22:00:35 - INFO - __main__ - Step 5375: {'lr': 0.0004993590991655384, 'samples': 1032000, 'steps': 5374, 'loss/train': 2.0169715881347656} 11/06/2021 22:00:35 - INFO - __main__ - Step 5376: {'lr': 0.0004993587193661126, 'samples': 1032192, 'steps': 5375, 'loss/train': 2.4737699031829834} 11/06/2021 22:00:36 - INFO - __main__ - Step 5377: {'lr': 0.0004993583394543295, 'samples': 1032384, 'steps': 5376, 'loss/train': 1.8116259574890137} 11/06/2021 22:00:37 - INFO - __main__ - Step 5378: {'lr': 0.0004993579594301895, 'samples': 1032576, 'steps': 5377, 'loss/train': 1.8319172859191895} 11/06/2021 22:00:37 - INFO - __main__ - Step 5379: {'lr': 0.0004993575792936925, 'samples': 1032768, 'steps': 5378, 'loss/train': 1.6784554719924927} 11/06/2021 22:00:37 - INFO - __main__ - Step 5380: {'lr': 0.000499357199044839, 'samples': 1032960, 'steps': 5379, 'loss/train': 1.458201289176941} 11/06/2021 22:00:38 - INFO - __main__ - Step 5381: {'lr': 0.0004993568186836288, 'samples': 1033152, 'steps': 5380, 'loss/train': 1.8276842832565308} 11/06/2021 22:00:39 - INFO - __main__ - Step 5382: {'lr': 0.0004993564382100624, 'samples': 1033344, 'steps': 5381, 'loss/train': 1.3644241094589233} 11/06/2021 22:00:39 - INFO - __main__ - Step 5383: {'lr': 0.0004993560576241398, 'samples': 1033536, 'steps': 5382, 'loss/train': 1.386871337890625} 11/06/2021 22:00:39 - INFO - __main__ - Step 5384: {'lr': 0.0004993556769258612, 'samples': 1033728, 'steps': 5383, 'loss/train': 1.235226035118103} 11/06/2021 22:00:40 - INFO - __main__ - Step 5385: {'lr': 0.0004993552961152268, 'samples': 1033920, 'steps': 5384, 'loss/train': 1.334904670715332} 11/06/2021 22:00:40 - INFO - __main__ - Step 5386: {'lr': 0.0004993549151922367, 'samples': 1034112, 'steps': 5385, 'loss/train': 2.5312018394470215} 11/06/2021 22:00:41 - INFO - __main__ - Step 5387: {'lr': 0.0004993545341568912, 'samples': 1034304, 'steps': 5386, 'loss/train': 1.546046257019043} 11/06/2021 22:00:41 - INFO - __main__ - Step 5388: {'lr': 0.0004993541530091903, 'samples': 1034496, 'steps': 5387, 'loss/train': 1.8282663822174072} 11/06/2021 22:00:42 - INFO - __main__ - Step 5389: {'lr': 0.0004993537717491343, 'samples': 1034688, 'steps': 5388, 'loss/train': 2.0151655673980713} 11/06/2021 22:00:42 - INFO - __main__ - Step 5390: {'lr': 0.0004993533903767235, 'samples': 1034880, 'steps': 5389, 'loss/train': 1.5707823038101196} 11/06/2021 22:00:42 - INFO - __main__ - Step 5391: {'lr': 0.0004993530088919577, 'samples': 1035072, 'steps': 5390, 'loss/train': 1.7136812210083008} 11/06/2021 22:00:43 - INFO - __main__ - Step 5392: {'lr': 0.0004993526272948374, 'samples': 1035264, 'steps': 5391, 'loss/train': 0.6717694401741028} 11/06/2021 22:00:44 - INFO - __main__ - Step 5393: {'lr': 0.0004993522455853626, 'samples': 1035456, 'steps': 5392, 'loss/train': 1.5146379470825195} 11/06/2021 22:00:44 - INFO - __main__ - Step 5394: {'lr': 0.0004993518637635334, 'samples': 1035648, 'steps': 5393, 'loss/train': 1.4863612651824951} 11/06/2021 22:00:45 - INFO - __main__ - Step 5395: {'lr': 0.0004993514818293503, 'samples': 1035840, 'steps': 5394, 'loss/train': 1.7902588844299316} 11/06/2021 22:00:45 - INFO - __main__ - Step 5396: {'lr': 0.0004993510997828132, 'samples': 1036032, 'steps': 5395, 'loss/train': 1.7345638275146484} 11/06/2021 22:00:45 - INFO - __main__ - Step 5397: {'lr': 0.0004993507176239224, 'samples': 1036224, 'steps': 5396, 'loss/train': 1.4879080057144165} 11/06/2021 22:00:46 - INFO - __main__ - Step 5398: {'lr': 0.0004993503353526779, 'samples': 1036416, 'steps': 5397, 'loss/train': 1.566808819770813} 11/06/2021 22:00:47 - INFO - __main__ - Step 5399: {'lr': 0.0004993499529690801, 'samples': 1036608, 'steps': 5398, 'loss/train': 1.6237696409225464} 11/06/2021 22:00:47 - INFO - __main__ - Step 5400: {'lr': 0.000499349570473129, 'samples': 1036800, 'steps': 5399, 'loss/train': 2.0763614177703857} 11/06/2021 22:00:47 - INFO - __main__ - Step 5401: {'lr': 0.0004993491878648249, 'samples': 1036992, 'steps': 5400, 'loss/train': 2.139653444290161} 11/06/2021 22:00:48 - INFO - __main__ - Step 5402: {'lr': 0.0004993488051441677, 'samples': 1037184, 'steps': 5401, 'loss/train': 1.2702760696411133} 11/06/2021 22:00:49 - INFO - __main__ - Step 5403: {'lr': 0.000499348422311158, 'samples': 1037376, 'steps': 5402, 'loss/train': 1.7422914505004883} 11/06/2021 22:00:49 - INFO - __main__ - Step 5404: {'lr': 0.0004993480393657956, 'samples': 1037568, 'steps': 5403, 'loss/train': 1.667367935180664} 11/06/2021 22:00:50 - INFO - __main__ - Step 5405: {'lr': 0.0004993476563080809, 'samples': 1037760, 'steps': 5404, 'loss/train': 1.849706768989563} 11/06/2021 22:00:50 - INFO - __main__ - Step 5406: {'lr': 0.000499347273138014, 'samples': 1037952, 'steps': 5405, 'loss/train': 1.485642433166504} 11/06/2021 22:00:50 - INFO - __main__ - Step 5407: {'lr': 0.000499346889855595, 'samples': 1038144, 'steps': 5406, 'loss/train': 1.8878998756408691} 11/06/2021 22:00:51 - INFO - __main__ - Step 5408: {'lr': 0.0004993465064608242, 'samples': 1038336, 'steps': 5407, 'loss/train': 2.2448551654815674} 11/06/2021 22:00:52 - INFO - __main__ - Step 5409: {'lr': 0.0004993461229537017, 'samples': 1038528, 'steps': 5408, 'loss/train': 1.8952325582504272} 11/06/2021 22:00:52 - INFO - __main__ - Step 5410: {'lr': 0.0004993457393342276, 'samples': 1038720, 'steps': 5409, 'loss/train': 1.7118123769760132} 11/06/2021 22:00:52 - INFO - __main__ - Step 5411: {'lr': 0.0004993453556024023, 'samples': 1038912, 'steps': 5410, 'loss/train': 1.6171423196792603} 11/06/2021 22:00:53 - INFO - __main__ - Step 5412: {'lr': 0.0004993449717582258, 'samples': 1039104, 'steps': 5411, 'loss/train': 1.8556959629058838} 11/06/2021 22:00:53 - INFO - __main__ - Step 5413: {'lr': 0.0004993445878016982, 'samples': 1039296, 'steps': 5412, 'loss/train': 2.079441547393799} 11/06/2021 22:00:54 - INFO - __main__ - Step 5414: {'lr': 0.0004993442037328199, 'samples': 1039488, 'steps': 5413, 'loss/train': 2.005168914794922} 11/06/2021 22:00:54 - INFO - __main__ - Step 5415: {'lr': 0.0004993438195515909, 'samples': 1039680, 'steps': 5414, 'loss/train': 1.7402551174163818} 11/06/2021 22:00:55 - INFO - __main__ - Step 5416: {'lr': 0.0004993434352580115, 'samples': 1039872, 'steps': 5415, 'loss/train': 1.5889499187469482} 11/06/2021 22:00:55 - INFO - __main__ - Step 5417: {'lr': 0.0004993430508520816, 'samples': 1040064, 'steps': 5416, 'loss/train': 2.1305549144744873} 11/06/2021 22:00:55 - INFO - __main__ - Step 5418: {'lr': 0.0004993426663338018, 'samples': 1040256, 'steps': 5417, 'loss/train': 1.9073415994644165} 11/06/2021 22:00:57 - INFO - __main__ - Step 5419: {'lr': 0.0004993422817031719, 'samples': 1040448, 'steps': 5418, 'loss/train': 2.1884820461273193} 11/06/2021 22:00:57 - INFO - __main__ - Step 5420: {'lr': 0.0004993418969601921, 'samples': 1040640, 'steps': 5419, 'loss/train': 2.154372215270996} 11/06/2021 22:00:57 - INFO - __main__ - Step 5421: {'lr': 0.0004993415121048629, 'samples': 1040832, 'steps': 5420, 'loss/train': 1.9942115545272827} 11/06/2021 22:00:58 - INFO - __main__ - Step 5422: {'lr': 0.0004993411271371842, 'samples': 1041024, 'steps': 5421, 'loss/train': 1.9106769561767578} 11/06/2021 22:00:58 - INFO - __main__ - Step 5423: {'lr': 0.0004993407420571563, 'samples': 1041216, 'steps': 5422, 'loss/train': 1.0730233192443848} 11/06/2021 22:00:59 - INFO - __main__ - Step 5424: {'lr': 0.0004993403568647792, 'samples': 1041408, 'steps': 5423, 'loss/train': 2.985499143600464} 11/06/2021 22:01:00 - INFO - __main__ - Step 5425: {'lr': 0.0004993399715600531, 'samples': 1041600, 'steps': 5424, 'loss/train': 1.9801826477050781} 11/06/2021 22:01:00 - INFO - __main__ - Step 5426: {'lr': 0.0004993395861429785, 'samples': 1041792, 'steps': 5425, 'loss/train': 2.2979655265808105} 11/06/2021 22:01:00 - INFO - __main__ - Step 5427: {'lr': 0.0004993392006135552, 'samples': 1041984, 'steps': 5426, 'loss/train': 1.975477933883667} 11/06/2021 22:01:01 - INFO - __main__ - Step 5428: {'lr': 0.0004993388149717834, 'samples': 1042176, 'steps': 5427, 'loss/train': 1.659934639930725} 11/06/2021 22:01:02 - INFO - __main__ - Step 5429: {'lr': 0.0004993384292176636, 'samples': 1042368, 'steps': 5428, 'loss/train': 0.41693881154060364} 11/06/2021 22:01:02 - INFO - __main__ - Step 5430: {'lr': 0.0004993380433511956, 'samples': 1042560, 'steps': 5429, 'loss/train': 1.881492257118225} 11/06/2021 22:01:03 - INFO - __main__ - Step 5431: {'lr': 0.0004993376573723798, 'samples': 1042752, 'steps': 5430, 'loss/train': 2.2154150009155273} 11/06/2021 22:01:03 - INFO - __main__ - Step 5432: {'lr': 0.0004993372712812162, 'samples': 1042944, 'steps': 5431, 'loss/train': 2.0306501388549805} 11/06/2021 22:01:04 - INFO - __main__ - Step 5433: {'lr': 0.0004993368850777052, 'samples': 1043136, 'steps': 5432, 'loss/train': 2.2101669311523438} 11/06/2021 22:01:04 - INFO - __main__ - Step 5434: {'lr': 0.0004993364987618468, 'samples': 1043328, 'steps': 5433, 'loss/train': 2.0793869495391846} 11/06/2021 22:01:05 - INFO - __main__ - Step 5435: {'lr': 0.0004993361123336412, 'samples': 1043520, 'steps': 5434, 'loss/train': 1.7962534427642822} 11/06/2021 22:01:05 - INFO - __main__ - Step 5436: {'lr': 0.0004993357257930887, 'samples': 1043712, 'steps': 5435, 'loss/train': 2.5018160343170166} 11/06/2021 22:01:06 - INFO - __main__ - Step 5437: {'lr': 0.0004993353391401892, 'samples': 1043904, 'steps': 5436, 'loss/train': 2.258634567260742} 11/06/2021 22:01:06 - INFO - __main__ - Step 5438: {'lr': 0.0004993349523749431, 'samples': 1044096, 'steps': 5437, 'loss/train': 1.8289518356323242} 11/06/2021 22:01:07 - INFO - __main__ - Step 5439: {'lr': 0.0004993345654973505, 'samples': 1044288, 'steps': 5438, 'loss/train': 1.9161920547485352} 11/06/2021 22:01:07 - INFO - __main__ - Step 5440: {'lr': 0.0004993341785074116, 'samples': 1044480, 'steps': 5439, 'loss/train': 2.753596305847168} 11/06/2021 22:01:07 - INFO - __main__ - Step 5441: {'lr': 0.0004993337914051266, 'samples': 1044672, 'steps': 5440, 'loss/train': 1.9837926626205444} 11/06/2021 22:01:08 - INFO - __main__ - Step 5442: {'lr': 0.0004993334041904957, 'samples': 1044864, 'steps': 5441, 'loss/train': 0.3798553943634033} 11/06/2021 22:01:09 - INFO - __main__ - Step 5443: {'lr': 0.0004993330168635189, 'samples': 1045056, 'steps': 5442, 'loss/train': 1.5008134841918945} 11/06/2021 22:01:09 - INFO - __main__ - Step 5444: {'lr': 0.0004993326294241966, 'samples': 1045248, 'steps': 5443, 'loss/train': 0.524370014667511} 11/06/2021 22:01:09 - INFO - __main__ - Step 5445: {'lr': 0.0004993322418725286, 'samples': 1045440, 'steps': 5444, 'loss/train': 1.8944960832595825} 11/06/2021 22:01:10 - INFO - __main__ - Step 5446: {'lr': 0.0004993318542085157, 'samples': 1045632, 'steps': 5445, 'loss/train': 2.161842107772827} 11/06/2021 22:01:11 - INFO - __main__ - Step 5447: {'lr': 0.0004993314664321575, 'samples': 1045824, 'steps': 5446, 'loss/train': 2.0262186527252197} 11/06/2021 22:01:11 - INFO - __main__ - Step 5448: {'lr': 0.0004993310785434544, 'samples': 1046016, 'steps': 5447, 'loss/train': 1.3276009559631348} 11/06/2021 22:01:11 - INFO - __main__ - Step 5449: {'lr': 0.0004993306905424067, 'samples': 1046208, 'steps': 5448, 'loss/train': 1.6204453706741333} 11/06/2021 22:01:12 - INFO - __main__ - Step 5450: {'lr': 0.0004993303024290143, 'samples': 1046400, 'steps': 5449, 'loss/train': 2.0517914295196533} 11/06/2021 22:01:12 - INFO - __main__ - Step 5451: {'lr': 0.0004993299142032776, 'samples': 1046592, 'steps': 5450, 'loss/train': 1.5047802925109863} 11/06/2021 22:01:13 - INFO - __main__ - Step 5452: {'lr': 0.0004993295258651966, 'samples': 1046784, 'steps': 5451, 'loss/train': 2.418118953704834} 11/06/2021 22:01:14 - INFO - __main__ - Step 5453: {'lr': 0.0004993291374147716, 'samples': 1046976, 'steps': 5452, 'loss/train': 0.4726817011833191} 11/06/2021 22:01:14 - INFO - __main__ - Step 5454: {'lr': 0.0004993287488520027, 'samples': 1047168, 'steps': 5453, 'loss/train': 2.1030688285827637} 11/06/2021 22:01:14 - INFO - __main__ - Step 5455: {'lr': 0.0004993283601768902, 'samples': 1047360, 'steps': 5454, 'loss/train': 2.6343281269073486} 11/06/2021 22:01:15 - INFO - __main__ - Step 5456: {'lr': 0.0004993279713894342, 'samples': 1047552, 'steps': 5455, 'loss/train': 2.150969982147217} 11/06/2021 22:01:16 - INFO - __main__ - Step 5457: {'lr': 0.0004993275824896348, 'samples': 1047744, 'steps': 5456, 'loss/train': 1.408422827720642} 11/06/2021 22:01:16 - INFO - __main__ - Step 5458: {'lr': 0.0004993271934774922, 'samples': 1047936, 'steps': 5457, 'loss/train': 1.94776451587677} 11/06/2021 22:01:16 - INFO - __main__ - Step 5459: {'lr': 0.0004993268043530067, 'samples': 1048128, 'steps': 5458, 'loss/train': 1.2634351253509521} 11/06/2021 22:01:17 - INFO - __main__ - Step 5460: {'lr': 0.0004993264151161783, 'samples': 1048320, 'steps': 5459, 'loss/train': 1.6682987213134766} 11/06/2021 22:01:17 - INFO - __main__ - Step 5461: {'lr': 0.0004993260257670074, 'samples': 1048512, 'steps': 5460, 'loss/train': 2.2864060401916504} 11/06/2021 22:01:18 - INFO - __main__ - Step 5462: {'lr': 0.000499325636305494, 'samples': 1048704, 'steps': 5461, 'loss/train': 1.3038978576660156} 11/06/2021 22:01:18 - INFO - __main__ - Step 5463: {'lr': 0.0004993252467316382, 'samples': 1048896, 'steps': 5462, 'loss/train': 1.608263373374939} 11/06/2021 22:01:19 - INFO - __main__ - Step 5464: {'lr': 0.0004993248570454404, 'samples': 1049088, 'steps': 5463, 'loss/train': 1.6738258600234985} 11/06/2021 22:01:19 - INFO - __main__ - Step 5465: {'lr': 0.0004993244672469007, 'samples': 1049280, 'steps': 5464, 'loss/train': 2.2814807891845703} 11/06/2021 22:01:19 - INFO - __main__ - Step 5466: {'lr': 0.000499324077336019, 'samples': 1049472, 'steps': 5465, 'loss/train': 1.8330951929092407} 11/06/2021 22:01:20 - INFO - __main__ - Step 5467: {'lr': 0.000499323687312796, 'samples': 1049664, 'steps': 5466, 'loss/train': 1.905300498008728} 11/06/2021 22:01:21 - INFO - __main__ - Step 5468: {'lr': 0.0004993232971772315, 'samples': 1049856, 'steps': 5467, 'loss/train': 1.9987244606018066} 11/06/2021 22:01:21 - INFO - __main__ - Step 5469: {'lr': 0.0004993229069293257, 'samples': 1050048, 'steps': 5468, 'loss/train': 2.0383851528167725} 11/06/2021 22:01:21 - INFO - __main__ - Step 5470: {'lr': 0.0004993225165690789, 'samples': 1050240, 'steps': 5469, 'loss/train': 1.96856689453125} 11/06/2021 22:01:22 - INFO - __main__ - Step 5471: {'lr': 0.0004993221260964912, 'samples': 1050432, 'steps': 5470, 'loss/train': 2.155994415283203} 11/06/2021 22:01:22 - INFO - __main__ - Step 5472: {'lr': 0.0004993217355115628, 'samples': 1050624, 'steps': 5471, 'loss/train': 2.175236463546753} 11/06/2021 22:01:23 - INFO - __main__ - Step 5473: {'lr': 0.0004993213448142939, 'samples': 1050816, 'steps': 5472, 'loss/train': 2.2860300540924072} 11/06/2021 22:01:24 - INFO - __main__ - Step 5474: {'lr': 0.0004993209540046846, 'samples': 1051008, 'steps': 5473, 'loss/train': 2.309166431427002} 11/06/2021 22:01:24 - INFO - __main__ - Step 5475: {'lr': 0.0004993205630827352, 'samples': 1051200, 'steps': 5474, 'loss/train': 1.8463259935379028} 11/06/2021 22:01:24 - INFO - __main__ - Step 5476: {'lr': 0.0004993201720484458, 'samples': 1051392, 'steps': 5475, 'loss/train': 1.8536326885223389} 11/06/2021 22:01:25 - INFO - __main__ - Step 5477: {'lr': 0.0004993197809018165, 'samples': 1051584, 'steps': 5476, 'loss/train': 1.8011512756347656} 11/06/2021 22:01:26 - INFO - __main__ - Step 5478: {'lr': 0.0004993193896428476, 'samples': 1051776, 'steps': 5477, 'loss/train': 1.370781421661377} 11/06/2021 22:01:26 - INFO - __main__ - Step 5479: {'lr': 0.0004993189982715392, 'samples': 1051968, 'steps': 5478, 'loss/train': 1.9263511896133423} 11/06/2021 22:01:26 - INFO - __main__ - Step 5480: {'lr': 0.0004993186067878916, 'samples': 1052160, 'steps': 5479, 'loss/train': 1.3620198965072632} 11/06/2021 22:01:27 - INFO - __main__ - Step 5481: {'lr': 0.0004993182151919049, 'samples': 1052352, 'steps': 5480, 'loss/train': 2.0209407806396484} 11/06/2021 22:01:27 - INFO - __main__ - Step 5482: {'lr': 0.0004993178234835792, 'samples': 1052544, 'steps': 5481, 'loss/train': 1.6855971813201904} 11/06/2021 22:01:28 - INFO - __main__ - Step 5483: {'lr': 0.0004993174316629146, 'samples': 1052736, 'steps': 5482, 'loss/train': 2.446448802947998} 11/06/2021 22:01:28 - INFO - __main__ - Step 5484: {'lr': 0.0004993170397299116, 'samples': 1052928, 'steps': 5483, 'loss/train': 1.8740376234054565} 11/06/2021 22:01:29 - INFO - __main__ - Step 5485: {'lr': 0.0004993166476845701, 'samples': 1053120, 'steps': 5484, 'loss/train': 2.2243385314941406} 11/06/2021 22:01:29 - INFO - __main__ - Step 5486: {'lr': 0.0004993162555268903, 'samples': 1053312, 'steps': 5485, 'loss/train': 1.9227041006088257} 11/06/2021 22:01:29 - INFO - __main__ - Step 5487: {'lr': 0.0004993158632568726, 'samples': 1053504, 'steps': 5486, 'loss/train': 2.1195530891418457} 11/06/2021 22:01:30 - INFO - __main__ - Step 5488: {'lr': 0.000499315470874517, 'samples': 1053696, 'steps': 5487, 'loss/train': 2.2419090270996094} 11/06/2021 22:01:31 - INFO - __main__ - Step 5489: {'lr': 0.0004993150783798236, 'samples': 1053888, 'steps': 5488, 'loss/train': 2.4271090030670166} 11/06/2021 22:01:31 - INFO - __main__ - Step 5490: {'lr': 0.0004993146857727927, 'samples': 1054080, 'steps': 5489, 'loss/train': 1.7117938995361328} 11/06/2021 22:01:31 - INFO - __main__ - Step 5491: {'lr': 0.0004993142930534245, 'samples': 1054272, 'steps': 5490, 'loss/train': 1.232141375541687} 11/06/2021 22:01:32 - INFO - __main__ - Step 5492: {'lr': 0.000499313900221719, 'samples': 1054464, 'steps': 5491, 'loss/train': 1.8107589483261108} 11/06/2021 22:01:33 - INFO - __main__ - Step 5493: {'lr': 0.0004993135072776766, 'samples': 1054656, 'steps': 5492, 'loss/train': 2.035764694213867} 11/06/2021 22:01:33 - INFO - __main__ - Step 5494: {'lr': 0.0004993131142212974, 'samples': 1054848, 'steps': 5493, 'loss/train': 1.9107087850570679} 11/06/2021 22:01:33 - INFO - __main__ - Step 5495: {'lr': 0.0004993127210525815, 'samples': 1055040, 'steps': 5494, 'loss/train': 1.5598526000976562} 11/06/2021 22:01:34 - INFO - __main__ - Step 5496: {'lr': 0.0004993123277715292, 'samples': 1055232, 'steps': 5495, 'loss/train': 1.6145435571670532} 11/06/2021 22:01:34 - INFO - __main__ - Step 5497: {'lr': 0.0004993119343781406, 'samples': 1055424, 'steps': 5496, 'loss/train': 1.697355031967163} 11/06/2021 22:01:35 - INFO - __main__ - Step 5498: {'lr': 0.0004993115408724159, 'samples': 1055616, 'steps': 5497, 'loss/train': 1.3558404445648193} 11/06/2021 22:01:36 - INFO - __main__ - Step 5499: {'lr': 0.0004993111472543552, 'samples': 1055808, 'steps': 5498, 'loss/train': 1.6926814317703247} 11/06/2021 22:01:36 - INFO - __main__ - Step 5500: {'lr': 0.0004993107535239588, 'samples': 1056000, 'steps': 5499, 'loss/train': 1.3640247583389282} 11/06/2021 22:01:36 - INFO - __main__ - Step 5501: {'lr': 0.0004993103596812267, 'samples': 1056192, 'steps': 5500, 'loss/train': 1.8864690065383911} 11/06/2021 22:01:37 - INFO - __main__ - Step 5502: {'lr': 0.0004993099657261594, 'samples': 1056384, 'steps': 5501, 'loss/train': 2.0299696922302246} 11/06/2021 22:01:37 - INFO - __main__ - Step 5503: {'lr': 0.0004993095716587568, 'samples': 1056576, 'steps': 5502, 'loss/train': 2.2388596534729004} 11/06/2021 22:01:38 - INFO - __main__ - Step 5504: {'lr': 0.0004993091774790191, 'samples': 1056768, 'steps': 5503, 'loss/train': 1.5079683065414429} 11/06/2021 22:01:38 - INFO - __main__ - Step 5505: {'lr': 0.0004993087831869466, 'samples': 1056960, 'steps': 5504, 'loss/train': 2.0037267208099365} 11/06/2021 22:01:39 - INFO - __main__ - Step 5506: {'lr': 0.0004993083887825393, 'samples': 1057152, 'steps': 5505, 'loss/train': 2.18703556060791} 11/06/2021 22:01:39 - INFO - __main__ - Step 5507: {'lr': 0.0004993079942657976, 'samples': 1057344, 'steps': 5506, 'loss/train': 1.9802910089492798} 11/06/2021 22:01:39 - INFO - __main__ - Step 5508: {'lr': 0.0004993075996367215, 'samples': 1057536, 'steps': 5507, 'loss/train': 2.6383426189422607} 11/06/2021 22:01:40 - INFO - __main__ - Step 5509: {'lr': 0.0004993072048953113, 'samples': 1057728, 'steps': 5508, 'loss/train': 2.038627862930298} 11/06/2021 22:01:41 - INFO - __main__ - Step 5510: {'lr': 0.0004993068100415671, 'samples': 1057920, 'steps': 5509, 'loss/train': 1.877155065536499} 11/06/2021 22:01:41 - INFO - __main__ - Step 5511: {'lr': 0.000499306415075489, 'samples': 1058112, 'steps': 5510, 'loss/train': 1.7391774654388428} 11/06/2021 22:01:41 - INFO - __main__ - Step 5512: {'lr': 0.0004993060199970774, 'samples': 1058304, 'steps': 5511, 'loss/train': 2.053729772567749} 11/06/2021 22:01:42 - INFO - __main__ - Step 5513: {'lr': 0.0004993056248063323, 'samples': 1058496, 'steps': 5512, 'loss/train': 2.05024790763855} 11/06/2021 22:01:43 - INFO - __main__ - Step 5514: {'lr': 0.000499305229503254, 'samples': 1058688, 'steps': 5513, 'loss/train': 1.8646981716156006} 11/06/2021 22:01:43 - INFO - __main__ - Step 5515: {'lr': 0.0004993048340878425, 'samples': 1058880, 'steps': 5514, 'loss/train': 2.1601507663726807} 11/06/2021 22:01:44 - INFO - __main__ - Step 5516: {'lr': 0.0004993044385600982, 'samples': 1059072, 'steps': 5515, 'loss/train': 1.9711893796920776} 11/06/2021 22:01:44 - INFO - __main__ - Step 5517: {'lr': 0.0004993040429200211, 'samples': 1059264, 'steps': 5516, 'loss/train': 2.012702465057373} 11/06/2021 22:01:44 - INFO - __main__ - Step 5518: {'lr': 0.0004993036471676115, 'samples': 1059456, 'steps': 5517, 'loss/train': 1.4311362504959106} 11/06/2021 22:01:45 - INFO - __main__ - Step 5519: {'lr': 0.0004993032513028695, 'samples': 1059648, 'steps': 5518, 'loss/train': 1.7759687900543213} 11/06/2021 22:01:45 - INFO - __main__ - Step 5520: {'lr': 0.0004993028553257952, 'samples': 1059840, 'steps': 5519, 'loss/train': 2.569220781326294} 11/06/2021 22:01:46 - INFO - __main__ - Step 5521: {'lr': 0.000499302459236389, 'samples': 1060032, 'steps': 5520, 'loss/train': 1.6093146800994873} 11/06/2021 22:01:46 - INFO - __main__ - Step 5522: {'lr': 0.0004993020630346509, 'samples': 1060224, 'steps': 5521, 'loss/train': 2.444692611694336} 11/06/2021 22:01:47 - INFO - __main__ - Step 5523: {'lr': 0.0004993016667205812, 'samples': 1060416, 'steps': 5522, 'loss/train': 2.1542999744415283} 11/06/2021 22:01:48 - INFO - __main__ - Step 5524: {'lr': 0.0004993012702941799, 'samples': 1060608, 'steps': 5523, 'loss/train': 1.754870891571045} 11/06/2021 22:01:48 - INFO - __main__ - Step 5525: {'lr': 0.0004993008737554474, 'samples': 1060800, 'steps': 5524, 'loss/train': 1.7959246635437012} 11/06/2021 22:01:48 - INFO - __main__ - Step 5526: {'lr': 0.0004993004771043837, 'samples': 1060992, 'steps': 5525, 'loss/train': 2.041109800338745} 11/06/2021 22:01:49 - INFO - __main__ - Step 5527: {'lr': 0.0004993000803409891, 'samples': 1061184, 'steps': 5526, 'loss/train': 2.20894455909729} 11/06/2021 22:01:49 - INFO - __main__ - Step 5528: {'lr': 0.0004992996834652638, 'samples': 1061376, 'steps': 5527, 'loss/train': 1.942233681678772} 11/06/2021 22:01:50 - INFO - __main__ - Step 5529: {'lr': 0.0004992992864772079, 'samples': 1061568, 'steps': 5528, 'loss/train': 1.7475767135620117} 11/06/2021 22:01:50 - INFO - __main__ - Step 5530: {'lr': 0.0004992988893768214, 'samples': 1061760, 'steps': 5529, 'loss/train': 2.076091766357422} 11/06/2021 22:01:51 - INFO - __main__ - Step 5531: {'lr': 0.0004992984921641048, 'samples': 1061952, 'steps': 5530, 'loss/train': 2.0082509517669678} 11/06/2021 22:01:51 - INFO - __main__ - Step 5532: {'lr': 0.0004992980948390582, 'samples': 1062144, 'steps': 5531, 'loss/train': 1.9930384159088135} 11/06/2021 22:01:51 - INFO - __main__ - Step 5533: {'lr': 0.0004992976974016817, 'samples': 1062336, 'steps': 5532, 'loss/train': 2.128443479537964} 11/06/2021 22:01:52 - INFO - __main__ - Step 5534: {'lr': 0.0004992972998519755, 'samples': 1062528, 'steps': 5533, 'loss/train': 1.8396596908569336} 11/06/2021 22:01:53 - INFO - __main__ - Step 5535: {'lr': 0.0004992969021899397, 'samples': 1062720, 'steps': 5534, 'loss/train': 1.7834552526474} 11/06/2021 22:01:53 - INFO - __main__ - Step 5536: {'lr': 0.0004992965044155746, 'samples': 1062912, 'steps': 5535, 'loss/train': 1.7400227785110474} 11/06/2021 22:01:54 - INFO - __main__ - Step 5537: {'lr': 0.0004992961065288803, 'samples': 1063104, 'steps': 5536, 'loss/train': 1.9240829944610596} 11/06/2021 22:01:54 - INFO - __main__ - Step 5538: {'lr': 0.0004992957085298571, 'samples': 1063296, 'steps': 5537, 'loss/train': 1.3260631561279297} 11/06/2021 22:01:54 - INFO - __main__ - Step 5539: {'lr': 0.0004992953104185052, 'samples': 1063488, 'steps': 5538, 'loss/train': 1.4942200183868408} 11/06/2021 22:01:55 - INFO - __main__ - Step 5540: {'lr': 0.0004992949121948245, 'samples': 1063680, 'steps': 5539, 'loss/train': 1.48552668094635} 11/06/2021 22:01:56 - INFO - __main__ - Step 5541: {'lr': 0.0004992945138588154, 'samples': 1063872, 'steps': 5540, 'loss/train': 1.611240267753601} 11/06/2021 22:01:56 - INFO - __main__ - Step 5542: {'lr': 0.0004992941154104781, 'samples': 1064064, 'steps': 5541, 'loss/train': 1.437373399734497} 11/06/2021 22:01:56 - INFO - __main__ - Step 5543: {'lr': 0.0004992937168498126, 'samples': 1064256, 'steps': 5542, 'loss/train': 1.7761310338974} 11/06/2021 22:01:57 - INFO - __main__ - Step 5544: {'lr': 0.0004992933181768194, 'samples': 1064448, 'steps': 5543, 'loss/train': 2.0214922428131104} 11/06/2021 22:01:58 - INFO - __main__ - Step 5545: {'lr': 0.0004992929193914983, 'samples': 1064640, 'steps': 5544, 'loss/train': 1.7681033611297607} 11/06/2021 22:01:58 - INFO - __main__ - Step 5546: {'lr': 0.0004992925204938498, 'samples': 1064832, 'steps': 5545, 'loss/train': 2.075996160507202} 11/06/2021 22:01:59 - INFO - __main__ - Step 5547: {'lr': 0.0004992921214838738, 'samples': 1065024, 'steps': 5546, 'loss/train': 1.834026575088501} 11/06/2021 22:01:59 - INFO - __main__ - Step 5548: {'lr': 0.0004992917223615706, 'samples': 1065216, 'steps': 5547, 'loss/train': 1.4636272192001343} 11/06/2021 22:01:59 - INFO - __main__ - Step 5549: {'lr': 0.0004992913231269405, 'samples': 1065408, 'steps': 5548, 'loss/train': 1.9245575666427612} 11/06/2021 22:02:00 - INFO - __main__ - Step 5550: {'lr': 0.0004992909237799835, 'samples': 1065600, 'steps': 5549, 'loss/train': 2.2728660106658936} 11/06/2021 22:02:01 - INFO - __main__ - Step 5551: {'lr': 0.0004992905243206999, 'samples': 1065792, 'steps': 5550, 'loss/train': 1.8605475425720215} 11/06/2021 22:02:01 - INFO - __main__ - Step 5552: {'lr': 0.0004992901247490899, 'samples': 1065984, 'steps': 5551, 'loss/train': 2.065237522125244} 11/06/2021 22:02:01 - INFO - __main__ - Step 5553: {'lr': 0.0004992897250651535, 'samples': 1066176, 'steps': 5552, 'loss/train': 2.103445529937744} 11/06/2021 22:02:02 - INFO - __main__ - Step 5554: {'lr': 0.000499289325268891, 'samples': 1066368, 'steps': 5553, 'loss/train': 1.8733552694320679} 11/06/2021 22:02:03 - INFO - __main__ - Step 5555: {'lr': 0.0004992889253603027, 'samples': 1066560, 'steps': 5554, 'loss/train': 1.6204419136047363} 11/06/2021 22:02:03 - INFO - __main__ - Step 5556: {'lr': 0.0004992885253393885, 'samples': 1066752, 'steps': 5555, 'loss/train': 1.7640080451965332} 11/06/2021 22:02:04 - INFO - __main__ - Step 5557: {'lr': 0.0004992881252061489, 'samples': 1066944, 'steps': 5556, 'loss/train': 1.724705457687378} 11/06/2021 22:02:04 - INFO - __main__ - Step 5558: {'lr': 0.0004992877249605838, 'samples': 1067136, 'steps': 5557, 'loss/train': 2.2034895420074463} 11/06/2021 22:02:04 - INFO - __main__ - Step 5559: {'lr': 0.0004992873246026935, 'samples': 1067328, 'steps': 5558, 'loss/train': 2.039024591445923} 11/06/2021 22:02:05 - INFO - __main__ - Step 5560: {'lr': 0.0004992869241324783, 'samples': 1067520, 'steps': 5559, 'loss/train': 1.9477410316467285} 11/06/2021 22:02:06 - INFO - __main__ - Step 5561: {'lr': 0.000499286523549938, 'samples': 1067712, 'steps': 5560, 'loss/train': 1.9774774312973022} 11/06/2021 22:02:06 - INFO - __main__ - Step 5562: {'lr': 0.0004992861228550733, 'samples': 1067904, 'steps': 5561, 'loss/train': 2.137509822845459} 11/06/2021 22:02:06 - INFO - __main__ - Step 5563: {'lr': 0.0004992857220478841, 'samples': 1068096, 'steps': 5562, 'loss/train': 1.9436842203140259} 11/06/2021 22:02:07 - INFO - __main__ - Step 5564: {'lr': 0.0004992853211283705, 'samples': 1068288, 'steps': 5563, 'loss/train': 2.0045459270477295} 11/06/2021 22:02:08 - INFO - __main__ - Step 5565: {'lr': 0.0004992849200965327, 'samples': 1068480, 'steps': 5564, 'loss/train': 1.8353327512741089} 11/06/2021 22:02:08 - INFO - __main__ - Step 5566: {'lr': 0.0004992845189523711, 'samples': 1068672, 'steps': 5565, 'loss/train': 2.0094947814941406} 11/06/2021 22:02:08 - INFO - __main__ - Step 5567: {'lr': 0.0004992841176958858, 'samples': 1068864, 'steps': 5566, 'loss/train': 1.8892325162887573} 11/06/2021 22:02:09 - INFO - __main__ - Step 5568: {'lr': 0.0004992837163270769, 'samples': 1069056, 'steps': 5567, 'loss/train': 1.280644178390503} 11/06/2021 22:02:09 - INFO - __main__ - Step 5569: {'lr': 0.0004992833148459445, 'samples': 1069248, 'steps': 5568, 'loss/train': 2.168335199356079} 11/06/2021 22:02:10 - INFO - __main__ - Step 5570: {'lr': 0.0004992829132524889, 'samples': 1069440, 'steps': 5569, 'loss/train': 1.7364050149917603} 11/06/2021 22:02:11 - INFO - __main__ - Step 5571: {'lr': 0.0004992825115467102, 'samples': 1069632, 'steps': 5570, 'loss/train': 1.8209835290908813} 11/06/2021 22:02:11 - INFO - __main__ - Step 5572: {'lr': 0.0004992821097286088, 'samples': 1069824, 'steps': 5571, 'loss/train': 2.0928235054016113} 11/06/2021 22:02:11 - INFO - __main__ - Step 5573: {'lr': 0.0004992817077981846, 'samples': 1070016, 'steps': 5572, 'loss/train': 1.939453125} 11/06/2021 22:02:12 - INFO - __main__ - Step 5574: {'lr': 0.000499281305755438, 'samples': 1070208, 'steps': 5573, 'loss/train': 1.4345474243164062} 11/06/2021 22:02:13 - INFO - __main__ - Step 5575: {'lr': 0.0004992809036003691, 'samples': 1070400, 'steps': 5574, 'loss/train': 1.5226601362228394} 11/06/2021 22:02:13 - INFO - __main__ - Step 5576: {'lr': 0.000499280501332978, 'samples': 1070592, 'steps': 5575, 'loss/train': 1.6950485706329346} 11/06/2021 22:02:13 - INFO - __main__ - Step 5577: {'lr': 0.000499280098953265, 'samples': 1070784, 'steps': 5576, 'loss/train': 1.7863175868988037} 11/06/2021 22:02:14 - INFO - __main__ - Step 5578: {'lr': 0.0004992796964612302, 'samples': 1070976, 'steps': 5577, 'loss/train': 2.1370227336883545} 11/06/2021 22:02:14 - INFO - __main__ - Step 5579: {'lr': 0.0004992792938568739, 'samples': 1071168, 'steps': 5578, 'loss/train': 1.6954041719436646} 11/06/2021 22:02:15 - INFO - __main__ - Step 5580: {'lr': 0.0004992788911401961, 'samples': 1071360, 'steps': 5579, 'loss/train': 1.965846300125122} 11/06/2021 22:02:15 - INFO - __main__ - Step 5581: {'lr': 0.0004992784883111972, 'samples': 1071552, 'steps': 5580, 'loss/train': 2.0566160678863525} 11/06/2021 22:02:16 - INFO - __main__ - Step 5582: {'lr': 0.0004992780853698771, 'samples': 1071744, 'steps': 5581, 'loss/train': 2.0826704502105713} 11/06/2021 22:02:16 - INFO - __main__ - Step 5583: {'lr': 0.0004992776823162362, 'samples': 1071936, 'steps': 5582, 'loss/train': 1.6253973245620728} 11/06/2021 22:02:16 - INFO - __main__ - Step 5584: {'lr': 0.0004992772791502746, 'samples': 1072128, 'steps': 5583, 'loss/train': 1.7292557954788208} 11/06/2021 22:02:18 - INFO - __main__ - Step 5585: {'lr': 0.0004992768758719926, 'samples': 1072320, 'steps': 5584, 'loss/train': 2.4336965084075928} 11/06/2021 22:02:18 - INFO - __main__ - Step 5586: {'lr': 0.0004992764724813902, 'samples': 1072512, 'steps': 5585, 'loss/train': 1.5816513299942017} 11/06/2021 22:02:18 - INFO - __main__ - Step 5587: {'lr': 0.0004992760689784677, 'samples': 1072704, 'steps': 5586, 'loss/train': 1.2834484577178955} 11/06/2021 22:02:19 - INFO - __main__ - Step 5588: {'lr': 0.0004992756653632252, 'samples': 1072896, 'steps': 5587, 'loss/train': 2.1524899005889893} 11/06/2021 22:02:19 - INFO - __main__ - Step 5589: {'lr': 0.0004992752616356631, 'samples': 1073088, 'steps': 5588, 'loss/train': 2.2790422439575195} 11/06/2021 22:02:19 - INFO - __main__ - Step 5590: {'lr': 0.0004992748577957812, 'samples': 1073280, 'steps': 5589, 'loss/train': 1.7279716730117798} 11/06/2021 22:02:21 - INFO - __main__ - Step 5591: {'lr': 0.00049927445384358, 'samples': 1073472, 'steps': 5590, 'loss/train': 1.8624963760375977} 11/06/2021 22:02:21 - INFO - __main__ - Step 5592: {'lr': 0.0004992740497790595, 'samples': 1073664, 'steps': 5591, 'loss/train': 2.11755108833313} 11/06/2021 22:02:21 - INFO - __main__ - Step 5593: {'lr': 0.0004992736456022201, 'samples': 1073856, 'steps': 5592, 'loss/train': 6.259105205535889} 11/06/2021 22:02:22 - INFO - __main__ - Step 5594: {'lr': 0.0004992732413130617, 'samples': 1074048, 'steps': 5593, 'loss/train': 2.766812562942505} 11/06/2021 22:02:22 - INFO - __main__ - Step 5595: {'lr': 0.0004992728369115848, 'samples': 1074240, 'steps': 5594, 'loss/train': 0.8529731631278992} 11/06/2021 22:02:22 - INFO - __main__ - Step 5596: {'lr': 0.0004992724323977893, 'samples': 1074432, 'steps': 5595, 'loss/train': 1.5423051118850708} 11/06/2021 22:02:23 - INFO - __main__ - Step 5597: {'lr': 0.0004992720277716755, 'samples': 1074624, 'steps': 5596, 'loss/train': 1.7895461320877075} 11/06/2021 22:02:24 - INFO - __main__ - Step 5598: {'lr': 0.0004992716230332435, 'samples': 1074816, 'steps': 5597, 'loss/train': 1.8931951522827148} 11/06/2021 22:02:24 - INFO - __main__ - Step 5599: {'lr': 0.0004992712181824936, 'samples': 1075008, 'steps': 5598, 'loss/train': 2.938845634460449} 11/06/2021 22:02:24 - INFO - __main__ - Step 5600: {'lr': 0.0004992708132194259, 'samples': 1075200, 'steps': 5599, 'loss/train': 1.7190282344818115} 11/06/2021 22:02:25 - INFO - __main__ - Step 5601: {'lr': 0.0004992704081440407, 'samples': 1075392, 'steps': 5600, 'loss/train': 2.1930367946624756} 11/06/2021 22:02:26 - INFO - __main__ - Step 5602: {'lr': 0.0004992700029563381, 'samples': 1075584, 'steps': 5601, 'loss/train': 1.8844269514083862} 11/06/2021 22:02:26 - INFO - __main__ - Step 5603: {'lr': 0.0004992695976563182, 'samples': 1075776, 'steps': 5602, 'loss/train': 2.1230838298797607} 11/06/2021 22:02:26 - INFO - __main__ - Step 5604: {'lr': 0.0004992691922439814, 'samples': 1075968, 'steps': 5603, 'loss/train': 1.6668339967727661} 11/06/2021 22:02:27 - INFO - __main__ - Step 5605: {'lr': 0.0004992687867193277, 'samples': 1076160, 'steps': 5604, 'loss/train': 1.7833575010299683} 11/06/2021 22:02:27 - INFO - __main__ - Step 5606: {'lr': 0.0004992683810823572, 'samples': 1076352, 'steps': 5605, 'loss/train': 1.914794683456421} 11/06/2021 22:02:28 - INFO - __main__ - Step 5607: {'lr': 0.0004992679753330703, 'samples': 1076544, 'steps': 5606, 'loss/train': 1.698716163635254} 11/06/2021 22:02:28 - INFO - __main__ - Step 5608: {'lr': 0.0004992675694714671, 'samples': 1076736, 'steps': 5607, 'loss/train': 2.1556620597839355} 11/06/2021 22:02:29 - INFO - __main__ - Step 5609: {'lr': 0.0004992671634975477, 'samples': 1076928, 'steps': 5608, 'loss/train': 1.4872366189956665} 11/06/2021 22:02:29 - INFO - __main__ - Step 5610: {'lr': 0.0004992667574113125, 'samples': 1077120, 'steps': 5609, 'loss/train': 1.928450345993042} 11/06/2021 22:02:29 - INFO - __main__ - Step 5611: {'lr': 0.0004992663512127615, 'samples': 1077312, 'steps': 5610, 'loss/train': 2.2512283325195312} 11/06/2021 22:02:31 - INFO - __main__ - Step 5612: {'lr': 0.0004992659449018949, 'samples': 1077504, 'steps': 5611, 'loss/train': 1.50386643409729} 11/06/2021 22:02:31 - INFO - __main__ - Step 5613: {'lr': 0.0004992655384787129, 'samples': 1077696, 'steps': 5612, 'loss/train': 2.3416426181793213} 11/06/2021 22:02:31 - INFO - __main__ - Step 5614: {'lr': 0.0004992651319432157, 'samples': 1077888, 'steps': 5613, 'loss/train': 2.967033863067627} 11/06/2021 22:02:32 - INFO - __main__ - Step 5615: {'lr': 0.0004992647252954035, 'samples': 1078080, 'steps': 5614, 'loss/train': 2.203519582748413} 11/06/2021 22:02:32 - INFO - __main__ - Step 5616: {'lr': 0.0004992643185352765, 'samples': 1078272, 'steps': 5615, 'loss/train': 2.0697097778320312} 11/06/2021 22:02:32 - INFO - __main__ - Step 5617: {'lr': 0.0004992639116628349, 'samples': 1078464, 'steps': 5616, 'loss/train': 2.1669209003448486} 11/06/2021 22:02:34 - INFO - __main__ - Step 5618: {'lr': 0.0004992635046780786, 'samples': 1078656, 'steps': 5617, 'loss/train': 2.434951066970825} 11/06/2021 22:02:34 - INFO - __main__ - Step 5619: {'lr': 0.0004992630975810083, 'samples': 1078848, 'steps': 5618, 'loss/train': 2.055948257446289} 11/06/2021 22:02:35 - INFO - __main__ - Step 5620: {'lr': 0.0004992626903716237, 'samples': 1079040, 'steps': 5619, 'loss/train': 0.3906489312648773} 11/06/2021 22:02:35 - INFO - __main__ - Step 5621: {'lr': 0.0004992622830499252, 'samples': 1079232, 'steps': 5620, 'loss/train': 0.3192686438560486} 11/06/2021 22:02:35 - INFO - __main__ - Step 5622: {'lr': 0.000499261875615913, 'samples': 1079424, 'steps': 5621, 'loss/train': 1.627442479133606} 11/06/2021 22:02:36 - INFO - __main__ - Step 5623: {'lr': 0.0004992614680695872, 'samples': 1079616, 'steps': 5622, 'loss/train': 2.138500452041626} 11/06/2021 22:02:37 - INFO - __main__ - Step 5624: {'lr': 0.0004992610604109481, 'samples': 1079808, 'steps': 5623, 'loss/train': 2.165574312210083} 11/06/2021 22:02:37 - INFO - __main__ - Step 5625: {'lr': 0.0004992606526399957, 'samples': 1080000, 'steps': 5624, 'loss/train': 1.8479472398757935} 11/06/2021 22:02:37 - INFO - __main__ - Step 5626: {'lr': 0.0004992602447567304, 'samples': 1080192, 'steps': 5625, 'loss/train': 2.4740467071533203} 11/06/2021 22:02:38 - INFO - __main__ - Step 5627: {'lr': 0.0004992598367611523, 'samples': 1080384, 'steps': 5626, 'loss/train': 1.5033149719238281} 11/06/2021 22:02:38 - INFO - __main__ - Step 5628: {'lr': 0.0004992594286532615, 'samples': 1080576, 'steps': 5627, 'loss/train': 1.958828330039978} 11/06/2021 22:02:39 - INFO - __main__ - Step 5629: {'lr': 0.0004992590204330583, 'samples': 1080768, 'steps': 5628, 'loss/train': 1.696641445159912} 11/06/2021 22:02:39 - INFO - __main__ - Step 5630: {'lr': 0.0004992586121005427, 'samples': 1080960, 'steps': 5629, 'loss/train': 0.8877306580543518} 11/06/2021 22:02:40 - INFO - __main__ - Step 5631: {'lr': 0.0004992582036557152, 'samples': 1081152, 'steps': 5630, 'loss/train': 2.333393096923828} 11/06/2021 22:02:40 - INFO - __main__ - Step 5632: {'lr': 0.0004992577950985757, 'samples': 1081344, 'steps': 5631, 'loss/train': 2.0259532928466797} 11/06/2021 22:02:40 - INFO - __main__ - Step 5633: {'lr': 0.0004992573864291244, 'samples': 1081536, 'steps': 5632, 'loss/train': 1.8936342000961304} 11/06/2021 22:02:41 - INFO - __main__ - Step 5634: {'lr': 0.0004992569776473616, 'samples': 1081728, 'steps': 5633, 'loss/train': 1.8749663829803467} 11/06/2021 22:02:42 - INFO - __main__ - Step 5635: {'lr': 0.0004992565687532875, 'samples': 1081920, 'steps': 5634, 'loss/train': 1.5571837425231934} 11/06/2021 22:02:42 - INFO - __main__ - Step 5636: {'lr': 0.0004992561597469023, 'samples': 1082112, 'steps': 5635, 'loss/train': 2.0260841846466064} 11/06/2021 22:02:42 - INFO - __main__ - Step 5637: {'lr': 0.0004992557506282061, 'samples': 1082304, 'steps': 5636, 'loss/train': 1.9845712184906006} 11/06/2021 22:02:43 - INFO - __main__ - Step 5638: {'lr': 0.0004992553413971991, 'samples': 1082496, 'steps': 5637, 'loss/train': 1.8022942543029785} 11/06/2021 22:02:44 - INFO - __main__ - Step 5639: {'lr': 0.0004992549320538814, 'samples': 1082688, 'steps': 5638, 'loss/train': 2.3104302883148193} 11/06/2021 22:02:44 - INFO - __main__ - Step 5640: {'lr': 0.0004992545225982533, 'samples': 1082880, 'steps': 5639, 'loss/train': 1.7610740661621094} 11/06/2021 22:02:45 - INFO - __main__ - Step 5641: {'lr': 0.000499254113030315, 'samples': 1083072, 'steps': 5640, 'loss/train': 1.8128447532653809} 11/06/2021 22:02:45 - INFO - __main__ - Step 5642: {'lr': 0.0004992537033500667, 'samples': 1083264, 'steps': 5641, 'loss/train': 1.8965189456939697} 11/06/2021 22:02:45 - INFO - __main__ - Step 5643: {'lr': 0.0004992532935575084, 'samples': 1083456, 'steps': 5642, 'loss/train': 2.2873661518096924} 11/06/2021 22:02:46 - INFO - __main__ - Step 5644: {'lr': 0.0004992528836526405, 'samples': 1083648, 'steps': 5643, 'loss/train': 2.314706325531006} 11/06/2021 22:02:47 - INFO - __main__ - Step 5645: {'lr': 0.0004992524736354631, 'samples': 1083840, 'steps': 5644, 'loss/train': 2.096315622329712} 11/06/2021 22:02:47 - INFO - __main__ - Step 5646: {'lr': 0.0004992520635059762, 'samples': 1084032, 'steps': 5645, 'loss/train': 1.736008644104004} 11/06/2021 22:02:47 - INFO - __main__ - Step 5647: {'lr': 0.0004992516532641804, 'samples': 1084224, 'steps': 5646, 'loss/train': 1.71793794631958} 11/06/2021 22:02:48 - INFO - __main__ - Step 5648: {'lr': 0.0004992512429100757, 'samples': 1084416, 'steps': 5647, 'loss/train': 1.7612890005111694} 11/06/2021 22:02:49 - INFO - __main__ - Step 5649: {'lr': 0.000499250832443662, 'samples': 1084608, 'steps': 5648, 'loss/train': 2.2662413120269775} 11/06/2021 22:02:49 - INFO - __main__ - Step 5650: {'lr': 0.0004992504218649398, 'samples': 1084800, 'steps': 5649, 'loss/train': 1.443153977394104} 11/06/2021 22:02:50 - INFO - __main__ - Step 5651: {'lr': 0.0004992500111739093, 'samples': 1084992, 'steps': 5650, 'loss/train': 2.055508852005005} 11/06/2021 22:02:50 - INFO - __main__ - Step 5652: {'lr': 0.0004992496003705705, 'samples': 1085184, 'steps': 5651, 'loss/train': 2.131808042526245} 11/06/2021 22:02:50 - INFO - __main__ - Step 5653: {'lr': 0.0004992491894549236, 'samples': 1085376, 'steps': 5652, 'loss/train': 1.7326874732971191} 11/06/2021 22:02:51 - INFO - __main__ - Step 5654: {'lr': 0.000499248778426969, 'samples': 1085568, 'steps': 5653, 'loss/train': 2.14742374420166} 11/06/2021 22:02:52 - INFO - __main__ - Step 5655: {'lr': 0.0004992483672867068, 'samples': 1085760, 'steps': 5654, 'loss/train': 1.887015461921692} 11/06/2021 22:02:52 - INFO - __main__ - Step 5656: {'lr': 0.000499247956034137, 'samples': 1085952, 'steps': 5655, 'loss/train': 0.4538571536540985} 11/06/2021 22:02:52 - INFO - __main__ - Step 5657: {'lr': 0.00049924754466926, 'samples': 1086144, 'steps': 5656, 'loss/train': 2.187472105026245} 11/06/2021 22:02:53 - INFO - __main__ - Step 5658: {'lr': 0.0004992471331920758, 'samples': 1086336, 'steps': 5657, 'loss/train': 1.9629387855529785} 11/06/2021 22:02:53 - INFO - __main__ - Step 5659: {'lr': 0.0004992467216025848, 'samples': 1086528, 'steps': 5658, 'loss/train': 2.4174067974090576} 11/06/2021 22:02:54 - INFO - __main__ - Step 5660: {'lr': 0.0004992463099007871, 'samples': 1086720, 'steps': 5659, 'loss/train': 1.6454474925994873} 11/06/2021 22:02:54 - INFO - __main__ - Step 5661: {'lr': 0.0004992458980866827, 'samples': 1086912, 'steps': 5660, 'loss/train': 1.8920400142669678} 11/06/2021 22:02:55 - INFO - __main__ - Step 5662: {'lr': 0.000499245486160272, 'samples': 1087104, 'steps': 5661, 'loss/train': 1.9282915592193604} 11/06/2021 22:02:55 - INFO - __main__ - Step 5663: {'lr': 0.0004992450741215552, 'samples': 1087296, 'steps': 5662, 'loss/train': 1.9078826904296875} 11/06/2021 22:02:55 - INFO - __main__ - Step 5664: {'lr': 0.0004992446619705324, 'samples': 1087488, 'steps': 5663, 'loss/train': 1.9330226182937622} 11/06/2021 22:02:56 - INFO - __main__ - Step 5665: {'lr': 0.0004992442497072037, 'samples': 1087680, 'steps': 5664, 'loss/train': 1.8962855339050293} 11/06/2021 22:02:57 - INFO - __main__ - Step 5666: {'lr': 0.0004992438373315694, 'samples': 1087872, 'steps': 5665, 'loss/train': 1.5366052389144897} 11/06/2021 22:02:57 - INFO - __main__ - Step 5667: {'lr': 0.0004992434248436298, 'samples': 1088064, 'steps': 5666, 'loss/train': 1.221272587776184} 11/06/2021 22:02:57 - INFO - __main__ - Step 5668: {'lr': 0.0004992430122433848, 'samples': 1088256, 'steps': 5667, 'loss/train': 1.9391791820526123} 11/06/2021 22:02:58 - INFO - __main__ - Step 5669: {'lr': 0.0004992425995308349, 'samples': 1088448, 'steps': 5668, 'loss/train': 1.7548898458480835} 11/06/2021 22:02:59 - INFO - __main__ - Step 5670: {'lr': 0.0004992421867059801, 'samples': 1088640, 'steps': 5669, 'loss/train': 1.8856055736541748} 11/06/2021 22:02:59 - INFO - __main__ - Step 5671: {'lr': 0.0004992417737688206, 'samples': 1088832, 'steps': 5670, 'loss/train': 1.850290298461914} 11/06/2021 22:02:59 - INFO - __main__ - Step 5672: {'lr': 0.0004992413607193566, 'samples': 1089024, 'steps': 5671, 'loss/train': 2.0641515254974365} 11/06/2021 22:03:00 - INFO - __main__ - Step 5673: {'lr': 0.0004992409475575882, 'samples': 1089216, 'steps': 5672, 'loss/train': 2.626786231994629} 11/06/2021 22:03:00 - INFO - __main__ - Step 5674: {'lr': 0.0004992405342835158, 'samples': 1089408, 'steps': 5673, 'loss/train': 1.913179636001587} 11/06/2021 22:03:01 - INFO - __main__ - Step 5675: {'lr': 0.0004992401208971394, 'samples': 1089600, 'steps': 5674, 'loss/train': 2.4290151596069336} 11/06/2021 22:03:02 - INFO - __main__ - Step 5676: {'lr': 0.0004992397073984592, 'samples': 1089792, 'steps': 5675, 'loss/train': 1.8842289447784424} 11/06/2021 22:03:02 - INFO - __main__ - Step 5677: {'lr': 0.0004992392937874755, 'samples': 1089984, 'steps': 5676, 'loss/train': 1.9053252935409546} 11/06/2021 22:03:02 - INFO - __main__ - Step 5678: {'lr': 0.0004992388800641885, 'samples': 1090176, 'steps': 5677, 'loss/train': 2.1851115226745605} 11/06/2021 22:03:03 - INFO - __main__ - Step 5679: {'lr': 0.0004992384662285981, 'samples': 1090368, 'steps': 5678, 'loss/train': 1.9551331996917725} 11/06/2021 22:03:04 - INFO - __main__ - Step 5680: {'lr': 0.0004992380522807049, 'samples': 1090560, 'steps': 5679, 'loss/train': 2.1385090351104736} 11/06/2021 22:03:04 - INFO - __main__ - Step 5681: {'lr': 0.0004992376382205088, 'samples': 1090752, 'steps': 5680, 'loss/train': 0.955967903137207} 11/06/2021 22:03:04 - INFO - __main__ - Step 5682: {'lr': 0.00049923722404801, 'samples': 1090944, 'steps': 5681, 'loss/train': 1.333667278289795} 11/06/2021 22:03:05 - INFO - __main__ - Step 5683: {'lr': 0.0004992368097632089, 'samples': 1091136, 'steps': 5682, 'loss/train': 2.1074047088623047} 11/06/2021 22:03:05 - INFO - __main__ - Step 5684: {'lr': 0.0004992363953661054, 'samples': 1091328, 'steps': 5683, 'loss/train': 1.597164273262024} 11/06/2021 22:03:06 - INFO - __main__ - Step 5685: {'lr': 0.0004992359808566999, 'samples': 1091520, 'steps': 5684, 'loss/train': 4.272282600402832} 11/06/2021 22:03:06 - INFO - __main__ - Step 5686: {'lr': 0.0004992355662349925, 'samples': 1091712, 'steps': 5685, 'loss/train': 2.0851457118988037} 11/06/2021 22:03:07 - INFO - __main__ - Step 5687: {'lr': 0.0004992351515009833, 'samples': 1091904, 'steps': 5686, 'loss/train': 1.8593014478683472} 11/06/2021 22:03:07 - INFO - __main__ - Step 5688: {'lr': 0.0004992347366546727, 'samples': 1092096, 'steps': 5687, 'loss/train': 2.107888698577881} 11/06/2021 22:03:07 - INFO - __main__ - Step 5689: {'lr': 0.0004992343216960607, 'samples': 1092288, 'steps': 5688, 'loss/train': 1.92064368724823} 11/06/2021 22:03:09 - INFO - __main__ - Step 5690: {'lr': 0.0004992339066251476, 'samples': 1092480, 'steps': 5689, 'loss/train': 1.6505532264709473} 11/06/2021 22:03:09 - INFO - __main__ - Step 5691: {'lr': 0.0004992334914419337, 'samples': 1092672, 'steps': 5690, 'loss/train': 1.4800523519515991} 11/06/2021 22:03:09 - INFO - __main__ - Step 5692: {'lr': 0.0004992330761464188, 'samples': 1092864, 'steps': 5691, 'loss/train': 1.566269874572754} 11/06/2021 22:03:10 - INFO - __main__ - Step 5693: {'lr': 0.0004992326607386034, 'samples': 1093056, 'steps': 5692, 'loss/train': 2.7816977500915527} 11/06/2021 22:03:10 - INFO - __main__ - Step 5694: {'lr': 0.0004992322452184876, 'samples': 1093248, 'steps': 5693, 'loss/train': 2.2784321308135986} 11/06/2021 22:03:10 - INFO - __main__ - Step 5695: {'lr': 0.0004992318295860718, 'samples': 1093440, 'steps': 5694, 'loss/train': 1.6132631301879883} 11/06/2021 22:03:11 - INFO - __main__ - Step 5696: {'lr': 0.0004992314138413557, 'samples': 1093632, 'steps': 5695, 'loss/train': 1.0506314039230347} 11/06/2021 22:03:12 - INFO - __main__ - Step 5697: {'lr': 0.0004992309979843398, 'samples': 1093824, 'steps': 5696, 'loss/train': 1.6640676259994507} 11/06/2021 22:03:12 - INFO - __main__ - Step 5698: {'lr': 0.0004992305820150243, 'samples': 1094016, 'steps': 5697, 'loss/train': 2.264589548110962} 11/06/2021 22:03:12 - INFO - __main__ - Step 5699: {'lr': 0.0004992301659334095, 'samples': 1094208, 'steps': 5698, 'loss/train': 1.779240369796753} 11/06/2021 22:03:13 - INFO - __main__ - Step 5700: {'lr': 0.0004992297497394953, 'samples': 1094400, 'steps': 5699, 'loss/train': 1.5789347887039185} 11/06/2021 22:03:14 - INFO - __main__ - Step 5701: {'lr': 0.000499229333433282, 'samples': 1094592, 'steps': 5700, 'loss/train': 1.8561713695526123} 11/06/2021 22:03:14 - INFO - __main__ - Step 5702: {'lr': 0.0004992289170147699, 'samples': 1094784, 'steps': 5701, 'loss/train': 2.206258535385132} 11/06/2021 22:03:15 - INFO - __main__ - Step 5703: {'lr': 0.000499228500483959, 'samples': 1094976, 'steps': 5702, 'loss/train': 1.1616374254226685} 11/06/2021 22:03:15 - INFO - __main__ - Step 5704: {'lr': 0.0004992280838408496, 'samples': 1095168, 'steps': 5703, 'loss/train': 2.011932611465454} 11/06/2021 22:03:15 - INFO - __main__ - Step 5705: {'lr': 0.0004992276670854419, 'samples': 1095360, 'steps': 5704, 'loss/train': 2.265223503112793} 11/06/2021 22:03:16 - INFO - __main__ - Step 5706: {'lr': 0.000499227250217736, 'samples': 1095552, 'steps': 5705, 'loss/train': 1.879677414894104} 11/06/2021 22:03:17 - INFO - __main__ - Step 5707: {'lr': 0.0004992268332377323, 'samples': 1095744, 'steps': 5706, 'loss/train': 1.6559354066848755} 11/06/2021 22:03:17 - INFO - __main__ - Step 5708: {'lr': 0.0004992264161454306, 'samples': 1095936, 'steps': 5707, 'loss/train': 2.132072925567627} 11/06/2021 22:03:17 - INFO - __main__ - Step 5709: {'lr': 0.0004992259989408316, 'samples': 1096128, 'steps': 5708, 'loss/train': 3.037383556365967} 11/06/2021 22:03:18 - INFO - __main__ - Step 5710: {'lr': 0.000499225581623935, 'samples': 1096320, 'steps': 5709, 'loss/train': 2.0813891887664795} 11/06/2021 22:03:18 - INFO - __main__ - Step 5711: {'lr': 0.0004992251641947412, 'samples': 1096512, 'steps': 5710, 'loss/train': 1.6070504188537598} 11/06/2021 22:03:19 - INFO - __main__ - Step 5712: {'lr': 0.0004992247466532504, 'samples': 1096704, 'steps': 5711, 'loss/train': 1.8336153030395508} 11/06/2021 22:03:20 - INFO - __main__ - Step 5713: {'lr': 0.0004992243289994629, 'samples': 1096896, 'steps': 5712, 'loss/train': 1.5240449905395508} 11/06/2021 22:03:20 - INFO - __main__ - Step 5714: {'lr': 0.0004992239112333787, 'samples': 1097088, 'steps': 5713, 'loss/train': 2.0184898376464844} 11/06/2021 22:03:20 - INFO - __main__ - Step 5715: {'lr': 0.000499223493354998, 'samples': 1097280, 'steps': 5714, 'loss/train': 1.668257713317871} 11/06/2021 22:03:21 - INFO - __main__ - Step 5716: {'lr': 0.0004992230753643211, 'samples': 1097472, 'steps': 5715, 'loss/train': 2.1014792919158936} 11/06/2021 22:03:22 - INFO - __main__ - Step 5717: {'lr': 0.0004992226572613481, 'samples': 1097664, 'steps': 5716, 'loss/train': 2.5767147541046143} 11/06/2021 22:03:22 - INFO - __main__ - Step 5718: {'lr': 0.0004992222390460792, 'samples': 1097856, 'steps': 5717, 'loss/train': 1.6296945810317993} 11/06/2021 22:03:23 - INFO - __main__ - Step 5719: {'lr': 0.0004992218207185146, 'samples': 1098048, 'steps': 5718, 'loss/train': 1.9573607444763184} 11/06/2021 22:03:23 - INFO - __main__ - Step 5720: {'lr': 0.0004992214022786546, 'samples': 1098240, 'steps': 5719, 'loss/train': 2.1646392345428467} 11/06/2021 22:03:23 - INFO - __main__ - Step 5721: {'lr': 0.0004992209837264991, 'samples': 1098432, 'steps': 5720, 'loss/train': 1.7333484888076782} 11/06/2021 22:03:24 - INFO - __main__ - Step 5722: {'lr': 0.0004992205650620487, 'samples': 1098624, 'steps': 5721, 'loss/train': 0.6370934844017029} 11/06/2021 22:03:25 - INFO - __main__ - Step 5723: {'lr': 0.0004992201462853032, 'samples': 1098816, 'steps': 5722, 'loss/train': 1.629151701927185} 11/06/2021 22:03:25 - INFO - __main__ - Step 5724: {'lr': 0.000499219727396263, 'samples': 1099008, 'steps': 5723, 'loss/train': 1.9365618228912354} 11/06/2021 22:03:25 - INFO - __main__ - Step 5725: {'lr': 0.0004992193083949282, 'samples': 1099200, 'steps': 5724, 'loss/train': 1.9601372480392456} 11/06/2021 22:03:26 - INFO - __main__ - Step 5726: {'lr': 0.000499218889281299, 'samples': 1099392, 'steps': 5725, 'loss/train': 2.3020355701446533} 11/06/2021 22:03:27 - INFO - __main__ - Step 5727: {'lr': 0.0004992184700553756, 'samples': 1099584, 'steps': 5726, 'loss/train': 1.6674587726593018} 11/06/2021 22:03:27 - INFO - __main__ - Step 5728: {'lr': 0.0004992180507171583, 'samples': 1099776, 'steps': 5727, 'loss/train': 1.6533688306808472} 11/06/2021 22:03:28 - INFO - __main__ - Step 5729: {'lr': 0.0004992176312666472, 'samples': 1099968, 'steps': 5728, 'loss/train': 2.2179229259490967} 11/06/2021 22:03:28 - INFO - __main__ - Step 5730: {'lr': 0.0004992172117038424, 'samples': 1100160, 'steps': 5729, 'loss/train': 1.1370854377746582} 11/06/2021 22:03:28 - INFO - __main__ - Step 5731: {'lr': 0.0004992167920287443, 'samples': 1100352, 'steps': 5730, 'loss/train': 1.7039726972579956} 11/06/2021 22:03:29 - INFO - __main__ - Step 5732: {'lr': 0.0004992163722413528, 'samples': 1100544, 'steps': 5731, 'loss/train': 1.9213690757751465} 11/06/2021 22:03:30 - INFO - __main__ - Step 5733: {'lr': 0.0004992159523416683, 'samples': 1100736, 'steps': 5732, 'loss/train': 1.352500081062317} 11/06/2021 22:03:30 - INFO - __main__ - Step 5734: {'lr': 0.000499215532329691, 'samples': 1100928, 'steps': 5733, 'loss/train': 2.3343346118927} 11/06/2021 22:03:30 - INFO - __main__ - Step 5735: {'lr': 0.000499215112205421, 'samples': 1101120, 'steps': 5734, 'loss/train': 2.6258137226104736} 11/06/2021 22:03:31 - INFO - __main__ - Step 5736: {'lr': 0.0004992146919688584, 'samples': 1101312, 'steps': 5735, 'loss/train': 2.2007718086242676} 11/06/2021 22:03:31 - INFO - __main__ - Step 5737: {'lr': 0.0004992142716200036, 'samples': 1101504, 'steps': 5736, 'loss/train': 1.8303169012069702} 11/06/2021 22:03:32 - INFO - __main__ - Step 5738: {'lr': 0.0004992138511588567, 'samples': 1101696, 'steps': 5737, 'loss/train': 1.5697263479232788} 11/06/2021 22:03:33 - INFO - __main__ - Step 5739: {'lr': 0.0004992134305854179, 'samples': 1101888, 'steps': 5738, 'loss/train': 1.949165940284729} 11/06/2021 22:03:33 - INFO - __main__ - Step 5740: {'lr': 0.0004992130098996873, 'samples': 1102080, 'steps': 5739, 'loss/train': 2.357881784439087} 11/06/2021 22:03:33 - INFO - __main__ - Step 5741: {'lr': 0.0004992125891016652, 'samples': 1102272, 'steps': 5740, 'loss/train': 2.1080918312072754} 11/06/2021 22:03:34 - INFO - __main__ - Step 5742: {'lr': 0.0004992121681913518, 'samples': 1102464, 'steps': 5741, 'loss/train': 1.897286295890808} 11/06/2021 22:03:35 - INFO - __main__ - Step 5743: {'lr': 0.0004992117471687472, 'samples': 1102656, 'steps': 5742, 'loss/train': 1.7857768535614014} 11/06/2021 22:03:35 - INFO - __main__ - Step 5744: {'lr': 0.0004992113260338517, 'samples': 1102848, 'steps': 5743, 'loss/train': 1.731091022491455} 11/06/2021 22:03:36 - INFO - __main__ - Step 5745: {'lr': 0.0004992109047866653, 'samples': 1103040, 'steps': 5744, 'loss/train': 0.806926429271698} 11/06/2021 22:03:36 - INFO - __main__ - Step 5746: {'lr': 0.0004992104834271884, 'samples': 1103232, 'steps': 5745, 'loss/train': 1.5223336219787598} 11/06/2021 22:03:36 - INFO - __main__ - Step 5747: {'lr': 0.0004992100619554211, 'samples': 1103424, 'steps': 5746, 'loss/train': 0.8918942809104919} 11/06/2021 22:03:37 - INFO - __main__ - Step 5748: {'lr': 0.0004992096403713635, 'samples': 1103616, 'steps': 5747, 'loss/train': 2.426661491394043} 11/06/2021 22:03:38 - INFO - __main__ - Step 5749: {'lr': 0.000499209218675016, 'samples': 1103808, 'steps': 5748, 'loss/train': 1.9452520608901978} 11/06/2021 22:03:38 - INFO - __main__ - Step 5750: {'lr': 0.0004992087968663786, 'samples': 1104000, 'steps': 5749, 'loss/train': 2.1779873371124268} 11/06/2021 22:03:38 - INFO - __main__ - Step 5751: {'lr': 0.0004992083749454515, 'samples': 1104192, 'steps': 5750, 'loss/train': 2.161979913711548} 11/06/2021 22:03:39 - INFO - __main__ - Step 5752: {'lr': 0.0004992079529122351, 'samples': 1104384, 'steps': 5751, 'loss/train': 2.0824427604675293} 11/06/2021 22:03:39 - INFO - __main__ - Step 5753: {'lr': 0.0004992075307667294, 'samples': 1104576, 'steps': 5752, 'loss/train': 1.917615532875061} 11/06/2021 22:03:40 - INFO - __main__ - Step 5754: {'lr': 0.0004992071085089346, 'samples': 1104768, 'steps': 5753, 'loss/train': 1.9957177639007568} 11/06/2021 22:03:40 - INFO - __main__ - Step 5755: {'lr': 0.0004992066861388509, 'samples': 1104960, 'steps': 5754, 'loss/train': 2.4890198707580566} 11/06/2021 22:03:41 - INFO - __main__ - Step 5756: {'lr': 0.0004992062636564786, 'samples': 1105152, 'steps': 5755, 'loss/train': 2.0366508960723877} 11/06/2021 22:03:41 - INFO - __main__ - Step 5757: {'lr': 0.0004992058410618177, 'samples': 1105344, 'steps': 5756, 'loss/train': 1.8771767616271973} 11/06/2021 22:03:41 - INFO - __main__ - Step 5758: {'lr': 0.0004992054183548685, 'samples': 1105536, 'steps': 5757, 'loss/train': 1.831110954284668} 11/06/2021 22:03:42 - INFO - __main__ - Step 5759: {'lr': 0.0004992049955356313, 'samples': 1105728, 'steps': 5758, 'loss/train': 1.9998888969421387} 11/06/2021 22:03:43 - INFO - __main__ - Step 5760: {'lr': 0.0004992045726041061, 'samples': 1105920, 'steps': 5759, 'loss/train': 1.6390290260314941} 11/06/2021 22:03:43 - INFO - __main__ - Step 5761: {'lr': 0.0004992041495602931, 'samples': 1106112, 'steps': 5760, 'loss/train': 1.8871279954910278} 11/06/2021 22:03:43 - INFO - __main__ - Step 5762: {'lr': 0.0004992037264041927, 'samples': 1106304, 'steps': 5761, 'loss/train': 2.7481179237365723} 11/06/2021 22:03:44 - INFO - __main__ - Step 5763: {'lr': 0.0004992033031358048, 'samples': 1106496, 'steps': 5762, 'loss/train': 2.292053699493408} 11/06/2021 22:03:45 - INFO - __main__ - Step 5764: {'lr': 0.0004992028797551298, 'samples': 1106688, 'steps': 5763, 'loss/train': 1.911152720451355} 11/06/2021 22:03:45 - INFO - __main__ - Step 5765: {'lr': 0.0004992024562621678, 'samples': 1106880, 'steps': 5764, 'loss/train': 2.004040479660034} 11/06/2021 22:03:46 - INFO - __main__ - Step 5766: {'lr': 0.0004992020326569191, 'samples': 1107072, 'steps': 5765, 'loss/train': 2.0469226837158203} 11/06/2021 22:03:46 - INFO - __main__ - Step 5767: {'lr': 0.0004992016089393837, 'samples': 1107264, 'steps': 5766, 'loss/train': 1.93135666847229} 11/06/2021 22:03:46 - INFO - __main__ - Step 5768: {'lr': 0.000499201185109562, 'samples': 1107456, 'steps': 5767, 'loss/train': 2.3704495429992676} 11/06/2021 22:03:47 - INFO - __main__ - Step 5769: {'lr': 0.000499200761167454, 'samples': 1107648, 'steps': 5768, 'loss/train': 1.7773163318634033} 11/06/2021 22:03:48 - INFO - __main__ - Step 5770: {'lr': 0.0004992003371130601, 'samples': 1107840, 'steps': 5769, 'loss/train': 1.401180386543274} 11/06/2021 22:03:48 - INFO - __main__ - Step 5771: {'lr': 0.0004991999129463803, 'samples': 1108032, 'steps': 5770, 'loss/train': 2.0761475563049316} 11/06/2021 22:03:48 - INFO - __main__ - Step 5772: {'lr': 0.0004991994886674148, 'samples': 1108224, 'steps': 5771, 'loss/train': 2.294243574142456} 11/06/2021 22:03:49 - INFO - __main__ - Step 5773: {'lr': 0.000499199064276164, 'samples': 1108416, 'steps': 5772, 'loss/train': 2.8149049282073975} 11/06/2021 22:03:50 - INFO - __main__ - Step 5774: {'lr': 0.0004991986397726278, 'samples': 1108608, 'steps': 5773, 'loss/train': 1.9633965492248535} 11/06/2021 22:03:50 - INFO - __main__ - Step 5775: {'lr': 0.0004991982151568066, 'samples': 1108800, 'steps': 5774, 'loss/train': 2.1700239181518555} 11/06/2021 22:03:50 - INFO - __main__ - Step 5776: {'lr': 0.0004991977904287006, 'samples': 1108992, 'steps': 5775, 'loss/train': 1.5528236627578735} 11/06/2021 22:03:51 - INFO - __main__ - Step 5777: {'lr': 0.0004991973655883099, 'samples': 1109184, 'steps': 5776, 'loss/train': 1.8211041688919067} 11/06/2021 22:03:51 - INFO - __main__ - Step 5778: {'lr': 0.0004991969406356346, 'samples': 1109376, 'steps': 5777, 'loss/train': 2.413984775543213} 11/06/2021 22:03:51 - INFO - __main__ - Step 5779: {'lr': 0.0004991965155706752, 'samples': 1109568, 'steps': 5778, 'loss/train': 1.5177364349365234} 11/06/2021 22:03:52 - INFO - __main__ - Step 5780: {'lr': 0.0004991960903934315, 'samples': 1109760, 'steps': 5779, 'loss/train': 1.6337430477142334} 11/06/2021 22:03:53 - INFO - __main__ - Step 5781: {'lr': 0.0004991956651039039, 'samples': 1109952, 'steps': 5780, 'loss/train': 1.6799068450927734} 11/06/2021 22:03:53 - INFO - __main__ - Step 5782: {'lr': 0.0004991952397020927, 'samples': 1110144, 'steps': 5781, 'loss/train': 1.9769641160964966} 11/06/2021 22:03:54 - INFO - __main__ - Step 5783: {'lr': 0.0004991948141879978, 'samples': 1110336, 'steps': 5782, 'loss/train': 1.8195029497146606} 11/06/2021 22:03:54 - INFO - __main__ - Step 5784: {'lr': 0.0004991943885616198, 'samples': 1110528, 'steps': 5783, 'loss/train': 1.771957516670227} 11/06/2021 22:03:55 - INFO - __main__ - Step 5785: {'lr': 0.0004991939628229585, 'samples': 1110720, 'steps': 5784, 'loss/train': 1.7779532670974731} 11/06/2021 22:03:55 - INFO - __main__ - Step 5786: {'lr': 0.0004991935369720143, 'samples': 1110912, 'steps': 5785, 'loss/train': 2.5987279415130615} 11/06/2021 22:03:56 - INFO - __main__ - Step 5787: {'lr': 0.0004991931110087873, 'samples': 1111104, 'steps': 5786, 'loss/train': 2.056739568710327} 11/06/2021 22:03:56 - INFO - __main__ - Step 5788: {'lr': 0.0004991926849332777, 'samples': 1111296, 'steps': 5787, 'loss/train': 2.098029851913452} 11/06/2021 22:03:56 - INFO - __main__ - Step 5789: {'lr': 0.0004991922587454858, 'samples': 1111488, 'steps': 5788, 'loss/train': 1.8450032472610474} 11/06/2021 22:03:57 - INFO - __main__ - Step 5790: {'lr': 0.0004991918324454117, 'samples': 1111680, 'steps': 5789, 'loss/train': 1.8697025775909424} 11/06/2021 22:03:58 - INFO - __main__ - Step 5791: {'lr': 0.0004991914060330556, 'samples': 1111872, 'steps': 5790, 'loss/train': 2.107398271560669} 11/06/2021 22:03:58 - INFO - __main__ - Step 5792: {'lr': 0.0004991909795084177, 'samples': 1112064, 'steps': 5791, 'loss/train': 1.6011704206466675} 11/06/2021 22:03:58 - INFO - __main__ - Step 5793: {'lr': 0.0004991905528714981, 'samples': 1112256, 'steps': 5792, 'loss/train': 1.8442002534866333} 11/06/2021 22:03:59 - INFO - __main__ - Step 5794: {'lr': 0.0004991901261222971, 'samples': 1112448, 'steps': 5793, 'loss/train': 2.1112143993377686} 11/06/2021 22:04:00 - INFO - __main__ - Step 5795: {'lr': 0.000499189699260815, 'samples': 1112640, 'steps': 5794, 'loss/train': 1.3628276586532593} 11/06/2021 22:04:00 - INFO - __main__ - Step 5796: {'lr': 0.0004991892722870517, 'samples': 1112832, 'steps': 5795, 'loss/train': 1.7392164468765259} 11/06/2021 22:04:00 - INFO - __main__ - Step 5797: {'lr': 0.0004991888452010076, 'samples': 1113024, 'steps': 5796, 'loss/train': 2.1500654220581055} 11/06/2021 22:04:01 - INFO - __main__ - Step 5798: {'lr': 0.000499188418002683, 'samples': 1113216, 'steps': 5797, 'loss/train': 2.0395941734313965} 11/06/2021 22:04:01 - INFO - __main__ - Step 5799: {'lr': 0.0004991879906920779, 'samples': 1113408, 'steps': 5798, 'loss/train': 1.4463250637054443} 11/06/2021 22:04:02 - INFO - __main__ - Step 5800: {'lr': 0.0004991875632691924, 'samples': 1113600, 'steps': 5799, 'loss/train': 2.434201717376709} 11/06/2021 22:04:02 - INFO - __main__ - Step 5801: {'lr': 0.0004991871357340269, 'samples': 1113792, 'steps': 5800, 'loss/train': 2.0661230087280273} 11/06/2021 22:04:03 - INFO - __main__ - Step 5802: {'lr': 0.0004991867080865815, 'samples': 1113984, 'steps': 5801, 'loss/train': 2.2368245124816895} 11/06/2021 22:04:03 - INFO - __main__ - Step 5803: {'lr': 0.0004991862803268564, 'samples': 1114176, 'steps': 5802, 'loss/train': 2.107743501663208} 11/06/2021 22:04:03 - INFO - __main__ - Step 5804: {'lr': 0.0004991858524548519, 'samples': 1114368, 'steps': 5803, 'loss/train': 1.269503116607666} 11/06/2021 22:04:05 - INFO - __main__ - Step 5805: {'lr': 0.000499185424470568, 'samples': 1114560, 'steps': 5804, 'loss/train': 1.8158258199691772} 11/06/2021 22:04:05 - INFO - __main__ - Step 5806: {'lr': 0.0004991849963740052, 'samples': 1114752, 'steps': 5805, 'loss/train': 1.9110217094421387} 11/06/2021 22:04:06 - INFO - __main__ - Step 5807: {'lr': 0.0004991845681651632, 'samples': 1114944, 'steps': 5806, 'loss/train': 1.8827478885650635} 11/06/2021 22:04:06 - INFO - __main__ - Step 5808: {'lr': 0.0004991841398440427, 'samples': 1115136, 'steps': 5807, 'loss/train': 1.9933255910873413} 11/06/2021 22:04:06 - INFO - __main__ - Step 5809: {'lr': 0.0004991837114106436, 'samples': 1115328, 'steps': 5808, 'loss/train': 2.1655633449554443} 11/06/2021 22:04:07 - INFO - __main__ - Step 5810: {'lr': 0.0004991832828649661, 'samples': 1115520, 'steps': 5809, 'loss/train': 1.6817560195922852} 11/06/2021 22:04:08 - INFO - __main__ - Step 5811: {'lr': 0.0004991828542070105, 'samples': 1115712, 'steps': 5810, 'loss/train': 0.6421383023262024} 11/06/2021 22:04:08 - INFO - __main__ - Step 5812: {'lr': 0.000499182425436777, 'samples': 1115904, 'steps': 5811, 'loss/train': 1.786048173904419} 11/06/2021 22:04:08 - INFO - __main__ - Step 5813: {'lr': 0.0004991819965542657, 'samples': 1116096, 'steps': 5812, 'loss/train': 1.7168254852294922} 11/06/2021 22:04:09 - INFO - __main__ - Step 5814: {'lr': 0.0004991815675594768, 'samples': 1116288, 'steps': 5813, 'loss/train': 1.5646218061447144} 11/06/2021 22:04:09 - INFO - __main__ - Step 5815: {'lr': 0.0004991811384524106, 'samples': 1116480, 'steps': 5814, 'loss/train': 1.233889102935791} 11/06/2021 22:04:10 - INFO - __main__ - Step 5816: {'lr': 0.0004991807092330671, 'samples': 1116672, 'steps': 5815, 'loss/train': 1.7671700716018677} 11/06/2021 22:04:10 - INFO - __main__ - Step 5817: {'lr': 0.0004991802799014467, 'samples': 1116864, 'steps': 5816, 'loss/train': 2.299095630645752} 11/06/2021 22:04:11 - INFO - __main__ - Step 5818: {'lr': 0.0004991798504575495, 'samples': 1117056, 'steps': 5817, 'loss/train': 1.8498998880386353} 11/06/2021 22:04:11 - INFO - __main__ - Step 5819: {'lr': 0.0004991794209013758, 'samples': 1117248, 'steps': 5818, 'loss/train': 1.8572213649749756} 11/06/2021 22:04:11 - INFO - __main__ - Step 5820: {'lr': 0.0004991789912329257, 'samples': 1117440, 'steps': 5819, 'loss/train': 0.8670408725738525} 11/06/2021 22:04:12 - INFO - __main__ - Step 5821: {'lr': 0.0004991785614521993, 'samples': 1117632, 'steps': 5820, 'loss/train': 3.0006051063537598} 11/06/2021 22:04:13 - INFO - __main__ - Step 5822: {'lr': 0.0004991781315591969, 'samples': 1117824, 'steps': 5821, 'loss/train': 2.1679015159606934} 11/06/2021 22:04:13 - INFO - __main__ - Step 5823: {'lr': 0.0004991777015539186, 'samples': 1118016, 'steps': 5822, 'loss/train': 2.232862710952759} 11/06/2021 22:04:13 - INFO - __main__ - Step 5824: {'lr': 0.0004991772714363649, 'samples': 1118208, 'steps': 5823, 'loss/train': 1.790226936340332} 11/06/2021 22:04:14 - INFO - __main__ - Step 5825: {'lr': 0.0004991768412065355, 'samples': 1118400, 'steps': 5824, 'loss/train': 1.278944492340088} 11/06/2021 22:04:15 - INFO - __main__ - Step 5826: {'lr': 0.000499176410864431, 'samples': 1118592, 'steps': 5825, 'loss/train': 1.5651650428771973} 11/06/2021 22:04:15 - INFO - __main__ - Step 5827: {'lr': 0.0004991759804100515, 'samples': 1118784, 'steps': 5826, 'loss/train': 1.8785316944122314} 11/06/2021 22:04:16 - INFO - __main__ - Step 5828: {'lr': 0.000499175549843397, 'samples': 1118976, 'steps': 5827, 'loss/train': 2.057476282119751} 11/06/2021 22:04:16 - INFO - __main__ - Step 5829: {'lr': 0.0004991751191644679, 'samples': 1119168, 'steps': 5828, 'loss/train': 2.047145366668701} 11/06/2021 22:04:16 - INFO - __main__ - Step 5830: {'lr': 0.0004991746883732644, 'samples': 1119360, 'steps': 5829, 'loss/train': 1.9324302673339844} 11/06/2021 22:04:17 - INFO - __main__ - Step 5831: {'lr': 0.0004991742574697866, 'samples': 1119552, 'steps': 5830, 'loss/train': 1.9625569581985474} 11/06/2021 22:04:18 - INFO - __main__ - Step 5832: {'lr': 0.0004991738264540347, 'samples': 1119744, 'steps': 5831, 'loss/train': 2.0301690101623535} 11/06/2021 22:04:18 - INFO - __main__ - Step 5833: {'lr': 0.0004991733953260089, 'samples': 1119936, 'steps': 5832, 'loss/train': 2.2081518173217773} 11/06/2021 22:04:18 - INFO - __main__ - Step 5834: {'lr': 0.0004991729640857095, 'samples': 1120128, 'steps': 5833, 'loss/train': 1.706737756729126} 11/06/2021 22:04:19 - INFO - __main__ - Step 5835: {'lr': 0.0004991725327331366, 'samples': 1120320, 'steps': 5834, 'loss/train': 1.6535284519195557} 11/06/2021 22:04:19 - INFO - __main__ - Step 5836: {'lr': 0.0004991721012682903, 'samples': 1120512, 'steps': 5835, 'loss/train': 1.5109832286834717} 11/06/2021 22:04:20 - INFO - __main__ - Step 5837: {'lr': 0.0004991716696911709, 'samples': 1120704, 'steps': 5836, 'loss/train': 1.7255332469940186} 11/06/2021 22:04:21 - INFO - __main__ - Step 5838: {'lr': 0.0004991712380017786, 'samples': 1120896, 'steps': 5837, 'loss/train': 2.3126747608184814} 11/06/2021 22:04:21 - INFO - __main__ - Step 5839: {'lr': 0.0004991708062001137, 'samples': 1121088, 'steps': 5838, 'loss/train': 1.5363658666610718} 11/06/2021 22:04:22 - INFO - __main__ - Step 5840: {'lr': 0.0004991703742861762, 'samples': 1121280, 'steps': 5839, 'loss/train': 2.014793872833252} 11/06/2021 22:04:22 - INFO - __main__ - Step 5841: {'lr': 0.0004991699422599664, 'samples': 1121472, 'steps': 5840, 'loss/train': 5.858007907867432} 11/06/2021 22:04:22 - INFO - __main__ - Step 5842: {'lr': 0.0004991695101214844, 'samples': 1121664, 'steps': 5841, 'loss/train': 6.085140228271484} 11/06/2021 22:04:23 - INFO - __main__ - Step 5843: {'lr': 0.0004991690778707305, 'samples': 1121856, 'steps': 5842, 'loss/train': 1.8172892332077026} 11/06/2021 22:04:24 - INFO - __main__ - Step 5844: {'lr': 0.0004991686455077049, 'samples': 1122048, 'steps': 5843, 'loss/train': 2.0172688961029053} 11/06/2021 22:04:24 - INFO - __main__ - Step 5845: {'lr': 0.0004991682130324078, 'samples': 1122240, 'steps': 5844, 'loss/train': 1.5611968040466309} 11/06/2021 22:04:24 - INFO - __main__ - Step 5846: {'lr': 0.0004991677804448392, 'samples': 1122432, 'steps': 5845, 'loss/train': 2.0921804904937744} 11/06/2021 22:04:25 - INFO - __main__ - Step 5847: {'lr': 0.0004991673477449995, 'samples': 1122624, 'steps': 5846, 'loss/train': 1.8288406133651733} 11/06/2021 22:04:25 - INFO - __main__ - Step 5848: {'lr': 0.0004991669149328889, 'samples': 1122816, 'steps': 5847, 'loss/train': 1.8865562677383423} 11/06/2021 22:04:26 - INFO - __main__ - Step 5849: {'lr': 0.0004991664820085074, 'samples': 1123008, 'steps': 5848, 'loss/train': 2.1795449256896973} 11/06/2021 22:04:27 - INFO - __main__ - Step 5850: {'lr': 0.0004991660489718554, 'samples': 1123200, 'steps': 5849, 'loss/train': 1.6964969635009766} 11/06/2021 22:04:27 - INFO - __main__ - Step 5851: {'lr': 0.0004991656158229331, 'samples': 1123392, 'steps': 5850, 'loss/train': 1.9025615453720093} 11/06/2021 22:04:27 - INFO - __main__ - Step 5852: {'lr': 0.0004991651825617406, 'samples': 1123584, 'steps': 5851, 'loss/train': 2.156768560409546} 11/06/2021 22:04:28 - INFO - __main__ - Step 5853: {'lr': 0.000499164749188278, 'samples': 1123776, 'steps': 5852, 'loss/train': 2.2558553218841553} 11/06/2021 22:04:29 - INFO - __main__ - Step 5854: {'lr': 0.0004991643157025458, 'samples': 1123968, 'steps': 5853, 'loss/train': 1.9887200593948364} 11/06/2021 22:04:29 - INFO - __main__ - Step 5855: {'lr': 0.0004991638821045439, 'samples': 1124160, 'steps': 5854, 'loss/train': 2.0848309993743896} 11/06/2021 22:04:29 - INFO - __main__ - Step 5856: {'lr': 0.0004991634483942725, 'samples': 1124352, 'steps': 5855, 'loss/train': 1.9906128644943237} 11/06/2021 22:04:30 - INFO - __main__ - Step 5857: {'lr': 0.000499163014571732, 'samples': 1124544, 'steps': 5856, 'loss/train': 1.9705578088760376} 11/06/2021 22:04:30 - INFO - __main__ - Step 5858: {'lr': 0.0004991625806369225, 'samples': 1124736, 'steps': 5857, 'loss/train': 1.9932202100753784} 11/06/2021 22:04:32 - INFO - __main__ - Step 5859: {'lr': 0.0004991621465898441, 'samples': 1124928, 'steps': 5858, 'loss/train': 1.7375214099884033} 11/06/2021 22:04:32 - INFO - __main__ - Step 5860: {'lr': 0.0004991617124304971, 'samples': 1125120, 'steps': 5859, 'loss/train': 1.9731206893920898} 11/06/2021 22:04:32 - INFO - __main__ - Step 5861: {'lr': 0.0004991612781588818, 'samples': 1125312, 'steps': 5860, 'loss/train': 0.45900076627731323} 11/06/2021 22:04:33 - INFO - __main__ - Step 5862: {'lr': 0.0004991608437749981, 'samples': 1125504, 'steps': 5861, 'loss/train': 2.0371387004852295} 11/06/2021 22:04:33 - INFO - __main__ - Step 5863: {'lr': 0.0004991604092788465, 'samples': 1125696, 'steps': 5862, 'loss/train': 3.404467821121216} 11/06/2021 22:04:33 - INFO - __main__ - Step 5864: {'lr': 0.000499159974670427, 'samples': 1125888, 'steps': 5863, 'loss/train': 1.9706652164459229} 11/06/2021 22:04:34 - INFO - __main__ - Step 5865: {'lr': 0.00049915953994974, 'samples': 1126080, 'steps': 5864, 'loss/train': 1.71634042263031} 11/06/2021 22:04:35 - INFO - __main__ - Step 5866: {'lr': 0.0004991591051167853, 'samples': 1126272, 'steps': 5865, 'loss/train': 2.338533401489258} 11/06/2021 22:04:35 - INFO - __main__ - Step 5867: {'lr': 0.0004991586701715635, 'samples': 1126464, 'steps': 5866, 'loss/train': 0.339321494102478} 11/06/2021 22:04:36 - INFO - __main__ - Step 5868: {'lr': 0.0004991582351140747, 'samples': 1126656, 'steps': 5867, 'loss/train': 1.8088363409042358} 11/06/2021 22:04:36 - INFO - __main__ - Step 5869: {'lr': 0.000499157799944319, 'samples': 1126848, 'steps': 5868, 'loss/train': 1.787323236465454} 11/06/2021 22:04:36 - INFO - __main__ - Step 5870: {'lr': 0.0004991573646622965, 'samples': 1127040, 'steps': 5869, 'loss/train': 1.9397883415222168} 11/06/2021 22:04:37 - INFO - __main__ - Step 5871: {'lr': 0.0004991569292680078, 'samples': 1127232, 'steps': 5870, 'loss/train': 0.43821990489959717} 11/06/2021 22:04:38 - INFO - __main__ - Step 5872: {'lr': 0.0004991564937614526, 'samples': 1127424, 'steps': 5871, 'loss/train': 2.1050267219543457} 11/06/2021 22:04:38 - INFO - __main__ - Step 5873: {'lr': 0.0004991560581426314, 'samples': 1127616, 'steps': 5872, 'loss/train': 2.0710291862487793} 11/06/2021 22:04:38 - INFO - __main__ - Step 5874: {'lr': 0.0004991556224115444, 'samples': 1127808, 'steps': 5873, 'loss/train': 2.220693349838257} 11/06/2021 22:04:39 - INFO - __main__ - Step 5875: {'lr': 0.0004991551865681916, 'samples': 1128000, 'steps': 5874, 'loss/train': 1.8416770696640015} 11/06/2021 22:04:40 - INFO - __main__ - Step 5876: {'lr': 0.0004991547506125734, 'samples': 1128192, 'steps': 5875, 'loss/train': 2.190176248550415} 11/06/2021 22:04:40 - INFO - __main__ - Step 5877: {'lr': 0.0004991543145446899, 'samples': 1128384, 'steps': 5876, 'loss/train': 1.174869418144226} 11/06/2021 22:04:40 - INFO - __main__ - Step 5878: {'lr': 0.0004991538783645413, 'samples': 1128576, 'steps': 5877, 'loss/train': 2.180773973464966} 11/06/2021 22:04:41 - INFO - __main__ - Step 5879: {'lr': 0.0004991534420721278, 'samples': 1128768, 'steps': 5878, 'loss/train': 1.775754451751709} 11/06/2021 22:04:41 - INFO - __main__ - Step 5880: {'lr': 0.0004991530056674496, 'samples': 1128960, 'steps': 5879, 'loss/train': 1.8799948692321777} 11/06/2021 22:04:42 - INFO - __main__ - Step 5881: {'lr': 0.000499152569150507, 'samples': 1129152, 'steps': 5880, 'loss/train': 2.05820894241333} 11/06/2021 22:04:43 - INFO - __main__ - Step 5882: {'lr': 0.0004991521325213, 'samples': 1129344, 'steps': 5881, 'loss/train': 1.8497810363769531} 11/06/2021 22:04:43 - INFO - __main__ - Step 5883: {'lr': 0.0004991516957798289, 'samples': 1129536, 'steps': 5882, 'loss/train': 2.388411521911621} 11/06/2021 22:04:43 - INFO - __main__ - Step 5884: {'lr': 0.0004991512589260939, 'samples': 1129728, 'steps': 5883, 'loss/train': 2.0570759773254395} 11/06/2021 22:04:44 - INFO - __main__ - Step 5885: {'lr': 0.0004991508219600952, 'samples': 1129920, 'steps': 5884, 'loss/train': 2.236051321029663} 11/06/2021 22:04:45 - INFO - __main__ - Step 5886: {'lr': 0.000499150384881833, 'samples': 1130112, 'steps': 5885, 'loss/train': 1.3894482851028442} 11/06/2021 22:04:45 - INFO - __main__ - Step 5887: {'lr': 0.0004991499476913074, 'samples': 1130304, 'steps': 5886, 'loss/train': 1.8141835927963257} 11/06/2021 22:04:45 - INFO - __main__ - Step 5888: {'lr': 0.0004991495103885187, 'samples': 1130496, 'steps': 5887, 'loss/train': 1.7120885848999023} 11/06/2021 22:04:46 - INFO - __main__ - Step 5889: {'lr': 0.0004991490729734672, 'samples': 1130688, 'steps': 5888, 'loss/train': 1.4462894201278687} 11/06/2021 22:04:46 - INFO - __main__ - Step 5890: {'lr': 0.0004991486354461528, 'samples': 1130880, 'steps': 5889, 'loss/train': 2.2380330562591553} 11/06/2021 22:04:46 - INFO - __main__ - Step 5891: {'lr': 0.000499148197806576, 'samples': 1131072, 'steps': 5890, 'loss/train': 1.384273648262024} 11/06/2021 22:04:47 - INFO - __main__ - Step 5892: {'lr': 0.0004991477600547367, 'samples': 1131264, 'steps': 5891, 'loss/train': 1.8132703304290771} 11/06/2021 22:04:48 - INFO - __main__ - Step 5893: {'lr': 0.0004991473221906354, 'samples': 1131456, 'steps': 5892, 'loss/train': 2.0042428970336914} 11/06/2021 22:04:48 - INFO - __main__ - Step 5894: {'lr': 0.0004991468842142722, 'samples': 1131648, 'steps': 5893, 'loss/train': 2.088541030883789} 11/06/2021 22:04:48 - INFO - __main__ - Step 5895: {'lr': 0.0004991464461256472, 'samples': 1131840, 'steps': 5894, 'loss/train': 1.8709157705307007} 11/06/2021 22:04:49 - INFO - __main__ - Step 5896: {'lr': 0.0004991460079247606, 'samples': 1132032, 'steps': 5895, 'loss/train': 1.9683117866516113} 11/06/2021 22:04:50 - INFO - __main__ - Step 5897: {'lr': 0.0004991455696116128, 'samples': 1132224, 'steps': 5896, 'loss/train': 1.7423728704452515} 11/06/2021 22:04:50 - INFO - __main__ - Step 5898: {'lr': 0.0004991451311862037, 'samples': 1132416, 'steps': 5897, 'loss/train': 1.5214271545410156} 11/06/2021 22:04:50 - INFO - __main__ - Step 5899: {'lr': 0.0004991446926485337, 'samples': 1132608, 'steps': 5898, 'loss/train': 1.8397377729415894} 11/06/2021 22:04:51 - INFO - __main__ - Step 5900: {'lr': 0.0004991442539986029, 'samples': 1132800, 'steps': 5899, 'loss/train': 2.119931697845459} 11/06/2021 22:04:51 - INFO - __main__ - Step 5901: {'lr': 0.0004991438152364117, 'samples': 1132992, 'steps': 5900, 'loss/train': 1.960817813873291} 11/06/2021 22:04:52 - INFO - __main__ - Step 5902: {'lr': 0.0004991433763619599, 'samples': 1133184, 'steps': 5901, 'loss/train': 1.7662793397903442} 11/06/2021 22:04:52 - INFO - __main__ - Step 5903: {'lr': 0.0004991429373752482, 'samples': 1133376, 'steps': 5902, 'loss/train': 1.5504059791564941} 11/06/2021 22:04:53 - INFO - __main__ - Step 5904: {'lr': 0.0004991424982762763, 'samples': 1133568, 'steps': 5903, 'loss/train': 2.349090576171875} 11/06/2021 22:04:53 - INFO - __main__ - Step 5905: {'lr': 0.0004991420590650448, 'samples': 1133760, 'steps': 5904, 'loss/train': 1.9303275346755981} 11/06/2021 22:04:54 - INFO - __main__ - Step 5906: {'lr': 0.0004991416197415537, 'samples': 1133952, 'steps': 5905, 'loss/train': 1.6415051221847534} 11/06/2021 22:04:55 - INFO - __main__ - Step 5907: {'lr': 0.0004991411803058032, 'samples': 1134144, 'steps': 5906, 'loss/train': 1.6599597930908203} 11/06/2021 22:04:55 - INFO - __main__ - Step 5908: {'lr': 0.0004991407407577936, 'samples': 1134336, 'steps': 5907, 'loss/train': 1.926432728767395} 11/06/2021 22:04:55 - INFO - __main__ - Step 5909: {'lr': 0.0004991403010975249, 'samples': 1134528, 'steps': 5908, 'loss/train': 1.6873294115066528} 11/06/2021 22:04:56 - INFO - __main__ - Step 5910: {'lr': 0.0004991398613249976, 'samples': 1134720, 'steps': 5909, 'loss/train': 2.0104522705078125} 11/06/2021 22:04:56 - INFO - __main__ - Step 5911: {'lr': 0.0004991394214402115, 'samples': 1134912, 'steps': 5910, 'loss/train': 1.2183507680892944} 11/06/2021 22:04:57 - INFO - __main__ - Step 5912: {'lr': 0.0004991389814431672, 'samples': 1135104, 'steps': 5911, 'loss/train': 2.336418390274048} 11/06/2021 22:04:57 - INFO - __main__ - Step 5913: {'lr': 0.0004991385413338646, 'samples': 1135296, 'steps': 5912, 'loss/train': 1.6936346292495728} 11/06/2021 22:04:58 - INFO - __main__ - Step 5914: {'lr': 0.0004991381011123041, 'samples': 1135488, 'steps': 5913, 'loss/train': 1.936176061630249} 11/06/2021 22:04:58 - INFO - __main__ - Step 5915: {'lr': 0.0004991376607784857, 'samples': 1135680, 'steps': 5914, 'loss/train': 2.222874402999878} 11/06/2021 22:04:58 - INFO - __main__ - Step 5916: {'lr': 0.0004991372203324098, 'samples': 1135872, 'steps': 5915, 'loss/train': 1.72451913356781} 11/06/2021 22:04:59 - INFO - __main__ - Step 5917: {'lr': 0.0004991367797740765, 'samples': 1136064, 'steps': 5916, 'loss/train': 2.277719736099243} 11/06/2021 22:05:00 - INFO - __main__ - Step 5918: {'lr': 0.0004991363391034861, 'samples': 1136256, 'steps': 5917, 'loss/train': 1.82767915725708} 11/06/2021 22:05:00 - INFO - __main__ - Step 5919: {'lr': 0.0004991358983206386, 'samples': 1136448, 'steps': 5918, 'loss/train': 1.7569984197616577} 11/06/2021 22:05:00 - INFO - __main__ - Step 5920: {'lr': 0.0004991354574255344, 'samples': 1136640, 'steps': 5919, 'loss/train': 1.7401177883148193} 11/06/2021 22:05:01 - INFO - __main__ - Step 5921: {'lr': 0.0004991350164181735, 'samples': 1136832, 'steps': 5920, 'loss/train': 1.826175332069397} 11/06/2021 22:05:02 - INFO - __main__ - Step 5922: {'lr': 0.0004991345752985563, 'samples': 1137024, 'steps': 5921, 'loss/train': 1.4562937021255493} 11/06/2021 22:05:02 - INFO - __main__ - Step 5923: {'lr': 0.0004991341340666828, 'samples': 1137216, 'steps': 5922, 'loss/train': 2.3827600479125977} 11/06/2021 22:05:03 - INFO - __main__ - Step 5924: {'lr': 0.0004991336927225534, 'samples': 1137408, 'steps': 5923, 'loss/train': 1.9076972007751465} 11/06/2021 22:05:03 - INFO - __main__ - Step 5925: {'lr': 0.0004991332512661682, 'samples': 1137600, 'steps': 5924, 'loss/train': 1.9979768991470337} 11/06/2021 22:05:03 - INFO - __main__ - Step 5926: {'lr': 0.0004991328096975273, 'samples': 1137792, 'steps': 5925, 'loss/train': 2.118201971054077} 11/06/2021 22:05:04 - INFO - __main__ - Step 5927: {'lr': 0.0004991323680166312, 'samples': 1137984, 'steps': 5926, 'loss/train': 2.161531925201416} 11/06/2021 22:05:05 - INFO - __main__ - Step 5928: {'lr': 0.0004991319262234797, 'samples': 1138176, 'steps': 5927, 'loss/train': 1.8950947523117065} 11/06/2021 22:05:05 - INFO - __main__ - Step 5929: {'lr': 0.0004991314843180733, 'samples': 1138368, 'steps': 5928, 'loss/train': 1.211108684539795} 11/06/2021 22:05:05 - INFO - __main__ - Step 5930: {'lr': 0.0004991310423004121, 'samples': 1138560, 'steps': 5929, 'loss/train': 2.195322275161743} 11/06/2021 22:05:06 - INFO - __main__ - Step 5931: {'lr': 0.0004991306001704962, 'samples': 1138752, 'steps': 5930, 'loss/train': 2.2083332538604736} 11/06/2021 22:05:06 - INFO - __main__ - Step 5932: {'lr': 0.000499130157928326, 'samples': 1138944, 'steps': 5931, 'loss/train': 1.8265712261199951} 11/06/2021 22:05:07 - INFO - __main__ - Step 5933: {'lr': 0.0004991297155739015, 'samples': 1139136, 'steps': 5932, 'loss/train': 2.3104774951934814} 11/06/2021 22:05:08 - INFO - __main__ - Step 5934: {'lr': 0.0004991292731072231, 'samples': 1139328, 'steps': 5933, 'loss/train': 1.6424787044525146} 11/06/2021 22:05:08 - INFO - __main__ - Step 5935: {'lr': 0.0004991288305282908, 'samples': 1139520, 'steps': 5934, 'loss/train': 1.7371667623519897} 11/06/2021 22:05:08 - INFO - __main__ - Step 5936: {'lr': 0.0004991283878371049, 'samples': 1139712, 'steps': 5935, 'loss/train': 2.0819458961486816} 11/06/2021 22:05:09 - INFO - __main__ - Step 5937: {'lr': 0.0004991279450336656, 'samples': 1139904, 'steps': 5936, 'loss/train': 2.049226999282837} 11/06/2021 22:05:09 - INFO - __main__ - Step 5938: {'lr': 0.0004991275021179732, 'samples': 1140096, 'steps': 5937, 'loss/train': 1.6424188613891602} 11/06/2021 22:05:10 - INFO - __main__ - Step 5939: {'lr': 0.0004991270590900277, 'samples': 1140288, 'steps': 5938, 'loss/train': 2.0688111782073975} 11/06/2021 22:05:11 - INFO - __main__ - Step 5940: {'lr': 0.0004991266159498294, 'samples': 1140480, 'steps': 5939, 'loss/train': 1.3544045686721802} 11/06/2021 22:05:11 - INFO - __main__ - Step 5941: {'lr': 0.0004991261726973784, 'samples': 1140672, 'steps': 5940, 'loss/train': 2.6650118827819824} 11/06/2021 22:05:11 - INFO - __main__ - Step 5942: {'lr': 0.0004991257293326752, 'samples': 1140864, 'steps': 5941, 'loss/train': 1.340391993522644} 11/06/2021 22:05:12 - INFO - __main__ - Step 5943: {'lr': 0.0004991252858557196, 'samples': 1141056, 'steps': 5942, 'loss/train': 1.737450122833252} 11/06/2021 22:05:12 - INFO - __main__ - Step 5944: {'lr': 0.0004991248422665122, 'samples': 1141248, 'steps': 5943, 'loss/train': 2.2285938262939453} 11/06/2021 22:05:13 - INFO - __main__ - Step 5945: {'lr': 0.0004991243985650528, 'samples': 1141440, 'steps': 5944, 'loss/train': 2.2643086910247803} 11/06/2021 22:05:13 - INFO - __main__ - Step 5946: {'lr': 0.0004991239547513419, 'samples': 1141632, 'steps': 5945, 'loss/train': 1.9071139097213745} 11/06/2021 22:05:14 - INFO - __main__ - Step 5947: {'lr': 0.0004991235108253795, 'samples': 1141824, 'steps': 5946, 'loss/train': 1.679957628250122} 11/06/2021 22:05:14 - INFO - __main__ - Step 5948: {'lr': 0.0004991230667871659, 'samples': 1142016, 'steps': 5947, 'loss/train': 0.8383622765541077} 11/06/2021 22:05:16 - INFO - __main__ - Step 5949: {'lr': 0.0004991226226367013, 'samples': 1142208, 'steps': 5948, 'loss/train': 1.7677212953567505} 11/06/2021 22:05:16 - INFO - __main__ - Step 5950: {'lr': 0.0004991221783739859, 'samples': 1142400, 'steps': 5949, 'loss/train': 2.1849448680877686} 11/06/2021 22:05:16 - INFO - __main__ - Step 5951: {'lr': 0.0004991217339990199, 'samples': 1142592, 'steps': 5950, 'loss/train': 1.8535935878753662} 11/06/2021 22:05:17 - INFO - __main__ - Step 5952: {'lr': 0.0004991212895118035, 'samples': 1142784, 'steps': 5951, 'loss/train': 2.292984962463379} 11/06/2021 22:05:17 - INFO - __main__ - Step 5953: {'lr': 0.0004991208449123369, 'samples': 1142976, 'steps': 5952, 'loss/train': 2.318779468536377} 11/06/2021 22:05:17 - INFO - __main__ - Step 5954: {'lr': 0.0004991204002006203, 'samples': 1143168, 'steps': 5953, 'loss/train': 2.651663064956665} 11/06/2021 22:05:18 - INFO - __main__ - Step 5955: {'lr': 0.0004991199553766538, 'samples': 1143360, 'steps': 5954, 'loss/train': 2.468550205230713} 11/06/2021 22:05:19 - INFO - __main__ - Step 5956: {'lr': 0.0004991195104404378, 'samples': 1143552, 'steps': 5955, 'loss/train': 1.74437415599823} 11/06/2021 22:05:19 - INFO - __main__ - Step 5957: {'lr': 0.0004991190653919723, 'samples': 1143744, 'steps': 5956, 'loss/train': 1.5060465335845947} 11/06/2021 22:05:19 - INFO - __main__ - Step 5958: {'lr': 0.0004991186202312576, 'samples': 1143936, 'steps': 5957, 'loss/train': 2.46972393989563} 11/06/2021 22:05:20 - INFO - __main__ - Step 5959: {'lr': 0.0004991181749582941, 'samples': 1144128, 'steps': 5958, 'loss/train': 1.874528408050537} 11/06/2021 22:05:20 - INFO - __main__ - Step 5960: {'lr': 0.0004991177295730815, 'samples': 1144320, 'steps': 5959, 'loss/train': 1.015513300895691} 11/06/2021 22:05:21 - INFO - __main__ - Step 5961: {'lr': 0.0004991172840756204, 'samples': 1144512, 'steps': 5960, 'loss/train': 1.6141202449798584} 11/06/2021 22:05:22 - INFO - __main__ - Step 5962: {'lr': 0.000499116838465911, 'samples': 1144704, 'steps': 5961, 'loss/train': 2.2520675659179688} 11/06/2021 22:05:22 - INFO - __main__ - Step 5963: {'lr': 0.0004991163927439533, 'samples': 1144896, 'steps': 5962, 'loss/train': 2.061602830886841} 11/06/2021 22:05:22 - INFO - __main__ - Step 5964: {'lr': 0.0004991159469097476, 'samples': 1145088, 'steps': 5963, 'loss/train': 1.7788499593734741} 11/06/2021 22:05:23 - INFO - __main__ - Step 5965: {'lr': 0.0004991155009632941, 'samples': 1145280, 'steps': 5964, 'loss/train': 2.016207456588745} 11/06/2021 22:05:23 - INFO - __main__ - Step 5966: {'lr': 0.0004991150549045931, 'samples': 1145472, 'steps': 5965, 'loss/train': 2.390394926071167} 11/06/2021 22:05:24 - INFO - __main__ - Step 5967: {'lr': 0.0004991146087336446, 'samples': 1145664, 'steps': 5966, 'loss/train': 2.1438944339752197} 11/06/2021 22:05:24 - INFO - __main__ - Step 5968: {'lr': 0.0004991141624504489, 'samples': 1145856, 'steps': 5967, 'loss/train': 2.0830070972442627} 11/06/2021 22:05:25 - INFO - __main__ - Step 5969: {'lr': 0.0004991137160550062, 'samples': 1146048, 'steps': 5968, 'loss/train': 1.9556607007980347} 11/06/2021 22:05:25 - INFO - __main__ - Step 5970: {'lr': 0.0004991132695473167, 'samples': 1146240, 'steps': 5969, 'loss/train': 1.7997961044311523} 11/06/2021 22:05:25 - INFO - __main__ - Step 5971: {'lr': 0.0004991128229273807, 'samples': 1146432, 'steps': 5970, 'loss/train': 2.2100391387939453} 11/06/2021 22:05:26 - INFO - __main__ - Step 5972: {'lr': 0.0004991123761951982, 'samples': 1146624, 'steps': 5971, 'loss/train': 2.2720751762390137} 11/06/2021 22:05:27 - INFO - __main__ - Step 5973: {'lr': 0.0004991119293507695, 'samples': 1146816, 'steps': 5972, 'loss/train': 2.5560436248779297} 11/06/2021 22:05:27 - INFO - __main__ - Step 5974: {'lr': 0.0004991114823940948, 'samples': 1147008, 'steps': 5973, 'loss/train': 1.9444048404693604} 11/06/2021 22:05:27 - INFO - __main__ - Step 5975: {'lr': 0.0004991110353251744, 'samples': 1147200, 'steps': 5974, 'loss/train': 1.894894003868103} 11/06/2021 22:05:28 - INFO - __main__ - Step 5976: {'lr': 0.0004991105881440084, 'samples': 1147392, 'steps': 5975, 'loss/train': 1.42281174659729} 11/06/2021 22:05:29 - INFO - __main__ - Step 5977: {'lr': 0.000499110140850597, 'samples': 1147584, 'steps': 5976, 'loss/train': 1.4847767353057861} 11/06/2021 22:05:29 - INFO - __main__ - Step 5978: {'lr': 0.0004991096934449404, 'samples': 1147776, 'steps': 5977, 'loss/train': 1.5438770055770874} 11/06/2021 22:05:30 - INFO - __main__ - Step 5979: {'lr': 0.0004991092459270388, 'samples': 1147968, 'steps': 5978, 'loss/train': 1.8954336643218994} 11/06/2021 22:05:30 - INFO - __main__ - Step 5980: {'lr': 0.0004991087982968924, 'samples': 1148160, 'steps': 5979, 'loss/train': 1.413256287574768} 11/06/2021 22:05:30 - INFO - __main__ - Step 5981: {'lr': 0.0004991083505545014, 'samples': 1148352, 'steps': 5980, 'loss/train': 2.335400104522705} 11/06/2021 22:05:31 - INFO - __main__ - Step 5982: {'lr': 0.0004991079026998662, 'samples': 1148544, 'steps': 5981, 'loss/train': 1.8564774990081787} 11/06/2021 22:05:32 - INFO - __main__ - Step 5983: {'lr': 0.0004991074547329867, 'samples': 1148736, 'steps': 5982, 'loss/train': 2.0363080501556396} 11/06/2021 22:05:32 - INFO - __main__ - Step 5984: {'lr': 0.0004991070066538632, 'samples': 1148928, 'steps': 5983, 'loss/train': 1.7715718746185303} 11/06/2021 22:05:32 - INFO - __main__ - Step 5985: {'lr': 0.0004991065584624959, 'samples': 1149120, 'steps': 5984, 'loss/train': 2.119075059890747} 11/06/2021 22:05:33 - INFO - __main__ - Step 5986: {'lr': 0.0004991061101588851, 'samples': 1149312, 'steps': 5985, 'loss/train': 1.6880245208740234} 11/06/2021 22:05:33 - INFO - __main__ - Step 5987: {'lr': 0.0004991056617430308, 'samples': 1149504, 'steps': 5986, 'loss/train': 1.5863991975784302} 11/06/2021 22:05:34 - INFO - __main__ - Step 5988: {'lr': 0.0004991052132149336, 'samples': 1149696, 'steps': 5987, 'loss/train': 1.918567419052124} 11/06/2021 22:05:35 - INFO - __main__ - Step 5989: {'lr': 0.0004991047645745932, 'samples': 1149888, 'steps': 5988, 'loss/train': 2.1846141815185547} 11/06/2021 22:05:35 - INFO - __main__ - Step 5990: {'lr': 0.0004991043158220101, 'samples': 1150080, 'steps': 5989, 'loss/train': 0.287127822637558} 11/06/2021 22:05:35 - INFO - __main__ - Step 5991: {'lr': 0.0004991038669571844, 'samples': 1150272, 'steps': 5990, 'loss/train': 2.2613940238952637} 11/06/2021 22:05:36 - INFO - __main__ - Step 5992: {'lr': 0.0004991034179801165, 'samples': 1150464, 'steps': 5991, 'loss/train': 2.2237813472747803} 11/06/2021 22:05:37 - INFO - __main__ - Step 5993: {'lr': 0.0004991029688908063, 'samples': 1150656, 'steps': 5992, 'loss/train': 1.6125682592391968} 11/06/2021 22:05:37 - INFO - __main__ - Step 5994: {'lr': 0.0004991025196892542, 'samples': 1150848, 'steps': 5993, 'loss/train': 2.0412638187408447} 11/06/2021 22:05:37 - INFO - __main__ - Step 5995: {'lr': 0.0004991020703754603, 'samples': 1151040, 'steps': 5994, 'loss/train': 2.479659080505371} 11/06/2021 22:05:38 - INFO - __main__ - Step 5996: {'lr': 0.0004991016209494249, 'samples': 1151232, 'steps': 5995, 'loss/train': 1.6935555934906006} 11/06/2021 22:05:38 - INFO - __main__ - Step 5997: {'lr': 0.000499101171411148, 'samples': 1151424, 'steps': 5996, 'loss/train': 1.8328980207443237} 11/06/2021 22:05:39 - INFO - __main__ - Step 5998: {'lr': 0.0004991007217606303, 'samples': 1151616, 'steps': 5997, 'loss/train': 1.736120343208313} 11/06/2021 22:05:39 - INFO - __main__ - Step 5999: {'lr': 0.0004991002719978713, 'samples': 1151808, 'steps': 5998, 'loss/train': 1.7928426265716553} 11/06/2021 22:05:40 - INFO - __main__ - Step 6000: {'lr': 0.0004990998221228718, 'samples': 1152000, 'steps': 5999, 'loss/train': 2.1965417861938477} 11/06/2021 22:05:40 - INFO - __main__ - Step 6001: {'lr': 0.0004990993721356316, 'samples': 1152192, 'steps': 6000, 'loss/train': 1.9689626693725586} 11/06/2021 22:05:41 - INFO - __main__ - Step 6002: {'lr': 0.0004990989220361511, 'samples': 1152384, 'steps': 6001, 'loss/train': 2.398984670639038} 11/06/2021 22:05:42 - INFO - __main__ - Step 6003: {'lr': 0.0004990984718244306, 'samples': 1152576, 'steps': 6002, 'loss/train': 1.8467917442321777} 11/06/2021 22:05:42 - INFO - __main__ - Step 6004: {'lr': 0.00049909802150047, 'samples': 1152768, 'steps': 6003, 'loss/train': 1.9117133617401123} 11/06/2021 22:05:42 - INFO - __main__ - Step 6005: {'lr': 0.0004990975710642699, 'samples': 1152960, 'steps': 6004, 'loss/train': 2.0805423259735107} 11/06/2021 22:05:43 - INFO - __main__ - Step 6006: {'lr': 0.0004990971205158301, 'samples': 1153152, 'steps': 6005, 'loss/train': 1.9245171546936035} 11/06/2021 22:05:43 - INFO - __main__ - Step 6007: {'lr': 0.000499096669855151, 'samples': 1153344, 'steps': 6006, 'loss/train': 1.9373234510421753} 11/06/2021 22:05:44 - INFO - __main__ - Step 6008: {'lr': 0.0004990962190822328, 'samples': 1153536, 'steps': 6007, 'loss/train': 2.0879454612731934} 11/06/2021 22:05:44 - INFO - __main__ - Step 6009: {'lr': 0.0004990957681970757, 'samples': 1153728, 'steps': 6008, 'loss/train': 1.5155565738677979} 11/06/2021 22:05:45 - INFO - __main__ - Step 6010: {'lr': 0.0004990953171996798, 'samples': 1153920, 'steps': 6009, 'loss/train': 1.890929937362671} 11/06/2021 22:05:45 - INFO - __main__ - Step 6011: {'lr': 0.0004990948660900455, 'samples': 1154112, 'steps': 6010, 'loss/train': 1.6468206644058228} 11/06/2021 22:05:45 - INFO - __main__ - Step 6012: {'lr': 0.0004990944148681729, 'samples': 1154304, 'steps': 6011, 'loss/train': 1.8808773756027222} 11/06/2021 22:05:46 - INFO - __main__ - Step 6013: {'lr': 0.0004990939635340621, 'samples': 1154496, 'steps': 6012, 'loss/train': 1.6922487020492554} 11/06/2021 22:05:47 - INFO - __main__ - Step 6014: {'lr': 0.0004990935120877136, 'samples': 1154688, 'steps': 6013, 'loss/train': 1.5045371055603027} 11/06/2021 22:05:47 - INFO - __main__ - Step 6015: {'lr': 0.0004990930605291272, 'samples': 1154880, 'steps': 6014, 'loss/train': 1.9965391159057617} 11/06/2021 22:05:47 - INFO - __main__ - Step 6016: {'lr': 0.0004990926088583034, 'samples': 1155072, 'steps': 6015, 'loss/train': 1.9830745458602905} 11/06/2021 22:05:48 - INFO - __main__ - Step 6017: {'lr': 0.0004990921570752424, 'samples': 1155264, 'steps': 6016, 'loss/train': 1.850488305091858} 11/06/2021 22:05:49 - INFO - __main__ - Step 6018: {'lr': 0.0004990917051799442, 'samples': 1155456, 'steps': 6017, 'loss/train': 2.0463662147521973} 11/06/2021 22:05:49 - INFO - __main__ - Step 6019: {'lr': 0.0004990912531724092, 'samples': 1155648, 'steps': 6018, 'loss/train': 1.561757206916809} 11/06/2021 22:05:50 - INFO - __main__ - Step 6020: {'lr': 0.0004990908010526374, 'samples': 1155840, 'steps': 6019, 'loss/train': 2.3812918663024902} 11/06/2021 22:05:50 - INFO - __main__ - Step 6021: {'lr': 0.0004990903488206292, 'samples': 1156032, 'steps': 6020, 'loss/train': 1.9738842248916626} 11/06/2021 22:05:50 - INFO - __main__ - Step 6022: {'lr': 0.0004990898964763847, 'samples': 1156224, 'steps': 6021, 'loss/train': 1.0704938173294067} 11/06/2021 22:05:51 - INFO - __main__ - Step 6023: {'lr': 0.0004990894440199042, 'samples': 1156416, 'steps': 6022, 'loss/train': 2.2078778743743896} 11/06/2021 22:05:52 - INFO - __main__ - Step 6024: {'lr': 0.0004990889914511878, 'samples': 1156608, 'steps': 6023, 'loss/train': 1.3262114524841309} 11/06/2021 22:05:52 - INFO - __main__ - Step 6025: {'lr': 0.0004990885387702357, 'samples': 1156800, 'steps': 6024, 'loss/train': 2.6244781017303467} 11/06/2021 22:05:52 - INFO - __main__ - Step 6026: {'lr': 0.0004990880859770483, 'samples': 1156992, 'steps': 6025, 'loss/train': 2.0771734714508057} 11/06/2021 22:05:53 - INFO - __main__ - Step 6027: {'lr': 0.0004990876330716256, 'samples': 1157184, 'steps': 6026, 'loss/train': 1.7819007635116577} 11/06/2021 22:05:53 - INFO - __main__ - Step 6028: {'lr': 0.0004990871800539677, 'samples': 1157376, 'steps': 6027, 'loss/train': 2.1311354637145996} 11/06/2021 22:05:54 - INFO - __main__ - Step 6029: {'lr': 0.0004990867269240751, 'samples': 1157568, 'steps': 6028, 'loss/train': 2.2152881622314453} 11/06/2021 22:05:55 - INFO - __main__ - Step 6030: {'lr': 0.0004990862736819478, 'samples': 1157760, 'steps': 6029, 'loss/train': 2.027174949645996} 11/06/2021 22:05:55 - INFO - __main__ - Step 6031: {'lr': 0.000499085820327586, 'samples': 1157952, 'steps': 6030, 'loss/train': 1.750157356262207} 11/06/2021 22:05:55 - INFO - __main__ - Step 6032: {'lr': 0.0004990853668609902, 'samples': 1158144, 'steps': 6031, 'loss/train': 1.8239428997039795} 11/06/2021 22:05:56 - INFO - __main__ - Step 6033: {'lr': 0.0004990849132821602, 'samples': 1158336, 'steps': 6032, 'loss/train': 1.92527437210083} 11/06/2021 22:05:57 - INFO - __main__ - Step 6034: {'lr': 0.0004990844595910965, 'samples': 1158528, 'steps': 6033, 'loss/train': 2.1087069511413574} 11/06/2021 22:05:57 - INFO - __main__ - Step 6035: {'lr': 0.0004990840057877991, 'samples': 1158720, 'steps': 6034, 'loss/train': 1.7086195945739746} 11/06/2021 22:05:57 - INFO - __main__ - Step 6036: {'lr': 0.0004990835518722683, 'samples': 1158912, 'steps': 6035, 'loss/train': 1.736742615699768} 11/06/2021 22:05:58 - INFO - __main__ - Step 6037: {'lr': 0.0004990830978445043, 'samples': 1159104, 'steps': 6036, 'loss/train': 2.061073064804077} 11/06/2021 22:05:58 - INFO - __main__ - Step 6038: {'lr': 0.0004990826437045073, 'samples': 1159296, 'steps': 6037, 'loss/train': 1.51176917552948} 11/06/2021 22:05:59 - INFO - __main__ - Step 6039: {'lr': 0.0004990821894522775, 'samples': 1159488, 'steps': 6038, 'loss/train': 2.080949306488037} 11/06/2021 22:05:59 - INFO - __main__ - Step 6040: {'lr': 0.0004990817350878152, 'samples': 1159680, 'steps': 6039, 'loss/train': 1.2319962978363037} 11/06/2021 22:06:00 - INFO - __main__ - Step 6041: {'lr': 0.0004990812806111205, 'samples': 1159872, 'steps': 6040, 'loss/train': 2.2004785537719727} 11/06/2021 22:06:00 - INFO - __main__ - Step 6042: {'lr': 0.0004990808260221934, 'samples': 1160064, 'steps': 6041, 'loss/train': 2.198143243789673} 11/06/2021 22:06:00 - INFO - __main__ - Step 6043: {'lr': 0.0004990803713210345, 'samples': 1160256, 'steps': 6042, 'loss/train': 2.189502716064453} 11/06/2021 22:06:01 - INFO - __main__ - Step 6044: {'lr': 0.0004990799165076438, 'samples': 1160448, 'steps': 6043, 'loss/train': 0.32099586725234985} 11/06/2021 22:06:02 - INFO - __main__ - Step 6045: {'lr': 0.0004990794615820216, 'samples': 1160640, 'steps': 6044, 'loss/train': 2.055501937866211} 11/06/2021 22:06:02 - INFO - __main__ - Step 6046: {'lr': 0.0004990790065441679, 'samples': 1160832, 'steps': 6045, 'loss/train': 2.4283854961395264} 11/06/2021 22:06:03 - INFO - __main__ - Step 6047: {'lr': 0.0004990785513940832, 'samples': 1161024, 'steps': 6046, 'loss/train': 1.5293772220611572} 11/06/2021 22:06:03 - INFO - __main__ - Step 6048: {'lr': 0.0004990780961317674, 'samples': 1161216, 'steps': 6047, 'loss/train': 1.82656729221344} 11/06/2021 22:06:04 - INFO - __main__ - Step 6049: {'lr': 0.0004990776407572209, 'samples': 1161408, 'steps': 6048, 'loss/train': 2.004995107650757} 11/06/2021 22:06:04 - INFO - __main__ - Step 6050: {'lr': 0.000499077185270444, 'samples': 1161600, 'steps': 6049, 'loss/train': 1.4372830390930176} 11/06/2021 22:06:05 - INFO - __main__ - Step 6051: {'lr': 0.0004990767296714365, 'samples': 1161792, 'steps': 6050, 'loss/train': 1.7014871835708618} 11/06/2021 22:06:05 - INFO - __main__ - Step 6052: {'lr': 0.000499076273960199, 'samples': 1161984, 'steps': 6051, 'loss/train': 2.0586600303649902} 11/06/2021 22:06:05 - INFO - __main__ - Step 6053: {'lr': 0.0004990758181367316, 'samples': 1162176, 'steps': 6052, 'loss/train': 1.914888858795166} 11/06/2021 22:06:06 - INFO - __main__ - Step 6054: {'lr': 0.0004990753622010345, 'samples': 1162368, 'steps': 6053, 'loss/train': 2.2003793716430664} 11/06/2021 22:06:06 - INFO - __main__ - Step 6055: {'lr': 0.0004990749061531079, 'samples': 1162560, 'steps': 6054, 'loss/train': 2.1446890830993652} 11/06/2021 22:06:07 - INFO - __main__ - Step 6056: {'lr': 0.0004990744499929519, 'samples': 1162752, 'steps': 6055, 'loss/train': 1.890893578529358} 11/06/2021 22:06:07 - INFO - __main__ - Step 6057: {'lr': 0.0004990739937205668, 'samples': 1162944, 'steps': 6056, 'loss/train': 1.2735779285430908} 11/06/2021 22:06:08 - INFO - __main__ - Step 6058: {'lr': 0.0004990735373359529, 'samples': 1163136, 'steps': 6057, 'loss/train': 1.9701690673828125} 11/06/2021 22:06:08 - INFO - __main__ - Step 6059: {'lr': 0.0004990730808391102, 'samples': 1163328, 'steps': 6058, 'loss/train': 2.143030881881714} 11/06/2021 22:06:09 - INFO - __main__ - Step 6060: {'lr': 0.0004990726242300391, 'samples': 1163520, 'steps': 6059, 'loss/train': 2.049363613128662} 11/06/2021 22:06:09 - INFO - __main__ - Step 6061: {'lr': 0.0004990721675087397, 'samples': 1163712, 'steps': 6060, 'loss/train': 2.1596553325653076} 11/06/2021 22:06:10 - INFO - __main__ - Step 6062: {'lr': 0.0004990717106752122, 'samples': 1163904, 'steps': 6061, 'loss/train': 2.0180513858795166} 11/06/2021 22:06:10 - INFO - __main__ - Step 6063: {'lr': 0.0004990712537294568, 'samples': 1164096, 'steps': 6062, 'loss/train': 2.0434930324554443} 11/06/2021 22:06:10 - INFO - __main__ - Step 6064: {'lr': 0.0004990707966714738, 'samples': 1164288, 'steps': 6063, 'loss/train': 1.9746204614639282} 11/06/2021 22:06:11 - INFO - __main__ - Step 6065: {'lr': 0.0004990703395012634, 'samples': 1164480, 'steps': 6064, 'loss/train': 2.094088554382324} 11/06/2021 22:06:12 - INFO - __main__ - Step 6066: {'lr': 0.0004990698822188255, 'samples': 1164672, 'steps': 6065, 'loss/train': 1.3795504570007324} 11/06/2021 22:06:12 - INFO - __main__ - Step 6067: {'lr': 0.0004990694248241608, 'samples': 1164864, 'steps': 6066, 'loss/train': 2.1936051845550537} 11/06/2021 22:06:12 - INFO - __main__ - Step 6068: {'lr': 0.0004990689673172691, 'samples': 1165056, 'steps': 6067, 'loss/train': 2.1371805667877197} 11/06/2021 22:06:13 - INFO - __main__ - Step 6069: {'lr': 0.000499068509698151, 'samples': 1165248, 'steps': 6068, 'loss/train': 1.7530931234359741} 11/06/2021 22:06:14 - INFO - __main__ - Step 6070: {'lr': 0.0004990680519668063, 'samples': 1165440, 'steps': 6069, 'loss/train': 2.192474842071533} 11/06/2021 22:06:14 - INFO - __main__ - Step 6071: {'lr': 0.0004990675941232354, 'samples': 1165632, 'steps': 6070, 'loss/train': 1.666622281074524} 11/06/2021 22:06:15 - INFO - __main__ - Step 6072: {'lr': 0.0004990671361674384, 'samples': 1165824, 'steps': 6071, 'loss/train': 1.9727286100387573} 11/06/2021 22:06:15 - INFO - __main__ - Step 6073: {'lr': 0.0004990666780994156, 'samples': 1166016, 'steps': 6072, 'loss/train': 2.116274356842041} 11/06/2021 22:06:15 - INFO - __main__ - Step 6074: {'lr': 0.0004990662199191673, 'samples': 1166208, 'steps': 6073, 'loss/train': 2.271106243133545} 11/06/2021 22:06:16 - INFO - __main__ - Step 6075: {'lr': 0.0004990657616266936, 'samples': 1166400, 'steps': 6074, 'loss/train': 1.8674793243408203} 11/06/2021 22:06:17 - INFO - __main__ - Step 6076: {'lr': 0.0004990653032219947, 'samples': 1166592, 'steps': 6075, 'loss/train': 1.4734982252120972} 11/06/2021 22:06:17 - INFO - __main__ - Step 6077: {'lr': 0.0004990648447050709, 'samples': 1166784, 'steps': 6076, 'loss/train': 1.8218048810958862} 11/06/2021 22:06:17 - INFO - __main__ - Step 6078: {'lr': 0.0004990643860759222, 'samples': 1166976, 'steps': 6077, 'loss/train': 1.642020583152771} 11/06/2021 22:06:18 - INFO - __main__ - Step 6079: {'lr': 0.0004990639273345489, 'samples': 1167168, 'steps': 6078, 'loss/train': 1.3877575397491455} 11/06/2021 22:06:18 - INFO - __main__ - Step 6080: {'lr': 0.0004990634684809513, 'samples': 1167360, 'steps': 6079, 'loss/train': 1.8536438941955566} 11/06/2021 22:06:19 - INFO - __main__ - Step 6081: {'lr': 0.0004990630095151296, 'samples': 1167552, 'steps': 6080, 'loss/train': 2.0661253929138184} 11/06/2021 22:06:19 - INFO - __main__ - Step 6082: {'lr': 0.0004990625504370838, 'samples': 1167744, 'steps': 6081, 'loss/train': 1.9790138006210327} 11/06/2021 22:06:20 - INFO - __main__ - Step 6083: {'lr': 0.0004990620912468143, 'samples': 1167936, 'steps': 6082, 'loss/train': 1.5834496021270752} 11/06/2021 22:06:20 - INFO - __main__ - Step 6084: {'lr': 0.0004990616319443214, 'samples': 1168128, 'steps': 6083, 'loss/train': 1.8926007747650146} 11/06/2021 22:06:20 - INFO - __main__ - Step 6085: {'lr': 0.0004990611725296052, 'samples': 1168320, 'steps': 6084, 'loss/train': 1.9960976839065552} 11/06/2021 22:06:21 - INFO - __main__ - Step 6086: {'lr': 0.0004990607130026657, 'samples': 1168512, 'steps': 6085, 'loss/train': 2.563918113708496} 11/06/2021 22:06:22 - INFO - __main__ - Step 6087: {'lr': 0.0004990602533635033, 'samples': 1168704, 'steps': 6086, 'loss/train': 1.8031913042068481} 11/06/2021 22:06:22 - INFO - __main__ - Step 6088: {'lr': 0.0004990597936121182, 'samples': 1168896, 'steps': 6087, 'loss/train': 1.5920250415802002} 11/06/2021 22:06:22 - INFO - __main__ - Step 6089: {'lr': 0.0004990593337485108, 'samples': 1169088, 'steps': 6088, 'loss/train': 2.1160354614257812} 11/06/2021 22:06:23 - INFO - __main__ - Step 6090: {'lr': 0.0004990588737726809, 'samples': 1169280, 'steps': 6089, 'loss/train': 2.0593221187591553} 11/06/2021 22:06:24 - INFO - __main__ - Step 6091: {'lr': 0.0004990584136846289, 'samples': 1169472, 'steps': 6090, 'loss/train': 2.7556521892547607} 11/06/2021 22:06:24 - INFO - __main__ - Step 6092: {'lr': 0.0004990579534843551, 'samples': 1169664, 'steps': 6091, 'loss/train': 2.4434940814971924} 11/06/2021 22:06:24 - INFO - __main__ - Step 6093: {'lr': 0.0004990574931718597, 'samples': 1169856, 'steps': 6092, 'loss/train': 2.342974901199341} 11/06/2021 22:06:25 - INFO - __main__ - Step 6094: {'lr': 0.0004990570327471427, 'samples': 1170048, 'steps': 6093, 'loss/train': 1.9696969985961914} 11/06/2021 22:06:25 - INFO - __main__ - Step 6095: {'lr': 0.0004990565722102045, 'samples': 1170240, 'steps': 6094, 'loss/train': 1.6603606939315796} 11/06/2021 22:06:26 - INFO - __main__ - Step 6096: {'lr': 0.0004990561115610452, 'samples': 1170432, 'steps': 6095, 'loss/train': 1.3589533567428589} 11/06/2021 22:06:26 - INFO - __main__ - Step 6097: {'lr': 0.0004990556507996652, 'samples': 1170624, 'steps': 6096, 'loss/train': 1.9999220371246338} 11/06/2021 22:06:27 - INFO - __main__ - Step 6098: {'lr': 0.0004990551899260644, 'samples': 1170816, 'steps': 6097, 'loss/train': 1.987586498260498} 11/06/2021 22:06:27 - INFO - __main__ - Step 6099: {'lr': 0.0004990547289402433, 'samples': 1171008, 'steps': 6098, 'loss/train': 1.9792393445968628} 11/06/2021 22:06:27 - INFO - __main__ - Step 6100: {'lr': 0.0004990542678422019, 'samples': 1171200, 'steps': 6099, 'loss/train': 1.7970731258392334} 11/06/2021 22:06:29 - INFO - __main__ - Step 6101: {'lr': 0.0004990538066319406, 'samples': 1171392, 'steps': 6100, 'loss/train': 1.9069502353668213} 11/06/2021 22:06:29 - INFO - __main__ - Step 6102: {'lr': 0.0004990533453094594, 'samples': 1171584, 'steps': 6101, 'loss/train': 1.354381799697876} 11/06/2021 22:06:29 - INFO - __main__ - Step 6103: {'lr': 0.0004990528838747586, 'samples': 1171776, 'steps': 6102, 'loss/train': 1.948202133178711} 11/06/2021 22:06:30 - INFO - __main__ - Step 6104: {'lr': 0.0004990524223278384, 'samples': 1171968, 'steps': 6103, 'loss/train': 1.6591429710388184} 11/06/2021 22:06:30 - INFO - __main__ - Step 6105: {'lr': 0.0004990519606686991, 'samples': 1172160, 'steps': 6104, 'loss/train': 2.1720986366271973} 11/06/2021 22:06:31 - INFO - __main__ - Step 6106: {'lr': 0.0004990514988973408, 'samples': 1172352, 'steps': 6105, 'loss/train': 2.1691129207611084} 11/06/2021 22:06:31 - INFO - __main__ - Step 6107: {'lr': 0.0004990510370137637, 'samples': 1172544, 'steps': 6106, 'loss/train': 1.6799395084381104} 11/06/2021 22:06:32 - INFO - __main__ - Step 6108: {'lr': 0.0004990505750179682, 'samples': 1172736, 'steps': 6107, 'loss/train': 1.9363466501235962} 11/06/2021 22:06:32 - INFO - __main__ - Step 6109: {'lr': 0.0004990501129099542, 'samples': 1172928, 'steps': 6108, 'loss/train': 1.7905744314193726} 11/06/2021 22:06:32 - INFO - __main__ - Step 6110: {'lr': 0.000499049650689722, 'samples': 1173120, 'steps': 6109, 'loss/train': 1.5997885465621948} 11/06/2021 22:06:33 - INFO - __main__ - Step 6111: {'lr': 0.000499049188357272, 'samples': 1173312, 'steps': 6110, 'loss/train': 2.1724469661712646} 11/06/2021 22:06:34 - INFO - __main__ - Step 6112: {'lr': 0.0004990487259126043, 'samples': 1173504, 'steps': 6111, 'loss/train': 1.6976191997528076} 11/06/2021 22:06:34 - INFO - __main__ - Step 6113: {'lr': 0.0004990482633557189, 'samples': 1173696, 'steps': 6112, 'loss/train': 1.716886281967163} 11/06/2021 22:06:34 - INFO - __main__ - Step 6114: {'lr': 0.0004990478006866165, 'samples': 1173888, 'steps': 6113, 'loss/train': 1.6597141027450562} 11/06/2021 22:06:35 - INFO - __main__ - Step 6115: {'lr': 0.0004990473379052968, 'samples': 1174080, 'steps': 6114, 'loss/train': 2.12764835357666} 11/06/2021 22:06:36 - INFO - __main__ - Step 6116: {'lr': 0.0004990468750117602, 'samples': 1174272, 'steps': 6115, 'loss/train': 1.4543788433074951} 11/06/2021 22:06:36 - INFO - __main__ - Step 6117: {'lr': 0.000499046412006007, 'samples': 1174464, 'steps': 6116, 'loss/train': 1.662864327430725} 11/06/2021 22:06:37 - INFO - __main__ - Step 6118: {'lr': 0.0004990459488880372, 'samples': 1174656, 'steps': 6117, 'loss/train': 1.5874830484390259} 11/06/2021 22:06:37 - INFO - __main__ - Step 6119: {'lr': 0.0004990454856578513, 'samples': 1174848, 'steps': 6118, 'loss/train': 1.8817105293273926} 11/06/2021 22:06:37 - INFO - __main__ - Step 6120: {'lr': 0.0004990450223154492, 'samples': 1175040, 'steps': 6119, 'loss/train': 1.8093229532241821} 11/06/2021 22:06:38 - INFO - __main__ - Step 6121: {'lr': 0.0004990445588608313, 'samples': 1175232, 'steps': 6120, 'loss/train': 1.6032285690307617} 11/06/2021 22:06:39 - INFO - __main__ - Step 6122: {'lr': 0.0004990440952939979, 'samples': 1175424, 'steps': 6121, 'loss/train': 2.429919719696045} 11/06/2021 22:06:39 - INFO - __main__ - Step 6123: {'lr': 0.0004990436316149489, 'samples': 1175616, 'steps': 6122, 'loss/train': 1.6632248163223267} 11/06/2021 22:06:40 - INFO - __main__ - Step 6124: {'lr': 0.0004990431678236849, 'samples': 1175808, 'steps': 6123, 'loss/train': 1.8044439554214478} 11/06/2021 22:06:40 - INFO - __main__ - Step 6125: {'lr': 0.0004990427039202057, 'samples': 1176000, 'steps': 6124, 'loss/train': 1.9963434934616089} 11/06/2021 22:06:40 - INFO - __main__ - Step 6126: {'lr': 0.0004990422399045117, 'samples': 1176192, 'steps': 6125, 'loss/train': 1.9155720472335815} 11/06/2021 22:06:41 - INFO - __main__ - Step 6127: {'lr': 0.0004990417757766031, 'samples': 1176384, 'steps': 6126, 'loss/train': 1.775386929512024} 11/06/2021 22:06:42 - INFO - __main__ - Step 6128: {'lr': 0.0004990413115364803, 'samples': 1176576, 'steps': 6127, 'loss/train': 1.5602920055389404} 11/06/2021 22:06:42 - INFO - __main__ - Step 6129: {'lr': 0.0004990408471841431, 'samples': 1176768, 'steps': 6128, 'loss/train': 2.0410501956939697} 11/06/2021 22:06:42 - INFO - __main__ - Step 6130: {'lr': 0.0004990403827195921, 'samples': 1176960, 'steps': 6129, 'loss/train': 2.2790169715881348} 11/06/2021 22:06:43 - INFO - __main__ - Step 6131: {'lr': 0.0004990399181428273, 'samples': 1177152, 'steps': 6130, 'loss/train': 2.0821285247802734} 11/06/2021 22:06:44 - INFO - __main__ - Step 6132: {'lr': 0.000499039453453849, 'samples': 1177344, 'steps': 6131, 'loss/train': 1.8965604305267334} 11/06/2021 22:06:44 - INFO - __main__ - Step 6133: {'lr': 0.0004990389886526573, 'samples': 1177536, 'steps': 6132, 'loss/train': 1.8164457082748413} 11/06/2021 22:06:44 - INFO - __main__ - Step 6134: {'lr': 0.0004990385237392524, 'samples': 1177728, 'steps': 6133, 'loss/train': 1.9925854206085205} 11/06/2021 22:06:45 - INFO - __main__ - Step 6135: {'lr': 0.0004990380587136347, 'samples': 1177920, 'steps': 6134, 'loss/train': 1.2192782163619995} 11/06/2021 22:06:45 - INFO - __main__ - Step 6136: {'lr': 0.0004990375935758042, 'samples': 1178112, 'steps': 6135, 'loss/train': 1.5371315479278564} 11/06/2021 22:06:46 - INFO - __main__ - Step 6137: {'lr': 0.0004990371283257613, 'samples': 1178304, 'steps': 6136, 'loss/train': 1.8077142238616943} 11/06/2021 22:06:47 - INFO - __main__ - Step 6138: {'lr': 0.0004990366629635062, 'samples': 1178496, 'steps': 6137, 'loss/train': 2.013735771179199} 11/06/2021 22:06:47 - INFO - __main__ - Step 6139: {'lr': 0.0004990361974890388, 'samples': 1178688, 'steps': 6138, 'loss/train': 2.858919620513916} 11/06/2021 22:06:47 - INFO - __main__ - Step 6140: {'lr': 0.0004990357319023597, 'samples': 1178880, 'steps': 6139, 'loss/train': 1.368839144706726} 11/06/2021 22:06:48 - INFO - __main__ - Step 6141: {'lr': 0.0004990352662034689, 'samples': 1179072, 'steps': 6140, 'loss/train': 1.362464427947998} 11/06/2021 22:06:49 - INFO - __main__ - Step 6142: {'lr': 0.0004990348003923665, 'samples': 1179264, 'steps': 6141, 'loss/train': 1.9596736431121826} 11/06/2021 22:06:49 - INFO - __main__ - Step 6143: {'lr': 0.000499034334469053, 'samples': 1179456, 'steps': 6142, 'loss/train': 2.131988048553467} 11/06/2021 22:06:50 - INFO - __main__ - Step 6144: {'lr': 0.0004990338684335285, 'samples': 1179648, 'steps': 6143, 'loss/train': 1.4277572631835938} 11/06/2021 22:06:50 - INFO - __main__ - Step 6145: {'lr': 0.0004990334022857932, 'samples': 1179840, 'steps': 6144, 'loss/train': 1.9954853057861328} 11/06/2021 22:06:50 - INFO - __main__ - Step 6146: {'lr': 0.0004990329360258472, 'samples': 1180032, 'steps': 6145, 'loss/train': 1.7441011667251587} 11/06/2021 22:06:51 - INFO - __main__ - Step 6147: {'lr': 0.0004990324696536908, 'samples': 1180224, 'steps': 6146, 'loss/train': 0.7077341079711914} 11/06/2021 22:06:52 - INFO - __main__ - Step 6148: {'lr': 0.0004990320031693242, 'samples': 1180416, 'steps': 6147, 'loss/train': 2.2118520736694336} 11/06/2021 22:06:52 - INFO - __main__ - Step 6149: {'lr': 0.0004990315365727476, 'samples': 1180608, 'steps': 6148, 'loss/train': 2.055332660675049} 11/06/2021 22:06:52 - INFO - __main__ - Step 6150: {'lr': 0.0004990310698639614, 'samples': 1180800, 'steps': 6149, 'loss/train': 2.1165621280670166} 11/06/2021 22:06:53 - INFO - __main__ - Step 6151: {'lr': 0.0004990306030429655, 'samples': 1180992, 'steps': 6150, 'loss/train': 1.5419307947158813} 11/06/2021 22:06:53 - INFO - __main__ - Step 6152: {'lr': 0.0004990301361097603, 'samples': 1181184, 'steps': 6151, 'loss/train': 1.1245484352111816} 11/06/2021 22:06:54 - INFO - __main__ - Step 6153: {'lr': 0.000499029669064346, 'samples': 1181376, 'steps': 6152, 'loss/train': 2.3590290546417236} 11/06/2021 22:06:54 - INFO - __main__ - Step 6154: {'lr': 0.0004990292019067227, 'samples': 1181568, 'steps': 6153, 'loss/train': 1.9370914697647095} 11/06/2021 22:06:55 - INFO - __main__ - Step 6155: {'lr': 0.0004990287346368908, 'samples': 1181760, 'steps': 6154, 'loss/train': 2.3206193447113037} 11/06/2021 22:06:55 - INFO - __main__ - Step 6156: {'lr': 0.0004990282672548503, 'samples': 1181952, 'steps': 6155, 'loss/train': 1.679478645324707} 11/06/2021 22:06:56 - INFO - __main__ - Step 6157: {'lr': 0.0004990277997606016, 'samples': 1182144, 'steps': 6156, 'loss/train': 1.524378776550293} 11/06/2021 22:06:56 - INFO - __main__ - Step 6158: {'lr': 0.0004990273321541447, 'samples': 1182336, 'steps': 6157, 'loss/train': 0.6619194149971008} 11/06/2021 22:06:57 - INFO - __main__ - Step 6159: {'lr': 0.0004990268644354799, 'samples': 1182528, 'steps': 6158, 'loss/train': 1.9400123357772827} 11/06/2021 22:06:57 - INFO - __main__ - Step 6160: {'lr': 0.0004990263966046075, 'samples': 1182720, 'steps': 6159, 'loss/train': 1.8187702894210815} 11/06/2021 22:06:57 - INFO - __main__ - Step 6161: {'lr': 0.0004990259286615276, 'samples': 1182912, 'steps': 6160, 'loss/train': 1.444914698600769} 11/06/2021 22:06:58 - INFO - __main__ - Step 6162: {'lr': 0.0004990254606062406, 'samples': 1183104, 'steps': 6161, 'loss/train': 1.4133529663085938} 11/06/2021 22:06:59 - INFO - __main__ - Step 6163: {'lr': 0.0004990249924387465, 'samples': 1183296, 'steps': 6162, 'loss/train': 1.8153014183044434} 11/06/2021 22:06:59 - INFO - __main__ - Step 6164: {'lr': 0.0004990245241590455, 'samples': 1183488, 'steps': 6163, 'loss/train': 1.761313796043396} 11/06/2021 22:07:00 - INFO - __main__ - Step 6165: {'lr': 0.0004990240557671379, 'samples': 1183680, 'steps': 6164, 'loss/train': 0.9687419533729553} 11/06/2021 22:07:00 - INFO - __main__ - Step 6166: {'lr': 0.000499023587263024, 'samples': 1183872, 'steps': 6165, 'loss/train': 1.6445512771606445} 11/06/2021 22:07:00 - INFO - __main__ - Step 6167: {'lr': 0.0004990231186467039, 'samples': 1184064, 'steps': 6166, 'loss/train': 1.3879804611206055} 11/06/2021 22:07:01 - INFO - __main__ - Step 6168: {'lr': 0.0004990226499181778, 'samples': 1184256, 'steps': 6167, 'loss/train': 2.506103992462158} 11/06/2021 22:07:02 - INFO - __main__ - Step 6169: {'lr': 0.0004990221810774459, 'samples': 1184448, 'steps': 6168, 'loss/train': 2.4504177570343018} 11/06/2021 22:07:02 - INFO - __main__ - Step 6170: {'lr': 0.0004990217121245084, 'samples': 1184640, 'steps': 6169, 'loss/train': 1.9792548418045044} 11/06/2021 22:07:02 - INFO - __main__ - Step 6171: {'lr': 0.0004990212430593657, 'samples': 1184832, 'steps': 6170, 'loss/train': 1.8940435647964478} 11/06/2021 22:07:03 - INFO - __main__ - Step 6172: {'lr': 0.0004990207738820178, 'samples': 1185024, 'steps': 6171, 'loss/train': 1.8361660242080688} 11/06/2021 22:07:04 - INFO - __main__ - Step 6173: {'lr': 0.000499020304592465, 'samples': 1185216, 'steps': 6172, 'loss/train': 2.492899179458618} 11/06/2021 22:07:04 - INFO - __main__ - Step 6174: {'lr': 0.0004990198351907075, 'samples': 1185408, 'steps': 6173, 'loss/train': 1.5003889799118042} 11/06/2021 22:07:04 - INFO - __main__ - Step 6175: {'lr': 0.0004990193656767455, 'samples': 1185600, 'steps': 6174, 'loss/train': 2.0811853408813477} 11/06/2021 22:07:05 - INFO - __main__ - Step 6176: {'lr': 0.0004990188960505792, 'samples': 1185792, 'steps': 6175, 'loss/train': 3.0719258785247803} 11/06/2021 22:07:05 - INFO - __main__ - Step 6177: {'lr': 0.0004990184263122088, 'samples': 1185984, 'steps': 6176, 'loss/train': 1.861528754234314} 11/06/2021 22:07:05 - INFO - __main__ - Step 6178: {'lr': 0.0004990179564616346, 'samples': 1186176, 'steps': 6177, 'loss/train': 5.958117485046387} 11/06/2021 22:07:07 - INFO - __main__ - Step 6179: {'lr': 0.0004990174864988566, 'samples': 1186368, 'steps': 6178, 'loss/train': 1.926222562789917} 11/06/2021 22:07:07 - INFO - __main__ - Step 6180: {'lr': 0.0004990170164238754, 'samples': 1186560, 'steps': 6179, 'loss/train': 1.9009610414505005} 11/06/2021 22:07:07 - INFO - __main__ - Step 6181: {'lr': 0.0004990165462366909, 'samples': 1186752, 'steps': 6180, 'loss/train': 2.4664011001586914} 11/06/2021 22:07:08 - INFO - __main__ - Step 6182: {'lr': 0.0004990160759373033, 'samples': 1186944, 'steps': 6181, 'loss/train': 2.10902738571167} 11/06/2021 22:07:08 - INFO - __main__ - Step 6183: {'lr': 0.0004990156055257129, 'samples': 1187136, 'steps': 6182, 'loss/train': 1.813493251800537} 11/06/2021 22:07:09 - INFO - __main__ - Step 6184: {'lr': 0.00049901513500192, 'samples': 1187328, 'steps': 6183, 'loss/train': 2.25309681892395} 11/06/2021 22:07:09 - INFO - __main__ - Step 6185: {'lr': 0.0004990146643659247, 'samples': 1187520, 'steps': 6184, 'loss/train': 2.2059273719787598} 11/06/2021 22:07:10 - INFO - __main__ - Step 6186: {'lr': 0.0004990141936177272, 'samples': 1187712, 'steps': 6185, 'loss/train': 2.0499684810638428} 11/06/2021 22:07:10 - INFO - __main__ - Step 6187: {'lr': 0.0004990137227573278, 'samples': 1187904, 'steps': 6186, 'loss/train': 3.2400269508361816} 11/06/2021 22:07:10 - INFO - __main__ - Step 6188: {'lr': 0.0004990132517847266, 'samples': 1188096, 'steps': 6187, 'loss/train': 1.8898727893829346} 11/06/2021 22:07:11 - INFO - __main__ - Step 6189: {'lr': 0.0004990127806999239, 'samples': 1188288, 'steps': 6188, 'loss/train': 0.9931958317756653} 11/06/2021 22:07:12 - INFO - __main__ - Step 6190: {'lr': 0.0004990123095029199, 'samples': 1188480, 'steps': 6189, 'loss/train': 1.6618373394012451} 11/06/2021 22:07:12 - INFO - __main__ - Step 6191: {'lr': 0.0004990118381937148, 'samples': 1188672, 'steps': 6190, 'loss/train': 1.4901134967803955} 11/06/2021 22:07:12 - INFO - __main__ - Step 6192: {'lr': 0.0004990113667723088, 'samples': 1188864, 'steps': 6191, 'loss/train': 1.1681599617004395} 11/06/2021 22:07:13 - INFO - __main__ - Step 6193: {'lr': 0.000499010895238702, 'samples': 1189056, 'steps': 6192, 'loss/train': 1.8815301656723022} 11/06/2021 22:07:14 - INFO - __main__ - Step 6194: {'lr': 0.0004990104235928948, 'samples': 1189248, 'steps': 6193, 'loss/train': 1.8837906122207642} 11/06/2021 22:07:14 - INFO - __main__ - Step 6195: {'lr': 0.0004990099518348874, 'samples': 1189440, 'steps': 6194, 'loss/train': 2.526460647583008} 11/06/2021 22:07:14 - INFO - __main__ - Step 6196: {'lr': 0.00049900947996468, 'samples': 1189632, 'steps': 6195, 'loss/train': 1.910827875137329} 11/06/2021 22:07:15 - INFO - __main__ - Step 6197: {'lr': 0.0004990090079822726, 'samples': 1189824, 'steps': 6196, 'loss/train': 2.300349473953247} 11/06/2021 22:07:15 - INFO - __main__ - Step 6198: {'lr': 0.0004990085358876658, 'samples': 1190016, 'steps': 6197, 'loss/train': 1.6927975416183472} 11/06/2021 22:07:16 - INFO - __main__ - Step 6199: {'lr': 0.0004990080636808595, 'samples': 1190208, 'steps': 6198, 'loss/train': 2.1126410961151123} 11/06/2021 22:07:17 - INFO - __main__ - Step 6200: {'lr': 0.000499007591361854, 'samples': 1190400, 'steps': 6199, 'loss/train': 1.71726393699646} 11/06/2021 22:07:17 - INFO - __main__ - Step 6201: {'lr': 0.0004990071189306495, 'samples': 1190592, 'steps': 6200, 'loss/train': 2.024538040161133} 11/06/2021 22:07:17 - INFO - __main__ - Step 6202: {'lr': 0.0004990066463872462, 'samples': 1190784, 'steps': 6201, 'loss/train': 1.7335742712020874} 11/06/2021 22:07:18 - INFO - __main__ - Step 6203: {'lr': 0.0004990061737316445, 'samples': 1190976, 'steps': 6202, 'loss/train': 1.7096867561340332} 11/06/2021 22:07:18 - INFO - __main__ - Step 6204: {'lr': 0.0004990057009638443, 'samples': 1191168, 'steps': 6203, 'loss/train': 1.5593353509902954} 11/06/2021 22:07:19 - INFO - __main__ - Step 6205: {'lr': 0.000499005228083846, 'samples': 1191360, 'steps': 6204, 'loss/train': 2.450568437576294} 11/06/2021 22:07:20 - INFO - __main__ - Step 6206: {'lr': 0.0004990047550916498, 'samples': 1191552, 'steps': 6205, 'loss/train': 2.2195775508880615} 11/06/2021 22:07:20 - INFO - __main__ - Step 6207: {'lr': 0.000499004281987256, 'samples': 1191744, 'steps': 6206, 'loss/train': 1.6635841131210327} 11/06/2021 22:07:20 - INFO - __main__ - Step 6208: {'lr': 0.0004990038087706646, 'samples': 1191936, 'steps': 6207, 'loss/train': 1.9627450704574585} 11/06/2021 22:07:21 - INFO - __main__ - Step 6209: {'lr': 0.000499003335441876, 'samples': 1192128, 'steps': 6208, 'loss/train': 2.254753351211548} 11/06/2021 22:07:22 - INFO - __main__ - Step 6210: {'lr': 0.0004990028620008903, 'samples': 1192320, 'steps': 6209, 'loss/train': 1.8182693719863892} 11/06/2021 22:07:22 - INFO - __main__ - Step 6211: {'lr': 0.0004990023884477077, 'samples': 1192512, 'steps': 6210, 'loss/train': 2.1537516117095947} 11/06/2021 22:07:23 - INFO - __main__ - Step 6212: {'lr': 0.0004990019147823286, 'samples': 1192704, 'steps': 6211, 'loss/train': 1.6900304555892944} 11/06/2021 22:07:23 - INFO - __main__ - Step 6213: {'lr': 0.000499001441004753, 'samples': 1192896, 'steps': 6212, 'loss/train': 0.32827237248420715} 11/06/2021 22:07:23 - INFO - __main__ - Step 6214: {'lr': 0.0004990009671149811, 'samples': 1193088, 'steps': 6213, 'loss/train': 1.9133497476577759} 11/06/2021 22:07:24 - INFO - __main__ - Step 6215: {'lr': 0.0004990004931130133, 'samples': 1193280, 'steps': 6214, 'loss/train': 2.055999994277954} 11/06/2021 22:07:25 - INFO - __main__ - Step 6216: {'lr': 0.0004990000189988497, 'samples': 1193472, 'steps': 6215, 'loss/train': 2.248798370361328} 11/06/2021 22:07:25 - INFO - __main__ - Step 6217: {'lr': 0.0004989995447724907, 'samples': 1193664, 'steps': 6216, 'loss/train': 1.8303406238555908} 11/06/2021 22:07:26 - INFO - __main__ - Step 6218: {'lr': 0.0004989990704339361, 'samples': 1193856, 'steps': 6217, 'loss/train': 1.0167715549468994} 11/06/2021 22:07:26 - INFO - __main__ - Step 6219: {'lr': 0.0004989985959831865, 'samples': 1194048, 'steps': 6218, 'loss/train': 1.4236408472061157} 11/06/2021 22:07:27 - INFO - __main__ - Step 6220: {'lr': 0.0004989981214202419, 'samples': 1194240, 'steps': 6219, 'loss/train': 2.0307483673095703} 11/06/2021 22:07:28 - INFO - __main__ - Step 6221: {'lr': 0.0004989976467451026, 'samples': 1194432, 'steps': 6220, 'loss/train': 2.141268253326416} 11/06/2021 22:07:28 - INFO - __main__ - Step 6222: {'lr': 0.0004989971719577688, 'samples': 1194624, 'steps': 6221, 'loss/train': 1.8791309595108032} 11/06/2021 22:07:28 - INFO - __main__ - Step 6223: {'lr': 0.0004989966970582408, 'samples': 1194816, 'steps': 6222, 'loss/train': 1.7829779386520386} 11/06/2021 22:07:29 - INFO - __main__ - Step 6224: {'lr': 0.0004989962220465187, 'samples': 1195008, 'steps': 6223, 'loss/train': 4.018397331237793} 11/06/2021 22:07:29 - INFO - __main__ - Step 6225: {'lr': 0.0004989957469226027, 'samples': 1195200, 'steps': 6224, 'loss/train': 0.3159443736076355} 11/06/2021 22:07:30 - INFO - __main__ - Step 6226: {'lr': 0.0004989952716864931, 'samples': 1195392, 'steps': 6225, 'loss/train': 2.1702065467834473} 11/06/2021 22:07:30 - INFO - __main__ - Step 6227: {'lr': 0.00049899479633819, 'samples': 1195584, 'steps': 6226, 'loss/train': 1.8308675289154053} 11/06/2021 22:07:31 - INFO - __main__ - Step 6228: {'lr': 0.0004989943208776938, 'samples': 1195776, 'steps': 6227, 'loss/train': 2.180677890777588} 11/06/2021 22:07:31 - INFO - __main__ - Step 6229: {'lr': 0.0004989938453050045, 'samples': 1195968, 'steps': 6228, 'loss/train': 1.6663007736206055} 11/06/2021 22:07:31 - INFO - __main__ - Step 6230: {'lr': 0.0004989933696201225, 'samples': 1196160, 'steps': 6229, 'loss/train': 1.936288595199585} 11/06/2021 22:07:33 - INFO - __main__ - Step 6231: {'lr': 0.0004989928938230478, 'samples': 1196352, 'steps': 6230, 'loss/train': 2.008770227432251} 11/06/2021 22:07:33 - INFO - __main__ - Step 6232: {'lr': 0.0004989924179137808, 'samples': 1196544, 'steps': 6231, 'loss/train': 2.004091501235962} 11/06/2021 22:07:33 - INFO - __main__ - Step 6233: {'lr': 0.0004989919418923218, 'samples': 1196736, 'steps': 6232, 'loss/train': 1.9655420780181885} 11/06/2021 22:07:34 - INFO - __main__ - Step 6234: {'lr': 0.0004989914657586707, 'samples': 1196928, 'steps': 6233, 'loss/train': 1.8493226766586304} 11/06/2021 22:07:34 - INFO - __main__ - Step 6235: {'lr': 0.000498990989512828, 'samples': 1197120, 'steps': 6234, 'loss/train': 2.194037675857544} 11/06/2021 22:07:35 - INFO - __main__ - Step 6236: {'lr': 0.0004989905131547937, 'samples': 1197312, 'steps': 6235, 'loss/train': 2.581127643585205} 11/06/2021 22:07:35 - INFO - __main__ - Step 6237: {'lr': 0.0004989900366845682, 'samples': 1197504, 'steps': 6236, 'loss/train': 2.2226333618164062} 11/06/2021 22:07:36 - INFO - __main__ - Step 6238: {'lr': 0.0004989895601021515, 'samples': 1197696, 'steps': 6237, 'loss/train': 2.466501235961914} 11/06/2021 22:07:36 - INFO - __main__ - Step 6239: {'lr': 0.0004989890834075441, 'samples': 1197888, 'steps': 6238, 'loss/train': 1.5966987609863281} 11/06/2021 22:07:36 - INFO - __main__ - Step 6240: {'lr': 0.000498988606600746, 'samples': 1198080, 'steps': 6239, 'loss/train': 1.8267741203308105} 11/06/2021 22:07:37 - INFO - __main__ - Step 6241: {'lr': 0.0004989881296817575, 'samples': 1198272, 'steps': 6240, 'loss/train': 2.1044561862945557} 11/06/2021 22:07:38 - INFO - __main__ - Step 6242: {'lr': 0.0004989876526505788, 'samples': 1198464, 'steps': 6241, 'loss/train': 1.7580589056015015} 11/06/2021 22:07:38 - INFO - __main__ - Step 6243: {'lr': 0.0004989871755072101, 'samples': 1198656, 'steps': 6242, 'loss/train': 1.8593974113464355} 11/06/2021 22:07:38 - INFO - __main__ - Step 6244: {'lr': 0.0004989866982516516, 'samples': 1198848, 'steps': 6243, 'loss/train': 1.7739224433898926} 11/06/2021 22:07:39 - INFO - __main__ - Step 6245: {'lr': 0.0004989862208839035, 'samples': 1199040, 'steps': 6244, 'loss/train': 2.1439409255981445} 11/06/2021 22:07:40 - INFO - __main__ - Step 6246: {'lr': 0.0004989857434039661, 'samples': 1199232, 'steps': 6245, 'loss/train': 1.7785552740097046} 11/06/2021 22:07:40 - INFO - __main__ - Step 6247: {'lr': 0.0004989852658118395, 'samples': 1199424, 'steps': 6246, 'loss/train': 0.6885305047035217} 11/06/2021 22:07:41 - INFO - __main__ - Step 6248: {'lr': 0.000498984788107524, 'samples': 1199616, 'steps': 6247, 'loss/train': 1.918849229812622} 11/06/2021 22:07:41 - INFO - __main__ - Step 6249: {'lr': 0.0004989843102910198, 'samples': 1199808, 'steps': 6248, 'loss/train': 1.5366085767745972} 11/06/2021 22:07:41 - INFO - __main__ - Step 6250: {'lr': 0.0004989838323623272, 'samples': 1200000, 'steps': 6249, 'loss/train': 2.3803365230560303} 11/06/2021 22:07:42 - INFO - __main__ - Step 6251: {'lr': 0.0004989833543214463, 'samples': 1200192, 'steps': 6250, 'loss/train': 1.9775062799453735} 11/06/2021 22:07:43 - INFO - __main__ - Step 6252: {'lr': 0.0004989828761683774, 'samples': 1200384, 'steps': 6251, 'loss/train': 1.7551454305648804} 11/06/2021 22:07:43 - INFO - __main__ - Step 6253: {'lr': 0.0004989823979031205, 'samples': 1200576, 'steps': 6252, 'loss/train': 1.5888711214065552} 11/06/2021 22:07:43 - INFO - __main__ - Step 6254: {'lr': 0.000498981919525676, 'samples': 1200768, 'steps': 6253, 'loss/train': 1.925179362297058} 11/06/2021 22:07:44 - INFO - __main__ - Step 6255: {'lr': 0.0004989814410360442, 'samples': 1200960, 'steps': 6254, 'loss/train': 0.6360993981361389} 11/06/2021 22:07:45 - INFO - __main__ - Step 6256: {'lr': 0.0004989809624342251, 'samples': 1201152, 'steps': 6255, 'loss/train': 1.8095260858535767} 11/06/2021 22:07:45 - INFO - __main__ - Step 6257: {'lr': 0.000498980483720219, 'samples': 1201344, 'steps': 6256, 'loss/train': 1.7746502161026} 11/06/2021 22:07:46 - INFO - __main__ - Step 6258: {'lr': 0.0004989800048940263, 'samples': 1201536, 'steps': 6257, 'loss/train': 2.096313953399658} 11/06/2021 22:07:46 - INFO - __main__ - Step 6259: {'lr': 0.0004989795259556469, 'samples': 1201728, 'steps': 6258, 'loss/train': 2.1959168910980225} 11/06/2021 22:07:46 - INFO - __main__ - Step 6260: {'lr': 0.0004989790469050813, 'samples': 1201920, 'steps': 6259, 'loss/train': 1.8663018941879272} 11/06/2021 22:07:47 - INFO - __main__ - Step 6261: {'lr': 0.0004989785677423295, 'samples': 1202112, 'steps': 6260, 'loss/train': 1.3765881061553955} 11/06/2021 22:07:47 - INFO - __main__ - Step 6262: {'lr': 0.0004989780884673917, 'samples': 1202304, 'steps': 6261, 'loss/train': 2.113983631134033} 11/06/2021 22:07:48 - INFO - __main__ - Step 6263: {'lr': 0.0004989776090802683, 'samples': 1202496, 'steps': 6262, 'loss/train': 2.6338729858398438} 11/06/2021 22:07:48 - INFO - __main__ - Step 6264: {'lr': 0.0004989771295809594, 'samples': 1202688, 'steps': 6263, 'loss/train': 1.5440324544906616} 11/06/2021 22:07:49 - INFO - __main__ - Step 6265: {'lr': 0.0004989766499694653, 'samples': 1202880, 'steps': 6264, 'loss/train': 2.112800121307373} 11/06/2021 22:07:50 - INFO - __main__ - Step 6266: {'lr': 0.0004989761702457862, 'samples': 1203072, 'steps': 6265, 'loss/train': 1.9621334075927734} 11/06/2021 22:07:50 - INFO - __main__ - Step 6267: {'lr': 0.0004989756904099222, 'samples': 1203264, 'steps': 6266, 'loss/train': 2.0875959396362305} 11/06/2021 22:07:50 - INFO - __main__ - Step 6268: {'lr': 0.0004989752104618736, 'samples': 1203456, 'steps': 6267, 'loss/train': 2.009376287460327} 11/06/2021 22:07:51 - INFO - __main__ - Step 6269: {'lr': 0.0004989747304016407, 'samples': 1203648, 'steps': 6268, 'loss/train': 2.29427433013916} 11/06/2021 22:07:51 - INFO - __main__ - Step 6270: {'lr': 0.0004989742502292235, 'samples': 1203840, 'steps': 6269, 'loss/train': 2.1417438983917236} 11/06/2021 22:07:52 - INFO - __main__ - Step 6271: {'lr': 0.0004989737699446225, 'samples': 1204032, 'steps': 6270, 'loss/train': 1.981139063835144} 11/06/2021 22:07:52 - INFO - __main__ - Step 6272: {'lr': 0.0004989732895478376, 'samples': 1204224, 'steps': 6271, 'loss/train': 1.8909876346588135} 11/06/2021 22:07:53 - INFO - __main__ - Step 6273: {'lr': 0.0004989728090388693, 'samples': 1204416, 'steps': 6272, 'loss/train': 1.6940367221832275} 11/06/2021 22:07:53 - INFO - __main__ - Step 6274: {'lr': 0.0004989723284177177, 'samples': 1204608, 'steps': 6273, 'loss/train': 2.325897216796875} 11/06/2021 22:07:53 - INFO - __main__ - Step 6275: {'lr': 0.0004989718476843828, 'samples': 1204800, 'steps': 6274, 'loss/train': 1.9315811395645142} 11/06/2021 22:07:54 - INFO - __main__ - Step 6276: {'lr': 0.0004989713668388652, 'samples': 1204992, 'steps': 6275, 'loss/train': 1.894089937210083} 11/06/2021 22:07:55 - INFO - __main__ - Step 6277: {'lr': 0.000498970885881165, 'samples': 1205184, 'steps': 6276, 'loss/train': 1.7275776863098145} 11/06/2021 22:07:55 - INFO - __main__ - Step 6278: {'lr': 0.0004989704048112823, 'samples': 1205376, 'steps': 6277, 'loss/train': 2.880239486694336} 11/06/2021 22:07:55 - INFO - __main__ - Step 6279: {'lr': 0.0004989699236292173, 'samples': 1205568, 'steps': 6278, 'loss/train': 1.9111852645874023} 11/06/2021 22:07:56 - INFO - __main__ - Step 6280: {'lr': 0.0004989694423349704, 'samples': 1205760, 'steps': 6279, 'loss/train': 2.0868144035339355} 11/06/2021 22:07:57 - INFO - __main__ - Step 6281: {'lr': 0.0004989689609285417, 'samples': 1205952, 'steps': 6280, 'loss/train': 2.3991446495056152} 11/06/2021 22:07:57 - INFO - __main__ - Step 6282: {'lr': 0.0004989684794099314, 'samples': 1206144, 'steps': 6281, 'loss/train': 1.962192177772522} 11/06/2021 22:07:57 - INFO - __main__ - Step 6283: {'lr': 0.0004989679977791397, 'samples': 1206336, 'steps': 6282, 'loss/train': 2.007171630859375} 11/06/2021 22:07:58 - INFO - __main__ - Step 6284: {'lr': 0.0004989675160361669, 'samples': 1206528, 'steps': 6283, 'loss/train': 2.0830070972442627} 11/06/2021 22:07:58 - INFO - __main__ - Step 6285: {'lr': 0.0004989670341810132, 'samples': 1206720, 'steps': 6284, 'loss/train': 1.7555127143859863} 11/06/2021 22:07:59 - INFO - __main__ - Step 6286: {'lr': 0.0004989665522136789, 'samples': 1206912, 'steps': 6285, 'loss/train': 1.7010811567306519} 11/06/2021 22:08:00 - INFO - __main__ - Step 6287: {'lr': 0.0004989660701341639, 'samples': 1207104, 'steps': 6286, 'loss/train': 1.6270495653152466} 11/06/2021 22:08:00 - INFO - __main__ - Step 6288: {'lr': 0.0004989655879424687, 'samples': 1207296, 'steps': 6287, 'loss/train': 0.33006778359413147} 11/06/2021 22:08:00 - INFO - __main__ - Step 6289: {'lr': 0.0004989651056385936, 'samples': 1207488, 'steps': 6288, 'loss/train': 1.9551554918289185} 11/06/2021 22:08:01 - INFO - __main__ - Step 6290: {'lr': 0.0004989646232225384, 'samples': 1207680, 'steps': 6289, 'loss/train': 5.657275199890137} 11/06/2021 22:08:01 - INFO - __main__ - Step 6291: {'lr': 0.0004989641406943037, 'samples': 1207872, 'steps': 6290, 'loss/train': 1.931122899055481} 11/06/2021 22:08:02 - INFO - __main__ - Step 6292: {'lr': 0.0004989636580538896, 'samples': 1208064, 'steps': 6291, 'loss/train': 1.3750081062316895} 11/06/2021 22:08:02 - INFO - __main__ - Step 6293: {'lr': 0.0004989631753012964, 'samples': 1208256, 'steps': 6292, 'loss/train': 2.0061533451080322} 11/06/2021 22:08:03 - INFO - __main__ - Step 6294: {'lr': 0.0004989626924365242, 'samples': 1208448, 'steps': 6293, 'loss/train': 1.9302148818969727} 11/06/2021 22:08:03 - INFO - __main__ - Step 6295: {'lr': 0.0004989622094595733, 'samples': 1208640, 'steps': 6294, 'loss/train': 2.0548641681671143} 11/06/2021 22:08:03 - INFO - __main__ - Step 6296: {'lr': 0.0004989617263704437, 'samples': 1208832, 'steps': 6295, 'loss/train': 1.7614363431930542} 11/06/2021 22:08:05 - INFO - __main__ - Step 6297: {'lr': 0.0004989612431691359, 'samples': 1209024, 'steps': 6296, 'loss/train': 1.5389564037322998} 11/06/2021 22:08:05 - INFO - __main__ - Step 6298: {'lr': 0.0004989607598556501, 'samples': 1209216, 'steps': 6297, 'loss/train': 2.1703968048095703} 11/06/2021 22:08:05 - INFO - __main__ - Step 6299: {'lr': 0.0004989602764299862, 'samples': 1209408, 'steps': 6298, 'loss/train': 1.22577702999115} 11/06/2021 22:08:06 - INFO - __main__ - Step 6300: {'lr': 0.0004989597928921447, 'samples': 1209600, 'steps': 6299, 'loss/train': 2.2829113006591797} 11/06/2021 22:08:06 - INFO - __main__ - Step 6301: {'lr': 0.0004989593092421258, 'samples': 1209792, 'steps': 6300, 'loss/train': 1.3044848442077637} 11/06/2021 22:08:06 - INFO - __main__ - Step 6302: {'lr': 0.0004989588254799297, 'samples': 1209984, 'steps': 6301, 'loss/train': 1.8990617990493774} 11/06/2021 22:08:07 - INFO - __main__ - Step 6303: {'lr': 0.0004989583416055566, 'samples': 1210176, 'steps': 6302, 'loss/train': 1.9017807245254517} 11/06/2021 22:08:08 - INFO - __main__ - Step 6304: {'lr': 0.0004989578576190068, 'samples': 1210368, 'steps': 6303, 'loss/train': 2.0887370109558105} 11/06/2021 22:08:08 - INFO - __main__ - Step 6305: {'lr': 0.0004989573735202802, 'samples': 1210560, 'steps': 6304, 'loss/train': 1.910634994506836} 11/06/2021 22:08:08 - INFO - __main__ - Step 6306: {'lr': 0.0004989568893093774, 'samples': 1210752, 'steps': 6305, 'loss/train': 1.4305139780044556} 11/06/2021 22:08:09 - INFO - __main__ - Step 6307: {'lr': 0.0004989564049862986, 'samples': 1210944, 'steps': 6306, 'loss/train': 1.7775681018829346} 11/06/2021 22:08:10 - INFO - __main__ - Step 6308: {'lr': 0.0004989559205510436, 'samples': 1211136, 'steps': 6307, 'loss/train': 1.7592074871063232} 11/06/2021 22:08:10 - INFO - __main__ - Step 6309: {'lr': 0.000498955436003613, 'samples': 1211328, 'steps': 6308, 'loss/train': 1.7411454916000366} 11/06/2021 22:08:10 - INFO - __main__ - Step 6310: {'lr': 0.0004989549513440071, 'samples': 1211520, 'steps': 6309, 'loss/train': 2.116926431655884} 11/06/2021 22:08:11 - INFO - __main__ - Step 6311: {'lr': 0.0004989544665722258, 'samples': 1211712, 'steps': 6310, 'loss/train': 1.8361932039260864} 11/06/2021 22:08:11 - INFO - __main__ - Step 6312: {'lr': 0.0004989539816882694, 'samples': 1211904, 'steps': 6311, 'loss/train': 1.3926923274993896} 11/06/2021 22:08:12 - INFO - __main__ - Step 6313: {'lr': 0.0004989534966921382, 'samples': 1212096, 'steps': 6312, 'loss/train': 1.8578460216522217} 11/06/2021 22:08:13 - INFO - __main__ - Step 6314: {'lr': 0.0004989530115838324, 'samples': 1212288, 'steps': 6313, 'loss/train': 2.0698132514953613} 11/06/2021 22:08:13 - INFO - __main__ - Step 6315: {'lr': 0.0004989525263633523, 'samples': 1212480, 'steps': 6314, 'loss/train': 1.3116508722305298} 11/06/2021 22:08:13 - INFO - __main__ - Step 6316: {'lr': 0.0004989520410306979, 'samples': 1212672, 'steps': 6315, 'loss/train': 1.9149450063705444} 11/06/2021 22:08:14 - INFO - __main__ - Step 6317: {'lr': 0.0004989515555858697, 'samples': 1212864, 'steps': 6316, 'loss/train': 2.206059694290161} 11/06/2021 22:08:15 - INFO - __main__ - Step 6318: {'lr': 0.0004989510700288678, 'samples': 1213056, 'steps': 6317, 'loss/train': 1.6991815567016602} 11/06/2021 22:08:15 - INFO - __main__ - Step 6319: {'lr': 0.0004989505843596922, 'samples': 1213248, 'steps': 6318, 'loss/train': 2.7226216793060303} 11/06/2021 22:08:15 - INFO - __main__ - Step 6320: {'lr': 0.0004989500985783434, 'samples': 1213440, 'steps': 6319, 'loss/train': 2.1540679931640625} 11/06/2021 22:08:16 - INFO - __main__ - Step 6321: {'lr': 0.0004989496126848215, 'samples': 1213632, 'steps': 6320, 'loss/train': 1.8948540687561035} 11/06/2021 22:08:16 - INFO - __main__ - Step 6322: {'lr': 0.0004989491266791268, 'samples': 1213824, 'steps': 6321, 'loss/train': 1.8643752336502075} 11/06/2021 22:08:17 - INFO - __main__ - Step 6323: {'lr': 0.0004989486405612595, 'samples': 1214016, 'steps': 6322, 'loss/train': 1.8771063089370728} 11/06/2021 22:08:18 - INFO - __main__ - Step 6324: {'lr': 0.0004989481543312196, 'samples': 1214208, 'steps': 6323, 'loss/train': 1.7344616651535034} 11/06/2021 22:08:18 - INFO - __main__ - Step 6325: {'lr': 0.0004989476679890077, 'samples': 1214400, 'steps': 6324, 'loss/train': 1.7559272050857544} 11/06/2021 22:08:18 - INFO - __main__ - Step 6326: {'lr': 0.0004989471815346237, 'samples': 1214592, 'steps': 6325, 'loss/train': 2.1662731170654297} 11/06/2021 22:08:19 - INFO - __main__ - Step 6327: {'lr': 0.000498946694968068, 'samples': 1214784, 'steps': 6326, 'loss/train': 2.0608224868774414} 11/06/2021 22:08:19 - INFO - __main__ - Step 6328: {'lr': 0.0004989462082893407, 'samples': 1214976, 'steps': 6327, 'loss/train': 2.2688443660736084} 11/06/2021 22:08:20 - INFO - __main__ - Step 6329: {'lr': 0.0004989457214984421, 'samples': 1215168, 'steps': 6328, 'loss/train': 2.0150251388549805} 11/06/2021 22:08:20 - INFO - __main__ - Step 6330: {'lr': 0.0004989452345953725, 'samples': 1215360, 'steps': 6329, 'loss/train': 1.8572430610656738} 11/06/2021 22:08:21 - INFO - __main__ - Step 6331: {'lr': 0.000498944747580132, 'samples': 1215552, 'steps': 6330, 'loss/train': 1.986055850982666} 11/06/2021 22:08:21 - INFO - __main__ - Step 6332: {'lr': 0.0004989442604527208, 'samples': 1215744, 'steps': 6331, 'loss/train': 1.7587941884994507} 11/06/2021 22:08:21 - INFO - __main__ - Step 6333: {'lr': 0.0004989437732131391, 'samples': 1215936, 'steps': 6332, 'loss/train': 1.8428518772125244} 11/06/2021 22:08:23 - INFO - __main__ - Step 6334: {'lr': 0.0004989432858613873, 'samples': 1216128, 'steps': 6333, 'loss/train': 2.1033074855804443} 11/06/2021 22:08:23 - INFO - __main__ - Step 6335: {'lr': 0.0004989427983974653, 'samples': 1216320, 'steps': 6334, 'loss/train': 1.3180185556411743} 11/06/2021 22:08:23 - INFO - __main__ - Step 6336: {'lr': 0.0004989423108213737, 'samples': 1216512, 'steps': 6335, 'loss/train': 2.3260984420776367} 11/06/2021 22:08:24 - INFO - __main__ - Step 6337: {'lr': 0.0004989418231331124, 'samples': 1216704, 'steps': 6336, 'loss/train': 2.056525230407715} 11/06/2021 22:08:24 - INFO - __main__ - Step 6338: {'lr': 0.0004989413353326818, 'samples': 1216896, 'steps': 6337, 'loss/train': 2.151630401611328} 11/06/2021 22:08:25 - INFO - __main__ - Step 6339: {'lr': 0.0004989408474200821, 'samples': 1217088, 'steps': 6338, 'loss/train': 0.4232385456562042} 11/06/2021 22:08:25 - INFO - __main__ - Step 6340: {'lr': 0.0004989403593953135, 'samples': 1217280, 'steps': 6339, 'loss/train': 2.0424396991729736} 11/06/2021 22:08:26 - INFO - __main__ - Step 6341: {'lr': 0.0004989398712583762, 'samples': 1217472, 'steps': 6340, 'loss/train': 1.5621088743209839} 11/06/2021 22:08:26 - INFO - __main__ - Step 6342: {'lr': 0.0004989393830092705, 'samples': 1217664, 'steps': 6341, 'loss/train': 2.555983781814575} 11/06/2021 22:08:26 - INFO - __main__ - Step 6343: {'lr': 0.0004989388946479965, 'samples': 1217856, 'steps': 6342, 'loss/train': 1.820717692375183} 11/06/2021 22:08:27 - INFO - __main__ - Step 6344: {'lr': 0.0004989384061745545, 'samples': 1218048, 'steps': 6343, 'loss/train': 1.8627382516860962} 11/06/2021 22:08:28 - INFO - __main__ - Step 6345: {'lr': 0.0004989379175889447, 'samples': 1218240, 'steps': 6344, 'loss/train': 2.2447197437286377} 11/06/2021 22:08:28 - INFO - __main__ - Step 6346: {'lr': 0.0004989374288911672, 'samples': 1218432, 'steps': 6345, 'loss/train': 2.0814595222473145} 11/06/2021 22:08:28 - INFO - __main__ - Step 6347: {'lr': 0.0004989369400812225, 'samples': 1218624, 'steps': 6346, 'loss/train': 1.2773443460464478} 11/06/2021 22:08:29 - INFO - __main__ - Step 6348: {'lr': 0.0004989364511591106, 'samples': 1218816, 'steps': 6347, 'loss/train': 1.5852837562561035} 11/06/2021 22:08:29 - INFO - __main__ - Step 6349: {'lr': 0.0004989359621248317, 'samples': 1219008, 'steps': 6348, 'loss/train': 1.5508657693862915} 11/06/2021 22:08:30 - INFO - __main__ - Step 6350: {'lr': 0.0004989354729783861, 'samples': 1219200, 'steps': 6349, 'loss/train': 1.6555392742156982} 11/06/2021 22:08:31 - INFO - __main__ - Step 6351: {'lr': 0.0004989349837197742, 'samples': 1219392, 'steps': 6350, 'loss/train': 1.6861871480941772} 11/06/2021 22:08:31 - INFO - __main__ - Step 6352: {'lr': 0.0004989344943489958, 'samples': 1219584, 'steps': 6351, 'loss/train': 2.0413320064544678} 11/06/2021 22:08:31 - INFO - __main__ - Step 6353: {'lr': 0.0004989340048660515, 'samples': 1219776, 'steps': 6352, 'loss/train': 2.444296360015869} 11/06/2021 22:08:32 - INFO - __main__ - Step 6354: {'lr': 0.0004989335152709414, 'samples': 1219968, 'steps': 6353, 'loss/train': 2.159029245376587} 11/06/2021 22:08:33 - INFO - __main__ - Step 6355: {'lr': 0.0004989330255636656, 'samples': 1220160, 'steps': 6354, 'loss/train': 1.5128992795944214} 11/06/2021 22:08:33 - INFO - __main__ - Step 6356: {'lr': 0.0004989325357442245, 'samples': 1220352, 'steps': 6355, 'loss/train': 1.8759398460388184} 11/06/2021 22:08:33 - INFO - __main__ - Step 6357: {'lr': 0.0004989320458126182, 'samples': 1220544, 'steps': 6356, 'loss/train': 1.6384868621826172} 11/06/2021 22:08:34 - INFO - __main__ - Step 6358: {'lr': 0.0004989315557688469, 'samples': 1220736, 'steps': 6357, 'loss/train': 2.0724997520446777} 11/06/2021 22:08:34 - INFO - __main__ - Step 6359: {'lr': 0.000498931065612911, 'samples': 1220928, 'steps': 6358, 'loss/train': 1.3788177967071533} 11/06/2021 22:08:35 - INFO - __main__ - Step 6360: {'lr': 0.0004989305753448106, 'samples': 1221120, 'steps': 6359, 'loss/train': 1.6440093517303467} 11/06/2021 22:08:35 - INFO - __main__ - Step 6361: {'lr': 0.0004989300849645459, 'samples': 1221312, 'steps': 6360, 'loss/train': 0.8680780529975891} 11/06/2021 22:08:36 - INFO - __main__ - Step 6362: {'lr': 0.0004989295944721171, 'samples': 1221504, 'steps': 6361, 'loss/train': 1.834447979927063} 11/06/2021 22:08:36 - INFO - __main__ - Step 6363: {'lr': 0.0004989291038675245, 'samples': 1221696, 'steps': 6362, 'loss/train': 2.109246015548706} 11/06/2021 22:08:36 - INFO - __main__ - Step 6364: {'lr': 0.0004989286131507682, 'samples': 1221888, 'steps': 6363, 'loss/train': 2.3124148845672607} 11/06/2021 22:08:37 - INFO - __main__ - Step 6365: {'lr': 0.0004989281223218486, 'samples': 1222080, 'steps': 6364, 'loss/train': 2.2666964530944824} 11/06/2021 22:08:38 - INFO - __main__ - Step 6366: {'lr': 0.0004989276313807658, 'samples': 1222272, 'steps': 6365, 'loss/train': 2.1086597442626953} 11/06/2021 22:08:38 - INFO - __main__ - Step 6367: {'lr': 0.00049892714032752, 'samples': 1222464, 'steps': 6366, 'loss/train': 1.7961052656173706} 11/06/2021 22:08:39 - INFO - __main__ - Step 6368: {'lr': 0.0004989266491621117, 'samples': 1222656, 'steps': 6367, 'loss/train': 1.7916598320007324} 11/06/2021 22:08:39 - INFO - __main__ - Step 6369: {'lr': 0.0004989261578845406, 'samples': 1222848, 'steps': 6368, 'loss/train': 2.015110731124878} 11/06/2021 22:08:40 - INFO - __main__ - Step 6370: {'lr': 0.0004989256664948073, 'samples': 1223040, 'steps': 6369, 'loss/train': 1.7681583166122437} 11/06/2021 22:08:40 - INFO - __main__ - Step 6371: {'lr': 0.000498925174992912, 'samples': 1223232, 'steps': 6370, 'loss/train': 2.142261266708374} 11/06/2021 22:08:41 - INFO - __main__ - Step 6372: {'lr': 0.0004989246833788549, 'samples': 1223424, 'steps': 6371, 'loss/train': 1.6870710849761963} 11/06/2021 22:08:41 - INFO - __main__ - Step 6373: {'lr': 0.000498924191652636, 'samples': 1223616, 'steps': 6372, 'loss/train': 1.6174527406692505} 11/06/2021 22:08:41 - INFO - __main__ - Step 6374: {'lr': 0.0004989236998142559, 'samples': 1223808, 'steps': 6373, 'loss/train': 1.6731904745101929} 11/06/2021 22:08:42 - INFO - __main__ - Step 6375: {'lr': 0.0004989232078637145, 'samples': 1224000, 'steps': 6374, 'loss/train': 1.9477375745773315} 11/06/2021 22:08:44 - INFO - __main__ - Step 6376: {'lr': 0.0004989227158010123, 'samples': 1224192, 'steps': 6375, 'loss/train': 1.9696638584136963} 11/06/2021 22:08:44 - INFO - __main__ - Step 6377: {'lr': 0.0004989222236261491, 'samples': 1224384, 'steps': 6376, 'loss/train': 2.0389769077301025} 11/06/2021 22:08:45 - INFO - __main__ - Step 6378: {'lr': 0.0004989217313391256, 'samples': 1224576, 'steps': 6377, 'loss/train': 1.981373906135559} 11/06/2021 22:08:45 - INFO - __main__ - Step 6379: {'lr': 0.0004989212389399417, 'samples': 1224768, 'steps': 6378, 'loss/train': 1.8569397926330566} 11/06/2021 22:08:46 - INFO - __main__ - Step 6380: {'lr': 0.0004989207464285978, 'samples': 1224960, 'steps': 6379, 'loss/train': 1.8618382215499878} 11/06/2021 22:08:46 - INFO - __main__ - Step 6381: {'lr': 0.0004989202538050939, 'samples': 1225152, 'steps': 6380, 'loss/train': 1.969157099723816} 11/06/2021 22:08:46 - INFO - __main__ - Step 6382: {'lr': 0.0004989197610694306, 'samples': 1225344, 'steps': 6381, 'loss/train': 4.056380748748779} 11/06/2021 22:08:47 - INFO - __main__ - Step 6383: {'lr': 0.0004989192682216078, 'samples': 1225536, 'steps': 6382, 'loss/train': 2.3175463676452637} 11/06/2021 22:08:48 - INFO - __main__ - Step 6384: {'lr': 0.0004989187752616258, 'samples': 1225728, 'steps': 6383, 'loss/train': 2.1582441329956055} 11/06/2021 22:08:48 - INFO - __main__ - Step 6385: {'lr': 0.0004989182821894849, 'samples': 1225920, 'steps': 6384, 'loss/train': 1.9625952243804932} 11/06/2021 22:08:48 - INFO - __main__ - Step 6386: {'lr': 0.0004989177890051852, 'samples': 1226112, 'steps': 6385, 'loss/train': 2.3441145420074463} 11/06/2021 22:08:49 - INFO - __main__ - Step 6387: {'lr': 0.000498917295708727, 'samples': 1226304, 'steps': 6386, 'loss/train': 2.260986328125} 11/06/2021 22:08:49 - INFO - __main__ - Step 6388: {'lr': 0.0004989168023001105, 'samples': 1226496, 'steps': 6387, 'loss/train': 2.005218029022217} 11/06/2021 22:08:50 - INFO - __main__ - Step 6389: {'lr': 0.0004989163087793359, 'samples': 1226688, 'steps': 6388, 'loss/train': 1.657336950302124} 11/06/2021 22:08:50 - INFO - __main__ - Step 6390: {'lr': 0.0004989158151464036, 'samples': 1226880, 'steps': 6389, 'loss/train': 1.1649143695831299} 11/06/2021 22:08:51 - INFO - __main__ - Step 6391: {'lr': 0.0004989153214013135, 'samples': 1227072, 'steps': 6390, 'loss/train': 2.1878926753997803} 11/06/2021 22:08:51 - INFO - __main__ - Step 6392: {'lr': 0.0004989148275440661, 'samples': 1227264, 'steps': 6391, 'loss/train': 2.312974691390991} 11/06/2021 22:08:51 - INFO - __main__ - Step 6393: {'lr': 0.0004989143335746614, 'samples': 1227456, 'steps': 6392, 'loss/train': 2.0476460456848145} 11/06/2021 22:08:52 - INFO - __main__ - Step 6394: {'lr': 0.0004989138394930998, 'samples': 1227648, 'steps': 6393, 'loss/train': 2.358633518218994} 11/06/2021 22:08:53 - INFO - __main__ - Step 6395: {'lr': 0.0004989133452993816, 'samples': 1227840, 'steps': 6394, 'loss/train': 1.8311744928359985} 11/06/2021 22:08:53 - INFO - __main__ - Step 6396: {'lr': 0.0004989128509935068, 'samples': 1228032, 'steps': 6395, 'loss/train': 1.4626703262329102} 11/06/2021 22:08:53 - INFO - __main__ - Step 6397: {'lr': 0.0004989123565754756, 'samples': 1228224, 'steps': 6396, 'loss/train': 2.180896043777466} 11/06/2021 22:08:54 - INFO - __main__ - Step 6398: {'lr': 0.0004989118620452884, 'samples': 1228416, 'steps': 6397, 'loss/train': 1.6816266775131226} 11/06/2021 22:08:55 - INFO - __main__ - Step 6399: {'lr': 0.0004989113674029454, 'samples': 1228608, 'steps': 6398, 'loss/train': 2.068782091140747} 11/06/2021 22:08:55 - INFO - __main__ - Step 6400: {'lr': 0.0004989108726484469, 'samples': 1228800, 'steps': 6399, 'loss/train': 2.064892530441284} 11/06/2021 22:08:55 - INFO - __main__ - Step 6401: {'lr': 0.0004989103777817928, 'samples': 1228992, 'steps': 6400, 'loss/train': 1.7987390756607056} 11/06/2021 22:08:56 - INFO - __main__ - Step 6402: {'lr': 0.0004989098828029836, 'samples': 1229184, 'steps': 6401, 'loss/train': 2.1822431087493896} 11/06/2021 22:08:56 - INFO - __main__ - Step 6403: {'lr': 0.0004989093877120194, 'samples': 1229376, 'steps': 6402, 'loss/train': 1.9657015800476074} 11/06/2021 22:08:58 - INFO - __main__ - Step 6404: {'lr': 0.0004989088925089005, 'samples': 1229568, 'steps': 6403, 'loss/train': 1.4904416799545288} 11/06/2021 22:08:58 - INFO - __main__ - Step 6405: {'lr': 0.0004989083971936271, 'samples': 1229760, 'steps': 6404, 'loss/train': 2.7258121967315674} 11/06/2021 22:08:58 - INFO - __main__ - Step 6406: {'lr': 0.0004989079017661994, 'samples': 1229952, 'steps': 6405, 'loss/train': 1.961512804031372} 11/06/2021 22:08:59 - INFO - __main__ - Step 6407: {'lr': 0.0004989074062266177, 'samples': 1230144, 'steps': 6406, 'loss/train': 5.536162853240967} 11/06/2021 22:08:59 - INFO - __main__ - Step 6408: {'lr': 0.0004989069105748821, 'samples': 1230336, 'steps': 6407, 'loss/train': 1.5103638172149658} 11/06/2021 22:08:59 - INFO - __main__ - Step 6409: {'lr': 0.0004989064148109929, 'samples': 1230528, 'steps': 6408, 'loss/train': 1.6851413249969482} 11/06/2021 22:09:00 - INFO - __main__ - Step 6410: {'lr': 0.0004989059189349503, 'samples': 1230720, 'steps': 6409, 'loss/train': 1.7821619510650635} 11/06/2021 22:09:01 - INFO - __main__ - Step 6411: {'lr': 0.0004989054229467546, 'samples': 1230912, 'steps': 6410, 'loss/train': 1.9058226346969604} 11/06/2021 22:09:01 - INFO - __main__ - Step 6412: {'lr': 0.0004989049268464058, 'samples': 1231104, 'steps': 6411, 'loss/train': 1.1312766075134277} 11/06/2021 22:09:01 - INFO - __main__ - Step 6413: {'lr': 0.0004989044306339044, 'samples': 1231296, 'steps': 6412, 'loss/train': 2.2100672721862793} 11/06/2021 22:09:02 - INFO - __main__ - Step 6414: {'lr': 0.0004989039343092505, 'samples': 1231488, 'steps': 6413, 'loss/train': 1.6702120304107666} 11/06/2021 22:09:03 - INFO - __main__ - Step 6415: {'lr': 0.0004989034378724443, 'samples': 1231680, 'steps': 6414, 'loss/train': 2.1043949127197266} 11/06/2021 22:09:03 - INFO - __main__ - Step 6416: {'lr': 0.0004989029413234861, 'samples': 1231872, 'steps': 6415, 'loss/train': 1.9456948041915894} 11/06/2021 22:09:03 - INFO - __main__ - Step 6417: {'lr': 0.000498902444662376, 'samples': 1232064, 'steps': 6416, 'loss/train': 2.429739475250244} 11/06/2021 22:09:04 - INFO - __main__ - Step 6418: {'lr': 0.0004989019478891144, 'samples': 1232256, 'steps': 6417, 'loss/train': 2.0660645961761475} 11/06/2021 22:09:04 - INFO - __main__ - Step 6419: {'lr': 0.0004989014510037013, 'samples': 1232448, 'steps': 6418, 'loss/train': 1.0096818208694458} 11/06/2021 22:09:05 - INFO - __main__ - Step 6420: {'lr': 0.0004989009540061373, 'samples': 1232640, 'steps': 6419, 'loss/train': 1.7009047269821167} 11/06/2021 22:09:05 - INFO - __main__ - Step 6421: {'lr': 0.0004989004568964221, 'samples': 1232832, 'steps': 6420, 'loss/train': 1.8998950719833374} 11/06/2021 22:09:06 - INFO - __main__ - Step 6422: {'lr': 0.0004988999596745562, 'samples': 1233024, 'steps': 6421, 'loss/train': 2.2160139083862305} 11/06/2021 22:09:06 - INFO - __main__ - Step 6423: {'lr': 0.00049889946234054, 'samples': 1233216, 'steps': 6422, 'loss/train': 1.466779351234436} 11/06/2021 22:09:07 - INFO - __main__ - Step 6424: {'lr': 0.0004988989648943734, 'samples': 1233408, 'steps': 6423, 'loss/train': 1.9524688720703125} 11/06/2021 22:09:08 - INFO - __main__ - Step 6425: {'lr': 0.0004988984673360568, 'samples': 1233600, 'steps': 6424, 'loss/train': 1.4690804481506348} 11/06/2021 22:09:08 - INFO - __main__ - Step 6426: {'lr': 0.0004988979696655904, 'samples': 1233792, 'steps': 6425, 'loss/train': 1.6141606569290161} 11/06/2021 22:09:08 - INFO - __main__ - Step 6427: {'lr': 0.0004988974718829744, 'samples': 1233984, 'steps': 6426, 'loss/train': 1.9417755603790283} 11/06/2021 22:09:09 - INFO - __main__ - Step 6428: {'lr': 0.0004988969739882091, 'samples': 1234176, 'steps': 6427, 'loss/train': 1.8715691566467285} 11/06/2021 22:09:09 - INFO - __main__ - Step 6429: {'lr': 0.0004988964759812946, 'samples': 1234368, 'steps': 6428, 'loss/train': 1.668321132659912} 11/06/2021 22:09:11 - INFO - __main__ - Step 6430: {'lr': 0.0004988959778622313, 'samples': 1234560, 'steps': 6429, 'loss/train': 1.9378490447998047} 11/06/2021 22:09:11 - INFO - __main__ - Step 6431: {'lr': 0.0004988954796310191, 'samples': 1234752, 'steps': 6430, 'loss/train': 1.9765774011611938} 11/06/2021 22:09:12 - INFO - __main__ - Step 6432: {'lr': 0.0004988949812876586, 'samples': 1234944, 'steps': 6431, 'loss/train': 1.7189645767211914} 11/06/2021 22:09:12 - INFO - __main__ - Step 6433: {'lr': 0.0004988944828321499, 'samples': 1235136, 'steps': 6432, 'loss/train': 1.8814183473587036} 11/06/2021 22:09:12 - INFO - __main__ - Step 6434: {'lr': 0.0004988939842644931, 'samples': 1235328, 'steps': 6433, 'loss/train': 1.60706627368927} 11/06/2021 22:09:13 - INFO - __main__ - Step 6435: {'lr': 0.0004988934855846885, 'samples': 1235520, 'steps': 6434, 'loss/train': 1.5853242874145508} 11/06/2021 22:09:13 - INFO - __main__ - Step 6436: {'lr': 0.0004988929867927363, 'samples': 1235712, 'steps': 6435, 'loss/train': 1.9436169862747192} 11/06/2021 22:09:13 - INFO - __main__ - Step 6437: {'lr': 0.0004988924878886368, 'samples': 1235904, 'steps': 6436, 'loss/train': 1.9640707969665527} 11/06/2021 22:09:15 - INFO - __main__ - Step 6438: {'lr': 0.0004988919888723902, 'samples': 1236096, 'steps': 6437, 'loss/train': 1.9225521087646484} 11/06/2021 22:09:15 - INFO - __main__ - Step 6439: {'lr': 0.0004988914897439968, 'samples': 1236288, 'steps': 6438, 'loss/train': 1.5273348093032837} 11/06/2021 22:09:15 - INFO - __main__ - Step 6440: {'lr': 0.0004988909905034566, 'samples': 1236480, 'steps': 6439, 'loss/train': 1.8240584135055542} 11/06/2021 22:09:16 - INFO - __main__ - Step 6441: {'lr': 0.00049889049115077, 'samples': 1236672, 'steps': 6440, 'loss/train': 1.741806983947754} 11/06/2021 22:09:16 - INFO - __main__ - Step 6442: {'lr': 0.0004988899916859372, 'samples': 1236864, 'steps': 6441, 'loss/train': 2.750673294067383} 11/06/2021 22:09:17 - INFO - __main__ - Step 6443: {'lr': 0.0004988894921089584, 'samples': 1237056, 'steps': 6442, 'loss/train': 1.7780988216400146} 11/06/2021 22:09:17 - INFO - __main__ - Step 6444: {'lr': 0.0004988889924198339, 'samples': 1237248, 'steps': 6443, 'loss/train': 1.9567726850509644} 11/06/2021 22:09:18 - INFO - __main__ - Step 6445: {'lr': 0.0004988884926185637, 'samples': 1237440, 'steps': 6444, 'loss/train': 1.8130319118499756} 11/06/2021 22:09:18 - INFO - __main__ - Step 6446: {'lr': 0.0004988879927051484, 'samples': 1237632, 'steps': 6445, 'loss/train': 1.573569655418396} 11/06/2021 22:09:18 - INFO - __main__ - Step 6447: {'lr': 0.0004988874926795878, 'samples': 1237824, 'steps': 6446, 'loss/train': 1.8743444681167603} 11/06/2021 22:09:19 - INFO - __main__ - Step 6448: {'lr': 0.0004988869925418825, 'samples': 1238016, 'steps': 6447, 'loss/train': 1.5253403186798096} 11/06/2021 22:09:20 - INFO - __main__ - Step 6449: {'lr': 0.0004988864922920325, 'samples': 1238208, 'steps': 6448, 'loss/train': 1.7076308727264404} 11/06/2021 22:09:20 - INFO - __main__ - Step 6450: {'lr': 0.000498885991930038, 'samples': 1238400, 'steps': 6449, 'loss/train': 2.227938413619995} 11/06/2021 22:09:20 - INFO - __main__ - Step 6451: {'lr': 0.0004988854914558994, 'samples': 1238592, 'steps': 6450, 'loss/train': 1.7760518789291382} 11/06/2021 22:09:21 - INFO - __main__ - Step 6452: {'lr': 0.0004988849908696169, 'samples': 1238784, 'steps': 6451, 'loss/train': 2.213900566101074} 11/06/2021 22:09:21 - INFO - __main__ - Step 6453: {'lr': 0.0004988844901711905, 'samples': 1238976, 'steps': 6452, 'loss/train': 2.0823049545288086} 11/06/2021 22:09:22 - INFO - __main__ - Step 6454: {'lr': 0.0004988839893606208, 'samples': 1239168, 'steps': 6453, 'loss/train': 1.9034770727157593} 11/06/2021 22:09:23 - INFO - __main__ - Step 6455: {'lr': 0.0004988834884379076, 'samples': 1239360, 'steps': 6454, 'loss/train': 1.2255651950836182} 11/06/2021 22:09:23 - INFO - __main__ - Step 6456: {'lr': 0.0004988829874030514, 'samples': 1239552, 'steps': 6455, 'loss/train': 2.5880677700042725} 11/06/2021 22:09:23 - INFO - __main__ - Step 6457: {'lr': 0.0004988824862560525, 'samples': 1239744, 'steps': 6456, 'loss/train': 4.904447078704834} 11/06/2021 22:09:24 - INFO - __main__ - Step 6458: {'lr': 0.0004988819849969109, 'samples': 1239936, 'steps': 6457, 'loss/train': 1.9330166578292847} 11/06/2021 22:09:25 - INFO - __main__ - Step 6459: {'lr': 0.0004988814836256269, 'samples': 1240128, 'steps': 6458, 'loss/train': 1.4734045267105103} 11/06/2021 22:09:25 - INFO - __main__ - Step 6460: {'lr': 0.0004988809821422008, 'samples': 1240320, 'steps': 6459, 'loss/train': 2.5002269744873047} 11/06/2021 22:09:25 - INFO - __main__ - Step 6461: {'lr': 0.0004988804805466327, 'samples': 1240512, 'steps': 6460, 'loss/train': 1.7648297548294067} 11/06/2021 22:09:26 - INFO - __main__ - Step 6462: {'lr': 0.000498879978838923, 'samples': 1240704, 'steps': 6461, 'loss/train': 2.2282261848449707} 11/06/2021 22:09:26 - INFO - __main__ - Step 6463: {'lr': 0.0004988794770190717, 'samples': 1240896, 'steps': 6462, 'loss/train': 2.106597661972046} 11/06/2021 22:09:27 - INFO - __main__ - Step 6464: {'lr': 0.0004988789750870792, 'samples': 1241088, 'steps': 6463, 'loss/train': 2.194019317626953} 11/06/2021 22:09:28 - INFO - __main__ - Step 6465: {'lr': 0.0004988784730429457, 'samples': 1241280, 'steps': 6464, 'loss/train': 2.4788918495178223} 11/06/2021 22:09:28 - INFO - __main__ - Step 6466: {'lr': 0.0004988779708866714, 'samples': 1241472, 'steps': 6465, 'loss/train': 2.1936397552490234} 11/06/2021 22:09:28 - INFO - __main__ - Step 6467: {'lr': 0.0004988774686182564, 'samples': 1241664, 'steps': 6466, 'loss/train': 1.6836004257202148} 11/06/2021 22:09:29 - INFO - __main__ - Step 6468: {'lr': 0.0004988769662377013, 'samples': 1241856, 'steps': 6467, 'loss/train': 2.2871172428131104} 11/06/2021 22:09:29 - INFO - __main__ - Step 6469: {'lr': 0.0004988764637450058, 'samples': 1242048, 'steps': 6468, 'loss/train': 2.1349170207977295} 11/06/2021 22:09:30 - INFO - __main__ - Step 6470: {'lr': 0.0004988759611401706, 'samples': 1242240, 'steps': 6469, 'loss/train': 2.350022315979004} 11/06/2021 22:09:30 - INFO - __main__ - Step 6471: {'lr': 0.0004988754584231957, 'samples': 1242432, 'steps': 6470, 'loss/train': 1.986899971961975} 11/06/2021 22:09:31 - INFO - __main__ - Step 6472: {'lr': 0.0004988749555940814, 'samples': 1242624, 'steps': 6471, 'loss/train': 1.6348545551300049} 11/06/2021 22:09:31 - INFO - __main__ - Step 6473: {'lr': 0.0004988744526528277, 'samples': 1242816, 'steps': 6472, 'loss/train': 2.23770809173584} 11/06/2021 22:09:31 - INFO - __main__ - Step 6474: {'lr': 0.0004988739495994352, 'samples': 1243008, 'steps': 6473, 'loss/train': 1.4470930099487305} 11/06/2021 22:09:32 - INFO - __main__ - Step 6475: {'lr': 0.0004988734464339038, 'samples': 1243200, 'steps': 6474, 'loss/train': 1.8867287635803223} 11/06/2021 22:09:33 - INFO - __main__ - Step 6476: {'lr': 0.0004988729431562339, 'samples': 1243392, 'steps': 6475, 'loss/train': 2.184319496154785} 11/06/2021 22:09:33 - INFO - __main__ - Step 6477: {'lr': 0.0004988724397664258, 'samples': 1243584, 'steps': 6476, 'loss/train': 2.0487940311431885} 11/06/2021 22:09:33 - INFO - __main__ - Step 6478: {'lr': 0.0004988719362644795, 'samples': 1243776, 'steps': 6477, 'loss/train': 2.7399888038635254} 11/06/2021 22:09:34 - INFO - __main__ - Step 6479: {'lr': 0.0004988714326503953, 'samples': 1243968, 'steps': 6478, 'loss/train': 1.4962778091430664} 11/06/2021 22:09:35 - INFO - __main__ - Step 6480: {'lr': 0.0004988709289241736, 'samples': 1244160, 'steps': 6479, 'loss/train': 2.1878020763397217} 11/06/2021 22:09:35 - INFO - __main__ - Step 6481: {'lr': 0.0004988704250858145, 'samples': 1244352, 'steps': 6480, 'loss/train': 1.851172685623169} 11/06/2021 22:09:36 - INFO - __main__ - Step 6482: {'lr': 0.0004988699211353182, 'samples': 1244544, 'steps': 6481, 'loss/train': 1.0221189260482788} 11/06/2021 22:09:36 - INFO - __main__ - Step 6483: {'lr': 0.000498869417072685, 'samples': 1244736, 'steps': 6482, 'loss/train': 1.667419672012329} 11/06/2021 22:09:36 - INFO - __main__ - Step 6484: {'lr': 0.000498868912897915, 'samples': 1244928, 'steps': 6483, 'loss/train': 1.5354762077331543} 11/06/2021 22:09:37 - INFO - __main__ - Step 6485: {'lr': 0.0004988684086110085, 'samples': 1245120, 'steps': 6484, 'loss/train': 1.7448598146438599} 11/06/2021 22:09:38 - INFO - __main__ - Step 6486: {'lr': 0.0004988679042119658, 'samples': 1245312, 'steps': 6485, 'loss/train': 1.7529888153076172} 11/06/2021 22:09:38 - INFO - __main__ - Step 6487: {'lr': 0.000498867399700787, 'samples': 1245504, 'steps': 6486, 'loss/train': 2.0920798778533936} 11/06/2021 22:09:38 - INFO - __main__ - Step 6488: {'lr': 0.0004988668950774724, 'samples': 1245696, 'steps': 6487, 'loss/train': 2.620828151702881} 11/06/2021 22:09:39 - INFO - __main__ - Step 6489: {'lr': 0.0004988663903420222, 'samples': 1245888, 'steps': 6488, 'loss/train': 2.216024398803711} 11/06/2021 22:09:39 - INFO - __main__ - Step 6490: {'lr': 0.0004988658854944367, 'samples': 1246080, 'steps': 6489, 'loss/train': 2.130100727081299} 11/06/2021 22:09:40 - INFO - __main__ - Step 6491: {'lr': 0.0004988653805347161, 'samples': 1246272, 'steps': 6490, 'loss/train': 1.8137882947921753} 11/06/2021 22:09:40 - INFO - __main__ - Step 6492: {'lr': 0.0004988648754628605, 'samples': 1246464, 'steps': 6491, 'loss/train': 1.7981539964675903} 11/06/2021 22:09:41 - INFO - __main__ - Step 6493: {'lr': 0.0004988643702788703, 'samples': 1246656, 'steps': 6492, 'loss/train': 2.0300815105438232} 11/06/2021 22:09:41 - INFO - __main__ - Step 6494: {'lr': 0.0004988638649827456, 'samples': 1246848, 'steps': 6493, 'loss/train': 2.65700101852417} 11/06/2021 22:09:41 - INFO - __main__ - Step 6495: {'lr': 0.0004988633595744867, 'samples': 1247040, 'steps': 6494, 'loss/train': 2.369478940963745} 11/06/2021 22:09:43 - INFO - __main__ - Step 6496: {'lr': 0.0004988628540540939, 'samples': 1247232, 'steps': 6495, 'loss/train': 1.5928741693496704} 11/06/2021 22:09:43 - INFO - __main__ - Step 6497: {'lr': 0.0004988623484215673, 'samples': 1247424, 'steps': 6496, 'loss/train': 0.6462783217430115} 11/06/2021 22:09:43 - INFO - __main__ - Step 6498: {'lr': 0.0004988618426769071, 'samples': 1247616, 'steps': 6497, 'loss/train': 1.936950922012329} 11/06/2021 22:09:44 - INFO - __main__ - Step 6499: {'lr': 0.0004988613368201135, 'samples': 1247808, 'steps': 6498, 'loss/train': 2.1868393421173096} 11/06/2021 22:09:44 - INFO - __main__ - Step 6500: {'lr': 0.0004988608308511871, 'samples': 1248000, 'steps': 6499, 'loss/train': 1.6641846895217896} 11/06/2021 22:09:45 - INFO - __main__ - Step 6501: {'lr': 0.0004988603247701276, 'samples': 1248192, 'steps': 6500, 'loss/train': 1.803402066230774} 11/06/2021 22:09:45 - INFO - __main__ - Step 6502: {'lr': 0.0004988598185769357, 'samples': 1248384, 'steps': 6501, 'loss/train': 2.2652242183685303} 11/06/2021 22:09:46 - INFO - __main__ - Step 6503: {'lr': 0.0004988593122716112, 'samples': 1248576, 'steps': 6502, 'loss/train': 1.5960944890975952} 11/06/2021 22:09:46 - INFO - __main__ - Step 6504: {'lr': 0.0004988588058541547, 'samples': 1248768, 'steps': 6503, 'loss/train': 1.9997038841247559} 11/06/2021 22:09:46 - INFO - __main__ - Step 6505: {'lr': 0.0004988582993245661, 'samples': 1248960, 'steps': 6504, 'loss/train': 2.119647264480591} 11/06/2021 22:09:47 - INFO - __main__ - Step 6506: {'lr': 0.0004988577926828459, 'samples': 1249152, 'steps': 6505, 'loss/train': 1.9104197025299072} 11/06/2021 22:09:48 - INFO - __main__ - Step 6507: {'lr': 0.0004988572859289941, 'samples': 1249344, 'steps': 6506, 'loss/train': 0.5368994474411011} 11/06/2021 22:09:48 - INFO - __main__ - Step 6508: {'lr': 0.0004988567790630111, 'samples': 1249536, 'steps': 6507, 'loss/train': 1.2611361742019653} 11/06/2021 22:09:48 - INFO - __main__ - Step 6509: {'lr': 0.0004988562720848973, 'samples': 1249728, 'steps': 6508, 'loss/train': 2.310293674468994} 11/06/2021 22:09:49 - INFO - __main__ - Step 6510: {'lr': 0.0004988557649946525, 'samples': 1249920, 'steps': 6509, 'loss/train': 1.3652870655059814} 11/06/2021 22:09:50 - INFO - __main__ - Step 6511: {'lr': 0.000498855257792277, 'samples': 1250112, 'steps': 6510, 'loss/train': 2.2352139949798584} 11/06/2021 22:09:50 - INFO - __main__ - Step 6512: {'lr': 0.0004988547504777714, 'samples': 1250304, 'steps': 6511, 'loss/train': 1.4359506368637085} 11/06/2021 22:09:50 - INFO - __main__ - Step 6513: {'lr': 0.0004988542430511356, 'samples': 1250496, 'steps': 6512, 'loss/train': 2.3191769123077393} 11/06/2021 22:09:51 - INFO - __main__ - Step 6514: {'lr': 0.0004988537355123699, 'samples': 1250688, 'steps': 6513, 'loss/train': 2.3833720684051514} 11/06/2021 22:09:51 - INFO - __main__ - Step 6515: {'lr': 0.0004988532278614745, 'samples': 1250880, 'steps': 6514, 'loss/train': 1.4373453855514526} 11/06/2021 22:09:52 - INFO - __main__ - Step 6516: {'lr': 0.0004988527200984498, 'samples': 1251072, 'steps': 6515, 'loss/train': 1.5178534984588623} 11/06/2021 22:09:52 - INFO - __main__ - Step 6517: {'lr': 0.0004988522122232958, 'samples': 1251264, 'steps': 6516, 'loss/train': 1.9620566368103027} 11/06/2021 22:09:53 - INFO - __main__ - Step 6518: {'lr': 0.0004988517042360128, 'samples': 1251456, 'steps': 6517, 'loss/train': 1.9547860622406006} 11/06/2021 22:09:53 - INFO - __main__ - Step 6519: {'lr': 0.0004988511961366012, 'samples': 1251648, 'steps': 6518, 'loss/train': 5.9194560050964355} 11/06/2021 22:09:54 - INFO - __main__ - Step 6520: {'lr': 0.000498850687925061, 'samples': 1251840, 'steps': 6519, 'loss/train': 2.1525418758392334} 11/06/2021 22:09:54 - INFO - __main__ - Step 6521: {'lr': 0.0004988501796013926, 'samples': 1252032, 'steps': 6520, 'loss/train': 1.9909714460372925} 11/06/2021 22:09:55 - INFO - __main__ - Step 6522: {'lr': 0.0004988496711655961, 'samples': 1252224, 'steps': 6521, 'loss/train': 1.5848283767700195} 11/06/2021 22:09:55 - INFO - __main__ - Step 6523: {'lr': 0.0004988491626176718, 'samples': 1252416, 'steps': 6522, 'loss/train': 1.9467501640319824} 11/06/2021 22:09:56 - INFO - __main__ - Step 6524: {'lr': 0.0004988486539576198, 'samples': 1252608, 'steps': 6523, 'loss/train': 2.0624489784240723} 11/06/2021 22:09:56 - INFO - __main__ - Step 6525: {'lr': 0.0004988481451854406, 'samples': 1252800, 'steps': 6524, 'loss/train': 1.9637494087219238} 11/06/2021 22:09:56 - INFO - __main__ - Step 6526: {'lr': 0.0004988476363011341, 'samples': 1252992, 'steps': 6525, 'loss/train': 2.017069101333618} 11/06/2021 22:09:57 - INFO - __main__ - Step 6527: {'lr': 0.0004988471273047008, 'samples': 1253184, 'steps': 6526, 'loss/train': 1.8356379270553589} 11/06/2021 22:09:58 - INFO - __main__ - Step 6528: {'lr': 0.0004988466181961408, 'samples': 1253376, 'steps': 6527, 'loss/train': 1.6690013408660889} 11/06/2021 22:09:58 - INFO - __main__ - Step 6529: {'lr': 0.0004988461089754544, 'samples': 1253568, 'steps': 6528, 'loss/train': 1.86026930809021} 11/06/2021 22:09:58 - INFO - __main__ - Step 6530: {'lr': 0.0004988455996426418, 'samples': 1253760, 'steps': 6529, 'loss/train': 1.9897722005844116} 11/06/2021 22:09:59 - INFO - __main__ - Step 6531: {'lr': 0.0004988450901977031, 'samples': 1253952, 'steps': 6530, 'loss/train': 2.079590082168579} 11/06/2021 22:10:00 - INFO - __main__ - Step 6532: {'lr': 0.0004988445806406387, 'samples': 1254144, 'steps': 6531, 'loss/train': 3.077930212020874} 11/06/2021 22:10:00 - INFO - __main__ - Step 6533: {'lr': 0.0004988440709714487, 'samples': 1254336, 'steps': 6532, 'loss/train': 2.232908010482788} 11/06/2021 22:10:01 - INFO - __main__ - Step 6534: {'lr': 0.0004988435611901335, 'samples': 1254528, 'steps': 6533, 'loss/train': 1.569143533706665} 11/06/2021 22:10:01 - INFO - __main__ - Step 6535: {'lr': 0.0004988430512966932, 'samples': 1254720, 'steps': 6534, 'loss/train': 2.3683085441589355} 11/06/2021 22:10:01 - INFO - __main__ - Step 6536: {'lr': 0.000498842541291128, 'samples': 1254912, 'steps': 6535, 'loss/train': 1.7087137699127197} 11/06/2021 22:10:02 - INFO - __main__ - Step 6537: {'lr': 0.0004988420311734383, 'samples': 1255104, 'steps': 6536, 'loss/train': 2.1297061443328857} 11/06/2021 22:10:03 - INFO - __main__ - Step 6538: {'lr': 0.0004988415209436243, 'samples': 1255296, 'steps': 6537, 'loss/train': 1.857115387916565} 11/06/2021 22:10:03 - INFO - __main__ - Step 6539: {'lr': 0.000498841010601686, 'samples': 1255488, 'steps': 6538, 'loss/train': 2.037982225418091} 11/06/2021 22:10:03 - INFO - __main__ - Step 6540: {'lr': 0.0004988405001476237, 'samples': 1255680, 'steps': 6539, 'loss/train': 2.1346702575683594} 11/06/2021 22:10:04 - INFO - __main__ - Step 6541: {'lr': 0.0004988399895814378, 'samples': 1255872, 'steps': 6540, 'loss/train': 1.6024067401885986} 11/06/2021 22:10:05 - INFO - __main__ - Step 6542: {'lr': 0.0004988394789031286, 'samples': 1256064, 'steps': 6541, 'loss/train': 2.4668922424316406} 11/06/2021 22:10:05 - INFO - __main__ - Step 6543: {'lr': 0.000498838968112696, 'samples': 1256256, 'steps': 6542, 'loss/train': 1.6090011596679688} 11/06/2021 22:10:05 - INFO - __main__ - Step 6544: {'lr': 0.0004988384572101403, 'samples': 1256448, 'steps': 6543, 'loss/train': 1.8808883428573608} 11/06/2021 22:10:06 - INFO - __main__ - Step 6545: {'lr': 0.000498837946195462, 'samples': 1256640, 'steps': 6544, 'loss/train': 1.728848934173584} 11/06/2021 22:10:06 - INFO - __main__ - Step 6546: {'lr': 0.0004988374350686611, 'samples': 1256832, 'steps': 6545, 'loss/train': 1.9686365127563477} 11/06/2021 22:10:07 - INFO - __main__ - Step 6547: {'lr': 0.000498836923829738, 'samples': 1257024, 'steps': 6546, 'loss/train': 1.8499354124069214} 11/06/2021 22:10:07 - INFO - __main__ - Step 6548: {'lr': 0.0004988364124786927, 'samples': 1257216, 'steps': 6547, 'loss/train': 1.4615224599838257} 11/06/2021 22:10:08 - INFO - __main__ - Step 6549: {'lr': 0.0004988359010155255, 'samples': 1257408, 'steps': 6548, 'loss/train': 1.7403419017791748} 11/06/2021 22:10:08 - INFO - __main__ - Step 6550: {'lr': 0.0004988353894402368, 'samples': 1257600, 'steps': 6549, 'loss/train': 1.965841293334961} 11/06/2021 22:10:09 - INFO - __main__ - Step 6551: {'lr': 0.0004988348777528267, 'samples': 1257792, 'steps': 6550, 'loss/train': 2.171156883239746} 11/06/2021 22:10:09 - INFO - __main__ - Step 6552: {'lr': 0.0004988343659532954, 'samples': 1257984, 'steps': 6551, 'loss/train': 2.217395782470703} 11/06/2021 22:10:10 - INFO - __main__ - Step 6553: {'lr': 0.0004988338540416432, 'samples': 1258176, 'steps': 6552, 'loss/train': 2.0271527767181396} 11/06/2021 22:10:10 - INFO - __main__ - Step 6554: {'lr': 0.0004988333420178704, 'samples': 1258368, 'steps': 6553, 'loss/train': 1.7768346071243286} 11/06/2021 22:10:11 - INFO - __main__ - Step 6555: {'lr': 0.000498832829881977, 'samples': 1258560, 'steps': 6554, 'loss/train': 2.1017751693725586} 11/06/2021 22:10:11 - INFO - __main__ - Step 6556: {'lr': 0.0004988323176339633, 'samples': 1258752, 'steps': 6555, 'loss/train': 1.467564582824707} 11/06/2021 22:10:11 - INFO - __main__ - Step 6557: {'lr': 0.0004988318052738298, 'samples': 1258944, 'steps': 6556, 'loss/train': 1.9192558526992798} 11/06/2021 22:10:12 - INFO - __main__ - Step 6558: {'lr': 0.0004988312928015763, 'samples': 1259136, 'steps': 6557, 'loss/train': 1.927575945854187} 11/06/2021 22:10:13 - INFO - __main__ - Step 6559: {'lr': 0.0004988307802172035, 'samples': 1259328, 'steps': 6558, 'loss/train': 2.1528429985046387} 11/06/2021 22:10:13 - INFO - __main__ - Step 6560: {'lr': 0.0004988302675207112, 'samples': 1259520, 'steps': 6559, 'loss/train': 2.317401170730591} 11/06/2021 22:10:13 - INFO - __main__ - Step 6561: {'lr': 0.0004988297547121, 'samples': 1259712, 'steps': 6560, 'loss/train': 1.843315839767456} 11/06/2021 22:10:14 - INFO - __main__ - Step 6562: {'lr': 0.0004988292417913698, 'samples': 1259904, 'steps': 6561, 'loss/train': 2.109565019607544} 11/06/2021 22:10:15 - INFO - __main__ - Step 6563: {'lr': 0.0004988287287585211, 'samples': 1260096, 'steps': 6562, 'loss/train': 0.9284586906433105} 11/06/2021 22:10:15 - INFO - __main__ - Step 6564: {'lr': 0.0004988282156135539, 'samples': 1260288, 'steps': 6563, 'loss/train': 1.7303180694580078} 11/06/2021 22:10:15 - INFO - __main__ - Step 6565: {'lr': 0.0004988277023564685, 'samples': 1260480, 'steps': 6564, 'loss/train': 2.28778338432312} 11/06/2021 22:10:16 - INFO - __main__ - Step 6566: {'lr': 0.0004988271889872654, 'samples': 1260672, 'steps': 6565, 'loss/train': 2.138411045074463} 11/06/2021 22:10:16 - INFO - __main__ - Step 6567: {'lr': 0.0004988266755059444, 'samples': 1260864, 'steps': 6566, 'loss/train': 2.42995023727417} 11/06/2021 22:10:17 - INFO - __main__ - Step 6568: {'lr': 0.000498826161912506, 'samples': 1261056, 'steps': 6567, 'loss/train': 1.8563683032989502} 11/06/2021 22:10:17 - INFO - __main__ - Step 6569: {'lr': 0.0004988256482069505, 'samples': 1261248, 'steps': 6568, 'loss/train': 1.7521291971206665} 11/06/2021 22:10:18 - INFO - __main__ - Step 6570: {'lr': 0.0004988251343892779, 'samples': 1261440, 'steps': 6569, 'loss/train': 1.6020236015319824} 11/06/2021 22:10:18 - INFO - __main__ - Step 6571: {'lr': 0.0004988246204594885, 'samples': 1261632, 'steps': 6570, 'loss/train': 1.995780348777771} 11/06/2021 22:10:18 - INFO - __main__ - Step 6572: {'lr': 0.0004988241064175826, 'samples': 1261824, 'steps': 6571, 'loss/train': 2.2202649116516113} 11/06/2021 22:10:19 - INFO - __main__ - Step 6573: {'lr': 0.0004988235922635604, 'samples': 1262016, 'steps': 6572, 'loss/train': 2.3514225482940674} 11/06/2021 22:10:20 - INFO - __main__ - Step 6574: {'lr': 0.0004988230779974221, 'samples': 1262208, 'steps': 6573, 'loss/train': 1.9057080745697021} 11/06/2021 22:10:20 - INFO - __main__ - Step 6575: {'lr': 0.000498822563619168, 'samples': 1262400, 'steps': 6574, 'loss/train': 1.9137215614318848} 11/06/2021 22:10:21 - INFO - __main__ - Step 6576: {'lr': 0.0004988220491287983, 'samples': 1262592, 'steps': 6575, 'loss/train': 1.931260108947754} 11/06/2021 22:10:21 - INFO - __main__ - Step 6577: {'lr': 0.0004988215345263132, 'samples': 1262784, 'steps': 6576, 'loss/train': 1.774949550628662} 11/06/2021 22:10:21 - INFO - __main__ - Step 6578: {'lr': 0.0004988210198117129, 'samples': 1262976, 'steps': 6577, 'loss/train': 1.8798142671585083} 11/06/2021 22:10:22 - INFO - __main__ - Step 6579: {'lr': 0.0004988205049849978, 'samples': 1263168, 'steps': 6578, 'loss/train': 2.2597692012786865} 11/06/2021 22:10:23 - INFO - __main__ - Step 6580: {'lr': 0.0004988199900461679, 'samples': 1263360, 'steps': 6579, 'loss/train': 2.158470869064331} 11/06/2021 22:10:23 - INFO - __main__ - Step 6581: {'lr': 0.0004988194749952237, 'samples': 1263552, 'steps': 6580, 'loss/train': 1.9333513975143433} 11/06/2021 22:10:23 - INFO - __main__ - Step 6582: {'lr': 0.0004988189598321652, 'samples': 1263744, 'steps': 6581, 'loss/train': 2.1383774280548096} 11/06/2021 22:10:24 - INFO - __main__ - Step 6583: {'lr': 0.0004988184445569926, 'samples': 1263936, 'steps': 6582, 'loss/train': 2.0251195430755615} 11/06/2021 22:10:25 - INFO - __main__ - Step 6584: {'lr': 0.0004988179291697064, 'samples': 1264128, 'steps': 6583, 'loss/train': 1.354901671409607} 11/06/2021 22:10:25 - INFO - __main__ - Step 6585: {'lr': 0.0004988174136703066, 'samples': 1264320, 'steps': 6584, 'loss/train': 1.922788381576538} 11/06/2021 22:10:25 - INFO - __main__ - Step 6586: {'lr': 0.0004988168980587936, 'samples': 1264512, 'steps': 6585, 'loss/train': 1.8145787715911865} 11/06/2021 22:10:26 - INFO - __main__ - Step 6587: {'lr': 0.0004988163823351676, 'samples': 1264704, 'steps': 6586, 'loss/train': 1.9621999263763428} 11/06/2021 22:10:26 - INFO - __main__ - Step 6588: {'lr': 0.0004988158664994286, 'samples': 1264896, 'steps': 6587, 'loss/train': 1.8063040971755981} 11/06/2021 22:10:27 - INFO - __main__ - Step 6589: {'lr': 0.0004988153505515771, 'samples': 1265088, 'steps': 6588, 'loss/train': 2.8506147861480713} 11/06/2021 22:10:28 - INFO - __main__ - Step 6590: {'lr': 0.0004988148344916133, 'samples': 1265280, 'steps': 6589, 'loss/train': 1.649754524230957} 11/06/2021 22:10:28 - INFO - __main__ - Step 6591: {'lr': 0.0004988143183195373, 'samples': 1265472, 'steps': 6590, 'loss/train': 1.709517478942871} 11/06/2021 22:10:28 - INFO - __main__ - Step 6592: {'lr': 0.0004988138020353493, 'samples': 1265664, 'steps': 6591, 'loss/train': 1.7125308513641357} 11/06/2021 22:10:29 - INFO - __main__ - Step 6593: {'lr': 0.0004988132856390498, 'samples': 1265856, 'steps': 6592, 'loss/train': 1.3303931951522827} 11/06/2021 22:10:30 - INFO - __main__ - Step 6594: {'lr': 0.0004988127691306388, 'samples': 1266048, 'steps': 6593, 'loss/train': 2.029025077819824} 11/06/2021 22:10:30 - INFO - __main__ - Step 6595: {'lr': 0.0004988122525101166, 'samples': 1266240, 'steps': 6594, 'loss/train': 1.7039332389831543} 11/06/2021 22:10:30 - INFO - __main__ - Step 6596: {'lr': 0.0004988117357774835, 'samples': 1266432, 'steps': 6595, 'loss/train': 2.2546744346618652} 11/06/2021 22:10:31 - INFO - __main__ - Step 6597: {'lr': 0.0004988112189327397, 'samples': 1266624, 'steps': 6596, 'loss/train': 2.0449981689453125} 11/06/2021 22:10:31 - INFO - __main__ - Step 6598: {'lr': 0.0004988107019758853, 'samples': 1266816, 'steps': 6597, 'loss/train': 1.963868260383606} 11/06/2021 22:10:31 - INFO - __main__ - Step 6599: {'lr': 0.0004988101849069208, 'samples': 1267008, 'steps': 6598, 'loss/train': 1.7069015502929688} 11/06/2021 22:10:32 - INFO - __main__ - Step 6600: {'lr': 0.0004988096677258461, 'samples': 1267200, 'steps': 6599, 'loss/train': 2.1868088245391846} 11/06/2021 22:10:33 - INFO - __main__ - Step 6601: {'lr': 0.0004988091504326616, 'samples': 1267392, 'steps': 6600, 'loss/train': 1.8266377449035645} 11/06/2021 22:10:33 - INFO - __main__ - Step 6602: {'lr': 0.0004988086330273676, 'samples': 1267584, 'steps': 6601, 'loss/train': 1.2428114414215088} 11/06/2021 22:10:34 - INFO - __main__ - Step 6603: {'lr': 0.0004988081155099643, 'samples': 1267776, 'steps': 6602, 'loss/train': 1.4701017141342163} 11/06/2021 22:10:34 - INFO - __main__ - Step 6604: {'lr': 0.0004988075978804518, 'samples': 1267968, 'steps': 6603, 'loss/train': 1.7105156183242798} 11/06/2021 22:10:35 - INFO - __main__ - Step 6605: {'lr': 0.0004988070801388306, 'samples': 1268160, 'steps': 6604, 'loss/train': 0.2404092252254486} 11/06/2021 22:10:35 - INFO - __main__ - Step 6606: {'lr': 0.0004988065622851006, 'samples': 1268352, 'steps': 6605, 'loss/train': 1.55886709690094} 11/06/2021 22:10:36 - INFO - __main__ - Step 6607: {'lr': 0.0004988060443192623, 'samples': 1268544, 'steps': 6606, 'loss/train': 1.470017671585083} 11/06/2021 22:10:36 - INFO - __main__ - Step 6608: {'lr': 0.0004988055262413158, 'samples': 1268736, 'steps': 6607, 'loss/train': 1.6374818086624146} 11/06/2021 22:10:36 - INFO - __main__ - Step 6609: {'lr': 0.0004988050080512614, 'samples': 1268928, 'steps': 6608, 'loss/train': 1.785929799079895} 11/06/2021 22:10:37 - INFO - __main__ - Step 6610: {'lr': 0.0004988044897490993, 'samples': 1269120, 'steps': 6609, 'loss/train': 1.8489357233047485} 11/06/2021 22:10:38 - INFO - __main__ - Step 6611: {'lr': 0.0004988039713348297, 'samples': 1269312, 'steps': 6610, 'loss/train': 2.0114858150482178} 11/06/2021 22:10:38 - INFO - __main__ - Step 6612: {'lr': 0.0004988034528084529, 'samples': 1269504, 'steps': 6611, 'loss/train': 1.6072686910629272} 11/06/2021 22:10:38 - INFO - __main__ - Step 6613: {'lr': 0.000498802934169969, 'samples': 1269696, 'steps': 6612, 'loss/train': 2.4517900943756104} 11/06/2021 22:10:39 - INFO - __main__ - Step 6614: {'lr': 0.0004988024154193785, 'samples': 1269888, 'steps': 6613, 'loss/train': 2.244598627090454} 11/06/2021 22:10:40 - INFO - __main__ - Step 6615: {'lr': 0.0004988018965566814, 'samples': 1270080, 'steps': 6614, 'loss/train': 1.5164146423339844} 11/06/2021 22:10:40 - INFO - __main__ - Step 6616: {'lr': 0.000498801377581878, 'samples': 1270272, 'steps': 6615, 'loss/train': 1.9338390827178955} 11/06/2021 22:10:40 - INFO - __main__ - Step 6617: {'lr': 0.0004988008584949686, 'samples': 1270464, 'steps': 6616, 'loss/train': 0.2597677409648895} 11/06/2021 22:10:41 - INFO - __main__ - Step 6618: {'lr': 0.0004988003392959533, 'samples': 1270656, 'steps': 6617, 'loss/train': 2.182772397994995} 11/06/2021 22:10:41 - INFO - __main__ - Step 6619: {'lr': 0.0004987998199848324, 'samples': 1270848, 'steps': 6618, 'loss/train': 1.4753804206848145} 11/06/2021 22:10:42 - INFO - __main__ - Step 6620: {'lr': 0.0004987993005616061, 'samples': 1271040, 'steps': 6619, 'loss/train': 1.8077441453933716} 11/06/2021 22:10:43 - INFO - __main__ - Step 6621: {'lr': 0.0004987987810262747, 'samples': 1271232, 'steps': 6620, 'loss/train': 0.7256439328193665} 11/06/2021 22:10:43 - INFO - __main__ - Step 6622: {'lr': 0.0004987982613788384, 'samples': 1271424, 'steps': 6621, 'loss/train': 1.808817744255066} 11/06/2021 22:10:43 - INFO - __main__ - Step 6623: {'lr': 0.0004987977416192976, 'samples': 1271616, 'steps': 6622, 'loss/train': 2.2159342765808105} 11/06/2021 22:10:44 - INFO - __main__ - Step 6624: {'lr': 0.0004987972217476523, 'samples': 1271808, 'steps': 6623, 'loss/train': 2.1314337253570557} 11/06/2021 22:10:45 - INFO - __main__ - Step 6625: {'lr': 0.0004987967017639027, 'samples': 1272000, 'steps': 6624, 'loss/train': 1.8644086122512817} 11/06/2021 22:10:45 - INFO - __main__ - Step 6626: {'lr': 0.0004987961816680492, 'samples': 1272192, 'steps': 6625, 'loss/train': 1.5684220790863037} 11/06/2021 22:10:45 - INFO - __main__ - Step 6627: {'lr': 0.000498795661460092, 'samples': 1272384, 'steps': 6626, 'loss/train': 2.122796058654785} 11/06/2021 22:10:46 - INFO - __main__ - Step 6628: {'lr': 0.0004987951411400313, 'samples': 1272576, 'steps': 6627, 'loss/train': 1.8228185176849365} 11/06/2021 22:10:46 - INFO - __main__ - Step 6629: {'lr': 0.0004987946207078674, 'samples': 1272768, 'steps': 6628, 'loss/train': 1.77996826171875} 11/06/2021 22:10:47 - INFO - __main__ - Step 6630: {'lr': 0.0004987941001636004, 'samples': 1272960, 'steps': 6629, 'loss/train': 2.308150291442871} 11/06/2021 22:10:47 - INFO - __main__ - Step 6631: {'lr': 0.0004987935795072307, 'samples': 1273152, 'steps': 6630, 'loss/train': 1.8831443786621094} 11/06/2021 22:10:48 - INFO - __main__ - Step 6632: {'lr': 0.0004987930587387584, 'samples': 1273344, 'steps': 6631, 'loss/train': 2.2510716915130615} 11/06/2021 22:10:48 - INFO - __main__ - Step 6633: {'lr': 0.0004987925378581838, 'samples': 1273536, 'steps': 6632, 'loss/train': 1.590349793434143} 11/06/2021 22:10:48 - INFO - __main__ - Step 6634: {'lr': 0.0004987920168655071, 'samples': 1273728, 'steps': 6633, 'loss/train': 1.4046695232391357} 11/06/2021 22:10:49 - INFO - __main__ - Step 6635: {'lr': 0.0004987914957607286, 'samples': 1273920, 'steps': 6634, 'loss/train': 2.134047746658325} 11/06/2021 22:10:50 - INFO - __main__ - Step 6636: {'lr': 0.0004987909745438484, 'samples': 1274112, 'steps': 6635, 'loss/train': 1.8218518495559692} 11/06/2021 22:10:50 - INFO - __main__ - Step 6637: {'lr': 0.000498790453214867, 'samples': 1274304, 'steps': 6636, 'loss/train': 2.182149648666382} 11/06/2021 22:10:51 - INFO - __main__ - Step 6638: {'lr': 0.0004987899317737843, 'samples': 1274496, 'steps': 6637, 'loss/train': 2.091320753097534} 11/06/2021 22:10:51 - INFO - __main__ - Step 6639: {'lr': 0.0004987894102206008, 'samples': 1274688, 'steps': 6638, 'loss/train': 1.5049811601638794} 11/06/2021 22:10:51 - INFO - __main__ - Step 6640: {'lr': 0.0004987888885553166, 'samples': 1274880, 'steps': 6639, 'loss/train': 2.0964086055755615} 11/06/2021 22:10:52 - INFO - __main__ - Step 6641: {'lr': 0.0004987883667779319, 'samples': 1275072, 'steps': 6640, 'loss/train': 0.3089179992675781} 11/06/2021 22:10:53 - INFO - __main__ - Step 6642: {'lr': 0.0004987878448884471, 'samples': 1275264, 'steps': 6641, 'loss/train': 1.5907493829727173} 11/06/2021 22:10:53 - INFO - __main__ - Step 6643: {'lr': 0.0004987873228868622, 'samples': 1275456, 'steps': 6642, 'loss/train': 1.9820168018341064} 11/06/2021 22:10:53 - INFO - __main__ - Step 6644: {'lr': 0.0004987868007731778, 'samples': 1275648, 'steps': 6643, 'loss/train': 1.0979185104370117} 11/06/2021 22:10:54 - INFO - __main__ - Step 6645: {'lr': 0.0004987862785473937, 'samples': 1275840, 'steps': 6644, 'loss/train': 1.9548885822296143} 11/06/2021 22:10:55 - INFO - __main__ - Step 6646: {'lr': 0.0004987857562095103, 'samples': 1276032, 'steps': 6645, 'loss/train': 1.8669013977050781} 11/06/2021 22:10:55 - INFO - __main__ - Step 6647: {'lr': 0.0004987852337595281, 'samples': 1276224, 'steps': 6646, 'loss/train': 1.6619402170181274} 11/06/2021 22:10:55 - INFO - __main__ - Step 6648: {'lr': 0.0004987847111974469, 'samples': 1276416, 'steps': 6647, 'loss/train': 1.4700028896331787} 11/06/2021 22:10:56 - INFO - __main__ - Step 6649: {'lr': 0.0004987841885232674, 'samples': 1276608, 'steps': 6648, 'loss/train': 1.8722918033599854} 11/06/2021 22:10:56 - INFO - __main__ - Step 6650: {'lr': 0.0004987836657369893, 'samples': 1276800, 'steps': 6649, 'loss/train': 2.425621271133423} 11/06/2021 22:10:57 - INFO - __main__ - Step 6651: {'lr': 0.0004987831428386133, 'samples': 1276992, 'steps': 6650, 'loss/train': 1.4940102100372314} 11/06/2021 22:10:58 - INFO - __main__ - Step 6652: {'lr': 0.0004987826198281394, 'samples': 1277184, 'steps': 6651, 'loss/train': 1.6286990642547607} 11/06/2021 22:10:58 - INFO - __main__ - Step 6653: {'lr': 0.0004987820967055678, 'samples': 1277376, 'steps': 6652, 'loss/train': 1.9754180908203125} 11/06/2021 22:10:58 - INFO - __main__ - Step 6654: {'lr': 0.000498781573470899, 'samples': 1277568, 'steps': 6653, 'loss/train': 2.209815502166748} 11/06/2021 22:10:59 - INFO - __main__ - Step 6655: {'lr': 0.000498781050124133, 'samples': 1277760, 'steps': 6654, 'loss/train': 2.132535934448242} 11/06/2021 22:11:00 - INFO - __main__ - Step 6656: {'lr': 0.0004987805266652701, 'samples': 1277952, 'steps': 6655, 'loss/train': 1.9133520126342773} 11/06/2021 22:11:00 - INFO - __main__ - Step 6657: {'lr': 0.0004987800030943105, 'samples': 1278144, 'steps': 6656, 'loss/train': 1.6376458406448364} 11/06/2021 22:11:00 - INFO - __main__ - Step 6658: {'lr': 0.0004987794794112545, 'samples': 1278336, 'steps': 6657, 'loss/train': 1.9937931299209595} 11/06/2021 22:11:01 - INFO - __main__ - Step 6659: {'lr': 0.0004987789556161022, 'samples': 1278528, 'steps': 6658, 'loss/train': 0.2782423198223114} 11/06/2021 22:11:01 - INFO - __main__ - Step 6660: {'lr': 0.0004987784317088541, 'samples': 1278720, 'steps': 6659, 'loss/train': 1.6906732320785522} 11/06/2021 22:11:02 - INFO - __main__ - Step 6661: {'lr': 0.0004987779076895102, 'samples': 1278912, 'steps': 6660, 'loss/train': 2.3083863258361816} 11/06/2021 22:11:03 - INFO - __main__ - Step 6662: {'lr': 0.0004987773835580708, 'samples': 1279104, 'steps': 6661, 'loss/train': 1.6070939302444458} 11/06/2021 22:11:03 - INFO - __main__ - Step 6663: {'lr': 0.0004987768593145362, 'samples': 1279296, 'steps': 6662, 'loss/train': 1.7183184623718262} 11/06/2021 22:11:03 - INFO - __main__ - Step 6664: {'lr': 0.0004987763349589065, 'samples': 1279488, 'steps': 6663, 'loss/train': 1.628963828086853} 11/06/2021 22:11:04 - INFO - __main__ - Step 6665: {'lr': 0.0004987758104911821, 'samples': 1279680, 'steps': 6664, 'loss/train': 1.9477825164794922} 11/06/2021 22:11:05 - INFO - __main__ - Step 6666: {'lr': 0.0004987752859113631, 'samples': 1279872, 'steps': 6665, 'loss/train': 1.3619539737701416} 11/06/2021 22:11:05 - INFO - __main__ - Step 6667: {'lr': 0.0004987747612194499, 'samples': 1280064, 'steps': 6666, 'loss/train': 1.2572715282440186} 11/06/2021 22:11:05 - INFO - __main__ - Step 6668: {'lr': 0.0004987742364154425, 'samples': 1280256, 'steps': 6667, 'loss/train': 2.365995407104492} 11/06/2021 22:11:06 - INFO - __main__ - Step 6669: {'lr': 0.0004987737114993413, 'samples': 1280448, 'steps': 6668, 'loss/train': 1.940123438835144} 11/06/2021 22:11:06 - INFO - __main__ - Step 6670: {'lr': 0.0004987731864711466, 'samples': 1280640, 'steps': 6669, 'loss/train': 1.1229156255722046} 11/06/2021 22:11:07 - INFO - __main__ - Step 6671: {'lr': 0.0004987726613308584, 'samples': 1280832, 'steps': 6670, 'loss/train': 1.7459203004837036} 11/06/2021 22:11:07 - INFO - __main__ - Step 6672: {'lr': 0.0004987721360784772, 'samples': 1281024, 'steps': 6671, 'loss/train': 1.6915955543518066} 11/06/2021 22:11:08 - INFO - __main__ - Step 6673: {'lr': 0.0004987716107140031, 'samples': 1281216, 'steps': 6672, 'loss/train': 2.0414986610412598} 11/06/2021 22:11:08 - INFO - __main__ - Step 6674: {'lr': 0.0004987710852374363, 'samples': 1281408, 'steps': 6673, 'loss/train': 2.4011054039001465} 11/06/2021 22:11:08 - INFO - __main__ - Step 6675: {'lr': 0.0004987705596487771, 'samples': 1281600, 'steps': 6674, 'loss/train': 1.3550761938095093} 11/06/2021 22:11:09 - INFO - __main__ - Step 6676: {'lr': 0.0004987700339480258, 'samples': 1281792, 'steps': 6675, 'loss/train': 1.6839749813079834} 11/06/2021 22:11:10 - INFO - __main__ - Step 6677: {'lr': 0.0004987695081351824, 'samples': 1281984, 'steps': 6676, 'loss/train': 2.2747011184692383} 11/06/2021 22:11:10 - INFO - __main__ - Step 6678: {'lr': 0.0004987689822102474, 'samples': 1282176, 'steps': 6677, 'loss/train': 1.934715747833252} 11/06/2021 22:11:10 - INFO - __main__ - Step 6679: {'lr': 0.000498768456173221, 'samples': 1282368, 'steps': 6678, 'loss/train': 1.7885398864746094} 11/06/2021 22:11:11 - INFO - __main__ - Step 6680: {'lr': 0.0004987679300241033, 'samples': 1282560, 'steps': 6679, 'loss/train': 1.533270239830017} 11/06/2021 22:11:11 - INFO - __main__ - Step 6681: {'lr': 0.0004987674037628945, 'samples': 1282752, 'steps': 6680, 'loss/train': 1.9293286800384521} 11/06/2021 22:11:12 - INFO - __main__ - Step 6682: {'lr': 0.0004987668773895951, 'samples': 1282944, 'steps': 6681, 'loss/train': 1.8046060800552368} 11/06/2021 22:11:13 - INFO - __main__ - Step 6683: {'lr': 0.0004987663509042052, 'samples': 1283136, 'steps': 6682, 'loss/train': 1.674709677696228} 11/06/2021 22:11:13 - INFO - __main__ - Step 6684: {'lr': 0.000498765824306725, 'samples': 1283328, 'steps': 6683, 'loss/train': 1.765952467918396} 11/06/2021 22:11:13 - INFO - __main__ - Step 6685: {'lr': 0.0004987652975971546, 'samples': 1283520, 'steps': 6684, 'loss/train': 2.027494192123413} 11/06/2021 22:11:14 - INFO - __main__ - Step 6686: {'lr': 0.0004987647707754945, 'samples': 1283712, 'steps': 6685, 'loss/train': 1.870924949645996} 11/06/2021 22:11:15 - INFO - __main__ - Step 6687: {'lr': 0.0004987642438417449, 'samples': 1283904, 'steps': 6686, 'loss/train': 1.476467490196228} 11/06/2021 22:11:15 - INFO - __main__ - Step 6688: {'lr': 0.0004987637167959059, 'samples': 1284096, 'steps': 6687, 'loss/train': 2.0917282104492188} 11/06/2021 22:11:15 - INFO - __main__ - Step 6689: {'lr': 0.0004987631896379779, 'samples': 1284288, 'steps': 6688, 'loss/train': 1.4638168811798096} 11/06/2021 22:11:16 - INFO - __main__ - Step 6690: {'lr': 0.0004987626623679609, 'samples': 1284480, 'steps': 6689, 'loss/train': 1.3137454986572266} 11/06/2021 22:11:16 - INFO - __main__ - Step 6691: {'lr': 0.0004987621349858553, 'samples': 1284672, 'steps': 6690, 'loss/train': 2.255805015563965} 11/06/2021 22:11:17 - INFO - __main__ - Step 6692: {'lr': 0.0004987616074916615, 'samples': 1284864, 'steps': 6691, 'loss/train': 1.9453632831573486} 11/06/2021 22:11:18 - INFO - __main__ - Step 6693: {'lr': 0.0004987610798853794, 'samples': 1285056, 'steps': 6692, 'loss/train': 1.9508827924728394} 11/06/2021 22:11:18 - INFO - __main__ - Step 6694: {'lr': 0.0004987605521670094, 'samples': 1285248, 'steps': 6693, 'loss/train': 1.6768591403961182} 11/06/2021 22:11:18 - INFO - __main__ - Step 6695: {'lr': 0.0004987600243365518, 'samples': 1285440, 'steps': 6694, 'loss/train': 1.7393834590911865} 11/06/2021 22:11:19 - INFO - __main__ - Step 6696: {'lr': 0.0004987594963940066, 'samples': 1285632, 'steps': 6695, 'loss/train': 1.7367832660675049} 11/06/2021 22:11:20 - INFO - __main__ - Step 6697: {'lr': 0.0004987589683393744, 'samples': 1285824, 'steps': 6696, 'loss/train': 2.0130209922790527} 11/06/2021 22:11:20 - INFO - __main__ - Step 6698: {'lr': 0.0004987584401726552, 'samples': 1286016, 'steps': 6697, 'loss/train': 2.1210427284240723} 11/06/2021 22:11:20 - INFO - __main__ - Step 6699: {'lr': 0.0004987579118938492, 'samples': 1286208, 'steps': 6698, 'loss/train': 1.9069485664367676} 11/06/2021 22:11:21 - INFO - __main__ - Step 6700: {'lr': 0.0004987573835029569, 'samples': 1286400, 'steps': 6699, 'loss/train': 2.1391384601593018} 11/06/2021 22:11:21 - INFO - __main__ - Step 6701: {'lr': 0.0004987568549999782, 'samples': 1286592, 'steps': 6700, 'loss/train': 1.8220481872558594} 11/06/2021 22:11:22 - INFO - __main__ - Step 6702: {'lr': 0.0004987563263849136, 'samples': 1286784, 'steps': 6701, 'loss/train': 2.194391965866089} 11/06/2021 22:11:22 - INFO - __main__ - Step 6703: {'lr': 0.0004987557976577632, 'samples': 1286976, 'steps': 6702, 'loss/train': 1.930019497871399} 11/06/2021 22:11:23 - INFO - __main__ - Step 6704: {'lr': 0.0004987552688185273, 'samples': 1287168, 'steps': 6703, 'loss/train': 2.2534563541412354} 11/06/2021 22:11:23 - INFO - __main__ - Step 6705: {'lr': 0.0004987547398672061, 'samples': 1287360, 'steps': 6704, 'loss/train': 1.8196769952774048} 11/06/2021 22:11:23 - INFO - __main__ - Step 6706: {'lr': 0.0004987542108037998, 'samples': 1287552, 'steps': 6705, 'loss/train': 2.2005743980407715} 11/06/2021 22:11:24 - INFO - __main__ - Step 6707: {'lr': 0.0004987536816283087, 'samples': 1287744, 'steps': 6706, 'loss/train': 1.494478702545166} 11/06/2021 22:11:25 - INFO - __main__ - Step 6708: {'lr': 0.0004987531523407331, 'samples': 1287936, 'steps': 6707, 'loss/train': 1.9010131359100342} 11/06/2021 22:11:25 - INFO - __main__ - Step 6709: {'lr': 0.0004987526229410732, 'samples': 1288128, 'steps': 6708, 'loss/train': 1.8058475255966187} 11/06/2021 22:11:25 - INFO - __main__ - Step 6710: {'lr': 0.000498752093429329, 'samples': 1288320, 'steps': 6709, 'loss/train': 1.8113231658935547} 11/06/2021 22:11:26 - INFO - __main__ - Step 6711: {'lr': 0.0004987515638055012, 'samples': 1288512, 'steps': 6710, 'loss/train': 1.8796414136886597} 11/06/2021 22:11:27 - INFO - __main__ - Step 6712: {'lr': 0.0004987510340695896, 'samples': 1288704, 'steps': 6711, 'loss/train': 2.1111533641815186} 11/06/2021 22:11:27 - INFO - __main__ - Step 6713: {'lr': 0.0004987505042215948, 'samples': 1288896, 'steps': 6712, 'loss/train': 2.0792810916900635} 11/06/2021 22:11:27 - INFO - __main__ - Step 6714: {'lr': 0.0004987499742615167, 'samples': 1289088, 'steps': 6713, 'loss/train': 1.8710038661956787} 11/06/2021 22:11:28 - INFO - __main__ - Step 6715: {'lr': 0.0004987494441893557, 'samples': 1289280, 'steps': 6714, 'loss/train': 2.19236421585083} 11/06/2021 22:11:28 - INFO - __main__ - Step 6716: {'lr': 0.0004987489140051121, 'samples': 1289472, 'steps': 6715, 'loss/train': 2.288778066635132} 11/06/2021 22:11:29 - INFO - __main__ - Step 6717: {'lr': 0.000498748383708786, 'samples': 1289664, 'steps': 6716, 'loss/train': 1.4987444877624512} 11/06/2021 22:11:30 - INFO - __main__ - Step 6718: {'lr': 0.0004987478533003779, 'samples': 1289856, 'steps': 6717, 'loss/train': 1.6623836755752563} 11/06/2021 22:11:30 - INFO - __main__ - Step 6719: {'lr': 0.0004987473227798877, 'samples': 1290048, 'steps': 6718, 'loss/train': 1.9826804399490356} 11/06/2021 22:11:30 - INFO - __main__ - Step 6720: {'lr': 0.0004987467921473157, 'samples': 1290240, 'steps': 6719, 'loss/train': 1.859378457069397} 11/06/2021 22:11:31 - INFO - __main__ - Step 6721: {'lr': 0.0004987462614026624, 'samples': 1290432, 'steps': 6720, 'loss/train': 1.4481638669967651} 11/06/2021 22:11:32 - INFO - __main__ - Step 6722: {'lr': 0.0004987457305459279, 'samples': 1290624, 'steps': 6721, 'loss/train': 1.673604965209961} 11/06/2021 22:11:32 - INFO - __main__ - Step 6723: {'lr': 0.0004987451995771124, 'samples': 1290816, 'steps': 6722, 'loss/train': 1.731692910194397} 11/06/2021 22:11:32 - INFO - __main__ - Step 6724: {'lr': 0.000498744668496216, 'samples': 1291008, 'steps': 6723, 'loss/train': 1.4091925621032715} 11/06/2021 22:11:33 - INFO - __main__ - Step 6725: {'lr': 0.0004987441373032393, 'samples': 1291200, 'steps': 6724, 'loss/train': 1.264276385307312} 11/06/2021 22:11:33 - INFO - __main__ - Step 6726: {'lr': 0.0004987436059981821, 'samples': 1291392, 'steps': 6725, 'loss/train': 1.7149850130081177} 11/06/2021 22:11:34 - INFO - __main__ - Step 6727: {'lr': 0.0004987430745810451, 'samples': 1291584, 'steps': 6726, 'loss/train': 1.8684431314468384} 11/06/2021 22:11:34 - INFO - __main__ - Step 6728: {'lr': 0.0004987425430518282, 'samples': 1291776, 'steps': 6727, 'loss/train': 1.2485170364379883} 11/06/2021 22:11:35 - INFO - __main__ - Step 6729: {'lr': 0.0004987420114105317, 'samples': 1291968, 'steps': 6728, 'loss/train': 1.7726978063583374} 11/06/2021 22:11:35 - INFO - __main__ - Step 6730: {'lr': 0.000498741479657156, 'samples': 1292160, 'steps': 6729, 'loss/train': 2.122650146484375} 11/06/2021 22:11:35 - INFO - __main__ - Step 6731: {'lr': 0.0004987409477917011, 'samples': 1292352, 'steps': 6730, 'loss/train': 2.2055673599243164} 11/06/2021 22:11:36 - INFO - __main__ - Step 6732: {'lr': 0.0004987404158141675, 'samples': 1292544, 'steps': 6731, 'loss/train': 1.7402311563491821} 11/06/2021 22:11:37 - INFO - __main__ - Step 6733: {'lr': 0.0004987398837245552, 'samples': 1292736, 'steps': 6732, 'loss/train': 2.053032159805298} 11/06/2021 22:11:37 - INFO - __main__ - Step 6734: {'lr': 0.0004987393515228646, 'samples': 1292928, 'steps': 6733, 'loss/train': 1.614290475845337} 11/06/2021 22:11:38 - INFO - __main__ - Step 6735: {'lr': 0.0004987388192090959, 'samples': 1293120, 'steps': 6734, 'loss/train': 1.5254181623458862} 11/06/2021 22:11:38 - INFO - __main__ - Step 6736: {'lr': 0.0004987382867832493, 'samples': 1293312, 'steps': 6735, 'loss/train': 2.606459379196167} 11/06/2021 22:11:38 - INFO - __main__ - Step 6737: {'lr': 0.0004987377542453251, 'samples': 1293504, 'steps': 6736, 'loss/train': 1.3655518293380737} 11/06/2021 22:11:39 - INFO - __main__ - Step 6738: {'lr': 0.0004987372215953234, 'samples': 1293696, 'steps': 6737, 'loss/train': 1.8274635076522827} 11/06/2021 22:11:40 - INFO - __main__ - Step 6739: {'lr': 0.0004987366888332446, 'samples': 1293888, 'steps': 6738, 'loss/train': 1.5153939723968506} 11/06/2021 22:11:40 - INFO - __main__ - Step 6740: {'lr': 0.0004987361559590889, 'samples': 1294080, 'steps': 6739, 'loss/train': 2.4495294094085693} 11/06/2021 22:11:40 - INFO - __main__ - Step 6741: {'lr': 0.0004987356229728566, 'samples': 1294272, 'steps': 6740, 'loss/train': 1.0965790748596191} 11/06/2021 22:11:41 - INFO - __main__ - Step 6742: {'lr': 0.0004987350898745477, 'samples': 1294464, 'steps': 6741, 'loss/train': 1.4959521293640137} 11/06/2021 22:11:42 - INFO - __main__ - Step 6743: {'lr': 0.0004987345566641628, 'samples': 1294656, 'steps': 6742, 'loss/train': 0.2513975501060486} 11/06/2021 22:11:42 - INFO - __main__ - Step 6744: {'lr': 0.0004987340233417019, 'samples': 1294848, 'steps': 6743, 'loss/train': 1.892668604850769} 11/06/2021 22:11:42 - INFO - __main__ - Step 6745: {'lr': 0.0004987334899071652, 'samples': 1295040, 'steps': 6744, 'loss/train': 3.132077217102051} 11/06/2021 22:11:43 - INFO - __main__ - Step 6746: {'lr': 0.000498732956360553, 'samples': 1295232, 'steps': 6745, 'loss/train': 2.2569050788879395} 11/06/2021 22:11:43 - INFO - __main__ - Step 6747: {'lr': 0.0004987324227018657, 'samples': 1295424, 'steps': 6746, 'loss/train': 2.1900782585144043} 11/06/2021 22:11:44 - INFO - __main__ - Step 6748: {'lr': 0.0004987318889311033, 'samples': 1295616, 'steps': 6747, 'loss/train': 2.0776844024658203} 11/06/2021 22:11:45 - INFO - __main__ - Step 6749: {'lr': 0.0004987313550482663, 'samples': 1295808, 'steps': 6748, 'loss/train': 2.524590253829956} 11/06/2021 22:11:45 - INFO - __main__ - Step 6750: {'lr': 0.0004987308210533546, 'samples': 1296000, 'steps': 6749, 'loss/train': 2.416774272918701} 11/06/2021 22:11:45 - INFO - __main__ - Step 6751: {'lr': 0.0004987302869463686, 'samples': 1296192, 'steps': 6750, 'loss/train': 1.5255533456802368} 11/06/2021 22:11:46 - INFO - __main__ - Step 6752: {'lr': 0.0004987297527273088, 'samples': 1296384, 'steps': 6751, 'loss/train': 1.5347144603729248} 11/06/2021 22:11:47 - INFO - __main__ - Step 6753: {'lr': 0.0004987292183961751, 'samples': 1296576, 'steps': 6752, 'loss/train': 1.8391497135162354} 11/06/2021 22:11:47 - INFO - __main__ - Step 6754: {'lr': 0.0004987286839529679, 'samples': 1296768, 'steps': 6753, 'loss/train': 1.9722280502319336} 11/06/2021 22:11:47 - INFO - __main__ - Step 6755: {'lr': 0.0004987281493976873, 'samples': 1296960, 'steps': 6754, 'loss/train': 1.6345213651657104} 11/06/2021 22:11:48 - INFO - __main__ - Step 6756: {'lr': 0.0004987276147303337, 'samples': 1297152, 'steps': 6755, 'loss/train': 1.5767613649368286} 11/06/2021 22:11:48 - INFO - __main__ - Step 6757: {'lr': 0.0004987270799509071, 'samples': 1297344, 'steps': 6756, 'loss/train': 1.8740566968917847} 11/06/2021 22:11:49 - INFO - __main__ - Step 6758: {'lr': 0.0004987265450594082, 'samples': 1297536, 'steps': 6757, 'loss/train': 1.6574358940124512} 11/06/2021 22:11:49 - INFO - __main__ - Step 6759: {'lr': 0.0004987260100558368, 'samples': 1297728, 'steps': 6758, 'loss/train': 2.1299595832824707} 11/06/2021 22:11:50 - INFO - __main__ - Step 6760: {'lr': 0.0004987254749401933, 'samples': 1297920, 'steps': 6759, 'loss/train': 1.5151968002319336} 11/06/2021 22:11:50 - INFO - __main__ - Step 6761: {'lr': 0.000498724939712478, 'samples': 1298112, 'steps': 6760, 'loss/train': 2.2985787391662598} 11/06/2021 22:11:51 - INFO - __main__ - Step 6762: {'lr': 0.000498724404372691, 'samples': 1298304, 'steps': 6761, 'loss/train': 2.6387453079223633} 11/06/2021 22:11:51 - INFO - __main__ - Step 6763: {'lr': 0.0004987238689208327, 'samples': 1298496, 'steps': 6762, 'loss/train': 1.8828072547912598} 11/06/2021 22:11:52 - INFO - __main__ - Step 6764: {'lr': 0.0004987233333569031, 'samples': 1298688, 'steps': 6763, 'loss/train': 2.250027894973755} 11/06/2021 22:11:52 - INFO - __main__ - Step 6765: {'lr': 0.0004987227976809028, 'samples': 1298880, 'steps': 6764, 'loss/train': 1.8417761325836182} 11/06/2021 22:11:53 - INFO - __main__ - Step 6766: {'lr': 0.0004987222618928318, 'samples': 1299072, 'steps': 6765, 'loss/train': 1.9207943677902222} 11/06/2021 22:11:53 - INFO - __main__ - Step 6767: {'lr': 0.0004987217259926904, 'samples': 1299264, 'steps': 6766, 'loss/train': 0.23178565502166748} 11/06/2021 22:11:53 - INFO - __main__ - Step 6768: {'lr': 0.0004987211899804788, 'samples': 1299456, 'steps': 6767, 'loss/train': 1.8414160013198853} 11/06/2021 22:11:55 - INFO - __main__ - Step 6769: {'lr': 0.0004987206538561972, 'samples': 1299648, 'steps': 6768, 'loss/train': 1.939680814743042} 11/06/2021 22:11:55 - INFO - __main__ - Step 6770: {'lr': 0.000498720117619846, 'samples': 1299840, 'steps': 6769, 'loss/train': 1.4610973596572876} 11/06/2021 22:11:55 - INFO - __main__ - Step 6771: {'lr': 0.0004987195812714252, 'samples': 1300032, 'steps': 6770, 'loss/train': 1.526523232460022} 11/06/2021 22:11:56 - INFO - __main__ - Step 6772: {'lr': 0.0004987190448109354, 'samples': 1300224, 'steps': 6771, 'loss/train': 2.2244198322296143} 11/06/2021 22:11:56 - INFO - __main__ - Step 6773: {'lr': 0.0004987185082383765, 'samples': 1300416, 'steps': 6772, 'loss/train': 1.7443127632141113} 11/06/2021 22:11:56 - INFO - __main__ - Step 6774: {'lr': 0.000498717971553749, 'samples': 1300608, 'steps': 6773, 'loss/train': 2.230355978012085} 11/06/2021 22:11:57 - INFO - __main__ - Step 6775: {'lr': 0.0004987174347570529, 'samples': 1300800, 'steps': 6774, 'loss/train': 0.42308324575424194} 11/06/2021 22:11:58 - INFO - __main__ - Step 6776: {'lr': 0.0004987168978482886, 'samples': 1300992, 'steps': 6775, 'loss/train': 1.6637697219848633} 11/06/2021 22:11:58 - INFO - __main__ - Step 6777: {'lr': 0.0004987163608274564, 'samples': 1301184, 'steps': 6776, 'loss/train': 1.6888412237167358} 11/06/2021 22:11:58 - INFO - __main__ - Step 6778: {'lr': 0.0004987158236945563, 'samples': 1301376, 'steps': 6777, 'loss/train': 1.6136714220046997} 11/06/2021 22:11:59 - INFO - __main__ - Step 6779: {'lr': 0.0004987152864495887, 'samples': 1301568, 'steps': 6778, 'loss/train': 2.1859946250915527} 11/06/2021 22:12:00 - INFO - __main__ - Step 6780: {'lr': 0.000498714749092554, 'samples': 1301760, 'steps': 6779, 'loss/train': 1.9430932998657227} 11/06/2021 22:12:00 - INFO - __main__ - Step 6781: {'lr': 0.0004987142116234521, 'samples': 1301952, 'steps': 6780, 'loss/train': 1.1831541061401367} 11/06/2021 22:12:00 - INFO - __main__ - Step 6782: {'lr': 0.0004987136740422835, 'samples': 1302144, 'steps': 6781, 'loss/train': 1.972415566444397} 11/06/2021 22:12:01 - INFO - __main__ - Step 6783: {'lr': 0.0004987131363490483, 'samples': 1302336, 'steps': 6782, 'loss/train': 1.6971096992492676} 11/06/2021 22:12:01 - INFO - __main__ - Step 6784: {'lr': 0.0004987125985437468, 'samples': 1302528, 'steps': 6783, 'loss/train': 2.15081524848938} 11/06/2021 22:12:02 - INFO - __main__ - Step 6785: {'lr': 0.0004987120606263794, 'samples': 1302720, 'steps': 6784, 'loss/train': 2.056392192840576} 11/06/2021 22:12:03 - INFO - __main__ - Step 6786: {'lr': 0.000498711522596946, 'samples': 1302912, 'steps': 6785, 'loss/train': 1.852485179901123} 11/06/2021 22:12:03 - INFO - __main__ - Step 6787: {'lr': 0.000498710984455447, 'samples': 1303104, 'steps': 6786, 'loss/train': 1.6676357984542847} 11/06/2021 22:12:03 - INFO - __main__ - Step 6788: {'lr': 0.0004987104462018828, 'samples': 1303296, 'steps': 6787, 'loss/train': 1.6238676309585571} 11/06/2021 22:12:04 - INFO - __main__ - Step 6789: {'lr': 0.0004987099078362534, 'samples': 1303488, 'steps': 6788, 'loss/train': 1.8077821731567383} 11/06/2021 22:12:05 - INFO - __main__ - Step 6790: {'lr': 0.0004987093693585591, 'samples': 1303680, 'steps': 6789, 'loss/train': 2.1676840782165527} 11/06/2021 22:12:05 - INFO - __main__ - Step 6791: {'lr': 0.0004987088307688004, 'samples': 1303872, 'steps': 6790, 'loss/train': 0.8062730431556702} 11/06/2021 22:12:05 - INFO - __main__ - Step 6792: {'lr': 0.0004987082920669772, 'samples': 1304064, 'steps': 6791, 'loss/train': 1.5815889835357666} 11/06/2021 22:12:06 - INFO - __main__ - Step 6793: {'lr': 0.0004987077532530899, 'samples': 1304256, 'steps': 6792, 'loss/train': 1.743523120880127} 11/06/2021 22:12:06 - INFO - __main__ - Step 6794: {'lr': 0.0004987072143271388, 'samples': 1304448, 'steps': 6793, 'loss/train': 1.1385010480880737} 11/06/2021 22:12:07 - INFO - __main__ - Step 6795: {'lr': 0.000498706675289124, 'samples': 1304640, 'steps': 6794, 'loss/train': 1.5993659496307373} 11/06/2021 22:12:07 - INFO - __main__ - Step 6796: {'lr': 0.0004987061361390458, 'samples': 1304832, 'steps': 6795, 'loss/train': 1.9786431789398193} 11/06/2021 22:12:08 - INFO - __main__ - Step 6797: {'lr': 0.0004987055968769045, 'samples': 1305024, 'steps': 6796, 'loss/train': 1.779247522354126} 11/06/2021 22:12:08 - INFO - __main__ - Step 6798: {'lr': 0.0004987050575027002, 'samples': 1305216, 'steps': 6797, 'loss/train': 1.8261228799819946} 11/06/2021 22:12:08 - INFO - __main__ - Step 6799: {'lr': 0.0004987045180164333, 'samples': 1305408, 'steps': 6798, 'loss/train': 1.5211654901504517} 11/06/2021 22:12:10 - INFO - __main__ - Step 6800: {'lr': 0.0004987039784181041, 'samples': 1305600, 'steps': 6799, 'loss/train': 0.9507125616073608} 11/06/2021 22:12:10 - INFO - __main__ - Step 6801: {'lr': 0.0004987034387077126, 'samples': 1305792, 'steps': 6800, 'loss/train': 1.7004783153533936} 11/06/2021 22:12:10 - INFO - __main__ - Step 6802: {'lr': 0.0004987028988852592, 'samples': 1305984, 'steps': 6801, 'loss/train': 2.2649049758911133} 11/06/2021 22:12:11 - INFO - __main__ - Step 6803: {'lr': 0.0004987023589507441, 'samples': 1306176, 'steps': 6802, 'loss/train': 1.9656171798706055} 11/06/2021 22:12:11 - INFO - __main__ - Step 6804: {'lr': 0.0004987018189041675, 'samples': 1306368, 'steps': 6803, 'loss/train': 1.7351499795913696} 11/06/2021 22:12:12 - INFO - __main__ - Step 6805: {'lr': 0.0004987012787455297, 'samples': 1306560, 'steps': 6804, 'loss/train': 1.2826176881790161} 11/06/2021 22:12:12 - INFO - __main__ - Step 6806: {'lr': 0.000498700738474831, 'samples': 1306752, 'steps': 6805, 'loss/train': 1.076385736465454} 11/06/2021 22:12:13 - INFO - __main__ - Step 6807: {'lr': 0.0004987001980920716, 'samples': 1306944, 'steps': 6806, 'loss/train': 1.8413560390472412} 11/06/2021 22:12:13 - INFO - __main__ - Step 6808: {'lr': 0.0004986996575972517, 'samples': 1307136, 'steps': 6807, 'loss/train': 0.7521131038665771} 11/06/2021 22:12:13 - INFO - __main__ - Step 6809: {'lr': 0.0004986991169903716, 'samples': 1307328, 'steps': 6808, 'loss/train': 2.316514015197754} 11/06/2021 22:12:14 - INFO - __main__ - Step 6810: {'lr': 0.0004986985762714314, 'samples': 1307520, 'steps': 6809, 'loss/train': 2.3181118965148926} 11/06/2021 22:12:15 - INFO - __main__ - Step 6811: {'lr': 0.0004986980354404316, 'samples': 1307712, 'steps': 6810, 'loss/train': 1.841756820678711} 11/06/2021 22:12:15 - INFO - __main__ - Step 6812: {'lr': 0.0004986974944973723, 'samples': 1307904, 'steps': 6811, 'loss/train': 2.5697433948516846} 11/06/2021 22:12:16 - INFO - __main__ - Step 6813: {'lr': 0.0004986969534422537, 'samples': 1308096, 'steps': 6812, 'loss/train': 0.27386459708213806} 11/06/2021 22:12:16 - INFO - __main__ - Step 6814: {'lr': 0.000498696412275076, 'samples': 1308288, 'steps': 6813, 'loss/train': 1.9443211555480957} 11/06/2021 22:12:16 - INFO - __main__ - Step 6815: {'lr': 0.0004986958709958396, 'samples': 1308480, 'steps': 6814, 'loss/train': 1.8300877809524536} 11/06/2021 22:12:18 - INFO - __main__ - Step 6816: {'lr': 0.0004986953296045448, 'samples': 1308672, 'steps': 6815, 'loss/train': 1.716494083404541} 11/06/2021 22:12:18 - INFO - __main__ - Step 6817: {'lr': 0.0004986947881011917, 'samples': 1308864, 'steps': 6816, 'loss/train': 1.5828531980514526} 11/06/2021 22:12:18 - INFO - __main__ - Step 6818: {'lr': 0.0004986942464857804, 'samples': 1309056, 'steps': 6817, 'loss/train': 1.8163785934448242} 11/06/2021 22:12:19 - INFO - __main__ - Step 6819: {'lr': 0.0004986937047583114, 'samples': 1309248, 'steps': 6818, 'loss/train': 1.8825898170471191} 11/06/2021 22:12:19 - INFO - __main__ - Step 6820: {'lr': 0.0004986931629187848, 'samples': 1309440, 'steps': 6819, 'loss/train': 2.2714736461639404} 11/06/2021 22:12:20 - INFO - __main__ - Step 6821: {'lr': 0.0004986926209672011, 'samples': 1309632, 'steps': 6820, 'loss/train': 2.2257802486419678} 11/06/2021 22:12:20 - INFO - __main__ - Step 6822: {'lr': 0.0004986920789035601, 'samples': 1309824, 'steps': 6821, 'loss/train': 1.6289818286895752} 11/06/2021 22:12:21 - INFO - __main__ - Step 6823: {'lr': 0.0004986915367278623, 'samples': 1310016, 'steps': 6822, 'loss/train': 1.8991657495498657} 11/06/2021 22:12:21 - INFO - __main__ - Step 6824: {'lr': 0.0004986909944401082, 'samples': 1310208, 'steps': 6823, 'loss/train': 1.4807871580123901} 11/06/2021 22:12:21 - INFO - __main__ - Step 6825: {'lr': 0.0004986904520402975, 'samples': 1310400, 'steps': 6824, 'loss/train': 1.9092999696731567} 11/06/2021 22:12:23 - INFO - __main__ - Step 6826: {'lr': 0.0004986899095284308, 'samples': 1310592, 'steps': 6825, 'loss/train': 2.1994271278381348} 11/06/2021 22:12:23 - INFO - __main__ - Step 6827: {'lr': 0.0004986893669045083, 'samples': 1310784, 'steps': 6826, 'loss/train': 2.3838884830474854} 11/06/2021 22:12:23 - INFO - __main__ - Step 6828: {'lr': 0.0004986888241685301, 'samples': 1310976, 'steps': 6827, 'loss/train': 1.6222882270812988} 11/06/2021 22:12:24 - INFO - __main__ - Step 6829: {'lr': 0.0004986882813204967, 'samples': 1311168, 'steps': 6828, 'loss/train': 2.6850411891937256} 11/06/2021 22:12:24 - INFO - __main__ - Step 6830: {'lr': 0.0004986877383604081, 'samples': 1311360, 'steps': 6829, 'loss/train': 1.8489203453063965} 11/06/2021 22:12:25 - INFO - __main__ - Step 6831: {'lr': 0.0004986871952882647, 'samples': 1311552, 'steps': 6830, 'loss/train': 1.933595061302185} 11/06/2021 22:12:25 - INFO - __main__ - Step 6832: {'lr': 0.0004986866521040666, 'samples': 1311744, 'steps': 6831, 'loss/train': 1.9676979780197144} 11/06/2021 22:12:26 - INFO - __main__ - Step 6833: {'lr': 0.0004986861088078142, 'samples': 1311936, 'steps': 6832, 'loss/train': 1.3297863006591797} 11/06/2021 22:12:26 - INFO - __main__ - Step 6834: {'lr': 0.0004986855653995077, 'samples': 1312128, 'steps': 6833, 'loss/train': 0.9145995378494263} 11/06/2021 22:12:26 - INFO - __main__ - Step 6835: {'lr': 0.0004986850218791474, 'samples': 1312320, 'steps': 6834, 'loss/train': 2.046581983566284} 11/06/2021 22:12:28 - INFO - __main__ - Step 6836: {'lr': 0.0004986844782467332, 'samples': 1312512, 'steps': 6835, 'loss/train': 1.6971626281738281} 11/06/2021 22:12:28 - INFO - __main__ - Step 6837: {'lr': 0.0004986839345022658, 'samples': 1312704, 'steps': 6836, 'loss/train': 1.80866539478302} 11/06/2021 22:12:29 - INFO - __main__ - Step 6838: {'lr': 0.0004986833906457453, 'samples': 1312896, 'steps': 6837, 'loss/train': 2.049255609512329} 11/06/2021 22:12:29 - INFO - __main__ - Step 6839: {'lr': 0.0004986828466771718, 'samples': 1313088, 'steps': 6838, 'loss/train': 2.1545591354370117} 11/06/2021 22:12:29 - INFO - __main__ - Step 6840: {'lr': 0.0004986823025965457, 'samples': 1313280, 'steps': 6839, 'loss/train': 1.8146533966064453} 11/06/2021 22:12:30 - INFO - __main__ - Step 6841: {'lr': 0.0004986817584038671, 'samples': 1313472, 'steps': 6840, 'loss/train': 1.6725200414657593} 11/06/2021 22:12:31 - INFO - __main__ - Step 6842: {'lr': 0.0004986812140991365, 'samples': 1313664, 'steps': 6841, 'loss/train': 2.010808229446411} 11/06/2021 22:12:31 - INFO - __main__ - Step 6843: {'lr': 0.0004986806696823538, 'samples': 1313856, 'steps': 6842, 'loss/train': 1.6977182626724243} 11/06/2021 22:12:31 - INFO - __main__ - Step 6844: {'lr': 0.0004986801251535195, 'samples': 1314048, 'steps': 6843, 'loss/train': 1.7137373685836792} 11/06/2021 22:12:32 - INFO - __main__ - Step 6845: {'lr': 0.0004986795805126339, 'samples': 1314240, 'steps': 6844, 'loss/train': 1.878450632095337} 11/06/2021 22:12:32 - INFO - __main__ - Step 6846: {'lr': 0.000498679035759697, 'samples': 1314432, 'steps': 6845, 'loss/train': 1.5944817066192627} 11/06/2021 22:12:33 - INFO - __main__ - Step 6847: {'lr': 0.0004986784908947091, 'samples': 1314624, 'steps': 6846, 'loss/train': 1.7078269720077515} 11/06/2021 22:12:34 - INFO - __main__ - Step 6848: {'lr': 0.0004986779459176706, 'samples': 1314816, 'steps': 6847, 'loss/train': 1.9563493728637695} 11/06/2021 22:12:34 - INFO - __main__ - Step 6849: {'lr': 0.0004986774008285816, 'samples': 1315008, 'steps': 6848, 'loss/train': 1.9744795560836792} 11/06/2021 22:12:34 - INFO - __main__ - Step 6850: {'lr': 0.0004986768556274425, 'samples': 1315200, 'steps': 6849, 'loss/train': 1.5663435459136963} 11/06/2021 22:12:35 - INFO - __main__ - Step 6851: {'lr': 0.0004986763103142533, 'samples': 1315392, 'steps': 6850, 'loss/train': 1.9012843370437622} 11/06/2021 22:12:36 - INFO - __main__ - Step 6852: {'lr': 0.0004986757648890145, 'samples': 1315584, 'steps': 6851, 'loss/train': 1.9131489992141724} 11/06/2021 22:12:36 - INFO - __main__ - Step 6853: {'lr': 0.0004986752193517262, 'samples': 1315776, 'steps': 6852, 'loss/train': 1.6437551975250244} 11/06/2021 22:12:36 - INFO - __main__ - Step 6854: {'lr': 0.0004986746737023887, 'samples': 1315968, 'steps': 6853, 'loss/train': 0.30247291922569275} 11/06/2021 22:12:37 - INFO - __main__ - Step 6855: {'lr': 0.0004986741279410023, 'samples': 1316160, 'steps': 6854, 'loss/train': 2.0035195350646973} 11/06/2021 22:12:37 - INFO - __main__ - Step 6856: {'lr': 0.000498673582067567, 'samples': 1316352, 'steps': 6855, 'loss/train': 1.685018539428711} 11/06/2021 22:12:38 - INFO - __main__ - Step 6857: {'lr': 0.0004986730360820833, 'samples': 1316544, 'steps': 6856, 'loss/train': 1.0697712898254395} 11/06/2021 22:12:39 - INFO - __main__ - Step 6858: {'lr': 0.0004986724899845514, 'samples': 1316736, 'steps': 6857, 'loss/train': 1.4516496658325195} 11/06/2021 22:12:39 - INFO - __main__ - Step 6859: {'lr': 0.0004986719437749716, 'samples': 1316928, 'steps': 6858, 'loss/train': 2.176100730895996} 11/06/2021 22:12:39 - INFO - __main__ - Step 6860: {'lr': 0.0004986713974533439, 'samples': 1317120, 'steps': 6859, 'loss/train': 1.1705691814422607} 11/06/2021 22:12:40 - INFO - __main__ - Step 6861: {'lr': 0.0004986708510196688, 'samples': 1317312, 'steps': 6860, 'loss/train': 1.852432131767273} 11/06/2021 22:12:40 - INFO - __main__ - Step 6862: {'lr': 0.0004986703044739464, 'samples': 1317504, 'steps': 6861, 'loss/train': 1.9012736082077026} 11/06/2021 22:12:41 - INFO - __main__ - Step 6863: {'lr': 0.000498669757816177, 'samples': 1317696, 'steps': 6862, 'loss/train': 2.041994333267212} 11/06/2021 22:12:41 - INFO - __main__ - Step 6864: {'lr': 0.0004986692110463609, 'samples': 1317888, 'steps': 6863, 'loss/train': 0.9249697327613831} 11/06/2021 22:12:42 - INFO - __main__ - Step 6865: {'lr': 0.0004986686641644982, 'samples': 1318080, 'steps': 6864, 'loss/train': 1.9790064096450806} 11/06/2021 22:12:42 - INFO - __main__ - Step 6866: {'lr': 0.0004986681171705893, 'samples': 1318272, 'steps': 6865, 'loss/train': 1.9213021993637085} 11/06/2021 22:12:42 - INFO - __main__ - Step 6867: {'lr': 0.0004986675700646343, 'samples': 1318464, 'steps': 6866, 'loss/train': 1.9628852605819702} 11/06/2021 22:12:43 - INFO - __main__ - Step 6868: {'lr': 0.0004986670228466337, 'samples': 1318656, 'steps': 6867, 'loss/train': 2.0977110862731934} 11/06/2021 22:12:44 - INFO - __main__ - Step 6869: {'lr': 0.0004986664755165874, 'samples': 1318848, 'steps': 6868, 'loss/train': 1.4509758949279785} 11/06/2021 22:12:44 - INFO - __main__ - Step 6870: {'lr': 0.000498665928074496, 'samples': 1319040, 'steps': 6869, 'loss/train': 1.9043452739715576} 11/06/2021 22:12:44 - INFO - __main__ - Step 6871: {'lr': 0.0004986653805203594, 'samples': 1319232, 'steps': 6870, 'loss/train': 1.188199520111084} 11/06/2021 22:12:45 - INFO - __main__ - Step 6872: {'lr': 0.0004986648328541781, 'samples': 1319424, 'steps': 6871, 'loss/train': 1.6279737949371338} 11/06/2021 22:12:46 - INFO - __main__ - Step 6873: {'lr': 0.0004986642850759522, 'samples': 1319616, 'steps': 6872, 'loss/train': 1.2891004085540771} 11/06/2021 22:12:46 - INFO - __main__ - Step 6874: {'lr': 0.0004986637371856822, 'samples': 1319808, 'steps': 6873, 'loss/train': 2.0982792377471924} 11/06/2021 22:12:47 - INFO - __main__ - Step 6875: {'lr': 0.000498663189183368, 'samples': 1320000, 'steps': 6874, 'loss/train': 1.6079758405685425} 11/06/2021 22:12:47 - INFO - __main__ - Step 6876: {'lr': 0.0004986626410690099, 'samples': 1320192, 'steps': 6875, 'loss/train': 1.3328086137771606} 11/06/2021 22:12:47 - INFO - __main__ - Step 6877: {'lr': 0.0004986620928426085, 'samples': 1320384, 'steps': 6876, 'loss/train': 1.9018620252609253} 11/06/2021 22:12:48 - INFO - __main__ - Step 6878: {'lr': 0.0004986615445041636, 'samples': 1320576, 'steps': 6877, 'loss/train': 1.5764297246932983} 11/06/2021 22:12:49 - INFO - __main__ - Step 6879: {'lr': 0.0004986609960536757, 'samples': 1320768, 'steps': 6878, 'loss/train': 1.831556797027588} 11/06/2021 22:12:49 - INFO - __main__ - Step 6880: {'lr': 0.000498660447491145, 'samples': 1320960, 'steps': 6879, 'loss/train': 1.3842569589614868} 11/06/2021 22:12:49 - INFO - __main__ - Step 6881: {'lr': 0.0004986598988165718, 'samples': 1321152, 'steps': 6880, 'loss/train': 2.5729262828826904} 11/06/2021 22:12:50 - INFO - __main__ - Step 6882: {'lr': 0.0004986593500299562, 'samples': 1321344, 'steps': 6881, 'loss/train': 2.3645272254943848} 11/06/2021 22:12:51 - INFO - __main__ - Step 6883: {'lr': 0.0004986588011312986, 'samples': 1321536, 'steps': 6882, 'loss/train': 1.524306058883667} 11/06/2021 22:12:51 - INFO - __main__ - Step 6884: {'lr': 0.0004986582521205992, 'samples': 1321728, 'steps': 6883, 'loss/train': 2.1317756175994873} 11/06/2021 22:12:51 - INFO - __main__ - Step 6885: {'lr': 0.0004986577029978581, 'samples': 1321920, 'steps': 6884, 'loss/train': 2.3129236698150635} 11/06/2021 22:12:52 - INFO - __main__ - Step 6886: {'lr': 0.0004986571537630757, 'samples': 1322112, 'steps': 6885, 'loss/train': 1.8439607620239258} 11/06/2021 22:12:52 - INFO - __main__ - Step 6887: {'lr': 0.0004986566044162523, 'samples': 1322304, 'steps': 6886, 'loss/train': 2.0930988788604736} 11/06/2021 22:12:53 - INFO - __main__ - Step 6888: {'lr': 0.0004986560549573881, 'samples': 1322496, 'steps': 6887, 'loss/train': 2.0430047512054443} 11/06/2021 22:12:53 - INFO - __main__ - Step 6889: {'lr': 0.0004986555053864833, 'samples': 1322688, 'steps': 6888, 'loss/train': 1.7372727394104004} 11/06/2021 22:12:54 - INFO - __main__ - Step 6890: {'lr': 0.0004986549557035381, 'samples': 1322880, 'steps': 6889, 'loss/train': 1.7419931888580322} 11/06/2021 22:12:54 - INFO - __main__ - Step 6891: {'lr': 0.0004986544059085528, 'samples': 1323072, 'steps': 6890, 'loss/train': 1.3079571723937988} 11/06/2021 22:12:55 - INFO - __main__ - Step 6892: {'lr': 0.0004986538560015277, 'samples': 1323264, 'steps': 6891, 'loss/train': 2.068556070327759} 11/06/2021 22:12:56 - INFO - __main__ - Step 6893: {'lr': 0.000498653305982463, 'samples': 1323456, 'steps': 6892, 'loss/train': 1.350880742073059} 11/06/2021 22:12:56 - INFO - __main__ - Step 6894: {'lr': 0.0004986527558513591, 'samples': 1323648, 'steps': 6893, 'loss/train': 1.4500173330307007} 11/06/2021 22:12:56 - INFO - __main__ - Step 6895: {'lr': 0.0004986522056082159, 'samples': 1323840, 'steps': 6894, 'loss/train': 2.146724224090576} 11/06/2021 22:12:57 - INFO - __main__ - Step 6896: {'lr': 0.0004986516552530339, 'samples': 1324032, 'steps': 6895, 'loss/train': 1.203369140625} 11/06/2021 22:12:57 - INFO - __main__ - Step 6897: {'lr': 0.0004986511047858134, 'samples': 1324224, 'steps': 6896, 'loss/train': 2.2133257389068604} 11/06/2021 22:12:57 - INFO - __main__ - Step 6898: {'lr': 0.0004986505542065545, 'samples': 1324416, 'steps': 6897, 'loss/train': 2.0129928588867188} 11/06/2021 22:12:58 - INFO - __main__ - Step 6899: {'lr': 0.0004986500035152574, 'samples': 1324608, 'steps': 6898, 'loss/train': 2.3049161434173584} 11/06/2021 22:12:59 - INFO - __main__ - Step 6900: {'lr': 0.0004986494527119226, 'samples': 1324800, 'steps': 6899, 'loss/train': 1.816585898399353} 11/06/2021 22:12:59 - INFO - __main__ - Step 6901: {'lr': 0.0004986489017965501, 'samples': 1324992, 'steps': 6900, 'loss/train': 1.263410210609436} 11/06/2021 22:12:59 - INFO - __main__ - Step 6902: {'lr': 0.0004986483507691403, 'samples': 1325184, 'steps': 6901, 'loss/train': 1.9309344291687012} 11/06/2021 22:13:00 - INFO - __main__ - Step 6903: {'lr': 0.0004986477996296934, 'samples': 1325376, 'steps': 6902, 'loss/train': 1.9613704681396484} 11/06/2021 22:13:01 - INFO - __main__ - Step 6904: {'lr': 0.0004986472483782096, 'samples': 1325568, 'steps': 6903, 'loss/train': 1.877648115158081} 11/06/2021 22:13:01 - INFO - __main__ - Step 6905: {'lr': 0.0004986466970146891, 'samples': 1325760, 'steps': 6904, 'loss/train': 1.900101900100708} 11/06/2021 22:13:01 - INFO - __main__ - Step 6906: {'lr': 0.0004986461455391323, 'samples': 1325952, 'steps': 6905, 'loss/train': 2.0257797241210938} 11/06/2021 22:13:02 - INFO - __main__ - Step 6907: {'lr': 0.0004986455939515395, 'samples': 1326144, 'steps': 6906, 'loss/train': 1.7874330282211304} 11/06/2021 22:13:02 - INFO - __main__ - Step 6908: {'lr': 0.0004986450422519107, 'samples': 1326336, 'steps': 6907, 'loss/train': 1.842711091041565} 11/06/2021 22:13:03 - INFO - __main__ - Step 6909: {'lr': 0.0004986444904402463, 'samples': 1326528, 'steps': 6908, 'loss/train': 2.1983108520507812} 11/06/2021 22:13:03 - INFO - __main__ - Step 6910: {'lr': 0.0004986439385165464, 'samples': 1326720, 'steps': 6909, 'loss/train': 1.821173906326294} 11/06/2021 22:13:04 - INFO - __main__ - Step 6911: {'lr': 0.0004986433864808115, 'samples': 1326912, 'steps': 6910, 'loss/train': 2.3128411769866943} 11/06/2021 22:13:04 - INFO - __main__ - Step 6912: {'lr': 0.0004986428343330418, 'samples': 1327104, 'steps': 6911, 'loss/train': 1.7972272634506226} 11/06/2021 22:13:05 - INFO - __main__ - Step 6913: {'lr': 0.0004986422820732375, 'samples': 1327296, 'steps': 6912, 'loss/train': 2.888493776321411} 11/06/2021 22:13:05 - INFO - __main__ - Step 6914: {'lr': 0.0004986417297013987, 'samples': 1327488, 'steps': 6913, 'loss/train': 1.6202863454818726} 11/06/2021 22:13:06 - INFO - __main__ - Step 6915: {'lr': 0.0004986411772175258, 'samples': 1327680, 'steps': 6914, 'loss/train': 1.704667329788208} 11/06/2021 22:13:06 - INFO - __main__ - Step 6916: {'lr': 0.000498640624621619, 'samples': 1327872, 'steps': 6915, 'loss/train': 1.571314811706543} 11/06/2021 22:13:07 - INFO - __main__ - Step 6917: {'lr': 0.0004986400719136786, 'samples': 1328064, 'steps': 6916, 'loss/train': 1.8842869997024536} 11/06/2021 22:13:07 - INFO - __main__ - Step 6918: {'lr': 0.0004986395190937048, 'samples': 1328256, 'steps': 6917, 'loss/train': 1.7865066528320312} 11/06/2021 22:13:08 - INFO - __main__ - Step 6919: {'lr': 0.000498638966161698, 'samples': 1328448, 'steps': 6918, 'loss/train': 1.9803627729415894} 11/06/2021 22:13:08 - INFO - __main__ - Step 6920: {'lr': 0.0004986384131176583, 'samples': 1328640, 'steps': 6919, 'loss/train': 2.02374529838562} 11/06/2021 22:13:09 - INFO - __main__ - Step 6921: {'lr': 0.0004986378599615858, 'samples': 1328832, 'steps': 6920, 'loss/train': 2.2463250160217285} 11/06/2021 22:13:09 - INFO - __main__ - Step 6922: {'lr': 0.000498637306693481, 'samples': 1329024, 'steps': 6921, 'loss/train': 1.9036977291107178} 11/06/2021 22:13:09 - INFO - __main__ - Step 6923: {'lr': 0.0004986367533133441, 'samples': 1329216, 'steps': 6922, 'loss/train': 1.8560709953308105} 11/06/2021 22:13:10 - INFO - __main__ - Step 6924: {'lr': 0.0004986361998211752, 'samples': 1329408, 'steps': 6923, 'loss/train': 1.860509991645813} 11/06/2021 22:13:11 - INFO - __main__ - Step 6925: {'lr': 0.0004986356462169748, 'samples': 1329600, 'steps': 6924, 'loss/train': 1.627434253692627} 11/06/2021 22:13:11 - INFO - __main__ - Step 6926: {'lr': 0.0004986350925007429, 'samples': 1329792, 'steps': 6925, 'loss/train': 1.9676358699798584} 11/06/2021 22:13:11 - INFO - __main__ - Step 6927: {'lr': 0.00049863453867248, 'samples': 1329984, 'steps': 6926, 'loss/train': 1.5846327543258667} 11/06/2021 22:13:12 - INFO - __main__ - Step 6928: {'lr': 0.0004986339847321862, 'samples': 1330176, 'steps': 6927, 'loss/train': 2.3178396224975586} 11/06/2021 22:13:13 - INFO - __main__ - Step 6929: {'lr': 0.0004986334306798616, 'samples': 1330368, 'steps': 6928, 'loss/train': 2.166189432144165} 11/06/2021 22:13:13 - INFO - __main__ - Step 6930: {'lr': 0.0004986328765155068, 'samples': 1330560, 'steps': 6929, 'loss/train': 2.235844373703003} 11/06/2021 22:13:13 - INFO - __main__ - Step 6931: {'lr': 0.0004986323222391217, 'samples': 1330752, 'steps': 6930, 'loss/train': 1.5991435050964355} 11/06/2021 22:13:14 - INFO - __main__ - Step 6932: {'lr': 0.0004986317678507069, 'samples': 1330944, 'steps': 6931, 'loss/train': 1.773486852645874} 11/06/2021 22:13:14 - INFO - __main__ - Step 6933: {'lr': 0.0004986312133502623, 'samples': 1331136, 'steps': 6932, 'loss/train': 0.26625779271125793} 11/06/2021 22:13:15 - INFO - __main__ - Step 6934: {'lr': 0.0004986306587377884, 'samples': 1331328, 'steps': 6933, 'loss/train': 1.767874836921692} 11/06/2021 22:13:16 - INFO - __main__ - Step 6935: {'lr': 0.0004986301040132853, 'samples': 1331520, 'steps': 6934, 'loss/train': 1.564034342765808} 11/06/2021 22:13:16 - INFO - __main__ - Step 6936: {'lr': 0.0004986295491767533, 'samples': 1331712, 'steps': 6935, 'loss/train': 1.7570186853408813} 11/06/2021 22:13:16 - INFO - __main__ - Step 6937: {'lr': 0.0004986289942281927, 'samples': 1331904, 'steps': 6936, 'loss/train': 1.7670789957046509} 11/06/2021 22:13:17 - INFO - __main__ - Step 6938: {'lr': 0.0004986284391676037, 'samples': 1332096, 'steps': 6937, 'loss/train': 2.3007538318634033} 11/06/2021 22:13:18 - INFO - __main__ - Step 6939: {'lr': 0.0004986278839949866, 'samples': 1332288, 'steps': 6938, 'loss/train': 2.0607035160064697} 11/06/2021 22:13:18 - INFO - __main__ - Step 6940: {'lr': 0.0004986273287103416, 'samples': 1332480, 'steps': 6939, 'loss/train': 2.072021484375} 11/06/2021 22:13:18 - INFO - __main__ - Step 6941: {'lr': 0.0004986267733136689, 'samples': 1332672, 'steps': 6940, 'loss/train': 1.5899384021759033} 11/06/2021 22:13:19 - INFO - __main__ - Step 6942: {'lr': 0.0004986262178049689, 'samples': 1332864, 'steps': 6941, 'loss/train': 2.349639415740967} 11/06/2021 22:13:19 - INFO - __main__ - Step 6943: {'lr': 0.0004986256621842417, 'samples': 1333056, 'steps': 6942, 'loss/train': 1.3075493574142456} 11/06/2021 22:13:20 - INFO - __main__ - Step 6944: {'lr': 0.0004986251064514878, 'samples': 1333248, 'steps': 6943, 'loss/train': 1.4353903532028198} 11/06/2021 22:13:21 - INFO - __main__ - Step 6945: {'lr': 0.000498624550606707, 'samples': 1333440, 'steps': 6944, 'loss/train': 2.1542229652404785} 11/06/2021 22:13:21 - INFO - __main__ - Step 6946: {'lr': 0.0004986239946498999, 'samples': 1333632, 'steps': 6945, 'loss/train': 1.9125726222991943} 11/06/2021 22:13:21 - INFO - __main__ - Step 6947: {'lr': 0.0004986234385810668, 'samples': 1333824, 'steps': 6946, 'loss/train': 1.9208937883377075} 11/06/2021 22:13:22 - INFO - __main__ - Step 6948: {'lr': 0.0004986228824002076, 'samples': 1334016, 'steps': 6947, 'loss/train': 1.7958143949508667} 11/06/2021 22:13:23 - INFO - __main__ - Step 6949: {'lr': 0.0004986223261073228, 'samples': 1334208, 'steps': 6948, 'loss/train': 2.022138833999634} 11/06/2021 22:13:23 - INFO - __main__ - Step 6950: {'lr': 0.0004986217697024128, 'samples': 1334400, 'steps': 6949, 'loss/train': 2.049771785736084} 11/06/2021 22:13:23 - INFO - __main__ - Step 6951: {'lr': 0.0004986212131854775, 'samples': 1334592, 'steps': 6950, 'loss/train': 1.7569395303726196} 11/06/2021 22:13:24 - INFO - __main__ - Step 6952: {'lr': 0.0004986206565565173, 'samples': 1334784, 'steps': 6951, 'loss/train': 1.6181141138076782} 11/06/2021 22:13:24 - INFO - __main__ - Step 6953: {'lr': 0.0004986200998155325, 'samples': 1334976, 'steps': 6952, 'loss/train': 2.3144214153289795} 11/06/2021 22:13:24 - INFO - __main__ - Step 6954: {'lr': 0.0004986195429625234, 'samples': 1335168, 'steps': 6953, 'loss/train': 1.8186416625976562} 11/06/2021 22:13:25 - INFO - __main__ - Step 6955: {'lr': 0.0004986189859974901, 'samples': 1335360, 'steps': 6954, 'loss/train': 1.4414639472961426} 11/06/2021 22:13:26 - INFO - __main__ - Step 6956: {'lr': 0.000498618428920433, 'samples': 1335552, 'steps': 6955, 'loss/train': 1.4519563913345337} 11/06/2021 22:13:26 - INFO - __main__ - Step 6957: {'lr': 0.0004986178717313522, 'samples': 1335744, 'steps': 6956, 'loss/train': 1.7825121879577637} 11/06/2021 22:13:26 - INFO - __main__ - Step 6958: {'lr': 0.000498617314430248, 'samples': 1335936, 'steps': 6957, 'loss/train': 2.2233121395111084} 11/06/2021 22:13:27 - INFO - __main__ - Step 6959: {'lr': 0.0004986167570171208, 'samples': 1336128, 'steps': 6958, 'loss/train': 2.0718345642089844} 11/06/2021 22:13:28 - INFO - __main__ - Step 6960: {'lr': 0.0004986161994919706, 'samples': 1336320, 'steps': 6959, 'loss/train': 1.547726035118103} 11/06/2021 22:13:28 - INFO - __main__ - Step 6961: {'lr': 0.0004986156418547978, 'samples': 1336512, 'steps': 6960, 'loss/train': 2.062274694442749} 11/06/2021 22:13:29 - INFO - __main__ - Step 6962: {'lr': 0.0004986150841056027, 'samples': 1336704, 'steps': 6961, 'loss/train': 2.067039966583252} 11/06/2021 22:13:29 - INFO - __main__ - Step 6963: {'lr': 0.0004986145262443854, 'samples': 1336896, 'steps': 6962, 'loss/train': 1.8536887168884277} 11/06/2021 22:13:29 - INFO - __main__ - Step 6964: {'lr': 0.0004986139682711463, 'samples': 1337088, 'steps': 6963, 'loss/train': 2.060143232345581} 11/06/2021 22:13:31 - INFO - __main__ - Step 6965: {'lr': 0.0004986134101858854, 'samples': 1337280, 'steps': 6964, 'loss/train': 1.9564449787139893} 11/06/2021 22:13:31 - INFO - __main__ - Step 6966: {'lr': 0.0004986128519886033, 'samples': 1337472, 'steps': 6965, 'loss/train': 2.0002098083496094} 11/06/2021 22:13:31 - INFO - __main__ - Step 6967: {'lr': 0.0004986122936793, 'samples': 1337664, 'steps': 6966, 'loss/train': 1.0394262075424194} 11/06/2021 22:13:32 - INFO - __main__ - Step 6968: {'lr': 0.000498611735257976, 'samples': 1337856, 'steps': 6967, 'loss/train': 1.8004080057144165} 11/06/2021 22:13:32 - INFO - __main__ - Step 6969: {'lr': 0.0004986111767246313, 'samples': 1338048, 'steps': 6968, 'loss/train': 1.8207870721817017} 11/06/2021 22:13:33 - INFO - __main__ - Step 6970: {'lr': 0.0004986106180792662, 'samples': 1338240, 'steps': 6969, 'loss/train': 2.2126078605651855} 11/06/2021 22:13:33 - INFO - __main__ - Step 6971: {'lr': 0.000498610059321881, 'samples': 1338432, 'steps': 6970, 'loss/train': 1.8455476760864258} 11/06/2021 22:13:34 - INFO - __main__ - Step 6972: {'lr': 0.000498609500452476, 'samples': 1338624, 'steps': 6971, 'loss/train': 1.8543205261230469} 11/06/2021 22:13:34 - INFO - __main__ - Step 6973: {'lr': 0.0004986089414710513, 'samples': 1338816, 'steps': 6972, 'loss/train': 2.0430335998535156} 11/06/2021 22:13:35 - INFO - __main__ - Step 6974: {'lr': 0.0004986083823776073, 'samples': 1339008, 'steps': 6973, 'loss/train': 1.9079920053482056} 11/06/2021 22:13:35 - INFO - __main__ - Step 6975: {'lr': 0.0004986078231721443, 'samples': 1339200, 'steps': 6974, 'loss/train': 0.6376028060913086} 11/06/2021 22:13:36 - INFO - __main__ - Step 6976: {'lr': 0.0004986072638546623, 'samples': 1339392, 'steps': 6975, 'loss/train': 1.7745708227157593} 11/06/2021 22:13:36 - INFO - __main__ - Step 6977: {'lr': 0.0004986067044251617, 'samples': 1339584, 'steps': 6976, 'loss/train': 2.010833740234375} 11/06/2021 22:13:37 - INFO - __main__ - Step 6978: {'lr': 0.0004986061448836428, 'samples': 1339776, 'steps': 6977, 'loss/train': 1.629277229309082} 11/06/2021 22:13:37 - INFO - __main__ - Step 6979: {'lr': 0.0004986055852301058, 'samples': 1339968, 'steps': 6978, 'loss/train': 1.9664230346679688} 11/06/2021 22:13:39 - INFO - __main__ - Step 6980: {'lr': 0.000498605025464551, 'samples': 1340160, 'steps': 6979, 'loss/train': 1.915958046913147} 11/06/2021 22:13:39 - INFO - __main__ - Step 6981: {'lr': 0.0004986044655869786, 'samples': 1340352, 'steps': 6980, 'loss/train': 1.7974884510040283} 11/06/2021 22:13:40 - INFO - __main__ - Step 6982: {'lr': 0.0004986039055973889, 'samples': 1340544, 'steps': 6981, 'loss/train': 1.9128649234771729} 11/06/2021 22:13:40 - INFO - __main__ - Step 6983: {'lr': 0.000498603345495782, 'samples': 1340736, 'steps': 6982, 'loss/train': 1.8619794845581055} 11/06/2021 22:13:40 - INFO - __main__ - Step 6984: {'lr': 0.0004986027852821583, 'samples': 1340928, 'steps': 6983, 'loss/train': 1.3572192192077637} 11/06/2021 22:13:41 - INFO - __main__ - Step 6985: {'lr': 0.000498602224956518, 'samples': 1341120, 'steps': 6984, 'loss/train': 1.4499187469482422} 11/06/2021 22:13:41 - INFO - __main__ - Step 6986: {'lr': 0.0004986016645188615, 'samples': 1341312, 'steps': 6985, 'loss/train': 1.4657028913497925} 11/06/2021 22:13:41 - INFO - __main__ - Step 6987: {'lr': 0.0004986011039691889, 'samples': 1341504, 'steps': 6986, 'loss/train': 2.016274929046631} 11/06/2021 22:13:43 - INFO - __main__ - Step 6988: {'lr': 0.0004986005433075004, 'samples': 1341696, 'steps': 6987, 'loss/train': 2.058478355407715} 11/06/2021 22:13:43 - INFO - __main__ - Step 6989: {'lr': 0.0004985999825337964, 'samples': 1341888, 'steps': 6988, 'loss/train': 2.3388593196868896} 11/06/2021 22:13:43 - INFO - __main__ - Step 6990: {'lr': 0.000498599421648077, 'samples': 1342080, 'steps': 6989, 'loss/train': 1.34577214717865} 11/06/2021 22:13:44 - INFO - __main__ - Step 6991: {'lr': 0.0004985988606503426, 'samples': 1342272, 'steps': 6990, 'loss/train': 1.556723713874817} 11/06/2021 22:13:44 - INFO - __main__ - Step 6992: {'lr': 0.0004985982995405933, 'samples': 1342464, 'steps': 6991, 'loss/train': 1.9663342237472534} 11/06/2021 22:13:45 - INFO - __main__ - Step 6993: {'lr': 0.0004985977383188296, 'samples': 1342656, 'steps': 6992, 'loss/train': 1.9654285907745361} 11/06/2021 22:13:45 - INFO - __main__ - Step 6994: {'lr': 0.0004985971769850515, 'samples': 1342848, 'steps': 6993, 'loss/train': 1.9241557121276855} 11/06/2021 22:13:46 - INFO - __main__ - Step 6995: {'lr': 0.0004985966155392593, 'samples': 1343040, 'steps': 6994, 'loss/train': 2.185478687286377} 11/06/2021 22:13:46 - INFO - __main__ - Step 6996: {'lr': 0.0004985960539814534, 'samples': 1343232, 'steps': 6995, 'loss/train': 1.901774287223816} 11/06/2021 22:13:46 - INFO - __main__ - Step 6997: {'lr': 0.000498595492311634, 'samples': 1343424, 'steps': 6996, 'loss/train': 1.8927873373031616} 11/06/2021 22:13:47 - INFO - __main__ - Step 6998: {'lr': 0.0004985949305298012, 'samples': 1343616, 'steps': 6997, 'loss/train': 1.5036976337432861} 11/06/2021 22:13:48 - INFO - __main__ - Step 6999: {'lr': 0.0004985943686359554, 'samples': 1343808, 'steps': 6998, 'loss/train': 2.1838197708129883} 11/06/2021 22:13:48 - INFO - __main__ - Step 7000: {'lr': 0.0004985938066300968, 'samples': 1344000, 'steps': 6999, 'loss/train': 1.8023282289505005} 11/06/2021 22:13:48 - INFO - __main__ - Step 7001: {'lr': 0.0004985932445122257, 'samples': 1344192, 'steps': 7000, 'loss/train': 0.9377233982086182} 11/06/2021 22:13:49 - INFO - __main__ - Step 7002: {'lr': 0.0004985926822823422, 'samples': 1344384, 'steps': 7001, 'loss/train': 2.0790352821350098} 11/06/2021 22:13:49 - INFO - __main__ - Step 7003: {'lr': 0.0004985921199404467, 'samples': 1344576, 'steps': 7002, 'loss/train': 1.5738434791564941} 11/06/2021 22:13:50 - INFO - __main__ - Step 7004: {'lr': 0.0004985915574865395, 'samples': 1344768, 'steps': 7003, 'loss/train': 2.0488102436065674} 11/06/2021 22:13:50 - INFO - __main__ - Step 7005: {'lr': 0.0004985909949206209, 'samples': 1344960, 'steps': 7004, 'loss/train': 2.2131500244140625} 11/06/2021 22:13:51 - INFO - __main__ - Step 7006: {'lr': 0.0004985904322426909, 'samples': 1345152, 'steps': 7005, 'loss/train': 2.16741681098938} 11/06/2021 22:13:51 - INFO - __main__ - Step 7007: {'lr': 0.0004985898694527498, 'samples': 1345344, 'steps': 7006, 'loss/train': 1.9892717599868774} 11/06/2021 22:13:52 - INFO - __main__ - Step 7008: {'lr': 0.000498589306550798, 'samples': 1345536, 'steps': 7007, 'loss/train': 1.8207341432571411} 11/06/2021 22:13:53 - INFO - __main__ - Step 7009: {'lr': 0.0004985887435368357, 'samples': 1345728, 'steps': 7008, 'loss/train': 2.600703716278076} 11/06/2021 22:13:53 - INFO - __main__ - Step 7010: {'lr': 0.0004985881804108632, 'samples': 1345920, 'steps': 7009, 'loss/train': 1.830875277519226} 11/06/2021 22:13:53 - INFO - __main__ - Step 7011: {'lr': 0.0004985876171728807, 'samples': 1346112, 'steps': 7010, 'loss/train': 1.6429589986801147} 11/06/2021 22:13:54 - INFO - __main__ - Step 7012: {'lr': 0.0004985870538228884, 'samples': 1346304, 'steps': 7011, 'loss/train': 1.831001877784729} 11/06/2021 22:13:54 - INFO - __main__ - Step 7013: {'lr': 0.0004985864903608866, 'samples': 1346496, 'steps': 7012, 'loss/train': 1.9762578010559082} 11/06/2021 22:13:55 - INFO - __main__ - Step 7014: {'lr': 0.0004985859267868756, 'samples': 1346688, 'steps': 7013, 'loss/train': 2.1162519454956055} 11/06/2021 22:13:55 - INFO - __main__ - Step 7015: {'lr': 0.0004985853631008557, 'samples': 1346880, 'steps': 7014, 'loss/train': 1.9965474605560303} 11/06/2021 22:13:56 - INFO - __main__ - Step 7016: {'lr': 0.000498584799302827, 'samples': 1347072, 'steps': 7015, 'loss/train': 1.330285906791687} 11/06/2021 22:13:56 - INFO - __main__ - Step 7017: {'lr': 0.0004985842353927897, 'samples': 1347264, 'steps': 7016, 'loss/train': 1.870910406112671} 11/06/2021 22:13:57 - INFO - __main__ - Step 7018: {'lr': 0.0004985836713707443, 'samples': 1347456, 'steps': 7017, 'loss/train': 2.007260799407959} 11/06/2021 22:13:58 - INFO - __main__ - Step 7019: {'lr': 0.000498583107236691, 'samples': 1347648, 'steps': 7018, 'loss/train': 1.9178532361984253} 11/06/2021 22:13:58 - INFO - __main__ - Step 7020: {'lr': 0.0004985825429906299, 'samples': 1347840, 'steps': 7019, 'loss/train': 1.8477237224578857} 11/06/2021 22:13:58 - INFO - __main__ - Step 7021: {'lr': 0.0004985819786325614, 'samples': 1348032, 'steps': 7020, 'loss/train': 1.7012065649032593} 11/06/2021 22:13:59 - INFO - __main__ - Step 7022: {'lr': 0.0004985814141624856, 'samples': 1348224, 'steps': 7021, 'loss/train': 1.9966686964035034} 11/06/2021 22:13:59 - INFO - __main__ - Step 7023: {'lr': 0.000498580849580403, 'samples': 1348416, 'steps': 7022, 'loss/train': 1.7305803298950195} 11/06/2021 22:13:59 - INFO - __main__ - Step 7024: {'lr': 0.0004985802848863135, 'samples': 1348608, 'steps': 7023, 'loss/train': 2.0270166397094727} 11/06/2021 22:14:00 - INFO - __main__ - Step 7025: {'lr': 0.0004985797200802176, 'samples': 1348800, 'steps': 7024, 'loss/train': 2.123034715652466} 11/06/2021 22:14:01 - INFO - __main__ - Step 7026: {'lr': 0.0004985791551621158, 'samples': 1348992, 'steps': 7025, 'loss/train': 1.589381456375122} 11/06/2021 22:14:01 - INFO - __main__ - Step 7027: {'lr': 0.0004985785901320078, 'samples': 1349184, 'steps': 7026, 'loss/train': 1.8792399168014526} 11/06/2021 22:14:02 - INFO - __main__ - Step 7028: {'lr': 0.0004985780249898941, 'samples': 1349376, 'steps': 7027, 'loss/train': 1.8955899477005005} 11/06/2021 22:14:02 - INFO - __main__ - Step 7029: {'lr': 0.0004985774597357751, 'samples': 1349568, 'steps': 7028, 'loss/train': 2.0755574703216553} 11/06/2021 22:14:03 - INFO - __main__ - Step 7030: {'lr': 0.0004985768943696509, 'samples': 1349760, 'steps': 7029, 'loss/train': 1.896838903427124} 11/06/2021 22:14:03 - INFO - __main__ - Step 7031: {'lr': 0.0004985763288915217, 'samples': 1349952, 'steps': 7030, 'loss/train': 2.0868396759033203} 11/06/2021 22:14:04 - INFO - __main__ - Step 7032: {'lr': 0.0004985757633013879, 'samples': 1350144, 'steps': 7031, 'loss/train': 1.9611470699310303} 11/06/2021 22:14:04 - INFO - __main__ - Step 7033: {'lr': 0.0004985751975992497, 'samples': 1350336, 'steps': 7032, 'loss/train': 2.039046287536621} 11/06/2021 22:14:04 - INFO - __main__ - Step 7034: {'lr': 0.0004985746317851074, 'samples': 1350528, 'steps': 7033, 'loss/train': 2.1592774391174316} 11/06/2021 22:14:05 - INFO - __main__ - Step 7035: {'lr': 0.0004985740658589612, 'samples': 1350720, 'steps': 7034, 'loss/train': 1.7554975748062134} 11/06/2021 22:14:06 - INFO - __main__ - Step 7036: {'lr': 0.0004985734998208112, 'samples': 1350912, 'steps': 7035, 'loss/train': 1.64145028591156} 11/06/2021 22:14:06 - INFO - __main__ - Step 7037: {'lr': 0.000498572933670658, 'samples': 1351104, 'steps': 7036, 'loss/train': 1.8642250299453735} 11/06/2021 22:14:06 - INFO - __main__ - Step 7038: {'lr': 0.0004985723674085016, 'samples': 1351296, 'steps': 7037, 'loss/train': 1.789941430091858} 11/06/2021 22:14:07 - INFO - __main__ - Step 7039: {'lr': 0.0004985718010343424, 'samples': 1351488, 'steps': 7038, 'loss/train': 2.0578839778900146} 11/06/2021 22:14:08 - INFO - __main__ - Step 7040: {'lr': 0.0004985712345481805, 'samples': 1351680, 'steps': 7039, 'loss/train': 1.8973031044006348} 11/06/2021 22:14:08 - INFO - __main__ - Step 7041: {'lr': 0.0004985706679500163, 'samples': 1351872, 'steps': 7040, 'loss/train': 2.2261483669281006} 11/06/2021 22:14:08 - INFO - __main__ - Step 7042: {'lr': 0.0004985701012398499, 'samples': 1352064, 'steps': 7041, 'loss/train': 2.026362657546997} 11/06/2021 22:14:09 - INFO - __main__ - Step 7043: {'lr': 0.0004985695344176817, 'samples': 1352256, 'steps': 7042, 'loss/train': 2.370086669921875} 11/06/2021 22:14:09 - INFO - __main__ - Step 7044: {'lr': 0.0004985689674835119, 'samples': 1352448, 'steps': 7043, 'loss/train': 1.789720058441162} 11/06/2021 22:14:10 - INFO - __main__ - Step 7045: {'lr': 0.0004985684004373409, 'samples': 1352640, 'steps': 7044, 'loss/train': 1.5516489744186401} 11/06/2021 22:14:10 - INFO - __main__ - Step 7046: {'lr': 0.0004985678332791686, 'samples': 1352832, 'steps': 7045, 'loss/train': 1.1448746919631958} 11/06/2021 22:14:11 - INFO - __main__ - Step 7047: {'lr': 0.0004985672660089956, 'samples': 1353024, 'steps': 7046, 'loss/train': 1.9174386262893677} 11/06/2021 22:14:11 - INFO - __main__ - Step 7048: {'lr': 0.000498566698626822, 'samples': 1353216, 'steps': 7047, 'loss/train': 2.0857248306274414} 11/06/2021 22:14:12 - INFO - __main__ - Step 7049: {'lr': 0.000498566131132648, 'samples': 1353408, 'steps': 7048, 'loss/train': 1.5200681686401367} 11/06/2021 22:14:13 - INFO - __main__ - Step 7050: {'lr': 0.0004985655635264739, 'samples': 1353600, 'steps': 7049, 'loss/train': 2.0432159900665283} 11/06/2021 22:14:13 - INFO - __main__ - Step 7051: {'lr': 0.0004985649958083001, 'samples': 1353792, 'steps': 7050, 'loss/train': 2.034838914871216} 11/06/2021 22:14:13 - INFO - __main__ - Step 7052: {'lr': 0.0004985644279781268, 'samples': 1353984, 'steps': 7051, 'loss/train': 2.010187864303589} 11/06/2021 22:14:14 - INFO - __main__ - Step 7053: {'lr': 0.0004985638600359542, 'samples': 1354176, 'steps': 7052, 'loss/train': 2.2024543285369873} 11/06/2021 22:14:14 - INFO - __main__ - Step 7054: {'lr': 0.0004985632919817824, 'samples': 1354368, 'steps': 7053, 'loss/train': 2.1596481800079346} 11/06/2021 22:14:14 - INFO - __main__ - Step 7055: {'lr': 0.000498562723815612, 'samples': 1354560, 'steps': 7054, 'loss/train': 1.9444608688354492} 11/06/2021 22:14:15 - INFO - __main__ - Step 7056: {'lr': 0.000498562155537443, 'samples': 1354752, 'steps': 7055, 'loss/train': 1.5930293798446655} 11/06/2021 22:14:16 - INFO - __main__ - Step 7057: {'lr': 0.0004985615871472757, 'samples': 1354944, 'steps': 7056, 'loss/train': 2.356843948364258} 11/06/2021 22:14:16 - INFO - __main__ - Step 7058: {'lr': 0.0004985610186451104, 'samples': 1355136, 'steps': 7057, 'loss/train': 2.400442123413086} 11/06/2021 22:14:16 - INFO - __main__ - Step 7059: {'lr': 0.0004985604500309473, 'samples': 1355328, 'steps': 7058, 'loss/train': 1.9683334827423096} 11/06/2021 22:14:17 - INFO - __main__ - Step 7060: {'lr': 0.0004985598813047868, 'samples': 1355520, 'steps': 7059, 'loss/train': 2.3564374446868896} 11/06/2021 22:14:18 - INFO - __main__ - Step 7061: {'lr': 0.000498559312466629, 'samples': 1355712, 'steps': 7060, 'loss/train': 2.06729793548584} 11/06/2021 22:14:18 - INFO - __main__ - Step 7062: {'lr': 0.0004985587435164742, 'samples': 1355904, 'steps': 7061, 'loss/train': 2.2818222045898438} 11/06/2021 22:14:19 - INFO - __main__ - Step 7063: {'lr': 0.0004985581744543226, 'samples': 1356096, 'steps': 7062, 'loss/train': 2.8262219429016113} 11/06/2021 22:14:19 - INFO - __main__ - Step 7064: {'lr': 0.0004985576052801747, 'samples': 1356288, 'steps': 7063, 'loss/train': 2.252566337585449} 11/06/2021 22:14:19 - INFO - __main__ - Step 7065: {'lr': 0.0004985570359940304, 'samples': 1356480, 'steps': 7064, 'loss/train': 1.9672572612762451} 11/06/2021 22:14:20 - INFO - __main__ - Step 7066: {'lr': 0.0004985564665958901, 'samples': 1356672, 'steps': 7065, 'loss/train': 1.8801449537277222} 11/06/2021 22:14:21 - INFO - __main__ - Step 7067: {'lr': 0.0004985558970857543, 'samples': 1356864, 'steps': 7066, 'loss/train': 2.165799379348755} 11/06/2021 22:14:21 - INFO - __main__ - Step 7068: {'lr': 0.000498555327463623, 'samples': 1357056, 'steps': 7067, 'loss/train': 1.7966886758804321} 11/06/2021 22:14:21 - INFO - __main__ - Step 7069: {'lr': 0.0004985547577294963, 'samples': 1357248, 'steps': 7068, 'loss/train': 1.4728955030441284} 11/06/2021 22:14:22 - INFO - __main__ - Step 7070: {'lr': 0.0004985541878833749, 'samples': 1357440, 'steps': 7069, 'loss/train': 1.446349024772644} 11/06/2021 22:14:23 - INFO - __main__ - Step 7071: {'lr': 0.0004985536179252587, 'samples': 1357632, 'steps': 7070, 'loss/train': 1.8138748407363892} 11/06/2021 22:14:23 - INFO - __main__ - Step 7072: {'lr': 0.0004985530478551481, 'samples': 1357824, 'steps': 7071, 'loss/train': 1.72260582447052} 11/06/2021 22:14:23 - INFO - __main__ - Step 7073: {'lr': 0.0004985524776730434, 'samples': 1358016, 'steps': 7072, 'loss/train': 2.2067058086395264} 11/06/2021 22:14:24 - INFO - __main__ - Step 7074: {'lr': 0.0004985519073789447, 'samples': 1358208, 'steps': 7073, 'loss/train': 2.0647716522216797} 11/06/2021 22:14:24 - INFO - __main__ - Step 7075: {'lr': 0.0004985513369728524, 'samples': 1358400, 'steps': 7074, 'loss/train': 1.755244255065918} 11/06/2021 22:14:26 - INFO - __main__ - Step 7076: {'lr': 0.0004985507664547666, 'samples': 1358592, 'steps': 7075, 'loss/train': 2.965153455734253} 11/06/2021 22:14:26 - INFO - __main__ - Step 7077: {'lr': 0.0004985501958246878, 'samples': 1358784, 'steps': 7076, 'loss/train': 2.4291107654571533} 11/06/2021 22:14:26 - INFO - __main__ - Step 7078: {'lr': 0.000498549625082616, 'samples': 1358976, 'steps': 7077, 'loss/train': 1.5984909534454346} 11/06/2021 22:14:27 - INFO - __main__ - Step 7079: {'lr': 0.0004985490542285516, 'samples': 1359168, 'steps': 7078, 'loss/train': 2.098381519317627} 11/06/2021 22:14:27 - INFO - __main__ - Step 7080: {'lr': 0.0004985484832624949, 'samples': 1359360, 'steps': 7079, 'loss/train': 1.983779788017273} 11/06/2021 22:14:27 - INFO - __main__ - Step 7081: {'lr': 0.000498547912184446, 'samples': 1359552, 'steps': 7080, 'loss/train': 2.0644752979278564} 11/06/2021 22:14:28 - INFO - __main__ - Step 7082: {'lr': 0.0004985473409944054, 'samples': 1359744, 'steps': 7081, 'loss/train': 1.7944436073303223} 11/06/2021 22:14:29 - INFO - __main__ - Step 7083: {'lr': 0.000498546769692373, 'samples': 1359936, 'steps': 7082, 'loss/train': 2.2252354621887207} 11/06/2021 22:14:29 - INFO - __main__ - Step 7084: {'lr': 0.0004985461982783494, 'samples': 1360128, 'steps': 7083, 'loss/train': 1.653979778289795} 11/06/2021 22:14:30 - INFO - __main__ - Step 7085: {'lr': 0.0004985456267523346, 'samples': 1360320, 'steps': 7084, 'loss/train': 1.65984308719635} 11/06/2021 22:14:30 - INFO - __main__ - Step 7086: {'lr': 0.0004985450551143291, 'samples': 1360512, 'steps': 7085, 'loss/train': 2.264176607131958} 11/06/2021 22:14:31 - INFO - __main__ - Step 7087: {'lr': 0.000498544483364333, 'samples': 1360704, 'steps': 7086, 'loss/train': 2.1907835006713867} 11/06/2021 22:14:31 - INFO - __main__ - Step 7088: {'lr': 0.0004985439115023465, 'samples': 1360896, 'steps': 7087, 'loss/train': 1.8847299814224243} 11/06/2021 22:14:32 - INFO - __main__ - Step 7089: {'lr': 0.0004985433395283701, 'samples': 1361088, 'steps': 7088, 'loss/train': 1.9816615581512451} 11/06/2021 22:14:32 - INFO - __main__ - Step 7090: {'lr': 0.0004985427674424038, 'samples': 1361280, 'steps': 7089, 'loss/train': 1.8997712135314941} 11/06/2021 22:14:32 - INFO - __main__ - Step 7091: {'lr': 0.000498542195244448, 'samples': 1361472, 'steps': 7090, 'loss/train': 1.139441967010498} 11/06/2021 22:14:33 - INFO - __main__ - Step 7092: {'lr': 0.0004985416229345029, 'samples': 1361664, 'steps': 7091, 'loss/train': 1.571520209312439} 11/06/2021 22:14:34 - INFO - __main__ - Step 7093: {'lr': 0.0004985410505125689, 'samples': 1361856, 'steps': 7092, 'loss/train': 2.138139486312866} 11/06/2021 22:14:34 - INFO - __main__ - Step 7094: {'lr': 0.0004985404779786459, 'samples': 1362048, 'steps': 7093, 'loss/train': 2.261357545852661} 11/06/2021 22:14:35 - INFO - __main__ - Step 7095: {'lr': 0.0004985399053327346, 'samples': 1362240, 'steps': 7094, 'loss/train': 1.7595309019088745} 11/06/2021 22:14:35 - INFO - __main__ - Step 7096: {'lr': 0.000498539332574835, 'samples': 1362432, 'steps': 7095, 'loss/train': 1.8926126956939697} 11/06/2021 22:14:35 - INFO - __main__ - Step 7097: {'lr': 0.0004985387597049474, 'samples': 1362624, 'steps': 7096, 'loss/train': 2.004615306854248} 11/06/2021 22:14:37 - INFO - __main__ - Step 7098: {'lr': 0.0004985381867230721, 'samples': 1362816, 'steps': 7097, 'loss/train': 1.6198385953903198} 11/06/2021 22:14:37 - INFO - __main__ - Step 7099: {'lr': 0.0004985376136292093, 'samples': 1363008, 'steps': 7098, 'loss/train': 1.9833897352218628} 11/06/2021 22:14:37 - INFO - __main__ - Step 7100: {'lr': 0.0004985370404233592, 'samples': 1363200, 'steps': 7099, 'loss/train': 1.9819221496582031} 11/06/2021 22:14:38 - INFO - __main__ - Step 7101: {'lr': 0.0004985364671055223, 'samples': 1363392, 'steps': 7100, 'loss/train': 2.0017964839935303} 11/06/2021 22:14:38 - INFO - __main__ - Step 7102: {'lr': 0.0004985358936756985, 'samples': 1363584, 'steps': 7101, 'loss/train': 1.8916939496994019} 11/06/2021 22:14:38 - INFO - __main__ - Step 7103: {'lr': 0.0004985353201338885, 'samples': 1363776, 'steps': 7102, 'loss/train': 1.6781333684921265} 11/06/2021 22:14:39 - INFO - __main__ - Step 7104: {'lr': 0.0004985347464800921, 'samples': 1363968, 'steps': 7103, 'loss/train': 0.41633597016334534} 11/06/2021 22:14:40 - INFO - __main__ - Step 7105: {'lr': 0.0004985341727143099, 'samples': 1364160, 'steps': 7104, 'loss/train': 1.5315263271331787} 11/06/2021 22:14:40 - INFO - __main__ - Step 7106: {'lr': 0.000498533598836542, 'samples': 1364352, 'steps': 7105, 'loss/train': 1.2057009935379028} 11/06/2021 22:14:40 - INFO - __main__ - Step 7107: {'lr': 0.0004985330248467888, 'samples': 1364544, 'steps': 7106, 'loss/train': 1.7681790590286255} 11/06/2021 22:14:41 - INFO - __main__ - Step 7108: {'lr': 0.0004985324507450504, 'samples': 1364736, 'steps': 7107, 'loss/train': 1.7636619806289673} 11/06/2021 22:14:42 - INFO - __main__ - Step 7109: {'lr': 0.000498531876531327, 'samples': 1364928, 'steps': 7108, 'loss/train': 1.970746397972107} 11/06/2021 22:14:42 - INFO - __main__ - Step 7110: {'lr': 0.0004985313022056191, 'samples': 1365120, 'steps': 7109, 'loss/train': 1.8806291818618774} 11/06/2021 22:14:43 - INFO - __main__ - Step 7111: {'lr': 0.0004985307277679267, 'samples': 1365312, 'steps': 7110, 'loss/train': 1.3008683919906616} 11/06/2021 22:14:43 - INFO - __main__ - Step 7112: {'lr': 0.0004985301532182503, 'samples': 1365504, 'steps': 7111, 'loss/train': 1.9515055418014526} 11/06/2021 22:14:43 - INFO - __main__ - Step 7113: {'lr': 0.0004985295785565901, 'samples': 1365696, 'steps': 7112, 'loss/train': 1.8485970497131348} 11/06/2021 22:14:44 - INFO - __main__ - Step 7114: {'lr': 0.0004985290037829462, 'samples': 1365888, 'steps': 7113, 'loss/train': 3.7923812866210938} 11/06/2021 22:14:45 - INFO - __main__ - Step 7115: {'lr': 0.000498528428897319, 'samples': 1366080, 'steps': 7114, 'loss/train': 1.5386725664138794} 11/06/2021 22:14:45 - INFO - __main__ - Step 7116: {'lr': 0.0004985278538997088, 'samples': 1366272, 'steps': 7115, 'loss/train': 2.0282657146453857} 11/06/2021 22:14:45 - INFO - __main__ - Step 7117: {'lr': 0.0004985272787901156, 'samples': 1366464, 'steps': 7116, 'loss/train': 2.3269381523132324} 11/06/2021 22:14:46 - INFO - __main__ - Step 7118: {'lr': 0.00049852670356854, 'samples': 1366656, 'steps': 7117, 'loss/train': 1.7581599950790405} 11/06/2021 22:14:47 - INFO - __main__ - Step 7119: {'lr': 0.000498526128234982, 'samples': 1366848, 'steps': 7118, 'loss/train': 2.1720402240753174} 11/06/2021 22:14:47 - INFO - __main__ - Step 7120: {'lr': 0.000498525552789442, 'samples': 1367040, 'steps': 7119, 'loss/train': 1.491886019706726} 11/06/2021 22:14:48 - INFO - __main__ - Step 7121: {'lr': 0.0004985249772319202, 'samples': 1367232, 'steps': 7120, 'loss/train': 2.0638139247894287} 11/06/2021 22:14:48 - INFO - __main__ - Step 7122: {'lr': 0.000498524401562417, 'samples': 1367424, 'steps': 7121, 'loss/train': 1.7350369691848755} 11/06/2021 22:14:48 - INFO - __main__ - Step 7123: {'lr': 0.0004985238257809325, 'samples': 1367616, 'steps': 7122, 'loss/train': 2.518477201461792} 11/06/2021 22:14:49 - INFO - __main__ - Step 7124: {'lr': 0.0004985232498874669, 'samples': 1367808, 'steps': 7123, 'loss/train': 1.7711327075958252} 11/06/2021 22:14:50 - INFO - __main__ - Step 7125: {'lr': 0.0004985226738820207, 'samples': 1368000, 'steps': 7124, 'loss/train': 1.2557427883148193} 11/06/2021 22:14:50 - INFO - __main__ - Step 7126: {'lr': 0.0004985220977645939, 'samples': 1368192, 'steps': 7125, 'loss/train': 2.177485942840576} 11/06/2021 22:14:50 - INFO - __main__ - Step 7127: {'lr': 0.0004985215215351869, 'samples': 1368384, 'steps': 7126, 'loss/train': 1.9587048292160034} 11/06/2021 22:14:51 - INFO - __main__ - Step 7128: {'lr': 0.0004985209451937999, 'samples': 1368576, 'steps': 7127, 'loss/train': 2.1263420581817627} 11/06/2021 22:14:51 - INFO - __main__ - Step 7129: {'lr': 0.0004985203687404333, 'samples': 1368768, 'steps': 7128, 'loss/train': 1.8560582399368286} 11/06/2021 22:14:52 - INFO - __main__ - Step 7130: {'lr': 0.0004985197921750871, 'samples': 1368960, 'steps': 7129, 'loss/train': 1.6565303802490234} 11/06/2021 22:14:52 - INFO - __main__ - Step 7131: {'lr': 0.0004985192154977619, 'samples': 1369152, 'steps': 7130, 'loss/train': 1.868842601776123} 11/06/2021 22:14:53 - INFO - __main__ - Step 7132: {'lr': 0.0004985186387084577, 'samples': 1369344, 'steps': 7131, 'loss/train': 1.80448317527771} 11/06/2021 22:14:53 - INFO - __main__ - Step 7133: {'lr': 0.0004985180618071748, 'samples': 1369536, 'steps': 7132, 'loss/train': 1.7268800735473633} 11/06/2021 22:14:53 - INFO - __main__ - Step 7134: {'lr': 0.0004985174847939135, 'samples': 1369728, 'steps': 7133, 'loss/train': 2.039775848388672} 11/06/2021 22:14:55 - INFO - __main__ - Step 7135: {'lr': 0.0004985169076686741, 'samples': 1369920, 'steps': 7134, 'loss/train': 1.5857948064804077} 11/06/2021 22:14:55 - INFO - __main__ - Step 7136: {'lr': 0.0004985163304314568, 'samples': 1370112, 'steps': 7135, 'loss/train': 1.8448445796966553} 11/06/2021 22:14:55 - INFO - __main__ - Step 7137: {'lr': 0.0004985157530822619, 'samples': 1370304, 'steps': 7136, 'loss/train': 1.450709342956543} 11/06/2021 22:14:56 - INFO - __main__ - Step 7138: {'lr': 0.0004985151756210897, 'samples': 1370496, 'steps': 7137, 'loss/train': 1.8486833572387695} 11/06/2021 22:14:56 - INFO - __main__ - Step 7139: {'lr': 0.0004985145980479402, 'samples': 1370688, 'steps': 7138, 'loss/train': 1.828140377998352} 11/06/2021 22:14:57 - INFO - __main__ - Step 7140: {'lr': 0.000498514020362814, 'samples': 1370880, 'steps': 7139, 'loss/train': 1.5421890020370483} 11/06/2021 22:14:57 - INFO - __main__ - Step 7141: {'lr': 0.0004985134425657111, 'samples': 1371072, 'steps': 7140, 'loss/train': 2.6949613094329834} 11/06/2021 22:14:58 - INFO - __main__ - Step 7142: {'lr': 0.000498512864656632, 'samples': 1371264, 'steps': 7141, 'loss/train': 1.5988658666610718} 11/06/2021 22:14:58 - INFO - __main__ - Step 7143: {'lr': 0.0004985122866355768, 'samples': 1371456, 'steps': 7142, 'loss/train': 2.1116602420806885} 11/06/2021 22:14:58 - INFO - __main__ - Step 7144: {'lr': 0.0004985117085025458, 'samples': 1371648, 'steps': 7143, 'loss/train': 2.3443734645843506} 11/06/2021 22:14:59 - INFO - __main__ - Step 7145: {'lr': 0.0004985111302575392, 'samples': 1371840, 'steps': 7144, 'loss/train': 1.9202158451080322} 11/06/2021 22:15:00 - INFO - __main__ - Step 7146: {'lr': 0.0004985105519005573, 'samples': 1372032, 'steps': 7145, 'loss/train': 1.7188575267791748} 11/06/2021 22:15:00 - INFO - __main__ - Step 7147: {'lr': 0.0004985099734316006, 'samples': 1372224, 'steps': 7146, 'loss/train': 1.5859005451202393} 11/06/2021 22:15:00 - INFO - __main__ - Step 7148: {'lr': 0.0004985093948506689, 'samples': 1372416, 'steps': 7147, 'loss/train': 1.5859280824661255} 11/06/2021 22:15:01 - INFO - __main__ - Step 7149: {'lr': 0.0004985088161577628, 'samples': 1372608, 'steps': 7148, 'loss/train': 1.8605209589004517} 11/06/2021 22:15:01 - INFO - __main__ - Step 7150: {'lr': 0.0004985082373528825, 'samples': 1372800, 'steps': 7149, 'loss/train': 2.1094627380371094} 11/06/2021 22:15:02 - INFO - __main__ - Step 7151: {'lr': 0.0004985076584360282, 'samples': 1372992, 'steps': 7150, 'loss/train': 2.0223774909973145} 11/06/2021 22:15:03 - INFO - __main__ - Step 7152: {'lr': 0.0004985070794072002, 'samples': 1373184, 'steps': 7151, 'loss/train': 2.0557656288146973} 11/06/2021 22:15:03 - INFO - __main__ - Step 7153: {'lr': 0.0004985065002663986, 'samples': 1373376, 'steps': 7152, 'loss/train': 1.9789502620697021} 11/06/2021 22:15:03 - INFO - __main__ - Step 7154: {'lr': 0.000498505921013624, 'samples': 1373568, 'steps': 7153, 'loss/train': 1.9508652687072754} 11/06/2021 22:15:04 - INFO - __main__ - Step 7155: {'lr': 0.0004985053416488764, 'samples': 1373760, 'steps': 7154, 'loss/train': 2.078094244003296} 11/06/2021 22:15:05 - INFO - __main__ - Step 7156: {'lr': 0.0004985047621721561, 'samples': 1373952, 'steps': 7155, 'loss/train': 0.8238686323165894} 11/06/2021 22:15:05 - INFO - __main__ - Step 7157: {'lr': 0.0004985041825834634, 'samples': 1374144, 'steps': 7156, 'loss/train': 1.8907822370529175} 11/06/2021 22:15:05 - INFO - __main__ - Step 7158: {'lr': 0.0004985036028827986, 'samples': 1374336, 'steps': 7157, 'loss/train': 2.6892762184143066} 11/06/2021 22:15:06 - INFO - __main__ - Step 7159: {'lr': 0.0004985030230701619, 'samples': 1374528, 'steps': 7158, 'loss/train': 2.3940746784210205} 11/06/2021 22:15:06 - INFO - __main__ - Step 7160: {'lr': 0.0004985024431455534, 'samples': 1374720, 'steps': 7159, 'loss/train': 1.8155590295791626} 11/06/2021 22:15:07 - INFO - __main__ - Step 7161: {'lr': 0.0004985018631089738, 'samples': 1374912, 'steps': 7160, 'loss/train': 2.003696918487549} 11/06/2021 22:15:07 - INFO - __main__ - Step 7162: {'lr': 0.0004985012829604228, 'samples': 1375104, 'steps': 7161, 'loss/train': 1.8686398267745972} 11/06/2021 22:15:08 - INFO - __main__ - Step 7163: {'lr': 0.0004985007026999011, 'samples': 1375296, 'steps': 7162, 'loss/train': 1.7682377099990845} 11/06/2021 22:15:08 - INFO - __main__ - Step 7164: {'lr': 0.0004985001223274089, 'samples': 1375488, 'steps': 7163, 'loss/train': 2.357848644256592} 11/06/2021 22:15:08 - INFO - __main__ - Step 7165: {'lr': 0.0004984995418429463, 'samples': 1375680, 'steps': 7164, 'loss/train': 2.2802202701568604} 11/06/2021 22:15:09 - INFO - __main__ - Step 7166: {'lr': 0.0004984989612465137, 'samples': 1375872, 'steps': 7165, 'loss/train': 2.0252628326416016} 11/06/2021 22:15:10 - INFO - __main__ - Step 7167: {'lr': 0.0004984983805381112, 'samples': 1376064, 'steps': 7166, 'loss/train': 1.8542309999465942} 11/06/2021 22:15:10 - INFO - __main__ - Step 7168: {'lr': 0.0004984977997177393, 'samples': 1376256, 'steps': 7167, 'loss/train': 1.5369564294815063} 11/06/2021 22:15:10 - INFO - __main__ - Step 7169: {'lr': 0.000498497218785398, 'samples': 1376448, 'steps': 7168, 'loss/train': 1.955859899520874} 11/06/2021 22:15:11 - INFO - __main__ - Step 7170: {'lr': 0.0004984966377410878, 'samples': 1376640, 'steps': 7169, 'loss/train': 2.5765957832336426} 11/06/2021 22:15:12 - INFO - __main__ - Step 7171: {'lr': 0.0004984960565848086, 'samples': 1376832, 'steps': 7170, 'loss/train': 2.264620304107666} 11/06/2021 22:15:12 - INFO - __main__ - Step 7172: {'lr': 0.0004984954753165612, 'samples': 1377024, 'steps': 7171, 'loss/train': 1.8249037265777588} 11/06/2021 22:15:13 - INFO - __main__ - Step 7173: {'lr': 0.0004984948939363455, 'samples': 1377216, 'steps': 7172, 'loss/train': 1.3821079730987549} 11/06/2021 22:15:13 - INFO - __main__ - Step 7174: {'lr': 0.0004984943124441617, 'samples': 1377408, 'steps': 7173, 'loss/train': 1.980968952178955} 11/06/2021 22:15:13 - INFO - __main__ - Step 7175: {'lr': 0.0004984937308400104, 'samples': 1377600, 'steps': 7174, 'loss/train': 1.0405446290969849} 11/06/2021 22:15:15 - INFO - __main__ - Step 7176: {'lr': 0.0004984931491238915, 'samples': 1377792, 'steps': 7175, 'loss/train': 2.0840656757354736} 11/06/2021 22:15:15 - INFO - __main__ - Step 7177: {'lr': 0.0004984925672958055, 'samples': 1377984, 'steps': 7176, 'loss/train': 1.7736377716064453} 11/06/2021 22:15:15 - INFO - __main__ - Step 7178: {'lr': 0.0004984919853557526, 'samples': 1378176, 'steps': 7177, 'loss/train': 2.2246837615966797} 11/06/2021 22:15:16 - INFO - __main__ - Step 7179: {'lr': 0.000498491403303733, 'samples': 1378368, 'steps': 7178, 'loss/train': 2.4547572135925293} 11/06/2021 22:15:16 - INFO - __main__ - Step 7180: {'lr': 0.000498490821139747, 'samples': 1378560, 'steps': 7179, 'loss/train': 1.4960209131240845} 11/06/2021 22:15:16 - INFO - __main__ - Step 7181: {'lr': 0.0004984902388637949, 'samples': 1378752, 'steps': 7180, 'loss/train': 1.8791706562042236} 11/06/2021 22:15:17 - INFO - __main__ - Step 7182: {'lr': 0.000498489656475877, 'samples': 1378944, 'steps': 7181, 'loss/train': 2.0455527305603027} 11/06/2021 22:15:18 - INFO - __main__ - Step 7183: {'lr': 0.0004984890739759934, 'samples': 1379136, 'steps': 7182, 'loss/train': 1.8560911417007446} 11/06/2021 22:15:18 - INFO - __main__ - Step 7184: {'lr': 0.0004984884913641444, 'samples': 1379328, 'steps': 7183, 'loss/train': 2.0507397651672363} 11/06/2021 22:15:18 - INFO - __main__ - Step 7185: {'lr': 0.0004984879086403304, 'samples': 1379520, 'steps': 7184, 'loss/train': 2.017399311065674} 11/06/2021 22:15:19 - INFO - __main__ - Step 7186: {'lr': 0.0004984873258045517, 'samples': 1379712, 'steps': 7185, 'loss/train': 1.7196428775787354} 11/06/2021 22:15:19 - INFO - __main__ - Step 7187: {'lr': 0.0004984867428568083, 'samples': 1379904, 'steps': 7186, 'loss/train': 1.79092276096344} 11/06/2021 22:15:20 - INFO - __main__ - Step 7188: {'lr': 0.0004984861597971006, 'samples': 1380096, 'steps': 7187, 'loss/train': 2.169832468032837} 11/06/2021 22:15:21 - INFO - __main__ - Step 7189: {'lr': 0.000498485576625429, 'samples': 1380288, 'steps': 7188, 'loss/train': 1.9039487838745117} 11/06/2021 22:15:21 - INFO - __main__ - Step 7190: {'lr': 0.0004984849933417935, 'samples': 1380480, 'steps': 7189, 'loss/train': 1.6038901805877686} 11/06/2021 22:15:21 - INFO - __main__ - Step 7191: {'lr': 0.0004984844099461945, 'samples': 1380672, 'steps': 7190, 'loss/train': 1.7970324754714966} 11/06/2021 22:15:22 - INFO - __main__ - Step 7192: {'lr': 0.0004984838264386322, 'samples': 1380864, 'steps': 7191, 'loss/train': 1.7720707654953003} 11/06/2021 22:15:23 - INFO - __main__ - Step 7193: {'lr': 0.000498483242819107, 'samples': 1381056, 'steps': 7192, 'loss/train': 1.6350319385528564} 11/06/2021 22:15:23 - INFO - __main__ - Step 7194: {'lr': 0.0004984826590876192, 'samples': 1381248, 'steps': 7193, 'loss/train': 1.857775092124939} 11/06/2021 22:15:23 - INFO - __main__ - Step 7195: {'lr': 0.0004984820752441688, 'samples': 1381440, 'steps': 7194, 'loss/train': 1.664402961730957} 11/06/2021 22:15:24 - INFO - __main__ - Step 7196: {'lr': 0.0004984814912887563, 'samples': 1381632, 'steps': 7195, 'loss/train': 2.1821253299713135} 11/06/2021 22:15:24 - INFO - __main__ - Step 7197: {'lr': 0.0004984809072213818, 'samples': 1381824, 'steps': 7196, 'loss/train': 1.2754813432693481} 11/06/2021 22:15:25 - INFO - __main__ - Step 7198: {'lr': 0.0004984803230420457, 'samples': 1382016, 'steps': 7197, 'loss/train': 1.7471867799758911} 11/06/2021 22:15:25 - INFO - __main__ - Step 7199: {'lr': 0.0004984797387507481, 'samples': 1382208, 'steps': 7198, 'loss/train': 1.889627456665039} 11/06/2021 22:15:26 - INFO - __main__ - Step 7200: {'lr': 0.0004984791543474896, 'samples': 1382400, 'steps': 7199, 'loss/train': 1.987942099571228} 11/06/2021 22:15:26 - INFO - __main__ - Step 7201: {'lr': 0.0004984785698322699, 'samples': 1382592, 'steps': 7200, 'loss/train': 1.3614236116409302} 11/06/2021 22:15:26 - INFO - __main__ - Step 7202: {'lr': 0.0004984779852050898, 'samples': 1382784, 'steps': 7201, 'loss/train': 1.576859712600708} 11/06/2021 22:15:28 - INFO - __main__ - Step 7203: {'lr': 0.0004984774004659493, 'samples': 1382976, 'steps': 7202, 'loss/train': 2.200059413909912} 11/06/2021 22:15:28 - INFO - __main__ - Step 7204: {'lr': 0.0004984768156148489, 'samples': 1383168, 'steps': 7203, 'loss/train': 2.1063573360443115} 11/06/2021 22:15:28 - INFO - __main__ - Step 7205: {'lr': 0.0004984762306517883, 'samples': 1383360, 'steps': 7204, 'loss/train': 2.0486106872558594} 11/06/2021 22:15:29 - INFO - __main__ - Step 7206: {'lr': 0.0004984756455767684, 'samples': 1383552, 'steps': 7205, 'loss/train': 1.8581416606903076} 11/06/2021 22:15:29 - INFO - __main__ - Step 7207: {'lr': 0.0004984750603897892, 'samples': 1383744, 'steps': 7206, 'loss/train': 1.1366732120513916} 11/06/2021 22:15:29 - INFO - __main__ - Step 7208: {'lr': 0.0004984744750908509, 'samples': 1383936, 'steps': 7207, 'loss/train': 1.8445619344711304} 11/06/2021 22:15:30 - INFO - __main__ - Step 7209: {'lr': 0.0004984738896799539, 'samples': 1384128, 'steps': 7208, 'loss/train': 2.4114151000976562} 11/06/2021 22:15:31 - INFO - __main__ - Step 7210: {'lr': 0.0004984733041570983, 'samples': 1384320, 'steps': 7209, 'loss/train': 1.8552770614624023} 11/06/2021 22:15:31 - INFO - __main__ - Step 7211: {'lr': 0.0004984727185222846, 'samples': 1384512, 'steps': 7210, 'loss/train': 2.154346466064453} 11/06/2021 22:15:31 - INFO - __main__ - Step 7212: {'lr': 0.0004984721327755128, 'samples': 1384704, 'steps': 7211, 'loss/train': 2.079803466796875} 11/06/2021 22:15:32 - INFO - __main__ - Step 7213: {'lr': 0.0004984715469167835, 'samples': 1384896, 'steps': 7212, 'loss/train': 1.798585295677185} 11/06/2021 22:15:33 - INFO - __main__ - Step 7214: {'lr': 0.0004984709609460966, 'samples': 1385088, 'steps': 7213, 'loss/train': 1.8755601644515991} 11/06/2021 22:15:33 - INFO - __main__ - Step 7215: {'lr': 0.0004984703748634524, 'samples': 1385280, 'steps': 7214, 'loss/train': 1.9229755401611328} 11/06/2021 22:15:33 - INFO - __main__ - Step 7216: {'lr': 0.0004984697886688514, 'samples': 1385472, 'steps': 7215, 'loss/train': 2.1766183376312256} 11/06/2021 22:15:34 - INFO - __main__ - Step 7217: {'lr': 0.0004984692023622938, 'samples': 1385664, 'steps': 7216, 'loss/train': 2.0930893421173096} 11/06/2021 22:15:34 - INFO - __main__ - Step 7218: {'lr': 0.0004984686159437798, 'samples': 1385856, 'steps': 7217, 'loss/train': 1.6894450187683105} 11/06/2021 22:15:35 - INFO - __main__ - Step 7219: {'lr': 0.0004984680294133096, 'samples': 1386048, 'steps': 7218, 'loss/train': 2.3785271644592285} 11/06/2021 22:15:35 - INFO - __main__ - Step 7220: {'lr': 0.0004984674427708836, 'samples': 1386240, 'steps': 7219, 'loss/train': 2.290306806564331} 11/06/2021 22:15:36 - INFO - __main__ - Step 7221: {'lr': 0.000498466856016502, 'samples': 1386432, 'steps': 7220, 'loss/train': 1.620937705039978} 11/06/2021 22:15:36 - INFO - __main__ - Step 7222: {'lr': 0.000498466269150165, 'samples': 1386624, 'steps': 7221, 'loss/train': 1.4030883312225342} 11/06/2021 22:15:36 - INFO - __main__ - Step 7223: {'lr': 0.000498465682171873, 'samples': 1386816, 'steps': 7222, 'loss/train': 2.2824783325195312} 11/06/2021 22:15:37 - INFO - __main__ - Step 7224: {'lr': 0.0004984650950816262, 'samples': 1387008, 'steps': 7223, 'loss/train': 2.3977138996124268} 11/06/2021 22:15:38 - INFO - __main__ - Step 7225: {'lr': 0.0004984645078794248, 'samples': 1387200, 'steps': 7224, 'loss/train': 1.694911241531372} 11/06/2021 22:15:38 - INFO - __main__ - Step 7226: {'lr': 0.0004984639205652692, 'samples': 1387392, 'steps': 7225, 'loss/train': 2.0143306255340576} 11/06/2021 22:15:39 - INFO - __main__ - Step 7227: {'lr': 0.0004984633331391596, 'samples': 1387584, 'steps': 7226, 'loss/train': 1.6768375635147095} 11/06/2021 22:15:39 - INFO - __main__ - Step 7228: {'lr': 0.0004984627456010962, 'samples': 1387776, 'steps': 7227, 'loss/train': 1.5799198150634766} 11/06/2021 22:15:39 - INFO - __main__ - Step 7229: {'lr': 0.0004984621579510794, 'samples': 1387968, 'steps': 7228, 'loss/train': 1.486791729927063} 11/06/2021 22:15:40 - INFO - __main__ - Step 7230: {'lr': 0.0004984615701891093, 'samples': 1388160, 'steps': 7229, 'loss/train': 3.868530035018921} 11/06/2021 22:15:41 - INFO - __main__ - Step 7231: {'lr': 0.0004984609823151863, 'samples': 1388352, 'steps': 7230, 'loss/train': 0.9350582957267761} 11/06/2021 22:15:41 - INFO - __main__ - Step 7232: {'lr': 0.0004984603943293106, 'samples': 1388544, 'steps': 7231, 'loss/train': 0.9871623516082764} 11/06/2021 22:15:41 - INFO - __main__ - Step 7233: {'lr': 0.0004984598062314824, 'samples': 1388736, 'steps': 7232, 'loss/train': 1.3852111101150513} 11/06/2021 22:15:42 - INFO - __main__ - Step 7234: {'lr': 0.0004984592180217022, 'samples': 1388928, 'steps': 7233, 'loss/train': 1.602423906326294} 11/06/2021 22:15:43 - INFO - __main__ - Step 7235: {'lr': 0.00049845862969997, 'samples': 1389120, 'steps': 7234, 'loss/train': 2.038180351257324} 11/06/2021 22:15:43 - INFO - __main__ - Step 7236: {'lr': 0.0004984580412662862, 'samples': 1389312, 'steps': 7235, 'loss/train': 2.45739483833313} 11/06/2021 22:15:44 - INFO - __main__ - Step 7237: {'lr': 0.000498457452720651, 'samples': 1389504, 'steps': 7236, 'loss/train': 2.169727087020874} 11/06/2021 22:15:44 - INFO - __main__ - Step 7238: {'lr': 0.0004984568640630648, 'samples': 1389696, 'steps': 7237, 'loss/train': 1.3969769477844238} 11/06/2021 22:15:44 - INFO - __main__ - Step 7239: {'lr': 0.0004984562752935278, 'samples': 1389888, 'steps': 7238, 'loss/train': 2.331590175628662} 11/06/2021 22:15:45 - INFO - __main__ - Step 7240: {'lr': 0.0004984556864120401, 'samples': 1390080, 'steps': 7239, 'loss/train': 1.9875437021255493} 11/06/2021 22:15:46 - INFO - __main__ - Step 7241: {'lr': 0.0004984550974186021, 'samples': 1390272, 'steps': 7240, 'loss/train': 2.1947824954986572} 11/06/2021 22:15:46 - INFO - __main__ - Step 7242: {'lr': 0.0004984545083132142, 'samples': 1390464, 'steps': 7241, 'loss/train': 1.9179041385650635} 11/06/2021 22:15:46 - INFO - __main__ - Step 7243: {'lr': 0.0004984539190958765, 'samples': 1390656, 'steps': 7242, 'loss/train': 2.117086887359619} 11/06/2021 22:15:47 - INFO - __main__ - Step 7244: {'lr': 0.0004984533297665892, 'samples': 1390848, 'steps': 7243, 'loss/train': 1.8736786842346191} 11/06/2021 22:15:48 - INFO - __main__ - Step 7245: {'lr': 0.0004984527403253527, 'samples': 1391040, 'steps': 7244, 'loss/train': 2.511869430541992} 11/06/2021 22:15:48 - INFO - __main__ - Step 7246: {'lr': 0.0004984521507721672, 'samples': 1391232, 'steps': 7245, 'loss/train': 2.5657119750976562} 11/06/2021 22:15:48 - INFO - __main__ - Step 7247: {'lr': 0.0004984515611070331, 'samples': 1391424, 'steps': 7246, 'loss/train': 1.9452552795410156} 11/06/2021 22:15:49 - INFO - __main__ - Step 7248: {'lr': 0.0004984509713299505, 'samples': 1391616, 'steps': 7247, 'loss/train': 1.7578961849212646} 11/06/2021 22:15:49 - INFO - __main__ - Step 7249: {'lr': 0.0004984503814409198, 'samples': 1391808, 'steps': 7248, 'loss/train': 1.7365186214447021} 11/06/2021 22:15:49 - INFO - __main__ - Step 7250: {'lr': 0.000498449791439941, 'samples': 1392000, 'steps': 7249, 'loss/train': 2.028444766998291} 11/06/2021 22:15:50 - INFO - __main__ - Step 7251: {'lr': 0.0004984492013270147, 'samples': 1392192, 'steps': 7250, 'loss/train': 1.9997104406356812} 11/06/2021 22:15:51 - INFO - __main__ - Step 7252: {'lr': 0.0004984486111021411, 'samples': 1392384, 'steps': 7251, 'loss/train': 1.7183914184570312} 11/06/2021 22:15:51 - INFO - __main__ - Step 7253: {'lr': 0.0004984480207653202, 'samples': 1392576, 'steps': 7252, 'loss/train': 1.8998106718063354} 11/06/2021 22:15:52 - INFO - __main__ - Step 7254: {'lr': 0.0004984474303165526, 'samples': 1392768, 'steps': 7253, 'loss/train': 1.8647713661193848} 11/06/2021 22:15:52 - INFO - __main__ - Step 7255: {'lr': 0.0004984468397558384, 'samples': 1392960, 'steps': 7254, 'loss/train': 1.3238756656646729} 11/06/2021 22:15:53 - INFO - __main__ - Step 7256: {'lr': 0.0004984462490831778, 'samples': 1393152, 'steps': 7255, 'loss/train': 2.027043104171753} 11/06/2021 22:15:53 - INFO - __main__ - Step 7257: {'lr': 0.0004984456582985713, 'samples': 1393344, 'steps': 7256, 'loss/train': 1.4800242185592651} 11/06/2021 22:15:54 - INFO - __main__ - Step 7258: {'lr': 0.0004984450674020189, 'samples': 1393536, 'steps': 7257, 'loss/train': 2.1049439907073975} 11/06/2021 22:15:54 - INFO - __main__ - Step 7259: {'lr': 0.000498444476393521, 'samples': 1393728, 'steps': 7258, 'loss/train': 2.349273920059204} 11/06/2021 22:15:54 - INFO - __main__ - Step 7260: {'lr': 0.0004984438852730779, 'samples': 1393920, 'steps': 7259, 'loss/train': 1.9256495237350464} 11/06/2021 22:15:55 - INFO - __main__ - Step 7261: {'lr': 0.0004984432940406898, 'samples': 1394112, 'steps': 7260, 'loss/train': 1.3321802616119385} 11/06/2021 22:15:56 - INFO - __main__ - Step 7262: {'lr': 0.0004984427026963569, 'samples': 1394304, 'steps': 7261, 'loss/train': 1.8123741149902344} 11/06/2021 22:15:56 - INFO - __main__ - Step 7263: {'lr': 0.0004984421112400796, 'samples': 1394496, 'steps': 7262, 'loss/train': 1.7774724960327148} 11/06/2021 22:15:56 - INFO - __main__ - Step 7264: {'lr': 0.0004984415196718582, 'samples': 1394688, 'steps': 7263, 'loss/train': 1.5477303266525269} 11/06/2021 22:15:57 - INFO - __main__ - Step 7265: {'lr': 0.0004984409279916929, 'samples': 1394880, 'steps': 7264, 'loss/train': 1.974310278892517} 11/06/2021 22:15:58 - INFO - __main__ - Step 7266: {'lr': 0.0004984403361995839, 'samples': 1395072, 'steps': 7265, 'loss/train': 1.740934133529663} 11/06/2021 22:15:58 - INFO - __main__ - Step 7267: {'lr': 0.0004984397442955315, 'samples': 1395264, 'steps': 7266, 'loss/train': 2.080972194671631} 11/06/2021 22:15:59 - INFO - __main__ - Step 7268: {'lr': 0.0004984391522795359, 'samples': 1395456, 'steps': 7267, 'loss/train': 1.868264079093933} 11/06/2021 22:15:59 - INFO - __main__ - Step 7269: {'lr': 0.0004984385601515977, 'samples': 1395648, 'steps': 7268, 'loss/train': 1.7006977796554565} 11/06/2021 22:15:59 - INFO - __main__ - Step 7270: {'lr': 0.0004984379679117166, 'samples': 1395840, 'steps': 7269, 'loss/train': 2.674793004989624} 11/06/2021 22:16:00 - INFO - __main__ - Step 7271: {'lr': 0.0004984373755598934, 'samples': 1396032, 'steps': 7270, 'loss/train': 1.7683382034301758} 11/06/2021 22:16:01 - INFO - __main__ - Step 7272: {'lr': 0.0004984367830961281, 'samples': 1396224, 'steps': 7271, 'loss/train': 2.167809247970581} 11/06/2021 22:16:01 - INFO - __main__ - Step 7273: {'lr': 0.0004984361905204209, 'samples': 1396416, 'steps': 7272, 'loss/train': 1.7610995769500732} 11/06/2021 22:16:01 - INFO - __main__ - Step 7274: {'lr': 0.0004984355978327724, 'samples': 1396608, 'steps': 7273, 'loss/train': 1.7615541219711304} 11/06/2021 22:16:02 - INFO - __main__ - Step 7275: {'lr': 0.0004984350050331826, 'samples': 1396800, 'steps': 7274, 'loss/train': 1.920972466468811} 11/06/2021 22:16:03 - INFO - __main__ - Step 7276: {'lr': 0.0004984344121216518, 'samples': 1396992, 'steps': 7275, 'loss/train': 1.9374135732650757} 11/06/2021 22:16:03 - INFO - __main__ - Step 7277: {'lr': 0.0004984338190981802, 'samples': 1397184, 'steps': 7276, 'loss/train': 1.6864734888076782} 11/06/2021 22:16:04 - INFO - __main__ - Step 7278: {'lr': 0.0004984332259627682, 'samples': 1397376, 'steps': 7277, 'loss/train': 1.0560840368270874} 11/06/2021 22:16:04 - INFO - __main__ - Step 7279: {'lr': 0.000498432632715416, 'samples': 1397568, 'steps': 7278, 'loss/train': 0.6851865649223328} 11/06/2021 22:16:04 - INFO - __main__ - Step 7280: {'lr': 0.000498432039356124, 'samples': 1397760, 'steps': 7279, 'loss/train': 1.729858160018921} 11/06/2021 22:16:05 - INFO - __main__ - Step 7281: {'lr': 0.0004984314458848923, 'samples': 1397952, 'steps': 7280, 'loss/train': 1.9476943016052246} 11/06/2021 22:16:06 - INFO - __main__ - Step 7282: {'lr': 0.0004984308523017212, 'samples': 1398144, 'steps': 7281, 'loss/train': 1.9683444499969482} 11/06/2021 22:16:06 - INFO - __main__ - Step 7283: {'lr': 0.000498430258606611, 'samples': 1398336, 'steps': 7282, 'loss/train': 2.0607752799987793} 11/06/2021 22:16:06 - INFO - __main__ - Step 7284: {'lr': 0.000498429664799562, 'samples': 1398528, 'steps': 7283, 'loss/train': 2.243265151977539} 11/06/2021 22:16:07 - INFO - __main__ - Step 7285: {'lr': 0.0004984290708805743, 'samples': 1398720, 'steps': 7284, 'loss/train': 1.7015665769577026} 11/06/2021 22:16:07 - INFO - __main__ - Step 7286: {'lr': 0.0004984284768496484, 'samples': 1398912, 'steps': 7285, 'loss/train': 2.0108847618103027} 11/06/2021 22:16:08 - INFO - __main__ - Step 7287: {'lr': 0.0004984278827067844, 'samples': 1399104, 'steps': 7286, 'loss/train': 5.661211967468262} 11/06/2021 22:16:08 - INFO - __main__ - Step 7288: {'lr': 0.0004984272884519827, 'samples': 1399296, 'steps': 7287, 'loss/train': 2.067732572555542} 11/06/2021 22:16:09 - INFO - __main__ - Step 7289: {'lr': 0.0004984266940852434, 'samples': 1399488, 'steps': 7288, 'loss/train': 2.0083210468292236} 11/06/2021 22:16:09 - INFO - __main__ - Step 7290: {'lr': 0.0004984260996065671, 'samples': 1399680, 'steps': 7289, 'loss/train': 1.771776795387268} 11/06/2021 22:16:10 - INFO - __main__ - Step 7291: {'lr': 0.0004984255050159536, 'samples': 1399872, 'steps': 7290, 'loss/train': 2.357623815536499} 11/06/2021 22:16:10 - INFO - __main__ - Step 7292: {'lr': 0.0004984249103134035, 'samples': 1400064, 'steps': 7291, 'loss/train': 1.8594292402267456} 11/06/2021 22:16:11 - INFO - __main__ - Step 7293: {'lr': 0.0004984243154989168, 'samples': 1400256, 'steps': 7292, 'loss/train': 1.5568764209747314} 11/06/2021 22:16:11 - INFO - __main__ - Step 7294: {'lr': 0.0004984237205724942, 'samples': 1400448, 'steps': 7293, 'loss/train': 2.0495903491973877} 11/06/2021 22:16:12 - INFO - __main__ - Step 7295: {'lr': 0.0004984231255341355, 'samples': 1400640, 'steps': 7294, 'loss/train': 1.9171841144561768} 11/06/2021 22:16:12 - INFO - __main__ - Step 7296: {'lr': 0.0004984225303838413, 'samples': 1400832, 'steps': 7295, 'loss/train': 2.2132482528686523} 11/06/2021 22:16:12 - INFO - __main__ - Step 7297: {'lr': 0.0004984219351216116, 'samples': 1401024, 'steps': 7296, 'loss/train': 2.4018189907073975} 11/06/2021 22:16:13 - INFO - __main__ - Step 7298: {'lr': 0.000498421339747447, 'samples': 1401216, 'steps': 7297, 'loss/train': 1.9810959100723267} 11/06/2021 22:16:14 - INFO - __main__ - Step 7299: {'lr': 0.0004984207442613474, 'samples': 1401408, 'steps': 7298, 'loss/train': 2.1550214290618896} 11/06/2021 22:16:14 - INFO - __main__ - Step 7300: {'lr': 0.0004984201486633134, 'samples': 1401600, 'steps': 7299, 'loss/train': 1.7208765745162964} 11/06/2021 22:16:14 - INFO - __main__ - Step 7301: {'lr': 0.0004984195529533451, 'samples': 1401792, 'steps': 7300, 'loss/train': 1.912482500076294} 11/06/2021 22:16:15 - INFO - __main__ - Step 7302: {'lr': 0.0004984189571314426, 'samples': 1401984, 'steps': 7301, 'loss/train': 2.2242941856384277} 11/06/2021 22:16:16 - INFO - __main__ - Step 7303: {'lr': 0.0004984183611976065, 'samples': 1402176, 'steps': 7302, 'loss/train': 1.918655276298523} 11/06/2021 22:16:16 - INFO - __main__ - Step 7304: {'lr': 0.0004984177651518369, 'samples': 1402368, 'steps': 7303, 'loss/train': 2.114764451980591} 11/06/2021 22:16:16 - INFO - __main__ - Step 7305: {'lr': 0.0004984171689941341, 'samples': 1402560, 'steps': 7304, 'loss/train': 1.9402523040771484} 11/06/2021 22:16:17 - INFO - __main__ - Step 7306: {'lr': 0.0004984165727244984, 'samples': 1402752, 'steps': 7305, 'loss/train': 1.6583056449890137} 11/06/2021 22:16:17 - INFO - __main__ - Step 7307: {'lr': 0.0004984159763429299, 'samples': 1402944, 'steps': 7306, 'loss/train': 1.5926767587661743} 11/06/2021 22:16:17 - INFO - __main__ - Step 7308: {'lr': 0.0004984153798494291, 'samples': 1403136, 'steps': 7307, 'loss/train': 1.6592446565628052} 11/06/2021 22:16:18 - INFO - __main__ - Step 7309: {'lr': 0.000498414783243996, 'samples': 1403328, 'steps': 7308, 'loss/train': 1.728884220123291} 11/06/2021 22:16:19 - INFO - __main__ - Step 7310: {'lr': 0.0004984141865266312, 'samples': 1403520, 'steps': 7309, 'loss/train': 0.43410855531692505} 11/06/2021 22:16:19 - INFO - __main__ - Step 7311: {'lr': 0.0004984135896973348, 'samples': 1403712, 'steps': 7310, 'loss/train': 2.0036983489990234} 11/06/2021 22:16:20 - INFO - __main__ - Step 7312: {'lr': 0.000498412992756107, 'samples': 1403904, 'steps': 7311, 'loss/train': 1.7266615629196167} 11/06/2021 22:16:20 - INFO - __main__ - Step 7313: {'lr': 0.0004984123957029482, 'samples': 1404096, 'steps': 7312, 'loss/train': 1.9898042678833008} 11/06/2021 22:16:21 - INFO - __main__ - Step 7314: {'lr': 0.0004984117985378586, 'samples': 1404288, 'steps': 7313, 'loss/train': 2.5833494663238525} 11/06/2021 22:16:21 - INFO - __main__ - Step 7315: {'lr': 0.0004984112012608384, 'samples': 1404480, 'steps': 7314, 'loss/train': 1.9153599739074707} 11/06/2021 22:16:22 - INFO - __main__ - Step 7316: {'lr': 0.000498410603871888, 'samples': 1404672, 'steps': 7315, 'loss/train': 1.9567968845367432} 11/06/2021 22:16:22 - INFO - __main__ - Step 7317: {'lr': 0.0004984100063710076, 'samples': 1404864, 'steps': 7316, 'loss/train': 1.922874927520752} 11/06/2021 22:16:22 - INFO - __main__ - Step 7318: {'lr': 0.0004984094087581975, 'samples': 1405056, 'steps': 7317, 'loss/train': 1.7780122756958008} 11/06/2021 22:16:23 - INFO - __main__ - Step 7319: {'lr': 0.0004984088110334579, 'samples': 1405248, 'steps': 7318, 'loss/train': 1.6674338579177856} 11/06/2021 22:16:24 - INFO - __main__ - Step 7320: {'lr': 0.0004984082131967892, 'samples': 1405440, 'steps': 7319, 'loss/train': 1.5355974435806274} 11/06/2021 22:16:24 - INFO - __main__ - Step 7321: {'lr': 0.0004984076152481916, 'samples': 1405632, 'steps': 7320, 'loss/train': 1.7139782905578613} 11/06/2021 22:16:24 - INFO - __main__ - Step 7322: {'lr': 0.0004984070171876653, 'samples': 1405824, 'steps': 7321, 'loss/train': 2.0687177181243896} 11/06/2021 22:16:25 - INFO - __main__ - Step 7323: {'lr': 0.0004984064190152106, 'samples': 1406016, 'steps': 7322, 'loss/train': 1.93816077709198} 11/06/2021 22:16:26 - INFO - __main__ - Step 7324: {'lr': 0.0004984058207308279, 'samples': 1406208, 'steps': 7323, 'loss/train': 1.9266688823699951} 11/06/2021 22:16:26 - INFO - __main__ - Step 7325: {'lr': 0.0004984052223345174, 'samples': 1406400, 'steps': 7324, 'loss/train': 1.676986575126648} 11/06/2021 22:16:27 - INFO - __main__ - Step 7326: {'lr': 0.0004984046238262792, 'samples': 1406592, 'steps': 7325, 'loss/train': 1.8388060331344604} 11/06/2021 22:16:27 - INFO - __main__ - Step 7327: {'lr': 0.0004984040252061137, 'samples': 1406784, 'steps': 7326, 'loss/train': 2.036705255508423} 11/06/2021 22:16:27 - INFO - __main__ - Step 7328: {'lr': 0.0004984034264740213, 'samples': 1406976, 'steps': 7327, 'loss/train': 1.4209593534469604} 11/06/2021 22:16:28 - INFO - __main__ - Step 7329: {'lr': 0.0004984028276300021, 'samples': 1407168, 'steps': 7328, 'loss/train': 1.2647329568862915} 11/06/2021 22:16:29 - INFO - __main__ - Step 7330: {'lr': 0.0004984022286740565, 'samples': 1407360, 'steps': 7329, 'loss/train': 1.7428815364837646} 11/06/2021 22:16:29 - INFO - __main__ - Step 7331: {'lr': 0.0004984016296061846, 'samples': 1407552, 'steps': 7330, 'loss/train': 1.4878357648849487} 11/06/2021 22:16:29 - INFO - __main__ - Step 7332: {'lr': 0.0004984010304263868, 'samples': 1407744, 'steps': 7331, 'loss/train': 1.8269178867340088} 11/06/2021 22:16:30 - INFO - __main__ - Step 7333: {'lr': 0.0004984004311346632, 'samples': 1407936, 'steps': 7332, 'loss/train': 1.686158299446106} 11/06/2021 22:16:31 - INFO - __main__ - Step 7334: {'lr': 0.0004983998317310143, 'samples': 1408128, 'steps': 7333, 'loss/train': 0.8892830610275269} 11/06/2021 22:16:31 - INFO - __main__ - Step 7335: {'lr': 0.0004983992322154403, 'samples': 1408320, 'steps': 7334, 'loss/train': 1.7509515285491943} 11/06/2021 22:16:31 - INFO - __main__ - Step 7336: {'lr': 0.0004983986325879414, 'samples': 1408512, 'steps': 7335, 'loss/train': 1.561832070350647} 11/06/2021 22:16:32 - INFO - __main__ - Step 7337: {'lr': 0.0004983980328485179, 'samples': 1408704, 'steps': 7336, 'loss/train': 1.7405585050582886} 11/06/2021 22:16:32 - INFO - __main__ - Step 7338: {'lr': 0.0004983974329971702, 'samples': 1408896, 'steps': 7337, 'loss/train': 1.8037465810775757} 11/06/2021 22:16:32 - INFO - __main__ - Step 7339: {'lr': 0.0004983968330338983, 'samples': 1409088, 'steps': 7338, 'loss/train': 1.8311342000961304} 11/06/2021 22:16:33 - INFO - __main__ - Step 7340: {'lr': 0.0004983962329587026, 'samples': 1409280, 'steps': 7339, 'loss/train': 1.649807333946228} 11/06/2021 22:16:34 - INFO - __main__ - Step 7341: {'lr': 0.0004983956327715835, 'samples': 1409472, 'steps': 7340, 'loss/train': 1.9435052871704102} 11/06/2021 22:16:34 - INFO - __main__ - Step 7342: {'lr': 0.000498395032472541, 'samples': 1409664, 'steps': 7341, 'loss/train': 1.9914734363555908} 11/06/2021 22:16:34 - INFO - __main__ - Step 7343: {'lr': 0.0004983944320615757, 'samples': 1409856, 'steps': 7342, 'loss/train': 2.154550552368164} 11/06/2021 22:16:35 - INFO - __main__ - Step 7344: {'lr': 0.0004983938315386877, 'samples': 1410048, 'steps': 7343, 'loss/train': 0.9631898403167725} 11/06/2021 22:16:36 - INFO - __main__ - Step 7345: {'lr': 0.0004983932309038773, 'samples': 1410240, 'steps': 7344, 'loss/train': 1.8528187274932861} 11/06/2021 22:16:36 - INFO - __main__ - Step 7346: {'lr': 0.0004983926301571445, 'samples': 1410432, 'steps': 7345, 'loss/train': 1.4792355298995972} 11/06/2021 22:16:36 - INFO - __main__ - Step 7347: {'lr': 0.00049839202929849, 'samples': 1410624, 'steps': 7346, 'loss/train': 2.132795572280884} 11/06/2021 22:16:37 - INFO - __main__ - Step 7348: {'lr': 0.0004983914283279139, 'samples': 1410816, 'steps': 7347, 'loss/train': 1.8769862651824951} 11/06/2021 22:16:37 - INFO - __main__ - Step 7349: {'lr': 0.0004983908272454164, 'samples': 1411008, 'steps': 7348, 'loss/train': 1.8160367012023926} 11/06/2021 22:16:38 - INFO - __main__ - Step 7350: {'lr': 0.0004983902260509978, 'samples': 1411200, 'steps': 7349, 'loss/train': 0.7067152857780457} 11/06/2021 22:16:38 - INFO - __main__ - Step 7351: {'lr': 0.0004983896247446585, 'samples': 1411392, 'steps': 7350, 'loss/train': 1.9250173568725586} 11/06/2021 22:16:39 - INFO - __main__ - Step 7352: {'lr': 0.0004983890233263986, 'samples': 1411584, 'steps': 7351, 'loss/train': 1.8521647453308105} 11/06/2021 22:16:39 - INFO - __main__ - Step 7353: {'lr': 0.0004983884217962185, 'samples': 1411776, 'steps': 7352, 'loss/train': 1.7597779035568237} 11/06/2021 22:16:40 - INFO - __main__ - Step 7354: {'lr': 0.0004983878201541183, 'samples': 1411968, 'steps': 7353, 'loss/train': 1.017517328262329} 11/06/2021 22:16:41 - INFO - __main__ - Step 7355: {'lr': 0.0004983872184000984, 'samples': 1412160, 'steps': 7354, 'loss/train': 2.1541800498962402} 11/06/2021 22:16:41 - INFO - __main__ - Step 7356: {'lr': 0.0004983866165341592, 'samples': 1412352, 'steps': 7355, 'loss/train': 2.184023380279541} 11/06/2021 22:16:41 - INFO - __main__ - Step 7357: {'lr': 0.0004983860145563006, 'samples': 1412544, 'steps': 7356, 'loss/train': 1.9269651174545288} 11/06/2021 22:16:42 - INFO - __main__ - Step 7358: {'lr': 0.0004983854124665232, 'samples': 1412736, 'steps': 7357, 'loss/train': 0.9575059413909912} 11/06/2021 22:16:42 - INFO - __main__ - Step 7359: {'lr': 0.0004983848102648273, 'samples': 1412928, 'steps': 7358, 'loss/train': 1.8082619905471802} 11/06/2021 22:16:43 - INFO - __main__ - Step 7360: {'lr': 0.0004983842079512128, 'samples': 1413120, 'steps': 7359, 'loss/train': 1.5754534006118774} 11/06/2021 22:16:43 - INFO - __main__ - Step 7361: {'lr': 0.0004983836055256804, 'samples': 1413312, 'steps': 7360, 'loss/train': 1.5222283601760864} 11/06/2021 22:16:44 - INFO - __main__ - Step 7362: {'lr': 0.0004983830029882301, 'samples': 1413504, 'steps': 7361, 'loss/train': 2.070965051651001} 11/06/2021 22:16:44 - INFO - __main__ - Step 7363: {'lr': 0.0004983824003388622, 'samples': 1413696, 'steps': 7362, 'loss/train': 1.8946985006332397} 11/06/2021 22:16:44 - INFO - __main__ - Step 7364: {'lr': 0.0004983817975775771, 'samples': 1413888, 'steps': 7363, 'loss/train': 1.666754126548767} 11/06/2021 22:16:45 - INFO - __main__ - Step 7365: {'lr': 0.000498381194704375, 'samples': 1414080, 'steps': 7364, 'loss/train': 1.879746675491333} 11/06/2021 22:16:46 - INFO - __main__ - Step 7366: {'lr': 0.000498380591719256, 'samples': 1414272, 'steps': 7365, 'loss/train': 1.6441892385482788} 11/06/2021 22:16:46 - INFO - __main__ - Step 7367: {'lr': 0.0004983799886222207, 'samples': 1414464, 'steps': 7366, 'loss/train': 2.2601382732391357} 11/06/2021 22:16:47 - INFO - __main__ - Step 7368: {'lr': 0.0004983793854132693, 'samples': 1414656, 'steps': 7367, 'loss/train': 1.803916335105896} 11/06/2021 22:16:47 - INFO - __main__ - Step 7369: {'lr': 0.0004983787820924019, 'samples': 1414848, 'steps': 7368, 'loss/train': 1.7164603471755981} 11/06/2021 22:16:48 - INFO - __main__ - Step 7370: {'lr': 0.0004983781786596187, 'samples': 1415040, 'steps': 7369, 'loss/train': 1.3875094652175903} 11/06/2021 22:16:48 - INFO - __main__ - Step 7371: {'lr': 0.0004983775751149204, 'samples': 1415232, 'steps': 7370, 'loss/train': 2.1840782165527344} 11/06/2021 22:16:49 - INFO - __main__ - Step 7372: {'lr': 0.0004983769714583067, 'samples': 1415424, 'steps': 7371, 'loss/train': 2.081000328063965} 11/06/2021 22:16:49 - INFO - __main__ - Step 7373: {'lr': 0.0004983763676897784, 'samples': 1415616, 'steps': 7372, 'loss/train': 1.749665379524231} 11/06/2021 22:16:49 - INFO - __main__ - Step 7374: {'lr': 0.0004983757638093355, 'samples': 1415808, 'steps': 7373, 'loss/train': 1.8339048624038696} 11/06/2021 22:16:50 - INFO - __main__ - Step 7375: {'lr': 0.0004983751598169781, 'samples': 1416000, 'steps': 7374, 'loss/train': 1.631373405456543} 11/06/2021 22:16:51 - INFO - __main__ - Step 7376: {'lr': 0.000498374555712707, 'samples': 1416192, 'steps': 7375, 'loss/train': 2.2265982627868652} 11/06/2021 22:16:51 - INFO - __main__ - Step 7377: {'lr': 0.000498373951496522, 'samples': 1416384, 'steps': 7376, 'loss/train': 2.2691292762756348} 11/06/2021 22:16:51 - INFO - __main__ - Step 7378: {'lr': 0.0004983733471684234, 'samples': 1416576, 'steps': 7377, 'loss/train': 2.114935874938965} 11/06/2021 22:16:52 - INFO - __main__ - Step 7379: {'lr': 0.0004983727427284118, 'samples': 1416768, 'steps': 7378, 'loss/train': 1.5053443908691406} 11/06/2021 22:16:52 - INFO - __main__ - Step 7380: {'lr': 0.0004983721381764873, 'samples': 1416960, 'steps': 7379, 'loss/train': 2.0003550052642822} 11/06/2021 22:16:53 - INFO - __main__ - Step 7381: {'lr': 0.00049837153351265, 'samples': 1417152, 'steps': 7380, 'loss/train': 1.6483396291732788} 11/06/2021 22:16:53 - INFO - __main__ - Step 7382: {'lr': 0.0004983709287369004, 'samples': 1417344, 'steps': 7381, 'loss/train': 1.4799424409866333} 11/06/2021 22:16:54 - INFO - __main__ - Step 7383: {'lr': 0.0004983703238492386, 'samples': 1417536, 'steps': 7382, 'loss/train': 1.8734862804412842} 11/06/2021 22:16:54 - INFO - __main__ - Step 7384: {'lr': 0.000498369718849665, 'samples': 1417728, 'steps': 7383, 'loss/train': 2.09423565864563} 11/06/2021 22:16:54 - INFO - __main__ - Step 7385: {'lr': 0.00049836911373818, 'samples': 1417920, 'steps': 7384, 'loss/train': 1.690798044204712} 11/06/2021 22:16:55 - INFO - __main__ - Step 7386: {'lr': 0.0004983685085147836, 'samples': 1418112, 'steps': 7385, 'loss/train': 1.6163359880447388} 11/06/2021 22:16:56 - INFO - __main__ - Step 7387: {'lr': 0.0004983679031794762, 'samples': 1418304, 'steps': 7386, 'loss/train': 1.860256552696228} 11/06/2021 22:16:56 - INFO - __main__ - Step 7388: {'lr': 0.000498367297732258, 'samples': 1418496, 'steps': 7387, 'loss/train': 1.9493474960327148} 11/06/2021 22:16:56 - INFO - __main__ - Step 7389: {'lr': 0.0004983666921731293, 'samples': 1418688, 'steps': 7388, 'loss/train': 1.6616744995117188} 11/06/2021 22:16:57 - INFO - __main__ - Step 7390: {'lr': 0.0004983660865020905, 'samples': 1418880, 'steps': 7389, 'loss/train': 1.901392936706543} 11/06/2021 22:16:58 - INFO - __main__ - Step 7391: {'lr': 0.0004983654807191418, 'samples': 1419072, 'steps': 7390, 'loss/train': 2.3318119049072266} 11/06/2021 22:16:58 - INFO - __main__ - Step 7392: {'lr': 0.0004983648748242833, 'samples': 1419264, 'steps': 7391, 'loss/train': 1.922995686531067} 11/06/2021 22:16:59 - INFO - __main__ - Step 7393: {'lr': 0.0004983642688175155, 'samples': 1419456, 'steps': 7392, 'loss/train': 1.9183114767074585} 11/06/2021 22:16:59 - INFO - __main__ - Step 7394: {'lr': 0.0004983636626988386, 'samples': 1419648, 'steps': 7393, 'loss/train': 2.0506041049957275} 11/06/2021 22:16:59 - INFO - __main__ - Step 7395: {'lr': 0.0004983630564682529, 'samples': 1419840, 'steps': 7394, 'loss/train': 1.6490904092788696} 11/06/2021 22:17:00 - INFO - __main__ - Step 7396: {'lr': 0.0004983624501257585, 'samples': 1420032, 'steps': 7395, 'loss/train': 2.0997846126556396} 11/06/2021 22:17:01 - INFO - __main__ - Step 7397: {'lr': 0.000498361843671356, 'samples': 1420224, 'steps': 7396, 'loss/train': 1.3421412706375122} 11/06/2021 22:17:01 - INFO - __main__ - Step 7398: {'lr': 0.0004983612371050453, 'samples': 1420416, 'steps': 7397, 'loss/train': 1.7991613149642944} 11/06/2021 22:17:01 - INFO - __main__ - Step 7399: {'lr': 0.000498360630426827, 'samples': 1420608, 'steps': 7398, 'loss/train': 2.0792791843414307} 11/06/2021 22:17:02 - INFO - __main__ - Step 7400: {'lr': 0.0004983600236367012, 'samples': 1420800, 'steps': 7399, 'loss/train': 3.388317346572876} 11/06/2021 22:17:03 - INFO - __main__ - Step 7401: {'lr': 0.0004983594167346681, 'samples': 1420992, 'steps': 7400, 'loss/train': 1.5198123455047607} 11/06/2021 22:17:03 - INFO - __main__ - Step 7402: {'lr': 0.0004983588097207283, 'samples': 1421184, 'steps': 7401, 'loss/train': 1.678063988685608} 11/06/2021 22:17:03 - INFO - __main__ - Step 7403: {'lr': 0.0004983582025948816, 'samples': 1421376, 'steps': 7402, 'loss/train': 1.8395053148269653} 11/06/2021 22:17:04 - INFO - __main__ - Step 7404: {'lr': 0.0004983575953571287, 'samples': 1421568, 'steps': 7403, 'loss/train': 0.9479645490646362} 11/06/2021 22:17:04 - INFO - __main__ - Step 7405: {'lr': 0.0004983569880074696, 'samples': 1421760, 'steps': 7404, 'loss/train': 1.9720947742462158} 11/06/2021 22:17:05 - INFO - __main__ - Step 7406: {'lr': 0.0004983563805459048, 'samples': 1421952, 'steps': 7405, 'loss/train': 1.7848727703094482} 11/06/2021 22:17:06 - INFO - __main__ - Step 7407: {'lr': 0.0004983557729724343, 'samples': 1422144, 'steps': 7406, 'loss/train': 1.2195593118667603} 11/06/2021 22:17:06 - INFO - __main__ - Step 7408: {'lr': 0.0004983551652870586, 'samples': 1422336, 'steps': 7407, 'loss/train': 1.9602643251419067} 11/06/2021 22:17:06 - INFO - __main__ - Step 7409: {'lr': 0.000498354557489778, 'samples': 1422528, 'steps': 7408, 'loss/train': 1.9274659156799316} 11/06/2021 22:17:07 - INFO - __main__ - Step 7410: {'lr': 0.0004983539495805925, 'samples': 1422720, 'steps': 7409, 'loss/train': 0.8858946561813354} 11/06/2021 22:17:07 - INFO - __main__ - Step 7411: {'lr': 0.0004983533415595026, 'samples': 1422912, 'steps': 7410, 'loss/train': 2.1833174228668213} 11/06/2021 22:17:08 - INFO - __main__ - Step 7412: {'lr': 0.0004983527334265085, 'samples': 1423104, 'steps': 7411, 'loss/train': 1.8606438636779785} 11/06/2021 22:17:08 - INFO - __main__ - Step 7413: {'lr': 0.0004983521251816105, 'samples': 1423296, 'steps': 7412, 'loss/train': 2.0643343925476074} 11/06/2021 22:17:09 - INFO - __main__ - Step 7414: {'lr': 0.0004983515168248088, 'samples': 1423488, 'steps': 7413, 'loss/train': 2.0445151329040527} 11/06/2021 22:17:09 - INFO - __main__ - Step 7415: {'lr': 0.0004983509083561038, 'samples': 1423680, 'steps': 7414, 'loss/train': 2.069981813430786} 11/06/2021 22:17:09 - INFO - __main__ - Step 7416: {'lr': 0.0004983502997754958, 'samples': 1423872, 'steps': 7415, 'loss/train': 1.200181245803833} 11/06/2021 22:17:11 - INFO - __main__ - Step 7417: {'lr': 0.0004983496910829849, 'samples': 1424064, 'steps': 7416, 'loss/train': 1.8883745670318604} 11/06/2021 22:17:11 - INFO - __main__ - Step 7418: {'lr': 0.0004983490822785715, 'samples': 1424256, 'steps': 7417, 'loss/train': 2.2220919132232666} 11/06/2021 22:17:11 - INFO - __main__ - Step 7419: {'lr': 0.0004983484733622558, 'samples': 1424448, 'steps': 7418, 'loss/train': 1.7355570793151855} 11/06/2021 22:17:12 - INFO - __main__ - Step 7420: {'lr': 0.0004983478643340382, 'samples': 1424640, 'steps': 7419, 'loss/train': 2.0675618648529053} 11/06/2021 22:17:12 - INFO - __main__ - Step 7421: {'lr': 0.0004983472551939186, 'samples': 1424832, 'steps': 7420, 'loss/train': 1.9630166292190552} 11/06/2021 22:17:13 - INFO - __main__ - Step 7422: {'lr': 0.0004983466459418978, 'samples': 1425024, 'steps': 7421, 'loss/train': 2.4349045753479004} 11/06/2021 22:17:13 - INFO - __main__ - Step 7423: {'lr': 0.0004983460365779759, 'samples': 1425216, 'steps': 7422, 'loss/train': 2.1964800357818604} 11/06/2021 22:17:14 - INFO - __main__ - Step 7424: {'lr': 0.0004983454271021529, 'samples': 1425408, 'steps': 7423, 'loss/train': 2.5369303226470947} 11/06/2021 22:17:14 - INFO - __main__ - Step 7425: {'lr': 0.0004983448175144294, 'samples': 1425600, 'steps': 7424, 'loss/train': 2.294490098953247} 11/06/2021 22:17:14 - INFO - __main__ - Step 7426: {'lr': 0.0004983442078148056, 'samples': 1425792, 'steps': 7425, 'loss/train': 1.7043726444244385} 11/06/2021 22:17:15 - INFO - __main__ - Step 7427: {'lr': 0.0004983435980032817, 'samples': 1425984, 'steps': 7426, 'loss/train': 1.0851516723632812} 11/06/2021 22:17:16 - INFO - __main__ - Step 7428: {'lr': 0.0004983429880798579, 'samples': 1426176, 'steps': 7427, 'loss/train': 1.8238978385925293} 11/06/2021 22:17:16 - INFO - __main__ - Step 7429: {'lr': 0.0004983423780445346, 'samples': 1426368, 'steps': 7428, 'loss/train': 2.179898977279663} 11/06/2021 22:17:16 - INFO - __main__ - Step 7430: {'lr': 0.0004983417678973123, 'samples': 1426560, 'steps': 7429, 'loss/train': 2.2200920581817627} 11/06/2021 22:17:17 - INFO - __main__ - Step 7431: {'lr': 0.0004983411576381907, 'samples': 1426752, 'steps': 7430, 'loss/train': 2.471855640411377} 11/06/2021 22:17:18 - INFO - __main__ - Step 7432: {'lr': 0.0004983405472671706, 'samples': 1426944, 'steps': 7431, 'loss/train': 1.9045939445495605} 11/06/2021 22:17:18 - INFO - __main__ - Step 7433: {'lr': 0.000498339936784252, 'samples': 1427136, 'steps': 7432, 'loss/train': 2.156907558441162} 11/06/2021 22:17:18 - INFO - __main__ - Step 7434: {'lr': 0.0004983393261894354, 'samples': 1427328, 'steps': 7433, 'loss/train': 2.357154130935669} 11/06/2021 22:17:19 - INFO - __main__ - Step 7435: {'lr': 0.0004983387154827208, 'samples': 1427520, 'steps': 7434, 'loss/train': 1.69056236743927} 11/06/2021 22:17:19 - INFO - __main__ - Step 7436: {'lr': 0.0004983381046641085, 'samples': 1427712, 'steps': 7435, 'loss/train': 1.9962103366851807} 11/06/2021 22:17:20 - INFO - __main__ - Step 7437: {'lr': 0.0004983374937335991, 'samples': 1427904, 'steps': 7436, 'loss/train': 1.3792200088500977} 11/06/2021 22:17:21 - INFO - __main__ - Step 7438: {'lr': 0.0004983368826911926, 'samples': 1428096, 'steps': 7437, 'loss/train': 1.7087842226028442} 11/06/2021 22:17:21 - INFO - __main__ - Step 7439: {'lr': 0.0004983362715368893, 'samples': 1428288, 'steps': 7438, 'loss/train': 1.5336834192276} 11/06/2021 22:17:21 - INFO - __main__ - Step 7440: {'lr': 0.0004983356602706895, 'samples': 1428480, 'steps': 7439, 'loss/train': 2.1194801330566406} 11/06/2021 22:17:22 - INFO - __main__ - Step 7441: {'lr': 0.0004983350488925936, 'samples': 1428672, 'steps': 7440, 'loss/train': 1.9883663654327393} 11/06/2021 22:17:22 - INFO - __main__ - Step 7442: {'lr': 0.0004983344374026016, 'samples': 1428864, 'steps': 7441, 'loss/train': 1.836441993713379} 11/06/2021 22:17:23 - INFO - __main__ - Step 7443: {'lr': 0.0004983338258007139, 'samples': 1429056, 'steps': 7442, 'loss/train': 1.3787864446640015} 11/06/2021 22:17:24 - INFO - __main__ - Step 7444: {'lr': 0.0004983332140869309, 'samples': 1429248, 'steps': 7443, 'loss/train': 1.911993384361267} 11/06/2021 22:17:24 - INFO - __main__ - Step 7445: {'lr': 0.0004983326022612528, 'samples': 1429440, 'steps': 7444, 'loss/train': 2.0019690990448} 11/06/2021 22:17:24 - INFO - __main__ - Step 7446: {'lr': 0.0004983319903236799, 'samples': 1429632, 'steps': 7445, 'loss/train': 2.305609941482544} 11/06/2021 22:17:25 - INFO - __main__ - Step 7447: {'lr': 0.0004983313782742124, 'samples': 1429824, 'steps': 7446, 'loss/train': 1.3778132200241089} 11/06/2021 22:17:26 - INFO - __main__ - Step 7448: {'lr': 0.0004983307661128505, 'samples': 1430016, 'steps': 7447, 'loss/train': 1.7583122253417969} 11/06/2021 22:17:26 - INFO - __main__ - Step 7449: {'lr': 0.0004983301538395948, 'samples': 1430208, 'steps': 7448, 'loss/train': 1.846149206161499} 11/06/2021 22:17:27 - INFO - __main__ - Step 7450: {'lr': 0.0004983295414544452, 'samples': 1430400, 'steps': 7449, 'loss/train': 1.9692158699035645} 11/06/2021 22:17:27 - INFO - __main__ - Step 7451: {'lr': 0.0004983289289574022, 'samples': 1430592, 'steps': 7450, 'loss/train': 1.7557963132858276} 11/06/2021 22:17:27 - INFO - __main__ - Step 7452: {'lr': 0.000498328316348466, 'samples': 1430784, 'steps': 7451, 'loss/train': 2.0076990127563477} 11/06/2021 22:17:28 - INFO - __main__ - Step 7453: {'lr': 0.0004983277036276369, 'samples': 1430976, 'steps': 7452, 'loss/train': 0.7926499843597412} 11/06/2021 22:17:29 - INFO - __main__ - Step 7454: {'lr': 0.0004983270907949152, 'samples': 1431168, 'steps': 7453, 'loss/train': 1.5046730041503906} 11/06/2021 22:17:29 - INFO - __main__ - Step 7455: {'lr': 0.0004983264778503011, 'samples': 1431360, 'steps': 7454, 'loss/train': 1.3215335607528687} 11/06/2021 22:17:29 - INFO - __main__ - Step 7456: {'lr': 0.0004983258647937949, 'samples': 1431552, 'steps': 7455, 'loss/train': 2.1020658016204834} 11/06/2021 22:17:30 - INFO - __main__ - Step 7457: {'lr': 0.0004983252516253969, 'samples': 1431744, 'steps': 7456, 'loss/train': 1.746657133102417} 11/06/2021 22:17:31 - INFO - __main__ - Step 7458: {'lr': 0.0004983246383451074, 'samples': 1431936, 'steps': 7457, 'loss/train': 2.16377329826355} 11/06/2021 22:17:31 - INFO - __main__ - Step 7459: {'lr': 0.0004983240249529267, 'samples': 1432128, 'steps': 7458, 'loss/train': 2.1467676162719727} 11/06/2021 22:17:31 - INFO - __main__ - Step 7460: {'lr': 0.000498323411448855, 'samples': 1432320, 'steps': 7459, 'loss/train': 1.7520476579666138} 11/06/2021 22:17:32 - INFO - __main__ - Step 7461: {'lr': 0.0004983227978328926, 'samples': 1432512, 'steps': 7460, 'loss/train': 2.5341601371765137} 11/06/2021 22:17:32 - INFO - __main__ - Step 7462: {'lr': 0.0004983221841050397, 'samples': 1432704, 'steps': 7461, 'loss/train': 2.3244829177856445} 11/06/2021 22:17:33 - INFO - __main__ - Step 7463: {'lr': 0.0004983215702652968, 'samples': 1432896, 'steps': 7462, 'loss/train': 1.8480935096740723} 11/06/2021 22:17:34 - INFO - __main__ - Step 7464: {'lr': 0.0004983209563136639, 'samples': 1433088, 'steps': 7463, 'loss/train': 1.939214825630188} 11/06/2021 22:17:34 - INFO - __main__ - Step 7465: {'lr': 0.0004983203422501414, 'samples': 1433280, 'steps': 7464, 'loss/train': 2.0819010734558105} 11/06/2021 22:17:34 - INFO - __main__ - Step 7466: {'lr': 0.0004983197280747297, 'samples': 1433472, 'steps': 7465, 'loss/train': 1.9236360788345337} 11/06/2021 22:17:35 - INFO - __main__ - Step 7467: {'lr': 0.0004983191137874289, 'samples': 1433664, 'steps': 7466, 'loss/train': 1.6283338069915771} 11/06/2021 22:17:35 - INFO - __main__ - Step 7468: {'lr': 0.0004983184993882394, 'samples': 1433856, 'steps': 7467, 'loss/train': 1.6667938232421875} 11/06/2021 22:17:36 - INFO - __main__ - Step 7469: {'lr': 0.0004983178848771613, 'samples': 1434048, 'steps': 7468, 'loss/train': 2.123434066772461} 11/06/2021 22:17:36 - INFO - __main__ - Step 7470: {'lr': 0.0004983172702541951, 'samples': 1434240, 'steps': 7469, 'loss/train': 1.7942487001419067} 11/06/2021 22:17:37 - INFO - __main__ - Step 7471: {'lr': 0.0004983166555193409, 'samples': 1434432, 'steps': 7470, 'loss/train': 2.0577187538146973} 11/06/2021 22:17:37 - INFO - __main__ - Step 7472: {'lr': 0.000498316040672599, 'samples': 1434624, 'steps': 7471, 'loss/train': 1.767512559890747} 11/06/2021 22:17:37 - INFO - __main__ - Step 7473: {'lr': 0.00049831542571397, 'samples': 1434816, 'steps': 7472, 'loss/train': 1.7135177850723267} 11/06/2021 22:17:38 - INFO - __main__ - Step 7474: {'lr': 0.0004983148106434536, 'samples': 1435008, 'steps': 7473, 'loss/train': 1.7563962936401367} 11/06/2021 22:17:39 - INFO - __main__ - Step 7475: {'lr': 0.0004983141954610505, 'samples': 1435200, 'steps': 7474, 'loss/train': 1.7334504127502441} 11/06/2021 22:17:39 - INFO - __main__ - Step 7476: {'lr': 0.0004983135801667608, 'samples': 1435392, 'steps': 7475, 'loss/train': 2.2196826934814453} 11/06/2021 22:17:39 - INFO - __main__ - Step 7477: {'lr': 0.0004983129647605849, 'samples': 1435584, 'steps': 7476, 'loss/train': 1.6023023128509521} 11/06/2021 22:17:40 - INFO - __main__ - Step 7478: {'lr': 0.0004983123492425229, 'samples': 1435776, 'steps': 7477, 'loss/train': 1.4602479934692383} 11/06/2021 22:17:41 - INFO - __main__ - Step 7479: {'lr': 0.0004983117336125753, 'samples': 1435968, 'steps': 7478, 'loss/train': 1.8138916492462158} 11/06/2021 22:17:41 - INFO - __main__ - Step 7480: {'lr': 0.0004983111178707422, 'samples': 1436160, 'steps': 7479, 'loss/train': 1.5433342456817627} 11/06/2021 22:17:41 - INFO - __main__ - Step 7481: {'lr': 0.0004983105020170239, 'samples': 1436352, 'steps': 7480, 'loss/train': 1.7849801778793335} 11/06/2021 22:17:42 - INFO - __main__ - Step 7482: {'lr': 0.0004983098860514209, 'samples': 1436544, 'steps': 7481, 'loss/train': 2.041818380355835} 11/06/2021 22:17:42 - INFO - __main__ - Step 7483: {'lr': 0.0004983092699739331, 'samples': 1436736, 'steps': 7482, 'loss/train': 1.6931136846542358} 11/06/2021 22:17:43 - INFO - __main__ - Step 7484: {'lr': 0.0004983086537845611, 'samples': 1436928, 'steps': 7483, 'loss/train': 1.9418973922729492} 11/06/2021 22:17:44 - INFO - __main__ - Step 7485: {'lr': 0.000498308037483305, 'samples': 1437120, 'steps': 7484, 'loss/train': 1.8237574100494385} 11/06/2021 22:17:44 - INFO - __main__ - Step 7486: {'lr': 0.0004983074210701651, 'samples': 1437312, 'steps': 7485, 'loss/train': 1.779268503189087} 11/06/2021 22:17:44 - INFO - __main__ - Step 7487: {'lr': 0.0004983068045451418, 'samples': 1437504, 'steps': 7486, 'loss/train': 2.059835433959961} 11/06/2021 22:17:45 - INFO - __main__ - Step 7488: {'lr': 0.0004983061879082352, 'samples': 1437696, 'steps': 7487, 'loss/train': 2.341383218765259} 11/06/2021 22:17:46 - INFO - __main__ - Step 7489: {'lr': 0.0004983055711594458, 'samples': 1437888, 'steps': 7488, 'loss/train': 1.6205718517303467} 11/06/2021 22:17:46 - INFO - __main__ - Step 7490: {'lr': 0.0004983049542987736, 'samples': 1438080, 'steps': 7489, 'loss/train': 1.5462263822555542} 11/06/2021 22:17:46 - INFO - __main__ - Step 7491: {'lr': 0.000498304337326219, 'samples': 1438272, 'steps': 7490, 'loss/train': 2.009598970413208} 11/06/2021 22:17:47 - INFO - __main__ - Step 7492: {'lr': 0.0004983037202417824, 'samples': 1438464, 'steps': 7491, 'loss/train': 1.6653988361358643} 11/06/2021 22:17:47 - INFO - __main__ - Step 7493: {'lr': 0.0004983031030454639, 'samples': 1438656, 'steps': 7492, 'loss/train': 1.2993390560150146} 11/06/2021 22:17:48 - INFO - __main__ - Step 7494: {'lr': 0.0004983024857372639, 'samples': 1438848, 'steps': 7493, 'loss/train': 1.8429198265075684} 11/06/2021 22:17:49 - INFO - __main__ - Step 7495: {'lr': 0.0004983018683171826, 'samples': 1439040, 'steps': 7494, 'loss/train': 1.9161350727081299} 11/06/2021 22:17:49 - INFO - __main__ - Step 7496: {'lr': 0.0004983012507852203, 'samples': 1439232, 'steps': 7495, 'loss/train': 1.8230706453323364} 11/06/2021 22:17:49 - INFO - __main__ - Step 7497: {'lr': 0.0004983006331413773, 'samples': 1439424, 'steps': 7496, 'loss/train': 1.6898235082626343} 11/06/2021 22:17:50 - INFO - __main__ - Step 7498: {'lr': 0.0004983000153856539, 'samples': 1439616, 'steps': 7497, 'loss/train': 1.4853415489196777} 11/06/2021 22:17:50 - INFO - __main__ - Step 7499: {'lr': 0.0004982993975180504, 'samples': 1439808, 'steps': 7498, 'loss/train': 2.2079527378082275} 11/06/2021 22:17:51 - INFO - __main__ - Step 7500: {'lr': 0.0004982987795385669, 'samples': 1440000, 'steps': 7499, 'loss/train': 2.117600440979004} 11/06/2021 22:17:51 - INFO - __main__ - Step 7501: {'lr': 0.0004982981614472039, 'samples': 1440192, 'steps': 7500, 'loss/train': 2.0123913288116455} 11/06/2021 22:17:52 - INFO - __main__ - Step 7502: {'lr': 0.0004982975432439615, 'samples': 1440384, 'steps': 7501, 'loss/train': 1.6575013399124146} 11/06/2021 22:17:52 - INFO - __main__ - Step 7503: {'lr': 0.0004982969249288401, 'samples': 1440576, 'steps': 7502, 'loss/train': 0.48864415287971497} 11/06/2021 22:17:53 - INFO - __main__ - Step 7504: {'lr': 0.0004982963065018399, 'samples': 1440768, 'steps': 7503, 'loss/train': 2.1326282024383545} 11/06/2021 22:17:54 - INFO - __main__ - Step 7505: {'lr': 0.0004982956879629612, 'samples': 1440960, 'steps': 7504, 'loss/train': 2.184455633163452} 11/06/2021 22:17:54 - INFO - __main__ - Step 7506: {'lr': 0.0004982950693122044, 'samples': 1441152, 'steps': 7505, 'loss/train': 1.8725202083587646} 11/06/2021 22:17:54 - INFO - __main__ - Step 7507: {'lr': 0.0004982944505495696, 'samples': 1441344, 'steps': 7506, 'loss/train': 1.6214361190795898} 11/06/2021 22:17:55 - INFO - __main__ - Step 7508: {'lr': 0.0004982938316750572, 'samples': 1441536, 'steps': 7507, 'loss/train': 2.0111021995544434} 11/06/2021 22:17:55 - INFO - __main__ - Step 7509: {'lr': 0.0004982932126886674, 'samples': 1441728, 'steps': 7508, 'loss/train': 1.670443058013916} 11/06/2021 22:17:56 - INFO - __main__ - Step 7510: {'lr': 0.0004982925935904004, 'samples': 1441920, 'steps': 7509, 'loss/train': 1.7598836421966553} 11/06/2021 22:17:56 - INFO - __main__ - Step 7511: {'lr': 0.0004982919743802567, 'samples': 1442112, 'steps': 7510, 'loss/train': 2.0992119312286377} 11/06/2021 22:17:57 - INFO - __main__ - Step 7512: {'lr': 0.0004982913550582364, 'samples': 1442304, 'steps': 7511, 'loss/train': 1.790313720703125} 11/06/2021 22:17:57 - INFO - __main__ - Step 7513: {'lr': 0.00049829073562434, 'samples': 1442496, 'steps': 7512, 'loss/train': 1.9669575691223145} 11/06/2021 22:17:57 - INFO - __main__ - Step 7514: {'lr': 0.0004982901160785675, 'samples': 1442688, 'steps': 7513, 'loss/train': 1.6649370193481445} 11/06/2021 22:17:58 - INFO - __main__ - Step 7515: {'lr': 0.0004982894964209193, 'samples': 1442880, 'steps': 7514, 'loss/train': 2.0550930500030518} 11/06/2021 22:17:59 - INFO - __main__ - Step 7516: {'lr': 0.0004982888766513957, 'samples': 1443072, 'steps': 7515, 'loss/train': 1.5787222385406494} 11/06/2021 22:17:59 - INFO - __main__ - Step 7517: {'lr': 0.000498288256769997, 'samples': 1443264, 'steps': 7516, 'loss/train': 1.9650318622589111} 11/06/2021 22:17:59 - INFO - __main__ - Step 7518: {'lr': 0.0004982876367767234, 'samples': 1443456, 'steps': 7517, 'loss/train': 2.108997106552124} 11/06/2021 22:18:00 - INFO - __main__ - Step 7519: {'lr': 0.0004982870166715753, 'samples': 1443648, 'steps': 7518, 'loss/train': 1.5011414289474487} 11/06/2021 22:18:01 - INFO - __main__ - Step 7520: {'lr': 0.0004982863964545529, 'samples': 1443840, 'steps': 7519, 'loss/train': 2.0259969234466553} 11/06/2021 22:18:01 - INFO - __main__ - Step 7521: {'lr': 0.0004982857761256564, 'samples': 1444032, 'steps': 7520, 'loss/train': 2.0949344635009766} 11/06/2021 22:18:01 - INFO - __main__ - Step 7522: {'lr': 0.0004982851556848861, 'samples': 1444224, 'steps': 7521, 'loss/train': 1.771378993988037} 11/06/2021 22:18:02 - INFO - __main__ - Step 7523: {'lr': 0.0004982845351322424, 'samples': 1444416, 'steps': 7522, 'loss/train': 1.6753596067428589} 11/06/2021 22:18:02 - INFO - __main__ - Step 7524: {'lr': 0.0004982839144677257, 'samples': 1444608, 'steps': 7523, 'loss/train': 0.8086962103843689} 11/06/2021 22:18:03 - INFO - __main__ - Step 7525: {'lr': 0.0004982832936913359, 'samples': 1444800, 'steps': 7524, 'loss/train': 1.749570608139038} 11/06/2021 22:18:04 - INFO - __main__ - Step 7526: {'lr': 0.0004982826728030735, 'samples': 1444992, 'steps': 7525, 'loss/train': 1.9992296695709229} 11/06/2021 22:18:04 - INFO - __main__ - Step 7527: {'lr': 0.0004982820518029387, 'samples': 1445184, 'steps': 7526, 'loss/train': 2.0168237686157227} 11/06/2021 22:18:04 - INFO - __main__ - Step 7528: {'lr': 0.000498281430690932, 'samples': 1445376, 'steps': 7527, 'loss/train': 1.9467054605484009} 11/06/2021 22:18:05 - INFO - __main__ - Step 7529: {'lr': 0.0004982808094670534, 'samples': 1445568, 'steps': 7528, 'loss/train': 1.8640716075897217} 11/06/2021 22:18:05 - INFO - __main__ - Step 7530: {'lr': 0.0004982801881313034, 'samples': 1445760, 'steps': 7529, 'loss/train': 1.9027869701385498} 11/06/2021 22:18:06 - INFO - __main__ - Step 7531: {'lr': 0.0004982795666836821, 'samples': 1445952, 'steps': 7530, 'loss/train': 1.8328138589859009} 11/06/2021 22:18:06 - INFO - __main__ - Step 7532: {'lr': 0.00049827894512419, 'samples': 1446144, 'steps': 7531, 'loss/train': 2.27284836769104} 11/06/2021 22:18:07 - INFO - __main__ - Step 7533: {'lr': 0.000498278323452827, 'samples': 1446336, 'steps': 7532, 'loss/train': 1.7987669706344604} 11/06/2021 22:18:07 - INFO - __main__ - Step 7534: {'lr': 0.0004982777016695937, 'samples': 1446528, 'steps': 7533, 'loss/train': 1.1510889530181885} 11/06/2021 22:18:08 - INFO - __main__ - Step 7535: {'lr': 0.0004982770797744904, 'samples': 1446720, 'steps': 7534, 'loss/train': 1.9908004999160767} 11/06/2021 22:18:09 - INFO - __main__ - Step 7536: {'lr': 0.0004982764577675172, 'samples': 1446912, 'steps': 7535, 'loss/train': 2.135575532913208} 11/06/2021 22:18:09 - INFO - __main__ - Step 7537: {'lr': 0.0004982758356486746, 'samples': 1447104, 'steps': 7536, 'loss/train': 1.4963163137435913} 11/06/2021 22:18:09 - INFO - __main__ - Step 7538: {'lr': 0.0004982752134179624, 'samples': 1447296, 'steps': 7537, 'loss/train': 2.040847063064575} 11/06/2021 22:18:10 - INFO - __main__ - Step 7539: {'lr': 0.0004982745910753815, 'samples': 1447488, 'steps': 7538, 'loss/train': 2.0529308319091797} 11/06/2021 22:18:10 - INFO - __main__ - Step 7540: {'lr': 0.0004982739686209319, 'samples': 1447680, 'steps': 7539, 'loss/train': 1.5102424621582031} 11/06/2021 22:18:10 - INFO - __main__ - Step 7541: {'lr': 0.0004982733460546138, 'samples': 1447872, 'steps': 7540, 'loss/train': 2.3759987354278564} 11/06/2021 22:18:11 - INFO - __main__ - Step 7542: {'lr': 0.0004982727233764276, 'samples': 1448064, 'steps': 7541, 'loss/train': 1.264660120010376} 11/06/2021 22:18:12 - INFO - __main__ - Step 7543: {'lr': 0.0004982721005863734, 'samples': 1448256, 'steps': 7542, 'loss/train': 2.076897621154785} 11/06/2021 22:18:12 - INFO - __main__ - Step 7544: {'lr': 0.0004982714776844518, 'samples': 1448448, 'steps': 7543, 'loss/train': 1.9812895059585571} 11/06/2021 22:18:13 - INFO - __main__ - Step 7545: {'lr': 0.0004982708546706628, 'samples': 1448640, 'steps': 7544, 'loss/train': 1.7536218166351318} 11/06/2021 22:18:13 - INFO - __main__ - Step 7546: {'lr': 0.0004982702315450068, 'samples': 1448832, 'steps': 7545, 'loss/train': 2.1761927604675293} 11/06/2021 22:18:14 - INFO - __main__ - Step 7547: {'lr': 0.0004982696083074841, 'samples': 1449024, 'steps': 7546, 'loss/train': 1.766358494758606} 11/06/2021 22:18:14 - INFO - __main__ - Step 7548: {'lr': 0.0004982689849580951, 'samples': 1449216, 'steps': 7547, 'loss/train': 0.7626532912254333} 11/06/2021 22:18:15 - INFO - __main__ - Step 7549: {'lr': 0.0004982683614968396, 'samples': 1449408, 'steps': 7548, 'loss/train': 1.5238088369369507} 11/06/2021 22:18:15 - INFO - __main__ - Step 7550: {'lr': 0.0004982677379237185, 'samples': 1449600, 'steps': 7549, 'loss/train': 1.834945797920227} 11/06/2021 22:18:15 - INFO - __main__ - Step 7551: {'lr': 0.0004982671142387316, 'samples': 1449792, 'steps': 7550, 'loss/train': 1.1642390489578247} 11/06/2021 22:18:17 - INFO - __main__ - Step 7552: {'lr': 0.0004982664904418794, 'samples': 1449984, 'steps': 7551, 'loss/train': 0.791139543056488} 11/06/2021 22:18:17 - INFO - __main__ - Step 7553: {'lr': 0.0004982658665331622, 'samples': 1450176, 'steps': 7552, 'loss/train': 2.0422914028167725} 11/06/2021 22:18:17 - INFO - __main__ - Step 7554: {'lr': 0.0004982652425125802, 'samples': 1450368, 'steps': 7553, 'loss/train': 2.1857669353485107} 11/06/2021 22:18:18 - INFO - __main__ - Step 7555: {'lr': 0.0004982646183801337, 'samples': 1450560, 'steps': 7554, 'loss/train': 2.560479164123535} 11/06/2021 22:18:18 - INFO - __main__ - Step 7556: {'lr': 0.000498263994135823, 'samples': 1450752, 'steps': 7555, 'loss/train': 2.5498902797698975} 11/06/2021 22:18:18 - INFO - __main__ - Step 7557: {'lr': 0.0004982633697796484, 'samples': 1450944, 'steps': 7556, 'loss/train': 2.1660470962524414} 11/06/2021 22:18:19 - INFO - __main__ - Step 7558: {'lr': 0.0004982627453116102, 'samples': 1451136, 'steps': 7557, 'loss/train': 1.7757936716079712} 11/06/2021 22:18:20 - INFO - __main__ - Step 7559: {'lr': 0.0004982621207317086, 'samples': 1451328, 'steps': 7558, 'loss/train': 1.7856800556182861} 11/06/2021 22:18:20 - INFO - __main__ - Step 7560: {'lr': 0.0004982614960399439, 'samples': 1451520, 'steps': 7559, 'loss/train': 1.483529806137085} 11/06/2021 22:18:20 - INFO - __main__ - Step 7561: {'lr': 0.0004982608712363163, 'samples': 1451712, 'steps': 7560, 'loss/train': 2.1394612789154053} 11/06/2021 22:18:21 - INFO - __main__ - Step 7562: {'lr': 0.0004982602463208263, 'samples': 1451904, 'steps': 7561, 'loss/train': 2.436344861984253} 11/06/2021 22:18:22 - INFO - __main__ - Step 7563: {'lr': 0.0004982596212934742, 'samples': 1452096, 'steps': 7562, 'loss/train': 2.3320164680480957} 11/06/2021 22:18:22 - INFO - __main__ - Step 7564: {'lr': 0.00049825899615426, 'samples': 1452288, 'steps': 7563, 'loss/train': 1.0937427282333374} 11/06/2021 22:18:22 - INFO - __main__ - Step 7565: {'lr': 0.000498258370903184, 'samples': 1452480, 'steps': 7564, 'loss/train': 1.8388676643371582} 11/06/2021 22:18:23 - INFO - __main__ - Step 7566: {'lr': 0.0004982577455402467, 'samples': 1452672, 'steps': 7565, 'loss/train': 1.7357767820358276} 11/06/2021 22:18:23 - INFO - __main__ - Step 7567: {'lr': 0.0004982571200654485, 'samples': 1452864, 'steps': 7566, 'loss/train': 1.5761942863464355} 11/06/2021 22:18:24 - INFO - __main__ - Step 7568: {'lr': 0.0004982564944787892, 'samples': 1453056, 'steps': 7567, 'loss/train': 2.1120853424072266} 11/06/2021 22:18:25 - INFO - __main__ - Step 7569: {'lr': 0.0004982558687802695, 'samples': 1453248, 'steps': 7568, 'loss/train': 2.013747453689575} 11/06/2021 22:18:25 - INFO - __main__ - Step 7570: {'lr': 0.0004982552429698894, 'samples': 1453440, 'steps': 7569, 'loss/train': 2.029210090637207} 11/06/2021 22:18:25 - INFO - __main__ - Step 7571: {'lr': 0.0004982546170476494, 'samples': 1453632, 'steps': 7570, 'loss/train': 1.5686355829238892} 11/06/2021 22:18:26 - INFO - __main__ - Step 7572: {'lr': 0.0004982539910135497, 'samples': 1453824, 'steps': 7571, 'loss/train': 1.8889784812927246} 11/06/2021 22:18:27 - INFO - __main__ - Step 7573: {'lr': 0.0004982533648675906, 'samples': 1454016, 'steps': 7572, 'loss/train': 2.56605863571167} 11/06/2021 22:18:27 - INFO - __main__ - Step 7574: {'lr': 0.0004982527386097723, 'samples': 1454208, 'steps': 7573, 'loss/train': 1.9748719930648804} 11/06/2021 22:18:27 - INFO - __main__ - Step 7575: {'lr': 0.0004982521122400953, 'samples': 1454400, 'steps': 7574, 'loss/train': 1.4213409423828125} 11/06/2021 22:18:28 - INFO - __main__ - Step 7576: {'lr': 0.0004982514857585596, 'samples': 1454592, 'steps': 7575, 'loss/train': 1.8690491914749146} 11/06/2021 22:18:28 - INFO - __main__ - Step 7577: {'lr': 0.0004982508591651657, 'samples': 1454784, 'steps': 7576, 'loss/train': 2.0668842792510986} 11/06/2021 22:18:29 - INFO - __main__ - Step 7578: {'lr': 0.0004982502324599137, 'samples': 1454976, 'steps': 7577, 'loss/train': 2.21287202835083} 11/06/2021 22:18:29 - INFO - __main__ - Step 7579: {'lr': 0.000498249605642804, 'samples': 1455168, 'steps': 7578, 'loss/train': 1.7789140939712524} 11/06/2021 22:18:30 - INFO - __main__ - Step 7580: {'lr': 0.0004982489787138369, 'samples': 1455360, 'steps': 7579, 'loss/train': 1.9952287673950195} 11/06/2021 22:18:30 - INFO - __main__ - Step 7581: {'lr': 0.0004982483516730126, 'samples': 1455552, 'steps': 7580, 'loss/train': 1.8334144353866577} 11/06/2021 22:18:30 - INFO - __main__ - Step 7582: {'lr': 0.0004982477245203314, 'samples': 1455744, 'steps': 7581, 'loss/train': 1.755232572555542} 11/06/2021 22:18:31 - INFO - __main__ - Step 7583: {'lr': 0.0004982470972557936, 'samples': 1455936, 'steps': 7582, 'loss/train': 2.4827017784118652} 11/06/2021 22:18:32 - INFO - __main__ - Step 7584: {'lr': 0.0004982464698793995, 'samples': 1456128, 'steps': 7583, 'loss/train': 1.831977128982544} 11/06/2021 22:18:32 - INFO - __main__ - Step 7585: {'lr': 0.0004982458423911495, 'samples': 1456320, 'steps': 7584, 'loss/train': 2.0919387340545654} 11/06/2021 22:18:32 - INFO - __main__ - Step 7586: {'lr': 0.0004982452147910437, 'samples': 1456512, 'steps': 7585, 'loss/train': 2.820591449737549} 11/06/2021 22:18:33 - INFO - __main__ - Step 7587: {'lr': 0.0004982445870790823, 'samples': 1456704, 'steps': 7586, 'loss/train': 1.7887145280838013} 11/06/2021 22:18:34 - INFO - __main__ - Step 7588: {'lr': 0.0004982439592552658, 'samples': 1456896, 'steps': 7587, 'loss/train': 1.2820543050765991} 11/06/2021 22:18:35 - INFO - __main__ - Step 7589: {'lr': 0.0004982433313195945, 'samples': 1457088, 'steps': 7588, 'loss/train': 2.508904218673706} 11/06/2021 22:18:35 - INFO - __main__ - Step 7590: {'lr': 0.0004982427032720685, 'samples': 1457280, 'steps': 7589, 'loss/train': 1.968804121017456} 11/06/2021 22:18:35 - INFO - __main__ - Step 7591: {'lr': 0.0004982420751126882, 'samples': 1457472, 'steps': 7590, 'loss/train': 2.2515952587127686} 11/06/2021 22:18:36 - INFO - __main__ - Step 7592: {'lr': 0.0004982414468414538, 'samples': 1457664, 'steps': 7591, 'loss/train': 2.3000688552856445} 11/06/2021 22:18:36 - INFO - __main__ - Step 7593: {'lr': 0.0004982408184583656, 'samples': 1457856, 'steps': 7592, 'loss/train': 1.6771053075790405} 11/06/2021 22:18:37 - INFO - __main__ - Step 7594: {'lr': 0.000498240189963424, 'samples': 1458048, 'steps': 7593, 'loss/train': 1.5368093252182007} 11/06/2021 22:18:37 - INFO - __main__ - Step 7595: {'lr': 0.0004982395613566291, 'samples': 1458240, 'steps': 7594, 'loss/train': 2.213350772857666} 11/06/2021 22:18:38 - INFO - __main__ - Step 7596: {'lr': 0.0004982389326379814, 'samples': 1458432, 'steps': 7595, 'loss/train': 1.887166142463684} 11/06/2021 22:18:38 - INFO - __main__ - Step 7597: {'lr': 0.000498238303807481, 'samples': 1458624, 'steps': 7596, 'loss/train': 2.0999624729156494} 11/06/2021 22:18:38 - INFO - __main__ - Step 7598: {'lr': 0.0004982376748651283, 'samples': 1458816, 'steps': 7597, 'loss/train': 1.3151088953018188} 11/06/2021 22:18:39 - INFO - __main__ - Step 7599: {'lr': 0.0004982370458109235, 'samples': 1459008, 'steps': 7598, 'loss/train': 1.5262982845306396} 11/06/2021 22:18:40 - INFO - __main__ - Step 7600: {'lr': 0.0004982364166448669, 'samples': 1459200, 'steps': 7599, 'loss/train': 1.9225597381591797} 11/06/2021 22:18:40 - INFO - __main__ - Step 7601: {'lr': 0.0004982357873669588, 'samples': 1459392, 'steps': 7600, 'loss/train': 1.6969056129455566} 11/06/2021 22:18:40 - INFO - __main__ - Step 7602: {'lr': 0.0004982351579771995, 'samples': 1459584, 'steps': 7601, 'loss/train': 1.8867172002792358} 11/06/2021 22:18:41 - INFO - __main__ - Step 7603: {'lr': 0.0004982345284755893, 'samples': 1459776, 'steps': 7602, 'loss/train': 2.111384868621826} 11/06/2021 22:18:41 - INFO - __main__ - Step 7604: {'lr': 0.0004982338988621284, 'samples': 1459968, 'steps': 7603, 'loss/train': 1.420168161392212} 11/06/2021 22:18:42 - INFO - __main__ - Step 7605: {'lr': 0.0004982332691368172, 'samples': 1460160, 'steps': 7604, 'loss/train': 2.9915659427642822} 11/06/2021 22:18:43 - INFO - __main__ - Step 7606: {'lr': 0.0004982326392996559, 'samples': 1460352, 'steps': 7605, 'loss/train': 1.3998850584030151} 11/06/2021 22:18:43 - INFO - __main__ - Step 7607: {'lr': 0.0004982320093506449, 'samples': 1460544, 'steps': 7606, 'loss/train': 1.5841953754425049} 11/06/2021 22:18:43 - INFO - __main__ - Step 7608: {'lr': 0.0004982313792897843, 'samples': 1460736, 'steps': 7607, 'loss/train': 1.2424761056900024} 11/06/2021 22:18:44 - INFO - __main__ - Step 7609: {'lr': 0.0004982307491170744, 'samples': 1460928, 'steps': 7608, 'loss/train': 2.063056707382202} 11/06/2021 22:18:45 - INFO - __main__ - Step 7610: {'lr': 0.0004982301188325156, 'samples': 1461120, 'steps': 7609, 'loss/train': 1.6519533395767212} 11/06/2021 22:18:45 - INFO - __main__ - Step 7611: {'lr': 0.0004982294884361081, 'samples': 1461312, 'steps': 7610, 'loss/train': 1.2619507312774658} 11/06/2021 22:18:45 - INFO - __main__ - Step 7612: {'lr': 0.0004982288579278522, 'samples': 1461504, 'steps': 7611, 'loss/train': 1.7545194625854492} 11/06/2021 22:18:46 - INFO - __main__ - Step 7613: {'lr': 0.0004982282273077483, 'samples': 1461696, 'steps': 7612, 'loss/train': 1.8897991180419922} 11/06/2021 22:18:46 - INFO - __main__ - Step 7614: {'lr': 0.0004982275965757965, 'samples': 1461888, 'steps': 7613, 'loss/train': 1.383413314819336} 11/06/2021 22:18:47 - INFO - __main__ - Step 7615: {'lr': 0.0004982269657319974, 'samples': 1462080, 'steps': 7614, 'loss/train': 1.955425500869751} 11/06/2021 22:18:47 - INFO - __main__ - Step 7616: {'lr': 0.0004982263347763508, 'samples': 1462272, 'steps': 7615, 'loss/train': 1.6622871160507202} 11/06/2021 22:18:48 - INFO - __main__ - Step 7617: {'lr': 0.0004982257037088574, 'samples': 1462464, 'steps': 7616, 'loss/train': 1.82854425907135} 11/06/2021 22:18:48 - INFO - __main__ - Step 7618: {'lr': 0.0004982250725295173, 'samples': 1462656, 'steps': 7617, 'loss/train': 2.0947282314300537} 11/06/2021 22:18:48 - INFO - __main__ - Step 7619: {'lr': 0.0004982244412383307, 'samples': 1462848, 'steps': 7618, 'loss/train': 1.5993701219558716} 11/06/2021 22:18:49 - INFO - __main__ - Step 7620: {'lr': 0.0004982238098352981, 'samples': 1463040, 'steps': 7619, 'loss/train': 2.0519866943359375} 11/06/2021 22:18:50 - INFO - __main__ - Step 7621: {'lr': 0.0004982231783204196, 'samples': 1463232, 'steps': 7620, 'loss/train': 1.768385887145996} 11/06/2021 22:18:50 - INFO - __main__ - Step 7622: {'lr': 0.0004982225466936957, 'samples': 1463424, 'steps': 7621, 'loss/train': 2.114351749420166} 11/06/2021 22:18:51 - INFO - __main__ - Step 7623: {'lr': 0.0004982219149551265, 'samples': 1463616, 'steps': 7622, 'loss/train': 2.196870803833008} 11/06/2021 22:18:51 - INFO - __main__ - Step 7624: {'lr': 0.0004982212831047123, 'samples': 1463808, 'steps': 7623, 'loss/train': 2.04628586769104} 11/06/2021 22:18:51 - INFO - __main__ - Step 7625: {'lr': 0.0004982206511424534, 'samples': 1464000, 'steps': 7624, 'loss/train': 1.2371141910552979} 11/06/2021 22:18:52 - INFO - __main__ - Step 7626: {'lr': 0.0004982200190683502, 'samples': 1464192, 'steps': 7625, 'loss/train': 1.4562172889709473} 11/06/2021 22:18:53 - INFO - __main__ - Step 7627: {'lr': 0.0004982193868824028, 'samples': 1464384, 'steps': 7626, 'loss/train': 1.9983298778533936} 11/06/2021 22:18:53 - INFO - __main__ - Step 7628: {'lr': 0.0004982187545846116, 'samples': 1464576, 'steps': 7627, 'loss/train': 2.135972261428833} 11/06/2021 22:18:53 - INFO - __main__ - Step 7629: {'lr': 0.0004982181221749769, 'samples': 1464768, 'steps': 7628, 'loss/train': 2.405860424041748} 11/06/2021 22:18:54 - INFO - __main__ - Step 7630: {'lr': 0.0004982174896534989, 'samples': 1464960, 'steps': 7629, 'loss/train': 1.3668652772903442} 11/06/2021 22:18:55 - INFO - __main__ - Step 7631: {'lr': 0.0004982168570201779, 'samples': 1465152, 'steps': 7630, 'loss/train': 1.261794924736023} 11/06/2021 22:18:55 - INFO - __main__ - Step 7632: {'lr': 0.0004982162242750143, 'samples': 1465344, 'steps': 7631, 'loss/train': 1.8205006122589111} 11/06/2021 22:18:55 - INFO - __main__ - Step 7633: {'lr': 0.0004982155914180082, 'samples': 1465536, 'steps': 7632, 'loss/train': 1.8356959819793701} 11/06/2021 22:18:56 - INFO - __main__ - Step 7634: {'lr': 0.0004982149584491601, 'samples': 1465728, 'steps': 7633, 'loss/train': 2.227440357208252} 11/06/2021 22:18:56 - INFO - __main__ - Step 7635: {'lr': 0.0004982143253684701, 'samples': 1465920, 'steps': 7634, 'loss/train': 1.8960446119308472} 11/06/2021 22:18:57 - INFO - __main__ - Step 7636: {'lr': 0.0004982136921759385, 'samples': 1466112, 'steps': 7635, 'loss/train': 1.77176833152771} 11/06/2021 22:18:57 - INFO - __main__ - Step 7637: {'lr': 0.0004982130588715657, 'samples': 1466304, 'steps': 7636, 'loss/train': 1.8046529293060303} 11/06/2021 22:18:58 - INFO - __main__ - Step 7638: {'lr': 0.000498212425455352, 'samples': 1466496, 'steps': 7637, 'loss/train': 1.6778465509414673} 11/06/2021 22:18:58 - INFO - __main__ - Step 7639: {'lr': 0.0004982117919272975, 'samples': 1466688, 'steps': 7638, 'loss/train': 1.662865400314331} 11/06/2021 22:18:59 - INFO - __main__ - Step 7640: {'lr': 0.0004982111582874026, 'samples': 1466880, 'steps': 7639, 'loss/train': 1.8962922096252441} 11/06/2021 22:18:59 - INFO - __main__ - Step 7641: {'lr': 0.0004982105245356676, 'samples': 1467072, 'steps': 7640, 'loss/train': 1.8994383811950684} 11/06/2021 22:19:00 - INFO - __main__ - Step 7642: {'lr': 0.0004982098906720928, 'samples': 1467264, 'steps': 7641, 'loss/train': 1.8286490440368652} 11/06/2021 22:19:00 - INFO - __main__ - Step 7643: {'lr': 0.0004982092566966785, 'samples': 1467456, 'steps': 7642, 'loss/train': 1.5329943895339966} 11/06/2021 22:19:01 - INFO - __main__ - Step 7644: {'lr': 0.0004982086226094248, 'samples': 1467648, 'steps': 7643, 'loss/train': 2.1548547744750977} 11/06/2021 22:19:01 - INFO - __main__ - Step 7645: {'lr': 0.0004982079884103322, 'samples': 1467840, 'steps': 7644, 'loss/train': 2.1634531021118164} 11/06/2021 22:19:02 - INFO - __main__ - Step 7646: {'lr': 0.0004982073540994009, 'samples': 1468032, 'steps': 7645, 'loss/train': 2.008009195327759} 11/06/2021 22:19:02 - INFO - __main__ - Step 7647: {'lr': 0.0004982067196766312, 'samples': 1468224, 'steps': 7646, 'loss/train': 1.8204351663589478} 11/06/2021 22:19:03 - INFO - __main__ - Step 7648: {'lr': 0.0004982060851420235, 'samples': 1468416, 'steps': 7647, 'loss/train': 2.1249916553497314} 11/06/2021 22:19:03 - INFO - __main__ - Step 7649: {'lr': 0.0004982054504955778, 'samples': 1468608, 'steps': 7648, 'loss/train': 1.6242306232452393} 11/06/2021 22:19:03 - INFO - __main__ - Step 7650: {'lr': 0.0004982048157372946, 'samples': 1468800, 'steps': 7649, 'loss/train': 1.693130612373352} 11/06/2021 22:19:04 - INFO - __main__ - Step 7651: {'lr': 0.0004982041808671741, 'samples': 1468992, 'steps': 7650, 'loss/train': 2.0973668098449707} 11/06/2021 22:19:05 - INFO - __main__ - Step 7652: {'lr': 0.0004982035458852168, 'samples': 1469184, 'steps': 7651, 'loss/train': 2.164299488067627} 11/06/2021 22:19:05 - INFO - __main__ - Step 7653: {'lr': 0.0004982029107914226, 'samples': 1469376, 'steps': 7652, 'loss/train': 2.062432289123535} 11/06/2021 22:19:05 - INFO - __main__ - Step 7654: {'lr': 0.0004982022755857921, 'samples': 1469568, 'steps': 7653, 'loss/train': 2.0671913623809814} 11/06/2021 22:19:06 - INFO - __main__ - Step 7655: {'lr': 0.0004982016402683255, 'samples': 1469760, 'steps': 7654, 'loss/train': 2.361668348312378} 11/06/2021 22:19:06 - INFO - __main__ - Step 7656: {'lr': 0.000498201004839023, 'samples': 1469952, 'steps': 7655, 'loss/train': 1.3417774438858032} 11/06/2021 22:19:07 - INFO - __main__ - Step 7657: {'lr': 0.000498200369297885, 'samples': 1470144, 'steps': 7656, 'loss/train': 1.8992770910263062} 11/06/2021 22:19:08 - INFO - __main__ - Step 7658: {'lr': 0.0004981997336449118, 'samples': 1470336, 'steps': 7657, 'loss/train': 1.9563990831375122} 11/06/2021 22:19:08 - INFO - __main__ - Step 7659: {'lr': 0.0004981990978801035, 'samples': 1470528, 'steps': 7658, 'loss/train': 2.046013593673706} 11/06/2021 22:19:08 - INFO - __main__ - Step 7660: {'lr': 0.0004981984620034606, 'samples': 1470720, 'steps': 7659, 'loss/train': 1.6640254259109497} 11/06/2021 22:19:09 - INFO - __main__ - Step 7661: {'lr': 0.0004981978260149833, 'samples': 1470912, 'steps': 7660, 'loss/train': 1.9074742794036865} 11/06/2021 22:19:10 - INFO - __main__ - Step 7662: {'lr': 0.0004981971899146719, 'samples': 1471104, 'steps': 7661, 'loss/train': 2.0889925956726074} 11/06/2021 22:19:10 - INFO - __main__ - Step 7663: {'lr': 0.0004981965537025267, 'samples': 1471296, 'steps': 7662, 'loss/train': 2.0140273571014404} 11/06/2021 22:19:10 - INFO - __main__ - Step 7664: {'lr': 0.000498195917378548, 'samples': 1471488, 'steps': 7663, 'loss/train': 1.5945501327514648} 11/06/2021 22:19:11 - INFO - __main__ - Step 7665: {'lr': 0.0004981952809427359, 'samples': 1471680, 'steps': 7664, 'loss/train': 1.7172966003417969} 11/06/2021 22:19:11 - INFO - __main__ - Step 7666: {'lr': 0.0004981946443950909, 'samples': 1471872, 'steps': 7665, 'loss/train': 1.3649691343307495} 11/06/2021 22:19:12 - INFO - __main__ - Step 7667: {'lr': 0.0004981940077356132, 'samples': 1472064, 'steps': 7666, 'loss/train': 0.8751164674758911} 11/06/2021 22:19:12 - INFO - __main__ - Step 7668: {'lr': 0.0004981933709643032, 'samples': 1472256, 'steps': 7667, 'loss/train': 1.916032314300537} 11/06/2021 22:19:13 - INFO - __main__ - Step 7669: {'lr': 0.000498192734081161, 'samples': 1472448, 'steps': 7668, 'loss/train': 1.9548919200897217} 11/06/2021 22:19:13 - INFO - __main__ - Step 7670: {'lr': 0.000498192097086187, 'samples': 1472640, 'steps': 7669, 'loss/train': 1.8466298580169678} 11/06/2021 22:19:13 - INFO - __main__ - Step 7671: {'lr': 0.0004981914599793816, 'samples': 1472832, 'steps': 7670, 'loss/train': 3.0567245483398438} 11/06/2021 22:19:14 - INFO - __main__ - Step 7672: {'lr': 0.0004981908227607448, 'samples': 1473024, 'steps': 7671, 'loss/train': 1.7764103412628174} 11/06/2021 22:19:15 - INFO - __main__ - Step 7673: {'lr': 0.0004981901854302771, 'samples': 1473216, 'steps': 7672, 'loss/train': 2.425710678100586} 11/06/2021 22:19:15 - INFO - __main__ - Step 7674: {'lr': 0.0004981895479879787, 'samples': 1473408, 'steps': 7673, 'loss/train': 1.5297969579696655} 11/06/2021 22:19:15 - INFO - __main__ - Step 7675: {'lr': 0.0004981889104338499, 'samples': 1473600, 'steps': 7674, 'loss/train': 1.673049807548523} 11/06/2021 22:19:16 - INFO - __main__ - Step 7676: {'lr': 0.0004981882727678912, 'samples': 1473792, 'steps': 7675, 'loss/train': 1.8792694807052612} 11/06/2021 22:19:17 - INFO - __main__ - Step 7677: {'lr': 0.0004981876349901025, 'samples': 1473984, 'steps': 7676, 'loss/train': 1.6450796127319336} 11/06/2021 22:19:17 - INFO - __main__ - Step 7678: {'lr': 0.0004981869971004843, 'samples': 1474176, 'steps': 7677, 'loss/train': 1.8612487316131592} 11/06/2021 22:19:18 - INFO - __main__ - Step 7679: {'lr': 0.0004981863590990369, 'samples': 1474368, 'steps': 7678, 'loss/train': 1.9628400802612305} 11/06/2021 22:19:18 - INFO - __main__ - Step 7680: {'lr': 0.0004981857209857605, 'samples': 1474560, 'steps': 7679, 'loss/train': 2.1773593425750732} 11/06/2021 22:19:18 - INFO - __main__ - Step 7681: {'lr': 0.0004981850827606556, 'samples': 1474752, 'steps': 7680, 'loss/train': 2.009646415710449} 11/06/2021 22:19:19 - INFO - __main__ - Step 7682: {'lr': 0.0004981844444237223, 'samples': 1474944, 'steps': 7681, 'loss/train': 1.6620737314224243} 11/06/2021 22:19:20 - INFO - __main__ - Step 7683: {'lr': 0.0004981838059749607, 'samples': 1475136, 'steps': 7682, 'loss/train': 1.8136749267578125} 11/06/2021 22:19:20 - INFO - __main__ - Step 7684: {'lr': 0.0004981831674143716, 'samples': 1475328, 'steps': 7683, 'loss/train': 2.1339547634124756} 11/06/2021 22:19:20 - INFO - __main__ - Step 7685: {'lr': 0.0004981825287419549, 'samples': 1475520, 'steps': 7684, 'loss/train': 1.375860571861267} 11/06/2021 22:19:21 - INFO - __main__ - Step 7686: {'lr': 0.0004981818899577108, 'samples': 1475712, 'steps': 7685, 'loss/train': 1.8953708410263062} 11/06/2021 22:19:21 - INFO - __main__ - Step 7687: {'lr': 0.0004981812510616399, 'samples': 1475904, 'steps': 7686, 'loss/train': 1.9942034482955933} 11/06/2021 22:19:22 - INFO - __main__ - Step 7688: {'lr': 0.0004981806120537424, 'samples': 1476096, 'steps': 7687, 'loss/train': 1.7413487434387207} 11/06/2021 22:19:22 - INFO - __main__ - Step 7689: {'lr': 0.0004981799729340185, 'samples': 1476288, 'steps': 7688, 'loss/train': 1.861914038658142} 11/06/2021 22:19:23 - INFO - __main__ - Step 7690: {'lr': 0.0004981793337024685, 'samples': 1476480, 'steps': 7689, 'loss/train': 1.644492745399475} 11/06/2021 22:19:23 - INFO - __main__ - Step 7691: {'lr': 0.0004981786943590928, 'samples': 1476672, 'steps': 7690, 'loss/train': 1.96816885471344} 11/06/2021 22:19:24 - INFO - __main__ - Step 7692: {'lr': 0.0004981780549038916, 'samples': 1476864, 'steps': 7691, 'loss/train': 1.6738519668579102} 11/06/2021 22:19:25 - INFO - __main__ - Step 7693: {'lr': 0.0004981774153368651, 'samples': 1477056, 'steps': 7692, 'loss/train': 1.7269313335418701} 11/06/2021 22:19:25 - INFO - __main__ - Step 7694: {'lr': 0.0004981767756580138, 'samples': 1477248, 'steps': 7693, 'loss/train': 1.7965588569641113} 11/06/2021 22:19:25 - INFO - __main__ - Step 7695: {'lr': 0.0004981761358673378, 'samples': 1477440, 'steps': 7694, 'loss/train': 2.0630042552948} 11/06/2021 22:19:26 - INFO - __main__ - Step 7696: {'lr': 0.0004981754959648376, 'samples': 1477632, 'steps': 7695, 'loss/train': 1.89278244972229} 11/06/2021 22:19:26 - INFO - __main__ - Step 7697: {'lr': 0.0004981748559505131, 'samples': 1477824, 'steps': 7696, 'loss/train': 1.7540265321731567} 11/06/2021 22:19:27 - INFO - __main__ - Step 7698: {'lr': 0.0004981742158243651, 'samples': 1478016, 'steps': 7697, 'loss/train': 1.4253634214401245} 11/06/2021 22:19:27 - INFO - __main__ - Step 7699: {'lr': 0.0004981735755863934, 'samples': 1478208, 'steps': 7698, 'loss/train': 2.059159517288208} 11/06/2021 22:19:28 - INFO - __main__ - Step 7700: {'lr': 0.0004981729352365986, 'samples': 1478400, 'steps': 7699, 'loss/train': 1.953817367553711} 11/06/2021 22:19:28 - INFO - __main__ - Step 7701: {'lr': 0.0004981722947749811, 'samples': 1478592, 'steps': 7700, 'loss/train': 1.9342195987701416} 11/06/2021 22:19:28 - INFO - __main__ - Step 7702: {'lr': 0.0004981716542015408, 'samples': 1478784, 'steps': 7701, 'loss/train': 1.841950535774231} 11/06/2021 22:19:29 - INFO - __main__ - Step 7703: {'lr': 0.0004981710135162781, 'samples': 1478976, 'steps': 7702, 'loss/train': 1.5683513879776} 11/06/2021 22:19:30 - INFO - __main__ - Step 7704: {'lr': 0.0004981703727191935, 'samples': 1479168, 'steps': 7703, 'loss/train': 1.7264535427093506} 11/06/2021 22:19:30 - INFO - __main__ - Step 7705: {'lr': 0.0004981697318102872, 'samples': 1479360, 'steps': 7704, 'loss/train': 2.026911973953247} 11/06/2021 22:19:31 - INFO - __main__ - Step 7706: {'lr': 0.0004981690907895594, 'samples': 1479552, 'steps': 7705, 'loss/train': 1.9870184659957886} 11/06/2021 22:19:31 - INFO - __main__ - Step 7707: {'lr': 0.0004981684496570104, 'samples': 1479744, 'steps': 7706, 'loss/train': 1.9878273010253906} 11/06/2021 22:19:31 - INFO - __main__ - Step 7708: {'lr': 0.0004981678084126405, 'samples': 1479936, 'steps': 7707, 'loss/train': 1.5540006160736084} 11/06/2021 22:19:32 - INFO - __main__ - Step 7709: {'lr': 0.0004981671670564502, 'samples': 1480128, 'steps': 7708, 'loss/train': 1.544403314590454} 11/06/2021 22:19:33 - INFO - __main__ - Step 7710: {'lr': 0.0004981665255884394, 'samples': 1480320, 'steps': 7709, 'loss/train': 2.273404598236084} 11/06/2021 22:19:33 - INFO - __main__ - Step 7711: {'lr': 0.0004981658840086087, 'samples': 1480512, 'steps': 7710, 'loss/train': 2.1462695598602295} 11/06/2021 22:19:33 - INFO - __main__ - Step 7712: {'lr': 0.0004981652423169582, 'samples': 1480704, 'steps': 7711, 'loss/train': 1.3098105192184448} 11/06/2021 22:19:34 - INFO - __main__ - Step 7713: {'lr': 0.0004981646005134884, 'samples': 1480896, 'steps': 7712, 'loss/train': 1.9249402284622192} 11/06/2021 22:19:35 - INFO - __main__ - Step 7714: {'lr': 0.0004981639585981993, 'samples': 1481088, 'steps': 7713, 'loss/train': 1.955265760421753} 11/06/2021 22:19:35 - INFO - __main__ - Step 7715: {'lr': 0.0004981633165710914, 'samples': 1481280, 'steps': 7714, 'loss/train': 2.254359722137451} 11/06/2021 22:19:35 - INFO - __main__ - Step 7716: {'lr': 0.000498162674432165, 'samples': 1481472, 'steps': 7715, 'loss/train': 1.0957801342010498} 11/06/2021 22:19:36 - INFO - __main__ - Step 7717: {'lr': 0.0004981620321814203, 'samples': 1481664, 'steps': 7716, 'loss/train': 1.9144853353500366} 11/06/2021 22:19:36 - INFO - __main__ - Step 7718: {'lr': 0.0004981613898188576, 'samples': 1481856, 'steps': 7717, 'loss/train': 1.7931559085845947} 11/06/2021 22:19:37 - INFO - __main__ - Step 7719: {'lr': 0.0004981607473444772, 'samples': 1482048, 'steps': 7718, 'loss/train': 1.8735288381576538} 11/06/2021 22:19:37 - INFO - __main__ - Step 7720: {'lr': 0.0004981601047582794, 'samples': 1482240, 'steps': 7719, 'loss/train': 1.727927327156067} 11/06/2021 22:19:38 - INFO - __main__ - Step 7721: {'lr': 0.0004981594620602645, 'samples': 1482432, 'steps': 7720, 'loss/train': 1.948897361755371} 11/06/2021 22:19:38 - INFO - __main__ - Step 7722: {'lr': 0.0004981588192504329, 'samples': 1482624, 'steps': 7721, 'loss/train': 0.8639780879020691} 11/06/2021 22:19:38 - INFO - __main__ - Step 7723: {'lr': 0.0004981581763287845, 'samples': 1482816, 'steps': 7722, 'loss/train': 2.073319673538208} 11/06/2021 22:19:39 - INFO - __main__ - Step 7724: {'lr': 0.0004981575332953201, 'samples': 1483008, 'steps': 7723, 'loss/train': 1.5656884908676147} 11/06/2021 22:19:40 - INFO - __main__ - Step 7725: {'lr': 0.0004981568901500396, 'samples': 1483200, 'steps': 7724, 'loss/train': 1.949126124382019} 11/06/2021 22:19:40 - INFO - __main__ - Step 7726: {'lr': 0.0004981562468929435, 'samples': 1483392, 'steps': 7725, 'loss/train': 2.107515811920166} 11/06/2021 22:19:40 - INFO - __main__ - Step 7727: {'lr': 0.000498155603524032, 'samples': 1483584, 'steps': 7726, 'loss/train': 1.8496572971343994} 11/06/2021 22:19:41 - INFO - __main__ - Step 7728: {'lr': 0.0004981549600433054, 'samples': 1483776, 'steps': 7727, 'loss/train': 2.109135627746582} 11/06/2021 22:19:42 - INFO - __main__ - Step 7729: {'lr': 0.000498154316450764, 'samples': 1483968, 'steps': 7728, 'loss/train': 1.3323477506637573} 11/06/2021 22:19:43 - INFO - __main__ - Step 7730: {'lr': 0.0004981536727464082, 'samples': 1484160, 'steps': 7729, 'loss/train': 1.855273962020874} 11/06/2021 22:19:43 - INFO - __main__ - Step 7731: {'lr': 0.0004981530289302381, 'samples': 1484352, 'steps': 7730, 'loss/train': 0.4355550706386566} 11/06/2021 22:19:43 - INFO - __main__ - Step 7732: {'lr': 0.000498152385002254, 'samples': 1484544, 'steps': 7731, 'loss/train': 2.0949268341064453} 11/06/2021 22:19:44 - INFO - __main__ - Step 7733: {'lr': 0.0004981517409624564, 'samples': 1484736, 'steps': 7732, 'loss/train': 2.226652145385742} 11/06/2021 22:19:44 - INFO - __main__ - Step 7734: {'lr': 0.0004981510968108453, 'samples': 1484928, 'steps': 7733, 'loss/train': 2.1331582069396973} 11/06/2021 22:19:45 - INFO - __main__ - Step 7735: {'lr': 0.0004981504525474214, 'samples': 1485120, 'steps': 7734, 'loss/train': 1.9985811710357666} 11/06/2021 22:19:45 - INFO - __main__ - Step 7736: {'lr': 0.0004981498081721845, 'samples': 1485312, 'steps': 7735, 'loss/train': 2.0506227016448975} 11/06/2021 22:19:46 - INFO - __main__ - Step 7737: {'lr': 0.0004981491636851351, 'samples': 1485504, 'steps': 7736, 'loss/train': 1.7922340631484985} 11/06/2021 22:19:46 - INFO - __main__ - Step 7738: {'lr': 0.0004981485190862737, 'samples': 1485696, 'steps': 7737, 'loss/train': 1.6075444221496582} 11/06/2021 22:19:46 - INFO - __main__ - Step 7739: {'lr': 0.0004981478743756004, 'samples': 1485888, 'steps': 7738, 'loss/train': 1.6328068971633911} 11/06/2021 22:19:47 - INFO - __main__ - Step 7740: {'lr': 0.0004981472295531153, 'samples': 1486080, 'steps': 7739, 'loss/train': 2.0658867359161377} 11/06/2021 22:19:48 - INFO - __main__ - Step 7741: {'lr': 0.000498146584618819, 'samples': 1486272, 'steps': 7740, 'loss/train': 2.0356781482696533} 11/06/2021 22:19:48 - INFO - __main__ - Step 7742: {'lr': 0.0004981459395727117, 'samples': 1486464, 'steps': 7741, 'loss/train': 2.1248600482940674} 11/06/2021 22:19:49 - INFO - __main__ - Step 7743: {'lr': 0.0004981452944147937, 'samples': 1486656, 'steps': 7742, 'loss/train': 2.0457077026367188} 11/06/2021 22:19:49 - INFO - __main__ - Step 7744: {'lr': 0.0004981446491450652, 'samples': 1486848, 'steps': 7743, 'loss/train': 2.0596790313720703} 11/06/2021 22:19:50 - INFO - __main__ - Step 7745: {'lr': 0.0004981440037635266, 'samples': 1487040, 'steps': 7744, 'loss/train': 1.76706862449646} 11/06/2021 22:19:50 - INFO - __main__ - Step 7746: {'lr': 0.0004981433582701781, 'samples': 1487232, 'steps': 7745, 'loss/train': 1.9943227767944336} 11/06/2021 22:19:51 - INFO - __main__ - Step 7747: {'lr': 0.00049814271266502, 'samples': 1487424, 'steps': 7746, 'loss/train': 1.9205338954925537} 11/06/2021 22:19:51 - INFO - __main__ - Step 7748: {'lr': 0.0004981420669480526, 'samples': 1487616, 'steps': 7747, 'loss/train': 1.346489667892456} 11/06/2021 22:19:51 - INFO - __main__ - Step 7749: {'lr': 0.0004981414211192763, 'samples': 1487808, 'steps': 7748, 'loss/train': 2.3146755695343018} 11/06/2021 22:19:52 - INFO - __main__ - Step 7750: {'lr': 0.0004981407751786913, 'samples': 1488000, 'steps': 7749, 'loss/train': 1.654209852218628} 11/06/2021 22:19:53 - INFO - __main__ - Step 7751: {'lr': 0.0004981401291262979, 'samples': 1488192, 'steps': 7750, 'loss/train': 0.34622922539711} 11/06/2021 22:19:53 - INFO - __main__ - Step 7752: {'lr': 0.0004981394829620963, 'samples': 1488384, 'steps': 7751, 'loss/train': 1.7355235815048218} 11/06/2021 22:19:53 - INFO - __main__ - Step 7753: {'lr': 0.0004981388366860869, 'samples': 1488576, 'steps': 7752, 'loss/train': 1.484642744064331} 11/06/2021 22:19:54 - INFO - __main__ - Step 7754: {'lr': 0.0004981381902982702, 'samples': 1488768, 'steps': 7753, 'loss/train': 1.2855114936828613} 11/06/2021 22:19:54 - INFO - __main__ - Step 7755: {'lr': 0.0004981375437986459, 'samples': 1488960, 'steps': 7754, 'loss/train': 1.6895129680633545} 11/06/2021 22:19:55 - INFO - __main__ - Step 7756: {'lr': 0.0004981368971872149, 'samples': 1489152, 'steps': 7755, 'loss/train': 1.9353357553482056} 11/06/2021 22:19:55 - INFO - __main__ - Step 7757: {'lr': 0.0004981362504639772, 'samples': 1489344, 'steps': 7756, 'loss/train': 1.9244894981384277} 11/06/2021 22:19:56 - INFO - __main__ - Step 7758: {'lr': 0.0004981356036289331, 'samples': 1489536, 'steps': 7757, 'loss/train': 1.332854151725769} 11/06/2021 22:19:56 - INFO - __main__ - Step 7759: {'lr': 0.0004981349566820828, 'samples': 1489728, 'steps': 7758, 'loss/train': 1.9951647520065308} 11/06/2021 22:19:57 - INFO - __main__ - Step 7760: {'lr': 0.0004981343096234268, 'samples': 1489920, 'steps': 7759, 'loss/train': 1.7065505981445312} 11/06/2021 22:19:58 - INFO - __main__ - Step 7761: {'lr': 0.0004981336624529654, 'samples': 1490112, 'steps': 7760, 'loss/train': 1.5033037662506104} 11/06/2021 22:19:58 - INFO - __main__ - Step 7762: {'lr': 0.0004981330151706988, 'samples': 1490304, 'steps': 7761, 'loss/train': 1.2406482696533203} 11/06/2021 22:19:58 - INFO - __main__ - Step 7763: {'lr': 0.0004981323677766273, 'samples': 1490496, 'steps': 7762, 'loss/train': 2.313772678375244} 11/06/2021 22:19:59 - INFO - __main__ - Step 7764: {'lr': 0.000498131720270751, 'samples': 1490688, 'steps': 7763, 'loss/train': 1.9050623178482056} 11/06/2021 22:19:59 - INFO - __main__ - Step 7765: {'lr': 0.0004981310726530706, 'samples': 1490880, 'steps': 7764, 'loss/train': 1.5810691118240356} 11/06/2021 22:20:00 - INFO - __main__ - Step 7766: {'lr': 0.0004981304249235861, 'samples': 1491072, 'steps': 7765, 'loss/train': 1.6522554159164429} 11/06/2021 22:20:00 - INFO - __main__ - Step 7767: {'lr': 0.0004981297770822977, 'samples': 1491264, 'steps': 7766, 'loss/train': 2.1629531383514404} 11/06/2021 22:20:01 - INFO - __main__ - Step 7768: {'lr': 0.0004981291291292061, 'samples': 1491456, 'steps': 7767, 'loss/train': 2.0148024559020996} 11/06/2021 22:20:01 - INFO - __main__ - Step 7769: {'lr': 0.0004981284810643112, 'samples': 1491648, 'steps': 7768, 'loss/train': 1.8133976459503174} 11/06/2021 22:20:01 - INFO - __main__ - Step 7770: {'lr': 0.0004981278328876134, 'samples': 1491840, 'steps': 7769, 'loss/train': 1.844529151916504} 11/06/2021 22:20:02 - INFO - __main__ - Step 7771: {'lr': 0.0004981271845991131, 'samples': 1492032, 'steps': 7770, 'loss/train': 1.6602436304092407} 11/06/2021 22:20:03 - INFO - __main__ - Step 7772: {'lr': 0.0004981265361988105, 'samples': 1492224, 'steps': 7771, 'loss/train': 1.7457406520843506} 11/06/2021 22:20:03 - INFO - __main__ - Step 7773: {'lr': 0.000498125887686706, 'samples': 1492416, 'steps': 7772, 'loss/train': 1.8187391757965088} 11/06/2021 22:20:03 - INFO - __main__ - Step 7774: {'lr': 0.0004981252390627997, 'samples': 1492608, 'steps': 7773, 'loss/train': 2.382383108139038} 11/06/2021 22:20:04 - INFO - __main__ - Step 7775: {'lr': 0.000498124590327092, 'samples': 1492800, 'steps': 7774, 'loss/train': 2.054161310195923} 11/06/2021 22:20:05 - INFO - __main__ - Step 7776: {'lr': 0.0004981239414795832, 'samples': 1492992, 'steps': 7775, 'loss/train': 1.5990222692489624} 11/06/2021 22:20:05 - INFO - __main__ - Step 7777: {'lr': 0.0004981232925202736, 'samples': 1493184, 'steps': 7776, 'loss/train': 1.856432318687439} 11/06/2021 22:20:06 - INFO - __main__ - Step 7778: {'lr': 0.0004981226434491635, 'samples': 1493376, 'steps': 7777, 'loss/train': 1.7675282955169678} 11/06/2021 22:20:06 - INFO - __main__ - Step 7779: {'lr': 0.000498121994266253, 'samples': 1493568, 'steps': 7778, 'loss/train': 1.9787876605987549} 11/06/2021 22:20:07 - INFO - __main__ - Step 7780: {'lr': 0.0004981213449715427, 'samples': 1493760, 'steps': 7779, 'loss/train': 2.1368026733398438} 11/06/2021 22:20:07 - INFO - __main__ - Step 7781: {'lr': 0.0004981206955650328, 'samples': 1493952, 'steps': 7780, 'loss/train': 1.7741491794586182} 11/06/2021 22:20:08 - INFO - __main__ - Step 7782: {'lr': 0.0004981200460467234, 'samples': 1494144, 'steps': 7781, 'loss/train': 1.8448156118392944} 11/06/2021 22:20:08 - INFO - __main__ - Step 7783: {'lr': 0.0004981193964166151, 'samples': 1494336, 'steps': 7782, 'loss/train': 2.476020097732544} 11/06/2021 22:20:09 - INFO - __main__ - Step 7784: {'lr': 0.0004981187466747079, 'samples': 1494528, 'steps': 7783, 'loss/train': 1.604864478111267} 11/06/2021 22:20:09 - INFO - __main__ - Step 7785: {'lr': 0.0004981180968210023, 'samples': 1494720, 'steps': 7784, 'loss/train': 1.6935956478118896} 11/06/2021 22:20:09 - INFO - __main__ - Step 7786: {'lr': 0.0004981174468554984, 'samples': 1494912, 'steps': 7785, 'loss/train': 1.7149159908294678} 11/06/2021 22:20:10 - INFO - __main__ - Step 7787: {'lr': 0.0004981167967781968, 'samples': 1495104, 'steps': 7786, 'loss/train': 1.5972424745559692} 11/06/2021 22:20:11 - INFO - __main__ - Step 7788: {'lr': 0.0004981161465890975, 'samples': 1495296, 'steps': 7787, 'loss/train': 1.8924384117126465} 11/06/2021 22:20:11 - INFO - __main__ - Step 7789: {'lr': 0.0004981154962882008, 'samples': 1495488, 'steps': 7788, 'loss/train': 2.01082444190979} 11/06/2021 22:20:11 - INFO - __main__ - Step 7790: {'lr': 0.0004981148458755071, 'samples': 1495680, 'steps': 7789, 'loss/train': 2.0022146701812744} 11/06/2021 22:20:12 - INFO - __main__ - Step 7791: {'lr': 0.0004981141953510169, 'samples': 1495872, 'steps': 7790, 'loss/train': 1.9454317092895508} 11/06/2021 22:20:13 - INFO - __main__ - Step 7792: {'lr': 0.00049811354471473, 'samples': 1496064, 'steps': 7791, 'loss/train': 1.6417571306228638} 11/06/2021 22:20:13 - INFO - __main__ - Step 7793: {'lr': 0.0004981128939666471, 'samples': 1496256, 'steps': 7792, 'loss/train': 2.925060510635376} 11/06/2021 22:20:13 - INFO - __main__ - Step 7794: {'lr': 0.0004981122431067683, 'samples': 1496448, 'steps': 7793, 'loss/train': 2.0619399547576904} 11/06/2021 22:20:14 - INFO - __main__ - Step 7795: {'lr': 0.0004981115921350941, 'samples': 1496640, 'steps': 7794, 'loss/train': 1.3753095865249634} 11/06/2021 22:20:14 - INFO - __main__ - Step 7796: {'lr': 0.0004981109410516245, 'samples': 1496832, 'steps': 7795, 'loss/train': 1.4551738500595093} 11/06/2021 22:20:15 - INFO - __main__ - Step 7797: {'lr': 0.00049811028985636, 'samples': 1497024, 'steps': 7796, 'loss/train': 2.0298280715942383} 11/06/2021 22:20:15 - INFO - __main__ - Step 7798: {'lr': 0.0004981096385493007, 'samples': 1497216, 'steps': 7797, 'loss/train': 1.5669386386871338} 11/06/2021 22:20:16 - INFO - __main__ - Step 7799: {'lr': 0.0004981089871304472, 'samples': 1497408, 'steps': 7798, 'loss/train': 1.8886076211929321} 11/06/2021 22:20:16 - INFO - __main__ - Step 7800: {'lr': 0.0004981083355997995, 'samples': 1497600, 'steps': 7799, 'loss/train': 2.574296236038208} 11/06/2021 22:20:16 - INFO - __main__ - Step 7801: {'lr': 0.0004981076839573581, 'samples': 1497792, 'steps': 7800, 'loss/train': 0.9300814867019653} 11/06/2021 22:20:18 - INFO - __main__ - Step 7802: {'lr': 0.0004981070322031231, 'samples': 1497984, 'steps': 7801, 'loss/train': 1.7975281476974487} 11/06/2021 22:20:18 - INFO - __main__ - Step 7803: {'lr': 0.000498106380337095, 'samples': 1498176, 'steps': 7802, 'loss/train': 1.889115810394287} 11/06/2021 22:20:18 - INFO - __main__ - Step 7804: {'lr': 0.000498105728359274, 'samples': 1498368, 'steps': 7803, 'loss/train': 1.7939035892486572} 11/06/2021 22:20:19 - INFO - __main__ - Step 7805: {'lr': 0.0004981050762696604, 'samples': 1498560, 'steps': 7804, 'loss/train': 0.8266537189483643} 11/06/2021 22:20:19 - INFO - __main__ - Step 7806: {'lr': 0.0004981044240682544, 'samples': 1498752, 'steps': 7805, 'loss/train': 1.2602494955062866} 11/06/2021 22:20:19 - INFO - __main__ - Step 7807: {'lr': 0.0004981037717550564, 'samples': 1498944, 'steps': 7806, 'loss/train': 1.8031574487686157} 11/06/2021 22:20:20 - INFO - __main__ - Step 7808: {'lr': 0.0004981031193300667, 'samples': 1499136, 'steps': 7807, 'loss/train': 1.0529712438583374} 11/06/2021 22:20:21 - INFO - __main__ - Step 7809: {'lr': 0.0004981024667932855, 'samples': 1499328, 'steps': 7808, 'loss/train': 2.0406086444854736} 11/06/2021 22:20:21 - INFO - __main__ - Step 7810: {'lr': 0.0004981018141447133, 'samples': 1499520, 'steps': 7809, 'loss/train': 2.1355226039886475} 11/06/2021 22:20:21 - INFO - __main__ - Step 7811: {'lr': 0.00049810116138435, 'samples': 1499712, 'steps': 7810, 'loss/train': 1.709943413734436} 11/06/2021 22:20:22 - INFO - __main__ - Step 7812: {'lr': 0.0004981005085121963, 'samples': 1499904, 'steps': 7811, 'loss/train': 2.1915555000305176} 11/06/2021 22:20:23 - INFO - __main__ - Step 7813: {'lr': 0.0004980998555282524, 'samples': 1500096, 'steps': 7812, 'loss/train': 1.9955559968948364} 11/06/2021 22:20:23 - INFO - __main__ - Step 7814: {'lr': 0.0004980992024325185, 'samples': 1500288, 'steps': 7813, 'loss/train': 1.8444761037826538} 11/06/2021 22:20:24 - INFO - __main__ - Step 7815: {'lr': 0.0004980985492249949, 'samples': 1500480, 'steps': 7814, 'loss/train': 1.2955152988433838} 11/06/2021 22:20:24 - INFO - __main__ - Step 7816: {'lr': 0.0004980978959056819, 'samples': 1500672, 'steps': 7815, 'loss/train': 2.804755926132202} 11/06/2021 22:20:24 - INFO - __main__ - Step 7817: {'lr': 0.0004980972424745798, 'samples': 1500864, 'steps': 7816, 'loss/train': 2.022976875305176} 11/06/2021 22:20:26 - INFO - __main__ - Step 7818: {'lr': 0.000498096588931689, 'samples': 1501056, 'steps': 7817, 'loss/train': 1.9922711849212646} 11/06/2021 22:20:26 - INFO - __main__ - Step 7819: {'lr': 0.0004980959352770095, 'samples': 1501248, 'steps': 7818, 'loss/train': 2.2206287384033203} 11/06/2021 22:20:27 - INFO - __main__ - Step 7820: {'lr': 0.000498095281510542, 'samples': 1501440, 'steps': 7819, 'loss/train': 2.107832908630371} 11/06/2021 22:20:27 - INFO - __main__ - Step 7821: {'lr': 0.0004980946276322866, 'samples': 1501632, 'steps': 7820, 'loss/train': 2.020620822906494} 11/06/2021 22:20:27 - INFO - __main__ - Step 7822: {'lr': 0.0004980939736422436, 'samples': 1501824, 'steps': 7821, 'loss/train': 1.574951410293579} 11/06/2021 22:20:28 - INFO - __main__ - Step 7823: {'lr': 0.0004980933195404131, 'samples': 1502016, 'steps': 7822, 'loss/train': 1.5748287439346313} 11/06/2021 22:20:29 - INFO - __main__ - Step 7824: {'lr': 0.0004980926653267957, 'samples': 1502208, 'steps': 7823, 'loss/train': 2.000244617462158} 11/06/2021 22:20:29 - INFO - __main__ - Step 7825: {'lr': 0.0004980920110013915, 'samples': 1502400, 'steps': 7824, 'loss/train': 1.830479383468628} 11/06/2021 22:20:29 - INFO - __main__ - Step 7826: {'lr': 0.000498091356564201, 'samples': 1502592, 'steps': 7825, 'loss/train': 2.059532880783081} 11/06/2021 22:20:30 - INFO - __main__ - Step 7827: {'lr': 0.0004980907020152242, 'samples': 1502784, 'steps': 7826, 'loss/train': 2.0572545528411865} 11/06/2021 22:20:30 - INFO - __main__ - Step 7828: {'lr': 0.0004980900473544617, 'samples': 1502976, 'steps': 7827, 'loss/train': 1.7238388061523438} 11/06/2021 22:20:31 - INFO - __main__ - Step 7829: {'lr': 0.0004980893925819137, 'samples': 1503168, 'steps': 7828, 'loss/train': 1.9951441287994385} 11/06/2021 22:20:31 - INFO - __main__ - Step 7830: {'lr': 0.0004980887376975804, 'samples': 1503360, 'steps': 7829, 'loss/train': 1.7326819896697998} 11/06/2021 22:20:32 - INFO - __main__ - Step 7831: {'lr': 0.000498088082701462, 'samples': 1503552, 'steps': 7830, 'loss/train': 2.2184314727783203} 11/06/2021 22:20:32 - INFO - __main__ - Step 7832: {'lr': 0.0004980874275935591, 'samples': 1503744, 'steps': 7831, 'loss/train': 0.8481667041778564} 11/06/2021 22:20:32 - INFO - __main__ - Step 7833: {'lr': 0.0004980867723738717, 'samples': 1503936, 'steps': 7832, 'loss/train': 1.9760394096374512} 11/06/2021 22:20:34 - INFO - __main__ - Step 7834: {'lr': 0.0004980861170424003, 'samples': 1504128, 'steps': 7833, 'loss/train': 2.106902837753296} 11/06/2021 22:20:34 - INFO - __main__ - Step 7835: {'lr': 0.0004980854615991452, 'samples': 1504320, 'steps': 7834, 'loss/train': 2.3520045280456543} 11/06/2021 22:20:34 - INFO - __main__ - Step 7836: {'lr': 0.0004980848060441064, 'samples': 1504512, 'steps': 7835, 'loss/train': 2.106365442276001} 11/06/2021 22:20:35 - INFO - __main__ - Step 7837: {'lr': 0.0004980841503772846, 'samples': 1504704, 'steps': 7836, 'loss/train': 1.764209270477295} 11/06/2021 22:20:35 - INFO - __main__ - Step 7838: {'lr': 0.0004980834945986799, 'samples': 1504896, 'steps': 7837, 'loss/train': 2.0532870292663574} 11/06/2021 22:20:37 - INFO - __main__ - Step 7839: {'lr': 0.0004980828387082925, 'samples': 1505088, 'steps': 7838, 'loss/train': 2.207231283187866} 11/06/2021 22:20:37 - INFO - __main__ - Step 7840: {'lr': 0.000498082182706123, 'samples': 1505280, 'steps': 7839, 'loss/train': 0.6530503034591675} 11/06/2021 22:20:37 - INFO - __main__ - Step 7841: {'lr': 0.0004980815265921713, 'samples': 1505472, 'steps': 7840, 'loss/train': 0.6378718614578247} 11/06/2021 22:20:38 - INFO - __main__ - Step 7842: {'lr': 0.000498080870366438, 'samples': 1505664, 'steps': 7841, 'loss/train': 1.529860258102417} 11/06/2021 22:20:38 - INFO - __main__ - Step 7843: {'lr': 0.0004980802140289232, 'samples': 1505856, 'steps': 7842, 'loss/train': 1.2913402318954468} 11/06/2021 22:20:39 - INFO - __main__ - Step 7844: {'lr': 0.0004980795575796273, 'samples': 1506048, 'steps': 7843, 'loss/train': 1.243632435798645} 11/06/2021 22:20:39 - INFO - __main__ - Step 7845: {'lr': 0.0004980789010185507, 'samples': 1506240, 'steps': 7844, 'loss/train': 1.593994379043579} 11/06/2021 22:20:40 - INFO - __main__ - Step 7846: {'lr': 0.0004980782443456935, 'samples': 1506432, 'steps': 7845, 'loss/train': 1.4024707078933716} 11/06/2021 22:20:40 - INFO - __main__ - Step 7847: {'lr': 0.000498077587561056, 'samples': 1506624, 'steps': 7846, 'loss/train': 1.6562167406082153} 11/06/2021 22:20:40 - INFO - __main__ - Step 7848: {'lr': 0.0004980769306646386, 'samples': 1506816, 'steps': 7847, 'loss/train': 1.7106425762176514} 11/06/2021 22:20:41 - INFO - __main__ - Step 7849: {'lr': 0.0004980762736564417, 'samples': 1507008, 'steps': 7848, 'loss/train': 2.013291358947754} 11/06/2021 22:20:42 - INFO - __main__ - Step 7850: {'lr': 0.0004980756165364653, 'samples': 1507200, 'steps': 7849, 'loss/train': 2.613538980484009} 11/06/2021 22:20:42 - INFO - __main__ - Step 7851: {'lr': 0.0004980749593047099, 'samples': 1507392, 'steps': 7850, 'loss/train': 1.5560020208358765} 11/06/2021 22:20:43 - INFO - __main__ - Step 7852: {'lr': 0.0004980743019611757, 'samples': 1507584, 'steps': 7851, 'loss/train': 1.659212589263916} 11/06/2021 22:20:43 - INFO - __main__ - Step 7853: {'lr': 0.0004980736445058631, 'samples': 1507776, 'steps': 7852, 'loss/train': 1.3701859712600708} 11/06/2021 22:20:43 - INFO - __main__ - Step 7854: {'lr': 0.0004980729869387724, 'samples': 1507968, 'steps': 7853, 'loss/train': 1.2803887128829956} 11/06/2021 22:20:44 - INFO - __main__ - Step 7855: {'lr': 0.0004980723292599037, 'samples': 1508160, 'steps': 7854, 'loss/train': 1.8908063173294067} 11/06/2021 22:20:45 - INFO - __main__ - Step 7856: {'lr': 0.0004980716714692576, 'samples': 1508352, 'steps': 7855, 'loss/train': 1.353319764137268} 11/06/2021 22:20:45 - INFO - __main__ - Step 7857: {'lr': 0.0004980710135668342, 'samples': 1508544, 'steps': 7856, 'loss/train': 1.9040857553482056} 11/06/2021 22:20:45 - INFO - __main__ - Step 7858: {'lr': 0.0004980703555526338, 'samples': 1508736, 'steps': 7857, 'loss/train': 1.8307812213897705} 11/06/2021 22:20:46 - INFO - __main__ - Step 7859: {'lr': 0.0004980696974266566, 'samples': 1508928, 'steps': 7858, 'loss/train': 1.9307920932769775} 11/06/2021 22:20:47 - INFO - __main__ - Step 7860: {'lr': 0.0004980690391889033, 'samples': 1509120, 'steps': 7859, 'loss/train': 1.5426441431045532} 11/06/2021 22:20:47 - INFO - __main__ - Step 7861: {'lr': 0.0004980683808393737, 'samples': 1509312, 'steps': 7860, 'loss/train': 2.0717623233795166} 11/06/2021 22:20:48 - INFO - __main__ - Step 7862: {'lr': 0.0004980677223780683, 'samples': 1509504, 'steps': 7861, 'loss/train': 1.8097648620605469} 11/06/2021 22:20:48 - INFO - __main__ - Step 7863: {'lr': 0.0004980670638049875, 'samples': 1509696, 'steps': 7862, 'loss/train': 2.1986732482910156} 11/06/2021 22:20:48 - INFO - __main__ - Step 7864: {'lr': 0.0004980664051201315, 'samples': 1509888, 'steps': 7863, 'loss/train': 1.8978809118270874} 11/06/2021 22:20:49 - INFO - __main__ - Step 7865: {'lr': 0.0004980657463235006, 'samples': 1510080, 'steps': 7864, 'loss/train': 2.399979829788208} 11/06/2021 22:20:50 - INFO - __main__ - Step 7866: {'lr': 0.0004980650874150951, 'samples': 1510272, 'steps': 7865, 'loss/train': 2.042692184448242} 11/06/2021 22:20:50 - INFO - __main__ - Step 7867: {'lr': 0.0004980644283949152, 'samples': 1510464, 'steps': 7866, 'loss/train': 1.8365942239761353} 11/06/2021 22:20:50 - INFO - __main__ - Step 7868: {'lr': 0.0004980637692629615, 'samples': 1510656, 'steps': 7867, 'loss/train': 1.6458255052566528} 11/06/2021 22:20:51 - INFO - __main__ - Step 7869: {'lr': 0.0004980631100192339, 'samples': 1510848, 'steps': 7868, 'loss/train': 1.41680908203125} 11/06/2021 22:20:52 - INFO - __main__ - Step 7870: {'lr': 0.000498062450663733, 'samples': 1511040, 'steps': 7869, 'loss/train': 1.9412450790405273} 11/06/2021 22:20:53 - INFO - __main__ - Step 7871: {'lr': 0.000498061791196459, 'samples': 1511232, 'steps': 7870, 'loss/train': 1.4640387296676636} 11/06/2021 22:20:53 - INFO - __main__ - Step 7872: {'lr': 0.0004980611316174122, 'samples': 1511424, 'steps': 7871, 'loss/train': 1.4650890827178955} 11/06/2021 22:20:53 - INFO - __main__ - Step 7873: {'lr': 0.0004980604719265928, 'samples': 1511616, 'steps': 7872, 'loss/train': 1.9281331300735474} 11/06/2021 22:20:54 - INFO - __main__ - Step 7874: {'lr': 0.0004980598121240012, 'samples': 1511808, 'steps': 7873, 'loss/train': 1.7270317077636719} 11/06/2021 22:20:54 - INFO - __main__ - Step 7875: {'lr': 0.0004980591522096377, 'samples': 1512000, 'steps': 7874, 'loss/train': 3.885760545730591} 11/06/2021 22:20:55 - INFO - __main__ - Step 7876: {'lr': 0.0004980584921835025, 'samples': 1512192, 'steps': 7875, 'loss/train': 1.878515362739563} 11/06/2021 22:20:56 - INFO - __main__ - Step 7877: {'lr': 0.000498057832045596, 'samples': 1512384, 'steps': 7876, 'loss/train': 2.0955753326416016} 11/06/2021 22:20:56 - INFO - __main__ - Step 7878: {'lr': 0.0004980571717959186, 'samples': 1512576, 'steps': 7877, 'loss/train': 1.7663307189941406} 11/06/2021 22:20:56 - INFO - __main__ - Step 7879: {'lr': 0.0004980565114344704, 'samples': 1512768, 'steps': 7878, 'loss/train': 2.003199577331543} 11/06/2021 22:20:57 - INFO - __main__ - Step 7880: {'lr': 0.0004980558509612516, 'samples': 1512960, 'steps': 7879, 'loss/train': 1.4468077421188354} 11/06/2021 22:20:58 - INFO - __main__ - Step 7881: {'lr': 0.0004980551903762629, 'samples': 1513152, 'steps': 7880, 'loss/train': 1.9286073446273804} 11/06/2021 22:20:58 - INFO - __main__ - Step 7882: {'lr': 0.0004980545296795043, 'samples': 1513344, 'steps': 7881, 'loss/train': 1.9439208507537842} 11/06/2021 22:20:58 - INFO - __main__ - Step 7883: {'lr': 0.0004980538688709761, 'samples': 1513536, 'steps': 7882, 'loss/train': 1.9707108736038208} 11/06/2021 22:20:59 - INFO - __main__ - Step 7884: {'lr': 0.0004980532079506786, 'samples': 1513728, 'steps': 7883, 'loss/train': 2.4057676792144775} 11/06/2021 22:20:59 - INFO - __main__ - Step 7885: {'lr': 0.0004980525469186122, 'samples': 1513920, 'steps': 7884, 'loss/train': 2.0141124725341797} 11/06/2021 22:20:59 - INFO - __main__ - Step 7886: {'lr': 0.0004980518857747772, 'samples': 1514112, 'steps': 7885, 'loss/train': 1.6964260339736938} 11/06/2021 22:21:00 - INFO - __main__ - Step 7887: {'lr': 0.0004980512245191738, 'samples': 1514304, 'steps': 7886, 'loss/train': 2.2796289920806885} 11/06/2021 22:21:01 - INFO - __main__ - Step 7888: {'lr': 0.0004980505631518023, 'samples': 1514496, 'steps': 7887, 'loss/train': 1.9760662317276} 11/06/2021 22:21:01 - INFO - __main__ - Step 7889: {'lr': 0.0004980499016726632, 'samples': 1514688, 'steps': 7888, 'loss/train': 1.9461336135864258} 11/06/2021 22:21:01 - INFO - __main__ - Step 7890: {'lr': 0.0004980492400817564, 'samples': 1514880, 'steps': 7889, 'loss/train': 1.913292407989502} 11/06/2021 22:21:02 - INFO - __main__ - Step 7891: {'lr': 0.0004980485783790827, 'samples': 1515072, 'steps': 7890, 'loss/train': 1.8750466108322144} 11/06/2021 22:21:03 - INFO - __main__ - Step 7892: {'lr': 0.0004980479165646419, 'samples': 1515264, 'steps': 7891, 'loss/train': 2.3821234703063965} 11/06/2021 22:21:04 - INFO - __main__ - Step 7893: {'lr': 0.0004980472546384347, 'samples': 1515456, 'steps': 7892, 'loss/train': 2.1186270713806152} 11/06/2021 22:21:04 - INFO - __main__ - Step 7894: {'lr': 0.0004980465926004613, 'samples': 1515648, 'steps': 7893, 'loss/train': 5.3973493576049805} 11/06/2021 22:21:04 - INFO - __main__ - Step 7895: {'lr': 0.0004980459304507218, 'samples': 1515840, 'steps': 7894, 'loss/train': 2.0702314376831055} 11/06/2021 22:21:05 - INFO - __main__ - Step 7896: {'lr': 0.0004980452681892166, 'samples': 1516032, 'steps': 7895, 'loss/train': 2.7855825424194336} 11/06/2021 22:21:05 - INFO - __main__ - Step 7897: {'lr': 0.0004980446058159461, 'samples': 1516224, 'steps': 7896, 'loss/train': 2.5757217407226562} 11/06/2021 22:21:06 - INFO - __main__ - Step 7898: {'lr': 0.0004980439433309106, 'samples': 1516416, 'steps': 7897, 'loss/train': 1.7188911437988281} 11/06/2021 22:21:06 - INFO - __main__ - Step 7899: {'lr': 0.0004980432807341102, 'samples': 1516608, 'steps': 7898, 'loss/train': 1.881008267402649} 11/06/2021 22:21:07 - INFO - __main__ - Step 7900: {'lr': 0.0004980426180255453, 'samples': 1516800, 'steps': 7899, 'loss/train': 2.025660514831543} 11/06/2021 22:21:07 - INFO - __main__ - Step 7901: {'lr': 0.0004980419552052163, 'samples': 1516992, 'steps': 7900, 'loss/train': 1.8244693279266357} 11/06/2021 22:21:07 - INFO - __main__ - Step 7902: {'lr': 0.0004980412922731234, 'samples': 1517184, 'steps': 7901, 'loss/train': 2.084381103515625} 11/06/2021 22:21:09 - INFO - __main__ - Step 7903: {'lr': 0.0004980406292292669, 'samples': 1517376, 'steps': 7902, 'loss/train': 1.546284556388855} 11/06/2021 22:21:09 - INFO - __main__ - Step 7904: {'lr': 0.0004980399660736472, 'samples': 1517568, 'steps': 7903, 'loss/train': 1.5335673093795776} 11/06/2021 22:21:09 - INFO - __main__ - Step 7905: {'lr': 0.0004980393028062646, 'samples': 1517760, 'steps': 7904, 'loss/train': 2.310431480407715} 11/06/2021 22:21:10 - INFO - __main__ - Step 7906: {'lr': 0.0004980386394271191, 'samples': 1517952, 'steps': 7905, 'loss/train': 1.9592478275299072} 11/06/2021 22:21:10 - INFO - __main__ - Step 7907: {'lr': 0.0004980379759362113, 'samples': 1518144, 'steps': 7906, 'loss/train': 2.000092029571533} 11/06/2021 22:21:11 - INFO - __main__ - Step 7908: {'lr': 0.0004980373123335414, 'samples': 1518336, 'steps': 7907, 'loss/train': 2.0210494995117188} 11/06/2021 22:21:11 - INFO - __main__ - Step 7909: {'lr': 0.0004980366486191098, 'samples': 1518528, 'steps': 7908, 'loss/train': 1.597767949104309} 11/06/2021 22:21:12 - INFO - __main__ - Step 7910: {'lr': 0.0004980359847929167, 'samples': 1518720, 'steps': 7909, 'loss/train': 1.6900368928909302} 11/06/2021 22:21:12 - INFO - __main__ - Step 7911: {'lr': 0.0004980353208549623, 'samples': 1518912, 'steps': 7910, 'loss/train': 2.016561508178711} 11/06/2021 22:21:12 - INFO - __main__ - Step 7912: {'lr': 0.0004980346568052471, 'samples': 1519104, 'steps': 7911, 'loss/train': 1.868909478187561} 11/06/2021 22:21:14 - INFO - __main__ - Step 7913: {'lr': 0.0004980339926437713, 'samples': 1519296, 'steps': 7912, 'loss/train': 1.8896089792251587} 11/06/2021 22:21:14 - INFO - __main__ - Step 7914: {'lr': 0.0004980333283705351, 'samples': 1519488, 'steps': 7913, 'loss/train': 1.9207043647766113} 11/06/2021 22:21:14 - INFO - __main__ - Step 7915: {'lr': 0.000498032663985539, 'samples': 1519680, 'steps': 7914, 'loss/train': 1.3756333589553833} 11/06/2021 22:21:15 - INFO - __main__ - Step 7916: {'lr': 0.0004980319994887833, 'samples': 1519872, 'steps': 7915, 'loss/train': 1.878954291343689} 11/06/2021 22:21:15 - INFO - __main__ - Step 7917: {'lr': 0.0004980313348802681, 'samples': 1520064, 'steps': 7916, 'loss/train': 2.100268602371216} 11/06/2021 22:21:15 - INFO - __main__ - Step 7918: {'lr': 0.0004980306701599938, 'samples': 1520256, 'steps': 7917, 'loss/train': 2.110107421875} 11/06/2021 22:21:17 - INFO - __main__ - Step 7919: {'lr': 0.0004980300053279607, 'samples': 1520448, 'steps': 7918, 'loss/train': 2.0678465366363525} 11/06/2021 22:21:17 - INFO - __main__ - Step 7920: {'lr': 0.0004980293403841693, 'samples': 1520640, 'steps': 7919, 'loss/train': 2.0250751972198486} 11/06/2021 22:21:17 - INFO - __main__ - Step 7921: {'lr': 0.0004980286753286195, 'samples': 1520832, 'steps': 7920, 'loss/train': 0.41942548751831055} 11/06/2021 22:21:18 - INFO - __main__ - Step 7922: {'lr': 0.0004980280101613119, 'samples': 1521024, 'steps': 7921, 'loss/train': 2.067695379257202} 11/06/2021 22:21:18 - INFO - __main__ - Step 7923: {'lr': 0.0004980273448822466, 'samples': 1521216, 'steps': 7922, 'loss/train': 1.7907764911651611} 11/06/2021 22:21:19 - INFO - __main__ - Step 7924: {'lr': 0.000498026679491424, 'samples': 1521408, 'steps': 7923, 'loss/train': 1.5925756692886353} 11/06/2021 22:21:20 - INFO - __main__ - Step 7925: {'lr': 0.0004980260139888445, 'samples': 1521600, 'steps': 7924, 'loss/train': 1.906895637512207} 11/06/2021 22:21:20 - INFO - __main__ - Step 7926: {'lr': 0.0004980253483745083, 'samples': 1521792, 'steps': 7925, 'loss/train': 1.7573232650756836} 11/06/2021 22:21:20 - INFO - __main__ - Step 7927: {'lr': 0.0004980246826484157, 'samples': 1521984, 'steps': 7926, 'loss/train': 1.8057146072387695} 11/06/2021 22:21:21 - INFO - __main__ - Step 7928: {'lr': 0.000498024016810567, 'samples': 1522176, 'steps': 7927, 'loss/train': 0.6858081221580505} 11/06/2021 22:21:22 - INFO - __main__ - Step 7929: {'lr': 0.0004980233508609625, 'samples': 1522368, 'steps': 7928, 'loss/train': 1.501013994216919} 11/06/2021 22:21:22 - INFO - __main__ - Step 7930: {'lr': 0.0004980226847996025, 'samples': 1522560, 'steps': 7929, 'loss/train': 1.761657476425171} 11/06/2021 22:21:23 - INFO - __main__ - Step 7931: {'lr': 0.0004980220186264874, 'samples': 1522752, 'steps': 7930, 'loss/train': 1.874160885810852} 11/06/2021 22:21:23 - INFO - __main__ - Step 7932: {'lr': 0.0004980213523416172, 'samples': 1522944, 'steps': 7931, 'loss/train': 2.012180805206299} 11/06/2021 22:21:23 - INFO - __main__ - Step 7933: {'lr': 0.0004980206859449926, 'samples': 1523136, 'steps': 7932, 'loss/train': 2.158865213394165} 11/06/2021 22:21:24 - INFO - __main__ - Step 7934: {'lr': 0.0004980200194366136, 'samples': 1523328, 'steps': 7933, 'loss/train': 1.6870510578155518} 11/06/2021 22:21:25 - INFO - __main__ - Step 7935: {'lr': 0.0004980193528164806, 'samples': 1523520, 'steps': 7934, 'loss/train': 1.3986356258392334} 11/06/2021 22:21:25 - INFO - __main__ - Step 7936: {'lr': 0.0004980186860845939, 'samples': 1523712, 'steps': 7935, 'loss/train': 2.1106855869293213} 11/06/2021 22:21:25 - INFO - __main__ - Step 7937: {'lr': 0.0004980180192409539, 'samples': 1523904, 'steps': 7936, 'loss/train': 1.40969979763031} 11/06/2021 22:21:26 - INFO - __main__ - Step 7938: {'lr': 0.0004980173522855608, 'samples': 1524096, 'steps': 7937, 'loss/train': 1.793459415435791} 11/06/2021 22:21:26 - INFO - __main__ - Step 7939: {'lr': 0.0004980166852184148, 'samples': 1524288, 'steps': 7938, 'loss/train': 1.5917954444885254} 11/06/2021 22:21:27 - INFO - __main__ - Step 7940: {'lr': 0.0004980160180395164, 'samples': 1524480, 'steps': 7939, 'loss/train': 1.0192344188690186} 11/06/2021 22:21:27 - INFO - __main__ - Step 7941: {'lr': 0.0004980153507488657, 'samples': 1524672, 'steps': 7940, 'loss/train': 1.7049601078033447} 11/06/2021 22:21:28 - INFO - __main__ - Step 7942: {'lr': 0.0004980146833464633, 'samples': 1524864, 'steps': 7941, 'loss/train': 1.757932424545288} 11/06/2021 22:21:28 - INFO - __main__ - Step 7943: {'lr': 0.0004980140158323092, 'samples': 1525056, 'steps': 7942, 'loss/train': 1.4568382501602173} 11/06/2021 22:21:29 - INFO - __main__ - Step 7944: {'lr': 0.0004980133482064038, 'samples': 1525248, 'steps': 7943, 'loss/train': 2.041997194290161} 11/06/2021 22:21:30 - INFO - __main__ - Step 7945: {'lr': 0.0004980126804687474, 'samples': 1525440, 'steps': 7944, 'loss/train': 1.5531878471374512} 11/06/2021 22:21:30 - INFO - __main__ - Step 7946: {'lr': 0.0004980120126193403, 'samples': 1525632, 'steps': 7945, 'loss/train': 1.8834867477416992} 11/06/2021 22:21:30 - INFO - __main__ - Step 7947: {'lr': 0.0004980113446581829, 'samples': 1525824, 'steps': 7946, 'loss/train': 1.9784860610961914} 11/06/2021 22:21:31 - INFO - __main__ - Step 7948: {'lr': 0.0004980106765852753, 'samples': 1526016, 'steps': 7947, 'loss/train': 1.549972653388977} 11/06/2021 22:21:31 - INFO - __main__ - Step 7949: {'lr': 0.0004980100084006181, 'samples': 1526208, 'steps': 7948, 'loss/train': 1.9199172258377075} 11/06/2021 22:21:32 - INFO - __main__ - Step 7950: {'lr': 0.0004980093401042113, 'samples': 1526400, 'steps': 7949, 'loss/train': 1.6837117671966553} 11/06/2021 22:21:32 - INFO - __main__ - Step 7951: {'lr': 0.0004980086716960552, 'samples': 1526592, 'steps': 7950, 'loss/train': 2.027440071105957} 11/06/2021 22:21:33 - INFO - __main__ - Step 7952: {'lr': 0.0004980080031761504, 'samples': 1526784, 'steps': 7951, 'loss/train': 1.549945592880249} 11/06/2021 22:21:33 - INFO - __main__ - Step 7953: {'lr': 0.000498007334544497, 'samples': 1526976, 'steps': 7952, 'loss/train': 2.5235583782196045} 11/06/2021 22:21:33 - INFO - __main__ - Step 7954: {'lr': 0.0004980066658010952, 'samples': 1527168, 'steps': 7953, 'loss/train': 1.8120335340499878} 11/06/2021 22:21:34 - INFO - __main__ - Step 7955: {'lr': 0.0004980059969459455, 'samples': 1527360, 'steps': 7954, 'loss/train': 1.7081891298294067} 11/06/2021 22:21:35 - INFO - __main__ - Step 7956: {'lr': 0.0004980053279790481, 'samples': 1527552, 'steps': 7955, 'loss/train': 1.0960886478424072} 11/06/2021 22:21:35 - INFO - __main__ - Step 7957: {'lr': 0.0004980046589004034, 'samples': 1527744, 'steps': 7956, 'loss/train': 1.416310429573059} 11/06/2021 22:21:35 - INFO - __main__ - Step 7958: {'lr': 0.0004980039897100115, 'samples': 1527936, 'steps': 7957, 'loss/train': 2.159583568572998} 11/06/2021 22:21:36 - INFO - __main__ - Step 7959: {'lr': 0.000498003320407873, 'samples': 1528128, 'steps': 7958, 'loss/train': 2.172380208969116} 11/06/2021 22:21:36 - INFO - __main__ - Step 7960: {'lr': 0.000498002650993988, 'samples': 1528320, 'steps': 7959, 'loss/train': 1.913335919380188} 11/06/2021 22:21:37 - INFO - __main__ - Step 7961: {'lr': 0.0004980019814683568, 'samples': 1528512, 'steps': 7960, 'loss/train': 1.8780403137207031} 11/06/2021 22:21:38 - INFO - __main__ - Step 7962: {'lr': 0.0004980013118309796, 'samples': 1528704, 'steps': 7961, 'loss/train': 1.8526026010513306} 11/06/2021 22:21:38 - INFO - __main__ - Step 7963: {'lr': 0.000498000642081857, 'samples': 1528896, 'steps': 7962, 'loss/train': 1.8182376623153687} 11/06/2021 22:21:38 - INFO - __main__ - Step 7964: {'lr': 0.0004979999722209891, 'samples': 1529088, 'steps': 7963, 'loss/train': 1.9147931337356567} 11/06/2021 22:21:39 - INFO - __main__ - Step 7965: {'lr': 0.0004979993022483762, 'samples': 1529280, 'steps': 7964, 'loss/train': 1.8008043766021729} 11/06/2021 22:21:40 - INFO - __main__ - Step 7966: {'lr': 0.0004979986321640187, 'samples': 1529472, 'steps': 7965, 'loss/train': 1.427006483078003} 11/06/2021 22:21:40 - INFO - __main__ - Step 7967: {'lr': 0.0004979979619679168, 'samples': 1529664, 'steps': 7966, 'loss/train': 1.8842092752456665} 11/06/2021 22:21:41 - INFO - __main__ - Step 7968: {'lr': 0.0004979972916600708, 'samples': 1529856, 'steps': 7967, 'loss/train': 2.3407888412475586} 11/06/2021 22:21:41 - INFO - __main__ - Step 7969: {'lr': 0.0004979966212404812, 'samples': 1530048, 'steps': 7968, 'loss/train': 1.6167200803756714} 11/06/2021 22:21:41 - INFO - __main__ - Step 7970: {'lr': 0.0004979959507091479, 'samples': 1530240, 'steps': 7969, 'loss/train': 1.8462982177734375} 11/06/2021 22:21:42 - INFO - __main__ - Step 7971: {'lr': 0.0004979952800660717, 'samples': 1530432, 'steps': 7970, 'loss/train': 2.2095303535461426} 11/06/2021 22:21:43 - INFO - __main__ - Step 7972: {'lr': 0.0004979946093112525, 'samples': 1530624, 'steps': 7971, 'loss/train': 2.2119836807250977} 11/06/2021 22:21:44 - INFO - __main__ - Step 7973: {'lr': 0.0004979939384446908, 'samples': 1530816, 'steps': 7972, 'loss/train': 1.7757575511932373} 11/06/2021 22:21:44 - INFO - __main__ - Step 7974: {'lr': 0.0004979932674663869, 'samples': 1531008, 'steps': 7973, 'loss/train': 1.04556405544281} 11/06/2021 22:21:44 - INFO - __main__ - Step 7975: {'lr': 0.000497992596376341, 'samples': 1531200, 'steps': 7974, 'loss/train': 1.6081430912017822} 11/06/2021 22:21:45 - INFO - __main__ - Step 7976: {'lr': 0.0004979919251745535, 'samples': 1531392, 'steps': 7975, 'loss/train': 2.162876844406128} 11/06/2021 22:21:46 - INFO - __main__ - Step 7977: {'lr': 0.0004979912538610247, 'samples': 1531584, 'steps': 7976, 'loss/train': 1.4090156555175781} 11/06/2021 22:21:46 - INFO - __main__ - Step 7978: {'lr': 0.0004979905824357548, 'samples': 1531776, 'steps': 7977, 'loss/train': 1.850123405456543} 11/06/2021 22:21:46 - INFO - __main__ - Step 7979: {'lr': 0.0004979899108987442, 'samples': 1531968, 'steps': 7978, 'loss/train': 2.130842685699463} 11/06/2021 22:21:47 - INFO - __main__ - Step 7980: {'lr': 0.0004979892392499932, 'samples': 1532160, 'steps': 7979, 'loss/train': 1.864031434059143} 11/06/2021 22:21:47 - INFO - __main__ - Step 7981: {'lr': 0.0004979885674895021, 'samples': 1532352, 'steps': 7980, 'loss/train': 1.6603336334228516} 11/06/2021 22:21:48 - INFO - __main__ - Step 7982: {'lr': 0.0004979878956172711, 'samples': 1532544, 'steps': 7981, 'loss/train': 1.4955323934555054} 11/06/2021 22:21:48 - INFO - __main__ - Step 7983: {'lr': 0.0004979872236333005, 'samples': 1532736, 'steps': 7982, 'loss/train': 2.017420530319214} 11/06/2021 22:21:49 - INFO - __main__ - Step 7984: {'lr': 0.0004979865515375908, 'samples': 1532928, 'steps': 7983, 'loss/train': 1.8274989128112793} 11/06/2021 22:21:49 - INFO - __main__ - Step 7985: {'lr': 0.0004979858793301422, 'samples': 1533120, 'steps': 7984, 'loss/train': 1.6632914543151855} 11/06/2021 22:21:49 - INFO - __main__ - Step 7986: {'lr': 0.000497985207010955, 'samples': 1533312, 'steps': 7985, 'loss/train': 1.2843949794769287} 11/06/2021 22:21:50 - INFO - __main__ - Step 7987: {'lr': 0.0004979845345800294, 'samples': 1533504, 'steps': 7986, 'loss/train': 1.5527995824813843} 11/06/2021 22:21:51 - INFO - __main__ - Step 7988: {'lr': 0.0004979838620373659, 'samples': 1533696, 'steps': 7987, 'loss/train': 1.7927266359329224} 11/06/2021 22:21:52 - INFO - __main__ - Step 7989: {'lr': 0.0004979831893829646, 'samples': 1533888, 'steps': 7988, 'loss/train': 0.755244791507721} 11/06/2021 22:21:52 - INFO - __main__ - Step 7990: {'lr': 0.0004979825166168259, 'samples': 1534080, 'steps': 7989, 'loss/train': 1.9054588079452515} 11/06/2021 22:21:52 - INFO - __main__ - Step 7991: {'lr': 0.0004979818437389502, 'samples': 1534272, 'steps': 7990, 'loss/train': 1.586555004119873} 11/06/2021 22:21:53 - INFO - __main__ - Step 7992: {'lr': 0.0004979811707493377, 'samples': 1534464, 'steps': 7991, 'loss/train': 2.3696506023406982} 11/06/2021 22:21:53 - INFO - __main__ - Step 7993: {'lr': 0.0004979804976479887, 'samples': 1534656, 'steps': 7992, 'loss/train': 2.9696648120880127} 11/06/2021 22:21:54 - INFO - __main__ - Step 7994: {'lr': 0.0004979798244349034, 'samples': 1534848, 'steps': 7993, 'loss/train': 0.5555250644683838} 11/06/2021 22:21:54 - INFO - __main__ - Step 7995: {'lr': 0.0004979791511100823, 'samples': 1535040, 'steps': 7994, 'loss/train': 2.1885805130004883} 11/06/2021 22:21:55 - INFO - __main__ - Step 7996: {'lr': 0.0004979784776735257, 'samples': 1535232, 'steps': 7995, 'loss/train': 2.107032060623169} 11/06/2021 22:21:55 - INFO - __main__ - Step 7997: {'lr': 0.0004979778041252338, 'samples': 1535424, 'steps': 7996, 'loss/train': 1.827415943145752} 11/06/2021 22:21:55 - INFO - __main__ - Step 7998: {'lr': 0.0004979771304652068, 'samples': 1535616, 'steps': 7997, 'loss/train': 1.842453122138977} 11/06/2021 22:21:56 - INFO - __main__ - Step 7999: {'lr': 0.0004979764566934452, 'samples': 1535808, 'steps': 7998, 'loss/train': 1.6808557510375977} 11/06/2021 22:21:57 - INFO - __main__ - Step 8000: {'lr': 0.0004979757828099492, 'samples': 1536000, 'steps': 7999, 'loss/train': 1.729453206062317} 11/06/2021 22:21:57 - INFO - __main__ - Step 8001: {'lr': 0.0004979751088147192, 'samples': 1536192, 'steps': 8000, 'loss/train': 1.9405218362808228} 11/06/2021 22:21:58 - INFO - __main__ - Step 8002: {'lr': 0.0004979744347077555, 'samples': 1536384, 'steps': 8001, 'loss/train': 1.666454553604126} 11/06/2021 22:21:58 - INFO - __main__ - Step 8003: {'lr': 0.0004979737604890582, 'samples': 1536576, 'steps': 8002, 'loss/train': 2.0916402339935303} 11/06/2021 22:22:00 - INFO - __main__ - Step 8004: {'lr': 0.0004979730861586278, 'samples': 1536768, 'steps': 8003, 'loss/train': 2.936959743499756} 11/06/2021 22:22:00 - INFO - __main__ - Step 8005: {'lr': 0.0004979724117164646, 'samples': 1536960, 'steps': 8004, 'loss/train': 2.1846797466278076} 11/06/2021 22:22:00 - INFO - __main__ - Step 8006: {'lr': 0.0004979717371625689, 'samples': 1537152, 'steps': 8005, 'loss/train': 1.4431344270706177} 11/06/2021 22:22:01 - INFO - __main__ - Step 8007: {'lr': 0.0004979710624969408, 'samples': 1537344, 'steps': 8006, 'loss/train': 1.6920371055603027} 11/06/2021 22:22:01 - INFO - __main__ - Step 8008: {'lr': 0.000497970387719581, 'samples': 1537536, 'steps': 8007, 'loss/train': 1.7911888360977173} 11/06/2021 22:22:01 - INFO - __main__ - Step 8009: {'lr': 0.0004979697128304893, 'samples': 1537728, 'steps': 8008, 'loss/train': 2.065598249435425} 11/06/2021 22:22:02 - INFO - __main__ - Step 8010: {'lr': 0.0004979690378296665, 'samples': 1537920, 'steps': 8009, 'loss/train': 1.9170726537704468} 11/06/2021 22:22:02 - INFO - __main__ - Step 8011: {'lr': 0.0004979683627171125, 'samples': 1538112, 'steps': 8010, 'loss/train': 2.038038730621338} 11/06/2021 22:22:03 - INFO - __main__ - Step 8012: {'lr': 0.0004979676874928278, 'samples': 1538304, 'steps': 8011, 'loss/train': 1.8985741138458252} 11/06/2021 22:22:04 - INFO - __main__ - Step 8013: {'lr': 0.0004979670121568129, 'samples': 1538496, 'steps': 8012, 'loss/train': 2.169818878173828} 11/06/2021 22:22:04 - INFO - __main__ - Step 8014: {'lr': 0.0004979663367090676, 'samples': 1538688, 'steps': 8013, 'loss/train': 2.037712335586548} 11/06/2021 22:22:04 - INFO - __main__ - Step 8015: {'lr': 0.0004979656611495927, 'samples': 1538880, 'steps': 8014, 'loss/train': 2.2237470149993896} 11/06/2021 22:22:05 - INFO - __main__ - Step 8016: {'lr': 0.0004979649854783883, 'samples': 1539072, 'steps': 8015, 'loss/train': 1.822688341140747} 11/06/2021 22:22:06 - INFO - __main__ - Step 8017: {'lr': 0.0004979643096954545, 'samples': 1539264, 'steps': 8016, 'loss/train': 2.502737283706665} 11/06/2021 22:22:06 - INFO - __main__ - Step 8018: {'lr': 0.000497963633800792, 'samples': 1539456, 'steps': 8017, 'loss/train': 1.937386393547058} 11/06/2021 22:22:06 - INFO - __main__ - Step 8019: {'lr': 0.0004979629577944009, 'samples': 1539648, 'steps': 8018, 'loss/train': 1.8304017782211304} 11/06/2021 22:22:07 - INFO - __main__ - Step 8020: {'lr': 0.0004979622816762815, 'samples': 1539840, 'steps': 8019, 'loss/train': 2.0212595462799072} 11/06/2021 22:22:07 - INFO - __main__ - Step 8021: {'lr': 0.0004979616054464341, 'samples': 1540032, 'steps': 8020, 'loss/train': 1.852665662765503} 11/06/2021 22:22:08 - INFO - __main__ - Step 8022: {'lr': 0.000497960929104859, 'samples': 1540224, 'steps': 8021, 'loss/train': 2.02396821975708} 11/06/2021 22:22:08 - INFO - __main__ - Step 8023: {'lr': 0.0004979602526515566, 'samples': 1540416, 'steps': 8022, 'loss/train': 2.089334726333618} 11/06/2021 22:22:09 - INFO - __main__ - Step 8024: {'lr': 0.0004979595760865271, 'samples': 1540608, 'steps': 8023, 'loss/train': 1.602588176727295} 11/06/2021 22:22:09 - INFO - __main__ - Step 8025: {'lr': 0.0004979588994097708, 'samples': 1540800, 'steps': 8024, 'loss/train': 1.3973878622055054} 11/06/2021 22:22:09 - INFO - __main__ - Step 8026: {'lr': 0.0004979582226212881, 'samples': 1540992, 'steps': 8025, 'loss/train': 1.1899759769439697} 11/06/2021 22:22:10 - INFO - __main__ - Step 8027: {'lr': 0.0004979575457210792, 'samples': 1541184, 'steps': 8026, 'loss/train': 1.6932975053787231} 11/06/2021 22:22:11 - INFO - __main__ - Step 8028: {'lr': 0.0004979568687091446, 'samples': 1541376, 'steps': 8027, 'loss/train': 1.7348906993865967} 11/06/2021 22:22:11 - INFO - __main__ - Step 8029: {'lr': 0.0004979561915854843, 'samples': 1541568, 'steps': 8028, 'loss/train': 1.5938066244125366} 11/06/2021 22:22:12 - INFO - __main__ - Step 8030: {'lr': 0.0004979555143500988, 'samples': 1541760, 'steps': 8029, 'loss/train': 2.0716259479522705} 11/06/2021 22:22:12 - INFO - __main__ - Step 8031: {'lr': 0.0004979548370029884, 'samples': 1541952, 'steps': 8030, 'loss/train': 1.7080976963043213} 11/06/2021 22:22:12 - INFO - __main__ - Step 8032: {'lr': 0.0004979541595441534, 'samples': 1542144, 'steps': 8031, 'loss/train': 1.9131830930709839} 11/06/2021 22:22:13 - INFO - __main__ - Step 8033: {'lr': 0.000497953481973594, 'samples': 1542336, 'steps': 8032, 'loss/train': 1.6197700500488281} 11/06/2021 22:22:14 - INFO - __main__ - Step 8034: {'lr': 0.0004979528042913106, 'samples': 1542528, 'steps': 8033, 'loss/train': 1.7297489643096924} 11/06/2021 22:22:14 - INFO - __main__ - Step 8035: {'lr': 0.0004979521264973036, 'samples': 1542720, 'steps': 8034, 'loss/train': 2.004364490509033} 11/06/2021 22:22:14 - INFO - __main__ - Step 8036: {'lr': 0.0004979514485915731, 'samples': 1542912, 'steps': 8035, 'loss/train': 1.4310439825057983} 11/06/2021 22:22:15 - INFO - __main__ - Step 8037: {'lr': 0.0004979507705741195, 'samples': 1543104, 'steps': 8036, 'loss/train': 1.9910510778427124} 11/06/2021 22:22:16 - INFO - __main__ - Step 8038: {'lr': 0.0004979500924449431, 'samples': 1543296, 'steps': 8037, 'loss/train': 1.46047043800354} 11/06/2021 22:22:16 - INFO - __main__ - Step 8039: {'lr': 0.0004979494142040444, 'samples': 1543488, 'steps': 8038, 'loss/train': 1.9380624294281006} 11/06/2021 22:22:16 - INFO - __main__ - Step 8040: {'lr': 0.0004979487358514233, 'samples': 1543680, 'steps': 8039, 'loss/train': 1.9118831157684326} 11/06/2021 22:22:17 - INFO - __main__ - Step 8041: {'lr': 0.0004979480573870803, 'samples': 1543872, 'steps': 8040, 'loss/train': 1.2592707872390747} 11/06/2021 22:22:17 - INFO - __main__ - Step 8042: {'lr': 0.000497947378811016, 'samples': 1544064, 'steps': 8041, 'loss/train': 2.1490068435668945} 11/06/2021 22:22:18 - INFO - __main__ - Step 8043: {'lr': 0.0004979467001232302, 'samples': 1544256, 'steps': 8042, 'loss/train': 1.6307603120803833} 11/06/2021 22:22:18 - INFO - __main__ - Step 8044: {'lr': 0.0004979460213237235, 'samples': 1544448, 'steps': 8043, 'loss/train': 1.8183969259262085} 11/06/2021 22:22:19 - INFO - __main__ - Step 8045: {'lr': 0.0004979453424124961, 'samples': 1544640, 'steps': 8044, 'loss/train': 1.2424389123916626} 11/06/2021 22:22:19 - INFO - __main__ - Step 8046: {'lr': 0.0004979446633895484, 'samples': 1544832, 'steps': 8045, 'loss/train': 1.352750301361084} 11/06/2021 22:22:20 - INFO - __main__ - Step 8047: {'lr': 0.0004979439842548808, 'samples': 1545024, 'steps': 8046, 'loss/train': 1.8693174123764038} 11/06/2021 22:22:20 - INFO - __main__ - Step 8048: {'lr': 0.0004979433050084933, 'samples': 1545216, 'steps': 8047, 'loss/train': 1.2310196161270142} 11/06/2021 22:22:21 - INFO - __main__ - Step 8049: {'lr': 0.0004979426256503863, 'samples': 1545408, 'steps': 8048, 'loss/train': 2.2753522396087646} 11/06/2021 22:22:21 - INFO - __main__ - Step 8050: {'lr': 0.0004979419461805603, 'samples': 1545600, 'steps': 8049, 'loss/train': 2.1069955825805664} 11/06/2021 22:22:22 - INFO - __main__ - Step 8051: {'lr': 0.0004979412665990156, 'samples': 1545792, 'steps': 8050, 'loss/train': 1.705078125} 11/06/2021 22:22:22 - INFO - __main__ - Step 8052: {'lr': 0.0004979405869057522, 'samples': 1545984, 'steps': 8051, 'loss/train': 1.9316350221633911} 11/06/2021 22:22:23 - INFO - __main__ - Step 8053: {'lr': 0.0004979399071007707, 'samples': 1546176, 'steps': 8052, 'loss/train': 2.099993944168091} 11/06/2021 22:22:23 - INFO - __main__ - Step 8054: {'lr': 0.0004979392271840712, 'samples': 1546368, 'steps': 8053, 'loss/train': 1.937470555305481} 11/06/2021 22:22:24 - INFO - __main__ - Step 8055: {'lr': 0.0004979385471556542, 'samples': 1546560, 'steps': 8054, 'loss/train': 2.03237247467041} 11/06/2021 22:22:24 - INFO - __main__ - Step 8056: {'lr': 0.00049793786701552, 'samples': 1546752, 'steps': 8055, 'loss/train': 1.7011387348175049} 11/06/2021 22:22:24 - INFO - __main__ - Step 8057: {'lr': 0.0004979371867636687, 'samples': 1546944, 'steps': 8056, 'loss/train': 1.647082805633545} 11/06/2021 22:22:25 - INFO - __main__ - Step 8058: {'lr': 0.0004979365064001007, 'samples': 1547136, 'steps': 8057, 'loss/train': 1.8520019054412842} 11/06/2021 22:22:26 - INFO - __main__ - Step 8059: {'lr': 0.0004979358259248164, 'samples': 1547328, 'steps': 8058, 'loss/train': 1.324270486831665} 11/06/2021 22:22:26 - INFO - __main__ - Step 8060: {'lr': 0.000497935145337816, 'samples': 1547520, 'steps': 8059, 'loss/train': 1.6463035345077515} 11/06/2021 22:22:27 - INFO - __main__ - Step 8061: {'lr': 0.0004979344646390999, 'samples': 1547712, 'steps': 8060, 'loss/train': 2.0036861896514893} 11/06/2021 22:22:27 - INFO - __main__ - Step 8062: {'lr': 0.0004979337838286684, 'samples': 1547904, 'steps': 8061, 'loss/train': 1.1817820072174072} 11/06/2021 22:22:27 - INFO - __main__ - Step 8063: {'lr': 0.0004979331029065216, 'samples': 1548096, 'steps': 8062, 'loss/train': 1.7082180976867676} 11/06/2021 22:22:28 - INFO - __main__ - Step 8064: {'lr': 0.00049793242187266, 'samples': 1548288, 'steps': 8063, 'loss/train': 1.778818964958191} 11/06/2021 22:22:29 - INFO - __main__ - Step 8065: {'lr': 0.000497931740727084, 'samples': 1548480, 'steps': 8064, 'loss/train': 6.108870983123779} 11/06/2021 22:22:29 - INFO - __main__ - Step 8066: {'lr': 0.0004979310594697937, 'samples': 1548672, 'steps': 8065, 'loss/train': 2.9927406311035156} 11/06/2021 22:22:30 - INFO - __main__ - Step 8067: {'lr': 0.0004979303781007896, 'samples': 1548864, 'steps': 8066, 'loss/train': 1.902443528175354} 11/06/2021 22:22:30 - INFO - __main__ - Step 8068: {'lr': 0.0004979296966200718, 'samples': 1549056, 'steps': 8067, 'loss/train': 1.7086400985717773} 11/06/2021 22:22:30 - INFO - __main__ - Step 8069: {'lr': 0.0004979290150276407, 'samples': 1549248, 'steps': 8068, 'loss/train': 1.9971203804016113} 11/06/2021 22:22:31 - INFO - __main__ - Step 8070: {'lr': 0.0004979283333234966, 'samples': 1549440, 'steps': 8069, 'loss/train': 1.9657219648361206} 11/06/2021 22:22:32 - INFO - __main__ - Step 8071: {'lr': 0.0004979276515076399, 'samples': 1549632, 'steps': 8070, 'loss/train': 1.041121482849121} 11/06/2021 22:22:32 - INFO - __main__ - Step 8072: {'lr': 0.0004979269695800707, 'samples': 1549824, 'steps': 8071, 'loss/train': 2.119021415710449} 11/06/2021 22:22:32 - INFO - __main__ - Step 8073: {'lr': 0.0004979262875407896, 'samples': 1550016, 'steps': 8072, 'loss/train': 1.3283685445785522} 11/06/2021 22:22:33 - INFO - __main__ - Step 8074: {'lr': 0.0004979256053897966, 'samples': 1550208, 'steps': 8073, 'loss/train': 2.044589042663574} 11/06/2021 22:22:35 - INFO - __main__ - Step 8075: {'lr': 0.0004979249231270923, 'samples': 1550400, 'steps': 8074, 'loss/train': 1.9413076639175415} 11/06/2021 22:22:35 - INFO - __main__ - Step 8076: {'lr': 0.0004979242407526766, 'samples': 1550592, 'steps': 8075, 'loss/train': 1.5758004188537598} 11/06/2021 22:22:35 - INFO - __main__ - Step 8077: {'lr': 0.0004979235582665503, 'samples': 1550784, 'steps': 8076, 'loss/train': 2.4612207412719727} 11/06/2021 22:22:36 - INFO - __main__ - Step 8078: {'lr': 0.0004979228756687135, 'samples': 1550976, 'steps': 8077, 'loss/train': 2.0157461166381836} 11/06/2021 22:22:36 - INFO - __main__ - Step 8079: {'lr': 0.0004979221929591663, 'samples': 1551168, 'steps': 8078, 'loss/train': 2.025535821914673} 11/06/2021 22:22:36 - INFO - __main__ - Step 8080: {'lr': 0.0004979215101379093, 'samples': 1551360, 'steps': 8079, 'loss/train': 1.8989232778549194} 11/06/2021 22:22:37 - INFO - __main__ - Step 8081: {'lr': 0.0004979208272049426, 'samples': 1551552, 'steps': 8080, 'loss/train': 1.8116631507873535} 11/06/2021 22:22:37 - INFO - __main__ - Step 8082: {'lr': 0.0004979201441602665, 'samples': 1551744, 'steps': 8081, 'loss/train': 1.6327272653579712} 11/06/2021 22:22:38 - INFO - __main__ - Step 8083: {'lr': 0.0004979194610038816, 'samples': 1551936, 'steps': 8082, 'loss/train': 2.1452510356903076} 11/06/2021 22:22:39 - INFO - __main__ - Step 8084: {'lr': 0.000497918777735788, 'samples': 1552128, 'steps': 8083, 'loss/train': 1.8749728202819824} 11/06/2021 22:22:39 - INFO - __main__ - Step 8085: {'lr': 0.000497918094355986, 'samples': 1552320, 'steps': 8084, 'loss/train': 1.904146671295166} 11/06/2021 22:22:39 - INFO - __main__ - Step 8086: {'lr': 0.000497917410864476, 'samples': 1552512, 'steps': 8085, 'loss/train': 1.7037712335586548} 11/06/2021 22:22:40 - INFO - __main__ - Step 8087: {'lr': 0.0004979167272612581, 'samples': 1552704, 'steps': 8086, 'loss/train': 1.7576018571853638} 11/06/2021 22:22:41 - INFO - __main__ - Step 8088: {'lr': 0.0004979160435463328, 'samples': 1552896, 'steps': 8087, 'loss/train': 1.7359386682510376} 11/06/2021 22:22:41 - INFO - __main__ - Step 8089: {'lr': 0.0004979153597197003, 'samples': 1553088, 'steps': 8088, 'loss/train': 1.035143256187439} 11/06/2021 22:22:41 - INFO - __main__ - Step 8090: {'lr': 0.0004979146757813611, 'samples': 1553280, 'steps': 8089, 'loss/train': 1.9019252061843872} 11/06/2021 22:22:42 - INFO - __main__ - Step 8091: {'lr': 0.0004979139917313153, 'samples': 1553472, 'steps': 8090, 'loss/train': 1.6100707054138184} 11/06/2021 22:22:42 - INFO - __main__ - Step 8092: {'lr': 0.0004979133075695634, 'samples': 1553664, 'steps': 8091, 'loss/train': 1.4543089866638184} 11/06/2021 22:22:43 - INFO - __main__ - Step 8093: {'lr': 0.0004979126232961054, 'samples': 1553856, 'steps': 8092, 'loss/train': 1.6169660091400146} 11/06/2021 22:22:43 - INFO - __main__ - Step 8094: {'lr': 0.0004979119389109419, 'samples': 1554048, 'steps': 8093, 'loss/train': 1.693862795829773} 11/06/2021 22:22:44 - INFO - __main__ - Step 8095: {'lr': 0.000497911254414073, 'samples': 1554240, 'steps': 8094, 'loss/train': 1.6328054666519165} 11/06/2021 22:22:44 - INFO - __main__ - Step 8096: {'lr': 0.0004979105698054992, 'samples': 1554432, 'steps': 8095, 'loss/train': 2.0879011154174805} 11/06/2021 22:22:44 - INFO - __main__ - Step 8097: {'lr': 0.0004979098850852208, 'samples': 1554624, 'steps': 8096, 'loss/train': 1.8429865837097168} 11/06/2021 22:22:46 - INFO - __main__ - Step 8098: {'lr': 0.0004979092002532379, 'samples': 1554816, 'steps': 8097, 'loss/train': 2.1915647983551025} 11/06/2021 22:22:46 - INFO - __main__ - Step 8099: {'lr': 0.0004979085153095509, 'samples': 1555008, 'steps': 8098, 'loss/train': 2.256697177886963} 11/06/2021 22:22:46 - INFO - __main__ - Step 8100: {'lr': 0.0004979078302541604, 'samples': 1555200, 'steps': 8099, 'loss/train': 1.6514664888381958} 11/06/2021 22:22:47 - INFO - __main__ - Step 8101: {'lr': 0.0004979071450870662, 'samples': 1555392, 'steps': 8100, 'loss/train': 1.7830842733383179} 11/06/2021 22:22:47 - INFO - __main__ - Step 8102: {'lr': 0.0004979064598082689, 'samples': 1555584, 'steps': 8101, 'loss/train': 2.325873851776123} 11/06/2021 22:22:48 - INFO - __main__ - Step 8103: {'lr': 0.0004979057744177689, 'samples': 1555776, 'steps': 8102, 'loss/train': 1.6292482614517212} 11/06/2021 22:22:48 - INFO - __main__ - Step 8104: {'lr': 0.0004979050889155663, 'samples': 1555968, 'steps': 8103, 'loss/train': 1.9589861631393433} 11/06/2021 22:22:49 - INFO - __main__ - Step 8105: {'lr': 0.0004979044033016616, 'samples': 1556160, 'steps': 8104, 'loss/train': 1.5695750713348389} 11/06/2021 22:22:49 - INFO - __main__ - Step 8106: {'lr': 0.0004979037175760548, 'samples': 1556352, 'steps': 8105, 'loss/train': 2.045214891433716} 11/06/2021 22:22:49 - INFO - __main__ - Step 8107: {'lr': 0.0004979030317387466, 'samples': 1556544, 'steps': 8106, 'loss/train': 2.292895555496216} 11/06/2021 22:22:50 - INFO - __main__ - Step 8108: {'lr': 0.0004979023457897371, 'samples': 1556736, 'steps': 8107, 'loss/train': 2.0541205406188965} 11/06/2021 22:22:51 - INFO - __main__ - Step 8109: {'lr': 0.0004979016597290264, 'samples': 1556928, 'steps': 8108, 'loss/train': 2.090756893157959} 11/06/2021 22:22:51 - INFO - __main__ - Step 8110: {'lr': 0.0004979009735566152, 'samples': 1557120, 'steps': 8109, 'loss/train': 1.956437110900879} 11/06/2021 22:22:51 - INFO - __main__ - Step 8111: {'lr': 0.0004979002872725037, 'samples': 1557312, 'steps': 8110, 'loss/train': 0.9265539050102234} 11/06/2021 22:22:52 - INFO - __main__ - Step 8112: {'lr': 0.0004978996008766922, 'samples': 1557504, 'steps': 8111, 'loss/train': 1.8253365755081177} 11/06/2021 22:22:53 - INFO - __main__ - Step 8113: {'lr': 0.0004978989143691808, 'samples': 1557696, 'steps': 8112, 'loss/train': 2.0821549892425537} 11/06/2021 22:22:54 - INFO - __main__ - Step 8114: {'lr': 0.00049789822774997, 'samples': 1557888, 'steps': 8113, 'loss/train': 1.468117356300354} 11/06/2021 22:22:54 - INFO - __main__ - Step 8115: {'lr': 0.0004978975410190601, 'samples': 1558080, 'steps': 8114, 'loss/train': 2.1024975776672363} 11/06/2021 22:22:54 - INFO - __main__ - Step 8116: {'lr': 0.0004978968541764515, 'samples': 1558272, 'steps': 8115, 'loss/train': 2.9898436069488525} 11/06/2021 22:22:55 - INFO - __main__ - Step 8117: {'lr': 0.0004978961672221444, 'samples': 1558464, 'steps': 8116, 'loss/train': 2.3070497512817383} 11/06/2021 22:22:55 - INFO - __main__ - Step 8118: {'lr': 0.000497895480156139, 'samples': 1558656, 'steps': 8117, 'loss/train': 1.9509214162826538} 11/06/2021 22:22:56 - INFO - __main__ - Step 8119: {'lr': 0.0004978947929784358, 'samples': 1558848, 'steps': 8118, 'loss/train': 1.3840776681900024} 11/06/2021 22:22:57 - INFO - __main__ - Step 8120: {'lr': 0.0004978941056890349, 'samples': 1559040, 'steps': 8119, 'loss/train': 1.7440425157546997} 11/06/2021 22:22:57 - INFO - __main__ - Step 8121: {'lr': 0.0004978934182879369, 'samples': 1559232, 'steps': 8120, 'loss/train': 1.8195523023605347} 11/06/2021 22:22:57 - INFO - __main__ - Step 8122: {'lr': 0.0004978927307751419, 'samples': 1559424, 'steps': 8121, 'loss/train': 1.3453588485717773} 11/06/2021 22:22:58 - INFO - __main__ - Step 8123: {'lr': 0.0004978920431506501, 'samples': 1559616, 'steps': 8122, 'loss/train': 1.8378467559814453} 11/06/2021 22:22:58 - INFO - __main__ - Step 8124: {'lr': 0.0004978913554144623, 'samples': 1559808, 'steps': 8123, 'loss/train': 2.123873472213745} 11/06/2021 22:22:59 - INFO - __main__ - Step 8125: {'lr': 0.0004978906675665782, 'samples': 1560000, 'steps': 8124, 'loss/train': 2.085545063018799} 11/06/2021 22:22:59 - INFO - __main__ - Step 8126: {'lr': 0.0004978899796069985, 'samples': 1560192, 'steps': 8125, 'loss/train': 1.9374345541000366} 11/06/2021 22:23:00 - INFO - __main__ - Step 8127: {'lr': 0.0004978892915357234, 'samples': 1560384, 'steps': 8126, 'loss/train': 2.0188064575195312} 11/06/2021 22:23:00 - INFO - __main__ - Step 8128: {'lr': 0.0004978886033527532, 'samples': 1560576, 'steps': 8127, 'loss/train': 2.549909830093384} 11/06/2021 22:23:00 - INFO - __main__ - Step 8129: {'lr': 0.0004978879150580882, 'samples': 1560768, 'steps': 8128, 'loss/train': 1.8509951829910278} 11/06/2021 22:23:01 - INFO - __main__ - Step 8130: {'lr': 0.0004978872266517288, 'samples': 1560960, 'steps': 8129, 'loss/train': 1.873234748840332} 11/06/2021 22:23:02 - INFO - __main__ - Step 8131: {'lr': 0.0004978865381336752, 'samples': 1561152, 'steps': 8130, 'loss/train': 1.8826571702957153} 11/06/2021 22:23:03 - INFO - __main__ - Step 8132: {'lr': 0.0004978858495039277, 'samples': 1561344, 'steps': 8131, 'loss/train': 1.34197998046875} 11/06/2021 22:23:03 - INFO - __main__ - Step 8133: {'lr': 0.0004978851607624867, 'samples': 1561536, 'steps': 8132, 'loss/train': 3.694458484649658} 11/06/2021 22:23:03 - INFO - __main__ - Step 8134: {'lr': 0.0004978844719093525, 'samples': 1561728, 'steps': 8133, 'loss/train': 2.0985944271087646} 11/06/2021 22:23:04 - INFO - __main__ - Step 8135: {'lr': 0.0004978837829445254, 'samples': 1561920, 'steps': 8134, 'loss/train': 1.7875434160232544} 11/06/2021 22:23:04 - INFO - __main__ - Step 8136: {'lr': 0.0004978830938680056, 'samples': 1562112, 'steps': 8135, 'loss/train': 1.9818612337112427} 11/06/2021 22:23:05 - INFO - __main__ - Step 8137: {'lr': 0.0004978824046797935, 'samples': 1562304, 'steps': 8136, 'loss/train': 2.647218942642212} 11/06/2021 22:23:05 - INFO - __main__ - Step 8138: {'lr': 0.0004978817153798895, 'samples': 1562496, 'steps': 8137, 'loss/train': 1.852639079093933} 11/06/2021 22:23:06 - INFO - __main__ - Step 8139: {'lr': 0.0004978810259682939, 'samples': 1562688, 'steps': 8138, 'loss/train': 1.697789192199707} 11/06/2021 22:23:06 - INFO - __main__ - Step 8140: {'lr': 0.0004978803364450068, 'samples': 1562880, 'steps': 8139, 'loss/train': 1.768872618675232} 11/06/2021 22:23:06 - INFO - __main__ - Step 8141: {'lr': 0.0004978796468100286, 'samples': 1563072, 'steps': 8140, 'loss/train': 1.7271705865859985} 11/06/2021 22:23:07 - INFO - __main__ - Step 8142: {'lr': 0.0004978789570633598, 'samples': 1563264, 'steps': 8141, 'loss/train': 2.176313638687134} 11/06/2021 22:23:08 - INFO - __main__ - Step 8143: {'lr': 0.0004978782672050004, 'samples': 1563456, 'steps': 8142, 'loss/train': 2.180102586746216} 11/06/2021 22:23:08 - INFO - __main__ - Step 8144: {'lr': 0.000497877577234951, 'samples': 1563648, 'steps': 8143, 'loss/train': 2.109963893890381} 11/06/2021 22:23:08 - INFO - __main__ - Step 8145: {'lr': 0.0004978768871532117, 'samples': 1563840, 'steps': 8144, 'loss/train': 2.1114203929901123} 11/06/2021 22:23:09 - INFO - __main__ - Step 8146: {'lr': 0.0004978761969597831, 'samples': 1564032, 'steps': 8145, 'loss/train': 2.357698917388916} 11/06/2021 22:23:09 - INFO - __main__ - Step 8147: {'lr': 0.0004978755066546651, 'samples': 1564224, 'steps': 8146, 'loss/train': 1.9668238162994385} 11/06/2021 22:23:10 - INFO - __main__ - Step 8148: {'lr': 0.0004978748162378583, 'samples': 1564416, 'steps': 8147, 'loss/train': 1.837101936340332} 11/06/2021 22:23:11 - INFO - __main__ - Step 8149: {'lr': 0.0004978741257093629, 'samples': 1564608, 'steps': 8148, 'loss/train': 2.2903544902801514} 11/06/2021 22:23:11 - INFO - __main__ - Step 8150: {'lr': 0.0004978734350691793, 'samples': 1564800, 'steps': 8149, 'loss/train': 1.4162994623184204} 11/06/2021 22:23:11 - INFO - __main__ - Step 8151: {'lr': 0.0004978727443173077, 'samples': 1564992, 'steps': 8150, 'loss/train': 1.6277093887329102} 11/06/2021 22:23:12 - INFO - __main__ - Step 8152: {'lr': 0.0004978720534537485, 'samples': 1565184, 'steps': 8151, 'loss/train': 1.715814471244812} 11/06/2021 22:23:13 - INFO - __main__ - Step 8153: {'lr': 0.000497871362478502, 'samples': 1565376, 'steps': 8152, 'loss/train': 1.4183335304260254} 11/06/2021 22:23:13 - INFO - __main__ - Step 8154: {'lr': 0.0004978706713915684, 'samples': 1565568, 'steps': 8153, 'loss/train': 1.7779927253723145} 11/06/2021 22:23:13 - INFO - __main__ - Step 8155: {'lr': 0.0004978699801929481, 'samples': 1565760, 'steps': 8154, 'loss/train': 1.3929790258407593} 11/06/2021 22:23:14 - INFO - __main__ - Step 8156: {'lr': 0.0004978692888826415, 'samples': 1565952, 'steps': 8155, 'loss/train': 1.6693589687347412} 11/06/2021 22:23:14 - INFO - __main__ - Step 8157: {'lr': 0.0004978685974606488, 'samples': 1566144, 'steps': 8156, 'loss/train': 1.4203959703445435} 11/06/2021 22:23:15 - INFO - __main__ - Step 8158: {'lr': 0.0004978679059269704, 'samples': 1566336, 'steps': 8157, 'loss/train': 1.9157639741897583} 11/06/2021 22:23:16 - INFO - __main__ - Step 8159: {'lr': 0.0004978672142816064, 'samples': 1566528, 'steps': 8158, 'loss/train': 1.5440396070480347} 11/06/2021 22:23:16 - INFO - __main__ - Step 8160: {'lr': 0.0004978665225245573, 'samples': 1566720, 'steps': 8159, 'loss/train': 1.5769977569580078} 11/06/2021 22:23:16 - INFO - __main__ - Step 8161: {'lr': 0.0004978658306558234, 'samples': 1566912, 'steps': 8160, 'loss/train': 1.9604636430740356} 11/06/2021 22:23:17 - INFO - __main__ - Step 8162: {'lr': 0.000497865138675405, 'samples': 1567104, 'steps': 8161, 'loss/train': 1.6201629638671875} 11/06/2021 22:23:18 - INFO - __main__ - Step 8163: {'lr': 0.0004978644465833024, 'samples': 1567296, 'steps': 8162, 'loss/train': 1.5474094152450562} 11/06/2021 22:23:18 - INFO - __main__ - Step 8164: {'lr': 0.000497863754379516, 'samples': 1567488, 'steps': 8163, 'loss/train': 2.2988481521606445} 11/06/2021 22:23:18 - INFO - __main__ - Step 8165: {'lr': 0.0004978630620640458, 'samples': 1567680, 'steps': 8164, 'loss/train': 1.2239776849746704} 11/06/2021 22:23:19 - INFO - __main__ - Step 8166: {'lr': 0.0004978623696368924, 'samples': 1567872, 'steps': 8165, 'loss/train': 1.8691977262496948} 11/06/2021 22:23:19 - INFO - __main__ - Step 8167: {'lr': 0.0004978616770980561, 'samples': 1568064, 'steps': 8166, 'loss/train': 1.9919401407241821} 11/06/2021 22:23:19 - INFO - __main__ - Step 8168: {'lr': 0.0004978609844475371, 'samples': 1568256, 'steps': 8167, 'loss/train': 2.040534257888794} 11/06/2021 22:23:21 - INFO - __main__ - Step 8169: {'lr': 0.0004978602916853359, 'samples': 1568448, 'steps': 8168, 'loss/train': 1.9862347841262817} 11/06/2021 22:23:21 - INFO - __main__ - Step 8170: {'lr': 0.0004978595988114525, 'samples': 1568640, 'steps': 8169, 'loss/train': 1.8854844570159912} 11/06/2021 22:23:21 - INFO - __main__ - Step 8171: {'lr': 0.0004978589058258874, 'samples': 1568832, 'steps': 8170, 'loss/train': 1.8785438537597656} 11/06/2021 22:23:22 - INFO - __main__ - Step 8172: {'lr': 0.0004978582127286409, 'samples': 1569024, 'steps': 8171, 'loss/train': 2.3725521564483643} 11/06/2021 22:23:22 - INFO - __main__ - Step 8173: {'lr': 0.0004978575195197135, 'samples': 1569216, 'steps': 8172, 'loss/train': 2.0266950130462646} 11/06/2021 22:23:23 - INFO - __main__ - Step 8174: {'lr': 0.0004978568261991051, 'samples': 1569408, 'steps': 8173, 'loss/train': 1.6820038557052612} 11/06/2021 22:23:23 - INFO - __main__ - Step 8175: {'lr': 0.0004978561327668164, 'samples': 1569600, 'steps': 8174, 'loss/train': 1.5046055316925049} 11/06/2021 22:23:24 - INFO - __main__ - Step 8176: {'lr': 0.0004978554392228475, 'samples': 1569792, 'steps': 8175, 'loss/train': 1.718424916267395} 11/06/2021 22:23:24 - INFO - __main__ - Step 8177: {'lr': 0.0004978547455671986, 'samples': 1569984, 'steps': 8176, 'loss/train': 1.3242571353912354} 11/06/2021 22:23:24 - INFO - __main__ - Step 8178: {'lr': 0.0004978540517998704, 'samples': 1570176, 'steps': 8177, 'loss/train': 1.511248230934143} 11/06/2021 22:23:25 - INFO - __main__ - Step 8179: {'lr': 0.0004978533579208629, 'samples': 1570368, 'steps': 8178, 'loss/train': 1.66165292263031} 11/06/2021 22:23:26 - INFO - __main__ - Step 8180: {'lr': 0.0004978526639301766, 'samples': 1570560, 'steps': 8179, 'loss/train': 2.0218098163604736} 11/06/2021 22:23:26 - INFO - __main__ - Step 8181: {'lr': 0.0004978519698278116, 'samples': 1570752, 'steps': 8180, 'loss/train': 2.0883474349975586} 11/06/2021 22:23:26 - INFO - __main__ - Step 8182: {'lr': 0.0004978512756137684, 'samples': 1570944, 'steps': 8181, 'loss/train': 2.5265679359436035} 11/06/2021 22:23:27 - INFO - __main__ - Step 8183: {'lr': 0.0004978505812880472, 'samples': 1571136, 'steps': 8182, 'loss/train': 1.8759024143218994} 11/06/2021 22:23:28 - INFO - __main__ - Step 8184: {'lr': 0.0004978498868506483, 'samples': 1571328, 'steps': 8183, 'loss/train': 2.0600368976593018} 11/06/2021 22:23:28 - INFO - __main__ - Step 8185: {'lr': 0.0004978491923015721, 'samples': 1571520, 'steps': 8184, 'loss/train': 1.7194626331329346} 11/06/2021 22:23:28 - INFO - __main__ - Step 8186: {'lr': 0.0004978484976408189, 'samples': 1571712, 'steps': 8185, 'loss/train': 1.817257285118103} 11/06/2021 22:23:29 - INFO - __main__ - Step 8187: {'lr': 0.000497847802868389, 'samples': 1571904, 'steps': 8186, 'loss/train': 1.9279881715774536} 11/06/2021 22:23:29 - INFO - __main__ - Step 8188: {'lr': 0.0004978471079842827, 'samples': 1572096, 'steps': 8187, 'loss/train': 1.9306005239486694} 11/06/2021 22:23:30 - INFO - __main__ - Step 8189: {'lr': 0.0004978464129885003, 'samples': 1572288, 'steps': 8188, 'loss/train': 1.9209128618240356} 11/06/2021 22:23:31 - INFO - __main__ - Step 8190: {'lr': 0.0004978457178810422, 'samples': 1572480, 'steps': 8189, 'loss/train': 0.7898921966552734} 11/06/2021 22:23:31 - INFO - __main__ - Step 8191: {'lr': 0.0004978450226619085, 'samples': 1572672, 'steps': 8190, 'loss/train': 1.9690088033676147} 11/06/2021 22:23:31 - INFO - __main__ - Step 8192: {'lr': 0.0004978443273310997, 'samples': 1572864, 'steps': 8191, 'loss/train': 1.8891528844833374} 11/06/2021 22:23:32 - INFO - __main__ - Step 8193: {'lr': 0.0004978436318886162, 'samples': 1573056, 'steps': 8192, 'loss/train': 1.6559553146362305} 11/06/2021 22:23:32 - INFO - __main__ - Step 8194: {'lr': 0.0004978429363344581, 'samples': 1573248, 'steps': 8193, 'loss/train': 0.9482629299163818} 11/06/2021 22:23:33 - INFO - __main__ - Step 8195: {'lr': 0.0004978422406686257, 'samples': 1573440, 'steps': 8194, 'loss/train': 1.9039665460586548} 11/06/2021 22:23:33 - INFO - __main__ - Step 8196: {'lr': 0.0004978415448911196, 'samples': 1573632, 'steps': 8195, 'loss/train': 1.7499048709869385} 11/06/2021 22:23:34 - INFO - __main__ - Step 8197: {'lr': 0.0004978408490019398, 'samples': 1573824, 'steps': 8196, 'loss/train': 1.526663064956665} 11/06/2021 22:23:34 - INFO - __main__ - Step 8198: {'lr': 0.0004978401530010868, 'samples': 1574016, 'steps': 8197, 'loss/train': 2.0458552837371826} 11/06/2021 22:23:35 - INFO - __main__ - Step 8199: {'lr': 0.0004978394568885608, 'samples': 1574208, 'steps': 8198, 'loss/train': 1.7852051258087158} 11/06/2021 22:23:36 - INFO - __main__ - Step 8200: {'lr': 0.0004978387606643621, 'samples': 1574400, 'steps': 8199, 'loss/train': 1.7068928480148315} 11/06/2021 22:23:36 - INFO - __main__ - Step 8201: {'lr': 0.0004978380643284912, 'samples': 1574592, 'steps': 8200, 'loss/train': 1.7486140727996826} 11/06/2021 22:23:36 - INFO - __main__ - Step 8202: {'lr': 0.0004978373678809482, 'samples': 1574784, 'steps': 8201, 'loss/train': 6.046335220336914} 11/06/2021 22:23:37 - INFO - __main__ - Step 8203: {'lr': 0.0004978366713217336, 'samples': 1574976, 'steps': 8202, 'loss/train': 1.418635368347168} 11/06/2021 22:23:37 - INFO - __main__ - Step 8204: {'lr': 0.0004978359746508476, 'samples': 1575168, 'steps': 8203, 'loss/train': 2.039991855621338} 11/06/2021 22:23:38 - INFO - __main__ - Step 8205: {'lr': 0.0004978352778682905, 'samples': 1575360, 'steps': 8204, 'loss/train': 1.6890138387680054} 11/06/2021 22:23:38 - INFO - __main__ - Step 8206: {'lr': 0.0004978345809740626, 'samples': 1575552, 'steps': 8205, 'loss/train': 1.5831921100616455} 11/06/2021 22:23:39 - INFO - __main__ - Step 8207: {'lr': 0.0004978338839681644, 'samples': 1575744, 'steps': 8206, 'loss/train': 1.284598469734192} 11/06/2021 22:23:39 - INFO - __main__ - Step 8208: {'lr': 0.000497833186850596, 'samples': 1575936, 'steps': 8207, 'loss/train': 1.9825032949447632} 11/06/2021 22:23:39 - INFO - __main__ - Step 8209: {'lr': 0.0004978324896213577, 'samples': 1576128, 'steps': 8208, 'loss/train': 1.068908929824829} 11/06/2021 22:23:40 - INFO - __main__ - Step 8210: {'lr': 0.00049783179228045, 'samples': 1576320, 'steps': 8209, 'loss/train': 2.0612127780914307} 11/06/2021 22:23:41 - INFO - __main__ - Step 8211: {'lr': 0.0004978310948278731, 'samples': 1576512, 'steps': 8210, 'loss/train': 1.5374093055725098} 11/06/2021 22:23:41 - INFO - __main__ - Step 8212: {'lr': 0.0004978303972636275, 'samples': 1576704, 'steps': 8211, 'loss/train': 1.6517690420150757} 11/06/2021 22:23:41 - INFO - __main__ - Step 8213: {'lr': 0.0004978296995877132, 'samples': 1576896, 'steps': 8212, 'loss/train': 1.9710806608200073} 11/06/2021 22:23:42 - INFO - __main__ - Step 8214: {'lr': 0.0004978290018001306, 'samples': 1577088, 'steps': 8213, 'loss/train': 1.794217824935913} 11/06/2021 22:23:42 - INFO - __main__ - Step 8215: {'lr': 0.0004978283039008801, 'samples': 1577280, 'steps': 8214, 'loss/train': 2.031745433807373} 11/06/2021 22:23:43 - INFO - __main__ - Step 8216: {'lr': 0.000497827605889962, 'samples': 1577472, 'steps': 8215, 'loss/train': 1.8105554580688477} 11/06/2021 22:23:44 - INFO - __main__ - Step 8217: {'lr': 0.0004978269077673766, 'samples': 1577664, 'steps': 8216, 'loss/train': 1.3719156980514526} 11/06/2021 22:23:44 - INFO - __main__ - Step 8218: {'lr': 0.0004978262095331243, 'samples': 1577856, 'steps': 8217, 'loss/train': 1.7575129270553589} 11/06/2021 22:23:44 - INFO - __main__ - Step 8219: {'lr': 0.0004978255111872053, 'samples': 1578048, 'steps': 8218, 'loss/train': 1.9451090097427368} 11/06/2021 22:23:45 - INFO - __main__ - Step 8220: {'lr': 0.0004978248127296198, 'samples': 1578240, 'steps': 8219, 'loss/train': 2.2303905487060547} 11/06/2021 22:23:46 - INFO - __main__ - Step 8221: {'lr': 0.0004978241141603685, 'samples': 1578432, 'steps': 8220, 'loss/train': 2.1511523723602295} 11/06/2021 22:23:46 - INFO - __main__ - Step 8222: {'lr': 0.0004978234154794514, 'samples': 1578624, 'steps': 8221, 'loss/train': 1.9754325151443481} 11/06/2021 22:23:46 - INFO - __main__ - Step 8223: {'lr': 0.0004978227166868689, 'samples': 1578816, 'steps': 8222, 'loss/train': 1.4744144678115845} 11/06/2021 22:23:47 - INFO - __main__ - Step 8224: {'lr': 0.0004978220177826212, 'samples': 1579008, 'steps': 8223, 'loss/train': 1.977313756942749} 11/06/2021 22:23:47 - INFO - __main__ - Step 8225: {'lr': 0.0004978213187667087, 'samples': 1579200, 'steps': 8224, 'loss/train': 1.9514577388763428} 11/06/2021 22:23:48 - INFO - __main__ - Step 8226: {'lr': 0.0004978206196391319, 'samples': 1579392, 'steps': 8225, 'loss/train': 1.6040639877319336} 11/06/2021 22:23:48 - INFO - __main__ - Step 8227: {'lr': 0.0004978199203998909, 'samples': 1579584, 'steps': 8226, 'loss/train': 2.2450778484344482} 11/06/2021 22:23:49 - INFO - __main__ - Step 8228: {'lr': 0.0004978192210489861, 'samples': 1579776, 'steps': 8227, 'loss/train': 1.8342416286468506} 11/06/2021 22:23:49 - INFO - __main__ - Step 8229: {'lr': 0.0004978185215864177, 'samples': 1579968, 'steps': 8228, 'loss/train': 1.8177608251571655} 11/06/2021 22:23:50 - INFO - __main__ - Step 8230: {'lr': 0.0004978178220121862, 'samples': 1580160, 'steps': 8229, 'loss/train': 1.1857390403747559} 11/06/2021 22:23:50 - INFO - __main__ - Step 8231: {'lr': 0.0004978171223262917, 'samples': 1580352, 'steps': 8230, 'loss/train': 1.4002374410629272} 11/06/2021 22:23:51 - INFO - __main__ - Step 8232: {'lr': 0.0004978164225287346, 'samples': 1580544, 'steps': 8231, 'loss/train': 1.7684195041656494} 11/06/2021 22:23:51 - INFO - __main__ - Step 8233: {'lr': 0.0004978157226195153, 'samples': 1580736, 'steps': 8232, 'loss/train': 1.4781603813171387} 11/06/2021 22:23:52 - INFO - __main__ - Step 8234: {'lr': 0.0004978150225986342, 'samples': 1580928, 'steps': 8233, 'loss/train': 1.919776439666748} 11/06/2021 22:23:52 - INFO - __main__ - Step 8235: {'lr': 0.0004978143224660913, 'samples': 1581120, 'steps': 8234, 'loss/train': 2.3026692867279053} 11/06/2021 22:23:52 - INFO - __main__ - Step 8236: {'lr': 0.0004978136222218872, 'samples': 1581312, 'steps': 8235, 'loss/train': 1.9693598747253418} 11/06/2021 22:23:53 - INFO - __main__ - Step 8237: {'lr': 0.000497812921866022, 'samples': 1581504, 'steps': 8236, 'loss/train': 2.0580477714538574} 11/06/2021 22:23:54 - INFO - __main__ - Step 8238: {'lr': 0.0004978122213984961, 'samples': 1581696, 'steps': 8237, 'loss/train': 1.808411955833435} 11/06/2021 22:23:54 - INFO - __main__ - Step 8239: {'lr': 0.00049781152081931, 'samples': 1581888, 'steps': 8238, 'loss/train': 1.9752439260482788} 11/06/2021 22:23:54 - INFO - __main__ - Step 8240: {'lr': 0.0004978108201284638, 'samples': 1582080, 'steps': 8239, 'loss/train': 0.9977089166641235} 11/06/2021 22:23:55 - INFO - __main__ - Step 8241: {'lr': 0.0004978101193259578, 'samples': 1582272, 'steps': 8240, 'loss/train': 0.3795441687107086} 11/06/2021 22:23:56 - INFO - __main__ - Step 8242: {'lr': 0.0004978094184117924, 'samples': 1582464, 'steps': 8241, 'loss/train': 1.8929859399795532} 11/06/2021 22:23:56 - INFO - __main__ - Step 8243: {'lr': 0.0004978087173859679, 'samples': 1582656, 'steps': 8242, 'loss/train': 1.606785774230957} 11/06/2021 22:23:56 - INFO - __main__ - Step 8244: {'lr': 0.0004978080162484846, 'samples': 1582848, 'steps': 8243, 'loss/train': 2.046319007873535} 11/06/2021 22:23:57 - INFO - __main__ - Step 8245: {'lr': 0.000497807314999343, 'samples': 1583040, 'steps': 8244, 'loss/train': 1.9589273929595947} 11/06/2021 22:23:57 - INFO - __main__ - Step 8246: {'lr': 0.000497806613638543, 'samples': 1583232, 'steps': 8245, 'loss/train': 1.904953956604004} 11/06/2021 22:23:58 - INFO - __main__ - Step 8247: {'lr': 0.0004978059121660853, 'samples': 1583424, 'steps': 8246, 'loss/train': 1.978081226348877} 11/06/2021 22:23:59 - INFO - __main__ - Step 8248: {'lr': 0.0004978052105819701, 'samples': 1583616, 'steps': 8247, 'loss/train': 2.0313427448272705} 11/06/2021 22:23:59 - INFO - __main__ - Step 8249: {'lr': 0.0004978045088861976, 'samples': 1583808, 'steps': 8248, 'loss/train': 2.3198392391204834} 11/06/2021 22:23:59 - INFO - __main__ - Step 8250: {'lr': 0.0004978038070787683, 'samples': 1584000, 'steps': 8249, 'loss/train': 2.009343147277832} 11/06/2021 22:24:00 - INFO - __main__ - Step 8251: {'lr': 0.0004978031051596824, 'samples': 1584192, 'steps': 8250, 'loss/train': 1.775676965713501} 11/06/2021 22:24:01 - INFO - __main__ - Step 8252: {'lr': 0.0004978024031289402, 'samples': 1584384, 'steps': 8251, 'loss/train': 1.8589789867401123} 11/06/2021 22:24:01 - INFO - __main__ - Step 8253: {'lr': 0.0004978017009865421, 'samples': 1584576, 'steps': 8252, 'loss/train': 1.619974970817566} 11/06/2021 22:24:01 - INFO - __main__ - Step 8254: {'lr': 0.0004978009987324884, 'samples': 1584768, 'steps': 8253, 'loss/train': 1.9142787456512451} 11/06/2021 22:24:02 - INFO - __main__ - Step 8255: {'lr': 0.0004978002963667794, 'samples': 1584960, 'steps': 8254, 'loss/train': 1.2939250469207764} 11/06/2021 22:24:02 - INFO - __main__ - Step 8256: {'lr': 0.0004977995938894153, 'samples': 1585152, 'steps': 8255, 'loss/train': 2.2025363445281982} 11/06/2021 22:24:03 - INFO - __main__ - Step 8257: {'lr': 0.0004977988913003966, 'samples': 1585344, 'steps': 8256, 'loss/train': 2.0969882011413574} 11/06/2021 22:24:03 - INFO - __main__ - Step 8258: {'lr': 0.0004977981885997235, 'samples': 1585536, 'steps': 8257, 'loss/train': 1.3908742666244507} 11/06/2021 22:24:04 - INFO - __main__ - Step 8259: {'lr': 0.0004977974857873964, 'samples': 1585728, 'steps': 8258, 'loss/train': 1.7966902256011963} 11/06/2021 22:24:04 - INFO - __main__ - Step 8260: {'lr': 0.0004977967828634157, 'samples': 1585920, 'steps': 8259, 'loss/train': 1.8227860927581787} 11/06/2021 22:24:05 - INFO - __main__ - Step 8261: {'lr': 0.0004977960798277814, 'samples': 1586112, 'steps': 8260, 'loss/train': 1.9070138931274414} 11/06/2021 22:24:05 - INFO - __main__ - Step 8262: {'lr': 0.0004977953766804941, 'samples': 1586304, 'steps': 8261, 'loss/train': 1.7750216722488403} 11/06/2021 22:24:06 - INFO - __main__ - Step 8263: {'lr': 0.0004977946734215541, 'samples': 1586496, 'steps': 8262, 'loss/train': 1.3663748502731323} 11/06/2021 22:24:06 - INFO - __main__ - Step 8264: {'lr': 0.0004977939700509615, 'samples': 1586688, 'steps': 8263, 'loss/train': 2.318755626678467} 11/06/2021 22:24:07 - INFO - __main__ - Step 8265: {'lr': 0.0004977932665687168, 'samples': 1586880, 'steps': 8264, 'loss/train': 1.6168603897094727} 11/06/2021 22:24:07 - INFO - __main__ - Step 8266: {'lr': 0.0004977925629748203, 'samples': 1587072, 'steps': 8265, 'loss/train': 1.8348907232284546} 11/06/2021 22:24:07 - INFO - __main__ - Step 8267: {'lr': 0.0004977918592692723, 'samples': 1587264, 'steps': 8266, 'loss/train': 2.0127944946289062} 11/06/2021 22:24:08 - INFO - __main__ - Step 8268: {'lr': 0.0004977911554520731, 'samples': 1587456, 'steps': 8267, 'loss/train': 2.4717700481414795} 11/06/2021 22:24:09 - INFO - __main__ - Step 8269: {'lr': 0.000497790451523223, 'samples': 1587648, 'steps': 8268, 'loss/train': 1.848479151725769} 11/06/2021 22:24:09 - INFO - __main__ - Step 8270: {'lr': 0.0004977897474827224, 'samples': 1587840, 'steps': 8269, 'loss/train': 2.058631420135498} 11/06/2021 22:24:09 - INFO - __main__ - Step 8271: {'lr': 0.0004977890433305716, 'samples': 1588032, 'steps': 8270, 'loss/train': 2.1743407249450684} 11/06/2021 22:24:10 - INFO - __main__ - Step 8272: {'lr': 0.0004977883390667707, 'samples': 1588224, 'steps': 8271, 'loss/train': 1.7695982456207275} 11/06/2021 22:24:11 - INFO - __main__ - Step 8273: {'lr': 0.0004977876346913204, 'samples': 1588416, 'steps': 8272, 'loss/train': 1.5789722204208374} 11/06/2021 22:24:11 - INFO - __main__ - Step 8274: {'lr': 0.0004977869302042207, 'samples': 1588608, 'steps': 8273, 'loss/train': 1.8004614114761353} 11/06/2021 22:24:12 - INFO - __main__ - Step 8275: {'lr': 0.0004977862256054721, 'samples': 1588800, 'steps': 8274, 'loss/train': 1.8264741897583008} 11/06/2021 22:24:12 - INFO - __main__ - Step 8276: {'lr': 0.0004977855208950748, 'samples': 1588992, 'steps': 8275, 'loss/train': 1.8771971464157104} 11/06/2021 22:24:12 - INFO - __main__ - Step 8277: {'lr': 0.0004977848160730292, 'samples': 1589184, 'steps': 8276, 'loss/train': 1.6610857248306274} 11/06/2021 22:24:13 - INFO - __main__ - Step 8278: {'lr': 0.0004977841111393356, 'samples': 1589376, 'steps': 8277, 'loss/train': 1.7846906185150146} 11/06/2021 22:24:14 - INFO - __main__ - Step 8279: {'lr': 0.0004977834060939943, 'samples': 1589568, 'steps': 8278, 'loss/train': 1.6243547201156616} 11/06/2021 22:24:14 - INFO - __main__ - Step 8280: {'lr': 0.0004977827009370056, 'samples': 1589760, 'steps': 8279, 'loss/train': 1.42178475856781} 11/06/2021 22:24:14 - INFO - __main__ - Step 8281: {'lr': 0.0004977819956683698, 'samples': 1589952, 'steps': 8280, 'loss/train': 1.8669449090957642} 11/06/2021 22:24:15 - INFO - __main__ - Step 8282: {'lr': 0.0004977812902880873, 'samples': 1590144, 'steps': 8281, 'loss/train': 2.0661544799804688} 11/06/2021 22:24:15 - INFO - __main__ - Step 8283: {'lr': 0.0004977805847961584, 'samples': 1590336, 'steps': 8282, 'loss/train': 1.9494577646255493} 11/06/2021 22:24:16 - INFO - __main__ - Step 8284: {'lr': 0.0004977798791925834, 'samples': 1590528, 'steps': 8283, 'loss/train': 1.9344879388809204} 11/06/2021 22:24:16 - INFO - __main__ - Step 8285: {'lr': 0.0004977791734773624, 'samples': 1590720, 'steps': 8284, 'loss/train': 1.7498496770858765} 11/06/2021 22:24:17 - INFO - __main__ - Step 8286: {'lr': 0.0004977784676504962, 'samples': 1590912, 'steps': 8285, 'loss/train': 1.4974678754806519} 11/06/2021 22:24:17 - INFO - __main__ - Step 8287: {'lr': 0.0004977777617119847, 'samples': 1591104, 'steps': 8286, 'loss/train': 1.8675819635391235} 11/06/2021 22:24:18 - INFO - __main__ - Step 8288: {'lr': 0.0004977770556618284, 'samples': 1591296, 'steps': 8287, 'loss/train': 2.119424343109131} 11/06/2021 22:24:18 - INFO - __main__ - Step 8289: {'lr': 0.0004977763495000276, 'samples': 1591488, 'steps': 8288, 'loss/train': 1.7665525674819946} 11/06/2021 22:24:19 - INFO - __main__ - Step 8290: {'lr': 0.0004977756432265827, 'samples': 1591680, 'steps': 8289, 'loss/train': 1.5919955968856812} 11/06/2021 22:24:19 - INFO - __main__ - Step 8291: {'lr': 0.0004977749368414937, 'samples': 1591872, 'steps': 8290, 'loss/train': 1.6842032670974731} 11/06/2021 22:24:20 - INFO - __main__ - Step 8292: {'lr': 0.0004977742303447613, 'samples': 1592064, 'steps': 8291, 'loss/train': 2.4035115242004395} 11/06/2021 22:24:20 - INFO - __main__ - Step 8293: {'lr': 0.0004977735237363855, 'samples': 1592256, 'steps': 8292, 'loss/train': 1.7982686758041382} 11/06/2021 22:24:21 - INFO - __main__ - Step 8294: {'lr': 0.0004977728170163669, 'samples': 1592448, 'steps': 8293, 'loss/train': 2.0274014472961426} 11/06/2021 22:24:21 - INFO - __main__ - Step 8295: {'lr': 0.0004977721101847057, 'samples': 1592640, 'steps': 8294, 'loss/train': 2.058189868927002} 11/06/2021 22:24:22 - INFO - __main__ - Step 8296: {'lr': 0.0004977714032414021, 'samples': 1592832, 'steps': 8295, 'loss/train': 1.709094762802124} 11/06/2021 22:24:22 - INFO - __main__ - Step 8297: {'lr': 0.0004977706961864566, 'samples': 1593024, 'steps': 8296, 'loss/train': 2.004429578781128} 11/06/2021 22:24:22 - INFO - __main__ - Step 8298: {'lr': 0.0004977699890198695, 'samples': 1593216, 'steps': 8297, 'loss/train': 0.9048164486885071} 11/06/2021 22:24:23 - INFO - __main__ - Step 8299: {'lr': 0.0004977692817416411, 'samples': 1593408, 'steps': 8298, 'loss/train': 1.415739893913269} 11/06/2021 22:24:24 - INFO - __main__ - Step 8300: {'lr': 0.0004977685743517715, 'samples': 1593600, 'steps': 8299, 'loss/train': 1.8536925315856934} 11/06/2021 22:24:24 - INFO - __main__ - Step 8301: {'lr': 0.0004977678668502614, 'samples': 1593792, 'steps': 8300, 'loss/train': 2.0791187286376953} 11/06/2021 22:24:24 - INFO - __main__ - Step 8302: {'lr': 0.0004977671592371108, 'samples': 1593984, 'steps': 8301, 'loss/train': 0.7785729765892029} 11/06/2021 22:24:25 - INFO - __main__ - Step 8303: {'lr': 0.0004977664515123201, 'samples': 1594176, 'steps': 8302, 'loss/train': 1.8622348308563232} 11/06/2021 22:24:26 - INFO - __main__ - Step 8304: {'lr': 0.0004977657436758898, 'samples': 1594368, 'steps': 8303, 'loss/train': 1.9687023162841797} 11/06/2021 22:24:26 - INFO - __main__ - Step 8305: {'lr': 0.00049776503572782, 'samples': 1594560, 'steps': 8304, 'loss/train': 1.782356858253479} 11/06/2021 22:24:27 - INFO - __main__ - Step 8306: {'lr': 0.0004977643276681111, 'samples': 1594752, 'steps': 8305, 'loss/train': 1.6314135789871216} 11/06/2021 22:24:27 - INFO - __main__ - Step 8307: {'lr': 0.0004977636194967634, 'samples': 1594944, 'steps': 8306, 'loss/train': 2.114375591278076} 11/06/2021 22:24:27 - INFO - __main__ - Step 8308: {'lr': 0.0004977629112137773, 'samples': 1595136, 'steps': 8307, 'loss/train': 1.4448978900909424} 11/06/2021 22:24:28 - INFO - __main__ - Step 8309: {'lr': 0.000497762202819153, 'samples': 1595328, 'steps': 8308, 'loss/train': 2.2745602130889893} 11/06/2021 22:24:29 - INFO - __main__ - Step 8310: {'lr': 0.0004977614943128909, 'samples': 1595520, 'steps': 8309, 'loss/train': 2.14582896232605} 11/06/2021 22:24:29 - INFO - __main__ - Step 8311: {'lr': 0.0004977607856949913, 'samples': 1595712, 'steps': 8310, 'loss/train': 1.7548679113388062} 11/06/2021 22:24:29 - INFO - __main__ - Step 8312: {'lr': 0.0004977600769654545, 'samples': 1595904, 'steps': 8311, 'loss/train': 1.5743494033813477} 11/06/2021 22:24:30 - INFO - __main__ - Step 8313: {'lr': 0.0004977593681242808, 'samples': 1596096, 'steps': 8312, 'loss/train': 2.279632806777954} 11/06/2021 22:24:30 - INFO - __main__ - Step 8314: {'lr': 0.0004977586591714706, 'samples': 1596288, 'steps': 8313, 'loss/train': 2.477036714553833} 11/06/2021 22:24:31 - INFO - __main__ - Step 8315: {'lr': 0.0004977579501070241, 'samples': 1596480, 'steps': 8314, 'loss/train': 1.5748059749603271} 11/06/2021 22:24:31 - INFO - __main__ - Step 8316: {'lr': 0.0004977572409309418, 'samples': 1596672, 'steps': 8315, 'loss/train': 1.7749756574630737} 11/06/2021 22:24:32 - INFO - __main__ - Step 8317: {'lr': 0.0004977565316432238, 'samples': 1596864, 'steps': 8316, 'loss/train': 2.0671162605285645} 11/06/2021 22:24:32 - INFO - __main__ - Step 8318: {'lr': 0.0004977558222438707, 'samples': 1597056, 'steps': 8317, 'loss/train': 1.9523950815200806} 11/06/2021 22:24:33 - INFO - __main__ - Step 8319: {'lr': 0.0004977551127328824, 'samples': 1597248, 'steps': 8318, 'loss/train': 1.767422080039978} 11/06/2021 22:24:34 - INFO - __main__ - Step 8320: {'lr': 0.0004977544031102597, 'samples': 1597440, 'steps': 8319, 'loss/train': 1.7613978385925293} 11/06/2021 22:24:34 - INFO - __main__ - Step 8321: {'lr': 0.0004977536933760025, 'samples': 1597632, 'steps': 8320, 'loss/train': 1.8801629543304443} 11/06/2021 22:24:34 - INFO - __main__ - Step 8322: {'lr': 0.0004977529835301115, 'samples': 1597824, 'steps': 8321, 'loss/train': 1.653210163116455} 11/06/2021 22:24:35 - INFO - __main__ - Step 8323: {'lr': 0.0004977522735725866, 'samples': 1598016, 'steps': 8322, 'loss/train': 1.4604030847549438} 11/06/2021 22:24:35 - INFO - __main__ - Step 8324: {'lr': 0.0004977515635034285, 'samples': 1598208, 'steps': 8323, 'loss/train': 1.7190172672271729} 11/06/2021 22:24:36 - INFO - __main__ - Step 8325: {'lr': 0.0004977508533226374, 'samples': 1598400, 'steps': 8324, 'loss/train': 1.7144981622695923} 11/06/2021 22:24:36 - INFO - __main__ - Step 8326: {'lr': 0.0004977501430302136, 'samples': 1598592, 'steps': 8325, 'loss/train': 2.134070873260498} 11/06/2021 22:24:37 - INFO - __main__ - Step 8327: {'lr': 0.0004977494326261573, 'samples': 1598784, 'steps': 8326, 'loss/train': 2.0011684894561768} 11/06/2021 22:24:37 - INFO - __main__ - Step 8328: {'lr': 0.000497748722110469, 'samples': 1598976, 'steps': 8327, 'loss/train': 3.0373826026916504} 11/06/2021 22:24:38 - INFO - __main__ - Step 8329: {'lr': 0.0004977480114831489, 'samples': 1599168, 'steps': 8328, 'loss/train': 1.5927302837371826} 11/06/2021 22:24:38 - INFO - __main__ - Step 8330: {'lr': 0.0004977473007441973, 'samples': 1599360, 'steps': 8329, 'loss/train': 1.6622514724731445} 11/06/2021 22:24:39 - INFO - __main__ - Step 8331: {'lr': 0.0004977465898936147, 'samples': 1599552, 'steps': 8330, 'loss/train': 2.2087743282318115} 11/06/2021 22:24:39 - INFO - __main__ - Step 8332: {'lr': 0.0004977458789314014, 'samples': 1599744, 'steps': 8331, 'loss/train': 2.013927936553955} 11/06/2021 22:24:40 - INFO - __main__ - Step 8333: {'lr': 0.0004977451678575575, 'samples': 1599936, 'steps': 8332, 'loss/train': 1.207032561302185} 11/06/2021 22:24:40 - INFO - __main__ - Step 8334: {'lr': 0.0004977444566720834, 'samples': 1600128, 'steps': 8333, 'loss/train': 1.9208728075027466} 11/06/2021 22:24:40 - INFO - __main__ - Step 8335: {'lr': 0.0004977437453749795, 'samples': 1600320, 'steps': 8334, 'loss/train': 1.8534126281738281} 11/06/2021 22:24:42 - INFO - __main__ - Step 8336: {'lr': 0.0004977430339662462, 'samples': 1600512, 'steps': 8335, 'loss/train': 2.3053481578826904} 11/06/2021 22:24:42 - INFO - __main__ - Step 8337: {'lr': 0.0004977423224458837, 'samples': 1600704, 'steps': 8336, 'loss/train': 1.7347732782363892} 11/06/2021 22:24:42 - INFO - __main__ - Step 8338: {'lr': 0.0004977416108138922, 'samples': 1600896, 'steps': 8337, 'loss/train': 1.3853802680969238} 11/06/2021 22:24:43 - INFO - __main__ - Step 8339: {'lr': 0.0004977408990702722, 'samples': 1601088, 'steps': 8338, 'loss/train': 1.826993465423584} 11/06/2021 22:24:43 - INFO - __main__ - Step 8340: {'lr': 0.0004977401872150241, 'samples': 1601280, 'steps': 8339, 'loss/train': 1.857647180557251} 11/06/2021 22:24:44 - INFO - __main__ - Step 8341: {'lr': 0.000497739475248148, 'samples': 1601472, 'steps': 8340, 'loss/train': 1.9734106063842773} 11/06/2021 22:24:44 - INFO - __main__ - Step 8342: {'lr': 0.0004977387631696443, 'samples': 1601664, 'steps': 8341, 'loss/train': 1.5935910940170288} 11/06/2021 22:24:45 - INFO - __main__ - Step 8343: {'lr': 0.0004977380509795133, 'samples': 1601856, 'steps': 8342, 'loss/train': 1.8237578868865967} 11/06/2021 22:24:45 - INFO - __main__ - Step 8344: {'lr': 0.0004977373386777554, 'samples': 1602048, 'steps': 8343, 'loss/train': 2.0518319606781006} 11/06/2021 22:24:45 - INFO - __main__ - Step 8345: {'lr': 0.0004977366262643709, 'samples': 1602240, 'steps': 8344, 'loss/train': 2.2713077068328857} 11/06/2021 22:24:46 - INFO - __main__ - Step 8346: {'lr': 0.0004977359137393601, 'samples': 1602432, 'steps': 8345, 'loss/train': 1.824563980102539} 11/06/2021 22:24:47 - INFO - __main__ - Step 8347: {'lr': 0.0004977352011027233, 'samples': 1602624, 'steps': 8346, 'loss/train': 2.087007999420166} 11/06/2021 22:24:47 - INFO - __main__ - Step 8348: {'lr': 0.0004977344883544608, 'samples': 1602816, 'steps': 8347, 'loss/train': 1.990113615989685} 11/06/2021 22:24:47 - INFO - __main__ - Step 8349: {'lr': 0.0004977337754945731, 'samples': 1603008, 'steps': 8348, 'loss/train': 1.9418188333511353} 11/06/2021 22:24:48 - INFO - __main__ - Step 8350: {'lr': 0.0004977330625230603, 'samples': 1603200, 'steps': 8349, 'loss/train': 1.706646203994751} 11/06/2021 22:24:49 - INFO - __main__ - Step 8351: {'lr': 0.0004977323494399227, 'samples': 1603392, 'steps': 8350, 'loss/train': 2.1837000846862793} 11/06/2021 22:24:49 - INFO - __main__ - Step 8352: {'lr': 0.0004977316362451608, 'samples': 1603584, 'steps': 8351, 'loss/train': 1.3790884017944336} 11/06/2021 22:24:50 - INFO - __main__ - Step 8353: {'lr': 0.0004977309229387749, 'samples': 1603776, 'steps': 8352, 'loss/train': 2.0610451698303223} 11/06/2021 22:24:50 - INFO - __main__ - Step 8354: {'lr': 0.0004977302095207653, 'samples': 1603968, 'steps': 8353, 'loss/train': 1.990189552307129} 11/06/2021 22:24:50 - INFO - __main__ - Step 8355: {'lr': 0.0004977294959911322, 'samples': 1604160, 'steps': 8354, 'loss/train': 1.6748875379562378} 11/06/2021 22:24:51 - INFO - __main__ - Step 8356: {'lr': 0.0004977287823498761, 'samples': 1604352, 'steps': 8355, 'loss/train': 2.276111602783203} 11/06/2021 22:24:52 - INFO - __main__ - Step 8357: {'lr': 0.0004977280685969971, 'samples': 1604544, 'steps': 8356, 'loss/train': 2.2838523387908936} 11/06/2021 22:24:52 - INFO - __main__ - Step 8358: {'lr': 0.0004977273547324958, 'samples': 1604736, 'steps': 8357, 'loss/train': 2.2760424613952637} 11/06/2021 22:24:52 - INFO - __main__ - Step 8359: {'lr': 0.0004977266407563722, 'samples': 1604928, 'steps': 8358, 'loss/train': 3.90104079246521} 11/06/2021 22:24:53 - INFO - __main__ - Step 8360: {'lr': 0.0004977259266686269, 'samples': 1605120, 'steps': 8359, 'loss/train': 1.3751955032348633} 11/06/2021 22:24:54 - INFO - __main__ - Step 8361: {'lr': 0.0004977252124692601, 'samples': 1605312, 'steps': 8360, 'loss/train': 1.8619705438613892} 11/06/2021 22:24:54 - INFO - __main__ - Step 8362: {'lr': 0.0004977244981582723, 'samples': 1605504, 'steps': 8361, 'loss/train': 1.7409098148345947} 11/06/2021 22:24:54 - INFO - __main__ - Step 8363: {'lr': 0.0004977237837356634, 'samples': 1605696, 'steps': 8362, 'loss/train': 1.7340418100357056} 11/06/2021 22:24:55 - INFO - __main__ - Step 8364: {'lr': 0.0004977230692014341, 'samples': 1605888, 'steps': 8363, 'loss/train': 1.5807855129241943} 11/06/2021 22:24:55 - INFO - __main__ - Step 8365: {'lr': 0.0004977223545555847, 'samples': 1606080, 'steps': 8364, 'loss/train': 1.9046440124511719} 11/06/2021 22:24:56 - INFO - __main__ - Step 8366: {'lr': 0.0004977216397981153, 'samples': 1606272, 'steps': 8365, 'loss/train': 1.9120842218399048} 11/06/2021 22:24:56 - INFO - __main__ - Step 8367: {'lr': 0.0004977209249290264, 'samples': 1606464, 'steps': 8366, 'loss/train': 1.9572821855545044} 11/06/2021 22:24:57 - INFO - __main__ - Step 8368: {'lr': 0.0004977202099483184, 'samples': 1606656, 'steps': 8367, 'loss/train': 2.009401559829712} 11/06/2021 22:24:57 - INFO - __main__ - Step 8369: {'lr': 0.0004977194948559913, 'samples': 1606848, 'steps': 8368, 'loss/train': 1.6611480712890625} 11/06/2021 22:24:58 - INFO - __main__ - Step 8370: {'lr': 0.0004977187796520457, 'samples': 1607040, 'steps': 8369, 'loss/train': 1.6625375747680664} 11/06/2021 22:24:58 - INFO - __main__ - Step 8371: {'lr': 0.0004977180643364819, 'samples': 1607232, 'steps': 8370, 'loss/train': 1.967087984085083} 11/06/2021 22:24:59 - INFO - __main__ - Step 8372: {'lr': 0.0004977173489093, 'samples': 1607424, 'steps': 8371, 'loss/train': 1.945002555847168} 11/06/2021 22:24:59 - INFO - __main__ - Step 8373: {'lr': 0.0004977166333705005, 'samples': 1607616, 'steps': 8372, 'loss/train': 1.0230144262313843} 11/06/2021 22:25:00 - INFO - __main__ - Step 8374: {'lr': 0.0004977159177200839, 'samples': 1607808, 'steps': 8373, 'loss/train': 1.9832271337509155} 11/06/2021 22:25:00 - INFO - __main__ - Step 8375: {'lr': 0.0004977152019580502, 'samples': 1608000, 'steps': 8374, 'loss/train': 1.5450023412704468} 11/06/2021 22:25:00 - INFO - __main__ - Step 8376: {'lr': 0.0004977144860843998, 'samples': 1608192, 'steps': 8375, 'loss/train': 1.7388193607330322} 11/06/2021 22:25:01 - INFO - __main__ - Step 8377: {'lr': 0.0004977137700991332, 'samples': 1608384, 'steps': 8376, 'loss/train': 1.4530218839645386} 11/06/2021 22:25:02 - INFO - __main__ - Step 8378: {'lr': 0.0004977130540022506, 'samples': 1608576, 'steps': 8377, 'loss/train': 1.5566548109054565} 11/06/2021 22:25:02 - INFO - __main__ - Step 8379: {'lr': 0.0004977123377937523, 'samples': 1608768, 'steps': 8378, 'loss/train': 1.7430671453475952} 11/06/2021 22:25:02 - INFO - __main__ - Step 8380: {'lr': 0.0004977116214736385, 'samples': 1608960, 'steps': 8379, 'loss/train': 1.1904152631759644} 11/06/2021 22:25:03 - INFO - __main__ - Step 8381: {'lr': 0.0004977109050419097, 'samples': 1609152, 'steps': 8380, 'loss/train': 2.402639150619507} 11/06/2021 22:25:04 - INFO - __main__ - Step 8382: {'lr': 0.0004977101884985663, 'samples': 1609344, 'steps': 8381, 'loss/train': 1.059557557106018} 11/06/2021 22:25:04 - INFO - __main__ - Step 8383: {'lr': 0.0004977094718436085, 'samples': 1609536, 'steps': 8382, 'loss/train': 1.9162335395812988} 11/06/2021 22:25:04 - INFO - __main__ - Step 8384: {'lr': 0.0004977087550770366, 'samples': 1609728, 'steps': 8383, 'loss/train': 1.9313476085662842} 11/06/2021 22:25:05 - INFO - __main__ - Step 8385: {'lr': 0.000497708038198851, 'samples': 1609920, 'steps': 8384, 'loss/train': 1.6063017845153809} 11/06/2021 22:25:05 - INFO - __main__ - Step 8386: {'lr': 0.0004977073212090519, 'samples': 1610112, 'steps': 8385, 'loss/train': 1.8818663358688354} 11/06/2021 22:25:06 - INFO - __main__ - Step 8387: {'lr': 0.0004977066041076398, 'samples': 1610304, 'steps': 8386, 'loss/train': 1.9069045782089233} 11/06/2021 22:25:06 - INFO - __main__ - Step 8388: {'lr': 0.0004977058868946148, 'samples': 1610496, 'steps': 8387, 'loss/train': 2.062808036804199} 11/06/2021 22:25:07 - INFO - __main__ - Step 8389: {'lr': 0.0004977051695699775, 'samples': 1610688, 'steps': 8388, 'loss/train': 1.8666964769363403} 11/06/2021 22:25:07 - INFO - __main__ - Step 8390: {'lr': 0.000497704452133728, 'samples': 1610880, 'steps': 8389, 'loss/train': 2.0732336044311523} 11/06/2021 22:25:08 - INFO - __main__ - Step 8391: {'lr': 0.0004977037345858667, 'samples': 1611072, 'steps': 8390, 'loss/train': 2.5039222240448} 11/06/2021 22:25:09 - INFO - __main__ - Step 8392: {'lr': 0.0004977030169263938, 'samples': 1611264, 'steps': 8391, 'loss/train': 1.8938769102096558} 11/06/2021 22:25:09 - INFO - __main__ - Step 8393: {'lr': 0.0004977022991553099, 'samples': 1611456, 'steps': 8392, 'loss/train': 1.8383945226669312} 11/06/2021 22:25:09 - INFO - __main__ - Step 8394: {'lr': 0.0004977015812726151, 'samples': 1611648, 'steps': 8393, 'loss/train': 1.9845424890518188} 11/06/2021 22:25:10 - INFO - __main__ - Step 8395: {'lr': 0.0004977008632783098, 'samples': 1611840, 'steps': 8394, 'loss/train': 1.7527318000793457} 11/06/2021 22:25:10 - INFO - __main__ - Step 8396: {'lr': 0.0004977001451723944, 'samples': 1612032, 'steps': 8395, 'loss/train': 0.8850865364074707} 11/06/2021 22:25:10 - INFO - __main__ - Step 8397: {'lr': 0.000497699426954869, 'samples': 1612224, 'steps': 8396, 'loss/train': 1.9200553894042969} 11/06/2021 22:25:11 - INFO - __main__ - Step 8398: {'lr': 0.0004976987086257342, 'samples': 1612416, 'steps': 8397, 'loss/train': 1.0962588787078857} 11/06/2021 22:25:12 - INFO - __main__ - Step 8399: {'lr': 0.0004976979901849901, 'samples': 1612608, 'steps': 8398, 'loss/train': 1.736077070236206} 11/06/2021 22:25:12 - INFO - __main__ - Step 8400: {'lr': 0.000497697271632637, 'samples': 1612800, 'steps': 8399, 'loss/train': 1.651288390159607} 11/06/2021 22:25:12 - INFO - __main__ - Step 8401: {'lr': 0.0004976965529686756, 'samples': 1612992, 'steps': 8400, 'loss/train': 1.851194977760315} 11/06/2021 22:25:13 - INFO - __main__ - Step 8402: {'lr': 0.0004976958341931057, 'samples': 1613184, 'steps': 8401, 'loss/train': 1.0811164379119873} 11/06/2021 22:25:14 - INFO - __main__ - Step 8403: {'lr': 0.000497695115305928, 'samples': 1613376, 'steps': 8402, 'loss/train': 1.9029980897903442} 11/06/2021 22:25:14 - INFO - __main__ - Step 8404: {'lr': 0.0004976943963071426, 'samples': 1613568, 'steps': 8403, 'loss/train': 1.7651731967926025} 11/06/2021 22:25:15 - INFO - __main__ - Step 8405: {'lr': 0.0004976936771967501, 'samples': 1613760, 'steps': 8404, 'loss/train': 2.381589412689209} 11/06/2021 22:25:15 - INFO - __main__ - Step 8406: {'lr': 0.0004976929579747505, 'samples': 1613952, 'steps': 8405, 'loss/train': 1.7756294012069702} 11/06/2021 22:25:15 - INFO - __main__ - Step 8407: {'lr': 0.0004976922386411444, 'samples': 1614144, 'steps': 8406, 'loss/train': 2.137291669845581} 11/06/2021 22:25:16 - INFO - __main__ - Step 8408: {'lr': 0.0004976915191959319, 'samples': 1614336, 'steps': 8407, 'loss/train': 1.879096508026123} 11/06/2021 22:25:17 - INFO - __main__ - Step 8409: {'lr': 0.0004976907996391135, 'samples': 1614528, 'steps': 8408, 'loss/train': 1.870473861694336} 11/06/2021 22:25:17 - INFO - __main__ - Step 8410: {'lr': 0.0004976900799706894, 'samples': 1614720, 'steps': 8409, 'loss/train': 2.050117015838623} 11/06/2021 22:25:17 - INFO - __main__ - Step 8411: {'lr': 0.00049768936019066, 'samples': 1614912, 'steps': 8410, 'loss/train': 1.824702262878418} 11/06/2021 22:25:18 - INFO - __main__ - Step 8412: {'lr': 0.0004976886402990255, 'samples': 1615104, 'steps': 8411, 'loss/train': 1.77139413356781} 11/06/2021 22:25:19 - INFO - __main__ - Step 8413: {'lr': 0.0004976879202957864, 'samples': 1615296, 'steps': 8412, 'loss/train': 1.8707973957061768} 11/06/2021 22:25:19 - INFO - __main__ - Step 8414: {'lr': 0.000497687200180943, 'samples': 1615488, 'steps': 8413, 'loss/train': 1.7304356098175049} 11/06/2021 22:25:19 - INFO - __main__ - Step 8415: {'lr': 0.0004976864799544954, 'samples': 1615680, 'steps': 8414, 'loss/train': 1.8535863161087036} 11/06/2021 22:25:20 - INFO - __main__ - Step 8416: {'lr': 0.0004976857596164443, 'samples': 1615872, 'steps': 8415, 'loss/train': 1.761738896369934} 11/06/2021 22:25:20 - INFO - __main__ - Step 8417: {'lr': 0.0004976850391667897, 'samples': 1616064, 'steps': 8416, 'loss/train': 1.0257487297058105} 11/06/2021 22:25:21 - INFO - __main__ - Step 8418: {'lr': 0.0004976843186055321, 'samples': 1616256, 'steps': 8417, 'loss/train': 1.9058445692062378} 11/06/2021 22:25:21 - INFO - __main__ - Step 8419: {'lr': 0.0004976835979326718, 'samples': 1616448, 'steps': 8418, 'loss/train': 1.5903434753417969} 11/06/2021 22:25:22 - INFO - __main__ - Step 8420: {'lr': 0.0004976828771482089, 'samples': 1616640, 'steps': 8419, 'loss/train': 1.9769971370697021} 11/06/2021 22:25:22 - INFO - __main__ - Step 8421: {'lr': 0.0004976821562521441, 'samples': 1616832, 'steps': 8420, 'loss/train': 1.9733790159225464} 11/06/2021 22:25:22 - INFO - __main__ - Step 8422: {'lr': 0.0004976814352444775, 'samples': 1617024, 'steps': 8421, 'loss/train': 1.1663745641708374} 11/06/2021 22:25:24 - INFO - __main__ - Step 8423: {'lr': 0.0004976807141252094, 'samples': 1617216, 'steps': 8422, 'loss/train': 1.938113808631897} 11/06/2021 22:25:24 - INFO - __main__ - Step 8424: {'lr': 0.0004976799928943403, 'samples': 1617408, 'steps': 8423, 'loss/train': 0.9520623683929443} 11/06/2021 22:25:25 - INFO - __main__ - Step 8425: {'lr': 0.0004976792715518703, 'samples': 1617600, 'steps': 8424, 'loss/train': 2.006859302520752} 11/06/2021 22:25:25 - INFO - __main__ - Step 8426: {'lr': 0.0004976785500978, 'samples': 1617792, 'steps': 8425, 'loss/train': 1.5716438293457031} 11/06/2021 22:25:25 - INFO - __main__ - Step 8427: {'lr': 0.0004976778285321294, 'samples': 1617984, 'steps': 8426, 'loss/train': 1.7659454345703125} 11/06/2021 22:25:26 - INFO - __main__ - Step 8428: {'lr': 0.0004976771068548591, 'samples': 1618176, 'steps': 8427, 'loss/train': 2.2531676292419434} 11/06/2021 22:25:27 - INFO - __main__ - Step 8429: {'lr': 0.0004976763850659893, 'samples': 1618368, 'steps': 8428, 'loss/train': 0.3412085175514221} 11/06/2021 22:25:27 - INFO - __main__ - Step 8430: {'lr': 0.0004976756631655203, 'samples': 1618560, 'steps': 8429, 'loss/train': 1.789984941482544} 11/06/2021 22:25:27 - INFO - __main__ - Step 8431: {'lr': 0.0004976749411534525, 'samples': 1618752, 'steps': 8430, 'loss/train': 1.566307783126831} 11/06/2021 22:25:28 - INFO - __main__ - Step 8432: {'lr': 0.0004976742190297862, 'samples': 1618944, 'steps': 8431, 'loss/train': 2.2874772548675537} 11/06/2021 22:25:28 - INFO - __main__ - Step 8433: {'lr': 0.0004976734967945217, 'samples': 1619136, 'steps': 8432, 'loss/train': 1.724574327468872} 11/06/2021 22:25:29 - INFO - __main__ - Step 8434: {'lr': 0.0004976727744476593, 'samples': 1619328, 'steps': 8433, 'loss/train': 2.225064516067505} 11/06/2021 22:25:29 - INFO - __main__ - Step 8435: {'lr': 0.0004976720519891994, 'samples': 1619520, 'steps': 8434, 'loss/train': 1.7268136739730835} 11/06/2021 22:25:30 - INFO - __main__ - Step 8436: {'lr': 0.0004976713294191423, 'samples': 1619712, 'steps': 8435, 'loss/train': 1.76718270778656} 11/06/2021 22:25:30 - INFO - __main__ - Step 8437: {'lr': 0.0004976706067374885, 'samples': 1619904, 'steps': 8436, 'loss/train': 1.7768479585647583} 11/06/2021 22:25:31 - INFO - __main__ - Step 8438: {'lr': 0.0004976698839442379, 'samples': 1620096, 'steps': 8437, 'loss/train': 1.0732344388961792} 11/06/2021 22:25:31 - INFO - __main__ - Step 8439: {'lr': 0.0004976691610393911, 'samples': 1620288, 'steps': 8438, 'loss/train': 2.1499216556549072} 11/06/2021 22:25:32 - INFO - __main__ - Step 8440: {'lr': 0.0004976684380229485, 'samples': 1620480, 'steps': 8439, 'loss/train': 1.8291432857513428} 11/06/2021 22:25:32 - INFO - __main__ - Step 8441: {'lr': 0.0004976677148949102, 'samples': 1620672, 'steps': 8440, 'loss/train': 1.7826381921768188} 11/06/2021 22:25:33 - INFO - __main__ - Step 8442: {'lr': 0.0004976669916552768, 'samples': 1620864, 'steps': 8441, 'loss/train': 1.9281136989593506} 11/06/2021 22:25:33 - INFO - __main__ - Step 8443: {'lr': 0.0004976662683040484, 'samples': 1621056, 'steps': 8442, 'loss/train': 1.7835140228271484} 11/06/2021 22:25:33 - INFO - __main__ - Step 8444: {'lr': 0.0004976655448412254, 'samples': 1621248, 'steps': 8443, 'loss/train': 2.6480824947357178} 11/06/2021 22:25:34 - INFO - __main__ - Step 8445: {'lr': 0.0004976648212668081, 'samples': 1621440, 'steps': 8444, 'loss/train': 1.505518913269043} 11/06/2021 22:25:35 - INFO - __main__ - Step 8446: {'lr': 0.0004976640975807969, 'samples': 1621632, 'steps': 8445, 'loss/train': 1.532829761505127} 11/06/2021 22:25:35 - INFO - __main__ - Step 8447: {'lr': 0.0004976633737831921, 'samples': 1621824, 'steps': 8446, 'loss/train': 1.6985620260238647} 11/06/2021 22:25:35 - INFO - __main__ - Step 8448: {'lr': 0.000497662649873994, 'samples': 1622016, 'steps': 8447, 'loss/train': 1.2889354228973389} 11/06/2021 22:25:36 - INFO - __main__ - Step 8449: {'lr': 0.0004976619258532029, 'samples': 1622208, 'steps': 8448, 'loss/train': 1.8851784467697144} 11/06/2021 22:25:37 - INFO - __main__ - Step 8450: {'lr': 0.0004976612017208191, 'samples': 1622400, 'steps': 8449, 'loss/train': 2.305765151977539} 11/06/2021 22:25:37 - INFO - __main__ - Step 8451: {'lr': 0.000497660477476843, 'samples': 1622592, 'steps': 8450, 'loss/train': 1.9551142454147339} 11/06/2021 22:25:37 - INFO - __main__ - Step 8452: {'lr': 0.000497659753121275, 'samples': 1622784, 'steps': 8451, 'loss/train': 2.0676944255828857} 11/06/2021 22:25:38 - INFO - __main__ - Step 8453: {'lr': 0.0004976590286541152, 'samples': 1622976, 'steps': 8452, 'loss/train': 1.929355502128601} 11/06/2021 22:25:38 - INFO - __main__ - Step 8454: {'lr': 0.0004976583040753643, 'samples': 1623168, 'steps': 8453, 'loss/train': 1.959873080253601} 11/06/2021 22:25:39 - INFO - __main__ - Step 8455: {'lr': 0.0004976575793850223, 'samples': 1623360, 'steps': 8454, 'loss/train': 1.7598445415496826} 11/06/2021 22:25:40 - INFO - __main__ - Step 8456: {'lr': 0.0004976568545830894, 'samples': 1623552, 'steps': 8455, 'loss/train': 1.6411019563674927} 11/06/2021 22:25:40 - INFO - __main__ - Step 8457: {'lr': 0.0004976561296695663, 'samples': 1623744, 'steps': 8456, 'loss/train': 1.8063533306121826} 11/06/2021 22:25:40 - INFO - __main__ - Step 8458: {'lr': 0.0004976554046444532, 'samples': 1623936, 'steps': 8457, 'loss/train': 1.4065788984298706} 11/06/2021 22:25:41 - INFO - __main__ - Step 8459: {'lr': 0.0004976546795077503, 'samples': 1624128, 'steps': 8458, 'loss/train': 2.1664505004882812} 11/06/2021 22:25:42 - INFO - __main__ - Step 8460: {'lr': 0.0004976539542594582, 'samples': 1624320, 'steps': 8459, 'loss/train': 1.423108458518982} 11/06/2021 22:25:42 - INFO - __main__ - Step 8461: {'lr': 0.0004976532288995768, 'samples': 1624512, 'steps': 8460, 'loss/train': 1.727888822555542} 11/06/2021 22:25:42 - INFO - __main__ - Step 8462: {'lr': 0.0004976525034281069, 'samples': 1624704, 'steps': 8461, 'loss/train': 1.7131630182266235} 11/06/2021 22:25:43 - INFO - __main__ - Step 8463: {'lr': 0.0004976517778450486, 'samples': 1624896, 'steps': 8462, 'loss/train': 1.9994878768920898} 11/06/2021 22:25:43 - INFO - __main__ - Step 8464: {'lr': 0.000497651052150402, 'samples': 1625088, 'steps': 8463, 'loss/train': 1.3184614181518555} 11/06/2021 22:25:44 - INFO - __main__ - Step 8465: {'lr': 0.0004976503263441679, 'samples': 1625280, 'steps': 8464, 'loss/train': 1.885838270187378} 11/06/2021 22:25:44 - INFO - __main__ - Step 8466: {'lr': 0.0004976496004263463, 'samples': 1625472, 'steps': 8465, 'loss/train': 1.80532968044281} 11/06/2021 22:25:45 - INFO - __main__ - Step 8467: {'lr': 0.0004976488743969376, 'samples': 1625664, 'steps': 8466, 'loss/train': 1.5060994625091553} 11/06/2021 22:25:45 - INFO - __main__ - Step 8468: {'lr': 0.0004976481482559421, 'samples': 1625856, 'steps': 8467, 'loss/train': 1.5242974758148193} 11/06/2021 22:25:46 - INFO - __main__ - Step 8469: {'lr': 0.0004976474220033602, 'samples': 1626048, 'steps': 8468, 'loss/train': 1.838634729385376} 11/06/2021 22:25:46 - INFO - __main__ - Step 8470: {'lr': 0.0004976466956391922, 'samples': 1626240, 'steps': 8469, 'loss/train': 1.2237740755081177} 11/06/2021 22:25:47 - INFO - __main__ - Step 8471: {'lr': 0.0004976459691634384, 'samples': 1626432, 'steps': 8470, 'loss/train': 1.4460675716400146} 11/06/2021 22:25:47 - INFO - __main__ - Step 8472: {'lr': 0.0004976452425760992, 'samples': 1626624, 'steps': 8471, 'loss/train': 2.1150126457214355} 11/06/2021 22:25:48 - INFO - __main__ - Step 8473: {'lr': 0.0004976445158771748, 'samples': 1626816, 'steps': 8472, 'loss/train': 1.7219116687774658} 11/06/2021 22:25:48 - INFO - __main__ - Step 8474: {'lr': 0.0004976437890666657, 'samples': 1627008, 'steps': 8473, 'loss/train': 1.7514417171478271} 11/06/2021 22:25:48 - INFO - __main__ - Step 8475: {'lr': 0.0004976430621445721, 'samples': 1627200, 'steps': 8474, 'loss/train': 1.6506552696228027} 11/06/2021 22:25:50 - INFO - __main__ - Step 8476: {'lr': 0.0004976423351108943, 'samples': 1627392, 'steps': 8475, 'loss/train': 1.3242745399475098} 11/06/2021 22:25:50 - INFO - __main__ - Step 8477: {'lr': 0.0004976416079656328, 'samples': 1627584, 'steps': 8476, 'loss/train': 2.3073718547821045} 11/06/2021 22:25:50 - INFO - __main__ - Step 8478: {'lr': 0.0004976408807087876, 'samples': 1627776, 'steps': 8477, 'loss/train': 1.0934592485427856} 11/06/2021 22:25:51 - INFO - __main__ - Step 8479: {'lr': 0.0004976401533403594, 'samples': 1627968, 'steps': 8478, 'loss/train': 1.5833334922790527} 11/06/2021 22:25:51 - INFO - __main__ - Step 8480: {'lr': 0.0004976394258603484, 'samples': 1628160, 'steps': 8479, 'loss/train': 1.3912901878356934} 11/06/2021 22:25:52 - INFO - __main__ - Step 8481: {'lr': 0.0004976386982687549, 'samples': 1628352, 'steps': 8480, 'loss/train': 1.8894435167312622} 11/06/2021 22:25:52 - INFO - __main__ - Step 8482: {'lr': 0.0004976379705655791, 'samples': 1628544, 'steps': 8481, 'loss/train': 1.7393689155578613} 11/06/2021 22:25:53 - INFO - __main__ - Step 8483: {'lr': 0.0004976372427508215, 'samples': 1628736, 'steps': 8482, 'loss/train': 1.9663249254226685} 11/06/2021 22:25:53 - INFO - __main__ - Step 8484: {'lr': 0.0004976365148244824, 'samples': 1628928, 'steps': 8483, 'loss/train': 2.2855916023254395} 11/06/2021 22:25:53 - INFO - __main__ - Step 8485: {'lr': 0.0004976357867865621, 'samples': 1629120, 'steps': 8484, 'loss/train': 1.8762072324752808} 11/06/2021 22:25:54 - INFO - __main__ - Step 8486: {'lr': 0.0004976350586370609, 'samples': 1629312, 'steps': 8485, 'loss/train': 1.8550057411193848} 11/06/2021 22:25:55 - INFO - __main__ - Step 8487: {'lr': 0.0004976343303759792, 'samples': 1629504, 'steps': 8486, 'loss/train': 1.618645191192627} 11/06/2021 22:25:55 - INFO - __main__ - Step 8488: {'lr': 0.0004976336020033174, 'samples': 1629696, 'steps': 8487, 'loss/train': 1.672871470451355} 11/06/2021 22:25:55 - INFO - __main__ - Step 8489: {'lr': 0.0004976328735190755, 'samples': 1629888, 'steps': 8488, 'loss/train': 1.8050670623779297} 11/06/2021 22:25:56 - INFO - __main__ - Step 8490: {'lr': 0.0004976321449232542, 'samples': 1630080, 'steps': 8489, 'loss/train': 1.4703295230865479} 11/06/2021 22:25:56 - INFO - __main__ - Step 8491: {'lr': 0.0004976314162158536, 'samples': 1630272, 'steps': 8490, 'loss/train': 1.5319691896438599} 11/06/2021 22:25:57 - INFO - __main__ - Step 8492: {'lr': 0.0004976306873968741, 'samples': 1630464, 'steps': 8491, 'loss/train': 1.876915454864502} 11/06/2021 22:25:57 - INFO - __main__ - Step 8493: {'lr': 0.0004976299584663161, 'samples': 1630656, 'steps': 8492, 'loss/train': 1.787011981010437} 11/06/2021 22:25:58 - INFO - __main__ - Step 8494: {'lr': 0.0004976292294241798, 'samples': 1630848, 'steps': 8493, 'loss/train': 1.3954660892486572} 11/06/2021 22:25:58 - INFO - __main__ - Step 8495: {'lr': 0.0004976285002704656, 'samples': 1631040, 'steps': 8494, 'loss/train': 2.078723669052124} 11/06/2021 22:25:59 - INFO - __main__ - Step 8496: {'lr': 0.0004976277710051739, 'samples': 1631232, 'steps': 8495, 'loss/train': 2.3327043056488037} 11/06/2021 22:26:00 - INFO - __main__ - Step 8497: {'lr': 0.0004976270416283049, 'samples': 1631424, 'steps': 8496, 'loss/train': 1.9234745502471924} 11/06/2021 22:26:00 - INFO - __main__ - Step 8498: {'lr': 0.000497626312139859, 'samples': 1631616, 'steps': 8497, 'loss/train': 1.7845796346664429} 11/06/2021 22:26:00 - INFO - __main__ - Step 8499: {'lr': 0.0004976255825398365, 'samples': 1631808, 'steps': 8498, 'loss/train': 2.1495463848114014} 11/06/2021 22:26:01 - INFO - __main__ - Step 8500: {'lr': 0.0004976248528282376, 'samples': 1632000, 'steps': 8499, 'loss/train': 2.1509108543395996} 11/06/2021 22:26:01 - INFO - __main__ - Step 8501: {'lr': 0.000497624123005063, 'samples': 1632192, 'steps': 8500, 'loss/train': 1.78944993019104} 11/06/2021 22:26:02 - INFO - __main__ - Step 8502: {'lr': 0.0004976233930703126, 'samples': 1632384, 'steps': 8501, 'loss/train': 1.7581290006637573} 11/06/2021 22:26:02 - INFO - __main__ - Step 8503: {'lr': 0.000497622663023987, 'samples': 1632576, 'steps': 8502, 'loss/train': 1.6255358457565308} 11/06/2021 22:26:03 - INFO - __main__ - Step 8504: {'lr': 0.0004976219328660864, 'samples': 1632768, 'steps': 8503, 'loss/train': 1.977895975112915} 11/06/2021 22:26:03 - INFO - __main__ - Step 8505: {'lr': 0.0004976212025966112, 'samples': 1632960, 'steps': 8504, 'loss/train': 3.297560453414917} 11/06/2021 22:26:03 - INFO - __main__ - Step 8506: {'lr': 0.0004976204722155617, 'samples': 1633152, 'steps': 8505, 'loss/train': 1.7833354473114014} 11/06/2021 22:26:05 - INFO - __main__ - Step 8507: {'lr': 0.0004976197417229383, 'samples': 1633344, 'steps': 8506, 'loss/train': 1.4364817142486572} 11/06/2021 22:26:05 - INFO - __main__ - Step 8508: {'lr': 0.0004976190111187412, 'samples': 1633536, 'steps': 8507, 'loss/train': 2.1614036560058594} 11/06/2021 22:26:05 - INFO - __main__ - Step 8509: {'lr': 0.0004976182804029708, 'samples': 1633728, 'steps': 8508, 'loss/train': 1.8485922813415527} 11/06/2021 22:26:06 - INFO - __main__ - Step 8510: {'lr': 0.0004976175495756274, 'samples': 1633920, 'steps': 8509, 'loss/train': 1.9994100332260132} 11/06/2021 22:26:06 - INFO - __main__ - Step 8511: {'lr': 0.0004976168186367115, 'samples': 1634112, 'steps': 8510, 'loss/train': 1.4582551717758179} 11/06/2021 22:26:07 - INFO - __main__ - Step 8512: {'lr': 0.0004976160875862231, 'samples': 1634304, 'steps': 8511, 'loss/train': 1.6600501537322998} 11/06/2021 22:26:07 - INFO - __main__ - Step 8513: {'lr': 0.0004976153564241628, 'samples': 1634496, 'steps': 8512, 'loss/train': 1.6046333312988281} 11/06/2021 22:26:08 - INFO - __main__ - Step 8514: {'lr': 0.0004976146251505309, 'samples': 1634688, 'steps': 8513, 'loss/train': 2.3201334476470947} 11/06/2021 22:26:08 - INFO - __main__ - Step 8515: {'lr': 0.0004976138937653275, 'samples': 1634880, 'steps': 8514, 'loss/train': 1.6256712675094604} 11/06/2021 22:26:08 - INFO - __main__ - Step 8516: {'lr': 0.0004976131622685532, 'samples': 1635072, 'steps': 8515, 'loss/train': 1.8171868324279785} 11/06/2021 22:26:09 - INFO - __main__ - Step 8517: {'lr': 0.0004976124306602083, 'samples': 1635264, 'steps': 8516, 'loss/train': 1.7854278087615967} 11/06/2021 22:26:10 - INFO - __main__ - Step 8518: {'lr': 0.0004976116989402929, 'samples': 1635456, 'steps': 8517, 'loss/train': 1.7013543844223022} 11/06/2021 22:26:10 - INFO - __main__ - Step 8519: {'lr': 0.0004976109671088076, 'samples': 1635648, 'steps': 8518, 'loss/train': 1.9890172481536865} 11/06/2021 22:26:10 - INFO - __main__ - Step 8520: {'lr': 0.0004976102351657526, 'samples': 1635840, 'steps': 8519, 'loss/train': 1.8006373643875122} 11/06/2021 22:26:11 - INFO - __main__ - Step 8521: {'lr': 0.0004976095031111283, 'samples': 1636032, 'steps': 8520, 'loss/train': 1.9364084005355835} 11/06/2021 22:26:11 - INFO - __main__ - Step 8522: {'lr': 0.0004976087709449348, 'samples': 1636224, 'steps': 8521, 'loss/train': 1.6207133531570435} 11/06/2021 22:26:12 - INFO - __main__ - Step 8523: {'lr': 0.0004976080386671728, 'samples': 1636416, 'steps': 8522, 'loss/train': 2.250715732574463} 11/06/2021 22:26:13 - INFO - __main__ - Step 8524: {'lr': 0.0004976073062778423, 'samples': 1636608, 'steps': 8523, 'loss/train': 1.8443859815597534} 11/06/2021 22:26:13 - INFO - __main__ - Step 8525: {'lr': 0.0004976065737769439, 'samples': 1636800, 'steps': 8524, 'loss/train': 0.9607848525047302} 11/06/2021 22:26:13 - INFO - __main__ - Step 8526: {'lr': 0.0004976058411644777, 'samples': 1636992, 'steps': 8525, 'loss/train': 1.7340941429138184} 11/06/2021 22:26:14 - INFO - __main__ - Step 8527: {'lr': 0.0004976051084404443, 'samples': 1637184, 'steps': 8526, 'loss/train': 2.0823657512664795} 11/06/2021 22:26:15 - INFO - __main__ - Step 8528: {'lr': 0.0004976043756048436, 'samples': 1637376, 'steps': 8527, 'loss/train': 1.8343414068222046} 11/06/2021 22:26:15 - INFO - __main__ - Step 8529: {'lr': 0.0004976036426576763, 'samples': 1637568, 'steps': 8528, 'loss/train': 2.1914784908294678} 11/06/2021 22:26:15 - INFO - __main__ - Step 8530: {'lr': 0.0004976029095989427, 'samples': 1637760, 'steps': 8529, 'loss/train': 3.2090442180633545} 11/06/2021 22:26:16 - INFO - __main__ - Step 8531: {'lr': 0.000497602176428643, 'samples': 1637952, 'steps': 8530, 'loss/train': 2.355168342590332} 11/06/2021 22:26:16 - INFO - __main__ - Step 8532: {'lr': 0.0004976014431467775, 'samples': 1638144, 'steps': 8531, 'loss/train': 1.1861897706985474} 11/06/2021 22:26:17 - INFO - __main__ - Step 8533: {'lr': 0.0004976007097533467, 'samples': 1638336, 'steps': 8532, 'loss/train': 2.105987071990967} 11/06/2021 22:26:17 - INFO - __main__ - Step 8534: {'lr': 0.0004975999762483509, 'samples': 1638528, 'steps': 8533, 'loss/train': 1.797425389289856} 11/06/2021 22:26:18 - INFO - __main__ - Step 8535: {'lr': 0.0004975992426317902, 'samples': 1638720, 'steps': 8534, 'loss/train': 1.2677090167999268} 11/06/2021 22:26:18 - INFO - __main__ - Step 8536: {'lr': 0.0004975985089036652, 'samples': 1638912, 'steps': 8535, 'loss/train': 1.7440499067306519} 11/06/2021 22:26:18 - INFO - __main__ - Step 8537: {'lr': 0.0004975977750639761, 'samples': 1639104, 'steps': 8536, 'loss/train': 3.189857244491577} 11/06/2021 22:26:19 - INFO - __main__ - Step 8538: {'lr': 0.0004975970411127233, 'samples': 1639296, 'steps': 8537, 'loss/train': 1.8964868783950806} 11/06/2021 22:26:20 - INFO - __main__ - Step 8539: {'lr': 0.0004975963070499071, 'samples': 1639488, 'steps': 8538, 'loss/train': 2.1125876903533936} 11/06/2021 22:26:20 - INFO - __main__ - Step 8540: {'lr': 0.0004975955728755277, 'samples': 1639680, 'steps': 8539, 'loss/train': 1.6929922103881836} 11/06/2021 22:26:21 - INFO - __main__ - Step 8541: {'lr': 0.0004975948385895858, 'samples': 1639872, 'steps': 8540, 'loss/train': 1.6374714374542236} 11/06/2021 22:26:21 - INFO - __main__ - Step 8542: {'lr': 0.0004975941041920813, 'samples': 1640064, 'steps': 8541, 'loss/train': 2.006967067718506} 11/06/2021 22:26:22 - INFO - __main__ - Step 8543: {'lr': 0.0004975933696830147, 'samples': 1640256, 'steps': 8542, 'loss/train': 1.9213240146636963} 11/06/2021 22:26:22 - INFO - __main__ - Step 8544: {'lr': 0.0004975926350623864, 'samples': 1640448, 'steps': 8543, 'loss/train': 2.296802043914795} 11/06/2021 22:26:23 - INFO - __main__ - Step 8545: {'lr': 0.0004975919003301967, 'samples': 1640640, 'steps': 8544, 'loss/train': 1.7071173191070557} 11/06/2021 22:26:23 - INFO - __main__ - Step 8546: {'lr': 0.0004975911654864459, 'samples': 1640832, 'steps': 8545, 'loss/train': 2.5078957080841064} 11/06/2021 22:26:23 - INFO - __main__ - Step 8547: {'lr': 0.0004975904305311344, 'samples': 1641024, 'steps': 8546, 'loss/train': 2.413429021835327} 11/06/2021 22:26:25 - INFO - __main__ - Step 8548: {'lr': 0.0004975896954642623, 'samples': 1641216, 'steps': 8547, 'loss/train': 1.545986294746399} 11/06/2021 22:26:25 - INFO - __main__ - Step 8549: {'lr': 0.0004975889602858303, 'samples': 1641408, 'steps': 8548, 'loss/train': 1.858424425125122} 11/06/2021 22:26:26 - INFO - __main__ - Step 8550: {'lr': 0.0004975882249958385, 'samples': 1641600, 'steps': 8549, 'loss/train': 2.576998472213745} 11/06/2021 22:26:26 - INFO - __main__ - Step 8551: {'lr': 0.0004975874895942872, 'samples': 1641792, 'steps': 8550, 'loss/train': 1.2816332578659058} 11/06/2021 22:26:26 - INFO - __main__ - Step 8552: {'lr': 0.0004975867540811768, 'samples': 1641984, 'steps': 8551, 'loss/train': 1.6916189193725586} 11/06/2021 22:26:27 - INFO - __main__ - Step 8553: {'lr': 0.0004975860184565076, 'samples': 1642176, 'steps': 8552, 'loss/train': 1.8677070140838623} 11/06/2021 22:26:27 - INFO - __main__ - Step 8554: {'lr': 0.0004975852827202801, 'samples': 1642368, 'steps': 8553, 'loss/train': 1.7044568061828613} 11/06/2021 22:26:28 - INFO - __main__ - Step 8555: {'lr': 0.0004975845468724944, 'samples': 1642560, 'steps': 8554, 'loss/train': 1.236737847328186} 11/06/2021 22:26:29 - INFO - __main__ - Step 8556: {'lr': 0.0004975838109131509, 'samples': 1642752, 'steps': 8555, 'loss/train': 1.6801140308380127} 11/06/2021 22:26:29 - INFO - __main__ - Step 8557: {'lr': 0.0004975830748422499, 'samples': 1642944, 'steps': 8556, 'loss/train': 1.9419505596160889} 11/06/2021 22:26:29 - INFO - __main__ - Step 8558: {'lr': 0.0004975823386597918, 'samples': 1643136, 'steps': 8557, 'loss/train': 1.7988413572311401} 11/06/2021 22:26:30 - INFO - __main__ - Step 8559: {'lr': 0.000497581602365777, 'samples': 1643328, 'steps': 8558, 'loss/train': 2.347743511199951} 11/06/2021 22:26:30 - INFO - __main__ - Step 8560: {'lr': 0.0004975808659602058, 'samples': 1643520, 'steps': 8559, 'loss/train': 1.103050947189331} 11/06/2021 22:26:31 - INFO - __main__ - Step 8561: {'lr': 0.0004975801294430784, 'samples': 1643712, 'steps': 8560, 'loss/train': 2.005465030670166} 11/06/2021 22:26:31 - INFO - __main__ - Step 8562: {'lr': 0.0004975793928143952, 'samples': 1643904, 'steps': 8561, 'loss/train': 1.6958545446395874} 11/06/2021 22:26:32 - INFO - __main__ - Step 8563: {'lr': 0.0004975786560741566, 'samples': 1644096, 'steps': 8562, 'loss/train': 1.6469171047210693} 11/06/2021 22:26:32 - INFO - __main__ - Step 8564: {'lr': 0.0004975779192223629, 'samples': 1644288, 'steps': 8563, 'loss/train': 1.7488797903060913} 11/06/2021 22:26:32 - INFO - __main__ - Step 8565: {'lr': 0.0004975771822590143, 'samples': 1644480, 'steps': 8564, 'loss/train': 1.531661868095398} 11/06/2021 22:26:33 - INFO - __main__ - Step 8566: {'lr': 0.0004975764451841114, 'samples': 1644672, 'steps': 8565, 'loss/train': 1.9583240747451782} 11/06/2021 22:26:34 - INFO - __main__ - Step 8567: {'lr': 0.0004975757079976542, 'samples': 1644864, 'steps': 8566, 'loss/train': 2.6760129928588867} 11/06/2021 22:26:34 - INFO - __main__ - Step 8568: {'lr': 0.0004975749706996433, 'samples': 1645056, 'steps': 8567, 'loss/train': 1.9086993932724} 11/06/2021 22:26:34 - INFO - __main__ - Step 8569: {'lr': 0.0004975742332900789, 'samples': 1645248, 'steps': 8568, 'loss/train': 1.3222005367279053} 11/06/2021 22:26:35 - INFO - __main__ - Step 8570: {'lr': 0.0004975734957689614, 'samples': 1645440, 'steps': 8569, 'loss/train': 2.3594229221343994} 11/06/2021 22:26:36 - INFO - __main__ - Step 8571: {'lr': 0.0004975727581362911, 'samples': 1645632, 'steps': 8570, 'loss/train': 1.3136167526245117} 11/06/2021 22:26:36 - INFO - __main__ - Step 8572: {'lr': 0.0004975720203920683, 'samples': 1645824, 'steps': 8571, 'loss/train': 2.3485770225524902} 11/06/2021 22:26:36 - INFO - __main__ - Step 8573: {'lr': 0.0004975712825362934, 'samples': 1646016, 'steps': 8572, 'loss/train': 2.9186878204345703} 11/06/2021 22:26:37 - INFO - __main__ - Step 8574: {'lr': 0.0004975705445689668, 'samples': 1646208, 'steps': 8573, 'loss/train': 1.8523027896881104} 11/06/2021 22:26:37 - INFO - __main__ - Step 8575: {'lr': 0.0004975698064900886, 'samples': 1646400, 'steps': 8574, 'loss/train': 2.420170783996582} 11/06/2021 22:26:38 - INFO - __main__ - Step 8576: {'lr': 0.0004975690682996592, 'samples': 1646592, 'steps': 8575, 'loss/train': 1.9467846155166626} 11/06/2021 22:26:38 - INFO - __main__ - Step 8577: {'lr': 0.0004975683299976791, 'samples': 1646784, 'steps': 8576, 'loss/train': 1.946389079093933} 11/06/2021 22:26:39 - INFO - __main__ - Step 8578: {'lr': 0.0004975675915841485, 'samples': 1646976, 'steps': 8577, 'loss/train': 2.061795711517334} 11/06/2021 22:26:39 - INFO - __main__ - Step 8579: {'lr': 0.0004975668530590679, 'samples': 1647168, 'steps': 8578, 'loss/train': 2.067392110824585} 11/06/2021 22:26:40 - INFO - __main__ - Step 8580: {'lr': 0.0004975661144224374, 'samples': 1647360, 'steps': 8579, 'loss/train': 2.0112924575805664} 11/06/2021 22:26:41 - INFO - __main__ - Step 8581: {'lr': 0.0004975653756742574, 'samples': 1647552, 'steps': 8580, 'loss/train': 1.770460605621338} 11/06/2021 22:26:42 - INFO - __main__ - Step 8582: {'lr': 0.0004975646368145282, 'samples': 1647744, 'steps': 8581, 'loss/train': 2.0862138271331787} 11/06/2021 22:26:42 - INFO - __main__ - Step 8583: {'lr': 0.0004975638978432503, 'samples': 1647936, 'steps': 8582, 'loss/train': 1.6407994031906128} 11/06/2021 22:26:42 - INFO - __main__ - Step 8584: {'lr': 0.0004975631587604239, 'samples': 1648128, 'steps': 8583, 'loss/train': 1.8305928707122803} 11/06/2021 22:26:43 - INFO - __main__ - Step 8585: {'lr': 0.0004975624195660494, 'samples': 1648320, 'steps': 8584, 'loss/train': 1.8244094848632812} 11/06/2021 22:26:43 - INFO - __main__ - Step 8586: {'lr': 0.0004975616802601271, 'samples': 1648512, 'steps': 8585, 'loss/train': 1.8164951801300049} 11/06/2021 22:26:44 - INFO - __main__ - Step 8587: {'lr': 0.0004975609408426572, 'samples': 1648704, 'steps': 8586, 'loss/train': 1.84238600730896} 11/06/2021 22:26:45 - INFO - __main__ - Step 8588: {'lr': 0.0004975602013136403, 'samples': 1648896, 'steps': 8587, 'loss/train': 1.9760645627975464} 11/06/2021 22:26:45 - INFO - __main__ - Step 8589: {'lr': 0.0004975594616730766, 'samples': 1649088, 'steps': 8588, 'loss/train': 1.6991413831710815} 11/06/2021 22:26:45 - INFO - __main__ - Step 8590: {'lr': 0.0004975587219209663, 'samples': 1649280, 'steps': 8589, 'loss/train': 1.3211325407028198} 11/06/2021 22:26:46 - INFO - __main__ - Step 8591: {'lr': 0.0004975579820573099, 'samples': 1649472, 'steps': 8590, 'loss/train': 1.5335558652877808} 11/06/2021 22:26:46 - INFO - __main__ - Step 8592: {'lr': 0.0004975572420821078, 'samples': 1649664, 'steps': 8591, 'loss/train': 1.8905811309814453} 11/06/2021 22:26:47 - INFO - __main__ - Step 8593: {'lr': 0.0004975565019953601, 'samples': 1649856, 'steps': 8592, 'loss/train': 1.5297328233718872} 11/06/2021 22:26:47 - INFO - __main__ - Step 8594: {'lr': 0.0004975557617970673, 'samples': 1650048, 'steps': 8593, 'loss/train': 1.714114785194397} 11/06/2021 22:26:48 - INFO - __main__ - Step 8595: {'lr': 0.0004975550214872296, 'samples': 1650240, 'steps': 8594, 'loss/train': 1.8319971561431885} 11/06/2021 22:26:48 - INFO - __main__ - Step 8596: {'lr': 0.0004975542810658476, 'samples': 1650432, 'steps': 8595, 'loss/train': 1.850310206413269} 11/06/2021 22:26:48 - INFO - __main__ - Step 8597: {'lr': 0.0004975535405329213, 'samples': 1650624, 'steps': 8596, 'loss/train': 1.9651538133621216} 11/06/2021 22:26:49 - INFO - __main__ - Step 8598: {'lr': 0.0004975527998884513, 'samples': 1650816, 'steps': 8597, 'loss/train': 1.1798516511917114} 11/06/2021 22:26:50 - INFO - __main__ - Step 8599: {'lr': 0.0004975520591324378, 'samples': 1651008, 'steps': 8598, 'loss/train': 1.6260879039764404} 11/06/2021 22:26:51 - INFO - __main__ - Step 8600: {'lr': 0.0004975513182648812, 'samples': 1651200, 'steps': 8599, 'loss/train': 1.9237269163131714} 11/06/2021 22:26:51 - INFO - __main__ - Step 8601: {'lr': 0.0004975505772857818, 'samples': 1651392, 'steps': 8600, 'loss/train': 1.373760461807251} 11/06/2021 22:26:51 - INFO - __main__ - Step 8602: {'lr': 0.0004975498361951398, 'samples': 1651584, 'steps': 8601, 'loss/train': 1.5592304468154907} 11/06/2021 22:26:52 - INFO - __main__ - Step 8603: {'lr': 0.0004975490949929558, 'samples': 1651776, 'steps': 8602, 'loss/train': 1.782822608947754} 11/06/2021 22:26:53 - INFO - __main__ - Step 8604: {'lr': 0.00049754835367923, 'samples': 1651968, 'steps': 8603, 'loss/train': 1.2513318061828613} 11/06/2021 22:26:53 - INFO - __main__ - Step 8605: {'lr': 0.0004975476122539627, 'samples': 1652160, 'steps': 8604, 'loss/train': 1.4960685968399048} 11/06/2021 22:26:53 - INFO - __main__ - Step 8606: {'lr': 0.0004975468707171542, 'samples': 1652352, 'steps': 8605, 'loss/train': 1.4390000104904175} 11/06/2021 22:26:54 - INFO - __main__ - Step 8607: {'lr': 0.000497546129068805, 'samples': 1652544, 'steps': 8606, 'loss/train': 1.4670497179031372} 11/06/2021 22:26:54 - INFO - __main__ - Step 8608: {'lr': 0.0004975453873089153, 'samples': 1652736, 'steps': 8607, 'loss/train': 2.2388570308685303} 11/06/2021 22:26:55 - INFO - __main__ - Step 8609: {'lr': 0.0004975446454374854, 'samples': 1652928, 'steps': 8608, 'loss/train': 1.8201464414596558} 11/06/2021 22:26:55 - INFO - __main__ - Step 8610: {'lr': 0.0004975439034545158, 'samples': 1653120, 'steps': 8609, 'loss/train': 1.0466943979263306} 11/06/2021 22:26:56 - INFO - __main__ - Step 8611: {'lr': 0.0004975431613600067, 'samples': 1653312, 'steps': 8610, 'loss/train': 1.8127079010009766} 11/06/2021 22:26:56 - INFO - __main__ - Step 8612: {'lr': 0.0004975424191539585, 'samples': 1653504, 'steps': 8611, 'loss/train': 1.8894764184951782} 11/06/2021 22:26:57 - INFO - __main__ - Step 8613: {'lr': 0.0004975416768363715, 'samples': 1653696, 'steps': 8612, 'loss/train': 1.5986841917037964} 11/06/2021 22:26:57 - INFO - __main__ - Step 8614: {'lr': 0.0004975409344072459, 'samples': 1653888, 'steps': 8613, 'loss/train': 1.9542250633239746} 11/06/2021 22:26:58 - INFO - __main__ - Step 8615: {'lr': 0.0004975401918665823, 'samples': 1654080, 'steps': 8614, 'loss/train': 1.8345215320587158} 11/06/2021 22:26:58 - INFO - __main__ - Step 8616: {'lr': 0.0004975394492143808, 'samples': 1654272, 'steps': 8615, 'loss/train': 2.525263547897339} 11/06/2021 22:26:59 - INFO - __main__ - Step 8617: {'lr': 0.0004975387064506421, 'samples': 1654464, 'steps': 8616, 'loss/train': 1.956400752067566} 11/06/2021 22:26:59 - INFO - __main__ - Step 8618: {'lr': 0.000497537963575366, 'samples': 1654656, 'steps': 8617, 'loss/train': 1.734102725982666} 11/06/2021 22:26:59 - INFO - __main__ - Step 8619: {'lr': 0.0004975372205885533, 'samples': 1654848, 'steps': 8618, 'loss/train': 1.9009313583374023} 11/06/2021 22:27:00 - INFO - __main__ - Step 8620: {'lr': 0.0004975364774902041, 'samples': 1655040, 'steps': 8619, 'loss/train': 2.392646551132202} 11/06/2021 22:27:01 - INFO - __main__ - Step 8621: {'lr': 0.0004975357342803187, 'samples': 1655232, 'steps': 8620, 'loss/train': 1.483763575553894} 11/06/2021 22:27:01 - INFO - __main__ - Step 8622: {'lr': 0.0004975349909588976, 'samples': 1655424, 'steps': 8621, 'loss/train': 1.836098313331604} 11/06/2021 22:27:02 - INFO - __main__ - Step 8623: {'lr': 0.000497534247525941, 'samples': 1655616, 'steps': 8622, 'loss/train': 1.932421326637268} 11/06/2021 22:27:02 - INFO - __main__ - Step 8624: {'lr': 0.0004975335039814493, 'samples': 1655808, 'steps': 8623, 'loss/train': 1.0708638429641724} 11/06/2021 22:27:03 - INFO - __main__ - Step 8625: {'lr': 0.0004975327603254229, 'samples': 1656000, 'steps': 8624, 'loss/train': 1.1579554080963135} 11/06/2021 22:27:03 - INFO - __main__ - Step 8626: {'lr': 0.000497532016557862, 'samples': 1656192, 'steps': 8625, 'loss/train': 1.4248061180114746} 11/06/2021 22:27:04 - INFO - __main__ - Step 8627: {'lr': 0.0004975312726787671, 'samples': 1656384, 'steps': 8626, 'loss/train': 2.444399833679199} 11/06/2021 22:27:04 - INFO - __main__ - Step 8628: {'lr': 0.0004975305286881383, 'samples': 1656576, 'steps': 8627, 'loss/train': 1.7895426750183105} 11/06/2021 22:27:04 - INFO - __main__ - Step 8629: {'lr': 0.0004975297845859761, 'samples': 1656768, 'steps': 8628, 'loss/train': 1.8516370058059692} 11/06/2021 22:27:05 - INFO - __main__ - Step 8630: {'lr': 0.0004975290403722807, 'samples': 1656960, 'steps': 8629, 'loss/train': 1.1760151386260986} 11/06/2021 22:27:06 - INFO - __main__ - Step 8631: {'lr': 0.0004975282960470527, 'samples': 1657152, 'steps': 8630, 'loss/train': 0.9543409943580627} 11/06/2021 22:27:06 - INFO - __main__ - Step 8632: {'lr': 0.0004975275516102922, 'samples': 1657344, 'steps': 8631, 'loss/train': 1.7859746217727661} 11/06/2021 22:27:06 - INFO - __main__ - Step 8633: {'lr': 0.0004975268070619996, 'samples': 1657536, 'steps': 8632, 'loss/train': 1.8400938510894775} 11/06/2021 22:27:07 - INFO - __main__ - Step 8634: {'lr': 0.0004975260624021752, 'samples': 1657728, 'steps': 8633, 'loss/train': 1.3847429752349854} 11/06/2021 22:27:07 - INFO - __main__ - Step 8635: {'lr': 0.0004975253176308194, 'samples': 1657920, 'steps': 8634, 'loss/train': 1.8140875101089478} 11/06/2021 22:27:08 - INFO - __main__ - Step 8636: {'lr': 0.0004975245727479325, 'samples': 1658112, 'steps': 8635, 'loss/train': 1.8529094457626343} 11/06/2021 22:27:09 - INFO - __main__ - Step 8637: {'lr': 0.0004975238277535149, 'samples': 1658304, 'steps': 8636, 'loss/train': 1.9702752828598022} 11/06/2021 22:27:09 - INFO - __main__ - Step 8638: {'lr': 0.0004975230826475669, 'samples': 1658496, 'steps': 8637, 'loss/train': 1.9060972929000854} 11/06/2021 22:27:09 - INFO - __main__ - Step 8639: {'lr': 0.0004975223374300887, 'samples': 1658688, 'steps': 8638, 'loss/train': 2.0410006046295166} 11/06/2021 22:27:10 - INFO - __main__ - Step 8640: {'lr': 0.0004975215921010808, 'samples': 1658880, 'steps': 8639, 'loss/train': 1.9541411399841309} 11/06/2021 22:27:11 - INFO - __main__ - Step 8641: {'lr': 0.0004975208466605435, 'samples': 1659072, 'steps': 8640, 'loss/train': 1.7635051012039185} 11/06/2021 22:27:12 - INFO - __main__ - Step 8642: {'lr': 0.0004975201011084773, 'samples': 1659264, 'steps': 8641, 'loss/train': 1.650898814201355} 11/06/2021 22:27:12 - INFO - __main__ - Step 8643: {'lr': 0.0004975193554448821, 'samples': 1659456, 'steps': 8642, 'loss/train': 1.8950550556182861} 11/06/2021 22:27:12 - INFO - __main__ - Step 8644: {'lr': 0.0004975186096697585, 'samples': 1659648, 'steps': 8643, 'loss/train': 2.5472521781921387} 11/06/2021 22:27:13 - INFO - __main__ - Step 8645: {'lr': 0.000497517863783107, 'samples': 1659840, 'steps': 8644, 'loss/train': 1.438095211982727} 11/06/2021 22:27:13 - INFO - __main__ - Step 8646: {'lr': 0.0004975171177849277, 'samples': 1660032, 'steps': 8645, 'loss/train': 2.031336545944214} 11/06/2021 22:27:14 - INFO - __main__ - Step 8647: {'lr': 0.000497516371675221, 'samples': 1660224, 'steps': 8646, 'loss/train': 2.2316641807556152} 11/06/2021 22:27:15 - INFO - __main__ - Step 8648: {'lr': 0.0004975156254539873, 'samples': 1660416, 'steps': 8647, 'loss/train': 2.0939579010009766} 11/06/2021 22:27:15 - INFO - __main__ - Step 8649: {'lr': 0.0004975148791212269, 'samples': 1660608, 'steps': 8648, 'loss/train': 1.8442882299423218} 11/06/2021 22:27:15 - INFO - __main__ - Step 8650: {'lr': 0.00049751413267694, 'samples': 1660800, 'steps': 8649, 'loss/train': 2.065793752670288} 11/06/2021 22:27:16 - INFO - __main__ - Step 8651: {'lr': 0.000497513386121127, 'samples': 1660992, 'steps': 8650, 'loss/train': 1.510509729385376} 11/06/2021 22:27:17 - INFO - __main__ - Step 8652: {'lr': 0.0004975126394537884, 'samples': 1661184, 'steps': 8651, 'loss/train': 1.9904800653457642} 11/06/2021 22:27:17 - INFO - __main__ - Step 8653: {'lr': 0.0004975118926749245, 'samples': 1661376, 'steps': 8652, 'loss/train': 2.4816510677337646} 11/06/2021 22:27:17 - INFO - __main__ - Step 8654: {'lr': 0.0004975111457845354, 'samples': 1661568, 'steps': 8653, 'loss/train': 1.7848095893859863} 11/06/2021 22:27:18 - INFO - __main__ - Step 8655: {'lr': 0.0004975103987826217, 'samples': 1661760, 'steps': 8654, 'loss/train': 1.5138027667999268} 11/06/2021 22:27:18 - INFO - __main__ - Step 8656: {'lr': 0.0004975096516691836, 'samples': 1661952, 'steps': 8655, 'loss/train': 2.094496965408325} 11/06/2021 22:27:19 - INFO - __main__ - Step 8657: {'lr': 0.0004975089044442215, 'samples': 1662144, 'steps': 8656, 'loss/train': 2.499422073364258} 11/06/2021 22:27:19 - INFO - __main__ - Step 8658: {'lr': 0.0004975081571077357, 'samples': 1662336, 'steps': 8657, 'loss/train': 2.3265016078948975} 11/06/2021 22:27:20 - INFO - __main__ - Step 8659: {'lr': 0.0004975074096597265, 'samples': 1662528, 'steps': 8658, 'loss/train': 1.801966905593872} 11/06/2021 22:27:20 - INFO - __main__ - Step 8660: {'lr': 0.0004975066621001943, 'samples': 1662720, 'steps': 8659, 'loss/train': 1.1997767686843872} 11/06/2021 22:27:20 - INFO - __main__ - Step 8661: {'lr': 0.0004975059144291394, 'samples': 1662912, 'steps': 8660, 'loss/train': 1.4698797464370728} 11/06/2021 22:27:21 - INFO - __main__ - Step 8662: {'lr': 0.0004975051666465622, 'samples': 1663104, 'steps': 8661, 'loss/train': 2.032155990600586} 11/06/2021 22:27:22 - INFO - __main__ - Step 8663: {'lr': 0.0004975044187524629, 'samples': 1663296, 'steps': 8662, 'loss/train': 1.661537766456604} 11/06/2021 22:27:22 - INFO - __main__ - Step 8664: {'lr': 0.000497503670746842, 'samples': 1663488, 'steps': 8663, 'loss/train': 1.3858249187469482} 11/06/2021 22:27:23 - INFO - __main__ - Step 8665: {'lr': 0.0004975029226296998, 'samples': 1663680, 'steps': 8664, 'loss/train': 2.2269275188446045} 11/06/2021 22:27:23 - INFO - __main__ - Step 8666: {'lr': 0.0004975021744010365, 'samples': 1663872, 'steps': 8665, 'loss/train': 2.003019332885742} 11/06/2021 22:27:23 - INFO - __main__ - Step 8667: {'lr': 0.0004975014260608527, 'samples': 1664064, 'steps': 8666, 'loss/train': 1.5768145322799683} 11/06/2021 22:27:24 - INFO - __main__ - Step 8668: {'lr': 0.0004975006776091484, 'samples': 1664256, 'steps': 8667, 'loss/train': 1.9532525539398193} 11/06/2021 22:27:25 - INFO - __main__ - Step 8669: {'lr': 0.0004974999290459243, 'samples': 1664448, 'steps': 8668, 'loss/train': 1.587928295135498} 11/06/2021 22:27:25 - INFO - __main__ - Step 8670: {'lr': 0.0004974991803711803, 'samples': 1664640, 'steps': 8669, 'loss/train': 1.7508021593093872} 11/06/2021 22:27:25 - INFO - __main__ - Step 8671: {'lr': 0.0004974984315849172, 'samples': 1664832, 'steps': 8670, 'loss/train': 1.5539518594741821} 11/06/2021 22:27:26 - INFO - __main__ - Step 8672: {'lr': 0.000497497682687135, 'samples': 1665024, 'steps': 8671, 'loss/train': 1.5913190841674805} 11/06/2021 22:27:27 - INFO - __main__ - Step 8673: {'lr': 0.0004974969336778343, 'samples': 1665216, 'steps': 8672, 'loss/train': 1.3758063316345215} 11/06/2021 22:27:27 - INFO - __main__ - Step 8674: {'lr': 0.0004974961845570152, 'samples': 1665408, 'steps': 8673, 'loss/train': 1.925683856010437} 11/06/2021 22:27:27 - INFO - __main__ - Step 8675: {'lr': 0.0004974954353246781, 'samples': 1665600, 'steps': 8674, 'loss/train': 1.983332872390747} 11/06/2021 22:27:28 - INFO - __main__ - Step 8676: {'lr': 0.0004974946859808235, 'samples': 1665792, 'steps': 8675, 'loss/train': 1.5062406063079834} 11/06/2021 22:27:28 - INFO - __main__ - Step 8677: {'lr': 0.0004974939365254515, 'samples': 1665984, 'steps': 8676, 'loss/train': 1.4310129880905151} 11/06/2021 22:27:29 - INFO - __main__ - Step 8678: {'lr': 0.0004974931869585626, 'samples': 1666176, 'steps': 8677, 'loss/train': 1.1025075912475586} 11/06/2021 22:27:29 - INFO - __main__ - Step 8679: {'lr': 0.0004974924372801572, 'samples': 1666368, 'steps': 8678, 'loss/train': 1.5145713090896606} 11/06/2021 22:27:30 - INFO - __main__ - Step 8680: {'lr': 0.0004974916874902353, 'samples': 1666560, 'steps': 8679, 'loss/train': 1.819180965423584} 11/06/2021 22:27:30 - INFO - __main__ - Step 8681: {'lr': 0.0004974909375887976, 'samples': 1666752, 'steps': 8680, 'loss/train': 1.7356830835342407} 11/06/2021 22:27:31 - INFO - __main__ - Step 8682: {'lr': 0.0004974901875758444, 'samples': 1666944, 'steps': 8681, 'loss/train': 2.002690315246582} 11/06/2021 22:27:31 - INFO - __main__ - Step 8683: {'lr': 0.0004974894374513757, 'samples': 1667136, 'steps': 8682, 'loss/train': 2.2562174797058105} 11/06/2021 22:27:32 - INFO - __main__ - Step 8684: {'lr': 0.0004974886872153922, 'samples': 1667328, 'steps': 8683, 'loss/train': 1.799873948097229} 11/06/2021 22:27:32 - INFO - __main__ - Step 8685: {'lr': 0.0004974879368678942, 'samples': 1667520, 'steps': 8684, 'loss/train': 2.1796953678131104} 11/06/2021 22:27:32 - INFO - __main__ - Step 8686: {'lr': 0.0004974871864088818, 'samples': 1667712, 'steps': 8685, 'loss/train': 1.7284247875213623} 11/06/2021 22:27:33 - INFO - __main__ - Step 8687: {'lr': 0.0004974864358383555, 'samples': 1667904, 'steps': 8686, 'loss/train': 2.1161437034606934} 11/06/2021 22:27:34 - INFO - __main__ - Step 8688: {'lr': 0.0004974856851563158, 'samples': 1668096, 'steps': 8687, 'loss/train': 1.6163461208343506} 11/06/2021 22:27:34 - INFO - __main__ - Step 8689: {'lr': 0.0004974849343627628, 'samples': 1668288, 'steps': 8688, 'loss/train': 2.1865105628967285} 11/06/2021 22:27:35 - INFO - __main__ - Step 8690: {'lr': 0.0004974841834576968, 'samples': 1668480, 'steps': 8689, 'loss/train': 2.0201218128204346} 11/06/2021 22:27:35 - INFO - __main__ - Step 8691: {'lr': 0.0004974834324411183, 'samples': 1668672, 'steps': 8690, 'loss/train': 1.748487949371338} 11/06/2021 22:27:35 - INFO - __main__ - Step 8692: {'lr': 0.0004974826813130276, 'samples': 1668864, 'steps': 8691, 'loss/train': 1.8078097105026245} 11/06/2021 22:27:36 - INFO - __main__ - Step 8693: {'lr': 0.000497481930073425, 'samples': 1669056, 'steps': 8692, 'loss/train': 1.9291731119155884} 11/06/2021 22:27:37 - INFO - __main__ - Step 8694: {'lr': 0.000497481178722311, 'samples': 1669248, 'steps': 8693, 'loss/train': 1.6936627626419067} 11/06/2021 22:27:37 - INFO - __main__ - Step 8695: {'lr': 0.0004974804272596857, 'samples': 1669440, 'steps': 8694, 'loss/train': 1.6667145490646362} 11/06/2021 22:27:37 - INFO - __main__ - Step 8696: {'lr': 0.0004974796756855494, 'samples': 1669632, 'steps': 8695, 'loss/train': 1.6822011470794678} 11/06/2021 22:27:38 - INFO - __main__ - Step 8697: {'lr': 0.0004974789239999027, 'samples': 1669824, 'steps': 8696, 'loss/train': 1.7184191942214966} 11/06/2021 22:27:38 - INFO - __main__ - Step 8698: {'lr': 0.0004974781722027459, 'samples': 1670016, 'steps': 8697, 'loss/train': 1.8397278785705566} 11/06/2021 22:27:39 - INFO - __main__ - Step 8699: {'lr': 0.0004974774202940791, 'samples': 1670208, 'steps': 8698, 'loss/train': 1.9065308570861816} 11/06/2021 22:27:40 - INFO - __main__ - Step 8700: {'lr': 0.000497476668273903, 'samples': 1670400, 'steps': 8699, 'loss/train': 0.9457817673683167} 11/06/2021 22:27:40 - INFO - __main__ - Step 8701: {'lr': 0.0004974759161422175, 'samples': 1670592, 'steps': 8700, 'loss/train': 1.5651483535766602} 11/06/2021 22:27:40 - INFO - __main__ - Step 8702: {'lr': 0.0004974751638990233, 'samples': 1670784, 'steps': 8701, 'loss/train': 1.9571999311447144} 11/06/2021 22:27:41 - INFO - __main__ - Step 8703: {'lr': 0.0004974744115443206, 'samples': 1670976, 'steps': 8702, 'loss/train': 1.1419717073440552} 11/06/2021 22:27:42 - INFO - __main__ - Step 8704: {'lr': 0.0004974736590781097, 'samples': 1671168, 'steps': 8703, 'loss/train': 2.3399507999420166} 11/06/2021 22:27:42 - INFO - __main__ - Step 8705: {'lr': 0.000497472906500391, 'samples': 1671360, 'steps': 8704, 'loss/train': 1.9851784706115723} 11/06/2021 22:27:43 - INFO - __main__ - Step 8706: {'lr': 0.0004974721538111649, 'samples': 1671552, 'steps': 8705, 'loss/train': 1.7644202709197998} 11/06/2021 22:27:43 - INFO - __main__ - Step 8707: {'lr': 0.0004974714010104315, 'samples': 1671744, 'steps': 8706, 'loss/train': 2.234442949295044} 11/06/2021 22:27:43 - INFO - __main__ - Step 8708: {'lr': 0.0004974706480981914, 'samples': 1671936, 'steps': 8707, 'loss/train': 1.5944517850875854} 11/06/2021 22:27:44 - INFO - __main__ - Step 8709: {'lr': 0.0004974698950744449, 'samples': 1672128, 'steps': 8708, 'loss/train': 2.5472395420074463} 11/06/2021 22:27:45 - INFO - __main__ - Step 8710: {'lr': 0.0004974691419391922, 'samples': 1672320, 'steps': 8709, 'loss/train': 1.8900412321090698} 11/06/2021 22:27:45 - INFO - __main__ - Step 8711: {'lr': 0.0004974683886924339, 'samples': 1672512, 'steps': 8710, 'loss/train': 1.7456995248794556} 11/06/2021 22:27:45 - INFO - __main__ - Step 8712: {'lr': 0.00049746763533417, 'samples': 1672704, 'steps': 8711, 'loss/train': 2.188673257827759} 11/06/2021 22:27:46 - INFO - __main__ - Step 8713: {'lr': 0.000497466881864401, 'samples': 1672896, 'steps': 8712, 'loss/train': 1.39998459815979} 11/06/2021 22:27:46 - INFO - __main__ - Step 8714: {'lr': 0.0004974661282831272, 'samples': 1673088, 'steps': 8713, 'loss/train': 1.890992522239685} 11/06/2021 22:27:47 - INFO - __main__ - Step 8715: {'lr': 0.0004974653745903491, 'samples': 1673280, 'steps': 8714, 'loss/train': 1.2863065004348755} 11/06/2021 22:27:47 - INFO - __main__ - Step 8716: {'lr': 0.0004974646207860668, 'samples': 1673472, 'steps': 8715, 'loss/train': 2.333582878112793} 11/06/2021 22:27:48 - INFO - __main__ - Step 8717: {'lr': 0.0004974638668702809, 'samples': 1673664, 'steps': 8716, 'loss/train': 0.5336604714393616} 11/06/2021 22:27:48 - INFO - __main__ - Step 8718: {'lr': 0.0004974631128429915, 'samples': 1673856, 'steps': 8717, 'loss/train': 2.640268564224243} 11/06/2021 22:27:48 - INFO - __main__ - Step 8719: {'lr': 0.0004974623587041991, 'samples': 1674048, 'steps': 8718, 'loss/train': 2.1433913707733154} 11/06/2021 22:27:49 - INFO - __main__ - Step 8720: {'lr': 0.000497461604453904, 'samples': 1674240, 'steps': 8719, 'loss/train': 1.5393385887145996} 11/06/2021 22:27:50 - INFO - __main__ - Step 8721: {'lr': 0.0004974608500921064, 'samples': 1674432, 'steps': 8720, 'loss/train': 2.028142213821411} 11/06/2021 22:27:50 - INFO - __main__ - Step 8722: {'lr': 0.0004974600956188068, 'samples': 1674624, 'steps': 8721, 'loss/train': 1.8649543523788452} 11/06/2021 22:27:50 - INFO - __main__ - Step 8723: {'lr': 0.0004974593410340056, 'samples': 1674816, 'steps': 8722, 'loss/train': 1.037925124168396} 11/06/2021 22:27:51 - INFO - __main__ - Step 8724: {'lr': 0.000497458586337703, 'samples': 1675008, 'steps': 8723, 'loss/train': 1.7672851085662842} 11/06/2021 22:27:52 - INFO - __main__ - Step 8725: {'lr': 0.0004974578315298993, 'samples': 1675200, 'steps': 8724, 'loss/train': 1.966469407081604} 11/06/2021 22:27:52 - INFO - __main__ - Step 8726: {'lr': 0.000497457076610595, 'samples': 1675392, 'steps': 8725, 'loss/train': 1.7877483367919922} 11/06/2021 22:27:53 - INFO - __main__ - Step 8727: {'lr': 0.0004974563215797903, 'samples': 1675584, 'steps': 8726, 'loss/train': 1.3742201328277588} 11/06/2021 22:27:53 - INFO - __main__ - Step 8728: {'lr': 0.0004974555664374857, 'samples': 1675776, 'steps': 8727, 'loss/train': 2.265516757965088} 11/06/2021 22:27:53 - INFO - __main__ - Step 8729: {'lr': 0.0004974548111836812, 'samples': 1675968, 'steps': 8728, 'loss/train': 1.7990782260894775} 11/06/2021 22:27:54 - INFO - __main__ - Step 8730: {'lr': 0.0004974540558183776, 'samples': 1676160, 'steps': 8729, 'loss/train': 1.5700663328170776} 11/06/2021 22:27:55 - INFO - __main__ - Step 8731: {'lr': 0.0004974533003415751, 'samples': 1676352, 'steps': 8730, 'loss/train': 2.584472894668579} 11/06/2021 22:27:55 - INFO - __main__ - Step 8732: {'lr': 0.0004974525447532737, 'samples': 1676544, 'steps': 8731, 'loss/train': 0.7004466652870178} 11/06/2021 22:27:55 - INFO - __main__ - Step 8733: {'lr': 0.0004974517890534742, 'samples': 1676736, 'steps': 8732, 'loss/train': 2.029585361480713} 11/06/2021 22:27:56 - INFO - __main__ - Step 8734: {'lr': 0.0004974510332421767, 'samples': 1676928, 'steps': 8733, 'loss/train': 2.3354976177215576} 11/06/2021 22:27:57 - INFO - __main__ - Step 8735: {'lr': 0.0004974502773193815, 'samples': 1677120, 'steps': 8734, 'loss/train': 1.779792308807373} 11/06/2021 22:27:57 - INFO - __main__ - Step 8736: {'lr': 0.0004974495212850892, 'samples': 1677312, 'steps': 8735, 'loss/train': 1.7999184131622314} 11/06/2021 22:27:58 - INFO - __main__ - Step 8737: {'lr': 0.0004974487651392998, 'samples': 1677504, 'steps': 8736, 'loss/train': 2.0719504356384277} 11/06/2021 22:27:58 - INFO - __main__ - Step 8738: {'lr': 0.0004974480088820139, 'samples': 1677696, 'steps': 8737, 'loss/train': 1.93135666847229} 11/06/2021 22:27:59 - INFO - __main__ - Step 8739: {'lr': 0.0004974472525132316, 'samples': 1677888, 'steps': 8738, 'loss/train': 2.191235303878784} 11/06/2021 22:27:59 - INFO - __main__ - Step 8740: {'lr': 0.0004974464960329536, 'samples': 1678080, 'steps': 8739, 'loss/train': 1.711877465248108} 11/06/2021 22:28:00 - INFO - __main__ - Step 8741: {'lr': 0.0004974457394411798, 'samples': 1678272, 'steps': 8740, 'loss/train': 1.1441311836242676} 11/06/2021 22:28:00 - INFO - __main__ - Step 8742: {'lr': 0.0004974449827379109, 'samples': 1678464, 'steps': 8741, 'loss/train': 2.325981616973877} 11/06/2021 22:28:01 - INFO - __main__ - Step 8743: {'lr': 0.000497444225923147, 'samples': 1678656, 'steps': 8742, 'loss/train': 1.826312780380249} 11/06/2021 22:28:01 - INFO - __main__ - Step 8744: {'lr': 0.0004974434689968887, 'samples': 1678848, 'steps': 8743, 'loss/train': 2.2900896072387695} 11/06/2021 22:28:01 - INFO - __main__ - Step 8745: {'lr': 0.0004974427119591361, 'samples': 1679040, 'steps': 8744, 'loss/train': 2.352595090866089} 11/06/2021 22:28:02 - INFO - __main__ - Step 8746: {'lr': 0.0004974419548098897, 'samples': 1679232, 'steps': 8745, 'loss/train': 1.2469137907028198} 11/06/2021 22:28:03 - INFO - __main__ - Step 8747: {'lr': 0.0004974411975491498, 'samples': 1679424, 'steps': 8746, 'loss/train': 1.8148316144943237} 11/06/2021 22:28:03 - INFO - __main__ - Step 8748: {'lr': 0.0004974404401769167, 'samples': 1679616, 'steps': 8747, 'loss/train': 2.2442069053649902} 11/06/2021 22:28:03 - INFO - __main__ - Step 8749: {'lr': 0.0004974396826931906, 'samples': 1679808, 'steps': 8748, 'loss/train': 2.0725631713867188} 11/06/2021 22:28:04 - INFO - __main__ - Step 8750: {'lr': 0.0004974389250979722, 'samples': 1680000, 'steps': 8749, 'loss/train': 1.7763142585754395} 11/06/2021 22:28:05 - INFO - __main__ - Step 8751: {'lr': 0.0004974381673912614, 'samples': 1680192, 'steps': 8750, 'loss/train': 2.2424070835113525} 11/06/2021 22:28:05 - INFO - __main__ - Step 8752: {'lr': 0.000497437409573059, 'samples': 1680384, 'steps': 8751, 'loss/train': 1.8414136171340942} 11/06/2021 22:28:05 - INFO - __main__ - Step 8753: {'lr': 0.000497436651643365, 'samples': 1680576, 'steps': 8752, 'loss/train': 1.7458195686340332} 11/06/2021 22:28:06 - INFO - __main__ - Step 8754: {'lr': 0.00049743589360218, 'samples': 1680768, 'steps': 8753, 'loss/train': 2.1746933460235596} 11/06/2021 22:28:06 - INFO - __main__ - Step 8755: {'lr': 0.0004974351354495041, 'samples': 1680960, 'steps': 8754, 'loss/train': 1.9091649055480957} 11/06/2021 22:28:07 - INFO - __main__ - Step 8756: {'lr': 0.0004974343771853377, 'samples': 1681152, 'steps': 8755, 'loss/train': 1.7813023328781128} 11/06/2021 22:28:07 - INFO - __main__ - Step 8757: {'lr': 0.0004974336188096813, 'samples': 1681344, 'steps': 8756, 'loss/train': 1.627087116241455} 11/06/2021 22:28:08 - INFO - __main__ - Step 8758: {'lr': 0.0004974328603225351, 'samples': 1681536, 'steps': 8757, 'loss/train': 1.492101788520813} 11/06/2021 22:28:08 - INFO - __main__ - Step 8759: {'lr': 0.0004974321017238994, 'samples': 1681728, 'steps': 8758, 'loss/train': 1.5569329261779785} 11/06/2021 22:28:09 - INFO - __main__ - Step 8760: {'lr': 0.0004974313430137747, 'samples': 1681920, 'steps': 8759, 'loss/train': 1.9305700063705444} 11/06/2021 22:28:10 - INFO - __main__ - Step 8761: {'lr': 0.0004974305841921612, 'samples': 1682112, 'steps': 8760, 'loss/train': 1.7186388969421387} 11/06/2021 22:28:10 - INFO - __main__ - Step 8762: {'lr': 0.0004974298252590593, 'samples': 1682304, 'steps': 8761, 'loss/train': 1.9467127323150635} 11/06/2021 22:28:10 - INFO - __main__ - Step 8763: {'lr': 0.0004974290662144694, 'samples': 1682496, 'steps': 8762, 'loss/train': 1.675337791442871} 11/06/2021 22:28:11 - INFO - __main__ - Step 8764: {'lr': 0.0004974283070583917, 'samples': 1682688, 'steps': 8763, 'loss/train': 1.5082322359085083} 11/06/2021 22:28:11 - INFO - __main__ - Step 8765: {'lr': 0.0004974275477908266, 'samples': 1682880, 'steps': 8764, 'loss/train': 2.1924538612365723} 11/06/2021 22:28:12 - INFO - __main__ - Step 8766: {'lr': 0.0004974267884117746, 'samples': 1683072, 'steps': 8765, 'loss/train': 0.8508917093276978} 11/06/2021 22:28:12 - INFO - __main__ - Step 8767: {'lr': 0.0004974260289212358, 'samples': 1683264, 'steps': 8766, 'loss/train': 1.8229551315307617} 11/06/2021 22:28:13 - INFO - __main__ - Step 8768: {'lr': 0.0004974252693192106, 'samples': 1683456, 'steps': 8767, 'loss/train': 1.7984812259674072} 11/06/2021 22:28:13 - INFO - __main__ - Step 8769: {'lr': 0.0004974245096056995, 'samples': 1683648, 'steps': 8768, 'loss/train': 1.664389729499817} 11/06/2021 22:28:13 - INFO - __main__ - Step 8770: {'lr': 0.0004974237497807027, 'samples': 1683840, 'steps': 8769, 'loss/train': 0.3073934018611908} 11/06/2021 22:28:14 - INFO - __main__ - Step 8771: {'lr': 0.0004974229898442207, 'samples': 1684032, 'steps': 8770, 'loss/train': 1.7521414756774902} 11/06/2021 22:28:15 - INFO - __main__ - Step 8772: {'lr': 0.0004974222297962535, 'samples': 1684224, 'steps': 8771, 'loss/train': 1.6954983472824097} 11/06/2021 22:28:15 - INFO - __main__ - Step 8773: {'lr': 0.0004974214696368017, 'samples': 1684416, 'steps': 8772, 'loss/train': 1.7648489475250244} 11/06/2021 22:28:15 - INFO - __main__ - Step 8774: {'lr': 0.0004974207093658657, 'samples': 1684608, 'steps': 8773, 'loss/train': 1.5386720895767212} 11/06/2021 22:28:16 - INFO - __main__ - Step 8775: {'lr': 0.0004974199489834457, 'samples': 1684800, 'steps': 8774, 'loss/train': 1.757877230644226} 11/06/2021 22:28:17 - INFO - __main__ - Step 8776: {'lr': 0.0004974191884895421, 'samples': 1684992, 'steps': 8775, 'loss/train': 1.9185556173324585} 11/06/2021 22:28:17 - INFO - __main__ - Step 8777: {'lr': 0.0004974184278841552, 'samples': 1685184, 'steps': 8776, 'loss/train': 2.359382152557373} 11/06/2021 22:28:18 - INFO - __main__ - Step 8778: {'lr': 0.0004974176671672854, 'samples': 1685376, 'steps': 8777, 'loss/train': 1.3928838968276978} 11/06/2021 22:28:18 - INFO - __main__ - Step 8779: {'lr': 0.000497416906338933, 'samples': 1685568, 'steps': 8778, 'loss/train': 2.1888182163238525} 11/06/2021 22:28:18 - INFO - __main__ - Step 8780: {'lr': 0.0004974161453990985, 'samples': 1685760, 'steps': 8779, 'loss/train': 1.9797148704528809} 11/06/2021 22:28:19 - INFO - __main__ - Step 8781: {'lr': 0.0004974153843477819, 'samples': 1685952, 'steps': 8780, 'loss/train': 1.6259188652038574} 11/06/2021 22:28:20 - INFO - __main__ - Step 8782: {'lr': 0.0004974146231849838, 'samples': 1686144, 'steps': 8781, 'loss/train': 1.612627387046814} 11/06/2021 22:28:20 - INFO - __main__ - Step 8783: {'lr': 0.0004974138619107046, 'samples': 1686336, 'steps': 8782, 'loss/train': 1.9349783658981323} 11/06/2021 22:28:20 - INFO - __main__ - Step 8784: {'lr': 0.0004974131005249444, 'samples': 1686528, 'steps': 8783, 'loss/train': 1.3217498064041138} 11/06/2021 22:28:21 - INFO - __main__ - Step 8785: {'lr': 0.0004974123390277037, 'samples': 1686720, 'steps': 8784, 'loss/train': 0.9705907702445984} 11/06/2021 22:28:21 - INFO - __main__ - Step 8786: {'lr': 0.0004974115774189829, 'samples': 1686912, 'steps': 8785, 'loss/train': 1.2650182247161865} 11/06/2021 22:28:22 - INFO - __main__ - Step 8787: {'lr': 0.0004974108156987822, 'samples': 1687104, 'steps': 8786, 'loss/train': 1.8180323839187622} 11/06/2021 22:28:23 - INFO - __main__ - Step 8788: {'lr': 0.000497410053867102, 'samples': 1687296, 'steps': 8787, 'loss/train': 2.028273105621338} 11/06/2021 22:28:23 - INFO - __main__ - Step 8789: {'lr': 0.0004974092919239427, 'samples': 1687488, 'steps': 8788, 'loss/train': 1.8566056489944458} 11/06/2021 22:28:23 - INFO - __main__ - Step 8790: {'lr': 0.0004974085298693045, 'samples': 1687680, 'steps': 8789, 'loss/train': 1.7841997146606445} 11/06/2021 22:28:24 - INFO - __main__ - Step 8791: {'lr': 0.0004974077677031879, 'samples': 1687872, 'steps': 8790, 'loss/train': 1.7749842405319214} 11/06/2021 22:28:25 - INFO - __main__ - Step 8792: {'lr': 0.0004974070054255932, 'samples': 1688064, 'steps': 8791, 'loss/train': 1.3394575119018555} 11/06/2021 22:28:25 - INFO - __main__ - Step 8793: {'lr': 0.0004974062430365206, 'samples': 1688256, 'steps': 8792, 'loss/train': 2.017791509628296} 11/06/2021 22:28:25 - INFO - __main__ - Step 8794: {'lr': 0.0004974054805359706, 'samples': 1688448, 'steps': 8793, 'loss/train': 1.5951640605926514} 11/06/2021 22:28:26 - INFO - __main__ - Step 8795: {'lr': 0.0004974047179239436, 'samples': 1688640, 'steps': 8794, 'loss/train': 1.9513581991195679} 11/06/2021 22:28:26 - INFO - __main__ - Step 8796: {'lr': 0.0004974039552004398, 'samples': 1688832, 'steps': 8795, 'loss/train': 1.227823257446289} 11/06/2021 22:28:27 - INFO - __main__ - Step 8797: {'lr': 0.0004974031923654596, 'samples': 1689024, 'steps': 8796, 'loss/train': 1.9323481321334839} 11/06/2021 22:28:27 - INFO - __main__ - Step 8798: {'lr': 0.0004974024294190034, 'samples': 1689216, 'steps': 8797, 'loss/train': 1.6938538551330566} 11/06/2021 22:28:28 - INFO - __main__ - Step 8799: {'lr': 0.0004974016663610713, 'samples': 1689408, 'steps': 8798, 'loss/train': 1.885172724723816} 11/06/2021 22:28:28 - INFO - __main__ - Step 8800: {'lr': 0.000497400903191664, 'samples': 1689600, 'steps': 8799, 'loss/train': 1.5480501651763916} 11/06/2021 22:28:28 - INFO - __main__ - Step 8801: {'lr': 0.0004974001399107816, 'samples': 1689792, 'steps': 8800, 'loss/train': 2.519402265548706} 11/06/2021 22:28:29 - INFO - __main__ - Step 8802: {'lr': 0.0004973993765184246, 'samples': 1689984, 'steps': 8801, 'loss/train': 1.6957279443740845} 11/06/2021 22:28:30 - INFO - __main__ - Step 8803: {'lr': 0.0004973986130145931, 'samples': 1690176, 'steps': 8802, 'loss/train': 1.698414921760559} 11/06/2021 22:28:30 - INFO - __main__ - Step 8804: {'lr': 0.0004973978493992877, 'samples': 1690368, 'steps': 8803, 'loss/train': 1.4111169576644897} 11/06/2021 22:28:30 - INFO - __main__ - Step 8805: {'lr': 0.0004973970856725086, 'samples': 1690560, 'steps': 8804, 'loss/train': 1.862740159034729} 11/06/2021 22:28:31 - INFO - __main__ - Step 8806: {'lr': 0.0004973963218342563, 'samples': 1690752, 'steps': 8805, 'loss/train': 1.8943990468978882} 11/06/2021 22:28:32 - INFO - __main__ - Step 8807: {'lr': 0.000497395557884531, 'samples': 1690944, 'steps': 8806, 'loss/train': 2.2840816974639893} 11/06/2021 22:28:32 - INFO - __main__ - Step 8808: {'lr': 0.000497394793823333, 'samples': 1691136, 'steps': 8807, 'loss/train': 1.346774697303772} 11/06/2021 22:28:33 - INFO - __main__ - Step 8809: {'lr': 0.0004973940296506627, 'samples': 1691328, 'steps': 8808, 'loss/train': 1.4218136072158813} 11/06/2021 22:28:33 - INFO - __main__ - Step 8810: {'lr': 0.0004973932653665206, 'samples': 1691520, 'steps': 8809, 'loss/train': 0.507257878780365} 11/06/2021 22:28:33 - INFO - __main__ - Step 8811: {'lr': 0.0004973925009709068, 'samples': 1691712, 'steps': 8810, 'loss/train': 1.9113545417785645} 11/06/2021 22:28:34 - INFO - __main__ - Step 8812: {'lr': 0.0004973917364638218, 'samples': 1691904, 'steps': 8811, 'loss/train': 1.7712737321853638} 11/06/2021 22:28:34 - INFO - __main__ - Step 8813: {'lr': 0.0004973909718452659, 'samples': 1692096, 'steps': 8812, 'loss/train': 2.098034381866455} 11/06/2021 22:28:35 - INFO - __main__ - Step 8814: {'lr': 0.0004973902071152396, 'samples': 1692288, 'steps': 8813, 'loss/train': 1.7307425737380981} 11/06/2021 22:28:35 - INFO - __main__ - Step 8815: {'lr': 0.0004973894422737428, 'samples': 1692480, 'steps': 8814, 'loss/train': 1.682647466659546} 11/06/2021 22:28:36 - INFO - __main__ - Step 8816: {'lr': 0.0004973886773207763, 'samples': 1692672, 'steps': 8815, 'loss/train': 1.6713685989379883} 11/06/2021 22:28:37 - INFO - __main__ - Step 8817: {'lr': 0.0004973879122563403, 'samples': 1692864, 'steps': 8816, 'loss/train': 2.685642957687378} 11/06/2021 22:28:37 - INFO - __main__ - Step 8818: {'lr': 0.000497387147080435, 'samples': 1693056, 'steps': 8817, 'loss/train': 1.6026307344436646} 11/06/2021 22:28:37 - INFO - __main__ - Step 8819: {'lr': 0.000497386381793061, 'samples': 1693248, 'steps': 8818, 'loss/train': 1.8597676753997803} 11/06/2021 22:28:38 - INFO - __main__ - Step 8820: {'lr': 0.0004973856163942185, 'samples': 1693440, 'steps': 8819, 'loss/train': 1.8286590576171875} 11/06/2021 22:28:38 - INFO - __main__ - Step 8821: {'lr': 0.0004973848508839077, 'samples': 1693632, 'steps': 8820, 'loss/train': 1.868646264076233} 11/06/2021 22:28:38 - INFO - __main__ - Step 8822: {'lr': 0.0004973840852621293, 'samples': 1693824, 'steps': 8821, 'loss/train': 1.4488940238952637} 11/06/2021 22:28:39 - INFO - __main__ - Step 8823: {'lr': 0.0004973833195288834, 'samples': 1694016, 'steps': 8822, 'loss/train': 1.7625603675842285} 11/06/2021 22:28:40 - INFO - __main__ - Step 8824: {'lr': 0.0004973825536841703, 'samples': 1694208, 'steps': 8823, 'loss/train': 1.8687225580215454} 11/06/2021 22:28:40 - INFO - __main__ - Step 8825: {'lr': 0.0004973817877279906, 'samples': 1694400, 'steps': 8824, 'loss/train': 1.4771027565002441} 11/06/2021 22:28:41 - INFO - __main__ - Step 8826: {'lr': 0.0004973810216603443, 'samples': 1694592, 'steps': 8825, 'loss/train': 0.3064444959163666} 11/06/2021 22:28:41 - INFO - __main__ - Step 8827: {'lr': 0.000497380255481232, 'samples': 1694784, 'steps': 8826, 'loss/train': 1.9688664674758911} 11/06/2021 22:28:42 - INFO - __main__ - Step 8828: {'lr': 0.000497379489190654, 'samples': 1694976, 'steps': 8827, 'loss/train': 1.279405117034912} 11/06/2021 22:28:42 - INFO - __main__ - Step 8829: {'lr': 0.0004973787227886106, 'samples': 1695168, 'steps': 8828, 'loss/train': 1.0162687301635742} 11/06/2021 22:28:43 - INFO - __main__ - Step 8830: {'lr': 0.0004973779562751022, 'samples': 1695360, 'steps': 8829, 'loss/train': 1.7933107614517212} 11/06/2021 22:28:43 - INFO - __main__ - Step 8831: {'lr': 0.0004973771896501292, 'samples': 1695552, 'steps': 8830, 'loss/train': 2.0314056873321533} 11/06/2021 22:28:43 - INFO - __main__ - Step 8832: {'lr': 0.0004973764229136917, 'samples': 1695744, 'steps': 8831, 'loss/train': 1.8036915063858032} 11/06/2021 22:28:44 - INFO - __main__ - Step 8833: {'lr': 0.0004973756560657901, 'samples': 1695936, 'steps': 8832, 'loss/train': 1.417937994003296} 11/06/2021 22:28:45 - INFO - __main__ - Step 8834: {'lr': 0.0004973748891064251, 'samples': 1696128, 'steps': 8833, 'loss/train': 1.0796078443527222} 11/06/2021 22:28:45 - INFO - __main__ - Step 8835: {'lr': 0.0004973741220355967, 'samples': 1696320, 'steps': 8834, 'loss/train': 1.756775975227356} 11/06/2021 22:28:45 - INFO - __main__ - Step 8836: {'lr': 0.0004973733548533052, 'samples': 1696512, 'steps': 8835, 'loss/train': 1.7436381578445435} 11/06/2021 22:28:46 - INFO - __main__ - Step 8837: {'lr': 0.0004973725875595513, 'samples': 1696704, 'steps': 8836, 'loss/train': 2.336061716079712} 11/06/2021 22:28:47 - INFO - __main__ - Step 8838: {'lr': 0.000497371820154335, 'samples': 1696896, 'steps': 8837, 'loss/train': 1.9740123748779297} 11/06/2021 22:28:47 - INFO - __main__ - Step 8839: {'lr': 0.0004973710526376569, 'samples': 1697088, 'steps': 8838, 'loss/train': 0.7141327261924744} 11/06/2021 22:28:47 - INFO - __main__ - Step 8840: {'lr': 0.000497370285009517, 'samples': 1697280, 'steps': 8839, 'loss/train': 1.4266327619552612} 11/06/2021 22:28:48 - INFO - __main__ - Step 8841: {'lr': 0.000497369517269916, 'samples': 1697472, 'steps': 8840, 'loss/train': 2.1014244556427} 11/06/2021 22:28:48 - INFO - __main__ - Step 8842: {'lr': 0.0004973687494188541, 'samples': 1697664, 'steps': 8841, 'loss/train': 1.9083147048950195} 11/06/2021 22:28:49 - INFO - __main__ - Step 8843: {'lr': 0.0004973679814563318, 'samples': 1697856, 'steps': 8842, 'loss/train': 1.9293524026870728} 11/06/2021 22:28:50 - INFO - __main__ - Step 8844: {'lr': 0.0004973672133823491, 'samples': 1698048, 'steps': 8843, 'loss/train': 1.9651559591293335} 11/06/2021 22:28:50 - INFO - __main__ - Step 8845: {'lr': 0.0004973664451969066, 'samples': 1698240, 'steps': 8844, 'loss/train': 2.337369203567505} 11/06/2021 22:28:50 - INFO - __main__ - Step 8846: {'lr': 0.0004973656769000046, 'samples': 1698432, 'steps': 8845, 'loss/train': 1.0025054216384888} 11/06/2021 22:28:51 - INFO - __main__ - Step 8847: {'lr': 0.0004973649084916435, 'samples': 1698624, 'steps': 8846, 'loss/train': 2.6142988204956055} 11/06/2021 22:28:52 - INFO - __main__ - Step 8848: {'lr': 0.0004973641399718236, 'samples': 1698816, 'steps': 8847, 'loss/train': 2.0085182189941406} 11/06/2021 22:28:52 - INFO - __main__ - Step 8849: {'lr': 0.0004973633713405451, 'samples': 1699008, 'steps': 8848, 'loss/train': 1.4306745529174805} 11/06/2021 22:28:53 - INFO - __main__ - Step 8850: {'lr': 0.0004973626025978086, 'samples': 1699200, 'steps': 8849, 'loss/train': 1.551592469215393} 11/06/2021 22:28:53 - INFO - __main__ - Step 8851: {'lr': 0.0004973618337436143, 'samples': 1699392, 'steps': 8850, 'loss/train': 1.7350374460220337} 11/06/2021 22:28:53 - INFO - __main__ - Step 8852: {'lr': 0.0004973610647779626, 'samples': 1699584, 'steps': 8851, 'loss/train': 1.860516905784607} 11/06/2021 22:28:54 - INFO - __main__ - Step 8853: {'lr': 0.0004973602957008537, 'samples': 1699776, 'steps': 8852, 'loss/train': 1.765555739402771} 11/06/2021 22:28:55 - INFO - __main__ - Step 8854: {'lr': 0.0004973595265122883, 'samples': 1699968, 'steps': 8853, 'loss/train': 2.4081969261169434} 11/06/2021 22:28:55 - INFO - __main__ - Step 8855: {'lr': 0.0004973587572122663, 'samples': 1700160, 'steps': 8854, 'loss/train': 1.8850605487823486} 11/06/2021 22:28:55 - INFO - __main__ - Step 8856: {'lr': 0.0004973579878007884, 'samples': 1700352, 'steps': 8855, 'loss/train': 1.608045220375061} 11/06/2021 22:28:56 - INFO - __main__ - Step 8857: {'lr': 0.0004973572182778546, 'samples': 1700544, 'steps': 8856, 'loss/train': 1.670106291770935} 11/06/2021 22:28:56 - INFO - __main__ - Step 8858: {'lr': 0.0004973564486434656, 'samples': 1700736, 'steps': 8857, 'loss/train': 1.6520261764526367} 11/06/2021 22:28:57 - INFO - __main__ - Step 8859: {'lr': 0.0004973556788976217, 'samples': 1700928, 'steps': 8858, 'loss/train': 1.5881348848342896} 11/06/2021 22:28:57 - INFO - __main__ - Step 8860: {'lr': 0.000497354909040323, 'samples': 1701120, 'steps': 8859, 'loss/train': 1.7174054384231567} 11/06/2021 22:28:58 - INFO - __main__ - Step 8861: {'lr': 0.00049735413907157, 'samples': 1701312, 'steps': 8860, 'loss/train': 1.8992524147033691} 11/06/2021 22:28:58 - INFO - __main__ - Step 8862: {'lr': 0.0004973533689913631, 'samples': 1701504, 'steps': 8861, 'loss/train': 1.570049524307251} 11/06/2021 22:28:58 - INFO - __main__ - Step 8863: {'lr': 0.0004973525987997026, 'samples': 1701696, 'steps': 8862, 'loss/train': 2.445530891418457} 11/06/2021 22:28:59 - INFO - __main__ - Step 8864: {'lr': 0.0004973518284965888, 'samples': 1701888, 'steps': 8863, 'loss/train': 1.9217685461044312} 11/06/2021 22:29:00 - INFO - __main__ - Step 8865: {'lr': 0.0004973510580820221, 'samples': 1702080, 'steps': 8864, 'loss/train': 1.8503520488739014} 11/06/2021 22:29:00 - INFO - __main__ - Step 8866: {'lr': 0.0004973502875560028, 'samples': 1702272, 'steps': 8865, 'loss/train': 1.635284185409546} 11/06/2021 22:29:01 - INFO - __main__ - Step 8867: {'lr': 0.0004973495169185313, 'samples': 1702464, 'steps': 8866, 'loss/train': 1.2922062873840332} 11/06/2021 22:29:01 - INFO - __main__ - Step 8868: {'lr': 0.0004973487461696079, 'samples': 1702656, 'steps': 8867, 'loss/train': 1.878747820854187} 11/06/2021 22:29:02 - INFO - __main__ - Step 8869: {'lr': 0.000497347975309233, 'samples': 1702848, 'steps': 8868, 'loss/train': 1.7777513265609741} 11/06/2021 22:29:02 - INFO - __main__ - Step 8870: {'lr': 0.0004973472043374069, 'samples': 1703040, 'steps': 8869, 'loss/train': 1.8679721355438232} 11/06/2021 22:29:03 - INFO - __main__ - Step 8871: {'lr': 0.00049734643325413, 'samples': 1703232, 'steps': 8870, 'loss/train': 1.7162529230117798} 11/06/2021 22:29:03 - INFO - __main__ - Step 8872: {'lr': 0.0004973456620594026, 'samples': 1703424, 'steps': 8871, 'loss/train': 1.8824801445007324} 11/06/2021 22:29:03 - INFO - __main__ - Step 8873: {'lr': 0.0004973448907532251, 'samples': 1703616, 'steps': 8872, 'loss/train': 1.3267327547073364} 11/06/2021 22:29:04 - INFO - __main__ - Step 8874: {'lr': 0.0004973441193355978, 'samples': 1703808, 'steps': 8873, 'loss/train': 2.032459259033203} 11/06/2021 22:29:05 - INFO - __main__ - Step 8875: {'lr': 0.0004973433478065209, 'samples': 1704000, 'steps': 8874, 'loss/train': 1.6638219356536865} 11/06/2021 22:29:05 - INFO - __main__ - Step 8876: {'lr': 0.0004973425761659951, 'samples': 1704192, 'steps': 8875, 'loss/train': 1.3877533674240112} 11/06/2021 22:29:05 - INFO - __main__ - Step 8877: {'lr': 0.0004973418044140204, 'samples': 1704384, 'steps': 8876, 'loss/train': 1.6968107223510742} 11/06/2021 22:29:06 - INFO - __main__ - Step 8878: {'lr': 0.0004973410325505974, 'samples': 1704576, 'steps': 8877, 'loss/train': 0.47580868005752563} 11/06/2021 22:29:07 - INFO - __main__ - Step 8879: {'lr': 0.0004973402605757263, 'samples': 1704768, 'steps': 8878, 'loss/train': 1.8513538837432861} 11/06/2021 22:29:07 - INFO - __main__ - Step 8880: {'lr': 0.0004973394884894075, 'samples': 1704960, 'steps': 8879, 'loss/train': 1.8675156831741333} 11/06/2021 22:29:08 - INFO - __main__ - Step 8881: {'lr': 0.0004973387162916415, 'samples': 1705152, 'steps': 8880, 'loss/train': 1.1261378526687622} 11/06/2021 22:29:08 - INFO - __main__ - Step 8882: {'lr': 0.0004973379439824283, 'samples': 1705344, 'steps': 8881, 'loss/train': 2.2020576000213623} 11/06/2021 22:29:09 - INFO - __main__ - Step 8883: {'lr': 0.0004973371715617685, 'samples': 1705536, 'steps': 8882, 'loss/train': 1.241743803024292} 11/06/2021 22:29:09 - INFO - __main__ - Step 8884: {'lr': 0.0004973363990296624, 'samples': 1705728, 'steps': 8883, 'loss/train': 1.7667030096054077} 11/06/2021 22:29:10 - INFO - __main__ - Step 8885: {'lr': 0.0004973356263861103, 'samples': 1705920, 'steps': 8884, 'loss/train': 1.5185878276824951} 11/06/2021 22:29:10 - INFO - __main__ - Step 8886: {'lr': 0.0004973348536311126, 'samples': 1706112, 'steps': 8885, 'loss/train': 2.001230001449585} 11/06/2021 22:29:11 - INFO - __main__ - Step 8887: {'lr': 0.0004973340807646696, 'samples': 1706304, 'steps': 8886, 'loss/train': 2.3790831565856934} 11/06/2021 22:29:11 - INFO - __main__ - Step 8888: {'lr': 0.0004973333077867817, 'samples': 1706496, 'steps': 8887, 'loss/train': 1.9295786619186401} 11/06/2021 22:29:11 - INFO - __main__ - Step 8889: {'lr': 0.0004973325346974493, 'samples': 1706688, 'steps': 8888, 'loss/train': 1.372741937637329} 11/06/2021 22:29:13 - INFO - __main__ - Step 8890: {'lr': 0.0004973317614966726, 'samples': 1706880, 'steps': 8889, 'loss/train': 2.034008502960205} 11/06/2021 22:29:13 - INFO - __main__ - Step 8891: {'lr': 0.000497330988184452, 'samples': 1707072, 'steps': 8890, 'loss/train': 1.9604440927505493} 11/06/2021 22:29:13 - INFO - __main__ - Step 8892: {'lr': 0.000497330214760788, 'samples': 1707264, 'steps': 8891, 'loss/train': 0.9298360347747803} 11/06/2021 22:29:14 - INFO - __main__ - Step 8893: {'lr': 0.0004973294412256807, 'samples': 1707456, 'steps': 8892, 'loss/train': 1.6712414026260376} 11/06/2021 22:29:14 - INFO - __main__ - Step 8894: {'lr': 0.0004973286675791305, 'samples': 1707648, 'steps': 8893, 'loss/train': 1.9249519109725952} 11/06/2021 22:29:15 - INFO - __main__ - Step 8895: {'lr': 0.000497327893821138, 'samples': 1707840, 'steps': 8894, 'loss/train': 1.5401326417922974} 11/06/2021 22:29:15 - INFO - __main__ - Step 8896: {'lr': 0.0004973271199517033, 'samples': 1708032, 'steps': 8895, 'loss/train': 1.5258654356002808} 11/06/2021 22:29:16 - INFO - __main__ - Step 8897: {'lr': 0.0004973263459708268, 'samples': 1708224, 'steps': 8896, 'loss/train': 1.3488224744796753} 11/06/2021 22:29:16 - INFO - __main__ - Step 8898: {'lr': 0.0004973255718785088, 'samples': 1708416, 'steps': 8897, 'loss/train': 1.7817871570587158} 11/06/2021 22:29:16 - INFO - __main__ - Step 8899: {'lr': 0.0004973247976747499, 'samples': 1708608, 'steps': 8898, 'loss/train': 1.7609913349151611} 11/06/2021 22:29:17 - INFO - __main__ - Step 8900: {'lr': 0.00049732402335955, 'samples': 1708800, 'steps': 8899, 'loss/train': 1.659672737121582} 11/06/2021 22:29:18 - INFO - __main__ - Step 8901: {'lr': 0.0004973232489329099, 'samples': 1708992, 'steps': 8900, 'loss/train': 1.438830852508545} 11/06/2021 22:29:18 - INFO - __main__ - Step 8902: {'lr': 0.0004973224743948298, 'samples': 1709184, 'steps': 8901, 'loss/train': 2.05006742477417} 11/06/2021 22:29:18 - INFO - __main__ - Step 8903: {'lr': 0.00049732169974531, 'samples': 1709376, 'steps': 8902, 'loss/train': 1.8295646905899048} 11/06/2021 22:29:19 - INFO - __main__ - Step 8904: {'lr': 0.0004973209249843507, 'samples': 1709568, 'steps': 8903, 'loss/train': 1.4633625745773315} 11/06/2021 22:29:19 - INFO - __main__ - Step 8905: {'lr': 0.0004973201501119525, 'samples': 1709760, 'steps': 8904, 'loss/train': 1.7192811965942383} 11/06/2021 22:29:20 - INFO - __main__ - Step 8906: {'lr': 0.0004973193751281156, 'samples': 1709952, 'steps': 8905, 'loss/train': 1.834681510925293} 11/06/2021 22:29:20 - INFO - __main__ - Step 8907: {'lr': 0.0004973186000328405, 'samples': 1710144, 'steps': 8906, 'loss/train': 1.5164721012115479} 11/06/2021 22:29:21 - INFO - __main__ - Step 8908: {'lr': 0.0004973178248261274, 'samples': 1710336, 'steps': 8907, 'loss/train': 2.1482808589935303} 11/06/2021 22:29:21 - INFO - __main__ - Step 8909: {'lr': 0.0004973170495079768, 'samples': 1710528, 'steps': 8908, 'loss/train': 2.774986505508423} 11/06/2021 22:29:22 - INFO - __main__ - Step 8910: {'lr': 0.0004973162740783888, 'samples': 1710720, 'steps': 8909, 'loss/train': 1.804287075996399} 11/06/2021 22:29:23 - INFO - __main__ - Step 8911: {'lr': 0.000497315498537364, 'samples': 1710912, 'steps': 8910, 'loss/train': 1.4533205032348633} 11/06/2021 22:29:23 - INFO - __main__ - Step 8912: {'lr': 0.0004973147228849027, 'samples': 1711104, 'steps': 8911, 'loss/train': 1.924820065498352} 11/06/2021 22:29:23 - INFO - __main__ - Step 8913: {'lr': 0.0004973139471210051, 'samples': 1711296, 'steps': 8912, 'loss/train': 1.9860786199569702} 11/06/2021 22:29:24 - INFO - __main__ - Step 8914: {'lr': 0.0004973131712456717, 'samples': 1711488, 'steps': 8913, 'loss/train': 1.800611972808838} 11/06/2021 22:29:24 - INFO - __main__ - Step 8915: {'lr': 0.0004973123952589027, 'samples': 1711680, 'steps': 8914, 'loss/train': 2.1335561275482178} 11/06/2021 22:29:25 - INFO - __main__ - Step 8916: {'lr': 0.0004973116191606987, 'samples': 1711872, 'steps': 8915, 'loss/train': 2.246720552444458} 11/06/2021 22:29:25 - INFO - __main__ - Step 8917: {'lr': 0.0004973108429510598, 'samples': 1712064, 'steps': 8916, 'loss/train': 1.5170209407806396} 11/06/2021 22:29:26 - INFO - __main__ - Step 8918: {'lr': 0.0004973100666299864, 'samples': 1712256, 'steps': 8917, 'loss/train': 1.8479684591293335} 11/06/2021 22:29:26 - INFO - __main__ - Step 8919: {'lr': 0.000497309290197479, 'samples': 1712448, 'steps': 8918, 'loss/train': 1.6200121641159058} 11/06/2021 22:29:27 - INFO - __main__ - Step 8920: {'lr': 0.0004973085136535379, 'samples': 1712640, 'steps': 8919, 'loss/train': 1.8492422103881836} 11/06/2021 22:29:27 - INFO - __main__ - Step 8921: {'lr': 0.0004973077369981633, 'samples': 1712832, 'steps': 8920, 'loss/train': 1.80856192111969} 11/06/2021 22:29:28 - INFO - __main__ - Step 8922: {'lr': 0.0004973069602313557, 'samples': 1713024, 'steps': 8921, 'loss/train': 2.0224783420562744} 11/06/2021 22:29:28 - INFO - __main__ - Step 8923: {'lr': 0.0004973061833531154, 'samples': 1713216, 'steps': 8922, 'loss/train': 1.5250040292739868} 11/06/2021 22:29:28 - INFO - __main__ - Step 8924: {'lr': 0.0004973054063634428, 'samples': 1713408, 'steps': 8923, 'loss/train': 1.9554498195648193} 11/06/2021 22:29:29 - INFO - __main__ - Step 8925: {'lr': 0.0004973046292623382, 'samples': 1713600, 'steps': 8924, 'loss/train': 1.6448603868484497} 11/06/2021 22:29:30 - INFO - __main__ - Step 8926: {'lr': 0.0004973038520498017, 'samples': 1713792, 'steps': 8925, 'loss/train': 1.983763575553894} 11/06/2021 22:29:30 - INFO - __main__ - Step 8927: {'lr': 0.0004973030747258342, 'samples': 1713984, 'steps': 8926, 'loss/train': 1.5370417833328247} 11/06/2021 22:29:31 - INFO - __main__ - Step 8928: {'lr': 0.0004973022972904356, 'samples': 1714176, 'steps': 8927, 'loss/train': 2.2001898288726807} 11/06/2021 22:29:31 - INFO - __main__ - Step 8929: {'lr': 0.0004973015197436063, 'samples': 1714368, 'steps': 8928, 'loss/train': 3.589097023010254} 11/06/2021 22:29:31 - INFO - __main__ - Step 8930: {'lr': 0.0004973007420853471, 'samples': 1714560, 'steps': 8929, 'loss/train': 1.7380764484405518} 11/06/2021 22:29:32 - INFO - __main__ - Step 8931: {'lr': 0.0004972999643156577, 'samples': 1714752, 'steps': 8930, 'loss/train': 1.9435979127883911} 11/06/2021 22:29:33 - INFO - __main__ - Step 8932: {'lr': 0.0004972991864345389, 'samples': 1714944, 'steps': 8931, 'loss/train': 1.745599627494812} 11/06/2021 22:29:33 - INFO - __main__ - Step 8933: {'lr': 0.0004972984084419908, 'samples': 1715136, 'steps': 8932, 'loss/train': 1.9273251295089722} 11/06/2021 22:29:34 - INFO - __main__ - Step 8934: {'lr': 0.0004972976303380139, 'samples': 1715328, 'steps': 8933, 'loss/train': 1.3530958890914917} 11/06/2021 22:29:34 - INFO - __main__ - Step 8935: {'lr': 0.0004972968521226085, 'samples': 1715520, 'steps': 8934, 'loss/train': 2.121579885482788} 11/06/2021 22:29:34 - INFO - __main__ - Step 8936: {'lr': 0.0004972960737957749, 'samples': 1715712, 'steps': 8935, 'loss/train': 1.9268077611923218} 11/06/2021 22:29:35 - INFO - __main__ - Step 8937: {'lr': 0.0004972952953575136, 'samples': 1715904, 'steps': 8936, 'loss/train': 1.8524378538131714} 11/06/2021 22:29:36 - INFO - __main__ - Step 8938: {'lr': 0.0004972945168078248, 'samples': 1716096, 'steps': 8937, 'loss/train': 0.22625112533569336} 11/06/2021 22:29:36 - INFO - __main__ - Step 8939: {'lr': 0.000497293738146709, 'samples': 1716288, 'steps': 8938, 'loss/train': 1.5457402467727661} 11/06/2021 22:29:36 - INFO - __main__ - Step 8940: {'lr': 0.0004972929593741662, 'samples': 1716480, 'steps': 8939, 'loss/train': 1.9634605646133423} 11/06/2021 22:29:37 - INFO - __main__ - Step 8941: {'lr': 0.0004972921804901973, 'samples': 1716672, 'steps': 8940, 'loss/train': 1.6684246063232422} 11/06/2021 22:29:38 - INFO - __main__ - Step 8942: {'lr': 0.0004972914014948023, 'samples': 1716864, 'steps': 8941, 'loss/train': 2.112210512161255} 11/06/2021 22:29:38 - INFO - __main__ - Step 8943: {'lr': 0.0004972906223879815, 'samples': 1717056, 'steps': 8942, 'loss/train': 2.020785331726074} 11/06/2021 22:29:39 - INFO - __main__ - Step 8944: {'lr': 0.0004972898431697355, 'samples': 1717248, 'steps': 8943, 'loss/train': 1.4316191673278809} 11/06/2021 22:29:39 - INFO - __main__ - Step 8945: {'lr': 0.0004972890638400644, 'samples': 1717440, 'steps': 8944, 'loss/train': 3.3050966262817383} 11/06/2021 22:29:39 - INFO - __main__ - Step 8946: {'lr': 0.0004972882843989687, 'samples': 1717632, 'steps': 8945, 'loss/train': 1.6966694593429565} 11/06/2021 22:29:40 - INFO - __main__ - Step 8947: {'lr': 0.0004972875048464487, 'samples': 1717824, 'steps': 8946, 'loss/train': 1.5771774053573608} 11/06/2021 22:29:41 - INFO - __main__ - Step 8948: {'lr': 0.0004972867251825048, 'samples': 1718016, 'steps': 8947, 'loss/train': 2.001603841781616} 11/06/2021 22:29:41 - INFO - __main__ - Step 8949: {'lr': 0.0004972859454071373, 'samples': 1718208, 'steps': 8948, 'loss/train': 1.704656720161438} 11/06/2021 22:29:41 - INFO - __main__ - Step 8950: {'lr': 0.0004972851655203465, 'samples': 1718400, 'steps': 8949, 'loss/train': 2.2928240299224854} 11/06/2021 22:29:42 - INFO - __main__ - Step 8951: {'lr': 0.000497284385522133, 'samples': 1718592, 'steps': 8950, 'loss/train': 2.0876405239105225} 11/06/2021 22:29:42 - INFO - __main__ - Step 8952: {'lr': 0.0004972836054124968, 'samples': 1718784, 'steps': 8951, 'loss/train': 1.9462846517562866} 11/06/2021 22:29:43 - INFO - __main__ - Step 8953: {'lr': 0.0004972828251914384, 'samples': 1718976, 'steps': 8952, 'loss/train': 1.3022961616516113} 11/06/2021 22:29:43 - INFO - __main__ - Step 8954: {'lr': 0.0004972820448589584, 'samples': 1719168, 'steps': 8953, 'loss/train': 1.0215758085250854} 11/06/2021 22:29:44 - INFO - __main__ - Step 8955: {'lr': 0.0004972812644150567, 'samples': 1719360, 'steps': 8954, 'loss/train': 1.7224310636520386} 11/06/2021 22:29:44 - INFO - __main__ - Step 8956: {'lr': 0.000497280483859734, 'samples': 1719552, 'steps': 8955, 'loss/train': 1.3424328565597534} 11/06/2021 22:29:45 - INFO - __main__ - Step 8957: {'lr': 0.0004972797031929904, 'samples': 1719744, 'steps': 8956, 'loss/train': 1.9782111644744873} 11/06/2021 22:29:45 - INFO - __main__ - Step 8958: {'lr': 0.0004972789224148266, 'samples': 1719936, 'steps': 8957, 'loss/train': 2.0518736839294434} 11/06/2021 22:29:46 - INFO - __main__ - Step 8959: {'lr': 0.0004972781415252426, 'samples': 1720128, 'steps': 8958, 'loss/train': 2.4421885013580322} 11/06/2021 22:29:46 - INFO - __main__ - Step 8960: {'lr': 0.0004972773605242388, 'samples': 1720320, 'steps': 8959, 'loss/train': 1.4392913579940796} 11/06/2021 22:29:47 - INFO - __main__ - Step 8961: {'lr': 0.0004972765794118158, 'samples': 1720512, 'steps': 8960, 'loss/train': 1.1508708000183105} 11/06/2021 22:29:47 - INFO - __main__ - Step 8962: {'lr': 0.0004972757981879737, 'samples': 1720704, 'steps': 8961, 'loss/train': 1.7767452001571655} 11/06/2021 22:29:48 - INFO - __main__ - Step 8963: {'lr': 0.000497275016852713, 'samples': 1720896, 'steps': 8962, 'loss/train': 2.000983953475952} 11/06/2021 22:29:48 - INFO - __main__ - Step 8964: {'lr': 0.0004972742354060339, 'samples': 1721088, 'steps': 8963, 'loss/train': 1.6095494031906128} 11/06/2021 22:29:49 - INFO - __main__ - Step 8965: {'lr': 0.0004972734538479369, 'samples': 1721280, 'steps': 8964, 'loss/train': 2.106870174407959} 11/06/2021 22:29:49 - INFO - __main__ - Step 8966: {'lr': 0.0004972726721784223, 'samples': 1721472, 'steps': 8965, 'loss/train': 1.3510915040969849} 11/06/2021 22:29:49 - INFO - __main__ - Step 8967: {'lr': 0.0004972718903974904, 'samples': 1721664, 'steps': 8966, 'loss/train': 2.6114096641540527} 11/06/2021 22:29:50 - INFO - __main__ - Step 8968: {'lr': 0.0004972711085051417, 'samples': 1721856, 'steps': 8967, 'loss/train': 1.257983922958374} 11/06/2021 22:29:51 - INFO - __main__ - Step 8969: {'lr': 0.0004972703265013764, 'samples': 1722048, 'steps': 8968, 'loss/train': 1.7631484270095825} 11/06/2021 22:29:51 - INFO - __main__ - Step 8970: {'lr': 0.0004972695443861949, 'samples': 1722240, 'steps': 8969, 'loss/train': 1.826780080795288} 11/06/2021 22:29:51 - INFO - __main__ - Step 8971: {'lr': 0.0004972687621595975, 'samples': 1722432, 'steps': 8970, 'loss/train': 1.8841552734375} 11/06/2021 22:29:52 - INFO - __main__ - Step 8972: {'lr': 0.0004972679798215847, 'samples': 1722624, 'steps': 8971, 'loss/train': 1.433433175086975} 11/06/2021 22:29:53 - INFO - __main__ - Step 8973: {'lr': 0.0004972671973721567, 'samples': 1722816, 'steps': 8972, 'loss/train': 1.9559110403060913} 11/06/2021 22:29:53 - INFO - __main__ - Step 8974: {'lr': 0.000497266414811314, 'samples': 1723008, 'steps': 8973, 'loss/train': 2.148409605026245} 11/06/2021 22:29:54 - INFO - __main__ - Step 8975: {'lr': 0.0004972656321390568, 'samples': 1723200, 'steps': 8974, 'loss/train': 1.603281855583191} 11/06/2021 22:29:54 - INFO - __main__ - Step 8976: {'lr': 0.0004972648493553856, 'samples': 1723392, 'steps': 8975, 'loss/train': 1.3855030536651611} 11/06/2021 22:29:54 - INFO - __main__ - Step 8977: {'lr': 0.0004972640664603006, 'samples': 1723584, 'steps': 8976, 'loss/train': 1.4343771934509277} 11/06/2021 22:29:55 - INFO - __main__ - Step 8978: {'lr': 0.0004972632834538023, 'samples': 1723776, 'steps': 8977, 'loss/train': 1.4176015853881836} 11/06/2021 22:29:56 - INFO - __main__ - Step 8979: {'lr': 0.0004972625003358908, 'samples': 1723968, 'steps': 8978, 'loss/train': 1.4474366903305054} 11/06/2021 22:29:56 - INFO - __main__ - Step 8980: {'lr': 0.0004972617171065668, 'samples': 1724160, 'steps': 8979, 'loss/train': 0.7004616260528564} 11/06/2021 22:29:56 - INFO - __main__ - Step 8981: {'lr': 0.0004972609337658305, 'samples': 1724352, 'steps': 8980, 'loss/train': 1.7616783380508423} 11/06/2021 22:29:57 - INFO - __main__ - Step 8982: {'lr': 0.0004972601503136822, 'samples': 1724544, 'steps': 8981, 'loss/train': 1.6659023761749268} 11/06/2021 22:29:57 - INFO - __main__ - Step 8983: {'lr': 0.0004972593667501222, 'samples': 1724736, 'steps': 8982, 'loss/train': 1.7344557046890259} 11/06/2021 22:29:58 - INFO - __main__ - Step 8984: {'lr': 0.0004972585830751511, 'samples': 1724928, 'steps': 8983, 'loss/train': 1.725408673286438} 11/06/2021 22:29:59 - INFO - __main__ - Step 8985: {'lr': 0.0004972577992887689, 'samples': 1725120, 'steps': 8984, 'loss/train': 1.803905725479126} 11/06/2021 22:29:59 - INFO - __main__ - Step 8986: {'lr': 0.0004972570153909763, 'samples': 1725312, 'steps': 8985, 'loss/train': 2.232513427734375} 11/06/2021 22:29:59 - INFO - __main__ - Step 8987: {'lr': 0.0004972562313817735, 'samples': 1725504, 'steps': 8986, 'loss/train': 0.8127360343933105} 11/06/2021 22:30:00 - INFO - __main__ - Step 8988: {'lr': 0.0004972554472611609, 'samples': 1725696, 'steps': 8987, 'loss/train': 2.031003713607788} 11/06/2021 22:30:01 - INFO - __main__ - Step 8989: {'lr': 0.0004972546630291387, 'samples': 1725888, 'steps': 8988, 'loss/train': 1.9944013357162476} 11/06/2021 22:30:01 - INFO - __main__ - Step 8990: {'lr': 0.0004972538786857073, 'samples': 1726080, 'steps': 8989, 'loss/train': 2.087233781814575} 11/06/2021 22:30:01 - INFO - __main__ - Step 8991: {'lr': 0.0004972530942308673, 'samples': 1726272, 'steps': 8990, 'loss/train': 2.0797486305236816} 11/06/2021 22:30:02 - INFO - __main__ - Step 8992: {'lr': 0.0004972523096646188, 'samples': 1726464, 'steps': 8991, 'loss/train': 1.4353597164154053} 11/06/2021 22:30:02 - INFO - __main__ - Step 8993: {'lr': 0.0004972515249869622, 'samples': 1726656, 'steps': 8992, 'loss/train': 1.706042766571045} 11/06/2021 22:30:03 - INFO - __main__ - Step 8994: {'lr': 0.000497250740197898, 'samples': 1726848, 'steps': 8993, 'loss/train': 1.5139485597610474} 11/06/2021 22:30:04 - INFO - __main__ - Step 8995: {'lr': 0.0004972499552974263, 'samples': 1727040, 'steps': 8994, 'loss/train': 1.8484820127487183} 11/06/2021 22:30:04 - INFO - __main__ - Step 8996: {'lr': 0.0004972491702855477, 'samples': 1727232, 'steps': 8995, 'loss/train': 1.559495210647583} 11/06/2021 22:30:04 - INFO - __main__ - Step 8997: {'lr': 0.0004972483851622623, 'samples': 1727424, 'steps': 8996, 'loss/train': 2.627495527267456} 11/06/2021 22:30:05 - INFO - __main__ - Step 8998: {'lr': 0.0004972475999275707, 'samples': 1727616, 'steps': 8997, 'loss/train': 1.676787257194519} 11/06/2021 22:30:06 - INFO - __main__ - Step 8999: {'lr': 0.0004972468145814729, 'samples': 1727808, 'steps': 8998, 'loss/train': 1.9144270420074463} 11/06/2021 22:30:06 - INFO - __main__ - Step 9000: {'lr': 0.0004972460291239697, 'samples': 1728000, 'steps': 8999, 'loss/train': 2.2997002601623535} 11/06/2021 22:30:06 - INFO - __main__ - Step 9001: {'lr': 0.0004972452435550613, 'samples': 1728192, 'steps': 9000, 'loss/train': 1.8456697463989258} 11/06/2021 22:30:07 - INFO - __main__ - Step 9002: {'lr': 0.000497244457874748, 'samples': 1728384, 'steps': 9001, 'loss/train': 1.736275315284729} 11/06/2021 22:30:07 - INFO - __main__ - Step 9003: {'lr': 0.0004972436720830301, 'samples': 1728576, 'steps': 9002, 'loss/train': 1.504355549812317} 11/06/2021 22:30:08 - INFO - __main__ - Step 9004: {'lr': 0.000497242886179908, 'samples': 1728768, 'steps': 9003, 'loss/train': 1.5996493101119995} 11/06/2021 22:30:08 - INFO - __main__ - Step 9005: {'lr': 0.0004972421001653822, 'samples': 1728960, 'steps': 9004, 'loss/train': 1.5102970600128174} 11/06/2021 22:30:09 - INFO - __main__ - Step 9006: {'lr': 0.0004972413140394528, 'samples': 1729152, 'steps': 9005, 'loss/train': 2.118058204650879} 11/06/2021 22:30:09 - INFO - __main__ - Step 9007: {'lr': 0.0004972405278021203, 'samples': 1729344, 'steps': 9006, 'loss/train': 2.190336227416992} 11/06/2021 22:30:09 - INFO - __main__ - Step 9008: {'lr': 0.000497239741453385, 'samples': 1729536, 'steps': 9007, 'loss/train': 1.8450963497161865} 11/06/2021 22:30:10 - INFO - __main__ - Step 9009: {'lr': 0.0004972389549932473, 'samples': 1729728, 'steps': 9008, 'loss/train': 2.0399389266967773} 11/06/2021 22:30:11 - INFO - __main__ - Step 9010: {'lr': 0.0004972381684217077, 'samples': 1729920, 'steps': 9009, 'loss/train': 1.8316234350204468} 11/06/2021 22:30:11 - INFO - __main__ - Step 9011: {'lr': 0.0004972373817387662, 'samples': 1730112, 'steps': 9010, 'loss/train': 0.9349846839904785} 11/06/2021 22:30:11 - INFO - __main__ - Step 9012: {'lr': 0.0004972365949444234, 'samples': 1730304, 'steps': 9011, 'loss/train': 1.6661643981933594} 11/06/2021 22:30:12 - INFO - __main__ - Step 9013: {'lr': 0.0004972358080386796, 'samples': 1730496, 'steps': 9012, 'loss/train': 1.3828058242797852} 11/06/2021 22:30:12 - INFO - __main__ - Step 9014: {'lr': 0.0004972350210215353, 'samples': 1730688, 'steps': 9013, 'loss/train': 1.3896691799163818} 11/06/2021 22:30:13 - INFO - __main__ - Step 9015: {'lr': 0.0004972342338929906, 'samples': 1730880, 'steps': 9014, 'loss/train': 1.6471271514892578} 11/06/2021 22:30:13 - INFO - __main__ - Step 9016: {'lr': 0.000497233446653046, 'samples': 1731072, 'steps': 9015, 'loss/train': 1.893143892288208} 11/06/2021 22:30:14 - INFO - __main__ - Step 9017: {'lr': 0.0004972326593017017, 'samples': 1731264, 'steps': 9016, 'loss/train': 1.5417567491531372} 11/06/2021 22:30:14 - INFO - __main__ - Step 9018: {'lr': 0.0004972318718389583, 'samples': 1731456, 'steps': 9017, 'loss/train': 2.1753854751586914} 11/06/2021 22:30:14 - INFO - __main__ - Step 9019: {'lr': 0.000497231084264816, 'samples': 1731648, 'steps': 9018, 'loss/train': 1.7801073789596558} 11/06/2021 22:30:16 - INFO - __main__ - Step 9020: {'lr': 0.0004972302965792752, 'samples': 1731840, 'steps': 9019, 'loss/train': 1.927689790725708} 11/06/2021 22:30:16 - INFO - __main__ - Step 9021: {'lr': 0.0004972295087823362, 'samples': 1732032, 'steps': 9020, 'loss/train': 1.54207444190979} 11/06/2021 22:30:16 - INFO - __main__ - Step 9022: {'lr': 0.0004972287208739995, 'samples': 1732224, 'steps': 9021, 'loss/train': 1.5854400396347046} 11/06/2021 22:30:17 - INFO - __main__ - Step 9023: {'lr': 0.0004972279328542652, 'samples': 1732416, 'steps': 9022, 'loss/train': 2.2019128799438477} 11/06/2021 22:30:17 - INFO - __main__ - Step 9024: {'lr': 0.000497227144723134, 'samples': 1732608, 'steps': 9023, 'loss/train': 1.814888596534729} 11/06/2021 22:30:18 - INFO - __main__ - Step 9025: {'lr': 0.0004972263564806059, 'samples': 1732800, 'steps': 9024, 'loss/train': 2.2460262775421143} 11/06/2021 22:30:18 - INFO - __main__ - Step 9026: {'lr': 0.0004972255681266816, 'samples': 1732992, 'steps': 9025, 'loss/train': 2.0175039768218994} 11/06/2021 22:30:19 - INFO - __main__ - Step 9027: {'lr': 0.0004972247796613611, 'samples': 1733184, 'steps': 9026, 'loss/train': 1.2223052978515625} 11/06/2021 22:30:19 - INFO - __main__ - Step 9028: {'lr': 0.000497223991084645, 'samples': 1733376, 'steps': 9027, 'loss/train': 1.550275206565857} 11/06/2021 22:30:19 - INFO - __main__ - Step 9029: {'lr': 0.0004972232023965335, 'samples': 1733568, 'steps': 9028, 'loss/train': 1.8038307428359985} 11/06/2021 22:30:20 - INFO - __main__ - Step 9030: {'lr': 0.0004972224135970271, 'samples': 1733760, 'steps': 9029, 'loss/train': 1.7093403339385986} 11/06/2021 22:30:21 - INFO - __main__ - Step 9031: {'lr': 0.0004972216246861262, 'samples': 1733952, 'steps': 9030, 'loss/train': 2.1160690784454346} 11/06/2021 22:30:21 - INFO - __main__ - Step 9032: {'lr': 0.0004972208356638309, 'samples': 1734144, 'steps': 9031, 'loss/train': 0.9576528072357178} 11/06/2021 22:30:21 - INFO - __main__ - Step 9033: {'lr': 0.0004972200465301418, 'samples': 1734336, 'steps': 9032, 'loss/train': 2.587282657623291} 11/06/2021 22:30:22 - INFO - __main__ - Step 9034: {'lr': 0.0004972192572850592, 'samples': 1734528, 'steps': 9033, 'loss/train': 1.5511127710342407} 11/06/2021 22:30:22 - INFO - __main__ - Step 9035: {'lr': 0.0004972184679285833, 'samples': 1734720, 'steps': 9034, 'loss/train': 1.6166201829910278} 11/06/2021 22:30:23 - INFO - __main__ - Step 9036: {'lr': 0.0004972176784607146, 'samples': 1734912, 'steps': 9035, 'loss/train': 1.4050244092941284} 11/06/2021 22:30:24 - INFO - __main__ - Step 9037: {'lr': 0.0004972168888814533, 'samples': 1735104, 'steps': 9036, 'loss/train': 2.019019842147827} 11/06/2021 22:30:24 - INFO - __main__ - Step 9038: {'lr': 0.0004972160991908001, 'samples': 1735296, 'steps': 9037, 'loss/train': 1.9093737602233887} 11/06/2021 22:30:24 - INFO - __main__ - Step 9039: {'lr': 0.0004972153093887551, 'samples': 1735488, 'steps': 9038, 'loss/train': 1.671932578086853} 11/06/2021 22:30:25 - INFO - __main__ - Step 9040: {'lr': 0.0004972145194753186, 'samples': 1735680, 'steps': 9039, 'loss/train': 1.3389232158660889} 11/06/2021 22:30:26 - INFO - __main__ - Step 9041: {'lr': 0.0004972137294504912, 'samples': 1735872, 'steps': 9040, 'loss/train': 2.4330387115478516} 11/06/2021 22:30:26 - INFO - __main__ - Step 9042: {'lr': 0.000497212939314273, 'samples': 1736064, 'steps': 9041, 'loss/train': 1.3045024871826172} 11/06/2021 22:30:26 - INFO - __main__ - Step 9043: {'lr': 0.0004972121490666644, 'samples': 1736256, 'steps': 9042, 'loss/train': 1.6939752101898193} 11/06/2021 22:30:27 - INFO - __main__ - Step 9044: {'lr': 0.000497211358707666, 'samples': 1736448, 'steps': 9043, 'loss/train': 1.862133502960205} 11/06/2021 22:30:27 - INFO - __main__ - Step 9045: {'lr': 0.0004972105682372779, 'samples': 1736640, 'steps': 9044, 'loss/train': 2.044528007507324} 11/06/2021 22:30:29 - INFO - __main__ - Step 9046: {'lr': 0.0004972097776555005, 'samples': 1736832, 'steps': 9045, 'loss/train': 1.8090245723724365} 11/06/2021 22:30:29 - INFO - __main__ - Step 9047: {'lr': 0.0004972089869623342, 'samples': 1737024, 'steps': 9046, 'loss/train': 1.5942614078521729} 11/06/2021 22:30:29 - INFO - __main__ - Step 9048: {'lr': 0.0004972081961577793, 'samples': 1737216, 'steps': 9047, 'loss/train': 2.0264124870300293} 11/06/2021 22:30:30 - INFO - __main__ - Step 9049: {'lr': 0.0004972074052418363, 'samples': 1737408, 'steps': 9048, 'loss/train': 1.723054051399231} 11/06/2021 22:30:30 - INFO - __main__ - Step 9050: {'lr': 0.0004972066142145055, 'samples': 1737600, 'steps': 9049, 'loss/train': 1.216286540031433} 11/06/2021 22:30:30 - INFO - __main__ - Step 9051: {'lr': 0.0004972058230757871, 'samples': 1737792, 'steps': 9050, 'loss/train': 1.870058536529541} 11/06/2021 22:30:31 - INFO - __main__ - Step 9052: {'lr': 0.0004972050318256815, 'samples': 1737984, 'steps': 9051, 'loss/train': 0.3041784465312958} 11/06/2021 22:30:32 - INFO - __main__ - Step 9053: {'lr': 0.0004972042404641893, 'samples': 1738176, 'steps': 9052, 'loss/train': 1.6829752922058105} 11/06/2021 22:30:32 - INFO - __main__ - Step 9054: {'lr': 0.0004972034489913106, 'samples': 1738368, 'steps': 9053, 'loss/train': 1.7771741151809692} 11/06/2021 22:30:32 - INFO - __main__ - Step 9055: {'lr': 0.0004972026574070459, 'samples': 1738560, 'steps': 9054, 'loss/train': 2.233604907989502} 11/06/2021 22:30:33 - INFO - __main__ - Step 9056: {'lr': 0.0004972018657113953, 'samples': 1738752, 'steps': 9055, 'loss/train': 2.35441517829895} 11/06/2021 22:30:34 - INFO - __main__ - Step 9057: {'lr': 0.0004972010739043596, 'samples': 1738944, 'steps': 9056, 'loss/train': 1.8315443992614746} 11/06/2021 22:30:34 - INFO - __main__ - Step 9058: {'lr': 0.0004972002819859388, 'samples': 1739136, 'steps': 9057, 'loss/train': 1.4996757507324219} 11/06/2021 22:30:34 - INFO - __main__ - Step 9059: {'lr': 0.0004971994899561334, 'samples': 1739328, 'steps': 9058, 'loss/train': 2.07483172416687} 11/06/2021 22:30:35 - INFO - __main__ - Step 9060: {'lr': 0.0004971986978149437, 'samples': 1739520, 'steps': 9059, 'loss/train': 1.568691372871399} 11/06/2021 22:30:35 - INFO - __main__ - Step 9061: {'lr': 0.0004971979055623701, 'samples': 1739712, 'steps': 9060, 'loss/train': 1.295201063156128} 11/06/2021 22:30:36 - INFO - __main__ - Step 9062: {'lr': 0.0004971971131984129, 'samples': 1739904, 'steps': 9061, 'loss/train': 2.6647539138793945} 11/06/2021 22:30:37 - INFO - __main__ - Step 9063: {'lr': 0.0004971963207230725, 'samples': 1740096, 'steps': 9062, 'loss/train': 1.6942551136016846} 11/06/2021 22:30:37 - INFO - __main__ - Step 9064: {'lr': 0.0004971955281363493, 'samples': 1740288, 'steps': 9063, 'loss/train': 1.9272085428237915} 11/06/2021 22:30:37 - INFO - __main__ - Step 9065: {'lr': 0.0004971947354382436, 'samples': 1740480, 'steps': 9064, 'loss/train': 1.482258677482605} 11/06/2021 22:30:38 - INFO - __main__ - Step 9066: {'lr': 0.0004971939426287557, 'samples': 1740672, 'steps': 9065, 'loss/train': 2.1765670776367188} 11/06/2021 22:30:39 - INFO - __main__ - Step 9067: {'lr': 0.0004971931497078861, 'samples': 1740864, 'steps': 9066, 'loss/train': 1.9422248601913452} 11/06/2021 22:30:39 - INFO - __main__ - Step 9068: {'lr': 0.000497192356675635, 'samples': 1741056, 'steps': 9067, 'loss/train': 0.8091076612472534} 11/06/2021 22:30:39 - INFO - __main__ - Step 9069: {'lr': 0.0004971915635320029, 'samples': 1741248, 'steps': 9068, 'loss/train': 2.0615551471710205} 11/06/2021 22:30:40 - INFO - __main__ - Step 9070: {'lr': 0.0004971907702769901, 'samples': 1741440, 'steps': 9069, 'loss/train': 1.8505113124847412} 11/06/2021 22:30:40 - INFO - __main__ - Step 9071: {'lr': 0.000497189976910597, 'samples': 1741632, 'steps': 9070, 'loss/train': 2.2380783557891846} 11/06/2021 22:30:41 - INFO - __main__ - Step 9072: {'lr': 0.0004971891834328238, 'samples': 1741824, 'steps': 9071, 'loss/train': 0.3026748597621918} 11/06/2021 22:30:41 - INFO - __main__ - Step 9073: {'lr': 0.000497188389843671, 'samples': 1742016, 'steps': 9072, 'loss/train': 1.1784673929214478} 11/06/2021 22:30:42 - INFO - __main__ - Step 9074: {'lr': 0.0004971875961431389, 'samples': 1742208, 'steps': 9073, 'loss/train': 1.7177537679672241} 11/06/2021 22:30:42 - INFO - __main__ - Step 9075: {'lr': 0.000497186802331228, 'samples': 1742400, 'steps': 9074, 'loss/train': 2.291787624359131} 11/06/2021 22:30:43 - INFO - __main__ - Step 9076: {'lr': 0.0004971860084079385, 'samples': 1742592, 'steps': 9075, 'loss/train': 1.1271553039550781} 11/06/2021 22:30:43 - INFO - __main__ - Step 9077: {'lr': 0.0004971852143732707, 'samples': 1742784, 'steps': 9076, 'loss/train': 1.6700618267059326} 11/06/2021 22:30:44 - INFO - __main__ - Step 9078: {'lr': 0.0004971844202272251, 'samples': 1742976, 'steps': 9077, 'loss/train': 1.7252506017684937} 11/06/2021 22:30:44 - INFO - __main__ - Step 9079: {'lr': 0.000497183625969802, 'samples': 1743168, 'steps': 9078, 'loss/train': 1.921452283859253} 11/06/2021 22:30:45 - INFO - __main__ - Step 9080: {'lr': 0.0004971828316010019, 'samples': 1743360, 'steps': 9079, 'loss/train': 1.8389270305633545} 11/06/2021 22:30:45 - INFO - __main__ - Step 9081: {'lr': 0.0004971820371208248, 'samples': 1743552, 'steps': 9080, 'loss/train': 1.7379286289215088} 11/06/2021 22:30:45 - INFO - __main__ - Step 9082: {'lr': 0.0004971812425292716, 'samples': 1743744, 'steps': 9081, 'loss/train': 2.1627986431121826} 11/06/2021 22:30:46 - INFO - __main__ - Step 9083: {'lr': 0.000497180447826342, 'samples': 1743936, 'steps': 9082, 'loss/train': 1.5003117322921753} 11/06/2021 22:30:47 - INFO - __main__ - Step 9084: {'lr': 0.0004971796530120371, 'samples': 1744128, 'steps': 9083, 'loss/train': 1.885785460472107} 11/06/2021 22:30:47 - INFO - __main__ - Step 9085: {'lr': 0.0004971788580863566, 'samples': 1744320, 'steps': 9084, 'loss/train': 1.6980434656143188} 11/06/2021 22:30:47 - INFO - __main__ - Step 9086: {'lr': 0.0004971780630493012, 'samples': 1744512, 'steps': 9085, 'loss/train': 1.8362492322921753} 11/06/2021 22:30:48 - INFO - __main__ - Step 9087: {'lr': 0.000497177267900871, 'samples': 1744704, 'steps': 9086, 'loss/train': 1.9315646886825562} 11/06/2021 22:30:49 - INFO - __main__ - Step 9088: {'lr': 0.0004971764726410668, 'samples': 1744896, 'steps': 9087, 'loss/train': 1.8160526752471924} 11/06/2021 22:30:49 - INFO - __main__ - Step 9089: {'lr': 0.0004971756772698886, 'samples': 1745088, 'steps': 9088, 'loss/train': 1.7330033779144287} 11/06/2021 22:30:50 - INFO - __main__ - Step 9090: {'lr': 0.0004971748817873367, 'samples': 1745280, 'steps': 9089, 'loss/train': 1.8694642782211304} 11/06/2021 22:30:50 - INFO - __main__ - Step 9091: {'lr': 0.0004971740861934117, 'samples': 1745472, 'steps': 9090, 'loss/train': 1.788362741470337} 11/06/2021 22:30:50 - INFO - __main__ - Step 9092: {'lr': 0.000497173290488114, 'samples': 1745664, 'steps': 9091, 'loss/train': 1.6177334785461426} 11/06/2021 22:30:52 - INFO - __main__ - Step 9093: {'lr': 0.0004971724946714437, 'samples': 1745856, 'steps': 9092, 'loss/train': 1.626973032951355} 11/06/2021 22:30:52 - INFO - __main__ - Step 9094: {'lr': 0.0004971716987434014, 'samples': 1746048, 'steps': 9093, 'loss/train': 1.3091498613357544} 11/06/2021 22:30:52 - INFO - __main__ - Step 9095: {'lr': 0.0004971709027039872, 'samples': 1746240, 'steps': 9094, 'loss/train': 1.3420382738113403} 11/06/2021 22:30:53 - INFO - __main__ - Step 9096: {'lr': 0.0004971701065532017, 'samples': 1746432, 'steps': 9095, 'loss/train': 1.4940237998962402} 11/06/2021 22:30:53 - INFO - __main__ - Step 9097: {'lr': 0.0004971693102910451, 'samples': 1746624, 'steps': 9096, 'loss/train': 2.1180343627929688} 11/06/2021 22:30:53 - INFO - __main__ - Step 9098: {'lr': 0.0004971685139175179, 'samples': 1746816, 'steps': 9097, 'loss/train': 2.4166030883789062} 11/06/2021 22:30:54 - INFO - __main__ - Step 9099: {'lr': 0.0004971677174326204, 'samples': 1747008, 'steps': 9098, 'loss/train': 5.840542316436768} 11/06/2021 22:30:55 - INFO - __main__ - Step 9100: {'lr': 0.0004971669208363529, 'samples': 1747200, 'steps': 9099, 'loss/train': 5.970412731170654} 11/06/2021 22:30:55 - INFO - __main__ - Step 9101: {'lr': 0.0004971661241287157, 'samples': 1747392, 'steps': 9100, 'loss/train': 2.0452849864959717} 11/06/2021 22:30:55 - INFO - __main__ - Step 9102: {'lr': 0.0004971653273097094, 'samples': 1747584, 'steps': 9101, 'loss/train': 1.074893593788147} 11/06/2021 22:30:56 - INFO - __main__ - Step 9103: {'lr': 0.0004971645303793342, 'samples': 1747776, 'steps': 9102, 'loss/train': 1.511845588684082} 11/06/2021 22:30:56 - INFO - __main__ - Step 9104: {'lr': 0.0004971637333375904, 'samples': 1747968, 'steps': 9103, 'loss/train': 1.4513171911239624} 11/06/2021 22:30:57 - INFO - __main__ - Step 9105: {'lr': 0.0004971629361844785, 'samples': 1748160, 'steps': 9104, 'loss/train': 1.3599135875701904} 11/06/2021 22:30:58 - INFO - __main__ - Step 9106: {'lr': 0.0004971621389199988, 'samples': 1748352, 'steps': 9105, 'loss/train': 1.6643626689910889} 11/06/2021 22:30:58 - INFO - __main__ - Step 9107: {'lr': 0.0004971613415441516, 'samples': 1748544, 'steps': 9106, 'loss/train': 1.0446491241455078} 11/06/2021 22:30:58 - INFO - __main__ - Step 9108: {'lr': 0.0004971605440569374, 'samples': 1748736, 'steps': 9107, 'loss/train': 1.783698558807373} 11/06/2021 22:30:59 - INFO - __main__ - Step 9109: {'lr': 0.0004971597464583563, 'samples': 1748928, 'steps': 9108, 'loss/train': 1.7281625270843506} 11/06/2021 22:31:00 - INFO - __main__ - Step 9110: {'lr': 0.0004971589487484091, 'samples': 1749120, 'steps': 9109, 'loss/train': 1.642425537109375} 11/06/2021 22:31:00 - INFO - __main__ - Step 9111: {'lr': 0.0004971581509270956, 'samples': 1749312, 'steps': 9110, 'loss/train': 1.2902659177780151} 11/06/2021 22:31:01 - INFO - __main__ - Step 9112: {'lr': 0.0004971573529944167, 'samples': 1749504, 'steps': 9111, 'loss/train': 2.028707981109619} 11/06/2021 22:31:01 - INFO - __main__ - Step 9113: {'lr': 0.0004971565549503723, 'samples': 1749696, 'steps': 9112, 'loss/train': 2.295679807662964} 11/06/2021 22:31:01 - INFO - __main__ - Step 9114: {'lr': 0.0004971557567949631, 'samples': 1749888, 'steps': 9113, 'loss/train': 1.6835025548934937} 11/06/2021 22:31:02 - INFO - __main__ - Step 9115: {'lr': 0.0004971549585281893, 'samples': 1750080, 'steps': 9114, 'loss/train': 1.3464192152023315} 11/06/2021 22:31:03 - INFO - __main__ - Step 9116: {'lr': 0.0004971541601500513, 'samples': 1750272, 'steps': 9115, 'loss/train': 1.7078288793563843} 11/06/2021 22:31:03 - INFO - __main__ - Step 9117: {'lr': 0.0004971533616605495, 'samples': 1750464, 'steps': 9116, 'loss/train': 1.4062687158584595} 11/06/2021 22:31:03 - INFO - __main__ - Step 9118: {'lr': 0.0004971525630596841, 'samples': 1750656, 'steps': 9117, 'loss/train': 1.889479637145996} 11/06/2021 22:31:04 - INFO - __main__ - Step 9119: {'lr': 0.0004971517643474556, 'samples': 1750848, 'steps': 9118, 'loss/train': 2.2042081356048584} 11/06/2021 22:31:04 - INFO - __main__ - Step 9120: {'lr': 0.0004971509655238643, 'samples': 1751040, 'steps': 9119, 'loss/train': 1.9173781871795654} 11/06/2021 22:31:05 - INFO - __main__ - Step 9121: {'lr': 0.0004971501665889107, 'samples': 1751232, 'steps': 9120, 'loss/train': 1.5485329627990723} 11/06/2021 22:31:05 - INFO - __main__ - Step 9122: {'lr': 0.000497149367542595, 'samples': 1751424, 'steps': 9121, 'loss/train': 2.2474119663238525} 11/06/2021 22:31:06 - INFO - __main__ - Step 9123: {'lr': 0.0004971485683849176, 'samples': 1751616, 'steps': 9122, 'loss/train': 1.9738703966140747} 11/06/2021 22:31:06 - INFO - __main__ - Step 9124: {'lr': 0.0004971477691158788, 'samples': 1751808, 'steps': 9123, 'loss/train': 1.9322994947433472} 11/06/2021 22:31:06 - INFO - __main__ - Step 9125: {'lr': 0.0004971469697354792, 'samples': 1752000, 'steps': 9124, 'loss/train': 2.1170740127563477} 11/06/2021 22:31:08 - INFO - __main__ - Step 9126: {'lr': 0.0004971461702437188, 'samples': 1752192, 'steps': 9125, 'loss/train': 1.9848999977111816} 11/06/2021 22:31:08 - INFO - __main__ - Step 9127: {'lr': 0.0004971453706405981, 'samples': 1752384, 'steps': 9126, 'loss/train': 1.6258814334869385} 11/06/2021 22:31:08 - INFO - __main__ - Step 9128: {'lr': 0.0004971445709261177, 'samples': 1752576, 'steps': 9127, 'loss/train': 1.3074932098388672} 11/06/2021 22:31:09 - INFO - __main__ - Step 9129: {'lr': 0.0004971437711002777, 'samples': 1752768, 'steps': 9128, 'loss/train': 1.880787968635559} 11/06/2021 22:31:09 - INFO - __main__ - Step 9130: {'lr': 0.0004971429711630786, 'samples': 1752960, 'steps': 9129, 'loss/train': 1.60108482837677} 11/06/2021 22:31:10 - INFO - __main__ - Step 9131: {'lr': 0.0004971421711145207, 'samples': 1753152, 'steps': 9130, 'loss/train': 1.9594632387161255} 11/06/2021 22:31:10 - INFO - __main__ - Step 9132: {'lr': 0.0004971413709546043, 'samples': 1753344, 'steps': 9131, 'loss/train': 1.4151302576065063} 11/06/2021 22:31:11 - INFO - __main__ - Step 9133: {'lr': 0.0004971405706833297, 'samples': 1753536, 'steps': 9132, 'loss/train': 2.054569721221924} 11/06/2021 22:31:11 - INFO - __main__ - Step 9134: {'lr': 0.0004971397703006974, 'samples': 1753728, 'steps': 9133, 'loss/train': 2.073517322540283} 11/06/2021 22:31:11 - INFO - __main__ - Step 9135: {'lr': 0.0004971389698067079, 'samples': 1753920, 'steps': 9134, 'loss/train': 2.4211151599884033} 11/06/2021 22:31:12 - INFO - __main__ - Step 9136: {'lr': 0.0004971381692013612, 'samples': 1754112, 'steps': 9135, 'loss/train': 1.4416491985321045} 11/06/2021 22:31:13 - INFO - __main__ - Step 9137: {'lr': 0.000497137368484658, 'samples': 1754304, 'steps': 9136, 'loss/train': 1.7312275171279907} 11/06/2021 22:31:13 - INFO - __main__ - Step 9138: {'lr': 0.0004971365676565984, 'samples': 1754496, 'steps': 9137, 'loss/train': 1.9141130447387695} 11/06/2021 22:31:13 - INFO - __main__ - Step 9139: {'lr': 0.000497135766717183, 'samples': 1754688, 'steps': 9138, 'loss/train': 1.8034145832061768} 11/06/2021 22:31:14 - INFO - __main__ - Step 9140: {'lr': 0.000497134965666412, 'samples': 1754880, 'steps': 9139, 'loss/train': 1.8870742321014404} 11/06/2021 22:31:14 - INFO - __main__ - Step 9141: {'lr': 0.0004971341645042857, 'samples': 1755072, 'steps': 9140, 'loss/train': 1.999513030052185} 11/06/2021 22:31:15 - INFO - __main__ - Step 9142: {'lr': 0.0004971333632308047, 'samples': 1755264, 'steps': 9141, 'loss/train': 2.0353739261627197} 11/06/2021 22:31:15 - INFO - __main__ - Step 9143: {'lr': 0.0004971325618459691, 'samples': 1755456, 'steps': 9142, 'loss/train': 2.1731367111206055} 11/06/2021 22:31:16 - INFO - __main__ - Step 9144: {'lr': 0.0004971317603497795, 'samples': 1755648, 'steps': 9143, 'loss/train': 1.4460906982421875} 11/06/2021 22:31:16 - INFO - __main__ - Step 9145: {'lr': 0.000497130958742236, 'samples': 1755840, 'steps': 9144, 'loss/train': 1.5800894498825073} 11/06/2021 22:31:17 - INFO - __main__ - Step 9146: {'lr': 0.0004971301570233392, 'samples': 1756032, 'steps': 9145, 'loss/train': 2.155123472213745} 11/06/2021 22:31:18 - INFO - __main__ - Step 9147: {'lr': 0.0004971293551930894, 'samples': 1756224, 'steps': 9146, 'loss/train': 1.0763784646987915} 11/06/2021 22:31:18 - INFO - __main__ - Step 9148: {'lr': 0.0004971285532514868, 'samples': 1756416, 'steps': 9147, 'loss/train': 1.7974958419799805} 11/06/2021 22:31:18 - INFO - __main__ - Step 9149: {'lr': 0.000497127751198532, 'samples': 1756608, 'steps': 9148, 'loss/train': 1.7774125337600708} 11/06/2021 22:31:19 - INFO - __main__ - Step 9150: {'lr': 0.0004971269490342252, 'samples': 1756800, 'steps': 9149, 'loss/train': 1.6433255672454834} 11/06/2021 22:31:19 - INFO - __main__ - Step 9151: {'lr': 0.0004971261467585669, 'samples': 1756992, 'steps': 9150, 'loss/train': 2.028750419616699} 11/06/2021 22:31:20 - INFO - __main__ - Step 9152: {'lr': 0.0004971253443715572, 'samples': 1757184, 'steps': 9151, 'loss/train': 1.2226204872131348} 11/06/2021 22:31:20 - INFO - __main__ - Step 9153: {'lr': 0.0004971245418731966, 'samples': 1757376, 'steps': 9152, 'loss/train': 2.202749490737915} 11/06/2021 22:31:21 - INFO - __main__ - Step 9154: {'lr': 0.0004971237392634857, 'samples': 1757568, 'steps': 9153, 'loss/train': 1.7844159603118896} 11/06/2021 22:31:21 - INFO - __main__ - Step 9155: {'lr': 0.0004971229365424246, 'samples': 1757760, 'steps': 9154, 'loss/train': 1.973191738128662} 11/06/2021 22:31:21 - INFO - __main__ - Step 9156: {'lr': 0.0004971221337100137, 'samples': 1757952, 'steps': 9155, 'loss/train': 1.785252332687378} 11/06/2021 22:31:23 - INFO - __main__ - Step 9157: {'lr': 0.0004971213307662534, 'samples': 1758144, 'steps': 9156, 'loss/train': 2.1560020446777344} 11/06/2021 22:31:23 - INFO - __main__ - Step 9158: {'lr': 0.000497120527711144, 'samples': 1758336, 'steps': 9157, 'loss/train': 1.1199406385421753} 11/06/2021 22:31:23 - INFO - __main__ - Step 9159: {'lr': 0.0004971197245446859, 'samples': 1758528, 'steps': 9158, 'loss/train': 2.273866891860962} 11/06/2021 22:31:24 - INFO - __main__ - Step 9160: {'lr': 0.0004971189212668794, 'samples': 1758720, 'steps': 9159, 'loss/train': 2.1585633754730225} 11/06/2021 22:31:24 - INFO - __main__ - Step 9161: {'lr': 0.0004971181178777251, 'samples': 1758912, 'steps': 9160, 'loss/train': 2.7556064128875732} 11/06/2021 22:31:24 - INFO - __main__ - Step 9162: {'lr': 0.0004971173143772231, 'samples': 1759104, 'steps': 9161, 'loss/train': 1.7603553533554077} 11/06/2021 22:31:25 - INFO - __main__ - Step 9163: {'lr': 0.0004971165107653738, 'samples': 1759296, 'steps': 9162, 'loss/train': 1.7750883102416992} 11/06/2021 22:31:26 - INFO - __main__ - Step 9164: {'lr': 0.0004971157070421776, 'samples': 1759488, 'steps': 9163, 'loss/train': 2.1838340759277344} 11/06/2021 22:31:26 - INFO - __main__ - Step 9165: {'lr': 0.000497114903207635, 'samples': 1759680, 'steps': 9164, 'loss/train': 2.311770439147949} 11/06/2021 22:31:26 - INFO - __main__ - Step 9166: {'lr': 0.0004971140992617462, 'samples': 1759872, 'steps': 9165, 'loss/train': 1.977513074874878} 11/06/2021 22:31:27 - INFO - __main__ - Step 9167: {'lr': 0.0004971132952045115, 'samples': 1760064, 'steps': 9166, 'loss/train': 1.9859895706176758} 11/06/2021 22:31:28 - INFO - __main__ - Step 9168: {'lr': 0.0004971124910359315, 'samples': 1760256, 'steps': 9167, 'loss/train': 1.7666734457015991} 11/06/2021 22:31:28 - INFO - __main__ - Step 9169: {'lr': 0.0004971116867560064, 'samples': 1760448, 'steps': 9168, 'loss/train': 1.7961926460266113} 11/06/2021 22:31:28 - INFO - __main__ - Step 9170: {'lr': 0.0004971108823647365, 'samples': 1760640, 'steps': 9169, 'loss/train': 1.904386281967163} 11/06/2021 22:31:29 - INFO - __main__ - Step 9171: {'lr': 0.0004971100778621223, 'samples': 1760832, 'steps': 9170, 'loss/train': 1.477123498916626} 11/06/2021 22:31:29 - INFO - __main__ - Step 9172: {'lr': 0.0004971092732481641, 'samples': 1761024, 'steps': 9171, 'loss/train': 1.5037059783935547} 11/06/2021 22:31:30 - INFO - __main__ - Step 9173: {'lr': 0.0004971084685228623, 'samples': 1761216, 'steps': 9172, 'loss/train': 1.8202991485595703} 11/06/2021 22:31:30 - INFO - __main__ - Step 9174: {'lr': 0.0004971076636862172, 'samples': 1761408, 'steps': 9173, 'loss/train': 1.7181813716888428} 11/06/2021 22:31:31 - INFO - __main__ - Step 9175: {'lr': 0.0004971068587382293, 'samples': 1761600, 'steps': 9174, 'loss/train': 1.4500627517700195} 11/06/2021 22:31:31 - INFO - __main__ - Step 9176: {'lr': 0.0004971060536788988, 'samples': 1761792, 'steps': 9175, 'loss/train': 1.9381426572799683} 11/06/2021 22:31:31 - INFO - __main__ - Step 9177: {'lr': 0.000497105248508226, 'samples': 1761984, 'steps': 9176, 'loss/train': 1.841038465499878} 11/06/2021 22:31:33 - INFO - __main__ - Step 9178: {'lr': 0.0004971044432262115, 'samples': 1762176, 'steps': 9177, 'loss/train': 1.351794958114624} 11/06/2021 22:31:33 - INFO - __main__ - Step 9179: {'lr': 0.0004971036378328556, 'samples': 1762368, 'steps': 9178, 'loss/train': 1.7219154834747314} 11/06/2021 22:31:33 - INFO - __main__ - Step 9180: {'lr': 0.0004971028323281586, 'samples': 1762560, 'steps': 9179, 'loss/train': 2.6767475605010986} 11/06/2021 22:31:34 - INFO - __main__ - Step 9181: {'lr': 0.0004971020267121208, 'samples': 1762752, 'steps': 9180, 'loss/train': 2.2747514247894287} 11/06/2021 22:31:34 - INFO - __main__ - Step 9182: {'lr': 0.0004971012209847427, 'samples': 1762944, 'steps': 9181, 'loss/train': 1.0986586809158325} 11/06/2021 22:31:35 - INFO - __main__ - Step 9183: {'lr': 0.0004971004151460245, 'samples': 1763136, 'steps': 9182, 'loss/train': 1.2503761053085327} 11/06/2021 22:31:35 - INFO - __main__ - Step 9184: {'lr': 0.0004970996091959668, 'samples': 1763328, 'steps': 9183, 'loss/train': 1.487623691558838} 11/06/2021 22:31:36 - INFO - __main__ - Step 9185: {'lr': 0.0004970988031345698, 'samples': 1763520, 'steps': 9184, 'loss/train': 1.718331217765808} 11/06/2021 22:31:36 - INFO - __main__ - Step 9186: {'lr': 0.0004970979969618338, 'samples': 1763712, 'steps': 9185, 'loss/train': 1.1969623565673828} 11/06/2021 22:31:36 - INFO - __main__ - Step 9187: {'lr': 0.0004970971906777593, 'samples': 1763904, 'steps': 9186, 'loss/train': 1.9592634439468384} 11/06/2021 22:31:37 - INFO - __main__ - Step 9188: {'lr': 0.0004970963842823468, 'samples': 1764096, 'steps': 9187, 'loss/train': 1.1920486688613892} 11/06/2021 22:31:38 - INFO - __main__ - Step 9189: {'lr': 0.0004970955777755963, 'samples': 1764288, 'steps': 9188, 'loss/train': 1.6599271297454834} 11/06/2021 22:31:39 - INFO - __main__ - Step 9190: {'lr': 0.0004970947711575083, 'samples': 1764480, 'steps': 9189, 'loss/train': 1.8432694673538208} 11/06/2021 22:31:39 - INFO - __main__ - Step 9191: {'lr': 0.0004970939644280833, 'samples': 1764672, 'steps': 9190, 'loss/train': 1.6999132633209229} 11/06/2021 22:31:39 - INFO - __main__ - Step 9192: {'lr': 0.0004970931575873215, 'samples': 1764864, 'steps': 9191, 'loss/train': 1.533847451210022} 11/06/2021 22:31:40 - INFO - __main__ - Step 9193: {'lr': 0.0004970923506352234, 'samples': 1765056, 'steps': 9192, 'loss/train': 1.8772602081298828} 11/06/2021 22:31:40 - INFO - __main__ - Step 9194: {'lr': 0.0004970915435717893, 'samples': 1765248, 'steps': 9193, 'loss/train': 1.8430668115615845} 11/06/2021 22:31:41 - INFO - __main__ - Step 9195: {'lr': 0.0004970907363970196, 'samples': 1765440, 'steps': 9194, 'loss/train': 1.7536627054214478} 11/06/2021 22:31:41 - INFO - __main__ - Step 9196: {'lr': 0.0004970899291109145, 'samples': 1765632, 'steps': 9195, 'loss/train': 1.858238697052002} 11/06/2021 22:31:42 - INFO - __main__ - Step 9197: {'lr': 0.0004970891217134746, 'samples': 1765824, 'steps': 9196, 'loss/train': 2.1001169681549072} 11/06/2021 22:31:42 - INFO - __main__ - Step 9198: {'lr': 0.0004970883142047001, 'samples': 1766016, 'steps': 9197, 'loss/train': 2.0097129344940186} 11/06/2021 22:31:43 - INFO - __main__ - Step 9199: {'lr': 0.0004970875065845914, 'samples': 1766208, 'steps': 9198, 'loss/train': 2.0182600021362305} 11/06/2021 22:31:44 - INFO - __main__ - Step 9200: {'lr': 0.000497086698853149, 'samples': 1766400, 'steps': 9199, 'loss/train': 1.785837173461914} 11/06/2021 22:31:44 - INFO - __main__ - Step 9201: {'lr': 0.0004970858910103731, 'samples': 1766592, 'steps': 9200, 'loss/train': 1.773368000984192} 11/06/2021 22:31:44 - INFO - __main__ - Step 9202: {'lr': 0.0004970850830562641, 'samples': 1766784, 'steps': 9201, 'loss/train': 2.1283977031707764} 11/06/2021 22:31:45 - INFO - __main__ - Step 9203: {'lr': 0.0004970842749908223, 'samples': 1766976, 'steps': 9202, 'loss/train': 1.710228443145752} 11/06/2021 22:31:45 - INFO - __main__ - Step 9204: {'lr': 0.0004970834668140482, 'samples': 1767168, 'steps': 9203, 'loss/train': 1.6465903520584106} 11/06/2021 22:31:46 - INFO - __main__ - Step 9205: {'lr': 0.0004970826585259421, 'samples': 1767360, 'steps': 9204, 'loss/train': 1.8432360887527466} 11/06/2021 22:31:46 - INFO - __main__ - Step 9206: {'lr': 0.0004970818501265044, 'samples': 1767552, 'steps': 9205, 'loss/train': 1.9140604734420776} 11/06/2021 22:31:47 - INFO - __main__ - Step 9207: {'lr': 0.0004970810416157354, 'samples': 1767744, 'steps': 9206, 'loss/train': 1.4957774877548218} 11/06/2021 22:31:47 - INFO - __main__ - Step 9208: {'lr': 0.0004970802329936355, 'samples': 1767936, 'steps': 9207, 'loss/train': 1.110896110534668} 11/06/2021 22:31:47 - INFO - __main__ - Step 9209: {'lr': 0.000497079424260205, 'samples': 1768128, 'steps': 9208, 'loss/train': 2.041264057159424} 11/06/2021 22:31:48 - INFO - __main__ - Step 9210: {'lr': 0.0004970786154154444, 'samples': 1768320, 'steps': 9209, 'loss/train': 1.8915854692459106} 11/06/2021 22:31:49 - INFO - __main__ - Step 9211: {'lr': 0.000497077806459354, 'samples': 1768512, 'steps': 9210, 'loss/train': 1.495437502861023} 11/06/2021 22:31:49 - INFO - __main__ - Step 9212: {'lr': 0.0004970769973919341, 'samples': 1768704, 'steps': 9211, 'loss/train': 1.2148029804229736} 11/06/2021 22:31:50 - INFO - __main__ - Step 9213: {'lr': 0.0004970761882131851, 'samples': 1768896, 'steps': 9212, 'loss/train': 3.2807438373565674} 11/06/2021 22:31:50 - INFO - __main__ - Step 9214: {'lr': 0.0004970753789231074, 'samples': 1769088, 'steps': 9213, 'loss/train': 1.3468689918518066} 11/06/2021 22:31:50 - INFO - __main__ - Step 9215: {'lr': 0.0004970745695217014, 'samples': 1769280, 'steps': 9214, 'loss/train': 2.1012730598449707} 11/06/2021 22:31:51 - INFO - __main__ - Step 9216: {'lr': 0.0004970737600089673, 'samples': 1769472, 'steps': 9215, 'loss/train': 2.018321990966797} 11/06/2021 22:31:52 - INFO - __main__ - Step 9217: {'lr': 0.0004970729503849057, 'samples': 1769664, 'steps': 9216, 'loss/train': 0.4318278431892395} 11/06/2021 22:31:52 - INFO - __main__ - Step 9218: {'lr': 0.0004970721406495168, 'samples': 1769856, 'steps': 9217, 'loss/train': 1.8669426441192627} 11/06/2021 22:31:53 - INFO - __main__ - Step 9219: {'lr': 0.000497071330802801, 'samples': 1770048, 'steps': 9218, 'loss/train': 1.9158834218978882} 11/06/2021 22:31:53 - INFO - __main__ - Step 9220: {'lr': 0.0004970705208447587, 'samples': 1770240, 'steps': 9219, 'loss/train': 1.9058424234390259} 11/06/2021 22:31:53 - INFO - __main__ - Step 9221: {'lr': 0.0004970697107753902, 'samples': 1770432, 'steps': 9220, 'loss/train': 2.554187059402466} 11/06/2021 22:31:54 - INFO - __main__ - Step 9222: {'lr': 0.0004970689005946959, 'samples': 1770624, 'steps': 9221, 'loss/train': 1.937746286392212} 11/06/2021 22:31:55 - INFO - __main__ - Step 9223: {'lr': 0.0004970680903026762, 'samples': 1770816, 'steps': 9222, 'loss/train': 1.8598175048828125} 11/06/2021 22:31:55 - INFO - __main__ - Step 9224: {'lr': 0.0004970672798993313, 'samples': 1771008, 'steps': 9223, 'loss/train': 1.6042256355285645} 11/06/2021 22:31:55 - INFO - __main__ - Step 9225: {'lr': 0.0004970664693846618, 'samples': 1771200, 'steps': 9224, 'loss/train': 1.7904118299484253} 11/06/2021 22:31:56 - INFO - __main__ - Step 9226: {'lr': 0.000497065658758668, 'samples': 1771392, 'steps': 9225, 'loss/train': 1.8984659910202026} 11/06/2021 22:31:57 - INFO - __main__ - Step 9227: {'lr': 0.0004970648480213502, 'samples': 1771584, 'steps': 9226, 'loss/train': 2.082733631134033} 11/06/2021 22:31:57 - INFO - __main__ - Step 9228: {'lr': 0.0004970640371727088, 'samples': 1771776, 'steps': 9227, 'loss/train': 1.6446478366851807} 11/06/2021 22:31:57 - INFO - __main__ - Step 9229: {'lr': 0.0004970632262127441, 'samples': 1771968, 'steps': 9228, 'loss/train': 1.608752965927124} 11/06/2021 22:31:58 - INFO - __main__ - Step 9230: {'lr': 0.0004970624151414565, 'samples': 1772160, 'steps': 9229, 'loss/train': 1.8369768857955933} 11/06/2021 22:31:58 - INFO - __main__ - Step 9231: {'lr': 0.0004970616039588465, 'samples': 1772352, 'steps': 9230, 'loss/train': 1.8356631994247437} 11/06/2021 22:31:59 - INFO - __main__ - Step 9232: {'lr': 0.0004970607926649143, 'samples': 1772544, 'steps': 9231, 'loss/train': 1.8199883699417114} 11/06/2021 22:31:59 - INFO - __main__ - Step 9233: {'lr': 0.0004970599812596603, 'samples': 1772736, 'steps': 9232, 'loss/train': 1.2998876571655273} 11/06/2021 22:32:00 - INFO - __main__ - Step 9234: {'lr': 0.0004970591697430849, 'samples': 1772928, 'steps': 9233, 'loss/train': 1.8944025039672852} 11/06/2021 22:32:00 - INFO - __main__ - Step 9235: {'lr': 0.0004970583581151885, 'samples': 1773120, 'steps': 9234, 'loss/train': 1.8377676010131836} 11/06/2021 22:32:00 - INFO - __main__ - Step 9236: {'lr': 0.0004970575463759713, 'samples': 1773312, 'steps': 9235, 'loss/train': 1.6329200267791748} 11/06/2021 22:32:02 - INFO - __main__ - Step 9237: {'lr': 0.0004970567345254339, 'samples': 1773504, 'steps': 9236, 'loss/train': 1.544152021408081} 11/06/2021 22:32:02 - INFO - __main__ - Step 9238: {'lr': 0.0004970559225635765, 'samples': 1773696, 'steps': 9237, 'loss/train': 1.5652451515197754} 11/06/2021 22:32:02 - INFO - __main__ - Step 9239: {'lr': 0.0004970551104903995, 'samples': 1773888, 'steps': 9238, 'loss/train': 2.267305374145508} 11/06/2021 22:32:03 - INFO - __main__ - Step 9240: {'lr': 0.0004970542983059033, 'samples': 1774080, 'steps': 9239, 'loss/train': 1.8805835247039795} 11/06/2021 22:32:03 - INFO - __main__ - Step 9241: {'lr': 0.0004970534860100883, 'samples': 1774272, 'steps': 9240, 'loss/train': 1.7890807390213013} 11/06/2021 22:32:04 - INFO - __main__ - Step 9242: {'lr': 0.0004970526736029547, 'samples': 1774464, 'steps': 9241, 'loss/train': 1.4509460926055908} 11/06/2021 22:32:04 - INFO - __main__ - Step 9243: {'lr': 0.000497051861084503, 'samples': 1774656, 'steps': 9242, 'loss/train': 1.3258929252624512} 11/06/2021 22:32:05 - INFO - __main__ - Step 9244: {'lr': 0.0004970510484547336, 'samples': 1774848, 'steps': 9243, 'loss/train': 1.9906156063079834} 11/06/2021 22:32:05 - INFO - __main__ - Step 9245: {'lr': 0.0004970502357136468, 'samples': 1775040, 'steps': 9244, 'loss/train': 1.9643011093139648} 11/06/2021 22:32:05 - INFO - __main__ - Step 9246: {'lr': 0.0004970494228612429, 'samples': 1775232, 'steps': 9245, 'loss/train': 1.9511194229125977} 11/06/2021 22:32:06 - INFO - __main__ - Step 9247: {'lr': 0.0004970486098975224, 'samples': 1775424, 'steps': 9246, 'loss/train': 1.7946999073028564} 11/06/2021 22:32:07 - INFO - __main__ - Step 9248: {'lr': 0.0004970477968224856, 'samples': 1775616, 'steps': 9247, 'loss/train': 1.9336384534835815} 11/06/2021 22:32:07 - INFO - __main__ - Step 9249: {'lr': 0.000497046983636133, 'samples': 1775808, 'steps': 9248, 'loss/train': 1.3553489446640015} 11/06/2021 22:32:07 - INFO - __main__ - Step 9250: {'lr': 0.0004970461703384647, 'samples': 1776000, 'steps': 9249, 'loss/train': 1.9446792602539062} 11/06/2021 22:32:08 - INFO - __main__ - Step 9251: {'lr': 0.0004970453569294812, 'samples': 1776192, 'steps': 9250, 'loss/train': 1.8456064462661743} 11/06/2021 22:32:08 - INFO - __main__ - Step 9252: {'lr': 0.000497044543409183, 'samples': 1776384, 'steps': 9251, 'loss/train': 1.9062843322753906} 11/06/2021 22:32:09 - INFO - __main__ - Step 9253: {'lr': 0.0004970437297775702, 'samples': 1776576, 'steps': 9252, 'loss/train': 1.9113454818725586} 11/06/2021 22:32:10 - INFO - __main__ - Step 9254: {'lr': 0.0004970429160346433, 'samples': 1776768, 'steps': 9253, 'loss/train': 1.5249048471450806} 11/06/2021 22:32:10 - INFO - __main__ - Step 9255: {'lr': 0.0004970421021804027, 'samples': 1776960, 'steps': 9254, 'loss/train': 2.172207832336426} 11/06/2021 22:32:10 - INFO - __main__ - Step 9256: {'lr': 0.0004970412882148488, 'samples': 1777152, 'steps': 9255, 'loss/train': 1.1907613277435303} 11/06/2021 22:32:11 - INFO - __main__ - Step 9257: {'lr': 0.0004970404741379818, 'samples': 1777344, 'steps': 9256, 'loss/train': 1.6788753271102905} 11/06/2021 22:32:12 - INFO - __main__ - Step 9258: {'lr': 0.0004970396599498023, 'samples': 1777536, 'steps': 9257, 'loss/train': 1.072892189025879} 11/06/2021 22:32:12 - INFO - __main__ - Step 9259: {'lr': 0.0004970388456503105, 'samples': 1777728, 'steps': 9258, 'loss/train': 1.4916554689407349} 11/06/2021 22:32:13 - INFO - __main__ - Step 9260: {'lr': 0.0004970380312395069, 'samples': 1777920, 'steps': 9259, 'loss/train': 1.7013155221939087} 11/06/2021 22:32:13 - INFO - __main__ - Step 9261: {'lr': 0.0004970372167173915, 'samples': 1778112, 'steps': 9260, 'loss/train': 2.010669708251953} 11/06/2021 22:32:13 - INFO - __main__ - Step 9262: {'lr': 0.0004970364020839652, 'samples': 1778304, 'steps': 9261, 'loss/train': 1.8238394260406494} 11/06/2021 22:32:14 - INFO - __main__ - Step 9263: {'lr': 0.0004970355873392281, 'samples': 1778496, 'steps': 9262, 'loss/train': 1.3550951480865479} 11/06/2021 22:32:15 - INFO - __main__ - Step 9264: {'lr': 0.0004970347724831804, 'samples': 1778688, 'steps': 9263, 'loss/train': 1.880463719367981} 11/06/2021 22:32:15 - INFO - __main__ - Step 9265: {'lr': 0.0004970339575158228, 'samples': 1778880, 'steps': 9264, 'loss/train': 1.692347764968872} 11/06/2021 22:32:15 - INFO - __main__ - Step 9266: {'lr': 0.0004970331424371555, 'samples': 1779072, 'steps': 9265, 'loss/train': 1.760794997215271} 11/06/2021 22:32:16 - INFO - __main__ - Step 9267: {'lr': 0.0004970323272471788, 'samples': 1779264, 'steps': 9266, 'loss/train': 1.9668833017349243} 11/06/2021 22:32:16 - INFO - __main__ - Step 9268: {'lr': 0.0004970315119458931, 'samples': 1779456, 'steps': 9267, 'loss/train': 1.905427098274231} 11/06/2021 22:32:17 - INFO - __main__ - Step 9269: {'lr': 0.000497030696533299, 'samples': 1779648, 'steps': 9268, 'loss/train': 1.6687836647033691} 11/06/2021 22:32:17 - INFO - __main__ - Step 9270: {'lr': 0.0004970298810093965, 'samples': 1779840, 'steps': 9269, 'loss/train': 1.7328218221664429} 11/06/2021 22:32:18 - INFO - __main__ - Step 9271: {'lr': 0.0004970290653741863, 'samples': 1780032, 'steps': 9270, 'loss/train': 1.6917048692703247} 11/06/2021 22:32:18 - INFO - __main__ - Step 9272: {'lr': 0.0004970282496276684, 'samples': 1780224, 'steps': 9271, 'loss/train': 1.8625608682632446} 11/06/2021 22:32:19 - INFO - __main__ - Step 9273: {'lr': 0.0004970274337698436, 'samples': 1780416, 'steps': 9272, 'loss/train': 1.789774775505066} 11/06/2021 22:32:19 - INFO - __main__ - Step 9274: {'lr': 0.000497026617800712, 'samples': 1780608, 'steps': 9273, 'loss/train': 1.5155421495437622} 11/06/2021 22:32:20 - INFO - __main__ - Step 9275: {'lr': 0.000497025801720274, 'samples': 1780800, 'steps': 9274, 'loss/train': 1.8197972774505615} 11/06/2021 22:32:20 - INFO - __main__ - Step 9276: {'lr': 0.00049702498552853, 'samples': 1780992, 'steps': 9275, 'loss/train': 2.1639773845672607} 11/06/2021 22:32:21 - INFO - __main__ - Step 9277: {'lr': 0.0004970241692254803, 'samples': 1781184, 'steps': 9276, 'loss/train': 1.6572617292404175} 11/06/2021 22:32:21 - INFO - __main__ - Step 9278: {'lr': 0.0004970233528111253, 'samples': 1781376, 'steps': 9277, 'loss/train': 1.9275052547454834} 11/06/2021 22:32:22 - INFO - __main__ - Step 9279: {'lr': 0.0004970225362854654, 'samples': 1781568, 'steps': 9278, 'loss/train': 1.4207885265350342} 11/06/2021 22:32:22 - INFO - __main__ - Step 9280: {'lr': 0.0004970217196485011, 'samples': 1781760, 'steps': 9279, 'loss/train': 1.7480263710021973} 11/06/2021 22:32:23 - INFO - __main__ - Step 9281: {'lr': 0.0004970209029002325, 'samples': 1781952, 'steps': 9280, 'loss/train': 2.001077890396118} 11/06/2021 22:32:23 - INFO - __main__ - Step 9282: {'lr': 0.0004970200860406601, 'samples': 1782144, 'steps': 9281, 'loss/train': 1.4385563135147095} 11/06/2021 22:32:23 - INFO - __main__ - Step 9283: {'lr': 0.0004970192690697843, 'samples': 1782336, 'steps': 9282, 'loss/train': 1.1170109510421753} 11/06/2021 22:32:24 - INFO - __main__ - Step 9284: {'lr': 0.0004970184519876053, 'samples': 1782528, 'steps': 9283, 'loss/train': 1.6639217138290405} 11/06/2021 22:32:25 - INFO - __main__ - Step 9285: {'lr': 0.0004970176347941237, 'samples': 1782720, 'steps': 9284, 'loss/train': 1.5901360511779785} 11/06/2021 22:32:25 - INFO - __main__ - Step 9286: {'lr': 0.0004970168174893398, 'samples': 1782912, 'steps': 9285, 'loss/train': 2.016771078109741} 11/06/2021 22:32:25 - INFO - __main__ - Step 9287: {'lr': 0.0004970160000732539, 'samples': 1783104, 'steps': 9286, 'loss/train': 1.7004362344741821} 11/06/2021 22:32:26 - INFO - __main__ - Step 9288: {'lr': 0.0004970151825458664, 'samples': 1783296, 'steps': 9287, 'loss/train': 2.1699607372283936} 11/06/2021 22:32:27 - INFO - __main__ - Step 9289: {'lr': 0.0004970143649071777, 'samples': 1783488, 'steps': 9288, 'loss/train': 1.9318630695343018} 11/06/2021 22:32:27 - INFO - __main__ - Step 9290: {'lr': 0.0004970135471571881, 'samples': 1783680, 'steps': 9289, 'loss/train': 1.4492002725601196} 11/06/2021 22:32:28 - INFO - __main__ - Step 9291: {'lr': 0.000497012729295898, 'samples': 1783872, 'steps': 9290, 'loss/train': 1.6423429250717163} 11/06/2021 22:32:28 - INFO - __main__ - Step 9292: {'lr': 0.0004970119113233078, 'samples': 1784064, 'steps': 9291, 'loss/train': 1.7522389888763428} 11/06/2021 22:32:28 - INFO - __main__ - Step 9293: {'lr': 0.0004970110932394178, 'samples': 1784256, 'steps': 9292, 'loss/train': 1.4317411184310913} 11/06/2021 22:32:29 - INFO - __main__ - Step 9294: {'lr': 0.0004970102750442285, 'samples': 1784448, 'steps': 9293, 'loss/train': 2.1833393573760986} 11/06/2021 22:32:30 - INFO - __main__ - Step 9295: {'lr': 0.0004970094567377402, 'samples': 1784640, 'steps': 9294, 'loss/train': 0.5875868797302246} 11/06/2021 22:32:30 - INFO - __main__ - Step 9296: {'lr': 0.0004970086383199532, 'samples': 1784832, 'steps': 9295, 'loss/train': 2.128361701965332} 11/06/2021 22:32:30 - INFO - __main__ - Step 9297: {'lr': 0.0004970078197908678, 'samples': 1785024, 'steps': 9296, 'loss/train': 1.418079137802124} 11/06/2021 22:32:31 - INFO - __main__ - Step 9298: {'lr': 0.0004970070011504846, 'samples': 1785216, 'steps': 9297, 'loss/train': 1.8061836957931519} 11/06/2021 22:32:31 - INFO - __main__ - Step 9299: {'lr': 0.0004970061823988038, 'samples': 1785408, 'steps': 9298, 'loss/train': 1.9081430435180664} 11/06/2021 22:32:32 - INFO - __main__ - Step 9300: {'lr': 0.0004970053635358259, 'samples': 1785600, 'steps': 9299, 'loss/train': 1.5807913541793823} 11/06/2021 22:32:33 - INFO - __main__ - Step 9301: {'lr': 0.0004970045445615512, 'samples': 1785792, 'steps': 9300, 'loss/train': 1.5762659311294556} 11/06/2021 22:32:33 - INFO - __main__ - Step 9302: {'lr': 0.00049700372547598, 'samples': 1785984, 'steps': 9301, 'loss/train': 1.5313711166381836} 11/06/2021 22:32:33 - INFO - __main__ - Step 9303: {'lr': 0.0004970029062791128, 'samples': 1786176, 'steps': 9302, 'loss/train': 1.933624505996704} 11/06/2021 22:32:34 - INFO - __main__ - Step 9304: {'lr': 0.0004970020869709498, 'samples': 1786368, 'steps': 9303, 'loss/train': 1.6305302381515503} 11/06/2021 22:32:35 - INFO - __main__ - Step 9305: {'lr': 0.0004970012675514915, 'samples': 1786560, 'steps': 9304, 'loss/train': 1.5275381803512573} 11/06/2021 22:32:35 - INFO - __main__ - Step 9306: {'lr': 0.0004970004480207384, 'samples': 1786752, 'steps': 9305, 'loss/train': 1.6243155002593994} 11/06/2021 22:32:35 - INFO - __main__ - Step 9307: {'lr': 0.0004969996283786905, 'samples': 1786944, 'steps': 9306, 'loss/train': 1.8101203441619873} 11/06/2021 22:32:36 - INFO - __main__ - Step 9308: {'lr': 0.0004969988086253486, 'samples': 1787136, 'steps': 9307, 'loss/train': 1.9703173637390137} 11/06/2021 22:32:36 - INFO - __main__ - Step 9309: {'lr': 0.0004969979887607125, 'samples': 1787328, 'steps': 9308, 'loss/train': 1.6925435066223145} 11/06/2021 22:32:37 - INFO - __main__ - Step 9310: {'lr': 0.0004969971687847832, 'samples': 1787520, 'steps': 9309, 'loss/train': 2.585664749145508} 11/06/2021 22:32:37 - INFO - __main__ - Step 9311: {'lr': 0.0004969963486975607, 'samples': 1787712, 'steps': 9310, 'loss/train': 1.2041494846343994} 11/06/2021 22:32:38 - INFO - __main__ - Step 9312: {'lr': 0.0004969955284990455, 'samples': 1787904, 'steps': 9311, 'loss/train': 1.6597546339035034} 11/06/2021 22:32:38 - INFO - __main__ - Step 9313: {'lr': 0.0004969947081892379, 'samples': 1788096, 'steps': 9312, 'loss/train': 1.6976758241653442} 11/06/2021 22:32:38 - INFO - __main__ - Step 9314: {'lr': 0.0004969938877681383, 'samples': 1788288, 'steps': 9313, 'loss/train': 1.9452763795852661} 11/06/2021 22:32:39 - INFO - __main__ - Step 9315: {'lr': 0.0004969930672357471, 'samples': 1788480, 'steps': 9314, 'loss/train': 2.183551788330078} 11/06/2021 22:32:40 - INFO - __main__ - Step 9316: {'lr': 0.0004969922465920645, 'samples': 1788672, 'steps': 9315, 'loss/train': 2.362039089202881} 11/06/2021 22:32:40 - INFO - __main__ - Step 9317: {'lr': 0.0004969914258370912, 'samples': 1788864, 'steps': 9316, 'loss/train': 2.2192182540893555} 11/06/2021 22:32:41 - INFO - __main__ - Step 9318: {'lr': 0.0004969906049708272, 'samples': 1789056, 'steps': 9317, 'loss/train': 1.708990454673767} 11/06/2021 22:32:41 - INFO - __main__ - Step 9319: {'lr': 0.0004969897839932732, 'samples': 1789248, 'steps': 9318, 'loss/train': 1.988546371459961} 11/06/2021 22:32:41 - INFO - __main__ - Step 9320: {'lr': 0.0004969889629044293, 'samples': 1789440, 'steps': 9319, 'loss/train': 1.3980268239974976} 11/06/2021 22:32:42 - INFO - __main__ - Step 9321: {'lr': 0.000496988141704296, 'samples': 1789632, 'steps': 9320, 'loss/train': 6.165026664733887} 11/06/2021 22:32:43 - INFO - __main__ - Step 9322: {'lr': 0.0004969873203928737, 'samples': 1789824, 'steps': 9321, 'loss/train': 1.2539154291152954} 11/06/2021 22:32:43 - INFO - __main__ - Step 9323: {'lr': 0.0004969864989701626, 'samples': 1790016, 'steps': 9322, 'loss/train': 1.7540534734725952} 11/06/2021 22:32:43 - INFO - __main__ - Step 9324: {'lr': 0.0004969856774361634, 'samples': 1790208, 'steps': 9323, 'loss/train': 1.9644792079925537} 11/06/2021 22:32:44 - INFO - __main__ - Step 9325: {'lr': 0.0004969848557908761, 'samples': 1790400, 'steps': 9324, 'loss/train': 1.722193717956543} 11/06/2021 22:32:44 - INFO - __main__ - Step 9326: {'lr': 0.0004969840340343013, 'samples': 1790592, 'steps': 9325, 'loss/train': 2.381248712539673} 11/06/2021 22:32:45 - INFO - __main__ - Step 9327: {'lr': 0.0004969832121664394, 'samples': 1790784, 'steps': 9326, 'loss/train': 1.3223377466201782} 11/06/2021 22:32:46 - INFO - __main__ - Step 9328: {'lr': 0.0004969823901872906, 'samples': 1790976, 'steps': 9327, 'loss/train': 2.1059629917144775} 11/06/2021 22:32:46 - INFO - __main__ - Step 9329: {'lr': 0.0004969815680968552, 'samples': 1791168, 'steps': 9328, 'loss/train': 1.7567250728607178} 11/06/2021 22:32:46 - INFO - __main__ - Step 9330: {'lr': 0.0004969807458951339, 'samples': 1791360, 'steps': 9329, 'loss/train': 1.10848867893219} 11/06/2021 22:32:47 - INFO - __main__ - Step 9331: {'lr': 0.0004969799235821268, 'samples': 1791552, 'steps': 9330, 'loss/train': 1.527244210243225} 11/06/2021 22:32:48 - INFO - __main__ - Step 9332: {'lr': 0.0004969791011578344, 'samples': 1791744, 'steps': 9331, 'loss/train': 1.9717392921447754} 11/06/2021 22:32:48 - INFO - __main__ - Step 9333: {'lr': 0.000496978278622257, 'samples': 1791936, 'steps': 9332, 'loss/train': 1.1061983108520508} 11/06/2021 22:32:48 - INFO - __main__ - Step 9334: {'lr': 0.000496977455975395, 'samples': 1792128, 'steps': 9333, 'loss/train': 1.6116451025009155} 11/06/2021 22:32:49 - INFO - __main__ - Step 9335: {'lr': 0.0004969766332172488, 'samples': 1792320, 'steps': 9334, 'loss/train': 2.009783983230591} 11/06/2021 22:32:49 - INFO - __main__ - Step 9336: {'lr': 0.0004969758103478187, 'samples': 1792512, 'steps': 9335, 'loss/train': 1.6776036024093628} 11/06/2021 22:32:50 - INFO - __main__ - Step 9337: {'lr': 0.0004969749873671051, 'samples': 1792704, 'steps': 9336, 'loss/train': 1.6824532747268677} 11/06/2021 22:32:50 - INFO - __main__ - Step 9338: {'lr': 0.0004969741642751085, 'samples': 1792896, 'steps': 9337, 'loss/train': 1.9121274948120117} 11/06/2021 22:32:51 - INFO - __main__ - Step 9339: {'lr': 0.000496973341071829, 'samples': 1793088, 'steps': 9338, 'loss/train': 1.8858976364135742} 11/06/2021 22:32:51 - INFO - __main__ - Step 9340: {'lr': 0.0004969725177572672, 'samples': 1793280, 'steps': 9339, 'loss/train': 1.5066814422607422} 11/06/2021 22:32:51 - INFO - __main__ - Step 9341: {'lr': 0.0004969716943314234, 'samples': 1793472, 'steps': 9340, 'loss/train': 1.8776342868804932} 11/06/2021 22:32:52 - INFO - __main__ - Step 9342: {'lr': 0.0004969708707942979, 'samples': 1793664, 'steps': 9341, 'loss/train': 2.180288314819336} 11/06/2021 22:32:53 - INFO - __main__ - Step 9343: {'lr': 0.0004969700471458913, 'samples': 1793856, 'steps': 9342, 'loss/train': 1.4142295122146606} 11/06/2021 22:32:53 - INFO - __main__ - Step 9344: {'lr': 0.0004969692233862036, 'samples': 1794048, 'steps': 9343, 'loss/train': 1.8621501922607422} 11/06/2021 22:32:53 - INFO - __main__ - Step 9345: {'lr': 0.0004969683995152355, 'samples': 1794240, 'steps': 9344, 'loss/train': 3.186241626739502} 11/06/2021 22:32:54 - INFO - __main__ - Step 9346: {'lr': 0.0004969675755329872, 'samples': 1794432, 'steps': 9345, 'loss/train': 1.875054121017456} 11/06/2021 22:32:55 - INFO - __main__ - Step 9347: {'lr': 0.0004969667514394592, 'samples': 1794624, 'steps': 9346, 'loss/train': 2.312483072280884} 11/06/2021 22:32:55 - INFO - __main__ - Step 9348: {'lr': 0.0004969659272346517, 'samples': 1794816, 'steps': 9347, 'loss/train': 1.7761608362197876} 11/06/2021 22:32:56 - INFO - __main__ - Step 9349: {'lr': 0.0004969651029185652, 'samples': 1795008, 'steps': 9348, 'loss/train': 1.646713137626648} 11/06/2021 22:32:56 - INFO - __main__ - Step 9350: {'lr': 0.0004969642784912001, 'samples': 1795200, 'steps': 9349, 'loss/train': 3.636963367462158} 11/06/2021 22:32:57 - INFO - __main__ - Step 9351: {'lr': 0.0004969634539525566, 'samples': 1795392, 'steps': 9350, 'loss/train': 2.115077495574951} 11/06/2021 22:32:57 - INFO - __main__ - Step 9352: {'lr': 0.0004969626293026353, 'samples': 1795584, 'steps': 9351, 'loss/train': 1.7228342294692993} 11/06/2021 22:32:58 - INFO - __main__ - Step 9353: {'lr': 0.0004969618045414363, 'samples': 1795776, 'steps': 9352, 'loss/train': 1.540665626525879} 11/06/2021 22:32:58 - INFO - __main__ - Step 9354: {'lr': 0.0004969609796689602, 'samples': 1795968, 'steps': 9353, 'loss/train': 1.7847797870635986} 11/06/2021 22:32:59 - INFO - __main__ - Step 9355: {'lr': 0.0004969601546852073, 'samples': 1796160, 'steps': 9354, 'loss/train': 1.4905754327774048} 11/06/2021 22:32:59 - INFO - __main__ - Step 9356: {'lr': 0.0004969593295901779, 'samples': 1796352, 'steps': 9355, 'loss/train': 1.998396873474121} 11/06/2021 22:32:59 - INFO - __main__ - Step 9357: {'lr': 0.0004969585043838725, 'samples': 1796544, 'steps': 9356, 'loss/train': 2.060382127761841} 11/06/2021 22:33:00 - INFO - __main__ - Step 9358: {'lr': 0.0004969576790662914, 'samples': 1796736, 'steps': 9357, 'loss/train': 1.823445200920105} 11/06/2021 22:33:01 - INFO - __main__ - Step 9359: {'lr': 0.0004969568536374349, 'samples': 1796928, 'steps': 9358, 'loss/train': 1.6767795085906982} 11/06/2021 22:33:01 - INFO - __main__ - Step 9360: {'lr': 0.0004969560280973036, 'samples': 1797120, 'steps': 9359, 'loss/train': 2.0261526107788086} 11/06/2021 22:33:01 - INFO - __main__ - Step 9361: {'lr': 0.0004969552024458976, 'samples': 1797312, 'steps': 9360, 'loss/train': 1.6986967325210571} 11/06/2021 22:33:02 - INFO - __main__ - Step 9362: {'lr': 0.0004969543766832176, 'samples': 1797504, 'steps': 9361, 'loss/train': 1.3113603591918945} 11/06/2021 22:33:03 - INFO - __main__ - Step 9363: {'lr': 0.0004969535508092635, 'samples': 1797696, 'steps': 9362, 'loss/train': 1.4548332691192627} 11/06/2021 22:33:03 - INFO - __main__ - Step 9364: {'lr': 0.0004969527248240361, 'samples': 1797888, 'steps': 9363, 'loss/train': 2.1433870792388916} 11/06/2021 22:33:03 - INFO - __main__ - Step 9365: {'lr': 0.0004969518987275356, 'samples': 1798080, 'steps': 9364, 'loss/train': 1.7857718467712402} 11/06/2021 22:33:04 - INFO - __main__ - Step 9366: {'lr': 0.0004969510725197624, 'samples': 1798272, 'steps': 9365, 'loss/train': 1.3735690116882324} 11/06/2021 22:33:04 - INFO - __main__ - Step 9367: {'lr': 0.0004969502462007167, 'samples': 1798464, 'steps': 9366, 'loss/train': 2.3249306678771973} 11/06/2021 22:33:05 - INFO - __main__ - Step 9368: {'lr': 0.0004969494197703992, 'samples': 1798656, 'steps': 9367, 'loss/train': 1.9111498594284058} 11/06/2021 22:33:05 - INFO - __main__ - Step 9369: {'lr': 0.00049694859322881, 'samples': 1798848, 'steps': 9368, 'loss/train': 1.7562119960784912} 11/06/2021 22:33:06 - INFO - __main__ - Step 9370: {'lr': 0.0004969477665759496, 'samples': 1799040, 'steps': 9369, 'loss/train': 1.8081409931182861} 11/06/2021 22:33:06 - INFO - __main__ - Step 9371: {'lr': 0.0004969469398118184, 'samples': 1799232, 'steps': 9370, 'loss/train': 1.3036324977874756} 11/06/2021 22:33:07 - INFO - __main__ - Step 9372: {'lr': 0.0004969461129364167, 'samples': 1799424, 'steps': 9371, 'loss/train': 1.8562147617340088} 11/06/2021 22:33:08 - INFO - __main__ - Step 9373: {'lr': 0.0004969452859497449, 'samples': 1799616, 'steps': 9372, 'loss/train': 1.927634596824646} 11/06/2021 22:33:08 - INFO - __main__ - Step 9374: {'lr': 0.0004969444588518034, 'samples': 1799808, 'steps': 9373, 'loss/train': 2.170872449874878} 11/06/2021 22:33:08 - INFO - __main__ - Step 9375: {'lr': 0.0004969436316425924, 'samples': 1800000, 'steps': 9374, 'loss/train': 1.852042317390442} 11/06/2021 22:33:09 - INFO - __main__ - Step 9376: {'lr': 0.0004969428043221125, 'samples': 1800192, 'steps': 9375, 'loss/train': 1.5770512819290161} 11/06/2021 22:33:09 - INFO - __main__ - Step 9377: {'lr': 0.000496941976890364, 'samples': 1800384, 'steps': 9376, 'loss/train': 1.6627328395843506} 11/06/2021 22:33:09 - INFO - __main__ - Step 9378: {'lr': 0.0004969411493473472, 'samples': 1800576, 'steps': 9377, 'loss/train': 1.592746376991272} 11/06/2021 22:33:11 - INFO - __main__ - Step 9379: {'lr': 0.0004969403216930626, 'samples': 1800768, 'steps': 9378, 'loss/train': 1.5629326105117798} 11/06/2021 22:33:11 - INFO - __main__ - Step 9380: {'lr': 0.0004969394939275105, 'samples': 1800960, 'steps': 9379, 'loss/train': 1.9716914892196655} 11/06/2021 22:33:11 - INFO - __main__ - Step 9381: {'lr': 0.0004969386660506912, 'samples': 1801152, 'steps': 9380, 'loss/train': 1.612454891204834} 11/06/2021 22:33:12 - INFO - __main__ - Step 9382: {'lr': 0.0004969378380626051, 'samples': 1801344, 'steps': 9381, 'loss/train': 1.764012336730957} 11/06/2021 22:33:12 - INFO - __main__ - Step 9383: {'lr': 0.0004969370099632528, 'samples': 1801536, 'steps': 9382, 'loss/train': 1.9213483333587646} 11/06/2021 22:33:13 - INFO - __main__ - Step 9384: {'lr': 0.0004969361817526343, 'samples': 1801728, 'steps': 9383, 'loss/train': 1.2012592554092407} 11/06/2021 22:33:14 - INFO - __main__ - Step 9385: {'lr': 0.0004969353534307504, 'samples': 1801920, 'steps': 9384, 'loss/train': 2.151750087738037} 11/06/2021 22:33:14 - INFO - __main__ - Step 9386: {'lr': 0.000496934524997601, 'samples': 1802112, 'steps': 9385, 'loss/train': 0.9660016894340515} 11/06/2021 22:33:14 - INFO - __main__ - Step 9387: {'lr': 0.0004969336964531869, 'samples': 1802304, 'steps': 9386, 'loss/train': 1.7892236709594727} 11/06/2021 22:33:15 - INFO - __main__ - Step 9388: {'lr': 0.0004969328677975083, 'samples': 1802496, 'steps': 9387, 'loss/train': 1.6319315433502197} 11/06/2021 22:33:15 - INFO - __main__ - Step 9389: {'lr': 0.0004969320390305654, 'samples': 1802688, 'steps': 9388, 'loss/train': 2.0772619247436523} 11/06/2021 22:33:16 - INFO - __main__ - Step 9390: {'lr': 0.0004969312101523588, 'samples': 1802880, 'steps': 9389, 'loss/train': 1.8143119812011719} 11/06/2021 22:33:17 - INFO - __main__ - Step 9391: {'lr': 0.0004969303811628888, 'samples': 1803072, 'steps': 9390, 'loss/train': 1.7659382820129395} 11/06/2021 22:33:17 - INFO - __main__ - Step 9392: {'lr': 0.0004969295520621558, 'samples': 1803264, 'steps': 9391, 'loss/train': 2.3056082725524902} 11/06/2021 22:33:17 - INFO - __main__ - Step 9393: {'lr': 0.0004969287228501602, 'samples': 1803456, 'steps': 9392, 'loss/train': 1.8238893747329712} 11/06/2021 22:33:18 - INFO - __main__ - Step 9394: {'lr': 0.0004969278935269022, 'samples': 1803648, 'steps': 9393, 'loss/train': 0.7025987505912781} 11/06/2021 22:33:19 - INFO - __main__ - Step 9395: {'lr': 0.0004969270640923823, 'samples': 1803840, 'steps': 9394, 'loss/train': 1.8956180810928345} 11/06/2021 22:33:19 - INFO - __main__ - Step 9396: {'lr': 0.0004969262345466011, 'samples': 1804032, 'steps': 9395, 'loss/train': 1.8042242527008057} 11/06/2021 22:33:19 - INFO - __main__ - Step 9397: {'lr': 0.0004969254048895585, 'samples': 1804224, 'steps': 9396, 'loss/train': 1.8003126382827759} 11/06/2021 22:33:20 - INFO - __main__ - Step 9398: {'lr': 0.0004969245751212552, 'samples': 1804416, 'steps': 9397, 'loss/train': 1.8957905769348145} 11/06/2021 22:33:20 - INFO - __main__ - Step 9399: {'lr': 0.0004969237452416915, 'samples': 1804608, 'steps': 9398, 'loss/train': 1.0701171159744263} 11/06/2021 22:33:21 - INFO - __main__ - Step 9400: {'lr': 0.0004969229152508678, 'samples': 1804800, 'steps': 9399, 'loss/train': 1.8020981550216675} 11/06/2021 22:33:22 - INFO - __main__ - Step 9401: {'lr': 0.0004969220851487844, 'samples': 1804992, 'steps': 9400, 'loss/train': 0.40175554156303406} 11/06/2021 22:33:22 - INFO - __main__ - Step 9402: {'lr': 0.0004969212549354418, 'samples': 1805184, 'steps': 9401, 'loss/train': 1.8108609914779663} 11/06/2021 22:33:22 - INFO - __main__ - Step 9403: {'lr': 0.0004969204246108402, 'samples': 1805376, 'steps': 9402, 'loss/train': 1.1330207586288452} 11/06/2021 22:33:23 - INFO - __main__ - Step 9404: {'lr': 0.0004969195941749801, 'samples': 1805568, 'steps': 9403, 'loss/train': 1.2011972665786743} 11/06/2021 22:33:23 - INFO - __main__ - Step 9405: {'lr': 0.000496918763627862, 'samples': 1805760, 'steps': 9404, 'loss/train': 1.7720991373062134} 11/06/2021 22:33:24 - INFO - __main__ - Step 9406: {'lr': 0.0004969179329694859, 'samples': 1805952, 'steps': 9405, 'loss/train': 1.7720216512680054} 11/06/2021 22:33:24 - INFO - __main__ - Step 9407: {'lr': 0.0004969171021998525, 'samples': 1806144, 'steps': 9406, 'loss/train': 1.1517207622528076} 11/06/2021 22:33:25 - INFO - __main__ - Step 9408: {'lr': 0.0004969162713189619, 'samples': 1806336, 'steps': 9407, 'loss/train': 1.5206893682479858} 11/06/2021 22:33:25 - INFO - __main__ - Step 9409: {'lr': 0.0004969154403268148, 'samples': 1806528, 'steps': 9408, 'loss/train': 1.8082882165908813} 11/06/2021 22:33:25 - INFO - __main__ - Step 9410: {'lr': 0.0004969146092234114, 'samples': 1806720, 'steps': 9409, 'loss/train': 1.3360158205032349} 11/06/2021 22:33:26 - INFO - __main__ - Step 9411: {'lr': 0.000496913778008752, 'samples': 1806912, 'steps': 9410, 'loss/train': 2.1113593578338623} 11/06/2021 22:33:27 - INFO - __main__ - Step 9412: {'lr': 0.0004969129466828371, 'samples': 1807104, 'steps': 9411, 'loss/train': 2.4222865104675293} 11/06/2021 22:33:27 - INFO - __main__ - Step 9413: {'lr': 0.0004969121152456671, 'samples': 1807296, 'steps': 9412, 'loss/train': 1.631757378578186} 11/06/2021 22:33:27 - INFO - __main__ - Step 9414: {'lr': 0.0004969112836972423, 'samples': 1807488, 'steps': 9413, 'loss/train': 1.678197979927063} 11/06/2021 22:33:28 - INFO - __main__ - Step 9415: {'lr': 0.000496910452037563, 'samples': 1807680, 'steps': 9414, 'loss/train': 2.7772085666656494} 11/06/2021 22:33:29 - INFO - __main__ - Step 9416: {'lr': 0.0004969096202666297, 'samples': 1807872, 'steps': 9415, 'loss/train': 1.9640326499938965} 11/06/2021 22:33:29 - INFO - __main__ - Step 9417: {'lr': 0.0004969087883844428, 'samples': 1808064, 'steps': 9416, 'loss/train': 1.4064429998397827} 11/06/2021 22:33:30 - INFO - __main__ - Step 9418: {'lr': 0.0004969079563910025, 'samples': 1808256, 'steps': 9417, 'loss/train': 1.8658392429351807} 11/06/2021 22:33:30 - INFO - __main__ - Step 9419: {'lr': 0.0004969071242863093, 'samples': 1808448, 'steps': 9418, 'loss/train': 2.186412811279297} 11/06/2021 22:33:30 - INFO - __main__ - Step 9420: {'lr': 0.0004969062920703636, 'samples': 1808640, 'steps': 9419, 'loss/train': 1.6242451667785645} 11/06/2021 22:33:31 - INFO - __main__ - Step 9421: {'lr': 0.0004969054597431658, 'samples': 1808832, 'steps': 9420, 'loss/train': 2.4329140186309814} 11/06/2021 22:33:32 - INFO - __main__ - Step 9422: {'lr': 0.0004969046273047161, 'samples': 1809024, 'steps': 9421, 'loss/train': 1.61650812625885} 11/06/2021 22:33:32 - INFO - __main__ - Step 9423: {'lr': 0.0004969037947550151, 'samples': 1809216, 'steps': 9422, 'loss/train': 1.550029993057251} 11/06/2021 22:33:32 - INFO - __main__ - Step 9424: {'lr': 0.000496902962094063, 'samples': 1809408, 'steps': 9423, 'loss/train': 1.670718789100647} 11/06/2021 22:33:33 - INFO - __main__ - Step 9425: {'lr': 0.0004969021293218602, 'samples': 1809600, 'steps': 9424, 'loss/train': 1.6841366291046143} 11/06/2021 22:33:34 - INFO - __main__ - Step 9426: {'lr': 0.0004969012964384071, 'samples': 1809792, 'steps': 9425, 'loss/train': 2.19000244140625} 11/06/2021 22:33:34 - INFO - __main__ - Step 9427: {'lr': 0.0004969004634437042, 'samples': 1809984, 'steps': 9426, 'loss/train': 1.5909390449523926} 11/06/2021 22:33:34 - INFO - __main__ - Step 9428: {'lr': 0.0004968996303377517, 'samples': 1810176, 'steps': 9427, 'loss/train': 2.3377866744995117} 11/06/2021 22:33:35 - INFO - __main__ - Step 9429: {'lr': 0.00049689879712055, 'samples': 1810368, 'steps': 9428, 'loss/train': 1.959466576576233} 11/06/2021 22:33:35 - INFO - __main__ - Step 9430: {'lr': 0.0004968979637920995, 'samples': 1810560, 'steps': 9429, 'loss/train': 2.1791741847991943} 11/06/2021 22:33:35 - INFO - __main__ - Step 9431: {'lr': 0.0004968971303524007, 'samples': 1810752, 'steps': 9430, 'loss/train': 1.2849524021148682} 11/06/2021 22:33:36 - INFO - __main__ - Step 9432: {'lr': 0.0004968962968014537, 'samples': 1810944, 'steps': 9431, 'loss/train': 1.7930939197540283} 11/06/2021 22:33:37 - INFO - __main__ - Step 9433: {'lr': 0.0004968954631392592, 'samples': 1811136, 'steps': 9432, 'loss/train': 1.8857711553573608} 11/06/2021 22:33:37 - INFO - __main__ - Step 9434: {'lr': 0.0004968946293658173, 'samples': 1811328, 'steps': 9433, 'loss/train': 1.6014503240585327} 11/06/2021 22:33:37 - INFO - __main__ - Step 9435: {'lr': 0.0004968937954811284, 'samples': 1811520, 'steps': 9434, 'loss/train': 1.701316237449646} 11/06/2021 22:33:38 - INFO - __main__ - Step 9436: {'lr': 0.0004968929614851932, 'samples': 1811712, 'steps': 9435, 'loss/train': 1.106537938117981} 11/06/2021 22:33:39 - INFO - __main__ - Step 9437: {'lr': 0.0004968921273780118, 'samples': 1811904, 'steps': 9436, 'loss/train': 1.812904715538025} 11/06/2021 22:33:39 - INFO - __main__ - Step 9438: {'lr': 0.0004968912931595845, 'samples': 1812096, 'steps': 9437, 'loss/train': 1.294390082359314} 11/06/2021 22:33:39 - INFO - __main__ - Step 9439: {'lr': 0.0004968904588299118, 'samples': 1812288, 'steps': 9438, 'loss/train': 1.2054589986801147} 11/06/2021 22:33:40 - INFO - __main__ - Step 9440: {'lr': 0.0004968896243889941, 'samples': 1812480, 'steps': 9439, 'loss/train': 1.6848827600479126} 11/06/2021 22:33:40 - INFO - __main__ - Step 9441: {'lr': 0.0004968887898368318, 'samples': 1812672, 'steps': 9440, 'loss/train': 2.0966432094573975} 11/06/2021 22:33:41 - INFO - __main__ - Step 9442: {'lr': 0.0004968879551734252, 'samples': 1812864, 'steps': 9441, 'loss/train': 1.6406598091125488} 11/06/2021 22:33:42 - INFO - __main__ - Step 9443: {'lr': 0.0004968871203987746, 'samples': 1813056, 'steps': 9442, 'loss/train': 1.871316909790039} 11/06/2021 22:33:42 - INFO - __main__ - Step 9444: {'lr': 0.0004968862855128806, 'samples': 1813248, 'steps': 9443, 'loss/train': 1.394817590713501} 11/06/2021 22:33:42 - INFO - __main__ - Step 9445: {'lr': 0.0004968854505157434, 'samples': 1813440, 'steps': 9444, 'loss/train': 1.726036787033081} 11/06/2021 22:33:43 - INFO - __main__ - Step 9446: {'lr': 0.0004968846154073634, 'samples': 1813632, 'steps': 9445, 'loss/train': 0.7175917625427246} 11/06/2021 22:33:44 - INFO - __main__ - Step 9447: {'lr': 0.0004968837801877411, 'samples': 1813824, 'steps': 9446, 'loss/train': 1.9097816944122314} 11/06/2021 22:33:44 - INFO - __main__ - Step 9448: {'lr': 0.0004968829448568766, 'samples': 1814016, 'steps': 9447, 'loss/train': 1.6947396993637085} 11/06/2021 22:33:44 - INFO - __main__ - Step 9449: {'lr': 0.0004968821094147706, 'samples': 1814208, 'steps': 9448, 'loss/train': 1.652908205986023} 11/06/2021 22:33:45 - INFO - __main__ - Step 9450: {'lr': 0.0004968812738614232, 'samples': 1814400, 'steps': 9449, 'loss/train': 1.8884683847427368} 11/06/2021 22:33:45 - INFO - __main__ - Step 9451: {'lr': 0.000496880438196835, 'samples': 1814592, 'steps': 9450, 'loss/train': 1.878915786743164} 11/06/2021 22:33:46 - INFO - __main__ - Step 9452: {'lr': 0.0004968796024210064, 'samples': 1814784, 'steps': 9451, 'loss/train': 1.934395670890808} 11/06/2021 22:33:46 - INFO - __main__ - Step 9453: {'lr': 0.0004968787665339375, 'samples': 1814976, 'steps': 9452, 'loss/train': 1.7748756408691406} 11/06/2021 22:33:47 - INFO - __main__ - Step 9454: {'lr': 0.0004968779305356289, 'samples': 1815168, 'steps': 9453, 'loss/train': 1.8355199098587036} 11/06/2021 22:33:47 - INFO - __main__ - Step 9455: {'lr': 0.0004968770944260808, 'samples': 1815360, 'steps': 9454, 'loss/train': 1.931564450263977} 11/06/2021 22:33:47 - INFO - __main__ - Step 9456: {'lr': 0.0004968762582052938, 'samples': 1815552, 'steps': 9455, 'loss/train': 1.959632396697998} 11/06/2021 22:33:48 - INFO - __main__ - Step 9457: {'lr': 0.0004968754218732682, 'samples': 1815744, 'steps': 9456, 'loss/train': 1.8338159322738647} 11/06/2021 22:33:49 - INFO - __main__ - Step 9458: {'lr': 0.0004968745854300043, 'samples': 1815936, 'steps': 9457, 'loss/train': 1.7420405149459839} 11/06/2021 22:33:49 - INFO - __main__ - Step 9459: {'lr': 0.0004968737488755025, 'samples': 1816128, 'steps': 9458, 'loss/train': 1.8314000368118286} 11/06/2021 22:33:49 - INFO - __main__ - Step 9460: {'lr': 0.0004968729122097632, 'samples': 1816320, 'steps': 9459, 'loss/train': 1.9877064228057861} 11/06/2021 22:33:50 - INFO - __main__ - Step 9461: {'lr': 0.0004968720754327867, 'samples': 1816512, 'steps': 9460, 'loss/train': 1.5283275842666626} 11/06/2021 22:33:51 - INFO - __main__ - Step 9462: {'lr': 0.0004968712385445737, 'samples': 1816704, 'steps': 9461, 'loss/train': 1.5519593954086304} 11/06/2021 22:33:51 - INFO - __main__ - Step 9463: {'lr': 0.0004968704015451241, 'samples': 1816896, 'steps': 9462, 'loss/train': 2.173038959503174} 11/06/2021 22:33:51 - INFO - __main__ - Step 9464: {'lr': 0.0004968695644344387, 'samples': 1817088, 'steps': 9463, 'loss/train': 2.0512888431549072} 11/06/2021 22:33:52 - INFO - __main__ - Step 9465: {'lr': 0.0004968687272125174, 'samples': 1817280, 'steps': 9464, 'loss/train': 1.5592188835144043} 11/06/2021 22:33:52 - INFO - __main__ - Step 9466: {'lr': 0.0004968678898793611, 'samples': 1817472, 'steps': 9465, 'loss/train': 1.7354072332382202} 11/06/2021 22:33:53 - INFO - __main__ - Step 9467: {'lr': 0.0004968670524349699, 'samples': 1817664, 'steps': 9466, 'loss/train': 1.9592326879501343} 11/06/2021 22:33:54 - INFO - __main__ - Step 9468: {'lr': 0.0004968662148793441, 'samples': 1817856, 'steps': 9467, 'loss/train': 1.6401126384735107} 11/06/2021 22:33:54 - INFO - __main__ - Step 9469: {'lr': 0.0004968653772124843, 'samples': 1818048, 'steps': 9468, 'loss/train': 1.6802074909210205} 11/06/2021 22:33:54 - INFO - __main__ - Step 9470: {'lr': 0.0004968645394343908, 'samples': 1818240, 'steps': 9469, 'loss/train': 1.5846213102340698} 11/06/2021 22:33:55 - INFO - __main__ - Step 9471: {'lr': 0.0004968637015450639, 'samples': 1818432, 'steps': 9470, 'loss/train': 1.8173738718032837} 11/06/2021 22:33:55 - INFO - __main__ - Step 9472: {'lr': 0.000496862863544504, 'samples': 1818624, 'steps': 9471, 'loss/train': 1.4764689207077026} 11/06/2021 22:33:56 - INFO - __main__ - Step 9473: {'lr': 0.0004968620254327114, 'samples': 1818816, 'steps': 9472, 'loss/train': 1.9476569890975952} 11/06/2021 22:33:56 - INFO - __main__ - Step 9474: {'lr': 0.0004968611872096868, 'samples': 1819008, 'steps': 9473, 'loss/train': 1.7643886804580688} 11/06/2021 22:33:57 - INFO - __main__ - Step 9475: {'lr': 0.0004968603488754302, 'samples': 1819200, 'steps': 9474, 'loss/train': 1.442833423614502} 11/06/2021 22:33:57 - INFO - __main__ - Step 9476: {'lr': 0.0004968595104299422, 'samples': 1819392, 'steps': 9475, 'loss/train': 2.2946574687957764} 11/06/2021 22:33:57 - INFO - __main__ - Step 9477: {'lr': 0.000496858671873223, 'samples': 1819584, 'steps': 9476, 'loss/train': 2.2401070594787598} 11/06/2021 22:33:58 - INFO - __main__ - Step 9478: {'lr': 0.0004968578332052733, 'samples': 1819776, 'steps': 9477, 'loss/train': 1.696811556816101} 11/06/2021 22:33:59 - INFO - __main__ - Step 9479: {'lr': 0.0004968569944260932, 'samples': 1819968, 'steps': 9478, 'loss/train': 2.2511560916900635} 11/06/2021 22:33:59 - INFO - __main__ - Step 9480: {'lr': 0.0004968561555356831, 'samples': 1820160, 'steps': 9479, 'loss/train': 1.6641721725463867} 11/06/2021 22:33:59 - INFO - __main__ - Step 9481: {'lr': 0.0004968553165340435, 'samples': 1820352, 'steps': 9480, 'loss/train': 1.7705086469650269} 11/06/2021 22:34:00 - INFO - __main__ - Step 9482: {'lr': 0.0004968544774211746, 'samples': 1820544, 'steps': 9481, 'loss/train': 1.7317209243774414} 11/06/2021 22:34:01 - INFO - __main__ - Step 9483: {'lr': 0.0004968536381970769, 'samples': 1820736, 'steps': 9482, 'loss/train': 1.7108385562896729} 11/06/2021 22:34:01 - INFO - __main__ - Step 9484: {'lr': 0.0004968527988617508, 'samples': 1820928, 'steps': 9483, 'loss/train': 1.9148114919662476} 11/06/2021 22:34:02 - INFO - __main__ - Step 9485: {'lr': 0.0004968519594151966, 'samples': 1821120, 'steps': 9484, 'loss/train': 2.120004653930664} 11/06/2021 22:34:02 - INFO - __main__ - Step 9486: {'lr': 0.0004968511198574147, 'samples': 1821312, 'steps': 9485, 'loss/train': 2.175170660018921} 11/06/2021 22:34:02 - INFO - __main__ - Step 9487: {'lr': 0.0004968502801884056, 'samples': 1821504, 'steps': 9486, 'loss/train': 1.964535117149353} 11/06/2021 22:34:03 - INFO - __main__ - Step 9488: {'lr': 0.0004968494404081695, 'samples': 1821696, 'steps': 9487, 'loss/train': 1.3058781623840332} 11/06/2021 22:34:04 - INFO - __main__ - Step 9489: {'lr': 0.0004968486005167069, 'samples': 1821888, 'steps': 9488, 'loss/train': 1.7480806112289429} 11/06/2021 22:34:04 - INFO - __main__ - Step 9490: {'lr': 0.000496847760514018, 'samples': 1822080, 'steps': 9489, 'loss/train': 1.46340012550354} 11/06/2021 22:34:04 - INFO - __main__ - Step 9491: {'lr': 0.0004968469204001035, 'samples': 1822272, 'steps': 9490, 'loss/train': 2.2288196086883545} 11/06/2021 22:34:05 - INFO - __main__ - Step 9492: {'lr': 0.0004968460801749635, 'samples': 1822464, 'steps': 9491, 'loss/train': 1.707292914390564} 11/06/2021 22:34:05 - INFO - __main__ - Step 9493: {'lr': 0.0004968452398385984, 'samples': 1822656, 'steps': 9492, 'loss/train': 1.734236240386963} 11/06/2021 22:34:06 - INFO - __main__ - Step 9494: {'lr': 0.0004968443993910086, 'samples': 1822848, 'steps': 9493, 'loss/train': 1.8103415966033936} 11/06/2021 22:34:07 - INFO - __main__ - Step 9495: {'lr': 0.0004968435588321947, 'samples': 1823040, 'steps': 9494, 'loss/train': 1.4598149061203003} 11/06/2021 22:34:07 - INFO - __main__ - Step 9496: {'lr': 0.0004968427181621567, 'samples': 1823232, 'steps': 9495, 'loss/train': 1.8984959125518799} 11/06/2021 22:34:07 - INFO - __main__ - Step 9497: {'lr': 0.0004968418773808954, 'samples': 1823424, 'steps': 9496, 'loss/train': 1.705300211906433} 11/06/2021 22:34:08 - INFO - __main__ - Step 9498: {'lr': 0.0004968410364884109, 'samples': 1823616, 'steps': 9497, 'loss/train': 1.6887762546539307} 11/06/2021 22:34:09 - INFO - __main__ - Step 9499: {'lr': 0.0004968401954847035, 'samples': 1823808, 'steps': 9498, 'loss/train': 1.4879417419433594} 11/06/2021 22:34:09 - INFO - __main__ - Step 9500: {'lr': 0.0004968393543697739, 'samples': 1824000, 'steps': 9499, 'loss/train': 1.49965238571167} 11/06/2021 22:34:09 - INFO - __main__ - Step 9501: {'lr': 0.0004968385131436222, 'samples': 1824192, 'steps': 9500, 'loss/train': 1.9283766746520996} 11/06/2021 22:34:10 - INFO - __main__ - Step 9502: {'lr': 0.0004968376718062488, 'samples': 1824384, 'steps': 9501, 'loss/train': 1.9032224416732788} 11/06/2021 22:34:10 - INFO - __main__ - Step 9503: {'lr': 0.0004968368303576542, 'samples': 1824576, 'steps': 9502, 'loss/train': 2.1025607585906982} 11/06/2021 22:34:11 - INFO - __main__ - Step 9504: {'lr': 0.0004968359887978389, 'samples': 1824768, 'steps': 9503, 'loss/train': 0.9440047144889832} 11/06/2021 22:34:11 - INFO - __main__ - Step 9505: {'lr': 0.0004968351471268029, 'samples': 1824960, 'steps': 9504, 'loss/train': 2.4170825481414795} 11/06/2021 22:34:12 - INFO - __main__ - Step 9506: {'lr': 0.0004968343053445469, 'samples': 1825152, 'steps': 9505, 'loss/train': 1.723289132118225} 11/06/2021 22:34:12 - INFO - __main__ - Step 9507: {'lr': 0.0004968334634510712, 'samples': 1825344, 'steps': 9506, 'loss/train': 2.1578383445739746} 11/06/2021 22:34:13 - INFO - __main__ - Step 9508: {'lr': 0.000496832621446376, 'samples': 1825536, 'steps': 9507, 'loss/train': 1.3967469930648804} 11/06/2021 22:34:13 - INFO - __main__ - Step 9509: {'lr': 0.000496831779330462, 'samples': 1825728, 'steps': 9508, 'loss/train': 1.5260154008865356} 11/06/2021 22:34:14 - INFO - __main__ - Step 9510: {'lr': 0.0004968309371033293, 'samples': 1825920, 'steps': 9509, 'loss/train': 1.9803547859191895} 11/06/2021 22:34:14 - INFO - __main__ - Step 9511: {'lr': 0.0004968300947649784, 'samples': 1826112, 'steps': 9510, 'loss/train': 1.811102271080017} 11/06/2021 22:34:15 - INFO - __main__ - Step 9512: {'lr': 0.0004968292523154096, 'samples': 1826304, 'steps': 9511, 'loss/train': 1.6655768156051636} 11/06/2021 22:34:15 - INFO - __main__ - Step 9513: {'lr': 0.0004968284097546235, 'samples': 1826496, 'steps': 9512, 'loss/train': 1.665718913078308} 11/06/2021 22:34:15 - INFO - __main__ - Step 9514: {'lr': 0.0004968275670826204, 'samples': 1826688, 'steps': 9513, 'loss/train': 1.7052531242370605} 11/06/2021 22:34:16 - INFO - __main__ - Step 9515: {'lr': 0.0004968267242994003, 'samples': 1826880, 'steps': 9514, 'loss/train': 1.3750792741775513} 11/06/2021 22:34:17 - INFO - __main__ - Step 9516: {'lr': 0.0004968258814049641, 'samples': 1827072, 'steps': 9515, 'loss/train': 1.3806614875793457} 11/06/2021 22:34:17 - INFO - __main__ - Step 9517: {'lr': 0.0004968250383993119, 'samples': 1827264, 'steps': 9516, 'loss/train': 2.3476414680480957} 11/06/2021 22:34:17 - INFO - __main__ - Step 9518: {'lr': 0.0004968241952824442, 'samples': 1827456, 'steps': 9517, 'loss/train': 1.6158838272094727} 11/06/2021 22:34:18 - INFO - __main__ - Step 9519: {'lr': 0.0004968233520543613, 'samples': 1827648, 'steps': 9518, 'loss/train': 2.4204187393188477} 11/06/2021 22:34:19 - INFO - __main__ - Step 9520: {'lr': 0.0004968225087150636, 'samples': 1827840, 'steps': 9519, 'loss/train': 1.876865267753601} 11/06/2021 22:34:19 - INFO - __main__ - Step 9521: {'lr': 0.0004968216652645515, 'samples': 1828032, 'steps': 9520, 'loss/train': 2.089869737625122} 11/06/2021 22:34:20 - INFO - __main__ - Step 9522: {'lr': 0.0004968208217028254, 'samples': 1828224, 'steps': 9521, 'loss/train': 1.1889524459838867} 11/06/2021 22:34:20 - INFO - __main__ - Step 9523: {'lr': 0.0004968199780298855, 'samples': 1828416, 'steps': 9522, 'loss/train': 1.8859481811523438} 11/06/2021 22:34:20 - INFO - __main__ - Step 9524: {'lr': 0.0004968191342457325, 'samples': 1828608, 'steps': 9523, 'loss/train': 1.6368975639343262} 11/06/2021 22:34:22 - INFO - __main__ - Step 9525: {'lr': 0.0004968182903503665, 'samples': 1828800, 'steps': 9524, 'loss/train': 1.5924121141433716} 11/06/2021 22:34:22 - INFO - __main__ - Step 9526: {'lr': 0.0004968174463437881, 'samples': 1828992, 'steps': 9525, 'loss/train': 1.3622881174087524} 11/06/2021 22:34:22 - INFO - __main__ - Step 9527: {'lr': 0.0004968166022259974, 'samples': 1829184, 'steps': 9526, 'loss/train': 1.6611684560775757} 11/06/2021 22:34:23 - INFO - __main__ - Step 9528: {'lr': 0.0004968157579969951, 'samples': 1829376, 'steps': 9527, 'loss/train': 1.8583652973175049} 11/06/2021 22:34:23 - INFO - __main__ - Step 9529: {'lr': 0.0004968149136567814, 'samples': 1829568, 'steps': 9528, 'loss/train': 1.5691323280334473} 11/06/2021 22:34:23 - INFO - __main__ - Step 9530: {'lr': 0.0004968140692053567, 'samples': 1829760, 'steps': 9529, 'loss/train': 1.7236870527267456} 11/06/2021 22:34:24 - INFO - __main__ - Step 9531: {'lr': 0.0004968132246427212, 'samples': 1829952, 'steps': 9530, 'loss/train': 2.275637149810791} 11/06/2021 22:34:25 - INFO - __main__ - Step 9532: {'lr': 0.0004968123799688757, 'samples': 1830144, 'steps': 9531, 'loss/train': 1.7203338146209717} 11/06/2021 22:34:25 - INFO - __main__ - Step 9533: {'lr': 0.0004968115351838203, 'samples': 1830336, 'steps': 9532, 'loss/train': 2.002017021179199} 11/06/2021 22:34:26 - INFO - __main__ - Step 9534: {'lr': 0.0004968106902875554, 'samples': 1830528, 'steps': 9533, 'loss/train': 1.5515137910842896} 11/06/2021 22:34:26 - INFO - __main__ - Step 9535: {'lr': 0.0004968098452800815, 'samples': 1830720, 'steps': 9534, 'loss/train': 1.9174106121063232} 11/06/2021 22:34:26 - INFO - __main__ - Step 9536: {'lr': 0.0004968090001613987, 'samples': 1830912, 'steps': 9535, 'loss/train': 1.4403159618377686} 11/06/2021 22:34:27 - INFO - __main__ - Step 9537: {'lr': 0.0004968081549315078, 'samples': 1831104, 'steps': 9536, 'loss/train': 1.640446662902832} 11/06/2021 22:34:28 - INFO - __main__ - Step 9538: {'lr': 0.0004968073095904088, 'samples': 1831296, 'steps': 9537, 'loss/train': 1.7125294208526611} 11/06/2021 22:34:28 - INFO - __main__ - Step 9539: {'lr': 0.0004968064641381022, 'samples': 1831488, 'steps': 9538, 'loss/train': 2.010976552963257} 11/06/2021 22:34:28 - INFO - __main__ - Step 9540: {'lr': 0.0004968056185745886, 'samples': 1831680, 'steps': 9539, 'loss/train': 2.09594464302063} 11/06/2021 22:34:29 - INFO - __main__ - Step 9541: {'lr': 0.000496804772899868, 'samples': 1831872, 'steps': 9540, 'loss/train': 1.843455195426941} 11/06/2021 22:34:30 - INFO - __main__ - Step 9542: {'lr': 0.0004968039271139412, 'samples': 1832064, 'steps': 9541, 'loss/train': 2.009514331817627} 11/06/2021 22:34:30 - INFO - __main__ - Step 9543: {'lr': 0.0004968030812168082, 'samples': 1832256, 'steps': 9542, 'loss/train': 1.7577793598175049} 11/06/2021 22:34:30 - INFO - __main__ - Step 9544: {'lr': 0.0004968022352084695, 'samples': 1832448, 'steps': 9543, 'loss/train': 2.0828323364257812} 11/06/2021 22:34:31 - INFO - __main__ - Step 9545: {'lr': 0.0004968013890889256, 'samples': 1832640, 'steps': 9544, 'loss/train': 1.7458568811416626} 11/06/2021 22:34:31 - INFO - __main__ - Step 9546: {'lr': 0.0004968005428581767, 'samples': 1832832, 'steps': 9545, 'loss/train': 2.1240222454071045} 11/06/2021 22:34:32 - INFO - __main__ - Step 9547: {'lr': 0.0004967996965162235, 'samples': 1833024, 'steps': 9546, 'loss/train': 1.8390196561813354} 11/06/2021 22:34:32 - INFO - __main__ - Step 9548: {'lr': 0.0004967988500630661, 'samples': 1833216, 'steps': 9547, 'loss/train': 1.910079836845398} 11/06/2021 22:34:33 - INFO - __main__ - Step 9549: {'lr': 0.0004967980034987048, 'samples': 1833408, 'steps': 9548, 'loss/train': 2.003260374069214} 11/06/2021 22:34:33 - INFO - __main__ - Step 9550: {'lr': 0.0004967971568231402, 'samples': 1833600, 'steps': 9549, 'loss/train': 1.5235668420791626} 11/06/2021 22:34:33 - INFO - __main__ - Step 9551: {'lr': 0.0004967963100363726, 'samples': 1833792, 'steps': 9550, 'loss/train': 1.7840920686721802} 11/06/2021 22:34:35 - INFO - __main__ - Step 9552: {'lr': 0.0004967954631384025, 'samples': 1833984, 'steps': 9551, 'loss/train': 1.2505114078521729} 11/06/2021 22:34:35 - INFO - __main__ - Step 9553: {'lr': 0.00049679461612923, 'samples': 1834176, 'steps': 9552, 'loss/train': 1.4225518703460693} 11/06/2021 22:34:36 - INFO - __main__ - Step 9554: {'lr': 0.0004967937690088558, 'samples': 1834368, 'steps': 9553, 'loss/train': 2.004608154296875} 11/06/2021 22:34:36 - INFO - __main__ - Step 9555: {'lr': 0.0004967929217772801, 'samples': 1834560, 'steps': 9554, 'loss/train': 2.0895955562591553} 11/06/2021 22:34:36 - INFO - __main__ - Step 9556: {'lr': 0.0004967920744345033, 'samples': 1834752, 'steps': 9555, 'loss/train': 0.44858700037002563} 11/06/2021 22:34:37 - INFO - __main__ - Step 9557: {'lr': 0.0004967912269805257, 'samples': 1834944, 'steps': 9556, 'loss/train': 2.1345250606536865} 11/06/2021 22:34:38 - INFO - __main__ - Step 9558: {'lr': 0.000496790379415348, 'samples': 1835136, 'steps': 9557, 'loss/train': 1.6229137182235718} 11/06/2021 22:34:38 - INFO - __main__ - Step 9559: {'lr': 0.0004967895317389702, 'samples': 1835328, 'steps': 9558, 'loss/train': 1.6342477798461914} 11/06/2021 22:34:38 - INFO - __main__ - Step 9560: {'lr': 0.0004967886839513929, 'samples': 1835520, 'steps': 9559, 'loss/train': 1.9332375526428223} 11/06/2021 22:34:39 - INFO - __main__ - Step 9561: {'lr': 0.0004967878360526163, 'samples': 1835712, 'steps': 9560, 'loss/train': 1.733110785484314} 11/06/2021 22:34:39 - INFO - __main__ - Step 9562: {'lr': 0.0004967869880426411, 'samples': 1835904, 'steps': 9561, 'loss/train': 1.7204447984695435} 11/06/2021 22:34:40 - INFO - __main__ - Step 9563: {'lr': 0.0004967861399214674, 'samples': 1836096, 'steps': 9562, 'loss/train': 1.7751929759979248} 11/06/2021 22:34:40 - INFO - __main__ - Step 9564: {'lr': 0.0004967852916890958, 'samples': 1836288, 'steps': 9563, 'loss/train': 1.7982103824615479} 11/06/2021 22:34:41 - INFO - __main__ - Step 9565: {'lr': 0.0004967844433455263, 'samples': 1836480, 'steps': 9564, 'loss/train': 2.2210209369659424} 11/06/2021 22:34:41 - INFO - __main__ - Step 9566: {'lr': 0.0004967835948907598, 'samples': 1836672, 'steps': 9565, 'loss/train': 1.8117777109146118} 11/06/2021 22:34:41 - INFO - __main__ - Step 9567: {'lr': 0.0004967827463247962, 'samples': 1836864, 'steps': 9566, 'loss/train': 1.7700427770614624} 11/06/2021 22:34:42 - INFO - __main__ - Step 9568: {'lr': 0.0004967818976476363, 'samples': 1837056, 'steps': 9567, 'loss/train': 1.9625296592712402} 11/06/2021 22:34:43 - INFO - __main__ - Step 9569: {'lr': 0.0004967810488592801, 'samples': 1837248, 'steps': 9568, 'loss/train': 1.7147427797317505} 11/06/2021 22:34:43 - INFO - __main__ - Step 9570: {'lr': 0.0004967801999597283, 'samples': 1837440, 'steps': 9569, 'loss/train': 1.954040765762329} 11/06/2021 22:34:43 - INFO - __main__ - Step 9571: {'lr': 0.0004967793509489811, 'samples': 1837632, 'steps': 9570, 'loss/train': 2.0516250133514404} 11/06/2021 22:34:44 - INFO - __main__ - Step 9572: {'lr': 0.0004967785018270389, 'samples': 1837824, 'steps': 9571, 'loss/train': 1.2908155918121338} 11/06/2021 22:34:45 - INFO - __main__ - Step 9573: {'lr': 0.0004967776525939022, 'samples': 1838016, 'steps': 9572, 'loss/train': 1.586142897605896} 11/06/2021 22:34:45 - INFO - __main__ - Step 9574: {'lr': 0.0004967768032495712, 'samples': 1838208, 'steps': 9573, 'loss/train': 1.9211750030517578} 11/06/2021 22:34:46 - INFO - __main__ - Step 9575: {'lr': 0.0004967759537940464, 'samples': 1838400, 'steps': 9574, 'loss/train': 6.170900821685791} 11/06/2021 22:34:46 - INFO - __main__ - Step 9576: {'lr': 0.0004967751042273282, 'samples': 1838592, 'steps': 9575, 'loss/train': 2.606065273284912} 11/06/2021 22:34:46 - INFO - __main__ - Step 9577: {'lr': 0.000496774254549417, 'samples': 1838784, 'steps': 9576, 'loss/train': 1.8838642835617065} 11/06/2021 22:34:47 - INFO - __main__ - Step 9578: {'lr': 0.0004967734047603131, 'samples': 1838976, 'steps': 9577, 'loss/train': 1.8725879192352295} 11/06/2021 22:34:48 - INFO - __main__ - Step 9579: {'lr': 0.0004967725548600168, 'samples': 1839168, 'steps': 9578, 'loss/train': 1.7218821048736572} 11/06/2021 22:34:48 - INFO - __main__ - Step 9580: {'lr': 0.0004967717048485287, 'samples': 1839360, 'steps': 9579, 'loss/train': 2.031545400619507} 11/06/2021 22:34:48 - INFO - __main__ - Step 9581: {'lr': 0.000496770854725849, 'samples': 1839552, 'steps': 9580, 'loss/train': 1.9540653228759766} 11/06/2021 22:34:49 - INFO - __main__ - Step 9582: {'lr': 0.0004967700044919783, 'samples': 1839744, 'steps': 9581, 'loss/train': 2.139894485473633} 11/06/2021 22:34:49 - INFO - __main__ - Step 9583: {'lr': 0.0004967691541469167, 'samples': 1839936, 'steps': 9582, 'loss/train': 1.7633771896362305} 11/06/2021 22:34:50 - INFO - __main__ - Step 9584: {'lr': 0.0004967683036906648, 'samples': 1840128, 'steps': 9583, 'loss/train': 2.204907178878784} 11/06/2021 22:34:50 - INFO - __main__ - Step 9585: {'lr': 0.0004967674531232229, 'samples': 1840320, 'steps': 9584, 'loss/train': 1.9411033391952515} 11/06/2021 22:34:51 - INFO - __main__ - Step 9586: {'lr': 0.0004967666024445913, 'samples': 1840512, 'steps': 9585, 'loss/train': 1.1677578687667847} 11/06/2021 22:34:51 - INFO - __main__ - Step 9587: {'lr': 0.0004967657516547707, 'samples': 1840704, 'steps': 9586, 'loss/train': 1.5230824947357178} 11/06/2021 22:34:51 - INFO - __main__ - Step 9588: {'lr': 0.0004967649007537611, 'samples': 1840896, 'steps': 9587, 'loss/train': 1.3074101209640503} 11/06/2021 22:34:52 - INFO - __main__ - Step 9589: {'lr': 0.0004967640497415631, 'samples': 1841088, 'steps': 9588, 'loss/train': 2.527937412261963} 11/06/2021 22:34:53 - INFO - __main__ - Step 9590: {'lr': 0.000496763198618177, 'samples': 1841280, 'steps': 9589, 'loss/train': 2.1263439655303955} 11/06/2021 22:34:53 - INFO - __main__ - Step 9591: {'lr': 0.0004967623473836032, 'samples': 1841472, 'steps': 9590, 'loss/train': 1.8565070629119873} 11/06/2021 22:34:53 - INFO - __main__ - Step 9592: {'lr': 0.0004967614960378421, 'samples': 1841664, 'steps': 9591, 'loss/train': 2.0305135250091553} 11/06/2021 22:34:54 - INFO - __main__ - Step 9593: {'lr': 0.000496760644580894, 'samples': 1841856, 'steps': 9592, 'loss/train': 1.905313491821289} 11/06/2021 22:34:55 - INFO - __main__ - Step 9594: {'lr': 0.0004967597930127595, 'samples': 1842048, 'steps': 9593, 'loss/train': 2.1325924396514893} 11/06/2021 22:34:55 - INFO - __main__ - Step 9595: {'lr': 0.0004967589413334387, 'samples': 1842240, 'steps': 9594, 'loss/train': 1.6247498989105225} 11/06/2021 22:34:56 - INFO - __main__ - Step 9596: {'lr': 0.0004967580895429322, 'samples': 1842432, 'steps': 9595, 'loss/train': 1.7681033611297607} 11/06/2021 22:34:56 - INFO - __main__ - Step 9597: {'lr': 0.0004967572376412405, 'samples': 1842624, 'steps': 9596, 'loss/train': 1.8701852560043335} 11/06/2021 22:34:56 - INFO - __main__ - Step 9598: {'lr': 0.0004967563856283636, 'samples': 1842816, 'steps': 9597, 'loss/train': 1.6864864826202393} 11/06/2021 22:34:58 - INFO - __main__ - Step 9599: {'lr': 0.000496755533504302, 'samples': 1843008, 'steps': 9598, 'loss/train': 1.842852234840393} 11/06/2021 22:34:58 - INFO - __main__ - Step 9600: {'lr': 0.0004967546812690563, 'samples': 1843200, 'steps': 9599, 'loss/train': 1.65439772605896} 11/06/2021 22:34:58 - INFO - __main__ - Step 9601: {'lr': 0.0004967538289226267, 'samples': 1843392, 'steps': 9600, 'loss/train': 1.5017353296279907} 11/06/2021 22:34:59 - INFO - __main__ - Step 9602: {'lr': 0.0004967529764650137, 'samples': 1843584, 'steps': 9601, 'loss/train': 1.895011067390442} 11/06/2021 22:34:59 - INFO - __main__ - Step 9603: {'lr': 0.0004967521238962175, 'samples': 1843776, 'steps': 9602, 'loss/train': 1.7376617193222046} 11/06/2021 22:34:59 - INFO - __main__ - Step 9604: {'lr': 0.0004967512712162387, 'samples': 1843968, 'steps': 9603, 'loss/train': 1.7263696193695068} 11/06/2021 22:35:00 - INFO - __main__ - Step 9605: {'lr': 0.0004967504184250775, 'samples': 1844160, 'steps': 9604, 'loss/train': 2.917311906814575} 11/06/2021 22:35:01 - INFO - __main__ - Step 9606: {'lr': 0.0004967495655227344, 'samples': 1844352, 'steps': 9605, 'loss/train': 2.3843181133270264} 11/06/2021 22:35:01 - INFO - __main__ - Step 9607: {'lr': 0.0004967487125092098, 'samples': 1844544, 'steps': 9606, 'loss/train': 1.349475622177124} 11/06/2021 22:35:02 - INFO - __main__ - Step 9608: {'lr': 0.0004967478593845041, 'samples': 1844736, 'steps': 9607, 'loss/train': 1.40731942653656} 11/06/2021 22:35:02 - INFO - __main__ - Step 9609: {'lr': 0.0004967470061486175, 'samples': 1844928, 'steps': 9608, 'loss/train': 1.621782660484314} 11/06/2021 22:35:02 - INFO - __main__ - Step 9610: {'lr': 0.0004967461528015506, 'samples': 1845120, 'steps': 9609, 'loss/train': 1.7241379022598267} 11/06/2021 22:35:03 - INFO - __main__ - Step 9611: {'lr': 0.0004967452993433036, 'samples': 1845312, 'steps': 9610, 'loss/train': 1.7397297620773315} 11/06/2021 22:35:04 - INFO - __main__ - Step 9612: {'lr': 0.0004967444457738769, 'samples': 1845504, 'steps': 9611, 'loss/train': 1.5325270891189575} 11/06/2021 22:35:04 - INFO - __main__ - Step 9613: {'lr': 0.0004967435920932711, 'samples': 1845696, 'steps': 9612, 'loss/train': 1.8796131610870361} 11/06/2021 22:35:04 - INFO - __main__ - Step 9614: {'lr': 0.0004967427383014865, 'samples': 1845888, 'steps': 9613, 'loss/train': 2.0587081909179688} 11/06/2021 22:35:05 - INFO - __main__ - Step 9615: {'lr': 0.0004967418843985233, 'samples': 1846080, 'steps': 9614, 'loss/train': 1.6247023344039917} 11/06/2021 22:35:06 - INFO - __main__ - Step 9616: {'lr': 0.0004967410303843821, 'samples': 1846272, 'steps': 9615, 'loss/train': 2.14107346534729} 11/06/2021 22:35:06 - INFO - __main__ - Step 9617: {'lr': 0.0004967401762590631, 'samples': 1846464, 'steps': 9616, 'loss/train': 1.674919605255127} 11/06/2021 22:35:06 - INFO - __main__ - Step 9618: {'lr': 0.0004967393220225668, 'samples': 1846656, 'steps': 9617, 'loss/train': 1.9346179962158203} 11/06/2021 22:35:07 - INFO - __main__ - Step 9619: {'lr': 0.0004967384676748936, 'samples': 1846848, 'steps': 9618, 'loss/train': 1.5423815250396729} 11/06/2021 22:35:07 - INFO - __main__ - Step 9620: {'lr': 0.0004967376132160438, 'samples': 1847040, 'steps': 9619, 'loss/train': 2.3580563068389893} 11/06/2021 22:35:08 - INFO - __main__ - Step 9621: {'lr': 0.000496736758646018, 'samples': 1847232, 'steps': 9620, 'loss/train': 1.7103986740112305} 11/06/2021 22:35:09 - INFO - __main__ - Step 9622: {'lr': 0.0004967359039648163, 'samples': 1847424, 'steps': 9621, 'loss/train': 1.5623164176940918} 11/06/2021 22:35:09 - INFO - __main__ - Step 9623: {'lr': 0.0004967350491724392, 'samples': 1847616, 'steps': 9622, 'loss/train': 1.5282793045043945} 11/06/2021 22:35:09 - INFO - __main__ - Step 9624: {'lr': 0.0004967341942688872, 'samples': 1847808, 'steps': 9623, 'loss/train': 1.75148344039917} 11/06/2021 22:35:10 - INFO - __main__ - Step 9625: {'lr': 0.0004967333392541604, 'samples': 1848000, 'steps': 9624, 'loss/train': 1.2734038829803467} 11/06/2021 22:35:11 - INFO - __main__ - Step 9626: {'lr': 0.0004967324841282596, 'samples': 1848192, 'steps': 9625, 'loss/train': 1.7880160808563232} 11/06/2021 22:35:11 - INFO - __main__ - Step 9627: {'lr': 0.0004967316288911847, 'samples': 1848384, 'steps': 9626, 'loss/train': 1.675197958946228} 11/06/2021 22:35:11 - INFO - __main__ - Step 9628: {'lr': 0.0004967307735429365, 'samples': 1848576, 'steps': 9627, 'loss/train': 2.0006182193756104} 11/06/2021 22:35:12 - INFO - __main__ - Step 9629: {'lr': 0.0004967299180835153, 'samples': 1848768, 'steps': 9628, 'loss/train': 1.4124494791030884} 11/06/2021 22:35:12 - INFO - __main__ - Step 9630: {'lr': 0.0004967290625129212, 'samples': 1848960, 'steps': 9629, 'loss/train': 1.5025792121887207} 11/06/2021 22:35:12 - INFO - __main__ - Step 9631: {'lr': 0.0004967282068311548, 'samples': 1849152, 'steps': 9630, 'loss/train': 2.0721802711486816} 11/06/2021 22:35:14 - INFO - __main__ - Step 9632: {'lr': 0.0004967273510382166, 'samples': 1849344, 'steps': 9631, 'loss/train': 1.781577229499817} 11/06/2021 22:35:14 - INFO - __main__ - Step 9633: {'lr': 0.0004967264951341069, 'samples': 1849536, 'steps': 9632, 'loss/train': 1.7446000576019287} 11/06/2021 22:35:14 - INFO - __main__ - Step 9634: {'lr': 0.0004967256391188258, 'samples': 1849728, 'steps': 9633, 'loss/train': 1.9075095653533936} 11/06/2021 22:35:15 - INFO - __main__ - Step 9635: {'lr': 0.0004967247829923742, 'samples': 1849920, 'steps': 9634, 'loss/train': 2.3468635082244873} 11/06/2021 22:35:15 - INFO - __main__ - Step 9636: {'lr': 0.0004967239267547521, 'samples': 1850112, 'steps': 9635, 'loss/train': 1.420114278793335} 11/06/2021 22:35:16 - INFO - __main__ - Step 9637: {'lr': 0.00049672307040596, 'samples': 1850304, 'steps': 9636, 'loss/train': 1.8249211311340332} 11/06/2021 22:35:17 - INFO - __main__ - Step 9638: {'lr': 0.0004967222139459983, 'samples': 1850496, 'steps': 9637, 'loss/train': 0.8801685571670532} 11/06/2021 22:35:17 - INFO - __main__ - Step 9639: {'lr': 0.0004967213573748674, 'samples': 1850688, 'steps': 9638, 'loss/train': 1.8244924545288086} 11/06/2021 22:35:17 - INFO - __main__ - Step 9640: {'lr': 0.0004967205006925677, 'samples': 1850880, 'steps': 9639, 'loss/train': 1.8652440309524536} 11/06/2021 22:35:18 - INFO - __main__ - Step 9641: {'lr': 0.0004967196438990995, 'samples': 1851072, 'steps': 9640, 'loss/train': 1.3965966701507568} 11/06/2021 22:35:18 - INFO - __main__ - Step 9642: {'lr': 0.0004967187869944632, 'samples': 1851264, 'steps': 9641, 'loss/train': 2.0054564476013184} 11/06/2021 22:35:19 - INFO - __main__ - Step 9643: {'lr': 0.0004967179299786593, 'samples': 1851456, 'steps': 9642, 'loss/train': 0.7685950994491577} 11/06/2021 22:35:19 - INFO - __main__ - Step 9644: {'lr': 0.000496717072851688, 'samples': 1851648, 'steps': 9643, 'loss/train': 1.9576327800750732} 11/06/2021 22:35:20 - INFO - __main__ - Step 9645: {'lr': 0.0004967162156135499, 'samples': 1851840, 'steps': 9644, 'loss/train': 1.0496654510498047} 11/06/2021 22:35:20 - INFO - __main__ - Step 9646: {'lr': 0.0004967153582642452, 'samples': 1852032, 'steps': 9645, 'loss/train': 5.934883117675781} 11/06/2021 22:35:21 - INFO - __main__ - Step 9647: {'lr': 0.0004967145008037744, 'samples': 1852224, 'steps': 9646, 'loss/train': 1.2428544759750366} 11/06/2021 22:35:21 - INFO - __main__ - Step 9648: {'lr': 0.000496713643232138, 'samples': 1852416, 'steps': 9647, 'loss/train': 1.7407283782958984} 11/06/2021 22:35:22 - INFO - __main__ - Step 9649: {'lr': 0.000496712785549336, 'samples': 1852608, 'steps': 9648, 'loss/train': 1.9216516017913818} 11/06/2021 22:35:22 - INFO - __main__ - Step 9650: {'lr': 0.0004967119277553692, 'samples': 1852800, 'steps': 9649, 'loss/train': 1.5039646625518799} 11/06/2021 22:35:23 - INFO - __main__ - Step 9651: {'lr': 0.0004967110698502377, 'samples': 1852992, 'steps': 9650, 'loss/train': 1.7824867963790894} 11/06/2021 22:35:23 - INFO - __main__ - Step 9652: {'lr': 0.000496710211833942, 'samples': 1853184, 'steps': 9651, 'loss/train': 1.6737051010131836} 11/06/2021 22:35:24 - INFO - __main__ - Step 9653: {'lr': 0.0004967093537064825, 'samples': 1853376, 'steps': 9652, 'loss/train': 2.1513617038726807} 11/06/2021 22:35:24 - INFO - __main__ - Step 9654: {'lr': 0.0004967084954678597, 'samples': 1853568, 'steps': 9653, 'loss/train': 2.3092923164367676} 11/06/2021 22:35:25 - INFO - __main__ - Step 9655: {'lr': 0.0004967076371180738, 'samples': 1853760, 'steps': 9654, 'loss/train': 1.796209454536438} 11/06/2021 22:35:25 - INFO - __main__ - Step 9656: {'lr': 0.0004967067786571251, 'samples': 1853952, 'steps': 9655, 'loss/train': 1.7951509952545166} 11/06/2021 22:35:25 - INFO - __main__ - Step 9657: {'lr': 0.0004967059200850142, 'samples': 1854144, 'steps': 9656, 'loss/train': 1.7078092098236084} 11/06/2021 22:35:26 - INFO - __main__ - Step 9658: {'lr': 0.0004967050614017415, 'samples': 1854336, 'steps': 9657, 'loss/train': 2.276421546936035} 11/06/2021 22:35:27 - INFO - __main__ - Step 9659: {'lr': 0.0004967042026073073, 'samples': 1854528, 'steps': 9658, 'loss/train': 2.434033155441284} 11/06/2021 22:35:27 - INFO - __main__ - Step 9660: {'lr': 0.000496703343701712, 'samples': 1854720, 'steps': 9659, 'loss/train': 2.1363980770111084} 11/06/2021 22:35:28 - INFO - __main__ - Step 9661: {'lr': 0.0004967024846849558, 'samples': 1854912, 'steps': 9660, 'loss/train': 1.4073458909988403} 11/06/2021 22:35:28 - INFO - __main__ - Step 9662: {'lr': 0.0004967016255570394, 'samples': 1855104, 'steps': 9661, 'loss/train': 1.8968331813812256} 11/06/2021 22:35:28 - INFO - __main__ - Step 9663: {'lr': 0.0004967007663179632, 'samples': 1855296, 'steps': 9662, 'loss/train': 1.9019638299942017} 11/06/2021 22:35:29 - INFO - __main__ - Step 9664: {'lr': 0.0004966999069677272, 'samples': 1855488, 'steps': 9663, 'loss/train': 1.987317681312561} 11/06/2021 22:35:30 - INFO - __main__ - Step 9665: {'lr': 0.0004966990475063321, 'samples': 1855680, 'steps': 9664, 'loss/train': 1.3810869455337524} 11/06/2021 22:35:30 - INFO - __main__ - Step 9666: {'lr': 0.0004966981879337783, 'samples': 1855872, 'steps': 9665, 'loss/train': 1.1948487758636475} 11/06/2021 22:35:30 - INFO - __main__ - Step 9667: {'lr': 0.0004966973282500661, 'samples': 1856064, 'steps': 9666, 'loss/train': 1.8805911540985107} 11/06/2021 22:35:31 - INFO - __main__ - Step 9668: {'lr': 0.0004966964684551958, 'samples': 1856256, 'steps': 9667, 'loss/train': 1.9904563426971436} 11/06/2021 22:35:32 - INFO - __main__ - Step 9669: {'lr': 0.0004966956085491679, 'samples': 1856448, 'steps': 9668, 'loss/train': 1.7589201927185059} 11/06/2021 22:35:32 - INFO - __main__ - Step 9670: {'lr': 0.0004966947485319828, 'samples': 1856640, 'steps': 9669, 'loss/train': 2.156191825866699} 11/06/2021 22:35:32 - INFO - __main__ - Step 9671: {'lr': 0.0004966938884036408, 'samples': 1856832, 'steps': 9670, 'loss/train': 1.8533520698547363} 11/06/2021 22:35:33 - INFO - __main__ - Step 9672: {'lr': 0.0004966930281641423, 'samples': 1857024, 'steps': 9671, 'loss/train': 2.1764333248138428} 11/06/2021 22:35:33 - INFO - __main__ - Step 9673: {'lr': 0.0004966921678134879, 'samples': 1857216, 'steps': 9672, 'loss/train': 2.0473735332489014} 11/06/2021 22:35:34 - INFO - __main__ - Step 9674: {'lr': 0.0004966913073516777, 'samples': 1857408, 'steps': 9673, 'loss/train': 2.0089006423950195} 11/06/2021 22:35:35 - INFO - __main__ - Step 9675: {'lr': 0.0004966904467787123, 'samples': 1857600, 'steps': 9674, 'loss/train': 1.5391348600387573} 11/06/2021 22:35:35 - INFO - __main__ - Step 9676: {'lr': 0.0004966895860945918, 'samples': 1857792, 'steps': 9675, 'loss/train': 0.35723814368247986} 11/06/2021 22:35:35 - INFO - __main__ - Step 9677: {'lr': 0.0004966887252993169, 'samples': 1857984, 'steps': 9676, 'loss/train': 1.9116827249526978} 11/06/2021 22:35:36 - INFO - __main__ - Step 9678: {'lr': 0.0004966878643928879, 'samples': 1858176, 'steps': 9677, 'loss/train': 2.0364251136779785} 11/06/2021 22:35:37 - INFO - __main__ - Step 9679: {'lr': 0.0004966870033753051, 'samples': 1858368, 'steps': 9678, 'loss/train': 2.400313138961792} 11/06/2021 22:35:37 - INFO - __main__ - Step 9680: {'lr': 0.0004966861422465689, 'samples': 1858560, 'steps': 9679, 'loss/train': 1.7770127058029175} 11/06/2021 22:35:37 - INFO - __main__ - Step 9681: {'lr': 0.0004966852810066798, 'samples': 1858752, 'steps': 9680, 'loss/train': 1.5626760721206665} 11/06/2021 22:35:38 - INFO - __main__ - Step 9682: {'lr': 0.0004966844196556382, 'samples': 1858944, 'steps': 9681, 'loss/train': 1.113672137260437} 11/06/2021 22:35:38 - INFO - __main__ - Step 9683: {'lr': 0.0004966835581934442, 'samples': 1859136, 'steps': 9682, 'loss/train': 1.9191310405731201} 11/06/2021 22:35:39 - INFO - __main__ - Step 9684: {'lr': 0.0004966826966200985, 'samples': 1859328, 'steps': 9683, 'loss/train': 1.801482915878296} 11/06/2021 22:35:40 - INFO - __main__ - Step 9685: {'lr': 0.0004966818349356015, 'samples': 1859520, 'steps': 9684, 'loss/train': 2.271620273590088} 11/06/2021 22:35:40 - INFO - __main__ - Step 9686: {'lr': 0.0004966809731399533, 'samples': 1859712, 'steps': 9685, 'loss/train': 1.5356147289276123} 11/06/2021 22:35:40 - INFO - __main__ - Step 9687: {'lr': 0.0004966801112331545, 'samples': 1859904, 'steps': 9686, 'loss/train': 1.8079742193222046} 11/06/2021 22:35:41 - INFO - __main__ - Step 9688: {'lr': 0.0004966792492152054, 'samples': 1860096, 'steps': 9687, 'loss/train': 1.7540109157562256} 11/06/2021 22:35:41 - INFO - __main__ - Step 9689: {'lr': 0.0004966783870861066, 'samples': 1860288, 'steps': 9688, 'loss/train': 1.8654999732971191} 11/06/2021 22:35:42 - INFO - __main__ - Step 9690: {'lr': 0.0004966775248458582, 'samples': 1860480, 'steps': 9689, 'loss/train': 1.8638839721679688} 11/06/2021 22:35:42 - INFO - __main__ - Step 9691: {'lr': 0.0004966766624944607, 'samples': 1860672, 'steps': 9690, 'loss/train': 1.839411735534668} 11/06/2021 22:35:43 - INFO - __main__ - Step 9692: {'lr': 0.0004966758000319147, 'samples': 1860864, 'steps': 9691, 'loss/train': 1.5078877210617065} 11/06/2021 22:35:43 - INFO - __main__ - Step 9693: {'lr': 0.0004966749374582202, 'samples': 1861056, 'steps': 9692, 'loss/train': 2.1486093997955322} 11/06/2021 22:35:43 - INFO - __main__ - Step 9694: {'lr': 0.0004966740747733778, 'samples': 1861248, 'steps': 9693, 'loss/train': 2.061601161956787} 11/06/2021 22:35:44 - INFO - __main__ - Step 9695: {'lr': 0.0004966732119773879, 'samples': 1861440, 'steps': 9694, 'loss/train': 1.997467279434204} 11/06/2021 22:35:45 - INFO - __main__ - Step 9696: {'lr': 0.0004966723490702509, 'samples': 1861632, 'steps': 9695, 'loss/train': 1.515207052230835} 11/06/2021 22:35:45 - INFO - __main__ - Step 9697: {'lr': 0.000496671486051967, 'samples': 1861824, 'steps': 9696, 'loss/train': 1.542418122291565} 11/06/2021 22:35:45 - INFO - __main__ - Step 9698: {'lr': 0.0004966706229225368, 'samples': 1862016, 'steps': 9697, 'loss/train': 1.7097152471542358} 11/06/2021 22:35:46 - INFO - __main__ - Step 9699: {'lr': 0.0004966697596819607, 'samples': 1862208, 'steps': 9698, 'loss/train': 1.9859890937805176} 11/06/2021 22:35:47 - INFO - __main__ - Step 9700: {'lr': 0.0004966688963302389, 'samples': 1862400, 'steps': 9699, 'loss/train': 1.6902731657028198} 11/06/2021 22:35:47 - INFO - __main__ - Step 9701: {'lr': 0.000496668032867372, 'samples': 1862592, 'steps': 9700, 'loss/train': 2.2249433994293213} 11/06/2021 22:35:48 - INFO - __main__ - Step 9702: {'lr': 0.0004966671692933603, 'samples': 1862784, 'steps': 9701, 'loss/train': 2.1209845542907715} 11/06/2021 22:35:48 - INFO - __main__ - Step 9703: {'lr': 0.0004966663056082041, 'samples': 1862976, 'steps': 9702, 'loss/train': 2.0231058597564697} 11/06/2021 22:35:48 - INFO - __main__ - Step 9704: {'lr': 0.0004966654418119039, 'samples': 1863168, 'steps': 9703, 'loss/train': 1.7849528789520264} 11/06/2021 22:35:49 - INFO - __main__ - Step 9705: {'lr': 0.00049666457790446, 'samples': 1863360, 'steps': 9704, 'loss/train': 1.677087664604187} 11/06/2021 22:35:49 - INFO - __main__ - Step 9706: {'lr': 0.000496663713885873, 'samples': 1863552, 'steps': 9705, 'loss/train': 1.6094074249267578} 11/06/2021 22:35:50 - INFO - __main__ - Step 9707: {'lr': 0.0004966628497561431, 'samples': 1863744, 'steps': 9706, 'loss/train': 1.4655191898345947} 11/06/2021 22:35:50 - INFO - __main__ - Step 9708: {'lr': 0.0004966619855152706, 'samples': 1863936, 'steps': 9707, 'loss/train': 1.6858545541763306} 11/06/2021 22:35:51 - INFO - __main__ - Step 9709: {'lr': 0.0004966611211632561, 'samples': 1864128, 'steps': 9708, 'loss/train': 2.7526583671569824} 11/06/2021 22:35:51 - INFO - __main__ - Step 9710: {'lr': 0.0004966602567000999, 'samples': 1864320, 'steps': 9709, 'loss/train': 1.8276556730270386} 11/06/2021 22:35:52 - INFO - __main__ - Step 9711: {'lr': 0.0004966593921258023, 'samples': 1864512, 'steps': 9710, 'loss/train': 2.059382677078247} 11/06/2021 22:35:52 - INFO - __main__ - Step 9712: {'lr': 0.000496658527440364, 'samples': 1864704, 'steps': 9711, 'loss/train': 1.3967629671096802} 11/06/2021 22:35:53 - INFO - __main__ - Step 9713: {'lr': 0.000496657662643785, 'samples': 1864896, 'steps': 9712, 'loss/train': 1.2711286544799805} 11/06/2021 22:35:53 - INFO - __main__ - Step 9714: {'lr': 0.000496656797736066, 'samples': 1865088, 'steps': 9713, 'loss/train': 1.6706697940826416} 11/06/2021 22:35:53 - INFO - __main__ - Step 9715: {'lr': 0.0004966559327172071, 'samples': 1865280, 'steps': 9714, 'loss/train': 1.8991764783859253} 11/06/2021 22:35:54 - INFO - __main__ - Step 9716: {'lr': 0.0004966550675872089, 'samples': 1865472, 'steps': 9715, 'loss/train': 0.8012776970863342} 11/06/2021 22:35:55 - INFO - __main__ - Step 9717: {'lr': 0.0004966542023460718, 'samples': 1865664, 'steps': 9716, 'loss/train': 1.5615832805633545} 11/06/2021 22:35:55 - INFO - __main__ - Step 9718: {'lr': 0.000496653336993796, 'samples': 1865856, 'steps': 9717, 'loss/train': 1.8252805471420288} 11/06/2021 22:35:55 - INFO - __main__ - Step 9719: {'lr': 0.0004966524715303821, 'samples': 1866048, 'steps': 9718, 'loss/train': 1.7340925931930542} 11/06/2021 22:35:56 - INFO - __main__ - Step 9720: {'lr': 0.0004966516059558304, 'samples': 1866240, 'steps': 9719, 'loss/train': 1.9591999053955078} 11/06/2021 22:35:57 - INFO - __main__ - Step 9721: {'lr': 0.0004966507402701413, 'samples': 1866432, 'steps': 9720, 'loss/train': 1.574877142906189} 11/06/2021 22:35:57 - INFO - __main__ - Step 9722: {'lr': 0.0004966498744733151, 'samples': 1866624, 'steps': 9721, 'loss/train': 1.7162398099899292} 11/06/2021 22:35:58 - INFO - __main__ - Step 9723: {'lr': 0.0004966490085653523, 'samples': 1866816, 'steps': 9722, 'loss/train': 1.3233344554901123} 11/06/2021 22:35:58 - INFO - __main__ - Step 9724: {'lr': 0.0004966481425462533, 'samples': 1867008, 'steps': 9723, 'loss/train': 2.0228872299194336} 11/06/2021 22:35:58 - INFO - __main__ - Step 9725: {'lr': 0.0004966472764160183, 'samples': 1867200, 'steps': 9724, 'loss/train': 2.029508590698242} 11/06/2021 22:35:59 - INFO - __main__ - Step 9726: {'lr': 0.000496646410174648, 'samples': 1867392, 'steps': 9725, 'loss/train': 2.0226128101348877} 11/06/2021 22:36:00 - INFO - __main__ - Step 9727: {'lr': 0.0004966455438221427, 'samples': 1867584, 'steps': 9726, 'loss/train': 1.7145023345947266} 11/06/2021 22:36:00 - INFO - __main__ - Step 9728: {'lr': 0.0004966446773585026, 'samples': 1867776, 'steps': 9727, 'loss/train': 1.7100311517715454} 11/06/2021 22:36:00 - INFO - __main__ - Step 9729: {'lr': 0.0004966438107837283, 'samples': 1867968, 'steps': 9728, 'loss/train': 1.7535374164581299} 11/06/2021 22:36:01 - INFO - __main__ - Step 9730: {'lr': 0.00049664294409782, 'samples': 1868160, 'steps': 9729, 'loss/train': 1.442859172821045} 11/06/2021 22:36:01 - INFO - __main__ - Step 9731: {'lr': 0.0004966420773007782, 'samples': 1868352, 'steps': 9730, 'loss/train': 1.5532867908477783} 11/06/2021 22:36:02 - INFO - __main__ - Step 9732: {'lr': 0.0004966412103926034, 'samples': 1868544, 'steps': 9731, 'loss/train': 1.9052324295043945} 11/06/2021 22:36:02 - INFO - __main__ - Step 9733: {'lr': 0.0004966403433732958, 'samples': 1868736, 'steps': 9732, 'loss/train': 1.7569751739501953} 11/06/2021 22:36:03 - INFO - __main__ - Step 9734: {'lr': 0.0004966394762428559, 'samples': 1868928, 'steps': 9733, 'loss/train': 1.3910471200942993} 11/06/2021 22:36:03 - INFO - __main__ - Step 9735: {'lr': 0.0004966386090012841, 'samples': 1869120, 'steps': 9734, 'loss/train': 1.8945108652114868} 11/06/2021 22:36:03 - INFO - __main__ - Step 9736: {'lr': 0.0004966377416485806, 'samples': 1869312, 'steps': 9735, 'loss/train': 2.4226298332214355} 11/06/2021 22:36:05 - INFO - __main__ - Step 9737: {'lr': 0.0004966368741847461, 'samples': 1869504, 'steps': 9736, 'loss/train': 1.5357730388641357} 11/06/2021 22:36:05 - INFO - __main__ - Step 9738: {'lr': 0.0004966360066097807, 'samples': 1869696, 'steps': 9737, 'loss/train': 1.4675438404083252} 11/06/2021 22:36:05 - INFO - __main__ - Step 9739: {'lr': 0.0004966351389236851, 'samples': 1869888, 'steps': 9738, 'loss/train': 1.7604131698608398} 11/06/2021 22:36:06 - INFO - __main__ - Step 9740: {'lr': 0.0004966342711264593, 'samples': 1870080, 'steps': 9739, 'loss/train': 2.1939752101898193} 11/06/2021 22:36:06 - INFO - __main__ - Step 9741: {'lr': 0.000496633403218104, 'samples': 1870272, 'steps': 9740, 'loss/train': 1.855295181274414} 11/06/2021 22:36:07 - INFO - __main__ - Step 9742: {'lr': 0.0004966325351986195, 'samples': 1870464, 'steps': 9741, 'loss/train': 2.2889838218688965} 11/06/2021 22:36:07 - INFO - __main__ - Step 9743: {'lr': 0.0004966316670680062, 'samples': 1870656, 'steps': 9742, 'loss/train': 1.9611015319824219} 11/06/2021 22:36:08 - INFO - __main__ - Step 9744: {'lr': 0.0004966307988262644, 'samples': 1870848, 'steps': 9743, 'loss/train': 1.9820667505264282} 11/06/2021 22:36:08 - INFO - __main__ - Step 9745: {'lr': 0.0004966299304733947, 'samples': 1871040, 'steps': 9744, 'loss/train': 1.8761732578277588} 11/06/2021 22:36:08 - INFO - __main__ - Step 9746: {'lr': 0.0004966290620093972, 'samples': 1871232, 'steps': 9745, 'loss/train': 1.9568790197372437} 11/06/2021 22:36:09 - INFO - __main__ - Step 9747: {'lr': 0.0004966281934342725, 'samples': 1871424, 'steps': 9746, 'loss/train': 1.7533879280090332} 11/06/2021 22:36:10 - INFO - __main__ - Step 9748: {'lr': 0.000496627324748021, 'samples': 1871616, 'steps': 9747, 'loss/train': 2.03949236869812} 11/06/2021 22:36:10 - INFO - __main__ - Step 9749: {'lr': 0.000496626455950643, 'samples': 1871808, 'steps': 9748, 'loss/train': 2.50878643989563} 11/06/2021 22:36:10 - INFO - __main__ - Step 9750: {'lr': 0.000496625587042139, 'samples': 1872000, 'steps': 9749, 'loss/train': 1.5603324174880981} 11/06/2021 22:36:11 - INFO - __main__ - Step 9751: {'lr': 0.0004966247180225092, 'samples': 1872192, 'steps': 9750, 'loss/train': 1.700454592704773} 11/06/2021 22:36:11 - INFO - __main__ - Step 9752: {'lr': 0.0004966238488917542, 'samples': 1872384, 'steps': 9751, 'loss/train': 0.7267507910728455} 11/06/2021 22:36:12 - INFO - __main__ - Step 9753: {'lr': 0.0004966229796498742, 'samples': 1872576, 'steps': 9752, 'loss/train': 1.431842565536499} 11/06/2021 22:36:13 - INFO - __main__ - Step 9754: {'lr': 0.0004966221102968698, 'samples': 1872768, 'steps': 9753, 'loss/train': 1.7751144170761108} 11/06/2021 22:36:13 - INFO - __main__ - Step 9755: {'lr': 0.0004966212408327412, 'samples': 1872960, 'steps': 9754, 'loss/train': 1.7627856731414795} 11/06/2021 22:36:13 - INFO - __main__ - Step 9756: {'lr': 0.0004966203712574889, 'samples': 1873152, 'steps': 9755, 'loss/train': 2.068781852722168} 11/06/2021 22:36:14 - INFO - __main__ - Step 9757: {'lr': 0.0004966195015711132, 'samples': 1873344, 'steps': 9756, 'loss/train': 1.8314954042434692} 11/06/2021 22:36:15 - INFO - __main__ - Step 9758: {'lr': 0.0004966186317736146, 'samples': 1873536, 'steps': 9757, 'loss/train': 1.5385262966156006} 11/06/2021 22:36:15 - INFO - __main__ - Step 9759: {'lr': 0.0004966177618649935, 'samples': 1873728, 'steps': 9758, 'loss/train': 2.23832106590271} 11/06/2021 22:36:15 - INFO - __main__ - Step 9760: {'lr': 0.0004966168918452503, 'samples': 1873920, 'steps': 9759, 'loss/train': 2.2789201736450195} 11/06/2021 22:36:16 - INFO - __main__ - Step 9761: {'lr': 0.0004966160217143852, 'samples': 1874112, 'steps': 9760, 'loss/train': 1.4507635831832886} 11/06/2021 22:36:16 - INFO - __main__ - Step 9762: {'lr': 0.0004966151514723988, 'samples': 1874304, 'steps': 9761, 'loss/train': 1.7444387674331665} 11/06/2021 22:36:17 - INFO - __main__ - Step 9763: {'lr': 0.0004966142811192914, 'samples': 1874496, 'steps': 9762, 'loss/train': 2.5484769344329834} 11/06/2021 22:36:17 - INFO - __main__ - Step 9764: {'lr': 0.0004966134106550634, 'samples': 1874688, 'steps': 9763, 'loss/train': 2.3604612350463867} 11/06/2021 22:36:18 - INFO - __main__ - Step 9765: {'lr': 0.0004966125400797152, 'samples': 1874880, 'steps': 9764, 'loss/train': 1.8499844074249268} 11/06/2021 22:36:18 - INFO - __main__ - Step 9766: {'lr': 0.0004966116693932472, 'samples': 1875072, 'steps': 9765, 'loss/train': 2.1143555641174316} 11/06/2021 22:36:18 - INFO - __main__ - Step 9767: {'lr': 0.0004966107985956598, 'samples': 1875264, 'steps': 9766, 'loss/train': 1.2633049488067627} 11/06/2021 22:36:19 - INFO - __main__ - Step 9768: {'lr': 0.0004966099276869534, 'samples': 1875456, 'steps': 9767, 'loss/train': 1.519136667251587} 11/06/2021 22:36:20 - INFO - __main__ - Step 9769: {'lr': 0.0004966090566671283, 'samples': 1875648, 'steps': 9768, 'loss/train': 1.8271578550338745} 11/06/2021 22:36:20 - INFO - __main__ - Step 9770: {'lr': 0.000496608185536185, 'samples': 1875840, 'steps': 9769, 'loss/train': 1.837522268295288} 11/06/2021 22:36:20 - INFO - __main__ - Step 9771: {'lr': 0.0004966073142941239, 'samples': 1876032, 'steps': 9770, 'loss/train': 2.2010583877563477} 11/06/2021 22:36:21 - INFO - __main__ - Step 9772: {'lr': 0.0004966064429409452, 'samples': 1876224, 'steps': 9771, 'loss/train': 1.9248576164245605} 11/06/2021 22:36:22 - INFO - __main__ - Step 9773: {'lr': 0.0004966055714766496, 'samples': 1876416, 'steps': 9772, 'loss/train': 2.22214412689209} 11/06/2021 22:36:22 - INFO - __main__ - Step 9774: {'lr': 0.0004966046999012373, 'samples': 1876608, 'steps': 9773, 'loss/train': 1.0785945653915405} 11/06/2021 22:36:22 - INFO - __main__ - Step 9775: {'lr': 0.0004966038282147087, 'samples': 1876800, 'steps': 9774, 'loss/train': 1.5319066047668457} 11/06/2021 22:36:23 - INFO - __main__ - Step 9776: {'lr': 0.0004966029564170643, 'samples': 1876992, 'steps': 9775, 'loss/train': 1.8373252153396606} 11/06/2021 22:36:23 - INFO - __main__ - Step 9777: {'lr': 0.0004966020845083044, 'samples': 1877184, 'steps': 9776, 'loss/train': 1.9333018064498901} 11/06/2021 22:36:24 - INFO - __main__ - Step 9778: {'lr': 0.0004966012124884292, 'samples': 1877376, 'steps': 9777, 'loss/train': 1.7784291505813599} 11/06/2021 22:36:24 - INFO - __main__ - Step 9779: {'lr': 0.0004966003403574395, 'samples': 1877568, 'steps': 9778, 'loss/train': 2.0057530403137207} 11/06/2021 22:36:25 - INFO - __main__ - Step 9780: {'lr': 0.0004965994681153355, 'samples': 1877760, 'steps': 9779, 'loss/train': 1.3653117418289185} 11/06/2021 22:36:25 - INFO - __main__ - Step 9781: {'lr': 0.0004965985957621175, 'samples': 1877952, 'steps': 9780, 'loss/train': 1.5184459686279297} 11/06/2021 22:36:26 - INFO - __main__ - Step 9782: {'lr': 0.0004965977232977861, 'samples': 1878144, 'steps': 9781, 'loss/train': 1.0429720878601074} 11/06/2021 22:36:26 - INFO - __main__ - Step 9783: {'lr': 0.0004965968507223414, 'samples': 1878336, 'steps': 9782, 'loss/train': 1.5216965675354004} 11/06/2021 22:36:27 - INFO - __main__ - Step 9784: {'lr': 0.000496595978035784, 'samples': 1878528, 'steps': 9783, 'loss/train': 1.9085612297058105} 11/06/2021 22:36:27 - INFO - __main__ - Step 9785: {'lr': 0.0004965951052381144, 'samples': 1878720, 'steps': 9784, 'loss/train': 1.6528103351593018} 11/06/2021 22:36:28 - INFO - __main__ - Step 9786: {'lr': 0.0004965942323293328, 'samples': 1878912, 'steps': 9785, 'loss/train': 1.703324317932129} 11/06/2021 22:36:28 - INFO - __main__ - Step 9787: {'lr': 0.0004965933593094395, 'samples': 1879104, 'steps': 9786, 'loss/train': 1.8389208316802979} 11/06/2021 22:36:28 - INFO - __main__ - Step 9788: {'lr': 0.0004965924861784352, 'samples': 1879296, 'steps': 9787, 'loss/train': 2.069736957550049} 11/06/2021 22:36:29 - INFO - __main__ - Step 9789: {'lr': 0.0004965916129363201, 'samples': 1879488, 'steps': 9788, 'loss/train': 1.9471229314804077} 11/06/2021 22:36:30 - INFO - __main__ - Step 9790: {'lr': 0.0004965907395830945, 'samples': 1879680, 'steps': 9789, 'loss/train': 1.7488764524459839} 11/06/2021 22:36:30 - INFO - __main__ - Step 9791: {'lr': 0.000496589866118759, 'samples': 1879872, 'steps': 9790, 'loss/train': 2.344801902770996} 11/06/2021 22:36:30 - INFO - __main__ - Step 9792: {'lr': 0.000496588992543314, 'samples': 1880064, 'steps': 9791, 'loss/train': 1.8304022550582886} 11/06/2021 22:36:31 - INFO - __main__ - Step 9793: {'lr': 0.0004965881188567597, 'samples': 1880256, 'steps': 9792, 'loss/train': 1.6867663860321045} 11/06/2021 22:36:32 - INFO - __main__ - Step 9794: {'lr': 0.0004965872450590965, 'samples': 1880448, 'steps': 9793, 'loss/train': 1.4658312797546387} 11/06/2021 22:36:32 - INFO - __main__ - Step 9795: {'lr': 0.0004965863711503251, 'samples': 1880640, 'steps': 9794, 'loss/train': 2.786965847015381} 11/06/2021 22:36:33 - INFO - __main__ - Step 9796: {'lr': 0.0004965854971304457, 'samples': 1880832, 'steps': 9795, 'loss/train': 1.9178855419158936} 11/06/2021 22:36:33 - INFO - __main__ - Step 9797: {'lr': 0.0004965846229994586, 'samples': 1881024, 'steps': 9796, 'loss/train': 1.2724156379699707} 11/06/2021 22:36:34 - INFO - __main__ - Step 9798: {'lr': 0.0004965837487573641, 'samples': 1881216, 'steps': 9797, 'loss/train': 2.09515380859375} 11/06/2021 22:36:35 - INFO - __main__ - Step 9799: {'lr': 0.000496582874404163, 'samples': 1881408, 'steps': 9798, 'loss/train': 0.2936389148235321} 11/06/2021 22:36:35 - INFO - __main__ - Step 9800: {'lr': 0.0004965819999398554, 'samples': 1881600, 'steps': 9799, 'loss/train': 1.556630253791809} 11/06/2021 22:36:35 - INFO - __main__ - Step 9801: {'lr': 0.0004965811253644418, 'samples': 1881792, 'steps': 9800, 'loss/train': 1.5148653984069824} 11/06/2021 22:36:36 - INFO - __main__ - Step 9802: {'lr': 0.0004965802506779225, 'samples': 1881984, 'steps': 9801, 'loss/train': 1.8393759727478027} 11/06/2021 22:36:36 - INFO - __main__ - Step 9803: {'lr': 0.0004965793758802978, 'samples': 1882176, 'steps': 9802, 'loss/train': 1.732246994972229} 11/06/2021 22:36:37 - INFO - __main__ - Step 9804: {'lr': 0.0004965785009715684, 'samples': 1882368, 'steps': 9803, 'loss/train': 1.4041547775268555} 11/06/2021 22:36:37 - INFO - __main__ - Step 9805: {'lr': 0.0004965776259517345, 'samples': 1882560, 'steps': 9804, 'loss/train': 1.8803133964538574} 11/06/2021 22:36:38 - INFO - __main__ - Step 9806: {'lr': 0.0004965767508207966, 'samples': 1882752, 'steps': 9805, 'loss/train': 1.2590349912643433} 11/06/2021 22:36:38 - INFO - __main__ - Step 9807: {'lr': 0.000496575875578755, 'samples': 1882944, 'steps': 9806, 'loss/train': 2.212789535522461} 11/06/2021 22:36:38 - INFO - __main__ - Step 9808: {'lr': 0.00049657500022561, 'samples': 1883136, 'steps': 9807, 'loss/train': 1.9373009204864502} 11/06/2021 22:36:39 - INFO - __main__ - Step 9809: {'lr': 0.0004965741247613622, 'samples': 1883328, 'steps': 9808, 'loss/train': 0.9651688933372498} 11/06/2021 22:36:40 - INFO - __main__ - Step 9810: {'lr': 0.0004965732491860119, 'samples': 1883520, 'steps': 9809, 'loss/train': 2.099937915802002} 11/06/2021 22:36:40 - INFO - __main__ - Step 9811: {'lr': 0.0004965723734995594, 'samples': 1883712, 'steps': 9810, 'loss/train': 1.7372392416000366} 11/06/2021 22:36:40 - INFO - __main__ - Step 9812: {'lr': 0.0004965714977020053, 'samples': 1883904, 'steps': 9811, 'loss/train': 1.6284079551696777} 11/06/2021 22:36:41 - INFO - __main__ - Step 9813: {'lr': 0.0004965706217933499, 'samples': 1884096, 'steps': 9812, 'loss/train': 1.849391222000122} 11/06/2021 22:36:41 - INFO - __main__ - Step 9814: {'lr': 0.0004965697457735936, 'samples': 1884288, 'steps': 9813, 'loss/train': 1.648018717765808} 11/06/2021 22:36:42 - INFO - __main__ - Step 9815: {'lr': 0.0004965688696427366, 'samples': 1884480, 'steps': 9814, 'loss/train': 1.7322882413864136} 11/06/2021 22:36:42 - INFO - __main__ - Step 9816: {'lr': 0.0004965679934007797, 'samples': 1884672, 'steps': 9815, 'loss/train': 1.7675822973251343} 11/06/2021 22:36:43 - INFO - __main__ - Step 9817: {'lr': 0.0004965671170477229, 'samples': 1884864, 'steps': 9816, 'loss/train': 1.9023000001907349} 11/06/2021 22:36:43 - INFO - __main__ - Step 9818: {'lr': 0.0004965662405835668, 'samples': 1885056, 'steps': 9817, 'loss/train': 1.9403313398361206} 11/06/2021 22:36:43 - INFO - __main__ - Step 9819: {'lr': 0.0004965653640083118, 'samples': 1885248, 'steps': 9818, 'loss/train': 0.30051112174987793} 11/06/2021 22:36:45 - INFO - __main__ - Step 9820: {'lr': 0.0004965644873219583, 'samples': 1885440, 'steps': 9819, 'loss/train': 1.5804340839385986} 11/06/2021 22:36:45 - INFO - __main__ - Step 9821: {'lr': 0.0004965636105245066, 'samples': 1885632, 'steps': 9820, 'loss/train': 3.80881404876709} 11/06/2021 22:36:45 - INFO - __main__ - Step 9822: {'lr': 0.000496562733615957, 'samples': 1885824, 'steps': 9821, 'loss/train': 1.808201551437378} 11/06/2021 22:36:46 - INFO - __main__ - Step 9823: {'lr': 0.0004965618565963102, 'samples': 1886016, 'steps': 9822, 'loss/train': 1.3552080392837524} 11/06/2021 22:36:46 - INFO - __main__ - Step 9824: {'lr': 0.0004965609794655664, 'samples': 1886208, 'steps': 9823, 'loss/train': 1.8378883600234985} 11/06/2021 22:36:47 - INFO - __main__ - Step 9825: {'lr': 0.0004965601022237261, 'samples': 1886400, 'steps': 9824, 'loss/train': 1.9869000911712646} 11/06/2021 22:36:47 - INFO - __main__ - Step 9826: {'lr': 0.0004965592248707895, 'samples': 1886592, 'steps': 9825, 'loss/train': 1.6496671438217163} 11/06/2021 22:36:48 - INFO - __main__ - Step 9827: {'lr': 0.0004965583474067571, 'samples': 1886784, 'steps': 9826, 'loss/train': 1.5552690029144287} 11/06/2021 22:36:48 - INFO - __main__ - Step 9828: {'lr': 0.0004965574698316294, 'samples': 1886976, 'steps': 9827, 'loss/train': 1.9440776109695435} 11/06/2021 22:36:48 - INFO - __main__ - Step 9829: {'lr': 0.0004965565921454067, 'samples': 1887168, 'steps': 9828, 'loss/train': 1.336730718612671} 11/06/2021 22:36:49 - INFO - __main__ - Step 9830: {'lr': 0.0004965557143480893, 'samples': 1887360, 'steps': 9829, 'loss/train': 0.4165128171443939} 11/06/2021 22:36:50 - INFO - __main__ - Step 9831: {'lr': 0.0004965548364396779, 'samples': 1887552, 'steps': 9830, 'loss/train': 1.5435590744018555} 11/06/2021 22:36:50 - INFO - __main__ - Step 9832: {'lr': 0.0004965539584201725, 'samples': 1887744, 'steps': 9831, 'loss/train': 1.4158097505569458} 11/06/2021 22:36:50 - INFO - __main__ - Step 9833: {'lr': 0.0004965530802895738, 'samples': 1887936, 'steps': 9832, 'loss/train': 1.8684687614440918} 11/06/2021 22:36:51 - INFO - __main__ - Step 9834: {'lr': 0.000496552202047882, 'samples': 1888128, 'steps': 9833, 'loss/train': 2.1222503185272217} 11/06/2021 22:36:52 - INFO - __main__ - Step 9835: {'lr': 0.0004965513236950977, 'samples': 1888320, 'steps': 9834, 'loss/train': 1.6643342971801758} 11/06/2021 22:36:52 - INFO - __main__ - Step 9836: {'lr': 0.0004965504452312211, 'samples': 1888512, 'steps': 9835, 'loss/train': 1.892832636833191} 11/06/2021 22:36:53 - INFO - __main__ - Step 9837: {'lr': 0.0004965495666562527, 'samples': 1888704, 'steps': 9836, 'loss/train': 1.7342534065246582} 11/06/2021 22:36:53 - INFO - __main__ - Step 9838: {'lr': 0.0004965486879701928, 'samples': 1888896, 'steps': 9837, 'loss/train': 1.782392978668213} 11/06/2021 22:36:53 - INFO - __main__ - Step 9839: {'lr': 0.000496547809173042, 'samples': 1889088, 'steps': 9838, 'loss/train': 5.928685665130615} 11/06/2021 22:36:54 - INFO - __main__ - Step 9840: {'lr': 0.0004965469302648005, 'samples': 1889280, 'steps': 9839, 'loss/train': 2.1997225284576416} 11/06/2021 22:36:55 - INFO - __main__ - Step 9841: {'lr': 0.0004965460512454688, 'samples': 1889472, 'steps': 9840, 'loss/train': 1.9922882318496704} 11/06/2021 22:36:55 - INFO - __main__ - Step 9842: {'lr': 0.0004965451721150471, 'samples': 1889664, 'steps': 9841, 'loss/train': 1.385520100593567} 11/06/2021 22:36:55 - INFO - __main__ - Step 9843: {'lr': 0.0004965442928735361, 'samples': 1889856, 'steps': 9842, 'loss/train': 1.5797951221466064} 11/06/2021 22:36:56 - INFO - __main__ - Step 9844: {'lr': 0.000496543413520936, 'samples': 1890048, 'steps': 9843, 'loss/train': 1.9805771112442017} 11/06/2021 22:36:56 - INFO - __main__ - Step 9845: {'lr': 0.0004965425340572472, 'samples': 1890240, 'steps': 9844, 'loss/train': 2.120030403137207} 11/06/2021 22:36:57 - INFO - __main__ - Step 9846: {'lr': 0.0004965416544824703, 'samples': 1890432, 'steps': 9845, 'loss/train': 1.6327179670333862} 11/06/2021 22:36:58 - INFO - __main__ - Step 9847: {'lr': 0.0004965407747966053, 'samples': 1890624, 'steps': 9846, 'loss/train': 1.7754491567611694} 11/06/2021 22:36:58 - INFO - __main__ - Step 9848: {'lr': 0.000496539894999653, 'samples': 1890816, 'steps': 9847, 'loss/train': 1.8676059246063232} 11/06/2021 22:36:59 - INFO - __main__ - Step 9849: {'lr': 0.0004965390150916136, 'samples': 1891008, 'steps': 9848, 'loss/train': 1.9253339767456055} 11/06/2021 22:36:59 - INFO - __main__ - Step 9850: {'lr': 0.0004965381350724874, 'samples': 1891200, 'steps': 9849, 'loss/train': 1.7396340370178223} 11/06/2021 22:36:59 - INFO - __main__ - Step 9851: {'lr': 0.000496537254942275, 'samples': 1891392, 'steps': 9850, 'loss/train': 1.8926670551300049} 11/06/2021 22:37:00 - INFO - __main__ - Step 9852: {'lr': 0.0004965363747009767, 'samples': 1891584, 'steps': 9851, 'loss/train': 1.3262872695922852} 11/06/2021 22:37:01 - INFO - __main__ - Step 9853: {'lr': 0.000496535494348593, 'samples': 1891776, 'steps': 9852, 'loss/train': 1.9677116870880127} 11/06/2021 22:37:01 - INFO - __main__ - Step 9854: {'lr': 0.0004965346138851241, 'samples': 1891968, 'steps': 9853, 'loss/train': 1.1874512434005737} 11/06/2021 22:37:01 - INFO - __main__ - Step 9855: {'lr': 0.0004965337333105706, 'samples': 1892160, 'steps': 9854, 'loss/train': 2.6719613075256348} 11/06/2021 22:37:02 - INFO - __main__ - Step 9856: {'lr': 0.0004965328526249328, 'samples': 1892352, 'steps': 9855, 'loss/train': 1.6148555278778076} 11/06/2021 22:37:03 - INFO - __main__ - Step 9857: {'lr': 0.000496531971828211, 'samples': 1892544, 'steps': 9856, 'loss/train': 1.9275423288345337} 11/06/2021 22:37:03 - INFO - __main__ - Step 9858: {'lr': 0.0004965310909204058, 'samples': 1892736, 'steps': 9857, 'loss/train': 2.38356876373291} 11/06/2021 22:37:03 - INFO - __main__ - Step 9859: {'lr': 0.0004965302099015175, 'samples': 1892928, 'steps': 9858, 'loss/train': 1.5586031675338745} 11/06/2021 22:37:04 - INFO - __main__ - Step 9860: {'lr': 0.0004965293287715464, 'samples': 1893120, 'steps': 9859, 'loss/train': 1.8850369453430176} 11/06/2021 22:37:04 - INFO - __main__ - Step 9861: {'lr': 0.0004965284475304931, 'samples': 1893312, 'steps': 9860, 'loss/train': 2.069514274597168} 11/06/2021 22:37:06 - INFO - __main__ - Step 9862: {'lr': 0.0004965275661783579, 'samples': 1893504, 'steps': 9861, 'loss/train': 2.3399882316589355} 11/06/2021 22:37:06 - INFO - __main__ - Step 9863: {'lr': 0.0004965266847151411, 'samples': 1893696, 'steps': 9862, 'loss/train': 1.8240324258804321} 11/06/2021 22:37:07 - INFO - __main__ - Step 9864: {'lr': 0.0004965258031408432, 'samples': 1893888, 'steps': 9863, 'loss/train': 1.4843631982803345} 11/06/2021 22:37:07 - INFO - __main__ - Step 9865: {'lr': 0.0004965249214554645, 'samples': 1894080, 'steps': 9864, 'loss/train': 1.9368473291397095} 11/06/2021 22:37:07 - INFO - __main__ - Step 9866: {'lr': 0.0004965240396590055, 'samples': 1894272, 'steps': 9865, 'loss/train': 1.9536662101745605} 11/06/2021 22:37:08 - INFO - __main__ - Step 9867: {'lr': 0.0004965231577514666, 'samples': 1894464, 'steps': 9866, 'loss/train': 2.018059253692627} 11/06/2021 22:37:08 - INFO - __main__ - Step 9868: {'lr': 0.0004965222757328482, 'samples': 1894656, 'steps': 9867, 'loss/train': 2.956916093826294} 11/06/2021 22:37:09 - INFO - __main__ - Step 9869: {'lr': 0.0004965213936031507, 'samples': 1894848, 'steps': 9868, 'loss/train': 2.127044677734375} 11/06/2021 22:37:09 - INFO - __main__ - Step 9870: {'lr': 0.0004965205113623744, 'samples': 1895040, 'steps': 9869, 'loss/train': 1.807254433631897} 11/06/2021 22:37:10 - INFO - __main__ - Step 9871: {'lr': 0.0004965196290105197, 'samples': 1895232, 'steps': 9870, 'loss/train': 1.9776854515075684} 11/06/2021 22:37:10 - INFO - __main__ - Step 9872: {'lr': 0.0004965187465475873, 'samples': 1895424, 'steps': 9871, 'loss/train': 1.997361660003662} 11/06/2021 22:37:11 - INFO - __main__ - Step 9873: {'lr': 0.0004965178639735772, 'samples': 1895616, 'steps': 9872, 'loss/train': 1.981247067451477} 11/06/2021 22:37:11 - INFO - __main__ - Step 9874: {'lr': 0.0004965169812884898, 'samples': 1895808, 'steps': 9873, 'loss/train': 1.2999768257141113} 11/06/2021 22:37:12 - INFO - __main__ - Step 9875: {'lr': 0.0004965160984923259, 'samples': 1896000, 'steps': 9874, 'loss/train': 2.367384433746338} 11/06/2021 22:37:12 - INFO - __main__ - Step 9876: {'lr': 0.0004965152155850855, 'samples': 1896192, 'steps': 9875, 'loss/train': 1.639930248260498} 11/06/2021 22:37:13 - INFO - __main__ - Step 9877: {'lr': 0.0004965143325667692, 'samples': 1896384, 'steps': 9876, 'loss/train': 2.1641268730163574} 11/06/2021 22:37:13 - INFO - __main__ - Step 9878: {'lr': 0.0004965134494373773, 'samples': 1896576, 'steps': 9877, 'loss/train': 1.5240188837051392} 11/06/2021 22:37:13 - INFO - __main__ - Step 9879: {'lr': 0.0004965125661969103, 'samples': 1896768, 'steps': 9878, 'loss/train': 1.7663499116897583} 11/06/2021 22:37:14 - INFO - __main__ - Step 9880: {'lr': 0.0004965116828453685, 'samples': 1896960, 'steps': 9879, 'loss/train': 1.9683094024658203} 11/06/2021 22:37:15 - INFO - __main__ - Step 9881: {'lr': 0.0004965107993827524, 'samples': 1897152, 'steps': 9880, 'loss/train': 1.8044612407684326} 11/06/2021 22:37:15 - INFO - __main__ - Step 9882: {'lr': 0.0004965099158090624, 'samples': 1897344, 'steps': 9881, 'loss/train': 1.7531816959381104} 11/06/2021 22:37:15 - INFO - __main__ - Step 9883: {'lr': 0.0004965090321242987, 'samples': 1897536, 'steps': 9882, 'loss/train': 1.9520115852355957} 11/06/2021 22:37:16 - INFO - __main__ - Step 9884: {'lr': 0.0004965081483284618, 'samples': 1897728, 'steps': 9883, 'loss/train': 2.066235065460205} 11/06/2021 22:37:17 - INFO - __main__ - Step 9885: {'lr': 0.0004965072644215522, 'samples': 1897920, 'steps': 9884, 'loss/train': 1.9941020011901855} 11/06/2021 22:37:17 - INFO - __main__ - Step 9886: {'lr': 0.0004965063804035703, 'samples': 1898112, 'steps': 9885, 'loss/train': 1.8429374694824219} 11/06/2021 22:37:17 - INFO - __main__ - Step 9887: {'lr': 0.0004965054962745163, 'samples': 1898304, 'steps': 9886, 'loss/train': 2.1564157009124756} 11/06/2021 22:37:18 - INFO - __main__ - Step 9888: {'lr': 0.0004965046120343908, 'samples': 1898496, 'steps': 9887, 'loss/train': 1.4476630687713623} 11/06/2021 22:37:18 - INFO - __main__ - Step 9889: {'lr': 0.0004965037276831942, 'samples': 1898688, 'steps': 9888, 'loss/train': 1.0114924907684326} 11/06/2021 22:37:19 - INFO - __main__ - Step 9890: {'lr': 0.0004965028432209267, 'samples': 1898880, 'steps': 9889, 'loss/train': 1.0621055364608765} 11/06/2021 22:37:19 - INFO - __main__ - Step 9891: {'lr': 0.0004965019586475888, 'samples': 1899072, 'steps': 9890, 'loss/train': 1.3111342191696167} 11/06/2021 22:37:20 - INFO - __main__ - Step 9892: {'lr': 0.000496501073963181, 'samples': 1899264, 'steps': 9891, 'loss/train': 1.5163756608963013} 11/06/2021 22:37:20 - INFO - __main__ - Step 9893: {'lr': 0.0004965001891677037, 'samples': 1899456, 'steps': 9892, 'loss/train': 1.4711591005325317} 11/06/2021 22:37:21 - INFO - __main__ - Step 9894: {'lr': 0.000496499304261157, 'samples': 1899648, 'steps': 9893, 'loss/train': 2.3137621879577637} 11/06/2021 22:37:22 - INFO - __main__ - Step 9895: {'lr': 0.0004964984192435417, 'samples': 1899840, 'steps': 9894, 'loss/train': 1.8023244142532349} 11/06/2021 22:37:22 - INFO - __main__ - Step 9896: {'lr': 0.000496497534114858, 'samples': 1900032, 'steps': 9895, 'loss/train': 1.5675020217895508} 11/06/2021 22:37:23 - INFO - __main__ - Step 9897: {'lr': 0.0004964966488751062, 'samples': 1900224, 'steps': 9896, 'loss/train': 1.4843635559082031} 11/06/2021 22:37:23 - INFO - __main__ - Step 9898: {'lr': 0.000496495763524287, 'samples': 1900416, 'steps': 9897, 'loss/train': 2.4118194580078125} 11/06/2021 22:37:24 - INFO - __main__ - Step 9899: {'lr': 0.0004964948780624005, 'samples': 1900608, 'steps': 9898, 'loss/train': 1.6140505075454712} 11/06/2021 22:37:24 - INFO - __main__ - Step 9900: {'lr': 0.0004964939924894472, 'samples': 1900800, 'steps': 9899, 'loss/train': 1.5659464597702026} 11/06/2021 22:37:24 - INFO - __main__ - Step 9901: {'lr': 0.0004964931068054274, 'samples': 1900992, 'steps': 9900, 'loss/train': 0.7185819149017334} 11/06/2021 22:37:25 - INFO - __main__ - Step 9902: {'lr': 0.0004964922210103418, 'samples': 1901184, 'steps': 9901, 'loss/train': 0.6721516251564026} 11/06/2021 22:37:26 - INFO - __main__ - Step 9903: {'lr': 0.0004964913351041905, 'samples': 1901376, 'steps': 9902, 'loss/train': 2.110550880432129} 11/06/2021 22:37:26 - INFO - __main__ - Step 9904: {'lr': 0.000496490449086974, 'samples': 1901568, 'steps': 9903, 'loss/train': 2.038620948791504} 11/06/2021 22:37:26 - INFO - __main__ - Step 9905: {'lr': 0.0004964895629586928, 'samples': 1901760, 'steps': 9904, 'loss/train': 2.2174274921417236} 11/06/2021 22:37:27 - INFO - __main__ - Step 9906: {'lr': 0.0004964886767193471, 'samples': 1901952, 'steps': 9905, 'loss/train': 1.2710689306259155} 11/06/2021 22:37:27 - INFO - __main__ - Step 9907: {'lr': 0.0004964877903689375, 'samples': 1902144, 'steps': 9906, 'loss/train': 1.9977413415908813} 11/06/2021 22:37:28 - INFO - __main__ - Step 9908: {'lr': 0.0004964869039074643, 'samples': 1902336, 'steps': 9907, 'loss/train': 1.3191601037979126} 11/06/2021 22:37:29 - INFO - __main__ - Step 9909: {'lr': 0.000496486017334928, 'samples': 1902528, 'steps': 9908, 'loss/train': 2.083883047103882} 11/06/2021 22:37:29 - INFO - __main__ - Step 9910: {'lr': 0.0004964851306513287, 'samples': 1902720, 'steps': 9909, 'loss/train': 0.8083091974258423} 11/06/2021 22:37:29 - INFO - __main__ - Step 9911: {'lr': 0.0004964842438566671, 'samples': 1902912, 'steps': 9910, 'loss/train': 1.8291444778442383} 11/06/2021 22:37:30 - INFO - __main__ - Step 9912: {'lr': 0.0004964833569509434, 'samples': 1903104, 'steps': 9911, 'loss/train': 1.752215027809143} 11/06/2021 22:37:31 - INFO - __main__ - Step 9913: {'lr': 0.0004964824699341582, 'samples': 1903296, 'steps': 9912, 'loss/train': 1.807940125465393} 11/06/2021 22:37:31 - INFO - __main__ - Step 9914: {'lr': 0.0004964815828063118, 'samples': 1903488, 'steps': 9913, 'loss/train': 1.4366697072982788} 11/06/2021 22:37:31 - INFO - __main__ - Step 9915: {'lr': 0.0004964806955674046, 'samples': 1903680, 'steps': 9914, 'loss/train': 2.080610752105713} 11/06/2021 22:37:32 - INFO - __main__ - Step 9916: {'lr': 0.0004964798082174371, 'samples': 1903872, 'steps': 9915, 'loss/train': 1.5181411504745483} 11/06/2021 22:37:32 - INFO - __main__ - Step 9917: {'lr': 0.0004964789207564094, 'samples': 1904064, 'steps': 9916, 'loss/train': 1.8686342239379883} 11/06/2021 22:37:33 - INFO - __main__ - Step 9918: {'lr': 0.0004964780331843223, 'samples': 1904256, 'steps': 9917, 'loss/train': 2.0889878273010254} 11/06/2021 22:37:33 - INFO - __main__ - Step 9919: {'lr': 0.0004964771455011758, 'samples': 1904448, 'steps': 9918, 'loss/train': 1.7543959617614746} 11/06/2021 22:37:34 - INFO - __main__ - Step 9920: {'lr': 0.0004964762577069707, 'samples': 1904640, 'steps': 9919, 'loss/train': 1.4055709838867188} 11/06/2021 22:37:34 - INFO - __main__ - Step 9921: {'lr': 0.0004964753698017071, 'samples': 1904832, 'steps': 9920, 'loss/train': 1.6933525800704956} 11/06/2021 22:37:34 - INFO - __main__ - Step 9922: {'lr': 0.0004964744817853855, 'samples': 1905024, 'steps': 9921, 'loss/train': 1.261832356452942} 11/06/2021 22:37:35 - INFO - __main__ - Step 9923: {'lr': 0.0004964735936580063, 'samples': 1905216, 'steps': 9922, 'loss/train': 2.1547069549560547} 11/06/2021 22:37:36 - INFO - __main__ - Step 9924: {'lr': 0.00049647270541957, 'samples': 1905408, 'steps': 9923, 'loss/train': 1.4039684534072876} 11/06/2021 22:37:36 - INFO - __main__ - Step 9925: {'lr': 0.0004964718170700767, 'samples': 1905600, 'steps': 9924, 'loss/train': 1.7743269205093384} 11/06/2021 22:37:37 - INFO - __main__ - Step 9926: {'lr': 0.0004964709286095271, 'samples': 1905792, 'steps': 9925, 'loss/train': 1.6846635341644287} 11/06/2021 22:37:37 - INFO - __main__ - Step 9927: {'lr': 0.0004964700400379215, 'samples': 1905984, 'steps': 9926, 'loss/train': 1.7219330072402954} 11/06/2021 22:37:37 - INFO - __main__ - Step 9928: {'lr': 0.0004964691513552604, 'samples': 1906176, 'steps': 9927, 'loss/train': 1.6441329717636108} 11/06/2021 22:37:38 - INFO - __main__ - Step 9929: {'lr': 0.000496468262561544, 'samples': 1906368, 'steps': 9928, 'loss/train': 1.6577142477035522} 11/06/2021 22:37:38 - INFO - __main__ - Step 9930: {'lr': 0.0004964673736567728, 'samples': 1906560, 'steps': 9929, 'loss/train': 1.7367271184921265} 11/06/2021 22:37:39 - INFO - __main__ - Step 9931: {'lr': 0.0004964664846409473, 'samples': 1906752, 'steps': 9930, 'loss/train': 1.7608524560928345} 11/06/2021 22:37:39 - INFO - __main__ - Step 9932: {'lr': 0.0004964655955140677, 'samples': 1906944, 'steps': 9931, 'loss/train': 1.8868800401687622} 11/06/2021 22:37:40 - INFO - __main__ - Step 9933: {'lr': 0.0004964647062761345, 'samples': 1907136, 'steps': 9932, 'loss/train': 2.024883270263672} 11/06/2021 22:37:41 - INFO - __main__ - Step 9934: {'lr': 0.0004964638169271482, 'samples': 1907328, 'steps': 9933, 'loss/train': 1.661513328552246} 11/06/2021 22:37:41 - INFO - __main__ - Step 9935: {'lr': 0.0004964629274671091, 'samples': 1907520, 'steps': 9934, 'loss/train': 1.6273504495620728} 11/06/2021 22:37:42 - INFO - __main__ - Step 9936: {'lr': 0.0004964620378960175, 'samples': 1907712, 'steps': 9935, 'loss/train': 1.556241750717163} 11/06/2021 22:37:42 - INFO - __main__ - Step 9937: {'lr': 0.000496461148213874, 'samples': 1907904, 'steps': 9936, 'loss/train': 2.6705877780914307} 11/06/2021 22:37:42 - INFO - __main__ - Step 9938: {'lr': 0.0004964602584206788, 'samples': 1908096, 'steps': 9937, 'loss/train': 1.8545628786087036} 11/06/2021 22:37:43 - INFO - __main__ - Step 9939: {'lr': 0.0004964593685164326, 'samples': 1908288, 'steps': 9938, 'loss/train': 1.645344614982605} 11/06/2021 22:37:44 - INFO - __main__ - Step 9940: {'lr': 0.0004964584785011355, 'samples': 1908480, 'steps': 9939, 'loss/train': 1.6063035726547241} 11/06/2021 22:37:44 - INFO - __main__ - Step 9941: {'lr': 0.000496457588374788, 'samples': 1908672, 'steps': 9940, 'loss/train': 0.9683166146278381} 11/06/2021 22:37:44 - INFO - __main__ - Step 9942: {'lr': 0.0004964566981373905, 'samples': 1908864, 'steps': 9941, 'loss/train': 1.5615383386611938} 11/06/2021 22:37:45 - INFO - __main__ - Step 9943: {'lr': 0.0004964558077889435, 'samples': 1909056, 'steps': 9942, 'loss/train': 1.4332380294799805} 11/06/2021 22:37:45 - INFO - __main__ - Step 9944: {'lr': 0.0004964549173294472, 'samples': 1909248, 'steps': 9943, 'loss/train': 1.4659695625305176} 11/06/2021 22:37:46 - INFO - __main__ - Step 9945: {'lr': 0.0004964540267589023, 'samples': 1909440, 'steps': 9944, 'loss/train': 1.1614114046096802} 11/06/2021 22:37:46 - INFO - __main__ - Step 9946: {'lr': 0.0004964531360773088, 'samples': 1909632, 'steps': 9945, 'loss/train': 1.5709928274154663} 11/06/2021 22:37:47 - INFO - __main__ - Step 9947: {'lr': 0.0004964522452846675, 'samples': 1909824, 'steps': 9946, 'loss/train': 1.8149683475494385} 11/06/2021 22:37:47 - INFO - __main__ - Step 9948: {'lr': 0.0004964513543809785, 'samples': 1910016, 'steps': 9947, 'loss/train': 1.0438923835754395} 11/06/2021 22:37:48 - INFO - __main__ - Step 9949: {'lr': 0.0004964504633662424, 'samples': 1910208, 'steps': 9948, 'loss/train': 0.8558833599090576} 11/06/2021 22:37:49 - INFO - __main__ - Step 9950: {'lr': 0.0004964495722404595, 'samples': 1910400, 'steps': 9949, 'loss/train': 1.9778287410736084} 11/06/2021 22:37:49 - INFO - __main__ - Step 9951: {'lr': 0.0004964486810036301, 'samples': 1910592, 'steps': 9950, 'loss/train': 2.1700186729431152} 11/06/2021 22:37:49 - INFO - __main__ - Step 9952: {'lr': 0.000496447789655755, 'samples': 1910784, 'steps': 9951, 'loss/train': 1.5003530979156494} 11/06/2021 22:37:50 - INFO - __main__ - Step 9953: {'lr': 0.0004964468981968341, 'samples': 1910976, 'steps': 9952, 'loss/train': 1.8400135040283203} 11/06/2021 22:37:50 - INFO - __main__ - Step 9954: {'lr': 0.0004964460066268681, 'samples': 1911168, 'steps': 9953, 'loss/train': 1.9778108596801758} 11/06/2021 22:37:51 - INFO - __main__ - Step 9955: {'lr': 0.0004964451149458573, 'samples': 1911360, 'steps': 9954, 'loss/train': 1.6574676036834717} 11/06/2021 22:37:51 - INFO - __main__ - Step 9956: {'lr': 0.0004964442231538023, 'samples': 1911552, 'steps': 9955, 'loss/train': 1.1613150835037231} 11/06/2021 22:37:52 - INFO - __main__ - Step 9957: {'lr': 0.000496443331250703, 'samples': 1911744, 'steps': 9956, 'loss/train': 1.88170325756073} 11/06/2021 22:37:52 - INFO - __main__ - Step 9958: {'lr': 0.0004964424392365604, 'samples': 1911936, 'steps': 9957, 'loss/train': 2.3807883262634277} 11/06/2021 22:37:52 - INFO - __main__ - Step 9959: {'lr': 0.0004964415471113747, 'samples': 1912128, 'steps': 9958, 'loss/train': 2.4166524410247803} 11/06/2021 22:37:53 - INFO - __main__ - Step 9960: {'lr': 0.0004964406548751461, 'samples': 1912320, 'steps': 9959, 'loss/train': 2.397183656692505} 11/06/2021 22:37:54 - INFO - __main__ - Step 9961: {'lr': 0.0004964397625278751, 'samples': 1912512, 'steps': 9960, 'loss/train': 1.7461156845092773} 11/06/2021 22:37:54 - INFO - __main__ - Step 9962: {'lr': 0.0004964388700695623, 'samples': 1912704, 'steps': 9961, 'loss/train': 1.654305100440979} 11/06/2021 22:37:55 - INFO - __main__ - Step 9963: {'lr': 0.0004964379775002078, 'samples': 1912896, 'steps': 9962, 'loss/train': 2.360861301422119} 11/06/2021 22:37:55 - INFO - __main__ - Step 9964: {'lr': 0.0004964370848198122, 'samples': 1913088, 'steps': 9963, 'loss/train': 2.1414263248443604} 11/06/2021 22:37:55 - INFO - __main__ - Step 9965: {'lr': 0.0004964361920283759, 'samples': 1913280, 'steps': 9964, 'loss/train': 1.4624086618423462} 11/06/2021 22:37:56 - INFO - __main__ - Step 9966: {'lr': 0.0004964352991258992, 'samples': 1913472, 'steps': 9965, 'loss/train': 1.942596435546875} 11/06/2021 22:37:57 - INFO - __main__ - Step 9967: {'lr': 0.0004964344061123826, 'samples': 1913664, 'steps': 9966, 'loss/train': 1.6361198425292969} 11/06/2021 22:37:57 - INFO - __main__ - Step 9968: {'lr': 0.0004964335129878264, 'samples': 1913856, 'steps': 9967, 'loss/train': 1.1940803527832031} 11/06/2021 22:37:57 - INFO - __main__ - Step 9969: {'lr': 0.0004964326197522311, 'samples': 1914048, 'steps': 9968, 'loss/train': 1.8905918598175049} 11/06/2021 22:37:58 - INFO - __main__ - Step 9970: {'lr': 0.0004964317264055971, 'samples': 1914240, 'steps': 9969, 'loss/train': 1.6931695938110352} 11/06/2021 22:37:59 - INFO - __main__ - Step 9971: {'lr': 0.0004964308329479247, 'samples': 1914432, 'steps': 9970, 'loss/train': 2.065845489501953} 11/06/2021 22:37:59 - INFO - __main__ - Step 9972: {'lr': 0.0004964299393792143, 'samples': 1914624, 'steps': 9971, 'loss/train': 2.043527603149414} 11/06/2021 22:37:59 - INFO - __main__ - Step 9973: {'lr': 0.0004964290456994666, 'samples': 1914816, 'steps': 9972, 'loss/train': 1.4954372644424438} 11/06/2021 22:38:00 - INFO - __main__ - Step 9974: {'lr': 0.0004964281519086816, 'samples': 1915008, 'steps': 9973, 'loss/train': 2.400644302368164} 11/06/2021 22:38:00 - INFO - __main__ - Step 9975: {'lr': 0.0004964272580068599, 'samples': 1915200, 'steps': 9974, 'loss/train': 1.6218897104263306} 11/06/2021 22:38:01 - INFO - __main__ - Step 9976: {'lr': 0.0004964263639940018, 'samples': 1915392, 'steps': 9975, 'loss/train': 2.0930655002593994} 11/06/2021 22:38:02 - INFO - __main__ - Step 9977: {'lr': 0.000496425469870108, 'samples': 1915584, 'steps': 9976, 'loss/train': 1.391600251197815} 11/06/2021 22:38:02 - INFO - __main__ - Step 9978: {'lr': 0.0004964245756351786, 'samples': 1915776, 'steps': 9977, 'loss/train': 1.8200713396072388} 11/06/2021 22:38:02 - INFO - __main__ - Step 9979: {'lr': 0.000496423681289214, 'samples': 1915968, 'steps': 9978, 'loss/train': 1.7776697874069214} 11/06/2021 22:38:03 - INFO - __main__ - Step 9980: {'lr': 0.0004964227868322148, 'samples': 1916160, 'steps': 9979, 'loss/train': 1.082550287246704} 11/06/2021 22:38:04 - INFO - __main__ - Step 9981: {'lr': 0.0004964218922641812, 'samples': 1916352, 'steps': 9980, 'loss/train': 1.5953223705291748} 11/06/2021 22:38:04 - INFO - __main__ - Step 9982: {'lr': 0.0004964209975851137, 'samples': 1916544, 'steps': 9981, 'loss/train': 1.3076783418655396} 11/06/2021 22:38:04 - INFO - __main__ - Step 9983: {'lr': 0.0004964201027950129, 'samples': 1916736, 'steps': 9982, 'loss/train': 1.7255864143371582} 11/06/2021 22:38:05 - INFO - __main__ - Step 9984: {'lr': 0.0004964192078938788, 'samples': 1916928, 'steps': 9983, 'loss/train': 2.1289093494415283} 11/06/2021 22:38:05 - INFO - __main__ - Step 9985: {'lr': 0.0004964183128817121, 'samples': 1917120, 'steps': 9984, 'loss/train': 1.9382929801940918} 11/06/2021 22:38:06 - INFO - __main__ - Step 9986: {'lr': 0.000496417417758513, 'samples': 1917312, 'steps': 9985, 'loss/train': 1.6477100849151611} 11/06/2021 22:38:06 - INFO - __main__ - Step 9987: {'lr': 0.000496416522524282, 'samples': 1917504, 'steps': 9986, 'loss/train': 1.6857631206512451} 11/06/2021 22:38:07 - INFO - __main__ - Step 9988: {'lr': 0.0004964156271790197, 'samples': 1917696, 'steps': 9987, 'loss/train': 1.4530490636825562} 11/06/2021 22:38:07 - INFO - __main__ - Step 9989: {'lr': 0.0004964147317227262, 'samples': 1917888, 'steps': 9988, 'loss/train': 1.7035175561904907} 11/06/2021 22:38:07 - INFO - __main__ - Step 9990: {'lr': 0.000496413836155402, 'samples': 1918080, 'steps': 9989, 'loss/train': 1.9417301416397095} 11/06/2021 22:38:09 - INFO - __main__ - Step 9991: {'lr': 0.0004964129404770476, 'samples': 1918272, 'steps': 9990, 'loss/train': 1.67684006690979} 11/06/2021 22:38:09 - INFO - __main__ - Step 9992: {'lr': 0.0004964120446876633, 'samples': 1918464, 'steps': 9991, 'loss/train': 1.8245000839233398} 11/06/2021 22:38:09 - INFO - __main__ - Step 9993: {'lr': 0.0004964111487872495, 'samples': 1918656, 'steps': 9992, 'loss/train': 1.7816718816757202} 11/06/2021 22:38:10 - INFO - __main__ - Step 9994: {'lr': 0.0004964102527758067, 'samples': 1918848, 'steps': 9993, 'loss/train': 1.7993488311767578} 11/06/2021 22:38:10 - INFO - __main__ - Step 9995: {'lr': 0.0004964093566533352, 'samples': 1919040, 'steps': 9994, 'loss/train': 1.1521903276443481} 11/06/2021 22:38:11 - INFO - __main__ - Step 9996: {'lr': 0.0004964084604198354, 'samples': 1919232, 'steps': 9995, 'loss/train': 1.9137816429138184} 11/06/2021 22:38:11 - INFO - __main__ - Step 9997: {'lr': 0.0004964075640753079, 'samples': 1919424, 'steps': 9996, 'loss/train': 1.5610769987106323} 11/06/2021 22:38:12 - INFO - __main__ - Step 9998: {'lr': 0.0004964066676197528, 'samples': 1919616, 'steps': 9997, 'loss/train': 1.6594487428665161} 11/06/2021 22:38:12 - INFO - __main__ - Step 9999: {'lr': 0.0004964057710531707, 'samples': 1919808, 'steps': 9998, 'loss/train': 1.531420111656189} 11/06/2021 22:38:12 - INFO - __main__ - Step 10000: {'lr': 0.0004964048743755621, 'samples': 1920000, 'steps': 9999, 'loss/train': 1.2586203813552856} 11/06/2021 22:38:13 - INFO - __main__ - Step 10001: {'lr': 0.0004964039775869272, 'samples': 1920192, 'steps': 10000, 'loss/train': 1.3523963689804077} 11/06/2021 22:38:14 - INFO - __main__ - Step 10002: {'lr': 0.0004964030806872664, 'samples': 1920384, 'steps': 10001, 'loss/train': 1.9878268241882324} 11/06/2021 22:38:14 - INFO - __main__ - Step 10003: {'lr': 0.0004964021836765802, 'samples': 1920576, 'steps': 10002, 'loss/train': 1.8906978368759155} 11/06/2021 22:38:14 - INFO - __main__ - Step 10004: {'lr': 0.000496401286554869, 'samples': 1920768, 'steps': 10003, 'loss/train': 1.310309648513794} 11/06/2021 22:38:15 - INFO - __main__ - Step 10005: {'lr': 0.000496400389322133, 'samples': 1920960, 'steps': 10004, 'loss/train': 1.6377642154693604} 11/06/2021 22:38:15 - INFO - __main__ - Step 10006: {'lr': 0.000496399491978373, 'samples': 1921152, 'steps': 10005, 'loss/train': 2.1881837844848633} 11/06/2021 22:38:16 - INFO - __main__ - Step 10007: {'lr': 0.0004963985945235891, 'samples': 1921344, 'steps': 10006, 'loss/train': 1.2533279657363892} 11/06/2021 22:38:16 - INFO - __main__ - Step 10008: {'lr': 0.0004963976969577819, 'samples': 1921536, 'steps': 10007, 'loss/train': 1.3222609758377075} 11/06/2021 22:38:17 - INFO - __main__ - Step 10009: {'lr': 0.0004963967992809516, 'samples': 1921728, 'steps': 10008, 'loss/train': 1.7431193590164185} 11/06/2021 22:38:17 - INFO - __main__ - Step 10010: {'lr': 0.0004963959014930988, 'samples': 1921920, 'steps': 10009, 'loss/train': 2.0606842041015625} 11/06/2021 22:38:17 - INFO - __main__ - Step 10011: {'lr': 0.0004963950035942237, 'samples': 1922112, 'steps': 10010, 'loss/train': 1.9937307834625244} 11/06/2021 22:38:19 - INFO - __main__ - Step 10012: {'lr': 0.0004963941055843268, 'samples': 1922304, 'steps': 10011, 'loss/train': 1.7773168087005615} 11/06/2021 22:38:19 - INFO - __main__ - Step 10013: {'lr': 0.0004963932074634087, 'samples': 1922496, 'steps': 10012, 'loss/train': 1.3404390811920166} 11/06/2021 22:38:19 - INFO - __main__ - Step 10014: {'lr': 0.0004963923092314694, 'samples': 1922688, 'steps': 10013, 'loss/train': 1.3710219860076904} 11/06/2021 22:38:20 - INFO - __main__ - Step 10015: {'lr': 0.0004963914108885097, 'samples': 1922880, 'steps': 10014, 'loss/train': 1.7717443704605103} 11/06/2021 22:38:20 - INFO - __main__ - Step 10016: {'lr': 0.0004963905124345297, 'samples': 1923072, 'steps': 10015, 'loss/train': 1.6650587320327759} 11/06/2021 22:38:21 - INFO - __main__ - Step 10017: {'lr': 0.00049638961386953, 'samples': 1923264, 'steps': 10016, 'loss/train': 1.4850966930389404} 11/06/2021 22:38:21 - INFO - __main__ - Step 10018: {'lr': 0.000496388715193511, 'samples': 1923456, 'steps': 10017, 'loss/train': 1.0550109148025513} 11/06/2021 22:38:22 - INFO - __main__ - Step 10019: {'lr': 0.000496387816406473, 'samples': 1923648, 'steps': 10018, 'loss/train': 1.6358630657196045} 11/06/2021 22:38:22 - INFO - __main__ - Step 10020: {'lr': 0.0004963869175084164, 'samples': 1923840, 'steps': 10019, 'loss/train': 1.7765165567398071} 11/06/2021 22:38:22 - INFO - __main__ - Step 10021: {'lr': 0.0004963860184993416, 'samples': 1924032, 'steps': 10020, 'loss/train': 1.717958688735962} 11/06/2021 22:38:23 - INFO - __main__ - Step 10022: {'lr': 0.0004963851193792492, 'samples': 1924224, 'steps': 10021, 'loss/train': 1.7283834218978882} 11/06/2021 22:38:24 - INFO - __main__ - Step 10023: {'lr': 0.0004963842201481394, 'samples': 1924416, 'steps': 10022, 'loss/train': 1.514644742012024} 11/06/2021 22:38:24 - INFO - __main__ - Step 10024: {'lr': 0.0004963833208060128, 'samples': 1924608, 'steps': 10023, 'loss/train': 1.3506485223770142} 11/06/2021 22:38:24 - INFO - __main__ - Step 10025: {'lr': 0.0004963824213528696, 'samples': 1924800, 'steps': 10024, 'loss/train': 1.9384859800338745} 11/06/2021 22:38:25 - INFO - __main__ - Step 10026: {'lr': 0.0004963815217887102, 'samples': 1924992, 'steps': 10025, 'loss/train': 2.230543851852417} 11/06/2021 22:38:26 - INFO - __main__ - Step 10027: {'lr': 0.0004963806221135351, 'samples': 1925184, 'steps': 10026, 'loss/train': 1.5828680992126465} 11/06/2021 22:38:26 - INFO - __main__ - Step 10028: {'lr': 0.0004963797223273448, 'samples': 1925376, 'steps': 10027, 'loss/train': 2.0957136154174805} 11/06/2021 22:38:27 - INFO - __main__ - Step 10029: {'lr': 0.0004963788224301395, 'samples': 1925568, 'steps': 10028, 'loss/train': 2.020629405975342} 11/06/2021 22:38:27 - INFO - __main__ - Step 10030: {'lr': 0.0004963779224219197, 'samples': 1925760, 'steps': 10029, 'loss/train': 1.7997504472732544} 11/06/2021 22:38:27 - INFO - __main__ - Step 10031: {'lr': 0.0004963770223026858, 'samples': 1925952, 'steps': 10030, 'loss/train': 1.9878836870193481} 11/06/2021 22:38:28 - INFO - __main__ - Step 10032: {'lr': 0.0004963761220724384, 'samples': 1926144, 'steps': 10031, 'loss/train': 1.9155443906784058} 11/06/2021 22:38:29 - INFO - __main__ - Step 10033: {'lr': 0.0004963752217311775, 'samples': 1926336, 'steps': 10032, 'loss/train': 1.7525060176849365} 11/06/2021 22:38:29 - INFO - __main__ - Step 10034: {'lr': 0.0004963743212789038, 'samples': 1926528, 'steps': 10033, 'loss/train': 2.0401864051818848} 11/06/2021 22:38:29 - INFO - __main__ - Step 10035: {'lr': 0.0004963734207156178, 'samples': 1926720, 'steps': 10034, 'loss/train': 1.524163007736206} 11/06/2021 22:38:30 - INFO - __main__ - Step 10036: {'lr': 0.0004963725200413195, 'samples': 1926912, 'steps': 10035, 'loss/train': 1.768740177154541} 11/06/2021 22:38:30 - INFO - __main__ - Step 10037: {'lr': 0.0004963716192560097, 'samples': 1927104, 'steps': 10036, 'loss/train': 0.7059550285339355} 11/06/2021 22:38:31 - INFO - __main__ - Step 10038: {'lr': 0.0004963707183596885, 'samples': 1927296, 'steps': 10037, 'loss/train': 1.979498267173767} 11/06/2021 22:38:32 - INFO - __main__ - Step 10039: {'lr': 0.0004963698173523566, 'samples': 1927488, 'steps': 10038, 'loss/train': 1.3461339473724365} 11/06/2021 22:38:32 - INFO - __main__ - Step 10040: {'lr': 0.0004963689162340142, 'samples': 1927680, 'steps': 10039, 'loss/train': 1.8551632165908813} 11/06/2021 22:38:32 - INFO - __main__ - Step 10041: {'lr': 0.0004963680150046618, 'samples': 1927872, 'steps': 10040, 'loss/train': 1.8479679822921753} 11/06/2021 22:38:33 - INFO - __main__ - Step 10042: {'lr': 0.0004963671136642997, 'samples': 1928064, 'steps': 10041, 'loss/train': 1.8179422616958618} 11/06/2021 22:38:34 - INFO - __main__ - Step 10043: {'lr': 0.0004963662122129284, 'samples': 1928256, 'steps': 10042, 'loss/train': 1.5947130918502808} 11/06/2021 22:38:34 - INFO - __main__ - Step 10044: {'lr': 0.0004963653106505483, 'samples': 1928448, 'steps': 10043, 'loss/train': 2.2334413528442383} 11/06/2021 22:38:34 - INFO - __main__ - Step 10045: {'lr': 0.0004963644089771598, 'samples': 1928640, 'steps': 10044, 'loss/train': 1.9353092908859253} 11/06/2021 22:38:35 - INFO - __main__ - Step 10046: {'lr': 0.0004963635071927633, 'samples': 1928832, 'steps': 10045, 'loss/train': 1.261295199394226} 11/06/2021 22:38:35 - INFO - __main__ - Step 10047: {'lr': 0.0004963626052973592, 'samples': 1929024, 'steps': 10046, 'loss/train': 1.8938654661178589} 11/06/2021 22:38:36 - INFO - __main__ - Step 10048: {'lr': 0.0004963617032909479, 'samples': 1929216, 'steps': 10047, 'loss/train': 1.5131046772003174} 11/06/2021 22:38:36 - INFO - __main__ - Step 10049: {'lr': 0.0004963608011735298, 'samples': 1929408, 'steps': 10048, 'loss/train': 1.5676438808441162} 11/06/2021 22:38:37 - INFO - __main__ - Step 10050: {'lr': 0.0004963598989451053, 'samples': 1929600, 'steps': 10049, 'loss/train': 1.8830301761627197} 11/06/2021 22:38:37 - INFO - __main__ - Step 10051: {'lr': 0.000496358996605675, 'samples': 1929792, 'steps': 10050, 'loss/train': 1.9643739461898804} 11/06/2021 22:38:37 - INFO - __main__ - Step 10052: {'lr': 0.0004963580941552391, 'samples': 1929984, 'steps': 10051, 'loss/train': 1.7612366676330566} 11/06/2021 22:38:38 - INFO - __main__ - Step 10053: {'lr': 0.0004963571915937979, 'samples': 1930176, 'steps': 10052, 'loss/train': 1.8770025968551636} 11/06/2021 22:38:39 - INFO - __main__ - Step 10054: {'lr': 0.000496356288921352, 'samples': 1930368, 'steps': 10053, 'loss/train': 1.7088673114776611} 11/06/2021 22:38:39 - INFO - __main__ - Step 10055: {'lr': 0.0004963553861379018, 'samples': 1930560, 'steps': 10054, 'loss/train': 2.023212194442749} 11/06/2021 22:38:39 - INFO - __main__ - Step 10056: {'lr': 0.0004963544832434476, 'samples': 1930752, 'steps': 10055, 'loss/train': 1.9566535949707031} 11/06/2021 22:38:40 - INFO - __main__ - Step 10057: {'lr': 0.00049635358023799, 'samples': 1930944, 'steps': 10056, 'loss/train': 1.7718608379364014} 11/06/2021 22:38:41 - INFO - __main__ - Step 10058: {'lr': 0.0004963526771215291, 'samples': 1931136, 'steps': 10057, 'loss/train': 1.9035671949386597} 11/06/2021 22:38:41 - INFO - __main__ - Step 10059: {'lr': 0.0004963517738940656, 'samples': 1931328, 'steps': 10058, 'loss/train': 1.7526215314865112} 11/06/2021 22:38:41 - INFO - __main__ - Step 10060: {'lr': 0.0004963508705555998, 'samples': 1931520, 'steps': 10059, 'loss/train': 1.9995636940002441} 11/06/2021 22:38:42 - INFO - __main__ - Step 10061: {'lr': 0.000496349967106132, 'samples': 1931712, 'steps': 10060, 'loss/train': 1.6497392654418945} 11/06/2021 22:38:42 - INFO - __main__ - Step 10062: {'lr': 0.0004963490635456629, 'samples': 1931904, 'steps': 10061, 'loss/train': 1.680986762046814} 11/06/2021 22:38:42 - INFO - __main__ - Step 10063: {'lr': 0.0004963481598741925, 'samples': 1932096, 'steps': 10062, 'loss/train': 1.8724141120910645} 11/06/2021 22:38:43 - INFO - __main__ - Step 10064: {'lr': 0.0004963472560917216, 'samples': 1932288, 'steps': 10063, 'loss/train': 1.6809192895889282} 11/06/2021 22:38:44 - INFO - __main__ - Step 10065: {'lr': 0.0004963463521982503, 'samples': 1932480, 'steps': 10064, 'loss/train': 1.8800909519195557} 11/06/2021 22:38:44 - INFO - __main__ - Step 10066: {'lr': 0.0004963454481937791, 'samples': 1932672, 'steps': 10065, 'loss/train': 1.9918426275253296} 11/06/2021 22:38:45 - INFO - __main__ - Step 10067: {'lr': 0.0004963445440783086, 'samples': 1932864, 'steps': 10066, 'loss/train': 1.8763664960861206} 11/06/2021 22:38:45 - INFO - __main__ - Step 10068: {'lr': 0.0004963436398518389, 'samples': 1933056, 'steps': 10067, 'loss/train': 1.3873255252838135} 11/06/2021 22:38:46 - INFO - __main__ - Step 10069: {'lr': 0.0004963427355143706, 'samples': 1933248, 'steps': 10068, 'loss/train': 1.4686658382415771} 11/06/2021 22:38:46 - INFO - __main__ - Step 10070: {'lr': 0.0004963418310659041, 'samples': 1933440, 'steps': 10069, 'loss/train': 1.890317440032959} 11/06/2021 22:38:47 - INFO - __main__ - Step 10071: {'lr': 0.0004963409265064398, 'samples': 1933632, 'steps': 10070, 'loss/train': 1.8712046146392822} 11/06/2021 22:38:47 - INFO - __main__ - Step 10072: {'lr': 0.0004963400218359781, 'samples': 1933824, 'steps': 10071, 'loss/train': 1.9994356632232666} 11/06/2021 22:38:47 - INFO - __main__ - Step 10073: {'lr': 0.0004963391170545193, 'samples': 1934016, 'steps': 10072, 'loss/train': 1.4278088808059692} 11/06/2021 22:38:48 - INFO - __main__ - Step 10074: {'lr': 0.0004963382121620639, 'samples': 1934208, 'steps': 10073, 'loss/train': 1.821343183517456} 11/06/2021 22:38:49 - INFO - __main__ - Step 10075: {'lr': 0.0004963373071586123, 'samples': 1934400, 'steps': 10074, 'loss/train': 2.091646909713745} 11/06/2021 22:38:49 - INFO - __main__ - Step 10076: {'lr': 0.000496336402044165, 'samples': 1934592, 'steps': 10075, 'loss/train': 1.838132619857788} 11/06/2021 22:38:49 - INFO - __main__ - Step 10077: {'lr': 0.0004963354968187222, 'samples': 1934784, 'steps': 10076, 'loss/train': 1.6847151517868042} 11/06/2021 22:38:50 - INFO - __main__ - Step 10078: {'lr': 0.0004963345914822845, 'samples': 1934976, 'steps': 10077, 'loss/train': 1.5948516130447388} 11/06/2021 22:38:51 - INFO - __main__ - Step 10079: {'lr': 0.0004963336860348521, 'samples': 1935168, 'steps': 10078, 'loss/train': 1.8252551555633545} 11/06/2021 22:38:51 - INFO - __main__ - Step 10080: {'lr': 0.0004963327804764257, 'samples': 1935360, 'steps': 10079, 'loss/train': 1.9088850021362305} 11/06/2021 22:38:52 - INFO - __main__ - Step 10081: {'lr': 0.0004963318748070056, 'samples': 1935552, 'steps': 10080, 'loss/train': 1.7089757919311523} 11/06/2021 22:38:52 - INFO - __main__ - Step 10082: {'lr': 0.0004963309690265921, 'samples': 1935744, 'steps': 10081, 'loss/train': 1.9372551441192627} 11/06/2021 22:38:52 - INFO - __main__ - Step 10083: {'lr': 0.0004963300631351856, 'samples': 1935936, 'steps': 10082, 'loss/train': 1.6880172491073608} 11/06/2021 22:38:53 - INFO - __main__ - Step 10084: {'lr': 0.0004963291571327866, 'samples': 1936128, 'steps': 10083, 'loss/train': 1.8040343523025513} 11/06/2021 22:38:54 - INFO - __main__ - Step 10085: {'lr': 0.0004963282510193955, 'samples': 1936320, 'steps': 10084, 'loss/train': 1.7773646116256714} 11/06/2021 22:38:54 - INFO - __main__ - Step 10086: {'lr': 0.0004963273447950126, 'samples': 1936512, 'steps': 10085, 'loss/train': 1.7499951124191284} 11/06/2021 22:38:54 - INFO - __main__ - Step 10087: {'lr': 0.0004963264384596386, 'samples': 1936704, 'steps': 10086, 'loss/train': 1.3979496955871582} 11/06/2021 22:38:55 - INFO - __main__ - Step 10088: {'lr': 0.0004963255320132735, 'samples': 1936896, 'steps': 10087, 'loss/train': 2.1118416786193848} 11/06/2021 22:38:55 - INFO - __main__ - Step 10089: {'lr': 0.0004963246254559181, 'samples': 1937088, 'steps': 10088, 'loss/train': 1.7357311248779297} 11/06/2021 22:38:56 - INFO - __main__ - Step 10090: {'lr': 0.0004963237187875724, 'samples': 1937280, 'steps': 10089, 'loss/train': 1.6976773738861084} 11/06/2021 22:38:56 - INFO - __main__ - Step 10091: {'lr': 0.0004963228120082372, 'samples': 1937472, 'steps': 10090, 'loss/train': 2.041823148727417} 11/06/2021 22:38:57 - INFO - __main__ - Step 10092: {'lr': 0.0004963219051179127, 'samples': 1937664, 'steps': 10091, 'loss/train': 1.77364182472229} 11/06/2021 22:38:57 - INFO - __main__ - Step 10093: {'lr': 0.0004963209981165993, 'samples': 1937856, 'steps': 10092, 'loss/train': 2.122255802154541} 11/06/2021 22:38:57 - INFO - __main__ - Step 10094: {'lr': 0.0004963200910042976, 'samples': 1938048, 'steps': 10093, 'loss/train': 1.8102829456329346} 11/06/2021 22:38:59 - INFO - __main__ - Step 10095: {'lr': 0.0004963191837810077, 'samples': 1938240, 'steps': 10094, 'loss/train': 1.7681382894515991} 11/06/2021 22:38:59 - INFO - __main__ - Step 10096: {'lr': 0.0004963182764467303, 'samples': 1938432, 'steps': 10095, 'loss/train': 1.8665366172790527} 11/06/2021 22:38:59 - INFO - __main__ - Step 10097: {'lr': 0.0004963173690014656, 'samples': 1938624, 'steps': 10096, 'loss/train': 2.1923022270202637} 11/06/2021 22:39:00 - INFO - __main__ - Step 10098: {'lr': 0.0004963164614452142, 'samples': 1938816, 'steps': 10097, 'loss/train': 1.7837992906570435} 11/06/2021 22:39:00 - INFO - __main__ - Step 10099: {'lr': 0.0004963155537779764, 'samples': 1939008, 'steps': 10098, 'loss/train': 1.7613967657089233} 11/06/2021 22:39:01 - INFO - __main__ - Step 10100: {'lr': 0.0004963146459997525, 'samples': 1939200, 'steps': 10099, 'loss/train': 0.7903871536254883} 11/06/2021 22:39:01 - INFO - __main__ - Step 10101: {'lr': 0.0004963137381105431, 'samples': 1939392, 'steps': 10100, 'loss/train': 2.0459229946136475} 11/06/2021 22:39:02 - INFO - __main__ - Step 10102: {'lr': 0.0004963128301103485, 'samples': 1939584, 'steps': 10101, 'loss/train': 1.6579885482788086} 11/06/2021 22:39:02 - INFO - __main__ - Step 10103: {'lr': 0.0004963119219991691, 'samples': 1939776, 'steps': 10102, 'loss/train': 1.9690332412719727} 11/06/2021 22:39:02 - INFO - __main__ - Step 10104: {'lr': 0.0004963110137770054, 'samples': 1939968, 'steps': 10103, 'loss/train': 1.717148780822754} 11/06/2021 22:39:04 - INFO - __main__ - Step 10105: {'lr': 0.0004963101054438578, 'samples': 1940160, 'steps': 10104, 'loss/train': 2.5613667964935303} 11/06/2021 22:39:04 - INFO - __main__ - Step 10106: {'lr': 0.0004963091969997265, 'samples': 1940352, 'steps': 10105, 'loss/train': 1.79701566696167} 11/06/2021 22:39:05 - INFO - __main__ - Step 10107: {'lr': 0.0004963082884446123, 'samples': 1940544, 'steps': 10106, 'loss/train': 1.8763283491134644} 11/06/2021 22:39:05 - INFO - __main__ - Step 10108: {'lr': 0.0004963073797785153, 'samples': 1940736, 'steps': 10107, 'loss/train': 1.9007415771484375} 11/06/2021 22:39:06 - INFO - __main__ - Step 10109: {'lr': 0.000496306471001436, 'samples': 1940928, 'steps': 10108, 'loss/train': 0.9993707537651062} 11/06/2021 22:39:06 - INFO - __main__ - Step 10110: {'lr': 0.0004963055621133748, 'samples': 1941120, 'steps': 10109, 'loss/train': 1.6748651266098022} 11/06/2021 22:39:06 - INFO - __main__ - Step 10111: {'lr': 0.0004963046531143321, 'samples': 1941312, 'steps': 10110, 'loss/train': 1.998529076576233} 11/06/2021 22:39:07 - INFO - __main__ - Step 10112: {'lr': 0.0004963037440043083, 'samples': 1941504, 'steps': 10111, 'loss/train': 1.8323516845703125} 11/06/2021 22:39:08 - INFO - __main__ - Step 10113: {'lr': 0.0004963028347833038, 'samples': 1941696, 'steps': 10112, 'loss/train': 2.443237066268921} 11/06/2021 22:39:08 - INFO - __main__ - Step 10114: {'lr': 0.0004963019254513191, 'samples': 1941888, 'steps': 10113, 'loss/train': 1.495759129524231} 11/06/2021 22:39:08 - INFO - __main__ - Step 10115: {'lr': 0.0004963010160083546, 'samples': 1942080, 'steps': 10114, 'loss/train': 1.9651312828063965} 11/06/2021 22:39:09 - INFO - __main__ - Step 10116: {'lr': 0.0004963001064544106, 'samples': 1942272, 'steps': 10115, 'loss/train': 1.4049568176269531} 11/06/2021 22:39:10 - INFO - __main__ - Step 10117: {'lr': 0.0004962991967894876, 'samples': 1942464, 'steps': 10116, 'loss/train': 1.3363804817199707} 11/06/2021 22:39:10 - INFO - __main__ - Step 10118: {'lr': 0.0004962982870135859, 'samples': 1942656, 'steps': 10117, 'loss/train': 1.6501933336257935} 11/06/2021 22:39:11 - INFO - __main__ - Step 10119: {'lr': 0.0004962973771267061, 'samples': 1942848, 'steps': 10118, 'loss/train': 1.7426259517669678} 11/06/2021 22:39:11 - INFO - __main__ - Step 10120: {'lr': 0.0004962964671288484, 'samples': 1943040, 'steps': 10119, 'loss/train': 1.8889999389648438} 11/06/2021 22:39:11 - INFO - __main__ - Step 10121: {'lr': 0.0004962955570200135, 'samples': 1943232, 'steps': 10120, 'loss/train': 1.7648907899856567} 11/06/2021 22:39:12 - INFO - __main__ - Step 10122: {'lr': 0.0004962946468002014, 'samples': 1943424, 'steps': 10121, 'loss/train': 2.164283514022827} 11/06/2021 22:39:13 - INFO - __main__ - Step 10123: {'lr': 0.0004962937364694129, 'samples': 1943616, 'steps': 10122, 'loss/train': 2.0995631217956543} 11/06/2021 22:39:13 - INFO - __main__ - Step 10124: {'lr': 0.0004962928260276481, 'samples': 1943808, 'steps': 10123, 'loss/train': 0.9570446014404297} 11/06/2021 22:39:13 - INFO - __main__ - Step 10125: {'lr': 0.0004962919154749077, 'samples': 1944000, 'steps': 10124, 'loss/train': 1.9521758556365967} 11/06/2021 22:39:14 - INFO - __main__ - Step 10126: {'lr': 0.0004962910048111919, 'samples': 1944192, 'steps': 10125, 'loss/train': 1.8387959003448486} 11/06/2021 22:39:14 - INFO - __main__ - Step 10127: {'lr': 0.0004962900940365012, 'samples': 1944384, 'steps': 10126, 'loss/train': 1.178709864616394} 11/06/2021 22:39:15 - INFO - __main__ - Step 10128: {'lr': 0.0004962891831508359, 'samples': 1944576, 'steps': 10127, 'loss/train': 1.920640230178833} 11/06/2021 22:39:15 - INFO - __main__ - Step 10129: {'lr': 0.0004962882721541965, 'samples': 1944768, 'steps': 10128, 'loss/train': 1.703497290611267} 11/06/2021 22:39:16 - INFO - __main__ - Step 10130: {'lr': 0.0004962873610465835, 'samples': 1944960, 'steps': 10129, 'loss/train': 1.4937912225723267} 11/06/2021 22:39:16 - INFO - __main__ - Step 10131: {'lr': 0.0004962864498279972, 'samples': 1945152, 'steps': 10130, 'loss/train': 1.5674489736557007} 11/06/2021 22:39:17 - INFO - __main__ - Step 10132: {'lr': 0.000496285538498438, 'samples': 1945344, 'steps': 10131, 'loss/train': 1.649821162223816} 11/06/2021 22:39:18 - INFO - __main__ - Step 10133: {'lr': 0.0004962846270579062, 'samples': 1945536, 'steps': 10132, 'loss/train': 1.809035062789917} 11/06/2021 22:39:18 - INFO - __main__ - Step 10134: {'lr': 0.0004962837155064025, 'samples': 1945728, 'steps': 10133, 'loss/train': 1.8651890754699707} 11/06/2021 22:39:18 - INFO - __main__ - Step 10135: {'lr': 0.0004962828038439272, 'samples': 1945920, 'steps': 10134, 'loss/train': 1.8161650896072388} 11/06/2021 22:39:19 - INFO - __main__ - Step 10136: {'lr': 0.0004962818920704805, 'samples': 1946112, 'steps': 10135, 'loss/train': 2.1492648124694824} 11/06/2021 22:39:19 - INFO - __main__ - Step 10137: {'lr': 0.0004962809801860632, 'samples': 1946304, 'steps': 10136, 'loss/train': 1.2017062902450562} 11/06/2021 22:39:20 - INFO - __main__ - Step 10138: {'lr': 0.0004962800681906753, 'samples': 1946496, 'steps': 10137, 'loss/train': 1.8302303552627563} 11/06/2021 22:39:20 - INFO - __main__ - Step 10139: {'lr': 0.0004962791560843175, 'samples': 1946688, 'steps': 10138, 'loss/train': 1.9001319408416748} 11/06/2021 22:39:21 - INFO - __main__ - Step 10140: {'lr': 0.00049627824386699, 'samples': 1946880, 'steps': 10139, 'loss/train': 1.766602873802185} 11/06/2021 22:39:21 - INFO - __main__ - Step 10141: {'lr': 0.0004962773315386935, 'samples': 1947072, 'steps': 10140, 'loss/train': 2.360720157623291} 11/06/2021 22:39:21 - INFO - __main__ - Step 10142: {'lr': 0.0004962764190994282, 'samples': 1947264, 'steps': 10141, 'loss/train': 1.9311211109161377} 11/06/2021 22:39:22 - INFO - __main__ - Step 10143: {'lr': 0.0004962755065491944, 'samples': 1947456, 'steps': 10142, 'loss/train': 1.6629748344421387} 11/06/2021 22:39:23 - INFO - __main__ - Step 10144: {'lr': 0.0004962745938879928, 'samples': 1947648, 'steps': 10143, 'loss/train': 2.1842613220214844} 11/06/2021 22:39:23 - INFO - __main__ - Step 10145: {'lr': 0.0004962736811158236, 'samples': 1947840, 'steps': 10144, 'loss/train': 1.8755285739898682} 11/06/2021 22:39:23 - INFO - __main__ - Step 10146: {'lr': 0.0004962727682326873, 'samples': 1948032, 'steps': 10145, 'loss/train': 1.2577018737792969} 11/06/2021 22:39:24 - INFO - __main__ - Step 10147: {'lr': 0.0004962718552385843, 'samples': 1948224, 'steps': 10146, 'loss/train': 1.5316803455352783} 11/06/2021 22:39:25 - INFO - __main__ - Step 10148: {'lr': 0.000496270942133515, 'samples': 1948416, 'steps': 10147, 'loss/train': 2.5291173458099365} 11/06/2021 22:39:25 - INFO - __main__ - Step 10149: {'lr': 0.0004962700289174798, 'samples': 1948608, 'steps': 10148, 'loss/train': 1.7601622343063354} 11/06/2021 22:39:26 - INFO - __main__ - Step 10150: {'lr': 0.0004962691155904791, 'samples': 1948800, 'steps': 10149, 'loss/train': 1.9934760332107544} 11/06/2021 22:39:26 - INFO - __main__ - Step 10151: {'lr': 0.0004962682021525134, 'samples': 1948992, 'steps': 10150, 'loss/train': 1.820966124534607} 11/06/2021 22:39:26 - INFO - __main__ - Step 10152: {'lr': 0.000496267288603583, 'samples': 1949184, 'steps': 10151, 'loss/train': 1.7241519689559937} 11/06/2021 22:39:27 - INFO - __main__ - Step 10153: {'lr': 0.0004962663749436883, 'samples': 1949376, 'steps': 10152, 'loss/train': 1.250434160232544} 11/06/2021 22:39:28 - INFO - __main__ - Step 10154: {'lr': 0.0004962654611728299, 'samples': 1949568, 'steps': 10153, 'loss/train': 1.601660966873169} 11/06/2021 22:39:28 - INFO - __main__ - Step 10155: {'lr': 0.000496264547291008, 'samples': 1949760, 'steps': 10154, 'loss/train': 1.6472324132919312} 11/06/2021 22:39:28 - INFO - __main__ - Step 10156: {'lr': 0.0004962636332982232, 'samples': 1949952, 'steps': 10155, 'loss/train': 0.9378107786178589} 11/06/2021 22:39:29 - INFO - __main__ - Step 10157: {'lr': 0.0004962627191944756, 'samples': 1950144, 'steps': 10156, 'loss/train': 1.7389650344848633} 11/06/2021 22:39:29 - INFO - __main__ - Step 10158: {'lr': 0.000496261804979766, 'samples': 1950336, 'steps': 10157, 'loss/train': 1.7131973505020142} 11/06/2021 22:39:30 - INFO - __main__ - Step 10159: {'lr': 0.0004962608906540946, 'samples': 1950528, 'steps': 10158, 'loss/train': 1.5908807516098022} 11/06/2021 22:39:30 - INFO - __main__ - Step 10160: {'lr': 0.0004962599762174618, 'samples': 1950720, 'steps': 10159, 'loss/train': 1.8197942972183228} 11/06/2021 22:39:31 - INFO - __main__ - Step 10161: {'lr': 0.0004962590616698681, 'samples': 1950912, 'steps': 10160, 'loss/train': 1.3586111068725586} 11/06/2021 22:39:31 - INFO - __main__ - Step 10162: {'lr': 0.0004962581470113138, 'samples': 1951104, 'steps': 10161, 'loss/train': 1.4758445024490356} 11/06/2021 22:39:31 - INFO - __main__ - Step 10163: {'lr': 0.0004962572322417994, 'samples': 1951296, 'steps': 10162, 'loss/train': 2.033109426498413} 11/06/2021 22:39:33 - INFO - __main__ - Step 10164: {'lr': 0.0004962563173613254, 'samples': 1951488, 'steps': 10163, 'loss/train': 2.092374324798584} 11/06/2021 22:39:33 - INFO - __main__ - Step 10165: {'lr': 0.000496255402369892, 'samples': 1951680, 'steps': 10164, 'loss/train': 1.1480693817138672} 11/06/2021 22:39:33 - INFO - __main__ - Step 10166: {'lr': 0.0004962544872674997, 'samples': 1951872, 'steps': 10165, 'loss/train': 1.6419637203216553} 11/06/2021 22:39:34 - INFO - __main__ - Step 10167: {'lr': 0.000496253572054149, 'samples': 1952064, 'steps': 10166, 'loss/train': 2.001441240310669} 11/06/2021 22:39:34 - INFO - __main__ - Step 10168: {'lr': 0.0004962526567298402, 'samples': 1952256, 'steps': 10167, 'loss/train': 0.8515803217887878} 11/06/2021 22:39:35 - INFO - __main__ - Step 10169: {'lr': 0.0004962517412945738, 'samples': 1952448, 'steps': 10168, 'loss/train': 1.8724167346954346} 11/06/2021 22:39:35 - INFO - __main__ - Step 10170: {'lr': 0.00049625082574835, 'samples': 1952640, 'steps': 10169, 'loss/train': 1.8901619911193848} 11/06/2021 22:39:36 - INFO - __main__ - Step 10171: {'lr': 0.0004962499100911696, 'samples': 1952832, 'steps': 10170, 'loss/train': 1.726631999015808} 11/06/2021 22:39:36 - INFO - __main__ - Step 10172: {'lr': 0.0004962489943230326, 'samples': 1953024, 'steps': 10171, 'loss/train': 1.2755616903305054} 11/06/2021 22:39:36 - INFO - __main__ - Step 10173: {'lr': 0.0004962480784439397, 'samples': 1953216, 'steps': 10172, 'loss/train': 1.9039602279663086} 11/06/2021 22:39:38 - INFO - __main__ - Step 10174: {'lr': 0.0004962471624538913, 'samples': 1953408, 'steps': 10173, 'loss/train': 2.694628953933716} 11/06/2021 22:39:38 - INFO - __main__ - Step 10175: {'lr': 0.0004962462463528875, 'samples': 1953600, 'steps': 10174, 'loss/train': 1.5455642938613892} 11/06/2021 22:39:38 - INFO - __main__ - Step 10176: {'lr': 0.0004962453301409291, 'samples': 1953792, 'steps': 10175, 'loss/train': 2.612506866455078} 11/06/2021 22:39:39 - INFO - __main__ - Step 10177: {'lr': 0.0004962444138180164, 'samples': 1953984, 'steps': 10176, 'loss/train': 1.5094280242919922} 11/06/2021 22:39:39 - INFO - __main__ - Step 10178: {'lr': 0.0004962434973841497, 'samples': 1954176, 'steps': 10177, 'loss/train': 1.740395426750183} 11/06/2021 22:39:39 - INFO - __main__ - Step 10179: {'lr': 0.0004962425808393295, 'samples': 1954368, 'steps': 10178, 'loss/train': 1.4363911151885986} 11/06/2021 22:39:40 - INFO - __main__ - Step 10180: {'lr': 0.000496241664183556, 'samples': 1954560, 'steps': 10179, 'loss/train': 1.8234004974365234} 11/06/2021 22:39:41 - INFO - __main__ - Step 10181: {'lr': 0.0004962407474168301, 'samples': 1954752, 'steps': 10180, 'loss/train': 2.219465970993042} 11/06/2021 22:39:41 - INFO - __main__ - Step 10182: {'lr': 0.0004962398305391518, 'samples': 1954944, 'steps': 10181, 'loss/train': 1.733228087425232} 11/06/2021 22:39:41 - INFO - __main__ - Step 10183: {'lr': 0.0004962389135505217, 'samples': 1955136, 'steps': 10182, 'loss/train': 1.9149515628814697} 11/06/2021 22:39:42 - INFO - __main__ - Step 10184: {'lr': 0.00049623799645094, 'samples': 1955328, 'steps': 10183, 'loss/train': 1.2486367225646973} 11/06/2021 22:39:43 - INFO - __main__ - Step 10185: {'lr': 0.0004962370792404073, 'samples': 1955520, 'steps': 10184, 'loss/train': 1.7573951482772827} 11/06/2021 22:39:43 - INFO - __main__ - Step 10186: {'lr': 0.000496236161918924, 'samples': 1955712, 'steps': 10185, 'loss/train': 2.166736125946045} 11/06/2021 22:39:43 - INFO - __main__ - Step 10187: {'lr': 0.0004962352444864904, 'samples': 1955904, 'steps': 10186, 'loss/train': 1.8243874311447144} 11/06/2021 22:39:44 - INFO - __main__ - Step 10188: {'lr': 0.0004962343269431072, 'samples': 1956096, 'steps': 10187, 'loss/train': 1.3378742933273315} 11/06/2021 22:39:44 - INFO - __main__ - Step 10189: {'lr': 0.0004962334092887744, 'samples': 1956288, 'steps': 10188, 'loss/train': 1.5770504474639893} 11/06/2021 22:39:45 - INFO - __main__ - Step 10190: {'lr': 0.0004962324915234928, 'samples': 1956480, 'steps': 10189, 'loss/train': 1.703421711921692} 11/06/2021 22:39:46 - INFO - __main__ - Step 10191: {'lr': 0.0004962315736472626, 'samples': 1956672, 'steps': 10190, 'loss/train': 1.7759064435958862} 11/06/2021 22:39:46 - INFO - __main__ - Step 10192: {'lr': 0.0004962306556600842, 'samples': 1956864, 'steps': 10191, 'loss/train': 1.7851485013961792} 11/06/2021 22:39:46 - INFO - __main__ - Step 10193: {'lr': 0.0004962297375619581, 'samples': 1957056, 'steps': 10192, 'loss/train': 1.8357267379760742} 11/06/2021 22:39:47 - INFO - __main__ - Step 10194: {'lr': 0.0004962288193528846, 'samples': 1957248, 'steps': 10193, 'loss/train': 1.8837649822235107} 11/06/2021 22:39:48 - INFO - __main__ - Step 10195: {'lr': 0.0004962279010328642, 'samples': 1957440, 'steps': 10194, 'loss/train': 1.5941526889801025} 11/06/2021 22:39:48 - INFO - __main__ - Step 10196: {'lr': 0.0004962269826018974, 'samples': 1957632, 'steps': 10195, 'loss/train': 1.2281931638717651} 11/06/2021 22:39:48 - INFO - __main__ - Step 10197: {'lr': 0.0004962260640599845, 'samples': 1957824, 'steps': 10196, 'loss/train': 1.6017422676086426} 11/06/2021 22:39:49 - INFO - __main__ - Step 10198: {'lr': 0.0004962251454071259, 'samples': 1958016, 'steps': 10197, 'loss/train': 1.4461283683776855} 11/06/2021 22:39:49 - INFO - __main__ - Step 10199: {'lr': 0.0004962242266433221, 'samples': 1958208, 'steps': 10198, 'loss/train': 1.918282151222229} 11/06/2021 22:39:51 - INFO - __main__ - Step 10200: {'lr': 0.0004962233077685734, 'samples': 1958400, 'steps': 10199, 'loss/train': 1.5870373249053955} 11/06/2021 22:39:51 - INFO - __main__ - Step 10201: {'lr': 0.0004962223887828803, 'samples': 1958592, 'steps': 10200, 'loss/train': 1.9776827096939087} 11/06/2021 22:39:52 - INFO - __main__ - Step 10202: {'lr': 0.0004962214696862432, 'samples': 1958784, 'steps': 10201, 'loss/train': 1.859332799911499} 11/06/2021 22:39:52 - INFO - __main__ - Step 10203: {'lr': 0.0004962205504786626, 'samples': 1958976, 'steps': 10202, 'loss/train': 1.5587915182113647} 11/06/2021 22:39:52 - INFO - __main__ - Step 10204: {'lr': 0.0004962196311601386, 'samples': 1959168, 'steps': 10203, 'loss/train': 2.007300615310669} 11/06/2021 22:39:53 - INFO - __main__ - Step 10205: {'lr': 0.000496218711730672, 'samples': 1959360, 'steps': 10204, 'loss/train': 1.5240142345428467} 11/06/2021 22:39:53 - INFO - __main__ - Step 10206: {'lr': 0.000496217792190263, 'samples': 1959552, 'steps': 10205, 'loss/train': 1.8262863159179688} 11/06/2021 22:39:54 - INFO - __main__ - Step 10207: {'lr': 0.0004962168725389121, 'samples': 1959744, 'steps': 10206, 'loss/train': 1.8134788274765015} 11/06/2021 22:39:54 - INFO - __main__ - Step 10208: {'lr': 0.0004962159527766196, 'samples': 1959936, 'steps': 10207, 'loss/train': 1.8182828426361084} 11/06/2021 22:39:55 - INFO - __main__ - Step 10209: {'lr': 0.000496215032903386, 'samples': 1960128, 'steps': 10208, 'loss/train': 2.6121604442596436} 11/06/2021 22:39:55 - INFO - __main__ - Step 10210: {'lr': 0.0004962141129192118, 'samples': 1960320, 'steps': 10209, 'loss/train': 1.8937207460403442} 11/06/2021 22:39:56 - INFO - __main__ - Step 10211: {'lr': 0.0004962131928240972, 'samples': 1960512, 'steps': 10210, 'loss/train': 2.1297640800476074} 11/06/2021 22:39:56 - INFO - __main__ - Step 10212: {'lr': 0.0004962122726180428, 'samples': 1960704, 'steps': 10211, 'loss/train': 1.8525015115737915} 11/06/2021 22:39:57 - INFO - __main__ - Step 10213: {'lr': 0.000496211352301049, 'samples': 1960896, 'steps': 10212, 'loss/train': 2.0309906005859375} 11/06/2021 22:39:57 - INFO - __main__ - Step 10214: {'lr': 0.0004962104318731161, 'samples': 1961088, 'steps': 10213, 'loss/train': 1.6677395105361938} 11/06/2021 22:39:58 - INFO - __main__ - Step 10215: {'lr': 0.0004962095113342445, 'samples': 1961280, 'steps': 10214, 'loss/train': 1.9678434133529663} 11/06/2021 22:39:58 - INFO - __main__ - Step 10216: {'lr': 0.0004962085906844348, 'samples': 1961472, 'steps': 10215, 'loss/train': 1.773742437362671} 11/06/2021 22:39:58 - INFO - __main__ - Step 10217: {'lr': 0.0004962076699236873, 'samples': 1961664, 'steps': 10216, 'loss/train': 1.3391557931900024} 11/06/2021 22:39:59 - INFO - __main__ - Step 10218: {'lr': 0.0004962067490520024, 'samples': 1961856, 'steps': 10217, 'loss/train': 1.5753556489944458} 11/06/2021 22:40:00 - INFO - __main__ - Step 10219: {'lr': 0.0004962058280693805, 'samples': 1962048, 'steps': 10218, 'loss/train': 1.1909321546554565} 11/06/2021 22:40:00 - INFO - __main__ - Step 10220: {'lr': 0.0004962049069758221, 'samples': 1962240, 'steps': 10219, 'loss/train': 2.3441052436828613} 11/06/2021 22:40:00 - INFO - __main__ - Step 10221: {'lr': 0.0004962039857713276, 'samples': 1962432, 'steps': 10220, 'loss/train': 2.0248935222625732} 11/06/2021 22:40:01 - INFO - __main__ - Step 10222: {'lr': 0.0004962030644558974, 'samples': 1962624, 'steps': 10221, 'loss/train': 2.0207021236419678} 11/06/2021 22:40:02 - INFO - __main__ - Step 10223: {'lr': 0.0004962021430295319, 'samples': 1962816, 'steps': 10222, 'loss/train': 2.0765750408172607} 11/06/2021 22:40:02 - INFO - __main__ - Step 10224: {'lr': 0.0004962012214922314, 'samples': 1963008, 'steps': 10223, 'loss/train': 1.82279372215271} 11/06/2021 22:40:02 - INFO - __main__ - Step 10225: {'lr': 0.0004962002998439966, 'samples': 1963200, 'steps': 10224, 'loss/train': 2.235682725906372} 11/06/2021 22:40:03 - INFO - __main__ - Step 10226: {'lr': 0.0004961993780848276, 'samples': 1963392, 'steps': 10225, 'loss/train': 1.372536301612854} 11/06/2021 22:40:03 - INFO - __main__ - Step 10227: {'lr': 0.000496198456214725, 'samples': 1963584, 'steps': 10226, 'loss/train': 2.1596105098724365} 11/06/2021 22:40:04 - INFO - __main__ - Step 10228: {'lr': 0.0004961975342336891, 'samples': 1963776, 'steps': 10227, 'loss/train': 1.9234440326690674} 11/06/2021 22:40:05 - INFO - __main__ - Step 10229: {'lr': 0.0004961966121417204, 'samples': 1963968, 'steps': 10228, 'loss/train': 1.7350918054580688} 11/06/2021 22:40:05 - INFO - __main__ - Step 10230: {'lr': 0.0004961956899388195, 'samples': 1964160, 'steps': 10229, 'loss/train': 1.6040425300598145} 11/06/2021 22:40:05 - INFO - __main__ - Step 10231: {'lr': 0.0004961947676249864, 'samples': 1964352, 'steps': 10230, 'loss/train': 1.6913650035858154} 11/06/2021 22:40:06 - INFO - __main__ - Step 10232: {'lr': 0.0004961938452002218, 'samples': 1964544, 'steps': 10231, 'loss/train': 1.3374961614608765} 11/06/2021 22:40:06 - INFO - __main__ - Step 10233: {'lr': 0.0004961929226645261, 'samples': 1964736, 'steps': 10232, 'loss/train': 1.7283188104629517} 11/06/2021 22:40:07 - INFO - __main__ - Step 10234: {'lr': 0.0004961920000178996, 'samples': 1964928, 'steps': 10233, 'loss/train': 1.5170456171035767} 11/06/2021 22:40:07 - INFO - __main__ - Step 10235: {'lr': 0.0004961910772603429, 'samples': 1965120, 'steps': 10234, 'loss/train': 1.6485928297042847} 11/06/2021 22:40:08 - INFO - __main__ - Step 10236: {'lr': 0.0004961901543918563, 'samples': 1965312, 'steps': 10235, 'loss/train': 1.1824220418930054} 11/06/2021 22:40:08 - INFO - __main__ - Step 10237: {'lr': 0.0004961892314124401, 'samples': 1965504, 'steps': 10236, 'loss/train': 1.3500611782073975} 11/06/2021 22:40:08 - INFO - __main__ - Step 10238: {'lr': 0.0004961883083220948, 'samples': 1965696, 'steps': 10237, 'loss/train': 1.5080724954605103} 11/06/2021 22:40:09 - INFO - __main__ - Step 10239: {'lr': 0.0004961873851208209, 'samples': 1965888, 'steps': 10238, 'loss/train': 1.8465136289596558} 11/06/2021 22:40:10 - INFO - __main__ - Step 10240: {'lr': 0.0004961864618086188, 'samples': 1966080, 'steps': 10239, 'loss/train': 1.8443377017974854} 11/06/2021 22:40:10 - INFO - __main__ - Step 10241: {'lr': 0.0004961855383854889, 'samples': 1966272, 'steps': 10240, 'loss/train': 1.4369864463806152} 11/06/2021 22:40:10 - INFO - __main__ - Step 10242: {'lr': 0.0004961846148514315, 'samples': 1966464, 'steps': 10241, 'loss/train': 1.790596842765808} 11/06/2021 22:40:11 - INFO - __main__ - Step 10243: {'lr': 0.0004961836912064472, 'samples': 1966656, 'steps': 10242, 'loss/train': 1.661517858505249} 11/06/2021 22:40:12 - INFO - __main__ - Step 10244: {'lr': 0.0004961827674505363, 'samples': 1966848, 'steps': 10243, 'loss/train': 0.9283877015113831} 11/06/2021 22:40:12 - INFO - __main__ - Step 10245: {'lr': 0.0004961818435836993, 'samples': 1967040, 'steps': 10244, 'loss/train': 2.5082814693450928} 11/06/2021 22:40:13 - INFO - __main__ - Step 10246: {'lr': 0.0004961809196059365, 'samples': 1967232, 'steps': 10245, 'loss/train': 0.8113244771957397} 11/06/2021 22:40:13 - INFO - __main__ - Step 10247: {'lr': 0.0004961799955172483, 'samples': 1967424, 'steps': 10246, 'loss/train': 1.7437840700149536} 11/06/2021 22:40:13 - INFO - __main__ - Step 10248: {'lr': 0.0004961790713176353, 'samples': 1967616, 'steps': 10247, 'loss/train': 1.9482618570327759} 11/06/2021 22:40:14 - INFO - __main__ - Step 10249: {'lr': 0.0004961781470070978, 'samples': 1967808, 'steps': 10248, 'loss/train': 1.7293133735656738} 11/06/2021 22:40:15 - INFO - __main__ - Step 10250: {'lr': 0.0004961772225856362, 'samples': 1968000, 'steps': 10249, 'loss/train': 1.6525590419769287} 11/06/2021 22:40:15 - INFO - __main__ - Step 10251: {'lr': 0.0004961762980532509, 'samples': 1968192, 'steps': 10250, 'loss/train': 1.758000135421753} 11/06/2021 22:40:15 - INFO - __main__ - Step 10252: {'lr': 0.0004961753734099425, 'samples': 1968384, 'steps': 10251, 'loss/train': 1.3765827417373657} 11/06/2021 22:40:16 - INFO - __main__ - Step 10253: {'lr': 0.0004961744486557112, 'samples': 1968576, 'steps': 10252, 'loss/train': 1.9734203815460205} 11/06/2021 22:40:17 - INFO - __main__ - Step 10254: {'lr': 0.0004961735237905574, 'samples': 1968768, 'steps': 10253, 'loss/train': 1.5639272928237915} 11/06/2021 22:40:17 - INFO - __main__ - Step 10255: {'lr': 0.0004961725988144816, 'samples': 1968960, 'steps': 10254, 'loss/train': 1.7732101678848267} 11/06/2021 22:40:17 - INFO - __main__ - Step 10256: {'lr': 0.0004961716737274844, 'samples': 1969152, 'steps': 10255, 'loss/train': 1.2973039150238037} 11/06/2021 22:40:18 - INFO - __main__ - Step 10257: {'lr': 0.0004961707485295659, 'samples': 1969344, 'steps': 10256, 'loss/train': 1.6465765237808228} 11/06/2021 22:40:18 - INFO - __main__ - Step 10258: {'lr': 0.0004961698232207268, 'samples': 1969536, 'steps': 10257, 'loss/train': 1.0470538139343262} 11/06/2021 22:40:18 - INFO - __main__ - Step 10259: {'lr': 0.0004961688978009672, 'samples': 1969728, 'steps': 10258, 'loss/train': 1.853024959564209} 11/06/2021 22:40:19 - INFO - __main__ - Step 10260: {'lr': 0.0004961679722702879, 'samples': 1969920, 'steps': 10259, 'loss/train': 1.3597272634506226} 11/06/2021 22:40:20 - INFO - __main__ - Step 10261: {'lr': 0.0004961670466286889, 'samples': 1970112, 'steps': 10260, 'loss/train': 1.9587664604187012} 11/06/2021 22:40:20 - INFO - __main__ - Step 10262: {'lr': 0.000496166120876171, 'samples': 1970304, 'steps': 10261, 'loss/train': 1.988135814666748} 11/06/2021 22:40:21 - INFO - __main__ - Step 10263: {'lr': 0.0004961651950127343, 'samples': 1970496, 'steps': 10262, 'loss/train': 1.9414300918579102} 11/06/2021 22:40:21 - INFO - __main__ - Step 10264: {'lr': 0.0004961642690383794, 'samples': 1970688, 'steps': 10263, 'loss/train': 1.9191449880599976} 11/06/2021 22:40:22 - INFO - __main__ - Step 10265: {'lr': 0.0004961633429531068, 'samples': 1970880, 'steps': 10264, 'loss/train': 2.015342950820923} 11/06/2021 22:40:22 - INFO - __main__ - Step 10266: {'lr': 0.0004961624167569166, 'samples': 1971072, 'steps': 10265, 'loss/train': 1.3776711225509644} 11/06/2021 22:40:23 - INFO - __main__ - Step 10267: {'lr': 0.0004961614904498095, 'samples': 1971264, 'steps': 10266, 'loss/train': 2.4013566970825195} 11/06/2021 22:40:23 - INFO - __main__ - Step 10268: {'lr': 0.0004961605640317858, 'samples': 1971456, 'steps': 10267, 'loss/train': 1.6062275171279907} 11/06/2021 22:40:23 - INFO - __main__ - Step 10269: {'lr': 0.0004961596375028461, 'samples': 1971648, 'steps': 10268, 'loss/train': 1.3712677955627441} 11/06/2021 22:40:24 - INFO - __main__ - Step 10270: {'lr': 0.0004961587108629906, 'samples': 1971840, 'steps': 10269, 'loss/train': 1.1406784057617188} 11/06/2021 22:40:25 - INFO - __main__ - Step 10271: {'lr': 0.0004961577841122197, 'samples': 1972032, 'steps': 10270, 'loss/train': 1.5826045274734497} 11/06/2021 22:40:25 - INFO - __main__ - Step 10272: {'lr': 0.000496156857250534, 'samples': 1972224, 'steps': 10271, 'loss/train': 1.612403154373169} 11/06/2021 22:40:25 - INFO - __main__ - Step 10273: {'lr': 0.0004961559302779338, 'samples': 1972416, 'steps': 10272, 'loss/train': 1.9364298582077026} 11/06/2021 22:40:26 - INFO - __main__ - Step 10274: {'lr': 0.0004961550031944194, 'samples': 1972608, 'steps': 10273, 'loss/train': 1.6951217651367188} 11/06/2021 22:40:27 - INFO - __main__ - Step 10275: {'lr': 0.0004961540759999914, 'samples': 1972800, 'steps': 10274, 'loss/train': 2.1520490646362305} 11/06/2021 22:40:27 - INFO - __main__ - Step 10276: {'lr': 0.0004961531486946502, 'samples': 1972992, 'steps': 10275, 'loss/train': 1.9737766981124878} 11/06/2021 22:40:28 - INFO - __main__ - Step 10277: {'lr': 0.0004961522212783962, 'samples': 1973184, 'steps': 10276, 'loss/train': 1.5989540815353394} 11/06/2021 22:40:28 - INFO - __main__ - Step 10278: {'lr': 0.00049615129375123, 'samples': 1973376, 'steps': 10277, 'loss/train': 0.26742538809776306} 11/06/2021 22:40:29 - INFO - __main__ - Step 10279: {'lr': 0.0004961503661131515, 'samples': 1973568, 'steps': 10278, 'loss/train': 1.548420786857605} 11/06/2021 22:40:30 - INFO - __main__ - Step 10280: {'lr': 0.0004961494383641616, 'samples': 1973760, 'steps': 10279, 'loss/train': 1.5787936449050903} 11/06/2021 22:40:30 - INFO - __main__ - Step 10281: {'lr': 0.0004961485105042606, 'samples': 1973952, 'steps': 10280, 'loss/train': 2.117009401321411} 11/06/2021 22:40:30 - INFO - __main__ - Step 10282: {'lr': 0.0004961475825334488, 'samples': 1974144, 'steps': 10281, 'loss/train': 1.2511065006256104} 11/06/2021 22:40:31 - INFO - __main__ - Step 10283: {'lr': 0.0004961466544517267, 'samples': 1974336, 'steps': 10282, 'loss/train': 2.109795570373535} 11/06/2021 22:40:31 - INFO - __main__ - Step 10284: {'lr': 0.0004961457262590948, 'samples': 1974528, 'steps': 10283, 'loss/train': 1.5047543048858643} 11/06/2021 22:40:31 - INFO - __main__ - Step 10285: {'lr': 0.0004961447979555533, 'samples': 1974720, 'steps': 10284, 'loss/train': 1.6194933652877808} 11/06/2021 22:40:32 - INFO - __main__ - Step 10286: {'lr': 0.000496143869541103, 'samples': 1974912, 'steps': 10285, 'loss/train': 1.41692054271698} 11/06/2021 22:40:33 - INFO - __main__ - Step 10287: {'lr': 0.0004961429410157437, 'samples': 1975104, 'steps': 10286, 'loss/train': 1.688002586364746} 11/06/2021 22:40:33 - INFO - __main__ - Step 10288: {'lr': 0.0004961420123794764, 'samples': 1975296, 'steps': 10287, 'loss/train': 1.7760323286056519} 11/06/2021 22:40:34 - INFO - __main__ - Step 10289: {'lr': 0.0004961410836323014, 'samples': 1975488, 'steps': 10288, 'loss/train': 1.8118081092834473} 11/06/2021 22:40:34 - INFO - __main__ - Step 10290: {'lr': 0.0004961401547742189, 'samples': 1975680, 'steps': 10289, 'loss/train': 1.0009771585464478} 11/06/2021 22:40:35 - INFO - __main__ - Step 10291: {'lr': 0.0004961392258052294, 'samples': 1975872, 'steps': 10290, 'loss/train': 1.9054882526397705} 11/06/2021 22:40:35 - INFO - __main__ - Step 10292: {'lr': 0.0004961382967253335, 'samples': 1976064, 'steps': 10291, 'loss/train': 1.815101981163025} 11/06/2021 22:40:36 - INFO - __main__ - Step 10293: {'lr': 0.0004961373675345315, 'samples': 1976256, 'steps': 10292, 'loss/train': 1.6685025691986084} 11/06/2021 22:40:36 - INFO - __main__ - Step 10294: {'lr': 0.0004961364382328236, 'samples': 1976448, 'steps': 10293, 'loss/train': 1.8122519254684448} 11/06/2021 22:40:36 - INFO - __main__ - Step 10295: {'lr': 0.0004961355088202106, 'samples': 1976640, 'steps': 10294, 'loss/train': 1.760857105255127} 11/06/2021 22:40:37 - INFO - __main__ - Step 10296: {'lr': 0.0004961345792966926, 'samples': 1976832, 'steps': 10295, 'loss/train': 2.0797855854034424} 11/06/2021 22:40:38 - INFO - __main__ - Step 10297: {'lr': 0.0004961336496622702, 'samples': 1977024, 'steps': 10296, 'loss/train': 0.4716089367866516} 11/06/2021 22:40:38 - INFO - __main__ - Step 10298: {'lr': 0.0004961327199169438, 'samples': 1977216, 'steps': 10297, 'loss/train': 1.8852925300598145} 11/06/2021 22:40:39 - INFO - __main__ - Step 10299: {'lr': 0.0004961317900607138, 'samples': 1977408, 'steps': 10298, 'loss/train': 2.788236141204834} 11/06/2021 22:40:39 - INFO - __main__ - Step 10300: {'lr': 0.0004961308600935807, 'samples': 1977600, 'steps': 10299, 'loss/train': 1.8009984493255615} 11/06/2021 22:40:39 - INFO - __main__ - Step 10301: {'lr': 0.0004961299300155446, 'samples': 1977792, 'steps': 10300, 'loss/train': 1.9995626211166382} 11/06/2021 22:40:40 - INFO - __main__ - Step 10302: {'lr': 0.0004961289998266064, 'samples': 1977984, 'steps': 10301, 'loss/train': 1.1890935897827148} 11/06/2021 22:40:41 - INFO - __main__ - Step 10303: {'lr': 0.0004961280695267662, 'samples': 1978176, 'steps': 10302, 'loss/train': 1.4275264739990234} 11/06/2021 22:40:41 - INFO - __main__ - Step 10304: {'lr': 0.0004961271391160243, 'samples': 1978368, 'steps': 10303, 'loss/train': 1.8430927991867065} 11/06/2021 22:40:41 - INFO - __main__ - Step 10305: {'lr': 0.0004961262085943815, 'samples': 1978560, 'steps': 10304, 'loss/train': 1.3772211074829102} 11/06/2021 22:40:42 - INFO - __main__ - Step 10306: {'lr': 0.000496125277961838, 'samples': 1978752, 'steps': 10305, 'loss/train': 1.3290560245513916} 11/06/2021 22:40:43 - INFO - __main__ - Step 10307: {'lr': 0.0004961243472183942, 'samples': 1978944, 'steps': 10306, 'loss/train': 1.4642337560653687} 11/06/2021 22:40:43 - INFO - __main__ - Step 10308: {'lr': 0.0004961234163640507, 'samples': 1979136, 'steps': 10307, 'loss/train': 1.9507293701171875} 11/06/2021 22:40:43 - INFO - __main__ - Step 10309: {'lr': 0.0004961224853988076, 'samples': 1979328, 'steps': 10308, 'loss/train': 2.0494842529296875} 11/06/2021 22:40:44 - INFO - __main__ - Step 10310: {'lr': 0.0004961215543226657, 'samples': 1979520, 'steps': 10309, 'loss/train': 1.839885950088501} 11/06/2021 22:40:44 - INFO - __main__ - Step 10311: {'lr': 0.0004961206231356251, 'samples': 1979712, 'steps': 10310, 'loss/train': 1.5155304670333862} 11/06/2021 22:40:45 - INFO - __main__ - Step 10312: {'lr': 0.0004961196918376864, 'samples': 1979904, 'steps': 10311, 'loss/train': 1.361377239227295} 11/06/2021 22:40:46 - INFO - __main__ - Step 10313: {'lr': 0.0004961187604288498, 'samples': 1980096, 'steps': 10312, 'loss/train': 1.2928553819656372} 11/06/2021 22:40:46 - INFO - __main__ - Step 10314: {'lr': 0.0004961178289091161, 'samples': 1980288, 'steps': 10313, 'loss/train': 1.8654111623764038} 11/06/2021 22:40:46 - INFO - __main__ - Step 10315: {'lr': 0.0004961168972784855, 'samples': 1980480, 'steps': 10314, 'loss/train': 1.5969183444976807} 11/06/2021 22:40:47 - INFO - __main__ - Step 10316: {'lr': 0.0004961159655369582, 'samples': 1980672, 'steps': 10315, 'loss/train': 1.5698388814926147} 11/06/2021 22:40:48 - INFO - __main__ - Step 10317: {'lr': 0.0004961150336845351, 'samples': 1980864, 'steps': 10316, 'loss/train': 1.988451600074768} 11/06/2021 22:40:48 - INFO - __main__ - Step 10318: {'lr': 0.0004961141017212162, 'samples': 1981056, 'steps': 10317, 'loss/train': 1.8295103311538696} 11/06/2021 22:40:48 - INFO - __main__ - Step 10319: {'lr': 0.0004961131696470021, 'samples': 1981248, 'steps': 10318, 'loss/train': 1.9807548522949219} 11/06/2021 22:40:49 - INFO - __main__ - Step 10320: {'lr': 0.0004961122374618933, 'samples': 1981440, 'steps': 10319, 'loss/train': 1.6604183912277222} 11/06/2021 22:40:49 - INFO - __main__ - Step 10321: {'lr': 0.00049611130516589, 'samples': 1981632, 'steps': 10320, 'loss/train': 1.369404911994934} 11/06/2021 22:40:49 - INFO - __main__ - Step 10322: {'lr': 0.0004961103727589929, 'samples': 1981824, 'steps': 10321, 'loss/train': 1.0568779706954956} 11/06/2021 22:40:50 - INFO - __main__ - Step 10323: {'lr': 0.0004961094402412021, 'samples': 1982016, 'steps': 10322, 'loss/train': 1.5058151483535767} 11/06/2021 22:40:51 - INFO - __main__ - Step 10324: {'lr': 0.0004961085076125182, 'samples': 1982208, 'steps': 10323, 'loss/train': 1.933563470840454} 11/06/2021 22:40:51 - INFO - __main__ - Step 10325: {'lr': 0.0004961075748729418, 'samples': 1982400, 'steps': 10324, 'loss/train': 1.765858769416809} 11/06/2021 22:40:51 - INFO - __main__ - Step 10326: {'lr': 0.0004961066420224729, 'samples': 1982592, 'steps': 10325, 'loss/train': 1.309130072593689} 11/06/2021 22:40:52 - INFO - __main__ - Step 10327: {'lr': 0.0004961057090611123, 'samples': 1982784, 'steps': 10326, 'loss/train': 1.8366987705230713} 11/06/2021 22:40:53 - INFO - __main__ - Step 10328: {'lr': 0.0004961047759888601, 'samples': 1982976, 'steps': 10327, 'loss/train': 1.7899473905563354} 11/06/2021 22:40:53 - INFO - __main__ - Step 10329: {'lr': 0.000496103842805717, 'samples': 1983168, 'steps': 10328, 'loss/train': 1.4649665355682373} 11/06/2021 22:40:54 - INFO - __main__ - Step 10330: {'lr': 0.0004961029095116833, 'samples': 1983360, 'steps': 10329, 'loss/train': 2.0712430477142334} 11/06/2021 22:40:54 - INFO - __main__ - Step 10331: {'lr': 0.0004961019761067594, 'samples': 1983552, 'steps': 10330, 'loss/train': 2.5029542446136475} 11/06/2021 22:40:54 - INFO - __main__ - Step 10332: {'lr': 0.0004961010425909458, 'samples': 1983744, 'steps': 10331, 'loss/train': 2.1280171871185303} 11/06/2021 22:40:55 - INFO - __main__ - Step 10333: {'lr': 0.0004961001089642428, 'samples': 1983936, 'steps': 10332, 'loss/train': 1.8614065647125244} 11/06/2021 22:40:56 - INFO - __main__ - Step 10334: {'lr': 0.000496099175226651, 'samples': 1984128, 'steps': 10333, 'loss/train': 2.124863862991333} 11/06/2021 22:40:56 - INFO - __main__ - Step 10335: {'lr': 0.0004960982413781705, 'samples': 1984320, 'steps': 10334, 'loss/train': 1.0853912830352783} 11/06/2021 22:40:56 - INFO - __main__ - Step 10336: {'lr': 0.0004960973074188021, 'samples': 1984512, 'steps': 10335, 'loss/train': 1.535399317741394} 11/06/2021 22:40:57 - INFO - __main__ - Step 10337: {'lr': 0.000496096373348546, 'samples': 1984704, 'steps': 10336, 'loss/train': 1.792807936668396} 11/06/2021 22:40:58 - INFO - __main__ - Step 10338: {'lr': 0.0004960954391674026, 'samples': 1984896, 'steps': 10337, 'loss/train': 1.693973183631897} 11/06/2021 22:40:58 - INFO - __main__ - Step 10339: {'lr': 0.0004960945048753725, 'samples': 1985088, 'steps': 10338, 'loss/train': 2.293344020843506} 11/06/2021 22:40:58 - INFO - __main__ - Step 10340: {'lr': 0.000496093570472456, 'samples': 1985280, 'steps': 10339, 'loss/train': 1.3442373275756836} 11/06/2021 22:40:59 - INFO - __main__ - Step 10341: {'lr': 0.0004960926359586535, 'samples': 1985472, 'steps': 10340, 'loss/train': 1.554911732673645} 11/06/2021 22:40:59 - INFO - __main__ - Step 10342: {'lr': 0.0004960917013339656, 'samples': 1985664, 'steps': 10341, 'loss/train': 2.2928225994110107} 11/06/2021 22:41:00 - INFO - __main__ - Step 10343: {'lr': 0.0004960907665983923, 'samples': 1985856, 'steps': 10342, 'loss/train': 1.8382941484451294} 11/06/2021 22:41:00 - INFO - __main__ - Step 10344: {'lr': 0.0004960898317519345, 'samples': 1986048, 'steps': 10343, 'loss/train': 1.9304149150848389} 11/06/2021 22:41:01 - INFO - __main__ - Step 10345: {'lr': 0.0004960888967945924, 'samples': 1986240, 'steps': 10344, 'loss/train': 1.7264574766159058} 11/06/2021 22:41:01 - INFO - __main__ - Step 10346: {'lr': 0.0004960879617263664, 'samples': 1986432, 'steps': 10345, 'loss/train': 1.2233792543411255} 11/06/2021 22:41:02 - INFO - __main__ - Step 10347: {'lr': 0.000496087026547257, 'samples': 1986624, 'steps': 10346, 'loss/train': 1.9374359846115112} 11/06/2021 22:41:02 - INFO - __main__ - Step 10348: {'lr': 0.0004960860912572645, 'samples': 1986816, 'steps': 10347, 'loss/train': 1.7407712936401367} 11/06/2021 22:41:03 - INFO - __main__ - Step 10349: {'lr': 0.0004960851558563895, 'samples': 1987008, 'steps': 10348, 'loss/train': 1.7582014799118042} 11/06/2021 22:41:04 - INFO - __main__ - Step 10350: {'lr': 0.0004960842203446322, 'samples': 1987200, 'steps': 10349, 'loss/train': 0.7253603935241699} 11/06/2021 22:41:04 - INFO - __main__ - Step 10351: {'lr': 0.0004960832847219933, 'samples': 1987392, 'steps': 10350, 'loss/train': 1.7652232646942139} 11/06/2021 22:41:04 - INFO - __main__ - Step 10352: {'lr': 0.000496082348988473, 'samples': 1987584, 'steps': 10351, 'loss/train': 1.9226405620574951} 11/06/2021 22:41:05 - INFO - __main__ - Step 10353: {'lr': 0.0004960814131440717, 'samples': 1987776, 'steps': 10352, 'loss/train': 2.1182456016540527} 11/06/2021 22:41:06 - INFO - __main__ - Step 10354: {'lr': 0.0004960804771887901, 'samples': 1987968, 'steps': 10353, 'loss/train': 1.939481496810913} 11/06/2021 22:41:06 - INFO - __main__ - Step 10355: {'lr': 0.0004960795411226283, 'samples': 1988160, 'steps': 10354, 'loss/train': 2.0060768127441406} 11/06/2021 22:41:06 - INFO - __main__ - Step 10356: {'lr': 0.0004960786049455868, 'samples': 1988352, 'steps': 10355, 'loss/train': 1.5645967721939087} 11/06/2021 22:41:07 - INFO - __main__ - Step 10357: {'lr': 0.0004960776686576663, 'samples': 1988544, 'steps': 10356, 'loss/train': 1.7053444385528564} 11/06/2021 22:41:07 - INFO - __main__ - Step 10358: {'lr': 0.0004960767322588668, 'samples': 1988736, 'steps': 10357, 'loss/train': 1.8968679904937744} 11/06/2021 22:41:08 - INFO - __main__ - Step 10359: {'lr': 0.000496075795749189, 'samples': 1988928, 'steps': 10358, 'loss/train': 1.6383986473083496} 11/06/2021 22:41:08 - INFO - __main__ - Step 10360: {'lr': 0.0004960748591286332, 'samples': 1989120, 'steps': 10359, 'loss/train': 2.3410348892211914} 11/06/2021 22:41:09 - INFO - __main__ - Step 10361: {'lr': 0.0004960739223971999, 'samples': 1989312, 'steps': 10360, 'loss/train': 2.367793560028076} 11/06/2021 22:41:09 - INFO - __main__ - Step 10362: {'lr': 0.0004960729855548895, 'samples': 1989504, 'steps': 10361, 'loss/train': 1.650147795677185} 11/06/2021 22:41:10 - INFO - __main__ - Step 10363: {'lr': 0.0004960720486017025, 'samples': 1989696, 'steps': 10362, 'loss/train': 0.8240565657615662} 11/06/2021 22:41:11 - INFO - __main__ - Step 10364: {'lr': 0.0004960711115376391, 'samples': 1989888, 'steps': 10363, 'loss/train': 2.253253698348999} 11/06/2021 22:41:11 - INFO - __main__ - Step 10365: {'lr': 0.0004960701743626999, 'samples': 1990080, 'steps': 10364, 'loss/train': 1.3892946243286133} 11/06/2021 22:41:11 - INFO - __main__ - Step 10366: {'lr': 0.0004960692370768853, 'samples': 1990272, 'steps': 10365, 'loss/train': 1.6433247327804565} 11/06/2021 22:41:12 - INFO - __main__ - Step 10367: {'lr': 0.0004960682996801956, 'samples': 1990464, 'steps': 10366, 'loss/train': 1.6244982481002808} 11/06/2021 22:41:12 - INFO - __main__ - Step 10368: {'lr': 0.0004960673621726314, 'samples': 1990656, 'steps': 10367, 'loss/train': 1.839497685432434} 11/06/2021 22:41:13 - INFO - __main__ - Step 10369: {'lr': 0.000496066424554193, 'samples': 1990848, 'steps': 10368, 'loss/train': 1.401164174079895} 11/06/2021 22:41:14 - INFO - __main__ - Step 10370: {'lr': 0.0004960654868248809, 'samples': 1991040, 'steps': 10369, 'loss/train': 2.057040214538574} 11/06/2021 22:41:14 - INFO - __main__ - Step 10371: {'lr': 0.0004960645489846955, 'samples': 1991232, 'steps': 10370, 'loss/train': 2.0657546520233154} 11/06/2021 22:41:14 - INFO - __main__ - Step 10372: {'lr': 0.0004960636110336371, 'samples': 1991424, 'steps': 10371, 'loss/train': 1.8819034099578857} 11/06/2021 22:41:15 - INFO - __main__ - Step 10373: {'lr': 0.0004960626729717064, 'samples': 1991616, 'steps': 10372, 'loss/train': 1.3991193771362305} 11/06/2021 22:41:15 - INFO - __main__ - Step 10374: {'lr': 0.0004960617347989036, 'samples': 1991808, 'steps': 10373, 'loss/train': 1.9511737823486328} 11/06/2021 22:41:17 - INFO - __main__ - Step 10375: {'lr': 0.0004960607965152292, 'samples': 1992000, 'steps': 10374, 'loss/train': 1.4117087125778198} 11/06/2021 22:41:17 - INFO - __main__ - Step 10376: {'lr': 0.0004960598581206835, 'samples': 1992192, 'steps': 10375, 'loss/train': 1.0104448795318604} 11/06/2021 22:41:18 - INFO - __main__ - Step 10377: {'lr': 0.000496058919615267, 'samples': 1992384, 'steps': 10376, 'loss/train': 1.3164684772491455} 11/06/2021 22:41:18 - INFO - __main__ - Step 10378: {'lr': 0.0004960579809989803, 'samples': 1992576, 'steps': 10377, 'loss/train': 1.1793992519378662} 11/06/2021 22:41:19 - INFO - __main__ - Step 10379: {'lr': 0.0004960570422718237, 'samples': 1992768, 'steps': 10378, 'loss/train': 0.8159379959106445} 11/06/2021 22:41:19 - INFO - __main__ - Step 10380: {'lr': 0.0004960561034337975, 'samples': 1992960, 'steps': 10379, 'loss/train': 2.050475835800171} 11/06/2021 22:41:19 - INFO - __main__ - Step 10381: {'lr': 0.0004960551644849022, 'samples': 1993152, 'steps': 10380, 'loss/train': 1.5734912157058716} 11/06/2021 22:41:20 - INFO - __main__ - Step 10382: {'lr': 0.0004960542254251382, 'samples': 1993344, 'steps': 10381, 'loss/train': 1.4547319412231445} 11/06/2021 22:41:21 - INFO - __main__ - Step 10383: {'lr': 0.0004960532862545061, 'samples': 1993536, 'steps': 10382, 'loss/train': 1.9347224235534668} 11/06/2021 22:41:21 - INFO - __main__ - Step 10384: {'lr': 0.0004960523469730061, 'samples': 1993728, 'steps': 10383, 'loss/train': 1.8350725173950195} 11/06/2021 22:41:21 - INFO - __main__ - Step 10385: {'lr': 0.0004960514075806387, 'samples': 1993920, 'steps': 10384, 'loss/train': 2.118435859680176} 11/06/2021 22:41:22 - INFO - __main__ - Step 10386: {'lr': 0.0004960504680774043, 'samples': 1994112, 'steps': 10385, 'loss/train': 1.8689095973968506} 11/06/2021 22:41:23 - INFO - __main__ - Step 10387: {'lr': 0.0004960495284633034, 'samples': 1994304, 'steps': 10386, 'loss/train': 0.7011041045188904} 11/06/2021 22:41:23 - INFO - __main__ - Step 10388: {'lr': 0.0004960485887383363, 'samples': 1994496, 'steps': 10387, 'loss/train': 1.9033364057540894} 11/06/2021 22:41:23 - INFO - __main__ - Step 10389: {'lr': 0.0004960476489025037, 'samples': 1994688, 'steps': 10388, 'loss/train': 1.7955809831619263} 11/06/2021 22:41:24 - INFO - __main__ - Step 10390: {'lr': 0.0004960467089558057, 'samples': 1994880, 'steps': 10389, 'loss/train': 1.9313750267028809} 11/06/2021 22:41:24 - INFO - __main__ - Step 10391: {'lr': 0.0004960457688982428, 'samples': 1995072, 'steps': 10390, 'loss/train': 1.3544516563415527} 11/06/2021 22:41:25 - INFO - __main__ - Step 10392: {'lr': 0.0004960448287298156, 'samples': 1995264, 'steps': 10391, 'loss/train': 1.409073829650879} 11/06/2021 22:41:26 - INFO - __main__ - Step 10393: {'lr': 0.0004960438884505242, 'samples': 1995456, 'steps': 10392, 'loss/train': 1.8170816898345947} 11/06/2021 22:41:26 - INFO - __main__ - Step 10394: {'lr': 0.0004960429480603694, 'samples': 1995648, 'steps': 10393, 'loss/train': 1.6994777917861938} 11/06/2021 22:41:26 - INFO - __main__ - Step 10395: {'lr': 0.0004960420075593515, 'samples': 1995840, 'steps': 10394, 'loss/train': 1.7487272024154663} 11/06/2021 22:41:27 - INFO - __main__ - Step 10396: {'lr': 0.0004960410669474708, 'samples': 1996032, 'steps': 10395, 'loss/train': 1.8390122652053833} 11/06/2021 22:41:28 - INFO - __main__ - Step 10397: {'lr': 0.0004960401262247277, 'samples': 1996224, 'steps': 10396, 'loss/train': 1.0365424156188965} 11/06/2021 22:41:28 - INFO - __main__ - Step 10398: {'lr': 0.0004960391853911228, 'samples': 1996416, 'steps': 10397, 'loss/train': 1.6385880708694458} 11/06/2021 22:41:29 - INFO - __main__ - Step 10399: {'lr': 0.0004960382444466564, 'samples': 1996608, 'steps': 10398, 'loss/train': 1.1302746534347534} 11/06/2021 22:41:29 - INFO - __main__ - Step 10400: {'lr': 0.0004960373033913289, 'samples': 1996800, 'steps': 10399, 'loss/train': 1.6192007064819336} 11/06/2021 22:41:29 - INFO - __main__ - Step 10401: {'lr': 0.0004960363622251409, 'samples': 1996992, 'steps': 10400, 'loss/train': 1.7356743812561035} 11/06/2021 22:41:30 - INFO - __main__ - Step 10402: {'lr': 0.0004960354209480927, 'samples': 1997184, 'steps': 10401, 'loss/train': 1.772154688835144} 11/06/2021 22:41:31 - INFO - __main__ - Step 10403: {'lr': 0.0004960344795601847, 'samples': 1997376, 'steps': 10402, 'loss/train': 2.327993869781494} 11/06/2021 22:41:31 - INFO - __main__ - Step 10404: {'lr': 0.0004960335380614174, 'samples': 1997568, 'steps': 10403, 'loss/train': 1.9825865030288696} 11/06/2021 22:41:31 - INFO - __main__ - Step 10405: {'lr': 0.0004960325964517912, 'samples': 1997760, 'steps': 10404, 'loss/train': 1.8894060850143433} 11/06/2021 22:41:32 - INFO - __main__ - Step 10406: {'lr': 0.0004960316547313064, 'samples': 1997952, 'steps': 10405, 'loss/train': 1.8726656436920166} 11/06/2021 22:41:33 - INFO - __main__ - Step 10407: {'lr': 0.0004960307128999636, 'samples': 1998144, 'steps': 10406, 'loss/train': 1.447789192199707} 11/06/2021 22:41:33 - INFO - __main__ - Step 10408: {'lr': 0.0004960297709577632, 'samples': 1998336, 'steps': 10407, 'loss/train': 2.224510431289673} 11/06/2021 22:41:34 - INFO - __main__ - Step 10409: {'lr': 0.0004960288289047054, 'samples': 1998528, 'steps': 10408, 'loss/train': 1.373806118965149} 11/06/2021 22:41:34 - INFO - __main__ - Step 10410: {'lr': 0.000496027886740791, 'samples': 1998720, 'steps': 10409, 'loss/train': 1.4621202945709229} 11/06/2021 22:41:34 - INFO - __main__ - Step 10411: {'lr': 0.0004960269444660201, 'samples': 1998912, 'steps': 10410, 'loss/train': 1.9223895072937012} 11/06/2021 22:41:35 - INFO - __main__ - Step 10412: {'lr': 0.0004960260020803934, 'samples': 1999104, 'steps': 10411, 'loss/train': 1.9031519889831543} 11/06/2021 22:41:36 - INFO - __main__ - Step 10413: {'lr': 0.0004960250595839111, 'samples': 1999296, 'steps': 10412, 'loss/train': 1.6633156538009644} 11/06/2021 22:41:36 - INFO - __main__ - Step 10414: {'lr': 0.0004960241169765737, 'samples': 1999488, 'steps': 10413, 'loss/train': 1.947817087173462} 11/06/2021 22:41:36 - INFO - __main__ - Step 10415: {'lr': 0.0004960231742583817, 'samples': 1999680, 'steps': 10414, 'loss/train': 1.6474344730377197} 11/06/2021 22:41:37 - INFO - __main__ - Step 10416: {'lr': 0.0004960222314293354, 'samples': 1999872, 'steps': 10415, 'loss/train': 1.6221576929092407} 11/06/2021 22:41:37 - INFO - __main__ - Step 10417: {'lr': 0.0004960212884894353, 'samples': 2000064, 'steps': 10416, 'loss/train': 1.5149108171463013} 11/06/2021 22:41:38 - INFO - __main__ - Step 10418: {'lr': 0.0004960203454386817, 'samples': 2000256, 'steps': 10417, 'loss/train': 0.9100244641304016} 11/06/2021 22:41:38 - INFO - __main__ - Step 10419: {'lr': 0.0004960194022770753, 'samples': 2000448, 'steps': 10418, 'loss/train': 1.549153447151184} 11/06/2021 22:41:39 - INFO - __main__ - Step 10420: {'lr': 0.0004960184590046162, 'samples': 2000640, 'steps': 10419, 'loss/train': 1.8034104108810425} 11/06/2021 22:41:39 - INFO - __main__ - Step 10421: {'lr': 0.0004960175156213051, 'samples': 2000832, 'steps': 10420, 'loss/train': 0.9687737226486206} 11/06/2021 22:41:39 - INFO - __main__ - Step 10422: {'lr': 0.0004960165721271422, 'samples': 2001024, 'steps': 10421, 'loss/train': 1.8846495151519775} 11/06/2021 22:41:41 - INFO - __main__ - Step 10423: {'lr': 0.000496015628522128, 'samples': 2001216, 'steps': 10422, 'loss/train': 1.9125192165374756} 11/06/2021 22:41:41 - INFO - __main__ - Step 10424: {'lr': 0.000496014684806263, 'samples': 2001408, 'steps': 10423, 'loss/train': 1.8119043111801147} 11/06/2021 22:41:41 - INFO - __main__ - Step 10425: {'lr': 0.0004960137409795477, 'samples': 2001600, 'steps': 10424, 'loss/train': 2.1726603507995605} 11/06/2021 22:41:42 - INFO - __main__ - Step 10426: {'lr': 0.0004960127970419822, 'samples': 2001792, 'steps': 10425, 'loss/train': 1.9231642484664917} 11/06/2021 22:41:42 - INFO - __main__ - Step 10427: {'lr': 0.0004960118529935674, 'samples': 2001984, 'steps': 10426, 'loss/train': 2.1175436973571777} 11/06/2021 22:41:43 - INFO - __main__ - Step 10428: {'lr': 0.0004960109088343032, 'samples': 2002176, 'steps': 10427, 'loss/train': 2.0986135005950928} 11/06/2021 22:41:43 - INFO - __main__ - Step 10429: {'lr': 0.0004960099645641903, 'samples': 2002368, 'steps': 10428, 'loss/train': 1.7129641771316528} 11/06/2021 22:41:44 - INFO - __main__ - Step 10430: {'lr': 0.0004960090201832293, 'samples': 2002560, 'steps': 10429, 'loss/train': 1.1784454584121704} 11/06/2021 22:41:44 - INFO - __main__ - Step 10431: {'lr': 0.0004960080756914203, 'samples': 2002752, 'steps': 10430, 'loss/train': 1.253516674041748} 11/06/2021 22:41:44 - INFO - __main__ - Step 10432: {'lr': 0.0004960071310887638, 'samples': 2002944, 'steps': 10431, 'loss/train': 1.8971530199050903} 11/06/2021 22:41:45 - INFO - __main__ - Step 10433: {'lr': 0.0004960061863752604, 'samples': 2003136, 'steps': 10432, 'loss/train': 2.065613031387329} 11/06/2021 22:41:46 - INFO - __main__ - Step 10434: {'lr': 0.0004960052415509103, 'samples': 2003328, 'steps': 10433, 'loss/train': 1.7110601663589478} 11/06/2021 22:41:46 - INFO - __main__ - Step 10435: {'lr': 0.0004960042966157141, 'samples': 2003520, 'steps': 10434, 'loss/train': 1.3858282566070557} 11/06/2021 22:41:47 - INFO - __main__ - Step 10436: {'lr': 0.0004960033515696722, 'samples': 2003712, 'steps': 10435, 'loss/train': 1.303482174873352} 11/06/2021 22:41:47 - INFO - __main__ - Step 10437: {'lr': 0.0004960024064127849, 'samples': 2003904, 'steps': 10436, 'loss/train': 1.2062408924102783} 11/06/2021 22:41:47 - INFO - __main__ - Step 10438: {'lr': 0.0004960014611450527, 'samples': 2004096, 'steps': 10437, 'loss/train': 1.7718604803085327} 11/06/2021 22:41:48 - INFO - __main__ - Step 10439: {'lr': 0.0004960005157664762, 'samples': 2004288, 'steps': 10438, 'loss/train': 1.6597301959991455} 11/06/2021 22:41:49 - INFO - __main__ - Step 10440: {'lr': 0.0004959995702770555, 'samples': 2004480, 'steps': 10439, 'loss/train': 1.9467318058013916} 11/06/2021 22:41:49 - INFO - __main__ - Step 10441: {'lr': 0.0004959986246767913, 'samples': 2004672, 'steps': 10440, 'loss/train': 2.24183988571167} 11/06/2021 22:41:49 - INFO - __main__ - Step 10442: {'lr': 0.0004959976789656838, 'samples': 2004864, 'steps': 10441, 'loss/train': 1.8617457151412964} 11/06/2021 22:41:50 - INFO - __main__ - Step 10443: {'lr': 0.0004959967331437336, 'samples': 2005056, 'steps': 10442, 'loss/train': 2.394559621810913} 11/06/2021 22:41:51 - INFO - __main__ - Step 10444: {'lr': 0.0004959957872109411, 'samples': 2005248, 'steps': 10443, 'loss/train': 2.127652168273926} 11/06/2021 22:41:51 - INFO - __main__ - Step 10445: {'lr': 0.0004959948411673066, 'samples': 2005440, 'steps': 10444, 'loss/train': 1.733871579170227} 11/06/2021 22:41:51 - INFO - __main__ - Step 10446: {'lr': 0.0004959938950128308, 'samples': 2005632, 'steps': 10445, 'loss/train': 1.8825232982635498} 11/06/2021 22:41:52 - INFO - __main__ - Step 10447: {'lr': 0.0004959929487475138, 'samples': 2005824, 'steps': 10446, 'loss/train': 1.3970377445220947} 11/06/2021 22:41:52 - INFO - __main__ - Step 10448: {'lr': 0.0004959920023713563, 'samples': 2006016, 'steps': 10447, 'loss/train': 1.3850414752960205} 11/06/2021 22:41:52 - INFO - __main__ - Step 10449: {'lr': 0.0004959910558843584, 'samples': 2006208, 'steps': 10448, 'loss/train': 1.893357753753662} 11/06/2021 22:41:54 - INFO - __main__ - Step 10450: {'lr': 0.0004959901092865208, 'samples': 2006400, 'steps': 10449, 'loss/train': 1.3027065992355347} 11/06/2021 22:41:54 - INFO - __main__ - Step 10451: {'lr': 0.0004959891625778438, 'samples': 2006592, 'steps': 10450, 'loss/train': 2.1753814220428467} 11/06/2021 22:41:54 - INFO - __main__ - Step 10452: {'lr': 0.0004959882157583281, 'samples': 2006784, 'steps': 10451, 'loss/train': 1.65367591381073} 11/06/2021 22:41:55 - INFO - __main__ - Step 10453: {'lr': 0.0004959872688279737, 'samples': 2006976, 'steps': 10452, 'loss/train': 1.8160592317581177} 11/06/2021 22:41:55 - INFO - __main__ - Step 10454: {'lr': 0.0004959863217867814, 'samples': 2007168, 'steps': 10453, 'loss/train': 1.6027116775512695} 11/06/2021 22:41:56 - INFO - __main__ - Step 10455: {'lr': 0.0004959853746347513, 'samples': 2007360, 'steps': 10454, 'loss/train': 1.9197403192520142} 11/06/2021 22:41:57 - INFO - __main__ - Step 10456: {'lr': 0.0004959844273718841, 'samples': 2007552, 'steps': 10455, 'loss/train': 1.6267117261886597} 11/06/2021 22:41:57 - INFO - __main__ - Step 10457: {'lr': 0.00049598347999818, 'samples': 2007744, 'steps': 10456, 'loss/train': 1.6874116659164429} 11/06/2021 22:41:58 - INFO - __main__ - Step 10458: {'lr': 0.0004959825325136396, 'samples': 2007936, 'steps': 10457, 'loss/train': 1.9572356939315796} 11/06/2021 22:41:58 - INFO - __main__ - Step 10459: {'lr': 0.0004959815849182633, 'samples': 2008128, 'steps': 10458, 'loss/train': 1.2736876010894775} 11/06/2021 22:41:58 - INFO - __main__ - Step 10460: {'lr': 0.0004959806372120515, 'samples': 2008320, 'steps': 10459, 'loss/train': 1.9439113140106201} 11/06/2021 22:41:59 - INFO - __main__ - Step 10461: {'lr': 0.0004959796893950045, 'samples': 2008512, 'steps': 10460, 'loss/train': 2.2238245010375977} 11/06/2021 22:42:00 - INFO - __main__ - Step 10462: {'lr': 0.0004959787414671229, 'samples': 2008704, 'steps': 10461, 'loss/train': 1.8460416793823242} 11/06/2021 22:42:00 - INFO - __main__ - Step 10463: {'lr': 0.000495977793428407, 'samples': 2008896, 'steps': 10462, 'loss/train': 1.9288679361343384} 11/06/2021 22:42:00 - INFO - __main__ - Step 10464: {'lr': 0.0004959768452788575, 'samples': 2009088, 'steps': 10463, 'loss/train': 1.7732502222061157} 11/06/2021 22:42:01 - INFO - __main__ - Step 10465: {'lr': 0.0004959758970184745, 'samples': 2009280, 'steps': 10464, 'loss/train': 2.398732900619507} 11/06/2021 22:42:02 - INFO - __main__ - Step 10466: {'lr': 0.0004959749486472587, 'samples': 2009472, 'steps': 10465, 'loss/train': 1.7253568172454834} 11/06/2021 22:42:02 - INFO - __main__ - Step 10467: {'lr': 0.0004959740001652102, 'samples': 2009664, 'steps': 10466, 'loss/train': 1.70595121383667} 11/06/2021 22:42:02 - INFO - __main__ - Step 10468: {'lr': 0.0004959730515723298, 'samples': 2009856, 'steps': 10467, 'loss/train': 1.9712741374969482} 11/06/2021 22:42:03 - INFO - __main__ - Step 10469: {'lr': 0.0004959721028686175, 'samples': 2010048, 'steps': 10468, 'loss/train': 1.4620518684387207} 11/06/2021 22:42:03 - INFO - __main__ - Step 10470: {'lr': 0.0004959711540540741, 'samples': 2010240, 'steps': 10469, 'loss/train': 1.9116092920303345} 11/06/2021 22:42:04 - INFO - __main__ - Step 10471: {'lr': 0.0004959702051286999, 'samples': 2010432, 'steps': 10470, 'loss/train': 2.0080511569976807} 11/06/2021 22:42:04 - INFO - __main__ - Step 10472: {'lr': 0.0004959692560924954, 'samples': 2010624, 'steps': 10471, 'loss/train': 1.8201708793640137} 11/06/2021 22:42:05 - INFO - __main__ - Step 10473: {'lr': 0.0004959683069454608, 'samples': 2010816, 'steps': 10472, 'loss/train': 1.4937697649002075} 11/06/2021 22:42:05 - INFO - __main__ - Step 10474: {'lr': 0.0004959673576875967, 'samples': 2011008, 'steps': 10473, 'loss/train': 1.4996908903121948} 11/06/2021 22:42:06 - INFO - __main__ - Step 10475: {'lr': 0.0004959664083189035, 'samples': 2011200, 'steps': 10474, 'loss/train': 1.8766359090805054} 11/06/2021 22:42:07 - INFO - __main__ - Step 10476: {'lr': 0.0004959654588393818, 'samples': 2011392, 'steps': 10475, 'loss/train': 2.539846181869507} 11/06/2021 22:42:07 - INFO - __main__ - Step 10477: {'lr': 0.0004959645092490316, 'samples': 2011584, 'steps': 10476, 'loss/train': 2.0664992332458496} 11/06/2021 22:42:07 - INFO - __main__ - Step 10478: {'lr': 0.0004959635595478537, 'samples': 2011776, 'steps': 10477, 'loss/train': 2.1357581615448} 11/06/2021 22:42:08 - INFO - __main__ - Step 10479: {'lr': 0.0004959626097358485, 'samples': 2011968, 'steps': 10478, 'loss/train': 1.7041106224060059} 11/06/2021 22:42:08 - INFO - __main__ - Step 10480: {'lr': 0.0004959616598130162, 'samples': 2012160, 'steps': 10479, 'loss/train': 1.5367162227630615} 11/06/2021 22:42:09 - INFO - __main__ - Step 10481: {'lr': 0.0004959607097793575, 'samples': 2012352, 'steps': 10480, 'loss/train': 0.8361077904701233} 11/06/2021 22:42:09 - INFO - __main__ - Step 10482: {'lr': 0.0004959597596348726, 'samples': 2012544, 'steps': 10481, 'loss/train': 1.425622582435608} 11/06/2021 22:42:10 - INFO - __main__ - Step 10483: {'lr': 0.0004959588093795621, 'samples': 2012736, 'steps': 10482, 'loss/train': 1.8034032583236694} 11/06/2021 22:42:10 - INFO - __main__ - Step 10484: {'lr': 0.0004959578590134262, 'samples': 2012928, 'steps': 10483, 'loss/train': 1.7551143169403076} 11/06/2021 22:42:10 - INFO - __main__ - Step 10485: {'lr': 0.0004959569085364657, 'samples': 2013120, 'steps': 10484, 'loss/train': 1.8540403842926025} 11/06/2021 22:42:11 - INFO - __main__ - Step 10486: {'lr': 0.0004959559579486807, 'samples': 2013312, 'steps': 10485, 'loss/train': 1.9935126304626465} 11/06/2021 22:42:12 - INFO - __main__ - Step 10487: {'lr': 0.0004959550072500718, 'samples': 2013504, 'steps': 10486, 'loss/train': 1.6862492561340332} 11/06/2021 22:42:12 - INFO - __main__ - Step 10488: {'lr': 0.0004959540564406393, 'samples': 2013696, 'steps': 10487, 'loss/train': 1.781096339225769} 11/06/2021 22:42:12 - INFO - __main__ - Step 10489: {'lr': 0.0004959531055203837, 'samples': 2013888, 'steps': 10488, 'loss/train': 1.8593416213989258} 11/06/2021 22:42:13 - INFO - __main__ - Step 10490: {'lr': 0.0004959521544893055, 'samples': 2014080, 'steps': 10489, 'loss/train': 1.739823579788208} 11/06/2021 22:42:13 - INFO - __main__ - Step 10491: {'lr': 0.000495951203347405, 'samples': 2014272, 'steps': 10490, 'loss/train': 1.9146326780319214} 11/06/2021 22:42:14 - INFO - __main__ - Step 10492: {'lr': 0.0004959502520946827, 'samples': 2014464, 'steps': 10491, 'loss/train': 2.1610023975372314} 11/06/2021 22:42:14 - INFO - __main__ - Step 10493: {'lr': 0.000495949300731139, 'samples': 2014656, 'steps': 10492, 'loss/train': 1.6168802976608276} 11/06/2021 22:42:15 - INFO - __main__ - Step 10494: {'lr': 0.0004959483492567744, 'samples': 2014848, 'steps': 10493, 'loss/train': 1.6835715770721436} 11/06/2021 22:42:15 - INFO - __main__ - Step 10495: {'lr': 0.0004959473976715892, 'samples': 2015040, 'steps': 10494, 'loss/train': 1.6348047256469727} 11/06/2021 22:42:15 - INFO - __main__ - Step 10496: {'lr': 0.0004959464459755839, 'samples': 2015232, 'steps': 10495, 'loss/train': 1.2869349718093872} 11/06/2021 22:42:17 - INFO - __main__ - Step 10497: {'lr': 0.0004959454941687589, 'samples': 2015424, 'steps': 10496, 'loss/train': 1.733815312385559} 11/06/2021 22:42:17 - INFO - __main__ - Step 10498: {'lr': 0.0004959445422511148, 'samples': 2015616, 'steps': 10497, 'loss/train': 1.9846241474151611} 11/06/2021 22:42:17 - INFO - __main__ - Step 10499: {'lr': 0.0004959435902226517, 'samples': 2015808, 'steps': 10498, 'loss/train': 0.9969847202301025} 11/06/2021 22:42:18 - INFO - __main__ - Step 10500: {'lr': 0.0004959426380833703, 'samples': 2016000, 'steps': 10499, 'loss/train': 1.6832598447799683} 11/06/2021 22:42:18 - INFO - __main__ - Step 10501: {'lr': 0.0004959416858332709, 'samples': 2016192, 'steps': 10500, 'loss/train': 1.6795216798782349} 11/06/2021 22:42:19 - INFO - __main__ - Step 10502: {'lr': 0.000495940733472354, 'samples': 2016384, 'steps': 10501, 'loss/train': 2.3554139137268066} 11/06/2021 22:42:19 - INFO - __main__ - Step 10503: {'lr': 0.00049593978100062, 'samples': 2016576, 'steps': 10502, 'loss/train': 1.8953803777694702} 11/06/2021 22:42:20 - INFO - __main__ - Step 10504: {'lr': 0.0004959388284180694, 'samples': 2016768, 'steps': 10503, 'loss/train': 2.1253671646118164} 11/06/2021 22:42:20 - INFO - __main__ - Step 10505: {'lr': 0.0004959378757247024, 'samples': 2016960, 'steps': 10504, 'loss/train': 1.8000476360321045} 11/06/2021 22:42:20 - INFO - __main__ - Step 10506: {'lr': 0.0004959369229205197, 'samples': 2017152, 'steps': 10505, 'loss/train': 0.36025920510292053} 11/06/2021 22:42:21 - INFO - __main__ - Step 10507: {'lr': 0.0004959359700055216, 'samples': 2017344, 'steps': 10506, 'loss/train': 1.7624599933624268} 11/06/2021 22:42:22 - INFO - __main__ - Step 10508: {'lr': 0.0004959350169797085, 'samples': 2017536, 'steps': 10507, 'loss/train': 1.786071538925171} 11/06/2021 22:42:22 - INFO - __main__ - Step 10509: {'lr': 0.000495934063843081, 'samples': 2017728, 'steps': 10508, 'loss/train': 2.0693178176879883} 11/06/2021 22:42:23 - INFO - __main__ - Step 10510: {'lr': 0.0004959331105956393, 'samples': 2017920, 'steps': 10509, 'loss/train': 1.65921950340271} 11/06/2021 22:42:23 - INFO - __main__ - Step 10511: {'lr': 0.000495932157237384, 'samples': 2018112, 'steps': 10510, 'loss/train': 1.9849227666854858} 11/06/2021 22:42:23 - INFO - __main__ - Step 10512: {'lr': 0.0004959312037683154, 'samples': 2018304, 'steps': 10511, 'loss/train': 1.6077988147735596} 11/06/2021 22:42:24 - INFO - __main__ - Step 10513: {'lr': 0.0004959302501884341, 'samples': 2018496, 'steps': 10512, 'loss/train': 1.6553412675857544} 11/06/2021 22:42:25 - INFO - __main__ - Step 10514: {'lr': 0.0004959292964977403, 'samples': 2018688, 'steps': 10513, 'loss/train': 1.3393796682357788} 11/06/2021 22:42:25 - INFO - __main__ - Step 10515: {'lr': 0.0004959283426962345, 'samples': 2018880, 'steps': 10514, 'loss/train': 1.1275650262832642} 11/06/2021 22:42:25 - INFO - __main__ - Step 10516: {'lr': 0.0004959273887839175, 'samples': 2019072, 'steps': 10515, 'loss/train': 1.4780120849609375} 11/06/2021 22:42:26 - INFO - __main__ - Step 10517: {'lr': 0.000495926434760789, 'samples': 2019264, 'steps': 10516, 'loss/train': 1.8599119186401367} 11/06/2021 22:42:27 - INFO - __main__ - Step 10518: {'lr': 0.0004959254806268501, 'samples': 2019456, 'steps': 10517, 'loss/train': 1.910403847694397} 11/06/2021 22:42:27 - INFO - __main__ - Step 10519: {'lr': 0.0004959245263821009, 'samples': 2019648, 'steps': 10518, 'loss/train': 1.8654783964157104} 11/06/2021 22:42:27 - INFO - __main__ - Step 10520: {'lr': 0.0004959235720265419, 'samples': 2019840, 'steps': 10519, 'loss/train': 1.3041726350784302} 11/06/2021 22:42:28 - INFO - __main__ - Step 10521: {'lr': 0.0004959226175601736, 'samples': 2020032, 'steps': 10520, 'loss/train': 1.7402362823486328} 11/06/2021 22:42:28 - INFO - __main__ - Step 10522: {'lr': 0.0004959216629829964, 'samples': 2020224, 'steps': 10521, 'loss/train': 1.8779191970825195} 11/06/2021 22:42:29 - INFO - __main__ - Step 10523: {'lr': 0.0004959207082950105, 'samples': 2020416, 'steps': 10522, 'loss/train': 1.9938749074935913} 11/06/2021 22:42:29 - INFO - __main__ - Step 10524: {'lr': 0.0004959197534962166, 'samples': 2020608, 'steps': 10523, 'loss/train': 1.7240444421768188} 11/06/2021 22:42:30 - INFO - __main__ - Step 10525: {'lr': 0.0004959187985866152, 'samples': 2020800, 'steps': 10524, 'loss/train': 1.3333297967910767} 11/06/2021 22:42:30 - INFO - __main__ - Step 10526: {'lr': 0.0004959178435662064, 'samples': 2020992, 'steps': 10525, 'loss/train': 3.172778606414795} 11/06/2021 22:42:30 - INFO - __main__ - Step 10527: {'lr': 0.0004959168884349909, 'samples': 2021184, 'steps': 10526, 'loss/train': 2.005765438079834} 11/06/2021 22:42:32 - INFO - __main__ - Step 10528: {'lr': 0.0004959159331929691, 'samples': 2021376, 'steps': 10527, 'loss/train': 1.9616241455078125} 11/06/2021 22:42:32 - INFO - __main__ - Step 10529: {'lr': 0.0004959149778401412, 'samples': 2021568, 'steps': 10528, 'loss/train': 0.44385576248168945} 11/06/2021 22:42:32 - INFO - __main__ - Step 10530: {'lr': 0.000495914022376508, 'samples': 2021760, 'steps': 10529, 'loss/train': 1.7894715070724487} 11/06/2021 22:42:33 - INFO - __main__ - Step 10531: {'lr': 0.0004959130668020696, 'samples': 2021952, 'steps': 10530, 'loss/train': 1.43883216381073} 11/06/2021 22:42:33 - INFO - __main__ - Step 10532: {'lr': 0.0004959121111168266, 'samples': 2022144, 'steps': 10531, 'loss/train': 1.801062822341919} 11/06/2021 22:42:34 - INFO - __main__ - Step 10533: {'lr': 0.0004959111553207794, 'samples': 2022336, 'steps': 10532, 'loss/train': 1.905466914176941} 11/06/2021 22:42:34 - INFO - __main__ - Step 10534: {'lr': 0.0004959101994139284, 'samples': 2022528, 'steps': 10533, 'loss/train': 1.9462462663650513} 11/06/2021 22:42:35 - INFO - __main__ - Step 10535: {'lr': 0.0004959092433962742, 'samples': 2022720, 'steps': 10534, 'loss/train': 1.9173693656921387} 11/06/2021 22:42:35 - INFO - __main__ - Step 10536: {'lr': 0.0004959082872678169, 'samples': 2022912, 'steps': 10535, 'loss/train': 0.9070050716400146} 11/06/2021 22:42:35 - INFO - __main__ - Step 10537: {'lr': 0.0004959073310285572, 'samples': 2023104, 'steps': 10536, 'loss/train': 1.65950345993042} 11/06/2021 22:42:37 - INFO - __main__ - Step 10538: {'lr': 0.0004959063746784955, 'samples': 2023296, 'steps': 10537, 'loss/train': 1.9055320024490356} 11/06/2021 22:42:37 - INFO - __main__ - Step 10539: {'lr': 0.0004959054182176321, 'samples': 2023488, 'steps': 10538, 'loss/train': 1.639905333518982} 11/06/2021 22:42:38 - INFO - __main__ - Step 10540: {'lr': 0.0004959044616459676, 'samples': 2023680, 'steps': 10539, 'loss/train': 1.4272217750549316} 11/06/2021 22:42:38 - INFO - __main__ - Step 10541: {'lr': 0.0004959035049635023, 'samples': 2023872, 'steps': 10540, 'loss/train': 0.9154389500617981} 11/06/2021 22:42:38 - INFO - __main__ - Step 10542: {'lr': 0.0004959025481702366, 'samples': 2024064, 'steps': 10541, 'loss/train': 1.519034504890442} 11/06/2021 22:42:39 - INFO - __main__ - Step 10543: {'lr': 0.0004959015912661712, 'samples': 2024256, 'steps': 10542, 'loss/train': 1.571833848953247} 11/06/2021 22:42:40 - INFO - __main__ - Step 10544: {'lr': 0.0004959006342513062, 'samples': 2024448, 'steps': 10543, 'loss/train': 0.902131974697113} 11/06/2021 22:42:40 - INFO - __main__ - Step 10545: {'lr': 0.0004958996771256422, 'samples': 2024640, 'steps': 10544, 'loss/train': 1.765254259109497} 11/06/2021 22:42:40 - INFO - __main__ - Step 10546: {'lr': 0.0004958987198891796, 'samples': 2024832, 'steps': 10545, 'loss/train': 1.8613026142120361} 11/06/2021 22:42:41 - INFO - __main__ - Step 10547: {'lr': 0.0004958977625419187, 'samples': 2025024, 'steps': 10546, 'loss/train': 1.9227455854415894} 11/06/2021 22:42:41 - INFO - __main__ - Step 10548: {'lr': 0.0004958968050838603, 'samples': 2025216, 'steps': 10547, 'loss/train': 1.920836091041565} 11/06/2021 22:42:42 - INFO - __main__ - Step 10549: {'lr': 0.0004958958475150044, 'samples': 2025408, 'steps': 10548, 'loss/train': 1.9864680767059326} 11/06/2021 22:42:43 - INFO - __main__ - Step 10550: {'lr': 0.0004958948898353516, 'samples': 2025600, 'steps': 10549, 'loss/train': 1.8649414777755737} 11/06/2021 22:42:43 - INFO - __main__ - Step 10551: {'lr': 0.0004958939320449026, 'samples': 2025792, 'steps': 10550, 'loss/train': 2.0411078929901123} 11/06/2021 22:42:43 - INFO - __main__ - Step 10552: {'lr': 0.0004958929741436574, 'samples': 2025984, 'steps': 10551, 'loss/train': 1.3231031894683838} 11/06/2021 22:42:44 - INFO - __main__ - Step 10553: {'lr': 0.0004958920161316167, 'samples': 2026176, 'steps': 10552, 'loss/train': 1.855413556098938} 11/06/2021 22:42:45 - INFO - __main__ - Step 10554: {'lr': 0.0004958910580087808, 'samples': 2026368, 'steps': 10553, 'loss/train': 2.308320999145508} 11/06/2021 22:42:45 - INFO - __main__ - Step 10555: {'lr': 0.0004958900997751502, 'samples': 2026560, 'steps': 10554, 'loss/train': 1.9437730312347412} 11/06/2021 22:42:45 - INFO - __main__ - Step 10556: {'lr': 0.0004958891414307253, 'samples': 2026752, 'steps': 10555, 'loss/train': 2.246142864227295} 11/06/2021 22:42:46 - INFO - __main__ - Step 10557: {'lr': 0.0004958881829755066, 'samples': 2026944, 'steps': 10556, 'loss/train': 2.26411771774292} 11/06/2021 22:42:46 - INFO - __main__ - Step 10558: {'lr': 0.0004958872244094944, 'samples': 2027136, 'steps': 10557, 'loss/train': 1.6010162830352783} 11/06/2021 22:42:46 - INFO - __main__ - Step 10559: {'lr': 0.0004958862657326893, 'samples': 2027328, 'steps': 10558, 'loss/train': 1.5600662231445312} 11/06/2021 22:42:48 - INFO - __main__ - Step 10560: {'lr': 0.0004958853069450916, 'samples': 2027520, 'steps': 10559, 'loss/train': 1.8983862400054932} 11/06/2021 22:42:48 - INFO - __main__ - Step 10561: {'lr': 0.0004958843480467017, 'samples': 2027712, 'steps': 10560, 'loss/train': 0.7014676332473755} 11/06/2021 22:42:48 - INFO - __main__ - Step 10562: {'lr': 0.0004958833890375202, 'samples': 2027904, 'steps': 10561, 'loss/train': 1.9279460906982422} 11/06/2021 22:42:49 - INFO - __main__ - Step 10563: {'lr': 0.0004958824299175474, 'samples': 2028096, 'steps': 10562, 'loss/train': 1.799846887588501} 11/06/2021 22:42:49 - INFO - __main__ - Step 10564: {'lr': 0.0004958814706867838, 'samples': 2028288, 'steps': 10563, 'loss/train': 1.8568742275238037} 11/06/2021 22:42:50 - INFO - __main__ - Step 10565: {'lr': 0.0004958805113452298, 'samples': 2028480, 'steps': 10564, 'loss/train': 1.6105812788009644} 11/06/2021 22:42:50 - INFO - __main__ - Step 10566: {'lr': 0.0004958795518928858, 'samples': 2028672, 'steps': 10565, 'loss/train': 1.1961236000061035} 11/06/2021 22:42:51 - INFO - __main__ - Step 10567: {'lr': 0.0004958785923297522, 'samples': 2028864, 'steps': 10566, 'loss/train': 1.8169901371002197} 11/06/2021 22:42:51 - INFO - __main__ - Step 10568: {'lr': 0.0004958776326558298, 'samples': 2029056, 'steps': 10567, 'loss/train': 1.83092200756073} 11/06/2021 22:42:51 - INFO - __main__ - Step 10569: {'lr': 0.0004958766728711184, 'samples': 2029248, 'steps': 10568, 'loss/train': 1.380372166633606} 11/06/2021 22:42:52 - INFO - __main__ - Step 10570: {'lr': 0.000495875712975619, 'samples': 2029440, 'steps': 10569, 'loss/train': 1.897265076637268} 11/06/2021 22:42:53 - INFO - __main__ - Step 10571: {'lr': 0.0004958747529693316, 'samples': 2029632, 'steps': 10570, 'loss/train': 1.3889024257659912} 11/06/2021 22:42:53 - INFO - __main__ - Step 10572: {'lr': 0.000495873792852257, 'samples': 2029824, 'steps': 10571, 'loss/train': 1.0971200466156006} 11/06/2021 22:42:53 - INFO - __main__ - Step 10573: {'lr': 0.0004958728326243954, 'samples': 2030016, 'steps': 10572, 'loss/train': 1.5941991806030273} 11/06/2021 22:42:54 - INFO - __main__ - Step 10574: {'lr': 0.0004958718722857473, 'samples': 2030208, 'steps': 10573, 'loss/train': 1.6007957458496094} 11/06/2021 22:42:55 - INFO - __main__ - Step 10575: {'lr': 0.0004958709118363131, 'samples': 2030400, 'steps': 10574, 'loss/train': 1.9947025775909424} 11/06/2021 22:42:55 - INFO - __main__ - Step 10576: {'lr': 0.0004958699512760933, 'samples': 2030592, 'steps': 10575, 'loss/train': 1.9763280153274536} 11/06/2021 22:42:55 - INFO - __main__ - Step 10577: {'lr': 0.0004958689906050882, 'samples': 2030784, 'steps': 10576, 'loss/train': 1.577843189239502} 11/06/2021 22:42:56 - INFO - __main__ - Step 10578: {'lr': 0.0004958680298232983, 'samples': 2030976, 'steps': 10577, 'loss/train': 1.4245156049728394} 11/06/2021 22:42:56 - INFO - __main__ - Step 10579: {'lr': 0.0004958670689307242, 'samples': 2031168, 'steps': 10578, 'loss/train': 2.1011838912963867} 11/06/2021 22:42:57 - INFO - __main__ - Step 10580: {'lr': 0.0004958661079273662, 'samples': 2031360, 'steps': 10579, 'loss/train': 1.68887197971344} 11/06/2021 22:42:58 - INFO - __main__ - Step 10581: {'lr': 0.0004958651468132246, 'samples': 2031552, 'steps': 10580, 'loss/train': 1.6149951219558716} 11/06/2021 22:42:58 - INFO - __main__ - Step 10582: {'lr': 0.0004958641855883001, 'samples': 2031744, 'steps': 10581, 'loss/train': 1.577684760093689} 11/06/2021 22:42:58 - INFO - __main__ - Step 10583: {'lr': 0.0004958632242525929, 'samples': 2031936, 'steps': 10582, 'loss/train': 1.9628639221191406} 11/06/2021 22:42:59 - INFO - __main__ - Step 10584: {'lr': 0.0004958622628061035, 'samples': 2032128, 'steps': 10583, 'loss/train': 1.6041213274002075} 11/06/2021 22:42:59 - INFO - __main__ - Step 10585: {'lr': 0.0004958613012488324, 'samples': 2032320, 'steps': 10584, 'loss/train': 0.9554458856582642} 11/06/2021 22:43:00 - INFO - __main__ - Step 10586: {'lr': 0.00049586033958078, 'samples': 2032512, 'steps': 10585, 'loss/train': 2.0318799018859863} 11/06/2021 22:43:00 - INFO - __main__ - Step 10587: {'lr': 0.0004958593778019468, 'samples': 2032704, 'steps': 10586, 'loss/train': 1.9779229164123535} 11/06/2021 22:43:01 - INFO - __main__ - Step 10588: {'lr': 0.0004958584159123331, 'samples': 2032896, 'steps': 10587, 'loss/train': 2.021127939224243} 11/06/2021 22:43:01 - INFO - __main__ - Step 10589: {'lr': 0.0004958574539119392, 'samples': 2033088, 'steps': 10588, 'loss/train': 1.864471435546875} 11/06/2021 22:43:01 - INFO - __main__ - Step 10590: {'lr': 0.0004958564918007659, 'samples': 2033280, 'steps': 10589, 'loss/train': 0.6293484568595886} 11/06/2021 22:43:03 - INFO - __main__ - Step 10591: {'lr': 0.0004958555295788135, 'samples': 2033472, 'steps': 10590, 'loss/train': 1.892540454864502} 11/06/2021 22:43:03 - INFO - __main__ - Step 10592: {'lr': 0.0004958545672460824, 'samples': 2033664, 'steps': 10591, 'loss/train': 2.0386414527893066} 11/06/2021 22:43:03 - INFO - __main__ - Step 10593: {'lr': 0.0004958536048025729, 'samples': 2033856, 'steps': 10592, 'loss/train': 2.018533706665039} 11/06/2021 22:43:04 - INFO - __main__ - Step 10594: {'lr': 0.0004958526422482857, 'samples': 2034048, 'steps': 10593, 'loss/train': 1.8704800605773926} 11/06/2021 22:43:04 - INFO - __main__ - Step 10595: {'lr': 0.000495851679583221, 'samples': 2034240, 'steps': 10594, 'loss/train': 1.2096422910690308} 11/06/2021 22:43:05 - INFO - __main__ - Step 10596: {'lr': 0.0004958507168073793, 'samples': 2034432, 'steps': 10595, 'loss/train': 1.69189453125} 11/06/2021 22:43:05 - INFO - __main__ - Step 10597: {'lr': 0.0004958497539207611, 'samples': 2034624, 'steps': 10596, 'loss/train': 1.6835681200027466} 11/06/2021 22:43:06 - INFO - __main__ - Step 10598: {'lr': 0.0004958487909233669, 'samples': 2034816, 'steps': 10597, 'loss/train': 1.550663948059082} 11/06/2021 22:43:06 - INFO - __main__ - Step 10599: {'lr': 0.0004958478278151969, 'samples': 2035008, 'steps': 10598, 'loss/train': 1.3910499811172485} 11/06/2021 22:43:06 - INFO - __main__ - Step 10600: {'lr': 0.0004958468645962517, 'samples': 2035200, 'steps': 10599, 'loss/train': 0.5436376333236694} 11/06/2021 22:43:08 - INFO - __main__ - Step 10601: {'lr': 0.0004958459012665317, 'samples': 2035392, 'steps': 10600, 'loss/train': 1.8061374425888062} 11/06/2021 22:43:08 - INFO - __main__ - Step 10602: {'lr': 0.0004958449378260374, 'samples': 2035584, 'steps': 10601, 'loss/train': 1.4709066152572632} 11/06/2021 22:43:08 - INFO - __main__ - Step 10603: {'lr': 0.000495843974274769, 'samples': 2035776, 'steps': 10602, 'loss/train': 1.4589587450027466} 11/06/2021 22:43:09 - INFO - __main__ - Step 10604: {'lr': 0.0004958430106127272, 'samples': 2035968, 'steps': 10603, 'loss/train': 2.1720454692840576} 11/06/2021 22:43:09 - INFO - __main__ - Step 10605: {'lr': 0.0004958420468399123, 'samples': 2036160, 'steps': 10604, 'loss/train': 1.612654209136963} 11/06/2021 22:43:10 - INFO - __main__ - Step 10606: {'lr': 0.0004958410829563248, 'samples': 2036352, 'steps': 10605, 'loss/train': 1.6653211116790771} 11/06/2021 22:43:10 - INFO - __main__ - Step 10607: {'lr': 0.0004958401189619652, 'samples': 2036544, 'steps': 10606, 'loss/train': 1.9401401281356812} 11/06/2021 22:43:11 - INFO - __main__ - Step 10608: {'lr': 0.0004958391548568336, 'samples': 2036736, 'steps': 10607, 'loss/train': 2.150745391845703} 11/06/2021 22:43:11 - INFO - __main__ - Step 10609: {'lr': 0.0004958381906409308, 'samples': 2036928, 'steps': 10608, 'loss/train': 1.7484651803970337} 11/06/2021 22:43:11 - INFO - __main__ - Step 10610: {'lr': 0.0004958372263142571, 'samples': 2037120, 'steps': 10609, 'loss/train': 1.8919156789779663} 11/06/2021 22:43:12 - INFO - __main__ - Step 10611: {'lr': 0.0004958362618768129, 'samples': 2037312, 'steps': 10610, 'loss/train': 1.3154926300048828} 11/06/2021 22:43:14 - INFO - __main__ - Step 10612: {'lr': 0.0004958352973285987, 'samples': 2037504, 'steps': 10611, 'loss/train': 1.8000268936157227} 11/06/2021 22:43:14 - INFO - __main__ - Step 10613: {'lr': 0.000495834332669615, 'samples': 2037696, 'steps': 10612, 'loss/train': 0.241807758808136} 11/06/2021 22:43:15 - INFO - __main__ - Step 10614: {'lr': 0.0004958333678998622, 'samples': 2037888, 'steps': 10613, 'loss/train': 1.521072268486023} 11/06/2021 22:43:15 - INFO - __main__ - Step 10615: {'lr': 0.0004958324030193404, 'samples': 2038080, 'steps': 10614, 'loss/train': 1.6696323156356812} 11/06/2021 22:43:15 - INFO - __main__ - Step 10616: {'lr': 0.0004958314380280504, 'samples': 2038272, 'steps': 10615, 'loss/train': 1.4865412712097168} 11/06/2021 22:43:16 - INFO - __main__ - Step 10617: {'lr': 0.0004958304729259927, 'samples': 2038464, 'steps': 10616, 'loss/train': 1.9140490293502808} 11/06/2021 22:43:16 - INFO - __main__ - Step 10618: {'lr': 0.0004958295077131674, 'samples': 2038656, 'steps': 10617, 'loss/train': 1.3753329515457153} 11/06/2021 22:43:17 - INFO - __main__ - Step 10619: {'lr': 0.0004958285423895752, 'samples': 2038848, 'steps': 10618, 'loss/train': 1.8530157804489136} 11/06/2021 22:43:18 - INFO - __main__ - Step 10620: {'lr': 0.0004958275769552165, 'samples': 2039040, 'steps': 10619, 'loss/train': 1.2794376611709595} 11/06/2021 22:43:18 - INFO - __main__ - Step 10621: {'lr': 0.0004958266114100917, 'samples': 2039232, 'steps': 10620, 'loss/train': 1.9506590366363525} 11/06/2021 22:43:18 - INFO - __main__ - Step 10622: {'lr': 0.0004958256457542011, 'samples': 2039424, 'steps': 10621, 'loss/train': 1.7138190269470215} 11/06/2021 22:43:19 - INFO - __main__ - Step 10623: {'lr': 0.0004958246799875453, 'samples': 2039616, 'steps': 10622, 'loss/train': 1.564102292060852} 11/06/2021 22:43:19 - INFO - __main__ - Step 10624: {'lr': 0.0004958237141101247, 'samples': 2039808, 'steps': 10623, 'loss/train': 0.428017258644104} 11/06/2021 22:43:19 - INFO - __main__ - Step 10625: {'lr': 0.0004958227481219399, 'samples': 2040000, 'steps': 10624, 'loss/train': 1.5035067796707153} 11/06/2021 22:43:21 - INFO - __main__ - Step 10626: {'lr': 0.0004958217820229909, 'samples': 2040192, 'steps': 10625, 'loss/train': 1.7794239521026611} 11/06/2021 22:43:21 - INFO - __main__ - Step 10627: {'lr': 0.0004958208158132785, 'samples': 2040384, 'steps': 10626, 'loss/train': 1.4251916408538818} 11/06/2021 22:43:21 - INFO - __main__ - Step 10628: {'lr': 0.000495819849492803, 'samples': 2040576, 'steps': 10627, 'loss/train': 1.7260355949401855} 11/06/2021 22:43:22 - INFO - __main__ - Step 10629: {'lr': 0.0004958188830615649, 'samples': 2040768, 'steps': 10628, 'loss/train': 2.004051446914673} 11/06/2021 22:43:22 - INFO - __main__ - Step 10630: {'lr': 0.0004958179165195646, 'samples': 2040960, 'steps': 10629, 'loss/train': 1.7460932731628418} 11/06/2021 22:43:23 - INFO - __main__ - Step 10631: {'lr': 0.0004958169498668026, 'samples': 2041152, 'steps': 10630, 'loss/train': 1.8938854932785034} 11/06/2021 22:43:23 - INFO - __main__ - Step 10632: {'lr': 0.0004958159831032793, 'samples': 2041344, 'steps': 10631, 'loss/train': 1.446541428565979} 11/06/2021 22:43:24 - INFO - __main__ - Step 10633: {'lr': 0.000495815016228995, 'samples': 2041536, 'steps': 10632, 'loss/train': 1.7085440158843994} 11/06/2021 22:43:24 - INFO - __main__ - Step 10634: {'lr': 0.0004958140492439502, 'samples': 2041728, 'steps': 10633, 'loss/train': 1.8917224407196045} 11/06/2021 22:43:24 - INFO - __main__ - Step 10635: {'lr': 0.0004958130821481455, 'samples': 2041920, 'steps': 10634, 'loss/train': 1.9071261882781982} 11/06/2021 22:43:25 - INFO - __main__ - Step 10636: {'lr': 0.0004958121149415812, 'samples': 2042112, 'steps': 10635, 'loss/train': 1.352231502532959} 11/06/2021 22:43:26 - INFO - __main__ - Step 10637: {'lr': 0.0004958111476242577, 'samples': 2042304, 'steps': 10636, 'loss/train': 1.8399983644485474} 11/06/2021 22:43:26 - INFO - __main__ - Step 10638: {'lr': 0.0004958101801961755, 'samples': 2042496, 'steps': 10637, 'loss/train': 2.4199862480163574} 11/06/2021 22:43:26 - INFO - __main__ - Step 10639: {'lr': 0.0004958092126573352, 'samples': 2042688, 'steps': 10638, 'loss/train': 1.832000970840454} 11/06/2021 22:43:27 - INFO - __main__ - Step 10640: {'lr': 0.0004958082450077369, 'samples': 2042880, 'steps': 10639, 'loss/train': 2.0270063877105713} 11/06/2021 22:43:28 - INFO - __main__ - Step 10641: {'lr': 0.0004958072772473812, 'samples': 2043072, 'steps': 10640, 'loss/train': 1.655840277671814} 11/06/2021 22:43:28 - INFO - __main__ - Step 10642: {'lr': 0.0004958063093762684, 'samples': 2043264, 'steps': 10641, 'loss/train': 2.09566330909729} 11/06/2021 22:43:29 - INFO - __main__ - Step 10643: {'lr': 0.0004958053413943993, 'samples': 2043456, 'steps': 10642, 'loss/train': 1.8067339658737183} 11/06/2021 22:43:29 - INFO - __main__ - Step 10644: {'lr': 0.0004958043733017741, 'samples': 2043648, 'steps': 10643, 'loss/train': 1.5341635942459106} 11/06/2021 22:43:29 - INFO - __main__ - Step 10645: {'lr': 0.0004958034050983932, 'samples': 2043840, 'steps': 10644, 'loss/train': 1.7686036825180054} 11/06/2021 22:43:30 - INFO - __main__ - Step 10646: {'lr': 0.0004958024367842569, 'samples': 2044032, 'steps': 10645, 'loss/train': 1.8402926921844482} 11/06/2021 22:43:31 - INFO - __main__ - Step 10647: {'lr': 0.000495801468359366, 'samples': 2044224, 'steps': 10646, 'loss/train': 1.6536637544631958} 11/06/2021 22:43:31 - INFO - __main__ - Step 10648: {'lr': 0.0004958004998237207, 'samples': 2044416, 'steps': 10647, 'loss/train': 2.0047109127044678} 11/06/2021 22:43:31 - INFO - __main__ - Step 10649: {'lr': 0.0004957995311773215, 'samples': 2044608, 'steps': 10648, 'loss/train': 1.8218034505844116} 11/06/2021 22:43:32 - INFO - __main__ - Step 10650: {'lr': 0.0004957985624201688, 'samples': 2044800, 'steps': 10649, 'loss/train': 1.3269506692886353} 11/06/2021 22:43:32 - INFO - __main__ - Step 10651: {'lr': 0.0004957975935522632, 'samples': 2044992, 'steps': 10650, 'loss/train': 1.8232018947601318} 11/06/2021 22:43:33 - INFO - __main__ - Step 10652: {'lr': 0.0004957966245736048, 'samples': 2045184, 'steps': 10651, 'loss/train': 1.26266348361969} 11/06/2021 22:43:34 - INFO - __main__ - Step 10653: {'lr': 0.0004957956554841943, 'samples': 2045376, 'steps': 10652, 'loss/train': 1.7997419834136963} 11/06/2021 22:43:34 - INFO - __main__ - Step 10654: {'lr': 0.0004957946862840321, 'samples': 2045568, 'steps': 10653, 'loss/train': 1.7390581369400024} 11/06/2021 22:43:34 - INFO - __main__ - Step 10655: {'lr': 0.0004957937169731186, 'samples': 2045760, 'steps': 10654, 'loss/train': 1.672598958015442} 11/06/2021 22:43:35 - INFO - __main__ - Step 10656: {'lr': 0.0004957927475514542, 'samples': 2045952, 'steps': 10655, 'loss/train': 1.358737826347351} 11/06/2021 22:43:36 - INFO - __main__ - Step 10657: {'lr': 0.0004957917780190395, 'samples': 2046144, 'steps': 10656, 'loss/train': 1.9060399532318115} 11/06/2021 22:43:36 - INFO - __main__ - Step 10658: {'lr': 0.0004957908083758747, 'samples': 2046336, 'steps': 10657, 'loss/train': 1.6999403238296509} 11/06/2021 22:43:36 - INFO - __main__ - Step 10659: {'lr': 0.0004957898386219603, 'samples': 2046528, 'steps': 10658, 'loss/train': 1.6076505184173584} 11/06/2021 22:43:37 - INFO - __main__ - Step 10660: {'lr': 0.000495788868757297, 'samples': 2046720, 'steps': 10659, 'loss/train': 1.7159380912780762} 11/06/2021 22:43:37 - INFO - __main__ - Step 10661: {'lr': 0.0004957878987818849, 'samples': 2046912, 'steps': 10660, 'loss/train': 1.4848712682724} 11/06/2021 22:43:39 - INFO - __main__ - Step 10662: {'lr': 0.0004957869286957246, 'samples': 2047104, 'steps': 10661, 'loss/train': 1.9166350364685059} 11/06/2021 22:43:39 - INFO - __main__ - Step 10663: {'lr': 0.0004957859584988164, 'samples': 2047296, 'steps': 10662, 'loss/train': 1.3673206567764282} 11/06/2021 22:43:39 - INFO - __main__ - Step 10664: {'lr': 0.0004957849881911609, 'samples': 2047488, 'steps': 10663, 'loss/train': 1.7530118227005005} 11/06/2021 22:43:40 - INFO - __main__ - Step 10665: {'lr': 0.0004957840177727585, 'samples': 2047680, 'steps': 10664, 'loss/train': 1.5396183729171753} 11/06/2021 22:43:40 - INFO - __main__ - Step 10666: {'lr': 0.0004957830472436097, 'samples': 2047872, 'steps': 10665, 'loss/train': 2.27144718170166} 11/06/2021 22:43:40 - INFO - __main__ - Step 10667: {'lr': 0.0004957820766037147, 'samples': 2048064, 'steps': 10666, 'loss/train': 2.53610897064209} 11/06/2021 22:43:41 - INFO - __main__ - Step 10668: {'lr': 0.0004957811058530742, 'samples': 2048256, 'steps': 10667, 'loss/train': 2.2089219093322754} 11/06/2021 22:43:42 - INFO - __main__ - Step 10669: {'lr': 0.0004957801349916884, 'samples': 2048448, 'steps': 10668, 'loss/train': 2.4702768325805664} 11/06/2021 22:43:42 - INFO - __main__ - Step 10670: {'lr': 0.000495779164019558, 'samples': 2048640, 'steps': 10669, 'loss/train': 0.2976198196411133} 11/06/2021 22:43:42 - INFO - __main__ - Step 10671: {'lr': 0.0004957781929366832, 'samples': 2048832, 'steps': 10670, 'loss/train': 2.503723382949829} 11/06/2021 22:43:43 - INFO - __main__ - Step 10672: {'lr': 0.0004957772217430646, 'samples': 2049024, 'steps': 10671, 'loss/train': 1.338193416595459} 11/06/2021 22:43:44 - INFO - __main__ - Step 10673: {'lr': 0.0004957762504387025, 'samples': 2049216, 'steps': 10672, 'loss/train': 1.5494346618652344} 11/06/2021 22:43:45 - INFO - __main__ - Step 10674: {'lr': 0.0004957752790235976, 'samples': 2049408, 'steps': 10673, 'loss/train': 2.082453489303589} 11/06/2021 22:43:45 - INFO - __main__ - Step 10675: {'lr': 0.00049577430749775, 'samples': 2049600, 'steps': 10674, 'loss/train': 1.9406588077545166} 11/06/2021 22:43:45 - INFO - __main__ - Step 10676: {'lr': 0.0004957733358611602, 'samples': 2049792, 'steps': 10675, 'loss/train': 1.727967381477356} 11/06/2021 22:43:46 - INFO - __main__ - Step 10677: {'lr': 0.0004957723641138289, 'samples': 2049984, 'steps': 10676, 'loss/train': 1.9043892621994019} 11/06/2021 22:43:47 - INFO - __main__ - Step 10678: {'lr': 0.0004957713922557563, 'samples': 2050176, 'steps': 10677, 'loss/train': 1.6342339515686035} 11/06/2021 22:43:47 - INFO - __main__ - Step 10679: {'lr': 0.0004957704202869429, 'samples': 2050368, 'steps': 10678, 'loss/train': 2.188232898712158} 11/06/2021 22:43:47 - INFO - __main__ - Step 10680: {'lr': 0.0004957694482073891, 'samples': 2050560, 'steps': 10679, 'loss/train': 2.068922758102417} 11/06/2021 22:43:48 - INFO - __main__ - Step 10681: {'lr': 0.0004957684760170955, 'samples': 2050752, 'steps': 10680, 'loss/train': 1.8861031532287598} 11/06/2021 22:43:48 - INFO - __main__ - Step 10682: {'lr': 0.0004957675037160624, 'samples': 2050944, 'steps': 10681, 'loss/train': 1.979127049446106} 11/06/2021 22:43:49 - INFO - __main__ - Step 10683: {'lr': 0.0004957665313042902, 'samples': 2051136, 'steps': 10682, 'loss/train': 1.728943109512329} 11/06/2021 22:43:49 - INFO - __main__ - Step 10684: {'lr': 0.0004957655587817793, 'samples': 2051328, 'steps': 10683, 'loss/train': 1.295758605003357} 11/06/2021 22:43:50 - INFO - __main__ - Step 10685: {'lr': 0.0004957645861485304, 'samples': 2051520, 'steps': 10684, 'loss/train': 1.4567430019378662} 11/06/2021 22:43:50 - INFO - __main__ - Step 10686: {'lr': 0.0004957636134045437, 'samples': 2051712, 'steps': 10685, 'loss/train': 1.5401474237442017} 11/06/2021 22:43:50 - INFO - __main__ - Step 10687: {'lr': 0.0004957626405498196, 'samples': 2051904, 'steps': 10686, 'loss/train': 2.0036814212799072} 11/06/2021 22:43:51 - INFO - __main__ - Step 10688: {'lr': 0.0004957616675843588, 'samples': 2052096, 'steps': 10687, 'loss/train': 1.8221772909164429} 11/06/2021 22:43:52 - INFO - __main__ - Step 10689: {'lr': 0.0004957606945081615, 'samples': 2052288, 'steps': 10688, 'loss/train': 1.658942461013794} 11/06/2021 22:43:52 - INFO - __main__ - Step 10690: {'lr': 0.0004957597213212284, 'samples': 2052480, 'steps': 10689, 'loss/train': 1.7910033464431763} 11/06/2021 22:43:52 - INFO - __main__ - Step 10691: {'lr': 0.0004957587480235595, 'samples': 2052672, 'steps': 10690, 'loss/train': 1.7230714559555054} 11/06/2021 22:43:53 - INFO - __main__ - Step 10692: {'lr': 0.0004957577746151556, 'samples': 2052864, 'steps': 10691, 'loss/train': 1.9783008098602295} 11/06/2021 22:43:54 - INFO - __main__ - Step 10693: {'lr': 0.0004957568010960171, 'samples': 2053056, 'steps': 10692, 'loss/train': 1.6153764724731445} 11/06/2021 22:43:54 - INFO - __main__ - Step 10694: {'lr': 0.0004957558274661444, 'samples': 2053248, 'steps': 10693, 'loss/train': 1.5456531047821045} 11/06/2021 22:43:55 - INFO - __main__ - Step 10695: {'lr': 0.0004957548537255378, 'samples': 2053440, 'steps': 10694, 'loss/train': 2.194444417953491} 11/06/2021 22:43:55 - INFO - __main__ - Step 10696: {'lr': 0.000495753879874198, 'samples': 2053632, 'steps': 10695, 'loss/train': 1.8297362327575684} 11/06/2021 22:43:55 - INFO - __main__ - Step 10697: {'lr': 0.0004957529059121251, 'samples': 2053824, 'steps': 10696, 'loss/train': 2.081062078475952} 11/06/2021 22:43:56 - INFO - __main__ - Step 10698: {'lr': 0.0004957519318393199, 'samples': 2054016, 'steps': 10697, 'loss/train': 1.8401142358779907} 11/06/2021 22:43:57 - INFO - __main__ - Step 10699: {'lr': 0.0004957509576557826, 'samples': 2054208, 'steps': 10698, 'loss/train': 1.7377288341522217} 11/06/2021 22:43:57 - INFO - __main__ - Step 10700: {'lr': 0.0004957499833615137, 'samples': 2054400, 'steps': 10699, 'loss/train': 2.147418260574341} 11/06/2021 22:43:57 - INFO - __main__ - Step 10701: {'lr': 0.0004957490089565137, 'samples': 2054592, 'steps': 10700, 'loss/train': 1.7337613105773926} 11/06/2021 22:43:58 - INFO - __main__ - Step 10702: {'lr': 0.0004957480344407829, 'samples': 2054784, 'steps': 10701, 'loss/train': 1.7109180688858032} 11/06/2021 22:43:58 - INFO - __main__ - Step 10703: {'lr': 0.0004957470598143218, 'samples': 2054976, 'steps': 10702, 'loss/train': 1.4152930974960327} 11/06/2021 22:43:59 - INFO - __main__ - Step 10704: {'lr': 0.000495746085077131, 'samples': 2055168, 'steps': 10703, 'loss/train': 1.6608555316925049} 11/06/2021 22:44:00 - INFO - __main__ - Step 10705: {'lr': 0.0004957451102292108, 'samples': 2055360, 'steps': 10704, 'loss/train': 1.8776973485946655} 11/06/2021 22:44:00 - INFO - __main__ - Step 10706: {'lr': 0.0004957441352705616, 'samples': 2055552, 'steps': 10705, 'loss/train': 1.9885625839233398} 11/06/2021 22:44:00 - INFO - __main__ - Step 10707: {'lr': 0.0004957431602011839, 'samples': 2055744, 'steps': 10706, 'loss/train': 1.5255722999572754} 11/06/2021 22:44:01 - INFO - __main__ - Step 10708: {'lr': 0.0004957421850210781, 'samples': 2055936, 'steps': 10707, 'loss/train': 1.827954649925232} 11/06/2021 22:44:02 - INFO - __main__ - Step 10709: {'lr': 0.0004957412097302446, 'samples': 2056128, 'steps': 10708, 'loss/train': 2.0723941326141357} 11/06/2021 22:44:02 - INFO - __main__ - Step 10710: {'lr': 0.000495740234328684, 'samples': 2056320, 'steps': 10709, 'loss/train': 2.509693145751953} 11/06/2021 22:44:02 - INFO - __main__ - Step 10711: {'lr': 0.0004957392588163967, 'samples': 2056512, 'steps': 10710, 'loss/train': 1.9670145511627197} 11/06/2021 22:44:03 - INFO - __main__ - Step 10712: {'lr': 0.000495738283193383, 'samples': 2056704, 'steps': 10711, 'loss/train': 1.6758942604064941} 11/06/2021 22:44:03 - INFO - __main__ - Step 10713: {'lr': 0.0004957373074596434, 'samples': 2056896, 'steps': 10712, 'loss/train': 1.8210440874099731} 11/06/2021 22:44:04 - INFO - __main__ - Step 10714: {'lr': 0.0004957363316151784, 'samples': 2057088, 'steps': 10713, 'loss/train': 1.4150325059890747} 11/06/2021 22:44:04 - INFO - __main__ - Step 10715: {'lr': 0.0004957353556599884, 'samples': 2057280, 'steps': 10714, 'loss/train': 1.9680533409118652} 11/06/2021 22:44:05 - INFO - __main__ - Step 10716: {'lr': 0.0004957343795940738, 'samples': 2057472, 'steps': 10715, 'loss/train': 2.222113847732544} 11/06/2021 22:44:05 - INFO - __main__ - Step 10717: {'lr': 0.0004957334034174351, 'samples': 2057664, 'steps': 10716, 'loss/train': 1.4754172563552856} 11/06/2021 22:44:06 - INFO - __main__ - Step 10718: {'lr': 0.0004957324271300728, 'samples': 2057856, 'steps': 10717, 'loss/train': 1.6506346464157104} 11/06/2021 22:44:07 - INFO - __main__ - Step 10719: {'lr': 0.0004957314507319871, 'samples': 2058048, 'steps': 10718, 'loss/train': 1.6497141122817993} 11/06/2021 22:44:07 - INFO - __main__ - Step 10720: {'lr': 0.0004957304742231787, 'samples': 2058240, 'steps': 10719, 'loss/train': 1.7145750522613525} 11/06/2021 22:44:07 - INFO - __main__ - Step 10721: {'lr': 0.0004957294976036479, 'samples': 2058432, 'steps': 10720, 'loss/train': 1.7986055612564087} 11/06/2021 22:44:08 - INFO - __main__ - Step 10722: {'lr': 0.0004957285208733953, 'samples': 2058624, 'steps': 10721, 'loss/train': 1.592078447341919} 11/06/2021 22:44:08 - INFO - __main__ - Step 10723: {'lr': 0.0004957275440324211, 'samples': 2058816, 'steps': 10722, 'loss/train': 1.7179454565048218} 11/06/2021 22:44:09 - INFO - __main__ - Step 10724: {'lr': 0.0004957265670807258, 'samples': 2059008, 'steps': 10723, 'loss/train': 1.8023722171783447} 11/06/2021 22:44:09 - INFO - __main__ - Step 10725: {'lr': 0.0004957255900183101, 'samples': 2059200, 'steps': 10724, 'loss/train': 2.032273292541504} 11/06/2021 22:44:10 - INFO - __main__ - Step 10726: {'lr': 0.000495724612845174, 'samples': 2059392, 'steps': 10725, 'loss/train': 1.7515367269515991} 11/06/2021 22:44:10 - INFO - __main__ - Step 10727: {'lr': 0.0004957236355613184, 'samples': 2059584, 'steps': 10726, 'loss/train': 1.7511495351791382} 11/06/2021 22:44:10 - INFO - __main__ - Step 10728: {'lr': 0.0004957226581667434, 'samples': 2059776, 'steps': 10727, 'loss/train': 2.232154369354248} 11/06/2021 22:44:11 - INFO - __main__ - Step 10729: {'lr': 0.0004957216806614496, 'samples': 2059968, 'steps': 10728, 'loss/train': 2.1359822750091553} 11/06/2021 22:44:12 - INFO - __main__ - Step 10730: {'lr': 0.0004957207030454374, 'samples': 2060160, 'steps': 10729, 'loss/train': 2.018582582473755} 11/06/2021 22:44:12 - INFO - __main__ - Step 10731: {'lr': 0.0004957197253187073, 'samples': 2060352, 'steps': 10730, 'loss/train': 1.6088690757751465} 11/06/2021 22:44:12 - INFO - __main__ - Step 10732: {'lr': 0.0004957187474812595, 'samples': 2060544, 'steps': 10731, 'loss/train': 1.1297942399978638} 11/06/2021 22:44:13 - INFO - __main__ - Step 10733: {'lr': 0.0004957177695330948, 'samples': 2060736, 'steps': 10732, 'loss/train': 1.5375380516052246} 11/06/2021 22:44:13 - INFO - __main__ - Step 10734: {'lr': 0.0004957167914742134, 'samples': 2060928, 'steps': 10733, 'loss/train': 1.797157883644104} 11/06/2021 22:44:14 - INFO - __main__ - Step 10735: {'lr': 0.0004957158133046158, 'samples': 2061120, 'steps': 10734, 'loss/train': 2.323326587677002} 11/06/2021 22:44:15 - INFO - __main__ - Step 10736: {'lr': 0.0004957148350243025, 'samples': 2061312, 'steps': 10735, 'loss/train': 2.2027082443237305} 11/06/2021 22:44:15 - INFO - __main__ - Step 10737: {'lr': 0.0004957138566332738, 'samples': 2061504, 'steps': 10736, 'loss/train': 1.9741092920303345} 11/06/2021 22:44:15 - INFO - __main__ - Step 10738: {'lr': 0.0004957128781315303, 'samples': 2061696, 'steps': 10737, 'loss/train': 1.5470525026321411} 11/06/2021 22:44:16 - INFO - __main__ - Step 10739: {'lr': 0.0004957118995190723, 'samples': 2061888, 'steps': 10738, 'loss/train': 1.9221845865249634} 11/06/2021 22:44:17 - INFO - __main__ - Step 10740: {'lr': 0.0004957109207959004, 'samples': 2062080, 'steps': 10739, 'loss/train': 1.5594979524612427} 11/06/2021 22:44:17 - INFO - __main__ - Step 10741: {'lr': 0.0004957099419620149, 'samples': 2062272, 'steps': 10740, 'loss/train': 1.0245342254638672} 11/06/2021 22:44:17 - INFO - __main__ - Step 10742: {'lr': 0.0004957089630174163, 'samples': 2062464, 'steps': 10741, 'loss/train': 1.913503885269165} 11/06/2021 22:44:18 - INFO - __main__ - Step 10743: {'lr': 0.0004957079839621051, 'samples': 2062656, 'steps': 10742, 'loss/train': 2.051661968231201} 11/06/2021 22:44:18 - INFO - __main__ - Step 10744: {'lr': 0.0004957070047960816, 'samples': 2062848, 'steps': 10743, 'loss/train': 2.0730690956115723} 11/06/2021 22:44:19 - INFO - __main__ - Step 10745: {'lr': 0.0004957060255193462, 'samples': 2063040, 'steps': 10744, 'loss/train': 2.2329344749450684} 11/06/2021 22:44:19 - INFO - __main__ - Step 10746: {'lr': 0.0004957050461318997, 'samples': 2063232, 'steps': 10745, 'loss/train': 1.7052743434906006} 11/06/2021 22:44:20 - INFO - __main__ - Step 10747: {'lr': 0.0004957040666337422, 'samples': 2063424, 'steps': 10746, 'loss/train': 1.7934210300445557} 11/06/2021 22:44:20 - INFO - __main__ - Step 10748: {'lr': 0.0004957030870248742, 'samples': 2063616, 'steps': 10747, 'loss/train': 1.2029649019241333} 11/06/2021 22:44:21 - INFO - __main__ - Step 10749: {'lr': 0.0004957021073052962, 'samples': 2063808, 'steps': 10748, 'loss/train': 1.534114122390747} 11/06/2021 22:44:21 - INFO - __main__ - Step 10750: {'lr': 0.0004957011274750086, 'samples': 2064000, 'steps': 10749, 'loss/train': 0.9407898783683777} 11/06/2021 22:44:22 - INFO - __main__ - Step 10751: {'lr': 0.0004957001475340119, 'samples': 2064192, 'steps': 10750, 'loss/train': 1.5393911600112915} 11/06/2021 22:44:22 - INFO - __main__ - Step 10752: {'lr': 0.0004956991674823065, 'samples': 2064384, 'steps': 10751, 'loss/train': 1.8228400945663452} 11/06/2021 22:44:23 - INFO - __main__ - Step 10753: {'lr': 0.0004956981873198928, 'samples': 2064576, 'steps': 10752, 'loss/train': 1.507121205329895} 11/06/2021 22:44:23 - INFO - __main__ - Step 10754: {'lr': 0.0004956972070467712, 'samples': 2064768, 'steps': 10753, 'loss/train': 1.64657461643219} 11/06/2021 22:44:23 - INFO - __main__ - Step 10755: {'lr': 0.0004956962266629424, 'samples': 2064960, 'steps': 10754, 'loss/train': 1.6212588548660278} 11/06/2021 22:44:24 - INFO - __main__ - Step 10756: {'lr': 0.0004956952461684066, 'samples': 2065152, 'steps': 10755, 'loss/train': 2.1202139854431152} 11/06/2021 22:44:25 - INFO - __main__ - Step 10757: {'lr': 0.0004956942655631644, 'samples': 2065344, 'steps': 10756, 'loss/train': 1.5507880449295044} 11/06/2021 22:44:25 - INFO - __main__ - Step 10758: {'lr': 0.0004956932848472161, 'samples': 2065536, 'steps': 10757, 'loss/train': 1.8729193210601807} 11/06/2021 22:44:25 - INFO - __main__ - Step 10759: {'lr': 0.0004956923040205622, 'samples': 2065728, 'steps': 10758, 'loss/train': 1.7279722690582275} 11/06/2021 22:44:26 - INFO - __main__ - Step 10760: {'lr': 0.0004956913230832031, 'samples': 2065920, 'steps': 10759, 'loss/train': 2.134770154953003} 11/06/2021 22:44:27 - INFO - __main__ - Step 10761: {'lr': 0.0004956903420351393, 'samples': 2066112, 'steps': 10760, 'loss/train': 1.6158196926116943} 11/06/2021 22:44:28 - INFO - __main__ - Step 10762: {'lr': 0.0004956893608763713, 'samples': 2066304, 'steps': 10761, 'loss/train': 1.8567177057266235} 11/06/2021 22:44:28 - INFO - __main__ - Step 10763: {'lr': 0.0004956883796068993, 'samples': 2066496, 'steps': 10762, 'loss/train': 1.9238213300704956} 11/06/2021 22:44:28 - INFO - __main__ - Step 10764: {'lr': 0.000495687398226724, 'samples': 2066688, 'steps': 10763, 'loss/train': 1.661841869354248} 11/06/2021 22:44:29 - INFO - __main__ - Step 10765: {'lr': 0.0004956864167358458, 'samples': 2066880, 'steps': 10764, 'loss/train': 1.6980998516082764} 11/06/2021 22:44:29 - INFO - __main__ - Step 10766: {'lr': 0.000495685435134265, 'samples': 2067072, 'steps': 10765, 'loss/train': 1.7710494995117188} 11/06/2021 22:44:30 - INFO - __main__ - Step 10767: {'lr': 0.0004956844534219822, 'samples': 2067264, 'steps': 10766, 'loss/train': 1.8926702737808228} 11/06/2021 22:44:31 - INFO - __main__ - Step 10768: {'lr': 0.0004956834715989977, 'samples': 2067456, 'steps': 10767, 'loss/train': 1.8057781457901} 11/06/2021 22:44:31 - INFO - __main__ - Step 10769: {'lr': 0.0004956824896653122, 'samples': 2067648, 'steps': 10768, 'loss/train': 1.9344799518585205} 11/06/2021 22:44:31 - INFO - __main__ - Step 10770: {'lr': 0.0004956815076209257, 'samples': 2067840, 'steps': 10769, 'loss/train': 1.7666970491409302} 11/06/2021 22:44:32 - INFO - __main__ - Step 10771: {'lr': 0.0004956805254658391, 'samples': 2068032, 'steps': 10770, 'loss/train': 1.9876887798309326} 11/06/2021 22:44:32 - INFO - __main__ - Step 10772: {'lr': 0.0004956795432000526, 'samples': 2068224, 'steps': 10771, 'loss/train': 1.9097800254821777} 11/06/2021 22:44:33 - INFO - __main__ - Step 10773: {'lr': 0.0004956785608235667, 'samples': 2068416, 'steps': 10772, 'loss/train': 1.9691766500473022} 11/06/2021 22:44:33 - INFO - __main__ - Step 10774: {'lr': 0.0004956775783363817, 'samples': 2068608, 'steps': 10773, 'loss/train': 1.7181099653244019} 11/06/2021 22:44:34 - INFO - __main__ - Step 10775: {'lr': 0.0004956765957384984, 'samples': 2068800, 'steps': 10774, 'loss/train': 1.4132951498031616} 11/06/2021 22:44:34 - INFO - __main__ - Step 10776: {'lr': 0.0004956756130299169, 'samples': 2068992, 'steps': 10775, 'loss/train': 1.20347261428833} 11/06/2021 22:44:34 - INFO - __main__ - Step 10777: {'lr': 0.0004956746302106378, 'samples': 2069184, 'steps': 10776, 'loss/train': 1.4337294101715088} 11/06/2021 22:44:35 - INFO - __main__ - Step 10778: {'lr': 0.0004956736472806614, 'samples': 2069376, 'steps': 10777, 'loss/train': 1.8119988441467285} 11/06/2021 22:44:36 - INFO - __main__ - Step 10779: {'lr': 0.0004956726642399883, 'samples': 2069568, 'steps': 10778, 'loss/train': 1.28266179561615} 11/06/2021 22:44:36 - INFO - __main__ - Step 10780: {'lr': 0.0004956716810886189, 'samples': 2069760, 'steps': 10779, 'loss/train': 1.9151232242584229} 11/06/2021 22:44:36 - INFO - __main__ - Step 10781: {'lr': 0.0004956706978265536, 'samples': 2069952, 'steps': 10780, 'loss/train': 1.7361626625061035} 11/06/2021 22:44:37 - INFO - __main__ - Step 10782: {'lr': 0.0004956697144537929, 'samples': 2070144, 'steps': 10781, 'loss/train': 1.9696669578552246} 11/06/2021 22:44:38 - INFO - __main__ - Step 10783: {'lr': 0.0004956687309703372, 'samples': 2070336, 'steps': 10782, 'loss/train': 2.0651187896728516} 11/06/2021 22:44:38 - INFO - __main__ - Step 10784: {'lr': 0.0004956677473761871, 'samples': 2070528, 'steps': 10783, 'loss/train': 1.8574343919754028} 11/06/2021 22:44:38 - INFO - __main__ - Step 10785: {'lr': 0.0004956667636713427, 'samples': 2070720, 'steps': 10784, 'loss/train': 1.5868165493011475} 11/06/2021 22:44:39 - INFO - __main__ - Step 10786: {'lr': 0.0004956657798558047, 'samples': 2070912, 'steps': 10785, 'loss/train': 1.4748395681381226} 11/06/2021 22:44:39 - INFO - __main__ - Step 10787: {'lr': 0.0004956647959295735, 'samples': 2071104, 'steps': 10786, 'loss/train': 1.701545000076294} 11/06/2021 22:44:40 - INFO - __main__ - Step 10788: {'lr': 0.0004956638118926495, 'samples': 2071296, 'steps': 10787, 'loss/train': 1.976759910583496} 11/06/2021 22:44:40 - INFO - __main__ - Step 10789: {'lr': 0.0004956628277450333, 'samples': 2071488, 'steps': 10788, 'loss/train': 2.1454813480377197} 11/06/2021 22:44:41 - INFO - __main__ - Step 10790: {'lr': 0.0004956618434867251, 'samples': 2071680, 'steps': 10789, 'loss/train': 1.5759694576263428} 11/06/2021 22:44:41 - INFO - __main__ - Step 10791: {'lr': 0.0004956608591177256, 'samples': 2071872, 'steps': 10790, 'loss/train': 1.8092042207717896} 11/06/2021 22:44:41 - INFO - __main__ - Step 10792: {'lr': 0.0004956598746380349, 'samples': 2072064, 'steps': 10791, 'loss/train': 1.6778979301452637} 11/06/2021 22:44:43 - INFO - __main__ - Step 10793: {'lr': 0.0004956588900476538, 'samples': 2072256, 'steps': 10792, 'loss/train': 1.459761381149292} 11/06/2021 22:44:43 - INFO - __main__ - Step 10794: {'lr': 0.0004956579053465826, 'samples': 2072448, 'steps': 10793, 'loss/train': 1.5399609804153442} 11/06/2021 22:44:43 - INFO - __main__ - Step 10795: {'lr': 0.0004956569205348217, 'samples': 2072640, 'steps': 10794, 'loss/train': 1.4473868608474731} 11/06/2021 22:44:44 - INFO - __main__ - Step 10796: {'lr': 0.0004956559356123717, 'samples': 2072832, 'steps': 10795, 'loss/train': 1.825037956237793} 11/06/2021 22:44:44 - INFO - __main__ - Step 10797: {'lr': 0.0004956549505792327, 'samples': 2073024, 'steps': 10796, 'loss/train': 2.264521360397339} 11/06/2021 22:44:45 - INFO - __main__ - Step 10798: {'lr': 0.0004956539654354055, 'samples': 2073216, 'steps': 10797, 'loss/train': 1.450287938117981} 11/06/2021 22:44:45 - INFO - __main__ - Step 10799: {'lr': 0.0004956529801808904, 'samples': 2073408, 'steps': 10798, 'loss/train': 1.9415364265441895} 11/06/2021 22:44:46 - INFO - __main__ - Step 10800: {'lr': 0.0004956519948156879, 'samples': 2073600, 'steps': 10799, 'loss/train': 1.7281900644302368} 11/06/2021 22:44:46 - INFO - __main__ - Step 10801: {'lr': 0.0004956510093397983, 'samples': 2073792, 'steps': 10800, 'loss/train': 1.6848071813583374} 11/06/2021 22:44:46 - INFO - __main__ - Step 10802: {'lr': 0.0004956500237532222, 'samples': 2073984, 'steps': 10801, 'loss/train': 2.119485855102539} 11/06/2021 22:44:47 - INFO - __main__ - Step 10803: {'lr': 0.0004956490380559601, 'samples': 2074176, 'steps': 10802, 'loss/train': 1.7440723180770874} 11/06/2021 22:44:48 - INFO - __main__ - Step 10804: {'lr': 0.0004956480522480121, 'samples': 2074368, 'steps': 10803, 'loss/train': 1.6248648166656494} 11/06/2021 22:44:48 - INFO - __main__ - Step 10805: {'lr': 0.000495647066329379, 'samples': 2074560, 'steps': 10804, 'loss/train': 2.111401081085205} 11/06/2021 22:44:49 - INFO - __main__ - Step 10806: {'lr': 0.0004956460803000612, 'samples': 2074752, 'steps': 10805, 'loss/train': 2.2455363273620605} 11/06/2021 22:44:49 - INFO - __main__ - Step 10807: {'lr': 0.0004956450941600589, 'samples': 2074944, 'steps': 10806, 'loss/train': 4.325562477111816} 11/06/2021 22:44:49 - INFO - __main__ - Step 10808: {'lr': 0.0004956441079093729, 'samples': 2075136, 'steps': 10807, 'loss/train': 1.592270851135254} 11/06/2021 22:44:50 - INFO - __main__ - Step 10809: {'lr': 0.0004956431215480034, 'samples': 2075328, 'steps': 10808, 'loss/train': 1.865347981452942} 11/06/2021 22:44:51 - INFO - __main__ - Step 10810: {'lr': 0.0004956421350759508, 'samples': 2075520, 'steps': 10809, 'loss/train': 1.3117084503173828} 11/06/2021 22:44:51 - INFO - __main__ - Step 10811: {'lr': 0.0004956411484932158, 'samples': 2075712, 'steps': 10810, 'loss/train': 1.8176014423370361} 11/06/2021 22:44:51 - INFO - __main__ - Step 10812: {'lr': 0.0004956401617997985, 'samples': 2075904, 'steps': 10811, 'loss/train': 1.2668240070343018} 11/06/2021 22:44:52 - INFO - __main__ - Step 10813: {'lr': 0.0004956391749956997, 'samples': 2076096, 'steps': 10812, 'loss/train': 2.1930673122406006} 11/06/2021 22:44:53 - INFO - __main__ - Step 10814: {'lr': 0.0004956381880809195, 'samples': 2076288, 'steps': 10813, 'loss/train': 2.0894155502319336} 11/06/2021 22:44:53 - INFO - __main__ - Step 10815: {'lr': 0.0004956372010554587, 'samples': 2076480, 'steps': 10814, 'loss/train': 1.8282910585403442} 11/06/2021 22:44:54 - INFO - __main__ - Step 10816: {'lr': 0.0004956362139193174, 'samples': 2076672, 'steps': 10815, 'loss/train': 2.062333345413208} 11/06/2021 22:44:54 - INFO - __main__ - Step 10817: {'lr': 0.0004956352266724964, 'samples': 2076864, 'steps': 10816, 'loss/train': 1.4473098516464233} 11/06/2021 22:44:54 - INFO - __main__ - Step 10818: {'lr': 0.0004956342393149959, 'samples': 2077056, 'steps': 10817, 'loss/train': 1.7274878025054932} 11/06/2021 22:44:56 - INFO - __main__ - Step 10819: {'lr': 0.0004956332518468163, 'samples': 2077248, 'steps': 10818, 'loss/train': 1.570633053779602} 11/06/2021 22:44:56 - INFO - __main__ - Step 10820: {'lr': 0.0004956322642679583, 'samples': 2077440, 'steps': 10819, 'loss/train': 1.7297214269638062} 11/06/2021 22:44:56 - INFO - __main__ - Step 10821: {'lr': 0.000495631276578422, 'samples': 2077632, 'steps': 10820, 'loss/train': 4.519792079925537} 11/06/2021 22:44:57 - INFO - __main__ - Step 10822: {'lr': 0.0004956302887782082, 'samples': 2077824, 'steps': 10821, 'loss/train': 1.0446696281433105} 11/06/2021 22:44:57 - INFO - __main__ - Step 10823: {'lr': 0.0004956293008673172, 'samples': 2078016, 'steps': 10822, 'loss/train': 0.5035202503204346} 11/06/2021 22:44:57 - INFO - __main__ - Step 10824: {'lr': 0.0004956283128457493, 'samples': 2078208, 'steps': 10823, 'loss/train': 0.34573525190353394} 11/06/2021 22:44:58 - INFO - __main__ - Step 10825: {'lr': 0.0004956273247135051, 'samples': 2078400, 'steps': 10824, 'loss/train': 1.6415156126022339} 11/06/2021 22:44:59 - INFO - __main__ - Step 10826: {'lr': 0.0004956263364705851, 'samples': 2078592, 'steps': 10825, 'loss/train': 1.6332308053970337} 11/06/2021 22:44:59 - INFO - __main__ - Step 10827: {'lr': 0.0004956253481169895, 'samples': 2078784, 'steps': 10826, 'loss/train': 2.319352865219116} 11/06/2021 22:44:59 - INFO - __main__ - Step 10828: {'lr': 0.0004956243596527191, 'samples': 2078976, 'steps': 10827, 'loss/train': 1.773661494255066} 11/06/2021 22:45:00 - INFO - __main__ - Step 10829: {'lr': 0.000495623371077774, 'samples': 2079168, 'steps': 10828, 'loss/train': 1.8046337366104126} 11/06/2021 22:45:01 - INFO - __main__ - Step 10830: {'lr': 0.000495622382392155, 'samples': 2079360, 'steps': 10829, 'loss/train': 1.06633460521698} 11/06/2021 22:45:01 - INFO - __main__ - Step 10831: {'lr': 0.0004956213935958621, 'samples': 2079552, 'steps': 10830, 'loss/train': 1.7313029766082764} 11/06/2021 22:45:02 - INFO - __main__ - Step 10832: {'lr': 0.0004956204046888961, 'samples': 2079744, 'steps': 10831, 'loss/train': 1.655548095703125} 11/06/2021 22:45:02 - INFO - __main__ - Step 10833: {'lr': 0.0004956194156712574, 'samples': 2079936, 'steps': 10832, 'loss/train': 2.0152714252471924} 11/06/2021 22:45:02 - INFO - __main__ - Step 10834: {'lr': 0.0004956184265429463, 'samples': 2080128, 'steps': 10833, 'loss/train': 2.010787010192871} 11/06/2021 22:45:03 - INFO - __main__ - Step 10835: {'lr': 0.0004956174373039634, 'samples': 2080320, 'steps': 10834, 'loss/train': 1.838008999824524} 11/06/2021 22:45:04 - INFO - __main__ - Step 10836: {'lr': 0.0004956164479543089, 'samples': 2080512, 'steps': 10835, 'loss/train': 1.985622525215149} 11/06/2021 22:45:04 - INFO - __main__ - Step 10837: {'lr': 0.0004956154584939836, 'samples': 2080704, 'steps': 10836, 'loss/train': 1.8421908617019653} 11/06/2021 22:45:04 - INFO - __main__ - Step 10838: {'lr': 0.0004956144689229877, 'samples': 2080896, 'steps': 10837, 'loss/train': 2.1746902465820312} 11/06/2021 22:45:05 - INFO - __main__ - Step 10839: {'lr': 0.0004956134792413218, 'samples': 2081088, 'steps': 10838, 'loss/train': 1.700150966644287} 11/06/2021 22:45:05 - INFO - __main__ - Step 10840: {'lr': 0.0004956124894489861, 'samples': 2081280, 'steps': 10839, 'loss/train': 2.058382749557495} 11/06/2021 22:45:06 - INFO - __main__ - Step 10841: {'lr': 0.0004956114995459813, 'samples': 2081472, 'steps': 10840, 'loss/train': 6.1868720054626465} 11/06/2021 22:45:06 - INFO - __main__ - Step 10842: {'lr': 0.0004956105095323077, 'samples': 2081664, 'steps': 10841, 'loss/train': 1.7854186296463013} 11/06/2021 22:45:07 - INFO - __main__ - Step 10843: {'lr': 0.0004956095194079658, 'samples': 2081856, 'steps': 10842, 'loss/train': 1.8103166818618774} 11/06/2021 22:45:07 - INFO - __main__ - Step 10844: {'lr': 0.000495608529172956, 'samples': 2082048, 'steps': 10843, 'loss/train': 1.3330121040344238} 11/06/2021 22:45:08 - INFO - __main__ - Step 10845: {'lr': 0.0004956075388272789, 'samples': 2082240, 'steps': 10844, 'loss/train': 1.5644762516021729} 11/06/2021 22:45:09 - INFO - __main__ - Step 10846: {'lr': 0.0004956065483709348, 'samples': 2082432, 'steps': 10845, 'loss/train': 1.6880342960357666} 11/06/2021 22:45:09 - INFO - __main__ - Step 10847: {'lr': 0.0004956055578039241, 'samples': 2082624, 'steps': 10846, 'loss/train': 1.727042317390442} 11/06/2021 22:45:09 - INFO - __main__ - Step 10848: {'lr': 0.0004956045671262475, 'samples': 2082816, 'steps': 10847, 'loss/train': 1.3795113563537598} 11/06/2021 22:45:10 - INFO - __main__ - Step 10849: {'lr': 0.0004956035763379051, 'samples': 2083008, 'steps': 10848, 'loss/train': 1.9024347066879272} 11/06/2021 22:45:10 - INFO - __main__ - Step 10850: {'lr': 0.0004956025854388976, 'samples': 2083200, 'steps': 10849, 'loss/train': 0.8960413336753845} 11/06/2021 22:45:11 - INFO - __main__ - Step 10851: {'lr': 0.0004956015944292253, 'samples': 2083392, 'steps': 10850, 'loss/train': 2.1259398460388184} 11/06/2021 22:45:11 - INFO - __main__ - Step 10852: {'lr': 0.0004956006033088888, 'samples': 2083584, 'steps': 10851, 'loss/train': 1.9654005765914917} 11/06/2021 22:45:12 - INFO - __main__ - Step 10853: {'lr': 0.0004955996120778884, 'samples': 2083776, 'steps': 10852, 'loss/train': 1.8148349523544312} 11/06/2021 22:45:12 - INFO - __main__ - Step 10854: {'lr': 0.0004955986207362246, 'samples': 2083968, 'steps': 10853, 'loss/train': 0.7852979898452759} 11/06/2021 22:45:12 - INFO - __main__ - Step 10855: {'lr': 0.0004955976292838979, 'samples': 2084160, 'steps': 10854, 'loss/train': 1.4349844455718994} 11/06/2021 22:45:14 - INFO - __main__ - Step 10856: {'lr': 0.0004955966377209086, 'samples': 2084352, 'steps': 10855, 'loss/train': 1.8268946409225464} 11/06/2021 22:45:14 - INFO - __main__ - Step 10857: {'lr': 0.0004955956460472573, 'samples': 2084544, 'steps': 10856, 'loss/train': 2.225966453552246} 11/06/2021 22:45:14 - INFO - __main__ - Step 10858: {'lr': 0.0004955946542629444, 'samples': 2084736, 'steps': 10857, 'loss/train': 1.7703862190246582} 11/06/2021 22:45:15 - INFO - __main__ - Step 10859: {'lr': 0.0004955936623679703, 'samples': 2084928, 'steps': 10858, 'loss/train': 2.2991316318511963} 11/06/2021 22:45:15 - INFO - __main__ - Step 10860: {'lr': 0.0004955926703623356, 'samples': 2085120, 'steps': 10859, 'loss/train': 2.4180328845977783} 11/06/2021 22:45:15 - INFO - __main__ - Step 10861: {'lr': 0.0004955916782460405, 'samples': 2085312, 'steps': 10860, 'loss/train': 2.1252551078796387} 11/06/2021 22:45:16 - INFO - __main__ - Step 10862: {'lr': 0.0004955906860190857, 'samples': 2085504, 'steps': 10861, 'loss/train': 1.9650782346725464} 11/06/2021 22:45:17 - INFO - __main__ - Step 10863: {'lr': 0.0004955896936814714, 'samples': 2085696, 'steps': 10862, 'loss/train': 1.8924416303634644} 11/06/2021 22:45:17 - INFO - __main__ - Step 10864: {'lr': 0.0004955887012331982, 'samples': 2085888, 'steps': 10863, 'loss/train': 1.7115322351455688} 11/06/2021 22:45:17 - INFO - __main__ - Step 10865: {'lr': 0.0004955877086742666, 'samples': 2086080, 'steps': 10864, 'loss/train': 1.3016084432601929} 11/06/2021 22:45:18 - INFO - __main__ - Step 10866: {'lr': 0.0004955867160046769, 'samples': 2086272, 'steps': 10865, 'loss/train': 1.6526113748550415} 11/06/2021 22:45:19 - INFO - __main__ - Step 10867: {'lr': 0.0004955857232244297, 'samples': 2086464, 'steps': 10866, 'loss/train': 1.5143797397613525} 11/06/2021 22:45:19 - INFO - __main__ - Step 10868: {'lr': 0.0004955847303335253, 'samples': 2086656, 'steps': 10867, 'loss/train': 1.8986718654632568} 11/06/2021 22:45:19 - INFO - __main__ - Step 10869: {'lr': 0.0004955837373319641, 'samples': 2086848, 'steps': 10868, 'loss/train': 1.6956390142440796} 11/06/2021 22:45:20 - INFO - __main__ - Step 10870: {'lr': 0.0004955827442197468, 'samples': 2087040, 'steps': 10869, 'loss/train': 1.9190312623977661} 11/06/2021 22:45:20 - INFO - __main__ - Step 10871: {'lr': 0.0004955817509968737, 'samples': 2087232, 'steps': 10870, 'loss/train': 1.3590363264083862} 11/06/2021 22:45:21 - INFO - __main__ - Step 10872: {'lr': 0.0004955807576633452, 'samples': 2087424, 'steps': 10871, 'loss/train': 1.7737996578216553} 11/06/2021 22:45:22 - INFO - __main__ - Step 10873: {'lr': 0.0004955797642191618, 'samples': 2087616, 'steps': 10872, 'loss/train': 1.8622442483901978} 11/06/2021 22:45:22 - INFO - __main__ - Step 10874: {'lr': 0.000495578770664324, 'samples': 2087808, 'steps': 10873, 'loss/train': 1.9006373882293701} 11/06/2021 22:45:22 - INFO - __main__ - Step 10875: {'lr': 0.0004955777769988322, 'samples': 2088000, 'steps': 10874, 'loss/train': 2.0556890964508057} 11/06/2021 22:45:23 - INFO - __main__ - Step 10876: {'lr': 0.0004955767832226868, 'samples': 2088192, 'steps': 10875, 'loss/train': 1.6354632377624512} 11/06/2021 22:45:24 - INFO - __main__ - Step 10877: {'lr': 0.0004955757893358884, 'samples': 2088384, 'steps': 10876, 'loss/train': 1.777341365814209} 11/06/2021 22:45:24 - INFO - __main__ - Step 10878: {'lr': 0.0004955747953384372, 'samples': 2088576, 'steps': 10877, 'loss/train': 1.4678032398223877} 11/06/2021 22:45:25 - INFO - __main__ - Step 10879: {'lr': 0.0004955738012303338, 'samples': 2088768, 'steps': 10878, 'loss/train': 2.07291579246521} 11/06/2021 22:45:25 - INFO - __main__ - Step 10880: {'lr': 0.0004955728070115787, 'samples': 2088960, 'steps': 10879, 'loss/train': 1.9695255756378174} 11/06/2021 22:45:25 - INFO - __main__ - Step 10881: {'lr': 0.0004955718126821722, 'samples': 2089152, 'steps': 10880, 'loss/train': 2.657900094985962} 11/06/2021 22:45:26 - INFO - __main__ - Step 10882: {'lr': 0.0004955708182421149, 'samples': 2089344, 'steps': 10881, 'loss/train': 1.587023377418518} 11/06/2021 22:45:26 - INFO - __main__ - Step 10883: {'lr': 0.0004955698236914071, 'samples': 2089536, 'steps': 10882, 'loss/train': 1.4646183252334595} 11/06/2021 22:45:27 - INFO - __main__ - Step 10884: {'lr': 0.0004955688290300494, 'samples': 2089728, 'steps': 10883, 'loss/train': 2.305067539215088} 11/06/2021 22:45:27 - INFO - __main__ - Step 10885: {'lr': 0.0004955678342580421, 'samples': 2089920, 'steps': 10884, 'loss/train': 2.04618501663208} 11/06/2021 22:45:28 - INFO - __main__ - Step 10886: {'lr': 0.0004955668393753858, 'samples': 2090112, 'steps': 10885, 'loss/train': 1.8852858543395996} 11/06/2021 22:45:28 - INFO - __main__ - Step 10887: {'lr': 0.0004955658443820809, 'samples': 2090304, 'steps': 10886, 'loss/train': 1.9681472778320312} 11/06/2021 22:45:28 - INFO - __main__ - Step 10888: {'lr': 0.0004955648492781277, 'samples': 2090496, 'steps': 10887, 'loss/train': 2.0879921913146973} 11/06/2021 22:45:30 - INFO - __main__ - Step 10889: {'lr': 0.0004955638540635269, 'samples': 2090688, 'steps': 10888, 'loss/train': 1.7729140520095825} 11/06/2021 22:45:30 - INFO - __main__ - Step 10890: {'lr': 0.0004955628587382788, 'samples': 2090880, 'steps': 10889, 'loss/train': 1.8550975322723389} 11/06/2021 22:45:30 - INFO - __main__ - Step 10891: {'lr': 0.0004955618633023837, 'samples': 2091072, 'steps': 10890, 'loss/train': 1.8905608654022217} 11/06/2021 22:45:31 - INFO - __main__ - Step 10892: {'lr': 0.0004955608677558424, 'samples': 2091264, 'steps': 10891, 'loss/train': 1.5222508907318115} 11/06/2021 22:45:31 - INFO - __main__ - Step 10893: {'lr': 0.0004955598720986551, 'samples': 2091456, 'steps': 10892, 'loss/train': 1.770163655281067} 11/06/2021 22:45:32 - INFO - __main__ - Step 10894: {'lr': 0.0004955588763308223, 'samples': 2091648, 'steps': 10893, 'loss/train': 1.9412416219711304} 11/06/2021 22:45:32 - INFO - __main__ - Step 10895: {'lr': 0.0004955578804523445, 'samples': 2091840, 'steps': 10894, 'loss/train': 1.9182251691818237} 11/06/2021 22:45:33 - INFO - __main__ - Step 10896: {'lr': 0.000495556884463222, 'samples': 2092032, 'steps': 10895, 'loss/train': 1.7548394203186035} 11/06/2021 22:45:33 - INFO - __main__ - Step 10897: {'lr': 0.0004955558883634555, 'samples': 2092224, 'steps': 10896, 'loss/train': 2.1378374099731445} 11/06/2021 22:45:33 - INFO - __main__ - Step 10898: {'lr': 0.0004955548921530452, 'samples': 2092416, 'steps': 10897, 'loss/train': 1.4862196445465088} 11/06/2021 22:45:34 - INFO - __main__ - Step 10899: {'lr': 0.0004955538958319917, 'samples': 2092608, 'steps': 10898, 'loss/train': 1.4818750619888306} 11/06/2021 22:45:35 - INFO - __main__ - Step 10900: {'lr': 0.0004955528994002954, 'samples': 2092800, 'steps': 10899, 'loss/train': 2.25352144241333} 11/06/2021 22:45:35 - INFO - __main__ - Step 10901: {'lr': 0.0004955519028579568, 'samples': 2092992, 'steps': 10900, 'loss/train': 1.904826045036316} 11/06/2021 22:45:35 - INFO - __main__ - Step 10902: {'lr': 0.0004955509062049763, 'samples': 2093184, 'steps': 10901, 'loss/train': 2.2646710872650146} 11/06/2021 22:45:36 - INFO - __main__ - Step 10903: {'lr': 0.0004955499094413542, 'samples': 2093376, 'steps': 10902, 'loss/train': 1.8453540802001953} 11/06/2021 22:45:37 - INFO - __main__ - Step 10904: {'lr': 0.0004955489125670912, 'samples': 2093568, 'steps': 10903, 'loss/train': 1.2901415824890137} 11/06/2021 22:45:37 - INFO - __main__ - Step 10905: {'lr': 0.0004955479155821877, 'samples': 2093760, 'steps': 10904, 'loss/train': 1.7908188104629517} 11/06/2021 22:45:37 - INFO - __main__ - Step 10906: {'lr': 0.000495546918486644, 'samples': 2093952, 'steps': 10905, 'loss/train': 1.9787135124206543} 11/06/2021 22:45:38 - INFO - __main__ - Step 10907: {'lr': 0.0004955459212804607, 'samples': 2094144, 'steps': 10906, 'loss/train': 0.762311577796936} 11/06/2021 22:45:38 - INFO - __main__ - Step 10908: {'lr': 0.0004955449239636382, 'samples': 2094336, 'steps': 10907, 'loss/train': 1.5875378847122192} 11/06/2021 22:45:38 - INFO - __main__ - Step 10909: {'lr': 0.000495543926536177, 'samples': 2094528, 'steps': 10908, 'loss/train': 1.9852023124694824} 11/06/2021 22:45:39 - INFO - __main__ - Step 10910: {'lr': 0.0004955429289980774, 'samples': 2094720, 'steps': 10909, 'loss/train': 1.7083479166030884} 11/06/2021 22:45:40 - INFO - __main__ - Step 10911: {'lr': 0.00049554193134934, 'samples': 2094912, 'steps': 10910, 'loss/train': 2.226473808288574} 11/06/2021 22:45:40 - INFO - __main__ - Step 10912: {'lr': 0.0004955409335899651, 'samples': 2095104, 'steps': 10911, 'loss/train': 1.7257845401763916} 11/06/2021 22:45:40 - INFO - __main__ - Step 10913: {'lr': 0.0004955399357199534, 'samples': 2095296, 'steps': 10912, 'loss/train': 1.5614092350006104} 11/06/2021 22:45:41 - INFO - __main__ - Step 10914: {'lr': 0.0004955389377393051, 'samples': 2095488, 'steps': 10913, 'loss/train': 1.6544125080108643} 11/06/2021 22:45:42 - INFO - __main__ - Step 10915: {'lr': 0.0004955379396480207, 'samples': 2095680, 'steps': 10914, 'loss/train': 2.452582836151123} 11/06/2021 22:45:42 - INFO - __main__ - Step 10916: {'lr': 0.0004955369414461007, 'samples': 2095872, 'steps': 10915, 'loss/train': 1.0178364515304565} 11/06/2021 22:45:43 - INFO - __main__ - Step 10917: {'lr': 0.0004955359431335456, 'samples': 2096064, 'steps': 10916, 'loss/train': 0.8691990375518799} 11/06/2021 22:45:43 - INFO - __main__ - Step 10918: {'lr': 0.0004955349447103559, 'samples': 2096256, 'steps': 10917, 'loss/train': 1.4804587364196777} 11/06/2021 22:45:43 - INFO - __main__ - Step 10919: {'lr': 0.0004955339461765318, 'samples': 2096448, 'steps': 10918, 'loss/train': 1.8954075574874878} 11/06/2021 22:45:44 - INFO - __main__ - Step 10920: {'lr': 0.0004955329475320739, 'samples': 2096640, 'steps': 10919, 'loss/train': 1.3000982999801636} 11/06/2021 22:45:45 - INFO - __main__ - Step 10921: {'lr': 0.0004955319487769827, 'samples': 2096832, 'steps': 10920, 'loss/train': 1.6396160125732422} 11/06/2021 22:45:45 - INFO - __main__ - Step 10922: {'lr': 0.0004955309499112586, 'samples': 2097024, 'steps': 10921, 'loss/train': 1.534606695175171} 11/06/2021 22:45:45 - INFO - __main__ - Step 10923: {'lr': 0.000495529950934902, 'samples': 2097216, 'steps': 10922, 'loss/train': 1.6064701080322266} 11/06/2021 22:45:46 - INFO - __main__ - Step 10924: {'lr': 0.0004955289518479134, 'samples': 2097408, 'steps': 10923, 'loss/train': 1.5156863927841187} 11/06/2021 22:45:47 - INFO - __main__ - Step 10925: {'lr': 0.0004955279526502931, 'samples': 2097600, 'steps': 10924, 'loss/train': 1.9213401079177856} 11/06/2021 22:45:47 - INFO - __main__ - Step 10926: {'lr': 0.0004955269533420419, 'samples': 2097792, 'steps': 10925, 'loss/train': 1.5095373392105103} 11/06/2021 22:45:47 - INFO - __main__ - Step 10927: {'lr': 0.00049552595392316, 'samples': 2097984, 'steps': 10926, 'loss/train': 1.9724854230880737} 11/06/2021 22:45:48 - INFO - __main__ - Step 10928: {'lr': 0.0004955249543936479, 'samples': 2098176, 'steps': 10927, 'loss/train': 1.7889426946640015} 11/06/2021 22:45:48 - INFO - __main__ - Step 10929: {'lr': 0.000495523954753506, 'samples': 2098368, 'steps': 10928, 'loss/train': 2.7239041328430176} 11/06/2021 22:45:49 - INFO - __main__ - Step 10930: {'lr': 0.0004955229550027347, 'samples': 2098560, 'steps': 10929, 'loss/train': 1.7207810878753662} 11/06/2021 22:45:49 - INFO - __main__ - Step 10931: {'lr': 0.0004955219551413347, 'samples': 2098752, 'steps': 10930, 'loss/train': 1.4292515516281128} 11/06/2021 22:45:50 - INFO - __main__ - Step 10932: {'lr': 0.0004955209551693063, 'samples': 2098944, 'steps': 10931, 'loss/train': 2.1134376525878906} 11/06/2021 22:45:50 - INFO - __main__ - Step 10933: {'lr': 0.0004955199550866498, 'samples': 2099136, 'steps': 10932, 'loss/train': 1.5380351543426514} 11/06/2021 22:45:51 - INFO - __main__ - Step 10934: {'lr': 0.000495518954893366, 'samples': 2099328, 'steps': 10933, 'loss/train': 1.7026230096817017} 11/06/2021 22:45:51 - INFO - __main__ - Step 10935: {'lr': 0.000495517954589455, 'samples': 2099520, 'steps': 10934, 'loss/train': 1.979361891746521} 11/06/2021 22:45:52 - INFO - __main__ - Step 10936: {'lr': 0.0004955169541749173, 'samples': 2099712, 'steps': 10935, 'loss/train': 1.7494332790374756} 11/06/2021 22:45:52 - INFO - __main__ - Step 10937: {'lr': 0.0004955159536497536, 'samples': 2099904, 'steps': 10936, 'loss/train': 1.8730812072753906} 11/06/2021 22:45:53 - INFO - __main__ - Step 10938: {'lr': 0.0004955149530139643, 'samples': 2100096, 'steps': 10937, 'loss/train': 1.707970142364502} 11/06/2021 22:45:53 - INFO - __main__ - Step 10939: {'lr': 0.0004955139522675496, 'samples': 2100288, 'steps': 10938, 'loss/train': 2.6687307357788086} 11/06/2021 22:45:53 - INFO - __main__ - Step 10940: {'lr': 0.0004955129514105101, 'samples': 2100480, 'steps': 10939, 'loss/train': 2.0217690467834473} 11/06/2021 22:45:54 - INFO - __main__ - Step 10941: {'lr': 0.0004955119504428464, 'samples': 2100672, 'steps': 10940, 'loss/train': 1.6573915481567383} 11/06/2021 22:45:55 - INFO - __main__ - Step 10942: {'lr': 0.0004955109493645587, 'samples': 2100864, 'steps': 10941, 'loss/train': 1.8716685771942139} 11/06/2021 22:45:55 - INFO - __main__ - Step 10943: {'lr': 0.0004955099481756475, 'samples': 2101056, 'steps': 10942, 'loss/train': 2.186129331588745} 11/06/2021 22:45:55 - INFO - __main__ - Step 10944: {'lr': 0.0004955089468761133, 'samples': 2101248, 'steps': 10943, 'loss/train': 2.171231985092163} 11/06/2021 22:45:56 - INFO - __main__ - Step 10945: {'lr': 0.0004955079454659567, 'samples': 2101440, 'steps': 10944, 'loss/train': 1.6056174039840698} 11/06/2021 22:45:57 - INFO - __main__ - Step 10946: {'lr': 0.0004955069439451778, 'samples': 2101632, 'steps': 10945, 'loss/train': 1.8130086660385132} 11/06/2021 22:45:57 - INFO - __main__ - Step 10947: {'lr': 0.0004955059423137774, 'samples': 2101824, 'steps': 10946, 'loss/train': 1.8952678442001343} 11/06/2021 22:45:57 - INFO - __main__ - Step 10948: {'lr': 0.0004955049405717558, 'samples': 2102016, 'steps': 10947, 'loss/train': 1.9916630983352661} 11/06/2021 22:45:58 - INFO - __main__ - Step 10949: {'lr': 0.0004955039387191135, 'samples': 2102208, 'steps': 10948, 'loss/train': 1.791800618171692} 11/06/2021 22:45:58 - INFO - __main__ - Step 10950: {'lr': 0.0004955029367558508, 'samples': 2102400, 'steps': 10949, 'loss/train': 1.4076131582260132} 11/06/2021 22:45:59 - INFO - __main__ - Step 10951: {'lr': 0.0004955019346819684, 'samples': 2102592, 'steps': 10950, 'loss/train': 1.7060812711715698} 11/06/2021 22:45:59 - INFO - __main__ - Step 10952: {'lr': 0.0004955009324974666, 'samples': 2102784, 'steps': 10951, 'loss/train': 1.271490216255188} 11/06/2021 22:46:00 - INFO - __main__ - Step 10953: {'lr': 0.0004954999302023458, 'samples': 2102976, 'steps': 10952, 'loss/train': 2.104248046875} 11/06/2021 22:46:00 - INFO - __main__ - Step 10954: {'lr': 0.0004954989277966064, 'samples': 2103168, 'steps': 10953, 'loss/train': 1.4961109161376953} 11/06/2021 22:46:01 - INFO - __main__ - Step 10955: {'lr': 0.0004954979252802491, 'samples': 2103360, 'steps': 10954, 'loss/train': 1.854745626449585} 11/06/2021 22:46:02 - INFO - __main__ - Step 10956: {'lr': 0.0004954969226532743, 'samples': 2103552, 'steps': 10955, 'loss/train': 1.7922343015670776} 11/06/2021 22:46:02 - INFO - __main__ - Step 10957: {'lr': 0.0004954959199156824, 'samples': 2103744, 'steps': 10956, 'loss/train': 2.0796141624450684} 11/06/2021 22:46:02 - INFO - __main__ - Step 10958: {'lr': 0.0004954949170674736, 'samples': 2103936, 'steps': 10957, 'loss/train': 2.0407614707946777} 11/06/2021 22:46:03 - INFO - __main__ - Step 10959: {'lr': 0.0004954939141086488, 'samples': 2104128, 'steps': 10958, 'loss/train': 1.2875730991363525} 11/06/2021 22:46:03 - INFO - __main__ - Step 10960: {'lr': 0.0004954929110392081, 'samples': 2104320, 'steps': 10959, 'loss/train': 1.918273687362671} 11/06/2021 22:46:04 - INFO - __main__ - Step 10961: {'lr': 0.0004954919078591521, 'samples': 2104512, 'steps': 10960, 'loss/train': 1.7532039880752563} 11/06/2021 22:46:05 - INFO - __main__ - Step 10962: {'lr': 0.0004954909045684812, 'samples': 2104704, 'steps': 10961, 'loss/train': 1.595262050628662} 11/06/2021 22:46:05 - INFO - __main__ - Step 10963: {'lr': 0.000495489901167196, 'samples': 2104896, 'steps': 10962, 'loss/train': 1.8555185794830322} 11/06/2021 22:46:05 - INFO - __main__ - Step 10964: {'lr': 0.0004954888976552968, 'samples': 2105088, 'steps': 10963, 'loss/train': 1.559324026107788} 11/06/2021 22:46:06 - INFO - __main__ - Step 10965: {'lr': 0.0004954878940327841, 'samples': 2105280, 'steps': 10964, 'loss/train': 1.616891860961914} 11/06/2021 22:46:07 - INFO - __main__ - Step 10966: {'lr': 0.0004954868902996582, 'samples': 2105472, 'steps': 10965, 'loss/train': 1.9043447971343994} 11/06/2021 22:46:07 - INFO - __main__ - Step 10967: {'lr': 0.0004954858864559199, 'samples': 2105664, 'steps': 10966, 'loss/train': 2.002821922302246} 11/06/2021 22:46:07 - INFO - __main__ - Step 10968: {'lr': 0.0004954848825015694, 'samples': 2105856, 'steps': 10967, 'loss/train': 1.3038442134857178} 11/06/2021 22:46:08 - INFO - __main__ - Step 10969: {'lr': 0.0004954838784366071, 'samples': 2106048, 'steps': 10968, 'loss/train': 1.6041873693466187} 11/06/2021 22:46:08 - INFO - __main__ - Step 10970: {'lr': 0.0004954828742610336, 'samples': 2106240, 'steps': 10969, 'loss/train': 2.28267502784729} 11/06/2021 22:46:09 - INFO - __main__ - Step 10971: {'lr': 0.0004954818699748493, 'samples': 2106432, 'steps': 10970, 'loss/train': 1.9312834739685059} 11/06/2021 22:46:09 - INFO - __main__ - Step 10972: {'lr': 0.0004954808655780546, 'samples': 2106624, 'steps': 10971, 'loss/train': 1.7641648054122925} 11/06/2021 22:46:10 - INFO - __main__ - Step 10973: {'lr': 0.0004954798610706502, 'samples': 2106816, 'steps': 10972, 'loss/train': 1.8444775342941284} 11/06/2021 22:46:10 - INFO - __main__ - Step 10974: {'lr': 0.0004954788564526362, 'samples': 2107008, 'steps': 10973, 'loss/train': 1.9866970777511597} 11/06/2021 22:46:10 - INFO - __main__ - Step 10975: {'lr': 0.0004954778517240133, 'samples': 2107200, 'steps': 10974, 'loss/train': 1.7459683418273926} 11/06/2021 22:46:11 - INFO - __main__ - Step 10976: {'lr': 0.0004954768468847818, 'samples': 2107392, 'steps': 10975, 'loss/train': 3.977201223373413} 11/06/2021 22:46:12 - INFO - __main__ - Step 10977: {'lr': 0.0004954758419349422, 'samples': 2107584, 'steps': 10976, 'loss/train': 1.9195420742034912} 11/06/2021 22:46:12 - INFO - __main__ - Step 10978: {'lr': 0.000495474836874495, 'samples': 2107776, 'steps': 10977, 'loss/train': 1.9030547142028809} 11/06/2021 22:46:12 - INFO - __main__ - Step 10979: {'lr': 0.0004954738317034408, 'samples': 2107968, 'steps': 10978, 'loss/train': 1.9866000413894653} 11/06/2021 22:46:13 - INFO - __main__ - Step 10980: {'lr': 0.0004954728264217796, 'samples': 2108160, 'steps': 10979, 'loss/train': 1.738264560699463} 11/06/2021 22:46:13 - INFO - __main__ - Step 10981: {'lr': 0.0004954718210295123, 'samples': 2108352, 'steps': 10980, 'loss/train': 1.8209415674209595} 11/06/2021 22:46:14 - INFO - __main__ - Step 10982: {'lr': 0.0004954708155266392, 'samples': 2108544, 'steps': 10981, 'loss/train': 1.931311011314392} 11/06/2021 22:46:15 - INFO - __main__ - Step 10983: {'lr': 0.0004954698099131606, 'samples': 2108736, 'steps': 10982, 'loss/train': 2.182440996170044} 11/06/2021 22:46:15 - INFO - __main__ - Step 10984: {'lr': 0.0004954688041890772, 'samples': 2108928, 'steps': 10983, 'loss/train': 1.8009920120239258} 11/06/2021 22:46:15 - INFO - __main__ - Step 10985: {'lr': 0.0004954677983543893, 'samples': 2109120, 'steps': 10984, 'loss/train': 1.65086829662323} 11/06/2021 22:46:16 - INFO - __main__ - Step 10986: {'lr': 0.0004954667924090974, 'samples': 2109312, 'steps': 10985, 'loss/train': 2.294792413711548} 11/06/2021 22:46:17 - INFO - __main__ - Step 10987: {'lr': 0.000495465786353202, 'samples': 2109504, 'steps': 10986, 'loss/train': 1.9113411903381348} 11/06/2021 22:46:17 - INFO - __main__ - Step 10988: {'lr': 0.0004954647801867035, 'samples': 2109696, 'steps': 10987, 'loss/train': 1.2942930459976196} 11/06/2021 22:46:17 - INFO - __main__ - Step 10989: {'lr': 0.0004954637739096023, 'samples': 2109888, 'steps': 10988, 'loss/train': 2.1266255378723145} 11/06/2021 22:46:18 - INFO - __main__ - Step 10990: {'lr': 0.0004954627675218989, 'samples': 2110080, 'steps': 10989, 'loss/train': 1.2343058586120605} 11/06/2021 22:46:18 - INFO - __main__ - Step 10991: {'lr': 0.0004954617610235939, 'samples': 2110272, 'steps': 10990, 'loss/train': 1.7390135526657104} 11/06/2021 22:46:18 - INFO - __main__ - Step 10992: {'lr': 0.0004954607544146875, 'samples': 2110464, 'steps': 10991, 'loss/train': 1.8976117372512817} 11/06/2021 22:46:20 - INFO - __main__ - Step 10993: {'lr': 0.0004954597476951804, 'samples': 2110656, 'steps': 10992, 'loss/train': 1.9760702848434448} 11/06/2021 22:46:20 - INFO - __main__ - Step 10994: {'lr': 0.0004954587408650727, 'samples': 2110848, 'steps': 10993, 'loss/train': 2.1973133087158203} 11/06/2021 22:46:21 - INFO - __main__ - Step 10995: {'lr': 0.0004954577339243652, 'samples': 2111040, 'steps': 10994, 'loss/train': 1.842615008354187} 11/06/2021 22:46:21 - INFO - __main__ - Step 10996: {'lr': 0.0004954567268730582, 'samples': 2111232, 'steps': 10995, 'loss/train': 1.7507246732711792} 11/06/2021 22:46:21 - INFO - __main__ - Step 10997: {'lr': 0.0004954557197111522, 'samples': 2111424, 'steps': 10996, 'loss/train': 4.712644577026367} 11/06/2021 22:46:22 - INFO - __main__ - Step 10998: {'lr': 0.0004954547124386477, 'samples': 2111616, 'steps': 10997, 'loss/train': 2.954491138458252} 11/06/2021 22:46:23 - INFO - __main__ - Step 10999: {'lr': 0.0004954537050555451, 'samples': 2111808, 'steps': 10998, 'loss/train': 1.7333556413650513} 11/06/2021 22:46:23 - INFO - __main__ - Step 11000: {'lr': 0.0004954526975618447, 'samples': 2112000, 'steps': 10999, 'loss/train': 1.7559417486190796} 11/06/2021 22:46:23 - INFO - __main__ - Step 11001: {'lr': 0.0004954516899575473, 'samples': 2112192, 'steps': 11000, 'loss/train': 1.4720401763916016} 11/06/2021 22:46:24 - INFO - __main__ - Step 11002: {'lr': 0.000495450682242653, 'samples': 2112384, 'steps': 11001, 'loss/train': 1.7932754755020142} 11/06/2021 22:46:24 - INFO - __main__ - Step 11003: {'lr': 0.0004954496744171624, 'samples': 2112576, 'steps': 11002, 'loss/train': 2.697819232940674} 11/06/2021 22:46:25 - INFO - __main__ - Step 11004: {'lr': 0.0004954486664810762, 'samples': 2112768, 'steps': 11003, 'loss/train': 1.973361849784851} 11/06/2021 22:46:25 - INFO - __main__ - Step 11005: {'lr': 0.0004954476584343945, 'samples': 2112960, 'steps': 11004, 'loss/train': 1.4034092426300049} 11/06/2021 22:46:26 - INFO - __main__ - Step 11006: {'lr': 0.0004954466502771178, 'samples': 2113152, 'steps': 11005, 'loss/train': 1.6337881088256836} 11/06/2021 22:46:26 - INFO - __main__ - Step 11007: {'lr': 0.0004954456420092466, 'samples': 2113344, 'steps': 11006, 'loss/train': 1.6302669048309326} 11/06/2021 22:46:27 - INFO - __main__ - Step 11008: {'lr': 0.0004954446336307814, 'samples': 2113536, 'steps': 11007, 'loss/train': 1.5346612930297852} 11/06/2021 22:46:28 - INFO - __main__ - Step 11009: {'lr': 0.0004954436251417227, 'samples': 2113728, 'steps': 11008, 'loss/train': 1.931561827659607} 11/06/2021 22:46:29 - INFO - __main__ - Step 11010: {'lr': 0.0004954426165420709, 'samples': 2113920, 'steps': 11009, 'loss/train': 2.0047390460968018} 11/06/2021 22:46:29 - INFO - __main__ - Step 11011: {'lr': 0.0004954416078318263, 'samples': 2114112, 'steps': 11010, 'loss/train': 1.6063743829727173} 11/06/2021 22:46:29 - INFO - __main__ - Step 11012: {'lr': 0.0004954405990109897, 'samples': 2114304, 'steps': 11011, 'loss/train': 1.1053050756454468} 11/06/2021 22:46:30 - INFO - __main__ - Step 11013: {'lr': 0.0004954395900795611, 'samples': 2114496, 'steps': 11012, 'loss/train': 1.6776947975158691} 11/06/2021 22:46:30 - INFO - __main__ - Step 11014: {'lr': 0.0004954385810375415, 'samples': 2114688, 'steps': 11013, 'loss/train': 0.4628012776374817} 11/06/2021 22:46:31 - INFO - __main__ - Step 11015: {'lr': 0.0004954375718849308, 'samples': 2114880, 'steps': 11014, 'loss/train': 1.808875560760498} 11/06/2021 22:46:31 - INFO - __main__ - Step 11016: {'lr': 0.0004954365626217299, 'samples': 2115072, 'steps': 11015, 'loss/train': 1.8625273704528809} 11/06/2021 22:46:32 - INFO - __main__ - Step 11017: {'lr': 0.0004954355532479391, 'samples': 2115264, 'steps': 11016, 'loss/train': 1.388060450553894} 11/06/2021 22:46:32 - INFO - __main__ - Step 11018: {'lr': 0.0004954345437635587, 'samples': 2115456, 'steps': 11017, 'loss/train': 1.43634033203125} 11/06/2021 22:46:32 - INFO - __main__ - Step 11019: {'lr': 0.0004954335341685893, 'samples': 2115648, 'steps': 11018, 'loss/train': 1.6223043203353882} 11/06/2021 22:46:33 - INFO - __main__ - Step 11020: {'lr': 0.0004954325244630315, 'samples': 2115840, 'steps': 11019, 'loss/train': 1.9205574989318848} 11/06/2021 22:46:34 - INFO - __main__ - Step 11021: {'lr': 0.0004954315146468854, 'samples': 2116032, 'steps': 11020, 'loss/train': 2.2265076637268066} 11/06/2021 22:46:34 - INFO - __main__ - Step 11022: {'lr': 0.0004954305047201517, 'samples': 2116224, 'steps': 11021, 'loss/train': 1.7414882183074951} 11/06/2021 22:46:34 - INFO - __main__ - Step 11023: {'lr': 0.0004954294946828308, 'samples': 2116416, 'steps': 11022, 'loss/train': 2.187809944152832} 11/06/2021 22:46:35 - INFO - __main__ - Step 11024: {'lr': 0.0004954284845349232, 'samples': 2116608, 'steps': 11023, 'loss/train': 1.778928279876709} 11/06/2021 22:46:35 - INFO - __main__ - Step 11025: {'lr': 0.0004954274742764292, 'samples': 2116800, 'steps': 11024, 'loss/train': 1.018846869468689} 11/06/2021 22:46:36 - INFO - __main__ - Step 11026: {'lr': 0.0004954264639073495, 'samples': 2116992, 'steps': 11025, 'loss/train': 4.848390579223633} 11/06/2021 22:46:37 - INFO - __main__ - Step 11027: {'lr': 0.0004954254534276843, 'samples': 2117184, 'steps': 11026, 'loss/train': 1.92079496383667} 11/06/2021 22:46:37 - INFO - __main__ - Step 11028: {'lr': 0.0004954244428374343, 'samples': 2117376, 'steps': 11027, 'loss/train': 1.617121934890747} 11/06/2021 22:46:37 - INFO - __main__ - Step 11029: {'lr': 0.0004954234321365998, 'samples': 2117568, 'steps': 11028, 'loss/train': 2.104278087615967} 11/06/2021 22:46:38 - INFO - __main__ - Step 11030: {'lr': 0.0004954224213251813, 'samples': 2117760, 'steps': 11029, 'loss/train': 1.8662179708480835} 11/06/2021 22:46:39 - INFO - __main__ - Step 11031: {'lr': 0.0004954214104031791, 'samples': 2117952, 'steps': 11030, 'loss/train': 1.559154748916626} 11/06/2021 22:46:39 - INFO - __main__ - Step 11032: {'lr': 0.0004954203993705939, 'samples': 2118144, 'steps': 11031, 'loss/train': 1.7694008350372314} 11/06/2021 22:46:39 - INFO - __main__ - Step 11033: {'lr': 0.0004954193882274261, 'samples': 2118336, 'steps': 11032, 'loss/train': 1.8850313425064087} 11/06/2021 22:46:40 - INFO - __main__ - Step 11034: {'lr': 0.000495418376973676, 'samples': 2118528, 'steps': 11033, 'loss/train': 1.6931498050689697} 11/06/2021 22:46:40 - INFO - __main__ - Step 11035: {'lr': 0.0004954173656093443, 'samples': 2118720, 'steps': 11034, 'loss/train': 1.525468111038208} 11/06/2021 22:46:41 - INFO - __main__ - Step 11036: {'lr': 0.0004954163541344312, 'samples': 2118912, 'steps': 11035, 'loss/train': 1.5203142166137695} 11/06/2021 22:46:41 - INFO - __main__ - Step 11037: {'lr': 0.0004954153425489374, 'samples': 2119104, 'steps': 11036, 'loss/train': 1.7933595180511475} 11/06/2021 22:46:42 - INFO - __main__ - Step 11038: {'lr': 0.0004954143308528631, 'samples': 2119296, 'steps': 11037, 'loss/train': 1.0918980836868286} 11/06/2021 22:46:42 - INFO - __main__ - Step 11039: {'lr': 0.000495413319046209, 'samples': 2119488, 'steps': 11038, 'loss/train': 2.03767728805542} 11/06/2021 22:46:42 - INFO - __main__ - Step 11040: {'lr': 0.0004954123071289754, 'samples': 2119680, 'steps': 11039, 'loss/train': 1.9475619792938232} 11/06/2021 22:46:43 - INFO - __main__ - Step 11041: {'lr': 0.0004954112951011628, 'samples': 2119872, 'steps': 11040, 'loss/train': 1.5636467933654785} 11/06/2021 22:46:44 - INFO - __main__ - Step 11042: {'lr': 0.0004954102829627717, 'samples': 2120064, 'steps': 11041, 'loss/train': 2.0013155937194824} 11/06/2021 22:46:44 - INFO - __main__ - Step 11043: {'lr': 0.0004954092707138024, 'samples': 2120256, 'steps': 11042, 'loss/train': 1.427595853805542} 11/06/2021 22:46:44 - INFO - __main__ - Step 11044: {'lr': 0.0004954082583542557, 'samples': 2120448, 'steps': 11043, 'loss/train': 1.2948354482650757} 11/06/2021 22:46:45 - INFO - __main__ - Step 11045: {'lr': 0.0004954072458841315, 'samples': 2120640, 'steps': 11044, 'loss/train': 1.7006868124008179} 11/06/2021 22:46:46 - INFO - __main__ - Step 11046: {'lr': 0.0004954062333034308, 'samples': 2120832, 'steps': 11045, 'loss/train': 1.9174220561981201} 11/06/2021 22:46:46 - INFO - __main__ - Step 11047: {'lr': 0.0004954052206121538, 'samples': 2121024, 'steps': 11046, 'loss/train': 2.2105274200439453} 11/06/2021 22:46:47 - INFO - __main__ - Step 11048: {'lr': 0.000495404207810301, 'samples': 2121216, 'steps': 11047, 'loss/train': 1.7449531555175781} 11/06/2021 22:46:47 - INFO - __main__ - Step 11049: {'lr': 0.0004954031948978729, 'samples': 2121408, 'steps': 11048, 'loss/train': 1.3999689817428589} 11/06/2021 22:46:47 - INFO - __main__ - Step 11050: {'lr': 0.0004954021818748698, 'samples': 2121600, 'steps': 11049, 'loss/train': 2.2191524505615234} 11/06/2021 22:46:48 - INFO - __main__ - Step 11051: {'lr': 0.0004954011687412923, 'samples': 2121792, 'steps': 11050, 'loss/train': 1.5955177545547485} 11/06/2021 22:46:49 - INFO - __main__ - Step 11052: {'lr': 0.0004954001554971409, 'samples': 2121984, 'steps': 11051, 'loss/train': 1.4700356721878052} 11/06/2021 22:46:49 - INFO - __main__ - Step 11053: {'lr': 0.0004953991421424159, 'samples': 2122176, 'steps': 11052, 'loss/train': 1.7988489866256714} 11/06/2021 22:46:49 - INFO - __main__ - Step 11054: {'lr': 0.0004953981286771178, 'samples': 2122368, 'steps': 11053, 'loss/train': 0.8524397015571594} 11/06/2021 22:46:50 - INFO - __main__ - Step 11055: {'lr': 0.0004953971151012471, 'samples': 2122560, 'steps': 11054, 'loss/train': 1.956466794013977} 11/06/2021 22:46:51 - INFO - __main__ - Step 11056: {'lr': 0.0004953961014148043, 'samples': 2122752, 'steps': 11055, 'loss/train': 1.7897028923034668} 11/06/2021 22:46:51 - INFO - __main__ - Step 11057: {'lr': 0.0004953950876177897, 'samples': 2122944, 'steps': 11056, 'loss/train': 2.5609962940216064} 11/06/2021 22:46:51 - INFO - __main__ - Step 11058: {'lr': 0.000495394073710204, 'samples': 2123136, 'steps': 11057, 'loss/train': 1.6628973484039307} 11/06/2021 22:46:52 - INFO - __main__ - Step 11059: {'lr': 0.0004953930596920474, 'samples': 2123328, 'steps': 11058, 'loss/train': 0.9463735222816467} 11/06/2021 22:46:52 - INFO - __main__ - Step 11060: {'lr': 0.0004953920455633206, 'samples': 2123520, 'steps': 11059, 'loss/train': 1.7199558019638062} 11/06/2021 22:46:53 - INFO - __main__ - Step 11061: {'lr': 0.0004953910313240239, 'samples': 2123712, 'steps': 11060, 'loss/train': 2.0685226917266846} 11/06/2021 22:46:53 - INFO - __main__ - Step 11062: {'lr': 0.0004953900169741577, 'samples': 2123904, 'steps': 11061, 'loss/train': 1.852885127067566} 11/06/2021 22:46:54 - INFO - __main__ - Step 11063: {'lr': 0.0004953890025137226, 'samples': 2124096, 'steps': 11062, 'loss/train': 1.0211209058761597} 11/06/2021 22:46:54 - INFO - __main__ - Step 11064: {'lr': 0.000495387987942719, 'samples': 2124288, 'steps': 11063, 'loss/train': 1.8901044130325317} 11/06/2021 22:46:55 - INFO - __main__ - Step 11065: {'lr': 0.0004953869732611474, 'samples': 2124480, 'steps': 11064, 'loss/train': 1.3837004899978638} 11/06/2021 22:46:56 - INFO - __main__ - Step 11066: {'lr': 0.0004953859584690081, 'samples': 2124672, 'steps': 11065, 'loss/train': 1.3166968822479248} 11/06/2021 22:46:56 - INFO - __main__ - Step 11067: {'lr': 0.0004953849435663018, 'samples': 2124864, 'steps': 11066, 'loss/train': 1.1097654104232788} 11/06/2021 22:46:56 - INFO - __main__ - Step 11068: {'lr': 0.0004953839285530287, 'samples': 2125056, 'steps': 11067, 'loss/train': 1.8698753118515015} 11/06/2021 22:46:57 - INFO - __main__ - Step 11069: {'lr': 0.0004953829134291895, 'samples': 2125248, 'steps': 11068, 'loss/train': 2.0392954349517822} 11/06/2021 22:46:57 - INFO - __main__ - Step 11070: {'lr': 0.0004953818981947845, 'samples': 2125440, 'steps': 11069, 'loss/train': 1.821742057800293} 11/06/2021 22:46:58 - INFO - __main__ - Step 11071: {'lr': 0.0004953808828498142, 'samples': 2125632, 'steps': 11070, 'loss/train': 1.4613873958587646} 11/06/2021 22:46:58 - INFO - __main__ - Step 11072: {'lr': 0.0004953798673942791, 'samples': 2125824, 'steps': 11071, 'loss/train': 1.947845220565796} 11/06/2021 22:46:59 - INFO - __main__ - Step 11073: {'lr': 0.0004953788518281796, 'samples': 2126016, 'steps': 11072, 'loss/train': 2.3288230895996094} 11/06/2021 22:46:59 - INFO - __main__ - Step 11074: {'lr': 0.0004953778361515163, 'samples': 2126208, 'steps': 11073, 'loss/train': 1.7177420854568481} 11/06/2021 22:46:59 - INFO - __main__ - Step 11075: {'lr': 0.0004953768203642893, 'samples': 2126400, 'steps': 11074, 'loss/train': 1.5442180633544922} 11/06/2021 22:47:00 - INFO - __main__ - Step 11076: {'lr': 0.0004953758044664994, 'samples': 2126592, 'steps': 11075, 'loss/train': 2.2615766525268555} 11/06/2021 22:47:01 - INFO - __main__ - Step 11077: {'lr': 0.0004953747884581469, 'samples': 2126784, 'steps': 11076, 'loss/train': 2.156431198120117} 11/06/2021 22:47:01 - INFO - __main__ - Step 11078: {'lr': 0.0004953737723392324, 'samples': 2126976, 'steps': 11077, 'loss/train': 2.2842390537261963} 11/06/2021 22:47:01 - INFO - __main__ - Step 11079: {'lr': 0.0004953727561097562, 'samples': 2127168, 'steps': 11078, 'loss/train': 1.461220383644104} 11/06/2021 22:47:02 - INFO - __main__ - Step 11080: {'lr': 0.0004953717397697189, 'samples': 2127360, 'steps': 11079, 'loss/train': 1.6877996921539307} 11/06/2021 22:47:02 - INFO - __main__ - Step 11081: {'lr': 0.0004953707233191207, 'samples': 2127552, 'steps': 11080, 'loss/train': 1.1885510683059692} 11/06/2021 22:47:03 - INFO - __main__ - Step 11082: {'lr': 0.0004953697067579624, 'samples': 2127744, 'steps': 11081, 'loss/train': 1.5058166980743408} 11/06/2021 22:47:03 - INFO - __main__ - Step 11083: {'lr': 0.0004953686900862442, 'samples': 2127936, 'steps': 11082, 'loss/train': 1.6675527095794678} 11/06/2021 22:47:04 - INFO - __main__ - Step 11084: {'lr': 0.0004953676733039668, 'samples': 2128128, 'steps': 11083, 'loss/train': 1.9385559558868408} 11/06/2021 22:47:04 - INFO - __main__ - Step 11085: {'lr': 0.0004953666564111303, 'samples': 2128320, 'steps': 11084, 'loss/train': 1.6214722394943237} 11/06/2021 22:47:05 - INFO - __main__ - Step 11086: {'lr': 0.0004953656394077355, 'samples': 2128512, 'steps': 11085, 'loss/train': 1.1860450506210327} 11/06/2021 22:47:06 - INFO - __main__ - Step 11087: {'lr': 0.0004953646222937828, 'samples': 2128704, 'steps': 11086, 'loss/train': 1.8663421869277954} 11/06/2021 22:47:06 - INFO - __main__ - Step 11088: {'lr': 0.0004953636050692724, 'samples': 2128896, 'steps': 11087, 'loss/train': 1.5686103105545044} 11/06/2021 22:47:06 - INFO - __main__ - Step 11089: {'lr': 0.0004953625877342051, 'samples': 2129088, 'steps': 11088, 'loss/train': 1.8276753425598145} 11/06/2021 22:47:07 - INFO - __main__ - Step 11090: {'lr': 0.0004953615702885812, 'samples': 2129280, 'steps': 11089, 'loss/train': 1.5646089315414429} 11/06/2021 22:47:07 - INFO - __main__ - Step 11091: {'lr': 0.0004953605527324011, 'samples': 2129472, 'steps': 11090, 'loss/train': 1.7502714395523071} 11/06/2021 22:47:08 - INFO - __main__ - Step 11092: {'lr': 0.0004953595350656653, 'samples': 2129664, 'steps': 11091, 'loss/train': 1.3079643249511719} 11/06/2021 22:47:08 - INFO - __main__ - Step 11093: {'lr': 0.0004953585172883743, 'samples': 2129856, 'steps': 11092, 'loss/train': 1.8320194482803345} 11/06/2021 22:47:09 - INFO - __main__ - Step 11094: {'lr': 0.0004953574994005286, 'samples': 2130048, 'steps': 11093, 'loss/train': 1.7098597288131714} 11/06/2021 22:47:09 - INFO - __main__ - Step 11095: {'lr': 0.0004953564814021285, 'samples': 2130240, 'steps': 11094, 'loss/train': 1.9974864721298218} 11/06/2021 22:47:09 - INFO - __main__ - Step 11096: {'lr': 0.0004953554632931746, 'samples': 2130432, 'steps': 11095, 'loss/train': 1.8548189401626587} 11/06/2021 22:47:10 - INFO - __main__ - Step 11097: {'lr': 0.0004953544450736674, 'samples': 2130624, 'steps': 11096, 'loss/train': 1.8645987510681152} 11/06/2021 22:47:11 - INFO - __main__ - Step 11098: {'lr': 0.0004953534267436072, 'samples': 2130816, 'steps': 11097, 'loss/train': 2.160127878189087} 11/06/2021 22:47:11 - INFO - __main__ - Step 11099: {'lr': 0.0004953524083029945, 'samples': 2131008, 'steps': 11098, 'loss/train': 2.143897533416748} 11/06/2021 22:47:11 - INFO - __main__ - Step 11100: {'lr': 0.0004953513897518298, 'samples': 2131200, 'steps': 11099, 'loss/train': 1.7265970706939697} 11/06/2021 22:47:12 - INFO - __main__ - Step 11101: {'lr': 0.0004953503710901136, 'samples': 2131392, 'steps': 11100, 'loss/train': 1.6248968839645386} 11/06/2021 22:47:13 - INFO - __main__ - Step 11102: {'lr': 0.0004953493523178463, 'samples': 2131584, 'steps': 11101, 'loss/train': 1.7727121114730835} 11/06/2021 22:47:13 - INFO - __main__ - Step 11103: {'lr': 0.0004953483334350283, 'samples': 2131776, 'steps': 11102, 'loss/train': 1.5690078735351562} 11/06/2021 22:47:13 - INFO - __main__ - Step 11104: {'lr': 0.0004953473144416602, 'samples': 2131968, 'steps': 11103, 'loss/train': 1.7810686826705933} 11/06/2021 22:47:14 - INFO - __main__ - Step 11105: {'lr': 0.0004953462953377424, 'samples': 2132160, 'steps': 11104, 'loss/train': 1.8869816064834595} 11/06/2021 22:47:14 - INFO - __main__ - Step 11106: {'lr': 0.0004953452761232753, 'samples': 2132352, 'steps': 11105, 'loss/train': 2.1093719005584717} 11/06/2021 22:47:15 - INFO - __main__ - Step 11107: {'lr': 0.0004953442567982593, 'samples': 2132544, 'steps': 11106, 'loss/train': 1.606833815574646} 11/06/2021 22:47:15 - INFO - __main__ - Step 11108: {'lr': 0.0004953432373626951, 'samples': 2132736, 'steps': 11107, 'loss/train': 1.9414929151535034} 11/06/2021 22:47:16 - INFO - __main__ - Step 11109: {'lr': 0.0004953422178165831, 'samples': 2132928, 'steps': 11108, 'loss/train': 1.8882904052734375} 11/06/2021 22:47:16 - INFO - __main__ - Step 11110: {'lr': 0.0004953411981599235, 'samples': 2133120, 'steps': 11109, 'loss/train': 1.7762526273727417} 11/06/2021 22:47:17 - INFO - __main__ - Step 11111: {'lr': 0.0004953401783927171, 'samples': 2133312, 'steps': 11110, 'loss/train': 1.9979164600372314} 11/06/2021 22:47:17 - INFO - __main__ - Step 11112: {'lr': 0.000495339158514964, 'samples': 2133504, 'steps': 11111, 'loss/train': 1.818439245223999} 11/06/2021 22:47:18 - INFO - __main__ - Step 11113: {'lr': 0.0004953381385266651, 'samples': 2133696, 'steps': 11112, 'loss/train': 1.8462194204330444} 11/06/2021 22:47:18 - INFO - __main__ - Step 11114: {'lr': 0.0004953371184278205, 'samples': 2133888, 'steps': 11113, 'loss/train': 1.7984886169433594} 11/06/2021 22:47:19 - INFO - __main__ - Step 11115: {'lr': 0.0004953360982184308, 'samples': 2134080, 'steps': 11114, 'loss/train': 2.2713496685028076} 11/06/2021 22:47:19 - INFO - __main__ - Step 11116: {'lr': 0.0004953350778984963, 'samples': 2134272, 'steps': 11115, 'loss/train': 1.8427363634109497} 11/06/2021 22:47:19 - INFO - __main__ - Step 11117: {'lr': 0.0004953340574680177, 'samples': 2134464, 'steps': 11116, 'loss/train': 1.2288328409194946} 11/06/2021 22:47:20 - INFO - __main__ - Step 11118: {'lr': 0.0004953330369269955, 'samples': 2134656, 'steps': 11117, 'loss/train': 1.9006093740463257} 11/06/2021 22:47:21 - INFO - __main__ - Step 11119: {'lr': 0.0004953320162754298, 'samples': 2134848, 'steps': 11118, 'loss/train': 1.618245244026184} 11/06/2021 22:47:21 - INFO - __main__ - Step 11120: {'lr': 0.0004953309955133214, 'samples': 2135040, 'steps': 11119, 'loss/train': 1.8152966499328613} 11/06/2021 22:47:22 - INFO - __main__ - Step 11121: {'lr': 0.0004953299746406707, 'samples': 2135232, 'steps': 11120, 'loss/train': 1.3982417583465576} 11/06/2021 22:47:22 - INFO - __main__ - Step 11122: {'lr': 0.000495328953657478, 'samples': 2135424, 'steps': 11121, 'loss/train': 1.8065004348754883} 11/06/2021 22:47:23 - INFO - __main__ - Step 11123: {'lr': 0.0004953279325637438, 'samples': 2135616, 'steps': 11122, 'loss/train': 1.807936191558838} 11/06/2021 22:47:23 - INFO - __main__ - Step 11124: {'lr': 0.0004953269113594687, 'samples': 2135808, 'steps': 11123, 'loss/train': 1.684847116470337} 11/06/2021 22:47:24 - INFO - __main__ - Step 11125: {'lr': 0.0004953258900446531, 'samples': 2136000, 'steps': 11124, 'loss/train': 1.3972996473312378} 11/06/2021 22:47:24 - INFO - __main__ - Step 11126: {'lr': 0.0004953248686192975, 'samples': 2136192, 'steps': 11125, 'loss/train': 1.6219148635864258} 11/06/2021 22:47:24 - INFO - __main__ - Step 11127: {'lr': 0.0004953238470834022, 'samples': 2136384, 'steps': 11126, 'loss/train': 0.5669246912002563} 11/06/2021 22:47:25 - INFO - __main__ - Step 11128: {'lr': 0.0004953228254369677, 'samples': 2136576, 'steps': 11127, 'loss/train': 1.5386314392089844} 11/06/2021 22:47:26 - INFO - __main__ - Step 11129: {'lr': 0.0004953218036799946, 'samples': 2136768, 'steps': 11128, 'loss/train': 1.7558674812316895} 11/06/2021 22:47:26 - INFO - __main__ - Step 11130: {'lr': 0.0004953207818124833, 'samples': 2136960, 'steps': 11129, 'loss/train': 1.2307705879211426} 11/06/2021 22:47:26 - INFO - __main__ - Step 11131: {'lr': 0.0004953197598344342, 'samples': 2137152, 'steps': 11130, 'loss/train': 2.116920232772827} 11/06/2021 22:47:27 - INFO - __main__ - Step 11132: {'lr': 0.0004953187377458478, 'samples': 2137344, 'steps': 11131, 'loss/train': 2.0362181663513184} 11/06/2021 22:47:28 - INFO - __main__ - Step 11133: {'lr': 0.0004953177155467246, 'samples': 2137536, 'steps': 11132, 'loss/train': 1.7032719850540161} 11/06/2021 22:47:28 - INFO - __main__ - Step 11134: {'lr': 0.0004953166932370651, 'samples': 2137728, 'steps': 11133, 'loss/train': 1.8138678073883057} 11/06/2021 22:47:28 - INFO - __main__ - Step 11135: {'lr': 0.0004953156708168695, 'samples': 2137920, 'steps': 11134, 'loss/train': 1.8151832818984985} 11/06/2021 22:47:29 - INFO - __main__ - Step 11136: {'lr': 0.0004953146482861385, 'samples': 2138112, 'steps': 11135, 'loss/train': 1.7565672397613525} 11/06/2021 22:47:29 - INFO - __main__ - Step 11137: {'lr': 0.0004953136256448725, 'samples': 2138304, 'steps': 11136, 'loss/train': 1.687888741493225} 11/06/2021 22:47:29 - INFO - __main__ - Step 11138: {'lr': 0.0004953126028930721, 'samples': 2138496, 'steps': 11137, 'loss/train': 1.2880886793136597} 11/06/2021 22:47:31 - INFO - __main__ - Step 11139: {'lr': 0.0004953115800307375, 'samples': 2138688, 'steps': 11138, 'loss/train': 1.8301455974578857} 11/06/2021 22:47:31 - INFO - __main__ - Step 11140: {'lr': 0.0004953105570578693, 'samples': 2138880, 'steps': 11139, 'loss/train': 1.5267943143844604} 11/06/2021 22:47:31 - INFO - __main__ - Step 11141: {'lr': 0.000495309533974468, 'samples': 2139072, 'steps': 11140, 'loss/train': 1.7597218751907349} 11/06/2021 22:47:32 - INFO - __main__ - Step 11142: {'lr': 0.0004953085107805339, 'samples': 2139264, 'steps': 11141, 'loss/train': 0.6988817453384399} 11/06/2021 22:47:32 - INFO - __main__ - Step 11143: {'lr': 0.0004953074874760677, 'samples': 2139456, 'steps': 11142, 'loss/train': 1.4152743816375732} 11/06/2021 22:47:33 - INFO - __main__ - Step 11144: {'lr': 0.0004953064640610697, 'samples': 2139648, 'steps': 11143, 'loss/train': 1.5432703495025635} 11/06/2021 22:47:33 - INFO - __main__ - Step 11145: {'lr': 0.0004953054405355404, 'samples': 2139840, 'steps': 11144, 'loss/train': 2.103456735610962} 11/06/2021 22:47:34 - INFO - __main__ - Step 11146: {'lr': 0.0004953044168994802, 'samples': 2140032, 'steps': 11145, 'loss/train': 1.8357820510864258} 11/06/2021 22:47:34 - INFO - __main__ - Step 11147: {'lr': 0.0004953033931528897, 'samples': 2140224, 'steps': 11146, 'loss/train': 1.426553726196289} 11/06/2021 22:47:34 - INFO - __main__ - Step 11148: {'lr': 0.0004953023692957691, 'samples': 2140416, 'steps': 11147, 'loss/train': 0.9832262992858887} 11/06/2021 22:47:35 - INFO - __main__ - Step 11149: {'lr': 0.0004953013453281193, 'samples': 2140608, 'steps': 11148, 'loss/train': 2.1371872425079346} 11/06/2021 22:47:36 - INFO - __main__ - Step 11150: {'lr': 0.0004953003212499403, 'samples': 2140800, 'steps': 11149, 'loss/train': 2.0583853721618652} 11/06/2021 22:47:36 - INFO - __main__ - Step 11151: {'lr': 0.0004952992970612328, 'samples': 2140992, 'steps': 11150, 'loss/train': 1.6691161394119263} 11/06/2021 22:47:36 - INFO - __main__ - Step 11152: {'lr': 0.0004952982727619973, 'samples': 2141184, 'steps': 11151, 'loss/train': 1.672674298286438} 11/06/2021 22:47:37 - INFO - __main__ - Step 11153: {'lr': 0.000495297248352234, 'samples': 2141376, 'steps': 11152, 'loss/train': 0.8161516785621643} 11/06/2021 22:47:38 - INFO - __main__ - Step 11154: {'lr': 0.0004952962238319436, 'samples': 2141568, 'steps': 11153, 'loss/train': 1.0644028186798096} 11/06/2021 22:47:38 - INFO - __main__ - Step 11155: {'lr': 0.0004952951992011266, 'samples': 2141760, 'steps': 11154, 'loss/train': 1.5536161661148071} 11/06/2021 22:47:39 - INFO - __main__ - Step 11156: {'lr': 0.0004952941744597834, 'samples': 2141952, 'steps': 11155, 'loss/train': 1.6991686820983887} 11/06/2021 22:47:39 - INFO - __main__ - Step 11157: {'lr': 0.0004952931496079143, 'samples': 2142144, 'steps': 11156, 'loss/train': 1.9701721668243408} 11/06/2021 22:47:39 - INFO - __main__ - Step 11158: {'lr': 0.00049529212464552, 'samples': 2142336, 'steps': 11157, 'loss/train': 1.5034873485565186} 11/06/2021 22:47:40 - INFO - __main__ - Step 11159: {'lr': 0.0004952910995726008, 'samples': 2142528, 'steps': 11158, 'loss/train': 1.8932499885559082} 11/06/2021 22:47:41 - INFO - __main__ - Step 11160: {'lr': 0.0004952900743891573, 'samples': 2142720, 'steps': 11159, 'loss/train': 1.5492379665374756} 11/06/2021 22:47:41 - INFO - __main__ - Step 11161: {'lr': 0.0004952890490951898, 'samples': 2142912, 'steps': 11160, 'loss/train': 1.9090741872787476} 11/06/2021 22:47:41 - INFO - __main__ - Step 11162: {'lr': 0.0004952880236906988, 'samples': 2143104, 'steps': 11161, 'loss/train': 1.2465680837631226} 11/06/2021 22:47:42 - INFO - __main__ - Step 11163: {'lr': 0.0004952869981756848, 'samples': 2143296, 'steps': 11162, 'loss/train': 1.629738450050354} 11/06/2021 22:47:42 - INFO - __main__ - Step 11164: {'lr': 0.0004952859725501484, 'samples': 2143488, 'steps': 11163, 'loss/train': 1.934152364730835} 11/06/2021 22:47:43 - INFO - __main__ - Step 11165: {'lr': 0.0004952849468140898, 'samples': 2143680, 'steps': 11164, 'loss/train': 1.5853830575942993} 11/06/2021 22:47:43 - INFO - __main__ - Step 11166: {'lr': 0.0004952839209675096, 'samples': 2143872, 'steps': 11165, 'loss/train': 1.383769154548645} 11/06/2021 22:47:44 - INFO - __main__ - Step 11167: {'lr': 0.0004952828950104083, 'samples': 2144064, 'steps': 11166, 'loss/train': 1.3340117931365967} 11/06/2021 22:47:44 - INFO - __main__ - Step 11168: {'lr': 0.0004952818689427863, 'samples': 2144256, 'steps': 11167, 'loss/train': 1.787481665611267} 11/06/2021 22:47:44 - INFO - __main__ - Step 11169: {'lr': 0.0004952808427646441, 'samples': 2144448, 'steps': 11168, 'loss/train': 1.6143748760223389} 11/06/2021 22:47:45 - INFO - __main__ - Step 11170: {'lr': 0.000495279816475982, 'samples': 2144640, 'steps': 11169, 'loss/train': 1.741720199584961} 11/06/2021 22:47:46 - INFO - __main__ - Step 11171: {'lr': 0.0004952787900768008, 'samples': 2144832, 'steps': 11170, 'loss/train': 1.8251887559890747} 11/06/2021 22:47:46 - INFO - __main__ - Step 11172: {'lr': 0.0004952777635671006, 'samples': 2145024, 'steps': 11171, 'loss/train': 2.326918125152588} 11/06/2021 22:47:46 - INFO - __main__ - Step 11173: {'lr': 0.0004952767369468821, 'samples': 2145216, 'steps': 11172, 'loss/train': 1.6599018573760986} 11/06/2021 22:47:47 - INFO - __main__ - Step 11174: {'lr': 0.0004952757102161457, 'samples': 2145408, 'steps': 11173, 'loss/train': 1.3439342975616455} 11/06/2021 22:47:48 - INFO - __main__ - Step 11175: {'lr': 0.0004952746833748918, 'samples': 2145600, 'steps': 11174, 'loss/train': 1.5779838562011719} 11/06/2021 22:47:48 - INFO - __main__ - Step 11176: {'lr': 0.0004952736564231209, 'samples': 2145792, 'steps': 11175, 'loss/train': 1.711625099182129} 11/06/2021 22:47:49 - INFO - __main__ - Step 11177: {'lr': 0.0004952726293608335, 'samples': 2145984, 'steps': 11176, 'loss/train': 1.7261056900024414} 11/06/2021 22:47:49 - INFO - __main__ - Step 11178: {'lr': 0.0004952716021880301, 'samples': 2146176, 'steps': 11177, 'loss/train': 2.207341194152832} 11/06/2021 22:47:49 - INFO - __main__ - Step 11179: {'lr': 0.0004952705749047111, 'samples': 2146368, 'steps': 11178, 'loss/train': 1.6642247438430786} 11/06/2021 22:47:50 - INFO - __main__ - Step 11180: {'lr': 0.0004952695475108768, 'samples': 2146560, 'steps': 11179, 'loss/train': 1.309320092201233} 11/06/2021 22:47:51 - INFO - __main__ - Step 11181: {'lr': 0.000495268520006528, 'samples': 2146752, 'steps': 11180, 'loss/train': 1.2601646184921265} 11/06/2021 22:47:51 - INFO - __main__ - Step 11182: {'lr': 0.000495267492391665, 'samples': 2146944, 'steps': 11181, 'loss/train': 1.730563759803772} 11/06/2021 22:47:52 - INFO - __main__ - Step 11183: {'lr': 0.0004952664646662882, 'samples': 2147136, 'steps': 11182, 'loss/train': 1.8496640920639038} 11/06/2021 22:47:52 - INFO - __main__ - Step 11184: {'lr': 0.000495265436830398, 'samples': 2147328, 'steps': 11183, 'loss/train': 1.282710075378418} 11/06/2021 22:47:52 - INFO - __main__ - Step 11185: {'lr': 0.0004952644088839951, 'samples': 2147520, 'steps': 11184, 'loss/train': 0.9519102573394775} 11/06/2021 22:47:54 - INFO - __main__ - Step 11186: {'lr': 0.0004952633808270797, 'samples': 2147712, 'steps': 11185, 'loss/train': 2.1716973781585693} 11/06/2021 22:47:54 - INFO - __main__ - Step 11187: {'lr': 0.0004952623526596526, 'samples': 2147904, 'steps': 11186, 'loss/train': 1.5648038387298584} 11/06/2021 22:47:54 - INFO - __main__ - Step 11188: {'lr': 0.000495261324381714, 'samples': 2148096, 'steps': 11187, 'loss/train': 1.8554600477218628} 11/06/2021 22:47:55 - INFO - __main__ - Step 11189: {'lr': 0.0004952602959932644, 'samples': 2148288, 'steps': 11188, 'loss/train': 0.2998165488243103} 11/06/2021 22:47:55 - INFO - __main__ - Step 11190: {'lr': 0.0004952592674943043, 'samples': 2148480, 'steps': 11189, 'loss/train': 1.513887882232666} 11/06/2021 22:47:56 - INFO - __main__ - Step 11191: {'lr': 0.0004952582388848343, 'samples': 2148672, 'steps': 11190, 'loss/train': 1.61158287525177} 11/06/2021 22:47:56 - INFO - __main__ - Step 11192: {'lr': 0.0004952572101648545, 'samples': 2148864, 'steps': 11191, 'loss/train': 1.4567087888717651} 11/06/2021 22:47:57 - INFO - __main__ - Step 11193: {'lr': 0.0004952561813343657, 'samples': 2149056, 'steps': 11192, 'loss/train': 1.598156452178955} 11/06/2021 22:47:57 - INFO - __main__ - Step 11194: {'lr': 0.0004952551523933682, 'samples': 2149248, 'steps': 11193, 'loss/train': 2.243593692779541} 11/06/2021 22:47:58 - INFO - __main__ - Step 11195: {'lr': 0.0004952541233418626, 'samples': 2149440, 'steps': 11194, 'loss/train': 1.6861417293548584} 11/06/2021 22:47:59 - INFO - __main__ - Step 11196: {'lr': 0.0004952530941798492, 'samples': 2149632, 'steps': 11195, 'loss/train': 1.3837568759918213} 11/06/2021 22:47:59 - INFO - __main__ - Step 11197: {'lr': 0.0004952520649073286, 'samples': 2149824, 'steps': 11196, 'loss/train': 1.8483117818832397} 11/06/2021 22:47:59 - INFO - __main__ - Step 11198: {'lr': 0.0004952510355243012, 'samples': 2150016, 'steps': 11197, 'loss/train': 1.94416081905365} 11/06/2021 22:48:00 - INFO - __main__ - Step 11199: {'lr': 0.0004952500060307674, 'samples': 2150208, 'steps': 11198, 'loss/train': 1.9236924648284912} 11/06/2021 22:48:00 - INFO - __main__ - Step 11200: {'lr': 0.0004952489764267278, 'samples': 2150400, 'steps': 11199, 'loss/train': 2.271418333053589} 11/06/2021 22:48:01 - INFO - __main__ - Step 11201: {'lr': 0.0004952479467121827, 'samples': 2150592, 'steps': 11200, 'loss/train': 1.6901092529296875} 11/06/2021 22:48:01 - INFO - __main__ - Step 11202: {'lr': 0.0004952469168871327, 'samples': 2150784, 'steps': 11201, 'loss/train': 1.9666694402694702} 11/06/2021 22:48:02 - INFO - __main__ - Step 11203: {'lr': 0.0004952458869515782, 'samples': 2150976, 'steps': 11202, 'loss/train': 1.9771145582199097} 11/06/2021 22:48:02 - INFO - __main__ - Step 11204: {'lr': 0.0004952448569055198, 'samples': 2151168, 'steps': 11203, 'loss/train': 1.4397884607315063} 11/06/2021 22:48:02 - INFO - __main__ - Step 11205: {'lr': 0.0004952438267489578, 'samples': 2151360, 'steps': 11204, 'loss/train': 1.73050856590271} 11/06/2021 22:48:03 - INFO - __main__ - Step 11206: {'lr': 0.0004952427964818927, 'samples': 2151552, 'steps': 11205, 'loss/train': 1.7064839601516724} 11/06/2021 22:48:04 - INFO - __main__ - Step 11207: {'lr': 0.0004952417661043249, 'samples': 2151744, 'steps': 11206, 'loss/train': 1.7774326801300049} 11/06/2021 22:48:04 - INFO - __main__ - Step 11208: {'lr': 0.0004952407356162551, 'samples': 2151936, 'steps': 11207, 'loss/train': 1.5006650686264038} 11/06/2021 22:48:04 - INFO - __main__ - Step 11209: {'lr': 0.0004952397050176835, 'samples': 2152128, 'steps': 11208, 'loss/train': 1.9831914901733398} 11/06/2021 22:48:05 - INFO - __main__ - Step 11210: {'lr': 0.0004952386743086107, 'samples': 2152320, 'steps': 11209, 'loss/train': 1.5023480653762817} 11/06/2021 22:48:05 - INFO - __main__ - Step 11211: {'lr': 0.0004952376434890372, 'samples': 2152512, 'steps': 11210, 'loss/train': 1.4154393672943115} 11/06/2021 22:48:06 - INFO - __main__ - Step 11212: {'lr': 0.0004952366125589633, 'samples': 2152704, 'steps': 11211, 'loss/train': 1.498394250869751} 11/06/2021 22:48:07 - INFO - __main__ - Step 11213: {'lr': 0.0004952355815183897, 'samples': 2152896, 'steps': 11212, 'loss/train': 1.85457444190979} 11/06/2021 22:48:07 - INFO - __main__ - Step 11214: {'lr': 0.0004952345503673166, 'samples': 2153088, 'steps': 11213, 'loss/train': 1.908733606338501} 11/06/2021 22:48:07 - INFO - __main__ - Step 11215: {'lr': 0.0004952335191057447, 'samples': 2153280, 'steps': 11214, 'loss/train': 1.707075834274292} 11/06/2021 22:48:08 - INFO - __main__ - Step 11216: {'lr': 0.0004952324877336743, 'samples': 2153472, 'steps': 11215, 'loss/train': 1.5228673219680786} 11/06/2021 22:48:09 - INFO - __main__ - Step 11217: {'lr': 0.0004952314562511059, 'samples': 2153664, 'steps': 11216, 'loss/train': 2.17525053024292} 11/06/2021 22:48:09 - INFO - __main__ - Step 11218: {'lr': 0.00049523042465804, 'samples': 2153856, 'steps': 11217, 'loss/train': 1.8961803913116455} 11/06/2021 22:48:09 - INFO - __main__ - Step 11219: {'lr': 0.0004952293929544771, 'samples': 2154048, 'steps': 11218, 'loss/train': 1.9099823236465454} 11/06/2021 22:48:10 - INFO - __main__ - Step 11220: {'lr': 0.0004952283611404176, 'samples': 2154240, 'steps': 11219, 'loss/train': 1.5863221883773804} 11/06/2021 22:48:10 - INFO - __main__ - Step 11221: {'lr': 0.0004952273292158619, 'samples': 2154432, 'steps': 11220, 'loss/train': 1.5212019681930542} 11/06/2021 22:48:11 - INFO - __main__ - Step 11222: {'lr': 0.0004952262971808106, 'samples': 2154624, 'steps': 11221, 'loss/train': 1.0951485633850098} 11/06/2021 22:48:11 - INFO - __main__ - Step 11223: {'lr': 0.0004952252650352642, 'samples': 2154816, 'steps': 11222, 'loss/train': 1.8635188341140747} 11/06/2021 22:48:12 - INFO - __main__ - Step 11224: {'lr': 0.000495224232779223, 'samples': 2155008, 'steps': 11223, 'loss/train': 1.7595049142837524} 11/06/2021 22:48:12 - INFO - __main__ - Step 11225: {'lr': 0.0004952232004126876, 'samples': 2155200, 'steps': 11224, 'loss/train': 1.5507785081863403} 11/06/2021 22:48:12 - INFO - __main__ - Step 11226: {'lr': 0.0004952221679356583, 'samples': 2155392, 'steps': 11225, 'loss/train': 1.63408625125885} 11/06/2021 22:48:13 - INFO - __main__ - Step 11227: {'lr': 0.0004952211353481358, 'samples': 2155584, 'steps': 11226, 'loss/train': 1.6727948188781738} 11/06/2021 22:48:14 - INFO - __main__ - Step 11228: {'lr': 0.0004952201026501204, 'samples': 2155776, 'steps': 11227, 'loss/train': 1.2963249683380127} 11/06/2021 22:48:14 - INFO - __main__ - Step 11229: {'lr': 0.0004952190698416126, 'samples': 2155968, 'steps': 11228, 'loss/train': 1.5582791566848755} 11/06/2021 22:48:14 - INFO - __main__ - Step 11230: {'lr': 0.0004952180369226129, 'samples': 2156160, 'steps': 11229, 'loss/train': 2.1736087799072266} 11/06/2021 22:48:15 - INFO - __main__ - Step 11231: {'lr': 0.0004952170038931217, 'samples': 2156352, 'steps': 11230, 'loss/train': 1.376936912536621} 11/06/2021 22:48:15 - INFO - __main__ - Step 11232: {'lr': 0.0004952159707531395, 'samples': 2156544, 'steps': 11231, 'loss/train': 1.1369280815124512} 11/06/2021 22:48:17 - INFO - __main__ - Step 11233: {'lr': 0.0004952149375026668, 'samples': 2156736, 'steps': 11232, 'loss/train': 1.7504469156265259} 11/06/2021 22:48:17 - INFO - __main__ - Step 11234: {'lr': 0.000495213904141704, 'samples': 2156928, 'steps': 11233, 'loss/train': 1.4631931781768799} 11/06/2021 22:48:17 - INFO - __main__ - Step 11235: {'lr': 0.0004952128706702516, 'samples': 2157120, 'steps': 11234, 'loss/train': 2.062126636505127} 11/06/2021 22:48:18 - INFO - __main__ - Step 11236: {'lr': 0.0004952118370883101, 'samples': 2157312, 'steps': 11235, 'loss/train': 2.1615216732025146} 11/06/2021 22:48:18 - INFO - __main__ - Step 11237: {'lr': 0.0004952108033958798, 'samples': 2157504, 'steps': 11236, 'loss/train': 2.1354172229766846} 11/06/2021 22:48:19 - INFO - __main__ - Step 11238: {'lr': 0.0004952097695929614, 'samples': 2157696, 'steps': 11237, 'loss/train': 1.028824806213379} 11/06/2021 22:48:19 - INFO - __main__ - Step 11239: {'lr': 0.0004952087356795553, 'samples': 2157888, 'steps': 11238, 'loss/train': 1.6951218843460083} 11/06/2021 22:48:20 - INFO - __main__ - Step 11240: {'lr': 0.0004952077016556619, 'samples': 2158080, 'steps': 11239, 'loss/train': 1.766661524772644} 11/06/2021 22:48:20 - INFO - __main__ - Step 11241: {'lr': 0.0004952066675212816, 'samples': 2158272, 'steps': 11240, 'loss/train': 1.3343703746795654} 11/06/2021 22:48:20 - INFO - __main__ - Step 11242: {'lr': 0.0004952056332764151, 'samples': 2158464, 'steps': 11241, 'loss/train': 1.303107500076294} 11/06/2021 22:48:21 - INFO - __main__ - Step 11243: {'lr': 0.0004952045989210627, 'samples': 2158656, 'steps': 11242, 'loss/train': 1.767581820487976} 11/06/2021 22:48:22 - INFO - __main__ - Step 11244: {'lr': 0.0004952035644552249, 'samples': 2158848, 'steps': 11243, 'loss/train': 1.4197368621826172} 11/06/2021 22:48:22 - INFO - __main__ - Step 11245: {'lr': 0.000495202529878902, 'samples': 2159040, 'steps': 11244, 'loss/train': 1.4061685800552368} 11/06/2021 22:48:22 - INFO - __main__ - Step 11246: {'lr': 0.0004952014951920948, 'samples': 2159232, 'steps': 11245, 'loss/train': 2.45194673538208} 11/06/2021 22:48:23 - INFO - __main__ - Step 11247: {'lr': 0.0004952004603948034, 'samples': 2159424, 'steps': 11246, 'loss/train': 2.113942861557007} 11/06/2021 22:48:24 - INFO - __main__ - Step 11248: {'lr': 0.0004951994254870286, 'samples': 2159616, 'steps': 11247, 'loss/train': 1.3976362943649292} 11/06/2021 22:48:24 - INFO - __main__ - Step 11249: {'lr': 0.0004951983904687708, 'samples': 2159808, 'steps': 11248, 'loss/train': 1.9836783409118652} 11/06/2021 22:48:24 - INFO - __main__ - Step 11250: {'lr': 0.0004951973553400303, 'samples': 2160000, 'steps': 11249, 'loss/train': 2.1718714237213135} 11/06/2021 22:48:25 - INFO - __main__ - Step 11251: {'lr': 0.0004951963201008077, 'samples': 2160192, 'steps': 11250, 'loss/train': 1.9737859964370728} 11/06/2021 22:48:25 - INFO - __main__ - Step 11252: {'lr': 0.0004951952847511033, 'samples': 2160384, 'steps': 11251, 'loss/train': 1.3120644092559814} 11/06/2021 22:48:25 - INFO - __main__ - Step 11253: {'lr': 0.0004951942492909177, 'samples': 2160576, 'steps': 11252, 'loss/train': 1.7218273878097534} 11/06/2021 22:48:26 - INFO - __main__ - Step 11254: {'lr': 0.0004951932137202515, 'samples': 2160768, 'steps': 11253, 'loss/train': 1.8016955852508545} 11/06/2021 22:48:27 - INFO - __main__ - Step 11255: {'lr': 0.0004951921780391049, 'samples': 2160960, 'steps': 11254, 'loss/train': 1.4004523754119873} 11/06/2021 22:48:27 - INFO - __main__ - Step 11256: {'lr': 0.0004951911422474785, 'samples': 2161152, 'steps': 11255, 'loss/train': 1.1996997594833374} 11/06/2021 22:48:28 - INFO - __main__ - Step 11257: {'lr': 0.0004951901063453728, 'samples': 2161344, 'steps': 11256, 'loss/train': 1.6938501596450806} 11/06/2021 22:48:28 - INFO - __main__ - Step 11258: {'lr': 0.0004951890703327883, 'samples': 2161536, 'steps': 11257, 'loss/train': 1.6818426847457886} 11/06/2021 22:48:29 - INFO - __main__ - Step 11259: {'lr': 0.0004951880342097251, 'samples': 2161728, 'steps': 11258, 'loss/train': 1.2888652086257935} 11/06/2021 22:48:29 - INFO - __main__ - Step 11260: {'lr': 0.0004951869979761842, 'samples': 2161920, 'steps': 11259, 'loss/train': 1.692081332206726} 11/06/2021 22:48:30 - INFO - __main__ - Step 11261: {'lr': 0.0004951859616321658, 'samples': 2162112, 'steps': 11260, 'loss/train': 1.8977314233779907} 11/06/2021 22:48:30 - INFO - __main__ - Step 11262: {'lr': 0.0004951849251776703, 'samples': 2162304, 'steps': 11261, 'loss/train': 3.5356547832489014} 11/06/2021 22:48:30 - INFO - __main__ - Step 11263: {'lr': 0.0004951838886126983, 'samples': 2162496, 'steps': 11262, 'loss/train': 1.3212937116622925} 11/06/2021 22:48:31 - INFO - __main__ - Step 11264: {'lr': 0.0004951828519372503, 'samples': 2162688, 'steps': 11263, 'loss/train': 1.470284104347229} 11/06/2021 22:48:32 - INFO - __main__ - Step 11265: {'lr': 0.0004951818151513267, 'samples': 2162880, 'steps': 11264, 'loss/train': 1.6359577178955078} 11/06/2021 22:48:32 - INFO - __main__ - Step 11266: {'lr': 0.0004951807782549277, 'samples': 2163072, 'steps': 11265, 'loss/train': 1.4494744539260864} 11/06/2021 22:48:32 - INFO - __main__ - Step 11267: {'lr': 0.0004951797412480544, 'samples': 2163264, 'steps': 11266, 'loss/train': 1.7457062005996704} 11/06/2021 22:48:33 - INFO - __main__ - Step 11268: {'lr': 0.0004951787041307066, 'samples': 2163456, 'steps': 11267, 'loss/train': 1.7130874395370483} 11/06/2021 22:48:34 - INFO - __main__ - Step 11269: {'lr': 0.0004951776669028851, 'samples': 2163648, 'steps': 11268, 'loss/train': 1.1853411197662354} 11/06/2021 22:48:34 - INFO - __main__ - Step 11270: {'lr': 0.0004951766295645904, 'samples': 2163840, 'steps': 11269, 'loss/train': 0.6745047569274902} 11/06/2021 22:48:34 - INFO - __main__ - Step 11271: {'lr': 0.000495175592115823, 'samples': 2164032, 'steps': 11270, 'loss/train': 2.009004592895508} 11/06/2021 22:48:35 - INFO - __main__ - Step 11272: {'lr': 0.0004951745545565831, 'samples': 2164224, 'steps': 11271, 'loss/train': 1.5138862133026123} 11/06/2021 22:48:35 - INFO - __main__ - Step 11273: {'lr': 0.0004951735168868713, 'samples': 2164416, 'steps': 11272, 'loss/train': 1.8810991048812866} 11/06/2021 22:48:36 - INFO - __main__ - Step 11274: {'lr': 0.0004951724791066881, 'samples': 2164608, 'steps': 11273, 'loss/train': 1.8236666917800903} 11/06/2021 22:48:37 - INFO - __main__ - Step 11275: {'lr': 0.0004951714412160342, 'samples': 2164800, 'steps': 11274, 'loss/train': 1.84593665599823} 11/06/2021 22:48:37 - INFO - __main__ - Step 11276: {'lr': 0.0004951704032149096, 'samples': 2164992, 'steps': 11275, 'loss/train': 1.9748361110687256} 11/06/2021 22:48:37 - INFO - __main__ - Step 11277: {'lr': 0.000495169365103315, 'samples': 2165184, 'steps': 11276, 'loss/train': 1.7609519958496094} 11/06/2021 22:48:38 - INFO - __main__ - Step 11278: {'lr': 0.0004951683268812511, 'samples': 2165376, 'steps': 11277, 'loss/train': 2.1392531394958496} 11/06/2021 22:48:38 - INFO - __main__ - Step 11279: {'lr': 0.0004951672885487178, 'samples': 2165568, 'steps': 11278, 'loss/train': 1.3590013980865479} 11/06/2021 22:48:39 - INFO - __main__ - Step 11280: {'lr': 0.0004951662501057161, 'samples': 2165760, 'steps': 11279, 'loss/train': 1.762505054473877} 11/06/2021 22:48:39 - INFO - __main__ - Step 11281: {'lr': 0.0004951652115522462, 'samples': 2165952, 'steps': 11280, 'loss/train': 1.7629950046539307} 11/06/2021 22:48:40 - INFO - __main__ - Step 11282: {'lr': 0.0004951641728883087, 'samples': 2166144, 'steps': 11281, 'loss/train': 2.0722081661224365} 11/06/2021 22:48:40 - INFO - __main__ - Step 11283: {'lr': 0.000495163134113904, 'samples': 2166336, 'steps': 11282, 'loss/train': 1.4518979787826538} 11/06/2021 22:48:40 - INFO - __main__ - Step 11284: {'lr': 0.0004951620952290325, 'samples': 2166528, 'steps': 11283, 'loss/train': 1.5402740240097046} 11/06/2021 22:48:41 - INFO - __main__ - Step 11285: {'lr': 0.0004951610562336949, 'samples': 2166720, 'steps': 11284, 'loss/train': 1.6244908571243286} 11/06/2021 22:48:42 - INFO - __main__ - Step 11286: {'lr': 0.0004951600171278914, 'samples': 2166912, 'steps': 11285, 'loss/train': 1.898524522781372} 11/06/2021 22:48:42 - INFO - __main__ - Step 11287: {'lr': 0.0004951589779116225, 'samples': 2167104, 'steps': 11286, 'loss/train': 1.5938304662704468} 11/06/2021 22:48:42 - INFO - __main__ - Step 11288: {'lr': 0.0004951579385848889, 'samples': 2167296, 'steps': 11287, 'loss/train': 1.7559034824371338} 11/06/2021 22:48:43 - INFO - __main__ - Step 11289: {'lr': 0.0004951568991476908, 'samples': 2167488, 'steps': 11288, 'loss/train': 1.3978345394134521} 11/06/2021 22:48:44 - INFO - __main__ - Step 11290: {'lr': 0.0004951558596000289, 'samples': 2167680, 'steps': 11289, 'loss/train': 2.016148328781128} 11/06/2021 22:48:44 - INFO - __main__ - Step 11291: {'lr': 0.0004951548199419035, 'samples': 2167872, 'steps': 11290, 'loss/train': 1.7956223487854004} 11/06/2021 22:48:45 - INFO - __main__ - Step 11292: {'lr': 0.0004951537801733152, 'samples': 2168064, 'steps': 11291, 'loss/train': 2.4459264278411865} 11/06/2021 22:48:45 - INFO - __main__ - Step 11293: {'lr': 0.0004951527402942643, 'samples': 2168256, 'steps': 11292, 'loss/train': 1.4760860204696655} 11/06/2021 22:48:45 - INFO - __main__ - Step 11294: {'lr': 0.0004951517003047512, 'samples': 2168448, 'steps': 11293, 'loss/train': 1.7521895170211792} 11/06/2021 22:48:46 - INFO - __main__ - Step 11295: {'lr': 0.0004951506602047767, 'samples': 2168640, 'steps': 11294, 'loss/train': 1.6032721996307373} 11/06/2021 22:48:47 - INFO - __main__ - Step 11296: {'lr': 0.0004951496199943412, 'samples': 2168832, 'steps': 11295, 'loss/train': 1.7261722087860107} 11/06/2021 22:48:47 - INFO - __main__ - Step 11297: {'lr': 0.0004951485796734448, 'samples': 2169024, 'steps': 11296, 'loss/train': 1.5861694812774658} 11/06/2021 22:48:47 - INFO - __main__ - Step 11298: {'lr': 0.0004951475392420884, 'samples': 2169216, 'steps': 11297, 'loss/train': 2.0938913822174072} 11/06/2021 22:48:48 - INFO - __main__ - Step 11299: {'lr': 0.0004951464987002724, 'samples': 2169408, 'steps': 11298, 'loss/train': 1.6789658069610596} 11/06/2021 22:48:49 - INFO - __main__ - Step 11300: {'lr': 0.000495145458047997, 'samples': 2169600, 'steps': 11299, 'loss/train': 1.7498912811279297} 11/06/2021 22:48:49 - INFO - __main__ - Step 11301: {'lr': 0.0004951444172852629, 'samples': 2169792, 'steps': 11300, 'loss/train': 1.6389604806900024} 11/06/2021 22:48:50 - INFO - __main__ - Step 11302: {'lr': 0.0004951433764120705, 'samples': 2169984, 'steps': 11301, 'loss/train': 1.9538257122039795} 11/06/2021 22:48:50 - INFO - __main__ - Step 11303: {'lr': 0.0004951423354284202, 'samples': 2170176, 'steps': 11302, 'loss/train': 1.9088263511657715} 11/06/2021 22:48:50 - INFO - __main__ - Step 11304: {'lr': 0.0004951412943343126, 'samples': 2170368, 'steps': 11303, 'loss/train': 1.584460735321045} 11/06/2021 22:48:51 - INFO - __main__ - Step 11305: {'lr': 0.0004951402531297482, 'samples': 2170560, 'steps': 11304, 'loss/train': 2.0329270362854004} 11/06/2021 22:48:52 - INFO - __main__ - Step 11306: {'lr': 0.0004951392118147273, 'samples': 2170752, 'steps': 11305, 'loss/train': 1.8602527379989624} 11/06/2021 22:48:52 - INFO - __main__ - Step 11307: {'lr': 0.0004951381703892506, 'samples': 2170944, 'steps': 11306, 'loss/train': 2.025909185409546} 11/06/2021 22:48:52 - INFO - __main__ - Step 11308: {'lr': 0.0004951371288533182, 'samples': 2171136, 'steps': 11307, 'loss/train': 2.1841092109680176} 11/06/2021 22:48:53 - INFO - __main__ - Step 11309: {'lr': 0.0004951360872069309, 'samples': 2171328, 'steps': 11308, 'loss/train': 1.3344480991363525} 11/06/2021 22:48:53 - INFO - __main__ - Step 11310: {'lr': 0.0004951350454500891, 'samples': 2171520, 'steps': 11309, 'loss/train': 1.7531185150146484} 11/06/2021 22:48:54 - INFO - __main__ - Step 11311: {'lr': 0.0004951340035827932, 'samples': 2171712, 'steps': 11310, 'loss/train': 1.6113486289978027} 11/06/2021 22:48:54 - INFO - __main__ - Step 11312: {'lr': 0.0004951329616050437, 'samples': 2171904, 'steps': 11311, 'loss/train': 1.9962801933288574} 11/06/2021 22:48:55 - INFO - __main__ - Step 11313: {'lr': 0.000495131919516841, 'samples': 2172096, 'steps': 11312, 'loss/train': 1.5207346677780151} 11/06/2021 22:48:55 - INFO - __main__ - Step 11314: {'lr': 0.0004951308773181856, 'samples': 2172288, 'steps': 11313, 'loss/train': 2.1913249492645264} 11/06/2021 22:48:55 - INFO - __main__ - Step 11315: {'lr': 0.0004951298350090782, 'samples': 2172480, 'steps': 11314, 'loss/train': 1.2448691129684448} 11/06/2021 22:48:56 - INFO - __main__ - Step 11316: {'lr': 0.000495128792589519, 'samples': 2172672, 'steps': 11315, 'loss/train': 2.215278387069702} 11/06/2021 22:48:57 - INFO - __main__ - Step 11317: {'lr': 0.0004951277500595085, 'samples': 2172864, 'steps': 11316, 'loss/train': 1.4738010168075562} 11/06/2021 22:48:57 - INFO - __main__ - Step 11318: {'lr': 0.0004951267074190473, 'samples': 2173056, 'steps': 11317, 'loss/train': 1.4282978773117065} 11/06/2021 22:48:58 - INFO - __main__ - Step 11319: {'lr': 0.0004951256646681356, 'samples': 2173248, 'steps': 11318, 'loss/train': 1.662163496017456} 11/06/2021 22:48:58 - INFO - __main__ - Step 11320: {'lr': 0.0004951246218067744, 'samples': 2173440, 'steps': 11319, 'loss/train': 1.9382797479629517} 11/06/2021 22:48:59 - INFO - __main__ - Step 11321: {'lr': 0.0004951235788349636, 'samples': 2173632, 'steps': 11320, 'loss/train': 3.9597983360290527} 11/06/2021 22:48:59 - INFO - __main__ - Step 11322: {'lr': 0.0004951225357527038, 'samples': 2173824, 'steps': 11321, 'loss/train': 1.7996548414230347} 11/06/2021 22:49:00 - INFO - __main__ - Step 11323: {'lr': 0.0004951214925599957, 'samples': 2174016, 'steps': 11322, 'loss/train': 1.6424283981323242} 11/06/2021 22:49:00 - INFO - __main__ - Step 11324: {'lr': 0.0004951204492568397, 'samples': 2174208, 'steps': 11323, 'loss/train': 1.757283329963684} 11/06/2021 22:49:00 - INFO - __main__ - Step 11325: {'lr': 0.0004951194058432361, 'samples': 2174400, 'steps': 11324, 'loss/train': 1.688281774520874} 11/06/2021 22:49:01 - INFO - __main__ - Step 11326: {'lr': 0.0004951183623191855, 'samples': 2174592, 'steps': 11325, 'loss/train': 1.493397831916809} 11/06/2021 22:49:02 - INFO - __main__ - Step 11327: {'lr': 0.0004951173186846884, 'samples': 2174784, 'steps': 11326, 'loss/train': 3.6629881858825684} 11/06/2021 22:49:02 - INFO - __main__ - Step 11328: {'lr': 0.0004951162749397452, 'samples': 2174976, 'steps': 11327, 'loss/train': 3.2852323055267334} 11/06/2021 22:49:03 - INFO - __main__ - Step 11329: {'lr': 0.0004951152310843564, 'samples': 2175168, 'steps': 11328, 'loss/train': 1.8344948291778564} 11/06/2021 22:49:03 - INFO - __main__ - Step 11330: {'lr': 0.0004951141871185224, 'samples': 2175360, 'steps': 11329, 'loss/train': 2.280247449874878} 11/06/2021 22:49:03 - INFO - __main__ - Step 11331: {'lr': 0.0004951131430422438, 'samples': 2175552, 'steps': 11330, 'loss/train': 1.4350394010543823} 11/06/2021 22:49:05 - INFO - __main__ - Step 11332: {'lr': 0.0004951120988555209, 'samples': 2175744, 'steps': 11331, 'loss/train': 2.2273924350738525} 11/06/2021 22:49:05 - INFO - __main__ - Step 11333: {'lr': 0.0004951110545583543, 'samples': 2175936, 'steps': 11332, 'loss/train': 1.8531851768493652} 11/06/2021 22:49:05 - INFO - __main__ - Step 11334: {'lr': 0.0004951100101507445, 'samples': 2176128, 'steps': 11333, 'loss/train': 1.8258116245269775} 11/06/2021 22:49:06 - INFO - __main__ - Step 11335: {'lr': 0.0004951089656326919, 'samples': 2176320, 'steps': 11334, 'loss/train': 0.3128710389137268} 11/06/2021 22:49:06 - INFO - __main__ - Step 11336: {'lr': 0.0004951079210041969, 'samples': 2176512, 'steps': 11335, 'loss/train': 1.8034968376159668} 11/06/2021 22:49:06 - INFO - __main__ - Step 11337: {'lr': 0.0004951068762652602, 'samples': 2176704, 'steps': 11336, 'loss/train': 1.5365264415740967} 11/06/2021 22:49:07 - INFO - __main__ - Step 11338: {'lr': 0.000495105831415882, 'samples': 2176896, 'steps': 11337, 'loss/train': 1.9638972282409668} 11/06/2021 22:49:08 - INFO - __main__ - Step 11339: {'lr': 0.0004951047864560629, 'samples': 2177088, 'steps': 11338, 'loss/train': 1.64912748336792} 11/06/2021 22:49:08 - INFO - __main__ - Step 11340: {'lr': 0.0004951037413858034, 'samples': 2177280, 'steps': 11339, 'loss/train': 1.4875028133392334} 11/06/2021 22:49:08 - INFO - __main__ - Step 11341: {'lr': 0.000495102696205104, 'samples': 2177472, 'steps': 11340, 'loss/train': 1.8226743936538696} 11/06/2021 22:49:09 - INFO - __main__ - Step 11342: {'lr': 0.000495101650913965, 'samples': 2177664, 'steps': 11341, 'loss/train': 1.8261113166809082} 11/06/2021 22:49:10 - INFO - __main__ - Step 11343: {'lr': 0.000495100605512387, 'samples': 2177856, 'steps': 11342, 'loss/train': 1.8620246648788452} 11/06/2021 22:49:10 - INFO - __main__ - Step 11344: {'lr': 0.0004950995600003705, 'samples': 2178048, 'steps': 11343, 'loss/train': 1.7309132814407349} 11/06/2021 22:49:10 - INFO - __main__ - Step 11345: {'lr': 0.0004950985143779159, 'samples': 2178240, 'steps': 11344, 'loss/train': 1.5737980604171753} 11/06/2021 22:49:11 - INFO - __main__ - Step 11346: {'lr': 0.0004950974686450237, 'samples': 2178432, 'steps': 11345, 'loss/train': 1.4471863508224487} 11/06/2021 22:49:11 - INFO - __main__ - Step 11347: {'lr': 0.0004950964228016944, 'samples': 2178624, 'steps': 11346, 'loss/train': 1.8836588859558105} 11/06/2021 22:49:12 - INFO - __main__ - Step 11348: {'lr': 0.0004950953768479284, 'samples': 2178816, 'steps': 11347, 'loss/train': 1.526648998260498} 11/06/2021 22:49:12 - INFO - __main__ - Step 11349: {'lr': 0.0004950943307837261, 'samples': 2179008, 'steps': 11348, 'loss/train': 1.9713560342788696} 11/06/2021 22:49:13 - INFO - __main__ - Step 11350: {'lr': 0.0004950932846090882, 'samples': 2179200, 'steps': 11349, 'loss/train': 1.60298490524292} 11/06/2021 22:49:13 - INFO - __main__ - Step 11351: {'lr': 0.000495092238324015, 'samples': 2179392, 'steps': 11350, 'loss/train': 1.1730953454971313} 11/06/2021 22:49:13 - INFO - __main__ - Step 11352: {'lr': 0.0004950911919285071, 'samples': 2179584, 'steps': 11351, 'loss/train': 1.5877281427383423} 11/06/2021 22:49:15 - INFO - __main__ - Step 11353: {'lr': 0.0004950901454225647, 'samples': 2179776, 'steps': 11352, 'loss/train': 1.8872724771499634} 11/06/2021 22:49:15 - INFO - __main__ - Step 11354: {'lr': 0.0004950890988061886, 'samples': 2179968, 'steps': 11353, 'loss/train': 2.0779507160186768} 11/06/2021 22:49:15 - INFO - __main__ - Step 11355: {'lr': 0.0004950880520793791, 'samples': 2180160, 'steps': 11354, 'loss/train': 1.030417561531067} 11/06/2021 22:49:16 - INFO - __main__ - Step 11356: {'lr': 0.0004950870052421368, 'samples': 2180352, 'steps': 11355, 'loss/train': 1.806678295135498} 11/06/2021 22:49:16 - INFO - __main__ - Step 11357: {'lr': 0.000495085958294462, 'samples': 2180544, 'steps': 11356, 'loss/train': 1.621626377105713} 11/06/2021 22:49:17 - INFO - __main__ - Step 11358: {'lr': 0.0004950849112363553, 'samples': 2180736, 'steps': 11357, 'loss/train': 0.5637978911399841} 11/06/2021 22:49:17 - INFO - __main__ - Step 11359: {'lr': 0.000495083864067817, 'samples': 2180928, 'steps': 11358, 'loss/train': 1.728484869003296} 11/06/2021 22:49:18 - INFO - __main__ - Step 11360: {'lr': 0.0004950828167888478, 'samples': 2181120, 'steps': 11359, 'loss/train': 1.9310498237609863} 11/06/2021 22:49:18 - INFO - __main__ - Step 11361: {'lr': 0.0004950817693994481, 'samples': 2181312, 'steps': 11360, 'loss/train': 1.8141530752182007} 11/06/2021 22:49:18 - INFO - __main__ - Step 11362: {'lr': 0.0004950807218996182, 'samples': 2181504, 'steps': 11361, 'loss/train': 2.154552936553955} 11/06/2021 22:49:19 - INFO - __main__ - Step 11363: {'lr': 0.0004950796742893588, 'samples': 2181696, 'steps': 11362, 'loss/train': 1.7956241369247437} 11/06/2021 22:49:20 - INFO - __main__ - Step 11364: {'lr': 0.0004950786265686702, 'samples': 2181888, 'steps': 11363, 'loss/train': 1.6461005210876465} 11/06/2021 22:49:20 - INFO - __main__ - Step 11365: {'lr': 0.000495077578737553, 'samples': 2182080, 'steps': 11364, 'loss/train': 1.5644863843917847} 11/06/2021 22:49:20 - INFO - __main__ - Step 11366: {'lr': 0.0004950765307960076, 'samples': 2182272, 'steps': 11365, 'loss/train': 1.4656745195388794} 11/06/2021 22:49:21 - INFO - __main__ - Step 11367: {'lr': 0.0004950754827440346, 'samples': 2182464, 'steps': 11366, 'loss/train': 1.9674415588378906} 11/06/2021 22:49:21 - INFO - __main__ - Step 11368: {'lr': 0.0004950744345816342, 'samples': 2182656, 'steps': 11367, 'loss/train': 1.2006498575210571} 11/06/2021 22:49:22 - INFO - __main__ - Step 11369: {'lr': 0.0004950733863088072, 'samples': 2182848, 'steps': 11368, 'loss/train': 2.536311388015747} 11/06/2021 22:49:22 - INFO - __main__ - Step 11370: {'lr': 0.0004950723379255538, 'samples': 2183040, 'steps': 11369, 'loss/train': 1.9103096723556519} 11/06/2021 22:49:23 - INFO - __main__ - Step 11371: {'lr': 0.0004950712894318748, 'samples': 2183232, 'steps': 11370, 'loss/train': 1.7062427997589111} 11/06/2021 22:49:23 - INFO - __main__ - Step 11372: {'lr': 0.0004950702408277702, 'samples': 2183424, 'steps': 11371, 'loss/train': 1.4938647747039795} 11/06/2021 22:49:23 - INFO - __main__ - Step 11373: {'lr': 0.0004950691921132409, 'samples': 2183616, 'steps': 11372, 'loss/train': 1.9969819784164429} 11/06/2021 22:49:24 - INFO - __main__ - Step 11374: {'lr': 0.000495068143288287, 'samples': 2183808, 'steps': 11373, 'loss/train': 1.8315733671188354} 11/06/2021 22:49:25 - INFO - __main__ - Step 11375: {'lr': 0.0004950670943529094, 'samples': 2184000, 'steps': 11374, 'loss/train': 0.9384849071502686} 11/06/2021 22:49:25 - INFO - __main__ - Step 11376: {'lr': 0.0004950660453071082, 'samples': 2184192, 'steps': 11375, 'loss/train': 1.814307689666748} 11/06/2021 22:49:25 - INFO - __main__ - Step 11377: {'lr': 0.0004950649961508841, 'samples': 2184384, 'steps': 11376, 'loss/train': 2.194822311401367} 11/06/2021 22:49:26 - INFO - __main__ - Step 11378: {'lr': 0.0004950639468842375, 'samples': 2184576, 'steps': 11377, 'loss/train': 2.181577205657959} 11/06/2021 22:49:27 - INFO - __main__ - Step 11379: {'lr': 0.0004950628975071688, 'samples': 2184768, 'steps': 11378, 'loss/train': 1.6916229724884033} 11/06/2021 22:49:27 - INFO - __main__ - Step 11380: {'lr': 0.0004950618480196785, 'samples': 2184960, 'steps': 11379, 'loss/train': 1.5891834497451782} 11/06/2021 22:49:28 - INFO - __main__ - Step 11381: {'lr': 0.0004950607984217674, 'samples': 2185152, 'steps': 11380, 'loss/train': 1.64052414894104} 11/06/2021 22:49:28 - INFO - __main__ - Step 11382: {'lr': 0.0004950597487134354, 'samples': 2185344, 'steps': 11381, 'loss/train': 2.620680570602417} 11/06/2021 22:49:28 - INFO - __main__ - Step 11383: {'lr': 0.0004950586988946834, 'samples': 2185536, 'steps': 11382, 'loss/train': 1.9298031330108643} 11/06/2021 22:49:29 - INFO - __main__ - Step 11384: {'lr': 0.0004950576489655116, 'samples': 2185728, 'steps': 11383, 'loss/train': 1.8474303483963013} 11/06/2021 22:49:30 - INFO - __main__ - Step 11385: {'lr': 0.0004950565989259207, 'samples': 2185920, 'steps': 11384, 'loss/train': 1.4204002618789673} 11/06/2021 22:49:30 - INFO - __main__ - Step 11386: {'lr': 0.000495055548775911, 'samples': 2186112, 'steps': 11385, 'loss/train': 1.618093729019165} 11/06/2021 22:49:30 - INFO - __main__ - Step 11387: {'lr': 0.0004950544985154831, 'samples': 2186304, 'steps': 11386, 'loss/train': 2.119515895843506} 11/06/2021 22:49:31 - INFO - __main__ - Step 11388: {'lr': 0.0004950534481446375, 'samples': 2186496, 'steps': 11387, 'loss/train': 2.0047428607940674} 11/06/2021 22:49:31 - INFO - __main__ - Step 11389: {'lr': 0.0004950523976633745, 'samples': 2186688, 'steps': 11388, 'loss/train': 1.9979164600372314} 11/06/2021 22:49:32 - INFO - __main__ - Step 11390: {'lr': 0.0004950513470716947, 'samples': 2186880, 'steps': 11389, 'loss/train': 1.188913345336914} 11/06/2021 22:49:32 - INFO - __main__ - Step 11391: {'lr': 0.0004950502963695985, 'samples': 2187072, 'steps': 11390, 'loss/train': 1.6476091146469116} 11/06/2021 22:49:33 - INFO - __main__ - Step 11392: {'lr': 0.0004950492455570865, 'samples': 2187264, 'steps': 11391, 'loss/train': 7.051599502563477} 11/06/2021 22:49:33 - INFO - __main__ - Step 11393: {'lr': 0.000495048194634159, 'samples': 2187456, 'steps': 11392, 'loss/train': 1.676889419555664} 11/06/2021 22:49:33 - INFO - __main__ - Step 11394: {'lr': 0.0004950471436008167, 'samples': 2187648, 'steps': 11393, 'loss/train': 1.9109240770339966} 11/06/2021 22:49:35 - INFO - __main__ - Step 11395: {'lr': 0.0004950460924570598, 'samples': 2187840, 'steps': 11394, 'loss/train': 1.9026075601577759} 11/06/2021 22:49:35 - INFO - __main__ - Step 11396: {'lr': 0.0004950450412028889, 'samples': 2188032, 'steps': 11395, 'loss/train': 1.8585050106048584} 11/06/2021 22:49:35 - INFO - __main__ - Step 11397: {'lr': 0.0004950439898383047, 'samples': 2188224, 'steps': 11396, 'loss/train': 2.125458002090454} 11/06/2021 22:49:36 - INFO - __main__ - Step 11398: {'lr': 0.0004950429383633073, 'samples': 2188416, 'steps': 11397, 'loss/train': 1.5339622497558594} 11/06/2021 22:49:36 - INFO - __main__ - Step 11399: {'lr': 0.0004950418867778973, 'samples': 2188608, 'steps': 11398, 'loss/train': 1.875319480895996} 11/06/2021 22:49:37 - INFO - __main__ - Step 11400: {'lr': 0.0004950408350820752, 'samples': 2188800, 'steps': 11399, 'loss/train': 1.5913536548614502} 11/06/2021 22:49:37 - INFO - __main__ - Step 11401: {'lr': 0.0004950397832758415, 'samples': 2188992, 'steps': 11400, 'loss/train': 1.3669840097427368} 11/06/2021 22:49:38 - INFO - __main__ - Step 11402: {'lr': 0.0004950387313591968, 'samples': 2189184, 'steps': 11401, 'loss/train': 1.8628370761871338} 11/06/2021 22:49:38 - INFO - __main__ - Step 11403: {'lr': 0.0004950376793321413, 'samples': 2189376, 'steps': 11402, 'loss/train': 1.7271697521209717} 11/06/2021 22:49:38 - INFO - __main__ - Step 11404: {'lr': 0.0004950366271946756, 'samples': 2189568, 'steps': 11403, 'loss/train': 2.1682770252227783} 11/06/2021 22:49:39 - INFO - __main__ - Step 11405: {'lr': 0.0004950355749468001, 'samples': 2189760, 'steps': 11404, 'loss/train': 1.8250188827514648} 11/06/2021 22:49:40 - INFO - __main__ - Step 11406: {'lr': 0.0004950345225885155, 'samples': 2189952, 'steps': 11405, 'loss/train': 1.594653844833374} 11/06/2021 22:49:40 - INFO - __main__ - Step 11407: {'lr': 0.0004950334701198222, 'samples': 2190144, 'steps': 11406, 'loss/train': 1.9516605138778687} 11/06/2021 22:49:40 - INFO - __main__ - Step 11408: {'lr': 0.0004950324175407204, 'samples': 2190336, 'steps': 11407, 'loss/train': 1.3681046962738037} 11/06/2021 22:49:41 - INFO - __main__ - Step 11409: {'lr': 0.0004950313648512108, 'samples': 2190528, 'steps': 11408, 'loss/train': 1.6536760330200195} 11/06/2021 22:49:42 - INFO - __main__ - Step 11410: {'lr': 0.0004950303120512939, 'samples': 2190720, 'steps': 11409, 'loss/train': 1.9302911758422852} 11/06/2021 22:49:42 - INFO - __main__ - Step 11411: {'lr': 0.0004950292591409701, 'samples': 2190912, 'steps': 11410, 'loss/train': 2.0202114582061768} 11/06/2021 22:49:42 - INFO - __main__ - Step 11412: {'lr': 0.0004950282061202399, 'samples': 2191104, 'steps': 11411, 'loss/train': 1.8923609256744385} 11/06/2021 22:49:43 - INFO - __main__ - Step 11413: {'lr': 0.0004950271529891038, 'samples': 2191296, 'steps': 11412, 'loss/train': 1.6358274221420288} 11/06/2021 22:49:43 - INFO - __main__ - Step 11414: {'lr': 0.0004950260997475623, 'samples': 2191488, 'steps': 11413, 'loss/train': 1.7055529356002808} 11/06/2021 22:49:43 - INFO - __main__ - Step 11415: {'lr': 0.0004950250463956157, 'samples': 2191680, 'steps': 11414, 'loss/train': 1.4822801351547241} 11/06/2021 22:49:44 - INFO - __main__ - Step 11416: {'lr': 0.0004950239929332646, 'samples': 2191872, 'steps': 11415, 'loss/train': 1.7098220586776733} 11/06/2021 22:49:45 - INFO - __main__ - Step 11417: {'lr': 0.0004950229393605095, 'samples': 2192064, 'steps': 11416, 'loss/train': 1.437166690826416} 11/06/2021 22:49:45 - INFO - __main__ - Step 11418: {'lr': 0.0004950218856773509, 'samples': 2192256, 'steps': 11417, 'loss/train': 1.7919132709503174} 11/06/2021 22:49:46 - INFO - __main__ - Step 11419: {'lr': 0.0004950208318837892, 'samples': 2192448, 'steps': 11418, 'loss/train': 1.5548012256622314} 11/06/2021 22:49:46 - INFO - __main__ - Step 11420: {'lr': 0.0004950197779798248, 'samples': 2192640, 'steps': 11419, 'loss/train': 1.615087628364563} 11/06/2021 22:49:47 - INFO - __main__ - Step 11421: {'lr': 0.0004950187239654584, 'samples': 2192832, 'steps': 11420, 'loss/train': 1.5422042608261108} 11/06/2021 22:49:47 - INFO - __main__ - Step 11422: {'lr': 0.0004950176698406903, 'samples': 2193024, 'steps': 11421, 'loss/train': 1.7497607469558716} 11/06/2021 22:49:48 - INFO - __main__ - Step 11423: {'lr': 0.000495016615605521, 'samples': 2193216, 'steps': 11422, 'loss/train': 1.8676695823669434} 11/06/2021 22:49:48 - INFO - __main__ - Step 11424: {'lr': 0.0004950155612599511, 'samples': 2193408, 'steps': 11423, 'loss/train': 1.7228354215621948} 11/06/2021 22:49:48 - INFO - __main__ - Step 11425: {'lr': 0.0004950145068039808, 'samples': 2193600, 'steps': 11424, 'loss/train': 2.0538418292999268} 11/06/2021 22:49:50 - INFO - __main__ - Step 11426: {'lr': 0.0004950134522376108, 'samples': 2193792, 'steps': 11425, 'loss/train': 2.0715749263763428} 11/06/2021 22:49:50 - INFO - __main__ - Step 11427: {'lr': 0.0004950123975608415, 'samples': 2193984, 'steps': 11426, 'loss/train': 1.7007184028625488} 11/06/2021 22:49:50 - INFO - __main__ - Step 11428: {'lr': 0.0004950113427736734, 'samples': 2194176, 'steps': 11427, 'loss/train': 3.19921612739563} 11/06/2021 22:49:51 - INFO - __main__ - Step 11429: {'lr': 0.000495010287876107, 'samples': 2194368, 'steps': 11428, 'loss/train': 1.4021397829055786} 11/06/2021 22:49:51 - INFO - __main__ - Step 11430: {'lr': 0.0004950092328681428, 'samples': 2194560, 'steps': 11429, 'loss/train': 1.3537622690200806} 11/06/2021 22:49:52 - INFO - __main__ - Step 11431: {'lr': 0.0004950081777497812, 'samples': 2194752, 'steps': 11430, 'loss/train': 2.075549840927124} 11/06/2021 22:49:52 - INFO - __main__ - Step 11432: {'lr': 0.0004950071225210226, 'samples': 2194944, 'steps': 11431, 'loss/train': 2.2437236309051514} 11/06/2021 22:49:53 - INFO - __main__ - Step 11433: {'lr': 0.0004950060671818676, 'samples': 2195136, 'steps': 11432, 'loss/train': 1.8015081882476807} 11/06/2021 22:49:53 - INFO - __main__ - Step 11434: {'lr': 0.0004950050117323167, 'samples': 2195328, 'steps': 11433, 'loss/train': 1.7495261430740356} 11/06/2021 22:49:54 - INFO - __main__ - Step 11435: {'lr': 0.0004950039561723703, 'samples': 2195520, 'steps': 11434, 'loss/train': 2.0817348957061768} 11/06/2021 22:49:54 - INFO - __main__ - Step 11436: {'lr': 0.0004950029005020289, 'samples': 2195712, 'steps': 11435, 'loss/train': 1.7753431797027588} 11/06/2021 22:49:55 - INFO - __main__ - Step 11437: {'lr': 0.0004950018447212929, 'samples': 2195904, 'steps': 11436, 'loss/train': 2.0094332695007324} 11/06/2021 22:49:55 - INFO - __main__ - Step 11438: {'lr': 0.000495000788830163, 'samples': 2196096, 'steps': 11437, 'loss/train': 1.0826306343078613} 11/06/2021 22:49:56 - INFO - __main__ - Step 11439: {'lr': 0.0004949997328286394, 'samples': 2196288, 'steps': 11438, 'loss/train': 1.5447814464569092} 11/06/2021 22:49:56 - INFO - __main__ - Step 11440: {'lr': 0.0004949986767167228, 'samples': 2196480, 'steps': 11439, 'loss/train': 1.7402944564819336} 11/06/2021 22:49:56 - INFO - __main__ - Step 11441: {'lr': 0.0004949976204944135, 'samples': 2196672, 'steps': 11440, 'loss/train': 1.6975988149642944} 11/06/2021 22:49:57 - INFO - __main__ - Step 11442: {'lr': 0.0004949965641617121, 'samples': 2196864, 'steps': 11441, 'loss/train': 1.6777021884918213} 11/06/2021 22:49:58 - INFO - __main__ - Step 11443: {'lr': 0.000494995507718619, 'samples': 2197056, 'steps': 11442, 'loss/train': 1.3952085971832275} 11/06/2021 22:49:58 - INFO - __main__ - Step 11444: {'lr': 0.0004949944511651347, 'samples': 2197248, 'steps': 11443, 'loss/train': 1.7834969758987427} 11/06/2021 22:49:58 - INFO - __main__ - Step 11445: {'lr': 0.0004949933945012597, 'samples': 2197440, 'steps': 11444, 'loss/train': 1.753387212753296} 11/06/2021 22:49:59 - INFO - __main__ - Step 11446: {'lr': 0.0004949923377269945, 'samples': 2197632, 'steps': 11445, 'loss/train': 2.039705753326416} 11/06/2021 22:49:59 - INFO - __main__ - Step 11447: {'lr': 0.0004949912808423394, 'samples': 2197824, 'steps': 11446, 'loss/train': 1.4728835821151733} 11/06/2021 22:50:01 - INFO - __main__ - Step 11448: {'lr': 0.000494990223847295, 'samples': 2198016, 'steps': 11447, 'loss/train': 1.628504991531372} 11/06/2021 22:50:01 - INFO - __main__ - Step 11449: {'lr': 0.000494989166741862, 'samples': 2198208, 'steps': 11448, 'loss/train': 2.0440473556518555} 11/06/2021 22:50:01 - INFO - __main__ - Step 11450: {'lr': 0.0004949881095260405, 'samples': 2198400, 'steps': 11449, 'loss/train': 1.660749912261963} 11/06/2021 22:50:02 - INFO - __main__ - Step 11451: {'lr': 0.0004949870521998312, 'samples': 2198592, 'steps': 11450, 'loss/train': 1.8337377309799194} 11/06/2021 22:50:02 - INFO - __main__ - Step 11452: {'lr': 0.0004949859947632344, 'samples': 2198784, 'steps': 11451, 'loss/train': 1.8013213872909546} 11/06/2021 22:50:03 - INFO - __main__ - Step 11453: {'lr': 0.0004949849372162509, 'samples': 2198976, 'steps': 11452, 'loss/train': 1.1063177585601807} 11/06/2021 22:50:03 - INFO - __main__ - Step 11454: {'lr': 0.0004949838795588808, 'samples': 2199168, 'steps': 11453, 'loss/train': 1.6007579565048218} 11/06/2021 22:50:04 - INFO - __main__ - Step 11455: {'lr': 0.0004949828217911248, 'samples': 2199360, 'steps': 11454, 'loss/train': 1.6126108169555664} 11/06/2021 22:50:04 - INFO - __main__ - Step 11456: {'lr': 0.0004949817639129832, 'samples': 2199552, 'steps': 11455, 'loss/train': 1.85826575756073} 11/06/2021 22:50:04 - INFO - __main__ - Step 11457: {'lr': 0.0004949807059244568, 'samples': 2199744, 'steps': 11456, 'loss/train': 1.6572095155715942} 11/06/2021 22:50:05 - INFO - __main__ - Step 11458: {'lr': 0.0004949796478255458, 'samples': 2199936, 'steps': 11457, 'loss/train': 1.5031503438949585} 11/06/2021 22:50:06 - INFO - __main__ - Step 11459: {'lr': 0.0004949785896162507, 'samples': 2200128, 'steps': 11458, 'loss/train': 1.6806023120880127} 11/06/2021 22:50:06 - INFO - __main__ - Step 11460: {'lr': 0.0004949775312965721, 'samples': 2200320, 'steps': 11459, 'loss/train': 1.5449903011322021} 11/06/2021 22:50:06 - INFO - __main__ - Step 11461: {'lr': 0.0004949764728665103, 'samples': 2200512, 'steps': 11460, 'loss/train': 1.651967167854309} 11/06/2021 22:50:07 - INFO - __main__ - Step 11462: {'lr': 0.000494975414326066, 'samples': 2200704, 'steps': 11461, 'loss/train': 1.8669898509979248} 11/06/2021 22:50:07 - INFO - __main__ - Step 11463: {'lr': 0.0004949743556752395, 'samples': 2200896, 'steps': 11462, 'loss/train': 1.9016015529632568} 11/06/2021 22:50:08 - INFO - __main__ - Step 11464: {'lr': 0.0004949732969140313, 'samples': 2201088, 'steps': 11463, 'loss/train': 1.5884931087493896} 11/06/2021 22:50:09 - INFO - __main__ - Step 11465: {'lr': 0.000494972238042442, 'samples': 2201280, 'steps': 11464, 'loss/train': 1.8496793508529663} 11/06/2021 22:50:09 - INFO - __main__ - Step 11466: {'lr': 0.0004949711790604719, 'samples': 2201472, 'steps': 11465, 'loss/train': 1.83021879196167} 11/06/2021 22:50:09 - INFO - __main__ - Step 11467: {'lr': 0.0004949701199681217, 'samples': 2201664, 'steps': 11466, 'loss/train': 1.7761445045471191} 11/06/2021 22:50:10 - INFO - __main__ - Step 11468: {'lr': 0.0004949690607653916, 'samples': 2201856, 'steps': 11467, 'loss/train': 1.5645625591278076} 11/06/2021 22:50:10 - INFO - __main__ - Step 11469: {'lr': 0.0004949680014522822, 'samples': 2202048, 'steps': 11468, 'loss/train': 1.7550435066223145} 11/06/2021 22:50:11 - INFO - __main__ - Step 11470: {'lr': 0.0004949669420287941, 'samples': 2202240, 'steps': 11469, 'loss/train': 1.595177412033081} 11/06/2021 22:50:11 - INFO - __main__ - Step 11471: {'lr': 0.0004949658824949277, 'samples': 2202432, 'steps': 11470, 'loss/train': 1.6120365858078003} 11/06/2021 22:50:12 - INFO - __main__ - Step 11472: {'lr': 0.0004949648228506834, 'samples': 2202624, 'steps': 11471, 'loss/train': 1.6954563856124878} 11/06/2021 22:50:12 - INFO - __main__ - Step 11473: {'lr': 0.0004949637630960618, 'samples': 2202816, 'steps': 11472, 'loss/train': 1.4135258197784424} 11/06/2021 22:50:13 - INFO - __main__ - Step 11474: {'lr': 0.0004949627032310632, 'samples': 2203008, 'steps': 11473, 'loss/train': 1.074235200881958} 11/06/2021 22:50:13 - INFO - __main__ - Step 11475: {'lr': 0.0004949616432556882, 'samples': 2203200, 'steps': 11474, 'loss/train': 2.001767635345459} 11/06/2021 22:50:14 - INFO - __main__ - Step 11476: {'lr': 0.0004949605831699373, 'samples': 2203392, 'steps': 11475, 'loss/train': 1.4366241693496704} 11/06/2021 22:50:14 - INFO - __main__ - Step 11477: {'lr': 0.000494959522973811, 'samples': 2203584, 'steps': 11476, 'loss/train': 1.5947993993759155} 11/06/2021 22:50:14 - INFO - __main__ - Step 11478: {'lr': 0.0004949584626673096, 'samples': 2203776, 'steps': 11477, 'loss/train': 1.586674451828003} 11/06/2021 22:50:15 - INFO - __main__ - Step 11479: {'lr': 0.0004949574022504338, 'samples': 2203968, 'steps': 11478, 'loss/train': 1.7831062078475952} 11/06/2021 22:50:16 - INFO - __main__ - Step 11480: {'lr': 0.0004949563417231838, 'samples': 2204160, 'steps': 11479, 'loss/train': 2.0792524814605713} 11/06/2021 22:50:16 - INFO - __main__ - Step 11481: {'lr': 0.0004949552810855605, 'samples': 2204352, 'steps': 11480, 'loss/train': 1.7512733936309814} 11/06/2021 22:50:16 - INFO - __main__ - Step 11482: {'lr': 0.000494954220337564, 'samples': 2204544, 'steps': 11481, 'loss/train': 1.9360369443893433} 11/06/2021 22:50:17 - INFO - __main__ - Step 11483: {'lr': 0.0004949531594791948, 'samples': 2204736, 'steps': 11482, 'loss/train': 0.5724799633026123} 11/06/2021 22:50:18 - INFO - __main__ - Step 11484: {'lr': 0.0004949520985104536, 'samples': 2204928, 'steps': 11483, 'loss/train': 1.9736790657043457} 11/06/2021 22:50:18 - INFO - __main__ - Step 11485: {'lr': 0.0004949510374313409, 'samples': 2205120, 'steps': 11484, 'loss/train': 1.3778972625732422} 11/06/2021 22:50:18 - INFO - __main__ - Step 11486: {'lr': 0.0004949499762418568, 'samples': 2205312, 'steps': 11485, 'loss/train': 1.658096194267273} 11/06/2021 22:50:19 - INFO - __main__ - Step 11487: {'lr': 0.0004949489149420021, 'samples': 2205504, 'steps': 11486, 'loss/train': 1.9003406763076782} 11/06/2021 22:50:19 - INFO - __main__ - Step 11488: {'lr': 0.0004949478535317773, 'samples': 2205696, 'steps': 11487, 'loss/train': 1.31734299659729} 11/06/2021 22:50:20 - INFO - __main__ - Step 11489: {'lr': 0.0004949467920111827, 'samples': 2205888, 'steps': 11488, 'loss/train': 2.2677342891693115} 11/06/2021 22:50:21 - INFO - __main__ - Step 11490: {'lr': 0.0004949457303802189, 'samples': 2206080, 'steps': 11489, 'loss/train': 1.4590067863464355} 11/06/2021 22:50:21 - INFO - __main__ - Step 11491: {'lr': 0.0004949446686388862, 'samples': 2206272, 'steps': 11490, 'loss/train': 1.8408727645874023} 11/06/2021 22:50:21 - INFO - __main__ - Step 11492: {'lr': 0.0004949436067871854, 'samples': 2206464, 'steps': 11491, 'loss/train': 2.519899845123291} 11/06/2021 22:50:22 - INFO - __main__ - Step 11493: {'lr': 0.0004949425448251166, 'samples': 2206656, 'steps': 11492, 'loss/train': 1.598381757736206} 11/06/2021 22:50:22 - INFO - __main__ - Step 11494: {'lr': 0.0004949414827526805, 'samples': 2206848, 'steps': 11493, 'loss/train': 1.779862642288208} 11/06/2021 22:50:23 - INFO - __main__ - Step 11495: {'lr': 0.0004949404205698777, 'samples': 2207040, 'steps': 11494, 'loss/train': 1.9242050647735596} 11/06/2021 22:50:23 - INFO - __main__ - Step 11496: {'lr': 0.0004949393582767084, 'samples': 2207232, 'steps': 11495, 'loss/train': 1.6788808107376099} 11/06/2021 22:50:24 - INFO - __main__ - Step 11497: {'lr': 0.0004949382958731733, 'samples': 2207424, 'steps': 11496, 'loss/train': 1.5965083837509155} 11/06/2021 22:50:24 - INFO - __main__ - Step 11498: {'lr': 0.0004949372333592728, 'samples': 2207616, 'steps': 11497, 'loss/train': 2.236138105392456} 11/06/2021 22:50:24 - INFO - __main__ - Step 11499: {'lr': 0.0004949361707350072, 'samples': 2207808, 'steps': 11498, 'loss/train': 1.7686853408813477} 11/06/2021 22:50:25 - INFO - __main__ - Step 11500: {'lr': 0.0004949351080003773, 'samples': 2208000, 'steps': 11499, 'loss/train': 1.9962258338928223} 11/06/2021 22:50:26 - INFO - __main__ - Step 11501: {'lr': 0.0004949340451553833, 'samples': 2208192, 'steps': 11500, 'loss/train': 1.7358423471450806} 11/06/2021 22:50:26 - INFO - __main__ - Step 11502: {'lr': 0.0004949329822000259, 'samples': 2208384, 'steps': 11501, 'loss/train': 1.7392240762710571} 11/06/2021 22:50:26 - INFO - __main__ - Step 11503: {'lr': 0.0004949319191343053, 'samples': 2208576, 'steps': 11502, 'loss/train': 2.329840660095215} 11/06/2021 22:50:27 - INFO - __main__ - Step 11504: {'lr': 0.0004949308559582224, 'samples': 2208768, 'steps': 11503, 'loss/train': 1.5851150751113892} 11/06/2021 22:50:28 - INFO - __main__ - Step 11505: {'lr': 0.0004949297926717772, 'samples': 2208960, 'steps': 11504, 'loss/train': 1.932198166847229} 11/06/2021 22:50:28 - INFO - __main__ - Step 11506: {'lr': 0.0004949287292749705, 'samples': 2209152, 'steps': 11505, 'loss/train': 1.7171591520309448} 11/06/2021 22:50:29 - INFO - __main__ - Step 11507: {'lr': 0.0004949276657678028, 'samples': 2209344, 'steps': 11506, 'loss/train': 2.190236806869507} 11/06/2021 22:50:29 - INFO - __main__ - Step 11508: {'lr': 0.0004949266021502744, 'samples': 2209536, 'steps': 11507, 'loss/train': 1.2840288877487183} 11/06/2021 22:50:29 - INFO - __main__ - Step 11509: {'lr': 0.0004949255384223859, 'samples': 2209728, 'steps': 11508, 'loss/train': 0.29179802536964417} 11/06/2021 22:50:30 - INFO - __main__ - Step 11510: {'lr': 0.0004949244745841377, 'samples': 2209920, 'steps': 11509, 'loss/train': 1.3277264833450317} 11/06/2021 22:50:31 - INFO - __main__ - Step 11511: {'lr': 0.0004949234106355302, 'samples': 2210112, 'steps': 11510, 'loss/train': 2.052576780319214} 11/06/2021 22:50:31 - INFO - __main__ - Step 11512: {'lr': 0.0004949223465765642, 'samples': 2210304, 'steps': 11511, 'loss/train': 1.9434692859649658} 11/06/2021 22:50:32 - INFO - __main__ - Step 11513: {'lr': 0.0004949212824072398, 'samples': 2210496, 'steps': 11512, 'loss/train': 2.3121955394744873} 11/06/2021 22:50:32 - INFO - __main__ - Step 11514: {'lr': 0.0004949202181275577, 'samples': 2210688, 'steps': 11513, 'loss/train': 1.9893686771392822} 11/06/2021 22:50:32 - INFO - __main__ - Step 11515: {'lr': 0.0004949191537375184, 'samples': 2210880, 'steps': 11514, 'loss/train': 1.3614085912704468} 11/06/2021 22:50:33 - INFO - __main__ - Step 11516: {'lr': 0.0004949180892371223, 'samples': 2211072, 'steps': 11515, 'loss/train': 1.9664455652236938} 11/06/2021 22:50:34 - INFO - __main__ - Step 11517: {'lr': 0.0004949170246263697, 'samples': 2211264, 'steps': 11516, 'loss/train': 2.041748046875} 11/06/2021 22:50:34 - INFO - __main__ - Step 11518: {'lr': 0.0004949159599052614, 'samples': 2211456, 'steps': 11517, 'loss/train': 1.993459939956665} 11/06/2021 22:50:34 - INFO - __main__ - Step 11519: {'lr': 0.0004949148950737978, 'samples': 2211648, 'steps': 11518, 'loss/train': 1.6282458305358887} 11/06/2021 22:50:35 - INFO - __main__ - Step 11520: {'lr': 0.0004949138301319793, 'samples': 2211840, 'steps': 11519, 'loss/train': 1.2587100267410278} 11/06/2021 22:50:35 - INFO - __main__ - Step 11521: {'lr': 0.0004949127650798063, 'samples': 2212032, 'steps': 11520, 'loss/train': 1.619840145111084} 11/06/2021 22:50:36 - INFO - __main__ - Step 11522: {'lr': 0.0004949116999172795, 'samples': 2212224, 'steps': 11521, 'loss/train': 1.572475790977478} 11/06/2021 22:50:36 - INFO - __main__ - Step 11523: {'lr': 0.0004949106346443992, 'samples': 2212416, 'steps': 11522, 'loss/train': 1.6941105127334595} 11/06/2021 22:50:37 - INFO - __main__ - Step 11524: {'lr': 0.0004949095692611661, 'samples': 2212608, 'steps': 11523, 'loss/train': 1.9662777185440063} 11/06/2021 22:50:37 - INFO - __main__ - Step 11525: {'lr': 0.0004949085037675803, 'samples': 2212800, 'steps': 11524, 'loss/train': 1.7247838973999023} 11/06/2021 22:50:37 - INFO - __main__ - Step 11526: {'lr': 0.0004949074381636427, 'samples': 2212992, 'steps': 11525, 'loss/train': 1.9012329578399658} 11/06/2021 22:50:38 - INFO - __main__ - Step 11527: {'lr': 0.0004949063724493534, 'samples': 2213184, 'steps': 11526, 'loss/train': 1.9752020835876465} 11/06/2021 22:50:39 - INFO - __main__ - Step 11528: {'lr': 0.0004949053066247133, 'samples': 2213376, 'steps': 11527, 'loss/train': 1.4446111917495728} 11/06/2021 22:50:39 - INFO - __main__ - Step 11529: {'lr': 0.0004949042406897225, 'samples': 2213568, 'steps': 11528, 'loss/train': 1.739598274230957} 11/06/2021 22:50:40 - INFO - __main__ - Step 11530: {'lr': 0.0004949031746443816, 'samples': 2213760, 'steps': 11529, 'loss/train': 1.8068318367004395} 11/06/2021 22:50:40 - INFO - __main__ - Step 11531: {'lr': 0.0004949021084886912, 'samples': 2213952, 'steps': 11530, 'loss/train': 1.7446298599243164} 11/06/2021 22:50:41 - INFO - __main__ - Step 11532: {'lr': 0.0004949010422226517, 'samples': 2214144, 'steps': 11531, 'loss/train': 1.2922008037567139} 11/06/2021 22:50:41 - INFO - __main__ - Step 11533: {'lr': 0.0004948999758462634, 'samples': 2214336, 'steps': 11532, 'loss/train': 2.062342882156372} 11/06/2021 22:50:42 - INFO - __main__ - Step 11534: {'lr': 0.000494898909359527, 'samples': 2214528, 'steps': 11533, 'loss/train': 1.4486849308013916} 11/06/2021 22:50:42 - INFO - __main__ - Step 11535: {'lr': 0.0004948978427624431, 'samples': 2214720, 'steps': 11534, 'loss/train': 1.7897515296936035} 11/06/2021 22:50:42 - INFO - __main__ - Step 11536: {'lr': 0.0004948967760550119, 'samples': 2214912, 'steps': 11535, 'loss/train': 1.818204402923584} 11/06/2021 22:50:43 - INFO - __main__ - Step 11537: {'lr': 0.000494895709237234, 'samples': 2215104, 'steps': 11536, 'loss/train': 1.166475534439087} 11/06/2021 22:50:44 - INFO - __main__ - Step 11538: {'lr': 0.0004948946423091099, 'samples': 2215296, 'steps': 11537, 'loss/train': 1.8267381191253662} 11/06/2021 22:50:44 - INFO - __main__ - Step 11539: {'lr': 0.0004948935752706401, 'samples': 2215488, 'steps': 11538, 'loss/train': 2.1393001079559326} 11/06/2021 22:50:44 - INFO - __main__ - Step 11540: {'lr': 0.0004948925081218248, 'samples': 2215680, 'steps': 11539, 'loss/train': 1.0833170413970947} 11/06/2021 22:50:45 - INFO - __main__ - Step 11541: {'lr': 0.000494891440862665, 'samples': 2215872, 'steps': 11540, 'loss/train': 1.8711026906967163} 11/06/2021 22:50:46 - INFO - __main__ - Step 11542: {'lr': 0.0004948903734931608, 'samples': 2216064, 'steps': 11541, 'loss/train': 1.5871480703353882} 11/06/2021 22:50:46 - INFO - __main__ - Step 11543: {'lr': 0.0004948893060133128, 'samples': 2216256, 'steps': 11542, 'loss/train': 1.5107953548431396} 11/06/2021 22:50:47 - INFO - __main__ - Step 11544: {'lr': 0.0004948882384231213, 'samples': 2216448, 'steps': 11543, 'loss/train': 1.1852807998657227} 11/06/2021 22:50:47 - INFO - __main__ - Step 11545: {'lr': 0.0004948871707225871, 'samples': 2216640, 'steps': 11544, 'loss/train': 1.8642654418945312} 11/06/2021 22:50:47 - INFO - __main__ - Step 11546: {'lr': 0.0004948861029117104, 'samples': 2216832, 'steps': 11545, 'loss/train': 1.524583339691162} 11/06/2021 22:50:48 - INFO - __main__ - Step 11547: {'lr': 0.0004948850349904919, 'samples': 2217024, 'steps': 11546, 'loss/train': 1.7309693098068237} 11/06/2021 22:50:49 - INFO - __main__ - Step 11548: {'lr': 0.0004948839669589319, 'samples': 2217216, 'steps': 11547, 'loss/train': 1.4048861265182495} 11/06/2021 22:50:49 - INFO - __main__ - Step 11549: {'lr': 0.000494882898817031, 'samples': 2217408, 'steps': 11548, 'loss/train': 1.8817989826202393} 11/06/2021 22:50:49 - INFO - __main__ - Step 11550: {'lr': 0.0004948818305647897, 'samples': 2217600, 'steps': 11549, 'loss/train': 1.4648607969284058} 11/06/2021 22:50:50 - INFO - __main__ - Step 11551: {'lr': 0.0004948807622022083, 'samples': 2217792, 'steps': 11550, 'loss/train': 2.3621954917907715} 11/06/2021 22:50:50 - INFO - __main__ - Step 11552: {'lr': 0.0004948796937292875, 'samples': 2217984, 'steps': 11551, 'loss/train': 1.87119722366333} 11/06/2021 22:50:51 - INFO - __main__ - Step 11553: {'lr': 0.0004948786251460277, 'samples': 2218176, 'steps': 11552, 'loss/train': 2.0248570442199707} 11/06/2021 22:50:51 - INFO - __main__ - Step 11554: {'lr': 0.0004948775564524294, 'samples': 2218368, 'steps': 11553, 'loss/train': 2.0253686904907227} 11/06/2021 22:50:52 - INFO - __main__ - Step 11555: {'lr': 0.000494876487648493, 'samples': 2218560, 'steps': 11554, 'loss/train': 1.359278917312622} 11/06/2021 22:50:52 - INFO - __main__ - Step 11556: {'lr': 0.0004948754187342189, 'samples': 2218752, 'steps': 11555, 'loss/train': 2.02559757232666} 11/06/2021 22:50:52 - INFO - __main__ - Step 11557: {'lr': 0.0004948743497096079, 'samples': 2218944, 'steps': 11556, 'loss/train': 1.019589900970459} 11/06/2021 22:50:54 - INFO - __main__ - Step 11558: {'lr': 0.0004948732805746604, 'samples': 2219136, 'steps': 11557, 'loss/train': 1.860040545463562} 11/06/2021 22:50:54 - INFO - __main__ - Step 11559: {'lr': 0.0004948722113293766, 'samples': 2219328, 'steps': 11558, 'loss/train': 1.786012053489685} 11/06/2021 22:50:54 - INFO - __main__ - Step 11560: {'lr': 0.000494871141973757, 'samples': 2219520, 'steps': 11559, 'loss/train': 1.9296960830688477} 11/06/2021 22:50:55 - INFO - __main__ - Step 11561: {'lr': 0.0004948700725078025, 'samples': 2219712, 'steps': 11560, 'loss/train': 1.883889079093933} 11/06/2021 22:50:55 - INFO - __main__ - Step 11562: {'lr': 0.0004948690029315133, 'samples': 2219904, 'steps': 11561, 'loss/train': 1.7466262578964233} 11/06/2021 22:50:56 - INFO - __main__ - Step 11563: {'lr': 0.0004948679332448899, 'samples': 2220096, 'steps': 11562, 'loss/train': 1.7426663637161255} 11/06/2021 22:50:57 - INFO - __main__ - Step 11564: {'lr': 0.0004948668634479327, 'samples': 2220288, 'steps': 11563, 'loss/train': 2.0808141231536865} 11/06/2021 22:50:57 - INFO - __main__ - Step 11565: {'lr': 0.0004948657935406423, 'samples': 2220480, 'steps': 11564, 'loss/train': 1.635694980621338} 11/06/2021 22:50:57 - INFO - __main__ - Step 11566: {'lr': 0.0004948647235230192, 'samples': 2220672, 'steps': 11565, 'loss/train': 1.6016157865524292} 11/06/2021 22:50:58 - INFO - __main__ - Step 11567: {'lr': 0.0004948636533950638, 'samples': 2220864, 'steps': 11566, 'loss/train': 1.8231465816497803} 11/06/2021 22:50:59 - INFO - __main__ - Step 11568: {'lr': 0.0004948625831567766, 'samples': 2221056, 'steps': 11567, 'loss/train': 1.6762038469314575} 11/06/2021 22:50:59 - INFO - __main__ - Step 11569: {'lr': 0.000494861512808158, 'samples': 2221248, 'steps': 11568, 'loss/train': 1.9984252452850342} 11/06/2021 22:50:59 - INFO - __main__ - Step 11570: {'lr': 0.0004948604423492088, 'samples': 2221440, 'steps': 11569, 'loss/train': 1.8539700508117676} 11/06/2021 22:51:00 - INFO - __main__ - Step 11571: {'lr': 0.0004948593717799292, 'samples': 2221632, 'steps': 11570, 'loss/train': 1.6391667127609253} 11/06/2021 22:51:00 - INFO - __main__ - Step 11572: {'lr': 0.0004948583011003196, 'samples': 2221824, 'steps': 11571, 'loss/train': 1.7608556747436523} 11/06/2021 22:51:00 - INFO - __main__ - Step 11573: {'lr': 0.0004948572303103808, 'samples': 2222016, 'steps': 11572, 'loss/train': 1.8692219257354736} 11/06/2021 22:51:01 - INFO - __main__ - Step 11574: {'lr': 0.0004948561594101129, 'samples': 2222208, 'steps': 11573, 'loss/train': 1.743513584136963} 11/06/2021 22:51:02 - INFO - __main__ - Step 11575: {'lr': 0.0004948550883995168, 'samples': 2222400, 'steps': 11574, 'loss/train': 1.3404990434646606} 11/06/2021 22:51:02 - INFO - __main__ - Step 11576: {'lr': 0.0004948540172785927, 'samples': 2222592, 'steps': 11575, 'loss/train': 1.5622731447219849} 11/06/2021 22:51:02 - INFO - __main__ - Step 11577: {'lr': 0.0004948529460473412, 'samples': 2222784, 'steps': 11576, 'loss/train': 1.3603805303573608} 11/06/2021 22:51:03 - INFO - __main__ - Step 11578: {'lr': 0.0004948518747057626, 'samples': 2222976, 'steps': 11577, 'loss/train': 1.4765523672103882} 11/06/2021 22:51:04 - INFO - __main__ - Step 11579: {'lr': 0.0004948508032538578, 'samples': 2223168, 'steps': 11578, 'loss/train': 1.918787956237793} 11/06/2021 22:51:04 - INFO - __main__ - Step 11580: {'lr': 0.0004948497316916267, 'samples': 2223360, 'steps': 11579, 'loss/train': 2.0067150592803955} 11/06/2021 22:51:04 - INFO - __main__ - Step 11581: {'lr': 0.0004948486600190702, 'samples': 2223552, 'steps': 11580, 'loss/train': 1.6319156885147095} 11/06/2021 22:51:05 - INFO - __main__ - Step 11582: {'lr': 0.0004948475882361888, 'samples': 2223744, 'steps': 11581, 'loss/train': 1.4225183725357056} 11/06/2021 22:51:05 - INFO - __main__ - Step 11583: {'lr': 0.0004948465163429828, 'samples': 2223936, 'steps': 11582, 'loss/train': 1.5520676374435425} 11/06/2021 22:51:06 - INFO - __main__ - Step 11584: {'lr': 0.0004948454443394527, 'samples': 2224128, 'steps': 11583, 'loss/train': 1.5501137971878052} 11/06/2021 22:51:07 - INFO - __main__ - Step 11585: {'lr': 0.000494844372225599, 'samples': 2224320, 'steps': 11584, 'loss/train': 1.650390625} 11/06/2021 22:51:07 - INFO - __main__ - Step 11586: {'lr': 0.0004948433000014222, 'samples': 2224512, 'steps': 11585, 'loss/train': 1.4051388502120972} 11/06/2021 22:51:07 - INFO - __main__ - Step 11587: {'lr': 0.0004948422276669228, 'samples': 2224704, 'steps': 11586, 'loss/train': 1.91586434841156} 11/06/2021 22:51:08 - INFO - __main__ - Step 11588: {'lr': 0.0004948411552221012, 'samples': 2224896, 'steps': 11587, 'loss/train': 2.1239876747131348} 11/06/2021 22:51:09 - INFO - __main__ - Step 11589: {'lr': 0.000494840082666958, 'samples': 2225088, 'steps': 11588, 'loss/train': 1.9688409566879272} 11/06/2021 22:51:09 - INFO - __main__ - Step 11590: {'lr': 0.0004948390100014937, 'samples': 2225280, 'steps': 11589, 'loss/train': 1.567671775817871} 11/06/2021 22:51:09 - INFO - __main__ - Step 11591: {'lr': 0.0004948379372257086, 'samples': 2225472, 'steps': 11590, 'loss/train': 1.2471174001693726} 11/06/2021 22:51:10 - INFO - __main__ - Step 11592: {'lr': 0.0004948368643396035, 'samples': 2225664, 'steps': 11591, 'loss/train': 1.732541799545288} 11/06/2021 22:51:10 - INFO - __main__ - Step 11593: {'lr': 0.0004948357913431786, 'samples': 2225856, 'steps': 11592, 'loss/train': 1.8311717510223389} 11/06/2021 22:51:11 - INFO - __main__ - Step 11594: {'lr': 0.0004948347182364344, 'samples': 2226048, 'steps': 11593, 'loss/train': 2.0419249534606934} 11/06/2021 22:51:11 - INFO - __main__ - Step 11595: {'lr': 0.0004948336450193715, 'samples': 2226240, 'steps': 11594, 'loss/train': 0.42051059007644653} 11/06/2021 22:51:12 - INFO - __main__ - Step 11596: {'lr': 0.0004948325716919904, 'samples': 2226432, 'steps': 11595, 'loss/train': 2.2684004306793213} 11/06/2021 22:51:12 - INFO - __main__ - Step 11597: {'lr': 0.0004948314982542914, 'samples': 2226624, 'steps': 11596, 'loss/train': 1.7309365272521973} 11/06/2021 22:51:13 - INFO - __main__ - Step 11598: {'lr': 0.0004948304247062752, 'samples': 2226816, 'steps': 11597, 'loss/train': 1.4194977283477783} 11/06/2021 22:51:14 - INFO - __main__ - Step 11599: {'lr': 0.0004948293510479421, 'samples': 2227008, 'steps': 11598, 'loss/train': 0.8019549250602722} 11/06/2021 22:51:14 - INFO - __main__ - Step 11600: {'lr': 0.0004948282772792927, 'samples': 2227200, 'steps': 11599, 'loss/train': 1.8629350662231445} 11/06/2021 22:51:14 - INFO - __main__ - Step 11601: {'lr': 0.0004948272034003275, 'samples': 2227392, 'steps': 11600, 'loss/train': 1.8861653804779053} 11/06/2021 22:51:15 - INFO - __main__ - Step 11602: {'lr': 0.000494826129411047, 'samples': 2227584, 'steps': 11601, 'loss/train': 1.1293734312057495} 11/06/2021 22:51:15 - INFO - __main__ - Step 11603: {'lr': 0.0004948250553114516, 'samples': 2227776, 'steps': 11602, 'loss/train': 1.6663720607757568} 11/06/2021 22:51:15 - INFO - __main__ - Step 11604: {'lr': 0.0004948239811015416, 'samples': 2227968, 'steps': 11603, 'loss/train': 1.6033754348754883} 11/06/2021 22:51:16 - INFO - __main__ - Step 11605: {'lr': 0.0004948229067813179, 'samples': 2228160, 'steps': 11604, 'loss/train': 1.3643414974212646} 11/06/2021 22:51:17 - INFO - __main__ - Step 11606: {'lr': 0.0004948218323507807, 'samples': 2228352, 'steps': 11605, 'loss/train': 2.118734836578369} 11/06/2021 22:51:17 - INFO - __main__ - Step 11607: {'lr': 0.0004948207578099306, 'samples': 2228544, 'steps': 11606, 'loss/train': 1.5554416179656982} 11/06/2021 22:51:17 - INFO - __main__ - Step 11608: {'lr': 0.000494819683158768, 'samples': 2228736, 'steps': 11607, 'loss/train': 1.433336615562439} 11/06/2021 22:51:18 - INFO - __main__ - Step 11609: {'lr': 0.0004948186083972934, 'samples': 2228928, 'steps': 11608, 'loss/train': 1.552068829536438} 11/06/2021 22:51:19 - INFO - __main__ - Step 11610: {'lr': 0.0004948175335255075, 'samples': 2229120, 'steps': 11609, 'loss/train': 1.240715503692627} 11/06/2021 22:51:19 - INFO - __main__ - Step 11611: {'lr': 0.0004948164585434104, 'samples': 2229312, 'steps': 11610, 'loss/train': 1.7166132926940918} 11/06/2021 22:51:20 - INFO - __main__ - Step 11612: {'lr': 0.0004948153834510028, 'samples': 2229504, 'steps': 11611, 'loss/train': 1.7382307052612305} 11/06/2021 22:51:20 - INFO - __main__ - Step 11613: {'lr': 0.0004948143082482852, 'samples': 2229696, 'steps': 11612, 'loss/train': 1.9601466655731201} 11/06/2021 22:51:20 - INFO - __main__ - Step 11614: {'lr': 0.0004948132329352582, 'samples': 2229888, 'steps': 11613, 'loss/train': 1.7500718832015991} 11/06/2021 22:51:21 - INFO - __main__ - Step 11615: {'lr': 0.0004948121575119219, 'samples': 2230080, 'steps': 11614, 'loss/train': 5.872844696044922} 11/06/2021 22:51:22 - INFO - __main__ - Step 11616: {'lr': 0.0004948110819782771, 'samples': 2230272, 'steps': 11615, 'loss/train': 1.6911870241165161} 11/06/2021 22:51:22 - INFO - __main__ - Step 11617: {'lr': 0.0004948100063343243, 'samples': 2230464, 'steps': 11616, 'loss/train': 1.594088077545166} 11/06/2021 22:51:22 - INFO - __main__ - Step 11618: {'lr': 0.0004948089305800638, 'samples': 2230656, 'steps': 11617, 'loss/train': 1.9734327793121338} 11/06/2021 22:51:23 - INFO - __main__ - Step 11619: {'lr': 0.0004948078547154962, 'samples': 2230848, 'steps': 11618, 'loss/train': 1.6274333000183105} 11/06/2021 22:51:24 - INFO - __main__ - Step 11620: {'lr': 0.0004948067787406219, 'samples': 2231040, 'steps': 11619, 'loss/train': 2.051081418991089} 11/06/2021 22:51:24 - INFO - __main__ - Step 11621: {'lr': 0.0004948057026554415, 'samples': 2231232, 'steps': 11620, 'loss/train': 1.4982271194458008} 11/06/2021 22:51:24 - INFO - __main__ - Step 11622: {'lr': 0.0004948046264599554, 'samples': 2231424, 'steps': 11621, 'loss/train': 1.4409050941467285} 11/06/2021 22:51:25 - INFO - __main__ - Step 11623: {'lr': 0.0004948035501541641, 'samples': 2231616, 'steps': 11622, 'loss/train': 2.061194658279419} 11/06/2021 22:51:25 - INFO - __main__ - Step 11624: {'lr': 0.0004948024737380681, 'samples': 2231808, 'steps': 11623, 'loss/train': 1.7897642850875854} 11/06/2021 22:51:26 - INFO - __main__ - Step 11625: {'lr': 0.000494801397211668, 'samples': 2232000, 'steps': 11624, 'loss/train': 1.5584590435028076} 11/06/2021 22:51:27 - INFO - __main__ - Step 11626: {'lr': 0.000494800320574964, 'samples': 2232192, 'steps': 11625, 'loss/train': 1.4107680320739746} 11/06/2021 22:51:27 - INFO - __main__ - Step 11627: {'lr': 0.0004947992438279568, 'samples': 2232384, 'steps': 11626, 'loss/train': 1.7100175619125366} 11/06/2021 22:51:27 - INFO - __main__ - Step 11628: {'lr': 0.0004947981669706469, 'samples': 2232576, 'steps': 11627, 'loss/train': 1.3465707302093506} 11/06/2021 22:51:28 - INFO - __main__ - Step 11629: {'lr': 0.0004947970900030346, 'samples': 2232768, 'steps': 11628, 'loss/train': 1.6569414138793945} 11/06/2021 22:51:28 - INFO - __main__ - Step 11630: {'lr': 0.0004947960129251206, 'samples': 2232960, 'steps': 11629, 'loss/train': 1.8013361692428589} 11/06/2021 22:51:29 - INFO - __main__ - Step 11631: {'lr': 0.0004947949357369054, 'samples': 2233152, 'steps': 11630, 'loss/train': 1.4865167140960693} 11/06/2021 22:51:29 - INFO - __main__ - Step 11632: {'lr': 0.0004947938584383892, 'samples': 2233344, 'steps': 11631, 'loss/train': 2.1401309967041016} 11/06/2021 22:51:30 - INFO - __main__ - Step 11633: {'lr': 0.0004947927810295728, 'samples': 2233536, 'steps': 11632, 'loss/train': 1.6527354717254639} 11/06/2021 22:51:30 - INFO - __main__ - Step 11634: {'lr': 0.0004947917035104564, 'samples': 2233728, 'steps': 11633, 'loss/train': 1.9105935096740723} 11/06/2021 22:51:30 - INFO - __main__ - Step 11635: {'lr': 0.0004947906258810407, 'samples': 2233920, 'steps': 11634, 'loss/train': 1.906631350517273} 11/06/2021 22:51:31 - INFO - __main__ - Step 11636: {'lr': 0.0004947895481413262, 'samples': 2234112, 'steps': 11635, 'loss/train': 2.1798081398010254} 11/06/2021 22:51:32 - INFO - __main__ - Step 11637: {'lr': 0.0004947884702913133, 'samples': 2234304, 'steps': 11636, 'loss/train': 1.9816797971725464} 11/06/2021 22:51:32 - INFO - __main__ - Step 11638: {'lr': 0.0004947873923310024, 'samples': 2234496, 'steps': 11637, 'loss/train': 1.7533513307571411} 11/06/2021 22:51:32 - INFO - __main__ - Step 11639: {'lr': 0.0004947863142603941, 'samples': 2234688, 'steps': 11638, 'loss/train': 1.5233229398727417} 11/06/2021 22:51:33 - INFO - __main__ - Step 11640: {'lr': 0.0004947852360794889, 'samples': 2234880, 'steps': 11639, 'loss/train': 1.628521203994751} 11/06/2021 22:51:34 - INFO - __main__ - Step 11641: {'lr': 0.0004947841577882873, 'samples': 2235072, 'steps': 11640, 'loss/train': 1.8481824398040771} 11/06/2021 22:51:34 - INFO - __main__ - Step 11642: {'lr': 0.0004947830793867896, 'samples': 2235264, 'steps': 11641, 'loss/train': 2.2054293155670166} 11/06/2021 22:51:34 - INFO - __main__ - Step 11643: {'lr': 0.0004947820008749965, 'samples': 2235456, 'steps': 11642, 'loss/train': 1.8366459608078003} 11/06/2021 22:51:35 - INFO - __main__ - Step 11644: {'lr': 0.0004947809222529084, 'samples': 2235648, 'steps': 11643, 'loss/train': 1.237697720527649} 11/06/2021 22:51:35 - INFO - __main__ - Step 11645: {'lr': 0.0004947798435205258, 'samples': 2235840, 'steps': 11644, 'loss/train': 1.8298448324203491} 11/06/2021 22:51:36 - INFO - __main__ - Step 11646: {'lr': 0.0004947787646778491, 'samples': 2236032, 'steps': 11645, 'loss/train': 1.7138854265213013} 11/06/2021 22:51:36 - INFO - __main__ - Step 11647: {'lr': 0.0004947776857248791, 'samples': 2236224, 'steps': 11646, 'loss/train': 1.7913033962249756} 11/06/2021 22:51:37 - INFO - __main__ - Step 11648: {'lr': 0.0004947766066616157, 'samples': 2236416, 'steps': 11647, 'loss/train': 1.7357981204986572} 11/06/2021 22:51:37 - INFO - __main__ - Step 11649: {'lr': 0.00049477552748806, 'samples': 2236608, 'steps': 11648, 'loss/train': 1.6491725444793701} 11/06/2021 22:51:38 - INFO - __main__ - Step 11650: {'lr': 0.0004947744482042122, 'samples': 2236800, 'steps': 11649, 'loss/train': 1.3429278135299683} 11/06/2021 22:51:38 - INFO - __main__ - Step 11651: {'lr': 0.0004947733688100728, 'samples': 2236992, 'steps': 11650, 'loss/train': 1.4097791910171509} 11/06/2021 22:51:39 - INFO - __main__ - Step 11652: {'lr': 0.0004947722893056423, 'samples': 2237184, 'steps': 11651, 'loss/train': 1.9435794353485107} 11/06/2021 22:51:39 - INFO - __main__ - Step 11653: {'lr': 0.0004947712096909211, 'samples': 2237376, 'steps': 11652, 'loss/train': 1.272664189338684} 11/06/2021 22:51:40 - INFO - __main__ - Step 11654: {'lr': 0.0004947701299659097, 'samples': 2237568, 'steps': 11653, 'loss/train': 1.6220483779907227} 11/06/2021 22:51:40 - INFO - __main__ - Step 11655: {'lr': 0.0004947690501306088, 'samples': 2237760, 'steps': 11654, 'loss/train': 2.2113709449768066} 11/06/2021 22:51:40 - INFO - __main__ - Step 11656: {'lr': 0.0004947679701850187, 'samples': 2237952, 'steps': 11655, 'loss/train': 1.6100585460662842} 11/06/2021 22:51:41 - INFO - __main__ - Step 11657: {'lr': 0.00049476689012914, 'samples': 2238144, 'steps': 11656, 'loss/train': 1.5602444410324097} 11/06/2021 22:51:42 - INFO - __main__ - Step 11658: {'lr': 0.0004947658099629731, 'samples': 2238336, 'steps': 11657, 'loss/train': 1.4081251621246338} 11/06/2021 22:51:42 - INFO - __main__ - Step 11659: {'lr': 0.0004947647296865184, 'samples': 2238528, 'steps': 11658, 'loss/train': 1.445473074913025} 11/06/2021 22:51:42 - INFO - __main__ - Step 11660: {'lr': 0.0004947636492997765, 'samples': 2238720, 'steps': 11659, 'loss/train': 1.6339030265808105} 11/06/2021 22:51:43 - INFO - __main__ - Step 11661: {'lr': 0.0004947625688027479, 'samples': 2238912, 'steps': 11660, 'loss/train': 1.8317729234695435} 11/06/2021 22:51:44 - INFO - __main__ - Step 11662: {'lr': 0.0004947614881954332, 'samples': 2239104, 'steps': 11661, 'loss/train': 1.262568712234497} 11/06/2021 22:51:44 - INFO - __main__ - Step 11663: {'lr': 0.0004947604074778325, 'samples': 2239296, 'steps': 11662, 'loss/train': 1.5094019174575806} 11/06/2021 22:51:45 - INFO - __main__ - Step 11664: {'lr': 0.0004947593266499468, 'samples': 2239488, 'steps': 11663, 'loss/train': 1.4688637256622314} 11/06/2021 22:51:45 - INFO - __main__ - Step 11665: {'lr': 0.0004947582457117762, 'samples': 2239680, 'steps': 11664, 'loss/train': 2.3043243885040283} 11/06/2021 22:51:45 - INFO - __main__ - Step 11666: {'lr': 0.0004947571646633214, 'samples': 2239872, 'steps': 11665, 'loss/train': 1.4841505289077759} 11/06/2021 22:51:46 - INFO - __main__ - Step 11667: {'lr': 0.0004947560835045826, 'samples': 2240064, 'steps': 11666, 'loss/train': 1.7168904542922974} 11/06/2021 22:51:47 - INFO - __main__ - Step 11668: {'lr': 0.0004947550022355606, 'samples': 2240256, 'steps': 11667, 'loss/train': 1.6431477069854736} 11/06/2021 22:51:47 - INFO - __main__ - Step 11669: {'lr': 0.0004947539208562558, 'samples': 2240448, 'steps': 11668, 'loss/train': 0.3917955160140991} 11/06/2021 22:51:47 - INFO - __main__ - Step 11670: {'lr': 0.0004947528393666686, 'samples': 2240640, 'steps': 11669, 'loss/train': 1.4626667499542236} 11/06/2021 22:51:48 - INFO - __main__ - Step 11671: {'lr': 0.0004947517577667996, 'samples': 2240832, 'steps': 11670, 'loss/train': 1.9244534969329834} 11/06/2021 22:51:48 - INFO - __main__ - Step 11672: {'lr': 0.0004947506760566492, 'samples': 2241024, 'steps': 11671, 'loss/train': 1.6915565729141235} 11/06/2021 22:51:49 - INFO - __main__ - Step 11673: {'lr': 0.0004947495942362179, 'samples': 2241216, 'steps': 11672, 'loss/train': 1.8372933864593506} 11/06/2021 22:51:49 - INFO - __main__ - Step 11674: {'lr': 0.0004947485123055063, 'samples': 2241408, 'steps': 11673, 'loss/train': 1.7340296506881714} 11/06/2021 22:51:50 - INFO - __main__ - Step 11675: {'lr': 0.0004947474302645147, 'samples': 2241600, 'steps': 11674, 'loss/train': 2.306028127670288} 11/06/2021 22:51:50 - INFO - __main__ - Step 11676: {'lr': 0.0004947463481132438, 'samples': 2241792, 'steps': 11675, 'loss/train': 1.5373907089233398} 11/06/2021 22:51:50 - INFO - __main__ - Step 11677: {'lr': 0.0004947452658516938, 'samples': 2241984, 'steps': 11676, 'loss/train': 1.0355230569839478} 11/06/2021 22:51:52 - INFO - __main__ - Step 11678: {'lr': 0.0004947441834798655, 'samples': 2242176, 'steps': 11677, 'loss/train': 1.9624687433242798} 11/06/2021 22:51:52 - INFO - __main__ - Step 11679: {'lr': 0.0004947431009977592, 'samples': 2242368, 'steps': 11678, 'loss/train': 2.2089884281158447} 11/06/2021 22:51:52 - INFO - __main__ - Step 11680: {'lr': 0.0004947420184053755, 'samples': 2242560, 'steps': 11679, 'loss/train': 1.4249625205993652} 11/06/2021 22:51:53 - INFO - __main__ - Step 11681: {'lr': 0.0004947409357027148, 'samples': 2242752, 'steps': 11680, 'loss/train': 1.3146612644195557} 11/06/2021 22:51:53 - INFO - __main__ - Step 11682: {'lr': 0.0004947398528897775, 'samples': 2242944, 'steps': 11681, 'loss/train': 1.6115343570709229} 11/06/2021 22:51:53 - INFO - __main__ - Step 11683: {'lr': 0.0004947387699665643, 'samples': 2243136, 'steps': 11682, 'loss/train': 2.1556644439697266} 11/06/2021 22:51:54 - INFO - __main__ - Step 11684: {'lr': 0.0004947376869330755, 'samples': 2243328, 'steps': 11683, 'loss/train': 2.4655892848968506} 11/06/2021 22:51:55 - INFO - __main__ - Step 11685: {'lr': 0.0004947366037893118, 'samples': 2243520, 'steps': 11684, 'loss/train': 1.4725700616836548} 11/06/2021 22:51:55 - INFO - __main__ - Step 11686: {'lr': 0.0004947355205352735, 'samples': 2243712, 'steps': 11685, 'loss/train': 1.7428011894226074} 11/06/2021 22:51:55 - INFO - __main__ - Step 11687: {'lr': 0.0004947344371709611, 'samples': 2243904, 'steps': 11686, 'loss/train': 1.42606520652771} 11/06/2021 22:51:56 - INFO - __main__ - Step 11688: {'lr': 0.0004947333536963753, 'samples': 2244096, 'steps': 11687, 'loss/train': 1.7653391361236572} 11/06/2021 22:51:57 - INFO - __main__ - Step 11689: {'lr': 0.0004947322701115163, 'samples': 2244288, 'steps': 11688, 'loss/train': 2.0010745525360107} 11/06/2021 22:51:57 - INFO - __main__ - Step 11690: {'lr': 0.0004947311864163847, 'samples': 2244480, 'steps': 11689, 'loss/train': 1.4831074476242065} 11/06/2021 22:51:58 - INFO - __main__ - Step 11691: {'lr': 0.000494730102610981, 'samples': 2244672, 'steps': 11690, 'loss/train': 1.9865690469741821} 11/06/2021 22:51:58 - INFO - __main__ - Step 11692: {'lr': 0.0004947290186953057, 'samples': 2244864, 'steps': 11691, 'loss/train': 1.3742139339447021} 11/06/2021 22:51:58 - INFO - __main__ - Step 11693: {'lr': 0.0004947279346693594, 'samples': 2245056, 'steps': 11692, 'loss/train': 1.701667308807373} 11/06/2021 22:51:59 - INFO - __main__ - Step 11694: {'lr': 0.0004947268505331424, 'samples': 2245248, 'steps': 11693, 'loss/train': 1.9292877912521362} 11/06/2021 22:52:00 - INFO - __main__ - Step 11695: {'lr': 0.0004947257662866551, 'samples': 2245440, 'steps': 11694, 'loss/train': 2.0025382041931152} 11/06/2021 22:52:00 - INFO - __main__ - Step 11696: {'lr': 0.0004947246819298984, 'samples': 2245632, 'steps': 11695, 'loss/train': 1.10530424118042} 11/06/2021 22:52:00 - INFO - __main__ - Step 11697: {'lr': 0.0004947235974628723, 'samples': 2245824, 'steps': 11696, 'loss/train': 1.703428864479065} 11/06/2021 22:52:01 - INFO - __main__ - Step 11698: {'lr': 0.0004947225128855777, 'samples': 2246016, 'steps': 11697, 'loss/train': 1.4115582704544067} 11/06/2021 22:52:02 - INFO - __main__ - Step 11699: {'lr': 0.0004947214281980149, 'samples': 2246208, 'steps': 11698, 'loss/train': 1.6013803482055664} 11/06/2021 22:52:02 - INFO - __main__ - Step 11700: {'lr': 0.0004947203434001843, 'samples': 2246400, 'steps': 11699, 'loss/train': 0.8683410882949829} 11/06/2021 22:52:02 - INFO - __main__ - Step 11701: {'lr': 0.0004947192584920866, 'samples': 2246592, 'steps': 11700, 'loss/train': 1.931963324546814} 11/06/2021 22:52:03 - INFO - __main__ - Step 11702: {'lr': 0.000494718173473722, 'samples': 2246784, 'steps': 11701, 'loss/train': 1.411441683769226} 11/06/2021 22:52:03 - INFO - __main__ - Step 11703: {'lr': 0.0004947170883450913, 'samples': 2246976, 'steps': 11702, 'loss/train': 1.5308102369308472} 11/06/2021 22:52:04 - INFO - __main__ - Step 11704: {'lr': 0.000494716003106195, 'samples': 2247168, 'steps': 11703, 'loss/train': 2.047659158706665} 11/06/2021 22:52:05 - INFO - __main__ - Step 11705: {'lr': 0.0004947149177570332, 'samples': 2247360, 'steps': 11704, 'loss/train': 1.6442228555679321} 11/06/2021 22:52:05 - INFO - __main__ - Step 11706: {'lr': 0.0004947138322976067, 'samples': 2247552, 'steps': 11705, 'loss/train': 1.791576623916626} 11/06/2021 22:52:05 - INFO - __main__ - Step 11707: {'lr': 0.000494712746727916, 'samples': 2247744, 'steps': 11706, 'loss/train': 1.8355753421783447} 11/06/2021 22:52:06 - INFO - __main__ - Step 11708: {'lr': 0.0004947116610479614, 'samples': 2247936, 'steps': 11707, 'loss/train': 1.7016065120697021} 11/06/2021 22:52:06 - INFO - __main__ - Step 11709: {'lr': 0.0004947105752577436, 'samples': 2248128, 'steps': 11708, 'loss/train': 1.3050994873046875} 11/06/2021 22:52:07 - INFO - __main__ - Step 11710: {'lr': 0.0004947094893572629, 'samples': 2248320, 'steps': 11709, 'loss/train': 1.8479158878326416} 11/06/2021 22:52:08 - INFO - __main__ - Step 11711: {'lr': 0.00049470840334652, 'samples': 2248512, 'steps': 11710, 'loss/train': 1.892972707748413} 11/06/2021 22:52:08 - INFO - __main__ - Step 11712: {'lr': 0.0004947073172255151, 'samples': 2248704, 'steps': 11711, 'loss/train': 2.1024181842803955} 11/06/2021 22:52:08 - INFO - __main__ - Step 11713: {'lr': 0.000494706230994249, 'samples': 2248896, 'steps': 11712, 'loss/train': 1.7337501049041748} 11/06/2021 22:52:09 - INFO - __main__ - Step 11714: {'lr': 0.000494705144652722, 'samples': 2249088, 'steps': 11713, 'loss/train': 2.053222894668579} 11/06/2021 22:52:10 - INFO - __main__ - Step 11715: {'lr': 0.0004947040582009346, 'samples': 2249280, 'steps': 11714, 'loss/train': 1.5705617666244507} 11/06/2021 22:52:10 - INFO - __main__ - Step 11716: {'lr': 0.0004947029716388875, 'samples': 2249472, 'steps': 11715, 'loss/train': 1.1817337274551392} 11/06/2021 22:52:11 - INFO - __main__ - Step 11717: {'lr': 0.0004947018849665809, 'samples': 2249664, 'steps': 11716, 'loss/train': 1.7236446142196655} 11/06/2021 22:52:11 - INFO - __main__ - Step 11718: {'lr': 0.0004947007981840153, 'samples': 2249856, 'steps': 11717, 'loss/train': 1.561780571937561} 11/06/2021 22:52:11 - INFO - __main__ - Step 11719: {'lr': 0.0004946997112911914, 'samples': 2250048, 'steps': 11718, 'loss/train': 1.9812180995941162} 11/06/2021 22:52:12 - INFO - __main__ - Step 11720: {'lr': 0.0004946986242881096, 'samples': 2250240, 'steps': 11719, 'loss/train': 2.1320106983184814} 11/06/2021 22:52:13 - INFO - __main__ - Step 11721: {'lr': 0.0004946975371747704, 'samples': 2250432, 'steps': 11720, 'loss/train': 2.156588315963745} 11/06/2021 22:52:13 - INFO - __main__ - Step 11722: {'lr': 0.0004946964499511742, 'samples': 2250624, 'steps': 11721, 'loss/train': 1.9945005178451538} 11/06/2021 22:52:13 - INFO - __main__ - Step 11723: {'lr': 0.0004946953626173216, 'samples': 2250816, 'steps': 11722, 'loss/train': 1.00374174118042} 11/06/2021 22:52:14 - INFO - __main__ - Step 11724: {'lr': 0.0004946942751732129, 'samples': 2251008, 'steps': 11723, 'loss/train': 1.2549642324447632} 11/06/2021 22:52:14 - INFO - __main__ - Step 11725: {'lr': 0.000494693187618849, 'samples': 2251200, 'steps': 11724, 'loss/train': 1.6982334852218628} 11/06/2021 22:52:15 - INFO - __main__ - Step 11726: {'lr': 0.0004946920999542299, 'samples': 2251392, 'steps': 11725, 'loss/train': 5.76206636428833} 11/06/2021 22:52:15 - INFO - __main__ - Step 11727: {'lr': 0.0004946910121793564, 'samples': 2251584, 'steps': 11726, 'loss/train': 1.8769844770431519} 11/06/2021 22:52:16 - INFO - __main__ - Step 11728: {'lr': 0.0004946899242942289, 'samples': 2251776, 'steps': 11727, 'loss/train': 1.1642225980758667} 11/06/2021 22:52:16 - INFO - __main__ - Step 11729: {'lr': 0.000494688836298848, 'samples': 2251968, 'steps': 11728, 'loss/train': 1.9926396608352661} 11/06/2021 22:52:16 - INFO - __main__ - Step 11730: {'lr': 0.0004946877481932139, 'samples': 2252160, 'steps': 11729, 'loss/train': 1.9353605508804321} 11/06/2021 22:52:17 - INFO - __main__ - Step 11731: {'lr': 0.0004946866599773274, 'samples': 2252352, 'steps': 11730, 'loss/train': 1.884238839149475} 11/06/2021 22:52:18 - INFO - __main__ - Step 11732: {'lr': 0.0004946855716511888, 'samples': 2252544, 'steps': 11731, 'loss/train': 2.3865725994110107} 11/06/2021 22:52:18 - INFO - __main__ - Step 11733: {'lr': 0.0004946844832147987, 'samples': 2252736, 'steps': 11732, 'loss/train': 1.7648876905441284} 11/06/2021 22:52:18 - INFO - __main__ - Step 11734: {'lr': 0.0004946833946681575, 'samples': 2252928, 'steps': 11733, 'loss/train': 1.9588221311569214} 11/06/2021 22:52:19 - INFO - __main__ - Step 11735: {'lr': 0.0004946823060112658, 'samples': 2253120, 'steps': 11734, 'loss/train': 1.0032458305358887} 11/06/2021 22:52:20 - INFO - __main__ - Step 11736: {'lr': 0.000494681217244124, 'samples': 2253312, 'steps': 11735, 'loss/train': 1.9610958099365234} 11/06/2021 22:52:20 - INFO - __main__ - Step 11737: {'lr': 0.0004946801283667326, 'samples': 2253504, 'steps': 11736, 'loss/train': 1.6213114261627197} 11/06/2021 22:52:21 - INFO - __main__ - Step 11738: {'lr': 0.0004946790393790921, 'samples': 2253696, 'steps': 11737, 'loss/train': 1.8059797286987305} 11/06/2021 22:52:21 - INFO - __main__ - Step 11739: {'lr': 0.0004946779502812031, 'samples': 2253888, 'steps': 11738, 'loss/train': 2.2257320880889893} 11/06/2021 22:52:21 - INFO - __main__ - Step 11740: {'lr': 0.0004946768610730659, 'samples': 2254080, 'steps': 11739, 'loss/train': 1.6921347379684448} 11/06/2021 22:52:22 - INFO - __main__ - Step 11741: {'lr': 0.0004946757717546812, 'samples': 2254272, 'steps': 11740, 'loss/train': 1.7304112911224365} 11/06/2021 22:52:23 - INFO - __main__ - Step 11742: {'lr': 0.0004946746823260491, 'samples': 2254464, 'steps': 11741, 'loss/train': 0.8344317078590393} 11/06/2021 22:52:23 - INFO - __main__ - Step 11743: {'lr': 0.0004946735927871706, 'samples': 2254656, 'steps': 11742, 'loss/train': 1.438720941543579} 11/06/2021 22:52:23 - INFO - __main__ - Step 11744: {'lr': 0.0004946725031380459, 'samples': 2254848, 'steps': 11743, 'loss/train': 2.020857572555542} 11/06/2021 22:52:24 - INFO - __main__ - Step 11745: {'lr': 0.0004946714133786756, 'samples': 2255040, 'steps': 11744, 'loss/train': 1.5829726457595825} 11/06/2021 22:52:24 - INFO - __main__ - Step 11746: {'lr': 0.00049467032350906, 'samples': 2255232, 'steps': 11745, 'loss/train': 1.7040636539459229} 11/06/2021 22:52:25 - INFO - __main__ - Step 11747: {'lr': 0.0004946692335291999, 'samples': 2255424, 'steps': 11746, 'loss/train': 1.7753183841705322} 11/06/2021 22:52:26 - INFO - __main__ - Step 11748: {'lr': 0.0004946681434390955, 'samples': 2255616, 'steps': 11747, 'loss/train': 2.8313393592834473} 11/06/2021 22:52:26 - INFO - __main__ - Step 11749: {'lr': 0.0004946670532387474, 'samples': 2255808, 'steps': 11748, 'loss/train': 1.4932595491409302} 11/06/2021 22:52:26 - INFO - __main__ - Step 11750: {'lr': 0.0004946659629281561, 'samples': 2256000, 'steps': 11749, 'loss/train': 1.6414493322372437} 11/06/2021 22:52:27 - INFO - __main__ - Step 11751: {'lr': 0.0004946648725073222, 'samples': 2256192, 'steps': 11750, 'loss/train': 1.8771973848342896} 11/06/2021 22:52:27 - INFO - __main__ - Step 11752: {'lr': 0.0004946637819762459, 'samples': 2256384, 'steps': 11751, 'loss/train': 1.819173812866211} 11/06/2021 22:52:28 - INFO - __main__ - Step 11753: {'lr': 0.000494662691334928, 'samples': 2256576, 'steps': 11752, 'loss/train': 1.7248607873916626} 11/06/2021 22:52:29 - INFO - __main__ - Step 11754: {'lr': 0.0004946616005833689, 'samples': 2256768, 'steps': 11753, 'loss/train': 1.390769362449646} 11/06/2021 22:52:29 - INFO - __main__ - Step 11755: {'lr': 0.0004946605097215691, 'samples': 2256960, 'steps': 11754, 'loss/train': 1.4197173118591309} 11/06/2021 22:52:29 - INFO - __main__ - Step 11756: {'lr': 0.0004946594187495289, 'samples': 2257152, 'steps': 11755, 'loss/train': 1.846717357635498} 11/06/2021 22:52:30 - INFO - __main__ - Step 11757: {'lr': 0.0004946583276672489, 'samples': 2257344, 'steps': 11756, 'loss/train': 1.6870449781417847} 11/06/2021 22:52:31 - INFO - __main__ - Step 11758: {'lr': 0.0004946572364747298, 'samples': 2257536, 'steps': 11757, 'loss/train': 1.6424260139465332} 11/06/2021 22:52:31 - INFO - __main__ - Step 11759: {'lr': 0.0004946561451719719, 'samples': 2257728, 'steps': 11758, 'loss/train': 1.8925001621246338} 11/06/2021 22:52:32 - INFO - __main__ - Step 11760: {'lr': 0.0004946550537589757, 'samples': 2257920, 'steps': 11759, 'loss/train': 1.6655386686325073} 11/06/2021 22:52:32 - INFO - __main__ - Step 11761: {'lr': 0.0004946539622357417, 'samples': 2258112, 'steps': 11760, 'loss/train': 1.9449368715286255} 11/06/2021 22:52:32 - INFO - __main__ - Step 11762: {'lr': 0.0004946528706022703, 'samples': 2258304, 'steps': 11761, 'loss/train': 2.078275442123413} 11/06/2021 22:52:33 - INFO - __main__ - Step 11763: {'lr': 0.0004946517788585622, 'samples': 2258496, 'steps': 11762, 'loss/train': 1.4620633125305176} 11/06/2021 22:52:34 - INFO - __main__ - Step 11764: {'lr': 0.0004946506870046178, 'samples': 2258688, 'steps': 11763, 'loss/train': 1.66291344165802} 11/06/2021 22:52:34 - INFO - __main__ - Step 11765: {'lr': 0.0004946495950404375, 'samples': 2258880, 'steps': 11764, 'loss/train': 1.7311233282089233} 11/06/2021 22:52:34 - INFO - __main__ - Step 11766: {'lr': 0.0004946485029660219, 'samples': 2259072, 'steps': 11765, 'loss/train': 1.9496246576309204} 11/06/2021 22:52:35 - INFO - __main__ - Step 11767: {'lr': 0.0004946474107813715, 'samples': 2259264, 'steps': 11766, 'loss/train': 1.842336654663086} 11/06/2021 22:52:35 - INFO - __main__ - Step 11768: {'lr': 0.0004946463184864867, 'samples': 2259456, 'steps': 11767, 'loss/train': 1.8693801164627075} 11/06/2021 22:52:36 - INFO - __main__ - Step 11769: {'lr': 0.000494645226081368, 'samples': 2259648, 'steps': 11768, 'loss/train': 1.6797724962234497} 11/06/2021 22:52:36 - INFO - __main__ - Step 11770: {'lr': 0.000494644133566016, 'samples': 2259840, 'steps': 11769, 'loss/train': 1.4810084104537964} 11/06/2021 22:52:37 - INFO - __main__ - Step 11771: {'lr': 0.0004946430409404311, 'samples': 2260032, 'steps': 11770, 'loss/train': 1.5809372663497925} 11/06/2021 22:52:37 - INFO - __main__ - Step 11772: {'lr': 0.0004946419482046139, 'samples': 2260224, 'steps': 11771, 'loss/train': 1.3877747058868408} 11/06/2021 22:52:38 - INFO - __main__ - Step 11773: {'lr': 0.0004946408553585648, 'samples': 2260416, 'steps': 11772, 'loss/train': 1.1509099006652832} 11/06/2021 22:52:38 - INFO - __main__ - Step 11774: {'lr': 0.0004946397624022843, 'samples': 2260608, 'steps': 11773, 'loss/train': 2.2780723571777344} 11/06/2021 22:52:39 - INFO - __main__ - Step 11775: {'lr': 0.0004946386693357728, 'samples': 2260800, 'steps': 11774, 'loss/train': 1.5526584386825562} 11/06/2021 22:52:39 - INFO - __main__ - Step 11776: {'lr': 0.0004946375761590309, 'samples': 2260992, 'steps': 11775, 'loss/train': 2.048732280731201} 11/06/2021 22:52:39 - INFO - __main__ - Step 11777: {'lr': 0.0004946364828720592, 'samples': 2261184, 'steps': 11776, 'loss/train': 1.518662929534912} 11/06/2021 22:52:40 - INFO - __main__ - Step 11778: {'lr': 0.000494635389474858, 'samples': 2261376, 'steps': 11777, 'loss/train': 2.078202962875366} 11/06/2021 22:52:41 - INFO - __main__ - Step 11779: {'lr': 0.0004946342959674278, 'samples': 2261568, 'steps': 11778, 'loss/train': 1.6700574159622192} 11/06/2021 22:52:41 - INFO - __main__ - Step 11780: {'lr': 0.0004946332023497693, 'samples': 2261760, 'steps': 11779, 'loss/train': 2.0931074619293213} 11/06/2021 22:52:42 - INFO - __main__ - Step 11781: {'lr': 0.0004946321086218828, 'samples': 2261952, 'steps': 11780, 'loss/train': 1.8575305938720703} 11/06/2021 22:52:42 - INFO - __main__ - Step 11782: {'lr': 0.0004946310147837689, 'samples': 2262144, 'steps': 11781, 'loss/train': 1.1181639432907104} 11/06/2021 22:52:42 - INFO - __main__ - Step 11783: {'lr': 0.0004946299208354279, 'samples': 2262336, 'steps': 11782, 'loss/train': 1.7613064050674438} 11/06/2021 22:52:43 - INFO - __main__ - Step 11784: {'lr': 0.0004946288267768605, 'samples': 2262528, 'steps': 11783, 'loss/train': 1.5127317905426025} 11/06/2021 22:52:44 - INFO - __main__ - Step 11785: {'lr': 0.0004946277326080672, 'samples': 2262720, 'steps': 11784, 'loss/train': 1.9431092739105225} 11/06/2021 22:52:44 - INFO - __main__ - Step 11786: {'lr': 0.0004946266383290483, 'samples': 2262912, 'steps': 11785, 'loss/train': 2.022489547729492} 11/06/2021 22:52:44 - INFO - __main__ - Step 11787: {'lr': 0.0004946255439398045, 'samples': 2263104, 'steps': 11786, 'loss/train': 1.829660177230835} 11/06/2021 22:52:45 - INFO - __main__ - Step 11788: {'lr': 0.0004946244494403361, 'samples': 2263296, 'steps': 11787, 'loss/train': 1.7674616575241089} 11/06/2021 22:52:45 - INFO - __main__ - Step 11789: {'lr': 0.0004946233548306438, 'samples': 2263488, 'steps': 11788, 'loss/train': 1.8133692741394043} 11/06/2021 22:52:46 - INFO - __main__ - Step 11790: {'lr': 0.000494622260110728, 'samples': 2263680, 'steps': 11789, 'loss/train': 1.611851453781128} 11/06/2021 22:52:46 - INFO - __main__ - Step 11791: {'lr': 0.0004946211652805891, 'samples': 2263872, 'steps': 11790, 'loss/train': 1.1969035863876343} 11/06/2021 22:52:47 - INFO - __main__ - Step 11792: {'lr': 0.0004946200703402278, 'samples': 2264064, 'steps': 11791, 'loss/train': 2.1586251258850098} 11/06/2021 22:52:47 - INFO - __main__ - Step 11793: {'lr': 0.0004946189752896443, 'samples': 2264256, 'steps': 11792, 'loss/train': 1.361173152923584} 11/06/2021 22:52:47 - INFO - __main__ - Step 11794: {'lr': 0.0004946178801288394, 'samples': 2264448, 'steps': 11793, 'loss/train': 1.3458670377731323} 11/06/2021 22:52:48 - INFO - __main__ - Step 11795: {'lr': 0.0004946167848578134, 'samples': 2264640, 'steps': 11794, 'loss/train': 1.4906079769134521} 11/06/2021 22:52:49 - INFO - __main__ - Step 11796: {'lr': 0.0004946156894765669, 'samples': 2264832, 'steps': 11795, 'loss/train': 1.776194453239441} 11/06/2021 22:52:49 - INFO - __main__ - Step 11797: {'lr': 0.0004946145939851004, 'samples': 2265024, 'steps': 11796, 'loss/train': 1.1889792680740356} 11/06/2021 22:52:49 - INFO - __main__ - Step 11798: {'lr': 0.0004946134983834142, 'samples': 2265216, 'steps': 11797, 'loss/train': 1.3798962831497192} 11/06/2021 22:52:50 - INFO - __main__ - Step 11799: {'lr': 0.0004946124026715089, 'samples': 2265408, 'steps': 11798, 'loss/train': 1.1393816471099854} 11/06/2021 22:52:51 - INFO - __main__ - Step 11800: {'lr': 0.0004946113068493851, 'samples': 2265600, 'steps': 11799, 'loss/train': 1.938144326210022} 11/06/2021 22:52:51 - INFO - __main__ - Step 11801: {'lr': 0.0004946102109170433, 'samples': 2265792, 'steps': 11800, 'loss/train': 1.085551381111145} 11/06/2021 22:52:52 - INFO - __main__ - Step 11802: {'lr': 0.0004946091148744838, 'samples': 2265984, 'steps': 11801, 'loss/train': 1.9782894849777222} 11/06/2021 22:52:52 - INFO - __main__ - Step 11803: {'lr': 0.0004946080187217072, 'samples': 2266176, 'steps': 11802, 'loss/train': 1.8065892457962036} 11/06/2021 22:52:52 - INFO - __main__ - Step 11804: {'lr': 0.0004946069224587141, 'samples': 2266368, 'steps': 11803, 'loss/train': 1.8885177373886108} 11/06/2021 22:52:53 - INFO - __main__ - Step 11805: {'lr': 0.0004946058260855049, 'samples': 2266560, 'steps': 11804, 'loss/train': 1.6581742763519287} 11/06/2021 22:52:54 - INFO - __main__ - Step 11806: {'lr': 0.00049460472960208, 'samples': 2266752, 'steps': 11805, 'loss/train': 1.8669217824935913} 11/06/2021 22:52:54 - INFO - __main__ - Step 11807: {'lr': 0.00049460363300844, 'samples': 2266944, 'steps': 11806, 'loss/train': 1.47137451171875} 11/06/2021 22:52:54 - INFO - __main__ - Step 11808: {'lr': 0.0004946025363045854, 'samples': 2267136, 'steps': 11807, 'loss/train': 1.8952033519744873} 11/06/2021 22:52:55 - INFO - __main__ - Step 11809: {'lr': 0.0004946014394905167, 'samples': 2267328, 'steps': 11808, 'loss/train': 1.2856425046920776} 11/06/2021 22:52:56 - INFO - __main__ - Step 11810: {'lr': 0.0004946003425662343, 'samples': 2267520, 'steps': 11809, 'loss/train': 2.00896954536438} 11/06/2021 22:52:56 - INFO - __main__ - Step 11811: {'lr': 0.0004945992455317389, 'samples': 2267712, 'steps': 11810, 'loss/train': 1.724480390548706} 11/06/2021 22:52:56 - INFO - __main__ - Step 11812: {'lr': 0.0004945981483870307, 'samples': 2267904, 'steps': 11811, 'loss/train': 1.5671536922454834} 11/06/2021 22:52:57 - INFO - __main__ - Step 11813: {'lr': 0.0004945970511321104, 'samples': 2268096, 'steps': 11812, 'loss/train': 2.0063626766204834} 11/06/2021 22:52:57 - INFO - __main__ - Step 11814: {'lr': 0.0004945959537669784, 'samples': 2268288, 'steps': 11813, 'loss/train': 1.2561020851135254} 11/06/2021 22:52:57 - INFO - __main__ - Step 11815: {'lr': 0.0004945948562916353, 'samples': 2268480, 'steps': 11814, 'loss/train': 1.8354805707931519} 11/06/2021 22:52:59 - INFO - __main__ - Step 11816: {'lr': 0.0004945937587060815, 'samples': 2268672, 'steps': 11815, 'loss/train': 1.270841360092163} 11/06/2021 22:53:00 - INFO - __main__ - Step 11817: {'lr': 0.0004945926610103175, 'samples': 2268864, 'steps': 11816, 'loss/train': 1.2347999811172485} 11/06/2021 22:53:00 - INFO - __main__ - Step 11818: {'lr': 0.0004945915632043439, 'samples': 2269056, 'steps': 11817, 'loss/train': 1.340549349784851} 11/06/2021 22:53:00 - INFO - __main__ - Step 11819: {'lr': 0.0004945904652881611, 'samples': 2269248, 'steps': 11818, 'loss/train': 1.905003547668457} 11/06/2021 22:53:01 - INFO - __main__ - Step 11820: {'lr': 0.0004945893672617695, 'samples': 2269440, 'steps': 11819, 'loss/train': 1.818457841873169} 11/06/2021 22:53:01 - INFO - __main__ - Step 11821: {'lr': 0.0004945882691251699, 'samples': 2269632, 'steps': 11820, 'loss/train': 1.8213856220245361} 11/06/2021 22:53:02 - INFO - __main__ - Step 11822: {'lr': 0.0004945871708783625, 'samples': 2269824, 'steps': 11821, 'loss/train': 1.8481453657150269} 11/06/2021 22:53:02 - INFO - __main__ - Step 11823: {'lr': 0.0004945860725213477, 'samples': 2270016, 'steps': 11822, 'loss/train': 1.7589327096939087} 11/06/2021 22:53:03 - INFO - __main__ - Step 11824: {'lr': 0.0004945849740541265, 'samples': 2270208, 'steps': 11823, 'loss/train': 1.8017772436141968} 11/06/2021 22:53:03 - INFO - __main__ - Step 11825: {'lr': 0.000494583875476699, 'samples': 2270400, 'steps': 11824, 'loss/train': 1.8388363122940063} 11/06/2021 22:53:03 - INFO - __main__ - Step 11826: {'lr': 0.0004945827767890657, 'samples': 2270592, 'steps': 11825, 'loss/train': 1.1017407178878784} 11/06/2021 22:53:04 - INFO - __main__ - Step 11827: {'lr': 0.0004945816779912272, 'samples': 2270784, 'steps': 11826, 'loss/train': 1.1215801239013672} 11/06/2021 22:53:05 - INFO - __main__ - Step 11828: {'lr': 0.000494580579083184, 'samples': 2270976, 'steps': 11827, 'loss/train': 1.8502867221832275} 11/06/2021 22:53:05 - INFO - __main__ - Step 11829: {'lr': 0.0004945794800649366, 'samples': 2271168, 'steps': 11828, 'loss/train': 1.7619470357894897} 11/06/2021 22:53:06 - INFO - __main__ - Step 11830: {'lr': 0.0004945783809364853, 'samples': 2271360, 'steps': 11829, 'loss/train': 1.7739301919937134} 11/06/2021 22:53:06 - INFO - __main__ - Step 11831: {'lr': 0.0004945772816978309, 'samples': 2271552, 'steps': 11830, 'loss/train': 1.8250958919525146} 11/06/2021 22:53:06 - INFO - __main__ - Step 11832: {'lr': 0.0004945761823489737, 'samples': 2271744, 'steps': 11831, 'loss/train': 2.298485040664673} 11/06/2021 22:53:07 - INFO - __main__ - Step 11833: {'lr': 0.0004945750828899144, 'samples': 2271936, 'steps': 11832, 'loss/train': 1.6959924697875977} 11/06/2021 22:53:08 - INFO - __main__ - Step 11834: {'lr': 0.0004945739833206531, 'samples': 2272128, 'steps': 11833, 'loss/train': 1.4420230388641357} 11/06/2021 22:53:08 - INFO - __main__ - Step 11835: {'lr': 0.0004945728836411907, 'samples': 2272320, 'steps': 11834, 'loss/train': 1.9085041284561157} 11/06/2021 22:53:08 - INFO - __main__ - Step 11836: {'lr': 0.0004945717838515275, 'samples': 2272512, 'steps': 11835, 'loss/train': 1.7910774946212769} 11/06/2021 22:53:09 - INFO - __main__ - Step 11837: {'lr': 0.0004945706839516639, 'samples': 2272704, 'steps': 11836, 'loss/train': 0.9751549363136292} 11/06/2021 22:53:10 - INFO - __main__ - Step 11838: {'lr': 0.0004945695839416006, 'samples': 2272896, 'steps': 11837, 'loss/train': 1.1652768850326538} 11/06/2021 22:53:10 - INFO - __main__ - Step 11839: {'lr': 0.0004945684838213382, 'samples': 2273088, 'steps': 11838, 'loss/train': 2.4562861919403076} 11/06/2021 22:53:10 - INFO - __main__ - Step 11840: {'lr': 0.0004945673835908767, 'samples': 2273280, 'steps': 11839, 'loss/train': 1.785056233406067} 11/06/2021 22:53:11 - INFO - __main__ - Step 11841: {'lr': 0.0004945662832502171, 'samples': 2273472, 'steps': 11840, 'loss/train': 1.478073000907898} 11/06/2021 22:53:11 - INFO - __main__ - Step 11842: {'lr': 0.0004945651827993597, 'samples': 2273664, 'steps': 11841, 'loss/train': 1.5558642148971558} 11/06/2021 22:53:11 - INFO - __main__ - Step 11843: {'lr': 0.000494564082238305, 'samples': 2273856, 'steps': 11842, 'loss/train': 1.5626791715621948} 11/06/2021 22:53:13 - INFO - __main__ - Step 11844: {'lr': 0.0004945629815670535, 'samples': 2274048, 'steps': 11843, 'loss/train': 1.923411250114441} 11/06/2021 22:53:13 - INFO - __main__ - Step 11845: {'lr': 0.0004945618807856056, 'samples': 2274240, 'steps': 11844, 'loss/train': 1.2191367149353027} 11/06/2021 22:53:13 - INFO - __main__ - Step 11846: {'lr': 0.000494560779893962, 'samples': 2274432, 'steps': 11845, 'loss/train': 1.695072054862976} 11/06/2021 22:53:14 - INFO - __main__ - Step 11847: {'lr': 0.0004945596788921231, 'samples': 2274624, 'steps': 11846, 'loss/train': 1.8576159477233887} 11/06/2021 22:53:14 - INFO - __main__ - Step 11848: {'lr': 0.0004945585777800893, 'samples': 2274816, 'steps': 11847, 'loss/train': 1.8127880096435547} 11/06/2021 22:53:15 - INFO - __main__ - Step 11849: {'lr': 0.0004945574765578612, 'samples': 2275008, 'steps': 11848, 'loss/train': 1.4972808361053467} 11/06/2021 22:53:15 - INFO - __main__ - Step 11850: {'lr': 0.0004945563752254393, 'samples': 2275200, 'steps': 11849, 'loss/train': 1.7864632606506348} 11/06/2021 22:53:16 - INFO - __main__ - Step 11851: {'lr': 0.000494555273782824, 'samples': 2275392, 'steps': 11850, 'loss/train': 1.0610276460647583} 11/06/2021 22:53:16 - INFO - __main__ - Step 11852: {'lr': 0.000494554172230016, 'samples': 2275584, 'steps': 11851, 'loss/train': 1.3797191381454468} 11/06/2021 22:53:16 - INFO - __main__ - Step 11853: {'lr': 0.0004945530705670156, 'samples': 2275776, 'steps': 11852, 'loss/train': 1.7779545783996582} 11/06/2021 22:53:18 - INFO - __main__ - Step 11854: {'lr': 0.0004945519687938234, 'samples': 2275968, 'steps': 11853, 'loss/train': 1.6782175302505493} 11/06/2021 22:53:18 - INFO - __main__ - Step 11855: {'lr': 0.0004945508669104397, 'samples': 2276160, 'steps': 11854, 'loss/train': 2.7653017044067383} 11/06/2021 22:53:18 - INFO - __main__ - Step 11856: {'lr': 0.0004945497649168654, 'samples': 2276352, 'steps': 11855, 'loss/train': 1.727919340133667} 11/06/2021 22:53:19 - INFO - __main__ - Step 11857: {'lr': 0.0004945486628131006, 'samples': 2276544, 'steps': 11856, 'loss/train': 1.342666506767273} 11/06/2021 22:53:19 - INFO - __main__ - Step 11858: {'lr': 0.0004945475605991459, 'samples': 2276736, 'steps': 11857, 'loss/train': 0.3159283399581909} 11/06/2021 22:53:20 - INFO - __main__ - Step 11859: {'lr': 0.0004945464582750019, 'samples': 2276928, 'steps': 11858, 'loss/train': 1.7237508296966553} 11/06/2021 22:53:20 - INFO - __main__ - Step 11860: {'lr': 0.000494545355840669, 'samples': 2277120, 'steps': 11859, 'loss/train': 1.4004329442977905} 11/06/2021 22:53:21 - INFO - __main__ - Step 11861: {'lr': 0.0004945442532961478, 'samples': 2277312, 'steps': 11860, 'loss/train': 1.3610728979110718} 11/06/2021 22:53:21 - INFO - __main__ - Step 11862: {'lr': 0.0004945431506414386, 'samples': 2277504, 'steps': 11861, 'loss/train': 2.1039764881134033} 11/06/2021 22:53:21 - INFO - __main__ - Step 11863: {'lr': 0.0004945420478765422, 'samples': 2277696, 'steps': 11862, 'loss/train': 2.395843505859375} 11/06/2021 22:53:22 - INFO - __main__ - Step 11864: {'lr': 0.0004945409450014588, 'samples': 2277888, 'steps': 11863, 'loss/train': 1.4284261465072632} 11/06/2021 22:53:23 - INFO - __main__ - Step 11865: {'lr': 0.0004945398420161892, 'samples': 2278080, 'steps': 11864, 'loss/train': 1.9594630002975464} 11/06/2021 22:53:23 - INFO - __main__ - Step 11866: {'lr': 0.0004945387389207335, 'samples': 2278272, 'steps': 11865, 'loss/train': 1.446509599685669} 11/06/2021 22:53:23 - INFO - __main__ - Step 11867: {'lr': 0.0004945376357150926, 'samples': 2278464, 'steps': 11866, 'loss/train': 1.6203410625457764} 11/06/2021 22:53:24 - INFO - __main__ - Step 11868: {'lr': 0.0004945365323992668, 'samples': 2278656, 'steps': 11867, 'loss/train': 1.354360818862915} 11/06/2021 22:53:24 - INFO - __main__ - Step 11869: {'lr': 0.0004945354289732565, 'samples': 2278848, 'steps': 11868, 'loss/train': 1.6801567077636719} 11/06/2021 22:53:25 - INFO - __main__ - Step 11870: {'lr': 0.0004945343254370623, 'samples': 2279040, 'steps': 11869, 'loss/train': 1.8512502908706665} 11/06/2021 22:53:26 - INFO - __main__ - Step 11871: {'lr': 0.0004945332217906848, 'samples': 2279232, 'steps': 11870, 'loss/train': 1.6721792221069336} 11/06/2021 22:53:26 - INFO - __main__ - Step 11872: {'lr': 0.0004945321180341244, 'samples': 2279424, 'steps': 11871, 'loss/train': 1.7960259914398193} 11/06/2021 22:53:26 - INFO - __main__ - Step 11873: {'lr': 0.0004945310141673816, 'samples': 2279616, 'steps': 11872, 'loss/train': 1.7067826986312866} 11/06/2021 22:53:27 - INFO - __main__ - Step 11874: {'lr': 0.0004945299101904568, 'samples': 2279808, 'steps': 11873, 'loss/train': 1.8478683233261108} 11/06/2021 22:53:28 - INFO - __main__ - Step 11875: {'lr': 0.0004945288061033507, 'samples': 2280000, 'steps': 11874, 'loss/train': 1.8449006080627441} 11/06/2021 22:53:28 - INFO - __main__ - Step 11876: {'lr': 0.0004945277019060637, 'samples': 2280192, 'steps': 11875, 'loss/train': 1.7363343238830566} 11/06/2021 22:53:28 - INFO - __main__ - Step 11877: {'lr': 0.0004945265975985962, 'samples': 2280384, 'steps': 11876, 'loss/train': 1.287642478942871} 11/06/2021 22:53:29 - INFO - __main__ - Step 11878: {'lr': 0.0004945254931809489, 'samples': 2280576, 'steps': 11877, 'loss/train': 1.5857558250427246} 11/06/2021 22:53:29 - INFO - __main__ - Step 11879: {'lr': 0.000494524388653122, 'samples': 2280768, 'steps': 11878, 'loss/train': 1.3704111576080322} 11/06/2021 22:53:30 - INFO - __main__ - Step 11880: {'lr': 0.0004945232840151164, 'samples': 2280960, 'steps': 11879, 'loss/train': 1.9039316177368164} 11/06/2021 22:53:31 - INFO - __main__ - Step 11881: {'lr': 0.0004945221792669322, 'samples': 2281152, 'steps': 11880, 'loss/train': 1.6828198432922363} 11/06/2021 22:53:31 - INFO - __main__ - Step 11882: {'lr': 0.0004945210744085702, 'samples': 2281344, 'steps': 11881, 'loss/train': 2.2412431240081787} 11/06/2021 22:53:31 - INFO - __main__ - Step 11883: {'lr': 0.0004945199694400308, 'samples': 2281536, 'steps': 11882, 'loss/train': 1.4356671571731567} 11/06/2021 22:53:32 - INFO - __main__ - Step 11884: {'lr': 0.0004945188643613144, 'samples': 2281728, 'steps': 11883, 'loss/train': 1.609714388847351} 11/06/2021 22:53:33 - INFO - __main__ - Step 11885: {'lr': 0.0004945177591724216, 'samples': 2281920, 'steps': 11884, 'loss/train': 1.3597244024276733} 11/06/2021 22:53:33 - INFO - __main__ - Step 11886: {'lr': 0.0004945166538733529, 'samples': 2282112, 'steps': 11885, 'loss/train': 1.8882817029953003} 11/06/2021 22:53:33 - INFO - __main__ - Step 11887: {'lr': 0.0004945155484641087, 'samples': 2282304, 'steps': 11886, 'loss/train': 1.242563009262085} 11/06/2021 22:53:34 - INFO - __main__ - Step 11888: {'lr': 0.0004945144429446897, 'samples': 2282496, 'steps': 11887, 'loss/train': 1.8521512746810913} 11/06/2021 22:53:34 - INFO - __main__ - Step 11889: {'lr': 0.000494513337315096, 'samples': 2282688, 'steps': 11888, 'loss/train': 1.7061456441879272} 11/06/2021 22:53:35 - INFO - __main__ - Step 11890: {'lr': 0.0004945122315753286, 'samples': 2282880, 'steps': 11889, 'loss/train': 1.742601990699768} 11/06/2021 22:53:35 - INFO - __main__ - Step 11891: {'lr': 0.0004945111257253877, 'samples': 2283072, 'steps': 11890, 'loss/train': 1.7886587381362915} 11/06/2021 22:53:36 - INFO - __main__ - Step 11892: {'lr': 0.0004945100197652738, 'samples': 2283264, 'steps': 11891, 'loss/train': 2.3092379570007324} 11/06/2021 22:53:36 - INFO - __main__ - Step 11893: {'lr': 0.0004945089136949876, 'samples': 2283456, 'steps': 11892, 'loss/train': 1.7748466730117798} 11/06/2021 22:53:37 - INFO - __main__ - Step 11894: {'lr': 0.0004945078075145292, 'samples': 2283648, 'steps': 11893, 'loss/train': 2.0001227855682373} 11/06/2021 22:53:37 - INFO - __main__ - Step 11895: {'lr': 0.0004945067012238996, 'samples': 2283840, 'steps': 11894, 'loss/train': 1.475651502609253} 11/06/2021 22:53:38 - INFO - __main__ - Step 11896: {'lr': 0.000494505594823099, 'samples': 2284032, 'steps': 11895, 'loss/train': 0.5591633915901184} 11/06/2021 22:53:38 - INFO - __main__ - Step 11897: {'lr': 0.0004945044883121279, 'samples': 2284224, 'steps': 11896, 'loss/train': 1.6657626628875732} 11/06/2021 22:53:39 - INFO - __main__ - Step 11898: {'lr': 0.0004945033816909868, 'samples': 2284416, 'steps': 11897, 'loss/train': 1.5739197731018066} 11/06/2021 22:53:39 - INFO - __main__ - Step 11899: {'lr': 0.0004945022749596764, 'samples': 2284608, 'steps': 11898, 'loss/train': 1.6990892887115479} 11/06/2021 22:53:39 - INFO - __main__ - Step 11900: {'lr': 0.000494501168118197, 'samples': 2284800, 'steps': 11899, 'loss/train': 1.785287618637085} 11/06/2021 22:53:40 - INFO - __main__ - Step 11901: {'lr': 0.0004945000611665491, 'samples': 2284992, 'steps': 11900, 'loss/train': 1.6788626909255981} 11/06/2021 22:53:41 - INFO - __main__ - Step 11902: {'lr': 0.0004944989541047333, 'samples': 2285184, 'steps': 11901, 'loss/train': 1.9078575372695923} 11/06/2021 22:53:41 - INFO - __main__ - Step 11903: {'lr': 0.0004944978469327499, 'samples': 2285376, 'steps': 11902, 'loss/train': 1.8978601694107056} 11/06/2021 22:53:41 - INFO - __main__ - Step 11904: {'lr': 0.0004944967396505998, 'samples': 2285568, 'steps': 11903, 'loss/train': 1.5183367729187012} 11/06/2021 22:53:42 - INFO - __main__ - Step 11905: {'lr': 0.000494495632258283, 'samples': 2285760, 'steps': 11904, 'loss/train': 1.6283320188522339} 11/06/2021 22:53:43 - INFO - __main__ - Step 11906: {'lr': 0.0004944945247558004, 'samples': 2285952, 'steps': 11905, 'loss/train': 1.0076183080673218} 11/06/2021 22:53:43 - INFO - __main__ - Step 11907: {'lr': 0.0004944934171431522, 'samples': 2286144, 'steps': 11906, 'loss/train': 1.794429898262024} 11/06/2021 22:53:43 - INFO - __main__ - Step 11908: {'lr': 0.0004944923094203391, 'samples': 2286336, 'steps': 11907, 'loss/train': 1.9065196514129639} 11/06/2021 22:53:44 - INFO - __main__ - Step 11909: {'lr': 0.0004944912015873616, 'samples': 2286528, 'steps': 11908, 'loss/train': 1.092329740524292} 11/06/2021 22:53:44 - INFO - __main__ - Step 11910: {'lr': 0.0004944900936442201, 'samples': 2286720, 'steps': 11909, 'loss/train': 1.7471907138824463} 11/06/2021 22:53:45 - INFO - __main__ - Step 11911: {'lr': 0.000494488985590915, 'samples': 2286912, 'steps': 11910, 'loss/train': 2.4433507919311523} 11/06/2021 22:53:46 - INFO - __main__ - Step 11912: {'lr': 0.0004944878774274472, 'samples': 2287104, 'steps': 11911, 'loss/train': 1.9557836055755615} 11/06/2021 22:53:46 - INFO - __main__ - Step 11913: {'lr': 0.0004944867691538167, 'samples': 2287296, 'steps': 11912, 'loss/train': 1.8252263069152832} 11/06/2021 22:53:46 - INFO - __main__ - Step 11914: {'lr': 0.0004944856607700243, 'samples': 2287488, 'steps': 11913, 'loss/train': 1.3514317274093628} 11/06/2021 22:53:47 - INFO - __main__ - Step 11915: {'lr': 0.0004944845522760706, 'samples': 2287680, 'steps': 11914, 'loss/train': 1.4288444519042969} 11/06/2021 22:53:47 - INFO - __main__ - Step 11916: {'lr': 0.0004944834436719557, 'samples': 2287872, 'steps': 11915, 'loss/train': 1.7817126512527466} 11/06/2021 22:53:48 - INFO - __main__ - Step 11917: {'lr': 0.0004944823349576805, 'samples': 2288064, 'steps': 11916, 'loss/train': 0.9959566593170166} 11/06/2021 22:53:48 - INFO - __main__ - Step 11918: {'lr': 0.0004944812261332452, 'samples': 2288256, 'steps': 11917, 'loss/train': 1.2782503366470337} 11/06/2021 22:53:49 - INFO - __main__ - Step 11919: {'lr': 0.0004944801171986505, 'samples': 2288448, 'steps': 11918, 'loss/train': 1.4812965393066406} 11/06/2021 22:53:49 - INFO - __main__ - Step 11920: {'lr': 0.0004944790081538969, 'samples': 2288640, 'steps': 11919, 'loss/train': 1.7780016660690308} 11/06/2021 22:53:49 - INFO - __main__ - Step 11921: {'lr': 0.0004944778989989847, 'samples': 2288832, 'steps': 11920, 'loss/train': 2.459728240966797} 11/06/2021 22:53:50 - INFO - __main__ - Step 11922: {'lr': 0.0004944767897339146, 'samples': 2289024, 'steps': 11921, 'loss/train': 1.5695812702178955} 11/06/2021 22:53:51 - INFO - __main__ - Step 11923: {'lr': 0.000494475680358687, 'samples': 2289216, 'steps': 11922, 'loss/train': 1.9413129091262817} 11/06/2021 22:53:51 - INFO - __main__ - Step 11924: {'lr': 0.0004944745708733025, 'samples': 2289408, 'steps': 11923, 'loss/train': 2.0285587310791016} 11/06/2021 22:53:52 - INFO - __main__ - Step 11925: {'lr': 0.0004944734612777615, 'samples': 2289600, 'steps': 11924, 'loss/train': 1.9409503936767578} 11/06/2021 22:53:52 - INFO - __main__ - Step 11926: {'lr': 0.0004944723515720645, 'samples': 2289792, 'steps': 11925, 'loss/train': 1.747592806816101} 11/06/2021 22:53:53 - INFO - __main__ - Step 11927: {'lr': 0.000494471241756212, 'samples': 2289984, 'steps': 11926, 'loss/train': 1.674291729927063} 11/06/2021 22:53:53 - INFO - __main__ - Step 11928: {'lr': 0.0004944701318302046, 'samples': 2290176, 'steps': 11927, 'loss/train': 1.5972325801849365} 11/06/2021 22:53:54 - INFO - __main__ - Step 11929: {'lr': 0.0004944690217940427, 'samples': 2290368, 'steps': 11928, 'loss/train': 1.5960736274719238} 11/06/2021 22:53:54 - INFO - __main__ - Step 11930: {'lr': 0.0004944679116477269, 'samples': 2290560, 'steps': 11929, 'loss/train': 1.5236477851867676} 11/06/2021 22:53:54 - INFO - __main__ - Step 11931: {'lr': 0.0004944668013912575, 'samples': 2290752, 'steps': 11930, 'loss/train': 1.5838426351547241} 11/06/2021 22:53:55 - INFO - __main__ - Step 11932: {'lr': 0.0004944656910246352, 'samples': 2290944, 'steps': 11931, 'loss/train': 1.651598572731018} 11/06/2021 22:53:56 - INFO - __main__ - Step 11933: {'lr': 0.0004944645805478605, 'samples': 2291136, 'steps': 11932, 'loss/train': 1.3770673274993896} 11/06/2021 22:53:56 - INFO - __main__ - Step 11934: {'lr': 0.0004944634699609338, 'samples': 2291328, 'steps': 11933, 'loss/train': 2.044022560119629} 11/06/2021 22:53:56 - INFO - __main__ - Step 11935: {'lr': 0.0004944623592638555, 'samples': 2291520, 'steps': 11934, 'loss/train': 1.4433174133300781} 11/06/2021 22:53:57 - INFO - __main__ - Step 11936: {'lr': 0.0004944612484566263, 'samples': 2291712, 'steps': 11935, 'loss/train': 1.43559730052948} 11/06/2021 22:53:57 - INFO - __main__ - Step 11937: {'lr': 0.0004944601375392467, 'samples': 2291904, 'steps': 11936, 'loss/train': 2.0127391815185547} 11/06/2021 22:53:58 - INFO - __main__ - Step 11938: {'lr': 0.000494459026511717, 'samples': 2292096, 'steps': 11937, 'loss/train': 1.7522636651992798} 11/06/2021 22:53:59 - INFO - __main__ - Step 11939: {'lr': 0.000494457915374038, 'samples': 2292288, 'steps': 11938, 'loss/train': 1.910089373588562} 11/06/2021 22:53:59 - INFO - __main__ - Step 11940: {'lr': 0.00049445680412621, 'samples': 2292480, 'steps': 11939, 'loss/train': 1.6067532300949097} 11/06/2021 22:53:59 - INFO - __main__ - Step 11941: {'lr': 0.0004944556927682335, 'samples': 2292672, 'steps': 11940, 'loss/train': 1.7804417610168457} 11/06/2021 22:54:00 - INFO - __main__ - Step 11942: {'lr': 0.000494454581300109, 'samples': 2292864, 'steps': 11941, 'loss/train': 1.387055516242981} 11/06/2021 22:54:00 - INFO - __main__ - Step 11943: {'lr': 0.0004944534697218371, 'samples': 2293056, 'steps': 11942, 'loss/train': 1.4685114622116089} 11/06/2021 22:54:01 - INFO - __main__ - Step 11944: {'lr': 0.0004944523580334183, 'samples': 2293248, 'steps': 11943, 'loss/train': 0.5896292924880981} 11/06/2021 22:54:01 - INFO - __main__ - Step 11945: {'lr': 0.0004944512462348528, 'samples': 2293440, 'steps': 11944, 'loss/train': 1.9006822109222412} 11/06/2021 22:54:02 - INFO - __main__ - Step 11946: {'lr': 0.0004944501343261416, 'samples': 2293632, 'steps': 11945, 'loss/train': 1.7416223287582397} 11/06/2021 22:54:02 - INFO - __main__ - Step 11947: {'lr': 0.0004944490223072848, 'samples': 2293824, 'steps': 11946, 'loss/train': 0.48262572288513184} 11/06/2021 22:54:03 - INFO - __main__ - Step 11948: {'lr': 0.0004944479101782831, 'samples': 2294016, 'steps': 11947, 'loss/train': 1.9362751245498657} 11/06/2021 22:54:03 - INFO - __main__ - Step 11949: {'lr': 0.0004944467979391369, 'samples': 2294208, 'steps': 11948, 'loss/train': 1.4246046543121338} 11/06/2021 22:54:04 - INFO - __main__ - Step 11950: {'lr': 0.0004944456855898469, 'samples': 2294400, 'steps': 11949, 'loss/train': 1.5375300645828247} 11/06/2021 22:54:04 - INFO - __main__ - Step 11951: {'lr': 0.0004944445731304133, 'samples': 2294592, 'steps': 11950, 'loss/train': 2.0238983631134033} 11/06/2021 22:54:04 - INFO - __main__ - Step 11952: {'lr': 0.0004944434605608367, 'samples': 2294784, 'steps': 11951, 'loss/train': 1.2838270664215088} 11/06/2021 22:54:05 - INFO - __main__ - Step 11953: {'lr': 0.0004944423478811177, 'samples': 2294976, 'steps': 11952, 'loss/train': 1.7840559482574463} 11/06/2021 22:54:06 - INFO - __main__ - Step 11954: {'lr': 0.0004944412350912567, 'samples': 2295168, 'steps': 11953, 'loss/train': 1.7490830421447754} 11/06/2021 22:54:06 - INFO - __main__ - Step 11955: {'lr': 0.0004944401221912544, 'samples': 2295360, 'steps': 11954, 'loss/train': 1.8949376344680786} 11/06/2021 22:54:06 - INFO - __main__ - Step 11956: {'lr': 0.0004944390091811111, 'samples': 2295552, 'steps': 11955, 'loss/train': 1.7346197366714478} 11/06/2021 22:54:07 - INFO - __main__ - Step 11957: {'lr': 0.0004944378960608272, 'samples': 2295744, 'steps': 11956, 'loss/train': 1.9037965536117554} 11/06/2021 22:54:07 - INFO - __main__ - Step 11958: {'lr': 0.0004944367828304035, 'samples': 2295936, 'steps': 11957, 'loss/train': 1.8399511575698853} 11/06/2021 22:54:08 - INFO - __main__ - Step 11959: {'lr': 0.0004944356694898404, 'samples': 2296128, 'steps': 11958, 'loss/train': 0.14074699580669403} 11/06/2021 22:54:09 - INFO - __main__ - Step 11960: {'lr': 0.0004944345560391382, 'samples': 2296320, 'steps': 11959, 'loss/train': 1.9995806217193604} 11/06/2021 22:54:09 - INFO - __main__ - Step 11961: {'lr': 0.0004944334424782977, 'samples': 2296512, 'steps': 11960, 'loss/train': 1.5391980409622192} 11/06/2021 22:54:09 - INFO - __main__ - Step 11962: {'lr': 0.0004944323288073192, 'samples': 2296704, 'steps': 11961, 'loss/train': 1.4151784181594849} 11/06/2021 22:54:10 - INFO - __main__ - Step 11963: {'lr': 0.0004944312150262033, 'samples': 2296896, 'steps': 11962, 'loss/train': 1.0756903886795044} 11/06/2021 22:54:11 - INFO - __main__ - Step 11964: {'lr': 0.0004944301011349505, 'samples': 2297088, 'steps': 11963, 'loss/train': 1.644909143447876} 11/06/2021 22:54:11 - INFO - __main__ - Step 11965: {'lr': 0.0004944289871335612, 'samples': 2297280, 'steps': 11964, 'loss/train': 2.0946547985076904} 11/06/2021 22:54:11 - INFO - __main__ - Step 11966: {'lr': 0.0004944278730220359, 'samples': 2297472, 'steps': 11965, 'loss/train': 1.4997888803482056} 11/06/2021 22:54:12 - INFO - __main__ - Step 11967: {'lr': 0.0004944267588003754, 'samples': 2297664, 'steps': 11966, 'loss/train': 1.587903618812561} 11/06/2021 22:54:12 - INFO - __main__ - Step 11968: {'lr': 0.0004944256444685798, 'samples': 2297856, 'steps': 11967, 'loss/train': 1.6351916790008545} 11/06/2021 22:54:13 - INFO - __main__ - Step 11969: {'lr': 0.0004944245300266498, 'samples': 2298048, 'steps': 11968, 'loss/train': 1.4442037343978882} 11/06/2021 22:54:13 - INFO - __main__ - Step 11970: {'lr': 0.0004944234154745859, 'samples': 2298240, 'steps': 11969, 'loss/train': 1.8854682445526123} 11/06/2021 22:54:14 - INFO - __main__ - Step 11971: {'lr': 0.0004944223008123886, 'samples': 2298432, 'steps': 11970, 'loss/train': 1.832446813583374} 11/06/2021 22:54:14 - INFO - __main__ - Step 11972: {'lr': 0.0004944211860400582, 'samples': 2298624, 'steps': 11971, 'loss/train': 1.448323130607605} 11/06/2021 22:54:15 - INFO - __main__ - Step 11973: {'lr': 0.0004944200711575956, 'samples': 2298816, 'steps': 11972, 'loss/train': 1.9672355651855469} 11/06/2021 22:54:16 - INFO - __main__ - Step 11974: {'lr': 0.0004944189561650011, 'samples': 2299008, 'steps': 11973, 'loss/train': 1.4000511169433594} 11/06/2021 22:54:16 - INFO - __main__ - Step 11975: {'lr': 0.0004944178410622751, 'samples': 2299200, 'steps': 11974, 'loss/train': 1.234156847000122} 11/06/2021 22:54:17 - INFO - __main__ - Step 11976: {'lr': 0.0004944167258494181, 'samples': 2299392, 'steps': 11975, 'loss/train': 1.449386477470398} 11/06/2021 22:54:17 - INFO - __main__ - Step 11977: {'lr': 0.0004944156105264308, 'samples': 2299584, 'steps': 11976, 'loss/train': 0.3471572995185852} 11/06/2021 22:54:17 - INFO - __main__ - Step 11978: {'lr': 0.0004944144950933137, 'samples': 2299776, 'steps': 11977, 'loss/train': 1.291796088218689} 11/06/2021 22:54:18 - INFO - __main__ - Step 11979: {'lr': 0.000494413379550067, 'samples': 2299968, 'steps': 11978, 'loss/train': 1.4064840078353882} 11/06/2021 22:54:19 - INFO - __main__ - Step 11980: {'lr': 0.0004944122638966916, 'samples': 2300160, 'steps': 11979, 'loss/train': 2.1589348316192627} 11/06/2021 22:54:19 - INFO - __main__ - Step 11981: {'lr': 0.0004944111481331876, 'samples': 2300352, 'steps': 11980, 'loss/train': 2.178071975708008} 11/06/2021 22:54:19 - INFO - __main__ - Step 11982: {'lr': 0.0004944100322595558, 'samples': 2300544, 'steps': 11981, 'loss/train': 1.5777186155319214} 11/06/2021 22:54:20 - INFO - __main__ - Step 11983: {'lr': 0.0004944089162757968, 'samples': 2300736, 'steps': 11982, 'loss/train': 1.8397846221923828} 11/06/2021 22:54:21 - INFO - __main__ - Step 11984: {'lr': 0.0004944078001819106, 'samples': 2300928, 'steps': 11983, 'loss/train': 1.9047267436981201} 11/06/2021 22:54:21 - INFO - __main__ - Step 11985: {'lr': 0.0004944066839778983, 'samples': 2301120, 'steps': 11984, 'loss/train': 1.7153035402297974} 11/06/2021 22:54:21 - INFO - __main__ - Step 11986: {'lr': 0.0004944055676637599, 'samples': 2301312, 'steps': 11985, 'loss/train': 1.4917117357254028} 11/06/2021 22:54:22 - INFO - __main__ - Step 11987: {'lr': 0.0004944044512394962, 'samples': 2301504, 'steps': 11986, 'loss/train': 1.906925916671753} 11/06/2021 22:54:22 - INFO - __main__ - Step 11988: {'lr': 0.0004944033347051076, 'samples': 2301696, 'steps': 11987, 'loss/train': 1.9712414741516113} 11/06/2021 22:54:23 - INFO - __main__ - Step 11989: {'lr': 0.0004944022180605947, 'samples': 2301888, 'steps': 11988, 'loss/train': 1.898970127105713} 11/06/2021 22:54:24 - INFO - __main__ - Step 11990: {'lr': 0.0004944011013059579, 'samples': 2302080, 'steps': 11989, 'loss/train': 1.1535645723342896} 11/06/2021 22:54:24 - INFO - __main__ - Step 11991: {'lr': 0.0004943999844411977, 'samples': 2302272, 'steps': 11990, 'loss/train': 1.2547049522399902} 11/06/2021 22:54:24 - INFO - __main__ - Step 11992: {'lr': 0.0004943988674663147, 'samples': 2302464, 'steps': 11991, 'loss/train': 1.8330069780349731} 11/06/2021 22:54:25 - INFO - __main__ - Step 11993: {'lr': 0.0004943977503813092, 'samples': 2302656, 'steps': 11992, 'loss/train': 1.3840059041976929} 11/06/2021 22:54:25 - INFO - __main__ - Step 11994: {'lr': 0.000494396633186182, 'samples': 2302848, 'steps': 11993, 'loss/train': 2.0180881023406982} 11/06/2021 22:54:26 - INFO - __main__ - Step 11995: {'lr': 0.0004943955158809334, 'samples': 2303040, 'steps': 11994, 'loss/train': 1.7665796279907227} 11/06/2021 22:54:26 - INFO - __main__ - Step 11996: {'lr': 0.0004943943984655639, 'samples': 2303232, 'steps': 11995, 'loss/train': 1.3726966381072998} 11/06/2021 22:54:27 - INFO - __main__ - Step 11997: {'lr': 0.0004943932809400741, 'samples': 2303424, 'steps': 11996, 'loss/train': 1.9251682758331299} 11/06/2021 22:54:27 - INFO - __main__ - Step 11998: {'lr': 0.0004943921633044644, 'samples': 2303616, 'steps': 11997, 'loss/train': 1.752882480621338} 11/06/2021 22:54:27 - INFO - __main__ - Step 11999: {'lr': 0.0004943910455587354, 'samples': 2303808, 'steps': 11998, 'loss/train': 1.246006727218628} 11/06/2021 22:54:28 - INFO - __main__ - Step 12000: {'lr': 0.0004943899277028877, 'samples': 2304000, 'steps': 11999, 'loss/train': 1.6304447650909424} 11/06/2021 22:54:29 - INFO - __main__ - Step 12001: {'lr': 0.0004943888097369216, 'samples': 2304192, 'steps': 12000, 'loss/train': 1.6695126295089722} 11/06/2021 22:54:29 - INFO - __main__ - Step 12002: {'lr': 0.0004943876916608375, 'samples': 2304384, 'steps': 12001, 'loss/train': 1.794386625289917} 11/06/2021 22:54:29 - INFO - __main__ - Step 12003: {'lr': 0.0004943865734746364, 'samples': 2304576, 'steps': 12002, 'loss/train': 1.8876904249191284} 11/06/2021 22:54:30 - INFO - __main__ - Step 12004: {'lr': 0.0004943854551783182, 'samples': 2304768, 'steps': 12003, 'loss/train': 1.6697297096252441} 11/06/2021 22:54:31 - INFO - __main__ - Step 12005: {'lr': 0.0004943843367718838, 'samples': 2304960, 'steps': 12004, 'loss/train': 1.6420907974243164} 11/06/2021 22:54:31 - INFO - __main__ - Step 12006: {'lr': 0.0004943832182553336, 'samples': 2305152, 'steps': 12005, 'loss/train': 1.553113341331482} 11/06/2021 22:54:32 - INFO - __main__ - Step 12007: {'lr': 0.000494382099628668, 'samples': 2305344, 'steps': 12006, 'loss/train': 2.0208699703216553} 11/06/2021 22:54:32 - INFO - __main__ - Step 12008: {'lr': 0.0004943809808918877, 'samples': 2305536, 'steps': 12007, 'loss/train': 1.575126051902771} 11/06/2021 22:54:32 - INFO - __main__ - Step 12009: {'lr': 0.000494379862044993, 'samples': 2305728, 'steps': 12008, 'loss/train': 1.9213340282440186} 11/06/2021 22:54:33 - INFO - __main__ - Step 12010: {'lr': 0.0004943787430879846, 'samples': 2305920, 'steps': 12009, 'loss/train': 1.99528169631958} 11/06/2021 22:54:34 - INFO - __main__ - Step 12011: {'lr': 0.0004943776240208628, 'samples': 2306112, 'steps': 12010, 'loss/train': 1.7493822574615479} 11/06/2021 22:54:34 - INFO - __main__ - Step 12012: {'lr': 0.0004943765048436283, 'samples': 2306304, 'steps': 12011, 'loss/train': 1.5333633422851562} 11/06/2021 22:54:34 - INFO - __main__ - Step 12013: {'lr': 0.0004943753855562815, 'samples': 2306496, 'steps': 12012, 'loss/train': 1.6219375133514404} 11/06/2021 22:54:35 - INFO - __main__ - Step 12014: {'lr': 0.000494374266158823, 'samples': 2306688, 'steps': 12013, 'loss/train': 1.9051744937896729} 11/06/2021 22:54:36 - INFO - __main__ - Step 12015: {'lr': 0.0004943731466512531, 'samples': 2306880, 'steps': 12014, 'loss/train': 1.5404677391052246} 11/06/2021 22:54:37 - INFO - __main__ - Step 12016: {'lr': 0.0004943720270335724, 'samples': 2307072, 'steps': 12015, 'loss/train': 1.5498056411743164} 11/06/2021 22:54:37 - INFO - __main__ - Step 12017: {'lr': 0.0004943709073057816, 'samples': 2307264, 'steps': 12016, 'loss/train': 1.53780198097229} 11/06/2021 22:54:37 - INFO - __main__ - Step 12018: {'lr': 0.000494369787467881, 'samples': 2307456, 'steps': 12017, 'loss/train': 1.328963041305542} 11/06/2021 22:54:38 - INFO - __main__ - Step 12019: {'lr': 0.000494368667519871, 'samples': 2307648, 'steps': 12018, 'loss/train': 1.014737844467163} 11/06/2021 22:54:38 - INFO - __main__ - Step 12020: {'lr': 0.0004943675474617524, 'samples': 2307840, 'steps': 12019, 'loss/train': 1.956551432609558} 11/06/2021 22:54:39 - INFO - __main__ - Step 12021: {'lr': 0.0004943664272935255, 'samples': 2308032, 'steps': 12020, 'loss/train': 1.9309561252593994} 11/06/2021 22:54:39 - INFO - __main__ - Step 12022: {'lr': 0.0004943653070151909, 'samples': 2308224, 'steps': 12021, 'loss/train': 1.6873791217803955} 11/06/2021 22:54:40 - INFO - __main__ - Step 12023: {'lr': 0.000494364186626749, 'samples': 2308416, 'steps': 12022, 'loss/train': 0.8956676125526428} 11/06/2021 22:54:40 - INFO - __main__ - Step 12024: {'lr': 0.0004943630661282004, 'samples': 2308608, 'steps': 12023, 'loss/train': 1.492911696434021} 11/06/2021 22:54:40 - INFO - __main__ - Step 12025: {'lr': 0.0004943619455195456, 'samples': 2308800, 'steps': 12024, 'loss/train': 2.243229627609253} 11/06/2021 22:54:41 - INFO - __main__ - Step 12026: {'lr': 0.000494360824800785, 'samples': 2308992, 'steps': 12025, 'loss/train': 2.4765336513519287} 11/06/2021 22:54:42 - INFO - __main__ - Step 12027: {'lr': 0.0004943597039719192, 'samples': 2309184, 'steps': 12026, 'loss/train': 2.0413081645965576} 11/06/2021 22:54:42 - INFO - __main__ - Step 12028: {'lr': 0.0004943585830329487, 'samples': 2309376, 'steps': 12027, 'loss/train': 1.6951693296432495} 11/06/2021 22:54:42 - INFO - __main__ - Step 12029: {'lr': 0.0004943574619838741, 'samples': 2309568, 'steps': 12028, 'loss/train': 1.7877787351608276} 11/06/2021 22:54:43 - INFO - __main__ - Step 12030: {'lr': 0.0004943563408246957, 'samples': 2309760, 'steps': 12029, 'loss/train': 2.547342300415039} 11/06/2021 22:54:44 - INFO - __main__ - Step 12031: {'lr': 0.000494355219555414, 'samples': 2309952, 'steps': 12030, 'loss/train': 1.9224157333374023} 11/06/2021 22:54:44 - INFO - __main__ - Step 12032: {'lr': 0.0004943540981760298, 'samples': 2310144, 'steps': 12031, 'loss/train': 1.732265591621399} 11/06/2021 22:54:45 - INFO - __main__ - Step 12033: {'lr': 0.0004943529766865434, 'samples': 2310336, 'steps': 12032, 'loss/train': 1.4456995725631714} 11/06/2021 22:54:45 - INFO - __main__ - Step 12034: {'lr': 0.0004943518550869552, 'samples': 2310528, 'steps': 12033, 'loss/train': 1.0278202295303345} 11/06/2021 22:54:45 - INFO - __main__ - Step 12035: {'lr': 0.0004943507333772659, 'samples': 2310720, 'steps': 12034, 'loss/train': 1.7194404602050781} 11/06/2021 22:54:46 - INFO - __main__ - Step 12036: {'lr': 0.0004943496115574758, 'samples': 2310912, 'steps': 12035, 'loss/train': 1.838610053062439} 11/06/2021 22:54:47 - INFO - __main__ - Step 12037: {'lr': 0.0004943484896275857, 'samples': 2311104, 'steps': 12036, 'loss/train': 1.8748886585235596} 11/06/2021 22:54:47 - INFO - __main__ - Step 12038: {'lr': 0.0004943473675875959, 'samples': 2311296, 'steps': 12037, 'loss/train': 1.785986304283142} 11/06/2021 22:54:47 - INFO - __main__ - Step 12039: {'lr': 0.0004943462454375069, 'samples': 2311488, 'steps': 12038, 'loss/train': 1.6484863758087158} 11/06/2021 22:54:48 - INFO - __main__ - Step 12040: {'lr': 0.0004943451231773192, 'samples': 2311680, 'steps': 12039, 'loss/train': 1.8287440538406372} 11/06/2021 22:54:48 - INFO - __main__ - Step 12041: {'lr': 0.0004943440008070336, 'samples': 2311872, 'steps': 12040, 'loss/train': 1.2634427547454834} 11/06/2021 22:54:49 - INFO - __main__ - Step 12042: {'lr': 0.0004943428783266502, 'samples': 2312064, 'steps': 12041, 'loss/train': 1.7805235385894775} 11/06/2021 22:54:49 - INFO - __main__ - Step 12043: {'lr': 0.0004943417557361696, 'samples': 2312256, 'steps': 12042, 'loss/train': 1.758927583694458} 11/06/2021 22:54:50 - INFO - __main__ - Step 12044: {'lr': 0.0004943406330355925, 'samples': 2312448, 'steps': 12043, 'loss/train': 1.159117341041565} 11/06/2021 22:54:50 - INFO - __main__ - Step 12045: {'lr': 0.0004943395102249192, 'samples': 2312640, 'steps': 12044, 'loss/train': 1.8468574285507202} 11/06/2021 22:54:51 - INFO - __main__ - Step 12046: {'lr': 0.0004943383873041503, 'samples': 2312832, 'steps': 12045, 'loss/train': 1.8814785480499268} 11/06/2021 22:54:51 - INFO - __main__ - Step 12047: {'lr': 0.0004943372642732864, 'samples': 2313024, 'steps': 12046, 'loss/train': 2.093292236328125} 11/06/2021 22:54:52 - INFO - __main__ - Step 12048: {'lr': 0.0004943361411323277, 'samples': 2313216, 'steps': 12047, 'loss/train': 1.5713038444519043} 11/06/2021 22:54:52 - INFO - __main__ - Step 12049: {'lr': 0.0004943350178812751, 'samples': 2313408, 'steps': 12048, 'loss/train': 1.8695523738861084} 11/06/2021 22:54:52 - INFO - __main__ - Step 12050: {'lr': 0.0004943338945201288, 'samples': 2313600, 'steps': 12049, 'loss/train': 1.4861245155334473} 11/06/2021 22:54:53 - INFO - __main__ - Step 12051: {'lr': 0.0004943327710488894, 'samples': 2313792, 'steps': 12050, 'loss/train': 1.4577797651290894} 11/06/2021 22:54:54 - INFO - __main__ - Step 12052: {'lr': 0.0004943316474675575, 'samples': 2313984, 'steps': 12051, 'loss/train': 1.26167631149292} 11/06/2021 22:54:54 - INFO - __main__ - Step 12053: {'lr': 0.0004943305237761335, 'samples': 2314176, 'steps': 12052, 'loss/train': 1.674010157585144} 11/06/2021 22:54:54 - INFO - __main__ - Step 12054: {'lr': 0.0004943293999746179, 'samples': 2314368, 'steps': 12053, 'loss/train': 1.585003137588501} 11/06/2021 22:54:55 - INFO - __main__ - Step 12055: {'lr': 0.0004943282760630114, 'samples': 2314560, 'steps': 12054, 'loss/train': 1.8338981866836548} 11/06/2021 22:54:56 - INFO - __main__ - Step 12056: {'lr': 0.0004943271520413141, 'samples': 2314752, 'steps': 12055, 'loss/train': 1.6926106214523315} 11/06/2021 22:54:56 - INFO - __main__ - Step 12057: {'lr': 0.0004943260279095269, 'samples': 2314944, 'steps': 12056, 'loss/train': 1.6934748888015747} 11/06/2021 22:54:57 - INFO - __main__ - Step 12058: {'lr': 0.0004943249036676501, 'samples': 2315136, 'steps': 12057, 'loss/train': 1.7814065217971802} 11/06/2021 22:54:57 - INFO - __main__ - Step 12059: {'lr': 0.0004943237793156844, 'samples': 2315328, 'steps': 12058, 'loss/train': 2.4570651054382324} 11/06/2021 22:54:57 - INFO - __main__ - Step 12060: {'lr': 0.00049432265485363, 'samples': 2315520, 'steps': 12059, 'loss/train': 1.1607862710952759} 11/06/2021 22:54:58 - INFO - __main__ - Step 12061: {'lr': 0.0004943215302814877, 'samples': 2315712, 'steps': 12060, 'loss/train': 1.2177666425704956} 11/06/2021 22:54:59 - INFO - __main__ - Step 12062: {'lr': 0.0004943204055992579, 'samples': 2315904, 'steps': 12061, 'loss/train': 1.5392464399337769} 11/06/2021 22:54:59 - INFO - __main__ - Step 12063: {'lr': 0.0004943192808069411, 'samples': 2316096, 'steps': 12062, 'loss/train': 1.2292014360427856} 11/06/2021 22:54:59 - INFO - __main__ - Step 12064: {'lr': 0.0004943181559045378, 'samples': 2316288, 'steps': 12063, 'loss/train': 1.9744259119033813} 11/06/2021 22:55:00 - INFO - __main__ - Step 12065: {'lr': 0.0004943170308920483, 'samples': 2316480, 'steps': 12064, 'loss/train': 1.6766237020492554} 11/06/2021 22:55:00 - INFO - __main__ - Step 12066: {'lr': 0.0004943159057694736, 'samples': 2316672, 'steps': 12065, 'loss/train': 1.6280395984649658} 11/06/2021 22:55:02 - INFO - __main__ - Step 12067: {'lr': 0.0004943147805368138, 'samples': 2316864, 'steps': 12066, 'loss/train': 1.6543623208999634} 11/06/2021 22:55:02 - INFO - __main__ - Step 12068: {'lr': 0.0004943136551940695, 'samples': 2317056, 'steps': 12067, 'loss/train': 1.1795415878295898} 11/06/2021 22:55:02 - INFO - __main__ - Step 12069: {'lr': 0.0004943125297412413, 'samples': 2317248, 'steps': 12068, 'loss/train': 1.8180369138717651} 11/06/2021 22:55:03 - INFO - __main__ - Step 12070: {'lr': 0.0004943114041783296, 'samples': 2317440, 'steps': 12069, 'loss/train': 1.788710355758667} 11/06/2021 22:55:03 - INFO - __main__ - Step 12071: {'lr': 0.000494310278505335, 'samples': 2317632, 'steps': 12070, 'loss/train': 1.761925458908081} 11/06/2021 22:55:03 - INFO - __main__ - Step 12072: {'lr': 0.0004943091527222579, 'samples': 2317824, 'steps': 12071, 'loss/train': 1.7475976943969727} 11/06/2021 22:55:04 - INFO - __main__ - Step 12073: {'lr': 0.0004943080268290989, 'samples': 2318016, 'steps': 12072, 'loss/train': 1.6183298826217651} 11/06/2021 22:55:05 - INFO - __main__ - Step 12074: {'lr': 0.0004943069008258584, 'samples': 2318208, 'steps': 12073, 'loss/train': 1.4593292474746704} 11/06/2021 22:55:05 - INFO - __main__ - Step 12075: {'lr': 0.0004943057747125371, 'samples': 2318400, 'steps': 12074, 'loss/train': 1.8841195106506348} 11/06/2021 22:55:06 - INFO - __main__ - Step 12076: {'lr': 0.0004943046484891352, 'samples': 2318592, 'steps': 12075, 'loss/train': 1.3961201906204224} 11/06/2021 22:55:06 - INFO - __main__ - Step 12077: {'lr': 0.0004943035221556536, 'samples': 2318784, 'steps': 12076, 'loss/train': 1.4814202785491943} 11/06/2021 22:55:07 - INFO - __main__ - Step 12078: {'lr': 0.0004943023957120926, 'samples': 2318976, 'steps': 12077, 'loss/train': 1.8115047216415405} 11/06/2021 22:55:07 - INFO - __main__ - Step 12079: {'lr': 0.0004943012691584526, 'samples': 2319168, 'steps': 12078, 'loss/train': 1.6468000411987305} 11/06/2021 22:55:08 - INFO - __main__ - Step 12080: {'lr': 0.0004943001424947343, 'samples': 2319360, 'steps': 12079, 'loss/train': 2.1361162662506104} 11/06/2021 22:55:08 - INFO - __main__ - Step 12081: {'lr': 0.000494299015720938, 'samples': 2319552, 'steps': 12080, 'loss/train': 1.2885714769363403} 11/06/2021 22:55:08 - INFO - __main__ - Step 12082: {'lr': 0.0004942978888370645, 'samples': 2319744, 'steps': 12081, 'loss/train': 2.0774118900299072} 11/06/2021 22:55:10 - INFO - __main__ - Step 12083: {'lr': 0.000494296761843114, 'samples': 2319936, 'steps': 12082, 'loss/train': 1.5701794624328613} 11/06/2021 22:55:10 - INFO - __main__ - Step 12084: {'lr': 0.0004942956347390872, 'samples': 2320128, 'steps': 12083, 'loss/train': 1.1753841638565063} 11/06/2021 22:55:11 - INFO - __main__ - Step 12085: {'lr': 0.0004942945075249845, 'samples': 2320320, 'steps': 12084, 'loss/train': 1.6543172597885132} 11/06/2021 22:55:11 - INFO - __main__ - Step 12086: {'lr': 0.0004942933802008066, 'samples': 2320512, 'steps': 12085, 'loss/train': 2.0663843154907227} 11/06/2021 22:55:12 - INFO - __main__ - Step 12087: {'lr': 0.0004942922527665538, 'samples': 2320704, 'steps': 12086, 'loss/train': 1.5923895835876465} 11/06/2021 22:55:12 - INFO - __main__ - Step 12088: {'lr': 0.0004942911252222267, 'samples': 2320896, 'steps': 12087, 'loss/train': 0.7742838263511658} 11/06/2021 22:55:12 - INFO - __main__ - Step 12089: {'lr': 0.0004942899975678257, 'samples': 2321088, 'steps': 12088, 'loss/train': 1.9447110891342163} 11/06/2021 22:55:13 - INFO - __main__ - Step 12090: {'lr': 0.0004942888698033515, 'samples': 2321280, 'steps': 12089, 'loss/train': 1.889116883277893} 11/06/2021 22:55:14 - INFO - __main__ - Step 12091: {'lr': 0.0004942877419288045, 'samples': 2321472, 'steps': 12090, 'loss/train': 1.8266593217849731} 11/06/2021 22:55:14 - INFO - __main__ - Step 12092: {'lr': 0.0004942866139441851, 'samples': 2321664, 'steps': 12091, 'loss/train': 1.2201491594314575} 11/06/2021 22:55:14 - INFO - __main__ - Step 12093: {'lr': 0.0004942854858494941, 'samples': 2321856, 'steps': 12092, 'loss/train': 1.8048791885375977} 11/06/2021 22:55:15 - INFO - __main__ - Step 12094: {'lr': 0.0004942843576447316, 'samples': 2322048, 'steps': 12093, 'loss/train': 1.6451750993728638} 11/06/2021 22:55:15 - INFO - __main__ - Step 12095: {'lr': 0.0004942832293298986, 'samples': 2322240, 'steps': 12094, 'loss/train': 1.459704041481018} 11/06/2021 22:55:16 - INFO - __main__ - Step 12096: {'lr': 0.0004942821009049952, 'samples': 2322432, 'steps': 12095, 'loss/train': 2.0811355113983154} 11/06/2021 22:55:16 - INFO - __main__ - Step 12097: {'lr': 0.0004942809723700221, 'samples': 2322624, 'steps': 12096, 'loss/train': 1.5584614276885986} 11/06/2021 22:55:17 - INFO - __main__ - Step 12098: {'lr': 0.0004942798437249797, 'samples': 2322816, 'steps': 12097, 'loss/train': 1.5112113952636719} 11/06/2021 22:55:17 - INFO - __main__ - Step 12099: {'lr': 0.0004942787149698687, 'samples': 2323008, 'steps': 12098, 'loss/train': 1.9871211051940918} 11/06/2021 22:55:17 - INFO - __main__ - Step 12100: {'lr': 0.0004942775861046893, 'samples': 2323200, 'steps': 12099, 'loss/train': 2.562870740890503} 11/06/2021 22:55:18 - INFO - __main__ - Step 12101: {'lr': 0.0004942764571294422, 'samples': 2323392, 'steps': 12100, 'loss/train': 1.39982008934021} 11/06/2021 22:55:19 - INFO - __main__ - Step 12102: {'lr': 0.0004942753280441281, 'samples': 2323584, 'steps': 12101, 'loss/train': 2.016968250274658} 11/06/2021 22:55:19 - INFO - __main__ - Step 12103: {'lr': 0.0004942741988487471, 'samples': 2323776, 'steps': 12102, 'loss/train': 1.8526740074157715} 11/06/2021 22:55:19 - INFO - __main__ - Step 12104: {'lr': 0.0004942730695433001, 'samples': 2323968, 'steps': 12103, 'loss/train': 1.4173227548599243} 11/06/2021 22:55:20 - INFO - __main__ - Step 12105: {'lr': 0.0004942719401277873, 'samples': 2324160, 'steps': 12104, 'loss/train': 1.0924817323684692} 11/06/2021 22:55:21 - INFO - __main__ - Step 12106: {'lr': 0.0004942708106022094, 'samples': 2324352, 'steps': 12105, 'loss/train': 1.5277026891708374} 11/06/2021 22:55:21 - INFO - __main__ - Step 12107: {'lr': 0.0004942696809665668, 'samples': 2324544, 'steps': 12106, 'loss/train': 1.9903008937835693} 11/06/2021 22:55:21 - INFO - __main__ - Step 12108: {'lr': 0.0004942685512208599, 'samples': 2324736, 'steps': 12107, 'loss/train': 1.1875405311584473} 11/06/2021 22:55:22 - INFO - __main__ - Step 12109: {'lr': 0.0004942674213650896, 'samples': 2324928, 'steps': 12108, 'loss/train': 1.4761584997177124} 11/06/2021 22:55:22 - INFO - __main__ - Step 12110: {'lr': 0.000494266291399256, 'samples': 2325120, 'steps': 12109, 'loss/train': 1.6016489267349243} 11/06/2021 22:55:23 - INFO - __main__ - Step 12111: {'lr': 0.0004942651613233599, 'samples': 2325312, 'steps': 12110, 'loss/train': 1.8227951526641846} 11/06/2021 22:55:24 - INFO - __main__ - Step 12112: {'lr': 0.0004942640311374017, 'samples': 2325504, 'steps': 12111, 'loss/train': 1.8934657573699951} 11/06/2021 22:55:24 - INFO - __main__ - Step 12113: {'lr': 0.0004942629008413818, 'samples': 2325696, 'steps': 12112, 'loss/train': 1.4961217641830444} 11/06/2021 22:55:24 - INFO - __main__ - Step 12114: {'lr': 0.0004942617704353008, 'samples': 2325888, 'steps': 12113, 'loss/train': 1.5559004545211792} 11/06/2021 22:55:25 - INFO - __main__ - Step 12115: {'lr': 0.0004942606399191593, 'samples': 2326080, 'steps': 12114, 'loss/train': 1.4733009338378906} 11/06/2021 22:55:25 - INFO - __main__ - Step 12116: {'lr': 0.0004942595092929577, 'samples': 2326272, 'steps': 12115, 'loss/train': 1.756181240081787} 11/06/2021 22:55:27 - INFO - __main__ - Step 12117: {'lr': 0.0004942583785566965, 'samples': 2326464, 'steps': 12116, 'loss/train': 1.810673475265503} 11/06/2021 22:55:27 - INFO - __main__ - Step 12118: {'lr': 0.0004942572477103763, 'samples': 2326656, 'steps': 12117, 'loss/train': 1.341216802597046} 11/06/2021 22:55:27 - INFO - __main__ - Step 12119: {'lr': 0.0004942561167539975, 'samples': 2326848, 'steps': 12118, 'loss/train': 1.2394644021987915} 11/06/2021 22:55:28 - INFO - __main__ - Step 12120: {'lr': 0.0004942549856875606, 'samples': 2327040, 'steps': 12119, 'loss/train': 1.7882914543151855} 11/06/2021 22:55:28 - INFO - __main__ - Step 12121: {'lr': 0.0004942538545110663, 'samples': 2327232, 'steps': 12120, 'loss/train': 1.967432975769043} 11/06/2021 22:55:29 - INFO - __main__ - Step 12122: {'lr': 0.0004942527232245149, 'samples': 2327424, 'steps': 12121, 'loss/train': 2.7210159301757812} 11/06/2021 22:55:30 - INFO - __main__ - Step 12123: {'lr': 0.0004942515918279071, 'samples': 2327616, 'steps': 12122, 'loss/train': 1.1575158834457397} 11/06/2021 22:55:30 - INFO - __main__ - Step 12124: {'lr': 0.0004942504603212433, 'samples': 2327808, 'steps': 12123, 'loss/train': 1.8473913669586182} 11/06/2021 22:55:30 - INFO - __main__ - Step 12125: {'lr': 0.0004942493287045239, 'samples': 2328000, 'steps': 12124, 'loss/train': 1.3073318004608154} 11/06/2021 22:55:31 - INFO - __main__ - Step 12126: {'lr': 0.0004942481969777495, 'samples': 2328192, 'steps': 12125, 'loss/train': 2.258380889892578} 11/06/2021 22:55:31 - INFO - __main__ - Step 12127: {'lr': 0.0004942470651409207, 'samples': 2328384, 'steps': 12126, 'loss/train': 1.8729878664016724} 11/06/2021 22:55:32 - INFO - __main__ - Step 12128: {'lr': 0.000494245933194038, 'samples': 2328576, 'steps': 12127, 'loss/train': 1.1568888425827026} 11/06/2021 22:55:33 - INFO - __main__ - Step 12129: {'lr': 0.0004942448011371018, 'samples': 2328768, 'steps': 12128, 'loss/train': 1.8721987009048462} 11/06/2021 22:55:33 - INFO - __main__ - Step 12130: {'lr': 0.0004942436689701126, 'samples': 2328960, 'steps': 12129, 'loss/train': 1.7959274053573608} 11/06/2021 22:55:33 - INFO - __main__ - Step 12131: {'lr': 0.000494242536693071, 'samples': 2329152, 'steps': 12130, 'loss/train': 1.718927264213562} 11/06/2021 22:55:34 - INFO - __main__ - Step 12132: {'lr': 0.0004942414043059776, 'samples': 2329344, 'steps': 12131, 'loss/train': 1.8697373867034912} 11/06/2021 22:55:35 - INFO - __main__ - Step 12133: {'lr': 0.0004942402718088326, 'samples': 2329536, 'steps': 12132, 'loss/train': 1.8126624822616577} 11/06/2021 22:55:35 - INFO - __main__ - Step 12134: {'lr': 0.0004942391392016368, 'samples': 2329728, 'steps': 12133, 'loss/train': 1.6488497257232666} 11/06/2021 22:55:35 - INFO - __main__ - Step 12135: {'lr': 0.0004942380064843906, 'samples': 2329920, 'steps': 12134, 'loss/train': 1.8256616592407227} 11/06/2021 22:55:36 - INFO - __main__ - Step 12136: {'lr': 0.0004942368736570946, 'samples': 2330112, 'steps': 12135, 'loss/train': 1.4627264738082886} 11/06/2021 22:55:36 - INFO - __main__ - Step 12137: {'lr': 0.0004942357407197491, 'samples': 2330304, 'steps': 12136, 'loss/train': 1.6289029121398926} 11/06/2021 22:55:37 - INFO - __main__ - Step 12138: {'lr': 0.0004942346076723548, 'samples': 2330496, 'steps': 12137, 'loss/train': 1.9620447158813477} 11/06/2021 22:55:38 - INFO - __main__ - Step 12139: {'lr': 0.0004942334745149122, 'samples': 2330688, 'steps': 12138, 'loss/train': 2.0332870483398438} 11/06/2021 22:55:38 - INFO - __main__ - Step 12140: {'lr': 0.0004942323412474218, 'samples': 2330880, 'steps': 12139, 'loss/train': 1.301571249961853} 11/06/2021 22:55:38 - INFO - __main__ - Step 12141: {'lr': 0.000494231207869884, 'samples': 2331072, 'steps': 12140, 'loss/train': 1.325974941253662} 11/06/2021 22:55:39 - INFO - __main__ - Step 12142: {'lr': 0.0004942300743822993, 'samples': 2331264, 'steps': 12141, 'loss/train': 1.419464111328125} 11/06/2021 22:55:40 - INFO - __main__ - Step 12143: {'lr': 0.0004942289407846684, 'samples': 2331456, 'steps': 12142, 'loss/train': 0.5399057865142822} 11/06/2021 22:55:40 - INFO - __main__ - Step 12144: {'lr': 0.0004942278070769917, 'samples': 2331648, 'steps': 12143, 'loss/train': 1.7828236818313599} 11/06/2021 22:55:40 - INFO - __main__ - Step 12145: {'lr': 0.0004942266732592697, 'samples': 2331840, 'steps': 12144, 'loss/train': 1.8341064453125} 11/06/2021 22:55:41 - INFO - __main__ - Step 12146: {'lr': 0.0004942255393315029, 'samples': 2332032, 'steps': 12145, 'loss/train': 1.1581087112426758} 11/06/2021 22:55:41 - INFO - __main__ - Step 12147: {'lr': 0.000494224405293692, 'samples': 2332224, 'steps': 12146, 'loss/train': 1.870377779006958} 11/06/2021 22:55:41 - INFO - __main__ - Step 12148: {'lr': 0.0004942232711458372, 'samples': 2332416, 'steps': 12147, 'loss/train': 1.5845227241516113} 11/06/2021 22:55:42 - INFO - __main__ - Step 12149: {'lr': 0.0004942221368879391, 'samples': 2332608, 'steps': 12148, 'loss/train': 1.2885611057281494} 11/06/2021 22:55:43 - INFO - __main__ - Step 12150: {'lr': 0.0004942210025199985, 'samples': 2332800, 'steps': 12149, 'loss/train': 1.941924810409546} 11/06/2021 22:55:43 - INFO - __main__ - Step 12151: {'lr': 0.0004942198680420155, 'samples': 2332992, 'steps': 12150, 'loss/train': 1.2128046751022339} 11/06/2021 22:55:43 - INFO - __main__ - Step 12152: {'lr': 0.0004942187334539908, 'samples': 2333184, 'steps': 12151, 'loss/train': 1.949316382408142} 11/06/2021 22:55:44 - INFO - __main__ - Step 12153: {'lr': 0.0004942175987559251, 'samples': 2333376, 'steps': 12152, 'loss/train': 3.1983020305633545} 11/06/2021 22:55:45 - INFO - __main__ - Step 12154: {'lr': 0.0004942164639478185, 'samples': 2333568, 'steps': 12153, 'loss/train': 2.0099000930786133} 11/06/2021 22:55:45 - INFO - __main__ - Step 12155: {'lr': 0.0004942153290296718, 'samples': 2333760, 'steps': 12154, 'loss/train': 2.0011119842529297} 11/06/2021 22:55:45 - INFO - __main__ - Step 12156: {'lr': 0.0004942141940014854, 'samples': 2333952, 'steps': 12155, 'loss/train': 1.7409350872039795} 11/06/2021 22:55:46 - INFO - __main__ - Step 12157: {'lr': 0.0004942130588632599, 'samples': 2334144, 'steps': 12156, 'loss/train': 2.0252602100372314} 11/06/2021 22:55:46 - INFO - __main__ - Step 12158: {'lr': 0.0004942119236149958, 'samples': 2334336, 'steps': 12157, 'loss/train': 1.6826229095458984} 11/06/2021 22:55:47 - INFO - __main__ - Step 12159: {'lr': 0.0004942107882566936, 'samples': 2334528, 'steps': 12158, 'loss/train': 1.7889363765716553} 11/06/2021 22:55:47 - INFO - __main__ - Step 12160: {'lr': 0.0004942096527883538, 'samples': 2334720, 'steps': 12159, 'loss/train': 1.4643501043319702} 11/06/2021 22:55:48 - INFO - __main__ - Step 12161: {'lr': 0.0004942085172099768, 'samples': 2334912, 'steps': 12160, 'loss/train': 1.5718570947647095} 11/06/2021 22:55:48 - INFO - __main__ - Step 12162: {'lr': 0.0004942073815215632, 'samples': 2335104, 'steps': 12161, 'loss/train': 1.6668059825897217} 11/06/2021 22:55:49 - INFO - __main__ - Step 12163: {'lr': 0.0004942062457231136, 'samples': 2335296, 'steps': 12162, 'loss/train': 1.9814776182174683} 11/06/2021 22:55:50 - INFO - __main__ - Step 12164: {'lr': 0.0004942051098146284, 'samples': 2335488, 'steps': 12163, 'loss/train': 1.9286428689956665} 11/06/2021 22:55:50 - INFO - __main__ - Step 12165: {'lr': 0.0004942039737961081, 'samples': 2335680, 'steps': 12164, 'loss/train': 1.8754050731658936} 11/06/2021 22:55:50 - INFO - __main__ - Step 12166: {'lr': 0.0004942028376675533, 'samples': 2335872, 'steps': 12165, 'loss/train': 1.3288780450820923} 11/06/2021 22:55:51 - INFO - __main__ - Step 12167: {'lr': 0.0004942017014289645, 'samples': 2336064, 'steps': 12166, 'loss/train': 1.694525957107544} 11/06/2021 22:55:51 - INFO - __main__ - Step 12168: {'lr': 0.0004942005650803421, 'samples': 2336256, 'steps': 12167, 'loss/train': 1.5838422775268555} 11/06/2021 22:55:51 - INFO - __main__ - Step 12169: {'lr': 0.0004941994286216867, 'samples': 2336448, 'steps': 12168, 'loss/train': 1.7149986028671265} 11/06/2021 22:55:52 - INFO - __main__ - Step 12170: {'lr': 0.0004941982920529989, 'samples': 2336640, 'steps': 12169, 'loss/train': 1.529640793800354} 11/06/2021 22:55:53 - INFO - __main__ - Step 12171: {'lr': 0.0004941971553742791, 'samples': 2336832, 'steps': 12170, 'loss/train': 1.0539520978927612} 11/06/2021 22:55:53 - INFO - __main__ - Step 12172: {'lr': 0.0004941960185855278, 'samples': 2337024, 'steps': 12171, 'loss/train': 2.0904853343963623} 11/06/2021 22:55:53 - INFO - __main__ - Step 12173: {'lr': 0.0004941948816867455, 'samples': 2337216, 'steps': 12172, 'loss/train': 1.5987201929092407} 11/06/2021 22:55:54 - INFO - __main__ - Step 12174: {'lr': 0.0004941937446779328, 'samples': 2337408, 'steps': 12173, 'loss/train': 1.9558372497558594} 11/06/2021 22:55:55 - INFO - __main__ - Step 12175: {'lr': 0.0004941926075590901, 'samples': 2337600, 'steps': 12174, 'loss/train': 1.9326342344284058} 11/06/2021 22:55:55 - INFO - __main__ - Step 12176: {'lr': 0.0004941914703302181, 'samples': 2337792, 'steps': 12175, 'loss/train': 1.5912054777145386} 11/06/2021 22:55:55 - INFO - __main__ - Step 12177: {'lr': 0.0004941903329913172, 'samples': 2337984, 'steps': 12176, 'loss/train': 1.657957911491394} 11/06/2021 22:55:56 - INFO - __main__ - Step 12178: {'lr': 0.0004941891955423878, 'samples': 2338176, 'steps': 12177, 'loss/train': 2.281742811203003} 11/06/2021 22:55:56 - INFO - __main__ - Step 12179: {'lr': 0.0004941880579834306, 'samples': 2338368, 'steps': 12178, 'loss/train': 1.9655019044876099} 11/06/2021 22:55:57 - INFO - __main__ - Step 12180: {'lr': 0.0004941869203144459, 'samples': 2338560, 'steps': 12179, 'loss/train': 0.7616149187088013} 11/06/2021 22:55:58 - INFO - __main__ - Step 12181: {'lr': 0.0004941857825354344, 'samples': 2338752, 'steps': 12180, 'loss/train': 1.607176423072815} 11/06/2021 22:55:58 - INFO - __main__ - Step 12182: {'lr': 0.0004941846446463966, 'samples': 2338944, 'steps': 12181, 'loss/train': 1.9033536911010742} 11/06/2021 22:55:58 - INFO - __main__ - Step 12183: {'lr': 0.000494183506647333, 'samples': 2339136, 'steps': 12182, 'loss/train': 1.972430944442749} 11/06/2021 22:55:59 - INFO - __main__ - Step 12184: {'lr': 0.000494182368538244, 'samples': 2339328, 'steps': 12183, 'loss/train': 1.4964816570281982} 11/06/2021 22:56:00 - INFO - __main__ - Step 12185: {'lr': 0.0004941812303191302, 'samples': 2339520, 'steps': 12184, 'loss/train': 1.8058403730392456} 11/06/2021 22:56:00 - INFO - __main__ - Step 12186: {'lr': 0.0004941800919899921, 'samples': 2339712, 'steps': 12185, 'loss/train': 1.9677493572235107} 11/06/2021 22:56:00 - INFO - __main__ - Step 12187: {'lr': 0.0004941789535508303, 'samples': 2339904, 'steps': 12186, 'loss/train': 1.774878978729248} 11/06/2021 22:56:01 - INFO - __main__ - Step 12188: {'lr': 0.0004941778150016451, 'samples': 2340096, 'steps': 12187, 'loss/train': 1.5408471822738647} 11/06/2021 22:56:01 - INFO - __main__ - Step 12189: {'lr': 0.0004941766763424373, 'samples': 2340288, 'steps': 12188, 'loss/train': 1.433286190032959} 11/06/2021 22:56:02 - INFO - __main__ - Step 12190: {'lr': 0.0004941755375732071, 'samples': 2340480, 'steps': 12189, 'loss/train': 1.7328075170516968} 11/06/2021 22:56:02 - INFO - __main__ - Step 12191: {'lr': 0.0004941743986939553, 'samples': 2340672, 'steps': 12190, 'loss/train': 1.647670030593872} 11/06/2021 22:56:03 - INFO - __main__ - Step 12192: {'lr': 0.0004941732597046822, 'samples': 2340864, 'steps': 12191, 'loss/train': 1.4415473937988281} 11/06/2021 22:56:03 - INFO - __main__ - Step 12193: {'lr': 0.0004941721206053885, 'samples': 2341056, 'steps': 12192, 'loss/train': 3.6256818771362305} 11/06/2021 22:56:04 - INFO - __main__ - Step 12194: {'lr': 0.0004941709813960745, 'samples': 2341248, 'steps': 12193, 'loss/train': 1.5252056121826172} 11/06/2021 22:56:05 - INFO - __main__ - Step 12195: {'lr': 0.0004941698420767408, 'samples': 2341440, 'steps': 12194, 'loss/train': 1.3925776481628418} 11/06/2021 22:56:05 - INFO - __main__ - Step 12196: {'lr': 0.0004941687026473881, 'samples': 2341632, 'steps': 12195, 'loss/train': 0.3151058852672577} 11/06/2021 22:56:05 - INFO - __main__ - Step 12197: {'lr': 0.0004941675631080166, 'samples': 2341824, 'steps': 12196, 'loss/train': 2.0778465270996094} 11/06/2021 22:56:06 - INFO - __main__ - Step 12198: {'lr': 0.000494166423458627, 'samples': 2342016, 'steps': 12197, 'loss/train': 2.043381929397583} 11/06/2021 22:56:06 - INFO - __main__ - Step 12199: {'lr': 0.0004941652836992198, 'samples': 2342208, 'steps': 12198, 'loss/train': 1.0915753841400146} 11/06/2021 22:56:06 - INFO - __main__ - Step 12200: {'lr': 0.0004941641438297955, 'samples': 2342400, 'steps': 12199, 'loss/train': 1.6756670475006104} 11/06/2021 22:56:07 - INFO - __main__ - Step 12201: {'lr': 0.0004941630038503545, 'samples': 2342592, 'steps': 12200, 'loss/train': 1.9415992498397827} 11/06/2021 22:56:08 - INFO - __main__ - Step 12202: {'lr': 0.0004941618637608976, 'samples': 2342784, 'steps': 12201, 'loss/train': 2.4285848140716553} 11/06/2021 22:56:08 - INFO - __main__ - Step 12203: {'lr': 0.000494160723561425, 'samples': 2342976, 'steps': 12202, 'loss/train': 0.7406458854675293} 11/06/2021 22:56:08 - INFO - __main__ - Step 12204: {'lr': 0.0004941595832519374, 'samples': 2343168, 'steps': 12203, 'loss/train': 1.799933910369873} 11/06/2021 22:56:09 - INFO - __main__ - Step 12205: {'lr': 0.0004941584428324352, 'samples': 2343360, 'steps': 12204, 'loss/train': 1.6155633926391602} 11/06/2021 22:56:10 - INFO - __main__ - Step 12206: {'lr': 0.000494157302302919, 'samples': 2343552, 'steps': 12205, 'loss/train': 1.4476592540740967} 11/06/2021 22:56:10 - INFO - __main__ - Step 12207: {'lr': 0.0004941561616633893, 'samples': 2343744, 'steps': 12206, 'loss/train': 1.3193042278289795} 11/06/2021 22:56:11 - INFO - __main__ - Step 12208: {'lr': 0.0004941550209138466, 'samples': 2343936, 'steps': 12207, 'loss/train': 0.9178276062011719} 11/06/2021 22:56:11 - INFO - __main__ - Step 12209: {'lr': 0.0004941538800542915, 'samples': 2344128, 'steps': 12208, 'loss/train': 2.1289122104644775} 11/06/2021 22:56:11 - INFO - __main__ - Step 12210: {'lr': 0.0004941527390847243, 'samples': 2344320, 'steps': 12209, 'loss/train': 1.1334985494613647} 11/06/2021 22:56:12 - INFO - __main__ - Step 12211: {'lr': 0.0004941515980051457, 'samples': 2344512, 'steps': 12210, 'loss/train': 1.8072400093078613} 11/06/2021 22:56:13 - INFO - __main__ - Step 12212: {'lr': 0.0004941504568155561, 'samples': 2344704, 'steps': 12211, 'loss/train': 2.0195960998535156} 11/06/2021 22:56:13 - INFO - __main__ - Step 12213: {'lr': 0.0004941493155159562, 'samples': 2344896, 'steps': 12212, 'loss/train': 1.8710730075836182} 11/06/2021 22:56:13 - INFO - __main__ - Step 12214: {'lr': 0.0004941481741063462, 'samples': 2345088, 'steps': 12213, 'loss/train': 6.926916599273682} 11/06/2021 22:56:14 - INFO - __main__ - Step 12215: {'lr': 0.000494147032586727, 'samples': 2345280, 'steps': 12214, 'loss/train': 1.6592782735824585} 11/06/2021 22:56:14 - INFO - __main__ - Step 12216: {'lr': 0.0004941458909570988, 'samples': 2345472, 'steps': 12215, 'loss/train': 2.1829493045806885} 11/06/2021 22:56:14 - INFO - __main__ - Step 12217: {'lr': 0.0004941447492174622, 'samples': 2345664, 'steps': 12216, 'loss/train': 1.9610031843185425} 11/06/2021 22:56:15 - INFO - __main__ - Step 12218: {'lr': 0.0004941436073678179, 'samples': 2345856, 'steps': 12217, 'loss/train': 1.678484320640564} 11/06/2021 22:56:16 - INFO - __main__ - Step 12219: {'lr': 0.0004941424654081661, 'samples': 2346048, 'steps': 12218, 'loss/train': 1.9036375284194946} 11/06/2021 22:56:16 - INFO - __main__ - Step 12220: {'lr': 0.0004941413233385075, 'samples': 2346240, 'steps': 12219, 'loss/train': 1.5224061012268066} 11/06/2021 22:56:16 - INFO - __main__ - Step 12221: {'lr': 0.0004941401811588426, 'samples': 2346432, 'steps': 12220, 'loss/train': 1.5389779806137085} 11/06/2021 22:56:17 - INFO - __main__ - Step 12222: {'lr': 0.0004941390388691719, 'samples': 2346624, 'steps': 12221, 'loss/train': 1.6822108030319214} 11/06/2021 22:56:18 - INFO - __main__ - Step 12223: {'lr': 0.0004941378964694959, 'samples': 2346816, 'steps': 12222, 'loss/train': 1.6283819675445557} 11/06/2021 22:56:18 - INFO - __main__ - Step 12224: {'lr': 0.0004941367539598152, 'samples': 2347008, 'steps': 12223, 'loss/train': 2.0302186012268066} 11/06/2021 22:56:19 - INFO - __main__ - Step 12225: {'lr': 0.0004941356113401301, 'samples': 2347200, 'steps': 12224, 'loss/train': 1.9460519552230835} 11/06/2021 22:56:19 - INFO - __main__ - Step 12226: {'lr': 0.0004941344686104414, 'samples': 2347392, 'steps': 12225, 'loss/train': 1.5134607553482056} 11/06/2021 22:56:19 - INFO - __main__ - Step 12227: {'lr': 0.0004941333257707495, 'samples': 2347584, 'steps': 12226, 'loss/train': 1.468488097190857} 11/06/2021 22:56:20 - INFO - __main__ - Step 12228: {'lr': 0.0004941321828210548, 'samples': 2347776, 'steps': 12227, 'loss/train': 1.6733677387237549} 11/06/2021 22:56:21 - INFO - __main__ - Step 12229: {'lr': 0.000494131039761358, 'samples': 2347968, 'steps': 12228, 'loss/train': 1.7591148614883423} 11/06/2021 22:56:21 - INFO - __main__ - Step 12230: {'lr': 0.0004941298965916594, 'samples': 2348160, 'steps': 12229, 'loss/train': 7.689951419830322} 11/06/2021 22:56:21 - INFO - __main__ - Step 12231: {'lr': 0.0004941287533119597, 'samples': 2348352, 'steps': 12230, 'loss/train': 1.1296072006225586} 11/06/2021 22:56:22 - INFO - __main__ - Step 12232: {'lr': 0.0004941276099222593, 'samples': 2348544, 'steps': 12231, 'loss/train': 1.8858994245529175} 11/06/2021 22:56:22 - INFO - __main__ - Step 12233: {'lr': 0.0004941264664225589, 'samples': 2348736, 'steps': 12232, 'loss/train': 1.4118883609771729} 11/06/2021 22:56:23 - INFO - __main__ - Step 12234: {'lr': 0.0004941253228128588, 'samples': 2348928, 'steps': 12233, 'loss/train': 2.075518846511841} 11/06/2021 22:56:24 - INFO - __main__ - Step 12235: {'lr': 0.0004941241790931595, 'samples': 2349120, 'steps': 12234, 'loss/train': 1.6000714302062988} 11/06/2021 22:56:24 - INFO - __main__ - Step 12236: {'lr': 0.0004941230352634617, 'samples': 2349312, 'steps': 12235, 'loss/train': 1.9275367259979248} 11/06/2021 22:56:24 - INFO - __main__ - Step 12237: {'lr': 0.0004941218913237658, 'samples': 2349504, 'steps': 12236, 'loss/train': 2.1970300674438477} 11/06/2021 22:56:25 - INFO - __main__ - Step 12238: {'lr': 0.0004941207472740724, 'samples': 2349696, 'steps': 12237, 'loss/train': 1.9084161520004272} 11/06/2021 22:56:26 - INFO - __main__ - Step 12239: {'lr': 0.000494119603114382, 'samples': 2349888, 'steps': 12238, 'loss/train': 2.308227062225342} 11/06/2021 22:56:26 - INFO - __main__ - Step 12240: {'lr': 0.000494118458844695, 'samples': 2350080, 'steps': 12239, 'loss/train': 2.078315019607544} 11/06/2021 22:56:26 - INFO - __main__ - Step 12241: {'lr': 0.0004941173144650119, 'samples': 2350272, 'steps': 12240, 'loss/train': 1.4600110054016113} 11/06/2021 22:56:27 - INFO - __main__ - Step 12242: {'lr': 0.0004941161699753335, 'samples': 2350464, 'steps': 12241, 'loss/train': 2.5496790409088135} 11/06/2021 22:56:27 - INFO - __main__ - Step 12243: {'lr': 0.00049411502537566, 'samples': 2350656, 'steps': 12242, 'loss/train': 1.7416751384735107} 11/06/2021 22:56:28 - INFO - __main__ - Step 12244: {'lr': 0.0004941138806659921, 'samples': 2350848, 'steps': 12243, 'loss/train': 1.8958357572555542} 11/06/2021 22:56:29 - INFO - __main__ - Step 12245: {'lr': 0.00049411273584633, 'samples': 2351040, 'steps': 12244, 'loss/train': 1.8675121068954468} 11/06/2021 22:56:29 - INFO - __main__ - Step 12246: {'lr': 0.0004941115909166748, 'samples': 2351232, 'steps': 12245, 'loss/train': 1.8117626905441284} 11/06/2021 22:56:29 - INFO - __main__ - Step 12247: {'lr': 0.0004941104458770266, 'samples': 2351424, 'steps': 12246, 'loss/train': 1.715092420578003} 11/06/2021 22:56:30 - INFO - __main__ - Step 12248: {'lr': 0.0004941093007273859, 'samples': 2351616, 'steps': 12247, 'loss/train': 2.001997470855713} 11/06/2021 22:56:30 - INFO - __main__ - Step 12249: {'lr': 0.0004941081554677534, 'samples': 2351808, 'steps': 12248, 'loss/train': 1.8709745407104492} 11/06/2021 22:56:31 - INFO - __main__ - Step 12250: {'lr': 0.0004941070100981295, 'samples': 2352000, 'steps': 12249, 'loss/train': 2.0071144104003906} 11/06/2021 22:56:32 - INFO - __main__ - Step 12251: {'lr': 0.0004941058646185148, 'samples': 2352192, 'steps': 12250, 'loss/train': 1.7076789140701294} 11/06/2021 22:56:32 - INFO - __main__ - Step 12252: {'lr': 0.0004941047190289096, 'samples': 2352384, 'steps': 12251, 'loss/train': 2.314211845397949} 11/06/2021 22:56:32 - INFO - __main__ - Step 12253: {'lr': 0.0004941035733293148, 'samples': 2352576, 'steps': 12252, 'loss/train': 2.0742650032043457} 11/06/2021 22:56:33 - INFO - __main__ - Step 12254: {'lr': 0.0004941024275197305, 'samples': 2352768, 'steps': 12253, 'loss/train': 1.7907441854476929} 11/06/2021 22:56:34 - INFO - __main__ - Step 12255: {'lr': 0.0004941012816001575, 'samples': 2352960, 'steps': 12254, 'loss/train': 1.6902923583984375} 11/06/2021 22:56:34 - INFO - __main__ - Step 12256: {'lr': 0.0004941001355705963, 'samples': 2353152, 'steps': 12255, 'loss/train': 1.4194196462631226} 11/06/2021 22:56:34 - INFO - __main__ - Step 12257: {'lr': 0.0004940989894310473, 'samples': 2353344, 'steps': 12256, 'loss/train': 2.029238224029541} 11/06/2021 22:56:35 - INFO - __main__ - Step 12258: {'lr': 0.000494097843181511, 'samples': 2353536, 'steps': 12257, 'loss/train': 1.5298515558242798} 11/06/2021 22:56:35 - INFO - __main__ - Step 12259: {'lr': 0.0004940966968219881, 'samples': 2353728, 'steps': 12258, 'loss/train': 1.807405948638916} 11/06/2021 22:56:35 - INFO - __main__ - Step 12260: {'lr': 0.0004940955503524789, 'samples': 2353920, 'steps': 12259, 'loss/train': 1.857525110244751} 11/06/2021 22:56:36 - INFO - __main__ - Step 12261: {'lr': 0.000494094403772984, 'samples': 2354112, 'steps': 12260, 'loss/train': 1.2936711311340332} 11/06/2021 22:56:37 - INFO - __main__ - Step 12262: {'lr': 0.0004940932570835039, 'samples': 2354304, 'steps': 12261, 'loss/train': 1.5933314561843872} 11/06/2021 22:56:37 - INFO - __main__ - Step 12263: {'lr': 0.0004940921102840393, 'samples': 2354496, 'steps': 12262, 'loss/train': 1.8495118618011475} 11/06/2021 22:56:37 - INFO - __main__ - Step 12264: {'lr': 0.0004940909633745905, 'samples': 2354688, 'steps': 12263, 'loss/train': 1.749647617340088} 11/06/2021 22:56:38 - INFO - __main__ - Step 12265: {'lr': 0.000494089816355158, 'samples': 2354880, 'steps': 12264, 'loss/train': 1.2525612115859985} 11/06/2021 22:56:39 - INFO - __main__ - Step 12266: {'lr': 0.0004940886692257424, 'samples': 2355072, 'steps': 12265, 'loss/train': 1.6700724363327026} 11/06/2021 22:56:40 - INFO - __main__ - Step 12267: {'lr': 0.0004940875219863443, 'samples': 2355264, 'steps': 12266, 'loss/train': 1.503882646560669} 11/06/2021 22:56:40 - INFO - __main__ - Step 12268: {'lr': 0.0004940863746369641, 'samples': 2355456, 'steps': 12267, 'loss/train': 1.7664172649383545} 11/06/2021 22:56:40 - INFO - __main__ - Step 12269: {'lr': 0.0004940852271776023, 'samples': 2355648, 'steps': 12268, 'loss/train': 1.7359364032745361} 11/06/2021 22:56:41 - INFO - __main__ - Step 12270: {'lr': 0.0004940840796082594, 'samples': 2355840, 'steps': 12269, 'loss/train': 1.848609447479248} 11/06/2021 22:56:41 - INFO - __main__ - Step 12271: {'lr': 0.0004940829319289361, 'samples': 2356032, 'steps': 12270, 'loss/train': 1.8408467769622803} 11/06/2021 22:56:42 - INFO - __main__ - Step 12272: {'lr': 0.0004940817841396327, 'samples': 2356224, 'steps': 12271, 'loss/train': 1.7952295541763306} 11/06/2021 22:56:42 - INFO - __main__ - Step 12273: {'lr': 0.0004940806362403499, 'samples': 2356416, 'steps': 12272, 'loss/train': 2.3083720207214355} 11/06/2021 22:56:43 - INFO - __main__ - Step 12274: {'lr': 0.0004940794882310882, 'samples': 2356608, 'steps': 12273, 'loss/train': 1.8015666007995605} 11/06/2021 22:56:43 - INFO - __main__ - Step 12275: {'lr': 0.000494078340111848, 'samples': 2356800, 'steps': 12274, 'loss/train': 1.8969393968582153} 11/06/2021 22:56:43 - INFO - __main__ - Step 12276: {'lr': 0.0004940771918826298, 'samples': 2356992, 'steps': 12275, 'loss/train': 1.4344086647033691} 11/06/2021 22:56:44 - INFO - __main__ - Step 12277: {'lr': 0.0004940760435434341, 'samples': 2357184, 'steps': 12276, 'loss/train': 1.9090864658355713} 11/06/2021 22:56:45 - INFO - __main__ - Step 12278: {'lr': 0.0004940748950942618, 'samples': 2357376, 'steps': 12277, 'loss/train': 1.6862670183181763} 11/06/2021 22:56:45 - INFO - __main__ - Step 12279: {'lr': 0.0004940737465351128, 'samples': 2357568, 'steps': 12278, 'loss/train': 1.9722408056259155} 11/06/2021 22:56:45 - INFO - __main__ - Step 12280: {'lr': 0.0004940725978659881, 'samples': 2357760, 'steps': 12279, 'loss/train': 1.9200348854064941} 11/06/2021 22:56:46 - INFO - __main__ - Step 12281: {'lr': 0.000494071449086888, 'samples': 2357952, 'steps': 12280, 'loss/train': 1.9199883937835693} 11/06/2021 22:56:47 - INFO - __main__ - Step 12282: {'lr': 0.0004940703001978131, 'samples': 2358144, 'steps': 12281, 'loss/train': 1.2483066320419312} 11/06/2021 22:56:47 - INFO - __main__ - Step 12283: {'lr': 0.0004940691511987639, 'samples': 2358336, 'steps': 12282, 'loss/train': 1.7513079643249512} 11/06/2021 22:56:47 - INFO - __main__ - Step 12284: {'lr': 0.0004940680020897409, 'samples': 2358528, 'steps': 12283, 'loss/train': 5.813735485076904} 11/06/2021 22:56:48 - INFO - __main__ - Step 12285: {'lr': 0.0004940668528707446, 'samples': 2358720, 'steps': 12284, 'loss/train': 1.4615005254745483} 11/06/2021 22:56:48 - INFO - __main__ - Step 12286: {'lr': 0.0004940657035417755, 'samples': 2358912, 'steps': 12285, 'loss/train': 2.1803793907165527} 11/06/2021 22:56:49 - INFO - __main__ - Step 12287: {'lr': 0.0004940645541028343, 'samples': 2359104, 'steps': 12286, 'loss/train': 3.126666307449341} 11/06/2021 22:56:50 - INFO - __main__ - Step 12288: {'lr': 0.0004940634045539213, 'samples': 2359296, 'steps': 12287, 'loss/train': 1.47324800491333} 11/06/2021 22:56:50 - INFO - __main__ - Step 12289: {'lr': 0.000494062254895037, 'samples': 2359488, 'steps': 12288, 'loss/train': 1.7035387754440308} 11/06/2021 22:56:50 - INFO - __main__ - Step 12290: {'lr': 0.0004940611051261822, 'samples': 2359680, 'steps': 12289, 'loss/train': 1.6604784727096558} 11/06/2021 22:56:51 - INFO - __main__ - Step 12291: {'lr': 0.000494059955247357, 'samples': 2359872, 'steps': 12290, 'loss/train': 1.833449125289917} 11/06/2021 22:56:51 - INFO - __main__ - Step 12292: {'lr': 0.0004940588052585624, 'samples': 2360064, 'steps': 12291, 'loss/train': 1.7617274522781372} 11/06/2021 22:56:52 - INFO - __main__ - Step 12293: {'lr': 0.0004940576551597985, 'samples': 2360256, 'steps': 12292, 'loss/train': 1.5269970893859863} 11/06/2021 22:56:52 - INFO - __main__ - Step 12294: {'lr': 0.000494056504951066, 'samples': 2360448, 'steps': 12293, 'loss/train': 0.993115246295929} 11/06/2021 22:56:53 - INFO - __main__ - Step 12295: {'lr': 0.0004940553546323655, 'samples': 2360640, 'steps': 12294, 'loss/train': 1.4290341138839722} 11/06/2021 22:56:53 - INFO - __main__ - Step 12296: {'lr': 0.0004940542042036974, 'samples': 2360832, 'steps': 12295, 'loss/train': 1.7313815355300903} 11/06/2021 22:56:53 - INFO - __main__ - Step 12297: {'lr': 0.0004940530536650621, 'samples': 2361024, 'steps': 12296, 'loss/train': 1.907927393913269} 11/06/2021 22:56:54 - INFO - __main__ - Step 12298: {'lr': 0.0004940519030164605, 'samples': 2361216, 'steps': 12297, 'loss/train': 1.4239506721496582} 11/06/2021 22:56:55 - INFO - __main__ - Step 12299: {'lr': 0.0004940507522578927, 'samples': 2361408, 'steps': 12298, 'loss/train': 2.017463207244873} 11/06/2021 22:56:55 - INFO - __main__ - Step 12300: {'lr': 0.0004940496013893594, 'samples': 2361600, 'steps': 12299, 'loss/train': 1.8298779726028442} 11/06/2021 22:56:55 - INFO - __main__ - Step 12301: {'lr': 0.0004940484504108612, 'samples': 2361792, 'steps': 12300, 'loss/train': 1.4371073246002197} 11/06/2021 22:56:56 - INFO - __main__ - Step 12302: {'lr': 0.0004940472993223985, 'samples': 2361984, 'steps': 12301, 'loss/train': 1.8154419660568237} 11/06/2021 22:56:56 - INFO - __main__ - Step 12303: {'lr': 0.0004940461481239719, 'samples': 2362176, 'steps': 12302, 'loss/train': 1.6027586460113525} 11/06/2021 22:56:57 - INFO - __main__ - Step 12304: {'lr': 0.0004940449968155818, 'samples': 2362368, 'steps': 12303, 'loss/train': 2.39496111869812} 11/06/2021 22:56:58 - INFO - __main__ - Step 12305: {'lr': 0.0004940438453972288, 'samples': 2362560, 'steps': 12304, 'loss/train': 1.6671775579452515} 11/06/2021 22:56:58 - INFO - __main__ - Step 12306: {'lr': 0.0004940426938689135, 'samples': 2362752, 'steps': 12305, 'loss/train': 1.91392183303833} 11/06/2021 22:56:58 - INFO - __main__ - Step 12307: {'lr': 0.0004940415422306361, 'samples': 2362944, 'steps': 12306, 'loss/train': 1.1845285892486572} 11/06/2021 22:56:59 - INFO - __main__ - Step 12308: {'lr': 0.0004940403904823976, 'samples': 2363136, 'steps': 12307, 'loss/train': 1.8613651990890503} 11/06/2021 22:57:00 - INFO - __main__ - Step 12309: {'lr': 0.0004940392386241981, 'samples': 2363328, 'steps': 12308, 'loss/train': 2.2593603134155273} 11/06/2021 22:57:00 - INFO - __main__ - Step 12310: {'lr': 0.0004940380866560384, 'samples': 2363520, 'steps': 12309, 'loss/train': 1.829590916633606} 11/06/2021 22:57:00 - INFO - __main__ - Step 12311: {'lr': 0.0004940369345779187, 'samples': 2363712, 'steps': 12310, 'loss/train': 1.535210371017456} 11/06/2021 22:57:01 - INFO - __main__ - Step 12312: {'lr': 0.00049403578238984, 'samples': 2363904, 'steps': 12311, 'loss/train': 1.498138189315796} 11/06/2021 22:57:01 - INFO - __main__ - Step 12313: {'lr': 0.0004940346300918024, 'samples': 2364096, 'steps': 12312, 'loss/train': 1.6373891830444336} 11/06/2021 22:57:02 - INFO - __main__ - Step 12314: {'lr': 0.0004940334776838065, 'samples': 2364288, 'steps': 12313, 'loss/train': 2.0152111053466797} 11/06/2021 22:57:02 - INFO - __main__ - Step 12315: {'lr': 0.000494032325165853, 'samples': 2364480, 'steps': 12314, 'loss/train': 1.4806876182556152} 11/06/2021 22:57:03 - INFO - __main__ - Step 12316: {'lr': 0.0004940311725379423, 'samples': 2364672, 'steps': 12315, 'loss/train': 1.7899569272994995} 11/06/2021 22:57:03 - INFO - __main__ - Step 12317: {'lr': 0.0004940300198000748, 'samples': 2364864, 'steps': 12316, 'loss/train': 1.8684685230255127} 11/06/2021 22:57:03 - INFO - __main__ - Step 12318: {'lr': 0.0004940288669522513, 'samples': 2365056, 'steps': 12317, 'loss/train': 1.5197453498840332} 11/06/2021 22:57:05 - INFO - __main__ - Step 12319: {'lr': 0.000494027713994472, 'samples': 2365248, 'steps': 12318, 'loss/train': 1.7379388809204102} 11/06/2021 22:57:05 - INFO - __main__ - Step 12320: {'lr': 0.0004940265609267377, 'samples': 2365440, 'steps': 12319, 'loss/train': 1.384351372718811} 11/06/2021 22:57:05 - INFO - __main__ - Step 12321: {'lr': 0.0004940254077490487, 'samples': 2365632, 'steps': 12320, 'loss/train': 2.0284841060638428} 11/06/2021 22:57:06 - INFO - __main__ - Step 12322: {'lr': 0.0004940242544614056, 'samples': 2365824, 'steps': 12321, 'loss/train': 1.541722297668457} 11/06/2021 22:57:06 - INFO - __main__ - Step 12323: {'lr': 0.0004940231010638091, 'samples': 2366016, 'steps': 12322, 'loss/train': 1.9831137657165527} 11/06/2021 22:57:07 - INFO - __main__ - Step 12324: {'lr': 0.0004940219475562593, 'samples': 2366208, 'steps': 12323, 'loss/train': 0.3079480528831482} 11/06/2021 22:57:08 - INFO - __main__ - Step 12325: {'lr': 0.0004940207939387573, 'samples': 2366400, 'steps': 12324, 'loss/train': 1.5259820222854614} 11/06/2021 22:57:08 - INFO - __main__ - Step 12326: {'lr': 0.0004940196402113031, 'samples': 2366592, 'steps': 12325, 'loss/train': 1.683254361152649} 11/06/2021 22:57:08 - INFO - __main__ - Step 12327: {'lr': 0.0004940184863738975, 'samples': 2366784, 'steps': 12326, 'loss/train': 1.8076988458633423} 11/06/2021 22:57:09 - INFO - __main__ - Step 12328: {'lr': 0.0004940173324265407, 'samples': 2366976, 'steps': 12327, 'loss/train': 1.4422165155410767} 11/06/2021 22:57:10 - INFO - __main__ - Step 12329: {'lr': 0.0004940161783692338, 'samples': 2367168, 'steps': 12328, 'loss/train': 1.9659552574157715} 11/06/2021 22:57:10 - INFO - __main__ - Step 12330: {'lr': 0.0004940150242019768, 'samples': 2367360, 'steps': 12329, 'loss/train': 1.24222731590271} 11/06/2021 22:57:11 - INFO - __main__ - Step 12331: {'lr': 0.0004940138699247704, 'samples': 2367552, 'steps': 12330, 'loss/train': 2.182091474533081} 11/06/2021 22:57:11 - INFO - __main__ - Step 12332: {'lr': 0.0004940127155376151, 'samples': 2367744, 'steps': 12331, 'loss/train': 1.8635755777359009} 11/06/2021 22:57:11 - INFO - __main__ - Step 12333: {'lr': 0.0004940115610405114, 'samples': 2367936, 'steps': 12332, 'loss/train': 1.9940561056137085} 11/06/2021 22:57:12 - INFO - __main__ - Step 12334: {'lr': 0.0004940104064334599, 'samples': 2368128, 'steps': 12333, 'loss/train': 1.4782226085662842} 11/06/2021 22:57:12 - INFO - __main__ - Step 12335: {'lr': 0.0004940092517164612, 'samples': 2368320, 'steps': 12334, 'loss/train': 1.205909252166748} 11/06/2021 22:57:13 - INFO - __main__ - Step 12336: {'lr': 0.0004940080968895155, 'samples': 2368512, 'steps': 12335, 'loss/train': 2.1496191024780273} 11/06/2021 22:57:13 - INFO - __main__ - Step 12337: {'lr': 0.0004940069419526236, 'samples': 2368704, 'steps': 12336, 'loss/train': 1.5507055521011353} 11/06/2021 22:57:14 - INFO - __main__ - Step 12338: {'lr': 0.0004940057869057859, 'samples': 2368896, 'steps': 12337, 'loss/train': 0.8827195763587952} 11/06/2021 22:57:14 - INFO - __main__ - Step 12339: {'lr': 0.000494004631749003, 'samples': 2369088, 'steps': 12338, 'loss/train': 1.7027802467346191} 11/06/2021 22:57:14 - INFO - __main__ - Step 12340: {'lr': 0.0004940034764822754, 'samples': 2369280, 'steps': 12339, 'loss/train': 1.6618696451187134} 11/06/2021 22:57:15 - INFO - __main__ - Step 12341: {'lr': 0.0004940023211056036, 'samples': 2369472, 'steps': 12340, 'loss/train': 1.9544622898101807} 11/06/2021 22:57:16 - INFO - __main__ - Step 12342: {'lr': 0.0004940011656189881, 'samples': 2369664, 'steps': 12341, 'loss/train': 1.8119089603424072} 11/06/2021 22:57:16 - INFO - __main__ - Step 12343: {'lr': 0.0004940000100224295, 'samples': 2369856, 'steps': 12342, 'loss/train': 1.2512726783752441} 11/06/2021 22:57:17 - INFO - __main__ - Step 12344: {'lr': 0.0004939988543159282, 'samples': 2370048, 'steps': 12343, 'loss/train': 1.5961833000183105} 11/06/2021 22:57:17 - INFO - __main__ - Step 12345: {'lr': 0.0004939976984994847, 'samples': 2370240, 'steps': 12344, 'loss/train': 1.501064419746399} 11/06/2021 22:57:18 - INFO - __main__ - Step 12346: {'lr': 0.0004939965425730996, 'samples': 2370432, 'steps': 12345, 'loss/train': 1.6873294115066528} 11/06/2021 22:57:18 - INFO - __main__ - Step 12347: {'lr': 0.0004939953865367735, 'samples': 2370624, 'steps': 12346, 'loss/train': 1.1613168716430664} 11/06/2021 22:57:19 - INFO - __main__ - Step 12348: {'lr': 0.0004939942303905069, 'samples': 2370816, 'steps': 12347, 'loss/train': 1.5683062076568604} 11/06/2021 22:57:19 - INFO - __main__ - Step 12349: {'lr': 0.0004939930741343002, 'samples': 2371008, 'steps': 12348, 'loss/train': 1.0350655317306519} 11/06/2021 22:57:19 - INFO - __main__ - Step 12350: {'lr': 0.000493991917768154, 'samples': 2371200, 'steps': 12349, 'loss/train': 1.5559569597244263} 11/06/2021 22:57:20 - INFO - __main__ - Step 12351: {'lr': 0.0004939907612920688, 'samples': 2371392, 'steps': 12350, 'loss/train': 1.799564003944397} 11/06/2021 22:57:21 - INFO - __main__ - Step 12352: {'lr': 0.0004939896047060451, 'samples': 2371584, 'steps': 12351, 'loss/train': 1.0660388469696045} 11/06/2021 22:57:21 - INFO - __main__ - Step 12353: {'lr': 0.0004939884480100836, 'samples': 2371776, 'steps': 12352, 'loss/train': 1.4161072969436646} 11/06/2021 22:57:21 - INFO - __main__ - Step 12354: {'lr': 0.0004939872912041844, 'samples': 2371968, 'steps': 12353, 'loss/train': 1.7622441053390503} 11/06/2021 22:57:22 - INFO - __main__ - Step 12355: {'lr': 0.0004939861342883485, 'samples': 2372160, 'steps': 12354, 'loss/train': 1.4472774267196655} 11/06/2021 22:57:22 - INFO - __main__ - Step 12356: {'lr': 0.0004939849772625761, 'samples': 2372352, 'steps': 12355, 'loss/train': 1.7829986810684204} 11/06/2021 22:57:23 - INFO - __main__ - Step 12357: {'lr': 0.0004939838201268679, 'samples': 2372544, 'steps': 12356, 'loss/train': 2.1662769317626953} 11/06/2021 22:57:24 - INFO - __main__ - Step 12358: {'lr': 0.0004939826628812244, 'samples': 2372736, 'steps': 12357, 'loss/train': 1.5202739238739014} 11/06/2021 22:57:24 - INFO - __main__ - Step 12359: {'lr': 0.000493981505525646, 'samples': 2372928, 'steps': 12358, 'loss/train': 1.9102394580841064} 11/06/2021 22:57:24 - INFO - __main__ - Step 12360: {'lr': 0.0004939803480601333, 'samples': 2373120, 'steps': 12359, 'loss/train': 1.4958027601242065} 11/06/2021 22:57:25 - INFO - __main__ - Step 12361: {'lr': 0.0004939791904846869, 'samples': 2373312, 'steps': 12360, 'loss/train': 1.0917266607284546} 11/06/2021 22:57:26 - INFO - __main__ - Step 12362: {'lr': 0.0004939780327993072, 'samples': 2373504, 'steps': 12361, 'loss/train': 1.7016918659210205} 11/06/2021 22:57:26 - INFO - __main__ - Step 12363: {'lr': 0.0004939768750039946, 'samples': 2373696, 'steps': 12362, 'loss/train': 1.3413039445877075} 11/06/2021 22:57:26 - INFO - __main__ - Step 12364: {'lr': 0.00049397571709875, 'samples': 2373888, 'steps': 12363, 'loss/train': 2.0355639457702637} 11/06/2021 22:57:27 - INFO - __main__ - Step 12365: {'lr': 0.0004939745590835736, 'samples': 2374080, 'steps': 12364, 'loss/train': 2.2837467193603516} 11/06/2021 22:57:27 - INFO - __main__ - Step 12366: {'lr': 0.0004939734009584661, 'samples': 2374272, 'steps': 12365, 'loss/train': 2.075040578842163} 11/06/2021 22:57:28 - INFO - __main__ - Step 12367: {'lr': 0.0004939722427234279, 'samples': 2374464, 'steps': 12366, 'loss/train': 1.9329279661178589} 11/06/2021 22:57:28 - INFO - __main__ - Step 12368: {'lr': 0.0004939710843784596, 'samples': 2374656, 'steps': 12367, 'loss/train': 1.624833106994629} 11/06/2021 22:57:29 - INFO - __main__ - Step 12369: {'lr': 0.0004939699259235617, 'samples': 2374848, 'steps': 12368, 'loss/train': 1.8931273221969604} 11/06/2021 22:57:29 - INFO - __main__ - Step 12370: {'lr': 0.0004939687673587346, 'samples': 2375040, 'steps': 12369, 'loss/train': 1.79799222946167} 11/06/2021 22:57:29 - INFO - __main__ - Step 12371: {'lr': 0.0004939676086839791, 'samples': 2375232, 'steps': 12370, 'loss/train': 1.7580457925796509} 11/06/2021 22:57:30 - INFO - __main__ - Step 12372: {'lr': 0.0004939664498992955, 'samples': 2375424, 'steps': 12371, 'loss/train': 1.5857473611831665} 11/06/2021 22:57:31 - INFO - __main__ - Step 12373: {'lr': 0.0004939652910046844, 'samples': 2375616, 'steps': 12372, 'loss/train': 1.6514397859573364} 11/06/2021 22:57:31 - INFO - __main__ - Step 12374: {'lr': 0.0004939641320001462, 'samples': 2375808, 'steps': 12373, 'loss/train': 1.8684816360473633} 11/06/2021 22:57:31 - INFO - __main__ - Step 12375: {'lr': 0.0004939629728856817, 'samples': 2376000, 'steps': 12374, 'loss/train': 1.2412973642349243} 11/06/2021 22:57:32 - INFO - __main__ - Step 12376: {'lr': 0.0004939618136612911, 'samples': 2376192, 'steps': 12375, 'loss/train': 1.7167083024978638} 11/06/2021 22:57:33 - INFO - __main__ - Step 12377: {'lr': 0.0004939606543269751, 'samples': 2376384, 'steps': 12376, 'loss/train': 1.4133563041687012} 11/06/2021 22:57:33 - INFO - __main__ - Step 12378: {'lr': 0.0004939594948827343, 'samples': 2376576, 'steps': 12377, 'loss/train': 1.8891206979751587} 11/06/2021 22:57:34 - INFO - __main__ - Step 12379: {'lr': 0.000493958335328569, 'samples': 2376768, 'steps': 12378, 'loss/train': 1.7850967645645142} 11/06/2021 22:57:34 - INFO - __main__ - Step 12380: {'lr': 0.0004939571756644799, 'samples': 2376960, 'steps': 12379, 'loss/train': 1.6386349201202393} 11/06/2021 22:57:34 - INFO - __main__ - Step 12381: {'lr': 0.0004939560158904675, 'samples': 2377152, 'steps': 12380, 'loss/train': 0.9704276323318481} 11/06/2021 22:57:36 - INFO - __main__ - Step 12382: {'lr': 0.0004939548560065322, 'samples': 2377344, 'steps': 12381, 'loss/train': 1.7817976474761963} 11/06/2021 22:57:36 - INFO - __main__ - Step 12383: {'lr': 0.0004939536960126746, 'samples': 2377536, 'steps': 12382, 'loss/train': 1.5621471405029297} 11/06/2021 22:57:37 - INFO - __main__ - Step 12384: {'lr': 0.0004939525359088953, 'samples': 2377728, 'steps': 12383, 'loss/train': 1.8215516805648804} 11/06/2021 22:57:37 - INFO - __main__ - Step 12385: {'lr': 0.0004939513756951946, 'samples': 2377920, 'steps': 12384, 'loss/train': 1.7546138763427734} 11/06/2021 22:57:37 - INFO - __main__ - Step 12386: {'lr': 0.0004939502153715733, 'samples': 2378112, 'steps': 12385, 'loss/train': 1.881219744682312} 11/06/2021 22:57:38 - INFO - __main__ - Step 12387: {'lr': 0.0004939490549380318, 'samples': 2378304, 'steps': 12386, 'loss/train': 2.4455485343933105} 11/06/2021 22:57:38 - INFO - __main__ - Step 12388: {'lr': 0.0004939478943945706, 'samples': 2378496, 'steps': 12387, 'loss/train': 1.8239833116531372} 11/06/2021 22:57:39 - INFO - __main__ - Step 12389: {'lr': 0.0004939467337411903, 'samples': 2378688, 'steps': 12388, 'loss/train': 1.7355788946151733} 11/06/2021 22:57:39 - INFO - __main__ - Step 12390: {'lr': 0.0004939455729778912, 'samples': 2378880, 'steps': 12389, 'loss/train': 1.7384986877441406} 11/06/2021 22:57:40 - INFO - __main__ - Step 12391: {'lr': 0.0004939444121046741, 'samples': 2379072, 'steps': 12390, 'loss/train': 1.171319603919983} 11/06/2021 22:57:40 - INFO - __main__ - Step 12392: {'lr': 0.0004939432511215395, 'samples': 2379264, 'steps': 12391, 'loss/train': 1.6471529006958008} 11/06/2021 22:57:40 - INFO - __main__ - Step 12393: {'lr': 0.0004939420900284876, 'samples': 2379456, 'steps': 12392, 'loss/train': 1.4227564334869385} 11/06/2021 22:57:41 - INFO - __main__ - Step 12394: {'lr': 0.0004939409288255194, 'samples': 2379648, 'steps': 12393, 'loss/train': 1.9644057750701904} 11/06/2021 22:57:42 - INFO - __main__ - Step 12395: {'lr': 0.000493939767512635, 'samples': 2379840, 'steps': 12394, 'loss/train': 1.767325758934021} 11/06/2021 22:57:42 - INFO - __main__ - Step 12396: {'lr': 0.0004939386060898353, 'samples': 2380032, 'steps': 12395, 'loss/train': 2.075618267059326} 11/06/2021 22:57:42 - INFO - __main__ - Step 12397: {'lr': 0.0004939374445571206, 'samples': 2380224, 'steps': 12396, 'loss/train': 1.4473564624786377} 11/06/2021 22:57:43 - INFO - __main__ - Step 12398: {'lr': 0.0004939362829144913, 'samples': 2380416, 'steps': 12397, 'loss/train': 1.165976643562317} 11/06/2021 22:57:44 - INFO - __main__ - Step 12399: {'lr': 0.0004939351211619481, 'samples': 2380608, 'steps': 12398, 'loss/train': 2.2611637115478516} 11/06/2021 22:57:44 - INFO - __main__ - Step 12400: {'lr': 0.0004939339592994916, 'samples': 2380800, 'steps': 12399, 'loss/train': 2.222313165664673} 11/06/2021 22:57:45 - INFO - __main__ - Step 12401: {'lr': 0.0004939327973271222, 'samples': 2380992, 'steps': 12400, 'loss/train': 1.8177175521850586} 11/06/2021 22:57:45 - INFO - __main__ - Step 12402: {'lr': 0.0004939316352448403, 'samples': 2381184, 'steps': 12401, 'loss/train': 1.0791970491409302} 11/06/2021 22:57:45 - INFO - __main__ - Step 12403: {'lr': 0.0004939304730526467, 'samples': 2381376, 'steps': 12402, 'loss/train': 1.8736491203308105} 11/06/2021 22:57:46 - INFO - __main__ - Step 12404: {'lr': 0.0004939293107505418, 'samples': 2381568, 'steps': 12403, 'loss/train': 2.0087807178497314} 11/06/2021 22:57:47 - INFO - __main__ - Step 12405: {'lr': 0.0004939281483385261, 'samples': 2381760, 'steps': 12404, 'loss/train': 1.4684945344924927} 11/06/2021 22:57:47 - INFO - __main__ - Step 12406: {'lr': 0.0004939269858166001, 'samples': 2381952, 'steps': 12405, 'loss/train': 1.8479887247085571} 11/06/2021 22:57:47 - INFO - __main__ - Step 12407: {'lr': 0.0004939258231847644, 'samples': 2382144, 'steps': 12406, 'loss/train': 1.5185550451278687} 11/06/2021 22:57:48 - INFO - __main__ - Step 12408: {'lr': 0.0004939246604430195, 'samples': 2382336, 'steps': 12407, 'loss/train': 1.8069547414779663} 11/06/2021 22:57:48 - INFO - __main__ - Step 12409: {'lr': 0.0004939234975913659, 'samples': 2382528, 'steps': 12408, 'loss/train': 1.5137650966644287} 11/06/2021 22:57:49 - INFO - __main__ - Step 12410: {'lr': 0.0004939223346298042, 'samples': 2382720, 'steps': 12409, 'loss/train': 0.2782944142818451} 11/06/2021 22:57:50 - INFO - __main__ - Step 12411: {'lr': 0.0004939211715583347, 'samples': 2382912, 'steps': 12410, 'loss/train': 1.9376318454742432} 11/06/2021 22:57:50 - INFO - __main__ - Step 12412: {'lr': 0.0004939200083769582, 'samples': 2383104, 'steps': 12411, 'loss/train': 1.265969157218933} 11/06/2021 22:57:50 - INFO - __main__ - Step 12413: {'lr': 0.000493918845085675, 'samples': 2383296, 'steps': 12412, 'loss/train': 1.7577980756759644} 11/06/2021 22:57:51 - INFO - __main__ - Step 12414: {'lr': 0.000493917681684486, 'samples': 2383488, 'steps': 12413, 'loss/train': 0.3489714562892914} 11/06/2021 22:57:52 - INFO - __main__ - Step 12415: {'lr': 0.0004939165181733911, 'samples': 2383680, 'steps': 12414, 'loss/train': 1.9580268859863281} 11/06/2021 22:57:52 - INFO - __main__ - Step 12416: {'lr': 0.0004939153545523914, 'samples': 2383872, 'steps': 12415, 'loss/train': 1.4286139011383057} 11/06/2021 22:57:52 - INFO - __main__ - Step 12417: {'lr': 0.0004939141908214871, 'samples': 2384064, 'steps': 12416, 'loss/train': 1.9150186777114868} 11/06/2021 22:57:53 - INFO - __main__ - Step 12418: {'lr': 0.000493913026980679, 'samples': 2384256, 'steps': 12417, 'loss/train': 1.4130078554153442} 11/06/2021 22:57:53 - INFO - __main__ - Step 12419: {'lr': 0.0004939118630299672, 'samples': 2384448, 'steps': 12418, 'loss/train': 1.799317479133606} 11/06/2021 22:57:54 - INFO - __main__ - Step 12420: {'lr': 0.0004939106989693527, 'samples': 2384640, 'steps': 12419, 'loss/train': 2.154881477355957} 11/06/2021 22:57:55 - INFO - __main__ - Step 12421: {'lr': 0.0004939095347988357, 'samples': 2384832, 'steps': 12420, 'loss/train': 1.9314367771148682} 11/06/2021 22:57:55 - INFO - __main__ - Step 12422: {'lr': 0.0004939083705184169, 'samples': 2385024, 'steps': 12421, 'loss/train': 1.7732396125793457} 11/06/2021 22:57:55 - INFO - __main__ - Step 12423: {'lr': 0.0004939072061280967, 'samples': 2385216, 'steps': 12422, 'loss/train': 1.4663667678833008} 11/06/2021 22:57:56 - INFO - __main__ - Step 12424: {'lr': 0.0004939060416278756, 'samples': 2385408, 'steps': 12423, 'loss/train': 1.8774303197860718} 11/06/2021 22:57:57 - INFO - __main__ - Step 12425: {'lr': 0.0004939048770177543, 'samples': 2385600, 'steps': 12424, 'loss/train': 1.5539430379867554} 11/06/2021 22:57:57 - INFO - __main__ - Step 12426: {'lr': 0.0004939037122977332, 'samples': 2385792, 'steps': 12425, 'loss/train': 1.527437448501587} 11/06/2021 22:57:57 - INFO - __main__ - Step 12427: {'lr': 0.0004939025474678129, 'samples': 2385984, 'steps': 12426, 'loss/train': 1.8269412517547607} 11/06/2021 22:57:58 - INFO - __main__ - Step 12428: {'lr': 0.0004939013825279939, 'samples': 2386176, 'steps': 12427, 'loss/train': 1.5649765729904175} 11/06/2021 22:57:58 - INFO - __main__ - Step 12429: {'lr': 0.0004939002174782766, 'samples': 2386368, 'steps': 12428, 'loss/train': 0.8925514221191406} 11/06/2021 22:57:59 - INFO - __main__ - Step 12430: {'lr': 0.0004938990523186616, 'samples': 2386560, 'steps': 12429, 'loss/train': 0.6605421304702759} 11/06/2021 22:57:59 - INFO - __main__ - Step 12431: {'lr': 0.0004938978870491495, 'samples': 2386752, 'steps': 12430, 'loss/train': 1.8025730848312378} 11/06/2021 22:58:00 - INFO - __main__ - Step 12432: {'lr': 0.0004938967216697409, 'samples': 2386944, 'steps': 12431, 'loss/train': 1.7200829982757568} 11/06/2021 22:58:00 - INFO - __main__ - Step 12433: {'lr': 0.0004938955561804361, 'samples': 2387136, 'steps': 12432, 'loss/train': 1.6927978992462158} 11/06/2021 22:58:00 - INFO - __main__ - Step 12434: {'lr': 0.0004938943905812357, 'samples': 2387328, 'steps': 12433, 'loss/train': 1.6915475130081177} 11/06/2021 22:58:02 - INFO - __main__ - Step 12435: {'lr': 0.0004938932248721401, 'samples': 2387520, 'steps': 12434, 'loss/train': 1.565805196762085} 11/06/2021 22:58:02 - INFO - __main__ - Step 12436: {'lr': 0.0004938920590531503, 'samples': 2387712, 'steps': 12435, 'loss/train': 1.8280435800552368} 11/06/2021 22:58:02 - INFO - __main__ - Step 12437: {'lr': 0.0004938908931242663, 'samples': 2387904, 'steps': 12436, 'loss/train': 1.499329924583435} 11/06/2021 22:58:03 - INFO - __main__ - Step 12438: {'lr': 0.0004938897270854889, 'samples': 2388096, 'steps': 12437, 'loss/train': 0.8522184491157532} 11/06/2021 22:58:03 - INFO - __main__ - Step 12439: {'lr': 0.0004938885609368184, 'samples': 2388288, 'steps': 12438, 'loss/train': 1.0399724245071411} 11/06/2021 22:58:03 - INFO - __main__ - Step 12440: {'lr': 0.0004938873946782557, 'samples': 2388480, 'steps': 12439, 'loss/train': 1.3930256366729736} 11/06/2021 22:58:04 - INFO - __main__ - Step 12441: {'lr': 0.000493886228309801, 'samples': 2388672, 'steps': 12440, 'loss/train': 2.077786922454834} 11/06/2021 22:58:05 - INFO - __main__ - Step 12442: {'lr': 0.0004938850618314549, 'samples': 2388864, 'steps': 12441, 'loss/train': 1.7731544971466064} 11/06/2021 22:58:05 - INFO - __main__ - Step 12443: {'lr': 0.000493883895243218, 'samples': 2389056, 'steps': 12442, 'loss/train': 1.930917501449585} 11/06/2021 22:58:05 - INFO - __main__ - Step 12444: {'lr': 0.0004938827285450908, 'samples': 2389248, 'steps': 12443, 'loss/train': 1.6230531930923462} 11/06/2021 22:58:06 - INFO - __main__ - Step 12445: {'lr': 0.0004938815617370737, 'samples': 2389440, 'steps': 12444, 'loss/train': 1.7806710004806519} 11/06/2021 22:58:07 - INFO - __main__ - Step 12446: {'lr': 0.0004938803948191674, 'samples': 2389632, 'steps': 12445, 'loss/train': 1.8226611614227295} 11/06/2021 22:58:07 - INFO - __main__ - Step 12447: {'lr': 0.0004938792277913724, 'samples': 2389824, 'steps': 12446, 'loss/train': 1.6907552480697632} 11/06/2021 22:58:07 - INFO - __main__ - Step 12448: {'lr': 0.0004938780606536891, 'samples': 2390016, 'steps': 12447, 'loss/train': 1.9399850368499756} 11/06/2021 22:58:08 - INFO - __main__ - Step 12449: {'lr': 0.0004938768934061182, 'samples': 2390208, 'steps': 12448, 'loss/train': 1.9037350416183472} 11/06/2021 22:58:08 - INFO - __main__ - Step 12450: {'lr': 0.0004938757260486601, 'samples': 2390400, 'steps': 12449, 'loss/train': 1.6695767641067505} 11/06/2021 22:58:09 - INFO - __main__ - Step 12451: {'lr': 0.0004938745585813153, 'samples': 2390592, 'steps': 12450, 'loss/train': 1.9863166809082031} 11/06/2021 22:58:09 - INFO - __main__ - Step 12452: {'lr': 0.0004938733910040845, 'samples': 2390784, 'steps': 12451, 'loss/train': 1.697218418121338} 11/06/2021 22:58:10 - INFO - __main__ - Step 12453: {'lr': 0.000493872223316968, 'samples': 2390976, 'steps': 12452, 'loss/train': 1.4199427366256714} 11/06/2021 22:58:10 - INFO - __main__ - Step 12454: {'lr': 0.0004938710555199664, 'samples': 2391168, 'steps': 12453, 'loss/train': 1.2885547876358032} 11/06/2021 22:58:10 - INFO - __main__ - Step 12455: {'lr': 0.0004938698876130804, 'samples': 2391360, 'steps': 12454, 'loss/train': 1.8421812057495117} 11/06/2021 22:58:12 - INFO - __main__ - Step 12456: {'lr': 0.0004938687195963104, 'samples': 2391552, 'steps': 12455, 'loss/train': 1.8967642784118652} 11/06/2021 22:58:12 - INFO - __main__ - Step 12457: {'lr': 0.0004938675514696569, 'samples': 2391744, 'steps': 12456, 'loss/train': 0.5866230726242065} 11/06/2021 22:58:12 - INFO - __main__ - Step 12458: {'lr': 0.0004938663832331204, 'samples': 2391936, 'steps': 12457, 'loss/train': 1.3953553438186646} 11/06/2021 22:58:13 - INFO - __main__ - Step 12459: {'lr': 0.0004938652148867014, 'samples': 2392128, 'steps': 12458, 'loss/train': 0.3347637355327606} 11/06/2021 22:58:13 - INFO - __main__ - Step 12460: {'lr': 0.0004938640464304006, 'samples': 2392320, 'steps': 12459, 'loss/train': 1.7510279417037964} 11/06/2021 22:58:14 - INFO - __main__ - Step 12461: {'lr': 0.0004938628778642185, 'samples': 2392512, 'steps': 12460, 'loss/train': 1.7119464874267578} 11/06/2021 22:58:15 - INFO - __main__ - Step 12462: {'lr': 0.0004938617091881554, 'samples': 2392704, 'steps': 12461, 'loss/train': 1.8808752298355103} 11/06/2021 22:58:15 - INFO - __main__ - Step 12463: {'lr': 0.000493860540402212, 'samples': 2392896, 'steps': 12462, 'loss/train': 1.534140706062317} 11/06/2021 22:58:15 - INFO - __main__ - Step 12464: {'lr': 0.0004938593715063888, 'samples': 2393088, 'steps': 12463, 'loss/train': 1.430334448814392} 11/06/2021 22:58:16 - INFO - __main__ - Step 12465: {'lr': 0.0004938582025006864, 'samples': 2393280, 'steps': 12464, 'loss/train': 1.060693621635437} 11/06/2021 22:58:16 - INFO - __main__ - Step 12466: {'lr': 0.0004938570333851052, 'samples': 2393472, 'steps': 12465, 'loss/train': 1.7933906316757202} 11/06/2021 22:58:17 - INFO - __main__ - Step 12467: {'lr': 0.0004938558641596458, 'samples': 2393664, 'steps': 12466, 'loss/train': 0.8866246938705444} 11/06/2021 22:58:18 - INFO - __main__ - Step 12468: {'lr': 0.0004938546948243087, 'samples': 2393856, 'steps': 12467, 'loss/train': 1.1550602912902832} 11/06/2021 22:58:18 - INFO - __main__ - Step 12469: {'lr': 0.0004938535253790944, 'samples': 2394048, 'steps': 12468, 'loss/train': 1.569968342781067} 11/06/2021 22:58:18 - INFO - __main__ - Step 12470: {'lr': 0.0004938523558240035, 'samples': 2394240, 'steps': 12469, 'loss/train': 1.5518238544464111} 11/06/2021 22:58:19 - INFO - __main__ - Step 12471: {'lr': 0.0004938511861590365, 'samples': 2394432, 'steps': 12470, 'loss/train': 1.680008053779602} 11/06/2021 22:58:20 - INFO - __main__ - Step 12472: {'lr': 0.000493850016384194, 'samples': 2394624, 'steps': 12471, 'loss/train': 1.5120447874069214} 11/06/2021 22:58:20 - INFO - __main__ - Step 12473: {'lr': 0.0004938488464994764, 'samples': 2394816, 'steps': 12472, 'loss/train': 1.2448487281799316} 11/06/2021 22:58:20 - INFO - __main__ - Step 12474: {'lr': 0.0004938476765048842, 'samples': 2395008, 'steps': 12473, 'loss/train': 1.75505530834198} 11/06/2021 22:58:21 - INFO - __main__ - Step 12475: {'lr': 0.0004938465064004181, 'samples': 2395200, 'steps': 12474, 'loss/train': 1.4993715286254883} 11/06/2021 22:58:21 - INFO - __main__ - Step 12476: {'lr': 0.0004938453361860785, 'samples': 2395392, 'steps': 12475, 'loss/train': 1.6002764701843262} 11/06/2021 22:58:22 - INFO - __main__ - Step 12477: {'lr': 0.0004938441658618659, 'samples': 2395584, 'steps': 12476, 'loss/train': 1.6196297407150269} 11/06/2021 22:58:22 - INFO - __main__ - Step 12478: {'lr': 0.0004938429954277809, 'samples': 2395776, 'steps': 12477, 'loss/train': 1.990778923034668} 11/06/2021 22:58:23 - INFO - __main__ - Step 12479: {'lr': 0.000493841824883824, 'samples': 2395968, 'steps': 12478, 'loss/train': 1.7635631561279297} 11/06/2021 22:58:23 - INFO - __main__ - Step 12480: {'lr': 0.0004938406542299956, 'samples': 2396160, 'steps': 12479, 'loss/train': 1.3563812971115112} 11/06/2021 22:58:23 - INFO - __main__ - Step 12481: {'lr': 0.0004938394834662966, 'samples': 2396352, 'steps': 12480, 'loss/train': 0.9850826859474182} 11/06/2021 22:58:24 - INFO - __main__ - Step 12482: {'lr': 0.0004938383125927272, 'samples': 2396544, 'steps': 12481, 'loss/train': 1.9261223077774048} 11/06/2021 22:58:25 - INFO - __main__ - Step 12483: {'lr': 0.0004938371416092881, 'samples': 2396736, 'steps': 12482, 'loss/train': 1.8350012302398682} 11/06/2021 22:58:25 - INFO - __main__ - Step 12484: {'lr': 0.0004938359705159796, 'samples': 2396928, 'steps': 12483, 'loss/train': 1.5597666501998901} 11/06/2021 22:58:25 - INFO - __main__ - Step 12485: {'lr': 0.0004938347993128025, 'samples': 2397120, 'steps': 12484, 'loss/train': 1.588280439376831} 11/06/2021 22:58:26 - INFO - __main__ - Step 12486: {'lr': 0.0004938336279997571, 'samples': 2397312, 'steps': 12485, 'loss/train': 1.8706063032150269} 11/06/2021 22:58:27 - INFO - __main__ - Step 12487: {'lr': 0.0004938324565768441, 'samples': 2397504, 'steps': 12486, 'loss/train': 1.8012641668319702} 11/06/2021 22:58:27 - INFO - __main__ - Step 12488: {'lr': 0.0004938312850440639, 'samples': 2397696, 'steps': 12487, 'loss/train': 0.9766202569007874} 11/06/2021 22:58:28 - INFO - __main__ - Step 12489: {'lr': 0.0004938301134014172, 'samples': 2397888, 'steps': 12488, 'loss/train': 1.6374305486679077} 11/06/2021 22:58:28 - INFO - __main__ - Step 12490: {'lr': 0.0004938289416489042, 'samples': 2398080, 'steps': 12489, 'loss/train': 2.0924293994903564} 11/06/2021 22:58:28 - INFO - __main__ - Step 12491: {'lr': 0.0004938277697865259, 'samples': 2398272, 'steps': 12490, 'loss/train': 1.4682444334030151} 11/06/2021 22:58:29 - INFO - __main__ - Step 12492: {'lr': 0.0004938265978142824, 'samples': 2398464, 'steps': 12491, 'loss/train': 2.014890193939209} 11/06/2021 22:58:30 - INFO - __main__ - Step 12493: {'lr': 0.0004938254257321745, 'samples': 2398656, 'steps': 12492, 'loss/train': 1.3816254138946533} 11/06/2021 22:58:30 - INFO - __main__ - Step 12494: {'lr': 0.0004938242535402025, 'samples': 2398848, 'steps': 12493, 'loss/train': 1.4478893280029297} 11/06/2021 22:58:30 - INFO - __main__ - Step 12495: {'lr': 0.0004938230812383672, 'samples': 2399040, 'steps': 12494, 'loss/train': 4.631295680999756} 11/06/2021 22:58:31 - INFO - __main__ - Step 12496: {'lr': 0.0004938219088266688, 'samples': 2399232, 'steps': 12495, 'loss/train': 1.651924729347229} 11/06/2021 22:58:31 - INFO - __main__ - Step 12497: {'lr': 0.0004938207363051082, 'samples': 2399424, 'steps': 12496, 'loss/train': 1.7738444805145264} 11/06/2021 22:58:32 - INFO - __main__ - Step 12498: {'lr': 0.0004938195636736857, 'samples': 2399616, 'steps': 12497, 'loss/train': 1.413539171218872} 11/06/2021 22:58:33 - INFO - __main__ - Step 12499: {'lr': 0.0004938183909324017, 'samples': 2399808, 'steps': 12498, 'loss/train': 1.9373109340667725} 11/06/2021 22:58:33 - INFO - __main__ - Step 12500: {'lr': 0.0004938172180812571, 'samples': 2400000, 'steps': 12499, 'loss/train': 2.0888149738311768} 11/06/2021 22:58:34 - INFO - __main__ - Step 12501: {'lr': 0.000493816045120252, 'samples': 2400192, 'steps': 12500, 'loss/train': 1.6998056173324585} 11/06/2021 22:58:34 - INFO - __main__ - Step 12502: {'lr': 0.0004938148720493873, 'samples': 2400384, 'steps': 12501, 'loss/train': 1.5378485918045044} 11/06/2021 22:58:35 - INFO - __main__ - Step 12503: {'lr': 0.0004938136988686634, 'samples': 2400576, 'steps': 12502, 'loss/train': 0.36686891317367554} 11/06/2021 22:58:35 - INFO - __main__ - Step 12504: {'lr': 0.0004938125255780808, 'samples': 2400768, 'steps': 12503, 'loss/train': 1.6974081993103027} 11/06/2021 22:58:36 - INFO - __main__ - Step 12505: {'lr': 0.0004938113521776401, 'samples': 2400960, 'steps': 12504, 'loss/train': 1.7452573776245117} 11/06/2021 22:58:36 - INFO - __main__ - Step 12506: {'lr': 0.0004938101786673416, 'samples': 2401152, 'steps': 12505, 'loss/train': 1.861094355583191} 11/06/2021 22:58:36 - INFO - __main__ - Step 12507: {'lr': 0.0004938090050471861, 'samples': 2401344, 'steps': 12506, 'loss/train': 1.3893704414367676} 11/06/2021 22:58:37 - INFO - __main__ - Step 12508: {'lr': 0.000493807831317174, 'samples': 2401536, 'steps': 12507, 'loss/train': 1.7757277488708496} 11/06/2021 22:58:38 - INFO - __main__ - Step 12509: {'lr': 0.0004938066574773058, 'samples': 2401728, 'steps': 12508, 'loss/train': 2.139897346496582} 11/06/2021 22:58:38 - INFO - __main__ - Step 12510: {'lr': 0.0004938054835275822, 'samples': 2401920, 'steps': 12509, 'loss/train': 1.3797563314437866} 11/06/2021 22:58:38 - INFO - __main__ - Step 12511: {'lr': 0.0004938043094680036, 'samples': 2402112, 'steps': 12510, 'loss/train': 1.6733269691467285} 11/06/2021 22:58:39 - INFO - __main__ - Step 12512: {'lr': 0.0004938031352985704, 'samples': 2402304, 'steps': 12511, 'loss/train': 1.8873792886734009} 11/06/2021 22:58:40 - INFO - __main__ - Step 12513: {'lr': 0.0004938019610192835, 'samples': 2402496, 'steps': 12512, 'loss/train': 2.266671657562256} 11/06/2021 22:58:40 - INFO - __main__ - Step 12514: {'lr': 0.0004938007866301429, 'samples': 2402688, 'steps': 12513, 'loss/train': 1.8477656841278076} 11/06/2021 22:58:40 - INFO - __main__ - Step 12515: {'lr': 0.0004937996121311496, 'samples': 2402880, 'steps': 12514, 'loss/train': 1.2845678329467773} 11/06/2021 22:58:41 - INFO - __main__ - Step 12516: {'lr': 0.000493798437522304, 'samples': 2403072, 'steps': 12515, 'loss/train': 1.3590694665908813} 11/06/2021 22:58:41 - INFO - __main__ - Step 12517: {'lr': 0.0004937972628036065, 'samples': 2403264, 'steps': 12516, 'loss/train': 1.4171817302703857} 11/06/2021 22:58:42 - INFO - __main__ - Step 12518: {'lr': 0.0004937960879750578, 'samples': 2403456, 'steps': 12517, 'loss/train': 1.5596864223480225} 11/06/2021 22:58:43 - INFO - __main__ - Step 12519: {'lr': 0.0004937949130366582, 'samples': 2403648, 'steps': 12518, 'loss/train': 1.6341605186462402} 11/06/2021 22:58:43 - INFO - __main__ - Step 12520: {'lr': 0.0004937937379884085, 'samples': 2403840, 'steps': 12519, 'loss/train': 1.1687759160995483} 11/06/2021 22:58:43 - INFO - __main__ - Step 12521: {'lr': 0.0004937925628303091, 'samples': 2404032, 'steps': 12520, 'loss/train': 1.7421510219573975} 11/06/2021 22:58:44 - INFO - __main__ - Step 12522: {'lr': 0.0004937913875623605, 'samples': 2404224, 'steps': 12521, 'loss/train': 1.12235426902771} 11/06/2021 22:58:44 - INFO - __main__ - Step 12523: {'lr': 0.0004937902121845633, 'samples': 2404416, 'steps': 12522, 'loss/train': 1.6094740629196167} 11/06/2021 22:58:45 - INFO - __main__ - Step 12524: {'lr': 0.000493789036696918, 'samples': 2404608, 'steps': 12523, 'loss/train': 1.749121069908142} 11/06/2021 22:58:45 - INFO - __main__ - Step 12525: {'lr': 0.000493787861099425, 'samples': 2404800, 'steps': 12524, 'loss/train': 1.0136215686798096} 11/06/2021 22:58:46 - INFO - __main__ - Step 12526: {'lr': 0.0004937866853920851, 'samples': 2404992, 'steps': 12525, 'loss/train': 1.7210983037948608} 11/06/2021 22:58:46 - INFO - __main__ - Step 12527: {'lr': 0.0004937855095748985, 'samples': 2405184, 'steps': 12526, 'loss/train': 1.8808726072311401} 11/06/2021 22:58:46 - INFO - __main__ - Step 12528: {'lr': 0.0004937843336478661, 'samples': 2405376, 'steps': 12527, 'loss/train': 1.6538246870040894} 11/06/2021 22:58:48 - INFO - __main__ - Step 12529: {'lr': 0.0004937831576109881, 'samples': 2405568, 'steps': 12528, 'loss/train': 1.4507884979248047} 11/06/2021 22:58:48 - INFO - __main__ - Step 12530: {'lr': 0.0004937819814642653, 'samples': 2405760, 'steps': 12529, 'loss/train': 1.7369753122329712} 11/06/2021 22:58:49 - INFO - __main__ - Step 12531: {'lr': 0.000493780805207698, 'samples': 2405952, 'steps': 12530, 'loss/train': 1.33586585521698} 11/06/2021 22:58:49 - INFO - __main__ - Step 12532: {'lr': 0.000493779628841287, 'samples': 2406144, 'steps': 12531, 'loss/train': 2.222987413406372} 11/06/2021 22:58:49 - INFO - __main__ - Step 12533: {'lr': 0.0004937784523650324, 'samples': 2406336, 'steps': 12532, 'loss/train': 0.8576833605766296} 11/06/2021 22:58:50 - INFO - __main__ - Step 12534: {'lr': 0.0004937772757789352, 'samples': 2406528, 'steps': 12533, 'loss/train': 0.9707418084144592} 11/06/2021 22:58:51 - INFO - __main__ - Step 12535: {'lr': 0.0004937760990829956, 'samples': 2406720, 'steps': 12534, 'loss/train': 1.9520204067230225} 11/06/2021 22:58:51 - INFO - __main__ - Step 12536: {'lr': 0.0004937749222772143, 'samples': 2406912, 'steps': 12535, 'loss/train': 1.7065337896347046} 11/06/2021 22:58:51 - INFO - __main__ - Step 12537: {'lr': 0.0004937737453615918, 'samples': 2407104, 'steps': 12536, 'loss/train': 1.8596389293670654} 11/06/2021 22:58:52 - INFO - __main__ - Step 12538: {'lr': 0.0004937725683361286, 'samples': 2407296, 'steps': 12537, 'loss/train': 1.6391980648040771} 11/06/2021 22:58:52 - INFO - __main__ - Step 12539: {'lr': 0.0004937713912008252, 'samples': 2407488, 'steps': 12538, 'loss/train': 0.9928711652755737} 11/06/2021 22:58:53 - INFO - __main__ - Step 12540: {'lr': 0.0004937702139556822, 'samples': 2407680, 'steps': 12539, 'loss/train': 2.0436432361602783} 11/06/2021 22:58:54 - INFO - __main__ - Step 12541: {'lr': 0.0004937690366007, 'samples': 2407872, 'steps': 12540, 'loss/train': 1.766875982284546} 11/06/2021 22:58:54 - INFO - __main__ - Step 12542: {'lr': 0.0004937678591358794, 'samples': 2408064, 'steps': 12541, 'loss/train': 1.9210069179534912} 11/06/2021 22:58:54 - INFO - __main__ - Step 12543: {'lr': 0.0004937666815612207, 'samples': 2408256, 'steps': 12542, 'loss/train': 1.523587703704834} 11/06/2021 22:58:55 - INFO - __main__ - Step 12544: {'lr': 0.0004937655038767245, 'samples': 2408448, 'steps': 12543, 'loss/train': 1.4020729064941406} 11/06/2021 22:58:56 - INFO - __main__ - Step 12545: {'lr': 0.0004937643260823914, 'samples': 2408640, 'steps': 12544, 'loss/train': 1.2543262243270874} 11/06/2021 22:58:56 - INFO - __main__ - Step 12546: {'lr': 0.0004937631481782218, 'samples': 2408832, 'steps': 12545, 'loss/train': 1.850257396697998} 11/06/2021 22:58:56 - INFO - __main__ - Step 12547: {'lr': 0.0004937619701642162, 'samples': 2409024, 'steps': 12546, 'loss/train': 2.321528434753418} 11/06/2021 22:58:57 - INFO - __main__ - Step 12548: {'lr': 0.0004937607920403752, 'samples': 2409216, 'steps': 12547, 'loss/train': 2.2833588123321533} 11/06/2021 22:58:57 - INFO - __main__ - Step 12549: {'lr': 0.0004937596138066996, 'samples': 2409408, 'steps': 12548, 'loss/train': 1.803352952003479} 11/06/2021 22:58:58 - INFO - __main__ - Step 12550: {'lr': 0.0004937584354631894, 'samples': 2409600, 'steps': 12549, 'loss/train': 2.182974100112915} 11/06/2021 22:58:58 - INFO - __main__ - Step 12551: {'lr': 0.0004937572570098455, 'samples': 2409792, 'steps': 12550, 'loss/train': 1.5153863430023193} 11/06/2021 22:58:59 - INFO - __main__ - Step 12552: {'lr': 0.0004937560784466685, 'samples': 2409984, 'steps': 12551, 'loss/train': 1.986527681350708} 11/06/2021 22:58:59 - INFO - __main__ - Step 12553: {'lr': 0.0004937548997736586, 'samples': 2410176, 'steps': 12552, 'loss/train': 1.3949472904205322} 11/06/2021 22:58:59 - INFO - __main__ - Step 12554: {'lr': 0.0004937537209908165, 'samples': 2410368, 'steps': 12553, 'loss/train': 1.5035032033920288} 11/06/2021 22:59:00 - INFO - __main__ - Step 12555: {'lr': 0.0004937525420981428, 'samples': 2410560, 'steps': 12554, 'loss/train': 2.261537790298462} 11/06/2021 22:59:01 - INFO - __main__ - Step 12556: {'lr': 0.0004937513630956379, 'samples': 2410752, 'steps': 12555, 'loss/train': 3.3203577995300293} 11/06/2021 22:59:01 - INFO - __main__ - Step 12557: {'lr': 0.0004937501839833024, 'samples': 2410944, 'steps': 12556, 'loss/train': 1.5435712337493896} 11/06/2021 22:59:01 - INFO - __main__ - Step 12558: {'lr': 0.0004937490047611369, 'samples': 2411136, 'steps': 12557, 'loss/train': 1.9833227396011353} 11/06/2021 22:59:02 - INFO - __main__ - Step 12559: {'lr': 0.0004937478254291418, 'samples': 2411328, 'steps': 12558, 'loss/train': 1.6904947757720947} 11/06/2021 22:59:02 - INFO - __main__ - Step 12560: {'lr': 0.0004937466459873178, 'samples': 2411520, 'steps': 12559, 'loss/train': 1.6554052829742432} 11/06/2021 22:59:03 - INFO - __main__ - Step 12561: {'lr': 0.0004937454664356652, 'samples': 2411712, 'steps': 12560, 'loss/train': 1.7198843955993652} 11/06/2021 22:59:04 - INFO - __main__ - Step 12562: {'lr': 0.0004937442867741848, 'samples': 2411904, 'steps': 12561, 'loss/train': 1.9236637353897095} 11/06/2021 22:59:04 - INFO - __main__ - Step 12563: {'lr': 0.0004937431070028768, 'samples': 2412096, 'steps': 12562, 'loss/train': 1.9572961330413818} 11/06/2021 22:59:04 - INFO - __main__ - Step 12564: {'lr': 0.0004937419271217419, 'samples': 2412288, 'steps': 12563, 'loss/train': 5.857589244842529} 11/06/2021 22:59:05 - INFO - __main__ - Step 12565: {'lr': 0.0004937407471307807, 'samples': 2412480, 'steps': 12564, 'loss/train': 1.5907723903656006} 11/06/2021 22:59:05 - INFO - __main__ - Step 12566: {'lr': 0.0004937395670299938, 'samples': 2412672, 'steps': 12565, 'loss/train': 1.3296524286270142} 11/06/2021 22:59:06 - INFO - __main__ - Step 12567: {'lr': 0.0004937383868193815, 'samples': 2412864, 'steps': 12566, 'loss/train': 1.7416648864746094} 11/06/2021 22:59:06 - INFO - __main__ - Step 12568: {'lr': 0.0004937372064989445, 'samples': 2413056, 'steps': 12567, 'loss/train': 1.3068746328353882} 11/06/2021 22:59:07 - INFO - __main__ - Step 12569: {'lr': 0.0004937360260686833, 'samples': 2413248, 'steps': 12568, 'loss/train': 1.8562533855438232} 11/06/2021 22:59:07 - INFO - __main__ - Step 12570: {'lr': 0.0004937348455285983, 'samples': 2413440, 'steps': 12569, 'loss/train': 1.5558427572250366} 11/06/2021 22:59:08 - INFO - __main__ - Step 12571: {'lr': 0.0004937336648786903, 'samples': 2413632, 'steps': 12570, 'loss/train': 1.8795865774154663} 11/06/2021 22:59:08 - INFO - __main__ - Step 12572: {'lr': 0.0004937324841189595, 'samples': 2413824, 'steps': 12571, 'loss/train': 1.9527837038040161} 11/06/2021 22:59:09 - INFO - __main__ - Step 12573: {'lr': 0.0004937313032494068, 'samples': 2414016, 'steps': 12572, 'loss/train': 1.613889217376709} 11/06/2021 22:59:09 - INFO - __main__ - Step 12574: {'lr': 0.0004937301222700324, 'samples': 2414208, 'steps': 12573, 'loss/train': 1.2357040643692017} 11/06/2021 22:59:10 - INFO - __main__ - Step 12575: {'lr': 0.0004937289411808369, 'samples': 2414400, 'steps': 12574, 'loss/train': 2.0651497840881348} 11/06/2021 22:59:10 - INFO - __main__ - Step 12576: {'lr': 0.000493727759981821, 'samples': 2414592, 'steps': 12575, 'loss/train': 1.5525745153427124} 11/06/2021 22:59:11 - INFO - __main__ - Step 12577: {'lr': 0.0004937265786729851, 'samples': 2414784, 'steps': 12576, 'loss/train': 0.8309584856033325} 11/06/2021 22:59:11 - INFO - __main__ - Step 12578: {'lr': 0.0004937253972543298, 'samples': 2414976, 'steps': 12577, 'loss/train': 2.5759201049804688} 11/06/2021 22:59:12 - INFO - __main__ - Step 12579: {'lr': 0.0004937242157258555, 'samples': 2415168, 'steps': 12578, 'loss/train': 2.0868546962738037} 11/06/2021 22:59:12 - INFO - __main__ - Step 12580: {'lr': 0.000493723034087563, 'samples': 2415360, 'steps': 12579, 'loss/train': 0.8102706074714661} 11/06/2021 22:59:12 - INFO - __main__ - Step 12581: {'lr': 0.0004937218523394525, 'samples': 2415552, 'steps': 12580, 'loss/train': 1.7701175212860107} 11/06/2021 22:59:13 - INFO - __main__ - Step 12582: {'lr': 0.0004937206704815248, 'samples': 2415744, 'steps': 12581, 'loss/train': 1.8174623250961304} 11/06/2021 22:59:14 - INFO - __main__ - Step 12583: {'lr': 0.0004937194885137803, 'samples': 2415936, 'steps': 12582, 'loss/train': 1.7041975259780884} 11/06/2021 22:59:14 - INFO - __main__ - Step 12584: {'lr': 0.0004937183064362196, 'samples': 2416128, 'steps': 12583, 'loss/train': 1.7389198541641235} 11/06/2021 22:59:14 - INFO - __main__ - Step 12585: {'lr': 0.0004937171242488431, 'samples': 2416320, 'steps': 12584, 'loss/train': 2.084197521209717} 11/06/2021 22:59:15 - INFO - __main__ - Step 12586: {'lr': 0.0004937159419516515, 'samples': 2416512, 'steps': 12585, 'loss/train': 1.9468644857406616} 11/06/2021 22:59:15 - INFO - __main__ - Step 12587: {'lr': 0.0004937147595446452, 'samples': 2416704, 'steps': 12586, 'loss/train': 1.4281747341156006} 11/06/2021 22:59:16 - INFO - __main__ - Step 12588: {'lr': 0.0004937135770278248, 'samples': 2416896, 'steps': 12587, 'loss/train': 1.688107967376709} 11/06/2021 22:59:17 - INFO - __main__ - Step 12589: {'lr': 0.0004937123944011908, 'samples': 2417088, 'steps': 12588, 'loss/train': 1.6565439701080322} 11/06/2021 22:59:17 - INFO - __main__ - Step 12590: {'lr': 0.0004937112116647439, 'samples': 2417280, 'steps': 12589, 'loss/train': 1.904050588607788} 11/06/2021 22:59:17 - INFO - __main__ - Step 12591: {'lr': 0.0004937100288184843, 'samples': 2417472, 'steps': 12590, 'loss/train': 1.9118624925613403} 11/06/2021 22:59:18 - INFO - __main__ - Step 12592: {'lr': 0.0004937088458624128, 'samples': 2417664, 'steps': 12591, 'loss/train': 1.8203307390213013} 11/06/2021 22:59:19 - INFO - __main__ - Step 12593: {'lr': 0.0004937076627965299, 'samples': 2417856, 'steps': 12592, 'loss/train': 2.2025625705718994} 11/06/2021 22:59:19 - INFO - __main__ - Step 12594: {'lr': 0.000493706479620836, 'samples': 2418048, 'steps': 12593, 'loss/train': 1.668437123298645} 11/06/2021 22:59:19 - INFO - __main__ - Step 12595: {'lr': 0.0004937052963353318, 'samples': 2418240, 'steps': 12594, 'loss/train': 0.7377382516860962} 11/06/2021 22:59:20 - INFO - __main__ - Step 12596: {'lr': 0.0004937041129400177, 'samples': 2418432, 'steps': 12595, 'loss/train': 2.0309805870056152} 11/06/2021 22:59:20 - INFO - __main__ - Step 12597: {'lr': 0.0004937029294348943, 'samples': 2418624, 'steps': 12596, 'loss/train': 2.111126661300659} 11/06/2021 22:59:21 - INFO - __main__ - Step 12598: {'lr': 0.0004937017458199621, 'samples': 2418816, 'steps': 12597, 'loss/train': 1.5420552492141724} 11/06/2021 22:59:21 - INFO - __main__ - Step 12599: {'lr': 0.0004937005620952217, 'samples': 2419008, 'steps': 12598, 'loss/train': 1.6897568702697754} 11/06/2021 22:59:22 - INFO - __main__ - Step 12600: {'lr': 0.0004936993782606735, 'samples': 2419200, 'steps': 12599, 'loss/train': 2.092362880706787} 11/06/2021 22:59:22 - INFO - __main__ - Step 12601: {'lr': 0.0004936981943163182, 'samples': 2419392, 'steps': 12600, 'loss/train': 1.7093287706375122} 11/06/2021 22:59:22 - INFO - __main__ - Step 12602: {'lr': 0.0004936970102621563, 'samples': 2419584, 'steps': 12601, 'loss/train': 1.4950854778289795} 11/06/2021 22:59:23 - INFO - __main__ - Step 12603: {'lr': 0.0004936958260981883, 'samples': 2419776, 'steps': 12602, 'loss/train': 1.867283582687378} 11/06/2021 22:59:24 - INFO - __main__ - Step 12604: {'lr': 0.0004936946418244146, 'samples': 2419968, 'steps': 12603, 'loss/train': 1.928859829902649} 11/06/2021 22:59:24 - INFO - __main__ - Step 12605: {'lr': 0.000493693457440836, 'samples': 2420160, 'steps': 12604, 'loss/train': 2.0806899070739746} 11/06/2021 22:59:25 - INFO - __main__ - Step 12606: {'lr': 0.0004936922729474526, 'samples': 2420352, 'steps': 12605, 'loss/train': 1.991079330444336} 11/06/2021 22:59:25 - INFO - __main__ - Step 12607: {'lr': 0.0004936910883442655, 'samples': 2420544, 'steps': 12606, 'loss/train': 1.499205231666565} 11/06/2021 22:59:25 - INFO - __main__ - Step 12608: {'lr': 0.0004936899036312749, 'samples': 2420736, 'steps': 12607, 'loss/train': 1.6349520683288574} 11/06/2021 22:59:26 - INFO - __main__ - Step 12609: {'lr': 0.0004936887188084813, 'samples': 2420928, 'steps': 12608, 'loss/train': 1.5434484481811523} 11/06/2021 22:59:27 - INFO - __main__ - Step 12610: {'lr': 0.0004936875338758855, 'samples': 2421120, 'steps': 12609, 'loss/train': 1.8176034688949585} 11/06/2021 22:59:27 - INFO - __main__ - Step 12611: {'lr': 0.0004936863488334877, 'samples': 2421312, 'steps': 12610, 'loss/train': 1.5530451536178589} 11/06/2021 22:59:27 - INFO - __main__ - Step 12612: {'lr': 0.0004936851636812886, 'samples': 2421504, 'steps': 12611, 'loss/train': 1.6083686351776123} 11/06/2021 22:59:28 - INFO - __main__ - Step 12613: {'lr': 0.0004936839784192888, 'samples': 2421696, 'steps': 12612, 'loss/train': 1.6676959991455078} 11/06/2021 22:59:29 - INFO - __main__ - Step 12614: {'lr': 0.0004936827930474887, 'samples': 2421888, 'steps': 12613, 'loss/train': 1.6996138095855713} 11/06/2021 22:59:29 - INFO - __main__ - Step 12615: {'lr': 0.0004936816075658889, 'samples': 2422080, 'steps': 12614, 'loss/train': 1.4866139888763428} 11/06/2021 22:59:30 - INFO - __main__ - Step 12616: {'lr': 0.00049368042197449, 'samples': 2422272, 'steps': 12615, 'loss/train': 1.4212651252746582} 11/06/2021 22:59:30 - INFO - __main__ - Step 12617: {'lr': 0.0004936792362732924, 'samples': 2422464, 'steps': 12616, 'loss/train': 1.7673671245574951} 11/06/2021 22:59:30 - INFO - __main__ - Step 12618: {'lr': 0.0004936780504622967, 'samples': 2422656, 'steps': 12617, 'loss/train': 0.7368472218513489} 11/06/2021 22:59:31 - INFO - __main__ - Step 12619: {'lr': 0.0004936768645415033, 'samples': 2422848, 'steps': 12618, 'loss/train': 1.724778652191162} 11/06/2021 22:59:32 - INFO - __main__ - Step 12620: {'lr': 0.0004936756785109131, 'samples': 2423040, 'steps': 12619, 'loss/train': 2.020928382873535} 11/06/2021 22:59:32 - INFO - __main__ - Step 12621: {'lr': 0.0004936744923705263, 'samples': 2423232, 'steps': 12620, 'loss/train': 1.8747974634170532} 11/06/2021 22:59:32 - INFO - __main__ - Step 12622: {'lr': 0.0004936733061203435, 'samples': 2423424, 'steps': 12621, 'loss/train': 1.8702856302261353} 11/06/2021 22:59:33 - INFO - __main__ - Step 12623: {'lr': 0.0004936721197603653, 'samples': 2423616, 'steps': 12622, 'loss/train': 2.058539628982544} 11/06/2021 22:59:34 - INFO - __main__ - Step 12624: {'lr': 0.0004936709332905923, 'samples': 2423808, 'steps': 12623, 'loss/train': 1.5451576709747314} 11/06/2021 22:59:34 - INFO - __main__ - Step 12625: {'lr': 0.0004936697467110248, 'samples': 2424000, 'steps': 12624, 'loss/train': 1.5256503820419312} 11/06/2021 22:59:34 - INFO - __main__ - Step 12626: {'lr': 0.0004936685600216635, 'samples': 2424192, 'steps': 12625, 'loss/train': 1.8777118921279907} 11/06/2021 22:59:35 - INFO - __main__ - Step 12627: {'lr': 0.0004936673732225088, 'samples': 2424384, 'steps': 12626, 'loss/train': 1.737566590309143} 11/06/2021 22:59:35 - INFO - __main__ - Step 12628: {'lr': 0.0004936661863135615, 'samples': 2424576, 'steps': 12627, 'loss/train': 1.7391878366470337} 11/06/2021 22:59:35 - INFO - __main__ - Step 12629: {'lr': 0.000493664999294822, 'samples': 2424768, 'steps': 12628, 'loss/train': 1.8638137578964233} 11/06/2021 22:59:37 - INFO - __main__ - Step 12630: {'lr': 0.0004936638121662908, 'samples': 2424960, 'steps': 12629, 'loss/train': 0.9604647159576416} 11/06/2021 22:59:37 - INFO - __main__ - Step 12631: {'lr': 0.0004936626249279683, 'samples': 2425152, 'steps': 12630, 'loss/train': 1.9379112720489502} 11/06/2021 22:59:37 - INFO - __main__ - Step 12632: {'lr': 0.0004936614375798553, 'samples': 2425344, 'steps': 12631, 'loss/train': 0.7208963632583618} 11/06/2021 22:59:38 - INFO - __main__ - Step 12633: {'lr': 0.0004936602501219522, 'samples': 2425536, 'steps': 12632, 'loss/train': 1.7763961553573608} 11/06/2021 22:59:38 - INFO - __main__ - Step 12634: {'lr': 0.0004936590625542595, 'samples': 2425728, 'steps': 12633, 'loss/train': 1.4927427768707275} 11/06/2021 22:59:39 - INFO - __main__ - Step 12635: {'lr': 0.0004936578748767779, 'samples': 2425920, 'steps': 12634, 'loss/train': 1.5714725255966187} 11/06/2021 22:59:40 - INFO - __main__ - Step 12636: {'lr': 0.0004936566870895078, 'samples': 2426112, 'steps': 12635, 'loss/train': 1.6926263570785522} 11/06/2021 22:59:40 - INFO - __main__ - Step 12637: {'lr': 0.0004936554991924496, 'samples': 2426304, 'steps': 12636, 'loss/train': 0.9753442406654358} 11/06/2021 22:59:40 - INFO - __main__ - Step 12638: {'lr': 0.0004936543111856041, 'samples': 2426496, 'steps': 12637, 'loss/train': 1.6684590578079224} 11/06/2021 22:59:41 - INFO - __main__ - Step 12639: {'lr': 0.0004936531230689717, 'samples': 2426688, 'steps': 12638, 'loss/train': 1.6691287755966187} 11/06/2021 22:59:42 - INFO - __main__ - Step 12640: {'lr': 0.000493651934842553, 'samples': 2426880, 'steps': 12639, 'loss/train': 1.628665566444397} 11/06/2021 22:59:42 - INFO - __main__ - Step 12641: {'lr': 0.0004936507465063486, 'samples': 2427072, 'steps': 12640, 'loss/train': 1.266121745109558} 11/06/2021 22:59:42 - INFO - __main__ - Step 12642: {'lr': 0.0004936495580603588, 'samples': 2427264, 'steps': 12641, 'loss/train': 1.7384616136550903} 11/06/2021 22:59:43 - INFO - __main__ - Step 12643: {'lr': 0.0004936483695045842, 'samples': 2427456, 'steps': 12642, 'loss/train': 1.6067787408828735} 11/06/2021 22:59:43 - INFO - __main__ - Step 12644: {'lr': 0.0004936471808390254, 'samples': 2427648, 'steps': 12643, 'loss/train': 1.7227269411087036} 11/06/2021 22:59:44 - INFO - __main__ - Step 12645: {'lr': 0.0004936459920636832, 'samples': 2427840, 'steps': 12644, 'loss/train': 1.4939666986465454} 11/06/2021 22:59:44 - INFO - __main__ - Step 12646: {'lr': 0.0004936448031785576, 'samples': 2428032, 'steps': 12645, 'loss/train': 1.5351349115371704} 11/06/2021 22:59:45 - INFO - __main__ - Step 12647: {'lr': 0.0004936436141836496, 'samples': 2428224, 'steps': 12646, 'loss/train': 1.6882507801055908} 11/06/2021 22:59:45 - INFO - __main__ - Step 12648: {'lr': 0.0004936424250789594, 'samples': 2428416, 'steps': 12647, 'loss/train': 2.1467108726501465} 11/06/2021 22:59:45 - INFO - __main__ - Step 12649: {'lr': 0.0004936412358644878, 'samples': 2428608, 'steps': 12648, 'loss/train': 1.2412645816802979} 11/06/2021 22:59:47 - INFO - __main__ - Step 12650: {'lr': 0.0004936400465402351, 'samples': 2428800, 'steps': 12649, 'loss/train': 1.8610800504684448} 11/06/2021 22:59:47 - INFO - __main__ - Step 12651: {'lr': 0.0004936388571062021, 'samples': 2428992, 'steps': 12650, 'loss/train': 0.7149181365966797} 11/06/2021 22:59:47 - INFO - __main__ - Step 12652: {'lr': 0.0004936376675623892, 'samples': 2429184, 'steps': 12651, 'loss/train': 1.40529203414917} 11/06/2021 22:59:48 - INFO - __main__ - Step 12653: {'lr': 0.0004936364779087967, 'samples': 2429376, 'steps': 12652, 'loss/train': 1.6117287874221802} 11/06/2021 22:59:48 - INFO - __main__ - Step 12654: {'lr': 0.0004936352881454256, 'samples': 2429568, 'steps': 12653, 'loss/train': 1.6259452104568481} 11/06/2021 22:59:49 - INFO - __main__ - Step 12655: {'lr': 0.000493634098272276, 'samples': 2429760, 'steps': 12654, 'loss/train': 0.9939618706703186} 11/06/2021 22:59:49 - INFO - __main__ - Step 12656: {'lr': 0.0004936329082893488, 'samples': 2429952, 'steps': 12655, 'loss/train': 1.6605446338653564} 11/06/2021 22:59:50 - INFO - __main__ - Step 12657: {'lr': 0.0004936317181966443, 'samples': 2430144, 'steps': 12656, 'loss/train': 1.8564437627792358} 11/06/2021 22:59:50 - INFO - __main__ - Step 12658: {'lr': 0.000493630527994163, 'samples': 2430336, 'steps': 12657, 'loss/train': 1.9009116888046265} 11/06/2021 22:59:50 - INFO - __main__ - Step 12659: {'lr': 0.0004936293376819058, 'samples': 2430528, 'steps': 12658, 'loss/train': 1.7733248472213745} 11/06/2021 22:59:52 - INFO - __main__ - Step 12660: {'lr': 0.0004936281472598728, 'samples': 2430720, 'steps': 12659, 'loss/train': 1.9527031183242798} 11/06/2021 22:59:52 - INFO - __main__ - Step 12661: {'lr': 0.0004936269567280648, 'samples': 2430912, 'steps': 12660, 'loss/train': 1.9170464277267456} 11/06/2021 22:59:52 - INFO - __main__ - Step 12662: {'lr': 0.0004936257660864822, 'samples': 2431104, 'steps': 12661, 'loss/train': 1.2196903228759766} 11/06/2021 22:59:53 - INFO - __main__ - Step 12663: {'lr': 0.0004936245753351256, 'samples': 2431296, 'steps': 12662, 'loss/train': 1.9647890329360962} 11/06/2021 22:59:53 - INFO - __main__ - Step 12664: {'lr': 0.0004936233844739955, 'samples': 2431488, 'steps': 12663, 'loss/train': 1.9114571809768677} 11/06/2021 22:59:54 - INFO - __main__ - Step 12665: {'lr': 0.0004936221935030924, 'samples': 2431680, 'steps': 12664, 'loss/train': 1.2946553230285645} 11/06/2021 22:59:54 - INFO - __main__ - Step 12666: {'lr': 0.000493621002422417, 'samples': 2431872, 'steps': 12665, 'loss/train': 1.6806825399398804} 11/06/2021 22:59:55 - INFO - __main__ - Step 12667: {'lr': 0.0004936198112319698, 'samples': 2432064, 'steps': 12666, 'loss/train': 1.6497493982315063} 11/06/2021 22:59:55 - INFO - __main__ - Step 12668: {'lr': 0.0004936186199317511, 'samples': 2432256, 'steps': 12667, 'loss/train': 0.71175217628479} 11/06/2021 22:59:55 - INFO - __main__ - Step 12669: {'lr': 0.0004936174285217618, 'samples': 2432448, 'steps': 12668, 'loss/train': 2.324061393737793} 11/06/2021 22:59:56 - INFO - __main__ - Step 12670: {'lr': 0.0004936162370020021, 'samples': 2432640, 'steps': 12669, 'loss/train': 1.2230273485183716} 11/06/2021 22:59:57 - INFO - __main__ - Step 12671: {'lr': 0.0004936150453724727, 'samples': 2432832, 'steps': 12670, 'loss/train': 1.6716129779815674} 11/06/2021 22:59:57 - INFO - __main__ - Step 12672: {'lr': 0.0004936138536331742, 'samples': 2433024, 'steps': 12671, 'loss/train': 1.5442752838134766} 11/06/2021 22:59:58 - INFO - __main__ - Step 12673: {'lr': 0.000493612661784107, 'samples': 2433216, 'steps': 12672, 'loss/train': 1.818224310874939} 11/06/2021 22:59:58 - INFO - __main__ - Step 12674: {'lr': 0.0004936114698252717, 'samples': 2433408, 'steps': 12673, 'loss/train': 1.5145035982131958} 11/06/2021 22:59:58 - INFO - __main__ - Step 12675: {'lr': 0.0004936102777566688, 'samples': 2433600, 'steps': 12674, 'loss/train': 0.6831874847412109} 11/06/2021 22:59:59 - INFO - __main__ - Step 12676: {'lr': 0.0004936090855782989, 'samples': 2433792, 'steps': 12675, 'loss/train': 1.201375961303711} 11/06/2021 23:00:00 - INFO - __main__ - Step 12677: {'lr': 0.0004936078932901625, 'samples': 2433984, 'steps': 12676, 'loss/train': 1.8787459135055542} 11/06/2021 23:00:00 - INFO - __main__ - Step 12678: {'lr': 0.0004936067008922602, 'samples': 2434176, 'steps': 12677, 'loss/train': 2.3367536067962646} 11/06/2021 23:00:00 - INFO - __main__ - Step 12679: {'lr': 0.0004936055083845924, 'samples': 2434368, 'steps': 12678, 'loss/train': 1.7321808338165283} 11/06/2021 23:00:01 - INFO - __main__ - Step 12680: {'lr': 0.0004936043157671597, 'samples': 2434560, 'steps': 12679, 'loss/train': 1.1493476629257202} 11/06/2021 23:00:02 - INFO - __main__ - Step 12681: {'lr': 0.0004936031230399628, 'samples': 2434752, 'steps': 12680, 'loss/train': 1.7847900390625} 11/06/2021 23:00:02 - INFO - __main__ - Step 12682: {'lr': 0.000493601930203002, 'samples': 2434944, 'steps': 12681, 'loss/train': 1.8895950317382812} 11/06/2021 23:00:03 - INFO - __main__ - Step 12683: {'lr': 0.0004936007372562778, 'samples': 2435136, 'steps': 12682, 'loss/train': 1.703395128250122} 11/06/2021 23:00:03 - INFO - __main__ - Step 12684: {'lr': 0.0004935995441997911, 'samples': 2435328, 'steps': 12683, 'loss/train': 1.5084177255630493} 11/06/2021 23:00:03 - INFO - __main__ - Step 12685: {'lr': 0.000493598351033542, 'samples': 2435520, 'steps': 12684, 'loss/train': 1.2693252563476562} 11/06/2021 23:00:04 - INFO - __main__ - Step 12686: {'lr': 0.0004935971577575313, 'samples': 2435712, 'steps': 12685, 'loss/train': 1.772753357887268} 11/06/2021 23:00:05 - INFO - __main__ - Step 12687: {'lr': 0.0004935959643717595, 'samples': 2435904, 'steps': 12686, 'loss/train': 1.3232539892196655} 11/06/2021 23:00:05 - INFO - __main__ - Step 12688: {'lr': 0.0004935947708762272, 'samples': 2436096, 'steps': 12687, 'loss/train': 1.1217036247253418} 11/06/2021 23:00:05 - INFO - __main__ - Step 12689: {'lr': 0.0004935935772709348, 'samples': 2436288, 'steps': 12688, 'loss/train': 2.837244987487793} 11/06/2021 23:00:06 - INFO - __main__ - Step 12690: {'lr': 0.0004935923835558829, 'samples': 2436480, 'steps': 12689, 'loss/train': 1.7220159769058228} 11/06/2021 23:00:07 - INFO - __main__ - Step 12691: {'lr': 0.0004935911897310719, 'samples': 2436672, 'steps': 12690, 'loss/train': 1.8013556003570557} 11/06/2021 23:00:07 - INFO - __main__ - Step 12692: {'lr': 0.0004935899957965027, 'samples': 2436864, 'steps': 12691, 'loss/train': 1.6648582220077515} 11/06/2021 23:00:07 - INFO - __main__ - Step 12693: {'lr': 0.0004935888017521754, 'samples': 2437056, 'steps': 12692, 'loss/train': 2.0813040733337402} 11/06/2021 23:00:08 - INFO - __main__ - Step 12694: {'lr': 0.0004935876075980908, 'samples': 2437248, 'steps': 12693, 'loss/train': 1.7684749364852905} 11/06/2021 23:00:08 - INFO - __main__ - Step 12695: {'lr': 0.0004935864133342495, 'samples': 2437440, 'steps': 12694, 'loss/train': 1.6431411504745483} 11/06/2021 23:00:09 - INFO - __main__ - Step 12696: {'lr': 0.0004935852189606517, 'samples': 2437632, 'steps': 12695, 'loss/train': 1.6132627725601196} 11/06/2021 23:00:09 - INFO - __main__ - Step 12697: {'lr': 0.0004935840244772984, 'samples': 2437824, 'steps': 12696, 'loss/train': 1.5991240739822388} 11/06/2021 23:00:10 - INFO - __main__ - Step 12698: {'lr': 0.0004935828298841898, 'samples': 2438016, 'steps': 12697, 'loss/train': 1.5647183656692505} 11/06/2021 23:00:10 - INFO - __main__ - Step 12699: {'lr': 0.0004935816351813265, 'samples': 2438208, 'steps': 12698, 'loss/train': 1.92881441116333} 11/06/2021 23:00:11 - INFO - __main__ - Step 12700: {'lr': 0.0004935804403687091, 'samples': 2438400, 'steps': 12699, 'loss/train': 1.8408360481262207} 11/06/2021 23:00:11 - INFO - __main__ - Step 12701: {'lr': 0.0004935792454463381, 'samples': 2438592, 'steps': 12700, 'loss/train': 1.49015212059021} 11/06/2021 23:00:12 - INFO - __main__ - Step 12702: {'lr': 0.000493578050414214, 'samples': 2438784, 'steps': 12701, 'loss/train': 1.8693770170211792} 11/06/2021 23:00:12 - INFO - __main__ - Step 12703: {'lr': 0.0004935768552723375, 'samples': 2438976, 'steps': 12702, 'loss/train': 1.6412677764892578} 11/06/2021 23:00:13 - INFO - __main__ - Step 12704: {'lr': 0.000493575660020709, 'samples': 2439168, 'steps': 12703, 'loss/train': 1.6965235471725464} 11/06/2021 23:00:13 - INFO - __main__ - Step 12705: {'lr': 0.000493574464659329, 'samples': 2439360, 'steps': 12704, 'loss/train': 1.4000598192214966} 11/06/2021 23:00:13 - INFO - __main__ - Step 12706: {'lr': 0.0004935732691881981, 'samples': 2439552, 'steps': 12705, 'loss/train': 1.8921767473220825} 11/06/2021 23:00:14 - INFO - __main__ - Step 12707: {'lr': 0.0004935720736073169, 'samples': 2439744, 'steps': 12706, 'loss/train': 1.973684310913086} 11/06/2021 23:00:15 - INFO - __main__ - Step 12708: {'lr': 0.0004935708779166859, 'samples': 2439936, 'steps': 12707, 'loss/train': 1.2805671691894531} 11/06/2021 23:00:15 - INFO - __main__ - Step 12709: {'lr': 0.0004935696821163056, 'samples': 2440128, 'steps': 12708, 'loss/train': 1.9279905557632446} 11/06/2021 23:00:15 - INFO - __main__ - Step 12710: {'lr': 0.0004935684862061766, 'samples': 2440320, 'steps': 12709, 'loss/train': 1.654013991355896} 11/06/2021 23:00:16 - INFO - __main__ - Step 12711: {'lr': 0.0004935672901862993, 'samples': 2440512, 'steps': 12710, 'loss/train': 1.4374395608901978} 11/06/2021 23:00:17 - INFO - __main__ - Step 12712: {'lr': 0.0004935660940566744, 'samples': 2440704, 'steps': 12711, 'loss/train': 1.640486240386963} 11/06/2021 23:00:17 - INFO - __main__ - Step 12713: {'lr': 0.0004935648978173024, 'samples': 2440896, 'steps': 12712, 'loss/train': 1.7238794565200806} 11/06/2021 23:00:17 - INFO - __main__ - Step 12714: {'lr': 0.0004935637014681837, 'samples': 2441088, 'steps': 12713, 'loss/train': 1.8015981912612915} 11/06/2021 23:00:18 - INFO - __main__ - Step 12715: {'lr': 0.0004935625050093191, 'samples': 2441280, 'steps': 12714, 'loss/train': 1.7736315727233887} 11/06/2021 23:00:18 - INFO - __main__ - Step 12716: {'lr': 0.000493561308440709, 'samples': 2441472, 'steps': 12715, 'loss/train': 1.336901068687439} 11/06/2021 23:00:19 - INFO - __main__ - Step 12717: {'lr': 0.0004935601117623538, 'samples': 2441664, 'steps': 12716, 'loss/train': 1.488775610923767} 11/06/2021 23:00:19 - INFO - __main__ - Step 12718: {'lr': 0.0004935589149742542, 'samples': 2441856, 'steps': 12717, 'loss/train': 2.1799232959747314} 11/06/2021 23:00:20 - INFO - __main__ - Step 12719: {'lr': 0.0004935577180764108, 'samples': 2442048, 'steps': 12718, 'loss/train': 1.3624755144119263} 11/06/2021 23:00:20 - INFO - __main__ - Step 12720: {'lr': 0.000493556521068824, 'samples': 2442240, 'steps': 12719, 'loss/train': 1.883273720741272} 11/06/2021 23:00:21 - INFO - __main__ - Step 12721: {'lr': 0.0004935553239514943, 'samples': 2442432, 'steps': 12720, 'loss/train': 1.8159416913986206} 11/06/2021 23:00:22 - INFO - __main__ - Step 12722: {'lr': 0.0004935541267244225, 'samples': 2442624, 'steps': 12721, 'loss/train': 1.7805633544921875} 11/06/2021 23:00:22 - INFO - __main__ - Step 12723: {'lr': 0.0004935529293876088, 'samples': 2442816, 'steps': 12722, 'loss/train': 1.8284835815429688} 11/06/2021 23:00:22 - INFO - __main__ - Step 12724: {'lr': 0.000493551731941054, 'samples': 2443008, 'steps': 12723, 'loss/train': 1.8245835304260254} 11/06/2021 23:00:23 - INFO - __main__ - Step 12725: {'lr': 0.0004935505343847586, 'samples': 2443200, 'steps': 12724, 'loss/train': 1.665300965309143} 11/06/2021 23:00:23 - INFO - __main__ - Step 12726: {'lr': 0.000493549336718723, 'samples': 2443392, 'steps': 12725, 'loss/train': 1.5313256978988647} 11/06/2021 23:00:23 - INFO - __main__ - Step 12727: {'lr': 0.0004935481389429479, 'samples': 2443584, 'steps': 12726, 'loss/train': 1.2466456890106201} 11/06/2021 23:00:24 - INFO - __main__ - Step 12728: {'lr': 0.0004935469410574337, 'samples': 2443776, 'steps': 12727, 'loss/train': 1.5720294713974} 11/06/2021 23:00:25 - INFO - __main__ - Step 12729: {'lr': 0.000493545743062181, 'samples': 2443968, 'steps': 12728, 'loss/train': 1.831040382385254} 11/06/2021 23:00:25 - INFO - __main__ - Step 12730: {'lr': 0.0004935445449571903, 'samples': 2444160, 'steps': 12729, 'loss/train': 1.5608617067337036} 11/06/2021 23:00:26 - INFO - __main__ - Step 12731: {'lr': 0.0004935433467424624, 'samples': 2444352, 'steps': 12730, 'loss/train': 1.843248963356018} 11/06/2021 23:00:26 - INFO - __main__ - Step 12732: {'lr': 0.0004935421484179974, 'samples': 2444544, 'steps': 12731, 'loss/train': 1.858343243598938} 11/06/2021 23:00:27 - INFO - __main__ - Step 12733: {'lr': 0.0004935409499837962, 'samples': 2444736, 'steps': 12732, 'loss/train': 2.449708938598633} 11/06/2021 23:00:27 - INFO - __main__ - Step 12734: {'lr': 0.0004935397514398591, 'samples': 2444928, 'steps': 12733, 'loss/train': 1.2555254697799683} 11/06/2021 23:00:27 - INFO - __main__ - Step 12735: {'lr': 0.0004935385527861869, 'samples': 2445120, 'steps': 12734, 'loss/train': 1.8298964500427246} 11/06/2021 23:00:28 - INFO - __main__ - Step 12736: {'lr': 0.0004935373540227798, 'samples': 2445312, 'steps': 12735, 'loss/train': 1.8937606811523438} 11/06/2021 23:00:28 - INFO - __main__ - Step 12737: {'lr': 0.0004935361551496387, 'samples': 2445504, 'steps': 12736, 'loss/train': 1.5406252145767212} 11/06/2021 23:00:29 - INFO - __main__ - Step 12738: {'lr': 0.0004935349561667638, 'samples': 2445696, 'steps': 12737, 'loss/train': 1.3331429958343506} 11/06/2021 23:00:30 - INFO - __main__ - Step 12739: {'lr': 0.000493533757074156, 'samples': 2445888, 'steps': 12738, 'loss/train': 1.462608814239502} 11/06/2021 23:00:30 - INFO - __main__ - Step 12740: {'lr': 0.0004935325578718155, 'samples': 2446080, 'steps': 12739, 'loss/train': 1.7719459533691406} 11/06/2021 23:00:30 - INFO - __main__ - Step 12741: {'lr': 0.000493531358559743, 'samples': 2446272, 'steps': 12740, 'loss/train': 1.849948525428772} 11/06/2021 23:00:31 - INFO - __main__ - Step 12742: {'lr': 0.0004935301591379391, 'samples': 2446464, 'steps': 12741, 'loss/train': 2.125699520111084} 11/06/2021 23:00:32 - INFO - __main__ - Step 12743: {'lr': 0.0004935289596064042, 'samples': 2446656, 'steps': 12742, 'loss/train': 1.4492217302322388} 11/06/2021 23:00:32 - INFO - __main__ - Step 12744: {'lr': 0.0004935277599651389, 'samples': 2446848, 'steps': 12743, 'loss/train': 1.4659162759780884} 11/06/2021 23:00:32 - INFO - __main__ - Step 12745: {'lr': 0.0004935265602141437, 'samples': 2447040, 'steps': 12744, 'loss/train': 2.072901964187622} 11/06/2021 23:00:33 - INFO - __main__ - Step 12746: {'lr': 0.0004935253603534193, 'samples': 2447232, 'steps': 12745, 'loss/train': 1.745668649673462} 11/06/2021 23:00:33 - INFO - __main__ - Step 12747: {'lr': 0.0004935241603829661, 'samples': 2447424, 'steps': 12746, 'loss/train': 1.9850726127624512} 11/06/2021 23:00:34 - INFO - __main__ - Step 12748: {'lr': 0.0004935229603027847, 'samples': 2447616, 'steps': 12747, 'loss/train': 1.3477630615234375} 11/06/2021 23:00:34 - INFO - __main__ - Step 12749: {'lr': 0.0004935217601128755, 'samples': 2447808, 'steps': 12748, 'loss/train': 1.7236028909683228} 11/06/2021 23:00:35 - INFO - __main__ - Step 12750: {'lr': 0.0004935205598132393, 'samples': 2448000, 'steps': 12749, 'loss/train': 1.637121319770813} 11/06/2021 23:00:35 - INFO - __main__ - Step 12751: {'lr': 0.0004935193594038764, 'samples': 2448192, 'steps': 12750, 'loss/train': 1.4060014486312866} 11/06/2021 23:00:35 - INFO - __main__ - Step 12752: {'lr': 0.0004935181588847876, 'samples': 2448384, 'steps': 12751, 'loss/train': 1.7447482347488403} 11/06/2021 23:00:36 - INFO - __main__ - Step 12753: {'lr': 0.0004935169582559731, 'samples': 2448576, 'steps': 12752, 'loss/train': 2.0324788093566895} 11/06/2021 23:00:37 - INFO - __main__ - Step 12754: {'lr': 0.0004935157575174336, 'samples': 2448768, 'steps': 12753, 'loss/train': 1.7007369995117188} 11/06/2021 23:00:37 - INFO - __main__ - Step 12755: {'lr': 0.0004935145566691698, 'samples': 2448960, 'steps': 12754, 'loss/train': 1.9936100244522095} 11/06/2021 23:00:37 - INFO - __main__ - Step 12756: {'lr': 0.000493513355711182, 'samples': 2449152, 'steps': 12755, 'loss/train': 1.6351784467697144} 11/06/2021 23:00:38 - INFO - __main__ - Step 12757: {'lr': 0.0004935121546434708, 'samples': 2449344, 'steps': 12756, 'loss/train': 1.7019697427749634} 11/06/2021 23:00:39 - INFO - __main__ - Step 12758: {'lr': 0.0004935109534660368, 'samples': 2449536, 'steps': 12757, 'loss/train': 1.8127813339233398} 11/06/2021 23:00:39 - INFO - __main__ - Step 12759: {'lr': 0.0004935097521788805, 'samples': 2449728, 'steps': 12758, 'loss/train': 0.67855304479599} 11/06/2021 23:00:40 - INFO - __main__ - Step 12760: {'lr': 0.0004935085507820026, 'samples': 2449920, 'steps': 12759, 'loss/train': 2.1449859142303467} 11/06/2021 23:00:40 - INFO - __main__ - Step 12761: {'lr': 0.0004935073492754034, 'samples': 2450112, 'steps': 12760, 'loss/train': 1.3522168397903442} 11/06/2021 23:00:40 - INFO - __main__ - Step 12762: {'lr': 0.0004935061476590835, 'samples': 2450304, 'steps': 12761, 'loss/train': 0.9896982908248901} 11/06/2021 23:00:41 - INFO - __main__ - Step 12763: {'lr': 0.0004935049459330437, 'samples': 2450496, 'steps': 12762, 'loss/train': 1.8502204418182373} 11/06/2021 23:00:42 - INFO - __main__ - Step 12764: {'lr': 0.0004935037440972841, 'samples': 2450688, 'steps': 12763, 'loss/train': 2.6931633949279785} 11/06/2021 23:00:42 - INFO - __main__ - Step 12765: {'lr': 0.0004935025421518056, 'samples': 2450880, 'steps': 12764, 'loss/train': 1.5279968976974487} 11/06/2021 23:00:42 - INFO - __main__ - Step 12766: {'lr': 0.0004935013400966086, 'samples': 2451072, 'steps': 12765, 'loss/train': 1.7780354022979736} 11/06/2021 23:00:43 - INFO - __main__ - Step 12767: {'lr': 0.0004935001379316935, 'samples': 2451264, 'steps': 12766, 'loss/train': 1.5596141815185547} 11/06/2021 23:00:43 - INFO - __main__ - Step 12768: {'lr': 0.0004934989356570611, 'samples': 2451456, 'steps': 12767, 'loss/train': 1.7943576574325562} 11/06/2021 23:00:44 - INFO - __main__ - Step 12769: {'lr': 0.0004934977332727118, 'samples': 2451648, 'steps': 12768, 'loss/train': 1.703242301940918} 11/06/2021 23:00:44 - INFO - __main__ - Step 12770: {'lr': 0.0004934965307786464, 'samples': 2451840, 'steps': 12769, 'loss/train': 1.6350740194320679} 11/06/2021 23:00:45 - INFO - __main__ - Step 12771: {'lr': 0.0004934953281748649, 'samples': 2452032, 'steps': 12770, 'loss/train': 1.350691318511963} 11/06/2021 23:00:45 - INFO - __main__ - Step 12772: {'lr': 0.0004934941254613684, 'samples': 2452224, 'steps': 12771, 'loss/train': 1.6434681415557861} 11/06/2021 23:00:45 - INFO - __main__ - Step 12773: {'lr': 0.0004934929226381572, 'samples': 2452416, 'steps': 12772, 'loss/train': 1.567505121231079} 11/06/2021 23:00:47 - INFO - __main__ - Step 12774: {'lr': 0.0004934917197052317, 'samples': 2452608, 'steps': 12773, 'loss/train': 1.6525659561157227} 11/06/2021 23:00:47 - INFO - __main__ - Step 12775: {'lr': 0.0004934905166625926, 'samples': 2452800, 'steps': 12774, 'loss/train': 1.8389402627944946} 11/06/2021 23:00:48 - INFO - __main__ - Step 12776: {'lr': 0.0004934893135102405, 'samples': 2452992, 'steps': 12775, 'loss/train': 0.4809941351413727} 11/06/2021 23:00:48 - INFO - __main__ - Step 12777: {'lr': 0.0004934881102481759, 'samples': 2453184, 'steps': 12776, 'loss/train': 1.6852707862854004} 11/06/2021 23:00:48 - INFO - __main__ - Step 12778: {'lr': 0.0004934869068763992, 'samples': 2453376, 'steps': 12777, 'loss/train': 1.8732150793075562} 11/06/2021 23:00:49 - INFO - __main__ - Step 12779: {'lr': 0.0004934857033949112, 'samples': 2453568, 'steps': 12778, 'loss/train': 1.4058620929718018} 11/06/2021 23:00:49 - INFO - __main__ - Step 12780: {'lr': 0.0004934844998037122, 'samples': 2453760, 'steps': 12779, 'loss/train': 1.6926714181900024} 11/06/2021 23:00:50 - INFO - __main__ - Step 12781: {'lr': 0.0004934832961028028, 'samples': 2453952, 'steps': 12780, 'loss/train': 1.8323222398757935} 11/06/2021 23:00:50 - INFO - __main__ - Step 12782: {'lr': 0.0004934820922921836, 'samples': 2454144, 'steps': 12781, 'loss/train': 1.9833849668502808} 11/06/2021 23:00:51 - INFO - __main__ - Step 12783: {'lr': 0.0004934808883718553, 'samples': 2454336, 'steps': 12782, 'loss/train': 1.3475416898727417} 11/06/2021 23:00:52 - INFO - __main__ - Step 12784: {'lr': 0.0004934796843418181, 'samples': 2454528, 'steps': 12783, 'loss/train': 1.8701623678207397} 11/06/2021 23:00:52 - INFO - __main__ - Step 12785: {'lr': 0.0004934784802020728, 'samples': 2454720, 'steps': 12784, 'loss/train': 1.69837486743927} 11/06/2021 23:00:52 - INFO - __main__ - Step 12786: {'lr': 0.0004934772759526198, 'samples': 2454912, 'steps': 12785, 'loss/train': 1.8089841604232788} 11/06/2021 23:00:53 - INFO - __main__ - Step 12787: {'lr': 0.0004934760715934597, 'samples': 2455104, 'steps': 12786, 'loss/train': 1.5334937572479248} 11/06/2021 23:00:53 - INFO - __main__ - Step 12788: {'lr': 0.0004934748671245931, 'samples': 2455296, 'steps': 12787, 'loss/train': 1.691651463508606} 11/06/2021 23:00:54 - INFO - __main__ - Step 12789: {'lr': 0.0004934736625460203, 'samples': 2455488, 'steps': 12788, 'loss/train': 1.8990588188171387} 11/06/2021 23:00:54 - INFO - __main__ - Step 12790: {'lr': 0.0004934724578577422, 'samples': 2455680, 'steps': 12789, 'loss/train': 1.2063190937042236} 11/06/2021 23:00:55 - INFO - __main__ - Step 12791: {'lr': 0.0004934712530597591, 'samples': 2455872, 'steps': 12790, 'loss/train': 1.6075855493545532} 11/06/2021 23:00:55 - INFO - __main__ - Step 12792: {'lr': 0.0004934700481520717, 'samples': 2456064, 'steps': 12791, 'loss/train': 1.9511923789978027} 11/06/2021 23:00:55 - INFO - __main__ - Step 12793: {'lr': 0.0004934688431346804, 'samples': 2456256, 'steps': 12792, 'loss/train': 1.5929433107376099} 11/06/2021 23:00:56 - INFO - __main__ - Step 12794: {'lr': 0.0004934676380075857, 'samples': 2456448, 'steps': 12793, 'loss/train': 1.243303894996643} 11/06/2021 23:00:57 - INFO - __main__ - Step 12795: {'lr': 0.0004934664327707884, 'samples': 2456640, 'steps': 12794, 'loss/train': 2.0900375843048096} 11/06/2021 23:00:57 - INFO - __main__ - Step 12796: {'lr': 0.0004934652274242888, 'samples': 2456832, 'steps': 12795, 'loss/train': 1.7989342212677002} 11/06/2021 23:00:58 - INFO - __main__ - Step 12797: {'lr': 0.0004934640219680875, 'samples': 2457024, 'steps': 12796, 'loss/train': 0.9342484474182129} 11/06/2021 23:00:58 - INFO - __main__ - Step 12798: {'lr': 0.0004934628164021851, 'samples': 2457216, 'steps': 12797, 'loss/train': 0.24721074104309082} 11/06/2021 23:00:59 - INFO - __main__ - Step 12799: {'lr': 0.0004934616107265821, 'samples': 2457408, 'steps': 12798, 'loss/train': 2.198434591293335} 11/06/2021 23:00:59 - INFO - __main__ - Step 12800: {'lr': 0.0004934604049412791, 'samples': 2457600, 'steps': 12799, 'loss/train': 1.550801157951355} 11/06/2021 23:01:00 - INFO - __main__ - Step 12801: {'lr': 0.0004934591990462766, 'samples': 2457792, 'steps': 12800, 'loss/train': 1.3108569383621216} 11/06/2021 23:01:00 - INFO - __main__ - Step 12802: {'lr': 0.0004934579930415751, 'samples': 2457984, 'steps': 12801, 'loss/train': 1.9333668947219849} 11/06/2021 23:01:00 - INFO - __main__ - Step 12803: {'lr': 0.0004934567869271751, 'samples': 2458176, 'steps': 12802, 'loss/train': 1.9801455736160278} 11/06/2021 23:01:01 - INFO - __main__ - Step 12804: {'lr': 0.0004934555807030774, 'samples': 2458368, 'steps': 12803, 'loss/train': 1.053701400756836} 11/06/2021 23:01:02 - INFO - __main__ - Step 12805: {'lr': 0.0004934543743692822, 'samples': 2458560, 'steps': 12804, 'loss/train': 1.4716850519180298} 11/06/2021 23:01:02 - INFO - __main__ - Step 12806: {'lr': 0.0004934531679257903, 'samples': 2458752, 'steps': 12805, 'loss/train': 1.5304038524627686} 11/06/2021 23:01:03 - INFO - __main__ - Step 12807: {'lr': 0.0004934519613726022, 'samples': 2458944, 'steps': 12806, 'loss/train': 1.6712217330932617} 11/06/2021 23:01:03 - INFO - __main__ - Step 12808: {'lr': 0.0004934507547097183, 'samples': 2459136, 'steps': 12807, 'loss/train': 1.6593072414398193} 11/06/2021 23:01:03 - INFO - __main__ - Step 12809: {'lr': 0.0004934495479371393, 'samples': 2459328, 'steps': 12808, 'loss/train': 4.7572102546691895} 11/06/2021 23:01:04 - INFO - __main__ - Step 12810: {'lr': 0.0004934483410548658, 'samples': 2459520, 'steps': 12809, 'loss/train': 0.9556400775909424} 11/06/2021 23:01:05 - INFO - __main__ - Step 12811: {'lr': 0.0004934471340628981, 'samples': 2459712, 'steps': 12810, 'loss/train': 0.8056848049163818} 11/06/2021 23:01:05 - INFO - __main__ - Step 12812: {'lr': 0.000493445926961237, 'samples': 2459904, 'steps': 12811, 'loss/train': 1.6957072019577026} 11/06/2021 23:01:05 - INFO - __main__ - Step 12813: {'lr': 0.0004934447197498828, 'samples': 2460096, 'steps': 12812, 'loss/train': 1.9989805221557617} 11/06/2021 23:01:06 - INFO - __main__ - Step 12814: {'lr': 0.0004934435124288362, 'samples': 2460288, 'steps': 12813, 'loss/train': 2.132337808609009} 11/06/2021 23:01:07 - INFO - __main__ - Step 12815: {'lr': 0.0004934423049980977, 'samples': 2460480, 'steps': 12814, 'loss/train': 1.7412266731262207} 11/06/2021 23:01:07 - INFO - __main__ - Step 12816: {'lr': 0.0004934410974576679, 'samples': 2460672, 'steps': 12815, 'loss/train': 1.9368259906768799} 11/06/2021 23:01:07 - INFO - __main__ - Step 12817: {'lr': 0.0004934398898075472, 'samples': 2460864, 'steps': 12816, 'loss/train': 1.8307538032531738} 11/06/2021 23:01:08 - INFO - __main__ - Step 12818: {'lr': 0.0004934386820477363, 'samples': 2461056, 'steps': 12817, 'loss/train': 1.4713562726974487} 11/06/2021 23:01:08 - INFO - __main__ - Step 12819: {'lr': 0.0004934374741782357, 'samples': 2461248, 'steps': 12818, 'loss/train': 1.4008110761642456} 11/06/2021 23:01:09 - INFO - __main__ - Step 12820: {'lr': 0.000493436266199046, 'samples': 2461440, 'steps': 12819, 'loss/train': 1.4729948043823242} 11/06/2021 23:01:09 - INFO - __main__ - Step 12821: {'lr': 0.0004934350581101676, 'samples': 2461632, 'steps': 12820, 'loss/train': 1.7127310037612915} 11/06/2021 23:01:10 - INFO - __main__ - Step 12822: {'lr': 0.0004934338499116011, 'samples': 2461824, 'steps': 12821, 'loss/train': 1.610058307647705} 11/06/2021 23:01:10 - INFO - __main__ - Step 12823: {'lr': 0.0004934326416033471, 'samples': 2462016, 'steps': 12822, 'loss/train': 1.9114627838134766} 11/06/2021 23:01:10 - INFO - __main__ - Step 12824: {'lr': 0.0004934314331854061, 'samples': 2462208, 'steps': 12823, 'loss/train': 1.826889157295227} 11/06/2021 23:01:12 - INFO - __main__ - Step 12825: {'lr': 0.0004934302246577786, 'samples': 2462400, 'steps': 12824, 'loss/train': 1.7399663925170898} 11/06/2021 23:01:12 - INFO - __main__ - Step 12826: {'lr': 0.0004934290160204652, 'samples': 2462592, 'steps': 12825, 'loss/train': 1.5818183422088623} 11/06/2021 23:01:12 - INFO - __main__ - Step 12827: {'lr': 0.0004934278072734666, 'samples': 2462784, 'steps': 12826, 'loss/train': 1.4330865144729614} 11/06/2021 23:01:13 - INFO - __main__ - Step 12828: {'lr': 0.000493426598416783, 'samples': 2462976, 'steps': 12827, 'loss/train': 1.77826988697052} 11/06/2021 23:01:13 - INFO - __main__ - Step 12829: {'lr': 0.0004934253894504152, 'samples': 2463168, 'steps': 12828, 'loss/train': 1.1493207216262817} 11/06/2021 23:01:14 - INFO - __main__ - Step 12830: {'lr': 0.0004934241803743637, 'samples': 2463360, 'steps': 12829, 'loss/train': 1.8719111680984497} 11/06/2021 23:01:14 - INFO - __main__ - Step 12831: {'lr': 0.000493422971188629, 'samples': 2463552, 'steps': 12830, 'loss/train': 1.228978157043457} 11/06/2021 23:01:15 - INFO - __main__ - Step 12832: {'lr': 0.0004934217618932117, 'samples': 2463744, 'steps': 12831, 'loss/train': 1.8399741649627686} 11/06/2021 23:01:15 - INFO - __main__ - Step 12833: {'lr': 0.0004934205524881123, 'samples': 2463936, 'steps': 12832, 'loss/train': 1.9681713581085205} 11/06/2021 23:01:15 - INFO - __main__ - Step 12834: {'lr': 0.0004934193429733312, 'samples': 2464128, 'steps': 12833, 'loss/train': 0.9652214050292969} 11/06/2021 23:01:17 - INFO - __main__ - Step 12835: {'lr': 0.0004934181333488693, 'samples': 2464320, 'steps': 12834, 'loss/train': 1.6454654932022095} 11/06/2021 23:01:18 - INFO - __main__ - Step 12836: {'lr': 0.0004934169236147268, 'samples': 2464512, 'steps': 12835, 'loss/train': 1.410776138305664} 11/06/2021 23:01:18 - INFO - __main__ - Step 12837: {'lr': 0.0004934157137709044, 'samples': 2464704, 'steps': 12836, 'loss/train': 1.5523715019226074} 11/06/2021 23:01:18 - INFO - __main__ - Step 12838: {'lr': 0.0004934145038174028, 'samples': 2464896, 'steps': 12837, 'loss/train': 1.6767199039459229} 11/06/2021 23:01:19 - INFO - __main__ - Step 12839: {'lr': 0.0004934132937542223, 'samples': 2465088, 'steps': 12838, 'loss/train': 1.64377760887146} 11/06/2021 23:01:19 - INFO - __main__ - Step 12840: {'lr': 0.0004934120835813634, 'samples': 2465280, 'steps': 12839, 'loss/train': 5.639301776885986} 11/06/2021 23:01:19 - INFO - __main__ - Step 12841: {'lr': 0.0004934108732988269, 'samples': 2465472, 'steps': 12840, 'loss/train': 5.523209095001221} 11/06/2021 23:01:20 - INFO - __main__ - Step 12842: {'lr': 0.0004934096629066133, 'samples': 2465664, 'steps': 12841, 'loss/train': 5.597353458404541} 11/06/2021 23:01:21 - INFO - __main__ - Step 12843: {'lr': 0.0004934084524047229, 'samples': 2465856, 'steps': 12842, 'loss/train': 2.109654188156128} 11/06/2021 23:01:21 - INFO - __main__ - Step 12844: {'lr': 0.0004934072417931564, 'samples': 2466048, 'steps': 12843, 'loss/train': 1.7585923671722412} 11/06/2021 23:01:22 - INFO - __main__ - Step 12845: {'lr': 0.0004934060310719145, 'samples': 2466240, 'steps': 12844, 'loss/train': 1.6276121139526367} 11/06/2021 23:01:22 - INFO - __main__ - Step 12846: {'lr': 0.0004934048202409974, 'samples': 2466432, 'steps': 12845, 'loss/train': 1.8930996656417847} 11/06/2021 23:01:22 - INFO - __main__ - Step 12847: {'lr': 0.000493403609300406, 'samples': 2466624, 'steps': 12846, 'loss/train': 1.744299054145813} 11/06/2021 23:01:23 - INFO - __main__ - Step 12848: {'lr': 0.0004934023982501406, 'samples': 2466816, 'steps': 12847, 'loss/train': 1.2807785272598267} 11/06/2021 23:01:24 - INFO - __main__ - Step 12849: {'lr': 0.000493401187090202, 'samples': 2467008, 'steps': 12848, 'loss/train': 1.8994098901748657} 11/06/2021 23:01:24 - INFO - __main__ - Step 12850: {'lr': 0.0004933999758205904, 'samples': 2467200, 'steps': 12849, 'loss/train': 1.5451972484588623} 11/06/2021 23:01:24 - INFO - __main__ - Step 12851: {'lr': 0.0004933987644413066, 'samples': 2467392, 'steps': 12850, 'loss/train': 1.0521020889282227} 11/06/2021 23:01:25 - INFO - __main__ - Step 12852: {'lr': 0.0004933975529523511, 'samples': 2467584, 'steps': 12851, 'loss/train': 1.6571444272994995} 11/06/2021 23:01:26 - INFO - __main__ - Step 12853: {'lr': 0.0004933963413537244, 'samples': 2467776, 'steps': 12852, 'loss/train': 1.570617437362671} 11/06/2021 23:01:26 - INFO - __main__ - Step 12854: {'lr': 0.000493395129645427, 'samples': 2467968, 'steps': 12853, 'loss/train': 0.8766611218452454} 11/06/2021 23:01:26 - INFO - __main__ - Step 12855: {'lr': 0.0004933939178274596, 'samples': 2468160, 'steps': 12854, 'loss/train': 1.5354561805725098} 11/06/2021 23:01:27 - INFO - __main__ - Step 12856: {'lr': 0.0004933927058998226, 'samples': 2468352, 'steps': 12855, 'loss/train': 1.74991774559021} 11/06/2021 23:01:27 - INFO - __main__ - Step 12857: {'lr': 0.0004933914938625166, 'samples': 2468544, 'steps': 12856, 'loss/train': 1.6767778396606445} 11/06/2021 23:01:28 - INFO - __main__ - Step 12858: {'lr': 0.0004933902817155422, 'samples': 2468736, 'steps': 12857, 'loss/train': 1.899521827697754} 11/06/2021 23:01:29 - INFO - __main__ - Step 12859: {'lr': 0.0004933890694588998, 'samples': 2468928, 'steps': 12858, 'loss/train': 1.837016224861145} 11/06/2021 23:01:29 - INFO - __main__ - Step 12860: {'lr': 0.0004933878570925901, 'samples': 2469120, 'steps': 12859, 'loss/train': 1.6478303670883179} 11/06/2021 23:01:29 - INFO - __main__ - Step 12861: {'lr': 0.0004933866446166136, 'samples': 2469312, 'steps': 12860, 'loss/train': 1.84537935256958} 11/06/2021 23:01:30 - INFO - __main__ - Step 12862: {'lr': 0.0004933854320309708, 'samples': 2469504, 'steps': 12861, 'loss/train': 2.0039706230163574} 11/06/2021 23:01:31 - INFO - __main__ - Step 12863: {'lr': 0.0004933842193356624, 'samples': 2469696, 'steps': 12862, 'loss/train': 2.835719585418701} 11/06/2021 23:01:31 - INFO - __main__ - Step 12864: {'lr': 0.0004933830065306887, 'samples': 2469888, 'steps': 12863, 'loss/train': 2.0461788177490234} 11/06/2021 23:01:31 - INFO - __main__ - Step 12865: {'lr': 0.0004933817936160504, 'samples': 2470080, 'steps': 12864, 'loss/train': 1.4113506078720093} 11/06/2021 23:01:32 - INFO - __main__ - Step 12866: {'lr': 0.0004933805805917479, 'samples': 2470272, 'steps': 12865, 'loss/train': 1.4473645687103271} 11/06/2021 23:01:32 - INFO - __main__ - Step 12867: {'lr': 0.000493379367457782, 'samples': 2470464, 'steps': 12866, 'loss/train': 1.7717403173446655} 11/06/2021 23:01:32 - INFO - __main__ - Step 12868: {'lr': 0.0004933781542141532, 'samples': 2470656, 'steps': 12867, 'loss/train': 1.985740303993225} 11/06/2021 23:01:34 - INFO - __main__ - Step 12869: {'lr': 0.0004933769408608618, 'samples': 2470848, 'steps': 12868, 'loss/train': 1.6742669343948364} 11/06/2021 23:01:34 - INFO - __main__ - Step 12870: {'lr': 0.0004933757273979086, 'samples': 2471040, 'steps': 12869, 'loss/train': 0.415326327085495} 11/06/2021 23:01:34 - INFO - __main__ - Step 12871: {'lr': 0.0004933745138252939, 'samples': 2471232, 'steps': 12870, 'loss/train': 1.893794298171997} 11/06/2021 23:01:35 - INFO - __main__ - Step 12872: {'lr': 0.0004933733001430186, 'samples': 2471424, 'steps': 12871, 'loss/train': 1.7198106050491333} 11/06/2021 23:01:35 - INFO - __main__ - Step 12873: {'lr': 0.000493372086351083, 'samples': 2471616, 'steps': 12872, 'loss/train': 2.0763978958129883} 11/06/2021 23:01:36 - INFO - __main__ - Step 12874: {'lr': 0.0004933708724494877, 'samples': 2471808, 'steps': 12873, 'loss/train': 1.426766276359558} 11/06/2021 23:01:36 - INFO - __main__ - Step 12875: {'lr': 0.0004933696584382331, 'samples': 2472000, 'steps': 12874, 'loss/train': 1.5616261959075928} 11/06/2021 23:01:37 - INFO - __main__ - Step 12876: {'lr': 0.00049336844431732, 'samples': 2472192, 'steps': 12875, 'loss/train': 1.1413694620132446} 11/06/2021 23:01:37 - INFO - __main__ - Step 12877: {'lr': 0.0004933672300867488, 'samples': 2472384, 'steps': 12876, 'loss/train': 1.5705578327178955} 11/06/2021 23:01:37 - INFO - __main__ - Step 12878: {'lr': 0.0004933660157465202, 'samples': 2472576, 'steps': 12877, 'loss/train': 1.6181583404541016} 11/06/2021 23:01:39 - INFO - __main__ - Step 12879: {'lr': 0.0004933648012966344, 'samples': 2472768, 'steps': 12878, 'loss/train': 2.557011842727661} 11/06/2021 23:01:39 - INFO - __main__ - Step 12880: {'lr': 0.0004933635867370923, 'samples': 2472960, 'steps': 12879, 'loss/train': 1.745104193687439} 11/06/2021 23:01:39 - INFO - __main__ - Step 12881: {'lr': 0.0004933623720678944, 'samples': 2473152, 'steps': 12880, 'loss/train': 1.816332459449768} 11/06/2021 23:01:40 - INFO - __main__ - Step 12882: {'lr': 0.000493361157289041, 'samples': 2473344, 'steps': 12881, 'loss/train': 1.7329145669937134} 11/06/2021 23:01:40 - INFO - __main__ - Step 12883: {'lr': 0.000493359942400533, 'samples': 2473536, 'steps': 12882, 'loss/train': 1.764685034751892} 11/06/2021 23:01:41 - INFO - __main__ - Step 12884: {'lr': 0.0004933587274023706, 'samples': 2473728, 'steps': 12883, 'loss/train': 1.9591374397277832} 11/06/2021 23:01:41 - INFO - __main__ - Step 12885: {'lr': 0.0004933575122945547, 'samples': 2473920, 'steps': 12884, 'loss/train': 1.8801758289337158} 11/06/2021 23:01:42 - INFO - __main__ - Step 12886: {'lr': 0.0004933562970770855, 'samples': 2474112, 'steps': 12885, 'loss/train': 1.7536741495132446} 11/06/2021 23:01:42 - INFO - __main__ - Step 12887: {'lr': 0.0004933550817499638, 'samples': 2474304, 'steps': 12886, 'loss/train': 1.2631752490997314} 11/06/2021 23:01:42 - INFO - __main__ - Step 12888: {'lr': 0.00049335386631319, 'samples': 2474496, 'steps': 12887, 'loss/train': 2.017979621887207} 11/06/2021 23:01:43 - INFO - __main__ - Step 12889: {'lr': 0.0004933526507667648, 'samples': 2474688, 'steps': 12888, 'loss/train': 2.223954200744629} 11/06/2021 23:01:44 - INFO - __main__ - Step 12890: {'lr': 0.0004933514351106885, 'samples': 2474880, 'steps': 12889, 'loss/train': 1.3178850412368774} 11/06/2021 23:01:44 - INFO - __main__ - Step 12891: {'lr': 0.0004933502193449618, 'samples': 2475072, 'steps': 12890, 'loss/train': 1.6767654418945312} 11/06/2021 23:01:44 - INFO - __main__ - Step 12892: {'lr': 0.0004933490034695853, 'samples': 2475264, 'steps': 12891, 'loss/train': 1.598570466041565} 11/06/2021 23:01:45 - INFO - __main__ - Step 12893: {'lr': 0.0004933477874845595, 'samples': 2475456, 'steps': 12892, 'loss/train': 1.9039117097854614} 11/06/2021 23:01:46 - INFO - __main__ - Step 12894: {'lr': 0.000493346571389885, 'samples': 2475648, 'steps': 12893, 'loss/train': 1.4883739948272705} 11/06/2021 23:01:46 - INFO - __main__ - Step 12895: {'lr': 0.0004933453551855622, 'samples': 2475840, 'steps': 12894, 'loss/train': 1.6035698652267456} 11/06/2021 23:01:47 - INFO - __main__ - Step 12896: {'lr': 0.0004933441388715919, 'samples': 2476032, 'steps': 12895, 'loss/train': 1.7423779964447021} 11/06/2021 23:01:47 - INFO - __main__ - Step 12897: {'lr': 0.0004933429224479743, 'samples': 2476224, 'steps': 12896, 'loss/train': 1.6877235174179077} 11/06/2021 23:01:48 - INFO - __main__ - Step 12898: {'lr': 0.0004933417059147102, 'samples': 2476416, 'steps': 12897, 'loss/train': 1.5222456455230713} 11/06/2021 23:01:48 - INFO - __main__ - Step 12899: {'lr': 0.0004933404892718, 'samples': 2476608, 'steps': 12898, 'loss/train': 0.5296525955200195} 11/06/2021 23:01:50 - INFO - __main__ - Step 12900: {'lr': 0.0004933392725192444, 'samples': 2476800, 'steps': 12899, 'loss/train': 2.3114047050476074} 11/06/2021 23:01:50 - INFO - __main__ - Step 12901: {'lr': 0.000493338055657044, 'samples': 2476992, 'steps': 12900, 'loss/train': 1.6073651313781738} 11/06/2021 23:01:50 - INFO - __main__ - Step 12902: {'lr': 0.0004933368386851991, 'samples': 2477184, 'steps': 12901, 'loss/train': 0.23120321333408356} 11/06/2021 23:01:51 - INFO - __main__ - Step 12903: {'lr': 0.0004933356216037104, 'samples': 2477376, 'steps': 12902, 'loss/train': 1.398972749710083} 11/06/2021 23:01:51 - INFO - __main__ - Step 12904: {'lr': 0.0004933344044125784, 'samples': 2477568, 'steps': 12903, 'loss/train': 1.585076093673706} 11/06/2021 23:01:52 - INFO - __main__ - Step 12905: {'lr': 0.0004933331871118037, 'samples': 2477760, 'steps': 12904, 'loss/train': 1.9329192638397217} 11/06/2021 23:01:52 - INFO - __main__ - Step 12906: {'lr': 0.0004933319697013869, 'samples': 2477952, 'steps': 12905, 'loss/train': 1.082170844078064} 11/06/2021 23:01:53 - INFO - __main__ - Step 12907: {'lr': 0.0004933307521813282, 'samples': 2478144, 'steps': 12906, 'loss/train': 1.3846449851989746} 11/06/2021 23:01:53 - INFO - __main__ - Step 12908: {'lr': 0.0004933295345516287, 'samples': 2478336, 'steps': 12907, 'loss/train': 1.5094048976898193} 11/06/2021 23:01:53 - INFO - __main__ - Step 12909: {'lr': 0.0004933283168122886, 'samples': 2478528, 'steps': 12908, 'loss/train': 1.7825679779052734} 11/06/2021 23:01:54 - INFO - __main__ - Step 12910: {'lr': 0.0004933270989633084, 'samples': 2478720, 'steps': 12909, 'loss/train': 2.0155463218688965} 11/06/2021 23:01:55 - INFO - __main__ - Step 12911: {'lr': 0.0004933258810046889, 'samples': 2478912, 'steps': 12910, 'loss/train': 1.8153250217437744} 11/06/2021 23:01:55 - INFO - __main__ - Step 12912: {'lr': 0.0004933246629364304, 'samples': 2479104, 'steps': 12911, 'loss/train': 1.8701039552688599} 11/06/2021 23:01:55 - INFO - __main__ - Step 12913: {'lr': 0.0004933234447585337, 'samples': 2479296, 'steps': 12912, 'loss/train': 1.6900960206985474} 11/06/2021 23:01:56 - INFO - __main__ - Step 12914: {'lr': 0.0004933222264709991, 'samples': 2479488, 'steps': 12913, 'loss/train': 1.2670124769210815} 11/06/2021 23:01:57 - INFO - __main__ - Step 12915: {'lr': 0.0004933210080738273, 'samples': 2479680, 'steps': 12914, 'loss/train': 1.0174174308776855} 11/06/2021 23:01:57 - INFO - __main__ - Step 12916: {'lr': 0.0004933197895670187, 'samples': 2479872, 'steps': 12915, 'loss/train': 1.1427866220474243} 11/06/2021 23:01:58 - INFO - __main__ - Step 12917: {'lr': 0.0004933185709505741, 'samples': 2480064, 'steps': 12916, 'loss/train': 1.5414581298828125} 11/06/2021 23:01:58 - INFO - __main__ - Step 12918: {'lr': 0.0004933173522244939, 'samples': 2480256, 'steps': 12917, 'loss/train': 1.7401179075241089} 11/06/2021 23:01:58 - INFO - __main__ - Step 12919: {'lr': 0.0004933161333887786, 'samples': 2480448, 'steps': 12918, 'loss/train': 0.29492583870887756} 11/06/2021 23:02:00 - INFO - __main__ - Step 12920: {'lr': 0.0004933149144434288, 'samples': 2480640, 'steps': 12919, 'loss/train': 1.3319393396377563} 11/06/2021 23:02:00 - INFO - __main__ - Step 12921: {'lr': 0.0004933136953884451, 'samples': 2480832, 'steps': 12920, 'loss/train': 1.2883689403533936} 11/06/2021 23:02:00 - INFO - __main__ - Step 12922: {'lr': 0.0004933124762238279, 'samples': 2481024, 'steps': 12921, 'loss/train': 1.720632791519165} 11/06/2021 23:02:01 - INFO - __main__ - Step 12923: {'lr': 0.000493311256949578, 'samples': 2481216, 'steps': 12922, 'loss/train': 1.5766397714614868} 11/06/2021 23:02:01 - INFO - __main__ - Step 12924: {'lr': 0.0004933100375656957, 'samples': 2481408, 'steps': 12923, 'loss/train': 1.3622182607650757} 11/06/2021 23:02:02 - INFO - __main__ - Step 12925: {'lr': 0.0004933088180721817, 'samples': 2481600, 'steps': 12924, 'loss/train': 1.876354694366455} 11/06/2021 23:02:02 - INFO - __main__ - Step 12926: {'lr': 0.0004933075984690365, 'samples': 2481792, 'steps': 12925, 'loss/train': 1.9899784326553345} 11/06/2021 23:02:03 - INFO - __main__ - Step 12927: {'lr': 0.0004933063787562606, 'samples': 2481984, 'steps': 12926, 'loss/train': 1.0341933965682983} 11/06/2021 23:02:03 - INFO - __main__ - Step 12928: {'lr': 0.0004933051589338547, 'samples': 2482176, 'steps': 12927, 'loss/train': 1.815047025680542} 11/06/2021 23:02:03 - INFO - __main__ - Step 12929: {'lr': 0.0004933039390018192, 'samples': 2482368, 'steps': 12928, 'loss/train': 1.346707820892334} 11/06/2021 23:02:04 - INFO - __main__ - Step 12930: {'lr': 0.0004933027189601547, 'samples': 2482560, 'steps': 12929, 'loss/train': 1.6749612092971802} 11/06/2021 23:02:05 - INFO - __main__ - Step 12931: {'lr': 0.0004933014988088616, 'samples': 2482752, 'steps': 12930, 'loss/train': 1.7851786613464355} 11/06/2021 23:02:05 - INFO - __main__ - Step 12932: {'lr': 0.0004933002785479408, 'samples': 2482944, 'steps': 12931, 'loss/train': 1.2040504217147827} 11/06/2021 23:02:06 - INFO - __main__ - Step 12933: {'lr': 0.0004932990581773926, 'samples': 2483136, 'steps': 12932, 'loss/train': 1.582513451576233} 11/06/2021 23:02:06 - INFO - __main__ - Step 12934: {'lr': 0.0004932978376972175, 'samples': 2483328, 'steps': 12933, 'loss/train': 0.32476386427879333} 11/06/2021 23:02:06 - INFO - __main__ - Step 12935: {'lr': 0.0004932966171074163, 'samples': 2483520, 'steps': 12934, 'loss/train': 1.2251867055892944} 11/06/2021 23:02:07 - INFO - __main__ - Step 12936: {'lr': 0.0004932953964079893, 'samples': 2483712, 'steps': 12935, 'loss/train': 1.569036841392517} 11/06/2021 23:02:08 - INFO - __main__ - Step 12937: {'lr': 0.0004932941755989372, 'samples': 2483904, 'steps': 12936, 'loss/train': 1.9879359006881714} 11/06/2021 23:02:08 - INFO - __main__ - Step 12938: {'lr': 0.0004932929546802605, 'samples': 2484096, 'steps': 12937, 'loss/train': 1.4933514595031738} 11/06/2021 23:02:08 - INFO - __main__ - Step 12939: {'lr': 0.0004932917336519597, 'samples': 2484288, 'steps': 12938, 'loss/train': 1.2084016799926758} 11/06/2021 23:02:09 - INFO - __main__ - Step 12940: {'lr': 0.0004932905125140354, 'samples': 2484480, 'steps': 12939, 'loss/train': 1.5175234079360962} 11/06/2021 23:02:10 - INFO - __main__ - Step 12941: {'lr': 0.0004932892912664882, 'samples': 2484672, 'steps': 12940, 'loss/train': 1.7356916666030884} 11/06/2021 23:02:10 - INFO - __main__ - Step 12942: {'lr': 0.0004932880699093186, 'samples': 2484864, 'steps': 12941, 'loss/train': 1.6481016874313354} 11/06/2021 23:02:10 - INFO - __main__ - Step 12943: {'lr': 0.0004932868484425271, 'samples': 2485056, 'steps': 12942, 'loss/train': 1.8215060234069824} 11/06/2021 23:02:11 - INFO - __main__ - Step 12944: {'lr': 0.0004932856268661143, 'samples': 2485248, 'steps': 12943, 'loss/train': 1.5122387409210205} 11/06/2021 23:02:11 - INFO - __main__ - Step 12945: {'lr': 0.0004932844051800808, 'samples': 2485440, 'steps': 12944, 'loss/train': 1.9089007377624512} 11/06/2021 23:02:12 - INFO - __main__ - Step 12946: {'lr': 0.000493283183384427, 'samples': 2485632, 'steps': 12945, 'loss/train': 1.1102064847946167} 11/06/2021 23:02:13 - INFO - __main__ - Step 12947: {'lr': 0.0004932819614791537, 'samples': 2485824, 'steps': 12946, 'loss/train': 1.6587165594100952} 11/06/2021 23:02:13 - INFO - __main__ - Step 12948: {'lr': 0.0004932807394642612, 'samples': 2486016, 'steps': 12947, 'loss/train': 1.7909165620803833} 11/06/2021 23:02:13 - INFO - __main__ - Step 12949: {'lr': 0.0004932795173397501, 'samples': 2486208, 'steps': 12948, 'loss/train': 2.574796199798584} 11/06/2021 23:02:14 - INFO - __main__ - Step 12950: {'lr': 0.0004932782951056211, 'samples': 2486400, 'steps': 12949, 'loss/train': 2.0372557640075684} 11/06/2021 23:02:15 - INFO - __main__ - Step 12951: {'lr': 0.0004932770727618747, 'samples': 2486592, 'steps': 12950, 'loss/train': 3.691620349884033} 11/06/2021 23:02:15 - INFO - __main__ - Step 12952: {'lr': 0.0004932758503085114, 'samples': 2486784, 'steps': 12951, 'loss/train': 1.6988760232925415} 11/06/2021 23:02:15 - INFO - __main__ - Step 12953: {'lr': 0.0004932746277455317, 'samples': 2486976, 'steps': 12952, 'loss/train': 1.674808382987976} 11/06/2021 23:02:16 - INFO - __main__ - Step 12954: {'lr': 0.0004932734050729362, 'samples': 2487168, 'steps': 12953, 'loss/train': 1.2582000494003296} 11/06/2021 23:02:16 - INFO - __main__ - Step 12955: {'lr': 0.0004932721822907255, 'samples': 2487360, 'steps': 12954, 'loss/train': 1.9092744588851929} 11/06/2021 23:02:17 - INFO - __main__ - Step 12956: {'lr': 0.0004932709593989, 'samples': 2487552, 'steps': 12955, 'loss/train': 0.7507480978965759} 11/06/2021 23:02:17 - INFO - __main__ - Step 12957: {'lr': 0.0004932697363974604, 'samples': 2487744, 'steps': 12956, 'loss/train': 1.7332943677902222} 11/06/2021 23:02:18 - INFO - __main__ - Step 12958: {'lr': 0.0004932685132864072, 'samples': 2487936, 'steps': 12957, 'loss/train': 1.3571699857711792} 11/06/2021 23:02:18 - INFO - __main__ - Step 12959: {'lr': 0.0004932672900657411, 'samples': 2488128, 'steps': 12958, 'loss/train': 1.7823020219802856} 11/06/2021 23:02:18 - INFO - __main__ - Step 12960: {'lr': 0.0004932660667354623, 'samples': 2488320, 'steps': 12959, 'loss/train': 1.8756752014160156} 11/06/2021 23:02:20 - INFO - __main__ - Step 12961: {'lr': 0.0004932648432955717, 'samples': 2488512, 'steps': 12960, 'loss/train': 1.2283211946487427} 11/06/2021 23:02:20 - INFO - __main__ - Step 12962: {'lr': 0.0004932636197460698, 'samples': 2488704, 'steps': 12961, 'loss/train': 1.6361844539642334} 11/06/2021 23:02:20 - INFO - __main__ - Step 12963: {'lr': 0.0004932623960869569, 'samples': 2488896, 'steps': 12962, 'loss/train': 2.1100404262542725} 11/06/2021 23:02:21 - INFO - __main__ - Step 12964: {'lr': 0.0004932611723182338, 'samples': 2489088, 'steps': 12963, 'loss/train': 2.5909249782562256} 11/06/2021 23:02:21 - INFO - __main__ - Step 12965: {'lr': 0.000493259948439901, 'samples': 2489280, 'steps': 12964, 'loss/train': 1.8154479265213013} 11/06/2021 23:02:21 - INFO - __main__ - Step 12966: {'lr': 0.0004932587244519589, 'samples': 2489472, 'steps': 12965, 'loss/train': 1.5690723657608032} 11/06/2021 23:02:22 - INFO - __main__ - Step 12967: {'lr': 0.0004932575003544083, 'samples': 2489664, 'steps': 12966, 'loss/train': 1.8327417373657227} 11/06/2021 23:02:23 - INFO - __main__ - Step 12968: {'lr': 0.0004932562761472496, 'samples': 2489856, 'steps': 12967, 'loss/train': 1.6555702686309814} 11/06/2021 23:02:23 - INFO - __main__ - Step 12969: {'lr': 0.0004932550518304833, 'samples': 2490048, 'steps': 12968, 'loss/train': 1.708749532699585} 11/06/2021 23:02:23 - INFO - __main__ - Step 12970: {'lr': 0.0004932538274041101, 'samples': 2490240, 'steps': 12969, 'loss/train': 1.741576910018921} 11/06/2021 23:02:24 - INFO - __main__ - Step 12971: {'lr': 0.0004932526028681304, 'samples': 2490432, 'steps': 12970, 'loss/train': 1.9435254335403442} 11/06/2021 23:02:25 - INFO - __main__ - Step 12972: {'lr': 0.0004932513782225449, 'samples': 2490624, 'steps': 12971, 'loss/train': 1.1495224237442017} 11/06/2021 23:02:26 - INFO - __main__ - Step 12973: {'lr': 0.000493250153467354, 'samples': 2490816, 'steps': 12972, 'loss/train': 1.953014850616455} 11/06/2021 23:02:26 - INFO - __main__ - Step 12974: {'lr': 0.0004932489286025584, 'samples': 2491008, 'steps': 12973, 'loss/train': 1.809768557548523} 11/06/2021 23:02:26 - INFO - __main__ - Step 12975: {'lr': 0.0004932477036281586, 'samples': 2491200, 'steps': 12974, 'loss/train': 1.7049349546432495} 11/06/2021 23:02:27 - INFO - __main__ - Step 12976: {'lr': 0.0004932464785441552, 'samples': 2491392, 'steps': 12975, 'loss/train': 1.7377121448516846} 11/06/2021 23:02:27 - INFO - __main__ - Step 12977: {'lr': 0.0004932452533505486, 'samples': 2491584, 'steps': 12976, 'loss/train': 1.471701979637146} 11/06/2021 23:02:27 - INFO - __main__ - Step 12978: {'lr': 0.0004932440280473395, 'samples': 2491776, 'steps': 12977, 'loss/train': 1.8254519701004028} 11/06/2021 23:02:28 - INFO - __main__ - Step 12979: {'lr': 0.0004932428026345282, 'samples': 2491968, 'steps': 12978, 'loss/train': 1.869467854499817} 11/06/2021 23:02:29 - INFO - __main__ - Step 12980: {'lr': 0.0004932415771121157, 'samples': 2492160, 'steps': 12979, 'loss/train': 2.1289100646972656} 11/06/2021 23:02:29 - INFO - __main__ - Step 12981: {'lr': 0.0004932403514801021, 'samples': 2492352, 'steps': 12980, 'loss/train': 1.9706026315689087} 11/06/2021 23:02:29 - INFO - __main__ - Step 12982: {'lr': 0.0004932391257384883, 'samples': 2492544, 'steps': 12981, 'loss/train': 1.9861183166503906} 11/06/2021 23:02:30 - INFO - __main__ - Step 12983: {'lr': 0.0004932378998872746, 'samples': 2492736, 'steps': 12982, 'loss/train': 2.142361640930176} 11/06/2021 23:02:31 - INFO - __main__ - Step 12984: {'lr': 0.0004932366739264618, 'samples': 2492928, 'steps': 12983, 'loss/train': 0.2792867422103882} 11/06/2021 23:02:31 - INFO - __main__ - Step 12985: {'lr': 0.0004932354478560502, 'samples': 2493120, 'steps': 12984, 'loss/train': 1.9411041736602783} 11/06/2021 23:02:31 - INFO - __main__ - Step 12986: {'lr': 0.0004932342216760405, 'samples': 2493312, 'steps': 12985, 'loss/train': 1.882585883140564} 11/06/2021 23:02:32 - INFO - __main__ - Step 12987: {'lr': 0.0004932329953864331, 'samples': 2493504, 'steps': 12986, 'loss/train': 1.9840530157089233} 11/06/2021 23:02:32 - INFO - __main__ - Step 12988: {'lr': 0.0004932317689872287, 'samples': 2493696, 'steps': 12987, 'loss/train': 1.5482817888259888} 11/06/2021 23:02:33 - INFO - __main__ - Step 12989: {'lr': 0.000493230542478428, 'samples': 2493888, 'steps': 12988, 'loss/train': 1.229072093963623} 11/06/2021 23:02:34 - INFO - __main__ - Step 12990: {'lr': 0.0004932293158600312, 'samples': 2494080, 'steps': 12989, 'loss/train': 2.037853717803955} 11/06/2021 23:02:34 - INFO - __main__ - Step 12991: {'lr': 0.0004932280891320391, 'samples': 2494272, 'steps': 12990, 'loss/train': 1.9727879762649536} 11/06/2021 23:02:34 - INFO - __main__ - Step 12992: {'lr': 0.0004932268622944521, 'samples': 2494464, 'steps': 12991, 'loss/train': 2.1666083335876465} 11/06/2021 23:02:35 - INFO - __main__ - Step 12993: {'lr': 0.0004932256353472709, 'samples': 2494656, 'steps': 12992, 'loss/train': 1.4905357360839844} 11/06/2021 23:02:36 - INFO - __main__ - Step 12994: {'lr': 0.0004932244082904959, 'samples': 2494848, 'steps': 12993, 'loss/train': 1.4867180585861206} 11/06/2021 23:02:36 - INFO - __main__ - Step 12995: {'lr': 0.0004932231811241278, 'samples': 2495040, 'steps': 12994, 'loss/train': 0.779282808303833} 11/06/2021 23:02:36 - INFO - __main__ - Step 12996: {'lr': 0.0004932219538481672, 'samples': 2495232, 'steps': 12995, 'loss/train': 1.931758165359497} 11/06/2021 23:02:37 - INFO - __main__ - Step 12997: {'lr': 0.0004932207264626143, 'samples': 2495424, 'steps': 12996, 'loss/train': 1.4099301099777222} 11/06/2021 23:02:37 - INFO - __main__ - Step 12998: {'lr': 0.00049321949896747, 'samples': 2495616, 'steps': 12997, 'loss/train': 1.4822603464126587} 11/06/2021 23:02:39 - INFO - __main__ - Step 12999: {'lr': 0.0004932182713627348, 'samples': 2495808, 'steps': 12998, 'loss/train': 1.0354619026184082} 11/06/2021 23:02:39 - INFO - __main__ - Step 13000: {'lr': 0.0004932170436484091, 'samples': 2496000, 'steps': 12999, 'loss/train': 0.504642903804779} 11/06/2021 23:02:39 - INFO - __main__ - Step 13001: {'lr': 0.0004932158158244937, 'samples': 2496192, 'steps': 13000, 'loss/train': 0.3501379191875458} 11/06/2021 23:02:40 - INFO - __main__ - Step 13002: {'lr': 0.0004932145878909889, 'samples': 2496384, 'steps': 13001, 'loss/train': 1.9297839403152466} 11/06/2021 23:02:40 - INFO - __main__ - Step 13003: {'lr': 0.0004932133598478953, 'samples': 2496576, 'steps': 13002, 'loss/train': 1.891198754310608} 11/06/2021 23:02:41 - INFO - __main__ - Step 13004: {'lr': 0.0004932121316952136, 'samples': 2496768, 'steps': 13003, 'loss/train': 1.5406192541122437} 11/06/2021 23:02:41 - INFO - __main__ - Step 13005: {'lr': 0.0004932109034329442, 'samples': 2496960, 'steps': 13004, 'loss/train': 1.6352239847183228} 11/06/2021 23:02:42 - INFO - __main__ - Step 13006: {'lr': 0.0004932096750610879, 'samples': 2497152, 'steps': 13005, 'loss/train': 1.8479753732681274} 11/06/2021 23:02:42 - INFO - __main__ - Step 13007: {'lr': 0.0004932084465796449, 'samples': 2497344, 'steps': 13006, 'loss/train': 1.7984743118286133} 11/06/2021 23:02:43 - INFO - __main__ - Step 13008: {'lr': 0.000493207217988616, 'samples': 2497536, 'steps': 13007, 'loss/train': 1.575510025024414} 11/06/2021 23:02:44 - INFO - __main__ - Step 13009: {'lr': 0.0004932059892880016, 'samples': 2497728, 'steps': 13008, 'loss/train': 1.1041618585586548} 11/06/2021 23:02:44 - INFO - __main__ - Step 13010: {'lr': 0.0004932047604778025, 'samples': 2497920, 'steps': 13009, 'loss/train': 0.6211431622505188} 11/06/2021 23:02:45 - INFO - __main__ - Step 13011: {'lr': 0.0004932035315580188, 'samples': 2498112, 'steps': 13010, 'loss/train': 2.066260576248169} 11/06/2021 23:02:45 - INFO - __main__ - Step 13012: {'lr': 0.0004932023025286516, 'samples': 2498304, 'steps': 13011, 'loss/train': 1.6674903631210327} 11/06/2021 23:02:45 - INFO - __main__ - Step 13013: {'lr': 0.0004932010733897012, 'samples': 2498496, 'steps': 13012, 'loss/train': 1.5579228401184082} 11/06/2021 23:02:46 - INFO - __main__ - Step 13014: {'lr': 0.000493199844141168, 'samples': 2498688, 'steps': 13013, 'loss/train': 1.8701536655426025} 11/06/2021 23:02:47 - INFO - __main__ - Step 13015: {'lr': 0.0004931986147830527, 'samples': 2498880, 'steps': 13014, 'loss/train': 1.726219654083252} 11/06/2021 23:02:47 - INFO - __main__ - Step 13016: {'lr': 0.000493197385315356, 'samples': 2499072, 'steps': 13015, 'loss/train': 1.4233412742614746} 11/06/2021 23:02:47 - INFO - __main__ - Step 13017: {'lr': 0.0004931961557380782, 'samples': 2499264, 'steps': 13016, 'loss/train': 1.2359046936035156} 11/06/2021 23:02:48 - INFO - __main__ - Step 13018: {'lr': 0.00049319492605122, 'samples': 2499456, 'steps': 13017, 'loss/train': 1.8722537755966187} 11/06/2021 23:02:48 - INFO - __main__ - Step 13019: {'lr': 0.000493193696254782, 'samples': 2499648, 'steps': 13018, 'loss/train': 1.5985851287841797} 11/06/2021 23:02:49 - INFO - __main__ - Step 13020: {'lr': 0.0004931924663487646, 'samples': 2499840, 'steps': 13019, 'loss/train': 1.6530154943466187} 11/06/2021 23:02:50 - INFO - __main__ - Step 13021: {'lr': 0.0004931912363331683, 'samples': 2500032, 'steps': 13020, 'loss/train': 1.8538419008255005} 11/06/2021 23:02:50 - INFO - __main__ - Step 13022: {'lr': 0.000493190006207994, 'samples': 2500224, 'steps': 13021, 'loss/train': 1.3083375692367554} 11/06/2021 23:02:50 - INFO - __main__ - Step 13023: {'lr': 0.0004931887759732419, 'samples': 2500416, 'steps': 13022, 'loss/train': 1.6392176151275635} 11/06/2021 23:02:51 - INFO - __main__ - Step 13024: {'lr': 0.0004931875456289128, 'samples': 2500608, 'steps': 13023, 'loss/train': 1.1660975217819214} 11/06/2021 23:02:52 - INFO - __main__ - Step 13025: {'lr': 0.000493186315175007, 'samples': 2500800, 'steps': 13024, 'loss/train': 1.6558512449264526} 11/06/2021 23:02:52 - INFO - __main__ - Step 13026: {'lr': 0.0004931850846115253, 'samples': 2500992, 'steps': 13025, 'loss/train': 1.7009488344192505} 11/06/2021 23:02:52 - INFO - __main__ - Step 13027: {'lr': 0.0004931838539384681, 'samples': 2501184, 'steps': 13026, 'loss/train': 1.87421452999115} 11/06/2021 23:02:53 - INFO - __main__ - Step 13028: {'lr': 0.0004931826231558361, 'samples': 2501376, 'steps': 13027, 'loss/train': 1.698980450630188} 11/06/2021 23:02:53 - INFO - __main__ - Step 13029: {'lr': 0.0004931813922636297, 'samples': 2501568, 'steps': 13028, 'loss/train': 2.1995387077331543} 11/06/2021 23:02:53 - INFO - __main__ - Step 13030: {'lr': 0.0004931801612618494, 'samples': 2501760, 'steps': 13029, 'loss/train': 0.9826458692550659} 11/06/2021 23:02:54 - INFO - __main__ - Step 13031: {'lr': 0.0004931789301504961, 'samples': 2501952, 'steps': 13030, 'loss/train': 1.9155287742614746} 11/06/2021 23:02:55 - INFO - __main__ - Step 13032: {'lr': 0.00049317769892957, 'samples': 2502144, 'steps': 13031, 'loss/train': 2.2658934593200684} 11/06/2021 23:02:55 - INFO - __main__ - Step 13033: {'lr': 0.0004931764675990718, 'samples': 2502336, 'steps': 13032, 'loss/train': 1.7716729640960693} 11/06/2021 23:02:56 - INFO - __main__ - Step 13034: {'lr': 0.000493175236159002, 'samples': 2502528, 'steps': 13033, 'loss/train': 2.120985269546509} 11/06/2021 23:02:56 - INFO - __main__ - Step 13035: {'lr': 0.0004931740046093612, 'samples': 2502720, 'steps': 13034, 'loss/train': 1.8945417404174805} 11/06/2021 23:02:57 - INFO - __main__ - Step 13036: {'lr': 0.0004931727729501499, 'samples': 2502912, 'steps': 13035, 'loss/train': 1.613181471824646} 11/06/2021 23:02:57 - INFO - __main__ - Step 13037: {'lr': 0.0004931715411813689, 'samples': 2503104, 'steps': 13036, 'loss/train': 1.5703870058059692} 11/06/2021 23:02:58 - INFO - __main__ - Step 13038: {'lr': 0.0004931703093030183, 'samples': 2503296, 'steps': 13037, 'loss/train': 1.6445674896240234} 11/06/2021 23:02:58 - INFO - __main__ - Step 13039: {'lr': 0.0004931690773150991, 'samples': 2503488, 'steps': 13038, 'loss/train': 1.3863145112991333} 11/06/2021 23:02:58 - INFO - __main__ - Step 13040: {'lr': 0.0004931678452176116, 'samples': 2503680, 'steps': 13039, 'loss/train': 1.5152404308319092} 11/06/2021 23:02:59 - INFO - __main__ - Step 13041: {'lr': 0.0004931666130105563, 'samples': 2503872, 'steps': 13040, 'loss/train': 1.6495338678359985} 11/06/2021 23:03:00 - INFO - __main__ - Step 13042: {'lr': 0.0004931653806939341, 'samples': 2504064, 'steps': 13041, 'loss/train': 1.6312777996063232} 11/06/2021 23:03:00 - INFO - __main__ - Step 13043: {'lr': 0.0004931641482677452, 'samples': 2504256, 'steps': 13042, 'loss/train': 1.6087427139282227} 11/06/2021 23:03:01 - INFO - __main__ - Step 13044: {'lr': 0.0004931629157319904, 'samples': 2504448, 'steps': 13043, 'loss/train': 1.2086093425750732} 11/06/2021 23:03:01 - INFO - __main__ - Step 13045: {'lr': 0.00049316168308667, 'samples': 2504640, 'steps': 13044, 'loss/train': 0.33910736441612244} 11/06/2021 23:03:02 - INFO - __main__ - Step 13046: {'lr': 0.0004931604503317846, 'samples': 2504832, 'steps': 13045, 'loss/train': 1.6171785593032837} 11/06/2021 23:03:02 - INFO - __main__ - Step 13047: {'lr': 0.0004931592174673351, 'samples': 2505024, 'steps': 13046, 'loss/train': 1.6778055429458618} 11/06/2021 23:03:03 - INFO - __main__ - Step 13048: {'lr': 0.0004931579844933218, 'samples': 2505216, 'steps': 13047, 'loss/train': 1.8838071823120117} 11/06/2021 23:03:03 - INFO - __main__ - Step 13049: {'lr': 0.0004931567514097451, 'samples': 2505408, 'steps': 13048, 'loss/train': 1.89003586769104} 11/06/2021 23:03:03 - INFO - __main__ - Step 13050: {'lr': 0.0004931555182166059, 'samples': 2505600, 'steps': 13049, 'loss/train': 1.5584383010864258} 11/06/2021 23:03:04 - INFO - __main__ - Step 13051: {'lr': 0.0004931542849139044, 'samples': 2505792, 'steps': 13050, 'loss/train': 2.8212499618530273} 11/06/2021 23:03:05 - INFO - __main__ - Step 13052: {'lr': 0.0004931530515016415, 'samples': 2505984, 'steps': 13051, 'loss/train': 1.6236906051635742} 11/06/2021 23:03:05 - INFO - __main__ - Step 13053: {'lr': 0.0004931518179798175, 'samples': 2506176, 'steps': 13052, 'loss/train': 1.8090659379959106} 11/06/2021 23:03:05 - INFO - __main__ - Step 13054: {'lr': 0.000493150584348433, 'samples': 2506368, 'steps': 13053, 'loss/train': 1.5024513006210327} 11/06/2021 23:03:06 - INFO - __main__ - Step 13055: {'lr': 0.0004931493506074886, 'samples': 2506560, 'steps': 13054, 'loss/train': 1.271149754524231} 11/06/2021 23:03:07 - INFO - __main__ - Step 13056: {'lr': 0.0004931481167569849, 'samples': 2506752, 'steps': 13055, 'loss/train': 1.1166818141937256} 11/06/2021 23:03:07 - INFO - __main__ - Step 13057: {'lr': 0.0004931468827969223, 'samples': 2506944, 'steps': 13056, 'loss/train': 1.8440206050872803} 11/06/2021 23:03:08 - INFO - __main__ - Step 13058: {'lr': 0.0004931456487273017, 'samples': 2507136, 'steps': 13057, 'loss/train': 1.7418556213378906} 11/06/2021 23:03:08 - INFO - __main__ - Step 13059: {'lr': 0.0004931444145481233, 'samples': 2507328, 'steps': 13058, 'loss/train': 1.3444815874099731} 11/06/2021 23:03:08 - INFO - __main__ - Step 13060: {'lr': 0.0004931431802593877, 'samples': 2507520, 'steps': 13059, 'loss/train': 2.082310438156128} 11/06/2021 23:03:09 - INFO - __main__ - Step 13061: {'lr': 0.0004931419458610956, 'samples': 2507712, 'steps': 13060, 'loss/train': 0.22066687047481537} 11/06/2021 23:03:10 - INFO - __main__ - Step 13062: {'lr': 0.0004931407113532476, 'samples': 2507904, 'steps': 13061, 'loss/train': 1.8766014575958252} 11/06/2021 23:03:10 - INFO - __main__ - Step 13063: {'lr': 0.000493139476735844, 'samples': 2508096, 'steps': 13062, 'loss/train': 2.6808207035064697} 11/06/2021 23:03:10 - INFO - __main__ - Step 13064: {'lr': 0.0004931382420088855, 'samples': 2508288, 'steps': 13063, 'loss/train': 1.8290174007415771} 11/06/2021 23:03:11 - INFO - __main__ - Step 13065: {'lr': 0.0004931370071723728, 'samples': 2508480, 'steps': 13064, 'loss/train': 1.6822478771209717} 11/06/2021 23:03:11 - INFO - __main__ - Step 13066: {'lr': 0.0004931357722263061, 'samples': 2508672, 'steps': 13065, 'loss/train': 1.7262883186340332} 11/06/2021 23:03:12 - INFO - __main__ - Step 13067: {'lr': 0.0004931345371706863, 'samples': 2508864, 'steps': 13066, 'loss/train': 1.8077123165130615} 11/06/2021 23:03:13 - INFO - __main__ - Step 13068: {'lr': 0.0004931333020055139, 'samples': 2509056, 'steps': 13067, 'loss/train': 1.600835919380188} 11/06/2021 23:03:13 - INFO - __main__ - Step 13069: {'lr': 0.0004931320667307893, 'samples': 2509248, 'steps': 13068, 'loss/train': 2.081484794616699} 11/06/2021 23:03:13 - INFO - __main__ - Step 13070: {'lr': 0.0004931308313465132, 'samples': 2509440, 'steps': 13069, 'loss/train': 1.9141942262649536} 11/06/2021 23:03:14 - INFO - __main__ - Step 13071: {'lr': 0.000493129595852686, 'samples': 2509632, 'steps': 13070, 'loss/train': 3.5768423080444336} 11/06/2021 23:03:15 - INFO - __main__ - Step 13072: {'lr': 0.0004931283602493084, 'samples': 2509824, 'steps': 13071, 'loss/train': 1.5225051641464233} 11/06/2021 23:03:15 - INFO - __main__ - Step 13073: {'lr': 0.0004931271245363809, 'samples': 2510016, 'steps': 13072, 'loss/train': 1.574705958366394} 11/06/2021 23:03:15 - INFO - __main__ - Step 13074: {'lr': 0.0004931258887139041, 'samples': 2510208, 'steps': 13073, 'loss/train': 1.5260777473449707} 11/06/2021 23:03:16 - INFO - __main__ - Step 13075: {'lr': 0.0004931246527818785, 'samples': 2510400, 'steps': 13074, 'loss/train': 1.26536226272583} 11/06/2021 23:03:16 - INFO - __main__ - Step 13076: {'lr': 0.0004931234167403047, 'samples': 2510592, 'steps': 13075, 'loss/train': 1.4206626415252686} 11/06/2021 23:03:17 - INFO - __main__ - Step 13077: {'lr': 0.0004931221805891833, 'samples': 2510784, 'steps': 13076, 'loss/train': 1.6260327100753784} 11/06/2021 23:03:17 - INFO - __main__ - Step 13078: {'lr': 0.0004931209443285147, 'samples': 2510976, 'steps': 13077, 'loss/train': 1.759495735168457} 11/06/2021 23:03:18 - INFO - __main__ - Step 13079: {'lr': 0.0004931197079582996, 'samples': 2511168, 'steps': 13078, 'loss/train': 1.1801636219024658} 11/06/2021 23:03:18 - INFO - __main__ - Step 13080: {'lr': 0.0004931184714785385, 'samples': 2511360, 'steps': 13079, 'loss/train': 1.9777021408081055} 11/06/2021 23:03:18 - INFO - __main__ - Step 13081: {'lr': 0.000493117234889232, 'samples': 2511552, 'steps': 13080, 'loss/train': 1.551526665687561} 11/06/2021 23:03:20 - INFO - __main__ - Step 13082: {'lr': 0.0004931159981903805, 'samples': 2511744, 'steps': 13081, 'loss/train': 1.3697116374969482} 11/06/2021 23:03:20 - INFO - __main__ - Step 13083: {'lr': 0.0004931147613819848, 'samples': 2511936, 'steps': 13082, 'loss/train': 1.5000561475753784} 11/06/2021 23:03:20 - INFO - __main__ - Step 13084: {'lr': 0.0004931135244640453, 'samples': 2512128, 'steps': 13083, 'loss/train': 1.6628344058990479} 11/06/2021 23:03:21 - INFO - __main__ - Step 13085: {'lr': 0.0004931122874365627, 'samples': 2512320, 'steps': 13084, 'loss/train': 1.448068618774414} 11/06/2021 23:03:21 - INFO - __main__ - Step 13086: {'lr': 0.0004931110502995374, 'samples': 2512512, 'steps': 13085, 'loss/train': 1.518585443496704} 11/06/2021 23:03:22 - INFO - __main__ - Step 13087: {'lr': 0.0004931098130529699, 'samples': 2512704, 'steps': 13086, 'loss/train': 2.025705099105835} 11/06/2021 23:03:22 - INFO - __main__ - Step 13088: {'lr': 0.000493108575696861, 'samples': 2512896, 'steps': 13087, 'loss/train': 1.6481513977050781} 11/06/2021 23:03:23 - INFO - __main__ - Step 13089: {'lr': 0.0004931073382312111, 'samples': 2513088, 'steps': 13088, 'loss/train': 1.394457221031189} 11/06/2021 23:03:23 - INFO - __main__ - Step 13090: {'lr': 0.0004931061006560207, 'samples': 2513280, 'steps': 13089, 'loss/train': 1.560387134552002} 11/06/2021 23:03:23 - INFO - __main__ - Step 13091: {'lr': 0.0004931048629712905, 'samples': 2513472, 'steps': 13090, 'loss/train': 1.8698866367340088} 11/06/2021 23:03:24 - INFO - __main__ - Step 13092: {'lr': 0.000493103625177021, 'samples': 2513664, 'steps': 13091, 'loss/train': 2.115483522415161} 11/06/2021 23:03:25 - INFO - __main__ - Step 13093: {'lr': 0.0004931023872732128, 'samples': 2513856, 'steps': 13092, 'loss/train': 1.7177084684371948} 11/06/2021 23:03:25 - INFO - __main__ - Step 13094: {'lr': 0.0004931011492598664, 'samples': 2514048, 'steps': 13093, 'loss/train': 0.9922665953636169} 11/06/2021 23:03:25 - INFO - __main__ - Step 13095: {'lr': 0.0004930999111369824, 'samples': 2514240, 'steps': 13094, 'loss/train': 1.6571738719940186} 11/06/2021 23:03:26 - INFO - __main__ - Step 13096: {'lr': 0.0004930986729045613, 'samples': 2514432, 'steps': 13095, 'loss/train': 1.8621472120285034} 11/06/2021 23:03:26 - INFO - __main__ - Step 13097: {'lr': 0.0004930974345626036, 'samples': 2514624, 'steps': 13096, 'loss/train': 1.177014708518982} 11/06/2021 23:03:27 - INFO - __main__ - Step 13098: {'lr': 0.00049309619611111, 'samples': 2514816, 'steps': 13097, 'loss/train': 1.6075189113616943} 11/06/2021 23:03:28 - INFO - __main__ - Step 13099: {'lr': 0.000493094957550081, 'samples': 2515008, 'steps': 13098, 'loss/train': 1.3369756937026978} 11/06/2021 23:03:28 - INFO - __main__ - Step 13100: {'lr': 0.0004930937188795172, 'samples': 2515200, 'steps': 13099, 'loss/train': 1.7597624063491821} 11/06/2021 23:03:28 - INFO - __main__ - Step 13101: {'lr': 0.0004930924800994192, 'samples': 2515392, 'steps': 13100, 'loss/train': 1.1955498456954956} 11/06/2021 23:03:29 - INFO - __main__ - Step 13102: {'lr': 0.0004930912412097874, 'samples': 2515584, 'steps': 13101, 'loss/train': 2.0432753562927246} 11/06/2021 23:03:30 - INFO - __main__ - Step 13103: {'lr': 0.0004930900022106224, 'samples': 2515776, 'steps': 13102, 'loss/train': 1.897919774055481} 11/06/2021 23:03:30 - INFO - __main__ - Step 13104: {'lr': 0.0004930887631019248, 'samples': 2515968, 'steps': 13103, 'loss/train': 1.3024754524230957} 11/06/2021 23:03:30 - INFO - __main__ - Step 13105: {'lr': 0.0004930875238836951, 'samples': 2516160, 'steps': 13104, 'loss/train': 1.7677634954452515} 11/06/2021 23:03:31 - INFO - __main__ - Step 13106: {'lr': 0.000493086284555934, 'samples': 2516352, 'steps': 13105, 'loss/train': 2.061279773712158} 11/06/2021 23:03:31 - INFO - __main__ - Step 13107: {'lr': 0.0004930850451186421, 'samples': 2516544, 'steps': 13106, 'loss/train': 1.8709264993667603} 11/06/2021 23:03:32 - INFO - __main__ - Step 13108: {'lr': 0.0004930838055718196, 'samples': 2516736, 'steps': 13107, 'loss/train': 1.6638407707214355} 11/06/2021 23:03:32 - INFO - __main__ - Step 13109: {'lr': 0.0004930825659154674, 'samples': 2516928, 'steps': 13108, 'loss/train': 0.959062397480011} 11/06/2021 23:03:33 - INFO - __main__ - Step 13110: {'lr': 0.000493081326149586, 'samples': 2517120, 'steps': 13109, 'loss/train': 1.8210927248001099} 11/06/2021 23:03:33 - INFO - __main__ - Step 13111: {'lr': 0.0004930800862741758, 'samples': 2517312, 'steps': 13110, 'loss/train': 1.239546775817871} 11/06/2021 23:03:33 - INFO - __main__ - Step 13112: {'lr': 0.0004930788462892375, 'samples': 2517504, 'steps': 13111, 'loss/train': 1.4559482336044312} 11/06/2021 23:03:34 - INFO - __main__ - Step 13113: {'lr': 0.0004930776061947716, 'samples': 2517696, 'steps': 13112, 'loss/train': 1.6347291469573975} 11/06/2021 23:03:35 - INFO - __main__ - Step 13114: {'lr': 0.0004930763659907788, 'samples': 2517888, 'steps': 13113, 'loss/train': 1.7659398317337036} 11/06/2021 23:03:35 - INFO - __main__ - Step 13115: {'lr': 0.0004930751256772593, 'samples': 2518080, 'steps': 13114, 'loss/train': 1.8895224332809448} 11/06/2021 23:03:35 - INFO - __main__ - Step 13116: {'lr': 0.0004930738852542141, 'samples': 2518272, 'steps': 13115, 'loss/train': 1.4143397808074951} 11/06/2021 23:03:36 - INFO - __main__ - Step 13117: {'lr': 0.0004930726447216435, 'samples': 2518464, 'steps': 13116, 'loss/train': 2.018646240234375} 11/06/2021 23:03:37 - INFO - __main__ - Step 13118: {'lr': 0.0004930714040795481, 'samples': 2518656, 'steps': 13117, 'loss/train': 1.5822020769119263} 11/06/2021 23:03:37 - INFO - __main__ - Step 13119: {'lr': 0.0004930701633279285, 'samples': 2518848, 'steps': 13118, 'loss/train': 1.5482772588729858} 11/06/2021 23:03:38 - INFO - __main__ - Step 13120: {'lr': 0.0004930689224667853, 'samples': 2519040, 'steps': 13119, 'loss/train': 1.1191784143447876} 11/06/2021 23:03:38 - INFO - __main__ - Step 13121: {'lr': 0.0004930676814961189, 'samples': 2519232, 'steps': 13120, 'loss/train': 1.5763609409332275} 11/06/2021 23:03:38 - INFO - __main__ - Step 13122: {'lr': 0.00049306644041593, 'samples': 2519424, 'steps': 13121, 'loss/train': 1.3790456056594849} 11/06/2021 23:03:40 - INFO - __main__ - Step 13123: {'lr': 0.0004930651992262191, 'samples': 2519616, 'steps': 13122, 'loss/train': 1.3503462076187134} 11/06/2021 23:03:40 - INFO - __main__ - Step 13124: {'lr': 0.0004930639579269866, 'samples': 2519808, 'steps': 13123, 'loss/train': 1.4450675249099731} 11/06/2021 23:03:41 - INFO - __main__ - Step 13125: {'lr': 0.0004930627165182335, 'samples': 2520000, 'steps': 13124, 'loss/train': 1.5567551851272583} 11/06/2021 23:03:41 - INFO - __main__ - Step 13126: {'lr': 0.00049306147499996, 'samples': 2520192, 'steps': 13125, 'loss/train': 1.7951613664627075} 11/06/2021 23:03:42 - INFO - __main__ - Step 13127: {'lr': 0.0004930602333721667, 'samples': 2520384, 'steps': 13126, 'loss/train': 1.6020070314407349} 11/06/2021 23:03:42 - INFO - __main__ - Step 13128: {'lr': 0.0004930589916348542, 'samples': 2520576, 'steps': 13127, 'loss/train': 1.8576689958572388} 11/06/2021 23:03:42 - INFO - __main__ - Step 13129: {'lr': 0.0004930577497880231, 'samples': 2520768, 'steps': 13128, 'loss/train': 1.8128331899642944} 11/06/2021 23:03:43 - INFO - __main__ - Step 13130: {'lr': 0.000493056507831674, 'samples': 2520960, 'steps': 13129, 'loss/train': 1.8191680908203125} 11/06/2021 23:03:44 - INFO - __main__ - Step 13131: {'lr': 0.0004930552657658073, 'samples': 2521152, 'steps': 13130, 'loss/train': 1.0424447059631348} 11/06/2021 23:03:44 - INFO - __main__ - Step 13132: {'lr': 0.0004930540235904237, 'samples': 2521344, 'steps': 13131, 'loss/train': 0.9502253532409668} 11/06/2021 23:03:44 - INFO - __main__ - Step 13133: {'lr': 0.0004930527813055237, 'samples': 2521536, 'steps': 13132, 'loss/train': 2.1043529510498047} 11/06/2021 23:03:45 - INFO - __main__ - Step 13134: {'lr': 0.0004930515389111078, 'samples': 2521728, 'steps': 13133, 'loss/train': 1.6203279495239258} 11/06/2021 23:03:46 - INFO - __main__ - Step 13135: {'lr': 0.0004930502964071767, 'samples': 2521920, 'steps': 13134, 'loss/train': 1.872376561164856} 11/06/2021 23:03:46 - INFO - __main__ - Step 13136: {'lr': 0.0004930490537937309, 'samples': 2522112, 'steps': 13135, 'loss/train': 1.530863881111145} 11/06/2021 23:03:46 - INFO - __main__ - Step 13137: {'lr': 0.0004930478110707709, 'samples': 2522304, 'steps': 13136, 'loss/train': 1.446254014968872} 11/06/2021 23:03:47 - INFO - __main__ - Step 13138: {'lr': 0.0004930465682382973, 'samples': 2522496, 'steps': 13137, 'loss/train': 2.3736038208007812} 11/06/2021 23:03:47 - INFO - __main__ - Step 13139: {'lr': 0.0004930453252963107, 'samples': 2522688, 'steps': 13138, 'loss/train': 1.7954354286193848} 11/06/2021 23:03:48 - INFO - __main__ - Step 13140: {'lr': 0.0004930440822448115, 'samples': 2522880, 'steps': 13139, 'loss/train': 1.4115676879882812} 11/06/2021 23:03:48 - INFO - __main__ - Step 13141: {'lr': 0.0004930428390838006, 'samples': 2523072, 'steps': 13140, 'loss/train': 1.7960898876190186} 11/06/2021 23:03:49 - INFO - __main__ - Step 13142: {'lr': 0.0004930415958132782, 'samples': 2523264, 'steps': 13141, 'loss/train': 1.5703538656234741} 11/06/2021 23:03:49 - INFO - __main__ - Step 13143: {'lr': 0.0004930403524332451, 'samples': 2523456, 'steps': 13142, 'loss/train': 1.3508336544036865} 11/06/2021 23:03:50 - INFO - __main__ - Step 13144: {'lr': 0.0004930391089437017, 'samples': 2523648, 'steps': 13143, 'loss/train': 1.7475541830062866} 11/06/2021 23:03:51 - INFO - __main__ - Step 13145: {'lr': 0.0004930378653446487, 'samples': 2523840, 'steps': 13144, 'loss/train': 1.8159362077713013} 11/06/2021 23:03:51 - INFO - __main__ - Step 13146: {'lr': 0.0004930366216360865, 'samples': 2524032, 'steps': 13145, 'loss/train': 1.6725317239761353} 11/06/2021 23:03:51 - INFO - __main__ - Step 13147: {'lr': 0.0004930353778180158, 'samples': 2524224, 'steps': 13146, 'loss/train': 1.5883946418762207} 11/06/2021 23:03:52 - INFO - __main__ - Step 13148: {'lr': 0.0004930341338904371, 'samples': 2524416, 'steps': 13147, 'loss/train': 1.889960765838623} 11/06/2021 23:03:52 - INFO - __main__ - Step 13149: {'lr': 0.000493032889853351, 'samples': 2524608, 'steps': 13148, 'loss/train': 1.9227193593978882} 11/06/2021 23:03:53 - INFO - __main__ - Step 13150: {'lr': 0.0004930316457067579, 'samples': 2524800, 'steps': 13149, 'loss/train': 1.51223886013031} 11/06/2021 23:03:53 - INFO - __main__ - Step 13151: {'lr': 0.0004930304014506586, 'samples': 2524992, 'steps': 13150, 'loss/train': 1.603259801864624} 11/06/2021 23:03:54 - INFO - __main__ - Step 13152: {'lr': 0.0004930291570850536, 'samples': 2525184, 'steps': 13151, 'loss/train': 1.576635479927063} 11/06/2021 23:03:54 - INFO - __main__ - Step 13153: {'lr': 0.0004930279126099433, 'samples': 2525376, 'steps': 13152, 'loss/train': 1.0832215547561646} 11/06/2021 23:03:54 - INFO - __main__ - Step 13154: {'lr': 0.0004930266680253284, 'samples': 2525568, 'steps': 13153, 'loss/train': 1.7394440174102783} 11/06/2021 23:03:55 - INFO - __main__ - Step 13155: {'lr': 0.0004930254233312095, 'samples': 2525760, 'steps': 13154, 'loss/train': 1.2529510259628296} 11/06/2021 23:03:56 - INFO - __main__ - Step 13156: {'lr': 0.000493024178527587, 'samples': 2525952, 'steps': 13155, 'loss/train': 2.2869997024536133} 11/06/2021 23:03:56 - INFO - __main__ - Step 13157: {'lr': 0.0004930229336144616, 'samples': 2526144, 'steps': 13156, 'loss/train': 1.7819799184799194} 11/06/2021 23:03:56 - INFO - __main__ - Step 13158: {'lr': 0.0004930216885918339, 'samples': 2526336, 'steps': 13157, 'loss/train': 1.7473498582839966} 11/06/2021 23:03:57 - INFO - __main__ - Step 13159: {'lr': 0.0004930204434597042, 'samples': 2526528, 'steps': 13158, 'loss/train': 1.48399019241333} 11/06/2021 23:03:57 - INFO - __main__ - Step 13160: {'lr': 0.0004930191982180734, 'samples': 2526720, 'steps': 13159, 'loss/train': 1.5737640857696533} 11/06/2021 23:03:58 - INFO - __main__ - Step 13161: {'lr': 0.0004930179528669418, 'samples': 2526912, 'steps': 13160, 'loss/train': 2.018446683883667} 11/06/2021 23:03:59 - INFO - __main__ - Step 13162: {'lr': 0.0004930167074063101, 'samples': 2527104, 'steps': 13161, 'loss/train': 1.7737540006637573} 11/06/2021 23:03:59 - INFO - __main__ - Step 13163: {'lr': 0.0004930154618361789, 'samples': 2527296, 'steps': 13162, 'loss/train': 1.867935299873352} 11/06/2021 23:03:59 - INFO - __main__ - Step 13164: {'lr': 0.0004930142161565486, 'samples': 2527488, 'steps': 13163, 'loss/train': 2.0381977558135986} 11/06/2021 23:04:00 - INFO - __main__ - Step 13165: {'lr': 0.0004930129703674198, 'samples': 2527680, 'steps': 13164, 'loss/train': 1.8840910196304321} 11/06/2021 23:04:01 - INFO - __main__ - Step 13166: {'lr': 0.0004930117244687931, 'samples': 2527872, 'steps': 13165, 'loss/train': 1.4074156284332275} 11/06/2021 23:04:01 - INFO - __main__ - Step 13167: {'lr': 0.0004930104784606692, 'samples': 2528064, 'steps': 13166, 'loss/train': 2.063450813293457} 11/06/2021 23:04:01 - INFO - __main__ - Step 13168: {'lr': 0.0004930092323430484, 'samples': 2528256, 'steps': 13167, 'loss/train': 2.0765469074249268} 11/06/2021 23:04:02 - INFO - __main__ - Step 13169: {'lr': 0.0004930079861159315, 'samples': 2528448, 'steps': 13168, 'loss/train': 1.3868030309677124} 11/06/2021 23:04:02 - INFO - __main__ - Step 13170: {'lr': 0.0004930067397793188, 'samples': 2528640, 'steps': 13169, 'loss/train': 1.732783317565918} 11/06/2021 23:04:03 - INFO - __main__ - Step 13171: {'lr': 0.0004930054933332111, 'samples': 2528832, 'steps': 13170, 'loss/train': 1.683578610420227} 11/06/2021 23:04:03 - INFO - __main__ - Step 13172: {'lr': 0.0004930042467776089, 'samples': 2529024, 'steps': 13171, 'loss/train': 1.720359206199646} 11/06/2021 23:04:04 - INFO - __main__ - Step 13173: {'lr': 0.0004930030001125128, 'samples': 2529216, 'steps': 13172, 'loss/train': 1.2218900918960571} 11/06/2021 23:04:04 - INFO - __main__ - Step 13174: {'lr': 0.000493001753337923, 'samples': 2529408, 'steps': 13173, 'loss/train': 1.542048692703247} 11/06/2021 23:04:04 - INFO - __main__ - Step 13175: {'lr': 0.0004930005064538406, 'samples': 2529600, 'steps': 13174, 'loss/train': 1.7872868776321411} 11/06/2021 23:04:05 - INFO - __main__ - Step 13176: {'lr': 0.0004929992594602659, 'samples': 2529792, 'steps': 13175, 'loss/train': 1.104972243309021} 11/06/2021 23:04:06 - INFO - __main__ - Step 13177: {'lr': 0.0004929980123571995, 'samples': 2529984, 'steps': 13176, 'loss/train': 1.122475504875183} 11/06/2021 23:04:06 - INFO - __main__ - Step 13178: {'lr': 0.000492996765144642, 'samples': 2530176, 'steps': 13177, 'loss/train': 1.7249083518981934} 11/06/2021 23:04:07 - INFO - __main__ - Step 13179: {'lr': 0.0004929955178225938, 'samples': 2530368, 'steps': 13178, 'loss/train': 1.1565667390823364} 11/06/2021 23:04:07 - INFO - __main__ - Step 13180: {'lr': 0.0004929942703910556, 'samples': 2530560, 'steps': 13179, 'loss/train': 2.3176705837249756} 11/06/2021 23:04:07 - INFO - __main__ - Step 13181: {'lr': 0.0004929930228500279, 'samples': 2530752, 'steps': 13180, 'loss/train': 1.5887360572814941} 11/06/2021 23:04:09 - INFO - __main__ - Step 13182: {'lr': 0.0004929917751995114, 'samples': 2530944, 'steps': 13181, 'loss/train': 1.8893414735794067} 11/06/2021 23:04:09 - INFO - __main__ - Step 13183: {'lr': 0.0004929905274395064, 'samples': 2531136, 'steps': 13182, 'loss/train': 1.7040241956710815} 11/06/2021 23:04:09 - INFO - __main__ - Step 13184: {'lr': 0.0004929892795700137, 'samples': 2531328, 'steps': 13183, 'loss/train': 1.8834102153778076} 11/06/2021 23:04:10 - INFO - __main__ - Step 13185: {'lr': 0.0004929880315910338, 'samples': 2531520, 'steps': 13184, 'loss/train': 0.39340120553970337} 11/06/2021 23:04:10 - INFO - __main__ - Step 13186: {'lr': 0.0004929867835025672, 'samples': 2531712, 'steps': 13185, 'loss/train': 0.29085573554039} 11/06/2021 23:04:10 - INFO - __main__ - Step 13187: {'lr': 0.0004929855353046145, 'samples': 2531904, 'steps': 13186, 'loss/train': 1.7704241275787354} 11/06/2021 23:04:11 - INFO - __main__ - Step 13188: {'lr': 0.0004929842869971763, 'samples': 2532096, 'steps': 13187, 'loss/train': 1.63791024684906} 11/06/2021 23:04:12 - INFO - __main__ - Step 13189: {'lr': 0.000492983038580253, 'samples': 2532288, 'steps': 13188, 'loss/train': 0.9975517392158508} 11/06/2021 23:04:12 - INFO - __main__ - Step 13190: {'lr': 0.0004929817900538455, 'samples': 2532480, 'steps': 13189, 'loss/train': 1.4178115129470825} 11/06/2021 23:04:12 - INFO - __main__ - Step 13191: {'lr': 0.000492980541417954, 'samples': 2532672, 'steps': 13190, 'loss/train': 1.3342525959014893} 11/06/2021 23:04:13 - INFO - __main__ - Step 13192: {'lr': 0.0004929792926725794, 'samples': 2532864, 'steps': 13191, 'loss/train': 1.589043140411377} 11/06/2021 23:04:14 - INFO - __main__ - Step 13193: {'lr': 0.000492978043817722, 'samples': 2533056, 'steps': 13192, 'loss/train': 1.7581723928451538} 11/06/2021 23:04:14 - INFO - __main__ - Step 13194: {'lr': 0.0004929767948533823, 'samples': 2533248, 'steps': 13193, 'loss/train': 1.6882779598236084} 11/06/2021 23:04:15 - INFO - __main__ - Step 13195: {'lr': 0.0004929755457795612, 'samples': 2533440, 'steps': 13194, 'loss/train': 1.444657802581787} 11/06/2021 23:04:15 - INFO - __main__ - Step 13196: {'lr': 0.0004929742965962589, 'samples': 2533632, 'steps': 13195, 'loss/train': 1.9157907962799072} 11/06/2021 23:04:15 - INFO - __main__ - Step 13197: {'lr': 0.0004929730473034763, 'samples': 2533824, 'steps': 13196, 'loss/train': 1.561155915260315} 11/06/2021 23:04:16 - INFO - __main__ - Step 13198: {'lr': 0.0004929717979012136, 'samples': 2534016, 'steps': 13197, 'loss/train': 1.3841657638549805} 11/06/2021 23:04:17 - INFO - __main__ - Step 13199: {'lr': 0.0004929705483894717, 'samples': 2534208, 'steps': 13198, 'loss/train': 1.5719326734542847} 11/06/2021 23:04:17 - INFO - __main__ - Step 13200: {'lr': 0.000492969298768251, 'samples': 2534400, 'steps': 13199, 'loss/train': 1.4586410522460938} 11/06/2021 23:04:17 - INFO - __main__ - Step 13201: {'lr': 0.000492968049037552, 'samples': 2534592, 'steps': 13200, 'loss/train': 1.2632533311843872} 11/06/2021 23:04:18 - INFO - __main__ - Step 13202: {'lr': 0.0004929667991973754, 'samples': 2534784, 'steps': 13201, 'loss/train': 1.835030436515808} 11/06/2021 23:04:19 - INFO - __main__ - Step 13203: {'lr': 0.0004929655492477218, 'samples': 2534976, 'steps': 13202, 'loss/train': 2.025620937347412} 11/06/2021 23:04:19 - INFO - __main__ - Step 13204: {'lr': 0.0004929642991885916, 'samples': 2535168, 'steps': 13203, 'loss/train': 1.8089247941970825} 11/06/2021 23:04:20 - INFO - __main__ - Step 13205: {'lr': 0.0004929630490199854, 'samples': 2535360, 'steps': 13204, 'loss/train': 1.419128656387329} 11/06/2021 23:04:20 - INFO - __main__ - Step 13206: {'lr': 0.0004929617987419039, 'samples': 2535552, 'steps': 13205, 'loss/train': 1.9509419202804565} 11/06/2021 23:04:20 - INFO - __main__ - Step 13207: {'lr': 0.0004929605483543474, 'samples': 2535744, 'steps': 13206, 'loss/train': 1.7414957284927368} 11/06/2021 23:04:21 - INFO - __main__ - Step 13208: {'lr': 0.0004929592978573168, 'samples': 2535936, 'steps': 13207, 'loss/train': 1.3497000932693481} 11/06/2021 23:04:22 - INFO - __main__ - Step 13209: {'lr': 0.0004929580472508124, 'samples': 2536128, 'steps': 13208, 'loss/train': 1.4849274158477783} 11/06/2021 23:04:22 - INFO - __main__ - Step 13210: {'lr': 0.0004929567965348347, 'samples': 2536320, 'steps': 13209, 'loss/train': 1.7661024332046509} 11/06/2021 23:04:22 - INFO - __main__ - Step 13211: {'lr': 0.0004929555457093847, 'samples': 2536512, 'steps': 13210, 'loss/train': 1.7493226528167725} 11/06/2021 23:04:23 - INFO - __main__ - Step 13212: {'lr': 0.0004929542947744625, 'samples': 2536704, 'steps': 13211, 'loss/train': 1.1652683019638062} 11/06/2021 23:04:23 - INFO - __main__ - Step 13213: {'lr': 0.0004929530437300689, 'samples': 2536896, 'steps': 13212, 'loss/train': 1.901105284690857} 11/06/2021 23:04:24 - INFO - __main__ - Step 13214: {'lr': 0.0004929517925762045, 'samples': 2537088, 'steps': 13213, 'loss/train': 1.7152559757232666} 11/06/2021 23:04:24 - INFO - __main__ - Step 13215: {'lr': 0.0004929505413128696, 'samples': 2537280, 'steps': 13214, 'loss/train': 1.5080220699310303} 11/06/2021 23:04:25 - INFO - __main__ - Step 13216: {'lr': 0.000492949289940065, 'samples': 2537472, 'steps': 13215, 'loss/train': 1.7063556909561157} 11/06/2021 23:04:25 - INFO - __main__ - Step 13217: {'lr': 0.0004929480384577912, 'samples': 2537664, 'steps': 13216, 'loss/train': 1.8721396923065186} 11/06/2021 23:04:25 - INFO - __main__ - Step 13218: {'lr': 0.0004929467868660487, 'samples': 2537856, 'steps': 13217, 'loss/train': 1.3065961599349976} 11/06/2021 23:04:27 - INFO - __main__ - Step 13219: {'lr': 0.0004929455351648383, 'samples': 2538048, 'steps': 13218, 'loss/train': 1.533849835395813} 11/06/2021 23:04:27 - INFO - __main__ - Step 13220: {'lr': 0.0004929442833541603, 'samples': 2538240, 'steps': 13219, 'loss/train': 1.7814805507659912} 11/06/2021 23:04:27 - INFO - __main__ - Step 13221: {'lr': 0.0004929430314340154, 'samples': 2538432, 'steps': 13220, 'loss/train': 1.5206717252731323} 11/06/2021 23:04:28 - INFO - __main__ - Step 13222: {'lr': 0.000492941779404404, 'samples': 2538624, 'steps': 13221, 'loss/train': 1.3217058181762695} 11/06/2021 23:04:28 - INFO - __main__ - Step 13223: {'lr': 0.0004929405272653269, 'samples': 2538816, 'steps': 13222, 'loss/train': 1.758650302886963} 11/06/2021 23:04:29 - INFO - __main__ - Step 13224: {'lr': 0.0004929392750167845, 'samples': 2539008, 'steps': 13223, 'loss/train': 0.7512359023094177} 11/06/2021 23:04:29 - INFO - __main__ - Step 13225: {'lr': 0.0004929380226587774, 'samples': 2539200, 'steps': 13224, 'loss/train': 1.9796491861343384} 11/06/2021 23:04:30 - INFO - __main__ - Step 13226: {'lr': 0.0004929367701913062, 'samples': 2539392, 'steps': 13225, 'loss/train': 0.7901928424835205} 11/06/2021 23:04:30 - INFO - __main__ - Step 13227: {'lr': 0.0004929355176143714, 'samples': 2539584, 'steps': 13226, 'loss/train': 1.6296266317367554} 11/06/2021 23:04:30 - INFO - __main__ - Step 13228: {'lr': 0.0004929342649279736, 'samples': 2539776, 'steps': 13227, 'loss/train': 2.2642016410827637} 11/06/2021 23:04:31 - INFO - __main__ - Step 13229: {'lr': 0.0004929330121321134, 'samples': 2539968, 'steps': 13228, 'loss/train': 1.9576764106750488} 11/06/2021 23:04:32 - INFO - __main__ - Step 13230: {'lr': 0.0004929317592267913, 'samples': 2540160, 'steps': 13229, 'loss/train': 1.3771681785583496} 11/06/2021 23:04:32 - INFO - __main__ - Step 13231: {'lr': 0.000492930506212008, 'samples': 2540352, 'steps': 13230, 'loss/train': 1.4580309391021729} 11/06/2021 23:04:32 - INFO - __main__ - Step 13232: {'lr': 0.0004929292530877638, 'samples': 2540544, 'steps': 13231, 'loss/train': 1.5122207403182983} 11/06/2021 23:04:33 - INFO - __main__ - Step 13233: {'lr': 0.0004929279998540596, 'samples': 2540736, 'steps': 13232, 'loss/train': 1.770660400390625} 11/06/2021 23:04:34 - INFO - __main__ - Step 13234: {'lr': 0.0004929267465108956, 'samples': 2540928, 'steps': 13233, 'loss/train': 0.6602691411972046} 11/06/2021 23:04:34 - INFO - __main__ - Step 13235: {'lr': 0.0004929254930582728, 'samples': 2541120, 'steps': 13234, 'loss/train': 1.746626853942871} 11/06/2021 23:04:35 - INFO - __main__ - Step 13236: {'lr': 0.0004929242394961914, 'samples': 2541312, 'steps': 13235, 'loss/train': 1.5053402185440063} 11/06/2021 23:04:35 - INFO - __main__ - Step 13237: {'lr': 0.000492922985824652, 'samples': 2541504, 'steps': 13236, 'loss/train': 1.3967210054397583} 11/06/2021 23:04:35 - INFO - __main__ - Step 13238: {'lr': 0.0004929217320436553, 'samples': 2541696, 'steps': 13237, 'loss/train': 1.8607147932052612} 11/06/2021 23:04:36 - INFO - __main__ - Step 13239: {'lr': 0.0004929204781532018, 'samples': 2541888, 'steps': 13238, 'loss/train': 1.915743112564087} 11/06/2021 23:04:37 - INFO - __main__ - Step 13240: {'lr': 0.0004929192241532921, 'samples': 2542080, 'steps': 13239, 'loss/train': 1.5825774669647217} 11/06/2021 23:04:37 - INFO - __main__ - Step 13241: {'lr': 0.0004929179700439269, 'samples': 2542272, 'steps': 13240, 'loss/train': 0.8731245994567871} 11/06/2021 23:04:37 - INFO - __main__ - Step 13242: {'lr': 0.0004929167158251065, 'samples': 2542464, 'steps': 13241, 'loss/train': 1.7549324035644531} 11/06/2021 23:04:38 - INFO - __main__ - Step 13243: {'lr': 0.0004929154614968315, 'samples': 2542656, 'steps': 13242, 'loss/train': 0.5492236614227295} 11/06/2021 23:04:39 - INFO - __main__ - Step 13244: {'lr': 0.0004929142070591026, 'samples': 2542848, 'steps': 13243, 'loss/train': 1.614729404449463} 11/06/2021 23:04:39 - INFO - __main__ - Step 13245: {'lr': 0.0004929129525119203, 'samples': 2543040, 'steps': 13244, 'loss/train': 1.537818193435669} 11/06/2021 23:04:40 - INFO - __main__ - Step 13246: {'lr': 0.0004929116978552851, 'samples': 2543232, 'steps': 13245, 'loss/train': 1.7951061725616455} 11/06/2021 23:04:40 - INFO - __main__ - Step 13247: {'lr': 0.0004929104430891978, 'samples': 2543424, 'steps': 13246, 'loss/train': 1.5061917304992676} 11/06/2021 23:04:40 - INFO - __main__ - Step 13248: {'lr': 0.0004929091882136587, 'samples': 2543616, 'steps': 13247, 'loss/train': 1.6849573850631714} 11/06/2021 23:04:41 - INFO - __main__ - Step 13249: {'lr': 0.0004929079332286685, 'samples': 2543808, 'steps': 13248, 'loss/train': 1.7223663330078125} 11/06/2021 23:04:42 - INFO - __main__ - Step 13250: {'lr': 0.0004929066781342277, 'samples': 2544000, 'steps': 13249, 'loss/train': 1.109410285949707} 11/06/2021 23:04:42 - INFO - __main__ - Step 13251: {'lr': 0.0004929054229303369, 'samples': 2544192, 'steps': 13250, 'loss/train': 1.5786045789718628} 11/06/2021 23:04:42 - INFO - __main__ - Step 13252: {'lr': 0.0004929041676169967, 'samples': 2544384, 'steps': 13251, 'loss/train': 1.4510022401809692} 11/06/2021 23:04:43 - INFO - __main__ - Step 13253: {'lr': 0.0004929029121942077, 'samples': 2544576, 'steps': 13252, 'loss/train': 1.5690830945968628} 11/06/2021 23:04:43 - INFO - __main__ - Step 13254: {'lr': 0.0004929016566619703, 'samples': 2544768, 'steps': 13253, 'loss/train': 1.475024938583374} 11/06/2021 23:04:44 - INFO - __main__ - Step 13255: {'lr': 0.0004929004010202851, 'samples': 2544960, 'steps': 13254, 'loss/train': 1.462422490119934} 11/06/2021 23:04:44 - INFO - __main__ - Step 13256: {'lr': 0.0004928991452691528, 'samples': 2545152, 'steps': 13255, 'loss/train': 1.8692861795425415} 11/06/2021 23:04:45 - INFO - __main__ - Step 13257: {'lr': 0.0004928978894085739, 'samples': 2545344, 'steps': 13256, 'loss/train': 1.8582710027694702} 11/06/2021 23:04:45 - INFO - __main__ - Step 13258: {'lr': 0.000492896633438549, 'samples': 2545536, 'steps': 13257, 'loss/train': 2.016894578933716} 11/06/2021 23:04:46 - INFO - __main__ - Step 13259: {'lr': 0.0004928953773590785, 'samples': 2545728, 'steps': 13258, 'loss/train': 1.6744557619094849} 11/06/2021 23:04:47 - INFO - __main__ - Step 13260: {'lr': 0.0004928941211701632, 'samples': 2545920, 'steps': 13259, 'loss/train': 0.2616555988788605} 11/06/2021 23:04:47 - INFO - __main__ - Step 13261: {'lr': 0.0004928928648718035, 'samples': 2546112, 'steps': 13260, 'loss/train': 1.6083446741104126} 11/06/2021 23:04:47 - INFO - __main__ - Step 13262: {'lr': 0.0004928916084640001, 'samples': 2546304, 'steps': 13261, 'loss/train': 1.1888530254364014} 11/06/2021 23:04:48 - INFO - __main__ - Step 13263: {'lr': 0.0004928903519467534, 'samples': 2546496, 'steps': 13262, 'loss/train': 2.4009814262390137} 11/06/2021 23:04:48 - INFO - __main__ - Step 13264: {'lr': 0.0004928890953200641, 'samples': 2546688, 'steps': 13263, 'loss/train': 1.301665186882019} 11/06/2021 23:04:49 - INFO - __main__ - Step 13265: {'lr': 0.0004928878385839327, 'samples': 2546880, 'steps': 13264, 'loss/train': 1.293091058731079} 11/06/2021 23:04:49 - INFO - __main__ - Step 13266: {'lr': 0.0004928865817383597, 'samples': 2547072, 'steps': 13265, 'loss/train': 1.251847267150879} 11/06/2021 23:04:50 - INFO - __main__ - Step 13267: {'lr': 0.0004928853247833459, 'samples': 2547264, 'steps': 13266, 'loss/train': 1.894822120666504} 11/06/2021 23:04:50 - INFO - __main__ - Step 13268: {'lr': 0.0004928840677188918, 'samples': 2547456, 'steps': 13267, 'loss/train': 1.5503804683685303} 11/06/2021 23:04:50 - INFO - __main__ - Step 13269: {'lr': 0.0004928828105449977, 'samples': 2547648, 'steps': 13268, 'loss/train': 1.885050892829895} 11/06/2021 23:04:52 - INFO - __main__ - Step 13270: {'lr': 0.0004928815532616644, 'samples': 2547840, 'steps': 13269, 'loss/train': 1.5068950653076172} 11/06/2021 23:04:52 - INFO - __main__ - Step 13271: {'lr': 0.0004928802958688924, 'samples': 2548032, 'steps': 13270, 'loss/train': 1.590198040008545} 11/06/2021 23:04:52 - INFO - __main__ - Step 13272: {'lr': 0.0004928790383666823, 'samples': 2548224, 'steps': 13271, 'loss/train': 1.786717176437378} 11/06/2021 23:04:53 - INFO - __main__ - Step 13273: {'lr': 0.0004928777807550348, 'samples': 2548416, 'steps': 13272, 'loss/train': 1.5326136350631714} 11/06/2021 23:04:53 - INFO - __main__ - Step 13274: {'lr': 0.0004928765230339502, 'samples': 2548608, 'steps': 13273, 'loss/train': 2.4051101207733154} 11/06/2021 23:04:54 - INFO - __main__ - Step 13275: {'lr': 0.000492875265203429, 'samples': 2548800, 'steps': 13274, 'loss/train': 2.304696559906006} 11/06/2021 23:04:54 - INFO - __main__ - Step 13276: {'lr': 0.0004928740072634722, 'samples': 2548992, 'steps': 13275, 'loss/train': 2.0685250759124756} 11/06/2021 23:04:55 - INFO - __main__ - Step 13277: {'lr': 0.0004928727492140801, 'samples': 2549184, 'steps': 13276, 'loss/train': 1.733903408050537} 11/06/2021 23:04:55 - INFO - __main__ - Step 13278: {'lr': 0.0004928714910552533, 'samples': 2549376, 'steps': 13277, 'loss/train': 1.8066354990005493} 11/06/2021 23:04:55 - INFO - __main__ - Step 13279: {'lr': 0.0004928702327869922, 'samples': 2549568, 'steps': 13278, 'loss/train': 1.9759420156478882} 11/06/2021 23:04:56 - INFO - __main__ - Step 13280: {'lr': 0.0004928689744092976, 'samples': 2549760, 'steps': 13279, 'loss/train': 1.8017091751098633} 11/06/2021 23:04:57 - INFO - __main__ - Step 13281: {'lr': 0.0004928677159221701, 'samples': 2549952, 'steps': 13280, 'loss/train': 1.4251042604446411} 11/06/2021 23:04:57 - INFO - __main__ - Step 13282: {'lr': 0.00049286645732561, 'samples': 2550144, 'steps': 13281, 'loss/train': 1.3295786380767822} 11/06/2021 23:04:57 - INFO - __main__ - Step 13283: {'lr': 0.0004928651986196181, 'samples': 2550336, 'steps': 13282, 'loss/train': 1.1968010663986206} 11/06/2021 23:04:58 - INFO - __main__ - Step 13284: {'lr': 0.0004928639398041948, 'samples': 2550528, 'steps': 13283, 'loss/train': 1.974739670753479} 11/06/2021 23:04:59 - INFO - __main__ - Step 13285: {'lr': 0.0004928626808793409, 'samples': 2550720, 'steps': 13284, 'loss/train': 1.5590357780456543} 11/06/2021 23:04:59 - INFO - __main__ - Step 13286: {'lr': 0.0004928614218450568, 'samples': 2550912, 'steps': 13285, 'loss/train': 1.4196864366531372} 11/06/2021 23:05:00 - INFO - __main__ - Step 13287: {'lr': 0.000492860162701343, 'samples': 2551104, 'steps': 13286, 'loss/train': 1.4201176166534424} 11/06/2021 23:05:00 - INFO - __main__ - Step 13288: {'lr': 0.0004928589034482001, 'samples': 2551296, 'steps': 13287, 'loss/train': 1.8625463247299194} 11/06/2021 23:05:00 - INFO - __main__ - Step 13289: {'lr': 0.000492857644085629, 'samples': 2551488, 'steps': 13288, 'loss/train': 1.8042540550231934} 11/06/2021 23:05:01 - INFO - __main__ - Step 13290: {'lr': 0.0004928563846136296, 'samples': 2551680, 'steps': 13289, 'loss/train': 1.6024283170700073} 11/06/2021 23:05:02 - INFO - __main__ - Step 13291: {'lr': 0.0004928551250322032, 'samples': 2551872, 'steps': 13290, 'loss/train': 1.5354256629943848} 11/06/2021 23:05:02 - INFO - __main__ - Step 13292: {'lr': 0.0004928538653413499, 'samples': 2552064, 'steps': 13291, 'loss/train': 2.1231536865234375} 11/06/2021 23:05:02 - INFO - __main__ - Step 13293: {'lr': 0.0004928526055410704, 'samples': 2552256, 'steps': 13292, 'loss/train': 1.7034943103790283} 11/06/2021 23:05:03 - INFO - __main__ - Step 13294: {'lr': 0.0004928513456313653, 'samples': 2552448, 'steps': 13293, 'loss/train': 1.6685292720794678} 11/06/2021 23:05:03 - INFO - __main__ - Step 13295: {'lr': 0.000492850085612235, 'samples': 2552640, 'steps': 13294, 'loss/train': 1.6578960418701172} 11/06/2021 23:05:04 - INFO - __main__ - Step 13296: {'lr': 0.0004928488254836804, 'samples': 2552832, 'steps': 13295, 'loss/train': 1.1260663270950317} 11/06/2021 23:05:04 - INFO - __main__ - Step 13297: {'lr': 0.0004928475652457017, 'samples': 2553024, 'steps': 13296, 'loss/train': 1.738349199295044} 11/06/2021 23:05:05 - INFO - __main__ - Step 13298: {'lr': 0.0004928463048982998, 'samples': 2553216, 'steps': 13297, 'loss/train': 1.6107333898544312} 11/06/2021 23:05:05 - INFO - __main__ - Step 13299: {'lr': 0.0004928450444414749, 'samples': 2553408, 'steps': 13298, 'loss/train': 2.0628111362457275} 11/06/2021 23:05:06 - INFO - __main__ - Step 13300: {'lr': 0.0004928437838752278, 'samples': 2553600, 'steps': 13299, 'loss/train': 1.5923441648483276} 11/06/2021 23:05:07 - INFO - __main__ - Step 13301: {'lr': 0.0004928425231995593, 'samples': 2553792, 'steps': 13300, 'loss/train': 1.7554380893707275} 11/06/2021 23:05:07 - INFO - __main__ - Step 13302: {'lr': 0.0004928412624144694, 'samples': 2553984, 'steps': 13301, 'loss/train': 1.8865063190460205} 11/06/2021 23:05:07 - INFO - __main__ - Step 13303: {'lr': 0.0004928400015199591, 'samples': 2554176, 'steps': 13302, 'loss/train': 1.7683358192443848} 11/06/2021 23:05:08 - INFO - __main__ - Step 13304: {'lr': 0.0004928387405160288, 'samples': 2554368, 'steps': 13303, 'loss/train': 1.6909922361373901} 11/06/2021 23:05:08 - INFO - __main__ - Step 13305: {'lr': 0.0004928374794026792, 'samples': 2554560, 'steps': 13304, 'loss/train': 1.3994184732437134} 11/06/2021 23:05:09 - INFO - __main__ - Step 13306: {'lr': 0.0004928362181799107, 'samples': 2554752, 'steps': 13305, 'loss/train': 1.705430030822754} 11/06/2021 23:05:09 - INFO - __main__ - Step 13307: {'lr': 0.0004928349568477239, 'samples': 2554944, 'steps': 13306, 'loss/train': 1.6792200803756714} 11/06/2021 23:05:10 - INFO - __main__ - Step 13308: {'lr': 0.0004928336954061195, 'samples': 2555136, 'steps': 13307, 'loss/train': 1.2497096061706543} 11/06/2021 23:05:10 - INFO - __main__ - Step 13309: {'lr': 0.000492832433855098, 'samples': 2555328, 'steps': 13308, 'loss/train': 1.3893240690231323} 11/06/2021 23:05:10 - INFO - __main__ - Step 13310: {'lr': 0.0004928311721946599, 'samples': 2555520, 'steps': 13309, 'loss/train': 1.5529297590255737} 11/06/2021 23:05:11 - INFO - __main__ - Step 13311: {'lr': 0.0004928299104248059, 'samples': 2555712, 'steps': 13310, 'loss/train': 1.6530280113220215} 11/06/2021 23:05:12 - INFO - __main__ - Step 13312: {'lr': 0.0004928286485455365, 'samples': 2555904, 'steps': 13311, 'loss/train': 1.1228559017181396} 11/06/2021 23:05:12 - INFO - __main__ - Step 13313: {'lr': 0.0004928273865568521, 'samples': 2556096, 'steps': 13312, 'loss/train': 1.5058236122131348} 11/06/2021 23:05:13 - INFO - __main__ - Step 13314: {'lr': 0.0004928261244587536, 'samples': 2556288, 'steps': 13313, 'loss/train': 1.8624576330184937} 11/06/2021 23:05:13 - INFO - __main__ - Step 13315: {'lr': 0.0004928248622512412, 'samples': 2556480, 'steps': 13314, 'loss/train': 1.4049829244613647} 11/06/2021 23:05:14 - INFO - __main__ - Step 13316: {'lr': 0.0004928235999343159, 'samples': 2556672, 'steps': 13315, 'loss/train': 1.9419476985931396} 11/06/2021 23:05:14 - INFO - __main__ - Step 13317: {'lr': 0.0004928223375079778, 'samples': 2556864, 'steps': 13316, 'loss/train': 1.7618058919906616} 11/06/2021 23:05:14 - INFO - __main__ - Step 13318: {'lr': 0.0004928210749722278, 'samples': 2557056, 'steps': 13317, 'loss/train': 1.606778621673584} 11/06/2021 23:05:15 - INFO - __main__ - Step 13319: {'lr': 0.0004928198123270664, 'samples': 2557248, 'steps': 13318, 'loss/train': 1.5688247680664062} 11/06/2021 23:05:15 - INFO - __main__ - Step 13320: {'lr': 0.0004928185495724942, 'samples': 2557440, 'steps': 13319, 'loss/train': 1.9147883653640747} 11/06/2021 23:05:16 - INFO - __main__ - Step 13321: {'lr': 0.0004928172867085115, 'samples': 2557632, 'steps': 13320, 'loss/train': 1.3374717235565186} 11/06/2021 23:05:17 - INFO - __main__ - Step 13322: {'lr': 0.0004928160237351192, 'samples': 2557824, 'steps': 13321, 'loss/train': 1.7408499717712402} 11/06/2021 23:05:17 - INFO - __main__ - Step 13323: {'lr': 0.0004928147606523179, 'samples': 2558016, 'steps': 13322, 'loss/train': 1.20220947265625} 11/06/2021 23:05:17 - INFO - __main__ - Step 13324: {'lr': 0.0004928134974601078, 'samples': 2558208, 'steps': 13323, 'loss/train': 1.730553388595581} 11/06/2021 23:05:18 - INFO - __main__ - Step 13325: {'lr': 0.0004928122341584897, 'samples': 2558400, 'steps': 13324, 'loss/train': 1.704958200454712} 11/06/2021 23:05:18 - INFO - __main__ - Step 13326: {'lr': 0.0004928109707474643, 'samples': 2558592, 'steps': 13325, 'loss/train': 1.6393741369247437} 11/06/2021 23:05:19 - INFO - __main__ - Step 13327: {'lr': 0.0004928097072270319, 'samples': 2558784, 'steps': 13326, 'loss/train': 1.696864366531372} 11/06/2021 23:05:19 - INFO - __main__ - Step 13328: {'lr': 0.0004928084435971932, 'samples': 2558976, 'steps': 13327, 'loss/train': 1.6887147426605225} 11/06/2021 23:05:20 - INFO - __main__ - Step 13329: {'lr': 0.0004928071798579488, 'samples': 2559168, 'steps': 13328, 'loss/train': 1.7880932092666626} 11/06/2021 23:05:20 - INFO - __main__ - Step 13330: {'lr': 0.0004928059160092993, 'samples': 2559360, 'steps': 13329, 'loss/train': 1.7646552324295044} 11/06/2021 23:05:20 - INFO - __main__ - Step 13331: {'lr': 0.000492804652051245, 'samples': 2559552, 'steps': 13330, 'loss/train': 1.588904857635498} 11/06/2021 23:05:22 - INFO - __main__ - Step 13332: {'lr': 0.0004928033879837868, 'samples': 2559744, 'steps': 13331, 'loss/train': 1.295880675315857} 11/06/2021 23:05:22 - INFO - __main__ - Step 13333: {'lr': 0.0004928021238069251, 'samples': 2559936, 'steps': 13332, 'loss/train': 1.5167969465255737} 11/06/2021 23:05:22 - INFO - __main__ - Step 13334: {'lr': 0.0004928008595206605, 'samples': 2560128, 'steps': 13333, 'loss/train': 0.6749269366264343} 11/06/2021 23:05:23 - INFO - __main__ - Step 13335: {'lr': 0.0004927995951249937, 'samples': 2560320, 'steps': 13334, 'loss/train': 1.3417226076126099} 11/06/2021 23:05:23 - INFO - __main__ - Step 13336: {'lr': 0.0004927983306199251, 'samples': 2560512, 'steps': 13335, 'loss/train': 0.18615388870239258} 11/06/2021 23:05:24 - INFO - __main__ - Step 13337: {'lr': 0.0004927970660054552, 'samples': 2560704, 'steps': 13336, 'loss/train': 1.5057356357574463} 11/06/2021 23:05:25 - INFO - __main__ - Step 13338: {'lr': 0.0004927958012815849, 'samples': 2560896, 'steps': 13337, 'loss/train': 1.761964201927185} 11/06/2021 23:05:25 - INFO - __main__ - Step 13339: {'lr': 0.0004927945364483144, 'samples': 2561088, 'steps': 13338, 'loss/train': 1.5232926607131958} 11/06/2021 23:05:25 - INFO - __main__ - Step 13340: {'lr': 0.0004927932715056444, 'samples': 2561280, 'steps': 13339, 'loss/train': 1.5843162536621094} 11/06/2021 23:05:26 - INFO - __main__ - Step 13341: {'lr': 0.0004927920064535756, 'samples': 2561472, 'steps': 13340, 'loss/train': 1.808425784111023} 11/06/2021 23:05:27 - INFO - __main__ - Step 13342: {'lr': 0.0004927907412921084, 'samples': 2561664, 'steps': 13341, 'loss/train': 1.7012680768966675} 11/06/2021 23:05:27 - INFO - __main__ - Step 13343: {'lr': 0.0004927894760212435, 'samples': 2561856, 'steps': 13342, 'loss/train': 1.866380214691162} 11/06/2021 23:05:27 - INFO - __main__ - Step 13344: {'lr': 0.0004927882106409813, 'samples': 2562048, 'steps': 13343, 'loss/train': 0.6757373213768005} 11/06/2021 23:05:28 - INFO - __main__ - Step 13345: {'lr': 0.0004927869451513226, 'samples': 2562240, 'steps': 13344, 'loss/train': 1.5792934894561768} 11/06/2021 23:05:28 - INFO - __main__ - Step 13346: {'lr': 0.0004927856795522678, 'samples': 2562432, 'steps': 13345, 'loss/train': 1.1496813297271729} 11/06/2021 23:05:29 - INFO - __main__ - Step 13347: {'lr': 0.0004927844138438175, 'samples': 2562624, 'steps': 13346, 'loss/train': 1.646971344947815} 11/06/2021 23:05:29 - INFO - __main__ - Step 13348: {'lr': 0.0004927831480259723, 'samples': 2562816, 'steps': 13347, 'loss/train': 1.720847725868225} 11/06/2021 23:05:30 - INFO - __main__ - Step 13349: {'lr': 0.0004927818820987328, 'samples': 2563008, 'steps': 13348, 'loss/train': 1.679519534111023} 11/06/2021 23:05:30 - INFO - __main__ - Step 13350: {'lr': 0.0004927806160620995, 'samples': 2563200, 'steps': 13349, 'loss/train': 2.182821750640869} 11/06/2021 23:05:30 - INFO - __main__ - Step 13351: {'lr': 0.0004927793499160729, 'samples': 2563392, 'steps': 13350, 'loss/train': 2.1079049110412598} 11/06/2021 23:05:31 - INFO - __main__ - Step 13352: {'lr': 0.000492778083660654, 'samples': 2563584, 'steps': 13351, 'loss/train': 1.3878504037857056} 11/06/2021 23:05:32 - INFO - __main__ - Step 13353: {'lr': 0.0004927768172958427, 'samples': 2563776, 'steps': 13352, 'loss/train': 1.9145451784133911} 11/06/2021 23:05:32 - INFO - __main__ - Step 13354: {'lr': 0.00049277555082164, 'samples': 2563968, 'steps': 13353, 'loss/train': 1.463181495666504} 11/06/2021 23:05:33 - INFO - __main__ - Step 13355: {'lr': 0.0004927742842380465, 'samples': 2564160, 'steps': 13354, 'loss/train': 2.1269948482513428} 11/06/2021 23:05:33 - INFO - __main__ - Step 13356: {'lr': 0.0004927730175450626, 'samples': 2564352, 'steps': 13355, 'loss/train': 1.5894335508346558} 11/06/2021 23:05:34 - INFO - __main__ - Step 13357: {'lr': 0.0004927717507426887, 'samples': 2564544, 'steps': 13356, 'loss/train': 1.5174648761749268} 11/06/2021 23:05:34 - INFO - __main__ - Step 13358: {'lr': 0.0004927704838309259, 'samples': 2564736, 'steps': 13357, 'loss/train': 1.6864207983016968} 11/06/2021 23:05:35 - INFO - __main__ - Step 13359: {'lr': 0.0004927692168097743, 'samples': 2564928, 'steps': 13358, 'loss/train': 1.3122807741165161} 11/06/2021 23:05:35 - INFO - __main__ - Step 13360: {'lr': 0.0004927679496792347, 'samples': 2565120, 'steps': 13359, 'loss/train': 1.738892674446106} 11/06/2021 23:05:35 - INFO - __main__ - Step 13361: {'lr': 0.0004927666824393076, 'samples': 2565312, 'steps': 13360, 'loss/train': 0.4052657186985016} 11/06/2021 23:05:36 - INFO - __main__ - Step 13362: {'lr': 0.0004927654150899937, 'samples': 2565504, 'steps': 13361, 'loss/train': 2.3916163444519043} 11/06/2021 23:05:37 - INFO - __main__ - Step 13363: {'lr': 0.0004927641476312932, 'samples': 2565696, 'steps': 13362, 'loss/train': 1.5969182252883911} 11/06/2021 23:05:37 - INFO - __main__ - Step 13364: {'lr': 0.000492762880063207, 'samples': 2565888, 'steps': 13363, 'loss/train': 1.332525610923767} 11/06/2021 23:05:37 - INFO - __main__ - Step 13365: {'lr': 0.0004927616123857357, 'samples': 2566080, 'steps': 13364, 'loss/train': 1.9494463205337524} 11/06/2021 23:05:38 - INFO - __main__ - Step 13366: {'lr': 0.0004927603445988797, 'samples': 2566272, 'steps': 13365, 'loss/train': 1.918675422668457} 11/06/2021 23:05:39 - INFO - __main__ - Step 13367: {'lr': 0.0004927590767026396, 'samples': 2566464, 'steps': 13366, 'loss/train': 2.5392587184906006} 11/06/2021 23:05:39 - INFO - __main__ - Step 13368: {'lr': 0.0004927578086970161, 'samples': 2566656, 'steps': 13367, 'loss/train': 1.7140586376190186} 11/06/2021 23:05:39 - INFO - __main__ - Step 13369: {'lr': 0.0004927565405820096, 'samples': 2566848, 'steps': 13368, 'loss/train': 1.7201263904571533} 11/06/2021 23:05:40 - INFO - __main__ - Step 13370: {'lr': 0.0004927552723576207, 'samples': 2567040, 'steps': 13369, 'loss/train': 1.939740777015686} 11/06/2021 23:05:40 - INFO - __main__ - Step 13371: {'lr': 0.0004927540040238501, 'samples': 2567232, 'steps': 13370, 'loss/train': 1.2085801362991333} 11/06/2021 23:05:41 - INFO - __main__ - Step 13372: {'lr': 0.0004927527355806983, 'samples': 2567424, 'steps': 13371, 'loss/train': 1.3057010173797607} 11/06/2021 23:05:41 - INFO - __main__ - Step 13373: {'lr': 0.0004927514670281659, 'samples': 2567616, 'steps': 13372, 'loss/train': 1.3879728317260742} 11/06/2021 23:05:42 - INFO - __main__ - Step 13374: {'lr': 0.0004927501983662534, 'samples': 2567808, 'steps': 13373, 'loss/train': 2.229905128479004} 11/06/2021 23:05:42 - INFO - __main__ - Step 13375: {'lr': 0.0004927489295949613, 'samples': 2568000, 'steps': 13374, 'loss/train': 1.403855562210083} 11/06/2021 23:05:43 - INFO - __main__ - Step 13376: {'lr': 0.0004927476607142904, 'samples': 2568192, 'steps': 13375, 'loss/train': 1.8529839515686035} 11/06/2021 23:05:44 - INFO - __main__ - Step 13377: {'lr': 0.0004927463917242411, 'samples': 2568384, 'steps': 13376, 'loss/train': 1.6956440210342407} 11/06/2021 23:05:44 - INFO - __main__ - Step 13378: {'lr': 0.0004927451226248141, 'samples': 2568576, 'steps': 13377, 'loss/train': 1.5961147546768188} 11/06/2021 23:05:44 - INFO - __main__ - Step 13379: {'lr': 0.0004927438534160098, 'samples': 2568768, 'steps': 13378, 'loss/train': 1.2757984399795532} 11/06/2021 23:05:45 - INFO - __main__ - Step 13380: {'lr': 0.0004927425840978289, 'samples': 2568960, 'steps': 13379, 'loss/train': 1.361778736114502} 11/06/2021 23:05:45 - INFO - __main__ - Step 13381: {'lr': 0.0004927413146702719, 'samples': 2569152, 'steps': 13380, 'loss/train': 1.2567278146743774} 11/06/2021 23:05:45 - INFO - __main__ - Step 13382: {'lr': 0.0004927400451333394, 'samples': 2569344, 'steps': 13381, 'loss/train': 1.9279859066009521} 11/06/2021 23:05:46 - INFO - __main__ - Step 13383: {'lr': 0.0004927387754870321, 'samples': 2569536, 'steps': 13382, 'loss/train': 0.35177692770957947} 11/06/2021 23:05:47 - INFO - __main__ - Step 13384: {'lr': 0.0004927375057313504, 'samples': 2569728, 'steps': 13383, 'loss/train': 2.1717875003814697} 11/06/2021 23:05:47 - INFO - __main__ - Step 13385: {'lr': 0.0004927362358662948, 'samples': 2569920, 'steps': 13384, 'loss/train': 1.7761980295181274} 11/06/2021 23:05:47 - INFO - __main__ - Step 13386: {'lr': 0.0004927349658918662, 'samples': 2570112, 'steps': 13385, 'loss/train': 1.3248966932296753} 11/06/2021 23:05:48 - INFO - __main__ - Step 13387: {'lr': 0.0004927336958080648, 'samples': 2570304, 'steps': 13386, 'loss/train': 1.88877272605896} 11/06/2021 23:05:49 - INFO - __main__ - Step 13388: {'lr': 0.0004927324256148914, 'samples': 2570496, 'steps': 13387, 'loss/train': 1.2974435091018677} 11/06/2021 23:05:49 - INFO - __main__ - Step 13389: {'lr': 0.0004927311553123465, 'samples': 2570688, 'steps': 13388, 'loss/train': 1.5624191761016846} 11/06/2021 23:05:50 - INFO - __main__ - Step 13390: {'lr': 0.0004927298849004307, 'samples': 2570880, 'steps': 13389, 'loss/train': 1.5980331897735596} 11/06/2021 23:05:50 - INFO - __main__ - Step 13391: {'lr': 0.0004927286143791447, 'samples': 2571072, 'steps': 13390, 'loss/train': 2.0183682441711426} 11/06/2021 23:05:50 - INFO - __main__ - Step 13392: {'lr': 0.0004927273437484888, 'samples': 2571264, 'steps': 13391, 'loss/train': 1.7328522205352783} 11/06/2021 23:05:51 - INFO - __main__ - Step 13393: {'lr': 0.0004927260730084636, 'samples': 2571456, 'steps': 13392, 'loss/train': 1.669248342514038} 11/06/2021 23:05:52 - INFO - __main__ - Step 13394: {'lr': 0.0004927248021590699, 'samples': 2571648, 'steps': 13393, 'loss/train': 2.133970022201538} 11/06/2021 23:05:52 - INFO - __main__ - Step 13395: {'lr': 0.0004927235312003082, 'samples': 2571840, 'steps': 13394, 'loss/train': 1.0649173259735107} 11/06/2021 23:05:52 - INFO - __main__ - Step 13396: {'lr': 0.0004927222601321789, 'samples': 2572032, 'steps': 13395, 'loss/train': 2.6536405086517334} 11/06/2021 23:05:53 - INFO - __main__ - Step 13397: {'lr': 0.0004927209889546828, 'samples': 2572224, 'steps': 13396, 'loss/train': 1.37748122215271} 11/06/2021 23:05:54 - INFO - __main__ - Step 13398: {'lr': 0.0004927197176678203, 'samples': 2572416, 'steps': 13397, 'loss/train': 1.8754839897155762} 11/06/2021 23:05:54 - INFO - __main__ - Step 13399: {'lr': 0.000492718446271592, 'samples': 2572608, 'steps': 13398, 'loss/train': 1.372225284576416} 11/06/2021 23:05:55 - INFO - __main__ - Step 13400: {'lr': 0.0004927171747659986, 'samples': 2572800, 'steps': 13399, 'loss/train': 1.8654577732086182} 11/06/2021 23:05:55 - INFO - __main__ - Step 13401: {'lr': 0.0004927159031510405, 'samples': 2572992, 'steps': 13400, 'loss/train': 0.6154890060424805} 11/06/2021 23:05:55 - INFO - __main__ - Step 13402: {'lr': 0.0004927146314267184, 'samples': 2573184, 'steps': 13401, 'loss/train': 1.4731422662734985} 11/06/2021 23:05:57 - INFO - __main__ - Step 13403: {'lr': 0.000492713359593033, 'samples': 2573376, 'steps': 13402, 'loss/train': 1.8210397958755493} 11/06/2021 23:05:57 - INFO - __main__ - Step 13404: {'lr': 0.0004927120876499846, 'samples': 2573568, 'steps': 13403, 'loss/train': 2.1863057613372803} 11/06/2021 23:05:57 - INFO - __main__ - Step 13405: {'lr': 0.0004927108155975738, 'samples': 2573760, 'steps': 13404, 'loss/train': 1.5386931896209717} 11/06/2021 23:05:58 - INFO - __main__ - Step 13406: {'lr': 0.0004927095434358012, 'samples': 2573952, 'steps': 13405, 'loss/train': 2.4172680377960205} 11/06/2021 23:05:58 - INFO - __main__ - Step 13407: {'lr': 0.0004927082711646676, 'samples': 2574144, 'steps': 13406, 'loss/train': 1.7488776445388794} 11/06/2021 23:05:58 - INFO - __main__ - Step 13408: {'lr': 0.0004927069987841733, 'samples': 2574336, 'steps': 13407, 'loss/train': 2.008625030517578} 11/06/2021 23:05:59 - INFO - __main__ - Step 13409: {'lr': 0.0004927057262943189, 'samples': 2574528, 'steps': 13408, 'loss/train': 1.6773982048034668} 11/06/2021 23:06:00 - INFO - __main__ - Step 13410: {'lr': 0.0004927044536951052, 'samples': 2574720, 'steps': 13409, 'loss/train': 2.1111257076263428} 11/06/2021 23:06:00 - INFO - __main__ - Step 13411: {'lr': 0.0004927031809865324, 'samples': 2574912, 'steps': 13410, 'loss/train': 1.8370585441589355} 11/06/2021 23:06:00 - INFO - __main__ - Step 13412: {'lr': 0.0004927019081686015, 'samples': 2575104, 'steps': 13411, 'loss/train': 1.4143866300582886} 11/06/2021 23:06:01 - INFO - __main__ - Step 13413: {'lr': 0.0004927006352413128, 'samples': 2575296, 'steps': 13412, 'loss/train': 1.1940877437591553} 11/06/2021 23:06:02 - INFO - __main__ - Step 13414: {'lr': 0.000492699362204667, 'samples': 2575488, 'steps': 13413, 'loss/train': 1.8130630254745483} 11/06/2021 23:06:02 - INFO - __main__ - Step 13415: {'lr': 0.0004926980890586645, 'samples': 2575680, 'steps': 13414, 'loss/train': 1.6337990760803223} 11/06/2021 23:06:02 - INFO - __main__ - Step 13416: {'lr': 0.000492696815803306, 'samples': 2575872, 'steps': 13415, 'loss/train': 1.7381833791732788} 11/06/2021 23:06:03 - INFO - __main__ - Step 13417: {'lr': 0.0004926955424385921, 'samples': 2576064, 'steps': 13416, 'loss/train': 1.8581897020339966} 11/06/2021 23:06:03 - INFO - __main__ - Step 13418: {'lr': 0.0004926942689645234, 'samples': 2576256, 'steps': 13417, 'loss/train': 0.9027330875396729} 11/06/2021 23:06:04 - INFO - __main__ - Step 13419: {'lr': 0.0004926929953811003, 'samples': 2576448, 'steps': 13418, 'loss/train': 1.8112411499023438} 11/06/2021 23:06:05 - INFO - __main__ - Step 13420: {'lr': 0.0004926917216883235, 'samples': 2576640, 'steps': 13419, 'loss/train': 1.7219598293304443} 11/06/2021 23:06:05 - INFO - __main__ - Step 13421: {'lr': 0.0004926904478861937, 'samples': 2576832, 'steps': 13420, 'loss/train': 1.8902779817581177} 11/06/2021 23:06:05 - INFO - __main__ - Step 13422: {'lr': 0.0004926891739747111, 'samples': 2577024, 'steps': 13421, 'loss/train': 1.9297341108322144} 11/06/2021 23:06:06 - INFO - __main__ - Step 13423: {'lr': 0.0004926878999538766, 'samples': 2577216, 'steps': 13422, 'loss/train': 1.187242865562439} 11/06/2021 23:06:07 - INFO - __main__ - Step 13424: {'lr': 0.0004926866258236907, 'samples': 2577408, 'steps': 13423, 'loss/train': 2.304779529571533} 11/06/2021 23:06:07 - INFO - __main__ - Step 13425: {'lr': 0.000492685351584154, 'samples': 2577600, 'steps': 13424, 'loss/train': 2.2769415378570557} 11/06/2021 23:06:07 - INFO - __main__ - Step 13426: {'lr': 0.000492684077235267, 'samples': 2577792, 'steps': 13425, 'loss/train': 0.9876941442489624} 11/06/2021 23:06:08 - INFO - __main__ - Step 13427: {'lr': 0.0004926828027770302, 'samples': 2577984, 'steps': 13426, 'loss/train': 1.4643722772598267} 11/06/2021 23:06:08 - INFO - __main__ - Step 13428: {'lr': 0.0004926815282094443, 'samples': 2578176, 'steps': 13427, 'loss/train': 2.2351608276367188} 11/06/2021 23:06:09 - INFO - __main__ - Step 13429: {'lr': 0.00049268025353251, 'samples': 2578368, 'steps': 13428, 'loss/train': 1.725936770439148} 11/06/2021 23:06:09 - INFO - __main__ - Step 13430: {'lr': 0.0004926789787462276, 'samples': 2578560, 'steps': 13429, 'loss/train': 1.8152636289596558} 11/06/2021 23:06:10 - INFO - __main__ - Step 13431: {'lr': 0.0004926777038505978, 'samples': 2578752, 'steps': 13430, 'loss/train': 1.673268437385559} 11/06/2021 23:06:10 - INFO - __main__ - Step 13432: {'lr': 0.0004926764288456212, 'samples': 2578944, 'steps': 13431, 'loss/train': 1.6883317232131958} 11/06/2021 23:06:10 - INFO - __main__ - Step 13433: {'lr': 0.0004926751537312982, 'samples': 2579136, 'steps': 13432, 'loss/train': 1.6265524625778198} 11/06/2021 23:06:11 - INFO - __main__ - Step 13434: {'lr': 0.0004926738785076297, 'samples': 2579328, 'steps': 13433, 'loss/train': 1.6185946464538574} 11/06/2021 23:06:12 - INFO - __main__ - Step 13435: {'lr': 0.000492672603174616, 'samples': 2579520, 'steps': 13434, 'loss/train': 1.6138404607772827} 11/06/2021 23:06:12 - INFO - __main__ - Step 13436: {'lr': 0.0004926713277322579, 'samples': 2579712, 'steps': 13435, 'loss/train': 1.8457807302474976} 11/06/2021 23:06:13 - INFO - __main__ - Step 13437: {'lr': 0.0004926700521805557, 'samples': 2579904, 'steps': 13436, 'loss/train': 1.5972094535827637} 11/06/2021 23:06:13 - INFO - __main__ - Step 13438: {'lr': 0.0004926687765195102, 'samples': 2580096, 'steps': 13437, 'loss/train': 1.67529296875} 11/06/2021 23:06:14 - INFO - __main__ - Step 13439: {'lr': 0.0004926675007491218, 'samples': 2580288, 'steps': 13438, 'loss/train': 1.481308937072754} 11/06/2021 23:06:14 - INFO - __main__ - Step 13440: {'lr': 0.0004926662248693912, 'samples': 2580480, 'steps': 13439, 'loss/train': 1.9427311420440674} 11/06/2021 23:06:15 - INFO - __main__ - Step 13441: {'lr': 0.000492664948880319, 'samples': 2580672, 'steps': 13440, 'loss/train': 1.513269305229187} 11/06/2021 23:06:15 - INFO - __main__ - Step 13442: {'lr': 0.0004926636727819057, 'samples': 2580864, 'steps': 13441, 'loss/train': 2.367624282836914} 11/06/2021 23:06:15 - INFO - __main__ - Step 13443: {'lr': 0.0004926623965741519, 'samples': 2581056, 'steps': 13442, 'loss/train': 1.679581880569458} 11/06/2021 23:06:16 - INFO - __main__ - Step 13444: {'lr': 0.0004926611202570582, 'samples': 2581248, 'steps': 13443, 'loss/train': 1.8118009567260742} 11/06/2021 23:06:17 - INFO - __main__ - Step 13445: {'lr': 0.0004926598438306252, 'samples': 2581440, 'steps': 13444, 'loss/train': 1.1700359582901} 11/06/2021 23:06:17 - INFO - __main__ - Step 13446: {'lr': 0.0004926585672948532, 'samples': 2581632, 'steps': 13445, 'loss/train': 1.750053882598877} 11/06/2021 23:06:17 - INFO - __main__ - Step 13447: {'lr': 0.0004926572906497432, 'samples': 2581824, 'steps': 13446, 'loss/train': 1.81050705909729} 11/06/2021 23:06:18 - INFO - __main__ - Step 13448: {'lr': 0.0004926560138952955, 'samples': 2582016, 'steps': 13447, 'loss/train': 1.7286831140518188} 11/06/2021 23:06:19 - INFO - __main__ - Step 13449: {'lr': 0.0004926547370315106, 'samples': 2582208, 'steps': 13448, 'loss/train': 1.6813172101974487} 11/06/2021 23:06:19 - INFO - __main__ - Step 13450: {'lr': 0.0004926534600583894, 'samples': 2582400, 'steps': 13449, 'loss/train': 3.3553547859191895} 11/06/2021 23:06:20 - INFO - __main__ - Step 13451: {'lr': 0.0004926521829759323, 'samples': 2582592, 'steps': 13450, 'loss/train': 2.9841389656066895} 11/06/2021 23:06:20 - INFO - __main__ - Step 13452: {'lr': 0.0004926509057841397, 'samples': 2582784, 'steps': 13451, 'loss/train': 2.232959032058716} 11/06/2021 23:06:20 - INFO - __main__ - Step 13453: {'lr': 0.0004926496284830125, 'samples': 2582976, 'steps': 13452, 'loss/train': 0.9120754599571228} 11/06/2021 23:06:21 - INFO - __main__ - Step 13454: {'lr': 0.0004926483510725511, 'samples': 2583168, 'steps': 13453, 'loss/train': 0.20907138288021088} 11/06/2021 23:06:22 - INFO - __main__ - Step 13455: {'lr': 0.000492647073552756, 'samples': 2583360, 'steps': 13454, 'loss/train': 0.17836831510066986} 11/06/2021 23:06:22 - INFO - __main__ - Step 13456: {'lr': 0.000492645795923628, 'samples': 2583552, 'steps': 13455, 'loss/train': 1.9313207864761353} 11/06/2021 23:06:23 - INFO - __main__ - Step 13457: {'lr': 0.0004926445181851675, 'samples': 2583744, 'steps': 13456, 'loss/train': 1.514086365699768} 11/06/2021 23:06:23 - INFO - __main__ - Step 13458: {'lr': 0.0004926432403373752, 'samples': 2583936, 'steps': 13457, 'loss/train': 2.219507932662964} 11/06/2021 23:06:23 - INFO - __main__ - Step 13459: {'lr': 0.0004926419623802515, 'samples': 2584128, 'steps': 13458, 'loss/train': 1.9155421257019043} 11/06/2021 23:06:24 - INFO - __main__ - Step 13460: {'lr': 0.0004926406843137971, 'samples': 2584320, 'steps': 13459, 'loss/train': 1.7302833795547485} 11/06/2021 23:06:25 - INFO - __main__ - Step 13461: {'lr': 0.0004926394061380126, 'samples': 2584512, 'steps': 13460, 'loss/train': 1.796723484992981} 11/06/2021 23:06:25 - INFO - __main__ - Step 13462: {'lr': 0.0004926381278528984, 'samples': 2584704, 'steps': 13461, 'loss/train': 1.6994954347610474} 11/06/2021 23:06:25 - INFO - __main__ - Step 13463: {'lr': 0.0004926368494584553, 'samples': 2584896, 'steps': 13462, 'loss/train': 0.4757920205593109} 11/06/2021 23:06:26 - INFO - __main__ - Step 13464: {'lr': 0.0004926355709546838, 'samples': 2585088, 'steps': 13463, 'loss/train': 1.7972795963287354} 11/06/2021 23:06:26 - INFO - __main__ - Step 13465: {'lr': 0.0004926342923415844, 'samples': 2585280, 'steps': 13464, 'loss/train': 1.8750228881835938} 11/06/2021 23:06:27 - INFO - __main__ - Step 13466: {'lr': 0.0004926330136191577, 'samples': 2585472, 'steps': 13465, 'loss/train': 2.082622766494751} 11/06/2021 23:06:28 - INFO - __main__ - Step 13467: {'lr': 0.0004926317347874044, 'samples': 2585664, 'steps': 13466, 'loss/train': 1.9300429821014404} 11/06/2021 23:06:28 - INFO - __main__ - Step 13468: {'lr': 0.000492630455846325, 'samples': 2585856, 'steps': 13467, 'loss/train': 1.32011079788208} 11/06/2021 23:06:28 - INFO - __main__ - Step 13469: {'lr': 0.0004926291767959199, 'samples': 2586048, 'steps': 13468, 'loss/train': 3.767948865890503} 11/06/2021 23:06:29 - INFO - __main__ - Step 13470: {'lr': 0.00049262789763619, 'samples': 2586240, 'steps': 13469, 'loss/train': 1.0086846351623535} 11/06/2021 23:06:30 - INFO - __main__ - Step 13471: {'lr': 0.0004926266183671356, 'samples': 2586432, 'steps': 13470, 'loss/train': 1.8389583826065063} 11/06/2021 23:06:30 - INFO - __main__ - Step 13472: {'lr': 0.0004926253389887575, 'samples': 2586624, 'steps': 13471, 'loss/train': 1.7652056217193604} 11/06/2021 23:06:31 - INFO - __main__ - Step 13473: {'lr': 0.0004926240595010561, 'samples': 2586816, 'steps': 13472, 'loss/train': 1.13629949092865} 11/06/2021 23:06:31 - INFO - __main__ - Step 13474: {'lr': 0.000492622779904032, 'samples': 2587008, 'steps': 13473, 'loss/train': 1.531381368637085} 11/06/2021 23:06:31 - INFO - __main__ - Step 13475: {'lr': 0.000492621500197686, 'samples': 2587200, 'steps': 13474, 'loss/train': 1.4940797090530396} 11/06/2021 23:06:32 - INFO - __main__ - Step 13476: {'lr': 0.0004926202203820182, 'samples': 2587392, 'steps': 13475, 'loss/train': 1.8899670839309692} 11/06/2021 23:06:33 - INFO - __main__ - Step 13477: {'lr': 0.0004926189404570297, 'samples': 2587584, 'steps': 13476, 'loss/train': 1.735036015510559} 11/06/2021 23:06:33 - INFO - __main__ - Step 13478: {'lr': 0.0004926176604227208, 'samples': 2587776, 'steps': 13477, 'loss/train': 1.7047059535980225} 11/06/2021 23:06:33 - INFO - __main__ - Step 13479: {'lr': 0.0004926163802790922, 'samples': 2587968, 'steps': 13478, 'loss/train': 2.1699039936065674} 11/06/2021 23:06:34 - INFO - __main__ - Step 13480: {'lr': 0.0004926151000261442, 'samples': 2588160, 'steps': 13479, 'loss/train': 1.8087481260299683} 11/06/2021 23:06:34 - INFO - __main__ - Step 13481: {'lr': 0.0004926138196638777, 'samples': 2588352, 'steps': 13480, 'loss/train': 1.056817889213562} 11/06/2021 23:06:35 - INFO - __main__ - Step 13482: {'lr': 0.0004926125391922932, 'samples': 2588544, 'steps': 13481, 'loss/train': 1.3383749723434448} 11/06/2021 23:06:36 - INFO - __main__ - Step 13483: {'lr': 0.0004926112586113912, 'samples': 2588736, 'steps': 13482, 'loss/train': 1.4926176071166992} 11/06/2021 23:06:36 - INFO - __main__ - Step 13484: {'lr': 0.0004926099779211723, 'samples': 2588928, 'steps': 13483, 'loss/train': 1.5623829364776611} 11/06/2021 23:06:36 - INFO - __main__ - Step 13485: {'lr': 0.0004926086971216371, 'samples': 2589120, 'steps': 13484, 'loss/train': 1.7945960760116577} 11/06/2021 23:06:37 - INFO - __main__ - Step 13486: {'lr': 0.0004926074162127862, 'samples': 2589312, 'steps': 13485, 'loss/train': 1.6965285539627075} 11/06/2021 23:06:38 - INFO - __main__ - Step 13487: {'lr': 0.0004926061351946201, 'samples': 2589504, 'steps': 13486, 'loss/train': 2.568559169769287} 11/06/2021 23:06:39 - INFO - __main__ - Step 13488: {'lr': 0.0004926048540671394, 'samples': 2589696, 'steps': 13487, 'loss/train': 0.929760754108429} 11/06/2021 23:06:39 - INFO - __main__ - Step 13489: {'lr': 0.0004926035728303447, 'samples': 2589888, 'steps': 13488, 'loss/train': 1.9634153842926025} 11/06/2021 23:06:39 - INFO - __main__ - Step 13490: {'lr': 0.0004926022914842366, 'samples': 2590080, 'steps': 13489, 'loss/train': 0.19946490228176117} 11/06/2021 23:06:40 - INFO - __main__ - Step 13491: {'lr': 0.0004926010100288156, 'samples': 2590272, 'steps': 13490, 'loss/train': 1.7599334716796875} 11/06/2021 23:06:41 - INFO - __main__ - Step 13492: {'lr': 0.0004925997284640823, 'samples': 2590464, 'steps': 13491, 'loss/train': 1.668752908706665} 11/06/2021 23:06:41 - INFO - __main__ - Step 13493: {'lr': 0.0004925984467900374, 'samples': 2590656, 'steps': 13492, 'loss/train': 1.4135771989822388} 11/06/2021 23:06:41 - INFO - __main__ - Step 13494: {'lr': 0.0004925971650066814, 'samples': 2590848, 'steps': 13493, 'loss/train': 2.2493417263031006} 11/06/2021 23:06:42 - INFO - __main__ - Step 13495: {'lr': 0.0004925958831140147, 'samples': 2591040, 'steps': 13494, 'loss/train': 1.576822280883789} 11/06/2021 23:06:42 - INFO - __main__ - Step 13496: {'lr': 0.0004925946011120382, 'samples': 2591232, 'steps': 13495, 'loss/train': 2.3239166736602783} 11/06/2021 23:06:43 - INFO - __main__ - Step 13497: {'lr': 0.0004925933190007523, 'samples': 2591424, 'steps': 13496, 'loss/train': 1.5112223625183105} 11/06/2021 23:06:44 - INFO - __main__ - Step 13498: {'lr': 0.0004925920367801575, 'samples': 2591616, 'steps': 13497, 'loss/train': 1.7464356422424316} 11/06/2021 23:06:44 - INFO - __main__ - Step 13499: {'lr': 0.0004925907544502545, 'samples': 2591808, 'steps': 13498, 'loss/train': 1.7748905420303345} 11/06/2021 23:06:44 - INFO - __main__ - Step 13500: {'lr': 0.000492589472011044, 'samples': 2592000, 'steps': 13499, 'loss/train': 2.0992441177368164} 11/06/2021 23:06:45 - INFO - __main__ - Step 13501: {'lr': 0.0004925881894625263, 'samples': 2592192, 'steps': 13500, 'loss/train': 1.2124189138412476} 11/06/2021 23:06:46 - INFO - __main__ - Step 13502: {'lr': 0.0004925869068047021, 'samples': 2592384, 'steps': 13501, 'loss/train': 1.7404388189315796} 11/06/2021 23:06:46 - INFO - __main__ - Step 13503: {'lr': 0.000492585624037572, 'samples': 2592576, 'steps': 13502, 'loss/train': 1.5108038187026978} 11/06/2021 23:06:46 - INFO - __main__ - Step 13504: {'lr': 0.0004925843411611366, 'samples': 2592768, 'steps': 13503, 'loss/train': 1.60819673538208} 11/06/2021 23:06:47 - INFO - __main__ - Step 13505: {'lr': 0.0004925830581753964, 'samples': 2592960, 'steps': 13504, 'loss/train': 2.066511631011963} 11/06/2021 23:06:47 - INFO - __main__ - Step 13506: {'lr': 0.000492581775080352, 'samples': 2593152, 'steps': 13505, 'loss/train': 1.5650289058685303} 11/06/2021 23:06:48 - INFO - __main__ - Step 13507: {'lr': 0.000492580491876004, 'samples': 2593344, 'steps': 13506, 'loss/train': 1.7949551343917847} 11/06/2021 23:06:48 - INFO - __main__ - Step 13508: {'lr': 0.000492579208562353, 'samples': 2593536, 'steps': 13507, 'loss/train': 1.567403793334961} 11/06/2021 23:06:49 - INFO - __main__ - Step 13509: {'lr': 0.0004925779251393995, 'samples': 2593728, 'steps': 13508, 'loss/train': 2.0398895740509033} 11/06/2021 23:06:49 - INFO - __main__ - Step 13510: {'lr': 0.0004925766416071441, 'samples': 2593920, 'steps': 13509, 'loss/train': 1.7081353664398193} 11/06/2021 23:06:49 - INFO - __main__ - Step 13511: {'lr': 0.0004925753579655876, 'samples': 2594112, 'steps': 13510, 'loss/train': 1.5593316555023193} 11/06/2021 23:06:51 - INFO - __main__ - Step 13512: {'lr': 0.0004925740742147302, 'samples': 2594304, 'steps': 13511, 'loss/train': 1.8523802757263184} 11/06/2021 23:06:51 - INFO - __main__ - Step 13513: {'lr': 0.0004925727903545727, 'samples': 2594496, 'steps': 13512, 'loss/train': 1.662975788116455} 11/06/2021 23:06:51 - INFO - __main__ - Step 13514: {'lr': 0.0004925715063851157, 'samples': 2594688, 'steps': 13513, 'loss/train': 1.6688578128814697} 11/06/2021 23:06:52 - INFO - __main__ - Step 13515: {'lr': 0.0004925702223063597, 'samples': 2594880, 'steps': 13514, 'loss/train': 0.6073575019836426} 11/06/2021 23:06:52 - INFO - __main__ - Step 13516: {'lr': 0.0004925689381183052, 'samples': 2595072, 'steps': 13515, 'loss/train': 1.6856118440628052} 11/06/2021 23:06:52 - INFO - __main__ - Step 13517: {'lr': 0.0004925676538209531, 'samples': 2595264, 'steps': 13516, 'loss/train': 2.147181510925293} 11/06/2021 23:06:53 - INFO - __main__ - Step 13518: {'lr': 0.0004925663694143036, 'samples': 2595456, 'steps': 13517, 'loss/train': 1.5106788873672485} 11/06/2021 23:06:54 - INFO - __main__ - Step 13519: {'lr': 0.0004925650848983575, 'samples': 2595648, 'steps': 13518, 'loss/train': 1.8200280666351318} 11/06/2021 23:06:54 - INFO - __main__ - Step 13520: {'lr': 0.0004925638002731153, 'samples': 2595840, 'steps': 13519, 'loss/train': 1.5269951820373535} 11/06/2021 23:06:54 - INFO - __main__ - Step 13521: {'lr': 0.0004925625155385775, 'samples': 2596032, 'steps': 13520, 'loss/train': 1.926583170890808} 11/06/2021 23:06:55 - INFO - __main__ - Step 13522: {'lr': 0.0004925612306947449, 'samples': 2596224, 'steps': 13521, 'loss/train': 1.5432051420211792} 11/06/2021 23:06:56 - INFO - __main__ - Step 13523: {'lr': 0.0004925599457416179, 'samples': 2596416, 'steps': 13522, 'loss/train': 1.8082655668258667} 11/06/2021 23:06:56 - INFO - __main__ - Step 13524: {'lr': 0.0004925586606791972, 'samples': 2596608, 'steps': 13523, 'loss/train': 1.771362543106079} 11/06/2021 23:06:56 - INFO - __main__ - Step 13525: {'lr': 0.0004925573755074832, 'samples': 2596800, 'steps': 13524, 'loss/train': 1.9710201025009155} 11/06/2021 23:06:57 - INFO - __main__ - Step 13526: {'lr': 0.0004925560902264766, 'samples': 2596992, 'steps': 13525, 'loss/train': 1.8510551452636719} 11/06/2021 23:06:57 - INFO - __main__ - Step 13527: {'lr': 0.000492554804836178, 'samples': 2597184, 'steps': 13526, 'loss/train': 1.7917557954788208} 11/06/2021 23:06:59 - INFO - __main__ - Step 13528: {'lr': 0.000492553519336588, 'samples': 2597376, 'steps': 13527, 'loss/train': 1.3388181924819946} 11/06/2021 23:06:59 - INFO - __main__ - Step 13529: {'lr': 0.000492552233727707, 'samples': 2597568, 'steps': 13528, 'loss/train': 1.9206374883651733} 11/06/2021 23:07:00 - INFO - __main__ - Step 13530: {'lr': 0.0004925509480095358, 'samples': 2597760, 'steps': 13529, 'loss/train': 1.7673835754394531} 11/06/2021 23:07:00 - INFO - __main__ - Step 13531: {'lr': 0.0004925496621820749, 'samples': 2597952, 'steps': 13530, 'loss/train': 1.4747488498687744} 11/06/2021 23:07:00 - INFO - __main__ - Step 13532: {'lr': 0.0004925483762453249, 'samples': 2598144, 'steps': 13531, 'loss/train': 0.7193357944488525} 11/06/2021 23:07:01 - INFO - __main__ - Step 13533: {'lr': 0.0004925470901992863, 'samples': 2598336, 'steps': 13532, 'loss/train': 1.6905871629714966} 11/06/2021 23:07:02 - INFO - __main__ - Step 13534: {'lr': 0.0004925458040439596, 'samples': 2598528, 'steps': 13533, 'loss/train': 1.825111985206604} 11/06/2021 23:07:02 - INFO - __main__ - Step 13535: {'lr': 0.0004925445177793457, 'samples': 2598720, 'steps': 13534, 'loss/train': 1.879388689994812} 11/06/2021 23:07:02 - INFO - __main__ - Step 13536: {'lr': 0.0004925432314054448, 'samples': 2598912, 'steps': 13535, 'loss/train': 1.898144006729126} 11/06/2021 23:07:03 - INFO - __main__ - Step 13537: {'lr': 0.0004925419449222578, 'samples': 2599104, 'steps': 13536, 'loss/train': 2.1349470615386963} 11/06/2021 23:07:04 - INFO - __main__ - Step 13538: {'lr': 0.0004925406583297851, 'samples': 2599296, 'steps': 13537, 'loss/train': 1.8483484983444214} 11/06/2021 23:07:04 - INFO - __main__ - Step 13539: {'lr': 0.0004925393716280274, 'samples': 2599488, 'steps': 13538, 'loss/train': 1.1511164903640747} 11/06/2021 23:07:04 - INFO - __main__ - Step 13540: {'lr': 0.0004925380848169851, 'samples': 2599680, 'steps': 13539, 'loss/train': 1.4362045526504517} 11/06/2021 23:07:05 - INFO - __main__ - Step 13541: {'lr': 0.0004925367978966588, 'samples': 2599872, 'steps': 13540, 'loss/train': 2.184403419494629} 11/06/2021 23:07:05 - INFO - __main__ - Step 13542: {'lr': 0.0004925355108670493, 'samples': 2600064, 'steps': 13541, 'loss/train': 1.5568304061889648} 11/06/2021 23:07:06 - INFO - __main__ - Step 13543: {'lr': 0.0004925342237281571, 'samples': 2600256, 'steps': 13542, 'loss/train': 1.6341681480407715} 11/06/2021 23:07:06 - INFO - __main__ - Step 13544: {'lr': 0.0004925329364799825, 'samples': 2600448, 'steps': 13543, 'loss/train': 1.4727433919906616} 11/06/2021 23:07:07 - INFO - __main__ - Step 13545: {'lr': 0.0004925316491225265, 'samples': 2600640, 'steps': 13544, 'loss/train': 1.715566873550415} 11/06/2021 23:07:07 - INFO - __main__ - Step 13546: {'lr': 0.0004925303616557893, 'samples': 2600832, 'steps': 13545, 'loss/train': 1.6563913822174072} 11/06/2021 23:07:08 - INFO - __main__ - Step 13547: {'lr': 0.0004925290740797718, 'samples': 2601024, 'steps': 13546, 'loss/train': 1.9867644309997559} 11/06/2021 23:07:08 - INFO - __main__ - Step 13548: {'lr': 0.0004925277863944745, 'samples': 2601216, 'steps': 13547, 'loss/train': 2.0575523376464844} 11/06/2021 23:07:09 - INFO - __main__ - Step 13549: {'lr': 0.0004925264985998978, 'samples': 2601408, 'steps': 13548, 'loss/train': 1.7261626720428467} 11/06/2021 23:07:09 - INFO - __main__ - Step 13550: {'lr': 0.0004925252106960425, 'samples': 2601600, 'steps': 13549, 'loss/train': 1.596949577331543} 11/06/2021 23:07:10 - INFO - __main__ - Step 13551: {'lr': 0.000492523922682909, 'samples': 2601792, 'steps': 13550, 'loss/train': 1.3621716499328613} 11/06/2021 23:07:10 - INFO - __main__ - Step 13552: {'lr': 0.0004925226345604979, 'samples': 2601984, 'steps': 13551, 'loss/train': 1.458478569984436} 11/06/2021 23:07:10 - INFO - __main__ - Step 13553: {'lr': 0.0004925213463288099, 'samples': 2602176, 'steps': 13552, 'loss/train': 1.587988257408142} 11/06/2021 23:07:11 - INFO - __main__ - Step 13554: {'lr': 0.0004925200579878456, 'samples': 2602368, 'steps': 13553, 'loss/train': 1.3299733400344849} 11/06/2021 23:07:12 - INFO - __main__ - Step 13555: {'lr': 0.0004925187695376055, 'samples': 2602560, 'steps': 13554, 'loss/train': 1.4566415548324585} 11/06/2021 23:07:12 - INFO - __main__ - Step 13556: {'lr': 0.0004925174809780901, 'samples': 2602752, 'steps': 13555, 'loss/train': 1.6076452732086182} 11/06/2021 23:07:12 - INFO - __main__ - Step 13557: {'lr': 0.0004925161923093001, 'samples': 2602944, 'steps': 13556, 'loss/train': 1.5300875902175903} 11/06/2021 23:07:13 - INFO - __main__ - Step 13558: {'lr': 0.000492514903531236, 'samples': 2603136, 'steps': 13557, 'loss/train': 1.9013261795043945} 11/06/2021 23:07:14 - INFO - __main__ - Step 13559: {'lr': 0.0004925136146438986, 'samples': 2603328, 'steps': 13558, 'loss/train': 1.5588353872299194} 11/06/2021 23:07:14 - INFO - __main__ - Step 13560: {'lr': 0.0004925123256472881, 'samples': 2603520, 'steps': 13559, 'loss/train': 1.529636263847351} 11/06/2021 23:07:14 - INFO - __main__ - Step 13561: {'lr': 0.0004925110365414054, 'samples': 2603712, 'steps': 13560, 'loss/train': 1.6311484575271606} 11/06/2021 23:07:15 - INFO - __main__ - Step 13562: {'lr': 0.0004925097473262509, 'samples': 2603904, 'steps': 13561, 'loss/train': 1.773667573928833} 11/06/2021 23:07:15 - INFO - __main__ - Step 13563: {'lr': 0.0004925084580018253, 'samples': 2604096, 'steps': 13562, 'loss/train': 1.412859559059143} 11/06/2021 23:07:16 - INFO - __main__ - Step 13564: {'lr': 0.0004925071685681292, 'samples': 2604288, 'steps': 13563, 'loss/train': 2.0537023544311523} 11/06/2021 23:07:16 - INFO - __main__ - Step 13565: {'lr': 0.000492505879025163, 'samples': 2604480, 'steps': 13564, 'loss/train': 1.4484230279922485} 11/06/2021 23:07:17 - INFO - __main__ - Step 13566: {'lr': 0.0004925045893729274, 'samples': 2604672, 'steps': 13565, 'loss/train': 1.8159236907958984} 11/06/2021 23:07:17 - INFO - __main__ - Step 13567: {'lr': 0.000492503299611423, 'samples': 2604864, 'steps': 13566, 'loss/train': 1.2261930704116821} 11/06/2021 23:07:18 - INFO - __main__ - Step 13568: {'lr': 0.0004925020097406504, 'samples': 2605056, 'steps': 13567, 'loss/train': 1.8330031633377075} 11/06/2021 23:07:19 - INFO - __main__ - Step 13569: {'lr': 0.00049250071976061, 'samples': 2605248, 'steps': 13568, 'loss/train': 1.5868868827819824} 11/06/2021 23:07:19 - INFO - __main__ - Step 13570: {'lr': 0.0004924994296713026, 'samples': 2605440, 'steps': 13569, 'loss/train': 1.7566038370132446} 11/06/2021 23:07:19 - INFO - __main__ - Step 13571: {'lr': 0.0004924981394727288, 'samples': 2605632, 'steps': 13570, 'loss/train': 1.4361501932144165} 11/06/2021 23:07:20 - INFO - __main__ - Step 13572: {'lr': 0.0004924968491648889, 'samples': 2605824, 'steps': 13571, 'loss/train': 1.6027421951293945} 11/06/2021 23:07:20 - INFO - __main__ - Step 13573: {'lr': 0.0004924955587477837, 'samples': 2606016, 'steps': 13572, 'loss/train': 1.6203863620758057} 11/06/2021 23:07:21 - INFO - __main__ - Step 13574: {'lr': 0.0004924942682214138, 'samples': 2606208, 'steps': 13573, 'loss/train': 1.7938021421432495} 11/06/2021 23:07:21 - INFO - __main__ - Step 13575: {'lr': 0.0004924929775857798, 'samples': 2606400, 'steps': 13574, 'loss/train': 0.9622952938079834} 11/06/2021 23:07:22 - INFO - __main__ - Step 13576: {'lr': 0.0004924916868408821, 'samples': 2606592, 'steps': 13575, 'loss/train': 1.0422208309173584} 11/06/2021 23:07:22 - INFO - __main__ - Step 13577: {'lr': 0.0004924903959867214, 'samples': 2606784, 'steps': 13576, 'loss/train': 1.6397504806518555} 11/06/2021 23:07:22 - INFO - __main__ - Step 13578: {'lr': 0.0004924891050232984, 'samples': 2606976, 'steps': 13577, 'loss/train': 1.8606226444244385} 11/06/2021 23:07:23 - INFO - __main__ - Step 13579: {'lr': 0.0004924878139506134, 'samples': 2607168, 'steps': 13578, 'loss/train': 1.482366681098938} 11/06/2021 23:07:24 - INFO - __main__ - Step 13580: {'lr': 0.0004924865227686671, 'samples': 2607360, 'steps': 13579, 'loss/train': 1.4506975412368774} 11/06/2021 23:07:25 - INFO - __main__ - Step 13581: {'lr': 0.0004924852314774602, 'samples': 2607552, 'steps': 13580, 'loss/train': 2.16103196144104} 11/06/2021 23:07:25 - INFO - __main__ - Step 13582: {'lr': 0.0004924839400769932, 'samples': 2607744, 'steps': 13581, 'loss/train': 1.665818452835083} 11/06/2021 23:07:25 - INFO - __main__ - Step 13583: {'lr': 0.0004924826485672667, 'samples': 2607936, 'steps': 13582, 'loss/train': 0.9627843499183655} 11/06/2021 23:07:26 - INFO - __main__ - Step 13584: {'lr': 0.0004924813569482812, 'samples': 2608128, 'steps': 13583, 'loss/train': 1.1385881900787354} 11/06/2021 23:07:27 - INFO - __main__ - Step 13585: {'lr': 0.0004924800652200373, 'samples': 2608320, 'steps': 13584, 'loss/train': 1.7369372844696045} 11/06/2021 23:07:27 - INFO - __main__ - Step 13586: {'lr': 0.0004924787733825357, 'samples': 2608512, 'steps': 13585, 'loss/train': 2.386915922164917} 11/06/2021 23:07:27 - INFO - __main__ - Step 13587: {'lr': 0.0004924774814357768, 'samples': 2608704, 'steps': 13586, 'loss/train': 1.7782405614852905} 11/06/2021 23:07:28 - INFO - __main__ - Step 13588: {'lr': 0.0004924761893797615, 'samples': 2608896, 'steps': 13587, 'loss/train': 1.8888113498687744} 11/06/2021 23:07:28 - INFO - __main__ - Step 13589: {'lr': 0.00049247489721449, 'samples': 2609088, 'steps': 13588, 'loss/train': 5.811814785003662} 11/06/2021 23:07:29 - INFO - __main__ - Step 13590: {'lr': 0.0004924736049399631, 'samples': 2609280, 'steps': 13589, 'loss/train': 1.3031723499298096} 11/06/2021 23:07:29 - INFO - __main__ - Step 13591: {'lr': 0.0004924723125561813, 'samples': 2609472, 'steps': 13590, 'loss/train': 1.8062859773635864} 11/06/2021 23:07:30 - INFO - __main__ - Step 13592: {'lr': 0.0004924710200631453, 'samples': 2609664, 'steps': 13591, 'loss/train': 1.475710153579712} 11/06/2021 23:07:30 - INFO - __main__ - Step 13593: {'lr': 0.0004924697274608556, 'samples': 2609856, 'steps': 13592, 'loss/train': 1.6289951801300049} 11/06/2021 23:07:31 - INFO - __main__ - Step 13594: {'lr': 0.0004924684347493126, 'samples': 2610048, 'steps': 13593, 'loss/train': 1.741284966468811} 11/06/2021 23:07:32 - INFO - __main__ - Step 13595: {'lr': 0.0004924671419285172, 'samples': 2610240, 'steps': 13594, 'loss/train': 1.871199369430542} 11/06/2021 23:07:32 - INFO - __main__ - Step 13596: {'lr': 0.0004924658489984699, 'samples': 2610432, 'steps': 13595, 'loss/train': 1.5763531923294067} 11/06/2021 23:07:32 - INFO - __main__ - Step 13597: {'lr': 0.0004924645559591712, 'samples': 2610624, 'steps': 13596, 'loss/train': 1.8481301069259644} 11/06/2021 23:07:33 - INFO - __main__ - Step 13598: {'lr': 0.0004924632628106217, 'samples': 2610816, 'steps': 13597, 'loss/train': 1.9725139141082764} 11/06/2021 23:07:33 - INFO - __main__ - Step 13599: {'lr': 0.000492461969552822, 'samples': 2611008, 'steps': 13598, 'loss/train': 1.4551993608474731} 11/06/2021 23:07:33 - INFO - __main__ - Step 13600: {'lr': 0.0004924606761857726, 'samples': 2611200, 'steps': 13599, 'loss/train': 1.2987780570983887} 11/06/2021 23:07:34 - INFO - __main__ - Step 13601: {'lr': 0.0004924593827094744, 'samples': 2611392, 'steps': 13600, 'loss/train': 1.725089430809021} 11/06/2021 23:07:35 - INFO - __main__ - Step 13602: {'lr': 0.0004924580891239274, 'samples': 2611584, 'steps': 13601, 'loss/train': 1.9037177562713623} 11/06/2021 23:07:35 - INFO - __main__ - Step 13603: {'lr': 0.0004924567954291328, 'samples': 2611776, 'steps': 13602, 'loss/train': 1.6505497694015503} 11/06/2021 23:07:35 - INFO - __main__ - Step 13604: {'lr': 0.0004924555016250908, 'samples': 2611968, 'steps': 13603, 'loss/train': 1.9242398738861084} 11/06/2021 23:07:36 - INFO - __main__ - Step 13605: {'lr': 0.0004924542077118021, 'samples': 2612160, 'steps': 13604, 'loss/train': 1.6586657762527466} 11/06/2021 23:07:38 - INFO - __main__ - Step 13606: {'lr': 0.0004924529136892673, 'samples': 2612352, 'steps': 13605, 'loss/train': 1.6785545349121094} 11/06/2021 23:07:38 - INFO - __main__ - Step 13607: {'lr': 0.0004924516195574869, 'samples': 2612544, 'steps': 13606, 'loss/train': 0.57390958070755} 11/06/2021 23:07:39 - INFO - __main__ - Step 13608: {'lr': 0.0004924503253164614, 'samples': 2612736, 'steps': 13607, 'loss/train': 1.4627878665924072} 11/06/2021 23:07:39 - INFO - __main__ - Step 13609: {'lr': 0.0004924490309661918, 'samples': 2612928, 'steps': 13608, 'loss/train': 0.6431192755699158} 11/06/2021 23:07:39 - INFO - __main__ - Step 13610: {'lr': 0.0004924477365066783, 'samples': 2613120, 'steps': 13609, 'loss/train': 1.783947229385376} 11/06/2021 23:07:40 - INFO - __main__ - Step 13611: {'lr': 0.0004924464419379217, 'samples': 2613312, 'steps': 13610, 'loss/train': 1.6398215293884277} 11/06/2021 23:07:41 - INFO - __main__ - Step 13612: {'lr': 0.0004924451472599222, 'samples': 2613504, 'steps': 13611, 'loss/train': 0.23156870901584625} 11/06/2021 23:07:41 - INFO - __main__ - Step 13613: {'lr': 0.000492443852472681, 'samples': 2613696, 'steps': 13612, 'loss/train': 1.843045711517334} 11/06/2021 23:07:41 - INFO - __main__ - Step 13614: {'lr': 0.000492442557576198, 'samples': 2613888, 'steps': 13613, 'loss/train': 1.7973626852035522} 11/06/2021 23:07:42 - INFO - __main__ - Step 13615: {'lr': 0.0004924412625704744, 'samples': 2614080, 'steps': 13614, 'loss/train': 2.342195749282837} 11/06/2021 23:07:42 - INFO - __main__ - Step 13616: {'lr': 0.0004924399674555103, 'samples': 2614272, 'steps': 13615, 'loss/train': 1.5847399234771729} 11/06/2021 23:07:42 - INFO - __main__ - Step 13617: {'lr': 0.0004924386722313066, 'samples': 2614464, 'steps': 13616, 'loss/train': 1.906503438949585} 11/06/2021 23:07:43 - INFO - __main__ - Step 13618: {'lr': 0.0004924373768978638, 'samples': 2614656, 'steps': 13617, 'loss/train': 2.0704801082611084} 11/06/2021 23:07:44 - INFO - __main__ - Step 13619: {'lr': 0.0004924360814551825, 'samples': 2614848, 'steps': 13618, 'loss/train': 1.9433788061141968} 11/06/2021 23:07:44 - INFO - __main__ - Step 13620: {'lr': 0.0004924347859032631, 'samples': 2615040, 'steps': 13619, 'loss/train': 1.7289719581604004} 11/06/2021 23:07:44 - INFO - __main__ - Step 13621: {'lr': 0.0004924334902421065, 'samples': 2615232, 'steps': 13620, 'loss/train': 2.022456407546997} 11/06/2021 23:07:45 - INFO - __main__ - Step 13622: {'lr': 0.0004924321944717129, 'samples': 2615424, 'steps': 13621, 'loss/train': 1.9337289333343506} 11/06/2021 23:07:46 - INFO - __main__ - Step 13623: {'lr': 0.0004924308985920832, 'samples': 2615616, 'steps': 13622, 'loss/train': 2.2961039543151855} 11/06/2021 23:07:46 - INFO - __main__ - Step 13624: {'lr': 0.0004924296026032179, 'samples': 2615808, 'steps': 13623, 'loss/train': 1.2842239141464233} 11/06/2021 23:07:46 - INFO - __main__ - Step 13625: {'lr': 0.0004924283065051176, 'samples': 2616000, 'steps': 13624, 'loss/train': 1.8332487344741821} 11/06/2021 23:07:47 - INFO - __main__ - Step 13626: {'lr': 0.0004924270102977827, 'samples': 2616192, 'steps': 13625, 'loss/train': 2.1077988147735596} 11/06/2021 23:07:47 - INFO - __main__ - Step 13627: {'lr': 0.0004924257139812141, 'samples': 2616384, 'steps': 13626, 'loss/train': 1.8396973609924316} 11/06/2021 23:07:49 - INFO - __main__ - Step 13628: {'lr': 0.0004924244175554121, 'samples': 2616576, 'steps': 13627, 'loss/train': 1.9551353454589844} 11/06/2021 23:07:49 - INFO - __main__ - Step 13629: {'lr': 0.0004924231210203775, 'samples': 2616768, 'steps': 13628, 'loss/train': 1.9722517728805542} 11/06/2021 23:07:49 - INFO - __main__ - Step 13630: {'lr': 0.0004924218243761106, 'samples': 2616960, 'steps': 13629, 'loss/train': 1.3070924282073975} 11/06/2021 23:07:50 - INFO - __main__ - Step 13631: {'lr': 0.0004924205276226123, 'samples': 2617152, 'steps': 13630, 'loss/train': 1.8307408094406128} 11/06/2021 23:07:50 - INFO - __main__ - Step 13632: {'lr': 0.000492419230759883, 'samples': 2617344, 'steps': 13631, 'loss/train': 1.8106156587600708} 11/06/2021 23:07:50 - INFO - __main__ - Step 13633: {'lr': 0.0004924179337879234, 'samples': 2617536, 'steps': 13632, 'loss/train': 1.3615984916687012} 11/06/2021 23:07:51 - INFO - __main__ - Step 13634: {'lr': 0.000492416636706734, 'samples': 2617728, 'steps': 13633, 'loss/train': 1.84334397315979} 11/06/2021 23:07:52 - INFO - __main__ - Step 13635: {'lr': 0.0004924153395163153, 'samples': 2617920, 'steps': 13634, 'loss/train': 1.8057781457901} 11/06/2021 23:07:52 - INFO - __main__ - Step 13636: {'lr': 0.0004924140422166681, 'samples': 2618112, 'steps': 13635, 'loss/train': 1.1118844747543335} 11/06/2021 23:07:52 - INFO - __main__ - Step 13637: {'lr': 0.0004924127448077929, 'samples': 2618304, 'steps': 13636, 'loss/train': 1.283229112625122} 11/06/2021 23:07:53 - INFO - __main__ - Step 13638: {'lr': 0.0004924114472896902, 'samples': 2618496, 'steps': 13637, 'loss/train': 1.547655701637268} 11/06/2021 23:07:54 - INFO - __main__ - Step 13639: {'lr': 0.0004924101496623606, 'samples': 2618688, 'steps': 13638, 'loss/train': 1.6274465322494507} 11/06/2021 23:07:54 - INFO - __main__ - Step 13640: {'lr': 0.0004924088519258049, 'samples': 2618880, 'steps': 13639, 'loss/train': 1.567894697189331} 11/06/2021 23:07:55 - INFO - __main__ - Step 13641: {'lr': 0.0004924075540800233, 'samples': 2619072, 'steps': 13640, 'loss/train': 1.5722965002059937} 11/06/2021 23:07:55 - INFO - __main__ - Step 13642: {'lr': 0.0004924062561250167, 'samples': 2619264, 'steps': 13641, 'loss/train': 1.4436631202697754} 11/06/2021 23:07:55 - INFO - __main__ - Step 13643: {'lr': 0.0004924049580607855, 'samples': 2619456, 'steps': 13642, 'loss/train': 1.4802452325820923} 11/06/2021 23:07:56 - INFO - __main__ - Step 13644: {'lr': 0.0004924036598873305, 'samples': 2619648, 'steps': 13643, 'loss/train': 1.4207649230957031} 11/06/2021 23:07:57 - INFO - __main__ - Step 13645: {'lr': 0.0004924023616046521, 'samples': 2619840, 'steps': 13644, 'loss/train': 0.14849276840686798} 11/06/2021 23:07:57 - INFO - __main__ - Step 13646: {'lr': 0.000492401063212751, 'samples': 2620032, 'steps': 13645, 'loss/train': 1.5605554580688477} 11/06/2021 23:07:57 - INFO - __main__ - Step 13647: {'lr': 0.0004923997647116276, 'samples': 2620224, 'steps': 13646, 'loss/train': 0.61855548620224} 11/06/2021 23:07:58 - INFO - __main__ - Step 13648: {'lr': 0.0004923984661012827, 'samples': 2620416, 'steps': 13647, 'loss/train': 1.0472122430801392} 11/06/2021 23:07:58 - INFO - __main__ - Step 13649: {'lr': 0.0004923971673817167, 'samples': 2620608, 'steps': 13648, 'loss/train': 2.2823336124420166} 11/06/2021 23:07:59 - INFO - __main__ - Step 13650: {'lr': 0.0004923958685529303, 'samples': 2620800, 'steps': 13649, 'loss/train': 1.8961249589920044} 11/06/2021 23:07:59 - INFO - __main__ - Step 13651: {'lr': 0.0004923945696149241, 'samples': 2620992, 'steps': 13650, 'loss/train': 1.9037821292877197} 11/06/2021 23:08:00 - INFO - __main__ - Step 13652: {'lr': 0.0004923932705676986, 'samples': 2621184, 'steps': 13651, 'loss/train': 1.549149990081787} 11/06/2021 23:08:00 - INFO - __main__ - Step 13653: {'lr': 0.0004923919714112545, 'samples': 2621376, 'steps': 13652, 'loss/train': 1.5010502338409424} 11/06/2021 23:08:01 - INFO - __main__ - Step 13654: {'lr': 0.0004923906721455922, 'samples': 2621568, 'steps': 13653, 'loss/train': 1.2696095705032349} 11/06/2021 23:08:01 - INFO - __main__ - Step 13655: {'lr': 0.0004923893727707125, 'samples': 2621760, 'steps': 13654, 'loss/train': 2.0763657093048096} 11/06/2021 23:08:02 - INFO - __main__ - Step 13656: {'lr': 0.0004923880732866159, 'samples': 2621952, 'steps': 13655, 'loss/train': 1.3228414058685303} 11/06/2021 23:08:02 - INFO - __main__ - Step 13657: {'lr': 0.0004923867736933029, 'samples': 2622144, 'steps': 13656, 'loss/train': 1.621604323387146} 11/06/2021 23:08:02 - INFO - __main__ - Step 13658: {'lr': 0.0004923854739907743, 'samples': 2622336, 'steps': 13657, 'loss/train': 1.2240080833435059} 11/06/2021 23:08:03 - INFO - __main__ - Step 13659: {'lr': 0.0004923841741790304, 'samples': 2622528, 'steps': 13658, 'loss/train': 1.7025880813598633} 11/06/2021 23:08:04 - INFO - __main__ - Step 13660: {'lr': 0.0004923828742580719, 'samples': 2622720, 'steps': 13659, 'loss/train': 0.8166983723640442} 11/06/2021 23:08:04 - INFO - __main__ - Step 13661: {'lr': 0.0004923815742278996, 'samples': 2622912, 'steps': 13660, 'loss/train': 1.8481582403182983} 11/06/2021 23:08:05 - INFO - __main__ - Step 13662: {'lr': 0.0004923802740885139, 'samples': 2623104, 'steps': 13661, 'loss/train': 1.9549199342727661} 11/06/2021 23:08:05 - INFO - __main__ - Step 13663: {'lr': 0.0004923789738399152, 'samples': 2623296, 'steps': 13662, 'loss/train': 1.5047688484191895} 11/06/2021 23:08:05 - INFO - __main__ - Step 13664: {'lr': 0.0004923776734821044, 'samples': 2623488, 'steps': 13663, 'loss/train': 1.784568190574646} 11/06/2021 23:08:07 - INFO - __main__ - Step 13665: {'lr': 0.0004923763730150819, 'samples': 2623680, 'steps': 13664, 'loss/train': 1.6027264595031738} 11/06/2021 23:08:07 - INFO - __main__ - Step 13666: {'lr': 0.0004923750724388483, 'samples': 2623872, 'steps': 13665, 'loss/train': 1.826322078704834} 11/06/2021 23:08:07 - INFO - __main__ - Step 13667: {'lr': 0.0004923737717534044, 'samples': 2624064, 'steps': 13666, 'loss/train': 1.9176068305969238} 11/06/2021 23:08:08 - INFO - __main__ - Step 13668: {'lr': 0.0004923724709587504, 'samples': 2624256, 'steps': 13667, 'loss/train': 2.2338030338287354} 11/06/2021 23:08:08 - INFO - __main__ - Step 13669: {'lr': 0.0004923711700548873, 'samples': 2624448, 'steps': 13668, 'loss/train': 1.8430683612823486} 11/06/2021 23:08:09 - INFO - __main__ - Step 13670: {'lr': 0.0004923698690418154, 'samples': 2624640, 'steps': 13669, 'loss/train': 0.9396169781684875} 11/06/2021 23:08:09 - INFO - __main__ - Step 13671: {'lr': 0.0004923685679195355, 'samples': 2624832, 'steps': 13670, 'loss/train': 1.6337071657180786} 11/06/2021 23:08:10 - INFO - __main__ - Step 13672: {'lr': 0.0004923672666880479, 'samples': 2625024, 'steps': 13671, 'loss/train': 1.3676971197128296} 11/06/2021 23:08:10 - INFO - __main__ - Step 13673: {'lr': 0.0004923659653473533, 'samples': 2625216, 'steps': 13672, 'loss/train': 1.6104556322097778} 11/06/2021 23:08:10 - INFO - __main__ - Step 13674: {'lr': 0.0004923646638974524, 'samples': 2625408, 'steps': 13673, 'loss/train': 1.3942358493804932} 11/06/2021 23:08:11 - INFO - __main__ - Step 13675: {'lr': 0.0004923633623383459, 'samples': 2625600, 'steps': 13674, 'loss/train': 1.8581507205963135} 11/06/2021 23:08:12 - INFO - __main__ - Step 13676: {'lr': 0.0004923620606700341, 'samples': 2625792, 'steps': 13675, 'loss/train': 1.0747668743133545} 11/06/2021 23:08:12 - INFO - __main__ - Step 13677: {'lr': 0.0004923607588925177, 'samples': 2625984, 'steps': 13676, 'loss/train': 1.7954381704330444} 11/06/2021 23:08:13 - INFO - __main__ - Step 13678: {'lr': 0.0004923594570057972, 'samples': 2626176, 'steps': 13677, 'loss/train': 1.8228819370269775} 11/06/2021 23:08:13 - INFO - __main__ - Step 13679: {'lr': 0.0004923581550098733, 'samples': 2626368, 'steps': 13678, 'loss/train': 1.6954731941223145} 11/06/2021 23:08:13 - INFO - __main__ - Step 13680: {'lr': 0.0004923568529047466, 'samples': 2626560, 'steps': 13679, 'loss/train': 1.7315030097961426} 11/06/2021 23:08:14 - INFO - __main__ - Step 13681: {'lr': 0.0004923555506904176, 'samples': 2626752, 'steps': 13680, 'loss/train': 1.701892614364624} 11/06/2021 23:08:15 - INFO - __main__ - Step 13682: {'lr': 0.0004923542483668869, 'samples': 2626944, 'steps': 13681, 'loss/train': 1.860314130783081} 11/06/2021 23:08:15 - INFO - __main__ - Step 13683: {'lr': 0.0004923529459341553, 'samples': 2627136, 'steps': 13682, 'loss/train': 1.5771002769470215} 11/06/2021 23:08:15 - INFO - __main__ - Step 13684: {'lr': 0.000492351643392223, 'samples': 2627328, 'steps': 13683, 'loss/train': 2.214824914932251} 11/06/2021 23:08:16 - INFO - __main__ - Step 13685: {'lr': 0.0004923503407410908, 'samples': 2627520, 'steps': 13684, 'loss/train': 1.6392228603363037} 11/06/2021 23:08:17 - INFO - __main__ - Step 13686: {'lr': 0.0004923490379807594, 'samples': 2627712, 'steps': 13685, 'loss/train': 1.5940680503845215} 11/06/2021 23:08:17 - INFO - __main__ - Step 13687: {'lr': 0.0004923477351112291, 'samples': 2627904, 'steps': 13686, 'loss/train': 1.3699396848678589} 11/06/2021 23:08:17 - INFO - __main__ - Step 13688: {'lr': 0.0004923464321325008, 'samples': 2628096, 'steps': 13687, 'loss/train': 2.2057156562805176} 11/06/2021 23:08:18 - INFO - __main__ - Step 13689: {'lr': 0.0004923451290445749, 'samples': 2628288, 'steps': 13688, 'loss/train': 1.5960655212402344} 11/06/2021 23:08:18 - INFO - __main__ - Step 13690: {'lr': 0.000492343825847452, 'samples': 2628480, 'steps': 13689, 'loss/train': 2.2023274898529053} 11/06/2021 23:08:19 - INFO - __main__ - Step 13691: {'lr': 0.0004923425225411328, 'samples': 2628672, 'steps': 13690, 'loss/train': 1.8006644248962402} 11/06/2021 23:08:19 - INFO - __main__ - Step 13692: {'lr': 0.0004923412191256176, 'samples': 2628864, 'steps': 13691, 'loss/train': 1.6855508089065552} 11/06/2021 23:08:20 - INFO - __main__ - Step 13693: {'lr': 0.0004923399156009073, 'samples': 2629056, 'steps': 13692, 'loss/train': 1.91086745262146} 11/06/2021 23:08:20 - INFO - __main__ - Step 13694: {'lr': 0.0004923386119670024, 'samples': 2629248, 'steps': 13693, 'loss/train': 2.243964910507202} 11/06/2021 23:08:20 - INFO - __main__ - Step 13695: {'lr': 0.0004923373082239035, 'samples': 2629440, 'steps': 13694, 'loss/train': 1.8091576099395752} 11/06/2021 23:08:21 - INFO - __main__ - Step 13696: {'lr': 0.000492336004371611, 'samples': 2629632, 'steps': 13695, 'loss/train': 1.891566276550293} 11/06/2021 23:08:22 - INFO - __main__ - Step 13697: {'lr': 0.0004923347004101257, 'samples': 2629824, 'steps': 13696, 'loss/train': 1.8870787620544434} 11/06/2021 23:08:22 - INFO - __main__ - Step 13698: {'lr': 0.0004923333963394482, 'samples': 2630016, 'steps': 13697, 'loss/train': 1.609986424446106} 11/06/2021 23:08:22 - INFO - __main__ - Step 13699: {'lr': 0.000492332092159579, 'samples': 2630208, 'steps': 13698, 'loss/train': 1.5685456991195679} 11/06/2021 23:08:23 - INFO - __main__ - Step 13700: {'lr': 0.0004923307878705186, 'samples': 2630400, 'steps': 13699, 'loss/train': 1.4887804985046387} 11/06/2021 23:08:24 - INFO - __main__ - Step 13701: {'lr': 0.0004923294834722678, 'samples': 2630592, 'steps': 13700, 'loss/train': 1.6135252714157104} 11/06/2021 23:08:24 - INFO - __main__ - Step 13702: {'lr': 0.000492328178964827, 'samples': 2630784, 'steps': 13701, 'loss/train': 1.7054436206817627} 11/06/2021 23:08:25 - INFO - __main__ - Step 13703: {'lr': 0.0004923268743481969, 'samples': 2630976, 'steps': 13702, 'loss/train': 1.3338326215744019} 11/06/2021 23:08:25 - INFO - __main__ - Step 13704: {'lr': 0.000492325569622378, 'samples': 2631168, 'steps': 13703, 'loss/train': 1.5167887210845947} 11/06/2021 23:08:25 - INFO - __main__ - Step 13705: {'lr': 0.0004923242647873709, 'samples': 2631360, 'steps': 13704, 'loss/train': 1.7764374017715454} 11/06/2021 23:08:27 - INFO - __main__ - Step 13706: {'lr': 0.0004923229598431763, 'samples': 2631552, 'steps': 13705, 'loss/train': 1.981156349182129} 11/06/2021 23:08:27 - INFO - __main__ - Step 13707: {'lr': 0.0004923216547897948, 'samples': 2631744, 'steps': 13706, 'loss/train': 1.6128805875778198} 11/06/2021 23:08:27 - INFO - __main__ - Step 13708: {'lr': 0.0004923203496272267, 'samples': 2631936, 'steps': 13707, 'loss/train': 1.7675461769104004} 11/06/2021 23:08:28 - INFO - __main__ - Step 13709: {'lr': 0.0004923190443554729, 'samples': 2632128, 'steps': 13708, 'loss/train': 1.66963529586792} 11/06/2021 23:08:28 - INFO - __main__ - Step 13710: {'lr': 0.0004923177389745339, 'samples': 2632320, 'steps': 13709, 'loss/train': 0.4389937222003937} 11/06/2021 23:08:29 - INFO - __main__ - Step 13711: {'lr': 0.0004923164334844103, 'samples': 2632512, 'steps': 13710, 'loss/train': 1.7726091146469116} 11/06/2021 23:08:29 - INFO - __main__ - Step 13712: {'lr': 0.0004923151278851025, 'samples': 2632704, 'steps': 13711, 'loss/train': 0.9735046029090881} 11/06/2021 23:08:30 - INFO - __main__ - Step 13713: {'lr': 0.0004923138221766114, 'samples': 2632896, 'steps': 13712, 'loss/train': 1.4027715921401978} 11/06/2021 23:08:30 - INFO - __main__ - Step 13714: {'lr': 0.0004923125163589373, 'samples': 2633088, 'steps': 13713, 'loss/train': 1.875281572341919} 11/06/2021 23:08:30 - INFO - __main__ - Step 13715: {'lr': 0.0004923112104320811, 'samples': 2633280, 'steps': 13714, 'loss/train': 1.6900665760040283} 11/06/2021 23:08:32 - INFO - __main__ - Step 13716: {'lr': 0.000492309904396043, 'samples': 2633472, 'steps': 13715, 'loss/train': 1.1452865600585938} 11/06/2021 23:08:32 - INFO - __main__ - Step 13717: {'lr': 0.0004923085982508239, 'samples': 2633664, 'steps': 13716, 'loss/train': 1.9100697040557861} 11/06/2021 23:08:32 - INFO - __main__ - Step 13718: {'lr': 0.0004923072919964243, 'samples': 2633856, 'steps': 13717, 'loss/train': 4.524446964263916} 11/06/2021 23:08:33 - INFO - __main__ - Step 13719: {'lr': 0.0004923059856328447, 'samples': 2634048, 'steps': 13718, 'loss/train': 1.5310945510864258} 11/06/2021 23:08:33 - INFO - __main__ - Step 13720: {'lr': 0.0004923046791600859, 'samples': 2634240, 'steps': 13719, 'loss/train': 1.5483694076538086} 11/06/2021 23:08:34 - INFO - __main__ - Step 13721: {'lr': 0.0004923033725781482, 'samples': 2634432, 'steps': 13720, 'loss/train': 1.6037310361862183} 11/06/2021 23:08:34 - INFO - __main__ - Step 13722: {'lr': 0.0004923020658870324, 'samples': 2634624, 'steps': 13721, 'loss/train': 1.3916311264038086} 11/06/2021 23:08:35 - INFO - __main__ - Step 13723: {'lr': 0.000492300759086739, 'samples': 2634816, 'steps': 13722, 'loss/train': 0.19058111310005188} 11/06/2021 23:08:35 - INFO - __main__ - Step 13724: {'lr': 0.0004922994521772687, 'samples': 2635008, 'steps': 13723, 'loss/train': 1.856695532798767} 11/06/2021 23:08:35 - INFO - __main__ - Step 13725: {'lr': 0.000492298145158622, 'samples': 2635200, 'steps': 13724, 'loss/train': 1.6110961437225342} 11/06/2021 23:08:36 - INFO - __main__ - Step 13726: {'lr': 0.0004922968380307994, 'samples': 2635392, 'steps': 13725, 'loss/train': 2.1884765625} 11/06/2021 23:08:37 - INFO - __main__ - Step 13727: {'lr': 0.0004922955307938016, 'samples': 2635584, 'steps': 13726, 'loss/train': 1.4792718887329102} 11/06/2021 23:08:37 - INFO - __main__ - Step 13728: {'lr': 0.0004922942234476292, 'samples': 2635776, 'steps': 13727, 'loss/train': 0.9615477323532104} 11/06/2021 23:08:38 - INFO - __main__ - Step 13729: {'lr': 0.0004922929159922828, 'samples': 2635968, 'steps': 13728, 'loss/train': 1.467282772064209} 11/06/2021 23:08:38 - INFO - __main__ - Step 13730: {'lr': 0.0004922916084277629, 'samples': 2636160, 'steps': 13729, 'loss/train': 1.4172507524490356} 11/06/2021 23:08:38 - INFO - __main__ - Step 13731: {'lr': 0.0004922903007540701, 'samples': 2636352, 'steps': 13730, 'loss/train': 1.9334453344345093} 11/06/2021 23:08:40 - INFO - __main__ - Step 13732: {'lr': 0.0004922889929712051, 'samples': 2636544, 'steps': 13731, 'loss/train': 1.7975131273269653} 11/06/2021 23:08:40 - INFO - __main__ - Step 13733: {'lr': 0.0004922876850791684, 'samples': 2636736, 'steps': 13732, 'loss/train': 1.5812453031539917} 11/06/2021 23:08:40 - INFO - __main__ - Step 13734: {'lr': 0.0004922863770779606, 'samples': 2636928, 'steps': 13733, 'loss/train': 1.761626124382019} 11/06/2021 23:08:41 - INFO - __main__ - Step 13735: {'lr': 0.0004922850689675823, 'samples': 2637120, 'steps': 13734, 'loss/train': 0.9880645275115967} 11/06/2021 23:08:41 - INFO - __main__ - Step 13736: {'lr': 0.0004922837607480341, 'samples': 2637312, 'steps': 13735, 'loss/train': 1.812021255493164} 11/06/2021 23:08:41 - INFO - __main__ - Step 13737: {'lr': 0.0004922824524193166, 'samples': 2637504, 'steps': 13736, 'loss/train': 1.655940055847168} 11/06/2021 23:08:42 - INFO - __main__ - Step 13738: {'lr': 0.0004922811439814303, 'samples': 2637696, 'steps': 13737, 'loss/train': 3.0965065956115723} 11/06/2021 23:08:43 - INFO - __main__ - Step 13739: {'lr': 0.0004922798354343758, 'samples': 2637888, 'steps': 13738, 'loss/train': 1.1980981826782227} 11/06/2021 23:08:43 - INFO - __main__ - Step 13740: {'lr': 0.0004922785267781539, 'samples': 2638080, 'steps': 13739, 'loss/train': 1.4556937217712402} 11/06/2021 23:08:43 - INFO - __main__ - Step 13741: {'lr': 0.000492277218012765, 'samples': 2638272, 'steps': 13740, 'loss/train': 1.6142842769622803} 11/06/2021 23:08:44 - INFO - __main__ - Step 13742: {'lr': 0.0004922759091382097, 'samples': 2638464, 'steps': 13741, 'loss/train': 2.2603096961975098} 11/06/2021 23:08:45 - INFO - __main__ - Step 13743: {'lr': 0.0004922746001544885, 'samples': 2638656, 'steps': 13742, 'loss/train': 1.3596349954605103} 11/06/2021 23:08:45 - INFO - __main__ - Step 13744: {'lr': 0.0004922732910616023, 'samples': 2638848, 'steps': 13743, 'loss/train': 1.9842960834503174} 11/06/2021 23:08:46 - INFO - __main__ - Step 13745: {'lr': 0.0004922719818595514, 'samples': 2639040, 'steps': 13744, 'loss/train': 2.0623526573181152} 11/06/2021 23:08:46 - INFO - __main__ - Step 13746: {'lr': 0.0004922706725483364, 'samples': 2639232, 'steps': 13745, 'loss/train': 1.9149062633514404} 11/06/2021 23:08:46 - INFO - __main__ - Step 13747: {'lr': 0.0004922693631279581, 'samples': 2639424, 'steps': 13746, 'loss/train': 2.0381317138671875} 11/06/2021 23:08:47 - INFO - __main__ - Step 13748: {'lr': 0.000492268053598417, 'samples': 2639616, 'steps': 13747, 'loss/train': 2.627610445022583} 11/06/2021 23:08:48 - INFO - __main__ - Step 13749: {'lr': 0.0004922667439597136, 'samples': 2639808, 'steps': 13748, 'loss/train': 1.306639552116394} 11/06/2021 23:08:48 - INFO - __main__ - Step 13750: {'lr': 0.0004922654342118484, 'samples': 2640000, 'steps': 13749, 'loss/train': 1.3434464931488037} 11/06/2021 23:08:48 - INFO - __main__ - Step 13751: {'lr': 0.0004922641243548223, 'samples': 2640192, 'steps': 13750, 'loss/train': 2.0326554775238037} 11/06/2021 23:08:49 - INFO - __main__ - Step 13752: {'lr': 0.0004922628143886358, 'samples': 2640384, 'steps': 13751, 'loss/train': 2.049717903137207} 11/06/2021 23:08:49 - INFO - __main__ - Step 13753: {'lr': 0.0004922615043132892, 'samples': 2640576, 'steps': 13752, 'loss/train': 1.6888577938079834} 11/06/2021 23:08:50 - INFO - __main__ - Step 13754: {'lr': 0.0004922601941287835, 'samples': 2640768, 'steps': 13753, 'loss/train': 0.9408217668533325} 11/06/2021 23:08:50 - INFO - __main__ - Step 13755: {'lr': 0.0004922588838351189, 'samples': 2640960, 'steps': 13754, 'loss/train': 1.6507468223571777} 11/06/2021 23:08:51 - INFO - __main__ - Step 13756: {'lr': 0.0004922575734322963, 'samples': 2641152, 'steps': 13755, 'loss/train': 1.7679812908172607} 11/06/2021 23:08:51 - INFO - __main__ - Step 13757: {'lr': 0.0004922562629203161, 'samples': 2641344, 'steps': 13756, 'loss/train': 1.271485447883606} 11/06/2021 23:08:51 - INFO - __main__ - Step 13758: {'lr': 0.0004922549522991791, 'samples': 2641536, 'steps': 13757, 'loss/train': 1.955517053604126} 11/06/2021 23:08:53 - INFO - __main__ - Step 13759: {'lr': 0.0004922536415688856, 'samples': 2641728, 'steps': 13758, 'loss/train': 2.06683349609375} 11/06/2021 23:08:53 - INFO - __main__ - Step 13760: {'lr': 0.0004922523307294364, 'samples': 2641920, 'steps': 13759, 'loss/train': 1.354589581489563} 11/06/2021 23:08:53 - INFO - __main__ - Step 13761: {'lr': 0.0004922510197808321, 'samples': 2642112, 'steps': 13760, 'loss/train': 1.9096133708953857} 11/06/2021 23:08:54 - INFO - __main__ - Step 13762: {'lr': 0.0004922497087230732, 'samples': 2642304, 'steps': 13761, 'loss/train': 1.9483925104141235} 11/06/2021 23:08:54 - INFO - __main__ - Step 13763: {'lr': 0.0004922483975561603, 'samples': 2642496, 'steps': 13762, 'loss/train': 2.073777437210083} 11/06/2021 23:08:55 - INFO - __main__ - Step 13764: {'lr': 0.000492247086280094, 'samples': 2642688, 'steps': 13763, 'loss/train': 1.6551817655563354} 11/06/2021 23:08:55 - INFO - __main__ - Step 13765: {'lr': 0.0004922457748948749, 'samples': 2642880, 'steps': 13764, 'loss/train': 1.7662451267242432} 11/06/2021 23:08:56 - INFO - __main__ - Step 13766: {'lr': 0.0004922444634005037, 'samples': 2643072, 'steps': 13765, 'loss/train': 1.595479965209961} 11/06/2021 23:08:56 - INFO - __main__ - Step 13767: {'lr': 0.0004922431517969808, 'samples': 2643264, 'steps': 13766, 'loss/train': 1.6051268577575684} 11/06/2021 23:08:57 - INFO - __main__ - Step 13768: {'lr': 0.0004922418400843068, 'samples': 2643456, 'steps': 13767, 'loss/train': 0.942782461643219} 11/06/2021 23:08:58 - INFO - __main__ - Step 13769: {'lr': 0.0004922405282624825, 'samples': 2643648, 'steps': 13768, 'loss/train': 0.8302810788154602} 11/06/2021 23:08:58 - INFO - __main__ - Step 13770: {'lr': 0.0004922392163315083, 'samples': 2643840, 'steps': 13769, 'loss/train': 2.0029447078704834} 11/06/2021 23:08:58 - INFO - __main__ - Step 13771: {'lr': 0.0004922379042913848, 'samples': 2644032, 'steps': 13770, 'loss/train': 1.7818113565444946} 11/06/2021 23:08:59 - INFO - __main__ - Step 13772: {'lr': 0.0004922365921421126, 'samples': 2644224, 'steps': 13771, 'loss/train': 2.0600786209106445} 11/06/2021 23:08:59 - INFO - __main__ - Step 13773: {'lr': 0.0004922352798836924, 'samples': 2644416, 'steps': 13772, 'loss/train': 2.0496914386749268} 11/06/2021 23:09:00 - INFO - __main__ - Step 13774: {'lr': 0.0004922339675161248, 'samples': 2644608, 'steps': 13773, 'loss/train': 1.9038617610931396} 11/06/2021 23:09:00 - INFO - __main__ - Step 13775: {'lr': 0.0004922326550394102, 'samples': 2644800, 'steps': 13774, 'loss/train': 1.3252359628677368} 11/06/2021 23:09:01 - INFO - __main__ - Step 13776: {'lr': 0.0004922313424535494, 'samples': 2644992, 'steps': 13775, 'loss/train': 1.8318666219711304} 11/06/2021 23:09:01 - INFO - __main__ - Step 13777: {'lr': 0.0004922300297585428, 'samples': 2645184, 'steps': 13776, 'loss/train': 2.127047061920166} 11/06/2021 23:09:01 - INFO - __main__ - Step 13778: {'lr': 0.0004922287169543911, 'samples': 2645376, 'steps': 13777, 'loss/train': 1.2675795555114746} 11/06/2021 23:09:02 - INFO - __main__ - Step 13779: {'lr': 0.0004922274040410949, 'samples': 2645568, 'steps': 13778, 'loss/train': 1.7878961563110352} 11/06/2021 23:09:03 - INFO - __main__ - Step 13780: {'lr': 0.0004922260910186548, 'samples': 2645760, 'steps': 13779, 'loss/train': 1.97649085521698} 11/06/2021 23:09:03 - INFO - __main__ - Step 13781: {'lr': 0.0004922247778870714, 'samples': 2645952, 'steps': 13780, 'loss/train': 1.6250098943710327} 11/06/2021 23:09:04 - INFO - __main__ - Step 13782: {'lr': 0.0004922234646463451, 'samples': 2646144, 'steps': 13781, 'loss/train': 1.9466784000396729} 11/06/2021 23:09:04 - INFO - __main__ - Step 13783: {'lr': 0.0004922221512964767, 'samples': 2646336, 'steps': 13782, 'loss/train': 1.1106783151626587} 11/06/2021 23:09:04 - INFO - __main__ - Step 13784: {'lr': 0.0004922208378374668, 'samples': 2646528, 'steps': 13783, 'loss/train': 2.0252909660339355} 11/06/2021 23:09:05 - INFO - __main__ - Step 13785: {'lr': 0.0004922195242693159, 'samples': 2646720, 'steps': 13784, 'loss/train': 1.6497094631195068} 11/06/2021 23:09:06 - INFO - __main__ - Step 13786: {'lr': 0.0004922182105920246, 'samples': 2646912, 'steps': 13785, 'loss/train': 1.4474611282348633} 11/06/2021 23:09:06 - INFO - __main__ - Step 13787: {'lr': 0.0004922168968055935, 'samples': 2647104, 'steps': 13786, 'loss/train': 2.291621446609497} 11/06/2021 23:09:06 - INFO - __main__ - Step 13788: {'lr': 0.0004922155829100233, 'samples': 2647296, 'steps': 13787, 'loss/train': 2.148526668548584} 11/06/2021 23:09:07 - INFO - __main__ - Step 13789: {'lr': 0.0004922142689053144, 'samples': 2647488, 'steps': 13788, 'loss/train': 1.9192496538162231} 11/06/2021 23:09:08 - INFO - __main__ - Step 13790: {'lr': 0.0004922129547914675, 'samples': 2647680, 'steps': 13789, 'loss/train': 2.097891330718994} 11/06/2021 23:09:08 - INFO - __main__ - Step 13791: {'lr': 0.0004922116405684832, 'samples': 2647872, 'steps': 13790, 'loss/train': 1.7721515893936157} 11/06/2021 23:09:08 - INFO - __main__ - Step 13792: {'lr': 0.0004922103262363621, 'samples': 2648064, 'steps': 13791, 'loss/train': 1.5414934158325195} 11/06/2021 23:09:09 - INFO - __main__ - Step 13793: {'lr': 0.0004922090117951047, 'samples': 2648256, 'steps': 13792, 'loss/train': 2.323873519897461} 11/06/2021 23:09:09 - INFO - __main__ - Step 13794: {'lr': 0.0004922076972447117, 'samples': 2648448, 'steps': 13793, 'loss/train': 1.157942533493042} 11/06/2021 23:09:10 - INFO - __main__ - Step 13795: {'lr': 0.0004922063825851836, 'samples': 2648640, 'steps': 13794, 'loss/train': 1.609661340713501} 11/06/2021 23:09:10 - INFO - __main__ - Step 13796: {'lr': 0.0004922050678165211, 'samples': 2648832, 'steps': 13795, 'loss/train': 0.9388356804847717} 11/06/2021 23:09:11 - INFO - __main__ - Step 13797: {'lr': 0.0004922037529387247, 'samples': 2649024, 'steps': 13796, 'loss/train': 1.7672147750854492} 11/06/2021 23:09:11 - INFO - __main__ - Step 13798: {'lr': 0.000492202437951795, 'samples': 2649216, 'steps': 13797, 'loss/train': 1.639959454536438} 11/06/2021 23:09:11 - INFO - __main__ - Step 13799: {'lr': 0.0004922011228557327, 'samples': 2649408, 'steps': 13798, 'loss/train': 1.4303568601608276} 11/06/2021 23:09:13 - INFO - __main__ - Step 13800: {'lr': 0.0004921998076505383, 'samples': 2649600, 'steps': 13799, 'loss/train': 1.8134762048721313} 11/06/2021 23:09:13 - INFO - __main__ - Step 13801: {'lr': 0.0004921984923362124, 'samples': 2649792, 'steps': 13800, 'loss/train': 1.5940922498703003} 11/06/2021 23:09:13 - INFO - __main__ - Step 13802: {'lr': 0.0004921971769127555, 'samples': 2649984, 'steps': 13801, 'loss/train': 1.1051990985870361} 11/06/2021 23:09:14 - INFO - __main__ - Step 13803: {'lr': 0.0004921958613801683, 'samples': 2650176, 'steps': 13802, 'loss/train': 1.5847387313842773} 11/06/2021 23:09:14 - INFO - __main__ - Step 13804: {'lr': 0.0004921945457384516, 'samples': 2650368, 'steps': 13803, 'loss/train': 1.840270757675171} 11/06/2021 23:09:15 - INFO - __main__ - Step 13805: {'lr': 0.0004921932299876055, 'samples': 2650560, 'steps': 13804, 'loss/train': 1.712327241897583} 11/06/2021 23:09:15 - INFO - __main__ - Step 13806: {'lr': 0.000492191914127631, 'samples': 2650752, 'steps': 13805, 'loss/train': 2.4084267616271973} 11/06/2021 23:09:16 - INFO - __main__ - Step 13807: {'lr': 0.0004921905981585286, 'samples': 2650944, 'steps': 13806, 'loss/train': 1.3894824981689453} 11/06/2021 23:09:16 - INFO - __main__ - Step 13808: {'lr': 0.0004921892820802988, 'samples': 2651136, 'steps': 13807, 'loss/train': 1.7391257286071777} 11/06/2021 23:09:16 - INFO - __main__ - Step 13809: {'lr': 0.0004921879658929422, 'samples': 2651328, 'steps': 13808, 'loss/train': 4.748656272888184} 11/06/2021 23:09:17 - INFO - __main__ - Step 13810: {'lr': 0.0004921866495964594, 'samples': 2651520, 'steps': 13809, 'loss/train': 1.1638247966766357} 11/06/2021 23:09:18 - INFO - __main__ - Step 13811: {'lr': 0.0004921853331908512, 'samples': 2651712, 'steps': 13810, 'loss/train': 1.9072991609573364} 11/06/2021 23:09:18 - INFO - __main__ - Step 13812: {'lr': 0.000492184016676118, 'samples': 2651904, 'steps': 13811, 'loss/train': 2.309782028198242} 11/06/2021 23:09:18 - INFO - __main__ - Step 13813: {'lr': 0.0004921827000522603, 'samples': 2652096, 'steps': 13812, 'loss/train': 1.5911425352096558} 11/06/2021 23:09:19 - INFO - __main__ - Step 13814: {'lr': 0.0004921813833192788, 'samples': 2652288, 'steps': 13813, 'loss/train': 1.9693739414215088} 11/06/2021 23:09:19 - INFO - __main__ - Step 13815: {'lr': 0.0004921800664771743, 'samples': 2652480, 'steps': 13814, 'loss/train': 1.7779452800750732} 11/06/2021 23:09:20 - INFO - __main__ - Step 13816: {'lr': 0.0004921787495259471, 'samples': 2652672, 'steps': 13815, 'loss/train': 1.5039100646972656} 11/06/2021 23:09:21 - INFO - __main__ - Step 13817: {'lr': 0.0004921774324655978, 'samples': 2652864, 'steps': 13816, 'loss/train': 1.054052472114563} 11/06/2021 23:09:21 - INFO - __main__ - Step 13818: {'lr': 0.0004921761152961271, 'samples': 2653056, 'steps': 13817, 'loss/train': 2.0499536991119385} 11/06/2021 23:09:21 - INFO - __main__ - Step 13819: {'lr': 0.0004921747980175357, 'samples': 2653248, 'steps': 13818, 'loss/train': 1.8573100566864014} 11/06/2021 23:09:22 - INFO - __main__ - Step 13820: {'lr': 0.0004921734806298241, 'samples': 2653440, 'steps': 13819, 'loss/train': 1.531470775604248} 11/06/2021 23:09:23 - INFO - __main__ - Step 13821: {'lr': 0.0004921721631329927, 'samples': 2653632, 'steps': 13820, 'loss/train': 3.0055086612701416} 11/06/2021 23:09:23 - INFO - __main__ - Step 13822: {'lr': 0.0004921708455270424, 'samples': 2653824, 'steps': 13821, 'loss/train': 1.5940738916397095} 11/06/2021 23:09:23 - INFO - __main__ - Step 13823: {'lr': 0.0004921695278119736, 'samples': 2654016, 'steps': 13822, 'loss/train': 2.001136541366577} 11/06/2021 23:09:24 - INFO - __main__ - Step 13824: {'lr': 0.0004921682099877869, 'samples': 2654208, 'steps': 13823, 'loss/train': 1.7149298191070557} 11/06/2021 23:09:24 - INFO - __main__ - Step 13825: {'lr': 0.000492166892054483, 'samples': 2654400, 'steps': 13824, 'loss/train': 1.6777122020721436} 11/06/2021 23:09:25 - INFO - __main__ - Step 13826: {'lr': 0.0004921655740120623, 'samples': 2654592, 'steps': 13825, 'loss/train': 1.5764762163162231} 11/06/2021 23:09:25 - INFO - __main__ - Step 13827: {'lr': 0.0004921642558605257, 'samples': 2654784, 'steps': 13826, 'loss/train': 1.0914108753204346} 11/06/2021 23:09:26 - INFO - __main__ - Step 13828: {'lr': 0.0004921629375998736, 'samples': 2654976, 'steps': 13827, 'loss/train': 1.5501351356506348} 11/06/2021 23:09:26 - INFO - __main__ - Step 13829: {'lr': 0.0004921616192301065, 'samples': 2655168, 'steps': 13828, 'loss/train': 0.7807660102844238} 11/06/2021 23:09:26 - INFO - __main__ - Step 13830: {'lr': 0.0004921603007512253, 'samples': 2655360, 'steps': 13829, 'loss/train': 1.8146405220031738} 11/06/2021 23:09:28 - INFO - __main__ - Step 13831: {'lr': 0.0004921589821632302, 'samples': 2655552, 'steps': 13830, 'loss/train': 1.731347680091858} 11/06/2021 23:09:28 - INFO - __main__ - Step 13832: {'lr': 0.0004921576634661221, 'samples': 2655744, 'steps': 13831, 'loss/train': 1.670548439025879} 11/06/2021 23:09:28 - INFO - __main__ - Step 13833: {'lr': 0.0004921563446599015, 'samples': 2655936, 'steps': 13832, 'loss/train': 1.5320069789886475} 11/06/2021 23:09:29 - INFO - __main__ - Step 13834: {'lr': 0.000492155025744569, 'samples': 2656128, 'steps': 13833, 'loss/train': 1.0628443956375122} 11/06/2021 23:09:29 - INFO - __main__ - Step 13835: {'lr': 0.0004921537067201252, 'samples': 2656320, 'steps': 13834, 'loss/train': 2.1405911445617676} 11/06/2021 23:09:30 - INFO - __main__ - Step 13836: {'lr': 0.0004921523875865706, 'samples': 2656512, 'steps': 13835, 'loss/train': 1.0246696472167969} 11/06/2021 23:09:30 - INFO - __main__ - Step 13837: {'lr': 0.000492151068343906, 'samples': 2656704, 'steps': 13836, 'loss/train': 1.8948359489440918} 11/06/2021 23:09:31 - INFO - __main__ - Step 13838: {'lr': 0.0004921497489921318, 'samples': 2656896, 'steps': 13837, 'loss/train': 1.7364670038223267} 11/06/2021 23:09:31 - INFO - __main__ - Step 13839: {'lr': 0.0004921484295312485, 'samples': 2657088, 'steps': 13838, 'loss/train': 1.4942817687988281} 11/06/2021 23:09:31 - INFO - __main__ - Step 13840: {'lr': 0.0004921471099612571, 'samples': 2657280, 'steps': 13839, 'loss/train': 1.2678571939468384} 11/06/2021 23:09:32 - INFO - __main__ - Step 13841: {'lr': 0.0004921457902821578, 'samples': 2657472, 'steps': 13840, 'loss/train': 1.7979927062988281} 11/06/2021 23:09:33 - INFO - __main__ - Step 13842: {'lr': 0.0004921444704939514, 'samples': 2657664, 'steps': 13841, 'loss/train': 1.2086461782455444} 11/06/2021 23:09:33 - INFO - __main__ - Step 13843: {'lr': 0.0004921431505966384, 'samples': 2657856, 'steps': 13842, 'loss/train': 2.1114842891693115} 11/06/2021 23:09:33 - INFO - __main__ - Step 13844: {'lr': 0.0004921418305902194, 'samples': 2658048, 'steps': 13843, 'loss/train': 1.7268887758255005} 11/06/2021 23:09:34 - INFO - __main__ - Step 13845: {'lr': 0.0004921405104746951, 'samples': 2658240, 'steps': 13844, 'loss/train': 1.4159009456634521} 11/06/2021 23:09:35 - INFO - __main__ - Step 13846: {'lr': 0.0004921391902500661, 'samples': 2658432, 'steps': 13845, 'loss/train': 0.772130012512207} 11/06/2021 23:09:35 - INFO - __main__ - Step 13847: {'lr': 0.0004921378699163328, 'samples': 2658624, 'steps': 13846, 'loss/train': 1.6413114070892334} 11/06/2021 23:09:35 - INFO - __main__ - Step 13848: {'lr': 0.0004921365494734959, 'samples': 2658816, 'steps': 13847, 'loss/train': 2.0871777534484863} 11/06/2021 23:09:36 - INFO - __main__ - Step 13849: {'lr': 0.0004921352289215561, 'samples': 2659008, 'steps': 13848, 'loss/train': 1.6378384828567505} 11/06/2021 23:09:36 - INFO - __main__ - Step 13850: {'lr': 0.0004921339082605137, 'samples': 2659200, 'steps': 13849, 'loss/train': 1.849760890007019} 11/06/2021 23:09:37 - INFO - __main__ - Step 13851: {'lr': 0.0004921325874903697, 'samples': 2659392, 'steps': 13850, 'loss/train': 1.766753077507019} 11/06/2021 23:09:38 - INFO - __main__ - Step 13852: {'lr': 0.0004921312666111245, 'samples': 2659584, 'steps': 13851, 'loss/train': 1.397715449333191} 11/06/2021 23:09:38 - INFO - __main__ - Step 13853: {'lr': 0.0004921299456227785, 'samples': 2659776, 'steps': 13852, 'loss/train': 1.6967663764953613} 11/06/2021 23:09:38 - INFO - __main__ - Step 13854: {'lr': 0.0004921286245253327, 'samples': 2659968, 'steps': 13853, 'loss/train': 1.3301624059677124} 11/06/2021 23:09:39 - INFO - __main__ - Step 13855: {'lr': 0.0004921273033187874, 'samples': 2660160, 'steps': 13854, 'loss/train': 2.2361197471618652} 11/06/2021 23:09:39 - INFO - __main__ - Step 13856: {'lr': 0.0004921259820031431, 'samples': 2660352, 'steps': 13855, 'loss/train': 1.5160555839538574} 11/06/2021 23:09:40 - INFO - __main__ - Step 13857: {'lr': 0.0004921246605784008, 'samples': 2660544, 'steps': 13856, 'loss/train': 1.797963261604309} 11/06/2021 23:09:40 - INFO - __main__ - Step 13858: {'lr': 0.0004921233390445608, 'samples': 2660736, 'steps': 13857, 'loss/train': 1.6788036823272705} 11/06/2021 23:09:41 - INFO - __main__ - Step 13859: {'lr': 0.0004921220174016238, 'samples': 2660928, 'steps': 13858, 'loss/train': 1.196427345275879} 11/06/2021 23:09:41 - INFO - __main__ - Step 13860: {'lr': 0.0004921206956495903, 'samples': 2661120, 'steps': 13859, 'loss/train': 0.5547469854354858} 11/06/2021 23:09:42 - INFO - __main__ - Step 13861: {'lr': 0.000492119373788461, 'samples': 2661312, 'steps': 13860, 'loss/train': 1.9061776399612427} 11/06/2021 23:09:42 - INFO - __main__ - Step 13862: {'lr': 0.0004921180518182363, 'samples': 2661504, 'steps': 13861, 'loss/train': 1.7613266706466675} 11/06/2021 23:09:43 - INFO - __main__ - Step 13863: {'lr': 0.0004921167297389171, 'samples': 2661696, 'steps': 13862, 'loss/train': 1.3482214212417603} 11/06/2021 23:09:43 - INFO - __main__ - Step 13864: {'lr': 0.0004921154075505038, 'samples': 2661888, 'steps': 13863, 'loss/train': 1.8354885578155518} 11/06/2021 23:09:44 - INFO - __main__ - Step 13865: {'lr': 0.0004921140852529969, 'samples': 2662080, 'steps': 13864, 'loss/train': 1.8951635360717773} 11/06/2021 23:09:44 - INFO - __main__ - Step 13866: {'lr': 0.0004921127628463972, 'samples': 2662272, 'steps': 13865, 'loss/train': 1.8773669004440308} 11/06/2021 23:09:44 - INFO - __main__ - Step 13867: {'lr': 0.0004921114403307053, 'samples': 2662464, 'steps': 13866, 'loss/train': 1.8073745965957642} 11/06/2021 23:09:45 - INFO - __main__ - Step 13868: {'lr': 0.0004921101177059218, 'samples': 2662656, 'steps': 13867, 'loss/train': 2.099315881729126} 11/06/2021 23:09:46 - INFO - __main__ - Step 13869: {'lr': 0.0004921087949720471, 'samples': 2662848, 'steps': 13868, 'loss/train': 1.927840232849121} 11/06/2021 23:09:46 - INFO - __main__ - Step 13870: {'lr': 0.0004921074721290819, 'samples': 2663040, 'steps': 13869, 'loss/train': 1.8711230754852295} 11/06/2021 23:09:46 - INFO - __main__ - Step 13871: {'lr': 0.0004921061491770268, 'samples': 2663232, 'steps': 13870, 'loss/train': 1.5287330150604248} 11/06/2021 23:09:47 - INFO - __main__ - Step 13872: {'lr': 0.0004921048261158825, 'samples': 2663424, 'steps': 13871, 'loss/train': 1.6624870300292969} 11/06/2021 23:09:48 - INFO - __main__ - Step 13873: {'lr': 0.0004921035029456493, 'samples': 2663616, 'steps': 13872, 'loss/train': 1.8428698778152466} 11/06/2021 23:09:48 - INFO - __main__ - Step 13874: {'lr': 0.0004921021796663282, 'samples': 2663808, 'steps': 13873, 'loss/train': 1.4383753538131714} 11/06/2021 23:09:49 - INFO - __main__ - Step 13875: {'lr': 0.0004921008562779195, 'samples': 2664000, 'steps': 13874, 'loss/train': 1.4360547065734863} 11/06/2021 23:09:49 - INFO - __main__ - Step 13876: {'lr': 0.0004920995327804239, 'samples': 2664192, 'steps': 13875, 'loss/train': 1.6160893440246582} 11/06/2021 23:09:49 - INFO - __main__ - Step 13877: {'lr': 0.000492098209173842, 'samples': 2664384, 'steps': 13876, 'loss/train': 1.0459492206573486} 11/06/2021 23:09:50 - INFO - __main__ - Step 13878: {'lr': 0.0004920968854581745, 'samples': 2664576, 'steps': 13877, 'loss/train': 1.7214491367340088} 11/06/2021 23:09:51 - INFO - __main__ - Step 13879: {'lr': 0.0004920955616334216, 'samples': 2664768, 'steps': 13878, 'loss/train': 1.4879608154296875} 11/06/2021 23:09:51 - INFO - __main__ - Step 13880: {'lr': 0.0004920942376995844, 'samples': 2664960, 'steps': 13879, 'loss/train': 1.7275727987289429} 11/06/2021 23:09:51 - INFO - __main__ - Step 13881: {'lr': 0.0004920929136566632, 'samples': 2665152, 'steps': 13880, 'loss/train': 1.556997299194336} 11/06/2021 23:09:52 - INFO - __main__ - Step 13882: {'lr': 0.0004920915895046587, 'samples': 2665344, 'steps': 13881, 'loss/train': 1.629291296005249} 11/06/2021 23:09:52 - INFO - __main__ - Step 13883: {'lr': 0.0004920902652435715, 'samples': 2665536, 'steps': 13882, 'loss/train': 2.0257277488708496} 11/06/2021 23:09:53 - INFO - __main__ - Step 13884: {'lr': 0.0004920889408734021, 'samples': 2665728, 'steps': 13883, 'loss/train': 1.8871136903762817} 11/06/2021 23:09:53 - INFO - __main__ - Step 13885: {'lr': 0.0004920876163941511, 'samples': 2665920, 'steps': 13884, 'loss/train': 1.1255569458007812} 11/06/2021 23:09:54 - INFO - __main__ - Step 13886: {'lr': 0.0004920862918058192, 'samples': 2666112, 'steps': 13885, 'loss/train': 1.7489358186721802} 11/06/2021 23:09:54 - INFO - __main__ - Step 13887: {'lr': 0.000492084967108407, 'samples': 2666304, 'steps': 13886, 'loss/train': 2.234173536300659} 11/06/2021 23:09:54 - INFO - __main__ - Step 13888: {'lr': 0.000492083642301915, 'samples': 2666496, 'steps': 13887, 'loss/train': 1.7307212352752686} 11/06/2021 23:09:56 - INFO - __main__ - Step 13889: {'lr': 0.0004920823173863439, 'samples': 2666688, 'steps': 13888, 'loss/train': 1.4370955228805542} 11/06/2021 23:09:56 - INFO - __main__ - Step 13890: {'lr': 0.0004920809923616942, 'samples': 2666880, 'steps': 13889, 'loss/train': 1.8873080015182495} 11/06/2021 23:09:56 - INFO - __main__ - Step 13891: {'lr': 0.0004920796672279666, 'samples': 2667072, 'steps': 13890, 'loss/train': 1.8920522928237915} 11/06/2021 23:09:57 - INFO - __main__ - Step 13892: {'lr': 0.0004920783419851615, 'samples': 2667264, 'steps': 13891, 'loss/train': 5.647333145141602} 11/06/2021 23:09:57 - INFO - __main__ - Step 13893: {'lr': 0.0004920770166332798, 'samples': 2667456, 'steps': 13892, 'loss/train': 1.972109317779541} 11/06/2021 23:09:57 - INFO - __main__ - Step 13894: {'lr': 0.0004920756911723219, 'samples': 2667648, 'steps': 13893, 'loss/train': 2.5524847507476807} 11/06/2021 23:09:59 - INFO - __main__ - Step 13895: {'lr': 0.0004920743656022884, 'samples': 2667840, 'steps': 13894, 'loss/train': 1.815747618675232} 11/06/2021 23:09:59 - INFO - __main__ - Step 13896: {'lr': 0.0004920730399231799, 'samples': 2668032, 'steps': 13895, 'loss/train': 1.4550343751907349} 11/06/2021 23:09:59 - INFO - __main__ - Step 13897: {'lr': 0.000492071714134997, 'samples': 2668224, 'steps': 13896, 'loss/train': 1.4864250421524048} 11/06/2021 23:10:00 - INFO - __main__ - Step 13898: {'lr': 0.0004920703882377403, 'samples': 2668416, 'steps': 13897, 'loss/train': 1.741361141204834} 11/06/2021 23:10:00 - INFO - __main__ - Step 13899: {'lr': 0.0004920690622314105, 'samples': 2668608, 'steps': 13898, 'loss/train': 2.994898796081543} 11/06/2021 23:10:01 - INFO - __main__ - Step 13900: {'lr': 0.0004920677361160081, 'samples': 2668800, 'steps': 13899, 'loss/train': 1.8634357452392578} 11/06/2021 23:10:01 - INFO - __main__ - Step 13901: {'lr': 0.0004920664098915337, 'samples': 2668992, 'steps': 13900, 'loss/train': 1.6608809232711792} 11/06/2021 23:10:02 - INFO - __main__ - Step 13902: {'lr': 0.000492065083557988, 'samples': 2669184, 'steps': 13901, 'loss/train': 1.932618260383606} 11/06/2021 23:10:02 - INFO - __main__ - Step 13903: {'lr': 0.0004920637571153713, 'samples': 2669376, 'steps': 13902, 'loss/train': 1.4454110860824585} 11/06/2021 23:10:02 - INFO - __main__ - Step 13904: {'lr': 0.0004920624305636846, 'samples': 2669568, 'steps': 13903, 'loss/train': 1.8593177795410156} 11/06/2021 23:10:03 - INFO - __main__ - Step 13905: {'lr': 0.0004920611039029283, 'samples': 2669760, 'steps': 13904, 'loss/train': 1.7864686250686646} 11/06/2021 23:10:05 - INFO - __main__ - Step 13906: {'lr': 0.0004920597771331029, 'samples': 2669952, 'steps': 13905, 'loss/train': 2.002392292022705} 11/06/2021 23:10:05 - INFO - __main__ - Step 13907: {'lr': 0.0004920584502542091, 'samples': 2670144, 'steps': 13906, 'loss/train': 0.30893194675445557} 11/06/2021 23:10:06 - INFO - __main__ - Step 13908: {'lr': 0.0004920571232662475, 'samples': 2670336, 'steps': 13907, 'loss/train': 1.9505276679992676} 11/06/2021 23:10:06 - INFO - __main__ - Step 13909: {'lr': 0.0004920557961692188, 'samples': 2670528, 'steps': 13908, 'loss/train': 1.9398505687713623} 11/06/2021 23:10:06 - INFO - __main__ - Step 13910: {'lr': 0.0004920544689631233, 'samples': 2670720, 'steps': 13909, 'loss/train': 2.0230226516723633} 11/06/2021 23:10:07 - INFO - __main__ - Step 13911: {'lr': 0.000492053141647962, 'samples': 2670912, 'steps': 13910, 'loss/train': 1.8889321088790894} 11/06/2021 23:10:07 - INFO - __main__ - Step 13912: {'lr': 0.0004920518142237352, 'samples': 2671104, 'steps': 13911, 'loss/train': 1.33417809009552} 11/06/2021 23:10:07 - INFO - __main__ - Step 13913: {'lr': 0.0004920504866904436, 'samples': 2671296, 'steps': 13912, 'loss/train': 1.29556143283844} 11/06/2021 23:10:09 - INFO - __main__ - Step 13914: {'lr': 0.0004920491590480878, 'samples': 2671488, 'steps': 13913, 'loss/train': 1.9679369926452637} 11/06/2021 23:10:09 - INFO - __main__ - Step 13915: {'lr': 0.0004920478312966683, 'samples': 2671680, 'steps': 13914, 'loss/train': 1.3328757286071777} 11/06/2021 23:10:09 - INFO - __main__ - Step 13916: {'lr': 0.0004920465034361859, 'samples': 2671872, 'steps': 13915, 'loss/train': 1.5501303672790527} 11/06/2021 23:10:10 - INFO - __main__ - Step 13917: {'lr': 0.000492045175466641, 'samples': 2672064, 'steps': 13916, 'loss/train': 1.871131181716919} 11/06/2021 23:10:10 - INFO - __main__ - Step 13918: {'lr': 0.0004920438473880344, 'samples': 2672256, 'steps': 13917, 'loss/train': 2.702326536178589} 11/06/2021 23:10:11 - INFO - __main__ - Step 13919: {'lr': 0.0004920425192003663, 'samples': 2672448, 'steps': 13918, 'loss/train': 1.7303717136383057} 11/06/2021 23:10:11 - INFO - __main__ - Step 13920: {'lr': 0.0004920411909036379, 'samples': 2672640, 'steps': 13919, 'loss/train': 1.9704726934432983} 11/06/2021 23:10:12 - INFO - __main__ - Step 13921: {'lr': 0.0004920398624978493, 'samples': 2672832, 'steps': 13920, 'loss/train': 1.4717364311218262} 11/06/2021 23:10:12 - INFO - __main__ - Step 13922: {'lr': 0.0004920385339830012, 'samples': 2673024, 'steps': 13921, 'loss/train': 1.7774920463562012} 11/06/2021 23:10:12 - INFO - __main__ - Step 13923: {'lr': 0.0004920372053590945, 'samples': 2673216, 'steps': 13922, 'loss/train': 1.853239893913269} 11/06/2021 23:10:13 - INFO - __main__ - Step 13924: {'lr': 0.0004920358766261294, 'samples': 2673408, 'steps': 13923, 'loss/train': 1.9390660524368286} 11/06/2021 23:10:14 - INFO - __main__ - Step 13925: {'lr': 0.0004920345477841067, 'samples': 2673600, 'steps': 13924, 'loss/train': 2.0693888664245605} 11/06/2021 23:10:14 - INFO - __main__ - Step 13926: {'lr': 0.000492033218833027, 'samples': 2673792, 'steps': 13925, 'loss/train': 1.8519387245178223} 11/06/2021 23:10:14 - INFO - __main__ - Step 13927: {'lr': 0.0004920318897728909, 'samples': 2673984, 'steps': 13926, 'loss/train': 1.803276538848877} 11/06/2021 23:10:15 - INFO - __main__ - Step 13928: {'lr': 0.0004920305606036988, 'samples': 2674176, 'steps': 13927, 'loss/train': 1.768867015838623} 11/06/2021 23:10:16 - INFO - __main__ - Step 13929: {'lr': 0.0004920292313254516, 'samples': 2674368, 'steps': 13928, 'loss/train': 2.1696088314056396} 11/06/2021 23:10:16 - INFO - __main__ - Step 13930: {'lr': 0.0004920279019381497, 'samples': 2674560, 'steps': 13929, 'loss/train': 2.3505563735961914} 11/06/2021 23:10:16 - INFO - __main__ - Step 13931: {'lr': 0.0004920265724417938, 'samples': 2674752, 'steps': 13930, 'loss/train': 0.4778132736682892} 11/06/2021 23:10:17 - INFO - __main__ - Step 13932: {'lr': 0.0004920252428363845, 'samples': 2674944, 'steps': 13931, 'loss/train': 1.7751408815383911} 11/06/2021 23:10:17 - INFO - __main__ - Step 13933: {'lr': 0.0004920239131219223, 'samples': 2675136, 'steps': 13932, 'loss/train': 0.7648064494132996} 11/06/2021 23:10:18 - INFO - __main__ - Step 13934: {'lr': 0.0004920225832984079, 'samples': 2675328, 'steps': 13933, 'loss/train': 1.5931404829025269} 11/06/2021 23:10:18 - INFO - __main__ - Step 13935: {'lr': 0.0004920212533658419, 'samples': 2675520, 'steps': 13934, 'loss/train': 2.218579053878784} 11/06/2021 23:10:19 - INFO - __main__ - Step 13936: {'lr': 0.0004920199233242247, 'samples': 2675712, 'steps': 13935, 'loss/train': 1.7814935445785522} 11/06/2021 23:10:19 - INFO - __main__ - Step 13937: {'lr': 0.0004920185931735572, 'samples': 2675904, 'steps': 13936, 'loss/train': 1.8334205150604248} 11/06/2021 23:10:20 - INFO - __main__ - Step 13938: {'lr': 0.0004920172629138399, 'samples': 2676096, 'steps': 13937, 'loss/train': 1.7788243293762207} 11/06/2021 23:10:20 - INFO - __main__ - Step 13939: {'lr': 0.0004920159325450731, 'samples': 2676288, 'steps': 13938, 'loss/train': 1.6450157165527344} 11/06/2021 23:10:21 - INFO - __main__ - Step 13940: {'lr': 0.0004920146020672578, 'samples': 2676480, 'steps': 13939, 'loss/train': 1.6414921283721924} 11/06/2021 23:10:21 - INFO - __main__ - Step 13941: {'lr': 0.0004920132714803946, 'samples': 2676672, 'steps': 13940, 'loss/train': 1.8297538757324219} 11/06/2021 23:10:22 - INFO - __main__ - Step 13942: {'lr': 0.0004920119407844838, 'samples': 2676864, 'steps': 13941, 'loss/train': 1.148298978805542} 11/06/2021 23:10:22 - INFO - __main__ - Step 13943: {'lr': 0.0004920106099795262, 'samples': 2677056, 'steps': 13942, 'loss/train': 2.2544357776641846} 11/06/2021 23:10:22 - INFO - __main__ - Step 13944: {'lr': 0.0004920092790655224, 'samples': 2677248, 'steps': 13943, 'loss/train': 2.0402214527130127} 11/06/2021 23:10:24 - INFO - __main__ - Step 13945: {'lr': 0.0004920079480424728, 'samples': 2677440, 'steps': 13944, 'loss/train': 1.6051249504089355} 11/06/2021 23:10:24 - INFO - __main__ - Step 13946: {'lr': 0.0004920066169103783, 'samples': 2677632, 'steps': 13945, 'loss/train': 1.0051872730255127} 11/06/2021 23:10:24 - INFO - __main__ - Step 13947: {'lr': 0.0004920052856692394, 'samples': 2677824, 'steps': 13946, 'loss/train': 1.7988791465759277} 11/06/2021 23:10:25 - INFO - __main__ - Step 13948: {'lr': 0.0004920039543190565, 'samples': 2678016, 'steps': 13947, 'loss/train': 1.3910020589828491} 11/06/2021 23:10:25 - INFO - __main__ - Step 13949: {'lr': 0.0004920026228598303, 'samples': 2678208, 'steps': 13948, 'loss/train': 1.5859302282333374} 11/06/2021 23:10:26 - INFO - __main__ - Step 13950: {'lr': 0.0004920012912915616, 'samples': 2678400, 'steps': 13949, 'loss/train': 1.9164284467697144} 11/06/2021 23:10:26 - INFO - __main__ - Step 13951: {'lr': 0.0004919999596142508, 'samples': 2678592, 'steps': 13950, 'loss/train': 1.7975435256958008} 11/06/2021 23:10:27 - INFO - __main__ - Step 13952: {'lr': 0.0004919986278278986, 'samples': 2678784, 'steps': 13951, 'loss/train': 1.8759640455245972} 11/06/2021 23:10:27 - INFO - __main__ - Step 13953: {'lr': 0.0004919972959325055, 'samples': 2678976, 'steps': 13952, 'loss/train': 2.344235420227051} 11/06/2021 23:10:27 - INFO - __main__ - Step 13954: {'lr': 0.0004919959639280722, 'samples': 2679168, 'steps': 13953, 'loss/train': 3.453373432159424} 11/06/2021 23:10:28 - INFO - __main__ - Step 13955: {'lr': 0.0004919946318145992, 'samples': 2679360, 'steps': 13954, 'loss/train': 2.145124673843384} 11/06/2021 23:10:29 - INFO - __main__ - Step 13956: {'lr': 0.0004919932995920872, 'samples': 2679552, 'steps': 13955, 'loss/train': 1.7001293897628784} 11/06/2021 23:10:29 - INFO - __main__ - Step 13957: {'lr': 0.0004919919672605366, 'samples': 2679744, 'steps': 13956, 'loss/train': 1.792694330215454} 11/06/2021 23:10:29 - INFO - __main__ - Step 13958: {'lr': 0.0004919906348199483, 'samples': 2679936, 'steps': 13957, 'loss/train': 1.8448896408081055} 11/06/2021 23:10:30 - INFO - __main__ - Step 13959: {'lr': 0.0004919893022703228, 'samples': 2680128, 'steps': 13958, 'loss/train': 1.7649340629577637} 11/06/2021 23:10:31 - INFO - __main__ - Step 13960: {'lr': 0.0004919879696116605, 'samples': 2680320, 'steps': 13959, 'loss/train': 1.3440182209014893} 11/06/2021 23:10:31 - INFO - __main__ - Step 13961: {'lr': 0.0004919866368439624, 'samples': 2680512, 'steps': 13960, 'loss/train': 2.0946123600006104} 11/06/2021 23:10:31 - INFO - __main__ - Step 13962: {'lr': 0.0004919853039672287, 'samples': 2680704, 'steps': 13961, 'loss/train': 1.806232213973999} 11/06/2021 23:10:32 - INFO - __main__ - Step 13963: {'lr': 0.00049198397098146, 'samples': 2680896, 'steps': 13962, 'loss/train': 2.475085496902466} 11/06/2021 23:10:32 - INFO - __main__ - Step 13964: {'lr': 0.0004919826378866573, 'samples': 2681088, 'steps': 13963, 'loss/train': 1.6275722980499268} 11/06/2021 23:10:33 - INFO - __main__ - Step 13965: {'lr': 0.0004919813046828209, 'samples': 2681280, 'steps': 13964, 'loss/train': 1.4893933534622192} 11/06/2021 23:10:34 - INFO - __main__ - Step 13966: {'lr': 0.0004919799713699514, 'samples': 2681472, 'steps': 13965, 'loss/train': 1.8989421129226685} 11/06/2021 23:10:34 - INFO - __main__ - Step 13967: {'lr': 0.0004919786379480494, 'samples': 2681664, 'steps': 13966, 'loss/train': 2.0461950302124023} 11/06/2021 23:10:34 - INFO - __main__ - Step 13968: {'lr': 0.0004919773044171158, 'samples': 2681856, 'steps': 13967, 'loss/train': 1.7117947340011597} 11/06/2021 23:10:35 - INFO - __main__ - Step 13969: {'lr': 0.0004919759707771507, 'samples': 2682048, 'steps': 13968, 'loss/train': 1.7647126913070679} 11/06/2021 23:10:35 - INFO - __main__ - Step 13970: {'lr': 0.0004919746370281551, 'samples': 2682240, 'steps': 13969, 'loss/train': 1.784766435623169} 11/06/2021 23:10:36 - INFO - __main__ - Step 13971: {'lr': 0.0004919733031701295, 'samples': 2682432, 'steps': 13970, 'loss/train': 1.7901809215545654} 11/06/2021 23:10:37 - INFO - __main__ - Step 13972: {'lr': 0.0004919719692030743, 'samples': 2682624, 'steps': 13971, 'loss/train': 1.4720513820648193} 11/06/2021 23:10:37 - INFO - __main__ - Step 13973: {'lr': 0.0004919706351269904, 'samples': 2682816, 'steps': 13972, 'loss/train': 1.9798334836959839} 11/06/2021 23:10:37 - INFO - __main__ - Step 13974: {'lr': 0.0004919693009418782, 'samples': 2683008, 'steps': 13973, 'loss/train': 2.224214553833008} 11/06/2021 23:10:38 - INFO - __main__ - Step 13975: {'lr': 0.0004919679666477384, 'samples': 2683200, 'steps': 13974, 'loss/train': 1.7048498392105103} 11/06/2021 23:10:39 - INFO - __main__ - Step 13976: {'lr': 0.0004919666322445715, 'samples': 2683392, 'steps': 13975, 'loss/train': 1.784834384918213} 11/06/2021 23:10:39 - INFO - __main__ - Step 13977: {'lr': 0.0004919652977323783, 'samples': 2683584, 'steps': 13976, 'loss/train': 1.4972141981124878} 11/06/2021 23:10:39 - INFO - __main__ - Step 13978: {'lr': 0.0004919639631111592, 'samples': 2683776, 'steps': 13977, 'loss/train': 1.7331345081329346} 11/06/2021 23:10:40 - INFO - __main__ - Step 13979: {'lr': 0.0004919626283809149, 'samples': 2683968, 'steps': 13978, 'loss/train': 1.7867261171340942} 11/06/2021 23:10:40 - INFO - __main__ - Step 13980: {'lr': 0.0004919612935416459, 'samples': 2684160, 'steps': 13979, 'loss/train': 1.5536366701126099} 11/06/2021 23:10:41 - INFO - __main__ - Step 13981: {'lr': 0.000491959958593353, 'samples': 2684352, 'steps': 13980, 'loss/train': 1.6696289777755737} 11/06/2021 23:10:41 - INFO - __main__ - Step 13982: {'lr': 0.0004919586235360365, 'samples': 2684544, 'steps': 13981, 'loss/train': 1.6226602792739868} 11/06/2021 23:10:42 - INFO - __main__ - Step 13983: {'lr': 0.0004919572883696974, 'samples': 2684736, 'steps': 13982, 'loss/train': 1.9969899654388428} 11/06/2021 23:10:42 - INFO - __main__ - Step 13984: {'lr': 0.0004919559530943359, 'samples': 2684928, 'steps': 13983, 'loss/train': 1.8502788543701172} 11/06/2021 23:10:43 - INFO - __main__ - Step 13985: {'lr': 0.0004919546177099528, 'samples': 2685120, 'steps': 13984, 'loss/train': 2.2144908905029297} 11/06/2021 23:10:44 - INFO - __main__ - Step 13986: {'lr': 0.0004919532822165487, 'samples': 2685312, 'steps': 13985, 'loss/train': 1.7305831909179688} 11/06/2021 23:10:44 - INFO - __main__ - Step 13987: {'lr': 0.0004919519466141242, 'samples': 2685504, 'steps': 13986, 'loss/train': 2.206554889678955} 11/06/2021 23:10:44 - INFO - __main__ - Step 13988: {'lr': 0.0004919506109026799, 'samples': 2685696, 'steps': 13987, 'loss/train': 2.002794027328491} 11/06/2021 23:10:45 - INFO - __main__ - Step 13989: {'lr': 0.0004919492750822163, 'samples': 2685888, 'steps': 13988, 'loss/train': 1.8074073791503906} 11/06/2021 23:10:45 - INFO - __main__ - Step 13990: {'lr': 0.0004919479391527343, 'samples': 2686080, 'steps': 13989, 'loss/train': 1.7749497890472412} 11/06/2021 23:10:45 - INFO - __main__ - Step 13991: {'lr': 0.0004919466031142342, 'samples': 2686272, 'steps': 13990, 'loss/train': 1.2076705694198608} 11/06/2021 23:10:46 - INFO - __main__ - Step 13992: {'lr': 0.0004919452669667166, 'samples': 2686464, 'steps': 13991, 'loss/train': 3.3132309913635254} 11/06/2021 23:10:47 - INFO - __main__ - Step 13993: {'lr': 0.0004919439307101822, 'samples': 2686656, 'steps': 13992, 'loss/train': 3.050110101699829} 11/06/2021 23:10:47 - INFO - __main__ - Step 13994: {'lr': 0.0004919425943446317, 'samples': 2686848, 'steps': 13993, 'loss/train': 1.7559268474578857} 11/06/2021 23:10:48 - INFO - __main__ - Step 13995: {'lr': 0.0004919412578700654, 'samples': 2687040, 'steps': 13994, 'loss/train': 1.2231806516647339} 11/06/2021 23:10:48 - INFO - __main__ - Step 13996: {'lr': 0.0004919399212864843, 'samples': 2687232, 'steps': 13995, 'loss/train': 1.859178066253662} 11/06/2021 23:10:48 - INFO - __main__ - Step 13997: {'lr': 0.0004919385845938888, 'samples': 2687424, 'steps': 13996, 'loss/train': 1.9321575164794922} 11/06/2021 23:10:49 - INFO - __main__ - Step 13998: {'lr': 0.0004919372477922794, 'samples': 2687616, 'steps': 13997, 'loss/train': 1.7246944904327393} 11/06/2021 23:10:50 - INFO - __main__ - Step 13999: {'lr': 0.0004919359108816569, 'samples': 2687808, 'steps': 13998, 'loss/train': 2.0384068489074707} 11/06/2021 23:10:50 - INFO - __main__ - Step 14000: {'lr': 0.0004919345738620218, 'samples': 2688000, 'steps': 13999, 'loss/train': 1.2765216827392578} 11/06/2021 23:10:50 - INFO - __main__ - Step 14001: {'lr': 0.0004919332367333747, 'samples': 2688192, 'steps': 14000, 'loss/train': 1.9448779821395874} 11/06/2021 23:10:51 - INFO - __main__ - Step 14002: {'lr': 0.0004919318994957162, 'samples': 2688384, 'steps': 14001, 'loss/train': 1.9251288175582886} 11/06/2021 23:10:52 - INFO - __main__ - Step 14003: {'lr': 0.0004919305621490469, 'samples': 2688576, 'steps': 14002, 'loss/train': 2.482295274734497} 11/06/2021 23:10:52 - INFO - __main__ - Step 14004: {'lr': 0.0004919292246933675, 'samples': 2688768, 'steps': 14003, 'loss/train': 1.9574697017669678} 11/06/2021 23:10:52 - INFO - __main__ - Step 14005: {'lr': 0.0004919278871286785, 'samples': 2688960, 'steps': 14004, 'loss/train': 1.896154761314392} 11/06/2021 23:10:53 - INFO - __main__ - Step 14006: {'lr': 0.0004919265494549805, 'samples': 2689152, 'steps': 14005, 'loss/train': 1.633683681488037} 11/06/2021 23:10:53 - INFO - __main__ - Step 14007: {'lr': 0.0004919252116722742, 'samples': 2689344, 'steps': 14006, 'loss/train': 1.4971553087234497} 11/06/2021 23:10:54 - INFO - __main__ - Step 14008: {'lr': 0.0004919238737805601, 'samples': 2689536, 'steps': 14007, 'loss/train': 1.8007830381393433} 11/06/2021 23:10:55 - INFO - __main__ - Step 14009: {'lr': 0.0004919225357798387, 'samples': 2689728, 'steps': 14008, 'loss/train': 1.9640558958053589} 11/06/2021 23:10:55 - INFO - __main__ - Step 14010: {'lr': 0.000491921197670111, 'samples': 2689920, 'steps': 14009, 'loss/train': 1.3045183420181274} 11/06/2021 23:10:55 - INFO - __main__ - Step 14011: {'lr': 0.0004919198594513771, 'samples': 2690112, 'steps': 14010, 'loss/train': 1.3401371240615845} 11/06/2021 23:10:56 - INFO - __main__ - Step 14012: {'lr': 0.0004919185211236379, 'samples': 2690304, 'steps': 14011, 'loss/train': 1.836495280265808} 11/06/2021 23:10:57 - INFO - __main__ - Step 14013: {'lr': 0.000491917182686894, 'samples': 2690496, 'steps': 14012, 'loss/train': 1.7330890893936157} 11/06/2021 23:10:57 - INFO - __main__ - Step 14014: {'lr': 0.0004919158441411459, 'samples': 2690688, 'steps': 14013, 'loss/train': 1.8115290403366089} 11/06/2021 23:10:57 - INFO - __main__ - Step 14015: {'lr': 0.0004919145054863943, 'samples': 2690880, 'steps': 14014, 'loss/train': 1.6873303651809692} 11/06/2021 23:10:58 - INFO - __main__ - Step 14016: {'lr': 0.0004919131667226398, 'samples': 2691072, 'steps': 14015, 'loss/train': 1.5199148654937744} 11/06/2021 23:10:58 - INFO - __main__ - Step 14017: {'lr': 0.0004919118278498828, 'samples': 2691264, 'steps': 14016, 'loss/train': 1.7640634775161743} 11/06/2021 23:10:58 - INFO - __main__ - Step 14018: {'lr': 0.0004919104888681242, 'samples': 2691456, 'steps': 14017, 'loss/train': 2.0215377807617188} 11/06/2021 23:10:59 - INFO - __main__ - Step 14019: {'lr': 0.0004919091497773643, 'samples': 2691648, 'steps': 14018, 'loss/train': 1.64264976978302} 11/06/2021 23:11:00 - INFO - __main__ - Step 14020: {'lr': 0.0004919078105776041, 'samples': 2691840, 'steps': 14019, 'loss/train': 1.904662013053894} 11/06/2021 23:11:00 - INFO - __main__ - Step 14021: {'lr': 0.0004919064712688439, 'samples': 2692032, 'steps': 14020, 'loss/train': 1.8053239583969116} 11/06/2021 23:11:00 - INFO - __main__ - Step 14022: {'lr': 0.0004919051318510844, 'samples': 2692224, 'steps': 14021, 'loss/train': 2.048910617828369} 11/06/2021 23:11:01 - INFO - __main__ - Step 14023: {'lr': 0.0004919037923243261, 'samples': 2692416, 'steps': 14022, 'loss/train': 1.2819080352783203} 11/06/2021 23:11:02 - INFO - __main__ - Step 14024: {'lr': 0.0004919024526885697, 'samples': 2692608, 'steps': 14023, 'loss/train': 1.1374632120132446} 11/06/2021 23:11:02 - INFO - __main__ - Step 14025: {'lr': 0.0004919011129438158, 'samples': 2692800, 'steps': 14024, 'loss/train': 1.4526000022888184} 11/06/2021 23:11:03 - INFO - __main__ - Step 14026: {'lr': 0.0004918997730900649, 'samples': 2692992, 'steps': 14025, 'loss/train': 1.2924232482910156} 11/06/2021 23:11:03 - INFO - __main__ - Step 14027: {'lr': 0.0004918984331273178, 'samples': 2693184, 'steps': 14026, 'loss/train': 2.08219313621521} 11/06/2021 23:11:03 - INFO - __main__ - Step 14028: {'lr': 0.0004918970930555751, 'samples': 2693376, 'steps': 14027, 'loss/train': 1.7137112617492676} 11/06/2021 23:11:04 - INFO - __main__ - Step 14029: {'lr': 0.0004918957528748371, 'samples': 2693568, 'steps': 14028, 'loss/train': 1.5124090909957886} 11/06/2021 23:11:05 - INFO - __main__ - Step 14030: {'lr': 0.0004918944125851047, 'samples': 2693760, 'steps': 14029, 'loss/train': 1.7956403493881226} 11/06/2021 23:11:05 - INFO - __main__ - Step 14031: {'lr': 0.0004918930721863784, 'samples': 2693952, 'steps': 14030, 'loss/train': 0.82200026512146} 11/06/2021 23:11:05 - INFO - __main__ - Step 14032: {'lr': 0.0004918917316786589, 'samples': 2694144, 'steps': 14031, 'loss/train': 1.9360613822937012} 11/06/2021 23:11:06 - INFO - __main__ - Step 14033: {'lr': 0.0004918903910619465, 'samples': 2694336, 'steps': 14032, 'loss/train': 1.157104253768921} 11/06/2021 23:11:07 - INFO - __main__ - Step 14034: {'lr': 0.0004918890503362422, 'samples': 2694528, 'steps': 14033, 'loss/train': 1.9156838655471802} 11/06/2021 23:11:07 - INFO - __main__ - Step 14035: {'lr': 0.0004918877095015465, 'samples': 2694720, 'steps': 14034, 'loss/train': 2.019812822341919} 11/06/2021 23:11:07 - INFO - __main__ - Step 14036: {'lr': 0.0004918863685578598, 'samples': 2694912, 'steps': 14035, 'loss/train': 1.6677758693695068} 11/06/2021 23:11:08 - INFO - __main__ - Step 14037: {'lr': 0.0004918850275051829, 'samples': 2695104, 'steps': 14036, 'loss/train': 2.457850217819214} 11/06/2021 23:11:08 - INFO - __main__ - Step 14038: {'lr': 0.0004918836863435162, 'samples': 2695296, 'steps': 14037, 'loss/train': 2.003868341445923} 11/06/2021 23:11:09 - INFO - __main__ - Step 14039: {'lr': 0.0004918823450728606, 'samples': 2695488, 'steps': 14038, 'loss/train': 1.5999841690063477} 11/06/2021 23:11:10 - INFO - __main__ - Step 14040: {'lr': 0.0004918810036932164, 'samples': 2695680, 'steps': 14039, 'loss/train': 1.710452675819397} 11/06/2021 23:11:10 - INFO - __main__ - Step 14041: {'lr': 0.0004918796622045844, 'samples': 2695872, 'steps': 14040, 'loss/train': 1.9019668102264404} 11/06/2021 23:11:10 - INFO - __main__ - Step 14042: {'lr': 0.0004918783206069652, 'samples': 2696064, 'steps': 14041, 'loss/train': 1.7770543098449707} 11/06/2021 23:11:11 - INFO - __main__ - Step 14043: {'lr': 0.0004918769789003593, 'samples': 2696256, 'steps': 14042, 'loss/train': 1.4025808572769165} 11/06/2021 23:11:11 - INFO - __main__ - Step 14044: {'lr': 0.0004918756370847674, 'samples': 2696448, 'steps': 14043, 'loss/train': 1.5960426330566406} 11/06/2021 23:11:12 - INFO - __main__ - Step 14045: {'lr': 0.0004918742951601902, 'samples': 2696640, 'steps': 14044, 'loss/train': 1.9002901315689087} 11/06/2021 23:11:12 - INFO - __main__ - Step 14046: {'lr': 0.000491872953126628, 'samples': 2696832, 'steps': 14045, 'loss/train': 1.677998661994934} 11/06/2021 23:11:13 - INFO - __main__ - Step 14047: {'lr': 0.0004918716109840817, 'samples': 2697024, 'steps': 14046, 'loss/train': 1.461251139640808} 11/06/2021 23:11:13 - INFO - __main__ - Step 14048: {'lr': 0.0004918702687325517, 'samples': 2697216, 'steps': 14047, 'loss/train': 1.8989447355270386} 11/06/2021 23:11:13 - INFO - __main__ - Step 14049: {'lr': 0.0004918689263720388, 'samples': 2697408, 'steps': 14048, 'loss/train': 1.1831153631210327} 11/06/2021 23:11:14 - INFO - __main__ - Step 14050: {'lr': 0.0004918675839025434, 'samples': 2697600, 'steps': 14049, 'loss/train': 1.7708138227462769} 11/06/2021 23:11:15 - INFO - __main__ - Step 14051: {'lr': 0.0004918662413240662, 'samples': 2697792, 'steps': 14050, 'loss/train': 1.32278311252594} 11/06/2021 23:11:15 - INFO - __main__ - Step 14052: {'lr': 0.0004918648986366078, 'samples': 2697984, 'steps': 14051, 'loss/train': 1.6384484767913818} 11/06/2021 23:11:15 - INFO - __main__ - Step 14053: {'lr': 0.0004918635558401687, 'samples': 2698176, 'steps': 14052, 'loss/train': 2.0729172229766846} 11/06/2021 23:11:16 - INFO - __main__ - Step 14054: {'lr': 0.0004918622129347498, 'samples': 2698368, 'steps': 14053, 'loss/train': 1.6105602979660034} 11/06/2021 23:11:17 - INFO - __main__ - Step 14055: {'lr': 0.0004918608699203515, 'samples': 2698560, 'steps': 14054, 'loss/train': 2.5296216011047363} 11/06/2021 23:11:17 - INFO - __main__ - Step 14056: {'lr': 0.0004918595267969744, 'samples': 2698752, 'steps': 14055, 'loss/train': 1.7558401823043823} 11/06/2021 23:11:18 - INFO - __main__ - Step 14057: {'lr': 0.0004918581835646191, 'samples': 2698944, 'steps': 14056, 'loss/train': 5.832611083984375} 11/06/2021 23:11:18 - INFO - __main__ - Step 14058: {'lr': 0.0004918568402232863, 'samples': 2699136, 'steps': 14057, 'loss/train': 1.5368565320968628} 11/06/2021 23:11:18 - INFO - __main__ - Step 14059: {'lr': 0.0004918554967729764, 'samples': 2699328, 'steps': 14058, 'loss/train': 2.169316291809082} 11/06/2021 23:11:19 - INFO - __main__ - Step 14060: {'lr': 0.0004918541532136902, 'samples': 2699520, 'steps': 14059, 'loss/train': 2.3635847568511963} 11/06/2021 23:11:20 - INFO - __main__ - Step 14061: {'lr': 0.0004918528095454283, 'samples': 2699712, 'steps': 14060, 'loss/train': 1.7770613431930542} 11/06/2021 23:11:20 - INFO - __main__ - Step 14062: {'lr': 0.0004918514657681913, 'samples': 2699904, 'steps': 14061, 'loss/train': 1.688179850578308} 11/06/2021 23:11:20 - INFO - __main__ - Step 14063: {'lr': 0.0004918501218819796, 'samples': 2700096, 'steps': 14062, 'loss/train': 1.860587239265442} 11/06/2021 23:11:21 - INFO - __main__ - Step 14064: {'lr': 0.0004918487778867941, 'samples': 2700288, 'steps': 14063, 'loss/train': 1.8392343521118164} 11/06/2021 23:11:21 - INFO - __main__ - Step 14065: {'lr': 0.0004918474337826353, 'samples': 2700480, 'steps': 14064, 'loss/train': 1.5922415256500244} 11/06/2021 23:11:22 - INFO - __main__ - Step 14066: {'lr': 0.0004918460895695037, 'samples': 2700672, 'steps': 14065, 'loss/train': 1.585329294204712} 11/06/2021 23:11:23 - INFO - __main__ - Step 14067: {'lr': 0.0004918447452474, 'samples': 2700864, 'steps': 14066, 'loss/train': 3.7865922451019287} 11/06/2021 23:11:23 - INFO - __main__ - Step 14068: {'lr': 0.0004918434008163247, 'samples': 2701056, 'steps': 14067, 'loss/train': 1.2407586574554443} 11/06/2021 23:11:23 - INFO - __main__ - Step 14069: {'lr': 0.0004918420562762786, 'samples': 2701248, 'steps': 14068, 'loss/train': 1.7555322647094727} 11/06/2021 23:11:24 - INFO - __main__ - Step 14070: {'lr': 0.0004918407116272622, 'samples': 2701440, 'steps': 14069, 'loss/train': 1.0781822204589844} 11/06/2021 23:11:24 - INFO - __main__ - Step 14071: {'lr': 0.000491839366869276, 'samples': 2701632, 'steps': 14070, 'loss/train': 2.1765549182891846} 11/06/2021 23:11:25 - INFO - __main__ - Step 14072: {'lr': 0.000491838022002321, 'samples': 2701824, 'steps': 14071, 'loss/train': 1.3922725915908813} 11/06/2021 23:11:25 - INFO - __main__ - Step 14073: {'lr': 0.0004918366770263972, 'samples': 2702016, 'steps': 14072, 'loss/train': 1.583871603012085} 11/06/2021 23:11:26 - INFO - __main__ - Step 14074: {'lr': 0.0004918353319415057, 'samples': 2702208, 'steps': 14073, 'loss/train': 1.4345612525939941} 11/06/2021 23:11:26 - INFO - __main__ - Step 14075: {'lr': 0.0004918339867476469, 'samples': 2702400, 'steps': 14074, 'loss/train': 1.5994923114776611} 11/06/2021 23:11:26 - INFO - __main__ - Step 14076: {'lr': 0.0004918326414448214, 'samples': 2702592, 'steps': 14075, 'loss/train': 2.2489395141601562} 11/06/2021 23:11:28 - INFO - __main__ - Step 14077: {'lr': 0.0004918312960330299, 'samples': 2702784, 'steps': 14076, 'loss/train': 1.737290382385254} 11/06/2021 23:11:28 - INFO - __main__ - Step 14078: {'lr': 0.0004918299505122729, 'samples': 2702976, 'steps': 14077, 'loss/train': 1.7684190273284912} 11/06/2021 23:11:28 - INFO - __main__ - Step 14079: {'lr': 0.000491828604882551, 'samples': 2703168, 'steps': 14078, 'loss/train': 1.933387279510498} 11/06/2021 23:11:29 - INFO - __main__ - Step 14080: {'lr': 0.0004918272591438649, 'samples': 2703360, 'steps': 14079, 'loss/train': 2.6290011405944824} 11/06/2021 23:11:29 - INFO - __main__ - Step 14081: {'lr': 0.0004918259132962153, 'samples': 2703552, 'steps': 14080, 'loss/train': 1.1858018636703491} 11/06/2021 23:11:29 - INFO - __main__ - Step 14082: {'lr': 0.0004918245673396025, 'samples': 2703744, 'steps': 14081, 'loss/train': 1.8530510663986206} 11/06/2021 23:11:31 - INFO - __main__ - Step 14083: {'lr': 0.0004918232212740274, 'samples': 2703936, 'steps': 14082, 'loss/train': 0.8487131595611572} 11/06/2021 23:11:31 - INFO - __main__ - Step 14084: {'lr': 0.0004918218750994904, 'samples': 2704128, 'steps': 14083, 'loss/train': 2.605128288269043} 11/06/2021 23:11:32 - INFO - __main__ - Step 14085: {'lr': 0.0004918205288159923, 'samples': 2704320, 'steps': 14084, 'loss/train': 1.6674420833587646} 11/06/2021 23:11:32 - INFO - __main__ - Step 14086: {'lr': 0.0004918191824235335, 'samples': 2704512, 'steps': 14085, 'loss/train': 1.54026198387146} 11/06/2021 23:11:32 - INFO - __main__ - Step 14087: {'lr': 0.0004918178359221147, 'samples': 2704704, 'steps': 14086, 'loss/train': 2.31728196144104} 11/06/2021 23:11:33 - INFO - __main__ - Step 14088: {'lr': 0.0004918164893117366, 'samples': 2704896, 'steps': 14087, 'loss/train': 1.713340401649475} 11/06/2021 23:11:34 - INFO - __main__ - Step 14089: {'lr': 0.0004918151425923996, 'samples': 2705088, 'steps': 14088, 'loss/train': 1.4485218524932861} 11/06/2021 23:11:34 - INFO - __main__ - Step 14090: {'lr': 0.0004918137957641046, 'samples': 2705280, 'steps': 14089, 'loss/train': 1.635068416595459} 11/06/2021 23:11:34 - INFO - __main__ - Step 14091: {'lr': 0.000491812448826852, 'samples': 2705472, 'steps': 14090, 'loss/train': 1.8842805624008179} 11/06/2021 23:11:35 - INFO - __main__ - Step 14092: {'lr': 0.0004918111017806424, 'samples': 2705664, 'steps': 14091, 'loss/train': 1.5843989849090576} 11/06/2021 23:11:35 - INFO - __main__ - Step 14093: {'lr': 0.0004918097546254764, 'samples': 2705856, 'steps': 14092, 'loss/train': 1.812479019165039} 11/06/2021 23:11:36 - INFO - __main__ - Step 14094: {'lr': 0.0004918084073613547, 'samples': 2706048, 'steps': 14093, 'loss/train': 1.6815907955169678} 11/06/2021 23:11:36 - INFO - __main__ - Step 14095: {'lr': 0.0004918070599882778, 'samples': 2706240, 'steps': 14094, 'loss/train': 1.7472896575927734} 11/06/2021 23:11:37 - INFO - __main__ - Step 14096: {'lr': 0.0004918057125062465, 'samples': 2706432, 'steps': 14095, 'loss/train': 1.8674776554107666} 11/06/2021 23:11:37 - INFO - __main__ - Step 14097: {'lr': 0.0004918043649152612, 'samples': 2706624, 'steps': 14096, 'loss/train': 1.9627310037612915} 11/06/2021 23:11:37 - INFO - __main__ - Step 14098: {'lr': 0.0004918030172153225, 'samples': 2706816, 'steps': 14097, 'loss/train': 1.6795316934585571} 11/06/2021 23:11:38 - INFO - __main__ - Step 14099: {'lr': 0.0004918016694064313, 'samples': 2707008, 'steps': 14098, 'loss/train': 1.9474259614944458} 11/06/2021 23:11:39 - INFO - __main__ - Step 14100: {'lr': 0.0004918003214885877, 'samples': 2707200, 'steps': 14099, 'loss/train': 1.1974012851715088} 11/06/2021 23:11:39 - INFO - __main__ - Step 14101: {'lr': 0.0004917989734617928, 'samples': 2707392, 'steps': 14100, 'loss/train': 1.9953140020370483} 11/06/2021 23:11:39 - INFO - __main__ - Step 14102: {'lr': 0.0004917976253260471, 'samples': 2707584, 'steps': 14101, 'loss/train': 1.6123121976852417} 11/06/2021 23:11:40 - INFO - __main__ - Step 14103: {'lr': 0.000491796277081351, 'samples': 2707776, 'steps': 14102, 'loss/train': 1.817934274673462} 11/06/2021 23:11:40 - INFO - __main__ - Step 14104: {'lr': 0.0004917949287277052, 'samples': 2707968, 'steps': 14103, 'loss/train': 2.0339081287384033} 11/06/2021 23:11:41 - INFO - __main__ - Step 14105: {'lr': 0.0004917935802651104, 'samples': 2708160, 'steps': 14104, 'loss/train': 1.8835687637329102} 11/06/2021 23:11:42 - INFO - __main__ - Step 14106: {'lr': 0.0004917922316935671, 'samples': 2708352, 'steps': 14105, 'loss/train': 1.9598747491836548} 11/06/2021 23:11:42 - INFO - __main__ - Step 14107: {'lr': 0.000491790883013076, 'samples': 2708544, 'steps': 14106, 'loss/train': 1.3012551069259644} 11/06/2021 23:11:42 - INFO - __main__ - Step 14108: {'lr': 0.0004917895342236377, 'samples': 2708736, 'steps': 14107, 'loss/train': 1.5593042373657227} 11/06/2021 23:11:43 - INFO - __main__ - Step 14109: {'lr': 0.0004917881853252527, 'samples': 2708928, 'steps': 14108, 'loss/train': 1.7348185777664185} 11/06/2021 23:11:44 - INFO - __main__ - Step 14110: {'lr': 0.0004917868363179216, 'samples': 2709120, 'steps': 14109, 'loss/train': 2.0034899711608887} 11/06/2021 23:11:44 - INFO - __main__ - Step 14111: {'lr': 0.0004917854872016451, 'samples': 2709312, 'steps': 14110, 'loss/train': 1.7014143466949463} 11/06/2021 23:11:44 - INFO - __main__ - Step 14112: {'lr': 0.000491784137976424, 'samples': 2709504, 'steps': 14111, 'loss/train': 0.5290675163269043} 11/06/2021 23:11:45 - INFO - __main__ - Step 14113: {'lr': 0.0004917827886422586, 'samples': 2709696, 'steps': 14112, 'loss/train': 1.854404330253601} 11/06/2021 23:11:45 - INFO - __main__ - Step 14114: {'lr': 0.0004917814391991494, 'samples': 2709888, 'steps': 14113, 'loss/train': 1.8280285596847534} 11/06/2021 23:11:46 - INFO - __main__ - Step 14115: {'lr': 0.0004917800896470974, 'samples': 2710080, 'steps': 14114, 'loss/train': 1.9659100770950317} 11/06/2021 23:11:46 - INFO - __main__ - Step 14116: {'lr': 0.000491778739986103, 'samples': 2710272, 'steps': 14115, 'loss/train': 2.029839515686035} 11/06/2021 23:11:47 - INFO - __main__ - Step 14117: {'lr': 0.0004917773902161669, 'samples': 2710464, 'steps': 14116, 'loss/train': 1.8698660135269165} 11/06/2021 23:11:47 - INFO - __main__ - Step 14118: {'lr': 0.0004917760403372895, 'samples': 2710656, 'steps': 14117, 'loss/train': 1.1892834901809692} 11/06/2021 23:11:47 - INFO - __main__ - Step 14119: {'lr': 0.0004917746903494717, 'samples': 2710848, 'steps': 14118, 'loss/train': 0.9476760029792786} 11/06/2021 23:11:48 - INFO - __main__ - Step 14120: {'lr': 0.0004917733402527138, 'samples': 2711040, 'steps': 14119, 'loss/train': 1.6532700061798096} 11/06/2021 23:11:49 - INFO - __main__ - Step 14121: {'lr': 0.0004917719900470167, 'samples': 2711232, 'steps': 14120, 'loss/train': 1.880062222480774} 11/06/2021 23:11:49 - INFO - __main__ - Step 14122: {'lr': 0.0004917706397323808, 'samples': 2711424, 'steps': 14121, 'loss/train': 2.203761100769043} 11/06/2021 23:11:49 - INFO - __main__ - Step 14123: {'lr': 0.0004917692893088067, 'samples': 2711616, 'steps': 14122, 'loss/train': 1.9275217056274414} 11/06/2021 23:11:50 - INFO - __main__ - Step 14124: {'lr': 0.0004917679387762952, 'samples': 2711808, 'steps': 14123, 'loss/train': 1.9265042543411255} 11/06/2021 23:11:51 - INFO - __main__ - Step 14125: {'lr': 0.0004917665881348467, 'samples': 2712000, 'steps': 14124, 'loss/train': 1.2948485612869263} 11/06/2021 23:11:51 - INFO - __main__ - Step 14126: {'lr': 0.000491765237384462, 'samples': 2712192, 'steps': 14125, 'loss/train': 1.6665414571762085} 11/06/2021 23:11:52 - INFO - __main__ - Step 14127: {'lr': 0.0004917638865251416, 'samples': 2712384, 'steps': 14126, 'loss/train': 0.629708468914032} 11/06/2021 23:11:52 - INFO - __main__ - Step 14128: {'lr': 0.0004917625355568861, 'samples': 2712576, 'steps': 14127, 'loss/train': 1.6347920894622803} 11/06/2021 23:11:53 - INFO - __main__ - Step 14129: {'lr': 0.0004917611844796962, 'samples': 2712768, 'steps': 14128, 'loss/train': 1.388734221458435} 11/06/2021 23:11:54 - INFO - __main__ - Step 14130: {'lr': 0.0004917598332935724, 'samples': 2712960, 'steps': 14129, 'loss/train': 1.3727388381958008} 11/06/2021 23:11:54 - INFO - __main__ - Step 14131: {'lr': 0.0004917584819985153, 'samples': 2713152, 'steps': 14130, 'loss/train': 1.5905683040618896} 11/06/2021 23:11:54 - INFO - __main__ - Step 14132: {'lr': 0.0004917571305945256, 'samples': 2713344, 'steps': 14131, 'loss/train': 1.9821319580078125} 11/06/2021 23:11:55 - INFO - __main__ - Step 14133: {'lr': 0.0004917557790816039, 'samples': 2713536, 'steps': 14132, 'loss/train': 1.3257533311843872} 11/06/2021 23:11:55 - INFO - __main__ - Step 14134: {'lr': 0.0004917544274597507, 'samples': 2713728, 'steps': 14133, 'loss/train': 1.8089780807495117} 11/06/2021 23:11:56 - INFO - __main__ - Step 14135: {'lr': 0.0004917530757289668, 'samples': 2713920, 'steps': 14134, 'loss/train': 1.6792985200881958} 11/06/2021 23:11:56 - INFO - __main__ - Step 14136: {'lr': 0.0004917517238892526, 'samples': 2714112, 'steps': 14135, 'loss/train': 1.4552292823791504} 11/06/2021 23:11:57 - INFO - __main__ - Step 14137: {'lr': 0.0004917503719406087, 'samples': 2714304, 'steps': 14136, 'loss/train': 1.8645858764648438} 11/06/2021 23:11:57 - INFO - __main__ - Step 14138: {'lr': 0.000491749019883036, 'samples': 2714496, 'steps': 14137, 'loss/train': 1.6772507429122925} 11/06/2021 23:11:58 - INFO - __main__ - Step 14139: {'lr': 0.0004917476677165349, 'samples': 2714688, 'steps': 14138, 'loss/train': 1.839224100112915} 11/06/2021 23:11:58 - INFO - __main__ - Step 14140: {'lr': 0.0004917463154411059, 'samples': 2714880, 'steps': 14139, 'loss/train': 1.2386482954025269} 11/06/2021 23:11:59 - INFO - __main__ - Step 14141: {'lr': 0.0004917449630567499, 'samples': 2715072, 'steps': 14140, 'loss/train': 1.9079393148422241} 11/06/2021 23:11:59 - INFO - __main__ - Step 14142: {'lr': 0.0004917436105634673, 'samples': 2715264, 'steps': 14141, 'loss/train': 1.7619988918304443} 11/06/2021 23:12:00 - INFO - __main__ - Step 14143: {'lr': 0.0004917422579612587, 'samples': 2715456, 'steps': 14142, 'loss/train': 1.625560998916626} 11/06/2021 23:12:00 - INFO - __main__ - Step 14144: {'lr': 0.0004917409052501248, 'samples': 2715648, 'steps': 14143, 'loss/train': 1.4624155759811401} 11/06/2021 23:12:00 - INFO - __main__ - Step 14145: {'lr': 0.0004917395524300661, 'samples': 2715840, 'steps': 14144, 'loss/train': 1.0638208389282227} 11/06/2021 23:12:02 - INFO - __main__ - Step 14146: {'lr': 0.0004917381995010834, 'samples': 2716032, 'steps': 14145, 'loss/train': 1.523756980895996} 11/06/2021 23:12:02 - INFO - __main__ - Step 14147: {'lr': 0.0004917368464631772, 'samples': 2716224, 'steps': 14146, 'loss/train': 1.4266034364700317} 11/06/2021 23:12:02 - INFO - __main__ - Step 14148: {'lr': 0.0004917354933163481, 'samples': 2716416, 'steps': 14147, 'loss/train': 2.06866455078125} 11/06/2021 23:12:03 - INFO - __main__ - Step 14149: {'lr': 0.0004917341400605967, 'samples': 2716608, 'steps': 14148, 'loss/train': 1.4128144979476929} 11/06/2021 23:12:03 - INFO - __main__ - Step 14150: {'lr': 0.0004917327866959236, 'samples': 2716800, 'steps': 14149, 'loss/train': 1.0061789751052856} 11/06/2021 23:12:04 - INFO - __main__ - Step 14151: {'lr': 0.0004917314332223295, 'samples': 2716992, 'steps': 14150, 'loss/train': 1.8678841590881348} 11/06/2021 23:12:04 - INFO - __main__ - Step 14152: {'lr': 0.0004917300796398148, 'samples': 2717184, 'steps': 14151, 'loss/train': 1.5144931077957153} 11/06/2021 23:12:05 - INFO - __main__ - Step 14153: {'lr': 0.0004917287259483805, 'samples': 2717376, 'steps': 14152, 'loss/train': 1.836294412612915} 11/06/2021 23:12:05 - INFO - __main__ - Step 14154: {'lr': 0.0004917273721480268, 'samples': 2717568, 'steps': 14153, 'loss/train': 1.3567187786102295} 11/06/2021 23:12:05 - INFO - __main__ - Step 14155: {'lr': 0.0004917260182387545, 'samples': 2717760, 'steps': 14154, 'loss/train': 1.4603012800216675} 11/06/2021 23:12:07 - INFO - __main__ - Step 14156: {'lr': 0.0004917246642205642, 'samples': 2717952, 'steps': 14155, 'loss/train': 1.4038804769515991} 11/06/2021 23:12:07 - INFO - __main__ - Step 14157: {'lr': 0.0004917233100934565, 'samples': 2718144, 'steps': 14156, 'loss/train': 1.8869411945343018} 11/06/2021 23:12:07 - INFO - __main__ - Step 14158: {'lr': 0.0004917219558574319, 'samples': 2718336, 'steps': 14157, 'loss/train': 1.7700613737106323} 11/06/2021 23:12:08 - INFO - __main__ - Step 14159: {'lr': 0.0004917206015124913, 'samples': 2718528, 'steps': 14158, 'loss/train': 0.9594613313674927} 11/06/2021 23:12:08 - INFO - __main__ - Step 14160: {'lr': 0.000491719247058635, 'samples': 2718720, 'steps': 14159, 'loss/train': 1.2669697999954224} 11/06/2021 23:12:08 - INFO - __main__ - Step 14161: {'lr': 0.0004917178924958638, 'samples': 2718912, 'steps': 14160, 'loss/train': 2.0762956142425537} 11/06/2021 23:12:09 - INFO - __main__ - Step 14162: {'lr': 0.0004917165378241782, 'samples': 2719104, 'steps': 14161, 'loss/train': 1.8425633907318115} 11/06/2021 23:12:10 - INFO - __main__ - Step 14163: {'lr': 0.0004917151830435789, 'samples': 2719296, 'steps': 14162, 'loss/train': 1.3636668920516968} 11/06/2021 23:12:10 - INFO - __main__ - Step 14164: {'lr': 0.0004917138281540664, 'samples': 2719488, 'steps': 14163, 'loss/train': 2.4625284671783447} 11/06/2021 23:12:10 - INFO - __main__ - Step 14165: {'lr': 0.0004917124731556415, 'samples': 2719680, 'steps': 14164, 'loss/train': 2.0274415016174316} 11/06/2021 23:12:11 - INFO - __main__ - Step 14166: {'lr': 0.0004917111180483046, 'samples': 2719872, 'steps': 14165, 'loss/train': 1.421899676322937} 11/06/2021 23:12:12 - INFO - __main__ - Step 14167: {'lr': 0.0004917097628320564, 'samples': 2720064, 'steps': 14166, 'loss/train': 1.8181909322738647} 11/06/2021 23:12:12 - INFO - __main__ - Step 14168: {'lr': 0.0004917084075068975, 'samples': 2720256, 'steps': 14167, 'loss/train': 1.8530933856964111} 11/06/2021 23:12:12 - INFO - __main__ - Step 14169: {'lr': 0.0004917070520728286, 'samples': 2720448, 'steps': 14168, 'loss/train': 1.7252720594406128} 11/06/2021 23:12:13 - INFO - __main__ - Step 14170: {'lr': 0.0004917056965298501, 'samples': 2720640, 'steps': 14169, 'loss/train': 1.3021842241287231} 11/06/2021 23:12:13 - INFO - __main__ - Step 14171: {'lr': 0.0004917043408779629, 'samples': 2720832, 'steps': 14170, 'loss/train': 1.746435523033142} 11/06/2021 23:12:14 - INFO - __main__ - Step 14172: {'lr': 0.0004917029851171674, 'samples': 2721024, 'steps': 14171, 'loss/train': 0.15152797102928162} 11/06/2021 23:12:15 - INFO - __main__ - Step 14173: {'lr': 0.0004917016292474642, 'samples': 2721216, 'steps': 14172, 'loss/train': 2.0078635215759277} 11/06/2021 23:12:15 - INFO - __main__ - Step 14174: {'lr': 0.000491700273268854, 'samples': 2721408, 'steps': 14173, 'loss/train': 1.525383472442627} 11/06/2021 23:12:15 - INFO - __main__ - Step 14175: {'lr': 0.0004916989171813374, 'samples': 2721600, 'steps': 14174, 'loss/train': 1.6893455982208252} 11/06/2021 23:12:16 - INFO - __main__ - Step 14176: {'lr': 0.000491697560984915, 'samples': 2721792, 'steps': 14175, 'loss/train': 1.4403749704360962} 11/06/2021 23:12:17 - INFO - __main__ - Step 14177: {'lr': 0.0004916962046795874, 'samples': 2721984, 'steps': 14176, 'loss/train': 2.55890154838562} 11/06/2021 23:12:17 - INFO - __main__ - Step 14178: {'lr': 0.0004916948482653553, 'samples': 2722176, 'steps': 14177, 'loss/train': 1.7743308544158936} 11/06/2021 23:12:17 - INFO - __main__ - Step 14179: {'lr': 0.0004916934917422191, 'samples': 2722368, 'steps': 14178, 'loss/train': 1.5195140838623047} 11/06/2021 23:12:18 - INFO - __main__ - Step 14180: {'lr': 0.0004916921351101796, 'samples': 2722560, 'steps': 14179, 'loss/train': 1.529984951019287} 11/06/2021 23:12:18 - INFO - __main__ - Step 14181: {'lr': 0.0004916907783692374, 'samples': 2722752, 'steps': 14180, 'loss/train': 1.9970260858535767} 11/06/2021 23:12:18 - INFO - __main__ - Step 14182: {'lr': 0.000491689421519393, 'samples': 2722944, 'steps': 14181, 'loss/train': 1.5570197105407715} 11/06/2021 23:12:19 - INFO - __main__ - Step 14183: {'lr': 0.0004916880645606471, 'samples': 2723136, 'steps': 14182, 'loss/train': 1.396638035774231} 11/06/2021 23:12:20 - INFO - __main__ - Step 14184: {'lr': 0.0004916867074930002, 'samples': 2723328, 'steps': 14183, 'loss/train': 1.960241436958313} 11/06/2021 23:12:20 - INFO - __main__ - Step 14185: {'lr': 0.0004916853503164531, 'samples': 2723520, 'steps': 14184, 'loss/train': 1.312820553779602} 11/06/2021 23:12:20 - INFO - __main__ - Step 14186: {'lr': 0.0004916839930310063, 'samples': 2723712, 'steps': 14185, 'loss/train': 1.5989493131637573} 11/06/2021 23:12:21 - INFO - __main__ - Step 14187: {'lr': 0.0004916826356366605, 'samples': 2723904, 'steps': 14186, 'loss/train': 2.3531341552734375} 11/06/2021 23:12:22 - INFO - __main__ - Step 14188: {'lr': 0.0004916812781334161, 'samples': 2724096, 'steps': 14187, 'loss/train': 1.5331050157546997} 11/06/2021 23:12:22 - INFO - __main__ - Step 14189: {'lr': 0.0004916799205212739, 'samples': 2724288, 'steps': 14188, 'loss/train': 1.6796480417251587} 11/06/2021 23:12:23 - INFO - __main__ - Step 14190: {'lr': 0.0004916785628002345, 'samples': 2724480, 'steps': 14189, 'loss/train': 1.338007926940918} 11/06/2021 23:12:23 - INFO - __main__ - Step 14191: {'lr': 0.0004916772049702984, 'samples': 2724672, 'steps': 14190, 'loss/train': 2.13338565826416} 11/06/2021 23:12:23 - INFO - __main__ - Step 14192: {'lr': 0.0004916758470314662, 'samples': 2724864, 'steps': 14191, 'loss/train': 1.8545008897781372} 11/06/2021 23:12:24 - INFO - __main__ - Step 14193: {'lr': 0.0004916744889837388, 'samples': 2725056, 'steps': 14192, 'loss/train': 1.7253385782241821} 11/06/2021 23:12:25 - INFO - __main__ - Step 14194: {'lr': 0.0004916731308271165, 'samples': 2725248, 'steps': 14193, 'loss/train': 1.8852863311767578} 11/06/2021 23:12:25 - INFO - __main__ - Step 14195: {'lr': 0.0004916717725616, 'samples': 2725440, 'steps': 14194, 'loss/train': 1.7347588539123535} 11/06/2021 23:12:25 - INFO - __main__ - Step 14196: {'lr': 0.0004916704141871899, 'samples': 2725632, 'steps': 14195, 'loss/train': 1.6380163431167603} 11/06/2021 23:12:26 - INFO - __main__ - Step 14197: {'lr': 0.000491669055703887, 'samples': 2725824, 'steps': 14196, 'loss/train': 1.5754060745239258} 11/06/2021 23:12:27 - INFO - __main__ - Step 14198: {'lr': 0.0004916676971116916, 'samples': 2726016, 'steps': 14197, 'loss/train': 1.435011863708496} 11/06/2021 23:12:27 - INFO - __main__ - Step 14199: {'lr': 0.0004916663384106045, 'samples': 2726208, 'steps': 14198, 'loss/train': 1.975164771080017} 11/06/2021 23:12:28 - INFO - __main__ - Step 14200: {'lr': 0.0004916649796006263, 'samples': 2726400, 'steps': 14199, 'loss/train': 1.1312105655670166} 11/06/2021 23:12:28 - INFO - __main__ - Step 14201: {'lr': 0.0004916636206817575, 'samples': 2726592, 'steps': 14200, 'loss/train': 1.805939793586731} 11/06/2021 23:12:28 - INFO - __main__ - Step 14202: {'lr': 0.0004916622616539988, 'samples': 2726784, 'steps': 14201, 'loss/train': 1.1680221557617188} 11/06/2021 23:12:29 - INFO - __main__ - Step 14203: {'lr': 0.000491660902517351, 'samples': 2726976, 'steps': 14202, 'loss/train': 1.5965232849121094} 11/06/2021 23:12:30 - INFO - __main__ - Step 14204: {'lr': 0.0004916595432718143, 'samples': 2727168, 'steps': 14203, 'loss/train': 1.660688877105713} 11/06/2021 23:12:30 - INFO - __main__ - Step 14205: {'lr': 0.0004916581839173897, 'samples': 2727360, 'steps': 14204, 'loss/train': 2.3039207458496094} 11/06/2021 23:12:31 - INFO - __main__ - Step 14206: {'lr': 0.0004916568244540776, 'samples': 2727552, 'steps': 14205, 'loss/train': 1.4516968727111816} 11/06/2021 23:12:31 - INFO - __main__ - Step 14207: {'lr': 0.0004916554648818787, 'samples': 2727744, 'steps': 14206, 'loss/train': 1.7223396301269531} 11/06/2021 23:12:31 - INFO - __main__ - Step 14208: {'lr': 0.0004916541052007936, 'samples': 2727936, 'steps': 14207, 'loss/train': 1.0482618808746338} 11/06/2021 23:12:32 - INFO - __main__ - Step 14209: {'lr': 0.0004916527454108227, 'samples': 2728128, 'steps': 14208, 'loss/train': 1.7698308229446411} 11/06/2021 23:12:33 - INFO - __main__ - Step 14210: {'lr': 0.0004916513855119669, 'samples': 2728320, 'steps': 14209, 'loss/train': 2.295707941055298} 11/06/2021 23:12:33 - INFO - __main__ - Step 14211: {'lr': 0.0004916500255042268, 'samples': 2728512, 'steps': 14210, 'loss/train': 1.637235164642334} 11/06/2021 23:12:33 - INFO - __main__ - Step 14212: {'lr': 0.0004916486653876029, 'samples': 2728704, 'steps': 14211, 'loss/train': 1.552703619003296} 11/06/2021 23:12:34 - INFO - __main__ - Step 14213: {'lr': 0.0004916473051620958, 'samples': 2728896, 'steps': 14212, 'loss/train': 2.0507657527923584} 11/06/2021 23:12:35 - INFO - __main__ - Step 14214: {'lr': 0.0004916459448277062, 'samples': 2729088, 'steps': 14213, 'loss/train': 2.118166446685791} 11/06/2021 23:12:35 - INFO - __main__ - Step 14215: {'lr': 0.0004916445843844346, 'samples': 2729280, 'steps': 14214, 'loss/train': 1.8859989643096924} 11/06/2021 23:12:36 - INFO - __main__ - Step 14216: {'lr': 0.0004916432238322818, 'samples': 2729472, 'steps': 14215, 'loss/train': 1.5724775791168213} 11/06/2021 23:12:36 - INFO - __main__ - Step 14217: {'lr': 0.0004916418631712481, 'samples': 2729664, 'steps': 14216, 'loss/train': 1.3125375509262085} 11/06/2021 23:12:36 - INFO - __main__ - Step 14218: {'lr': 0.0004916405024013344, 'samples': 2729856, 'steps': 14217, 'loss/train': 1.8595951795578003} 11/06/2021 23:12:37 - INFO - __main__ - Step 14219: {'lr': 0.0004916391415225413, 'samples': 2730048, 'steps': 14218, 'loss/train': 1.1561071872711182} 11/06/2021 23:12:38 - INFO - __main__ - Step 14220: {'lr': 0.0004916377805348692, 'samples': 2730240, 'steps': 14219, 'loss/train': 1.8915315866470337} 11/06/2021 23:12:38 - INFO - __main__ - Step 14221: {'lr': 0.000491636419438319, 'samples': 2730432, 'steps': 14220, 'loss/train': 1.322358250617981} 11/06/2021 23:12:38 - INFO - __main__ - Step 14222: {'lr': 0.000491635058232891, 'samples': 2730624, 'steps': 14221, 'loss/train': 1.5473992824554443} 11/06/2021 23:12:39 - INFO - __main__ - Step 14223: {'lr': 0.0004916336969185861, 'samples': 2730816, 'steps': 14222, 'loss/train': 0.6478114724159241} 11/06/2021 23:12:40 - INFO - __main__ - Step 14224: {'lr': 0.0004916323354954047, 'samples': 2731008, 'steps': 14223, 'loss/train': 1.2803329229354858} 11/06/2021 23:12:40 - INFO - __main__ - Step 14225: {'lr': 0.0004916309739633475, 'samples': 2731200, 'steps': 14224, 'loss/train': 1.5547466278076172} 11/06/2021 23:12:40 - INFO - __main__ - Step 14226: {'lr': 0.0004916296123224151, 'samples': 2731392, 'steps': 14225, 'loss/train': 2.0657129287719727} 11/06/2021 23:12:41 - INFO - __main__ - Step 14227: {'lr': 0.0004916282505726082, 'samples': 2731584, 'steps': 14226, 'loss/train': 1.7537693977355957} 11/06/2021 23:12:41 - INFO - __main__ - Step 14228: {'lr': 0.0004916268887139272, 'samples': 2731776, 'steps': 14227, 'loss/train': 1.8315138816833496} 11/06/2021 23:12:42 - INFO - __main__ - Step 14229: {'lr': 0.000491625526746373, 'samples': 2731968, 'steps': 14228, 'loss/train': 2.130446434020996} 11/06/2021 23:12:43 - INFO - __main__ - Step 14230: {'lr': 0.000491624164669946, 'samples': 2732160, 'steps': 14229, 'loss/train': 2.0926835536956787} 11/06/2021 23:12:43 - INFO - __main__ - Step 14231: {'lr': 0.0004916228024846469, 'samples': 2732352, 'steps': 14230, 'loss/train': 1.9206883907318115} 11/06/2021 23:12:43 - INFO - __main__ - Step 14232: {'lr': 0.0004916214401904763, 'samples': 2732544, 'steps': 14231, 'loss/train': 1.461196780204773} 11/06/2021 23:12:44 - INFO - __main__ - Step 14233: {'lr': 0.0004916200777874348, 'samples': 2732736, 'steps': 14232, 'loss/train': 1.8770463466644287} 11/06/2021 23:12:45 - INFO - __main__ - Step 14234: {'lr': 0.000491618715275523, 'samples': 2732928, 'steps': 14233, 'loss/train': 2.0279908180236816} 11/06/2021 23:12:46 - INFO - __main__ - Step 14235: {'lr': 0.0004916173526547415, 'samples': 2733120, 'steps': 14234, 'loss/train': 1.2167682647705078} 11/06/2021 23:12:46 - INFO - __main__ - Step 14236: {'lr': 0.000491615989925091, 'samples': 2733312, 'steps': 14235, 'loss/train': 1.5853732824325562} 11/06/2021 23:12:46 - INFO - __main__ - Step 14237: {'lr': 0.0004916146270865721, 'samples': 2733504, 'steps': 14236, 'loss/train': 2.5321567058563232} 11/06/2021 23:12:47 - INFO - __main__ - Step 14238: {'lr': 0.0004916132641391854, 'samples': 2733696, 'steps': 14237, 'loss/train': 1.8508648872375488} 11/06/2021 23:12:47 - INFO - __main__ - Step 14239: {'lr': 0.0004916119010829314, 'samples': 2733888, 'steps': 14238, 'loss/train': 1.860944390296936} 11/06/2021 23:12:47 - INFO - __main__ - Step 14240: {'lr': 0.0004916105379178108, 'samples': 2734080, 'steps': 14239, 'loss/train': 1.910184621810913} 11/06/2021 23:12:48 - INFO - __main__ - Step 14241: {'lr': 0.0004916091746438243, 'samples': 2734272, 'steps': 14240, 'loss/train': 1.9223930835723877} 11/06/2021 23:12:49 - INFO - __main__ - Step 14242: {'lr': 0.0004916078112609724, 'samples': 2734464, 'steps': 14241, 'loss/train': 1.7084510326385498} 11/06/2021 23:12:49 - INFO - __main__ - Step 14243: {'lr': 0.0004916064477692557, 'samples': 2734656, 'steps': 14242, 'loss/train': 0.43883585929870605} 11/06/2021 23:12:49 - INFO - __main__ - Step 14244: {'lr': 0.0004916050841686748, 'samples': 2734848, 'steps': 14243, 'loss/train': 1.6944113969802856} 11/06/2021 23:12:50 - INFO - __main__ - Step 14245: {'lr': 0.0004916037204592306, 'samples': 2735040, 'steps': 14244, 'loss/train': 1.1687214374542236} 11/06/2021 23:12:51 - INFO - __main__ - Step 14246: {'lr': 0.0004916023566409233, 'samples': 2735232, 'steps': 14245, 'loss/train': 2.218296766281128} 11/06/2021 23:12:51 - INFO - __main__ - Step 14247: {'lr': 0.0004916009927137538, 'samples': 2735424, 'steps': 14246, 'loss/train': 1.7040735483169556} 11/06/2021 23:12:52 - INFO - __main__ - Step 14248: {'lr': 0.0004915996286777226, 'samples': 2735616, 'steps': 14247, 'loss/train': 1.2565326690673828} 11/06/2021 23:12:52 - INFO - __main__ - Step 14249: {'lr': 0.0004915982645328304, 'samples': 2735808, 'steps': 14248, 'loss/train': 1.6930842399597168} 11/06/2021 23:12:52 - INFO - __main__ - Step 14250: {'lr': 0.0004915969002790777, 'samples': 2736000, 'steps': 14249, 'loss/train': 1.771859049797058} 11/06/2021 23:12:53 - INFO - __main__ - Step 14251: {'lr': 0.0004915955359164651, 'samples': 2736192, 'steps': 14250, 'loss/train': 1.4663689136505127} 11/06/2021 23:12:54 - INFO - __main__ - Step 14252: {'lr': 0.0004915941714449933, 'samples': 2736384, 'steps': 14251, 'loss/train': 1.6903026103973389} 11/06/2021 23:12:54 - INFO - __main__ - Step 14253: {'lr': 0.000491592806864663, 'samples': 2736576, 'steps': 14252, 'loss/train': 1.7190121412277222} 11/06/2021 23:12:54 - INFO - __main__ - Step 14254: {'lr': 0.0004915914421754746, 'samples': 2736768, 'steps': 14253, 'loss/train': 0.6711317300796509} 11/06/2021 23:12:55 - INFO - __main__ - Step 14255: {'lr': 0.0004915900773774289, 'samples': 2736960, 'steps': 14254, 'loss/train': 1.9526695013046265} 11/06/2021 23:12:55 - INFO - __main__ - Step 14256: {'lr': 0.0004915887124705263, 'samples': 2737152, 'steps': 14255, 'loss/train': 2.055246353149414} 11/06/2021 23:12:56 - INFO - __main__ - Step 14257: {'lr': 0.0004915873474547677, 'samples': 2737344, 'steps': 14256, 'loss/train': 1.7780791521072388} 11/06/2021 23:12:57 - INFO - __main__ - Step 14258: {'lr': 0.0004915859823301535, 'samples': 2737536, 'steps': 14257, 'loss/train': 1.91822350025177} 11/06/2021 23:12:57 - INFO - __main__ - Step 14259: {'lr': 0.0004915846170966845, 'samples': 2737728, 'steps': 14258, 'loss/train': 1.2504817247390747} 11/06/2021 23:12:57 - INFO - __main__ - Step 14260: {'lr': 0.000491583251754361, 'samples': 2737920, 'steps': 14259, 'loss/train': 2.1529762744903564} 11/06/2021 23:12:58 - INFO - __main__ - Step 14261: {'lr': 0.0004915818863031839, 'samples': 2738112, 'steps': 14260, 'loss/train': 2.2141220569610596} 11/06/2021 23:12:58 - INFO - __main__ - Step 14262: {'lr': 0.0004915805207431537, 'samples': 2738304, 'steps': 14261, 'loss/train': 1.4041943550109863} 11/06/2021 23:12:59 - INFO - __main__ - Step 14263: {'lr': 0.0004915791550742712, 'samples': 2738496, 'steps': 14262, 'loss/train': 2.012119770050049} 11/06/2021 23:12:59 - INFO - __main__ - Step 14264: {'lr': 0.0004915777892965368, 'samples': 2738688, 'steps': 14263, 'loss/train': 1.8272478580474854} 11/06/2021 23:13:00 - INFO - __main__ - Step 14265: {'lr': 0.0004915764234099511, 'samples': 2738880, 'steps': 14264, 'loss/train': 1.6100664138793945} 11/06/2021 23:13:00 - INFO - __main__ - Step 14266: {'lr': 0.0004915750574145148, 'samples': 2739072, 'steps': 14265, 'loss/train': 1.9792180061340332} 11/06/2021 23:13:00 - INFO - __main__ - Step 14267: {'lr': 0.0004915736913102285, 'samples': 2739264, 'steps': 14266, 'loss/train': 1.8761942386627197} 11/06/2021 23:13:02 - INFO - __main__ - Step 14268: {'lr': 0.0004915723250970928, 'samples': 2739456, 'steps': 14267, 'loss/train': 2.067959785461426} 11/06/2021 23:13:02 - INFO - __main__ - Step 14269: {'lr': 0.0004915709587751084, 'samples': 2739648, 'steps': 14268, 'loss/train': 2.4542698860168457} 11/06/2021 23:13:02 - INFO - __main__ - Step 14270: {'lr': 0.0004915695923442759, 'samples': 2739840, 'steps': 14269, 'loss/train': 1.489783763885498} 11/06/2021 23:13:03 - INFO - __main__ - Step 14271: {'lr': 0.0004915682258045958, 'samples': 2740032, 'steps': 14270, 'loss/train': 1.8175733089447021} 11/06/2021 23:13:03 - INFO - __main__ - Step 14272: {'lr': 0.0004915668591560688, 'samples': 2740224, 'steps': 14271, 'loss/train': 1.3902310132980347} 11/06/2021 23:13:04 - INFO - __main__ - Step 14273: {'lr': 0.0004915654923986955, 'samples': 2740416, 'steps': 14272, 'loss/train': 1.5795223712921143} 11/06/2021 23:13:04 - INFO - __main__ - Step 14274: {'lr': 0.0004915641255324764, 'samples': 2740608, 'steps': 14273, 'loss/train': 1.8649059534072876} 11/06/2021 23:13:05 - INFO - __main__ - Step 14275: {'lr': 0.0004915627585574124, 'samples': 2740800, 'steps': 14274, 'loss/train': 1.643608570098877} 11/06/2021 23:13:05 - INFO - __main__ - Step 14276: {'lr': 0.0004915613914735038, 'samples': 2740992, 'steps': 14275, 'loss/train': 2.0051262378692627} 11/06/2021 23:13:05 - INFO - __main__ - Step 14277: {'lr': 0.0004915600242807516, 'samples': 2741184, 'steps': 14276, 'loss/train': 1.455479383468628} 11/06/2021 23:13:06 - INFO - __main__ - Step 14278: {'lr': 0.000491558656979156, 'samples': 2741376, 'steps': 14277, 'loss/train': 1.6889221668243408} 11/06/2021 23:13:07 - INFO - __main__ - Step 14279: {'lr': 0.0004915572895687179, 'samples': 2741568, 'steps': 14278, 'loss/train': 1.9221125841140747} 11/06/2021 23:13:07 - INFO - __main__ - Step 14280: {'lr': 0.0004915559220494376, 'samples': 2741760, 'steps': 14279, 'loss/train': 1.842481017112732} 11/06/2021 23:13:07 - INFO - __main__ - Step 14281: {'lr': 0.0004915545544213161, 'samples': 2741952, 'steps': 14280, 'loss/train': 1.5338388681411743} 11/06/2021 23:13:08 - INFO - __main__ - Step 14282: {'lr': 0.0004915531866843539, 'samples': 2742144, 'steps': 14281, 'loss/train': 1.903611660003662} 11/06/2021 23:13:09 - INFO - __main__ - Step 14283: {'lr': 0.0004915518188385514, 'samples': 2742336, 'steps': 14282, 'loss/train': 1.557709813117981} 11/06/2021 23:13:09 - INFO - __main__ - Step 14284: {'lr': 0.0004915504508839095, 'samples': 2742528, 'steps': 14283, 'loss/train': 1.7310574054718018} 11/06/2021 23:13:09 - INFO - __main__ - Step 14285: {'lr': 0.0004915490828204287, 'samples': 2742720, 'steps': 14284, 'loss/train': 1.8797122240066528} 11/06/2021 23:13:10 - INFO - __main__ - Step 14286: {'lr': 0.0004915477146481095, 'samples': 2742912, 'steps': 14285, 'loss/train': 1.9240556955337524} 11/06/2021 23:13:10 - INFO - __main__ - Step 14287: {'lr': 0.0004915463463669527, 'samples': 2743104, 'steps': 14286, 'loss/train': 2.307543992996216} 11/06/2021 23:13:11 - INFO - __main__ - Step 14288: {'lr': 0.0004915449779769589, 'samples': 2743296, 'steps': 14287, 'loss/train': 2.076409339904785} 11/06/2021 23:13:12 - INFO - __main__ - Step 14289: {'lr': 0.0004915436094781285, 'samples': 2743488, 'steps': 14288, 'loss/train': 1.6683952808380127} 11/06/2021 23:13:12 - INFO - __main__ - Step 14290: {'lr': 0.0004915422408704624, 'samples': 2743680, 'steps': 14289, 'loss/train': 1.3777574300765991} 11/06/2021 23:13:12 - INFO - __main__ - Step 14291: {'lr': 0.0004915408721539612, 'samples': 2743872, 'steps': 14290, 'loss/train': 1.556388020515442} 11/06/2021 23:13:13 - INFO - __main__ - Step 14292: {'lr': 0.0004915395033286251, 'samples': 2744064, 'steps': 14291, 'loss/train': 1.8184715509414673} 11/06/2021 23:13:13 - INFO - __main__ - Step 14293: {'lr': 0.0004915381343944552, 'samples': 2744256, 'steps': 14292, 'loss/train': 1.5744520425796509} 11/06/2021 23:13:14 - INFO - __main__ - Step 14294: {'lr': 0.0004915367653514521, 'samples': 2744448, 'steps': 14293, 'loss/train': 1.5268954038619995} 11/06/2021 23:13:14 - INFO - __main__ - Step 14295: {'lr': 0.0004915353961996161, 'samples': 2744640, 'steps': 14294, 'loss/train': 1.4393970966339111} 11/06/2021 23:13:15 - INFO - __main__ - Step 14296: {'lr': 0.000491534026938948, 'samples': 2744832, 'steps': 14295, 'loss/train': 1.8665683269500732} 11/06/2021 23:13:15 - INFO - __main__ - Step 14297: {'lr': 0.0004915326575694484, 'samples': 2745024, 'steps': 14296, 'loss/train': 2.0807924270629883} 11/06/2021 23:13:15 - INFO - __main__ - Step 14298: {'lr': 0.0004915312880911178, 'samples': 2745216, 'steps': 14297, 'loss/train': 1.800079345703125} 11/06/2021 23:13:16 - INFO - __main__ - Step 14299: {'lr': 0.000491529918503957, 'samples': 2745408, 'steps': 14298, 'loss/train': 1.6191036701202393} 11/06/2021 23:13:17 - INFO - __main__ - Step 14300: {'lr': 0.0004915285488079666, 'samples': 2745600, 'steps': 14299, 'loss/train': 1.6357394456863403} 11/06/2021 23:13:17 - INFO - __main__ - Step 14301: {'lr': 0.0004915271790031471, 'samples': 2745792, 'steps': 14300, 'loss/train': 1.8493192195892334} 11/06/2021 23:13:17 - INFO - __main__ - Step 14302: {'lr': 0.0004915258090894993, 'samples': 2745984, 'steps': 14301, 'loss/train': 1.3855708837509155} 11/06/2021 23:13:18 - INFO - __main__ - Step 14303: {'lr': 0.0004915244390670236, 'samples': 2746176, 'steps': 14302, 'loss/train': 1.6835711002349854} 11/06/2021 23:13:19 - INFO - __main__ - Step 14304: {'lr': 0.0004915230689357206, 'samples': 2746368, 'steps': 14303, 'loss/train': 1.9127839803695679} 11/06/2021 23:13:19 - INFO - __main__ - Step 14305: {'lr': 0.0004915216986955913, 'samples': 2746560, 'steps': 14304, 'loss/train': 1.7443958520889282} 11/06/2021 23:13:19 - INFO - __main__ - Step 14306: {'lr': 0.0004915203283466359, 'samples': 2746752, 'steps': 14305, 'loss/train': 1.8569912910461426} 11/06/2021 23:13:20 - INFO - __main__ - Step 14307: {'lr': 0.0004915189578888552, 'samples': 2746944, 'steps': 14306, 'loss/train': 2.002859354019165} 11/06/2021 23:13:20 - INFO - __main__ - Step 14308: {'lr': 0.0004915175873222497, 'samples': 2747136, 'steps': 14307, 'loss/train': 1.7802067995071411} 11/06/2021 23:13:21 - INFO - __main__ - Step 14309: {'lr': 0.0004915162166468201, 'samples': 2747328, 'steps': 14308, 'loss/train': 1.61421799659729} 11/06/2021 23:13:22 - INFO - __main__ - Step 14310: {'lr': 0.0004915148458625671, 'samples': 2747520, 'steps': 14309, 'loss/train': 1.5801818370819092} 11/06/2021 23:13:22 - INFO - __main__ - Step 14311: {'lr': 0.0004915134749694912, 'samples': 2747712, 'steps': 14310, 'loss/train': 1.24630606174469} 11/06/2021 23:13:22 - INFO - __main__ - Step 14312: {'lr': 0.000491512103967593, 'samples': 2747904, 'steps': 14311, 'loss/train': 1.8175888061523438} 11/06/2021 23:13:23 - INFO - __main__ - Step 14313: {'lr': 0.0004915107328568733, 'samples': 2748096, 'steps': 14312, 'loss/train': 1.7059366703033447} 11/06/2021 23:13:23 - INFO - __main__ - Step 14314: {'lr': 0.0004915093616373326, 'samples': 2748288, 'steps': 14313, 'loss/train': 1.2261815071105957} 11/06/2021 23:13:24 - INFO - __main__ - Step 14315: {'lr': 0.0004915079903089714, 'samples': 2748480, 'steps': 14314, 'loss/train': 1.8357681035995483} 11/06/2021 23:13:24 - INFO - __main__ - Step 14316: {'lr': 0.0004915066188717905, 'samples': 2748672, 'steps': 14315, 'loss/train': 1.671004056930542} 11/06/2021 23:13:25 - INFO - __main__ - Step 14317: {'lr': 0.0004915052473257904, 'samples': 2748864, 'steps': 14316, 'loss/train': 1.6903188228607178} 11/06/2021 23:13:25 - INFO - __main__ - Step 14318: {'lr': 0.0004915038756709717, 'samples': 2749056, 'steps': 14317, 'loss/train': 1.4683510065078735} 11/06/2021 23:13:25 - INFO - __main__ - Step 14319: {'lr': 0.0004915025039073352, 'samples': 2749248, 'steps': 14318, 'loss/train': 1.8368884325027466} 11/06/2021 23:13:26 - INFO - __main__ - Step 14320: {'lr': 0.0004915011320348814, 'samples': 2749440, 'steps': 14319, 'loss/train': 1.3511337041854858} 11/06/2021 23:13:27 - INFO - __main__ - Step 14321: {'lr': 0.0004914997600536108, 'samples': 2749632, 'steps': 14320, 'loss/train': 1.4233803749084473} 11/06/2021 23:13:27 - INFO - __main__ - Step 14322: {'lr': 0.0004914983879635242, 'samples': 2749824, 'steps': 14321, 'loss/train': 1.6688144207000732} 11/06/2021 23:13:28 - INFO - __main__ - Step 14323: {'lr': 0.0004914970157646222, 'samples': 2750016, 'steps': 14322, 'loss/train': 1.5857484340667725} 11/06/2021 23:13:28 - INFO - __main__ - Step 14324: {'lr': 0.0004914956434569054, 'samples': 2750208, 'steps': 14323, 'loss/train': 1.8138678073883057} 11/06/2021 23:13:29 - INFO - __main__ - Step 14325: {'lr': 0.0004914942710403743, 'samples': 2750400, 'steps': 14324, 'loss/train': 2.092402935028076} 11/06/2021 23:13:29 - INFO - __main__ - Step 14326: {'lr': 0.0004914928985150296, 'samples': 2750592, 'steps': 14325, 'loss/train': 1.820569396018982} 11/06/2021 23:13:30 - INFO - __main__ - Step 14327: {'lr': 0.0004914915258808719, 'samples': 2750784, 'steps': 14326, 'loss/train': 1.4420838356018066} 11/06/2021 23:13:30 - INFO - __main__ - Step 14328: {'lr': 0.0004914901531379019, 'samples': 2750976, 'steps': 14327, 'loss/train': 1.843706727027893} 11/06/2021 23:13:30 - INFO - __main__ - Step 14329: {'lr': 0.0004914887802861201, 'samples': 2751168, 'steps': 14328, 'loss/train': 1.7319104671478271} 11/06/2021 23:13:32 - INFO - __main__ - Step 14330: {'lr': 0.0004914874073255273, 'samples': 2751360, 'steps': 14329, 'loss/train': 1.3918893337249756} 11/06/2021 23:13:32 - INFO - __main__ - Step 14331: {'lr': 0.0004914860342561239, 'samples': 2751552, 'steps': 14330, 'loss/train': 1.587130069732666} 11/06/2021 23:13:32 - INFO - __main__ - Step 14332: {'lr': 0.0004914846610779107, 'samples': 2751744, 'steps': 14331, 'loss/train': 1.7063366174697876} 11/06/2021 23:13:33 - INFO - __main__ - Step 14333: {'lr': 0.0004914832877908881, 'samples': 2751936, 'steps': 14332, 'loss/train': 1.7004281282424927} 11/06/2021 23:13:33 - INFO - __main__ - Step 14334: {'lr': 0.0004914819143950571, 'samples': 2752128, 'steps': 14333, 'loss/train': 1.0245263576507568} 11/06/2021 23:13:33 - INFO - __main__ - Step 14335: {'lr': 0.0004914805408904179, 'samples': 2752320, 'steps': 14334, 'loss/train': 1.9465148448944092} 11/06/2021 23:13:34 - INFO - __main__ - Step 14336: {'lr': 0.0004914791672769713, 'samples': 2752512, 'steps': 14335, 'loss/train': 1.6169592142105103} 11/06/2021 23:13:35 - INFO - __main__ - Step 14337: {'lr': 0.000491477793554718, 'samples': 2752704, 'steps': 14336, 'loss/train': 1.842124342918396} 11/06/2021 23:13:35 - INFO - __main__ - Step 14338: {'lr': 0.0004914764197236584, 'samples': 2752896, 'steps': 14337, 'loss/train': 2.156404495239258} 11/06/2021 23:13:35 - INFO - __main__ - Step 14339: {'lr': 0.0004914750457837933, 'samples': 2753088, 'steps': 14338, 'loss/train': 2.6054229736328125} 11/06/2021 23:13:36 - INFO - __main__ - Step 14340: {'lr': 0.0004914736717351233, 'samples': 2753280, 'steps': 14339, 'loss/train': 0.8791477084159851} 11/06/2021 23:13:37 - INFO - __main__ - Step 14341: {'lr': 0.000491472297577649, 'samples': 2753472, 'steps': 14340, 'loss/train': 2.0689876079559326} 11/06/2021 23:13:37 - INFO - __main__ - Step 14342: {'lr': 0.000491470923311371, 'samples': 2753664, 'steps': 14341, 'loss/train': 2.8645334243774414} 11/06/2021 23:13:38 - INFO - __main__ - Step 14343: {'lr': 0.0004914695489362899, 'samples': 2753856, 'steps': 14342, 'loss/train': 1.755697250366211} 11/06/2021 23:13:38 - INFO - __main__ - Step 14344: {'lr': 0.0004914681744524064, 'samples': 2754048, 'steps': 14343, 'loss/train': 1.9771595001220703} 11/06/2021 23:13:38 - INFO - __main__ - Step 14345: {'lr': 0.0004914667998597211, 'samples': 2754240, 'steps': 14344, 'loss/train': 1.9034637212753296} 11/06/2021 23:13:39 - INFO - __main__ - Step 14346: {'lr': 0.0004914654251582344, 'samples': 2754432, 'steps': 14345, 'loss/train': 2.0097978115081787} 11/06/2021 23:13:40 - INFO - __main__ - Step 14347: {'lr': 0.0004914640503479473, 'samples': 2754624, 'steps': 14346, 'loss/train': 1.8135510683059692} 11/06/2021 23:13:40 - INFO - __main__ - Step 14348: {'lr': 0.0004914626754288601, 'samples': 2754816, 'steps': 14347, 'loss/train': 2.062073230743408} 11/06/2021 23:13:40 - INFO - __main__ - Step 14349: {'lr': 0.0004914613004009736, 'samples': 2755008, 'steps': 14348, 'loss/train': 1.3072386980056763} 11/06/2021 23:13:41 - INFO - __main__ - Step 14350: {'lr': 0.0004914599252642884, 'samples': 2755200, 'steps': 14349, 'loss/train': 1.7098357677459717} 11/06/2021 23:13:41 - INFO - __main__ - Step 14351: {'lr': 0.000491458550018805, 'samples': 2755392, 'steps': 14350, 'loss/train': 1.7779287099838257} 11/06/2021 23:13:42 - INFO - __main__ - Step 14352: {'lr': 0.0004914571746645242, 'samples': 2755584, 'steps': 14351, 'loss/train': 2.2135913372039795} 11/06/2021 23:13:43 - INFO - __main__ - Step 14353: {'lr': 0.0004914557992014465, 'samples': 2755776, 'steps': 14352, 'loss/train': 1.0863014459609985} 11/06/2021 23:13:43 - INFO - __main__ - Step 14354: {'lr': 0.0004914544236295725, 'samples': 2755968, 'steps': 14353, 'loss/train': 1.349368691444397} 11/06/2021 23:13:43 - INFO - __main__ - Step 14355: {'lr': 0.0004914530479489029, 'samples': 2756160, 'steps': 14354, 'loss/train': 1.2938106060028076} 11/06/2021 23:13:44 - INFO - __main__ - Step 14356: {'lr': 0.0004914516721594382, 'samples': 2756352, 'steps': 14355, 'loss/train': 1.8118442296981812} 11/06/2021 23:13:45 - INFO - __main__ - Step 14357: {'lr': 0.0004914502962611792, 'samples': 2756544, 'steps': 14356, 'loss/train': 1.9531524181365967} 11/06/2021 23:13:45 - INFO - __main__ - Step 14358: {'lr': 0.0004914489202541264, 'samples': 2756736, 'steps': 14357, 'loss/train': 1.827507734298706} 11/06/2021 23:13:45 - INFO - __main__ - Step 14359: {'lr': 0.0004914475441382804, 'samples': 2756928, 'steps': 14358, 'loss/train': 1.6210410594940186} 11/06/2021 23:13:46 - INFO - __main__ - Step 14360: {'lr': 0.0004914461679136419, 'samples': 2757120, 'steps': 14359, 'loss/train': 1.5644958019256592} 11/06/2021 23:13:46 - INFO - __main__ - Step 14361: {'lr': 0.0004914447915802115, 'samples': 2757312, 'steps': 14360, 'loss/train': 1.3498966693878174} 11/06/2021 23:13:47 - INFO - __main__ - Step 14362: {'lr': 0.0004914434151379898, 'samples': 2757504, 'steps': 14361, 'loss/train': 1.2043393850326538} 11/06/2021 23:13:47 - INFO - __main__ - Step 14363: {'lr': 0.0004914420385869773, 'samples': 2757696, 'steps': 14362, 'loss/train': 1.314516544342041} 11/06/2021 23:13:48 - INFO - __main__ - Step 14364: {'lr': 0.0004914406619271749, 'samples': 2757888, 'steps': 14363, 'loss/train': 1.803429126739502} 11/06/2021 23:13:48 - INFO - __main__ - Step 14365: {'lr': 0.0004914392851585829, 'samples': 2758080, 'steps': 14364, 'loss/train': 1.7351176738739014} 11/06/2021 23:13:48 - INFO - __main__ - Step 14366: {'lr': 0.0004914379082812023, 'samples': 2758272, 'steps': 14365, 'loss/train': 1.1345579624176025} 11/06/2021 23:13:49 - INFO - __main__ - Step 14367: {'lr': 0.0004914365312950333, 'samples': 2758464, 'steps': 14366, 'loss/train': 2.234651803970337} 11/06/2021 23:13:50 - INFO - __main__ - Step 14368: {'lr': 0.0004914351542000768, 'samples': 2758656, 'steps': 14367, 'loss/train': 1.3707553148269653} 11/06/2021 23:13:50 - INFO - __main__ - Step 14369: {'lr': 0.0004914337769963334, 'samples': 2758848, 'steps': 14368, 'loss/train': 1.8129569292068481} 11/06/2021 23:13:51 - INFO - __main__ - Step 14370: {'lr': 0.0004914323996838036, 'samples': 2759040, 'steps': 14369, 'loss/train': 1.368338942527771} 11/06/2021 23:13:51 - INFO - __main__ - Step 14371: {'lr': 0.0004914310222624881, 'samples': 2759232, 'steps': 14370, 'loss/train': 1.6486504077911377} 11/06/2021 23:13:51 - INFO - __main__ - Step 14372: {'lr': 0.0004914296447323875, 'samples': 2759424, 'steps': 14371, 'loss/train': 1.776429295539856} 11/06/2021 23:13:52 - INFO - __main__ - Step 14373: {'lr': 0.0004914282670935025, 'samples': 2759616, 'steps': 14372, 'loss/train': 1.8240113258361816} 11/06/2021 23:13:53 - INFO - __main__ - Step 14374: {'lr': 0.0004914268893458336, 'samples': 2759808, 'steps': 14373, 'loss/train': 1.7623484134674072} 11/06/2021 23:13:53 - INFO - __main__ - Step 14375: {'lr': 0.0004914255114893814, 'samples': 2760000, 'steps': 14374, 'loss/train': 1.851312518119812} 11/06/2021 23:13:53 - INFO - __main__ - Step 14376: {'lr': 0.0004914241335241467, 'samples': 2760192, 'steps': 14375, 'loss/train': 1.748906135559082} 11/06/2021 23:13:54 - INFO - __main__ - Step 14377: {'lr': 0.0004914227554501299, 'samples': 2760384, 'steps': 14376, 'loss/train': 0.8111178278923035} 11/06/2021 23:13:55 - INFO - __main__ - Step 14378: {'lr': 0.0004914213772673319, 'samples': 2760576, 'steps': 14377, 'loss/train': 0.3423219919204712} 11/06/2021 23:13:56 - INFO - __main__ - Step 14379: {'lr': 0.0004914199989757529, 'samples': 2760768, 'steps': 14378, 'loss/train': 1.715069055557251} 11/06/2021 23:13:56 - INFO - __main__ - Step 14380: {'lr': 0.000491418620575394, 'samples': 2760960, 'steps': 14379, 'loss/train': 1.3890851736068726} 11/06/2021 23:13:57 - INFO - __main__ - Step 14381: {'lr': 0.0004914172420662556, 'samples': 2761152, 'steps': 14380, 'loss/train': 1.8425896167755127} 11/06/2021 23:13:57 - INFO - __main__ - Step 14382: {'lr': 0.0004914158634483381, 'samples': 2761344, 'steps': 14381, 'loss/train': 1.8168810606002808} 11/06/2021 23:13:57 - INFO - __main__ - Step 14383: {'lr': 0.0004914144847216425, 'samples': 2761536, 'steps': 14382, 'loss/train': 1.2000967264175415} 11/06/2021 23:13:58 - INFO - __main__ - Step 14384: {'lr': 0.0004914131058861693, 'samples': 2761728, 'steps': 14383, 'loss/train': 1.7531344890594482} 11/06/2021 23:13:59 - INFO - __main__ - Step 14385: {'lr': 0.000491411726941919, 'samples': 2761920, 'steps': 14384, 'loss/train': 1.7564046382904053} 11/06/2021 23:13:59 - INFO - __main__ - Step 14386: {'lr': 0.0004914103478888922, 'samples': 2762112, 'steps': 14385, 'loss/train': 1.1627519130706787} 11/06/2021 23:13:59 - INFO - __main__ - Step 14387: {'lr': 0.0004914089687270898, 'samples': 2762304, 'steps': 14386, 'loss/train': 1.7761569023132324} 11/06/2021 23:14:00 - INFO - __main__ - Step 14388: {'lr': 0.0004914075894565122, 'samples': 2762496, 'steps': 14387, 'loss/train': 1.060185194015503} 11/06/2021 23:14:00 - INFO - __main__ - Step 14389: {'lr': 0.00049140621007716, 'samples': 2762688, 'steps': 14388, 'loss/train': 1.5696700811386108} 11/06/2021 23:14:01 - INFO - __main__ - Step 14390: {'lr': 0.0004914048305890339, 'samples': 2762880, 'steps': 14389, 'loss/train': 1.3252995014190674} 11/06/2021 23:14:01 - INFO - __main__ - Step 14391: {'lr': 0.0004914034509921345, 'samples': 2763072, 'steps': 14390, 'loss/train': 2.1088061332702637} 11/06/2021 23:14:02 - INFO - __main__ - Step 14392: {'lr': 0.0004914020712864626, 'samples': 2763264, 'steps': 14391, 'loss/train': 1.9577943086624146} 11/06/2021 23:14:02 - INFO - __main__ - Step 14393: {'lr': 0.0004914006914720184, 'samples': 2763456, 'steps': 14392, 'loss/train': 1.1342967748641968} 11/06/2021 23:14:03 - INFO - __main__ - Step 14394: {'lr': 0.0004913993115488029, 'samples': 2763648, 'steps': 14393, 'loss/train': 1.1955634355545044} 11/06/2021 23:14:03 - INFO - __main__ - Step 14395: {'lr': 0.0004913979315168167, 'samples': 2763840, 'steps': 14394, 'loss/train': 1.5130412578582764} 11/06/2021 23:14:04 - INFO - __main__ - Step 14396: {'lr': 0.0004913965513760601, 'samples': 2764032, 'steps': 14395, 'loss/train': 1.6799837350845337} 11/06/2021 23:14:04 - INFO - __main__ - Step 14397: {'lr': 0.0004913951711265341, 'samples': 2764224, 'steps': 14396, 'loss/train': 1.8832666873931885} 11/06/2021 23:14:05 - INFO - __main__ - Step 14398: {'lr': 0.0004913937907682391, 'samples': 2764416, 'steps': 14397, 'loss/train': 1.6365022659301758} 11/06/2021 23:14:05 - INFO - __main__ - Step 14399: {'lr': 0.0004913924103011757, 'samples': 2764608, 'steps': 14398, 'loss/train': 1.8931413888931274} 11/06/2021 23:14:05 - INFO - __main__ - Step 14400: {'lr': 0.0004913910297253448, 'samples': 2764800, 'steps': 14399, 'loss/train': 1.8184783458709717} 11/06/2021 23:14:06 - INFO - __main__ - Step 14401: {'lr': 0.0004913896490407467, 'samples': 2764992, 'steps': 14400, 'loss/train': 1.6293751001358032} 11/06/2021 23:14:07 - INFO - __main__ - Step 14402: {'lr': 0.0004913882682473821, 'samples': 2765184, 'steps': 14401, 'loss/train': 1.6088601350784302} 11/06/2021 23:14:07 - INFO - __main__ - Step 14403: {'lr': 0.0004913868873452519, 'samples': 2765376, 'steps': 14402, 'loss/train': 1.4631474018096924} 11/06/2021 23:14:07 - INFO - __main__ - Step 14404: {'lr': 0.0004913855063343563, 'samples': 2765568, 'steps': 14403, 'loss/train': 1.457092523574829} 11/06/2021 23:14:08 - INFO - __main__ - Step 14405: {'lr': 0.0004913841252146961, 'samples': 2765760, 'steps': 14404, 'loss/train': 1.6985125541687012} 11/06/2021 23:14:09 - INFO - __main__ - Step 14406: {'lr': 0.000491382743986272, 'samples': 2765952, 'steps': 14405, 'loss/train': 2.084760904312134} 11/06/2021 23:14:09 - INFO - __main__ - Step 14407: {'lr': 0.0004913813626490845, 'samples': 2766144, 'steps': 14406, 'loss/train': 1.3225696086883545} 11/06/2021 23:14:09 - INFO - __main__ - Step 14408: {'lr': 0.0004913799812031343, 'samples': 2766336, 'steps': 14407, 'loss/train': 1.6497886180877686} 11/06/2021 23:14:10 - INFO - __main__ - Step 14409: {'lr': 0.0004913785996484221, 'samples': 2766528, 'steps': 14408, 'loss/train': 1.065016746520996} 11/06/2021 23:14:10 - INFO - __main__ - Step 14410: {'lr': 0.0004913772179849483, 'samples': 2766720, 'steps': 14409, 'loss/train': 1.7048285007476807} 11/06/2021 23:14:11 - INFO - __main__ - Step 14411: {'lr': 0.0004913758362127137, 'samples': 2766912, 'steps': 14410, 'loss/train': 1.4540810585021973} 11/06/2021 23:14:11 - INFO - __main__ - Step 14412: {'lr': 0.0004913744543317189, 'samples': 2767104, 'steps': 14411, 'loss/train': 1.4666005373001099} 11/06/2021 23:14:12 - INFO - __main__ - Step 14413: {'lr': 0.0004913730723419645, 'samples': 2767296, 'steps': 14412, 'loss/train': 1.2720320224761963} 11/06/2021 23:14:12 - INFO - __main__ - Step 14414: {'lr': 0.000491371690243451, 'samples': 2767488, 'steps': 14413, 'loss/train': 1.3805193901062012} 11/06/2021 23:14:12 - INFO - __main__ - Step 14415: {'lr': 0.0004913703080361793, 'samples': 2767680, 'steps': 14414, 'loss/train': 2.1130664348602295} 11/06/2021 23:14:14 - INFO - __main__ - Step 14416: {'lr': 0.0004913689257201499, 'samples': 2767872, 'steps': 14415, 'loss/train': 1.317294955253601} 11/06/2021 23:14:14 - INFO - __main__ - Step 14417: {'lr': 0.0004913675432953633, 'samples': 2768064, 'steps': 14416, 'loss/train': 1.1042780876159668} 11/06/2021 23:14:14 - INFO - __main__ - Step 14418: {'lr': 0.0004913661607618202, 'samples': 2768256, 'steps': 14417, 'loss/train': 2.0876216888427734} 11/06/2021 23:14:15 - INFO - __main__ - Step 14419: {'lr': 0.0004913647781195212, 'samples': 2768448, 'steps': 14418, 'loss/train': 1.5809930562973022} 11/06/2021 23:14:15 - INFO - __main__ - Step 14420: {'lr': 0.000491363395368467, 'samples': 2768640, 'steps': 14419, 'loss/train': 0.730297863483429} 11/06/2021 23:14:15 - INFO - __main__ - Step 14421: {'lr': 0.0004913620125086581, 'samples': 2768832, 'steps': 14420, 'loss/train': 5.8645501136779785} 11/06/2021 23:14:16 - INFO - __main__ - Step 14422: {'lr': 0.0004913606295400953, 'samples': 2769024, 'steps': 14421, 'loss/train': 2.153425455093384} 11/06/2021 23:14:17 - INFO - __main__ - Step 14423: {'lr': 0.000491359246462779, 'samples': 2769216, 'steps': 14422, 'loss/train': 1.7173231840133667} 11/06/2021 23:14:17 - INFO - __main__ - Step 14424: {'lr': 0.0004913578632767101, 'samples': 2769408, 'steps': 14423, 'loss/train': 1.7363717555999756} 11/06/2021 23:14:18 - INFO - __main__ - Step 14425: {'lr': 0.0004913564799818891, 'samples': 2769600, 'steps': 14424, 'loss/train': 1.796966314315796} 11/06/2021 23:14:18 - INFO - __main__ - Step 14426: {'lr': 0.0004913550965783165, 'samples': 2769792, 'steps': 14425, 'loss/train': 1.8378410339355469} 11/06/2021 23:14:19 - INFO - __main__ - Step 14427: {'lr': 0.000491353713065993, 'samples': 2769984, 'steps': 14426, 'loss/train': 1.9022510051727295} 11/06/2021 23:14:19 - INFO - __main__ - Step 14428: {'lr': 0.0004913523294449193, 'samples': 2770176, 'steps': 14427, 'loss/train': 1.5578607320785522} 11/06/2021 23:14:20 - INFO - __main__ - Step 14429: {'lr': 0.0004913509457150959, 'samples': 2770368, 'steps': 14428, 'loss/train': 1.9274928569793701} 11/06/2021 23:14:20 - INFO - __main__ - Step 14430: {'lr': 0.0004913495618765235, 'samples': 2770560, 'steps': 14429, 'loss/train': 1.6800792217254639} 11/06/2021 23:14:20 - INFO - __main__ - Step 14431: {'lr': 0.0004913481779292027, 'samples': 2770752, 'steps': 14430, 'loss/train': 1.7277616262435913} 11/06/2021 23:14:21 - INFO - __main__ - Step 14432: {'lr': 0.0004913467938731341, 'samples': 2770944, 'steps': 14431, 'loss/train': 1.655822992324829} 11/06/2021 23:14:22 - INFO - __main__ - Step 14433: {'lr': 0.0004913454097083185, 'samples': 2771136, 'steps': 14432, 'loss/train': 1.733009696006775} 11/06/2021 23:14:22 - INFO - __main__ - Step 14434: {'lr': 0.0004913440254347563, 'samples': 2771328, 'steps': 14433, 'loss/train': 2.231706380844116} 11/06/2021 23:14:23 - INFO - __main__ - Step 14435: {'lr': 0.0004913426410524482, 'samples': 2771520, 'steps': 14434, 'loss/train': 0.5304341316223145} 11/06/2021 23:14:23 - INFO - __main__ - Step 14436: {'lr': 0.0004913412565613948, 'samples': 2771712, 'steps': 14435, 'loss/train': 1.5568188428878784} 11/06/2021 23:14:23 - INFO - __main__ - Step 14437: {'lr': 0.0004913398719615968, 'samples': 2771904, 'steps': 14436, 'loss/train': 1.682983160018921} 11/06/2021 23:14:24 - INFO - __main__ - Step 14438: {'lr': 0.0004913384872530548, 'samples': 2772096, 'steps': 14437, 'loss/train': 1.6241941452026367} 11/06/2021 23:14:25 - INFO - __main__ - Step 14439: {'lr': 0.0004913371024357694, 'samples': 2772288, 'steps': 14438, 'loss/train': 1.7349668741226196} 11/06/2021 23:14:25 - INFO - __main__ - Step 14440: {'lr': 0.0004913357175097412, 'samples': 2772480, 'steps': 14439, 'loss/train': 2.172203302383423} 11/06/2021 23:14:25 - INFO - __main__ - Step 14441: {'lr': 0.0004913343324749708, 'samples': 2772672, 'steps': 14440, 'loss/train': 0.5017638206481934} 11/06/2021 23:14:26 - INFO - __main__ - Step 14442: {'lr': 0.000491332947331459, 'samples': 2772864, 'steps': 14441, 'loss/train': 1.8418453931808472} 11/06/2021 23:14:27 - INFO - __main__ - Step 14443: {'lr': 0.0004913315620792061, 'samples': 2773056, 'steps': 14442, 'loss/train': 1.784220576286316} 11/06/2021 23:14:27 - INFO - __main__ - Step 14444: {'lr': 0.0004913301767182131, 'samples': 2773248, 'steps': 14443, 'loss/train': 1.6641216278076172} 11/06/2021 23:14:27 - INFO - __main__ - Step 14445: {'lr': 0.0004913287912484804, 'samples': 2773440, 'steps': 14444, 'loss/train': 1.1389085054397583} 11/06/2021 23:14:28 - INFO - __main__ - Step 14446: {'lr': 0.0004913274056700087, 'samples': 2773632, 'steps': 14445, 'loss/train': 1.5397385358810425} 11/06/2021 23:14:28 - INFO - __main__ - Step 14447: {'lr': 0.0004913260199827986, 'samples': 2773824, 'steps': 14446, 'loss/train': 2.1198136806488037} 11/06/2021 23:14:29 - INFO - __main__ - Step 14448: {'lr': 0.0004913246341868506, 'samples': 2774016, 'steps': 14447, 'loss/train': 1.6462889909744263} 11/06/2021 23:14:30 - INFO - __main__ - Step 14449: {'lr': 0.0004913232482821656, 'samples': 2774208, 'steps': 14448, 'loss/train': 1.6559960842132568} 11/06/2021 23:14:30 - INFO - __main__ - Step 14450: {'lr': 0.0004913218622687439, 'samples': 2774400, 'steps': 14449, 'loss/train': 1.4591246843338013} 11/06/2021 23:14:30 - INFO - __main__ - Step 14451: {'lr': 0.0004913204761465864, 'samples': 2774592, 'steps': 14450, 'loss/train': 1.9061264991760254} 11/06/2021 23:14:31 - INFO - __main__ - Step 14452: {'lr': 0.0004913190899156936, 'samples': 2774784, 'steps': 14451, 'loss/train': 1.592616081237793} 11/06/2021 23:14:31 - INFO - __main__ - Step 14453: {'lr': 0.0004913177035760661, 'samples': 2774976, 'steps': 14452, 'loss/train': 1.2767666578292847} 11/06/2021 23:14:32 - INFO - __main__ - Step 14454: {'lr': 0.0004913163171277046, 'samples': 2775168, 'steps': 14453, 'loss/train': 1.2585607767105103} 11/06/2021 23:14:32 - INFO - __main__ - Step 14455: {'lr': 0.0004913149305706097, 'samples': 2775360, 'steps': 14454, 'loss/train': 1.6764591932296753} 11/06/2021 23:14:33 - INFO - __main__ - Step 14456: {'lr': 0.0004913135439047821, 'samples': 2775552, 'steps': 14455, 'loss/train': 1.7716397047042847} 11/06/2021 23:14:33 - INFO - __main__ - Step 14457: {'lr': 0.0004913121571302222, 'samples': 2775744, 'steps': 14456, 'loss/train': 1.690000057220459} 11/06/2021 23:14:33 - INFO - __main__ - Step 14458: {'lr': 0.0004913107702469308, 'samples': 2775936, 'steps': 14457, 'loss/train': 1.5991852283477783} 11/06/2021 23:14:34 - INFO - __main__ - Step 14459: {'lr': 0.0004913093832549085, 'samples': 2776128, 'steps': 14458, 'loss/train': 1.8135044574737549} 11/06/2021 23:14:35 - INFO - __main__ - Step 14460: {'lr': 0.000491307996154156, 'samples': 2776320, 'steps': 14459, 'loss/train': 1.87995445728302} 11/06/2021 23:14:35 - INFO - __main__ - Step 14461: {'lr': 0.0004913066089446737, 'samples': 2776512, 'steps': 14460, 'loss/train': 1.2176554203033447} 11/06/2021 23:14:35 - INFO - __main__ - Step 14462: {'lr': 0.0004913052216264624, 'samples': 2776704, 'steps': 14461, 'loss/train': 1.4758261442184448} 11/06/2021 23:14:36 - INFO - __main__ - Step 14463: {'lr': 0.0004913038341995227, 'samples': 2776896, 'steps': 14462, 'loss/train': 1.7246683835983276} 11/06/2021 23:14:37 - INFO - __main__ - Step 14464: {'lr': 0.0004913024466638553, 'samples': 2777088, 'steps': 14463, 'loss/train': 1.7339403629302979} 11/06/2021 23:14:37 - INFO - __main__ - Step 14465: {'lr': 0.0004913010590194607, 'samples': 2777280, 'steps': 14464, 'loss/train': 1.6634852886199951} 11/06/2021 23:14:37 - INFO - __main__ - Step 14466: {'lr': 0.0004912996712663396, 'samples': 2777472, 'steps': 14465, 'loss/train': 1.24350106716156} 11/06/2021 23:14:38 - INFO - __main__ - Step 14467: {'lr': 0.0004912982834044924, 'samples': 2777664, 'steps': 14466, 'loss/train': 1.8824312686920166} 11/06/2021 23:14:38 - INFO - __main__ - Step 14468: {'lr': 0.0004912968954339202, 'samples': 2777856, 'steps': 14467, 'loss/train': 1.555679202079773} 11/06/2021 23:14:39 - INFO - __main__ - Step 14469: {'lr': 0.0004912955073546231, 'samples': 2778048, 'steps': 14468, 'loss/train': 1.743302345275879} 11/06/2021 23:14:40 - INFO - __main__ - Step 14470: {'lr': 0.0004912941191666021, 'samples': 2778240, 'steps': 14469, 'loss/train': 1.9434269666671753} 11/06/2021 23:14:40 - INFO - __main__ - Step 14471: {'lr': 0.0004912927308698576, 'samples': 2778432, 'steps': 14470, 'loss/train': 1.7462414503097534} 11/06/2021 23:14:40 - INFO - __main__ - Step 14472: {'lr': 0.0004912913424643904, 'samples': 2778624, 'steps': 14471, 'loss/train': 1.8691980838775635} 11/06/2021 23:14:41 - INFO - __main__ - Step 14473: {'lr': 0.0004912899539502011, 'samples': 2778816, 'steps': 14472, 'loss/train': 1.504024624824524} 11/06/2021 23:14:42 - INFO - __main__ - Step 14474: {'lr': 0.0004912885653272902, 'samples': 2779008, 'steps': 14473, 'loss/train': 0.9449915289878845} 11/06/2021 23:14:42 - INFO - __main__ - Step 14475: {'lr': 0.0004912871765956583, 'samples': 2779200, 'steps': 14474, 'loss/train': 1.7823172807693481} 11/06/2021 23:14:43 - INFO - __main__ - Step 14476: {'lr': 0.0004912857877553062, 'samples': 2779392, 'steps': 14475, 'loss/train': 1.6257444620132446} 11/06/2021 23:14:43 - INFO - __main__ - Step 14477: {'lr': 0.0004912843988062345, 'samples': 2779584, 'steps': 14476, 'loss/train': 1.8921669721603394} 11/06/2021 23:14:43 - INFO - __main__ - Step 14478: {'lr': 0.0004912830097484437, 'samples': 2779776, 'steps': 14477, 'loss/train': 1.9633761644363403} 11/06/2021 23:14:44 - INFO - __main__ - Step 14479: {'lr': 0.0004912816205819346, 'samples': 2779968, 'steps': 14478, 'loss/train': 1.5476384162902832} 11/06/2021 23:14:45 - INFO - __main__ - Step 14480: {'lr': 0.0004912802313067076, 'samples': 2780160, 'steps': 14479, 'loss/train': 1.160788893699646} 11/06/2021 23:14:45 - INFO - __main__ - Step 14481: {'lr': 0.0004912788419227635, 'samples': 2780352, 'steps': 14480, 'loss/train': 1.3065866231918335} 11/06/2021 23:14:45 - INFO - __main__ - Step 14482: {'lr': 0.000491277452430103, 'samples': 2780544, 'steps': 14481, 'loss/train': 1.6148895025253296} 11/06/2021 23:14:46 - INFO - __main__ - Step 14483: {'lr': 0.0004912760628287264, 'samples': 2780736, 'steps': 14482, 'loss/train': 0.9406113028526306} 11/06/2021 23:14:46 - INFO - __main__ - Step 14484: {'lr': 0.0004912746731186346, 'samples': 2780928, 'steps': 14483, 'loss/train': 1.4715304374694824} 11/06/2021 23:14:48 - INFO - __main__ - Step 14485: {'lr': 0.0004912732832998281, 'samples': 2781120, 'steps': 14484, 'loss/train': 1.742849349975586} 11/06/2021 23:14:48 - INFO - __main__ - Step 14486: {'lr': 0.0004912718933723077, 'samples': 2781312, 'steps': 14485, 'loss/train': 1.7313225269317627} 11/06/2021 23:14:48 - INFO - __main__ - Step 14487: {'lr': 0.0004912705033360738, 'samples': 2781504, 'steps': 14486, 'loss/train': 1.3565199375152588} 11/06/2021 23:14:49 - INFO - __main__ - Step 14488: {'lr': 0.0004912691131911272, 'samples': 2781696, 'steps': 14487, 'loss/train': 1.073344111442566} 11/06/2021 23:14:49 - INFO - __main__ - Step 14489: {'lr': 0.0004912677229374684, 'samples': 2781888, 'steps': 14488, 'loss/train': 1.5861188173294067} 11/06/2021 23:14:50 - INFO - __main__ - Step 14490: {'lr': 0.0004912663325750982, 'samples': 2782080, 'steps': 14489, 'loss/train': 1.8165336847305298} 11/06/2021 23:14:50 - INFO - __main__ - Step 14491: {'lr': 0.000491264942104017, 'samples': 2782272, 'steps': 14490, 'loss/train': 0.944426953792572} 11/06/2021 23:14:51 - INFO - __main__ - Step 14492: {'lr': 0.0004912635515242257, 'samples': 2782464, 'steps': 14491, 'loss/train': 0.730577826499939} 11/06/2021 23:14:51 - INFO - __main__ - Step 14493: {'lr': 0.0004912621608357246, 'samples': 2782656, 'steps': 14492, 'loss/train': 2.3794353008270264} 11/06/2021 23:14:52 - INFO - __main__ - Step 14494: {'lr': 0.0004912607700385146, 'samples': 2782848, 'steps': 14493, 'loss/train': 1.526907205581665} 11/06/2021 23:14:52 - INFO - __main__ - Step 14495: {'lr': 0.0004912593791325962, 'samples': 2783040, 'steps': 14494, 'loss/train': 0.6017135977745056} 11/06/2021 23:14:52 - INFO - __main__ - Step 14496: {'lr': 0.00049125798811797, 'samples': 2783232, 'steps': 14495, 'loss/train': 1.5702513456344604} 11/06/2021 23:14:53 - INFO - __main__ - Step 14497: {'lr': 0.0004912565969946367, 'samples': 2783424, 'steps': 14496, 'loss/train': 1.7380554676055908} 11/06/2021 23:14:54 - INFO - __main__ - Step 14498: {'lr': 0.0004912552057625969, 'samples': 2783616, 'steps': 14497, 'loss/train': 1.791672706604004} 11/06/2021 23:14:54 - INFO - __main__ - Step 14499: {'lr': 0.0004912538144218512, 'samples': 2783808, 'steps': 14498, 'loss/train': 1.8101961612701416} 11/06/2021 23:14:54 - INFO - __main__ - Step 14500: {'lr': 0.0004912524229724002, 'samples': 2784000, 'steps': 14499, 'loss/train': 1.174378752708435} 11/06/2021 23:14:55 - INFO - __main__ - Step 14501: {'lr': 0.0004912510314142447, 'samples': 2784192, 'steps': 14500, 'loss/train': 1.9984477758407593} 11/06/2021 23:14:56 - INFO - __main__ - Step 14502: {'lr': 0.0004912496397473852, 'samples': 2784384, 'steps': 14501, 'loss/train': 2.0241353511810303} 11/06/2021 23:14:56 - INFO - __main__ - Step 14503: {'lr': 0.0004912482479718223, 'samples': 2784576, 'steps': 14502, 'loss/train': 1.441446304321289} 11/06/2021 23:14:56 - INFO - __main__ - Step 14504: {'lr': 0.0004912468560875566, 'samples': 2784768, 'steps': 14503, 'loss/train': 1.7977668046951294} 11/06/2021 23:14:57 - INFO - __main__ - Step 14505: {'lr': 0.0004912454640945889, 'samples': 2784960, 'steps': 14504, 'loss/train': 1.8055917024612427} 11/06/2021 23:14:57 - INFO - __main__ - Step 14506: {'lr': 0.0004912440719929196, 'samples': 2785152, 'steps': 14505, 'loss/train': 1.3446134328842163} 11/06/2021 23:14:58 - INFO - __main__ - Step 14507: {'lr': 0.0004912426797825495, 'samples': 2785344, 'steps': 14506, 'loss/train': 1.7469252347946167} 11/06/2021 23:14:59 - INFO - __main__ - Step 14508: {'lr': 0.0004912412874634792, 'samples': 2785536, 'steps': 14507, 'loss/train': 1.3988752365112305} 11/06/2021 23:14:59 - INFO - __main__ - Step 14509: {'lr': 0.0004912398950357094, 'samples': 2785728, 'steps': 14508, 'loss/train': 2.2110788822174072} 11/06/2021 23:14:59 - INFO - __main__ - Step 14510: {'lr': 0.0004912385024992404, 'samples': 2785920, 'steps': 14509, 'loss/train': 1.4885571002960205} 11/06/2021 23:15:00 - INFO - __main__ - Step 14511: {'lr': 0.0004912371098540733, 'samples': 2786112, 'steps': 14510, 'loss/train': 2.1764931678771973} 11/06/2021 23:15:01 - INFO - __main__ - Step 14512: {'lr': 0.0004912357171002082, 'samples': 2786304, 'steps': 14511, 'loss/train': 1.1966614723205566} 11/06/2021 23:15:01 - INFO - __main__ - Step 14513: {'lr': 0.0004912343242376462, 'samples': 2786496, 'steps': 14512, 'loss/train': 1.392907738685608} 11/06/2021 23:15:01 - INFO - __main__ - Step 14514: {'lr': 0.0004912329312663877, 'samples': 2786688, 'steps': 14513, 'loss/train': 1.479630947113037} 11/06/2021 23:15:02 - INFO - __main__ - Step 14515: {'lr': 0.0004912315381864333, 'samples': 2786880, 'steps': 14514, 'loss/train': 1.7795805931091309} 11/06/2021 23:15:02 - INFO - __main__ - Step 14516: {'lr': 0.0004912301449977837, 'samples': 2787072, 'steps': 14515, 'loss/train': 1.7214689254760742} 11/06/2021 23:15:02 - INFO - __main__ - Step 14517: {'lr': 0.0004912287517004397, 'samples': 2787264, 'steps': 14516, 'loss/train': 1.5596206188201904} 11/06/2021 23:15:03 - INFO - __main__ - Step 14518: {'lr': 0.0004912273582944015, 'samples': 2787456, 'steps': 14517, 'loss/train': 1.3602560758590698} 11/06/2021 23:15:04 - INFO - __main__ - Step 14519: {'lr': 0.0004912259647796701, 'samples': 2787648, 'steps': 14518, 'loss/train': 1.7231316566467285} 11/06/2021 23:15:04 - INFO - __main__ - Step 14520: {'lr': 0.000491224571156246, 'samples': 2787840, 'steps': 14519, 'loss/train': 1.2232226133346558} 11/06/2021 23:15:05 - INFO - __main__ - Step 14521: {'lr': 0.0004912231774241298, 'samples': 2788032, 'steps': 14520, 'loss/train': 1.797451138496399} 11/06/2021 23:15:05 - INFO - __main__ - Step 14522: {'lr': 0.0004912217835833222, 'samples': 2788224, 'steps': 14521, 'loss/train': 0.9428983926773071} 11/06/2021 23:15:06 - INFO - __main__ - Step 14523: {'lr': 0.0004912203896338238, 'samples': 2788416, 'steps': 14522, 'loss/train': 1.4348275661468506} 11/06/2021 23:15:06 - INFO - __main__ - Step 14524: {'lr': 0.0004912189955756351, 'samples': 2788608, 'steps': 14523, 'loss/train': 1.5228583812713623} 11/06/2021 23:15:07 - INFO - __main__ - Step 14525: {'lr': 0.000491217601408757, 'samples': 2788800, 'steps': 14524, 'loss/train': 1.8126468658447266} 11/06/2021 23:15:07 - INFO - __main__ - Step 14526: {'lr': 0.0004912162071331898, 'samples': 2788992, 'steps': 14525, 'loss/train': 1.6764172315597534} 11/06/2021 23:15:07 - INFO - __main__ - Step 14527: {'lr': 0.0004912148127489345, 'samples': 2789184, 'steps': 14526, 'loss/train': 1.4271752834320068} 11/06/2021 23:15:08 - INFO - __main__ - Step 14528: {'lr': 0.0004912134182559913, 'samples': 2789376, 'steps': 14527, 'loss/train': 1.7659759521484375} 11/06/2021 23:15:09 - INFO - __main__ - Step 14529: {'lr': 0.0004912120236543611, 'samples': 2789568, 'steps': 14528, 'loss/train': 1.7609317302703857} 11/06/2021 23:15:09 - INFO - __main__ - Step 14530: {'lr': 0.0004912106289440446, 'samples': 2789760, 'steps': 14529, 'loss/train': 1.904273271560669} 11/06/2021 23:15:09 - INFO - __main__ - Step 14531: {'lr': 0.0004912092341250422, 'samples': 2789952, 'steps': 14530, 'loss/train': 1.7561595439910889} 11/06/2021 23:15:10 - INFO - __main__ - Step 14532: {'lr': 0.0004912078391973547, 'samples': 2790144, 'steps': 14531, 'loss/train': 1.3714295625686646} 11/06/2021 23:15:11 - INFO - __main__ - Step 14533: {'lr': 0.0004912064441609827, 'samples': 2790336, 'steps': 14532, 'loss/train': 1.4325453042984009} 11/06/2021 23:15:11 - INFO - __main__ - Step 14534: {'lr': 0.0004912050490159268, 'samples': 2790528, 'steps': 14533, 'loss/train': 1.7847373485565186} 11/06/2021 23:15:11 - INFO - __main__ - Step 14535: {'lr': 0.0004912036537621877, 'samples': 2790720, 'steps': 14534, 'loss/train': 2.0088417530059814} 11/06/2021 23:15:12 - INFO - __main__ - Step 14536: {'lr': 0.0004912022583997658, 'samples': 2790912, 'steps': 14535, 'loss/train': 1.3345286846160889} 11/06/2021 23:15:12 - INFO - __main__ - Step 14537: {'lr': 0.0004912008629286619, 'samples': 2791104, 'steps': 14536, 'loss/train': 1.6118730306625366} 11/06/2021 23:15:12 - INFO - __main__ - Step 14538: {'lr': 0.0004911994673488766, 'samples': 2791296, 'steps': 14537, 'loss/train': 1.377058982849121} 11/06/2021 23:15:14 - INFO - __main__ - Step 14539: {'lr': 0.0004911980716604107, 'samples': 2791488, 'steps': 14538, 'loss/train': 1.6407101154327393} 11/06/2021 23:15:14 - INFO - __main__ - Step 14540: {'lr': 0.0004911966758632645, 'samples': 2791680, 'steps': 14539, 'loss/train': 1.8519309759140015} 11/06/2021 23:15:14 - INFO - __main__ - Step 14541: {'lr': 0.000491195279957439, 'samples': 2791872, 'steps': 14540, 'loss/train': 1.6462745666503906} 11/06/2021 23:15:15 - INFO - __main__ - Step 14542: {'lr': 0.0004911938839429344, 'samples': 2792064, 'steps': 14541, 'loss/train': 1.8177210092544556} 11/06/2021 23:15:15 - INFO - __main__ - Step 14543: {'lr': 0.0004911924878197517, 'samples': 2792256, 'steps': 14542, 'loss/train': 2.063103437423706} 11/06/2021 23:15:16 - INFO - __main__ - Step 14544: {'lr': 0.0004911910915878913, 'samples': 2792448, 'steps': 14543, 'loss/train': 2.019076347351074} 11/06/2021 23:15:16 - INFO - __main__ - Step 14545: {'lr': 0.000491189695247354, 'samples': 2792640, 'steps': 14544, 'loss/train': 1.723437786102295} 11/06/2021 23:15:17 - INFO - __main__ - Step 14546: {'lr': 0.0004911882987981404, 'samples': 2792832, 'steps': 14545, 'loss/train': 1.8162659406661987} 11/06/2021 23:15:17 - INFO - __main__ - Step 14547: {'lr': 0.0004911869022402508, 'samples': 2793024, 'steps': 14546, 'loss/train': 1.8774888515472412} 11/06/2021 23:15:17 - INFO - __main__ - Step 14548: {'lr': 0.0004911855055736863, 'samples': 2793216, 'steps': 14547, 'loss/train': 1.6280256509780884} 11/06/2021 23:15:18 - INFO - __main__ - Step 14549: {'lr': 0.0004911841087984473, 'samples': 2793408, 'steps': 14548, 'loss/train': 1.9488242864608765} 11/06/2021 23:15:19 - INFO - __main__ - Step 14550: {'lr': 0.0004911827119145345, 'samples': 2793600, 'steps': 14549, 'loss/train': 2.1403818130493164} 11/06/2021 23:15:19 - INFO - __main__ - Step 14551: {'lr': 0.0004911813149219485, 'samples': 2793792, 'steps': 14550, 'loss/train': 1.7999236583709717} 11/06/2021 23:15:19 - INFO - __main__ - Step 14552: {'lr': 0.0004911799178206899, 'samples': 2793984, 'steps': 14551, 'loss/train': 1.94169020652771} 11/06/2021 23:15:20 - INFO - __main__ - Step 14553: {'lr': 0.0004911785206107592, 'samples': 2794176, 'steps': 14552, 'loss/train': 1.5757359266281128} 11/06/2021 23:15:21 - INFO - __main__ - Step 14554: {'lr': 0.0004911771232921575, 'samples': 2794368, 'steps': 14553, 'loss/train': 1.7550246715545654} 11/06/2021 23:15:21 - INFO - __main__ - Step 14555: {'lr': 0.0004911757258648849, 'samples': 2794560, 'steps': 14554, 'loss/train': 1.0412113666534424} 11/06/2021 23:15:22 - INFO - __main__ - Step 14556: {'lr': 0.0004911743283289423, 'samples': 2794752, 'steps': 14555, 'loss/train': 1.524915337562561} 11/06/2021 23:15:22 - INFO - __main__ - Step 14557: {'lr': 0.0004911729306843302, 'samples': 2794944, 'steps': 14556, 'loss/train': 1.9681403636932373} 11/06/2021 23:15:23 - INFO - __main__ - Step 14558: {'lr': 0.0004911715329310493, 'samples': 2795136, 'steps': 14557, 'loss/train': 1.4831455945968628} 11/06/2021 23:15:23 - INFO - __main__ - Step 14559: {'lr': 0.0004911701350691002, 'samples': 2795328, 'steps': 14558, 'loss/train': 0.9031453132629395} 11/06/2021 23:15:24 - INFO - __main__ - Step 14560: {'lr': 0.0004911687370984836, 'samples': 2795520, 'steps': 14559, 'loss/train': 1.7917287349700928} 11/06/2021 23:15:24 - INFO - __main__ - Step 14561: {'lr': 0.0004911673390192002, 'samples': 2795712, 'steps': 14560, 'loss/train': 1.3398245573043823} 11/06/2021 23:15:25 - INFO - __main__ - Step 14562: {'lr': 0.0004911659408312505, 'samples': 2795904, 'steps': 14561, 'loss/train': 1.7333766222000122} 11/06/2021 23:15:25 - INFO - __main__ - Step 14563: {'lr': 0.000491164542534635, 'samples': 2796096, 'steps': 14562, 'loss/train': 1.9331468343734741} 11/06/2021 23:15:25 - INFO - __main__ - Step 14564: {'lr': 0.0004911631441293546, 'samples': 2796288, 'steps': 14563, 'loss/train': 1.3357164859771729} 11/06/2021 23:15:26 - INFO - __main__ - Step 14565: {'lr': 0.0004911617456154097, 'samples': 2796480, 'steps': 14564, 'loss/train': 1.6416891813278198} 11/06/2021 23:15:27 - INFO - __main__ - Step 14566: {'lr': 0.0004911603469928012, 'samples': 2796672, 'steps': 14565, 'loss/train': 2.241854667663574} 11/06/2021 23:15:27 - INFO - __main__ - Step 14567: {'lr': 0.0004911589482615294, 'samples': 2796864, 'steps': 14566, 'loss/train': 0.8703035116195679} 11/06/2021 23:15:27 - INFO - __main__ - Step 14568: {'lr': 0.0004911575494215952, 'samples': 2797056, 'steps': 14567, 'loss/train': 1.3895127773284912} 11/06/2021 23:15:28 - INFO - __main__ - Step 14569: {'lr': 0.0004911561504729992, 'samples': 2797248, 'steps': 14568, 'loss/train': 3.218695878982544} 11/06/2021 23:15:28 - INFO - __main__ - Step 14570: {'lr': 0.0004911547514157417, 'samples': 2797440, 'steps': 14569, 'loss/train': 1.9162862300872803} 11/06/2021 23:15:29 - INFO - __main__ - Step 14571: {'lr': 0.0004911533522498239, 'samples': 2797632, 'steps': 14570, 'loss/train': 1.7870599031448364} 11/06/2021 23:15:29 - INFO - __main__ - Step 14572: {'lr': 0.0004911519529752459, 'samples': 2797824, 'steps': 14571, 'loss/train': 2.2179548740386963} 11/06/2021 23:15:30 - INFO - __main__ - Step 14573: {'lr': 0.0004911505535920086, 'samples': 2798016, 'steps': 14572, 'loss/train': 1.6157692670822144} 11/06/2021 23:15:30 - INFO - __main__ - Step 14574: {'lr': 0.0004911491541001126, 'samples': 2798208, 'steps': 14573, 'loss/train': 1.6204320192337036} 11/06/2021 23:15:30 - INFO - __main__ - Step 14575: {'lr': 0.0004911477544995585, 'samples': 2798400, 'steps': 14574, 'loss/train': 1.705611228942871} 11/06/2021 23:15:31 - INFO - __main__ - Step 14576: {'lr': 0.000491146354790347, 'samples': 2798592, 'steps': 14575, 'loss/train': 1.5545910596847534} 11/06/2021 23:15:32 - INFO - __main__ - Step 14577: {'lr': 0.0004911449549724786, 'samples': 2798784, 'steps': 14576, 'loss/train': 1.5566593408584595} 11/06/2021 23:15:32 - INFO - __main__ - Step 14578: {'lr': 0.0004911435550459541, 'samples': 2798976, 'steps': 14577, 'loss/train': 2.1471126079559326} 11/06/2021 23:15:32 - INFO - __main__ - Step 14579: {'lr': 0.0004911421550107739, 'samples': 2799168, 'steps': 14578, 'loss/train': 1.5826555490493774} 11/06/2021 23:15:33 - INFO - __main__ - Step 14580: {'lr': 0.0004911407548669389, 'samples': 2799360, 'steps': 14579, 'loss/train': 1.8544542789459229} 11/06/2021 23:15:34 - INFO - __main__ - Step 14581: {'lr': 0.0004911393546144495, 'samples': 2799552, 'steps': 14580, 'loss/train': 1.4082012176513672} 11/06/2021 23:15:34 - INFO - __main__ - Step 14582: {'lr': 0.0004911379542533065, 'samples': 2799744, 'steps': 14581, 'loss/train': 1.3058255910873413} 11/06/2021 23:15:35 - INFO - __main__ - Step 14583: {'lr': 0.0004911365537835105, 'samples': 2799936, 'steps': 14582, 'loss/train': 0.7232246398925781} 11/06/2021 23:15:35 - INFO - __main__ - Step 14584: {'lr': 0.000491135153205062, 'samples': 2800128, 'steps': 14583, 'loss/train': 1.6225132942199707} 11/06/2021 23:15:35 - INFO - __main__ - Step 14585: {'lr': 0.0004911337525179616, 'samples': 2800320, 'steps': 14584, 'loss/train': 1.6813801527023315} 11/06/2021 23:15:36 - INFO - __main__ - Step 14586: {'lr': 0.0004911323517222103, 'samples': 2800512, 'steps': 14585, 'loss/train': 1.8164918422698975} 11/06/2021 23:15:37 - INFO - __main__ - Step 14587: {'lr': 0.0004911309508178084, 'samples': 2800704, 'steps': 14586, 'loss/train': 1.225614309310913} 11/06/2021 23:15:37 - INFO - __main__ - Step 14588: {'lr': 0.0004911295498047565, 'samples': 2800896, 'steps': 14587, 'loss/train': 2.0302164554595947} 11/06/2021 23:15:37 - INFO - __main__ - Step 14589: {'lr': 0.0004911281486830554, 'samples': 2801088, 'steps': 14588, 'loss/train': 1.7795565128326416} 11/06/2021 23:15:38 - INFO - __main__ - Step 14590: {'lr': 0.0004911267474527058, 'samples': 2801280, 'steps': 14589, 'loss/train': 1.4832676649093628} 11/06/2021 23:15:39 - INFO - __main__ - Step 14591: {'lr': 0.000491125346113708, 'samples': 2801472, 'steps': 14590, 'loss/train': 1.647063136100769} 11/06/2021 23:15:39 - INFO - __main__ - Step 14592: {'lr': 0.000491123944666063, 'samples': 2801664, 'steps': 14591, 'loss/train': 1.9390480518341064} 11/06/2021 23:15:39 - INFO - __main__ - Step 14593: {'lr': 0.0004911225431097712, 'samples': 2801856, 'steps': 14592, 'loss/train': 1.7286971807479858} 11/06/2021 23:15:40 - INFO - __main__ - Step 14594: {'lr': 0.0004911211414448333, 'samples': 2802048, 'steps': 14593, 'loss/train': 1.6285250186920166} 11/06/2021 23:15:40 - INFO - __main__ - Step 14595: {'lr': 0.0004911197396712501, 'samples': 2802240, 'steps': 14594, 'loss/train': 1.946718692779541} 11/06/2021 23:15:41 - INFO - __main__ - Step 14596: {'lr': 0.0004911183377890218, 'samples': 2802432, 'steps': 14595, 'loss/train': 1.3001048564910889} 11/06/2021 23:15:42 - INFO - __main__ - Step 14597: {'lr': 0.0004911169357981496, 'samples': 2802624, 'steps': 14596, 'loss/train': 1.6624016761779785} 11/06/2021 23:15:42 - INFO - __main__ - Step 14598: {'lr': 0.0004911155336986335, 'samples': 2802816, 'steps': 14597, 'loss/train': 2.089146614074707} 11/06/2021 23:15:42 - INFO - __main__ - Step 14599: {'lr': 0.0004911141314904747, 'samples': 2803008, 'steps': 14598, 'loss/train': 1.3329648971557617} 11/06/2021 23:15:43 - INFO - __main__ - Step 14600: {'lr': 0.0004911127291736735, 'samples': 2803200, 'steps': 14599, 'loss/train': 1.4260157346725464} 11/06/2021 23:15:43 - INFO - __main__ - Step 14601: {'lr': 0.0004911113267482307, 'samples': 2803392, 'steps': 14600, 'loss/train': 1.7082431316375732} 11/06/2021 23:15:44 - INFO - __main__ - Step 14602: {'lr': 0.0004911099242141467, 'samples': 2803584, 'steps': 14601, 'loss/train': 1.4876233339309692} 11/06/2021 23:15:44 - INFO - __main__ - Step 14603: {'lr': 0.0004911085215714224, 'samples': 2803776, 'steps': 14602, 'loss/train': 1.9533902406692505} 11/06/2021 23:15:45 - INFO - __main__ - Step 14604: {'lr': 0.0004911071188200584, 'samples': 2803968, 'steps': 14603, 'loss/train': 2.090561866760254} 11/06/2021 23:15:45 - INFO - __main__ - Step 14605: {'lr': 0.0004911057159600551, 'samples': 2804160, 'steps': 14604, 'loss/train': 1.5187530517578125} 11/06/2021 23:15:45 - INFO - __main__ - Step 14606: {'lr': 0.0004911043129914133, 'samples': 2804352, 'steps': 14605, 'loss/train': 0.9737354516983032} 11/06/2021 23:15:46 - INFO - __main__ - Step 14607: {'lr': 0.0004911029099141336, 'samples': 2804544, 'steps': 14606, 'loss/train': 1.8305327892303467} 11/06/2021 23:15:47 - INFO - __main__ - Step 14608: {'lr': 0.0004911015067282168, 'samples': 2804736, 'steps': 14607, 'loss/train': 1.8033746480941772} 11/06/2021 23:15:47 - INFO - __main__ - Step 14609: {'lr': 0.0004911001034336633, 'samples': 2804928, 'steps': 14608, 'loss/train': 1.6968247890472412} 11/06/2021 23:15:47 - INFO - __main__ - Step 14610: {'lr': 0.0004910987000304737, 'samples': 2805120, 'steps': 14609, 'loss/train': 1.5940757989883423} 11/06/2021 23:15:48 - INFO - __main__ - Step 14611: {'lr': 0.0004910972965186488, 'samples': 2805312, 'steps': 14610, 'loss/train': 1.7843865156173706} 11/06/2021 23:15:48 - INFO - __main__ - Step 14612: {'lr': 0.0004910958928981893, 'samples': 2805504, 'steps': 14611, 'loss/train': 1.8916642665863037} 11/06/2021 23:15:49 - INFO - __main__ - Step 14613: {'lr': 0.0004910944891690956, 'samples': 2805696, 'steps': 14612, 'loss/train': 1.5665946006774902} 11/06/2021 23:15:49 - INFO - __main__ - Step 14614: {'lr': 0.0004910930853313686, 'samples': 2805888, 'steps': 14613, 'loss/train': 1.1098638772964478} 11/06/2021 23:15:50 - INFO - __main__ - Step 14615: {'lr': 0.0004910916813850086, 'samples': 2806080, 'steps': 14614, 'loss/train': 1.3663274049758911} 11/06/2021 23:15:50 - INFO - __main__ - Step 14616: {'lr': 0.0004910902773300164, 'samples': 2806272, 'steps': 14615, 'loss/train': 1.2368197441101074} 11/06/2021 23:15:51 - INFO - __main__ - Step 14617: {'lr': 0.0004910888731663928, 'samples': 2806464, 'steps': 14616, 'loss/train': 1.519328236579895} 11/06/2021 23:15:52 - INFO - __main__ - Step 14618: {'lr': 0.0004910874688941381, 'samples': 2806656, 'steps': 14617, 'loss/train': 1.4356929063796997} 11/06/2021 23:15:52 - INFO - __main__ - Step 14619: {'lr': 0.0004910860645132532, 'samples': 2806848, 'steps': 14618, 'loss/train': 1.3084660768508911} 11/06/2021 23:15:52 - INFO - __main__ - Step 14620: {'lr': 0.0004910846600237386, 'samples': 2807040, 'steps': 14619, 'loss/train': 1.374779224395752} 11/06/2021 23:15:53 - INFO - __main__ - Step 14621: {'lr': 0.0004910832554255951, 'samples': 2807232, 'steps': 14620, 'loss/train': 1.7540532350540161} 11/06/2021 23:15:53 - INFO - __main__ - Step 14622: {'lr': 0.0004910818507188231, 'samples': 2807424, 'steps': 14621, 'loss/train': 1.5536149740219116} 11/06/2021 23:15:54 - INFO - __main__ - Step 14623: {'lr': 0.0004910804459034233, 'samples': 2807616, 'steps': 14622, 'loss/train': 1.8700112104415894} 11/06/2021 23:15:55 - INFO - __main__ - Step 14624: {'lr': 0.0004910790409793965, 'samples': 2807808, 'steps': 14623, 'loss/train': 1.8716275691986084} 11/06/2021 23:15:55 - INFO - __main__ - Step 14625: {'lr': 0.000491077635946743, 'samples': 2808000, 'steps': 14624, 'loss/train': 1.835631012916565} 11/06/2021 23:15:55 - INFO - __main__ - Step 14626: {'lr': 0.0004910762308054638, 'samples': 2808192, 'steps': 14625, 'loss/train': 1.5419883728027344} 11/06/2021 23:15:56 - INFO - __main__ - Step 14627: {'lr': 0.0004910748255555593, 'samples': 2808384, 'steps': 14626, 'loss/train': 0.7711548805236816} 11/06/2021 23:15:56 - INFO - __main__ - Step 14628: {'lr': 0.0004910734201970302, 'samples': 2808576, 'steps': 14627, 'loss/train': 1.3061710596084595} 11/06/2021 23:15:57 - INFO - __main__ - Step 14629: {'lr': 0.0004910720147298772, 'samples': 2808768, 'steps': 14628, 'loss/train': 1.5132771730422974} 11/06/2021 23:15:58 - INFO - __main__ - Step 14630: {'lr': 0.0004910706091541009, 'samples': 2808960, 'steps': 14629, 'loss/train': 1.1088963747024536} 11/06/2021 23:15:58 - INFO - __main__ - Step 14631: {'lr': 0.0004910692034697018, 'samples': 2809152, 'steps': 14630, 'loss/train': 1.5924564599990845} 11/06/2021 23:15:58 - INFO - __main__ - Step 14632: {'lr': 0.0004910677976766807, 'samples': 2809344, 'steps': 14631, 'loss/train': 1.2352012395858765} 11/06/2021 23:15:59 - INFO - __main__ - Step 14633: {'lr': 0.0004910663917750382, 'samples': 2809536, 'steps': 14632, 'loss/train': 1.8392850160598755} 11/06/2021 23:15:59 - INFO - __main__ - Step 14634: {'lr': 0.0004910649857647748, 'samples': 2809728, 'steps': 14633, 'loss/train': 1.6638622283935547} 11/06/2021 23:16:00 - INFO - __main__ - Step 14635: {'lr': 0.0004910635796458913, 'samples': 2809920, 'steps': 14634, 'loss/train': 2.037661075592041} 11/06/2021 23:16:00 - INFO - __main__ - Step 14636: {'lr': 0.0004910621734183882, 'samples': 2810112, 'steps': 14635, 'loss/train': 0.8195475935935974} 11/06/2021 23:16:01 - INFO - __main__ - Step 14637: {'lr': 0.0004910607670822663, 'samples': 2810304, 'steps': 14636, 'loss/train': 1.5160691738128662} 11/06/2021 23:16:01 - INFO - __main__ - Step 14638: {'lr': 0.0004910593606375261, 'samples': 2810496, 'steps': 14637, 'loss/train': 2.147937297821045} 11/06/2021 23:16:02 - INFO - __main__ - Step 14639: {'lr': 0.0004910579540841683, 'samples': 2810688, 'steps': 14638, 'loss/train': 0.8111184239387512} 11/06/2021 23:16:03 - INFO - __main__ - Step 14640: {'lr': 0.0004910565474221934, 'samples': 2810880, 'steps': 14639, 'loss/train': 1.6241923570632935} 11/06/2021 23:16:03 - INFO - __main__ - Step 14641: {'lr': 0.0004910551406516022, 'samples': 2811072, 'steps': 14640, 'loss/train': 2.166555404663086} 11/06/2021 23:16:03 - INFO - __main__ - Step 14642: {'lr': 0.0004910537337723954, 'samples': 2811264, 'steps': 14641, 'loss/train': 1.8136534690856934} 11/06/2021 23:16:04 - INFO - __main__ - Step 14643: {'lr': 0.0004910523267845733, 'samples': 2811456, 'steps': 14642, 'loss/train': 1.5173137187957764} 11/06/2021 23:16:04 - INFO - __main__ - Step 14644: {'lr': 0.0004910509196881369, 'samples': 2811648, 'steps': 14643, 'loss/train': 1.1134371757507324} 11/06/2021 23:16:05 - INFO - __main__ - Step 14645: {'lr': 0.0004910495124830866, 'samples': 2811840, 'steps': 14644, 'loss/train': 1.4934502840042114} 11/06/2021 23:16:05 - INFO - __main__ - Step 14646: {'lr': 0.0004910481051694231, 'samples': 2812032, 'steps': 14645, 'loss/train': 1.9416497945785522} 11/06/2021 23:16:06 - INFO - __main__ - Step 14647: {'lr': 0.0004910466977471471, 'samples': 2812224, 'steps': 14646, 'loss/train': 1.6827425956726074} 11/06/2021 23:16:06 - INFO - __main__ - Step 14648: {'lr': 0.0004910452902162592, 'samples': 2812416, 'steps': 14647, 'loss/train': 1.602482795715332} 11/06/2021 23:16:06 - INFO - __main__ - Step 14649: {'lr': 0.0004910438825767599, 'samples': 2812608, 'steps': 14648, 'loss/train': 1.5670403242111206} 11/06/2021 23:16:08 - INFO - __main__ - Step 14650: {'lr': 0.00049104247482865, 'samples': 2812800, 'steps': 14649, 'loss/train': 1.4646493196487427} 11/06/2021 23:16:08 - INFO - __main__ - Step 14651: {'lr': 0.0004910410669719301, 'samples': 2812992, 'steps': 14650, 'loss/train': 1.5942716598510742} 11/06/2021 23:16:08 - INFO - __main__ - Step 14652: {'lr': 0.0004910396590066008, 'samples': 2813184, 'steps': 14651, 'loss/train': 1.6040905714035034} 11/06/2021 23:16:09 - INFO - __main__ - Step 14653: {'lr': 0.0004910382509326627, 'samples': 2813376, 'steps': 14652, 'loss/train': 2.0839693546295166} 11/06/2021 23:16:09 - INFO - __main__ - Step 14654: {'lr': 0.0004910368427501166, 'samples': 2813568, 'steps': 14653, 'loss/train': 1.966593861579895} 11/06/2021 23:16:10 - INFO - __main__ - Step 14655: {'lr': 0.000491035434458963, 'samples': 2813760, 'steps': 14654, 'loss/train': 2.07354998588562} 11/06/2021 23:16:10 - INFO - __main__ - Step 14656: {'lr': 0.0004910340260592024, 'samples': 2813952, 'steps': 14655, 'loss/train': 1.6633130311965942} 11/06/2021 23:16:11 - INFO - __main__ - Step 14657: {'lr': 0.0004910326175508357, 'samples': 2814144, 'steps': 14656, 'loss/train': 2.252934694290161} 11/06/2021 23:16:11 - INFO - __main__ - Step 14658: {'lr': 0.0004910312089338634, 'samples': 2814336, 'steps': 14657, 'loss/train': 1.369263768196106} 11/06/2021 23:16:11 - INFO - __main__ - Step 14659: {'lr': 0.0004910298002082863, 'samples': 2814528, 'steps': 14658, 'loss/train': 1.274910807609558} 11/06/2021 23:16:12 - INFO - __main__ - Step 14660: {'lr': 0.0004910283913741047, 'samples': 2814720, 'steps': 14659, 'loss/train': 1.555715560913086} 11/06/2021 23:16:13 - INFO - __main__ - Step 14661: {'lr': 0.0004910269824313194, 'samples': 2814912, 'steps': 14660, 'loss/train': 1.4081352949142456} 11/06/2021 23:16:13 - INFO - __main__ - Step 14662: {'lr': 0.0004910255733799312, 'samples': 2815104, 'steps': 14661, 'loss/train': 1.7849104404449463} 11/06/2021 23:16:13 - INFO - __main__ - Step 14663: {'lr': 0.0004910241642199406, 'samples': 2815296, 'steps': 14662, 'loss/train': 1.4840021133422852} 11/06/2021 23:16:14 - INFO - __main__ - Step 14664: {'lr': 0.0004910227549513481, 'samples': 2815488, 'steps': 14663, 'loss/train': 1.7678169012069702} 11/06/2021 23:16:14 - INFO - __main__ - Step 14665: {'lr': 0.0004910213455741546, 'samples': 2815680, 'steps': 14664, 'loss/train': 1.6775918006896973} 11/06/2021 23:16:15 - INFO - __main__ - Step 14666: {'lr': 0.0004910199360883605, 'samples': 2815872, 'steps': 14665, 'loss/train': 1.950029969215393} 11/06/2021 23:16:16 - INFO - __main__ - Step 14667: {'lr': 0.0004910185264939667, 'samples': 2816064, 'steps': 14666, 'loss/train': 1.7304264307022095} 11/06/2021 23:16:16 - INFO - __main__ - Step 14668: {'lr': 0.0004910171167909734, 'samples': 2816256, 'steps': 14667, 'loss/train': 1.4172935485839844} 11/06/2021 23:16:16 - INFO - __main__ - Step 14669: {'lr': 0.0004910157069793816, 'samples': 2816448, 'steps': 14668, 'loss/train': 1.8779479265213013} 11/06/2021 23:16:17 - INFO - __main__ - Step 14670: {'lr': 0.000491014297059192, 'samples': 2816640, 'steps': 14669, 'loss/train': 1.943215012550354} 11/06/2021 23:16:18 - INFO - __main__ - Step 14671: {'lr': 0.000491012887030405, 'samples': 2816832, 'steps': 14670, 'loss/train': 1.6946277618408203} 11/06/2021 23:16:18 - INFO - __main__ - Step 14672: {'lr': 0.0004910114768930212, 'samples': 2817024, 'steps': 14671, 'loss/train': 1.4919730424880981} 11/06/2021 23:16:18 - INFO - __main__ - Step 14673: {'lr': 0.0004910100666470415, 'samples': 2817216, 'steps': 14672, 'loss/train': 1.942678689956665} 11/06/2021 23:16:19 - INFO - __main__ - Step 14674: {'lr': 0.0004910086562924663, 'samples': 2817408, 'steps': 14673, 'loss/train': 1.5062963962554932} 11/06/2021 23:16:19 - INFO - __main__ - Step 14675: {'lr': 0.0004910072458292963, 'samples': 2817600, 'steps': 14674, 'loss/train': 1.1559689044952393} 11/06/2021 23:16:20 - INFO - __main__ - Step 14676: {'lr': 0.0004910058352575322, 'samples': 2817792, 'steps': 14675, 'loss/train': 1.5200366973876953} 11/06/2021 23:16:20 - INFO - __main__ - Step 14677: {'lr': 0.0004910044245771745, 'samples': 2817984, 'steps': 14676, 'loss/train': 1.2778904438018799} 11/06/2021 23:16:21 - INFO - __main__ - Step 14678: {'lr': 0.0004910030137882241, 'samples': 2818176, 'steps': 14677, 'loss/train': 2.667948007583618} 11/06/2021 23:16:21 - INFO - __main__ - Step 14679: {'lr': 0.0004910016028906813, 'samples': 2818368, 'steps': 14678, 'loss/train': 2.185955047607422} 11/06/2021 23:16:22 - INFO - __main__ - Step 14680: {'lr': 0.000491000191884547, 'samples': 2818560, 'steps': 14679, 'loss/train': 1.723203420639038} 11/06/2021 23:16:23 - INFO - __main__ - Step 14681: {'lr': 0.0004909987807698217, 'samples': 2818752, 'steps': 14680, 'loss/train': 1.139530062675476} 11/06/2021 23:16:23 - INFO - __main__ - Step 14682: {'lr': 0.000490997369546506, 'samples': 2818944, 'steps': 14681, 'loss/train': 1.7487425804138184} 11/06/2021 23:16:23 - INFO - __main__ - Step 14683: {'lr': 0.0004909959582146007, 'samples': 2819136, 'steps': 14682, 'loss/train': 1.6787686347961426} 11/06/2021 23:16:24 - INFO - __main__ - Step 14684: {'lr': 0.0004909945467741063, 'samples': 2819328, 'steps': 14683, 'loss/train': 1.1271356344223022} 11/06/2021 23:16:24 - INFO - __main__ - Step 14685: {'lr': 0.0004909931352250235, 'samples': 2819520, 'steps': 14684, 'loss/train': 3.540257453918457} 11/06/2021 23:16:24 - INFO - __main__ - Step 14686: {'lr': 0.0004909917235673529, 'samples': 2819712, 'steps': 14685, 'loss/train': 1.8830620050430298} 11/06/2021 23:16:25 - INFO - __main__ - Step 14687: {'lr': 0.0004909903118010951, 'samples': 2819904, 'steps': 14686, 'loss/train': 1.717665433883667} 11/06/2021 23:16:26 - INFO - __main__ - Step 14688: {'lr': 0.0004909888999262509, 'samples': 2820096, 'steps': 14687, 'loss/train': 0.35629066824913025} 11/06/2021 23:16:26 - INFO - __main__ - Step 14689: {'lr': 0.0004909874879428207, 'samples': 2820288, 'steps': 14688, 'loss/train': 1.5923266410827637} 11/06/2021 23:16:26 - INFO - __main__ - Step 14690: {'lr': 0.0004909860758508052, 'samples': 2820480, 'steps': 14689, 'loss/train': 1.798073172569275} 11/06/2021 23:16:27 - INFO - __main__ - Step 14691: {'lr': 0.0004909846636502053, 'samples': 2820672, 'steps': 14690, 'loss/train': 1.2456002235412598} 11/06/2021 23:16:28 - INFO - __main__ - Step 14692: {'lr': 0.0004909832513410213, 'samples': 2820864, 'steps': 14691, 'loss/train': 1.8775510787963867} 11/06/2021 23:16:28 - INFO - __main__ - Step 14693: {'lr': 0.000490981838923254, 'samples': 2821056, 'steps': 14692, 'loss/train': 2.1554715633392334} 11/06/2021 23:16:29 - INFO - __main__ - Step 14694: {'lr': 0.000490980426396904, 'samples': 2821248, 'steps': 14693, 'loss/train': 1.8785821199417114} 11/06/2021 23:16:29 - INFO - __main__ - Step 14695: {'lr': 0.0004909790137619719, 'samples': 2821440, 'steps': 14694, 'loss/train': 1.473230242729187} 11/06/2021 23:16:29 - INFO - __main__ - Step 14696: {'lr': 0.0004909776010184585, 'samples': 2821632, 'steps': 14695, 'loss/train': 1.770862340927124} 11/06/2021 23:16:30 - INFO - __main__ - Step 14697: {'lr': 0.0004909761881663642, 'samples': 2821824, 'steps': 14696, 'loss/train': 1.7851510047912598} 11/06/2021 23:16:31 - INFO - __main__ - Step 14698: {'lr': 0.0004909747752056897, 'samples': 2822016, 'steps': 14697, 'loss/train': 1.8535454273223877} 11/06/2021 23:16:31 - INFO - __main__ - Step 14699: {'lr': 0.0004909733621364358, 'samples': 2822208, 'steps': 14698, 'loss/train': 1.4395469427108765} 11/06/2021 23:16:31 - INFO - __main__ - Step 14700: {'lr': 0.0004909719489586029, 'samples': 2822400, 'steps': 14699, 'loss/train': 1.9752013683319092} 11/06/2021 23:16:32 - INFO - __main__ - Step 14701: {'lr': 0.0004909705356721919, 'samples': 2822592, 'steps': 14700, 'loss/train': 1.4780222177505493} 11/06/2021 23:16:32 - INFO - __main__ - Step 14702: {'lr': 0.0004909691222772032, 'samples': 2822784, 'steps': 14701, 'loss/train': 1.6931217908859253} 11/06/2021 23:16:33 - INFO - __main__ - Step 14703: {'lr': 0.0004909677087736375, 'samples': 2822976, 'steps': 14702, 'loss/train': 1.33567214012146} 11/06/2021 23:16:33 - INFO - __main__ - Step 14704: {'lr': 0.0004909662951614955, 'samples': 2823168, 'steps': 14703, 'loss/train': 1.3383984565734863} 11/06/2021 23:16:34 - INFO - __main__ - Step 14705: {'lr': 0.0004909648814407779, 'samples': 2823360, 'steps': 14704, 'loss/train': 1.8960798978805542} 11/06/2021 23:16:34 - INFO - __main__ - Step 14706: {'lr': 0.0004909634676114851, 'samples': 2823552, 'steps': 14705, 'loss/train': 1.757364273071289} 11/06/2021 23:16:34 - INFO - __main__ - Step 14707: {'lr': 0.000490962053673618, 'samples': 2823744, 'steps': 14706, 'loss/train': 1.1211416721343994} 11/06/2021 23:16:36 - INFO - __main__ - Step 14708: {'lr': 0.0004909606396271771, 'samples': 2823936, 'steps': 14707, 'loss/train': 1.9302036762237549} 11/06/2021 23:16:36 - INFO - __main__ - Step 14709: {'lr': 0.000490959225472163, 'samples': 2824128, 'steps': 14708, 'loss/train': 1.3645907640457153} 11/06/2021 23:16:36 - INFO - __main__ - Step 14710: {'lr': 0.0004909578112085764, 'samples': 2824320, 'steps': 14709, 'loss/train': 1.793792724609375} 11/06/2021 23:16:37 - INFO - __main__ - Step 14711: {'lr': 0.0004909563968364179, 'samples': 2824512, 'steps': 14710, 'loss/train': 1.516960859298706} 11/06/2021 23:16:37 - INFO - __main__ - Step 14712: {'lr': 0.0004909549823556883, 'samples': 2824704, 'steps': 14711, 'loss/train': 1.5521596670150757} 11/06/2021 23:16:37 - INFO - __main__ - Step 14713: {'lr': 0.000490953567766388, 'samples': 2824896, 'steps': 14712, 'loss/train': 1.9512124061584473} 11/06/2021 23:16:38 - INFO - __main__ - Step 14714: {'lr': 0.0004909521530685177, 'samples': 2825088, 'steps': 14713, 'loss/train': 2.1272761821746826} 11/06/2021 23:16:39 - INFO - __main__ - Step 14715: {'lr': 0.0004909507382620782, 'samples': 2825280, 'steps': 14714, 'loss/train': 1.705784559249878} 11/06/2021 23:16:39 - INFO - __main__ - Step 14716: {'lr': 0.0004909493233470699, 'samples': 2825472, 'steps': 14715, 'loss/train': 1.458823323249817} 11/06/2021 23:16:40 - INFO - __main__ - Step 14717: {'lr': 0.0004909479083234936, 'samples': 2825664, 'steps': 14716, 'loss/train': 1.4705344438552856} 11/06/2021 23:16:40 - INFO - __main__ - Step 14718: {'lr': 0.0004909464931913499, 'samples': 2825856, 'steps': 14717, 'loss/train': 2.149104118347168} 11/06/2021 23:16:41 - INFO - __main__ - Step 14719: {'lr': 0.0004909450779506393, 'samples': 2826048, 'steps': 14718, 'loss/train': 1.4514827728271484} 11/06/2021 23:16:41 - INFO - __main__ - Step 14720: {'lr': 0.0004909436626013628, 'samples': 2826240, 'steps': 14719, 'loss/train': 1.459384799003601} 11/06/2021 23:16:42 - INFO - __main__ - Step 14721: {'lr': 0.0004909422471435207, 'samples': 2826432, 'steps': 14720, 'loss/train': 1.9327574968338013} 11/06/2021 23:16:42 - INFO - __main__ - Step 14722: {'lr': 0.0004909408315771136, 'samples': 2826624, 'steps': 14721, 'loss/train': 1.8887290954589844} 11/06/2021 23:16:42 - INFO - __main__ - Step 14723: {'lr': 0.0004909394159021425, 'samples': 2826816, 'steps': 14722, 'loss/train': 1.6616466045379639} 11/06/2021 23:16:43 - INFO - __main__ - Step 14724: {'lr': 0.0004909380001186077, 'samples': 2827008, 'steps': 14723, 'loss/train': 1.528913974761963} 11/06/2021 23:16:44 - INFO - __main__ - Step 14725: {'lr': 0.00049093658422651, 'samples': 2827200, 'steps': 14724, 'loss/train': 1.4247432947158813} 11/06/2021 23:16:44 - INFO - __main__ - Step 14726: {'lr': 0.00049093516822585, 'samples': 2827392, 'steps': 14725, 'loss/train': 2.329897165298462} 11/06/2021 23:16:44 - INFO - __main__ - Step 14727: {'lr': 0.0004909337521166282, 'samples': 2827584, 'steps': 14726, 'loss/train': 1.4586058855056763} 11/06/2021 23:16:45 - INFO - __main__ - Step 14728: {'lr': 0.0004909323358988455, 'samples': 2827776, 'steps': 14727, 'loss/train': 1.297971487045288} 11/06/2021 23:16:45 - INFO - __main__ - Step 14729: {'lr': 0.0004909309195725024, 'samples': 2827968, 'steps': 14728, 'loss/train': 1.9658069610595703} 11/06/2021 23:16:46 - INFO - __main__ - Step 14730: {'lr': 0.0004909295031375996, 'samples': 2828160, 'steps': 14729, 'loss/train': 1.5267490148544312} 11/06/2021 23:16:47 - INFO - __main__ - Step 14731: {'lr': 0.0004909280865941375, 'samples': 2828352, 'steps': 14730, 'loss/train': 1.8899013996124268} 11/06/2021 23:16:47 - INFO - __main__ - Step 14732: {'lr': 0.0004909266699421171, 'samples': 2828544, 'steps': 14731, 'loss/train': 1.7061964273452759} 11/06/2021 23:16:47 - INFO - __main__ - Step 14733: {'lr': 0.0004909252531815388, 'samples': 2828736, 'steps': 14732, 'loss/train': 1.6438379287719727} 11/06/2021 23:16:48 - INFO - __main__ - Step 14734: {'lr': 0.0004909238363124033, 'samples': 2828928, 'steps': 14733, 'loss/train': 1.719048023223877} 11/06/2021 23:16:49 - INFO - __main__ - Step 14735: {'lr': 0.0004909224193347112, 'samples': 2829120, 'steps': 14734, 'loss/train': 1.9832409620285034} 11/06/2021 23:16:49 - INFO - __main__ - Step 14736: {'lr': 0.0004909210022484633, 'samples': 2829312, 'steps': 14735, 'loss/train': 2.6759371757507324} 11/06/2021 23:16:49 - INFO - __main__ - Step 14737: {'lr': 0.00049091958505366, 'samples': 2829504, 'steps': 14736, 'loss/train': 1.7943952083587646} 11/06/2021 23:16:50 - INFO - __main__ - Step 14738: {'lr': 0.000490918167750302, 'samples': 2829696, 'steps': 14737, 'loss/train': 1.9066545963287354} 11/06/2021 23:16:50 - INFO - __main__ - Step 14739: {'lr': 0.00049091675033839, 'samples': 2829888, 'steps': 14738, 'loss/train': 1.0074669122695923} 11/06/2021 23:16:51 - INFO - __main__ - Step 14740: {'lr': 0.0004909153328179248, 'samples': 2830080, 'steps': 14739, 'loss/train': 1.9582070112228394} 11/06/2021 23:16:51 - INFO - __main__ - Step 14741: {'lr': 0.0004909139151889067, 'samples': 2830272, 'steps': 14740, 'loss/train': 2.478091239929199} 11/06/2021 23:16:52 - INFO - __main__ - Step 14742: {'lr': 0.0004909124974513366, 'samples': 2830464, 'steps': 14741, 'loss/train': 1.6102755069732666} 11/06/2021 23:16:52 - INFO - __main__ - Step 14743: {'lr': 0.000490911079605215, 'samples': 2830656, 'steps': 14742, 'loss/train': 1.9308674335479736} 11/06/2021 23:16:53 - INFO - __main__ - Step 14744: {'lr': 0.0004909096616505426, 'samples': 2830848, 'steps': 14743, 'loss/train': 1.605130672454834} 11/06/2021 23:16:53 - INFO - __main__ - Step 14745: {'lr': 0.00049090824358732, 'samples': 2831040, 'steps': 14744, 'loss/train': 1.5920521020889282} 11/06/2021 23:16:54 - INFO - __main__ - Step 14746: {'lr': 0.0004909068254155479, 'samples': 2831232, 'steps': 14745, 'loss/train': 1.7607147693634033} 11/06/2021 23:16:55 - INFO - __main__ - Step 14747: {'lr': 0.0004909054071352269, 'samples': 2831424, 'steps': 14746, 'loss/train': 1.231070876121521} 11/06/2021 23:16:55 - INFO - __main__ - Step 14748: {'lr': 0.0004909039887463576, 'samples': 2831616, 'steps': 14747, 'loss/train': 1.5784339904785156} 11/06/2021 23:16:55 - INFO - __main__ - Step 14749: {'lr': 0.0004909025702489407, 'samples': 2831808, 'steps': 14748, 'loss/train': 0.686924934387207} 11/06/2021 23:16:56 - INFO - __main__ - Step 14750: {'lr': 0.0004909011516429768, 'samples': 2832000, 'steps': 14749, 'loss/train': 1.3042925596237183} 11/06/2021 23:16:57 - INFO - __main__ - Step 14751: {'lr': 0.0004908997329284667, 'samples': 2832192, 'steps': 14750, 'loss/train': 1.7104178667068481} 11/06/2021 23:16:57 - INFO - __main__ - Step 14752: {'lr': 0.0004908983141054107, 'samples': 2832384, 'steps': 14751, 'loss/train': 1.4332627058029175} 11/06/2021 23:16:57 - INFO - __main__ - Step 14753: {'lr': 0.0004908968951738098, 'samples': 2832576, 'steps': 14752, 'loss/train': 2.133230209350586} 11/06/2021 23:16:58 - INFO - __main__ - Step 14754: {'lr': 0.0004908954761336643, 'samples': 2832768, 'steps': 14753, 'loss/train': 1.732865333557129} 11/06/2021 23:16:58 - INFO - __main__ - Step 14755: {'lr': 0.0004908940569849751, 'samples': 2832960, 'steps': 14754, 'loss/train': 1.9178493022918701} 11/06/2021 23:16:59 - INFO - __main__ - Step 14756: {'lr': 0.0004908926377277428, 'samples': 2833152, 'steps': 14755, 'loss/train': 1.5390150547027588} 11/06/2021 23:16:59 - INFO - __main__ - Step 14757: {'lr': 0.000490891218361968, 'samples': 2833344, 'steps': 14756, 'loss/train': 1.8167674541473389} 11/06/2021 23:17:00 - INFO - __main__ - Step 14758: {'lr': 0.0004908897988876512, 'samples': 2833536, 'steps': 14757, 'loss/train': 1.5798640251159668} 11/06/2021 23:17:00 - INFO - __main__ - Step 14759: {'lr': 0.0004908883793047934, 'samples': 2833728, 'steps': 14758, 'loss/train': 1.2919437885284424} 11/06/2021 23:17:01 - INFO - __main__ - Step 14760: {'lr': 0.0004908869596133948, 'samples': 2833920, 'steps': 14759, 'loss/train': 2.303706407546997} 11/06/2021 23:17:01 - INFO - __main__ - Step 14761: {'lr': 0.0004908855398134563, 'samples': 2834112, 'steps': 14760, 'loss/train': 1.3456342220306396} 11/06/2021 23:17:02 - INFO - __main__ - Step 14762: {'lr': 0.0004908841199049785, 'samples': 2834304, 'steps': 14761, 'loss/train': 1.4965943098068237} 11/06/2021 23:17:02 - INFO - __main__ - Step 14763: {'lr': 0.0004908826998879621, 'samples': 2834496, 'steps': 14762, 'loss/train': 1.7584352493286133} 11/06/2021 23:17:03 - INFO - __main__ - Step 14764: {'lr': 0.0004908812797624077, 'samples': 2834688, 'steps': 14763, 'loss/train': 1.936519980430603} 11/06/2021 23:17:03 - INFO - __main__ - Step 14765: {'lr': 0.0004908798595283159, 'samples': 2834880, 'steps': 14764, 'loss/train': 2.077310562133789} 11/06/2021 23:17:03 - INFO - __main__ - Step 14766: {'lr': 0.0004908784391856872, 'samples': 2835072, 'steps': 14765, 'loss/train': 1.6648650169372559} 11/06/2021 23:17:05 - INFO - __main__ - Step 14767: {'lr': 0.0004908770187345225, 'samples': 2835264, 'steps': 14766, 'loss/train': 1.7535183429718018} 11/06/2021 23:17:05 - INFO - __main__ - Step 14768: {'lr': 0.0004908755981748223, 'samples': 2835456, 'steps': 14767, 'loss/train': 1.9513096809387207} 11/06/2021 23:17:05 - INFO - __main__ - Step 14769: {'lr': 0.0004908741775065873, 'samples': 2835648, 'steps': 14768, 'loss/train': 2.233029365539551} 11/06/2021 23:17:06 - INFO - __main__ - Step 14770: {'lr': 0.0004908727567298181, 'samples': 2835840, 'steps': 14769, 'loss/train': 1.4736902713775635} 11/06/2021 23:17:06 - INFO - __main__ - Step 14771: {'lr': 0.0004908713358445154, 'samples': 2836032, 'steps': 14770, 'loss/train': 5.864256381988525} 11/06/2021 23:17:06 - INFO - __main__ - Step 14772: {'lr': 0.0004908699148506797, 'samples': 2836224, 'steps': 14771, 'loss/train': 1.9193412065505981} 11/06/2021 23:17:07 - INFO - __main__ - Step 14773: {'lr': 0.0004908684937483119, 'samples': 2836416, 'steps': 14772, 'loss/train': 1.799191951751709} 11/06/2021 23:17:08 - INFO - __main__ - Step 14774: {'lr': 0.0004908670725374122, 'samples': 2836608, 'steps': 14773, 'loss/train': 1.3834148645401} 11/06/2021 23:17:08 - INFO - __main__ - Step 14775: {'lr': 0.0004908656512179817, 'samples': 2836800, 'steps': 14774, 'loss/train': 1.5654276609420776} 11/06/2021 23:17:08 - INFO - __main__ - Step 14776: {'lr': 0.0004908642297900209, 'samples': 2836992, 'steps': 14775, 'loss/train': 1.6006273031234741} 11/06/2021 23:17:09 - INFO - __main__ - Step 14777: {'lr': 0.0004908628082535303, 'samples': 2837184, 'steps': 14776, 'loss/train': 1.8874455690383911} 11/06/2021 23:17:10 - INFO - __main__ - Step 14778: {'lr': 0.0004908613866085106, 'samples': 2837376, 'steps': 14777, 'loss/train': 1.3304411172866821} 11/06/2021 23:17:10 - INFO - __main__ - Step 14779: {'lr': 0.0004908599648549626, 'samples': 2837568, 'steps': 14778, 'loss/train': 1.500115990638733} 11/06/2021 23:17:11 - INFO - __main__ - Step 14780: {'lr': 0.0004908585429928867, 'samples': 2837760, 'steps': 14779, 'loss/train': 1.748668909072876} 11/06/2021 23:17:11 - INFO - __main__ - Step 14781: {'lr': 0.0004908571210222837, 'samples': 2837952, 'steps': 14780, 'loss/train': 1.470587968826294} 11/06/2021 23:17:11 - INFO - __main__ - Step 14782: {'lr': 0.0004908556989431543, 'samples': 2838144, 'steps': 14781, 'loss/train': 0.8396479487419128} 11/06/2021 23:17:12 - INFO - __main__ - Step 14783: {'lr': 0.0004908542767554988, 'samples': 2838336, 'steps': 14782, 'loss/train': 1.8558242321014404} 11/06/2021 23:17:13 - INFO - __main__ - Step 14784: {'lr': 0.0004908528544593184, 'samples': 2838528, 'steps': 14783, 'loss/train': 1.9126049280166626} 11/06/2021 23:17:13 - INFO - __main__ - Step 14785: {'lr': 0.0004908514320546132, 'samples': 2838720, 'steps': 14784, 'loss/train': 1.7966852188110352} 11/06/2021 23:17:13 - INFO - __main__ - Step 14786: {'lr': 0.000490850009541384, 'samples': 2838912, 'steps': 14785, 'loss/train': 1.829959750175476} 11/06/2021 23:17:14 - INFO - __main__ - Step 14787: {'lr': 0.0004908485869196317, 'samples': 2839104, 'steps': 14786, 'loss/train': 1.6409757137298584} 11/06/2021 23:17:15 - INFO - __main__ - Step 14788: {'lr': 0.0004908471641893566, 'samples': 2839296, 'steps': 14787, 'loss/train': 1.4668761491775513} 11/06/2021 23:17:15 - INFO - __main__ - Step 14789: {'lr': 0.0004908457413505596, 'samples': 2839488, 'steps': 14788, 'loss/train': 1.7249447107315063} 11/06/2021 23:17:16 - INFO - __main__ - Step 14790: {'lr': 0.0004908443184032411, 'samples': 2839680, 'steps': 14789, 'loss/train': 1.671472430229187} 11/06/2021 23:17:16 - INFO - __main__ - Step 14791: {'lr': 0.0004908428953474019, 'samples': 2839872, 'steps': 14790, 'loss/train': 1.2302889823913574} 11/06/2021 23:17:16 - INFO - __main__ - Step 14792: {'lr': 0.0004908414721830427, 'samples': 2840064, 'steps': 14791, 'loss/train': 1.5707032680511475} 11/06/2021 23:17:17 - INFO - __main__ - Step 14793: {'lr': 0.000490840048910164, 'samples': 2840256, 'steps': 14792, 'loss/train': 1.8218657970428467} 11/06/2021 23:17:18 - INFO - __main__ - Step 14794: {'lr': 0.0004908386255287664, 'samples': 2840448, 'steps': 14793, 'loss/train': 2.0798017978668213} 11/06/2021 23:17:18 - INFO - __main__ - Step 14795: {'lr': 0.0004908372020388508, 'samples': 2840640, 'steps': 14794, 'loss/train': 1.7213116884231567} 11/06/2021 23:17:18 - INFO - __main__ - Step 14796: {'lr': 0.0004908357784404175, 'samples': 2840832, 'steps': 14795, 'loss/train': 1.5220595598220825} 11/06/2021 23:17:19 - INFO - __main__ - Step 14797: {'lr': 0.0004908343547334674, 'samples': 2841024, 'steps': 14796, 'loss/train': 1.586082100868225} 11/06/2021 23:17:19 - INFO - __main__ - Step 14798: {'lr': 0.0004908329309180011, 'samples': 2841216, 'steps': 14797, 'loss/train': 2.087536573410034} 11/06/2021 23:17:20 - INFO - __main__ - Step 14799: {'lr': 0.0004908315069940191, 'samples': 2841408, 'steps': 14798, 'loss/train': 1.841811180114746} 11/06/2021 23:17:20 - INFO - __main__ - Step 14800: {'lr': 0.0004908300829615222, 'samples': 2841600, 'steps': 14799, 'loss/train': 1.4489543437957764} 11/06/2021 23:17:21 - INFO - __main__ - Step 14801: {'lr': 0.000490828658820511, 'samples': 2841792, 'steps': 14800, 'loss/train': 1.7258611917495728} 11/06/2021 23:17:21 - INFO - __main__ - Step 14802: {'lr': 0.0004908272345709861, 'samples': 2841984, 'steps': 14801, 'loss/train': 1.824029564857483} 11/06/2021 23:17:21 - INFO - __main__ - Step 14803: {'lr': 0.0004908258102129481, 'samples': 2842176, 'steps': 14802, 'loss/train': 1.134230375289917} 11/06/2021 23:17:23 - INFO - __main__ - Step 14804: {'lr': 0.0004908243857463978, 'samples': 2842368, 'steps': 14803, 'loss/train': 1.6424596309661865} 11/06/2021 23:17:23 - INFO - __main__ - Step 14805: {'lr': 0.0004908229611713357, 'samples': 2842560, 'steps': 14804, 'loss/train': 1.7975839376449585} 11/06/2021 23:17:23 - INFO - __main__ - Step 14806: {'lr': 0.0004908215364877625, 'samples': 2842752, 'steps': 14805, 'loss/train': 1.8489753007888794} 11/06/2021 23:17:24 - INFO - __main__ - Step 14807: {'lr': 0.0004908201116956788, 'samples': 2842944, 'steps': 14806, 'loss/train': 4.51678466796875} 11/06/2021 23:17:24 - INFO - __main__ - Step 14808: {'lr': 0.0004908186867950854, 'samples': 2843136, 'steps': 14807, 'loss/train': 1.53168785572052} 11/06/2021 23:17:24 - INFO - __main__ - Step 14809: {'lr': 0.0004908172617859826, 'samples': 2843328, 'steps': 14808, 'loss/train': 1.772919774055481} 11/06/2021 23:17:25 - INFO - __main__ - Step 14810: {'lr': 0.0004908158366683714, 'samples': 2843520, 'steps': 14809, 'loss/train': 2.220637798309326} 11/06/2021 23:17:26 - INFO - __main__ - Step 14811: {'lr': 0.0004908144114422523, 'samples': 2843712, 'steps': 14810, 'loss/train': 0.5302165150642395} 11/06/2021 23:17:26 - INFO - __main__ - Step 14812: {'lr': 0.000490812986107626, 'samples': 2843904, 'steps': 14811, 'loss/train': 1.51813542842865} 11/06/2021 23:17:26 - INFO - __main__ - Step 14813: {'lr': 0.000490811560664493, 'samples': 2844096, 'steps': 14812, 'loss/train': 1.3996822834014893} 11/06/2021 23:17:27 - INFO - __main__ - Step 14814: {'lr': 0.000490810135112854, 'samples': 2844288, 'steps': 14813, 'loss/train': 1.816659688949585} 11/06/2021 23:17:28 - INFO - __main__ - Step 14815: {'lr': 0.0004908087094527097, 'samples': 2844480, 'steps': 14814, 'loss/train': 1.3649659156799316} 11/06/2021 23:17:28 - INFO - __main__ - Step 14816: {'lr': 0.0004908072836840607, 'samples': 2844672, 'steps': 14815, 'loss/train': 1.6869769096374512} 11/06/2021 23:17:29 - INFO - __main__ - Step 14817: {'lr': 0.0004908058578069077, 'samples': 2844864, 'steps': 14816, 'loss/train': 1.6598951816558838} 11/06/2021 23:17:29 - INFO - __main__ - Step 14818: {'lr': 0.0004908044318212512, 'samples': 2845056, 'steps': 14817, 'loss/train': 1.5969890356063843} 11/06/2021 23:17:29 - INFO - __main__ - Step 14819: {'lr': 0.000490803005727092, 'samples': 2845248, 'steps': 14818, 'loss/train': 1.8578797578811646} 11/06/2021 23:17:30 - INFO - __main__ - Step 14820: {'lr': 0.0004908015795244307, 'samples': 2845440, 'steps': 14819, 'loss/train': 1.8108375072479248} 11/06/2021 23:17:31 - INFO - __main__ - Step 14821: {'lr': 0.0004908001532132679, 'samples': 2845632, 'steps': 14820, 'loss/train': 1.6355654001235962} 11/06/2021 23:17:31 - INFO - __main__ - Step 14822: {'lr': 0.0004907987267936042, 'samples': 2845824, 'steps': 14821, 'loss/train': 1.8750498294830322} 11/06/2021 23:17:31 - INFO - __main__ - Step 14823: {'lr': 0.0004907973002654404, 'samples': 2846016, 'steps': 14822, 'loss/train': 1.708099603652954} 11/06/2021 23:17:32 - INFO - __main__ - Step 14824: {'lr': 0.0004907958736287771, 'samples': 2846208, 'steps': 14823, 'loss/train': 1.5635958909988403} 11/06/2021 23:17:33 - INFO - __main__ - Step 14825: {'lr': 0.0004907944468836148, 'samples': 2846400, 'steps': 14824, 'loss/train': 3.393544912338257} 11/06/2021 23:17:33 - INFO - __main__ - Step 14826: {'lr': 0.0004907930200299543, 'samples': 2846592, 'steps': 14825, 'loss/train': 1.548439383506775} 11/06/2021 23:17:33 - INFO - __main__ - Step 14827: {'lr': 0.0004907915930677961, 'samples': 2846784, 'steps': 14826, 'loss/train': 1.3716732263565063} 11/06/2021 23:17:34 - INFO - __main__ - Step 14828: {'lr': 0.000490790165997141, 'samples': 2846976, 'steps': 14827, 'loss/train': 1.7341458797454834} 11/06/2021 23:17:34 - INFO - __main__ - Step 14829: {'lr': 0.0004907887388179896, 'samples': 2847168, 'steps': 14828, 'loss/train': 1.6612669229507446} 11/06/2021 23:17:35 - INFO - __main__ - Step 14830: {'lr': 0.0004907873115303424, 'samples': 2847360, 'steps': 14829, 'loss/train': 1.5637840032577515} 11/06/2021 23:17:36 - INFO - __main__ - Step 14831: {'lr': 0.0004907858841342002, 'samples': 2847552, 'steps': 14830, 'loss/train': 1.3963515758514404} 11/06/2021 23:17:36 - INFO - __main__ - Step 14832: {'lr': 0.0004907844566295637, 'samples': 2847744, 'steps': 14831, 'loss/train': 1.5587491989135742} 11/06/2021 23:17:36 - INFO - __main__ - Step 14833: {'lr': 0.0004907830290164332, 'samples': 2847936, 'steps': 14832, 'loss/train': 1.662847638130188} 11/06/2021 23:17:37 - INFO - __main__ - Step 14834: {'lr': 0.0004907816012948098, 'samples': 2848128, 'steps': 14833, 'loss/train': 2.050675630569458} 11/06/2021 23:17:37 - INFO - __main__ - Step 14835: {'lr': 0.0004907801734646938, 'samples': 2848320, 'steps': 14834, 'loss/train': 1.9373985528945923} 11/06/2021 23:17:38 - INFO - __main__ - Step 14836: {'lr': 0.000490778745526086, 'samples': 2848512, 'steps': 14835, 'loss/train': 1.7804934978485107} 11/06/2021 23:17:38 - INFO - __main__ - Step 14837: {'lr': 0.000490777317478987, 'samples': 2848704, 'steps': 14836, 'loss/train': 1.345388412475586} 11/06/2021 23:17:39 - INFO - __main__ - Step 14838: {'lr': 0.0004907758893233975, 'samples': 2848896, 'steps': 14837, 'loss/train': 1.093638300895691} 11/06/2021 23:17:39 - INFO - __main__ - Step 14839: {'lr': 0.0004907744610593181, 'samples': 2849088, 'steps': 14838, 'loss/train': 1.936640739440918} 11/06/2021 23:17:39 - INFO - __main__ - Step 14840: {'lr': 0.0004907730326867495, 'samples': 2849280, 'steps': 14839, 'loss/train': 1.6983401775360107} 11/06/2021 23:17:40 - INFO - __main__ - Step 14841: {'lr': 0.0004907716042056921, 'samples': 2849472, 'steps': 14840, 'loss/train': 2.199077844619751} 11/06/2021 23:17:41 - INFO - __main__ - Step 14842: {'lr': 0.0004907701756161469, 'samples': 2849664, 'steps': 14841, 'loss/train': 2.0116066932678223} 11/06/2021 23:17:41 - INFO - __main__ - Step 14843: {'lr': 0.0004907687469181143, 'samples': 2849856, 'steps': 14842, 'loss/train': 1.461774230003357} 11/06/2021 23:17:41 - INFO - __main__ - Step 14844: {'lr': 0.000490767318111595, 'samples': 2850048, 'steps': 14843, 'loss/train': 1.6642917394638062} 11/06/2021 23:17:42 - INFO - __main__ - Step 14845: {'lr': 0.0004907658891965897, 'samples': 2850240, 'steps': 14844, 'loss/train': 1.9356948137283325} 11/06/2021 23:17:43 - INFO - __main__ - Step 14846: {'lr': 0.000490764460173099, 'samples': 2850432, 'steps': 14845, 'loss/train': 1.7684423923492432} 11/06/2021 23:17:43 - INFO - __main__ - Step 14847: {'lr': 0.0004907630310411236, 'samples': 2850624, 'steps': 14846, 'loss/train': 1.5917563438415527} 11/06/2021 23:17:43 - INFO - __main__ - Step 14848: {'lr': 0.000490761601800664, 'samples': 2850816, 'steps': 14847, 'loss/train': 1.7513176202774048} 11/06/2021 23:17:44 - INFO - __main__ - Step 14849: {'lr': 0.000490760172451721, 'samples': 2851008, 'steps': 14848, 'loss/train': 1.5119004249572754} 11/06/2021 23:17:44 - INFO - __main__ - Step 14850: {'lr': 0.0004907587429942952, 'samples': 2851200, 'steps': 14849, 'loss/train': 2.091411590576172} 11/06/2021 23:17:45 - INFO - __main__ - Step 14851: {'lr': 0.0004907573134283872, 'samples': 2851392, 'steps': 14850, 'loss/train': 1.808640718460083} 11/06/2021 23:17:45 - INFO - __main__ - Step 14852: {'lr': 0.0004907558837539976, 'samples': 2851584, 'steps': 14851, 'loss/train': 1.7864621877670288} 11/06/2021 23:17:46 - INFO - __main__ - Step 14853: {'lr': 0.0004907544539711272, 'samples': 2851776, 'steps': 14852, 'loss/train': 1.476940631866455} 11/06/2021 23:17:46 - INFO - __main__ - Step 14854: {'lr': 0.0004907530240797765, 'samples': 2851968, 'steps': 14853, 'loss/train': 1.674166202545166} 11/06/2021 23:17:47 - INFO - __main__ - Step 14855: {'lr': 0.0004907515940799463, 'samples': 2852160, 'steps': 14854, 'loss/train': 1.497134804725647} 11/06/2021 23:17:48 - INFO - __main__ - Step 14856: {'lr': 0.000490750163971637, 'samples': 2852352, 'steps': 14855, 'loss/train': 1.8654690980911255} 11/06/2021 23:17:48 - INFO - __main__ - Step 14857: {'lr': 0.0004907487337548495, 'samples': 2852544, 'steps': 14856, 'loss/train': 1.693071722984314} 11/06/2021 23:17:49 - INFO - __main__ - Step 14858: {'lr': 0.0004907473034295843, 'samples': 2852736, 'steps': 14857, 'loss/train': 1.270855188369751} 11/06/2021 23:17:49 - INFO - __main__ - Step 14859: {'lr': 0.0004907458729958422, 'samples': 2852928, 'steps': 14858, 'loss/train': 1.4801957607269287} 11/06/2021 23:17:49 - INFO - __main__ - Step 14860: {'lr': 0.0004907444424536235, 'samples': 2853120, 'steps': 14859, 'loss/train': 1.5060356855392456} 11/06/2021 23:17:50 - INFO - __main__ - Step 14861: {'lr': 0.0004907430118029293, 'samples': 2853312, 'steps': 14860, 'loss/train': 1.8055636882781982} 11/06/2021 23:17:51 - INFO - __main__ - Step 14862: {'lr': 0.0004907415810437598, 'samples': 2853504, 'steps': 14861, 'loss/train': 1.6325714588165283} 11/06/2021 23:17:51 - INFO - __main__ - Step 14863: {'lr': 0.0004907401501761159, 'samples': 2853696, 'steps': 14862, 'loss/train': 1.5240354537963867} 11/06/2021 23:17:51 - INFO - __main__ - Step 14864: {'lr': 0.0004907387191999984, 'samples': 2853888, 'steps': 14863, 'loss/train': 1.4284332990646362} 11/06/2021 23:17:52 - INFO - __main__ - Step 14865: {'lr': 0.0004907372881154075, 'samples': 2854080, 'steps': 14864, 'loss/train': 1.6523523330688477} 11/06/2021 23:17:53 - INFO - __main__ - Step 14866: {'lr': 0.0004907358569223442, 'samples': 2854272, 'steps': 14865, 'loss/train': 1.5434261560440063} 11/06/2021 23:17:53 - INFO - __main__ - Step 14867: {'lr': 0.000490734425620809, 'samples': 2854464, 'steps': 14866, 'loss/train': 1.671189785003662} 11/06/2021 23:17:53 - INFO - __main__ - Step 14868: {'lr': 0.0004907329942108027, 'samples': 2854656, 'steps': 14867, 'loss/train': 1.6853291988372803} 11/06/2021 23:17:54 - INFO - __main__ - Step 14869: {'lr': 0.0004907315626923258, 'samples': 2854848, 'steps': 14868, 'loss/train': 1.1505197286605835} 11/06/2021 23:17:54 - INFO - __main__ - Step 14870: {'lr': 0.0004907301310653789, 'samples': 2855040, 'steps': 14869, 'loss/train': 1.6277196407318115} 11/06/2021 23:17:55 - INFO - __main__ - Step 14871: {'lr': 0.0004907286993299627, 'samples': 2855232, 'steps': 14870, 'loss/train': 2.135899305343628} 11/06/2021 23:17:56 - INFO - __main__ - Step 14872: {'lr': 0.0004907272674860779, 'samples': 2855424, 'steps': 14871, 'loss/train': 2.0826239585876465} 11/06/2021 23:17:56 - INFO - __main__ - Step 14873: {'lr': 0.0004907258355337251, 'samples': 2855616, 'steps': 14872, 'loss/train': 1.8687199354171753} 11/06/2021 23:17:56 - INFO - __main__ - Step 14874: {'lr': 0.0004907244034729049, 'samples': 2855808, 'steps': 14873, 'loss/train': 1.6615655422210693} 11/06/2021 23:17:57 - INFO - __main__ - Step 14875: {'lr': 0.0004907229713036181, 'samples': 2856000, 'steps': 14874, 'loss/train': 1.7533270120620728} 11/06/2021 23:17:57 - INFO - __main__ - Step 14876: {'lr': 0.0004907215390258652, 'samples': 2856192, 'steps': 14875, 'loss/train': 1.654037356376648} 11/06/2021 23:17:58 - INFO - __main__ - Step 14877: {'lr': 0.0004907201066396469, 'samples': 2856384, 'steps': 14876, 'loss/train': 1.8480337858200073} 11/06/2021 23:17:58 - INFO - __main__ - Step 14878: {'lr': 0.0004907186741449638, 'samples': 2856576, 'steps': 14877, 'loss/train': 1.6371917724609375} 11/06/2021 23:17:59 - INFO - __main__ - Step 14879: {'lr': 0.0004907172415418166, 'samples': 2856768, 'steps': 14878, 'loss/train': 1.5165156126022339} 11/06/2021 23:17:59 - INFO - __main__ - Step 14880: {'lr': 0.0004907158088302059, 'samples': 2856960, 'steps': 14879, 'loss/train': 1.6989234685897827} 11/06/2021 23:17:59 - INFO - __main__ - Step 14881: {'lr': 0.0004907143760101325, 'samples': 2857152, 'steps': 14880, 'loss/train': 1.368466854095459} 11/06/2021 23:18:01 - INFO - __main__ - Step 14882: {'lr': 0.0004907129430815968, 'samples': 2857344, 'steps': 14881, 'loss/train': 1.8604637384414673} 11/06/2021 23:18:01 - INFO - __main__ - Step 14883: {'lr': 0.0004907115100445996, 'samples': 2857536, 'steps': 14882, 'loss/train': 1.6678239107131958} 11/06/2021 23:18:01 - INFO - __main__ - Step 14884: {'lr': 0.0004907100768991415, 'samples': 2857728, 'steps': 14883, 'loss/train': 2.169818878173828} 11/06/2021 23:18:02 - INFO - __main__ - Step 14885: {'lr': 0.0004907086436452231, 'samples': 2857920, 'steps': 14884, 'loss/train': 1.5355026721954346} 11/06/2021 23:18:02 - INFO - __main__ - Step 14886: {'lr': 0.0004907072102828451, 'samples': 2858112, 'steps': 14885, 'loss/train': 1.5460067987442017} 11/06/2021 23:18:03 - INFO - __main__ - Step 14887: {'lr': 0.0004907057768120082, 'samples': 2858304, 'steps': 14886, 'loss/train': 1.5537009239196777} 11/06/2021 23:18:03 - INFO - __main__ - Step 14888: {'lr': 0.000490704343232713, 'samples': 2858496, 'steps': 14887, 'loss/train': 1.7863291501998901} 11/06/2021 23:18:04 - INFO - __main__ - Step 14889: {'lr': 0.0004907029095449602, 'samples': 2858688, 'steps': 14888, 'loss/train': 1.847086787223816} 11/06/2021 23:18:04 - INFO - __main__ - Step 14890: {'lr': 0.0004907014757487503, 'samples': 2858880, 'steps': 14889, 'loss/train': 1.9282914400100708} 11/06/2021 23:18:04 - INFO - __main__ - Step 14891: {'lr': 0.0004907000418440839, 'samples': 2859072, 'steps': 14890, 'loss/train': 1.711198091506958} 11/06/2021 23:18:05 - INFO - __main__ - Step 14892: {'lr': 0.000490698607830962, 'samples': 2859264, 'steps': 14891, 'loss/train': 1.5940576791763306} 11/06/2021 23:18:06 - INFO - __main__ - Step 14893: {'lr': 0.0004906971737093849, 'samples': 2859456, 'steps': 14892, 'loss/train': 1.9280325174331665} 11/06/2021 23:18:06 - INFO - __main__ - Step 14894: {'lr': 0.0004906957394793534, 'samples': 2859648, 'steps': 14893, 'loss/train': 1.427196979522705} 11/06/2021 23:18:06 - INFO - __main__ - Step 14895: {'lr': 0.0004906943051408682, 'samples': 2859840, 'steps': 14894, 'loss/train': 1.3355380296707153} 11/06/2021 23:18:07 - INFO - __main__ - Step 14896: {'lr': 0.0004906928706939296, 'samples': 2860032, 'steps': 14895, 'loss/train': 1.7179498672485352} 11/06/2021 23:18:08 - INFO - __main__ - Step 14897: {'lr': 0.0004906914361385387, 'samples': 2860224, 'steps': 14896, 'loss/train': 1.7625758647918701} 11/06/2021 23:18:08 - INFO - __main__ - Step 14898: {'lr': 0.0004906900014746959, 'samples': 2860416, 'steps': 14897, 'loss/train': 1.622707724571228} 11/06/2021 23:18:09 - INFO - __main__ - Step 14899: {'lr': 0.000490688566702402, 'samples': 2860608, 'steps': 14898, 'loss/train': 1.7065162658691406} 11/06/2021 23:18:09 - INFO - __main__ - Step 14900: {'lr': 0.0004906871318216575, 'samples': 2860800, 'steps': 14899, 'loss/train': 1.6930841207504272} 11/06/2021 23:18:09 - INFO - __main__ - Step 14901: {'lr': 0.000490685696832463, 'samples': 2860992, 'steps': 14900, 'loss/train': 1.9356876611709595} 11/06/2021 23:18:10 - INFO - __main__ - Step 14902: {'lr': 0.0004906842617348193, 'samples': 2861184, 'steps': 14901, 'loss/train': 1.8025461435317993} 11/06/2021 23:18:11 - INFO - __main__ - Step 14903: {'lr': 0.000490682826528727, 'samples': 2861376, 'steps': 14902, 'loss/train': 1.7793097496032715} 11/06/2021 23:18:11 - INFO - __main__ - Step 14904: {'lr': 0.0004906813912141868, 'samples': 2861568, 'steps': 14903, 'loss/train': 2.1294806003570557} 11/06/2021 23:18:11 - INFO - __main__ - Step 14905: {'lr': 0.0004906799557911992, 'samples': 2861760, 'steps': 14904, 'loss/train': 1.366396427154541} 11/06/2021 23:18:12 - INFO - __main__ - Step 14906: {'lr': 0.0004906785202597649, 'samples': 2861952, 'steps': 14905, 'loss/train': 1.2872095108032227} 11/06/2021 23:18:13 - INFO - __main__ - Step 14907: {'lr': 0.0004906770846198846, 'samples': 2862144, 'steps': 14906, 'loss/train': 1.8077151775360107} 11/06/2021 23:18:13 - INFO - __main__ - Step 14908: {'lr': 0.0004906756488715589, 'samples': 2862336, 'steps': 14907, 'loss/train': 1.617505669593811} 11/06/2021 23:18:14 - INFO - __main__ - Step 14909: {'lr': 0.0004906742130147884, 'samples': 2862528, 'steps': 14908, 'loss/train': 1.6927767992019653} 11/06/2021 23:18:14 - INFO - __main__ - Step 14910: {'lr': 0.0004906727770495739, 'samples': 2862720, 'steps': 14909, 'loss/train': 1.645467758178711} 11/06/2021 23:18:14 - INFO - __main__ - Step 14911: {'lr': 0.000490671340975916, 'samples': 2862912, 'steps': 14910, 'loss/train': 1.7554125785827637} 11/06/2021 23:18:15 - INFO - __main__ - Step 14912: {'lr': 0.0004906699047938153, 'samples': 2863104, 'steps': 14911, 'loss/train': 1.9322426319122314} 11/06/2021 23:18:16 - INFO - __main__ - Step 14913: {'lr': 0.0004906684685032724, 'samples': 2863296, 'steps': 14912, 'loss/train': 0.9366849064826965} 11/06/2021 23:18:16 - INFO - __main__ - Step 14914: {'lr': 0.0004906670321042881, 'samples': 2863488, 'steps': 14913, 'loss/train': 1.394436240196228} 11/06/2021 23:18:16 - INFO - __main__ - Step 14915: {'lr': 0.0004906655955968628, 'samples': 2863680, 'steps': 14914, 'loss/train': 1.2023478746414185} 11/06/2021 23:18:17 - INFO - __main__ - Step 14916: {'lr': 0.0004906641589809973, 'samples': 2863872, 'steps': 14915, 'loss/train': 2.0545525550842285} 11/06/2021 23:18:17 - INFO - __main__ - Step 14917: {'lr': 0.0004906627222566924, 'samples': 2864064, 'steps': 14916, 'loss/train': 1.4498852491378784} 11/06/2021 23:18:18 - INFO - __main__ - Step 14918: {'lr': 0.0004906612854239485, 'samples': 2864256, 'steps': 14917, 'loss/train': 1.5706671476364136} 11/06/2021 23:18:19 - INFO - __main__ - Step 14919: {'lr': 0.0004906598484827663, 'samples': 2864448, 'steps': 14918, 'loss/train': 1.427777647972107} 11/06/2021 23:18:19 - INFO - __main__ - Step 14920: {'lr': 0.0004906584114331465, 'samples': 2864640, 'steps': 14919, 'loss/train': 1.9566161632537842} 11/06/2021 23:18:19 - INFO - __main__ - Step 14921: {'lr': 0.0004906569742750899, 'samples': 2864832, 'steps': 14920, 'loss/train': 1.7183939218521118} 11/06/2021 23:18:20 - INFO - __main__ - Step 14922: {'lr': 0.0004906555370085968, 'samples': 2865024, 'steps': 14921, 'loss/train': 1.8103671073913574} 11/06/2021 23:18:21 - INFO - __main__ - Step 14923: {'lr': 0.000490654099633668, 'samples': 2865216, 'steps': 14922, 'loss/train': 1.5817921161651611} 11/06/2021 23:18:21 - INFO - __main__ - Step 14924: {'lr': 0.0004906526621503043, 'samples': 2865408, 'steps': 14923, 'loss/train': 1.145911455154419} 11/06/2021 23:18:21 - INFO - __main__ - Step 14925: {'lr': 0.0004906512245585062, 'samples': 2865600, 'steps': 14924, 'loss/train': 2.2841901779174805} 11/06/2021 23:18:22 - INFO - __main__ - Step 14926: {'lr': 0.0004906497868582743, 'samples': 2865792, 'steps': 14925, 'loss/train': 1.4507989883422852} 11/06/2021 23:18:22 - INFO - __main__ - Step 14927: {'lr': 0.0004906483490496093, 'samples': 2865984, 'steps': 14926, 'loss/train': 1.791601538658142} 11/06/2021 23:18:23 - INFO - __main__ - Step 14928: {'lr': 0.000490646911132512, 'samples': 2866176, 'steps': 14927, 'loss/train': 1.7833375930786133} 11/06/2021 23:18:23 - INFO - __main__ - Step 14929: {'lr': 0.0004906454731069828, 'samples': 2866368, 'steps': 14928, 'loss/train': 1.2282170057296753} 11/06/2021 23:18:24 - INFO - __main__ - Step 14930: {'lr': 0.0004906440349730226, 'samples': 2866560, 'steps': 14929, 'loss/train': 1.1155771017074585} 11/06/2021 23:18:24 - INFO - __main__ - Step 14931: {'lr': 0.0004906425967306317, 'samples': 2866752, 'steps': 14930, 'loss/train': 1.8701684474945068} 11/06/2021 23:18:24 - INFO - __main__ - Step 14932: {'lr': 0.0004906411583798112, 'samples': 2866944, 'steps': 14931, 'loss/train': 1.1279278993606567} 11/06/2021 23:18:26 - INFO - __main__ - Step 14933: {'lr': 0.0004906397199205614, 'samples': 2867136, 'steps': 14932, 'loss/train': 1.0893620252609253} 11/06/2021 23:18:26 - INFO - __main__ - Step 14934: {'lr': 0.000490638281352883, 'samples': 2867328, 'steps': 14933, 'loss/train': 1.8908346891403198} 11/06/2021 23:18:26 - INFO - __main__ - Step 14935: {'lr': 0.0004906368426767767, 'samples': 2867520, 'steps': 14934, 'loss/train': 0.6824740767478943} 11/06/2021 23:18:27 - INFO - __main__ - Step 14936: {'lr': 0.0004906354038922432, 'samples': 2867712, 'steps': 14935, 'loss/train': 1.8815536499023438} 11/06/2021 23:18:27 - INFO - __main__ - Step 14937: {'lr': 0.000490633964999283, 'samples': 2867904, 'steps': 14936, 'loss/train': 1.1689196825027466} 11/06/2021 23:18:28 - INFO - __main__ - Step 14938: {'lr': 0.000490632525997897, 'samples': 2868096, 'steps': 14937, 'loss/train': 1.6703389883041382} 11/06/2021 23:18:28 - INFO - __main__ - Step 14939: {'lr': 0.0004906310868880856, 'samples': 2868288, 'steps': 14938, 'loss/train': 1.6411389112472534} 11/06/2021 23:18:29 - INFO - __main__ - Step 14940: {'lr': 0.0004906296476698496, 'samples': 2868480, 'steps': 14939, 'loss/train': 1.2749041318893433} 11/06/2021 23:18:29 - INFO - __main__ - Step 14941: {'lr': 0.0004906282083431897, 'samples': 2868672, 'steps': 14940, 'loss/train': 1.5104907751083374} 11/06/2021 23:18:29 - INFO - __main__ - Step 14942: {'lr': 0.0004906267689081063, 'samples': 2868864, 'steps': 14941, 'loss/train': 1.9198617935180664} 11/06/2021 23:18:30 - INFO - __main__ - Step 14943: {'lr': 0.0004906253293646002, 'samples': 2869056, 'steps': 14942, 'loss/train': 1.7235310077667236} 11/06/2021 23:18:31 - INFO - __main__ - Step 14944: {'lr': 0.0004906238897126721, 'samples': 2869248, 'steps': 14943, 'loss/train': 1.3015567064285278} 11/06/2021 23:18:31 - INFO - __main__ - Step 14945: {'lr': 0.0004906224499523225, 'samples': 2869440, 'steps': 14944, 'loss/train': 1.208406686782837} 11/06/2021 23:18:31 - INFO - __main__ - Step 14946: {'lr': 0.0004906210100835522, 'samples': 2869632, 'steps': 14945, 'loss/train': 1.7185678482055664} 11/06/2021 23:18:32 - INFO - __main__ - Step 14947: {'lr': 0.0004906195701063617, 'samples': 2869824, 'steps': 14946, 'loss/train': 1.6271605491638184} 11/06/2021 23:18:33 - INFO - __main__ - Step 14948: {'lr': 0.0004906181300207518, 'samples': 2870016, 'steps': 14947, 'loss/train': 1.1316771507263184} 11/06/2021 23:18:33 - INFO - __main__ - Step 14949: {'lr': 0.0004906166898267231, 'samples': 2870208, 'steps': 14948, 'loss/train': 1.8571279048919678} 11/06/2021 23:18:34 - INFO - __main__ - Step 14950: {'lr': 0.0004906152495242763, 'samples': 2870400, 'steps': 14949, 'loss/train': 1.3722232580184937} 11/06/2021 23:18:34 - INFO - __main__ - Step 14951: {'lr': 0.0004906138091134118, 'samples': 2870592, 'steps': 14950, 'loss/train': 1.7835454940795898} 11/06/2021 23:18:34 - INFO - __main__ - Step 14952: {'lr': 0.0004906123685941306, 'samples': 2870784, 'steps': 14951, 'loss/train': 1.6602545976638794} 11/06/2021 23:18:35 - INFO - __main__ - Step 14953: {'lr': 0.000490610927966433, 'samples': 2870976, 'steps': 14952, 'loss/train': 1.557942509651184} 11/06/2021 23:18:36 - INFO - __main__ - Step 14954: {'lr': 0.00049060948723032, 'samples': 2871168, 'steps': 14953, 'loss/train': 1.7364459037780762} 11/06/2021 23:18:36 - INFO - __main__ - Step 14955: {'lr': 0.000490608046385792, 'samples': 2871360, 'steps': 14954, 'loss/train': 2.2763864994049072} 11/06/2021 23:18:36 - INFO - __main__ - Step 14956: {'lr': 0.0004906066054328498, 'samples': 2871552, 'steps': 14955, 'loss/train': 1.585302472114563} 11/06/2021 23:18:37 - INFO - __main__ - Step 14957: {'lr': 0.0004906051643714939, 'samples': 2871744, 'steps': 14956, 'loss/train': 1.91652250289917} 11/06/2021 23:18:38 - INFO - __main__ - Step 14958: {'lr': 0.000490603723201725, 'samples': 2871936, 'steps': 14957, 'loss/train': 3.7174441814422607} 11/06/2021 23:18:38 - INFO - __main__ - Step 14959: {'lr': 0.0004906022819235438, 'samples': 2872128, 'steps': 14958, 'loss/train': 1.5262293815612793} 11/06/2021 23:18:38 - INFO - __main__ - Step 14960: {'lr': 0.000490600840536951, 'samples': 2872320, 'steps': 14959, 'loss/train': 1.9472852945327759} 11/06/2021 23:18:39 - INFO - __main__ - Step 14961: {'lr': 0.0004905993990419471, 'samples': 2872512, 'steps': 14960, 'loss/train': 1.6929538249969482} 11/06/2021 23:18:39 - INFO - __main__ - Step 14962: {'lr': 0.0004905979574385328, 'samples': 2872704, 'steps': 14961, 'loss/train': 1.4080133438110352} 11/06/2021 23:18:39 - INFO - __main__ - Step 14963: {'lr': 0.0004905965157267088, 'samples': 2872896, 'steps': 14962, 'loss/train': 1.8805720806121826} 11/06/2021 23:18:41 - INFO - __main__ - Step 14964: {'lr': 0.0004905950739064758, 'samples': 2873088, 'steps': 14963, 'loss/train': 1.8650027513504028} 11/06/2021 23:18:41 - INFO - __main__ - Step 14965: {'lr': 0.0004905936319778343, 'samples': 2873280, 'steps': 14964, 'loss/train': 1.3455829620361328} 11/06/2021 23:18:41 - INFO - __main__ - Step 14966: {'lr': 0.000490592189940785, 'samples': 2873472, 'steps': 14965, 'loss/train': 1.7468905448913574} 11/06/2021 23:18:42 - INFO - __main__ - Step 14967: {'lr': 0.0004905907477953286, 'samples': 2873664, 'steps': 14966, 'loss/train': 1.8428308963775635} 11/06/2021 23:18:42 - INFO - __main__ - Step 14968: {'lr': 0.0004905893055414658, 'samples': 2873856, 'steps': 14967, 'loss/train': 1.5465810298919678} 11/06/2021 23:18:43 - INFO - __main__ - Step 14969: {'lr': 0.0004905878631791971, 'samples': 2874048, 'steps': 14968, 'loss/train': 1.9906973838806152} 11/06/2021 23:18:43 - INFO - __main__ - Step 14970: {'lr': 0.0004905864207085232, 'samples': 2874240, 'steps': 14969, 'loss/train': 1.7254399061203003} 11/06/2021 23:18:44 - INFO - __main__ - Step 14971: {'lr': 0.0004905849781294448, 'samples': 2874432, 'steps': 14970, 'loss/train': 1.8917148113250732} 11/06/2021 23:18:44 - INFO - __main__ - Step 14972: {'lr': 0.0004905835354419625, 'samples': 2874624, 'steps': 14971, 'loss/train': 1.7392559051513672} 11/06/2021 23:18:44 - INFO - __main__ - Step 14973: {'lr': 0.0004905820926460769, 'samples': 2874816, 'steps': 14972, 'loss/train': 1.6358309984207153} 11/06/2021 23:18:45 - INFO - __main__ - Step 14974: {'lr': 0.0004905806497417888, 'samples': 2875008, 'steps': 14973, 'loss/train': 1.6739000082015991} 11/06/2021 23:18:46 - INFO - __main__ - Step 14975: {'lr': 0.0004905792067290988, 'samples': 2875200, 'steps': 14974, 'loss/train': 1.421985387802124} 11/06/2021 23:18:46 - INFO - __main__ - Step 14976: {'lr': 0.0004905777636080075, 'samples': 2875392, 'steps': 14975, 'loss/train': 1.8452321290969849} 11/06/2021 23:18:47 - INFO - __main__ - Step 14977: {'lr': 0.0004905763203785157, 'samples': 2875584, 'steps': 14976, 'loss/train': 1.7335155010223389} 11/06/2021 23:18:47 - INFO - __main__ - Step 14978: {'lr': 0.0004905748770406237, 'samples': 2875776, 'steps': 14977, 'loss/train': 1.7326487302780151} 11/06/2021 23:18:47 - INFO - __main__ - Step 14979: {'lr': 0.0004905734335943325, 'samples': 2875968, 'steps': 14978, 'loss/train': 1.5935941934585571} 11/06/2021 23:18:48 - INFO - __main__ - Step 14980: {'lr': 0.0004905719900396426, 'samples': 2876160, 'steps': 14979, 'loss/train': 1.410271406173706} 11/06/2021 23:18:49 - INFO - __main__ - Step 14981: {'lr': 0.0004905705463765546, 'samples': 2876352, 'steps': 14980, 'loss/train': 1.5823677778244019} 11/06/2021 23:18:49 - INFO - __main__ - Step 14982: {'lr': 0.0004905691026050692, 'samples': 2876544, 'steps': 14981, 'loss/train': 1.2565804719924927} 11/06/2021 23:18:49 - INFO - __main__ - Step 14983: {'lr': 0.0004905676587251873, 'samples': 2876736, 'steps': 14982, 'loss/train': 1.1940447092056274} 11/06/2021 23:18:50 - INFO - __main__ - Step 14984: {'lr': 0.0004905662147369091, 'samples': 2876928, 'steps': 14983, 'loss/train': 2.0281472206115723} 11/06/2021 23:18:50 - INFO - __main__ - Step 14985: {'lr': 0.0004905647706402356, 'samples': 2877120, 'steps': 14984, 'loss/train': 1.4245983362197876} 11/06/2021 23:18:51 - INFO - __main__ - Step 14986: {'lr': 0.0004905633264351673, 'samples': 2877312, 'steps': 14985, 'loss/train': 1.687376856803894} 11/06/2021 23:18:51 - INFO - __main__ - Step 14987: {'lr': 0.0004905618821217048, 'samples': 2877504, 'steps': 14986, 'loss/train': 1.4128832817077637} 11/06/2021 23:18:52 - INFO - __main__ - Step 14988: {'lr': 0.0004905604376998489, 'samples': 2877696, 'steps': 14987, 'loss/train': 1.4120585918426514} 11/06/2021 23:18:52 - INFO - __main__ - Step 14989: {'lr': 0.0004905589931696002, 'samples': 2877888, 'steps': 14988, 'loss/train': 1.6565570831298828} 11/06/2021 23:18:53 - INFO - __main__ - Step 14990: {'lr': 0.0004905575485309593, 'samples': 2878080, 'steps': 14989, 'loss/train': 1.8112573623657227} 11/06/2021 23:18:54 - INFO - __main__ - Step 14991: {'lr': 0.0004905561037839269, 'samples': 2878272, 'steps': 14990, 'loss/train': 1.1115037202835083} 11/06/2021 23:18:54 - INFO - __main__ - Step 14992: {'lr': 0.0004905546589285036, 'samples': 2878464, 'steps': 14991, 'loss/train': 1.6998533010482788} 11/06/2021 23:18:54 - INFO - __main__ - Step 14993: {'lr': 0.0004905532139646901, 'samples': 2878656, 'steps': 14992, 'loss/train': 0.203787699341774} 11/06/2021 23:18:55 - INFO - __main__ - Step 14994: {'lr': 0.000490551768892487, 'samples': 2878848, 'steps': 14993, 'loss/train': 2.0352189540863037} 11/06/2021 23:18:55 - INFO - __main__ - Step 14995: {'lr': 0.000490550323711895, 'samples': 2879040, 'steps': 14994, 'loss/train': 1.996849536895752} 11/06/2021 23:18:56 - INFO - __main__ - Step 14996: {'lr': 0.0004905488784229147, 'samples': 2879232, 'steps': 14995, 'loss/train': 1.7386120557785034} 11/06/2021 23:18:57 - INFO - __main__ - Step 14997: {'lr': 0.000490547433025547, 'samples': 2879424, 'steps': 14996, 'loss/train': 1.7791500091552734} 11/06/2021 23:18:57 - INFO - __main__ - Step 14998: {'lr': 0.0004905459875197921, 'samples': 2879616, 'steps': 14997, 'loss/train': 1.2726280689239502} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573} 11/06/2021 23:18:58 - INFO - __main__ - Step 15000: {'lr': 0.0004905430961831242, 'samples': 2880000, 'steps': 14999, 'loss/train': 0.5816349983215332} 11/06/2021 23:18:58 - INFO - __main__ - Evaluating and saving model checkpoint 11/06/2021 23:22:11 - INFO - __main__ - Step 15000: {'loss/eval': 1.610079050064087, 'perplexity': 5.003206729888916} 11/06/2021 23:22:42 - WARNING - huggingface_hub.repository - remote: ---------------------------------------------------------- remote: Your push was accepted, but with warnings: remote: - warning : empty or missing yaml metadata in card (lvwerra/codeparrot-small) remote: help: please find help at https://huggingface.co/docs/hub/model-repos remote: ---------------------------------------------------------- remote: Please find the documentation at: remote: https://huggingface.co/docs/hub/model-repos(B remote: ---------------------------------------------------------- To https://huggingface.co/lvwerra/codeparrot-small * [new branch] proud-haze-135 -> proud-haze-135 11/06/2021 23:22:43 - INFO - __main__ - Step 15001: {'lr': 0.0004905416503522123, 'samples': 2880192, 'steps': 15000, 'loss/train': 1.465524435043335} 11/06/2021 23:22:44 - INFO - __main__ - Step 15002: {'lr': 0.0004905402044129162, 'samples': 2880384, 'steps': 15001, 'loss/train': 2.2312591075897217} 11/06/2021 23:22:44 - INFO - __main__ - Step 15003: {'lr': 0.0004905387583652363, 'samples': 2880576, 'steps': 15002, 'loss/train': 1.7260355949401855} 11/06/2021 23:22:44 - INFO - __main__ - Step 15004: {'lr': 0.0004905373122091734, 'samples': 2880768, 'steps': 15003, 'loss/train': 0.4969237148761749} 11/06/2021 23:22:45 - INFO - __main__ - Step 15005: {'lr': 0.0004905358659447281, 'samples': 2880960, 'steps': 15004, 'loss/train': 1.7188736200332642} 11/06/2021 23:22:45 - INFO - __main__ - Step 15006: {'lr': 0.000490534419571901, 'samples': 2881152, 'steps': 15005, 'loss/train': 1.781702995300293} 11/06/2021 23:22:46 - INFO - __main__ - Step 15007: {'lr': 0.0004905329730906929, 'samples': 2881344, 'steps': 15006, 'loss/train': 1.1889454126358032} 11/06/2021 23:22:46 - INFO - __main__ - Step 15008: {'lr': 0.0004905315265011043, 'samples': 2881536, 'steps': 15007, 'loss/train': 1.3070452213287354} 11/06/2021 23:22:47 - INFO - __main__ - Step 15009: {'lr': 0.0004905300798031359, 'samples': 2881728, 'steps': 15008, 'loss/train': 2.0110666751861572} 11/06/2021 23:22:47 - INFO - __main__ - Step 15010: {'lr': 0.0004905286329967883, 'samples': 2881920, 'steps': 15009, 'loss/train': 1.7228916883468628} 11/06/2021 23:22:48 - INFO - __main__ - Step 15011: {'lr': 0.0004905271860820622, 'samples': 2882112, 'steps': 15010, 'loss/train': 1.8122127056121826} 11/06/2021 23:22:49 - INFO - __main__ - Step 15012: {'lr': 0.0004905257390589585, 'samples': 2882304, 'steps': 15011, 'loss/train': 1.7922598123550415} 11/06/2021 23:22:49 - INFO - __main__ - Step 15013: {'lr': 0.0004905242919274774, 'samples': 2882496, 'steps': 15012, 'loss/train': 1.3931684494018555} 11/06/2021 23:22:49 - INFO - __main__ - Step 15014: {'lr': 0.0004905228446876197, 'samples': 2882688, 'steps': 15013, 'loss/train': 1.860660195350647} 11/06/2021 23:22:50 - INFO - __main__ - Step 15015: {'lr': 0.0004905213973393863, 'samples': 2882880, 'steps': 15014, 'loss/train': 2.3514349460601807} 11/06/2021 23:22:50 - INFO - __main__ - Step 15016: {'lr': 0.0004905199498827776, 'samples': 2883072, 'steps': 15015, 'loss/train': 1.8527649641036987} 11/06/2021 23:22:51 - INFO - __main__ - Step 15017: {'lr': 0.0004905185023177942, 'samples': 2883264, 'steps': 15016, 'loss/train': 1.7189619541168213} 11/06/2021 23:22:51 - INFO - __main__ - Step 15018: {'lr': 0.0004905170546444371, 'samples': 2883456, 'steps': 15017, 'loss/train': 1.7839937210083008} 11/06/2021 23:22:52 - INFO - __main__ - Step 15019: {'lr': 0.0004905156068627065, 'samples': 2883648, 'steps': 15018, 'loss/train': 1.8625094890594482} 11/06/2021 23:22:52 - INFO - __main__ - Step 15020: {'lr': 0.0004905141589726035, 'samples': 2883840, 'steps': 15019, 'loss/train': 1.4955047369003296} 11/06/2021 23:22:52 - INFO - __main__ - Step 15021: {'lr': 0.0004905127109741284, 'samples': 2884032, 'steps': 15020, 'loss/train': 1.9690922498703003} 11/06/2021 23:22:54 - INFO - __main__ - Step 15022: {'lr': 0.000490511262867282, 'samples': 2884224, 'steps': 15021, 'loss/train': 1.777396559715271} 11/06/2021 23:22:54 - INFO - __main__ - Step 15023: {'lr': 0.000490509814652065, 'samples': 2884416, 'steps': 15022, 'loss/train': 2.3741092681884766} 11/06/2021 23:22:54 - INFO - __main__ - Step 15024: {'lr': 0.0004905083663284779, 'samples': 2884608, 'steps': 15023, 'loss/train': 1.2763044834136963} 11/06/2021 23:22:55 - INFO - __main__ - Step 15025: {'lr': 0.0004905069178965214, 'samples': 2884800, 'steps': 15024, 'loss/train': 1.7052578926086426} 11/06/2021 23:22:55 - INFO - __main__ - Step 15026: {'lr': 0.0004905054693561963, 'samples': 2884992, 'steps': 15025, 'loss/train': 1.5424304008483887} 11/06/2021 23:22:55 - INFO - __main__ - Step 15027: {'lr': 0.0004905040207075032, 'samples': 2885184, 'steps': 15026, 'loss/train': 1.7792967557907104} 11/06/2021 23:22:56 - INFO - __main__ - Step 15028: {'lr': 0.0004905025719504426, 'samples': 2885376, 'steps': 15027, 'loss/train': 2.0818092823028564} 11/06/2021 23:22:57 - INFO - __main__ - Step 15029: {'lr': 0.0004905011230850152, 'samples': 2885568, 'steps': 15028, 'loss/train': 1.9193378686904907} 11/06/2021 23:22:57 - INFO - __main__ - Step 15030: {'lr': 0.0004904996741112218, 'samples': 2885760, 'steps': 15029, 'loss/train': 1.4404072761535645} 11/06/2021 23:22:57 - INFO - __main__ - Step 15031: {'lr': 0.0004904982250290629, 'samples': 2885952, 'steps': 15030, 'loss/train': 1.7105932235717773} 11/06/2021 23:22:58 - INFO - __main__ - Step 15032: {'lr': 0.0004904967758385393, 'samples': 2886144, 'steps': 15031, 'loss/train': 1.7881110906600952} 11/06/2021 23:22:59 - INFO - __main__ - Step 15033: {'lr': 0.0004904953265396515, 'samples': 2886336, 'steps': 15032, 'loss/train': 1.9477510452270508} 11/06/2021 23:23:00 - INFO - __main__ - Step 15034: {'lr': 0.0004904938771324002, 'samples': 2886528, 'steps': 15033, 'loss/train': 0.2995307445526123} 11/06/2021 23:23:00 - INFO - __main__ - Step 15035: {'lr': 0.0004904924276167861, 'samples': 2886720, 'steps': 15034, 'loss/train': 1.766963243484497} 11/06/2021 23:23:00 - INFO - __main__ - Step 15036: {'lr': 0.0004904909779928099, 'samples': 2886912, 'steps': 15035, 'loss/train': 1.1945898532867432} 11/06/2021 23:23:01 - INFO - __main__ - Step 15037: {'lr': 0.000490489528260472, 'samples': 2887104, 'steps': 15036, 'loss/train': 1.4538508653640747} 11/06/2021 23:23:01 - INFO - __main__ - Step 15038: {'lr': 0.0004904880784197734, 'samples': 2887296, 'steps': 15037, 'loss/train': 1.6707526445388794} 11/06/2021 23:23:02 - INFO - __main__ - Step 15039: {'lr': 0.0004904866284707144, 'samples': 2887488, 'steps': 15038, 'loss/train': 1.706793189048767} 11/06/2021 23:23:02 - INFO - __main__ - Step 15040: {'lr': 0.000490485178413296, 'samples': 2887680, 'steps': 15039, 'loss/train': 1.7125017642974854} 11/06/2021 23:23:03 - INFO - __main__ - Step 15041: {'lr': 0.0004904837282475186, 'samples': 2887872, 'steps': 15040, 'loss/train': 1.610682487487793} 11/06/2021 23:23:03 - INFO - __main__ - Step 15042: {'lr': 0.000490482277973383, 'samples': 2888064, 'steps': 15041, 'loss/train': 2.108546495437622} 11/06/2021 23:23:03 - INFO - __main__ - Step 15043: {'lr': 0.0004904808275908898, 'samples': 2888256, 'steps': 15042, 'loss/train': 1.7590423822402954} 11/06/2021 23:23:05 - INFO - __main__ - Step 15044: {'lr': 0.0004904793771000396, 'samples': 2888448, 'steps': 15043, 'loss/train': 1.7148654460906982} 11/06/2021 23:23:05 - INFO - __main__ - Step 15045: {'lr': 0.0004904779265008331, 'samples': 2888640, 'steps': 15044, 'loss/train': 1.8898462057113647} 11/06/2021 23:23:05 - INFO - __main__ - Step 15046: {'lr': 0.000490476475793271, 'samples': 2888832, 'steps': 15045, 'loss/train': 0.4562050402164459} 11/06/2021 23:23:06 - INFO - __main__ - Step 15047: {'lr': 0.0004904750249773538, 'samples': 2889024, 'steps': 15046, 'loss/train': 1.8167861700057983} 11/06/2021 23:23:06 - INFO - __main__ - Step 15048: {'lr': 0.0004904735740530825, 'samples': 2889216, 'steps': 15047, 'loss/train': 1.7291828393936157} 11/06/2021 23:23:07 - INFO - __main__ - Step 15049: {'lr': 0.0004904721230204573, 'samples': 2889408, 'steps': 15048, 'loss/train': 1.8959709405899048} 11/06/2021 23:23:07 - INFO - __main__ - Step 15050: {'lr': 0.0004904706718794791, 'samples': 2889600, 'steps': 15049, 'loss/train': 1.5438092947006226} 11/06/2021 23:23:08 - INFO - __main__ - Step 15051: {'lr': 0.0004904692206301487, 'samples': 2889792, 'steps': 15050, 'loss/train': 1.4828404188156128} 11/06/2021 23:23:08 - INFO - __main__ - Step 15052: {'lr': 0.0004904677692724664, 'samples': 2889984, 'steps': 15051, 'loss/train': 1.3407013416290283} 11/06/2021 23:23:08 - INFO - __main__ - Step 15053: {'lr': 0.000490466317806433, 'samples': 2890176, 'steps': 15052, 'loss/train': 1.669877290725708} 11/06/2021 23:23:09 - INFO - __main__ - Step 15054: {'lr': 0.0004904648662320493, 'samples': 2890368, 'steps': 15053, 'loss/train': 1.7544317245483398} 11/06/2021 23:23:10 - INFO - __main__ - Step 15055: {'lr': 0.0004904634145493159, 'samples': 2890560, 'steps': 15054, 'loss/train': 1.4797533750534058} 11/06/2021 23:23:10 - INFO - __main__ - Step 15056: {'lr': 0.0004904619627582332, 'samples': 2890752, 'steps': 15055, 'loss/train': 1.6231729984283447} 11/06/2021 23:23:11 - INFO - __main__ - Step 15057: {'lr': 0.0004904605108588023, 'samples': 2890944, 'steps': 15056, 'loss/train': 1.5444527864456177} 11/06/2021 23:23:11 - INFO - __main__ - Step 15058: {'lr': 0.0004904590588510234, 'samples': 2891136, 'steps': 15057, 'loss/train': 2.30820631980896} 11/06/2021 23:23:11 - INFO - __main__ - Step 15059: {'lr': 0.0004904576067348975, 'samples': 2891328, 'steps': 15058, 'loss/train': 2.174032688140869} 11/06/2021 23:23:12 - INFO - __main__ - Step 15060: {'lr': 0.000490456154510425, 'samples': 2891520, 'steps': 15059, 'loss/train': 1.4963994026184082} 11/06/2021 23:23:13 - INFO - __main__ - Step 15061: {'lr': 0.0004904547021776067, 'samples': 2891712, 'steps': 15060, 'loss/train': 1.3455215692520142} 11/06/2021 23:23:13 - INFO - __main__ - Step 15062: {'lr': 0.0004904532497364432, 'samples': 2891904, 'steps': 15061, 'loss/train': 1.4568783044815063} 11/06/2021 23:23:13 - INFO - __main__ - Step 15063: {'lr': 0.0004904517971869352, 'samples': 2892096, 'steps': 15062, 'loss/train': 1.2581560611724854} 11/06/2021 23:23:14 - INFO - __main__ - Step 15064: {'lr': 0.0004904503445290833, 'samples': 2892288, 'steps': 15063, 'loss/train': 1.4970574378967285} 11/06/2021 23:23:15 - INFO - __main__ - Step 15065: {'lr': 0.0004904488917628882, 'samples': 2892480, 'steps': 15064, 'loss/train': 1.9729082584381104} 11/06/2021 23:23:15 - INFO - __main__ - Step 15066: {'lr': 0.0004904474388883507, 'samples': 2892672, 'steps': 15065, 'loss/train': 1.9454787969589233} 11/06/2021 23:23:15 - INFO - __main__ - Step 15067: {'lr': 0.000490445985905471, 'samples': 2892864, 'steps': 15066, 'loss/train': 1.946716547012329} 11/06/2021 23:23:16 - INFO - __main__ - Step 15068: {'lr': 0.0004904445328142503, 'samples': 2893056, 'steps': 15067, 'loss/train': 1.4555275440216064} 11/06/2021 23:23:16 - INFO - __main__ - Step 15069: {'lr': 0.0004904430796146889, 'samples': 2893248, 'steps': 15068, 'loss/train': 1.6384254693984985} 11/06/2021 23:23:17 - INFO - __main__ - Step 15070: {'lr': 0.0004904416263067876, 'samples': 2893440, 'steps': 15069, 'loss/train': 1.866790533065796} 11/06/2021 23:23:18 - INFO - __main__ - Step 15071: {'lr': 0.0004904401728905469, 'samples': 2893632, 'steps': 15070, 'loss/train': 1.6796128749847412} 11/06/2021 23:23:18 - INFO - __main__ - Step 15072: {'lr': 0.0004904387193659677, 'samples': 2893824, 'steps': 15071, 'loss/train': 1.5711565017700195} 11/06/2021 23:23:18 - INFO - __main__ - Step 15073: {'lr': 0.0004904372657330504, 'samples': 2894016, 'steps': 15072, 'loss/train': 1.440606713294983} 11/06/2021 23:23:19 - INFO - __main__ - Step 15074: {'lr': 0.0004904358119917959, 'samples': 2894208, 'steps': 15073, 'loss/train': 2.0183684825897217} 11/06/2021 23:23:20 - INFO - __main__ - Step 15075: {'lr': 0.0004904343581422047, 'samples': 2894400, 'steps': 15074, 'loss/train': 1.6127521991729736} 11/06/2021 23:23:20 - INFO - __main__ - Step 15076: {'lr': 0.0004904329041842774, 'samples': 2894592, 'steps': 15075, 'loss/train': 1.8341926336288452} 11/06/2021 23:23:20 - INFO - __main__ - Step 15077: {'lr': 0.0004904314501180148, 'samples': 2894784, 'steps': 15076, 'loss/train': 1.8274245262145996} 11/06/2021 23:23:21 - INFO - __main__ - Step 15078: {'lr': 0.0004904299959434175, 'samples': 2894976, 'steps': 15077, 'loss/train': 1.727868676185608} 11/06/2021 23:23:21 - INFO - __main__ - Step 15079: {'lr': 0.0004904285416604862, 'samples': 2895168, 'steps': 15078, 'loss/train': 1.6105992794036865} 11/06/2021 23:23:21 - INFO - __main__ - Step 15080: {'lr': 0.0004904270872692215, 'samples': 2895360, 'steps': 15079, 'loss/train': 1.6114802360534668} 11/06/2021 23:23:23 - INFO - __main__ - Step 15081: {'lr': 0.0004904256327696241, 'samples': 2895552, 'steps': 15080, 'loss/train': 0.8484119772911072} 11/06/2021 23:23:23 - INFO - __main__ - Step 15082: {'lr': 0.0004904241781616945, 'samples': 2895744, 'steps': 15081, 'loss/train': 1.8067805767059326} 11/06/2021 23:23:24 - INFO - __main__ - Step 15083: {'lr': 0.0004904227234454335, 'samples': 2895936, 'steps': 15082, 'loss/train': 1.6496418714523315} 11/06/2021 23:23:24 - INFO - __main__ - Step 15084: {'lr': 0.0004904212686208418, 'samples': 2896128, 'steps': 15083, 'loss/train': 1.4202336072921753} 11/06/2021 23:23:24 - INFO - __main__ - Step 15085: {'lr': 0.00049041981368792, 'samples': 2896320, 'steps': 15084, 'loss/train': 1.955782175064087} 11/06/2021 23:23:25 - INFO - __main__ - Step 15086: {'lr': 0.0004904183586466686, 'samples': 2896512, 'steps': 15085, 'loss/train': 0.5780248641967773} 11/06/2021 23:23:26 - INFO - __main__ - Step 15087: {'lr': 0.0004904169034970885, 'samples': 2896704, 'steps': 15086, 'loss/train': 1.998367190361023} 11/06/2021 23:23:26 - INFO - __main__ - Step 15088: {'lr': 0.0004904154482391803, 'samples': 2896896, 'steps': 15087, 'loss/train': 2.0256190299987793} 11/06/2021 23:23:26 - INFO - __main__ - Step 15089: {'lr': 0.0004904139928729445, 'samples': 2897088, 'steps': 15088, 'loss/train': 1.4871602058410645} 11/06/2021 23:23:27 - INFO - __main__ - Step 15090: {'lr': 0.0004904125373983819, 'samples': 2897280, 'steps': 15089, 'loss/train': 1.7294107675552368} 11/06/2021 23:23:28 - INFO - __main__ - Step 15091: {'lr': 0.0004904110818154931, 'samples': 2897472, 'steps': 15090, 'loss/train': 1.508729100227356} 11/06/2021 23:23:28 - INFO - __main__ - Step 15092: {'lr': 0.0004904096261242789, 'samples': 2897664, 'steps': 15091, 'loss/train': 1.496836543083191} 11/06/2021 23:23:28 - INFO - __main__ - Step 15093: {'lr': 0.0004904081703247397, 'samples': 2897856, 'steps': 15092, 'loss/train': 1.4800703525543213} 11/06/2021 23:23:29 - INFO - __main__ - Step 15094: {'lr': 0.0004904067144168763, 'samples': 2898048, 'steps': 15093, 'loss/train': 1.7554247379302979} 11/06/2021 23:23:29 - INFO - __main__ - Step 15095: {'lr': 0.0004904052584006895, 'samples': 2898240, 'steps': 15094, 'loss/train': 1.2802367210388184} 11/06/2021 23:23:30 - INFO - __main__ - Step 15096: {'lr': 0.0004904038022761797, 'samples': 2898432, 'steps': 15095, 'loss/train': 2.180551767349243} 11/06/2021 23:23:31 - INFO - __main__ - Step 15097: {'lr': 0.0004904023460433475, 'samples': 2898624, 'steps': 15096, 'loss/train': 1.744247555732727} 11/06/2021 23:23:31 - INFO - __main__ - Step 15098: {'lr': 0.0004904008897021939, 'samples': 2898816, 'steps': 15097, 'loss/train': 1.8757344484329224} 11/06/2021 23:23:31 - INFO - __main__ - Step 15099: {'lr': 0.0004903994332527193, 'samples': 2899008, 'steps': 15098, 'loss/train': 1.5103914737701416} 11/06/2021 23:23:32 - INFO - __main__ - Step 15100: {'lr': 0.0004903979766949244, 'samples': 2899200, 'steps': 15099, 'loss/train': 1.7478224039077759} 11/06/2021 23:23:32 - INFO - __main__ - Step 15101: {'lr': 0.00049039652002881, 'samples': 2899392, 'steps': 15100, 'loss/train': 1.6020162105560303} 11/06/2021 23:23:33 - INFO - __main__ - Step 15102: {'lr': 0.0004903950632543766, 'samples': 2899584, 'steps': 15101, 'loss/train': 1.921471357345581} 11/06/2021 23:23:33 - INFO - __main__ - Step 15103: {'lr': 0.0004903936063716248, 'samples': 2899776, 'steps': 15102, 'loss/train': 1.7595106363296509} 11/06/2021 23:23:34 - INFO - __main__ - Step 15104: {'lr': 0.0004903921493805554, 'samples': 2899968, 'steps': 15103, 'loss/train': 1.4765055179595947} 11/06/2021 23:23:34 - INFO - __main__ - Step 15105: {'lr': 0.000490390692281169, 'samples': 2900160, 'steps': 15104, 'loss/train': 1.2814937829971313} 11/06/2021 23:23:34 - INFO - __main__ - Step 15106: {'lr': 0.0004903892350734663, 'samples': 2900352, 'steps': 15105, 'loss/train': 1.4163860082626343} 11/06/2021 23:23:36 - INFO - __main__ - Step 15107: {'lr': 0.0004903877777574479, 'samples': 2900544, 'steps': 15106, 'loss/train': 1.5000590085983276} 11/06/2021 23:23:36 - INFO - __main__ - Step 15108: {'lr': 0.0004903863203331145, 'samples': 2900736, 'steps': 15107, 'loss/train': 1.54270339012146} 11/06/2021 23:23:36 - INFO - __main__ - Step 15109: {'lr': 0.0004903848628004667, 'samples': 2900928, 'steps': 15108, 'loss/train': 1.929103136062622} 11/06/2021 23:23:37 - INFO - __main__ - Step 15110: {'lr': 0.0004903834051595052, 'samples': 2901120, 'steps': 15109, 'loss/train': 1.6631858348846436} 11/06/2021 23:23:37 - INFO - __main__ - Step 15111: {'lr': 0.0004903819474102306, 'samples': 2901312, 'steps': 15110, 'loss/train': 1.7396637201309204} 11/06/2021 23:23:38 - INFO - __main__ - Step 15112: {'lr': 0.0004903804895526437, 'samples': 2901504, 'steps': 15111, 'loss/train': 1.462821125984192} 11/06/2021 23:23:38 - INFO - __main__ - Step 15113: {'lr': 0.0004903790315867449, 'samples': 2901696, 'steps': 15112, 'loss/train': 1.5667641162872314} 11/06/2021 23:23:39 - INFO - __main__ - Step 15114: {'lr': 0.0004903775735125352, 'samples': 2901888, 'steps': 15113, 'loss/train': 1.7718415260314941} 11/06/2021 23:23:39 - INFO - __main__ - Step 15115: {'lr': 0.0004903761153300149, 'samples': 2902080, 'steps': 15114, 'loss/train': 1.5683375597000122} 11/06/2021 23:23:39 - INFO - __main__ - Step 15116: {'lr': 0.000490374657039185, 'samples': 2902272, 'steps': 15115, 'loss/train': 1.12655770778656} 11/06/2021 23:23:40 - INFO - __main__ - Step 15117: {'lr': 0.0004903731986400459, 'samples': 2902464, 'steps': 15116, 'loss/train': 1.7532262802124023} 11/06/2021 23:23:41 - INFO - __main__ - Step 15118: {'lr': 0.0004903717401325983, 'samples': 2902656, 'steps': 15117, 'loss/train': 1.716330885887146} 11/06/2021 23:23:41 - INFO - __main__ - Step 15119: {'lr': 0.000490370281516843, 'samples': 2902848, 'steps': 15118, 'loss/train': 1.9191677570343018} 11/06/2021 23:23:42 - INFO - __main__ - Step 15120: {'lr': 0.0004903688227927806, 'samples': 2903040, 'steps': 15119, 'loss/train': 2.0227084159851074} 11/06/2021 23:23:42 - INFO - __main__ - Step 15121: {'lr': 0.0004903673639604116, 'samples': 2903232, 'steps': 15120, 'loss/train': 1.3265833854675293} 11/06/2021 23:23:43 - INFO - __main__ - Step 15122: {'lr': 0.0004903659050197369, 'samples': 2903424, 'steps': 15121, 'loss/train': 1.751603364944458} 11/06/2021 23:23:43 - INFO - __main__ - Step 15123: {'lr': 0.0004903644459707569, 'samples': 2903616, 'steps': 15122, 'loss/train': 1.6488754749298096} 11/06/2021 23:23:44 - INFO - __main__ - Step 15124: {'lr': 0.0004903629868134725, 'samples': 2903808, 'steps': 15123, 'loss/train': 1.1973127126693726} 11/06/2021 23:23:44 - INFO - __main__ - Step 15125: {'lr': 0.0004903615275478841, 'samples': 2904000, 'steps': 15124, 'loss/train': 1.4743258953094482} 11/06/2021 23:23:44 - INFO - __main__ - Step 15126: {'lr': 0.0004903600681739926, 'samples': 2904192, 'steps': 15125, 'loss/train': 1.5869134664535522} 11/06/2021 23:23:45 - INFO - __main__ - Step 15127: {'lr': 0.0004903586086917986, 'samples': 2904384, 'steps': 15126, 'loss/train': 1.4058135747909546} 11/06/2021 23:23:46 - INFO - __main__ - Step 15128: {'lr': 0.0004903571491013027, 'samples': 2904576, 'steps': 15127, 'loss/train': 0.8983421325683594} 11/06/2021 23:23:46 - INFO - __main__ - Step 15129: {'lr': 0.0004903556894025055, 'samples': 2904768, 'steps': 15128, 'loss/train': 1.5951757431030273} 11/06/2021 23:23:46 - INFO - __main__ - Step 15130: {'lr': 0.0004903542295954077, 'samples': 2904960, 'steps': 15129, 'loss/train': 1.553074836730957} 11/06/2021 23:23:47 - INFO - __main__ - Step 15131: {'lr': 0.0004903527696800102, 'samples': 2905152, 'steps': 15130, 'loss/train': 1.154365062713623} 11/06/2021 23:23:47 - INFO - __main__ - Step 15132: {'lr': 0.0004903513096563133, 'samples': 2905344, 'steps': 15131, 'loss/train': 1.9353693723678589} 11/06/2021 23:23:48 - INFO - __main__ - Step 15133: {'lr': 0.0004903498495243178, 'samples': 2905536, 'steps': 15132, 'loss/train': 2.002183675765991} 11/06/2021 23:23:49 - INFO - __main__ - Step 15134: {'lr': 0.0004903483892840244, 'samples': 2905728, 'steps': 15133, 'loss/train': 1.6886216402053833} 11/06/2021 23:23:49 - INFO - __main__ - Step 15135: {'lr': 0.0004903469289354338, 'samples': 2905920, 'steps': 15134, 'loss/train': 1.5800846815109253} 11/06/2021 23:23:49 - INFO - __main__ - Step 15136: {'lr': 0.0004903454684785465, 'samples': 2906112, 'steps': 15135, 'loss/train': 1.4386088848114014} 11/06/2021 23:23:50 - INFO - __main__ - Step 15137: {'lr': 0.0004903440079133633, 'samples': 2906304, 'steps': 15136, 'loss/train': 1.7239024639129639} 11/06/2021 23:23:51 - INFO - __main__ - Step 15138: {'lr': 0.0004903425472398846, 'samples': 2906496, 'steps': 15137, 'loss/train': 1.53848135471344} 11/06/2021 23:23:51 - INFO - __main__ - Step 15139: {'lr': 0.0004903410864581115, 'samples': 2906688, 'steps': 15138, 'loss/train': 2.4064629077911377} 11/06/2021 23:23:52 - INFO - __main__ - Step 15140: {'lr': 0.0004903396255680443, 'samples': 2906880, 'steps': 15139, 'loss/train': 1.433778166770935} 11/06/2021 23:23:52 - INFO - __main__ - Step 15141: {'lr': 0.0004903381645696838, 'samples': 2907072, 'steps': 15140, 'loss/train': 1.3583641052246094} 11/06/2021 23:23:52 - INFO - __main__ - Step 15142: {'lr': 0.0004903367034630307, 'samples': 2907264, 'steps': 15141, 'loss/train': 2.072575092315674} 11/06/2021 23:23:53 - INFO - __main__ - Step 15143: {'lr': 0.0004903352422480855, 'samples': 2907456, 'steps': 15142, 'loss/train': 1.4871820211410522} 11/06/2021 23:23:54 - INFO - __main__ - Step 15144: {'lr': 0.000490333780924849, 'samples': 2907648, 'steps': 15143, 'loss/train': 2.0281856060028076} 11/06/2021 23:23:54 - INFO - __main__ - Step 15145: {'lr': 0.0004903323194933218, 'samples': 2907840, 'steps': 15144, 'loss/train': 1.3873077630996704} 11/06/2021 23:23:54 - INFO - __main__ - Step 15146: {'lr': 0.0004903308579535045, 'samples': 2908032, 'steps': 15145, 'loss/train': 1.6294296979904175} 11/06/2021 23:23:55 - INFO - __main__ - Step 15147: {'lr': 0.0004903293963053979, 'samples': 2908224, 'steps': 15146, 'loss/train': 1.5592012405395508} 11/06/2021 23:23:56 - INFO - __main__ - Step 15148: {'lr': 0.0004903279345490026, 'samples': 2908416, 'steps': 15147, 'loss/train': 1.804236650466919} 11/06/2021 23:23:56 - INFO - __main__ - Step 15149: {'lr': 0.0004903264726843191, 'samples': 2908608, 'steps': 15148, 'loss/train': 1.6943762302398682} 11/06/2021 23:23:57 - INFO - __main__ - Step 15150: {'lr': 0.0004903250107113483, 'samples': 2908800, 'steps': 15149, 'loss/train': 1.375742793083191} 11/06/2021 23:23:57 - INFO - __main__ - Step 15151: {'lr': 0.0004903235486300908, 'samples': 2908992, 'steps': 15150, 'loss/train': 1.674008846282959} 11/06/2021 23:23:57 - INFO - __main__ - Step 15152: {'lr': 0.0004903220864405471, 'samples': 2909184, 'steps': 15151, 'loss/train': 1.1988589763641357} 11/06/2021 23:23:58 - INFO - __main__ - Step 15153: {'lr': 0.000490320624142718, 'samples': 2909376, 'steps': 15152, 'loss/train': 1.791925072669983} 11/06/2021 23:23:59 - INFO - __main__ - Step 15154: {'lr': 0.0004903191617366043, 'samples': 2909568, 'steps': 15153, 'loss/train': 1.7116177082061768} 11/06/2021 23:23:59 - INFO - __main__ - Step 15155: {'lr': 0.0004903176992222063, 'samples': 2909760, 'steps': 15154, 'loss/train': 1.5216213464736938} 11/06/2021 23:23:59 - INFO - __main__ - Step 15156: {'lr': 0.000490316236599525, 'samples': 2909952, 'steps': 15155, 'loss/train': 1.7094593048095703} 11/06/2021 23:24:00 - INFO - __main__ - Step 15157: {'lr': 0.0004903147738685609, 'samples': 2910144, 'steps': 15156, 'loss/train': 1.712797999382019} 11/06/2021 23:24:00 - INFO - __main__ - Step 15158: {'lr': 0.0004903133110293145, 'samples': 2910336, 'steps': 15157, 'loss/train': 0.5006943941116333} 11/06/2021 23:24:01 - INFO - __main__ - Step 15159: {'lr': 0.0004903118480817868, 'samples': 2910528, 'steps': 15158, 'loss/train': 1.5716854333877563} 11/06/2021 23:24:01 - INFO - __main__ - Step 15160: {'lr': 0.0004903103850259781, 'samples': 2910720, 'steps': 15159, 'loss/train': 1.470730185508728} 11/06/2021 23:24:02 - INFO - __main__ - Step 15161: {'lr': 0.0004903089218618895, 'samples': 2910912, 'steps': 15160, 'loss/train': 1.3954046964645386} 11/06/2021 23:24:02 - INFO - __main__ - Step 15162: {'lr': 0.0004903074585895212, 'samples': 2911104, 'steps': 15161, 'loss/train': 1.755196213722229} 11/06/2021 23:24:02 - INFO - __main__ - Step 15163: {'lr': 0.0004903059952088742, 'samples': 2911296, 'steps': 15162, 'loss/train': 1.6570621728897095} 11/06/2021 23:24:03 - INFO - __main__ - Step 15164: {'lr': 0.0004903045317199489, 'samples': 2911488, 'steps': 15163, 'loss/train': 1.5844322443008423} 11/06/2021 23:24:04 - INFO - __main__ - Step 15165: {'lr': 0.0004903030681227463, 'samples': 2911680, 'steps': 15164, 'loss/train': 1.630523681640625} 11/06/2021 23:24:04 - INFO - __main__ - Step 15166: {'lr': 0.0004903016044172666, 'samples': 2911872, 'steps': 15165, 'loss/train': 1.6046708822250366} 11/06/2021 23:24:04 - INFO - __main__ - Step 15167: {'lr': 0.0004903001406035109, 'samples': 2912064, 'steps': 15166, 'loss/train': 1.7047207355499268} 11/06/2021 23:24:05 - INFO - __main__ - Step 15168: {'lr': 0.0004902986766814795, 'samples': 2912256, 'steps': 15167, 'loss/train': 1.479922890663147} 11/06/2021 23:24:06 - INFO - __main__ - Step 15169: {'lr': 0.0004902972126511734, 'samples': 2912448, 'steps': 15168, 'loss/train': 1.8230383396148682} 11/06/2021 23:24:06 - INFO - __main__ - Step 15170: {'lr': 0.0004902957485125929, 'samples': 2912640, 'steps': 15169, 'loss/train': 1.3802613019943237} 11/06/2021 23:24:07 - INFO - __main__ - Step 15171: {'lr': 0.0004902942842657389, 'samples': 2912832, 'steps': 15170, 'loss/train': 1.3913418054580688} 11/06/2021 23:24:07 - INFO - __main__ - Step 15172: {'lr': 0.0004902928199106121, 'samples': 2913024, 'steps': 15171, 'loss/train': 1.4685401916503906} 11/06/2021 23:24:07 - INFO - __main__ - Step 15173: {'lr': 0.000490291355447213, 'samples': 2913216, 'steps': 15172, 'loss/train': 1.9173450469970703} 11/06/2021 23:24:08 - INFO - __main__ - Step 15174: {'lr': 0.0004902898908755424, 'samples': 2913408, 'steps': 15173, 'loss/train': 3.7974798679351807} 11/06/2021 23:24:09 - INFO - __main__ - Step 15175: {'lr': 0.0004902884261956007, 'samples': 2913600, 'steps': 15174, 'loss/train': 1.4008654356002808} 11/06/2021 23:24:09 - INFO - __main__ - Step 15176: {'lr': 0.0004902869614073889, 'samples': 2913792, 'steps': 15175, 'loss/train': 1.5890427827835083} 11/06/2021 23:24:09 - INFO - __main__ - Step 15177: {'lr': 0.0004902854965109074, 'samples': 2913984, 'steps': 15176, 'loss/train': 1.566736102104187} 11/06/2021 23:24:10 - INFO - __main__ - Step 15178: {'lr': 0.0004902840315061571, 'samples': 2914176, 'steps': 15177, 'loss/train': 1.907994270324707} 11/06/2021 23:24:10 - INFO - __main__ - Step 15179: {'lr': 0.0004902825663931384, 'samples': 2914368, 'steps': 15178, 'loss/train': 1.6074315309524536} 11/06/2021 23:24:11 - INFO - __main__ - Step 15180: {'lr': 0.0004902811011718521, 'samples': 2914560, 'steps': 15179, 'loss/train': 1.366045355796814} 11/06/2021 23:24:12 - INFO - __main__ - Step 15181: {'lr': 0.0004902796358422989, 'samples': 2914752, 'steps': 15180, 'loss/train': 1.674401044845581} 11/06/2021 23:24:12 - INFO - __main__ - Step 15182: {'lr': 0.0004902781704044793, 'samples': 2914944, 'steps': 15181, 'loss/train': 1.410227656364441} 11/06/2021 23:24:12 - INFO - __main__ - Step 15183: {'lr': 0.0004902767048583942, 'samples': 2915136, 'steps': 15182, 'loss/train': 1.924155354499817} 11/06/2021 23:24:13 - INFO - __main__ - Step 15184: {'lr': 0.000490275239204044, 'samples': 2915328, 'steps': 15183, 'loss/train': 1.9920772314071655} 11/06/2021 23:24:14 - INFO - __main__ - Step 15185: {'lr': 0.0004902737734414296, 'samples': 2915520, 'steps': 15184, 'loss/train': 1.6780331134796143} 11/06/2021 23:24:14 - INFO - __main__ - Step 15186: {'lr': 0.0004902723075705514, 'samples': 2915712, 'steps': 15185, 'loss/train': 1.4955074787139893} 11/06/2021 23:24:14 - INFO - __main__ - Step 15187: {'lr': 0.0004902708415914103, 'samples': 2915904, 'steps': 15186, 'loss/train': 1.7642159461975098} 11/06/2021 23:24:15 - INFO - __main__ - Step 15188: {'lr': 0.0004902693755040069, 'samples': 2916096, 'steps': 15187, 'loss/train': 1.1258271932601929} 11/06/2021 23:24:15 - INFO - __main__ - Step 15189: {'lr': 0.0004902679093083418, 'samples': 2916288, 'steps': 15188, 'loss/train': 1.6583342552185059} 11/06/2021 23:24:16 - INFO - __main__ - Step 15190: {'lr': 0.0004902664430044156, 'samples': 2916480, 'steps': 15189, 'loss/train': 1.885223388671875} 11/06/2021 23:24:16 - INFO - __main__ - Step 15191: {'lr': 0.0004902649765922292, 'samples': 2916672, 'steps': 15190, 'loss/train': 1.5591681003570557} 11/06/2021 23:24:17 - INFO - __main__ - Step 15192: {'lr': 0.0004902635100717831, 'samples': 2916864, 'steps': 15191, 'loss/train': 1.050796627998352} 11/06/2021 23:24:17 - INFO - __main__ - Step 15193: {'lr': 0.0004902620434430778, 'samples': 2917056, 'steps': 15192, 'loss/train': 0.23766328394412994} 11/06/2021 23:24:17 - INFO - __main__ - Step 15194: {'lr': 0.0004902605767061142, 'samples': 2917248, 'steps': 15193, 'loss/train': 1.7011011838912964} 11/06/2021 23:24:19 - INFO - __main__ - Step 15195: {'lr': 0.000490259109860893, 'samples': 2917440, 'steps': 15194, 'loss/train': 1.8796442747116089} 11/06/2021 23:24:20 - INFO - __main__ - Step 15196: {'lr': 0.0004902576429074146, 'samples': 2917632, 'steps': 15195, 'loss/train': 1.5257242918014526} 11/06/2021 23:24:20 - INFO - __main__ - Step 15197: {'lr': 0.0004902561758456799, 'samples': 2917824, 'steps': 15196, 'loss/train': 1.9276056289672852} 11/06/2021 23:24:20 - INFO - __main__ - Step 15198: {'lr': 0.0004902547086756895, 'samples': 2918016, 'steps': 15197, 'loss/train': 1.3162223100662231} 11/06/2021 23:24:21 - INFO - __main__ - Step 15199: {'lr': 0.000490253241397444, 'samples': 2918208, 'steps': 15198, 'loss/train': 1.277166724205017} 11/06/2021 23:24:21 - INFO - __main__ - Step 15200: {'lr': 0.0004902517740109441, 'samples': 2918400, 'steps': 15199, 'loss/train': 1.857283592224121} 11/06/2021 23:24:22 - INFO - __main__ - Step 15201: {'lr': 0.0004902503065161905, 'samples': 2918592, 'steps': 15200, 'loss/train': 2.3311972618103027} 11/06/2021 23:24:23 - INFO - __main__ - Step 15202: {'lr': 0.0004902488389131837, 'samples': 2918784, 'steps': 15201, 'loss/train': 1.7067315578460693} 11/06/2021 23:24:23 - INFO - __main__ - Step 15203: {'lr': 0.0004902473712019246, 'samples': 2918976, 'steps': 15202, 'loss/train': 1.788569450378418} 11/06/2021 23:24:23 - INFO - __main__ - Step 15204: {'lr': 0.0004902459033824137, 'samples': 2919168, 'steps': 15203, 'loss/train': 1.7561739683151245} 11/06/2021 23:24:24 - INFO - __main__ - Step 15205: {'lr': 0.0004902444354546516, 'samples': 2919360, 'steps': 15204, 'loss/train': 1.9418166875839233} 11/06/2021 23:24:24 - INFO - __main__ - Step 15206: {'lr': 0.0004902429674186392, 'samples': 2919552, 'steps': 15205, 'loss/train': 1.4655617475509644} 11/06/2021 23:24:25 - INFO - __main__ - Step 15207: {'lr': 0.000490241499274377, 'samples': 2919744, 'steps': 15206, 'loss/train': 1.264890193939209} 11/06/2021 23:24:26 - INFO - __main__ - Step 15208: {'lr': 0.0004902400310218657, 'samples': 2919936, 'steps': 15207, 'loss/train': 1.5054603815078735} 11/06/2021 23:24:26 - INFO - __main__ - Step 15209: {'lr': 0.0004902385626611059, 'samples': 2920128, 'steps': 15208, 'loss/train': 1.2494571208953857} 11/06/2021 23:24:26 - INFO - __main__ - Step 15210: {'lr': 0.0004902370941920984, 'samples': 2920320, 'steps': 15209, 'loss/train': 1.8583664894104004} 11/06/2021 23:24:27 - INFO - __main__ - Step 15211: {'lr': 0.0004902356256148437, 'samples': 2920512, 'steps': 15210, 'loss/train': 1.6945366859436035} 11/06/2021 23:24:27 - INFO - __main__ - Step 15212: {'lr': 0.0004902341569293425, 'samples': 2920704, 'steps': 15211, 'loss/train': 1.7645387649536133} 11/06/2021 23:24:28 - INFO - __main__ - Step 15213: {'lr': 0.0004902326881355955, 'samples': 2920896, 'steps': 15212, 'loss/train': 1.8450555801391602} 11/06/2021 23:24:28 - INFO - __main__ - Step 15214: {'lr': 0.0004902312192336034, 'samples': 2921088, 'steps': 15213, 'loss/train': 2.3038129806518555} 11/06/2021 23:24:29 - INFO - __main__ - Step 15215: {'lr': 0.000490229750223367, 'samples': 2921280, 'steps': 15214, 'loss/train': 0.8343202471733093} 11/06/2021 23:24:29 - INFO - __main__ - Step 15216: {'lr': 0.0004902282811048864, 'samples': 2921472, 'steps': 15215, 'loss/train': 1.9115432500839233} 11/06/2021 23:24:29 - INFO - __main__ - Step 15217: {'lr': 0.000490226811878163, 'samples': 2921664, 'steps': 15216, 'loss/train': 1.8696986436843872} 11/06/2021 23:24:30 - INFO - __main__ - Step 15218: {'lr': 0.0004902253425431969, 'samples': 2921856, 'steps': 15217, 'loss/train': 1.5307645797729492} 11/06/2021 23:24:31 - INFO - __main__ - Step 15219: {'lr': 0.000490223873099989, 'samples': 2922048, 'steps': 15218, 'loss/train': 1.9581738710403442} 11/06/2021 23:24:31 - INFO - __main__ - Step 15220: {'lr': 0.00049022240354854, 'samples': 2922240, 'steps': 15219, 'loss/train': 3.2477519512176514} 11/06/2021 23:24:32 - INFO - __main__ - Step 15221: {'lr': 0.0004902209338888503, 'samples': 2922432, 'steps': 15220, 'loss/train': 2.0752274990081787} 11/06/2021 23:24:32 - INFO - __main__ - Step 15222: {'lr': 0.000490219464120921, 'samples': 2922624, 'steps': 15221, 'loss/train': 1.8632875680923462} 11/06/2021 23:24:33 - INFO - __main__ - Step 15223: {'lr': 0.0004902179942447524, 'samples': 2922816, 'steps': 15222, 'loss/train': 1.554335117340088} 11/06/2021 23:24:33 - INFO - __main__ - Step 15224: {'lr': 0.0004902165242603452, 'samples': 2923008, 'steps': 15223, 'loss/train': 1.9520310163497925} 11/06/2021 23:24:34 - INFO - __main__ - Step 15225: {'lr': 0.0004902150541677003, 'samples': 2923200, 'steps': 15224, 'loss/train': 2.0851187705993652} 11/06/2021 23:24:34 - INFO - __main__ - Step 15226: {'lr': 0.0004902135839668181, 'samples': 2923392, 'steps': 15225, 'loss/train': 2.010134220123291} 11/06/2021 23:24:34 - INFO - __main__ - Step 15227: {'lr': 0.0004902121136576994, 'samples': 2923584, 'steps': 15226, 'loss/train': 1.7734330892562866} 11/06/2021 23:24:35 - INFO - __main__ - Step 15228: {'lr': 0.0004902106432403448, 'samples': 2923776, 'steps': 15227, 'loss/train': 1.354243516921997} 11/06/2021 23:24:36 - INFO - __main__ - Step 15229: {'lr': 0.0004902091727147551, 'samples': 2923968, 'steps': 15228, 'loss/train': 1.7039768695831299} 11/06/2021 23:24:36 - INFO - __main__ - Step 15230: {'lr': 0.0004902077020809307, 'samples': 2924160, 'steps': 15229, 'loss/train': 1.8697679042816162} 11/06/2021 23:24:36 - INFO - __main__ - Step 15231: {'lr': 0.0004902062313388725, 'samples': 2924352, 'steps': 15230, 'loss/train': 1.9331737756729126} 11/06/2021 23:24:37 - INFO - __main__ - Step 15232: {'lr': 0.0004902047604885811, 'samples': 2924544, 'steps': 15231, 'loss/train': 1.9354302883148193} 11/06/2021 23:24:37 - INFO - __main__ - Step 15233: {'lr': 0.0004902032895300571, 'samples': 2924736, 'steps': 15232, 'loss/train': 1.3947269916534424} 11/06/2021 23:24:38 - INFO - __main__ - Step 15234: {'lr': 0.0004902018184633012, 'samples': 2924928, 'steps': 15233, 'loss/train': 1.6846981048583984} 11/06/2021 23:24:39 - INFO - __main__ - Step 15235: {'lr': 0.0004902003472883141, 'samples': 2925120, 'steps': 15234, 'loss/train': 1.5643565654754639} 11/06/2021 23:24:39 - INFO - __main__ - Step 15236: {'lr': 0.0004901988760050964, 'samples': 2925312, 'steps': 15235, 'loss/train': 1.710155963897705} 11/06/2021 23:24:39 - INFO - __main__ - Step 15237: {'lr': 0.0004901974046136488, 'samples': 2925504, 'steps': 15236, 'loss/train': 1.4140377044677734} 11/06/2021 23:24:40 - INFO - __main__ - Step 15238: {'lr': 0.000490195933113972, 'samples': 2925696, 'steps': 15237, 'loss/train': 1.148417353630066} 11/06/2021 23:24:41 - INFO - __main__ - Step 15239: {'lr': 0.0004901944615060665, 'samples': 2925888, 'steps': 15238, 'loss/train': 1.7428512573242188} 11/06/2021 23:24:41 - INFO - __main__ - Step 15240: {'lr': 0.0004901929897899331, 'samples': 2926080, 'steps': 15239, 'loss/train': 1.6310620307922363} 11/06/2021 23:24:41 - INFO - __main__ - Step 15241: {'lr': 0.0004901915179655726, 'samples': 2926272, 'steps': 15240, 'loss/train': 1.1041312217712402} 11/06/2021 23:24:42 - INFO - __main__ - Step 15242: {'lr': 0.0004901900460329853, 'samples': 2926464, 'steps': 15241, 'loss/train': 1.5519336462020874} 11/06/2021 23:24:42 - INFO - __main__ - Step 15243: {'lr': 0.0004901885739921723, 'samples': 2926656, 'steps': 15242, 'loss/train': 2.025369644165039} 11/06/2021 23:24:43 - INFO - __main__ - Step 15244: {'lr': 0.0004901871018431339, 'samples': 2926848, 'steps': 15243, 'loss/train': 1.429811716079712} 11/06/2021 23:24:43 - INFO - __main__ - Step 15245: {'lr': 0.0004901856295858708, 'samples': 2927040, 'steps': 15244, 'loss/train': 1.7202751636505127} 11/06/2021 23:24:44 - INFO - __main__ - Step 15246: {'lr': 0.0004901841572203839, 'samples': 2927232, 'steps': 15245, 'loss/train': 1.7216240167617798} 11/06/2021 23:24:44 - INFO - __main__ - Step 15247: {'lr': 0.0004901826847466738, 'samples': 2927424, 'steps': 15246, 'loss/train': 1.255531668663025} 11/06/2021 23:24:44 - INFO - __main__ - Step 15248: {'lr': 0.000490181212164741, 'samples': 2927616, 'steps': 15247, 'loss/train': 2.0545222759246826} 11/06/2021 23:24:46 - INFO - __main__ - Step 15249: {'lr': 0.0004901797394745861, 'samples': 2927808, 'steps': 15248, 'loss/train': 1.667556643486023} 11/06/2021 23:24:46 - INFO - __main__ - Step 15250: {'lr': 0.0004901782666762102, 'samples': 2928000, 'steps': 15249, 'loss/train': 1.852941870689392} 11/06/2021 23:24:46 - INFO - __main__ - Step 15251: {'lr': 0.0004901767937696135, 'samples': 2928192, 'steps': 15250, 'loss/train': 2.0363755226135254} 11/06/2021 23:24:47 - INFO - __main__ - Step 15252: {'lr': 0.0004901753207547969, 'samples': 2928384, 'steps': 15251, 'loss/train': 1.3573280572891235} 11/06/2021 23:24:47 - INFO - __main__ - Step 15253: {'lr': 0.000490173847631761, 'samples': 2928576, 'steps': 15252, 'loss/train': 1.8147984743118286} 11/06/2021 23:24:47 - INFO - __main__ - Step 15254: {'lr': 0.0004901723744005065, 'samples': 2928768, 'steps': 15253, 'loss/train': 1.7918343544006348} 11/06/2021 23:24:48 - INFO - __main__ - Step 15255: {'lr': 0.0004901709010610339, 'samples': 2928960, 'steps': 15254, 'loss/train': 1.5869040489196777} 11/06/2021 23:24:49 - INFO - __main__ - Step 15256: {'lr': 0.0004901694276133441, 'samples': 2929152, 'steps': 15255, 'loss/train': 1.790469765663147} 11/06/2021 23:24:49 - INFO - __main__ - Step 15257: {'lr': 0.0004901679540574377, 'samples': 2929344, 'steps': 15256, 'loss/train': 1.7606391906738281} 11/06/2021 23:24:49 - INFO - __main__ - Step 15258: {'lr': 0.0004901664803933153, 'samples': 2929536, 'steps': 15257, 'loss/train': 1.265826940536499} 11/06/2021 23:24:50 - INFO - __main__ - Step 15259: {'lr': 0.0004901650066209775, 'samples': 2929728, 'steps': 15258, 'loss/train': 1.744316577911377} 11/06/2021 23:24:51 - INFO - __main__ - Step 15260: {'lr': 0.0004901635327404252, 'samples': 2929920, 'steps': 15259, 'loss/train': 1.8547157049179077} 11/06/2021 23:24:51 - INFO - __main__ - Step 15261: {'lr': 0.0004901620587516587, 'samples': 2930112, 'steps': 15260, 'loss/train': 1.576607346534729} 11/06/2021 23:24:52 - INFO - __main__ - Step 15262: {'lr': 0.0004901605846546791, 'samples': 2930304, 'steps': 15261, 'loss/train': 1.0735132694244385} 11/06/2021 23:24:52 - INFO - __main__ - Step 15263: {'lr': 0.0004901591104494868, 'samples': 2930496, 'steps': 15262, 'loss/train': 2.1972012519836426} 11/06/2021 23:24:52 - INFO - __main__ - Step 15264: {'lr': 0.0004901576361360825, 'samples': 2930688, 'steps': 15263, 'loss/train': 1.0414632558822632} 11/06/2021 23:24:54 - INFO - __main__ - Step 15265: {'lr': 0.0004901561617144667, 'samples': 2930880, 'steps': 15264, 'loss/train': 1.8631575107574463} 11/06/2021 23:24:54 - INFO - __main__ - Step 15266: {'lr': 0.0004901546871846405, 'samples': 2931072, 'steps': 15265, 'loss/train': 0.9033583402633667} 11/06/2021 23:24:54 - INFO - __main__ - Step 15267: {'lr': 0.0004901532125466041, 'samples': 2931264, 'steps': 15266, 'loss/train': 1.4277762174606323} 11/06/2021 23:24:55 - INFO - __main__ - Step 15268: {'lr': 0.0004901517378003584, 'samples': 2931456, 'steps': 15267, 'loss/train': 0.7227610349655151} 11/06/2021 23:24:55 - INFO - __main__ - Step 15269: {'lr': 0.0004901502629459042, 'samples': 2931648, 'steps': 15268, 'loss/train': 0.25619640946388245} 11/06/2021 23:24:56 - INFO - __main__ - Step 15270: {'lr': 0.000490148787983242, 'samples': 2931840, 'steps': 15269, 'loss/train': 1.9297094345092773} 11/06/2021 23:24:56 - INFO - __main__ - Step 15271: {'lr': 0.0004901473129123723, 'samples': 2932032, 'steps': 15270, 'loss/train': 1.359814167022705} 11/06/2021 23:24:57 - INFO - __main__ - Step 15272: {'lr': 0.0004901458377332959, 'samples': 2932224, 'steps': 15271, 'loss/train': 1.641144037246704} 11/06/2021 23:24:57 - INFO - __main__ - Step 15273: {'lr': 0.0004901443624460136, 'samples': 2932416, 'steps': 15272, 'loss/train': 1.692050576210022} 11/06/2021 23:24:57 - INFO - __main__ - Step 15274: {'lr': 0.000490142887050526, 'samples': 2932608, 'steps': 15273, 'loss/train': 0.9697434902191162} 11/06/2021 23:24:59 - INFO - __main__ - Step 15275: {'lr': 0.0004901414115468335, 'samples': 2932800, 'steps': 15274, 'loss/train': 1.5721526145935059} 11/06/2021 23:24:59 - INFO - __main__ - Step 15276: {'lr': 0.0004901399359349372, 'samples': 2932992, 'steps': 15275, 'loss/train': 1.9338434934616089} 11/06/2021 23:24:59 - INFO - __main__ - Step 15277: {'lr': 0.0004901384602148376, 'samples': 2933184, 'steps': 15276, 'loss/train': 1.4072198867797852} 11/06/2021 23:25:00 - INFO - __main__ - Step 15278: {'lr': 0.0004901369843865351, 'samples': 2933376, 'steps': 15277, 'loss/train': 1.9054416418075562} 11/06/2021 23:25:00 - INFO - __main__ - Step 15279: {'lr': 0.0004901355084500307, 'samples': 2933568, 'steps': 15278, 'loss/train': 1.816591501235962} 11/06/2021 23:25:00 - INFO - __main__ - Step 15280: {'lr': 0.000490134032405325, 'samples': 2933760, 'steps': 15279, 'loss/train': 1.780461311340332} 11/06/2021 23:25:01 - INFO - __main__ - Step 15281: {'lr': 0.0004901325562524185, 'samples': 2933952, 'steps': 15280, 'loss/train': 0.4465474784374237} 11/06/2021 23:25:02 - INFO - __main__ - Step 15282: {'lr': 0.0004901310799913121, 'samples': 2934144, 'steps': 15281, 'loss/train': 1.4008394479751587} 11/06/2021 23:25:02 - INFO - __main__ - Step 15283: {'lr': 0.0004901296036220062, 'samples': 2934336, 'steps': 15282, 'loss/train': 1.2545615434646606} 11/06/2021 23:25:03 - INFO - __main__ - Step 15284: {'lr': 0.0004901281271445016, 'samples': 2934528, 'steps': 15283, 'loss/train': 1.5060338973999023} 11/06/2021 23:25:03 - INFO - __main__ - Step 15285: {'lr': 0.000490126650558799, 'samples': 2934720, 'steps': 15284, 'loss/train': 1.5489078760147095} 11/06/2021 23:25:03 - INFO - __main__ - Step 15286: {'lr': 0.000490125173864899, 'samples': 2934912, 'steps': 15285, 'loss/train': 1.5084561109542847} 11/06/2021 23:25:05 - INFO - __main__ - Step 15287: {'lr': 0.0004901236970628024, 'samples': 2935104, 'steps': 15286, 'loss/train': 1.5067825317382812} 11/06/2021 23:25:05 - INFO - __main__ - Step 15288: {'lr': 0.0004901222201525099, 'samples': 2935296, 'steps': 15287, 'loss/train': 1.3645248413085938} 11/06/2021 23:25:05 - INFO - __main__ - Step 15289: {'lr': 0.0004901207431340218, 'samples': 2935488, 'steps': 15288, 'loss/train': 1.7662934064865112} 11/06/2021 23:25:06 - INFO - __main__ - Step 15290: {'lr': 0.000490119266007339, 'samples': 2935680, 'steps': 15289, 'loss/train': 1.3966121673583984} 11/06/2021 23:25:06 - INFO - __main__ - Step 15291: {'lr': 0.0004901177887724623, 'samples': 2935872, 'steps': 15290, 'loss/train': 1.620540976524353} 11/06/2021 23:25:08 - INFO - __main__ - Step 15292: {'lr': 0.0004901163114293921, 'samples': 2936064, 'steps': 15291, 'loss/train': 1.8138169050216675} 11/06/2021 23:25:08 - INFO - __main__ - Step 15293: {'lr': 0.0004901148339781293, 'samples': 2936256, 'steps': 15292, 'loss/train': 1.7931532859802246} 11/06/2021 23:25:09 - INFO - __main__ - Step 15294: {'lr': 0.0004901133564186744, 'samples': 2936448, 'steps': 15293, 'loss/train': 1.5500938892364502} 11/06/2021 23:25:09 - INFO - __main__ - Step 15295: {'lr': 0.0004901118787510281, 'samples': 2936640, 'steps': 15294, 'loss/train': 3.1649906635284424} 11/06/2021 23:25:10 - INFO - __main__ - Step 15296: {'lr': 0.0004901104009751912, 'samples': 2936832, 'steps': 15295, 'loss/train': 0.5988634824752808} 11/06/2021 23:25:10 - INFO - __main__ - Step 15297: {'lr': 0.0004901089230911642, 'samples': 2937024, 'steps': 15296, 'loss/train': 0.5126264095306396} 11/06/2021 23:25:11 - INFO - __main__ - Step 15298: {'lr': 0.0004901074450989479, 'samples': 2937216, 'steps': 15297, 'loss/train': 1.6376423835754395} 11/06/2021 23:25:11 - INFO - __main__ - Step 15299: {'lr': 0.0004901059669985427, 'samples': 2937408, 'steps': 15298, 'loss/train': 2.4724743366241455} 11/06/2021 23:25:12 - INFO - __main__ - Step 15300: {'lr': 0.0004901044887899496, 'samples': 2937600, 'steps': 15299, 'loss/train': 1.8324450254440308} 11/06/2021 23:25:12 - INFO - __main__ - Step 15301: {'lr': 0.0004901030104731691, 'samples': 2937792, 'steps': 15300, 'loss/train': 2.0078108310699463} 11/06/2021 23:25:12 - INFO - __main__ - Step 15302: {'lr': 0.0004901015320482019, 'samples': 2937984, 'steps': 15301, 'loss/train': 1.5919190645217896} 11/06/2021 23:25:13 - INFO - __main__ - Step 15303: {'lr': 0.0004901000535150486, 'samples': 2938176, 'steps': 15302, 'loss/train': 1.7838540077209473} 11/06/2021 23:25:14 - INFO - __main__ - Step 15304: {'lr': 0.0004900985748737101, 'samples': 2938368, 'steps': 15303, 'loss/train': 1.9150638580322266} 11/06/2021 23:25:14 - INFO - __main__ - Step 15305: {'lr': 0.0004900970961241866, 'samples': 2938560, 'steps': 15304, 'loss/train': 1.3544918298721313} 11/06/2021 23:25:14 - INFO - __main__ - Step 15306: {'lr': 0.0004900956172664792, 'samples': 2938752, 'steps': 15305, 'loss/train': 1.8071099519729614} 11/06/2021 23:25:15 - INFO - __main__ - Step 15307: {'lr': 0.0004900941383005884, 'samples': 2938944, 'steps': 15306, 'loss/train': 2.145068883895874} 11/06/2021 23:25:16 - INFO - __main__ - Step 15308: {'lr': 0.0004900926592265149, 'samples': 2939136, 'steps': 15307, 'loss/train': 1.8237600326538086} 11/06/2021 23:25:16 - INFO - __main__ - Step 15309: {'lr': 0.0004900911800442593, 'samples': 2939328, 'steps': 15308, 'loss/train': 1.6788253784179688} 11/06/2021 23:25:16 - INFO - __main__ - Step 15310: {'lr': 0.0004900897007538225, 'samples': 2939520, 'steps': 15309, 'loss/train': 1.255226731300354} 11/06/2021 23:25:17 - INFO - __main__ - Step 15311: {'lr': 0.0004900882213552049, 'samples': 2939712, 'steps': 15310, 'loss/train': 2.2887020111083984} 11/06/2021 23:25:17 - INFO - __main__ - Step 15312: {'lr': 0.0004900867418484072, 'samples': 2939904, 'steps': 15311, 'loss/train': 1.4309970140457153} 11/06/2021 23:25:18 - INFO - __main__ - Step 15313: {'lr': 0.0004900852622334301, 'samples': 2940096, 'steps': 15312, 'loss/train': 1.7414692640304565} 11/06/2021 23:25:18 - INFO - __main__ - Step 15314: {'lr': 0.0004900837825102743, 'samples': 2940288, 'steps': 15313, 'loss/train': 2.0665924549102783} 11/06/2021 23:25:19 - INFO - __main__ - Step 15315: {'lr': 0.0004900823026789405, 'samples': 2940480, 'steps': 15314, 'loss/train': 1.498655080795288} 11/06/2021 23:25:19 - INFO - __main__ - Step 15316: {'lr': 0.0004900808227394293, 'samples': 2940672, 'steps': 15315, 'loss/train': 1.51724112033844} 11/06/2021 23:25:20 - INFO - __main__ - Step 15317: {'lr': 0.0004900793426917412, 'samples': 2940864, 'steps': 15316, 'loss/train': 1.9836689233779907} 11/06/2021 23:25:21 - INFO - __main__ - Step 15318: {'lr': 0.0004900778625358774, 'samples': 2941056, 'steps': 15317, 'loss/train': 1.8022664785385132} 11/06/2021 23:25:21 - INFO - __main__ - Step 15319: {'lr': 0.000490076382271838, 'samples': 2941248, 'steps': 15318, 'loss/train': 1.9844396114349365} 11/06/2021 23:25:21 - INFO - __main__ - Step 15320: {'lr': 0.0004900749018996238, 'samples': 2941440, 'steps': 15319, 'loss/train': 1.9721046686172485} 11/06/2021 23:25:22 - INFO - __main__ - Step 15321: {'lr': 0.0004900734214192358, 'samples': 2941632, 'steps': 15320, 'loss/train': 1.9147781133651733} 11/06/2021 23:25:22 - INFO - __main__ - Step 15322: {'lr': 0.0004900719408306743, 'samples': 2941824, 'steps': 15321, 'loss/train': 1.805896520614624} 11/06/2021 23:25:22 - INFO - __main__ - Step 15323: {'lr': 0.0004900704601339401, 'samples': 2942016, 'steps': 15322, 'loss/train': 1.896042823791504} 11/06/2021 23:25:24 - INFO - __main__ - Step 15324: {'lr': 0.0004900689793290339, 'samples': 2942208, 'steps': 15323, 'loss/train': 1.7801188230514526} 11/06/2021 23:25:24 - INFO - __main__ - Step 15325: {'lr': 0.0004900674984159562, 'samples': 2942400, 'steps': 15324, 'loss/train': 1.7624114751815796} 11/06/2021 23:25:25 - INFO - __main__ - Step 15326: {'lr': 0.0004900660173947079, 'samples': 2942592, 'steps': 15325, 'loss/train': 2.018181324005127} 11/06/2021 23:25:25 - INFO - __main__ - Step 15327: {'lr': 0.0004900645362652895, 'samples': 2942784, 'steps': 15326, 'loss/train': 1.8817495107650757} 11/06/2021 23:25:25 - INFO - __main__ - Step 15328: {'lr': 0.0004900630550277018, 'samples': 2942976, 'steps': 15327, 'loss/train': 2.0773236751556396} 11/06/2021 23:25:26 - INFO - __main__ - Step 15329: {'lr': 0.0004900615736819452, 'samples': 2943168, 'steps': 15328, 'loss/train': 1.9412237405776978} 11/06/2021 23:25:27 - INFO - __main__ - Step 15330: {'lr': 0.0004900600922280207, 'samples': 2943360, 'steps': 15329, 'loss/train': 1.6500606536865234} 11/06/2021 23:25:27 - INFO - __main__ - Step 15331: {'lr': 0.0004900586106659289, 'samples': 2943552, 'steps': 15330, 'loss/train': 2.294801712036133} 11/06/2021 23:25:27 - INFO - __main__ - Step 15332: {'lr': 0.0004900571289956703, 'samples': 2943744, 'steps': 15331, 'loss/train': 0.9480341076850891} 11/06/2021 23:25:28 - INFO - __main__ - Step 15333: {'lr': 0.0004900556472172457, 'samples': 2943936, 'steps': 15332, 'loss/train': 2.6426515579223633} 11/06/2021 23:25:28 - INFO - __main__ - Step 15334: {'lr': 0.0004900541653306557, 'samples': 2944128, 'steps': 15333, 'loss/train': 2.0877530574798584} 11/06/2021 23:25:29 - INFO - __main__ - Step 15335: {'lr': 0.0004900526833359009, 'samples': 2944320, 'steps': 15334, 'loss/train': 1.6220190525054932} 11/06/2021 23:25:30 - INFO - __main__ - Step 15336: {'lr': 0.0004900512012329822, 'samples': 2944512, 'steps': 15335, 'loss/train': 1.7684022188186646} 11/06/2021 23:25:30 - INFO - __main__ - Step 15337: {'lr': 0.0004900497190219002, 'samples': 2944704, 'steps': 15336, 'loss/train': 1.8836572170257568} 11/06/2021 23:25:31 - INFO - __main__ - Step 15338: {'lr': 0.0004900482367026554, 'samples': 2944896, 'steps': 15337, 'loss/train': 0.36830881237983704} 11/06/2021 23:25:31 - INFO - __main__ - Step 15339: {'lr': 0.0004900467542752485, 'samples': 2945088, 'steps': 15338, 'loss/train': 1.869981288909912} 11/06/2021 23:25:31 - INFO - __main__ - Step 15340: {'lr': 0.0004900452717396803, 'samples': 2945280, 'steps': 15339, 'loss/train': 1.5828317403793335} 11/06/2021 23:25:32 - INFO - __main__ - Step 15341: {'lr': 0.0004900437890959515, 'samples': 2945472, 'steps': 15340, 'loss/train': 1.6785833835601807} 11/06/2021 23:25:33 - INFO - __main__ - Step 15342: {'lr': 0.0004900423063440625, 'samples': 2945664, 'steps': 15341, 'loss/train': 1.9623292684555054} 11/06/2021 23:25:33 - INFO - __main__ - Step 15343: {'lr': 0.0004900408234840142, 'samples': 2945856, 'steps': 15342, 'loss/train': 1.8435673713684082} 11/06/2021 23:25:33 - INFO - __main__ - Step 15344: {'lr': 0.0004900393405158073, 'samples': 2946048, 'steps': 15343, 'loss/train': 2.0135602951049805} 11/06/2021 23:25:34 - INFO - __main__ - Step 15345: {'lr': 0.0004900378574394423, 'samples': 2946240, 'steps': 15344, 'loss/train': 1.5645862817764282} 11/06/2021 23:25:34 - INFO - __main__ - Step 15346: {'lr': 0.00049003637425492, 'samples': 2946432, 'steps': 15345, 'loss/train': 6.070239543914795} 11/06/2021 23:25:35 - INFO - __main__ - Step 15347: {'lr': 0.0004900348909622409, 'samples': 2946624, 'steps': 15346, 'loss/train': 2.0285749435424805} 11/06/2021 23:25:35 - INFO - __main__ - Step 15348: {'lr': 0.0004900334075614059, 'samples': 2946816, 'steps': 15347, 'loss/train': 1.966596245765686} 11/06/2021 23:25:36 - INFO - __main__ - Step 15349: {'lr': 0.0004900319240524155, 'samples': 2947008, 'steps': 15348, 'loss/train': 1.8650213479995728} 11/06/2021 23:25:36 - INFO - __main__ - Step 15350: {'lr': 0.0004900304404352704, 'samples': 2947200, 'steps': 15349, 'loss/train': 2.0036559104919434} 11/06/2021 23:25:36 - INFO - __main__ - Step 15351: {'lr': 0.0004900289567099713, 'samples': 2947392, 'steps': 15350, 'loss/train': 1.567614197731018} 11/06/2021 23:25:38 - INFO - __main__ - Step 15352: {'lr': 0.000490027472876519, 'samples': 2947584, 'steps': 15351, 'loss/train': 1.430046558380127} 11/06/2021 23:25:38 - INFO - __main__ - Step 15353: {'lr': 0.0004900259889349138, 'samples': 2947776, 'steps': 15352, 'loss/train': 2.044808864593506} 11/06/2021 23:25:38 - INFO - __main__ - Step 15354: {'lr': 0.0004900245048851567, 'samples': 2947968, 'steps': 15353, 'loss/train': 0.9513130784034729} 11/06/2021 23:25:39 - INFO - __main__ - Step 15355: {'lr': 0.0004900230207272483, 'samples': 2948160, 'steps': 15354, 'loss/train': 1.701623558998108} 11/06/2021 23:25:39 - INFO - __main__ - Step 15356: {'lr': 0.000490021536461189, 'samples': 2948352, 'steps': 15355, 'loss/train': 1.8428715467453003} 11/06/2021 23:25:40 - INFO - __main__ - Step 15357: {'lr': 0.00049002005208698, 'samples': 2948544, 'steps': 15356, 'loss/train': 1.4935647249221802} 11/06/2021 23:25:40 - INFO - __main__ - Step 15358: {'lr': 0.0004900185676046214, 'samples': 2948736, 'steps': 15357, 'loss/train': 1.9879461526870728} 11/06/2021 23:25:41 - INFO - __main__ - Step 15359: {'lr': 0.0004900170830141144, 'samples': 2948928, 'steps': 15358, 'loss/train': 1.5759130716323853} 11/06/2021 23:25:41 - INFO - __main__ - Step 15360: {'lr': 0.0004900155983154592, 'samples': 2949120, 'steps': 15359, 'loss/train': 1.0437538623809814} 11/06/2021 23:25:41 - INFO - __main__ - Step 15361: {'lr': 0.0004900141135086569, 'samples': 2949312, 'steps': 15360, 'loss/train': 1.6635363101959229} 11/06/2021 23:25:43 - INFO - __main__ - Step 15362: {'lr': 0.0004900126285937077, 'samples': 2949504, 'steps': 15361, 'loss/train': 1.9337184429168701} 11/06/2021 23:25:43 - INFO - __main__ - Step 15363: {'lr': 0.0004900111435706127, 'samples': 2949696, 'steps': 15362, 'loss/train': 1.4485169649124146} 11/06/2021 23:25:43 - INFO - __main__ - Step 15364: {'lr': 0.0004900096584393723, 'samples': 2949888, 'steps': 15363, 'loss/train': 1.5810778141021729} 11/06/2021 23:25:44 - INFO - __main__ - Step 15365: {'lr': 0.0004900081731999872, 'samples': 2950080, 'steps': 15364, 'loss/train': 1.6182117462158203} 11/06/2021 23:25:44 - INFO - __main__ - Step 15366: {'lr': 0.0004900066878524582, 'samples': 2950272, 'steps': 15365, 'loss/train': 1.547028660774231} 11/06/2021 23:25:45 - INFO - __main__ - Step 15367: {'lr': 0.0004900052023967859, 'samples': 2950464, 'steps': 15366, 'loss/train': 1.7055131196975708} 11/06/2021 23:25:45 - INFO - __main__ - Step 15368: {'lr': 0.0004900037168329709, 'samples': 2950656, 'steps': 15367, 'loss/train': 1.550373911857605} 11/06/2021 23:25:46 - INFO - __main__ - Step 15369: {'lr': 0.000490002231161014, 'samples': 2950848, 'steps': 15368, 'loss/train': 1.7046164274215698} 11/06/2021 23:25:46 - INFO - __main__ - Step 15370: {'lr': 0.0004900007453809157, 'samples': 2951040, 'steps': 15369, 'loss/train': 1.5732970237731934} 11/06/2021 23:25:46 - INFO - __main__ - Step 15371: {'lr': 0.0004899992594926769, 'samples': 2951232, 'steps': 15370, 'loss/train': 1.3955873250961304} 11/06/2021 23:25:47 - INFO - __main__ - Step 15372: {'lr': 0.000489997773496298, 'samples': 2951424, 'steps': 15371, 'loss/train': 1.5523087978363037} 11/06/2021 23:25:48 - INFO - __main__ - Step 15373: {'lr': 0.0004899962873917798, 'samples': 2951616, 'steps': 15372, 'loss/train': 1.710436463356018} 11/06/2021 23:25:48 - INFO - __main__ - Step 15374: {'lr': 0.000489994801179123, 'samples': 2951808, 'steps': 15373, 'loss/train': 1.610868215560913} 11/06/2021 23:25:49 - INFO - __main__ - Step 15375: {'lr': 0.0004899933148583284, 'samples': 2952000, 'steps': 15374, 'loss/train': 1.4452327489852905} 11/06/2021 23:25:49 - INFO - __main__ - Step 15376: {'lr': 0.0004899918284293964, 'samples': 2952192, 'steps': 15375, 'loss/train': 1.6878875494003296} 11/06/2021 23:25:50 - INFO - __main__ - Step 15377: {'lr': 0.0004899903418923278, 'samples': 2952384, 'steps': 15376, 'loss/train': 1.6669107675552368} 11/06/2021 23:25:50 - INFO - __main__ - Step 15378: {'lr': 0.0004899888552471232, 'samples': 2952576, 'steps': 15377, 'loss/train': 1.7093682289123535} 11/06/2021 23:25:51 - INFO - __main__ - Step 15379: {'lr': 0.0004899873684937833, 'samples': 2952768, 'steps': 15378, 'loss/train': 1.2271426916122437} 11/06/2021 23:25:51 - INFO - __main__ - Step 15380: {'lr': 0.0004899858816323089, 'samples': 2952960, 'steps': 15379, 'loss/train': 1.6182414293289185} 11/06/2021 23:25:51 - INFO - __main__ - Step 15381: {'lr': 0.0004899843946627006, 'samples': 2953152, 'steps': 15380, 'loss/train': 1.7214139699935913} 11/06/2021 23:25:52 - INFO - __main__ - Step 15382: {'lr': 0.0004899829075849589, 'samples': 2953344, 'steps': 15381, 'loss/train': 2.161836624145508} 11/06/2021 23:25:53 - INFO - __main__ - Step 15383: {'lr': 0.0004899814203990847, 'samples': 2953536, 'steps': 15382, 'loss/train': 1.879669189453125} 11/06/2021 23:25:53 - INFO - __main__ - Step 15384: {'lr': 0.0004899799331050785, 'samples': 2953728, 'steps': 15383, 'loss/train': 1.885219693183899} 11/06/2021 23:25:53 - INFO - __main__ - Step 15385: {'lr': 0.0004899784457029411, 'samples': 2953920, 'steps': 15384, 'loss/train': 1.2572287321090698} 11/06/2021 23:25:54 - INFO - __main__ - Step 15386: {'lr': 0.000489976958192673, 'samples': 2954112, 'steps': 15385, 'loss/train': 1.6987992525100708} 11/06/2021 23:25:54 - INFO - __main__ - Step 15387: {'lr': 0.0004899754705742752, 'samples': 2954304, 'steps': 15386, 'loss/train': 2.0075464248657227} 11/06/2021 23:25:55 - INFO - __main__ - Step 15388: {'lr': 0.0004899739828477481, 'samples': 2954496, 'steps': 15387, 'loss/train': 1.6312947273254395} 11/06/2021 23:25:56 - INFO - __main__ - Step 15389: {'lr': 0.0004899724950130923, 'samples': 2954688, 'steps': 15388, 'loss/train': 1.7411924600601196} 11/06/2021 23:25:56 - INFO - __main__ - Step 15390: {'lr': 0.0004899710070703087, 'samples': 2954880, 'steps': 15389, 'loss/train': 5.825747489929199} 11/06/2021 23:25:56 - INFO - __main__ - Step 15391: {'lr': 0.0004899695190193978, 'samples': 2955072, 'steps': 15390, 'loss/train': 1.3837753534317017} 11/06/2021 23:25:57 - INFO - __main__ - Step 15392: {'lr': 0.0004899680308603604, 'samples': 2955264, 'steps': 15391, 'loss/train': 1.489019751548767} 11/06/2021 23:25:57 - INFO - __main__ - Step 15393: {'lr': 0.000489966542593197, 'samples': 2955456, 'steps': 15392, 'loss/train': 1.6282072067260742} 11/06/2021 23:25:58 - INFO - __main__ - Step 15394: {'lr': 0.0004899650542179085, 'samples': 2955648, 'steps': 15393, 'loss/train': 1.1773505210876465} 11/06/2021 23:25:58 - INFO - __main__ - Step 15395: {'lr': 0.0004899635657344954, 'samples': 2955840, 'steps': 15394, 'loss/train': 1.7524396181106567} 11/06/2021 23:25:59 - INFO - __main__ - Step 15396: {'lr': 0.0004899620771429585, 'samples': 2956032, 'steps': 15395, 'loss/train': 1.4660807847976685} 11/06/2021 23:25:59 - INFO - __main__ - Step 15397: {'lr': 0.0004899605884432983, 'samples': 2956224, 'steps': 15396, 'loss/train': 1.6448665857315063} 11/06/2021 23:25:59 - INFO - __main__ - Step 15398: {'lr': 0.0004899590996355155, 'samples': 2956416, 'steps': 15397, 'loss/train': 1.1277403831481934} 11/06/2021 23:26:00 - INFO - __main__ - Step 15399: {'lr': 0.000489957610719611, 'samples': 2956608, 'steps': 15398, 'loss/train': 1.7842482328414917} 11/06/2021 23:26:01 - INFO - __main__ - Step 15400: {'lr': 0.0004899561216955852, 'samples': 2956800, 'steps': 15399, 'loss/train': 1.7123888731002808} 11/06/2021 23:26:01 - INFO - __main__ - Step 15401: {'lr': 0.0004899546325634388, 'samples': 2956992, 'steps': 15400, 'loss/train': 1.379862666130066} 11/06/2021 23:26:01 - INFO - __main__ - Step 15402: {'lr': 0.0004899531433231728, 'samples': 2957184, 'steps': 15401, 'loss/train': 1.210938572883606} 11/06/2021 23:26:02 - INFO - __main__ - Step 15403: {'lr': 0.0004899516539747874, 'samples': 2957376, 'steps': 15402, 'loss/train': 1.3789384365081787} 11/06/2021 23:26:02 - INFO - __main__ - Step 15404: {'lr': 0.0004899501645182835, 'samples': 2957568, 'steps': 15403, 'loss/train': 1.301833987236023} 11/06/2021 23:26:03 - INFO - __main__ - Step 15405: {'lr': 0.0004899486749536618, 'samples': 2957760, 'steps': 15404, 'loss/train': 2.503183603286743} 11/06/2021 23:26:04 - INFO - __main__ - Step 15406: {'lr': 0.000489947185280923, 'samples': 2957952, 'steps': 15405, 'loss/train': 1.348501443862915} 11/06/2021 23:26:04 - INFO - __main__ - Step 15407: {'lr': 0.0004899456955000676, 'samples': 2958144, 'steps': 15406, 'loss/train': 1.6351882219314575} 11/06/2021 23:26:04 - INFO - __main__ - Step 15408: {'lr': 0.0004899442056110964, 'samples': 2958336, 'steps': 15407, 'loss/train': 1.4293309450149536} 11/06/2021 23:26:05 - INFO - __main__ - Step 15409: {'lr': 0.00048994271561401, 'samples': 2958528, 'steps': 15408, 'loss/train': 1.3212099075317383} 11/06/2021 23:26:06 - INFO - __main__ - Step 15410: {'lr': 0.0004899412255088091, 'samples': 2958720, 'steps': 15409, 'loss/train': 1.6057772636413574} 11/06/2021 23:26:06 - INFO - __main__ - Step 15411: {'lr': 0.0004899397352954945, 'samples': 2958912, 'steps': 15410, 'loss/train': 1.645572304725647} 11/06/2021 23:26:06 - INFO - __main__ - Step 15412: {'lr': 0.0004899382449740667, 'samples': 2959104, 'steps': 15411, 'loss/train': 1.9118947982788086} 11/06/2021 23:26:07 - INFO - __main__ - Step 15413: {'lr': 0.0004899367545445264, 'samples': 2959296, 'steps': 15412, 'loss/train': 1.989318609237671} 11/06/2021 23:26:07 - INFO - __main__ - Step 15414: {'lr': 0.0004899352640068743, 'samples': 2959488, 'steps': 15413, 'loss/train': 1.6085702180862427} 11/06/2021 23:26:08 - INFO - __main__ - Step 15415: {'lr': 0.0004899337733611113, 'samples': 2959680, 'steps': 15414, 'loss/train': 1.6412628889083862} 11/06/2021 23:26:09 - INFO - __main__ - Step 15416: {'lr': 0.0004899322826072375, 'samples': 2959872, 'steps': 15415, 'loss/train': 1.6715970039367676} 11/06/2021 23:26:09 - INFO - __main__ - Step 15417: {'lr': 0.0004899307917452542, 'samples': 2960064, 'steps': 15416, 'loss/train': 0.1822996884584427} 11/06/2021 23:26:09 - INFO - __main__ - Step 15418: {'lr': 0.0004899293007751616, 'samples': 2960256, 'steps': 15417, 'loss/train': 1.2673732042312622} 11/06/2021 23:26:10 - INFO - __main__ - Step 15419: {'lr': 0.0004899278096969605, 'samples': 2960448, 'steps': 15418, 'loss/train': 2.0934507846832275} 11/06/2021 23:26:11 - INFO - __main__ - Step 15420: {'lr': 0.0004899263185106518, 'samples': 2960640, 'steps': 15419, 'loss/train': 1.9533450603485107} 11/06/2021 23:26:11 - INFO - __main__ - Step 15421: {'lr': 0.000489924827216236, 'samples': 2960832, 'steps': 15420, 'loss/train': 1.7429298162460327} 11/06/2021 23:26:11 - INFO - __main__ - Step 15422: {'lr': 0.0004899233358137137, 'samples': 2961024, 'steps': 15421, 'loss/train': 1.7253789901733398} 11/06/2021 23:26:12 - INFO - __main__ - Step 15423: {'lr': 0.0004899218443030857, 'samples': 2961216, 'steps': 15422, 'loss/train': 0.9601808786392212} 11/06/2021 23:26:12 - INFO - __main__ - Step 15424: {'lr': 0.0004899203526843526, 'samples': 2961408, 'steps': 15423, 'loss/train': 1.4191993474960327} 11/06/2021 23:26:13 - INFO - __main__ - Step 15425: {'lr': 0.000489918860957515, 'samples': 2961600, 'steps': 15424, 'loss/train': 1.5568323135375977} 11/06/2021 23:26:14 - INFO - __main__ - Step 15426: {'lr': 0.0004899173691225737, 'samples': 2961792, 'steps': 15425, 'loss/train': 1.8269211053848267} 11/06/2021 23:26:14 - INFO - __main__ - Step 15427: {'lr': 0.0004899158771795295, 'samples': 2961984, 'steps': 15426, 'loss/train': 1.576122522354126} 11/06/2021 23:26:14 - INFO - __main__ - Step 15428: {'lr': 0.0004899143851283827, 'samples': 2962176, 'steps': 15427, 'loss/train': 1.3698642253875732} 11/06/2021 23:26:15 - INFO - __main__ - Step 15429: {'lr': 0.0004899128929691343, 'samples': 2962368, 'steps': 15428, 'loss/train': 1.8090990781784058} 11/06/2021 23:26:15 - INFO - __main__ - Step 15430: {'lr': 0.0004899114007017849, 'samples': 2962560, 'steps': 15429, 'loss/train': 1.9394744634628296} 11/06/2021 23:26:16 - INFO - __main__ - Step 15431: {'lr': 0.000489909908326335, 'samples': 2962752, 'steps': 15430, 'loss/train': 1.6963614225387573} 11/06/2021 23:26:17 - INFO - __main__ - Step 15432: {'lr': 0.0004899084158427855, 'samples': 2962944, 'steps': 15431, 'loss/train': 1.5980172157287598} 11/06/2021 23:26:17 - INFO - __main__ - Step 15433: {'lr': 0.0004899069232511368, 'samples': 2963136, 'steps': 15432, 'loss/train': 2.0468621253967285} 11/06/2021 23:26:17 - INFO - __main__ - Step 15434: {'lr': 0.0004899054305513899, 'samples': 2963328, 'steps': 15433, 'loss/train': 2.0248918533325195} 11/06/2021 23:26:18 - INFO - __main__ - Step 15435: {'lr': 0.0004899039377435452, 'samples': 2963520, 'steps': 15434, 'loss/train': 2.0467848777770996} 11/06/2021 23:26:19 - INFO - __main__ - Step 15436: {'lr': 0.0004899024448276036, 'samples': 2963712, 'steps': 15435, 'loss/train': 1.7609314918518066} 11/06/2021 23:26:19 - INFO - __main__ - Step 15437: {'lr': 0.0004899009518035657, 'samples': 2963904, 'steps': 15436, 'loss/train': 1.8569064140319824} 11/06/2021 23:26:19 - INFO - __main__ - Step 15438: {'lr': 0.000489899458671432, 'samples': 2964096, 'steps': 15437, 'loss/train': 1.7970716953277588} 11/06/2021 23:26:20 - INFO - __main__ - Step 15439: {'lr': 0.0004898979654312034, 'samples': 2964288, 'steps': 15438, 'loss/train': 1.7832825183868408} 11/06/2021 23:26:20 - INFO - __main__ - Step 15440: {'lr': 0.0004898964720828804, 'samples': 2964480, 'steps': 15439, 'loss/train': 1.3016754388809204} 11/06/2021 23:26:21 - INFO - __main__ - Step 15441: {'lr': 0.0004898949786264638, 'samples': 2964672, 'steps': 15440, 'loss/train': 1.9825522899627686} 11/06/2021 23:26:22 - INFO - __main__ - Step 15442: {'lr': 0.0004898934850619542, 'samples': 2964864, 'steps': 15441, 'loss/train': 1.5393202304840088} 11/06/2021 23:26:22 - INFO - __main__ - Step 15443: {'lr': 0.0004898919913893522, 'samples': 2965056, 'steps': 15442, 'loss/train': 1.3558751344680786} 11/06/2021 23:26:22 - INFO - __main__ - Step 15444: {'lr': 0.0004898904976086588, 'samples': 2965248, 'steps': 15443, 'loss/train': 1.950610876083374} 11/06/2021 23:26:23 - INFO - __main__ - Step 15445: {'lr': 0.0004898890037198743, 'samples': 2965440, 'steps': 15444, 'loss/train': 1.397348403930664} 11/06/2021 23:26:24 - INFO - __main__ - Step 15446: {'lr': 0.0004898875097229995, 'samples': 2965632, 'steps': 15445, 'loss/train': 2.1958858966827393} 11/06/2021 23:26:24 - INFO - __main__ - Step 15447: {'lr': 0.0004898860156180351, 'samples': 2965824, 'steps': 15446, 'loss/train': 1.3978043794631958} 11/06/2021 23:26:24 - INFO - __main__ - Step 15448: {'lr': 0.0004898845214049818, 'samples': 2966016, 'steps': 15447, 'loss/train': 0.5018529295921326} 11/06/2021 23:26:25 - INFO - __main__ - Step 15449: {'lr': 0.0004898830270838403, 'samples': 2966208, 'steps': 15448, 'loss/train': 1.4868112802505493} 11/06/2021 23:26:25 - INFO - __main__ - Step 15450: {'lr': 0.0004898815326546111, 'samples': 2966400, 'steps': 15449, 'loss/train': 1.6243109703063965} 11/06/2021 23:26:26 - INFO - __main__ - Step 15451: {'lr': 0.0004898800381172951, 'samples': 2966592, 'steps': 15450, 'loss/train': 1.2126389741897583} 11/06/2021 23:26:26 - INFO - __main__ - Step 15452: {'lr': 0.0004898785434718927, 'samples': 2966784, 'steps': 15451, 'loss/train': 1.6377086639404297} 11/06/2021 23:26:27 - INFO - __main__ - Step 15453: {'lr': 0.0004898770487184047, 'samples': 2966976, 'steps': 15452, 'loss/train': 2.266317844390869} 11/06/2021 23:26:27 - INFO - __main__ - Step 15454: {'lr': 0.000489875553856832, 'samples': 2967168, 'steps': 15453, 'loss/train': 2.4078965187072754} 11/06/2021 23:26:27 - INFO - __main__ - Step 15455: {'lr': 0.000489874058887175, 'samples': 2967360, 'steps': 15454, 'loss/train': 2.1697914600372314} 11/06/2021 23:26:29 - INFO - __main__ - Step 15456: {'lr': 0.0004898725638094345, 'samples': 2967552, 'steps': 15455, 'loss/train': 1.5495775938034058} 11/06/2021 23:26:29 - INFO - __main__ - Step 15457: {'lr': 0.0004898710686236109, 'samples': 2967744, 'steps': 15456, 'loss/train': 1.7793976068496704} 11/06/2021 23:26:29 - INFO - __main__ - Step 15458: {'lr': 0.0004898695733297054, 'samples': 2967936, 'steps': 15457, 'loss/train': 1.8634929656982422} 11/06/2021 23:26:30 - INFO - __main__ - Step 15459: {'lr': 0.0004898680779277182, 'samples': 2968128, 'steps': 15458, 'loss/train': 1.4359500408172607} 11/06/2021 23:26:30 - INFO - __main__ - Step 15460: {'lr': 0.0004898665824176502, 'samples': 2968320, 'steps': 15459, 'loss/train': 1.47426176071167} 11/06/2021 23:26:31 - INFO - __main__ - Step 15461: {'lr': 0.000489865086799502, 'samples': 2968512, 'steps': 15460, 'loss/train': 1.6190659999847412} 11/06/2021 23:26:31 - INFO - __main__ - Step 15462: {'lr': 0.0004898635910732743, 'samples': 2968704, 'steps': 15461, 'loss/train': 1.3541996479034424} 11/06/2021 23:26:32 - INFO - __main__ - Step 15463: {'lr': 0.0004898620952389677, 'samples': 2968896, 'steps': 15462, 'loss/train': 0.9938245415687561} 11/06/2021 23:26:32 - INFO - __main__ - Step 15464: {'lr': 0.000489860599296583, 'samples': 2969088, 'steps': 15463, 'loss/train': 1.6961749792099} 11/06/2021 23:26:32 - INFO - __main__ - Step 15465: {'lr': 0.0004898591032461208, 'samples': 2969280, 'steps': 15464, 'loss/train': 1.7099051475524902} 11/06/2021 23:26:33 - INFO - __main__ - Step 15466: {'lr': 0.0004898576070875818, 'samples': 2969472, 'steps': 15465, 'loss/train': 2.1200191974639893} 11/06/2021 23:26:34 - INFO - __main__ - Step 15467: {'lr': 0.0004898561108209667, 'samples': 2969664, 'steps': 15466, 'loss/train': 1.7490513324737549} 11/06/2021 23:26:34 - INFO - __main__ - Step 15468: {'lr': 0.0004898546144462762, 'samples': 2969856, 'steps': 15467, 'loss/train': 1.9029560089111328} 11/06/2021 23:26:34 - INFO - __main__ - Step 15469: {'lr': 0.0004898531179635108, 'samples': 2970048, 'steps': 15468, 'loss/train': 1.0643110275268555} 11/06/2021 23:26:35 - INFO - __main__ - Step 15470: {'lr': 0.0004898516213726712, 'samples': 2970240, 'steps': 15469, 'loss/train': 2.0877974033355713} 11/06/2021 23:26:35 - INFO - __main__ - Step 15471: {'lr': 0.0004898501246737583, 'samples': 2970432, 'steps': 15470, 'loss/train': 1.450588345527649} 11/06/2021 23:26:36 - INFO - __main__ - Step 15472: {'lr': 0.0004898486278667725, 'samples': 2970624, 'steps': 15471, 'loss/train': 1.8085523843765259} 11/06/2021 23:26:36 - INFO - __main__ - Step 15473: {'lr': 0.0004898471309517148, 'samples': 2970816, 'steps': 15472, 'loss/train': 1.8764305114746094} 11/06/2021 23:26:37 - INFO - __main__ - Step 15474: {'lr': 0.0004898456339285857, 'samples': 2971008, 'steps': 15473, 'loss/train': 1.699832558631897} 11/06/2021 23:26:37 - INFO - __main__ - Step 15475: {'lr': 0.0004898441367973856, 'samples': 2971200, 'steps': 15474, 'loss/train': 1.7935590744018555} 11/06/2021 23:26:38 - INFO - __main__ - Step 15476: {'lr': 0.0004898426395581156, 'samples': 2971392, 'steps': 15475, 'loss/train': 2.0866262912750244} 11/06/2021 23:26:39 - INFO - __main__ - Step 15477: {'lr': 0.0004898411422107762, 'samples': 2971584, 'steps': 15476, 'loss/train': 1.2381302118301392} 11/06/2021 23:26:39 - INFO - __main__ - Step 15478: {'lr': 0.0004898396447553681, 'samples': 2971776, 'steps': 15477, 'loss/train': 2.092162847518921} 11/06/2021 23:26:39 - INFO - __main__ - Step 15479: {'lr': 0.000489838147191892, 'samples': 2971968, 'steps': 15478, 'loss/train': 1.3541998863220215} 11/06/2021 23:26:40 - INFO - __main__ - Step 15480: {'lr': 0.0004898366495203483, 'samples': 2972160, 'steps': 15479, 'loss/train': 1.967870831489563} 11/06/2021 23:26:40 - INFO - __main__ - Step 15481: {'lr': 0.0004898351517407381, 'samples': 2972352, 'steps': 15480, 'loss/train': 1.0641893148422241} 11/06/2021 23:26:41 - INFO - __main__ - Step 15482: {'lr': 0.0004898336538530619, 'samples': 2972544, 'steps': 15481, 'loss/train': 1.6130701303482056} 11/06/2021 23:26:41 - INFO - __main__ - Step 15483: {'lr': 0.0004898321558573203, 'samples': 2972736, 'steps': 15482, 'loss/train': 1.8925131559371948} 11/06/2021 23:26:42 - INFO - __main__ - Step 15484: {'lr': 0.000489830657753514, 'samples': 2972928, 'steps': 15483, 'loss/train': 1.3299318552017212} 11/06/2021 23:26:42 - INFO - __main__ - Step 15485: {'lr': 0.0004898291595416438, 'samples': 2973120, 'steps': 15484, 'loss/train': 1.7662090063095093} 11/06/2021 23:26:42 - INFO - __main__ - Step 15486: {'lr': 0.0004898276612217102, 'samples': 2973312, 'steps': 15485, 'loss/train': 1.9618338346481323} 11/06/2021 23:26:43 - INFO - __main__ - Step 15487: {'lr': 0.0004898261627937139, 'samples': 2973504, 'steps': 15486, 'loss/train': 1.4506701231002808} 11/06/2021 23:26:44 - INFO - __main__ - Step 15488: {'lr': 0.0004898246642576559, 'samples': 2973696, 'steps': 15487, 'loss/train': 1.4949243068695068} 11/06/2021 23:26:44 - INFO - __main__ - Step 15489: {'lr': 0.0004898231656135362, 'samples': 2973888, 'steps': 15488, 'loss/train': 1.7489192485809326} 11/06/2021 23:26:44 - INFO - __main__ - Step 15490: {'lr': 0.0004898216668613562, 'samples': 2974080, 'steps': 15489, 'loss/train': 1.7685637474060059} 11/06/2021 23:26:45 - INFO - __main__ - Step 15491: {'lr': 0.0004898201680011161, 'samples': 2974272, 'steps': 15490, 'loss/train': 1.6107263565063477} 11/06/2021 23:26:45 - INFO - __main__ - Step 15492: {'lr': 0.0004898186690328168, 'samples': 2974464, 'steps': 15491, 'loss/train': 1.5681654214859009} 11/06/2021 23:26:46 - INFO - __main__ - Step 15493: {'lr': 0.000489817169956459, 'samples': 2974656, 'steps': 15492, 'loss/train': 1.5541188716888428} 11/06/2021 23:26:46 - INFO - __main__ - Step 15494: {'lr': 0.0004898156707720432, 'samples': 2974848, 'steps': 15493, 'loss/train': 1.5298535823822021} 11/06/2021 23:26:47 - INFO - __main__ - Step 15495: {'lr': 0.0004898141714795701, 'samples': 2975040, 'steps': 15494, 'loss/train': 1.4067524671554565} 11/06/2021 23:26:47 - INFO - __main__ - Step 15496: {'lr': 0.0004898126720790405, 'samples': 2975232, 'steps': 15495, 'loss/train': 1.936280369758606} 11/06/2021 23:26:47 - INFO - __main__ - Step 15497: {'lr': 0.0004898111725704549, 'samples': 2975424, 'steps': 15496, 'loss/train': 1.3486140966415405} 11/06/2021 23:26:49 - INFO - __main__ - Step 15498: {'lr': 0.0004898096729538142, 'samples': 2975616, 'steps': 15497, 'loss/train': 0.8642235398292542} 11/06/2021 23:26:49 - INFO - __main__ - Step 15499: {'lr': 0.000489808173229119, 'samples': 2975808, 'steps': 15498, 'loss/train': 1.4171953201293945} 11/06/2021 23:26:49 - INFO - __main__ - Step 15500: {'lr': 0.0004898066733963699, 'samples': 2976000, 'steps': 15499, 'loss/train': 1.831260085105896} 11/06/2021 23:26:50 - INFO - __main__ - Step 15501: {'lr': 0.0004898051734555676, 'samples': 2976192, 'steps': 15500, 'loss/train': 1.9325761795043945} 11/06/2021 23:26:50 - INFO - __main__ - Step 15502: {'lr': 0.0004898036734067127, 'samples': 2976384, 'steps': 15501, 'loss/train': 1.47348153591156} 11/06/2021 23:26:51 - INFO - __main__ - Step 15503: {'lr': 0.000489802173249806, 'samples': 2976576, 'steps': 15502, 'loss/train': 1.6190365552902222} 11/06/2021 23:26:51 - INFO - __main__ - Step 15504: {'lr': 0.0004898006729848482, 'samples': 2976768, 'steps': 15503, 'loss/train': 2.0817368030548096} 11/06/2021 23:26:52 - INFO - __main__ - Step 15505: {'lr': 0.0004897991726118399, 'samples': 2976960, 'steps': 15504, 'loss/train': 1.478041172027588} 11/06/2021 23:26:52 - INFO - __main__ - Step 15506: {'lr': 0.0004897976721307818, 'samples': 2977152, 'steps': 15505, 'loss/train': 1.7012150287628174} 11/06/2021 23:26:52 - INFO - __main__ - Step 15507: {'lr': 0.0004897961715416746, 'samples': 2977344, 'steps': 15506, 'loss/train': 1.763744592666626} 11/06/2021 23:26:54 - INFO - __main__ - Step 15508: {'lr': 0.0004897946708445189, 'samples': 2977536, 'steps': 15507, 'loss/train': 1.7468358278274536} 11/06/2021 23:26:54 - INFO - __main__ - Step 15509: {'lr': 0.0004897931700393154, 'samples': 2977728, 'steps': 15508, 'loss/train': 1.7361414432525635} 11/06/2021 23:26:54 - INFO - __main__ - Step 15510: {'lr': 0.0004897916691260648, 'samples': 2977920, 'steps': 15509, 'loss/train': 1.7605504989624023} 11/06/2021 23:26:55 - INFO - __main__ - Step 15511: {'lr': 0.0004897901681047679, 'samples': 2978112, 'steps': 15510, 'loss/train': 1.6326351165771484} 11/06/2021 23:26:55 - INFO - __main__ - Step 15512: {'lr': 0.0004897886669754251, 'samples': 2978304, 'steps': 15511, 'loss/train': 0.6337660551071167} 11/06/2021 23:26:56 - INFO - __main__ - Step 15513: {'lr': 0.0004897871657380373, 'samples': 2978496, 'steps': 15512, 'loss/train': 1.2955080270767212} 11/06/2021 23:26:56 - INFO - __main__ - Step 15514: {'lr': 0.0004897856643926051, 'samples': 2978688, 'steps': 15513, 'loss/train': 1.6480381488800049} 11/06/2021 23:26:57 - INFO - __main__ - Step 15515: {'lr': 0.0004897841629391291, 'samples': 2978880, 'steps': 15514, 'loss/train': 1.8163772821426392} 11/06/2021 23:26:57 - INFO - __main__ - Step 15516: {'lr': 0.0004897826613776101, 'samples': 2979072, 'steps': 15515, 'loss/train': 0.7295439839363098} 11/06/2021 23:26:57 - INFO - __main__ - Step 15517: {'lr': 0.0004897811597080488, 'samples': 2979264, 'steps': 15516, 'loss/train': 1.664631962776184} 11/06/2021 23:26:58 - INFO - __main__ - Step 15518: {'lr': 0.0004897796579304458, 'samples': 2979456, 'steps': 15517, 'loss/train': 2.0190608501434326} 11/06/2021 23:26:59 - INFO - __main__ - Step 15519: {'lr': 0.0004897781560448017, 'samples': 2979648, 'steps': 15518, 'loss/train': 2.086932897567749} 11/06/2021 23:26:59 - INFO - __main__ - Step 15520: {'lr': 0.0004897766540511173, 'samples': 2979840, 'steps': 15519, 'loss/train': 1.5068973302841187} 11/06/2021 23:26:59 - INFO - __main__ - Step 15521: {'lr': 0.0004897751519493933, 'samples': 2980032, 'steps': 15520, 'loss/train': 1.2962274551391602} 11/06/2021 23:27:00 - INFO - __main__ - Step 15522: {'lr': 0.0004897736497396303, 'samples': 2980224, 'steps': 15521, 'loss/train': 1.6107763051986694} 11/06/2021 23:27:00 - INFO - __main__ - Step 15523: {'lr': 0.000489772147421829, 'samples': 2980416, 'steps': 15522, 'loss/train': 1.9014201164245605} 11/06/2021 23:27:01 - INFO - __main__ - Step 15524: {'lr': 0.0004897706449959899, 'samples': 2980608, 'steps': 15523, 'loss/train': 0.9415351748466492} 11/06/2021 23:27:02 - INFO - __main__ - Step 15525: {'lr': 0.000489769142462114, 'samples': 2980800, 'steps': 15524, 'loss/train': 1.7568836212158203} 11/06/2021 23:27:02 - INFO - __main__ - Step 15526: {'lr': 0.0004897676398202018, 'samples': 2980992, 'steps': 15525, 'loss/train': 1.8288424015045166} 11/06/2021 23:27:02 - INFO - __main__ - Step 15527: {'lr': 0.000489766137070254, 'samples': 2981184, 'steps': 15526, 'loss/train': 1.7062444686889648} 11/06/2021 23:27:03 - INFO - __main__ - Step 15528: {'lr': 0.0004897646342122713, 'samples': 2981376, 'steps': 15527, 'loss/train': 1.133548378944397} 11/06/2021 23:27:04 - INFO - __main__ - Step 15529: {'lr': 0.0004897631312462544, 'samples': 2981568, 'steps': 15528, 'loss/train': 1.6974700689315796} 11/06/2021 23:27:04 - INFO - __main__ - Step 15530: {'lr': 0.0004897616281722038, 'samples': 2981760, 'steps': 15529, 'loss/train': 1.8842085599899292} 11/06/2021 23:27:04 - INFO - __main__ - Step 15531: {'lr': 0.0004897601249901204, 'samples': 2981952, 'steps': 15530, 'loss/train': 1.8136495351791382} 11/06/2021 23:27:05 - INFO - __main__ - Step 15532: {'lr': 0.0004897586217000047, 'samples': 2982144, 'steps': 15531, 'loss/train': 1.5027600526809692} 11/06/2021 23:27:05 - INFO - __main__ - Step 15533: {'lr': 0.0004897571183018576, 'samples': 2982336, 'steps': 15532, 'loss/train': 1.7708982229232788} 11/06/2021 23:27:06 - INFO - __main__ - Step 15534: {'lr': 0.0004897556147956796, 'samples': 2982528, 'steps': 15533, 'loss/train': 1.2645087242126465} 11/06/2021 23:27:06 - INFO - __main__ - Step 15535: {'lr': 0.0004897541111814714, 'samples': 2982720, 'steps': 15534, 'loss/train': 1.635704755783081} 11/06/2021 23:27:07 - INFO - __main__ - Step 15536: {'lr': 0.0004897526074592337, 'samples': 2982912, 'steps': 15535, 'loss/train': 1.6044889688491821} 11/06/2021 23:27:07 - INFO - __main__ - Step 15537: {'lr': 0.0004897511036289671, 'samples': 2983104, 'steps': 15536, 'loss/train': 1.1408244371414185} 11/06/2021 23:27:07 - INFO - __main__ - Step 15538: {'lr': 0.0004897495996906725, 'samples': 2983296, 'steps': 15537, 'loss/train': 1.5581918954849243} 11/06/2021 23:27:08 - INFO - __main__ - Step 15539: {'lr': 0.0004897480956443503, 'samples': 2983488, 'steps': 15538, 'loss/train': 1.4675657749176025} 11/06/2021 23:27:09 - INFO - __main__ - Step 15540: {'lr': 0.0004897465914900013, 'samples': 2983680, 'steps': 15539, 'loss/train': 1.6012563705444336} 11/06/2021 23:27:09 - INFO - __main__ - Step 15541: {'lr': 0.0004897450872276263, 'samples': 2983872, 'steps': 15540, 'loss/train': 1.357732892036438} 11/06/2021 23:27:10 - INFO - __main__ - Step 15542: {'lr': 0.0004897435828572258, 'samples': 2984064, 'steps': 15541, 'loss/train': 1.6430869102478027} 11/06/2021 23:27:10 - INFO - __main__ - Step 15543: {'lr': 0.0004897420783788006, 'samples': 2984256, 'steps': 15542, 'loss/train': 1.0916229486465454} 11/06/2021 23:27:10 - INFO - __main__ - Step 15544: {'lr': 0.0004897405737923511, 'samples': 2984448, 'steps': 15543, 'loss/train': 1.7726327180862427} 11/06/2021 23:27:11 - INFO - __main__ - Step 15545: {'lr': 0.0004897390690978785, 'samples': 2984640, 'steps': 15544, 'loss/train': 1.823258876800537} 11/06/2021 23:27:12 - INFO - __main__ - Step 15546: {'lr': 0.000489737564295383, 'samples': 2984832, 'steps': 15545, 'loss/train': 1.5735461711883545} 11/06/2021 23:27:12 - INFO - __main__ - Step 15547: {'lr': 0.0004897360593848655, 'samples': 2985024, 'steps': 15546, 'loss/train': 1.367332935333252} 11/06/2021 23:27:12 - INFO - __main__ - Step 15548: {'lr': 0.0004897345543663266, 'samples': 2985216, 'steps': 15547, 'loss/train': 0.22535571455955505} 11/06/2021 23:27:13 - INFO - __main__ - Step 15549: {'lr': 0.000489733049239767, 'samples': 2985408, 'steps': 15548, 'loss/train': 1.6396257877349854} 11/06/2021 23:27:14 - INFO - __main__ - Step 15550: {'lr': 0.0004897315440051874, 'samples': 2985600, 'steps': 15549, 'loss/train': 1.6046090126037598} 11/06/2021 23:27:14 - INFO - __main__ - Step 15551: {'lr': 0.0004897300386625885, 'samples': 2985792, 'steps': 15550, 'loss/train': 1.0172430276870728} 11/06/2021 23:27:15 - INFO - __main__ - Step 15552: {'lr': 0.0004897285332119709, 'samples': 2985984, 'steps': 15551, 'loss/train': 0.5910050272941589} 11/06/2021 23:27:15 - INFO - __main__ - Step 15553: {'lr': 0.0004897270276533355, 'samples': 2986176, 'steps': 15552, 'loss/train': 1.4002232551574707} 11/06/2021 23:27:16 - INFO - __main__ - Step 15554: {'lr': 0.0004897255219866825, 'samples': 2986368, 'steps': 15553, 'loss/train': 1.4650905132293701} 11/06/2021 23:27:16 - INFO - __main__ - Step 15555: {'lr': 0.000489724016212013, 'samples': 2986560, 'steps': 15554, 'loss/train': 0.7079336643218994} 11/06/2021 23:27:17 - INFO - __main__ - Step 15556: {'lr': 0.0004897225103293277, 'samples': 2986752, 'steps': 15555, 'loss/train': 1.5303819179534912} 11/06/2021 23:27:17 - INFO - __main__ - Step 15557: {'lr': 0.0004897210043386269, 'samples': 2986944, 'steps': 15556, 'loss/train': 1.7852060794830322} 11/06/2021 23:27:18 - INFO - __main__ - Step 15558: {'lr': 0.0004897194982399117, 'samples': 2987136, 'steps': 15557, 'loss/train': 1.5519731044769287} 11/06/2021 23:27:18 - INFO - __main__ - Step 15559: {'lr': 0.0004897179920331826, 'samples': 2987328, 'steps': 15558, 'loss/train': 1.4157203435897827} 11/06/2021 23:27:18 - INFO - __main__ - Step 15560: {'lr': 0.0004897164857184401, 'samples': 2987520, 'steps': 15559, 'loss/train': 1.7595874071121216} 11/06/2021 23:27:19 - INFO - __main__ - Step 15561: {'lr': 0.0004897149792956852, 'samples': 2987712, 'steps': 15560, 'loss/train': 1.732194185256958} 11/06/2021 23:27:20 - INFO - __main__ - Step 15562: {'lr': 0.0004897134727649184, 'samples': 2987904, 'steps': 15561, 'loss/train': 1.4084118604660034} 11/06/2021 23:27:20 - INFO - __main__ - Step 15563: {'lr': 0.0004897119661261403, 'samples': 2988096, 'steps': 15562, 'loss/train': 1.490355134010315} 11/06/2021 23:27:20 - INFO - __main__ - Step 15564: {'lr': 0.0004897104593793518, 'samples': 2988288, 'steps': 15563, 'loss/train': 1.779022455215454} 11/06/2021 23:27:21 - INFO - __main__ - Step 15565: {'lr': 0.0004897089525245535, 'samples': 2988480, 'steps': 15564, 'loss/train': 1.8259437084197998} 11/06/2021 23:27:22 - INFO - __main__ - Step 15566: {'lr': 0.000489707445561746, 'samples': 2988672, 'steps': 15565, 'loss/train': 1.727378010749817} 11/06/2021 23:27:22 - INFO - __main__ - Step 15567: {'lr': 0.0004897059384909299, 'samples': 2988864, 'steps': 15566, 'loss/train': 1.8309595584869385} 11/06/2021 23:27:23 - INFO - __main__ - Step 15568: {'lr': 0.0004897044313121061, 'samples': 2989056, 'steps': 15567, 'loss/train': 5.84039306640625} 11/06/2021 23:27:23 - INFO - __main__ - Step 15569: {'lr': 0.0004897029240252753, 'samples': 2989248, 'steps': 15568, 'loss/train': 1.787092685699463} 11/06/2021 23:27:23 - INFO - __main__ - Step 15570: {'lr': 0.000489701416630438, 'samples': 2989440, 'steps': 15569, 'loss/train': 1.3923110961914062} 11/06/2021 23:27:24 - INFO - __main__ - Step 15571: {'lr': 0.0004896999091275948, 'samples': 2989632, 'steps': 15570, 'loss/train': 0.7924749255180359} 11/06/2021 23:27:25 - INFO - __main__ - Step 15572: {'lr': 0.0004896984015167466, 'samples': 2989824, 'steps': 15571, 'loss/train': 1.4174809455871582} 11/06/2021 23:27:25 - INFO - __main__ - Step 15573: {'lr': 0.0004896968937978941, 'samples': 2990016, 'steps': 15572, 'loss/train': 1.8383560180664062} 11/06/2021 23:27:25 - INFO - __main__ - Step 15574: {'lr': 0.0004896953859710379, 'samples': 2990208, 'steps': 15573, 'loss/train': 1.6366064548492432} 11/06/2021 23:27:26 - INFO - __main__ - Step 15575: {'lr': 0.0004896938780361784, 'samples': 2990400, 'steps': 15574, 'loss/train': 1.5936790704727173} 11/06/2021 23:27:26 - INFO - __main__ - Step 15576: {'lr': 0.0004896923699933167, 'samples': 2990592, 'steps': 15575, 'loss/train': 1.7249995470046997} 11/06/2021 23:27:27 - INFO - __main__ - Step 15577: {'lr': 0.0004896908618424533, 'samples': 2990784, 'steps': 15576, 'loss/train': 2.0188241004943848} 11/06/2021 23:27:28 - INFO - __main__ - Step 15578: {'lr': 0.0004896893535835889, 'samples': 2990976, 'steps': 15577, 'loss/train': 1.7882033586502075} 11/06/2021 23:27:28 - INFO - __main__ - Step 15579: {'lr': 0.0004896878452167241, 'samples': 2991168, 'steps': 15578, 'loss/train': 1.9642932415008545} 11/06/2021 23:27:28 - INFO - __main__ - Step 15580: {'lr': 0.0004896863367418598, 'samples': 2991360, 'steps': 15579, 'loss/train': 1.1707956790924072} 11/06/2021 23:27:29 - INFO - __main__ - Step 15581: {'lr': 0.0004896848281589966, 'samples': 2991552, 'steps': 15580, 'loss/train': 1.590392827987671} 11/06/2021 23:27:29 - INFO - __main__ - Step 15582: {'lr': 0.0004896833194681349, 'samples': 2991744, 'steps': 15581, 'loss/train': 2.1214358806610107} 11/06/2021 23:27:30 - INFO - __main__ - Step 15583: {'lr': 0.0004896818106692757, 'samples': 2991936, 'steps': 15582, 'loss/train': 1.77946937084198} 11/06/2021 23:27:30 - INFO - __main__ - Step 15584: {'lr': 0.0004896803017624196, 'samples': 2992128, 'steps': 15583, 'loss/train': 1.9012962579727173} 11/06/2021 23:27:31 - INFO - __main__ - Step 15585: {'lr': 0.0004896787927475671, 'samples': 2992320, 'steps': 15584, 'loss/train': 1.8630757331848145} 11/06/2021 23:27:31 - INFO - __main__ - Step 15586: {'lr': 0.0004896772836247192, 'samples': 2992512, 'steps': 15585, 'loss/train': 1.8197124004364014} 11/06/2021 23:27:31 - INFO - __main__ - Step 15587: {'lr': 0.0004896757743938764, 'samples': 2992704, 'steps': 15586, 'loss/train': 1.4077614545822144} 11/06/2021 23:27:33 - INFO - __main__ - Step 15588: {'lr': 0.0004896742650550393, 'samples': 2992896, 'steps': 15587, 'loss/train': 1.6958098411560059} 11/06/2021 23:27:33 - INFO - __main__ - Step 15589: {'lr': 0.0004896727556082086, 'samples': 2993088, 'steps': 15588, 'loss/train': 1.470612645149231} 11/06/2021 23:27:34 - INFO - __main__ - Step 15590: {'lr': 0.0004896712460533854, 'samples': 2993280, 'steps': 15589, 'loss/train': 0.6105215549468994} 11/06/2021 23:27:34 - INFO - __main__ - Step 15591: {'lr': 0.0004896697363905697, 'samples': 2993472, 'steps': 15590, 'loss/train': 1.653200387954712} 11/06/2021 23:27:34 - INFO - __main__ - Step 15592: {'lr': 0.0004896682266197626, 'samples': 2993664, 'steps': 15591, 'loss/train': 1.4414596557617188} 11/06/2021 23:27:35 - INFO - __main__ - Step 15593: {'lr': 0.0004896667167409648, 'samples': 2993856, 'steps': 15592, 'loss/train': 1.5426613092422485} 11/06/2021 23:27:36 - INFO - __main__ - Step 15594: {'lr': 0.0004896652067541767, 'samples': 2994048, 'steps': 15593, 'loss/train': 1.6049456596374512} 11/06/2021 23:27:36 - INFO - __main__ - Step 15595: {'lr': 0.0004896636966593993, 'samples': 2994240, 'steps': 15594, 'loss/train': 1.4910203218460083} 11/06/2021 23:27:36 - INFO - __main__ - Step 15596: {'lr': 0.0004896621864566331, 'samples': 2994432, 'steps': 15595, 'loss/train': 1.6735210418701172} 11/06/2021 23:27:37 - INFO - __main__ - Step 15597: {'lr': 0.0004896606761458788, 'samples': 2994624, 'steps': 15596, 'loss/train': 1.6641515493392944} 11/06/2021 23:27:38 - INFO - __main__ - Step 15598: {'lr': 0.0004896591657271371, 'samples': 2994816, 'steps': 15597, 'loss/train': 1.6537212133407593} 11/06/2021 23:27:38 - INFO - __main__ - Step 15599: {'lr': 0.0004896576552004087, 'samples': 2995008, 'steps': 15598, 'loss/train': 2.4560248851776123} 11/06/2021 23:27:38 - INFO - __main__ - Step 15600: {'lr': 0.0004896561445656943, 'samples': 2995200, 'steps': 15599, 'loss/train': 1.4348224401474} 11/06/2021 23:27:39 - INFO - __main__ - Step 15601: {'lr': 0.0004896546338229945, 'samples': 2995392, 'steps': 15600, 'loss/train': 1.6731430292129517} 11/06/2021 23:27:39 - INFO - __main__ - Step 15602: {'lr': 0.00048965312297231, 'samples': 2995584, 'steps': 15601, 'loss/train': 1.7863247394561768} 11/06/2021 23:27:39 - INFO - __main__ - Step 15603: {'lr': 0.0004896516120136415, 'samples': 2995776, 'steps': 15602, 'loss/train': 1.5890967845916748} 11/06/2021 23:27:40 - INFO - __main__ - Step 15604: {'lr': 0.0004896501009469896, 'samples': 2995968, 'steps': 15603, 'loss/train': 1.6799132823944092} 11/06/2021 23:27:41 - INFO - __main__ - Step 15605: {'lr': 0.0004896485897723552, 'samples': 2996160, 'steps': 15604, 'loss/train': 1.8443701267242432} 11/06/2021 23:27:41 - INFO - __main__ - Step 15606: {'lr': 0.0004896470784897388, 'samples': 2996352, 'steps': 15605, 'loss/train': 1.6562256813049316} 11/06/2021 23:27:42 - INFO - __main__ - Step 15607: {'lr': 0.0004896455670991411, 'samples': 2996544, 'steps': 15606, 'loss/train': 1.9659929275512695} 11/06/2021 23:27:42 - INFO - __main__ - Step 15608: {'lr': 0.0004896440556005628, 'samples': 2996736, 'steps': 15607, 'loss/train': 1.765615463256836} 11/06/2021 23:27:43 - INFO - __main__ - Step 15609: {'lr': 0.0004896425439940047, 'samples': 2996928, 'steps': 15608, 'loss/train': 1.5836148262023926} 11/06/2021 23:27:43 - INFO - __main__ - Step 15610: {'lr': 0.0004896410322794673, 'samples': 2997120, 'steps': 15609, 'loss/train': 1.9341999292373657} 11/06/2021 23:27:44 - INFO - __main__ - Step 15611: {'lr': 0.0004896395204569512, 'samples': 2997312, 'steps': 15610, 'loss/train': 2.4479007720947266} 11/06/2021 23:27:44 - INFO - __main__ - Step 15612: {'lr': 0.0004896380085264573, 'samples': 2997504, 'steps': 15611, 'loss/train': 1.4598520994186401} 11/06/2021 23:27:45 - INFO - __main__ - Step 15613: {'lr': 0.0004896364964879864, 'samples': 2997696, 'steps': 15612, 'loss/train': 1.7066932916641235} 11/06/2021 23:27:46 - INFO - __main__ - Step 15614: {'lr': 0.0004896349843415389, 'samples': 2997888, 'steps': 15613, 'loss/train': 1.8551546335220337} 11/06/2021 23:27:46 - INFO - __main__ - Step 15615: {'lr': 0.0004896334720871156, 'samples': 2998080, 'steps': 15614, 'loss/train': 1.7236617803573608} 11/06/2021 23:27:46 - INFO - __main__ - Step 15616: {'lr': 0.0004896319597247169, 'samples': 2998272, 'steps': 15615, 'loss/train': 1.543269157409668} 11/06/2021 23:27:47 - INFO - __main__ - Step 15617: {'lr': 0.0004896304472543439, 'samples': 2998464, 'steps': 15616, 'loss/train': 1.2389189004898071} 11/06/2021 23:27:47 - INFO - __main__ - Step 15618: {'lr': 0.0004896289346759973, 'samples': 2998656, 'steps': 15617, 'loss/train': 1.4939310550689697} 11/06/2021 23:27:47 - INFO - __main__ - Step 15619: {'lr': 0.0004896274219896773, 'samples': 2998848, 'steps': 15618, 'loss/train': 1.4448155164718628} 11/06/2021 23:27:49 - INFO - __main__ - Step 15620: {'lr': 0.000489625909195385, 'samples': 2999040, 'steps': 15619, 'loss/train': 1.5308935642242432} 11/06/2021 23:27:49 - INFO - __main__ - Step 15621: {'lr': 0.0004896243962931211, 'samples': 2999232, 'steps': 15620, 'loss/train': 1.9983116388320923} 11/06/2021 23:27:49 - INFO - __main__ - Step 15622: {'lr': 0.0004896228832828861, 'samples': 2999424, 'steps': 15621, 'loss/train': 1.443530559539795} 11/06/2021 23:27:50 - INFO - __main__ - Step 15623: {'lr': 0.0004896213701646806, 'samples': 2999616, 'steps': 15622, 'loss/train': 1.5524290800094604} 11/06/2021 23:27:50 - INFO - __main__ - Step 15624: {'lr': 0.0004896198569385055, 'samples': 2999808, 'steps': 15623, 'loss/train': 1.7541719675064087} 11/06/2021 23:27:51 - INFO - __main__ - Step 15625: {'lr': 0.0004896183436043613, 'samples': 3000000, 'steps': 15624, 'loss/train': 1.905300498008728} 11/06/2021 23:27:51 - INFO - __main__ - Step 15626: {'lr': 0.0004896168301622488, 'samples': 3000192, 'steps': 15625, 'loss/train': 1.9676408767700195} 11/06/2021 23:27:52 - INFO - __main__ - Step 15627: {'lr': 0.0004896153166121688, 'samples': 3000384, 'steps': 15626, 'loss/train': 1.2466262578964233} 11/06/2021 23:27:52 - INFO - __main__ - Step 15628: {'lr': 0.0004896138029541217, 'samples': 3000576, 'steps': 15627, 'loss/train': 1.6408860683441162} 11/06/2021 23:27:52 - INFO - __main__ - Step 15629: {'lr': 0.0004896122891881083, 'samples': 3000768, 'steps': 15628, 'loss/train': 1.4601120948791504} 11/06/2021 23:27:53 - INFO - __main__ - Step 15630: {'lr': 0.0004896107753141293, 'samples': 3000960, 'steps': 15629, 'loss/train': 1.7563142776489258} 11/06/2021 23:27:54 - INFO - __main__ - Step 15631: {'lr': 0.0004896092613321854, 'samples': 3001152, 'steps': 15630, 'loss/train': 2.3639023303985596} 11/06/2021 23:27:54 - INFO - __main__ - Step 15632: {'lr': 0.0004896077472422773, 'samples': 3001344, 'steps': 15631, 'loss/train': 1.8587177991867065} 11/06/2021 23:27:54 - INFO - __main__ - Step 15633: {'lr': 0.0004896062330444057, 'samples': 3001536, 'steps': 15632, 'loss/train': 1.8706539869308472} 11/06/2021 23:27:55 - INFO - __main__ - Step 15634: {'lr': 0.0004896047187385711, 'samples': 3001728, 'steps': 15633, 'loss/train': 1.3826926946640015} 11/06/2021 23:27:56 - INFO - __main__ - Step 15635: {'lr': 0.0004896032043247744, 'samples': 3001920, 'steps': 15634, 'loss/train': 1.9541095495224} 11/06/2021 23:27:56 - INFO - __main__ - Step 15636: {'lr': 0.0004896016898030161, 'samples': 3002112, 'steps': 15635, 'loss/train': 1.0642521381378174} 11/06/2021 23:27:57 - INFO - __main__ - Step 15637: {'lr': 0.0004896001751732971, 'samples': 3002304, 'steps': 15636, 'loss/train': 0.9917707443237305} 11/06/2021 23:27:57 - INFO - __main__ - Step 15638: {'lr': 0.0004895986604356178, 'samples': 3002496, 'steps': 15637, 'loss/train': 1.6697584390640259} 11/06/2021 23:27:57 - INFO - __main__ - Step 15639: {'lr': 0.0004895971455899792, 'samples': 3002688, 'steps': 15638, 'loss/train': 1.6163402795791626} 11/06/2021 23:27:58 - INFO - __main__ - Step 15640: {'lr': 0.0004895956306363818, 'samples': 3002880, 'steps': 15639, 'loss/train': 0.30415403842926025} 11/06/2021 23:27:59 - INFO - __main__ - Step 15641: {'lr': 0.0004895941155748263, 'samples': 3003072, 'steps': 15640, 'loss/train': 1.7623761892318726} 11/06/2021 23:27:59 - INFO - __main__ - Step 15642: {'lr': 0.0004895926004053133, 'samples': 3003264, 'steps': 15641, 'loss/train': 1.5293596982955933} 11/06/2021 23:27:59 - INFO - __main__ - Step 15643: {'lr': 0.0004895910851278436, 'samples': 3003456, 'steps': 15642, 'loss/train': 1.518384575843811} 11/06/2021 23:28:00 - INFO - __main__ - Step 15644: {'lr': 0.0004895895697424179, 'samples': 3003648, 'steps': 15643, 'loss/train': 1.6720932722091675} 11/06/2021 23:28:00 - INFO - __main__ - Step 15645: {'lr': 0.0004895880542490369, 'samples': 3003840, 'steps': 15644, 'loss/train': 1.7592436075210571} 11/06/2021 23:28:01 - INFO - __main__ - Step 15646: {'lr': 0.0004895865386477011, 'samples': 3004032, 'steps': 15645, 'loss/train': 0.32387545704841614} 11/06/2021 23:28:02 - INFO - __main__ - Step 15647: {'lr': 0.0004895850229384113, 'samples': 3004224, 'steps': 15646, 'loss/train': 1.4493197202682495} 11/06/2021 23:28:02 - INFO - __main__ - Step 15648: {'lr': 0.0004895835071211682, 'samples': 3004416, 'steps': 15647, 'loss/train': 1.6152663230895996} 11/06/2021 23:28:02 - INFO - __main__ - Step 15649: {'lr': 0.0004895819911959725, 'samples': 3004608, 'steps': 15648, 'loss/train': 0.838192343711853} 11/06/2021 23:28:03 - INFO - __main__ - Step 15650: {'lr': 0.0004895804751628249, 'samples': 3004800, 'steps': 15649, 'loss/train': 1.9193270206451416} 11/06/2021 23:28:04 - INFO - __main__ - Step 15651: {'lr': 0.0004895789590217259, 'samples': 3004992, 'steps': 15650, 'loss/train': 1.8412024974822998} 11/06/2021 23:28:04 - INFO - __main__ - Step 15652: {'lr': 0.0004895774427726764, 'samples': 3005184, 'steps': 15651, 'loss/train': 1.4921084642410278} 11/06/2021 23:28:04 - INFO - __main__ - Step 15653: {'lr': 0.000489575926415677, 'samples': 3005376, 'steps': 15652, 'loss/train': 0.7553501725196838} 11/06/2021 23:28:05 - INFO - __main__ - Step 15654: {'lr': 0.0004895744099507284, 'samples': 3005568, 'steps': 15653, 'loss/train': 1.9209213256835938} 11/06/2021 23:28:05 - INFO - __main__ - Step 15655: {'lr': 0.0004895728933778313, 'samples': 3005760, 'steps': 15654, 'loss/train': 1.9080966711044312} 11/06/2021 23:28:06 - INFO - __main__ - Step 15656: {'lr': 0.0004895713766969863, 'samples': 3005952, 'steps': 15655, 'loss/train': 1.7056766748428345} 11/06/2021 23:28:06 - INFO - __main__ - Step 15657: {'lr': 0.0004895698599081942, 'samples': 3006144, 'steps': 15656, 'loss/train': 0.8188896775245667} 11/06/2021 23:28:07 - INFO - __main__ - Step 15658: {'lr': 0.0004895683430114555, 'samples': 3006336, 'steps': 15657, 'loss/train': 1.6825709342956543} 11/06/2021 23:28:07 - INFO - __main__ - Step 15659: {'lr': 0.0004895668260067711, 'samples': 3006528, 'steps': 15658, 'loss/train': 1.427496314048767} 11/06/2021 23:28:07 - INFO - __main__ - Step 15660: {'lr': 0.0004895653088941416, 'samples': 3006720, 'steps': 15659, 'loss/train': 1.5132235288619995} 11/06/2021 23:28:09 - INFO - __main__ - Step 15661: {'lr': 0.0004895637916735675, 'samples': 3006912, 'steps': 15660, 'loss/train': 2.0918915271759033} 11/06/2021 23:28:09 - INFO - __main__ - Step 15662: {'lr': 0.0004895622743450497, 'samples': 3007104, 'steps': 15661, 'loss/train': 1.9682589769363403} 11/06/2021 23:28:09 - INFO - __main__ - Step 15663: {'lr': 0.000489560756908589, 'samples': 3007296, 'steps': 15662, 'loss/train': 1.5035189390182495} 11/06/2021 23:28:10 - INFO - __main__ - Step 15664: {'lr': 0.0004895592393641858, 'samples': 3007488, 'steps': 15663, 'loss/train': 2.122490406036377} 11/06/2021 23:28:10 - INFO - __main__ - Step 15665: {'lr': 0.0004895577217118408, 'samples': 3007680, 'steps': 15664, 'loss/train': 1.6128400564193726} 11/06/2021 23:28:10 - INFO - __main__ - Step 15666: {'lr': 0.000489556203951555, 'samples': 3007872, 'steps': 15665, 'loss/train': 1.7595479488372803} 11/06/2021 23:28:11 - INFO - __main__ - Step 15667: {'lr': 0.0004895546860833287, 'samples': 3008064, 'steps': 15666, 'loss/train': 1.5410199165344238} 11/06/2021 23:28:12 - INFO - __main__ - Step 15668: {'lr': 0.000489553168107163, 'samples': 3008256, 'steps': 15667, 'loss/train': 1.6218812465667725} 11/06/2021 23:28:12 - INFO - __main__ - Step 15669: {'lr': 0.0004895516500230581, 'samples': 3008448, 'steps': 15668, 'loss/train': 1.6784151792526245} 11/06/2021 23:28:12 - INFO - __main__ - Step 15670: {'lr': 0.000489550131831015, 'samples': 3008640, 'steps': 15669, 'loss/train': 1.0655637979507446} 11/06/2021 23:28:13 - INFO - __main__ - Step 15671: {'lr': 0.0004895486135310343, 'samples': 3008832, 'steps': 15670, 'loss/train': 1.7083613872528076} 11/06/2021 23:28:14 - INFO - __main__ - Step 15672: {'lr': 0.0004895470951231166, 'samples': 3009024, 'steps': 15671, 'loss/train': 1.3929165601730347} 11/06/2021 23:28:14 - INFO - __main__ - Step 15673: {'lr': 0.0004895455766072629, 'samples': 3009216, 'steps': 15672, 'loss/train': 1.3593981266021729} 11/06/2021 23:28:15 - INFO - __main__ - Step 15674: {'lr': 0.0004895440579834736, 'samples': 3009408, 'steps': 15673, 'loss/train': 2.2362141609191895} 11/06/2021 23:28:15 - INFO - __main__ - Step 15675: {'lr': 0.0004895425392517493, 'samples': 3009600, 'steps': 15674, 'loss/train': 1.799658179283142} 11/06/2021 23:28:15 - INFO - __main__ - Step 15676: {'lr': 0.0004895410204120909, 'samples': 3009792, 'steps': 15675, 'loss/train': 0.6833590865135193} 11/06/2021 23:28:16 - INFO - __main__ - Step 15677: {'lr': 0.000489539501464499, 'samples': 3009984, 'steps': 15676, 'loss/train': 1.8790018558502197} 11/06/2021 23:28:17 - INFO - __main__ - Step 15678: {'lr': 0.0004895379824089743, 'samples': 3010176, 'steps': 15677, 'loss/train': 1.4177005290985107} 11/06/2021 23:28:17 - INFO - __main__ - Step 15679: {'lr': 0.0004895364632455175, 'samples': 3010368, 'steps': 15678, 'loss/train': 1.675038456916809} 11/06/2021 23:28:18 - INFO - __main__ - Step 15680: {'lr': 0.0004895349439741292, 'samples': 3010560, 'steps': 15679, 'loss/train': 1.6020910739898682} 11/06/2021 23:28:18 - INFO - __main__ - Step 15681: {'lr': 0.0004895334245948103, 'samples': 3010752, 'steps': 15680, 'loss/train': 1.6132880449295044} 11/06/2021 23:28:18 - INFO - __main__ - Step 15682: {'lr': 0.0004895319051075612, 'samples': 3010944, 'steps': 15681, 'loss/train': 1.2223986387252808} 11/06/2021 23:28:19 - INFO - __main__ - Step 15683: {'lr': 0.0004895303855123828, 'samples': 3011136, 'steps': 15682, 'loss/train': 1.8160881996154785} 11/06/2021 23:28:20 - INFO - __main__ - Step 15684: {'lr': 0.0004895288658092757, 'samples': 3011328, 'steps': 15683, 'loss/train': 1.8735129833221436} 11/06/2021 23:28:20 - INFO - __main__ - Step 15685: {'lr': 0.0004895273459982406, 'samples': 3011520, 'steps': 15684, 'loss/train': 2.2502872943878174} 11/06/2021 23:28:20 - INFO - __main__ - Step 15686: {'lr': 0.0004895258260792781, 'samples': 3011712, 'steps': 15685, 'loss/train': 1.674849271774292} 11/06/2021 23:28:21 - INFO - __main__ - Step 15687: {'lr': 0.0004895243060523889, 'samples': 3011904, 'steps': 15686, 'loss/train': 1.7530903816223145} 11/06/2021 23:28:22 - INFO - __main__ - Step 15688: {'lr': 0.0004895227859175739, 'samples': 3012096, 'steps': 15687, 'loss/train': 1.8542940616607666} 11/06/2021 23:28:22 - INFO - __main__ - Step 15689: {'lr': 0.0004895212656748336, 'samples': 3012288, 'steps': 15688, 'loss/train': 1.7547863721847534} 11/06/2021 23:28:23 - INFO - __main__ - Step 15690: {'lr': 0.0004895197453241687, 'samples': 3012480, 'steps': 15689, 'loss/train': 1.8645319938659668} 11/06/2021 23:28:23 - INFO - __main__ - Step 15691: {'lr': 0.0004895182248655798, 'samples': 3012672, 'steps': 15690, 'loss/train': 1.0751885175704956} 11/06/2021 23:28:23 - INFO - __main__ - Step 15692: {'lr': 0.0004895167042990678, 'samples': 3012864, 'steps': 15691, 'loss/train': 1.3243579864501953} 11/06/2021 23:28:24 - INFO - __main__ - Step 15693: {'lr': 0.0004895151836246332, 'samples': 3013056, 'steps': 15692, 'loss/train': 0.8072070479393005} 11/06/2021 23:28:25 - INFO - __main__ - Step 15694: {'lr': 0.0004895136628422767, 'samples': 3013248, 'steps': 15693, 'loss/train': 1.9552663564682007} 11/06/2021 23:28:25 - INFO - __main__ - Step 15695: {'lr': 0.0004895121419519992, 'samples': 3013440, 'steps': 15694, 'loss/train': 1.4452375173568726} 11/06/2021 23:28:25 - INFO - __main__ - Step 15696: {'lr': 0.0004895106209538011, 'samples': 3013632, 'steps': 15695, 'loss/train': 1.48054039478302} 11/06/2021 23:28:26 - INFO - __main__ - Step 15697: {'lr': 0.0004895090998476833, 'samples': 3013824, 'steps': 15696, 'loss/train': 1.7760313749313354} 11/06/2021 23:28:27 - INFO - __main__ - Step 15698: {'lr': 0.0004895075786336463, 'samples': 3014016, 'steps': 15697, 'loss/train': 3.4259486198425293} 11/06/2021 23:28:27 - INFO - __main__ - Step 15699: {'lr': 0.000489506057311691, 'samples': 3014208, 'steps': 15698, 'loss/train': 1.669292688369751} 11/06/2021 23:28:28 - INFO - __main__ - Step 15700: {'lr': 0.0004895045358818179, 'samples': 3014400, 'steps': 15699, 'loss/train': 1.6699222326278687} 11/06/2021 23:28:28 - INFO - __main__ - Step 15701: {'lr': 0.0004895030143440278, 'samples': 3014592, 'steps': 15700, 'loss/train': 1.9887073040008545} 11/06/2021 23:28:28 - INFO - __main__ - Step 15702: {'lr': 0.0004895014926983212, 'samples': 3014784, 'steps': 15701, 'loss/train': 2.682663679122925} 11/06/2021 23:28:30 - INFO - __main__ - Step 15703: {'lr': 0.0004894999709446991, 'samples': 3014976, 'steps': 15702, 'loss/train': 1.6473217010498047} 11/06/2021 23:28:31 - INFO - __main__ - Step 15704: {'lr': 0.0004894984490831619, 'samples': 3015168, 'steps': 15703, 'loss/train': 1.7859365940093994} 11/06/2021 23:28:31 - INFO - __main__ - Step 15705: {'lr': 0.0004894969271137104, 'samples': 3015360, 'steps': 15704, 'loss/train': 1.9558829069137573} 11/06/2021 23:28:31 - INFO - __main__ - Step 15706: {'lr': 0.0004894954050363452, 'samples': 3015552, 'steps': 15705, 'loss/train': 0.8179440498352051} 11/06/2021 23:28:32 - INFO - __main__ - Step 15707: {'lr': 0.0004894938828510672, 'samples': 3015744, 'steps': 15706, 'loss/train': 1.4025057554244995} 11/06/2021 23:28:32 - INFO - __main__ - Step 15708: {'lr': 0.000489492360557877, 'samples': 3015936, 'steps': 15707, 'loss/train': 1.384521722793579} 11/06/2021 23:28:32 - INFO - __main__ - Step 15709: {'lr': 0.0004894908381567751, 'samples': 3016128, 'steps': 15708, 'loss/train': 1.8304907083511353} 11/06/2021 23:28:33 - INFO - __main__ - Step 15710: {'lr': 0.0004894893156477623, 'samples': 3016320, 'steps': 15709, 'loss/train': 1.7960162162780762} 11/06/2021 23:28:34 - INFO - __main__ - Step 15711: {'lr': 0.0004894877930308395, 'samples': 3016512, 'steps': 15710, 'loss/train': 0.9428093433380127} 11/06/2021 23:28:34 - INFO - __main__ - Step 15712: {'lr': 0.0004894862703060071, 'samples': 3016704, 'steps': 15711, 'loss/train': 1.633742094039917} 11/06/2021 23:28:35 - INFO - __main__ - Step 15713: {'lr': 0.0004894847474732658, 'samples': 3016896, 'steps': 15712, 'loss/train': 1.6129860877990723} 11/06/2021 23:28:35 - INFO - __main__ - Step 15714: {'lr': 0.0004894832245326165, 'samples': 3017088, 'steps': 15713, 'loss/train': 1.7154533863067627} 11/06/2021 23:28:36 - INFO - __main__ - Step 15715: {'lr': 0.0004894817014840597, 'samples': 3017280, 'steps': 15714, 'loss/train': 0.8715106844902039} 11/06/2021 23:28:36 - INFO - __main__ - Step 15716: {'lr': 0.0004894801783275961, 'samples': 3017472, 'steps': 15715, 'loss/train': 1.5612425804138184} 11/06/2021 23:28:37 - INFO - __main__ - Step 15717: {'lr': 0.0004894786550632264, 'samples': 3017664, 'steps': 15716, 'loss/train': 1.774351954460144} 11/06/2021 23:28:37 - INFO - __main__ - Step 15718: {'lr': 0.0004894771316909514, 'samples': 3017856, 'steps': 15717, 'loss/train': 1.6608457565307617} 11/06/2021 23:28:38 - INFO - __main__ - Step 15719: {'lr': 0.0004894756082107717, 'samples': 3018048, 'steps': 15718, 'loss/train': 1.3828167915344238} 11/06/2021 23:28:38 - INFO - __main__ - Step 15720: {'lr': 0.0004894740846226879, 'samples': 3018240, 'steps': 15719, 'loss/train': 1.9708067178726196} 11/06/2021 23:28:39 - INFO - __main__ - Step 15721: {'lr': 0.0004894725609267009, 'samples': 3018432, 'steps': 15720, 'loss/train': 1.6421191692352295} 11/06/2021 23:28:39 - INFO - __main__ - Step 15722: {'lr': 0.0004894710371228111, 'samples': 3018624, 'steps': 15721, 'loss/train': 1.9015141725540161} 11/06/2021 23:28:40 - INFO - __main__ - Step 15723: {'lr': 0.0004894695132110196, 'samples': 3018816, 'steps': 15722, 'loss/train': 1.9826589822769165} 11/06/2021 23:28:40 - INFO - __main__ - Step 15724: {'lr': 0.0004894679891913266, 'samples': 3019008, 'steps': 15723, 'loss/train': 1.9029018878936768} 11/06/2021 23:28:40 - INFO - __main__ - Step 15725: {'lr': 0.000489466465063733, 'samples': 3019200, 'steps': 15724, 'loss/train': 1.4145636558532715} 11/06/2021 23:28:41 - INFO - __main__ - Step 15726: {'lr': 0.0004894649408282396, 'samples': 3019392, 'steps': 15725, 'loss/train': 1.6551494598388672} 11/06/2021 23:28:42 - INFO - __main__ - Step 15727: {'lr': 0.000489463416484847, 'samples': 3019584, 'steps': 15726, 'loss/train': 1.8325507640838623} 11/06/2021 23:28:42 - INFO - __main__ - Step 15728: {'lr': 0.0004894618920335558, 'samples': 3019776, 'steps': 15727, 'loss/train': 1.8709338903427124} 11/06/2021 23:28:43 - INFO - __main__ - Step 15729: {'lr': 0.0004894603674743668, 'samples': 3019968, 'steps': 15728, 'loss/train': 1.7608121633529663} 11/06/2021 23:28:43 - INFO - __main__ - Step 15730: {'lr': 0.0004894588428072808, 'samples': 3020160, 'steps': 15729, 'loss/train': 1.6138921976089478} 11/06/2021 23:28:43 - INFO - __main__ - Step 15731: {'lr': 0.0004894573180322982, 'samples': 3020352, 'steps': 15730, 'loss/train': 1.558252215385437} 11/06/2021 23:28:44 - INFO - __main__ - Step 15732: {'lr': 0.0004894557931494199, 'samples': 3020544, 'steps': 15731, 'loss/train': 1.370026707649231} 11/06/2021 23:28:45 - INFO - __main__ - Step 15733: {'lr': 0.0004894542681586465, 'samples': 3020736, 'steps': 15732, 'loss/train': 1.6896443367004395} 11/06/2021 23:28:45 - INFO - __main__ - Step 15734: {'lr': 0.0004894527430599786, 'samples': 3020928, 'steps': 15733, 'loss/train': 1.5409351587295532} 11/06/2021 23:28:45 - INFO - __main__ - Step 15735: {'lr': 0.0004894512178534171, 'samples': 3021120, 'steps': 15734, 'loss/train': 1.5735464096069336} 11/06/2021 23:28:46 - INFO - __main__ - Step 15736: {'lr': 0.0004894496925389625, 'samples': 3021312, 'steps': 15735, 'loss/train': 2.1567623615264893} 11/06/2021 23:28:47 - INFO - __main__ - Step 15737: {'lr': 0.0004894481671166155, 'samples': 3021504, 'steps': 15736, 'loss/train': 1.9780317544937134} 11/06/2021 23:28:47 - INFO - __main__ - Step 15738: {'lr': 0.0004894466415863771, 'samples': 3021696, 'steps': 15737, 'loss/train': 1.610995888710022} 11/06/2021 23:28:48 - INFO - __main__ - Step 15739: {'lr': 0.0004894451159482476, 'samples': 3021888, 'steps': 15738, 'loss/train': 1.5764869451522827} 11/06/2021 23:28:48 - INFO - __main__ - Step 15740: {'lr': 0.0004894435902022277, 'samples': 3022080, 'steps': 15739, 'loss/train': 1.3114335536956787} 11/06/2021 23:28:48 - INFO - __main__ - Step 15741: {'lr': 0.0004894420643483184, 'samples': 3022272, 'steps': 15740, 'loss/train': 1.8506090641021729} 11/06/2021 23:28:50 - INFO - __main__ - Step 15742: {'lr': 0.0004894405383865201, 'samples': 3022464, 'steps': 15741, 'loss/train': 1.5209318399429321} 11/06/2021 23:28:50 - INFO - __main__ - Step 15743: {'lr': 0.0004894390123168337, 'samples': 3022656, 'steps': 15742, 'loss/train': 1.4575647115707397} 11/06/2021 23:28:50 - INFO - __main__ - Step 15744: {'lr': 0.0004894374861392596, 'samples': 3022848, 'steps': 15743, 'loss/train': 2.133335828781128} 11/06/2021 23:28:51 - INFO - __main__ - Step 15745: {'lr': 0.0004894359598537987, 'samples': 3023040, 'steps': 15744, 'loss/train': 1.7577073574066162} 11/06/2021 23:28:52 - INFO - __main__ - Step 15746: {'lr': 0.0004894344334604517, 'samples': 3023232, 'steps': 15745, 'loss/train': 0.9471118450164795} 11/06/2021 23:28:52 - INFO - __main__ - Step 15747: {'lr': 0.0004894329069592192, 'samples': 3023424, 'steps': 15746, 'loss/train': 1.9907408952713013} 11/06/2021 23:28:53 - INFO - __main__ - Step 15748: {'lr': 0.000489431380350102, 'samples': 3023616, 'steps': 15747, 'loss/train': 2.8583734035491943} 11/06/2021 23:28:53 - INFO - __main__ - Step 15749: {'lr': 0.0004894298536331007, 'samples': 3023808, 'steps': 15748, 'loss/train': 0.6665930151939392} 11/06/2021 23:28:53 - INFO - __main__ - Step 15750: {'lr': 0.000489428326808216, 'samples': 3024000, 'steps': 15749, 'loss/train': 1.0621652603149414} 11/06/2021 23:28:54 - INFO - __main__ - Step 15751: {'lr': 0.0004894267998754486, 'samples': 3024192, 'steps': 15750, 'loss/train': 1.5037894248962402} 11/06/2021 23:28:55 - INFO - __main__ - Step 15752: {'lr': 0.0004894252728347992, 'samples': 3024384, 'steps': 15751, 'loss/train': 1.6653718948364258} 11/06/2021 23:28:55 - INFO - __main__ - Step 15753: {'lr': 0.0004894237456862684, 'samples': 3024576, 'steps': 15752, 'loss/train': 1.7493863105773926} 11/06/2021 23:28:55 - INFO - __main__ - Step 15754: {'lr': 0.000489422218429857, 'samples': 3024768, 'steps': 15753, 'loss/train': 1.5917749404907227} 11/06/2021 23:28:56 - INFO - __main__ - Step 15755: {'lr': 0.0004894206910655656, 'samples': 3024960, 'steps': 15754, 'loss/train': 1.5232791900634766} 11/06/2021 23:28:56 - INFO - __main__ - Step 15756: {'lr': 0.0004894191635933949, 'samples': 3025152, 'steps': 15755, 'loss/train': 2.1717846393585205} 11/06/2021 23:28:57 - INFO - __main__ - Step 15757: {'lr': 0.0004894176360133456, 'samples': 3025344, 'steps': 15756, 'loss/train': 1.9181817770004272} 11/06/2021 23:28:58 - INFO - __main__ - Step 15758: {'lr': 0.0004894161083254186, 'samples': 3025536, 'steps': 15757, 'loss/train': 2.2715911865234375} 11/06/2021 23:28:58 - INFO - __main__ - Step 15759: {'lr': 0.0004894145805296143, 'samples': 3025728, 'steps': 15758, 'loss/train': 1.8335258960723877} 11/06/2021 23:28:58 - INFO - __main__ - Step 15760: {'lr': 0.0004894130526259334, 'samples': 3025920, 'steps': 15759, 'loss/train': 1.284491777420044} 11/06/2021 23:28:59 - INFO - __main__ - Step 15761: {'lr': 0.0004894115246143768, 'samples': 3026112, 'steps': 15760, 'loss/train': 1.778687834739685} 11/06/2021 23:29:00 - INFO - __main__ - Step 15762: {'lr': 0.0004894099964949449, 'samples': 3026304, 'steps': 15761, 'loss/train': 1.093184471130371} 11/06/2021 23:29:00 - INFO - __main__ - Step 15763: {'lr': 0.0004894084682676387, 'samples': 3026496, 'steps': 15762, 'loss/train': 1.5207335948944092} 11/06/2021 23:29:01 - INFO - __main__ - Step 15764: {'lr': 0.0004894069399324586, 'samples': 3026688, 'steps': 15763, 'loss/train': 1.5955839157104492} 11/06/2021 23:29:01 - INFO - __main__ - Step 15765: {'lr': 0.0004894054114894055, 'samples': 3026880, 'steps': 15764, 'loss/train': 1.9538586139678955} 11/06/2021 23:29:01 - INFO - __main__ - Step 15766: {'lr': 0.00048940388293848, 'samples': 3027072, 'steps': 15765, 'loss/train': 1.9052053689956665} 11/06/2021 23:29:02 - INFO - __main__ - Step 15767: {'lr': 0.000489402354279683, 'samples': 3027264, 'steps': 15766, 'loss/train': 1.6094344854354858} 11/06/2021 23:29:03 - INFO - __main__ - Step 15768: {'lr': 0.0004894008255130147, 'samples': 3027456, 'steps': 15767, 'loss/train': 1.5516875982284546} 11/06/2021 23:29:03 - INFO - __main__ - Step 15769: {'lr': 0.0004893992966384762, 'samples': 3027648, 'steps': 15768, 'loss/train': 2.32871675491333} 11/06/2021 23:29:03 - INFO - __main__ - Step 15770: {'lr': 0.0004893977676560682, 'samples': 3027840, 'steps': 15769, 'loss/train': 1.62233567237854} 11/06/2021 23:29:04 - INFO - __main__ - Step 15771: {'lr': 0.000489396238565791, 'samples': 3028032, 'steps': 15770, 'loss/train': 1.8400375843048096} 11/06/2021 23:29:05 - INFO - __main__ - Step 15772: {'lr': 0.0004893947093676458, 'samples': 3028224, 'steps': 15771, 'loss/train': 1.4160466194152832} 11/06/2021 23:29:05 - INFO - __main__ - Step 15773: {'lr': 0.0004893931800616329, 'samples': 3028416, 'steps': 15772, 'loss/train': 1.8800396919250488} 11/06/2021 23:29:06 - INFO - __main__ - Step 15774: {'lr': 0.0004893916506477532, 'samples': 3028608, 'steps': 15773, 'loss/train': 1.4598236083984375} 11/06/2021 23:29:06 - INFO - __main__ - Step 15775: {'lr': 0.0004893901211260073, 'samples': 3028800, 'steps': 15774, 'loss/train': 1.329248309135437} 11/06/2021 23:29:06 - INFO - __main__ - Step 15776: {'lr': 0.0004893885914963958, 'samples': 3028992, 'steps': 15775, 'loss/train': 1.977927565574646} 11/06/2021 23:29:07 - INFO - __main__ - Step 15777: {'lr': 0.0004893870617589196, 'samples': 3029184, 'steps': 15776, 'loss/train': 2.081711530685425} 11/06/2021 23:29:08 - INFO - __main__ - Step 15778: {'lr': 0.0004893855319135791, 'samples': 3029376, 'steps': 15777, 'loss/train': 1.1884722709655762} 11/06/2021 23:29:08 - INFO - __main__ - Step 15779: {'lr': 0.0004893840019603754, 'samples': 3029568, 'steps': 15778, 'loss/train': 1.492553949356079} 11/06/2021 23:29:08 - INFO - __main__ - Step 15780: {'lr': 0.0004893824718993088, 'samples': 3029760, 'steps': 15779, 'loss/train': 1.4190195798873901} 11/06/2021 23:29:09 - INFO - __main__ - Step 15781: {'lr': 0.0004893809417303803, 'samples': 3029952, 'steps': 15780, 'loss/train': 1.5243641138076782} 11/06/2021 23:29:09 - INFO - __main__ - Step 15782: {'lr': 0.0004893794114535905, 'samples': 3030144, 'steps': 15781, 'loss/train': 1.7579809427261353} 11/06/2021 23:29:10 - INFO - __main__ - Step 15783: {'lr': 0.0004893778810689399, 'samples': 3030336, 'steps': 15782, 'loss/train': 5.995852470397949} 11/06/2021 23:29:11 - INFO - __main__ - Step 15784: {'lr': 0.0004893763505764292, 'samples': 3030528, 'steps': 15783, 'loss/train': 1.6866079568862915} 11/06/2021 23:29:11 - INFO - __main__ - Step 15785: {'lr': 0.0004893748199760594, 'samples': 3030720, 'steps': 15784, 'loss/train': 2.882068157196045} 11/06/2021 23:29:11 - INFO - __main__ - Step 15786: {'lr': 0.0004893732892678309, 'samples': 3030912, 'steps': 15785, 'loss/train': 1.8355077505111694} 11/06/2021 23:29:12 - INFO - __main__ - Step 15787: {'lr': 0.0004893717584517445, 'samples': 3031104, 'steps': 15786, 'loss/train': 1.4316880702972412} 11/06/2021 23:29:12 - INFO - __main__ - Step 15788: {'lr': 0.000489370227527801, 'samples': 3031296, 'steps': 15787, 'loss/train': 1.8777408599853516} 11/06/2021 23:29:13 - INFO - __main__ - Step 15789: {'lr': 0.0004893686964960009, 'samples': 3031488, 'steps': 15788, 'loss/train': 1.4991892576217651} 11/06/2021 23:29:13 - INFO - __main__ - Step 15790: {'lr': 0.0004893671653563448, 'samples': 3031680, 'steps': 15789, 'loss/train': 1.8356255292892456} 11/06/2021 23:29:14 - INFO - __main__ - Step 15791: {'lr': 0.0004893656341088338, 'samples': 3031872, 'steps': 15790, 'loss/train': 1.778326392173767} 11/06/2021 23:29:14 - INFO - __main__ - Step 15792: {'lr': 0.0004893641027534682, 'samples': 3032064, 'steps': 15791, 'loss/train': 1.4932310581207275} 11/06/2021 23:29:15 - INFO - __main__ - Step 15793: {'lr': 0.0004893625712902489, 'samples': 3032256, 'steps': 15792, 'loss/train': 1.2762092351913452} 11/06/2021 23:29:16 - INFO - __main__ - Step 15794: {'lr': 0.0004893610397191764, 'samples': 3032448, 'steps': 15793, 'loss/train': 1.7906181812286377} 11/06/2021 23:29:16 - INFO - __main__ - Step 15795: {'lr': 0.0004893595080402517, 'samples': 3032640, 'steps': 15794, 'loss/train': 1.450931191444397} 11/06/2021 23:29:16 - INFO - __main__ - Step 15796: {'lr': 0.0004893579762534751, 'samples': 3032832, 'steps': 15795, 'loss/train': 1.3250974416732788} 11/06/2021 23:29:17 - INFO - __main__ - Step 15797: {'lr': 0.0004893564443588476, 'samples': 3033024, 'steps': 15796, 'loss/train': 2.3122026920318604} 11/06/2021 23:29:17 - INFO - __main__ - Step 15798: {'lr': 0.0004893549123563697, 'samples': 3033216, 'steps': 15797, 'loss/train': 1.8526376485824585} 11/06/2021 23:29:18 - INFO - __main__ - Step 15799: {'lr': 0.0004893533802460422, 'samples': 3033408, 'steps': 15798, 'loss/train': 1.828189492225647} 11/06/2021 23:29:18 - INFO - __main__ - Step 15800: {'lr': 0.0004893518480278658, 'samples': 3033600, 'steps': 15799, 'loss/train': 1.659311294555664} 11/06/2021 23:29:19 - INFO - __main__ - Step 15801: {'lr': 0.0004893503157018412, 'samples': 3033792, 'steps': 15800, 'loss/train': 2.1041367053985596} 11/06/2021 23:29:19 - INFO - __main__ - Step 15802: {'lr': 0.000489348783267969, 'samples': 3033984, 'steps': 15801, 'loss/train': 1.7049843072891235} 11/06/2021 23:29:19 - INFO - __main__ - Step 15803: {'lr': 0.0004893472507262499, 'samples': 3034176, 'steps': 15802, 'loss/train': 1.8830581903457642} 11/06/2021 23:29:20 - INFO - __main__ - Step 15804: {'lr': 0.0004893457180766846, 'samples': 3034368, 'steps': 15803, 'loss/train': 1.8706032037734985} 11/06/2021 23:29:21 - INFO - __main__ - Step 15805: {'lr': 0.0004893441853192739, 'samples': 3034560, 'steps': 15804, 'loss/train': 1.941593885421753} 11/06/2021 23:29:21 - INFO - __main__ - Step 15806: {'lr': 0.0004893426524540183, 'samples': 3034752, 'steps': 15805, 'loss/train': 1.842943549156189} 11/06/2021 23:29:21 - INFO - __main__ - Step 15807: {'lr': 0.0004893411194809186, 'samples': 3034944, 'steps': 15806, 'loss/train': 1.782511830329895} 11/06/2021 23:29:22 - INFO - __main__ - Step 15808: {'lr': 0.0004893395863999755, 'samples': 3035136, 'steps': 15807, 'loss/train': 2.1695151329040527} 11/06/2021 23:29:22 - INFO - __main__ - Step 15809: {'lr': 0.0004893380532111898, 'samples': 3035328, 'steps': 15808, 'loss/train': 1.792184591293335} 11/06/2021 23:29:23 - INFO - __main__ - Step 15810: {'lr': 0.0004893365199145619, 'samples': 3035520, 'steps': 15809, 'loss/train': 1.3812013864517212} 11/06/2021 23:29:24 - INFO - __main__ - Step 15811: {'lr': 0.0004893349865100927, 'samples': 3035712, 'steps': 15810, 'loss/train': 1.7286027669906616} 11/06/2021 23:29:24 - INFO - __main__ - Step 15812: {'lr': 0.0004893334529977828, 'samples': 3035904, 'steps': 15811, 'loss/train': 1.761993646621704} 11/06/2021 23:29:24 - INFO - __main__ - Step 15813: {'lr': 0.0004893319193776331, 'samples': 3036096, 'steps': 15812, 'loss/train': 1.3216885328292847} 11/06/2021 23:29:25 - INFO - __main__ - Step 15814: {'lr': 0.000489330385649644, 'samples': 3036288, 'steps': 15813, 'loss/train': 1.5407154560089111} 11/06/2021 23:29:26 - INFO - __main__ - Step 15815: {'lr': 0.0004893288518138163, 'samples': 3036480, 'steps': 15814, 'loss/train': 1.7777429819107056} 11/06/2021 23:29:26 - INFO - __main__ - Step 15816: {'lr': 0.0004893273178701508, 'samples': 3036672, 'steps': 15815, 'loss/train': 1.6873787641525269} 11/06/2021 23:29:27 - INFO - __main__ - Step 15817: {'lr': 0.0004893257838186481, 'samples': 3036864, 'steps': 15816, 'loss/train': 2.015655755996704} 11/06/2021 23:29:27 - INFO - __main__ - Step 15818: {'lr': 0.0004893242496593089, 'samples': 3037056, 'steps': 15817, 'loss/train': 1.5864753723144531} 11/06/2021 23:29:27 - INFO - __main__ - Step 15819: {'lr': 0.0004893227153921338, 'samples': 3037248, 'steps': 15818, 'loss/train': 1.5181615352630615} 11/06/2021 23:29:28 - INFO - __main__ - Step 15820: {'lr': 0.0004893211810171237, 'samples': 3037440, 'steps': 15819, 'loss/train': 1.953935980796814} 11/06/2021 23:29:29 - INFO - __main__ - Step 15821: {'lr': 0.0004893196465342791, 'samples': 3037632, 'steps': 15820, 'loss/train': 1.703386664390564} 11/06/2021 23:29:29 - INFO - __main__ - Step 15822: {'lr': 0.0004893181119436007, 'samples': 3037824, 'steps': 15821, 'loss/train': 1.765951156616211} 11/06/2021 23:29:30 - INFO - __main__ - Step 15823: {'lr': 0.0004893165772450893, 'samples': 3038016, 'steps': 15822, 'loss/train': 1.1574418544769287} 11/06/2021 23:29:30 - INFO - __main__ - Step 15824: {'lr': 0.0004893150424387456, 'samples': 3038208, 'steps': 15823, 'loss/train': 2.260741949081421} 11/06/2021 23:29:30 - INFO - __main__ - Step 15825: {'lr': 0.0004893135075245702, 'samples': 3038400, 'steps': 15824, 'loss/train': 1.0996348857879639} 11/06/2021 23:29:31 - INFO - __main__ - Step 15826: {'lr': 0.0004893119725025639, 'samples': 3038592, 'steps': 15825, 'loss/train': 2.232999324798584} 11/06/2021 23:29:32 - INFO - __main__ - Step 15827: {'lr': 0.0004893104373727272, 'samples': 3038784, 'steps': 15826, 'loss/train': 1.5678027868270874} 11/06/2021 23:29:32 - INFO - __main__ - Step 15828: {'lr': 0.0004893089021350609, 'samples': 3038976, 'steps': 15827, 'loss/train': 1.6784107685089111} 11/06/2021 23:29:32 - INFO - __main__ - Step 15829: {'lr': 0.0004893073667895658, 'samples': 3039168, 'steps': 15828, 'loss/train': 1.2464898824691772} 11/06/2021 23:29:33 - INFO - __main__ - Step 15830: {'lr': 0.0004893058313362424, 'samples': 3039360, 'steps': 15829, 'loss/train': 1.7435942888259888} 11/06/2021 23:29:34 - INFO - __main__ - Step 15831: {'lr': 0.0004893042957750916, 'samples': 3039552, 'steps': 15830, 'loss/train': 1.2910295724868774} 11/06/2021 23:29:34 - INFO - __main__ - Step 15832: {'lr': 0.0004893027601061138, 'samples': 3039744, 'steps': 15831, 'loss/train': 1.3004798889160156} 11/06/2021 23:29:35 - INFO - __main__ - Step 15833: {'lr': 0.00048930122432931, 'samples': 3039936, 'steps': 15832, 'loss/train': 1.926435112953186} 11/06/2021 23:29:35 - INFO - __main__ - Step 15834: {'lr': 0.0004892996884446807, 'samples': 3040128, 'steps': 15833, 'loss/train': 1.350760817527771} 11/06/2021 23:29:35 - INFO - __main__ - Step 15835: {'lr': 0.0004892981524522267, 'samples': 3040320, 'steps': 15834, 'loss/train': 1.922597050666809} 11/06/2021 23:29:36 - INFO - __main__ - Step 15836: {'lr': 0.0004892966163519487, 'samples': 3040512, 'steps': 15835, 'loss/train': 1.553952932357788} 11/06/2021 23:29:37 - INFO - __main__ - Step 15837: {'lr': 0.0004892950801438472, 'samples': 3040704, 'steps': 15836, 'loss/train': 1.5477640628814697} 11/06/2021 23:29:37 - INFO - __main__ - Step 15838: {'lr': 0.0004892935438279231, 'samples': 3040896, 'steps': 15837, 'loss/train': 1.4451688528060913} 11/06/2021 23:29:37 - INFO - __main__ - Step 15839: {'lr': 0.0004892920074041771, 'samples': 3041088, 'steps': 15838, 'loss/train': 1.5197027921676636} 11/06/2021 23:29:38 - INFO - __main__ - Step 15840: {'lr': 0.0004892904708726096, 'samples': 3041280, 'steps': 15839, 'loss/train': 1.6010912656784058} 11/06/2021 23:29:38 - INFO - __main__ - Step 15841: {'lr': 0.0004892889342332218, 'samples': 3041472, 'steps': 15840, 'loss/train': 1.8907824754714966} 11/06/2021 23:29:39 - INFO - __main__ - Step 15842: {'lr': 0.000489287397486014, 'samples': 3041664, 'steps': 15841, 'loss/train': 1.3404470682144165} 11/06/2021 23:29:39 - INFO - __main__ - Step 15843: {'lr': 0.0004892858606309868, 'samples': 3041856, 'steps': 15842, 'loss/train': 1.9197810888290405} 11/06/2021 23:29:40 - INFO - __main__ - Step 15844: {'lr': 0.0004892843236681412, 'samples': 3042048, 'steps': 15843, 'loss/train': 1.7280125617980957} 11/06/2021 23:29:40 - INFO - __main__ - Step 15845: {'lr': 0.0004892827865974779, 'samples': 3042240, 'steps': 15844, 'loss/train': 1.8534923791885376} 11/06/2021 23:29:40 - INFO - __main__ - Step 15846: {'lr': 0.0004892812494189973, 'samples': 3042432, 'steps': 15845, 'loss/train': 1.8682761192321777} 11/06/2021 23:29:41 - INFO - __main__ - Step 15847: {'lr': 0.0004892797121327003, 'samples': 3042624, 'steps': 15846, 'loss/train': 1.5605151653289795} 11/06/2021 23:29:42 - INFO - __main__ - Step 15848: {'lr': 0.0004892781747385876, 'samples': 3042816, 'steps': 15847, 'loss/train': 1.5748695135116577} 11/06/2021 23:29:42 - INFO - __main__ - Step 15849: {'lr': 0.0004892766372366598, 'samples': 3043008, 'steps': 15848, 'loss/train': 1.6908271312713623} 11/06/2021 23:29:43 - INFO - __main__ - Step 15850: {'lr': 0.0004892750996269177, 'samples': 3043200, 'steps': 15849, 'loss/train': 1.7898610830307007} 11/06/2021 23:29:43 - INFO - __main__ - Step 15851: {'lr': 0.0004892735619093618, 'samples': 3043392, 'steps': 15850, 'loss/train': 1.9036415815353394} 11/06/2021 23:29:44 - INFO - __main__ - Step 15852: {'lr': 0.0004892720240839931, 'samples': 3043584, 'steps': 15851, 'loss/train': 1.692862629890442} 11/06/2021 23:29:44 - INFO - __main__ - Step 15853: {'lr': 0.0004892704861508121, 'samples': 3043776, 'steps': 15852, 'loss/train': 1.807676076889038} 11/06/2021 23:29:45 - INFO - __main__ - Step 15854: {'lr': 0.0004892689481098193, 'samples': 3043968, 'steps': 15853, 'loss/train': 1.5801000595092773} 11/06/2021 23:29:45 - INFO - __main__ - Step 15855: {'lr': 0.0004892674099610158, 'samples': 3044160, 'steps': 15854, 'loss/train': 1.7234050035476685} 11/06/2021 23:29:45 - INFO - __main__ - Step 15856: {'lr': 0.000489265871704402, 'samples': 3044352, 'steps': 15855, 'loss/train': 1.906105399131775} 11/06/2021 23:29:46 - INFO - __main__ - Step 15857: {'lr': 0.0004892643333399788, 'samples': 3044544, 'steps': 15856, 'loss/train': 1.3132715225219727} 11/06/2021 23:29:47 - INFO - __main__ - Step 15858: {'lr': 0.0004892627948677467, 'samples': 3044736, 'steps': 15857, 'loss/train': 1.1623653173446655} 11/06/2021 23:29:47 - INFO - __main__ - Step 15859: {'lr': 0.0004892612562877066, 'samples': 3044928, 'steps': 15858, 'loss/train': 1.407994270324707} 11/06/2021 23:29:48 - INFO - __main__ - Step 15860: {'lr': 0.0004892597175998589, 'samples': 3045120, 'steps': 15859, 'loss/train': 1.7048394680023193} 11/06/2021 23:29:48 - INFO - __main__ - Step 15861: {'lr': 0.0004892581788042045, 'samples': 3045312, 'steps': 15860, 'loss/train': 1.7856895923614502} 11/06/2021 23:29:49 - INFO - __main__ - Step 15862: {'lr': 0.0004892566399007441, 'samples': 3045504, 'steps': 15861, 'loss/train': 1.6186326742172241} 11/06/2021 23:29:49 - INFO - __main__ - Step 15863: {'lr': 0.0004892551008894784, 'samples': 3045696, 'steps': 15862, 'loss/train': 1.5280269384384155} 11/06/2021 23:29:50 - INFO - __main__ - Step 15864: {'lr': 0.0004892535617704079, 'samples': 3045888, 'steps': 15863, 'loss/train': 1.7535699605941772} 11/06/2021 23:29:50 - INFO - __main__ - Step 15865: {'lr': 0.0004892520225435336, 'samples': 3046080, 'steps': 15864, 'loss/train': 1.7523764371871948} 11/06/2021 23:29:50 - INFO - __main__ - Step 15866: {'lr': 0.000489250483208856, 'samples': 3046272, 'steps': 15865, 'loss/train': 1.8032313585281372} 11/06/2021 23:29:51 - INFO - __main__ - Step 15867: {'lr': 0.0004892489437663758, 'samples': 3046464, 'steps': 15866, 'loss/train': 1.9503204822540283} 11/06/2021 23:29:52 - INFO - __main__ - Step 15868: {'lr': 0.0004892474042160936, 'samples': 3046656, 'steps': 15867, 'loss/train': 1.6175391674041748} 11/06/2021 23:29:52 - INFO - __main__ - Step 15869: {'lr': 0.0004892458645580103, 'samples': 3046848, 'steps': 15868, 'loss/train': 1.700390338897705} 11/06/2021 23:29:53 - INFO - __main__ - Step 15870: {'lr': 0.0004892443247921265, 'samples': 3047040, 'steps': 15869, 'loss/train': 1.867167353630066} 11/06/2021 23:29:53 - INFO - __main__ - Step 15871: {'lr': 0.0004892427849184428, 'samples': 3047232, 'steps': 15870, 'loss/train': 2.022533416748047} 11/06/2021 23:29:53 - INFO - __main__ - Step 15872: {'lr': 0.0004892412449369602, 'samples': 3047424, 'steps': 15871, 'loss/train': 2.038414478302002} 11/06/2021 23:29:54 - INFO - __main__ - Step 15873: {'lr': 0.0004892397048476791, 'samples': 3047616, 'steps': 15872, 'loss/train': 2.008455276489258} 11/06/2021 23:29:55 - INFO - __main__ - Step 15874: {'lr': 0.0004892381646506002, 'samples': 3047808, 'steps': 15873, 'loss/train': 2.3255984783172607} 11/06/2021 23:29:55 - INFO - __main__ - Step 15875: {'lr': 0.0004892366243457244, 'samples': 3048000, 'steps': 15874, 'loss/train': 1.7657088041305542} 11/06/2021 23:29:55 - INFO - __main__ - Step 15876: {'lr': 0.0004892350839330522, 'samples': 3048192, 'steps': 15875, 'loss/train': 1.2648745775222778} 11/06/2021 23:29:56 - INFO - __main__ - Step 15877: {'lr': 0.0004892335434125844, 'samples': 3048384, 'steps': 15876, 'loss/train': 1.686819314956665} 11/06/2021 23:29:57 - INFO - __main__ - Step 15878: {'lr': 0.0004892320027843216, 'samples': 3048576, 'steps': 15877, 'loss/train': 1.774713158607483} 11/06/2021 23:29:57 - INFO - __main__ - Step 15879: {'lr': 0.0004892304620482646, 'samples': 3048768, 'steps': 15878, 'loss/train': 1.8812906742095947} 11/06/2021 23:29:57 - INFO - __main__ - Step 15880: {'lr': 0.000489228921204414, 'samples': 3048960, 'steps': 15879, 'loss/train': 1.9336180686950684} 11/06/2021 23:29:58 - INFO - __main__ - Step 15881: {'lr': 0.0004892273802527706, 'samples': 3049152, 'steps': 15880, 'loss/train': 1.9039198160171509} 11/06/2021 23:29:58 - INFO - __main__ - Step 15882: {'lr': 0.000489225839193335, 'samples': 3049344, 'steps': 15881, 'loss/train': 2.681311845779419} 11/06/2021 23:29:59 - INFO - __main__ - Step 15883: {'lr': 0.0004892242980261079, 'samples': 3049536, 'steps': 15882, 'loss/train': 1.3233872652053833} 11/06/2021 23:30:00 - INFO - __main__ - Step 15884: {'lr': 0.0004892227567510901, 'samples': 3049728, 'steps': 15883, 'loss/train': 1.6366331577301025} 11/06/2021 23:30:00 - INFO - __main__ - Step 15885: {'lr': 0.0004892212153682822, 'samples': 3049920, 'steps': 15884, 'loss/train': 2.0259757041931152} 11/06/2021 23:30:00 - INFO - __main__ - Step 15886: {'lr': 0.0004892196738776848, 'samples': 3050112, 'steps': 15885, 'loss/train': 1.6380057334899902} 11/06/2021 23:30:01 - INFO - __main__ - Step 15887: {'lr': 0.0004892181322792989, 'samples': 3050304, 'steps': 15886, 'loss/train': 1.6762605905532837} 11/06/2021 23:30:01 - INFO - __main__ - Step 15888: {'lr': 0.0004892165905731248, 'samples': 3050496, 'steps': 15887, 'loss/train': 1.6492608785629272} 11/06/2021 23:30:02 - INFO - __main__ - Step 15889: {'lr': 0.0004892150487591635, 'samples': 3050688, 'steps': 15888, 'loss/train': 1.8389976024627686} 11/06/2021 23:30:02 - INFO - __main__ - Step 15890: {'lr': 0.0004892135068374156, 'samples': 3050880, 'steps': 15889, 'loss/train': 1.6473220586776733} 11/06/2021 23:30:03 - INFO - __main__ - Step 15891: {'lr': 0.0004892119648078817, 'samples': 3051072, 'steps': 15890, 'loss/train': 1.6009021997451782} 11/06/2021 23:30:03 - INFO - __main__ - Step 15892: {'lr': 0.0004892104226705627, 'samples': 3051264, 'steps': 15891, 'loss/train': 1.201027512550354} 11/06/2021 23:30:04 - INFO - __main__ - Step 15893: {'lr': 0.0004892088804254591, 'samples': 3051456, 'steps': 15892, 'loss/train': 1.556810736656189} 11/06/2021 23:30:05 - INFO - __main__ - Step 15894: {'lr': 0.0004892073380725716, 'samples': 3051648, 'steps': 15893, 'loss/train': 1.3976141214370728} 11/06/2021 23:30:05 - INFO - __main__ - Step 15895: {'lr': 0.0004892057956119012, 'samples': 3051840, 'steps': 15894, 'loss/train': 1.6373045444488525} 11/06/2021 23:30:05 - INFO - __main__ - Step 15896: {'lr': 0.0004892042530434482, 'samples': 3052032, 'steps': 15895, 'loss/train': 1.500960350036621} 11/06/2021 23:30:06 - INFO - __main__ - Step 15897: {'lr': 0.0004892027103672134, 'samples': 3052224, 'steps': 15896, 'loss/train': 2.280747175216675} 11/06/2021 23:30:06 - INFO - __main__ - Step 15898: {'lr': 0.0004892011675831976, 'samples': 3052416, 'steps': 15897, 'loss/train': 1.7541916370391846} 11/06/2021 23:30:07 - INFO - __main__ - Step 15899: {'lr': 0.0004891996246914014, 'samples': 3052608, 'steps': 15898, 'loss/train': 1.63882315158844} 11/06/2021 23:30:07 - INFO - __main__ - Step 15900: {'lr': 0.0004891980816918257, 'samples': 3052800, 'steps': 15899, 'loss/train': 1.629404902458191} 11/06/2021 23:30:08 - INFO - __main__ - Step 15901: {'lr': 0.0004891965385844709, 'samples': 3052992, 'steps': 15900, 'loss/train': 1.5046894550323486} 11/06/2021 23:30:08 - INFO - __main__ - Step 15902: {'lr': 0.0004891949953693378, 'samples': 3053184, 'steps': 15901, 'loss/train': 1.5247341394424438} 11/06/2021 23:30:08 - INFO - __main__ - Step 15903: {'lr': 0.0004891934520464273, 'samples': 3053376, 'steps': 15902, 'loss/train': 1.785117268562317} 11/06/2021 23:30:09 - INFO - __main__ - Step 15904: {'lr': 0.0004891919086157398, 'samples': 3053568, 'steps': 15903, 'loss/train': 1.4606293439865112} 11/06/2021 23:30:10 - INFO - __main__ - Step 15905: {'lr': 0.000489190365077276, 'samples': 3053760, 'steps': 15904, 'loss/train': 1.7940731048583984} 11/06/2021 23:30:10 - INFO - __main__ - Step 15906: {'lr': 0.0004891888214310369, 'samples': 3053952, 'steps': 15905, 'loss/train': 1.8179190158843994} 11/06/2021 23:30:11 - INFO - __main__ - Step 15907: {'lr': 0.000489187277677023, 'samples': 3054144, 'steps': 15906, 'loss/train': 1.6493617296218872} 11/06/2021 23:30:11 - INFO - __main__ - Step 15908: {'lr': 0.000489185733815235, 'samples': 3054336, 'steps': 15907, 'loss/train': 1.645919919013977} 11/06/2021 23:30:11 - INFO - __main__ - Step 15909: {'lr': 0.0004891841898456735, 'samples': 3054528, 'steps': 15908, 'loss/train': 1.8453730344772339} 11/06/2021 23:30:12 - INFO - __main__ - Step 15910: {'lr': 0.0004891826457683394, 'samples': 3054720, 'steps': 15909, 'loss/train': 1.7440708875656128} 11/06/2021 23:30:13 - INFO - __main__ - Step 15911: {'lr': 0.0004891811015832332, 'samples': 3054912, 'steps': 15910, 'loss/train': 1.6581617593765259} 11/06/2021 23:30:13 - INFO - __main__ - Step 15912: {'lr': 0.0004891795572903557, 'samples': 3055104, 'steps': 15911, 'loss/train': 1.5487264394760132} 11/06/2021 23:30:14 - INFO - __main__ - Step 15913: {'lr': 0.0004891780128897077, 'samples': 3055296, 'steps': 15912, 'loss/train': 1.7143360376358032} 11/06/2021 23:30:14 - INFO - __main__ - Step 15914: {'lr': 0.0004891764683812896, 'samples': 3055488, 'steps': 15913, 'loss/train': 1.7877535820007324} 11/06/2021 23:30:15 - INFO - __main__ - Step 15915: {'lr': 0.0004891749237651024, 'samples': 3055680, 'steps': 15914, 'loss/train': 2.2019176483154297} 11/06/2021 23:30:16 - INFO - __main__ - Step 15916: {'lr': 0.0004891733790411466, 'samples': 3055872, 'steps': 15915, 'loss/train': 1.6177648305892944} 11/06/2021 23:30:16 - INFO - __main__ - Step 15917: {'lr': 0.000489171834209423, 'samples': 3056064, 'steps': 15916, 'loss/train': 2.0790741443634033} 11/06/2021 23:30:16 - INFO - __main__ - Step 15918: {'lr': 0.0004891702892699323, 'samples': 3056256, 'steps': 15917, 'loss/train': 1.2524698972702026} 11/06/2021 23:30:17 - INFO - __main__ - Step 15919: {'lr': 0.0004891687442226751, 'samples': 3056448, 'steps': 15918, 'loss/train': 2.0985350608825684} 11/06/2021 23:30:17 - INFO - __main__ - Step 15920: {'lr': 0.0004891671990676522, 'samples': 3056640, 'steps': 15919, 'loss/train': 1.7934945821762085} 11/06/2021 23:30:18 - INFO - __main__ - Step 15921: {'lr': 0.0004891656538048642, 'samples': 3056832, 'steps': 15920, 'loss/train': 0.5140384435653687} 11/06/2021 23:30:18 - INFO - __main__ - Step 15922: {'lr': 0.0004891641084343118, 'samples': 3057024, 'steps': 15921, 'loss/train': 1.4971164464950562} 11/06/2021 23:30:19 - INFO - __main__ - Step 15923: {'lr': 0.0004891625629559959, 'samples': 3057216, 'steps': 15922, 'loss/train': 2.0432043075561523} 11/06/2021 23:30:19 - INFO - __main__ - Step 15924: {'lr': 0.0004891610173699169, 'samples': 3057408, 'steps': 15923, 'loss/train': 1.4933257102966309} 11/06/2021 23:30:20 - INFO - __main__ - Step 15925: {'lr': 0.0004891594716760757, 'samples': 3057600, 'steps': 15924, 'loss/train': 2.0862317085266113} 11/06/2021 23:30:20 - INFO - __main__ - Step 15926: {'lr': 0.0004891579258744728, 'samples': 3057792, 'steps': 15925, 'loss/train': 1.788038969039917} 11/06/2021 23:30:21 - INFO - __main__ - Step 15927: {'lr': 0.0004891563799651092, 'samples': 3057984, 'steps': 15926, 'loss/train': 1.711025357246399} 11/06/2021 23:30:21 - INFO - __main__ - Step 15928: {'lr': 0.0004891548339479854, 'samples': 3058176, 'steps': 15927, 'loss/train': 1.7549539804458618} 11/06/2021 23:30:22 - INFO - __main__ - Step 15929: {'lr': 0.0004891532878231021, 'samples': 3058368, 'steps': 15928, 'loss/train': 1.9058785438537598} 11/06/2021 23:30:22 - INFO - __main__ - Step 15930: {'lr': 0.00048915174159046, 'samples': 3058560, 'steps': 15929, 'loss/train': 1.3126921653747559} 11/06/2021 23:30:23 - INFO - __main__ - Step 15931: {'lr': 0.0004891501952500599, 'samples': 3058752, 'steps': 15930, 'loss/train': 1.5705853700637817} 11/06/2021 23:30:23 - INFO - __main__ - Step 15932: {'lr': 0.0004891486488019023, 'samples': 3058944, 'steps': 15931, 'loss/train': 1.8984884023666382} 11/06/2021 23:30:24 - INFO - __main__ - Step 15933: {'lr': 0.000489147102245988, 'samples': 3059136, 'steps': 15932, 'loss/train': 1.7881126403808594} 11/06/2021 23:30:24 - INFO - __main__ - Step 15934: {'lr': 0.0004891455555823179, 'samples': 3059328, 'steps': 15933, 'loss/train': 0.8325879573822021} 11/06/2021 23:30:24 - INFO - __main__ - Step 15935: {'lr': 0.0004891440088108923, 'samples': 3059520, 'steps': 15934, 'loss/train': 1.4407297372817993} 11/06/2021 23:30:26 - INFO - __main__ - Step 15936: {'lr': 0.0004891424619317121, 'samples': 3059712, 'steps': 15935, 'loss/train': 1.791204810142517} 11/06/2021 23:30:26 - INFO - __main__ - Step 15937: {'lr': 0.000489140914944778, 'samples': 3059904, 'steps': 15936, 'loss/train': 1.459701657295227} 11/06/2021 23:30:26 - INFO - __main__ - Step 15938: {'lr': 0.0004891393678500909, 'samples': 3060096, 'steps': 15937, 'loss/train': 1.4462066888809204} 11/06/2021 23:30:27 - INFO - __main__ - Step 15939: {'lr': 0.0004891378206476511, 'samples': 3060288, 'steps': 15938, 'loss/train': 1.4875487089157104} 11/06/2021 23:30:27 - INFO - __main__ - Step 15940: {'lr': 0.0004891362733374595, 'samples': 3060480, 'steps': 15939, 'loss/train': 1.7356269359588623} 11/06/2021 23:30:28 - INFO - __main__ - Step 15941: {'lr': 0.0004891347259195168, 'samples': 3060672, 'steps': 15940, 'loss/train': 1.6053783893585205} 11/06/2021 23:30:28 - INFO - __main__ - Step 15942: {'lr': 0.0004891331783938238, 'samples': 3060864, 'steps': 15941, 'loss/train': 1.529934287071228} 11/06/2021 23:30:29 - INFO - __main__ - Step 15943: {'lr': 0.000489131630760381, 'samples': 3061056, 'steps': 15942, 'loss/train': 1.0856684446334839} 11/06/2021 23:30:29 - INFO - __main__ - Step 15944: {'lr': 0.000489130083019189, 'samples': 3061248, 'steps': 15943, 'loss/train': 1.3536427021026611} 11/06/2021 23:30:29 - INFO - __main__ - Step 15945: {'lr': 0.000489128535170249, 'samples': 3061440, 'steps': 15944, 'loss/train': 1.3053251504898071} 11/06/2021 23:30:30 - INFO - __main__ - Step 15946: {'lr': 0.0004891269872135611, 'samples': 3061632, 'steps': 15945, 'loss/train': 2.2223825454711914} 11/06/2021 23:30:31 - INFO - __main__ - Step 15947: {'lr': 0.0004891254391491264, 'samples': 3061824, 'steps': 15946, 'loss/train': 1.6169859170913696} 11/06/2021 23:30:31 - INFO - __main__ - Step 15948: {'lr': 0.0004891238909769454, 'samples': 3062016, 'steps': 15947, 'loss/train': 1.6971629858016968} 11/06/2021 23:30:32 - INFO - __main__ - Step 15949: {'lr': 0.0004891223426970189, 'samples': 3062208, 'steps': 15948, 'loss/train': 1.6054120063781738} 11/06/2021 23:30:32 - INFO - __main__ - Step 15950: {'lr': 0.0004891207943093476, 'samples': 3062400, 'steps': 15949, 'loss/train': 2.358388662338257} 11/06/2021 23:30:32 - INFO - __main__ - Step 15951: {'lr': 0.000489119245813932, 'samples': 3062592, 'steps': 15950, 'loss/train': 2.2552406787872314} 11/06/2021 23:30:34 - INFO - __main__ - Step 15952: {'lr': 0.0004891176972107731, 'samples': 3062784, 'steps': 15951, 'loss/train': 1.8012076616287231} 11/06/2021 23:30:34 - INFO - __main__ - Step 15953: {'lr': 0.0004891161484998715, 'samples': 3062976, 'steps': 15952, 'loss/train': 1.947579264640808} 11/06/2021 23:30:34 - INFO - __main__ - Step 15954: {'lr': 0.0004891145996812279, 'samples': 3063168, 'steps': 15953, 'loss/train': 1.6745511293411255} 11/06/2021 23:30:35 - INFO - __main__ - Step 15955: {'lr': 0.0004891130507548427, 'samples': 3063360, 'steps': 15954, 'loss/train': 2.0542542934417725} 11/06/2021 23:30:35 - INFO - __main__ - Step 15956: {'lr': 0.000489111501720717, 'samples': 3063552, 'steps': 15955, 'loss/train': 1.9397313594818115} 11/06/2021 23:30:36 - INFO - __main__ - Step 15957: {'lr': 0.0004891099525788514, 'samples': 3063744, 'steps': 15956, 'loss/train': 1.229641079902649} 11/06/2021 23:30:36 - INFO - __main__ - Step 15958: {'lr': 0.0004891084033292464, 'samples': 3063936, 'steps': 15957, 'loss/train': 1.033172369003296} 11/06/2021 23:30:37 - INFO - __main__ - Step 15959: {'lr': 0.0004891068539719031, 'samples': 3064128, 'steps': 15958, 'loss/train': 1.7503483295440674} 11/06/2021 23:30:37 - INFO - __main__ - Step 15960: {'lr': 0.0004891053045068217, 'samples': 3064320, 'steps': 15959, 'loss/train': 2.0735301971435547} 11/06/2021 23:30:38 - INFO - __main__ - Step 15961: {'lr': 0.0004891037549340032, 'samples': 3064512, 'steps': 15960, 'loss/train': 1.7163405418395996} 11/06/2021 23:30:38 - INFO - __main__ - Step 15962: {'lr': 0.0004891022052534482, 'samples': 3064704, 'steps': 15961, 'loss/train': 1.817496418952942} 11/06/2021 23:30:39 - INFO - __main__ - Step 15963: {'lr': 0.0004891006554651574, 'samples': 3064896, 'steps': 15962, 'loss/train': 1.744676113128662} 11/06/2021 23:30:39 - INFO - __main__ - Step 15964: {'lr': 0.0004890991055691318, 'samples': 3065088, 'steps': 15963, 'loss/train': 1.987470030784607} 11/06/2021 23:30:40 - INFO - __main__ - Step 15965: {'lr': 0.0004890975555653716, 'samples': 3065280, 'steps': 15964, 'loss/train': 1.5478122234344482} 11/06/2021 23:30:40 - INFO - __main__ - Step 15966: {'lr': 0.0004890960054538778, 'samples': 3065472, 'steps': 15965, 'loss/train': 1.4275331497192383} 11/06/2021 23:30:40 - INFO - __main__ - Step 15967: {'lr': 0.000489094455234651, 'samples': 3065664, 'steps': 15966, 'loss/train': 1.8712700605392456} 11/06/2021 23:30:41 - INFO - __main__ - Step 15968: {'lr': 0.0004890929049076919, 'samples': 3065856, 'steps': 15967, 'loss/train': 1.5248794555664062} 11/06/2021 23:30:42 - INFO - __main__ - Step 15969: {'lr': 0.0004890913544730013, 'samples': 3066048, 'steps': 15968, 'loss/train': 1.28171706199646} 11/06/2021 23:30:42 - INFO - __main__ - Step 15970: {'lr': 0.0004890898039305798, 'samples': 3066240, 'steps': 15969, 'loss/train': 1.9266890287399292} 11/06/2021 23:30:43 - INFO - __main__ - Step 15971: {'lr': 0.000489088253280428, 'samples': 3066432, 'steps': 15970, 'loss/train': 1.6399247646331787} 11/06/2021 23:30:43 - INFO - __main__ - Step 15972: {'lr': 0.0004890867025225469, 'samples': 3066624, 'steps': 15971, 'loss/train': 1.434366226196289} 11/06/2021 23:30:43 - INFO - __main__ - Step 15973: {'lr': 0.000489085151656937, 'samples': 3066816, 'steps': 15972, 'loss/train': 1.9939874410629272} 11/06/2021 23:30:44 - INFO - __main__ - Step 15974: {'lr': 0.000489083600683599, 'samples': 3067008, 'steps': 15973, 'loss/train': 1.6070252656936646} 11/06/2021 23:30:45 - INFO - __main__ - Step 15975: {'lr': 0.0004890820496025335, 'samples': 3067200, 'steps': 15974, 'loss/train': 2.040097236633301} 11/06/2021 23:30:45 - INFO - __main__ - Step 15976: {'lr': 0.0004890804984137415, 'samples': 3067392, 'steps': 15975, 'loss/train': 1.497314691543579} 11/06/2021 23:30:45 - INFO - __main__ - Step 15977: {'lr': 0.0004890789471172233, 'samples': 3067584, 'steps': 15976, 'loss/train': 1.9788398742675781} 11/06/2021 23:30:46 - INFO - __main__ - Step 15978: {'lr': 0.00048907739571298, 'samples': 3067776, 'steps': 15977, 'loss/train': 2.1698567867279053} 11/06/2021 23:30:47 - INFO - __main__ - Step 15979: {'lr': 0.000489075844201012, 'samples': 3067968, 'steps': 15978, 'loss/train': 1.803942084312439} 11/06/2021 23:30:47 - INFO - __main__ - Step 15980: {'lr': 0.0004890742925813202, 'samples': 3068160, 'steps': 15979, 'loss/train': 1.9059771299362183} 11/06/2021 23:30:47 - INFO - __main__ - Step 15981: {'lr': 0.0004890727408539051, 'samples': 3068352, 'steps': 15980, 'loss/train': 1.6169204711914062} 11/06/2021 23:30:48 - INFO - __main__ - Step 15982: {'lr': 0.0004890711890187676, 'samples': 3068544, 'steps': 15981, 'loss/train': 1.6662590503692627} 11/06/2021 23:30:48 - INFO - __main__ - Step 15983: {'lr': 0.0004890696370759085, 'samples': 3068736, 'steps': 15982, 'loss/train': 1.6316263675689697} 11/06/2021 23:30:49 - INFO - __main__ - Step 15984: {'lr': 0.0004890680850253281, 'samples': 3068928, 'steps': 15983, 'loss/train': 1.7431037425994873} 11/06/2021 23:30:50 - INFO - __main__ - Step 15985: {'lr': 0.0004890665328670273, 'samples': 3069120, 'steps': 15984, 'loss/train': 1.7892478704452515} 11/06/2021 23:30:50 - INFO - __main__ - Step 15986: {'lr': 0.0004890649806010067, 'samples': 3069312, 'steps': 15985, 'loss/train': 1.5585905313491821} 11/06/2021 23:30:50 - INFO - __main__ - Step 15987: {'lr': 0.0004890634282272673, 'samples': 3069504, 'steps': 15986, 'loss/train': 1.4312716722488403} 11/06/2021 23:30:51 - INFO - __main__ - Step 15988: {'lr': 0.0004890618757458096, 'samples': 3069696, 'steps': 15987, 'loss/train': 1.3658758401870728} 11/06/2021 23:30:52 - INFO - __main__ - Step 15989: {'lr': 0.0004890603231566343, 'samples': 3069888, 'steps': 15988, 'loss/train': 1.3280619382858276} 11/06/2021 23:30:52 - INFO - __main__ - Step 15990: {'lr': 0.000489058770459742, 'samples': 3070080, 'steps': 15989, 'loss/train': 0.21883493661880493} 11/06/2021 23:30:52 - INFO - __main__ - Step 15991: {'lr': 0.0004890572176551337, 'samples': 3070272, 'steps': 15990, 'loss/train': 1.0338857173919678} 11/06/2021 23:30:53 - INFO - __main__ - Step 15992: {'lr': 0.0004890556647428097, 'samples': 3070464, 'steps': 15991, 'loss/train': 1.804748296737671} 11/06/2021 23:30:53 - INFO - __main__ - Step 15993: {'lr': 0.0004890541117227711, 'samples': 3070656, 'steps': 15992, 'loss/train': 1.1383863687515259} 11/06/2021 23:30:54 - INFO - __main__ - Step 15994: {'lr': 0.0004890525585950181, 'samples': 3070848, 'steps': 15993, 'loss/train': 2.0593109130859375} 11/06/2021 23:30:54 - INFO - __main__ - Step 15995: {'lr': 0.000489051005359552, 'samples': 3071040, 'steps': 15994, 'loss/train': 1.7306606769561768} 11/06/2021 23:30:55 - INFO - __main__ - Step 15996: {'lr': 0.0004890494520163731, 'samples': 3071232, 'steps': 15995, 'loss/train': 1.410784363746643} 11/06/2021 23:30:55 - INFO - __main__ - Step 15997: {'lr': 0.0004890478985654823, 'samples': 3071424, 'steps': 15996, 'loss/train': 1.6601943969726562} 11/06/2021 23:30:56 - INFO - __main__ - Step 15998: {'lr': 0.0004890463450068801, 'samples': 3071616, 'steps': 15997, 'loss/train': 1.836921215057373} 11/06/2021 23:30:57 - INFO - __main__ - Step 15999: {'lr': 0.0004890447913405673, 'samples': 3071808, 'steps': 15998, 'loss/train': 1.627102255821228} 11/06/2021 23:30:57 - INFO - __main__ - Step 16000: {'lr': 0.0004890432375665447, 'samples': 3072000, 'steps': 15999, 'loss/train': 1.4169437885284424} 11/06/2021 23:30:57 - INFO - __main__ - Step 16001: {'lr': 0.0004890416836848127, 'samples': 3072192, 'steps': 16000, 'loss/train': 1.6639877557754517} 11/06/2021 23:30:58 - INFO - __main__ - Step 16002: {'lr': 0.0004890401296953723, 'samples': 3072384, 'steps': 16001, 'loss/train': 1.9044790267944336} 11/06/2021 23:30:58 - INFO - __main__ - Step 16003: {'lr': 0.0004890385755982243, 'samples': 3072576, 'steps': 16002, 'loss/train': 1.7246227264404297} 11/06/2021 23:30:58 - INFO - __main__ - Step 16004: {'lr': 0.0004890370213933691, 'samples': 3072768, 'steps': 16003, 'loss/train': 1.6332004070281982} 11/06/2021 23:30:59 - INFO - __main__ - Step 16005: {'lr': 0.0004890354670808074, 'samples': 3072960, 'steps': 16004, 'loss/train': 2.4110894203186035} 11/06/2021 23:31:00 - INFO - __main__ - Step 16006: {'lr': 0.0004890339126605401, 'samples': 3073152, 'steps': 16005, 'loss/train': 1.834177851676941} 11/06/2021 23:31:00 - INFO - __main__ - Step 16007: {'lr': 0.0004890323581325677, 'samples': 3073344, 'steps': 16006, 'loss/train': 1.876891016960144} 11/06/2021 23:31:00 - INFO - __main__ - Step 16008: {'lr': 0.0004890308034968911, 'samples': 3073536, 'steps': 16007, 'loss/train': 1.4352622032165527} 11/06/2021 23:31:01 - INFO - __main__ - Step 16009: {'lr': 0.0004890292487535108, 'samples': 3073728, 'steps': 16008, 'loss/train': 2.2022879123687744} 11/06/2021 23:31:02 - INFO - __main__ - Step 16010: {'lr': 0.0004890276939024278, 'samples': 3073920, 'steps': 16009, 'loss/train': 0.9346103072166443} 11/06/2021 23:31:02 - INFO - __main__ - Step 16011: {'lr': 0.0004890261389436424, 'samples': 3074112, 'steps': 16010, 'loss/train': 1.6814231872558594} 11/06/2021 23:31:03 - INFO - __main__ - Step 16012: {'lr': 0.0004890245838771557, 'samples': 3074304, 'steps': 16011, 'loss/train': 1.8951748609542847} 11/06/2021 23:31:03 - INFO - __main__ - Step 16013: {'lr': 0.0004890230287029681, 'samples': 3074496, 'steps': 16012, 'loss/train': 1.8367836475372314} 11/06/2021 23:31:03 - INFO - __main__ - Step 16014: {'lr': 0.0004890214734210805, 'samples': 3074688, 'steps': 16013, 'loss/train': 1.9447275400161743} 11/06/2021 23:31:04 - INFO - __main__ - Step 16015: {'lr': 0.0004890199180314935, 'samples': 3074880, 'steps': 16014, 'loss/train': 1.739290714263916} 11/06/2021 23:31:05 - INFO - __main__ - Step 16016: {'lr': 0.0004890183625342078, 'samples': 3075072, 'steps': 16015, 'loss/train': 1.9823334217071533} 11/06/2021 23:31:05 - INFO - __main__ - Step 16017: {'lr': 0.0004890168069292241, 'samples': 3075264, 'steps': 16016, 'loss/train': 1.898835301399231} 11/06/2021 23:31:05 - INFO - __main__ - Step 16018: {'lr': 0.000489015251216543, 'samples': 3075456, 'steps': 16017, 'loss/train': 1.9296048879623413} 11/06/2021 23:31:06 - INFO - __main__ - Step 16019: {'lr': 0.0004890136953961654, 'samples': 3075648, 'steps': 16018, 'loss/train': 1.3645771741867065} 11/06/2021 23:31:07 - INFO - __main__ - Step 16020: {'lr': 0.000489012139468092, 'samples': 3075840, 'steps': 16019, 'loss/train': 1.6032259464263916} 11/06/2021 23:31:07 - INFO - __main__ - Step 16021: {'lr': 0.0004890105834323233, 'samples': 3076032, 'steps': 16020, 'loss/train': 1.5474944114685059} 11/06/2021 23:31:07 - INFO - __main__ - Step 16022: {'lr': 0.0004890090272888602, 'samples': 3076224, 'steps': 16021, 'loss/train': 1.8480674028396606} 11/06/2021 23:31:08 - INFO - __main__ - Step 16023: {'lr': 0.0004890074710377033, 'samples': 3076416, 'steps': 16022, 'loss/train': 1.5558422803878784} 11/06/2021 23:31:08 - INFO - __main__ - Step 16024: {'lr': 0.0004890059146788532, 'samples': 3076608, 'steps': 16023, 'loss/train': 1.956721305847168} 11/06/2021 23:31:09 - INFO - __main__ - Step 16025: {'lr': 0.000489004358212311, 'samples': 3076800, 'steps': 16024, 'loss/train': 1.4700076580047607} 11/06/2021 23:31:10 - INFO - __main__ - Step 16026: {'lr': 0.0004890028016380769, 'samples': 3076992, 'steps': 16025, 'loss/train': 1.5509119033813477} 11/06/2021 23:31:10 - INFO - __main__ - Step 16027: {'lr': 0.0004890012449561518, 'samples': 3077184, 'steps': 16026, 'loss/train': 1.683590292930603} 11/06/2021 23:31:10 - INFO - __main__ - Step 16028: {'lr': 0.0004889996881665366, 'samples': 3077376, 'steps': 16027, 'loss/train': 1.3393259048461914} 11/06/2021 23:31:11 - INFO - __main__ - Step 16029: {'lr': 0.0004889981312692317, 'samples': 3077568, 'steps': 16028, 'loss/train': 1.7740288972854614} 11/06/2021 23:31:11 - INFO - __main__ - Step 16030: {'lr': 0.000488996574264238, 'samples': 3077760, 'steps': 16029, 'loss/train': 1.773305892944336} 11/06/2021 23:31:12 - INFO - __main__ - Step 16031: {'lr': 0.000488995017151556, 'samples': 3077952, 'steps': 16030, 'loss/train': 0.440768837928772} 11/06/2021 23:31:12 - INFO - __main__ - Step 16032: {'lr': 0.0004889934599311867, 'samples': 3078144, 'steps': 16031, 'loss/train': 2.0566556453704834} 11/06/2021 23:31:13 - INFO - __main__ - Step 16033: {'lr': 0.0004889919026031306, 'samples': 3078336, 'steps': 16032, 'loss/train': 1.4847581386566162} 11/06/2021 23:31:13 - INFO - __main__ - Step 16034: {'lr': 0.0004889903451673884, 'samples': 3078528, 'steps': 16033, 'loss/train': 1.6132980585098267} 11/06/2021 23:31:14 - INFO - __main__ - Step 16035: {'lr': 0.0004889887876239608, 'samples': 3078720, 'steps': 16034, 'loss/train': 1.4533771276474} 11/06/2021 23:31:15 - INFO - __main__ - Step 16036: {'lr': 0.0004889872299728486, 'samples': 3078912, 'steps': 16035, 'loss/train': 1.3205077648162842} 11/06/2021 23:31:15 - INFO - __main__ - Step 16037: {'lr': 0.0004889856722140525, 'samples': 3079104, 'steps': 16036, 'loss/train': 1.8552273511886597} 11/06/2021 23:31:15 - INFO - __main__ - Step 16038: {'lr': 0.000488984114347573, 'samples': 3079296, 'steps': 16037, 'loss/train': 1.5201712846755981} 11/06/2021 23:31:16 - INFO - __main__ - Step 16039: {'lr': 0.000488982556373411, 'samples': 3079488, 'steps': 16038, 'loss/train': 1.3169190883636475} 11/06/2021 23:31:16 - INFO - __main__ - Step 16040: {'lr': 0.0004889809982915672, 'samples': 3079680, 'steps': 16039, 'loss/train': 1.6098225116729736} 11/06/2021 23:31:17 - INFO - __main__ - Step 16041: {'lr': 0.0004889794401020422, 'samples': 3079872, 'steps': 16040, 'loss/train': 1.4014536142349243} 11/06/2021 23:31:17 - INFO - __main__ - Step 16042: {'lr': 0.0004889778818048368, 'samples': 3080064, 'steps': 16041, 'loss/train': 1.5248942375183105} 11/06/2021 23:31:18 - INFO - __main__ - Step 16043: {'lr': 0.0004889763233999516, 'samples': 3080256, 'steps': 16042, 'loss/train': 1.8693872690200806} 11/06/2021 23:31:18 - INFO - __main__ - Step 16044: {'lr': 0.0004889747648873874, 'samples': 3080448, 'steps': 16043, 'loss/train': 1.9268662929534912} 11/06/2021 23:31:18 - INFO - __main__ - Step 16045: {'lr': 0.0004889732062671448, 'samples': 3080640, 'steps': 16044, 'loss/train': 1.6837748289108276} 11/06/2021 23:31:19 - INFO - __main__ - Step 16046: {'lr': 0.0004889716475392247, 'samples': 3080832, 'steps': 16045, 'loss/train': 1.4768024682998657} 11/06/2021 23:31:20 - INFO - __main__ - Step 16047: {'lr': 0.0004889700887036275, 'samples': 3081024, 'steps': 16046, 'loss/train': 2.1141278743743896} 11/06/2021 23:31:20 - INFO - __main__ - Step 16048: {'lr': 0.0004889685297603541, 'samples': 3081216, 'steps': 16047, 'loss/train': 1.835368275642395} 11/06/2021 23:31:20 - INFO - __main__ - Step 16049: {'lr': 0.0004889669707094052, 'samples': 3081408, 'steps': 16048, 'loss/train': 1.6546882390975952} 11/06/2021 23:31:21 - INFO - __main__ - Step 16050: {'lr': 0.0004889654115507815, 'samples': 3081600, 'steps': 16049, 'loss/train': 1.2408573627471924} 11/06/2021 23:31:22 - INFO - __main__ - Step 16051: {'lr': 0.0004889638522844836, 'samples': 3081792, 'steps': 16050, 'loss/train': 1.5190356969833374} 11/06/2021 23:31:22 - INFO - __main__ - Step 16052: {'lr': 0.0004889622929105123, 'samples': 3081984, 'steps': 16051, 'loss/train': 2.227968215942383} 11/06/2021 23:31:23 - INFO - __main__ - Step 16053: {'lr': 0.0004889607334288683, 'samples': 3082176, 'steps': 16052, 'loss/train': 1.310071349143982} 11/06/2021 23:31:23 - INFO - __main__ - Step 16054: {'lr': 0.0004889591738395522, 'samples': 3082368, 'steps': 16053, 'loss/train': 1.535683512687683} 11/06/2021 23:31:24 - INFO - __main__ - Step 16055: {'lr': 0.0004889576141425649, 'samples': 3082560, 'steps': 16054, 'loss/train': 1.4733257293701172} 11/06/2021 23:31:24 - INFO - __main__ - Step 16056: {'lr': 0.0004889560543379069, 'samples': 3082752, 'steps': 16055, 'loss/train': 0.716278612613678} 11/06/2021 23:31:25 - INFO - __main__ - Step 16057: {'lr': 0.000488954494425579, 'samples': 3082944, 'steps': 16056, 'loss/train': 1.6750446557998657} 11/06/2021 23:31:25 - INFO - __main__ - Step 16058: {'lr': 0.000488952934405582, 'samples': 3083136, 'steps': 16057, 'loss/train': 2.0339643955230713} 11/06/2021 23:31:26 - INFO - __main__ - Step 16059: {'lr': 0.0004889513742779164, 'samples': 3083328, 'steps': 16058, 'loss/train': 1.7793983221054077} 11/06/2021 23:31:26 - INFO - __main__ - Step 16060: {'lr': 0.0004889498140425829, 'samples': 3083520, 'steps': 16059, 'loss/train': 1.7236640453338623} 11/06/2021 23:31:26 - INFO - __main__ - Step 16061: {'lr': 0.0004889482536995825, 'samples': 3083712, 'steps': 16060, 'loss/train': 1.4017524719238281} 11/06/2021 23:31:28 - INFO - __main__ - Step 16062: {'lr': 0.0004889466932489157, 'samples': 3083904, 'steps': 16061, 'loss/train': 1.0112061500549316} 11/06/2021 23:31:28 - INFO - __main__ - Step 16063: {'lr': 0.0004889451326905831, 'samples': 3084096, 'steps': 16062, 'loss/train': 1.7582470178604126} 11/06/2021 23:31:28 - INFO - __main__ - Step 16064: {'lr': 0.0004889435720245855, 'samples': 3084288, 'steps': 16063, 'loss/train': 1.3274977207183838} 11/06/2021 23:31:29 - INFO - __main__ - Step 16065: {'lr': 0.0004889420112509237, 'samples': 3084480, 'steps': 16064, 'loss/train': 0.20192763209342957} 11/06/2021 23:31:29 - INFO - __main__ - Step 16066: {'lr': 0.0004889404503695983, 'samples': 3084672, 'steps': 16065, 'loss/train': 1.569493293762207} 11/06/2021 23:31:29 - INFO - __main__ - Step 16067: {'lr': 0.0004889388893806099, 'samples': 3084864, 'steps': 16066, 'loss/train': 1.8277945518493652} 11/06/2021 23:31:31 - INFO - __main__ - Step 16068: {'lr': 0.0004889373282839594, 'samples': 3085056, 'steps': 16067, 'loss/train': 1.9361584186553955} 11/06/2021 23:31:31 - INFO - __main__ - Step 16069: {'lr': 0.0004889357670796474, 'samples': 3085248, 'steps': 16068, 'loss/train': 1.7485822439193726} 11/06/2021 23:31:31 - INFO - __main__ - Step 16070: {'lr': 0.0004889342057676748, 'samples': 3085440, 'steps': 16069, 'loss/train': 1.6493250131607056} 11/06/2021 23:31:32 - INFO - __main__ - Step 16071: {'lr': 0.000488932644348042, 'samples': 3085632, 'steps': 16070, 'loss/train': 1.7905906438827515} 11/06/2021 23:31:32 - INFO - __main__ - Step 16072: {'lr': 0.0004889310828207498, 'samples': 3085824, 'steps': 16071, 'loss/train': 1.781125783920288} 11/06/2021 23:31:33 - INFO - __main__ - Step 16073: {'lr': 0.000488929521185799, 'samples': 3086016, 'steps': 16072, 'loss/train': 1.8665831089019775} 11/06/2021 23:31:33 - INFO - __main__ - Step 16074: {'lr': 0.0004889279594431903, 'samples': 3086208, 'steps': 16073, 'loss/train': 1.7310936450958252} 11/06/2021 23:31:34 - INFO - __main__ - Step 16075: {'lr': 0.0004889263975929242, 'samples': 3086400, 'steps': 16074, 'loss/train': 1.4840593338012695} 11/06/2021 23:31:34 - INFO - __main__ - Step 16076: {'lr': 0.0004889248356350016, 'samples': 3086592, 'steps': 16075, 'loss/train': 1.7702844142913818} 11/06/2021 23:31:34 - INFO - __main__ - Step 16077: {'lr': 0.0004889232735694232, 'samples': 3086784, 'steps': 16076, 'loss/train': 1.6397343873977661} 11/06/2021 23:31:35 - INFO - __main__ - Step 16078: {'lr': 0.0004889217113961896, 'samples': 3086976, 'steps': 16077, 'loss/train': 1.6018173694610596} 11/06/2021 23:31:36 - INFO - __main__ - Step 16079: {'lr': 0.0004889201491153016, 'samples': 3087168, 'steps': 16078, 'loss/train': 1.2941079139709473} 11/06/2021 23:31:36 - INFO - __main__ - Step 16080: {'lr': 0.0004889185867267599, 'samples': 3087360, 'steps': 16079, 'loss/train': 1.3023624420166016} 11/06/2021 23:31:36 - INFO - __main__ - Step 16081: {'lr': 0.0004889170242305652, 'samples': 3087552, 'steps': 16080, 'loss/train': 0.6730976104736328} 11/06/2021 23:31:37 - INFO - __main__ - Step 16082: {'lr': 0.0004889154616267181, 'samples': 3087744, 'steps': 16081, 'loss/train': 1.7039793729782104} 11/06/2021 23:31:38 - INFO - __main__ - Step 16083: {'lr': 0.0004889138989152194, 'samples': 3087936, 'steps': 16082, 'loss/train': 1.3806297779083252} 11/06/2021 23:31:38 - INFO - __main__ - Step 16084: {'lr': 0.0004889123360960698, 'samples': 3088128, 'steps': 16083, 'loss/train': 1.7928431034088135} 11/06/2021 23:31:39 - INFO - __main__ - Step 16085: {'lr': 0.0004889107731692699, 'samples': 3088320, 'steps': 16084, 'loss/train': 2.2805731296539307} 11/06/2021 23:31:39 - INFO - __main__ - Step 16086: {'lr': 0.0004889092101348206, 'samples': 3088512, 'steps': 16085, 'loss/train': 1.1894793510437012} 11/06/2021 23:31:39 - INFO - __main__ - Step 16087: {'lr': 0.0004889076469927225, 'samples': 3088704, 'steps': 16086, 'loss/train': 1.330406904220581} 11/06/2021 23:31:40 - INFO - __main__ - Step 16088: {'lr': 0.0004889060837429762, 'samples': 3088896, 'steps': 16087, 'loss/train': 1.6548304557800293} 11/06/2021 23:31:41 - INFO - __main__ - Step 16089: {'lr': 0.0004889045203855826, 'samples': 3089088, 'steps': 16088, 'loss/train': 1.9373817443847656} 11/06/2021 23:31:41 - INFO - __main__ - Step 16090: {'lr': 0.0004889029569205423, 'samples': 3089280, 'steps': 16089, 'loss/train': 1.7468717098236084} 11/06/2021 23:31:41 - INFO - __main__ - Step 16091: {'lr': 0.0004889013933478559, 'samples': 3089472, 'steps': 16090, 'loss/train': 1.9759714603424072} 11/06/2021 23:31:42 - INFO - __main__ - Step 16092: {'lr': 0.0004888998296675243, 'samples': 3089664, 'steps': 16091, 'loss/train': 1.7408642768859863} 11/06/2021 23:31:42 - INFO - __main__ - Step 16093: {'lr': 0.0004888982658795482, 'samples': 3089856, 'steps': 16092, 'loss/train': 1.9443445205688477} 11/06/2021 23:31:43 - INFO - __main__ - Step 16094: {'lr': 0.0004888967019839282, 'samples': 3090048, 'steps': 16093, 'loss/train': 1.6487983465194702} 11/06/2021 23:31:43 - INFO - __main__ - Step 16095: {'lr': 0.000488895137980665, 'samples': 3090240, 'steps': 16094, 'loss/train': 1.88912034034729} 11/06/2021 23:31:44 - INFO - __main__ - Step 16096: {'lr': 0.0004888935738697593, 'samples': 3090432, 'steps': 16095, 'loss/train': 1.3398206233978271} 11/06/2021 23:31:44 - INFO - __main__ - Step 16097: {'lr': 0.0004888920096512118, 'samples': 3090624, 'steps': 16096, 'loss/train': 1.280200481414795} 11/06/2021 23:31:45 - INFO - __main__ - Step 16098: {'lr': 0.0004888904453250233, 'samples': 3090816, 'steps': 16097, 'loss/train': 1.5382581949234009} 11/06/2021 23:31:46 - INFO - __main__ - Step 16099: {'lr': 0.0004888888808911946, 'samples': 3091008, 'steps': 16098, 'loss/train': 2.1261627674102783} 11/06/2021 23:31:46 - INFO - __main__ - Step 16100: {'lr': 0.0004888873163497261, 'samples': 3091200, 'steps': 16099, 'loss/train': 2.079089641571045} 11/06/2021 23:31:46 - INFO - __main__ - Step 16101: {'lr': 0.0004888857517006186, 'samples': 3091392, 'steps': 16100, 'loss/train': 2.2009711265563965} 11/06/2021 23:31:47 - INFO - __main__ - Step 16102: {'lr': 0.000488884186943873, 'samples': 3091584, 'steps': 16101, 'loss/train': 0.620243489742279} 11/06/2021 23:31:48 - INFO - __main__ - Step 16103: {'lr': 0.0004888826220794899, 'samples': 3091776, 'steps': 16102, 'loss/train': 1.991464376449585} 11/06/2021 23:31:48 - INFO - __main__ - Step 16104: {'lr': 0.0004888810571074698, 'samples': 3091968, 'steps': 16103, 'loss/train': 1.6059728860855103} 11/06/2021 23:31:48 - INFO - __main__ - Step 16105: {'lr': 0.0004888794920278137, 'samples': 3092160, 'steps': 16104, 'loss/train': 1.831572413444519} 11/06/2021 23:31:49 - INFO - __main__ - Step 16106: {'lr': 0.0004888779268405223, 'samples': 3092352, 'steps': 16105, 'loss/train': 1.3617275953292847} 11/06/2021 23:31:49 - INFO - __main__ - Step 16107: {'lr': 0.0004888763615455959, 'samples': 3092544, 'steps': 16106, 'loss/train': 1.7099063396453857} 11/06/2021 23:31:49 - INFO - __main__ - Step 16108: {'lr': 0.0004888747961430358, 'samples': 3092736, 'steps': 16107, 'loss/train': 1.6582728624343872} 11/06/2021 23:31:51 - INFO - __main__ - Step 16109: {'lr': 0.0004888732306328422, 'samples': 3092928, 'steps': 16108, 'loss/train': 1.6152935028076172} 11/06/2021 23:31:52 - INFO - __main__ - Step 16110: {'lr': 0.000488871665015016, 'samples': 3093120, 'steps': 16109, 'loss/train': 1.057140588760376} 11/06/2021 23:31:52 - INFO - __main__ - Step 16111: {'lr': 0.0004888700992895581, 'samples': 3093312, 'steps': 16110, 'loss/train': 1.769897699356079} 11/06/2021 23:31:52 - INFO - __main__ - Step 16112: {'lr': 0.0004888685334564688, 'samples': 3093504, 'steps': 16111, 'loss/train': 1.4606374502182007} 11/06/2021 23:31:53 - INFO - __main__ - Step 16113: {'lr': 0.0004888669675157492, 'samples': 3093696, 'steps': 16112, 'loss/train': 1.8256492614746094} 11/06/2021 23:31:53 - INFO - __main__ - Step 16114: {'lr': 0.0004888654014673998, 'samples': 3093888, 'steps': 16113, 'loss/train': 1.8334771394729614} 11/06/2021 23:31:53 - INFO - __main__ - Step 16115: {'lr': 0.0004888638353114212, 'samples': 3094080, 'steps': 16114, 'loss/train': 1.4679832458496094} 11/06/2021 23:31:54 - INFO - __main__ - Step 16116: {'lr': 0.0004888622690478144, 'samples': 3094272, 'steps': 16115, 'loss/train': 1.6196460723876953} 11/06/2021 23:31:55 - INFO - __main__ - Step 16117: {'lr': 0.0004888607026765799, 'samples': 3094464, 'steps': 16116, 'loss/train': 1.916323184967041} 11/06/2021 23:31:55 - INFO - __main__ - Step 16118: {'lr': 0.0004888591361977184, 'samples': 3094656, 'steps': 16117, 'loss/train': 1.9670770168304443} 11/06/2021 23:31:55 - INFO - __main__ - Step 16119: {'lr': 0.0004888575696112308, 'samples': 3094848, 'steps': 16118, 'loss/train': 1.839583158493042} 11/06/2021 23:31:56 - INFO - __main__ - Step 16120: {'lr': 0.0004888560029171175, 'samples': 3095040, 'steps': 16119, 'loss/train': 1.7502155303955078} 11/06/2021 23:31:57 - INFO - __main__ - Step 16121: {'lr': 0.0004888544361153794, 'samples': 3095232, 'steps': 16120, 'loss/train': 1.7904021739959717} 11/06/2021 23:31:57 - INFO - __main__ - Step 16122: {'lr': 0.0004888528692060173, 'samples': 3095424, 'steps': 16121, 'loss/train': 2.108914852142334} 11/06/2021 23:31:58 - INFO - __main__ - Step 16123: {'lr': 0.0004888513021890316, 'samples': 3095616, 'steps': 16122, 'loss/train': 1.400886058807373} 11/06/2021 23:31:58 - INFO - __main__ - Step 16124: {'lr': 0.0004888497350644234, 'samples': 3095808, 'steps': 16123, 'loss/train': 1.7776603698730469} 11/06/2021 23:31:58 - INFO - __main__ - Step 16125: {'lr': 0.000488848167832193, 'samples': 3096000, 'steps': 16124, 'loss/train': 1.5604819059371948} 11/06/2021 23:31:59 - INFO - __main__ - Step 16126: {'lr': 0.0004888466004923413, 'samples': 3096192, 'steps': 16125, 'loss/train': 1.3210574388504028} 11/06/2021 23:32:00 - INFO - __main__ - Step 16127: {'lr': 0.0004888450330448692, 'samples': 3096384, 'steps': 16126, 'loss/train': 1.4687092304229736} 11/06/2021 23:32:00 - INFO - __main__ - Step 16128: {'lr': 0.000488843465489777, 'samples': 3096576, 'steps': 16127, 'loss/train': 1.5735976696014404} 11/06/2021 23:32:00 - INFO - __main__ - Step 16129: {'lr': 0.0004888418978270658, 'samples': 3096768, 'steps': 16128, 'loss/train': 1.747856855392456} 11/06/2021 23:32:01 - INFO - __main__ - Step 16130: {'lr': 0.000488840330056736, 'samples': 3096960, 'steps': 16129, 'loss/train': 2.0261762142181396} 11/06/2021 23:32:02 - INFO - __main__ - Step 16131: {'lr': 0.0004888387621787885, 'samples': 3097152, 'steps': 16130, 'loss/train': 2.1754300594329834} 11/06/2021 23:32:02 - INFO - __main__ - Step 16132: {'lr': 0.0004888371941932239, 'samples': 3097344, 'steps': 16131, 'loss/train': 1.5788387060165405} 11/06/2021 23:32:03 - INFO - __main__ - Step 16133: {'lr': 0.000488835626100043, 'samples': 3097536, 'steps': 16132, 'loss/train': 1.9797744750976562} 11/06/2021 23:32:03 - INFO - __main__ - Step 16134: {'lr': 0.0004888340578992464, 'samples': 3097728, 'steps': 16133, 'loss/train': 1.6595001220703125} 11/06/2021 23:32:03 - INFO - __main__ - Step 16135: {'lr': 0.0004888324895908349, 'samples': 3097920, 'steps': 16134, 'loss/train': 1.8906220197677612} 11/06/2021 23:32:04 - INFO - __main__ - Step 16136: {'lr': 0.0004888309211748091, 'samples': 3098112, 'steps': 16135, 'loss/train': 1.909759759902954} 11/06/2021 23:32:05 - INFO - __main__ - Step 16137: {'lr': 0.0004888293526511697, 'samples': 3098304, 'steps': 16136, 'loss/train': 1.5360649824142456} 11/06/2021 23:32:05 - INFO - __main__ - Step 16138: {'lr': 0.0004888277840199177, 'samples': 3098496, 'steps': 16137, 'loss/train': 1.0076518058776855} 11/06/2021 23:32:05 - INFO - __main__ - Step 16139: {'lr': 0.0004888262152810534, 'samples': 3098688, 'steps': 16138, 'loss/train': 1.4442768096923828} 11/06/2021 23:32:06 - INFO - __main__ - Step 16140: {'lr': 0.0004888246464345779, 'samples': 3098880, 'steps': 16139, 'loss/train': 1.477372646331787} 11/06/2021 23:32:06 - INFO - __main__ - Step 16141: {'lr': 0.0004888230774804915, 'samples': 3099072, 'steps': 16140, 'loss/train': 1.4299622774124146} 11/06/2021 23:32:07 - INFO - __main__ - Step 16142: {'lr': 0.0004888215084187952, 'samples': 3099264, 'steps': 16141, 'loss/train': 1.1523048877716064} 11/06/2021 23:32:07 - INFO - __main__ - Step 16143: {'lr': 0.0004888199392494896, 'samples': 3099456, 'steps': 16142, 'loss/train': 2.0774753093719482} 11/06/2021 23:32:08 - INFO - __main__ - Step 16144: {'lr': 0.0004888183699725755, 'samples': 3099648, 'steps': 16143, 'loss/train': 1.240419864654541} 11/06/2021 23:32:08 - INFO - __main__ - Step 16145: {'lr': 0.0004888168005880533, 'samples': 3099840, 'steps': 16144, 'loss/train': 1.6220906972885132} 11/06/2021 23:32:09 - INFO - __main__ - Step 16146: {'lr': 0.0004888152310959242, 'samples': 3100032, 'steps': 16145, 'loss/train': 1.7612665891647339} 11/06/2021 23:32:10 - INFO - __main__ - Step 16147: {'lr': 0.0004888136614961885, 'samples': 3100224, 'steps': 16146, 'loss/train': 1.4522687196731567} 11/06/2021 23:32:10 - INFO - __main__ - Step 16148: {'lr': 0.000488812091788847, 'samples': 3100416, 'steps': 16147, 'loss/train': 1.4766018390655518} 11/06/2021 23:32:10 - INFO - __main__ - Step 16149: {'lr': 0.0004888105219739005, 'samples': 3100608, 'steps': 16148, 'loss/train': 1.5540204048156738} 11/06/2021 23:32:11 - INFO - __main__ - Step 16150: {'lr': 0.0004888089520513497, 'samples': 3100800, 'steps': 16149, 'loss/train': 1.726388931274414} 11/06/2021 23:32:11 - INFO - __main__ - Step 16151: {'lr': 0.0004888073820211952, 'samples': 3100992, 'steps': 16150, 'loss/train': 1.5844863653182983} 11/06/2021 23:32:12 - INFO - __main__ - Step 16152: {'lr': 0.0004888058118834379, 'samples': 3101184, 'steps': 16151, 'loss/train': 1.4964817762374878} 11/06/2021 23:32:13 - INFO - __main__ - Step 16153: {'lr': 0.0004888042416380784, 'samples': 3101376, 'steps': 16152, 'loss/train': 2.1015524864196777} 11/06/2021 23:32:13 - INFO - __main__ - Step 16154: {'lr': 0.0004888026712851172, 'samples': 3101568, 'steps': 16153, 'loss/train': 1.3865811824798584} 11/06/2021 23:32:13 - INFO - __main__ - Step 16155: {'lr': 0.0004888011008245554, 'samples': 3101760, 'steps': 16154, 'loss/train': 1.5103989839553833} 11/06/2021 23:32:14 - INFO - __main__ - Step 16156: {'lr': 0.0004887995302563934, 'samples': 3101952, 'steps': 16155, 'loss/train': 1.9059851169586182} 11/06/2021 23:32:14 - INFO - __main__ - Step 16157: {'lr': 0.000488797959580632, 'samples': 3102144, 'steps': 16156, 'loss/train': 2.0351128578186035} 11/06/2021 23:32:15 - INFO - __main__ - Step 16158: {'lr': 0.000488796388797272, 'samples': 3102336, 'steps': 16157, 'loss/train': 0.9961912631988525} 11/06/2021 23:32:15 - INFO - __main__ - Step 16159: {'lr': 0.0004887948179063139, 'samples': 3102528, 'steps': 16158, 'loss/train': 1.8558497428894043} 11/06/2021 23:32:16 - INFO - __main__ - Step 16160: {'lr': 0.0004887932469077587, 'samples': 3102720, 'steps': 16159, 'loss/train': 1.6343353986740112} 11/06/2021 23:32:16 - INFO - __main__ - Step 16161: {'lr': 0.0004887916758016069, 'samples': 3102912, 'steps': 16160, 'loss/train': 1.7102398872375488} 11/06/2021 23:32:16 - INFO - __main__ - Step 16162: {'lr': 0.0004887901045878592, 'samples': 3103104, 'steps': 16161, 'loss/train': 1.9250577688217163} 11/06/2021 23:32:18 - INFO - __main__ - Step 16163: {'lr': 0.0004887885332665165, 'samples': 3103296, 'steps': 16162, 'loss/train': 0.9902682900428772} 11/06/2021 23:32:18 - INFO - __main__ - Step 16164: {'lr': 0.0004887869618375793, 'samples': 3103488, 'steps': 16163, 'loss/train': 2.02260160446167} 11/06/2021 23:32:18 - INFO - __main__ - Step 16165: {'lr': 0.0004887853903010483, 'samples': 3103680, 'steps': 16164, 'loss/train': 1.4178218841552734} 11/06/2021 23:32:19 - INFO - __main__ - Step 16166: {'lr': 0.0004887838186569244, 'samples': 3103872, 'steps': 16165, 'loss/train': 1.004329800605774} 11/06/2021 23:32:19 - INFO - __main__ - Step 16167: {'lr': 0.0004887822469052081, 'samples': 3104064, 'steps': 16166, 'loss/train': 1.5798732042312622} 11/06/2021 23:32:20 - INFO - __main__ - Step 16168: {'lr': 0.0004887806750459002, 'samples': 3104256, 'steps': 16167, 'loss/train': 0.9959037899971008} 11/06/2021 23:32:20 - INFO - __main__ - Step 16169: {'lr': 0.0004887791030790016, 'samples': 3104448, 'steps': 16168, 'loss/train': 1.3781802654266357} 11/06/2021 23:32:21 - INFO - __main__ - Step 16170: {'lr': 0.0004887775310045126, 'samples': 3104640, 'steps': 16169, 'loss/train': 1.5973459482192993} 11/06/2021 23:32:21 - INFO - __main__ - Step 16171: {'lr': 0.0004887759588224342, 'samples': 3104832, 'steps': 16170, 'loss/train': 1.6264896392822266} 11/06/2021 23:32:21 - INFO - __main__ - Step 16172: {'lr': 0.000488774386532767, 'samples': 3105024, 'steps': 16171, 'loss/train': 1.583954930305481} 11/06/2021 23:32:23 - INFO - __main__ - Step 16173: {'lr': 0.0004887728141355118, 'samples': 3105216, 'steps': 16172, 'loss/train': 1.6285706758499146} 11/06/2021 23:32:23 - INFO - __main__ - Step 16174: {'lr': 0.0004887712416306693, 'samples': 3105408, 'steps': 16173, 'loss/train': 1.1248019933700562} 11/06/2021 23:32:23 - INFO - __main__ - Step 16175: {'lr': 0.00048876966901824, 'samples': 3105600, 'steps': 16174, 'loss/train': 1.661145567893982} 11/06/2021 23:32:24 - INFO - __main__ - Step 16176: {'lr': 0.0004887680962982249, 'samples': 3105792, 'steps': 16175, 'loss/train': 1.8155841827392578} 11/06/2021 23:32:24 - INFO - __main__ - Step 16177: {'lr': 0.0004887665234706247, 'samples': 3105984, 'steps': 16176, 'loss/train': 1.6168017387390137} 11/06/2021 23:32:25 - INFO - __main__ - Step 16178: {'lr': 0.0004887649505354398, 'samples': 3106176, 'steps': 16177, 'loss/train': 1.7428489923477173} 11/06/2021 23:32:25 - INFO - __main__ - Step 16179: {'lr': 0.000488763377492671, 'samples': 3106368, 'steps': 16178, 'loss/train': 1.9398001432418823} 11/06/2021 23:32:26 - INFO - __main__ - Step 16180: {'lr': 0.0004887618043423194, 'samples': 3106560, 'steps': 16179, 'loss/train': 1.2829269170761108} 11/06/2021 23:32:26 - INFO - __main__ - Step 16181: {'lr': 0.0004887602310843852, 'samples': 3106752, 'steps': 16180, 'loss/train': 2.0834195613861084} 11/06/2021 23:32:26 - INFO - __main__ - Step 16182: {'lr': 0.0004887586577188694, 'samples': 3106944, 'steps': 16181, 'loss/train': 1.844342827796936} 11/06/2021 23:32:27 - INFO - __main__ - Step 16183: {'lr': 0.0004887570842457726, 'samples': 3107136, 'steps': 16182, 'loss/train': 1.410559058189392} 11/06/2021 23:32:28 - INFO - __main__ - Step 16184: {'lr': 0.0004887555106650956, 'samples': 3107328, 'steps': 16183, 'loss/train': 1.3475863933563232} 11/06/2021 23:32:28 - INFO - __main__ - Step 16185: {'lr': 0.000488753936976839, 'samples': 3107520, 'steps': 16184, 'loss/train': 5.809494495391846} 11/06/2021 23:32:29 - INFO - __main__ - Step 16186: {'lr': 0.0004887523631810036, 'samples': 3107712, 'steps': 16185, 'loss/train': 1.431221604347229} 11/06/2021 23:32:29 - INFO - __main__ - Step 16187: {'lr': 0.00048875078927759, 'samples': 3107904, 'steps': 16186, 'loss/train': 1.6781052350997925} 11/06/2021 23:32:29 - INFO - __main__ - Step 16188: {'lr': 0.000488749215266599, 'samples': 3108096, 'steps': 16187, 'loss/train': 2.205307960510254} 11/06/2021 23:32:30 - INFO - __main__ - Step 16189: {'lr': 0.0004887476411480314, 'samples': 3108288, 'steps': 16188, 'loss/train': 1.7173161506652832} 11/06/2021 23:32:31 - INFO - __main__ - Step 16190: {'lr': 0.0004887460669218877, 'samples': 3108480, 'steps': 16189, 'loss/train': 1.8591818809509277} 11/06/2021 23:32:31 - INFO - __main__ - Step 16191: {'lr': 0.0004887444925881688, 'samples': 3108672, 'steps': 16190, 'loss/train': 1.212321162223816} 11/06/2021 23:32:31 - INFO - __main__ - Step 16192: {'lr': 0.0004887429181468752, 'samples': 3108864, 'steps': 16191, 'loss/train': 1.6692414283752441} 11/06/2021 23:32:32 - INFO - __main__ - Step 16193: {'lr': 0.0004887413435980077, 'samples': 3109056, 'steps': 16192, 'loss/train': 1.82635498046875} 11/06/2021 23:32:32 - INFO - __main__ - Step 16194: {'lr': 0.0004887397689415672, 'samples': 3109248, 'steps': 16193, 'loss/train': 2.222364902496338} 11/06/2021 23:32:33 - INFO - __main__ - Step 16195: {'lr': 0.0004887381941775541, 'samples': 3109440, 'steps': 16194, 'loss/train': 1.727850079536438} 11/06/2021 23:32:34 - INFO - __main__ - Step 16196: {'lr': 0.0004887366193059693, 'samples': 3109632, 'steps': 16195, 'loss/train': 1.849395751953125} 11/06/2021 23:32:34 - INFO - __main__ - Step 16197: {'lr': 0.0004887350443268134, 'samples': 3109824, 'steps': 16196, 'loss/train': 1.7201744318008423} 11/06/2021 23:32:34 - INFO - __main__ - Step 16198: {'lr': 0.0004887334692400872, 'samples': 3110016, 'steps': 16197, 'loss/train': 1.6349120140075684} 11/06/2021 23:32:35 - INFO - __main__ - Step 16199: {'lr': 0.0004887318940457915, 'samples': 3110208, 'steps': 16198, 'loss/train': 2.126889944076538} 11/06/2021 23:32:36 - INFO - __main__ - Step 16200: {'lr': 0.0004887303187439267, 'samples': 3110400, 'steps': 16199, 'loss/train': 1.782393455505371} 11/06/2021 23:32:36 - INFO - __main__ - Step 16201: {'lr': 0.0004887287433344939, 'samples': 3110592, 'steps': 16200, 'loss/train': 1.2824130058288574} 11/06/2021 23:32:36 - INFO - __main__ - Step 16202: {'lr': 0.0004887271678174935, 'samples': 3110784, 'steps': 16201, 'loss/train': 1.416832447052002} 11/06/2021 23:32:37 - INFO - __main__ - Step 16203: {'lr': 0.0004887255921929264, 'samples': 3110976, 'steps': 16202, 'loss/train': 1.795426368713379} 11/06/2021 23:32:37 - INFO - __main__ - Step 16204: {'lr': 0.0004887240164607931, 'samples': 3111168, 'steps': 16203, 'loss/train': 2.0454015731811523} 11/06/2021 23:32:38 - INFO - __main__ - Step 16205: {'lr': 0.0004887224406210945, 'samples': 3111360, 'steps': 16204, 'loss/train': 1.9707239866256714} 11/06/2021 23:32:38 - INFO - __main__ - Step 16206: {'lr': 0.0004887208646738312, 'samples': 3111552, 'steps': 16205, 'loss/train': 1.7705655097961426} 11/06/2021 23:32:39 - INFO - __main__ - Step 16207: {'lr': 0.000488719288619004, 'samples': 3111744, 'steps': 16206, 'loss/train': 1.1489686965942383} 11/06/2021 23:32:39 - INFO - __main__ - Step 16208: {'lr': 0.0004887177124566136, 'samples': 3111936, 'steps': 16207, 'loss/train': 1.93928861618042} 11/06/2021 23:32:40 - INFO - __main__ - Step 16209: {'lr': 0.0004887161361866607, 'samples': 3112128, 'steps': 16208, 'loss/train': 1.5540908575057983} 11/06/2021 23:32:40 - INFO - __main__ - Step 16210: {'lr': 0.000488714559809146, 'samples': 3112320, 'steps': 16209, 'loss/train': 1.17920982837677} 11/06/2021 23:32:41 - INFO - __main__ - Step 16211: {'lr': 0.0004887129833240703, 'samples': 3112512, 'steps': 16210, 'loss/train': 1.7366347312927246} 11/06/2021 23:32:41 - INFO - __main__ - Step 16212: {'lr': 0.000488711406731434, 'samples': 3112704, 'steps': 16211, 'loss/train': 2.5286495685577393} 11/06/2021 23:32:42 - INFO - __main__ - Step 16213: {'lr': 0.0004887098300312381, 'samples': 3112896, 'steps': 16212, 'loss/train': 1.7790120840072632} 11/06/2021 23:32:42 - INFO - __main__ - Step 16214: {'lr': 0.0004887082532234832, 'samples': 3113088, 'steps': 16213, 'loss/train': 1.5670133829116821} 11/06/2021 23:32:42 - INFO - __main__ - Step 16215: {'lr': 0.0004887066763081702, 'samples': 3113280, 'steps': 16214, 'loss/train': 1.74317467212677} 11/06/2021 23:32:43 - INFO - __main__ - Step 16216: {'lr': 0.0004887050992852995, 'samples': 3113472, 'steps': 16215, 'loss/train': 1.506385087966919} 11/06/2021 23:32:44 - INFO - __main__ - Step 16217: {'lr': 0.000488703522154872, 'samples': 3113664, 'steps': 16216, 'loss/train': 2.2613461017608643} 11/06/2021 23:32:44 - INFO - __main__ - Step 16218: {'lr': 0.0004887019449168884, 'samples': 3113856, 'steps': 16217, 'loss/train': 1.4380708932876587} 11/06/2021 23:32:44 - INFO - __main__ - Step 16219: {'lr': 0.0004887003675713493, 'samples': 3114048, 'steps': 16218, 'loss/train': 1.0357764959335327} 11/06/2021 23:32:45 - INFO - __main__ - Step 16220: {'lr': 0.0004886987901182556, 'samples': 3114240, 'steps': 16219, 'loss/train': 1.8827407360076904} 11/06/2021 23:32:46 - INFO - __main__ - Step 16221: {'lr': 0.0004886972125576079, 'samples': 3114432, 'steps': 16220, 'loss/train': 1.24242103099823} 11/06/2021 23:32:46 - INFO - __main__ - Step 16222: {'lr': 0.0004886956348894069, 'samples': 3114624, 'steps': 16221, 'loss/train': 1.3726708889007568} 11/06/2021 23:32:47 - INFO - __main__ - Step 16223: {'lr': 0.0004886940571136533, 'samples': 3114816, 'steps': 16222, 'loss/train': 1.7079288959503174} 11/06/2021 23:32:47 - INFO - __main__ - Step 16224: {'lr': 0.0004886924792303479, 'samples': 3115008, 'steps': 16223, 'loss/train': 1.7370092868804932} 11/06/2021 23:32:47 - INFO - __main__ - Step 16225: {'lr': 0.0004886909012394913, 'samples': 3115200, 'steps': 16224, 'loss/train': 1.8899927139282227} 11/06/2021 23:32:48 - INFO - __main__ - Step 16226: {'lr': 0.0004886893231410844, 'samples': 3115392, 'steps': 16225, 'loss/train': 2.2528281211853027} 11/06/2021 23:32:49 - INFO - __main__ - Step 16227: {'lr': 0.0004886877449351276, 'samples': 3115584, 'steps': 16226, 'loss/train': 1.8323493003845215} 11/06/2021 23:32:49 - INFO - __main__ - Step 16228: {'lr': 0.0004886861666216219, 'samples': 3115776, 'steps': 16227, 'loss/train': 1.1392507553100586} 11/06/2021 23:32:49 - INFO - __main__ - Step 16229: {'lr': 0.0004886845882005679, 'samples': 3115968, 'steps': 16228, 'loss/train': 1.4808542728424072} 11/06/2021 23:32:50 - INFO - __main__ - Step 16230: {'lr': 0.0004886830096719662, 'samples': 3116160, 'steps': 16229, 'loss/train': 2.0018463134765625} 11/06/2021 23:32:51 - INFO - __main__ - Step 16231: {'lr': 0.0004886814310358176, 'samples': 3116352, 'steps': 16230, 'loss/train': 1.9077577590942383} 11/06/2021 23:32:51 - INFO - __main__ - Step 16232: {'lr': 0.000488679852292123, 'samples': 3116544, 'steps': 16231, 'loss/train': 1.6845556497573853} 11/06/2021 23:32:52 - INFO - __main__ - Step 16233: {'lr': 0.0004886782734408828, 'samples': 3116736, 'steps': 16232, 'loss/train': 1.8097872734069824} 11/06/2021 23:32:52 - INFO - __main__ - Step 16234: {'lr': 0.0004886766944820979, 'samples': 3116928, 'steps': 16233, 'loss/train': 1.961445927619934} 11/06/2021 23:32:52 - INFO - __main__ - Step 16235: {'lr': 0.0004886751154157689, 'samples': 3117120, 'steps': 16234, 'loss/train': 1.6174712181091309} 11/06/2021 23:32:53 - INFO - __main__ - Step 16236: {'lr': 0.0004886735362418967, 'samples': 3117312, 'steps': 16235, 'loss/train': 1.9013572931289673} 11/06/2021 23:32:54 - INFO - __main__ - Step 16237: {'lr': 0.0004886719569604818, 'samples': 3117504, 'steps': 16236, 'loss/train': 1.3297522068023682} 11/06/2021 23:32:54 - INFO - __main__ - Step 16238: {'lr': 0.000488670377571525, 'samples': 3117696, 'steps': 16237, 'loss/train': 1.2586119174957275} 11/06/2021 23:32:54 - INFO - __main__ - Step 16239: {'lr': 0.0004886687980750271, 'samples': 3117888, 'steps': 16238, 'loss/train': 1.6158231496810913} 11/06/2021 23:32:55 - INFO - __main__ - Step 16240: {'lr': 0.0004886672184709886, 'samples': 3118080, 'steps': 16239, 'loss/train': 1.4461692571640015} 11/06/2021 23:32:55 - INFO - __main__ - Step 16241: {'lr': 0.0004886656387594104, 'samples': 3118272, 'steps': 16240, 'loss/train': 1.8390427827835083} 11/06/2021 23:32:56 - INFO - __main__ - Step 16242: {'lr': 0.0004886640589402932, 'samples': 3118464, 'steps': 16241, 'loss/train': 1.874056339263916} 11/06/2021 23:32:56 - INFO - __main__ - Step 16243: {'lr': 0.0004886624790136375, 'samples': 3118656, 'steps': 16242, 'loss/train': 1.866011619567871} 11/06/2021 23:32:57 - INFO - __main__ - Step 16244: {'lr': 0.0004886608989794443, 'samples': 3118848, 'steps': 16243, 'loss/train': 1.8699171543121338} 11/06/2021 23:32:57 - INFO - __main__ - Step 16245: {'lr': 0.0004886593188377142, 'samples': 3119040, 'steps': 16244, 'loss/train': 1.6724590063095093} 11/06/2021 23:32:57 - INFO - __main__ - Step 16246: {'lr': 0.0004886577385884478, 'samples': 3119232, 'steps': 16245, 'loss/train': 1.6517481803894043} 11/06/2021 23:32:59 - INFO - __main__ - Step 16247: {'lr': 0.0004886561582316458, 'samples': 3119424, 'steps': 16246, 'loss/train': 2.0058236122131348} 11/06/2021 23:32:59 - INFO - __main__ - Step 16248: {'lr': 0.0004886545777673093, 'samples': 3119616, 'steps': 16247, 'loss/train': 1.7186330556869507} 11/06/2021 23:32:59 - INFO - __main__ - Step 16249: {'lr': 0.0004886529971954385, 'samples': 3119808, 'steps': 16248, 'loss/train': 1.8039087057113647} 11/06/2021 23:33:00 - INFO - __main__ - Step 16250: {'lr': 0.0004886514165160345, 'samples': 3120000, 'steps': 16249, 'loss/train': 1.7493343353271484} 11/06/2021 23:33:00 - INFO - __main__ - Step 16251: {'lr': 0.0004886498357290979, 'samples': 3120192, 'steps': 16250, 'loss/train': 2.166086196899414} 11/06/2021 23:33:01 - INFO - __main__ - Step 16252: {'lr': 0.0004886482548346291, 'samples': 3120384, 'steps': 16251, 'loss/train': 2.0029032230377197} 11/06/2021 23:33:01 - INFO - __main__ - Step 16253: {'lr': 0.0004886466738326293, 'samples': 3120576, 'steps': 16252, 'loss/train': 1.9279859066009521} 11/06/2021 23:33:02 - INFO - __main__ - Step 16254: {'lr': 0.000488645092723099, 'samples': 3120768, 'steps': 16253, 'loss/train': 1.5185385942459106} 11/06/2021 23:33:02 - INFO - __main__ - Step 16255: {'lr': 0.0004886435115060388, 'samples': 3120960, 'steps': 16254, 'loss/train': 1.7781718969345093} 11/06/2021 23:33:02 - INFO - __main__ - Step 16256: {'lr': 0.0004886419301814495, 'samples': 3121152, 'steps': 16255, 'loss/train': 1.7876300811767578} 11/06/2021 23:33:03 - INFO - __main__ - Step 16257: {'lr': 0.0004886403487493319, 'samples': 3121344, 'steps': 16256, 'loss/train': 1.7171152830123901} 11/06/2021 23:33:04 - INFO - __main__ - Step 16258: {'lr': 0.0004886387672096866, 'samples': 3121536, 'steps': 16257, 'loss/train': 1.8690319061279297} 11/06/2021 23:33:04 - INFO - __main__ - Step 16259: {'lr': 0.0004886371855625143, 'samples': 3121728, 'steps': 16258, 'loss/train': 1.2483769655227661} 11/06/2021 23:33:04 - INFO - __main__ - Step 16260: {'lr': 0.0004886356038078159, 'samples': 3121920, 'steps': 16259, 'loss/train': 2.029406785964966} 11/06/2021 23:33:05 - INFO - __main__ - Step 16261: {'lr': 0.0004886340219455919, 'samples': 3122112, 'steps': 16260, 'loss/train': 1.9332122802734375} 11/06/2021 23:33:06 - INFO - __main__ - Step 16262: {'lr': 0.0004886324399758431, 'samples': 3122304, 'steps': 16261, 'loss/train': 1.5385150909423828} 11/06/2021 23:33:07 - INFO - __main__ - Step 16263: {'lr': 0.0004886308578985702, 'samples': 3122496, 'steps': 16262, 'loss/train': 1.4591851234436035} 11/06/2021 23:33:07 - INFO - __main__ - Step 16264: {'lr': 0.0004886292757137739, 'samples': 3122688, 'steps': 16263, 'loss/train': 1.4665359258651733} 11/06/2021 23:33:07 - INFO - __main__ - Step 16265: {'lr': 0.0004886276934214551, 'samples': 3122880, 'steps': 16264, 'loss/train': 1.3488205671310425} 11/06/2021 23:33:08 - INFO - __main__ - Step 16266: {'lr': 0.0004886261110216141, 'samples': 3123072, 'steps': 16265, 'loss/train': 1.357380986213684} 11/06/2021 23:33:08 - INFO - __main__ - Step 16267: {'lr': 0.000488624528514252, 'samples': 3123264, 'steps': 16266, 'loss/train': 1.822448968887329} 11/06/2021 23:33:08 - INFO - __main__ - Step 16268: {'lr': 0.0004886229458993693, 'samples': 3123456, 'steps': 16267, 'loss/train': 1.8266669511795044} 11/06/2021 23:33:09 - INFO - __main__ - Step 16269: {'lr': 0.0004886213631769669, 'samples': 3123648, 'steps': 16268, 'loss/train': 1.6113994121551514} 11/06/2021 23:33:10 - INFO - __main__ - Step 16270: {'lr': 0.0004886197803470453, 'samples': 3123840, 'steps': 16269, 'loss/train': 2.0622076988220215} 11/06/2021 23:33:10 - INFO - __main__ - Step 16271: {'lr': 0.0004886181974096052, 'samples': 3124032, 'steps': 16270, 'loss/train': 1.6397887468338013} 11/06/2021 23:33:10 - INFO - __main__ - Step 16272: {'lr': 0.0004886166143646476, 'samples': 3124224, 'steps': 16271, 'loss/train': 1.217058777809143} 11/06/2021 23:33:11 - INFO - __main__ - Step 16273: {'lr': 0.000488615031212173, 'samples': 3124416, 'steps': 16272, 'loss/train': 1.726121187210083} 11/06/2021 23:33:12 - INFO - __main__ - Step 16274: {'lr': 0.0004886134479521821, 'samples': 3124608, 'steps': 16273, 'loss/train': 1.4393755197525024} 11/06/2021 23:33:12 - INFO - __main__ - Step 16275: {'lr': 0.0004886118645846757, 'samples': 3124800, 'steps': 16274, 'loss/train': 2.080183506011963} 11/06/2021 23:33:13 - INFO - __main__ - Step 16276: {'lr': 0.0004886102811096544, 'samples': 3124992, 'steps': 16275, 'loss/train': 1.4109009504318237} 11/06/2021 23:33:13 - INFO - __main__ - Step 16277: {'lr': 0.0004886086975271191, 'samples': 3125184, 'steps': 16276, 'loss/train': 0.7202199101448059} 11/06/2021 23:33:13 - INFO - __main__ - Step 16278: {'lr': 0.0004886071138370704, 'samples': 3125376, 'steps': 16277, 'loss/train': 1.714578628540039} 11/06/2021 23:33:14 - INFO - __main__ - Step 16279: {'lr': 0.000488605530039509, 'samples': 3125568, 'steps': 16278, 'loss/train': 2.0787177085876465} 11/06/2021 23:33:15 - INFO - __main__ - Step 16280: {'lr': 0.0004886039461344356, 'samples': 3125760, 'steps': 16279, 'loss/train': 1.3188015222549438} 11/06/2021 23:33:15 - INFO - __main__ - Step 16281: {'lr': 0.0004886023621218509, 'samples': 3125952, 'steps': 16280, 'loss/train': 0.2149624228477478} 11/06/2021 23:33:15 - INFO - __main__ - Step 16282: {'lr': 0.0004886007780017557, 'samples': 3126144, 'steps': 16281, 'loss/train': 1.910459280014038} 11/06/2021 23:33:16 - INFO - __main__ - Step 16283: {'lr': 0.0004885991937741506, 'samples': 3126336, 'steps': 16282, 'loss/train': 1.2737128734588623} 11/06/2021 23:33:17 - INFO - __main__ - Step 16284: {'lr': 0.0004885976094390366, 'samples': 3126528, 'steps': 16283, 'loss/train': 1.7941306829452515} 11/06/2021 23:33:17 - INFO - __main__ - Step 16285: {'lr': 0.000488596024996414, 'samples': 3126720, 'steps': 16284, 'loss/train': 1.8777819871902466} 11/06/2021 23:33:18 - INFO - __main__ - Step 16286: {'lr': 0.0004885944404462838, 'samples': 3126912, 'steps': 16285, 'loss/train': 1.9789512157440186} 11/06/2021 23:33:18 - INFO - __main__ - Step 16287: {'lr': 0.0004885928557886466, 'samples': 3127104, 'steps': 16286, 'loss/train': 1.347070574760437} 11/06/2021 23:33:18 - INFO - __main__ - Step 16288: {'lr': 0.0004885912710235031, 'samples': 3127296, 'steps': 16287, 'loss/train': 1.7394129037857056} 11/06/2021 23:33:19 - INFO - __main__ - Step 16289: {'lr': 0.0004885896861508541, 'samples': 3127488, 'steps': 16288, 'loss/train': 1.1697927713394165} 11/06/2021 23:33:20 - INFO - __main__ - Step 16290: {'lr': 0.0004885881011707003, 'samples': 3127680, 'steps': 16289, 'loss/train': 1.684749960899353} 11/06/2021 23:33:20 - INFO - __main__ - Step 16291: {'lr': 0.0004885865160830422, 'samples': 3127872, 'steps': 16290, 'loss/train': 1.4211009740829468} 11/06/2021 23:33:20 - INFO - __main__ - Step 16292: {'lr': 0.0004885849308878809, 'samples': 3128064, 'steps': 16291, 'loss/train': 1.1847530603408813} 11/06/2021 23:33:21 - INFO - __main__ - Step 16293: {'lr': 0.0004885833455852169, 'samples': 3128256, 'steps': 16292, 'loss/train': 3.3935234546661377} 11/06/2021 23:33:21 - INFO - __main__ - Step 16294: {'lr': 0.0004885817601750509, 'samples': 3128448, 'steps': 16293, 'loss/train': 1.7725669145584106} 11/06/2021 23:33:22 - INFO - __main__ - Step 16295: {'lr': 0.0004885801746573836, 'samples': 3128640, 'steps': 16294, 'loss/train': 1.7294930219650269} 11/06/2021 23:33:22 - INFO - __main__ - Step 16296: {'lr': 0.0004885785890322158, 'samples': 3128832, 'steps': 16295, 'loss/train': 1.4970622062683105} 11/06/2021 23:33:23 - INFO - __main__ - Step 16297: {'lr': 0.0004885770032995482, 'samples': 3129024, 'steps': 16296, 'loss/train': 2.0539581775665283} 11/06/2021 23:33:23 - INFO - __main__ - Step 16298: {'lr': 0.0004885754174593814, 'samples': 3129216, 'steps': 16297, 'loss/train': 1.6017131805419922} 11/06/2021 23:33:24 - INFO - __main__ - Step 16299: {'lr': 0.0004885738315117162, 'samples': 3129408, 'steps': 16298, 'loss/train': 1.718274474143982} 11/06/2021 23:33:25 - INFO - __main__ - Step 16300: {'lr': 0.0004885722454565534, 'samples': 3129600, 'steps': 16299, 'loss/train': 1.6284193992614746} 11/06/2021 23:33:25 - INFO - __main__ - Step 16301: {'lr': 0.0004885706592938936, 'samples': 3129792, 'steps': 16300, 'loss/train': 1.8225423097610474} 11/06/2021 23:33:26 - INFO - __main__ - Step 16302: {'lr': 0.0004885690730237375, 'samples': 3129984, 'steps': 16301, 'loss/train': 5.086080551147461} 11/06/2021 23:33:26 - INFO - __main__ - Step 16303: {'lr': 0.0004885674866460858, 'samples': 3130176, 'steps': 16302, 'loss/train': 1.6252819299697876} 11/06/2021 23:33:26 - INFO - __main__ - Step 16304: {'lr': 0.0004885659001609393, 'samples': 3130368, 'steps': 16303, 'loss/train': 1.6549954414367676} 11/06/2021 23:33:27 - INFO - __main__ - Step 16305: {'lr': 0.0004885643135682987, 'samples': 3130560, 'steps': 16304, 'loss/train': 1.7220511436462402} 11/06/2021 23:33:28 - INFO - __main__ - Step 16306: {'lr': 0.0004885627268681648, 'samples': 3130752, 'steps': 16305, 'loss/train': 1.3783352375030518} 11/06/2021 23:33:28 - INFO - __main__ - Step 16307: {'lr': 0.0004885611400605381, 'samples': 3130944, 'steps': 16306, 'loss/train': 1.6854547262191772} 11/06/2021 23:33:28 - INFO - __main__ - Step 16308: {'lr': 0.0004885595531454195, 'samples': 3131136, 'steps': 16307, 'loss/train': 1.55535888671875} 11/06/2021 23:33:29 - INFO - __main__ - Step 16309: {'lr': 0.0004885579661228097, 'samples': 3131328, 'steps': 16308, 'loss/train': 1.7425678968429565} 11/06/2021 23:33:29 - INFO - __main__ - Step 16310: {'lr': 0.0004885563789927092, 'samples': 3131520, 'steps': 16309, 'loss/train': 1.6261141300201416} 11/06/2021 23:33:30 - INFO - __main__ - Step 16311: {'lr': 0.0004885547917551189, 'samples': 3131712, 'steps': 16310, 'loss/train': 1.6285531520843506} 11/06/2021 23:33:30 - INFO - __main__ - Step 16312: {'lr': 0.0004885532044100396, 'samples': 3131904, 'steps': 16311, 'loss/train': 2.0965306758880615} 11/06/2021 23:33:31 - INFO - __main__ - Step 16313: {'lr': 0.0004885516169574719, 'samples': 3132096, 'steps': 16312, 'loss/train': 1.7948811054229736} 11/06/2021 23:33:31 - INFO - __main__ - Step 16314: {'lr': 0.0004885500293974165, 'samples': 3132288, 'steps': 16313, 'loss/train': 1.4973307847976685} 11/06/2021 23:33:32 - INFO - __main__ - Step 16315: {'lr': 0.0004885484417298741, 'samples': 3132480, 'steps': 16314, 'loss/train': 1.1816359758377075} 11/06/2021 23:33:32 - INFO - __main__ - Step 16316: {'lr': 0.0004885468539548455, 'samples': 3132672, 'steps': 16315, 'loss/train': 1.3534947633743286} 11/06/2021 23:33:33 - INFO - __main__ - Step 16317: {'lr': 0.0004885452660723313, 'samples': 3132864, 'steps': 16316, 'loss/train': 1.4789822101593018} 11/06/2021 23:33:33 - INFO - __main__ - Step 16318: {'lr': 0.0004885436780823324, 'samples': 3133056, 'steps': 16317, 'loss/train': 1.8241922855377197} 11/06/2021 23:33:34 - INFO - __main__ - Step 16319: {'lr': 0.0004885420899848492, 'samples': 3133248, 'steps': 16318, 'loss/train': 1.9675356149673462} 11/06/2021 23:33:34 - INFO - __main__ - Step 16320: {'lr': 0.0004885405017798828, 'samples': 3133440, 'steps': 16319, 'loss/train': 2.023437023162842} 11/06/2021 23:33:34 - INFO - __main__ - Step 16321: {'lr': 0.0004885389134674337, 'samples': 3133632, 'steps': 16320, 'loss/train': 1.4244306087493896} 11/06/2021 23:33:35 - INFO - __main__ - Step 16322: {'lr': 0.0004885373250475026, 'samples': 3133824, 'steps': 16321, 'loss/train': 1.2528997659683228} 11/06/2021 23:33:36 - INFO - __main__ - Step 16323: {'lr': 0.0004885357365200903, 'samples': 3134016, 'steps': 16322, 'loss/train': 1.5098944902420044} 11/06/2021 23:33:36 - INFO - __main__ - Step 16324: {'lr': 0.0004885341478851975, 'samples': 3134208, 'steps': 16323, 'loss/train': 1.332453966140747} 11/06/2021 23:33:37 - INFO - __main__ - Step 16325: {'lr': 0.0004885325591428248, 'samples': 3134400, 'steps': 16324, 'loss/train': 0.9680758118629456} 11/06/2021 23:33:37 - INFO - __main__ - Step 16326: {'lr': 0.0004885309702929731, 'samples': 3134592, 'steps': 16325, 'loss/train': 2.230714797973633} 11/06/2021 23:33:38 - INFO - __main__ - Step 16327: {'lr': 0.000488529381335643, 'samples': 3134784, 'steps': 16326, 'loss/train': 1.5488425493240356} 11/06/2021 23:33:38 - INFO - __main__ - Step 16328: {'lr': 0.0004885277922708352, 'samples': 3134976, 'steps': 16327, 'loss/train': 1.5818017721176147} 11/06/2021 23:33:39 - INFO - __main__ - Step 16329: {'lr': 0.0004885262030985504, 'samples': 3135168, 'steps': 16328, 'loss/train': 1.6726022958755493} 11/06/2021 23:33:39 - INFO - __main__ - Step 16330: {'lr': 0.0004885246138187896, 'samples': 3135360, 'steps': 16329, 'loss/train': 1.5830050706863403} 11/06/2021 23:33:39 - INFO - __main__ - Step 16331: {'lr': 0.0004885230244315531, 'samples': 3135552, 'steps': 16330, 'loss/train': 1.6243305206298828} 11/06/2021 23:33:40 - INFO - __main__ - Step 16332: {'lr': 0.0004885214349368419, 'samples': 3135744, 'steps': 16331, 'loss/train': 1.477682113647461} 11/06/2021 23:33:41 - INFO - __main__ - Step 16333: {'lr': 0.0004885198453346565, 'samples': 3135936, 'steps': 16332, 'loss/train': 1.5040385723114014} 11/06/2021 23:33:41 - INFO - __main__ - Step 16334: {'lr': 0.0004885182556249978, 'samples': 3136128, 'steps': 16333, 'loss/train': 1.872546672821045} 11/06/2021 23:33:41 - INFO - __main__ - Step 16335: {'lr': 0.0004885166658078666, 'samples': 3136320, 'steps': 16334, 'loss/train': 1.3030028343200684} 11/06/2021 23:33:42 - INFO - __main__ - Step 16336: {'lr': 0.0004885150758832632, 'samples': 3136512, 'steps': 16335, 'loss/train': 1.7622499465942383} 11/06/2021 23:33:43 - INFO - __main__ - Step 16337: {'lr': 0.0004885134858511888, 'samples': 3136704, 'steps': 16336, 'loss/train': 1.5407360792160034} 11/06/2021 23:33:43 - INFO - __main__ - Step 16338: {'lr': 0.0004885118957116438, 'samples': 3136896, 'steps': 16337, 'loss/train': 1.1537578105926514} 11/06/2021 23:33:44 - INFO - __main__ - Step 16339: {'lr': 0.000488510305464629, 'samples': 3137088, 'steps': 16338, 'loss/train': 1.9556260108947754} 11/06/2021 23:33:44 - INFO - __main__ - Step 16340: {'lr': 0.0004885087151101453, 'samples': 3137280, 'steps': 16339, 'loss/train': 1.7774170637130737} 11/06/2021 23:33:44 - INFO - __main__ - Step 16341: {'lr': 0.0004885071246481931, 'samples': 3137472, 'steps': 16340, 'loss/train': 1.5609866380691528} 11/06/2021 23:33:45 - INFO - __main__ - Step 16342: {'lr': 0.0004885055340787733, 'samples': 3137664, 'steps': 16341, 'loss/train': 2.0309407711029053} 11/06/2021 23:33:46 - INFO - __main__ - Step 16343: {'lr': 0.0004885039434018866, 'samples': 3137856, 'steps': 16342, 'loss/train': 1.1930187940597534} 11/06/2021 23:33:46 - INFO - __main__ - Step 16344: {'lr': 0.0004885023526175337, 'samples': 3138048, 'steps': 16343, 'loss/train': 1.741970181465149} 11/06/2021 23:33:46 - INFO - __main__ - Step 16345: {'lr': 0.0004885007617257154, 'samples': 3138240, 'steps': 16344, 'loss/train': 1.7933660745620728} 11/06/2021 23:33:47 - INFO - __main__ - Step 16346: {'lr': 0.0004884991707264322, 'samples': 3138432, 'steps': 16345, 'loss/train': 0.6066716909408569} 11/06/2021 23:33:47 - INFO - __main__ - Step 16347: {'lr': 0.000488497579619685, 'samples': 3138624, 'steps': 16346, 'loss/train': 1.3735432624816895} 11/06/2021 23:33:48 - INFO - __main__ - Step 16348: {'lr': 0.0004884959884054745, 'samples': 3138816, 'steps': 16347, 'loss/train': 1.762027621269226} 11/06/2021 23:33:48 - INFO - __main__ - Step 16349: {'lr': 0.0004884943970838014, 'samples': 3139008, 'steps': 16348, 'loss/train': 1.6400388479232788} 11/06/2021 23:33:49 - INFO - __main__ - Step 16350: {'lr': 0.0004884928056546663, 'samples': 3139200, 'steps': 16349, 'loss/train': 1.1514432430267334} 11/06/2021 23:33:49 - INFO - __main__ - Step 16351: {'lr': 0.0004884912141180701, 'samples': 3139392, 'steps': 16350, 'loss/train': 1.8475518226623535} 11/06/2021 23:33:49 - INFO - __main__ - Step 16352: {'lr': 0.0004884896224740136, 'samples': 3139584, 'steps': 16351, 'loss/train': 2.076098918914795} 11/06/2021 23:33:51 - INFO - __main__ - Step 16353: {'lr': 0.0004884880307224972, 'samples': 3139776, 'steps': 16352, 'loss/train': 1.0980122089385986} 11/06/2021 23:33:52 - INFO - __main__ - Step 16354: {'lr': 0.0004884864388635217, 'samples': 3139968, 'steps': 16353, 'loss/train': 2.0552711486816406} 11/06/2021 23:33:52 - INFO - __main__ - Step 16355: {'lr': 0.0004884848468970879, 'samples': 3140160, 'steps': 16354, 'loss/train': 1.639860987663269} 11/06/2021 23:33:52 - INFO - __main__ - Step 16356: {'lr': 0.0004884832548231966, 'samples': 3140352, 'steps': 16355, 'loss/train': 1.732427716255188} 11/06/2021 23:33:53 - INFO - __main__ - Step 16357: {'lr': 0.0004884816626418484, 'samples': 3140544, 'steps': 16356, 'loss/train': 1.1739200353622437} 11/06/2021 23:33:53 - INFO - __main__ - Step 16358: {'lr': 0.000488480070353044, 'samples': 3140736, 'steps': 16357, 'loss/train': 1.8249778747558594} 11/06/2021 23:33:53 - INFO - __main__ - Step 16359: {'lr': 0.0004884784779567843, 'samples': 3140928, 'steps': 16358, 'loss/train': 1.5144319534301758} 11/06/2021 23:33:55 - INFO - __main__ - Step 16360: {'lr': 0.0004884768854530696, 'samples': 3141120, 'steps': 16359, 'loss/train': 1.685762882232666} 11/06/2021 23:33:55 - INFO - __main__ - Step 16361: {'lr': 0.0004884752928419012, 'samples': 3141312, 'steps': 16360, 'loss/train': 1.7363823652267456} 11/06/2021 23:33:55 - INFO - __main__ - Step 16362: {'lr': 0.0004884737001232793, 'samples': 3141504, 'steps': 16361, 'loss/train': 1.8232380151748657} 11/06/2021 23:33:56 - INFO - __main__ - Step 16363: {'lr': 0.000488472107297205, 'samples': 3141696, 'steps': 16362, 'loss/train': 1.5497318506240845} 11/06/2021 23:33:56 - INFO - __main__ - Step 16364: {'lr': 0.0004884705143636788, 'samples': 3141888, 'steps': 16363, 'loss/train': 0.7053847312927246} 11/06/2021 23:33:57 - INFO - __main__ - Step 16365: {'lr': 0.0004884689213227013, 'samples': 3142080, 'steps': 16364, 'loss/train': 1.7800610065460205} 11/06/2021 23:33:57 - INFO - __main__ - Step 16366: {'lr': 0.0004884673281742736, 'samples': 3142272, 'steps': 16365, 'loss/train': 1.5720760822296143} 11/06/2021 23:33:58 - INFO - __main__ - Step 16367: {'lr': 0.0004884657349183961, 'samples': 3142464, 'steps': 16366, 'loss/train': 1.697306513786316} 11/06/2021 23:33:58 - INFO - __main__ - Step 16368: {'lr': 0.0004884641415550696, 'samples': 3142656, 'steps': 16367, 'loss/train': 1.6368969678878784} 11/06/2021 23:33:58 - INFO - __main__ - Step 16369: {'lr': 0.0004884625480842949, 'samples': 3142848, 'steps': 16368, 'loss/train': 1.6560715436935425} 11/06/2021 23:34:00 - INFO - __main__ - Step 16370: {'lr': 0.0004884609545060726, 'samples': 3143040, 'steps': 16369, 'loss/train': 1.2425155639648438} 11/06/2021 23:34:00 - INFO - __main__ - Step 16371: {'lr': 0.0004884593608204035, 'samples': 3143232, 'steps': 16370, 'loss/train': 1.0383766889572144} 11/06/2021 23:34:00 - INFO - __main__ - Step 16372: {'lr': 0.0004884577670272882, 'samples': 3143424, 'steps': 16371, 'loss/train': 1.6095656156539917} 11/06/2021 23:34:01 - INFO - __main__ - Step 16373: {'lr': 0.0004884561731267278, 'samples': 3143616, 'steps': 16372, 'loss/train': 2.9992105960845947} 11/06/2021 23:34:01 - INFO - __main__ - Step 16374: {'lr': 0.0004884545791187224, 'samples': 3143808, 'steps': 16373, 'loss/train': 1.24136483669281} 11/06/2021 23:34:02 - INFO - __main__ - Step 16375: {'lr': 0.0004884529850032732, 'samples': 3144000, 'steps': 16374, 'loss/train': 0.2827925384044647} 11/06/2021 23:34:03 - INFO - __main__ - Step 16376: {'lr': 0.0004884513907803808, 'samples': 3144192, 'steps': 16375, 'loss/train': 2.1391053199768066} 11/06/2021 23:34:03 - INFO - __main__ - Step 16377: {'lr': 0.0004884497964500457, 'samples': 3144384, 'steps': 16376, 'loss/train': 1.8426679372787476} 11/06/2021 23:34:03 - INFO - __main__ - Step 16378: {'lr': 0.000488448202012269, 'samples': 3144576, 'steps': 16377, 'loss/train': 1.9780805110931396} 11/06/2021 23:34:04 - INFO - __main__ - Step 16379: {'lr': 0.0004884466074670512, 'samples': 3144768, 'steps': 16378, 'loss/train': 1.9465229511260986} 11/06/2021 23:34:05 - INFO - __main__ - Step 16380: {'lr': 0.0004884450128143929, 'samples': 3144960, 'steps': 16379, 'loss/train': 1.7582536935806274} 11/06/2021 23:34:05 - INFO - __main__ - Step 16381: {'lr': 0.000488443418054295, 'samples': 3145152, 'steps': 16380, 'loss/train': 1.4283581972122192} 11/06/2021 23:34:06 - INFO - __main__ - Step 16382: {'lr': 0.0004884418231867583, 'samples': 3145344, 'steps': 16381, 'loss/train': 1.8179876804351807} 11/06/2021 23:34:06 - INFO - __main__ - Step 16383: {'lr': 0.0004884402282117833, 'samples': 3145536, 'steps': 16382, 'loss/train': 0.8137797117233276} 11/06/2021 23:34:06 - INFO - __main__ - Step 16384: {'lr': 0.0004884386331293708, 'samples': 3145728, 'steps': 16383, 'loss/train': 2.007157564163208} 11/06/2021 23:34:07 - INFO - __main__ - Step 16385: {'lr': 0.0004884370379395215, 'samples': 3145920, 'steps': 16384, 'loss/train': 1.2791186571121216} 11/06/2021 23:34:08 - INFO - __main__ - Step 16386: {'lr': 0.0004884354426422363, 'samples': 3146112, 'steps': 16385, 'loss/train': 2.190890073776245} 11/06/2021 23:34:08 - INFO - __main__ - Step 16387: {'lr': 0.0004884338472375156, 'samples': 3146304, 'steps': 16386, 'loss/train': 1.5550997257232666} 11/06/2021 23:34:08 - INFO - __main__ - Step 16388: {'lr': 0.0004884322517253604, 'samples': 3146496, 'steps': 16387, 'loss/train': 1.888465404510498} 11/06/2021 23:34:09 - INFO - __main__ - Step 16389: {'lr': 0.0004884306561057713, 'samples': 3146688, 'steps': 16388, 'loss/train': 1.8606306314468384} 11/06/2021 23:34:09 - INFO - __main__ - Step 16390: {'lr': 0.000488429060378749, 'samples': 3146880, 'steps': 16389, 'loss/train': 1.8474323749542236} 11/06/2021 23:34:09 - INFO - __main__ - Step 16391: {'lr': 0.0004884274645442942, 'samples': 3147072, 'steps': 16390, 'loss/train': 1.6243537664413452} 11/06/2021 23:34:10 - INFO - __main__ - Step 16392: {'lr': 0.0004884258686024077, 'samples': 3147264, 'steps': 16391, 'loss/train': 2.085355758666992} 11/06/2021 23:34:11 - INFO - __main__ - Step 16393: {'lr': 0.0004884242725530902, 'samples': 3147456, 'steps': 16392, 'loss/train': 1.4458762407302856} 11/06/2021 23:34:11 - INFO - __main__ - Step 16394: {'lr': 0.0004884226763963423, 'samples': 3147648, 'steps': 16393, 'loss/train': 1.4372462034225464} 11/06/2021 23:34:12 - INFO - __main__ - Step 16395: {'lr': 0.000488421080132165, 'samples': 3147840, 'steps': 16394, 'loss/train': 1.3019970655441284} 11/06/2021 23:34:12 - INFO - __main__ - Step 16396: {'lr': 0.0004884194837605587, 'samples': 3148032, 'steps': 16395, 'loss/train': 1.6716073751449585} 11/06/2021 23:34:13 - INFO - __main__ - Step 16397: {'lr': 0.0004884178872815243, 'samples': 3148224, 'steps': 16396, 'loss/train': 1.440213680267334} 11/06/2021 23:34:13 - INFO - __main__ - Step 16398: {'lr': 0.0004884162906950624, 'samples': 3148416, 'steps': 16397, 'loss/train': 1.1032469272613525} 11/06/2021 23:34:14 - INFO - __main__ - Step 16399: {'lr': 0.000488414694001174, 'samples': 3148608, 'steps': 16398, 'loss/train': 1.7716431617736816} 11/06/2021 23:34:14 - INFO - __main__ - Step 16400: {'lr': 0.0004884130971998595, 'samples': 3148800, 'steps': 16399, 'loss/train': 1.464718222618103} 11/06/2021 23:34:14 - INFO - __main__ - Step 16401: {'lr': 0.0004884115002911197, 'samples': 3148992, 'steps': 16400, 'loss/train': 2.2289278507232666} 11/06/2021 23:34:15 - INFO - __main__ - Step 16402: {'lr': 0.0004884099032749554, 'samples': 3149184, 'steps': 16401, 'loss/train': 1.3836721181869507} 11/06/2021 23:34:16 - INFO - __main__ - Step 16403: {'lr': 0.0004884083061513672, 'samples': 3149376, 'steps': 16402, 'loss/train': 1.9239706993103027} 11/06/2021 23:34:16 - INFO - __main__ - Step 16404: {'lr': 0.0004884067089203559, 'samples': 3149568, 'steps': 16403, 'loss/train': 1.8463945388793945} 11/06/2021 23:34:16 - INFO - __main__ - Step 16405: {'lr': 0.0004884051115819224, 'samples': 3149760, 'steps': 16404, 'loss/train': 2.2028331756591797} 11/06/2021 23:34:17 - INFO - __main__ - Step 16406: {'lr': 0.000488403514136067, 'samples': 3149952, 'steps': 16405, 'loss/train': 1.876776099205017} 11/06/2021 23:34:18 - INFO - __main__ - Step 16407: {'lr': 0.0004884019165827909, 'samples': 3150144, 'steps': 16406, 'loss/train': 2.1374125480651855} 11/06/2021 23:34:18 - INFO - __main__ - Step 16408: {'lr': 0.0004884003189220945, 'samples': 3150336, 'steps': 16407, 'loss/train': 1.7482624053955078} 11/06/2021 23:34:19 - INFO - __main__ - Step 16409: {'lr': 0.0004883987211539785, 'samples': 3150528, 'steps': 16408, 'loss/train': 1.7053028345108032} 11/06/2021 23:34:19 - INFO - __main__ - Step 16410: {'lr': 0.0004883971232784438, 'samples': 3150720, 'steps': 16409, 'loss/train': 1.538672685623169} 11/06/2021 23:34:19 - INFO - __main__ - Step 16411: {'lr': 0.0004883955252954909, 'samples': 3150912, 'steps': 16410, 'loss/train': 1.7705274820327759} 11/06/2021 23:34:20 - INFO - __main__ - Step 16412: {'lr': 0.0004883939272051208, 'samples': 3151104, 'steps': 16411, 'loss/train': 0.9273844957351685} 11/06/2021 23:34:21 - INFO - __main__ - Step 16413: {'lr': 0.000488392329007334, 'samples': 3151296, 'steps': 16412, 'loss/train': 2.2184743881225586} 11/06/2021 23:34:21 - INFO - __main__ - Step 16414: {'lr': 0.0004883907307021314, 'samples': 3151488, 'steps': 16413, 'loss/train': 1.27170729637146} 11/06/2021 23:34:21 - INFO - __main__ - Step 16415: {'lr': 0.0004883891322895134, 'samples': 3151680, 'steps': 16414, 'loss/train': 1.9354814291000366} 11/06/2021 23:34:22 - INFO - __main__ - Step 16416: {'lr': 0.000488387533769481, 'samples': 3151872, 'steps': 16415, 'loss/train': 1.3187230825424194} 11/06/2021 23:34:22 - INFO - __main__ - Step 16417: {'lr': 0.000488385935142035, 'samples': 3152064, 'steps': 16416, 'loss/train': 1.2268887758255005} 11/06/2021 23:34:23 - INFO - __main__ - Step 16418: {'lr': 0.0004883843364071759, 'samples': 3152256, 'steps': 16417, 'loss/train': 1.435240387916565} 11/06/2021 23:34:23 - INFO - __main__ - Step 16419: {'lr': 0.0004883827375649045, 'samples': 3152448, 'steps': 16418, 'loss/train': 1.6669983863830566} 11/06/2021 23:34:24 - INFO - __main__ - Step 16420: {'lr': 0.0004883811386152216, 'samples': 3152640, 'steps': 16419, 'loss/train': 1.5936235189437866} 11/06/2021 23:34:24 - INFO - __main__ - Step 16421: {'lr': 0.0004883795395581277, 'samples': 3152832, 'steps': 16420, 'loss/train': 1.746578574180603} 11/06/2021 23:34:24 - INFO - __main__ - Step 16422: {'lr': 0.0004883779403936237, 'samples': 3153024, 'steps': 16421, 'loss/train': 2.0167765617370605} 11/06/2021 23:34:26 - INFO - __main__ - Step 16423: {'lr': 0.0004883763411217103, 'samples': 3153216, 'steps': 16422, 'loss/train': 1.6873087882995605} 11/06/2021 23:34:26 - INFO - __main__ - Step 16424: {'lr': 0.0004883747417423882, 'samples': 3153408, 'steps': 16423, 'loss/train': 1.9148794412612915} 11/06/2021 23:34:27 - INFO - __main__ - Step 16425: {'lr': 0.000488373142255658, 'samples': 3153600, 'steps': 16424, 'loss/train': 1.9186967611312866} 11/06/2021 23:34:27 - INFO - __main__ - Step 16426: {'lr': 0.0004883715426615207, 'samples': 3153792, 'steps': 16425, 'loss/train': 1.168741226196289} 11/06/2021 23:34:27 - INFO - __main__ - Step 16427: {'lr': 0.0004883699429599768, 'samples': 3153984, 'steps': 16426, 'loss/train': 1.5318204164505005} 11/06/2021 23:34:28 - INFO - __main__ - Step 16428: {'lr': 0.0004883683431510272, 'samples': 3154176, 'steps': 16427, 'loss/train': 0.9183443784713745} 11/06/2021 23:34:29 - INFO - __main__ - Step 16429: {'lr': 0.0004883667432346723, 'samples': 3154368, 'steps': 16428, 'loss/train': 1.2394202947616577} 11/06/2021 23:34:29 - INFO - __main__ - Step 16430: {'lr': 0.0004883651432109132, 'samples': 3154560, 'steps': 16429, 'loss/train': 1.7920042276382446} 11/06/2021 23:34:29 - INFO - __main__ - Step 16431: {'lr': 0.0004883635430797502, 'samples': 3154752, 'steps': 16430, 'loss/train': 1.7659144401550293} 11/06/2021 23:34:30 - INFO - __main__ - Step 16432: {'lr': 0.0004883619428411846, 'samples': 3154944, 'steps': 16431, 'loss/train': 2.057718276977539} 11/06/2021 23:34:30 - INFO - __main__ - Step 16433: {'lr': 0.0004883603424952165, 'samples': 3155136, 'steps': 16432, 'loss/train': 1.8419935703277588} 11/06/2021 23:34:31 - INFO - __main__ - Step 16434: {'lr': 0.0004883587420418471, 'samples': 3155328, 'steps': 16433, 'loss/train': 1.7195664644241333} 11/06/2021 23:34:32 - INFO - __main__ - Step 16435: {'lr': 0.0004883571414810769, 'samples': 3155520, 'steps': 16434, 'loss/train': 1.438564658164978} 11/06/2021 23:34:32 - INFO - __main__ - Step 16436: {'lr': 0.0004883555408129066, 'samples': 3155712, 'steps': 16435, 'loss/train': 1.904558777809143} 11/06/2021 23:34:32 - INFO - __main__ - Step 16437: {'lr': 0.0004883539400373369, 'samples': 3155904, 'steps': 16436, 'loss/train': 1.708992600440979} 11/06/2021 23:34:33 - INFO - __main__ - Step 16438: {'lr': 0.0004883523391543687, 'samples': 3156096, 'steps': 16437, 'loss/train': 2.0826616287231445} 11/06/2021 23:34:34 - INFO - __main__ - Step 16439: {'lr': 0.0004883507381640026, 'samples': 3156288, 'steps': 16438, 'loss/train': 1.71291184425354} 11/06/2021 23:34:34 - INFO - __main__ - Step 16440: {'lr': 0.0004883491370662393, 'samples': 3156480, 'steps': 16439, 'loss/train': 2.3351950645446777} 11/06/2021 23:34:34 - INFO - __main__ - Step 16441: {'lr': 0.0004883475358610794, 'samples': 3156672, 'steps': 16440, 'loss/train': 1.700568437576294} 11/06/2021 23:34:35 - INFO - __main__ - Step 16442: {'lr': 0.000488345934548524, 'samples': 3156864, 'steps': 16441, 'loss/train': 1.2319788932800293} 11/06/2021 23:34:35 - INFO - __main__ - Step 16443: {'lr': 0.0004883443331285736, 'samples': 3157056, 'steps': 16442, 'loss/train': 1.9203612804412842} 11/06/2021 23:34:36 - INFO - __main__ - Step 16444: {'lr': 0.0004883427316012289, 'samples': 3157248, 'steps': 16443, 'loss/train': 1.382559061050415} 11/06/2021 23:34:36 - INFO - __main__ - Step 16445: {'lr': 0.0004883411299664906, 'samples': 3157440, 'steps': 16444, 'loss/train': 1.7996220588684082} 11/06/2021 23:34:37 - INFO - __main__ - Step 16446: {'lr': 0.0004883395282243595, 'samples': 3157632, 'steps': 16445, 'loss/train': 1.54921293258667} 11/06/2021 23:34:37 - INFO - __main__ - Step 16447: {'lr': 0.0004883379263748363, 'samples': 3157824, 'steps': 16446, 'loss/train': 1.4112991094589233} 11/06/2021 23:34:37 - INFO - __main__ - Step 16448: {'lr': 0.0004883363244179217, 'samples': 3158016, 'steps': 16447, 'loss/train': 2.3577237129211426} 11/06/2021 23:34:38 - INFO - __main__ - Step 16449: {'lr': 0.0004883347223536164, 'samples': 3158208, 'steps': 16448, 'loss/train': 1.258867859840393} 11/06/2021 23:34:39 - INFO - __main__ - Step 16450: {'lr': 0.0004883331201819211, 'samples': 3158400, 'steps': 16449, 'loss/train': 2.188420057296753} 11/06/2021 23:34:39 - INFO - __main__ - Step 16451: {'lr': 0.0004883315179028366, 'samples': 3158592, 'steps': 16450, 'loss/train': 1.7778352499008179} 11/06/2021 23:34:39 - INFO - __main__ - Step 16452: {'lr': 0.0004883299155163636, 'samples': 3158784, 'steps': 16451, 'loss/train': 1.679057002067566} 11/06/2021 23:34:40 - INFO - __main__ - Step 16453: {'lr': 0.0004883283130225029, 'samples': 3158976, 'steps': 16452, 'loss/train': 1.8446472883224487} 11/06/2021 23:34:40 - INFO - __main__ - Step 16454: {'lr': 0.0004883267104212551, 'samples': 3159168, 'steps': 16453, 'loss/train': 0.8045353889465332} 11/06/2021 23:34:42 - INFO - __main__ - Step 16455: {'lr': 0.0004883251077126209, 'samples': 3159360, 'steps': 16454, 'loss/train': 1.5657854080200195} 11/06/2021 23:34:42 - INFO - __main__ - Step 16456: {'lr': 0.0004883235048966011, 'samples': 3159552, 'steps': 16455, 'loss/train': 0.303096204996109} 11/06/2021 23:34:43 - INFO - __main__ - Step 16457: {'lr': 0.0004883219019731964, 'samples': 3159744, 'steps': 16456, 'loss/train': 1.406226634979248} 11/06/2021 23:34:43 - INFO - __main__ - Step 16458: {'lr': 0.0004883202989424076, 'samples': 3159936, 'steps': 16457, 'loss/train': 5.184366226196289} 11/06/2021 23:34:43 - INFO - __main__ - Step 16459: {'lr': 0.0004883186958042354, 'samples': 3160128, 'steps': 16458, 'loss/train': 8.043232917785645} 11/06/2021 23:34:44 - INFO - __main__ - Step 16460: {'lr': 0.0004883170925586804, 'samples': 3160320, 'steps': 16459, 'loss/train': 8.05807876586914} 11/06/2021 23:34:44 - INFO - __main__ - Step 16461: {'lr': 0.0004883154892057433, 'samples': 3160512, 'steps': 16460, 'loss/train': 1.4330828189849854} 11/06/2021 23:34:45 - INFO - __main__ - Step 16462: {'lr': 0.000488313885745425, 'samples': 3160704, 'steps': 16461, 'loss/train': 1.8377580642700195} 11/06/2021 23:34:46 - INFO - __main__ - Step 16463: {'lr': 0.0004883122821777261, 'samples': 3160896, 'steps': 16462, 'loss/train': 1.6536674499511719} 11/06/2021 23:34:46 - INFO - __main__ - Step 16464: {'lr': 0.0004883106785026475, 'samples': 3161088, 'steps': 16463, 'loss/train': 1.4309262037277222} 11/06/2021 23:34:46 - INFO - __main__ - Step 16465: {'lr': 0.0004883090747201897, 'samples': 3161280, 'steps': 16464, 'loss/train': 1.9212692975997925} 11/06/2021 23:34:47 - INFO - __main__ - Step 16466: {'lr': 0.0004883074708303534, 'samples': 3161472, 'steps': 16465, 'loss/train': 8.428525924682617} 11/06/2021 23:34:47 - INFO - __main__ - Step 16467: {'lr': 0.0004883058668331396, 'samples': 3161664, 'steps': 16466, 'loss/train': 1.6166472434997559} 11/06/2021 23:34:48 - INFO - __main__ - Step 16468: {'lr': 0.0004883042627285488, 'samples': 3161856, 'steps': 16467, 'loss/train': 1.0819371938705444} 11/06/2021 23:34:48 - INFO - __main__ - Step 16469: {'lr': 0.0004883026585165817, 'samples': 3162048, 'steps': 16468, 'loss/train': 2.0714237689971924} 11/06/2021 23:34:49 - INFO - __main__ - Step 16470: {'lr': 0.0004883010541972392, 'samples': 3162240, 'steps': 16469, 'loss/train': 1.4929072856903076} 11/06/2021 23:34:49 - INFO - __main__ - Step 16471: {'lr': 0.0004882994497705219, 'samples': 3162432, 'steps': 16470, 'loss/train': 2.0204741954803467} 11/06/2021 23:34:49 - INFO - __main__ - Step 16472: {'lr': 0.0004882978452364305, 'samples': 3162624, 'steps': 16471, 'loss/train': 2.0556139945983887} 11/06/2021 23:34:51 - INFO - __main__ - Step 16473: {'lr': 0.0004882962405949658, 'samples': 3162816, 'steps': 16472, 'loss/train': 1.3556331396102905} 11/06/2021 23:34:51 - INFO - __main__ - Step 16474: {'lr': 0.0004882946358461285, 'samples': 3163008, 'steps': 16473, 'loss/train': 2.0677337646484375} 11/06/2021 23:34:51 - INFO - __main__ - Step 16475: {'lr': 0.0004882930309899192, 'samples': 3163200, 'steps': 16474, 'loss/train': 2.228142738342285} 11/06/2021 23:34:52 - INFO - __main__ - Step 16476: {'lr': 0.000488291426026339, 'samples': 3163392, 'steps': 16475, 'loss/train': 1.9927802085876465} 11/06/2021 23:34:52 - INFO - __main__ - Step 16477: {'lr': 0.0004882898209553881, 'samples': 3163584, 'steps': 16476, 'loss/train': 2.140395164489746} 11/06/2021 23:34:52 - INFO - __main__ - Step 16478: {'lr': 0.0004882882157770676, 'samples': 3163776, 'steps': 16477, 'loss/train': 1.4509589672088623} 11/06/2021 23:34:53 - INFO - __main__ - Step 16479: {'lr': 0.000488286610491378, 'samples': 3163968, 'steps': 16478, 'loss/train': 1.8325351476669312} 11/06/2021 23:34:54 - INFO - __main__ - Step 16480: {'lr': 0.0004882850050983203, 'samples': 3164160, 'steps': 16479, 'loss/train': 1.5992738008499146} 11/06/2021 23:34:54 - INFO - __main__ - Step 16481: {'lr': 0.0004882833995978949, 'samples': 3164352, 'steps': 16480, 'loss/train': 1.4408437013626099} 11/06/2021 23:34:54 - INFO - __main__ - Step 16482: {'lr': 0.0004882817939901027, 'samples': 3164544, 'steps': 16481, 'loss/train': 1.7016760110855103} 11/06/2021 23:34:55 - INFO - __main__ - Step 16483: {'lr': 0.0004882801882749445, 'samples': 3164736, 'steps': 16482, 'loss/train': 1.6935230493545532} 11/06/2021 23:34:56 - INFO - __main__ - Step 16484: {'lr': 0.0004882785824524209, 'samples': 3164928, 'steps': 16483, 'loss/train': 1.9659548997879028} 11/06/2021 23:34:56 - INFO - __main__ - Step 16485: {'lr': 0.0004882769765225326, 'samples': 3165120, 'steps': 16484, 'loss/train': 1.4633811712265015} 11/06/2021 23:34:57 - INFO - __main__ - Step 16486: {'lr': 0.00048827537048528035, 'samples': 3165312, 'steps': 16485, 'loss/train': 1.8877742290496826} 11/06/2021 23:34:57 - INFO - __main__ - Step 16487: {'lr': 0.00048827376434066493, 'samples': 3165504, 'steps': 16486, 'loss/train': 1.910869836807251} 11/06/2021 23:34:57 - INFO - __main__ - Step 16488: {'lr': 0.0004882721580886871, 'samples': 3165696, 'steps': 16487, 'loss/train': 1.7500070333480835} 11/06/2021 23:34:58 - INFO - __main__ - Step 16489: {'lr': 0.00048827055172934744, 'samples': 3165888, 'steps': 16488, 'loss/train': 1.7136714458465576} 11/06/2021 23:34:59 - INFO - __main__ - Step 16490: {'lr': 0.0004882689452626468, 'samples': 3166080, 'steps': 16489, 'loss/train': 1.9867051839828491} 11/06/2021 23:34:59 - INFO - __main__ - Step 16491: {'lr': 0.00048826733868858577, 'samples': 3166272, 'steps': 16490, 'loss/train': 0.20762591063976288} 11/06/2021 23:34:59 - INFO - __main__ - Step 16492: {'lr': 0.00048826573200716516, 'samples': 3166464, 'steps': 16491, 'loss/train': 1.4593125581741333} 11/06/2021 23:35:00 - INFO - __main__ - Step 16493: {'lr': 0.0004882641252183857, 'samples': 3166656, 'steps': 16492, 'loss/train': 1.8210448026657104} 11/06/2021 23:35:00 - INFO - __main__ - Step 16494: {'lr': 0.0004882625183222481, 'samples': 3166848, 'steps': 16493, 'loss/train': 1.7978031635284424} 11/06/2021 23:35:01 - INFO - __main__ - Step 16495: {'lr': 0.00048826091131875317, 'samples': 3167040, 'steps': 16494, 'loss/train': 1.5826845169067383} 11/06/2021 23:35:02 - INFO - __main__ - Step 16496: {'lr': 0.00048825930420790144, 'samples': 3167232, 'steps': 16495, 'loss/train': 1.638830304145813} 11/06/2021 23:35:02 - INFO - __main__ - Step 16497: {'lr': 0.0004882576969896938, 'samples': 3167424, 'steps': 16496, 'loss/train': 1.9118237495422363} 11/06/2021 23:35:02 - INFO - __main__ - Step 16498: {'lr': 0.00048825608966413095, 'samples': 3167616, 'steps': 16497, 'loss/train': 1.3743228912353516} 11/06/2021 23:35:03 - INFO - __main__ - Step 16499: {'lr': 0.0004882544822312135, 'samples': 3167808, 'steps': 16498, 'loss/train': 1.554957628250122} 11/06/2021 23:35:04 - INFO - __main__ - Step 16500: {'lr': 0.00048825287469094224, 'samples': 3168000, 'steps': 16499, 'loss/train': 2.003634452819824} 11/06/2021 23:35:04 - INFO - __main__ - Step 16501: {'lr': 0.000488251267043318, 'samples': 3168192, 'steps': 16500, 'loss/train': 1.7607531547546387} 11/06/2021 23:35:04 - INFO - __main__ - Step 16502: {'lr': 0.00048824965928834143, 'samples': 3168384, 'steps': 16501, 'loss/train': 1.72561776638031} 11/06/2021 23:35:05 - INFO - __main__ - Step 16503: {'lr': 0.0004882480514260131, 'samples': 3168576, 'steps': 16502, 'loss/train': 1.7174887657165527} 11/06/2021 23:35:05 - INFO - __main__ - Step 16504: {'lr': 0.000488246443456334, 'samples': 3168768, 'steps': 16503, 'loss/train': 1.427158236503601} 11/06/2021 23:35:06 - INFO - __main__ - Step 16505: {'lr': 0.0004882448353793048, 'samples': 3168960, 'steps': 16504, 'loss/train': 1.661441445350647} 11/06/2021 23:35:07 - INFO - __main__ - Step 16506: {'lr': 0.000488243227194926, 'samples': 3169152, 'steps': 16505, 'loss/train': 1.6269912719726562} 11/06/2021 23:35:07 - INFO - __main__ - Step 16507: {'lr': 0.00048824161890319854, 'samples': 3169344, 'steps': 16506, 'loss/train': 1.6323604583740234} 11/06/2021 23:35:07 - INFO - __main__ - Step 16508: {'lr': 0.00048824001050412304, 'samples': 3169536, 'steps': 16507, 'loss/train': 1.1805651187896729} 11/06/2021 23:35:08 - INFO - __main__ - Step 16509: {'lr': 0.0004882384019977003, 'samples': 3169728, 'steps': 16508, 'loss/train': 1.7466378211975098} 11/06/2021 23:35:09 - INFO - __main__ - Step 16510: {'lr': 0.000488236793383931, 'samples': 3169920, 'steps': 16509, 'loss/train': 1.939573049545288} 11/06/2021 23:35:09 - INFO - __main__ - Step 16511: {'lr': 0.00048823518466281586, 'samples': 3170112, 'steps': 16510, 'loss/train': 1.5818568468093872} 11/06/2021 23:35:09 - INFO - __main__ - Step 16512: {'lr': 0.0004882335758343557, 'samples': 3170304, 'steps': 16511, 'loss/train': 1.3444141149520874} 11/06/2021 23:35:10 - INFO - __main__ - Step 16513: {'lr': 0.0004882319668985511, 'samples': 3170496, 'steps': 16512, 'loss/train': 1.5488812923431396} 11/06/2021 23:35:10 - INFO - __main__ - Step 16514: {'lr': 0.00048823035785540284, 'samples': 3170688, 'steps': 16513, 'loss/train': 1.5068031549453735} 11/06/2021 23:35:10 - INFO - __main__ - Step 16515: {'lr': 0.0004882287487049117, 'samples': 3170880, 'steps': 16514, 'loss/train': 2.055966854095459} 11/06/2021 23:35:11 - INFO - __main__ - Step 16516: {'lr': 0.00048822713944707833, 'samples': 3171072, 'steps': 16515, 'loss/train': 1.9989991188049316} 11/06/2021 23:35:12 - INFO - __main__ - Step 16517: {'lr': 0.0004882255300819035, 'samples': 3171264, 'steps': 16516, 'loss/train': 1.4609509706497192} 11/06/2021 23:35:12 - INFO - __main__ - Step 16518: {'lr': 0.0004882239206093879, 'samples': 3171456, 'steps': 16517, 'loss/train': 1.803931713104248} 11/06/2021 23:35:12 - INFO - __main__ - Step 16519: {'lr': 0.0004882223110295323, 'samples': 3171648, 'steps': 16518, 'loss/train': 1.8688184022903442} 11/06/2021 23:35:13 - INFO - __main__ - Step 16520: {'lr': 0.00048822070134233743, 'samples': 3171840, 'steps': 16519, 'loss/train': 2.412503719329834} 11/06/2021 23:35:14 - INFO - __main__ - Step 16521: {'lr': 0.000488219091547804, 'samples': 3172032, 'steps': 16520, 'loss/train': 1.5332022905349731} 11/06/2021 23:35:14 - INFO - __main__ - Step 16522: {'lr': 0.0004882174816459326, 'samples': 3172224, 'steps': 16521, 'loss/train': 1.0638723373413086} 11/06/2021 23:35:15 - INFO - __main__ - Step 16523: {'lr': 0.0004882158716367242, 'samples': 3172416, 'steps': 16522, 'loss/train': 1.8770873546600342} 11/06/2021 23:35:15 - INFO - __main__ - Step 16524: {'lr': 0.0004882142615201793, 'samples': 3172608, 'steps': 16523, 'loss/train': 1.3749724626541138} 11/06/2021 23:35:15 - INFO - __main__ - Step 16525: {'lr': 0.00048821265129629887, 'samples': 3172800, 'steps': 16524, 'loss/train': 1.3542416095733643} 11/06/2021 23:35:16 - INFO - __main__ - Step 16526: {'lr': 0.0004882110409650834, 'samples': 3172992, 'steps': 16525, 'loss/train': 1.317636489868164} 11/06/2021 23:35:17 - INFO - __main__ - Step 16527: {'lr': 0.0004882094305265338, 'samples': 3173184, 'steps': 16526, 'loss/train': 1.8596268892288208} 11/06/2021 23:35:17 - INFO - __main__ - Step 16528: {'lr': 0.00048820781998065054, 'samples': 3173376, 'steps': 16527, 'loss/train': 1.1366714239120483} 11/06/2021 23:35:17 - INFO - __main__ - Step 16529: {'lr': 0.00048820620932743465, 'samples': 3173568, 'steps': 16528, 'loss/train': 1.765393853187561} 11/06/2021 23:35:18 - INFO - __main__ - Step 16530: {'lr': 0.0004882045985668867, 'samples': 3173760, 'steps': 16529, 'loss/train': 1.457262635231018} 11/06/2021 23:35:19 - INFO - __main__ - Step 16531: {'lr': 0.0004882029876990074, 'samples': 3173952, 'steps': 16530, 'loss/train': 1.3767226934432983} 11/06/2021 23:35:19 - INFO - __main__ - Step 16532: {'lr': 0.0004882013767237975, 'samples': 3174144, 'steps': 16531, 'loss/train': 1.6697412729263306} 11/06/2021 23:35:20 - INFO - __main__ - Step 16533: {'lr': 0.0004881997656412578, 'samples': 3174336, 'steps': 16532, 'loss/train': 1.7419915199279785} 11/06/2021 23:35:20 - INFO - __main__ - Step 16534: {'lr': 0.0004881981544513889, 'samples': 3174528, 'steps': 16533, 'loss/train': 1.6078473329544067} 11/06/2021 23:35:20 - INFO - __main__ - Step 16535: {'lr': 0.0004881965431541916, 'samples': 3174720, 'steps': 16534, 'loss/train': 1.4096086025238037} 11/06/2021 23:35:21 - INFO - __main__ - Step 16536: {'lr': 0.0004881949317496667, 'samples': 3174912, 'steps': 16535, 'loss/train': 1.502119541168213} 11/06/2021 23:35:22 - INFO - __main__ - Step 16537: {'lr': 0.0004881933202378147, 'samples': 3175104, 'steps': 16536, 'loss/train': 1.3648459911346436} 11/06/2021 23:35:22 - INFO - __main__ - Step 16538: {'lr': 0.0004881917086186365, 'samples': 3175296, 'steps': 16537, 'loss/train': 2.249582290649414} 11/06/2021 23:35:22 - INFO - __main__ - Step 16539: {'lr': 0.0004881900968921328, 'samples': 3175488, 'steps': 16538, 'loss/train': 1.8910950422286987} 11/06/2021 23:35:23 - INFO - __main__ - Step 16540: {'lr': 0.00048818848505830436, 'samples': 3175680, 'steps': 16539, 'loss/train': 0.856475293636322} 11/06/2021 23:35:24 - INFO - __main__ - Step 16541: {'lr': 0.0004881868731171518, 'samples': 3175872, 'steps': 16540, 'loss/train': 1.7443568706512451} 11/06/2021 23:35:24 - INFO - __main__ - Step 16542: {'lr': 0.000488185261068676, 'samples': 3176064, 'steps': 16541, 'loss/train': 1.7981657981872559} 11/06/2021 23:35:25 - INFO - __main__ - Step 16543: {'lr': 0.0004881836489128776, 'samples': 3176256, 'steps': 16542, 'loss/train': 1.6968001127243042} 11/06/2021 23:35:25 - INFO - __main__ - Step 16544: {'lr': 0.00048818203664975727, 'samples': 3176448, 'steps': 16543, 'loss/train': 1.6674453020095825} 11/06/2021 23:35:25 - INFO - __main__ - Step 16545: {'lr': 0.00048818042427931573, 'samples': 3176640, 'steps': 16544, 'loss/train': 1.6024539470672607} 11/06/2021 23:35:26 - INFO - __main__ - Step 16546: {'lr': 0.00048817881180155385, 'samples': 3176832, 'steps': 16545, 'loss/train': 1.7665468454360962} 11/06/2021 23:35:27 - INFO - __main__ - Step 16547: {'lr': 0.0004881771992164722, 'samples': 3177024, 'steps': 16546, 'loss/train': 1.7142109870910645} 11/06/2021 23:35:27 - INFO - __main__ - Step 16548: {'lr': 0.0004881755865240717, 'samples': 3177216, 'steps': 16547, 'loss/train': 1.6377567052841187} 11/06/2021 23:35:27 - INFO - __main__ - Step 16549: {'lr': 0.0004881739737243528, 'samples': 3177408, 'steps': 16548, 'loss/train': 1.363764762878418} 11/06/2021 23:35:28 - INFO - __main__ - Step 16550: {'lr': 0.00048817236081731655, 'samples': 3177600, 'steps': 16549, 'loss/train': 1.6557155847549438} 11/06/2021 23:35:28 - INFO - __main__ - Step 16551: {'lr': 0.0004881707478029634, 'samples': 3177792, 'steps': 16550, 'loss/train': 1.6136027574539185} 11/06/2021 23:35:29 - INFO - __main__ - Step 16552: {'lr': 0.0004881691346812942, 'samples': 3177984, 'steps': 16551, 'loss/train': 1.5945550203323364} 11/06/2021 23:35:30 - INFO - __main__ - Step 16553: {'lr': 0.0004881675214523097, 'samples': 3178176, 'steps': 16552, 'loss/train': 1.704119086265564} 11/06/2021 23:35:30 - INFO - __main__ - Step 16554: {'lr': 0.00048816590811601054, 'samples': 3178368, 'steps': 16553, 'loss/train': 1.5714160203933716} 11/06/2021 23:35:30 - INFO - __main__ - Step 16555: {'lr': 0.0004881642946723975, 'samples': 3178560, 'steps': 16554, 'loss/train': 1.7432516813278198} 11/06/2021 23:35:31 - INFO - __main__ - Step 16556: {'lr': 0.00048816268112147134, 'samples': 3178752, 'steps': 16555, 'loss/train': 0.19290374219417572} 11/06/2021 23:35:32 - INFO - __main__ - Step 16557: {'lr': 0.00048816106746323273, 'samples': 3178944, 'steps': 16556, 'loss/train': 1.346990942955017} 11/06/2021 23:35:32 - INFO - __main__ - Step 16558: {'lr': 0.00048815945369768245, 'samples': 3179136, 'steps': 16557, 'loss/train': 1.5871580839157104} 11/06/2021 23:35:32 - INFO - __main__ - Step 16559: {'lr': 0.00048815783982482115, 'samples': 3179328, 'steps': 16558, 'loss/train': 1.4788908958435059} 11/06/2021 23:35:33 - INFO - __main__ - Step 16560: {'lr': 0.0004881562258446496, 'samples': 3179520, 'steps': 16559, 'loss/train': 1.6638139486312866} 11/06/2021 23:35:33 - INFO - __main__ - Step 16561: {'lr': 0.00048815461175716855, 'samples': 3179712, 'steps': 16560, 'loss/train': 1.913263201713562} 11/06/2021 23:35:35 - INFO - __main__ - Step 16562: {'lr': 0.00048815299756237873, 'samples': 3179904, 'steps': 16561, 'loss/train': 1.9439252614974976} 11/06/2021 23:35:35 - INFO - __main__ - Step 16563: {'lr': 0.0004881513832602808, 'samples': 3180096, 'steps': 16562, 'loss/train': 1.476514458656311} 11/06/2021 23:35:35 - INFO - __main__ - Step 16564: {'lr': 0.0004881497688508756, 'samples': 3180288, 'steps': 16563, 'loss/train': 1.613339900970459} 11/06/2021 23:35:36 - INFO - __main__ - Step 16565: {'lr': 0.0004881481543341637, 'samples': 3180480, 'steps': 16564, 'loss/train': 1.7296475172042847} 11/06/2021 23:35:36 - INFO - __main__ - Step 16566: {'lr': 0.000488146539710146, 'samples': 3180672, 'steps': 16565, 'loss/train': 1.3595619201660156} 11/06/2021 23:35:36 - INFO - __main__ - Step 16567: {'lr': 0.00048814492497882306, 'samples': 3180864, 'steps': 16566, 'loss/train': 1.9055120944976807} 11/06/2021 23:35:38 - INFO - __main__ - Step 16568: {'lr': 0.00048814331014019577, 'samples': 3181056, 'steps': 16567, 'loss/train': 1.4055638313293457} 11/06/2021 23:35:38 - INFO - __main__ - Step 16569: {'lr': 0.0004881416951942647, 'samples': 3181248, 'steps': 16568, 'loss/train': 1.8955738544464111} 11/06/2021 23:35:38 - INFO - __main__ - Step 16570: {'lr': 0.0004881400801410307, 'samples': 3181440, 'steps': 16569, 'loss/train': 1.4749326705932617} 11/06/2021 23:35:39 - INFO - __main__ - Step 16571: {'lr': 0.0004881384649804945, 'samples': 3181632, 'steps': 16570, 'loss/train': 1.6895521879196167} 11/06/2021 23:35:39 - INFO - __main__ - Step 16572: {'lr': 0.0004881368497126567, 'samples': 3181824, 'steps': 16571, 'loss/train': 1.8323323726654053} 11/06/2021 23:35:40 - INFO - __main__ - Step 16573: {'lr': 0.00048813523433751814, 'samples': 3182016, 'steps': 16572, 'loss/train': 1.8884717226028442} 11/06/2021 23:35:40 - INFO - __main__ - Step 16574: {'lr': 0.00048813361885507956, 'samples': 3182208, 'steps': 16573, 'loss/train': 1.50578773021698} 11/06/2021 23:35:41 - INFO - __main__ - Step 16575: {'lr': 0.00048813200326534156, 'samples': 3182400, 'steps': 16574, 'loss/train': 1.8170469999313354} 11/06/2021 23:35:41 - INFO - __main__ - Step 16576: {'lr': 0.00048813038756830506, 'samples': 3182592, 'steps': 16575, 'loss/train': 1.7620586156845093} 11/06/2021 23:35:41 - INFO - __main__ - Step 16577: {'lr': 0.00048812877176397066, 'samples': 3182784, 'steps': 16576, 'loss/train': 1.3254033327102661} 11/06/2021 23:35:42 - INFO - __main__ - Step 16578: {'lr': 0.00048812715585233905, 'samples': 3182976, 'steps': 16577, 'loss/train': 1.8124728202819824} 11/06/2021 23:35:43 - INFO - __main__ - Step 16579: {'lr': 0.000488125539833411, 'samples': 3183168, 'steps': 16578, 'loss/train': 1.7202956676483154} 11/06/2021 23:35:44 - INFO - __main__ - Step 16580: {'lr': 0.0004881239237071873, 'samples': 3183360, 'steps': 16579, 'loss/train': 1.9533145427703857} 11/06/2021 23:35:44 - INFO - __main__ - Step 16581: {'lr': 0.0004881223074736687, 'samples': 3183552, 'steps': 16580, 'loss/train': 1.7170475721359253} 11/06/2021 23:35:44 - INFO - __main__ - Step 16582: {'lr': 0.00048812069113285573, 'samples': 3183744, 'steps': 16581, 'loss/train': 1.8036795854568481} 11/06/2021 23:35:45 - INFO - __main__ - Step 16583: {'lr': 0.00048811907468474934, 'samples': 3183936, 'steps': 16582, 'loss/train': 1.9189525842666626} 11/06/2021 23:35:46 - INFO - __main__ - Step 16584: {'lr': 0.00048811745812935015, 'samples': 3184128, 'steps': 16583, 'loss/train': 1.163478970527649} 11/06/2021 23:35:46 - INFO - __main__ - Step 16585: {'lr': 0.00048811584146665895, 'samples': 3184320, 'steps': 16584, 'loss/train': 2.10164213180542} 11/06/2021 23:35:47 - INFO - __main__ - Step 16586: {'lr': 0.0004881142246966763, 'samples': 3184512, 'steps': 16585, 'loss/train': 2.115177631378174} 11/06/2021 23:35:47 - INFO - __main__ - Step 16587: {'lr': 0.00048811260781940317, 'samples': 3184704, 'steps': 16586, 'loss/train': 1.7502573728561401} 11/06/2021 23:35:47 - INFO - __main__ - Step 16588: {'lr': 0.00048811099083484016, 'samples': 3184896, 'steps': 16587, 'loss/train': 2.0302722454071045} 11/06/2021 23:35:48 - INFO - __main__ - Step 16589: {'lr': 0.000488109373742988, 'samples': 3185088, 'steps': 16588, 'loss/train': 1.9502590894699097} 11/06/2021 23:35:49 - INFO - __main__ - Step 16590: {'lr': 0.0004881077565438474, 'samples': 3185280, 'steps': 16589, 'loss/train': 2.074025869369507} 11/06/2021 23:35:49 - INFO - __main__ - Step 16591: {'lr': 0.0004881061392374192, 'samples': 3185472, 'steps': 16590, 'loss/train': 1.1119537353515625} 11/06/2021 23:35:49 - INFO - __main__ - Step 16592: {'lr': 0.000488104521823704, 'samples': 3185664, 'steps': 16591, 'loss/train': 2.0072836875915527} 11/06/2021 23:35:50 - INFO - __main__ - Step 16593: {'lr': 0.00048810290430270257, 'samples': 3185856, 'steps': 16592, 'loss/train': 1.3514482975006104} 11/06/2021 23:35:50 - INFO - __main__ - Step 16594: {'lr': 0.0004881012866744156, 'samples': 3186048, 'steps': 16593, 'loss/train': 1.91445791721344} 11/06/2021 23:35:51 - INFO - __main__ - Step 16595: {'lr': 0.00048809966893884396, 'samples': 3186240, 'steps': 16594, 'loss/train': 1.6212115287780762} 11/06/2021 23:35:52 - INFO - __main__ - Step 16596: {'lr': 0.00048809805109598813, 'samples': 3186432, 'steps': 16595, 'loss/train': 1.7508177757263184} 11/06/2021 23:35:52 - INFO - __main__ - Step 16597: {'lr': 0.0004880964331458492, 'samples': 3186624, 'steps': 16596, 'loss/train': 1.6479634046554565} 11/06/2021 23:35:52 - INFO - __main__ - Step 16598: {'lr': 0.0004880948150884276, 'samples': 3186816, 'steps': 16597, 'loss/train': 1.4154289960861206} 11/06/2021 23:35:53 - INFO - __main__ - Step 16599: {'lr': 0.00048809319692372406, 'samples': 3187008, 'steps': 16598, 'loss/train': 1.8749748468399048} 11/06/2021 23:35:53 - INFO - __main__ - Step 16600: {'lr': 0.0004880915786517395, 'samples': 3187200, 'steps': 16599, 'loss/train': 1.1239466667175293} 11/06/2021 23:35:54 - INFO - __main__ - Step 16601: {'lr': 0.00048808996027247453, 'samples': 3187392, 'steps': 16600, 'loss/train': 1.6338905096054077} 11/06/2021 23:35:55 - INFO - __main__ - Step 16602: {'lr': 0.0004880883417859299, 'samples': 3187584, 'steps': 16601, 'loss/train': 1.2137621641159058} 11/06/2021 23:35:55 - INFO - __main__ - Step 16603: {'lr': 0.0004880867231921063, 'samples': 3187776, 'steps': 16602, 'loss/train': 1.6289089918136597} 11/06/2021 23:35:55 - INFO - __main__ - Step 16604: {'lr': 0.0004880851044910045, 'samples': 3187968, 'steps': 16603, 'loss/train': 1.77559494972229} 11/06/2021 23:35:56 - INFO - __main__ - Step 16605: {'lr': 0.0004880834856826253, 'samples': 3188160, 'steps': 16604, 'loss/train': 1.5778307914733887} 11/06/2021 23:35:56 - INFO - __main__ - Step 16606: {'lr': 0.0004880818667669693, 'samples': 3188352, 'steps': 16605, 'loss/train': 1.3657011985778809} 11/06/2021 23:35:57 - INFO - __main__ - Step 16607: {'lr': 0.00048808024774403726, 'samples': 3188544, 'steps': 16606, 'loss/train': 1.502026915550232} 11/06/2021 23:35:57 - INFO - __main__ - Step 16608: {'lr': 0.00048807862861382996, 'samples': 3188736, 'steps': 16607, 'loss/train': 1.240162968635559} 11/06/2021 23:35:58 - INFO - __main__ - Step 16609: {'lr': 0.0004880770093763481, 'samples': 3188928, 'steps': 16608, 'loss/train': 1.8014652729034424} 11/06/2021 23:35:58 - INFO - __main__ - Step 16610: {'lr': 0.0004880753900315924, 'samples': 3189120, 'steps': 16609, 'loss/train': 1.908936619758606} 11/06/2021 23:35:59 - INFO - __main__ - Step 16611: {'lr': 0.00048807377057956365, 'samples': 3189312, 'steps': 16610, 'loss/train': 1.7892169952392578} 11/06/2021 23:35:59 - INFO - __main__ - Step 16612: {'lr': 0.00048807215102026247, 'samples': 3189504, 'steps': 16611, 'loss/train': 1.5934950113296509} 11/06/2021 23:36:00 - INFO - __main__ - Step 16613: {'lr': 0.00048807053135368973, 'samples': 3189696, 'steps': 16612, 'loss/train': 2.044344902038574} 11/06/2021 23:36:00 - INFO - __main__ - Step 16614: {'lr': 0.00048806891157984604, 'samples': 3189888, 'steps': 16613, 'loss/train': 2.3011348247528076} 11/06/2021 23:36:01 - INFO - __main__ - Step 16615: {'lr': 0.0004880672916987322, 'samples': 3190080, 'steps': 16614, 'loss/train': 1.4086577892303467} 11/06/2021 23:36:01 - INFO - __main__ - Step 16616: {'lr': 0.0004880656717103489, 'samples': 3190272, 'steps': 16615, 'loss/train': 1.8714768886566162} 11/06/2021 23:36:02 - INFO - __main__ - Step 16617: {'lr': 0.0004880640516146968, 'samples': 3190464, 'steps': 16616, 'loss/train': 1.5197786092758179} 11/06/2021 23:36:02 - INFO - __main__ - Step 16618: {'lr': 0.0004880624314117768, 'samples': 3190656, 'steps': 16617, 'loss/train': 1.993930459022522} 11/06/2021 23:36:03 - INFO - __main__ - Step 16619: {'lr': 0.0004880608111015895, 'samples': 3190848, 'steps': 16618, 'loss/train': 1.221077799797058} 11/06/2021 23:36:03 - INFO - __main__ - Step 16620: {'lr': 0.00048805919068413574, 'samples': 3191040, 'steps': 16619, 'loss/train': 3.036583185195923} 11/06/2021 23:36:03 - INFO - __main__ - Step 16621: {'lr': 0.0004880575701594161, 'samples': 3191232, 'steps': 16620, 'loss/train': 6.032649040222168} 11/06/2021 23:36:04 - INFO - __main__ - Step 16622: {'lr': 0.0004880559495274315, 'samples': 3191424, 'steps': 16621, 'loss/train': 1.834834337234497} 11/06/2021 23:36:05 - INFO - __main__ - Step 16623: {'lr': 0.00048805432878818247, 'samples': 3191616, 'steps': 16622, 'loss/train': 1.6473767757415771} 11/06/2021 23:36:05 - INFO - __main__ - Step 16624: {'lr': 0.0004880527079416698, 'samples': 3191808, 'steps': 16623, 'loss/train': 2.2454183101654053} 11/06/2021 23:36:05 - INFO - __main__ - Step 16625: {'lr': 0.00048805108698789435, 'samples': 3192000, 'steps': 16624, 'loss/train': 1.9317262172698975} 11/06/2021 23:36:06 - INFO - __main__ - Step 16626: {'lr': 0.00048804946592685667, 'samples': 3192192, 'steps': 16625, 'loss/train': 1.8206801414489746} 11/06/2021 23:36:06 - INFO - __main__ - Step 16627: {'lr': 0.0004880478447585576, 'samples': 3192384, 'steps': 16626, 'loss/train': 1.9301916360855103} 11/06/2021 23:36:07 - INFO - __main__ - Step 16628: {'lr': 0.00048804622348299785, 'samples': 3192576, 'steps': 16627, 'loss/train': 1.4593732357025146} 11/06/2021 23:36:08 - INFO - __main__ - Step 16629: {'lr': 0.0004880446021001782, 'samples': 3192768, 'steps': 16628, 'loss/train': 1.5824449062347412} 11/06/2021 23:36:08 - INFO - __main__ - Step 16630: {'lr': 0.00048804298061009925, 'samples': 3192960, 'steps': 16629, 'loss/train': 1.4731026887893677} 11/06/2021 23:36:08 - INFO - __main__ - Step 16631: {'lr': 0.0004880413590127619, 'samples': 3193152, 'steps': 16630, 'loss/train': 2.0067667961120605} 11/06/2021 23:36:09 - INFO - __main__ - Step 16632: {'lr': 0.0004880397373081666, 'samples': 3193344, 'steps': 16631, 'loss/train': 1.5343999862670898} 11/06/2021 23:36:10 - INFO - __main__ - Step 16633: {'lr': 0.0004880381154963145, 'samples': 3193536, 'steps': 16632, 'loss/train': 1.626529574394226} 11/06/2021 23:36:10 - INFO - __main__ - Step 16634: {'lr': 0.0004880364935772059, 'samples': 3193728, 'steps': 16633, 'loss/train': 1.5364887714385986} 11/06/2021 23:36:10 - INFO - __main__ - Step 16635: {'lr': 0.00048803487155084184, 'samples': 3193920, 'steps': 16634, 'loss/train': 1.7955859899520874} 11/06/2021 23:36:11 - INFO - __main__ - Step 16636: {'lr': 0.00048803324941722295, 'samples': 3194112, 'steps': 16635, 'loss/train': 2.057948350906372} 11/06/2021 23:36:11 - INFO - __main__ - Step 16637: {'lr': 0.0004880316271763499, 'samples': 3194304, 'steps': 16636, 'loss/train': 1.8370440006256104} 11/06/2021 23:36:11 - INFO - __main__ - Step 16638: {'lr': 0.0004880300048282235, 'samples': 3194496, 'steps': 16637, 'loss/train': 1.8288404941558838} 11/06/2021 23:36:13 - INFO - __main__ - Step 16639: {'lr': 0.00048802838237284443, 'samples': 3194688, 'steps': 16638, 'loss/train': 2.116184949874878} 11/06/2021 23:36:13 - INFO - __main__ - Step 16640: {'lr': 0.0004880267598102135, 'samples': 3194880, 'steps': 16639, 'loss/train': 1.812418818473816} 11/06/2021 23:36:13 - INFO - __main__ - Step 16641: {'lr': 0.0004880251371403313, 'samples': 3195072, 'steps': 16640, 'loss/train': 1.6309278011322021} 11/06/2021 23:36:14 - INFO - __main__ - Step 16642: {'lr': 0.0004880235143631987, 'samples': 3195264, 'steps': 16641, 'loss/train': 1.4941754341125488} 11/06/2021 23:36:14 - INFO - __main__ - Step 16643: {'lr': 0.0004880218914788164, 'samples': 3195456, 'steps': 16642, 'loss/train': 1.2120184898376465} 11/06/2021 23:36:15 - INFO - __main__ - Step 16644: {'lr': 0.00048802026848718505, 'samples': 3195648, 'steps': 16643, 'loss/train': 2.3507211208343506} 11/06/2021 23:36:15 - INFO - __main__ - Step 16645: {'lr': 0.0004880186453883054, 'samples': 3195840, 'steps': 16644, 'loss/train': 1.6673327684402466} 11/06/2021 23:36:16 - INFO - __main__ - Step 16646: {'lr': 0.00048801702218217834, 'samples': 3196032, 'steps': 16645, 'loss/train': 0.5201470255851746} 11/06/2021 23:36:16 - INFO - __main__ - Step 16647: {'lr': 0.0004880153988688044, 'samples': 3196224, 'steps': 16646, 'loss/train': 1.4588996171951294} 11/06/2021 23:36:16 - INFO - __main__ - Step 16648: {'lr': 0.0004880137754481845, 'samples': 3196416, 'steps': 16647, 'loss/train': 1.8681379556655884} 11/06/2021 23:36:18 - INFO - __main__ - Step 16649: {'lr': 0.0004880121519203191, 'samples': 3196608, 'steps': 16648, 'loss/train': 1.4541462659835815} 11/06/2021 23:36:18 - INFO - __main__ - Step 16650: {'lr': 0.0004880105282852092, 'samples': 3196800, 'steps': 16649, 'loss/train': 1.4326448440551758} 11/06/2021 23:36:18 - INFO - __main__ - Step 16651: {'lr': 0.0004880089045428554, 'samples': 3196992, 'steps': 16650, 'loss/train': 1.7155417203903198} 11/06/2021 23:36:19 - INFO - __main__ - Step 16652: {'lr': 0.0004880072806932585, 'samples': 3197184, 'steps': 16651, 'loss/train': 1.279304027557373} 11/06/2021 23:36:19 - INFO - __main__ - Step 16653: {'lr': 0.00048800565673641917, 'samples': 3197376, 'steps': 16652, 'loss/train': 1.7802386283874512} 11/06/2021 23:36:20 - INFO - __main__ - Step 16654: {'lr': 0.0004880040326723382, 'samples': 3197568, 'steps': 16653, 'loss/train': 1.3103575706481934} 11/06/2021 23:36:20 - INFO - __main__ - Step 16655: {'lr': 0.0004880024085010162, 'samples': 3197760, 'steps': 16654, 'loss/train': 1.8908525705337524} 11/06/2021 23:36:21 - INFO - __main__ - Step 16656: {'lr': 0.00048800078422245406, 'samples': 3197952, 'steps': 16655, 'loss/train': 1.29991614818573} 11/06/2021 23:36:21 - INFO - __main__ - Step 16657: {'lr': 0.0004879991598366524, 'samples': 3198144, 'steps': 16656, 'loss/train': 1.5738615989685059} 11/06/2021 23:36:21 - INFO - __main__ - Step 16658: {'lr': 0.000487997535343612, 'samples': 3198336, 'steps': 16657, 'loss/train': 1.7001148462295532} 11/06/2021 23:36:22 - INFO - __main__ - Step 16659: {'lr': 0.0004879959107433336, 'samples': 3198528, 'steps': 16658, 'loss/train': 2.02748966217041} 11/06/2021 23:36:23 - INFO - __main__ - Step 16660: {'lr': 0.00048799428603581786, 'samples': 3198720, 'steps': 16659, 'loss/train': 1.5537936687469482} 11/06/2021 23:36:23 - INFO - __main__ - Step 16661: {'lr': 0.0004879926612210656, 'samples': 3198912, 'steps': 16660, 'loss/train': 1.3373011350631714} 11/06/2021 23:36:23 - INFO - __main__ - Step 16662: {'lr': 0.0004879910362990775, 'samples': 3199104, 'steps': 16661, 'loss/train': 1.371505856513977} 11/06/2021 23:36:24 - INFO - __main__ - Step 16663: {'lr': 0.0004879894112698544, 'samples': 3199296, 'steps': 16662, 'loss/train': 1.5822465419769287} 11/06/2021 23:36:24 - INFO - __main__ - Step 16664: {'lr': 0.0004879877861333969, 'samples': 3199488, 'steps': 16663, 'loss/train': 1.791337013244629} 11/06/2021 23:36:25 - INFO - __main__ - Step 16665: {'lr': 0.00048798616088970573, 'samples': 3199680, 'steps': 16664, 'loss/train': 1.502487063407898} 11/06/2021 23:36:26 - INFO - __main__ - Step 16666: {'lr': 0.0004879845355387817, 'samples': 3199872, 'steps': 16665, 'loss/train': 1.4889453649520874} 11/06/2021 23:36:26 - INFO - __main__ - Step 16667: {'lr': 0.00048798291008062553, 'samples': 3200064, 'steps': 16666, 'loss/train': 1.684088110923767} 11/06/2021 23:36:26 - INFO - __main__ - Step 16668: {'lr': 0.0004879812845152379, 'samples': 3200256, 'steps': 16667, 'loss/train': 1.8223564624786377} 11/06/2021 23:36:27 - INFO - __main__ - Step 16669: {'lr': 0.0004879796588426195, 'samples': 3200448, 'steps': 16668, 'loss/train': 1.8264342546463013} 11/06/2021 23:36:28 - INFO - __main__ - Step 16670: {'lr': 0.0004879780330627713, 'samples': 3200640, 'steps': 16669, 'loss/train': 1.4178427457809448} 11/06/2021 23:36:28 - INFO - __main__ - Step 16671: {'lr': 0.0004879764071756938, 'samples': 3200832, 'steps': 16670, 'loss/train': 1.6339678764343262} 11/06/2021 23:36:28 - INFO - __main__ - Step 16672: {'lr': 0.00048797478118138777, 'samples': 3201024, 'steps': 16671, 'loss/train': 1.8383632898330688} 11/06/2021 23:36:29 - INFO - __main__ - Step 16673: {'lr': 0.000487973155079854, 'samples': 3201216, 'steps': 16672, 'loss/train': 1.7812671661376953} 11/06/2021 23:36:29 - INFO - __main__ - Step 16674: {'lr': 0.0004879715288710932, 'samples': 3201408, 'steps': 16673, 'loss/train': 0.6627610325813293} 11/06/2021 23:36:30 - INFO - __main__ - Step 16675: {'lr': 0.0004879699025551061, 'samples': 3201600, 'steps': 16674, 'loss/train': 0.5288013815879822} 11/06/2021 23:36:31 - INFO - __main__ - Step 16676: {'lr': 0.0004879682761318934, 'samples': 3201792, 'steps': 16675, 'loss/train': 1.3023256063461304} 11/06/2021 23:36:31 - INFO - __main__ - Step 16677: {'lr': 0.00048796664960145596, 'samples': 3201984, 'steps': 16676, 'loss/train': 1.4314196109771729} 11/06/2021 23:36:31 - INFO - __main__ - Step 16678: {'lr': 0.00048796502296379437, 'samples': 3202176, 'steps': 16677, 'loss/train': 1.5249167680740356} 11/06/2021 23:36:32 - INFO - __main__ - Step 16679: {'lr': 0.0004879633962189094, 'samples': 3202368, 'steps': 16678, 'loss/train': 1.4833424091339111} 11/06/2021 23:36:33 - INFO - __main__ - Step 16680: {'lr': 0.0004879617693668018, 'samples': 3202560, 'steps': 16679, 'loss/train': 2.0442960262298584} 11/06/2021 23:36:33 - INFO - __main__ - Step 16681: {'lr': 0.00048796014240747227, 'samples': 3202752, 'steps': 16680, 'loss/train': 2.0574045181274414} 11/06/2021 23:36:33 - INFO - __main__ - Step 16682: {'lr': 0.0004879585153409216, 'samples': 3202944, 'steps': 16681, 'loss/train': 2.0046627521514893} 11/06/2021 23:36:34 - INFO - __main__ - Step 16683: {'lr': 0.0004879568881671505, 'samples': 3203136, 'steps': 16682, 'loss/train': 1.7298773527145386} 11/06/2021 23:36:34 - INFO - __main__ - Step 16684: {'lr': 0.0004879552608861597, 'samples': 3203328, 'steps': 16683, 'loss/train': 2.0869789123535156} 11/06/2021 23:36:35 - INFO - __main__ - Step 16685: {'lr': 0.00048795363349794996, 'samples': 3203520, 'steps': 16684, 'loss/train': 1.313889741897583} 11/06/2021 23:36:36 - INFO - __main__ - Step 16686: {'lr': 0.00048795200600252193, 'samples': 3203712, 'steps': 16685, 'loss/train': 1.5983039140701294} 11/06/2021 23:36:36 - INFO - __main__ - Step 16687: {'lr': 0.00048795037839987644, 'samples': 3203904, 'steps': 16686, 'loss/train': 1.7276376485824585} 11/06/2021 23:36:36 - INFO - __main__ - Step 16688: {'lr': 0.0004879487506900141, 'samples': 3204096, 'steps': 16687, 'loss/train': 1.1915156841278076} 11/06/2021 23:36:37 - INFO - __main__ - Step 16689: {'lr': 0.0004879471228729358, 'samples': 3204288, 'steps': 16688, 'loss/train': 1.5921833515167236} 11/06/2021 23:36:37 - INFO - __main__ - Step 16690: {'lr': 0.0004879454949486422, 'samples': 3204480, 'steps': 16689, 'loss/train': 1.0146256685256958} 11/06/2021 23:36:38 - INFO - __main__ - Step 16691: {'lr': 0.000487943866917134, 'samples': 3204672, 'steps': 16690, 'loss/train': 1.430779218673706} 11/06/2021 23:36:38 - INFO - __main__ - Step 16692: {'lr': 0.00048794223877841197, 'samples': 3204864, 'steps': 16691, 'loss/train': 1.9642324447631836} 11/06/2021 23:36:39 - INFO - __main__ - Step 16693: {'lr': 0.00048794061053247686, 'samples': 3205056, 'steps': 16692, 'loss/train': 1.744354009628296} 11/06/2021 23:36:39 - INFO - __main__ - Step 16694: {'lr': 0.0004879389821793294, 'samples': 3205248, 'steps': 16693, 'loss/train': 1.6574643850326538} 11/06/2021 23:36:39 - INFO - __main__ - Step 16695: {'lr': 0.00048793735371897027, 'samples': 3205440, 'steps': 16694, 'loss/train': 1.3107839822769165} 11/06/2021 23:36:41 - INFO - __main__ - Step 16696: {'lr': 0.00048793572515140024, 'samples': 3205632, 'steps': 16695, 'loss/train': 1.8585938215255737} 11/06/2021 23:36:41 - INFO - __main__ - Step 16697: {'lr': 0.00048793409647662, 'samples': 3205824, 'steps': 16696, 'loss/train': 1.749839186668396} 11/06/2021 23:36:41 - INFO - __main__ - Step 16698: {'lr': 0.0004879324676946304, 'samples': 3206016, 'steps': 16697, 'loss/train': 6.0391693115234375} 11/06/2021 23:36:42 - INFO - __main__ - Step 16699: {'lr': 0.0004879308388054321, 'samples': 3206208, 'steps': 16698, 'loss/train': 1.5355502367019653} 11/06/2021 23:36:42 - INFO - __main__ - Step 16700: {'lr': 0.0004879292098090258, 'samples': 3206400, 'steps': 16699, 'loss/train': 1.8935089111328125} 11/06/2021 23:36:42 - INFO - __main__ - Step 16701: {'lr': 0.00048792758070541234, 'samples': 3206592, 'steps': 16700, 'loss/train': 1.749446153640747} 11/06/2021 23:36:43 - INFO - __main__ - Step 16702: {'lr': 0.00048792595149459226, 'samples': 3206784, 'steps': 16701, 'loss/train': 5.876288890838623} 11/06/2021 23:36:44 - INFO - __main__ - Step 16703: {'lr': 0.0004879243221765665, 'samples': 3206976, 'steps': 16702, 'loss/train': 5.527763366699219} 11/06/2021 23:36:44 - INFO - __main__ - Step 16704: {'lr': 0.00048792269275133574, 'samples': 3207168, 'steps': 16703, 'loss/train': 1.829983115196228} 11/06/2021 23:36:45 - INFO - __main__ - Step 16705: {'lr': 0.0004879210632189006, 'samples': 3207360, 'steps': 16704, 'loss/train': 1.1864635944366455} 11/06/2021 23:36:45 - INFO - __main__ - Step 16706: {'lr': 0.0004879194335792619, 'samples': 3207552, 'steps': 16705, 'loss/train': 1.3388835191726685} 11/06/2021 23:36:45 - INFO - __main__ - Step 16707: {'lr': 0.0004879178038324205, 'samples': 3207744, 'steps': 16706, 'loss/train': 1.0322556495666504} 11/06/2021 23:36:46 - INFO - __main__ - Step 16708: {'lr': 0.0004879161739783769, 'samples': 3207936, 'steps': 16707, 'loss/train': 1.4637303352355957} 11/06/2021 23:36:47 - INFO - __main__ - Step 16709: {'lr': 0.00048791454401713195, 'samples': 3208128, 'steps': 16708, 'loss/train': 2.0983338356018066} 11/06/2021 23:36:47 - INFO - __main__ - Step 16710: {'lr': 0.00048791291394868644, 'samples': 3208320, 'steps': 16709, 'loss/train': 2.0529398918151855} 11/06/2021 23:36:47 - INFO - __main__ - Step 16711: {'lr': 0.000487911283773041, 'samples': 3208512, 'steps': 16710, 'loss/train': 1.6955748796463013} 11/06/2021 23:36:48 - INFO - __main__ - Step 16712: {'lr': 0.0004879096534901964, 'samples': 3208704, 'steps': 16711, 'loss/train': 1.505379319190979} 11/06/2021 23:36:49 - INFO - __main__ - Step 16713: {'lr': 0.00048790802310015336, 'samples': 3208896, 'steps': 16712, 'loss/train': 1.625261664390564} 11/06/2021 23:36:49 - INFO - __main__ - Step 16714: {'lr': 0.0004879063926029127, 'samples': 3209088, 'steps': 16713, 'loss/train': 1.708368182182312} 11/06/2021 23:36:50 - INFO - __main__ - Step 16715: {'lr': 0.00048790476199847506, 'samples': 3209280, 'steps': 16714, 'loss/train': 1.6214886903762817} 11/06/2021 23:36:50 - INFO - __main__ - Step 16716: {'lr': 0.0004879031312868412, 'samples': 3209472, 'steps': 16715, 'loss/train': 1.734784722328186} 11/06/2021 23:36:50 - INFO - __main__ - Step 16717: {'lr': 0.00048790150046801187, 'samples': 3209664, 'steps': 16716, 'loss/train': 1.8624287843704224} 11/06/2021 23:36:51 - INFO - __main__ - Step 16718: {'lr': 0.0004878998695419877, 'samples': 3209856, 'steps': 16717, 'loss/train': 1.689518690109253} 11/06/2021 23:36:52 - INFO - __main__ - Step 16719: {'lr': 0.0004878982385087697, 'samples': 3210048, 'steps': 16718, 'loss/train': 2.1184022426605225} 11/06/2021 23:36:52 - INFO - __main__ - Step 16720: {'lr': 0.0004878966073683583, 'samples': 3210240, 'steps': 16719, 'loss/train': 1.6878275871276855} 11/06/2021 23:36:52 - INFO - __main__ - Step 16721: {'lr': 0.0004878949761207544, 'samples': 3210432, 'steps': 16720, 'loss/train': 1.9824398756027222} 11/06/2021 23:36:53 - INFO - __main__ - Step 16722: {'lr': 0.0004878933447659587, 'samples': 3210624, 'steps': 16721, 'loss/train': 2.208752155303955} 11/06/2021 23:36:53 - INFO - __main__ - Step 16723: {'lr': 0.0004878917133039719, 'samples': 3210816, 'steps': 16722, 'loss/train': 1.5395011901855469} 11/06/2021 23:36:54 - INFO - __main__ - Step 16724: {'lr': 0.00048789008173479476, 'samples': 3211008, 'steps': 16723, 'loss/train': 1.7593508958816528} 11/06/2021 23:36:54 - INFO - __main__ - Step 16725: {'lr': 0.0004878884500584281, 'samples': 3211200, 'steps': 16724, 'loss/train': 1.9849613904953003} 11/06/2021 23:36:55 - INFO - __main__ - Step 16726: {'lr': 0.0004878868182748725, 'samples': 3211392, 'steps': 16725, 'loss/train': 1.6343092918395996} 11/06/2021 23:36:55 - INFO - __main__ - Step 16727: {'lr': 0.0004878851863841287, 'samples': 3211584, 'steps': 16726, 'loss/train': 1.4157750606536865} 11/06/2021 23:36:55 - INFO - __main__ - Step 16728: {'lr': 0.00048788355438619764, 'samples': 3211776, 'steps': 16727, 'loss/train': 1.9465490579605103} 11/06/2021 23:36:57 - INFO - __main__ - Step 16729: {'lr': 0.00048788192228107986, 'samples': 3211968, 'steps': 16728, 'loss/train': 1.3417645692825317} 11/06/2021 23:36:57 - INFO - __main__ - Step 16730: {'lr': 0.00048788029006877623, 'samples': 3212160, 'steps': 16729, 'loss/train': 1.5599322319030762} 11/06/2021 23:36:57 - INFO - __main__ - Step 16731: {'lr': 0.0004878786577492873, 'samples': 3212352, 'steps': 16730, 'loss/train': 2.0975825786590576} 11/06/2021 23:36:58 - INFO - __main__ - Step 16732: {'lr': 0.00048787702532261396, 'samples': 3212544, 'steps': 16731, 'loss/train': 1.131968379020691} 11/06/2021 23:36:58 - INFO - __main__ - Step 16733: {'lr': 0.0004878753927887569, 'samples': 3212736, 'steps': 16732, 'loss/train': 1.7023571729660034} 11/06/2021 23:36:59 - INFO - __main__ - Step 16734: {'lr': 0.0004878737601477169, 'samples': 3212928, 'steps': 16733, 'loss/train': 1.4870761632919312} 11/06/2021 23:36:59 - INFO - __main__ - Step 16735: {'lr': 0.0004878721273994946, 'samples': 3213120, 'steps': 16734, 'loss/train': 1.8174546957015991} 11/06/2021 23:37:00 - INFO - __main__ - Step 16736: {'lr': 0.00048787049454409085, 'samples': 3213312, 'steps': 16735, 'loss/train': 1.6233512163162231} 11/06/2021 23:37:00 - INFO - __main__ - Step 16737: {'lr': 0.0004878688615815063, 'samples': 3213504, 'steps': 16736, 'loss/train': 1.541832447052002} 11/06/2021 23:37:00 - INFO - __main__ - Step 16738: {'lr': 0.0004878672285117417, 'samples': 3213696, 'steps': 16737, 'loss/train': 1.9334702491760254} 11/06/2021 23:37:01 - INFO - __main__ - Step 16739: {'lr': 0.0004878655953347978, 'samples': 3213888, 'steps': 16738, 'loss/train': 0.17051845788955688} 11/06/2021 23:37:02 - INFO - __main__ - Step 16740: {'lr': 0.0004878639620506753, 'samples': 3214080, 'steps': 16739, 'loss/train': 1.7930675745010376} 11/06/2021 23:37:02 - INFO - __main__ - Step 16741: {'lr': 0.00048786232865937504, 'samples': 3214272, 'steps': 16740, 'loss/train': 1.6320091485977173} 11/06/2021 23:37:02 - INFO - __main__ - Step 16742: {'lr': 0.0004878606951608976, 'samples': 3214464, 'steps': 16741, 'loss/train': 2.05916428565979} 11/06/2021 23:37:03 - INFO - __main__ - Step 16743: {'lr': 0.00048785906155524386, 'samples': 3214656, 'steps': 16742, 'loss/train': 1.2321640253067017} 11/06/2021 23:37:05 - INFO - __main__ - Step 16744: {'lr': 0.0004878574278424145, 'samples': 3214848, 'steps': 16743, 'loss/train': 1.7712355852127075} 11/06/2021 23:37:05 - INFO - __main__ - Step 16745: {'lr': 0.0004878557940224102, 'samples': 3215040, 'steps': 16744, 'loss/train': 1.3392685651779175} 11/06/2021 23:37:05 - INFO - __main__ - Step 16746: {'lr': 0.0004878541600952318, 'samples': 3215232, 'steps': 16745, 'loss/train': 1.2699354887008667} 11/06/2021 23:37:06 - INFO - __main__ - Step 16747: {'lr': 0.00048785252606087996, 'samples': 3215424, 'steps': 16746, 'loss/train': 1.778057336807251} 11/06/2021 23:37:06 - INFO - __main__ - Step 16748: {'lr': 0.0004878508919193555, 'samples': 3215616, 'steps': 16747, 'loss/train': 1.7373347282409668} 11/06/2021 23:37:06 - INFO - __main__ - Step 16749: {'lr': 0.000487849257670659, 'samples': 3215808, 'steps': 16748, 'loss/train': 1.591823935508728} 11/06/2021 23:37:07 - INFO - __main__ - Step 16750: {'lr': 0.0004878476233147914, 'samples': 3216000, 'steps': 16749, 'loss/train': 1.612733006477356} 11/06/2021 23:37:08 - INFO - __main__ - Step 16751: {'lr': 0.00048784598885175324, 'samples': 3216192, 'steps': 16750, 'loss/train': 1.5317606925964355} 11/06/2021 23:37:08 - INFO - __main__ - Step 16752: {'lr': 0.00048784435428154537, 'samples': 3216384, 'steps': 16751, 'loss/train': 1.491845965385437} 11/06/2021 23:37:08 - INFO - __main__ - Step 16753: {'lr': 0.0004878427196041686, 'samples': 3216576, 'steps': 16752, 'loss/train': 1.9473744630813599} 11/06/2021 23:37:09 - INFO - __main__ - Step 16754: {'lr': 0.00048784108481962347, 'samples': 3216768, 'steps': 16753, 'loss/train': 1.4702725410461426} 11/06/2021 23:37:09 - INFO - __main__ - Step 16755: {'lr': 0.00048783944992791085, 'samples': 3216960, 'steps': 16754, 'loss/train': 1.2084376811981201} 11/06/2021 23:37:10 - INFO - __main__ - Step 16756: {'lr': 0.00048783781492903145, 'samples': 3217152, 'steps': 16755, 'loss/train': 1.8387744426727295} 11/06/2021 23:37:11 - INFO - __main__ - Step 16757: {'lr': 0.00048783617982298594, 'samples': 3217344, 'steps': 16756, 'loss/train': 1.6829814910888672} 11/06/2021 23:37:11 - INFO - __main__ - Step 16758: {'lr': 0.00048783454460977517, 'samples': 3217536, 'steps': 16757, 'loss/train': 1.8421169519424438} 11/06/2021 23:37:11 - INFO - __main__ - Step 16759: {'lr': 0.00048783290928939985, 'samples': 3217728, 'steps': 16758, 'loss/train': 1.8196167945861816} 11/06/2021 23:37:12 - INFO - __main__ - Step 16760: {'lr': 0.00048783127386186064, 'samples': 3217920, 'steps': 16759, 'loss/train': 2.0818989276885986} 11/06/2021 23:37:12 - INFO - __main__ - Step 16761: {'lr': 0.00048782963832715834, 'samples': 3218112, 'steps': 16760, 'loss/train': 1.8237532377243042} 11/06/2021 23:37:13 - INFO - __main__ - Step 16762: {'lr': 0.0004878280026852937, 'samples': 3218304, 'steps': 16761, 'loss/train': 1.4099204540252686} 11/06/2021 23:37:14 - INFO - __main__ - Step 16763: {'lr': 0.00048782636693626736, 'samples': 3218496, 'steps': 16762, 'loss/train': 1.4822304248809814} 11/06/2021 23:37:14 - INFO - __main__ - Step 16764: {'lr': 0.0004878247310800802, 'samples': 3218688, 'steps': 16763, 'loss/train': 2.277588367462158} 11/06/2021 23:37:14 - INFO - __main__ - Step 16765: {'lr': 0.0004878230951167328, 'samples': 3218880, 'steps': 16764, 'loss/train': 1.6301047801971436} 11/06/2021 23:37:15 - INFO - __main__ - Step 16766: {'lr': 0.0004878214590462261, 'samples': 3219072, 'steps': 16765, 'loss/train': 1.8547323942184448} 11/06/2021 23:37:15 - INFO - __main__ - Step 16767: {'lr': 0.0004878198228685607, 'samples': 3219264, 'steps': 16766, 'loss/train': 0.8852767944335938} 11/06/2021 23:37:16 - INFO - __main__ - Step 16768: {'lr': 0.00048781818658373734, 'samples': 3219456, 'steps': 16767, 'loss/train': 1.3451430797576904} 11/06/2021 23:37:16 - INFO - __main__ - Step 16769: {'lr': 0.00048781655019175676, 'samples': 3219648, 'steps': 16768, 'loss/train': 2.1409761905670166} 11/06/2021 23:37:17 - INFO - __main__ - Step 16770: {'lr': 0.00048781491369261965, 'samples': 3219840, 'steps': 16769, 'loss/train': 1.503929615020752} 11/06/2021 23:37:17 - INFO - __main__ - Step 16771: {'lr': 0.00048781327708632695, 'samples': 3220032, 'steps': 16770, 'loss/train': 1.9142979383468628} 11/06/2021 23:37:17 - INFO - __main__ - Step 16772: {'lr': 0.0004878116403728792, 'samples': 3220224, 'steps': 16771, 'loss/train': 1.3144738674163818} 11/06/2021 23:37:19 - INFO - __main__ - Step 16773: {'lr': 0.0004878100035522771, 'samples': 3220416, 'steps': 16772, 'loss/train': 1.3985767364501953} 11/06/2021 23:37:19 - INFO - __main__ - Step 16774: {'lr': 0.00048780836662452154, 'samples': 3220608, 'steps': 16773, 'loss/train': 1.2412461042404175} 11/06/2021 23:37:19 - INFO - __main__ - Step 16775: {'lr': 0.00048780672958961325, 'samples': 3220800, 'steps': 16774, 'loss/train': 1.7060500383377075} 11/06/2021 23:37:20 - INFO - __main__ - Step 16776: {'lr': 0.0004878050924475529, 'samples': 3220992, 'steps': 16775, 'loss/train': 1.980871558189392} 11/06/2021 23:37:20 - INFO - __main__ - Step 16777: {'lr': 0.00048780345519834124, 'samples': 3221184, 'steps': 16776, 'loss/train': 1.8724604845046997} 11/06/2021 23:37:21 - INFO - __main__ - Step 16778: {'lr': 0.000487801817841979, 'samples': 3221376, 'steps': 16777, 'loss/train': 0.38344258069992065} 11/06/2021 23:37:21 - INFO - __main__ - Step 16779: {'lr': 0.0004878001803784669, 'samples': 3221568, 'steps': 16778, 'loss/train': 1.6661590337753296} 11/06/2021 23:37:22 - INFO - __main__ - Step 16780: {'lr': 0.00048779854280780576, 'samples': 3221760, 'steps': 16779, 'loss/train': 2.1896467208862305} 11/06/2021 23:37:22 - INFO - __main__ - Step 16781: {'lr': 0.00048779690512999627, 'samples': 3221952, 'steps': 16780, 'loss/train': 1.5965747833251953} 11/06/2021 23:37:22 - INFO - __main__ - Step 16782: {'lr': 0.0004877952673450391, 'samples': 3222144, 'steps': 16781, 'loss/train': 1.6310534477233887} 11/06/2021 23:37:23 - INFO - __main__ - Step 16783: {'lr': 0.0004877936294529351, 'samples': 3222336, 'steps': 16782, 'loss/train': 2.207812786102295} 11/06/2021 23:37:24 - INFO - __main__ - Step 16784: {'lr': 0.00048779199145368494, 'samples': 3222528, 'steps': 16783, 'loss/train': 1.5550755262374878} 11/06/2021 23:37:24 - INFO - __main__ - Step 16785: {'lr': 0.0004877903533472894, 'samples': 3222720, 'steps': 16784, 'loss/train': 1.640083909034729} 11/06/2021 23:37:24 - INFO - __main__ - Step 16786: {'lr': 0.0004877887151337492, 'samples': 3222912, 'steps': 16785, 'loss/train': 1.9522141218185425} 11/06/2021 23:37:25 - INFO - __main__ - Step 16787: {'lr': 0.0004877870768130651, 'samples': 3223104, 'steps': 16786, 'loss/train': 1.743882417678833} 11/06/2021 23:37:26 - INFO - __main__ - Step 16788: {'lr': 0.0004877854383852377, 'samples': 3223296, 'steps': 16787, 'loss/train': 1.766150712966919} 11/06/2021 23:37:26 - INFO - __main__ - Step 16789: {'lr': 0.000487783799850268, 'samples': 3223488, 'steps': 16788, 'loss/train': 2.058748960494995} 11/06/2021 23:37:27 - INFO - __main__ - Step 16790: {'lr': 0.00048778216120815644, 'samples': 3223680, 'steps': 16789, 'loss/train': 1.553648829460144} 11/06/2021 23:37:27 - INFO - __main__ - Step 16791: {'lr': 0.00048778052245890404, 'samples': 3223872, 'steps': 16790, 'loss/train': 1.9870336055755615} 11/06/2021 23:37:27 - INFO - __main__ - Step 16792: {'lr': 0.0004877788836025113, 'samples': 3224064, 'steps': 16791, 'loss/train': 1.8651750087738037} 11/06/2021 23:37:28 - INFO - __main__ - Step 16793: {'lr': 0.0004877772446389791, 'samples': 3224256, 'steps': 16792, 'loss/train': 2.0042312145233154} 11/06/2021 23:37:29 - INFO - __main__ - Step 16794: {'lr': 0.0004877756055683082, 'samples': 3224448, 'steps': 16793, 'loss/train': 1.699658989906311} 11/06/2021 23:37:29 - INFO - __main__ - Step 16795: {'lr': 0.0004877739663904992, 'samples': 3224640, 'steps': 16794, 'loss/train': 1.4867655038833618} 11/06/2021 23:37:29 - INFO - __main__ - Step 16796: {'lr': 0.00048777232710555296, 'samples': 3224832, 'steps': 16795, 'loss/train': 0.9855947494506836} 11/06/2021 23:37:30 - INFO - __main__ - Step 16797: {'lr': 0.0004877706877134702, 'samples': 3225024, 'steps': 16796, 'loss/train': 1.5652852058410645} 11/06/2021 23:37:30 - INFO - __main__ - Step 16798: {'lr': 0.0004877690482142516, 'samples': 3225216, 'steps': 16797, 'loss/train': 1.4271130561828613} 11/06/2021 23:37:31 - INFO - __main__ - Step 16799: {'lr': 0.0004877674086078979, 'samples': 3225408, 'steps': 16798, 'loss/train': 1.6108903884887695} 11/06/2021 23:37:32 - INFO - __main__ - Step 16800: {'lr': 0.0004877657688944099, 'samples': 3225600, 'steps': 16799, 'loss/train': 1.6870945692062378} 11/06/2021 23:37:32 - INFO - __main__ - Step 16801: {'lr': 0.0004877641290737884, 'samples': 3225792, 'steps': 16800, 'loss/train': 1.2492268085479736} 11/06/2021 23:37:32 - INFO - __main__ - Step 16802: {'lr': 0.000487762489146034, 'samples': 3225984, 'steps': 16801, 'loss/train': 1.245894193649292} 11/06/2021 23:37:33 - INFO - __main__ - Step 16803: {'lr': 0.0004877608491111475, 'samples': 3226176, 'steps': 16802, 'loss/train': 1.588663101196289} 11/06/2021 23:37:34 - INFO - __main__ - Step 16804: {'lr': 0.0004877592089691296, 'samples': 3226368, 'steps': 16803, 'loss/train': 1.9212385416030884} 11/06/2021 23:37:34 - INFO - __main__ - Step 16805: {'lr': 0.00048775756871998106, 'samples': 3226560, 'steps': 16804, 'loss/train': 1.954315185546875} 11/06/2021 23:37:34 - INFO - __main__ - Step 16806: {'lr': 0.0004877559283637026, 'samples': 3226752, 'steps': 16805, 'loss/train': 1.4202922582626343} 11/06/2021 23:37:35 - INFO - __main__ - Step 16807: {'lr': 0.0004877542879002951, 'samples': 3226944, 'steps': 16806, 'loss/train': 1.4899168014526367} 11/06/2021 23:37:35 - INFO - __main__ - Step 16808: {'lr': 0.0004877526473297591, 'samples': 3227136, 'steps': 16807, 'loss/train': 1.6989758014678955} 11/06/2021 23:37:36 - INFO - __main__ - Step 16809: {'lr': 0.0004877510066520954, 'samples': 3227328, 'steps': 16808, 'loss/train': 1.953574299812317} 11/06/2021 23:37:37 - INFO - __main__ - Step 16810: {'lr': 0.0004877493658673048, 'samples': 3227520, 'steps': 16809, 'loss/train': 1.6780657768249512} 11/06/2021 23:37:37 - INFO - __main__ - Step 16811: {'lr': 0.00048774772497538806, 'samples': 3227712, 'steps': 16810, 'loss/train': 2.09407114982605} 11/06/2021 23:37:37 - INFO - __main__ - Step 16812: {'lr': 0.0004877460839763458, 'samples': 3227904, 'steps': 16811, 'loss/train': 1.4879274368286133} 11/06/2021 23:37:38 - INFO - __main__ - Step 16813: {'lr': 0.0004877444428701788, 'samples': 3228096, 'steps': 16812, 'loss/train': 1.0444165468215942} 11/06/2021 23:37:38 - INFO - __main__ - Step 16814: {'lr': 0.0004877428016568879, 'samples': 3228288, 'steps': 16813, 'loss/train': 1.6901205778121948} 11/06/2021 23:37:38 - INFO - __main__ - Step 16815: {'lr': 0.00048774116033647373, 'samples': 3228480, 'steps': 16814, 'loss/train': 1.6762943267822266} 11/06/2021 23:37:40 - INFO - __main__ - Step 16816: {'lr': 0.0004877395189089371, 'samples': 3228672, 'steps': 16815, 'loss/train': 2.3396244049072266} 11/06/2021 23:37:40 - INFO - __main__ - Step 16817: {'lr': 0.00048773787737427867, 'samples': 3228864, 'steps': 16816, 'loss/train': 1.7871525287628174} 11/06/2021 23:37:40 - INFO - __main__ - Step 16818: {'lr': 0.0004877362357324992, 'samples': 3229056, 'steps': 16817, 'loss/train': 1.87598717212677} 11/06/2021 23:37:41 - INFO - __main__ - Step 16819: {'lr': 0.0004877345939835995, 'samples': 3229248, 'steps': 16818, 'loss/train': 1.668867826461792} 11/06/2021 23:37:41 - INFO - __main__ - Step 16820: {'lr': 0.0004877329521275802, 'samples': 3229440, 'steps': 16819, 'loss/train': 1.777286171913147} 11/06/2021 23:37:41 - INFO - __main__ - Step 16821: {'lr': 0.0004877313101644422, 'samples': 3229632, 'steps': 16820, 'loss/train': 2.054732084274292} 11/06/2021 23:37:43 - INFO - __main__ - Step 16822: {'lr': 0.000487729668094186, 'samples': 3229824, 'steps': 16821, 'loss/train': 1.3124911785125732} 11/06/2021 23:37:43 - INFO - __main__ - Step 16823: {'lr': 0.0004877280259168125, 'samples': 3230016, 'steps': 16822, 'loss/train': 1.8719760179519653} 11/06/2021 23:37:43 - INFO - __main__ - Step 16824: {'lr': 0.0004877263836323226, 'samples': 3230208, 'steps': 16823, 'loss/train': 1.8889883756637573} 11/06/2021 23:37:44 - INFO - __main__ - Step 16825: {'lr': 0.00048772474124071663, 'samples': 3230400, 'steps': 16824, 'loss/train': 1.6828360557556152} 11/06/2021 23:37:44 - INFO - __main__ - Step 16826: {'lr': 0.0004877230987419957, 'samples': 3230592, 'steps': 16825, 'loss/train': 1.12043035030365} 11/06/2021 23:37:45 - INFO - __main__ - Step 16827: {'lr': 0.00048772145613616035, 'samples': 3230784, 'steps': 16826, 'loss/train': 1.7970967292785645} 11/06/2021 23:37:45 - INFO - __main__ - Step 16828: {'lr': 0.00048771981342321145, 'samples': 3230976, 'steps': 16827, 'loss/train': 1.1399004459381104} 11/06/2021 23:37:46 - INFO - __main__ - Step 16829: {'lr': 0.0004877181706031496, 'samples': 3231168, 'steps': 16828, 'loss/train': 1.6806825399398804} 11/06/2021 23:37:46 - INFO - __main__ - Step 16830: {'lr': 0.00048771652767597563, 'samples': 3231360, 'steps': 16829, 'loss/train': 1.6623328924179077} 11/06/2021 23:37:47 - INFO - __main__ - Step 16831: {'lr': 0.0004877148846416903, 'samples': 3231552, 'steps': 16830, 'loss/train': 1.4326417446136475} 11/06/2021 23:37:48 - INFO - __main__ - Step 16832: {'lr': 0.0004877132415002943, 'samples': 3231744, 'steps': 16831, 'loss/train': 1.1790060997009277} 11/06/2021 23:37:48 - INFO - __main__ - Step 16833: {'lr': 0.00048771159825178827, 'samples': 3231936, 'steps': 16832, 'loss/train': 2.1430394649505615} 11/06/2021 23:37:48 - INFO - __main__ - Step 16834: {'lr': 0.0004877099548961732, 'samples': 3232128, 'steps': 16833, 'loss/train': 1.4152675867080688} 11/06/2021 23:37:49 - INFO - __main__ - Step 16835: {'lr': 0.0004877083114334496, 'samples': 3232320, 'steps': 16834, 'loss/train': 1.6590317487716675} 11/06/2021 23:37:49 - INFO - __main__ - Step 16836: {'lr': 0.0004877066678636184, 'samples': 3232512, 'steps': 16835, 'loss/train': 1.7234406471252441} 11/06/2021 23:37:50 - INFO - __main__ - Step 16837: {'lr': 0.00048770502418668017, 'samples': 3232704, 'steps': 16836, 'loss/train': 1.8991540670394897} 11/06/2021 23:37:50 - INFO - __main__ - Step 16838: {'lr': 0.00048770338040263574, 'samples': 3232896, 'steps': 16837, 'loss/train': 2.06113338470459} 11/06/2021 23:37:51 - INFO - __main__ - Step 16839: {'lr': 0.00048770173651148586, 'samples': 3233088, 'steps': 16838, 'loss/train': 1.2442201375961304} 11/06/2021 23:37:51 - INFO - __main__ - Step 16840: {'lr': 0.0004877000925132312, 'samples': 3233280, 'steps': 16839, 'loss/train': 2.102421760559082} 11/06/2021 23:37:51 - INFO - __main__ - Step 16841: {'lr': 0.0004876984484078726, 'samples': 3233472, 'steps': 16840, 'loss/train': 1.9662206172943115} 11/06/2021 23:37:52 - INFO - __main__ - Step 16842: {'lr': 0.0004876968041954107, 'samples': 3233664, 'steps': 16841, 'loss/train': 2.2481026649475098} 11/06/2021 23:37:53 - INFO - __main__ - Step 16843: {'lr': 0.00048769515987584624, 'samples': 3233856, 'steps': 16842, 'loss/train': 1.3896993398666382} 11/06/2021 23:37:53 - INFO - __main__ - Step 16844: {'lr': 0.0004876935154491801, 'samples': 3234048, 'steps': 16843, 'loss/train': 1.4437320232391357} 11/06/2021 23:37:54 - INFO - __main__ - Step 16845: {'lr': 0.00048769187091541287, 'samples': 3234240, 'steps': 16844, 'loss/train': 1.5791386365890503} 11/06/2021 23:37:54 - INFO - __main__ - Step 16846: {'lr': 0.0004876902262745454, 'samples': 3234432, 'steps': 16845, 'loss/train': 2.008009672164917} 11/06/2021 23:37:54 - INFO - __main__ - Step 16847: {'lr': 0.00048768858152657837, 'samples': 3234624, 'steps': 16846, 'loss/train': 1.5482451915740967} 11/06/2021 23:37:55 - INFO - __main__ - Step 16848: {'lr': 0.0004876869366715125, 'samples': 3234816, 'steps': 16847, 'loss/train': 1.9071651697158813} 11/06/2021 23:37:56 - INFO - __main__ - Step 16849: {'lr': 0.0004876852917093486, 'samples': 3235008, 'steps': 16848, 'loss/train': 1.9252054691314697} 11/06/2021 23:37:56 - INFO - __main__ - Step 16850: {'lr': 0.0004876836466400874, 'samples': 3235200, 'steps': 16849, 'loss/train': 1.199202537536621} 11/06/2021 23:37:56 - INFO - __main__ - Step 16851: {'lr': 0.00048768200146372955, 'samples': 3235392, 'steps': 16850, 'loss/train': 1.9420405626296997} 11/06/2021 23:37:57 - INFO - __main__ - Step 16852: {'lr': 0.00048768035618027597, 'samples': 3235584, 'steps': 16851, 'loss/train': 1.7523447275161743} 11/06/2021 23:37:58 - INFO - __main__ - Step 16853: {'lr': 0.00048767871078972717, 'samples': 3235776, 'steps': 16852, 'loss/train': 1.6883947849273682} 11/06/2021 23:37:58 - INFO - __main__ - Step 16854: {'lr': 0.000487677065292084, 'samples': 3235968, 'steps': 16853, 'loss/train': 1.7545671463012695} 11/06/2021 23:37:58 - INFO - __main__ - Step 16855: {'lr': 0.0004876754196873473, 'samples': 3236160, 'steps': 16854, 'loss/train': 1.6839386224746704} 11/06/2021 23:37:59 - INFO - __main__ - Step 16856: {'lr': 0.00048767377397551773, 'samples': 3236352, 'steps': 16855, 'loss/train': 2.0163164138793945} 11/06/2021 23:37:59 - INFO - __main__ - Step 16857: {'lr': 0.00048767212815659593, 'samples': 3236544, 'steps': 16856, 'loss/train': 1.7172329425811768} 11/06/2021 23:38:00 - INFO - __main__ - Step 16858: {'lr': 0.0004876704822305828, 'samples': 3236736, 'steps': 16857, 'loss/train': 1.4067416191101074} 11/06/2021 23:38:00 - INFO - __main__ - Step 16859: {'lr': 0.00048766883619747906, 'samples': 3236928, 'steps': 16858, 'loss/train': 1.5666910409927368} 11/06/2021 23:38:01 - INFO - __main__ - Step 16860: {'lr': 0.00048766719005728534, 'samples': 3237120, 'steps': 16859, 'loss/train': 1.6902843713760376} 11/06/2021 23:38:01 - INFO - __main__ - Step 16861: {'lr': 0.0004876655438100024, 'samples': 3237312, 'steps': 16860, 'loss/train': 1.7778635025024414} 11/06/2021 23:38:01 - INFO - __main__ - Step 16862: {'lr': 0.00048766389745563113, 'samples': 3237504, 'steps': 16861, 'loss/train': 1.489890217781067} 11/06/2021 23:38:02 - INFO - __main__ - Step 16863: {'lr': 0.00048766225099417215, 'samples': 3237696, 'steps': 16862, 'loss/train': 1.5627225637435913} 11/06/2021 23:38:03 - INFO - __main__ - Step 16864: {'lr': 0.0004876606044256262, 'samples': 3237888, 'steps': 16863, 'loss/train': 1.9307020902633667} 11/06/2021 23:38:03 - INFO - __main__ - Step 16865: {'lr': 0.0004876589577499941, 'samples': 3238080, 'steps': 16864, 'loss/train': 2.422271251678467} 11/06/2021 23:38:04 - INFO - __main__ - Step 16866: {'lr': 0.0004876573109672765, 'samples': 3238272, 'steps': 16865, 'loss/train': 1.67001473903656} 11/06/2021 23:38:04 - INFO - __main__ - Step 16867: {'lr': 0.0004876556640774742, 'samples': 3238464, 'steps': 16866, 'loss/train': 1.8609672784805298} 11/06/2021 23:38:05 - INFO - __main__ - Step 16868: {'lr': 0.0004876540170805879, 'samples': 3238656, 'steps': 16867, 'loss/train': 1.879353642463684} 11/06/2021 23:38:05 - INFO - __main__ - Step 16869: {'lr': 0.00048765236997661845, 'samples': 3238848, 'steps': 16868, 'loss/train': 1.293416142463684} 11/06/2021 23:38:06 - INFO - __main__ - Step 16870: {'lr': 0.0004876507227655664, 'samples': 3239040, 'steps': 16869, 'loss/train': 1.5355979204177856} 11/06/2021 23:38:06 - INFO - __main__ - Step 16871: {'lr': 0.00048764907544743264, 'samples': 3239232, 'steps': 16870, 'loss/train': 2.0187036991119385} 11/06/2021 23:38:06 - INFO - __main__ - Step 16872: {'lr': 0.0004876474280222179, 'samples': 3239424, 'steps': 16871, 'loss/train': 2.1045546531677246} 11/06/2021 23:38:07 - INFO - __main__ - Step 16873: {'lr': 0.00048764578048992284, 'samples': 3239616, 'steps': 16872, 'loss/train': 1.7573167085647583} 11/06/2021 23:38:08 - INFO - __main__ - Step 16874: {'lr': 0.0004876441328505483, 'samples': 3239808, 'steps': 16873, 'loss/train': 1.5508947372436523} 11/06/2021 23:38:08 - INFO - __main__ - Step 16875: {'lr': 0.000487642485104095, 'samples': 3240000, 'steps': 16874, 'loss/train': 1.8752604722976685} 11/06/2021 23:38:08 - INFO - __main__ - Step 16876: {'lr': 0.00048764083725056365, 'samples': 3240192, 'steps': 16875, 'loss/train': 1.2444089651107788} 11/06/2021 23:38:09 - INFO - __main__ - Step 16877: {'lr': 0.00048763918928995496, 'samples': 3240384, 'steps': 16876, 'loss/train': 0.795091450214386} 11/06/2021 23:38:10 - INFO - __main__ - Step 16878: {'lr': 0.00048763754122226977, 'samples': 3240576, 'steps': 16877, 'loss/train': 1.2658640146255493} 11/06/2021 23:38:10 - INFO - __main__ - Step 16879: {'lr': 0.00048763589304750876, 'samples': 3240768, 'steps': 16878, 'loss/train': 1.3780895471572876} 11/06/2021 23:38:11 - INFO - __main__ - Step 16880: {'lr': 0.0004876342447656727, 'samples': 3240960, 'steps': 16879, 'loss/train': 1.6554217338562012} 11/06/2021 23:38:11 - INFO - __main__ - Step 16881: {'lr': 0.00048763259637676226, 'samples': 3241152, 'steps': 16880, 'loss/train': 1.099325180053711} 11/06/2021 23:38:11 - INFO - __main__ - Step 16882: {'lr': 0.00048763094788077834, 'samples': 3241344, 'steps': 16881, 'loss/train': 1.8760645389556885} 11/06/2021 23:38:12 - INFO - __main__ - Step 16883: {'lr': 0.0004876292992777215, 'samples': 3241536, 'steps': 16882, 'loss/train': 1.7464240789413452} 11/06/2021 23:38:13 - INFO - __main__ - Step 16884: {'lr': 0.00048762765056759255, 'samples': 3241728, 'steps': 16883, 'loss/train': 1.7331184148788452} 11/06/2021 23:38:13 - INFO - __main__ - Step 16885: {'lr': 0.00048762600175039227, 'samples': 3241920, 'steps': 16884, 'loss/train': 1.990671157836914} 11/06/2021 23:38:13 - INFO - __main__ - Step 16886: {'lr': 0.0004876243528261214, 'samples': 3242112, 'steps': 16885, 'loss/train': 1.2171714305877686} 11/06/2021 23:38:14 - INFO - __main__ - Step 16887: {'lr': 0.0004876227037947807, 'samples': 3242304, 'steps': 16886, 'loss/train': 1.3176175355911255} 11/06/2021 23:38:14 - INFO - __main__ - Step 16888: {'lr': 0.0004876210546563707, 'samples': 3242496, 'steps': 16887, 'loss/train': 1.400641679763794} 11/06/2021 23:38:15 - INFO - __main__ - Step 16889: {'lr': 0.0004876194054108926, 'samples': 3242688, 'steps': 16888, 'loss/train': 1.618189811706543} 11/06/2021 23:38:15 - INFO - __main__ - Step 16890: {'lr': 0.0004876177560583466, 'samples': 3242880, 'steps': 16889, 'loss/train': 1.4612959623336792} 11/06/2021 23:38:16 - INFO - __main__ - Step 16891: {'lr': 0.00048761610659873387, 'samples': 3243072, 'steps': 16890, 'loss/train': 1.4026546478271484} 11/06/2021 23:38:16 - INFO - __main__ - Step 16892: {'lr': 0.0004876144570320549, 'samples': 3243264, 'steps': 16891, 'loss/train': 1.5020729303359985} 11/06/2021 23:38:16 - INFO - __main__ - Step 16893: {'lr': 0.0004876128073583106, 'samples': 3243456, 'steps': 16892, 'loss/train': 1.9043554067611694} 11/06/2021 23:38:18 - INFO - __main__ - Step 16894: {'lr': 0.00048761115757750155, 'samples': 3243648, 'steps': 16893, 'loss/train': 1.3672549724578857} 11/06/2021 23:38:18 - INFO - __main__ - Step 16895: {'lr': 0.00048760950768962863, 'samples': 3243840, 'steps': 16894, 'loss/train': 1.6721173524856567} 11/06/2021 23:38:18 - INFO - __main__ - Step 16896: {'lr': 0.00048760785769469254, 'samples': 3244032, 'steps': 16895, 'loss/train': 1.9637107849121094} 11/06/2021 23:38:19 - INFO - __main__ - Step 16897: {'lr': 0.00048760620759269403, 'samples': 3244224, 'steps': 16896, 'loss/train': 0.48643603920936584} 11/06/2021 23:38:19 - INFO - __main__ - Step 16898: {'lr': 0.00048760455738363376, 'samples': 3244416, 'steps': 16897, 'loss/train': 2.0343399047851562} 11/06/2021 23:38:19 - INFO - __main__ - Step 16899: {'lr': 0.0004876029070675126, 'samples': 3244608, 'steps': 16898, 'loss/train': 1.2345243692398071} 11/06/2021 23:38:20 - INFO - __main__ - Step 16900: {'lr': 0.0004876012566443312, 'samples': 3244800, 'steps': 16899, 'loss/train': 2.1287524700164795} 11/06/2021 23:38:21 - INFO - __main__ - Step 16901: {'lr': 0.00048759960611409036, 'samples': 3244992, 'steps': 16900, 'loss/train': 3.2728943824768066} 11/06/2021 23:38:21 - INFO - __main__ - Step 16902: {'lr': 0.00048759795547679083, 'samples': 3245184, 'steps': 16901, 'loss/train': 1.828813910484314} 11/06/2021 23:38:21 - INFO - __main__ - Step 16903: {'lr': 0.00048759630473243327, 'samples': 3245376, 'steps': 16902, 'loss/train': 1.6020948886871338} 11/06/2021 23:38:22 - INFO - __main__ - Step 16904: {'lr': 0.00048759465388101855, 'samples': 3245568, 'steps': 16903, 'loss/train': 1.3073654174804688} 11/06/2021 23:38:23 - INFO - __main__ - Step 16905: {'lr': 0.0004875930029225473, 'samples': 3245760, 'steps': 16904, 'loss/train': 1.9156172275543213} 11/06/2021 23:38:23 - INFO - __main__ - Step 16906: {'lr': 0.0004875913518570203, 'samples': 3245952, 'steps': 16905, 'loss/train': 1.7705250978469849} 11/06/2021 23:38:24 - INFO - __main__ - Step 16907: {'lr': 0.0004875897006844383, 'samples': 3246144, 'steps': 16906, 'loss/train': 1.3052102327346802} 11/06/2021 23:38:24 - INFO - __main__ - Step 16908: {'lr': 0.00048758804940480203, 'samples': 3246336, 'steps': 16907, 'loss/train': 1.4894630908966064} 11/06/2021 23:38:24 - INFO - __main__ - Step 16909: {'lr': 0.0004875863980181123, 'samples': 3246528, 'steps': 16908, 'loss/train': 1.704583764076233} 11/06/2021 23:38:25 - INFO - __main__ - Step 16910: {'lr': 0.0004875847465243698, 'samples': 3246720, 'steps': 16909, 'loss/train': 1.5926655530929565} 11/06/2021 23:38:26 - INFO - __main__ - Step 16911: {'lr': 0.00048758309492357533, 'samples': 3246912, 'steps': 16910, 'loss/train': 1.5082074403762817} 11/06/2021 23:38:26 - INFO - __main__ - Step 16912: {'lr': 0.0004875814432157295, 'samples': 3247104, 'steps': 16911, 'loss/train': 0.24572615325450897} 11/06/2021 23:38:26 - INFO - __main__ - Step 16913: {'lr': 0.0004875797914008332, 'samples': 3247296, 'steps': 16912, 'loss/train': 1.7498927116394043} 11/06/2021 23:38:27 - INFO - __main__ - Step 16914: {'lr': 0.00048757813947888706, 'samples': 3247488, 'steps': 16913, 'loss/train': 1.81270432472229} 11/06/2021 23:38:28 - INFO - __main__ - Step 16915: {'lr': 0.0004875764874498919, 'samples': 3247680, 'steps': 16914, 'loss/train': 1.2739442586898804} 11/06/2021 23:38:28 - INFO - __main__ - Step 16916: {'lr': 0.00048757483531384837, 'samples': 3247872, 'steps': 16915, 'loss/train': 1.8057383298873901} 11/06/2021 23:38:28 - INFO - __main__ - Step 16917: {'lr': 0.0004875731830707574, 'samples': 3248064, 'steps': 16916, 'loss/train': 1.790678858757019} 11/06/2021 23:38:29 - INFO - __main__ - Step 16918: {'lr': 0.00048757153072061954, 'samples': 3248256, 'steps': 16917, 'loss/train': 1.7016808986663818} 11/06/2021 23:38:29 - INFO - __main__ - Step 16919: {'lr': 0.0004875698782634357, 'samples': 3248448, 'steps': 16918, 'loss/train': 1.927436113357544} 11/06/2021 23:38:30 - INFO - __main__ - Step 16920: {'lr': 0.00048756822569920647, 'samples': 3248640, 'steps': 16919, 'loss/train': 1.947782278060913} 11/06/2021 23:38:31 - INFO - __main__ - Step 16921: {'lr': 0.0004875665730279326, 'samples': 3248832, 'steps': 16920, 'loss/train': 1.8256161212921143} 11/06/2021 23:38:31 - INFO - __main__ - Step 16922: {'lr': 0.000487564920249615, 'samples': 3249024, 'steps': 16921, 'loss/train': 1.3792588710784912} 11/06/2021 23:38:31 - INFO - __main__ - Step 16923: {'lr': 0.00048756326736425427, 'samples': 3249216, 'steps': 16922, 'loss/train': 1.6660298109054565} 11/06/2021 23:38:32 - INFO - __main__ - Step 16924: {'lr': 0.00048756161437185126, 'samples': 3249408, 'steps': 16923, 'loss/train': 1.9878901243209839} 11/06/2021 23:38:33 - INFO - __main__ - Step 16925: {'lr': 0.0004875599612724066, 'samples': 3249600, 'steps': 16924, 'loss/train': 1.48038649559021} 11/06/2021 23:38:33 - INFO - __main__ - Step 16926: {'lr': 0.00048755830806592105, 'samples': 3249792, 'steps': 16925, 'loss/train': 1.4531468152999878} 11/06/2021 23:38:33 - INFO - __main__ - Step 16927: {'lr': 0.00048755665475239547, 'samples': 3249984, 'steps': 16926, 'loss/train': 1.3796802759170532} 11/06/2021 23:38:34 - INFO - __main__ - Step 16928: {'lr': 0.0004875550013318305, 'samples': 3250176, 'steps': 16927, 'loss/train': 1.5753812789916992} 11/06/2021 23:38:34 - INFO - __main__ - Step 16929: {'lr': 0.0004875533478042269, 'samples': 3250368, 'steps': 16928, 'loss/train': 1.530529260635376} 11/06/2021 23:38:34 - INFO - __main__ - Step 16930: {'lr': 0.00048755169416958544, 'samples': 3250560, 'steps': 16929, 'loss/train': 0.7025906443595886} 11/06/2021 23:38:35 - INFO - __main__ - Step 16931: {'lr': 0.00048755004042790685, 'samples': 3250752, 'steps': 16930, 'loss/train': 1.3983508348464966} 11/06/2021 23:38:36 - INFO - __main__ - Step 16932: {'lr': 0.00048754838657919186, 'samples': 3250944, 'steps': 16931, 'loss/train': 1.5572248697280884} 11/06/2021 23:38:36 - INFO - __main__ - Step 16933: {'lr': 0.00048754673262344124, 'samples': 3251136, 'steps': 16932, 'loss/train': 1.7678862810134888} 11/06/2021 23:38:36 - INFO - __main__ - Step 16934: {'lr': 0.00048754507856065574, 'samples': 3251328, 'steps': 16933, 'loss/train': 1.5842539072036743} 11/06/2021 23:38:37 - INFO - __main__ - Step 16935: {'lr': 0.0004875434243908361, 'samples': 3251520, 'steps': 16934, 'loss/train': 1.8356921672821045} 11/06/2021 23:38:38 - INFO - __main__ - Step 16936: {'lr': 0.00048754177011398303, 'samples': 3251712, 'steps': 16935, 'loss/train': 1.8403395414352417} 11/06/2021 23:38:38 - INFO - __main__ - Step 16937: {'lr': 0.0004875401157300973, 'samples': 3251904, 'steps': 16936, 'loss/train': 1.9707876443862915} 11/06/2021 23:38:38 - INFO - __main__ - Step 16938: {'lr': 0.00048753846123917964, 'samples': 3252096, 'steps': 16937, 'loss/train': 1.436141014099121} 11/06/2021 23:38:39 - INFO - __main__ - Step 16939: {'lr': 0.0004875368066412309, 'samples': 3252288, 'steps': 16938, 'loss/train': 1.4638780355453491} 11/06/2021 23:38:39 - INFO - __main__ - Step 16940: {'lr': 0.00048753515193625165, 'samples': 3252480, 'steps': 16939, 'loss/train': 1.765663981437683} 11/06/2021 23:38:40 - INFO - __main__ - Step 16941: {'lr': 0.00048753349712424277, 'samples': 3252672, 'steps': 16940, 'loss/train': 1.8791602849960327} 11/06/2021 23:38:40 - INFO - __main__ - Step 16942: {'lr': 0.00048753184220520497, 'samples': 3252864, 'steps': 16941, 'loss/train': 1.836533784866333} 11/06/2021 23:38:41 - INFO - __main__ - Step 16943: {'lr': 0.000487530187179139, 'samples': 3253056, 'steps': 16942, 'loss/train': 1.2585967779159546} 11/06/2021 23:38:41 - INFO - __main__ - Step 16944: {'lr': 0.00048752853204604555, 'samples': 3253248, 'steps': 16943, 'loss/train': 1.238154649734497} 11/06/2021 23:38:42 - INFO - __main__ - Step 16945: {'lr': 0.00048752687680592545, 'samples': 3253440, 'steps': 16944, 'loss/train': 1.5847913026809692} 11/06/2021 23:38:43 - INFO - __main__ - Step 16946: {'lr': 0.00048752522145877937, 'samples': 3253632, 'steps': 16945, 'loss/train': 1.8298143148422241} 11/06/2021 23:38:43 - INFO - __main__ - Step 16947: {'lr': 0.0004875235660046081, 'samples': 3253824, 'steps': 16946, 'loss/train': 1.9328862428665161} 11/06/2021 23:38:43 - INFO - __main__ - Step 16948: {'lr': 0.0004875219104434124, 'samples': 3254016, 'steps': 16947, 'loss/train': 1.6540714502334595} 11/06/2021 23:38:44 - INFO - __main__ - Step 16949: {'lr': 0.0004875202547751929, 'samples': 3254208, 'steps': 16948, 'loss/train': 1.6022636890411377} 11/06/2021 23:38:44 - INFO - __main__ - Step 16950: {'lr': 0.00048751859899995054, 'samples': 3254400, 'steps': 16949, 'loss/train': 1.9451600313186646} 11/06/2021 23:38:44 - INFO - __main__ - Step 16951: {'lr': 0.0004875169431176859, 'samples': 3254592, 'steps': 16950, 'loss/train': 1.8131561279296875} 11/06/2021 23:38:45 - INFO - __main__ - Step 16952: {'lr': 0.0004875152871283999, 'samples': 3254784, 'steps': 16951, 'loss/train': 0.9848697185516357} 11/06/2021 23:38:46 - INFO - __main__ - Step 16953: {'lr': 0.0004875136310320931, 'samples': 3254976, 'steps': 16952, 'loss/train': 1.7647054195404053} 11/06/2021 23:38:46 - INFO - __main__ - Step 16954: {'lr': 0.0004875119748287663, 'samples': 3255168, 'steps': 16953, 'loss/train': 1.5684469938278198} 11/06/2021 23:38:46 - INFO - __main__ - Step 16955: {'lr': 0.0004875103185184203, 'samples': 3255360, 'steps': 16954, 'loss/train': 1.8279472589492798} 11/06/2021 23:38:47 - INFO - __main__ - Step 16956: {'lr': 0.00048750866210105583, 'samples': 3255552, 'steps': 16955, 'loss/train': 1.563040018081665} 11/06/2021 23:38:48 - INFO - __main__ - Step 16957: {'lr': 0.0004875070055766736, 'samples': 3255744, 'steps': 16956, 'loss/train': 1.38356614112854} 11/06/2021 23:38:48 - INFO - __main__ - Step 16958: {'lr': 0.0004875053489452743, 'samples': 3255936, 'steps': 16957, 'loss/train': 1.8866616487503052} 11/06/2021 23:38:48 - INFO - __main__ - Step 16959: {'lr': 0.00048750369220685886, 'samples': 3256128, 'steps': 16958, 'loss/train': 2.1793675422668457} 11/06/2021 23:38:49 - INFO - __main__ - Step 16960: {'lr': 0.0004875020353614279, 'samples': 3256320, 'steps': 16959, 'loss/train': 1.5973029136657715} 11/06/2021 23:38:49 - INFO - __main__ - Step 16961: {'lr': 0.0004875003784089822, 'samples': 3256512, 'steps': 16960, 'loss/train': 0.8916923403739929} 11/06/2021 23:38:50 - INFO - __main__ - Step 16962: {'lr': 0.00048749872134952243, 'samples': 3256704, 'steps': 16961, 'loss/train': 1.789965033531189} 11/06/2021 23:38:51 - INFO - __main__ - Step 16963: {'lr': 0.0004874970641830495, 'samples': 3256896, 'steps': 16962, 'loss/train': 1.8704382181167603} 11/06/2021 23:38:51 - INFO - __main__ - Step 16964: {'lr': 0.000487495406909564, 'samples': 3257088, 'steps': 16963, 'loss/train': 1.7221460342407227} 11/06/2021 23:38:51 - INFO - __main__ - Step 16965: {'lr': 0.00048749374952906677, 'samples': 3257280, 'steps': 16964, 'loss/train': 1.4671553373336792} 11/06/2021 23:38:52 - INFO - __main__ - Step 16966: {'lr': 0.0004874920920415584, 'samples': 3257472, 'steps': 16965, 'loss/train': 1.615160346031189} 11/06/2021 23:38:52 - INFO - __main__ - Step 16967: {'lr': 0.0004874904344470399, 'samples': 3257664, 'steps': 16966, 'loss/train': 2.103879690170288} 11/06/2021 23:38:53 - INFO - __main__ - Step 16968: {'lr': 0.00048748877674551183, 'samples': 3257856, 'steps': 16967, 'loss/train': 1.7058079242706299} 11/06/2021 23:38:53 - INFO - __main__ - Step 16969: {'lr': 0.00048748711893697495, 'samples': 3258048, 'steps': 16968, 'loss/train': 1.2930617332458496} 11/06/2021 23:38:54 - INFO - __main__ - Step 16970: {'lr': 0.0004874854610214301, 'samples': 3258240, 'steps': 16969, 'loss/train': 1.7520792484283447} 11/06/2021 23:38:54 - INFO - __main__ - Step 16971: {'lr': 0.00048748380299887793, 'samples': 3258432, 'steps': 16970, 'loss/train': 1.4183677434921265} 11/06/2021 23:38:54 - INFO - __main__ - Step 16972: {'lr': 0.0004874821448693192, 'samples': 3258624, 'steps': 16971, 'loss/train': 1.4067054986953735} 11/06/2021 23:38:56 - INFO - __main__ - Step 16973: {'lr': 0.00048748048663275475, 'samples': 3258816, 'steps': 16972, 'loss/train': 1.6782017946243286} 11/06/2021 23:38:56 - INFO - __main__ - Step 16974: {'lr': 0.00048747882828918524, 'samples': 3259008, 'steps': 16973, 'loss/train': 1.7959372997283936} 11/06/2021 23:38:56 - INFO - __main__ - Step 16975: {'lr': 0.0004874771698386113, 'samples': 3259200, 'steps': 16974, 'loss/train': 1.3942209482192993} 11/06/2021 23:38:57 - INFO - __main__ - Step 16976: {'lr': 0.00048747551128103397, 'samples': 3259392, 'steps': 16975, 'loss/train': 1.8319814205169678} 11/06/2021 23:38:57 - INFO - __main__ - Step 16977: {'lr': 0.00048747385261645377, 'samples': 3259584, 'steps': 16976, 'loss/train': 1.7434256076812744} 11/06/2021 23:38:58 - INFO - __main__ - Step 16978: {'lr': 0.0004874721938448715, 'samples': 3259776, 'steps': 16977, 'loss/train': 1.804638147354126} 11/06/2021 23:38:58 - INFO - __main__ - Step 16979: {'lr': 0.000487470534966288, 'samples': 3259968, 'steps': 16978, 'loss/train': 1.7611198425292969} 11/06/2021 23:38:59 - INFO - __main__ - Step 16980: {'lr': 0.0004874688759807039, 'samples': 3260160, 'steps': 16979, 'loss/train': 1.878199577331543} 11/06/2021 23:38:59 - INFO - __main__ - Step 16981: {'lr': 0.00048746721688812004, 'samples': 3260352, 'steps': 16980, 'loss/train': 1.5293978452682495} 11/06/2021 23:38:59 - INFO - __main__ - Step 16982: {'lr': 0.00048746555768853703, 'samples': 3260544, 'steps': 16981, 'loss/train': 0.42321673035621643} 11/06/2021 23:39:00 - INFO - __main__ - Step 16983: {'lr': 0.00048746389838195573, 'samples': 3260736, 'steps': 16982, 'loss/train': 2.233102560043335} 11/06/2021 23:39:01 - INFO - __main__ - Step 16984: {'lr': 0.0004874622389683768, 'samples': 3260928, 'steps': 16983, 'loss/train': 1.3476516008377075} 11/06/2021 23:39:01 - INFO - __main__ - Step 16985: {'lr': 0.0004874605794478012, 'samples': 3261120, 'steps': 16984, 'loss/train': 2.141995668411255} 11/06/2021 23:39:01 - INFO - __main__ - Step 16986: {'lr': 0.0004874589198202294, 'samples': 3261312, 'steps': 16985, 'loss/train': 1.3058313131332397} 11/06/2021 23:39:02 - INFO - __main__ - Step 16987: {'lr': 0.0004874572600856624, 'samples': 3261504, 'steps': 16986, 'loss/train': 2.046851873397827} 11/06/2021 23:39:03 - INFO - __main__ - Step 16988: {'lr': 0.0004874556002441007, 'samples': 3261696, 'steps': 16987, 'loss/train': 1.474866271018982} 11/06/2021 23:39:03 - INFO - __main__ - Step 16989: {'lr': 0.0004874539402955452, 'samples': 3261888, 'steps': 16988, 'loss/train': 1.660361647605896} 11/06/2021 23:39:04 - INFO - __main__ - Step 16990: {'lr': 0.00048745228023999666, 'samples': 3262080, 'steps': 16989, 'loss/train': 1.2000677585601807} 11/06/2021 23:39:04 - INFO - __main__ - Step 16991: {'lr': 0.0004874506200774557, 'samples': 3262272, 'steps': 16990, 'loss/train': 0.9482690095901489} 11/06/2021 23:39:04 - INFO - __main__ - Step 16992: {'lr': 0.00048744895980792327, 'samples': 3262464, 'steps': 16991, 'loss/train': 1.494470477104187} 11/06/2021 23:39:05 - INFO - __main__ - Step 16993: {'lr': 0.00048744729943139993, 'samples': 3262656, 'steps': 16992, 'loss/train': 2.6627840995788574} 11/06/2021 23:39:06 - INFO - __main__ - Step 16994: {'lr': 0.0004874456389478865, 'samples': 3262848, 'steps': 16993, 'loss/train': 1.9466887712478638} 11/06/2021 23:39:06 - INFO - __main__ - Step 16995: {'lr': 0.00048744397835738377, 'samples': 3263040, 'steps': 16994, 'loss/train': 1.755508542060852} 11/06/2021 23:39:06 - INFO - __main__ - Step 16996: {'lr': 0.00048744231765989246, 'samples': 3263232, 'steps': 16995, 'loss/train': 1.8926562070846558} 11/06/2021 23:39:07 - INFO - __main__ - Step 16997: {'lr': 0.0004874406568554132, 'samples': 3263424, 'steps': 16996, 'loss/train': 2.095759868621826} 11/06/2021 23:39:07 - INFO - __main__ - Step 16998: {'lr': 0.0004874389959439469, 'samples': 3263616, 'steps': 16997, 'loss/train': 1.821245789527893} 11/06/2021 23:39:08 - INFO - __main__ - Step 16999: {'lr': 0.0004874373349254943, 'samples': 3263808, 'steps': 16998, 'loss/train': 1.4103591442108154} 11/06/2021 23:39:09 - INFO - __main__ - Step 17000: {'lr': 0.00048743567380005604, 'samples': 3264000, 'steps': 16999, 'loss/train': 1.9653171300888062} 11/06/2021 23:39:09 - INFO - __main__ - Step 17001: {'lr': 0.000487434012567633, 'samples': 3264192, 'steps': 17000, 'loss/train': 1.2199618816375732} 11/06/2021 23:39:09 - INFO - __main__ - Step 17002: {'lr': 0.0004874323512282258, 'samples': 3264384, 'steps': 17001, 'loss/train': 1.7762051820755005} 11/06/2021 23:39:10 - INFO - __main__ - Step 17003: {'lr': 0.00048743068978183523, 'samples': 3264576, 'steps': 17002, 'loss/train': 1.578262448310852} 11/06/2021 23:39:11 - INFO - __main__ - Step 17004: {'lr': 0.00048742902822846215, 'samples': 3264768, 'steps': 17003, 'loss/train': 1.8485451936721802} 11/06/2021 23:39:11 - INFO - __main__ - Step 17005: {'lr': 0.0004874273665681071, 'samples': 3264960, 'steps': 17004, 'loss/train': 1.8046890497207642} 11/06/2021 23:39:12 - INFO - __main__ - Step 17006: {'lr': 0.00048742570480077096, 'samples': 3265152, 'steps': 17005, 'loss/train': 0.393635630607605} 11/06/2021 23:39:12 - INFO - __main__ - Step 17007: {'lr': 0.0004874240429264545, 'samples': 3265344, 'steps': 17006, 'loss/train': 2.030019760131836} 11/06/2021 23:39:12 - INFO - __main__ - Step 17008: {'lr': 0.00048742238094515844, 'samples': 3265536, 'steps': 17007, 'loss/train': 1.7320748567581177} 11/06/2021 23:39:13 - INFO - __main__ - Step 17009: {'lr': 0.00048742071885688354, 'samples': 3265728, 'steps': 17008, 'loss/train': 1.4661654233932495} 11/06/2021 23:39:14 - INFO - __main__ - Step 17010: {'lr': 0.00048741905666163047, 'samples': 3265920, 'steps': 17009, 'loss/train': 1.0361342430114746} 11/06/2021 23:39:14 - INFO - __main__ - Step 17011: {'lr': 0.00048741739435940003, 'samples': 3266112, 'steps': 17010, 'loss/train': 0.8936920166015625} 11/06/2021 23:39:14 - INFO - __main__ - Step 17012: {'lr': 0.000487415731950193, 'samples': 3266304, 'steps': 17011, 'loss/train': 1.6268352270126343} 11/06/2021 23:39:15 - INFO - __main__ - Step 17013: {'lr': 0.0004874140694340101, 'samples': 3266496, 'steps': 17012, 'loss/train': 1.5285117626190186} 11/06/2021 23:39:16 - INFO - __main__ - Step 17014: {'lr': 0.0004874124068108521, 'samples': 3266688, 'steps': 17013, 'loss/train': 1.6101415157318115} 11/06/2021 23:39:16 - INFO - __main__ - Step 17015: {'lr': 0.00048741074408071975, 'samples': 3266880, 'steps': 17014, 'loss/train': 1.7432715892791748} 11/06/2021 23:39:17 - INFO - __main__ - Step 17016: {'lr': 0.00048740908124361373, 'samples': 3267072, 'steps': 17015, 'loss/train': 1.9835989475250244} 11/06/2021 23:39:17 - INFO - __main__ - Step 17017: {'lr': 0.0004874074182995349, 'samples': 3267264, 'steps': 17016, 'loss/train': 1.999614953994751} 11/06/2021 23:39:17 - INFO - __main__ - Step 17018: {'lr': 0.0004874057552484839, 'samples': 3267456, 'steps': 17017, 'loss/train': 1.9977906942367554} 11/06/2021 23:39:18 - INFO - __main__ - Step 17019: {'lr': 0.00048740409209046154, 'samples': 3267648, 'steps': 17018, 'loss/train': 1.9863004684448242} 11/06/2021 23:39:19 - INFO - __main__ - Step 17020: {'lr': 0.0004874024288254686, 'samples': 3267840, 'steps': 17019, 'loss/train': 1.82778799533844} 11/06/2021 23:39:19 - INFO - __main__ - Step 17021: {'lr': 0.00048740076545350573, 'samples': 3268032, 'steps': 17020, 'loss/train': 3.3035099506378174} 11/06/2021 23:39:19 - INFO - __main__ - Step 17022: {'lr': 0.00048739910197457376, 'samples': 3268224, 'steps': 17021, 'loss/train': 1.5491119623184204} 11/06/2021 23:39:20 - INFO - __main__ - Step 17023: {'lr': 0.00048739743838867344, 'samples': 3268416, 'steps': 17022, 'loss/train': 1.543113112449646} 11/06/2021 23:39:20 - INFO - __main__ - Step 17024: {'lr': 0.00048739577469580545, 'samples': 3268608, 'steps': 17023, 'loss/train': 1.7079392671585083} 11/06/2021 23:39:21 - INFO - __main__ - Step 17025: {'lr': 0.0004873941108959706, 'samples': 3268800, 'steps': 17024, 'loss/train': 1.5123034715652466} 11/06/2021 23:39:21 - INFO - __main__ - Step 17026: {'lr': 0.0004873924469891697, 'samples': 3268992, 'steps': 17025, 'loss/train': 2.073899507522583} 11/06/2021 23:39:22 - INFO - __main__ - Step 17027: {'lr': 0.00048739078297540335, 'samples': 3269184, 'steps': 17026, 'loss/train': 1.565974235534668} 11/06/2021 23:39:22 - INFO - __main__ - Step 17028: {'lr': 0.00048738911885467243, 'samples': 3269376, 'steps': 17027, 'loss/train': 1.5371849536895752} 11/06/2021 23:39:22 - INFO - __main__ - Step 17029: {'lr': 0.00048738745462697754, 'samples': 3269568, 'steps': 17028, 'loss/train': 1.8682217597961426} 11/06/2021 23:39:24 - INFO - __main__ - Step 17030: {'lr': 0.0004873857902923196, 'samples': 3269760, 'steps': 17029, 'loss/train': 1.9690535068511963} 11/06/2021 23:39:24 - INFO - __main__ - Step 17031: {'lr': 0.00048738412585069927, 'samples': 3269952, 'steps': 17030, 'loss/train': 0.5062768459320068} 11/06/2021 23:39:24 - INFO - __main__ - Step 17032: {'lr': 0.00048738246130211734, 'samples': 3270144, 'steps': 17031, 'loss/train': 1.5573618412017822} 11/06/2021 23:39:25 - INFO - __main__ - Step 17033: {'lr': 0.00048738079664657454, 'samples': 3270336, 'steps': 17032, 'loss/train': 1.1958006620407104} 11/06/2021 23:39:25 - INFO - __main__ - Step 17034: {'lr': 0.00048737913188407156, 'samples': 3270528, 'steps': 17033, 'loss/train': 1.5426795482635498} 11/06/2021 23:39:26 - INFO - __main__ - Step 17035: {'lr': 0.00048737746701460927, 'samples': 3270720, 'steps': 17034, 'loss/train': 1.8075716495513916} 11/06/2021 23:39:26 - INFO - __main__ - Step 17036: {'lr': 0.0004873758020381883, 'samples': 3270912, 'steps': 17035, 'loss/train': 1.7565505504608154} 11/06/2021 23:39:27 - INFO - __main__ - Step 17037: {'lr': 0.00048737413695480947, 'samples': 3271104, 'steps': 17036, 'loss/train': 2.1112308502197266} 11/06/2021 23:39:27 - INFO - __main__ - Step 17038: {'lr': 0.00048737247176447354, 'samples': 3271296, 'steps': 17037, 'loss/train': 1.8859508037567139} 11/06/2021 23:39:27 - INFO - __main__ - Step 17039: {'lr': 0.0004873708064671812, 'samples': 3271488, 'steps': 17038, 'loss/train': 1.6623574495315552} 11/06/2021 23:39:28 - INFO - __main__ - Step 17040: {'lr': 0.0004873691410629333, 'samples': 3271680, 'steps': 17039, 'loss/train': 1.8081897497177124} 11/06/2021 23:39:29 - INFO - __main__ - Step 17041: {'lr': 0.0004873674755517304, 'samples': 3271872, 'steps': 17040, 'loss/train': 1.7349417209625244} 11/06/2021 23:39:29 - INFO - __main__ - Step 17042: {'lr': 0.00048736580993357357, 'samples': 3272064, 'steps': 17041, 'loss/train': 1.7138926982879639} 11/06/2021 23:39:29 - INFO - __main__ - Step 17043: {'lr': 0.0004873641442084632, 'samples': 3272256, 'steps': 17042, 'loss/train': 1.552172303199768} 11/06/2021 23:39:30 - INFO - __main__ - Step 17044: {'lr': 0.00048736247837640037, 'samples': 3272448, 'steps': 17043, 'loss/train': 1.3372807502746582} 11/06/2021 23:39:30 - INFO - __main__ - Step 17045: {'lr': 0.0004873608124373855, 'samples': 3272640, 'steps': 17044, 'loss/train': 1.974311113357544} 11/06/2021 23:39:31 - INFO - __main__ - Step 17046: {'lr': 0.00048735914639141964, 'samples': 3272832, 'steps': 17045, 'loss/train': 1.299238920211792} 11/06/2021 23:39:32 - INFO - __main__ - Step 17047: {'lr': 0.00048735748023850337, 'samples': 3273024, 'steps': 17046, 'loss/train': 1.769322156906128} 11/06/2021 23:39:32 - INFO - __main__ - Step 17048: {'lr': 0.00048735581397863745, 'samples': 3273216, 'steps': 17047, 'loss/train': 1.0597976446151733} 11/06/2021 23:39:32 - INFO - __main__ - Step 17049: {'lr': 0.0004873541476118227, 'samples': 3273408, 'steps': 17048, 'loss/train': 5.835102081298828} 11/06/2021 23:39:33 - INFO - __main__ - Step 17050: {'lr': 0.00048735248113805976, 'samples': 3273600, 'steps': 17049, 'loss/train': 1.1489970684051514} 11/06/2021 23:39:34 - INFO - __main__ - Step 17051: {'lr': 0.0004873508145573495, 'samples': 3273792, 'steps': 17050, 'loss/train': 1.3549208641052246} 11/06/2021 23:39:34 - INFO - __main__ - Step 17052: {'lr': 0.00048734914786969266, 'samples': 3273984, 'steps': 17051, 'loss/train': 1.75826895236969} 11/06/2021 23:39:34 - INFO - __main__ - Step 17053: {'lr': 0.00048734748107509, 'samples': 3274176, 'steps': 17052, 'loss/train': 1.9871326684951782} 11/06/2021 23:39:35 - INFO - __main__ - Step 17054: {'lr': 0.0004873458141735421, 'samples': 3274368, 'steps': 17053, 'loss/train': 1.1350014209747314} 11/06/2021 23:39:35 - INFO - __main__ - Step 17055: {'lr': 0.0004873441471650499, 'samples': 3274560, 'steps': 17054, 'loss/train': 1.7989593744277954} 11/06/2021 23:39:35 - INFO - __main__ - Step 17056: {'lr': 0.00048734248004961414, 'samples': 3274752, 'steps': 17055, 'loss/train': 1.6534098386764526} 11/06/2021 23:39:37 - INFO - __main__ - Step 17057: {'lr': 0.00048734081282723543, 'samples': 3274944, 'steps': 17056, 'loss/train': 1.4881923198699951} 11/06/2021 23:39:37 - INFO - __main__ - Step 17058: {'lr': 0.00048733914549791465, 'samples': 3275136, 'steps': 17057, 'loss/train': 1.5087233781814575} 11/06/2021 23:39:37 - INFO - __main__ - Step 17059: {'lr': 0.0004873374780616525, 'samples': 3275328, 'steps': 17058, 'loss/train': 1.679807186126709} 11/06/2021 23:39:38 - INFO - __main__ - Step 17060: {'lr': 0.00048733581051844976, 'samples': 3275520, 'steps': 17059, 'loss/train': 0.27972447872161865} 11/06/2021 23:39:38 - INFO - __main__ - Step 17061: {'lr': 0.00048733414286830716, 'samples': 3275712, 'steps': 17060, 'loss/train': 1.3394430875778198} 11/06/2021 23:39:39 - INFO - __main__ - Step 17062: {'lr': 0.00048733247511122547, 'samples': 3275904, 'steps': 17061, 'loss/train': 1.8187850713729858} 11/06/2021 23:39:40 - INFO - __main__ - Step 17063: {'lr': 0.00048733080724720545, 'samples': 3276096, 'steps': 17062, 'loss/train': 1.4330551624298096} 11/06/2021 23:39:40 - INFO - __main__ - Step 17064: {'lr': 0.00048732913927624776, 'samples': 3276288, 'steps': 17063, 'loss/train': 1.9322031736373901} 11/06/2021 23:39:40 - INFO - __main__ - Step 17065: {'lr': 0.0004873274711983533, 'samples': 3276480, 'steps': 17064, 'loss/train': 1.0093766450881958} 11/06/2021 23:39:41 - INFO - __main__ - Step 17066: {'lr': 0.0004873258030135227, 'samples': 3276672, 'steps': 17065, 'loss/train': 1.3425406217575073} 11/06/2021 23:39:42 - INFO - __main__ - Step 17067: {'lr': 0.0004873241347217567, 'samples': 3276864, 'steps': 17066, 'loss/train': 2.148686647415161} 11/06/2021 23:39:42 - INFO - __main__ - Step 17068: {'lr': 0.0004873224663230562, 'samples': 3277056, 'steps': 17067, 'loss/train': 1.3120168447494507} 11/06/2021 23:39:42 - INFO - __main__ - Step 17069: {'lr': 0.0004873207978174219, 'samples': 3277248, 'steps': 17068, 'loss/train': 1.5943224430084229} 11/06/2021 23:39:43 - INFO - __main__ - Step 17070: {'lr': 0.00048731912920485444, 'samples': 3277440, 'steps': 17069, 'loss/train': 0.7484023571014404} 11/06/2021 23:39:43 - INFO - __main__ - Step 17071: {'lr': 0.0004873174604853546, 'samples': 3277632, 'steps': 17070, 'loss/train': 1.536741018295288} 11/06/2021 23:39:44 - INFO - __main__ - Step 17072: {'lr': 0.00048731579165892325, 'samples': 3277824, 'steps': 17071, 'loss/train': 1.413906455039978} 11/06/2021 23:39:45 - INFO - __main__ - Step 17073: {'lr': 0.000487314122725561, 'samples': 3278016, 'steps': 17072, 'loss/train': 1.606602668762207} 11/06/2021 23:39:45 - INFO - __main__ - Step 17074: {'lr': 0.00048731245368526877, 'samples': 3278208, 'steps': 17073, 'loss/train': 1.9211560487747192} 11/06/2021 23:39:45 - INFO - __main__ - Step 17075: {'lr': 0.0004873107845380471, 'samples': 3278400, 'steps': 17074, 'loss/train': 1.820310115814209} 11/06/2021 23:39:46 - INFO - __main__ - Step 17076: {'lr': 0.00048730911528389686, 'samples': 3278592, 'steps': 17075, 'loss/train': 1.947972059249878} 11/06/2021 23:39:47 - INFO - __main__ - Step 17077: {'lr': 0.0004873074459228188, 'samples': 3278784, 'steps': 17076, 'loss/train': 1.7204302549362183} 11/06/2021 23:39:47 - INFO - __main__ - Step 17078: {'lr': 0.0004873057764548138, 'samples': 3278976, 'steps': 17077, 'loss/train': 1.879388689994812} 11/06/2021 23:39:48 - INFO - __main__ - Step 17079: {'lr': 0.00048730410687988237, 'samples': 3279168, 'steps': 17078, 'loss/train': 1.2664668560028076} 11/06/2021 23:39:48 - INFO - __main__ - Step 17080: {'lr': 0.00048730243719802535, 'samples': 3279360, 'steps': 17079, 'loss/train': 1.9700970649719238} 11/06/2021 23:39:48 - INFO - __main__ - Step 17081: {'lr': 0.00048730076740924355, 'samples': 3279552, 'steps': 17080, 'loss/train': 1.0473175048828125} 11/06/2021 23:39:49 - INFO - __main__ - Step 17082: {'lr': 0.0004872990975135377, 'samples': 3279744, 'steps': 17081, 'loss/train': 1.3515214920043945} 11/06/2021 23:39:49 - INFO - __main__ - Step 17083: {'lr': 0.0004872974275109085, 'samples': 3279936, 'steps': 17082, 'loss/train': 1.463983178138733} 11/06/2021 23:39:50 - INFO - __main__ - Step 17084: {'lr': 0.00048729575740135675, 'samples': 3280128, 'steps': 17083, 'loss/train': 1.5484230518341064} 11/06/2021 23:39:51 - INFO - __main__ - Step 17085: {'lr': 0.0004872940871848832, 'samples': 3280320, 'steps': 17084, 'loss/train': 2.2791638374328613} 11/06/2021 23:39:51 - INFO - __main__ - Step 17086: {'lr': 0.00048729241686148864, 'samples': 3280512, 'steps': 17085, 'loss/train': 1.8296653032302856} 11/06/2021 23:39:51 - INFO - __main__ - Step 17087: {'lr': 0.0004872907464311737, 'samples': 3280704, 'steps': 17086, 'loss/train': 1.4752380847930908} 11/06/2021 23:39:52 - INFO - __main__ - Step 17088: {'lr': 0.0004872890758939392, 'samples': 3280896, 'steps': 17087, 'loss/train': 1.206477165222168} 11/06/2021 23:39:53 - INFO - __main__ - Step 17089: {'lr': 0.00048728740524978597, 'samples': 3281088, 'steps': 17088, 'loss/train': 1.7436742782592773} 11/06/2021 23:39:53 - INFO - __main__ - Step 17090: {'lr': 0.00048728573449871473, 'samples': 3281280, 'steps': 17089, 'loss/train': 1.3805345296859741} 11/06/2021 23:39:53 - INFO - __main__ - Step 17091: {'lr': 0.0004872840636407261, 'samples': 3281472, 'steps': 17090, 'loss/train': 1.2210899591445923} 11/06/2021 23:39:54 - INFO - __main__ - Step 17092: {'lr': 0.00048728239267582096, 'samples': 3281664, 'steps': 17091, 'loss/train': 1.1213079690933228} 11/06/2021 23:39:54 - INFO - __main__ - Step 17093: {'lr': 0.00048728072160400006, 'samples': 3281856, 'steps': 17092, 'loss/train': 1.8094351291656494} 11/06/2021 23:39:55 - INFO - __main__ - Step 17094: {'lr': 0.0004872790504252641, 'samples': 3282048, 'steps': 17093, 'loss/train': 2.475142240524292} 11/06/2021 23:39:55 - INFO - __main__ - Step 17095: {'lr': 0.0004872773791396139, 'samples': 3282240, 'steps': 17094, 'loss/train': 1.4693809747695923} 11/06/2021 23:39:56 - INFO - __main__ - Step 17096: {'lr': 0.0004872757077470502, 'samples': 3282432, 'steps': 17095, 'loss/train': 2.0089285373687744} 11/06/2021 23:39:56 - INFO - __main__ - Step 17097: {'lr': 0.0004872740362475737, 'samples': 3282624, 'steps': 17096, 'loss/train': 1.8510346412658691} 11/06/2021 23:39:57 - INFO - __main__ - Step 17098: {'lr': 0.0004872723646411851, 'samples': 3282816, 'steps': 17097, 'loss/train': 1.2082724571228027} 11/06/2021 23:39:58 - INFO - __main__ - Step 17099: {'lr': 0.0004872706929278853, 'samples': 3283008, 'steps': 17098, 'loss/train': 1.7203577756881714} 11/06/2021 23:39:58 - INFO - __main__ - Step 17100: {'lr': 0.000487269021107675, 'samples': 3283200, 'steps': 17099, 'loss/train': 1.7792285680770874} 11/06/2021 23:39:59 - INFO - __main__ - Step 17101: {'lr': 0.0004872673491805549, 'samples': 3283392, 'steps': 17100, 'loss/train': 1.7158050537109375} 11/06/2021 23:39:59 - INFO - __main__ - Step 17102: {'lr': 0.0004872656771465259, 'samples': 3283584, 'steps': 17101, 'loss/train': 1.6568636894226074} 11/06/2021 23:39:59 - INFO - __main__ - Step 17103: {'lr': 0.00048726400500558856, 'samples': 3283776, 'steps': 17102, 'loss/train': 1.9495559930801392} 11/06/2021 23:40:00 - INFO - __main__ - Step 17104: {'lr': 0.0004872623327577437, 'samples': 3283968, 'steps': 17103, 'loss/train': 1.5772558450698853} 11/06/2021 23:40:01 - INFO - __main__ - Step 17105: {'lr': 0.0004872606604029921, 'samples': 3284160, 'steps': 17104, 'loss/train': 0.8391998410224915} 11/06/2021 23:40:01 - INFO - __main__ - Step 17106: {'lr': 0.00048725898794133455, 'samples': 3284352, 'steps': 17105, 'loss/train': 1.9225443601608276} 11/06/2021 23:40:01 - INFO - __main__ - Step 17107: {'lr': 0.00048725731537277173, 'samples': 3284544, 'steps': 17106, 'loss/train': 1.894831657409668} 11/06/2021 23:40:02 - INFO - __main__ - Step 17108: {'lr': 0.0004872556426973044, 'samples': 3284736, 'steps': 17107, 'loss/train': 1.5045108795166016} 11/06/2021 23:40:02 - INFO - __main__ - Step 17109: {'lr': 0.0004872539699149334, 'samples': 3284928, 'steps': 17108, 'loss/train': 1.9141845703125} 11/06/2021 23:40:03 - INFO - __main__ - Step 17110: {'lr': 0.0004872522970256594, 'samples': 3285120, 'steps': 17109, 'loss/train': 1.4091365337371826} 11/06/2021 23:40:03 - INFO - __main__ - Step 17111: {'lr': 0.00048725062402948314, 'samples': 3285312, 'steps': 17110, 'loss/train': 1.5130109786987305} 11/06/2021 23:40:04 - INFO - __main__ - Step 17112: {'lr': 0.00048724895092640546, 'samples': 3285504, 'steps': 17111, 'loss/train': 1.1645400524139404} 11/06/2021 23:40:04 - INFO - __main__ - Step 17113: {'lr': 0.00048724727771642706, 'samples': 3285696, 'steps': 17112, 'loss/train': 1.6924716234207153} 11/06/2021 23:40:05 - INFO - __main__ - Step 17114: {'lr': 0.00048724560439954867, 'samples': 3285888, 'steps': 17113, 'loss/train': 1.7980364561080933} 11/06/2021 23:40:06 - INFO - __main__ - Step 17115: {'lr': 0.00048724393097577113, 'samples': 3286080, 'steps': 17114, 'loss/train': 1.552628517150879} 11/06/2021 23:40:06 - INFO - __main__ - Step 17116: {'lr': 0.0004872422574450951, 'samples': 3286272, 'steps': 17115, 'loss/train': 1.8925765752792358} 11/06/2021 23:40:06 - INFO - __main__ - Step 17117: {'lr': 0.0004872405838075213, 'samples': 3286464, 'steps': 17116, 'loss/train': 1.6938984394073486} 11/06/2021 23:40:07 - INFO - __main__ - Step 17118: {'lr': 0.00048723891006305066, 'samples': 3286656, 'steps': 17117, 'loss/train': 1.3686352968215942} 11/06/2021 23:40:07 - INFO - __main__ - Step 17119: {'lr': 0.0004872372362116838, 'samples': 3286848, 'steps': 17118, 'loss/train': 1.6649097204208374} 11/06/2021 23:40:08 - INFO - __main__ - Step 17120: {'lr': 0.0004872355622534215, 'samples': 3287040, 'steps': 17119, 'loss/train': 0.5623729825019836} 11/06/2021 23:40:09 - INFO - __main__ - Step 17121: {'lr': 0.0004872338881882644, 'samples': 3287232, 'steps': 17120, 'loss/train': 1.4882264137268066} 11/06/2021 23:40:09 - INFO - __main__ - Step 17122: {'lr': 0.00048723221401621354, 'samples': 3287424, 'steps': 17121, 'loss/train': 2.0259337425231934} 11/06/2021 23:40:09 - INFO - __main__ - Step 17123: {'lr': 0.0004872305397372694, 'samples': 3287616, 'steps': 17122, 'loss/train': 1.6014872789382935} 11/06/2021 23:40:10 - INFO - __main__ - Step 17124: {'lr': 0.0004872288653514329, 'samples': 3287808, 'steps': 17123, 'loss/train': 1.8238401412963867} 11/06/2021 23:40:10 - INFO - __main__ - Step 17125: {'lr': 0.0004872271908587047, 'samples': 3288000, 'steps': 17124, 'loss/train': 1.2414836883544922} 11/06/2021 23:40:11 - INFO - __main__ - Step 17126: {'lr': 0.0004872255162590856, 'samples': 3288192, 'steps': 17125, 'loss/train': 1.3290209770202637} 11/06/2021 23:40:11 - INFO - __main__ - Step 17127: {'lr': 0.0004872238415525764, 'samples': 3288384, 'steps': 17126, 'loss/train': 1.592273235321045} 11/06/2021 23:40:12 - INFO - __main__ - Step 17128: {'lr': 0.0004872221667391777, 'samples': 3288576, 'steps': 17127, 'loss/train': 1.5573184490203857} 11/06/2021 23:40:12 - INFO - __main__ - Step 17129: {'lr': 0.00048722049181889037, 'samples': 3288768, 'steps': 17128, 'loss/train': 1.9449520111083984} 11/06/2021 23:40:12 - INFO - __main__ - Step 17130: {'lr': 0.0004872188167917152, 'samples': 3288960, 'steps': 17129, 'loss/train': 2.0175869464874268} 11/06/2021 23:40:13 - INFO - __main__ - Step 17131: {'lr': 0.00048721714165765286, 'samples': 3289152, 'steps': 17130, 'loss/train': 0.9070045948028564} 11/06/2021 23:40:14 - INFO - __main__ - Step 17132: {'lr': 0.00048721546641670413, 'samples': 3289344, 'steps': 17131, 'loss/train': 1.587490200996399} 11/06/2021 23:40:14 - INFO - __main__ - Step 17133: {'lr': 0.00048721379106886976, 'samples': 3289536, 'steps': 17132, 'loss/train': 1.65146005153656} 11/06/2021 23:40:14 - INFO - __main__ - Step 17134: {'lr': 0.0004872121156141506, 'samples': 3289728, 'steps': 17133, 'loss/train': 1.6445494890213013} 11/06/2021 23:40:15 - INFO - __main__ - Step 17135: {'lr': 0.0004872104400525472, 'samples': 3289920, 'steps': 17134, 'loss/train': 1.1517709493637085} 11/06/2021 23:40:16 - INFO - __main__ - Step 17136: {'lr': 0.0004872087643840605, 'samples': 3290112, 'steps': 17135, 'loss/train': 1.2968612909317017} 11/06/2021 23:40:16 - INFO - __main__ - Step 17137: {'lr': 0.00048720708860869116, 'samples': 3290304, 'steps': 17136, 'loss/train': 1.8563275337219238} 11/06/2021 23:40:17 - INFO - __main__ - Step 17138: {'lr': 0.00048720541272644004, 'samples': 3290496, 'steps': 17137, 'loss/train': 1.7054293155670166} 11/06/2021 23:40:17 - INFO - __main__ - Step 17139: {'lr': 0.00048720373673730773, 'samples': 3290688, 'steps': 17138, 'loss/train': 1.8172038793563843} 11/06/2021 23:40:17 - INFO - __main__ - Step 17140: {'lr': 0.00048720206064129516, 'samples': 3290880, 'steps': 17139, 'loss/train': 1.6057360172271729} 11/06/2021 23:40:18 - INFO - __main__ - Step 17141: {'lr': 0.0004872003844384029, 'samples': 3291072, 'steps': 17140, 'loss/train': 1.466621994972229} 11/06/2021 23:40:19 - INFO - __main__ - Step 17142: {'lr': 0.0004871987081286319, 'samples': 3291264, 'steps': 17141, 'loss/train': 1.6234965324401855} 11/06/2021 23:40:19 - INFO - __main__ - Step 17143: {'lr': 0.0004871970317119828, 'samples': 3291456, 'steps': 17142, 'loss/train': 0.9775916337966919} 11/06/2021 23:40:20 - INFO - __main__ - Step 17144: {'lr': 0.00048719535518845634, 'samples': 3291648, 'steps': 17143, 'loss/train': 0.35061636567115784} 11/06/2021 23:40:20 - INFO - __main__ - Step 17145: {'lr': 0.0004871936785580533, 'samples': 3291840, 'steps': 17144, 'loss/train': 1.0786106586456299} 11/06/2021 23:40:21 - INFO - __main__ - Step 17146: {'lr': 0.0004871920018207745, 'samples': 3292032, 'steps': 17145, 'loss/train': 1.6169389486312866} 11/06/2021 23:40:22 - INFO - __main__ - Step 17147: {'lr': 0.0004871903249766206, 'samples': 3292224, 'steps': 17146, 'loss/train': 1.4403924942016602} 11/06/2021 23:40:23 - INFO - __main__ - Step 17148: {'lr': 0.0004871886480255925, 'samples': 3292416, 'steps': 17147, 'loss/train': 2.0448710918426514} 11/06/2021 23:40:23 - INFO - __main__ - Step 17149: {'lr': 0.0004871869709676907, 'samples': 3292608, 'steps': 17148, 'loss/train': 1.7501180171966553} 11/06/2021 23:40:23 - INFO - __main__ - Step 17150: {'lr': 0.0004871852938029162, 'samples': 3292800, 'steps': 17149, 'loss/train': 1.0432761907577515} 11/06/2021 23:40:24 - INFO - __main__ - Step 17151: {'lr': 0.00048718361653126975, 'samples': 3292992, 'steps': 17150, 'loss/train': 1.692865252494812} 11/06/2021 23:40:24 - INFO - __main__ - Step 17152: {'lr': 0.0004871819391527519, 'samples': 3293184, 'steps': 17151, 'loss/train': 1.8303561210632324} 11/06/2021 23:40:24 - INFO - __main__ - Step 17153: {'lr': 0.0004871802616673636, 'samples': 3293376, 'steps': 17152, 'loss/train': 1.8023126125335693} 11/06/2021 23:40:25 - INFO - __main__ - Step 17154: {'lr': 0.00048717858407510545, 'samples': 3293568, 'steps': 17153, 'loss/train': 1.8174192905426025} 11/06/2021 23:40:26 - INFO - __main__ - Step 17155: {'lr': 0.0004871769063759783, 'samples': 3293760, 'steps': 17154, 'loss/train': 0.8695468306541443} 11/06/2021 23:40:26 - INFO - __main__ - Step 17156: {'lr': 0.000487175228569983, 'samples': 3293952, 'steps': 17155, 'loss/train': 1.9590970277786255} 11/06/2021 23:40:26 - INFO - __main__ - Step 17157: {'lr': 0.0004871735506571201, 'samples': 3294144, 'steps': 17156, 'loss/train': 1.572733759880066} 11/06/2021 23:40:27 - INFO - __main__ - Step 17158: {'lr': 0.00048717187263739046, 'samples': 3294336, 'steps': 17157, 'loss/train': 2.0862021446228027} 11/06/2021 23:40:28 - INFO - __main__ - Step 17159: {'lr': 0.00048717019451079493, 'samples': 3294528, 'steps': 17158, 'loss/train': 1.4682984352111816} 11/06/2021 23:40:28 - INFO - __main__ - Step 17160: {'lr': 0.00048716851627733404, 'samples': 3294720, 'steps': 17159, 'loss/train': 1.938184142112732} 11/06/2021 23:40:28 - INFO - __main__ - Step 17161: {'lr': 0.00048716683793700876, 'samples': 3294912, 'steps': 17160, 'loss/train': 1.7588785886764526} 11/06/2021 23:40:29 - INFO - __main__ - Step 17162: {'lr': 0.00048716515948981975, 'samples': 3295104, 'steps': 17161, 'loss/train': 1.657987117767334} 11/06/2021 23:40:29 - INFO - __main__ - Step 17163: {'lr': 0.0004871634809357678, 'samples': 3295296, 'steps': 17162, 'loss/train': 2.02968430519104} 11/06/2021 23:40:29 - INFO - __main__ - Step 17164: {'lr': 0.00048716180227485365, 'samples': 3295488, 'steps': 17163, 'loss/train': 1.6158461570739746} 11/06/2021 23:40:31 - INFO - __main__ - Step 17165: {'lr': 0.000487160123507078, 'samples': 3295680, 'steps': 17164, 'loss/train': 1.3516970872879028} 11/06/2021 23:40:31 - INFO - __main__ - Step 17166: {'lr': 0.00048715844463244166, 'samples': 3295872, 'steps': 17165, 'loss/train': 1.0224634408950806} 11/06/2021 23:40:32 - INFO - __main__ - Step 17167: {'lr': 0.0004871567656509454, 'samples': 3296064, 'steps': 17166, 'loss/train': 1.6286890506744385} 11/06/2021 23:40:32 - INFO - __main__ - Step 17168: {'lr': 0.00048715508656259, 'samples': 3296256, 'steps': 17167, 'loss/train': 1.054592490196228} 11/06/2021 23:40:32 - INFO - __main__ - Step 17169: {'lr': 0.00048715340736737615, 'samples': 3296448, 'steps': 17168, 'loss/train': 1.4753445386886597} 11/06/2021 23:40:33 - INFO - __main__ - Step 17170: {'lr': 0.0004871517280653046, 'samples': 3296640, 'steps': 17169, 'loss/train': 0.6704396605491638} 11/06/2021 23:40:34 - INFO - __main__ - Step 17171: {'lr': 0.0004871500486563761, 'samples': 3296832, 'steps': 17170, 'loss/train': 1.4900462627410889} 11/06/2021 23:40:34 - INFO - __main__ - Step 17172: {'lr': 0.0004871483691405916, 'samples': 3297024, 'steps': 17171, 'loss/train': 1.7745710611343384} 11/06/2021 23:40:34 - INFO - __main__ - Step 17173: {'lr': 0.0004871466895179516, 'samples': 3297216, 'steps': 17172, 'loss/train': 1.7504075765609741} 11/06/2021 23:40:35 - INFO - __main__ - Step 17174: {'lr': 0.000487145009788457, 'samples': 3297408, 'steps': 17173, 'loss/train': 1.5627763271331787} 11/06/2021 23:40:36 - INFO - __main__ - Step 17175: {'lr': 0.0004871433299521085, 'samples': 3297600, 'steps': 17174, 'loss/train': 1.7707425355911255} 11/06/2021 23:40:36 - INFO - __main__ - Step 17176: {'lr': 0.00048714165000890685, 'samples': 3297792, 'steps': 17175, 'loss/train': 1.890425682067871} 11/06/2021 23:40:36 - INFO - __main__ - Step 17177: {'lr': 0.00048713996995885286, 'samples': 3297984, 'steps': 17176, 'loss/train': 1.6484146118164062} 11/06/2021 23:40:37 - INFO - __main__ - Step 17178: {'lr': 0.0004871382898019472, 'samples': 3298176, 'steps': 17177, 'loss/train': 1.687612771987915} 11/06/2021 23:40:37 - INFO - __main__ - Step 17179: {'lr': 0.0004871366095381908, 'samples': 3298368, 'steps': 17178, 'loss/train': 1.811246633529663} 11/06/2021 23:40:37 - INFO - __main__ - Step 17180: {'lr': 0.00048713492916758425, 'samples': 3298560, 'steps': 17179, 'loss/train': 1.000144124031067} 11/06/2021 23:40:39 - INFO - __main__ - Step 17181: {'lr': 0.00048713324869012833, 'samples': 3298752, 'steps': 17180, 'loss/train': 1.4316315650939941} 11/06/2021 23:40:39 - INFO - __main__ - Step 17182: {'lr': 0.0004871315681058238, 'samples': 3298944, 'steps': 17181, 'loss/train': 1.5694489479064941} 11/06/2021 23:40:39 - INFO - __main__ - Step 17183: {'lr': 0.0004871298874146716, 'samples': 3299136, 'steps': 17182, 'loss/train': 1.7600518465042114} 11/06/2021 23:40:40 - INFO - __main__ - Step 17184: {'lr': 0.00048712820661667215, 'samples': 3299328, 'steps': 17183, 'loss/train': 1.9780511856079102} 11/06/2021 23:40:40 - INFO - __main__ - Step 17185: {'lr': 0.0004871265257118265, 'samples': 3299520, 'steps': 17184, 'loss/train': 1.7205005884170532} 11/06/2021 23:40:41 - INFO - __main__ - Step 17186: {'lr': 0.0004871248447001352, 'samples': 3299712, 'steps': 17185, 'loss/train': 1.8531748056411743} 11/06/2021 23:40:41 - INFO - __main__ - Step 17187: {'lr': 0.0004871231635815992, 'samples': 3299904, 'steps': 17186, 'loss/train': 1.939202904701233} 11/06/2021 23:40:42 - INFO - __main__ - Step 17188: {'lr': 0.0004871214823562191, 'samples': 3300096, 'steps': 17187, 'loss/train': 1.5620630979537964} 11/06/2021 23:40:42 - INFO - __main__ - Step 17189: {'lr': 0.0004871198010239958, 'samples': 3300288, 'steps': 17188, 'loss/train': 1.8698750734329224} 11/06/2021 23:40:42 - INFO - __main__ - Step 17190: {'lr': 0.0004871181195849299, 'samples': 3300480, 'steps': 17189, 'loss/train': 1.9810320138931274} 11/06/2021 23:40:43 - INFO - __main__ - Step 17191: {'lr': 0.00048711643803902227, 'samples': 3300672, 'steps': 17190, 'loss/train': 1.5719085931777954} 11/06/2021 23:40:44 - INFO - __main__ - Step 17192: {'lr': 0.00048711475638627363, 'samples': 3300864, 'steps': 17191, 'loss/train': 2.2363479137420654} 11/06/2021 23:40:44 - INFO - __main__ - Step 17193: {'lr': 0.0004871130746266847, 'samples': 3301056, 'steps': 17192, 'loss/train': 1.8176665306091309} 11/06/2021 23:40:45 - INFO - __main__ - Step 17194: {'lr': 0.00048711139276025626, 'samples': 3301248, 'steps': 17193, 'loss/train': 1.7995778322219849} 11/06/2021 23:40:45 - INFO - __main__ - Step 17195: {'lr': 0.00048710971078698916, 'samples': 3301440, 'steps': 17194, 'loss/train': 1.1480748653411865} 11/06/2021 23:40:46 - INFO - __main__ - Step 17196: {'lr': 0.0004871080287068841, 'samples': 3301632, 'steps': 17195, 'loss/train': 2.0318057537078857} 11/06/2021 23:40:46 - INFO - __main__ - Step 17197: {'lr': 0.00048710634651994176, 'samples': 3301824, 'steps': 17196, 'loss/train': 1.7191940546035767} 11/06/2021 23:40:47 - INFO - __main__ - Step 17198: {'lr': 0.0004871046642261629, 'samples': 3302016, 'steps': 17197, 'loss/train': 1.612969994544983} 11/06/2021 23:40:47 - INFO - __main__ - Step 17199: {'lr': 0.0004871029818255485, 'samples': 3302208, 'steps': 17198, 'loss/train': 2.0721495151519775} 11/06/2021 23:40:47 - INFO - __main__ - Step 17200: {'lr': 0.0004871012993180991, 'samples': 3302400, 'steps': 17199, 'loss/train': 1.4163411855697632} 11/06/2021 23:40:48 - INFO - __main__ - Step 17201: {'lr': 0.0004870996167038154, 'samples': 3302592, 'steps': 17200, 'loss/train': 1.7400089502334595} 11/06/2021 23:40:49 - INFO - __main__ - Step 17202: {'lr': 0.0004870979339826984, 'samples': 3302784, 'steps': 17201, 'loss/train': 2.5178351402282715} 11/06/2021 23:40:49 - INFO - __main__ - Step 17203: {'lr': 0.00048709625115474865, 'samples': 3302976, 'steps': 17202, 'loss/train': 1.433853030204773} 11/06/2021 23:40:49 - INFO - __main__ - Step 17204: {'lr': 0.00048709456821996705, 'samples': 3303168, 'steps': 17203, 'loss/train': 2.0640363693237305} 11/06/2021 23:40:50 - INFO - __main__ - Step 17205: {'lr': 0.0004870928851783543, 'samples': 3303360, 'steps': 17204, 'loss/train': 1.7238097190856934} 11/06/2021 23:40:50 - INFO - __main__ - Step 17206: {'lr': 0.00048709120202991107, 'samples': 3303552, 'steps': 17205, 'loss/train': 1.600435733795166} 11/06/2021 23:40:51 - INFO - __main__ - Step 17207: {'lr': 0.0004870895187746383, 'samples': 3303744, 'steps': 17206, 'loss/train': 0.9599513411521912} 11/06/2021 23:40:51 - INFO - __main__ - Step 17208: {'lr': 0.00048708783541253655, 'samples': 3303936, 'steps': 17207, 'loss/train': 1.8640570640563965} 11/06/2021 23:40:52 - INFO - __main__ - Step 17209: {'lr': 0.00048708615194360675, 'samples': 3304128, 'steps': 17208, 'loss/train': 1.4890220165252686} 11/06/2021 23:40:52 - INFO - __main__ - Step 17210: {'lr': 0.0004870844683678496, 'samples': 3304320, 'steps': 17209, 'loss/train': 1.492447018623352} 11/06/2021 23:40:52 - INFO - __main__ - Step 17211: {'lr': 0.0004870827846852658, 'samples': 3304512, 'steps': 17210, 'loss/train': 1.8645015954971313} 11/06/2021 23:40:53 - INFO - __main__ - Step 17212: {'lr': 0.00048708110089585617, 'samples': 3304704, 'steps': 17211, 'loss/train': 1.7372667789459229} 11/06/2021 23:40:54 - INFO - __main__ - Step 17213: {'lr': 0.00048707941699962143, 'samples': 3304896, 'steps': 17212, 'loss/train': 2.233567953109741} 11/06/2021 23:40:54 - INFO - __main__ - Step 17214: {'lr': 0.0004870777329965624, 'samples': 3305088, 'steps': 17213, 'loss/train': 1.0954010486602783} 11/06/2021 23:40:54 - INFO - __main__ - Step 17215: {'lr': 0.00048707604888667983, 'samples': 3305280, 'steps': 17214, 'loss/train': 0.9866694211959839} 11/06/2021 23:40:55 - INFO - __main__ - Step 17216: {'lr': 0.0004870743646699744, 'samples': 3305472, 'steps': 17215, 'loss/train': 2.072942018508911} 11/06/2021 23:40:56 - INFO - __main__ - Step 17217: {'lr': 0.0004870726803464469, 'samples': 3305664, 'steps': 17216, 'loss/train': 1.3955477476119995} 11/06/2021 23:40:56 - INFO - __main__ - Step 17218: {'lr': 0.00048707099591609816, 'samples': 3305856, 'steps': 17217, 'loss/train': 1.746077060699463} 11/06/2021 23:40:57 - INFO - __main__ - Step 17219: {'lr': 0.0004870693113789289, 'samples': 3306048, 'steps': 17218, 'loss/train': 1.7401597499847412} 11/06/2021 23:40:57 - INFO - __main__ - Step 17220: {'lr': 0.00048706762673493987, 'samples': 3306240, 'steps': 17219, 'loss/train': 1.4887704849243164} 11/06/2021 23:40:57 - INFO - __main__ - Step 17221: {'lr': 0.00048706594198413177, 'samples': 3306432, 'steps': 17220, 'loss/train': 1.8607337474822998} 11/06/2021 23:40:58 - INFO - __main__ - Step 17222: {'lr': 0.0004870642571265054, 'samples': 3306624, 'steps': 17221, 'loss/train': 1.514723539352417} 11/06/2021 23:40:59 - INFO - __main__ - Step 17223: {'lr': 0.0004870625721620616, 'samples': 3306816, 'steps': 17222, 'loss/train': 1.485309362411499} 11/06/2021 23:40:59 - INFO - __main__ - Step 17224: {'lr': 0.00048706088709080103, 'samples': 3307008, 'steps': 17223, 'loss/train': 1.781569004058838} 11/06/2021 23:40:59 - INFO - __main__ - Step 17225: {'lr': 0.00048705920191272447, 'samples': 3307200, 'steps': 17224, 'loss/train': 1.8365702629089355} 11/06/2021 23:41:00 - INFO - __main__ - Step 17226: {'lr': 0.0004870575166278327, 'samples': 3307392, 'steps': 17225, 'loss/train': 1.4333207607269287} 11/06/2021 23:41:00 - INFO - __main__ - Step 17227: {'lr': 0.0004870558312361265, 'samples': 3307584, 'steps': 17226, 'loss/train': 1.2213234901428223} 11/06/2021 23:41:01 - INFO - __main__ - Step 17228: {'lr': 0.0004870541457376066, 'samples': 3307776, 'steps': 17227, 'loss/train': 1.2647724151611328} 11/06/2021 23:41:01 - INFO - __main__ - Step 17229: {'lr': 0.0004870524601322737, 'samples': 3307968, 'steps': 17228, 'loss/train': 1.4165525436401367} 11/06/2021 23:41:02 - INFO - __main__ - Step 17230: {'lr': 0.00048705077442012866, 'samples': 3308160, 'steps': 17229, 'loss/train': 1.423817753791809} 11/06/2021 23:41:02 - INFO - __main__ - Step 17231: {'lr': 0.0004870490886011723, 'samples': 3308352, 'steps': 17230, 'loss/train': 1.821036696434021} 11/06/2021 23:41:02 - INFO - __main__ - Step 17232: {'lr': 0.0004870474026754051, 'samples': 3308544, 'steps': 17231, 'loss/train': 1.5790460109710693} 11/06/2021 23:41:04 - INFO - __main__ - Step 17233: {'lr': 0.00048704571664282806, 'samples': 3308736, 'steps': 17232, 'loss/train': 2.0383269786834717} 11/06/2021 23:41:04 - INFO - __main__ - Step 17234: {'lr': 0.0004870440305034419, 'samples': 3308928, 'steps': 17233, 'loss/train': 1.1566420793533325} 11/06/2021 23:41:04 - INFO - __main__ - Step 17235: {'lr': 0.00048704234425724736, 'samples': 3309120, 'steps': 17234, 'loss/train': 1.1532448530197144} 11/06/2021 23:41:05 - INFO - __main__ - Step 17236: {'lr': 0.0004870406579042452, 'samples': 3309312, 'steps': 17235, 'loss/train': 1.7704163789749146} 11/06/2021 23:41:05 - INFO - __main__ - Step 17237: {'lr': 0.00048703897144443615, 'samples': 3309504, 'steps': 17236, 'loss/train': 1.2826061248779297} 11/06/2021 23:41:05 - INFO - __main__ - Step 17238: {'lr': 0.000487037284877821, 'samples': 3309696, 'steps': 17237, 'loss/train': 1.5143266916275024} 11/06/2021 23:41:06 - INFO - __main__ - Step 17239: {'lr': 0.00048703559820440054, 'samples': 3309888, 'steps': 17238, 'loss/train': 1.221914291381836} 11/06/2021 23:41:07 - INFO - __main__ - Step 17240: {'lr': 0.0004870339114241755, 'samples': 3310080, 'steps': 17239, 'loss/train': 1.3863588571548462} 11/06/2021 23:41:07 - INFO - __main__ - Step 17241: {'lr': 0.00048703222453714656, 'samples': 3310272, 'steps': 17240, 'loss/train': 1.3659840822219849} 11/06/2021 23:41:07 - INFO - __main__ - Step 17242: {'lr': 0.0004870305375433146, 'samples': 3310464, 'steps': 17241, 'loss/train': 1.8747817277908325} 11/06/2021 23:41:08 - INFO - __main__ - Step 17243: {'lr': 0.0004870288504426804, 'samples': 3310656, 'steps': 17242, 'loss/train': 1.3708194494247437} 11/06/2021 23:41:09 - INFO - __main__ - Step 17244: {'lr': 0.0004870271632352446, 'samples': 3310848, 'steps': 17243, 'loss/train': 1.7464226484298706} 11/06/2021 23:41:09 - INFO - __main__ - Step 17245: {'lr': 0.000487025475921008, 'samples': 3311040, 'steps': 17244, 'loss/train': 1.4908338785171509} 11/06/2021 23:41:10 - INFO - __main__ - Step 17246: {'lr': 0.00048702378849997143, 'samples': 3311232, 'steps': 17245, 'loss/train': 1.377392292022705} 11/06/2021 23:41:10 - INFO - __main__ - Step 17247: {'lr': 0.0004870221009721356, 'samples': 3311424, 'steps': 17246, 'loss/train': 2.0922882556915283} 11/06/2021 23:41:10 - INFO - __main__ - Step 17248: {'lr': 0.00048702041333750117, 'samples': 3311616, 'steps': 17247, 'loss/train': 1.4370702505111694} 11/06/2021 23:41:11 - INFO - __main__ - Step 17249: {'lr': 0.0004870187255960691, 'samples': 3311808, 'steps': 17248, 'loss/train': 2.13187313079834} 11/06/2021 23:41:12 - INFO - __main__ - Step 17250: {'lr': 0.00048701703774784, 'samples': 3312000, 'steps': 17249, 'loss/train': 0.7868022918701172} 11/06/2021 23:41:12 - INFO - __main__ - Step 17251: {'lr': 0.0004870153497928147, 'samples': 3312192, 'steps': 17250, 'loss/train': 1.8932836055755615} 11/06/2021 23:41:13 - INFO - __main__ - Step 17252: {'lr': 0.00048701366173099396, 'samples': 3312384, 'steps': 17251, 'loss/train': 2.3235738277435303} 11/06/2021 23:41:13 - INFO - __main__ - Step 17253: {'lr': 0.0004870119735623785, 'samples': 3312576, 'steps': 17252, 'loss/train': 1.8664605617523193} 11/06/2021 23:41:13 - INFO - __main__ - Step 17254: {'lr': 0.00048701028528696914, 'samples': 3312768, 'steps': 17253, 'loss/train': 1.6442269086837769} 11/06/2021 23:41:14 - INFO - __main__ - Step 17255: {'lr': 0.0004870085969047665, 'samples': 3312960, 'steps': 17254, 'loss/train': 1.5786818265914917} 11/06/2021 23:41:15 - INFO - __main__ - Step 17256: {'lr': 0.00048700690841577154, 'samples': 3313152, 'steps': 17255, 'loss/train': 1.9361462593078613} 11/06/2021 23:41:15 - INFO - __main__ - Step 17257: {'lr': 0.0004870052198199849, 'samples': 3313344, 'steps': 17256, 'loss/train': 1.8143517971038818} 11/06/2021 23:41:15 - INFO - __main__ - Step 17258: {'lr': 0.00048700353111740734, 'samples': 3313536, 'steps': 17257, 'loss/train': 1.4683324098587036} 11/06/2021 23:41:16 - INFO - __main__ - Step 17259: {'lr': 0.0004870018423080397, 'samples': 3313728, 'steps': 17258, 'loss/train': 2.0775022506713867} 11/06/2021 23:41:17 - INFO - __main__ - Step 17260: {'lr': 0.00048700015339188266, 'samples': 3313920, 'steps': 17259, 'loss/train': 1.94370698928833} 11/06/2021 23:41:17 - INFO - __main__ - Step 17261: {'lr': 0.0004869984643689369, 'samples': 3314112, 'steps': 17260, 'loss/train': 2.0206258296966553} 11/06/2021 23:41:18 - INFO - __main__ - Step 17262: {'lr': 0.00048699677523920346, 'samples': 3314304, 'steps': 17261, 'loss/train': 1.3398237228393555} 11/06/2021 23:41:18 - INFO - __main__ - Step 17263: {'lr': 0.00048699508600268284, 'samples': 3314496, 'steps': 17262, 'loss/train': 1.1747856140136719} 11/06/2021 23:41:18 - INFO - __main__ - Step 17264: {'lr': 0.00048699339665937594, 'samples': 3314688, 'steps': 17263, 'loss/train': 1.7770746946334839} 11/06/2021 23:41:19 - INFO - __main__ - Step 17265: {'lr': 0.0004869917072092834, 'samples': 3314880, 'steps': 17264, 'loss/train': 1.789361596107483} 11/06/2021 23:41:20 - INFO - __main__ - Step 17266: {'lr': 0.00048699001765240615, 'samples': 3315072, 'steps': 17265, 'loss/train': 1.4973350763320923} 11/06/2021 23:41:20 - INFO - __main__ - Step 17267: {'lr': 0.00048698832798874477, 'samples': 3315264, 'steps': 17266, 'loss/train': 1.6333647966384888} 11/06/2021 23:41:20 - INFO - __main__ - Step 17268: {'lr': 0.0004869866382183001, 'samples': 3315456, 'steps': 17267, 'loss/train': 1.2706983089447021} 11/06/2021 23:41:21 - INFO - __main__ - Step 17269: {'lr': 0.00048698494834107297, 'samples': 3315648, 'steps': 17268, 'loss/train': 1.8148884773254395} 11/06/2021 23:41:21 - INFO - __main__ - Step 17270: {'lr': 0.000486983258357064, 'samples': 3315840, 'steps': 17269, 'loss/train': 2.033538341522217} 11/06/2021 23:41:22 - INFO - __main__ - Step 17271: {'lr': 0.00048698156826627414, 'samples': 3316032, 'steps': 17270, 'loss/train': 1.4283595085144043} 11/06/2021 23:41:23 - INFO - __main__ - Step 17272: {'lr': 0.00048697987806870397, 'samples': 3316224, 'steps': 17271, 'loss/train': 1.6567562818527222} 11/06/2021 23:41:23 - INFO - __main__ - Step 17273: {'lr': 0.0004869781877643543, 'samples': 3316416, 'steps': 17272, 'loss/train': 2.0340371131896973} 11/06/2021 23:41:24 - INFO - __main__ - Step 17274: {'lr': 0.000486976497353226, 'samples': 3316608, 'steps': 17273, 'loss/train': 1.683788537979126} 11/06/2021 23:41:24 - INFO - __main__ - Step 17275: {'lr': 0.0004869748068353197, 'samples': 3316800, 'steps': 17274, 'loss/train': 1.2586175203323364} 11/06/2021 23:41:25 - INFO - __main__ - Step 17276: {'lr': 0.00048697311621063625, 'samples': 3316992, 'steps': 17275, 'loss/train': 0.27130571007728577} 11/06/2021 23:41:25 - INFO - __main__ - Step 17277: {'lr': 0.0004869714254791763, 'samples': 3317184, 'steps': 17276, 'loss/train': 1.1046879291534424} 11/06/2021 23:41:26 - INFO - __main__ - Step 17278: {'lr': 0.00048696973464094076, 'samples': 3317376, 'steps': 17277, 'loss/train': 1.5609723329544067} 11/06/2021 23:41:26 - INFO - __main__ - Step 17279: {'lr': 0.00048696804369593023, 'samples': 3317568, 'steps': 17278, 'loss/train': 1.604174017906189} 11/06/2021 23:41:26 - INFO - __main__ - Step 17280: {'lr': 0.0004869663526441456, 'samples': 3317760, 'steps': 17279, 'loss/train': 1.6268011331558228} 11/06/2021 23:41:28 - INFO - __main__ - Step 17281: {'lr': 0.0004869646614855876, 'samples': 3317952, 'steps': 17280, 'loss/train': 2.258241653442383} 11/06/2021 23:41:28 - INFO - __main__ - Step 17282: {'lr': 0.0004869629702202569, 'samples': 3318144, 'steps': 17281, 'loss/train': 1.8233325481414795} 11/06/2021 23:41:28 - INFO - __main__ - Step 17283: {'lr': 0.0004869612788481544, 'samples': 3318336, 'steps': 17282, 'loss/train': 1.6801536083221436} 11/06/2021 23:41:29 - INFO - __main__ - Step 17284: {'lr': 0.00048695958736928084, 'samples': 3318528, 'steps': 17283, 'loss/train': 1.0620797872543335} 11/06/2021 23:41:29 - INFO - __main__ - Step 17285: {'lr': 0.00048695789578363693, 'samples': 3318720, 'steps': 17284, 'loss/train': 2.001478910446167} 11/06/2021 23:41:29 - INFO - __main__ - Step 17286: {'lr': 0.00048695620409122345, 'samples': 3318912, 'steps': 17285, 'loss/train': 1.4987901449203491} 11/06/2021 23:41:30 - INFO - __main__ - Step 17287: {'lr': 0.00048695451229204115, 'samples': 3319104, 'steps': 17286, 'loss/train': 1.8941746950149536} 11/06/2021 23:41:31 - INFO - __main__ - Step 17288: {'lr': 0.0004869528203860908, 'samples': 3319296, 'steps': 17287, 'loss/train': 1.8230624198913574} 11/06/2021 23:41:31 - INFO - __main__ - Step 17289: {'lr': 0.0004869511283733732, 'samples': 3319488, 'steps': 17288, 'loss/train': 2.0317223072052} 11/06/2021 23:41:32 - INFO - __main__ - Step 17290: {'lr': 0.000486949436253889, 'samples': 3319680, 'steps': 17289, 'loss/train': 1.3601343631744385} 11/06/2021 23:41:32 - INFO - __main__ - Step 17291: {'lr': 0.0004869477440276391, 'samples': 3319872, 'steps': 17290, 'loss/train': 1.6629352569580078} 11/06/2021 23:41:33 - INFO - __main__ - Step 17292: {'lr': 0.00048694605169462415, 'samples': 3320064, 'steps': 17291, 'loss/train': 1.6145473718643188} 11/06/2021 23:41:33 - INFO - __main__ - Step 17293: {'lr': 0.00048694435925484506, 'samples': 3320256, 'steps': 17292, 'loss/train': 1.7590662240982056} 11/06/2021 23:41:34 - INFO - __main__ - Step 17294: {'lr': 0.0004869426667083024, 'samples': 3320448, 'steps': 17293, 'loss/train': 1.9693127870559692} 11/06/2021 23:41:34 - INFO - __main__ - Step 17295: {'lr': 0.00048694097405499703, 'samples': 3320640, 'steps': 17294, 'loss/train': 1.6262609958648682} 11/06/2021 23:41:34 - INFO - __main__ - Step 17296: {'lr': 0.0004869392812949298, 'samples': 3320832, 'steps': 17295, 'loss/train': 1.807649850845337} 11/06/2021 23:41:35 - INFO - __main__ - Step 17297: {'lr': 0.00048693758842810133, 'samples': 3321024, 'steps': 17296, 'loss/train': 1.6670494079589844} 11/06/2021 23:41:36 - INFO - __main__ - Step 17298: {'lr': 0.00048693589545451243, 'samples': 3321216, 'steps': 17297, 'loss/train': 1.5862675905227661} 11/06/2021 23:41:36 - INFO - __main__ - Step 17299: {'lr': 0.00048693420237416393, 'samples': 3321408, 'steps': 17298, 'loss/train': 1.3745173215866089} 11/06/2021 23:41:36 - INFO - __main__ - Step 17300: {'lr': 0.00048693250918705643, 'samples': 3321600, 'steps': 17299, 'loss/train': 1.0434212684631348} 11/06/2021 23:41:37 - INFO - __main__ - Step 17301: {'lr': 0.0004869308158931909, 'samples': 3321792, 'steps': 17300, 'loss/train': 1.2677903175354004} 11/06/2021 23:41:37 - INFO - __main__ - Step 17302: {'lr': 0.00048692912249256794, 'samples': 3321984, 'steps': 17301, 'loss/train': 1.7817742824554443} 11/06/2021 23:41:38 - INFO - __main__ - Step 17303: {'lr': 0.00048692742898518836, 'samples': 3322176, 'steps': 17302, 'loss/train': 0.8955352306365967} 11/06/2021 23:41:38 - INFO - __main__ - Step 17304: {'lr': 0.000486925735371053, 'samples': 3322368, 'steps': 17303, 'loss/train': 1.3537814617156982} 11/06/2021 23:41:39 - INFO - __main__ - Step 17305: {'lr': 0.00048692404165016256, 'samples': 3322560, 'steps': 17304, 'loss/train': 1.636925458908081} 11/06/2021 23:41:39 - INFO - __main__ - Step 17306: {'lr': 0.0004869223478225178, 'samples': 3322752, 'steps': 17305, 'loss/train': 1.4810558557510376} 11/06/2021 23:41:39 - INFO - __main__ - Step 17307: {'lr': 0.00048692065388811944, 'samples': 3322944, 'steps': 17306, 'loss/train': 2.2091891765594482} 11/06/2021 23:41:40 - INFO - __main__ - Step 17308: {'lr': 0.0004869189598469683, 'samples': 3323136, 'steps': 17307, 'loss/train': 1.5620085000991821} 11/06/2021 23:41:41 - INFO - __main__ - Step 17309: {'lr': 0.00048691726569906514, 'samples': 3323328, 'steps': 17308, 'loss/train': 1.429172158241272} 11/06/2021 23:41:41 - INFO - __main__ - Step 17310: {'lr': 0.0004869155714444107, 'samples': 3323520, 'steps': 17309, 'loss/train': 1.6215598583221436} 11/06/2021 23:41:42 - INFO - __main__ - Step 17311: {'lr': 0.00048691387708300584, 'samples': 3323712, 'steps': 17310, 'loss/train': 2.614412307739258} 11/06/2021 23:41:42 - INFO - __main__ - Step 17312: {'lr': 0.00048691218261485113, 'samples': 3323904, 'steps': 17311, 'loss/train': 1.9944758415222168} 11/06/2021 23:41:43 - INFO - __main__ - Step 17313: {'lr': 0.00048691048803994755, 'samples': 3324096, 'steps': 17312, 'loss/train': 1.879599928855896} 11/06/2021 23:41:43 - INFO - __main__ - Step 17314: {'lr': 0.00048690879335829565, 'samples': 3324288, 'steps': 17313, 'loss/train': 1.1751701831817627} 11/06/2021 23:41:44 - INFO - __main__ - Step 17315: {'lr': 0.00048690709856989635, 'samples': 3324480, 'steps': 17314, 'loss/train': 2.111276388168335} 11/06/2021 23:41:44 - INFO - __main__ - Step 17316: {'lr': 0.00048690540367475046, 'samples': 3324672, 'steps': 17315, 'loss/train': 1.5609761476516724} 11/06/2021 23:41:45 - INFO - __main__ - Step 17317: {'lr': 0.00048690370867285847, 'samples': 3324864, 'steps': 17316, 'loss/train': 1.1299504041671753} 11/06/2021 23:41:45 - INFO - __main__ - Step 17318: {'lr': 0.00048690201356422146, 'samples': 3325056, 'steps': 17317, 'loss/train': 2.0143861770629883} 11/06/2021 23:41:46 - INFO - __main__ - Step 17319: {'lr': 0.00048690031834884004, 'samples': 3325248, 'steps': 17318, 'loss/train': 1.9749722480773926} 11/06/2021 23:41:46 - INFO - __main__ - Step 17320: {'lr': 0.00048689862302671495, 'samples': 3325440, 'steps': 17319, 'loss/train': 1.8871047496795654} 11/06/2021 23:41:47 - INFO - __main__ - Step 17321: {'lr': 0.000486896927597847, 'samples': 3325632, 'steps': 17320, 'loss/train': 1.9571623802185059} 11/06/2021 23:41:47 - INFO - __main__ - Step 17322: {'lr': 0.00048689523206223693, 'samples': 3325824, 'steps': 17321, 'loss/train': 1.0182833671569824} 11/06/2021 23:41:47 - INFO - __main__ - Step 17323: {'lr': 0.00048689353641988563, 'samples': 3326016, 'steps': 17322, 'loss/train': 1.170893907546997} 11/06/2021 23:41:48 - INFO - __main__ - Step 17324: {'lr': 0.0004868918406707937, 'samples': 3326208, 'steps': 17323, 'loss/train': 1.7133080959320068} 11/06/2021 23:41:48 - INFO - __main__ - Step 17325: {'lr': 0.00048689014481496197, 'samples': 3326400, 'steps': 17324, 'loss/train': 1.9357455968856812} 11/06/2021 23:41:49 - INFO - __main__ - Step 17326: {'lr': 0.0004868884488523911, 'samples': 3326592, 'steps': 17325, 'loss/train': 1.7280464172363281} 11/06/2021 23:41:49 - INFO - __main__ - Step 17327: {'lr': 0.0004868867527830821, 'samples': 3326784, 'steps': 17326, 'loss/train': 2.280465602874756} 11/06/2021 23:41:50 - INFO - __main__ - Step 17328: {'lr': 0.0004868850566070355, 'samples': 3326976, 'steps': 17327, 'loss/train': 1.5123412609100342} 11/06/2021 23:41:51 - INFO - __main__ - Step 17329: {'lr': 0.00048688336032425217, 'samples': 3327168, 'steps': 17328, 'loss/train': 2.5029807090759277} 11/06/2021 23:41:51 - INFO - __main__ - Step 17330: {'lr': 0.0004868816639347328, 'samples': 3327360, 'steps': 17329, 'loss/train': 1.3121029138565063} 11/06/2021 23:41:52 - INFO - __main__ - Step 17331: {'lr': 0.0004868799674384783, 'samples': 3327552, 'steps': 17330, 'loss/train': 1.6100544929504395} 11/06/2021 23:41:52 - INFO - __main__ - Step 17332: {'lr': 0.0004868782708354893, 'samples': 3327744, 'steps': 17331, 'loss/train': 1.734197735786438} 11/06/2021 23:41:52 - INFO - __main__ - Step 17333: {'lr': 0.0004868765741257666, 'samples': 3327936, 'steps': 17332, 'loss/train': 0.5606326460838318} 11/06/2021 23:41:53 - INFO - __main__ - Step 17334: {'lr': 0.00048687487730931096, 'samples': 3328128, 'steps': 17333, 'loss/train': 1.4768435955047607} 11/06/2021 23:41:54 - INFO - __main__ - Step 17335: {'lr': 0.00048687318038612317, 'samples': 3328320, 'steps': 17334, 'loss/train': 1.0746694803237915} 11/06/2021 23:41:54 - INFO - __main__ - Step 17336: {'lr': 0.000486871483356204, 'samples': 3328512, 'steps': 17335, 'loss/train': 1.6409631967544556} 11/06/2021 23:41:55 - INFO - __main__ - Step 17337: {'lr': 0.00048686978621955416, 'samples': 3328704, 'steps': 17336, 'loss/train': 1.2255845069885254} 11/06/2021 23:41:55 - INFO - __main__ - Step 17338: {'lr': 0.00048686808897617447, 'samples': 3328896, 'steps': 17337, 'loss/train': 1.829113483428955} 11/06/2021 23:41:55 - INFO - __main__ - Step 17339: {'lr': 0.00048686639162606564, 'samples': 3329088, 'steps': 17338, 'loss/train': 1.5854146480560303} 11/06/2021 23:41:56 - INFO - __main__ - Step 17340: {'lr': 0.0004868646941692285, 'samples': 3329280, 'steps': 17339, 'loss/train': 1.6102768182754517} 11/06/2021 23:41:57 - INFO - __main__ - Step 17341: {'lr': 0.0004868629966056638, 'samples': 3329472, 'steps': 17340, 'loss/train': 1.1662577390670776} 11/06/2021 23:41:57 - INFO - __main__ - Step 17342: {'lr': 0.0004868612989353722, 'samples': 3329664, 'steps': 17341, 'loss/train': 1.8177578449249268} 11/06/2021 23:41:57 - INFO - __main__ - Step 17343: {'lr': 0.0004868596011583547, 'samples': 3329856, 'steps': 17342, 'loss/train': 2.0990030765533447} 11/06/2021 23:41:58 - INFO - __main__ - Step 17344: {'lr': 0.00048685790327461184, 'samples': 3330048, 'steps': 17343, 'loss/train': 1.853256106376648} 11/06/2021 23:41:59 - INFO - __main__ - Step 17345: {'lr': 0.0004868562052841444, 'samples': 3330240, 'steps': 17344, 'loss/train': 1.188179612159729} 11/06/2021 23:41:59 - INFO - __main__ - Step 17346: {'lr': 0.00048685450718695335, 'samples': 3330432, 'steps': 17345, 'loss/train': 1.7476564645767212} 11/06/2021 23:42:00 - INFO - __main__ - Step 17347: {'lr': 0.00048685280898303916, 'samples': 3330624, 'steps': 17346, 'loss/train': 1.694496512413025} 11/06/2021 23:42:00 - INFO - __main__ - Step 17348: {'lr': 0.00048685111067240283, 'samples': 3330816, 'steps': 17347, 'loss/train': 1.716973900794983} 11/06/2021 23:42:00 - INFO - __main__ - Step 17349: {'lr': 0.00048684941225504507, 'samples': 3331008, 'steps': 17348, 'loss/train': 2.0338494777679443} 11/06/2021 23:42:01 - INFO - __main__ - Step 17350: {'lr': 0.0004868477137309666, 'samples': 3331200, 'steps': 17349, 'loss/train': 0.9896562695503235} 11/06/2021 23:42:02 - INFO - __main__ - Step 17351: {'lr': 0.00048684601510016817, 'samples': 3331392, 'steps': 17350, 'loss/train': 1.7950968742370605} 11/06/2021 23:42:02 - INFO - __main__ - Step 17352: {'lr': 0.00048684431636265065, 'samples': 3331584, 'steps': 17351, 'loss/train': 1.272135615348816} 11/06/2021 23:42:02 - INFO - __main__ - Step 17353: {'lr': 0.00048684261751841463, 'samples': 3331776, 'steps': 17352, 'loss/train': 1.1567683219909668} 11/06/2021 23:42:03 - INFO - __main__ - Step 17354: {'lr': 0.000486840918567461, 'samples': 3331968, 'steps': 17353, 'loss/train': 1.702507734298706} 11/06/2021 23:42:03 - INFO - __main__ - Step 17355: {'lr': 0.0004868392195097906, 'samples': 3332160, 'steps': 17354, 'loss/train': 1.6192466020584106} 11/06/2021 23:42:04 - INFO - __main__ - Step 17356: {'lr': 0.0004868375203454041, 'samples': 3332352, 'steps': 17355, 'loss/train': 1.33048415184021} 11/06/2021 23:42:05 - INFO - __main__ - Step 17357: {'lr': 0.00048683582107430227, 'samples': 3332544, 'steps': 17356, 'loss/train': 1.4887734651565552} 11/06/2021 23:42:05 - INFO - __main__ - Step 17358: {'lr': 0.0004868341216964858, 'samples': 3332736, 'steps': 17357, 'loss/train': 1.8216099739074707} 11/06/2021 23:42:05 - INFO - __main__ - Step 17359: {'lr': 0.00048683242221195553, 'samples': 3332928, 'steps': 17358, 'loss/train': 1.927954912185669} 11/06/2021 23:42:06 - INFO - __main__ - Step 17360: {'lr': 0.00048683072262071224, 'samples': 3333120, 'steps': 17359, 'loss/train': 1.390279769897461} 11/06/2021 23:42:07 - INFO - __main__ - Step 17361: {'lr': 0.00048682902292275667, 'samples': 3333312, 'steps': 17360, 'loss/train': 1.5254342555999756} 11/06/2021 23:42:07 - INFO - __main__ - Step 17362: {'lr': 0.00048682732311808964, 'samples': 3333504, 'steps': 17361, 'loss/train': 1.4730674028396606} 11/06/2021 23:42:07 - INFO - __main__ - Step 17363: {'lr': 0.00048682562320671185, 'samples': 3333696, 'steps': 17362, 'loss/train': 2.2368764877319336} 11/06/2021 23:42:08 - INFO - __main__ - Step 17364: {'lr': 0.00048682392318862407, 'samples': 3333888, 'steps': 17363, 'loss/train': 1.7727394104003906} 11/06/2021 23:42:08 - INFO - __main__ - Step 17365: {'lr': 0.00048682222306382705, 'samples': 3334080, 'steps': 17364, 'loss/train': 1.5252821445465088} 11/06/2021 23:42:09 - INFO - __main__ - Step 17366: {'lr': 0.0004868205228323217, 'samples': 3334272, 'steps': 17365, 'loss/train': 2.325129508972168} 11/06/2021 23:42:09 - INFO - __main__ - Step 17367: {'lr': 0.0004868188224941086, 'samples': 3334464, 'steps': 17366, 'loss/train': 1.7640018463134766} 11/06/2021 23:42:10 - INFO - __main__ - Step 17368: {'lr': 0.0004868171220491886, 'samples': 3334656, 'steps': 17367, 'loss/train': 1.7070350646972656} 11/06/2021 23:42:10 - INFO - __main__ - Step 17369: {'lr': 0.00048681542149756253, 'samples': 3334848, 'steps': 17368, 'loss/train': 1.49398672580719} 11/06/2021 23:42:10 - INFO - __main__ - Step 17370: {'lr': 0.00048681372083923103, 'samples': 3335040, 'steps': 17369, 'loss/train': 1.4385478496551514} 11/06/2021 23:42:11 - INFO - __main__ - Step 17371: {'lr': 0.0004868120200741949, 'samples': 3335232, 'steps': 17370, 'loss/train': 1.6209843158721924} 11/06/2021 23:42:12 - INFO - __main__ - Step 17372: {'lr': 0.0004868103192024549, 'samples': 3335424, 'steps': 17371, 'loss/train': 1.3664069175720215} 11/06/2021 23:42:12 - INFO - __main__ - Step 17373: {'lr': 0.0004868086182240119, 'samples': 3335616, 'steps': 17372, 'loss/train': 1.0094228982925415} 11/06/2021 23:42:12 - INFO - __main__ - Step 17374: {'lr': 0.00048680691713886653, 'samples': 3335808, 'steps': 17373, 'loss/train': 1.8448344469070435} 11/06/2021 23:42:13 - INFO - __main__ - Step 17375: {'lr': 0.00048680521594701964, 'samples': 3336000, 'steps': 17374, 'loss/train': 1.669421672821045} 11/06/2021 23:42:13 - INFO - __main__ - Step 17376: {'lr': 0.00048680351464847207, 'samples': 3336192, 'steps': 17375, 'loss/train': 1.7565194368362427} 11/06/2021 23:42:14 - INFO - __main__ - Step 17377: {'lr': 0.00048680181324322437, 'samples': 3336384, 'steps': 17376, 'loss/train': 1.2871259450912476} 11/06/2021 23:42:15 - INFO - __main__ - Step 17378: {'lr': 0.00048680011173127746, 'samples': 3336576, 'steps': 17377, 'loss/train': 5.979392051696777} 11/06/2021 23:42:15 - INFO - __main__ - Step 17379: {'lr': 0.00048679841011263204, 'samples': 3336768, 'steps': 17378, 'loss/train': 1.605148434638977} 11/06/2021 23:42:15 - INFO - __main__ - Step 17380: {'lr': 0.00048679670838728894, 'samples': 3336960, 'steps': 17379, 'loss/train': 1.5023822784423828} 11/06/2021 23:42:16 - INFO - __main__ - Step 17381: {'lr': 0.0004867950065552489, 'samples': 3337152, 'steps': 17380, 'loss/train': 1.3871039152145386} 11/06/2021 23:42:17 - INFO - __main__ - Step 17382: {'lr': 0.00048679330461651275, 'samples': 3337344, 'steps': 17381, 'loss/train': 1.6298061609268188} 11/06/2021 23:42:17 - INFO - __main__ - Step 17383: {'lr': 0.00048679160257108107, 'samples': 3337536, 'steps': 17382, 'loss/train': 0.9420875906944275} 11/06/2021 23:42:17 - INFO - __main__ - Step 17384: {'lr': 0.00048678990041895484, 'samples': 3337728, 'steps': 17383, 'loss/train': 1.7141159772872925} 11/06/2021 23:42:18 - INFO - __main__ - Step 17385: {'lr': 0.00048678819816013467, 'samples': 3337920, 'steps': 17384, 'loss/train': 1.5371354818344116} 11/06/2021 23:42:18 - INFO - __main__ - Step 17386: {'lr': 0.0004867864957946214, 'samples': 3338112, 'steps': 17385, 'loss/train': 1.0904605388641357} 11/06/2021 23:42:19 - INFO - __main__ - Step 17387: {'lr': 0.0004867847933224158, 'samples': 3338304, 'steps': 17386, 'loss/train': 1.6519601345062256} 11/06/2021 23:42:20 - INFO - __main__ - Step 17388: {'lr': 0.0004867830907435187, 'samples': 3338496, 'steps': 17387, 'loss/train': 5.8294854164123535} 11/06/2021 23:42:20 - INFO - __main__ - Step 17389: {'lr': 0.0004867813880579307, 'samples': 3338688, 'steps': 17388, 'loss/train': 1.8847965002059937} 11/06/2021 23:42:20 - INFO - __main__ - Step 17390: {'lr': 0.0004867796852656527, 'samples': 3338880, 'steps': 17389, 'loss/train': 1.6456767320632935} 11/06/2021 23:42:21 - INFO - __main__ - Step 17391: {'lr': 0.00048677798236668537, 'samples': 3339072, 'steps': 17390, 'loss/train': 1.7139424085617065} 11/06/2021 23:42:21 - INFO - __main__ - Step 17392: {'lr': 0.00048677627936102966, 'samples': 3339264, 'steps': 17391, 'loss/train': 1.8065518140792847} 11/06/2021 23:42:22 - INFO - __main__ - Step 17393: {'lr': 0.0004867745762486861, 'samples': 3339456, 'steps': 17392, 'loss/train': 2.5346922874450684} 11/06/2021 23:42:22 - INFO - __main__ - Step 17394: {'lr': 0.0004867728730296556, 'samples': 3339648, 'steps': 17393, 'loss/train': 1.5756046772003174} 11/06/2021 23:42:23 - INFO - __main__ - Step 17395: {'lr': 0.0004867711697039389, 'samples': 3339840, 'steps': 17394, 'loss/train': 1.0653622150421143} 11/06/2021 23:42:23 - INFO - __main__ - Step 17396: {'lr': 0.00048676946627153675, 'samples': 3340032, 'steps': 17395, 'loss/train': 1.5175786018371582} 11/06/2021 23:42:24 - INFO - __main__ - Step 17397: {'lr': 0.00048676776273244994, 'samples': 3340224, 'steps': 17396, 'loss/train': 1.1968095302581787} 11/06/2021 23:42:24 - INFO - __main__ - Step 17398: {'lr': 0.00048676605908667926, 'samples': 3340416, 'steps': 17397, 'loss/train': 1.3067097663879395} 11/06/2021 23:42:25 - INFO - __main__ - Step 17399: {'lr': 0.00048676435533422536, 'samples': 3340608, 'steps': 17398, 'loss/train': 2.1273090839385986} 11/06/2021 23:42:25 - INFO - __main__ - Step 17400: {'lr': 0.00048676265147508917, 'samples': 3340800, 'steps': 17399, 'loss/train': 1.887537956237793} 11/06/2021 23:42:26 - INFO - __main__ - Step 17401: {'lr': 0.00048676094750927144, 'samples': 3340992, 'steps': 17400, 'loss/train': 1.2887080907821655} 11/06/2021 23:42:26 - INFO - __main__ - Step 17402: {'lr': 0.0004867592434367728, 'samples': 3341184, 'steps': 17401, 'loss/train': 1.5143498182296753} 11/06/2021 23:42:26 - INFO - __main__ - Step 17403: {'lr': 0.0004867575392575941, 'samples': 3341376, 'steps': 17402, 'loss/train': 1.996867299079895} 11/06/2021 23:42:27 - INFO - __main__ - Step 17404: {'lr': 0.0004867558349717361, 'samples': 3341568, 'steps': 17403, 'loss/train': 1.7546355724334717} 11/06/2021 23:42:28 - INFO - __main__ - Step 17405: {'lr': 0.0004867541305791996, 'samples': 3341760, 'steps': 17404, 'loss/train': 1.6053557395935059} 11/06/2021 23:42:28 - INFO - __main__ - Step 17406: {'lr': 0.00048675242607998533, 'samples': 3341952, 'steps': 17405, 'loss/train': 1.5730146169662476} 11/06/2021 23:42:28 - INFO - __main__ - Step 17407: {'lr': 0.00048675072147409405, 'samples': 3342144, 'steps': 17406, 'loss/train': 1.472575068473816} 11/06/2021 23:42:29 - INFO - __main__ - Step 17408: {'lr': 0.0004867490167615266, 'samples': 3342336, 'steps': 17407, 'loss/train': 1.358944296836853} 11/06/2021 23:42:30 - INFO - __main__ - Step 17409: {'lr': 0.0004867473119422837, 'samples': 3342528, 'steps': 17408, 'loss/train': 1.0930287837982178} 11/06/2021 23:42:30 - INFO - __main__ - Step 17410: {'lr': 0.00048674560701636606, 'samples': 3342720, 'steps': 17409, 'loss/train': 1.447057843208313} 11/06/2021 23:42:30 - INFO - __main__ - Step 17411: {'lr': 0.0004867439019837745, 'samples': 3342912, 'steps': 17410, 'loss/train': 1.6748888492584229} 11/06/2021 23:42:31 - INFO - __main__ - Step 17412: {'lr': 0.00048674219684450985, 'samples': 3343104, 'steps': 17411, 'loss/train': 1.5127593278884888} 11/06/2021 23:42:31 - INFO - __main__ - Step 17413: {'lr': 0.00048674049159857277, 'samples': 3343296, 'steps': 17412, 'loss/train': 1.2442376613616943} 11/06/2021 23:42:32 - INFO - __main__ - Step 17414: {'lr': 0.0004867387862459641, 'samples': 3343488, 'steps': 17413, 'loss/train': 1.1401019096374512} 11/06/2021 23:42:33 - INFO - __main__ - Step 17415: {'lr': 0.0004867370807866845, 'samples': 3343680, 'steps': 17414, 'loss/train': 1.5754122734069824} 11/06/2021 23:42:33 - INFO - __main__ - Step 17416: {'lr': 0.000486735375220735, 'samples': 3343872, 'steps': 17415, 'loss/train': 1.994115948677063} 11/06/2021 23:42:33 - INFO - __main__ - Step 17417: {'lr': 0.00048673366954811605, 'samples': 3344064, 'steps': 17416, 'loss/train': 1.5085574388504028} 11/06/2021 23:42:34 - INFO - __main__ - Step 17418: {'lr': 0.0004867319637688286, 'samples': 3344256, 'steps': 17417, 'loss/train': 1.8701719045639038} 11/06/2021 23:42:34 - INFO - __main__ - Step 17419: {'lr': 0.0004867302578828734, 'samples': 3344448, 'steps': 17418, 'loss/train': 1.6169779300689697} 11/06/2021 23:42:35 - INFO - __main__ - Step 17420: {'lr': 0.0004867285518902512, 'samples': 3344640, 'steps': 17419, 'loss/train': 0.6736646294593811} 11/06/2021 23:42:35 - INFO - __main__ - Step 17421: {'lr': 0.0004867268457909627, 'samples': 3344832, 'steps': 17420, 'loss/train': 1.7313899993896484} 11/06/2021 23:42:36 - INFO - __main__ - Step 17422: {'lr': 0.0004867251395850088, 'samples': 3345024, 'steps': 17421, 'loss/train': 1.910513162612915} 11/06/2021 23:42:36 - INFO - __main__ - Step 17423: {'lr': 0.00048672343327239024, 'samples': 3345216, 'steps': 17422, 'loss/train': 1.4655964374542236} 11/06/2021 23:42:36 - INFO - __main__ - Step 17424: {'lr': 0.00048672172685310767, 'samples': 3345408, 'steps': 17423, 'loss/train': 1.7403993606567383} 11/06/2021 23:42:38 - INFO - __main__ - Step 17425: {'lr': 0.000486720020327162, 'samples': 3345600, 'steps': 17424, 'loss/train': 1.7060999870300293} 11/06/2021 23:42:38 - INFO - __main__ - Step 17426: {'lr': 0.00048671831369455386, 'samples': 3345792, 'steps': 17425, 'loss/train': 1.8607850074768066} 11/06/2021 23:42:38 - INFO - __main__ - Step 17427: {'lr': 0.0004867166069552842, 'samples': 3345984, 'steps': 17426, 'loss/train': 2.2483069896698} 11/06/2021 23:42:39 - INFO - __main__ - Step 17428: {'lr': 0.00048671490010935366, 'samples': 3346176, 'steps': 17427, 'loss/train': 1.5636916160583496} 11/06/2021 23:42:39 - INFO - __main__ - Step 17429: {'lr': 0.00048671319315676305, 'samples': 3346368, 'steps': 17428, 'loss/train': 1.3616186380386353} 11/06/2021 23:42:39 - INFO - __main__ - Step 17430: {'lr': 0.00048671148609751307, 'samples': 3346560, 'steps': 17429, 'loss/train': 2.639005422592163} 11/06/2021 23:42:41 - INFO - __main__ - Step 17431: {'lr': 0.0004867097789316046, 'samples': 3346752, 'steps': 17430, 'loss/train': 1.3610199689865112} 11/06/2021 23:42:41 - INFO - __main__ - Step 17432: {'lr': 0.0004867080716590384, 'samples': 3346944, 'steps': 17431, 'loss/train': 1.6907752752304077} 11/06/2021 23:42:41 - INFO - __main__ - Step 17433: {'lr': 0.0004867063642798151, 'samples': 3347136, 'steps': 17432, 'loss/train': 0.8346073627471924} 11/06/2021 23:42:42 - INFO - __main__ - Step 17434: {'lr': 0.0004867046567939356, 'samples': 3347328, 'steps': 17433, 'loss/train': 0.30705493688583374} 11/06/2021 23:42:42 - INFO - __main__ - Step 17435: {'lr': 0.00048670294920140063, 'samples': 3347520, 'steps': 17434, 'loss/train': 1.677636981010437} 11/06/2021 23:42:42 - INFO - __main__ - Step 17436: {'lr': 0.00048670124150221094, 'samples': 3347712, 'steps': 17435, 'loss/train': 1.5929261445999146} 11/06/2021 23:42:43 - INFO - __main__ - Step 17437: {'lr': 0.00048669953369636737, 'samples': 3347904, 'steps': 17436, 'loss/train': 1.458001971244812} 11/06/2021 23:42:44 - INFO - __main__ - Step 17438: {'lr': 0.00048669782578387067, 'samples': 3348096, 'steps': 17437, 'loss/train': 1.7061351537704468} 11/06/2021 23:42:44 - INFO - __main__ - Step 17439: {'lr': 0.00048669611776472153, 'samples': 3348288, 'steps': 17438, 'loss/train': 1.6646876335144043} 11/06/2021 23:42:44 - INFO - __main__ - Step 17440: {'lr': 0.00048669440963892074, 'samples': 3348480, 'steps': 17439, 'loss/train': 1.8037528991699219} 11/06/2021 23:42:45 - INFO - __main__ - Step 17441: {'lr': 0.00048669270140646914, 'samples': 3348672, 'steps': 17440, 'loss/train': 1.4020469188690186} 11/06/2021 23:42:46 - INFO - __main__ - Step 17442: {'lr': 0.0004866909930673675, 'samples': 3348864, 'steps': 17441, 'loss/train': 1.5615054368972778} 11/06/2021 23:42:46 - INFO - __main__ - Step 17443: {'lr': 0.00048668928462161653, 'samples': 3349056, 'steps': 17442, 'loss/train': 1.6594544649124146} 11/06/2021 23:42:46 - INFO - __main__ - Step 17444: {'lr': 0.000486687576069217, 'samples': 3349248, 'steps': 17443, 'loss/train': 1.418339729309082} 11/06/2021 23:42:47 - INFO - __main__ - Step 17445: {'lr': 0.00048668586741016967, 'samples': 3349440, 'steps': 17444, 'loss/train': 1.9371867179870605} 11/06/2021 23:42:47 - INFO - __main__ - Step 17446: {'lr': 0.0004866841586444754, 'samples': 3349632, 'steps': 17445, 'loss/train': 1.3474305868148804} 11/06/2021 23:42:49 - INFO - __main__ - Step 17447: {'lr': 0.0004866824497721349, 'samples': 3349824, 'steps': 17446, 'loss/train': 0.8789267539978027} 11/06/2021 23:42:49 - INFO - __main__ - Step 17448: {'lr': 0.0004866807407931489, 'samples': 3350016, 'steps': 17447, 'loss/train': 1.4513202905654907} 11/06/2021 23:42:49 - INFO - __main__ - Step 17449: {'lr': 0.0004866790317075182, 'samples': 3350208, 'steps': 17448, 'loss/train': 1.5578341484069824} 11/06/2021 23:42:50 - INFO - __main__ - Step 17450: {'lr': 0.00048667732251524365, 'samples': 3350400, 'steps': 17449, 'loss/train': 1.9759602546691895} 11/06/2021 23:42:50 - INFO - __main__ - Step 17451: {'lr': 0.0004866756132163259, 'samples': 3350592, 'steps': 17450, 'loss/train': 1.5698901414871216} 11/06/2021 23:42:50 - INFO - __main__ - Step 17452: {'lr': 0.0004866739038107658, 'samples': 3350784, 'steps': 17451, 'loss/train': 1.6254311800003052} 11/06/2021 23:42:51 - INFO - __main__ - Step 17453: {'lr': 0.000486672194298564, 'samples': 3350976, 'steps': 17452, 'loss/train': 1.6596328020095825} 11/06/2021 23:42:52 - INFO - __main__ - Step 17454: {'lr': 0.00048667048467972146, 'samples': 3351168, 'steps': 17453, 'loss/train': 1.0186972618103027} 11/06/2021 23:42:52 - INFO - __main__ - Step 17455: {'lr': 0.00048666877495423885, 'samples': 3351360, 'steps': 17454, 'loss/train': 1.5029501914978027} 11/06/2021 23:42:52 - INFO - __main__ - Step 17456: {'lr': 0.0004866670651221169, 'samples': 3351552, 'steps': 17455, 'loss/train': 1.6649607419967651} 11/06/2021 23:42:53 - INFO - __main__ - Step 17457: {'lr': 0.0004866653551833564, 'samples': 3351744, 'steps': 17456, 'loss/train': 1.0854140520095825} 11/06/2021 23:42:53 - INFO - __main__ - Step 17458: {'lr': 0.00048666364513795816, 'samples': 3351936, 'steps': 17457, 'loss/train': 1.361505389213562} 11/06/2021 23:42:54 - INFO - __main__ - Step 17459: {'lr': 0.00048666193498592304, 'samples': 3352128, 'steps': 17458, 'loss/train': 1.6825581789016724} 11/06/2021 23:42:55 - INFO - __main__ - Step 17460: {'lr': 0.0004866602247272516, 'samples': 3352320, 'steps': 17459, 'loss/train': 1.7726678848266602} 11/06/2021 23:42:55 - INFO - __main__ - Step 17461: {'lr': 0.0004866585143619447, 'samples': 3352512, 'steps': 17460, 'loss/train': 1.8011826276779175} 11/06/2021 23:42:55 - INFO - __main__ - Step 17462: {'lr': 0.00048665680389000315, 'samples': 3352704, 'steps': 17461, 'loss/train': 1.5437105894088745} 11/06/2021 23:42:56 - INFO - __main__ - Step 17463: {'lr': 0.0004866550933114277, 'samples': 3352896, 'steps': 17462, 'loss/train': 1.5845763683319092} 11/06/2021 23:42:57 - INFO - __main__ - Step 17464: {'lr': 0.00048665338262621915, 'samples': 3353088, 'steps': 17463, 'loss/train': 1.909666895866394} 11/06/2021 23:42:57 - INFO - __main__ - Step 17465: {'lr': 0.00048665167183437817, 'samples': 3353280, 'steps': 17464, 'loss/train': 1.7902839183807373} 11/06/2021 23:42:57 - INFO - __main__ - Step 17466: {'lr': 0.00048664996093590563, 'samples': 3353472, 'steps': 17465, 'loss/train': 1.6980420351028442} 11/06/2021 23:42:58 - INFO - __main__ - Step 17467: {'lr': 0.0004866482499308023, 'samples': 3353664, 'steps': 17466, 'loss/train': 1.636150598526001} 11/06/2021 23:42:58 - INFO - __main__ - Step 17468: {'lr': 0.0004866465388190689, 'samples': 3353856, 'steps': 17467, 'loss/train': 1.7070634365081787} 11/06/2021 23:42:59 - INFO - __main__ - Step 17469: {'lr': 0.0004866448276007062, 'samples': 3354048, 'steps': 17468, 'loss/train': 1.6868599653244019} 11/06/2021 23:42:59 - INFO - __main__ - Step 17470: {'lr': 0.000486643116275715, 'samples': 3354240, 'steps': 17469, 'loss/train': 1.1477397680282593} 11/06/2021 23:43:00 - INFO - __main__ - Step 17471: {'lr': 0.00048664140484409613, 'samples': 3354432, 'steps': 17470, 'loss/train': 2.090226173400879} 11/06/2021 23:43:00 - INFO - __main__ - Step 17472: {'lr': 0.0004866396933058502, 'samples': 3354624, 'steps': 17471, 'loss/train': 1.5251487493515015} 11/06/2021 23:43:00 - INFO - __main__ - Step 17473: {'lr': 0.00048663798166097814, 'samples': 3354816, 'steps': 17472, 'loss/train': 1.5208324193954468} 11/06/2021 23:43:01 - INFO - __main__ - Step 17474: {'lr': 0.0004866362699094806, 'samples': 3355008, 'steps': 17473, 'loss/train': 1.9953933954238892} 11/06/2021 23:43:02 - INFO - __main__ - Step 17475: {'lr': 0.0004866345580513585, 'samples': 3355200, 'steps': 17474, 'loss/train': 1.5545861721038818} 11/06/2021 23:43:02 - INFO - __main__ - Step 17476: {'lr': 0.0004866328460866124, 'samples': 3355392, 'steps': 17475, 'loss/train': 1.7343254089355469} 11/06/2021 23:43:03 - INFO - __main__ - Step 17477: {'lr': 0.0004866311340152433, 'samples': 3355584, 'steps': 17476, 'loss/train': 1.4653677940368652} 11/06/2021 23:43:03 - INFO - __main__ - Step 17478: {'lr': 0.0004866294218372518, 'samples': 3355776, 'steps': 17477, 'loss/train': 0.9723559617996216} 11/06/2021 23:43:03 - INFO - __main__ - Step 17479: {'lr': 0.0004866277095526387, 'samples': 3355968, 'steps': 17478, 'loss/train': 2.013335943222046} 11/06/2021 23:43:04 - INFO - __main__ - Step 17480: {'lr': 0.00048662599716140485, 'samples': 3356160, 'steps': 17479, 'loss/train': 0.8285233974456787} 11/06/2021 23:43:05 - INFO - __main__ - Step 17481: {'lr': 0.00048662428466355104, 'samples': 3356352, 'steps': 17480, 'loss/train': 1.3983826637268066} 11/06/2021 23:43:05 - INFO - __main__ - Step 17482: {'lr': 0.0004866225720590779, 'samples': 3356544, 'steps': 17481, 'loss/train': 1.461501955986023} 11/06/2021 23:43:05 - INFO - __main__ - Step 17483: {'lr': 0.00048662085934798627, 'samples': 3356736, 'steps': 17482, 'loss/train': 2.3746488094329834} 11/06/2021 23:43:06 - INFO - __main__ - Step 17484: {'lr': 0.00048661914653027694, 'samples': 3356928, 'steps': 17483, 'loss/train': 1.401358962059021} 11/06/2021 23:43:07 - INFO - __main__ - Step 17485: {'lr': 0.0004866174336059507, 'samples': 3357120, 'steps': 17484, 'loss/train': 1.5222047567367554} 11/06/2021 23:43:07 - INFO - __main__ - Step 17486: {'lr': 0.00048661572057500833, 'samples': 3357312, 'steps': 17485, 'loss/train': 1.8743535280227661} 11/06/2021 23:43:07 - INFO - __main__ - Step 17487: {'lr': 0.00048661400743745057, 'samples': 3357504, 'steps': 17486, 'loss/train': 1.2900893688201904} 11/06/2021 23:43:08 - INFO - __main__ - Step 17488: {'lr': 0.00048661229419327806, 'samples': 3357696, 'steps': 17487, 'loss/train': 2.1348865032196045} 11/06/2021 23:43:08 - INFO - __main__ - Step 17489: {'lr': 0.0004866105808424918, 'samples': 3357888, 'steps': 17488, 'loss/train': 1.581131100654602} 11/06/2021 23:43:09 - INFO - __main__ - Step 17490: {'lr': 0.0004866088673850925, 'samples': 3358080, 'steps': 17489, 'loss/train': 1.3917348384857178} 11/06/2021 23:43:09 - INFO - __main__ - Step 17491: {'lr': 0.0004866071538210808, 'samples': 3358272, 'steps': 17490, 'loss/train': 1.749387502670288} 11/06/2021 23:43:10 - INFO - __main__ - Step 17492: {'lr': 0.0004866054401504576, 'samples': 3358464, 'steps': 17491, 'loss/train': 1.3572897911071777} 11/06/2021 23:43:10 - INFO - __main__ - Step 17493: {'lr': 0.0004866037263732237, 'samples': 3358656, 'steps': 17492, 'loss/train': 1.8439452648162842} 11/06/2021 23:43:10 - INFO - __main__ - Step 17494: {'lr': 0.00048660201248937974, 'samples': 3358848, 'steps': 17493, 'loss/train': 1.8274236917495728} 11/06/2021 23:43:12 - INFO - __main__ - Step 17495: {'lr': 0.0004866002984989266, 'samples': 3359040, 'steps': 17494, 'loss/train': 1.4386416673660278} 11/06/2021 23:43:12 - INFO - __main__ - Step 17496: {'lr': 0.000486598584401865, 'samples': 3359232, 'steps': 17495, 'loss/train': 1.7358335256576538} 11/06/2021 23:43:12 - INFO - __main__ - Step 17497: {'lr': 0.0004865968701981958, 'samples': 3359424, 'steps': 17496, 'loss/train': 1.8483222723007202} 11/06/2021 23:43:13 - INFO - __main__ - Step 17498: {'lr': 0.0004865951558879196, 'samples': 3359616, 'steps': 17497, 'loss/train': 1.65242600440979} 11/06/2021 23:43:13 - INFO - __main__ - Step 17499: {'lr': 0.00048659344147103725, 'samples': 3359808, 'steps': 17498, 'loss/train': 1.2766807079315186} 11/06/2021 23:43:14 - INFO - __main__ - Step 17500: {'lr': 0.0004865917269475496, 'samples': 3360000, 'steps': 17499, 'loss/train': 1.5660682916641235} 11/06/2021 23:43:14 - INFO - __main__ - Step 17501: {'lr': 0.00048659001231745734, 'samples': 3360192, 'steps': 17500, 'loss/train': 1.4074636697769165} 11/06/2021 23:43:15 - INFO - __main__ - Step 17502: {'lr': 0.0004865882975807614, 'samples': 3360384, 'steps': 17501, 'loss/train': 1.3170610666275024} 11/06/2021 23:43:15 - INFO - __main__ - Step 17503: {'lr': 0.00048658658273746224, 'samples': 3360576, 'steps': 17502, 'loss/train': 1.7989710569381714} 11/06/2021 23:43:15 - INFO - __main__ - Step 17504: {'lr': 0.00048658486778756097, 'samples': 3360768, 'steps': 17503, 'loss/train': 1.5153037309646606} 11/06/2021 23:43:16 - INFO - __main__ - Step 17505: {'lr': 0.0004865831527310581, 'samples': 3360960, 'steps': 17504, 'loss/train': 1.4141513109207153} 11/06/2021 23:43:17 - INFO - __main__ - Step 17506: {'lr': 0.00048658143756795456, 'samples': 3361152, 'steps': 17505, 'loss/train': 1.6811589002609253} 11/06/2021 23:43:17 - INFO - __main__ - Step 17507: {'lr': 0.0004865797222982511, 'samples': 3361344, 'steps': 17506, 'loss/train': 1.6422348022460938} 11/06/2021 23:43:17 - INFO - __main__ - Step 17508: {'lr': 0.0004865780069219484, 'samples': 3361536, 'steps': 17507, 'loss/train': 1.3525933027267456} 11/06/2021 23:43:18 - INFO - __main__ - Step 17509: {'lr': 0.00048657629143904733, 'samples': 3361728, 'steps': 17508, 'loss/train': 1.62557053565979} 11/06/2021 23:43:19 - INFO - __main__ - Step 17510: {'lr': 0.0004865745758495487, 'samples': 3361920, 'steps': 17509, 'loss/train': 1.487194299697876} 11/06/2021 23:43:19 - INFO - __main__ - Step 17511: {'lr': 0.00048657286015345313, 'samples': 3362112, 'steps': 17510, 'loss/train': 1.7892093658447266} 11/06/2021 23:43:19 - INFO - __main__ - Step 17512: {'lr': 0.00048657114435076153, 'samples': 3362304, 'steps': 17511, 'loss/train': 1.6680396795272827} 11/06/2021 23:43:20 - INFO - __main__ - Step 17513: {'lr': 0.00048656942844147464, 'samples': 3362496, 'steps': 17512, 'loss/train': 3.819169282913208} 11/06/2021 23:43:20 - INFO - __main__ - Step 17514: {'lr': 0.00048656771242559316, 'samples': 3362688, 'steps': 17513, 'loss/train': 1.190946340560913} 11/06/2021 23:43:20 - INFO - __main__ - Step 17515: {'lr': 0.0004865659963031179, 'samples': 3362880, 'steps': 17514, 'loss/train': 1.7531355619430542} 11/06/2021 23:43:22 - INFO - __main__ - Step 17516: {'lr': 0.0004865642800740497, 'samples': 3363072, 'steps': 17515, 'loss/train': 1.5190026760101318} 11/06/2021 23:43:22 - INFO - __main__ - Step 17517: {'lr': 0.0004865625637383893, 'samples': 3363264, 'steps': 17516, 'loss/train': 1.2556828260421753} 11/06/2021 23:43:22 - INFO - __main__ - Step 17518: {'lr': 0.00048656084729613747, 'samples': 3363456, 'steps': 17517, 'loss/train': 1.2229467630386353} 11/06/2021 23:43:23 - INFO - __main__ - Step 17519: {'lr': 0.0004865591307472949, 'samples': 3363648, 'steps': 17518, 'loss/train': 1.7990758419036865} 11/06/2021 23:43:23 - INFO - __main__ - Step 17520: {'lr': 0.0004865574140918625, 'samples': 3363840, 'steps': 17519, 'loss/train': 1.7815418243408203} 11/06/2021 23:43:24 - INFO - __main__ - Step 17521: {'lr': 0.00048655569732984096, 'samples': 3364032, 'steps': 17520, 'loss/train': 1.8178268671035767} 11/06/2021 23:43:24 - INFO - __main__ - Step 17522: {'lr': 0.000486553980461231, 'samples': 3364224, 'steps': 17521, 'loss/train': 1.6401214599609375} 11/06/2021 23:43:25 - INFO - __main__ - Step 17523: {'lr': 0.0004865522634860335, 'samples': 3364416, 'steps': 17522, 'loss/train': 1.9060850143432617} 11/06/2021 23:43:25 - INFO - __main__ - Step 17524: {'lr': 0.00048655054640424936, 'samples': 3364608, 'steps': 17523, 'loss/train': 1.5981122255325317} 11/06/2021 23:43:25 - INFO - __main__ - Step 17525: {'lr': 0.00048654882921587907, 'samples': 3364800, 'steps': 17524, 'loss/train': 1.5546878576278687} 11/06/2021 23:43:27 - INFO - __main__ - Step 17526: {'lr': 0.00048654711192092347, 'samples': 3364992, 'steps': 17525, 'loss/train': 1.4803307056427002} 11/06/2021 23:43:27 - INFO - __main__ - Step 17527: {'lr': 0.0004865453945193835, 'samples': 3365184, 'steps': 17526, 'loss/train': 1.9428281784057617} 11/06/2021 23:43:28 - INFO - __main__ - Step 17528: {'lr': 0.00048654367701125975, 'samples': 3365376, 'steps': 17527, 'loss/train': 1.6251273155212402} 11/06/2021 23:43:28 - INFO - __main__ - Step 17529: {'lr': 0.0004865419593965531, 'samples': 3365568, 'steps': 17528, 'loss/train': 1.402294397354126} 11/06/2021 23:43:28 - INFO - __main__ - Step 17530: {'lr': 0.0004865402416752642, 'samples': 3365760, 'steps': 17529, 'loss/train': 1.7213939428329468} 11/06/2021 23:43:29 - INFO - __main__ - Step 17531: {'lr': 0.0004865385238473941, 'samples': 3365952, 'steps': 17530, 'loss/train': 1.3953073024749756} 11/06/2021 23:43:30 - INFO - __main__ - Step 17532: {'lr': 0.00048653680591294324, 'samples': 3366144, 'steps': 17531, 'loss/train': 1.9114352464675903} 11/06/2021 23:43:30 - INFO - __main__ - Step 17533: {'lr': 0.00048653508787191256, 'samples': 3366336, 'steps': 17532, 'loss/train': 1.8594201803207397} 11/06/2021 23:43:30 - INFO - __main__ - Step 17534: {'lr': 0.00048653336972430297, 'samples': 3366528, 'steps': 17533, 'loss/train': 1.5995358228683472} 11/06/2021 23:43:31 - INFO - __main__ - Step 17535: {'lr': 0.0004865316514701149, 'samples': 3366720, 'steps': 17534, 'loss/train': 1.5479685068130493} 11/06/2021 23:43:32 - INFO - __main__ - Step 17536: {'lr': 0.0004865299331093495, 'samples': 3366912, 'steps': 17535, 'loss/train': 1.3341885805130005} 11/06/2021 23:43:32 - INFO - __main__ - Step 17537: {'lr': 0.0004865282146420072, 'samples': 3367104, 'steps': 17536, 'loss/train': 0.8059349060058594} 11/06/2021 23:43:32 - INFO - __main__ - Step 17538: {'lr': 0.000486526496068089, 'samples': 3367296, 'steps': 17537, 'loss/train': 1.6529076099395752} 11/06/2021 23:43:33 - INFO - __main__ - Step 17539: {'lr': 0.0004865247773875956, 'samples': 3367488, 'steps': 17538, 'loss/train': 1.8571265935897827} 11/06/2021 23:43:33 - INFO - __main__ - Step 17540: {'lr': 0.0004865230586005278, 'samples': 3367680, 'steps': 17539, 'loss/train': 1.6223375797271729} 11/06/2021 23:43:33 - INFO - __main__ - Step 17541: {'lr': 0.00048652133970688633, 'samples': 3367872, 'steps': 17540, 'loss/train': 0.8137564063072205} 11/06/2021 23:43:35 - INFO - __main__ - Step 17542: {'lr': 0.00048651962070667197, 'samples': 3368064, 'steps': 17541, 'loss/train': 1.4748502969741821} 11/06/2021 23:43:35 - INFO - __main__ - Step 17543: {'lr': 0.00048651790159988563, 'samples': 3368256, 'steps': 17542, 'loss/train': 1.915865421295166} 11/06/2021 23:43:35 - INFO - __main__ - Step 17544: {'lr': 0.0004865161823865279, 'samples': 3368448, 'steps': 17543, 'loss/train': 1.7708138227462769} 11/06/2021 23:43:36 - INFO - __main__ - Step 17545: {'lr': 0.0004865144630665996, 'samples': 3368640, 'steps': 17544, 'loss/train': 1.7487539052963257} 11/06/2021 23:43:36 - INFO - __main__ - Step 17546: {'lr': 0.0004865127436401016, 'samples': 3368832, 'steps': 17545, 'loss/train': 0.35892412066459656} 11/06/2021 23:43:37 - INFO - __main__ - Step 17547: {'lr': 0.00048651102410703464, 'samples': 3369024, 'steps': 17546, 'loss/train': 1.7375950813293457} 11/06/2021 23:43:37 - INFO - __main__ - Step 17548: {'lr': 0.00048650930446739936, 'samples': 3369216, 'steps': 17547, 'loss/train': 1.6162850856781006} 11/06/2021 23:43:38 - INFO - __main__ - Step 17549: {'lr': 0.00048650758472119666, 'samples': 3369408, 'steps': 17548, 'loss/train': 1.576440453529358} 11/06/2021 23:43:38 - INFO - __main__ - Step 17550: {'lr': 0.0004865058648684273, 'samples': 3369600, 'steps': 17549, 'loss/train': 1.9157060384750366} 11/06/2021 23:43:38 - INFO - __main__ - Step 17551: {'lr': 0.00048650414490909207, 'samples': 3369792, 'steps': 17550, 'loss/train': 1.5747535228729248} 11/06/2021 23:43:40 - INFO - __main__ - Step 17552: {'lr': 0.00048650242484319175, 'samples': 3369984, 'steps': 17551, 'loss/train': 1.121293544769287} 11/06/2021 23:43:40 - INFO - __main__ - Step 17553: {'lr': 0.000486500704670727, 'samples': 3370176, 'steps': 17552, 'loss/train': 1.5847798585891724} 11/06/2021 23:43:40 - INFO - __main__ - Step 17554: {'lr': 0.0004864989843916987, 'samples': 3370368, 'steps': 17553, 'loss/train': 1.2869236469268799} 11/06/2021 23:43:41 - INFO - __main__ - Step 17555: {'lr': 0.0004864972640061077, 'samples': 3370560, 'steps': 17554, 'loss/train': 2.118384838104248} 11/06/2021 23:43:41 - INFO - __main__ - Step 17556: {'lr': 0.00048649554351395453, 'samples': 3370752, 'steps': 17555, 'loss/train': 1.5035825967788696} 11/06/2021 23:43:42 - INFO - __main__ - Step 17557: {'lr': 0.00048649382291524024, 'samples': 3370944, 'steps': 17556, 'loss/train': 1.856012225151062} 11/06/2021 23:43:42 - INFO - __main__ - Step 17558: {'lr': 0.0004864921022099654, 'samples': 3371136, 'steps': 17557, 'loss/train': 1.5416059494018555} 11/06/2021 23:43:43 - INFO - __main__ - Step 17559: {'lr': 0.00048649038139813097, 'samples': 3371328, 'steps': 17558, 'loss/train': 2.0964815616607666} 11/06/2021 23:43:43 - INFO - __main__ - Step 17560: {'lr': 0.00048648866047973756, 'samples': 3371520, 'steps': 17559, 'loss/train': 1.6892518997192383} 11/06/2021 23:43:43 - INFO - __main__ - Step 17561: {'lr': 0.000486486939454786, 'samples': 3371712, 'steps': 17560, 'loss/train': 1.6873033046722412} 11/06/2021 23:43:44 - INFO - __main__ - Step 17562: {'lr': 0.0004864852183232771, 'samples': 3371904, 'steps': 17561, 'loss/train': 1.7886723279953003} 11/06/2021 23:43:45 - INFO - __main__ - Step 17563: {'lr': 0.0004864834970852116, 'samples': 3372096, 'steps': 17562, 'loss/train': 1.7234721183776855} 11/06/2021 23:43:45 - INFO - __main__ - Step 17564: {'lr': 0.0004864817757405903, 'samples': 3372288, 'steps': 17563, 'loss/train': 1.9006706476211548} 11/06/2021 23:43:45 - INFO - __main__ - Step 17565: {'lr': 0.0004864800542894139, 'samples': 3372480, 'steps': 17564, 'loss/train': 1.8513051271438599} 11/06/2021 23:43:46 - INFO - __main__ - Step 17566: {'lr': 0.0004864783327316833, 'samples': 3372672, 'steps': 17565, 'loss/train': 1.8104248046875} 11/06/2021 23:43:46 - INFO - __main__ - Step 17567: {'lr': 0.0004864766110673992, 'samples': 3372864, 'steps': 17566, 'loss/train': 1.8728556632995605} 11/06/2021 23:43:47 - INFO - __main__ - Step 17568: {'lr': 0.00048647488929656237, 'samples': 3373056, 'steps': 17567, 'loss/train': 2.102489709854126} 11/06/2021 23:43:47 - INFO - __main__ - Step 17569: {'lr': 0.00048647316741917365, 'samples': 3373248, 'steps': 17568, 'loss/train': 1.702294945716858} 11/06/2021 23:43:48 - INFO - __main__ - Step 17570: {'lr': 0.0004864714454352337, 'samples': 3373440, 'steps': 17569, 'loss/train': 1.3519634008407593} 11/06/2021 23:43:48 - INFO - __main__ - Step 17571: {'lr': 0.00048646972334474343, 'samples': 3373632, 'steps': 17570, 'loss/train': 1.59146249294281} 11/06/2021 23:43:49 - INFO - __main__ - Step 17572: {'lr': 0.0004864680011477035, 'samples': 3373824, 'steps': 17571, 'loss/train': 2.0730271339416504} 11/06/2021 23:43:50 - INFO - __main__ - Step 17573: {'lr': 0.00048646627884411475, 'samples': 3374016, 'steps': 17572, 'loss/train': 1.1947308778762817} 11/06/2021 23:43:50 - INFO - __main__ - Step 17574: {'lr': 0.00048646455643397803, 'samples': 3374208, 'steps': 17573, 'loss/train': 1.3798284530639648} 11/06/2021 23:43:50 - INFO - __main__ - Step 17575: {'lr': 0.0004864628339172939, 'samples': 3374400, 'steps': 17574, 'loss/train': 1.0694053173065186} 11/06/2021 23:43:51 - INFO - __main__ - Step 17576: {'lr': 0.00048646111129406336, 'samples': 3374592, 'steps': 17575, 'loss/train': 1.4696110486984253} 11/06/2021 23:43:51 - INFO - __main__ - Step 17577: {'lr': 0.00048645938856428704, 'samples': 3374784, 'steps': 17576, 'loss/train': 1.9787176847457886} 11/06/2021 23:43:52 - INFO - __main__ - Step 17578: {'lr': 0.0004864576657279658, 'samples': 3374976, 'steps': 17577, 'loss/train': 1.9284498691558838} 11/06/2021 23:43:52 - INFO - __main__ - Step 17579: {'lr': 0.0004864559427851003, 'samples': 3375168, 'steps': 17578, 'loss/train': 1.340123176574707} 11/06/2021 23:43:53 - INFO - __main__ - Step 17580: {'lr': 0.0004864542197356915, 'samples': 3375360, 'steps': 17579, 'loss/train': 1.7503732442855835} 11/06/2021 23:43:53 - INFO - __main__ - Step 17581: {'lr': 0.00048645249657974007, 'samples': 3375552, 'steps': 17580, 'loss/train': 1.566757082939148} 11/06/2021 23:43:53 - INFO - __main__ - Step 17582: {'lr': 0.00048645077331724675, 'samples': 3375744, 'steps': 17581, 'loss/train': 1.6894330978393555} 11/06/2021 23:43:54 - INFO - __main__ - Step 17583: {'lr': 0.00048644904994821236, 'samples': 3375936, 'steps': 17582, 'loss/train': 1.7292306423187256} 11/06/2021 23:43:55 - INFO - __main__ - Step 17584: {'lr': 0.0004864473264726377, 'samples': 3376128, 'steps': 17583, 'loss/train': 1.413895606994629} 11/06/2021 23:43:55 - INFO - __main__ - Step 17585: {'lr': 0.00048644560289052354, 'samples': 3376320, 'steps': 17584, 'loss/train': 0.6992756128311157} 11/06/2021 23:43:55 - INFO - __main__ - Step 17586: {'lr': 0.0004864438792018706, 'samples': 3376512, 'steps': 17585, 'loss/train': 1.55809485912323} 11/06/2021 23:43:56 - INFO - __main__ - Step 17587: {'lr': 0.0004864421554066797, 'samples': 3376704, 'steps': 17586, 'loss/train': 1.9367111921310425} 11/06/2021 23:43:56 - INFO - __main__ - Step 17588: {'lr': 0.00048644043150495165, 'samples': 3376896, 'steps': 17587, 'loss/train': 1.8680219650268555} 11/06/2021 23:43:57 - INFO - __main__ - Step 17589: {'lr': 0.00048643870749668717, 'samples': 3377088, 'steps': 17588, 'loss/train': 1.4389081001281738} 11/06/2021 23:43:58 - INFO - __main__ - Step 17590: {'lr': 0.000486436983381887, 'samples': 3377280, 'steps': 17589, 'loss/train': 1.7606362104415894} 11/06/2021 23:43:58 - INFO - __main__ - Step 17591: {'lr': 0.0004864352591605521, 'samples': 3377472, 'steps': 17590, 'loss/train': 1.7748441696166992} 11/06/2021 23:43:58 - INFO - __main__ - Step 17592: {'lr': 0.00048643353483268306, 'samples': 3377664, 'steps': 17591, 'loss/train': 1.770916223526001} 11/06/2021 23:43:59 - INFO - __main__ - Step 17593: {'lr': 0.00048643181039828066, 'samples': 3377856, 'steps': 17592, 'loss/train': 1.82923424243927} 11/06/2021 23:44:00 - INFO - __main__ - Step 17594: {'lr': 0.00048643008585734575, 'samples': 3378048, 'steps': 17593, 'loss/train': 1.802532434463501} 11/06/2021 23:44:00 - INFO - __main__ - Step 17595: {'lr': 0.00048642836120987913, 'samples': 3378240, 'steps': 17594, 'loss/train': 2.2226874828338623} 11/06/2021 23:44:00 - INFO - __main__ - Step 17596: {'lr': 0.0004864266364558816, 'samples': 3378432, 'steps': 17595, 'loss/train': 1.8610355854034424} 11/06/2021 23:44:01 - INFO - __main__ - Step 17597: {'lr': 0.00048642491159535373, 'samples': 3378624, 'steps': 17596, 'loss/train': 1.6837226152420044} 11/06/2021 23:44:01 - INFO - __main__ - Step 17598: {'lr': 0.0004864231866282965, 'samples': 3378816, 'steps': 17597, 'loss/train': 2.052727460861206} 11/06/2021 23:44:02 - INFO - __main__ - Step 17599: {'lr': 0.0004864214615547107, 'samples': 3379008, 'steps': 17598, 'loss/train': 0.28922533988952637} 11/06/2021 23:44:02 - INFO - __main__ - Step 17600: {'lr': 0.000486419736374597, 'samples': 3379200, 'steps': 17599, 'loss/train': 1.3674170970916748} 11/06/2021 23:44:03 - INFO - __main__ - Step 17601: {'lr': 0.0004864180110879562, 'samples': 3379392, 'steps': 17600, 'loss/train': 1.1933001279830933} 11/06/2021 23:44:03 - INFO - __main__ - Step 17602: {'lr': 0.00048641628569478916, 'samples': 3379584, 'steps': 17601, 'loss/train': 1.8393548727035522} 11/06/2021 23:44:03 - INFO - __main__ - Step 17603: {'lr': 0.00048641456019509643, 'samples': 3379776, 'steps': 17602, 'loss/train': 1.6176362037658691} 11/06/2021 23:44:04 - INFO - __main__ - Step 17604: {'lr': 0.0004864128345888791, 'samples': 3379968, 'steps': 17603, 'loss/train': 1.574226975440979} 11/06/2021 23:44:05 - INFO - __main__ - Step 17605: {'lr': 0.0004864111088761377, 'samples': 3380160, 'steps': 17604, 'loss/train': 1.8493263721466064} 11/06/2021 23:44:05 - INFO - __main__ - Step 17606: {'lr': 0.00048640938305687315, 'samples': 3380352, 'steps': 17605, 'loss/train': 1.4664735794067383} 11/06/2021 23:44:06 - INFO - __main__ - Step 17607: {'lr': 0.00048640765713108615, 'samples': 3380544, 'steps': 17606, 'loss/train': 0.9904863238334656} 11/06/2021 23:44:06 - INFO - __main__ - Step 17608: {'lr': 0.00048640593109877754, 'samples': 3380736, 'steps': 17607, 'loss/train': 1.7418029308319092} 11/06/2021 23:44:07 - INFO - __main__ - Step 17609: {'lr': 0.00048640420495994806, 'samples': 3380928, 'steps': 17608, 'loss/train': 1.5066266059875488} 11/06/2021 23:44:07 - INFO - __main__ - Step 17610: {'lr': 0.0004864024787145985, 'samples': 3381120, 'steps': 17609, 'loss/train': 1.6443487405776978} 11/06/2021 23:44:08 - INFO - __main__ - Step 17611: {'lr': 0.00048640075236272963, 'samples': 3381312, 'steps': 17610, 'loss/train': 1.636483073234558} 11/06/2021 23:44:08 - INFO - __main__ - Step 17612: {'lr': 0.00048639902590434214, 'samples': 3381504, 'steps': 17611, 'loss/train': 2.005004405975342} 11/06/2021 23:44:08 - INFO - __main__ - Step 17613: {'lr': 0.000486397299339437, 'samples': 3381696, 'steps': 17612, 'loss/train': 1.4395687580108643} 11/06/2021 23:44:09 - INFO - __main__ - Step 17614: {'lr': 0.0004863955726680149, 'samples': 3381888, 'steps': 17613, 'loss/train': 1.2516988515853882} 11/06/2021 23:44:10 - INFO - __main__ - Step 17615: {'lr': 0.0004863938458900765, 'samples': 3382080, 'steps': 17614, 'loss/train': 1.3464889526367188} 11/06/2021 23:44:10 - INFO - __main__ - Step 17616: {'lr': 0.0004863921190056227, 'samples': 3382272, 'steps': 17615, 'loss/train': 1.286136507987976} 11/06/2021 23:44:10 - INFO - __main__ - Step 17617: {'lr': 0.0004863903920146544, 'samples': 3382464, 'steps': 17616, 'loss/train': 2.09637188911438} 11/06/2021 23:44:11 - INFO - __main__ - Step 17618: {'lr': 0.00048638866491717214, 'samples': 3382656, 'steps': 17617, 'loss/train': 1.8491672277450562} 11/06/2021 23:44:12 - INFO - __main__ - Step 17619: {'lr': 0.00048638693771317675, 'samples': 3382848, 'steps': 17618, 'loss/train': 1.8306879997253418} 11/06/2021 23:44:12 - INFO - __main__ - Step 17620: {'lr': 0.0004863852104026691, 'samples': 3383040, 'steps': 17619, 'loss/train': 2.254307746887207} 11/06/2021 23:44:12 - INFO - __main__ - Step 17621: {'lr': 0.00048638348298564996, 'samples': 3383232, 'steps': 17620, 'loss/train': 1.5837358236312866} 11/06/2021 23:44:13 - INFO - __main__ - Step 17622: {'lr': 0.00048638175546212, 'samples': 3383424, 'steps': 17621, 'loss/train': 1.2425532341003418} 11/06/2021 23:44:13 - INFO - __main__ - Step 17623: {'lr': 0.00048638002783208013, 'samples': 3383616, 'steps': 17622, 'loss/train': 1.2094799280166626} 11/06/2021 23:44:13 - INFO - __main__ - Step 17624: {'lr': 0.000486378300095531, 'samples': 3383808, 'steps': 17623, 'loss/train': 1.8858073949813843} 11/06/2021 23:44:15 - INFO - __main__ - Step 17625: {'lr': 0.0004863765722524735, 'samples': 3384000, 'steps': 17624, 'loss/train': 1.8143595457077026} 11/06/2021 23:44:15 - INFO - __main__ - Step 17626: {'lr': 0.0004863748443029083, 'samples': 3384192, 'steps': 17625, 'loss/train': 1.8148971796035767} 11/06/2021 23:44:15 - INFO - __main__ - Step 17627: {'lr': 0.00048637311624683634, 'samples': 3384384, 'steps': 17626, 'loss/train': 1.8352149724960327} 11/06/2021 23:44:16 - INFO - __main__ - Step 17628: {'lr': 0.0004863713880842583, 'samples': 3384576, 'steps': 17627, 'loss/train': 1.9543405771255493} 11/06/2021 23:44:16 - INFO - __main__ - Step 17629: {'lr': 0.0004863696598151749, 'samples': 3384768, 'steps': 17628, 'loss/train': 1.5006835460662842} 11/06/2021 23:44:17 - INFO - __main__ - Step 17630: {'lr': 0.00048636793143958695, 'samples': 3384960, 'steps': 17629, 'loss/train': 1.918142318725586} 11/06/2021 23:44:17 - INFO - __main__ - Step 17631: {'lr': 0.00048636620295749533, 'samples': 3385152, 'steps': 17630, 'loss/train': 1.7116636037826538} 11/06/2021 23:44:18 - INFO - __main__ - Step 17632: {'lr': 0.00048636447436890075, 'samples': 3385344, 'steps': 17631, 'loss/train': 2.3437066078186035} 11/06/2021 23:44:18 - INFO - __main__ - Step 17633: {'lr': 0.0004863627456738039, 'samples': 3385536, 'steps': 17632, 'loss/train': 1.3445450067520142} 11/06/2021 23:44:18 - INFO - __main__ - Step 17634: {'lr': 0.00048636101687220566, 'samples': 3385728, 'steps': 17633, 'loss/train': 1.811186671257019} 11/06/2021 23:44:20 - INFO - __main__ - Step 17635: {'lr': 0.0004863592879641069, 'samples': 3385920, 'steps': 17634, 'loss/train': 1.7794731855392456} 11/06/2021 23:44:20 - INFO - __main__ - Step 17636: {'lr': 0.0004863575589495082, 'samples': 3386112, 'steps': 17635, 'loss/train': 0.17728441953659058} 11/06/2021 23:44:20 - INFO - __main__ - Step 17637: {'lr': 0.00048635582982841047, 'samples': 3386304, 'steps': 17636, 'loss/train': 1.6846859455108643} 11/06/2021 23:44:21 - INFO - __main__ - Step 17638: {'lr': 0.0004863541006008144, 'samples': 3386496, 'steps': 17637, 'loss/train': 1.8092470169067383} 11/06/2021 23:44:21 - INFO - __main__ - Step 17639: {'lr': 0.0004863523712667209, 'samples': 3386688, 'steps': 17638, 'loss/train': 5.441341400146484} 11/06/2021 23:44:22 - INFO - __main__ - Step 17640: {'lr': 0.00048635064182613063, 'samples': 3386880, 'steps': 17639, 'loss/train': 1.7276121377944946} 11/06/2021 23:44:22 - INFO - __main__ - Step 17641: {'lr': 0.00048634891227904435, 'samples': 3387072, 'steps': 17640, 'loss/train': 1.31169855594635} 11/06/2021 23:44:23 - INFO - __main__ - Step 17642: {'lr': 0.00048634718262546297, 'samples': 3387264, 'steps': 17641, 'loss/train': 1.7238352298736572} 11/06/2021 23:44:23 - INFO - __main__ - Step 17643: {'lr': 0.0004863454528653872, 'samples': 3387456, 'steps': 17642, 'loss/train': 1.691017985343933} 11/06/2021 23:44:23 - INFO - __main__ - Step 17644: {'lr': 0.0004863437229988178, 'samples': 3387648, 'steps': 17643, 'loss/train': 1.94411301612854} 11/06/2021 23:44:24 - INFO - __main__ - Step 17645: {'lr': 0.00048634199302575554, 'samples': 3387840, 'steps': 17644, 'loss/train': 2.1407902240753174} 11/06/2021 23:44:25 - INFO - __main__ - Step 17646: {'lr': 0.00048634026294620125, 'samples': 3388032, 'steps': 17645, 'loss/train': 1.1031410694122314} 11/06/2021 23:44:25 - INFO - __main__ - Step 17647: {'lr': 0.00048633853276015566, 'samples': 3388224, 'steps': 17646, 'loss/train': 1.7941488027572632} 11/06/2021 23:44:25 - INFO - __main__ - Step 17648: {'lr': 0.00048633680246761956, 'samples': 3388416, 'steps': 17647, 'loss/train': 1.232820987701416} 11/06/2021 23:44:26 - INFO - __main__ - Step 17649: {'lr': 0.00048633507206859383, 'samples': 3388608, 'steps': 17648, 'loss/train': 1.629302740097046} 11/06/2021 23:44:26 - INFO - __main__ - Step 17650: {'lr': 0.00048633334156307907, 'samples': 3388800, 'steps': 17649, 'loss/train': 1.1769516468048096} 11/06/2021 23:44:27 - INFO - __main__ - Step 17651: {'lr': 0.0004863316109510762, 'samples': 3388992, 'steps': 17650, 'loss/train': 1.9917153120040894} 11/06/2021 23:44:28 - INFO - __main__ - Step 17652: {'lr': 0.00048632988023258596, 'samples': 3389184, 'steps': 17651, 'loss/train': 1.4906460046768188} 11/06/2021 23:44:28 - INFO - __main__ - Step 17653: {'lr': 0.00048632814940760907, 'samples': 3389376, 'steps': 17652, 'loss/train': 1.6637530326843262} 11/06/2021 23:44:28 - INFO - __main__ - Step 17654: {'lr': 0.00048632641847614645, 'samples': 3389568, 'steps': 17653, 'loss/train': 1.3956372737884521} 11/06/2021 23:44:29 - INFO - __main__ - Step 17655: {'lr': 0.0004863246874381987, 'samples': 3389760, 'steps': 17654, 'loss/train': 1.1515908241271973} 11/06/2021 23:44:30 - INFO - __main__ - Step 17656: {'lr': 0.00048632295629376675, 'samples': 3389952, 'steps': 17655, 'loss/train': 1.2790088653564453} 11/06/2021 23:44:30 - INFO - __main__ - Step 17657: {'lr': 0.00048632122504285133, 'samples': 3390144, 'steps': 17656, 'loss/train': 1.4435092210769653} 11/06/2021 23:44:31 - INFO - __main__ - Step 17658: {'lr': 0.0004863194936854531, 'samples': 3390336, 'steps': 17657, 'loss/train': 1.6653163433074951} 11/06/2021 23:44:31 - INFO - __main__ - Step 17659: {'lr': 0.0004863177622215731, 'samples': 3390528, 'steps': 17658, 'loss/train': 1.276356816291809} 11/06/2021 23:44:31 - INFO - __main__ - Step 17660: {'lr': 0.00048631603065121186, 'samples': 3390720, 'steps': 17659, 'loss/train': 0.5060484409332275} 11/06/2021 23:44:32 - INFO - __main__ - Step 17661: {'lr': 0.00048631429897437033, 'samples': 3390912, 'steps': 17660, 'loss/train': 1.867882251739502} 11/06/2021 23:44:33 - INFO - __main__ - Step 17662: {'lr': 0.0004863125671910492, 'samples': 3391104, 'steps': 17661, 'loss/train': 1.676186203956604} 11/06/2021 23:44:33 - INFO - __main__ - Step 17663: {'lr': 0.00048631083530124934, 'samples': 3391296, 'steps': 17662, 'loss/train': 1.7655481100082397} 11/06/2021 23:44:33 - INFO - __main__ - Step 17664: {'lr': 0.00048630910330497133, 'samples': 3391488, 'steps': 17663, 'loss/train': 2.426825523376465} 11/06/2021 23:44:34 - INFO - __main__ - Step 17665: {'lr': 0.0004863073712022162, 'samples': 3391680, 'steps': 17664, 'loss/train': 1.5191713571548462} 11/06/2021 23:44:34 - INFO - __main__ - Step 17666: {'lr': 0.00048630563899298453, 'samples': 3391872, 'steps': 17665, 'loss/train': 1.772326946258545} 11/06/2021 23:44:35 - INFO - __main__ - Step 17667: {'lr': 0.00048630390667727725, 'samples': 3392064, 'steps': 17666, 'loss/train': 2.8871684074401855} 11/06/2021 23:44:35 - INFO - __main__ - Step 17668: {'lr': 0.00048630217425509503, 'samples': 3392256, 'steps': 17667, 'loss/train': 1.6198441982269287} 11/06/2021 23:44:36 - INFO - __main__ - Step 17669: {'lr': 0.00048630044172643874, 'samples': 3392448, 'steps': 17668, 'loss/train': 1.4763474464416504} 11/06/2021 23:44:36 - INFO - __main__ - Step 17670: {'lr': 0.0004862987090913091, 'samples': 3392640, 'steps': 17669, 'loss/train': 1.4978746175765991} 11/06/2021 23:44:37 - INFO - __main__ - Step 17671: {'lr': 0.0004862969763497069, 'samples': 3392832, 'steps': 17670, 'loss/train': 1.3769848346710205} 11/06/2021 23:44:38 - INFO - __main__ - Step 17672: {'lr': 0.0004862952435016329, 'samples': 3393024, 'steps': 17671, 'loss/train': 1.7096163034439087} 11/06/2021 23:44:38 - INFO - __main__ - Step 17673: {'lr': 0.00048629351054708795, 'samples': 3393216, 'steps': 17672, 'loss/train': 0.3188741207122803} 11/06/2021 23:44:38 - INFO - __main__ - Step 17674: {'lr': 0.0004862917774860728, 'samples': 3393408, 'steps': 17673, 'loss/train': 1.1482633352279663} 11/06/2021 23:44:39 - INFO - __main__ - Step 17675: {'lr': 0.0004862900443185882, 'samples': 3393600, 'steps': 17674, 'loss/train': 1.705519199371338} 11/06/2021 23:44:39 - INFO - __main__ - Step 17676: {'lr': 0.00048628831104463496, 'samples': 3393792, 'steps': 17675, 'loss/train': 1.3742202520370483} 11/06/2021 23:44:40 - INFO - __main__ - Step 17677: {'lr': 0.0004862865776642138, 'samples': 3393984, 'steps': 17676, 'loss/train': 1.4632993936538696} 11/06/2021 23:44:40 - INFO - __main__ - Step 17678: {'lr': 0.00048628484417732567, 'samples': 3394176, 'steps': 17677, 'loss/train': 1.8968327045440674} 11/06/2021 23:44:41 - INFO - __main__ - Step 17679: {'lr': 0.00048628311058397113, 'samples': 3394368, 'steps': 17678, 'loss/train': 1.8257449865341187} 11/06/2021 23:44:41 - INFO - __main__ - Step 17680: {'lr': 0.0004862813768841511, 'samples': 3394560, 'steps': 17679, 'loss/train': 1.2477961778640747} 11/06/2021 23:44:41 - INFO - __main__ - Step 17681: {'lr': 0.0004862796430778663, 'samples': 3394752, 'steps': 17680, 'loss/train': 1.7231652736663818} 11/06/2021 23:44:42 - INFO - __main__ - Step 17682: {'lr': 0.0004862779091651176, 'samples': 3394944, 'steps': 17681, 'loss/train': 1.2031580209732056} 11/06/2021 23:44:44 - INFO - __main__ - Step 17683: {'lr': 0.0004862761751459057, 'samples': 3395136, 'steps': 17682, 'loss/train': 1.4472994804382324} 11/06/2021 23:44:44 - INFO - __main__ - Step 17684: {'lr': 0.0004862744410202314, 'samples': 3395328, 'steps': 17683, 'loss/train': 1.509667158126831} 11/06/2021 23:44:44 - INFO - __main__ - Step 17685: {'lr': 0.00048627270678809544, 'samples': 3395520, 'steps': 17684, 'loss/train': 1.8475730419158936} 11/06/2021 23:44:45 - INFO - __main__ - Step 17686: {'lr': 0.0004862709724494987, 'samples': 3395712, 'steps': 17685, 'loss/train': 0.83989018201828} 11/06/2021 23:44:45 - INFO - __main__ - Step 17687: {'lr': 0.0004862692380044419, 'samples': 3395904, 'steps': 17686, 'loss/train': 0.8706456422805786} 11/06/2021 23:44:45 - INFO - __main__ - Step 17688: {'lr': 0.0004862675034529258, 'samples': 3396096, 'steps': 17687, 'loss/train': 2.1186492443084717} 11/06/2021 23:44:46 - INFO - __main__ - Step 17689: {'lr': 0.0004862657687949512, 'samples': 3396288, 'steps': 17688, 'loss/train': 1.95449960231781} 11/06/2021 23:44:47 - INFO - __main__ - Step 17690: {'lr': 0.00048626403403051894, 'samples': 3396480, 'steps': 17689, 'loss/train': 1.7684614658355713} 11/06/2021 23:44:47 - INFO - __main__ - Step 17691: {'lr': 0.00048626229915962974, 'samples': 3396672, 'steps': 17690, 'loss/train': 1.9429171085357666} 11/06/2021 23:44:47 - INFO - __main__ - Step 17692: {'lr': 0.00048626056418228436, 'samples': 3396864, 'steps': 17691, 'loss/train': 1.7060483694076538} 11/06/2021 23:44:48 - INFO - __main__ - Step 17693: {'lr': 0.0004862588290984836, 'samples': 3397056, 'steps': 17692, 'loss/train': 1.6680606603622437} 11/06/2021 23:44:48 - INFO - __main__ - Step 17694: {'lr': 0.0004862570939082283, 'samples': 3397248, 'steps': 17693, 'loss/train': 1.6972788572311401} 11/06/2021 23:44:49 - INFO - __main__ - Step 17695: {'lr': 0.0004862553586115192, 'samples': 3397440, 'steps': 17694, 'loss/train': 1.4430689811706543} 11/06/2021 23:44:49 - INFO - __main__ - Step 17696: {'lr': 0.00048625362320835707, 'samples': 3397632, 'steps': 17695, 'loss/train': 1.8841452598571777} 11/06/2021 23:44:50 - INFO - __main__ - Step 17697: {'lr': 0.00048625188769874274, 'samples': 3397824, 'steps': 17696, 'loss/train': 1.7126444578170776} 11/06/2021 23:44:50 - INFO - __main__ - Step 17698: {'lr': 0.0004862501520826769, 'samples': 3398016, 'steps': 17697, 'loss/train': 1.623423457145691} 11/06/2021 23:44:50 - INFO - __main__ - Step 17699: {'lr': 0.0004862484163601604, 'samples': 3398208, 'steps': 17698, 'loss/train': 1.2866634130477905} 11/06/2021 23:44:51 - INFO - __main__ - Step 17700: {'lr': 0.000486246680531194, 'samples': 3398400, 'steps': 17699, 'loss/train': 1.8399791717529297} 11/06/2021 23:44:52 - INFO - __main__ - Step 17701: {'lr': 0.0004862449445957785, 'samples': 3398592, 'steps': 17700, 'loss/train': 2.2770066261291504} 11/06/2021 23:44:52 - INFO - __main__ - Step 17702: {'lr': 0.00048624320855391467, 'samples': 3398784, 'steps': 17701, 'loss/train': 1.8665575981140137} 11/06/2021 23:44:53 - INFO - __main__ - Step 17703: {'lr': 0.00048624147240560335, 'samples': 3398976, 'steps': 17702, 'loss/train': 1.7524605989456177} 11/06/2021 23:44:53 - INFO - __main__ - Step 17704: {'lr': 0.00048623973615084516, 'samples': 3399168, 'steps': 17703, 'loss/train': 1.6480399370193481} 11/06/2021 23:44:53 - INFO - __main__ - Step 17705: {'lr': 0.0004862379997896411, 'samples': 3399360, 'steps': 17704, 'loss/train': 1.4674162864685059} 11/06/2021 23:44:55 - INFO - __main__ - Step 17706: {'lr': 0.0004862362633219918, 'samples': 3399552, 'steps': 17705, 'loss/train': 1.4373513460159302} 11/06/2021 23:44:55 - INFO - __main__ - Step 17707: {'lr': 0.000486234526747898, 'samples': 3399744, 'steps': 17706, 'loss/train': 1.3480507135391235} 11/06/2021 23:44:55 - INFO - __main__ - Step 17708: {'lr': 0.0004862327900673607, 'samples': 3399936, 'steps': 17707, 'loss/train': 1.7012041807174683} 11/06/2021 23:44:56 - INFO - __main__ - Step 17709: {'lr': 0.00048623105328038054, 'samples': 3400128, 'steps': 17708, 'loss/train': 1.5123977661132812} 11/06/2021 23:44:56 - INFO - __main__ - Step 17710: {'lr': 0.0004862293163869582, 'samples': 3400320, 'steps': 17709, 'loss/train': 1.932036280632019} 11/06/2021 23:44:56 - INFO - __main__ - Step 17711: {'lr': 0.00048622757938709466, 'samples': 3400512, 'steps': 17710, 'loss/train': 1.562470555305481} 11/06/2021 23:44:57 - INFO - __main__ - Step 17712: {'lr': 0.0004862258422807906, 'samples': 3400704, 'steps': 17711, 'loss/train': 2.740880012512207} 11/06/2021 23:44:58 - INFO - __main__ - Step 17713: {'lr': 0.0004862241050680468, 'samples': 3400896, 'steps': 17712, 'loss/train': 1.3904601335525513} 11/06/2021 23:44:58 - INFO - __main__ - Step 17714: {'lr': 0.00048622236774886415, 'samples': 3401088, 'steps': 17713, 'loss/train': 1.7398821115493774} 11/06/2021 23:44:59 - INFO - __main__ - Step 17715: {'lr': 0.00048622063032324324, 'samples': 3401280, 'steps': 17714, 'loss/train': 1.4588117599487305} 11/06/2021 23:44:59 - INFO - __main__ - Step 17716: {'lr': 0.000486218892791185, 'samples': 3401472, 'steps': 17715, 'loss/train': 1.9582970142364502} 11/06/2021 23:45:00 - INFO - __main__ - Step 17717: {'lr': 0.00048621715515269017, 'samples': 3401664, 'steps': 17716, 'loss/train': 1.8855799436569214} 11/06/2021 23:45:00 - INFO - __main__ - Step 17718: {'lr': 0.0004862154174077595, 'samples': 3401856, 'steps': 17717, 'loss/train': 1.602186918258667} 11/06/2021 23:45:01 - INFO - __main__ - Step 17719: {'lr': 0.00048621367955639395, 'samples': 3402048, 'steps': 17718, 'loss/train': 1.7233091592788696} 11/06/2021 23:45:01 - INFO - __main__ - Step 17720: {'lr': 0.00048621194159859403, 'samples': 3402240, 'steps': 17719, 'loss/train': 1.776808738708496} 11/06/2021 23:45:01 - INFO - __main__ - Step 17721: {'lr': 0.0004862102035343607, 'samples': 3402432, 'steps': 17720, 'loss/train': 1.7947196960449219} 11/06/2021 23:45:02 - INFO - __main__ - Step 17722: {'lr': 0.0004862084653636947, 'samples': 3402624, 'steps': 17721, 'loss/train': 1.6159709692001343} 11/06/2021 23:45:03 - INFO - __main__ - Step 17723: {'lr': 0.00048620672708659675, 'samples': 3402816, 'steps': 17722, 'loss/train': 1.3404769897460938} 11/06/2021 23:45:03 - INFO - __main__ - Step 17724: {'lr': 0.0004862049887030677, 'samples': 3403008, 'steps': 17723, 'loss/train': 1.8909834623336792} 11/06/2021 23:45:04 - INFO - __main__ - Step 17725: {'lr': 0.0004862032502131084, 'samples': 3403200, 'steps': 17724, 'loss/train': 1.3636215925216675} 11/06/2021 23:45:04 - INFO - __main__ - Step 17726: {'lr': 0.00048620151161671955, 'samples': 3403392, 'steps': 17725, 'loss/train': 1.3039368391036987} 11/06/2021 23:45:04 - INFO - __main__ - Step 17727: {'lr': 0.00048619977291390186, 'samples': 3403584, 'steps': 17726, 'loss/train': 1.5515477657318115} 11/06/2021 23:45:05 - INFO - __main__ - Step 17728: {'lr': 0.00048619803410465624, 'samples': 3403776, 'steps': 17727, 'loss/train': 1.6064467430114746} 11/06/2021 23:45:06 - INFO - __main__ - Step 17729: {'lr': 0.00048619629518898344, 'samples': 3403968, 'steps': 17728, 'loss/train': 1.9508514404296875} 11/06/2021 23:45:06 - INFO - __main__ - Step 17730: {'lr': 0.00048619455616688426, 'samples': 3404160, 'steps': 17729, 'loss/train': 1.5443555116653442} 11/06/2021 23:45:06 - INFO - __main__ - Step 17731: {'lr': 0.0004861928170383594, 'samples': 3404352, 'steps': 17730, 'loss/train': 2.0993919372558594} 11/06/2021 23:45:07 - INFO - __main__ - Step 17732: {'lr': 0.0004861910778034098, 'samples': 3404544, 'steps': 17731, 'loss/train': 1.164543628692627} 11/06/2021 23:45:08 - INFO - __main__ - Step 17733: {'lr': 0.00048618933846203606, 'samples': 3404736, 'steps': 17732, 'loss/train': 1.5191380977630615} 11/06/2021 23:45:08 - INFO - __main__ - Step 17734: {'lr': 0.00048618759901423905, 'samples': 3404928, 'steps': 17733, 'loss/train': 1.3447216749191284} 11/06/2021 23:45:08 - INFO - __main__ - Step 17735: {'lr': 0.0004861858594600196, 'samples': 3405120, 'steps': 17734, 'loss/train': 1.69388747215271} 11/06/2021 23:45:09 - INFO - __main__ - Step 17736: {'lr': 0.0004861841197993784, 'samples': 3405312, 'steps': 17735, 'loss/train': 1.55830979347229} 11/06/2021 23:45:09 - INFO - __main__ - Step 17737: {'lr': 0.0004861823800323163, 'samples': 3405504, 'steps': 17736, 'loss/train': 1.009899377822876} 11/06/2021 23:45:10 - INFO - __main__ - Step 17738: {'lr': 0.00048618064015883405, 'samples': 3405696, 'steps': 17737, 'loss/train': 1.4308483600616455} 11/06/2021 23:45:10 - INFO - __main__ - Step 17739: {'lr': 0.0004861789001789325, 'samples': 3405888, 'steps': 17738, 'loss/train': 1.650288462638855} 11/06/2021 23:45:11 - INFO - __main__ - Step 17740: {'lr': 0.00048617716009261236, 'samples': 3406080, 'steps': 17739, 'loss/train': 1.3938745260238647} 11/06/2021 23:45:11 - INFO - __main__ - Step 17741: {'lr': 0.00048617541989987435, 'samples': 3406272, 'steps': 17740, 'loss/train': 2.2797935009002686} 11/06/2021 23:45:11 - INFO - __main__ - Step 17742: {'lr': 0.00048617367960071946, 'samples': 3406464, 'steps': 17741, 'loss/train': 1.6652027368545532} 11/06/2021 23:45:12 - INFO - __main__ - Step 17743: {'lr': 0.0004861719391951483, 'samples': 3406656, 'steps': 17742, 'loss/train': 2.367107629776001} 11/06/2021 23:45:13 - INFO - __main__ - Step 17744: {'lr': 0.0004861701986831617, 'samples': 3406848, 'steps': 17743, 'loss/train': 1.67668616771698} 11/06/2021 23:45:13 - INFO - __main__ - Step 17745: {'lr': 0.0004861684580647605, 'samples': 3407040, 'steps': 17744, 'loss/train': 1.9319639205932617} 11/06/2021 23:45:13 - INFO - __main__ - Step 17746: {'lr': 0.0004861667173399453, 'samples': 3407232, 'steps': 17745, 'loss/train': 1.9441924095153809} 11/06/2021 23:45:14 - INFO - __main__ - Step 17747: {'lr': 0.0004861649765087172, 'samples': 3407424, 'steps': 17746, 'loss/train': 1.659930944442749} 11/06/2021 23:45:15 - INFO - __main__ - Step 17748: {'lr': 0.0004861632355710767, 'samples': 3407616, 'steps': 17747, 'loss/train': 1.866965889930725} 11/06/2021 23:45:15 - INFO - __main__ - Step 17749: {'lr': 0.00048616149452702473, 'samples': 3407808, 'steps': 17748, 'loss/train': 1.0669561624526978} 11/06/2021 23:45:16 - INFO - __main__ - Step 17750: {'lr': 0.00048615975337656204, 'samples': 3408000, 'steps': 17749, 'loss/train': 1.2315871715545654} 11/06/2021 23:45:16 - INFO - __main__ - Step 17751: {'lr': 0.00048615801211968936, 'samples': 3408192, 'steps': 17750, 'loss/train': 1.1719317436218262} 11/06/2021 23:45:16 - INFO - __main__ - Step 17752: {'lr': 0.00048615627075640754, 'samples': 3408384, 'steps': 17751, 'loss/train': 1.6517837047576904} 11/06/2021 23:45:17 - INFO - __main__ - Step 17753: {'lr': 0.00048615452928671746, 'samples': 3408576, 'steps': 17752, 'loss/train': 1.3489247560501099} 11/06/2021 23:45:18 - INFO - __main__ - Step 17754: {'lr': 0.00048615278771061966, 'samples': 3408768, 'steps': 17753, 'loss/train': 1.8767962455749512} 11/06/2021 23:45:18 - INFO - __main__ - Step 17755: {'lr': 0.0004861510460281151, 'samples': 3408960, 'steps': 17754, 'loss/train': 1.8693060874938965} 11/06/2021 23:45:18 - INFO - __main__ - Step 17756: {'lr': 0.0004861493042392045, 'samples': 3409152, 'steps': 17755, 'loss/train': 1.853451132774353} 11/06/2021 23:45:19 - INFO - __main__ - Step 17757: {'lr': 0.00048614756234388866, 'samples': 3409344, 'steps': 17756, 'loss/train': 2.1281392574310303} 11/06/2021 23:45:19 - INFO - __main__ - Step 17758: {'lr': 0.00048614582034216844, 'samples': 3409536, 'steps': 17757, 'loss/train': 1.9277595281600952} 11/06/2021 23:45:20 - INFO - __main__ - Step 17759: {'lr': 0.0004861440782340445, 'samples': 3409728, 'steps': 17758, 'loss/train': 1.6594657897949219} 11/06/2021 23:45:20 - INFO - __main__ - Step 17760: {'lr': 0.0004861423360195177, 'samples': 3409920, 'steps': 17759, 'loss/train': 1.752420425415039} 11/06/2021 23:45:21 - INFO - __main__ - Step 17761: {'lr': 0.0004861405936985888, 'samples': 3410112, 'steps': 17760, 'loss/train': 1.9742324352264404} 11/06/2021 23:45:21 - INFO - __main__ - Step 17762: {'lr': 0.0004861388512712586, 'samples': 3410304, 'steps': 17761, 'loss/train': 1.6699413061141968} 11/06/2021 23:45:21 - INFO - __main__ - Step 17763: {'lr': 0.0004861371087375279, 'samples': 3410496, 'steps': 17762, 'loss/train': 1.9181797504425049} 11/06/2021 23:45:22 - INFO - __main__ - Step 17764: {'lr': 0.0004861353660973974, 'samples': 3410688, 'steps': 17763, 'loss/train': 2.064819812774658} 11/06/2021 23:45:23 - INFO - __main__ - Step 17765: {'lr': 0.00048613362335086797, 'samples': 3410880, 'steps': 17764, 'loss/train': 1.6789594888687134} 11/06/2021 23:45:23 - INFO - __main__ - Step 17766: {'lr': 0.00048613188049794045, 'samples': 3411072, 'steps': 17765, 'loss/train': 1.2881523370742798} 11/06/2021 23:45:23 - INFO - __main__ - Step 17767: {'lr': 0.00048613013753861546, 'samples': 3411264, 'steps': 17766, 'loss/train': 1.05593740940094} 11/06/2021 23:45:24 - INFO - __main__ - Step 17768: {'lr': 0.0004861283944728939, 'samples': 3411456, 'steps': 17767, 'loss/train': 1.651064395904541} 11/06/2021 23:45:25 - INFO - __main__ - Step 17769: {'lr': 0.0004861266513007765, 'samples': 3411648, 'steps': 17768, 'loss/train': 1.3704620599746704} 11/06/2021 23:45:25 - INFO - __main__ - Step 17770: {'lr': 0.00048612490802226415, 'samples': 3411840, 'steps': 17769, 'loss/train': 1.651924729347229} 11/06/2021 23:45:26 - INFO - __main__ - Step 17771: {'lr': 0.0004861231646373575, 'samples': 3412032, 'steps': 17770, 'loss/train': 1.6909313201904297} 11/06/2021 23:45:26 - INFO - __main__ - Step 17772: {'lr': 0.0004861214211460574, 'samples': 3412224, 'steps': 17771, 'loss/train': 1.8236727714538574} 11/06/2021 23:45:26 - INFO - __main__ - Step 17773: {'lr': 0.00048611967754836466, 'samples': 3412416, 'steps': 17772, 'loss/train': 1.6562209129333496} 11/06/2021 23:45:27 - INFO - __main__ - Step 17774: {'lr': 0.00048611793384428006, 'samples': 3412608, 'steps': 17773, 'loss/train': 1.7618428468704224} 11/06/2021 23:45:28 - INFO - __main__ - Step 17775: {'lr': 0.00048611619003380426, 'samples': 3412800, 'steps': 17774, 'loss/train': 1.5506091117858887} 11/06/2021 23:45:28 - INFO - __main__ - Step 17776: {'lr': 0.0004861144461169382, 'samples': 3412992, 'steps': 17775, 'loss/train': 1.73026442527771} 11/06/2021 23:45:28 - INFO - __main__ - Step 17777: {'lr': 0.00048611270209368264, 'samples': 3413184, 'steps': 17776, 'loss/train': 1.560634732246399} 11/06/2021 23:45:29 - INFO - __main__ - Step 17778: {'lr': 0.0004861109579640384, 'samples': 3413376, 'steps': 17777, 'loss/train': 1.7123799324035645} 11/06/2021 23:45:30 - INFO - __main__ - Step 17779: {'lr': 0.0004861092137280061, 'samples': 3413568, 'steps': 17778, 'loss/train': 2.3202054500579834} 11/06/2021 23:45:30 - INFO - __main__ - Step 17780: {'lr': 0.00048610746938558666, 'samples': 3413760, 'steps': 17779, 'loss/train': 1.6584490537643433} 11/06/2021 23:45:30 - INFO - __main__ - Step 17781: {'lr': 0.0004861057249367808, 'samples': 3413952, 'steps': 17780, 'loss/train': 1.4049021005630493} 11/06/2021 23:45:31 - INFO - __main__ - Step 17782: {'lr': 0.00048610398038158943, 'samples': 3414144, 'steps': 17781, 'loss/train': 1.194156527519226} 11/06/2021 23:45:31 - INFO - __main__ - Step 17783: {'lr': 0.00048610223572001315, 'samples': 3414336, 'steps': 17782, 'loss/train': 1.766744613647461} 11/06/2021 23:45:32 - INFO - __main__ - Step 17784: {'lr': 0.0004861004909520529, 'samples': 3414528, 'steps': 17783, 'loss/train': 1.6957181692123413} 11/06/2021 23:45:32 - INFO - __main__ - Step 17785: {'lr': 0.00048609874607770945, 'samples': 3414720, 'steps': 17784, 'loss/train': 1.5701578855514526} 11/06/2021 23:45:33 - INFO - __main__ - Step 17786: {'lr': 0.0004860970010969835, 'samples': 3414912, 'steps': 17785, 'loss/train': 1.456776738166809} 11/06/2021 23:45:33 - INFO - __main__ - Step 17787: {'lr': 0.0004860952560098759, 'samples': 3415104, 'steps': 17786, 'loss/train': 2.101297378540039} 11/06/2021 23:45:34 - INFO - __main__ - Step 17788: {'lr': 0.0004860935108163874, 'samples': 3415296, 'steps': 17787, 'loss/train': 1.8379590511322021} 11/06/2021 23:45:34 - INFO - __main__ - Step 17789: {'lr': 0.0004860917655165188, 'samples': 3415488, 'steps': 17788, 'loss/train': 1.5483107566833496} 11/06/2021 23:45:35 - INFO - __main__ - Step 17790: {'lr': 0.00048609002011027093, 'samples': 3415680, 'steps': 17789, 'loss/train': 1.2295900583267212} 11/06/2021 23:45:36 - INFO - __main__ - Step 17791: {'lr': 0.0004860882745976445, 'samples': 3415872, 'steps': 17790, 'loss/train': 1.2207252979278564} 11/06/2021 23:45:36 - INFO - __main__ - Step 17792: {'lr': 0.00048608652897864034, 'samples': 3416064, 'steps': 17791, 'loss/train': 1.811275601387024} 11/06/2021 23:45:36 - INFO - __main__ - Step 17793: {'lr': 0.0004860847832532593, 'samples': 3416256, 'steps': 17792, 'loss/train': 1.5424718856811523} 11/06/2021 23:45:37 - INFO - __main__ - Step 17794: {'lr': 0.00048608303742150204, 'samples': 3416448, 'steps': 17793, 'loss/train': 1.4225600957870483} 11/06/2021 23:45:37 - INFO - __main__ - Step 17795: {'lr': 0.0004860812914833694, 'samples': 3416640, 'steps': 17794, 'loss/train': 1.653011441230774} 11/06/2021 23:45:38 - INFO - __main__ - Step 17796: {'lr': 0.00048607954543886225, 'samples': 3416832, 'steps': 17795, 'loss/train': 1.1417803764343262} 11/06/2021 23:45:38 - INFO - __main__ - Step 17797: {'lr': 0.00048607779928798125, 'samples': 3417024, 'steps': 17796, 'loss/train': 1.5692245960235596} 11/06/2021 23:45:39 - INFO - __main__ - Step 17798: {'lr': 0.0004860760530307272, 'samples': 3417216, 'steps': 17797, 'loss/train': 2.135267734527588} 11/06/2021 23:45:39 - INFO - __main__ - Step 17799: {'lr': 0.00048607430666710097, 'samples': 3417408, 'steps': 17798, 'loss/train': 1.1966557502746582} 11/06/2021 23:45:40 - INFO - __main__ - Step 17800: {'lr': 0.00048607256019710327, 'samples': 3417600, 'steps': 17799, 'loss/train': 1.4628567695617676} 11/06/2021 23:45:41 - INFO - __main__ - Step 17801: {'lr': 0.0004860708136207349, 'samples': 3417792, 'steps': 17800, 'loss/train': 1.8049674034118652} 11/06/2021 23:45:42 - INFO - __main__ - Step 17802: {'lr': 0.0004860690669379967, 'samples': 3417984, 'steps': 17801, 'loss/train': 1.4776289463043213} 11/06/2021 23:45:42 - INFO - __main__ - Step 17803: {'lr': 0.00048606732014888946, 'samples': 3418176, 'steps': 17802, 'loss/train': 1.5583086013793945} 11/06/2021 23:45:42 - INFO - __main__ - Step 17804: {'lr': 0.0004860655732534138, 'samples': 3418368, 'steps': 17803, 'loss/train': 1.8255856037139893} 11/06/2021 23:45:43 - INFO - __main__ - Step 17805: {'lr': 0.00048606382625157075, 'samples': 3418560, 'steps': 17804, 'loss/train': 2.160865306854248} 11/06/2021 23:45:43 - INFO - __main__ - Step 17806: {'lr': 0.00048606207914336097, 'samples': 3418752, 'steps': 17805, 'loss/train': 1.7955251932144165} 11/06/2021 23:45:43 - INFO - __main__ - Step 17807: {'lr': 0.0004860603319287853, 'samples': 3418944, 'steps': 17806, 'loss/train': 1.7676409482955933} 11/06/2021 23:45:44 - INFO - __main__ - Step 17808: {'lr': 0.0004860585846078444, 'samples': 3419136, 'steps': 17807, 'loss/train': 1.7928861379623413} 11/06/2021 23:45:45 - INFO - __main__ - Step 17809: {'lr': 0.00048605683718053915, 'samples': 3419328, 'steps': 17808, 'loss/train': 1.0694918632507324} 11/06/2021 23:45:45 - INFO - __main__ - Step 17810: {'lr': 0.0004860550896468704, 'samples': 3419520, 'steps': 17809, 'loss/train': 1.7536789178848267} 11/06/2021 23:45:45 - INFO - __main__ - Step 17811: {'lr': 0.00048605334200683883, 'samples': 3419712, 'steps': 17810, 'loss/train': 1.910631537437439} 11/06/2021 23:45:46 - INFO - __main__ - Step 17812: {'lr': 0.0004860515942604452, 'samples': 3419904, 'steps': 17811, 'loss/train': 1.5907909870147705} 11/06/2021 23:45:47 - INFO - __main__ - Step 17813: {'lr': 0.00048604984640769047, 'samples': 3420096, 'steps': 17812, 'loss/train': 1.6828159093856812} 11/06/2021 23:45:47 - INFO - __main__ - Step 17814: {'lr': 0.00048604809844857524, 'samples': 3420288, 'steps': 17813, 'loss/train': 1.5735582113265991} 11/06/2021 23:45:48 - INFO - __main__ - Step 17815: {'lr': 0.0004860463503831004, 'samples': 3420480, 'steps': 17814, 'loss/train': 1.7284306287765503} 11/06/2021 23:45:48 - INFO - __main__ - Step 17816: {'lr': 0.0004860446022112668, 'samples': 3420672, 'steps': 17815, 'loss/train': 1.8749446868896484} 11/06/2021 23:45:48 - INFO - __main__ - Step 17817: {'lr': 0.00048604285393307503, 'samples': 3420864, 'steps': 17816, 'loss/train': 1.7494878768920898} 11/06/2021 23:45:50 - INFO - __main__ - Step 17818: {'lr': 0.000486041105548526, 'samples': 3421056, 'steps': 17817, 'loss/train': 1.6738638877868652} 11/06/2021 23:45:50 - INFO - __main__ - Step 17819: {'lr': 0.00048603935705762057, 'samples': 3421248, 'steps': 17818, 'loss/train': 1.633705496788025} 11/06/2021 23:45:50 - INFO - __main__ - Step 17820: {'lr': 0.0004860376084603594, 'samples': 3421440, 'steps': 17819, 'loss/train': 1.3418101072311401} 11/06/2021 23:45:51 - INFO - __main__ - Step 17821: {'lr': 0.00048603585975674334, 'samples': 3421632, 'steps': 17820, 'loss/train': 0.8659923076629639} 11/06/2021 23:45:51 - INFO - __main__ - Step 17822: {'lr': 0.0004860341109467732, 'samples': 3421824, 'steps': 17821, 'loss/train': 1.9501591920852661} 11/06/2021 23:45:52 - INFO - __main__ - Step 17823: {'lr': 0.00048603236203044963, 'samples': 3422016, 'steps': 17822, 'loss/train': 1.3790924549102783} 11/06/2021 23:45:52 - INFO - __main__ - Step 17824: {'lr': 0.00048603061300777365, 'samples': 3422208, 'steps': 17823, 'loss/train': 1.8731638193130493} 11/06/2021 23:45:53 - INFO - __main__ - Step 17825: {'lr': 0.0004860288638787458, 'samples': 3422400, 'steps': 17824, 'loss/train': 1.725066900253296} 11/06/2021 23:45:53 - INFO - __main__ - Step 17826: {'lr': 0.000486027114643367, 'samples': 3422592, 'steps': 17825, 'loss/train': 1.745100498199463} 11/06/2021 23:45:53 - INFO - __main__ - Step 17827: {'lr': 0.0004860253653016381, 'samples': 3422784, 'steps': 17826, 'loss/train': 1.3238799571990967} 11/06/2021 23:45:54 - INFO - __main__ - Step 17828: {'lr': 0.00048602361585355975, 'samples': 3422976, 'steps': 17827, 'loss/train': 1.8760740756988525} 11/06/2021 23:45:55 - INFO - __main__ - Step 17829: {'lr': 0.0004860218662991328, 'samples': 3423168, 'steps': 17828, 'loss/train': 1.3989863395690918} 11/06/2021 23:45:55 - INFO - __main__ - Step 17830: {'lr': 0.0004860201166383581, 'samples': 3423360, 'steps': 17829, 'loss/train': 1.4480235576629639} 11/06/2021 23:45:56 - INFO - __main__ - Step 17831: {'lr': 0.00048601836687123636, 'samples': 3423552, 'steps': 17830, 'loss/train': 1.3605883121490479} 11/06/2021 23:45:56 - INFO - __main__ - Step 17832: {'lr': 0.00048601661699776834, 'samples': 3423744, 'steps': 17831, 'loss/train': 1.4478638172149658} 11/06/2021 23:45:56 - INFO - __main__ - Step 17833: {'lr': 0.0004860148670179549, 'samples': 3423936, 'steps': 17832, 'loss/train': 1.6285169124603271} 11/06/2021 23:45:57 - INFO - __main__ - Step 17834: {'lr': 0.0004860131169317968, 'samples': 3424128, 'steps': 17833, 'loss/train': 1.9183335304260254} 11/06/2021 23:45:58 - INFO - __main__ - Step 17835: {'lr': 0.0004860113667392948, 'samples': 3424320, 'steps': 17834, 'loss/train': 1.6294353008270264} 11/06/2021 23:45:58 - INFO - __main__ - Step 17836: {'lr': 0.00048600961644044977, 'samples': 3424512, 'steps': 17835, 'loss/train': 1.7146687507629395} 11/06/2021 23:45:58 - INFO - __main__ - Step 17837: {'lr': 0.0004860078660352625, 'samples': 3424704, 'steps': 17836, 'loss/train': 1.918862223625183} 11/06/2021 23:45:59 - INFO - __main__ - Step 17838: {'lr': 0.0004860061155237336, 'samples': 3424896, 'steps': 17837, 'loss/train': 1.628971815109253} 11/06/2021 23:46:00 - INFO - __main__ - Step 17839: {'lr': 0.0004860043649058641, 'samples': 3425088, 'steps': 17838, 'loss/train': 1.5848469734191895} 11/06/2021 23:46:00 - INFO - __main__ - Step 17840: {'lr': 0.00048600261418165456, 'samples': 3425280, 'steps': 17839, 'loss/train': 2.30672025680542} 11/06/2021 23:46:00 - INFO - __main__ - Step 17841: {'lr': 0.00048600086335110593, 'samples': 3425472, 'steps': 17840, 'loss/train': 1.490289568901062} 11/06/2021 23:46:01 - INFO - __main__ - Step 17842: {'lr': 0.000485999112414219, 'samples': 3425664, 'steps': 17841, 'loss/train': 1.1723436117172241} 11/06/2021 23:46:01 - INFO - __main__ - Step 17843: {'lr': 0.0004859973613709945, 'samples': 3425856, 'steps': 17842, 'loss/train': 1.2183293104171753} 11/06/2021 23:46:02 - INFO - __main__ - Step 17844: {'lr': 0.0004859956102214332, 'samples': 3426048, 'steps': 17843, 'loss/train': 1.6532636880874634} 11/06/2021 23:46:02 - INFO - __main__ - Step 17845: {'lr': 0.00048599385896553595, 'samples': 3426240, 'steps': 17844, 'loss/train': 1.6545504331588745} 11/06/2021 23:46:03 - INFO - __main__ - Step 17846: {'lr': 0.0004859921076033034, 'samples': 3426432, 'steps': 17845, 'loss/train': 1.8391602039337158} 11/06/2021 23:46:03 - INFO - __main__ - Step 17847: {'lr': 0.00048599035613473656, 'samples': 3426624, 'steps': 17846, 'loss/train': 1.792570948600769} 11/06/2021 23:46:04 - INFO - __main__ - Step 17848: {'lr': 0.0004859886045598361, 'samples': 3426816, 'steps': 17847, 'loss/train': 1.6841845512390137} 11/06/2021 23:46:05 - INFO - __main__ - Step 17849: {'lr': 0.0004859868528786028, 'samples': 3427008, 'steps': 17848, 'loss/train': 1.8438422679901123} 11/06/2021 23:46:06 - INFO - __main__ - Step 17850: {'lr': 0.0004859851010910374, 'samples': 3427200, 'steps': 17849, 'loss/train': 1.7643811702728271} 11/06/2021 23:46:06 - INFO - __main__ - Step 17851: {'lr': 0.0004859833491971409, 'samples': 3427392, 'steps': 17850, 'loss/train': 1.7371845245361328} 11/06/2021 23:46:06 - INFO - __main__ - Step 17852: {'lr': 0.0004859815971969138, 'samples': 3427584, 'steps': 17851, 'loss/train': 1.7148878574371338} 11/06/2021 23:46:07 - INFO - __main__ - Step 17853: {'lr': 0.0004859798450903571, 'samples': 3427776, 'steps': 17852, 'loss/train': 1.6091628074645996} 11/06/2021 23:46:07 - INFO - __main__ - Step 17854: {'lr': 0.00048597809287747153, 'samples': 3427968, 'steps': 17853, 'loss/train': 1.4261943101882935} 11/06/2021 23:46:07 - INFO - __main__ - Step 17855: {'lr': 0.0004859763405582579, 'samples': 3428160, 'steps': 17854, 'loss/train': 1.8740367889404297} 11/06/2021 23:46:09 - INFO - __main__ - Step 17856: {'lr': 0.00048597458813271686, 'samples': 3428352, 'steps': 17855, 'loss/train': 1.8373491764068604} 11/06/2021 23:46:09 - INFO - __main__ - Step 17857: {'lr': 0.0004859728356008494, 'samples': 3428544, 'steps': 17856, 'loss/train': 1.820371389389038} 11/06/2021 23:46:09 - INFO - __main__ - Step 17858: {'lr': 0.00048597108296265625, 'samples': 3428736, 'steps': 17857, 'loss/train': 1.5134706497192383} 11/06/2021 23:46:10 - INFO - __main__ - Step 17859: {'lr': 0.00048596933021813815, 'samples': 3428928, 'steps': 17858, 'loss/train': 1.1924867630004883} 11/06/2021 23:46:10 - INFO - __main__ - Step 17860: {'lr': 0.0004859675773672959, 'samples': 3429120, 'steps': 17859, 'loss/train': 2.6600170135498047} 11/06/2021 23:46:10 - INFO - __main__ - Step 17861: {'lr': 0.00048596582441013026, 'samples': 3429312, 'steps': 17860, 'loss/train': 1.945163607597351} 11/06/2021 23:46:12 - INFO - __main__ - Step 17862: {'lr': 0.0004859640713466421, 'samples': 3429504, 'steps': 17861, 'loss/train': 1.5751862525939941} 11/06/2021 23:46:12 - INFO - __main__ - Step 17863: {'lr': 0.0004859623181768321, 'samples': 3429696, 'steps': 17862, 'loss/train': 1.3978326320648193} 11/06/2021 23:46:12 - INFO - __main__ - Step 17864: {'lr': 0.0004859605649007012, 'samples': 3429888, 'steps': 17863, 'loss/train': 0.8799737095832825} 11/06/2021 23:46:13 - INFO - __main__ - Step 17865: {'lr': 0.00048595881151825015, 'samples': 3430080, 'steps': 17864, 'loss/train': 1.6823805570602417} 11/06/2021 23:46:13 - INFO - __main__ - Step 17866: {'lr': 0.00048595705802947963, 'samples': 3430272, 'steps': 17865, 'loss/train': 1.4125962257385254} 11/06/2021 23:46:14 - INFO - __main__ - Step 17867: {'lr': 0.0004859553044343905, 'samples': 3430464, 'steps': 17866, 'loss/train': 0.9429519176483154} 11/06/2021 23:46:14 - INFO - __main__ - Step 17868: {'lr': 0.0004859535507329836, 'samples': 3430656, 'steps': 17867, 'loss/train': 1.0948766469955444} 11/06/2021 23:46:15 - INFO - __main__ - Step 17869: {'lr': 0.0004859517969252596, 'samples': 3430848, 'steps': 17868, 'loss/train': 1.6480631828308105} 11/06/2021 23:46:15 - INFO - __main__ - Step 17870: {'lr': 0.0004859500430112194, 'samples': 3431040, 'steps': 17869, 'loss/train': 1.7616888284683228} 11/06/2021 23:46:15 - INFO - __main__ - Step 17871: {'lr': 0.0004859482889908637, 'samples': 3431232, 'steps': 17870, 'loss/train': 1.6862744092941284} 11/06/2021 23:46:17 - INFO - __main__ - Step 17872: {'lr': 0.0004859465348641934, 'samples': 3431424, 'steps': 17871, 'loss/train': 1.6967068910598755} 11/06/2021 23:46:17 - INFO - __main__ - Step 17873: {'lr': 0.0004859447806312093, 'samples': 3431616, 'steps': 17872, 'loss/train': 1.1371982097625732} 11/06/2021 23:46:17 - INFO - __main__ - Step 17874: {'lr': 0.000485943026291912, 'samples': 3431808, 'steps': 17873, 'loss/train': 1.5849249362945557} 11/06/2021 23:46:18 - INFO - __main__ - Step 17875: {'lr': 0.0004859412718463025, 'samples': 3432000, 'steps': 17874, 'loss/train': 1.2743723392486572} 11/06/2021 23:46:18 - INFO - __main__ - Step 17876: {'lr': 0.00048593951729438144, 'samples': 3432192, 'steps': 17875, 'loss/train': 1.4073165655136108} 11/06/2021 23:46:19 - INFO - __main__ - Step 17877: {'lr': 0.0004859377626361497, 'samples': 3432384, 'steps': 17876, 'loss/train': 0.862178385257721} 11/06/2021 23:46:20 - INFO - __main__ - Step 17878: {'lr': 0.00048593600787160806, 'samples': 3432576, 'steps': 17877, 'loss/train': 1.8523950576782227} 11/06/2021 23:46:20 - INFO - __main__ - Step 17879: {'lr': 0.0004859342530007572, 'samples': 3432768, 'steps': 17878, 'loss/train': 2.0718274116516113} 11/06/2021 23:46:20 - INFO - __main__ - Step 17880: {'lr': 0.0004859324980235982, 'samples': 3432960, 'steps': 17879, 'loss/train': 1.829959750175476} 11/06/2021 23:46:21 - INFO - __main__ - Step 17881: {'lr': 0.0004859307429401315, 'samples': 3433152, 'steps': 17880, 'loss/train': 2.3833680152893066} 11/06/2021 23:46:22 - INFO - __main__ - Step 17882: {'lr': 0.0004859289877503581, 'samples': 3433344, 'steps': 17881, 'loss/train': 2.83292293548584} 11/06/2021 23:46:22 - INFO - __main__ - Step 17883: {'lr': 0.00048592723245427874, 'samples': 3433536, 'steps': 17882, 'loss/train': 1.8581055402755737} 11/06/2021 23:46:22 - INFO - __main__ - Step 17884: {'lr': 0.00048592547705189414, 'samples': 3433728, 'steps': 17883, 'loss/train': 1.7676963806152344} 11/06/2021 23:46:23 - INFO - __main__ - Step 17885: {'lr': 0.00048592372154320526, 'samples': 3433920, 'steps': 17884, 'loss/train': 1.4454072713851929} 11/06/2021 23:46:23 - INFO - __main__ - Step 17886: {'lr': 0.0004859219659282127, 'samples': 3434112, 'steps': 17885, 'loss/train': 1.591639518737793} 11/06/2021 23:46:23 - INFO - __main__ - Step 17887: {'lr': 0.00048592021020691745, 'samples': 3434304, 'steps': 17886, 'loss/train': 1.7790858745574951} 11/06/2021 23:46:25 - INFO - __main__ - Step 17888: {'lr': 0.00048591845437932014, 'samples': 3434496, 'steps': 17887, 'loss/train': 1.646702766418457} 11/06/2021 23:46:25 - INFO - __main__ - Step 17889: {'lr': 0.0004859166984454216, 'samples': 3434688, 'steps': 17888, 'loss/train': 1.685816764831543} 11/06/2021 23:46:25 - INFO - __main__ - Step 17890: {'lr': 0.0004859149424052226, 'samples': 3434880, 'steps': 17889, 'loss/train': 1.917184829711914} 11/06/2021 23:46:26 - INFO - __main__ - Step 17891: {'lr': 0.00048591318625872403, 'samples': 3435072, 'steps': 17890, 'loss/train': 1.9429192543029785} 11/06/2021 23:46:26 - INFO - __main__ - Step 17892: {'lr': 0.00048591143000592665, 'samples': 3435264, 'steps': 17891, 'loss/train': 1.4648727178573608} 11/06/2021 23:46:27 - INFO - __main__ - Step 17893: {'lr': 0.00048590967364683116, 'samples': 3435456, 'steps': 17892, 'loss/train': 1.7647534608840942} 11/06/2021 23:46:27 - INFO - __main__ - Step 17894: {'lr': 0.0004859079171814384, 'samples': 3435648, 'steps': 17893, 'loss/train': 2.2489683628082275} 11/06/2021 23:46:28 - INFO - __main__ - Step 17895: {'lr': 0.00048590616060974917, 'samples': 3435840, 'steps': 17894, 'loss/train': 2.1065878868103027} 11/06/2021 23:46:28 - INFO - __main__ - Step 17896: {'lr': 0.00048590440393176434, 'samples': 3436032, 'steps': 17895, 'loss/train': 2.224180221557617} 11/06/2021 23:46:28 - INFO - __main__ - Step 17897: {'lr': 0.00048590264714748455, 'samples': 3436224, 'steps': 17896, 'loss/train': 1.2746485471725464} 11/06/2021 23:46:29 - INFO - __main__ - Step 17898: {'lr': 0.0004859008902569107, 'samples': 3436416, 'steps': 17897, 'loss/train': 1.3870652914047241} 11/06/2021 23:46:30 - INFO - __main__ - Step 17899: {'lr': 0.00048589913326004355, 'samples': 3436608, 'steps': 17898, 'loss/train': 1.7949775457382202} 11/06/2021 23:46:30 - INFO - __main__ - Step 17900: {'lr': 0.0004858973761568839, 'samples': 3436800, 'steps': 17899, 'loss/train': 1.4077565670013428} 11/06/2021 23:46:30 - INFO - __main__ - Step 17901: {'lr': 0.0004858956189474325, 'samples': 3436992, 'steps': 17900, 'loss/train': 1.6623355150222778} 11/06/2021 23:46:31 - INFO - __main__ - Step 17902: {'lr': 0.0004858938616316902, 'samples': 3437184, 'steps': 17901, 'loss/train': 1.7787035703659058} 11/06/2021 23:46:31 - INFO - __main__ - Step 17903: {'lr': 0.00048589210420965775, 'samples': 3437376, 'steps': 17902, 'loss/train': 1.1888090372085571} 11/06/2021 23:46:32 - INFO - __main__ - Step 17904: {'lr': 0.0004858903466813359, 'samples': 3437568, 'steps': 17903, 'loss/train': 2.1995456218719482} 11/06/2021 23:46:33 - INFO - __main__ - Step 17905: {'lr': 0.0004858885890467256, 'samples': 3437760, 'steps': 17904, 'loss/train': 1.9049638509750366} 11/06/2021 23:46:33 - INFO - __main__ - Step 17906: {'lr': 0.00048588683130582755, 'samples': 3437952, 'steps': 17905, 'loss/train': 2.155893087387085} 11/06/2021 23:46:33 - INFO - __main__ - Step 17907: {'lr': 0.00048588507345864246, 'samples': 3438144, 'steps': 17906, 'loss/train': 1.5869488716125488} 11/06/2021 23:46:34 - INFO - __main__ - Step 17908: {'lr': 0.00048588331550517125, 'samples': 3438336, 'steps': 17907, 'loss/train': 2.066535472869873} 11/06/2021 23:46:35 - INFO - __main__ - Step 17909: {'lr': 0.0004858815574454146, 'samples': 3438528, 'steps': 17908, 'loss/train': 1.5409685373306274} 11/06/2021 23:46:35 - INFO - __main__ - Step 17910: {'lr': 0.0004858797992793734, 'samples': 3438720, 'steps': 17909, 'loss/train': 3.083186626434326} 11/06/2021 23:46:35 - INFO - __main__ - Step 17911: {'lr': 0.0004858780410070484, 'samples': 3438912, 'steps': 17910, 'loss/train': 1.5256142616271973} 11/06/2021 23:46:36 - INFO - __main__ - Step 17912: {'lr': 0.0004858762826284404, 'samples': 3439104, 'steps': 17911, 'loss/train': 1.5381463766098022} 11/06/2021 23:46:36 - INFO - __main__ - Step 17913: {'lr': 0.00048587452414355014, 'samples': 3439296, 'steps': 17912, 'loss/train': 1.493998408317566} 11/06/2021 23:46:38 - INFO - __main__ - Step 17914: {'lr': 0.00048587276555237853, 'samples': 3439488, 'steps': 17913, 'loss/train': 1.5796653032302856} 11/06/2021 23:46:38 - INFO - __main__ - Step 17915: {'lr': 0.00048587100685492626, 'samples': 3439680, 'steps': 17914, 'loss/train': 1.4091830253601074} 11/06/2021 23:46:39 - INFO - __main__ - Step 17916: {'lr': 0.00048586924805119416, 'samples': 3439872, 'steps': 17915, 'loss/train': 1.8840548992156982} 11/06/2021 23:46:39 - INFO - __main__ - Step 17917: {'lr': 0.00048586748914118303, 'samples': 3440064, 'steps': 17916, 'loss/train': 1.3594805002212524} 11/06/2021 23:46:39 - INFO - __main__ - Step 17918: {'lr': 0.0004858657301248936, 'samples': 3440256, 'steps': 17917, 'loss/train': 1.8550138473510742} 11/06/2021 23:46:40 - INFO - __main__ - Step 17919: {'lr': 0.00048586397100232673, 'samples': 3440448, 'steps': 17918, 'loss/train': 1.944078803062439} 11/06/2021 23:46:40 - INFO - __main__ - Step 17920: {'lr': 0.00048586221177348323, 'samples': 3440640, 'steps': 17919, 'loss/train': 1.8348121643066406} 11/06/2021 23:46:40 - INFO - __main__ - Step 17921: {'lr': 0.00048586045243836386, 'samples': 3440832, 'steps': 17920, 'loss/train': 1.7867494821548462} 11/06/2021 23:46:41 - INFO - __main__ - Step 17922: {'lr': 0.0004858586929969693, 'samples': 3441024, 'steps': 17921, 'loss/train': 1.8128037452697754} 11/06/2021 23:46:42 - INFO - __main__ - Step 17923: {'lr': 0.0004858569334493006, 'samples': 3441216, 'steps': 17922, 'loss/train': 0.7357732653617859} 11/06/2021 23:46:42 - INFO - __main__ - Step 17924: {'lr': 0.0004858551737953583, 'samples': 3441408, 'steps': 17923, 'loss/train': 1.7629271745681763} 11/06/2021 23:46:42 - INFO - __main__ - Step 17925: {'lr': 0.00048585341403514337, 'samples': 3441600, 'steps': 17924, 'loss/train': 1.7613294124603271} 11/06/2021 23:46:43 - INFO - __main__ - Step 17926: {'lr': 0.0004858516541686565, 'samples': 3441792, 'steps': 17925, 'loss/train': 2.0726709365844727} 11/06/2021 23:46:45 - INFO - __main__ - Step 17927: {'lr': 0.0004858498941958985, 'samples': 3441984, 'steps': 17926, 'loss/train': 1.4030810594558716} 11/06/2021 23:46:45 - INFO - __main__ - Step 17928: {'lr': 0.00048584813411687016, 'samples': 3442176, 'steps': 17927, 'loss/train': 1.4754228591918945} 11/06/2021 23:46:45 - INFO - __main__ - Step 17929: {'lr': 0.00048584637393157235, 'samples': 3442368, 'steps': 17928, 'loss/train': 1.5624349117279053} 11/06/2021 23:46:46 - INFO - __main__ - Step 17930: {'lr': 0.00048584461364000576, 'samples': 3442560, 'steps': 17929, 'loss/train': 1.3149511814117432} 11/06/2021 23:46:46 - INFO - __main__ - Step 17931: {'lr': 0.00048584285324217125, 'samples': 3442752, 'steps': 17930, 'loss/train': 1.7911587953567505} 11/06/2021 23:46:46 - INFO - __main__ - Step 17932: {'lr': 0.00048584109273806954, 'samples': 3442944, 'steps': 17931, 'loss/train': 1.8528138399124146} 11/06/2021 23:46:47 - INFO - __main__ - Step 17933: {'lr': 0.00048583933212770154, 'samples': 3443136, 'steps': 17932, 'loss/train': 1.8300704956054688} 11/06/2021 23:46:48 - INFO - __main__ - Step 17934: {'lr': 0.00048583757141106796, 'samples': 3443328, 'steps': 17933, 'loss/train': 1.5763187408447266} 11/06/2021 23:46:48 - INFO - __main__ - Step 17935: {'lr': 0.00048583581058816956, 'samples': 3443520, 'steps': 17934, 'loss/train': 1.7421388626098633} 11/06/2021 23:46:48 - INFO - __main__ - Step 17936: {'lr': 0.00048583404965900725, 'samples': 3443712, 'steps': 17935, 'loss/train': 1.6953171491622925} 11/06/2021 23:46:49 - INFO - __main__ - Step 17937: {'lr': 0.0004858322886235817, 'samples': 3443904, 'steps': 17936, 'loss/train': 1.5555347204208374} 11/06/2021 23:46:49 - INFO - __main__ - Step 17938: {'lr': 0.0004858305274818938, 'samples': 3444096, 'steps': 17937, 'loss/train': 1.8018510341644287} 11/06/2021 23:46:50 - INFO - __main__ - Step 17939: {'lr': 0.0004858287662339443, 'samples': 3444288, 'steps': 17938, 'loss/train': 1.823989748954773} 11/06/2021 23:46:51 - INFO - __main__ - Step 17940: {'lr': 0.00048582700487973397, 'samples': 3444480, 'steps': 17939, 'loss/train': 5.880978584289551} 11/06/2021 23:46:51 - INFO - __main__ - Step 17941: {'lr': 0.00048582524341926365, 'samples': 3444672, 'steps': 17940, 'loss/train': 1.577499270439148} 11/06/2021 23:46:51 - INFO - __main__ - Step 17942: {'lr': 0.0004858234818525341, 'samples': 3444864, 'steps': 17941, 'loss/train': 1.5800806283950806} 11/06/2021 23:46:52 - INFO - __main__ - Step 17943: {'lr': 0.0004858217201795462, 'samples': 3445056, 'steps': 17942, 'loss/train': 1.5680005550384521} 11/06/2021 23:46:52 - INFO - __main__ - Step 17944: {'lr': 0.0004858199584003006, 'samples': 3445248, 'steps': 17943, 'loss/train': 1.8235677480697632} 11/06/2021 23:46:53 - INFO - __main__ - Step 17945: {'lr': 0.00048581819651479814, 'samples': 3445440, 'steps': 17944, 'loss/train': 1.6470332145690918} 11/06/2021 23:46:53 - INFO - __main__ - Step 17946: {'lr': 0.0004858164345230397, 'samples': 3445632, 'steps': 17945, 'loss/train': 1.935632348060608} 11/06/2021 23:46:54 - INFO - __main__ - Step 17947: {'lr': 0.000485814672425026, 'samples': 3445824, 'steps': 17946, 'loss/train': 1.5922514200210571} 11/06/2021 23:46:54 - INFO - __main__ - Step 17948: {'lr': 0.0004858129102207578, 'samples': 3446016, 'steps': 17947, 'loss/train': 1.7044637203216553} 11/06/2021 23:46:54 - INFO - __main__ - Step 17949: {'lr': 0.0004858111479102359, 'samples': 3446208, 'steps': 17948, 'loss/train': 1.8843209743499756} 11/06/2021 23:46:55 - INFO - __main__ - Step 17950: {'lr': 0.00048580938549346134, 'samples': 3446400, 'steps': 17949, 'loss/train': 1.5622127056121826} 11/06/2021 23:46:56 - INFO - __main__ - Step 17951: {'lr': 0.00048580762297043456, 'samples': 3446592, 'steps': 17950, 'loss/train': 1.3768843412399292} 11/06/2021 23:46:56 - INFO - __main__ - Step 17952: {'lr': 0.00048580586034115646, 'samples': 3446784, 'steps': 17951, 'loss/train': 1.6338905096054077} 11/06/2021 23:46:57 - INFO - __main__ - Step 17953: {'lr': 0.000485804097605628, 'samples': 3446976, 'steps': 17952, 'loss/train': 1.5911237001419067} 11/06/2021 23:46:57 - INFO - __main__ - Step 17954: {'lr': 0.00048580233476384975, 'samples': 3447168, 'steps': 17953, 'loss/train': 1.5193482637405396} 11/06/2021 23:46:58 - INFO - __main__ - Step 17955: {'lr': 0.0004858005718158227, 'samples': 3447360, 'steps': 17954, 'loss/train': 1.441080093383789} 11/06/2021 23:46:58 - INFO - __main__ - Step 17956: {'lr': 0.0004857988087615475, 'samples': 3447552, 'steps': 17955, 'loss/train': 1.336949110031128} 11/06/2021 23:46:59 - INFO - __main__ - Step 17957: {'lr': 0.000485797045601025, 'samples': 3447744, 'steps': 17956, 'loss/train': 1.5038697719573975} 11/06/2021 23:46:59 - INFO - __main__ - Step 17958: {'lr': 0.000485795282334256, 'samples': 3447936, 'steps': 17957, 'loss/train': 1.2776819467544556} 11/06/2021 23:46:59 - INFO - __main__ - Step 17959: {'lr': 0.00048579351896124127, 'samples': 3448128, 'steps': 17958, 'loss/train': 1.4108787775039673} 11/06/2021 23:47:00 - INFO - __main__ - Step 17960: {'lr': 0.0004857917554819816, 'samples': 3448320, 'steps': 17959, 'loss/train': 1.811353087425232} 11/06/2021 23:47:01 - INFO - __main__ - Step 17961: {'lr': 0.00048578999189647786, 'samples': 3448512, 'steps': 17960, 'loss/train': 0.4753451347351074} 11/06/2021 23:47:01 - INFO - __main__ - Step 17962: {'lr': 0.00048578822820473074, 'samples': 3448704, 'steps': 17961, 'loss/train': 1.7303311824798584} 11/06/2021 23:47:01 - INFO - __main__ - Step 17963: {'lr': 0.00048578646440674113, 'samples': 3448896, 'steps': 17962, 'loss/train': 1.7330888509750366} 11/06/2021 23:47:02 - INFO - __main__ - Step 17964: {'lr': 0.0004857847005025097, 'samples': 3449088, 'steps': 17963, 'loss/train': 1.6312006711959839} 11/06/2021 23:47:03 - INFO - __main__ - Step 17965: {'lr': 0.0004857829364920374, 'samples': 3449280, 'steps': 17964, 'loss/train': 1.7061258554458618} 11/06/2021 23:47:04 - INFO - __main__ - Step 17966: {'lr': 0.0004857811723753249, 'samples': 3449472, 'steps': 17965, 'loss/train': 2.2474277019500732} 11/06/2021 23:47:04 - INFO - __main__ - Step 17967: {'lr': 0.00048577940815237305, 'samples': 3449664, 'steps': 17966, 'loss/train': 1.5776370763778687} 11/06/2021 23:47:04 - INFO - __main__ - Step 17968: {'lr': 0.00048577764382318265, 'samples': 3449856, 'steps': 17967, 'loss/train': 1.24309504032135} 11/06/2021 23:47:05 - INFO - __main__ - Step 17969: {'lr': 0.0004857758793877545, 'samples': 3450048, 'steps': 17968, 'loss/train': 1.6525585651397705} 11/06/2021 23:47:05 - INFO - __main__ - Step 17970: {'lr': 0.00048577411484608936, 'samples': 3450240, 'steps': 17969, 'loss/train': 1.6471604108810425} 11/06/2021 23:47:06 - INFO - __main__ - Step 17971: {'lr': 0.000485772350198188, 'samples': 3450432, 'steps': 17970, 'loss/train': 1.7846794128417969} 11/06/2021 23:47:06 - INFO - __main__ - Step 17972: {'lr': 0.00048577058544405126, 'samples': 3450624, 'steps': 17971, 'loss/train': 1.542449951171875} 11/06/2021 23:47:07 - INFO - __main__ - Step 17973: {'lr': 0.00048576882058368, 'samples': 3450816, 'steps': 17972, 'loss/train': 1.9146580696105957} 11/06/2021 23:47:07 - INFO - __main__ - Step 17974: {'lr': 0.0004857670556170749, 'samples': 3451008, 'steps': 17973, 'loss/train': 1.7699105739593506} 11/06/2021 23:47:07 - INFO - __main__ - Step 17975: {'lr': 0.0004857652905442368, 'samples': 3451200, 'steps': 17974, 'loss/train': 1.6144200563430786} 11/06/2021 23:47:08 - INFO - __main__ - Step 17976: {'lr': 0.0004857635253651665, 'samples': 3451392, 'steps': 17975, 'loss/train': 1.4524669647216797} 11/06/2021 23:47:09 - INFO - __main__ - Step 17977: {'lr': 0.00048576176007986485, 'samples': 3451584, 'steps': 17976, 'loss/train': 1.5237561464309692} 11/06/2021 23:47:09 - INFO - __main__ - Step 17978: {'lr': 0.00048575999468833256, 'samples': 3451776, 'steps': 17977, 'loss/train': 1.4829519987106323} 11/06/2021 23:47:10 - INFO - __main__ - Step 17979: {'lr': 0.0004857582291905704, 'samples': 3451968, 'steps': 17978, 'loss/train': 1.7393509149551392} 11/06/2021 23:47:10 - INFO - __main__ - Step 17980: {'lr': 0.00048575646358657934, 'samples': 3452160, 'steps': 17979, 'loss/train': 1.6369855403900146} 11/06/2021 23:47:11 - INFO - __main__ - Step 17981: {'lr': 0.00048575469787635997, 'samples': 3452352, 'steps': 17980, 'loss/train': 1.7648541927337646} 11/06/2021 23:47:11 - INFO - __main__ - Step 17982: {'lr': 0.00048575293205991313, 'samples': 3452544, 'steps': 17981, 'loss/train': 1.6027305126190186} 11/06/2021 23:47:11 - INFO - __main__ - Step 17983: {'lr': 0.0004857511661372397, 'samples': 3452736, 'steps': 17982, 'loss/train': 1.7817578315734863} 11/06/2021 23:47:12 - INFO - __main__ - Step 17984: {'lr': 0.00048574940010834045, 'samples': 3452928, 'steps': 17983, 'loss/train': 1.475091814994812} 11/06/2021 23:47:12 - INFO - __main__ - Step 17985: {'lr': 0.0004857476339732161, 'samples': 3453120, 'steps': 17984, 'loss/train': 1.626041293144226} 11/06/2021 23:47:13 - INFO - __main__ - Step 17986: {'lr': 0.0004857458677318676, 'samples': 3453312, 'steps': 17985, 'loss/train': 2.0042805671691895} 11/06/2021 23:47:14 - INFO - __main__ - Step 17987: {'lr': 0.0004857441013842956, 'samples': 3453504, 'steps': 17986, 'loss/train': 1.843093991279602} 11/06/2021 23:47:14 - INFO - __main__ - Step 17988: {'lr': 0.0004857423349305009, 'samples': 3453696, 'steps': 17987, 'loss/train': 1.5723204612731934} 11/06/2021 23:47:14 - INFO - __main__ - Step 17989: {'lr': 0.00048574056837048443, 'samples': 3453888, 'steps': 17988, 'loss/train': 2.2022082805633545} 11/06/2021 23:47:15 - INFO - __main__ - Step 17990: {'lr': 0.0004857388017042468, 'samples': 3454080, 'steps': 17989, 'loss/train': 1.4242730140686035} 11/06/2021 23:47:15 - INFO - __main__ - Step 17991: {'lr': 0.000485737034931789, 'samples': 3454272, 'steps': 17990, 'loss/train': 1.8824354410171509} 11/06/2021 23:47:16 - INFO - __main__ - Step 17992: {'lr': 0.00048573526805311166, 'samples': 3454464, 'steps': 17991, 'loss/train': 1.777707576751709} 11/06/2021 23:47:17 - INFO - __main__ - Step 17993: {'lr': 0.0004857335010682157, 'samples': 3454656, 'steps': 17992, 'loss/train': 1.7429853677749634} 11/06/2021 23:47:17 - INFO - __main__ - Step 17994: {'lr': 0.0004857317339771018, 'samples': 3454848, 'steps': 17993, 'loss/train': 1.6594009399414062} 11/06/2021 23:47:17 - INFO - __main__ - Step 17995: {'lr': 0.0004857299667797709, 'samples': 3455040, 'steps': 17994, 'loss/train': 1.633614420890808} 11/06/2021 23:47:18 - INFO - __main__ - Step 17996: {'lr': 0.0004857281994762236, 'samples': 3455232, 'steps': 17995, 'loss/train': 1.6758201122283936} 11/06/2021 23:47:19 - INFO - __main__ - Step 17997: {'lr': 0.00048572643206646097, 'samples': 3455424, 'steps': 17996, 'loss/train': 2.020440101623535} 11/06/2021 23:47:19 - INFO - __main__ - Step 17998: {'lr': 0.0004857246645504835, 'samples': 3455616, 'steps': 17997, 'loss/train': 1.609384536743164} 11/06/2021 23:47:19 - INFO - __main__ - Step 17999: {'lr': 0.00048572289692829217, 'samples': 3455808, 'steps': 17998, 'loss/train': 0.954354465007782} 11/06/2021 23:47:20 - INFO - __main__ - Step 18000: {'lr': 0.00048572112919988776, 'samples': 3456000, 'steps': 17999, 'loss/train': 1.7683265209197998} 11/06/2021 23:47:20 - INFO - __main__ - Step 18001: {'lr': 0.00048571936136527106, 'samples': 3456192, 'steps': 18000, 'loss/train': 1.2579536437988281} 11/06/2021 23:47:21 - INFO - __main__ - Step 18002: {'lr': 0.0004857175934244428, 'samples': 3456384, 'steps': 18001, 'loss/train': 1.649789571762085} 11/06/2021 23:47:21 - INFO - __main__ - Step 18003: {'lr': 0.0004857158253774039, 'samples': 3456576, 'steps': 18002, 'loss/train': 1.7836414575576782} 11/06/2021 23:47:22 - INFO - __main__ - Step 18004: {'lr': 0.0004857140572241551, 'samples': 3456768, 'steps': 18003, 'loss/train': 1.7444621324539185} 11/06/2021 23:47:22 - INFO - __main__ - Step 18005: {'lr': 0.00048571228896469713, 'samples': 3456960, 'steps': 18004, 'loss/train': 2.0532634258270264} 11/06/2021 23:47:23 - INFO - __main__ - Step 18006: {'lr': 0.0004857105205990308, 'samples': 3457152, 'steps': 18005, 'loss/train': 1.4198518991470337} 11/06/2021 23:47:24 - INFO - __main__ - Step 18007: {'lr': 0.00048570875212715706, 'samples': 3457344, 'steps': 18006, 'loss/train': 1.4929587841033936} 11/06/2021 23:47:24 - INFO - __main__ - Step 18008: {'lr': 0.0004857069835490765, 'samples': 3457536, 'steps': 18007, 'loss/train': 1.977075457572937} 11/06/2021 23:47:24 - INFO - __main__ - Step 18009: {'lr': 0.00048570521486479004, 'samples': 3457728, 'steps': 18008, 'loss/train': 1.3553253412246704} 11/06/2021 23:47:25 - INFO - __main__ - Step 18010: {'lr': 0.0004857034460742984, 'samples': 3457920, 'steps': 18009, 'loss/train': 2.5607073307037354} 11/06/2021 23:47:25 - INFO - __main__ - Step 18011: {'lr': 0.0004857016771776025, 'samples': 3458112, 'steps': 18010, 'loss/train': 1.862199068069458} 11/06/2021 23:47:27 - INFO - __main__ - Step 18012: {'lr': 0.000485699908174703, 'samples': 3458304, 'steps': 18011, 'loss/train': 1.1634882688522339} 11/06/2021 23:47:27 - INFO - __main__ - Step 18013: {'lr': 0.0004856981390656008, 'samples': 3458496, 'steps': 18012, 'loss/train': 2.178666114807129} 11/06/2021 23:47:27 - INFO - __main__ - Step 18014: {'lr': 0.00048569636985029664, 'samples': 3458688, 'steps': 18013, 'loss/train': 1.3716199398040771} 11/06/2021 23:47:28 - INFO - __main__ - Step 18015: {'lr': 0.00048569460052879136, 'samples': 3458880, 'steps': 18014, 'loss/train': 1.917330265045166} 11/06/2021 23:47:28 - INFO - __main__ - Step 18016: {'lr': 0.0004856928311010857, 'samples': 3459072, 'steps': 18015, 'loss/train': 1.8656532764434814} 11/06/2021 23:47:28 - INFO - __main__ - Step 18017: {'lr': 0.00048569106156718045, 'samples': 3459264, 'steps': 18016, 'loss/train': 1.8904410600662231} 11/06/2021 23:47:29 - INFO - __main__ - Step 18018: {'lr': 0.00048568929192707657, 'samples': 3459456, 'steps': 18017, 'loss/train': 1.839964509010315} 11/06/2021 23:47:30 - INFO - __main__ - Step 18019: {'lr': 0.0004856875221807746, 'samples': 3459648, 'steps': 18018, 'loss/train': 1.3662374019622803} 11/06/2021 23:47:30 - INFO - __main__ - Step 18020: {'lr': 0.0004856857523282755, 'samples': 3459840, 'steps': 18019, 'loss/train': 1.7677093744277954} 11/06/2021 23:47:30 - INFO - __main__ - Step 18021: {'lr': 0.0004856839823695801, 'samples': 3460032, 'steps': 18020, 'loss/train': 0.6586083173751831} 11/06/2021 23:47:31 - INFO - __main__ - Step 18022: {'lr': 0.00048568221230468905, 'samples': 3460224, 'steps': 18021, 'loss/train': 1.783107042312622} 11/06/2021 23:47:31 - INFO - __main__ - Step 18023: {'lr': 0.0004856804421336033, 'samples': 3460416, 'steps': 18022, 'loss/train': 1.4246373176574707} 11/06/2021 23:47:32 - INFO - __main__ - Step 18024: {'lr': 0.0004856786718563235, 'samples': 3460608, 'steps': 18023, 'loss/train': 1.6949611902236938} 11/06/2021 23:47:32 - INFO - __main__ - Step 18025: {'lr': 0.0004856769014728506, 'samples': 3460800, 'steps': 18024, 'loss/train': 1.8121979236602783} 11/06/2021 23:47:33 - INFO - __main__ - Step 18026: {'lr': 0.0004856751309831853, 'samples': 3460992, 'steps': 18025, 'loss/train': 1.5363273620605469} 11/06/2021 23:47:33 - INFO - __main__ - Step 18027: {'lr': 0.00048567336038732843, 'samples': 3461184, 'steps': 18026, 'loss/train': 1.299382209777832} 11/06/2021 23:47:33 - INFO - __main__ - Step 18028: {'lr': 0.0004856715896852808, 'samples': 3461376, 'steps': 18027, 'loss/train': 1.1882447004318237} 11/06/2021 23:47:35 - INFO - __main__ - Step 18029: {'lr': 0.0004856698188770432, 'samples': 3461568, 'steps': 18028, 'loss/train': 1.8871138095855713} 11/06/2021 23:47:35 - INFO - __main__ - Step 18030: {'lr': 0.0004856680479626163, 'samples': 3461760, 'steps': 18029, 'loss/train': 1.3500367403030396} 11/06/2021 23:47:35 - INFO - __main__ - Step 18031: {'lr': 0.0004856662769420012, 'samples': 3461952, 'steps': 18030, 'loss/train': 1.501436471939087} 11/06/2021 23:47:36 - INFO - __main__ - Step 18032: {'lr': 0.0004856645058151984, 'samples': 3462144, 'steps': 18031, 'loss/train': 1.7649421691894531} 11/06/2021 23:47:36 - INFO - __main__ - Step 18033: {'lr': 0.0004856627345822088, 'samples': 3462336, 'steps': 18032, 'loss/train': 1.8708281517028809} 11/06/2021 23:47:37 - INFO - __main__ - Step 18034: {'lr': 0.0004856609632430332, 'samples': 3462528, 'steps': 18033, 'loss/train': 1.811106562614441} 11/06/2021 23:47:37 - INFO - __main__ - Step 18035: {'lr': 0.00048565919179767246, 'samples': 3462720, 'steps': 18034, 'loss/train': 2.175701856613159} 11/06/2021 23:47:38 - INFO - __main__ - Step 18036: {'lr': 0.0004856574202461273, 'samples': 3462912, 'steps': 18035, 'loss/train': 1.7209889888763428} 11/06/2021 23:47:38 - INFO - __main__ - Step 18037: {'lr': 0.0004856556485883985, 'samples': 3463104, 'steps': 18036, 'loss/train': 1.8823835849761963} 11/06/2021 23:47:38 - INFO - __main__ - Step 18038: {'lr': 0.000485653876824487, 'samples': 3463296, 'steps': 18037, 'loss/train': 1.6220828294754028} 11/06/2021 23:47:39 - INFO - __main__ - Step 18039: {'lr': 0.00048565210495439337, 'samples': 3463488, 'steps': 18038, 'loss/train': 1.2623989582061768} 11/06/2021 23:47:40 - INFO - __main__ - Step 18040: {'lr': 0.00048565033297811867, 'samples': 3463680, 'steps': 18039, 'loss/train': 1.4432919025421143} 11/06/2021 23:47:40 - INFO - __main__ - Step 18041: {'lr': 0.0004856485608956635, 'samples': 3463872, 'steps': 18040, 'loss/train': 1.4835745096206665} 11/06/2021 23:47:40 - INFO - __main__ - Step 18042: {'lr': 0.00048564678870702873, 'samples': 3464064, 'steps': 18041, 'loss/train': 1.4812495708465576} 11/06/2021 23:47:41 - INFO - __main__ - Step 18043: {'lr': 0.00048564501641221516, 'samples': 3464256, 'steps': 18042, 'loss/train': 1.9328662157058716} 11/06/2021 23:47:41 - INFO - __main__ - Step 18044: {'lr': 0.00048564324401122357, 'samples': 3464448, 'steps': 18043, 'loss/train': 1.3001339435577393} 11/06/2021 23:47:42 - INFO - __main__ - Step 18045: {'lr': 0.0004856414715040548, 'samples': 3464640, 'steps': 18044, 'loss/train': 1.5383938550949097} 11/06/2021 23:47:43 - INFO - __main__ - Step 18046: {'lr': 0.0004856396988907096, 'samples': 3464832, 'steps': 18045, 'loss/train': 0.6114344596862793} 11/06/2021 23:47:43 - INFO - __main__ - Step 18047: {'lr': 0.00048563792617118876, 'samples': 3465024, 'steps': 18046, 'loss/train': 1.156066656112671} 11/06/2021 23:47:43 - INFO - __main__ - Step 18048: {'lr': 0.00048563615334549316, 'samples': 3465216, 'steps': 18047, 'loss/train': 1.3910892009735107} 11/06/2021 23:47:44 - INFO - __main__ - Step 18049: {'lr': 0.0004856343804136235, 'samples': 3465408, 'steps': 18048, 'loss/train': 1.6260871887207031} 11/06/2021 23:47:45 - INFO - __main__ - Step 18050: {'lr': 0.0004856326073755806, 'samples': 3465600, 'steps': 18049, 'loss/train': 1.7786577939987183} 11/06/2021 23:47:45 - INFO - __main__ - Step 18051: {'lr': 0.0004856308342313653, 'samples': 3465792, 'steps': 18050, 'loss/train': 1.674890398979187} 11/06/2021 23:47:46 - INFO - __main__ - Step 18052: {'lr': 0.00048562906098097847, 'samples': 3465984, 'steps': 18051, 'loss/train': 1.9408053159713745} 11/06/2021 23:47:46 - INFO - __main__ - Step 18053: {'lr': 0.0004856272876244208, 'samples': 3466176, 'steps': 18052, 'loss/train': 1.9470312595367432} 11/06/2021 23:47:46 - INFO - __main__ - Step 18054: {'lr': 0.000485625514161693, 'samples': 3466368, 'steps': 18053, 'loss/train': 1.8458075523376465} 11/06/2021 23:47:47 - INFO - __main__ - Step 18055: {'lr': 0.00048562374059279604, 'samples': 3466560, 'steps': 18054, 'loss/train': 1.438137173652649} 11/06/2021 23:47:48 - INFO - __main__ - Step 18056: {'lr': 0.00048562196691773066, 'samples': 3466752, 'steps': 18055, 'loss/train': 1.7463195323944092} 11/06/2021 23:47:48 - INFO - __main__ - Step 18057: {'lr': 0.00048562019313649766, 'samples': 3466944, 'steps': 18056, 'loss/train': 1.234910249710083} 11/06/2021 23:47:49 - INFO - __main__ - Step 18058: {'lr': 0.0004856184192490979, 'samples': 3467136, 'steps': 18057, 'loss/train': 1.2480186223983765} 11/06/2021 23:47:49 - INFO - __main__ - Step 18059: {'lr': 0.000485616645255532, 'samples': 3467328, 'steps': 18058, 'loss/train': 1.3768389225006104} 11/06/2021 23:47:50 - INFO - __main__ - Step 18060: {'lr': 0.0004856148711558009, 'samples': 3467520, 'steps': 18059, 'loss/train': 1.8986867666244507} 11/06/2021 23:47:50 - INFO - __main__ - Step 18061: {'lr': 0.00048561309694990543, 'samples': 3467712, 'steps': 18060, 'loss/train': 1.514434576034546} 11/06/2021 23:47:51 - INFO - __main__ - Step 18062: {'lr': 0.00048561132263784634, 'samples': 3467904, 'steps': 18061, 'loss/train': 1.9755878448486328} 11/06/2021 23:47:51 - INFO - __main__ - Step 18063: {'lr': 0.00048560954821962434, 'samples': 3468096, 'steps': 18062, 'loss/train': 1.7900665998458862} 11/06/2021 23:47:51 - INFO - __main__ - Step 18064: {'lr': 0.0004856077736952404, 'samples': 3468288, 'steps': 18063, 'loss/train': 2.1173062324523926} 11/06/2021 23:47:52 - INFO - __main__ - Step 18065: {'lr': 0.00048560599906469513, 'samples': 3468480, 'steps': 18064, 'loss/train': 1.6794781684875488} 11/06/2021 23:47:53 - INFO - __main__ - Step 18066: {'lr': 0.00048560422432798956, 'samples': 3468672, 'steps': 18065, 'loss/train': 1.2855576276779175} 11/06/2021 23:47:53 - INFO - __main__ - Step 18067: {'lr': 0.0004856024494851243, 'samples': 3468864, 'steps': 18066, 'loss/train': 0.3096511662006378} 11/06/2021 23:47:54 - INFO - __main__ - Step 18068: {'lr': 0.00048560067453610025, 'samples': 3469056, 'steps': 18067, 'loss/train': 1.4566000699996948} 11/06/2021 23:47:54 - INFO - __main__ - Step 18069: {'lr': 0.00048559889948091814, 'samples': 3469248, 'steps': 18068, 'loss/train': 1.557026743888855} 11/06/2021 23:47:54 - INFO - __main__ - Step 18070: {'lr': 0.0004855971243195788, 'samples': 3469440, 'steps': 18069, 'loss/train': 2.022430181503296} 11/06/2021 23:47:55 - INFO - __main__ - Step 18071: {'lr': 0.00048559534905208304, 'samples': 3469632, 'steps': 18070, 'loss/train': 1.8570542335510254} 11/06/2021 23:47:56 - INFO - __main__ - Step 18072: {'lr': 0.0004855935736784316, 'samples': 3469824, 'steps': 18071, 'loss/train': 1.6132595539093018} 11/06/2021 23:47:56 - INFO - __main__ - Step 18073: {'lr': 0.00048559179819862537, 'samples': 3470016, 'steps': 18072, 'loss/train': 2.393339157104492} 11/06/2021 23:47:57 - INFO - __main__ - Step 18074: {'lr': 0.0004855900226126651, 'samples': 3470208, 'steps': 18073, 'loss/train': 1.808935523033142} 11/06/2021 23:47:57 - INFO - __main__ - Step 18075: {'lr': 0.00048558824692055156, 'samples': 3470400, 'steps': 18074, 'loss/train': 1.6323792934417725} 11/06/2021 23:47:58 - INFO - __main__ - Step 18076: {'lr': 0.0004855864711222857, 'samples': 3470592, 'steps': 18075, 'loss/train': 1.7564913034439087} 11/06/2021 23:47:58 - INFO - __main__ - Step 18077: {'lr': 0.0004855846952178682, 'samples': 3470784, 'steps': 18076, 'loss/train': 1.3942724466323853} 11/06/2021 23:47:59 - INFO - __main__ - Step 18078: {'lr': 0.0004855829192072998, 'samples': 3470976, 'steps': 18077, 'loss/train': 1.9024382829666138} 11/06/2021 23:47:59 - INFO - __main__ - Step 18079: {'lr': 0.00048558114309058144, 'samples': 3471168, 'steps': 18078, 'loss/train': 1.360500693321228} 11/06/2021 23:47:59 - INFO - __main__ - Step 18080: {'lr': 0.00048557936686771376, 'samples': 3471360, 'steps': 18079, 'loss/train': 1.4804255962371826} 11/06/2021 23:48:00 - INFO - __main__ - Step 18081: {'lr': 0.0004855775905386977, 'samples': 3471552, 'steps': 18080, 'loss/train': 1.54103422164917} 11/06/2021 23:48:01 - INFO - __main__ - Step 18082: {'lr': 0.000485575814103534, 'samples': 3471744, 'steps': 18081, 'loss/train': 1.574988842010498} 11/06/2021 23:48:01 - INFO - __main__ - Step 18083: {'lr': 0.0004855740375622235, 'samples': 3471936, 'steps': 18082, 'loss/train': 1.7808576822280884} 11/06/2021 23:48:01 - INFO - __main__ - Step 18084: {'lr': 0.00048557226091476704, 'samples': 3472128, 'steps': 18083, 'loss/train': 1.7548338174819946} 11/06/2021 23:48:02 - INFO - __main__ - Step 18085: {'lr': 0.0004855704841611652, 'samples': 3472320, 'steps': 18084, 'loss/train': 1.2830348014831543} 11/06/2021 23:48:04 - INFO - __main__ - Step 18086: {'lr': 0.00048556870730141906, 'samples': 3472512, 'steps': 18085, 'loss/train': 1.6364481449127197} 11/06/2021 23:48:04 - INFO - __main__ - Step 18087: {'lr': 0.00048556693033552926, 'samples': 3472704, 'steps': 18086, 'loss/train': 1.6514034271240234} 11/06/2021 23:48:05 - INFO - __main__ - Step 18088: {'lr': 0.0004855651532634966, 'samples': 3472896, 'steps': 18087, 'loss/train': 1.5415527820587158} 11/06/2021 23:48:05 - INFO - __main__ - Step 18089: {'lr': 0.00048556337608532196, 'samples': 3473088, 'steps': 18088, 'loss/train': 1.6693471670150757} 11/06/2021 23:48:05 - INFO - __main__ - Step 18090: {'lr': 0.00048556159880100604, 'samples': 3473280, 'steps': 18089, 'loss/train': 1.8687024116516113} 11/06/2021 23:48:06 - INFO - __main__ - Step 18091: {'lr': 0.00048555982141054976, 'samples': 3473472, 'steps': 18090, 'loss/train': 1.8448222875595093} 11/06/2021 23:48:06 - INFO - __main__ - Step 18092: {'lr': 0.0004855580439139539, 'samples': 3473664, 'steps': 18091, 'loss/train': 1.816220998764038} 11/06/2021 23:48:06 - INFO - __main__ - Step 18093: {'lr': 0.00048555626631121906, 'samples': 3473856, 'steps': 18092, 'loss/train': 1.991423487663269} 11/06/2021 23:48:07 - INFO - __main__ - Step 18094: {'lr': 0.0004855544886023463, 'samples': 3474048, 'steps': 18093, 'loss/train': 1.56856107711792} 11/06/2021 23:48:08 - INFO - __main__ - Step 18095: {'lr': 0.00048555271078733637, 'samples': 3474240, 'steps': 18094, 'loss/train': 1.76602303981781} 11/06/2021 23:48:08 - INFO - __main__ - Step 18096: {'lr': 0.00048555093286618996, 'samples': 3474432, 'steps': 18095, 'loss/train': 1.4684127569198608} 11/06/2021 23:48:08 - INFO - __main__ - Step 18097: {'lr': 0.0004855491548389079, 'samples': 3474624, 'steps': 18096, 'loss/train': 1.608511209487915} 11/06/2021 23:48:09 - INFO - __main__ - Step 18098: {'lr': 0.0004855473767054911, 'samples': 3474816, 'steps': 18097, 'loss/train': 1.434108018875122} 11/06/2021 23:48:10 - INFO - __main__ - Step 18099: {'lr': 0.00048554559846594026, 'samples': 3475008, 'steps': 18098, 'loss/train': 1.5190415382385254} 11/06/2021 23:48:10 - INFO - __main__ - Step 18100: {'lr': 0.0004855438201202562, 'samples': 3475200, 'steps': 18099, 'loss/train': 1.5204631090164185} 11/06/2021 23:48:11 - INFO - __main__ - Step 18101: {'lr': 0.0004855420416684398, 'samples': 3475392, 'steps': 18100, 'loss/train': 1.687787652015686} 11/06/2021 23:48:11 - INFO - __main__ - Step 18102: {'lr': 0.0004855402631104917, 'samples': 3475584, 'steps': 18101, 'loss/train': 1.7101457118988037} 11/06/2021 23:48:11 - INFO - __main__ - Step 18103: {'lr': 0.0004855384844464128, 'samples': 3475776, 'steps': 18102, 'loss/train': 0.6320077776908875} 11/06/2021 23:48:12 - INFO - __main__ - Step 18104: {'lr': 0.00048553670567620395, 'samples': 3475968, 'steps': 18103, 'loss/train': 5.830017566680908} 11/06/2021 23:48:13 - INFO - __main__ - Step 18105: {'lr': 0.0004855349267998659, 'samples': 3476160, 'steps': 18104, 'loss/train': 1.7267546653747559} 11/06/2021 23:48:13 - INFO - __main__ - Step 18106: {'lr': 0.0004855331478173994, 'samples': 3476352, 'steps': 18105, 'loss/train': 1.6141504049301147} 11/06/2021 23:48:13 - INFO - __main__ - Step 18107: {'lr': 0.0004855313687288053, 'samples': 3476544, 'steps': 18106, 'loss/train': 1.6809107065200806} 11/06/2021 23:48:14 - INFO - __main__ - Step 18108: {'lr': 0.00048552958953408437, 'samples': 3476736, 'steps': 18107, 'loss/train': 1.5354968309402466} 11/06/2021 23:48:14 - INFO - __main__ - Step 18109: {'lr': 0.0004855278102332375, 'samples': 3476928, 'steps': 18108, 'loss/train': 1.51222562789917} 11/06/2021 23:48:15 - INFO - __main__ - Step 18110: {'lr': 0.0004855260308262654, 'samples': 3477120, 'steps': 18109, 'loss/train': 1.4141589403152466} 11/06/2021 23:48:15 - INFO - __main__ - Step 18111: {'lr': 0.00048552425131316893, 'samples': 3477312, 'steps': 18110, 'loss/train': 1.4904797077178955} 11/06/2021 23:48:16 - INFO - __main__ - Step 18112: {'lr': 0.0004855224716939488, 'samples': 3477504, 'steps': 18111, 'loss/train': 1.6454395055770874} 11/06/2021 23:48:16 - INFO - __main__ - Step 18113: {'lr': 0.0004855206919686059, 'samples': 3477696, 'steps': 18112, 'loss/train': 1.3504629135131836} 11/06/2021 23:48:16 - INFO - __main__ - Step 18114: {'lr': 0.0004855189121371411, 'samples': 3477888, 'steps': 18113, 'loss/train': 2.2587153911590576} 11/06/2021 23:48:17 - INFO - __main__ - Step 18115: {'lr': 0.00048551713219955505, 'samples': 3478080, 'steps': 18114, 'loss/train': 1.7638386487960815} 11/06/2021 23:48:18 - INFO - __main__ - Step 18116: {'lr': 0.00048551535215584865, 'samples': 3478272, 'steps': 18115, 'loss/train': 2.0963571071624756} 11/06/2021 23:48:18 - INFO - __main__ - Step 18117: {'lr': 0.00048551357200602265, 'samples': 3478464, 'steps': 18116, 'loss/train': 2.086501359939575} 11/06/2021 23:48:18 - INFO - __main__ - Step 18118: {'lr': 0.0004855117917500778, 'samples': 3478656, 'steps': 18117, 'loss/train': 1.9306416511535645} 11/06/2021 23:48:19 - INFO - __main__ - Step 18119: {'lr': 0.000485510011388015, 'samples': 3478848, 'steps': 18118, 'loss/train': 1.378063440322876} 11/06/2021 23:48:20 - INFO - __main__ - Step 18120: {'lr': 0.00048550823091983507, 'samples': 3479040, 'steps': 18119, 'loss/train': 1.7439087629318237} 11/06/2021 23:48:20 - INFO - __main__ - Step 18121: {'lr': 0.00048550645034553877, 'samples': 3479232, 'steps': 18120, 'loss/train': 1.3175225257873535} 11/06/2021 23:48:21 - INFO - __main__ - Step 18122: {'lr': 0.00048550466966512684, 'samples': 3479424, 'steps': 18121, 'loss/train': 1.7445465326309204} 11/06/2021 23:48:21 - INFO - __main__ - Step 18123: {'lr': 0.0004855028888786002, 'samples': 3479616, 'steps': 18122, 'loss/train': 1.4202873706817627} 11/06/2021 23:48:21 - INFO - __main__ - Step 18124: {'lr': 0.00048550110798595953, 'samples': 3479808, 'steps': 18123, 'loss/train': 1.469502568244934} 11/06/2021 23:48:23 - INFO - __main__ - Step 18125: {'lr': 0.0004854993269872057, 'samples': 3480000, 'steps': 18124, 'loss/train': 1.5502510070800781} 11/06/2021 23:48:23 - INFO - __main__ - Step 18126: {'lr': 0.0004854975458823396, 'samples': 3480192, 'steps': 18125, 'loss/train': 1.8614351749420166} 11/06/2021 23:48:23 - INFO - __main__ - Step 18127: {'lr': 0.0004854957646713618, 'samples': 3480384, 'steps': 18126, 'loss/train': 1.5096491575241089} 11/06/2021 23:48:24 - INFO - __main__ - Step 18128: {'lr': 0.00048549398335427337, 'samples': 3480576, 'steps': 18127, 'loss/train': 1.1223576068878174} 11/06/2021 23:48:24 - INFO - __main__ - Step 18129: {'lr': 0.0004854922019310749, 'samples': 3480768, 'steps': 18128, 'loss/train': 1.9551997184753418} 11/06/2021 23:48:25 - INFO - __main__ - Step 18130: {'lr': 0.0004854904204017673, 'samples': 3480960, 'steps': 18129, 'loss/train': 1.7446837425231934} 11/06/2021 23:48:25 - INFO - __main__ - Step 18131: {'lr': 0.0004854886387663514, 'samples': 3481152, 'steps': 18130, 'loss/train': 1.8030678033828735} 11/06/2021 23:48:26 - INFO - __main__ - Step 18132: {'lr': 0.0004854868570248279, 'samples': 3481344, 'steps': 18131, 'loss/train': 1.1974247694015503} 11/06/2021 23:48:26 - INFO - __main__ - Step 18133: {'lr': 0.00048548507517719766, 'samples': 3481536, 'steps': 18132, 'loss/train': 1.9085829257965088} 11/06/2021 23:48:26 - INFO - __main__ - Step 18134: {'lr': 0.0004854832932234615, 'samples': 3481728, 'steps': 18133, 'loss/train': 2.1923718452453613} 11/06/2021 23:48:27 - INFO - __main__ - Step 18135: {'lr': 0.0004854815111636202, 'samples': 3481920, 'steps': 18134, 'loss/train': 1.6197718381881714} 11/06/2021 23:48:28 - INFO - __main__ - Step 18136: {'lr': 0.00048547972899767454, 'samples': 3482112, 'steps': 18135, 'loss/train': 1.999104380607605} 11/06/2021 23:48:28 - INFO - __main__ - Step 18137: {'lr': 0.0004854779467256254, 'samples': 3482304, 'steps': 18136, 'loss/train': 1.6554611921310425} 11/06/2021 23:48:28 - INFO - __main__ - Step 18138: {'lr': 0.00048547616434747344, 'samples': 3482496, 'steps': 18137, 'loss/train': 1.7606922388076782} 11/06/2021 23:48:29 - INFO - __main__ - Step 18139: {'lr': 0.0004854743818632196, 'samples': 3482688, 'steps': 18138, 'loss/train': 0.9624412655830383} 11/06/2021 23:48:30 - INFO - __main__ - Step 18140: {'lr': 0.0004854725992728647, 'samples': 3482880, 'steps': 18139, 'loss/train': 1.5731804370880127} 11/06/2021 23:48:30 - INFO - __main__ - Step 18141: {'lr': 0.00048547081657640935, 'samples': 3483072, 'steps': 18140, 'loss/train': 1.5403448343276978} 11/06/2021 23:48:30 - INFO - __main__ - Step 18142: {'lr': 0.00048546903377385457, 'samples': 3483264, 'steps': 18141, 'loss/train': 1.5687254667282104} 11/06/2021 23:48:31 - INFO - __main__ - Step 18143: {'lr': 0.00048546725086520107, 'samples': 3483456, 'steps': 18142, 'loss/train': 1.6424126625061035} 11/06/2021 23:48:31 - INFO - __main__ - Step 18144: {'lr': 0.00048546546785044965, 'samples': 3483648, 'steps': 18143, 'loss/train': 1.4351774454116821} 11/06/2021 23:48:32 - INFO - __main__ - Step 18145: {'lr': 0.00048546368472960114, 'samples': 3483840, 'steps': 18144, 'loss/train': 1.7238658666610718} 11/06/2021 23:48:33 - INFO - __main__ - Step 18146: {'lr': 0.00048546190150265634, 'samples': 3484032, 'steps': 18145, 'loss/train': 1.5560191869735718} 11/06/2021 23:48:33 - INFO - __main__ - Step 18147: {'lr': 0.00048546011816961597, 'samples': 3484224, 'steps': 18146, 'loss/train': 1.6255768537521362} 11/06/2021 23:48:33 - INFO - __main__ - Step 18148: {'lr': 0.00048545833473048094, 'samples': 3484416, 'steps': 18147, 'loss/train': 1.2934846878051758} 11/06/2021 23:48:34 - INFO - __main__ - Step 18149: {'lr': 0.00048545655118525206, 'samples': 3484608, 'steps': 18148, 'loss/train': 1.6034213304519653} 11/06/2021 23:48:34 - INFO - __main__ - Step 18150: {'lr': 0.00048545476753393004, 'samples': 3484800, 'steps': 18149, 'loss/train': 1.2205133438110352} 11/06/2021 23:48:35 - INFO - __main__ - Step 18151: {'lr': 0.0004854529837765158, 'samples': 3484992, 'steps': 18150, 'loss/train': 1.9434911012649536} 11/06/2021 23:48:35 - INFO - __main__ - Step 18152: {'lr': 0.00048545119991301, 'samples': 3485184, 'steps': 18151, 'loss/train': 1.9763034582138062} 11/06/2021 23:48:36 - INFO - __main__ - Step 18153: {'lr': 0.0004854494159434135, 'samples': 3485376, 'steps': 18152, 'loss/train': 1.8863459825515747} 11/06/2021 23:48:36 - INFO - __main__ - Step 18154: {'lr': 0.0004854476318677272, 'samples': 3485568, 'steps': 18153, 'loss/train': 1.7033888101577759} 11/06/2021 23:48:36 - INFO - __main__ - Step 18155: {'lr': 0.00048544584768595185, 'samples': 3485760, 'steps': 18154, 'loss/train': 1.6517353057861328} 11/06/2021 23:48:37 - INFO - __main__ - Step 18156: {'lr': 0.00048544406339808823, 'samples': 3485952, 'steps': 18155, 'loss/train': 1.6695486307144165} 11/06/2021 23:48:38 - INFO - __main__ - Step 18157: {'lr': 0.00048544227900413706, 'samples': 3486144, 'steps': 18156, 'loss/train': 1.761330008506775} 11/06/2021 23:48:38 - INFO - __main__ - Step 18158: {'lr': 0.0004854404945040993, 'samples': 3486336, 'steps': 18157, 'loss/train': 1.339409589767456} 11/06/2021 23:48:38 - INFO - __main__ - Step 18159: {'lr': 0.0004854387098979757, 'samples': 3486528, 'steps': 18158, 'loss/train': 1.1606993675231934} 11/06/2021 23:48:39 - INFO - __main__ - Step 18160: {'lr': 0.000485436925185767, 'samples': 3486720, 'steps': 18159, 'loss/train': 2.2796413898468018} 11/06/2021 23:48:40 - INFO - __main__ - Step 18161: {'lr': 0.00048543514036747404, 'samples': 3486912, 'steps': 18160, 'loss/train': 1.5005325078964233} 11/06/2021 23:48:40 - INFO - __main__ - Step 18162: {'lr': 0.00048543335544309776, 'samples': 3487104, 'steps': 18161, 'loss/train': 1.5606443881988525} 11/06/2021 23:48:41 - INFO - __main__ - Step 18163: {'lr': 0.00048543157041263876, 'samples': 3487296, 'steps': 18162, 'loss/train': 1.3241081237792969} 11/06/2021 23:48:41 - INFO - __main__ - Step 18164: {'lr': 0.0004854297852760979, 'samples': 3487488, 'steps': 18163, 'loss/train': 1.3601914644241333} 11/06/2021 23:48:41 - INFO - __main__ - Step 18165: {'lr': 0.000485428000033476, 'samples': 3487680, 'steps': 18164, 'loss/train': 1.6403565406799316} 11/06/2021 23:48:42 - INFO - __main__ - Step 18166: {'lr': 0.00048542621468477393, 'samples': 3487872, 'steps': 18165, 'loss/train': 1.740988850593567} 11/06/2021 23:48:43 - INFO - __main__ - Step 18167: {'lr': 0.0004854244292299924, 'samples': 3488064, 'steps': 18166, 'loss/train': 1.6933728456497192} 11/06/2021 23:48:43 - INFO - __main__ - Step 18168: {'lr': 0.0004854226436691323, 'samples': 3488256, 'steps': 18167, 'loss/train': 2.1075711250305176} 11/06/2021 23:48:43 - INFO - __main__ - Step 18169: {'lr': 0.0004854208580021944, 'samples': 3488448, 'steps': 18168, 'loss/train': 1.6747791767120361} 11/06/2021 23:48:44 - INFO - __main__ - Step 18170: {'lr': 0.00048541907222917946, 'samples': 3488640, 'steps': 18169, 'loss/train': 1.5540307760238647} 11/06/2021 23:48:45 - INFO - __main__ - Step 18171: {'lr': 0.0004854172863500883, 'samples': 3488832, 'steps': 18170, 'loss/train': 1.7926479578018188} 11/06/2021 23:48:45 - INFO - __main__ - Step 18172: {'lr': 0.00048541550036492175, 'samples': 3489024, 'steps': 18171, 'loss/train': 1.5982003211975098} 11/06/2021 23:48:45 - INFO - __main__ - Step 18173: {'lr': 0.00048541371427368064, 'samples': 3489216, 'steps': 18172, 'loss/train': 1.605031967163086} 11/06/2021 23:48:46 - INFO - __main__ - Step 18174: {'lr': 0.0004854119280763657, 'samples': 3489408, 'steps': 18173, 'loss/train': 2.03631329536438} 11/06/2021 23:48:46 - INFO - __main__ - Step 18175: {'lr': 0.00048541014177297783, 'samples': 3489600, 'steps': 18174, 'loss/train': 1.3513402938842773} 11/06/2021 23:48:46 - INFO - __main__ - Step 18176: {'lr': 0.0004854083553635178, 'samples': 3489792, 'steps': 18175, 'loss/train': 2.2017154693603516} 11/06/2021 23:48:48 - INFO - __main__ - Step 18177: {'lr': 0.00048540656884798626, 'samples': 3489984, 'steps': 18176, 'loss/train': 1.6572513580322266} 11/06/2021 23:48:48 - INFO - __main__ - Step 18178: {'lr': 0.0004854047822263843, 'samples': 3490176, 'steps': 18177, 'loss/train': 1.844321608543396} 11/06/2021 23:48:48 - INFO - __main__ - Step 18179: {'lr': 0.00048540299549871256, 'samples': 3490368, 'steps': 18178, 'loss/train': 1.7020295858383179} 11/06/2021 23:48:49 - INFO - __main__ - Step 18180: {'lr': 0.0004854012086649718, 'samples': 3490560, 'steps': 18179, 'loss/train': 0.9979807734489441} 11/06/2021 23:48:49 - INFO - __main__ - Step 18181: {'lr': 0.00048539942172516295, 'samples': 3490752, 'steps': 18180, 'loss/train': 1.2583417892456055} 11/06/2021 23:48:50 - INFO - __main__ - Step 18182: {'lr': 0.00048539763467928665, 'samples': 3490944, 'steps': 18181, 'loss/train': 1.8729602098464966} 11/06/2021 23:48:50 - INFO - __main__ - Step 18183: {'lr': 0.0004853958475273439, 'samples': 3491136, 'steps': 18182, 'loss/train': 1.4147400856018066} 11/06/2021 23:48:51 - INFO - __main__ - Step 18184: {'lr': 0.0004853940602693354, 'samples': 3491328, 'steps': 18183, 'loss/train': 1.536829948425293} 11/06/2021 23:48:51 - INFO - __main__ - Step 18185: {'lr': 0.00048539227290526194, 'samples': 3491520, 'steps': 18184, 'loss/train': 1.2431416511535645} 11/06/2021 23:48:51 - INFO - __main__ - Step 18186: {'lr': 0.00048539048543512443, 'samples': 3491712, 'steps': 18185, 'loss/train': 1.6529532670974731} 11/06/2021 23:48:52 - INFO - __main__ - Step 18187: {'lr': 0.0004853886978589235, 'samples': 3491904, 'steps': 18186, 'loss/train': 1.608051061630249} 11/06/2021 23:48:53 - INFO - __main__ - Step 18188: {'lr': 0.0004853869101766601, 'samples': 3492096, 'steps': 18187, 'loss/train': 1.734632134437561} 11/06/2021 23:48:53 - INFO - __main__ - Step 18189: {'lr': 0.000485385122388335, 'samples': 3492288, 'steps': 18188, 'loss/train': 1.8401646614074707} 11/06/2021 23:48:53 - INFO - __main__ - Step 18190: {'lr': 0.000485383334493949, 'samples': 3492480, 'steps': 18189, 'loss/train': 1.6987266540527344} 11/06/2021 23:48:54 - INFO - __main__ - Step 18191: {'lr': 0.00048538154649350286, 'samples': 3492672, 'steps': 18190, 'loss/train': 1.5722284317016602} 11/06/2021 23:48:55 - INFO - __main__ - Step 18192: {'lr': 0.00048537975838699744, 'samples': 3492864, 'steps': 18191, 'loss/train': 1.4871503114700317} 11/06/2021 23:48:55 - INFO - __main__ - Step 18193: {'lr': 0.0004853779701744335, 'samples': 3493056, 'steps': 18192, 'loss/train': 1.6807981729507446} 11/06/2021 23:48:56 - INFO - __main__ - Step 18194: {'lr': 0.000485376181855812, 'samples': 3493248, 'steps': 18193, 'loss/train': 1.6861616373062134} 11/06/2021 23:48:56 - INFO - __main__ - Step 18195: {'lr': 0.00048537439343113354, 'samples': 3493440, 'steps': 18194, 'loss/train': 1.648136854171753} 11/06/2021 23:48:56 - INFO - __main__ - Step 18196: {'lr': 0.000485372604900399, 'samples': 3493632, 'steps': 18195, 'loss/train': 2.172215223312378} 11/06/2021 23:48:57 - INFO - __main__ - Step 18197: {'lr': 0.0004853708162636092, 'samples': 3493824, 'steps': 18196, 'loss/train': 1.964274525642395} 11/06/2021 23:48:58 - INFO - __main__ - Step 18198: {'lr': 0.00048536902752076494, 'samples': 3494016, 'steps': 18197, 'loss/train': 1.6822305917739868} 11/06/2021 23:48:58 - INFO - __main__ - Step 18199: {'lr': 0.00048536723867186705, 'samples': 3494208, 'steps': 18198, 'loss/train': 1.7284069061279297} 11/06/2021 23:48:58 - INFO - __main__ - Step 18200: {'lr': 0.0004853654497169163, 'samples': 3494400, 'steps': 18199, 'loss/train': 1.4523322582244873} 11/06/2021 23:48:59 - INFO - __main__ - Step 18201: {'lr': 0.00048536366065591354, 'samples': 3494592, 'steps': 18200, 'loss/train': 1.2845325469970703} 11/06/2021 23:49:00 - INFO - __main__ - Step 18202: {'lr': 0.00048536187148885956, 'samples': 3494784, 'steps': 18201, 'loss/train': 1.542911171913147} 11/06/2021 23:49:00 - INFO - __main__ - Step 18203: {'lr': 0.0004853600822157551, 'samples': 3494976, 'steps': 18202, 'loss/train': 1.685224175453186} 11/06/2021 23:49:00 - INFO - __main__ - Step 18204: {'lr': 0.000485358292836601, 'samples': 3495168, 'steps': 18203, 'loss/train': 2.2215311527252197} 11/06/2021 23:49:01 - INFO - __main__ - Step 18205: {'lr': 0.0004853565033513982, 'samples': 3495360, 'steps': 18204, 'loss/train': 1.5674799680709839} 11/06/2021 23:49:01 - INFO - __main__ - Step 18206: {'lr': 0.0004853547137601473, 'samples': 3495552, 'steps': 18205, 'loss/train': 1.1350946426391602} 11/06/2021 23:49:01 - INFO - __main__ - Step 18207: {'lr': 0.0004853529240628493, 'samples': 3495744, 'steps': 18206, 'loss/train': 1.7013306617736816} 11/06/2021 23:49:02 - INFO - __main__ - Step 18208: {'lr': 0.00048535113425950474, 'samples': 3495936, 'steps': 18207, 'loss/train': 1.4981727600097656} 11/06/2021 23:49:03 - INFO - __main__ - Step 18209: {'lr': 0.0004853493443501147, 'samples': 3496128, 'steps': 18208, 'loss/train': 1.4328523874282837} 11/06/2021 23:49:03 - INFO - __main__ - Step 18210: {'lr': 0.0004853475543346798, 'samples': 3496320, 'steps': 18209, 'loss/train': 1.8389379978179932} 11/06/2021 23:49:04 - INFO - __main__ - Step 18211: {'lr': 0.000485345764213201, 'samples': 3496512, 'steps': 18210, 'loss/train': 1.694120168685913} 11/06/2021 23:49:04 - INFO - __main__ - Step 18212: {'lr': 0.00048534397398567895, 'samples': 3496704, 'steps': 18211, 'loss/train': 1.3739075660705566} 11/06/2021 23:49:05 - INFO - __main__ - Step 18213: {'lr': 0.00048534218365211456, 'samples': 3496896, 'steps': 18212, 'loss/train': 1.0421340465545654} 11/06/2021 23:49:05 - INFO - __main__ - Step 18214: {'lr': 0.0004853403932125087, 'samples': 3497088, 'steps': 18213, 'loss/train': 1.7056995630264282} 11/06/2021 23:49:06 - INFO - __main__ - Step 18215: {'lr': 0.00048533860266686203, 'samples': 3497280, 'steps': 18214, 'loss/train': 1.6891775131225586} 11/06/2021 23:49:06 - INFO - __main__ - Step 18216: {'lr': 0.0004853368120151754, 'samples': 3497472, 'steps': 18215, 'loss/train': 2.0772011280059814} 11/06/2021 23:49:06 - INFO - __main__ - Step 18217: {'lr': 0.00048533502125744967, 'samples': 3497664, 'steps': 18216, 'loss/train': 1.8108474016189575} 11/06/2021 23:49:07 - INFO - __main__ - Step 18218: {'lr': 0.0004853332303936856, 'samples': 3497856, 'steps': 18217, 'loss/train': 1.2649246454238892} 11/06/2021 23:49:08 - INFO - __main__ - Step 18219: {'lr': 0.000485331439423884, 'samples': 3498048, 'steps': 18218, 'loss/train': 1.977997899055481} 11/06/2021 23:49:08 - INFO - __main__ - Step 18220: {'lr': 0.00048532964834804566, 'samples': 3498240, 'steps': 18219, 'loss/train': 1.0719574689865112} 11/06/2021 23:49:08 - INFO - __main__ - Step 18221: {'lr': 0.00048532785716617145, 'samples': 3498432, 'steps': 18220, 'loss/train': 1.237172245979309} 11/06/2021 23:49:09 - INFO - __main__ - Step 18222: {'lr': 0.0004853260658782621, 'samples': 3498624, 'steps': 18221, 'loss/train': 2.3552660942077637} 11/06/2021 23:49:10 - INFO - __main__ - Step 18223: {'lr': 0.0004853242744843185, 'samples': 3498816, 'steps': 18222, 'loss/train': 1.9997755289077759} 11/06/2021 23:49:10 - INFO - __main__ - Step 18224: {'lr': 0.0004853224829843414, 'samples': 3499008, 'steps': 18223, 'loss/train': 1.524446964263916} 11/06/2021 23:49:10 - INFO - __main__ - Step 18225: {'lr': 0.00048532069137833156, 'samples': 3499200, 'steps': 18224, 'loss/train': 1.5553361177444458} 11/06/2021 23:49:11 - INFO - __main__ - Step 18226: {'lr': 0.00048531889966628997, 'samples': 3499392, 'steps': 18225, 'loss/train': 1.2492296695709229} 11/06/2021 23:49:11 - INFO - __main__ - Step 18227: {'lr': 0.00048531710784821726, 'samples': 3499584, 'steps': 18226, 'loss/train': 1.625084638595581} 11/06/2021 23:49:12 - INFO - __main__ - Step 18228: {'lr': 0.0004853153159241143, 'samples': 3499776, 'steps': 18227, 'loss/train': 2.811218500137329} 11/06/2021 23:49:12 - INFO - __main__ - Step 18229: {'lr': 0.0004853135238939818, 'samples': 3499968, 'steps': 18228, 'loss/train': 1.670412540435791} 11/06/2021 23:49:13 - INFO - __main__ - Step 18230: {'lr': 0.0004853117317578207, 'samples': 3500160, 'steps': 18229, 'loss/train': 1.9822686910629272} 11/06/2021 23:49:13 - INFO - __main__ - Step 18231: {'lr': 0.00048530993951563186, 'samples': 3500352, 'steps': 18230, 'loss/train': 1.3493831157684326} 11/06/2021 23:49:14 - INFO - __main__ - Step 18232: {'lr': 0.0004853081471674159, 'samples': 3500544, 'steps': 18231, 'loss/train': 1.558580994606018} 11/06/2021 23:49:14 - INFO - __main__ - Step 18233: {'lr': 0.00048530635471317373, 'samples': 3500736, 'steps': 18232, 'loss/train': 1.388440489768982} 11/06/2021 23:49:15 - INFO - __main__ - Step 18234: {'lr': 0.0004853045621529062, 'samples': 3500928, 'steps': 18233, 'loss/train': 1.910845160484314} 11/06/2021 23:49:15 - INFO - __main__ - Step 18235: {'lr': 0.000485302769486614, 'samples': 3501120, 'steps': 18234, 'loss/train': 0.8962222337722778} 11/06/2021 23:49:16 - INFO - __main__ - Step 18236: {'lr': 0.000485300976714298, 'samples': 3501312, 'steps': 18235, 'loss/train': 1.7857457399368286} 11/06/2021 23:49:16 - INFO - __main__ - Step 18237: {'lr': 0.00048529918383595906, 'samples': 3501504, 'steps': 18236, 'loss/train': 1.4741995334625244} 11/06/2021 23:49:16 - INFO - __main__ - Step 18238: {'lr': 0.0004852973908515979, 'samples': 3501696, 'steps': 18237, 'loss/train': 1.837266206741333} 11/06/2021 23:49:17 - INFO - __main__ - Step 18239: {'lr': 0.0004852955977612154, 'samples': 3501888, 'steps': 18238, 'loss/train': 1.7000993490219116} 11/06/2021 23:49:18 - INFO - __main__ - Step 18240: {'lr': 0.0004852938045648123, 'samples': 3502080, 'steps': 18239, 'loss/train': 1.7775219678878784} 11/06/2021 23:49:18 - INFO - __main__ - Step 18241: {'lr': 0.0004852920112623895, 'samples': 3502272, 'steps': 18240, 'loss/train': 1.721117615699768} 11/06/2021 23:49:18 - INFO - __main__ - Step 18242: {'lr': 0.00048529021785394765, 'samples': 3502464, 'steps': 18241, 'loss/train': 1.123333215713501} 11/06/2021 23:49:19 - INFO - __main__ - Step 18243: {'lr': 0.00048528842433948776, 'samples': 3502656, 'steps': 18242, 'loss/train': 2.0293686389923096} 11/06/2021 23:49:20 - INFO - __main__ - Step 18244: {'lr': 0.00048528663071901047, 'samples': 3502848, 'steps': 18243, 'loss/train': 1.9051157236099243} 11/06/2021 23:49:20 - INFO - __main__ - Step 18245: {'lr': 0.0004852848369925167, 'samples': 3503040, 'steps': 18244, 'loss/train': 1.5317625999450684} 11/06/2021 23:49:21 - INFO - __main__ - Step 18246: {'lr': 0.00048528304316000723, 'samples': 3503232, 'steps': 18245, 'loss/train': 1.8175814151763916} 11/06/2021 23:49:21 - INFO - __main__ - Step 18247: {'lr': 0.0004852812492214828, 'samples': 3503424, 'steps': 18246, 'loss/train': 1.8638849258422852} 11/06/2021 23:49:21 - INFO - __main__ - Step 18248: {'lr': 0.0004852794551769443, 'samples': 3503616, 'steps': 18247, 'loss/train': 1.8089745044708252} 11/06/2021 23:49:22 - INFO - __main__ - Step 18249: {'lr': 0.0004852776610263925, 'samples': 3503808, 'steps': 18248, 'loss/train': 1.6794166564941406} 11/06/2021 23:49:23 - INFO - __main__ - Step 18250: {'lr': 0.0004852758667698282, 'samples': 3504000, 'steps': 18249, 'loss/train': 1.3067717552185059} 11/06/2021 23:49:23 - INFO - __main__ - Step 18251: {'lr': 0.00048527407240725223, 'samples': 3504192, 'steps': 18250, 'loss/train': 2.015906810760498} 11/06/2021 23:49:23 - INFO - __main__ - Step 18252: {'lr': 0.0004852722779386654, 'samples': 3504384, 'steps': 18251, 'loss/train': 0.8979371786117554} 11/06/2021 23:49:24 - INFO - __main__ - Step 18253: {'lr': 0.00048527048336406855, 'samples': 3504576, 'steps': 18252, 'loss/train': 1.830647587776184} 11/06/2021 23:49:24 - INFO - __main__ - Step 18254: {'lr': 0.00048526868868346243, 'samples': 3504768, 'steps': 18253, 'loss/train': 1.9769741296768188} 11/06/2021 23:49:25 - INFO - __main__ - Step 18255: {'lr': 0.0004852668938968478, 'samples': 3504960, 'steps': 18254, 'loss/train': 1.7995449304580688} 11/06/2021 23:49:25 - INFO - __main__ - Step 18256: {'lr': 0.0004852650990042256, 'samples': 3505152, 'steps': 18255, 'loss/train': 1.704633116722107} 11/06/2021 23:49:26 - INFO - __main__ - Step 18257: {'lr': 0.0004852633040055966, 'samples': 3505344, 'steps': 18256, 'loss/train': 0.9128971695899963} 11/06/2021 23:49:26 - INFO - __main__ - Step 18258: {'lr': 0.00048526150890096153, 'samples': 3505536, 'steps': 18257, 'loss/train': 1.5316613912582397} 11/06/2021 23:49:26 - INFO - __main__ - Step 18259: {'lr': 0.0004852597136903213, 'samples': 3505728, 'steps': 18258, 'loss/train': 1.312677264213562} 11/06/2021 23:49:28 - INFO - __main__ - Step 18260: {'lr': 0.0004852579183736766, 'samples': 3505920, 'steps': 18259, 'loss/train': 1.4250472784042358} 11/06/2021 23:49:28 - INFO - __main__ - Step 18261: {'lr': 0.00048525612295102836, 'samples': 3506112, 'steps': 18260, 'loss/train': 1.6988489627838135} 11/06/2021 23:49:28 - INFO - __main__ - Step 18262: {'lr': 0.00048525432742237736, 'samples': 3506304, 'steps': 18261, 'loss/train': 1.2453324794769287} 11/06/2021 23:49:29 - INFO - __main__ - Step 18263: {'lr': 0.00048525253178772435, 'samples': 3506496, 'steps': 18262, 'loss/train': 0.8945806622505188} 11/06/2021 23:49:29 - INFO - __main__ - Step 18264: {'lr': 0.0004852507360470702, 'samples': 3506688, 'steps': 18263, 'loss/train': 1.7507184743881226} 11/06/2021 23:49:30 - INFO - __main__ - Step 18265: {'lr': 0.0004852489402004157, 'samples': 3506880, 'steps': 18264, 'loss/train': 1.6482011079788208} 11/06/2021 23:49:30 - INFO - __main__ - Step 18266: {'lr': 0.0004852471442477617, 'samples': 3507072, 'steps': 18265, 'loss/train': 1.300549864768982} 11/06/2021 23:49:31 - INFO - __main__ - Step 18267: {'lr': 0.0004852453481891089, 'samples': 3507264, 'steps': 18266, 'loss/train': 1.42025887966156} 11/06/2021 23:49:31 - INFO - __main__ - Step 18268: {'lr': 0.00048524355202445827, 'samples': 3507456, 'steps': 18267, 'loss/train': 1.4363129138946533} 11/06/2021 23:49:31 - INFO - __main__ - Step 18269: {'lr': 0.0004852417557538104, 'samples': 3507648, 'steps': 18268, 'loss/train': 1.8137376308441162} 11/06/2021 23:49:32 - INFO - __main__ - Step 18270: {'lr': 0.00048523995937716625, 'samples': 3507840, 'steps': 18269, 'loss/train': 1.8509377241134644} 11/06/2021 23:49:33 - INFO - __main__ - Step 18271: {'lr': 0.0004852381628945267, 'samples': 3508032, 'steps': 18270, 'loss/train': 1.3168461322784424} 11/06/2021 23:49:33 - INFO - __main__ - Step 18272: {'lr': 0.0004852363663058924, 'samples': 3508224, 'steps': 18271, 'loss/train': 1.526947259902954} 11/06/2021 23:49:33 - INFO - __main__ - Step 18273: {'lr': 0.0004852345696112642, 'samples': 3508416, 'steps': 18272, 'loss/train': 1.773221492767334} 11/06/2021 23:49:34 - INFO - __main__ - Step 18274: {'lr': 0.00048523277281064295, 'samples': 3508608, 'steps': 18273, 'loss/train': 1.244134545326233} 11/06/2021 23:49:35 - INFO - __main__ - Step 18275: {'lr': 0.0004852309759040294, 'samples': 3508800, 'steps': 18274, 'loss/train': 1.7051624059677124} 11/06/2021 23:49:35 - INFO - __main__ - Step 18276: {'lr': 0.00048522917889142446, 'samples': 3508992, 'steps': 18275, 'loss/train': 1.3719818592071533} 11/06/2021 23:49:36 - INFO - __main__ - Step 18277: {'lr': 0.00048522738177282887, 'samples': 3509184, 'steps': 18276, 'loss/train': 1.7115142345428467} 11/06/2021 23:49:36 - INFO - __main__ - Step 18278: {'lr': 0.0004852255845482435, 'samples': 3509376, 'steps': 18277, 'loss/train': 1.9277666807174683} 11/06/2021 23:49:36 - INFO - __main__ - Step 18279: {'lr': 0.0004852237872176691, 'samples': 3509568, 'steps': 18278, 'loss/train': 1.5939767360687256} 11/06/2021 23:49:37 - INFO - __main__ - Step 18280: {'lr': 0.00048522198978110645, 'samples': 3509760, 'steps': 18279, 'loss/train': 1.7465685606002808} 11/06/2021 23:49:38 - INFO - __main__ - Step 18281: {'lr': 0.0004852201922385564, 'samples': 3509952, 'steps': 18280, 'loss/train': 1.4472709894180298} 11/06/2021 23:49:38 - INFO - __main__ - Step 18282: {'lr': 0.00048521839459001977, 'samples': 3510144, 'steps': 18281, 'loss/train': 4.651975154876709} 11/06/2021 23:49:38 - INFO - __main__ - Step 18283: {'lr': 0.0004852165968354973, 'samples': 3510336, 'steps': 18282, 'loss/train': 1.702797532081604} 11/06/2021 23:49:39 - INFO - __main__ - Step 18284: {'lr': 0.00048521479897499, 'samples': 3510528, 'steps': 18283, 'loss/train': 0.8926244378089905} 11/06/2021 23:49:39 - INFO - __main__ - Step 18285: {'lr': 0.0004852130010084984, 'samples': 3510720, 'steps': 18284, 'loss/train': 1.6344717741012573} 11/06/2021 23:49:40 - INFO - __main__ - Step 18286: {'lr': 0.0004852112029360235, 'samples': 3510912, 'steps': 18285, 'loss/train': 1.0488026142120361} 11/06/2021 23:49:41 - INFO - __main__ - Step 18287: {'lr': 0.0004852094047575661, 'samples': 3511104, 'steps': 18286, 'loss/train': 0.9467637538909912} 11/06/2021 23:49:41 - INFO - __main__ - Step 18288: {'lr': 0.00048520760647312696, 'samples': 3511296, 'steps': 18287, 'loss/train': 1.3265576362609863} 11/06/2021 23:49:41 - INFO - __main__ - Step 18289: {'lr': 0.00048520580808270687, 'samples': 3511488, 'steps': 18288, 'loss/train': 1.6178340911865234} 11/06/2021 23:49:42 - INFO - __main__ - Step 18290: {'lr': 0.0004852040095863067, 'samples': 3511680, 'steps': 18289, 'loss/train': 1.1068137884140015} 11/06/2021 23:49:43 - INFO - __main__ - Step 18291: {'lr': 0.0004852022109839273, 'samples': 3511872, 'steps': 18290, 'loss/train': 1.9344698190689087} 11/06/2021 23:49:43 - INFO - __main__ - Step 18292: {'lr': 0.0004852004122755693, 'samples': 3512064, 'steps': 18291, 'loss/train': 1.2475080490112305} 11/06/2021 23:49:43 - INFO - __main__ - Step 18293: {'lr': 0.00048519861346123363, 'samples': 3512256, 'steps': 18292, 'loss/train': 1.8506364822387695} 11/06/2021 23:49:44 - INFO - __main__ - Step 18294: {'lr': 0.0004851968145409211, 'samples': 3512448, 'steps': 18293, 'loss/train': 1.8689836263656616} 11/06/2021 23:49:44 - INFO - __main__ - Step 18295: {'lr': 0.00048519501551463255, 'samples': 3512640, 'steps': 18294, 'loss/train': 1.9257316589355469} 11/06/2021 23:49:45 - INFO - __main__ - Step 18296: {'lr': 0.0004851932163823688, 'samples': 3512832, 'steps': 18295, 'loss/train': 1.4464284181594849} 11/06/2021 23:49:45 - INFO - __main__ - Step 18297: {'lr': 0.0004851914171441305, 'samples': 3513024, 'steps': 18296, 'loss/train': 1.7276997566223145} 11/06/2021 23:49:46 - INFO - __main__ - Step 18298: {'lr': 0.00048518961779991866, 'samples': 3513216, 'steps': 18297, 'loss/train': 2.2795534133911133} 11/06/2021 23:49:46 - INFO - __main__ - Step 18299: {'lr': 0.00048518781834973405, 'samples': 3513408, 'steps': 18298, 'loss/train': 1.8023676872253418} 11/06/2021 23:49:46 - INFO - __main__ - Step 18300: {'lr': 0.0004851860187935773, 'samples': 3513600, 'steps': 18299, 'loss/train': 1.8043395280838013} 11/06/2021 23:49:47 - INFO - __main__ - Step 18301: {'lr': 0.0004851842191314494, 'samples': 3513792, 'steps': 18300, 'loss/train': 1.7995017766952515} 11/06/2021 23:49:48 - INFO - __main__ - Step 18302: {'lr': 0.0004851824193633512, 'samples': 3513984, 'steps': 18301, 'loss/train': 1.2116223573684692} 11/06/2021 23:49:48 - INFO - __main__ - Step 18303: {'lr': 0.00048518061948928337, 'samples': 3514176, 'steps': 18302, 'loss/train': 1.6613613367080688} 11/06/2021 23:49:48 - INFO - __main__ - Step 18304: {'lr': 0.0004851788195092468, 'samples': 3514368, 'steps': 18303, 'loss/train': 1.6209909915924072} 11/06/2021 23:49:49 - INFO - __main__ - Step 18305: {'lr': 0.00048517701942324225, 'samples': 3514560, 'steps': 18304, 'loss/train': 1.4889389276504517} 11/06/2021 23:49:50 - INFO - __main__ - Step 18306: {'lr': 0.00048517521923127063, 'samples': 3514752, 'steps': 18305, 'loss/train': 1.5987910032272339} 11/06/2021 23:49:50 - INFO - __main__ - Step 18307: {'lr': 0.00048517341893333267, 'samples': 3514944, 'steps': 18306, 'loss/train': 1.4218535423278809} 11/06/2021 23:49:51 - INFO - __main__ - Step 18308: {'lr': 0.0004851716185294291, 'samples': 3515136, 'steps': 18307, 'loss/train': 1.3307141065597534} 11/06/2021 23:49:51 - INFO - __main__ - Step 18309: {'lr': 0.00048516981801956097, 'samples': 3515328, 'steps': 18308, 'loss/train': 1.0911080837249756} 11/06/2021 23:49:51 - INFO - __main__ - Step 18310: {'lr': 0.00048516801740372886, 'samples': 3515520, 'steps': 18309, 'loss/train': 1.5803437232971191} 11/06/2021 23:49:52 - INFO - __main__ - Step 18311: {'lr': 0.0004851662166819337, 'samples': 3515712, 'steps': 18310, 'loss/train': 1.7632087469100952} 11/06/2021 23:49:53 - INFO - __main__ - Step 18312: {'lr': 0.00048516441585417624, 'samples': 3515904, 'steps': 18311, 'loss/train': 1.3670587539672852} 11/06/2021 23:49:53 - INFO - __main__ - Step 18313: {'lr': 0.0004851626149204573, 'samples': 3516096, 'steps': 18312, 'loss/train': 1.3371531963348389} 11/06/2021 23:49:53 - INFO - __main__ - Step 18314: {'lr': 0.0004851608138807778, 'samples': 3516288, 'steps': 18313, 'loss/train': 1.4870017766952515} 11/06/2021 23:49:54 - INFO - __main__ - Step 18315: {'lr': 0.0004851590127351384, 'samples': 3516480, 'steps': 18314, 'loss/train': 1.754389762878418} 11/06/2021 23:49:54 - INFO - __main__ - Step 18316: {'lr': 0.0004851572114835401, 'samples': 3516672, 'steps': 18315, 'loss/train': 1.351464867591858} 11/06/2021 23:49:55 - INFO - __main__ - Step 18317: {'lr': 0.0004851554101259834, 'samples': 3516864, 'steps': 18316, 'loss/train': 1.2993682622909546} 11/06/2021 23:49:56 - INFO - __main__ - Step 18318: {'lr': 0.00048515360866246943, 'samples': 3517056, 'steps': 18317, 'loss/train': 1.4569814205169678} 11/06/2021 23:49:56 - INFO - __main__ - Step 18319: {'lr': 0.00048515180709299884, 'samples': 3517248, 'steps': 18318, 'loss/train': 1.4474055767059326} 11/06/2021 23:49:56 - INFO - __main__ - Step 18320: {'lr': 0.0004851500054175725, 'samples': 3517440, 'steps': 18319, 'loss/train': 1.655856966972351} 11/06/2021 23:49:57 - INFO - __main__ - Step 18321: {'lr': 0.00048514820363619116, 'samples': 3517632, 'steps': 18320, 'loss/train': 1.7874832153320312} 11/06/2021 23:49:58 - INFO - __main__ - Step 18322: {'lr': 0.0004851464017488556, 'samples': 3517824, 'steps': 18321, 'loss/train': 1.4972641468048096} 11/06/2021 23:49:58 - INFO - __main__ - Step 18323: {'lr': 0.0004851445997555668, 'samples': 3518016, 'steps': 18322, 'loss/train': 1.730514407157898} 11/06/2021 23:49:59 - INFO - __main__ - Step 18324: {'lr': 0.00048514279765632547, 'samples': 3518208, 'steps': 18323, 'loss/train': 1.7246754169464111} 11/06/2021 23:49:59 - INFO - __main__ - Step 18325: {'lr': 0.0004851409954511324, 'samples': 3518400, 'steps': 18324, 'loss/train': 1.5894545316696167} 11/06/2021 23:49:59 - INFO - __main__ - Step 18326: {'lr': 0.0004851391931399884, 'samples': 3518592, 'steps': 18325, 'loss/train': 1.7464122772216797} 11/06/2021 23:50:00 - INFO - __main__ - Step 18327: {'lr': 0.0004851373907228943, 'samples': 3518784, 'steps': 18326, 'loss/train': 0.5367693901062012} 11/06/2021 23:50:01 - INFO - __main__ - Step 18328: {'lr': 0.00048513558819985106, 'samples': 3518976, 'steps': 18327, 'loss/train': 1.6977030038833618} 11/06/2021 23:50:01 - INFO - __main__ - Step 18329: {'lr': 0.0004851337855708592, 'samples': 3519168, 'steps': 18328, 'loss/train': 2.0396475791931152} 11/06/2021 23:50:01 - INFO - __main__ - Step 18330: {'lr': 0.0004851319828359198, 'samples': 3519360, 'steps': 18329, 'loss/train': 1.1017811298370361} 11/06/2021 23:50:02 - INFO - __main__ - Step 18331: {'lr': 0.0004851301799950334, 'samples': 3519552, 'steps': 18330, 'loss/train': 1.9549344778060913} 11/06/2021 23:50:03 - INFO - __main__ - Step 18332: {'lr': 0.00048512837704820107, 'samples': 3519744, 'steps': 18331, 'loss/train': 1.0040029287338257} 11/06/2021 23:50:03 - INFO - __main__ - Step 18333: {'lr': 0.00048512657399542346, 'samples': 3519936, 'steps': 18332, 'loss/train': 1.3319391012191772} 11/06/2021 23:50:04 - INFO - __main__ - Step 18334: {'lr': 0.0004851247708367015, 'samples': 3520128, 'steps': 18333, 'loss/train': 2.4227659702301025} 11/06/2021 23:50:04 - INFO - __main__ - Step 18335: {'lr': 0.000485122967572036, 'samples': 3520320, 'steps': 18334, 'loss/train': 1.6032307147979736} 11/06/2021 23:50:04 - INFO - __main__ - Step 18336: {'lr': 0.0004851211642014276, 'samples': 3520512, 'steps': 18335, 'loss/train': 1.2883901596069336} 11/06/2021 23:50:05 - INFO - __main__ - Step 18337: {'lr': 0.0004851193607248773, 'samples': 3520704, 'steps': 18336, 'loss/train': 1.7592421770095825} 11/06/2021 23:50:06 - INFO - __main__ - Step 18338: {'lr': 0.00048511755714238585, 'samples': 3520896, 'steps': 18337, 'loss/train': 1.8046785593032837} 11/06/2021 23:50:06 - INFO - __main__ - Step 18339: {'lr': 0.0004851157534539541, 'samples': 3521088, 'steps': 18338, 'loss/train': 1.5575534105300903} 11/06/2021 23:50:07 - INFO - __main__ - Step 18340: {'lr': 0.0004851139496595827, 'samples': 3521280, 'steps': 18339, 'loss/train': 2.251190423965454} 11/06/2021 23:50:07 - INFO - __main__ - Step 18341: {'lr': 0.00048511214575927265, 'samples': 3521472, 'steps': 18340, 'loss/train': 2.2171289920806885} 11/06/2021 23:50:07 - INFO - __main__ - Step 18342: {'lr': 0.0004851103417530247, 'samples': 3521664, 'steps': 18341, 'loss/train': 1.7757937908172607} 11/06/2021 23:50:08 - INFO - __main__ - Step 18343: {'lr': 0.0004851085376408396, 'samples': 3521856, 'steps': 18342, 'loss/train': 2.004495143890381} 11/06/2021 23:50:09 - INFO - __main__ - Step 18344: {'lr': 0.0004851067334227183, 'samples': 3522048, 'steps': 18343, 'loss/train': 2.009399175643921} 11/06/2021 23:50:10 - INFO - __main__ - Step 18345: {'lr': 0.0004851049290986615, 'samples': 3522240, 'steps': 18344, 'loss/train': 1.5748186111450195} 11/06/2021 23:50:10 - INFO - __main__ - Step 18346: {'lr': 0.00048510312466867, 'samples': 3522432, 'steps': 18345, 'loss/train': 1.7785025835037231} 11/06/2021 23:50:10 - INFO - __main__ - Step 18347: {'lr': 0.0004851013201327448, 'samples': 3522624, 'steps': 18346, 'loss/train': 1.7600922584533691} 11/06/2021 23:50:11 - INFO - __main__ - Step 18348: {'lr': 0.0004850995154908864, 'samples': 3522816, 'steps': 18347, 'loss/train': 1.7795250415802002} 11/06/2021 23:50:11 - INFO - __main__ - Step 18349: {'lr': 0.0004850977107430959, 'samples': 3523008, 'steps': 18348, 'loss/train': 1.8570597171783447} 11/06/2021 23:50:12 - INFO - __main__ - Step 18350: {'lr': 0.000485095905889374, 'samples': 3523200, 'steps': 18349, 'loss/train': 1.9649345874786377} 11/06/2021 23:50:12 - INFO - __main__ - Step 18351: {'lr': 0.00048509410092972144, 'samples': 3523392, 'steps': 18350, 'loss/train': 1.5948915481567383} 11/06/2021 23:50:13 - INFO - __main__ - Step 18352: {'lr': 0.0004850922958641392, 'samples': 3523584, 'steps': 18351, 'loss/train': 1.2234508991241455} 11/06/2021 23:50:13 - INFO - __main__ - Step 18353: {'lr': 0.0004850904906926279, 'samples': 3523776, 'steps': 18352, 'loss/train': 1.9055534601211548} 11/06/2021 23:50:14 - INFO - __main__ - Step 18354: {'lr': 0.0004850886854151885, 'samples': 3523968, 'steps': 18353, 'loss/train': 1.0320509672164917} 11/06/2021 23:50:14 - INFO - __main__ - Step 18355: {'lr': 0.0004850868800318218, 'samples': 3524160, 'steps': 18354, 'loss/train': 0.8012345433235168} 11/06/2021 23:50:15 - INFO - __main__ - Step 18356: {'lr': 0.00048508507454252846, 'samples': 3524352, 'steps': 18355, 'loss/train': 1.7290698289871216} 11/06/2021 23:50:15 - INFO - __main__ - Step 18357: {'lr': 0.00048508326894730955, 'samples': 3524544, 'steps': 18356, 'loss/train': 1.7813465595245361} 11/06/2021 23:50:16 - INFO - __main__ - Step 18358: {'lr': 0.00048508146324616566, 'samples': 3524736, 'steps': 18357, 'loss/train': 1.5460205078125} 11/06/2021 23:50:16 - INFO - __main__ - Step 18359: {'lr': 0.0004850796574390977, 'samples': 3524928, 'steps': 18358, 'loss/train': 2.392427682876587} 11/06/2021 23:50:16 - INFO - __main__ - Step 18360: {'lr': 0.0004850778515261065, 'samples': 3525120, 'steps': 18359, 'loss/train': 1.592339038848877} 11/06/2021 23:50:17 - INFO - __main__ - Step 18361: {'lr': 0.0004850760455071929, 'samples': 3525312, 'steps': 18360, 'loss/train': 1.4301173686981201} 11/06/2021 23:50:18 - INFO - __main__ - Step 18362: {'lr': 0.0004850742393823576, 'samples': 3525504, 'steps': 18361, 'loss/train': 2.1455750465393066} 11/06/2021 23:50:18 - INFO - __main__ - Step 18363: {'lr': 0.0004850724331516014, 'samples': 3525696, 'steps': 18362, 'loss/train': 1.851510763168335} 11/06/2021 23:50:18 - INFO - __main__ - Step 18364: {'lr': 0.0004850706268149253, 'samples': 3525888, 'steps': 18363, 'loss/train': 1.5681486129760742} 11/06/2021 23:50:19 - INFO - __main__ - Step 18365: {'lr': 0.00048506882037233, 'samples': 3526080, 'steps': 18364, 'loss/train': 1.4631696939468384} 11/06/2021 23:50:20 - INFO - __main__ - Step 18366: {'lr': 0.0004850670138238162, 'samples': 3526272, 'steps': 18365, 'loss/train': 1.7207934856414795} 11/06/2021 23:50:20 - INFO - __main__ - Step 18367: {'lr': 0.00048506520716938496, 'samples': 3526464, 'steps': 18366, 'loss/train': 1.616496205329895} 11/06/2021 23:50:21 - INFO - __main__ - Step 18368: {'lr': 0.00048506340040903697, 'samples': 3526656, 'steps': 18367, 'loss/train': 1.8938003778457642} 11/06/2021 23:50:21 - INFO - __main__ - Step 18369: {'lr': 0.00048506159354277294, 'samples': 3526848, 'steps': 18368, 'loss/train': 1.2933012247085571} 11/06/2021 23:50:21 - INFO - __main__ - Step 18370: {'lr': 0.00048505978657059385, 'samples': 3527040, 'steps': 18369, 'loss/train': 1.8358917236328125} 11/06/2021 23:50:22 - INFO - __main__ - Step 18371: {'lr': 0.0004850579794925004, 'samples': 3527232, 'steps': 18370, 'loss/train': 1.0218453407287598} 11/06/2021 23:50:23 - INFO - __main__ - Step 18372: {'lr': 0.0004850561723084935, 'samples': 3527424, 'steps': 18371, 'loss/train': 1.8021912574768066} 11/06/2021 23:50:23 - INFO - __main__ - Step 18373: {'lr': 0.0004850543650185739, 'samples': 3527616, 'steps': 18372, 'loss/train': 1.755092978477478} 11/06/2021 23:50:23 - INFO - __main__ - Step 18374: {'lr': 0.0004850525576227425, 'samples': 3527808, 'steps': 18373, 'loss/train': 1.9047927856445312} 11/06/2021 23:50:24 - INFO - __main__ - Step 18375: {'lr': 0.000485050750121, 'samples': 3528000, 'steps': 18374, 'loss/train': 1.681142807006836} 11/06/2021 23:50:24 - INFO - __main__ - Step 18376: {'lr': 0.0004850489425133472, 'samples': 3528192, 'steps': 18375, 'loss/train': 2.1112258434295654} 11/06/2021 23:50:25 - INFO - __main__ - Step 18377: {'lr': 0.000485047134799785, 'samples': 3528384, 'steps': 18376, 'loss/train': 1.5793359279632568} 11/06/2021 23:50:25 - INFO - __main__ - Step 18378: {'lr': 0.00048504532698031416, 'samples': 3528576, 'steps': 18377, 'loss/train': 1.8018134832382202} 11/06/2021 23:50:26 - INFO - __main__ - Step 18379: {'lr': 0.0004850435190549356, 'samples': 3528768, 'steps': 18378, 'loss/train': 1.5398601293563843} 11/06/2021 23:50:26 - INFO - __main__ - Step 18380: {'lr': 0.00048504171102365, 'samples': 3528960, 'steps': 18379, 'loss/train': 2.1345901489257812} 11/06/2021 23:50:26 - INFO - __main__ - Step 18381: {'lr': 0.0004850399028864583, 'samples': 3529152, 'steps': 18380, 'loss/train': 1.6549445390701294} 11/06/2021 23:50:27 - INFO - __main__ - Step 18382: {'lr': 0.0004850380946433611, 'samples': 3529344, 'steps': 18381, 'loss/train': 1.5185497999191284} 11/06/2021 23:50:28 - INFO - __main__ - Step 18383: {'lr': 0.00048503628629435947, 'samples': 3529536, 'steps': 18382, 'loss/train': 0.985446572303772} 11/06/2021 23:50:28 - INFO - __main__ - Step 18384: {'lr': 0.0004850344778394541, 'samples': 3529728, 'steps': 18383, 'loss/train': 1.2629421949386597} 11/06/2021 23:50:29 - INFO - __main__ - Step 18385: {'lr': 0.0004850326692786459, 'samples': 3529920, 'steps': 18384, 'loss/train': 1.4181469678878784} 11/06/2021 23:50:29 - INFO - __main__ - Step 18386: {'lr': 0.00048503086061193546, 'samples': 3530112, 'steps': 18385, 'loss/train': 1.1277774572372437} 11/06/2021 23:50:30 - INFO - __main__ - Step 18387: {'lr': 0.0004850290518393238, 'samples': 3530304, 'steps': 18386, 'loss/train': 1.889421820640564} 11/06/2021 23:50:30 - INFO - __main__ - Step 18388: {'lr': 0.0004850272429608117, 'samples': 3530496, 'steps': 18387, 'loss/train': 1.9355649948120117} 11/06/2021 23:50:31 - INFO - __main__ - Step 18389: {'lr': 0.0004850254339764, 'samples': 3530688, 'steps': 18388, 'loss/train': 1.574332356452942} 11/06/2021 23:50:31 - INFO - __main__ - Step 18390: {'lr': 0.00048502362488608933, 'samples': 3530880, 'steps': 18389, 'loss/train': 1.6066454648971558} 11/06/2021 23:50:31 - INFO - __main__ - Step 18391: {'lr': 0.0004850218156898807, 'samples': 3531072, 'steps': 18390, 'loss/train': 1.316588044166565} 11/06/2021 23:50:32 - INFO - __main__ - Step 18392: {'lr': 0.00048502000638777487, 'samples': 3531264, 'steps': 18391, 'loss/train': 1.8195085525512695} 11/06/2021 23:50:33 - INFO - __main__ - Step 18393: {'lr': 0.0004850181969797727, 'samples': 3531456, 'steps': 18392, 'loss/train': 2.008401870727539} 11/06/2021 23:50:33 - INFO - __main__ - Step 18394: {'lr': 0.00048501638746587493, 'samples': 3531648, 'steps': 18393, 'loss/train': 1.5043723583221436} 11/06/2021 23:50:33 - INFO - __main__ - Step 18395: {'lr': 0.0004850145778460824, 'samples': 3531840, 'steps': 18394, 'loss/train': 1.7984856367111206} 11/06/2021 23:50:34 - INFO - __main__ - Step 18396: {'lr': 0.00048501276812039585, 'samples': 3532032, 'steps': 18395, 'loss/train': 1.2785509824752808} 11/06/2021 23:50:34 - INFO - __main__ - Step 18397: {'lr': 0.00048501095828881627, 'samples': 3532224, 'steps': 18396, 'loss/train': 2.095855951309204} 11/06/2021 23:50:35 - INFO - __main__ - Step 18398: {'lr': 0.00048500914835134434, 'samples': 3532416, 'steps': 18397, 'loss/train': 0.8654271364212036} 11/06/2021 23:50:36 - INFO - __main__ - Step 18399: {'lr': 0.00048500733830798094, 'samples': 3532608, 'steps': 18398, 'loss/train': 1.7500832080841064} 11/06/2021 23:50:36 - INFO - __main__ - Step 18400: {'lr': 0.00048500552815872687, 'samples': 3532800, 'steps': 18399, 'loss/train': 1.6068241596221924} 11/06/2021 23:50:36 - INFO - __main__ - Step 18401: {'lr': 0.0004850037179035829, 'samples': 3532992, 'steps': 18400, 'loss/train': 1.3907315731048584} 11/06/2021 23:50:37 - INFO - __main__ - Step 18402: {'lr': 0.00048500190754254994, 'samples': 3533184, 'steps': 18401, 'loss/train': 1.8806012868881226} 11/06/2021 23:50:38 - INFO - __main__ - Step 18403: {'lr': 0.00048500009707562865, 'samples': 3533376, 'steps': 18402, 'loss/train': 1.9587904214859009} 11/06/2021 23:50:38 - INFO - __main__ - Step 18404: {'lr': 0.00048499828650281994, 'samples': 3533568, 'steps': 18403, 'loss/train': 1.476645588874817} 11/06/2021 23:50:38 - INFO - __main__ - Step 18405: {'lr': 0.00048499647582412475, 'samples': 3533760, 'steps': 18404, 'loss/train': 1.2877929210662842} 11/06/2021 23:50:39 - INFO - __main__ - Step 18406: {'lr': 0.0004849946650395437, 'samples': 3533952, 'steps': 18405, 'loss/train': 1.7419734001159668} 11/06/2021 23:50:39 - INFO - __main__ - Step 18407: {'lr': 0.0004849928541490777, 'samples': 3534144, 'steps': 18406, 'loss/train': 1.5446574687957764} 11/06/2021 23:50:40 - INFO - __main__ - Step 18408: {'lr': 0.0004849910431527275, 'samples': 3534336, 'steps': 18407, 'loss/train': 1.6804839372634888} 11/06/2021 23:50:41 - INFO - __main__ - Step 18409: {'lr': 0.000484989232050494, 'samples': 3534528, 'steps': 18408, 'loss/train': 1.753796935081482} 11/06/2021 23:50:41 - INFO - __main__ - Step 18410: {'lr': 0.00048498742084237796, 'samples': 3534720, 'steps': 18409, 'loss/train': 1.7677087783813477} 11/06/2021 23:50:41 - INFO - __main__ - Step 18411: {'lr': 0.00048498560952838025, 'samples': 3534912, 'steps': 18410, 'loss/train': 1.3275538682937622} 11/06/2021 23:50:42 - INFO - __main__ - Step 18412: {'lr': 0.00048498379810850157, 'samples': 3535104, 'steps': 18411, 'loss/train': 1.63711416721344} 11/06/2021 23:50:43 - INFO - __main__ - Step 18413: {'lr': 0.0004849819865827429, 'samples': 3535296, 'steps': 18412, 'loss/train': 1.4949513673782349} 11/06/2021 23:50:43 - INFO - __main__ - Step 18414: {'lr': 0.0004849801749511049, 'samples': 3535488, 'steps': 18413, 'loss/train': 1.6665228605270386} 11/06/2021 23:50:43 - INFO - __main__ - Step 18415: {'lr': 0.00048497836321358855, 'samples': 3535680, 'steps': 18414, 'loss/train': 1.7287325859069824} 11/06/2021 23:50:44 - INFO - __main__ - Step 18416: {'lr': 0.00048497655137019454, 'samples': 3535872, 'steps': 18415, 'loss/train': 1.747482180595398} 11/06/2021 23:50:44 - INFO - __main__ - Step 18417: {'lr': 0.0004849747394209237, 'samples': 3536064, 'steps': 18416, 'loss/train': 1.5815653800964355} 11/06/2021 23:50:45 - INFO - __main__ - Step 18418: {'lr': 0.00048497292736577685, 'samples': 3536256, 'steps': 18417, 'loss/train': 1.05122971534729} 11/06/2021 23:50:45 - INFO - __main__ - Step 18419: {'lr': 0.0004849711152047549, 'samples': 3536448, 'steps': 18418, 'loss/train': 1.965767502784729} 11/06/2021 23:50:46 - INFO - __main__ - Step 18420: {'lr': 0.0004849693029378585, 'samples': 3536640, 'steps': 18419, 'loss/train': 1.3536920547485352} 11/06/2021 23:50:46 - INFO - __main__ - Step 18421: {'lr': 0.0004849674905650886, 'samples': 3536832, 'steps': 18420, 'loss/train': 1.2444932460784912} 11/06/2021 23:50:46 - INFO - __main__ - Step 18422: {'lr': 0.000484965678086446, 'samples': 3537024, 'steps': 18421, 'loss/train': 2.4452176094055176} 11/06/2021 23:50:47 - INFO - __main__ - Step 18423: {'lr': 0.0004849638655019315, 'samples': 3537216, 'steps': 18422, 'loss/train': 1.608303189277649} 11/06/2021 23:50:48 - INFO - __main__ - Step 18424: {'lr': 0.0004849620528115458, 'samples': 3537408, 'steps': 18423, 'loss/train': 1.6960731744766235} 11/06/2021 23:50:48 - INFO - __main__ - Step 18425: {'lr': 0.0004849602400152899, 'samples': 3537600, 'steps': 18424, 'loss/train': 2.0159389972686768} 11/06/2021 23:50:49 - INFO - __main__ - Step 18426: {'lr': 0.0004849584271131646, 'samples': 3537792, 'steps': 18425, 'loss/train': 2.1302671432495117} 11/06/2021 23:50:49 - INFO - __main__ - Step 18427: {'lr': 0.00048495661410517056, 'samples': 3537984, 'steps': 18426, 'loss/train': 1.5923632383346558} 11/06/2021 23:50:49 - INFO - __main__ - Step 18428: {'lr': 0.0004849548009913087, 'samples': 3538176, 'steps': 18427, 'loss/train': 1.6678215265274048} 11/06/2021 23:50:50 - INFO - __main__ - Step 18429: {'lr': 0.00048495298777157994, 'samples': 3538368, 'steps': 18428, 'loss/train': 1.6185996532440186} 11/06/2021 23:50:50 - INFO - __main__ - Step 18430: {'lr': 0.0004849511744459849, 'samples': 3538560, 'steps': 18429, 'loss/train': 1.3633182048797607} 11/06/2021 23:50:51 - INFO - __main__ - Step 18431: {'lr': 0.00048494936101452446, 'samples': 3538752, 'steps': 18430, 'loss/train': 1.138119101524353} 11/06/2021 23:50:51 - INFO - __main__ - Step 18432: {'lr': 0.00048494754747719954, 'samples': 3538944, 'steps': 18431, 'loss/train': 1.6822428703308105} 11/06/2021 23:50:52 - INFO - __main__ - Step 18433: {'lr': 0.00048494573383401084, 'samples': 3539136, 'steps': 18432, 'loss/train': 1.2326446771621704} 11/06/2021 23:50:53 - INFO - __main__ - Step 18434: {'lr': 0.0004849439200849592, 'samples': 3539328, 'steps': 18433, 'loss/train': 1.6924725770950317} 11/06/2021 23:50:53 - INFO - __main__ - Step 18435: {'lr': 0.0004849421062300455, 'samples': 3539520, 'steps': 18434, 'loss/train': 1.6470112800598145} 11/06/2021 23:50:53 - INFO - __main__ - Step 18436: {'lr': 0.0004849402922692705, 'samples': 3539712, 'steps': 18435, 'loss/train': 1.5454944372177124} 11/06/2021 23:50:54 - INFO - __main__ - Step 18437: {'lr': 0.000484938478202635, 'samples': 3539904, 'steps': 18436, 'loss/train': 1.955151915550232} 11/06/2021 23:50:54 - INFO - __main__ - Step 18438: {'lr': 0.0004849366640301399, 'samples': 3540096, 'steps': 18437, 'loss/train': 1.923080563545227} 11/06/2021 23:50:55 - INFO - __main__ - Step 18439: {'lr': 0.00048493484975178593, 'samples': 3540288, 'steps': 18438, 'loss/train': 1.9832425117492676} 11/06/2021 23:50:56 - INFO - __main__ - Step 18440: {'lr': 0.00048493303536757394, 'samples': 3540480, 'steps': 18439, 'loss/train': 1.5207114219665527} 11/06/2021 23:50:56 - INFO - __main__ - Step 18441: {'lr': 0.00048493122087750473, 'samples': 3540672, 'steps': 18440, 'loss/train': 1.084029197692871} 11/06/2021 23:50:56 - INFO - __main__ - Step 18442: {'lr': 0.0004849294062815792, 'samples': 3540864, 'steps': 18441, 'loss/train': 1.1858590841293335} 11/06/2021 23:50:57 - INFO - __main__ - Step 18443: {'lr': 0.000484927591579798, 'samples': 3541056, 'steps': 18442, 'loss/train': 0.3680558204650879} 11/06/2021 23:50:58 - INFO - __main__ - Step 18444: {'lr': 0.0004849257767721622, 'samples': 3541248, 'steps': 18443, 'loss/train': 1.4493937492370605} 11/06/2021 23:50:58 - INFO - __main__ - Step 18445: {'lr': 0.00048492396185867236, 'samples': 3541440, 'steps': 18444, 'loss/train': 1.2673674821853638} 11/06/2021 23:50:59 - INFO - __main__ - Step 18446: {'lr': 0.0004849221468393294, 'samples': 3541632, 'steps': 18445, 'loss/train': 1.6834259033203125} 11/06/2021 23:50:59 - INFO - __main__ - Step 18447: {'lr': 0.00048492033171413425, 'samples': 3541824, 'steps': 18446, 'loss/train': 1.5090739727020264} 11/06/2021 23:50:59 - INFO - __main__ - Step 18448: {'lr': 0.00048491851648308756, 'samples': 3542016, 'steps': 18447, 'loss/train': 1.581689476966858} 11/06/2021 23:51:00 - INFO - __main__ - Step 18449: {'lr': 0.00048491670114619026, 'samples': 3542208, 'steps': 18448, 'loss/train': 1.6174651384353638} 11/06/2021 23:51:01 - INFO - __main__ - Step 18450: {'lr': 0.000484914885703443, 'samples': 3542400, 'steps': 18449, 'loss/train': 0.9411190748214722} 11/06/2021 23:51:01 - INFO - __main__ - Step 18451: {'lr': 0.00048491307015484684, 'samples': 3542592, 'steps': 18450, 'loss/train': 1.7741544246673584} 11/06/2021 23:51:01 - INFO - __main__ - Step 18452: {'lr': 0.0004849112545004024, 'samples': 3542784, 'steps': 18451, 'loss/train': 2.107996940612793} 11/06/2021 23:51:02 - INFO - __main__ - Step 18453: {'lr': 0.00048490943874011054, 'samples': 3542976, 'steps': 18452, 'loss/train': 1.9751431941986084} 11/06/2021 23:51:03 - INFO - __main__ - Step 18454: {'lr': 0.00048490762287397215, 'samples': 3543168, 'steps': 18453, 'loss/train': 1.6343867778778076} 11/06/2021 23:51:03 - INFO - __main__ - Step 18455: {'lr': 0.00048490580690198804, 'samples': 3543360, 'steps': 18454, 'loss/train': 1.3063688278198242} 11/06/2021 23:51:03 - INFO - __main__ - Step 18456: {'lr': 0.000484903990824159, 'samples': 3543552, 'steps': 18455, 'loss/train': 1.9505548477172852} 11/06/2021 23:51:04 - INFO - __main__ - Step 18457: {'lr': 0.0004849021746404859, 'samples': 3543744, 'steps': 18456, 'loss/train': 1.9116475582122803} 11/06/2021 23:51:04 - INFO - __main__ - Step 18458: {'lr': 0.00048490035835096936, 'samples': 3543936, 'steps': 18457, 'loss/train': 1.0650830268859863} 11/06/2021 23:51:05 - INFO - __main__ - Step 18459: {'lr': 0.0004848985419556104, 'samples': 3544128, 'steps': 18458, 'loss/train': 1.4827680587768555} 11/06/2021 23:51:05 - INFO - __main__ - Step 18460: {'lr': 0.0004848967254544099, 'samples': 3544320, 'steps': 18459, 'loss/train': 1.7137441635131836} 11/06/2021 23:51:06 - INFO - __main__ - Step 18461: {'lr': 0.00048489490884736844, 'samples': 3544512, 'steps': 18460, 'loss/train': 1.8879387378692627} 11/06/2021 23:51:06 - INFO - __main__ - Step 18462: {'lr': 0.00048489309213448696, 'samples': 3544704, 'steps': 18461, 'loss/train': 1.5725513696670532} 11/06/2021 23:51:07 - INFO - __main__ - Step 18463: {'lr': 0.00048489127531576627, 'samples': 3544896, 'steps': 18462, 'loss/train': 1.7514148950576782} 11/06/2021 23:51:08 - INFO - __main__ - Step 18464: {'lr': 0.0004848894583912072, 'samples': 3545088, 'steps': 18463, 'loss/train': 1.7304996252059937} 11/06/2021 23:51:08 - INFO - __main__ - Step 18465: {'lr': 0.00048488764136081063, 'samples': 3545280, 'steps': 18464, 'loss/train': 1.7402286529541016} 11/06/2021 23:51:08 - INFO - __main__ - Step 18466: {'lr': 0.00048488582422457726, 'samples': 3545472, 'steps': 18465, 'loss/train': 1.6294695138931274} 11/06/2021 23:51:09 - INFO - __main__ - Step 18467: {'lr': 0.000484884006982508, 'samples': 3545664, 'steps': 18466, 'loss/train': 1.3782211542129517} 11/06/2021 23:51:09 - INFO - __main__ - Step 18468: {'lr': 0.0004848821896346036, 'samples': 3545856, 'steps': 18467, 'loss/train': 1.301269769668579} 11/06/2021 23:51:10 - INFO - __main__ - Step 18469: {'lr': 0.0004848803721808649, 'samples': 3546048, 'steps': 18468, 'loss/train': 1.5572654008865356} 11/06/2021 23:51:10 - INFO - __main__ - Step 18470: {'lr': 0.0004848785546212927, 'samples': 3546240, 'steps': 18469, 'loss/train': 1.7710777521133423} 11/06/2021 23:51:11 - INFO - __main__ - Step 18471: {'lr': 0.00048487673695588794, 'samples': 3546432, 'steps': 18470, 'loss/train': 0.349107563495636} 11/06/2021 23:51:11 - INFO - __main__ - Step 18472: {'lr': 0.00048487491918465135, 'samples': 3546624, 'steps': 18471, 'loss/train': 1.540006399154663} 11/06/2021 23:51:12 - INFO - __main__ - Step 18473: {'lr': 0.00048487310130758366, 'samples': 3546816, 'steps': 18472, 'loss/train': 1.5236613750457764} 11/06/2021 23:51:12 - INFO - __main__ - Step 18474: {'lr': 0.00048487128332468576, 'samples': 3547008, 'steps': 18473, 'loss/train': 2.0391478538513184} 11/06/2021 23:51:13 - INFO - __main__ - Step 18475: {'lr': 0.00048486946523595856, 'samples': 3547200, 'steps': 18474, 'loss/train': 1.7095692157745361} 11/06/2021 23:51:13 - INFO - __main__ - Step 18476: {'lr': 0.00048486764704140276, 'samples': 3547392, 'steps': 18475, 'loss/train': 1.4590822458267212} 11/06/2021 23:51:14 - INFO - __main__ - Step 18477: {'lr': 0.00048486582874101924, 'samples': 3547584, 'steps': 18476, 'loss/train': 1.449753761291504} 11/06/2021 23:51:14 - INFO - __main__ - Step 18478: {'lr': 0.0004848640103348088, 'samples': 3547776, 'steps': 18477, 'loss/train': 1.4348716735839844} 11/06/2021 23:51:14 - INFO - __main__ - Step 18479: {'lr': 0.00048486219182277226, 'samples': 3547968, 'steps': 18478, 'loss/train': 2.534212827682495} 11/06/2021 23:51:15 - INFO - __main__ - Step 18480: {'lr': 0.00048486037320491043, 'samples': 3548160, 'steps': 18479, 'loss/train': 1.7648648023605347} 11/06/2021 23:51:16 - INFO - __main__ - Step 18481: {'lr': 0.0004848585544812242, 'samples': 3548352, 'steps': 18480, 'loss/train': 1.6988426446914673} 11/06/2021 23:51:16 - INFO - __main__ - Step 18482: {'lr': 0.0004848567356517143, 'samples': 3548544, 'steps': 18481, 'loss/train': 1.972807765007019} 11/06/2021 23:51:16 - INFO - __main__ - Step 18483: {'lr': 0.00048485491671638146, 'samples': 3548736, 'steps': 18482, 'loss/train': 2.1826088428497314} 11/06/2021 23:51:17 - INFO - __main__ - Step 18484: {'lr': 0.0004848530976752268, 'samples': 3548928, 'steps': 18483, 'loss/train': 1.9444767236709595} 11/06/2021 23:51:18 - INFO - __main__ - Step 18485: {'lr': 0.0004848512785282508, 'samples': 3549120, 'steps': 18484, 'loss/train': 1.6333898305892944} 11/06/2021 23:51:18 - INFO - __main__ - Step 18486: {'lr': 0.00048484945927545456, 'samples': 3549312, 'steps': 18485, 'loss/train': 0.7610716223716736} 11/06/2021 23:51:18 - INFO - __main__ - Step 18487: {'lr': 0.0004848476399168387, 'samples': 3549504, 'steps': 18486, 'loss/train': 1.0934349298477173} 11/06/2021 23:51:19 - INFO - __main__ - Step 18488: {'lr': 0.0004848458204524042, 'samples': 3549696, 'steps': 18487, 'loss/train': 1.1723536252975464} 11/06/2021 23:51:19 - INFO - __main__ - Step 18489: {'lr': 0.00048484400088215173, 'samples': 3549888, 'steps': 18488, 'loss/train': 1.8059593439102173} 11/06/2021 23:51:20 - INFO - __main__ - Step 18490: {'lr': 0.0004848421812060821, 'samples': 3550080, 'steps': 18489, 'loss/train': 1.5580663681030273} 11/06/2021 23:51:21 - INFO - __main__ - Step 18491: {'lr': 0.0004848403614241964, 'samples': 3550272, 'steps': 18490, 'loss/train': 1.4911553859710693} 11/06/2021 23:51:21 - INFO - __main__ - Step 18492: {'lr': 0.00048483854153649514, 'samples': 3550464, 'steps': 18491, 'loss/train': 1.4916285276412964} 11/06/2021 23:51:21 - INFO - __main__ - Step 18493: {'lr': 0.0004848367215429793, 'samples': 3550656, 'steps': 18492, 'loss/train': 0.5384908318519592} 11/06/2021 23:51:22 - INFO - __main__ - Step 18494: {'lr': 0.0004848349014436496, 'samples': 3550848, 'steps': 18493, 'loss/train': 1.4912225008010864} 11/06/2021 23:51:23 - INFO - __main__ - Step 18495: {'lr': 0.00048483308123850697, 'samples': 3551040, 'steps': 18494, 'loss/train': 1.5729341506958008} 11/06/2021 23:51:23 - INFO - __main__ - Step 18496: {'lr': 0.00048483126092755215, 'samples': 3551232, 'steps': 18495, 'loss/train': 1.8693089485168457} 11/06/2021 23:51:24 - INFO - __main__ - Step 18497: {'lr': 0.000484829440510786, 'samples': 3551424, 'steps': 18496, 'loss/train': 1.8200191259384155} 11/06/2021 23:51:24 - INFO - __main__ - Step 18498: {'lr': 0.0004848276199882093, 'samples': 3551616, 'steps': 18497, 'loss/train': 1.7423537969589233} 11/06/2021 23:51:24 - INFO - __main__ - Step 18499: {'lr': 0.0004848257993598229, 'samples': 3551808, 'steps': 18498, 'loss/train': 0.9620110392570496} 11/06/2021 23:51:25 - INFO - __main__ - Step 18500: {'lr': 0.00048482397862562764, 'samples': 3552000, 'steps': 18499, 'loss/train': 1.7833219766616821} 11/06/2021 23:51:25 - INFO - __main__ - Step 18501: {'lr': 0.00048482215778562434, 'samples': 3552192, 'steps': 18500, 'loss/train': 1.6852136850357056} 11/06/2021 23:51:27 - INFO - __main__ - Step 18502: {'lr': 0.00048482033683981376, 'samples': 3552384, 'steps': 18501, 'loss/train': 1.7665677070617676} 11/06/2021 23:51:27 - INFO - __main__ - Step 18503: {'lr': 0.0004848185157881968, 'samples': 3552576, 'steps': 18502, 'loss/train': 1.2151190042495728} 11/06/2021 23:51:28 - INFO - __main__ - Step 18504: {'lr': 0.0004848166946307742, 'samples': 3552768, 'steps': 18503, 'loss/train': 0.9729933142662048} 11/06/2021 23:51:28 - INFO - __main__ - Step 18505: {'lr': 0.0004848148733675468, 'samples': 3552960, 'steps': 18504, 'loss/train': 0.7332248687744141} 11/06/2021 23:51:28 - INFO - __main__ - Step 18506: {'lr': 0.0004848130519985155, 'samples': 3553152, 'steps': 18505, 'loss/train': 1.4994479417800903} 11/06/2021 23:51:29 - INFO - __main__ - Step 18507: {'lr': 0.000484811230523681, 'samples': 3553344, 'steps': 18506, 'loss/train': 0.5550244450569153} 11/06/2021 23:51:29 - INFO - __main__ - Step 18508: {'lr': 0.00048480940894304425, 'samples': 3553536, 'steps': 18507, 'loss/train': 2.118533134460449} 11/06/2021 23:51:30 - INFO - __main__ - Step 18509: {'lr': 0.000484807587256606, 'samples': 3553728, 'steps': 18508, 'loss/train': 1.753688097000122} 11/06/2021 23:51:30 - INFO - __main__ - Step 18510: {'lr': 0.00048480576546436707, 'samples': 3553920, 'steps': 18509, 'loss/train': 1.6554456949234009} 11/06/2021 23:51:31 - INFO - __main__ - Step 18511: {'lr': 0.0004848039435663282, 'samples': 3554112, 'steps': 18510, 'loss/train': 1.341917634010315} 11/06/2021 23:51:31 - INFO - __main__ - Step 18512: {'lr': 0.0004848021215624904, 'samples': 3554304, 'steps': 18511, 'loss/train': 2.3233754634857178} 11/06/2021 23:51:31 - INFO - __main__ - Step 18513: {'lr': 0.0004848002994528543, 'samples': 3554496, 'steps': 18512, 'loss/train': 1.3712459802627563} 11/06/2021 23:51:32 - INFO - __main__ - Step 18514: {'lr': 0.0004847984772374209, 'samples': 3554688, 'steps': 18513, 'loss/train': 1.0572799444198608} 11/06/2021 23:51:33 - INFO - __main__ - Step 18515: {'lr': 0.0004847966549161909, 'samples': 3554880, 'steps': 18514, 'loss/train': 1.137990117073059} 11/06/2021 23:51:33 - INFO - __main__ - Step 18516: {'lr': 0.0004847948324891651, 'samples': 3555072, 'steps': 18515, 'loss/train': 1.7233134508132935} 11/06/2021 23:51:34 - INFO - __main__ - Step 18517: {'lr': 0.00048479300995634447, 'samples': 3555264, 'steps': 18516, 'loss/train': 1.5515546798706055} 11/06/2021 23:51:34 - INFO - __main__ - Step 18518: {'lr': 0.0004847911873177296, 'samples': 3555456, 'steps': 18517, 'loss/train': 0.4498330056667328} 11/06/2021 23:51:35 - INFO - __main__ - Step 18519: {'lr': 0.0004847893645733216, 'samples': 3555648, 'steps': 18518, 'loss/train': 1.9441230297088623} 11/06/2021 23:51:35 - INFO - __main__ - Step 18520: {'lr': 0.000484787541723121, 'samples': 3555840, 'steps': 18519, 'loss/train': 1.6475194692611694} 11/06/2021 23:51:36 - INFO - __main__ - Step 18521: {'lr': 0.0004847857187671288, 'samples': 3556032, 'steps': 18520, 'loss/train': 1.5261662006378174} 11/06/2021 23:51:36 - INFO - __main__ - Step 18522: {'lr': 0.00048478389570534575, 'samples': 3556224, 'steps': 18521, 'loss/train': 2.240158796310425} 11/06/2021 23:51:36 - INFO - __main__ - Step 18523: {'lr': 0.0004847820725377728, 'samples': 3556416, 'steps': 18522, 'loss/train': 1.3520618677139282} 11/06/2021 23:51:37 - INFO - __main__ - Step 18524: {'lr': 0.0004847802492644106, 'samples': 3556608, 'steps': 18523, 'loss/train': 1.5715515613555908} 11/06/2021 23:51:38 - INFO - __main__ - Step 18525: {'lr': 0.00048477842588526, 'samples': 3556800, 'steps': 18524, 'loss/train': 2.274082899093628} 11/06/2021 23:51:38 - INFO - __main__ - Step 18526: {'lr': 0.000484776602400322, 'samples': 3556992, 'steps': 18525, 'loss/train': 1.5804866552352905} 11/06/2021 23:51:38 - INFO - __main__ - Step 18527: {'lr': 0.00048477477880959715, 'samples': 3557184, 'steps': 18526, 'loss/train': 1.8836625814437866} 11/06/2021 23:51:39 - INFO - __main__ - Step 18528: {'lr': 0.00048477295511308645, 'samples': 3557376, 'steps': 18527, 'loss/train': 2.796858072280884} 11/06/2021 23:51:40 - INFO - __main__ - Step 18529: {'lr': 0.0004847711313107907, 'samples': 3557568, 'steps': 18528, 'loss/train': 1.5906380414962769} 11/06/2021 23:51:40 - INFO - __main__ - Step 18530: {'lr': 0.0004847693074027106, 'samples': 3557760, 'steps': 18529, 'loss/train': 1.6240956783294678} 11/06/2021 23:51:40 - INFO - __main__ - Step 18531: {'lr': 0.0004847674833888472, 'samples': 3557952, 'steps': 18530, 'loss/train': 1.0976725816726685} 11/06/2021 23:51:41 - INFO - __main__ - Step 18532: {'lr': 0.0004847656592692012, 'samples': 3558144, 'steps': 18531, 'loss/train': 1.590442419052124} 11/06/2021 23:51:41 - INFO - __main__ - Step 18533: {'lr': 0.00048476383504377337, 'samples': 3558336, 'steps': 18532, 'loss/train': 2.1366772651672363} 11/06/2021 23:51:41 - INFO - __main__ - Step 18534: {'lr': 0.00048476201071256453, 'samples': 3558528, 'steps': 18533, 'loss/train': 1.71946120262146} 11/06/2021 23:51:43 - INFO - __main__ - Step 18535: {'lr': 0.0004847601862755756, 'samples': 3558720, 'steps': 18534, 'loss/train': 1.520142674446106} 11/06/2021 23:51:43 - INFO - __main__ - Step 18536: {'lr': 0.0004847583617328074, 'samples': 3558912, 'steps': 18535, 'loss/train': 1.7262592315673828} 11/06/2021 23:51:43 - INFO - __main__ - Step 18537: {'lr': 0.00048475653708426067, 'samples': 3559104, 'steps': 18536, 'loss/train': 1.5982173681259155} 11/06/2021 23:51:44 - INFO - __main__ - Step 18538: {'lr': 0.00048475471232993625, 'samples': 3559296, 'steps': 18537, 'loss/train': 1.012058138847351} 11/06/2021 23:51:44 - INFO - __main__ - Step 18539: {'lr': 0.000484752887469835, 'samples': 3559488, 'steps': 18538, 'loss/train': 1.8425029516220093} 11/06/2021 23:51:45 - INFO - __main__ - Step 18540: {'lr': 0.0004847510625039577, 'samples': 3559680, 'steps': 18539, 'loss/train': 1.090240716934204} 11/06/2021 23:51:45 - INFO - __main__ - Step 18541: {'lr': 0.00048474923743230513, 'samples': 3559872, 'steps': 18540, 'loss/train': 1.447522759437561} 11/06/2021 23:51:46 - INFO - __main__ - Step 18542: {'lr': 0.0004847474122548783, 'samples': 3560064, 'steps': 18541, 'loss/train': 1.5654059648513794} 11/06/2021 23:51:46 - INFO - __main__ - Step 18543: {'lr': 0.00048474558697167783, 'samples': 3560256, 'steps': 18542, 'loss/train': 0.9320666790008545} 11/06/2021 23:51:46 - INFO - __main__ - Step 18544: {'lr': 0.0004847437615827046, 'samples': 3560448, 'steps': 18543, 'loss/train': 1.538326621055603} 11/06/2021 23:51:47 - INFO - __main__ - Step 18545: {'lr': 0.0004847419360879596, 'samples': 3560640, 'steps': 18544, 'loss/train': 1.6450234651565552} 11/06/2021 23:51:48 - INFO - __main__ - Step 18546: {'lr': 0.00048474011048744336, 'samples': 3560832, 'steps': 18545, 'loss/train': 0.8614462614059448} 11/06/2021 23:51:48 - INFO - __main__ - Step 18547: {'lr': 0.0004847382847811569, 'samples': 3561024, 'steps': 18546, 'loss/train': 1.2386878728866577} 11/06/2021 23:51:49 - INFO - __main__ - Step 18548: {'lr': 0.00048473645896910094, 'samples': 3561216, 'steps': 18547, 'loss/train': 1.8863868713378906} 11/06/2021 23:51:49 - INFO - __main__ - Step 18549: {'lr': 0.0004847346330512764, 'samples': 3561408, 'steps': 18548, 'loss/train': 1.5607517957687378} 11/06/2021 23:51:50 - INFO - __main__ - Step 18550: {'lr': 0.0004847328070276841, 'samples': 3561600, 'steps': 18549, 'loss/train': 1.6773111820220947} 11/06/2021 23:51:50 - INFO - __main__ - Step 18551: {'lr': 0.00048473098089832475, 'samples': 3561792, 'steps': 18550, 'loss/train': 1.3553521633148193} 11/06/2021 23:51:51 - INFO - __main__ - Step 18552: {'lr': 0.0004847291546631992, 'samples': 3561984, 'steps': 18551, 'loss/train': 1.1575710773468018} 11/06/2021 23:51:51 - INFO - __main__ - Step 18553: {'lr': 0.0004847273283223084, 'samples': 3562176, 'steps': 18552, 'loss/train': 1.689874291419983} 11/06/2021 23:51:51 - INFO - __main__ - Step 18554: {'lr': 0.0004847255018756531, 'samples': 3562368, 'steps': 18553, 'loss/train': 1.8873575925827026} 11/06/2021 23:51:52 - INFO - __main__ - Step 18555: {'lr': 0.0004847236753232341, 'samples': 3562560, 'steps': 18554, 'loss/train': 1.9002426862716675} 11/06/2021 23:51:53 - INFO - __main__ - Step 18556: {'lr': 0.0004847218486650522, 'samples': 3562752, 'steps': 18555, 'loss/train': 1.6513395309448242} 11/06/2021 23:51:53 - INFO - __main__ - Step 18557: {'lr': 0.00048472002190110827, 'samples': 3562944, 'steps': 18556, 'loss/train': 1.091086506843567} 11/06/2021 23:51:53 - INFO - __main__ - Step 18558: {'lr': 0.0004847181950314031, 'samples': 3563136, 'steps': 18557, 'loss/train': 1.3922399282455444} 11/06/2021 23:51:54 - INFO - __main__ - Step 18559: {'lr': 0.00048471636805593756, 'samples': 3563328, 'steps': 18558, 'loss/train': 1.6785709857940674} 11/06/2021 23:51:55 - INFO - __main__ - Step 18560: {'lr': 0.0004847145409747125, 'samples': 3563520, 'steps': 18559, 'loss/train': 1.2590919733047485} 11/06/2021 23:51:55 - INFO - __main__ - Step 18561: {'lr': 0.00048471271378772857, 'samples': 3563712, 'steps': 18560, 'loss/train': 0.998810887336731} 11/06/2021 23:51:56 - INFO - __main__ - Step 18562: {'lr': 0.00048471088649498675, 'samples': 3563904, 'steps': 18561, 'loss/train': 1.4806946516036987} 11/06/2021 23:51:56 - INFO - __main__ - Step 18563: {'lr': 0.0004847090590964879, 'samples': 3564096, 'steps': 18562, 'loss/train': 1.5437051057815552} 11/06/2021 23:51:56 - INFO - __main__ - Step 18564: {'lr': 0.00048470723159223266, 'samples': 3564288, 'steps': 18563, 'loss/train': 1.4612468481063843} 11/06/2021 23:51:57 - INFO - __main__ - Step 18565: {'lr': 0.00048470540398222207, 'samples': 3564480, 'steps': 18564, 'loss/train': 2.0015370845794678} 11/06/2021 23:51:58 - INFO - __main__ - Step 18566: {'lr': 0.00048470357626645676, 'samples': 3564672, 'steps': 18565, 'loss/train': 1.15360689163208} 11/06/2021 23:51:58 - INFO - __main__ - Step 18567: {'lr': 0.0004847017484449377, 'samples': 3564864, 'steps': 18566, 'loss/train': 1.3309861421585083} 11/06/2021 23:51:58 - INFO - __main__ - Step 18568: {'lr': 0.0004846999205176657, 'samples': 3565056, 'steps': 18567, 'loss/train': 1.6591993570327759} 11/06/2021 23:51:59 - INFO - __main__ - Step 18569: {'lr': 0.00048469809248464135, 'samples': 3565248, 'steps': 18568, 'loss/train': 1.9893124103546143} 11/06/2021 23:51:59 - INFO - __main__ - Step 18570: {'lr': 0.0004846962643458658, 'samples': 3565440, 'steps': 18569, 'loss/train': 1.7136503458023071} 11/06/2021 23:52:00 - INFO - __main__ - Step 18571: {'lr': 0.00048469443610133975, 'samples': 3565632, 'steps': 18570, 'loss/train': 1.7042827606201172} 11/06/2021 23:52:00 - INFO - __main__ - Step 18572: {'lr': 0.00048469260775106394, 'samples': 3565824, 'steps': 18571, 'loss/train': 1.9384015798568726} 11/06/2021 23:52:01 - INFO - __main__ - Step 18573: {'lr': 0.0004846907792950393, 'samples': 3566016, 'steps': 18572, 'loss/train': 1.6838961839675903} 11/06/2021 23:52:01 - INFO - __main__ - Step 18574: {'lr': 0.00048468895073326663, 'samples': 3566208, 'steps': 18573, 'loss/train': 1.2497248649597168} 11/06/2021 23:52:01 - INFO - __main__ - Step 18575: {'lr': 0.0004846871220657467, 'samples': 3566400, 'steps': 18574, 'loss/train': 1.6817874908447266} 11/06/2021 23:52:02 - INFO - __main__ - Step 18576: {'lr': 0.0004846852932924804, 'samples': 3566592, 'steps': 18575, 'loss/train': 1.5133883953094482} 11/06/2021 23:52:03 - INFO - __main__ - Step 18577: {'lr': 0.00048468346441346853, 'samples': 3566784, 'steps': 18576, 'loss/train': 2.222951650619507} 11/06/2021 23:52:03 - INFO - __main__ - Step 18578: {'lr': 0.0004846816354287119, 'samples': 3566976, 'steps': 18577, 'loss/train': 1.6888341903686523} 11/06/2021 23:52:03 - INFO - __main__ - Step 18579: {'lr': 0.0004846798063382114, 'samples': 3567168, 'steps': 18578, 'loss/train': 1.3981465101242065} 11/06/2021 23:52:04 - INFO - __main__ - Step 18580: {'lr': 0.0004846779771419677, 'samples': 3567360, 'steps': 18579, 'loss/train': 1.2701265811920166} 11/06/2021 23:52:05 - INFO - __main__ - Step 18581: {'lr': 0.0004846761478399818, 'samples': 3567552, 'steps': 18580, 'loss/train': 2.1845619678497314} 11/06/2021 23:52:06 - INFO - __main__ - Step 18582: {'lr': 0.0004846743184322544, 'samples': 3567744, 'steps': 18581, 'loss/train': 1.6259300708770752} 11/06/2021 23:52:06 - INFO - __main__ - Step 18583: {'lr': 0.00048467248891878644, 'samples': 3567936, 'steps': 18582, 'loss/train': 4.3415937423706055} 11/06/2021 23:52:06 - INFO - __main__ - Step 18584: {'lr': 0.00048467065929957867, 'samples': 3568128, 'steps': 18583, 'loss/train': 4.058262348175049} 11/06/2021 23:52:07 - INFO - __main__ - Step 18585: {'lr': 0.00048466882957463186, 'samples': 3568320, 'steps': 18584, 'loss/train': 1.3394839763641357} 11/06/2021 23:52:07 - INFO - __main__ - Step 18586: {'lr': 0.0004846669997439469, 'samples': 3568512, 'steps': 18585, 'loss/train': 1.6075658798217773} 11/06/2021 23:52:08 - INFO - __main__ - Step 18587: {'lr': 0.0004846651698075246, 'samples': 3568704, 'steps': 18586, 'loss/train': 1.5133603811264038} 11/06/2021 23:52:08 - INFO - __main__ - Step 18588: {'lr': 0.00048466333976536594, 'samples': 3568896, 'steps': 18587, 'loss/train': 1.4529722929000854} 11/06/2021 23:52:09 - INFO - __main__ - Step 18589: {'lr': 0.0004846615096174715, 'samples': 3569088, 'steps': 18588, 'loss/train': 1.5087271928787231} 11/06/2021 23:52:09 - INFO - __main__ - Step 18590: {'lr': 0.00048465967936384217, 'samples': 3569280, 'steps': 18589, 'loss/train': 1.7790948152542114} 11/06/2021 23:52:09 - INFO - __main__ - Step 18591: {'lr': 0.00048465784900447885, 'samples': 3569472, 'steps': 18590, 'loss/train': 1.938625693321228} 11/06/2021 23:52:11 - INFO - __main__ - Step 18592: {'lr': 0.00048465601853938224, 'samples': 3569664, 'steps': 18591, 'loss/train': 1.7642910480499268} 11/06/2021 23:52:11 - INFO - __main__ - Step 18593: {'lr': 0.0004846541879685533, 'samples': 3569856, 'steps': 18592, 'loss/train': 1.766231656074524} 11/06/2021 23:52:11 - INFO - __main__ - Step 18594: {'lr': 0.0004846523572919929, 'samples': 3570048, 'steps': 18593, 'loss/train': 0.5659541487693787} 11/06/2021 23:52:12 - INFO - __main__ - Step 18595: {'lr': 0.00048465052650970166, 'samples': 3570240, 'steps': 18594, 'loss/train': 1.2477904558181763} 11/06/2021 23:52:12 - INFO - __main__ - Step 18596: {'lr': 0.00048464869562168055, 'samples': 3570432, 'steps': 18595, 'loss/train': 2.0898544788360596} 11/06/2021 23:52:12 - INFO - __main__ - Step 18597: {'lr': 0.0004846468646279304, 'samples': 3570624, 'steps': 18596, 'loss/train': 1.4772709608078003} 11/06/2021 23:52:14 - INFO - __main__ - Step 18598: {'lr': 0.0004846450335284519, 'samples': 3570816, 'steps': 18597, 'loss/train': 1.557440161705017} 11/06/2021 23:52:14 - INFO - __main__ - Step 18599: {'lr': 0.00048464320232324604, 'samples': 3571008, 'steps': 18598, 'loss/train': 0.901743471622467} 11/06/2021 23:52:14 - INFO - __main__ - Step 18600: {'lr': 0.00048464137101231355, 'samples': 3571200, 'steps': 18599, 'loss/train': 1.3562065362930298} 11/06/2021 23:52:15 - INFO - __main__ - Step 18601: {'lr': 0.0004846395395956553, 'samples': 3571392, 'steps': 18600, 'loss/train': 1.8090925216674805} 11/06/2021 23:52:15 - INFO - __main__ - Step 18602: {'lr': 0.00048463770807327206, 'samples': 3571584, 'steps': 18601, 'loss/train': 1.758899211883545} 11/06/2021 23:52:16 - INFO - __main__ - Step 18603: {'lr': 0.00048463587644516473, 'samples': 3571776, 'steps': 18602, 'loss/train': 1.4766874313354492} 11/06/2021 23:52:16 - INFO - __main__ - Step 18604: {'lr': 0.00048463404471133404, 'samples': 3571968, 'steps': 18603, 'loss/train': 1.383787989616394} 11/06/2021 23:52:17 - INFO - __main__ - Step 18605: {'lr': 0.00048463221287178094, 'samples': 3572160, 'steps': 18604, 'loss/train': 1.5332286357879639} 11/06/2021 23:52:17 - INFO - __main__ - Step 18606: {'lr': 0.0004846303809265061, 'samples': 3572352, 'steps': 18605, 'loss/train': 3.0497324466705322} 11/06/2021 23:52:18 - INFO - __main__ - Step 18607: {'lr': 0.00048462854887551044, 'samples': 3572544, 'steps': 18606, 'loss/train': 1.8126506805419922} 11/06/2021 23:52:18 - INFO - __main__ - Step 18608: {'lr': 0.0004846267167187949, 'samples': 3572736, 'steps': 18607, 'loss/train': 1.7569042444229126} 11/06/2021 23:52:19 - INFO - __main__ - Step 18609: {'lr': 0.00048462488445636005, 'samples': 3572928, 'steps': 18608, 'loss/train': 1.6716196537017822} 11/06/2021 23:52:19 - INFO - __main__ - Step 18610: {'lr': 0.0004846230520882069, 'samples': 3573120, 'steps': 18609, 'loss/train': 1.321014642715454} 11/06/2021 23:52:20 - INFO - __main__ - Step 18611: {'lr': 0.00048462121961433623, 'samples': 3573312, 'steps': 18610, 'loss/train': 1.7713549137115479} 11/06/2021 23:52:20 - INFO - __main__ - Step 18612: {'lr': 0.00048461938703474886, 'samples': 3573504, 'steps': 18611, 'loss/train': 1.7149677276611328} 11/06/2021 23:52:21 - INFO - __main__ - Step 18613: {'lr': 0.00048461755434944554, 'samples': 3573696, 'steps': 18612, 'loss/train': 2.0057523250579834} 11/06/2021 23:52:21 - INFO - __main__ - Step 18614: {'lr': 0.00048461572155842725, 'samples': 3573888, 'steps': 18613, 'loss/train': 1.244421362876892} 11/06/2021 23:52:22 - INFO - __main__ - Step 18615: {'lr': 0.00048461388866169474, 'samples': 3574080, 'steps': 18614, 'loss/train': 1.7953641414642334} 11/06/2021 23:52:22 - INFO - __main__ - Step 18616: {'lr': 0.00048461205565924884, 'samples': 3574272, 'steps': 18615, 'loss/train': 1.3679205179214478} 11/06/2021 23:52:22 - INFO - __main__ - Step 18617: {'lr': 0.0004846102225510903, 'samples': 3574464, 'steps': 18616, 'loss/train': 1.6835120916366577} 11/06/2021 23:52:23 - INFO - __main__ - Step 18618: {'lr': 0.00048460838933722005, 'samples': 3574656, 'steps': 18617, 'loss/train': 1.978041172027588} 11/06/2021 23:52:24 - INFO - __main__ - Step 18619: {'lr': 0.0004846065560176389, 'samples': 3574848, 'steps': 18618, 'loss/train': 1.8935999870300293} 11/06/2021 23:52:24 - INFO - __main__ - Step 18620: {'lr': 0.00048460472259234764, 'samples': 3575040, 'steps': 18619, 'loss/train': 1.8517574071884155} 11/06/2021 23:52:24 - INFO - __main__ - Step 18621: {'lr': 0.0004846028890613471, 'samples': 3575232, 'steps': 18620, 'loss/train': 1.4925874471664429} 11/06/2021 23:52:25 - INFO - __main__ - Step 18622: {'lr': 0.00048460105542463805, 'samples': 3575424, 'steps': 18621, 'loss/train': 1.666589617729187} 11/06/2021 23:52:26 - INFO - __main__ - Step 18623: {'lr': 0.00048459922168222146, 'samples': 3575616, 'steps': 18622, 'loss/train': 1.772215723991394} 11/06/2021 23:52:26 - INFO - __main__ - Step 18624: {'lr': 0.00048459738783409814, 'samples': 3575808, 'steps': 18623, 'loss/train': 1.7241744995117188} 11/06/2021 23:52:27 - INFO - __main__ - Step 18625: {'lr': 0.0004845955538802688, 'samples': 3576000, 'steps': 18624, 'loss/train': 1.6278271675109863} 11/06/2021 23:52:27 - INFO - __main__ - Step 18626: {'lr': 0.0004845937198207343, 'samples': 3576192, 'steps': 18625, 'loss/train': 1.2264925241470337} 11/06/2021 23:52:27 - INFO - __main__ - Step 18627: {'lr': 0.0004845918856554955, 'samples': 3576384, 'steps': 18626, 'loss/train': 1.4115939140319824} 11/06/2021 23:52:28 - INFO - __main__ - Step 18628: {'lr': 0.00048459005138455326, 'samples': 3576576, 'steps': 18627, 'loss/train': 1.6529324054718018} 11/06/2021 23:52:29 - INFO - __main__ - Step 18629: {'lr': 0.0004845882170079083, 'samples': 3576768, 'steps': 18628, 'loss/train': 1.8150955438613892} 11/06/2021 23:52:29 - INFO - __main__ - Step 18630: {'lr': 0.00048458638252556153, 'samples': 3576960, 'steps': 18629, 'loss/train': 1.786418080329895} 11/06/2021 23:52:29 - INFO - __main__ - Step 18631: {'lr': 0.0004845845479375138, 'samples': 3577152, 'steps': 18630, 'loss/train': 1.7051968574523926} 11/06/2021 23:52:30 - INFO - __main__ - Step 18632: {'lr': 0.00048458271324376586, 'samples': 3577344, 'steps': 18631, 'loss/train': 1.5298038721084595} 11/06/2021 23:52:30 - INFO - __main__ - Step 18633: {'lr': 0.0004845808784443185, 'samples': 3577536, 'steps': 18632, 'loss/train': 1.4666179418563843} 11/06/2021 23:52:31 - INFO - __main__ - Step 18634: {'lr': 0.00048457904353917277, 'samples': 3577728, 'steps': 18633, 'loss/train': 1.4519388675689697} 11/06/2021 23:52:31 - INFO - __main__ - Step 18635: {'lr': 0.0004845772085283292, 'samples': 3577920, 'steps': 18634, 'loss/train': 1.0929064750671387} 11/06/2021 23:52:32 - INFO - __main__ - Step 18636: {'lr': 0.00048457537341178885, 'samples': 3578112, 'steps': 18635, 'loss/train': 1.5843586921691895} 11/06/2021 23:52:32 - INFO - __main__ - Step 18637: {'lr': 0.0004845735381895524, 'samples': 3578304, 'steps': 18636, 'loss/train': 0.8894321322441101} 11/06/2021 23:52:33 - INFO - __main__ - Step 18638: {'lr': 0.0004845717028616208, 'samples': 3578496, 'steps': 18637, 'loss/train': 1.1412678956985474} 11/06/2021 23:52:34 - INFO - __main__ - Step 18639: {'lr': 0.00048456986742799474, 'samples': 3578688, 'steps': 18638, 'loss/train': 1.6525942087173462} 11/06/2021 23:52:34 - INFO - __main__ - Step 18640: {'lr': 0.00048456803188867513, 'samples': 3578880, 'steps': 18639, 'loss/train': 1.3935282230377197} 11/06/2021 23:52:34 - INFO - __main__ - Step 18641: {'lr': 0.00048456619624366284, 'samples': 3579072, 'steps': 18640, 'loss/train': 1.646254301071167} 11/06/2021 23:52:35 - INFO - __main__ - Step 18642: {'lr': 0.0004845643604929586, 'samples': 3579264, 'steps': 18641, 'loss/train': 1.6623493432998657} 11/06/2021 23:52:35 - INFO - __main__ - Step 18643: {'lr': 0.00048456252463656326, 'samples': 3579456, 'steps': 18642, 'loss/train': 2.4156510829925537} 11/06/2021 23:52:36 - INFO - __main__ - Step 18644: {'lr': 0.00048456068867447767, 'samples': 3579648, 'steps': 18643, 'loss/train': 1.8837863206863403} 11/06/2021 23:52:36 - INFO - __main__ - Step 18645: {'lr': 0.0004845588526067027, 'samples': 3579840, 'steps': 18644, 'loss/train': 2.037102460861206} 11/06/2021 23:52:37 - INFO - __main__ - Step 18646: {'lr': 0.00048455701643323914, 'samples': 3580032, 'steps': 18645, 'loss/train': 1.6288491487503052} 11/06/2021 23:52:37 - INFO - __main__ - Step 18647: {'lr': 0.00048455518015408773, 'samples': 3580224, 'steps': 18646, 'loss/train': 1.586424708366394} 11/06/2021 23:52:37 - INFO - __main__ - Step 18648: {'lr': 0.00048455334376924943, 'samples': 3580416, 'steps': 18647, 'loss/train': 1.1939408779144287} 11/06/2021 23:52:39 - INFO - __main__ - Step 18649: {'lr': 0.000484551507278725, 'samples': 3580608, 'steps': 18648, 'loss/train': 1.620247483253479} 11/06/2021 23:52:39 - INFO - __main__ - Step 18650: {'lr': 0.0004845496706825152, 'samples': 3580800, 'steps': 18649, 'loss/train': 0.9347361922264099} 11/06/2021 23:52:40 - INFO - __main__ - Step 18651: {'lr': 0.0004845478339806211, 'samples': 3580992, 'steps': 18650, 'loss/train': 1.3260414600372314} 11/06/2021 23:52:40 - INFO - __main__ - Step 18652: {'lr': 0.00048454599717304327, 'samples': 3581184, 'steps': 18651, 'loss/train': 1.604457139968872} 11/06/2021 23:52:40 - INFO - __main__ - Step 18653: {'lr': 0.0004845441602597826, 'samples': 3581376, 'steps': 18652, 'loss/train': 1.8285481929779053} 11/06/2021 23:52:41 - INFO - __main__ - Step 18654: {'lr': 0.00048454232324084004, 'samples': 3581568, 'steps': 18653, 'loss/train': 1.283732295036316} 11/06/2021 23:52:42 - INFO - __main__ - Step 18655: {'lr': 0.0004845404861162163, 'samples': 3581760, 'steps': 18654, 'loss/train': 0.30630865693092346} 11/06/2021 23:52:42 - INFO - __main__ - Step 18656: {'lr': 0.00048453864888591214, 'samples': 3581952, 'steps': 18655, 'loss/train': 0.8285514712333679} 11/06/2021 23:52:42 - INFO - __main__ - Step 18657: {'lr': 0.0004845368115499286, 'samples': 3582144, 'steps': 18656, 'loss/train': 1.719491720199585} 11/06/2021 23:52:43 - INFO - __main__ - Step 18658: {'lr': 0.0004845349741082663, 'samples': 3582336, 'steps': 18657, 'loss/train': 1.749588131904602} 11/06/2021 23:52:43 - INFO - __main__ - Step 18659: {'lr': 0.00048453313656092624, 'samples': 3582528, 'steps': 18658, 'loss/train': 1.7905672788619995} 11/06/2021 23:52:43 - INFO - __main__ - Step 18660: {'lr': 0.0004845312989079091, 'samples': 3582720, 'steps': 18659, 'loss/train': 1.651228666305542} 11/06/2021 23:52:45 - INFO - __main__ - Step 18661: {'lr': 0.0004845294611492158, 'samples': 3582912, 'steps': 18660, 'loss/train': 1.5173380374908447} 11/06/2021 23:52:45 - INFO - __main__ - Step 18662: {'lr': 0.00048452762328484724, 'samples': 3583104, 'steps': 18661, 'loss/train': 1.6837129592895508} 11/06/2021 23:52:45 - INFO - __main__ - Step 18663: {'lr': 0.000484525785314804, 'samples': 3583296, 'steps': 18662, 'loss/train': 1.4421535730361938} 11/06/2021 23:52:46 - INFO - __main__ - Step 18664: {'lr': 0.0004845239472390872, 'samples': 3583488, 'steps': 18663, 'loss/train': 1.7902997732162476} 11/06/2021 23:52:46 - INFO - __main__ - Step 18665: {'lr': 0.0004845221090576974, 'samples': 3583680, 'steps': 18664, 'loss/train': 2.102501153945923} 11/06/2021 23:52:47 - INFO - __main__ - Step 18666: {'lr': 0.0004845202707706356, 'samples': 3583872, 'steps': 18665, 'loss/train': 2.0066657066345215} 11/06/2021 23:52:47 - INFO - __main__ - Step 18667: {'lr': 0.0004845184323779026, 'samples': 3584064, 'steps': 18666, 'loss/train': 1.397498369216919} 11/06/2021 23:52:48 - INFO - __main__ - Step 18668: {'lr': 0.0004845165938794992, 'samples': 3584256, 'steps': 18667, 'loss/train': 1.3317439556121826} 11/06/2021 23:52:48 - INFO - __main__ - Step 18669: {'lr': 0.0004845147552754263, 'samples': 3584448, 'steps': 18668, 'loss/train': 1.9309141635894775} 11/06/2021 23:52:48 - INFO - __main__ - Step 18670: {'lr': 0.0004845129165656846, 'samples': 3584640, 'steps': 18669, 'loss/train': 1.3592866659164429} 11/06/2021 23:52:49 - INFO - __main__ - Step 18671: {'lr': 0.00048451107775027505, 'samples': 3584832, 'steps': 18670, 'loss/train': 1.5930179357528687} 11/06/2021 23:52:50 - INFO - __main__ - Step 18672: {'lr': 0.0004845092388291984, 'samples': 3585024, 'steps': 18671, 'loss/train': 1.7972415685653687} 11/06/2021 23:52:50 - INFO - __main__ - Step 18673: {'lr': 0.0004845073998024555, 'samples': 3585216, 'steps': 18672, 'loss/train': 2.1059470176696777} 11/06/2021 23:52:50 - INFO - __main__ - Step 18674: {'lr': 0.0004845055606700472, 'samples': 3585408, 'steps': 18673, 'loss/train': 1.6786866188049316} 11/06/2021 23:52:51 - INFO - __main__ - Step 18675: {'lr': 0.0004845037214319743, 'samples': 3585600, 'steps': 18674, 'loss/train': 1.7961063385009766} 11/06/2021 23:52:52 - INFO - __main__ - Step 18676: {'lr': 0.00048450188208823766, 'samples': 3585792, 'steps': 18675, 'loss/train': 1.6116433143615723} 11/06/2021 23:52:52 - INFO - __main__ - Step 18677: {'lr': 0.00048450004263883806, 'samples': 3585984, 'steps': 18676, 'loss/train': 1.9659193754196167} 11/06/2021 23:52:52 - INFO - __main__ - Step 18678: {'lr': 0.00048449820308377634, 'samples': 3586176, 'steps': 18677, 'loss/train': 1.7020381689071655} 11/06/2021 23:52:53 - INFO - __main__ - Step 18679: {'lr': 0.00048449636342305343, 'samples': 3586368, 'steps': 18678, 'loss/train': 1.582143783569336} 11/06/2021 23:52:53 - INFO - __main__ - Step 18680: {'lr': 0.00048449452365667003, 'samples': 3586560, 'steps': 18679, 'loss/train': 1.3421880006790161} 11/06/2021 23:52:54 - INFO - __main__ - Step 18681: {'lr': 0.00048449268378462695, 'samples': 3586752, 'steps': 18680, 'loss/train': 1.1730694770812988} 11/06/2021 23:52:55 - INFO - __main__ - Step 18682: {'lr': 0.00048449084380692523, 'samples': 3586944, 'steps': 18681, 'loss/train': 1.537729024887085} 11/06/2021 23:52:55 - INFO - __main__ - Step 18683: {'lr': 0.0004844890037235654, 'samples': 3587136, 'steps': 18682, 'loss/train': 1.7355479001998901} 11/06/2021 23:52:55 - INFO - __main__ - Step 18684: {'lr': 0.00048448716353454856, 'samples': 3587328, 'steps': 18683, 'loss/train': 1.7572896480560303} 11/06/2021 23:52:56 - INFO - __main__ - Step 18685: {'lr': 0.0004844853232398754, 'samples': 3587520, 'steps': 18684, 'loss/train': 1.7151175737380981} 11/06/2021 23:52:57 - INFO - __main__ - Step 18686: {'lr': 0.00048448348283954674, 'samples': 3587712, 'steps': 18685, 'loss/train': 1.8041529655456543} 11/06/2021 23:52:57 - INFO - __main__ - Step 18687: {'lr': 0.00048448164233356344, 'samples': 3587904, 'steps': 18686, 'loss/train': 1.7413592338562012} 11/06/2021 23:52:57 - INFO - __main__ - Step 18688: {'lr': 0.0004844798017219264, 'samples': 3588096, 'steps': 18687, 'loss/train': 1.6515896320343018} 11/06/2021 23:52:58 - INFO - __main__ - Step 18689: {'lr': 0.00048447796100463625, 'samples': 3588288, 'steps': 18688, 'loss/train': 1.8758485317230225} 11/06/2021 23:52:58 - INFO - __main__ - Step 18690: {'lr': 0.0004844761201816941, 'samples': 3588480, 'steps': 18689, 'loss/train': 1.7205249071121216} 11/06/2021 23:52:59 - INFO - __main__ - Step 18691: {'lr': 0.0004844742792531005, 'samples': 3588672, 'steps': 18690, 'loss/train': 1.1845340728759766} 11/06/2021 23:53:00 - INFO - __main__ - Step 18692: {'lr': 0.00048447243821885644, 'samples': 3588864, 'steps': 18691, 'loss/train': 1.7539175748825073} 11/06/2021 23:53:00 - INFO - __main__ - Step 18693: {'lr': 0.0004844705970789628, 'samples': 3589056, 'steps': 18692, 'loss/train': 1.7191652059555054} 11/06/2021 23:53:00 - INFO - __main__ - Step 18694: {'lr': 0.0004844687558334202, 'samples': 3589248, 'steps': 18693, 'loss/train': 1.35236394405365} 11/06/2021 23:53:01 - INFO - __main__ - Step 18695: {'lr': 0.0004844669144822297, 'samples': 3589440, 'steps': 18694, 'loss/train': 1.8748408555984497} 11/06/2021 23:53:02 - INFO - __main__ - Step 18696: {'lr': 0.000484465073025392, 'samples': 3589632, 'steps': 18695, 'loss/train': 1.7773936986923218} 11/06/2021 23:53:02 - INFO - __main__ - Step 18697: {'lr': 0.00048446323146290795, 'samples': 3589824, 'steps': 18696, 'loss/train': 1.5781259536743164} 11/06/2021 23:53:02 - INFO - __main__ - Step 18698: {'lr': 0.0004844613897947784, 'samples': 3590016, 'steps': 18697, 'loss/train': 1.8103524446487427} 11/06/2021 23:53:03 - INFO - __main__ - Step 18699: {'lr': 0.00048445954802100414, 'samples': 3590208, 'steps': 18698, 'loss/train': 1.5242774486541748} 11/06/2021 23:53:03 - INFO - __main__ - Step 18700: {'lr': 0.000484457706141586, 'samples': 3590400, 'steps': 18699, 'loss/train': 1.4086936712265015} 11/06/2021 23:53:03 - INFO - __main__ - Step 18701: {'lr': 0.0004844558641565249, 'samples': 3590592, 'steps': 18700, 'loss/train': 1.9146695137023926} 11/06/2021 23:53:04 - INFO - __main__ - Step 18702: {'lr': 0.00048445402206582155, 'samples': 3590784, 'steps': 18701, 'loss/train': 1.5071773529052734} 11/06/2021 23:53:05 - INFO - __main__ - Step 18703: {'lr': 0.0004844521798694768, 'samples': 3590976, 'steps': 18702, 'loss/train': 1.537892460823059} 11/06/2021 23:53:05 - INFO - __main__ - Step 18704: {'lr': 0.0004844503375674916, 'samples': 3591168, 'steps': 18703, 'loss/train': 1.0163267850875854} 11/06/2021 23:53:05 - INFO - __main__ - Step 18705: {'lr': 0.0004844484951598667, 'samples': 3591360, 'steps': 18704, 'loss/train': 1.5912625789642334} 11/06/2021 23:53:06 - INFO - __main__ - Step 18706: {'lr': 0.00048444665264660286, 'samples': 3591552, 'steps': 18705, 'loss/train': 1.7461742162704468} 11/06/2021 23:53:07 - INFO - __main__ - Step 18707: {'lr': 0.000484444810027701, 'samples': 3591744, 'steps': 18706, 'loss/train': 1.5173417329788208} 11/06/2021 23:53:07 - INFO - __main__ - Step 18708: {'lr': 0.00048444296730316196, 'samples': 3591936, 'steps': 18707, 'loss/train': 0.8280439972877502} 11/06/2021 23:53:08 - INFO - __main__ - Step 18709: {'lr': 0.0004844411244729865, 'samples': 3592128, 'steps': 18708, 'loss/train': 1.8678555488586426} 11/06/2021 23:53:08 - INFO - __main__ - Step 18710: {'lr': 0.00048443928153717555, 'samples': 3592320, 'steps': 18709, 'loss/train': 2.2123522758483887} 11/06/2021 23:53:08 - INFO - __main__ - Step 18711: {'lr': 0.00048443743849572974, 'samples': 3592512, 'steps': 18710, 'loss/train': 1.4127544164657593} 11/06/2021 23:53:09 - INFO - __main__ - Step 18712: {'lr': 0.00048443559534865017, 'samples': 3592704, 'steps': 18711, 'loss/train': 1.2353262901306152} 11/06/2021 23:53:10 - INFO - __main__ - Step 18713: {'lr': 0.0004844337520959375, 'samples': 3592896, 'steps': 18712, 'loss/train': 1.248792290687561} 11/06/2021 23:53:10 - INFO - __main__ - Step 18714: {'lr': 0.00048443190873759256, 'samples': 3593088, 'steps': 18713, 'loss/train': 0.6155982613563538} 11/06/2021 23:53:10 - INFO - __main__ - Step 18715: {'lr': 0.00048443006527361626, 'samples': 3593280, 'steps': 18714, 'loss/train': 1.6806596517562866} 11/06/2021 23:53:11 - INFO - __main__ - Step 18716: {'lr': 0.0004844282217040094, 'samples': 3593472, 'steps': 18715, 'loss/train': 1.9265437126159668} 11/06/2021 23:53:12 - INFO - __main__ - Step 18717: {'lr': 0.00048442637802877277, 'samples': 3593664, 'steps': 18716, 'loss/train': 1.060073971748352} 11/06/2021 23:53:12 - INFO - __main__ - Step 18718: {'lr': 0.0004844245342479072, 'samples': 3593856, 'steps': 18717, 'loss/train': 1.22144615650177} 11/06/2021 23:53:12 - INFO - __main__ - Step 18719: {'lr': 0.00048442269036141363, 'samples': 3594048, 'steps': 18718, 'loss/train': 2.08038592338562} 11/06/2021 23:53:13 - INFO - __main__ - Step 18720: {'lr': 0.0004844208463692928, 'samples': 3594240, 'steps': 18719, 'loss/train': 1.9108844995498657} 11/06/2021 23:53:13 - INFO - __main__ - Step 18721: {'lr': 0.00048441900227154557, 'samples': 3594432, 'steps': 18720, 'loss/train': 1.4159252643585205} 11/06/2021 23:53:14 - INFO - __main__ - Step 18722: {'lr': 0.00048441715806817265, 'samples': 3594624, 'steps': 18721, 'loss/train': 1.447057843208313} 11/06/2021 23:53:14 - INFO - __main__ - Step 18723: {'lr': 0.0004844153137591751, 'samples': 3594816, 'steps': 18722, 'loss/train': 1.7769197225570679} 11/06/2021 23:53:15 - INFO - __main__ - Step 18724: {'lr': 0.00048441346934455356, 'samples': 3595008, 'steps': 18723, 'loss/train': 0.7423035502433777} 11/06/2021 23:53:15 - INFO - __main__ - Step 18725: {'lr': 0.0004844116248243089, 'samples': 3595200, 'steps': 18724, 'loss/train': 2.38761568069458} 11/06/2021 23:53:15 - INFO - __main__ - Step 18726: {'lr': 0.0004844097801984421, 'samples': 3595392, 'steps': 18725, 'loss/train': 1.1686735153198242} 11/06/2021 23:53:16 - INFO - __main__ - Step 18727: {'lr': 0.0004844079354669537, 'samples': 3595584, 'steps': 18726, 'loss/train': 1.2005784511566162} 11/06/2021 23:53:17 - INFO - __main__ - Step 18728: {'lr': 0.0004844060906298448, 'samples': 3595776, 'steps': 18727, 'loss/train': 1.632523536682129} 11/06/2021 23:53:17 - INFO - __main__ - Step 18729: {'lr': 0.0004844042456871162, 'samples': 3595968, 'steps': 18728, 'loss/train': 1.1527280807495117} 11/06/2021 23:53:17 - INFO - __main__ - Step 18730: {'lr': 0.0004844024006387685, 'samples': 3596160, 'steps': 18729, 'loss/train': 1.233432412147522} 11/06/2021 23:53:18 - INFO - __main__ - Step 18731: {'lr': 0.00048440055548480275, 'samples': 3596352, 'steps': 18730, 'loss/train': 1.747187852859497} 11/06/2021 23:53:18 - INFO - __main__ - Step 18732: {'lr': 0.0004843987102252198, 'samples': 3596544, 'steps': 18731, 'loss/train': 1.3266741037368774} 11/06/2021 23:53:19 - INFO - __main__ - Step 18733: {'lr': 0.0004843968648600204, 'samples': 3596736, 'steps': 18732, 'loss/train': 1.6883686780929565} 11/06/2021 23:53:20 - INFO - __main__ - Step 18734: {'lr': 0.00048439501938920534, 'samples': 3596928, 'steps': 18733, 'loss/train': 1.7252039909362793} 11/06/2021 23:53:20 - INFO - __main__ - Step 18735: {'lr': 0.0004843931738127755, 'samples': 3597120, 'steps': 18734, 'loss/train': 1.1375492811203003} 11/06/2021 23:53:20 - INFO - __main__ - Step 18736: {'lr': 0.0004843913281307317, 'samples': 3597312, 'steps': 18735, 'loss/train': 1.8562067747116089} 11/06/2021 23:53:21 - INFO - __main__ - Step 18737: {'lr': 0.0004843894823430749, 'samples': 3597504, 'steps': 18736, 'loss/train': 1.7384228706359863} 11/06/2021 23:53:22 - INFO - __main__ - Step 18738: {'lr': 0.00048438763644980564, 'samples': 3597696, 'steps': 18737, 'loss/train': 1.3184276819229126} 11/06/2021 23:53:22 - INFO - __main__ - Step 18739: {'lr': 0.0004843857904509251, 'samples': 3597888, 'steps': 18738, 'loss/train': 1.289018154144287} 11/06/2021 23:53:23 - INFO - __main__ - Step 18740: {'lr': 0.00048438394434643386, 'samples': 3598080, 'steps': 18739, 'loss/train': 1.8547766208648682} 11/06/2021 23:53:23 - INFO - __main__ - Step 18741: {'lr': 0.0004843820981363328, 'samples': 3598272, 'steps': 18740, 'loss/train': 1.6665148735046387} 11/06/2021 23:53:23 - INFO - __main__ - Step 18742: {'lr': 0.00048438025182062286, 'samples': 3598464, 'steps': 18741, 'loss/train': 1.8995819091796875} 11/06/2021 23:53:24 - INFO - __main__ - Step 18743: {'lr': 0.00048437840539930466, 'samples': 3598656, 'steps': 18742, 'loss/train': 2.344691514968872} 11/06/2021 23:53:25 - INFO - __main__ - Step 18744: {'lr': 0.0004843765588723793, 'samples': 3598848, 'steps': 18743, 'loss/train': 1.6986535787582397} 11/06/2021 23:53:25 - INFO - __main__ - Step 18745: {'lr': 0.00048437471223984743, 'samples': 3599040, 'steps': 18744, 'loss/train': 1.527302861213684} 11/06/2021 23:53:25 - INFO - __main__ - Step 18746: {'lr': 0.00048437286550170996, 'samples': 3599232, 'steps': 18745, 'loss/train': 1.0772827863693237} 11/06/2021 23:53:26 - INFO - __main__ - Step 18747: {'lr': 0.00048437101865796763, 'samples': 3599424, 'steps': 18746, 'loss/train': 1.4782168865203857} 11/06/2021 23:53:26 - INFO - __main__ - Step 18748: {'lr': 0.0004843691717086214, 'samples': 3599616, 'steps': 18747, 'loss/train': 1.4430029392242432} 11/06/2021 23:53:27 - INFO - __main__ - Step 18749: {'lr': 0.000484367324653672, 'samples': 3599808, 'steps': 18748, 'loss/train': 1.6161614656448364} 11/06/2021 23:53:27 - INFO - __main__ - Step 18750: {'lr': 0.0004843654774931203, 'samples': 3600000, 'steps': 18749, 'loss/train': 1.6915335655212402} 11/06/2021 23:53:28 - INFO - __main__ - Step 18751: {'lr': 0.00048436363022696715, 'samples': 3600192, 'steps': 18750, 'loss/train': 1.6915794610977173} 11/06/2021 23:53:28 - INFO - __main__ - Step 18752: {'lr': 0.0004843617828552134, 'samples': 3600384, 'steps': 18751, 'loss/train': 1.9114357233047485} 11/06/2021 23:53:29 - INFO - __main__ - Step 18753: {'lr': 0.00048435993537785976, 'samples': 3600576, 'steps': 18752, 'loss/train': 0.9509969353675842} 11/06/2021 23:53:30 - INFO - __main__ - Step 18754: {'lr': 0.0004843580877949072, 'samples': 3600768, 'steps': 18753, 'loss/train': 1.7542425394058228} 11/06/2021 23:53:30 - INFO - __main__ - Step 18755: {'lr': 0.0004843562401063565, 'samples': 3600960, 'steps': 18754, 'loss/train': 0.9853984713554382} 11/06/2021 23:53:30 - INFO - __main__ - Step 18756: {'lr': 0.0004843543923122085, 'samples': 3601152, 'steps': 18755, 'loss/train': 1.426131010055542} 11/06/2021 23:53:31 - INFO - __main__ - Step 18757: {'lr': 0.000484352544412464, 'samples': 3601344, 'steps': 18756, 'loss/train': 1.6753501892089844} 11/06/2021 23:53:31 - INFO - __main__ - Step 18758: {'lr': 0.0004843506964071239, 'samples': 3601536, 'steps': 18757, 'loss/train': 1.2865328788757324} 11/06/2021 23:53:32 - INFO - __main__ - Step 18759: {'lr': 0.000484348848296189, 'samples': 3601728, 'steps': 18758, 'loss/train': 1.4032695293426514} 11/06/2021 23:53:32 - INFO - __main__ - Step 18760: {'lr': 0.00048434700007966006, 'samples': 3601920, 'steps': 18759, 'loss/train': 1.8957061767578125} 11/06/2021 23:53:33 - INFO - __main__ - Step 18761: {'lr': 0.000484345151757538, 'samples': 3602112, 'steps': 18760, 'loss/train': 1.5884819030761719} 11/06/2021 23:53:33 - INFO - __main__ - Step 18762: {'lr': 0.0004843433033298237, 'samples': 3602304, 'steps': 18761, 'loss/train': 0.3044757544994354} 11/06/2021 23:53:33 - INFO - __main__ - Step 18763: {'lr': 0.00048434145479651783, 'samples': 3602496, 'steps': 18762, 'loss/train': 2.1938703060150146} 11/06/2021 23:53:34 - INFO - __main__ - Step 18764: {'lr': 0.00048433960615762136, 'samples': 3602688, 'steps': 18763, 'loss/train': 2.144761562347412} 11/06/2021 23:53:35 - INFO - __main__ - Step 18765: {'lr': 0.0004843377574131351, 'samples': 3602880, 'steps': 18764, 'loss/train': 1.4775487184524536} 11/06/2021 23:53:35 - INFO - __main__ - Step 18766: {'lr': 0.0004843359085630598, 'samples': 3603072, 'steps': 18765, 'loss/train': 1.5913273096084595} 11/06/2021 23:53:35 - INFO - __main__ - Step 18767: {'lr': 0.0004843340596073964, 'samples': 3603264, 'steps': 18766, 'loss/train': 1.588987946510315} 11/06/2021 23:53:36 - INFO - __main__ - Step 18768: {'lr': 0.0004843322105461457, 'samples': 3603456, 'steps': 18767, 'loss/train': 1.364917278289795} 11/06/2021 23:53:37 - INFO - __main__ - Step 18769: {'lr': 0.0004843303613793085, 'samples': 3603648, 'steps': 18768, 'loss/train': 1.2008689641952515} 11/06/2021 23:53:37 - INFO - __main__ - Step 18770: {'lr': 0.00048432851210688567, 'samples': 3603840, 'steps': 18769, 'loss/train': 1.6347686052322388} 11/06/2021 23:53:37 - INFO - __main__ - Step 18771: {'lr': 0.00048432666272887805, 'samples': 3604032, 'steps': 18770, 'loss/train': 1.4727251529693604} 11/06/2021 23:53:38 - INFO - __main__ - Step 18772: {'lr': 0.0004843248132452864, 'samples': 3604224, 'steps': 18771, 'loss/train': 0.3945901691913605} 11/06/2021 23:53:38 - INFO - __main__ - Step 18773: {'lr': 0.0004843229636561116, 'samples': 3604416, 'steps': 18772, 'loss/train': 0.9401273131370544} 11/06/2021 23:53:39 - INFO - __main__ - Step 18774: {'lr': 0.00048432111396135447, 'samples': 3604608, 'steps': 18773, 'loss/train': 1.7124041318893433} 11/06/2021 23:53:40 - INFO - __main__ - Step 18775: {'lr': 0.0004843192641610159, 'samples': 3604800, 'steps': 18774, 'loss/train': 1.9197239875793457} 11/06/2021 23:53:40 - INFO - __main__ - Step 18776: {'lr': 0.00048431741425509676, 'samples': 3604992, 'steps': 18775, 'loss/train': 1.211233377456665} 11/06/2021 23:53:40 - INFO - __main__ - Step 18777: {'lr': 0.0004843155642435977, 'samples': 3605184, 'steps': 18776, 'loss/train': 1.7263365983963013} 11/06/2021 23:53:41 - INFO - __main__ - Step 18778: {'lr': 0.0004843137141265197, 'samples': 3605376, 'steps': 18777, 'loss/train': 1.467863917350769} 11/06/2021 23:53:41 - INFO - __main__ - Step 18779: {'lr': 0.00048431186390386356, 'samples': 3605568, 'steps': 18778, 'loss/train': 1.4806694984436035} 11/06/2021 23:53:42 - INFO - __main__ - Step 18780: {'lr': 0.0004843100135756301, 'samples': 3605760, 'steps': 18779, 'loss/train': 1.1883167028427124} 11/06/2021 23:53:42 - INFO - __main__ - Step 18781: {'lr': 0.0004843081631418202, 'samples': 3605952, 'steps': 18780, 'loss/train': 1.6708265542984009} 11/06/2021 23:53:43 - INFO - __main__ - Step 18782: {'lr': 0.00048430631260243465, 'samples': 3606144, 'steps': 18781, 'loss/train': 1.7834099531173706} 11/06/2021 23:53:43 - INFO - __main__ - Step 18783: {'lr': 0.00048430446195747424, 'samples': 3606336, 'steps': 18782, 'loss/train': 1.7206259965896606} 11/06/2021 23:53:43 - INFO - __main__ - Step 18784: {'lr': 0.00048430261120693986, 'samples': 3606528, 'steps': 18783, 'loss/train': 0.8681101202964783} 11/06/2021 23:53:44 - INFO - __main__ - Step 18785: {'lr': 0.0004843007603508324, 'samples': 3606720, 'steps': 18784, 'loss/train': 1.4189516305923462} 11/06/2021 23:53:45 - INFO - __main__ - Step 18786: {'lr': 0.00048429890938915255, 'samples': 3606912, 'steps': 18785, 'loss/train': 1.232398271560669} 11/06/2021 23:53:45 - INFO - __main__ - Step 18787: {'lr': 0.0004842970583219013, 'samples': 3607104, 'steps': 18786, 'loss/train': 1.7327327728271484} 11/06/2021 23:53:45 - INFO - __main__ - Step 18788: {'lr': 0.0004842952071490794, 'samples': 3607296, 'steps': 18787, 'loss/train': 1.8691179752349854} 11/06/2021 23:53:46 - INFO - __main__ - Step 18789: {'lr': 0.0004842933558706877, 'samples': 3607488, 'steps': 18788, 'loss/train': 1.6241750717163086} 11/06/2021 23:53:47 - INFO - __main__ - Step 18790: {'lr': 0.000484291504486727, 'samples': 3607680, 'steps': 18789, 'loss/train': 2.0066754817962646} 11/06/2021 23:53:47 - INFO - __main__ - Step 18791: {'lr': 0.0004842896529971982, 'samples': 3607872, 'steps': 18790, 'loss/train': 1.6360875368118286} 11/06/2021 23:53:48 - INFO - __main__ - Step 18792: {'lr': 0.00048428780140210204, 'samples': 3608064, 'steps': 18791, 'loss/train': 1.7113112211227417} 11/06/2021 23:53:48 - INFO - __main__ - Step 18793: {'lr': 0.0004842859497014394, 'samples': 3608256, 'steps': 18792, 'loss/train': 0.3076252043247223} 11/06/2021 23:53:48 - INFO - __main__ - Step 18794: {'lr': 0.0004842840978952112, 'samples': 3608448, 'steps': 18793, 'loss/train': 0.9483229517936707} 11/06/2021 23:53:49 - INFO - __main__ - Step 18795: {'lr': 0.00048428224598341815, 'samples': 3608640, 'steps': 18794, 'loss/train': 1.8937662839889526} 11/06/2021 23:53:50 - INFO - __main__ - Step 18796: {'lr': 0.0004842803939660612, 'samples': 3608832, 'steps': 18795, 'loss/train': 1.622977614402771} 11/06/2021 23:53:50 - INFO - __main__ - Step 18797: {'lr': 0.00048427854184314103, 'samples': 3609024, 'steps': 18796, 'loss/train': 0.8508211970329285} 11/06/2021 23:53:50 - INFO - __main__ - Step 18798: {'lr': 0.0004842766896146586, 'samples': 3609216, 'steps': 18797, 'loss/train': 1.125472068786621} 11/06/2021 23:53:51 - INFO - __main__ - Step 18799: {'lr': 0.0004842748372806147, 'samples': 3609408, 'steps': 18798, 'loss/train': 1.7149622440338135} 11/06/2021 23:53:51 - INFO - __main__ - Step 18800: {'lr': 0.00048427298484101023, 'samples': 3609600, 'steps': 18799, 'loss/train': 0.749356746673584} 11/06/2021 23:53:52 - INFO - __main__ - Step 18801: {'lr': 0.0004842711322958459, 'samples': 3609792, 'steps': 18800, 'loss/train': 2.0503411293029785} 11/06/2021 23:53:53 - INFO - __main__ - Step 18802: {'lr': 0.0004842692796451226, 'samples': 3609984, 'steps': 18801, 'loss/train': 1.4108389616012573} 11/06/2021 23:53:53 - INFO - __main__ - Step 18803: {'lr': 0.0004842674268888413, 'samples': 3610176, 'steps': 18802, 'loss/train': 1.3129260540008545} 11/06/2021 23:53:53 - INFO - __main__ - Step 18804: {'lr': 0.0004842655740270026, 'samples': 3610368, 'steps': 18803, 'loss/train': 1.4389371871948242} 11/06/2021 23:53:54 - INFO - __main__ - Step 18805: {'lr': 0.0004842637210596075, 'samples': 3610560, 'steps': 18804, 'loss/train': 1.9069589376449585} 11/06/2021 23:53:55 - INFO - __main__ - Step 18806: {'lr': 0.0004842618679866567, 'samples': 3610752, 'steps': 18805, 'loss/train': 1.330498218536377} 11/06/2021 23:53:55 - INFO - __main__ - Step 18807: {'lr': 0.0004842600148081512, 'samples': 3610944, 'steps': 18806, 'loss/train': 1.689779281616211} 11/06/2021 23:53:55 - INFO - __main__ - Step 18808: {'lr': 0.00048425816152409173, 'samples': 3611136, 'steps': 18807, 'loss/train': 1.5551064014434814} 11/06/2021 23:53:56 - INFO - __main__ - Step 18809: {'lr': 0.00048425630813447916, 'samples': 3611328, 'steps': 18808, 'loss/train': 1.4787465333938599} 11/06/2021 23:53:56 - INFO - __main__ - Step 18810: {'lr': 0.0004842544546393143, 'samples': 3611520, 'steps': 18809, 'loss/train': 1.7897770404815674} 11/06/2021 23:53:57 - INFO - __main__ - Step 18811: {'lr': 0.00048425260103859797, 'samples': 3611712, 'steps': 18810, 'loss/train': 1.685746192932129} 11/06/2021 23:53:57 - INFO - __main__ - Step 18812: {'lr': 0.0004842507473323311, 'samples': 3611904, 'steps': 18811, 'loss/train': 1.5817826986312866} 11/06/2021 23:53:58 - INFO - __main__ - Step 18813: {'lr': 0.00048424889352051436, 'samples': 3612096, 'steps': 18812, 'loss/train': 1.8189197778701782} 11/06/2021 23:53:58 - INFO - __main__ - Step 18814: {'lr': 0.00048424703960314876, 'samples': 3612288, 'steps': 18813, 'loss/train': 1.7752747535705566} 11/06/2021 23:53:59 - INFO - __main__ - Step 18815: {'lr': 0.00048424518558023505, 'samples': 3612480, 'steps': 18814, 'loss/train': 1.612984538078308} 11/06/2021 23:54:00 - INFO - __main__ - Step 18816: {'lr': 0.00048424333145177405, 'samples': 3612672, 'steps': 18815, 'loss/train': 1.6740766763687134} 11/06/2021 23:54:00 - INFO - __main__ - Step 18817: {'lr': 0.00048424147721776666, 'samples': 3612864, 'steps': 18816, 'loss/train': 2.1559345722198486} 11/06/2021 23:54:00 - INFO - __main__ - Step 18818: {'lr': 0.00048423962287821366, 'samples': 3613056, 'steps': 18817, 'loss/train': 0.7726074457168579} 11/06/2021 23:54:01 - INFO - __main__ - Step 18819: {'lr': 0.00048423776843311585, 'samples': 3613248, 'steps': 18818, 'loss/train': 1.7637563943862915} 11/06/2021 23:54:01 - INFO - __main__ - Step 18820: {'lr': 0.00048423591388247416, 'samples': 3613440, 'steps': 18819, 'loss/train': 1.7249056100845337} 11/06/2021 23:54:01 - INFO - __main__ - Step 18821: {'lr': 0.0004842340592262894, 'samples': 3613632, 'steps': 18820, 'loss/train': 1.6943449974060059} 11/06/2021 23:54:03 - INFO - __main__ - Step 18822: {'lr': 0.00048423220446456233, 'samples': 3613824, 'steps': 18821, 'loss/train': 1.352774739265442} 11/06/2021 23:54:03 - INFO - __main__ - Step 18823: {'lr': 0.0004842303495972939, 'samples': 3614016, 'steps': 18822, 'loss/train': 2.108214855194092} 11/06/2021 23:54:04 - INFO - __main__ - Step 18824: {'lr': 0.00048422849462448483, 'samples': 3614208, 'steps': 18823, 'loss/train': 1.8044021129608154} 11/06/2021 23:54:04 - INFO - __main__ - Step 18825: {'lr': 0.0004842266395461361, 'samples': 3614400, 'steps': 18824, 'loss/train': 6.426365375518799} 11/06/2021 23:54:04 - INFO - __main__ - Step 18826: {'lr': 0.0004842247843622484, 'samples': 3614592, 'steps': 18825, 'loss/train': 5.2774271965026855} 11/06/2021 23:54:05 - INFO - __main__ - Step 18827: {'lr': 0.0004842229290728226, 'samples': 3614784, 'steps': 18826, 'loss/train': 1.6318795680999756} 11/06/2021 23:54:05 - INFO - __main__ - Step 18828: {'lr': 0.0004842210736778596, 'samples': 3614976, 'steps': 18827, 'loss/train': 1.7528266906738281} 11/06/2021 23:54:06 - INFO - __main__ - Step 18829: {'lr': 0.0004842192181773602, 'samples': 3615168, 'steps': 18828, 'loss/train': 1.5569349527359009} 11/06/2021 23:54:07 - INFO - __main__ - Step 18830: {'lr': 0.0004842173625713252, 'samples': 3615360, 'steps': 18829, 'loss/train': 1.8088799715042114} 11/06/2021 23:54:07 - INFO - __main__ - Step 18831: {'lr': 0.0004842155068597556, 'samples': 3615552, 'steps': 18830, 'loss/train': 1.3919082880020142} 11/06/2021 23:54:07 - INFO - __main__ - Step 18832: {'lr': 0.0004842136510426519, 'samples': 3615744, 'steps': 18831, 'loss/train': 1.83372962474823} 11/06/2021 23:54:08 - INFO - __main__ - Step 18833: {'lr': 0.00048421179512001536, 'samples': 3615936, 'steps': 18832, 'loss/train': 1.6518522500991821} 11/06/2021 23:54:08 - INFO - __main__ - Step 18834: {'lr': 0.0004842099390918464, 'samples': 3616128, 'steps': 18833, 'loss/train': 1.6888887882232666} 11/06/2021 23:54:09 - INFO - __main__ - Step 18835: {'lr': 0.00048420808295814624, 'samples': 3616320, 'steps': 18834, 'loss/train': 1.783157467842102} 11/06/2021 23:54:09 - INFO - __main__ - Step 18836: {'lr': 0.00048420622671891533, 'samples': 3616512, 'steps': 18835, 'loss/train': 1.1677378416061401} 11/06/2021 23:54:10 - INFO - __main__ - Step 18837: {'lr': 0.00048420437037415486, 'samples': 3616704, 'steps': 18836, 'loss/train': 1.6292400360107422} 11/06/2021 23:54:10 - INFO - __main__ - Step 18838: {'lr': 0.00048420251392386547, 'samples': 3616896, 'steps': 18837, 'loss/train': 1.4948904514312744} 11/06/2021 23:54:11 - INFO - __main__ - Step 18839: {'lr': 0.0004842006573680481, 'samples': 3617088, 'steps': 18838, 'loss/train': 2.066965103149414} 11/06/2021 23:54:11 - INFO - __main__ - Step 18840: {'lr': 0.0004841988007067034, 'samples': 3617280, 'steps': 18839, 'loss/train': 1.4668853282928467} 11/06/2021 23:54:12 - INFO - __main__ - Step 18841: {'lr': 0.00048419694393983244, 'samples': 3617472, 'steps': 18840, 'loss/train': 1.542981505393982} 11/06/2021 23:54:12 - INFO - __main__ - Step 18842: {'lr': 0.00048419508706743587, 'samples': 3617664, 'steps': 18841, 'loss/train': 1.729238748550415} 11/06/2021 23:54:12 - INFO - __main__ - Step 18843: {'lr': 0.00048419323008951467, 'samples': 3617856, 'steps': 18842, 'loss/train': 1.618919014930725} 11/06/2021 23:54:13 - INFO - __main__ - Step 18844: {'lr': 0.00048419137300606963, 'samples': 3618048, 'steps': 18843, 'loss/train': 1.8637679815292358} 11/06/2021 23:54:14 - INFO - __main__ - Step 18845: {'lr': 0.00048418951581710154, 'samples': 3618240, 'steps': 18844, 'loss/train': 1.0846983194351196} 11/06/2021 23:54:14 - INFO - __main__ - Step 18846: {'lr': 0.00048418765852261124, 'samples': 3618432, 'steps': 18845, 'loss/train': 1.3119678497314453} 11/06/2021 23:54:14 - INFO - __main__ - Step 18847: {'lr': 0.0004841858011225996, 'samples': 3618624, 'steps': 18846, 'loss/train': 2.0200302600860596} 11/06/2021 23:54:15 - INFO - __main__ - Step 18848: {'lr': 0.0004841839436170675, 'samples': 3618816, 'steps': 18847, 'loss/train': 1.3020848035812378} 11/06/2021 23:54:16 - INFO - __main__ - Step 18849: {'lr': 0.0004841820860060157, 'samples': 3619008, 'steps': 18848, 'loss/train': 1.4073768854141235} 11/06/2021 23:54:16 - INFO - __main__ - Step 18850: {'lr': 0.0004841802282894451, 'samples': 3619200, 'steps': 18849, 'loss/train': 1.74467134475708} 11/06/2021 23:54:16 - INFO - __main__ - Step 18851: {'lr': 0.0004841783704673565, 'samples': 3619392, 'steps': 18850, 'loss/train': 1.4385926723480225} 11/06/2021 23:54:17 - INFO - __main__ - Step 18852: {'lr': 0.00048417651253975067, 'samples': 3619584, 'steps': 18851, 'loss/train': 1.2946964502334595} 11/06/2021 23:54:17 - INFO - __main__ - Step 18853: {'lr': 0.00048417465450662856, 'samples': 3619776, 'steps': 18852, 'loss/train': 1.5022872686386108} 11/06/2021 23:54:18 - INFO - __main__ - Step 18854: {'lr': 0.0004841727963679909, 'samples': 3619968, 'steps': 18853, 'loss/train': 2.3088507652282715} 11/06/2021 23:54:19 - INFO - __main__ - Step 18855: {'lr': 0.0004841709381238387, 'samples': 3620160, 'steps': 18854, 'loss/train': 1.64644455909729} 11/06/2021 23:54:19 - INFO - __main__ - Step 18856: {'lr': 0.0004841690797741726, 'samples': 3620352, 'steps': 18855, 'loss/train': 1.808630347251892} 11/06/2021 23:54:19 - INFO - __main__ - Step 18857: {'lr': 0.0004841672213189936, 'samples': 3620544, 'steps': 18856, 'loss/train': 1.4299774169921875} 11/06/2021 23:54:20 - INFO - __main__ - Step 18858: {'lr': 0.00048416536275830245, 'samples': 3620736, 'steps': 18857, 'loss/train': 1.275566577911377} 11/06/2021 23:54:20 - INFO - __main__ - Step 18859: {'lr': 0.00048416350409209995, 'samples': 3620928, 'steps': 18858, 'loss/train': 1.8275693655014038} 11/06/2021 23:54:21 - INFO - __main__ - Step 18860: {'lr': 0.000484161645320387, 'samples': 3621120, 'steps': 18859, 'loss/train': 0.7486388087272644} 11/06/2021 23:54:22 - INFO - __main__ - Step 18861: {'lr': 0.0004841597864431645, 'samples': 3621312, 'steps': 18860, 'loss/train': 1.5714701414108276} 11/06/2021 23:54:22 - INFO - __main__ - Step 18862: {'lr': 0.00048415792746043314, 'samples': 3621504, 'steps': 18861, 'loss/train': 2.1515393257141113} 11/06/2021 23:54:22 - INFO - __main__ - Step 18863: {'lr': 0.00048415606837219383, 'samples': 3621696, 'steps': 18862, 'loss/train': 0.8803495168685913} 11/06/2021 23:54:23 - INFO - __main__ - Step 18864: {'lr': 0.00048415420917844744, 'samples': 3621888, 'steps': 18863, 'loss/train': 1.059100866317749} 11/06/2021 23:54:23 - INFO - __main__ - Step 18865: {'lr': 0.00048415234987919474, 'samples': 3622080, 'steps': 18864, 'loss/train': 1.3690611124038696} 11/06/2021 23:54:24 - INFO - __main__ - Step 18866: {'lr': 0.0004841504904744367, 'samples': 3622272, 'steps': 18865, 'loss/train': 1.7441192865371704} 11/06/2021 23:54:24 - INFO - __main__ - Step 18867: {'lr': 0.0004841486309641739, 'samples': 3622464, 'steps': 18866, 'loss/train': 1.8263840675354004} 11/06/2021 23:54:25 - INFO - __main__ - Step 18868: {'lr': 0.00048414677134840753, 'samples': 3622656, 'steps': 18867, 'loss/train': 1.3380950689315796} 11/06/2021 23:54:25 - INFO - __main__ - Step 18869: {'lr': 0.00048414491162713814, 'samples': 3622848, 'steps': 18868, 'loss/train': 1.482108473777771} 11/06/2021 23:54:25 - INFO - __main__ - Step 18870: {'lr': 0.00048414305180036665, 'samples': 3623040, 'steps': 18869, 'loss/train': 1.6186736822128296} 11/06/2021 23:54:26 - INFO - __main__ - Step 18871: {'lr': 0.0004841411918680939, 'samples': 3623232, 'steps': 18870, 'loss/train': 1.5286184549331665} 11/06/2021 23:54:27 - INFO - __main__ - Step 18872: {'lr': 0.0004841393318303208, 'samples': 3623424, 'steps': 18871, 'loss/train': 1.8360174894332886} 11/06/2021 23:54:27 - INFO - __main__ - Step 18873: {'lr': 0.0004841374716870481, 'samples': 3623616, 'steps': 18872, 'loss/train': 1.7237398624420166} 11/06/2021 23:54:27 - INFO - __main__ - Step 18874: {'lr': 0.00048413561143827665, 'samples': 3623808, 'steps': 18873, 'loss/train': 1.8056116104125977} 11/06/2021 23:54:28 - INFO - __main__ - Step 18875: {'lr': 0.00048413375108400736, 'samples': 3624000, 'steps': 18874, 'loss/train': 1.8317943811416626} 11/06/2021 23:54:29 - INFO - __main__ - Step 18876: {'lr': 0.000484131890624241, 'samples': 3624192, 'steps': 18875, 'loss/train': 1.52738356590271} 11/06/2021 23:54:29 - INFO - __main__ - Step 18877: {'lr': 0.00048413003005897835, 'samples': 3624384, 'steps': 18876, 'loss/train': 1.5932716131210327} 11/06/2021 23:54:29 - INFO - __main__ - Step 18878: {'lr': 0.0004841281693882204, 'samples': 3624576, 'steps': 18877, 'loss/train': 1.2580933570861816} 11/06/2021 23:54:30 - INFO - __main__ - Step 18879: {'lr': 0.0004841263086119679, 'samples': 3624768, 'steps': 18878, 'loss/train': 1.666326642036438} 11/06/2021 23:54:30 - INFO - __main__ - Step 18880: {'lr': 0.00048412444773022166, 'samples': 3624960, 'steps': 18879, 'loss/train': 1.489951491355896} 11/06/2021 23:54:31 - INFO - __main__ - Step 18881: {'lr': 0.0004841225867429826, 'samples': 3625152, 'steps': 18880, 'loss/train': 1.7064945697784424} 11/06/2021 23:54:32 - INFO - __main__ - Step 18882: {'lr': 0.0004841207256502515, 'samples': 3625344, 'steps': 18881, 'loss/train': 1.4198405742645264} 11/06/2021 23:54:32 - INFO - __main__ - Step 18883: {'lr': 0.0004841188644520292, 'samples': 3625536, 'steps': 18882, 'loss/train': 2.1660537719726562} 11/06/2021 23:54:32 - INFO - __main__ - Step 18884: {'lr': 0.0004841170031483165, 'samples': 3625728, 'steps': 18883, 'loss/train': 1.5875812768936157} 11/06/2021 23:54:33 - INFO - __main__ - Step 18885: {'lr': 0.0004841151417391144, 'samples': 3625920, 'steps': 18884, 'loss/train': 1.3188947439193726} 11/06/2021 23:54:33 - INFO - __main__ - Step 18886: {'lr': 0.00048411328022442357, 'samples': 3626112, 'steps': 18885, 'loss/train': 1.1708683967590332} 11/06/2021 23:54:34 - INFO - __main__ - Step 18887: {'lr': 0.000484111418604245, 'samples': 3626304, 'steps': 18886, 'loss/train': 0.6864486336708069} 11/06/2021 23:54:34 - INFO - __main__ - Step 18888: {'lr': 0.00048410955687857926, 'samples': 3626496, 'steps': 18887, 'loss/train': 1.1706008911132812} 11/06/2021 23:54:35 - INFO - __main__ - Step 18889: {'lr': 0.0004841076950474275, 'samples': 3626688, 'steps': 18888, 'loss/train': 1.363743543624878} 11/06/2021 23:54:35 - INFO - __main__ - Step 18890: {'lr': 0.0004841058331107904, 'samples': 3626880, 'steps': 18889, 'loss/train': 1.5939115285873413} 11/06/2021 23:54:35 - INFO - __main__ - Step 18891: {'lr': 0.00048410397106866883, 'samples': 3627072, 'steps': 18890, 'loss/train': 1.363234281539917} 11/06/2021 23:54:36 - INFO - __main__ - Step 18892: {'lr': 0.0004841021089210636, 'samples': 3627264, 'steps': 18891, 'loss/train': 1.4709104299545288} 11/06/2021 23:54:37 - INFO - __main__ - Step 18893: {'lr': 0.0004841002466679756, 'samples': 3627456, 'steps': 18892, 'loss/train': 1.4250143766403198} 11/06/2021 23:54:37 - INFO - __main__ - Step 18894: {'lr': 0.00048409838430940556, 'samples': 3627648, 'steps': 18893, 'loss/train': 1.4562376737594604} 11/06/2021 23:54:37 - INFO - __main__ - Step 18895: {'lr': 0.00048409652184535447, 'samples': 3627840, 'steps': 18894, 'loss/train': 1.5139254331588745} 11/06/2021 23:54:38 - INFO - __main__ - Step 18896: {'lr': 0.0004840946592758231, 'samples': 3628032, 'steps': 18895, 'loss/train': 1.6306174993515015} 11/06/2021 23:54:39 - INFO - __main__ - Step 18897: {'lr': 0.00048409279660081226, 'samples': 3628224, 'steps': 18896, 'loss/train': 1.2062115669250488} 11/06/2021 23:54:39 - INFO - __main__ - Step 18898: {'lr': 0.0004840909338203229, 'samples': 3628416, 'steps': 18897, 'loss/train': 1.950105905532837} 11/06/2021 23:54:40 - INFO - __main__ - Step 18899: {'lr': 0.0004840890709343557, 'samples': 3628608, 'steps': 18898, 'loss/train': 1.9823861122131348} 11/06/2021 23:54:40 - INFO - __main__ - Step 18900: {'lr': 0.0004840872079429116, 'samples': 3628800, 'steps': 18899, 'loss/train': 1.6326419115066528} 11/06/2021 23:54:40 - INFO - __main__ - Step 18901: {'lr': 0.00048408534484599143, 'samples': 3628992, 'steps': 18900, 'loss/train': 1.2868921756744385} 11/06/2021 23:54:41 - INFO - __main__ - Step 18902: {'lr': 0.00048408348164359594, 'samples': 3629184, 'steps': 18901, 'loss/train': 1.845752477645874} 11/06/2021 23:54:42 - INFO - __main__ - Step 18903: {'lr': 0.00048408161833572613, 'samples': 3629376, 'steps': 18902, 'loss/train': 2.1983730792999268} 11/06/2021 23:54:42 - INFO - __main__ - Step 18904: {'lr': 0.0004840797549223827, 'samples': 3629568, 'steps': 18903, 'loss/train': 1.6073050498962402} 11/06/2021 23:54:42 - INFO - __main__ - Step 18905: {'lr': 0.00048407789140356654, 'samples': 3629760, 'steps': 18904, 'loss/train': 1.3466079235076904} 11/06/2021 23:54:43 - INFO - __main__ - Step 18906: {'lr': 0.00048407602777927856, 'samples': 3629952, 'steps': 18905, 'loss/train': 1.2008774280548096} 11/06/2021 23:54:44 - INFO - __main__ - Step 18907: {'lr': 0.0004840741640495195, 'samples': 3630144, 'steps': 18906, 'loss/train': 2.7398083209991455} 11/06/2021 23:54:44 - INFO - __main__ - Step 18908: {'lr': 0.0004840723002142902, 'samples': 3630336, 'steps': 18907, 'loss/train': 1.9664089679718018} 11/06/2021 23:54:45 - INFO - __main__ - Step 18909: {'lr': 0.0004840704362735916, 'samples': 3630528, 'steps': 18908, 'loss/train': 1.7534114122390747} 11/06/2021 23:54:45 - INFO - __main__ - Step 18910: {'lr': 0.0004840685722274244, 'samples': 3630720, 'steps': 18909, 'loss/train': 1.252254605293274} 11/06/2021 23:54:45 - INFO - __main__ - Step 18911: {'lr': 0.0004840667080757896, 'samples': 3630912, 'steps': 18910, 'loss/train': 1.4434303045272827} 11/06/2021 23:54:46 - INFO - __main__ - Step 18912: {'lr': 0.00048406484381868786, 'samples': 3631104, 'steps': 18911, 'loss/train': 1.8143805265426636} 11/06/2021 23:54:47 - INFO - __main__ - Step 18913: {'lr': 0.0004840629794561202, 'samples': 3631296, 'steps': 18912, 'loss/train': 1.431362509727478} 11/06/2021 23:54:47 - INFO - __main__ - Step 18914: {'lr': 0.0004840611149880873, 'samples': 3631488, 'steps': 18913, 'loss/train': 0.25539538264274597} 11/06/2021 23:54:48 - INFO - __main__ - Step 18915: {'lr': 0.0004840592504145901, 'samples': 3631680, 'steps': 18914, 'loss/train': 1.6547143459320068} 11/06/2021 23:54:48 - INFO - __main__ - Step 18916: {'lr': 0.0004840573857356294, 'samples': 3631872, 'steps': 18915, 'loss/train': 1.5618704557418823} 11/06/2021 23:54:48 - INFO - __main__ - Step 18917: {'lr': 0.0004840555209512061, 'samples': 3632064, 'steps': 18916, 'loss/train': 1.6179587841033936} 11/06/2021 23:54:49 - INFO - __main__ - Step 18918: {'lr': 0.00048405365606132096, 'samples': 3632256, 'steps': 18917, 'loss/train': 1.8452666997909546} 11/06/2021 23:54:50 - INFO - __main__ - Step 18919: {'lr': 0.00048405179106597487, 'samples': 3632448, 'steps': 18918, 'loss/train': 1.6771806478500366} 11/06/2021 23:54:50 - INFO - __main__ - Step 18920: {'lr': 0.0004840499259651686, 'samples': 3632640, 'steps': 18919, 'loss/train': 2.0507524013519287} 11/06/2021 23:54:51 - INFO - __main__ - Step 18921: {'lr': 0.0004840480607589031, 'samples': 3632832, 'steps': 18920, 'loss/train': 2.5290257930755615} 11/06/2021 23:54:51 - INFO - __main__ - Step 18922: {'lr': 0.0004840461954471792, 'samples': 3633024, 'steps': 18921, 'loss/train': 1.8817951679229736} 11/06/2021 23:54:52 - INFO - __main__ - Step 18923: {'lr': 0.00048404433002999757, 'samples': 3633216, 'steps': 18922, 'loss/train': 1.3463006019592285} 11/06/2021 23:54:52 - INFO - __main__ - Step 18924: {'lr': 0.0004840424645073593, 'samples': 3633408, 'steps': 18923, 'loss/train': 1.3242696523666382} 11/06/2021 23:54:53 - INFO - __main__ - Step 18925: {'lr': 0.000484040598879265, 'samples': 3633600, 'steps': 18924, 'loss/train': 1.3132871389389038} 11/06/2021 23:54:53 - INFO - __main__ - Step 18926: {'lr': 0.0004840387331457157, 'samples': 3633792, 'steps': 18925, 'loss/train': 1.4895330667495728} 11/06/2021 23:54:53 - INFO - __main__ - Step 18927: {'lr': 0.00048403686730671215, 'samples': 3633984, 'steps': 18926, 'loss/train': 1.6102023124694824} 11/06/2021 23:54:54 - INFO - __main__ - Step 18928: {'lr': 0.0004840350013622552, 'samples': 3634176, 'steps': 18927, 'loss/train': 1.062371015548706} 11/06/2021 23:54:55 - INFO - __main__ - Step 18929: {'lr': 0.0004840331353123456, 'samples': 3634368, 'steps': 18928, 'loss/train': 1.8671513795852661} 11/06/2021 23:54:55 - INFO - __main__ - Step 18930: {'lr': 0.00048403126915698435, 'samples': 3634560, 'steps': 18929, 'loss/train': 1.2960901260375977} 11/06/2021 23:54:55 - INFO - __main__ - Step 18931: {'lr': 0.00048402940289617223, 'samples': 3634752, 'steps': 18930, 'loss/train': 1.1204884052276611} 11/06/2021 23:54:56 - INFO - __main__ - Step 18932: {'lr': 0.00048402753652991007, 'samples': 3634944, 'steps': 18931, 'loss/train': 1.4577025175094604} 11/06/2021 23:54:57 - INFO - __main__ - Step 18933: {'lr': 0.0004840256700581988, 'samples': 3635136, 'steps': 18932, 'loss/train': 1.4760710000991821} 11/06/2021 23:54:57 - INFO - __main__ - Step 18934: {'lr': 0.000484023803481039, 'samples': 3635328, 'steps': 18933, 'loss/train': 1.0608885288238525} 11/06/2021 23:54:58 - INFO - __main__ - Step 18935: {'lr': 0.00048402193679843175, 'samples': 3635520, 'steps': 18934, 'loss/train': 1.0119448900222778} 11/06/2021 23:54:58 - INFO - __main__ - Step 18936: {'lr': 0.00048402007001037786, 'samples': 3635712, 'steps': 18935, 'loss/train': 1.3457058668136597} 11/06/2021 23:54:58 - INFO - __main__ - Step 18937: {'lr': 0.0004840182031168781, 'samples': 3635904, 'steps': 18936, 'loss/train': 1.7942875623703003} 11/06/2021 23:54:59 - INFO - __main__ - Step 18938: {'lr': 0.0004840163361179334, 'samples': 3636096, 'steps': 18937, 'loss/train': 1.9397997856140137} 11/06/2021 23:55:00 - INFO - __main__ - Step 18939: {'lr': 0.00048401446901354453, 'samples': 3636288, 'steps': 18938, 'loss/train': 1.4605997800827026} 11/06/2021 23:55:00 - INFO - __main__ - Step 18940: {'lr': 0.0004840126018037123, 'samples': 3636480, 'steps': 18939, 'loss/train': 2.040832996368408} 11/06/2021 23:55:00 - INFO - __main__ - Step 18941: {'lr': 0.0004840107344884377, 'samples': 3636672, 'steps': 18940, 'loss/train': 1.2393549680709839} 11/06/2021 23:55:01 - INFO - __main__ - Step 18942: {'lr': 0.0004840088670677214, 'samples': 3636864, 'steps': 18941, 'loss/train': 1.561964988708496} 11/06/2021 23:55:01 - INFO - __main__ - Step 18943: {'lr': 0.0004840069995415643, 'samples': 3637056, 'steps': 18942, 'loss/train': 1.3233683109283447} 11/06/2021 23:55:02 - INFO - __main__ - Step 18944: {'lr': 0.0004840051319099673, 'samples': 3637248, 'steps': 18943, 'loss/train': 1.9745053052902222} 11/06/2021 23:55:02 - INFO - __main__ - Step 18945: {'lr': 0.0004840032641729312, 'samples': 3637440, 'steps': 18944, 'loss/train': 1.7221941947937012} 11/06/2021 23:55:03 - INFO - __main__ - Step 18946: {'lr': 0.0004840013963304568, 'samples': 3637632, 'steps': 18945, 'loss/train': 1.6417832374572754} 11/06/2021 23:55:03 - INFO - __main__ - Step 18947: {'lr': 0.000483999528382545, 'samples': 3637824, 'steps': 18946, 'loss/train': 1.5244332551956177} 11/06/2021 23:55:03 - INFO - __main__ - Step 18948: {'lr': 0.00048399766032919666, 'samples': 3638016, 'steps': 18947, 'loss/train': 1.8366427421569824} 11/06/2021 23:55:05 - INFO - __main__ - Step 18949: {'lr': 0.0004839957921704126, 'samples': 3638208, 'steps': 18948, 'loss/train': 1.1807835102081299} 11/06/2021 23:55:05 - INFO - __main__ - Step 18950: {'lr': 0.0004839939239061936, 'samples': 3638400, 'steps': 18949, 'loss/train': 2.372195243835449} 11/06/2021 23:55:05 - INFO - __main__ - Step 18951: {'lr': 0.00048399205553654046, 'samples': 3638592, 'steps': 18950, 'loss/train': 1.5446722507476807} 11/06/2021 23:55:06 - INFO - __main__ - Step 18952: {'lr': 0.0004839901870614543, 'samples': 3638784, 'steps': 18951, 'loss/train': 1.4252386093139648} 11/06/2021 23:55:06 - INFO - __main__ - Step 18953: {'lr': 0.0004839883184809356, 'samples': 3638976, 'steps': 18952, 'loss/train': 1.5092815160751343} 11/06/2021 23:55:07 - INFO - __main__ - Step 18954: {'lr': 0.00048398644979498543, 'samples': 3639168, 'steps': 18953, 'loss/train': 1.7473726272583008} 11/06/2021 23:55:07 - INFO - __main__ - Step 18955: {'lr': 0.0004839845810036047, 'samples': 3639360, 'steps': 18954, 'loss/train': 1.6918965578079224} 11/06/2021 23:55:08 - INFO - __main__ - Step 18956: {'lr': 0.00048398271210679393, 'samples': 3639552, 'steps': 18955, 'loss/train': 1.3582264184951782} 11/06/2021 23:55:08 - INFO - __main__ - Step 18957: {'lr': 0.0004839808431045543, 'samples': 3639744, 'steps': 18956, 'loss/train': 1.5944629907608032} 11/06/2021 23:55:08 - INFO - __main__ - Step 18958: {'lr': 0.00048397897399688643, 'samples': 3639936, 'steps': 18957, 'loss/train': 1.8873305320739746} 11/06/2021 23:55:09 - INFO - __main__ - Step 18959: {'lr': 0.0004839771047837913, 'samples': 3640128, 'steps': 18958, 'loss/train': 1.7735779285430908} 11/06/2021 23:55:10 - INFO - __main__ - Step 18960: {'lr': 0.00048397523546526966, 'samples': 3640320, 'steps': 18959, 'loss/train': 1.5042515993118286} 11/06/2021 23:55:10 - INFO - __main__ - Step 18961: {'lr': 0.0004839733660413224, 'samples': 3640512, 'steps': 18960, 'loss/train': 1.004091739654541} 11/06/2021 23:55:10 - INFO - __main__ - Step 18962: {'lr': 0.0004839714965119504, 'samples': 3640704, 'steps': 18961, 'loss/train': 1.425974726676941} 11/06/2021 23:55:11 - INFO - __main__ - Step 18963: {'lr': 0.0004839696268771544, 'samples': 3640896, 'steps': 18962, 'loss/train': 0.9318681955337524} 11/06/2021 23:55:12 - INFO - __main__ - Step 18964: {'lr': 0.0004839677571369353, 'samples': 3641088, 'steps': 18963, 'loss/train': 1.7548048496246338} 11/06/2021 23:55:12 - INFO - __main__ - Step 18965: {'lr': 0.000483965887291294, 'samples': 3641280, 'steps': 18964, 'loss/train': 1.1884113550186157} 11/06/2021 23:55:13 - INFO - __main__ - Step 18966: {'lr': 0.0004839640173402312, 'samples': 3641472, 'steps': 18965, 'loss/train': 0.5352546572685242} 11/06/2021 23:55:13 - INFO - __main__ - Step 18967: {'lr': 0.00048396214728374786, 'samples': 3641664, 'steps': 18966, 'loss/train': 1.5210620164871216} 11/06/2021 23:55:13 - INFO - __main__ - Step 18968: {'lr': 0.00048396027712184475, 'samples': 3641856, 'steps': 18967, 'loss/train': 2.4723968505859375} 11/06/2021 23:55:14 - INFO - __main__ - Step 18969: {'lr': 0.0004839584068545228, 'samples': 3642048, 'steps': 18968, 'loss/train': 1.6791962385177612} 11/06/2021 23:55:15 - INFO - __main__ - Step 18970: {'lr': 0.0004839565364817828, 'samples': 3642240, 'steps': 18969, 'loss/train': 1.571922779083252} 11/06/2021 23:55:15 - INFO - __main__ - Step 18971: {'lr': 0.0004839546660036256, 'samples': 3642432, 'steps': 18970, 'loss/train': 1.4152837991714478} 11/06/2021 23:55:15 - INFO - __main__ - Step 18972: {'lr': 0.000483952795420052, 'samples': 3642624, 'steps': 18971, 'loss/train': 1.4770593643188477} 11/06/2021 23:55:16 - INFO - __main__ - Step 18973: {'lr': 0.0004839509247310629, 'samples': 3642816, 'steps': 18972, 'loss/train': 1.4914158582687378} 11/06/2021 23:55:16 - INFO - __main__ - Step 18974: {'lr': 0.00048394905393665913, 'samples': 3643008, 'steps': 18973, 'loss/train': 1.941078543663025} 11/06/2021 23:55:17 - INFO - __main__ - Step 18975: {'lr': 0.00048394718303684147, 'samples': 3643200, 'steps': 18974, 'loss/train': 1.4603451490402222} 11/06/2021 23:55:17 - INFO - __main__ - Step 18976: {'lr': 0.00048394531203161084, 'samples': 3643392, 'steps': 18975, 'loss/train': 1.721853256225586} 11/06/2021 23:55:18 - INFO - __main__ - Step 18977: {'lr': 0.00048394344092096816, 'samples': 3643584, 'steps': 18976, 'loss/train': 1.8595176935195923} 11/06/2021 23:55:18 - INFO - __main__ - Step 18978: {'lr': 0.0004839415697049141, 'samples': 3643776, 'steps': 18977, 'loss/train': 1.1810476779937744} 11/06/2021 23:55:19 - INFO - __main__ - Step 18979: {'lr': 0.00048393969838344956, 'samples': 3643968, 'steps': 18978, 'loss/train': 4.6848673820495605} 11/06/2021 23:55:20 - INFO - __main__ - Step 18980: {'lr': 0.0004839378269565754, 'samples': 3644160, 'steps': 18979, 'loss/train': 1.5944328308105469} 11/06/2021 23:55:20 - INFO - __main__ - Step 18981: {'lr': 0.00048393595542429253, 'samples': 3644352, 'steps': 18980, 'loss/train': 1.6038635969161987} 11/06/2021 23:55:21 - INFO - __main__ - Step 18982: {'lr': 0.0004839340837866016, 'samples': 3644544, 'steps': 18981, 'loss/train': 2.6181952953338623} 11/06/2021 23:55:21 - INFO - __main__ - Step 18983: {'lr': 0.00048393221204350376, 'samples': 3644736, 'steps': 18982, 'loss/train': 1.9288345575332642} 11/06/2021 23:55:21 - INFO - __main__ - Step 18984: {'lr': 0.0004839303401949996, 'samples': 3644928, 'steps': 18983, 'loss/train': 2.2249767780303955} 11/06/2021 23:55:22 - INFO - __main__ - Step 18985: {'lr': 0.00048392846824109, 'samples': 3645120, 'steps': 18984, 'loss/train': 1.4523152112960815} 11/06/2021 23:55:23 - INFO - __main__ - Step 18986: {'lr': 0.00048392659618177585, 'samples': 3645312, 'steps': 18985, 'loss/train': 1.7560415267944336} 11/06/2021 23:55:23 - INFO - __main__ - Step 18987: {'lr': 0.000483924724017058, 'samples': 3645504, 'steps': 18986, 'loss/train': 1.8481801748275757} 11/06/2021 23:55:23 - INFO - __main__ - Step 18988: {'lr': 0.00048392285174693727, 'samples': 3645696, 'steps': 18987, 'loss/train': 0.914047360420227} 11/06/2021 23:55:24 - INFO - __main__ - Step 18989: {'lr': 0.0004839209793714146, 'samples': 3645888, 'steps': 18988, 'loss/train': 1.7691324949264526} 11/06/2021 23:55:24 - INFO - __main__ - Step 18990: {'lr': 0.00048391910689049057, 'samples': 3646080, 'steps': 18989, 'loss/train': 1.0342262983322144} 11/06/2021 23:55:25 - INFO - __main__ - Step 18991: {'lr': 0.00048391723430416634, 'samples': 3646272, 'steps': 18990, 'loss/train': 1.5997928380966187} 11/06/2021 23:55:26 - INFO - __main__ - Step 18992: {'lr': 0.00048391536161244254, 'samples': 3646464, 'steps': 18991, 'loss/train': 1.5818288326263428} 11/06/2021 23:55:26 - INFO - __main__ - Step 18993: {'lr': 0.0004839134888153202, 'samples': 3646656, 'steps': 18992, 'loss/train': 1.9287071228027344} 11/06/2021 23:55:26 - INFO - __main__ - Step 18994: {'lr': 0.00048391161591279994, 'samples': 3646848, 'steps': 18993, 'loss/train': 1.191074252128601} 11/06/2021 23:55:27 - INFO - __main__ - Step 18995: {'lr': 0.0004839097429048827, 'samples': 3647040, 'steps': 18994, 'loss/train': 1.541513442993164} 11/06/2021 23:55:28 - INFO - __main__ - Step 18996: {'lr': 0.00048390786979156944, 'samples': 3647232, 'steps': 18995, 'loss/train': 1.4567269086837769} 11/06/2021 23:55:28 - INFO - __main__ - Step 18997: {'lr': 0.0004839059965728608, 'samples': 3647424, 'steps': 18996, 'loss/train': 1.8564058542251587} 11/06/2021 23:55:28 - INFO - __main__ - Step 18998: {'lr': 0.0004839041232487578, 'samples': 3647616, 'steps': 18997, 'loss/train': 0.6478285193443298} 11/06/2021 23:55:29 - INFO - __main__ - Step 18999: {'lr': 0.0004839022498192612, 'samples': 3647808, 'steps': 18998, 'loss/train': 1.9509828090667725} 11/06/2021 23:55:29 - INFO - __main__ - Step 19000: {'lr': 0.0004839003762843718, 'samples': 3648000, 'steps': 18999, 'loss/train': 1.717009425163269} 11/06/2021 23:55:30 - INFO - __main__ - Step 19001: {'lr': 0.00048389850264409054, 'samples': 3648192, 'steps': 19000, 'loss/train': 2.1301968097686768} 11/06/2021 23:55:30 - INFO - __main__ - Step 19002: {'lr': 0.00048389662889841825, 'samples': 3648384, 'steps': 19001, 'loss/train': 1.4481666088104248} 11/06/2021 23:55:31 - INFO - __main__ - Step 19003: {'lr': 0.0004838947550473557, 'samples': 3648576, 'steps': 19002, 'loss/train': 1.5534554719924927} 11/06/2021 23:55:31 - INFO - __main__ - Step 19004: {'lr': 0.00048389288109090383, 'samples': 3648768, 'steps': 19003, 'loss/train': 1.1992428302764893} 11/06/2021 23:55:31 - INFO - __main__ - Step 19005: {'lr': 0.0004838910070290634, 'samples': 3648960, 'steps': 19004, 'loss/train': 1.6658971309661865} 11/06/2021 23:55:33 - INFO - __main__ - Step 19006: {'lr': 0.00048388913286183535, 'samples': 3649152, 'steps': 19005, 'loss/train': 1.3170843124389648} 11/06/2021 23:55:33 - INFO - __main__ - Step 19007: {'lr': 0.0004838872585892204, 'samples': 3649344, 'steps': 19006, 'loss/train': 1.4023375511169434} 11/06/2021 23:55:33 - INFO - __main__ - Step 19008: {'lr': 0.00048388538421121946, 'samples': 3649536, 'steps': 19007, 'loss/train': 1.516270637512207} 11/06/2021 23:55:34 - INFO - __main__ - Step 19009: {'lr': 0.00048388350972783346, 'samples': 3649728, 'steps': 19008, 'loss/train': 1.362441062927246} 11/06/2021 23:55:34 - INFO - __main__ - Step 19010: {'lr': 0.000483881635139063, 'samples': 3649920, 'steps': 19009, 'loss/train': 1.2651129961013794} 11/06/2021 23:55:35 - INFO - __main__ - Step 19011: {'lr': 0.00048387976044490924, 'samples': 3650112, 'steps': 19010, 'loss/train': 1.373202919960022} 11/06/2021 23:55:35 - INFO - __main__ - Step 19012: {'lr': 0.0004838778856453728, 'samples': 3650304, 'steps': 19011, 'loss/train': 1.5069514513015747} 11/06/2021 23:55:36 - INFO - __main__ - Step 19013: {'lr': 0.00048387601074045464, 'samples': 3650496, 'steps': 19012, 'loss/train': 1.5526649951934814} 11/06/2021 23:55:36 - INFO - __main__ - Step 19014: {'lr': 0.0004838741357301555, 'samples': 3650688, 'steps': 19013, 'loss/train': 1.4588598012924194} 11/06/2021 23:55:36 - INFO - __main__ - Step 19015: {'lr': 0.00048387226061447633, 'samples': 3650880, 'steps': 19014, 'loss/train': 1.072666049003601} 11/06/2021 23:55:37 - INFO - __main__ - Step 19016: {'lr': 0.0004838703853934179, 'samples': 3651072, 'steps': 19015, 'loss/train': 1.7657874822616577} 11/06/2021 23:55:38 - INFO - __main__ - Step 19017: {'lr': 0.0004838685100669811, 'samples': 3651264, 'steps': 19016, 'loss/train': 1.2980928421020508} 11/06/2021 23:55:38 - INFO - __main__ - Step 19018: {'lr': 0.0004838666346351667, 'samples': 3651456, 'steps': 19017, 'loss/train': 1.271425724029541} 11/06/2021 23:55:38 - INFO - __main__ - Step 19019: {'lr': 0.0004838647590979757, 'samples': 3651648, 'steps': 19018, 'loss/train': 1.669057011604309} 11/06/2021 23:55:39 - INFO - __main__ - Step 19020: {'lr': 0.00048386288345540876, 'samples': 3651840, 'steps': 19019, 'loss/train': 1.4595625400543213} 11/06/2021 23:55:39 - INFO - __main__ - Step 19021: {'lr': 0.00048386100770746686, 'samples': 3652032, 'steps': 19020, 'loss/train': 1.5734845399856567} 11/06/2021 23:55:40 - INFO - __main__ - Step 19022: {'lr': 0.00048385913185415076, 'samples': 3652224, 'steps': 19021, 'loss/train': 1.6069128513336182} 11/06/2021 23:55:40 - INFO - __main__ - Step 19023: {'lr': 0.00048385725589546137, 'samples': 3652416, 'steps': 19022, 'loss/train': 1.7868753671646118} 11/06/2021 23:55:41 - INFO - __main__ - Step 19024: {'lr': 0.0004838553798313995, 'samples': 3652608, 'steps': 19023, 'loss/train': 1.7515647411346436} 11/06/2021 23:55:41 - INFO - __main__ - Step 19025: {'lr': 0.000483853503661966, 'samples': 3652800, 'steps': 19024, 'loss/train': 1.6738646030426025} 11/06/2021 23:55:42 - INFO - __main__ - Step 19026: {'lr': 0.00048385162738716174, 'samples': 3652992, 'steps': 19025, 'loss/train': 1.5762358903884888} 11/06/2021 23:55:42 - INFO - __main__ - Step 19027: {'lr': 0.00048384975100698756, 'samples': 3653184, 'steps': 19026, 'loss/train': 1.628257155418396} 11/06/2021 23:55:43 - INFO - __main__ - Step 19028: {'lr': 0.0004838478745214443, 'samples': 3653376, 'steps': 19027, 'loss/train': 1.753998041152954} 11/06/2021 23:55:43 - INFO - __main__ - Step 19029: {'lr': 0.00048384599793053275, 'samples': 3653568, 'steps': 19028, 'loss/train': 1.5790235996246338} 11/06/2021 23:55:44 - INFO - __main__ - Step 19030: {'lr': 0.0004838441212342538, 'samples': 3653760, 'steps': 19029, 'loss/train': 1.3828831911087036} 11/06/2021 23:55:44 - INFO - __main__ - Step 19031: {'lr': 0.0004838422444326084, 'samples': 3653952, 'steps': 19030, 'loss/train': 1.9452617168426514} 11/06/2021 23:55:45 - INFO - __main__ - Step 19032: {'lr': 0.0004838403675255971, 'samples': 3654144, 'steps': 19031, 'loss/train': 1.5351001024246216} 11/06/2021 23:55:45 - INFO - __main__ - Step 19033: {'lr': 0.0004838384905132211, 'samples': 3654336, 'steps': 19032, 'loss/train': 1.7516456842422485} 11/06/2021 23:55:46 - INFO - __main__ - Step 19034: {'lr': 0.000483836613395481, 'samples': 3654528, 'steps': 19033, 'loss/train': 0.5482653379440308} 11/06/2021 23:55:46 - INFO - __main__ - Step 19035: {'lr': 0.0004838347361723778, 'samples': 3654720, 'steps': 19034, 'loss/train': 1.3136703968048096} 11/06/2021 23:55:46 - INFO - __main__ - Step 19036: {'lr': 0.0004838328588439123, 'samples': 3654912, 'steps': 19035, 'loss/train': 1.8250452280044556} 11/06/2021 23:55:47 - INFO - __main__ - Step 19037: {'lr': 0.0004838309814100852, 'samples': 3655104, 'steps': 19036, 'loss/train': 1.3839877843856812} 11/06/2021 23:55:48 - INFO - __main__ - Step 19038: {'lr': 0.0004838291038708975, 'samples': 3655296, 'steps': 19037, 'loss/train': 1.6864336729049683} 11/06/2021 23:55:48 - INFO - __main__ - Step 19039: {'lr': 0.00048382722622635014, 'samples': 3655488, 'steps': 19038, 'loss/train': 1.4272637367248535} 11/06/2021 23:55:48 - INFO - __main__ - Step 19040: {'lr': 0.0004838253484764437, 'samples': 3655680, 'steps': 19039, 'loss/train': 1.7983784675598145} 11/06/2021 23:55:49 - INFO - __main__ - Step 19041: {'lr': 0.0004838234706211792, 'samples': 3655872, 'steps': 19040, 'loss/train': 1.689130187034607} 11/06/2021 23:55:50 - INFO - __main__ - Step 19042: {'lr': 0.00048382159266055746, 'samples': 3656064, 'steps': 19041, 'loss/train': 1.1757745742797852} 11/06/2021 23:55:50 - INFO - __main__ - Step 19043: {'lr': 0.0004838197145945793, 'samples': 3656256, 'steps': 19042, 'loss/train': 1.7386882305145264} 11/06/2021 23:55:51 - INFO - __main__ - Step 19044: {'lr': 0.0004838178364232456, 'samples': 3656448, 'steps': 19043, 'loss/train': 1.588568091392517} 11/06/2021 23:55:51 - INFO - __main__ - Step 19045: {'lr': 0.00048381595814655723, 'samples': 3656640, 'steps': 19044, 'loss/train': 1.907985806465149} 11/06/2021 23:55:51 - INFO - __main__ - Step 19046: {'lr': 0.000483814079764515, 'samples': 3656832, 'steps': 19045, 'loss/train': 1.874314308166504} 11/06/2021 23:55:52 - INFO - __main__ - Step 19047: {'lr': 0.00048381220127711967, 'samples': 3657024, 'steps': 19046, 'loss/train': 1.4793670177459717} 11/06/2021 23:55:53 - INFO - __main__ - Step 19048: {'lr': 0.0004838103226843722, 'samples': 3657216, 'steps': 19047, 'loss/train': 1.9645042419433594} 11/06/2021 23:55:53 - INFO - __main__ - Step 19049: {'lr': 0.00048380844398627343, 'samples': 3657408, 'steps': 19048, 'loss/train': 1.7406543493270874} 11/06/2021 23:55:53 - INFO - __main__ - Step 19050: {'lr': 0.0004838065651828242, 'samples': 3657600, 'steps': 19049, 'loss/train': 1.7106494903564453} 11/06/2021 23:55:54 - INFO - __main__ - Step 19051: {'lr': 0.0004838046862740253, 'samples': 3657792, 'steps': 19050, 'loss/train': 1.5313799381256104} 11/06/2021 23:55:55 - INFO - __main__ - Step 19052: {'lr': 0.0004838028072598777, 'samples': 3657984, 'steps': 19051, 'loss/train': 1.718366265296936} 11/06/2021 23:55:55 - INFO - __main__ - Step 19053: {'lr': 0.00048380092814038204, 'samples': 3658176, 'steps': 19052, 'loss/train': 2.046135425567627} 11/06/2021 23:55:55 - INFO - __main__ - Step 19054: {'lr': 0.0004837990489155394, 'samples': 3658368, 'steps': 19053, 'loss/train': 1.7225055694580078} 11/06/2021 23:55:56 - INFO - __main__ - Step 19055: {'lr': 0.00048379716958535043, 'samples': 3658560, 'steps': 19054, 'loss/train': 1.7493064403533936} 11/06/2021 23:55:56 - INFO - __main__ - Step 19056: {'lr': 0.00048379529014981604, 'samples': 3658752, 'steps': 19055, 'loss/train': 1.859459638595581} 11/06/2021 23:55:57 - INFO - __main__ - Step 19057: {'lr': 0.0004837934106089372, 'samples': 3658944, 'steps': 19056, 'loss/train': 0.9592066407203674} 11/06/2021 23:55:58 - INFO - __main__ - Step 19058: {'lr': 0.0004837915309627146, 'samples': 3659136, 'steps': 19057, 'loss/train': 1.9939336776733398} 11/06/2021 23:55:58 - INFO - __main__ - Step 19059: {'lr': 0.00048378965121114917, 'samples': 3659328, 'steps': 19058, 'loss/train': 1.3354649543762207} 11/06/2021 23:55:58 - INFO - __main__ - Step 19060: {'lr': 0.00048378777135424166, 'samples': 3659520, 'steps': 19059, 'loss/train': 1.3405288457870483} 11/06/2021 23:55:59 - INFO - __main__ - Step 19061: {'lr': 0.0004837858913919931, 'samples': 3659712, 'steps': 19060, 'loss/train': 1.7382433414459229} 11/06/2021 23:56:00 - INFO - __main__ - Step 19062: {'lr': 0.0004837840113244042, 'samples': 3659904, 'steps': 19061, 'loss/train': 1.5361404418945312} 11/06/2021 23:56:00 - INFO - __main__ - Step 19063: {'lr': 0.00048378213115147573, 'samples': 3660096, 'steps': 19062, 'loss/train': 0.8109406232833862} 11/06/2021 23:56:00 - INFO - __main__ - Step 19064: {'lr': 0.00048378025087320877, 'samples': 3660288, 'steps': 19063, 'loss/train': 1.951269507408142} 11/06/2021 23:56:01 - INFO - __main__ - Step 19065: {'lr': 0.0004837783704896039, 'samples': 3660480, 'steps': 19064, 'loss/train': 1.256687879562378} 11/06/2021 23:56:01 - INFO - __main__ - Step 19066: {'lr': 0.0004837764900006623, 'samples': 3660672, 'steps': 19065, 'loss/train': 1.4727357625961304} 11/06/2021 23:56:01 - INFO - __main__ - Step 19067: {'lr': 0.0004837746094063844, 'samples': 3660864, 'steps': 19066, 'loss/train': 1.9289908409118652} 11/06/2021 23:56:02 - INFO - __main__ - Step 19068: {'lr': 0.00048377272870677135, 'samples': 3661056, 'steps': 19067, 'loss/train': 1.5988587141036987} 11/06/2021 23:56:03 - INFO - __main__ - Step 19069: {'lr': 0.000483770847901824, 'samples': 3661248, 'steps': 19068, 'loss/train': 1.1608555316925049} 11/06/2021 23:56:03 - INFO - __main__ - Step 19070: {'lr': 0.000483768966991543, 'samples': 3661440, 'steps': 19069, 'loss/train': 1.5548107624053955} 11/06/2021 23:56:03 - INFO - __main__ - Step 19071: {'lr': 0.0004837670859759294, 'samples': 3661632, 'steps': 19070, 'loss/train': 1.667305827140808} 11/06/2021 23:56:04 - INFO - __main__ - Step 19072: {'lr': 0.0004837652048549839, 'samples': 3661824, 'steps': 19071, 'loss/train': 1.9622819423675537} 11/06/2021 23:56:05 - INFO - __main__ - Step 19073: {'lr': 0.00048376332362870745, 'samples': 3662016, 'steps': 19072, 'loss/train': 1.4648205041885376} 11/06/2021 23:56:05 - INFO - __main__ - Step 19074: {'lr': 0.00048376144229710083, 'samples': 3662208, 'steps': 19073, 'loss/train': 1.6623486280441284} 11/06/2021 23:56:06 - INFO - __main__ - Step 19075: {'lr': 0.00048375956086016495, 'samples': 3662400, 'steps': 19074, 'loss/train': 1.842500925064087} 11/06/2021 23:56:06 - INFO - __main__ - Step 19076: {'lr': 0.0004837576793179005, 'samples': 3662592, 'steps': 19075, 'loss/train': 0.8432311415672302} 11/06/2021 23:56:06 - INFO - __main__ - Step 19077: {'lr': 0.00048375579767030854, 'samples': 3662784, 'steps': 19076, 'loss/train': 1.2110167741775513} 11/06/2021 23:56:07 - INFO - __main__ - Step 19078: {'lr': 0.0004837539159173898, 'samples': 3662976, 'steps': 19077, 'loss/train': 1.565323829650879} 11/06/2021 23:56:08 - INFO - __main__ - Step 19079: {'lr': 0.00048375203405914515, 'samples': 3663168, 'steps': 19078, 'loss/train': 1.4068197011947632} 11/06/2021 23:56:08 - INFO - __main__ - Step 19080: {'lr': 0.00048375015209557547, 'samples': 3663360, 'steps': 19079, 'loss/train': 1.9403846263885498} 11/06/2021 23:56:08 - INFO - __main__ - Step 19081: {'lr': 0.00048374827002668156, 'samples': 3663552, 'steps': 19080, 'loss/train': 1.3041592836380005} 11/06/2021 23:56:09 - INFO - __main__ - Step 19082: {'lr': 0.0004837463878524643, 'samples': 3663744, 'steps': 19081, 'loss/train': 2.0662670135498047} 11/06/2021 23:56:10 - INFO - __main__ - Step 19083: {'lr': 0.0004837445055729245, 'samples': 3663936, 'steps': 19082, 'loss/train': 2.4334311485290527} 11/06/2021 23:56:10 - INFO - __main__ - Step 19084: {'lr': 0.00048374262318806306, 'samples': 3664128, 'steps': 19083, 'loss/train': 1.4202038049697876} 11/06/2021 23:56:10 - INFO - __main__ - Step 19085: {'lr': 0.00048374074069788077, 'samples': 3664320, 'steps': 19084, 'loss/train': 0.9480411410331726} 11/06/2021 23:56:11 - INFO - __main__ - Step 19086: {'lr': 0.0004837388581023785, 'samples': 3664512, 'steps': 19085, 'loss/train': 0.9188286662101746} 11/06/2021 23:56:11 - INFO - __main__ - Step 19087: {'lr': 0.0004837369754015571, 'samples': 3664704, 'steps': 19086, 'loss/train': 1.6863188743591309} 11/06/2021 23:56:12 - INFO - __main__ - Step 19088: {'lr': 0.0004837350925954175, 'samples': 3664896, 'steps': 19087, 'loss/train': 1.5236670970916748} 11/06/2021 23:56:12 - INFO - __main__ - Step 19089: {'lr': 0.00048373320968396043, 'samples': 3665088, 'steps': 19088, 'loss/train': 1.9557360410690308} 11/06/2021 23:56:13 - INFO - __main__ - Step 19090: {'lr': 0.0004837313266671868, 'samples': 3665280, 'steps': 19089, 'loss/train': 1.2220089435577393} 11/06/2021 23:56:13 - INFO - __main__ - Step 19091: {'lr': 0.0004837294435450974, 'samples': 3665472, 'steps': 19090, 'loss/train': 1.4970214366912842} 11/06/2021 23:56:13 - INFO - __main__ - Step 19092: {'lr': 0.00048372756031769316, 'samples': 3665664, 'steps': 19091, 'loss/train': 1.794909119606018} 11/06/2021 23:56:14 - INFO - __main__ - Step 19093: {'lr': 0.00048372567698497487, 'samples': 3665856, 'steps': 19092, 'loss/train': 1.7139893770217896} 11/06/2021 23:56:15 - INFO - __main__ - Step 19094: {'lr': 0.0004837237935469434, 'samples': 3666048, 'steps': 19093, 'loss/train': 1.5381810665130615} 11/06/2021 23:56:15 - INFO - __main__ - Step 19095: {'lr': 0.00048372191000359955, 'samples': 3666240, 'steps': 19094, 'loss/train': 1.4186919927597046} 11/06/2021 23:56:15 - INFO - __main__ - Step 19096: {'lr': 0.00048372002635494425, 'samples': 3666432, 'steps': 19095, 'loss/train': 1.651479721069336} 11/06/2021 23:56:16 - INFO - __main__ - Step 19097: {'lr': 0.00048371814260097834, 'samples': 3666624, 'steps': 19096, 'loss/train': 1.3919228315353394} 11/06/2021 23:56:16 - INFO - __main__ - Step 19098: {'lr': 0.0004837162587417027, 'samples': 3666816, 'steps': 19097, 'loss/train': 2.0199766159057617} 11/06/2021 23:56:17 - INFO - __main__ - Step 19099: {'lr': 0.000483714374777118, 'samples': 3667008, 'steps': 19098, 'loss/train': 1.6757065057754517} 11/06/2021 23:56:18 - INFO - __main__ - Step 19100: {'lr': 0.00048371249070722525, 'samples': 3667200, 'steps': 19099, 'loss/train': 1.7928451299667358} 11/06/2021 23:56:18 - INFO - __main__ - Step 19101: {'lr': 0.0004837106065320253, 'samples': 3667392, 'steps': 19100, 'loss/train': 1.8815168142318726} 11/06/2021 23:56:18 - INFO - __main__ - Step 19102: {'lr': 0.00048370872225151886, 'samples': 3667584, 'steps': 19101, 'loss/train': 1.4708551168441772} 11/06/2021 23:56:19 - INFO - __main__ - Step 19103: {'lr': 0.0004837068378657069, 'samples': 3667776, 'steps': 19102, 'loss/train': 1.1429966688156128} 11/06/2021 23:56:20 - INFO - __main__ - Step 19104: {'lr': 0.0004837049533745903, 'samples': 3667968, 'steps': 19103, 'loss/train': 1.8435603380203247} 11/06/2021 23:56:20 - INFO - __main__ - Step 19105: {'lr': 0.00048370306877816983, 'samples': 3668160, 'steps': 19104, 'loss/train': 1.6147713661193848} 11/06/2021 23:56:20 - INFO - __main__ - Step 19106: {'lr': 0.00048370118407644637, 'samples': 3668352, 'steps': 19105, 'loss/train': 1.210199236869812} 11/06/2021 23:56:21 - INFO - __main__ - Step 19107: {'lr': 0.0004836992992694208, 'samples': 3668544, 'steps': 19106, 'loss/train': 1.3318425416946411} 11/06/2021 23:56:21 - INFO - __main__ - Step 19108: {'lr': 0.00048369741435709383, 'samples': 3668736, 'steps': 19107, 'loss/train': 1.3522213697433472} 11/06/2021 23:56:22 - INFO - __main__ - Step 19109: {'lr': 0.0004836955293394665, 'samples': 3668928, 'steps': 19108, 'loss/train': 1.4460760354995728} 11/06/2021 23:56:22 - INFO - __main__ - Step 19110: {'lr': 0.00048369364421653953, 'samples': 3669120, 'steps': 19109, 'loss/train': 2.3319251537323} 11/06/2021 23:56:23 - INFO - __main__ - Step 19111: {'lr': 0.00048369175898831384, 'samples': 3669312, 'steps': 19110, 'loss/train': 1.8790931701660156} 11/06/2021 23:56:23 - INFO - __main__ - Step 19112: {'lr': 0.0004836898736547902, 'samples': 3669504, 'steps': 19111, 'loss/train': 1.8408292531967163} 11/06/2021 23:56:24 - INFO - __main__ - Step 19113: {'lr': 0.0004836879882159696, 'samples': 3669696, 'steps': 19112, 'loss/train': 1.7793866395950317} 11/06/2021 23:56:24 - INFO - __main__ - Step 19114: {'lr': 0.0004836861026718527, 'samples': 3669888, 'steps': 19113, 'loss/train': 1.2182018756866455} 11/06/2021 23:56:25 - INFO - __main__ - Step 19115: {'lr': 0.00048368421702244045, 'samples': 3670080, 'steps': 19114, 'loss/train': 1.8022509813308716} 11/06/2021 23:56:25 - INFO - __main__ - Step 19116: {'lr': 0.00048368233126773377, 'samples': 3670272, 'steps': 19115, 'loss/train': 1.5703195333480835} 11/06/2021 23:56:26 - INFO - __main__ - Step 19117: {'lr': 0.0004836804454077334, 'samples': 3670464, 'steps': 19116, 'loss/train': 1.6894001960754395} 11/06/2021 23:56:26 - INFO - __main__ - Step 19118: {'lr': 0.0004836785594424402, 'samples': 3670656, 'steps': 19117, 'loss/train': 1.8371540307998657} 11/06/2021 23:56:27 - INFO - __main__ - Step 19119: {'lr': 0.0004836766733718551, 'samples': 3670848, 'steps': 19118, 'loss/train': 1.038614273071289} 11/06/2021 23:56:27 - INFO - __main__ - Step 19120: {'lr': 0.0004836747871959789, 'samples': 3671040, 'steps': 19119, 'loss/train': 1.801282286643982} 11/06/2021 23:56:28 - INFO - __main__ - Step 19121: {'lr': 0.0004836729009148124, 'samples': 3671232, 'steps': 19120, 'loss/train': 1.3906354904174805} 11/06/2021 23:56:28 - INFO - __main__ - Step 19122: {'lr': 0.0004836710145283565, 'samples': 3671424, 'steps': 19121, 'loss/train': 1.6386430263519287} 11/06/2021 23:56:28 - INFO - __main__ - Step 19123: {'lr': 0.0004836691280366121, 'samples': 3671616, 'steps': 19122, 'loss/train': 1.5289560556411743} 11/06/2021 23:56:29 - INFO - __main__ - Step 19124: {'lr': 0.00048366724143958, 'samples': 3671808, 'steps': 19123, 'loss/train': 1.3992829322814941} 11/06/2021 23:56:30 - INFO - __main__ - Step 19125: {'lr': 0.0004836653547372609, 'samples': 3672000, 'steps': 19124, 'loss/train': 1.7073944807052612} 11/06/2021 23:56:30 - INFO - __main__ - Step 19126: {'lr': 0.00048366346792965597, 'samples': 3672192, 'steps': 19125, 'loss/train': 1.8707301616668701} 11/06/2021 23:56:30 - INFO - __main__ - Step 19127: {'lr': 0.0004836615810167658, 'samples': 3672384, 'steps': 19126, 'loss/train': 1.809815764427185} 11/06/2021 23:56:31 - INFO - __main__ - Step 19128: {'lr': 0.00048365969399859134, 'samples': 3672576, 'steps': 19127, 'loss/train': 0.8023385405540466} 11/06/2021 23:56:31 - INFO - __main__ - Step 19129: {'lr': 0.00048365780687513346, 'samples': 3672768, 'steps': 19128, 'loss/train': 0.9183880686759949} 11/06/2021 23:56:32 - INFO - __main__ - Step 19130: {'lr': 0.00048365591964639294, 'samples': 3672960, 'steps': 19129, 'loss/train': 1.3006956577301025} 11/06/2021 23:56:33 - INFO - __main__ - Step 19131: {'lr': 0.0004836540323123707, 'samples': 3673152, 'steps': 19130, 'loss/train': 1.5788049697875977} 11/06/2021 23:56:33 - INFO - __main__ - Step 19132: {'lr': 0.00048365214487306753, 'samples': 3673344, 'steps': 19131, 'loss/train': 2.2502143383026123} 11/06/2021 23:56:33 - INFO - __main__ - Step 19133: {'lr': 0.00048365025732848433, 'samples': 3673536, 'steps': 19132, 'loss/train': 1.1418144702911377} 11/06/2021 23:56:34 - INFO - __main__ - Step 19134: {'lr': 0.0004836483696786219, 'samples': 3673728, 'steps': 19133, 'loss/train': 0.19124336540699005} 11/06/2021 23:56:35 - INFO - __main__ - Step 19135: {'lr': 0.00048364648192348117, 'samples': 3673920, 'steps': 19134, 'loss/train': 1.739778757095337} 11/06/2021 23:56:35 - INFO - __main__ - Step 19136: {'lr': 0.0004836445940630629, 'samples': 3674112, 'steps': 19135, 'loss/train': 1.533406138420105} 11/06/2021 23:56:36 - INFO - __main__ - Step 19137: {'lr': 0.0004836427060973679, 'samples': 3674304, 'steps': 19136, 'loss/train': 1.5858798027038574} 11/06/2021 23:56:36 - INFO - __main__ - Step 19138: {'lr': 0.00048364081802639724, 'samples': 3674496, 'steps': 19137, 'loss/train': 1.5668922662734985} 11/06/2021 23:56:36 - INFO - __main__ - Step 19139: {'lr': 0.00048363892985015157, 'samples': 3674688, 'steps': 19138, 'loss/train': 1.4601547718048096} 11/06/2021 23:56:37 - INFO - __main__ - Step 19140: {'lr': 0.00048363704156863187, 'samples': 3674880, 'steps': 19139, 'loss/train': 1.4852017164230347} 11/06/2021 23:56:38 - INFO - __main__ - Step 19141: {'lr': 0.0004836351531818388, 'samples': 3675072, 'steps': 19140, 'loss/train': 1.2616417407989502} 11/06/2021 23:56:38 - INFO - __main__ - Step 19142: {'lr': 0.00048363326468977343, 'samples': 3675264, 'steps': 19141, 'loss/train': 2.229715585708618} 11/06/2021 23:56:38 - INFO - __main__ - Step 19143: {'lr': 0.00048363137609243654, 'samples': 3675456, 'steps': 19142, 'loss/train': 2.004793643951416} 11/06/2021 23:56:39 - INFO - __main__ - Step 19144: {'lr': 0.0004836294873898289, 'samples': 3675648, 'steps': 19143, 'loss/train': 1.3080763816833496} 11/06/2021 23:56:40 - INFO - __main__ - Step 19145: {'lr': 0.00048362759858195146, 'samples': 3675840, 'steps': 19144, 'loss/train': 1.3160992860794067} 11/06/2021 23:56:41 - INFO - __main__ - Step 19146: {'lr': 0.0004836257096688049, 'samples': 3676032, 'steps': 19145, 'loss/train': 1.665747880935669} 11/06/2021 23:56:41 - INFO - __main__ - Step 19147: {'lr': 0.00048362382065039034, 'samples': 3676224, 'steps': 19146, 'loss/train': 1.2606112957000732} 11/06/2021 23:56:41 - INFO - __main__ - Step 19148: {'lr': 0.00048362193152670847, 'samples': 3676416, 'steps': 19147, 'loss/train': 1.4134140014648438} 11/06/2021 23:56:42 - INFO - __main__ - Step 19149: {'lr': 0.0004836200422977601, 'samples': 3676608, 'steps': 19148, 'loss/train': 2.2859280109405518} 11/06/2021 23:56:42 - INFO - __main__ - Step 19150: {'lr': 0.00048361815296354624, 'samples': 3676800, 'steps': 19149, 'loss/train': 1.893393874168396} 11/06/2021 23:56:43 - INFO - __main__ - Step 19151: {'lr': 0.00048361626352406756, 'samples': 3676992, 'steps': 19150, 'loss/train': 1.5060352087020874} 11/06/2021 23:56:43 - INFO - __main__ - Step 19152: {'lr': 0.00048361437397932504, 'samples': 3677184, 'steps': 19151, 'loss/train': 2.0820343494415283} 11/06/2021 23:56:44 - INFO - __main__ - Step 19153: {'lr': 0.0004836124843293195, 'samples': 3677376, 'steps': 19152, 'loss/train': 1.5182925462722778} 11/06/2021 23:56:44 - INFO - __main__ - Step 19154: {'lr': 0.00048361059457405176, 'samples': 3677568, 'steps': 19153, 'loss/train': 1.160764217376709} 11/06/2021 23:56:45 - INFO - __main__ - Step 19155: {'lr': 0.0004836087047135227, 'samples': 3677760, 'steps': 19154, 'loss/train': 1.0252954959869385} 11/06/2021 23:56:46 - INFO - __main__ - Step 19156: {'lr': 0.0004836068147477331, 'samples': 3677952, 'steps': 19155, 'loss/train': 1.408125638961792} 11/06/2021 23:56:46 - INFO - __main__ - Step 19157: {'lr': 0.0004836049246766839, 'samples': 3678144, 'steps': 19156, 'loss/train': 1.8385653495788574} 11/06/2021 23:56:47 - INFO - __main__ - Step 19158: {'lr': 0.000483603034500376, 'samples': 3678336, 'steps': 19157, 'loss/train': 2.144331216812134} 11/06/2021 23:56:47 - INFO - __main__ - Step 19159: {'lr': 0.0004836011442188101, 'samples': 3678528, 'steps': 19158, 'loss/train': 2.2374393939971924} 11/06/2021 23:56:47 - INFO - __main__ - Step 19160: {'lr': 0.00048359925383198714, 'samples': 3678720, 'steps': 19159, 'loss/train': 1.9010136127471924} 11/06/2021 23:56:48 - INFO - __main__ - Step 19161: {'lr': 0.000483597363339908, 'samples': 3678912, 'steps': 19160, 'loss/train': 2.1621627807617188} 11/06/2021 23:56:49 - INFO - __main__ - Step 19162: {'lr': 0.0004835954727425734, 'samples': 3679104, 'steps': 19161, 'loss/train': 0.9236301779747009} 11/06/2021 23:56:49 - INFO - __main__ - Step 19163: {'lr': 0.0004835935820399844, 'samples': 3679296, 'steps': 19162, 'loss/train': 1.932969331741333} 11/06/2021 23:56:49 - INFO - __main__ - Step 19164: {'lr': 0.0004835916912321417, 'samples': 3679488, 'steps': 19163, 'loss/train': 1.723209261894226} 11/06/2021 23:56:50 - INFO - __main__ - Step 19165: {'lr': 0.0004835898003190462, 'samples': 3679680, 'steps': 19164, 'loss/train': 1.830366611480713} 11/06/2021 23:56:50 - INFO - __main__ - Step 19166: {'lr': 0.00048358790930069876, 'samples': 3679872, 'steps': 19165, 'loss/train': 1.4188361167907715} 11/06/2021 23:56:51 - INFO - __main__ - Step 19167: {'lr': 0.0004835860181771001, 'samples': 3680064, 'steps': 19166, 'loss/train': 1.7501378059387207} 11/06/2021 23:56:51 - INFO - __main__ - Step 19168: {'lr': 0.0004835841269482513, 'samples': 3680256, 'steps': 19167, 'loss/train': 1.277912974357605} 11/06/2021 23:56:52 - INFO - __main__ - Step 19169: {'lr': 0.00048358223561415306, 'samples': 3680448, 'steps': 19168, 'loss/train': 1.2338846921920776} 11/06/2021 23:56:52 - INFO - __main__ - Step 19170: {'lr': 0.0004835803441748062, 'samples': 3680640, 'steps': 19169, 'loss/train': 1.7084600925445557} 11/06/2021 23:56:53 - INFO - __main__ - Step 19171: {'lr': 0.0004835784526302117, 'samples': 3680832, 'steps': 19170, 'loss/train': 1.6762709617614746} 11/06/2021 23:56:54 - INFO - __main__ - Step 19172: {'lr': 0.0004835765609803704, 'samples': 3681024, 'steps': 19171, 'loss/train': 1.5952175855636597} 11/06/2021 23:56:54 - INFO - __main__ - Step 19173: {'lr': 0.00048357466922528306, 'samples': 3681216, 'steps': 19172, 'loss/train': 1.9963054656982422} 11/06/2021 23:56:54 - INFO - __main__ - Step 19174: {'lr': 0.00048357277736495055, 'samples': 3681408, 'steps': 19173, 'loss/train': 1.8751789331436157} 11/06/2021 23:56:55 - INFO - __main__ - Step 19175: {'lr': 0.0004835708853993738, 'samples': 3681600, 'steps': 19174, 'loss/train': 2.0888760089874268} 11/06/2021 23:56:55 - INFO - __main__ - Step 19176: {'lr': 0.0004835689933285536, 'samples': 3681792, 'steps': 19175, 'loss/train': 1.4569809436798096} 11/06/2021 23:56:55 - INFO - __main__ - Step 19177: {'lr': 0.0004835671011524908, 'samples': 3681984, 'steps': 19176, 'loss/train': 1.6103253364562988} 11/06/2021 23:56:56 - INFO - __main__ - Step 19178: {'lr': 0.0004835652088711863, 'samples': 3682176, 'steps': 19177, 'loss/train': 1.5094871520996094} 11/06/2021 23:56:57 - INFO - __main__ - Step 19179: {'lr': 0.0004835633164846409, 'samples': 3682368, 'steps': 19178, 'loss/train': 0.7678648233413696} 11/06/2021 23:56:57 - INFO - __main__ - Step 19180: {'lr': 0.00048356142399285545, 'samples': 3682560, 'steps': 19179, 'loss/train': 2.044360399246216} 11/06/2021 23:56:57 - INFO - __main__ - Step 19181: {'lr': 0.00048355953139583087, 'samples': 3682752, 'steps': 19180, 'loss/train': 1.4923588037490845} 11/06/2021 23:56:58 - INFO - __main__ - Step 19182: {'lr': 0.00048355763869356794, 'samples': 3682944, 'steps': 19181, 'loss/train': 1.9290509223937988} 11/06/2021 23:56:59 - INFO - __main__ - Step 19183: {'lr': 0.0004835557458860675, 'samples': 3683136, 'steps': 19182, 'loss/train': 1.8096939325332642} 11/06/2021 23:56:59 - INFO - __main__ - Step 19184: {'lr': 0.00048355385297333054, 'samples': 3683328, 'steps': 19183, 'loss/train': 1.6977989673614502} 11/06/2021 23:57:00 - INFO - __main__ - Step 19185: {'lr': 0.0004835519599553578, 'samples': 3683520, 'steps': 19184, 'loss/train': 1.9109872579574585} 11/06/2021 23:57:00 - INFO - __main__ - Step 19186: {'lr': 0.0004835500668321501, 'samples': 3683712, 'steps': 19185, 'loss/train': 1.4003887176513672} 11/06/2021 23:57:00 - INFO - __main__ - Step 19187: {'lr': 0.0004835481736037084, 'samples': 3683904, 'steps': 19186, 'loss/train': 1.6025032997131348} 11/06/2021 23:57:01 - INFO - __main__ - Step 19188: {'lr': 0.0004835462802700334, 'samples': 3684096, 'steps': 19187, 'loss/train': 1.638126015663147} 11/06/2021 23:57:02 - INFO - __main__ - Step 19189: {'lr': 0.00048354438683112614, 'samples': 3684288, 'steps': 19188, 'loss/train': 1.3936848640441895} 11/06/2021 23:57:02 - INFO - __main__ - Step 19190: {'lr': 0.00048354249328698743, 'samples': 3684480, 'steps': 19189, 'loss/train': 1.5004618167877197} 11/06/2021 23:57:02 - INFO - __main__ - Step 19191: {'lr': 0.000483540599637618, 'samples': 3684672, 'steps': 19190, 'loss/train': 1.6228737831115723} 11/06/2021 23:57:03 - INFO - __main__ - Step 19192: {'lr': 0.00048353870588301875, 'samples': 3684864, 'steps': 19191, 'loss/train': 1.393079400062561} 11/06/2021 23:57:04 - INFO - __main__ - Step 19193: {'lr': 0.00048353681202319056, 'samples': 3685056, 'steps': 19192, 'loss/train': 1.1348565816879272} 11/06/2021 23:57:04 - INFO - __main__ - Step 19194: {'lr': 0.0004835349180581343, 'samples': 3685248, 'steps': 19193, 'loss/train': 1.383333444595337} 11/06/2021 23:57:05 - INFO - __main__ - Step 19195: {'lr': 0.0004835330239878509, 'samples': 3685440, 'steps': 19194, 'loss/train': 1.5683242082595825} 11/06/2021 23:57:05 - INFO - __main__ - Step 19196: {'lr': 0.00048353112981234104, 'samples': 3685632, 'steps': 19195, 'loss/train': 1.6847158670425415} 11/06/2021 23:57:05 - INFO - __main__ - Step 19197: {'lr': 0.0004835292355316057, 'samples': 3685824, 'steps': 19196, 'loss/train': 1.350108027458191} 11/06/2021 23:57:06 - INFO - __main__ - Step 19198: {'lr': 0.0004835273411456456, 'samples': 3686016, 'steps': 19197, 'loss/train': 1.6436011791229248} 11/06/2021 23:57:07 - INFO - __main__ - Step 19199: {'lr': 0.00048352544665446174, 'samples': 3686208, 'steps': 19198, 'loss/train': 1.8072590827941895} 11/06/2021 23:57:07 - INFO - __main__ - Step 19200: {'lr': 0.000483523552058055, 'samples': 3686400, 'steps': 19199, 'loss/train': 1.4258383512496948} 11/06/2021 23:57:07 - INFO - __main__ - Step 19201: {'lr': 0.00048352165735642607, 'samples': 3686592, 'steps': 19200, 'loss/train': 1.8473409414291382} 11/06/2021 23:57:08 - INFO - __main__ - Step 19202: {'lr': 0.00048351976254957585, 'samples': 3686784, 'steps': 19201, 'loss/train': 1.810278296470642} 11/06/2021 23:57:08 - INFO - __main__ - Step 19203: {'lr': 0.0004835178676375053, 'samples': 3686976, 'steps': 19202, 'loss/train': 1.8077994585037231} 11/06/2021 23:57:09 - INFO - __main__ - Step 19204: {'lr': 0.0004835159726202151, 'samples': 3687168, 'steps': 19203, 'loss/train': 0.6947793364524841} 11/06/2021 23:57:09 - INFO - __main__ - Step 19205: {'lr': 0.0004835140774977063, 'samples': 3687360, 'steps': 19204, 'loss/train': 2.199883222579956} 11/06/2021 23:57:10 - INFO - __main__ - Step 19206: {'lr': 0.0004835121822699796, 'samples': 3687552, 'steps': 19205, 'loss/train': 1.3530951738357544} 11/06/2021 23:57:10 - INFO - __main__ - Step 19207: {'lr': 0.000483510286937036, 'samples': 3687744, 'steps': 19206, 'loss/train': 1.5403071641921997} 11/06/2021 23:57:10 - INFO - __main__ - Step 19208: {'lr': 0.0004835083914988762, 'samples': 3687936, 'steps': 19207, 'loss/train': 1.8505449295043945} 11/06/2021 23:57:11 - INFO - __main__ - Step 19209: {'lr': 0.0004835064959555011, 'samples': 3688128, 'steps': 19208, 'loss/train': 2.985590934753418} 11/06/2021 23:57:12 - INFO - __main__ - Step 19210: {'lr': 0.00048350460030691165, 'samples': 3688320, 'steps': 19209, 'loss/train': 1.5929882526397705} 11/06/2021 23:57:12 - INFO - __main__ - Step 19211: {'lr': 0.00048350270455310864, 'samples': 3688512, 'steps': 19210, 'loss/train': 1.5911760330200195} 11/06/2021 23:57:13 - INFO - __main__ - Step 19212: {'lr': 0.00048350080869409285, 'samples': 3688704, 'steps': 19211, 'loss/train': 1.439744234085083} 11/06/2021 23:57:13 - INFO - __main__ - Step 19213: {'lr': 0.0004834989127298652, 'samples': 3688896, 'steps': 19212, 'loss/train': 2.0591835975646973} 11/06/2021 23:57:14 - INFO - __main__ - Step 19214: {'lr': 0.00048349701666042656, 'samples': 3689088, 'steps': 19213, 'loss/train': 2.5805773735046387} 11/06/2021 23:57:14 - INFO - __main__ - Step 19215: {'lr': 0.00048349512048577784, 'samples': 3689280, 'steps': 19214, 'loss/train': 1.7143360376358032} 11/06/2021 23:57:15 - INFO - __main__ - Step 19216: {'lr': 0.00048349322420591966, 'samples': 3689472, 'steps': 19215, 'loss/train': 1.678261637687683} 11/06/2021 23:57:15 - INFO - __main__ - Step 19217: {'lr': 0.00048349132782085316, 'samples': 3689664, 'steps': 19216, 'loss/train': 2.175565004348755} 11/06/2021 23:57:15 - INFO - __main__ - Step 19218: {'lr': 0.00048348943133057903, 'samples': 3689856, 'steps': 19217, 'loss/train': 1.2457647323608398} 11/06/2021 23:57:17 - INFO - __main__ - Step 19219: {'lr': 0.0004834875347350982, 'samples': 3690048, 'steps': 19218, 'loss/train': 2.0584850311279297} 11/06/2021 23:57:17 - INFO - __main__ - Step 19220: {'lr': 0.00048348563803441146, 'samples': 3690240, 'steps': 19219, 'loss/train': 1.630115270614624} 11/06/2021 23:57:17 - INFO - __main__ - Step 19221: {'lr': 0.0004834837412285197, 'samples': 3690432, 'steps': 19220, 'loss/train': 2.0529630184173584} 11/06/2021 23:57:18 - INFO - __main__ - Step 19222: {'lr': 0.00048348184431742377, 'samples': 3690624, 'steps': 19221, 'loss/train': 1.4660913944244385} 11/06/2021 23:57:18 - INFO - __main__ - Step 19223: {'lr': 0.00048347994730112457, 'samples': 3690816, 'steps': 19222, 'loss/train': 1.7038263082504272} 11/06/2021 23:57:19 - INFO - __main__ - Step 19224: {'lr': 0.00048347805017962274, 'samples': 3691008, 'steps': 19223, 'loss/train': 1.5961427688598633} 11/06/2021 23:57:19 - INFO - __main__ - Step 19225: {'lr': 0.00048347615295291947, 'samples': 3691200, 'steps': 19224, 'loss/train': 1.2532870769500732} 11/06/2021 23:57:20 - INFO - __main__ - Step 19226: {'lr': 0.0004834742556210154, 'samples': 3691392, 'steps': 19225, 'loss/train': 1.9617180824279785} 11/06/2021 23:57:20 - INFO - __main__ - Step 19227: {'lr': 0.00048347235818391144, 'samples': 3691584, 'steps': 19226, 'loss/train': 1.223244071006775} 11/06/2021 23:57:20 - INFO - __main__ - Step 19228: {'lr': 0.0004834704606416084, 'samples': 3691776, 'steps': 19227, 'loss/train': 1.8296163082122803} 11/06/2021 23:57:21 - INFO - __main__ - Step 19229: {'lr': 0.00048346856299410725, 'samples': 3691968, 'steps': 19228, 'loss/train': 1.7569893598556519} 11/06/2021 23:57:22 - INFO - __main__ - Step 19230: {'lr': 0.0004834666652414087, 'samples': 3692160, 'steps': 19229, 'loss/train': 1.1428344249725342} 11/06/2021 23:57:22 - INFO - __main__ - Step 19231: {'lr': 0.0004834647673835137, 'samples': 3692352, 'steps': 19230, 'loss/train': 1.7404580116271973} 11/06/2021 23:57:23 - INFO - __main__ - Step 19232: {'lr': 0.00048346286942042307, 'samples': 3692544, 'steps': 19231, 'loss/train': 1.6448150873184204} 11/06/2021 23:57:23 - INFO - __main__ - Step 19233: {'lr': 0.0004834609713521377, 'samples': 3692736, 'steps': 19232, 'loss/train': 1.1691974401474} 11/06/2021 23:57:24 - INFO - __main__ - Step 19234: {'lr': 0.0004834590731786584, 'samples': 3692928, 'steps': 19233, 'loss/train': 1.48292076587677} 11/06/2021 23:57:24 - INFO - __main__ - Step 19235: {'lr': 0.000483457174899986, 'samples': 3693120, 'steps': 19234, 'loss/train': 1.780066728591919} 11/06/2021 23:57:25 - INFO - __main__ - Step 19236: {'lr': 0.00048345527651612145, 'samples': 3693312, 'steps': 19235, 'loss/train': 1.37631356716156} 11/06/2021 23:57:25 - INFO - __main__ - Step 19237: {'lr': 0.00048345337802706555, 'samples': 3693504, 'steps': 19236, 'loss/train': 1.759711503982544} 11/06/2021 23:57:25 - INFO - __main__ - Step 19238: {'lr': 0.0004834514794328192, 'samples': 3693696, 'steps': 19237, 'loss/train': 1.649255394935608} 11/06/2021 23:57:26 - INFO - __main__ - Step 19239: {'lr': 0.00048344958073338315, 'samples': 3693888, 'steps': 19238, 'loss/train': 0.8222588896751404} 11/06/2021 23:57:27 - INFO - __main__ - Step 19240: {'lr': 0.00048344768192875833, 'samples': 3694080, 'steps': 19239, 'loss/train': 1.4921916723251343} 11/06/2021 23:57:27 - INFO - __main__ - Step 19241: {'lr': 0.00048344578301894557, 'samples': 3694272, 'steps': 19240, 'loss/train': 1.9483696222305298} 11/06/2021 23:57:27 - INFO - __main__ - Step 19242: {'lr': 0.0004834438840039458, 'samples': 3694464, 'steps': 19241, 'loss/train': 1.5134330987930298} 11/06/2021 23:57:28 - INFO - __main__ - Step 19243: {'lr': 0.0004834419848837598, 'samples': 3694656, 'steps': 19242, 'loss/train': 1.6340835094451904} 11/06/2021 23:57:28 - INFO - __main__ - Step 19244: {'lr': 0.00048344008565838844, 'samples': 3694848, 'steps': 19243, 'loss/train': 1.8357398509979248} 11/06/2021 23:57:29 - INFO - __main__ - Step 19245: {'lr': 0.00048343818632783255, 'samples': 3695040, 'steps': 19244, 'loss/train': 1.5533130168914795} 11/06/2021 23:57:30 - INFO - __main__ - Step 19246: {'lr': 0.00048343628689209305, 'samples': 3695232, 'steps': 19245, 'loss/train': 1.3802316188812256} 11/06/2021 23:57:30 - INFO - __main__ - Step 19247: {'lr': 0.00048343438735117076, 'samples': 3695424, 'steps': 19246, 'loss/train': 1.439875841140747} 11/06/2021 23:57:30 - INFO - __main__ - Step 19248: {'lr': 0.00048343248770506655, 'samples': 3695616, 'steps': 19247, 'loss/train': 2.032688856124878} 11/06/2021 23:57:31 - INFO - __main__ - Step 19249: {'lr': 0.0004834305879537812, 'samples': 3695808, 'steps': 19248, 'loss/train': 2.1166164875030518} 11/06/2021 23:57:31 - INFO - __main__ - Step 19250: {'lr': 0.00048342868809731567, 'samples': 3696000, 'steps': 19249, 'loss/train': 5.924430847167969} 11/06/2021 23:57:32 - INFO - __main__ - Step 19251: {'lr': 0.0004834267881356708, 'samples': 3696192, 'steps': 19250, 'loss/train': 1.91569983959198} 11/06/2021 23:57:32 - INFO - __main__ - Step 19252: {'lr': 0.0004834248880688474, 'samples': 3696384, 'steps': 19251, 'loss/train': 1.692755103111267} 11/06/2021 23:57:33 - INFO - __main__ - Step 19253: {'lr': 0.00048342298789684637, 'samples': 3696576, 'steps': 19252, 'loss/train': 0.9209264516830444} 11/06/2021 23:57:33 - INFO - __main__ - Step 19254: {'lr': 0.0004834210876196685, 'samples': 3696768, 'steps': 19253, 'loss/train': 1.80112886428833} 11/06/2021 23:57:33 - INFO - __main__ - Step 19255: {'lr': 0.0004834191872373147, 'samples': 3696960, 'steps': 19254, 'loss/train': 1.6527858972549438} 11/06/2021 23:57:35 - INFO - __main__ - Step 19256: {'lr': 0.0004834172867497858, 'samples': 3697152, 'steps': 19255, 'loss/train': 1.832580804824829} 11/06/2021 23:57:35 - INFO - __main__ - Step 19257: {'lr': 0.0004834153861570827, 'samples': 3697344, 'steps': 19256, 'loss/train': 1.6807947158813477} 11/06/2021 23:57:36 - INFO - __main__ - Step 19258: {'lr': 0.00048341348545920623, 'samples': 3697536, 'steps': 19257, 'loss/train': 1.3076280355453491} 11/06/2021 23:57:36 - INFO - __main__ - Step 19259: {'lr': 0.0004834115846561572, 'samples': 3697728, 'steps': 19258, 'loss/train': 1.2784446477890015} 11/06/2021 23:57:36 - INFO - __main__ - Step 19260: {'lr': 0.0004834096837479366, 'samples': 3697920, 'steps': 19259, 'loss/train': 1.541551113128662} 11/06/2021 23:57:37 - INFO - __main__ - Step 19261: {'lr': 0.00048340778273454514, 'samples': 3698112, 'steps': 19260, 'loss/train': 1.8904051780700684} 11/06/2021 23:57:37 - INFO - __main__ - Step 19262: {'lr': 0.00048340588161598373, 'samples': 3698304, 'steps': 19261, 'loss/train': 1.777864694595337} 11/06/2021 23:57:38 - INFO - __main__ - Step 19263: {'lr': 0.00048340398039225325, 'samples': 3698496, 'steps': 19262, 'loss/train': 1.7768715620040894} 11/06/2021 23:57:38 - INFO - __main__ - Step 19264: {'lr': 0.0004834020790633545, 'samples': 3698688, 'steps': 19263, 'loss/train': 1.5909192562103271} 11/06/2021 23:57:39 - INFO - __main__ - Step 19265: {'lr': 0.00048340017762928843, 'samples': 3698880, 'steps': 19264, 'loss/train': 1.390834093093872} 11/06/2021 23:57:39 - INFO - __main__ - Step 19266: {'lr': 0.00048339827609005583, 'samples': 3699072, 'steps': 19265, 'loss/train': 1.6098217964172363} 11/06/2021 23:57:40 - INFO - __main__ - Step 19267: {'lr': 0.00048339637444565756, 'samples': 3699264, 'steps': 19266, 'loss/train': 1.617615818977356} 11/06/2021 23:57:41 - INFO - __main__ - Step 19268: {'lr': 0.0004833944726960945, 'samples': 3699456, 'steps': 19267, 'loss/train': 2.376567840576172} 11/06/2021 23:57:41 - INFO - __main__ - Step 19269: {'lr': 0.00048339257084136747, 'samples': 3699648, 'steps': 19268, 'loss/train': 1.5636578798294067} 11/06/2021 23:57:41 - INFO - __main__ - Step 19270: {'lr': 0.0004833906688814774, 'samples': 3699840, 'steps': 19269, 'loss/train': 1.7293047904968262} 11/06/2021 23:57:42 - INFO - __main__ - Step 19271: {'lr': 0.00048338876681642504, 'samples': 3700032, 'steps': 19270, 'loss/train': 2.3842742443084717} 11/06/2021 23:57:42 - INFO - __main__ - Step 19272: {'lr': 0.0004833868646462113, 'samples': 3700224, 'steps': 19271, 'loss/train': 0.9048125147819519} 11/06/2021 23:57:43 - INFO - __main__ - Step 19273: {'lr': 0.00048338496237083705, 'samples': 3700416, 'steps': 19272, 'loss/train': 1.2667889595031738} 11/06/2021 23:57:43 - INFO - __main__ - Step 19274: {'lr': 0.00048338305999030313, 'samples': 3700608, 'steps': 19273, 'loss/train': 1.076546311378479} 11/06/2021 23:57:44 - INFO - __main__ - Step 19275: {'lr': 0.00048338115750461044, 'samples': 3700800, 'steps': 19274, 'loss/train': 0.9132834076881409} 11/06/2021 23:57:44 - INFO - __main__ - Step 19276: {'lr': 0.0004833792549137598, 'samples': 3700992, 'steps': 19275, 'loss/train': 0.9541102051734924} 11/06/2021 23:57:44 - INFO - __main__ - Step 19277: {'lr': 0.00048337735221775204, 'samples': 3701184, 'steps': 19276, 'loss/train': 1.4680448770523071} 11/06/2021 23:57:45 - INFO - __main__ - Step 19278: {'lr': 0.000483375449416588, 'samples': 3701376, 'steps': 19277, 'loss/train': 1.483697533607483} 11/06/2021 23:57:46 - INFO - __main__ - Step 19279: {'lr': 0.0004833735465102687, 'samples': 3701568, 'steps': 19278, 'loss/train': 1.5289688110351562} 11/06/2021 23:57:46 - INFO - __main__ - Step 19280: {'lr': 0.0004833716434987948, 'samples': 3701760, 'steps': 19279, 'loss/train': 1.0586185455322266} 11/06/2021 23:57:47 - INFO - __main__ - Step 19281: {'lr': 0.0004833697403821672, 'samples': 3701952, 'steps': 19280, 'loss/train': 1.5551531314849854} 11/06/2021 23:57:47 - INFO - __main__ - Step 19282: {'lr': 0.0004833678371603869, 'samples': 3702144, 'steps': 19281, 'loss/train': 1.4676300287246704} 11/06/2021 23:57:47 - INFO - __main__ - Step 19283: {'lr': 0.0004833659338334546, 'samples': 3702336, 'steps': 19282, 'loss/train': 1.9228662252426147} 11/06/2021 23:57:48 - INFO - __main__ - Step 19284: {'lr': 0.0004833640304013712, 'samples': 3702528, 'steps': 19283, 'loss/train': 1.4458892345428467} 11/06/2021 23:57:49 - INFO - __main__ - Step 19285: {'lr': 0.0004833621268641376, 'samples': 3702720, 'steps': 19284, 'loss/train': 1.6027082204818726} 11/06/2021 23:57:49 - INFO - __main__ - Step 19286: {'lr': 0.0004833602232217546, 'samples': 3702912, 'steps': 19285, 'loss/train': 1.810238242149353} 11/06/2021 23:57:49 - INFO - __main__ - Step 19287: {'lr': 0.0004833583194742231, 'samples': 3703104, 'steps': 19286, 'loss/train': 2.003950595855713} 11/06/2021 23:57:50 - INFO - __main__ - Step 19288: {'lr': 0.00048335641562154396, 'samples': 3703296, 'steps': 19287, 'loss/train': 1.7352495193481445} 11/06/2021 23:57:51 - INFO - __main__ - Step 19289: {'lr': 0.00048335451166371803, 'samples': 3703488, 'steps': 19288, 'loss/train': 1.9271100759506226} 11/06/2021 23:57:51 - INFO - __main__ - Step 19290: {'lr': 0.0004833526076007461, 'samples': 3703680, 'steps': 19289, 'loss/train': 1.5342708826065063} 11/06/2021 23:57:52 - INFO - __main__ - Step 19291: {'lr': 0.0004833507034326291, 'samples': 3703872, 'steps': 19290, 'loss/train': 1.6989701986312866} 11/06/2021 23:57:52 - INFO - __main__ - Step 19292: {'lr': 0.0004833487991593679, 'samples': 3704064, 'steps': 19291, 'loss/train': 1.7094693183898926} 11/06/2021 23:57:52 - INFO - __main__ - Step 19293: {'lr': 0.0004833468947809633, 'samples': 3704256, 'steps': 19292, 'loss/train': 1.513824462890625} 11/06/2021 23:57:53 - INFO - __main__ - Step 19294: {'lr': 0.0004833449902974162, 'samples': 3704448, 'steps': 19293, 'loss/train': 1.898005723953247} 11/06/2021 23:57:54 - INFO - __main__ - Step 19295: {'lr': 0.00048334308570872745, 'samples': 3704640, 'steps': 19294, 'loss/train': 1.7322596311569214} 11/06/2021 23:57:54 - INFO - __main__ - Step 19296: {'lr': 0.00048334118101489793, 'samples': 3704832, 'steps': 19295, 'loss/train': 1.4130336046218872} 11/06/2021 23:57:54 - INFO - __main__ - Step 19297: {'lr': 0.00048333927621592844, 'samples': 3705024, 'steps': 19296, 'loss/train': 1.6878700256347656} 11/06/2021 23:57:55 - INFO - __main__ - Step 19298: {'lr': 0.00048333737131181986, 'samples': 3705216, 'steps': 19297, 'loss/train': 1.5635979175567627} 11/06/2021 23:57:55 - INFO - __main__ - Step 19299: {'lr': 0.00048333546630257315, 'samples': 3705408, 'steps': 19298, 'loss/train': 1.3481976985931396} 11/06/2021 23:57:56 - INFO - __main__ - Step 19300: {'lr': 0.000483333561188189, 'samples': 3705600, 'steps': 19299, 'loss/train': 1.2055301666259766} 11/06/2021 23:57:56 - INFO - __main__ - Step 19301: {'lr': 0.00048333165596866837, 'samples': 3705792, 'steps': 19300, 'loss/train': 1.9592984914779663} 11/06/2021 23:57:57 - INFO - __main__ - Step 19302: {'lr': 0.00048332975064401207, 'samples': 3705984, 'steps': 19301, 'loss/train': 1.6503535509109497} 11/06/2021 23:57:57 - INFO - __main__ - Step 19303: {'lr': 0.000483327845214221, 'samples': 3706176, 'steps': 19302, 'loss/train': 1.752747893333435} 11/06/2021 23:57:57 - INFO - __main__ - Step 19304: {'lr': 0.00048332593967929607, 'samples': 3706368, 'steps': 19303, 'loss/train': 1.8258517980575562} 11/06/2021 23:57:58 - INFO - __main__ - Step 19305: {'lr': 0.000483324034039238, 'samples': 3706560, 'steps': 19304, 'loss/train': 2.0873513221740723} 11/06/2021 23:57:59 - INFO - __main__ - Step 19306: {'lr': 0.00048332212829404775, 'samples': 3706752, 'steps': 19305, 'loss/train': 1.607957363128662} 11/06/2021 23:57:59 - INFO - __main__ - Step 19307: {'lr': 0.0004833202224437261, 'samples': 3706944, 'steps': 19306, 'loss/train': 1.5633206367492676} 11/06/2021 23:57:59 - INFO - __main__ - Step 19308: {'lr': 0.000483318316488274, 'samples': 3707136, 'steps': 19307, 'loss/train': 1.7169498205184937} 11/06/2021 23:58:00 - INFO - __main__ - Step 19309: {'lr': 0.00048331641042769223, 'samples': 3707328, 'steps': 19308, 'loss/train': 2.3175976276397705} 11/06/2021 23:58:01 - INFO - __main__ - Step 19310: {'lr': 0.00048331450426198177, 'samples': 3707520, 'steps': 19309, 'loss/train': 1.9806184768676758} 11/06/2021 23:58:01 - INFO - __main__ - Step 19311: {'lr': 0.0004833125979911434, 'samples': 3707712, 'steps': 19310, 'loss/train': 1.7473728656768799} 11/06/2021 23:58:01 - INFO - __main__ - Step 19312: {'lr': 0.0004833106916151778, 'samples': 3707904, 'steps': 19311, 'loss/train': 1.586052656173706} 11/06/2021 23:58:02 - INFO - __main__ - Step 19313: {'lr': 0.00048330878513408616, 'samples': 3708096, 'steps': 19312, 'loss/train': 1.7465753555297852} 11/06/2021 23:58:02 - INFO - __main__ - Step 19314: {'lr': 0.00048330687854786914, 'samples': 3708288, 'steps': 19313, 'loss/train': 1.9451851844787598} 11/06/2021 23:58:03 - INFO - __main__ - Step 19315: {'lr': 0.00048330497185652765, 'samples': 3708480, 'steps': 19314, 'loss/train': 1.6248364448547363} 11/06/2021 23:58:04 - INFO - __main__ - Step 19316: {'lr': 0.00048330306506006257, 'samples': 3708672, 'steps': 19315, 'loss/train': 1.7191998958587646} 11/06/2021 23:58:04 - INFO - __main__ - Step 19317: {'lr': 0.00048330115815847465, 'samples': 3708864, 'steps': 19316, 'loss/train': 1.3119690418243408} 11/06/2021 23:58:04 - INFO - __main__ - Step 19318: {'lr': 0.0004832992511517649, 'samples': 3709056, 'steps': 19317, 'loss/train': 1.7842353582382202} 11/06/2021 23:58:05 - INFO - __main__ - Step 19319: {'lr': 0.00048329734403993406, 'samples': 3709248, 'steps': 19318, 'loss/train': 1.7536782026290894} 11/06/2021 23:58:06 - INFO - __main__ - Step 19320: {'lr': 0.00048329543682298307, 'samples': 3709440, 'steps': 19319, 'loss/train': 1.9086477756500244} 11/06/2021 23:58:06 - INFO - __main__ - Step 19321: {'lr': 0.0004832935295009127, 'samples': 3709632, 'steps': 19320, 'loss/train': 1.843841791152954} 11/06/2021 23:58:06 - INFO - __main__ - Step 19322: {'lr': 0.0004832916220737239, 'samples': 3709824, 'steps': 19321, 'loss/train': 1.6211942434310913} 11/06/2021 23:58:07 - INFO - __main__ - Step 19323: {'lr': 0.0004832897145414175, 'samples': 3710016, 'steps': 19322, 'loss/train': 1.4943122863769531} 11/06/2021 23:58:07 - INFO - __main__ - Step 19324: {'lr': 0.0004832878069039943, 'samples': 3710208, 'steps': 19323, 'loss/train': 1.5772778987884521} 11/06/2021 23:58:08 - INFO - __main__ - Step 19325: {'lr': 0.0004832858991614553, 'samples': 3710400, 'steps': 19324, 'loss/train': 1.6283198595046997} 11/06/2021 23:58:08 - INFO - __main__ - Step 19326: {'lr': 0.00048328399131380127, 'samples': 3710592, 'steps': 19325, 'loss/train': 1.879858136177063} 11/06/2021 23:58:09 - INFO - __main__ - Step 19327: {'lr': 0.00048328208336103305, 'samples': 3710784, 'steps': 19326, 'loss/train': 1.4845051765441895} 11/06/2021 23:58:09 - INFO - __main__ - Step 19328: {'lr': 0.0004832801753031515, 'samples': 3710976, 'steps': 19327, 'loss/train': 1.7683420181274414} 11/06/2021 23:58:10 - INFO - __main__ - Step 19329: {'lr': 0.00048327826714015756, 'samples': 3711168, 'steps': 19328, 'loss/train': 3.4698245525360107} 11/06/2021 23:58:10 - INFO - __main__ - Step 19330: {'lr': 0.00048327635887205196, 'samples': 3711360, 'steps': 19329, 'loss/train': 1.1537224054336548} 11/06/2021 23:58:11 - INFO - __main__ - Step 19331: {'lr': 0.00048327445049883567, 'samples': 3711552, 'steps': 19330, 'loss/train': 1.5621854066848755} 11/06/2021 23:58:11 - INFO - __main__ - Step 19332: {'lr': 0.0004832725420205095, 'samples': 3711744, 'steps': 19331, 'loss/train': 1.1450562477111816} 11/06/2021 23:58:12 - INFO - __main__ - Step 19333: {'lr': 0.00048327063343707433, 'samples': 3711936, 'steps': 19332, 'loss/train': 1.4444999694824219} 11/06/2021 23:58:12 - INFO - __main__ - Step 19334: {'lr': 0.000483268724748531, 'samples': 3712128, 'steps': 19333, 'loss/train': 1.5540878772735596} 11/06/2021 23:58:12 - INFO - __main__ - Step 19335: {'lr': 0.0004832668159548804, 'samples': 3712320, 'steps': 19334, 'loss/train': 1.3487190008163452} 11/06/2021 23:58:13 - INFO - __main__ - Step 19336: {'lr': 0.00048326490705612337, 'samples': 3712512, 'steps': 19335, 'loss/train': 1.3741018772125244} 11/06/2021 23:58:14 - INFO - __main__ - Step 19337: {'lr': 0.0004832629980522608, 'samples': 3712704, 'steps': 19336, 'loss/train': 1.6172581911087036} 11/06/2021 23:58:14 - INFO - __main__ - Step 19338: {'lr': 0.00048326108894329345, 'samples': 3712896, 'steps': 19337, 'loss/train': 1.6094346046447754} 11/06/2021 23:58:15 - INFO - __main__ - Step 19339: {'lr': 0.00048325917972922227, 'samples': 3713088, 'steps': 19338, 'loss/train': 1.8732949495315552} 11/06/2021 23:58:15 - INFO - __main__ - Step 19340: {'lr': 0.00048325727041004815, 'samples': 3713280, 'steps': 19339, 'loss/train': 2.8026809692382812} 11/06/2021 23:58:16 - INFO - __main__ - Step 19341: {'lr': 0.0004832553609857719, 'samples': 3713472, 'steps': 19340, 'loss/train': 1.4218841791152954} 11/06/2021 23:58:16 - INFO - __main__ - Step 19342: {'lr': 0.0004832534514563943, 'samples': 3713664, 'steps': 19341, 'loss/train': 1.700616717338562} 11/06/2021 23:58:17 - INFO - __main__ - Step 19343: {'lr': 0.0004832515418219164, 'samples': 3713856, 'steps': 19342, 'loss/train': 1.542962908744812} 11/06/2021 23:58:17 - INFO - __main__ - Step 19344: {'lr': 0.0004832496320823389, 'samples': 3714048, 'steps': 19343, 'loss/train': 1.2601964473724365} 11/06/2021 23:58:17 - INFO - __main__ - Step 19345: {'lr': 0.0004832477222376627, 'samples': 3714240, 'steps': 19344, 'loss/train': 1.3057348728179932} 11/06/2021 23:58:18 - INFO - __main__ - Step 19346: {'lr': 0.0004832458122878888, 'samples': 3714432, 'steps': 19345, 'loss/train': 0.8863457441329956} 11/06/2021 23:58:19 - INFO - __main__ - Step 19347: {'lr': 0.0004832439022330178, 'samples': 3714624, 'steps': 19346, 'loss/train': 0.9812582731246948} 11/06/2021 23:58:19 - INFO - __main__ - Step 19348: {'lr': 0.00048324199207305075, 'samples': 3714816, 'steps': 19347, 'loss/train': 1.7008509635925293} 11/06/2021 23:58:19 - INFO - __main__ - Step 19349: {'lr': 0.0004832400818079884, 'samples': 3715008, 'steps': 19348, 'loss/train': 1.88282310962677} 11/06/2021 23:58:20 - INFO - __main__ - Step 19350: {'lr': 0.00048323817143783174, 'samples': 3715200, 'steps': 19349, 'loss/train': 1.5020644664764404} 11/06/2021 23:58:20 - INFO - __main__ - Step 19351: {'lr': 0.0004832362609625815, 'samples': 3715392, 'steps': 19350, 'loss/train': 1.5163544416427612} 11/06/2021 23:58:21 - INFO - __main__ - Step 19352: {'lr': 0.0004832343503822386, 'samples': 3715584, 'steps': 19351, 'loss/train': 1.6568862199783325} 11/06/2021 23:58:21 - INFO - __main__ - Step 19353: {'lr': 0.000483232439696804, 'samples': 3715776, 'steps': 19352, 'loss/train': 1.1262316703796387} 11/06/2021 23:58:22 - INFO - __main__ - Step 19354: {'lr': 0.0004832305289062784, 'samples': 3715968, 'steps': 19353, 'loss/train': 0.9058158993721008} 11/06/2021 23:58:22 - INFO - __main__ - Step 19355: {'lr': 0.00048322861801066265, 'samples': 3716160, 'steps': 19354, 'loss/train': 1.8485344648361206} 11/06/2021 23:58:23 - INFO - __main__ - Step 19356: {'lr': 0.00048322670700995775, 'samples': 3716352, 'steps': 19355, 'loss/train': 2.099125862121582} 11/06/2021 23:58:24 - INFO - __main__ - Step 19357: {'lr': 0.0004832247959041645, 'samples': 3716544, 'steps': 19356, 'loss/train': 1.2258687019348145} 11/06/2021 23:58:24 - INFO - __main__ - Step 19358: {'lr': 0.0004832228846932838, 'samples': 3716736, 'steps': 19357, 'loss/train': 1.7507163286209106} 11/06/2021 23:58:24 - INFO - __main__ - Step 19359: {'lr': 0.0004832209733773164, 'samples': 3716928, 'steps': 19358, 'loss/train': 1.7394636869430542} 11/06/2021 23:58:25 - INFO - __main__ - Step 19360: {'lr': 0.0004832190619562632, 'samples': 3717120, 'steps': 19359, 'loss/train': 1.5605862140655518} 11/06/2021 23:58:25 - INFO - __main__ - Step 19361: {'lr': 0.00048321715043012515, 'samples': 3717312, 'steps': 19360, 'loss/train': 1.5524048805236816} 11/06/2021 23:58:26 - INFO - __main__ - Step 19362: {'lr': 0.00048321523879890307, 'samples': 3717504, 'steps': 19361, 'loss/train': 1.7804820537567139} 11/06/2021 23:58:27 - INFO - __main__ - Step 19363: {'lr': 0.00048321332706259773, 'samples': 3717696, 'steps': 19362, 'loss/train': 1.6952815055847168} 11/06/2021 23:58:27 - INFO - __main__ - Step 19364: {'lr': 0.0004832114152212101, 'samples': 3717888, 'steps': 19363, 'loss/train': 1.2942943572998047} 11/06/2021 23:58:27 - INFO - __main__ - Step 19365: {'lr': 0.000483209503274741, 'samples': 3718080, 'steps': 19364, 'loss/train': 2.1306514739990234} 11/06/2021 23:58:28 - INFO - __main__ - Step 19366: {'lr': 0.0004832075912231913, 'samples': 3718272, 'steps': 19365, 'loss/train': 1.71169114112854} 11/06/2021 23:58:29 - INFO - __main__ - Step 19367: {'lr': 0.0004832056790665619, 'samples': 3718464, 'steps': 19366, 'loss/train': 0.859806478023529} 11/06/2021 23:58:29 - INFO - __main__ - Step 19368: {'lr': 0.0004832037668048536, 'samples': 3718656, 'steps': 19367, 'loss/train': 0.6691350340843201} 11/06/2021 23:58:29 - INFO - __main__ - Step 19369: {'lr': 0.00048320185443806717, 'samples': 3718848, 'steps': 19368, 'loss/train': 1.5839430093765259} 11/06/2021 23:58:30 - INFO - __main__ - Step 19370: {'lr': 0.0004831999419662037, 'samples': 3719040, 'steps': 19369, 'loss/train': 1.3964022397994995} 11/06/2021 23:58:30 - INFO - __main__ - Step 19371: {'lr': 0.0004831980293892639, 'samples': 3719232, 'steps': 19370, 'loss/train': 1.7141057252883911} 11/06/2021 23:58:31 - INFO - __main__ - Step 19372: {'lr': 0.0004831961167072487, 'samples': 3719424, 'steps': 19371, 'loss/train': 1.4822651147842407} 11/06/2021 23:58:31 - INFO - __main__ - Step 19373: {'lr': 0.0004831942039201589, 'samples': 3719616, 'steps': 19372, 'loss/train': 1.7279945611953735} 11/06/2021 23:58:32 - INFO - __main__ - Step 19374: {'lr': 0.0004831922910279954, 'samples': 3719808, 'steps': 19373, 'loss/train': 1.1288522481918335} 11/06/2021 23:58:32 - INFO - __main__ - Step 19375: {'lr': 0.000483190378030759, 'samples': 3720000, 'steps': 19374, 'loss/train': 1.5655479431152344} 11/06/2021 23:58:33 - INFO - __main__ - Step 19376: {'lr': 0.0004831884649284507, 'samples': 3720192, 'steps': 19375, 'loss/train': 1.5669931173324585} 11/06/2021 23:58:33 - INFO - __main__ - Step 19377: {'lr': 0.00048318655172107126, 'samples': 3720384, 'steps': 19376, 'loss/train': 1.8472486734390259} 11/06/2021 23:58:34 - INFO - __main__ - Step 19378: {'lr': 0.0004831846384086215, 'samples': 3720576, 'steps': 19377, 'loss/train': 1.4276505708694458} 11/06/2021 23:58:34 - INFO - __main__ - Step 19379: {'lr': 0.0004831827249911024, 'samples': 3720768, 'steps': 19378, 'loss/train': 2.790935516357422} 11/06/2021 23:58:35 - INFO - __main__ - Step 19380: {'lr': 0.0004831808114685147, 'samples': 3720960, 'steps': 19379, 'loss/train': 2.847355842590332} 11/06/2021 23:58:35 - INFO - __main__ - Step 19381: {'lr': 0.00048317889784085935, 'samples': 3721152, 'steps': 19380, 'loss/train': 1.9903427362442017} 11/06/2021 23:58:35 - INFO - __main__ - Step 19382: {'lr': 0.0004831769841081372, 'samples': 3721344, 'steps': 19381, 'loss/train': 1.8123948574066162} 11/06/2021 23:58:36 - INFO - __main__ - Step 19383: {'lr': 0.00048317507027034913, 'samples': 3721536, 'steps': 19382, 'loss/train': 1.783972978591919} 11/06/2021 23:58:37 - INFO - __main__ - Step 19384: {'lr': 0.0004831731563274959, 'samples': 3721728, 'steps': 19383, 'loss/train': 1.8129934072494507} 11/06/2021 23:58:37 - INFO - __main__ - Step 19385: {'lr': 0.0004831712422795785, 'samples': 3721920, 'steps': 19384, 'loss/train': 1.562839388847351} 11/06/2021 23:58:37 - INFO - __main__ - Step 19386: {'lr': 0.00048316932812659776, 'samples': 3722112, 'steps': 19385, 'loss/train': 1.5851092338562012} 11/06/2021 23:58:38 - INFO - __main__ - Step 19387: {'lr': 0.00048316741386855445, 'samples': 3722304, 'steps': 19386, 'loss/train': 1.541589379310608} 11/06/2021 23:58:39 - INFO - __main__ - Step 19388: {'lr': 0.0004831654995054495, 'samples': 3722496, 'steps': 19387, 'loss/train': 1.533311128616333} 11/06/2021 23:58:40 - INFO - __main__ - Step 19389: {'lr': 0.0004831635850372838, 'samples': 3722688, 'steps': 19388, 'loss/train': 1.626739740371704} 11/06/2021 23:58:40 - INFO - __main__ - Step 19390: {'lr': 0.00048316167046405826, 'samples': 3722880, 'steps': 19389, 'loss/train': 1.7666641473770142} 11/06/2021 23:58:40 - INFO - __main__ - Step 19391: {'lr': 0.0004831597557857735, 'samples': 3723072, 'steps': 19390, 'loss/train': 2.5333597660064697} 11/06/2021 23:58:41 - INFO - __main__ - Step 19392: {'lr': 0.00048315784100243063, 'samples': 3723264, 'steps': 19391, 'loss/train': 2.226302146911621} 11/06/2021 23:58:41 - INFO - __main__ - Step 19393: {'lr': 0.0004831559261140305, 'samples': 3723456, 'steps': 19392, 'loss/train': 1.3086782693862915} 11/06/2021 23:58:42 - INFO - __main__ - Step 19394: {'lr': 0.0004831540111205739, 'samples': 3723648, 'steps': 19393, 'loss/train': 2.0022213459014893} 11/06/2021 23:58:42 - INFO - __main__ - Step 19395: {'lr': 0.00048315209602206165, 'samples': 3723840, 'steps': 19394, 'loss/train': 1.3035863637924194} 11/06/2021 23:58:43 - INFO - __main__ - Step 19396: {'lr': 0.0004831501808184947, 'samples': 3724032, 'steps': 19395, 'loss/train': 1.9788427352905273} 11/06/2021 23:58:43 - INFO - __main__ - Step 19397: {'lr': 0.0004831482655098738, 'samples': 3724224, 'steps': 19396, 'loss/train': 1.7932040691375732} 11/06/2021 23:58:43 - INFO - __main__ - Step 19398: {'lr': 0.00048314635009619997, 'samples': 3724416, 'steps': 19397, 'loss/train': 1.363989233970642} 11/06/2021 23:58:45 - INFO - __main__ - Step 19399: {'lr': 0.0004831444345774739, 'samples': 3724608, 'steps': 19398, 'loss/train': 1.9155510663986206} 11/06/2021 23:58:45 - INFO - __main__ - Step 19400: {'lr': 0.00048314251895369663, 'samples': 3724800, 'steps': 19399, 'loss/train': 1.7524795532226562} 11/06/2021 23:58:45 - INFO - __main__ - Step 19401: {'lr': 0.000483140603224869, 'samples': 3724992, 'steps': 19400, 'loss/train': 1.5705088376998901} 11/06/2021 23:58:46 - INFO - __main__ - Step 19402: {'lr': 0.00048313868739099166, 'samples': 3725184, 'steps': 19401, 'loss/train': 1.9813029766082764} 11/06/2021 23:58:46 - INFO - __main__ - Step 19403: {'lr': 0.0004831367714520657, 'samples': 3725376, 'steps': 19402, 'loss/train': 1.216731071472168} 11/06/2021 23:58:47 - INFO - __main__ - Step 19404: {'lr': 0.0004831348554080919, 'samples': 3725568, 'steps': 19403, 'loss/train': 1.8201124668121338} 11/06/2021 23:58:47 - INFO - __main__ - Step 19405: {'lr': 0.0004831329392590711, 'samples': 3725760, 'steps': 19404, 'loss/train': 1.7847330570220947} 11/06/2021 23:58:48 - INFO - __main__ - Step 19406: {'lr': 0.00048313102300500424, 'samples': 3725952, 'steps': 19405, 'loss/train': 0.5406389236450195} 11/06/2021 23:58:48 - INFO - __main__ - Step 19407: {'lr': 0.00048312910664589215, 'samples': 3726144, 'steps': 19406, 'loss/train': 1.7577673196792603} 11/06/2021 23:58:49 - INFO - __main__ - Step 19408: {'lr': 0.0004831271901817357, 'samples': 3726336, 'steps': 19407, 'loss/train': 1.2659802436828613} 11/06/2021 23:58:50 - INFO - __main__ - Step 19409: {'lr': 0.00048312527361253567, 'samples': 3726528, 'steps': 19408, 'loss/train': 1.87831449508667} 11/06/2021 23:58:50 - INFO - __main__ - Step 19410: {'lr': 0.000483123356938293, 'samples': 3726720, 'steps': 19409, 'loss/train': 1.4174208641052246} 11/06/2021 23:58:50 - INFO - __main__ - Step 19411: {'lr': 0.00048312144015900856, 'samples': 3726912, 'steps': 19410, 'loss/train': 1.6749889850616455} 11/06/2021 23:58:51 - INFO - __main__ - Step 19412: {'lr': 0.00048311952327468325, 'samples': 3727104, 'steps': 19411, 'loss/train': 1.7777684926986694} 11/06/2021 23:58:51 - INFO - __main__ - Step 19413: {'lr': 0.00048311760628531777, 'samples': 3727296, 'steps': 19412, 'loss/train': 1.2066028118133545} 11/06/2021 23:58:52 - INFO - __main__ - Step 19414: {'lr': 0.00048311568919091316, 'samples': 3727488, 'steps': 19413, 'loss/train': 2.1470677852630615} 11/06/2021 23:58:52 - INFO - __main__ - Step 19415: {'lr': 0.00048311377199147023, 'samples': 3727680, 'steps': 19414, 'loss/train': 1.9940916299819946} 11/06/2021 23:58:53 - INFO - __main__ - Step 19416: {'lr': 0.00048311185468698974, 'samples': 3727872, 'steps': 19415, 'loss/train': 2.1364340782165527} 11/06/2021 23:58:53 - INFO - __main__ - Step 19417: {'lr': 0.00048310993727747277, 'samples': 3728064, 'steps': 19416, 'loss/train': 1.5032535791397095} 11/06/2021 23:58:53 - INFO - __main__ - Step 19418: {'lr': 0.00048310801976292, 'samples': 3728256, 'steps': 19417, 'loss/train': 1.4930044412612915} 11/06/2021 23:58:54 - INFO - __main__ - Step 19419: {'lr': 0.0004831061021433323, 'samples': 3728448, 'steps': 19418, 'loss/train': 1.7884739637374878} 11/06/2021 23:58:55 - INFO - __main__ - Step 19420: {'lr': 0.00048310418441871065, 'samples': 3728640, 'steps': 19419, 'loss/train': 1.911501169204712} 11/06/2021 23:58:55 - INFO - __main__ - Step 19421: {'lr': 0.00048310226658905585, 'samples': 3728832, 'steps': 19420, 'loss/train': 2.056422472000122} 11/06/2021 23:58:55 - INFO - __main__ - Step 19422: {'lr': 0.00048310034865436876, 'samples': 3729024, 'steps': 19421, 'loss/train': 1.5690569877624512} 11/06/2021 23:58:56 - INFO - __main__ - Step 19423: {'lr': 0.0004830984306146503, 'samples': 3729216, 'steps': 19422, 'loss/train': 1.1024407148361206} 11/06/2021 23:58:56 - INFO - __main__ - Step 19424: {'lr': 0.0004830965124699012, 'samples': 3729408, 'steps': 19423, 'loss/train': 1.1547913551330566} 11/06/2021 23:58:59 - INFO - __main__ - Step 19425: {'lr': 0.00048309459422012243, 'samples': 3729600, 'steps': 19424, 'loss/train': 1.695124626159668} 11/06/2021 23:58:59 - INFO - __main__ - Step 19426: {'lr': 0.0004830926758653148, 'samples': 3729792, 'steps': 19425, 'loss/train': 1.4162240028381348} 11/06/2021 23:58:59 - INFO - __main__ - Step 19427: {'lr': 0.00048309075740547925, 'samples': 3729984, 'steps': 19426, 'loss/train': 1.3688523769378662} 11/06/2021 23:59:00 - INFO - __main__ - Step 19428: {'lr': 0.0004830888388406166, 'samples': 3730176, 'steps': 19427, 'loss/train': 1.8020360469818115} 11/06/2021 23:59:00 - INFO - __main__ - Step 19429: {'lr': 0.00048308692017072773, 'samples': 3730368, 'steps': 19428, 'loss/train': 1.153029441833496} 11/06/2021 23:59:01 - INFO - __main__ - Step 19430: {'lr': 0.00048308500139581344, 'samples': 3730560, 'steps': 19429, 'loss/train': 1.0884881019592285} 11/06/2021 23:59:01 - INFO - __main__ - Step 19431: {'lr': 0.00048308308251587476, 'samples': 3730752, 'steps': 19430, 'loss/train': 1.7762088775634766} 11/06/2021 23:59:01 - INFO - __main__ - Step 19432: {'lr': 0.00048308116353091234, 'samples': 3730944, 'steps': 19431, 'loss/train': 1.8368065357208252} 11/06/2021 23:59:02 - INFO - __main__ - Step 19433: {'lr': 0.00048307924444092716, 'samples': 3731136, 'steps': 19432, 'loss/train': 1.6261667013168335} 11/06/2021 23:59:02 - INFO - __main__ - Step 19434: {'lr': 0.0004830773252459201, 'samples': 3731328, 'steps': 19433, 'loss/train': 1.6783347129821777} 11/06/2021 23:59:03 - INFO - __main__ - Step 19435: {'lr': 0.00048307540594589194, 'samples': 3731520, 'steps': 19434, 'loss/train': 1.5496079921722412} 11/06/2021 23:59:03 - INFO - __main__ - Step 19436: {'lr': 0.0004830734865408437, 'samples': 3731712, 'steps': 19435, 'loss/train': 1.2190485000610352} 11/06/2021 23:59:04 - INFO - __main__ - Step 19437: {'lr': 0.000483071567030776, 'samples': 3731904, 'steps': 19436, 'loss/train': 1.7005469799041748} 11/06/2021 23:59:05 - INFO - __main__ - Step 19438: {'lr': 0.00048306964741568994, 'samples': 3732096, 'steps': 19437, 'loss/train': 1.5111713409423828} 11/06/2021 23:59:05 - INFO - __main__ - Step 19439: {'lr': 0.00048306772769558624, 'samples': 3732288, 'steps': 19438, 'loss/train': 1.892134189605713} 11/06/2021 23:59:05 - INFO - __main__ - Step 19440: {'lr': 0.0004830658078704659, 'samples': 3732480, 'steps': 19439, 'loss/train': 1.6170473098754883} 11/06/2021 23:59:06 - INFO - __main__ - Step 19441: {'lr': 0.0004830638879403296, 'samples': 3732672, 'steps': 19440, 'loss/train': 1.8451327085494995} 11/06/2021 23:59:06 - INFO - __main__ - Step 19442: {'lr': 0.00048306196790517844, 'samples': 3732864, 'steps': 19441, 'loss/train': 1.871138334274292} 11/06/2021 23:59:07 - INFO - __main__ - Step 19443: {'lr': 0.0004830600477650131, 'samples': 3733056, 'steps': 19442, 'loss/train': 1.3609321117401123} 11/06/2021 23:59:07 - INFO - __main__ - Step 19444: {'lr': 0.0004830581275198344, 'samples': 3733248, 'steps': 19443, 'loss/train': 1.0425359010696411} 11/06/2021 23:59:08 - INFO - __main__ - Step 19445: {'lr': 0.00048305620716964336, 'samples': 3733440, 'steps': 19444, 'loss/train': 2.1450159549713135} 11/06/2021 23:59:08 - INFO - __main__ - Step 19446: {'lr': 0.00048305428671444083, 'samples': 3733632, 'steps': 19445, 'loss/train': 1.0744739770889282} 11/06/2021 23:59:09 - INFO - __main__ - Step 19447: {'lr': 0.00048305236615422763, 'samples': 3733824, 'steps': 19446, 'loss/train': 1.7108923196792603} 11/06/2021 23:59:09 - INFO - __main__ - Step 19448: {'lr': 0.00048305044548900463, 'samples': 3734016, 'steps': 19447, 'loss/train': 1.3483003377914429} 11/06/2021 23:59:10 - INFO - __main__ - Step 19449: {'lr': 0.0004830485247187727, 'samples': 3734208, 'steps': 19448, 'loss/train': 1.7191352844238281} 11/06/2021 23:59:10 - INFO - __main__ - Step 19450: {'lr': 0.0004830466038435327, 'samples': 3734400, 'steps': 19449, 'loss/train': 1.6840806007385254} 11/06/2021 23:59:11 - INFO - __main__ - Step 19451: {'lr': 0.0004830446828632854, 'samples': 3734592, 'steps': 19450, 'loss/train': 1.22652006149292} 11/06/2021 23:59:11 - INFO - __main__ - Step 19452: {'lr': 0.00048304276177803186, 'samples': 3734784, 'steps': 19451, 'loss/train': 1.2331140041351318} 11/06/2021 23:59:12 - INFO - __main__ - Step 19453: {'lr': 0.00048304084058777285, 'samples': 3734976, 'steps': 19452, 'loss/train': 1.9303547143936157} 11/06/2021 23:59:13 - INFO - __main__ - Step 19454: {'lr': 0.00048303891929250923, 'samples': 3735168, 'steps': 19453, 'loss/train': 1.3610291481018066} 11/06/2021 23:59:13 - INFO - __main__ - Step 19455: {'lr': 0.0004830369978922418, 'samples': 3735360, 'steps': 19454, 'loss/train': 1.2890371084213257} 11/06/2021 23:59:13 - INFO - __main__ - Step 19456: {'lr': 0.00048303507638697155, 'samples': 3735552, 'steps': 19455, 'loss/train': 1.549730658531189} 11/06/2021 23:59:14 - INFO - __main__ - Step 19457: {'lr': 0.0004830331547766993, 'samples': 3735744, 'steps': 19456, 'loss/train': 1.8960416316986084} 11/06/2021 23:59:14 - INFO - __main__ - Step 19458: {'lr': 0.0004830312330614259, 'samples': 3735936, 'steps': 19457, 'loss/train': 1.7021613121032715} 11/06/2021 23:59:15 - INFO - __main__ - Step 19459: {'lr': 0.00048302931124115226, 'samples': 3736128, 'steps': 19458, 'loss/train': 1.5494282245635986} 11/06/2021 23:59:15 - INFO - __main__ - Step 19460: {'lr': 0.0004830273893158791, 'samples': 3736320, 'steps': 19459, 'loss/train': 1.4215049743652344} 11/06/2021 23:59:16 - INFO - __main__ - Step 19461: {'lr': 0.0004830254672856075, 'samples': 3736512, 'steps': 19460, 'loss/train': 1.396445393562317} 11/06/2021 23:59:16 - INFO - __main__ - Step 19462: {'lr': 0.00048302354515033813, 'samples': 3736704, 'steps': 19461, 'loss/train': 1.8570998907089233} 11/06/2021 23:59:16 - INFO - __main__ - Step 19463: {'lr': 0.00048302162291007203, 'samples': 3736896, 'steps': 19462, 'loss/train': 1.4819624423980713} 11/06/2021 23:59:17 - INFO - __main__ - Step 19464: {'lr': 0.00048301970056480994, 'samples': 3737088, 'steps': 19463, 'loss/train': 1.6561782360076904} 11/06/2021 23:59:18 - INFO - __main__ - Step 19465: {'lr': 0.00048301777811455274, 'samples': 3737280, 'steps': 19464, 'loss/train': 1.935529351234436} 11/06/2021 23:59:18 - INFO - __main__ - Step 19466: {'lr': 0.0004830158555593014, 'samples': 3737472, 'steps': 19465, 'loss/train': 1.6366750001907349} 11/06/2021 23:59:18 - INFO - __main__ - Step 19467: {'lr': 0.00048301393289905663, 'samples': 3737664, 'steps': 19466, 'loss/train': 1.6262980699539185} 11/06/2021 23:59:19 - INFO - __main__ - Step 19468: {'lr': 0.00048301201013381946, 'samples': 3737856, 'steps': 19467, 'loss/train': 1.4352784156799316} 11/06/2021 23:59:20 - INFO - __main__ - Step 19469: {'lr': 0.00048301008726359064, 'samples': 3738048, 'steps': 19468, 'loss/train': 1.9483211040496826} 11/06/2021 23:59:20 - INFO - __main__ - Step 19470: {'lr': 0.00048300816428837104, 'samples': 3738240, 'steps': 19469, 'loss/train': 1.8832329511642456} 11/06/2021 23:59:20 - INFO - __main__ - Step 19471: {'lr': 0.00048300624120816153, 'samples': 3738432, 'steps': 19470, 'loss/train': 1.5686419010162354} 11/06/2021 23:59:21 - INFO - __main__ - Step 19472: {'lr': 0.0004830043180229631, 'samples': 3738624, 'steps': 19471, 'loss/train': 1.8613708019256592} 11/06/2021 23:59:21 - INFO - __main__ - Step 19473: {'lr': 0.0004830023947327764, 'samples': 3738816, 'steps': 19472, 'loss/train': 1.726355791091919} 11/06/2021 23:59:22 - INFO - __main__ - Step 19474: {'lr': 0.0004830004713376025, 'samples': 3739008, 'steps': 19473, 'loss/train': 1.6512629985809326} 11/06/2021 23:59:23 - INFO - __main__ - Step 19475: {'lr': 0.00048299854783744224, 'samples': 3739200, 'steps': 19474, 'loss/train': 1.8223868608474731} 11/06/2021 23:59:23 - INFO - __main__ - Step 19476: {'lr': 0.0004829966242322963, 'samples': 3739392, 'steps': 19475, 'loss/train': 1.6881176233291626} 11/06/2021 23:59:23 - INFO - __main__ - Step 19477: {'lr': 0.00048299470052216576, 'samples': 3739584, 'steps': 19476, 'loss/train': 1.664871096611023} 11/06/2021 23:59:24 - INFO - __main__ - Step 19478: {'lr': 0.0004829927767070514, 'samples': 3739776, 'steps': 19477, 'loss/train': 1.8570330142974854} 11/06/2021 23:59:25 - INFO - __main__ - Step 19479: {'lr': 0.0004829908527869541, 'samples': 3739968, 'steps': 19478, 'loss/train': 0.8422459363937378} 11/06/2021 23:59:25 - INFO - __main__ - Step 19480: {'lr': 0.0004829889287618746, 'samples': 3740160, 'steps': 19479, 'loss/train': 1.5826501846313477} 11/06/2021 23:59:26 - INFO - __main__ - Step 19481: {'lr': 0.000482987004631814, 'samples': 3740352, 'steps': 19480, 'loss/train': 1.462388515472412} 11/06/2021 23:59:26 - INFO - __main__ - Step 19482: {'lr': 0.000482985080396773, 'samples': 3740544, 'steps': 19481, 'loss/train': 1.5462687015533447} 11/06/2021 23:59:27 - INFO - __main__ - Step 19483: {'lr': 0.00048298315605675257, 'samples': 3740736, 'steps': 19482, 'loss/train': 1.6107537746429443} 11/06/2021 23:59:27 - INFO - __main__ - Step 19484: {'lr': 0.0004829812316117535, 'samples': 3740928, 'steps': 19483, 'loss/train': 1.9696018695831299} 11/06/2021 23:59:28 - INFO - __main__ - Step 19485: {'lr': 0.0004829793070617767, 'samples': 3741120, 'steps': 19484, 'loss/train': 1.904404878616333} 11/06/2021 23:59:28 - INFO - __main__ - Step 19486: {'lr': 0.000482977382406823, 'samples': 3741312, 'steps': 19485, 'loss/train': 1.7050572633743286} 11/06/2021 23:59:29 - INFO - __main__ - Step 19487: {'lr': 0.00048297545764689327, 'samples': 3741504, 'steps': 19486, 'loss/train': 1.5216996669769287} 11/06/2021 23:59:29 - INFO - __main__ - Step 19488: {'lr': 0.00048297353278198843, 'samples': 3741696, 'steps': 19487, 'loss/train': 1.2509301900863647} 11/06/2021 23:59:29 - INFO - __main__ - Step 19489: {'lr': 0.00048297160781210925, 'samples': 3741888, 'steps': 19488, 'loss/train': 1.639148235321045} 11/06/2021 23:59:31 - INFO - __main__ - Step 19490: {'lr': 0.00048296968273725673, 'samples': 3742080, 'steps': 19489, 'loss/train': 1.5672905445098877} 11/06/2021 23:59:31 - INFO - __main__ - Step 19491: {'lr': 0.0004829677575574316, 'samples': 3742272, 'steps': 19490, 'loss/train': 1.7949920892715454} 11/06/2021 23:59:31 - INFO - __main__ - Step 19492: {'lr': 0.0004829658322726348, 'samples': 3742464, 'steps': 19491, 'loss/train': 2.019479751586914} 11/06/2021 23:59:32 - INFO - __main__ - Step 19493: {'lr': 0.00048296390688286724, 'samples': 3742656, 'steps': 19492, 'loss/train': 2.085145950317383} 11/06/2021 23:59:32 - INFO - __main__ - Step 19494: {'lr': 0.00048296198138812974, 'samples': 3742848, 'steps': 19493, 'loss/train': 1.4425673484802246} 11/06/2021 23:59:33 - INFO - __main__ - Step 19495: {'lr': 0.00048296005578842314, 'samples': 3743040, 'steps': 19494, 'loss/train': 1.797430157661438} 11/06/2021 23:59:34 - INFO - __main__ - Step 19496: {'lr': 0.0004829581300837483, 'samples': 3743232, 'steps': 19495, 'loss/train': 1.7276138067245483} 11/06/2021 23:59:34 - INFO - __main__ - Step 19497: {'lr': 0.00048295620427410614, 'samples': 3743424, 'steps': 19496, 'loss/train': 1.0747604370117188} 11/06/2021 23:59:34 - INFO - __main__ - Step 19498: {'lr': 0.00048295427835949757, 'samples': 3743616, 'steps': 19497, 'loss/train': 1.531845211982727} 11/06/2021 23:59:35 - INFO - __main__ - Step 19499: {'lr': 0.0004829523523399233, 'samples': 3743808, 'steps': 19498, 'loss/train': 5.932285785675049} 11/06/2021 23:59:35 - INFO - __main__ - Step 19500: {'lr': 0.0004829504262153844, 'samples': 3744000, 'steps': 19499, 'loss/train': 1.0767468214035034} 11/06/2021 23:59:36 - INFO - __main__ - Step 19501: {'lr': 0.00048294849998588155, 'samples': 3744192, 'steps': 19500, 'loss/train': 2.2038071155548096} 11/06/2021 23:59:36 - INFO - __main__ - Step 19502: {'lr': 0.0004829465736514157, 'samples': 3744384, 'steps': 19501, 'loss/train': 1.7361345291137695} 11/06/2021 23:59:37 - INFO - __main__ - Step 19503: {'lr': 0.0004829446472119878, 'samples': 3744576, 'steps': 19502, 'loss/train': 1.3352776765823364} 11/06/2021 23:59:37 - INFO - __main__ - Step 19504: {'lr': 0.0004829427206675986, 'samples': 3744768, 'steps': 19503, 'loss/train': 1.8171507120132446} 11/06/2021 23:59:37 - INFO - __main__ - Step 19505: {'lr': 0.000482940794018249, 'samples': 3744960, 'steps': 19504, 'loss/train': 1.5799142122268677} 11/06/2021 23:59:39 - INFO - __main__ - Step 19506: {'lr': 0.00048293886726393984, 'samples': 3745152, 'steps': 19505, 'loss/train': 1.4450923204421997} 11/06/2021 23:59:39 - INFO - __main__ - Step 19507: {'lr': 0.00048293694040467205, 'samples': 3745344, 'steps': 19506, 'loss/train': 1.731113076210022} 11/06/2021 23:59:39 - INFO - __main__ - Step 19508: {'lr': 0.00048293501344044644, 'samples': 3745536, 'steps': 19507, 'loss/train': 1.6001505851745605} 11/06/2021 23:59:40 - INFO - __main__ - Step 19509: {'lr': 0.00048293308637126393, 'samples': 3745728, 'steps': 19508, 'loss/train': 0.7838894724845886} 11/06/2021 23:59:40 - INFO - __main__ - Step 19510: {'lr': 0.0004829311591971254, 'samples': 3745920, 'steps': 19509, 'loss/train': 1.8190715312957764} 11/06/2021 23:59:41 - INFO - __main__ - Step 19511: {'lr': 0.0004829292319180316, 'samples': 3746112, 'steps': 19510, 'loss/train': 1.5588487386703491} 11/06/2021 23:59:41 - INFO - __main__ - Step 19512: {'lr': 0.00048292730453398355, 'samples': 3746304, 'steps': 19511, 'loss/train': 1.3858730792999268} 11/06/2021 23:59:42 - INFO - __main__ - Step 19513: {'lr': 0.00048292537704498203, 'samples': 3746496, 'steps': 19512, 'loss/train': 1.3733470439910889} 11/06/2021 23:59:42 - INFO - __main__ - Step 19514: {'lr': 0.00048292344945102795, 'samples': 3746688, 'steps': 19513, 'loss/train': 1.670436143875122} 11/06/2021 23:59:43 - INFO - __main__ - Step 19515: {'lr': 0.0004829215217521221, 'samples': 3746880, 'steps': 19514, 'loss/train': 1.8147488832473755} 11/06/2021 23:59:44 - INFO - __main__ - Step 19516: {'lr': 0.00048291959394826546, 'samples': 3747072, 'steps': 19515, 'loss/train': 1.9264278411865234} 11/06/2021 23:59:44 - INFO - __main__ - Step 19517: {'lr': 0.00048291766603945885, 'samples': 3747264, 'steps': 19516, 'loss/train': 1.1423556804656982} 11/06/2021 23:59:44 - INFO - __main__ - Step 19518: {'lr': 0.0004829157380257031, 'samples': 3747456, 'steps': 19517, 'loss/train': 1.3138654232025146} 11/06/2021 23:59:45 - INFO - __main__ - Step 19519: {'lr': 0.0004829138099069991, 'samples': 3747648, 'steps': 19518, 'loss/train': 1.420883059501648} 11/06/2021 23:59:45 - INFO - __main__ - Step 19520: {'lr': 0.0004829118816833478, 'samples': 3747840, 'steps': 19519, 'loss/train': 2.512575626373291} 11/06/2021 23:59:45 - INFO - __main__ - Step 19521: {'lr': 0.00048290995335474997, 'samples': 3748032, 'steps': 19520, 'loss/train': 1.6682121753692627} 11/06/2021 23:59:47 - INFO - __main__ - Step 19522: {'lr': 0.0004829080249212064, 'samples': 3748224, 'steps': 19521, 'loss/train': 1.5365606546401978} 11/06/2021 23:59:47 - INFO - __main__ - Step 19523: {'lr': 0.00048290609638271823, 'samples': 3748416, 'steps': 19522, 'loss/train': 1.6803691387176514} 11/06/2021 23:59:47 - INFO - __main__ - Step 19524: {'lr': 0.00048290416773928615, 'samples': 3748608, 'steps': 19523, 'loss/train': 1.8752951622009277} 11/06/2021 23:59:48 - INFO - __main__ - Step 19525: {'lr': 0.00048290223899091094, 'samples': 3748800, 'steps': 19524, 'loss/train': 1.8965718746185303} 11/06/2021 23:59:48 - INFO - __main__ - Step 19526: {'lr': 0.0004829003101375937, 'samples': 3748992, 'steps': 19525, 'loss/train': 1.26532781124115} 11/06/2021 23:59:49 - INFO - __main__ - Step 19527: {'lr': 0.00048289838117933505, 'samples': 3749184, 'steps': 19526, 'loss/train': 1.8481258153915405} 11/06/2021 23:59:49 - INFO - __main__ - Step 19528: {'lr': 0.0004828964521161361, 'samples': 3749376, 'steps': 19527, 'loss/train': 1.413066029548645} 11/06/2021 23:59:50 - INFO - __main__ - Step 19529: {'lr': 0.0004828945229479975, 'samples': 3749568, 'steps': 19528, 'loss/train': 1.5402287244796753} 11/06/2021 23:59:50 - INFO - __main__ - Step 19530: {'lr': 0.0004828925936749202, 'samples': 3749760, 'steps': 19529, 'loss/train': 1.5571092367172241} 11/06/2021 23:59:50 - INFO - __main__ - Step 19531: {'lr': 0.0004828906642969052, 'samples': 3749952, 'steps': 19530, 'loss/train': 1.5103822946548462} 11/06/2021 23:59:51 - INFO - __main__ - Step 19532: {'lr': 0.00048288873481395323, 'samples': 3750144, 'steps': 19531, 'loss/train': 1.55597984790802} 11/06/2021 23:59:52 - INFO - __main__ - Step 19533: {'lr': 0.0004828868052260652, 'samples': 3750336, 'steps': 19532, 'loss/train': 1.014290690422058} 11/06/2021 23:59:52 - INFO - __main__ - Step 19534: {'lr': 0.0004828848755332419, 'samples': 3750528, 'steps': 19533, 'loss/train': 1.9036535024642944} 11/06/2021 23:59:52 - INFO - __main__ - Step 19535: {'lr': 0.0004828829457354843, 'samples': 3750720, 'steps': 19534, 'loss/train': 1.6254938840866089} 11/06/2021 23:59:53 - INFO - __main__ - Step 19536: {'lr': 0.0004828810158327933, 'samples': 3750912, 'steps': 19535, 'loss/train': 1.509027361869812} 11/06/2021 23:59:54 - INFO - __main__ - Step 19537: {'lr': 0.00048287908582516964, 'samples': 3751104, 'steps': 19536, 'loss/train': 1.6718204021453857} 11/06/2021 23:59:54 - INFO - __main__ - Step 19538: {'lr': 0.00048287715571261424, 'samples': 3751296, 'steps': 19537, 'loss/train': 1.5779062509536743} 11/06/2021 23:59:54 - INFO - __main__ - Step 19539: {'lr': 0.00048287522549512806, 'samples': 3751488, 'steps': 19538, 'loss/train': 1.7152458429336548} 11/06/2021 23:59:55 - INFO - __main__ - Step 19540: {'lr': 0.0004828732951727119, 'samples': 3751680, 'steps': 19539, 'loss/train': 1.4148067235946655} 11/06/2021 23:59:55 - INFO - __main__ - Step 19541: {'lr': 0.00048287136474536657, 'samples': 3751872, 'steps': 19540, 'loss/train': 2.09765362739563} 11/06/2021 23:59:56 - INFO - __main__ - Step 19542: {'lr': 0.000482869434213093, 'samples': 3752064, 'steps': 19541, 'loss/train': 1.6245235204696655} 11/06/2021 23:59:57 - INFO - __main__ - Step 19543: {'lr': 0.0004828675035758921, 'samples': 3752256, 'steps': 19542, 'loss/train': 1.6489465236663818} 11/06/2021 23:59:57 - INFO - __main__ - Step 19544: {'lr': 0.00048286557283376465, 'samples': 3752448, 'steps': 19543, 'loss/train': 1.6993317604064941} 11/06/2021 23:59:57 - INFO - __main__ - Step 19545: {'lr': 0.0004828636419867116, 'samples': 3752640, 'steps': 19544, 'loss/train': 2.0288567543029785} 11/06/2021 23:59:58 - INFO - __main__ - Step 19546: {'lr': 0.00048286171103473376, 'samples': 3752832, 'steps': 19545, 'loss/train': 1.3089454174041748} 11/07/2021 00:00:00 - INFO - __main__ - Step 19547: {'lr': 0.00048285977997783203, 'samples': 3753024, 'steps': 19546, 'loss/train': 1.9391834735870361} 11/07/2021 00:00:00 - INFO - __main__ - Step 19548: {'lr': 0.0004828578488160073, 'samples': 3753216, 'steps': 19547, 'loss/train': 1.6727211475372314} 11/07/2021 00:00:00 - INFO - __main__ - Step 19549: {'lr': 0.0004828559175492604, 'samples': 3753408, 'steps': 19548, 'loss/train': 1.9620425701141357} 11/07/2021 00:00:01 - INFO - __main__ - Step 19550: {'lr': 0.0004828539861775922, 'samples': 3753600, 'steps': 19549, 'loss/train': 2.087019205093384} 11/07/2021 00:00:01 - INFO - __main__ - Step 19551: {'lr': 0.0004828520547010036, 'samples': 3753792, 'steps': 19550, 'loss/train': 1.887309193611145} 11/07/2021 00:00:01 - INFO - __main__ - Step 19552: {'lr': 0.0004828501231194955, 'samples': 3753984, 'steps': 19551, 'loss/train': 1.9961159229278564} 11/07/2021 00:00:02 - INFO - __main__ - Step 19553: {'lr': 0.0004828481914330687, 'samples': 3754176, 'steps': 19552, 'loss/train': 1.8260117769241333} 11/07/2021 00:00:02 - INFO - __main__ - Step 19554: {'lr': 0.000482846259641724, 'samples': 3754368, 'steps': 19553, 'loss/train': 1.7672523260116577} 11/07/2021 00:00:03 - INFO - __main__ - Step 19555: {'lr': 0.0004828443277454625, 'samples': 3754560, 'steps': 19554, 'loss/train': 1.7993131875991821} 11/07/2021 00:00:04 - INFO - __main__ - Step 19556: {'lr': 0.0004828423957442849, 'samples': 3754752, 'steps': 19555, 'loss/train': 1.248684048652649} 11/07/2021 00:00:04 - INFO - __main__ - Step 19557: {'lr': 0.00048284046363819213, 'samples': 3754944, 'steps': 19556, 'loss/train': 1.7798717021942139} 11/07/2021 00:00:04 - INFO - __main__ - Step 19558: {'lr': 0.000482838531427185, 'samples': 3755136, 'steps': 19557, 'loss/train': 2.2403976917266846} 11/07/2021 00:00:05 - INFO - __main__ - Step 19559: {'lr': 0.00048283659911126445, 'samples': 3755328, 'steps': 19558, 'loss/train': 1.5755736827850342} 11/07/2021 00:00:06 - INFO - __main__ - Step 19560: {'lr': 0.0004828346666904313, 'samples': 3755520, 'steps': 19559, 'loss/train': 1.0093544721603394} 11/07/2021 00:00:06 - INFO - __main__ - Step 19561: {'lr': 0.00048283273416468644, 'samples': 3755712, 'steps': 19560, 'loss/train': 1.3120677471160889} 11/07/2021 00:00:06 - INFO - __main__ - Step 19562: {'lr': 0.0004828308015340307, 'samples': 3755904, 'steps': 19561, 'loss/train': 1.2885136604309082} 11/07/2021 00:00:07 - INFO - __main__ - Step 19563: {'lr': 0.0004828288687984651, 'samples': 3756096, 'steps': 19562, 'loss/train': 1.5286688804626465} 11/07/2021 00:00:07 - INFO - __main__ - Step 19564: {'lr': 0.0004828269359579903, 'samples': 3756288, 'steps': 19563, 'loss/train': 1.8564814329147339} 11/07/2021 00:00:08 - INFO - __main__ - Step 19565: {'lr': 0.00048282500301260735, 'samples': 3756480, 'steps': 19564, 'loss/train': 1.8124595880508423} 11/07/2021 00:00:09 - INFO - __main__ - Step 19566: {'lr': 0.000482823069962317, 'samples': 3756672, 'steps': 19565, 'loss/train': 1.629968285560608} 11/07/2021 00:00:09 - INFO - __main__ - Step 19567: {'lr': 0.0004828211368071202, 'samples': 3756864, 'steps': 19566, 'loss/train': 1.3446823358535767} 11/07/2021 00:00:09 - INFO - __main__ - Step 19568: {'lr': 0.0004828192035470178, 'samples': 3757056, 'steps': 19567, 'loss/train': 1.5391641855239868} 11/07/2021 00:00:10 - INFO - __main__ - Step 19569: {'lr': 0.00048281727018201063, 'samples': 3757248, 'steps': 19568, 'loss/train': 1.3339598178863525} 11/07/2021 00:00:11 - INFO - __main__ - Step 19570: {'lr': 0.00048281533671209955, 'samples': 3757440, 'steps': 19569, 'loss/train': 1.5045450925827026} 11/07/2021 00:00:11 - INFO - __main__ - Step 19571: {'lr': 0.0004828134031372855, 'samples': 3757632, 'steps': 19570, 'loss/train': 1.9711503982543945} 11/07/2021 00:00:11 - INFO - __main__ - Step 19572: {'lr': 0.00048281146945756937, 'samples': 3757824, 'steps': 19571, 'loss/train': 2.07639217376709} 11/07/2021 00:00:12 - INFO - __main__ - Step 19573: {'lr': 0.00048280953567295196, 'samples': 3758016, 'steps': 19572, 'loss/train': 1.6146386861801147} 11/07/2021 00:00:12 - INFO - __main__ - Step 19574: {'lr': 0.0004828076017834342, 'samples': 3758208, 'steps': 19573, 'loss/train': 1.6972767114639282} 11/07/2021 00:00:12 - INFO - __main__ - Step 19575: {'lr': 0.00048280566778901684, 'samples': 3758400, 'steps': 19574, 'loss/train': 1.4834022521972656} 11/07/2021 00:00:13 - INFO - __main__ - Step 19576: {'lr': 0.00048280373368970086, 'samples': 3758592, 'steps': 19575, 'loss/train': 2.030294418334961} 11/07/2021 00:00:14 - INFO - __main__ - Step 19577: {'lr': 0.0004828017994854872, 'samples': 3758784, 'steps': 19576, 'loss/train': 1.447499394416809} 11/07/2021 00:00:14 - INFO - __main__ - Step 19578: {'lr': 0.0004827998651763765, 'samples': 3758976, 'steps': 19577, 'loss/train': 1.5339030027389526} 11/07/2021 00:00:15 - INFO - __main__ - Step 19579: {'lr': 0.0004827979307623699, 'samples': 3759168, 'steps': 19578, 'loss/train': 1.9877415895462036} 11/07/2021 00:00:15 - INFO - __main__ - Step 19580: {'lr': 0.0004827959962434681, 'samples': 3759360, 'steps': 19579, 'loss/train': 1.5659407377243042} 11/07/2021 00:00:16 - INFO - __main__ - Step 19581: {'lr': 0.00048279406161967197, 'samples': 3759552, 'steps': 19580, 'loss/train': 2.1685266494750977} 11/07/2021 00:00:16 - INFO - __main__ - Step 19582: {'lr': 0.0004827921268909825, 'samples': 3759744, 'steps': 19581, 'loss/train': 2.012640953063965} 11/07/2021 00:00:17 - INFO - __main__ - Step 19583: {'lr': 0.0004827901920574005, 'samples': 3759936, 'steps': 19582, 'loss/train': 1.866450548171997} 11/07/2021 00:00:17 - INFO - __main__ - Step 19584: {'lr': 0.0004827882571189268, 'samples': 3760128, 'steps': 19583, 'loss/train': 1.6104263067245483} 11/07/2021 00:00:17 - INFO - __main__ - Step 19585: {'lr': 0.00048278632207556226, 'samples': 3760320, 'steps': 19584, 'loss/train': 1.3908783197402954} 11/07/2021 00:00:19 - INFO - __main__ - Step 19586: {'lr': 0.00048278438692730784, 'samples': 3760512, 'steps': 19585, 'loss/train': 1.4279026985168457} 11/07/2021 00:00:19 - INFO - __main__ - Step 19587: {'lr': 0.00048278245167416434, 'samples': 3760704, 'steps': 19586, 'loss/train': 1.6072896718978882} 11/07/2021 00:00:19 - INFO - __main__ - Step 19588: {'lr': 0.0004827805163161327, 'samples': 3760896, 'steps': 19587, 'loss/train': 1.6472357511520386} 11/07/2021 00:00:20 - INFO - __main__ - Step 19589: {'lr': 0.0004827785808532137, 'samples': 3761088, 'steps': 19588, 'loss/train': 2.886787176132202} 11/07/2021 00:00:20 - INFO - __main__ - Step 19590: {'lr': 0.0004827766452854083, 'samples': 3761280, 'steps': 19589, 'loss/train': 1.579707145690918} 11/07/2021 00:00:20 - INFO - __main__ - Step 19591: {'lr': 0.0004827747096127173, 'samples': 3761472, 'steps': 19590, 'loss/train': 1.1529046297073364} 11/07/2021 00:00:21 - INFO - __main__ - Step 19592: {'lr': 0.00048277277383514165, 'samples': 3761664, 'steps': 19591, 'loss/train': 1.4161953926086426} 11/07/2021 00:00:22 - INFO - __main__ - Step 19593: {'lr': 0.00048277083795268216, 'samples': 3761856, 'steps': 19592, 'loss/train': 2.133817434310913} 11/07/2021 00:00:22 - INFO - __main__ - Step 19594: {'lr': 0.0004827689019653397, 'samples': 3762048, 'steps': 19593, 'loss/train': 1.4859519004821777} 11/07/2021 00:00:22 - INFO - __main__ - Step 19595: {'lr': 0.00048276696587311525, 'samples': 3762240, 'steps': 19594, 'loss/train': 1.380395770072937} 11/07/2021 00:00:23 - INFO - __main__ - Step 19596: {'lr': 0.00048276502967600955, 'samples': 3762432, 'steps': 19595, 'loss/train': 1.3329964876174927} 11/07/2021 00:00:24 - INFO - __main__ - Step 19597: {'lr': 0.00048276309337402345, 'samples': 3762624, 'steps': 19596, 'loss/train': 1.8443204164505005} 11/07/2021 00:00:24 - INFO - __main__ - Step 19598: {'lr': 0.000482761156967158, 'samples': 3762816, 'steps': 19597, 'loss/train': 1.332189917564392} 11/07/2021 00:00:25 - INFO - __main__ - Step 19599: {'lr': 0.0004827592204554139, 'samples': 3763008, 'steps': 19598, 'loss/train': 1.2490015029907227} 11/07/2021 00:00:25 - INFO - __main__ - Step 19600: {'lr': 0.00048275728383879215, 'samples': 3763200, 'steps': 19599, 'loss/train': 1.4692695140838623} 11/07/2021 00:00:25 - INFO - __main__ - Step 19601: {'lr': 0.0004827553471172935, 'samples': 3763392, 'steps': 19600, 'loss/train': 1.5272186994552612} 11/07/2021 00:00:26 - INFO - __main__ - Step 19602: {'lr': 0.00048275341029091885, 'samples': 3763584, 'steps': 19601, 'loss/train': 1.5676167011260986} 11/07/2021 00:00:27 - INFO - __main__ - Step 19603: {'lr': 0.0004827514733596692, 'samples': 3763776, 'steps': 19602, 'loss/train': 1.6951826810836792} 11/07/2021 00:00:27 - INFO - __main__ - Step 19604: {'lr': 0.00048274953632354524, 'samples': 3763968, 'steps': 19603, 'loss/train': 1.893367886543274} 11/07/2021 00:00:27 - INFO - __main__ - Step 19605: {'lr': 0.000482747599182548, 'samples': 3764160, 'steps': 19604, 'loss/train': 1.5247362852096558} 11/07/2021 00:00:28 - INFO - __main__ - Step 19606: {'lr': 0.00048274566193667824, 'samples': 3764352, 'steps': 19605, 'loss/train': 1.5493274927139282} 11/07/2021 00:00:29 - INFO - __main__ - Step 19607: {'lr': 0.0004827437245859369, 'samples': 3764544, 'steps': 19606, 'loss/train': 1.8292338848114014} 11/07/2021 00:00:29 - INFO - __main__ - Step 19608: {'lr': 0.0004827417871303248, 'samples': 3764736, 'steps': 19607, 'loss/train': 1.5020709037780762} 11/07/2021 00:00:29 - INFO - __main__ - Step 19609: {'lr': 0.00048273984956984285, 'samples': 3764928, 'steps': 19608, 'loss/train': 1.4957561492919922} 11/07/2021 00:00:30 - INFO - __main__ - Step 19610: {'lr': 0.0004827379119044919, 'samples': 3765120, 'steps': 19609, 'loss/train': 1.7140650749206543} 11/07/2021 00:00:30 - INFO - __main__ - Step 19611: {'lr': 0.00048273597413427284, 'samples': 3765312, 'steps': 19610, 'loss/train': 1.3099769353866577} 11/07/2021 00:00:30 - INFO - __main__ - Step 19612: {'lr': 0.00048273403625918653, 'samples': 3765504, 'steps': 19611, 'loss/train': 1.9554831981658936} 11/07/2021 00:00:31 - INFO - __main__ - Step 19613: {'lr': 0.0004827320982792339, 'samples': 3765696, 'steps': 19612, 'loss/train': 1.5525972843170166} 11/07/2021 00:00:32 - INFO - __main__ - Step 19614: {'lr': 0.00048273016019441585, 'samples': 3765888, 'steps': 19613, 'loss/train': 1.6005520820617676} 11/07/2021 00:00:32 - INFO - __main__ - Step 19615: {'lr': 0.00048272822200473304, 'samples': 3766080, 'steps': 19614, 'loss/train': 1.727546215057373} 11/07/2021 00:00:32 - INFO - __main__ - Step 19616: {'lr': 0.0004827262837101866, 'samples': 3766272, 'steps': 19615, 'loss/train': 1.532706618309021} 11/07/2021 00:00:33 - INFO - __main__ - Step 19617: {'lr': 0.0004827243453107772, 'samples': 3766464, 'steps': 19616, 'loss/train': 1.8534809350967407} 11/07/2021 00:00:34 - INFO - __main__ - Step 19618: {'lr': 0.0004827224068065058, 'samples': 3766656, 'steps': 19617, 'loss/train': 1.6946210861206055} 11/07/2021 00:00:34 - INFO - __main__ - Step 19619: {'lr': 0.0004827204681973733, 'samples': 3766848, 'steps': 19618, 'loss/train': 1.5758514404296875} 11/07/2021 00:00:35 - INFO - __main__ - Step 19620: {'lr': 0.00048271852948338057, 'samples': 3767040, 'steps': 19619, 'loss/train': 1.195085048675537} 11/07/2021 00:00:35 - INFO - __main__ - Step 19621: {'lr': 0.00048271659066452847, 'samples': 3767232, 'steps': 19620, 'loss/train': 1.3183504343032837} 11/07/2021 00:00:35 - INFO - __main__ - Step 19622: {'lr': 0.0004827146517408178, 'samples': 3767424, 'steps': 19621, 'loss/train': 1.5881609916687012} 11/07/2021 00:00:36 - INFO - __main__ - Step 19623: {'lr': 0.0004827127127122495, 'samples': 3767616, 'steps': 19622, 'loss/train': 1.7038828134536743} 11/07/2021 00:00:37 - INFO - __main__ - Step 19624: {'lr': 0.00048271077357882455, 'samples': 3767808, 'steps': 19623, 'loss/train': 1.7045823335647583} 11/07/2021 00:00:37 - INFO - __main__ - Step 19625: {'lr': 0.00048270883434054364, 'samples': 3768000, 'steps': 19624, 'loss/train': 1.2590144872665405} 11/07/2021 00:00:37 - INFO - __main__ - Step 19626: {'lr': 0.00048270689499740774, 'samples': 3768192, 'steps': 19625, 'loss/train': 1.5791674852371216} 11/07/2021 00:00:38 - INFO - __main__ - Step 19627: {'lr': 0.0004827049555494176, 'samples': 3768384, 'steps': 19626, 'loss/train': 1.9357750415802002} 11/07/2021 00:00:39 - INFO - __main__ - Step 19628: {'lr': 0.00048270301599657436, 'samples': 3768576, 'steps': 19627, 'loss/train': 1.0141645669937134} 11/07/2021 00:00:39 - INFO - __main__ - Step 19629: {'lr': 0.0004827010763388786, 'samples': 3768768, 'steps': 19628, 'loss/train': 1.4488344192504883} 11/07/2021 00:00:39 - INFO - __main__ - Step 19630: {'lr': 0.00048269913657633147, 'samples': 3768960, 'steps': 19629, 'loss/train': 1.4897490739822388} 11/07/2021 00:00:40 - INFO - __main__ - Step 19631: {'lr': 0.00048269719670893357, 'samples': 3769152, 'steps': 19630, 'loss/train': 1.842172384262085} 11/07/2021 00:00:40 - INFO - __main__ - Step 19632: {'lr': 0.00048269525673668595, 'samples': 3769344, 'steps': 19631, 'loss/train': 1.7020254135131836} 11/07/2021 00:00:41 - INFO - __main__ - Step 19633: {'lr': 0.00048269331665958947, 'samples': 3769536, 'steps': 19632, 'loss/train': 1.7664917707443237} 11/07/2021 00:00:42 - INFO - __main__ - Step 19634: {'lr': 0.00048269137647764495, 'samples': 3769728, 'steps': 19633, 'loss/train': 1.8792760372161865} 11/07/2021 00:00:42 - INFO - __main__ - Step 19635: {'lr': 0.00048268943619085325, 'samples': 3769920, 'steps': 19634, 'loss/train': 1.3441412448883057} 11/07/2021 00:00:42 - INFO - __main__ - Step 19636: {'lr': 0.00048268749579921536, 'samples': 3770112, 'steps': 19635, 'loss/train': 1.066512107849121} 11/07/2021 00:00:43 - INFO - __main__ - Step 19637: {'lr': 0.00048268555530273197, 'samples': 3770304, 'steps': 19636, 'loss/train': 2.038100481033325} 11/07/2021 00:00:43 - INFO - __main__ - Step 19638: {'lr': 0.0004826836147014041, 'samples': 3770496, 'steps': 19637, 'loss/train': 1.2827800512313843} 11/07/2021 00:00:44 - INFO - __main__ - Step 19639: {'lr': 0.0004826816739952326, 'samples': 3770688, 'steps': 19638, 'loss/train': 1.3848270177841187} 11/07/2021 00:00:44 - INFO - __main__ - Step 19640: {'lr': 0.0004826797331842183, 'samples': 3770880, 'steps': 19639, 'loss/train': 1.6651759147644043} 11/07/2021 00:00:45 - INFO - __main__ - Step 19641: {'lr': 0.0004826777922683622, 'samples': 3771072, 'steps': 19640, 'loss/train': 1.8310139179229736} 11/07/2021 00:00:45 - INFO - __main__ - Step 19642: {'lr': 0.0004826758512476649, 'samples': 3771264, 'steps': 19641, 'loss/train': 1.6785047054290771} 11/07/2021 00:00:45 - INFO - __main__ - Step 19643: {'lr': 0.0004826739101221276, 'samples': 3771456, 'steps': 19642, 'loss/train': 1.0879031419754028} 11/07/2021 00:00:46 - INFO - __main__ - Step 19644: {'lr': 0.000482671968891751, 'samples': 3771648, 'steps': 19643, 'loss/train': 1.537487268447876} 11/07/2021 00:00:47 - INFO - __main__ - Step 19645: {'lr': 0.000482670027556536, 'samples': 3771840, 'steps': 19644, 'loss/train': 1.6221253871917725} 11/07/2021 00:00:47 - INFO - __main__ - Step 19646: {'lr': 0.0004826680861164834, 'samples': 3772032, 'steps': 19645, 'loss/train': 1.3264529705047607} 11/07/2021 00:00:47 - INFO - __main__ - Step 19647: {'lr': 0.00048266614457159426, 'samples': 3772224, 'steps': 19646, 'loss/train': 1.3037407398223877} 11/07/2021 00:00:48 - INFO - __main__ - Step 19648: {'lr': 0.0004826642029218693, 'samples': 3772416, 'steps': 19647, 'loss/train': 1.428112268447876} 11/07/2021 00:00:49 - INFO - __main__ - Step 19649: {'lr': 0.00048266226116730937, 'samples': 3772608, 'steps': 19648, 'loss/train': 1.5353032350540161} 11/07/2021 00:00:49 - INFO - __main__ - Step 19650: {'lr': 0.00048266031930791555, 'samples': 3772800, 'steps': 19649, 'loss/train': 1.3815804719924927} 11/07/2021 00:00:49 - INFO - __main__ - Step 19651: {'lr': 0.0004826583773436884, 'samples': 3772992, 'steps': 19650, 'loss/train': 1.9578912258148193} 11/07/2021 00:00:50 - INFO - __main__ - Step 19652: {'lr': 0.00048265643527462915, 'samples': 3773184, 'steps': 19651, 'loss/train': 1.3772332668304443} 11/07/2021 00:00:50 - INFO - __main__ - Step 19653: {'lr': 0.00048265449310073847, 'samples': 3773376, 'steps': 19652, 'loss/train': 1.2029601335525513} 11/07/2021 00:00:51 - INFO - __main__ - Step 19654: {'lr': 0.0004826525508220172, 'samples': 3773568, 'steps': 19653, 'loss/train': 1.0753741264343262} 11/07/2021 00:00:52 - INFO - __main__ - Step 19655: {'lr': 0.0004826506084384663, 'samples': 3773760, 'steps': 19654, 'loss/train': 1.2957723140716553} 11/07/2021 00:00:52 - INFO - __main__ - Step 19656: {'lr': 0.00048264866595008665, 'samples': 3773952, 'steps': 19655, 'loss/train': 1.1646695137023926} 11/07/2021 00:00:52 - INFO - __main__ - Step 19657: {'lr': 0.0004826467233568791, 'samples': 3774144, 'steps': 19656, 'loss/train': 1.3596159219741821} 11/07/2021 00:00:53 - INFO - __main__ - Step 19658: {'lr': 0.00048264478065884454, 'samples': 3774336, 'steps': 19657, 'loss/train': 1.6303644180297852} 11/07/2021 00:00:54 - INFO - __main__ - Step 19659: {'lr': 0.0004826428378559838, 'samples': 3774528, 'steps': 19658, 'loss/train': 1.8354287147521973} 11/07/2021 00:00:54 - INFO - __main__ - Step 19660: {'lr': 0.00048264089494829776, 'samples': 3774720, 'steps': 19659, 'loss/train': 1.5434880256652832} 11/07/2021 00:00:54 - INFO - __main__ - Step 19661: {'lr': 0.0004826389519357874, 'samples': 3774912, 'steps': 19660, 'loss/train': 1.5627186298370361} 11/07/2021 00:00:55 - INFO - __main__ - Step 19662: {'lr': 0.00048263700881845346, 'samples': 3775104, 'steps': 19661, 'loss/train': 2.0011045932769775} 11/07/2021 00:00:55 - INFO - __main__ - Step 19663: {'lr': 0.00048263506559629687, 'samples': 3775296, 'steps': 19662, 'loss/train': 1.4480820894241333} 11/07/2021 00:00:56 - INFO - __main__ - Step 19664: {'lr': 0.00048263312226931853, 'samples': 3775488, 'steps': 19663, 'loss/train': 2.256391763687134} 11/07/2021 00:00:56 - INFO - __main__ - Step 19665: {'lr': 0.0004826311788375193, 'samples': 3775680, 'steps': 19664, 'loss/train': 1.180256962776184} 11/07/2021 00:00:57 - INFO - __main__ - Step 19666: {'lr': 0.00048262923530090007, 'samples': 3775872, 'steps': 19665, 'loss/train': 1.7530802488327026} 11/07/2021 00:00:57 - INFO - __main__ - Step 19667: {'lr': 0.0004826272916594616, 'samples': 3776064, 'steps': 19666, 'loss/train': 1.6622743606567383} 11/07/2021 00:00:58 - INFO - __main__ - Step 19668: {'lr': 0.000482625347913205, 'samples': 3776256, 'steps': 19667, 'loss/train': 1.727860450744629} 11/07/2021 00:00:59 - INFO - __main__ - Step 19669: {'lr': 0.0004826234040621309, 'samples': 3776448, 'steps': 19668, 'loss/train': 1.6675604581832886} 11/07/2021 00:00:59 - INFO - __main__ - Step 19670: {'lr': 0.00048262146010624035, 'samples': 3776640, 'steps': 19669, 'loss/train': 1.950370192527771} 11/07/2021 00:00:59 - INFO - __main__ - Step 19671: {'lr': 0.0004826195160455341, 'samples': 3776832, 'steps': 19670, 'loss/train': 1.9079294204711914} 11/07/2021 00:01:00 - INFO - __main__ - Step 19672: {'lr': 0.00048261757188001314, 'samples': 3777024, 'steps': 19671, 'loss/train': 1.0746430158615112} 11/07/2021 00:01:00 - INFO - __main__ - Step 19673: {'lr': 0.00048261562760967824, 'samples': 3777216, 'steps': 19672, 'loss/train': 1.936623215675354} 11/07/2021 00:01:00 - INFO - __main__ - Step 19674: {'lr': 0.0004826136832345304, 'samples': 3777408, 'steps': 19673, 'loss/train': 1.9064000844955444} 11/07/2021 00:01:01 - INFO - __main__ - Step 19675: {'lr': 0.00048261173875457035, 'samples': 3777600, 'steps': 19674, 'loss/train': 2.2525835037231445} 11/07/2021 00:01:02 - INFO - __main__ - Step 19676: {'lr': 0.0004826097941697991, 'samples': 3777792, 'steps': 19675, 'loss/train': 1.700140118598938} 11/07/2021 00:01:02 - INFO - __main__ - Step 19677: {'lr': 0.0004826078494802174, 'samples': 3777984, 'steps': 19676, 'loss/train': 1.8438869714736938} 11/07/2021 00:01:02 - INFO - __main__ - Step 19678: {'lr': 0.00048260590468582624, 'samples': 3778176, 'steps': 19677, 'loss/train': 1.5107479095458984} 11/07/2021 00:01:03 - INFO - __main__ - Step 19679: {'lr': 0.0004826039597866265, 'samples': 3778368, 'steps': 19678, 'loss/train': 1.5821765661239624} 11/07/2021 00:01:04 - INFO - __main__ - Step 19680: {'lr': 0.00048260201478261887, 'samples': 3778560, 'steps': 19679, 'loss/train': 1.5964267253875732} 11/07/2021 00:01:04 - INFO - __main__ - Step 19681: {'lr': 0.0004826000696738045, 'samples': 3778752, 'steps': 19680, 'loss/train': 1.6593513488769531} 11/07/2021 00:01:05 - INFO - __main__ - Step 19682: {'lr': 0.000482598124460184, 'samples': 3778944, 'steps': 19681, 'loss/train': 0.817847490310669} 11/07/2021 00:01:05 - INFO - __main__ - Step 19683: {'lr': 0.00048259617914175846, 'samples': 3779136, 'steps': 19682, 'loss/train': 1.4910651445388794} 11/07/2021 00:01:05 - INFO - __main__ - Step 19684: {'lr': 0.00048259423371852867, 'samples': 3779328, 'steps': 19683, 'loss/train': 1.6026617288589478} 11/07/2021 00:01:06 - INFO - __main__ - Step 19685: {'lr': 0.0004825922881904955, 'samples': 3779520, 'steps': 19684, 'loss/train': 1.4363633394241333} 11/07/2021 00:01:07 - INFO - __main__ - Step 19686: {'lr': 0.00048259034255765984, 'samples': 3779712, 'steps': 19685, 'loss/train': 1.4773492813110352} 11/07/2021 00:01:07 - INFO - __main__ - Step 19687: {'lr': 0.00048258839682002253, 'samples': 3779904, 'steps': 19686, 'loss/train': 0.8131615519523621} 11/07/2021 00:01:07 - INFO - __main__ - Step 19688: {'lr': 0.00048258645097758445, 'samples': 3780096, 'steps': 19687, 'loss/train': 1.3371987342834473} 11/07/2021 00:01:08 - INFO - __main__ - Step 19689: {'lr': 0.0004825845050303466, 'samples': 3780288, 'steps': 19688, 'loss/train': 1.4941506385803223} 11/07/2021 00:01:09 - INFO - __main__ - Step 19690: {'lr': 0.00048258255897830967, 'samples': 3780480, 'steps': 19689, 'loss/train': 1.6974849700927734} 11/07/2021 00:01:09 - INFO - __main__ - Step 19691: {'lr': 0.0004825806128214747, 'samples': 3780672, 'steps': 19690, 'loss/train': 1.7099860906600952} 11/07/2021 00:01:09 - INFO - __main__ - Step 19692: {'lr': 0.00048257866655984237, 'samples': 3780864, 'steps': 19691, 'loss/train': 1.6285266876220703} 11/07/2021 00:01:10 - INFO - __main__ - Step 19693: {'lr': 0.0004825767201934138, 'samples': 3781056, 'steps': 19692, 'loss/train': 1.9358177185058594} 11/07/2021 00:01:10 - INFO - __main__ - Step 19694: {'lr': 0.0004825747737221897, 'samples': 3781248, 'steps': 19693, 'loss/train': 1.5827686786651611} 11/07/2021 00:01:11 - INFO - __main__ - Step 19695: {'lr': 0.000482572827146171, 'samples': 3781440, 'steps': 19694, 'loss/train': 1.8137426376342773} 11/07/2021 00:01:11 - INFO - __main__ - Step 19696: {'lr': 0.00048257088046535864, 'samples': 3781632, 'steps': 19695, 'loss/train': 1.3611456155776978} 11/07/2021 00:01:12 - INFO - __main__ - Step 19697: {'lr': 0.0004825689336797534, 'samples': 3781824, 'steps': 19696, 'loss/train': 1.5496810674667358} 11/07/2021 00:01:12 - INFO - __main__ - Step 19698: {'lr': 0.00048256698678935615, 'samples': 3782016, 'steps': 19697, 'loss/train': 1.8349579572677612} 11/07/2021 00:01:13 - INFO - __main__ - Step 19699: {'lr': 0.00048256503979416776, 'samples': 3782208, 'steps': 19698, 'loss/train': 1.5465449094772339} 11/07/2021 00:01:13 - INFO - __main__ - Step 19700: {'lr': 0.0004825630926941892, 'samples': 3782400, 'steps': 19699, 'loss/train': 1.3311418294906616} 11/07/2021 00:01:14 - INFO - __main__ - Step 19701: {'lr': 0.0004825611454894213, 'samples': 3782592, 'steps': 19700, 'loss/train': 1.6302708387374878} 11/07/2021 00:01:14 - INFO - __main__ - Step 19702: {'lr': 0.000482559198179865, 'samples': 3782784, 'steps': 19701, 'loss/train': 1.5560293197631836} 11/07/2021 00:01:15 - INFO - __main__ - Step 19703: {'lr': 0.00048255725076552103, 'samples': 3782976, 'steps': 19702, 'loss/train': 1.8703819513320923} 11/07/2021 00:01:15 - INFO - __main__ - Step 19704: {'lr': 0.0004825553032463904, 'samples': 3783168, 'steps': 19703, 'loss/train': 1.48812735080719} 11/07/2021 00:01:15 - INFO - __main__ - Step 19705: {'lr': 0.00048255335562247395, 'samples': 3783360, 'steps': 19704, 'loss/train': 1.8443870544433594} 11/07/2021 00:01:16 - INFO - __main__ - Step 19706: {'lr': 0.0004825514078937725, 'samples': 3783552, 'steps': 19705, 'loss/train': 1.9899576902389526} 11/07/2021 00:01:17 - INFO - __main__ - Step 19707: {'lr': 0.000482549460060287, 'samples': 3783744, 'steps': 19706, 'loss/train': 1.6891878843307495} 11/07/2021 00:01:17 - INFO - __main__ - Step 19708: {'lr': 0.0004825475121220183, 'samples': 3783936, 'steps': 19707, 'loss/train': 1.7115875482559204} 11/07/2021 00:01:17 - INFO - __main__ - Step 19709: {'lr': 0.0004825455640789672, 'samples': 3784128, 'steps': 19708, 'loss/train': 2.0251336097717285} 11/07/2021 00:01:18 - INFO - __main__ - Step 19710: {'lr': 0.00048254361593113475, 'samples': 3784320, 'steps': 19709, 'loss/train': 1.9587410688400269} 11/07/2021 00:01:19 - INFO - __main__ - Step 19711: {'lr': 0.0004825416676785217, 'samples': 3784512, 'steps': 19710, 'loss/train': 2.022176742553711} 11/07/2021 00:01:19 - INFO - __main__ - Step 19712: {'lr': 0.000482539719321129, 'samples': 3784704, 'steps': 19711, 'loss/train': 2.0017166137695312} 11/07/2021 00:01:19 - INFO - __main__ - Step 19713: {'lr': 0.00048253777085895745, 'samples': 3784896, 'steps': 19712, 'loss/train': 1.0912806987762451} 11/07/2021 00:01:20 - INFO - __main__ - Step 19714: {'lr': 0.000482535822292008, 'samples': 3785088, 'steps': 19713, 'loss/train': 1.5614527463912964} 11/07/2021 00:01:20 - INFO - __main__ - Step 19715: {'lr': 0.0004825338736202815, 'samples': 3785280, 'steps': 19714, 'loss/train': 1.2009432315826416} 11/07/2021 00:01:21 - INFO - __main__ - Step 19716: {'lr': 0.00048253192484377884, 'samples': 3785472, 'steps': 19715, 'loss/train': 1.6705501079559326} 11/07/2021 00:01:21 - INFO - __main__ - Step 19717: {'lr': 0.0004825299759625008, 'samples': 3785664, 'steps': 19716, 'loss/train': 1.6005103588104248} 11/07/2021 00:01:22 - INFO - __main__ - Step 19718: {'lr': 0.0004825280269764484, 'samples': 3785856, 'steps': 19717, 'loss/train': 1.6898348331451416} 11/07/2021 00:01:22 - INFO - __main__ - Step 19719: {'lr': 0.0004825260778856224, 'samples': 3786048, 'steps': 19718, 'loss/train': 1.2682095766067505} 11/07/2021 00:01:22 - INFO - __main__ - Step 19720: {'lr': 0.0004825241286900238, 'samples': 3786240, 'steps': 19719, 'loss/train': 1.398619294166565} 11/07/2021 00:01:24 - INFO - __main__ - Step 19721: {'lr': 0.0004825221793896535, 'samples': 3786432, 'steps': 19720, 'loss/train': 1.7995117902755737} 11/07/2021 00:01:24 - INFO - __main__ - Step 19722: {'lr': 0.0004825202299845122, 'samples': 3786624, 'steps': 19721, 'loss/train': 1.4003227949142456} 11/07/2021 00:01:24 - INFO - __main__ - Step 19723: {'lr': 0.00048251828047460077, 'samples': 3786816, 'steps': 19722, 'loss/train': 1.677733063697815} 11/07/2021 00:01:25 - INFO - __main__ - Step 19724: {'lr': 0.0004825163308599203, 'samples': 3787008, 'steps': 19723, 'loss/train': 1.8848963975906372} 11/07/2021 00:01:25 - INFO - __main__ - Step 19725: {'lr': 0.0004825143811404716, 'samples': 3787200, 'steps': 19724, 'loss/train': 1.6684176921844482} 11/07/2021 00:01:26 - INFO - __main__ - Step 19726: {'lr': 0.00048251243131625543, 'samples': 3787392, 'steps': 19725, 'loss/train': 1.2181226015090942} 11/07/2021 00:01:27 - INFO - __main__ - Step 19727: {'lr': 0.0004825104813872728, 'samples': 3787584, 'steps': 19726, 'loss/train': 1.703728437423706} 11/07/2021 00:01:27 - INFO - __main__ - Step 19728: {'lr': 0.0004825085313535245, 'samples': 3787776, 'steps': 19727, 'loss/train': 1.7474498748779297} 11/07/2021 00:01:27 - INFO - __main__ - Step 19729: {'lr': 0.00048250658121501145, 'samples': 3787968, 'steps': 19728, 'loss/train': 0.8166286945343018} 11/07/2021 00:01:28 - INFO - __main__ - Step 19730: {'lr': 0.00048250463097173447, 'samples': 3788160, 'steps': 19729, 'loss/train': 1.8923614025115967} 11/07/2021 00:01:29 - INFO - __main__ - Step 19731: {'lr': 0.0004825026806236946, 'samples': 3788352, 'steps': 19730, 'loss/train': 1.692439079284668} 11/07/2021 00:01:29 - INFO - __main__ - Step 19732: {'lr': 0.00048250073017089257, 'samples': 3788544, 'steps': 19731, 'loss/train': 1.34873366355896} 11/07/2021 00:01:30 - INFO - __main__ - Step 19733: {'lr': 0.00048249877961332923, 'samples': 3788736, 'steps': 19732, 'loss/train': 1.4355651140213013} 11/07/2021 00:01:30 - INFO - __main__ - Step 19734: {'lr': 0.0004824968289510056, 'samples': 3788928, 'steps': 19733, 'loss/train': 3.0089304447174072} 11/07/2021 00:01:30 - INFO - __main__ - Step 19735: {'lr': 0.0004824948781839225, 'samples': 3789120, 'steps': 19734, 'loss/train': 1.7068781852722168} 11/07/2021 00:01:31 - INFO - __main__ - Step 19736: {'lr': 0.0004824929273120807, 'samples': 3789312, 'steps': 19735, 'loss/train': 2.0317676067352295} 11/07/2021 00:01:32 - INFO - __main__ - Step 19737: {'lr': 0.0004824909763354813, 'samples': 3789504, 'steps': 19736, 'loss/train': 1.9025362730026245} 11/07/2021 00:01:32 - INFO - __main__ - Step 19738: {'lr': 0.00048248902525412497, 'samples': 3789696, 'steps': 19737, 'loss/train': 1.912577748298645} 11/07/2021 00:01:32 - INFO - __main__ - Step 19739: {'lr': 0.0004824870740680127, 'samples': 3789888, 'steps': 19738, 'loss/train': 2.0602569580078125} 11/07/2021 00:01:33 - INFO - __main__ - Step 19740: {'lr': 0.0004824851227771453, 'samples': 3790080, 'steps': 19739, 'loss/train': 1.4696168899536133} 11/07/2021 00:01:33 - INFO - __main__ - Step 19741: {'lr': 0.00048248317138152374, 'samples': 3790272, 'steps': 19740, 'loss/train': 2.4418163299560547} 11/07/2021 00:01:34 - INFO - __main__ - Step 19742: {'lr': 0.00048248121988114887, 'samples': 3790464, 'steps': 19741, 'loss/train': 1.8438314199447632} 11/07/2021 00:01:34 - INFO - __main__ - Step 19743: {'lr': 0.00048247926827602153, 'samples': 3790656, 'steps': 19742, 'loss/train': 1.5355260372161865} 11/07/2021 00:01:35 - INFO - __main__ - Step 19744: {'lr': 0.0004824773165661426, 'samples': 3790848, 'steps': 19743, 'loss/train': 1.5468591451644897} 11/07/2021 00:01:35 - INFO - __main__ - Step 19745: {'lr': 0.000482475364751513, 'samples': 3791040, 'steps': 19744, 'loss/train': 1.805572271347046} 11/07/2021 00:01:35 - INFO - __main__ - Step 19746: {'lr': 0.0004824734128321335, 'samples': 3791232, 'steps': 19745, 'loss/train': 1.6561604738235474} 11/07/2021 00:01:36 - INFO - __main__ - Step 19747: {'lr': 0.0004824714608080052, 'samples': 3791424, 'steps': 19746, 'loss/train': 1.387292504310608} 11/07/2021 00:01:37 - INFO - __main__ - Step 19748: {'lr': 0.00048246950867912873, 'samples': 3791616, 'steps': 19747, 'loss/train': 1.7688283920288086} 11/07/2021 00:01:37 - INFO - __main__ - Step 19749: {'lr': 0.0004824675564455052, 'samples': 3791808, 'steps': 19748, 'loss/train': 4.455977916717529} 11/07/2021 00:01:38 - INFO - __main__ - Step 19750: {'lr': 0.0004824656041071353, 'samples': 3792000, 'steps': 19749, 'loss/train': 1.741172432899475} 11/07/2021 00:01:38 - INFO - __main__ - Step 19751: {'lr': 0.00048246365166402003, 'samples': 3792192, 'steps': 19750, 'loss/train': 1.72153902053833} 11/07/2021 00:01:39 - INFO - __main__ - Step 19752: {'lr': 0.00048246169911616015, 'samples': 3792384, 'steps': 19751, 'loss/train': 1.7890325784683228} 11/07/2021 00:01:39 - INFO - __main__ - Step 19753: {'lr': 0.00048245974646355673, 'samples': 3792576, 'steps': 19752, 'loss/train': 1.2916069030761719} 11/07/2021 00:01:40 - INFO - __main__ - Step 19754: {'lr': 0.00048245779370621045, 'samples': 3792768, 'steps': 19753, 'loss/train': 1.873506784439087} 11/07/2021 00:01:40 - INFO - __main__ - Step 19755: {'lr': 0.0004824558408441223, 'samples': 3792960, 'steps': 19754, 'loss/train': 1.3477427959442139} 11/07/2021 00:01:40 - INFO - __main__ - Step 19756: {'lr': 0.00048245388787729316, 'samples': 3793152, 'steps': 19755, 'loss/train': 1.5922755002975464} 11/07/2021 00:01:41 - INFO - __main__ - Step 19757: {'lr': 0.00048245193480572383, 'samples': 3793344, 'steps': 19756, 'loss/train': 1.3742207288742065} 11/07/2021 00:01:42 - INFO - __main__ - Step 19758: {'lr': 0.0004824499816294152, 'samples': 3793536, 'steps': 19757, 'loss/train': 1.4603769779205322} 11/07/2021 00:01:42 - INFO - __main__ - Step 19759: {'lr': 0.0004824480283483683, 'samples': 3793728, 'steps': 19758, 'loss/train': 1.604667067527771} 11/07/2021 00:01:42 - INFO - __main__ - Step 19760: {'lr': 0.0004824460749625839, 'samples': 3793920, 'steps': 19759, 'loss/train': 1.4646573066711426} 11/07/2021 00:01:43 - INFO - __main__ - Step 19761: {'lr': 0.00048244412147206283, 'samples': 3794112, 'steps': 19760, 'loss/train': 1.8121919631958008} 11/07/2021 00:01:43 - INFO - __main__ - Step 19762: {'lr': 0.00048244216787680607, 'samples': 3794304, 'steps': 19761, 'loss/train': 1.9884823560714722} 11/07/2021 00:01:44 - INFO - __main__ - Step 19763: {'lr': 0.0004824402141768145, 'samples': 3794496, 'steps': 19762, 'loss/train': 1.7847453355789185} 11/07/2021 00:01:45 - INFO - __main__ - Step 19764: {'lr': 0.0004824382603720888, 'samples': 3794688, 'steps': 19763, 'loss/train': 1.5830997228622437} 11/07/2021 00:01:45 - INFO - __main__ - Step 19765: {'lr': 0.00048243630646263016, 'samples': 3794880, 'steps': 19764, 'loss/train': 0.8904869556427002} 11/07/2021 00:01:46 - INFO - __main__ - Step 19766: {'lr': 0.00048243435244843926, 'samples': 3795072, 'steps': 19765, 'loss/train': 2.7253293991088867} 11/07/2021 00:01:46 - INFO - __main__ - Step 19767: {'lr': 0.000482432398329517, 'samples': 3795264, 'steps': 19766, 'loss/train': 1.6028205156326294} 11/07/2021 00:01:47 - INFO - __main__ - Step 19768: {'lr': 0.00048243044410586433, 'samples': 3795456, 'steps': 19767, 'loss/train': 1.8924862146377563} 11/07/2021 00:01:47 - INFO - __main__ - Step 19769: {'lr': 0.00048242848977748205, 'samples': 3795648, 'steps': 19768, 'loss/train': 1.5213522911071777} 11/07/2021 00:01:48 - INFO - __main__ - Step 19770: {'lr': 0.0004824265353443711, 'samples': 3795840, 'steps': 19769, 'loss/train': 1.878341794013977} 11/07/2021 00:01:48 - INFO - __main__ - Step 19771: {'lr': 0.00048242458080653233, 'samples': 3796032, 'steps': 19770, 'loss/train': 1.079317569732666} 11/07/2021 00:01:48 - INFO - __main__ - Step 19772: {'lr': 0.0004824226261639666, 'samples': 3796224, 'steps': 19771, 'loss/train': 1.2419971227645874} 11/07/2021 00:01:49 - INFO - __main__ - Step 19773: {'lr': 0.00048242067141667487, 'samples': 3796416, 'steps': 19772, 'loss/train': 1.6807929277420044} 11/07/2021 00:01:50 - INFO - __main__ - Step 19774: {'lr': 0.00048241871656465795, 'samples': 3796608, 'steps': 19773, 'loss/train': 1.5872936248779297} 11/07/2021 00:01:50 - INFO - __main__ - Step 19775: {'lr': 0.0004824167616079168, 'samples': 3796800, 'steps': 19774, 'loss/train': 0.9091289639472961} 11/07/2021 00:01:50 - INFO - __main__ - Step 19776: {'lr': 0.0004824148065464522, 'samples': 3796992, 'steps': 19775, 'loss/train': 0.828349232673645} 11/07/2021 00:01:51 - INFO - __main__ - Step 19777: {'lr': 0.00048241285138026505, 'samples': 3797184, 'steps': 19776, 'loss/train': 1.399821400642395} 11/07/2021 00:01:52 - INFO - __main__ - Step 19778: {'lr': 0.00048241089610935627, 'samples': 3797376, 'steps': 19777, 'loss/train': 1.6102732419967651} 11/07/2021 00:01:52 - INFO - __main__ - Step 19779: {'lr': 0.0004824089407337267, 'samples': 3797568, 'steps': 19778, 'loss/train': 1.6403769254684448} 11/07/2021 00:01:52 - INFO - __main__ - Step 19780: {'lr': 0.00048240698525337726, 'samples': 3797760, 'steps': 19779, 'loss/train': 1.7121052742004395} 11/07/2021 00:01:53 - INFO - __main__ - Step 19781: {'lr': 0.0004824050296683089, 'samples': 3797952, 'steps': 19780, 'loss/train': 1.6432030200958252} 11/07/2021 00:01:53 - INFO - __main__ - Step 19782: {'lr': 0.0004824030739785223, 'samples': 3798144, 'steps': 19781, 'loss/train': 1.6968430280685425} 11/07/2021 00:01:53 - INFO - __main__ - Step 19783: {'lr': 0.00048240111818401854, 'samples': 3798336, 'steps': 19782, 'loss/train': 1.702683925628662} 11/07/2021 00:01:55 - INFO - __main__ - Step 19784: {'lr': 0.0004823991622847984, 'samples': 3798528, 'steps': 19783, 'loss/train': 1.420841932296753} 11/07/2021 00:01:55 - INFO - __main__ - Step 19785: {'lr': 0.0004823972062808628, 'samples': 3798720, 'steps': 19784, 'loss/train': 2.2331371307373047} 11/07/2021 00:01:55 - INFO - __main__ - Step 19786: {'lr': 0.0004823952501722126, 'samples': 3798912, 'steps': 19785, 'loss/train': 1.8033604621887207} 11/07/2021 00:01:56 - INFO - __main__ - Step 19787: {'lr': 0.00048239329395884865, 'samples': 3799104, 'steps': 19786, 'loss/train': 1.480093240737915} 11/07/2021 00:01:56 - INFO - __main__ - Step 19788: {'lr': 0.00048239133764077193, 'samples': 3799296, 'steps': 19787, 'loss/train': 1.7165354490280151} 11/07/2021 00:01:57 - INFO - __main__ - Step 19789: {'lr': 0.00048238938121798313, 'samples': 3799488, 'steps': 19788, 'loss/train': 1.7748910188674927} 11/07/2021 00:01:57 - INFO - __main__ - Step 19790: {'lr': 0.00048238742469048344, 'samples': 3799680, 'steps': 19789, 'loss/train': 1.333566427230835} 11/07/2021 00:01:58 - INFO - __main__ - Step 19791: {'lr': 0.00048238546805827345, 'samples': 3799872, 'steps': 19790, 'loss/train': 1.7777206897735596} 11/07/2021 00:01:58 - INFO - __main__ - Step 19792: {'lr': 0.00048238351132135415, 'samples': 3800064, 'steps': 19791, 'loss/train': 1.79335355758667} 11/07/2021 00:01:59 - INFO - __main__ - Step 19793: {'lr': 0.0004823815544797265, 'samples': 3800256, 'steps': 19792, 'loss/train': 1.7088088989257812} 11/07/2021 00:02:00 - INFO - __main__ - Step 19794: {'lr': 0.0004823795975333912, 'samples': 3800448, 'steps': 19793, 'loss/train': 1.7255698442459106} 11/07/2021 00:02:00 - INFO - __main__ - Step 19795: {'lr': 0.0004823776404823493, 'samples': 3800640, 'steps': 19794, 'loss/train': 0.7900757193565369} 11/07/2021 00:02:00 - INFO - __main__ - Step 19796: {'lr': 0.00048237568332660163, 'samples': 3800832, 'steps': 19795, 'loss/train': 1.8653993606567383} 11/07/2021 00:02:01 - INFO - __main__ - Step 19797: {'lr': 0.0004823737260661491, 'samples': 3801024, 'steps': 19796, 'loss/train': 2.3506391048431396} 11/07/2021 00:02:01 - INFO - __main__ - Step 19798: {'lr': 0.00048237176870099256, 'samples': 3801216, 'steps': 19797, 'loss/train': 1.4605416059494019} 11/07/2021 00:02:01 - INFO - __main__ - Step 19799: {'lr': 0.0004823698112311328, 'samples': 3801408, 'steps': 19798, 'loss/train': 1.5340802669525146} 11/07/2021 00:02:02 - INFO - __main__ - Step 19800: {'lr': 0.00048236785365657076, 'samples': 3801600, 'steps': 19799, 'loss/train': 1.685361623764038} 11/07/2021 00:02:03 - INFO - __main__ - Step 19801: {'lr': 0.00048236589597730744, 'samples': 3801792, 'steps': 19800, 'loss/train': 1.3878642320632935} 11/07/2021 00:02:03 - INFO - __main__ - Step 19802: {'lr': 0.00048236393819334363, 'samples': 3801984, 'steps': 19801, 'loss/train': 1.7285722494125366} 11/07/2021 00:02:03 - INFO - __main__ - Step 19803: {'lr': 0.0004823619803046802, 'samples': 3802176, 'steps': 19802, 'loss/train': 1.3159254789352417} 11/07/2021 00:02:04 - INFO - __main__ - Step 19804: {'lr': 0.00048236002231131803, 'samples': 3802368, 'steps': 19803, 'loss/train': 1.5569478273391724} 11/07/2021 00:02:05 - INFO - __main__ - Step 19805: {'lr': 0.00048235806421325803, 'samples': 3802560, 'steps': 19804, 'loss/train': 1.056659460067749} 11/07/2021 00:02:05 - INFO - __main__ - Step 19806: {'lr': 0.0004823561060105011, 'samples': 3802752, 'steps': 19805, 'loss/train': 1.6271388530731201} 11/07/2021 00:02:06 - INFO - __main__ - Step 19807: {'lr': 0.00048235414770304803, 'samples': 3802944, 'steps': 19806, 'loss/train': 1.493626356124878} 11/07/2021 00:02:06 - INFO - __main__ - Step 19808: {'lr': 0.00048235218929089987, 'samples': 3803136, 'steps': 19807, 'loss/train': 0.7152420878410339} 11/07/2021 00:02:06 - INFO - __main__ - Step 19809: {'lr': 0.00048235023077405724, 'samples': 3803328, 'steps': 19808, 'loss/train': 1.6068490743637085} 11/07/2021 00:02:07 - INFO - __main__ - Step 19810: {'lr': 0.0004823482721525213, 'samples': 3803520, 'steps': 19809, 'loss/train': 1.579464316368103} 11/07/2021 00:02:08 - INFO - __main__ - Step 19811: {'lr': 0.0004823463134262928, 'samples': 3803712, 'steps': 19810, 'loss/train': 1.4074981212615967} 11/07/2021 00:02:08 - INFO - __main__ - Step 19812: {'lr': 0.00048234435459537265, 'samples': 3803904, 'steps': 19811, 'loss/train': 1.2117066383361816} 11/07/2021 00:02:08 - INFO - __main__ - Step 19813: {'lr': 0.0004823423956597617, 'samples': 3804096, 'steps': 19812, 'loss/train': 1.3268944025039673} 11/07/2021 00:02:09 - INFO - __main__ - Step 19814: {'lr': 0.0004823404366194608, 'samples': 3804288, 'steps': 19813, 'loss/train': 1.8814929723739624} 11/07/2021 00:02:10 - INFO - __main__ - Step 19815: {'lr': 0.0004823384774744709, 'samples': 3804480, 'steps': 19814, 'loss/train': 1.1520098447799683} 11/07/2021 00:02:10 - INFO - __main__ - Step 19816: {'lr': 0.000482336518224793, 'samples': 3804672, 'steps': 19815, 'loss/train': 1.8974096775054932} 11/07/2021 00:02:11 - INFO - __main__ - Step 19817: {'lr': 0.00048233455887042764, 'samples': 3804864, 'steps': 19816, 'loss/train': 0.9365611672401428} 11/07/2021 00:02:11 - INFO - __main__ - Step 19818: {'lr': 0.0004823325994113761, 'samples': 3805056, 'steps': 19817, 'loss/train': 1.7103739976882935} 11/07/2021 00:02:11 - INFO - __main__ - Step 19819: {'lr': 0.00048233063984763895, 'samples': 3805248, 'steps': 19818, 'loss/train': 1.6624081134796143} 11/07/2021 00:02:12 - INFO - __main__ - Step 19820: {'lr': 0.0004823286801792173, 'samples': 3805440, 'steps': 19819, 'loss/train': 1.8712867498397827} 11/07/2021 00:02:13 - INFO - __main__ - Step 19821: {'lr': 0.0004823267204061118, 'samples': 3805632, 'steps': 19820, 'loss/train': 1.5832374095916748} 11/07/2021 00:02:13 - INFO - __main__ - Step 19822: {'lr': 0.0004823247605283236, 'samples': 3805824, 'steps': 19821, 'loss/train': 1.4720865488052368} 11/07/2021 00:02:13 - INFO - __main__ - Step 19823: {'lr': 0.0004823228005458534, 'samples': 3806016, 'steps': 19822, 'loss/train': 1.4312366247177124} 11/07/2021 00:02:14 - INFO - __main__ - Step 19824: {'lr': 0.00048232084045870204, 'samples': 3806208, 'steps': 19823, 'loss/train': 1.6238641738891602} 11/07/2021 00:02:14 - INFO - __main__ - Step 19825: {'lr': 0.00048231888026687065, 'samples': 3806400, 'steps': 19824, 'loss/train': 1.13937509059906} 11/07/2021 00:02:15 - INFO - __main__ - Step 19826: {'lr': 0.00048231691997035987, 'samples': 3806592, 'steps': 19825, 'loss/train': 1.6825331449508667} 11/07/2021 00:02:15 - INFO - __main__ - Step 19827: {'lr': 0.00048231495956917067, 'samples': 3806784, 'steps': 19826, 'loss/train': 1.4865928888320923} 11/07/2021 00:02:16 - INFO - __main__ - Step 19828: {'lr': 0.00048231299906330397, 'samples': 3806976, 'steps': 19827, 'loss/train': 1.6881486177444458} 11/07/2021 00:02:16 - INFO - __main__ - Step 19829: {'lr': 0.0004823110384527606, 'samples': 3807168, 'steps': 19828, 'loss/train': 1.7928725481033325} 11/07/2021 00:02:16 - INFO - __main__ - Step 19830: {'lr': 0.0004823090777375414, 'samples': 3807360, 'steps': 19829, 'loss/train': 2.1048223972320557} 11/07/2021 00:02:18 - INFO - __main__ - Step 19831: {'lr': 0.0004823071169176474, 'samples': 3807552, 'steps': 19830, 'loss/train': 1.4070188999176025} 11/07/2021 00:02:18 - INFO - __main__ - Step 19832: {'lr': 0.00048230515599307933, 'samples': 3807744, 'steps': 19831, 'loss/train': 1.6805315017700195} 11/07/2021 00:02:18 - INFO - __main__ - Step 19833: {'lr': 0.0004823031949638382, 'samples': 3807936, 'steps': 19832, 'loss/train': 1.3850849866867065} 11/07/2021 00:02:19 - INFO - __main__ - Step 19834: {'lr': 0.0004823012338299248, 'samples': 3808128, 'steps': 19833, 'loss/train': 1.5451176166534424} 11/07/2021 00:02:19 - INFO - __main__ - Step 19835: {'lr': 0.0004822992725913401, 'samples': 3808320, 'steps': 19834, 'loss/train': 1.776845932006836} 11/07/2021 00:02:20 - INFO - __main__ - Step 19836: {'lr': 0.00048229731124808484, 'samples': 3808512, 'steps': 19835, 'loss/train': 1.3738198280334473} 11/07/2021 00:02:20 - INFO - __main__ - Step 19837: {'lr': 0.00048229534980016007, 'samples': 3808704, 'steps': 19836, 'loss/train': 1.5505852699279785} 11/07/2021 00:02:21 - INFO - __main__ - Step 19838: {'lr': 0.0004822933882475666, 'samples': 3808896, 'steps': 19837, 'loss/train': 1.8858225345611572} 11/07/2021 00:02:21 - INFO - __main__ - Step 19839: {'lr': 0.00048229142659030527, 'samples': 3809088, 'steps': 19838, 'loss/train': 1.6088569164276123} 11/07/2021 00:02:21 - INFO - __main__ - Step 19840: {'lr': 0.000482289464828377, 'samples': 3809280, 'steps': 19839, 'loss/train': 2.1285462379455566} 11/07/2021 00:02:22 - INFO - __main__ - Step 19841: {'lr': 0.00048228750296178276, 'samples': 3809472, 'steps': 19840, 'loss/train': 1.7898335456848145} 11/07/2021 00:02:23 - INFO - __main__ - Step 19842: {'lr': 0.0004822855409905233, 'samples': 3809664, 'steps': 19841, 'loss/train': 1.2840629816055298} 11/07/2021 00:02:23 - INFO - __main__ - Step 19843: {'lr': 0.00048228357891459954, 'samples': 3809856, 'steps': 19842, 'loss/train': 1.5152689218521118} 11/07/2021 00:02:24 - INFO - __main__ - Step 19844: {'lr': 0.0004822816167340124, 'samples': 3810048, 'steps': 19843, 'loss/train': 1.5541553497314453} 11/07/2021 00:02:24 - INFO - __main__ - Step 19845: {'lr': 0.00048227965444876277, 'samples': 3810240, 'steps': 19844, 'loss/train': 1.1225873231887817} 11/07/2021 00:02:24 - INFO - __main__ - Step 19846: {'lr': 0.0004822776920588515, 'samples': 3810432, 'steps': 19845, 'loss/train': 1.8110250234603882} 11/07/2021 00:02:25 - INFO - __main__ - Step 19847: {'lr': 0.0004822757295642795, 'samples': 3810624, 'steps': 19846, 'loss/train': 1.0797271728515625} 11/07/2021 00:02:26 - INFO - __main__ - Step 19848: {'lr': 0.00048227376696504765, 'samples': 3810816, 'steps': 19847, 'loss/train': 2.2480528354644775} 11/07/2021 00:02:27 - INFO - __main__ - Step 19849: {'lr': 0.0004822718042611568, 'samples': 3811008, 'steps': 19848, 'loss/train': 1.7323418855667114} 11/07/2021 00:02:27 - INFO - __main__ - Step 19850: {'lr': 0.0004822698414526079, 'samples': 3811200, 'steps': 19849, 'loss/train': 1.9446306228637695} 11/07/2021 00:02:27 - INFO - __main__ - Step 19851: {'lr': 0.0004822678785394017, 'samples': 3811392, 'steps': 19850, 'loss/train': 0.8418163657188416} 11/07/2021 00:02:28 - INFO - __main__ - Step 19852: {'lr': 0.0004822659155215393, 'samples': 3811584, 'steps': 19851, 'loss/train': 1.6243457794189453} 11/07/2021 00:02:28 - INFO - __main__ - Step 19853: {'lr': 0.00048226395239902133, 'samples': 3811776, 'steps': 19852, 'loss/train': 1.8577585220336914} 11/07/2021 00:02:28 - INFO - __main__ - Step 19854: {'lr': 0.00048226198917184886, 'samples': 3811968, 'steps': 19853, 'loss/train': 1.4468683004379272} 11/07/2021 00:02:29 - INFO - __main__ - Step 19855: {'lr': 0.00048226002584002276, 'samples': 3812160, 'steps': 19854, 'loss/train': 1.4608242511749268} 11/07/2021 00:02:30 - INFO - __main__ - Step 19856: {'lr': 0.00048225806240354387, 'samples': 3812352, 'steps': 19855, 'loss/train': 1.6959941387176514} 11/07/2021 00:02:30 - INFO - __main__ - Step 19857: {'lr': 0.0004822560988624131, 'samples': 3812544, 'steps': 19856, 'loss/train': 1.8978437185287476} 11/07/2021 00:02:30 - INFO - __main__ - Step 19858: {'lr': 0.0004822541352166312, 'samples': 3812736, 'steps': 19857, 'loss/train': 1.0776863098144531} 11/07/2021 00:02:31 - INFO - __main__ - Step 19859: {'lr': 0.0004822521714661993, 'samples': 3812928, 'steps': 19858, 'loss/train': 1.6047289371490479} 11/07/2021 00:02:32 - INFO - __main__ - Step 19860: {'lr': 0.0004822502076111181, 'samples': 3813120, 'steps': 19859, 'loss/train': 1.677196741104126} 11/07/2021 00:02:32 - INFO - __main__ - Step 19861: {'lr': 0.0004822482436513885, 'samples': 3813312, 'steps': 19860, 'loss/train': 1.6622592210769653} 11/07/2021 00:02:33 - INFO - __main__ - Step 19862: {'lr': 0.0004822462795870115, 'samples': 3813504, 'steps': 19861, 'loss/train': 1.220088243484497} 11/07/2021 00:02:33 - INFO - __main__ - Step 19863: {'lr': 0.00048224431541798784, 'samples': 3813696, 'steps': 19862, 'loss/train': 1.8354872465133667} 11/07/2021 00:02:33 - INFO - __main__ - Step 19864: {'lr': 0.00048224235114431856, 'samples': 3813888, 'steps': 19863, 'loss/train': 1.6193705797195435} 11/07/2021 00:02:34 - INFO - __main__ - Step 19865: {'lr': 0.0004822403867660044, 'samples': 3814080, 'steps': 19864, 'loss/train': 1.5870299339294434} 11/07/2021 00:02:35 - INFO - __main__ - Step 19866: {'lr': 0.0004822384222830463, 'samples': 3814272, 'steps': 19865, 'loss/train': 0.993768036365509} 11/07/2021 00:02:35 - INFO - __main__ - Step 19867: {'lr': 0.0004822364576954452, 'samples': 3814464, 'steps': 19866, 'loss/train': 2.282621383666992} 11/07/2021 00:02:35 - INFO - __main__ - Step 19868: {'lr': 0.0004822344930032019, 'samples': 3814656, 'steps': 19867, 'loss/train': 1.6406512260437012} 11/07/2021 00:02:36 - INFO - __main__ - Step 19869: {'lr': 0.00048223252820631736, 'samples': 3814848, 'steps': 19868, 'loss/train': 1.61362886428833} 11/07/2021 00:02:37 - INFO - __main__ - Step 19870: {'lr': 0.00048223056330479235, 'samples': 3815040, 'steps': 19869, 'loss/train': 1.9277762174606323} 11/07/2021 00:02:38 - INFO - __main__ - Step 19871: {'lr': 0.00048222859829862784, 'samples': 3815232, 'steps': 19870, 'loss/train': 1.9069395065307617} 11/07/2021 00:02:38 - INFO - __main__ - Step 19872: {'lr': 0.0004822266331878248, 'samples': 3815424, 'steps': 19871, 'loss/train': 1.3378348350524902} 11/07/2021 00:02:38 - INFO - __main__ - Step 19873: {'lr': 0.00048222466797238396, 'samples': 3815616, 'steps': 19872, 'loss/train': 2.248401403427124} 11/07/2021 00:02:39 - INFO - __main__ - Step 19874: {'lr': 0.00048222270265230627, 'samples': 3815808, 'steps': 19873, 'loss/train': 2.2236526012420654} 11/07/2021 00:02:39 - INFO - __main__ - Step 19875: {'lr': 0.0004822207372275926, 'samples': 3816000, 'steps': 19874, 'loss/train': 1.6553248167037964} 11/07/2021 00:02:40 - INFO - __main__ - Step 19876: {'lr': 0.0004822187716982439, 'samples': 3816192, 'steps': 19875, 'loss/train': 1.5618613958358765} 11/07/2021 00:02:40 - INFO - __main__ - Step 19877: {'lr': 0.000482216806064261, 'samples': 3816384, 'steps': 19876, 'loss/train': 1.9362163543701172} 11/07/2021 00:02:41 - INFO - __main__ - Step 19878: {'lr': 0.0004822148403256447, 'samples': 3816576, 'steps': 19877, 'loss/train': 1.5398274660110474} 11/07/2021 00:02:41 - INFO - __main__ - Step 19879: {'lr': 0.00048221287448239604, 'samples': 3816768, 'steps': 19878, 'loss/train': 1.5209888219833374} 11/07/2021 00:02:42 - INFO - __main__ - Step 19880: {'lr': 0.00048221090853451586, 'samples': 3816960, 'steps': 19879, 'loss/train': 2.007349729537964} 11/07/2021 00:02:42 - INFO - __main__ - Step 19881: {'lr': 0.000482208942482005, 'samples': 3817152, 'steps': 19880, 'loss/train': 1.9798191785812378} 11/07/2021 00:02:43 - INFO - __main__ - Step 19882: {'lr': 0.00048220697632486443, 'samples': 3817344, 'steps': 19881, 'loss/train': 1.5395383834838867} 11/07/2021 00:02:43 - INFO - __main__ - Step 19883: {'lr': 0.0004822050100630949, 'samples': 3817536, 'steps': 19882, 'loss/train': 1.7728437185287476} 11/07/2021 00:02:44 - INFO - __main__ - Step 19884: {'lr': 0.0004822030436966974, 'samples': 3817728, 'steps': 19883, 'loss/train': 1.6646522283554077} 11/07/2021 00:02:44 - INFO - __main__ - Step 19885: {'lr': 0.0004822010772256728, 'samples': 3817920, 'steps': 19884, 'loss/train': 1.8511887788772583} 11/07/2021 00:02:44 - INFO - __main__ - Step 19886: {'lr': 0.00048219911065002196, 'samples': 3818112, 'steps': 19885, 'loss/train': 1.541710615158081} 11/07/2021 00:02:45 - INFO - __main__ - Step 19887: {'lr': 0.00048219714396974587, 'samples': 3818304, 'steps': 19886, 'loss/train': 1.7361443042755127} 11/07/2021 00:02:46 - INFO - __main__ - Step 19888: {'lr': 0.0004821951771848452, 'samples': 3818496, 'steps': 19887, 'loss/train': 1.6225528717041016} 11/07/2021 00:02:46 - INFO - __main__ - Step 19889: {'lr': 0.00048219321029532104, 'samples': 3818688, 'steps': 19888, 'loss/train': 1.6663713455200195} 11/07/2021 00:02:46 - INFO - __main__ - Step 19890: {'lr': 0.0004821912433011742, 'samples': 3818880, 'steps': 19889, 'loss/train': 1.6275155544281006} 11/07/2021 00:02:47 - INFO - __main__ - Step 19891: {'lr': 0.00048218927620240557, 'samples': 3819072, 'steps': 19890, 'loss/train': 2.8923377990722656} 11/07/2021 00:02:47 - INFO - __main__ - Step 19892: {'lr': 0.00048218730899901596, 'samples': 3819264, 'steps': 19891, 'loss/train': 1.6975051164627075} 11/07/2021 00:02:48 - INFO - __main__ - Step 19893: {'lr': 0.0004821853416910065, 'samples': 3819456, 'steps': 19892, 'loss/train': 1.6983906030654907} 11/07/2021 00:02:48 - INFO - __main__ - Step 19894: {'lr': 0.0004821833742783778, 'samples': 3819648, 'steps': 19893, 'loss/train': 1.6201729774475098} 11/07/2021 00:02:49 - INFO - __main__ - Step 19895: {'lr': 0.0004821814067611308, 'samples': 3819840, 'steps': 19894, 'loss/train': 1.6576566696166992} 11/07/2021 00:02:49 - INFO - __main__ - Step 19896: {'lr': 0.00048217943913926646, 'samples': 3820032, 'steps': 19895, 'loss/train': 1.4981663227081299} 11/07/2021 00:02:50 - INFO - __main__ - Step 19897: {'lr': 0.00048217747141278574, 'samples': 3820224, 'steps': 19896, 'loss/train': 1.8732210397720337} 11/07/2021 00:02:50 - INFO - __main__ - Step 19898: {'lr': 0.00048217550358168937, 'samples': 3820416, 'steps': 19897, 'loss/train': 1.464651346206665} 11/07/2021 00:02:51 - INFO - __main__ - Step 19899: {'lr': 0.00048217353564597833, 'samples': 3820608, 'steps': 19898, 'loss/train': 1.3284916877746582} 11/07/2021 00:02:51 - INFO - __main__ - Step 19900: {'lr': 0.0004821715676056534, 'samples': 3820800, 'steps': 19899, 'loss/train': 1.760026216506958} 11/07/2021 00:02:52 - INFO - __main__ - Step 19901: {'lr': 0.0004821695994607156, 'samples': 3820992, 'steps': 19900, 'loss/train': 0.7101365923881531} 11/07/2021 00:02:52 - INFO - __main__ - Step 19902: {'lr': 0.0004821676312111658, 'samples': 3821184, 'steps': 19901, 'loss/train': 1.8503962755203247} 11/07/2021 00:02:53 - INFO - __main__ - Step 19903: {'lr': 0.0004821656628570048, 'samples': 3821376, 'steps': 19902, 'loss/train': 2.2400448322296143} 11/07/2021 00:02:53 - INFO - __main__ - Step 19904: {'lr': 0.00048216369439823355, 'samples': 3821568, 'steps': 19903, 'loss/train': 1.7700556516647339} 11/07/2021 00:02:54 - INFO - __main__ - Step 19905: {'lr': 0.0004821617258348529, 'samples': 3821760, 'steps': 19904, 'loss/train': 1.802931785583496} 11/07/2021 00:02:54 - INFO - __main__ - Step 19906: {'lr': 0.0004821597571668638, 'samples': 3821952, 'steps': 19905, 'loss/train': 1.5791219472885132} 11/07/2021 00:02:54 - INFO - __main__ - Step 19907: {'lr': 0.00048215778839426706, 'samples': 3822144, 'steps': 19906, 'loss/train': 1.5165849924087524} 11/07/2021 00:02:55 - INFO - __main__ - Step 19908: {'lr': 0.0004821558195170636, 'samples': 3822336, 'steps': 19907, 'loss/train': 1.3953006267547607} 11/07/2021 00:02:56 - INFO - __main__ - Step 19909: {'lr': 0.00048215385053525434, 'samples': 3822528, 'steps': 19908, 'loss/train': 2.0647053718566895} 11/07/2021 00:02:56 - INFO - __main__ - Step 19910: {'lr': 0.00048215188144884013, 'samples': 3822720, 'steps': 19909, 'loss/train': 1.1711742877960205} 11/07/2021 00:02:57 - INFO - __main__ - Step 19911: {'lr': 0.0004821499122578218, 'samples': 3822912, 'steps': 19910, 'loss/train': 1.7796618938446045} 11/07/2021 00:02:57 - INFO - __main__ - Step 19912: {'lr': 0.00048214794296220045, 'samples': 3823104, 'steps': 19911, 'loss/train': 1.2514482736587524} 11/07/2021 00:02:58 - INFO - __main__ - Step 19913: {'lr': 0.00048214597356197665, 'samples': 3823296, 'steps': 19912, 'loss/train': 1.8508306741714478} 11/07/2021 00:02:58 - INFO - __main__ - Step 19914: {'lr': 0.00048214400405715153, 'samples': 3823488, 'steps': 19913, 'loss/train': 1.087483525276184} 11/07/2021 00:02:59 - INFO - __main__ - Step 19915: {'lr': 0.000482142034447726, 'samples': 3823680, 'steps': 19914, 'loss/train': 2.030345916748047} 11/07/2021 00:02:59 - INFO - __main__ - Step 19916: {'lr': 0.0004821400647337007, 'samples': 3823872, 'steps': 19915, 'loss/train': 1.1114486455917358} 11/07/2021 00:02:59 - INFO - __main__ - Step 19917: {'lr': 0.0004821380949150768, 'samples': 3824064, 'steps': 19916, 'loss/train': 1.6150808334350586} 11/07/2021 00:03:01 - INFO - __main__ - Step 19918: {'lr': 0.0004821361249918549, 'samples': 3824256, 'steps': 19917, 'loss/train': 1.4313181638717651} 11/07/2021 00:03:01 - INFO - __main__ - Step 19919: {'lr': 0.0004821341549640361, 'samples': 3824448, 'steps': 19918, 'loss/train': 1.7815539836883545} 11/07/2021 00:03:01 - INFO - __main__ - Step 19920: {'lr': 0.00048213218483162133, 'samples': 3824640, 'steps': 19919, 'loss/train': 0.6617085337638855} 11/07/2021 00:03:02 - INFO - __main__ - Step 19921: {'lr': 0.0004821302145946113, 'samples': 3824832, 'steps': 19920, 'loss/train': 0.7101051807403564} 11/07/2021 00:03:02 - INFO - __main__ - Step 19922: {'lr': 0.00048212824425300694, 'samples': 3825024, 'steps': 19921, 'loss/train': 1.7854225635528564} 11/07/2021 00:03:02 - INFO - __main__ - Step 19923: {'lr': 0.0004821262738068093, 'samples': 3825216, 'steps': 19922, 'loss/train': 1.6492276191711426} 11/07/2021 00:03:03 - INFO - __main__ - Step 19924: {'lr': 0.00048212430325601905, 'samples': 3825408, 'steps': 19923, 'loss/train': 1.4564239978790283} 11/07/2021 00:03:04 - INFO - __main__ - Step 19925: {'lr': 0.0004821223326006372, 'samples': 3825600, 'steps': 19924, 'loss/train': 1.9753514528274536} 11/07/2021 00:03:04 - INFO - __main__ - Step 19926: {'lr': 0.0004821203618406645, 'samples': 3825792, 'steps': 19925, 'loss/train': 0.9585258364677429} 11/07/2021 00:03:05 - INFO - __main__ - Step 19927: {'lr': 0.0004821183909761021, 'samples': 3825984, 'steps': 19926, 'loss/train': 1.797566294670105} 11/07/2021 00:03:05 - INFO - __main__ - Step 19928: {'lr': 0.00048211642000695065, 'samples': 3826176, 'steps': 19927, 'loss/train': 1.8800297975540161} 11/07/2021 00:03:06 - INFO - __main__ - Step 19929: {'lr': 0.0004821144489332112, 'samples': 3826368, 'steps': 19928, 'loss/train': 2.163250207901001} 11/07/2021 00:03:06 - INFO - __main__ - Step 19930: {'lr': 0.0004821124777548845, 'samples': 3826560, 'steps': 19929, 'loss/train': 1.244161605834961} 11/07/2021 00:03:07 - INFO - __main__ - Step 19931: {'lr': 0.0004821105064719715, 'samples': 3826752, 'steps': 19930, 'loss/train': 1.3511130809783936} 11/07/2021 00:03:07 - INFO - __main__ - Step 19932: {'lr': 0.0004821085350844731, 'samples': 3826944, 'steps': 19931, 'loss/train': 1.8413258790969849} 11/07/2021 00:03:07 - INFO - __main__ - Step 19933: {'lr': 0.0004821065635923902, 'samples': 3827136, 'steps': 19932, 'loss/train': 1.7809979915618896} 11/07/2021 00:03:08 - INFO - __main__ - Step 19934: {'lr': 0.0004821045919957237, 'samples': 3827328, 'steps': 19933, 'loss/train': 1.867737054824829} 11/07/2021 00:03:09 - INFO - __main__ - Step 19935: {'lr': 0.00048210262029447425, 'samples': 3827520, 'steps': 19934, 'loss/train': 2.0068836212158203} 11/07/2021 00:03:09 - INFO - __main__ - Step 19936: {'lr': 0.0004821006484886431, 'samples': 3827712, 'steps': 19935, 'loss/train': 0.7683764100074768} 11/07/2021 00:03:09 - INFO - __main__ - Step 19937: {'lr': 0.000482098676578231, 'samples': 3827904, 'steps': 19936, 'loss/train': 1.7007976770401} 11/07/2021 00:03:10 - INFO - __main__ - Step 19938: {'lr': 0.0004820967045632388, 'samples': 3828096, 'steps': 19937, 'loss/train': 1.0475733280181885} 11/07/2021 00:03:11 - INFO - __main__ - Step 19939: {'lr': 0.00048209473244366737, 'samples': 3828288, 'steps': 19938, 'loss/train': 1.78171706199646} 11/07/2021 00:03:11 - INFO - __main__ - Step 19940: {'lr': 0.00048209276021951765, 'samples': 3828480, 'steps': 19939, 'loss/train': 2.0384621620178223} 11/07/2021 00:03:11 - INFO - __main__ - Step 19941: {'lr': 0.00048209078789079055, 'samples': 3828672, 'steps': 19940, 'loss/train': 1.5990064144134521} 11/07/2021 00:03:12 - INFO - __main__ - Step 19942: {'lr': 0.00048208881545748684, 'samples': 3828864, 'steps': 19941, 'loss/train': 1.893418312072754} 11/07/2021 00:03:12 - INFO - __main__ - Step 19943: {'lr': 0.00048208684291960755, 'samples': 3829056, 'steps': 19942, 'loss/train': 2.1508586406707764} 11/07/2021 00:03:13 - INFO - __main__ - Step 19944: {'lr': 0.0004820848702771535, 'samples': 3829248, 'steps': 19943, 'loss/train': 1.315595269203186} 11/07/2021 00:03:14 - INFO - __main__ - Step 19945: {'lr': 0.0004820828975301256, 'samples': 3829440, 'steps': 19944, 'loss/train': 1.8021361827850342} 11/07/2021 00:03:14 - INFO - __main__ - Step 19946: {'lr': 0.0004820809246785247, 'samples': 3829632, 'steps': 19945, 'loss/train': 1.77168607711792} 11/07/2021 00:03:14 - INFO - __main__ - Step 19947: {'lr': 0.00048207895172235174, 'samples': 3829824, 'steps': 19946, 'loss/train': 0.8945348858833313} 11/07/2021 00:03:15 - INFO - __main__ - Step 19948: {'lr': 0.00048207697866160755, 'samples': 3830016, 'steps': 19947, 'loss/train': 1.6645221710205078} 11/07/2021 00:03:16 - INFO - __main__ - Step 19949: {'lr': 0.0004820750054962931, 'samples': 3830208, 'steps': 19948, 'loss/train': 1.7571378946304321} 11/07/2021 00:03:16 - INFO - __main__ - Step 19950: {'lr': 0.00048207303222640917, 'samples': 3830400, 'steps': 19949, 'loss/train': 1.541943073272705} 11/07/2021 00:03:16 - INFO - __main__ - Step 19951: {'lr': 0.00048207105885195677, 'samples': 3830592, 'steps': 19950, 'loss/train': 1.7315165996551514} 11/07/2021 00:03:17 - INFO - __main__ - Step 19952: {'lr': 0.0004820690853729367, 'samples': 3830784, 'steps': 19951, 'loss/train': 1.8238463401794434} 11/07/2021 00:03:17 - INFO - __main__ - Step 19953: {'lr': 0.00048206711178934994, 'samples': 3830976, 'steps': 19952, 'loss/train': 1.4717843532562256} 11/07/2021 00:03:18 - INFO - __main__ - Step 19954: {'lr': 0.00048206513810119725, 'samples': 3831168, 'steps': 19953, 'loss/train': 1.7660622596740723} 11/07/2021 00:03:18 - INFO - __main__ - Step 19955: {'lr': 0.0004820631643084796, 'samples': 3831360, 'steps': 19954, 'loss/train': 1.2286633253097534} 11/07/2021 00:03:19 - INFO - __main__ - Step 19956: {'lr': 0.00048206119041119787, 'samples': 3831552, 'steps': 19955, 'loss/train': 1.611129879951477} 11/07/2021 00:03:19 - INFO - __main__ - Step 19957: {'lr': 0.000482059216409353, 'samples': 3831744, 'steps': 19956, 'loss/train': 1.5581938028335571} 11/07/2021 00:03:20 - INFO - __main__ - Step 19958: {'lr': 0.0004820572423029458, 'samples': 3831936, 'steps': 19957, 'loss/train': 1.72555410861969} 11/07/2021 00:03:20 - INFO - __main__ - Step 19959: {'lr': 0.00048205526809197717, 'samples': 3832128, 'steps': 19958, 'loss/train': 1.7237474918365479} 11/07/2021 00:03:21 - INFO - __main__ - Step 19960: {'lr': 0.000482053293776448, 'samples': 3832320, 'steps': 19959, 'loss/train': 2.041546106338501} 11/07/2021 00:03:21 - INFO - __main__ - Step 19961: {'lr': 0.0004820513193563593, 'samples': 3832512, 'steps': 19960, 'loss/train': 1.0926424264907837} 11/07/2021 00:03:22 - INFO - __main__ - Step 19962: {'lr': 0.00048204934483171176, 'samples': 3832704, 'steps': 19961, 'loss/train': 1.6191062927246094} 11/07/2021 00:03:22 - INFO - __main__ - Step 19963: {'lr': 0.0004820473702025064, 'samples': 3832896, 'steps': 19962, 'loss/train': 1.6823042631149292} 11/07/2021 00:03:22 - INFO - __main__ - Step 19964: {'lr': 0.000482045395468744, 'samples': 3833088, 'steps': 19963, 'loss/train': 1.7179694175720215} 11/07/2021 00:03:23 - INFO - __main__ - Step 19965: {'lr': 0.0004820434206304256, 'samples': 3833280, 'steps': 19964, 'loss/train': 1.7687020301818848} 11/07/2021 00:03:24 - INFO - __main__ - Step 19966: {'lr': 0.000482041445687552, 'samples': 3833472, 'steps': 19965, 'loss/train': 1.8509191274642944} 11/07/2021 00:03:24 - INFO - __main__ - Step 19967: {'lr': 0.0004820394706401242, 'samples': 3833664, 'steps': 19966, 'loss/train': 1.80912184715271} 11/07/2021 00:03:24 - INFO - __main__ - Step 19968: {'lr': 0.0004820374954881429, 'samples': 3833856, 'steps': 19967, 'loss/train': 1.3538326025009155} 11/07/2021 00:03:25 - INFO - __main__ - Step 19969: {'lr': 0.000482035520231609, 'samples': 3834048, 'steps': 19968, 'loss/train': 1.9789313077926636} 11/07/2021 00:03:26 - INFO - __main__ - Step 19970: {'lr': 0.00048203354487052363, 'samples': 3834240, 'steps': 19969, 'loss/train': 1.5672111511230469} 11/07/2021 00:03:26 - INFO - __main__ - Step 19971: {'lr': 0.00048203156940488745, 'samples': 3834432, 'steps': 19970, 'loss/train': 1.6324018239974976} 11/07/2021 00:03:26 - INFO - __main__ - Step 19972: {'lr': 0.00048202959383470144, 'samples': 3834624, 'steps': 19971, 'loss/train': 1.9827386140823364} 11/07/2021 00:03:27 - INFO - __main__ - Step 19973: {'lr': 0.00048202761815996646, 'samples': 3834816, 'steps': 19972, 'loss/train': 2.020437717437744} 11/07/2021 00:03:27 - INFO - __main__ - Step 19974: {'lr': 0.0004820256423806835, 'samples': 3835008, 'steps': 19973, 'loss/train': 1.4316328763961792} 11/07/2021 00:03:28 - INFO - __main__ - Step 19975: {'lr': 0.00048202366649685325, 'samples': 3835200, 'steps': 19974, 'loss/train': 1.8737499713897705} 11/07/2021 00:03:28 - INFO - __main__ - Step 19976: {'lr': 0.0004820216905084768, 'samples': 3835392, 'steps': 19975, 'loss/train': 1.99074125289917} 11/07/2021 00:03:29 - INFO - __main__ - Step 19977: {'lr': 0.00048201971441555485, 'samples': 3835584, 'steps': 19976, 'loss/train': 1.502482295036316} 11/07/2021 00:03:29 - INFO - __main__ - Step 19978: {'lr': 0.0004820177382180885, 'samples': 3835776, 'steps': 19977, 'loss/train': 1.506314992904663} 11/07/2021 00:03:29 - INFO - __main__ - Step 19979: {'lr': 0.00048201576191607843, 'samples': 3835968, 'steps': 19978, 'loss/train': 1.5382750034332275} 11/07/2021 00:03:31 - INFO - __main__ - Step 19980: {'lr': 0.00048201378550952575, 'samples': 3836160, 'steps': 19979, 'loss/train': 1.3840868473052979} 11/07/2021 00:03:31 - INFO - __main__ - Step 19981: {'lr': 0.0004820118089984312, 'samples': 3836352, 'steps': 19980, 'loss/train': 1.589767336845398} 11/07/2021 00:03:31 - INFO - __main__ - Step 19982: {'lr': 0.0004820098323827957, 'samples': 3836544, 'steps': 19981, 'loss/train': 1.283545732498169} 11/07/2021 00:03:32 - INFO - __main__ - Step 19983: {'lr': 0.0004820078556626202, 'samples': 3836736, 'steps': 19982, 'loss/train': 1.711501955986023} 11/07/2021 00:03:32 - INFO - __main__ - Step 19984: {'lr': 0.0004820058788379055, 'samples': 3836928, 'steps': 19983, 'loss/train': 1.5402511358261108} 11/07/2021 00:03:34 - INFO - __main__ - Step 19985: {'lr': 0.0004820039019086525, 'samples': 3837120, 'steps': 19984, 'loss/train': 1.3763577938079834} 11/07/2021 00:03:34 - INFO - __main__ - Step 19986: {'lr': 0.00048200192487486216, 'samples': 3837312, 'steps': 19985, 'loss/train': 1.419226884841919} 11/07/2021 00:03:34 - INFO - __main__ - Step 19987: {'lr': 0.00048199994773653535, 'samples': 3837504, 'steps': 19986, 'loss/train': 1.899288535118103} 11/07/2021 00:03:35 - INFO - __main__ - Step 19988: {'lr': 0.0004819979704936729, 'samples': 3837696, 'steps': 19987, 'loss/train': 1.6395899057388306} 11/07/2021 00:03:35 - INFO - __main__ - Step 19989: {'lr': 0.00048199599314627576, 'samples': 3837888, 'steps': 19988, 'loss/train': 0.23624520003795624} 11/07/2021 00:03:35 - INFO - __main__ - Step 19990: {'lr': 0.00048199401569434477, 'samples': 3838080, 'steps': 19989, 'loss/train': 1.949768304824829} 11/07/2021 00:03:37 - INFO - __main__ - Step 19991: {'lr': 0.00048199203813788086, 'samples': 3838272, 'steps': 19990, 'loss/train': 1.4753718376159668} 11/07/2021 00:03:37 - INFO - __main__ - Step 19992: {'lr': 0.00048199006047688496, 'samples': 3838464, 'steps': 19991, 'loss/train': 1.7044295072555542} 11/07/2021 00:03:37 - INFO - __main__ - Step 19993: {'lr': 0.0004819880827113579, 'samples': 3838656, 'steps': 19992, 'loss/train': 1.3625941276550293} 11/07/2021 00:03:38 - INFO - __main__ - Step 19994: {'lr': 0.0004819861048413006, 'samples': 3838848, 'steps': 19993, 'loss/train': 2.013521671295166} 11/07/2021 00:03:38 - INFO - __main__ - Step 19995: {'lr': 0.00048198412686671394, 'samples': 3839040, 'steps': 19994, 'loss/train': 1.5377622842788696} 11/07/2021 00:03:39 - INFO - __main__ - Step 19996: {'lr': 0.0004819821487875988, 'samples': 3839232, 'steps': 19995, 'loss/train': 1.0868034362792969} 11/07/2021 00:03:40 - INFO - __main__ - Step 19997: {'lr': 0.0004819801706039561, 'samples': 3839424, 'steps': 19996, 'loss/train': 1.610630989074707} 11/07/2021 00:03:40 - INFO - __main__ - Step 19998: {'lr': 0.0004819781923157867, 'samples': 3839616, 'steps': 19997, 'loss/train': 1.0310406684875488} 11/07/2021 00:03:40 - INFO - __main__ - Step 19999: {'lr': 0.00048197621392309154, 'samples': 3839808, 'steps': 19998, 'loss/train': 1.1806281805038452} 11/07/2021 00:03:41 - INFO - __main__ - Step 20000: {'lr': 0.00048197423542587143, 'samples': 3840000, 'steps': 19999, 'loss/train': 1.3565977811813354} 11/07/2021 00:03:41 - INFO - __main__ - Step 20001: {'lr': 0.0004819722568241274, 'samples': 3840192, 'steps': 20000, 'loss/train': 1.302907109260559} 11/07/2021 00:03:42 - INFO - __main__ - Step 20002: {'lr': 0.0004819702781178601, 'samples': 3840384, 'steps': 20001, 'loss/train': 1.4258530139923096} 11/07/2021 00:03:42 - INFO - __main__ - Step 20003: {'lr': 0.00048196829930707066, 'samples': 3840576, 'steps': 20002, 'loss/train': 1.59110689163208} 11/07/2021 00:03:43 - INFO - __main__ - Step 20004: {'lr': 0.0004819663203917599, 'samples': 3840768, 'steps': 20003, 'loss/train': 1.8877140283584595} 11/07/2021 00:03:43 - INFO - __main__ - Step 20005: {'lr': 0.0004819643413719287, 'samples': 3840960, 'steps': 20004, 'loss/train': 1.347116470336914} 11/07/2021 00:03:44 - INFO - __main__ - Step 20006: {'lr': 0.0004819623622475779, 'samples': 3841152, 'steps': 20005, 'loss/train': 1.6380776166915894} 11/07/2021 00:03:45 - INFO - __main__ - Step 20007: {'lr': 0.00048196038301870847, 'samples': 3841344, 'steps': 20006, 'loss/train': 1.8486162424087524} 11/07/2021 00:03:45 - INFO - __main__ - Step 20008: {'lr': 0.0004819584036853212, 'samples': 3841536, 'steps': 20007, 'loss/train': 1.7656793594360352} 11/07/2021 00:03:45 - INFO - __main__ - Step 20009: {'lr': 0.00048195642424741716, 'samples': 3841728, 'steps': 20008, 'loss/train': 1.735512137413025} 11/07/2021 00:03:46 - INFO - __main__ - Step 20010: {'lr': 0.00048195444470499704, 'samples': 3841920, 'steps': 20009, 'loss/train': 2.8203601837158203} 11/07/2021 00:03:46 - INFO - __main__ - Step 20011: {'lr': 0.0004819524650580619, 'samples': 3842112, 'steps': 20010, 'loss/train': 1.5159910917282104} 11/07/2021 00:03:47 - INFO - __main__ - Step 20012: {'lr': 0.0004819504853066126, 'samples': 3842304, 'steps': 20011, 'loss/train': 1.7866848707199097} 11/07/2021 00:03:47 - INFO - __main__ - Step 20013: {'lr': 0.0004819485054506498, 'samples': 3842496, 'steps': 20012, 'loss/train': 1.8437927961349487} 11/07/2021 00:03:48 - INFO - __main__ - Step 20014: {'lr': 0.00048194652549017484, 'samples': 3842688, 'steps': 20013, 'loss/train': 0.3831263780593872} 11/07/2021 00:03:48 - INFO - __main__ - Step 20015: {'lr': 0.0004819445454251882, 'samples': 3842880, 'steps': 20014, 'loss/train': 1.9873749017715454} 11/07/2021 00:03:48 - INFO - __main__ - Step 20016: {'lr': 0.0004819425652556909, 'samples': 3843072, 'steps': 20015, 'loss/train': 1.6326098442077637} 11/07/2021 00:03:49 - INFO - __main__ - Step 20017: {'lr': 0.0004819405849816839, 'samples': 3843264, 'steps': 20016, 'loss/train': 1.6164615154266357} 11/07/2021 00:03:50 - INFO - __main__ - Step 20018: {'lr': 0.00048193860460316805, 'samples': 3843456, 'steps': 20017, 'loss/train': 1.9409981966018677} 11/07/2021 00:03:50 - INFO - __main__ - Step 20019: {'lr': 0.00048193662412014427, 'samples': 3843648, 'steps': 20018, 'loss/train': 1.6340709924697876} 11/07/2021 00:03:51 - INFO - __main__ - Step 20020: {'lr': 0.0004819346435326134, 'samples': 3843840, 'steps': 20019, 'loss/train': 1.7512751817703247} 11/07/2021 00:03:51 - INFO - __main__ - Step 20021: {'lr': 0.00048193266284057634, 'samples': 3844032, 'steps': 20020, 'loss/train': 1.0968644618988037} 11/07/2021 00:03:51 - INFO - __main__ - Step 20022: {'lr': 0.0004819306820440341, 'samples': 3844224, 'steps': 20021, 'loss/train': 1.8102567195892334} 11/07/2021 00:03:53 - INFO - __main__ - Step 20023: {'lr': 0.0004819287011429874, 'samples': 3844416, 'steps': 20022, 'loss/train': 1.518404483795166} 11/07/2021 00:03:53 - INFO - __main__ - Step 20024: {'lr': 0.0004819267201374372, 'samples': 3844608, 'steps': 20023, 'loss/train': 1.6262191534042358} 11/07/2021 00:03:53 - INFO - __main__ - Step 20025: {'lr': 0.0004819247390273844, 'samples': 3844800, 'steps': 20024, 'loss/train': 1.5133861303329468} 11/07/2021 00:03:54 - INFO - __main__ - Step 20026: {'lr': 0.00048192275781282993, 'samples': 3844992, 'steps': 20025, 'loss/train': 1.825657606124878} 11/07/2021 00:03:54 - INFO - __main__ - Step 20027: {'lr': 0.00048192077649377455, 'samples': 3845184, 'steps': 20026, 'loss/train': 1.623038411140442} 11/07/2021 00:03:55 - INFO - __main__ - Step 20028: {'lr': 0.0004819187950702193, 'samples': 3845376, 'steps': 20027, 'loss/train': 1.3020976781845093} 11/07/2021 00:03:55 - INFO - __main__ - Step 20029: {'lr': 0.00048191681354216504, 'samples': 3845568, 'steps': 20028, 'loss/train': 1.8787384033203125} 11/07/2021 00:03:56 - INFO - __main__ - Step 20030: {'lr': 0.0004819148319096126, 'samples': 3845760, 'steps': 20029, 'loss/train': 1.937517523765564} 11/07/2021 00:03:56 - INFO - __main__ - Step 20031: {'lr': 0.00048191285017256297, 'samples': 3845952, 'steps': 20030, 'loss/train': 1.222166657447815} 11/07/2021 00:03:56 - INFO - __main__ - Step 20032: {'lr': 0.00048191086833101695, 'samples': 3846144, 'steps': 20031, 'loss/train': 1.8381415605545044} 11/07/2021 00:03:57 - INFO - __main__ - Step 20033: {'lr': 0.00048190888638497553, 'samples': 3846336, 'steps': 20032, 'loss/train': 1.8794498443603516} 11/07/2021 00:03:58 - INFO - __main__ - Step 20034: {'lr': 0.00048190690433443946, 'samples': 3846528, 'steps': 20033, 'loss/train': 1.9543792009353638} 11/07/2021 00:03:58 - INFO - __main__ - Step 20035: {'lr': 0.0004819049221794097, 'samples': 3846720, 'steps': 20034, 'loss/train': 1.4590505361557007} 11/07/2021 00:03:59 - INFO - __main__ - Step 20036: {'lr': 0.0004819029399198873, 'samples': 3846912, 'steps': 20035, 'loss/train': 1.4175198078155518} 11/07/2021 00:03:59 - INFO - __main__ - Step 20037: {'lr': 0.0004819009575558729, 'samples': 3847104, 'steps': 20036, 'loss/train': 1.7073240280151367} 11/07/2021 00:03:59 - INFO - __main__ - Step 20038: {'lr': 0.0004818989750873676, 'samples': 3847296, 'steps': 20037, 'loss/train': 1.8141921758651733} 11/07/2021 00:04:00 - INFO - __main__ - Step 20039: {'lr': 0.00048189699251437206, 'samples': 3847488, 'steps': 20038, 'loss/train': 1.413412094116211} 11/07/2021 00:04:01 - INFO - __main__ - Step 20040: {'lr': 0.0004818950098368874, 'samples': 3847680, 'steps': 20039, 'loss/train': 1.2684391736984253} 11/07/2021 00:04:01 - INFO - __main__ - Step 20041: {'lr': 0.00048189302705491446, 'samples': 3847872, 'steps': 20040, 'loss/train': 1.8604671955108643} 11/07/2021 00:04:01 - INFO - __main__ - Step 20042: {'lr': 0.000481891044168454, 'samples': 3848064, 'steps': 20041, 'loss/train': 1.7178666591644287} 11/07/2021 00:04:02 - INFO - __main__ - Step 20043: {'lr': 0.00048188906117750706, 'samples': 3848256, 'steps': 20042, 'loss/train': 1.5165631771087646} 11/07/2021 00:04:03 - INFO - __main__ - Step 20044: {'lr': 0.00048188707808207457, 'samples': 3848448, 'steps': 20043, 'loss/train': 2.249117374420166} 11/07/2021 00:04:03 - INFO - __main__ - Step 20045: {'lr': 0.00048188509488215724, 'samples': 3848640, 'steps': 20044, 'loss/train': 1.760151743888855} 11/07/2021 00:04:03 - INFO - __main__ - Step 20046: {'lr': 0.0004818831115777561, 'samples': 3848832, 'steps': 20045, 'loss/train': 1.6248598098754883} 11/07/2021 00:04:04 - INFO - __main__ - Step 20047: {'lr': 0.00048188112816887203, 'samples': 3849024, 'steps': 20046, 'loss/train': 1.315185785293579} 11/07/2021 00:04:04 - INFO - __main__ - Step 20048: {'lr': 0.0004818791446555059, 'samples': 3849216, 'steps': 20047, 'loss/train': 1.6985441446304321} 11/07/2021 00:04:05 - INFO - __main__ - Step 20049: {'lr': 0.00048187716103765854, 'samples': 3849408, 'steps': 20048, 'loss/train': 1.3543537855148315} 11/07/2021 00:04:06 - INFO - __main__ - Step 20050: {'lr': 0.0004818751773153309, 'samples': 3849600, 'steps': 20049, 'loss/train': 1.623940348625183} 11/07/2021 00:04:06 - INFO - __main__ - Step 20051: {'lr': 0.000481873193488524, 'samples': 3849792, 'steps': 20050, 'loss/train': 1.5104410648345947} 11/07/2021 00:04:06 - INFO - __main__ - Step 20052: {'lr': 0.0004818712095572385, 'samples': 3849984, 'steps': 20051, 'loss/train': 1.4376572370529175} 11/07/2021 00:04:07 - INFO - __main__ - Step 20053: {'lr': 0.0004818692255214755, 'samples': 3850176, 'steps': 20052, 'loss/train': 1.616486668586731} 11/07/2021 00:04:08 - INFO - __main__ - Step 20054: {'lr': 0.00048186724138123577, 'samples': 3850368, 'steps': 20053, 'loss/train': 1.556071162223816} 11/07/2021 00:04:08 - INFO - __main__ - Step 20055: {'lr': 0.00048186525713652024, 'samples': 3850560, 'steps': 20054, 'loss/train': 2.0656604766845703} 11/07/2021 00:04:08 - INFO - __main__ - Step 20056: {'lr': 0.0004818632727873298, 'samples': 3850752, 'steps': 20055, 'loss/train': 1.7116798162460327} 11/07/2021 00:04:09 - INFO - __main__ - Step 20057: {'lr': 0.00048186128833366536, 'samples': 3850944, 'steps': 20056, 'loss/train': 1.7523837089538574} 11/07/2021 00:04:09 - INFO - __main__ - Step 20058: {'lr': 0.0004818593037755278, 'samples': 3851136, 'steps': 20057, 'loss/train': 1.833410620689392} 11/07/2021 00:04:09 - INFO - __main__ - Step 20059: {'lr': 0.000481857319112918, 'samples': 3851328, 'steps': 20058, 'loss/train': 1.7501016855239868} 11/07/2021 00:04:10 - INFO - __main__ - Step 20060: {'lr': 0.0004818553343458368, 'samples': 3851520, 'steps': 20059, 'loss/train': 1.7878509759902954} 11/07/2021 00:04:11 - INFO - __main__ - Step 20061: {'lr': 0.00048185334947428525, 'samples': 3851712, 'steps': 20060, 'loss/train': 3.4654691219329834} 11/07/2021 00:04:11 - INFO - __main__ - Step 20062: {'lr': 0.0004818513644982642, 'samples': 3851904, 'steps': 20061, 'loss/train': 1.6801989078521729} 11/07/2021 00:04:11 - INFO - __main__ - Step 20063: {'lr': 0.0004818493794177744, 'samples': 3852096, 'steps': 20062, 'loss/train': 2.7569499015808105} 11/07/2021 00:04:12 - INFO - __main__ - Step 20064: {'lr': 0.00048184739423281695, 'samples': 3852288, 'steps': 20063, 'loss/train': 1.657804250717163} 11/07/2021 00:04:13 - INFO - __main__ - Step 20065: {'lr': 0.00048184540894339256, 'samples': 3852480, 'steps': 20064, 'loss/train': 1.6313239336013794} 11/07/2021 00:04:13 - INFO - __main__ - Step 20066: {'lr': 0.00048184342354950225, 'samples': 3852672, 'steps': 20065, 'loss/train': 1.8359761238098145} 11/07/2021 00:04:14 - INFO - __main__ - Step 20067: {'lr': 0.00048184143805114684, 'samples': 3852864, 'steps': 20066, 'loss/train': 1.2439062595367432} 11/07/2021 00:04:14 - INFO - __main__ - Step 20068: {'lr': 0.00048183945244832725, 'samples': 3853056, 'steps': 20067, 'loss/train': 2.3377492427825928} 11/07/2021 00:04:14 - INFO - __main__ - Step 20069: {'lr': 0.00048183746674104446, 'samples': 3853248, 'steps': 20068, 'loss/train': 1.7448499202728271} 11/07/2021 00:04:15 - INFO - __main__ - Step 20070: {'lr': 0.00048183548092929916, 'samples': 3853440, 'steps': 20069, 'loss/train': 1.6729365587234497} 11/07/2021 00:04:16 - INFO - __main__ - Step 20071: {'lr': 0.0004818334950130925, 'samples': 3853632, 'steps': 20070, 'loss/train': 1.792672038078308} 11/07/2021 00:04:16 - INFO - __main__ - Step 20072: {'lr': 0.00048183150899242514, 'samples': 3853824, 'steps': 20071, 'loss/train': 2.116844892501831} 11/07/2021 00:04:16 - INFO - __main__ - Step 20073: {'lr': 0.0004818295228672981, 'samples': 3854016, 'steps': 20072, 'loss/train': 2.090991258621216} 11/07/2021 00:04:17 - INFO - __main__ - Step 20074: {'lr': 0.0004818275366377123, 'samples': 3854208, 'steps': 20073, 'loss/train': 1.6128263473510742} 11/07/2021 00:04:18 - INFO - __main__ - Step 20075: {'lr': 0.00048182555030366854, 'samples': 3854400, 'steps': 20074, 'loss/train': 1.7799201011657715} 11/07/2021 00:04:18 - INFO - __main__ - Step 20076: {'lr': 0.0004818235638651678, 'samples': 3854592, 'steps': 20075, 'loss/train': 1.5875736474990845} 11/07/2021 00:04:18 - INFO - __main__ - Step 20077: {'lr': 0.0004818215773222109, 'samples': 3854784, 'steps': 20076, 'loss/train': 0.957347571849823} 11/07/2021 00:04:19 - INFO - __main__ - Step 20078: {'lr': 0.0004818195906747988, 'samples': 3854976, 'steps': 20077, 'loss/train': 1.4278732538223267} 11/07/2021 00:04:19 - INFO - __main__ - Step 20079: {'lr': 0.0004818176039229324, 'samples': 3855168, 'steps': 20078, 'loss/train': 1.445941686630249} 11/07/2021 00:04:20 - INFO - __main__ - Step 20080: {'lr': 0.0004818156170666125, 'samples': 3855360, 'steps': 20079, 'loss/train': 1.3033205270767212} 11/07/2021 00:04:21 - INFO - __main__ - Step 20081: {'lr': 0.0004818136301058401, 'samples': 3855552, 'steps': 20080, 'loss/train': 1.856467604637146} 11/07/2021 00:04:21 - INFO - __main__ - Step 20082: {'lr': 0.0004818116430406161, 'samples': 3855744, 'steps': 20081, 'loss/train': 1.8351445198059082} 11/07/2021 00:04:21 - INFO - __main__ - Step 20083: {'lr': 0.00048180965587094125, 'samples': 3855936, 'steps': 20082, 'loss/train': 1.6008447408676147} 11/07/2021 00:04:22 - INFO - __main__ - Step 20084: {'lr': 0.00048180766859681664, 'samples': 3856128, 'steps': 20083, 'loss/train': 1.4347810745239258} 11/07/2021 00:04:23 - INFO - __main__ - Step 20085: {'lr': 0.000481805681218243, 'samples': 3856320, 'steps': 20084, 'loss/train': 1.5482618808746338} 11/07/2021 00:04:23 - INFO - __main__ - Step 20086: {'lr': 0.0004818036937352214, 'samples': 3856512, 'steps': 20085, 'loss/train': 1.7869243621826172} 11/07/2021 00:04:23 - INFO - __main__ - Step 20087: {'lr': 0.0004818017061477525, 'samples': 3856704, 'steps': 20086, 'loss/train': 1.2508975267410278} 11/07/2021 00:04:24 - INFO - __main__ - Step 20088: {'lr': 0.00048179971845583734, 'samples': 3856896, 'steps': 20087, 'loss/train': 1.7061606645584106} 11/07/2021 00:04:24 - INFO - __main__ - Step 20089: {'lr': 0.00048179773065947683, 'samples': 3857088, 'steps': 20088, 'loss/train': 1.7644474506378174} 11/07/2021 00:04:24 - INFO - __main__ - Step 20090: {'lr': 0.0004817957427586719, 'samples': 3857280, 'steps': 20089, 'loss/train': 1.3199182748794556} 11/07/2021 00:04:25 - INFO - __main__ - Step 20091: {'lr': 0.00048179375475342333, 'samples': 3857472, 'steps': 20090, 'loss/train': 1.5608344078063965} 11/07/2021 00:04:26 - INFO - __main__ - Step 20092: {'lr': 0.00048179176664373214, 'samples': 3857664, 'steps': 20091, 'loss/train': 1.464991569519043} 11/07/2021 00:04:26 - INFO - __main__ - Step 20093: {'lr': 0.0004817897784295991, 'samples': 3857856, 'steps': 20092, 'loss/train': 2.1888020038604736} 11/07/2021 00:04:26 - INFO - __main__ - Step 20094: {'lr': 0.0004817877901110251, 'samples': 3858048, 'steps': 20093, 'loss/train': 2.046032428741455} 11/07/2021 00:04:27 - INFO - __main__ - Step 20095: {'lr': 0.0004817858016880112, 'samples': 3858240, 'steps': 20094, 'loss/train': 2.0262649059295654} 11/07/2021 00:04:28 - INFO - __main__ - Step 20096: {'lr': 0.0004817838131605582, 'samples': 3858432, 'steps': 20095, 'loss/train': 1.6248929500579834} 11/07/2021 00:04:28 - INFO - __main__ - Step 20097: {'lr': 0.00048178182452866694, 'samples': 3858624, 'steps': 20096, 'loss/train': 1.839397668838501} 11/07/2021 00:04:28 - INFO - __main__ - Step 20098: {'lr': 0.0004817798357923384, 'samples': 3858816, 'steps': 20097, 'loss/train': 1.6831201314926147} 11/07/2021 00:04:29 - INFO - __main__ - Step 20099: {'lr': 0.00048177784695157335, 'samples': 3859008, 'steps': 20098, 'loss/train': 1.7522791624069214} 11/07/2021 00:04:29 - INFO - __main__ - Step 20100: {'lr': 0.00048177585800637286, 'samples': 3859200, 'steps': 20099, 'loss/train': 1.5155513286590576} 11/07/2021 00:04:30 - INFO - __main__ - Step 20101: {'lr': 0.00048177386895673774, 'samples': 3859392, 'steps': 20100, 'loss/train': 1.6983426809310913} 11/07/2021 00:04:30 - INFO - __main__ - Step 20102: {'lr': 0.0004817718798026689, 'samples': 3859584, 'steps': 20101, 'loss/train': 1.6446847915649414} 11/07/2021 00:04:31 - INFO - __main__ - Step 20103: {'lr': 0.0004817698905441672, 'samples': 3859776, 'steps': 20102, 'loss/train': 1.7305409908294678} 11/07/2021 00:04:31 - INFO - __main__ - Step 20104: {'lr': 0.0004817679011812336, 'samples': 3859968, 'steps': 20103, 'loss/train': 1.7872097492218018} 11/07/2021 00:04:32 - INFO - __main__ - Step 20105: {'lr': 0.00048176591171386884, 'samples': 3860160, 'steps': 20104, 'loss/train': 1.3828495740890503} 11/07/2021 00:04:33 - INFO - __main__ - Step 20106: {'lr': 0.0004817639221420741, 'samples': 3860352, 'steps': 20105, 'loss/train': 2.1160809993743896} 11/07/2021 00:04:33 - INFO - __main__ - Step 20107: {'lr': 0.00048176193246585, 'samples': 3860544, 'steps': 20106, 'loss/train': 1.92433500289917} 11/07/2021 00:04:33 - INFO - __main__ - Step 20108: {'lr': 0.00048175994268519765, 'samples': 3860736, 'steps': 20107, 'loss/train': 1.871787428855896} 11/07/2021 00:04:34 - INFO - __main__ - Step 20109: {'lr': 0.00048175795280011775, 'samples': 3860928, 'steps': 20108, 'loss/train': 1.485747218132019} 11/07/2021 00:04:34 - INFO - __main__ - Step 20110: {'lr': 0.00048175596281061135, 'samples': 3861120, 'steps': 20109, 'loss/train': 1.0940872430801392} 11/07/2021 00:04:35 - INFO - __main__ - Step 20111: {'lr': 0.00048175397271667925, 'samples': 3861312, 'steps': 20110, 'loss/train': 0.7007030844688416} 11/07/2021 00:04:35 - INFO - __main__ - Step 20112: {'lr': 0.00048175198251832244, 'samples': 3861504, 'steps': 20111, 'loss/train': 1.479366660118103} 11/07/2021 00:04:36 - INFO - __main__ - Step 20113: {'lr': 0.00048174999221554173, 'samples': 3861696, 'steps': 20112, 'loss/train': 1.4841296672821045} 11/07/2021 00:04:36 - INFO - __main__ - Step 20114: {'lr': 0.000481748001808338, 'samples': 3861888, 'steps': 20113, 'loss/train': 1.2486543655395508} 11/07/2021 00:04:36 - INFO - __main__ - Step 20115: {'lr': 0.00048174601129671223, 'samples': 3862080, 'steps': 20114, 'loss/train': 0.9452338218688965} 11/07/2021 00:04:37 - INFO - __main__ - Step 20116: {'lr': 0.00048174402068066534, 'samples': 3862272, 'steps': 20115, 'loss/train': 1.4266661405563354} 11/07/2021 00:04:38 - INFO - __main__ - Step 20117: {'lr': 0.0004817420299601981, 'samples': 3862464, 'steps': 20116, 'loss/train': 2.441077947616577} 11/07/2021 00:04:38 - INFO - __main__ - Step 20118: {'lr': 0.0004817400391353115, 'samples': 3862656, 'steps': 20117, 'loss/train': 1.3544243574142456} 11/07/2021 00:04:39 - INFO - __main__ - Step 20119: {'lr': 0.00048173804820600646, 'samples': 3862848, 'steps': 20118, 'loss/train': 1.8580822944641113} 11/07/2021 00:04:39 - INFO - __main__ - Step 20120: {'lr': 0.0004817360571722838, 'samples': 3863040, 'steps': 20119, 'loss/train': 1.5528236627578735} 11/07/2021 00:04:40 - INFO - __main__ - Step 20121: {'lr': 0.00048173406603414445, 'samples': 3863232, 'steps': 20120, 'loss/train': 1.140297293663025} 11/07/2021 00:04:40 - INFO - __main__ - Step 20122: {'lr': 0.00048173207479158933, 'samples': 3863424, 'steps': 20121, 'loss/train': 0.44841068983078003} 11/07/2021 00:04:41 - INFO - __main__ - Step 20123: {'lr': 0.0004817300834446192, 'samples': 3863616, 'steps': 20122, 'loss/train': 1.983655571937561} 11/07/2021 00:04:41 - INFO - __main__ - Step 20124: {'lr': 0.0004817280919932352, 'samples': 3863808, 'steps': 20123, 'loss/train': 1.480665683746338} 11/07/2021 00:04:42 - INFO - __main__ - Step 20125: {'lr': 0.000481726100437438, 'samples': 3864000, 'steps': 20124, 'loss/train': 1.859569787979126} 11/07/2021 00:04:42 - INFO - __main__ - Step 20126: {'lr': 0.00048172410877722865, 'samples': 3864192, 'steps': 20125, 'loss/train': 1.8540468215942383} 11/07/2021 00:04:43 - INFO - __main__ - Step 20127: {'lr': 0.00048172211701260807, 'samples': 3864384, 'steps': 20126, 'loss/train': 1.706778645515442} 11/07/2021 00:04:43 - INFO - __main__ - Step 20128: {'lr': 0.0004817201251435769, 'samples': 3864576, 'steps': 20127, 'loss/train': 1.863878846168518} 11/07/2021 00:04:44 - INFO - __main__ - Step 20129: {'lr': 0.00048171813317013633, 'samples': 3864768, 'steps': 20128, 'loss/train': 1.8926697969436646} 11/07/2021 00:04:44 - INFO - __main__ - Step 20130: {'lr': 0.00048171614109228714, 'samples': 3864960, 'steps': 20129, 'loss/train': 1.369523048400879} 11/07/2021 00:04:44 - INFO - __main__ - Step 20131: {'lr': 0.0004817141489100302, 'samples': 3865152, 'steps': 20130, 'loss/train': 1.67985999584198} 11/07/2021 00:04:45 - INFO - __main__ - Step 20132: {'lr': 0.0004817121566233665, 'samples': 3865344, 'steps': 20131, 'loss/train': 1.905426025390625} 11/07/2021 00:04:46 - INFO - __main__ - Step 20133: {'lr': 0.0004817101642322968, 'samples': 3865536, 'steps': 20132, 'loss/train': 1.1768156290054321} 11/07/2021 00:04:46 - INFO - __main__ - Step 20134: {'lr': 0.00048170817173682215, 'samples': 3865728, 'steps': 20133, 'loss/train': 1.8267666101455688} 11/07/2021 00:04:47 - INFO - __main__ - Step 20135: {'lr': 0.00048170617913694333, 'samples': 3865920, 'steps': 20134, 'loss/train': 1.8322908878326416} 11/07/2021 00:04:47 - INFO - __main__ - Step 20136: {'lr': 0.00048170418643266125, 'samples': 3866112, 'steps': 20135, 'loss/train': 1.625266671180725} 11/07/2021 00:04:47 - INFO - __main__ - Step 20137: {'lr': 0.00048170219362397685, 'samples': 3866304, 'steps': 20136, 'loss/train': 1.84674870967865} 11/07/2021 00:04:48 - INFO - __main__ - Step 20138: {'lr': 0.00048170020071089105, 'samples': 3866496, 'steps': 20137, 'loss/train': 1.6670939922332764} 11/07/2021 00:04:49 - INFO - __main__ - Step 20139: {'lr': 0.00048169820769340476, 'samples': 3866688, 'steps': 20138, 'loss/train': 1.7214689254760742} 11/07/2021 00:04:49 - INFO - __main__ - Step 20140: {'lr': 0.0004816962145715188, 'samples': 3866880, 'steps': 20139, 'loss/train': 1.1451594829559326} 11/07/2021 00:04:49 - INFO - __main__ - Step 20141: {'lr': 0.00048169422134523404, 'samples': 3867072, 'steps': 20140, 'loss/train': 1.5689804553985596} 11/07/2021 00:04:50 - INFO - __main__ - Step 20142: {'lr': 0.0004816922280145515, 'samples': 3867264, 'steps': 20141, 'loss/train': 1.038034439086914} 11/07/2021 00:04:51 - INFO - __main__ - Step 20143: {'lr': 0.00048169023457947195, 'samples': 3867456, 'steps': 20142, 'loss/train': 2.1106817722320557} 11/07/2021 00:04:51 - INFO - __main__ - Step 20144: {'lr': 0.0004816882410399964, 'samples': 3867648, 'steps': 20143, 'loss/train': 0.2228638380765915} 11/07/2021 00:04:51 - INFO - __main__ - Step 20145: {'lr': 0.00048168624739612577, 'samples': 3867840, 'steps': 20144, 'loss/train': 1.7859293222427368} 11/07/2021 00:04:52 - INFO - __main__ - Step 20146: {'lr': 0.0004816842536478608, 'samples': 3868032, 'steps': 20145, 'loss/train': 2.007317066192627} 11/07/2021 00:04:52 - INFO - __main__ - Step 20147: {'lr': 0.00048168225979520254, 'samples': 3868224, 'steps': 20146, 'loss/train': 1.6304503679275513} 11/07/2021 00:04:53 - INFO - __main__ - Step 20148: {'lr': 0.0004816802658381518, 'samples': 3868416, 'steps': 20147, 'loss/train': 1.0293740034103394} 11/07/2021 00:04:54 - INFO - __main__ - Step 20149: {'lr': 0.00048167827177670946, 'samples': 3868608, 'steps': 20148, 'loss/train': 1.9682854413986206} 11/07/2021 00:04:54 - INFO - __main__ - Step 20150: {'lr': 0.0004816762776108765, 'samples': 3868800, 'steps': 20149, 'loss/train': 1.8390581607818604} 11/07/2021 00:04:54 - INFO - __main__ - Step 20151: {'lr': 0.0004816742833406538, 'samples': 3868992, 'steps': 20150, 'loss/train': 2.0192196369171143} 11/07/2021 00:04:55 - INFO - __main__ - Step 20152: {'lr': 0.0004816722889660423, 'samples': 3869184, 'steps': 20151, 'loss/train': 1.632528305053711} 11/07/2021 00:04:56 - INFO - __main__ - Step 20153: {'lr': 0.00048167029448704273, 'samples': 3869376, 'steps': 20152, 'loss/train': 1.5875515937805176} 11/07/2021 00:04:56 - INFO - __main__ - Step 20154: {'lr': 0.00048166829990365615, 'samples': 3869568, 'steps': 20153, 'loss/train': 2.2908775806427} 11/07/2021 00:04:56 - INFO - __main__ - Step 20155: {'lr': 0.0004816663052158834, 'samples': 3869760, 'steps': 20154, 'loss/train': 2.725844621658325} 11/07/2021 00:04:57 - INFO - __main__ - Step 20156: {'lr': 0.0004816643104237254, 'samples': 3869952, 'steps': 20155, 'loss/train': 1.3609468936920166} 11/07/2021 00:04:57 - INFO - __main__ - Step 20157: {'lr': 0.00048166231552718305, 'samples': 3870144, 'steps': 20156, 'loss/train': 1.328758716583252} 11/07/2021 00:04:57 - INFO - __main__ - Step 20158: {'lr': 0.0004816603205262572, 'samples': 3870336, 'steps': 20157, 'loss/train': 1.8653146028518677} 11/07/2021 00:04:58 - INFO - __main__ - Step 20159: {'lr': 0.0004816583254209488, 'samples': 3870528, 'steps': 20158, 'loss/train': 1.5926765203475952} 11/07/2021 00:04:59 - INFO - __main__ - Step 20160: {'lr': 0.00048165633021125874, 'samples': 3870720, 'steps': 20159, 'loss/train': 1.8898913860321045} 11/07/2021 00:04:59 - INFO - __main__ - Step 20161: {'lr': 0.0004816543348971879, 'samples': 3870912, 'steps': 20160, 'loss/train': 1.354554295539856} 11/07/2021 00:05:00 - INFO - __main__ - Step 20162: {'lr': 0.0004816523394787372, 'samples': 3871104, 'steps': 20161, 'loss/train': 1.504123568534851} 11/07/2021 00:05:00 - INFO - __main__ - Step 20163: {'lr': 0.00048165034395590756, 'samples': 3871296, 'steps': 20162, 'loss/train': 1.4880510568618774} 11/07/2021 00:05:01 - INFO - __main__ - Step 20164: {'lr': 0.0004816483483286998, 'samples': 3871488, 'steps': 20163, 'loss/train': 1.560563087463379} 11/07/2021 00:05:01 - INFO - __main__ - Step 20165: {'lr': 0.0004816463525971149, 'samples': 3871680, 'steps': 20164, 'loss/train': 1.6374289989471436} 11/07/2021 00:05:01 - INFO - __main__ - Step 20166: {'lr': 0.0004816443567611537, 'samples': 3871872, 'steps': 20165, 'loss/train': 1.8701469898223877} 11/07/2021 00:05:02 - INFO - __main__ - Step 20167: {'lr': 0.00048164236082081713, 'samples': 3872064, 'steps': 20166, 'loss/train': 1.5199638605117798} 11/07/2021 00:05:02 - INFO - __main__ - Step 20168: {'lr': 0.00048164036477610616, 'samples': 3872256, 'steps': 20167, 'loss/train': 1.2595298290252686} 11/07/2021 00:05:03 - INFO - __main__ - Step 20169: {'lr': 0.00048163836862702154, 'samples': 3872448, 'steps': 20168, 'loss/train': 0.5841023921966553} 11/07/2021 00:05:04 - INFO - __main__ - Step 20170: {'lr': 0.0004816363723735643, 'samples': 3872640, 'steps': 20169, 'loss/train': 1.7465553283691406} 11/07/2021 00:05:04 - INFO - __main__ - Step 20171: {'lr': 0.00048163437601573525, 'samples': 3872832, 'steps': 20170, 'loss/train': 1.8244073390960693} 11/07/2021 00:05:04 - INFO - __main__ - Step 20172: {'lr': 0.00048163237955353526, 'samples': 3873024, 'steps': 20171, 'loss/train': 1.614901065826416} 11/07/2021 00:05:05 - INFO - __main__ - Step 20173: {'lr': 0.00048163038298696537, 'samples': 3873216, 'steps': 20172, 'loss/train': 1.7259612083435059} 11/07/2021 00:05:06 - INFO - __main__ - Step 20174: {'lr': 0.00048162838631602643, 'samples': 3873408, 'steps': 20173, 'loss/train': 1.643215298652649} 11/07/2021 00:05:06 - INFO - __main__ - Step 20175: {'lr': 0.00048162638954071926, 'samples': 3873600, 'steps': 20174, 'loss/train': 1.6356228590011597} 11/07/2021 00:05:07 - INFO - __main__ - Step 20176: {'lr': 0.0004816243926610448, 'samples': 3873792, 'steps': 20175, 'loss/train': 1.7864456176757812} 11/07/2021 00:05:07 - INFO - __main__ - Step 20177: {'lr': 0.000481622395677004, 'samples': 3873984, 'steps': 20176, 'loss/train': 1.7050282955169678} 11/07/2021 00:05:07 - INFO - __main__ - Step 20178: {'lr': 0.0004816203985885977, 'samples': 3874176, 'steps': 20177, 'loss/train': 1.7557024955749512} 11/07/2021 00:05:08 - INFO - __main__ - Step 20179: {'lr': 0.0004816184013958268, 'samples': 3874368, 'steps': 20178, 'loss/train': 1.1449915170669556} 11/07/2021 00:05:09 - INFO - __main__ - Step 20180: {'lr': 0.0004816164040986923, 'samples': 3874560, 'steps': 20179, 'loss/train': 1.9531196355819702} 11/07/2021 00:05:09 - INFO - __main__ - Step 20181: {'lr': 0.00048161440669719496, 'samples': 3874752, 'steps': 20180, 'loss/train': 1.1801620721817017} 11/07/2021 00:05:09 - INFO - __main__ - Step 20182: {'lr': 0.00048161240919133573, 'samples': 3874944, 'steps': 20181, 'loss/train': 1.5637428760528564} 11/07/2021 00:05:10 - INFO - __main__ - Step 20183: {'lr': 0.00048161041158111564, 'samples': 3875136, 'steps': 20182, 'loss/train': 1.7784135341644287} 11/07/2021 00:05:11 - INFO - __main__ - Step 20184: {'lr': 0.0004816084138665353, 'samples': 3875328, 'steps': 20183, 'loss/train': 1.9407933950424194} 11/07/2021 00:05:11 - INFO - __main__ - Step 20185: {'lr': 0.00048160641604759593, 'samples': 3875520, 'steps': 20184, 'loss/train': 1.4560564756393433} 11/07/2021 00:05:12 - INFO - __main__ - Step 20186: {'lr': 0.0004816044181242982, 'samples': 3875712, 'steps': 20185, 'loss/train': 0.9393236041069031} 11/07/2021 00:05:12 - INFO - __main__ - Step 20187: {'lr': 0.0004816024200966431, 'samples': 3875904, 'steps': 20186, 'loss/train': 1.409131646156311} 11/07/2021 00:05:12 - INFO - __main__ - Step 20188: {'lr': 0.00048160042196463153, 'samples': 3876096, 'steps': 20187, 'loss/train': 1.8766820430755615} 11/07/2021 00:05:13 - INFO - __main__ - Step 20189: {'lr': 0.00048159842372826446, 'samples': 3876288, 'steps': 20188, 'loss/train': 1.3698480129241943} 11/07/2021 00:05:14 - INFO - __main__ - Step 20190: {'lr': 0.0004815964253875426, 'samples': 3876480, 'steps': 20189, 'loss/train': 1.751710295677185} 11/07/2021 00:05:14 - INFO - __main__ - Step 20191: {'lr': 0.000481594426942467, 'samples': 3876672, 'steps': 20190, 'loss/train': 1.5956690311431885} 11/07/2021 00:05:14 - INFO - __main__ - Step 20192: {'lr': 0.0004815924283930385, 'samples': 3876864, 'steps': 20191, 'loss/train': 0.9469591975212097} 11/07/2021 00:05:15 - INFO - __main__ - Step 20193: {'lr': 0.0004815904297392582, 'samples': 3877056, 'steps': 20192, 'loss/train': 1.4563199281692505} 11/07/2021 00:05:15 - INFO - __main__ - Step 20194: {'lr': 0.00048158843098112657, 'samples': 3877248, 'steps': 20193, 'loss/train': 1.7004506587982178} 11/07/2021 00:05:16 - INFO - __main__ - Step 20195: {'lr': 0.00048158643211864495, 'samples': 3877440, 'steps': 20194, 'loss/train': 1.642877221107483} 11/07/2021 00:05:16 - INFO - __main__ - Step 20196: {'lr': 0.000481584433151814, 'samples': 3877632, 'steps': 20195, 'loss/train': 1.9405843019485474} 11/07/2021 00:05:17 - INFO - __main__ - Step 20197: {'lr': 0.00048158243408063465, 'samples': 3877824, 'steps': 20196, 'loss/train': 1.7496905326843262} 11/07/2021 00:05:17 - INFO - __main__ - Step 20198: {'lr': 0.0004815804349051078, 'samples': 3878016, 'steps': 20197, 'loss/train': 0.8492428660392761} 11/07/2021 00:05:17 - INFO - __main__ - Step 20199: {'lr': 0.0004815784356252344, 'samples': 3878208, 'steps': 20198, 'loss/train': 1.2568089962005615} 11/07/2021 00:05:18 - INFO - __main__ - Step 20200: {'lr': 0.0004815764362410154, 'samples': 3878400, 'steps': 20199, 'loss/train': 1.2597923278808594} 11/07/2021 00:05:19 - INFO - __main__ - Step 20201: {'lr': 0.0004815744367524516, 'samples': 3878592, 'steps': 20200, 'loss/train': 1.7262325286865234} 11/07/2021 00:05:19 - INFO - __main__ - Step 20202: {'lr': 0.0004815724371595439, 'samples': 3878784, 'steps': 20201, 'loss/train': 1.7526211738586426} 11/07/2021 00:05:19 - INFO - __main__ - Step 20203: {'lr': 0.00048157043746229324, 'samples': 3878976, 'steps': 20202, 'loss/train': 1.2760602235794067} 11/07/2021 00:05:20 - INFO - __main__ - Step 20204: {'lr': 0.0004815684376607006, 'samples': 3879168, 'steps': 20203, 'loss/train': 1.6638855934143066} 11/07/2021 00:05:21 - INFO - __main__ - Step 20205: {'lr': 0.0004815664377547667, 'samples': 3879360, 'steps': 20204, 'loss/train': 1.7330559492111206} 11/07/2021 00:05:22 - INFO - __main__ - Step 20206: {'lr': 0.00048156443774449254, 'samples': 3879552, 'steps': 20205, 'loss/train': 1.6828453540802002} 11/07/2021 00:05:22 - INFO - __main__ - Step 20207: {'lr': 0.00048156243762987905, 'samples': 3879744, 'steps': 20206, 'loss/train': 1.5695830583572388} 11/07/2021 00:05:22 - INFO - __main__ - Step 20208: {'lr': 0.00048156043741092705, 'samples': 3879936, 'steps': 20207, 'loss/train': 1.6807202100753784} 11/07/2021 00:05:23 - INFO - __main__ - Step 20209: {'lr': 0.00048155843708763755, 'samples': 3880128, 'steps': 20208, 'loss/train': 1.7855095863342285} 11/07/2021 00:05:23 - INFO - __main__ - Step 20210: {'lr': 0.0004815564366600114, 'samples': 3880320, 'steps': 20209, 'loss/train': 1.778202772140503} 11/07/2021 00:05:23 - INFO - __main__ - Step 20211: {'lr': 0.0004815544361280494, 'samples': 3880512, 'steps': 20210, 'loss/train': 1.3515102863311768} 11/07/2021 00:05:25 - INFO - __main__ - Step 20212: {'lr': 0.00048155243549175263, 'samples': 3880704, 'steps': 20211, 'loss/train': 1.462814211845398} 11/07/2021 00:05:25 - INFO - __main__ - Step 20213: {'lr': 0.00048155043475112184, 'samples': 3880896, 'steps': 20212, 'loss/train': 1.8552340269088745} 11/07/2021 00:05:25 - INFO - __main__ - Step 20214: {'lr': 0.0004815484339061581, 'samples': 3881088, 'steps': 20213, 'loss/train': 1.4173270463943481} 11/07/2021 00:05:26 - INFO - __main__ - Step 20215: {'lr': 0.0004815464329568621, 'samples': 3881280, 'steps': 20214, 'loss/train': 1.4815483093261719} 11/07/2021 00:05:26 - INFO - __main__ - Step 20216: {'lr': 0.00048154443190323495, 'samples': 3881472, 'steps': 20215, 'loss/train': 1.7627679109573364} 11/07/2021 00:05:27 - INFO - __main__ - Step 20217: {'lr': 0.0004815424307452774, 'samples': 3881664, 'steps': 20216, 'loss/train': 1.7265102863311768} 11/07/2021 00:05:28 - INFO - __main__ - Step 20218: {'lr': 0.0004815404294829904, 'samples': 3881856, 'steps': 20217, 'loss/train': 1.392214059829712} 11/07/2021 00:05:28 - INFO - __main__ - Step 20219: {'lr': 0.0004815384281163748, 'samples': 3882048, 'steps': 20218, 'loss/train': 1.5504920482635498} 11/07/2021 00:05:28 - INFO - __main__ - Step 20220: {'lr': 0.0004815364266454316, 'samples': 3882240, 'steps': 20219, 'loss/train': 1.3972241878509521} 11/07/2021 00:05:29 - INFO - __main__ - Step 20221: {'lr': 0.00048153442507016173, 'samples': 3882432, 'steps': 20220, 'loss/train': 2.0695390701293945} 11/07/2021 00:05:29 - INFO - __main__ - Step 20222: {'lr': 0.00048153242339056594, 'samples': 3882624, 'steps': 20221, 'loss/train': 0.68231600522995} 11/07/2021 00:05:30 - INFO - __main__ - Step 20223: {'lr': 0.0004815304216066453, 'samples': 3882816, 'steps': 20222, 'loss/train': 1.10834538936615} 11/07/2021 00:05:30 - INFO - __main__ - Step 20224: {'lr': 0.0004815284197184005, 'samples': 3883008, 'steps': 20223, 'loss/train': 0.35353773832321167} 11/07/2021 00:05:31 - INFO - __main__ - Step 20225: {'lr': 0.0004815264177258326, 'samples': 3883200, 'steps': 20224, 'loss/train': 2.266817569732666} 11/07/2021 00:05:31 - INFO - __main__ - Step 20226: {'lr': 0.00048152441562894255, 'samples': 3883392, 'steps': 20225, 'loss/train': 1.4499667882919312} 11/07/2021 00:05:31 - INFO - __main__ - Step 20227: {'lr': 0.0004815224134277311, 'samples': 3883584, 'steps': 20226, 'loss/train': 0.39159658551216125} 11/07/2021 00:05:32 - INFO - __main__ - Step 20228: {'lr': 0.00048152041112219926, 'samples': 3883776, 'steps': 20227, 'loss/train': 1.5912150144577026} 11/07/2021 00:05:33 - INFO - __main__ - Step 20229: {'lr': 0.0004815184087123479, 'samples': 3883968, 'steps': 20228, 'loss/train': 1.2535486221313477} 11/07/2021 00:05:33 - INFO - __main__ - Step 20230: {'lr': 0.0004815164061981778, 'samples': 3884160, 'steps': 20229, 'loss/train': 1.611649990081787} 11/07/2021 00:05:33 - INFO - __main__ - Step 20231: {'lr': 0.0004815144035796901, 'samples': 3884352, 'steps': 20230, 'loss/train': 1.4114503860473633} 11/07/2021 00:05:34 - INFO - __main__ - Step 20232: {'lr': 0.0004815124008568856, 'samples': 3884544, 'steps': 20231, 'loss/train': 1.8829342126846313} 11/07/2021 00:05:35 - INFO - __main__ - Step 20233: {'lr': 0.00048151039802976517, 'samples': 3884736, 'steps': 20232, 'loss/train': 1.1377381086349487} 11/07/2021 00:05:35 - INFO - __main__ - Step 20234: {'lr': 0.00048150839509832966, 'samples': 3884928, 'steps': 20233, 'loss/train': 1.054691195487976} 11/07/2021 00:05:36 - INFO - __main__ - Step 20235: {'lr': 0.0004815063920625801, 'samples': 3885120, 'steps': 20234, 'loss/train': 1.5479214191436768} 11/07/2021 00:05:36 - INFO - __main__ - Step 20236: {'lr': 0.00048150438892251724, 'samples': 3885312, 'steps': 20235, 'loss/train': 1.5953601598739624} 11/07/2021 00:05:36 - INFO - __main__ - Step 20237: {'lr': 0.00048150238567814217, 'samples': 3885504, 'steps': 20236, 'loss/train': 1.5126464366912842} 11/07/2021 00:05:37 - INFO - __main__ - Step 20238: {'lr': 0.0004815003823294557, 'samples': 3885696, 'steps': 20237, 'loss/train': 1.6836274862289429} 11/07/2021 00:05:38 - INFO - __main__ - Step 20239: {'lr': 0.0004814983788764587, 'samples': 3885888, 'steps': 20238, 'loss/train': 1.861586332321167} 11/07/2021 00:05:38 - INFO - __main__ - Step 20240: {'lr': 0.00048149637531915215, 'samples': 3886080, 'steps': 20239, 'loss/train': 1.6984490156173706} 11/07/2021 00:05:38 - INFO - __main__ - Step 20241: {'lr': 0.00048149437165753684, 'samples': 3886272, 'steps': 20240, 'loss/train': 1.4807697534561157} 11/07/2021 00:05:39 - INFO - __main__ - Step 20242: {'lr': 0.00048149236789161374, 'samples': 3886464, 'steps': 20241, 'loss/train': 1.7156288623809814} 11/07/2021 00:05:40 - INFO - __main__ - Step 20243: {'lr': 0.0004814903640213838, 'samples': 3886656, 'steps': 20242, 'loss/train': 1.6998640298843384} 11/07/2021 00:05:40 - INFO - __main__ - Step 20244: {'lr': 0.0004814883600468478, 'samples': 3886848, 'steps': 20243, 'loss/train': 1.2742701768875122} 11/07/2021 00:05:40 - INFO - __main__ - Step 20245: {'lr': 0.0004814863559680068, 'samples': 3887040, 'steps': 20244, 'loss/train': 1.1318578720092773} 11/07/2021 00:05:41 - INFO - __main__ - Step 20246: {'lr': 0.00048148435178486156, 'samples': 3887232, 'steps': 20245, 'loss/train': 0.7671478986740112} 11/07/2021 00:05:41 - INFO - __main__ - Step 20247: {'lr': 0.00048148234749741304, 'samples': 3887424, 'steps': 20246, 'loss/train': 1.5640817880630493} 11/07/2021 00:05:42 - INFO - __main__ - Step 20248: {'lr': 0.0004814803431056622, 'samples': 3887616, 'steps': 20247, 'loss/train': 1.6258333921432495} 11/07/2021 00:05:43 - INFO - __main__ - Step 20249: {'lr': 0.0004814783386096099, 'samples': 3887808, 'steps': 20248, 'loss/train': 1.5049912929534912} 11/07/2021 00:05:43 - INFO - __main__ - Step 20250: {'lr': 0.00048147633400925693, 'samples': 3888000, 'steps': 20249, 'loss/train': 1.4869909286499023} 11/07/2021 00:05:43 - INFO - __main__ - Step 20251: {'lr': 0.00048147432930460433, 'samples': 3888192, 'steps': 20250, 'loss/train': 1.4749869108200073} 11/07/2021 00:05:44 - INFO - __main__ - Step 20252: {'lr': 0.00048147232449565305, 'samples': 3888384, 'steps': 20251, 'loss/train': 1.609937310218811} 11/07/2021 00:05:44 - INFO - __main__ - Step 20253: {'lr': 0.00048147031958240384, 'samples': 3888576, 'steps': 20252, 'loss/train': 1.487269639968872} 11/07/2021 00:05:45 - INFO - __main__ - Step 20254: {'lr': 0.00048146831456485776, 'samples': 3888768, 'steps': 20253, 'loss/train': 1.6627097129821777} 11/07/2021 00:05:45 - INFO - __main__ - Step 20255: {'lr': 0.0004814663094430155, 'samples': 3888960, 'steps': 20254, 'loss/train': 1.5854759216308594} 11/07/2021 00:05:46 - INFO - __main__ - Step 20256: {'lr': 0.00048146430421687817, 'samples': 3889152, 'steps': 20255, 'loss/train': 1.851196527481079} 11/07/2021 00:05:46 - INFO - __main__ - Step 20257: {'lr': 0.00048146229888644656, 'samples': 3889344, 'steps': 20256, 'loss/train': 1.4912571907043457} 11/07/2021 00:05:47 - INFO - __main__ - Step 20258: {'lr': 0.00048146029345172165, 'samples': 3889536, 'steps': 20257, 'loss/train': 1.307698369026184} 11/07/2021 00:05:47 - INFO - __main__ - Step 20259: {'lr': 0.0004814582879127043, 'samples': 3889728, 'steps': 20258, 'loss/train': 1.6989030838012695} 11/07/2021 00:05:48 - INFO - __main__ - Step 20260: {'lr': 0.0004814562822693954, 'samples': 3889920, 'steps': 20259, 'loss/train': 1.2388862371444702} 11/07/2021 00:05:48 - INFO - __main__ - Step 20261: {'lr': 0.00048145427652179583, 'samples': 3890112, 'steps': 20260, 'loss/train': 2.0702147483825684} 11/07/2021 00:05:49 - INFO - __main__ - Step 20262: {'lr': 0.0004814522706699066, 'samples': 3890304, 'steps': 20261, 'loss/train': 1.5232195854187012} 11/07/2021 00:05:49 - INFO - __main__ - Step 20263: {'lr': 0.00048145026471372855, 'samples': 3890496, 'steps': 20262, 'loss/train': 1.651239275932312} 11/07/2021 00:05:50 - INFO - __main__ - Step 20264: {'lr': 0.0004814482586532626, 'samples': 3890688, 'steps': 20263, 'loss/train': 1.808660626411438} 11/07/2021 00:05:50 - INFO - __main__ - Step 20265: {'lr': 0.00048144625248850955, 'samples': 3890880, 'steps': 20264, 'loss/train': 1.8271234035491943} 11/07/2021 00:05:51 - INFO - __main__ - Step 20266: {'lr': 0.0004814442462194704, 'samples': 3891072, 'steps': 20265, 'loss/train': 1.453490972518921} 11/07/2021 00:05:51 - INFO - __main__ - Step 20267: {'lr': 0.0004814422398461461, 'samples': 3891264, 'steps': 20266, 'loss/train': 0.8812512159347534} 11/07/2021 00:05:51 - INFO - __main__ - Step 20268: {'lr': 0.00048144023336853746, 'samples': 3891456, 'steps': 20267, 'loss/train': 1.5316873788833618} 11/07/2021 00:05:52 - INFO - __main__ - Step 20269: {'lr': 0.00048143822678664545, 'samples': 3891648, 'steps': 20268, 'loss/train': 1.776229739189148} 11/07/2021 00:05:53 - INFO - __main__ - Step 20270: {'lr': 0.00048143622010047096, 'samples': 3891840, 'steps': 20269, 'loss/train': 1.6957920789718628} 11/07/2021 00:05:53 - INFO - __main__ - Step 20271: {'lr': 0.0004814342133100149, 'samples': 3892032, 'steps': 20270, 'loss/train': 1.6003910303115845} 11/07/2021 00:05:54 - INFO - __main__ - Step 20272: {'lr': 0.00048143220641527805, 'samples': 3892224, 'steps': 20271, 'loss/train': 0.8694155812263489} 11/07/2021 00:05:54 - INFO - __main__ - Step 20273: {'lr': 0.0004814301994162615, 'samples': 3892416, 'steps': 20272, 'loss/train': 1.7320939302444458} 11/07/2021 00:05:55 - INFO - __main__ - Step 20274: {'lr': 0.000481428192312966, 'samples': 3892608, 'steps': 20273, 'loss/train': 1.8822886943817139} 11/07/2021 00:05:55 - INFO - __main__ - Step 20275: {'lr': 0.0004814261851053926, 'samples': 3892800, 'steps': 20274, 'loss/train': 1.6996090412139893} 11/07/2021 00:05:56 - INFO - __main__ - Step 20276: {'lr': 0.00048142417779354214, 'samples': 3892992, 'steps': 20275, 'loss/train': 1.6192679405212402} 11/07/2021 00:05:56 - INFO - __main__ - Step 20277: {'lr': 0.0004814221703774155, 'samples': 3893184, 'steps': 20276, 'loss/train': 1.9482049942016602} 11/07/2021 00:05:56 - INFO - __main__ - Step 20278: {'lr': 0.00048142016285701356, 'samples': 3893376, 'steps': 20277, 'loss/train': 1.8611611127853394} 11/07/2021 00:05:57 - INFO - __main__ - Step 20279: {'lr': 0.00048141815523233735, 'samples': 3893568, 'steps': 20278, 'loss/train': 1.4806917905807495} 11/07/2021 00:05:58 - INFO - __main__ - Step 20280: {'lr': 0.00048141614750338757, 'samples': 3893760, 'steps': 20279, 'loss/train': 1.6425491571426392} 11/07/2021 00:05:58 - INFO - __main__ - Step 20281: {'lr': 0.00048141413967016535, 'samples': 3893952, 'steps': 20280, 'loss/train': 1.5973680019378662} 11/07/2021 00:05:58 - INFO - __main__ - Step 20282: {'lr': 0.00048141213173267145, 'samples': 3894144, 'steps': 20281, 'loss/train': 2.633230447769165} 11/07/2021 00:05:59 - INFO - __main__ - Step 20283: {'lr': 0.0004814101236909068, 'samples': 3894336, 'steps': 20282, 'loss/train': 1.511309266090393} 11/07/2021 00:05:59 - INFO - __main__ - Step 20284: {'lr': 0.00048140811554487234, 'samples': 3894528, 'steps': 20283, 'loss/train': 0.8016172647476196} 11/07/2021 00:06:00 - INFO - __main__ - Step 20285: {'lr': 0.000481406107294569, 'samples': 3894720, 'steps': 20284, 'loss/train': 1.877029538154602} 11/07/2021 00:06:01 - INFO - __main__ - Step 20286: {'lr': 0.0004814040989399975, 'samples': 3894912, 'steps': 20285, 'loss/train': 0.7532491087913513} 11/07/2021 00:06:01 - INFO - __main__ - Step 20287: {'lr': 0.000481402090481159, 'samples': 3895104, 'steps': 20286, 'loss/train': 1.5717297792434692} 11/07/2021 00:06:01 - INFO - __main__ - Step 20288: {'lr': 0.0004814000819180543, 'samples': 3895296, 'steps': 20287, 'loss/train': 1.7239612340927124} 11/07/2021 00:06:02 - INFO - __main__ - Step 20289: {'lr': 0.00048139807325068423, 'samples': 3895488, 'steps': 20288, 'loss/train': 0.5814390778541565} 11/07/2021 00:06:03 - INFO - __main__ - Step 20290: {'lr': 0.0004813960644790498, 'samples': 3895680, 'steps': 20289, 'loss/train': 1.498609185218811} 11/07/2021 00:06:03 - INFO - __main__ - Step 20291: {'lr': 0.00048139405560315186, 'samples': 3895872, 'steps': 20290, 'loss/train': 1.5184834003448486} 11/07/2021 00:06:03 - INFO - __main__ - Step 20292: {'lr': 0.0004813920466229913, 'samples': 3896064, 'steps': 20291, 'loss/train': 1.1429948806762695} 11/07/2021 00:06:04 - INFO - __main__ - Step 20293: {'lr': 0.0004813900375385691, 'samples': 3896256, 'steps': 20292, 'loss/train': 2.788299083709717} 11/07/2021 00:06:04 - INFO - __main__ - Step 20294: {'lr': 0.0004813880283498861, 'samples': 3896448, 'steps': 20293, 'loss/train': 1.6631759405136108} 11/07/2021 00:06:05 - INFO - __main__ - Step 20295: {'lr': 0.00048138601905694324, 'samples': 3896640, 'steps': 20294, 'loss/train': 1.9560562372207642} 11/07/2021 00:06:06 - INFO - __main__ - Step 20296: {'lr': 0.0004813840096597414, 'samples': 3896832, 'steps': 20295, 'loss/train': 1.8317298889160156} 11/07/2021 00:06:06 - INFO - __main__ - Step 20297: {'lr': 0.00048138200015828146, 'samples': 3897024, 'steps': 20296, 'loss/train': 1.5180134773254395} 11/07/2021 00:06:06 - INFO - __main__ - Step 20298: {'lr': 0.00048137999055256444, 'samples': 3897216, 'steps': 20297, 'loss/train': 1.6339629888534546} 11/07/2021 00:06:07 - INFO - __main__ - Step 20299: {'lr': 0.0004813779808425911, 'samples': 3897408, 'steps': 20298, 'loss/train': 1.5810672044754028} 11/07/2021 00:06:07 - INFO - __main__ - Step 20300: {'lr': 0.0004813759710283624, 'samples': 3897600, 'steps': 20299, 'loss/train': 1.6098214387893677} 11/07/2021 00:06:08 - INFO - __main__ - Step 20301: {'lr': 0.0004813739611098793, 'samples': 3897792, 'steps': 20300, 'loss/train': 1.575744867324829} 11/07/2021 00:06:09 - INFO - __main__ - Step 20302: {'lr': 0.00048137195108714266, 'samples': 3897984, 'steps': 20301, 'loss/train': 0.8211456537246704} 11/07/2021 00:06:09 - INFO - __main__ - Step 20303: {'lr': 0.00048136994096015343, 'samples': 3898176, 'steps': 20302, 'loss/train': 1.674750566482544} 11/07/2021 00:06:09 - INFO - __main__ - Step 20304: {'lr': 0.00048136793072891236, 'samples': 3898368, 'steps': 20303, 'loss/train': 0.8324387669563293} 11/07/2021 00:06:10 - INFO - __main__ - Step 20305: {'lr': 0.00048136592039342053, 'samples': 3898560, 'steps': 20304, 'loss/train': 1.7422436475753784} 11/07/2021 00:06:11 - INFO - __main__ - Step 20306: {'lr': 0.0004813639099536789, 'samples': 3898752, 'steps': 20305, 'loss/train': 1.2161121368408203} 11/07/2021 00:06:11 - INFO - __main__ - Step 20307: {'lr': 0.0004813618994096881, 'samples': 3898944, 'steps': 20306, 'loss/train': 1.429854393005371} 11/07/2021 00:06:11 - INFO - __main__ - Step 20308: {'lr': 0.0004813598887614492, 'samples': 3899136, 'steps': 20307, 'loss/train': 1.7629268169403076} 11/07/2021 00:06:12 - INFO - __main__ - Step 20309: {'lr': 0.0004813578780089632, 'samples': 3899328, 'steps': 20308, 'loss/train': 1.690520167350769} 11/07/2021 00:06:12 - INFO - __main__ - Step 20310: {'lr': 0.00048135586715223087, 'samples': 3899520, 'steps': 20309, 'loss/train': 1.9133003950119019} 11/07/2021 00:06:13 - INFO - __main__ - Step 20311: {'lr': 0.00048135385619125316, 'samples': 3899712, 'steps': 20310, 'loss/train': 1.6012578010559082} 11/07/2021 00:06:13 - INFO - __main__ - Step 20312: {'lr': 0.00048135184512603093, 'samples': 3899904, 'steps': 20311, 'loss/train': 0.8476859927177429} 11/07/2021 00:06:14 - INFO - __main__ - Step 20313: {'lr': 0.00048134983395656516, 'samples': 3900096, 'steps': 20312, 'loss/train': 1.9039126634597778} 11/07/2021 00:06:14 - INFO - __main__ - Step 20314: {'lr': 0.00048134782268285676, 'samples': 3900288, 'steps': 20313, 'loss/train': 1.62480890750885} 11/07/2021 00:06:15 - INFO - __main__ - Step 20315: {'lr': 0.00048134581130490655, 'samples': 3900480, 'steps': 20314, 'loss/train': 1.6092077493667603} 11/07/2021 00:06:15 - INFO - __main__ - Step 20316: {'lr': 0.0004813437998227155, 'samples': 3900672, 'steps': 20315, 'loss/train': 0.9180329442024231} 11/07/2021 00:06:16 - INFO - __main__ - Step 20317: {'lr': 0.00048134178823628455, 'samples': 3900864, 'steps': 20316, 'loss/train': 1.6326484680175781} 11/07/2021 00:06:16 - INFO - __main__ - Step 20318: {'lr': 0.0004813397765456145, 'samples': 3901056, 'steps': 20317, 'loss/train': 1.1808871030807495} 11/07/2021 00:06:17 - INFO - __main__ - Step 20319: {'lr': 0.00048133776475070637, 'samples': 3901248, 'steps': 20318, 'loss/train': 1.7565277814865112} 11/07/2021 00:06:17 - INFO - __main__ - Step 20320: {'lr': 0.00048133575285156093, 'samples': 3901440, 'steps': 20319, 'loss/train': 1.70387864112854} 11/07/2021 00:06:18 - INFO - __main__ - Step 20321: {'lr': 0.00048133374084817927, 'samples': 3901632, 'steps': 20320, 'loss/train': 0.9791955947875977} 11/07/2021 00:06:18 - INFO - __main__ - Step 20322: {'lr': 0.00048133172874056213, 'samples': 3901824, 'steps': 20321, 'loss/train': 1.9139925241470337} 11/07/2021 00:06:19 - INFO - __main__ - Step 20323: {'lr': 0.0004813297165287105, 'samples': 3902016, 'steps': 20322, 'loss/train': 1.494025707244873} 11/07/2021 00:06:19 - INFO - __main__ - Step 20324: {'lr': 0.00048132770421262526, 'samples': 3902208, 'steps': 20323, 'loss/train': 1.3976880311965942} 11/07/2021 00:06:19 - INFO - __main__ - Step 20325: {'lr': 0.00048132569179230736, 'samples': 3902400, 'steps': 20324, 'loss/train': 1.5085182189941406} 11/07/2021 00:06:20 - INFO - __main__ - Step 20326: {'lr': 0.0004813236792677577, 'samples': 3902592, 'steps': 20325, 'loss/train': 1.7275755405426025} 11/07/2021 00:06:21 - INFO - __main__ - Step 20327: {'lr': 0.00048132166663897703, 'samples': 3902784, 'steps': 20326, 'loss/train': 1.1090145111083984} 11/07/2021 00:06:21 - INFO - __main__ - Step 20328: {'lr': 0.0004813196539059665, 'samples': 3902976, 'steps': 20327, 'loss/train': 1.493940830230713} 11/07/2021 00:06:22 - INFO - __main__ - Step 20329: {'lr': 0.0004813176410687269, 'samples': 3903168, 'steps': 20328, 'loss/train': 1.5074502229690552} 11/07/2021 00:06:22 - INFO - __main__ - Step 20330: {'lr': 0.00048131562812725904, 'samples': 3903360, 'steps': 20329, 'loss/train': 1.7746409177780151} 11/07/2021 00:06:22 - INFO - __main__ - Step 20331: {'lr': 0.000481313615081564, 'samples': 3903552, 'steps': 20330, 'loss/train': 5.725223064422607} 11/07/2021 00:06:23 - INFO - __main__ - Step 20332: {'lr': 0.00048131160193164266, 'samples': 3903744, 'steps': 20331, 'loss/train': 1.2115626335144043} 11/07/2021 00:06:24 - INFO - __main__ - Step 20333: {'lr': 0.0004813095886774958, 'samples': 3903936, 'steps': 20332, 'loss/train': 1.7731631994247437} 11/07/2021 00:06:24 - INFO - __main__ - Step 20334: {'lr': 0.00048130757531912447, 'samples': 3904128, 'steps': 20333, 'loss/train': 1.7672775983810425} 11/07/2021 00:06:24 - INFO - __main__ - Step 20335: {'lr': 0.00048130556185652947, 'samples': 3904320, 'steps': 20334, 'loss/train': 1.9235178232192993} 11/07/2021 00:06:25 - INFO - __main__ - Step 20336: {'lr': 0.0004813035482897118, 'samples': 3904512, 'steps': 20335, 'loss/train': 1.494145393371582} 11/07/2021 00:06:25 - INFO - __main__ - Step 20337: {'lr': 0.00048130153461867225, 'samples': 3904704, 'steps': 20336, 'loss/train': 1.8234186172485352} 11/07/2021 00:06:26 - INFO - __main__ - Step 20338: {'lr': 0.0004812995208434119, 'samples': 3904896, 'steps': 20337, 'loss/train': 1.926507592201233} 11/07/2021 00:06:27 - INFO - __main__ - Step 20339: {'lr': 0.00048129750696393144, 'samples': 3905088, 'steps': 20338, 'loss/train': 1.6730066537857056} 11/07/2021 00:06:27 - INFO - __main__ - Step 20340: {'lr': 0.00048129549298023196, 'samples': 3905280, 'steps': 20339, 'loss/train': 1.868593692779541} 11/07/2021 00:06:27 - INFO - __main__ - Step 20341: {'lr': 0.0004812934788923143, 'samples': 3905472, 'steps': 20340, 'loss/train': 1.4159256219863892} 11/07/2021 00:06:28 - INFO - __main__ - Step 20342: {'lr': 0.00048129146470017933, 'samples': 3905664, 'steps': 20341, 'loss/train': 1.5150226354599} 11/07/2021 00:06:29 - INFO - __main__ - Step 20343: {'lr': 0.000481289450403828, 'samples': 3905856, 'steps': 20342, 'loss/train': 1.8831218481063843} 11/07/2021 00:06:29 - INFO - __main__ - Step 20344: {'lr': 0.0004812874360032613, 'samples': 3906048, 'steps': 20343, 'loss/train': 1.6416627168655396} 11/07/2021 00:06:29 - INFO - __main__ - Step 20345: {'lr': 0.0004812854214984799, 'samples': 3906240, 'steps': 20344, 'loss/train': 1.546629548072815} 11/07/2021 00:06:30 - INFO - __main__ - Step 20346: {'lr': 0.000481283406889485, 'samples': 3906432, 'steps': 20345, 'loss/train': 1.5926127433776855} 11/07/2021 00:06:30 - INFO - __main__ - Step 20347: {'lr': 0.00048128139217627725, 'samples': 3906624, 'steps': 20346, 'loss/train': 1.4639637470245361} 11/07/2021 00:06:31 - INFO - __main__ - Step 20348: {'lr': 0.00048127937735885774, 'samples': 3906816, 'steps': 20347, 'loss/train': 1.5859371423721313} 11/07/2021 00:06:32 - INFO - __main__ - Step 20349: {'lr': 0.0004812773624372273, 'samples': 3907008, 'steps': 20348, 'loss/train': 1.5963897705078125} 11/07/2021 00:06:32 - INFO - __main__ - Step 20350: {'lr': 0.0004812753474113869, 'samples': 3907200, 'steps': 20349, 'loss/train': 1.499479055404663} 11/07/2021 00:06:32 - INFO - __main__ - Step 20351: {'lr': 0.0004812733322813373, 'samples': 3907392, 'steps': 20350, 'loss/train': 1.5133908987045288} 11/07/2021 00:06:33 - INFO - __main__ - Step 20352: {'lr': 0.00048127131704707953, 'samples': 3907584, 'steps': 20351, 'loss/train': 1.4553083181381226} 11/07/2021 00:06:34 - INFO - __main__ - Step 20353: {'lr': 0.0004812693017086145, 'samples': 3907776, 'steps': 20352, 'loss/train': 1.3714314699172974} 11/07/2021 00:06:34 - INFO - __main__ - Step 20354: {'lr': 0.00048126728626594315, 'samples': 3907968, 'steps': 20353, 'loss/train': 1.7511000633239746} 11/07/2021 00:06:34 - INFO - __main__ - Step 20355: {'lr': 0.00048126527071906623, 'samples': 3908160, 'steps': 20354, 'loss/train': 1.3997986316680908} 11/07/2021 00:06:35 - INFO - __main__ - Step 20356: {'lr': 0.0004812632550679848, 'samples': 3908352, 'steps': 20355, 'loss/train': 1.9811984300613403} 11/07/2021 00:06:35 - INFO - __main__ - Step 20357: {'lr': 0.00048126123931269973, 'samples': 3908544, 'steps': 20356, 'loss/train': 1.576959252357483} 11/07/2021 00:06:36 - INFO - __main__ - Step 20358: {'lr': 0.0004812592234532118, 'samples': 3908736, 'steps': 20357, 'loss/train': 1.697954535484314} 11/07/2021 00:06:36 - INFO - __main__ - Step 20359: {'lr': 0.00048125720748952216, 'samples': 3908928, 'steps': 20358, 'loss/train': 2.012244939804077} 11/07/2021 00:06:37 - INFO - __main__ - Step 20360: {'lr': 0.00048125519142163157, 'samples': 3909120, 'steps': 20359, 'loss/train': 1.5944174528121948} 11/07/2021 00:06:37 - INFO - __main__ - Step 20361: {'lr': 0.0004812531752495409, 'samples': 3909312, 'steps': 20360, 'loss/train': 1.4070159196853638} 11/07/2021 00:06:37 - INFO - __main__ - Step 20362: {'lr': 0.00048125115897325115, 'samples': 3909504, 'steps': 20361, 'loss/train': 0.829056441783905} 11/07/2021 00:06:39 - INFO - __main__ - Step 20363: {'lr': 0.0004812491425927632, 'samples': 3909696, 'steps': 20362, 'loss/train': 0.8168201446533203} 11/07/2021 00:06:39 - INFO - __main__ - Step 20364: {'lr': 0.000481247126108078, 'samples': 3909888, 'steps': 20363, 'loss/train': 1.7574135065078735} 11/07/2021 00:06:39 - INFO - __main__ - Step 20365: {'lr': 0.00048124510951919633, 'samples': 3910080, 'steps': 20364, 'loss/train': 1.841825008392334} 11/07/2021 00:06:40 - INFO - __main__ - Step 20366: {'lr': 0.0004812430928261192, 'samples': 3910272, 'steps': 20365, 'loss/train': 1.7087429761886597} 11/07/2021 00:06:40 - INFO - __main__ - Step 20367: {'lr': 0.00048124107602884753, 'samples': 3910464, 'steps': 20366, 'loss/train': 1.9521450996398926} 11/07/2021 00:06:40 - INFO - __main__ - Step 20368: {'lr': 0.0004812390591273822, 'samples': 3910656, 'steps': 20367, 'loss/train': 1.9141671657562256} 11/07/2021 00:06:41 - INFO - __main__ - Step 20369: {'lr': 0.00048123704212172416, 'samples': 3910848, 'steps': 20368, 'loss/train': 1.6557412147521973} 11/07/2021 00:06:42 - INFO - __main__ - Step 20370: {'lr': 0.0004812350250118742, 'samples': 3911040, 'steps': 20369, 'loss/train': 2.031385660171509} 11/07/2021 00:06:42 - INFO - __main__ - Step 20371: {'lr': 0.0004812330077978333, 'samples': 3911232, 'steps': 20370, 'loss/train': 1.6474828720092773} 11/07/2021 00:06:42 - INFO - __main__ - Step 20372: {'lr': 0.0004812309904796024, 'samples': 3911424, 'steps': 20371, 'loss/train': 1.5013188123703003} 11/07/2021 00:06:43 - INFO - __main__ - Step 20373: {'lr': 0.0004812289730571824, 'samples': 3911616, 'steps': 20372, 'loss/train': 1.3218475580215454} 11/07/2021 00:06:44 - INFO - __main__ - Step 20374: {'lr': 0.00048122695553057417, 'samples': 3911808, 'steps': 20373, 'loss/train': 1.8798691034317017} 11/07/2021 00:06:44 - INFO - __main__ - Step 20375: {'lr': 0.00048122493789977866, 'samples': 3912000, 'steps': 20374, 'loss/train': 1.5554605722427368} 11/07/2021 00:06:45 - INFO - __main__ - Step 20376: {'lr': 0.00048122292016479674, 'samples': 3912192, 'steps': 20375, 'loss/train': 1.2647597789764404} 11/07/2021 00:06:45 - INFO - __main__ - Step 20377: {'lr': 0.0004812209023256294, 'samples': 3912384, 'steps': 20376, 'loss/train': 5.788717746734619} 11/07/2021 00:06:45 - INFO - __main__ - Step 20378: {'lr': 0.0004812188843822775, 'samples': 3912576, 'steps': 20377, 'loss/train': 1.56723153591156} 11/07/2021 00:06:46 - INFO - __main__ - Step 20379: {'lr': 0.0004812168663347418, 'samples': 3912768, 'steps': 20378, 'loss/train': 1.824082374572754} 11/07/2021 00:06:47 - INFO - __main__ - Step 20380: {'lr': 0.00048121484818302343, 'samples': 3912960, 'steps': 20379, 'loss/train': 1.4097694158554077} 11/07/2021 00:06:47 - INFO - __main__ - Step 20381: {'lr': 0.00048121282992712324, 'samples': 3913152, 'steps': 20380, 'loss/train': 1.7064259052276611} 11/07/2021 00:06:47 - INFO - __main__ - Step 20382: {'lr': 0.00048121081156704207, 'samples': 3913344, 'steps': 20381, 'loss/train': 1.8313816785812378} 11/07/2021 00:06:48 - INFO - __main__ - Step 20383: {'lr': 0.00048120879310278094, 'samples': 3913536, 'steps': 20382, 'loss/train': 0.8189948797225952} 11/07/2021 00:06:48 - INFO - __main__ - Step 20384: {'lr': 0.00048120677453434066, 'samples': 3913728, 'steps': 20383, 'loss/train': 1.7617720365524292} 11/07/2021 00:06:49 - INFO - __main__ - Step 20385: {'lr': 0.00048120475586172217, 'samples': 3913920, 'steps': 20384, 'loss/train': 1.839327335357666} 11/07/2021 00:06:49 - INFO - __main__ - Step 20386: {'lr': 0.00048120273708492637, 'samples': 3914112, 'steps': 20385, 'loss/train': 1.4274979829788208} 11/07/2021 00:06:50 - INFO - __main__ - Step 20387: {'lr': 0.0004812007182039542, 'samples': 3914304, 'steps': 20386, 'loss/train': 1.763704538345337} 11/07/2021 00:06:50 - INFO - __main__ - Step 20388: {'lr': 0.00048119869921880656, 'samples': 3914496, 'steps': 20387, 'loss/train': 1.6406468152999878} 11/07/2021 00:06:50 - INFO - __main__ - Step 20389: {'lr': 0.00048119668012948434, 'samples': 3914688, 'steps': 20388, 'loss/train': 1.7437411546707153} 11/07/2021 00:06:51 - INFO - __main__ - Step 20390: {'lr': 0.0004811946609359885, 'samples': 3914880, 'steps': 20389, 'loss/train': 1.9098658561706543} 11/07/2021 00:06:52 - INFO - __main__ - Step 20391: {'lr': 0.00048119264163831987, 'samples': 3915072, 'steps': 20390, 'loss/train': 1.7999954223632812} 11/07/2021 00:06:52 - INFO - __main__ - Step 20392: {'lr': 0.0004811906222364794, 'samples': 3915264, 'steps': 20391, 'loss/train': 1.9051964282989502} 11/07/2021 00:06:53 - INFO - __main__ - Step 20393: {'lr': 0.00048118860273046804, 'samples': 3915456, 'steps': 20392, 'loss/train': 1.446048378944397} 11/07/2021 00:06:53 - INFO - __main__ - Step 20394: {'lr': 0.00048118658312028663, 'samples': 3915648, 'steps': 20393, 'loss/train': 1.7522889375686646} 11/07/2021 00:06:53 - INFO - __main__ - Step 20395: {'lr': 0.0004811845634059361, 'samples': 3915840, 'steps': 20394, 'loss/train': 1.8213437795639038} 11/07/2021 00:06:54 - INFO - __main__ - Step 20396: {'lr': 0.0004811825435874174, 'samples': 3916032, 'steps': 20395, 'loss/train': 1.985606074333191} 11/07/2021 00:06:55 - INFO - __main__ - Step 20397: {'lr': 0.0004811805236647314, 'samples': 3916224, 'steps': 20396, 'loss/train': 1.8085836172103882} 11/07/2021 00:06:55 - INFO - __main__ - Step 20398: {'lr': 0.0004811785036378791, 'samples': 3916416, 'steps': 20397, 'loss/train': 1.7104543447494507} 11/07/2021 00:06:55 - INFO - __main__ - Step 20399: {'lr': 0.0004811764835068613, 'samples': 3916608, 'steps': 20398, 'loss/train': 1.8539533615112305} 11/07/2021 00:06:56 - INFO - __main__ - Step 20400: {'lr': 0.0004811744632716789, 'samples': 3916800, 'steps': 20399, 'loss/train': 1.566263198852539} 11/07/2021 00:06:57 - INFO - __main__ - Step 20401: {'lr': 0.0004811724429323329, 'samples': 3916992, 'steps': 20400, 'loss/train': 1.7747917175292969} 11/07/2021 00:06:57 - INFO - __main__ - Step 20402: {'lr': 0.0004811704224888241, 'samples': 3917184, 'steps': 20401, 'loss/train': 1.5107859373092651} 11/07/2021 00:06:58 - INFO - __main__ - Step 20403: {'lr': 0.0004811684019411535, 'samples': 3917376, 'steps': 20402, 'loss/train': 1.6205443143844604} 11/07/2021 00:06:58 - INFO - __main__ - Step 20404: {'lr': 0.000481166381289322, 'samples': 3917568, 'steps': 20403, 'loss/train': 1.3343929052352905} 11/07/2021 00:06:58 - INFO - __main__ - Step 20405: {'lr': 0.0004811643605333305, 'samples': 3917760, 'steps': 20404, 'loss/train': 1.218215823173523} 11/07/2021 00:07:00 - INFO - __main__ - Step 20406: {'lr': 0.0004811623396731799, 'samples': 3917952, 'steps': 20405, 'loss/train': 0.7047050595283508} 11/07/2021 00:07:00 - INFO - __main__ - Step 20407: {'lr': 0.0004811603187088711, 'samples': 3918144, 'steps': 20406, 'loss/train': 1.5910859107971191} 11/07/2021 00:07:00 - INFO - __main__ - Step 20408: {'lr': 0.00048115829764040503, 'samples': 3918336, 'steps': 20407, 'loss/train': 1.57695734500885} 11/07/2021 00:07:01 - INFO - __main__ - Step 20409: {'lr': 0.0004811562764677826, 'samples': 3918528, 'steps': 20408, 'loss/train': 2.5446133613586426} 11/07/2021 00:07:01 - INFO - __main__ - Step 20410: {'lr': 0.00048115425519100474, 'samples': 3918720, 'steps': 20409, 'loss/train': 1.1176563501358032} 11/07/2021 00:07:01 - INFO - __main__ - Step 20411: {'lr': 0.0004811522338100723, 'samples': 3918912, 'steps': 20410, 'loss/train': 1.3174539804458618} 11/07/2021 00:07:02 - INFO - __main__ - Step 20412: {'lr': 0.0004811502123249862, 'samples': 3919104, 'steps': 20411, 'loss/train': 1.4090980291366577} 11/07/2021 00:07:03 - INFO - __main__ - Step 20413: {'lr': 0.0004811481907357475, 'samples': 3919296, 'steps': 20412, 'loss/train': 2.012705087661743} 11/07/2021 00:07:03 - INFO - __main__ - Step 20414: {'lr': 0.000481146169042357, 'samples': 3919488, 'steps': 20413, 'loss/train': 1.4001191854476929} 11/07/2021 00:07:03 - INFO - __main__ - Step 20415: {'lr': 0.0004811441472448155, 'samples': 3919680, 'steps': 20414, 'loss/train': 1.7796525955200195} 11/07/2021 00:07:04 - INFO - __main__ - Step 20416: {'lr': 0.000481142125343124, 'samples': 3919872, 'steps': 20415, 'loss/train': 1.5453555583953857} 11/07/2021 00:07:05 - INFO - __main__ - Step 20417: {'lr': 0.0004811401033372835, 'samples': 3920064, 'steps': 20416, 'loss/train': 1.4924184083938599} 11/07/2021 00:07:05 - INFO - __main__ - Step 20418: {'lr': 0.0004811380812272948, 'samples': 3920256, 'steps': 20417, 'loss/train': 1.5377154350280762} 11/07/2021 00:07:06 - INFO - __main__ - Step 20419: {'lr': 0.0004811360590131589, 'samples': 3920448, 'steps': 20418, 'loss/train': 1.6782273054122925} 11/07/2021 00:07:06 - INFO - __main__ - Step 20420: {'lr': 0.00048113403669487655, 'samples': 3920640, 'steps': 20419, 'loss/train': 1.8343051671981812} 11/07/2021 00:07:06 - INFO - __main__ - Step 20421: {'lr': 0.0004811320142724489, 'samples': 3920832, 'steps': 20420, 'loss/train': 1.8262782096862793} 11/07/2021 00:07:07 - INFO - __main__ - Step 20422: {'lr': 0.0004811299917458766, 'samples': 3921024, 'steps': 20421, 'loss/train': 1.108807921409607} 11/07/2021 00:07:08 - INFO - __main__ - Step 20423: {'lr': 0.00048112796911516076, 'samples': 3921216, 'steps': 20422, 'loss/train': 1.0290982723236084} 11/07/2021 00:07:08 - INFO - __main__ - Step 20424: {'lr': 0.00048112594638030225, 'samples': 3921408, 'steps': 20423, 'loss/train': 1.9398106336593628} 11/07/2021 00:07:08 - INFO - __main__ - Step 20425: {'lr': 0.00048112392354130194, 'samples': 3921600, 'steps': 20424, 'loss/train': 1.6747199296951294} 11/07/2021 00:07:09 - INFO - __main__ - Step 20426: {'lr': 0.00048112190059816076, 'samples': 3921792, 'steps': 20425, 'loss/train': 1.918034315109253} 11/07/2021 00:07:10 - INFO - __main__ - Step 20427: {'lr': 0.0004811198775508796, 'samples': 3921984, 'steps': 20426, 'loss/train': 1.8439780473709106} 11/07/2021 00:07:10 - INFO - __main__ - Step 20428: {'lr': 0.0004811178543994593, 'samples': 3922176, 'steps': 20427, 'loss/train': 1.639901876449585} 11/07/2021 00:07:10 - INFO - __main__ - Step 20429: {'lr': 0.000481115831143901, 'samples': 3922368, 'steps': 20428, 'loss/train': 1.5237764120101929} 11/07/2021 00:07:11 - INFO - __main__ - Step 20430: {'lr': 0.00048111380778420544, 'samples': 3922560, 'steps': 20429, 'loss/train': 1.6115946769714355} 11/07/2021 00:07:11 - INFO - __main__ - Step 20431: {'lr': 0.0004811117843203735, 'samples': 3922752, 'steps': 20430, 'loss/train': 1.7255394458770752} 11/07/2021 00:07:12 - INFO - __main__ - Step 20432: {'lr': 0.00048110976075240624, 'samples': 3922944, 'steps': 20431, 'loss/train': 1.728033185005188} 11/07/2021 00:07:13 - INFO - __main__ - Step 20433: {'lr': 0.00048110773708030444, 'samples': 3923136, 'steps': 20432, 'loss/train': 1.7761973142623901} 11/07/2021 00:07:13 - INFO - __main__ - Step 20434: {'lr': 0.00048110571330406903, 'samples': 3923328, 'steps': 20433, 'loss/train': 1.1125842332839966} 11/07/2021 00:07:13 - INFO - __main__ - Step 20435: {'lr': 0.0004811036894237011, 'samples': 3923520, 'steps': 20434, 'loss/train': 1.707023024559021} 11/07/2021 00:07:14 - INFO - __main__ - Step 20436: {'lr': 0.00048110166543920125, 'samples': 3923712, 'steps': 20435, 'loss/train': 1.7297285795211792} 11/07/2021 00:07:14 - INFO - __main__ - Step 20437: {'lr': 0.0004810996413505706, 'samples': 3923904, 'steps': 20436, 'loss/train': 1.4574823379516602} 11/07/2021 00:07:15 - INFO - __main__ - Step 20438: {'lr': 0.0004810976171578101, 'samples': 3924096, 'steps': 20437, 'loss/train': 1.896390676498413} 11/07/2021 00:07:15 - INFO - __main__ - Step 20439: {'lr': 0.00048109559286092047, 'samples': 3924288, 'steps': 20438, 'loss/train': 1.5400142669677734} 11/07/2021 00:07:16 - INFO - __main__ - Step 20440: {'lr': 0.0004810935684599028, 'samples': 3924480, 'steps': 20439, 'loss/train': 2.090247631072998} 11/07/2021 00:07:16 - INFO - __main__ - Step 20441: {'lr': 0.00048109154395475787, 'samples': 3924672, 'steps': 20440, 'loss/train': 1.6449775695800781} 11/07/2021 00:07:16 - INFO - __main__ - Step 20442: {'lr': 0.00048108951934548673, 'samples': 3924864, 'steps': 20441, 'loss/train': 1.5495688915252686} 11/07/2021 00:07:17 - INFO - __main__ - Step 20443: {'lr': 0.0004810874946320901, 'samples': 3925056, 'steps': 20442, 'loss/train': 1.2552711963653564} 11/07/2021 00:07:18 - INFO - __main__ - Step 20444: {'lr': 0.00048108546981456916, 'samples': 3925248, 'steps': 20443, 'loss/train': 1.7868911027908325} 11/07/2021 00:07:18 - INFO - __main__ - Step 20445: {'lr': 0.0004810834448929246, 'samples': 3925440, 'steps': 20444, 'loss/train': 1.2708492279052734} 11/07/2021 00:07:18 - INFO - __main__ - Step 20446: {'lr': 0.0004810814198671574, 'samples': 3925632, 'steps': 20445, 'loss/train': 1.5799506902694702} 11/07/2021 00:07:19 - INFO - __main__ - Step 20447: {'lr': 0.00048107939473726846, 'samples': 3925824, 'steps': 20446, 'loss/train': 1.6609207391738892} 11/07/2021 00:07:20 - INFO - __main__ - Step 20448: {'lr': 0.0004810773695032588, 'samples': 3926016, 'steps': 20447, 'loss/train': 1.4672976732254028} 11/07/2021 00:07:20 - INFO - __main__ - Step 20449: {'lr': 0.00048107534416512915, 'samples': 3926208, 'steps': 20448, 'loss/train': 1.3852245807647705} 11/07/2021 00:07:20 - INFO - __main__ - Step 20450: {'lr': 0.00048107331872288055, 'samples': 3926400, 'steps': 20449, 'loss/train': 1.5605368614196777} 11/07/2021 00:07:21 - INFO - __main__ - Step 20451: {'lr': 0.0004810712931765139, 'samples': 3926592, 'steps': 20450, 'loss/train': 1.519484519958496} 11/07/2021 00:07:21 - INFO - __main__ - Step 20452: {'lr': 0.00048106926752603007, 'samples': 3926784, 'steps': 20451, 'loss/train': 1.3224151134490967} 11/07/2021 00:07:22 - INFO - __main__ - Step 20453: {'lr': 0.00048106724177143, 'samples': 3926976, 'steps': 20452, 'loss/train': 1.81765615940094} 11/07/2021 00:07:23 - INFO - __main__ - Step 20454: {'lr': 0.00048106521591271455, 'samples': 3927168, 'steps': 20453, 'loss/train': 1.5886616706848145} 11/07/2021 00:07:23 - INFO - __main__ - Step 20455: {'lr': 0.00048106318994988476, 'samples': 3927360, 'steps': 20454, 'loss/train': 1.4082697629928589} 11/07/2021 00:07:23 - INFO - __main__ - Step 20456: {'lr': 0.0004810611638829414, 'samples': 3927552, 'steps': 20455, 'loss/train': 1.9213775396347046} 11/07/2021 00:07:24 - INFO - __main__ - Step 20457: {'lr': 0.00048105913771188545, 'samples': 3927744, 'steps': 20456, 'loss/train': 1.3209149837493896} 11/07/2021 00:07:24 - INFO - __main__ - Step 20458: {'lr': 0.00048105711143671783, 'samples': 3927936, 'steps': 20457, 'loss/train': 1.615772008895874} 11/07/2021 00:07:25 - INFO - __main__ - Step 20459: {'lr': 0.0004810550850574394, 'samples': 3928128, 'steps': 20458, 'loss/train': 1.7410712242126465} 11/07/2021 00:07:25 - INFO - __main__ - Step 20460: {'lr': 0.0004810530585740512, 'samples': 3928320, 'steps': 20459, 'loss/train': 1.7318370342254639} 11/07/2021 00:07:26 - INFO - __main__ - Step 20461: {'lr': 0.00048105103198655406, 'samples': 3928512, 'steps': 20460, 'loss/train': 1.6753515005111694} 11/07/2021 00:07:26 - INFO - __main__ - Step 20462: {'lr': 0.0004810490052949488, 'samples': 3928704, 'steps': 20461, 'loss/train': 1.2645617723464966} 11/07/2021 00:07:26 - INFO - __main__ - Step 20463: {'lr': 0.0004810469784992365, 'samples': 3928896, 'steps': 20462, 'loss/train': 1.3526378870010376} 11/07/2021 00:07:27 - INFO - __main__ - Step 20464: {'lr': 0.00048104495159941794, 'samples': 3929088, 'steps': 20463, 'loss/train': 1.643991470336914} 11/07/2021 00:07:28 - INFO - __main__ - Step 20465: {'lr': 0.00048104292459549413, 'samples': 3929280, 'steps': 20464, 'loss/train': 1.367877721786499} 11/07/2021 00:07:28 - INFO - __main__ - Step 20466: {'lr': 0.0004810408974874659, 'samples': 3929472, 'steps': 20465, 'loss/train': 1.9926568269729614} 11/07/2021 00:07:28 - INFO - __main__ - Step 20467: {'lr': 0.0004810388702753342, 'samples': 3929664, 'steps': 20466, 'loss/train': 1.3941420316696167} 11/07/2021 00:07:29 - INFO - __main__ - Step 20468: {'lr': 0.0004810368429591, 'samples': 3929856, 'steps': 20467, 'loss/train': 1.4060722589492798} 11/07/2021 00:07:30 - INFO - __main__ - Step 20469: {'lr': 0.00048103481553876415, 'samples': 3930048, 'steps': 20468, 'loss/train': 1.519386887550354} 11/07/2021 00:07:30 - INFO - __main__ - Step 20470: {'lr': 0.0004810327880143276, 'samples': 3930240, 'steps': 20469, 'loss/train': 1.8691269159317017} 11/07/2021 00:07:31 - INFO - __main__ - Step 20471: {'lr': 0.00048103076038579125, 'samples': 3930432, 'steps': 20470, 'loss/train': 1.5998656749725342} 11/07/2021 00:07:31 - INFO - __main__ - Step 20472: {'lr': 0.00048102873265315596, 'samples': 3930624, 'steps': 20471, 'loss/train': 1.6641353368759155} 11/07/2021 00:07:31 - INFO - __main__ - Step 20473: {'lr': 0.0004810267048164227, 'samples': 3930816, 'steps': 20472, 'loss/train': 1.4810760021209717} 11/07/2021 00:07:32 - INFO - __main__ - Step 20474: {'lr': 0.0004810246768755924, 'samples': 3931008, 'steps': 20473, 'loss/train': 1.4060907363891602} 11/07/2021 00:07:33 - INFO - __main__ - Step 20475: {'lr': 0.0004810226488306659, 'samples': 3931200, 'steps': 20474, 'loss/train': 1.388344407081604} 11/07/2021 00:07:33 - INFO - __main__ - Step 20476: {'lr': 0.00048102062068164413, 'samples': 3931392, 'steps': 20475, 'loss/train': 1.8781498670578003} 11/07/2021 00:07:33 - INFO - __main__ - Step 20477: {'lr': 0.0004810185924285281, 'samples': 3931584, 'steps': 20476, 'loss/train': 1.7342883348464966} 11/07/2021 00:07:34 - INFO - __main__ - Step 20478: {'lr': 0.00048101656407131864, 'samples': 3931776, 'steps': 20477, 'loss/train': 1.5581198930740356} 11/07/2021 00:07:34 - INFO - __main__ - Step 20479: {'lr': 0.00048101453561001667, 'samples': 3931968, 'steps': 20478, 'loss/train': 2.0187954902648926} 11/07/2021 00:07:35 - INFO - __main__ - Step 20480: {'lr': 0.00048101250704462315, 'samples': 3932160, 'steps': 20479, 'loss/train': 1.8168493509292603} 11/07/2021 00:07:36 - INFO - __main__ - Step 20481: {'lr': 0.0004810104783751389, 'samples': 3932352, 'steps': 20480, 'loss/train': 0.9651317000389099} 11/07/2021 00:07:36 - INFO - __main__ - Step 20482: {'lr': 0.00048100844960156496, 'samples': 3932544, 'steps': 20481, 'loss/train': 1.6677086353302002} 11/07/2021 00:07:36 - INFO - __main__ - Step 20483: {'lr': 0.0004810064207239021, 'samples': 3932736, 'steps': 20482, 'loss/train': 1.6259307861328125} 11/07/2021 00:07:37 - INFO - __main__ - Step 20484: {'lr': 0.0004810043917421514, 'samples': 3932928, 'steps': 20483, 'loss/train': 1.52870512008667} 11/07/2021 00:07:38 - INFO - __main__ - Step 20485: {'lr': 0.0004810023626563136, 'samples': 3933120, 'steps': 20484, 'loss/train': 2.194108009338379} 11/07/2021 00:07:38 - INFO - __main__ - Step 20486: {'lr': 0.0004810003334663898, 'samples': 3933312, 'steps': 20485, 'loss/train': 1.8448121547698975} 11/07/2021 00:07:38 - INFO - __main__ - Step 20487: {'lr': 0.0004809983041723807, 'samples': 3933504, 'steps': 20486, 'loss/train': 1.5697786808013916} 11/07/2021 00:07:39 - INFO - __main__ - Step 20488: {'lr': 0.00048099627477428744, 'samples': 3933696, 'steps': 20487, 'loss/train': 1.9467196464538574} 11/07/2021 00:07:39 - INFO - __main__ - Step 20489: {'lr': 0.0004809942452721107, 'samples': 3933888, 'steps': 20488, 'loss/train': 1.9454448223114014} 11/07/2021 00:07:40 - INFO - __main__ - Step 20490: {'lr': 0.0004809922156658516, 'samples': 3934080, 'steps': 20489, 'loss/train': 1.799720048904419} 11/07/2021 00:07:40 - INFO - __main__ - Step 20491: {'lr': 0.00048099018595551096, 'samples': 3934272, 'steps': 20490, 'loss/train': 1.9494208097457886} 11/07/2021 00:07:41 - INFO - __main__ - Step 20492: {'lr': 0.0004809881561410897, 'samples': 3934464, 'steps': 20491, 'loss/train': 1.471197485923767} 11/07/2021 00:07:41 - INFO - __main__ - Step 20493: {'lr': 0.00048098612622258873, 'samples': 3934656, 'steps': 20492, 'loss/train': 1.7744096517562866} 11/07/2021 00:07:42 - INFO - __main__ - Step 20494: {'lr': 0.00048098409620000906, 'samples': 3934848, 'steps': 20493, 'loss/train': 1.5430742502212524} 11/07/2021 00:07:43 - INFO - __main__ - Step 20495: {'lr': 0.00048098206607335135, 'samples': 3935040, 'steps': 20494, 'loss/train': 2.459300994873047} 11/07/2021 00:07:43 - INFO - __main__ - Step 20496: {'lr': 0.00048098003584261684, 'samples': 3935232, 'steps': 20495, 'loss/train': 1.3832452297210693} 11/07/2021 00:07:43 - INFO - __main__ - Step 20497: {'lr': 0.00048097800550780625, 'samples': 3935424, 'steps': 20496, 'loss/train': 1.6852710247039795} 11/07/2021 00:07:44 - INFO - __main__ - Step 20498: {'lr': 0.0004809759750689205, 'samples': 3935616, 'steps': 20497, 'loss/train': 1.6425219774246216} 11/07/2021 00:07:44 - INFO - __main__ - Step 20499: {'lr': 0.00048097394452596053, 'samples': 3935808, 'steps': 20498, 'loss/train': 1.1338953971862793} 11/07/2021 00:07:45 - INFO - __main__ - Step 20500: {'lr': 0.0004809719138789273, 'samples': 3936000, 'steps': 20499, 'loss/train': 1.6991946697235107} 11/07/2021 00:07:45 - INFO - __main__ - Step 20501: {'lr': 0.0004809698831278217, 'samples': 3936192, 'steps': 20500, 'loss/train': 1.4078713655471802} 11/07/2021 00:07:46 - INFO - __main__ - Step 20502: {'lr': 0.0004809678522726446, 'samples': 3936384, 'steps': 20501, 'loss/train': 1.7715500593185425} 11/07/2021 00:07:46 - INFO - __main__ - Step 20503: {'lr': 0.000480965821313397, 'samples': 3936576, 'steps': 20502, 'loss/train': 1.9850037097930908} 11/07/2021 00:07:46 - INFO - __main__ - Step 20504: {'lr': 0.0004809637902500797, 'samples': 3936768, 'steps': 20503, 'loss/train': 1.8379428386688232} 11/07/2021 00:07:47 - INFO - __main__ - Step 20505: {'lr': 0.00048096175908269375, 'samples': 3936960, 'steps': 20504, 'loss/train': 1.2110165357589722} 11/07/2021 00:07:48 - INFO - __main__ - Step 20506: {'lr': 0.00048095972781124, 'samples': 3937152, 'steps': 20505, 'loss/train': 1.5643447637557983} 11/07/2021 00:07:48 - INFO - __main__ - Step 20507: {'lr': 0.00048095769643571927, 'samples': 3937344, 'steps': 20506, 'loss/train': 1.2366441488265991} 11/07/2021 00:07:49 - INFO - __main__ - Step 20508: {'lr': 0.0004809556649561326, 'samples': 3937536, 'steps': 20507, 'loss/train': 1.807586669921875} 11/07/2021 00:07:49 - INFO - __main__ - Step 20509: {'lr': 0.0004809536333724809, 'samples': 3937728, 'steps': 20508, 'loss/train': 1.3580818176269531} 11/07/2021 00:07:49 - INFO - __main__ - Step 20510: {'lr': 0.000480951601684765, 'samples': 3937920, 'steps': 20509, 'loss/train': 1.1185882091522217} 11/07/2021 00:07:50 - INFO - __main__ - Step 20511: {'lr': 0.00048094956989298593, 'samples': 3938112, 'steps': 20510, 'loss/train': 0.8402307629585266} 11/07/2021 00:07:51 - INFO - __main__ - Step 20512: {'lr': 0.0004809475379971445, 'samples': 3938304, 'steps': 20511, 'loss/train': 1.676823377609253} 11/07/2021 00:07:51 - INFO - __main__ - Step 20513: {'lr': 0.00048094550599724176, 'samples': 3938496, 'steps': 20512, 'loss/train': 1.8695015907287598} 11/07/2021 00:07:52 - INFO - __main__ - Step 20514: {'lr': 0.0004809434738932785, 'samples': 3938688, 'steps': 20513, 'loss/train': 2.3277268409729004} 11/07/2021 00:07:52 - INFO - __main__ - Step 20515: {'lr': 0.0004809414416852557, 'samples': 3938880, 'steps': 20514, 'loss/train': 1.4669691324234009} 11/07/2021 00:07:53 - INFO - __main__ - Step 20516: {'lr': 0.00048093940937317414, 'samples': 3939072, 'steps': 20515, 'loss/train': 0.4304567873477936} 11/07/2021 00:07:53 - INFO - __main__ - Step 20517: {'lr': 0.00048093737695703494, 'samples': 3939264, 'steps': 20516, 'loss/train': 1.61251962184906} 11/07/2021 00:07:54 - INFO - __main__ - Step 20518: {'lr': 0.0004809353444368389, 'samples': 3939456, 'steps': 20517, 'loss/train': 1.7847530841827393} 11/07/2021 00:07:54 - INFO - __main__ - Step 20519: {'lr': 0.00048093331181258694, 'samples': 3939648, 'steps': 20518, 'loss/train': 1.8373972177505493} 11/07/2021 00:07:54 - INFO - __main__ - Step 20520: {'lr': 0.00048093127908428, 'samples': 3939840, 'steps': 20519, 'loss/train': 1.905431866645813} 11/07/2021 00:07:55 - INFO - __main__ - Step 20521: {'lr': 0.00048092924625191903, 'samples': 3940032, 'steps': 20520, 'loss/train': 1.9084038734436035} 11/07/2021 00:07:56 - INFO - __main__ - Step 20522: {'lr': 0.0004809272133155048, 'samples': 3940224, 'steps': 20521, 'loss/train': 1.740844964981079} 11/07/2021 00:07:56 - INFO - __main__ - Step 20523: {'lr': 0.00048092518027503844, 'samples': 3940416, 'steps': 20522, 'loss/train': 0.3389599025249481} 11/07/2021 00:07:56 - INFO - __main__ - Step 20524: {'lr': 0.0004809231471305208, 'samples': 3940608, 'steps': 20523, 'loss/train': 1.7005736827850342} 11/07/2021 00:07:57 - INFO - __main__ - Step 20525: {'lr': 0.0004809211138819526, 'samples': 3940800, 'steps': 20524, 'loss/train': 1.737630009651184} 11/07/2021 00:07:57 - INFO - __main__ - Step 20526: {'lr': 0.000480919080529335, 'samples': 3940992, 'steps': 20525, 'loss/train': 1.6942681074142456} 11/07/2021 00:07:58 - INFO - __main__ - Step 20527: {'lr': 0.0004809170470726688, 'samples': 3941184, 'steps': 20526, 'loss/train': 1.7566242218017578} 11/07/2021 00:07:58 - INFO - __main__ - Step 20528: {'lr': 0.00048091501351195495, 'samples': 3941376, 'steps': 20527, 'loss/train': 1.3537952899932861} 11/07/2021 00:07:59 - INFO - __main__ - Step 20529: {'lr': 0.00048091297984719433, 'samples': 3941568, 'steps': 20528, 'loss/train': 1.87349271774292} 11/07/2021 00:07:59 - INFO - __main__ - Step 20530: {'lr': 0.0004809109460783879, 'samples': 3941760, 'steps': 20529, 'loss/train': 1.070335030555725} 11/07/2021 00:07:59 - INFO - __main__ - Step 20531: {'lr': 0.0004809089122055366, 'samples': 3941952, 'steps': 20530, 'loss/train': 1.6043052673339844} 11/07/2021 00:08:00 - INFO - __main__ - Step 20532: {'lr': 0.00048090687822864125, 'samples': 3942144, 'steps': 20531, 'loss/train': 1.5132311582565308} 11/07/2021 00:08:01 - INFO - __main__ - Step 20533: {'lr': 0.00048090484414770284, 'samples': 3942336, 'steps': 20532, 'loss/train': 1.4935758113861084} 11/07/2021 00:08:01 - INFO - __main__ - Step 20534: {'lr': 0.00048090280996272234, 'samples': 3942528, 'steps': 20533, 'loss/train': 0.8592392802238464} 11/07/2021 00:08:01 - INFO - __main__ - Step 20535: {'lr': 0.0004809007756737005, 'samples': 3942720, 'steps': 20534, 'loss/train': 1.2228641510009766} 11/07/2021 00:08:02 - INFO - __main__ - Step 20536: {'lr': 0.0004808987412806384, 'samples': 3942912, 'steps': 20535, 'loss/train': 1.644826889038086} 11/07/2021 00:08:03 - INFO - __main__ - Step 20537: {'lr': 0.0004808967067835369, 'samples': 3943104, 'steps': 20536, 'loss/train': 1.395026683807373} 11/07/2021 00:08:03 - INFO - __main__ - Step 20538: {'lr': 0.00048089467218239687, 'samples': 3943296, 'steps': 20537, 'loss/train': 1.3494728803634644} 11/07/2021 00:08:04 - INFO - __main__ - Step 20539: {'lr': 0.00048089263747721925, 'samples': 3943488, 'steps': 20538, 'loss/train': 1.3986570835113525} 11/07/2021 00:08:04 - INFO - __main__ - Step 20540: {'lr': 0.000480890602668005, 'samples': 3943680, 'steps': 20539, 'loss/train': 1.3758116960525513} 11/07/2021 00:08:04 - INFO - __main__ - Step 20541: {'lr': 0.000480888567754755, 'samples': 3943872, 'steps': 20540, 'loss/train': 2.1255879402160645} 11/07/2021 00:08:05 - INFO - __main__ - Step 20542: {'lr': 0.0004808865327374701, 'samples': 3944064, 'steps': 20541, 'loss/train': 1.351326584815979} 11/07/2021 00:08:06 - INFO - __main__ - Step 20543: {'lr': 0.0004808844976161514, 'samples': 3944256, 'steps': 20542, 'loss/train': 1.5466766357421875} 11/07/2021 00:08:06 - INFO - __main__ - Step 20544: {'lr': 0.0004808824623907997, 'samples': 3944448, 'steps': 20543, 'loss/train': 0.4308020770549774} 11/07/2021 00:08:07 - INFO - __main__ - Step 20545: {'lr': 0.0004808804270614159, 'samples': 3944640, 'steps': 20544, 'loss/train': 1.7366575002670288} 11/07/2021 00:08:07 - INFO - __main__ - Step 20546: {'lr': 0.0004808783916280008, 'samples': 3944832, 'steps': 20545, 'loss/train': 1.2901078462600708} 11/07/2021 00:08:08 - INFO - __main__ - Step 20547: {'lr': 0.0004808763560905557, 'samples': 3945024, 'steps': 20546, 'loss/train': 1.660652995109558} 11/07/2021 00:08:08 - INFO - __main__ - Step 20548: {'lr': 0.0004808743204490811, 'samples': 3945216, 'steps': 20547, 'loss/train': 1.7535828351974487} 11/07/2021 00:08:09 - INFO - __main__ - Step 20549: {'lr': 0.00048087228470357823, 'samples': 3945408, 'steps': 20548, 'loss/train': 1.7939064502716064} 11/07/2021 00:08:09 - INFO - __main__ - Step 20550: {'lr': 0.00048087024885404777, 'samples': 3945600, 'steps': 20549, 'loss/train': 2.644047975540161} 11/07/2021 00:08:09 - INFO - __main__ - Step 20551: {'lr': 0.00048086821290049077, 'samples': 3945792, 'steps': 20550, 'loss/train': 0.19849666953086853} 11/07/2021 00:08:10 - INFO - __main__ - Step 20552: {'lr': 0.00048086617684290814, 'samples': 3945984, 'steps': 20551, 'loss/train': 1.6721930503845215} 11/07/2021 00:08:11 - INFO - __main__ - Step 20553: {'lr': 0.00048086414068130077, 'samples': 3946176, 'steps': 20552, 'loss/train': 1.5216528177261353} 11/07/2021 00:08:11 - INFO - __main__ - Step 20554: {'lr': 0.00048086210441566956, 'samples': 3946368, 'steps': 20553, 'loss/train': 1.0089731216430664} 11/07/2021 00:08:11 - INFO - __main__ - Step 20555: {'lr': 0.00048086006804601544, 'samples': 3946560, 'steps': 20554, 'loss/train': 1.431415319442749} 11/07/2021 00:08:12 - INFO - __main__ - Step 20556: {'lr': 0.00048085803157233933, 'samples': 3946752, 'steps': 20555, 'loss/train': 1.5693162679672241} 11/07/2021 00:08:12 - INFO - __main__ - Step 20557: {'lr': 0.00048085599499464216, 'samples': 3946944, 'steps': 20556, 'loss/train': 1.8088922500610352} 11/07/2021 00:08:13 - INFO - __main__ - Step 20558: {'lr': 0.0004808539583129249, 'samples': 3947136, 'steps': 20557, 'loss/train': 1.1080645322799683} 11/07/2021 00:08:14 - INFO - __main__ - Step 20559: {'lr': 0.0004808519215271884, 'samples': 3947328, 'steps': 20558, 'loss/train': 1.701973795890808} 11/07/2021 00:08:14 - INFO - __main__ - Step 20560: {'lr': 0.0004808498846374335, 'samples': 3947520, 'steps': 20559, 'loss/train': 1.7977246046066284} 11/07/2021 00:08:14 - INFO - __main__ - Step 20561: {'lr': 0.0004808478476436612, 'samples': 3947712, 'steps': 20560, 'loss/train': 0.45990151166915894} 11/07/2021 00:08:15 - INFO - __main__ - Step 20562: {'lr': 0.00048084581054587253, 'samples': 3947904, 'steps': 20561, 'loss/train': 1.6324790716171265} 11/07/2021 00:08:16 - INFO - __main__ - Step 20563: {'lr': 0.0004808437733440682, 'samples': 3948096, 'steps': 20562, 'loss/train': 1.7324868440628052} 11/07/2021 00:08:16 - INFO - __main__ - Step 20564: {'lr': 0.0004808417360382493, 'samples': 3948288, 'steps': 20563, 'loss/train': 1.4390432834625244} 11/07/2021 00:08:17 - INFO - __main__ - Step 20565: {'lr': 0.00048083969862841667, 'samples': 3948480, 'steps': 20564, 'loss/train': 1.9008427858352661} 11/07/2021 00:08:17 - INFO - __main__ - Step 20566: {'lr': 0.00048083766111457115, 'samples': 3948672, 'steps': 20565, 'loss/train': 1.5727438926696777} 11/07/2021 00:08:17 - INFO - __main__ - Step 20567: {'lr': 0.0004808356234967138, 'samples': 3948864, 'steps': 20566, 'loss/train': 1.1518347263336182} 11/07/2021 00:08:18 - INFO - __main__ - Step 20568: {'lr': 0.00048083358577484547, 'samples': 3949056, 'steps': 20567, 'loss/train': 0.17537906765937805} 11/07/2021 00:08:19 - INFO - __main__ - Step 20569: {'lr': 0.0004808315479489671, 'samples': 3949248, 'steps': 20568, 'loss/train': 1.7102299928665161} 11/07/2021 00:08:19 - INFO - __main__ - Step 20570: {'lr': 0.00048082951001907965, 'samples': 3949440, 'steps': 20569, 'loss/train': 1.6897586584091187} 11/07/2021 00:08:20 - INFO - __main__ - Step 20571: {'lr': 0.0004808274719851839, 'samples': 3949632, 'steps': 20570, 'loss/train': 1.6184511184692383} 11/07/2021 00:08:20 - INFO - __main__ - Step 20572: {'lr': 0.0004808254338472809, 'samples': 3949824, 'steps': 20571, 'loss/train': 1.9622421264648438} 11/07/2021 00:08:20 - INFO - __main__ - Step 20573: {'lr': 0.00048082339560537145, 'samples': 3950016, 'steps': 20572, 'loss/train': 1.6746459007263184} 11/07/2021 00:08:21 - INFO - __main__ - Step 20574: {'lr': 0.00048082135725945665, 'samples': 3950208, 'steps': 20573, 'loss/train': 2.0031142234802246} 11/07/2021 00:08:22 - INFO - __main__ - Step 20575: {'lr': 0.0004808193188095372, 'samples': 3950400, 'steps': 20574, 'loss/train': 1.841524362564087} 11/07/2021 00:08:22 - INFO - __main__ - Step 20576: {'lr': 0.0004808172802556142, 'samples': 3950592, 'steps': 20575, 'loss/train': 0.7797104716300964} 11/07/2021 00:08:22 - INFO - __main__ - Step 20577: {'lr': 0.0004808152415976885, 'samples': 3950784, 'steps': 20576, 'loss/train': 1.443601369857788} 11/07/2021 00:08:23 - INFO - __main__ - Step 20578: {'lr': 0.000480813202835761, 'samples': 3950976, 'steps': 20577, 'loss/train': 1.2981877326965332} 11/07/2021 00:08:24 - INFO - __main__ - Step 20579: {'lr': 0.0004808111639698326, 'samples': 3951168, 'steps': 20578, 'loss/train': 1.9003407955169678} 11/07/2021 00:08:24 - INFO - __main__ - Step 20580: {'lr': 0.0004808091249999043, 'samples': 3951360, 'steps': 20579, 'loss/train': 2.1749467849731445} 11/07/2021 00:08:24 - INFO - __main__ - Step 20581: {'lr': 0.0004808070859259769, 'samples': 3951552, 'steps': 20580, 'loss/train': 1.2194863557815552} 11/07/2021 00:08:25 - INFO - __main__ - Step 20582: {'lr': 0.0004808050467480515, 'samples': 3951744, 'steps': 20581, 'loss/train': 1.2447943687438965} 11/07/2021 00:08:25 - INFO - __main__ - Step 20583: {'lr': 0.0004808030074661288, 'samples': 3951936, 'steps': 20582, 'loss/train': 1.7001240253448486} 11/07/2021 00:08:26 - INFO - __main__ - Step 20584: {'lr': 0.0004808009680802099, 'samples': 3952128, 'steps': 20583, 'loss/train': 1.4224902391433716} 11/07/2021 00:08:26 - INFO - __main__ - Step 20585: {'lr': 0.00048079892859029564, 'samples': 3952320, 'steps': 20584, 'loss/train': 1.3647888898849487} 11/07/2021 00:08:27 - INFO - __main__ - Step 20586: {'lr': 0.00048079688899638684, 'samples': 3952512, 'steps': 20585, 'loss/train': 1.5216392278671265} 11/07/2021 00:08:27 - INFO - __main__ - Step 20587: {'lr': 0.0004807948492984846, 'samples': 3952704, 'steps': 20586, 'loss/train': 2.091362476348877} 11/07/2021 00:08:27 - INFO - __main__ - Step 20588: {'lr': 0.0004807928094965898, 'samples': 3952896, 'steps': 20587, 'loss/train': 1.597800612449646} 11/07/2021 00:08:28 - INFO - __main__ - Step 20589: {'lr': 0.0004807907695907032, 'samples': 3953088, 'steps': 20588, 'loss/train': 1.6435033082962036} 11/07/2021 00:08:29 - INFO - __main__ - Step 20590: {'lr': 0.000480788729580826, 'samples': 3953280, 'steps': 20589, 'loss/train': 1.4887282848358154} 11/07/2021 00:08:29 - INFO - __main__ - Step 20591: {'lr': 0.00048078668946695887, 'samples': 3953472, 'steps': 20590, 'loss/train': 1.240483283996582} 11/07/2021 00:08:30 - INFO - __main__ - Step 20592: {'lr': 0.0004807846492491028, 'samples': 3953664, 'steps': 20591, 'loss/train': 1.6103730201721191} 11/07/2021 00:08:30 - INFO - __main__ - Step 20593: {'lr': 0.0004807826089272588, 'samples': 3953856, 'steps': 20592, 'loss/train': 1.8579837083816528} 11/07/2021 00:08:31 - INFO - __main__ - Step 20594: {'lr': 0.0004807805685014277, 'samples': 3954048, 'steps': 20593, 'loss/train': 1.8013089895248413} 11/07/2021 00:08:31 - INFO - __main__ - Step 20595: {'lr': 0.00048077852797161034, 'samples': 3954240, 'steps': 20594, 'loss/train': 1.3747129440307617} 11/07/2021 00:08:32 - INFO - __main__ - Step 20596: {'lr': 0.0004807764873378079, 'samples': 3954432, 'steps': 20595, 'loss/train': 1.2700172662734985} 11/07/2021 00:08:32 - INFO - __main__ - Step 20597: {'lr': 0.000480774446600021, 'samples': 3954624, 'steps': 20596, 'loss/train': 1.3797485828399658} 11/07/2021 00:08:32 - INFO - __main__ - Step 20598: {'lr': 0.00048077240575825075, 'samples': 3954816, 'steps': 20597, 'loss/train': 2.085232973098755} 11/07/2021 00:08:33 - INFO - __main__ - Step 20599: {'lr': 0.000480770364812498, 'samples': 3955008, 'steps': 20598, 'loss/train': 1.6940776109695435} 11/07/2021 00:08:34 - INFO - __main__ - Step 20600: {'lr': 0.0004807683237627637, 'samples': 3955200, 'steps': 20599, 'loss/train': 1.5124833583831787} 11/07/2021 00:08:34 - INFO - __main__ - Step 20601: {'lr': 0.0004807662826090488, 'samples': 3955392, 'steps': 20600, 'loss/train': 1.4219224452972412} 11/07/2021 00:08:34 - INFO - __main__ - Step 20602: {'lr': 0.00048076424135135406, 'samples': 3955584, 'steps': 20601, 'loss/train': 1.5715713500976562} 11/07/2021 00:08:35 - INFO - __main__ - Step 20603: {'lr': 0.00048076219998968055, 'samples': 3955776, 'steps': 20602, 'loss/train': 2.141200304031372} 11/07/2021 00:08:35 - INFO - __main__ - Step 20604: {'lr': 0.0004807601585240292, 'samples': 3955968, 'steps': 20603, 'loss/train': 1.7138773202896118} 11/07/2021 00:08:36 - INFO - __main__ - Step 20605: {'lr': 0.0004807581169544009, 'samples': 3956160, 'steps': 20604, 'loss/train': 1.7838529348373413} 11/07/2021 00:08:37 - INFO - __main__ - Step 20606: {'lr': 0.00048075607528079645, 'samples': 3956352, 'steps': 20605, 'loss/train': 1.4636403322219849} 11/07/2021 00:08:37 - INFO - __main__ - Step 20607: {'lr': 0.0004807540335032169, 'samples': 3956544, 'steps': 20606, 'loss/train': 1.51893949508667} 11/07/2021 00:08:37 - INFO - __main__ - Step 20608: {'lr': 0.0004807519916216633, 'samples': 3956736, 'steps': 20607, 'loss/train': 1.2464053630828857} 11/07/2021 00:08:38 - INFO - __main__ - Step 20609: {'lr': 0.0004807499496361362, 'samples': 3956928, 'steps': 20608, 'loss/train': 1.454918384552002} 11/07/2021 00:08:39 - INFO - __main__ - Step 20610: {'lr': 0.00048074790754663686, 'samples': 3957120, 'steps': 20609, 'loss/train': 1.3798999786376953} 11/07/2021 00:08:39 - INFO - __main__ - Step 20611: {'lr': 0.000480745865353166, 'samples': 3957312, 'steps': 20610, 'loss/train': 1.5794548988342285} 11/07/2021 00:08:39 - INFO - __main__ - Step 20612: {'lr': 0.0004807438230557247, 'samples': 3957504, 'steps': 20611, 'loss/train': 2.0480737686157227} 11/07/2021 00:08:40 - INFO - __main__ - Step 20613: {'lr': 0.00048074178065431373, 'samples': 3957696, 'steps': 20612, 'loss/train': 0.7402926087379456} 11/07/2021 00:08:40 - INFO - __main__ - Step 20614: {'lr': 0.0004807397381489341, 'samples': 3957888, 'steps': 20613, 'loss/train': 1.6169627904891968} 11/07/2021 00:08:41 - INFO - __main__ - Step 20615: {'lr': 0.00048073769553958666, 'samples': 3958080, 'steps': 20614, 'loss/train': 1.4762638807296753} 11/07/2021 00:08:42 - INFO - __main__ - Step 20616: {'lr': 0.00048073565282627246, 'samples': 3958272, 'steps': 20615, 'loss/train': 1.4194600582122803} 11/07/2021 00:08:42 - INFO - __main__ - Step 20617: {'lr': 0.0004807336100089923, 'samples': 3958464, 'steps': 20616, 'loss/train': 1.2621079683303833} 11/07/2021 00:08:42 - INFO - __main__ - Step 20618: {'lr': 0.0004807315670877471, 'samples': 3958656, 'steps': 20617, 'loss/train': 1.8260729312896729} 11/07/2021 00:08:43 - INFO - __main__ - Step 20619: {'lr': 0.00048072952406253783, 'samples': 3958848, 'steps': 20618, 'loss/train': 1.0546283721923828} 11/07/2021 00:08:44 - INFO - __main__ - Step 20620: {'lr': 0.00048072748093336536, 'samples': 3959040, 'steps': 20619, 'loss/train': 1.9476903676986694} 11/07/2021 00:08:44 - INFO - __main__ - Step 20621: {'lr': 0.00048072543770023076, 'samples': 3959232, 'steps': 20620, 'loss/train': 1.710111141204834} 11/07/2021 00:08:44 - INFO - __main__ - Step 20622: {'lr': 0.0004807233943631347, 'samples': 3959424, 'steps': 20621, 'loss/train': 1.7692378759384155} 11/07/2021 00:08:45 - INFO - __main__ - Step 20623: {'lr': 0.0004807213509220784, 'samples': 3959616, 'steps': 20622, 'loss/train': 1.9633917808532715} 11/07/2021 00:08:45 - INFO - __main__ - Step 20624: {'lr': 0.0004807193073770625, 'samples': 3959808, 'steps': 20623, 'loss/train': 1.632491111755371} 11/07/2021 00:08:46 - INFO - __main__ - Step 20625: {'lr': 0.0004807172637280881, 'samples': 3960000, 'steps': 20624, 'loss/train': 1.7137901782989502} 11/07/2021 00:08:46 - INFO - __main__ - Step 20626: {'lr': 0.000480715219975156, 'samples': 3960192, 'steps': 20625, 'loss/train': 1.7485941648483276} 11/07/2021 00:08:47 - INFO - __main__ - Step 20627: {'lr': 0.0004807131761182672, 'samples': 3960384, 'steps': 20626, 'loss/train': 1.6671442985534668} 11/07/2021 00:08:47 - INFO - __main__ - Step 20628: {'lr': 0.00048071113215742263, 'samples': 3960576, 'steps': 20627, 'loss/train': 1.8338042497634888} 11/07/2021 00:08:47 - INFO - __main__ - Step 20629: {'lr': 0.00048070908809262316, 'samples': 3960768, 'steps': 20628, 'loss/train': 1.5052207708358765} 11/07/2021 00:08:48 - INFO - __main__ - Step 20630: {'lr': 0.0004807070439238698, 'samples': 3960960, 'steps': 20629, 'loss/train': 1.7445805072784424} 11/07/2021 00:08:49 - INFO - __main__ - Step 20631: {'lr': 0.0004807049996511633, 'samples': 3961152, 'steps': 20630, 'loss/train': 1.856859564781189} 11/07/2021 00:08:49 - INFO - __main__ - Step 20632: {'lr': 0.00048070295527450474, 'samples': 3961344, 'steps': 20631, 'loss/train': 1.6593570709228516} 11/07/2021 00:08:50 - INFO - __main__ - Step 20633: {'lr': 0.000480700910793895, 'samples': 3961536, 'steps': 20632, 'loss/train': 1.9622411727905273} 11/07/2021 00:08:50 - INFO - __main__ - Step 20634: {'lr': 0.000480698866209335, 'samples': 3961728, 'steps': 20633, 'loss/train': 1.853137731552124} 11/07/2021 00:08:50 - INFO - __main__ - Step 20635: {'lr': 0.0004806968215208256, 'samples': 3961920, 'steps': 20634, 'loss/train': 1.5925242900848389} 11/07/2021 00:08:51 - INFO - __main__ - Step 20636: {'lr': 0.0004806947767283678, 'samples': 3962112, 'steps': 20635, 'loss/train': 1.4025685787200928} 11/07/2021 00:08:52 - INFO - __main__ - Step 20637: {'lr': 0.0004806927318319625, 'samples': 3962304, 'steps': 20636, 'loss/train': 1.8534218072891235} 11/07/2021 00:08:52 - INFO - __main__ - Step 20638: {'lr': 0.0004806906868316106, 'samples': 3962496, 'steps': 20637, 'loss/train': 2.070551633834839} 11/07/2021 00:08:52 - INFO - __main__ - Step 20639: {'lr': 0.000480688641727313, 'samples': 3962688, 'steps': 20638, 'loss/train': 1.5866397619247437} 11/07/2021 00:08:53 - INFO - __main__ - Step 20640: {'lr': 0.00048068659651907076, 'samples': 3962880, 'steps': 20639, 'loss/train': 1.4923686981201172} 11/07/2021 00:08:54 - INFO - __main__ - Step 20641: {'lr': 0.0004806845512068846, 'samples': 3963072, 'steps': 20640, 'loss/train': 1.3124408721923828} 11/07/2021 00:08:55 - INFO - __main__ - Step 20642: {'lr': 0.00048068250579075554, 'samples': 3963264, 'steps': 20641, 'loss/train': 1.4300978183746338} 11/07/2021 00:08:55 - INFO - __main__ - Step 20643: {'lr': 0.00048068046027068456, 'samples': 3963456, 'steps': 20642, 'loss/train': 0.792670726776123} 11/07/2021 00:08:55 - INFO - __main__ - Step 20644: {'lr': 0.0004806784146466726, 'samples': 3963648, 'steps': 20643, 'loss/train': 1.530687928199768} 11/07/2021 00:08:56 - INFO - __main__ - Step 20645: {'lr': 0.00048067636891872036, 'samples': 3963840, 'steps': 20644, 'loss/train': 1.8204725980758667} 11/07/2021 00:08:56 - INFO - __main__ - Step 20646: {'lr': 0.00048067432308682894, 'samples': 3964032, 'steps': 20645, 'loss/train': 1.7493035793304443} 11/07/2021 00:08:58 - INFO - __main__ - Step 20647: {'lr': 0.0004806722771509993, 'samples': 3964224, 'steps': 20646, 'loss/train': 1.4086054563522339} 11/07/2021 00:08:58 - INFO - __main__ - Step 20648: {'lr': 0.0004806702311112322, 'samples': 3964416, 'steps': 20647, 'loss/train': 1.8562865257263184} 11/07/2021 00:08:58 - INFO - __main__ - Step 20649: {'lr': 0.0004806681849675287, 'samples': 3964608, 'steps': 20648, 'loss/train': 0.9947164058685303} 11/07/2021 00:08:59 - INFO - __main__ - Step 20650: {'lr': 0.00048066613871988967, 'samples': 3964800, 'steps': 20649, 'loss/train': 0.8895652890205383} 11/07/2021 00:08:59 - INFO - __main__ - Step 20651: {'lr': 0.00048066409236831607, 'samples': 3964992, 'steps': 20650, 'loss/train': 1.5003318786621094} 11/07/2021 00:08:59 - INFO - __main__ - Step 20652: {'lr': 0.0004806620459128087, 'samples': 3965184, 'steps': 20651, 'loss/train': 2.3419785499572754} 11/07/2021 00:09:00 - INFO - __main__ - Step 20653: {'lr': 0.0004806599993533687, 'samples': 3965376, 'steps': 20652, 'loss/train': 1.5649617910385132} 11/07/2021 00:09:01 - INFO - __main__ - Step 20654: {'lr': 0.00048065795268999677, 'samples': 3965568, 'steps': 20653, 'loss/train': 1.3647540807724} 11/07/2021 00:09:01 - INFO - __main__ - Step 20655: {'lr': 0.00048065590592269393, 'samples': 3965760, 'steps': 20654, 'loss/train': 1.6221338510513306} 11/07/2021 00:09:01 - INFO - __main__ - Step 20656: {'lr': 0.00048065385905146114, 'samples': 3965952, 'steps': 20655, 'loss/train': 1.5131564140319824} 11/07/2021 00:09:02 - INFO - __main__ - Step 20657: {'lr': 0.0004806518120762993, 'samples': 3966144, 'steps': 20656, 'loss/train': 1.148862361907959} 11/07/2021 00:09:02 - INFO - __main__ - Step 20658: {'lr': 0.00048064976499720923, 'samples': 3966336, 'steps': 20657, 'loss/train': 1.2469278573989868} 11/07/2021 00:09:03 - INFO - __main__ - Step 20659: {'lr': 0.000480647717814192, 'samples': 3966528, 'steps': 20658, 'loss/train': 1.5720820426940918} 11/07/2021 00:09:03 - INFO - __main__ - Step 20660: {'lr': 0.0004806456705272484, 'samples': 3966720, 'steps': 20659, 'loss/train': 1.0221384763717651} 11/07/2021 00:09:04 - INFO - __main__ - Step 20661: {'lr': 0.0004806436231363795, 'samples': 3966912, 'steps': 20660, 'loss/train': 1.6063361167907715} 11/07/2021 00:09:04 - INFO - __main__ - Step 20662: {'lr': 0.00048064157564158607, 'samples': 3967104, 'steps': 20661, 'loss/train': 1.519577980041504} 11/07/2021 00:09:04 - INFO - __main__ - Step 20663: {'lr': 0.00048063952804286913, 'samples': 3967296, 'steps': 20662, 'loss/train': 1.8714137077331543} 11/07/2021 00:09:06 - INFO - __main__ - Step 20664: {'lr': 0.0004806374803402296, 'samples': 3967488, 'steps': 20663, 'loss/train': 1.5183724164962769} 11/07/2021 00:09:06 - INFO - __main__ - Step 20665: {'lr': 0.00048063543253366837, 'samples': 3967680, 'steps': 20664, 'loss/train': 0.3621142506599426} 11/07/2021 00:09:06 - INFO - __main__ - Step 20666: {'lr': 0.0004806333846231864, 'samples': 3967872, 'steps': 20665, 'loss/train': 1.2749292850494385} 11/07/2021 00:09:07 - INFO - __main__ - Step 20667: {'lr': 0.00048063133660878455, 'samples': 3968064, 'steps': 20666, 'loss/train': 2.1483118534088135} 11/07/2021 00:09:07 - INFO - __main__ - Step 20668: {'lr': 0.00048062928849046377, 'samples': 3968256, 'steps': 20667, 'loss/train': 1.5145294666290283} 11/07/2021 00:09:08 - INFO - __main__ - Step 20669: {'lr': 0.00048062724026822504, 'samples': 3968448, 'steps': 20668, 'loss/train': 1.9260640144348145} 11/07/2021 00:09:09 - INFO - __main__ - Step 20670: {'lr': 0.00048062519194206916, 'samples': 3968640, 'steps': 20669, 'loss/train': 1.9533888101577759} 11/07/2021 00:09:09 - INFO - __main__ - Step 20671: {'lr': 0.0004806231435119972, 'samples': 3968832, 'steps': 20670, 'loss/train': 1.6754319667816162} 11/07/2021 00:09:09 - INFO - __main__ - Step 20672: {'lr': 0.00048062109497800997, 'samples': 3969024, 'steps': 20671, 'loss/train': 1.9477202892303467} 11/07/2021 00:09:10 - INFO - __main__ - Step 20673: {'lr': 0.00048061904634010845, 'samples': 3969216, 'steps': 20672, 'loss/train': 1.5543469190597534} 11/07/2021 00:09:11 - INFO - __main__ - Step 20674: {'lr': 0.0004806169975982935, 'samples': 3969408, 'steps': 20673, 'loss/train': 1.644513726234436} 11/07/2021 00:09:11 - INFO - __main__ - Step 20675: {'lr': 0.0004806149487525662, 'samples': 3969600, 'steps': 20674, 'loss/train': 2.085134983062744} 11/07/2021 00:09:11 - INFO - __main__ - Step 20676: {'lr': 0.0004806128998029272, 'samples': 3969792, 'steps': 20675, 'loss/train': 1.3123760223388672} 11/07/2021 00:09:12 - INFO - __main__ - Step 20677: {'lr': 0.0004806108507493777, 'samples': 3969984, 'steps': 20676, 'loss/train': 2.046645402908325} 11/07/2021 00:09:12 - INFO - __main__ - Step 20678: {'lr': 0.0004806088015919185, 'samples': 3970176, 'steps': 20677, 'loss/train': 1.390297293663025} 11/07/2021 00:09:13 - INFO - __main__ - Step 20679: {'lr': 0.0004806067523305505, 'samples': 3970368, 'steps': 20678, 'loss/train': 1.6803712844848633} 11/07/2021 00:09:13 - INFO - __main__ - Step 20680: {'lr': 0.0004806047029652747, 'samples': 3970560, 'steps': 20679, 'loss/train': 1.983790636062622} 11/07/2021 00:09:14 - INFO - __main__ - Step 20681: {'lr': 0.00048060265349609193, 'samples': 3970752, 'steps': 20680, 'loss/train': 2.1428070068359375} 11/07/2021 00:09:14 - INFO - __main__ - Step 20682: {'lr': 0.0004806006039230032, 'samples': 3970944, 'steps': 20681, 'loss/train': 1.3993123769760132} 11/07/2021 00:09:14 - INFO - __main__ - Step 20683: {'lr': 0.0004805985542460094, 'samples': 3971136, 'steps': 20682, 'loss/train': 1.8802090883255005} 11/07/2021 00:09:15 - INFO - __main__ - Step 20684: {'lr': 0.00048059650446511136, 'samples': 3971328, 'steps': 20683, 'loss/train': 1.7126437425613403} 11/07/2021 00:09:16 - INFO - __main__ - Step 20685: {'lr': 0.00048059445458031023, 'samples': 3971520, 'steps': 20684, 'loss/train': 1.373525619506836} 11/07/2021 00:09:16 - INFO - __main__ - Step 20686: {'lr': 0.0004805924045916067, 'samples': 3971712, 'steps': 20685, 'loss/train': 1.4744396209716797} 11/07/2021 00:09:17 - INFO - __main__ - Step 20687: {'lr': 0.00048059035449900185, 'samples': 3971904, 'steps': 20686, 'loss/train': 1.5865269899368286} 11/07/2021 00:09:17 - INFO - __main__ - Step 20688: {'lr': 0.0004805883043024965, 'samples': 3972096, 'steps': 20687, 'loss/train': 1.5301727056503296} 11/07/2021 00:09:17 - INFO - __main__ - Step 20689: {'lr': 0.0004805862540020917, 'samples': 3972288, 'steps': 20688, 'loss/train': 1.0273187160491943} 11/07/2021 00:09:18 - INFO - __main__ - Step 20690: {'lr': 0.0004805842035977882, 'samples': 3972480, 'steps': 20689, 'loss/train': 2.583864212036133} 11/07/2021 00:09:19 - INFO - __main__ - Step 20691: {'lr': 0.00048058215308958703, 'samples': 3972672, 'steps': 20690, 'loss/train': 1.1605116128921509} 11/07/2021 00:09:19 - INFO - __main__ - Step 20692: {'lr': 0.00048058010247748904, 'samples': 3972864, 'steps': 20691, 'loss/train': 1.5896633863449097} 11/07/2021 00:09:19 - INFO - __main__ - Step 20693: {'lr': 0.0004805780517614954, 'samples': 3973056, 'steps': 20692, 'loss/train': 1.8313792943954468} 11/07/2021 00:09:20 - INFO - __main__ - Step 20694: {'lr': 0.0004805760009416067, 'samples': 3973248, 'steps': 20693, 'loss/train': 1.637069582939148} 11/07/2021 00:09:21 - INFO - __main__ - Step 20695: {'lr': 0.000480573950017824, 'samples': 3973440, 'steps': 20694, 'loss/train': 1.3542802333831787} 11/07/2021 00:09:21 - INFO - __main__ - Step 20696: {'lr': 0.0004805718989901483, 'samples': 3973632, 'steps': 20695, 'loss/train': 1.9154294729232788} 11/07/2021 00:09:21 - INFO - __main__ - Step 20697: {'lr': 0.00048056984785858046, 'samples': 3973824, 'steps': 20696, 'loss/train': 1.9138669967651367} 11/07/2021 00:09:22 - INFO - __main__ - Step 20698: {'lr': 0.0004805677966231214, 'samples': 3974016, 'steps': 20697, 'loss/train': 1.203387975692749} 11/07/2021 00:09:22 - INFO - __main__ - Step 20699: {'lr': 0.00048056574528377205, 'samples': 3974208, 'steps': 20698, 'loss/train': 1.5315396785736084} 11/07/2021 00:09:23 - INFO - __main__ - Step 20700: {'lr': 0.00048056369384053335, 'samples': 3974400, 'steps': 20699, 'loss/train': 1.4767616987228394} 11/07/2021 00:09:24 - INFO - __main__ - Step 20701: {'lr': 0.00048056164229340613, 'samples': 3974592, 'steps': 20700, 'loss/train': 1.3749315738677979} 11/07/2021 00:09:24 - INFO - __main__ - Step 20702: {'lr': 0.0004805595906423914, 'samples': 3974784, 'steps': 20701, 'loss/train': 1.688059687614441} 11/07/2021 00:09:24 - INFO - __main__ - Step 20703: {'lr': 0.00048055753888749013, 'samples': 3974976, 'steps': 20702, 'loss/train': 1.1523799896240234} 11/07/2021 00:09:25 - INFO - __main__ - Step 20704: {'lr': 0.0004805554870287032, 'samples': 3975168, 'steps': 20703, 'loss/train': 1.2559438943862915} 11/07/2021 00:09:26 - INFO - __main__ - Step 20705: {'lr': 0.0004805534350660315, 'samples': 3975360, 'steps': 20704, 'loss/train': 1.656295657157898} 11/07/2021 00:09:26 - INFO - __main__ - Step 20706: {'lr': 0.000480551382999476, 'samples': 3975552, 'steps': 20705, 'loss/train': 0.858877420425415} 11/07/2021 00:09:26 - INFO - __main__ - Step 20707: {'lr': 0.00048054933082903754, 'samples': 3975744, 'steps': 20706, 'loss/train': 1.4430663585662842} 11/07/2021 00:09:27 - INFO - __main__ - Step 20708: {'lr': 0.00048054727855471717, 'samples': 3975936, 'steps': 20707, 'loss/train': 1.785535216331482} 11/07/2021 00:09:27 - INFO - __main__ - Step 20709: {'lr': 0.00048054522617651575, 'samples': 3976128, 'steps': 20708, 'loss/train': 1.5893622636795044} 11/07/2021 00:09:27 - INFO - __main__ - Step 20710: {'lr': 0.0004805431736944342, 'samples': 3976320, 'steps': 20709, 'loss/train': 2.038050889968872} 11/07/2021 00:09:28 - INFO - __main__ - Step 20711: {'lr': 0.0004805411211084735, 'samples': 3976512, 'steps': 20710, 'loss/train': 1.4637943506240845} 11/07/2021 00:09:29 - INFO - __main__ - Step 20712: {'lr': 0.0004805390684186344, 'samples': 3976704, 'steps': 20711, 'loss/train': 1.8713645935058594} 11/07/2021 00:09:29 - INFO - __main__ - Step 20713: {'lr': 0.00048053701562491804, 'samples': 3976896, 'steps': 20712, 'loss/train': 1.541002869606018} 11/07/2021 00:09:30 - INFO - __main__ - Step 20714: {'lr': 0.0004805349627273253, 'samples': 3977088, 'steps': 20713, 'loss/train': 1.3053054809570312} 11/07/2021 00:09:30 - INFO - __main__ - Step 20715: {'lr': 0.00048053290972585697, 'samples': 3977280, 'steps': 20714, 'loss/train': 1.6590816974639893} 11/07/2021 00:09:31 - INFO - __main__ - Step 20716: {'lr': 0.0004805308566205141, 'samples': 3977472, 'steps': 20715, 'loss/train': 1.4192510843276978} 11/07/2021 00:09:31 - INFO - __main__ - Step 20717: {'lr': 0.00048052880341129764, 'samples': 3977664, 'steps': 20716, 'loss/train': 2.1568589210510254} 11/07/2021 00:09:32 - INFO - __main__ - Step 20718: {'lr': 0.00048052675009820837, 'samples': 3977856, 'steps': 20717, 'loss/train': 1.3425061702728271} 11/07/2021 00:09:32 - INFO - __main__ - Step 20719: {'lr': 0.0004805246966812474, 'samples': 3978048, 'steps': 20718, 'loss/train': 1.3434855937957764} 11/07/2021 00:09:32 - INFO - __main__ - Step 20720: {'lr': 0.0004805226431604155, 'samples': 3978240, 'steps': 20719, 'loss/train': 1.4448708295822144} 11/07/2021 00:09:33 - INFO - __main__ - Step 20721: {'lr': 0.00048052058953571366, 'samples': 3978432, 'steps': 20720, 'loss/train': 1.672785997390747} 11/07/2021 00:09:34 - INFO - __main__ - Step 20722: {'lr': 0.0004805185358071428, 'samples': 3978624, 'steps': 20721, 'loss/train': 1.3219795227050781} 11/07/2021 00:09:34 - INFO - __main__ - Step 20723: {'lr': 0.0004805164819747038, 'samples': 3978816, 'steps': 20722, 'loss/train': 1.7978225946426392} 11/07/2021 00:09:34 - INFO - __main__ - Step 20724: {'lr': 0.0004805144280383977, 'samples': 3979008, 'steps': 20723, 'loss/train': 1.7678934335708618} 11/07/2021 00:09:35 - INFO - __main__ - Step 20725: {'lr': 0.00048051237399822534, 'samples': 3979200, 'steps': 20724, 'loss/train': 1.6825202703475952} 11/07/2021 00:09:36 - INFO - __main__ - Step 20726: {'lr': 0.00048051031985418764, 'samples': 3979392, 'steps': 20725, 'loss/train': 2.1667885780334473} 11/07/2021 00:09:36 - INFO - __main__ - Step 20727: {'lr': 0.0004805082656062856, 'samples': 3979584, 'steps': 20726, 'loss/train': 1.395729422569275} 11/07/2021 00:09:36 - INFO - __main__ - Step 20728: {'lr': 0.00048050621125451996, 'samples': 3979776, 'steps': 20727, 'loss/train': 1.5101127624511719} 11/07/2021 00:09:37 - INFO - __main__ - Step 20729: {'lr': 0.00048050415679889194, 'samples': 3979968, 'steps': 20728, 'loss/train': 1.6429022550582886} 11/07/2021 00:09:37 - INFO - __main__ - Step 20730: {'lr': 0.0004805021022394022, 'samples': 3980160, 'steps': 20729, 'loss/train': 1.715214729309082} 11/07/2021 00:09:38 - INFO - __main__ - Step 20731: {'lr': 0.0004805000475760518, 'samples': 3980352, 'steps': 20730, 'loss/train': 1.8093258142471313} 11/07/2021 00:09:38 - INFO - __main__ - Step 20732: {'lr': 0.0004804979928088417, 'samples': 3980544, 'steps': 20731, 'loss/train': 1.727259874343872} 11/07/2021 00:09:39 - INFO - __main__ - Step 20733: {'lr': 0.0004804959379377727, 'samples': 3980736, 'steps': 20732, 'loss/train': 1.4324378967285156} 11/07/2021 00:09:39 - INFO - __main__ - Step 20734: {'lr': 0.00048049388296284576, 'samples': 3980928, 'steps': 20733, 'loss/train': 1.0420160293579102} 11/07/2021 00:09:40 - INFO - __main__ - Step 20735: {'lr': 0.00048049182788406186, 'samples': 3981120, 'steps': 20734, 'loss/train': 1.1756582260131836} 11/07/2021 00:09:41 - INFO - __main__ - Step 20736: {'lr': 0.0004804897727014219, 'samples': 3981312, 'steps': 20735, 'loss/train': 1.6517256498336792} 11/07/2021 00:09:41 - INFO - __main__ - Step 20737: {'lr': 0.0004804877174149268, 'samples': 3981504, 'steps': 20736, 'loss/train': 1.1860313415527344} 11/07/2021 00:09:41 - INFO - __main__ - Step 20738: {'lr': 0.00048048566202457747, 'samples': 3981696, 'steps': 20737, 'loss/train': 1.743064045906067} 11/07/2021 00:09:42 - INFO - __main__ - Step 20739: {'lr': 0.00048048360653037494, 'samples': 3981888, 'steps': 20738, 'loss/train': 1.2434656620025635} 11/07/2021 00:09:42 - INFO - __main__ - Step 20740: {'lr': 0.00048048155093231994, 'samples': 3982080, 'steps': 20739, 'loss/train': 1.5692754983901978} 11/07/2021 00:09:42 - INFO - __main__ - Step 20741: {'lr': 0.00048047949523041355, 'samples': 3982272, 'steps': 20740, 'loss/train': 1.2839480638504028} 11/07/2021 00:09:44 - INFO - __main__ - Step 20742: {'lr': 0.0004804774394246567, 'samples': 3982464, 'steps': 20741, 'loss/train': 0.8025193214416504} 11/07/2021 00:09:44 - INFO - __main__ - Step 20743: {'lr': 0.0004804753835150503, 'samples': 3982656, 'steps': 20742, 'loss/train': 1.786361813545227} 11/07/2021 00:09:44 - INFO - __main__ - Step 20744: {'lr': 0.0004804733275015951, 'samples': 3982848, 'steps': 20743, 'loss/train': 1.8952442407608032} 11/07/2021 00:09:45 - INFO - __main__ - Step 20745: {'lr': 0.0004804712713842923, 'samples': 3983040, 'steps': 20744, 'loss/train': 1.9803398847579956} 11/07/2021 00:09:46 - INFO - __main__ - Step 20746: {'lr': 0.0004804692151631427, 'samples': 3983232, 'steps': 20745, 'loss/train': 1.8971871137619019} 11/07/2021 00:09:46 - INFO - __main__ - Step 20747: {'lr': 0.00048046715883814716, 'samples': 3983424, 'steps': 20746, 'loss/train': 1.5037178993225098} 11/07/2021 00:09:46 - INFO - __main__ - Step 20748: {'lr': 0.00048046510240930674, 'samples': 3983616, 'steps': 20747, 'loss/train': 1.7305831909179688} 11/07/2021 00:09:47 - INFO - __main__ - Step 20749: {'lr': 0.00048046304587662225, 'samples': 3983808, 'steps': 20748, 'loss/train': 1.7680881023406982} 11/07/2021 00:09:47 - INFO - __main__ - Step 20750: {'lr': 0.00048046098924009467, 'samples': 3984000, 'steps': 20749, 'loss/train': 1.5476644039154053} 11/07/2021 00:09:48 - INFO - __main__ - Step 20751: {'lr': 0.00048045893249972497, 'samples': 3984192, 'steps': 20750, 'loss/train': 1.4620814323425293} 11/07/2021 00:09:48 - INFO - __main__ - Step 20752: {'lr': 0.000480456875655514, 'samples': 3984384, 'steps': 20751, 'loss/train': 1.5789313316345215} 11/07/2021 00:09:49 - INFO - __main__ - Step 20753: {'lr': 0.0004804548187074628, 'samples': 3984576, 'steps': 20752, 'loss/train': 1.754978060722351} 11/07/2021 00:09:49 - INFO - __main__ - Step 20754: {'lr': 0.0004804527616555721, 'samples': 3984768, 'steps': 20753, 'loss/train': 1.7650110721588135} 11/07/2021 00:09:50 - INFO - __main__ - Step 20755: {'lr': 0.00048045070449984295, 'samples': 3984960, 'steps': 20754, 'loss/train': 1.4159955978393555} 11/07/2021 00:09:51 - INFO - __main__ - Step 20756: {'lr': 0.0004804486472402763, 'samples': 3985152, 'steps': 20755, 'loss/train': 1.448042392730713} 11/07/2021 00:09:51 - INFO - __main__ - Step 20757: {'lr': 0.0004804465898768731, 'samples': 3985344, 'steps': 20756, 'loss/train': 1.3452638387680054} 11/07/2021 00:09:51 - INFO - __main__ - Step 20758: {'lr': 0.00048044453240963413, 'samples': 3985536, 'steps': 20757, 'loss/train': 1.4904249906539917} 11/07/2021 00:09:52 - INFO - __main__ - Step 20759: {'lr': 0.00048044247483856043, 'samples': 3985728, 'steps': 20758, 'loss/train': 1.6131950616836548} 11/07/2021 00:09:52 - INFO - __main__ - Step 20760: {'lr': 0.00048044041716365296, 'samples': 3985920, 'steps': 20759, 'loss/train': 1.4273293018341064} 11/07/2021 00:09:52 - INFO - __main__ - Step 20761: {'lr': 0.00048043835938491253, 'samples': 3986112, 'steps': 20760, 'loss/train': 1.341493010520935} 11/07/2021 00:09:54 - INFO - __main__ - Step 20762: {'lr': 0.0004804363015023402, 'samples': 3986304, 'steps': 20761, 'loss/train': 1.4203829765319824} 11/07/2021 00:09:54 - INFO - __main__ - Step 20763: {'lr': 0.00048043424351593676, 'samples': 3986496, 'steps': 20762, 'loss/train': 1.2885338068008423} 11/07/2021 00:09:54 - INFO - __main__ - Step 20764: {'lr': 0.0004804321854257032, 'samples': 3986688, 'steps': 20763, 'loss/train': 1.9818228483200073} 11/07/2021 00:09:55 - INFO - __main__ - Step 20765: {'lr': 0.0004804301272316405, 'samples': 3986880, 'steps': 20764, 'loss/train': 1.5577560663223267} 11/07/2021 00:09:55 - INFO - __main__ - Step 20766: {'lr': 0.0004804280689337496, 'samples': 3987072, 'steps': 20765, 'loss/train': 1.503300428390503} 11/07/2021 00:09:56 - INFO - __main__ - Step 20767: {'lr': 0.00048042601053203125, 'samples': 3987264, 'steps': 20766, 'loss/train': 1.59586763381958} 11/07/2021 00:09:56 - INFO - __main__ - Step 20768: {'lr': 0.00048042395202648646, 'samples': 3987456, 'steps': 20767, 'loss/train': 1.7341300249099731} 11/07/2021 00:09:57 - INFO - __main__ - Step 20769: {'lr': 0.00048042189341711636, 'samples': 3987648, 'steps': 20768, 'loss/train': 1.8474223613739014} 11/07/2021 00:09:57 - INFO - __main__ - Step 20770: {'lr': 0.0004804198347039216, 'samples': 3987840, 'steps': 20769, 'loss/train': 1.3138035535812378} 11/07/2021 00:09:57 - INFO - __main__ - Step 20771: {'lr': 0.0004804177758869032, 'samples': 3988032, 'steps': 20770, 'loss/train': 1.6257972717285156} 11/07/2021 00:09:58 - INFO - __main__ - Step 20772: {'lr': 0.0004804157169660622, 'samples': 3988224, 'steps': 20771, 'loss/train': 1.58247971534729} 11/07/2021 00:09:59 - INFO - __main__ - Step 20773: {'lr': 0.00048041365794139934, 'samples': 3988416, 'steps': 20772, 'loss/train': 2.3430113792419434} 11/07/2021 00:09:59 - INFO - __main__ - Step 20774: {'lr': 0.00048041159881291574, 'samples': 3988608, 'steps': 20773, 'loss/train': 1.5301451683044434} 11/07/2021 00:09:59 - INFO - __main__ - Step 20775: {'lr': 0.0004804095395806122, 'samples': 3988800, 'steps': 20774, 'loss/train': 1.7506057024002075} 11/07/2021 00:10:00 - INFO - __main__ - Step 20776: {'lr': 0.00048040748024448954, 'samples': 3988992, 'steps': 20775, 'loss/train': 1.5352082252502441} 11/07/2021 00:10:01 - INFO - __main__ - Step 20777: {'lr': 0.00048040542080454897, 'samples': 3989184, 'steps': 20776, 'loss/train': 1.5161960124969482} 11/07/2021 00:10:01 - INFO - __main__ - Step 20778: {'lr': 0.0004804033612607912, 'samples': 3989376, 'steps': 20777, 'loss/train': 1.5611916780471802} 11/07/2021 00:10:02 - INFO - __main__ - Step 20779: {'lr': 0.00048040130161321724, 'samples': 3989568, 'steps': 20778, 'loss/train': 1.8434571027755737} 11/07/2021 00:10:02 - INFO - __main__ - Step 20780: {'lr': 0.0004803992418618281, 'samples': 3989760, 'steps': 20779, 'loss/train': 1.56538724899292} 11/07/2021 00:10:02 - INFO - __main__ - Step 20781: {'lr': 0.00048039718200662454, 'samples': 3989952, 'steps': 20780, 'loss/train': 1.558919906616211} 11/07/2021 00:10:03 - INFO - __main__ - Step 20782: {'lr': 0.0004803951220476076, 'samples': 3990144, 'steps': 20781, 'loss/train': 2.473111391067505} 11/07/2021 00:10:04 - INFO - __main__ - Step 20783: {'lr': 0.00048039306198477817, 'samples': 3990336, 'steps': 20782, 'loss/train': 1.4748272895812988} 11/07/2021 00:10:04 - INFO - __main__ - Step 20784: {'lr': 0.0004803910018181371, 'samples': 3990528, 'steps': 20783, 'loss/train': 1.1002256870269775} 11/07/2021 00:10:04 - INFO - __main__ - Step 20785: {'lr': 0.0004803889415476855, 'samples': 3990720, 'steps': 20784, 'loss/train': 1.3398923873901367} 11/07/2021 00:10:05 - INFO - __main__ - Step 20786: {'lr': 0.0004803868811734242, 'samples': 3990912, 'steps': 20785, 'loss/train': 5.52646541595459} 11/07/2021 00:10:05 - INFO - __main__ - Step 20787: {'lr': 0.00048038482069535406, 'samples': 3991104, 'steps': 20786, 'loss/train': 1.664506435394287} 11/07/2021 00:10:06 - INFO - __main__ - Step 20788: {'lr': 0.000480382760113476, 'samples': 3991296, 'steps': 20787, 'loss/train': 1.9385948181152344} 11/07/2021 00:10:07 - INFO - __main__ - Step 20789: {'lr': 0.00048038069942779116, 'samples': 3991488, 'steps': 20788, 'loss/train': 1.4435739517211914} 11/07/2021 00:10:07 - INFO - __main__ - Step 20790: {'lr': 0.00048037863863830034, 'samples': 3991680, 'steps': 20789, 'loss/train': 1.7639094591140747} 11/07/2021 00:10:07 - INFO - __main__ - Step 20791: {'lr': 0.0004803765777450044, 'samples': 3991872, 'steps': 20790, 'loss/train': 2.179063081741333} 11/07/2021 00:10:08 - INFO - __main__ - Step 20792: {'lr': 0.00048037451674790433, 'samples': 3992064, 'steps': 20791, 'loss/train': 1.5259931087493896} 11/07/2021 00:10:09 - INFO - __main__ - Step 20793: {'lr': 0.0004803724556470011, 'samples': 3992256, 'steps': 20792, 'loss/train': 0.41106319427490234} 11/07/2021 00:10:09 - INFO - __main__ - Step 20794: {'lr': 0.0004803703944422956, 'samples': 3992448, 'steps': 20793, 'loss/train': 1.5696848630905151} 11/07/2021 00:10:10 - INFO - __main__ - Step 20795: {'lr': 0.0004803683331337887, 'samples': 3992640, 'steps': 20794, 'loss/train': 1.5306452512741089} 11/07/2021 00:10:10 - INFO - __main__ - Step 20796: {'lr': 0.0004803662717214814, 'samples': 3992832, 'steps': 20795, 'loss/train': 1.5071040391921997} 11/07/2021 00:10:10 - INFO - __main__ - Step 20797: {'lr': 0.00048036421020537464, 'samples': 3993024, 'steps': 20796, 'loss/train': 1.7013245820999146} 11/07/2021 00:10:11 - INFO - __main__ - Step 20798: {'lr': 0.0004803621485854693, 'samples': 3993216, 'steps': 20797, 'loss/train': 0.595490574836731} 11/07/2021 00:10:12 - INFO - __main__ - Step 20799: {'lr': 0.00048036008686176636, 'samples': 3993408, 'steps': 20798, 'loss/train': 1.1961723566055298} 11/07/2021 00:10:12 - INFO - __main__ - Step 20800: {'lr': 0.0004803580250342666, 'samples': 3993600, 'steps': 20799, 'loss/train': 1.7574987411499023} 11/07/2021 00:10:12 - INFO - __main__ - Step 20801: {'lr': 0.00048035596310297125, 'samples': 3993792, 'steps': 20800, 'loss/train': 1.2915315628051758} 11/07/2021 00:10:13 - INFO - __main__ - Step 20802: {'lr': 0.0004803539010678809, 'samples': 3993984, 'steps': 20801, 'loss/train': 1.912282943725586} 11/07/2021 00:10:13 - INFO - __main__ - Step 20803: {'lr': 0.00048035183892899676, 'samples': 3994176, 'steps': 20802, 'loss/train': 1.7721303701400757} 11/07/2021 00:10:14 - INFO - __main__ - Step 20804: {'lr': 0.0004803497766863195, 'samples': 3994368, 'steps': 20803, 'loss/train': 1.472104787826538} 11/07/2021 00:10:14 - INFO - __main__ - Step 20805: {'lr': 0.00048034771433985035, 'samples': 3994560, 'steps': 20804, 'loss/train': 1.4602023363113403} 11/07/2021 00:10:15 - INFO - __main__ - Step 20806: {'lr': 0.00048034565188959, 'samples': 3994752, 'steps': 20805, 'loss/train': 1.452834963798523} 11/07/2021 00:10:15 - INFO - __main__ - Step 20807: {'lr': 0.0004803435893355394, 'samples': 3994944, 'steps': 20806, 'loss/train': 1.882066011428833} 11/07/2021 00:10:15 - INFO - __main__ - Step 20808: {'lr': 0.00048034152667769957, 'samples': 3995136, 'steps': 20807, 'loss/train': 1.5793373584747314} 11/07/2021 00:10:17 - INFO - __main__ - Step 20809: {'lr': 0.0004803394639160714, 'samples': 3995328, 'steps': 20808, 'loss/train': 1.8108179569244385} 11/07/2021 00:10:17 - INFO - __main__ - Step 20810: {'lr': 0.00048033740105065585, 'samples': 3995520, 'steps': 20809, 'loss/train': 2.055591106414795} 11/07/2021 00:10:17 - INFO - __main__ - Step 20811: {'lr': 0.0004803353380814538, 'samples': 3995712, 'steps': 20810, 'loss/train': 1.3634858131408691} 11/07/2021 00:10:18 - INFO - __main__ - Step 20812: {'lr': 0.00048033327500846625, 'samples': 3995904, 'steps': 20811, 'loss/train': 1.299759864807129} 11/07/2021 00:10:18 - INFO - __main__ - Step 20813: {'lr': 0.000480331211831694, 'samples': 3996096, 'steps': 20812, 'loss/train': 1.4120014905929565} 11/07/2021 00:10:19 - INFO - __main__ - Step 20814: {'lr': 0.00048032914855113807, 'samples': 3996288, 'steps': 20813, 'loss/train': 1.361594796180725} 11/07/2021 00:10:19 - INFO - __main__ - Step 20815: {'lr': 0.00048032708516679946, 'samples': 3996480, 'steps': 20814, 'loss/train': 1.8684601783752441} 11/07/2021 00:10:20 - INFO - __main__ - Step 20816: {'lr': 0.00048032502167867896, 'samples': 3996672, 'steps': 20815, 'loss/train': 1.841715693473816} 11/07/2021 00:10:20 - INFO - __main__ - Step 20817: {'lr': 0.0004803229580867775, 'samples': 3996864, 'steps': 20816, 'loss/train': 1.7938896417617798} 11/07/2021 00:10:20 - INFO - __main__ - Step 20818: {'lr': 0.0004803208943910962, 'samples': 3997056, 'steps': 20817, 'loss/train': 1.2337151765823364} 11/07/2021 00:10:21 - INFO - __main__ - Step 20819: {'lr': 0.00048031883059163576, 'samples': 3997248, 'steps': 20818, 'loss/train': 1.5640259981155396} 11/07/2021 00:10:22 - INFO - __main__ - Step 20820: {'lr': 0.00048031676668839723, 'samples': 3997440, 'steps': 20819, 'loss/train': 1.9230550527572632} 11/07/2021 00:10:22 - INFO - __main__ - Step 20821: {'lr': 0.00048031470268138153, 'samples': 3997632, 'steps': 20820, 'loss/train': 1.2242671251296997} 11/07/2021 00:10:22 - INFO - __main__ - Step 20822: {'lr': 0.00048031263857058957, 'samples': 3997824, 'steps': 20821, 'loss/train': 1.852646827697754} 11/07/2021 00:10:23 - INFO - __main__ - Step 20823: {'lr': 0.00048031057435602234, 'samples': 3998016, 'steps': 20822, 'loss/train': 1.4730157852172852} 11/07/2021 00:10:24 - INFO - __main__ - Step 20824: {'lr': 0.0004803085100376807, 'samples': 3998208, 'steps': 20823, 'loss/train': 1.5855153799057007} 11/07/2021 00:10:24 - INFO - __main__ - Step 20825: {'lr': 0.00048030644561556556, 'samples': 3998400, 'steps': 20824, 'loss/train': 1.704028844833374} 11/07/2021 00:10:25 - INFO - __main__ - Step 20826: {'lr': 0.0004803043810896779, 'samples': 3998592, 'steps': 20825, 'loss/train': 1.5183931589126587} 11/07/2021 00:10:25 - INFO - __main__ - Step 20827: {'lr': 0.00048030231646001867, 'samples': 3998784, 'steps': 20826, 'loss/train': 1.6759287118911743} 11/07/2021 00:10:25 - INFO - __main__ - Step 20828: {'lr': 0.0004803002517265887, 'samples': 3998976, 'steps': 20827, 'loss/train': 0.2678111791610718} 11/07/2021 00:10:27 - INFO - __main__ - Step 20829: {'lr': 0.0004802981868893891, 'samples': 3999168, 'steps': 20828, 'loss/train': 1.3716883659362793} 11/07/2021 00:10:27 - INFO - __main__ - Step 20830: {'lr': 0.00048029612194842056, 'samples': 3999360, 'steps': 20829, 'loss/train': 1.400437593460083} 11/07/2021 00:10:27 - INFO - __main__ - Step 20831: {'lr': 0.0004802940569036842, 'samples': 3999552, 'steps': 20830, 'loss/train': 2.0138940811157227} 11/07/2021 00:10:28 - INFO - __main__ - Step 20832: {'lr': 0.0004802919917551809, 'samples': 3999744, 'steps': 20831, 'loss/train': 1.691053867340088} 11/07/2021 00:10:28 - INFO - __main__ - Step 20833: {'lr': 0.00048028992650291156, 'samples': 3999936, 'steps': 20832, 'loss/train': 1.3104307651519775} 11/07/2021 00:10:28 - INFO - __main__ - Step 20834: {'lr': 0.00048028786114687715, 'samples': 4000128, 'steps': 20833, 'loss/train': 1.7613739967346191} 11/07/2021 00:10:29 - INFO - __main__ - Step 20835: {'lr': 0.0004802857956870786, 'samples': 4000320, 'steps': 20834, 'loss/train': 1.5970216989517212} 11/07/2021 00:10:30 - INFO - __main__ - Step 20836: {'lr': 0.00048028373012351684, 'samples': 4000512, 'steps': 20835, 'loss/train': 1.327458143234253} 11/07/2021 00:10:30 - INFO - __main__ - Step 20837: {'lr': 0.00048028166445619275, 'samples': 4000704, 'steps': 20836, 'loss/train': 1.427208423614502} 11/07/2021 00:10:31 - INFO - __main__ - Step 20838: {'lr': 0.0004802795986851073, 'samples': 4000896, 'steps': 20837, 'loss/train': 1.2698725461959839} 11/07/2021 00:10:31 - INFO - __main__ - Step 20839: {'lr': 0.00048027753281026144, 'samples': 4001088, 'steps': 20838, 'loss/train': 1.6905814409255981} 11/07/2021 00:10:32 - INFO - __main__ - Step 20840: {'lr': 0.000480275466831656, 'samples': 4001280, 'steps': 20839, 'loss/train': 1.8319936990737915} 11/07/2021 00:10:32 - INFO - __main__ - Step 20841: {'lr': 0.00048027340074929207, 'samples': 4001472, 'steps': 20840, 'loss/train': 1.751729965209961} 11/07/2021 00:10:33 - INFO - __main__ - Step 20842: {'lr': 0.0004802713345631705, 'samples': 4001664, 'steps': 20841, 'loss/train': 1.7456939220428467} 11/07/2021 00:10:33 - INFO - __main__ - Step 20843: {'lr': 0.0004802692682732922, 'samples': 4001856, 'steps': 20842, 'loss/train': 1.7693142890930176} 11/07/2021 00:10:33 - INFO - __main__ - Step 20844: {'lr': 0.0004802672018796581, 'samples': 4002048, 'steps': 20843, 'loss/train': 1.407902479171753} 11/07/2021 00:10:34 - INFO - __main__ - Step 20845: {'lr': 0.0004802651353822691, 'samples': 4002240, 'steps': 20844, 'loss/train': 1.528710126876831} 11/07/2021 00:10:35 - INFO - __main__ - Step 20846: {'lr': 0.0004802630687811263, 'samples': 4002432, 'steps': 20845, 'loss/train': 1.6070642471313477} 11/07/2021 00:10:35 - INFO - __main__ - Step 20847: {'lr': 0.00048026100207623047, 'samples': 4002624, 'steps': 20846, 'loss/train': 1.699964165687561} 11/07/2021 00:10:35 - INFO - __main__ - Step 20848: {'lr': 0.0004802589352675826, 'samples': 4002816, 'steps': 20847, 'loss/train': 1.9966727495193481} 11/07/2021 00:10:36 - INFO - __main__ - Step 20849: {'lr': 0.0004802568683551836, 'samples': 4003008, 'steps': 20848, 'loss/train': 1.0162174701690674} 11/07/2021 00:10:37 - INFO - __main__ - Step 20850: {'lr': 0.0004802548013390343, 'samples': 4003200, 'steps': 20849, 'loss/train': 1.5560613870620728} 11/07/2021 00:10:37 - INFO - __main__ - Step 20851: {'lr': 0.00048025273421913587, 'samples': 4003392, 'steps': 20850, 'loss/train': 1.6622282266616821} 11/07/2021 00:10:37 - INFO - __main__ - Step 20852: {'lr': 0.0004802506669954891, 'samples': 4003584, 'steps': 20851, 'loss/train': 1.6507529020309448} 11/07/2021 00:10:38 - INFO - __main__ - Step 20853: {'lr': 0.00048024859966809487, 'samples': 4003776, 'steps': 20852, 'loss/train': 1.8268312215805054} 11/07/2021 00:10:38 - INFO - __main__ - Step 20854: {'lr': 0.00048024653223695425, 'samples': 4003968, 'steps': 20853, 'loss/train': 1.3219914436340332} 11/07/2021 00:10:39 - INFO - __main__ - Step 20855: {'lr': 0.00048024446470206806, 'samples': 4004160, 'steps': 20854, 'loss/train': 1.3745572566986084} 11/07/2021 00:10:40 - INFO - __main__ - Step 20856: {'lr': 0.0004802423970634373, 'samples': 4004352, 'steps': 20855, 'loss/train': 1.3092901706695557} 11/07/2021 00:10:40 - INFO - __main__ - Step 20857: {'lr': 0.00048024032932106277, 'samples': 4004544, 'steps': 20856, 'loss/train': 2.9461312294006348} 11/07/2021 00:10:40 - INFO - __main__ - Step 20858: {'lr': 0.00048023826147494556, 'samples': 4004736, 'steps': 20857, 'loss/train': 1.3187404870986938} 11/07/2021 00:10:41 - INFO - __main__ - Step 20859: {'lr': 0.0004802361935250865, 'samples': 4004928, 'steps': 20858, 'loss/train': 1.6424322128295898} 11/07/2021 00:10:42 - INFO - __main__ - Step 20860: {'lr': 0.0004802341254714867, 'samples': 4005120, 'steps': 20859, 'loss/train': 1.394176959991455} 11/07/2021 00:10:42 - INFO - __main__ - Step 20861: {'lr': 0.00048023205731414684, 'samples': 4005312, 'steps': 20860, 'loss/train': 1.5472851991653442} 11/07/2021 00:10:42 - INFO - __main__ - Step 20862: {'lr': 0.00048022998905306795, 'samples': 4005504, 'steps': 20861, 'loss/train': 1.6439249515533447} 11/07/2021 00:10:43 - INFO - __main__ - Step 20863: {'lr': 0.00048022792068825107, 'samples': 4005696, 'steps': 20862, 'loss/train': 1.2037049531936646} 11/07/2021 00:10:43 - INFO - __main__ - Step 20864: {'lr': 0.00048022585221969697, 'samples': 4005888, 'steps': 20863, 'loss/train': 1.481030821800232} 11/07/2021 00:10:43 - INFO - __main__ - Step 20865: {'lr': 0.00048022378364740673, 'samples': 4006080, 'steps': 20864, 'loss/train': 1.4156023263931274} 11/07/2021 00:10:44 - INFO - __main__ - Step 20866: {'lr': 0.0004802217149713811, 'samples': 4006272, 'steps': 20865, 'loss/train': 1.7416797876358032} 11/07/2021 00:10:45 - INFO - __main__ - Step 20867: {'lr': 0.0004802196461916212, 'samples': 4006464, 'steps': 20866, 'loss/train': 1.5621509552001953} 11/07/2021 00:10:45 - INFO - __main__ - Step 20868: {'lr': 0.0004802175773081278, 'samples': 4006656, 'steps': 20867, 'loss/train': 1.5445585250854492} 11/07/2021 00:10:46 - INFO - __main__ - Step 20869: {'lr': 0.000480215508320902, 'samples': 4006848, 'steps': 20868, 'loss/train': 1.958810806274414} 11/07/2021 00:10:46 - INFO - __main__ - Step 20870: {'lr': 0.0004802134392299446, 'samples': 4007040, 'steps': 20869, 'loss/train': 1.8098005056381226} 11/07/2021 00:10:47 - INFO - __main__ - Step 20871: {'lr': 0.0004802113700352566, 'samples': 4007232, 'steps': 20870, 'loss/train': 1.7087562084197998} 11/07/2021 00:10:47 - INFO - __main__ - Step 20872: {'lr': 0.00048020930073683886, 'samples': 4007424, 'steps': 20871, 'loss/train': 1.5298601388931274} 11/07/2021 00:10:48 - INFO - __main__ - Step 20873: {'lr': 0.0004802072313346924, 'samples': 4007616, 'steps': 20872, 'loss/train': 1.7708653211593628} 11/07/2021 00:10:48 - INFO - __main__ - Step 20874: {'lr': 0.00048020516182881813, 'samples': 4007808, 'steps': 20873, 'loss/train': 1.6466927528381348} 11/07/2021 00:10:48 - INFO - __main__ - Step 20875: {'lr': 0.00048020309221921686, 'samples': 4008000, 'steps': 20874, 'loss/train': 1.5985970497131348} 11/07/2021 00:10:49 - INFO - __main__ - Step 20876: {'lr': 0.00048020102250588976, 'samples': 4008192, 'steps': 20875, 'loss/train': 1.2552753686904907} 11/07/2021 00:10:50 - INFO - __main__ - Step 20877: {'lr': 0.00048019895268883764, 'samples': 4008384, 'steps': 20876, 'loss/train': 1.6351234912872314} 11/07/2021 00:10:50 - INFO - __main__ - Step 20878: {'lr': 0.0004801968827680613, 'samples': 4008576, 'steps': 20877, 'loss/train': 1.4687561988830566} 11/07/2021 00:10:50 - INFO - __main__ - Step 20879: {'lr': 0.00048019481274356194, 'samples': 4008768, 'steps': 20878, 'loss/train': 2.0849993228912354} 11/07/2021 00:10:51 - INFO - __main__ - Step 20880: {'lr': 0.0004801927426153402, 'samples': 4008960, 'steps': 20879, 'loss/train': 1.5084853172302246} 11/07/2021 00:10:52 - INFO - __main__ - Step 20881: {'lr': 0.00048019067238339725, 'samples': 4009152, 'steps': 20880, 'loss/train': 1.1716647148132324} 11/07/2021 00:10:52 - INFO - __main__ - Step 20882: {'lr': 0.000480188602047734, 'samples': 4009344, 'steps': 20881, 'loss/train': 1.4270188808441162} 11/07/2021 00:10:52 - INFO - __main__ - Step 20883: {'lr': 0.0004801865316083512, 'samples': 4009536, 'steps': 20882, 'loss/train': 1.583962082862854} 11/07/2021 00:10:53 - INFO - __main__ - Step 20884: {'lr': 0.0004801844610652499, 'samples': 4009728, 'steps': 20883, 'loss/train': 1.8874273300170898} 11/07/2021 00:10:53 - INFO - __main__ - Step 20885: {'lr': 0.0004801823904184311, 'samples': 4009920, 'steps': 20884, 'loss/train': 1.2241159677505493} 11/07/2021 00:10:53 - INFO - __main__ - Step 20886: {'lr': 0.00048018031966789564, 'samples': 4010112, 'steps': 20885, 'loss/train': 1.395066738128662} 11/07/2021 00:10:54 - INFO - __main__ - Step 20887: {'lr': 0.0004801782488136445, 'samples': 4010304, 'steps': 20886, 'loss/train': 1.531816840171814} 11/07/2021 00:10:55 - INFO - __main__ - Step 20888: {'lr': 0.00048017617785567855, 'samples': 4010496, 'steps': 20887, 'loss/train': 1.872235655784607} 11/07/2021 00:10:55 - INFO - __main__ - Step 20889: {'lr': 0.00048017410679399876, 'samples': 4010688, 'steps': 20888, 'loss/train': 1.6539621353149414} 11/07/2021 00:10:56 - INFO - __main__ - Step 20890: {'lr': 0.00048017203562860614, 'samples': 4010880, 'steps': 20889, 'loss/train': 1.7974549531936646} 11/07/2021 00:10:56 - INFO - __main__ - Step 20891: {'lr': 0.0004801699643595015, 'samples': 4011072, 'steps': 20890, 'loss/train': 1.764176845550537} 11/07/2021 00:10:57 - INFO - __main__ - Step 20892: {'lr': 0.00048016789298668583, 'samples': 4011264, 'steps': 20891, 'loss/train': 1.7799986600875854} 11/07/2021 00:10:57 - INFO - __main__ - Step 20893: {'lr': 0.0004801658215101601, 'samples': 4011456, 'steps': 20892, 'loss/train': 1.6527446508407593} 11/07/2021 00:10:58 - INFO - __main__ - Step 20894: {'lr': 0.00048016374992992516, 'samples': 4011648, 'steps': 20893, 'loss/train': 1.2430540323257446} 11/07/2021 00:10:58 - INFO - __main__ - Step 20895: {'lr': 0.000480161678245982, 'samples': 4011840, 'steps': 20894, 'loss/train': 1.2979753017425537} 11/07/2021 00:10:58 - INFO - __main__ - Step 20896: {'lr': 0.0004801596064583315, 'samples': 4012032, 'steps': 20895, 'loss/train': 1.6988540887832642} 11/07/2021 00:11:00 - INFO - __main__ - Step 20897: {'lr': 0.00048015753456697466, 'samples': 4012224, 'steps': 20896, 'loss/train': 1.5440951585769653} 11/07/2021 00:11:00 - INFO - __main__ - Step 20898: {'lr': 0.00048015546257191243, 'samples': 4012416, 'steps': 20897, 'loss/train': 2.084376096725464} 11/07/2021 00:11:00 - INFO - __main__ - Step 20899: {'lr': 0.00048015339047314566, 'samples': 4012608, 'steps': 20898, 'loss/train': 1.7466663122177124} 11/07/2021 00:11:01 - INFO - __main__ - Step 20900: {'lr': 0.00048015131827067534, 'samples': 4012800, 'steps': 20899, 'loss/train': 1.352649450302124} 11/07/2021 00:11:01 - INFO - __main__ - Step 20901: {'lr': 0.0004801492459645024, 'samples': 4012992, 'steps': 20900, 'loss/train': 1.8301000595092773} 11/07/2021 00:11:02 - INFO - __main__ - Step 20902: {'lr': 0.0004801471735546277, 'samples': 4013184, 'steps': 20901, 'loss/train': 1.3566601276397705} 11/07/2021 00:11:02 - INFO - __main__ - Step 20903: {'lr': 0.0004801451010410522, 'samples': 4013376, 'steps': 20902, 'loss/train': 1.7443701028823853} 11/07/2021 00:11:03 - INFO - __main__ - Step 20904: {'lr': 0.000480143028423777, 'samples': 4013568, 'steps': 20903, 'loss/train': 1.2707422971725464} 11/07/2021 00:11:03 - INFO - __main__ - Step 20905: {'lr': 0.0004801409557028028, 'samples': 4013760, 'steps': 20904, 'loss/train': 0.893004834651947} 11/07/2021 00:11:03 - INFO - __main__ - Step 20906: {'lr': 0.0004801388828781307, 'samples': 4013952, 'steps': 20905, 'loss/train': 1.4361201524734497} 11/07/2021 00:11:04 - INFO - __main__ - Step 20907: {'lr': 0.00048013680994976154, 'samples': 4014144, 'steps': 20906, 'loss/train': 1.4615181684494019} 11/07/2021 00:11:05 - INFO - __main__ - Step 20908: {'lr': 0.0004801347369176963, 'samples': 4014336, 'steps': 20907, 'loss/train': 1.5402387380599976} 11/07/2021 00:11:05 - INFO - __main__ - Step 20909: {'lr': 0.00048013266378193586, 'samples': 4014528, 'steps': 20908, 'loss/train': 1.783090591430664} 11/07/2021 00:11:05 - INFO - __main__ - Step 20910: {'lr': 0.00048013059054248134, 'samples': 4014720, 'steps': 20909, 'loss/train': 1.7908695936203003} 11/07/2021 00:11:06 - INFO - __main__ - Step 20911: {'lr': 0.00048012851719933335, 'samples': 4014912, 'steps': 20910, 'loss/train': 1.4883944988250732} 11/07/2021 00:11:06 - INFO - __main__ - Step 20912: {'lr': 0.000480126443752493, 'samples': 4015104, 'steps': 20911, 'loss/train': 1.4716330766677856} 11/07/2021 00:11:07 - INFO - __main__ - Step 20913: {'lr': 0.0004801243702019614, 'samples': 4015296, 'steps': 20912, 'loss/train': 1.7692360877990723} 11/07/2021 00:11:08 - INFO - __main__ - Step 20914: {'lr': 0.00048012229654773915, 'samples': 4015488, 'steps': 20913, 'loss/train': 1.800770878791809} 11/07/2021 00:11:08 - INFO - __main__ - Step 20915: {'lr': 0.0004801202227898274, 'samples': 4015680, 'steps': 20914, 'loss/train': 1.5604429244995117} 11/07/2021 00:11:08 - INFO - __main__ - Step 20916: {'lr': 0.00048011814892822704, 'samples': 4015872, 'steps': 20915, 'loss/train': 1.069279670715332} 11/07/2021 00:11:09 - INFO - __main__ - Step 20917: {'lr': 0.00048011607496293896, 'samples': 4016064, 'steps': 20916, 'loss/train': 1.8001171350479126} 11/07/2021 00:11:10 - INFO - __main__ - Step 20918: {'lr': 0.0004801140008939642, 'samples': 4016256, 'steps': 20917, 'loss/train': 1.5695562362670898} 11/07/2021 00:11:10 - INFO - __main__ - Step 20919: {'lr': 0.00048011192672130356, 'samples': 4016448, 'steps': 20918, 'loss/train': 1.5356251001358032} 11/07/2021 00:11:10 - INFO - __main__ - Step 20920: {'lr': 0.000480109852444958, 'samples': 4016640, 'steps': 20919, 'loss/train': 1.2524460554122925} 11/07/2021 00:11:11 - INFO - __main__ - Step 20921: {'lr': 0.0004801077780649286, 'samples': 4016832, 'steps': 20920, 'loss/train': 1.808414101600647} 11/07/2021 00:11:11 - INFO - __main__ - Step 20922: {'lr': 0.00048010570358121606, 'samples': 4017024, 'steps': 20921, 'loss/train': 1.0232669115066528} 11/07/2021 00:11:12 - INFO - __main__ - Step 20923: {'lr': 0.0004801036289938215, 'samples': 4017216, 'steps': 20922, 'loss/train': 1.1512370109558105} 11/07/2021 00:11:13 - INFO - __main__ - Step 20924: {'lr': 0.0004801015543027458, 'samples': 4017408, 'steps': 20923, 'loss/train': 1.5613585710525513} 11/07/2021 00:11:13 - INFO - __main__ - Step 20925: {'lr': 0.0004800994795079899, 'samples': 4017600, 'steps': 20924, 'loss/train': 1.8445370197296143} 11/07/2021 00:11:13 - INFO - __main__ - Step 20926: {'lr': 0.00048009740460955465, 'samples': 4017792, 'steps': 20925, 'loss/train': 1.7685003280639648} 11/07/2021 00:11:14 - INFO - __main__ - Step 20927: {'lr': 0.00048009532960744116, 'samples': 4017984, 'steps': 20926, 'loss/train': 1.086643099784851} 11/07/2021 00:11:15 - INFO - __main__ - Step 20928: {'lr': 0.0004800932545016502, 'samples': 4018176, 'steps': 20927, 'loss/train': 1.2196599245071411} 11/07/2021 00:11:15 - INFO - __main__ - Step 20929: {'lr': 0.0004800911792921828, 'samples': 4018368, 'steps': 20928, 'loss/train': 1.2705469131469727} 11/07/2021 00:11:15 - INFO - __main__ - Step 20930: {'lr': 0.0004800891039790399, 'samples': 4018560, 'steps': 20929, 'loss/train': 1.3246406316757202} 11/07/2021 00:11:16 - INFO - __main__ - Step 20931: {'lr': 0.00048008702856222233, 'samples': 4018752, 'steps': 20930, 'loss/train': 1.7419018745422363} 11/07/2021 00:11:16 - INFO - __main__ - Step 20932: {'lr': 0.0004800849530417312, 'samples': 4018944, 'steps': 20931, 'loss/train': 1.572576880455017} 11/07/2021 00:11:17 - INFO - __main__ - Step 20933: {'lr': 0.00048008287741756715, 'samples': 4019136, 'steps': 20932, 'loss/train': 1.5679242610931396} 11/07/2021 00:11:17 - INFO - __main__ - Step 20934: {'lr': 0.00048008080168973144, 'samples': 4019328, 'steps': 20933, 'loss/train': 1.1735849380493164} 11/07/2021 00:11:18 - INFO - __main__ - Step 20935: {'lr': 0.00048007872585822486, 'samples': 4019520, 'steps': 20934, 'loss/train': 1.741011619567871} 11/07/2021 00:11:18 - INFO - __main__ - Step 20936: {'lr': 0.00048007664992304834, 'samples': 4019712, 'steps': 20935, 'loss/train': 1.6827812194824219} 11/07/2021 00:11:18 - INFO - __main__ - Step 20937: {'lr': 0.0004800745738842029, 'samples': 4019904, 'steps': 20936, 'loss/train': 1.646528720855713} 11/07/2021 00:11:20 - INFO - __main__ - Step 20938: {'lr': 0.0004800724977416894, 'samples': 4020096, 'steps': 20937, 'loss/train': 2.168165445327759} 11/07/2021 00:11:20 - INFO - __main__ - Step 20939: {'lr': 0.00048007042149550866, 'samples': 4020288, 'steps': 20938, 'loss/train': 1.931329369544983} 11/07/2021 00:11:20 - INFO - __main__ - Step 20940: {'lr': 0.00048006834514566183, 'samples': 4020480, 'steps': 20939, 'loss/train': 1.0786707401275635} 11/07/2021 00:11:21 - INFO - __main__ - Step 20941: {'lr': 0.00048006626869214977, 'samples': 4020672, 'steps': 20940, 'loss/train': 1.6631247997283936} 11/07/2021 00:11:21 - INFO - __main__ - Step 20942: {'lr': 0.00048006419213497334, 'samples': 4020864, 'steps': 20941, 'loss/train': 1.8388252258300781} 11/07/2021 00:11:21 - INFO - __main__ - Step 20943: {'lr': 0.0004800621154741335, 'samples': 4021056, 'steps': 20942, 'loss/train': 3.2117514610290527} 11/07/2021 00:11:22 - INFO - __main__ - Step 20944: {'lr': 0.00048006003870963135, 'samples': 4021248, 'steps': 20943, 'loss/train': 1.8050594329833984} 11/07/2021 00:11:23 - INFO - __main__ - Step 20945: {'lr': 0.0004800579618414676, 'samples': 4021440, 'steps': 20944, 'loss/train': 1.1371890306472778} 11/07/2021 00:11:23 - INFO - __main__ - Step 20946: {'lr': 0.0004800558848696433, 'samples': 4021632, 'steps': 20945, 'loss/train': 1.728974461555481} 11/07/2021 00:11:23 - INFO - __main__ - Step 20947: {'lr': 0.0004800538077941594, 'samples': 4021824, 'steps': 20946, 'loss/train': 1.4030572175979614} 11/07/2021 00:11:24 - INFO - __main__ - Step 20948: {'lr': 0.00048005173061501673, 'samples': 4022016, 'steps': 20947, 'loss/train': 1.3526893854141235} 11/07/2021 00:11:25 - INFO - __main__ - Step 20949: {'lr': 0.0004800496533322164, 'samples': 4022208, 'steps': 20948, 'loss/train': 1.645173192024231} 11/07/2021 00:11:25 - INFO - __main__ - Step 20950: {'lr': 0.00048004757594575923, 'samples': 4022400, 'steps': 20949, 'loss/train': 1.7988004684448242} 11/07/2021 00:11:25 - INFO - __main__ - Step 20951: {'lr': 0.0004800454984556461, 'samples': 4022592, 'steps': 20950, 'loss/train': 1.3141534328460693} 11/07/2021 00:11:26 - INFO - __main__ - Step 20952: {'lr': 0.00048004342086187805, 'samples': 4022784, 'steps': 20951, 'loss/train': 1.546395182609558} 11/07/2021 00:11:26 - INFO - __main__ - Step 20953: {'lr': 0.000480041343164456, 'samples': 4022976, 'steps': 20952, 'loss/train': 1.7359364032745361} 11/07/2021 00:11:27 - INFO - __main__ - Step 20954: {'lr': 0.0004800392653633808, 'samples': 4023168, 'steps': 20953, 'loss/train': 1.3687946796417236} 11/07/2021 00:11:27 - INFO - __main__ - Step 20955: {'lr': 0.0004800371874586535, 'samples': 4023360, 'steps': 20954, 'loss/train': 1.8569425344467163} 11/07/2021 00:11:28 - INFO - __main__ - Step 20956: {'lr': 0.0004800351094502751, 'samples': 4023552, 'steps': 20955, 'loss/train': 1.5773091316223145} 11/07/2021 00:11:28 - INFO - __main__ - Step 20957: {'lr': 0.00048003303133824633, 'samples': 4023744, 'steps': 20956, 'loss/train': 1.7824643850326538} 11/07/2021 00:11:29 - INFO - __main__ - Step 20958: {'lr': 0.0004800309531225683, 'samples': 4023936, 'steps': 20957, 'loss/train': 1.6135205030441284} 11/07/2021 00:11:29 - INFO - __main__ - Step 20959: {'lr': 0.00048002887480324175, 'samples': 4024128, 'steps': 20958, 'loss/train': 1.5606435537338257} 11/07/2021 00:11:30 - INFO - __main__ - Step 20960: {'lr': 0.0004800267963802678, 'samples': 4024320, 'steps': 20959, 'loss/train': 1.4907593727111816} 11/07/2021 00:11:30 - INFO - __main__ - Step 20961: {'lr': 0.0004800247178536473, 'samples': 4024512, 'steps': 20960, 'loss/train': 1.7517757415771484} 11/07/2021 00:11:30 - INFO - __main__ - Step 20962: {'lr': 0.0004800226392233813, 'samples': 4024704, 'steps': 20961, 'loss/train': 1.6995420455932617} 11/07/2021 00:11:31 - INFO - __main__ - Step 20963: {'lr': 0.00048002056048947054, 'samples': 4024896, 'steps': 20962, 'loss/train': 1.690382957458496} 11/07/2021 00:11:32 - INFO - __main__ - Step 20964: {'lr': 0.0004800184816519161, 'samples': 4025088, 'steps': 20963, 'loss/train': 1.702330470085144} 11/07/2021 00:11:32 - INFO - __main__ - Step 20965: {'lr': 0.0004800164027107189, 'samples': 4025280, 'steps': 20964, 'loss/train': 1.4207866191864014} 11/07/2021 00:11:33 - INFO - __main__ - Step 20966: {'lr': 0.0004800143236658798, 'samples': 4025472, 'steps': 20965, 'loss/train': 1.6319961547851562} 11/07/2021 00:11:33 - INFO - __main__ - Step 20967: {'lr': 0.0004800122445173999, 'samples': 4025664, 'steps': 20966, 'loss/train': 1.5580840110778809} 11/07/2021 00:11:33 - INFO - __main__ - Step 20968: {'lr': 0.00048001016526528, 'samples': 4025856, 'steps': 20967, 'loss/train': 1.623712182044983} 11/07/2021 00:11:34 - INFO - __main__ - Step 20969: {'lr': 0.00048000808590952106, 'samples': 4026048, 'steps': 20968, 'loss/train': 1.501164197921753} 11/07/2021 00:11:35 - INFO - __main__ - Step 20970: {'lr': 0.0004800060064501239, 'samples': 4026240, 'steps': 20969, 'loss/train': 1.4817878007888794} 11/07/2021 00:11:35 - INFO - __main__ - Step 20971: {'lr': 0.00048000392688708976, 'samples': 4026432, 'steps': 20970, 'loss/train': 1.5886551141738892} 11/07/2021 00:11:35 - INFO - __main__ - Step 20972: {'lr': 0.00048000184722041934, 'samples': 4026624, 'steps': 20971, 'loss/train': 1.4988945722579956} 11/07/2021 00:11:36 - INFO - __main__ - Step 20973: {'lr': 0.00047999976745011366, 'samples': 4026816, 'steps': 20972, 'loss/train': 1.794215202331543} 11/07/2021 00:11:37 - INFO - __main__ - Step 20974: {'lr': 0.0004799976875761736, 'samples': 4027008, 'steps': 20973, 'loss/train': 1.4776352643966675} 11/07/2021 00:11:37 - INFO - __main__ - Step 20975: {'lr': 0.00047999560759860006, 'samples': 4027200, 'steps': 20974, 'loss/train': 1.5870134830474854} 11/07/2021 00:11:37 - INFO - __main__ - Step 20976: {'lr': 0.00047999352751739414, 'samples': 4027392, 'steps': 20975, 'loss/train': 1.6789779663085938} 11/07/2021 00:11:38 - INFO - __main__ - Step 20977: {'lr': 0.0004799914473325567, 'samples': 4027584, 'steps': 20976, 'loss/train': 1.4126200675964355} 11/07/2021 00:11:38 - INFO - __main__ - Step 20978: {'lr': 0.00047998936704408865, 'samples': 4027776, 'steps': 20977, 'loss/train': 1.6767990589141846} 11/07/2021 00:11:39 - INFO - __main__ - Step 20979: {'lr': 0.00047998728665199085, 'samples': 4027968, 'steps': 20978, 'loss/train': 1.7134089469909668} 11/07/2021 00:11:40 - INFO - __main__ - Step 20980: {'lr': 0.00047998520615626447, 'samples': 4028160, 'steps': 20979, 'loss/train': 1.2816898822784424} 11/07/2021 00:11:40 - INFO - __main__ - Step 20981: {'lr': 0.0004799831255569102, 'samples': 4028352, 'steps': 20980, 'loss/train': 1.301430106163025} 11/07/2021 00:11:40 - INFO - __main__ - Step 20982: {'lr': 0.00047998104485392915, 'samples': 4028544, 'steps': 20981, 'loss/train': 1.5153762102127075} 11/07/2021 00:11:41 - INFO - __main__ - Step 20983: {'lr': 0.0004799789640473221, 'samples': 4028736, 'steps': 20982, 'loss/train': 0.35239240527153015} 11/07/2021 00:11:41 - INFO - __main__ - Step 20984: {'lr': 0.0004799768831370902, 'samples': 4028928, 'steps': 20983, 'loss/train': 1.8679004907608032} 11/07/2021 00:11:42 - INFO - __main__ - Step 20985: {'lr': 0.0004799748021232342, 'samples': 4029120, 'steps': 20984, 'loss/train': 1.7212649583816528} 11/07/2021 00:11:42 - INFO - __main__ - Step 20986: {'lr': 0.00047997272100575505, 'samples': 4029312, 'steps': 20985, 'loss/train': 1.5796945095062256} 11/07/2021 00:11:43 - INFO - __main__ - Step 20987: {'lr': 0.00047997063978465383, 'samples': 4029504, 'steps': 20986, 'loss/train': 1.249659538269043} 11/07/2021 00:11:43 - INFO - __main__ - Step 20988: {'lr': 0.0004799685584599313, 'samples': 4029696, 'steps': 20987, 'loss/train': 1.1187705993652344} 11/07/2021 00:11:43 - INFO - __main__ - Step 20989: {'lr': 0.00047996647703158857, 'samples': 4029888, 'steps': 20988, 'loss/train': 1.584458827972412} 11/07/2021 00:11:45 - INFO - __main__ - Step 20990: {'lr': 0.00047996439549962647, 'samples': 4030080, 'steps': 20989, 'loss/train': 1.601413607597351} 11/07/2021 00:11:45 - INFO - __main__ - Step 20991: {'lr': 0.00047996231386404593, 'samples': 4030272, 'steps': 20990, 'loss/train': 1.417210578918457} 11/07/2021 00:11:45 - INFO - __main__ - Step 20992: {'lr': 0.00047996023212484797, 'samples': 4030464, 'steps': 20991, 'loss/train': 1.6309233903884888} 11/07/2021 00:11:46 - INFO - __main__ - Step 20993: {'lr': 0.00047995815028203346, 'samples': 4030656, 'steps': 20992, 'loss/train': 0.9504069685935974} 11/07/2021 00:11:46 - INFO - __main__ - Step 20994: {'lr': 0.00047995606833560337, 'samples': 4030848, 'steps': 20993, 'loss/train': 1.437409520149231} 11/07/2021 00:11:47 - INFO - __main__ - Step 20995: {'lr': 0.0004799539862855585, 'samples': 4031040, 'steps': 20994, 'loss/train': 1.4952812194824219} 11/07/2021 00:11:47 - INFO - __main__ - Step 20996: {'lr': 0.00047995190413190004, 'samples': 4031232, 'steps': 20995, 'loss/train': 1.3429986238479614} 11/07/2021 00:11:48 - INFO - __main__ - Step 20997: {'lr': 0.00047994982187462876, 'samples': 4031424, 'steps': 20996, 'loss/train': 1.1650660037994385} 11/07/2021 00:11:48 - INFO - __main__ - Step 20998: {'lr': 0.0004799477395137457, 'samples': 4031616, 'steps': 20997, 'loss/train': 1.3438471555709839} 11/07/2021 00:11:49 - INFO - __main__ - Step 20999: {'lr': 0.00047994565704925166, 'samples': 4031808, 'steps': 20998, 'loss/train': 0.818544328212738} 11/07/2021 00:11:50 - INFO - __main__ - Step 21000: {'lr': 0.0004799435744811477, 'samples': 4032000, 'steps': 20999, 'loss/train': 1.3654391765594482} 11/07/2021 00:11:50 - INFO - __main__ - Step 21001: {'lr': 0.0004799414918094347, 'samples': 4032192, 'steps': 21000, 'loss/train': 1.828909993171692} 11/07/2021 00:11:50 - INFO - __main__ - Step 21002: {'lr': 0.0004799394090341136, 'samples': 4032384, 'steps': 21001, 'loss/train': 1.7335715293884277} 11/07/2021 00:11:51 - INFO - __main__ - Step 21003: {'lr': 0.0004799373261551854, 'samples': 4032576, 'steps': 21002, 'loss/train': 1.3147293329238892} 11/07/2021 00:11:51 - INFO - __main__ - Step 21004: {'lr': 0.0004799352431726509, 'samples': 4032768, 'steps': 21003, 'loss/train': 1.6731096506118774} 11/07/2021 00:11:53 - INFO - __main__ - Step 21005: {'lr': 0.0004799331600865112, 'samples': 4032960, 'steps': 21004, 'loss/train': 1.3603119850158691} 11/07/2021 00:11:53 - INFO - __main__ - Step 21006: {'lr': 0.0004799310768967671, 'samples': 4033152, 'steps': 21005, 'loss/train': 1.5929734706878662} 11/07/2021 00:11:53 - INFO - __main__ - Step 21007: {'lr': 0.00047992899360341966, 'samples': 4033344, 'steps': 21006, 'loss/train': 1.6748112440109253} 11/07/2021 00:11:54 - INFO - __main__ - Step 21008: {'lr': 0.0004799269102064698, 'samples': 4033536, 'steps': 21007, 'loss/train': 1.620564579963684} 11/07/2021 00:11:54 - INFO - __main__ - Step 21009: {'lr': 0.0004799248267059183, 'samples': 4033728, 'steps': 21008, 'loss/train': 1.672453761100769} 11/07/2021 00:11:54 - INFO - __main__ - Step 21010: {'lr': 0.0004799227431017663, 'samples': 4033920, 'steps': 21009, 'loss/train': 1.5249580144882202} 11/07/2021 00:11:56 - INFO - __main__ - Step 21011: {'lr': 0.0004799206593940147, 'samples': 4034112, 'steps': 21010, 'loss/train': 4.485203742980957} 11/07/2021 00:11:56 - INFO - __main__ - Step 21012: {'lr': 0.0004799185755826644, 'samples': 4034304, 'steps': 21011, 'loss/train': 1.062135934829712} 11/07/2021 00:11:56 - INFO - __main__ - Step 21013: {'lr': 0.00047991649166771624, 'samples': 4034496, 'steps': 21012, 'loss/train': 1.410986065864563} 11/07/2021 00:11:57 - INFO - __main__ - Step 21014: {'lr': 0.00047991440764917127, 'samples': 4034688, 'steps': 21013, 'loss/train': 1.6308702230453491} 11/07/2021 00:11:57 - INFO - __main__ - Step 21015: {'lr': 0.0004799123235270305, 'samples': 4034880, 'steps': 21014, 'loss/train': 2.6511857509613037} 11/07/2021 00:11:58 - INFO - __main__ - Step 21016: {'lr': 0.0004799102393012947, 'samples': 4035072, 'steps': 21015, 'loss/train': 2.724604845046997} 11/07/2021 00:11:58 - INFO - __main__ - Step 21017: {'lr': 0.0004799081549719649, 'samples': 4035264, 'steps': 21016, 'loss/train': 1.653235673904419} 11/07/2021 00:11:59 - INFO - __main__ - Step 21018: {'lr': 0.0004799060705390421, 'samples': 4035456, 'steps': 21017, 'loss/train': 1.4333916902542114} 11/07/2021 00:11:59 - INFO - __main__ - Step 21019: {'lr': 0.00047990398600252713, 'samples': 4035648, 'steps': 21018, 'loss/train': 1.2776241302490234} 11/07/2021 00:12:00 - INFO - __main__ - Step 21020: {'lr': 0.00047990190136242103, 'samples': 4035840, 'steps': 21019, 'loss/train': 2.232898473739624} 11/07/2021 00:12:00 - INFO - __main__ - Step 21021: {'lr': 0.0004798998166187246, 'samples': 4036032, 'steps': 21020, 'loss/train': 1.4598809480667114} 11/07/2021 00:12:00 - INFO - __main__ - Step 21022: {'lr': 0.0004798977317714389, 'samples': 4036224, 'steps': 21021, 'loss/train': 1.3510398864746094} 11/07/2021 00:12:01 - INFO - __main__ - Step 21023: {'lr': 0.00047989564682056487, 'samples': 4036416, 'steps': 21022, 'loss/train': 2.0689451694488525} 11/07/2021 00:12:02 - INFO - __main__ - Step 21024: {'lr': 0.0004798935617661033, 'samples': 4036608, 'steps': 21023, 'loss/train': 1.3049253225326538} 11/07/2021 00:12:02 - INFO - __main__ - Step 21025: {'lr': 0.0004798914766080553, 'samples': 4036800, 'steps': 21024, 'loss/train': 2.5650393962860107} 11/07/2021 00:12:02 - INFO - __main__ - Step 21026: {'lr': 0.00047988939134642174, 'samples': 4036992, 'steps': 21025, 'loss/train': 1.9371496438980103} 11/07/2021 00:12:03 - INFO - __main__ - Step 21027: {'lr': 0.00047988730598120356, 'samples': 4037184, 'steps': 21026, 'loss/train': 1.7897228002548218} 11/07/2021 00:12:04 - INFO - __main__ - Step 21028: {'lr': 0.00047988522051240173, 'samples': 4037376, 'steps': 21027, 'loss/train': 1.6327519416809082} 11/07/2021 00:12:04 - INFO - __main__ - Step 21029: {'lr': 0.0004798831349400172, 'samples': 4037568, 'steps': 21028, 'loss/train': 4.53317403793335} 11/07/2021 00:12:05 - INFO - __main__ - Step 21030: {'lr': 0.0004798810492640508, 'samples': 4037760, 'steps': 21029, 'loss/train': 1.88721764087677} 11/07/2021 00:12:05 - INFO - __main__ - Step 21031: {'lr': 0.00047987896348450354, 'samples': 4037952, 'steps': 21030, 'loss/train': 1.8528162240982056} 11/07/2021 00:12:05 - INFO - __main__ - Step 21032: {'lr': 0.00047987687760137646, 'samples': 4038144, 'steps': 21031, 'loss/train': 0.6381519436836243} 11/07/2021 00:12:07 - INFO - __main__ - Step 21033: {'lr': 0.00047987479161467033, 'samples': 4038336, 'steps': 21032, 'loss/train': 1.6966015100479126} 11/07/2021 00:12:07 - INFO - __main__ - Step 21034: {'lr': 0.0004798727055243862, 'samples': 4038528, 'steps': 21033, 'loss/train': 1.0211305618286133} 11/07/2021 00:12:07 - INFO - __main__ - Step 21035: {'lr': 0.000479870619330525, 'samples': 4038720, 'steps': 21034, 'loss/train': 1.7424684762954712} 11/07/2021 00:12:08 - INFO - __main__ - Step 21036: {'lr': 0.0004798685330330876, 'samples': 4038912, 'steps': 21035, 'loss/train': 2.236529588699341} 11/07/2021 00:12:08 - INFO - __main__ - Step 21037: {'lr': 0.000479866446632075, 'samples': 4039104, 'steps': 21036, 'loss/train': 1.7629296779632568} 11/07/2021 00:12:09 - INFO - __main__ - Step 21038: {'lr': 0.00047986436012748815, 'samples': 4039296, 'steps': 21037, 'loss/train': 1.6419411897659302} 11/07/2021 00:12:09 - INFO - __main__ - Step 21039: {'lr': 0.00047986227351932785, 'samples': 4039488, 'steps': 21038, 'loss/train': 1.955509901046753} 11/07/2021 00:12:10 - INFO - __main__ - Step 21040: {'lr': 0.00047986018680759525, 'samples': 4039680, 'steps': 21039, 'loss/train': 1.8619760274887085} 11/07/2021 00:12:10 - INFO - __main__ - Step 21041: {'lr': 0.00047985809999229125, 'samples': 4039872, 'steps': 21040, 'loss/train': 1.6903468370437622} 11/07/2021 00:12:10 - INFO - __main__ - Step 21042: {'lr': 0.00047985601307341667, 'samples': 4040064, 'steps': 21041, 'loss/train': 1.6033693552017212} 11/07/2021 00:12:11 - INFO - __main__ - Step 21043: {'lr': 0.0004798539260509725, 'samples': 4040256, 'steps': 21042, 'loss/train': 1.5110589265823364} 11/07/2021 00:12:12 - INFO - __main__ - Step 21044: {'lr': 0.00047985183892495977, 'samples': 4040448, 'steps': 21043, 'loss/train': 1.7737886905670166} 11/07/2021 00:12:12 - INFO - __main__ - Step 21045: {'lr': 0.00047984975169537925, 'samples': 4040640, 'steps': 21044, 'loss/train': 2.1395554542541504} 11/07/2021 00:12:12 - INFO - __main__ - Step 21046: {'lr': 0.00047984766436223205, 'samples': 4040832, 'steps': 21045, 'loss/train': 1.7177172899246216} 11/07/2021 00:12:13 - INFO - __main__ - Step 21047: {'lr': 0.000479845576925519, 'samples': 4041024, 'steps': 21046, 'loss/train': 1.8919692039489746} 11/07/2021 00:12:13 - INFO - __main__ - Step 21048: {'lr': 0.00047984348938524113, 'samples': 4041216, 'steps': 21047, 'loss/train': 1.8316309452056885} 11/07/2021 00:12:14 - INFO - __main__ - Step 21049: {'lr': 0.00047984140174139926, 'samples': 4041408, 'steps': 21048, 'loss/train': 1.637800693511963} 11/07/2021 00:12:14 - INFO - __main__ - Step 21050: {'lr': 0.0004798393139939945, 'samples': 4041600, 'steps': 21049, 'loss/train': 1.1018575429916382} 11/07/2021 00:12:15 - INFO - __main__ - Step 21051: {'lr': 0.0004798372261430276, 'samples': 4041792, 'steps': 21050, 'loss/train': 1.6706420183181763} 11/07/2021 00:12:15 - INFO - __main__ - Step 21052: {'lr': 0.00047983513818849967, 'samples': 4041984, 'steps': 21051, 'loss/train': 1.7306580543518066} 11/07/2021 00:12:15 - INFO - __main__ - Step 21053: {'lr': 0.0004798330501304115, 'samples': 4042176, 'steps': 21052, 'loss/train': 1.6159660816192627} 11/07/2021 00:12:16 - INFO - __main__ - Step 21054: {'lr': 0.00047983096196876413, 'samples': 4042368, 'steps': 21053, 'loss/train': 1.8145487308502197} 11/07/2021 00:12:17 - INFO - __main__ - Step 21055: {'lr': 0.00047982887370355846, 'samples': 4042560, 'steps': 21054, 'loss/train': 0.9376941323280334} 11/07/2021 00:12:17 - INFO - __main__ - Step 21056: {'lr': 0.0004798267853347955, 'samples': 4042752, 'steps': 21055, 'loss/train': 1.15646493434906} 11/07/2021 00:12:17 - INFO - __main__ - Step 21057: {'lr': 0.0004798246968624761, 'samples': 4042944, 'steps': 21056, 'loss/train': 1.7298862934112549} 11/07/2021 00:12:18 - INFO - __main__ - Step 21058: {'lr': 0.00047982260828660124, 'samples': 4043136, 'steps': 21057, 'loss/train': 1.785570502281189} 11/07/2021 00:12:19 - INFO - __main__ - Step 21059: {'lr': 0.0004798205196071719, 'samples': 4043328, 'steps': 21058, 'loss/train': 1.3368200063705444} 11/07/2021 00:12:19 - INFO - __main__ - Step 21060: {'lr': 0.00047981843082418884, 'samples': 4043520, 'steps': 21059, 'loss/train': 1.938208818435669} 11/07/2021 00:12:20 - INFO - __main__ - Step 21061: {'lr': 0.0004798163419376533, 'samples': 4043712, 'steps': 21060, 'loss/train': 1.3998974561691284} 11/07/2021 00:12:20 - INFO - __main__ - Step 21062: {'lr': 0.00047981425294756595, 'samples': 4043904, 'steps': 21061, 'loss/train': 1.4993808269500732} 11/07/2021 00:12:20 - INFO - __main__ - Step 21063: {'lr': 0.00047981216385392796, 'samples': 4044096, 'steps': 21062, 'loss/train': 1.275914192199707} 11/07/2021 00:12:21 - INFO - __main__ - Step 21064: {'lr': 0.0004798100746567401, 'samples': 4044288, 'steps': 21063, 'loss/train': 1.8225252628326416} 11/07/2021 00:12:22 - INFO - __main__ - Step 21065: {'lr': 0.00047980798535600334, 'samples': 4044480, 'steps': 21064, 'loss/train': 2.1295764446258545} 11/07/2021 00:12:22 - INFO - __main__ - Step 21066: {'lr': 0.00047980589595171866, 'samples': 4044672, 'steps': 21065, 'loss/train': 1.8311939239501953} 11/07/2021 00:12:22 - INFO - __main__ - Step 21067: {'lr': 0.000479803806443887, 'samples': 4044864, 'steps': 21066, 'loss/train': 1.5671026706695557} 11/07/2021 00:12:23 - INFO - __main__ - Step 21068: {'lr': 0.0004798017168325093, 'samples': 4045056, 'steps': 21067, 'loss/train': 1.1478503942489624} 11/07/2021 00:12:24 - INFO - __main__ - Step 21069: {'lr': 0.0004797996271175865, 'samples': 4045248, 'steps': 21068, 'loss/train': 1.5180654525756836} 11/07/2021 00:12:24 - INFO - __main__ - Step 21070: {'lr': 0.00047979753729911944, 'samples': 4045440, 'steps': 21069, 'loss/train': 1.7661678791046143} 11/07/2021 00:12:24 - INFO - __main__ - Step 21071: {'lr': 0.00047979544737710925, 'samples': 4045632, 'steps': 21070, 'loss/train': 1.5353820323944092} 11/07/2021 00:12:25 - INFO - __main__ - Step 21072: {'lr': 0.00047979335735155677, 'samples': 4045824, 'steps': 21071, 'loss/train': 1.630466341972351} 11/07/2021 00:12:25 - INFO - __main__ - Step 21073: {'lr': 0.00047979126722246294, 'samples': 4046016, 'steps': 21072, 'loss/train': 1.4065769910812378} 11/07/2021 00:12:26 - INFO - __main__ - Step 21074: {'lr': 0.0004797891769898287, 'samples': 4046208, 'steps': 21073, 'loss/train': 1.6578621864318848} 11/07/2021 00:12:26 - INFO - __main__ - Step 21075: {'lr': 0.00047978708665365503, 'samples': 4046400, 'steps': 21074, 'loss/train': 1.570343017578125} 11/07/2021 00:12:27 - INFO - __main__ - Step 21076: {'lr': 0.0004797849962139428, 'samples': 4046592, 'steps': 21075, 'loss/train': 1.786704421043396} 11/07/2021 00:12:27 - INFO - __main__ - Step 21077: {'lr': 0.00047978290567069306, 'samples': 4046784, 'steps': 21076, 'loss/train': 2.6458792686462402} 11/07/2021 00:12:28 - INFO - __main__ - Step 21078: {'lr': 0.00047978081502390656, 'samples': 4046976, 'steps': 21077, 'loss/train': 0.9769932627677917} 11/07/2021 00:12:29 - INFO - __main__ - Step 21079: {'lr': 0.0004797787242735845, 'samples': 4047168, 'steps': 21078, 'loss/train': 1.5711239576339722} 11/07/2021 00:12:29 - INFO - __main__ - Step 21080: {'lr': 0.00047977663341972765, 'samples': 4047360, 'steps': 21079, 'loss/train': 2.6277008056640625} 11/07/2021 00:12:29 - INFO - __main__ - Step 21081: {'lr': 0.00047977454246233696, 'samples': 4047552, 'steps': 21080, 'loss/train': 1.6991426944732666} 11/07/2021 00:12:30 - INFO - __main__ - Step 21082: {'lr': 0.00047977245140141354, 'samples': 4047744, 'steps': 21081, 'loss/train': 2.114461660385132} 11/07/2021 00:12:30 - INFO - __main__ - Step 21083: {'lr': 0.00047977036023695807, 'samples': 4047936, 'steps': 21082, 'loss/train': 2.2551801204681396} 11/07/2021 00:12:30 - INFO - __main__ - Step 21084: {'lr': 0.00047976826896897165, 'samples': 4048128, 'steps': 21083, 'loss/train': 1.5625381469726562} 11/07/2021 00:12:31 - INFO - __main__ - Step 21085: {'lr': 0.0004797661775974552, 'samples': 4048320, 'steps': 21084, 'loss/train': 1.0512539148330688} 11/07/2021 00:12:32 - INFO - __main__ - Step 21086: {'lr': 0.00047976408612240964, 'samples': 4048512, 'steps': 21085, 'loss/train': 1.6367626190185547} 11/07/2021 00:12:32 - INFO - __main__ - Step 21087: {'lr': 0.00047976199454383595, 'samples': 4048704, 'steps': 21086, 'loss/train': 2.107619285583496} 11/07/2021 00:12:32 - INFO - __main__ - Step 21088: {'lr': 0.00047975990286173504, 'samples': 4048896, 'steps': 21087, 'loss/train': 1.7704498767852783} 11/07/2021 00:12:33 - INFO - __main__ - Step 21089: {'lr': 0.00047975781107610784, 'samples': 4049088, 'steps': 21088, 'loss/train': 1.409651279449463} 11/07/2021 00:12:34 - INFO - __main__ - Step 21090: {'lr': 0.0004797557191869554, 'samples': 4049280, 'steps': 21089, 'loss/train': 1.722057819366455} 11/07/2021 00:12:34 - INFO - __main__ - Step 21091: {'lr': 0.0004797536271942785, 'samples': 4049472, 'steps': 21090, 'loss/train': 0.3714536726474762} 11/07/2021 00:12:35 - INFO - __main__ - Step 21092: {'lr': 0.00047975153509807815, 'samples': 4049664, 'steps': 21091, 'loss/train': 1.37582266330719} 11/07/2021 00:12:35 - INFO - __main__ - Step 21093: {'lr': 0.0004797494428983553, 'samples': 4049856, 'steps': 21092, 'loss/train': 2.382824182510376} 11/07/2021 00:12:35 - INFO - __main__ - Step 21094: {'lr': 0.000479747350595111, 'samples': 4050048, 'steps': 21093, 'loss/train': 1.4000967741012573} 11/07/2021 00:12:36 - INFO - __main__ - Step 21095: {'lr': 0.00047974525818834604, 'samples': 4050240, 'steps': 21094, 'loss/train': 1.4624602794647217} 11/07/2021 00:12:37 - INFO - __main__ - Step 21096: {'lr': 0.0004797431656780613, 'samples': 4050432, 'steps': 21095, 'loss/train': 1.8797059059143066} 11/07/2021 00:12:37 - INFO - __main__ - Step 21097: {'lr': 0.000479741073064258, 'samples': 4050624, 'steps': 21096, 'loss/train': 1.8052655458450317} 11/07/2021 00:12:37 - INFO - __main__ - Step 21098: {'lr': 0.0004797389803469369, 'samples': 4050816, 'steps': 21097, 'loss/train': 1.7806404829025269} 11/07/2021 00:12:38 - INFO - __main__ - Step 21099: {'lr': 0.0004797368875260988, 'samples': 4051008, 'steps': 21098, 'loss/train': 1.817021131515503} 11/07/2021 00:12:39 - INFO - __main__ - Step 21100: {'lr': 0.00047973479460174497, 'samples': 4051200, 'steps': 21099, 'loss/train': 1.438368320465088} 11/07/2021 00:12:39 - INFO - __main__ - Step 21101: {'lr': 0.00047973270157387605, 'samples': 4051392, 'steps': 21100, 'loss/train': 1.0588150024414062} 11/07/2021 00:12:39 - INFO - __main__ - Step 21102: {'lr': 0.0004797306084424932, 'samples': 4051584, 'steps': 21101, 'loss/train': 1.283268690109253} 11/07/2021 00:12:40 - INFO - __main__ - Step 21103: {'lr': 0.0004797285152075973, 'samples': 4051776, 'steps': 21102, 'loss/train': 1.7230778932571411} 11/07/2021 00:12:40 - INFO - __main__ - Step 21104: {'lr': 0.00047972642186918925, 'samples': 4051968, 'steps': 21103, 'loss/train': 1.8024054765701294} 11/07/2021 00:12:41 - INFO - __main__ - Step 21105: {'lr': 0.00047972432842727003, 'samples': 4052160, 'steps': 21104, 'loss/train': 1.590198278427124} 11/07/2021 00:12:41 - INFO - __main__ - Step 21106: {'lr': 0.0004797222348818405, 'samples': 4052352, 'steps': 21105, 'loss/train': 1.9167371988296509} 11/07/2021 00:12:42 - INFO - __main__ - Step 21107: {'lr': 0.00047972014123290183, 'samples': 4052544, 'steps': 21106, 'loss/train': 1.5365687608718872} 11/07/2021 00:12:42 - INFO - __main__ - Step 21108: {'lr': 0.00047971804748045464, 'samples': 4052736, 'steps': 21107, 'loss/train': 1.8305330276489258} 11/07/2021 00:12:43 - INFO - __main__ - Step 21109: {'lr': 0.00047971595362450014, 'samples': 4052928, 'steps': 21108, 'loss/train': 1.5538760423660278} 11/07/2021 00:12:43 - INFO - __main__ - Step 21110: {'lr': 0.00047971385966503923, 'samples': 4053120, 'steps': 21109, 'loss/train': 1.6489356756210327} 11/07/2021 00:12:44 - INFO - __main__ - Step 21111: {'lr': 0.0004797117656020727, 'samples': 4053312, 'steps': 21110, 'loss/train': 1.8342883586883545} 11/07/2021 00:12:44 - INFO - __main__ - Step 21112: {'lr': 0.0004797096714356016, 'samples': 4053504, 'steps': 21111, 'loss/train': 1.6683157682418823} 11/07/2021 00:12:45 - INFO - __main__ - Step 21113: {'lr': 0.0004797075771656269, 'samples': 4053696, 'steps': 21112, 'loss/train': 1.3306695222854614} 11/07/2021 00:12:45 - INFO - __main__ - Step 21114: {'lr': 0.0004797054827921495, 'samples': 4053888, 'steps': 21113, 'loss/train': 0.9768213629722595} 11/07/2021 00:12:45 - INFO - __main__ - Step 21115: {'lr': 0.0004797033883151703, 'samples': 4054080, 'steps': 21114, 'loss/train': 1.3049585819244385} 11/07/2021 00:12:46 - INFO - __main__ - Step 21116: {'lr': 0.0004797012937346904, 'samples': 4054272, 'steps': 21115, 'loss/train': 0.9799783229827881} 11/07/2021 00:12:47 - INFO - __main__ - Step 21117: {'lr': 0.0004796991990507106, 'samples': 4054464, 'steps': 21116, 'loss/train': 1.0931177139282227} 11/07/2021 00:12:47 - INFO - __main__ - Step 21118: {'lr': 0.00047969710426323185, 'samples': 4054656, 'steps': 21117, 'loss/train': 1.5874521732330322} 11/07/2021 00:12:47 - INFO - __main__ - Step 21119: {'lr': 0.0004796950093722552, 'samples': 4054848, 'steps': 21118, 'loss/train': 1.7003940343856812} 11/07/2021 00:12:48 - INFO - __main__ - Step 21120: {'lr': 0.00047969291437778143, 'samples': 4055040, 'steps': 21119, 'loss/train': 1.7281981706619263} 11/07/2021 00:12:49 - INFO - __main__ - Step 21121: {'lr': 0.00047969081927981165, 'samples': 4055232, 'steps': 21120, 'loss/train': 1.3604600429534912} 11/07/2021 00:12:49 - INFO - __main__ - Step 21122: {'lr': 0.0004796887240783467, 'samples': 4055424, 'steps': 21121, 'loss/train': 0.33510664105415344} 11/07/2021 00:12:50 - INFO - __main__ - Step 21123: {'lr': 0.0004796866287733875, 'samples': 4055616, 'steps': 21122, 'loss/train': 1.2340506315231323} 11/07/2021 00:12:50 - INFO - __main__ - Step 21124: {'lr': 0.0004796845333649352, 'samples': 4055808, 'steps': 21123, 'loss/train': 1.552304744720459} 11/07/2021 00:12:51 - INFO - __main__ - Step 21125: {'lr': 0.00047968243785299046, 'samples': 4056000, 'steps': 21124, 'loss/train': 1.78187894821167} 11/07/2021 00:12:51 - INFO - __main__ - Step 21126: {'lr': 0.0004796803422375544, 'samples': 4056192, 'steps': 21125, 'loss/train': 1.5677958726882935} 11/07/2021 00:12:52 - INFO - __main__ - Step 21127: {'lr': 0.0004796782465186279, 'samples': 4056384, 'steps': 21126, 'loss/train': 0.20881569385528564} 11/07/2021 00:12:52 - INFO - __main__ - Step 21128: {'lr': 0.00047967615069621197, 'samples': 4056576, 'steps': 21127, 'loss/train': 1.6353904008865356} 11/07/2021 00:12:53 - INFO - __main__ - Step 21129: {'lr': 0.0004796740547703075, 'samples': 4056768, 'steps': 21128, 'loss/train': 1.5348048210144043} 11/07/2021 00:12:53 - INFO - __main__ - Step 21130: {'lr': 0.00047967195874091547, 'samples': 4056960, 'steps': 21129, 'loss/train': 1.6365200281143188} 11/07/2021 00:12:53 - INFO - __main__ - Step 21131: {'lr': 0.00047966986260803676, 'samples': 4057152, 'steps': 21130, 'loss/train': 1.6884819269180298} 11/07/2021 00:12:54 - INFO - __main__ - Step 21132: {'lr': 0.0004796677663716723, 'samples': 4057344, 'steps': 21131, 'loss/train': 1.2197633981704712} 11/07/2021 00:12:55 - INFO - __main__ - Step 21133: {'lr': 0.00047966567003182315, 'samples': 4057536, 'steps': 21132, 'loss/train': 1.3943464756011963} 11/07/2021 00:12:55 - INFO - __main__ - Step 21134: {'lr': 0.0004796635735884902, 'samples': 4057728, 'steps': 21133, 'loss/train': 1.5279340744018555} 11/07/2021 00:12:56 - INFO - __main__ - Step 21135: {'lr': 0.0004796614770416744, 'samples': 4057920, 'steps': 21134, 'loss/train': 1.308184027671814} 11/07/2021 00:12:56 - INFO - __main__ - Step 21136: {'lr': 0.00047965938039137666, 'samples': 4058112, 'steps': 21135, 'loss/train': 1.3822206258773804} 11/07/2021 00:12:57 - INFO - __main__ - Step 21137: {'lr': 0.000479657283637598, 'samples': 4058304, 'steps': 21136, 'loss/train': 1.382985234260559} 11/07/2021 00:12:57 - INFO - __main__ - Step 21138: {'lr': 0.00047965518678033924, 'samples': 4058496, 'steps': 21137, 'loss/train': 1.4995598793029785} 11/07/2021 00:12:57 - INFO - __main__ - Step 21139: {'lr': 0.00047965308981960143, 'samples': 4058688, 'steps': 21138, 'loss/train': 1.426271915435791} 11/07/2021 00:12:58 - INFO - __main__ - Step 21140: {'lr': 0.0004796509927553854, 'samples': 4058880, 'steps': 21139, 'loss/train': 1.2639167308807373} 11/07/2021 00:12:58 - INFO - __main__ - Step 21141: {'lr': 0.00047964889558769233, 'samples': 4059072, 'steps': 21140, 'loss/train': 1.5699936151504517} 11/07/2021 00:12:59 - INFO - __main__ - Step 21142: {'lr': 0.00047964679831652294, 'samples': 4059264, 'steps': 21141, 'loss/train': 1.2318005561828613} 11/07/2021 00:13:00 - INFO - __main__ - Step 21143: {'lr': 0.00047964470094187815, 'samples': 4059456, 'steps': 21142, 'loss/train': 0.17193424701690674} 11/07/2021 00:13:00 - INFO - __main__ - Step 21144: {'lr': 0.0004796426034637591, 'samples': 4059648, 'steps': 21143, 'loss/train': 1.6715084314346313} 11/07/2021 00:13:00 - INFO - __main__ - Step 21145: {'lr': 0.0004796405058821666, 'samples': 4059840, 'steps': 21144, 'loss/train': 1.1046626567840576} 11/07/2021 00:13:01 - INFO - __main__ - Step 21146: {'lr': 0.0004796384081971017, 'samples': 4060032, 'steps': 21145, 'loss/train': 1.7059005498886108} 11/07/2021 00:13:01 - INFO - __main__ - Step 21147: {'lr': 0.0004796363104085652, 'samples': 4060224, 'steps': 21146, 'loss/train': 1.8541324138641357} 11/07/2021 00:13:02 - INFO - __main__ - Step 21148: {'lr': 0.00047963421251655817, 'samples': 4060416, 'steps': 21147, 'loss/train': 1.7760694026947021} 11/07/2021 00:13:02 - INFO - __main__ - Step 21149: {'lr': 0.00047963211452108144, 'samples': 4060608, 'steps': 21148, 'loss/train': 1.298509955406189} 11/07/2021 00:13:03 - INFO - __main__ - Step 21150: {'lr': 0.0004796300164221361, 'samples': 4060800, 'steps': 21149, 'loss/train': 1.3676725625991821} 11/07/2021 00:13:03 - INFO - __main__ - Step 21151: {'lr': 0.00047962791821972296, 'samples': 4060992, 'steps': 21150, 'loss/train': 1.5314453840255737} 11/07/2021 00:13:03 - INFO - __main__ - Step 21152: {'lr': 0.00047962581991384305, 'samples': 4061184, 'steps': 21151, 'loss/train': 1.3802883625030518} 11/07/2021 00:13:05 - INFO - __main__ - Step 21153: {'lr': 0.0004796237215044973, 'samples': 4061376, 'steps': 21152, 'loss/train': 0.8671078085899353} 11/07/2021 00:13:05 - INFO - __main__ - Step 21154: {'lr': 0.0004796216229916867, 'samples': 4061568, 'steps': 21153, 'loss/train': 2.082179307937622} 11/07/2021 00:13:05 - INFO - __main__ - Step 21155: {'lr': 0.000479619524375412, 'samples': 4061760, 'steps': 21154, 'loss/train': 1.5132710933685303} 11/07/2021 00:13:06 - INFO - __main__ - Step 21156: {'lr': 0.0004796174256556744, 'samples': 4061952, 'steps': 21155, 'loss/train': 1.778990626335144} 11/07/2021 00:13:06 - INFO - __main__ - Step 21157: {'lr': 0.0004796153268324747, 'samples': 4062144, 'steps': 21156, 'loss/train': 2.0939102172851562} 11/07/2021 00:13:07 - INFO - __main__ - Step 21158: {'lr': 0.00047961322790581384, 'samples': 4062336, 'steps': 21157, 'loss/train': 1.0755528211593628} 11/07/2021 00:13:07 - INFO - __main__ - Step 21159: {'lr': 0.00047961112887569285, 'samples': 4062528, 'steps': 21158, 'loss/train': 1.004090666770935} 11/07/2021 00:13:08 - INFO - __main__ - Step 21160: {'lr': 0.0004796090297421126, 'samples': 4062720, 'steps': 21159, 'loss/train': 1.5249578952789307} 11/07/2021 00:13:08 - INFO - __main__ - Step 21161: {'lr': 0.0004796069305050741, 'samples': 4062912, 'steps': 21160, 'loss/train': 1.7508039474487305} 11/07/2021 00:13:08 - INFO - __main__ - Step 21162: {'lr': 0.0004796048311645782, 'samples': 4063104, 'steps': 21161, 'loss/train': 1.584632396697998} 11/07/2021 00:13:09 - INFO - __main__ - Step 21163: {'lr': 0.00047960273172062596, 'samples': 4063296, 'steps': 21162, 'loss/train': 0.974990725517273} 11/07/2021 00:13:10 - INFO - __main__ - Step 21164: {'lr': 0.00047960063217321824, 'samples': 4063488, 'steps': 21163, 'loss/train': 0.8087447285652161} 11/07/2021 00:13:10 - INFO - __main__ - Step 21165: {'lr': 0.0004795985325223561, 'samples': 4063680, 'steps': 21164, 'loss/train': 0.8941444158554077} 11/07/2021 00:13:10 - INFO - __main__ - Step 21166: {'lr': 0.00047959643276804026, 'samples': 4063872, 'steps': 21165, 'loss/train': 2.0346102714538574} 11/07/2021 00:13:11 - INFO - __main__ - Step 21167: {'lr': 0.0004795943329102719, 'samples': 4064064, 'steps': 21166, 'loss/train': 1.8948001861572266} 11/07/2021 00:13:11 - INFO - __main__ - Step 21168: {'lr': 0.00047959223294905185, 'samples': 4064256, 'steps': 21167, 'loss/train': 1.3930824995040894} 11/07/2021 00:13:12 - INFO - __main__ - Step 21169: {'lr': 0.00047959013288438113, 'samples': 4064448, 'steps': 21168, 'loss/train': 2.1027698516845703} 11/07/2021 00:13:13 - INFO - __main__ - Step 21170: {'lr': 0.0004795880327162606, 'samples': 4064640, 'steps': 21169, 'loss/train': 1.8361570835113525} 11/07/2021 00:13:13 - INFO - __main__ - Step 21171: {'lr': 0.0004795859324446912, 'samples': 4064832, 'steps': 21170, 'loss/train': 0.1968582719564438} 11/07/2021 00:13:13 - INFO - __main__ - Step 21172: {'lr': 0.000479583832069674, 'samples': 4065024, 'steps': 21171, 'loss/train': 1.953822374343872} 11/07/2021 00:13:14 - INFO - __main__ - Step 21173: {'lr': 0.00047958173159120984, 'samples': 4065216, 'steps': 21172, 'loss/train': 1.3294357061386108} 11/07/2021 00:13:15 - INFO - __main__ - Step 21174: {'lr': 0.0004795796310092997, 'samples': 4065408, 'steps': 21173, 'loss/train': 1.368573546409607} 11/07/2021 00:13:15 - INFO - __main__ - Step 21175: {'lr': 0.00047957753032394445, 'samples': 4065600, 'steps': 21174, 'loss/train': 1.3646577596664429} 11/07/2021 00:13:15 - INFO - __main__ - Step 21176: {'lr': 0.00047957542953514523, 'samples': 4065792, 'steps': 21175, 'loss/train': 1.5818638801574707} 11/07/2021 00:13:16 - INFO - __main__ - Step 21177: {'lr': 0.00047957332864290283, 'samples': 4065984, 'steps': 21176, 'loss/train': 1.4539034366607666} 11/07/2021 00:13:16 - INFO - __main__ - Step 21178: {'lr': 0.00047957122764721817, 'samples': 4066176, 'steps': 21177, 'loss/train': 1.3946402072906494} 11/07/2021 00:13:17 - INFO - __main__ - Step 21179: {'lr': 0.00047956912654809227, 'samples': 4066368, 'steps': 21178, 'loss/train': 1.720708966255188} 11/07/2021 00:13:18 - INFO - __main__ - Step 21180: {'lr': 0.0004795670253455261, 'samples': 4066560, 'steps': 21179, 'loss/train': 1.664499044418335} 11/07/2021 00:13:18 - INFO - __main__ - Step 21181: {'lr': 0.00047956492403952055, 'samples': 4066752, 'steps': 21180, 'loss/train': 1.7864108085632324} 11/07/2021 00:13:18 - INFO - __main__ - Step 21182: {'lr': 0.00047956282263007663, 'samples': 4066944, 'steps': 21181, 'loss/train': 2.069216728210449} 11/07/2021 00:13:19 - INFO - __main__ - Step 21183: {'lr': 0.00047956072111719517, 'samples': 4067136, 'steps': 21182, 'loss/train': 2.2219934463500977} 11/07/2021 00:13:20 - INFO - __main__ - Step 21184: {'lr': 0.00047955861950087724, 'samples': 4067328, 'steps': 21183, 'loss/train': 1.7120684385299683} 11/07/2021 00:13:20 - INFO - __main__ - Step 21185: {'lr': 0.00047955651778112376, 'samples': 4067520, 'steps': 21184, 'loss/train': 1.7454752922058105} 11/07/2021 00:13:20 - INFO - __main__ - Step 21186: {'lr': 0.00047955441595793556, 'samples': 4067712, 'steps': 21185, 'loss/train': 1.556066632270813} 11/07/2021 00:13:21 - INFO - __main__ - Step 21187: {'lr': 0.0004795523140313138, 'samples': 4067904, 'steps': 21186, 'loss/train': 1.2820478677749634} 11/07/2021 00:13:21 - INFO - __main__ - Step 21188: {'lr': 0.00047955021200125924, 'samples': 4068096, 'steps': 21187, 'loss/train': 1.2648670673370361} 11/07/2021 00:13:22 - INFO - __main__ - Step 21189: {'lr': 0.0004795481098677729, 'samples': 4068288, 'steps': 21188, 'loss/train': 1.7958683967590332} 11/07/2021 00:13:22 - INFO - __main__ - Step 21190: {'lr': 0.00047954600763085577, 'samples': 4068480, 'steps': 21189, 'loss/train': 1.792773723602295} 11/07/2021 00:13:23 - INFO - __main__ - Step 21191: {'lr': 0.0004795439052905087, 'samples': 4068672, 'steps': 21190, 'loss/train': 1.736327052116394} 11/07/2021 00:13:23 - INFO - __main__ - Step 21192: {'lr': 0.0004795418028467327, 'samples': 4068864, 'steps': 21191, 'loss/train': 1.8518633842468262} 11/07/2021 00:13:23 - INFO - __main__ - Step 21193: {'lr': 0.0004795397002995288, 'samples': 4069056, 'steps': 21192, 'loss/train': 1.6165372133255005} 11/07/2021 00:13:24 - INFO - __main__ - Step 21194: {'lr': 0.0004795375976488977, 'samples': 4069248, 'steps': 21193, 'loss/train': 2.0459771156311035} 11/07/2021 00:13:25 - INFO - __main__ - Step 21195: {'lr': 0.00047953549489484056, 'samples': 4069440, 'steps': 21194, 'loss/train': 1.7422749996185303} 11/07/2021 00:13:25 - INFO - __main__ - Step 21196: {'lr': 0.0004795333920373583, 'samples': 4069632, 'steps': 21195, 'loss/train': 1.8609914779663086} 11/07/2021 00:13:25 - INFO - __main__ - Step 21197: {'lr': 0.00047953128907645185, 'samples': 4069824, 'steps': 21196, 'loss/train': 1.623853087425232} 11/07/2021 00:13:26 - INFO - __main__ - Step 21198: {'lr': 0.000479529186012122, 'samples': 4070016, 'steps': 21197, 'loss/train': 1.546166181564331} 11/07/2021 00:13:26 - INFO - __main__ - Step 21199: {'lr': 0.00047952708284437, 'samples': 4070208, 'steps': 21198, 'loss/train': 1.3570958375930786} 11/07/2021 00:13:27 - INFO - __main__ - Step 21200: {'lr': 0.0004795249795731966, 'samples': 4070400, 'steps': 21199, 'loss/train': 1.755671739578247} 11/07/2021 00:13:28 - INFO - __main__ - Step 21201: {'lr': 0.00047952287619860273, 'samples': 4070592, 'steps': 21200, 'loss/train': 1.8419883251190186} 11/07/2021 00:13:28 - INFO - __main__ - Step 21202: {'lr': 0.0004795207727205895, 'samples': 4070784, 'steps': 21201, 'loss/train': 2.0788772106170654} 11/07/2021 00:13:28 - INFO - __main__ - Step 21203: {'lr': 0.00047951866913915767, 'samples': 4070976, 'steps': 21202, 'loss/train': 1.6653965711593628} 11/07/2021 00:13:29 - INFO - __main__ - Step 21204: {'lr': 0.0004795165654543082, 'samples': 4071168, 'steps': 21203, 'loss/train': 2.297816514968872} 11/07/2021 00:13:30 - INFO - __main__ - Step 21205: {'lr': 0.0004795144616660422, 'samples': 4071360, 'steps': 21204, 'loss/train': 1.7352765798568726} 11/07/2021 00:13:30 - INFO - __main__ - Step 21206: {'lr': 0.0004795123577743605, 'samples': 4071552, 'steps': 21205, 'loss/train': 0.8612052798271179} 11/07/2021 00:13:30 - INFO - __main__ - Step 21207: {'lr': 0.0004795102537792641, 'samples': 4071744, 'steps': 21206, 'loss/train': 1.7479983568191528} 11/07/2021 00:13:31 - INFO - __main__ - Step 21208: {'lr': 0.000479508149680754, 'samples': 4071936, 'steps': 21207, 'loss/train': 1.8053940534591675} 11/07/2021 00:13:31 - INFO - __main__ - Step 21209: {'lr': 0.0004795060454788309, 'samples': 4072128, 'steps': 21208, 'loss/train': 1.6106544733047485} 11/07/2021 00:13:32 - INFO - __main__ - Step 21210: {'lr': 0.000479503941173496, 'samples': 4072320, 'steps': 21209, 'loss/train': 2.015735387802124} 11/07/2021 00:13:33 - INFO - __main__ - Step 21211: {'lr': 0.0004795018367647501, 'samples': 4072512, 'steps': 21210, 'loss/train': 1.5669276714324951} 11/07/2021 00:13:33 - INFO - __main__ - Step 21212: {'lr': 0.0004794997322525944, 'samples': 4072704, 'steps': 21211, 'loss/train': 0.42870834469795227} 11/07/2021 00:13:33 - INFO - __main__ - Step 21213: {'lr': 0.0004794976276370295, 'samples': 4072896, 'steps': 21212, 'loss/train': 1.104694128036499} 11/07/2021 00:13:34 - INFO - __main__ - Step 21214: {'lr': 0.00047949552291805654, 'samples': 4073088, 'steps': 21213, 'loss/train': 1.143908977508545} 11/07/2021 00:13:35 - INFO - __main__ - Step 21215: {'lr': 0.0004794934180956764, 'samples': 4073280, 'steps': 21214, 'loss/train': 1.7211107015609741} 11/07/2021 00:13:35 - INFO - __main__ - Step 21216: {'lr': 0.00047949131316989016, 'samples': 4073472, 'steps': 21215, 'loss/train': 1.2839972972869873} 11/07/2021 00:13:35 - INFO - __main__ - Step 21217: {'lr': 0.0004794892081406986, 'samples': 4073664, 'steps': 21216, 'loss/train': 1.9193248748779297} 11/07/2021 00:13:36 - INFO - __main__ - Step 21218: {'lr': 0.00047948710300810276, 'samples': 4073856, 'steps': 21217, 'loss/train': 1.1737147569656372} 11/07/2021 00:13:36 - INFO - __main__ - Step 21219: {'lr': 0.0004794849977721036, 'samples': 4074048, 'steps': 21218, 'loss/train': 1.3072047233581543} 11/07/2021 00:13:37 - INFO - __main__ - Step 21220: {'lr': 0.00047948289243270205, 'samples': 4074240, 'steps': 21219, 'loss/train': 2.3479273319244385} 11/07/2021 00:13:38 - INFO - __main__ - Step 21221: {'lr': 0.000479480786989899, 'samples': 4074432, 'steps': 21220, 'loss/train': 1.474902868270874} 11/07/2021 00:13:38 - INFO - __main__ - Step 21222: {'lr': 0.0004794786814436955, 'samples': 4074624, 'steps': 21221, 'loss/train': 0.7210199236869812} 11/07/2021 00:13:38 - INFO - __main__ - Step 21223: {'lr': 0.0004794765757940924, 'samples': 4074816, 'steps': 21222, 'loss/train': 1.7698161602020264} 11/07/2021 00:13:39 - INFO - __main__ - Step 21224: {'lr': 0.00047947447004109066, 'samples': 4075008, 'steps': 21223, 'loss/train': 1.5735344886779785} 11/07/2021 00:13:40 - INFO - __main__ - Step 21225: {'lr': 0.0004794723641846914, 'samples': 4075200, 'steps': 21224, 'loss/train': 2.287526845932007} 11/07/2021 00:13:40 - INFO - __main__ - Step 21226: {'lr': 0.0004794702582248953, 'samples': 4075392, 'steps': 21225, 'loss/train': 1.3349794149398804} 11/07/2021 00:13:40 - INFO - __main__ - Step 21227: {'lr': 0.0004794681521617035, 'samples': 4075584, 'steps': 21226, 'loss/train': 1.7850278615951538} 11/07/2021 00:13:41 - INFO - __main__ - Step 21228: {'lr': 0.0004794660459951169, 'samples': 4075776, 'steps': 21227, 'loss/train': 1.253070592880249} 11/07/2021 00:13:41 - INFO - __main__ - Step 21229: {'lr': 0.0004794639397251365, 'samples': 4075968, 'steps': 21228, 'loss/train': 1.8737459182739258} 11/07/2021 00:13:42 - INFO - __main__ - Step 21230: {'lr': 0.00047946183335176307, 'samples': 4076160, 'steps': 21229, 'loss/train': 1.4230912923812866} 11/07/2021 00:13:42 - INFO - __main__ - Step 21231: {'lr': 0.00047945972687499775, 'samples': 4076352, 'steps': 21230, 'loss/train': 1.8137074708938599} 11/07/2021 00:13:43 - INFO - __main__ - Step 21232: {'lr': 0.0004794576202948414, 'samples': 4076544, 'steps': 21231, 'loss/train': 1.383471131324768} 11/07/2021 00:13:43 - INFO - __main__ - Step 21233: {'lr': 0.000479455513611295, 'samples': 4076736, 'steps': 21232, 'loss/train': 1.8926209211349487} 11/07/2021 00:13:43 - INFO - __main__ - Step 21234: {'lr': 0.00047945340682435943, 'samples': 4076928, 'steps': 21233, 'loss/train': 1.588423252105713} 11/07/2021 00:13:44 - INFO - __main__ - Step 21235: {'lr': 0.00047945129993403577, 'samples': 4077120, 'steps': 21234, 'loss/train': 1.1220974922180176} 11/07/2021 00:13:45 - INFO - __main__ - Step 21236: {'lr': 0.00047944919294032486, 'samples': 4077312, 'steps': 21235, 'loss/train': 1.501308560371399} 11/07/2021 00:13:45 - INFO - __main__ - Step 21237: {'lr': 0.00047944708584322763, 'samples': 4077504, 'steps': 21236, 'loss/train': 1.2461358308792114} 11/07/2021 00:13:45 - INFO - __main__ - Step 21238: {'lr': 0.00047944497864274517, 'samples': 4077696, 'steps': 21237, 'loss/train': 2.0483508110046387} 11/07/2021 00:13:46 - INFO - __main__ - Step 21239: {'lr': 0.00047944287133887834, 'samples': 4077888, 'steps': 21238, 'loss/train': 1.5501058101654053} 11/07/2021 00:13:47 - INFO - __main__ - Step 21240: {'lr': 0.00047944076393162806, 'samples': 4078080, 'steps': 21239, 'loss/train': 1.4833170175552368} 11/07/2021 00:13:47 - INFO - __main__ - Step 21241: {'lr': 0.00047943865642099525, 'samples': 4078272, 'steps': 21240, 'loss/train': 1.667752981185913} 11/07/2021 00:13:48 - INFO - __main__ - Step 21242: {'lr': 0.00047943654880698106, 'samples': 4078464, 'steps': 21241, 'loss/train': 2.3272855281829834} 11/07/2021 00:13:48 - INFO - __main__ - Step 21243: {'lr': 0.00047943444108958623, 'samples': 4078656, 'steps': 21242, 'loss/train': 3.5779306888580322} 11/07/2021 00:13:48 - INFO - __main__ - Step 21244: {'lr': 0.00047943233326881176, 'samples': 4078848, 'steps': 21243, 'loss/train': 1.0798635482788086} 11/07/2021 00:13:49 - INFO - __main__ - Step 21245: {'lr': 0.00047943022534465866, 'samples': 4079040, 'steps': 21244, 'loss/train': 1.2242178916931152} 11/07/2021 00:13:50 - INFO - __main__ - Step 21246: {'lr': 0.00047942811731712775, 'samples': 4079232, 'steps': 21245, 'loss/train': 1.6346696615219116} 11/07/2021 00:13:50 - INFO - __main__ - Step 21247: {'lr': 0.0004794260091862202, 'samples': 4079424, 'steps': 21246, 'loss/train': 1.1817291975021362} 11/07/2021 00:13:51 - INFO - __main__ - Step 21248: {'lr': 0.0004794239009519368, 'samples': 4079616, 'steps': 21247, 'loss/train': 1.7059050798416138} 11/07/2021 00:13:51 - INFO - __main__ - Step 21249: {'lr': 0.00047942179261427847, 'samples': 4079808, 'steps': 21248, 'loss/train': 1.453378438949585} 11/07/2021 00:13:51 - INFO - __main__ - Step 21250: {'lr': 0.0004794196841732463, 'samples': 4080000, 'steps': 21249, 'loss/train': 2.1280789375305176} 11/07/2021 00:13:52 - INFO - __main__ - Step 21251: {'lr': 0.0004794175756288411, 'samples': 4080192, 'steps': 21250, 'loss/train': 1.54837965965271} 11/07/2021 00:13:53 - INFO - __main__ - Step 21252: {'lr': 0.00047941546698106386, 'samples': 4080384, 'steps': 21251, 'loss/train': 1.0373538732528687} 11/07/2021 00:13:53 - INFO - __main__ - Step 21253: {'lr': 0.0004794133582299156, 'samples': 4080576, 'steps': 21252, 'loss/train': 1.5038514137268066} 11/07/2021 00:13:53 - INFO - __main__ - Step 21254: {'lr': 0.0004794112493753972, 'samples': 4080768, 'steps': 21253, 'loss/train': 1.8955644369125366} 11/07/2021 00:13:54 - INFO - __main__ - Step 21255: {'lr': 0.0004794091404175097, 'samples': 4080960, 'steps': 21254, 'loss/train': 1.3596187829971313} 11/07/2021 00:13:55 - INFO - __main__ - Step 21256: {'lr': 0.00047940703135625386, 'samples': 4081152, 'steps': 21255, 'loss/train': 1.5795459747314453} 11/07/2021 00:13:55 - INFO - __main__ - Step 21257: {'lr': 0.0004794049221916308, 'samples': 4081344, 'steps': 21256, 'loss/train': 1.0606127977371216} 11/07/2021 00:13:55 - INFO - __main__ - Step 21258: {'lr': 0.00047940281292364146, 'samples': 4081536, 'steps': 21257, 'loss/train': 1.7316327095031738} 11/07/2021 00:13:56 - INFO - __main__ - Step 21259: {'lr': 0.0004794007035522867, 'samples': 4081728, 'steps': 21258, 'loss/train': 1.6890673637390137} 11/07/2021 00:13:56 - INFO - __main__ - Step 21260: {'lr': 0.0004793985940775676, 'samples': 4081920, 'steps': 21259, 'loss/train': 1.8833643198013306} 11/07/2021 00:13:57 - INFO - __main__ - Step 21261: {'lr': 0.0004793964844994849, 'samples': 4082112, 'steps': 21260, 'loss/train': 1.4741991758346558} 11/07/2021 00:13:58 - INFO - __main__ - Step 21262: {'lr': 0.00047939437481803984, 'samples': 4082304, 'steps': 21261, 'loss/train': 1.6706597805023193} 11/07/2021 00:13:58 - INFO - __main__ - Step 21263: {'lr': 0.00047939226503323313, 'samples': 4082496, 'steps': 21262, 'loss/train': 2.2536189556121826} 11/07/2021 00:13:59 - INFO - __main__ - Step 21264: {'lr': 0.0004793901551450658, 'samples': 4082688, 'steps': 21263, 'loss/train': 1.888841152191162} 11/07/2021 00:13:59 - INFO - __main__ - Step 21265: {'lr': 0.00047938804515353887, 'samples': 4082880, 'steps': 21264, 'loss/train': 0.34845373034477234} 11/07/2021 00:14:00 - INFO - __main__ - Step 21266: {'lr': 0.00047938593505865315, 'samples': 4083072, 'steps': 21265, 'loss/train': 1.6031692028045654} 11/07/2021 00:14:00 - INFO - __main__ - Step 21267: {'lr': 0.00047938382486040963, 'samples': 4083264, 'steps': 21266, 'loss/train': 0.9460209012031555} 11/07/2021 00:14:01 - INFO - __main__ - Step 21268: {'lr': 0.0004793817145588094, 'samples': 4083456, 'steps': 21267, 'loss/train': 1.3523656129837036} 11/07/2021 00:14:01 - INFO - __main__ - Step 21269: {'lr': 0.0004793796041538533, 'samples': 4083648, 'steps': 21268, 'loss/train': 1.5146702527999878} 11/07/2021 00:14:01 - INFO - __main__ - Step 21270: {'lr': 0.00047937749364554226, 'samples': 4083840, 'steps': 21269, 'loss/train': 0.8026184439659119} 11/07/2021 00:14:02 - INFO - __main__ - Step 21271: {'lr': 0.0004793753830338773, 'samples': 4084032, 'steps': 21270, 'loss/train': 1.6856791973114014} 11/07/2021 00:14:03 - INFO - __main__ - Step 21272: {'lr': 0.00047937327231885925, 'samples': 4084224, 'steps': 21271, 'loss/train': 1.488318920135498} 11/07/2021 00:14:03 - INFO - __main__ - Step 21273: {'lr': 0.0004793711615004892, 'samples': 4084416, 'steps': 21272, 'loss/train': 1.5594409704208374} 11/07/2021 00:14:03 - INFO - __main__ - Step 21274: {'lr': 0.000479369050578768, 'samples': 4084608, 'steps': 21273, 'loss/train': 0.9363613128662109} 11/07/2021 00:14:04 - INFO - __main__ - Step 21275: {'lr': 0.0004793669395536967, 'samples': 4084800, 'steps': 21274, 'loss/train': 1.5633370876312256} 11/07/2021 00:14:04 - INFO - __main__ - Step 21276: {'lr': 0.00047936482842527616, 'samples': 4084992, 'steps': 21275, 'loss/train': 1.7852537631988525} 11/07/2021 00:14:05 - INFO - __main__ - Step 21277: {'lr': 0.00047936271719350743, 'samples': 4085184, 'steps': 21276, 'loss/train': 1.7984212636947632} 11/07/2021 00:14:05 - INFO - __main__ - Step 21278: {'lr': 0.0004793606058583913, 'samples': 4085376, 'steps': 21277, 'loss/train': 1.8766491413116455} 11/07/2021 00:14:06 - INFO - __main__ - Step 21279: {'lr': 0.00047935849441992887, 'samples': 4085568, 'steps': 21278, 'loss/train': 1.998377799987793} 11/07/2021 00:14:06 - INFO - __main__ - Step 21280: {'lr': 0.00047935638287812104, 'samples': 4085760, 'steps': 21279, 'loss/train': 1.6133617162704468} 11/07/2021 00:14:07 - INFO - __main__ - Step 21281: {'lr': 0.00047935427123296884, 'samples': 4085952, 'steps': 21280, 'loss/train': 1.5871071815490723} 11/07/2021 00:14:08 - INFO - __main__ - Step 21282: {'lr': 0.000479352159484473, 'samples': 4086144, 'steps': 21281, 'loss/train': 1.3508565425872803} 11/07/2021 00:14:08 - INFO - __main__ - Step 21283: {'lr': 0.0004793500476326347, 'samples': 4086336, 'steps': 21282, 'loss/train': 1.6303493976593018} 11/07/2021 00:14:08 - INFO - __main__ - Step 21284: {'lr': 0.0004793479356774548, 'samples': 4086528, 'steps': 21283, 'loss/train': 1.6954418420791626} 11/07/2021 00:14:09 - INFO - __main__ - Step 21285: {'lr': 0.00047934582361893423, 'samples': 4086720, 'steps': 21284, 'loss/train': 1.3281868696212769} 11/07/2021 00:14:09 - INFO - __main__ - Step 21286: {'lr': 0.000479343711457074, 'samples': 4086912, 'steps': 21285, 'loss/train': 1.7398327589035034} 11/07/2021 00:14:10 - INFO - __main__ - Step 21287: {'lr': 0.00047934159919187504, 'samples': 4087104, 'steps': 21286, 'loss/train': 1.656062126159668} 11/07/2021 00:14:10 - INFO - __main__ - Step 21288: {'lr': 0.0004793394868233383, 'samples': 4087296, 'steps': 21287, 'loss/train': 1.9397815465927124} 11/07/2021 00:14:11 - INFO - __main__ - Step 21289: {'lr': 0.0004793373743514647, 'samples': 4087488, 'steps': 21288, 'loss/train': 1.4470610618591309} 11/07/2021 00:14:11 - INFO - __main__ - Step 21290: {'lr': 0.0004793352617762552, 'samples': 4087680, 'steps': 21289, 'loss/train': 1.4866148233413696} 11/07/2021 00:14:11 - INFO - __main__ - Step 21291: {'lr': 0.0004793331490977108, 'samples': 4087872, 'steps': 21290, 'loss/train': 1.5853736400604248} 11/07/2021 00:14:12 - INFO - __main__ - Step 21292: {'lr': 0.0004793310363158324, 'samples': 4088064, 'steps': 21291, 'loss/train': 1.5216046571731567} 11/07/2021 00:14:13 - INFO - __main__ - Step 21293: {'lr': 0.00047932892343062103, 'samples': 4088256, 'steps': 21292, 'loss/train': 1.7325321435928345} 11/07/2021 00:14:13 - INFO - __main__ - Step 21294: {'lr': 0.00047932681044207757, 'samples': 4088448, 'steps': 21293, 'loss/train': 1.4280503988265991} 11/07/2021 00:14:13 - INFO - __main__ - Step 21295: {'lr': 0.0004793246973502029, 'samples': 4088640, 'steps': 21294, 'loss/train': 1.394337773323059} 11/07/2021 00:14:14 - INFO - __main__ - Step 21296: {'lr': 0.0004793225841549982, 'samples': 4088832, 'steps': 21295, 'loss/train': 1.7424063682556152} 11/07/2021 00:14:15 - INFO - __main__ - Step 21297: {'lr': 0.00047932047085646416, 'samples': 4089024, 'steps': 21296, 'loss/train': 1.4018278121948242} 11/07/2021 00:14:15 - INFO - __main__ - Step 21298: {'lr': 0.0004793183574546019, 'samples': 4089216, 'steps': 21297, 'loss/train': 1.655775547027588} 11/07/2021 00:14:16 - INFO - __main__ - Step 21299: {'lr': 0.0004793162439494123, 'samples': 4089408, 'steps': 21298, 'loss/train': 1.1595053672790527} 11/07/2021 00:14:16 - INFO - __main__ - Step 21300: {'lr': 0.00047931413034089644, 'samples': 4089600, 'steps': 21299, 'loss/train': 0.8736221194267273} 11/07/2021 00:14:16 - INFO - __main__ - Step 21301: {'lr': 0.00047931201662905503, 'samples': 4089792, 'steps': 21300, 'loss/train': 1.8875302076339722} 11/07/2021 00:14:17 - INFO - __main__ - Step 21302: {'lr': 0.00047930990281388927, 'samples': 4089984, 'steps': 21301, 'loss/train': 1.8271974325180054} 11/07/2021 00:14:18 - INFO - __main__ - Step 21303: {'lr': 0.00047930778889539996, 'samples': 4090176, 'steps': 21302, 'loss/train': 1.5348069667816162} 11/07/2021 00:14:18 - INFO - __main__ - Step 21304: {'lr': 0.00047930567487358813, 'samples': 4090368, 'steps': 21303, 'loss/train': 0.8241930603981018} 11/07/2021 00:14:18 - INFO - __main__ - Step 21305: {'lr': 0.00047930356074845466, 'samples': 4090560, 'steps': 21304, 'loss/train': 1.793365240097046} 11/07/2021 00:14:19 - INFO - __main__ - Step 21306: {'lr': 0.0004793014465200005, 'samples': 4090752, 'steps': 21305, 'loss/train': 1.727817177772522} 11/07/2021 00:14:19 - INFO - __main__ - Step 21307: {'lr': 0.0004792993321882267, 'samples': 4090944, 'steps': 21306, 'loss/train': 1.5948116779327393} 11/07/2021 00:14:20 - INFO - __main__ - Step 21308: {'lr': 0.0004792972177531342, 'samples': 4091136, 'steps': 21307, 'loss/train': 1.57756507396698} 11/07/2021 00:14:20 - INFO - __main__ - Step 21309: {'lr': 0.0004792951032147239, 'samples': 4091328, 'steps': 21308, 'loss/train': 1.8153151273727417} 11/07/2021 00:14:21 - INFO - __main__ - Step 21310: {'lr': 0.00047929298857299677, 'samples': 4091520, 'steps': 21309, 'loss/train': 1.4541163444519043} 11/07/2021 00:14:21 - INFO - __main__ - Step 21311: {'lr': 0.00047929087382795374, 'samples': 4091712, 'steps': 21310, 'loss/train': 1.1384114027023315} 11/07/2021 00:14:21 - INFO - __main__ - Step 21312: {'lr': 0.0004792887589795957, 'samples': 4091904, 'steps': 21311, 'loss/train': 1.6404975652694702} 11/07/2021 00:14:23 - INFO - __main__ - Step 21313: {'lr': 0.00047928664402792376, 'samples': 4092096, 'steps': 21312, 'loss/train': 1.0429054498672485} 11/07/2021 00:14:23 - INFO - __main__ - Step 21314: {'lr': 0.0004792845289729388, 'samples': 4092288, 'steps': 21313, 'loss/train': 1.4683235883712769} 11/07/2021 00:14:23 - INFO - __main__ - Step 21315: {'lr': 0.00047928241381464177, 'samples': 4092480, 'steps': 21314, 'loss/train': 2.05292010307312} 11/07/2021 00:14:24 - INFO - __main__ - Step 21316: {'lr': 0.0004792802985530337, 'samples': 4092672, 'steps': 21315, 'loss/train': 1.7252967357635498} 11/07/2021 00:14:24 - INFO - __main__ - Step 21317: {'lr': 0.0004792781831881153, 'samples': 4092864, 'steps': 21316, 'loss/train': 1.5299564599990845} 11/07/2021 00:14:25 - INFO - __main__ - Step 21318: {'lr': 0.0004792760677198878, 'samples': 4093056, 'steps': 21317, 'loss/train': 0.8147514462471008} 11/07/2021 00:14:25 - INFO - __main__ - Step 21319: {'lr': 0.00047927395214835203, 'samples': 4093248, 'steps': 21318, 'loss/train': 1.686421275138855} 11/07/2021 00:14:26 - INFO - __main__ - Step 21320: {'lr': 0.0004792718364735089, 'samples': 4093440, 'steps': 21319, 'loss/train': 1.8813480138778687} 11/07/2021 00:14:26 - INFO - __main__ - Step 21321: {'lr': 0.00047926972069535945, 'samples': 4093632, 'steps': 21320, 'loss/train': 1.4405698776245117} 11/07/2021 00:14:26 - INFO - __main__ - Step 21322: {'lr': 0.00047926760481390465, 'samples': 4093824, 'steps': 21321, 'loss/train': 1.2762727737426758} 11/07/2021 00:14:28 - INFO - __main__ - Step 21323: {'lr': 0.00047926548882914533, 'samples': 4094016, 'steps': 21322, 'loss/train': 2.274826765060425} 11/07/2021 00:14:28 - INFO - __main__ - Step 21324: {'lr': 0.0004792633727410826, 'samples': 4094208, 'steps': 21323, 'loss/train': 1.5988800525665283} 11/07/2021 00:14:28 - INFO - __main__ - Step 21325: {'lr': 0.0004792612565497172, 'samples': 4094400, 'steps': 21324, 'loss/train': 1.2898582220077515} 11/07/2021 00:14:29 - INFO - __main__ - Step 21326: {'lr': 0.00047925914025505036, 'samples': 4094592, 'steps': 21325, 'loss/train': 1.2253859043121338} 11/07/2021 00:14:29 - INFO - __main__ - Step 21327: {'lr': 0.0004792570238570828, 'samples': 4094784, 'steps': 21326, 'loss/train': 1.2003458738327026} 11/07/2021 00:14:30 - INFO - __main__ - Step 21328: {'lr': 0.00047925490735581557, 'samples': 4094976, 'steps': 21327, 'loss/train': 1.1544514894485474} 11/07/2021 00:14:30 - INFO - __main__ - Step 21329: {'lr': 0.00047925279075124963, 'samples': 4095168, 'steps': 21328, 'loss/train': 1.454081654548645} 11/07/2021 00:14:31 - INFO - __main__ - Step 21330: {'lr': 0.00047925067404338596, 'samples': 4095360, 'steps': 21329, 'loss/train': 1.619686484336853} 11/07/2021 00:14:31 - INFO - __main__ - Step 21331: {'lr': 0.00047924855723222536, 'samples': 4095552, 'steps': 21330, 'loss/train': 1.378673791885376} 11/07/2021 00:14:31 - INFO - __main__ - Step 21332: {'lr': 0.000479246440317769, 'samples': 4095744, 'steps': 21331, 'loss/train': 1.2609995603561401} 11/07/2021 00:14:32 - INFO - __main__ - Step 21333: {'lr': 0.00047924432330001776, 'samples': 4095936, 'steps': 21332, 'loss/train': 1.5046299695968628} 11/07/2021 00:14:33 - INFO - __main__ - Step 21334: {'lr': 0.0004792422061789725, 'samples': 4096128, 'steps': 21333, 'loss/train': 1.7911027669906616} 11/07/2021 00:14:33 - INFO - __main__ - Step 21335: {'lr': 0.0004792400889546342, 'samples': 4096320, 'steps': 21334, 'loss/train': 1.649451494216919} 11/07/2021 00:14:33 - INFO - __main__ - Step 21336: {'lr': 0.00047923797162700393, 'samples': 4096512, 'steps': 21335, 'loss/train': 1.574715495109558} 11/07/2021 00:14:34 - INFO - __main__ - Step 21337: {'lr': 0.0004792358541960826, 'samples': 4096704, 'steps': 21336, 'loss/train': 2.1345772743225098} 11/07/2021 00:14:34 - INFO - __main__ - Step 21338: {'lr': 0.000479233736661871, 'samples': 4096896, 'steps': 21337, 'loss/train': 1.7612472772598267} 11/07/2021 00:14:36 - INFO - __main__ - Step 21339: {'lr': 0.0004792316190243703, 'samples': 4097088, 'steps': 21338, 'loss/train': 1.8308136463165283} 11/07/2021 00:14:36 - INFO - __main__ - Step 21340: {'lr': 0.0004792295012835814, 'samples': 4097280, 'steps': 21339, 'loss/train': 1.3093798160552979} 11/07/2021 00:14:36 - INFO - __main__ - Step 21341: {'lr': 0.0004792273834395052, 'samples': 4097472, 'steps': 21340, 'loss/train': 1.1870094537734985} 11/07/2021 00:14:37 - INFO - __main__ - Step 21342: {'lr': 0.0004792252654921426, 'samples': 4097664, 'steps': 21341, 'loss/train': 0.9535714983940125} 11/07/2021 00:14:37 - INFO - __main__ - Step 21343: {'lr': 0.00047922314744149475, 'samples': 4097856, 'steps': 21342, 'loss/train': 0.45689645409584045} 11/07/2021 00:14:38 - INFO - __main__ - Step 21344: {'lr': 0.0004792210292875624, 'samples': 4098048, 'steps': 21343, 'loss/train': 1.4933922290802002} 11/07/2021 00:14:38 - INFO - __main__ - Step 21345: {'lr': 0.00047921891103034665, 'samples': 4098240, 'steps': 21344, 'loss/train': 1.3660441637039185} 11/07/2021 00:14:39 - INFO - __main__ - Step 21346: {'lr': 0.0004792167926698483, 'samples': 4098432, 'steps': 21345, 'loss/train': 1.77027428150177} 11/07/2021 00:14:39 - INFO - __main__ - Step 21347: {'lr': 0.0004792146742060685, 'samples': 4098624, 'steps': 21346, 'loss/train': 1.3939626216888428} 11/07/2021 00:14:39 - INFO - __main__ - Step 21348: {'lr': 0.00047921255563900813, 'samples': 4098816, 'steps': 21347, 'loss/train': 1.9357357025146484} 11/07/2021 00:14:40 - INFO - __main__ - Step 21349: {'lr': 0.000479210436968668, 'samples': 4099008, 'steps': 21348, 'loss/train': 0.2855474650859833} 11/07/2021 00:14:41 - INFO - __main__ - Step 21350: {'lr': 0.0004792083181950493, 'samples': 4099200, 'steps': 21349, 'loss/train': 1.2464492321014404} 11/07/2021 00:14:41 - INFO - __main__ - Step 21351: {'lr': 0.0004792061993181528, 'samples': 4099392, 'steps': 21350, 'loss/train': 1.0355896949768066} 11/07/2021 00:14:41 - INFO - __main__ - Step 21352: {'lr': 0.00047920408033797954, 'samples': 4099584, 'steps': 21351, 'loss/train': 1.2491384744644165} 11/07/2021 00:14:42 - INFO - __main__ - Step 21353: {'lr': 0.0004792019612545304, 'samples': 4099776, 'steps': 21352, 'loss/train': 1.8609702587127686} 11/07/2021 00:14:43 - INFO - __main__ - Step 21354: {'lr': 0.00047919984206780647, 'samples': 4099968, 'steps': 21353, 'loss/train': 1.1716221570968628} 11/07/2021 00:14:43 - INFO - __main__ - Step 21355: {'lr': 0.0004791977227778086, 'samples': 4100160, 'steps': 21354, 'loss/train': 1.2437918186187744} 11/07/2021 00:14:44 - INFO - __main__ - Step 21356: {'lr': 0.00047919560338453783, 'samples': 4100352, 'steps': 21355, 'loss/train': 1.356000304222107} 11/07/2021 00:14:44 - INFO - __main__ - Step 21357: {'lr': 0.000479193483887995, 'samples': 4100544, 'steps': 21356, 'loss/train': 1.198959469795227} 11/07/2021 00:14:44 - INFO - __main__ - Step 21358: {'lr': 0.0004791913642881811, 'samples': 4100736, 'steps': 21357, 'loss/train': 1.6075674295425415} 11/07/2021 00:14:45 - INFO - __main__ - Step 21359: {'lr': 0.00047918924458509717, 'samples': 4100928, 'steps': 21358, 'loss/train': 1.3278419971466064} 11/07/2021 00:14:46 - INFO - __main__ - Step 21360: {'lr': 0.00047918712477874404, 'samples': 4101120, 'steps': 21359, 'loss/train': 1.41783607006073} 11/07/2021 00:14:46 - INFO - __main__ - Step 21361: {'lr': 0.00047918500486912276, 'samples': 4101312, 'steps': 21360, 'loss/train': 1.095545768737793} 11/07/2021 00:14:46 - INFO - __main__ - Step 21362: {'lr': 0.00047918288485623427, 'samples': 4101504, 'steps': 21361, 'loss/train': 1.4473345279693604} 11/07/2021 00:14:47 - INFO - __main__ - Step 21363: {'lr': 0.0004791807647400795, 'samples': 4101696, 'steps': 21362, 'loss/train': 1.668547511100769} 11/07/2021 00:14:48 - INFO - __main__ - Step 21364: {'lr': 0.0004791786445206594, 'samples': 4101888, 'steps': 21363, 'loss/train': 1.4369771480560303} 11/07/2021 00:14:48 - INFO - __main__ - Step 21365: {'lr': 0.00047917652419797495, 'samples': 4102080, 'steps': 21364, 'loss/train': 1.4824670553207397} 11/07/2021 00:14:48 - INFO - __main__ - Step 21366: {'lr': 0.0004791744037720271, 'samples': 4102272, 'steps': 21365, 'loss/train': 1.810196042060852} 11/07/2021 00:14:49 - INFO - __main__ - Step 21367: {'lr': 0.00047917228324281683, 'samples': 4102464, 'steps': 21366, 'loss/train': 1.1198419332504272} 11/07/2021 00:14:49 - INFO - __main__ - Step 21368: {'lr': 0.00047917016261034496, 'samples': 4102656, 'steps': 21367, 'loss/train': 1.3379572629928589} 11/07/2021 00:14:50 - INFO - __main__ - Step 21369: {'lr': 0.0004791680418746126, 'samples': 4102848, 'steps': 21368, 'loss/train': 1.3369263410568237} 11/07/2021 00:14:51 - INFO - __main__ - Step 21370: {'lr': 0.00047916592103562075, 'samples': 4103040, 'steps': 21369, 'loss/train': 3.3667120933532715} 11/07/2021 00:14:51 - INFO - __main__ - Step 21371: {'lr': 0.00047916380009337014, 'samples': 4103232, 'steps': 21370, 'loss/train': 1.5507954359054565} 11/07/2021 00:14:51 - INFO - __main__ - Step 21372: {'lr': 0.0004791616790478619, 'samples': 4103424, 'steps': 21371, 'loss/train': 0.9956333041191101} 11/07/2021 00:14:52 - INFO - __main__ - Step 21373: {'lr': 0.000479159557899097, 'samples': 4103616, 'steps': 21372, 'loss/train': 1.4630414247512817} 11/07/2021 00:14:52 - INFO - __main__ - Step 21374: {'lr': 0.00047915743664707626, 'samples': 4103808, 'steps': 21373, 'loss/train': 0.219575896859169} 11/07/2021 00:14:53 - INFO - __main__ - Step 21375: {'lr': 0.0004791553152918008, 'samples': 4104000, 'steps': 21374, 'loss/train': 1.7526909112930298} 11/07/2021 00:14:53 - INFO - __main__ - Step 21376: {'lr': 0.0004791531938332714, 'samples': 4104192, 'steps': 21375, 'loss/train': 1.4546867609024048} 11/07/2021 00:14:54 - INFO - __main__ - Step 21377: {'lr': 0.0004791510722714891, 'samples': 4104384, 'steps': 21376, 'loss/train': 1.7402167320251465} 11/07/2021 00:14:54 - INFO - __main__ - Step 21378: {'lr': 0.000479148950606455, 'samples': 4104576, 'steps': 21377, 'loss/train': 1.7246084213256836} 11/07/2021 00:14:54 - INFO - __main__ - Step 21379: {'lr': 0.00047914682883816977, 'samples': 4104768, 'steps': 21378, 'loss/train': 1.3976600170135498} 11/07/2021 00:14:55 - INFO - __main__ - Step 21380: {'lr': 0.00047914470696663457, 'samples': 4104960, 'steps': 21379, 'loss/train': 0.43213319778442383} 11/07/2021 00:14:56 - INFO - __main__ - Step 21381: {'lr': 0.00047914258499185037, 'samples': 4105152, 'steps': 21380, 'loss/train': 1.7519570589065552} 11/07/2021 00:14:56 - INFO - __main__ - Step 21382: {'lr': 0.000479140462913818, 'samples': 4105344, 'steps': 21381, 'loss/train': 1.511462926864624} 11/07/2021 00:14:57 - INFO - __main__ - Step 21383: {'lr': 0.0004791383407325384, 'samples': 4105536, 'steps': 21382, 'loss/train': 1.6584688425064087} 11/07/2021 00:14:57 - INFO - __main__ - Step 21384: {'lr': 0.0004791362184480127, 'samples': 4105728, 'steps': 21383, 'loss/train': 1.2212923765182495} 11/07/2021 00:14:58 - INFO - __main__ - Step 21385: {'lr': 0.0004791340960602417, 'samples': 4105920, 'steps': 21384, 'loss/train': 1.8661881685256958} 11/07/2021 00:14:58 - INFO - __main__ - Step 21386: {'lr': 0.0004791319735692264, 'samples': 4106112, 'steps': 21385, 'loss/train': 1.2798576354980469} 11/07/2021 00:14:59 - INFO - __main__ - Step 21387: {'lr': 0.00047912985097496786, 'samples': 4106304, 'steps': 21386, 'loss/train': 1.0371434688568115} 11/07/2021 00:14:59 - INFO - __main__ - Step 21388: {'lr': 0.00047912772827746685, 'samples': 4106496, 'steps': 21387, 'loss/train': 1.1820610761642456} 11/07/2021 00:14:59 - INFO - __main__ - Step 21389: {'lr': 0.00047912560547672453, 'samples': 4106688, 'steps': 21388, 'loss/train': 1.3891115188598633} 11/07/2021 00:15:00 - INFO - __main__ - Step 21390: {'lr': 0.0004791234825727416, 'samples': 4106880, 'steps': 21389, 'loss/train': 1.801900863647461} 11/07/2021 00:15:01 - INFO - __main__ - Step 21391: {'lr': 0.0004791213595655193, 'samples': 4107072, 'steps': 21390, 'loss/train': 1.3491114377975464} 11/07/2021 00:15:01 - INFO - __main__ - Step 21392: {'lr': 0.0004791192364550584, 'samples': 4107264, 'steps': 21391, 'loss/train': 0.3875841498374939} 11/07/2021 00:15:01 - INFO - __main__ - Step 21393: {'lr': 0.00047911711324135985, 'samples': 4107456, 'steps': 21392, 'loss/train': 0.8895827531814575} 11/07/2021 00:15:02 - INFO - __main__ - Step 21394: {'lr': 0.00047911498992442476, 'samples': 4107648, 'steps': 21393, 'loss/train': 1.6767473220825195} 11/07/2021 00:15:02 - INFO - __main__ - Step 21395: {'lr': 0.0004791128665042539, 'samples': 4107840, 'steps': 21394, 'loss/train': 1.5300005674362183} 11/07/2021 00:15:03 - INFO - __main__ - Step 21396: {'lr': 0.0004791107429808484, 'samples': 4108032, 'steps': 21395, 'loss/train': 1.3103961944580078} 11/07/2021 00:15:03 - INFO - __main__ - Step 21397: {'lr': 0.00047910861935420915, 'samples': 4108224, 'steps': 21396, 'loss/train': 1.617367148399353} 11/07/2021 00:15:04 - INFO - __main__ - Step 21398: {'lr': 0.00047910649562433696, 'samples': 4108416, 'steps': 21397, 'loss/train': 1.543279767036438} 11/07/2021 00:15:04 - INFO - __main__ - Step 21399: {'lr': 0.000479104371791233, 'samples': 4108608, 'steps': 21398, 'loss/train': 1.3778434991836548} 11/07/2021 00:15:04 - INFO - __main__ - Step 21400: {'lr': 0.0004791022478548982, 'samples': 4108800, 'steps': 21399, 'loss/train': 1.259809136390686} 11/07/2021 00:15:06 - INFO - __main__ - Step 21401: {'lr': 0.0004791001238153334, 'samples': 4108992, 'steps': 21400, 'loss/train': 1.7310465574264526} 11/07/2021 00:15:06 - INFO - __main__ - Step 21402: {'lr': 0.00047909799967253957, 'samples': 4109184, 'steps': 21401, 'loss/train': 1.3898957967758179} 11/07/2021 00:15:06 - INFO - __main__ - Step 21403: {'lr': 0.00047909587542651776, 'samples': 4109376, 'steps': 21402, 'loss/train': 2.3288679122924805} 11/07/2021 00:15:07 - INFO - __main__ - Step 21404: {'lr': 0.00047909375107726894, 'samples': 4109568, 'steps': 21403, 'loss/train': 1.4478750228881836} 11/07/2021 00:15:07 - INFO - __main__ - Step 21405: {'lr': 0.000479091626624794, 'samples': 4109760, 'steps': 21404, 'loss/train': 1.4665449857711792} 11/07/2021 00:15:08 - INFO - __main__ - Step 21406: {'lr': 0.00047908950206909385, 'samples': 4109952, 'steps': 21405, 'loss/train': 2.423110008239746} 11/07/2021 00:15:08 - INFO - __main__ - Step 21407: {'lr': 0.0004790873774101695, 'samples': 4110144, 'steps': 21406, 'loss/train': 1.829443097114563} 11/07/2021 00:15:09 - INFO - __main__ - Step 21408: {'lr': 0.00047908525264802194, 'samples': 4110336, 'steps': 21407, 'loss/train': 1.2481427192687988} 11/07/2021 00:15:09 - INFO - __main__ - Step 21409: {'lr': 0.00047908312778265213, 'samples': 4110528, 'steps': 21408, 'loss/train': 1.5026124715805054} 11/07/2021 00:15:09 - INFO - __main__ - Step 21410: {'lr': 0.00047908100281406096, 'samples': 4110720, 'steps': 21409, 'loss/train': 1.617781162261963} 11/07/2021 00:15:10 - INFO - __main__ - Step 21411: {'lr': 0.00047907887774224946, 'samples': 4110912, 'steps': 21410, 'loss/train': 1.7944114208221436} 11/07/2021 00:15:11 - INFO - __main__ - Step 21412: {'lr': 0.0004790767525672185, 'samples': 4111104, 'steps': 21411, 'loss/train': 1.4158649444580078} 11/07/2021 00:15:11 - INFO - __main__ - Step 21413: {'lr': 0.0004790746272889691, 'samples': 4111296, 'steps': 21412, 'loss/train': 1.090224027633667} 11/07/2021 00:15:11 - INFO - __main__ - Step 21414: {'lr': 0.00047907250190750225, 'samples': 4111488, 'steps': 21413, 'loss/train': 1.7329736948013306} 11/07/2021 00:15:12 - INFO - __main__ - Step 21415: {'lr': 0.0004790703764228188, 'samples': 4111680, 'steps': 21414, 'loss/train': 1.5330586433410645} 11/07/2021 00:15:13 - INFO - __main__ - Step 21416: {'lr': 0.0004790682508349198, 'samples': 4111872, 'steps': 21415, 'loss/train': 1.7020710706710815} 11/07/2021 00:15:13 - INFO - __main__ - Step 21417: {'lr': 0.00047906612514380623, 'samples': 4112064, 'steps': 21416, 'loss/train': 2.212350368499756} 11/07/2021 00:15:14 - INFO - __main__ - Step 21418: {'lr': 0.000479063999349479, 'samples': 4112256, 'steps': 21417, 'loss/train': 0.7235398292541504} 11/07/2021 00:15:14 - INFO - __main__ - Step 21419: {'lr': 0.00047906187345193895, 'samples': 4112448, 'steps': 21418, 'loss/train': 1.5190248489379883} 11/07/2021 00:15:14 - INFO - __main__ - Step 21420: {'lr': 0.0004790597474511873, 'samples': 4112640, 'steps': 21419, 'loss/train': 1.4230188131332397} 11/07/2021 00:15:15 - INFO - __main__ - Step 21421: {'lr': 0.0004790576213472248, 'samples': 4112832, 'steps': 21420, 'loss/train': 1.2656906843185425} 11/07/2021 00:15:16 - INFO - __main__ - Step 21422: {'lr': 0.0004790554951400524, 'samples': 4113024, 'steps': 21421, 'loss/train': 1.5593222379684448} 11/07/2021 00:15:16 - INFO - __main__ - Step 21423: {'lr': 0.0004790533688296712, 'samples': 4113216, 'steps': 21422, 'loss/train': 1.5328755378723145} 11/07/2021 00:15:16 - INFO - __main__ - Step 21424: {'lr': 0.0004790512424160821, 'samples': 4113408, 'steps': 21423, 'loss/train': 1.8626660108566284} 11/07/2021 00:15:17 - INFO - __main__ - Step 21425: {'lr': 0.00047904911589928605, 'samples': 4113600, 'steps': 21424, 'loss/train': 1.336646556854248} 11/07/2021 00:15:17 - INFO - __main__ - Step 21426: {'lr': 0.00047904698927928404, 'samples': 4113792, 'steps': 21425, 'loss/train': 1.711372971534729} 11/07/2021 00:15:18 - INFO - __main__ - Step 21427: {'lr': 0.0004790448625560769, 'samples': 4113984, 'steps': 21426, 'loss/train': 1.2243375778198242} 11/07/2021 00:15:18 - INFO - __main__ - Step 21428: {'lr': 0.0004790427357296657, 'samples': 4114176, 'steps': 21427, 'loss/train': 1.5050300359725952} 11/07/2021 00:15:19 - INFO - __main__ - Step 21429: {'lr': 0.0004790406088000514, 'samples': 4114368, 'steps': 21428, 'loss/train': 1.5563852787017822} 11/07/2021 00:15:19 - INFO - __main__ - Step 21430: {'lr': 0.00047903848176723493, 'samples': 4114560, 'steps': 21429, 'loss/train': 1.2160688638687134} 11/07/2021 00:15:19 - INFO - __main__ - Step 21431: {'lr': 0.0004790363546312172, 'samples': 4114752, 'steps': 21430, 'loss/train': 1.644734263420105} 11/07/2021 00:15:20 - INFO - __main__ - Step 21432: {'lr': 0.0004790342273919993, 'samples': 4114944, 'steps': 21431, 'loss/train': 1.6484546661376953} 11/07/2021 00:15:21 - INFO - __main__ - Step 21433: {'lr': 0.00047903210004958207, 'samples': 4115136, 'steps': 21432, 'loss/train': 1.6944466829299927} 11/07/2021 00:15:21 - INFO - __main__ - Step 21434: {'lr': 0.0004790299726039665, 'samples': 4115328, 'steps': 21433, 'loss/train': 1.1725130081176758} 11/07/2021 00:15:22 - INFO - __main__ - Step 21435: {'lr': 0.0004790278450551536, 'samples': 4115520, 'steps': 21434, 'loss/train': 1.5247802734375} 11/07/2021 00:15:22 - INFO - __main__ - Step 21436: {'lr': 0.00047902571740314427, 'samples': 4115712, 'steps': 21435, 'loss/train': 1.878816843032837} 11/07/2021 00:15:23 - INFO - __main__ - Step 21437: {'lr': 0.00047902358964793944, 'samples': 4115904, 'steps': 21436, 'loss/train': 1.6808935403823853} 11/07/2021 00:15:23 - INFO - __main__ - Step 21438: {'lr': 0.0004790214617895402, 'samples': 4116096, 'steps': 21437, 'loss/train': 1.717866063117981} 11/07/2021 00:15:24 - INFO - __main__ - Step 21439: {'lr': 0.0004790193338279474, 'samples': 4116288, 'steps': 21438, 'loss/train': 1.757002353668213} 11/07/2021 00:15:24 - INFO - __main__ - Step 21440: {'lr': 0.000479017205763162, 'samples': 4116480, 'steps': 21439, 'loss/train': 1.3909929990768433} 11/07/2021 00:15:25 - INFO - __main__ - Step 21441: {'lr': 0.000479015077595185, 'samples': 4116672, 'steps': 21440, 'loss/train': 1.5615991353988647} 11/07/2021 00:15:25 - INFO - __main__ - Step 21442: {'lr': 0.0004790129493240173, 'samples': 4116864, 'steps': 21441, 'loss/train': 0.7827515006065369} 11/07/2021 00:15:26 - INFO - __main__ - Step 21443: {'lr': 0.0004790108209496599, 'samples': 4117056, 'steps': 21442, 'loss/train': 1.5710242986679077} 11/07/2021 00:15:26 - INFO - __main__ - Step 21444: {'lr': 0.00047900869247211384, 'samples': 4117248, 'steps': 21443, 'loss/train': 1.5302128791809082} 11/07/2021 00:15:27 - INFO - __main__ - Step 21445: {'lr': 0.0004790065638913799, 'samples': 4117440, 'steps': 21444, 'loss/train': 1.9915907382965088} 11/07/2021 00:15:27 - INFO - __main__ - Step 21446: {'lr': 0.00047900443520745915, 'samples': 4117632, 'steps': 21445, 'loss/train': 1.787115216255188} 11/07/2021 00:15:27 - INFO - __main__ - Step 21447: {'lr': 0.0004790023064203526, 'samples': 4117824, 'steps': 21446, 'loss/train': 1.482800006866455} 11/07/2021 00:15:28 - INFO - __main__ - Step 21448: {'lr': 0.00047900017753006106, 'samples': 4118016, 'steps': 21447, 'loss/train': 1.878015398979187} 11/07/2021 00:15:29 - INFO - __main__ - Step 21449: {'lr': 0.0004789980485365857, 'samples': 4118208, 'steps': 21448, 'loss/train': 1.9219799041748047} 11/07/2021 00:15:29 - INFO - __main__ - Step 21450: {'lr': 0.00047899591943992726, 'samples': 4118400, 'steps': 21449, 'loss/train': 1.6092162132263184} 11/07/2021 00:15:29 - INFO - __main__ - Step 21451: {'lr': 0.0004789937902400868, 'samples': 4118592, 'steps': 21450, 'loss/train': 2.2746288776397705} 11/07/2021 00:15:30 - INFO - __main__ - Step 21452: {'lr': 0.00047899166093706523, 'samples': 4118784, 'steps': 21451, 'loss/train': 1.3884146213531494} 11/07/2021 00:15:31 - INFO - __main__ - Step 21453: {'lr': 0.0004789895315308636, 'samples': 4118976, 'steps': 21452, 'loss/train': 1.2079041004180908} 11/07/2021 00:15:31 - INFO - __main__ - Step 21454: {'lr': 0.00047898740202148284, 'samples': 4119168, 'steps': 21453, 'loss/train': 0.21061697602272034} 11/07/2021 00:15:31 - INFO - __main__ - Step 21455: {'lr': 0.0004789852724089239, 'samples': 4119360, 'steps': 21454, 'loss/train': 1.5838730335235596} 11/07/2021 00:15:32 - INFO - __main__ - Step 21456: {'lr': 0.00047898314269318766, 'samples': 4119552, 'steps': 21455, 'loss/train': 1.8635731935501099} 11/07/2021 00:15:32 - INFO - __main__ - Step 21457: {'lr': 0.00047898101287427523, 'samples': 4119744, 'steps': 21456, 'loss/train': 1.6409664154052734} 11/07/2021 00:15:33 - INFO - __main__ - Step 21458: {'lr': 0.0004789788829521874, 'samples': 4119936, 'steps': 21457, 'loss/train': 1.7607507705688477} 11/07/2021 00:15:34 - INFO - __main__ - Step 21459: {'lr': 0.0004789767529269253, 'samples': 4120128, 'steps': 21458, 'loss/train': 0.8947522640228271} 11/07/2021 00:15:34 - INFO - __main__ - Step 21460: {'lr': 0.0004789746227984897, 'samples': 4120320, 'steps': 21459, 'loss/train': 1.4214286804199219} 11/07/2021 00:15:34 - INFO - __main__ - Step 21461: {'lr': 0.0004789724925668818, 'samples': 4120512, 'steps': 21460, 'loss/train': 1.5798488855361938} 11/07/2021 00:15:35 - INFO - __main__ - Step 21462: {'lr': 0.00047897036223210234, 'samples': 4120704, 'steps': 21461, 'loss/train': 1.2147839069366455} 11/07/2021 00:15:36 - INFO - __main__ - Step 21463: {'lr': 0.00047896823179415237, 'samples': 4120896, 'steps': 21462, 'loss/train': 1.45625638961792} 11/07/2021 00:15:36 - INFO - __main__ - Step 21464: {'lr': 0.0004789661012530329, 'samples': 4121088, 'steps': 21463, 'loss/train': 1.694710612297058} 11/07/2021 00:15:37 - INFO - __main__ - Step 21465: {'lr': 0.00047896397060874485, 'samples': 4121280, 'steps': 21464, 'loss/train': 1.63399076461792} 11/07/2021 00:15:37 - INFO - __main__ - Step 21466: {'lr': 0.0004789618398612891, 'samples': 4121472, 'steps': 21465, 'loss/train': 2.6186089515686035} 11/07/2021 00:15:37 - INFO - __main__ - Step 21467: {'lr': 0.0004789597090106667, 'samples': 4121664, 'steps': 21466, 'loss/train': 1.5475844144821167} 11/07/2021 00:15:38 - INFO - __main__ - Step 21468: {'lr': 0.00047895757805687864, 'samples': 4121856, 'steps': 21467, 'loss/train': 1.74507737159729} 11/07/2021 00:15:39 - INFO - __main__ - Step 21469: {'lr': 0.0004789554469999258, 'samples': 4122048, 'steps': 21468, 'loss/train': 1.5908174514770508} 11/07/2021 00:15:39 - INFO - __main__ - Step 21470: {'lr': 0.0004789533158398091, 'samples': 4122240, 'steps': 21469, 'loss/train': 1.933680772781372} 11/07/2021 00:15:39 - INFO - __main__ - Step 21471: {'lr': 0.00047895118457652965, 'samples': 4122432, 'steps': 21470, 'loss/train': 1.2519850730895996} 11/07/2021 00:15:40 - INFO - __main__ - Step 21472: {'lr': 0.0004789490532100883, 'samples': 4122624, 'steps': 21471, 'loss/train': 3.244593858718872} 11/07/2021 00:15:40 - INFO - __main__ - Step 21473: {'lr': 0.000478946921740486, 'samples': 4122816, 'steps': 21472, 'loss/train': 1.6074554920196533} 11/07/2021 00:15:41 - INFO - __main__ - Step 21474: {'lr': 0.0004789447901677238, 'samples': 4123008, 'steps': 21473, 'loss/train': 0.4565056264400482} 11/07/2021 00:15:41 - INFO - __main__ - Step 21475: {'lr': 0.00047894265849180264, 'samples': 4123200, 'steps': 21474, 'loss/train': 0.8751310110092163} 11/07/2021 00:15:42 - INFO - __main__ - Step 21476: {'lr': 0.00047894052671272337, 'samples': 4123392, 'steps': 21475, 'loss/train': 1.7021253108978271} 11/07/2021 00:15:42 - INFO - __main__ - Step 21477: {'lr': 0.0004789383948304871, 'samples': 4123584, 'steps': 21476, 'loss/train': 1.6824284791946411} 11/07/2021 00:15:42 - INFO - __main__ - Step 21478: {'lr': 0.00047893626284509466, 'samples': 4123776, 'steps': 21477, 'loss/train': 1.7401764392852783} 11/07/2021 00:15:43 - INFO - __main__ - Step 21479: {'lr': 0.0004789341307565471, 'samples': 4123968, 'steps': 21478, 'loss/train': 1.7683900594711304} 11/07/2021 00:15:44 - INFO - __main__ - Step 21480: {'lr': 0.0004789319985648454, 'samples': 4124160, 'steps': 21479, 'loss/train': 1.4364922046661377} 11/07/2021 00:15:44 - INFO - __main__ - Step 21481: {'lr': 0.0004789298662699905, 'samples': 4124352, 'steps': 21480, 'loss/train': 1.1518300771713257} 11/07/2021 00:15:44 - INFO - __main__ - Step 21482: {'lr': 0.0004789277338719832, 'samples': 4124544, 'steps': 21481, 'loss/train': 1.8052656650543213} 11/07/2021 00:15:45 - INFO - __main__ - Step 21483: {'lr': 0.0004789256013708246, 'samples': 4124736, 'steps': 21482, 'loss/train': 1.718207597732544} 11/07/2021 00:15:46 - INFO - __main__ - Step 21484: {'lr': 0.0004789234687665158, 'samples': 4124928, 'steps': 21483, 'loss/train': 1.4353058338165283} 11/07/2021 00:15:46 - INFO - __main__ - Step 21485: {'lr': 0.0004789213360590575, 'samples': 4125120, 'steps': 21484, 'loss/train': 1.7839090824127197} 11/07/2021 00:15:47 - INFO - __main__ - Step 21486: {'lr': 0.00047891920324845085, 'samples': 4125312, 'steps': 21485, 'loss/train': 1.8160103559494019} 11/07/2021 00:15:47 - INFO - __main__ - Step 21487: {'lr': 0.00047891707033469665, 'samples': 4125504, 'steps': 21486, 'loss/train': 1.8308603763580322} 11/07/2021 00:15:47 - INFO - __main__ - Step 21488: {'lr': 0.00047891493731779607, 'samples': 4125696, 'steps': 21487, 'loss/train': 1.8766554594039917} 11/07/2021 00:15:48 - INFO - __main__ - Step 21489: {'lr': 0.00047891280419774985, 'samples': 4125888, 'steps': 21488, 'loss/train': 1.0840020179748535} 11/07/2021 00:15:49 - INFO - __main__ - Step 21490: {'lr': 0.0004789106709745591, 'samples': 4126080, 'steps': 21489, 'loss/train': 1.5869641304016113} 11/07/2021 00:15:49 - INFO - __main__ - Step 21491: {'lr': 0.0004789085376482247, 'samples': 4126272, 'steps': 21490, 'loss/train': 1.71488356590271} 11/07/2021 00:15:49 - INFO - __main__ - Step 21492: {'lr': 0.00047890640421874775, 'samples': 4126464, 'steps': 21491, 'loss/train': 1.5513404607772827} 11/07/2021 00:15:50 - INFO - __main__ - Step 21493: {'lr': 0.000478904270686129, 'samples': 4126656, 'steps': 21492, 'loss/train': 0.9947566390037537} 11/07/2021 00:15:51 - INFO - __main__ - Step 21494: {'lr': 0.00047890213705036955, 'samples': 4126848, 'steps': 21493, 'loss/train': 2.104623794555664} 11/07/2021 00:15:51 - INFO - __main__ - Step 21495: {'lr': 0.00047890000331147033, 'samples': 4127040, 'steps': 21494, 'loss/train': 1.1416540145874023} 11/07/2021 00:15:51 - INFO - __main__ - Step 21496: {'lr': 0.0004788978694694323, 'samples': 4127232, 'steps': 21495, 'loss/train': 1.2822664976119995} 11/07/2021 00:15:52 - INFO - __main__ - Step 21497: {'lr': 0.0004788957355242564, 'samples': 4127424, 'steps': 21496, 'loss/train': 1.5804579257965088} 11/07/2021 00:15:52 - INFO - __main__ - Step 21498: {'lr': 0.00047889360147594363, 'samples': 4127616, 'steps': 21497, 'loss/train': 1.338382363319397} 11/07/2021 00:15:53 - INFO - __main__ - Step 21499: {'lr': 0.00047889146732449497, 'samples': 4127808, 'steps': 21498, 'loss/train': 1.7818214893341064} 11/07/2021 00:15:53 - INFO - __main__ - Step 21500: {'lr': 0.00047888933306991136, 'samples': 4128000, 'steps': 21499, 'loss/train': 2.0857651233673096} 11/07/2021 00:15:54 - INFO - __main__ - Step 21501: {'lr': 0.00047888719871219367, 'samples': 4128192, 'steps': 21500, 'loss/train': 1.147006630897522} 11/07/2021 00:15:54 - INFO - __main__ - Step 21502: {'lr': 0.00047888506425134293, 'samples': 4128384, 'steps': 21501, 'loss/train': 1.8140604496002197} 11/07/2021 00:15:54 - INFO - __main__ - Step 21503: {'lr': 0.0004788829296873601, 'samples': 4128576, 'steps': 21502, 'loss/train': 1.6200863122940063} 11/07/2021 00:15:56 - INFO - __main__ - Step 21504: {'lr': 0.0004788807950202463, 'samples': 4128768, 'steps': 21503, 'loss/train': 1.7840808629989624} 11/07/2021 00:15:56 - INFO - __main__ - Step 21505: {'lr': 0.00047887866025000226, 'samples': 4128960, 'steps': 21504, 'loss/train': 1.6177959442138672} 11/07/2021 00:15:56 - INFO - __main__ - Step 21506: {'lr': 0.000478876525376629, 'samples': 4129152, 'steps': 21505, 'loss/train': 0.20047274231910706} 11/07/2021 00:15:57 - INFO - __main__ - Step 21507: {'lr': 0.00047887439040012755, 'samples': 4129344, 'steps': 21506, 'loss/train': 1.625873327255249} 11/07/2021 00:15:57 - INFO - __main__ - Step 21508: {'lr': 0.0004788722553204988, 'samples': 4129536, 'steps': 21507, 'loss/train': 2.040313720703125} 11/07/2021 00:15:57 - INFO - __main__ - Step 21509: {'lr': 0.0004788701201377438, 'samples': 4129728, 'steps': 21508, 'loss/train': 1.7073478698730469} 11/07/2021 00:15:58 - INFO - __main__ - Step 21510: {'lr': 0.0004788679848518633, 'samples': 4129920, 'steps': 21509, 'loss/train': 2.4196596145629883} 11/07/2021 00:15:59 - INFO - __main__ - Step 21511: {'lr': 0.0004788658494628586, 'samples': 4130112, 'steps': 21510, 'loss/train': 1.1434111595153809} 11/07/2021 00:15:59 - INFO - __main__ - Step 21512: {'lr': 0.0004788637139707304, 'samples': 4130304, 'steps': 21511, 'loss/train': 1.6110708713531494} 11/07/2021 00:15:59 - INFO - __main__ - Step 21513: {'lr': 0.00047886157837547975, 'samples': 4130496, 'steps': 21512, 'loss/train': 1.1308202743530273} 11/07/2021 00:16:00 - INFO - __main__ - Step 21514: {'lr': 0.0004788594426771076, 'samples': 4130688, 'steps': 21513, 'loss/train': 1.610917329788208} 11/07/2021 00:16:01 - INFO - __main__ - Step 21515: {'lr': 0.0004788573068756149, 'samples': 4130880, 'steps': 21514, 'loss/train': 1.7047597169876099} 11/07/2021 00:16:01 - INFO - __main__ - Step 21516: {'lr': 0.0004788551709710027, 'samples': 4131072, 'steps': 21515, 'loss/train': 1.753533124923706} 11/07/2021 00:16:01 - INFO - __main__ - Step 21517: {'lr': 0.0004788530349632718, 'samples': 4131264, 'steps': 21516, 'loss/train': 1.1571195125579834} 11/07/2021 00:16:02 - INFO - __main__ - Step 21518: {'lr': 0.00047885089885242333, 'samples': 4131456, 'steps': 21517, 'loss/train': 1.3207316398620605} 11/07/2021 00:16:02 - INFO - __main__ - Step 21519: {'lr': 0.0004788487626384581, 'samples': 4131648, 'steps': 21518, 'loss/train': 1.6291325092315674} 11/07/2021 00:16:03 - INFO - __main__ - Step 21520: {'lr': 0.0004788466263213772, 'samples': 4131840, 'steps': 21519, 'loss/train': 1.5563868284225464} 11/07/2021 00:16:03 - INFO - __main__ - Step 21521: {'lr': 0.00047884448990118155, 'samples': 4132032, 'steps': 21520, 'loss/train': 1.9897916316986084} 11/07/2021 00:16:04 - INFO - __main__ - Step 21522: {'lr': 0.0004788423533778721, 'samples': 4132224, 'steps': 21521, 'loss/train': 2.186905860900879} 11/07/2021 00:16:04 - INFO - __main__ - Step 21523: {'lr': 0.00047884021675144987, 'samples': 4132416, 'steps': 21522, 'loss/train': 1.3714509010314941} 11/07/2021 00:16:04 - INFO - __main__ - Step 21524: {'lr': 0.0004788380800219156, 'samples': 4132608, 'steps': 21523, 'loss/train': 1.524144172668457} 11/07/2021 00:16:06 - INFO - __main__ - Step 21525: {'lr': 0.0004788359431892706, 'samples': 4132800, 'steps': 21524, 'loss/train': 0.893682062625885} 11/07/2021 00:16:06 - INFO - __main__ - Step 21526: {'lr': 0.00047883380625351557, 'samples': 4132992, 'steps': 21525, 'loss/train': 1.6024296283721924} 11/07/2021 00:16:06 - INFO - __main__ - Step 21527: {'lr': 0.00047883166921465156, 'samples': 4133184, 'steps': 21526, 'loss/train': 1.9296703338623047} 11/07/2021 00:16:07 - INFO - __main__ - Step 21528: {'lr': 0.00047882953207267954, 'samples': 4133376, 'steps': 21527, 'loss/train': 1.8911947011947632} 11/07/2021 00:16:07 - INFO - __main__ - Step 21529: {'lr': 0.00047882739482760044, 'samples': 4133568, 'steps': 21528, 'loss/train': 1.9235213994979858} 11/07/2021 00:16:08 - INFO - __main__ - Step 21530: {'lr': 0.0004788252574794153, 'samples': 4133760, 'steps': 21529, 'loss/train': 1.90152108669281} 11/07/2021 00:16:08 - INFO - __main__ - Step 21531: {'lr': 0.000478823120028125, 'samples': 4133952, 'steps': 21530, 'loss/train': 1.815036654472351} 11/07/2021 00:16:09 - INFO - __main__ - Step 21532: {'lr': 0.0004788209824737305, 'samples': 4134144, 'steps': 21531, 'loss/train': 1.7011241912841797} 11/07/2021 00:16:09 - INFO - __main__ - Step 21533: {'lr': 0.00047881884481623286, 'samples': 4134336, 'steps': 21532, 'loss/train': 1.5776305198669434} 11/07/2021 00:16:10 - INFO - __main__ - Step 21534: {'lr': 0.000478816707055633, 'samples': 4134528, 'steps': 21533, 'loss/train': 1.2953623533248901} 11/07/2021 00:16:11 - INFO - __main__ - Step 21535: {'lr': 0.0004788145691919318, 'samples': 4134720, 'steps': 21534, 'loss/train': 1.5415441989898682} 11/07/2021 00:16:11 - INFO - __main__ - Step 21536: {'lr': 0.0004788124312251303, 'samples': 4134912, 'steps': 21535, 'loss/train': 1.6723524332046509} 11/07/2021 00:16:11 - INFO - __main__ - Step 21537: {'lr': 0.0004788102931552294, 'samples': 4135104, 'steps': 21536, 'loss/train': 2.1559436321258545} 11/07/2021 00:16:12 - INFO - __main__ - Step 21538: {'lr': 0.0004788081549822302, 'samples': 4135296, 'steps': 21537, 'loss/train': 2.0337588787078857} 11/07/2021 00:16:12 - INFO - __main__ - Step 21539: {'lr': 0.0004788060167061335, 'samples': 4135488, 'steps': 21538, 'loss/train': 2.108020782470703} 11/07/2021 00:16:13 - INFO - __main__ - Step 21540: {'lr': 0.0004788038783269404, 'samples': 4135680, 'steps': 21539, 'loss/train': 1.685656189918518} 11/07/2021 00:16:13 - INFO - __main__ - Step 21541: {'lr': 0.00047880173984465174, 'samples': 4135872, 'steps': 21540, 'loss/train': 1.431290626525879} 11/07/2021 00:16:14 - INFO - __main__ - Step 21542: {'lr': 0.0004787996012592686, 'samples': 4136064, 'steps': 21541, 'loss/train': 1.244078278541565} 11/07/2021 00:16:14 - INFO - __main__ - Step 21543: {'lr': 0.0004787974625707919, 'samples': 4136256, 'steps': 21542, 'loss/train': 1.6926815509796143} 11/07/2021 00:16:14 - INFO - __main__ - Step 21544: {'lr': 0.0004787953237792225, 'samples': 4136448, 'steps': 21543, 'loss/train': 1.9875998497009277} 11/07/2021 00:16:15 - INFO - __main__ - Step 21545: {'lr': 0.0004787931848845616, 'samples': 4136640, 'steps': 21544, 'loss/train': 1.8045618534088135} 11/07/2021 00:16:16 - INFO - __main__ - Step 21546: {'lr': 0.00047879104588680987, 'samples': 4136832, 'steps': 21545, 'loss/train': 1.706560492515564} 11/07/2021 00:16:16 - INFO - __main__ - Step 21547: {'lr': 0.00047878890678596854, 'samples': 4137024, 'steps': 21546, 'loss/train': 1.979650616645813} 11/07/2021 00:16:16 - INFO - __main__ - Step 21548: {'lr': 0.00047878676758203844, 'samples': 4137216, 'steps': 21547, 'loss/train': 1.6284128427505493} 11/07/2021 00:16:17 - INFO - __main__ - Step 21549: {'lr': 0.00047878462827502055, 'samples': 4137408, 'steps': 21548, 'loss/train': 1.7756991386413574} 11/07/2021 00:16:17 - INFO - __main__ - Step 21550: {'lr': 0.0004787824888649158, 'samples': 4137600, 'steps': 21549, 'loss/train': 1.996254324913025} 11/07/2021 00:16:18 - INFO - __main__ - Step 21551: {'lr': 0.0004787803493517252, 'samples': 4137792, 'steps': 21550, 'loss/train': 1.7516309022903442} 11/07/2021 00:16:18 - INFO - __main__ - Step 21552: {'lr': 0.0004787782097354497, 'samples': 4137984, 'steps': 21551, 'loss/train': 1.415780782699585} 11/07/2021 00:16:19 - INFO - __main__ - Step 21553: {'lr': 0.00047877607001609035, 'samples': 4138176, 'steps': 21552, 'loss/train': 1.233115553855896} 11/07/2021 00:16:19 - INFO - __main__ - Step 21554: {'lr': 0.00047877393019364796, 'samples': 4138368, 'steps': 21553, 'loss/train': 1.7765933275222778} 11/07/2021 00:16:19 - INFO - __main__ - Step 21555: {'lr': 0.0004787717902681236, 'samples': 4138560, 'steps': 21554, 'loss/train': 1.6584759950637817} 11/07/2021 00:16:21 - INFO - __main__ - Step 21556: {'lr': 0.00047876965023951814, 'samples': 4138752, 'steps': 21555, 'loss/train': 1.7891385555267334} 11/07/2021 00:16:21 - INFO - __main__ - Step 21557: {'lr': 0.00047876751010783266, 'samples': 4138944, 'steps': 21556, 'loss/train': 1.8446043729782104} 11/07/2021 00:16:21 - INFO - __main__ - Step 21558: {'lr': 0.0004787653698730681, 'samples': 4139136, 'steps': 21557, 'loss/train': 1.8145853281021118} 11/07/2021 00:16:22 - INFO - __main__ - Step 21559: {'lr': 0.00047876322953522535, 'samples': 4139328, 'steps': 21558, 'loss/train': 1.5460792779922485} 11/07/2021 00:16:22 - INFO - __main__ - Step 21560: {'lr': 0.00047876108909430536, 'samples': 4139520, 'steps': 21559, 'loss/train': 1.2707014083862305} 11/07/2021 00:16:23 - INFO - __main__ - Step 21561: {'lr': 0.00047875894855030923, 'samples': 4139712, 'steps': 21560, 'loss/train': 1.3773596286773682} 11/07/2021 00:16:23 - INFO - __main__ - Step 21562: {'lr': 0.00047875680790323785, 'samples': 4139904, 'steps': 21561, 'loss/train': 1.5871561765670776} 11/07/2021 00:16:24 - INFO - __main__ - Step 21563: {'lr': 0.0004787546671530921, 'samples': 4140096, 'steps': 21562, 'loss/train': 1.8902066946029663} 11/07/2021 00:16:24 - INFO - __main__ - Step 21564: {'lr': 0.0004787525262998731, 'samples': 4140288, 'steps': 21563, 'loss/train': 1.7794413566589355} 11/07/2021 00:16:24 - INFO - __main__ - Step 21565: {'lr': 0.0004787503853435817, 'samples': 4140480, 'steps': 21564, 'loss/train': 1.6013624668121338} 11/07/2021 00:16:26 - INFO - __main__ - Step 21566: {'lr': 0.00047874824428421897, 'samples': 4140672, 'steps': 21565, 'loss/train': 1.7130855321884155} 11/07/2021 00:16:26 - INFO - __main__ - Step 21567: {'lr': 0.0004787461031217858, 'samples': 4140864, 'steps': 21566, 'loss/train': 1.4191340208053589} 11/07/2021 00:16:26 - INFO - __main__ - Step 21568: {'lr': 0.0004787439618562831, 'samples': 4141056, 'steps': 21567, 'loss/train': 0.9844514727592468} 11/07/2021 00:16:27 - INFO - __main__ - Step 21569: {'lr': 0.000478741820487712, 'samples': 4141248, 'steps': 21568, 'loss/train': 1.362804889678955} 11/07/2021 00:16:27 - INFO - __main__ - Step 21570: {'lr': 0.0004787396790160733, 'samples': 4141440, 'steps': 21569, 'loss/train': 1.464184284210205} 11/07/2021 00:16:28 - INFO - __main__ - Step 21571: {'lr': 0.00047873753744136807, 'samples': 4141632, 'steps': 21570, 'loss/train': 1.4976603984832764} 11/07/2021 00:16:29 - INFO - __main__ - Step 21572: {'lr': 0.0004787353957635971, 'samples': 4141824, 'steps': 21571, 'loss/train': 1.813546895980835} 11/07/2021 00:16:29 - INFO - __main__ - Step 21573: {'lr': 0.0004787332539827617, 'samples': 4142016, 'steps': 21572, 'loss/train': 1.614606261253357} 11/07/2021 00:16:29 - INFO - __main__ - Step 21574: {'lr': 0.00047873111209886245, 'samples': 4142208, 'steps': 21573, 'loss/train': 1.6597931385040283} 11/07/2021 00:16:30 - INFO - __main__ - Step 21575: {'lr': 0.00047872897011190063, 'samples': 4142400, 'steps': 21574, 'loss/train': 0.855787456035614} 11/07/2021 00:16:30 - INFO - __main__ - Step 21576: {'lr': 0.00047872682802187693, 'samples': 4142592, 'steps': 21575, 'loss/train': 1.3936561346054077} 11/07/2021 00:16:31 - INFO - __main__ - Step 21577: {'lr': 0.0004787246858287926, 'samples': 4142784, 'steps': 21576, 'loss/train': 2.4061551094055176} 11/07/2021 00:16:31 - INFO - __main__ - Step 21578: {'lr': 0.0004787225435326483, 'samples': 4142976, 'steps': 21577, 'loss/train': 1.734849452972412} 11/07/2021 00:16:32 - INFO - __main__ - Step 21579: {'lr': 0.0004787204011334453, 'samples': 4143168, 'steps': 21578, 'loss/train': 1.6250495910644531} 11/07/2021 00:16:32 - INFO - __main__ - Step 21580: {'lr': 0.0004787182586311843, 'samples': 4143360, 'steps': 21579, 'loss/train': 1.4872287511825562} 11/07/2021 00:16:32 - INFO - __main__ - Step 21581: {'lr': 0.0004787161160258664, 'samples': 4143552, 'steps': 21580, 'loss/train': 1.6663235425949097} 11/07/2021 00:16:33 - INFO - __main__ - Step 21582: {'lr': 0.00047871397331749254, 'samples': 4143744, 'steps': 21581, 'loss/train': 0.5923960208892822} 11/07/2021 00:16:34 - INFO - __main__ - Step 21583: {'lr': 0.00047871183050606376, 'samples': 4143936, 'steps': 21582, 'loss/train': 1.2236347198486328} 11/07/2021 00:16:34 - INFO - __main__ - Step 21584: {'lr': 0.00047870968759158096, 'samples': 4144128, 'steps': 21583, 'loss/train': 1.7751884460449219} 11/07/2021 00:16:34 - INFO - __main__ - Step 21585: {'lr': 0.000478707544574045, 'samples': 4144320, 'steps': 21584, 'loss/train': 1.1830782890319824} 11/07/2021 00:16:35 - INFO - __main__ - Step 21586: {'lr': 0.000478705401453457, 'samples': 4144512, 'steps': 21585, 'loss/train': 0.7636067867279053} 11/07/2021 00:16:36 - INFO - __main__ - Step 21587: {'lr': 0.000478703258229818, 'samples': 4144704, 'steps': 21586, 'loss/train': 0.18369045853614807} 11/07/2021 00:16:36 - INFO - __main__ - Step 21588: {'lr': 0.0004787011149031287, 'samples': 4144896, 'steps': 21587, 'loss/train': 1.3815462589263916} 11/07/2021 00:16:36 - INFO - __main__ - Step 21589: {'lr': 0.0004786989714733902, 'samples': 4145088, 'steps': 21588, 'loss/train': 0.8793774843215942} 11/07/2021 00:16:37 - INFO - __main__ - Step 21590: {'lr': 0.0004786968279406035, 'samples': 4145280, 'steps': 21589, 'loss/train': 1.5908639430999756} 11/07/2021 00:16:37 - INFO - __main__ - Step 21591: {'lr': 0.0004786946843047696, 'samples': 4145472, 'steps': 21590, 'loss/train': 1.7688531875610352} 11/07/2021 00:16:38 - INFO - __main__ - Step 21592: {'lr': 0.00047869254056588927, 'samples': 4145664, 'steps': 21591, 'loss/train': 1.64926016330719} 11/07/2021 00:16:39 - INFO - __main__ - Step 21593: {'lr': 0.0004786903967239637, 'samples': 4145856, 'steps': 21592, 'loss/train': 1.653084397315979} 11/07/2021 00:16:39 - INFO - __main__ - Step 21594: {'lr': 0.0004786882527789938, 'samples': 4146048, 'steps': 21593, 'loss/train': 2.0507147312164307} 11/07/2021 00:16:39 - INFO - __main__ - Step 21595: {'lr': 0.00047868610873098047, 'samples': 4146240, 'steps': 21594, 'loss/train': 1.6787335872650146} 11/07/2021 00:16:40 - INFO - __main__ - Step 21596: {'lr': 0.0004786839645799247, 'samples': 4146432, 'steps': 21595, 'loss/train': 1.8469722270965576} 11/07/2021 00:16:41 - INFO - __main__ - Step 21597: {'lr': 0.00047868182032582746, 'samples': 4146624, 'steps': 21596, 'loss/train': 1.7321794033050537} 11/07/2021 00:16:41 - INFO - __main__ - Step 21598: {'lr': 0.00047867967596868974, 'samples': 4146816, 'steps': 21597, 'loss/train': 1.0293926000595093} 11/07/2021 00:16:41 - INFO - __main__ - Step 21599: {'lr': 0.00047867753150851244, 'samples': 4147008, 'steps': 21598, 'loss/train': 1.51420259475708} 11/07/2021 00:16:42 - INFO - __main__ - Step 21600: {'lr': 0.0004786753869452966, 'samples': 4147200, 'steps': 21599, 'loss/train': 1.0068912506103516} 11/07/2021 00:16:42 - INFO - __main__ - Step 21601: {'lr': 0.00047867324227904317, 'samples': 4147392, 'steps': 21600, 'loss/train': 1.7972041368484497} 11/07/2021 00:16:43 - INFO - __main__ - Step 21602: {'lr': 0.0004786710975097531, 'samples': 4147584, 'steps': 21601, 'loss/train': 1.9969254732131958} 11/07/2021 00:16:43 - INFO - __main__ - Step 21603: {'lr': 0.0004786689526374274, 'samples': 4147776, 'steps': 21602, 'loss/train': 1.5673604011535645} 11/07/2021 00:16:44 - INFO - __main__ - Step 21604: {'lr': 0.00047866680766206693, 'samples': 4147968, 'steps': 21603, 'loss/train': 1.7802320718765259} 11/07/2021 00:16:44 - INFO - __main__ - Step 21605: {'lr': 0.0004786646625836727, 'samples': 4148160, 'steps': 21604, 'loss/train': 1.153442144393921} 11/07/2021 00:16:44 - INFO - __main__ - Step 21606: {'lr': 0.0004786625174022458, 'samples': 4148352, 'steps': 21605, 'loss/train': 1.1935527324676514} 11/07/2021 00:16:45 - INFO - __main__ - Step 21607: {'lr': 0.00047866037211778705, 'samples': 4148544, 'steps': 21606, 'loss/train': 0.29927515983581543} 11/07/2021 00:16:46 - INFO - __main__ - Step 21608: {'lr': 0.0004786582267302975, 'samples': 4148736, 'steps': 21607, 'loss/train': 0.8351354002952576} 11/07/2021 00:16:47 - INFO - __main__ - Step 21609: {'lr': 0.000478656081239778, 'samples': 4148928, 'steps': 21608, 'loss/train': 1.8804010152816772} 11/07/2021 00:16:47 - INFO - __main__ - Step 21610: {'lr': 0.0004786539356462297, 'samples': 4149120, 'steps': 21609, 'loss/train': 1.4697155952453613} 11/07/2021 00:16:47 - INFO - __main__ - Step 21611: {'lr': 0.0004786517899496534, 'samples': 4149312, 'steps': 21610, 'loss/train': 1.9372228384017944} 11/07/2021 00:16:48 - INFO - __main__ - Step 21612: {'lr': 0.0004786496441500502, 'samples': 4149504, 'steps': 21611, 'loss/train': 1.3799189329147339} 11/07/2021 00:16:49 - INFO - __main__ - Step 21613: {'lr': 0.00047864749824742093, 'samples': 4149696, 'steps': 21612, 'loss/train': 1.8028512001037598} 11/07/2021 00:16:49 - INFO - __main__ - Step 21614: {'lr': 0.00047864535224176666, 'samples': 4149888, 'steps': 21613, 'loss/train': 1.2140483856201172} 11/07/2021 00:16:49 - INFO - __main__ - Step 21615: {'lr': 0.0004786432061330882, 'samples': 4150080, 'steps': 21614, 'loss/train': 1.3646620512008667} 11/07/2021 00:16:50 - INFO - __main__ - Step 21616: {'lr': 0.0004786410599213868, 'samples': 4150272, 'steps': 21615, 'loss/train': 1.2908267974853516} 11/07/2021 00:16:50 - INFO - __main__ - Step 21617: {'lr': 0.00047863891360666323, 'samples': 4150464, 'steps': 21616, 'loss/train': 1.5464023351669312} 11/07/2021 00:16:51 - INFO - __main__ - Step 21618: {'lr': 0.00047863676718891846, 'samples': 4150656, 'steps': 21617, 'loss/train': 1.6251708269119263} 11/07/2021 00:16:51 - INFO - __main__ - Step 21619: {'lr': 0.0004786346206681535, 'samples': 4150848, 'steps': 21618, 'loss/train': 1.4503111839294434} 11/07/2021 00:16:52 - INFO - __main__ - Step 21620: {'lr': 0.0004786324740443693, 'samples': 4151040, 'steps': 21619, 'loss/train': 1.2549147605895996} 11/07/2021 00:16:52 - INFO - __main__ - Step 21621: {'lr': 0.00047863032731756684, 'samples': 4151232, 'steps': 21620, 'loss/train': 1.467907428741455} 11/07/2021 00:16:53 - INFO - __main__ - Step 21622: {'lr': 0.0004786281804877471, 'samples': 4151424, 'steps': 21621, 'loss/train': 1.2028931379318237} 11/07/2021 00:16:54 - INFO - __main__ - Step 21623: {'lr': 0.00047862603355491103, 'samples': 4151616, 'steps': 21622, 'loss/train': 1.0954643487930298} 11/07/2021 00:16:54 - INFO - __main__ - Step 21624: {'lr': 0.0004786238865190595, 'samples': 4151808, 'steps': 21623, 'loss/train': 1.7945252656936646} 11/07/2021 00:16:54 - INFO - __main__ - Step 21625: {'lr': 0.0004786217393801937, 'samples': 4152000, 'steps': 21624, 'loss/train': 1.5487359762191772} 11/07/2021 00:16:55 - INFO - __main__ - Step 21626: {'lr': 0.00047861959213831446, 'samples': 4152192, 'steps': 21625, 'loss/train': 1.6223113536834717} 11/07/2021 00:16:55 - INFO - __main__ - Step 21627: {'lr': 0.0004786174447934227, 'samples': 4152384, 'steps': 21626, 'loss/train': 1.4051604270935059} 11/07/2021 00:16:56 - INFO - __main__ - Step 21628: {'lr': 0.0004786152973455195, 'samples': 4152576, 'steps': 21627, 'loss/train': 1.345228672027588} 11/07/2021 00:16:56 - INFO - __main__ - Step 21629: {'lr': 0.0004786131497946058, 'samples': 4152768, 'steps': 21628, 'loss/train': 0.9680488705635071} 11/07/2021 00:16:57 - INFO - __main__ - Step 21630: {'lr': 0.0004786110021406824, 'samples': 4152960, 'steps': 21629, 'loss/train': 1.176362156867981} 11/07/2021 00:16:57 - INFO - __main__ - Step 21631: {'lr': 0.0004786088543837506, 'samples': 4153152, 'steps': 21630, 'loss/train': 1.5705360174179077} 11/07/2021 00:16:58 - INFO - __main__ - Step 21632: {'lr': 0.00047860670652381105, 'samples': 4153344, 'steps': 21631, 'loss/train': 1.987221598625183} 11/07/2021 00:16:58 - INFO - __main__ - Step 21633: {'lr': 0.00047860455856086487, 'samples': 4153536, 'steps': 21632, 'loss/train': 0.9786133170127869} 11/07/2021 00:16:59 - INFO - __main__ - Step 21634: {'lr': 0.00047860241049491303, 'samples': 4153728, 'steps': 21633, 'loss/train': 1.5743129253387451} 11/07/2021 00:16:59 - INFO - __main__ - Step 21635: {'lr': 0.00047860026232595645, 'samples': 4153920, 'steps': 21634, 'loss/train': 1.7790687084197998} 11/07/2021 00:17:00 - INFO - __main__ - Step 21636: {'lr': 0.0004785981140539961, 'samples': 4154112, 'steps': 21635, 'loss/train': 1.8462978601455688} 11/07/2021 00:17:00 - INFO - __main__ - Step 21637: {'lr': 0.000478595965679033, 'samples': 4154304, 'steps': 21636, 'loss/train': 1.3746877908706665} 11/07/2021 00:17:00 - INFO - __main__ - Step 21638: {'lr': 0.0004785938172010681, 'samples': 4154496, 'steps': 21637, 'loss/train': 1.551901936531067} 11/07/2021 00:17:01 - INFO - __main__ - Step 21639: {'lr': 0.0004785916686201023, 'samples': 4154688, 'steps': 21638, 'loss/train': 2.32190203666687} 11/07/2021 00:17:02 - INFO - __main__ - Step 21640: {'lr': 0.00047858951993613665, 'samples': 4154880, 'steps': 21639, 'loss/train': 1.568753957748413} 11/07/2021 00:17:02 - INFO - __main__ - Step 21641: {'lr': 0.0004785873711491721, 'samples': 4155072, 'steps': 21640, 'loss/train': 2.010477304458618} 11/07/2021 00:17:02 - INFO - __main__ - Step 21642: {'lr': 0.00047858522225920964, 'samples': 4155264, 'steps': 21641, 'loss/train': 0.8399590253829956} 11/07/2021 00:17:03 - INFO - __main__ - Step 21643: {'lr': 0.00047858307326625014, 'samples': 4155456, 'steps': 21642, 'loss/train': 1.774276614189148} 11/07/2021 00:17:04 - INFO - __main__ - Step 21644: {'lr': 0.00047858092417029464, 'samples': 4155648, 'steps': 21643, 'loss/train': 1.7628326416015625} 11/07/2021 00:17:04 - INFO - __main__ - Step 21645: {'lr': 0.00047857877497134416, 'samples': 4155840, 'steps': 21644, 'loss/train': 1.2147215604782104} 11/07/2021 00:17:04 - INFO - __main__ - Step 21646: {'lr': 0.0004785766256693995, 'samples': 4156032, 'steps': 21645, 'loss/train': 1.5541294813156128} 11/07/2021 00:17:05 - INFO - __main__ - Step 21647: {'lr': 0.0004785744762644619, 'samples': 4156224, 'steps': 21646, 'loss/train': 1.1660066843032837} 11/07/2021 00:17:05 - INFO - __main__ - Step 21648: {'lr': 0.00047857232675653207, 'samples': 4156416, 'steps': 21647, 'loss/train': 2.1722161769866943} 11/07/2021 00:17:06 - INFO - __main__ - Step 21649: {'lr': 0.00047857017714561105, 'samples': 4156608, 'steps': 21648, 'loss/train': 1.5793182849884033} 11/07/2021 00:17:07 - INFO - __main__ - Step 21650: {'lr': 0.00047856802743169994, 'samples': 4156800, 'steps': 21649, 'loss/train': 1.588982343673706} 11/07/2021 00:17:07 - INFO - __main__ - Step 21651: {'lr': 0.00047856587761479954, 'samples': 4156992, 'steps': 21650, 'loss/train': 1.7212626934051514} 11/07/2021 00:17:07 - INFO - __main__ - Step 21652: {'lr': 0.00047856372769491083, 'samples': 4157184, 'steps': 21651, 'loss/train': 1.6415342092514038} 11/07/2021 00:17:08 - INFO - __main__ - Step 21653: {'lr': 0.0004785615776720349, 'samples': 4157376, 'steps': 21652, 'loss/train': 1.7163211107254028} 11/07/2021 00:17:09 - INFO - __main__ - Step 21654: {'lr': 0.0004785594275461726, 'samples': 4157568, 'steps': 21653, 'loss/train': 1.6288830041885376} 11/07/2021 00:17:09 - INFO - __main__ - Step 21655: {'lr': 0.00047855727731732503, 'samples': 4157760, 'steps': 21654, 'loss/train': 1.8189113140106201} 11/07/2021 00:17:09 - INFO - __main__ - Step 21656: {'lr': 0.00047855512698549295, 'samples': 4157952, 'steps': 21655, 'loss/train': 1.7791290283203125} 11/07/2021 00:17:10 - INFO - __main__ - Step 21657: {'lr': 0.00047855297655067754, 'samples': 4158144, 'steps': 21656, 'loss/train': 1.4705885648727417} 11/07/2021 00:17:10 - INFO - __main__ - Step 21658: {'lr': 0.0004785508260128797, 'samples': 4158336, 'steps': 21657, 'loss/train': 1.4176886081695557} 11/07/2021 00:17:11 - INFO - __main__ - Step 21659: {'lr': 0.00047854867537210034, 'samples': 4158528, 'steps': 21658, 'loss/train': 1.7595988512039185} 11/07/2021 00:17:12 - INFO - __main__ - Step 21660: {'lr': 0.00047854652462834055, 'samples': 4158720, 'steps': 21659, 'loss/train': 1.7756456136703491} 11/07/2021 00:17:12 - INFO - __main__ - Step 21661: {'lr': 0.0004785443737816012, 'samples': 4158912, 'steps': 21660, 'loss/train': 5.061169147491455} 11/07/2021 00:17:12 - INFO - __main__ - Step 21662: {'lr': 0.0004785422228318832, 'samples': 4159104, 'steps': 21661, 'loss/train': 4.935312747955322} 11/07/2021 00:17:13 - INFO - __main__ - Step 21663: {'lr': 0.0004785400717791877, 'samples': 4159296, 'steps': 21662, 'loss/train': 2.0939254760742188} 11/07/2021 00:17:13 - INFO - __main__ - Step 21664: {'lr': 0.0004785379206235155, 'samples': 4159488, 'steps': 21663, 'loss/train': 1.599906086921692} 11/07/2021 00:17:15 - INFO - __main__ - Step 21665: {'lr': 0.00047853576936486764, 'samples': 4159680, 'steps': 21664, 'loss/train': 1.3704638481140137} 11/07/2021 00:17:15 - INFO - __main__ - Step 21666: {'lr': 0.00047853361800324516, 'samples': 4159872, 'steps': 21665, 'loss/train': 1.3747199773788452} 11/07/2021 00:17:15 - INFO - __main__ - Step 21667: {'lr': 0.0004785314665386489, 'samples': 4160064, 'steps': 21666, 'loss/train': 1.5270380973815918} 11/07/2021 00:17:16 - INFO - __main__ - Step 21668: {'lr': 0.00047852931497107987, 'samples': 4160256, 'steps': 21667, 'loss/train': 0.6752262115478516} 11/07/2021 00:17:16 - INFO - __main__ - Step 21669: {'lr': 0.0004785271633005391, 'samples': 4160448, 'steps': 21668, 'loss/train': 1.6318258047103882} 11/07/2021 00:17:16 - INFO - __main__ - Step 21670: {'lr': 0.0004785250115270275, 'samples': 4160640, 'steps': 21669, 'loss/train': 1.8416980504989624} 11/07/2021 00:17:17 - INFO - __main__ - Step 21671: {'lr': 0.00047852285965054606, 'samples': 4160832, 'steps': 21670, 'loss/train': 1.454700231552124} 11/07/2021 00:17:18 - INFO - __main__ - Step 21672: {'lr': 0.00047852070767109573, 'samples': 4161024, 'steps': 21671, 'loss/train': 1.5401418209075928} 11/07/2021 00:17:18 - INFO - __main__ - Step 21673: {'lr': 0.00047851855558867754, 'samples': 4161216, 'steps': 21672, 'loss/train': 1.6531730890274048} 11/07/2021 00:17:19 - INFO - __main__ - Step 21674: {'lr': 0.0004785164034032924, 'samples': 4161408, 'steps': 21673, 'loss/train': 1.682608962059021} 11/07/2021 00:17:19 - INFO - __main__ - Step 21675: {'lr': 0.0004785142511149412, 'samples': 4161600, 'steps': 21674, 'loss/train': 1.6360929012298584} 11/07/2021 00:17:20 - INFO - __main__ - Step 21676: {'lr': 0.0004785120987236251, 'samples': 4161792, 'steps': 21675, 'loss/train': 1.5555413961410522} 11/07/2021 00:17:20 - INFO - __main__ - Step 21677: {'lr': 0.00047850994622934494, 'samples': 4161984, 'steps': 21676, 'loss/train': 1.3743476867675781} 11/07/2021 00:17:21 - INFO - __main__ - Step 21678: {'lr': 0.0004785077936321018, 'samples': 4162176, 'steps': 21677, 'loss/train': 1.7006787061691284} 11/07/2021 00:17:21 - INFO - __main__ - Step 21679: {'lr': 0.00047850564093189653, 'samples': 4162368, 'steps': 21678, 'loss/train': 1.9067273139953613} 11/07/2021 00:17:21 - INFO - __main__ - Step 21680: {'lr': 0.0004785034881287301, 'samples': 4162560, 'steps': 21679, 'loss/train': 1.681504487991333} 11/07/2021 00:17:22 - INFO - __main__ - Step 21681: {'lr': 0.0004785013352226035, 'samples': 4162752, 'steps': 21680, 'loss/train': 1.96555495262146} 11/07/2021 00:17:23 - INFO - __main__ - Step 21682: {'lr': 0.00047849918221351783, 'samples': 4162944, 'steps': 21681, 'loss/train': 1.4842780828475952} 11/07/2021 00:17:23 - INFO - __main__ - Step 21683: {'lr': 0.0004784970291014739, 'samples': 4163136, 'steps': 21682, 'loss/train': 1.840103268623352} 11/07/2021 00:17:23 - INFO - __main__ - Step 21684: {'lr': 0.0004784948758864727, 'samples': 4163328, 'steps': 21683, 'loss/train': 1.3767606019973755} 11/07/2021 00:17:24 - INFO - __main__ - Step 21685: {'lr': 0.0004784927225685153, 'samples': 4163520, 'steps': 21684, 'loss/train': 1.9550838470458984} 11/07/2021 00:17:24 - INFO - __main__ - Step 21686: {'lr': 0.00047849056914760256, 'samples': 4163712, 'steps': 21685, 'loss/train': 1.6681972742080688} 11/07/2021 00:17:25 - INFO - __main__ - Step 21687: {'lr': 0.00047848841562373557, 'samples': 4163904, 'steps': 21686, 'loss/train': 1.4995859861373901} 11/07/2021 00:17:25 - INFO - __main__ - Step 21688: {'lr': 0.00047848626199691513, 'samples': 4164096, 'steps': 21687, 'loss/train': 1.1082913875579834} 11/07/2021 00:17:26 - INFO - __main__ - Step 21689: {'lr': 0.00047848410826714237, 'samples': 4164288, 'steps': 21688, 'loss/train': 1.973834753036499} 11/07/2021 00:17:26 - INFO - __main__ - Step 21690: {'lr': 0.00047848195443441817, 'samples': 4164480, 'steps': 21689, 'loss/train': 1.8756773471832275} 11/07/2021 00:17:26 - INFO - __main__ - Step 21691: {'lr': 0.0004784798004987435, 'samples': 4164672, 'steps': 21690, 'loss/train': 1.5911768674850464} 11/07/2021 00:17:28 - INFO - __main__ - Step 21692: {'lr': 0.00047847764646011937, 'samples': 4164864, 'steps': 21691, 'loss/train': 2.253713846206665} 11/07/2021 00:17:28 - INFO - __main__ - Step 21693: {'lr': 0.0004784754923185468, 'samples': 4165056, 'steps': 21692, 'loss/train': 1.468682050704956} 11/07/2021 00:17:28 - INFO - __main__ - Step 21694: {'lr': 0.00047847333807402666, 'samples': 4165248, 'steps': 21693, 'loss/train': 1.3054816722869873} 11/07/2021 00:17:29 - INFO - __main__ - Step 21695: {'lr': 0.00047847118372655996, 'samples': 4165440, 'steps': 21694, 'loss/train': 1.7357770204544067} 11/07/2021 00:17:29 - INFO - __main__ - Step 21696: {'lr': 0.00047846902927614767, 'samples': 4165632, 'steps': 21695, 'loss/train': 1.421078085899353} 11/07/2021 00:17:31 - INFO - __main__ - Step 21697: {'lr': 0.0004784668747227907, 'samples': 4165824, 'steps': 21696, 'loss/train': 2.197175979614258} 11/07/2021 00:17:31 - INFO - __main__ - Step 21698: {'lr': 0.00047846472006649016, 'samples': 4166016, 'steps': 21697, 'loss/train': 1.1708250045776367} 11/07/2021 00:17:31 - INFO - __main__ - Step 21699: {'lr': 0.0004784625653072469, 'samples': 4166208, 'steps': 21698, 'loss/train': 1.3094415664672852} 11/07/2021 00:17:32 - INFO - __main__ - Step 21700: {'lr': 0.00047846041044506194, 'samples': 4166400, 'steps': 21699, 'loss/train': 1.742722988128662} 11/07/2021 00:17:32 - INFO - __main__ - Step 21701: {'lr': 0.00047845825547993627, 'samples': 4166592, 'steps': 21700, 'loss/train': 1.8332834243774414} 11/07/2021 00:17:32 - INFO - __main__ - Step 21702: {'lr': 0.0004784561004118708, 'samples': 4166784, 'steps': 21701, 'loss/train': 0.8396803736686707} 11/07/2021 00:17:34 - INFO - __main__ - Step 21703: {'lr': 0.0004784539452408666, 'samples': 4166976, 'steps': 21702, 'loss/train': 1.7695062160491943} 11/07/2021 00:17:34 - INFO - __main__ - Step 21704: {'lr': 0.0004784517899669245, 'samples': 4167168, 'steps': 21703, 'loss/train': 1.3221811056137085} 11/07/2021 00:17:34 - INFO - __main__ - Step 21705: {'lr': 0.00047844963459004565, 'samples': 4167360, 'steps': 21704, 'loss/train': 1.2405515909194946} 11/07/2021 00:17:35 - INFO - __main__ - Step 21706: {'lr': 0.00047844747911023077, 'samples': 4167552, 'steps': 21705, 'loss/train': 1.389700174331665} 11/07/2021 00:17:35 - INFO - __main__ - Step 21707: {'lr': 0.00047844532352748115, 'samples': 4167744, 'steps': 21706, 'loss/train': 1.4688421487808228} 11/07/2021 00:17:35 - INFO - __main__ - Step 21708: {'lr': 0.0004784431678417975, 'samples': 4167936, 'steps': 21707, 'loss/train': 2.301962375640869} 11/07/2021 00:17:36 - INFO - __main__ - Step 21709: {'lr': 0.00047844101205318085, 'samples': 4168128, 'steps': 21708, 'loss/train': 1.7107940912246704} 11/07/2021 00:17:37 - INFO - __main__ - Step 21710: {'lr': 0.0004784388561616323, 'samples': 4168320, 'steps': 21709, 'loss/train': 1.7536522150039673} 11/07/2021 00:17:37 - INFO - __main__ - Step 21711: {'lr': 0.0004784367001671526, 'samples': 4168512, 'steps': 21710, 'loss/train': 1.3904337882995605} 11/07/2021 00:17:37 - INFO - __main__ - Step 21712: {'lr': 0.00047843454406974295, 'samples': 4168704, 'steps': 21711, 'loss/train': 0.41166508197784424} 11/07/2021 00:17:38 - INFO - __main__ - Step 21713: {'lr': 0.00047843238786940423, 'samples': 4168896, 'steps': 21712, 'loss/train': 1.6754117012023926} 11/07/2021 00:17:39 - INFO - __main__ - Step 21714: {'lr': 0.0004784302315661373, 'samples': 4169088, 'steps': 21713, 'loss/train': 1.7005354166030884} 11/07/2021 00:17:39 - INFO - __main__ - Step 21715: {'lr': 0.00047842807515994335, 'samples': 4169280, 'steps': 21714, 'loss/train': 1.8570494651794434} 11/07/2021 00:17:39 - INFO - __main__ - Step 21716: {'lr': 0.00047842591865082315, 'samples': 4169472, 'steps': 21715, 'loss/train': 1.401794672012329} 11/07/2021 00:17:40 - INFO - __main__ - Step 21717: {'lr': 0.0004784237620387778, 'samples': 4169664, 'steps': 21716, 'loss/train': 1.5829322338104248} 11/07/2021 00:17:40 - INFO - __main__ - Step 21718: {'lr': 0.0004784216053238082, 'samples': 4169856, 'steps': 21717, 'loss/train': 1.6291459798812866} 11/07/2021 00:17:41 - INFO - __main__ - Step 21719: {'lr': 0.00047841944850591535, 'samples': 4170048, 'steps': 21718, 'loss/train': 1.6986477375030518} 11/07/2021 00:17:42 - INFO - __main__ - Step 21720: {'lr': 0.0004784172915851003, 'samples': 4170240, 'steps': 21719, 'loss/train': 1.7120718955993652} 11/07/2021 00:17:42 - INFO - __main__ - Step 21721: {'lr': 0.00047841513456136383, 'samples': 4170432, 'steps': 21720, 'loss/train': 1.5299396514892578} 11/07/2021 00:17:42 - INFO - __main__ - Step 21722: {'lr': 0.000478412977434707, 'samples': 4170624, 'steps': 21721, 'loss/train': 1.4899734258651733} 11/07/2021 00:17:43 - INFO - __main__ - Step 21723: {'lr': 0.00047841082020513094, 'samples': 4170816, 'steps': 21722, 'loss/train': 1.4843244552612305} 11/07/2021 00:17:44 - INFO - __main__ - Step 21724: {'lr': 0.0004784086628726364, 'samples': 4171008, 'steps': 21723, 'loss/train': 1.7268753051757812} 11/07/2021 00:17:44 - INFO - __main__ - Step 21725: {'lr': 0.0004784065054372245, 'samples': 4171200, 'steps': 21724, 'loss/train': 1.1709247827529907} 11/07/2021 00:17:44 - INFO - __main__ - Step 21726: {'lr': 0.0004784043478988961, 'samples': 4171392, 'steps': 21725, 'loss/train': 1.5650849342346191} 11/07/2021 00:17:45 - INFO - __main__ - Step 21727: {'lr': 0.00047840219025765225, 'samples': 4171584, 'steps': 21726, 'loss/train': 1.7945908308029175} 11/07/2021 00:17:45 - INFO - __main__ - Step 21728: {'lr': 0.0004784000325134939, 'samples': 4171776, 'steps': 21727, 'loss/train': 1.760206937789917} 11/07/2021 00:17:46 - INFO - __main__ - Step 21729: {'lr': 0.00047839787466642206, 'samples': 4171968, 'steps': 21728, 'loss/train': 1.9748557806015015} 11/07/2021 00:17:46 - INFO - __main__ - Step 21730: {'lr': 0.00047839571671643756, 'samples': 4172160, 'steps': 21729, 'loss/train': 1.7250733375549316} 11/07/2021 00:17:47 - INFO - __main__ - Step 21731: {'lr': 0.0004783935586635415, 'samples': 4172352, 'steps': 21730, 'loss/train': 1.6004855632781982} 11/07/2021 00:17:47 - INFO - __main__ - Step 21732: {'lr': 0.0004783914005077349, 'samples': 4172544, 'steps': 21731, 'loss/train': 1.0612422227859497} 11/07/2021 00:17:48 - INFO - __main__ - Step 21733: {'lr': 0.0004783892422490186, 'samples': 4172736, 'steps': 21732, 'loss/train': 1.5149544477462769} 11/07/2021 00:17:49 - INFO - __main__ - Step 21734: {'lr': 0.00047838708388739365, 'samples': 4172928, 'steps': 21733, 'loss/train': 1.3210654258728027} 11/07/2021 00:17:49 - INFO - __main__ - Step 21735: {'lr': 0.000478384925422861, 'samples': 4173120, 'steps': 21734, 'loss/train': 1.7746955156326294} 11/07/2021 00:17:49 - INFO - __main__ - Step 21736: {'lr': 0.00047838276685542157, 'samples': 4173312, 'steps': 21735, 'loss/train': 1.6839711666107178} 11/07/2021 00:17:50 - INFO - __main__ - Step 21737: {'lr': 0.0004783806081850765, 'samples': 4173504, 'steps': 21736, 'loss/train': 1.643261432647705} 11/07/2021 00:17:50 - INFO - __main__ - Step 21738: {'lr': 0.0004783784494118266, 'samples': 4173696, 'steps': 21737, 'loss/train': 1.1372442245483398} 11/07/2021 00:17:51 - INFO - __main__ - Step 21739: {'lr': 0.00047837629053567286, 'samples': 4173888, 'steps': 21738, 'loss/train': 1.4777683019638062} 11/07/2021 00:17:52 - INFO - __main__ - Step 21740: {'lr': 0.00047837413155661635, 'samples': 4174080, 'steps': 21739, 'loss/train': 1.196178674697876} 11/07/2021 00:17:52 - INFO - __main__ - Step 21741: {'lr': 0.000478371972474658, 'samples': 4174272, 'steps': 21740, 'loss/train': 1.6837353706359863} 11/07/2021 00:17:52 - INFO - __main__ - Step 21742: {'lr': 0.00047836981328979865, 'samples': 4174464, 'steps': 21741, 'loss/train': 1.6639713048934937} 11/07/2021 00:17:53 - INFO - __main__ - Step 21743: {'lr': 0.00047836765400203953, 'samples': 4174656, 'steps': 21742, 'loss/train': 2.0440940856933594} 11/07/2021 00:17:53 - INFO - __main__ - Step 21744: {'lr': 0.00047836549461138133, 'samples': 4174848, 'steps': 21743, 'loss/train': 5.400808334350586} 11/07/2021 00:17:54 - INFO - __main__ - Step 21745: {'lr': 0.00047836333511782524, 'samples': 4175040, 'steps': 21744, 'loss/train': 5.30963659286499} 11/07/2021 00:17:55 - INFO - __main__ - Step 21746: {'lr': 0.00047836117552137213, 'samples': 4175232, 'steps': 21745, 'loss/train': 1.938073992729187} 11/07/2021 00:17:55 - INFO - __main__ - Step 21747: {'lr': 0.00047835901582202303, 'samples': 4175424, 'steps': 21746, 'loss/train': 1.732753872871399} 11/07/2021 00:17:55 - INFO - __main__ - Step 21748: {'lr': 0.00047835685601977886, 'samples': 4175616, 'steps': 21747, 'loss/train': 1.6363345384597778} 11/07/2021 00:17:56 - INFO - __main__ - Step 21749: {'lr': 0.00047835469611464055, 'samples': 4175808, 'steps': 21748, 'loss/train': 1.984601616859436} 11/07/2021 00:17:56 - INFO - __main__ - Step 21750: {'lr': 0.0004783525361066092, 'samples': 4176000, 'steps': 21749, 'loss/train': 1.448114275932312} 11/07/2021 00:17:57 - INFO - __main__ - Step 21751: {'lr': 0.00047835037599568576, 'samples': 4176192, 'steps': 21750, 'loss/train': 1.7460412979125977} 11/07/2021 00:17:57 - INFO - __main__ - Step 21752: {'lr': 0.0004783482157818711, 'samples': 4176384, 'steps': 21751, 'loss/train': 1.7143123149871826} 11/07/2021 00:17:58 - INFO - __main__ - Step 21753: {'lr': 0.0004783460554651663, 'samples': 4176576, 'steps': 21752, 'loss/train': 1.7651176452636719} 11/07/2021 00:17:58 - INFO - __main__ - Step 21754: {'lr': 0.0004783438950455723, 'samples': 4176768, 'steps': 21753, 'loss/train': 2.0488734245300293} 11/07/2021 00:17:59 - INFO - __main__ - Step 21755: {'lr': 0.00047834173452309005, 'samples': 4176960, 'steps': 21754, 'loss/train': 1.682035207748413} 11/07/2021 00:17:59 - INFO - __main__ - Step 21756: {'lr': 0.00047833957389772046, 'samples': 4177152, 'steps': 21755, 'loss/train': 1.6110718250274658} 11/07/2021 00:18:00 - INFO - __main__ - Step 21757: {'lr': 0.0004783374131694647, 'samples': 4177344, 'steps': 21756, 'loss/train': 1.5911036729812622} 11/07/2021 00:18:00 - INFO - __main__ - Step 21758: {'lr': 0.00047833525233832356, 'samples': 4177536, 'steps': 21757, 'loss/train': 1.5307949781417847} 11/07/2021 00:18:00 - INFO - __main__ - Step 21759: {'lr': 0.00047833309140429803, 'samples': 4177728, 'steps': 21758, 'loss/train': 1.8668280839920044} 11/07/2021 00:18:01 - INFO - __main__ - Step 21760: {'lr': 0.0004783309303673892, 'samples': 4177920, 'steps': 21759, 'loss/train': 1.4296553134918213} 11/07/2021 00:18:02 - INFO - __main__ - Step 21761: {'lr': 0.00047832876922759805, 'samples': 4178112, 'steps': 21760, 'loss/train': 1.4576103687286377} 11/07/2021 00:18:02 - INFO - __main__ - Step 21762: {'lr': 0.0004783266079849253, 'samples': 4178304, 'steps': 21761, 'loss/train': 1.7006242275238037} 11/07/2021 00:18:02 - INFO - __main__ - Step 21763: {'lr': 0.00047832444663937227, 'samples': 4178496, 'steps': 21762, 'loss/train': 1.7622902393341064} 11/07/2021 00:18:03 - INFO - __main__ - Step 21764: {'lr': 0.0004783222851909397, 'samples': 4178688, 'steps': 21763, 'loss/train': 1.790881872177124} 11/07/2021 00:18:03 - INFO - __main__ - Step 21765: {'lr': 0.0004783201236396286, 'samples': 4178880, 'steps': 21764, 'loss/train': 1.3149633407592773} 11/07/2021 00:18:04 - INFO - __main__ - Step 21766: {'lr': 0.00047831796198544, 'samples': 4179072, 'steps': 21765, 'loss/train': 1.403822422027588} 11/07/2021 00:18:04 - INFO - __main__ - Step 21767: {'lr': 0.0004783158002283749, 'samples': 4179264, 'steps': 21766, 'loss/train': 1.7273672819137573} 11/07/2021 00:18:05 - INFO - __main__ - Step 21768: {'lr': 0.0004783136383684342, 'samples': 4179456, 'steps': 21767, 'loss/train': 1.5977365970611572} 11/07/2021 00:18:05 - INFO - __main__ - Step 21769: {'lr': 0.0004783114764056188, 'samples': 4179648, 'steps': 21768, 'loss/train': 1.2280904054641724} 11/07/2021 00:18:05 - INFO - __main__ - Step 21770: {'lr': 0.00047830931433992985, 'samples': 4179840, 'steps': 21769, 'loss/train': 0.20790725946426392} 11/07/2021 00:18:07 - INFO - __main__ - Step 21771: {'lr': 0.00047830715217136825, 'samples': 4180032, 'steps': 21770, 'loss/train': 1.8650132417678833} 11/07/2021 00:18:07 - INFO - __main__ - Step 21772: {'lr': 0.000478304989899935, 'samples': 4180224, 'steps': 21771, 'loss/train': 1.264001488685608} 11/07/2021 00:18:07 - INFO - __main__ - Step 21773: {'lr': 0.00047830282752563103, 'samples': 4180416, 'steps': 21772, 'loss/train': 1.614505410194397} 11/07/2021 00:18:08 - INFO - __main__ - Step 21774: {'lr': 0.00047830066504845725, 'samples': 4180608, 'steps': 21773, 'loss/train': 1.6235742568969727} 11/07/2021 00:18:08 - INFO - __main__ - Step 21775: {'lr': 0.0004782985024684148, 'samples': 4180800, 'steps': 21774, 'loss/train': 1.3455910682678223} 11/07/2021 00:18:09 - INFO - __main__ - Step 21776: {'lr': 0.0004782963397855046, 'samples': 4180992, 'steps': 21775, 'loss/train': 1.3944709300994873} 11/07/2021 00:18:09 - INFO - __main__ - Step 21777: {'lr': 0.00047829417699972747, 'samples': 4181184, 'steps': 21776, 'loss/train': 1.57528555393219} 11/07/2021 00:18:10 - INFO - __main__ - Step 21778: {'lr': 0.0004782920141110846, 'samples': 4181376, 'steps': 21777, 'loss/train': 1.8353033065795898} 11/07/2021 00:18:10 - INFO - __main__ - Step 21779: {'lr': 0.0004782898511195768, 'samples': 4181568, 'steps': 21778, 'loss/train': 1.5532902479171753} 11/07/2021 00:18:10 - INFO - __main__ - Step 21780: {'lr': 0.00047828768802520515, 'samples': 4181760, 'steps': 21779, 'loss/train': 1.8849406242370605} 11/07/2021 00:18:11 - INFO - __main__ - Step 21781: {'lr': 0.0004782855248279706, 'samples': 4181952, 'steps': 21780, 'loss/train': 2.070675849914551} 11/07/2021 00:18:12 - INFO - __main__ - Step 21782: {'lr': 0.0004782833615278741, 'samples': 4182144, 'steps': 21781, 'loss/train': 1.6351349353790283} 11/07/2021 00:18:12 - INFO - __main__ - Step 21783: {'lr': 0.00047828119812491664, 'samples': 4182336, 'steps': 21782, 'loss/train': 1.5823655128479004} 11/07/2021 00:18:12 - INFO - __main__ - Step 21784: {'lr': 0.0004782790346190993, 'samples': 4182528, 'steps': 21783, 'loss/train': 1.455492377281189} 11/07/2021 00:18:13 - INFO - __main__ - Step 21785: {'lr': 0.00047827687101042283, 'samples': 4182720, 'steps': 21784, 'loss/train': 1.162865400314331} 11/07/2021 00:18:14 - INFO - __main__ - Step 21786: {'lr': 0.00047827470729888834, 'samples': 4182912, 'steps': 21785, 'loss/train': 1.6407850980758667} 11/07/2021 00:18:14 - INFO - __main__ - Step 21787: {'lr': 0.0004782725434844968, 'samples': 4183104, 'steps': 21786, 'loss/train': 1.9232147932052612} 11/07/2021 00:18:15 - INFO - __main__ - Step 21788: {'lr': 0.00047827037956724915, 'samples': 4183296, 'steps': 21787, 'loss/train': 1.6318410634994507} 11/07/2021 00:18:15 - INFO - __main__ - Step 21789: {'lr': 0.00047826821554714644, 'samples': 4183488, 'steps': 21788, 'loss/train': 1.4995194673538208} 11/07/2021 00:18:15 - INFO - __main__ - Step 21790: {'lr': 0.00047826605142418954, 'samples': 4183680, 'steps': 21789, 'loss/train': 1.456229329109192} 11/07/2021 00:18:17 - INFO - __main__ - Step 21791: {'lr': 0.0004782638871983795, 'samples': 4183872, 'steps': 21790, 'loss/train': 1.5134155750274658} 11/07/2021 00:18:17 - INFO - __main__ - Step 21792: {'lr': 0.0004782617228697173, 'samples': 4184064, 'steps': 21791, 'loss/train': 1.4779579639434814} 11/07/2021 00:18:17 - INFO - __main__ - Step 21793: {'lr': 0.0004782595584382039, 'samples': 4184256, 'steps': 21792, 'loss/train': 2.105910301208496} 11/07/2021 00:18:18 - INFO - __main__ - Step 21794: {'lr': 0.0004782573939038402, 'samples': 4184448, 'steps': 21793, 'loss/train': 1.5484576225280762} 11/07/2021 00:18:18 - INFO - __main__ - Step 21795: {'lr': 0.0004782552292666273, 'samples': 4184640, 'steps': 21794, 'loss/train': 0.4943452775478363} 11/07/2021 00:18:19 - INFO - __main__ - Step 21796: {'lr': 0.0004782530645265661, 'samples': 4184832, 'steps': 21795, 'loss/train': 1.8683432340621948} 11/07/2021 00:18:19 - INFO - __main__ - Step 21797: {'lr': 0.0004782508996836576, 'samples': 4185024, 'steps': 21796, 'loss/train': 1.6919738054275513} 11/07/2021 00:18:20 - INFO - __main__ - Step 21798: {'lr': 0.00047824873473790275, 'samples': 4185216, 'steps': 21797, 'loss/train': 1.65096914768219} 11/07/2021 00:18:20 - INFO - __main__ - Step 21799: {'lr': 0.0004782465696893025, 'samples': 4185408, 'steps': 21798, 'loss/train': 0.8809558749198914} 11/07/2021 00:18:20 - INFO - __main__ - Step 21800: {'lr': 0.0004782444045378579, 'samples': 4185600, 'steps': 21799, 'loss/train': 1.672647476196289} 11/07/2021 00:18:21 - INFO - __main__ - Step 21801: {'lr': 0.00047824223928356993, 'samples': 4185792, 'steps': 21800, 'loss/train': 1.561408281326294} 11/07/2021 00:18:22 - INFO - __main__ - Step 21802: {'lr': 0.0004782400739264395, 'samples': 4185984, 'steps': 21801, 'loss/train': 1.8260188102722168} 11/07/2021 00:18:22 - INFO - __main__ - Step 21803: {'lr': 0.00047823790846646764, 'samples': 4186176, 'steps': 21802, 'loss/train': 1.5481027364730835} 11/07/2021 00:18:23 - INFO - __main__ - Step 21804: {'lr': 0.0004782357429036553, 'samples': 4186368, 'steps': 21803, 'loss/train': 1.6464818716049194} 11/07/2021 00:18:23 - INFO - __main__ - Step 21805: {'lr': 0.00047823357723800344, 'samples': 4186560, 'steps': 21804, 'loss/train': 1.4024767875671387} 11/07/2021 00:18:23 - INFO - __main__ - Step 21806: {'lr': 0.000478231411469513, 'samples': 4186752, 'steps': 21805, 'loss/train': 1.3001028299331665} 11/07/2021 00:18:24 - INFO - __main__ - Step 21807: {'lr': 0.000478229245598185, 'samples': 4186944, 'steps': 21806, 'loss/train': 1.666137933731079} 11/07/2021 00:18:25 - INFO - __main__ - Step 21808: {'lr': 0.00047822707962402055, 'samples': 4187136, 'steps': 21807, 'loss/train': 1.8843587636947632} 11/07/2021 00:18:25 - INFO - __main__ - Step 21809: {'lr': 0.00047822491354702044, 'samples': 4187328, 'steps': 21808, 'loss/train': 1.8649508953094482} 11/07/2021 00:18:25 - INFO - __main__ - Step 21810: {'lr': 0.0004782227473671857, 'samples': 4187520, 'steps': 21809, 'loss/train': 1.851442575454712} 11/07/2021 00:18:26 - INFO - __main__ - Step 21811: {'lr': 0.00047822058108451727, 'samples': 4187712, 'steps': 21810, 'loss/train': 1.5848554372787476} 11/07/2021 00:18:27 - INFO - __main__ - Step 21812: {'lr': 0.0004782184146990162, 'samples': 4187904, 'steps': 21811, 'loss/train': 0.5069980025291443} 11/07/2021 00:18:27 - INFO - __main__ - Step 21813: {'lr': 0.00047821624821068346, 'samples': 4188096, 'steps': 21812, 'loss/train': 1.5223522186279297} 11/07/2021 00:18:28 - INFO - __main__ - Step 21814: {'lr': 0.00047821408161952, 'samples': 4188288, 'steps': 21813, 'loss/train': 1.8222025632858276} 11/07/2021 00:18:28 - INFO - __main__ - Step 21815: {'lr': 0.00047821191492552676, 'samples': 4188480, 'steps': 21814, 'loss/train': 1.5835494995117188} 11/07/2021 00:18:28 - INFO - __main__ - Step 21816: {'lr': 0.00047820974812870477, 'samples': 4188672, 'steps': 21815, 'loss/train': 1.3906757831573486} 11/07/2021 00:18:29 - INFO - __main__ - Step 21817: {'lr': 0.00047820758122905493, 'samples': 4188864, 'steps': 21816, 'loss/train': 0.9313063621520996} 11/07/2021 00:18:29 - INFO - __main__ - Step 21818: {'lr': 0.0004782054142265784, 'samples': 4189056, 'steps': 21817, 'loss/train': 1.363304853439331} 11/07/2021 00:18:30 - INFO - __main__ - Step 21819: {'lr': 0.00047820324712127593, 'samples': 4189248, 'steps': 21818, 'loss/train': 1.2986811399459839} 11/07/2021 00:18:30 - INFO - __main__ - Step 21820: {'lr': 0.0004782010799131487, 'samples': 4189440, 'steps': 21819, 'loss/train': 1.870723843574524} 11/07/2021 00:18:31 - INFO - __main__ - Step 21821: {'lr': 0.0004781989126021975, 'samples': 4189632, 'steps': 21820, 'loss/train': 1.8027942180633545} 11/07/2021 00:18:31 - INFO - __main__ - Step 21822: {'lr': 0.00047819674518842335, 'samples': 4189824, 'steps': 21821, 'loss/train': 2.168245792388916} 11/07/2021 00:18:32 - INFO - __main__ - Step 21823: {'lr': 0.00047819457767182735, 'samples': 4190016, 'steps': 21822, 'loss/train': 1.800366997718811} 11/07/2021 00:18:32 - INFO - __main__ - Step 21824: {'lr': 0.0004781924100524104, 'samples': 4190208, 'steps': 21823, 'loss/train': 1.727982997894287} 11/07/2021 00:18:33 - INFO - __main__ - Step 21825: {'lr': 0.00047819024233017337, 'samples': 4190400, 'steps': 21824, 'loss/train': 1.5247220993041992} 11/07/2021 00:18:33 - INFO - __main__ - Step 21826: {'lr': 0.00047818807450511746, 'samples': 4190592, 'steps': 21825, 'loss/train': 1.1852495670318604} 11/07/2021 00:18:34 - INFO - __main__ - Step 21827: {'lr': 0.00047818590657724345, 'samples': 4190784, 'steps': 21826, 'loss/train': 1.8680647611618042} 11/07/2021 00:18:34 - INFO - __main__ - Step 21828: {'lr': 0.0004781837385465524, 'samples': 4190976, 'steps': 21827, 'loss/train': 1.8202399015426636} 11/07/2021 00:18:35 - INFO - __main__ - Step 21829: {'lr': 0.00047818157041304535, 'samples': 4191168, 'steps': 21828, 'loss/train': 1.9058685302734375} 11/07/2021 00:18:35 - INFO - __main__ - Step 21830: {'lr': 0.00047817940217672315, 'samples': 4191360, 'steps': 21829, 'loss/train': 1.6217126846313477} 11/07/2021 00:18:35 - INFO - __main__ - Step 21831: {'lr': 0.0004781772338375868, 'samples': 4191552, 'steps': 21830, 'loss/train': 1.5916210412979126} 11/07/2021 00:18:36 - INFO - __main__ - Step 21832: {'lr': 0.0004781750653956374, 'samples': 4191744, 'steps': 21831, 'loss/train': 1.7649997472763062} 11/07/2021 00:18:37 - INFO - __main__ - Step 21833: {'lr': 0.00047817289685087575, 'samples': 4191936, 'steps': 21832, 'loss/train': 1.5090216398239136} 11/07/2021 00:18:37 - INFO - __main__ - Step 21834: {'lr': 0.00047817072820330287, 'samples': 4192128, 'steps': 21833, 'loss/train': 0.29298678040504456} 11/07/2021 00:18:37 - INFO - __main__ - Step 21835: {'lr': 0.0004781685594529199, 'samples': 4192320, 'steps': 21834, 'loss/train': 1.3522447347640991} 11/07/2021 00:18:38 - INFO - __main__ - Step 21836: {'lr': 0.00047816639059972767, 'samples': 4192512, 'steps': 21835, 'loss/train': 1.8226150274276733} 11/07/2021 00:18:39 - INFO - __main__ - Step 21837: {'lr': 0.00047816422164372713, 'samples': 4192704, 'steps': 21836, 'loss/train': 1.258782982826233} 11/07/2021 00:18:39 - INFO - __main__ - Step 21838: {'lr': 0.00047816205258491935, 'samples': 4192896, 'steps': 21837, 'loss/train': 1.8745602369308472} 11/07/2021 00:18:40 - INFO - __main__ - Step 21839: {'lr': 0.0004781598834233053, 'samples': 4193088, 'steps': 21838, 'loss/train': 2.18178653717041} 11/07/2021 00:18:40 - INFO - __main__ - Step 21840: {'lr': 0.0004781577141588859, 'samples': 4193280, 'steps': 21839, 'loss/train': 2.1624045372009277} 11/07/2021 00:18:40 - INFO - __main__ - Step 21841: {'lr': 0.0004781555447916621, 'samples': 4193472, 'steps': 21840, 'loss/train': 1.661039113998413} 11/07/2021 00:18:41 - INFO - __main__ - Step 21842: {'lr': 0.000478153375321635, 'samples': 4193664, 'steps': 21841, 'loss/train': 1.5614854097366333} 11/07/2021 00:18:42 - INFO - __main__ - Step 21843: {'lr': 0.0004781512057488055, 'samples': 4193856, 'steps': 21842, 'loss/train': 1.4372495412826538} 11/07/2021 00:18:42 - INFO - __main__ - Step 21844: {'lr': 0.00047814903607317454, 'samples': 4194048, 'steps': 21843, 'loss/train': 1.4852203130722046} 11/07/2021 00:18:42 - INFO - __main__ - Step 21845: {'lr': 0.00047814686629474323, 'samples': 4194240, 'steps': 21844, 'loss/train': 1.8648353815078735} 11/07/2021 00:18:43 - INFO - __main__ - Step 21846: {'lr': 0.00047814469641351237, 'samples': 4194432, 'steps': 21845, 'loss/train': 1.8701627254486084} 11/07/2021 00:18:43 - INFO - __main__ - Step 21847: {'lr': 0.0004781425264294831, 'samples': 4194624, 'steps': 21846, 'loss/train': 0.418106347322464} 11/07/2021 00:18:44 - INFO - __main__ - Step 21848: {'lr': 0.0004781403563426563, 'samples': 4194816, 'steps': 21847, 'loss/train': 0.14467741549015045} 11/07/2021 00:18:44 - INFO - __main__ - Step 21849: {'lr': 0.00047813818615303295, 'samples': 4195008, 'steps': 21848, 'loss/train': 1.6116589307785034} 11/07/2021 00:18:45 - INFO - __main__ - Step 21850: {'lr': 0.00047813601586061414, 'samples': 4195200, 'steps': 21849, 'loss/train': 1.225780725479126} 11/07/2021 00:18:45 - INFO - __main__ - Step 21851: {'lr': 0.0004781338454654007, 'samples': 4195392, 'steps': 21850, 'loss/train': 1.4589381217956543} 11/07/2021 00:18:45 - INFO - __main__ - Step 21852: {'lr': 0.00047813167496739363, 'samples': 4195584, 'steps': 21851, 'loss/train': 1.6310547590255737} 11/07/2021 00:18:47 - INFO - __main__ - Step 21853: {'lr': 0.00047812950436659405, 'samples': 4195776, 'steps': 21852, 'loss/train': 1.6144341230392456} 11/07/2021 00:18:47 - INFO - __main__ - Step 21854: {'lr': 0.0004781273336630028, 'samples': 4195968, 'steps': 21853, 'loss/train': 1.7650673389434814} 11/07/2021 00:18:47 - INFO - __main__ - Step 21855: {'lr': 0.00047812516285662086, 'samples': 4196160, 'steps': 21854, 'loss/train': 1.2556949853897095} 11/07/2021 00:18:48 - INFO - __main__ - Step 21856: {'lr': 0.00047812299194744924, 'samples': 4196352, 'steps': 21855, 'loss/train': 1.4601771831512451} 11/07/2021 00:18:48 - INFO - __main__ - Step 21857: {'lr': 0.0004781208209354889, 'samples': 4196544, 'steps': 21856, 'loss/train': 1.820651888847351} 11/07/2021 00:18:49 - INFO - __main__ - Step 21858: {'lr': 0.00047811864982074087, 'samples': 4196736, 'steps': 21857, 'loss/train': 1.4750192165374756} 11/07/2021 00:18:49 - INFO - __main__ - Step 21859: {'lr': 0.0004781164786032061, 'samples': 4196928, 'steps': 21858, 'loss/train': 0.9147158265113831} 11/07/2021 00:18:50 - INFO - __main__ - Step 21860: {'lr': 0.0004781143072828856, 'samples': 4197120, 'steps': 21859, 'loss/train': 1.3358899354934692} 11/07/2021 00:18:50 - INFO - __main__ - Step 21861: {'lr': 0.00047811213585978023, 'samples': 4197312, 'steps': 21860, 'loss/train': 1.5927478075027466} 11/07/2021 00:18:50 - INFO - __main__ - Step 21862: {'lr': 0.0004781099643338911, 'samples': 4197504, 'steps': 21861, 'loss/train': 1.585418701171875} 11/07/2021 00:18:51 - INFO - __main__ - Step 21863: {'lr': 0.00047810779270521914, 'samples': 4197696, 'steps': 21862, 'loss/train': 1.556623101234436} 11/07/2021 00:18:52 - INFO - __main__ - Step 21864: {'lr': 0.0004781056209737653, 'samples': 4197888, 'steps': 21863, 'loss/train': 1.2449383735656738} 11/07/2021 00:18:52 - INFO - __main__ - Step 21865: {'lr': 0.00047810344913953065, 'samples': 4198080, 'steps': 21864, 'loss/train': 1.4618579149246216} 11/07/2021 00:18:52 - INFO - __main__ - Step 21866: {'lr': 0.0004781012772025161, 'samples': 4198272, 'steps': 21865, 'loss/train': 1.6904577016830444} 11/07/2021 00:18:53 - INFO - __main__ - Step 21867: {'lr': 0.0004780991051627226, 'samples': 4198464, 'steps': 21866, 'loss/train': 1.2188726663589478} 11/07/2021 00:18:53 - INFO - __main__ - Step 21868: {'lr': 0.0004780969330201511, 'samples': 4198656, 'steps': 21867, 'loss/train': 2.0451176166534424} 11/07/2021 00:18:54 - INFO - __main__ - Step 21869: {'lr': 0.0004780947607748027, 'samples': 4198848, 'steps': 21868, 'loss/train': 1.7905349731445312} 11/07/2021 00:18:55 - INFO - __main__ - Step 21870: {'lr': 0.00047809258842667837, 'samples': 4199040, 'steps': 21869, 'loss/train': 1.1601496934890747} 11/07/2021 00:18:55 - INFO - __main__ - Step 21871: {'lr': 0.000478090415975779, 'samples': 4199232, 'steps': 21870, 'loss/train': 1.764669418334961} 11/07/2021 00:18:55 - INFO - __main__ - Step 21872: {'lr': 0.00047808824342210565, 'samples': 4199424, 'steps': 21871, 'loss/train': 2.10385799407959} 11/07/2021 00:18:56 - INFO - __main__ - Step 21873: {'lr': 0.0004780860707656592, 'samples': 4199616, 'steps': 21872, 'loss/train': 1.9181289672851562} 11/07/2021 00:18:57 - INFO - __main__ - Step 21874: {'lr': 0.0004780838980064407, 'samples': 4199808, 'steps': 21873, 'loss/train': 1.509691596031189} 11/07/2021 00:18:57 - INFO - __main__ - Step 21875: {'lr': 0.00047808172514445115, 'samples': 4200000, 'steps': 21874, 'loss/train': 1.487251877784729} 11/07/2021 00:18:57 - INFO - __main__ - Step 21876: {'lr': 0.0004780795521796914, 'samples': 4200192, 'steps': 21875, 'loss/train': 1.7136597633361816} 11/07/2021 00:18:58 - INFO - __main__ - Step 21877: {'lr': 0.0004780773791121626, 'samples': 4200384, 'steps': 21876, 'loss/train': 1.5490812063217163} 11/07/2021 00:18:58 - INFO - __main__ - Step 21878: {'lr': 0.0004780752059418656, 'samples': 4200576, 'steps': 21877, 'loss/train': 1.2752032279968262} 11/07/2021 00:18:59 - INFO - __main__ - Step 21879: {'lr': 0.0004780730326688015, 'samples': 4200768, 'steps': 21878, 'loss/train': 5.443384170532227} 11/07/2021 00:19:00 - INFO - __main__ - Step 21880: {'lr': 0.0004780708592929712, 'samples': 4200960, 'steps': 21879, 'loss/train': 1.6154052019119263} 11/07/2021 00:19:00 - INFO - __main__ - Step 21881: {'lr': 0.0004780686858143756, 'samples': 4201152, 'steps': 21880, 'loss/train': 1.7930195331573486} 11/07/2021 00:19:00 - INFO - __main__ - Step 21882: {'lr': 0.0004780665122330159, 'samples': 4201344, 'steps': 21881, 'loss/train': 1.518471360206604} 11/07/2021 00:19:01 - INFO - __main__ - Step 21883: {'lr': 0.00047806433854889285, 'samples': 4201536, 'steps': 21882, 'loss/train': 0.8893343210220337} 11/07/2021 00:19:01 - INFO - __main__ - Step 21884: {'lr': 0.0004780621647620076, 'samples': 4201728, 'steps': 21883, 'loss/train': 0.4408738613128662} 11/07/2021 00:19:02 - INFO - __main__ - Step 21885: {'lr': 0.00047805999087236097, 'samples': 4201920, 'steps': 21884, 'loss/train': 2.2122480869293213} 11/07/2021 00:19:02 - INFO - __main__ - Step 21886: {'lr': 0.0004780578168799541, 'samples': 4202112, 'steps': 21885, 'loss/train': 0.9583873748779297} 11/07/2021 00:19:03 - INFO - __main__ - Step 21887: {'lr': 0.00047805564278478787, 'samples': 4202304, 'steps': 21886, 'loss/train': 1.2693642377853394} 11/07/2021 00:19:03 - INFO - __main__ - Step 21888: {'lr': 0.00047805346858686325, 'samples': 4202496, 'steps': 21887, 'loss/train': 2.000314474105835} 11/07/2021 00:19:03 - INFO - __main__ - Step 21889: {'lr': 0.0004780512942861813, 'samples': 4202688, 'steps': 21888, 'loss/train': 1.3067866563796997} 11/07/2021 00:19:04 - INFO - __main__ - Step 21890: {'lr': 0.00047804911988274303, 'samples': 4202880, 'steps': 21889, 'loss/train': 1.678052544593811} 11/07/2021 00:19:05 - INFO - __main__ - Step 21891: {'lr': 0.00047804694537654927, 'samples': 4203072, 'steps': 21890, 'loss/train': 1.5158631801605225} 11/07/2021 00:19:05 - INFO - __main__ - Step 21892: {'lr': 0.00047804477076760106, 'samples': 4203264, 'steps': 21891, 'loss/train': 1.5687859058380127} 11/07/2021 00:19:06 - INFO - __main__ - Step 21893: {'lr': 0.0004780425960558994, 'samples': 4203456, 'steps': 21892, 'loss/train': 1.5470932722091675} 11/07/2021 00:19:06 - INFO - __main__ - Step 21894: {'lr': 0.00047804042124144526, 'samples': 4203648, 'steps': 21893, 'loss/train': 1.7719433307647705} 11/07/2021 00:19:07 - INFO - __main__ - Step 21895: {'lr': 0.00047803824632423967, 'samples': 4203840, 'steps': 21894, 'loss/train': 1.6650912761688232} 11/07/2021 00:19:07 - INFO - __main__ - Step 21896: {'lr': 0.0004780360713042835, 'samples': 4204032, 'steps': 21895, 'loss/train': 1.5138376951217651} 11/07/2021 00:19:08 - INFO - __main__ - Step 21897: {'lr': 0.0004780338961815779, 'samples': 4204224, 'steps': 21896, 'loss/train': 2.1411073207855225} 11/07/2021 00:19:08 - INFO - __main__ - Step 21898: {'lr': 0.00047803172095612365, 'samples': 4204416, 'steps': 21897, 'loss/train': 1.6470144987106323} 11/07/2021 00:19:08 - INFO - __main__ - Step 21899: {'lr': 0.00047802954562792185, 'samples': 4204608, 'steps': 21898, 'loss/train': 1.3572453260421753} 11/07/2021 00:19:09 - INFO - __main__ - Step 21900: {'lr': 0.0004780273701969734, 'samples': 4204800, 'steps': 21899, 'loss/train': 1.491538405418396} 11/07/2021 00:19:10 - INFO - __main__ - Step 21901: {'lr': 0.00047802519466327945, 'samples': 4204992, 'steps': 21900, 'loss/train': 2.107398509979248} 11/07/2021 00:19:10 - INFO - __main__ - Step 21902: {'lr': 0.00047802301902684076, 'samples': 4205184, 'steps': 21901, 'loss/train': 1.5457367897033691} 11/07/2021 00:19:10 - INFO - __main__ - Step 21903: {'lr': 0.0004780208432876585, 'samples': 4205376, 'steps': 21902, 'loss/train': 1.2147682905197144} 11/07/2021 00:19:11 - INFO - __main__ - Step 21904: {'lr': 0.00047801866744573353, 'samples': 4205568, 'steps': 21903, 'loss/train': 1.2499436140060425} 11/07/2021 00:19:12 - INFO - __main__ - Step 21905: {'lr': 0.00047801649150106684, 'samples': 4205760, 'steps': 21904, 'loss/train': 1.9765163660049438} 11/07/2021 00:19:12 - INFO - __main__ - Step 21906: {'lr': 0.00047801431545365947, 'samples': 4205952, 'steps': 21905, 'loss/train': 1.6405918598175049} 11/07/2021 00:19:13 - INFO - __main__ - Step 21907: {'lr': 0.0004780121393035124, 'samples': 4206144, 'steps': 21906, 'loss/train': 1.492367148399353} 11/07/2021 00:19:13 - INFO - __main__ - Step 21908: {'lr': 0.0004780099630506265, 'samples': 4206336, 'steps': 21907, 'loss/train': 1.6051231622695923} 11/07/2021 00:19:13 - INFO - __main__ - Step 21909: {'lr': 0.0004780077866950029, 'samples': 4206528, 'steps': 21908, 'loss/train': 5.938828945159912} 11/07/2021 00:19:14 - INFO - __main__ - Step 21910: {'lr': 0.00047800561023664246, 'samples': 4206720, 'steps': 21909, 'loss/train': 1.5359915494918823} 11/07/2021 00:19:15 - INFO - __main__ - Step 21911: {'lr': 0.0004780034336755462, 'samples': 4206912, 'steps': 21910, 'loss/train': 2.409712314605713} 11/07/2021 00:19:15 - INFO - __main__ - Step 21912: {'lr': 0.00047800125701171517, 'samples': 4207104, 'steps': 21911, 'loss/train': 1.6394217014312744} 11/07/2021 00:19:15 - INFO - __main__ - Step 21913: {'lr': 0.00047799908024515026, 'samples': 4207296, 'steps': 21912, 'loss/train': 1.683571457862854} 11/07/2021 00:19:16 - INFO - __main__ - Step 21914: {'lr': 0.0004779969033758525, 'samples': 4207488, 'steps': 21913, 'loss/train': 1.252291202545166} 11/07/2021 00:19:16 - INFO - __main__ - Step 21915: {'lr': 0.00047799472640382287, 'samples': 4207680, 'steps': 21914, 'loss/train': 1.8091762065887451} 11/07/2021 00:19:17 - INFO - __main__ - Step 21916: {'lr': 0.0004779925493290623, 'samples': 4207872, 'steps': 21915, 'loss/train': 1.6336859464645386} 11/07/2021 00:19:17 - INFO - __main__ - Step 21917: {'lr': 0.00047799037215157184, 'samples': 4208064, 'steps': 21916, 'loss/train': 1.4064269065856934} 11/07/2021 00:19:18 - INFO - __main__ - Step 21918: {'lr': 0.0004779881948713524, 'samples': 4208256, 'steps': 21917, 'loss/train': 1.3182247877120972} 11/07/2021 00:19:18 - INFO - __main__ - Step 21919: {'lr': 0.000477986017488405, 'samples': 4208448, 'steps': 21918, 'loss/train': 1.4684267044067383} 11/07/2021 00:19:18 - INFO - __main__ - Step 21920: {'lr': 0.00047798384000273053, 'samples': 4208640, 'steps': 21919, 'loss/train': 0.6165792346000671} 11/07/2021 00:19:20 - INFO - __main__ - Step 21921: {'lr': 0.0004779816624143302, 'samples': 4208832, 'steps': 21920, 'loss/train': 1.593778371810913} 11/07/2021 00:19:20 - INFO - __main__ - Step 21922: {'lr': 0.0004779794847232048, 'samples': 4209024, 'steps': 21921, 'loss/train': 0.12900973856449127} 11/07/2021 00:19:20 - INFO - __main__ - Step 21923: {'lr': 0.0004779773069293554, 'samples': 4209216, 'steps': 21922, 'loss/train': 1.0907976627349854} 11/07/2021 00:19:21 - INFO - __main__ - Step 21924: {'lr': 0.00047797512903278283, 'samples': 4209408, 'steps': 21923, 'loss/train': 1.5280829668045044} 11/07/2021 00:19:21 - INFO - __main__ - Step 21925: {'lr': 0.0004779729510334883, 'samples': 4209600, 'steps': 21924, 'loss/train': 1.7411478757858276} 11/07/2021 00:19:22 - INFO - __main__ - Step 21926: {'lr': 0.0004779707729314726, 'samples': 4209792, 'steps': 21925, 'loss/train': 1.3452696800231934} 11/07/2021 00:19:22 - INFO - __main__ - Step 21927: {'lr': 0.0004779685947267369, 'samples': 4209984, 'steps': 21926, 'loss/train': 1.8387231826782227} 11/07/2021 00:19:23 - INFO - __main__ - Step 21928: {'lr': 0.00047796641641928195, 'samples': 4210176, 'steps': 21927, 'loss/train': 1.7874184846878052} 11/07/2021 00:19:23 - INFO - __main__ - Step 21929: {'lr': 0.00047796423800910894, 'samples': 4210368, 'steps': 21928, 'loss/train': 1.862120270729065} 11/07/2021 00:19:23 - INFO - __main__ - Step 21930: {'lr': 0.00047796205949621873, 'samples': 4210560, 'steps': 21929, 'loss/train': 0.7199652791023254} 11/07/2021 00:19:24 - INFO - __main__ - Step 21931: {'lr': 0.00047795988088061224, 'samples': 4210752, 'steps': 21930, 'loss/train': 1.5282775163650513} 11/07/2021 00:19:25 - INFO - __main__ - Step 21932: {'lr': 0.00047795770216229065, 'samples': 4210944, 'steps': 21931, 'loss/train': 1.4960988759994507} 11/07/2021 00:19:25 - INFO - __main__ - Step 21933: {'lr': 0.0004779555233412548, 'samples': 4211136, 'steps': 21932, 'loss/train': 1.548005223274231} 11/07/2021 00:19:25 - INFO - __main__ - Step 21934: {'lr': 0.0004779533444175058, 'samples': 4211328, 'steps': 21933, 'loss/train': 1.737036108970642} 11/07/2021 00:19:26 - INFO - __main__ - Step 21935: {'lr': 0.00047795116539104445, 'samples': 4211520, 'steps': 21934, 'loss/train': 0.9867969751358032} 11/07/2021 00:19:27 - INFO - __main__ - Step 21936: {'lr': 0.0004779489862618718, 'samples': 4211712, 'steps': 21935, 'loss/train': 1.5135012865066528} 11/07/2021 00:19:27 - INFO - __main__ - Step 21937: {'lr': 0.00047794680702998893, 'samples': 4211904, 'steps': 21936, 'loss/train': 1.431929588317871} 11/07/2021 00:19:28 - INFO - __main__ - Step 21938: {'lr': 0.0004779446276953967, 'samples': 4212096, 'steps': 21937, 'loss/train': 1.4647678136825562} 11/07/2021 00:19:28 - INFO - __main__ - Step 21939: {'lr': 0.00047794244825809614, 'samples': 4212288, 'steps': 21938, 'loss/train': 1.3799632787704468} 11/07/2021 00:19:28 - INFO - __main__ - Step 21940: {'lr': 0.0004779402687180882, 'samples': 4212480, 'steps': 21939, 'loss/train': 1.5208247900009155} 11/07/2021 00:19:29 - INFO - __main__ - Step 21941: {'lr': 0.00047793808907537394, 'samples': 4212672, 'steps': 21940, 'loss/train': 0.8766080141067505} 11/07/2021 00:19:29 - INFO - __main__ - Step 21942: {'lr': 0.0004779359093299543, 'samples': 4212864, 'steps': 21941, 'loss/train': 1.755677342414856} 11/07/2021 00:19:30 - INFO - __main__ - Step 21943: {'lr': 0.00047793372948183024, 'samples': 4213056, 'steps': 21942, 'loss/train': 1.6062308549880981} 11/07/2021 00:19:30 - INFO - __main__ - Step 21944: {'lr': 0.0004779315495310027, 'samples': 4213248, 'steps': 21943, 'loss/train': 1.1306722164154053} 11/07/2021 00:19:31 - INFO - __main__ - Step 21945: {'lr': 0.00047792936947747285, 'samples': 4213440, 'steps': 21944, 'loss/train': 1.6972370147705078} 11/07/2021 00:19:32 - INFO - __main__ - Step 21946: {'lr': 0.00047792718932124147, 'samples': 4213632, 'steps': 21945, 'loss/train': 1.8011847734451294} 11/07/2021 00:19:32 - INFO - __main__ - Step 21947: {'lr': 0.00047792500906230963, 'samples': 4213824, 'steps': 21946, 'loss/train': 1.8816180229187012} 11/07/2021 00:19:32 - INFO - __main__ - Step 21948: {'lr': 0.00047792282870067827, 'samples': 4214016, 'steps': 21947, 'loss/train': 1.5840351581573486} 11/07/2021 00:19:33 - INFO - __main__ - Step 21949: {'lr': 0.0004779206482363484, 'samples': 4214208, 'steps': 21948, 'loss/train': 1.393857479095459} 11/07/2021 00:19:33 - INFO - __main__ - Step 21950: {'lr': 0.000477918467669321, 'samples': 4214400, 'steps': 21949, 'loss/train': 1.2116628885269165} 11/07/2021 00:19:33 - INFO - __main__ - Step 21951: {'lr': 0.0004779162869995971, 'samples': 4214592, 'steps': 21950, 'loss/train': 1.45814049243927} 11/07/2021 00:19:34 - INFO - __main__ - Step 21952: {'lr': 0.00047791410622717757, 'samples': 4214784, 'steps': 21951, 'loss/train': 1.6132930517196655} 11/07/2021 00:19:35 - INFO - __main__ - Step 21953: {'lr': 0.0004779119253520635, 'samples': 4214976, 'steps': 21952, 'loss/train': 0.9097754955291748} 11/07/2021 00:19:35 - INFO - __main__ - Step 21954: {'lr': 0.0004779097443742558, 'samples': 4215168, 'steps': 21953, 'loss/train': 1.3216646909713745} 11/07/2021 00:19:35 - INFO - __main__ - Step 21955: {'lr': 0.0004779075632937556, 'samples': 4215360, 'steps': 21954, 'loss/train': 1.2813609838485718} 11/07/2021 00:19:36 - INFO - __main__ - Step 21956: {'lr': 0.00047790538211056366, 'samples': 4215552, 'steps': 21955, 'loss/train': 1.931445837020874} 11/07/2021 00:19:37 - INFO - __main__ - Step 21957: {'lr': 0.00047790320082468106, 'samples': 4215744, 'steps': 21956, 'loss/train': 1.242431879043579} 11/07/2021 00:19:37 - INFO - __main__ - Step 21958: {'lr': 0.00047790101943610884, 'samples': 4215936, 'steps': 21957, 'loss/train': 1.5784425735473633} 11/07/2021 00:19:37 - INFO - __main__ - Step 21959: {'lr': 0.000477898837944848, 'samples': 4216128, 'steps': 21958, 'loss/train': 1.6640866994857788} 11/07/2021 00:19:38 - INFO - __main__ - Step 21960: {'lr': 0.0004778966563508994, 'samples': 4216320, 'steps': 21959, 'loss/train': 1.7045584917068481} 11/07/2021 00:19:38 - INFO - __main__ - Step 21961: {'lr': 0.00047789447465426406, 'samples': 4216512, 'steps': 21960, 'loss/train': 1.3044613599777222} 11/07/2021 00:19:39 - INFO - __main__ - Step 21962: {'lr': 0.000477892292854943, 'samples': 4216704, 'steps': 21961, 'loss/train': 0.78977370262146} 11/07/2021 00:19:40 - INFO - __main__ - Step 21963: {'lr': 0.00047789011095293723, 'samples': 4216896, 'steps': 21962, 'loss/train': 0.40516743063926697} 11/07/2021 00:19:40 - INFO - __main__ - Step 21964: {'lr': 0.0004778879289482476, 'samples': 4217088, 'steps': 21963, 'loss/train': 1.4810577630996704} 11/07/2021 00:19:40 - INFO - __main__ - Step 21965: {'lr': 0.00047788574684087527, 'samples': 4217280, 'steps': 21964, 'loss/train': 1.5655182600021362} 11/07/2021 00:19:41 - INFO - __main__ - Step 21966: {'lr': 0.0004778835646308211, 'samples': 4217472, 'steps': 21965, 'loss/train': 1.6509648561477661} 11/07/2021 00:19:42 - INFO - __main__ - Step 21967: {'lr': 0.0004778813823180861, 'samples': 4217664, 'steps': 21966, 'loss/train': 1.6567872762680054} 11/07/2021 00:19:42 - INFO - __main__ - Step 21968: {'lr': 0.0004778791999026713, 'samples': 4217856, 'steps': 21967, 'loss/train': 1.728704810142517} 11/07/2021 00:19:42 - INFO - __main__ - Step 21969: {'lr': 0.0004778770173845777, 'samples': 4218048, 'steps': 21968, 'loss/train': 1.5943408012390137} 11/07/2021 00:19:43 - INFO - __main__ - Step 21970: {'lr': 0.00047787483476380613, 'samples': 4218240, 'steps': 21969, 'loss/train': 1.6115553379058838} 11/07/2021 00:19:43 - INFO - __main__ - Step 21971: {'lr': 0.0004778726520403577, 'samples': 4218432, 'steps': 21970, 'loss/train': 2.4777019023895264} 11/07/2021 00:19:44 - INFO - __main__ - Step 21972: {'lr': 0.00047787046921423336, 'samples': 4218624, 'steps': 21971, 'loss/train': 1.0873228311538696} 11/07/2021 00:19:45 - INFO - __main__ - Step 21973: {'lr': 0.00047786828628543416, 'samples': 4218816, 'steps': 21972, 'loss/train': 1.61286199092865} 11/07/2021 00:19:45 - INFO - __main__ - Step 21974: {'lr': 0.00047786610325396096, 'samples': 4219008, 'steps': 21973, 'loss/train': 1.5641791820526123} 11/07/2021 00:19:45 - INFO - __main__ - Step 21975: {'lr': 0.0004778639201198149, 'samples': 4219200, 'steps': 21974, 'loss/train': 1.5207573175430298} 11/07/2021 00:19:46 - INFO - __main__ - Step 21976: {'lr': 0.00047786173688299684, 'samples': 4219392, 'steps': 21975, 'loss/train': 2.1882331371307373} 11/07/2021 00:19:46 - INFO - __main__ - Step 21977: {'lr': 0.00047785955354350776, 'samples': 4219584, 'steps': 21976, 'loss/train': 1.2902092933654785} 11/07/2021 00:19:47 - INFO - __main__ - Step 21978: {'lr': 0.00047785737010134865, 'samples': 4219776, 'steps': 21977, 'loss/train': 1.2863267660140991} 11/07/2021 00:19:47 - INFO - __main__ - Step 21979: {'lr': 0.0004778551865565206, 'samples': 4219968, 'steps': 21978, 'loss/train': 1.791292667388916} 11/07/2021 00:19:48 - INFO - __main__ - Step 21980: {'lr': 0.00047785300290902446, 'samples': 4220160, 'steps': 21979, 'loss/train': 1.2760604619979858} 11/07/2021 00:19:48 - INFO - __main__ - Step 21981: {'lr': 0.0004778508191588613, 'samples': 4220352, 'steps': 21980, 'loss/train': 1.6339510679244995} 11/07/2021 00:19:48 - INFO - __main__ - Step 21982: {'lr': 0.00047784863530603213, 'samples': 4220544, 'steps': 21981, 'loss/train': 1.3932080268859863} 11/07/2021 00:19:49 - INFO - __main__ - Step 21983: {'lr': 0.0004778464513505378, 'samples': 4220736, 'steps': 21982, 'loss/train': 1.6483728885650635} 11/07/2021 00:19:50 - INFO - __main__ - Step 21984: {'lr': 0.0004778442672923794, 'samples': 4220928, 'steps': 21983, 'loss/train': 1.459287166595459} 11/07/2021 00:19:50 - INFO - __main__ - Step 21985: {'lr': 0.0004778420831315579, 'samples': 4221120, 'steps': 21984, 'loss/train': 1.1991043090820312} 11/07/2021 00:19:50 - INFO - __main__ - Step 21986: {'lr': 0.0004778398988680743, 'samples': 4221312, 'steps': 21985, 'loss/train': 1.7007657289505005} 11/07/2021 00:19:51 - INFO - __main__ - Step 21987: {'lr': 0.00047783771450192946, 'samples': 4221504, 'steps': 21986, 'loss/train': 1.4460031986236572} 11/07/2021 00:19:52 - INFO - __main__ - Step 21988: {'lr': 0.00047783553003312456, 'samples': 4221696, 'steps': 21987, 'loss/train': 1.7444876432418823} 11/07/2021 00:19:52 - INFO - __main__ - Step 21989: {'lr': 0.00047783334546166046, 'samples': 4221888, 'steps': 21988, 'loss/train': 1.6182039976119995} 11/07/2021 00:19:52 - INFO - __main__ - Step 21990: {'lr': 0.0004778311607875382, 'samples': 4222080, 'steps': 21989, 'loss/train': 1.2163879871368408} 11/07/2021 00:19:53 - INFO - __main__ - Step 21991: {'lr': 0.0004778289760107587, 'samples': 4222272, 'steps': 21990, 'loss/train': 1.4696317911148071} 11/07/2021 00:19:53 - INFO - __main__ - Step 21992: {'lr': 0.00047782679113132293, 'samples': 4222464, 'steps': 21991, 'loss/train': 1.7996553182601929} 11/07/2021 00:19:54 - INFO - __main__ - Step 21993: {'lr': 0.00047782460614923195, 'samples': 4222656, 'steps': 21992, 'loss/train': 1.3140223026275635} 11/07/2021 00:19:55 - INFO - __main__ - Step 21994: {'lr': 0.00047782242106448675, 'samples': 4222848, 'steps': 21993, 'loss/train': 1.7597932815551758} 11/07/2021 00:19:55 - INFO - __main__ - Step 21995: {'lr': 0.00047782023587708826, 'samples': 4223040, 'steps': 21994, 'loss/train': 1.2252593040466309} 11/07/2021 00:19:55 - INFO - __main__ - Step 21996: {'lr': 0.0004778180505870375, 'samples': 4223232, 'steps': 21995, 'loss/train': 1.802703619003296} 11/07/2021 00:19:56 - INFO - __main__ - Step 21997: {'lr': 0.0004778158651943355, 'samples': 4223424, 'steps': 21996, 'loss/train': 1.5296212434768677} 11/07/2021 00:19:57 - INFO - __main__ - Step 21998: {'lr': 0.0004778136796989831, 'samples': 4223616, 'steps': 21997, 'loss/train': 1.3799338340759277} 11/07/2021 00:19:57 - INFO - __main__ - Step 21999: {'lr': 0.0004778114941009814, 'samples': 4223808, 'steps': 21998, 'loss/train': 1.5359055995941162} 11/07/2021 00:19:58 - INFO - __main__ - Step 22000: {'lr': 0.0004778093084003313, 'samples': 4224000, 'steps': 21999, 'loss/train': 1.4692944288253784} 11/07/2021 00:19:58 - INFO - __main__ - Step 22001: {'lr': 0.00047780712259703394, 'samples': 4224192, 'steps': 22000, 'loss/train': 1.5872262716293335} 11/07/2021 00:19:58 - INFO - __main__ - Step 22002: {'lr': 0.00047780493669109017, 'samples': 4224384, 'steps': 22001, 'loss/train': 1.0057997703552246} 11/07/2021 00:19:59 - INFO - __main__ - Step 22003: {'lr': 0.000477802750682501, 'samples': 4224576, 'steps': 22002, 'loss/train': 1.5224837064743042} 11/07/2021 00:20:00 - INFO - __main__ - Step 22004: {'lr': 0.0004778005645712674, 'samples': 4224768, 'steps': 22003, 'loss/train': 1.6188691854476929} 11/07/2021 00:20:00 - INFO - __main__ - Step 22005: {'lr': 0.00047779837835739043, 'samples': 4224960, 'steps': 22004, 'loss/train': 1.5473153591156006} 11/07/2021 00:20:00 - INFO - __main__ - Step 22006: {'lr': 0.000477796192040871, 'samples': 4225152, 'steps': 22005, 'loss/train': 1.9420137405395508} 11/07/2021 00:20:01 - INFO - __main__ - Step 22007: {'lr': 0.00047779400562171016, 'samples': 4225344, 'steps': 22006, 'loss/train': 1.241431474685669} 11/07/2021 00:20:01 - INFO - __main__ - Step 22008: {'lr': 0.00047779181909990876, 'samples': 4225536, 'steps': 22007, 'loss/train': 1.4020662307739258} 11/07/2021 00:20:02 - INFO - __main__ - Step 22009: {'lr': 0.000477789632475468, 'samples': 4225728, 'steps': 22008, 'loss/train': 1.392614722251892} 11/07/2021 00:20:02 - INFO - __main__ - Step 22010: {'lr': 0.00047778744574838864, 'samples': 4225920, 'steps': 22009, 'loss/train': 1.4451684951782227} 11/07/2021 00:20:03 - INFO - __main__ - Step 22011: {'lr': 0.00047778525891867187, 'samples': 4226112, 'steps': 22010, 'loss/train': 1.806498408317566} 11/07/2021 00:20:03 - INFO - __main__ - Step 22012: {'lr': 0.00047778307198631856, 'samples': 4226304, 'steps': 22011, 'loss/train': 1.545744776725769} 11/07/2021 00:20:03 - INFO - __main__ - Step 22013: {'lr': 0.00047778088495132963, 'samples': 4226496, 'steps': 22012, 'loss/train': 2.2597596645355225} 11/07/2021 00:20:04 - INFO - __main__ - Step 22014: {'lr': 0.0004777786978137062, 'samples': 4226688, 'steps': 22013, 'loss/train': 2.303285598754883} 11/07/2021 00:20:05 - INFO - __main__ - Step 22015: {'lr': 0.00047777651057344915, 'samples': 4226880, 'steps': 22014, 'loss/train': 1.7788605690002441} 11/07/2021 00:20:05 - INFO - __main__ - Step 22016: {'lr': 0.0004777743232305596, 'samples': 4227072, 'steps': 22015, 'loss/train': 1.471137285232544} 11/07/2021 00:20:06 - INFO - __main__ - Step 22017: {'lr': 0.00047777213578503844, 'samples': 4227264, 'steps': 22016, 'loss/train': 1.7673450708389282} 11/07/2021 00:20:06 - INFO - __main__ - Step 22018: {'lr': 0.0004777699482368867, 'samples': 4227456, 'steps': 22017, 'loss/train': 1.6683404445648193} 11/07/2021 00:20:07 - INFO - __main__ - Step 22019: {'lr': 0.00047776776058610525, 'samples': 4227648, 'steps': 22018, 'loss/train': 1.5695788860321045} 11/07/2021 00:20:07 - INFO - __main__ - Step 22020: {'lr': 0.0004777655728326952, 'samples': 4227840, 'steps': 22019, 'loss/train': 1.6337982416152954} 11/07/2021 00:20:08 - INFO - __main__ - Step 22021: {'lr': 0.0004777633849766575, 'samples': 4228032, 'steps': 22020, 'loss/train': 1.395537257194519} 11/07/2021 00:20:08 - INFO - __main__ - Step 22022: {'lr': 0.00047776119701799317, 'samples': 4228224, 'steps': 22021, 'loss/train': 1.1082690954208374} 11/07/2021 00:20:08 - INFO - __main__ - Step 22023: {'lr': 0.0004777590089567031, 'samples': 4228416, 'steps': 22022, 'loss/train': 1.4885226488113403} 11/07/2021 00:20:09 - INFO - __main__ - Step 22024: {'lr': 0.00047775682079278836, 'samples': 4228608, 'steps': 22023, 'loss/train': 2.176821231842041} 11/07/2021 00:20:10 - INFO - __main__ - Step 22025: {'lr': 0.0004777546325262499, 'samples': 4228800, 'steps': 22024, 'loss/train': 1.4476275444030762} 11/07/2021 00:20:10 - INFO - __main__ - Step 22026: {'lr': 0.00047775244415708873, 'samples': 4228992, 'steps': 22025, 'loss/train': 1.2312343120574951} 11/07/2021 00:20:10 - INFO - __main__ - Step 22027: {'lr': 0.0004777502556853058, 'samples': 4229184, 'steps': 22026, 'loss/train': 0.7627279162406921} 11/07/2021 00:20:11 - INFO - __main__ - Step 22028: {'lr': 0.00047774806711090213, 'samples': 4229376, 'steps': 22027, 'loss/train': 1.5808088779449463} 11/07/2021 00:20:12 - INFO - __main__ - Step 22029: {'lr': 0.0004777458784338787, 'samples': 4229568, 'steps': 22028, 'loss/train': 1.5712063312530518} 11/07/2021 00:20:12 - INFO - __main__ - Step 22030: {'lr': 0.00047774368965423653, 'samples': 4229760, 'steps': 22029, 'loss/train': 2.2965681552886963} 11/07/2021 00:20:12 - INFO - __main__ - Step 22031: {'lr': 0.0004777415007719765, 'samples': 4229952, 'steps': 22030, 'loss/train': 1.8073399066925049} 11/07/2021 00:20:13 - INFO - __main__ - Step 22032: {'lr': 0.00047773931178709975, 'samples': 4230144, 'steps': 22031, 'loss/train': 1.4591764211654663} 11/07/2021 00:20:13 - INFO - __main__ - Step 22033: {'lr': 0.00047773712269960714, 'samples': 4230336, 'steps': 22032, 'loss/train': 1.5156131982803345} 11/07/2021 00:20:14 - INFO - __main__ - Step 22034: {'lr': 0.00047773493350949963, 'samples': 4230528, 'steps': 22033, 'loss/train': 2.4903831481933594} 11/07/2021 00:20:15 - INFO - __main__ - Step 22035: {'lr': 0.00047773274421677834, 'samples': 4230720, 'steps': 22034, 'loss/train': 1.9549790620803833} 11/07/2021 00:20:15 - INFO - __main__ - Step 22036: {'lr': 0.0004777305548214442, 'samples': 4230912, 'steps': 22035, 'loss/train': 1.8083499670028687} 11/07/2021 00:20:15 - INFO - __main__ - Step 22037: {'lr': 0.0004777283653234982, 'samples': 4231104, 'steps': 22036, 'loss/train': 1.5485557317733765} 11/07/2021 00:20:16 - INFO - __main__ - Step 22038: {'lr': 0.00047772617572294123, 'samples': 4231296, 'steps': 22037, 'loss/train': 1.876652717590332} 11/07/2021 00:20:16 - INFO - __main__ - Step 22039: {'lr': 0.0004777239860197744, 'samples': 4231488, 'steps': 22038, 'loss/train': 1.5153416395187378} 11/07/2021 00:20:17 - INFO - __main__ - Step 22040: {'lr': 0.0004777217962139987, 'samples': 4231680, 'steps': 22039, 'loss/train': 1.3404922485351562} 11/07/2021 00:20:17 - INFO - __main__ - Step 22041: {'lr': 0.000477719606305615, 'samples': 4231872, 'steps': 22040, 'loss/train': 1.777071475982666} 11/07/2021 00:20:18 - INFO - __main__ - Step 22042: {'lr': 0.0004777174162946244, 'samples': 4232064, 'steps': 22041, 'loss/train': 1.7243245840072632} 11/07/2021 00:20:18 - INFO - __main__ - Step 22043: {'lr': 0.0004777152261810279, 'samples': 4232256, 'steps': 22042, 'loss/train': 1.7015565633773804} 11/07/2021 00:20:18 - INFO - __main__ - Step 22044: {'lr': 0.0004777130359648263, 'samples': 4232448, 'steps': 22043, 'loss/train': 0.8857391476631165} 11/07/2021 00:20:20 - INFO - __main__ - Step 22045: {'lr': 0.0004777108456460208, 'samples': 4232640, 'steps': 22044, 'loss/train': 1.6511143445968628} 11/07/2021 00:20:20 - INFO - __main__ - Step 22046: {'lr': 0.00047770865522461233, 'samples': 4232832, 'steps': 22045, 'loss/train': 2.004149913787842} 11/07/2021 00:20:20 - INFO - __main__ - Step 22047: {'lr': 0.0004777064647006018, 'samples': 4233024, 'steps': 22046, 'loss/train': 1.051743507385254} 11/07/2021 00:20:21 - INFO - __main__ - Step 22048: {'lr': 0.0004777042740739903, 'samples': 4233216, 'steps': 22047, 'loss/train': 0.2091219127178192} 11/07/2021 00:20:21 - INFO - __main__ - Step 22049: {'lr': 0.0004777020833447787, 'samples': 4233408, 'steps': 22048, 'loss/train': 1.2629278898239136} 11/07/2021 00:20:22 - INFO - __main__ - Step 22050: {'lr': 0.0004776998925129681, 'samples': 4233600, 'steps': 22049, 'loss/train': 1.63240647315979} 11/07/2021 00:20:23 - INFO - __main__ - Step 22051: {'lr': 0.0004776977015785595, 'samples': 4233792, 'steps': 22050, 'loss/train': 1.2668564319610596} 11/07/2021 00:20:23 - INFO - __main__ - Step 22052: {'lr': 0.0004776955105415537, 'samples': 4233984, 'steps': 22051, 'loss/train': 0.7824997305870056} 11/07/2021 00:20:23 - INFO - __main__ - Step 22053: {'lr': 0.00047769331940195194, 'samples': 4234176, 'steps': 22052, 'loss/train': 1.4100098609924316} 11/07/2021 00:20:24 - INFO - __main__ - Step 22054: {'lr': 0.00047769112815975503, 'samples': 4234368, 'steps': 22053, 'loss/train': 1.5036020278930664} 11/07/2021 00:20:25 - INFO - __main__ - Step 22055: {'lr': 0.00047768893681496397, 'samples': 4234560, 'steps': 22054, 'loss/train': 1.7130056619644165} 11/07/2021 00:20:25 - INFO - __main__ - Step 22056: {'lr': 0.00047768674536757984, 'samples': 4234752, 'steps': 22055, 'loss/train': 1.361931562423706} 11/07/2021 00:20:25 - INFO - __main__ - Step 22057: {'lr': 0.00047768455381760357, 'samples': 4234944, 'steps': 22056, 'loss/train': 1.6314910650253296} 11/07/2021 00:20:26 - INFO - __main__ - Step 22058: {'lr': 0.00047768236216503613, 'samples': 4235136, 'steps': 22057, 'loss/train': 1.7460706233978271} 11/07/2021 00:20:26 - INFO - __main__ - Step 22059: {'lr': 0.00047768017040987856, 'samples': 4235328, 'steps': 22058, 'loss/train': 1.787089228630066} 11/07/2021 00:20:26 - INFO - __main__ - Step 22060: {'lr': 0.0004776779785521318, 'samples': 4235520, 'steps': 22059, 'loss/train': 0.6601639986038208} 11/07/2021 00:20:27 - INFO - __main__ - Step 22061: {'lr': 0.0004776757865917969, 'samples': 4235712, 'steps': 22060, 'loss/train': 1.0703409910202026} 11/07/2021 00:20:28 - INFO - __main__ - Step 22062: {'lr': 0.0004776735945288747, 'samples': 4235904, 'steps': 22061, 'loss/train': 1.8439667224884033} 11/07/2021 00:20:28 - INFO - __main__ - Step 22063: {'lr': 0.00047767140236336635, 'samples': 4236096, 'steps': 22062, 'loss/train': 1.5141043663024902} 11/07/2021 00:20:28 - INFO - __main__ - Step 22064: {'lr': 0.00047766921009527284, 'samples': 4236288, 'steps': 22063, 'loss/train': 1.7506041526794434} 11/07/2021 00:20:29 - INFO - __main__ - Step 22065: {'lr': 0.00047766701772459505, 'samples': 4236480, 'steps': 22064, 'loss/train': 1.5079345703125} 11/07/2021 00:20:30 - INFO - __main__ - Step 22066: {'lr': 0.00047766482525133405, 'samples': 4236672, 'steps': 22065, 'loss/train': 1.629859209060669} 11/07/2021 00:20:30 - INFO - __main__ - Step 22067: {'lr': 0.00047766263267549073, 'samples': 4236864, 'steps': 22066, 'loss/train': 1.2172280550003052} 11/07/2021 00:20:31 - INFO - __main__ - Step 22068: {'lr': 0.0004776604399970661, 'samples': 4237056, 'steps': 22067, 'loss/train': 1.7213703393936157} 11/07/2021 00:20:31 - INFO - __main__ - Step 22069: {'lr': 0.0004776582472160613, 'samples': 4237248, 'steps': 22068, 'loss/train': 1.041486382484436} 11/07/2021 00:20:32 - INFO - __main__ - Step 22070: {'lr': 0.0004776560543324772, 'samples': 4237440, 'steps': 22069, 'loss/train': 1.5691708326339722} 11/07/2021 00:20:33 - INFO - __main__ - Step 22071: {'lr': 0.0004776538613463147, 'samples': 4237632, 'steps': 22070, 'loss/train': 0.24791944026947021} 11/07/2021 00:20:33 - INFO - __main__ - Step 22072: {'lr': 0.00047765166825757487, 'samples': 4237824, 'steps': 22071, 'loss/train': 1.8586312532424927} 11/07/2021 00:20:33 - INFO - __main__ - Step 22073: {'lr': 0.00047764947506625887, 'samples': 4238016, 'steps': 22072, 'loss/train': 1.8590911626815796} 11/07/2021 00:20:34 - INFO - __main__ - Step 22074: {'lr': 0.00047764728177236736, 'samples': 4238208, 'steps': 22073, 'loss/train': 1.7774806022644043} 11/07/2021 00:20:34 - INFO - __main__ - Step 22075: {'lr': 0.0004776450883759016, 'samples': 4238400, 'steps': 22074, 'loss/train': 1.0848828554153442} 11/07/2021 00:20:35 - INFO - __main__ - Step 22076: {'lr': 0.0004776428948768625, 'samples': 4238592, 'steps': 22075, 'loss/train': 1.9253149032592773} 11/07/2021 00:20:35 - INFO - __main__ - Step 22077: {'lr': 0.00047764070127525096, 'samples': 4238784, 'steps': 22076, 'loss/train': 1.2464487552642822} 11/07/2021 00:20:36 - INFO - __main__ - Step 22078: {'lr': 0.00047763850757106803, 'samples': 4238976, 'steps': 22077, 'loss/train': 1.8914134502410889} 11/07/2021 00:20:36 - INFO - __main__ - Step 22079: {'lr': 0.0004776363137643147, 'samples': 4239168, 'steps': 22078, 'loss/train': 1.6272341012954712} 11/07/2021 00:20:37 - INFO - __main__ - Step 22080: {'lr': 0.000477634119854992, 'samples': 4239360, 'steps': 22079, 'loss/train': 1.0225515365600586} 11/07/2021 00:20:37 - INFO - __main__ - Step 22081: {'lr': 0.00047763192584310087, 'samples': 4239552, 'steps': 22080, 'loss/train': 1.4733610153198242} 11/07/2021 00:20:38 - INFO - __main__ - Step 22082: {'lr': 0.0004776297317286423, 'samples': 4239744, 'steps': 22081, 'loss/train': 1.6918233633041382} 11/07/2021 00:20:38 - INFO - __main__ - Step 22083: {'lr': 0.00047762753751161725, 'samples': 4239936, 'steps': 22082, 'loss/train': 1.4075119495391846} 11/07/2021 00:20:39 - INFO - __main__ - Step 22084: {'lr': 0.0004776253431920268, 'samples': 4240128, 'steps': 22083, 'loss/train': 1.3235876560211182} 11/07/2021 00:20:39 - INFO - __main__ - Step 22085: {'lr': 0.00047762314876987185, 'samples': 4240320, 'steps': 22084, 'loss/train': 1.7621089220046997} 11/07/2021 00:20:39 - INFO - __main__ - Step 22086: {'lr': 0.0004776209542451534, 'samples': 4240512, 'steps': 22085, 'loss/train': 1.2438160181045532} 11/07/2021 00:20:40 - INFO - __main__ - Step 22087: {'lr': 0.0004776187596178725, 'samples': 4240704, 'steps': 22086, 'loss/train': 1.4515008926391602} 11/07/2021 00:20:41 - INFO - __main__ - Step 22088: {'lr': 0.00047761656488803006, 'samples': 4240896, 'steps': 22087, 'loss/train': 1.8013110160827637} 11/07/2021 00:20:41 - INFO - __main__ - Step 22089: {'lr': 0.00047761437005562716, 'samples': 4241088, 'steps': 22088, 'loss/train': 1.7310346364974976} 11/07/2021 00:20:41 - INFO - __main__ - Step 22090: {'lr': 0.00047761217512066475, 'samples': 4241280, 'steps': 22089, 'loss/train': 1.422913908958435} 11/07/2021 00:20:42 - INFO - __main__ - Step 22091: {'lr': 0.0004776099800831437, 'samples': 4241472, 'steps': 22090, 'loss/train': 0.4168950617313385} 11/07/2021 00:20:43 - INFO - __main__ - Step 22092: {'lr': 0.0004776077849430652, 'samples': 4241664, 'steps': 22091, 'loss/train': 1.9819186925888062} 11/07/2021 00:20:43 - INFO - __main__ - Step 22093: {'lr': 0.0004776055897004301, 'samples': 4241856, 'steps': 22092, 'loss/train': 0.8312280178070068} 11/07/2021 00:20:44 - INFO - __main__ - Step 22094: {'lr': 0.0004776033943552395, 'samples': 4242048, 'steps': 22093, 'loss/train': 1.5974825620651245} 11/07/2021 00:20:44 - INFO - __main__ - Step 22095: {'lr': 0.0004776011989074943, 'samples': 4242240, 'steps': 22094, 'loss/train': 1.7879666090011597} 11/07/2021 00:20:44 - INFO - __main__ - Step 22096: {'lr': 0.00047759900335719543, 'samples': 4242432, 'steps': 22095, 'loss/train': 1.5695561170578003} 11/07/2021 00:20:45 - INFO - __main__ - Step 22097: {'lr': 0.00047759680770434405, 'samples': 4242624, 'steps': 22096, 'loss/train': 1.3009862899780273} 11/07/2021 00:20:46 - INFO - __main__ - Step 22098: {'lr': 0.00047759461194894103, 'samples': 4242816, 'steps': 22097, 'loss/train': 1.4749153852462769} 11/07/2021 00:20:46 - INFO - __main__ - Step 22099: {'lr': 0.00047759241609098734, 'samples': 4243008, 'steps': 22098, 'loss/train': 1.5157296657562256} 11/07/2021 00:20:46 - INFO - __main__ - Step 22100: {'lr': 0.00047759022013048417, 'samples': 4243200, 'steps': 22099, 'loss/train': 1.2801927328109741} 11/07/2021 00:20:47 - INFO - __main__ - Step 22101: {'lr': 0.00047758802406743217, 'samples': 4243392, 'steps': 22100, 'loss/train': 1.891735553741455} 11/07/2021 00:20:48 - INFO - __main__ - Step 22102: {'lr': 0.0004775858279018326, 'samples': 4243584, 'steps': 22101, 'loss/train': 1.7186956405639648} 11/07/2021 00:20:48 - INFO - __main__ - Step 22103: {'lr': 0.0004775836316336864, 'samples': 4243776, 'steps': 22102, 'loss/train': 1.5562268495559692} 11/07/2021 00:20:49 - INFO - __main__ - Step 22104: {'lr': 0.00047758143526299446, 'samples': 4243968, 'steps': 22103, 'loss/train': 1.8075950145721436} 11/07/2021 00:20:49 - INFO - __main__ - Step 22105: {'lr': 0.0004775792387897579, 'samples': 4244160, 'steps': 22104, 'loss/train': 0.24321593344211578} 11/07/2021 00:20:49 - INFO - __main__ - Step 22106: {'lr': 0.0004775770422139776, 'samples': 4244352, 'steps': 22105, 'loss/train': 1.5821597576141357} 11/07/2021 00:20:50 - INFO - __main__ - Step 22107: {'lr': 0.00047757484553565465, 'samples': 4244544, 'steps': 22106, 'loss/train': 3.4636902809143066} 11/07/2021 00:20:51 - INFO - __main__ - Step 22108: {'lr': 0.00047757264875478996, 'samples': 4244736, 'steps': 22107, 'loss/train': 1.1725666522979736} 11/07/2021 00:20:51 - INFO - __main__ - Step 22109: {'lr': 0.0004775704518713845, 'samples': 4244928, 'steps': 22108, 'loss/train': 1.034935712814331} 11/07/2021 00:20:52 - INFO - __main__ - Step 22110: {'lr': 0.0004775682548854394, 'samples': 4245120, 'steps': 22109, 'loss/train': 1.4282327890396118} 11/07/2021 00:20:52 - INFO - __main__ - Step 22111: {'lr': 0.0004775660577969555, 'samples': 4245312, 'steps': 22110, 'loss/train': 0.2226470410823822} 11/07/2021 00:20:52 - INFO - __main__ - Step 22112: {'lr': 0.0004775638606059338, 'samples': 4245504, 'steps': 22111, 'loss/train': 1.1546566486358643} 11/07/2021 00:20:53 - INFO - __main__ - Step 22113: {'lr': 0.00047756166331237545, 'samples': 4245696, 'steps': 22112, 'loss/train': 1.7638152837753296} 11/07/2021 00:20:54 - INFO - __main__ - Step 22114: {'lr': 0.00047755946591628126, 'samples': 4245888, 'steps': 22113, 'loss/train': 1.632278561592102} 11/07/2021 00:20:54 - INFO - __main__ - Step 22115: {'lr': 0.00047755726841765224, 'samples': 4246080, 'steps': 22114, 'loss/train': 1.7050201892852783} 11/07/2021 00:20:54 - INFO - __main__ - Step 22116: {'lr': 0.0004775550708164895, 'samples': 4246272, 'steps': 22115, 'loss/train': 1.5752651691436768} 11/07/2021 00:20:55 - INFO - __main__ - Step 22117: {'lr': 0.00047755287311279394, 'samples': 4246464, 'steps': 22116, 'loss/train': 1.260236144065857} 11/07/2021 00:20:56 - INFO - __main__ - Step 22118: {'lr': 0.00047755067530656656, 'samples': 4246656, 'steps': 22117, 'loss/train': 1.4997432231903076} 11/07/2021 00:20:56 - INFO - __main__ - Step 22119: {'lr': 0.00047754847739780835, 'samples': 4246848, 'steps': 22118, 'loss/train': 1.6824851036071777} 11/07/2021 00:20:57 - INFO - __main__ - Step 22120: {'lr': 0.0004775462793865203, 'samples': 4247040, 'steps': 22119, 'loss/train': 1.5758624076843262} 11/07/2021 00:20:57 - INFO - __main__ - Step 22121: {'lr': 0.00047754408127270346, 'samples': 4247232, 'steps': 22120, 'loss/train': 1.820260763168335} 11/07/2021 00:20:57 - INFO - __main__ - Step 22122: {'lr': 0.0004775418830563587, 'samples': 4247424, 'steps': 22121, 'loss/train': 1.8502987623214722} 11/07/2021 00:20:58 - INFO - __main__ - Step 22123: {'lr': 0.0004775396847374871, 'samples': 4247616, 'steps': 22122, 'loss/train': 1.7947843074798584} 11/07/2021 00:20:59 - INFO - __main__ - Step 22124: {'lr': 0.0004775374863160896, 'samples': 4247808, 'steps': 22123, 'loss/train': 1.7967413663864136} 11/07/2021 00:20:59 - INFO - __main__ - Step 22125: {'lr': 0.0004775352877921673, 'samples': 4248000, 'steps': 22124, 'loss/train': 1.6300139427185059} 11/07/2021 00:20:59 - INFO - __main__ - Step 22126: {'lr': 0.000477533089165721, 'samples': 4248192, 'steps': 22125, 'loss/train': 0.9215333461761475} 11/07/2021 00:21:00 - INFO - __main__ - Step 22127: {'lr': 0.0004775308904367519, 'samples': 4248384, 'steps': 22126, 'loss/train': 1.7427397966384888} 11/07/2021 00:21:00 - INFO - __main__ - Step 22128: {'lr': 0.0004775286916052609, 'samples': 4248576, 'steps': 22127, 'loss/train': 1.6542869806289673} 11/07/2021 00:21:01 - INFO - __main__ - Step 22129: {'lr': 0.00047752649267124894, 'samples': 4248768, 'steps': 22128, 'loss/train': 1.5671731233596802} 11/07/2021 00:21:01 - INFO - __main__ - Step 22130: {'lr': 0.0004775242936347171, 'samples': 4248960, 'steps': 22129, 'loss/train': 1.698813796043396} 11/07/2021 00:21:02 - INFO - __main__ - Step 22131: {'lr': 0.0004775220944956662, 'samples': 4249152, 'steps': 22130, 'loss/train': 0.2281908243894577} 11/07/2021 00:21:02 - INFO - __main__ - Step 22132: {'lr': 0.00047751989525409745, 'samples': 4249344, 'steps': 22131, 'loss/train': 1.9468735456466675} 11/07/2021 00:21:02 - INFO - __main__ - Step 22133: {'lr': 0.0004775176959100117, 'samples': 4249536, 'steps': 22132, 'loss/train': 1.6748098134994507} 11/07/2021 00:21:04 - INFO - __main__ - Step 22134: {'lr': 0.00047751549646341007, 'samples': 4249728, 'steps': 22133, 'loss/train': 1.442635178565979} 11/07/2021 00:21:04 - INFO - __main__ - Step 22135: {'lr': 0.0004775132969142934, 'samples': 4249920, 'steps': 22134, 'loss/train': 1.737351417541504} 11/07/2021 00:21:04 - INFO - __main__ - Step 22136: {'lr': 0.00047751109726266273, 'samples': 4250112, 'steps': 22135, 'loss/train': 1.0580017566680908} 11/07/2021 00:21:05 - INFO - __main__ - Step 22137: {'lr': 0.00047750889750851913, 'samples': 4250304, 'steps': 22136, 'loss/train': 0.43822312355041504} 11/07/2021 00:21:05 - INFO - __main__ - Step 22138: {'lr': 0.0004775066976518635, 'samples': 4250496, 'steps': 22137, 'loss/train': 1.7369829416275024} 11/07/2021 00:21:06 - INFO - __main__ - Step 22139: {'lr': 0.00047750449769269686, 'samples': 4250688, 'steps': 22138, 'loss/train': 1.1845433712005615} 11/07/2021 00:21:06 - INFO - __main__ - Step 22140: {'lr': 0.0004775022976310203, 'samples': 4250880, 'steps': 22139, 'loss/train': 1.6305336952209473} 11/07/2021 00:21:07 - INFO - __main__ - Step 22141: {'lr': 0.0004775000974668345, 'samples': 4251072, 'steps': 22140, 'loss/train': 1.4394451379776} 11/07/2021 00:21:07 - INFO - __main__ - Step 22142: {'lr': 0.00047749789720014085, 'samples': 4251264, 'steps': 22141, 'loss/train': 1.3838160037994385} 11/07/2021 00:21:07 - INFO - __main__ - Step 22143: {'lr': 0.00047749569683094015, 'samples': 4251456, 'steps': 22142, 'loss/train': 1.30197012424469} 11/07/2021 00:21:08 - INFO - __main__ - Step 22144: {'lr': 0.00047749349635923334, 'samples': 4251648, 'steps': 22143, 'loss/train': 1.6408462524414062} 11/07/2021 00:21:09 - INFO - __main__ - Step 22145: {'lr': 0.0004774912957850215, 'samples': 4251840, 'steps': 22144, 'loss/train': 0.2188931703567505} 11/07/2021 00:21:09 - INFO - __main__ - Step 22146: {'lr': 0.0004774890951083055, 'samples': 4252032, 'steps': 22145, 'loss/train': 2.0885589122772217} 11/07/2021 00:21:09 - INFO - __main__ - Step 22147: {'lr': 0.00047748689432908654, 'samples': 4252224, 'steps': 22146, 'loss/train': 1.8011268377304077} 11/07/2021 00:21:10 - INFO - __main__ - Step 22148: {'lr': 0.00047748469344736547, 'samples': 4252416, 'steps': 22147, 'loss/train': 2.0035457611083984} 11/07/2021 00:21:10 - INFO - __main__ - Step 22149: {'lr': 0.00047748249246314323, 'samples': 4252608, 'steps': 22148, 'loss/train': 1.6972697973251343} 11/07/2021 00:21:11 - INFO - __main__ - Step 22150: {'lr': 0.000477480291376421, 'samples': 4252800, 'steps': 22149, 'loss/train': 1.4715518951416016} 11/07/2021 00:21:12 - INFO - __main__ - Step 22151: {'lr': 0.0004774780901871996, 'samples': 4252992, 'steps': 22150, 'loss/train': 1.3390496969223022} 11/07/2021 00:21:12 - INFO - __main__ - Step 22152: {'lr': 0.0004774758888954801, 'samples': 4253184, 'steps': 22151, 'loss/train': 1.2156263589859009} 11/07/2021 00:21:12 - INFO - __main__ - Step 22153: {'lr': 0.00047747368750126345, 'samples': 4253376, 'steps': 22152, 'loss/train': 1.6150118112564087} 11/07/2021 00:21:13 - INFO - __main__ - Step 22154: {'lr': 0.0004774714860045507, 'samples': 4253568, 'steps': 22153, 'loss/train': 1.8437851667404175} 11/07/2021 00:21:14 - INFO - __main__ - Step 22155: {'lr': 0.0004774692844053428, 'samples': 4253760, 'steps': 22154, 'loss/train': 1.99979567527771} 11/07/2021 00:21:14 - INFO - __main__ - Step 22156: {'lr': 0.00047746708270364073, 'samples': 4253952, 'steps': 22155, 'loss/train': 1.418305516242981} 11/07/2021 00:21:14 - INFO - __main__ - Step 22157: {'lr': 0.0004774648808994455, 'samples': 4254144, 'steps': 22156, 'loss/train': 1.7593319416046143} 11/07/2021 00:21:15 - INFO - __main__ - Step 22158: {'lr': 0.0004774626789927582, 'samples': 4254336, 'steps': 22157, 'loss/train': 1.7074940204620361} 11/07/2021 00:21:15 - INFO - __main__ - Step 22159: {'lr': 0.0004774604769835796, 'samples': 4254528, 'steps': 22158, 'loss/train': 1.7265609502792358} 11/07/2021 00:21:16 - INFO - __main__ - Step 22160: {'lr': 0.00047745827487191087, 'samples': 4254720, 'steps': 22159, 'loss/train': 1.7802273035049438} 11/07/2021 00:21:17 - INFO - __main__ - Step 22161: {'lr': 0.00047745607265775293, 'samples': 4254912, 'steps': 22160, 'loss/train': 1.9270914793014526} 11/07/2021 00:21:17 - INFO - __main__ - Step 22162: {'lr': 0.0004774538703411069, 'samples': 4255104, 'steps': 22161, 'loss/train': 1.74415922164917} 11/07/2021 00:21:17 - INFO - __main__ - Step 22163: {'lr': 0.00047745166792197353, 'samples': 4255296, 'steps': 22162, 'loss/train': 1.3474476337432861} 11/07/2021 00:21:18 - INFO - __main__ - Step 22164: {'lr': 0.000477449465400354, 'samples': 4255488, 'steps': 22163, 'loss/train': 1.540534496307373} 11/07/2021 00:21:19 - INFO - __main__ - Step 22165: {'lr': 0.00047744726277624926, 'samples': 4255680, 'steps': 22164, 'loss/train': 1.7479281425476074} 11/07/2021 00:21:19 - INFO - __main__ - Step 22166: {'lr': 0.00047744506004966024, 'samples': 4255872, 'steps': 22165, 'loss/train': 1.7287943363189697} 11/07/2021 00:21:19 - INFO - __main__ - Step 22167: {'lr': 0.00047744285722058804, 'samples': 4256064, 'steps': 22166, 'loss/train': 1.6895420551300049} 11/07/2021 00:21:20 - INFO - __main__ - Step 22168: {'lr': 0.0004774406542890336, 'samples': 4256256, 'steps': 22167, 'loss/train': 1.5301826000213623} 11/07/2021 00:21:20 - INFO - __main__ - Step 22169: {'lr': 0.0004774384512549979, 'samples': 4256448, 'steps': 22168, 'loss/train': 1.6606671810150146} 11/07/2021 00:21:20 - INFO - __main__ - Step 22170: {'lr': 0.00047743624811848195, 'samples': 4256640, 'steps': 22169, 'loss/train': 1.5106760263442993} 11/07/2021 00:21:21 - INFO - __main__ - Step 22171: {'lr': 0.00047743404487948673, 'samples': 4256832, 'steps': 22170, 'loss/train': 2.1511363983154297} 11/07/2021 00:21:22 - INFO - __main__ - Step 22172: {'lr': 0.0004774318415380132, 'samples': 4257024, 'steps': 22171, 'loss/train': 1.799678087234497} 11/07/2021 00:21:22 - INFO - __main__ - Step 22173: {'lr': 0.0004774296380940625, 'samples': 4257216, 'steps': 22172, 'loss/train': 1.9480088949203491} 11/07/2021 00:21:23 - INFO - __main__ - Step 22174: {'lr': 0.0004774274345476354, 'samples': 4257408, 'steps': 22173, 'loss/train': 1.6413216590881348} 11/07/2021 00:21:23 - INFO - __main__ - Step 22175: {'lr': 0.00047742523089873304, 'samples': 4257600, 'steps': 22174, 'loss/train': 1.6591426134109497} 11/07/2021 00:21:24 - INFO - __main__ - Step 22176: {'lr': 0.0004774230271473564, 'samples': 4257792, 'steps': 22175, 'loss/train': 1.1572656631469727} 11/07/2021 00:21:24 - INFO - __main__ - Step 22177: {'lr': 0.00047742082329350644, 'samples': 4257984, 'steps': 22176, 'loss/train': 1.5521352291107178} 11/07/2021 00:21:25 - INFO - __main__ - Step 22178: {'lr': 0.0004774186193371841, 'samples': 4258176, 'steps': 22177, 'loss/train': 0.964850664138794} 11/07/2021 00:21:25 - INFO - __main__ - Step 22179: {'lr': 0.00047741641527839054, 'samples': 4258368, 'steps': 22178, 'loss/train': 1.0296980142593384} 11/07/2021 00:21:25 - INFO - __main__ - Step 22180: {'lr': 0.00047741421111712666, 'samples': 4258560, 'steps': 22179, 'loss/train': 1.7389425039291382} 11/07/2021 00:21:26 - INFO - __main__ - Step 22181: {'lr': 0.00047741200685339337, 'samples': 4258752, 'steps': 22180, 'loss/train': 1.7903101444244385} 11/07/2021 00:21:27 - INFO - __main__ - Step 22182: {'lr': 0.0004774098024871918, 'samples': 4258944, 'steps': 22181, 'loss/train': 1.5867414474487305} 11/07/2021 00:21:27 - INFO - __main__ - Step 22183: {'lr': 0.00047740759801852284, 'samples': 4259136, 'steps': 22182, 'loss/train': 1.6548216342926025} 11/07/2021 00:21:27 - INFO - __main__ - Step 22184: {'lr': 0.00047740539344738754, 'samples': 4259328, 'steps': 22183, 'loss/train': 1.6218554973602295} 11/07/2021 00:21:28 - INFO - __main__ - Step 22185: {'lr': 0.00047740318877378685, 'samples': 4259520, 'steps': 22184, 'loss/train': 1.144287347793579} 11/07/2021 00:21:28 - INFO - __main__ - Step 22186: {'lr': 0.00047740098399772185, 'samples': 4259712, 'steps': 22185, 'loss/train': 0.4743185043334961} 11/07/2021 00:21:29 - INFO - __main__ - Step 22187: {'lr': 0.0004773987791191935, 'samples': 4259904, 'steps': 22186, 'loss/train': 1.6326481103897095} 11/07/2021 00:21:29 - INFO - __main__ - Step 22188: {'lr': 0.0004773965741382027, 'samples': 4260096, 'steps': 22187, 'loss/train': 2.2415566444396973} 11/07/2021 00:21:30 - INFO - __main__ - Step 22189: {'lr': 0.00047739436905475054, 'samples': 4260288, 'steps': 22188, 'loss/train': 0.9503546357154846} 11/07/2021 00:21:30 - INFO - __main__ - Step 22190: {'lr': 0.00047739216386883797, 'samples': 4260480, 'steps': 22189, 'loss/train': 1.4760106801986694} 11/07/2021 00:21:31 - INFO - __main__ - Step 22191: {'lr': 0.000477389958580466, 'samples': 4260672, 'steps': 22190, 'loss/train': 1.2762712240219116} 11/07/2021 00:21:32 - INFO - __main__ - Step 22192: {'lr': 0.0004773877531896356, 'samples': 4260864, 'steps': 22191, 'loss/train': 1.569229006767273} 11/07/2021 00:21:32 - INFO - __main__ - Step 22193: {'lr': 0.00047738554769634784, 'samples': 4261056, 'steps': 22192, 'loss/train': 1.164962887763977} 11/07/2021 00:21:32 - INFO - __main__ - Step 22194: {'lr': 0.00047738334210060366, 'samples': 4261248, 'steps': 22193, 'loss/train': 1.5791643857955933} 11/07/2021 00:21:33 - INFO - __main__ - Step 22195: {'lr': 0.000477381136402404, 'samples': 4261440, 'steps': 22194, 'loss/train': 1.462497591972351} 11/07/2021 00:21:33 - INFO - __main__ - Step 22196: {'lr': 0.00047737893060175, 'samples': 4261632, 'steps': 22195, 'loss/train': 1.5324187278747559} 11/07/2021 00:21:34 - INFO - __main__ - Step 22197: {'lr': 0.00047737672469864246, 'samples': 4261824, 'steps': 22196, 'loss/train': 2.1199514865875244} 11/07/2021 00:21:35 - INFO - __main__ - Step 22198: {'lr': 0.0004773745186930825, 'samples': 4262016, 'steps': 22197, 'loss/train': 1.5264184474945068} 11/07/2021 00:21:35 - INFO - __main__ - Step 22199: {'lr': 0.00047737231258507116, 'samples': 4262208, 'steps': 22198, 'loss/train': 1.4578293561935425} 11/07/2021 00:21:35 - INFO - __main__ - Step 22200: {'lr': 0.00047737010637460934, 'samples': 4262400, 'steps': 22199, 'loss/train': 1.3843269348144531} 11/07/2021 00:21:36 - INFO - __main__ - Step 22201: {'lr': 0.00047736790006169794, 'samples': 4262592, 'steps': 22200, 'loss/train': 1.5156464576721191} 11/07/2021 00:21:36 - INFO - __main__ - Step 22202: {'lr': 0.00047736569364633817, 'samples': 4262784, 'steps': 22201, 'loss/train': 2.108410358428955} 11/07/2021 00:21:37 - INFO - __main__ - Step 22203: {'lr': 0.00047736348712853094, 'samples': 4262976, 'steps': 22202, 'loss/train': 1.5299713611602783} 11/07/2021 00:21:37 - INFO - __main__ - Step 22204: {'lr': 0.0004773612805082772, 'samples': 4263168, 'steps': 22203, 'loss/train': 1.304317831993103} 11/07/2021 00:21:38 - INFO - __main__ - Step 22205: {'lr': 0.000477359073785578, 'samples': 4263360, 'steps': 22204, 'loss/train': 1.8367857933044434} 11/07/2021 00:21:38 - INFO - __main__ - Step 22206: {'lr': 0.00047735686696043434, 'samples': 4263552, 'steps': 22205, 'loss/train': 0.7247116565704346} 11/07/2021 00:21:38 - INFO - __main__ - Step 22207: {'lr': 0.0004773546600328471, 'samples': 4263744, 'steps': 22206, 'loss/train': 1.6162457466125488} 11/07/2021 00:21:39 - INFO - __main__ - Step 22208: {'lr': 0.00047735245300281745, 'samples': 4263936, 'steps': 22207, 'loss/train': 1.498029351234436} 11/07/2021 00:21:40 - INFO - __main__ - Step 22209: {'lr': 0.00047735024587034625, 'samples': 4264128, 'steps': 22208, 'loss/train': 2.190403699874878} 11/07/2021 00:21:40 - INFO - __main__ - Step 22210: {'lr': 0.00047734803863543453, 'samples': 4264320, 'steps': 22209, 'loss/train': 1.4946469068527222} 11/07/2021 00:21:40 - INFO - __main__ - Step 22211: {'lr': 0.00047734583129808327, 'samples': 4264512, 'steps': 22210, 'loss/train': 1.5364080667495728} 11/07/2021 00:21:41 - INFO - __main__ - Step 22212: {'lr': 0.00047734362385829356, 'samples': 4264704, 'steps': 22211, 'loss/train': 2.015793561935425} 11/07/2021 00:21:42 - INFO - __main__ - Step 22213: {'lr': 0.0004773414163160662, 'samples': 4264896, 'steps': 22212, 'loss/train': 1.358203649520874} 11/07/2021 00:21:42 - INFO - __main__ - Step 22214: {'lr': 0.00047733920867140244, 'samples': 4265088, 'steps': 22213, 'loss/train': 0.5409196615219116} 11/07/2021 00:21:43 - INFO - __main__ - Step 22215: {'lr': 0.00047733700092430305, 'samples': 4265280, 'steps': 22214, 'loss/train': 1.655433177947998} 11/07/2021 00:21:43 - INFO - __main__ - Step 22216: {'lr': 0.0004773347930747691, 'samples': 4265472, 'steps': 22215, 'loss/train': 1.6212674379348755} 11/07/2021 00:21:43 - INFO - __main__ - Step 22217: {'lr': 0.0004773325851228017, 'samples': 4265664, 'steps': 22216, 'loss/train': 1.560600757598877} 11/07/2021 00:21:44 - INFO - __main__ - Step 22218: {'lr': 0.00047733037706840166, 'samples': 4265856, 'steps': 22217, 'loss/train': 1.4888997077941895} 11/07/2021 00:21:45 - INFO - __main__ - Step 22219: {'lr': 0.0004773281689115701, 'samples': 4266048, 'steps': 22218, 'loss/train': 1.8526577949523926} 11/07/2021 00:21:45 - INFO - __main__ - Step 22220: {'lr': 0.000477325960652308, 'samples': 4266240, 'steps': 22219, 'loss/train': 1.9920580387115479} 11/07/2021 00:21:45 - INFO - __main__ - Step 22221: {'lr': 0.0004773237522906163, 'samples': 4266432, 'steps': 22220, 'loss/train': 1.6304432153701782} 11/07/2021 00:21:46 - INFO - __main__ - Step 22222: {'lr': 0.000477321543826496, 'samples': 4266624, 'steps': 22221, 'loss/train': 1.4741657972335815} 11/07/2021 00:21:46 - INFO - __main__ - Step 22223: {'lr': 0.00047731933525994814, 'samples': 4266816, 'steps': 22222, 'loss/train': 0.9544246196746826} 11/07/2021 00:21:47 - INFO - __main__ - Step 22224: {'lr': 0.0004773171265909737, 'samples': 4267008, 'steps': 22223, 'loss/train': 1.4136073589324951} 11/07/2021 00:21:48 - INFO - __main__ - Step 22225: {'lr': 0.00047731491781957366, 'samples': 4267200, 'steps': 22224, 'loss/train': 1.588415503501892} 11/07/2021 00:21:48 - INFO - __main__ - Step 22226: {'lr': 0.0004773127089457491, 'samples': 4267392, 'steps': 22225, 'loss/train': 1.265699863433838} 11/07/2021 00:21:48 - INFO - __main__ - Step 22227: {'lr': 0.0004773104999695008, 'samples': 4267584, 'steps': 22226, 'loss/train': 1.5060862302780151} 11/07/2021 00:21:49 - INFO - __main__ - Step 22228: {'lr': 0.00047730829089082994, 'samples': 4267776, 'steps': 22227, 'loss/train': 1.5712088346481323} 11/07/2021 00:21:50 - INFO - __main__ - Step 22229: {'lr': 0.00047730608170973754, 'samples': 4267968, 'steps': 22228, 'loss/train': 1.1024203300476074} 11/07/2021 00:21:50 - INFO - __main__ - Step 22230: {'lr': 0.00047730387242622446, 'samples': 4268160, 'steps': 22229, 'loss/train': 1.0003269910812378} 11/07/2021 00:21:51 - INFO - __main__ - Step 22231: {'lr': 0.00047730166304029185, 'samples': 4268352, 'steps': 22230, 'loss/train': 1.664233684539795} 11/07/2021 00:21:51 - INFO - __main__ - Step 22232: {'lr': 0.0004772994535519405, 'samples': 4268544, 'steps': 22231, 'loss/train': 1.1563791036605835} 11/07/2021 00:21:51 - INFO - __main__ - Step 22233: {'lr': 0.0004772972439611716, 'samples': 4268736, 'steps': 22232, 'loss/train': 1.4765862226486206} 11/07/2021 00:21:52 - INFO - __main__ - Step 22234: {'lr': 0.00047729503426798605, 'samples': 4268928, 'steps': 22233, 'loss/train': 1.6917579174041748} 11/07/2021 00:21:53 - INFO - __main__ - Step 22235: {'lr': 0.0004772928244723849, 'samples': 4269120, 'steps': 22234, 'loss/train': 1.5444846153259277} 11/07/2021 00:21:53 - INFO - __main__ - Step 22236: {'lr': 0.00047729061457436905, 'samples': 4269312, 'steps': 22235, 'loss/train': 1.9563812017440796} 11/07/2021 00:21:53 - INFO - __main__ - Step 22237: {'lr': 0.0004772884045739396, 'samples': 4269504, 'steps': 22236, 'loss/train': 1.596703052520752} 11/07/2021 00:21:54 - INFO - __main__ - Step 22238: {'lr': 0.0004772861944710974, 'samples': 4269696, 'steps': 22237, 'loss/train': 0.26425468921661377} 11/07/2021 00:21:55 - INFO - __main__ - Step 22239: {'lr': 0.00047728398426584375, 'samples': 4269888, 'steps': 22238, 'loss/train': 1.3304921388626099} 11/07/2021 00:21:55 - INFO - __main__ - Step 22240: {'lr': 0.0004772817739581793, 'samples': 4270080, 'steps': 22239, 'loss/train': 1.5881335735321045} 11/07/2021 00:21:55 - INFO - __main__ - Step 22241: {'lr': 0.0004772795635481052, 'samples': 4270272, 'steps': 22240, 'loss/train': 1.3964968919754028} 11/07/2021 00:21:56 - INFO - __main__ - Step 22242: {'lr': 0.00047727735303562246, 'samples': 4270464, 'steps': 22241, 'loss/train': 1.6961921453475952} 11/07/2021 00:21:56 - INFO - __main__ - Step 22243: {'lr': 0.000477275142420732, 'samples': 4270656, 'steps': 22242, 'loss/train': 0.41618475317955017} 11/07/2021 00:21:57 - INFO - __main__ - Step 22244: {'lr': 0.000477272931703435, 'samples': 4270848, 'steps': 22243, 'loss/train': 1.7850244045257568} 11/07/2021 00:21:58 - INFO - __main__ - Step 22245: {'lr': 0.0004772707208837322, 'samples': 4271040, 'steps': 22244, 'loss/train': 1.7638630867004395} 11/07/2021 00:21:58 - INFO - __main__ - Step 22246: {'lr': 0.0004772685099616247, 'samples': 4271232, 'steps': 22245, 'loss/train': 1.2982248067855835} 11/07/2021 00:21:58 - INFO - __main__ - Step 22247: {'lr': 0.0004772662989371136, 'samples': 4271424, 'steps': 22246, 'loss/train': 1.7262928485870361} 11/07/2021 00:21:59 - INFO - __main__ - Step 22248: {'lr': 0.0004772640878101998, 'samples': 4271616, 'steps': 22247, 'loss/train': 1.7095729112625122} 11/07/2021 00:22:00 - INFO - __main__ - Step 22249: {'lr': 0.00047726187658088425, 'samples': 4271808, 'steps': 22248, 'loss/train': 1.4458047151565552} 11/07/2021 00:22:00 - INFO - __main__ - Step 22250: {'lr': 0.0004772596652491681, 'samples': 4272000, 'steps': 22249, 'loss/train': 1.6028023958206177} 11/07/2021 00:22:00 - INFO - __main__ - Step 22251: {'lr': 0.0004772574538150522, 'samples': 4272192, 'steps': 22250, 'loss/train': 2.14910888671875} 11/07/2021 00:22:01 - INFO - __main__ - Step 22252: {'lr': 0.0004772552422785376, 'samples': 4272384, 'steps': 22251, 'loss/train': 1.52297043800354} 11/07/2021 00:22:01 - INFO - __main__ - Step 22253: {'lr': 0.00047725303063962535, 'samples': 4272576, 'steps': 22252, 'loss/train': 1.625970721244812} 11/07/2021 00:22:02 - INFO - __main__ - Step 22254: {'lr': 0.00047725081889831626, 'samples': 4272768, 'steps': 22253, 'loss/train': 0.8719094395637512} 11/07/2021 00:22:03 - INFO - __main__ - Step 22255: {'lr': 0.0004772486070546116, 'samples': 4272960, 'steps': 22254, 'loss/train': 1.5029946565628052} 11/07/2021 00:22:03 - INFO - __main__ - Step 22256: {'lr': 0.0004772463951085121, 'samples': 4273152, 'steps': 22255, 'loss/train': 1.6506491899490356} 11/07/2021 00:22:03 - INFO - __main__ - Step 22257: {'lr': 0.00047724418306001895, 'samples': 4273344, 'steps': 22256, 'loss/train': 1.6396294832229614} 11/07/2021 00:22:04 - INFO - __main__ - Step 22258: {'lr': 0.0004772419709091331, 'samples': 4273536, 'steps': 22257, 'loss/train': 1.0318959951400757} 11/07/2021 00:22:04 - INFO - __main__ - Step 22259: {'lr': 0.00047723975865585544, 'samples': 4273728, 'steps': 22258, 'loss/train': 0.9595012068748474} 11/07/2021 00:22:05 - INFO - __main__ - Step 22260: {'lr': 0.00047723754630018715, 'samples': 4273920, 'steps': 22259, 'loss/train': 1.636866569519043} 11/07/2021 00:22:06 - INFO - __main__ - Step 22261: {'lr': 0.000477235333842129, 'samples': 4274112, 'steps': 22260, 'loss/train': 1.8469388484954834} 11/07/2021 00:22:06 - INFO - __main__ - Step 22262: {'lr': 0.00047723312128168226, 'samples': 4274304, 'steps': 22261, 'loss/train': 1.572001576423645} 11/07/2021 00:22:06 - INFO - __main__ - Step 22263: {'lr': 0.00047723090861884773, 'samples': 4274496, 'steps': 22262, 'loss/train': 1.3939660787582397} 11/07/2021 00:22:07 - INFO - __main__ - Step 22264: {'lr': 0.00047722869585362646, 'samples': 4274688, 'steps': 22263, 'loss/train': 1.5644506216049194} 11/07/2021 00:22:08 - INFO - __main__ - Step 22265: {'lr': 0.0004772264829860194, 'samples': 4274880, 'steps': 22264, 'loss/train': 1.638916254043579} 11/07/2021 00:22:08 - INFO - __main__ - Step 22266: {'lr': 0.00047722427001602765, 'samples': 4275072, 'steps': 22265, 'loss/train': 1.7917464971542358} 11/07/2021 00:22:08 - INFO - __main__ - Step 22267: {'lr': 0.0004772220569436521, 'samples': 4275264, 'steps': 22266, 'loss/train': 1.3653619289398193} 11/07/2021 00:22:09 - INFO - __main__ - Step 22268: {'lr': 0.0004772198437688938, 'samples': 4275456, 'steps': 22267, 'loss/train': 1.468030571937561} 11/07/2021 00:22:09 - INFO - __main__ - Step 22269: {'lr': 0.0004772176304917538, 'samples': 4275648, 'steps': 22268, 'loss/train': 1.6017370223999023} 11/07/2021 00:22:10 - INFO - __main__ - Step 22270: {'lr': 0.00047721541711223306, 'samples': 4275840, 'steps': 22269, 'loss/train': 1.4185051918029785} 11/07/2021 00:22:10 - INFO - __main__ - Step 22271: {'lr': 0.00047721320363033247, 'samples': 4276032, 'steps': 22270, 'loss/train': 1.8277918100357056} 11/07/2021 00:22:11 - INFO - __main__ - Step 22272: {'lr': 0.00047721099004605316, 'samples': 4276224, 'steps': 22271, 'loss/train': 1.816880702972412} 11/07/2021 00:22:11 - INFO - __main__ - Step 22273: {'lr': 0.00047720877635939606, 'samples': 4276416, 'steps': 22272, 'loss/train': 1.9079005718231201} 11/07/2021 00:22:12 - INFO - __main__ - Step 22274: {'lr': 0.0004772065625703622, 'samples': 4276608, 'steps': 22273, 'loss/train': 1.729659080505371} 11/07/2021 00:22:12 - INFO - __main__ - Step 22275: {'lr': 0.0004772043486789526, 'samples': 4276800, 'steps': 22274, 'loss/train': 1.4082915782928467} 11/07/2021 00:22:13 - INFO - __main__ - Step 22276: {'lr': 0.0004772021346851682, 'samples': 4276992, 'steps': 22275, 'loss/train': 1.8357642889022827} 11/07/2021 00:22:13 - INFO - __main__ - Step 22277: {'lr': 0.00047719992058901006, 'samples': 4277184, 'steps': 22276, 'loss/train': 1.6172411441802979} 11/07/2021 00:22:13 - INFO - __main__ - Step 22278: {'lr': 0.0004771977063904791, 'samples': 4277376, 'steps': 22277, 'loss/train': 1.0844411849975586} 11/07/2021 00:22:14 - INFO - __main__ - Step 22279: {'lr': 0.00047719549208957636, 'samples': 4277568, 'steps': 22278, 'loss/train': 1.8847882747650146} 11/07/2021 00:22:14 - INFO - __main__ - Step 22280: {'lr': 0.0004771932776863028, 'samples': 4277760, 'steps': 22279, 'loss/train': 1.7806832790374756} 11/07/2021 00:22:15 - INFO - __main__ - Step 22281: {'lr': 0.0004771910631806595, 'samples': 4277952, 'steps': 22280, 'loss/train': 1.5112051963806152} 11/07/2021 00:22:16 - INFO - __main__ - Step 22282: {'lr': 0.00047718884857264745, 'samples': 4278144, 'steps': 22281, 'loss/train': 1.638104796409607} 11/07/2021 00:22:16 - INFO - __main__ - Step 22283: {'lr': 0.0004771866338622676, 'samples': 4278336, 'steps': 22282, 'loss/train': 1.4743871688842773} 11/07/2021 00:22:16 - INFO - __main__ - Step 22284: {'lr': 0.0004771844190495209, 'samples': 4278528, 'steps': 22283, 'loss/train': 1.577344298362732} 11/07/2021 00:22:17 - INFO - __main__ - Step 22285: {'lr': 0.0004771822041344085, 'samples': 4278720, 'steps': 22284, 'loss/train': 1.610754132270813} 11/07/2021 00:22:18 - INFO - __main__ - Step 22286: {'lr': 0.0004771799891169312, 'samples': 4278912, 'steps': 22285, 'loss/train': 1.7802586555480957} 11/07/2021 00:22:18 - INFO - __main__ - Step 22287: {'lr': 0.0004771777739970902, 'samples': 4279104, 'steps': 22286, 'loss/train': 1.3665558099746704} 11/07/2021 00:22:18 - INFO - __main__ - Step 22288: {'lr': 0.0004771755587748863, 'samples': 4279296, 'steps': 22287, 'loss/train': 1.852914571762085} 11/07/2021 00:22:19 - INFO - __main__ - Step 22289: {'lr': 0.00047717334345032065, 'samples': 4279488, 'steps': 22288, 'loss/train': 1.0421020984649658} 11/07/2021 00:22:19 - INFO - __main__ - Step 22290: {'lr': 0.0004771711280233942, 'samples': 4279680, 'steps': 22289, 'loss/train': 1.3669109344482422} 11/07/2021 00:22:20 - INFO - __main__ - Step 22291: {'lr': 0.000477168912494108, 'samples': 4279872, 'steps': 22290, 'loss/train': 1.376916527748108} 11/07/2021 00:22:21 - INFO - __main__ - Step 22292: {'lr': 0.00047716669686246287, 'samples': 4280064, 'steps': 22291, 'loss/train': 0.9953264594078064} 11/07/2021 00:22:21 - INFO - __main__ - Step 22293: {'lr': 0.00047716448112846, 'samples': 4280256, 'steps': 22292, 'loss/train': 1.9123733043670654} 11/07/2021 00:22:21 - INFO - __main__ - Step 22294: {'lr': 0.00047716226529210035, 'samples': 4280448, 'steps': 22293, 'loss/train': 1.8882105350494385} 11/07/2021 00:22:22 - INFO - __main__ - Step 22295: {'lr': 0.00047716004935338484, 'samples': 4280640, 'steps': 22294, 'loss/train': 1.7229362726211548} 11/07/2021 00:22:22 - INFO - __main__ - Step 22296: {'lr': 0.0004771578333123145, 'samples': 4280832, 'steps': 22295, 'loss/train': 0.9274198412895203} 11/07/2021 00:22:23 - INFO - __main__ - Step 22297: {'lr': 0.00047715561716889037, 'samples': 4281024, 'steps': 22296, 'loss/train': 1.4598338603973389} 11/07/2021 00:22:23 - INFO - __main__ - Step 22298: {'lr': 0.0004771534009231134, 'samples': 4281216, 'steps': 22297, 'loss/train': 1.5091946125030518} 11/07/2021 00:22:24 - INFO - __main__ - Step 22299: {'lr': 0.00047715118457498473, 'samples': 4281408, 'steps': 22298, 'loss/train': 1.5219142436981201} 11/07/2021 00:22:24 - INFO - __main__ - Step 22300: {'lr': 0.00047714896812450514, 'samples': 4281600, 'steps': 22299, 'loss/train': 1.509248971939087} 11/07/2021 00:22:25 - INFO - __main__ - Step 22301: {'lr': 0.00047714675157167573, 'samples': 4281792, 'steps': 22300, 'loss/train': 1.7630727291107178} 11/07/2021 00:22:25 - INFO - __main__ - Step 22302: {'lr': 0.00047714453491649753, 'samples': 4281984, 'steps': 22301, 'loss/train': 1.8778845071792603} 11/07/2021 00:22:26 - INFO - __main__ - Step 22303: {'lr': 0.00047714231815897145, 'samples': 4282176, 'steps': 22302, 'loss/train': 1.7316973209381104} 11/07/2021 00:22:26 - INFO - __main__ - Step 22304: {'lr': 0.0004771401012990986, 'samples': 4282368, 'steps': 22303, 'loss/train': 1.1782406568527222} 11/07/2021 00:22:27 - INFO - __main__ - Step 22305: {'lr': 0.0004771378843368799, 'samples': 4282560, 'steps': 22304, 'loss/train': 1.4086192846298218} 11/07/2021 00:22:27 - INFO - __main__ - Step 22306: {'lr': 0.0004771356672723164, 'samples': 4282752, 'steps': 22305, 'loss/train': 1.1016020774841309} 11/07/2021 00:22:28 - INFO - __main__ - Step 22307: {'lr': 0.0004771334501054091, 'samples': 4282944, 'steps': 22306, 'loss/train': 1.1659605503082275} 11/07/2021 00:22:28 - INFO - __main__ - Step 22308: {'lr': 0.0004771312328361589, 'samples': 4283136, 'steps': 22307, 'loss/train': 1.5280841588974} 11/07/2021 00:22:29 - INFO - __main__ - Step 22309: {'lr': 0.0004771290154645669, 'samples': 4283328, 'steps': 22308, 'loss/train': 1.160269021987915} 11/07/2021 00:22:29 - INFO - __main__ - Step 22310: {'lr': 0.0004771267979906341, 'samples': 4283520, 'steps': 22309, 'loss/train': 1.9466007947921753} 11/07/2021 00:22:29 - INFO - __main__ - Step 22311: {'lr': 0.0004771245804143615, 'samples': 4283712, 'steps': 22310, 'loss/train': 1.7830595970153809} 11/07/2021 00:22:30 - INFO - __main__ - Step 22312: {'lr': 0.00047712236273574993, 'samples': 4283904, 'steps': 22311, 'loss/train': 1.239105463027954} 11/07/2021 00:22:31 - INFO - __main__ - Step 22313: {'lr': 0.0004771201449548006, 'samples': 4284096, 'steps': 22312, 'loss/train': 1.8768774271011353} 11/07/2021 00:22:31 - INFO - __main__ - Step 22314: {'lr': 0.0004771179270715145, 'samples': 4284288, 'steps': 22313, 'loss/train': 0.8931715488433838} 11/07/2021 00:22:32 - INFO - __main__ - Step 22315: {'lr': 0.0004771157090858925, 'samples': 4284480, 'steps': 22314, 'loss/train': 1.2766802310943604} 11/07/2021 00:22:32 - INFO - __main__ - Step 22316: {'lr': 0.00047711349099793565, 'samples': 4284672, 'steps': 22315, 'loss/train': 1.6245396137237549} 11/07/2021 00:22:33 - INFO - __main__ - Step 22317: {'lr': 0.00047711127280764497, 'samples': 4284864, 'steps': 22316, 'loss/train': 0.9728535413742065} 11/07/2021 00:22:33 - INFO - __main__ - Step 22318: {'lr': 0.0004771090545150215, 'samples': 4285056, 'steps': 22317, 'loss/train': 2.4875779151916504} 11/07/2021 00:22:34 - INFO - __main__ - Step 22319: {'lr': 0.00047710683612006623, 'samples': 4285248, 'steps': 22318, 'loss/train': 1.8442782163619995} 11/07/2021 00:22:34 - INFO - __main__ - Step 22320: {'lr': 0.00047710461762278, 'samples': 4285440, 'steps': 22319, 'loss/train': 1.5267730951309204} 11/07/2021 00:22:34 - INFO - __main__ - Step 22321: {'lr': 0.00047710239902316404, 'samples': 4285632, 'steps': 22320, 'loss/train': 1.1445330381393433} 11/07/2021 00:22:35 - INFO - __main__ - Step 22322: {'lr': 0.0004771001803212192, 'samples': 4285824, 'steps': 22321, 'loss/train': 1.614621877670288} 11/07/2021 00:22:36 - INFO - __main__ - Step 22323: {'lr': 0.0004770979615169466, 'samples': 4286016, 'steps': 22322, 'loss/train': 1.7761956453323364} 11/07/2021 00:22:36 - INFO - __main__ - Step 22324: {'lr': 0.00047709574261034705, 'samples': 4286208, 'steps': 22323, 'loss/train': 1.923969030380249} 11/07/2021 00:22:36 - INFO - __main__ - Step 22325: {'lr': 0.0004770935236014217, 'samples': 4286400, 'steps': 22324, 'loss/train': 1.6623705625534058} 11/07/2021 00:22:37 - INFO - __main__ - Step 22326: {'lr': 0.00047709130449017154, 'samples': 4286592, 'steps': 22325, 'loss/train': 1.6310282945632935} 11/07/2021 00:22:37 - INFO - __main__ - Step 22327: {'lr': 0.0004770890852765975, 'samples': 4286784, 'steps': 22326, 'loss/train': 1.615903377532959} 11/07/2021 00:22:38 - INFO - __main__ - Step 22328: {'lr': 0.00047708686596070065, 'samples': 4286976, 'steps': 22327, 'loss/train': 1.5300124883651733} 11/07/2021 00:22:38 - INFO - __main__ - Step 22329: {'lr': 0.00047708464654248195, 'samples': 4287168, 'steps': 22328, 'loss/train': 1.8082314729690552} 11/07/2021 00:22:39 - INFO - __main__ - Step 22330: {'lr': 0.0004770824270219424, 'samples': 4287360, 'steps': 22329, 'loss/train': 1.3073426485061646} 11/07/2021 00:22:39 - INFO - __main__ - Step 22331: {'lr': 0.0004770802073990831, 'samples': 4287552, 'steps': 22330, 'loss/train': 1.7682299613952637} 11/07/2021 00:22:40 - INFO - __main__ - Step 22332: {'lr': 0.00047707798767390486, 'samples': 4287744, 'steps': 22331, 'loss/train': 1.6708950996398926} 11/07/2021 00:22:41 - INFO - __main__ - Step 22333: {'lr': 0.00047707576784640883, 'samples': 4287936, 'steps': 22332, 'loss/train': 1.608047604560852} 11/07/2021 00:22:41 - INFO - __main__ - Step 22334: {'lr': 0.00047707354791659594, 'samples': 4288128, 'steps': 22333, 'loss/train': 2.013827085494995} 11/07/2021 00:22:41 - INFO - __main__ - Step 22335: {'lr': 0.0004770713278844672, 'samples': 4288320, 'steps': 22334, 'loss/train': 1.0852205753326416} 11/07/2021 00:22:42 - INFO - __main__ - Step 22336: {'lr': 0.00047706910775002363, 'samples': 4288512, 'steps': 22335, 'loss/train': 1.5838872194290161} 11/07/2021 00:22:42 - INFO - __main__ - Step 22337: {'lr': 0.0004770668875132663, 'samples': 4288704, 'steps': 22336, 'loss/train': 1.68880033493042} 11/07/2021 00:22:43 - INFO - __main__ - Step 22338: {'lr': 0.00047706466717419607, 'samples': 4288896, 'steps': 22337, 'loss/train': 1.2144722938537598} 11/07/2021 00:22:43 - INFO - __main__ - Step 22339: {'lr': 0.000477062446732814, 'samples': 4289088, 'steps': 22338, 'loss/train': 1.7872453927993774} 11/07/2021 00:22:44 - INFO - __main__ - Step 22340: {'lr': 0.0004770602261891211, 'samples': 4289280, 'steps': 22339, 'loss/train': 1.386384129524231} 11/07/2021 00:22:44 - INFO - __main__ - Step 22341: {'lr': 0.00047705800554311836, 'samples': 4289472, 'steps': 22340, 'loss/train': 1.812485694885254} 11/07/2021 00:22:45 - INFO - __main__ - Step 22342: {'lr': 0.0004770557847948068, 'samples': 4289664, 'steps': 22341, 'loss/train': 1.3352690935134888} 11/07/2021 00:22:45 - INFO - __main__ - Step 22343: {'lr': 0.0004770535639441874, 'samples': 4289856, 'steps': 22342, 'loss/train': 1.8325512409210205} 11/07/2021 00:22:46 - INFO - __main__ - Step 22344: {'lr': 0.0004770513429912612, 'samples': 4290048, 'steps': 22343, 'loss/train': 1.429988145828247} 11/07/2021 00:22:46 - INFO - __main__ - Step 22345: {'lr': 0.0004770491219360291, 'samples': 4290240, 'steps': 22344, 'loss/train': 1.3341028690338135} 11/07/2021 00:22:47 - INFO - __main__ - Step 22346: {'lr': 0.00047704690077849223, 'samples': 4290432, 'steps': 22345, 'loss/train': 1.5357767343521118} 11/07/2021 00:22:47 - INFO - __main__ - Step 22347: {'lr': 0.0004770446795186515, 'samples': 4290624, 'steps': 22346, 'loss/train': 1.02168869972229} 11/07/2021 00:22:48 - INFO - __main__ - Step 22348: {'lr': 0.0004770424581565079, 'samples': 4290816, 'steps': 22347, 'loss/train': 2.0495290756225586} 11/07/2021 00:22:48 - INFO - __main__ - Step 22349: {'lr': 0.0004770402366920625, 'samples': 4291008, 'steps': 22348, 'loss/train': 1.4317470788955688} 11/07/2021 00:22:49 - INFO - __main__ - Step 22350: {'lr': 0.00047703801512531636, 'samples': 4291200, 'steps': 22349, 'loss/train': 1.5442404747009277} 11/07/2021 00:22:49 - INFO - __main__ - Step 22351: {'lr': 0.00047703579345627036, 'samples': 4291392, 'steps': 22350, 'loss/train': 1.2991282939910889} 11/07/2021 00:22:49 - INFO - __main__ - Step 22352: {'lr': 0.00047703357168492544, 'samples': 4291584, 'steps': 22351, 'loss/train': 1.0072227716445923} 11/07/2021 00:22:50 - INFO - __main__ - Step 22353: {'lr': 0.0004770313498112828, 'samples': 4291776, 'steps': 22352, 'loss/train': 1.7183367013931274} 11/07/2021 00:22:51 - INFO - __main__ - Step 22354: {'lr': 0.0004770291278353433, 'samples': 4291968, 'steps': 22353, 'loss/train': 1.5203161239624023} 11/07/2021 00:22:51 - INFO - __main__ - Step 22355: {'lr': 0.00047702690575710796, 'samples': 4292160, 'steps': 22354, 'loss/train': 1.6376055479049683} 11/07/2021 00:22:51 - INFO - __main__ - Step 22356: {'lr': 0.0004770246835765778, 'samples': 4292352, 'steps': 22355, 'loss/train': 1.6381512880325317} 11/07/2021 00:22:52 - INFO - __main__ - Step 22357: {'lr': 0.0004770224612937538, 'samples': 4292544, 'steps': 22356, 'loss/train': 1.4302995204925537} 11/07/2021 00:22:52 - INFO - __main__ - Step 22358: {'lr': 0.0004770202389086371, 'samples': 4292736, 'steps': 22357, 'loss/train': 1.8325588703155518} 11/07/2021 00:22:53 - INFO - __main__ - Step 22359: {'lr': 0.0004770180164212284, 'samples': 4292928, 'steps': 22358, 'loss/train': 2.1267189979553223} 11/07/2021 00:22:54 - INFO - __main__ - Step 22360: {'lr': 0.00047701579383152906, 'samples': 4293120, 'steps': 22359, 'loss/train': 1.712369680404663} 11/07/2021 00:22:54 - INFO - __main__ - Step 22361: {'lr': 0.0004770135711395398, 'samples': 4293312, 'steps': 22360, 'loss/train': 1.9101619720458984} 11/07/2021 00:22:54 - INFO - __main__ - Step 22362: {'lr': 0.0004770113483452618, 'samples': 4293504, 'steps': 22361, 'loss/train': 1.594948172569275} 11/07/2021 00:22:55 - INFO - __main__ - Step 22363: {'lr': 0.00047700912544869595, 'samples': 4293696, 'steps': 22362, 'loss/train': 0.9005606174468994} 11/07/2021 00:22:56 - INFO - __main__ - Step 22364: {'lr': 0.0004770069024498433, 'samples': 4293888, 'steps': 22363, 'loss/train': 1.134440541267395} 11/07/2021 00:22:56 - INFO - __main__ - Step 22365: {'lr': 0.00047700467934870484, 'samples': 4294080, 'steps': 22364, 'loss/train': 1.7258731126785278} 11/07/2021 00:22:56 - INFO - __main__ - Step 22366: {'lr': 0.0004770024561452816, 'samples': 4294272, 'steps': 22365, 'loss/train': 1.5210816860198975} 11/07/2021 00:22:57 - INFO - __main__ - Step 22367: {'lr': 0.0004770002328395745, 'samples': 4294464, 'steps': 22366, 'loss/train': 1.8009976148605347} 11/07/2021 00:22:57 - INFO - __main__ - Step 22368: {'lr': 0.00047699800943158454, 'samples': 4294656, 'steps': 22367, 'loss/train': 2.5974385738372803} 11/07/2021 00:22:58 - INFO - __main__ - Step 22369: {'lr': 0.0004769957859213129, 'samples': 4294848, 'steps': 22368, 'loss/train': 1.525080680847168} 11/07/2021 00:22:59 - INFO - __main__ - Step 22370: {'lr': 0.00047699356230876047, 'samples': 4295040, 'steps': 22369, 'loss/train': 1.390128254890442} 11/07/2021 00:22:59 - INFO - __main__ - Step 22371: {'lr': 0.0004769913385939282, 'samples': 4295232, 'steps': 22370, 'loss/train': 1.6059892177581787} 11/07/2021 00:22:59 - INFO - __main__ - Step 22372: {'lr': 0.0004769891147768171, 'samples': 4295424, 'steps': 22371, 'loss/train': 1.6354572772979736} 11/07/2021 00:23:00 - INFO - __main__ - Step 22373: {'lr': 0.00047698689085742823, 'samples': 4295616, 'steps': 22372, 'loss/train': 1.6986100673675537} 11/07/2021 00:23:01 - INFO - __main__ - Step 22374: {'lr': 0.00047698466683576256, 'samples': 4295808, 'steps': 22373, 'loss/train': 1.620169997215271} 11/07/2021 00:23:01 - INFO - __main__ - Step 22375: {'lr': 0.0004769824427118211, 'samples': 4296000, 'steps': 22374, 'loss/train': 1.9812889099121094} 11/07/2021 00:23:01 - INFO - __main__ - Step 22376: {'lr': 0.00047698021848560494, 'samples': 4296192, 'steps': 22375, 'loss/train': 1.9996026754379272} 11/07/2021 00:23:02 - INFO - __main__ - Step 22377: {'lr': 0.0004769779941571149, 'samples': 4296384, 'steps': 22376, 'loss/train': 3.221825361251831} 11/07/2021 00:23:02 - INFO - __main__ - Step 22378: {'lr': 0.00047697576972635213, 'samples': 4296576, 'steps': 22377, 'loss/train': 1.418237328529358} 11/07/2021 00:23:02 - INFO - __main__ - Step 22379: {'lr': 0.0004769735451933176, 'samples': 4296768, 'steps': 22378, 'loss/train': 1.2897299528121948} 11/07/2021 00:23:03 - INFO - __main__ - Step 22380: {'lr': 0.0004769713205580122, 'samples': 4296960, 'steps': 22379, 'loss/train': 1.8298044204711914} 11/07/2021 00:23:04 - INFO - __main__ - Step 22381: {'lr': 0.0004769690958204371, 'samples': 4297152, 'steps': 22380, 'loss/train': 1.7279726266860962} 11/07/2021 00:23:04 - INFO - __main__ - Step 22382: {'lr': 0.0004769668709805932, 'samples': 4297344, 'steps': 22381, 'loss/train': 1.6349552869796753} 11/07/2021 00:23:04 - INFO - __main__ - Step 22383: {'lr': 0.0004769646460384816, 'samples': 4297536, 'steps': 22382, 'loss/train': 1.3800374269485474} 11/07/2021 00:23:05 - INFO - __main__ - Step 22384: {'lr': 0.00047696242099410307, 'samples': 4297728, 'steps': 22383, 'loss/train': 1.1722553968429565} 11/07/2021 00:23:06 - INFO - __main__ - Step 22385: {'lr': 0.00047696019584745887, 'samples': 4297920, 'steps': 22384, 'loss/train': 1.6306506395339966} 11/07/2021 00:23:06 - INFO - __main__ - Step 22386: {'lr': 0.00047695797059854996, 'samples': 4298112, 'steps': 22385, 'loss/train': 1.1783595085144043} 11/07/2021 00:23:06 - INFO - __main__ - Step 22387: {'lr': 0.0004769557452473772, 'samples': 4298304, 'steps': 22386, 'loss/train': 1.5198588371276855} 11/07/2021 00:23:07 - INFO - __main__ - Step 22388: {'lr': 0.00047695351979394173, 'samples': 4298496, 'steps': 22387, 'loss/train': 1.2966810464859009} 11/07/2021 00:23:07 - INFO - __main__ - Step 22389: {'lr': 0.00047695129423824454, 'samples': 4298688, 'steps': 22388, 'loss/train': 1.4684990644454956} 11/07/2021 00:23:08 - INFO - __main__ - Step 22390: {'lr': 0.0004769490685802865, 'samples': 4298880, 'steps': 22389, 'loss/train': 1.5988306999206543} 11/07/2021 00:23:09 - INFO - __main__ - Step 22391: {'lr': 0.00047694684282006885, 'samples': 4299072, 'steps': 22390, 'loss/train': 1.4806081056594849} 11/07/2021 00:23:09 - INFO - __main__ - Step 22392: {'lr': 0.00047694461695759236, 'samples': 4299264, 'steps': 22391, 'loss/train': 1.8315117359161377} 11/07/2021 00:23:09 - INFO - __main__ - Step 22393: {'lr': 0.00047694239099285815, 'samples': 4299456, 'steps': 22392, 'loss/train': 1.7604342699050903} 11/07/2021 00:23:10 - INFO - __main__ - Step 22394: {'lr': 0.00047694016492586715, 'samples': 4299648, 'steps': 22393, 'loss/train': 1.3254797458648682} 11/07/2021 00:23:11 - INFO - __main__ - Step 22395: {'lr': 0.0004769379387566205, 'samples': 4299840, 'steps': 22394, 'loss/train': 1.8817399740219116} 11/07/2021 00:23:11 - INFO - __main__ - Step 22396: {'lr': 0.000476935712485119, 'samples': 4300032, 'steps': 22395, 'loss/train': 1.6139127016067505} 11/07/2021 00:23:12 - INFO - __main__ - Step 22397: {'lr': 0.0004769334861113639, 'samples': 4300224, 'steps': 22396, 'loss/train': 1.4538699388504028} 11/07/2021 00:23:12 - INFO - __main__ - Step 22398: {'lr': 0.000476931259635356, 'samples': 4300416, 'steps': 22397, 'loss/train': 1.6450276374816895} 11/07/2021 00:23:12 - INFO - __main__ - Step 22399: {'lr': 0.00047692903305709646, 'samples': 4300608, 'steps': 22398, 'loss/train': 1.532269835472107} 11/07/2021 00:23:13 - INFO - __main__ - Step 22400: {'lr': 0.0004769268063765861, 'samples': 4300800, 'steps': 22399, 'loss/train': 1.9369391202926636} 11/07/2021 00:23:14 - INFO - __main__ - Step 22401: {'lr': 0.00047692457959382605, 'samples': 4300992, 'steps': 22400, 'loss/train': 1.5004935264587402} 11/07/2021 00:23:14 - INFO - __main__ - Step 22402: {'lr': 0.0004769223527088173, 'samples': 4301184, 'steps': 22401, 'loss/train': 1.646162509918213} 11/07/2021 00:23:14 - INFO - __main__ - Step 22403: {'lr': 0.00047692012572156086, 'samples': 4301376, 'steps': 22402, 'loss/train': 1.83078134059906} 11/07/2021 00:23:15 - INFO - __main__ - Step 22404: {'lr': 0.00047691789863205764, 'samples': 4301568, 'steps': 22403, 'loss/train': 2.1965112686157227} 11/07/2021 00:23:15 - INFO - __main__ - Step 22405: {'lr': 0.0004769156714403088, 'samples': 4301760, 'steps': 22404, 'loss/train': 1.1386685371398926} 11/07/2021 00:23:16 - INFO - __main__ - Step 22406: {'lr': 0.0004769134441463152, 'samples': 4301952, 'steps': 22405, 'loss/train': 1.8415265083312988} 11/07/2021 00:23:16 - INFO - __main__ - Step 22407: {'lr': 0.0004769112167500779, 'samples': 4302144, 'steps': 22406, 'loss/train': 1.3537362813949585} 11/07/2021 00:23:17 - INFO - __main__ - Step 22408: {'lr': 0.00047690898925159796, 'samples': 4302336, 'steps': 22407, 'loss/train': 1.3526972532272339} 11/07/2021 00:23:17 - INFO - __main__ - Step 22409: {'lr': 0.0004769067616508763, 'samples': 4302528, 'steps': 22408, 'loss/train': 2.1145200729370117} 11/07/2021 00:23:17 - INFO - __main__ - Step 22410: {'lr': 0.00047690453394791393, 'samples': 4302720, 'steps': 22409, 'loss/train': 1.4619641304016113} 11/07/2021 00:23:18 - INFO - __main__ - Step 22411: {'lr': 0.0004769023061427119, 'samples': 4302912, 'steps': 22410, 'loss/train': 0.3332456052303314} 11/07/2021 00:23:19 - INFO - __main__ - Step 22412: {'lr': 0.0004769000782352713, 'samples': 4303104, 'steps': 22411, 'loss/train': 1.1623334884643555} 11/07/2021 00:23:19 - INFO - __main__ - Step 22413: {'lr': 0.00047689785022559284, 'samples': 4303296, 'steps': 22412, 'loss/train': 1.511031150817871} 11/07/2021 00:23:19 - INFO - __main__ - Step 22414: {'lr': 0.0004768956221136778, 'samples': 4303488, 'steps': 22413, 'loss/train': 1.2305755615234375} 11/07/2021 00:23:20 - INFO - __main__ - Step 22415: {'lr': 0.00047689339389952713, 'samples': 4303680, 'steps': 22414, 'loss/train': 1.4260704517364502} 11/07/2021 00:23:21 - INFO - __main__ - Step 22416: {'lr': 0.0004768911655831417, 'samples': 4303872, 'steps': 22415, 'loss/train': 2.0966074466705322} 11/07/2021 00:23:21 - INFO - __main__ - Step 22417: {'lr': 0.0004768889371645227, 'samples': 4304064, 'steps': 22416, 'loss/train': 1.5773353576660156} 11/07/2021 00:23:22 - INFO - __main__ - Step 22418: {'lr': 0.000476886708643671, 'samples': 4304256, 'steps': 22417, 'loss/train': 1.4869263172149658} 11/07/2021 00:23:22 - INFO - __main__ - Step 22419: {'lr': 0.0004768844800205877, 'samples': 4304448, 'steps': 22418, 'loss/train': 1.8562376499176025} 11/07/2021 00:23:22 - INFO - __main__ - Step 22420: {'lr': 0.0004768822512952737, 'samples': 4304640, 'steps': 22419, 'loss/train': 1.4217239618301392} 11/07/2021 00:23:23 - INFO - __main__ - Step 22421: {'lr': 0.0004768800224677301, 'samples': 4304832, 'steps': 22420, 'loss/train': 1.4588857889175415} 11/07/2021 00:23:24 - INFO - __main__ - Step 22422: {'lr': 0.0004768777935379578, 'samples': 4305024, 'steps': 22421, 'loss/train': 1.9696581363677979} 11/07/2021 00:23:24 - INFO - __main__ - Step 22423: {'lr': 0.0004768755645059579, 'samples': 4305216, 'steps': 22422, 'loss/train': 1.1590843200683594} 11/07/2021 00:23:24 - INFO - __main__ - Step 22424: {'lr': 0.00047687333537173136, 'samples': 4305408, 'steps': 22423, 'loss/train': 1.8532379865646362} 11/07/2021 00:23:25 - INFO - __main__ - Step 22425: {'lr': 0.00047687110613527924, 'samples': 4305600, 'steps': 22424, 'loss/train': 2.046144485473633} 11/07/2021 00:23:26 - INFO - __main__ - Step 22426: {'lr': 0.00047686887679660253, 'samples': 4305792, 'steps': 22425, 'loss/train': 1.6717745065689087} 11/07/2021 00:23:26 - INFO - __main__ - Step 22427: {'lr': 0.0004768666473557021, 'samples': 4305984, 'steps': 22426, 'loss/train': 1.4736900329589844} 11/07/2021 00:23:26 - INFO - __main__ - Step 22428: {'lr': 0.0004768644178125791, 'samples': 4306176, 'steps': 22427, 'loss/train': 2.280658483505249} 11/07/2021 00:23:27 - INFO - __main__ - Step 22429: {'lr': 0.0004768621881672345, 'samples': 4306368, 'steps': 22428, 'loss/train': 1.1368001699447632} 11/07/2021 00:23:27 - INFO - __main__ - Step 22430: {'lr': 0.00047685995841966936, 'samples': 4306560, 'steps': 22429, 'loss/train': 1.5032750368118286} 11/07/2021 00:23:27 - INFO - __main__ - Step 22431: {'lr': 0.0004768577285698845, 'samples': 4306752, 'steps': 22430, 'loss/train': 1.7891005277633667} 11/07/2021 00:23:28 - INFO - __main__ - Step 22432: {'lr': 0.00047685549861788113, 'samples': 4306944, 'steps': 22431, 'loss/train': 1.7036324739456177} 11/07/2021 00:23:29 - INFO - __main__ - Step 22433: {'lr': 0.0004768532685636602, 'samples': 4307136, 'steps': 22432, 'loss/train': 2.235828161239624} 11/07/2021 00:23:29 - INFO - __main__ - Step 22434: {'lr': 0.0004768510384072226, 'samples': 4307328, 'steps': 22433, 'loss/train': 1.9438023567199707} 11/07/2021 00:23:29 - INFO - __main__ - Step 22435: {'lr': 0.0004768488081485695, 'samples': 4307520, 'steps': 22434, 'loss/train': 0.8506284952163696} 11/07/2021 00:23:30 - INFO - __main__ - Step 22436: {'lr': 0.0004768465777877018, 'samples': 4307712, 'steps': 22435, 'loss/train': 1.2240818738937378} 11/07/2021 00:23:31 - INFO - __main__ - Step 22437: {'lr': 0.0004768443473246205, 'samples': 4307904, 'steps': 22436, 'loss/train': 1.3975337743759155} 11/07/2021 00:23:31 - INFO - __main__ - Step 22438: {'lr': 0.00047684211675932665, 'samples': 4308096, 'steps': 22437, 'loss/train': 2.0789754390716553} 11/07/2021 00:23:32 - INFO - __main__ - Step 22439: {'lr': 0.0004768398860918213, 'samples': 4308288, 'steps': 22438, 'loss/train': 1.650249719619751} 11/07/2021 00:23:32 - INFO - __main__ - Step 22440: {'lr': 0.0004768376553221053, 'samples': 4308480, 'steps': 22439, 'loss/train': 1.447034239768982} 11/07/2021 00:23:32 - INFO - __main__ - Step 22441: {'lr': 0.0004768354244501798, 'samples': 4308672, 'steps': 22440, 'loss/train': 1.8617455959320068} 11/07/2021 00:23:33 - INFO - __main__ - Step 22442: {'lr': 0.0004768331934760458, 'samples': 4308864, 'steps': 22441, 'loss/train': 1.5123012065887451} 11/07/2021 00:23:34 - INFO - __main__ - Step 22443: {'lr': 0.00047683096239970423, 'samples': 4309056, 'steps': 22442, 'loss/train': 1.7382694482803345} 11/07/2021 00:23:34 - INFO - __main__ - Step 22444: {'lr': 0.0004768287312211561, 'samples': 4309248, 'steps': 22443, 'loss/train': 1.417901635169983} 11/07/2021 00:23:34 - INFO - __main__ - Step 22445: {'lr': 0.0004768264999404025, 'samples': 4309440, 'steps': 22444, 'loss/train': 1.5879945755004883} 11/07/2021 00:23:35 - INFO - __main__ - Step 22446: {'lr': 0.00047682426855744434, 'samples': 4309632, 'steps': 22445, 'loss/train': 1.4141817092895508} 11/07/2021 00:23:36 - INFO - __main__ - Step 22447: {'lr': 0.00047682203707228264, 'samples': 4309824, 'steps': 22446, 'loss/train': 1.5829259157180786} 11/07/2021 00:23:36 - INFO - __main__ - Step 22448: {'lr': 0.00047681980548491853, 'samples': 4310016, 'steps': 22447, 'loss/train': 1.159238338470459} 11/07/2021 00:23:36 - INFO - __main__ - Step 22449: {'lr': 0.00047681757379535285, 'samples': 4310208, 'steps': 22448, 'loss/train': 1.684480905532837} 11/07/2021 00:23:37 - INFO - __main__ - Step 22450: {'lr': 0.00047681534200358665, 'samples': 4310400, 'steps': 22449, 'loss/train': 1.75339674949646} 11/07/2021 00:23:37 - INFO - __main__ - Step 22451: {'lr': 0.000476813110109621, 'samples': 4310592, 'steps': 22450, 'loss/train': 1.9512276649475098} 11/07/2021 00:23:38 - INFO - __main__ - Step 22452: {'lr': 0.0004768108781134568, 'samples': 4310784, 'steps': 22451, 'loss/train': 1.0854402780532837} 11/07/2021 00:23:38 - INFO - __main__ - Step 22453: {'lr': 0.0004768086460150952, 'samples': 4310976, 'steps': 22452, 'loss/train': 1.3101853132247925} 11/07/2021 00:23:39 - INFO - __main__ - Step 22454: {'lr': 0.00047680641381453703, 'samples': 4311168, 'steps': 22453, 'loss/train': 1.4491902589797974} 11/07/2021 00:23:39 - INFO - __main__ - Step 22455: {'lr': 0.0004768041815117835, 'samples': 4311360, 'steps': 22454, 'loss/train': 1.6310304403305054} 11/07/2021 00:23:40 - INFO - __main__ - Step 22456: {'lr': 0.00047680194910683545, 'samples': 4311552, 'steps': 22455, 'loss/train': 1.4553643465042114} 11/07/2021 00:23:41 - INFO - __main__ - Step 22457: {'lr': 0.0004767997165996939, 'samples': 4311744, 'steps': 22456, 'loss/train': 1.2861876487731934} 11/07/2021 00:23:41 - INFO - __main__ - Step 22458: {'lr': 0.00047679748399035994, 'samples': 4311936, 'steps': 22457, 'loss/train': 1.3340020179748535} 11/07/2021 00:23:41 - INFO - __main__ - Step 22459: {'lr': 0.00047679525127883456, 'samples': 4312128, 'steps': 22458, 'loss/train': 1.6435513496398926} 11/07/2021 00:23:42 - INFO - __main__ - Step 22460: {'lr': 0.0004767930184651187, 'samples': 4312320, 'steps': 22459, 'loss/train': 1.8718630075454712} 11/07/2021 00:23:42 - INFO - __main__ - Step 22461: {'lr': 0.0004767907855492134, 'samples': 4312512, 'steps': 22460, 'loss/train': 2.1382293701171875} 11/07/2021 00:23:43 - INFO - __main__ - Step 22462: {'lr': 0.0004767885525311197, 'samples': 4312704, 'steps': 22461, 'loss/train': 1.6162583827972412} 11/07/2021 00:23:44 - INFO - __main__ - Step 22463: {'lr': 0.0004767863194108386, 'samples': 4312896, 'steps': 22462, 'loss/train': 1.6668907403945923} 11/07/2021 00:23:44 - INFO - __main__ - Step 22464: {'lr': 0.000476784086188371, 'samples': 4313088, 'steps': 22463, 'loss/train': 0.9273306727409363} 11/07/2021 00:23:44 - INFO - __main__ - Step 22465: {'lr': 0.00047678185286371803, 'samples': 4313280, 'steps': 22464, 'loss/train': 1.323159098625183} 11/07/2021 00:23:45 - INFO - __main__ - Step 22466: {'lr': 0.0004767796194368807, 'samples': 4313472, 'steps': 22465, 'loss/train': 1.5647008419036865} 11/07/2021 00:23:45 - INFO - __main__ - Step 22467: {'lr': 0.00047677738590786, 'samples': 4313664, 'steps': 22466, 'loss/train': 0.24449460208415985} 11/07/2021 00:23:46 - INFO - __main__ - Step 22468: {'lr': 0.0004767751522766568, 'samples': 4313856, 'steps': 22467, 'loss/train': 1.7678805589675903} 11/07/2021 00:23:46 - INFO - __main__ - Step 22469: {'lr': 0.00047677291854327224, 'samples': 4314048, 'steps': 22468, 'loss/train': 1.3814095258712769} 11/07/2021 00:23:47 - INFO - __main__ - Step 22470: {'lr': 0.00047677068470770737, 'samples': 4314240, 'steps': 22469, 'loss/train': 1.6994068622589111} 11/07/2021 00:23:47 - INFO - __main__ - Step 22471: {'lr': 0.00047676845076996305, 'samples': 4314432, 'steps': 22470, 'loss/train': 2.423990488052368} 11/07/2021 00:23:47 - INFO - __main__ - Step 22472: {'lr': 0.0004767662167300404, 'samples': 4314624, 'steps': 22471, 'loss/train': 1.3077157735824585} 11/07/2021 00:23:48 - INFO - __main__ - Step 22473: {'lr': 0.0004767639825879404, 'samples': 4314816, 'steps': 22472, 'loss/train': 1.881967544555664} 11/07/2021 00:23:49 - INFO - __main__ - Step 22474: {'lr': 0.000476761748343664, 'samples': 4315008, 'steps': 22473, 'loss/train': 1.6649399995803833} 11/07/2021 00:23:49 - INFO - __main__ - Step 22475: {'lr': 0.00047675951399721235, 'samples': 4315200, 'steps': 22474, 'loss/train': 1.5792977809906006} 11/07/2021 00:23:49 - INFO - __main__ - Step 22476: {'lr': 0.0004767572795485863, 'samples': 4315392, 'steps': 22475, 'loss/train': 1.1965254545211792} 11/07/2021 00:23:50 - INFO - __main__ - Step 22477: {'lr': 0.00047675504499778695, 'samples': 4315584, 'steps': 22476, 'loss/train': 1.6232327222824097} 11/07/2021 00:23:51 - INFO - __main__ - Step 22478: {'lr': 0.0004767528103448152, 'samples': 4315776, 'steps': 22477, 'loss/train': 1.3932678699493408} 11/07/2021 00:23:51 - INFO - __main__ - Step 22479: {'lr': 0.00047675057558967224, 'samples': 4315968, 'steps': 22478, 'loss/train': 1.8550174236297607} 11/07/2021 00:23:52 - INFO - __main__ - Step 22480: {'lr': 0.0004767483407323589, 'samples': 4316160, 'steps': 22479, 'loss/train': 1.5465221405029297} 11/07/2021 00:23:52 - INFO - __main__ - Step 22481: {'lr': 0.00047674610577287625, 'samples': 4316352, 'steps': 22480, 'loss/train': 1.599327802658081} 11/07/2021 00:23:52 - INFO - __main__ - Step 22482: {'lr': 0.00047674387071122536, 'samples': 4316544, 'steps': 22481, 'loss/train': 1.6746630668640137} 11/07/2021 00:23:53 - INFO - __main__ - Step 22483: {'lr': 0.0004767416355474071, 'samples': 4316736, 'steps': 22482, 'loss/train': 1.7197669744491577} 11/07/2021 00:23:54 - INFO - __main__ - Step 22484: {'lr': 0.00047673940028142265, 'samples': 4316928, 'steps': 22483, 'loss/train': 1.0817619562149048} 11/07/2021 00:23:54 - INFO - __main__ - Step 22485: {'lr': 0.0004767371649132729, 'samples': 4317120, 'steps': 22484, 'loss/train': 1.4430047273635864} 11/07/2021 00:23:54 - INFO - __main__ - Step 22486: {'lr': 0.00047673492944295883, 'samples': 4317312, 'steps': 22485, 'loss/train': 1.4995204210281372} 11/07/2021 00:23:55 - INFO - __main__ - Step 22487: {'lr': 0.0004767326938704816, 'samples': 4317504, 'steps': 22486, 'loss/train': 1.2819911241531372} 11/07/2021 00:23:55 - INFO - __main__ - Step 22488: {'lr': 0.00047673045819584197, 'samples': 4317696, 'steps': 22487, 'loss/train': 1.8140474557876587} 11/07/2021 00:23:56 - INFO - __main__ - Step 22489: {'lr': 0.0004767282224190412, 'samples': 4317888, 'steps': 22488, 'loss/train': 1.7236080169677734} 11/07/2021 00:23:56 - INFO - __main__ - Step 22490: {'lr': 0.00047672598654008015, 'samples': 4318080, 'steps': 22489, 'loss/train': 1.273484468460083} 11/07/2021 00:23:57 - INFO - __main__ - Step 22491: {'lr': 0.0004767237505589599, 'samples': 4318272, 'steps': 22490, 'loss/train': 1.61056649684906} 11/07/2021 00:23:57 - INFO - __main__ - Step 22492: {'lr': 0.0004767215144756814, 'samples': 4318464, 'steps': 22491, 'loss/train': 2.750136137008667} 11/07/2021 00:23:57 - INFO - __main__ - Step 22493: {'lr': 0.0004767192782902457, 'samples': 4318656, 'steps': 22492, 'loss/train': 1.574295163154602} 11/07/2021 00:23:58 - INFO - __main__ - Step 22494: {'lr': 0.0004767170420026538, 'samples': 4318848, 'steps': 22493, 'loss/train': 1.4278780221939087} 11/07/2021 00:23:59 - INFO - __main__ - Step 22495: {'lr': 0.0004767148056129067, 'samples': 4319040, 'steps': 22494, 'loss/train': 1.9771298170089722} 11/07/2021 00:23:59 - INFO - __main__ - Step 22496: {'lr': 0.0004767125691210054, 'samples': 4319232, 'steps': 22495, 'loss/train': 1.6492091417312622} 11/07/2021 00:23:59 - INFO - __main__ - Step 22497: {'lr': 0.00047671033252695083, 'samples': 4319424, 'steps': 22496, 'loss/train': 1.8200469017028809} 11/07/2021 00:24:00 - INFO - __main__ - Step 22498: {'lr': 0.0004767080958307442, 'samples': 4319616, 'steps': 22497, 'loss/train': 2.2042977809906006} 11/07/2021 00:24:00 - INFO - __main__ - Step 22499: {'lr': 0.0004767058590323864, 'samples': 4319808, 'steps': 22498, 'loss/train': 1.707726001739502} 11/07/2021 00:24:01 - INFO - __main__ - Step 22500: {'lr': 0.00047670362213187833, 'samples': 4320000, 'steps': 22499, 'loss/train': 1.839163899421692} 11/07/2021 00:24:02 - INFO - __main__ - Step 22501: {'lr': 0.0004767013851292212, 'samples': 4320192, 'steps': 22500, 'loss/train': 1.605943202972412} 11/07/2021 00:24:02 - INFO - __main__ - Step 22502: {'lr': 0.0004766991480244159, 'samples': 4320384, 'steps': 22501, 'loss/train': 1.4277918338775635} 11/07/2021 00:24:02 - INFO - __main__ - Step 22503: {'lr': 0.0004766969108174635, 'samples': 4320576, 'steps': 22502, 'loss/train': 2.018317699432373} 11/07/2021 00:24:03 - INFO - __main__ - Step 22504: {'lr': 0.0004766946735083649, 'samples': 4320768, 'steps': 22503, 'loss/train': 1.634273648262024} 11/07/2021 00:24:04 - INFO - __main__ - Step 22505: {'lr': 0.0004766924360971212, 'samples': 4320960, 'steps': 22504, 'loss/train': 1.7070815563201904} 11/07/2021 00:24:04 - INFO - __main__ - Step 22506: {'lr': 0.00047669019858373343, 'samples': 4321152, 'steps': 22505, 'loss/train': 0.18323098123073578} 11/07/2021 00:24:04 - INFO - __main__ - Step 22507: {'lr': 0.00047668796096820247, 'samples': 4321344, 'steps': 22506, 'loss/train': 1.6749025583267212} 11/07/2021 00:24:05 - INFO - __main__ - Step 22508: {'lr': 0.00047668572325052953, 'samples': 4321536, 'steps': 22507, 'loss/train': 1.3067169189453125} 11/07/2021 00:24:05 - INFO - __main__ - Step 22509: {'lr': 0.00047668348543071536, 'samples': 4321728, 'steps': 22508, 'loss/train': 2.1704883575439453} 11/07/2021 00:24:06 - INFO - __main__ - Step 22510: {'lr': 0.00047668124750876117, 'samples': 4321920, 'steps': 22509, 'loss/train': 0.7687938213348389} 11/07/2021 00:24:07 - INFO - __main__ - Step 22511: {'lr': 0.0004766790094846679, 'samples': 4322112, 'steps': 22510, 'loss/train': 1.2642887830734253} 11/07/2021 00:24:07 - INFO - __main__ - Step 22512: {'lr': 0.0004766767713584367, 'samples': 4322304, 'steps': 22511, 'loss/train': 1.0644493103027344} 11/07/2021 00:24:07 - INFO - __main__ - Step 22513: {'lr': 0.00047667453313006826, 'samples': 4322496, 'steps': 22512, 'loss/train': 1.282457947731018} 11/07/2021 00:24:08 - INFO - __main__ - Step 22514: {'lr': 0.00047667229479956386, 'samples': 4322688, 'steps': 22513, 'loss/train': 1.6759504079818726} 11/07/2021 00:24:09 - INFO - __main__ - Step 22515: {'lr': 0.0004766700563669244, 'samples': 4322880, 'steps': 22514, 'loss/train': 1.3310102224349976} 11/07/2021 00:24:09 - INFO - __main__ - Step 22516: {'lr': 0.0004766678178321509, 'samples': 4323072, 'steps': 22515, 'loss/train': 1.6661882400512695} 11/07/2021 00:24:09 - INFO - __main__ - Step 22517: {'lr': 0.0004766655791952444, 'samples': 4323264, 'steps': 22516, 'loss/train': 1.630807876586914} 11/07/2021 00:24:10 - INFO - __main__ - Step 22518: {'lr': 0.0004766633404562059, 'samples': 4323456, 'steps': 22517, 'loss/train': 1.046881079673767} 11/07/2021 00:24:10 - INFO - __main__ - Step 22519: {'lr': 0.0004766611016150364, 'samples': 4323648, 'steps': 22518, 'loss/train': 1.5486303567886353} 11/07/2021 00:24:11 - INFO - __main__ - Step 22520: {'lr': 0.00047665886267173686, 'samples': 4323840, 'steps': 22519, 'loss/train': 1.411068320274353} 11/07/2021 00:24:11 - INFO - __main__ - Step 22521: {'lr': 0.00047665662362630836, 'samples': 4324032, 'steps': 22520, 'loss/train': 1.875470519065857} 11/07/2021 00:24:12 - INFO - __main__ - Step 22522: {'lr': 0.00047665438447875186, 'samples': 4324224, 'steps': 22521, 'loss/train': 1.4796826839447021} 11/07/2021 00:24:12 - INFO - __main__ - Step 22523: {'lr': 0.0004766521452290684, 'samples': 4324416, 'steps': 22522, 'loss/train': 1.4865388870239258} 11/07/2021 00:24:12 - INFO - __main__ - Step 22524: {'lr': 0.00047664990587725905, 'samples': 4324608, 'steps': 22523, 'loss/train': 2.097989320755005} 11/07/2021 00:24:13 - INFO - __main__ - Step 22525: {'lr': 0.0004766476664233247, 'samples': 4324800, 'steps': 22524, 'loss/train': 1.7296922206878662} 11/07/2021 00:24:14 - INFO - __main__ - Step 22526: {'lr': 0.0004766454268672664, 'samples': 4324992, 'steps': 22525, 'loss/train': 1.8197253942489624} 11/07/2021 00:24:14 - INFO - __main__ - Step 22527: {'lr': 0.00047664318720908516, 'samples': 4325184, 'steps': 22526, 'loss/train': 1.9192566871643066} 11/07/2021 00:24:15 - INFO - __main__ - Step 22528: {'lr': 0.000476640947448782, 'samples': 4325376, 'steps': 22527, 'loss/train': 1.9041056632995605} 11/07/2021 00:24:15 - INFO - __main__ - Step 22529: {'lr': 0.000476638707586358, 'samples': 4325568, 'steps': 22528, 'loss/train': 1.7667334079742432} 11/07/2021 00:24:15 - INFO - __main__ - Step 22530: {'lr': 0.000476636467621814, 'samples': 4325760, 'steps': 22529, 'loss/train': 1.750614881515503} 11/07/2021 00:24:16 - INFO - __main__ - Step 22531: {'lr': 0.00047663422755515113, 'samples': 4325952, 'steps': 22530, 'loss/train': 1.5936520099639893} 11/07/2021 00:24:17 - INFO - __main__ - Step 22532: {'lr': 0.00047663198738637035, 'samples': 4326144, 'steps': 22531, 'loss/train': 1.118178367614746} 11/07/2021 00:24:17 - INFO - __main__ - Step 22533: {'lr': 0.00047662974711547274, 'samples': 4326336, 'steps': 22532, 'loss/train': 1.724767804145813} 11/07/2021 00:24:17 - INFO - __main__ - Step 22534: {'lr': 0.0004766275067424593, 'samples': 4326528, 'steps': 22533, 'loss/train': 2.0930192470550537} 11/07/2021 00:24:18 - INFO - __main__ - Step 22535: {'lr': 0.0004766252662673309, 'samples': 4326720, 'steps': 22534, 'loss/train': 1.5219659805297852} 11/07/2021 00:24:19 - INFO - __main__ - Step 22536: {'lr': 0.0004766230256900887, 'samples': 4326912, 'steps': 22535, 'loss/train': 2.0084965229034424} 11/07/2021 00:24:19 - INFO - __main__ - Step 22537: {'lr': 0.0004766207850107337, 'samples': 4327104, 'steps': 22536, 'loss/train': 2.0028364658355713} 11/07/2021 00:24:19 - INFO - __main__ - Step 22538: {'lr': 0.00047661854422926674, 'samples': 4327296, 'steps': 22537, 'loss/train': 1.4535870552062988} 11/07/2021 00:24:20 - INFO - __main__ - Step 22539: {'lr': 0.0004766163033456891, 'samples': 4327488, 'steps': 22538, 'loss/train': 1.8392276763916016} 11/07/2021 00:24:20 - INFO - __main__ - Step 22540: {'lr': 0.0004766140623600016, 'samples': 4327680, 'steps': 22539, 'loss/train': 1.4498977661132812} 11/07/2021 00:24:21 - INFO - __main__ - Step 22541: {'lr': 0.0004766118212722053, 'samples': 4327872, 'steps': 22540, 'loss/train': 1.1951019763946533} 11/07/2021 00:24:22 - INFO - __main__ - Step 22542: {'lr': 0.0004766095800823013, 'samples': 4328064, 'steps': 22541, 'loss/train': 1.4123594760894775} 11/07/2021 00:24:22 - INFO - __main__ - Step 22543: {'lr': 0.0004766073387902904, 'samples': 4328256, 'steps': 22542, 'loss/train': 1.3626255989074707} 11/07/2021 00:24:22 - INFO - __main__ - Step 22544: {'lr': 0.00047660509739617376, 'samples': 4328448, 'steps': 22543, 'loss/train': 1.6323630809783936} 11/07/2021 00:24:23 - INFO - __main__ - Step 22545: {'lr': 0.00047660285589995233, 'samples': 4328640, 'steps': 22544, 'loss/train': 1.4572019577026367} 11/07/2021 00:24:24 - INFO - __main__ - Step 22546: {'lr': 0.0004766006143016272, 'samples': 4328832, 'steps': 22545, 'loss/train': 1.5938409566879272} 11/07/2021 00:24:24 - INFO - __main__ - Step 22547: {'lr': 0.0004765983726011993, 'samples': 4329024, 'steps': 22546, 'loss/train': 1.2399224042892456} 11/07/2021 00:24:24 - INFO - __main__ - Step 22548: {'lr': 0.0004765961307986697, 'samples': 4329216, 'steps': 22547, 'loss/train': 1.8739992380142212} 11/07/2021 00:24:25 - INFO - __main__ - Step 22549: {'lr': 0.0004765938888940393, 'samples': 4329408, 'steps': 22548, 'loss/train': 1.5949848890304565} 11/07/2021 00:24:25 - INFO - __main__ - Step 22550: {'lr': 0.00047659164688730935, 'samples': 4329600, 'steps': 22549, 'loss/train': 1.728060007095337} 11/07/2021 00:24:26 - INFO - __main__ - Step 22551: {'lr': 0.00047658940477848056, 'samples': 4329792, 'steps': 22550, 'loss/train': 1.4984592199325562} 11/07/2021 00:24:26 - INFO - __main__ - Step 22552: {'lr': 0.00047658716256755414, 'samples': 4329984, 'steps': 22551, 'loss/train': 1.639998435974121} 11/07/2021 00:24:27 - INFO - __main__ - Step 22553: {'lr': 0.00047658492025453106, 'samples': 4330176, 'steps': 22552, 'loss/train': 1.548398494720459} 11/07/2021 00:24:27 - INFO - __main__ - Step 22554: {'lr': 0.00047658267783941223, 'samples': 4330368, 'steps': 22553, 'loss/train': 1.8560858964920044} 11/07/2021 00:24:27 - INFO - __main__ - Step 22555: {'lr': 0.0004765804353221988, 'samples': 4330560, 'steps': 22554, 'loss/train': 1.5847618579864502} 11/07/2021 00:24:29 - INFO - __main__ - Step 22556: {'lr': 0.0004765781927028917, 'samples': 4330752, 'steps': 22555, 'loss/train': 1.6546162366867065} 11/07/2021 00:24:29 - INFO - __main__ - Step 22557: {'lr': 0.000476575949981492, 'samples': 4330944, 'steps': 22556, 'loss/train': 1.2733675241470337} 11/07/2021 00:24:30 - INFO - __main__ - Step 22558: {'lr': 0.00047657370715800066, 'samples': 4331136, 'steps': 22557, 'loss/train': 1.3643406629562378} 11/07/2021 00:24:30 - INFO - __main__ - Step 22559: {'lr': 0.0004765714642324187, 'samples': 4331328, 'steps': 22558, 'loss/train': 1.1664093732833862} 11/07/2021 00:24:30 - INFO - __main__ - Step 22560: {'lr': 0.0004765692212047471, 'samples': 4331520, 'steps': 22559, 'loss/train': 1.8109169006347656} 11/07/2021 00:24:31 - INFO - __main__ - Step 22561: {'lr': 0.00047656697807498693, 'samples': 4331712, 'steps': 22560, 'loss/train': 0.23743750154972076} 11/07/2021 00:24:32 - INFO - __main__ - Step 22562: {'lr': 0.0004765647348431392, 'samples': 4331904, 'steps': 22561, 'loss/train': 1.1988426446914673} 11/07/2021 00:24:32 - INFO - __main__ - Step 22563: {'lr': 0.00047656249150920485, 'samples': 4332096, 'steps': 22562, 'loss/train': 1.4141286611557007} 11/07/2021 00:24:32 - INFO - __main__ - Step 22564: {'lr': 0.000476560248073185, 'samples': 4332288, 'steps': 22563, 'loss/train': 2.05336856842041} 11/07/2021 00:24:33 - INFO - __main__ - Step 22565: {'lr': 0.0004765580045350805, 'samples': 4332480, 'steps': 22564, 'loss/train': 1.3348045349121094} 11/07/2021 00:24:33 - INFO - __main__ - Step 22566: {'lr': 0.00047655576089489254, 'samples': 4332672, 'steps': 22565, 'loss/train': 1.850450038909912} 11/07/2021 00:24:34 - INFO - __main__ - Step 22567: {'lr': 0.00047655351715262205, 'samples': 4332864, 'steps': 22566, 'loss/train': 2.1595187187194824} 11/07/2021 00:24:34 - INFO - __main__ - Step 22568: {'lr': 0.00047655127330827, 'samples': 4333056, 'steps': 22567, 'loss/train': 1.5340650081634521} 11/07/2021 00:24:35 - INFO - __main__ - Step 22569: {'lr': 0.00047654902936183745, 'samples': 4333248, 'steps': 22568, 'loss/train': 1.7308131456375122} 11/07/2021 00:24:35 - INFO - __main__ - Step 22570: {'lr': 0.00047654678531332544, 'samples': 4333440, 'steps': 22569, 'loss/train': 1.0381836891174316} 11/07/2021 00:24:36 - INFO - __main__ - Step 22571: {'lr': 0.00047654454116273493, 'samples': 4333632, 'steps': 22570, 'loss/train': 1.6245771646499634} 11/07/2021 00:24:37 - INFO - __main__ - Step 22572: {'lr': 0.0004765422969100669, 'samples': 4333824, 'steps': 22571, 'loss/train': 1.6836577653884888} 11/07/2021 00:24:37 - INFO - __main__ - Step 22573: {'lr': 0.00047654005255532247, 'samples': 4334016, 'steps': 22572, 'loss/train': 1.4743589162826538} 11/07/2021 00:24:38 - INFO - __main__ - Step 22574: {'lr': 0.0004765378080985026, 'samples': 4334208, 'steps': 22573, 'loss/train': 1.1934410333633423} 11/07/2021 00:24:38 - INFO - __main__ - Step 22575: {'lr': 0.00047653556353960825, 'samples': 4334400, 'steps': 22574, 'loss/train': 1.6637202501296997} 11/07/2021 00:24:38 - INFO - __main__ - Step 22576: {'lr': 0.0004765333188786404, 'samples': 4334592, 'steps': 22575, 'loss/train': 1.433510661125183} 11/07/2021 00:24:40 - INFO - __main__ - Step 22577: {'lr': 0.00047653107411560025, 'samples': 4334784, 'steps': 22576, 'loss/train': 0.8190040588378906} 11/07/2021 00:24:40 - INFO - __main__ - Step 22578: {'lr': 0.00047652882925048863, 'samples': 4334976, 'steps': 22577, 'loss/train': 1.0744819641113281} 11/07/2021 00:24:40 - INFO - __main__ - Step 22579: {'lr': 0.00047652658428330664, 'samples': 4335168, 'steps': 22578, 'loss/train': 1.8014850616455078} 11/07/2021 00:24:41 - INFO - __main__ - Step 22580: {'lr': 0.00047652433921405526, 'samples': 4335360, 'steps': 22579, 'loss/train': 1.555201530456543} 11/07/2021 00:24:41 - INFO - __main__ - Step 22581: {'lr': 0.0004765220940427355, 'samples': 4335552, 'steps': 22580, 'loss/train': 1.776862621307373} 11/07/2021 00:24:41 - INFO - __main__ - Step 22582: {'lr': 0.0004765198487693484, 'samples': 4335744, 'steps': 22581, 'loss/train': 1.9247335195541382} 11/07/2021 00:24:42 - INFO - __main__ - Step 22583: {'lr': 0.00047651760339389494, 'samples': 4335936, 'steps': 22582, 'loss/train': 1.5864671468734741} 11/07/2021 00:24:43 - INFO - __main__ - Step 22584: {'lr': 0.0004765153579163761, 'samples': 4336128, 'steps': 22583, 'loss/train': 1.8406521081924438} 11/07/2021 00:24:43 - INFO - __main__ - Step 22585: {'lr': 0.000476513112336793, 'samples': 4336320, 'steps': 22584, 'loss/train': 1.4075356721878052} 11/07/2021 00:24:43 - INFO - __main__ - Step 22586: {'lr': 0.00047651086665514655, 'samples': 4336512, 'steps': 22585, 'loss/train': 1.6485368013381958} 11/07/2021 00:24:44 - INFO - __main__ - Step 22587: {'lr': 0.00047650862087143787, 'samples': 4336704, 'steps': 22586, 'loss/train': 1.588104248046875} 11/07/2021 00:24:44 - INFO - __main__ - Step 22588: {'lr': 0.0004765063749856678, 'samples': 4336896, 'steps': 22587, 'loss/train': 1.528891682624817} 11/07/2021 00:24:45 - INFO - __main__ - Step 22589: {'lr': 0.00047650412899783747, 'samples': 4337088, 'steps': 22588, 'loss/train': 1.6844433546066284} 11/07/2021 00:24:45 - INFO - __main__ - Step 22590: {'lr': 0.0004765018829079479, 'samples': 4337280, 'steps': 22589, 'loss/train': 2.1266448497772217} 11/07/2021 00:24:46 - INFO - __main__ - Step 22591: {'lr': 0.0004764996367160001, 'samples': 4337472, 'steps': 22590, 'loss/train': 1.5239567756652832} 11/07/2021 00:24:46 - INFO - __main__ - Step 22592: {'lr': 0.000476497390421995, 'samples': 4337664, 'steps': 22591, 'loss/train': 1.8610912561416626} 11/07/2021 00:24:47 - INFO - __main__ - Step 22593: {'lr': 0.00047649514402593377, 'samples': 4337856, 'steps': 22592, 'loss/train': 1.4386012554168701} 11/07/2021 00:24:48 - INFO - __main__ - Step 22594: {'lr': 0.0004764928975278172, 'samples': 4338048, 'steps': 22593, 'loss/train': 1.2887240648269653} 11/07/2021 00:24:48 - INFO - __main__ - Step 22595: {'lr': 0.0004764906509276465, 'samples': 4338240, 'steps': 22594, 'loss/train': 1.515057921409607} 11/07/2021 00:24:48 - INFO - __main__ - Step 22596: {'lr': 0.0004764884042254226, 'samples': 4338432, 'steps': 22595, 'loss/train': 1.320374846458435} 11/07/2021 00:24:49 - INFO - __main__ - Step 22597: {'lr': 0.0004764861574211465, 'samples': 4338624, 'steps': 22596, 'loss/train': 1.4135661125183105} 11/07/2021 00:24:49 - INFO - __main__ - Step 22598: {'lr': 0.0004764839105148193, 'samples': 4338816, 'steps': 22597, 'loss/train': 1.6835685968399048} 11/07/2021 00:24:49 - INFO - __main__ - Step 22599: {'lr': 0.00047648166350644185, 'samples': 4339008, 'steps': 22598, 'loss/train': 1.2688534259796143} 11/07/2021 00:24:51 - INFO - __main__ - Step 22600: {'lr': 0.00047647941639601535, 'samples': 4339200, 'steps': 22599, 'loss/train': 2.066715955734253} 11/07/2021 00:24:51 - INFO - __main__ - Step 22601: {'lr': 0.00047647716918354066, 'samples': 4339392, 'steps': 22600, 'loss/train': 5.759670734405518} 11/07/2021 00:24:51 - INFO - __main__ - Step 22602: {'lr': 0.00047647492186901884, 'samples': 4339584, 'steps': 22601, 'loss/train': 1.9553258419036865} 11/07/2021 00:24:52 - INFO - __main__ - Step 22603: {'lr': 0.0004764726744524509, 'samples': 4339776, 'steps': 22602, 'loss/train': 1.3782130479812622} 11/07/2021 00:24:52 - INFO - __main__ - Step 22604: {'lr': 0.0004764704269338379, 'samples': 4339968, 'steps': 22603, 'loss/train': 1.4786680936813354} 11/07/2021 00:24:52 - INFO - __main__ - Step 22605: {'lr': 0.00047646817931318086, 'samples': 4340160, 'steps': 22604, 'loss/train': 1.6348986625671387} 11/07/2021 00:24:53 - INFO - __main__ - Step 22606: {'lr': 0.0004764659315904807, 'samples': 4340352, 'steps': 22605, 'loss/train': 1.906373143196106} 11/07/2021 00:24:54 - INFO - __main__ - Step 22607: {'lr': 0.0004764636837657385, 'samples': 4340544, 'steps': 22606, 'loss/train': 1.5434366464614868} 11/07/2021 00:24:54 - INFO - __main__ - Step 22608: {'lr': 0.0004764614358389553, 'samples': 4340736, 'steps': 22607, 'loss/train': 1.7191797494888306} 11/07/2021 00:24:55 - INFO - __main__ - Step 22609: {'lr': 0.00047645918781013196, 'samples': 4340928, 'steps': 22608, 'loss/train': 1.4511476755142212} 11/07/2021 00:24:55 - INFO - __main__ - Step 22610: {'lr': 0.0004764569396792697, 'samples': 4341120, 'steps': 22609, 'loss/train': 1.2064729928970337} 11/07/2021 00:24:56 - INFO - __main__ - Step 22611: {'lr': 0.0004764546914463694, 'samples': 4341312, 'steps': 22610, 'loss/train': 1.7000502347946167} 11/07/2021 00:24:56 - INFO - __main__ - Step 22612: {'lr': 0.0004764524431114321, 'samples': 4341504, 'steps': 22611, 'loss/train': 1.9246582984924316} 11/07/2021 00:24:57 - INFO - __main__ - Step 22613: {'lr': 0.0004764501946744589, 'samples': 4341696, 'steps': 22612, 'loss/train': 1.507423996925354} 11/07/2021 00:24:57 - INFO - __main__ - Step 22614: {'lr': 0.00047644794613545065, 'samples': 4341888, 'steps': 22613, 'loss/train': 1.790145993232727} 11/07/2021 00:24:57 - INFO - __main__ - Step 22615: {'lr': 0.00047644569749440846, 'samples': 4342080, 'steps': 22614, 'loss/train': 1.3438689708709717} 11/07/2021 00:24:58 - INFO - __main__ - Step 22616: {'lr': 0.0004764434487513334, 'samples': 4342272, 'steps': 22615, 'loss/train': 1.449084997177124} 11/07/2021 00:24:59 - INFO - __main__ - Step 22617: {'lr': 0.00047644119990622637, 'samples': 4342464, 'steps': 22616, 'loss/train': 1.7146893739700317} 11/07/2021 00:24:59 - INFO - __main__ - Step 22618: {'lr': 0.0004764389509590884, 'samples': 4342656, 'steps': 22617, 'loss/train': 1.4914770126342773} 11/07/2021 00:24:59 - INFO - __main__ - Step 22619: {'lr': 0.0004764367019099206, 'samples': 4342848, 'steps': 22618, 'loss/train': 1.383577823638916} 11/07/2021 00:25:00 - INFO - __main__ - Step 22620: {'lr': 0.0004764344527587239, 'samples': 4343040, 'steps': 22619, 'loss/train': 1.3901386260986328} 11/07/2021 00:25:01 - INFO - __main__ - Step 22621: {'lr': 0.00047643220350549934, 'samples': 4343232, 'steps': 22620, 'loss/train': 1.5086541175842285} 11/07/2021 00:25:01 - INFO - __main__ - Step 22622: {'lr': 0.0004764299541502478, 'samples': 4343424, 'steps': 22621, 'loss/train': 1.4755960702896118} 11/07/2021 00:25:01 - INFO - __main__ - Step 22623: {'lr': 0.0004764277046929706, 'samples': 4343616, 'steps': 22622, 'loss/train': 1.5495747327804565} 11/07/2021 00:25:02 - INFO - __main__ - Step 22624: {'lr': 0.00047642545513366843, 'samples': 4343808, 'steps': 22623, 'loss/train': 2.366577625274658} 11/07/2021 00:25:02 - INFO - __main__ - Step 22625: {'lr': 0.0004764232054723425, 'samples': 4344000, 'steps': 22624, 'loss/train': 1.5127291679382324} 11/07/2021 00:25:03 - INFO - __main__ - Step 22626: {'lr': 0.0004764209557089938, 'samples': 4344192, 'steps': 22625, 'loss/train': 1.3345847129821777} 11/07/2021 00:25:03 - INFO - __main__ - Step 22627: {'lr': 0.00047641870584362323, 'samples': 4344384, 'steps': 22626, 'loss/train': 1.7284650802612305} 11/07/2021 00:25:04 - INFO - __main__ - Step 22628: {'lr': 0.00047641645587623196, 'samples': 4344576, 'steps': 22627, 'loss/train': 1.660504698753357} 11/07/2021 00:25:04 - INFO - __main__ - Step 22629: {'lr': 0.0004764142058068209, 'samples': 4344768, 'steps': 22628, 'loss/train': 1.7857352495193481} 11/07/2021 00:25:05 - INFO - __main__ - Step 22630: {'lr': 0.00047641195563539107, 'samples': 4344960, 'steps': 22629, 'loss/train': 1.4394036531448364} 11/07/2021 00:25:05 - INFO - __main__ - Step 22631: {'lr': 0.0004764097053619435, 'samples': 4345152, 'steps': 22630, 'loss/train': 1.6884974241256714} 11/07/2021 00:25:06 - INFO - __main__ - Step 22632: {'lr': 0.00047640745498647925, 'samples': 4345344, 'steps': 22631, 'loss/train': 1.6962106227874756} 11/07/2021 00:25:06 - INFO - __main__ - Step 22633: {'lr': 0.00047640520450899926, 'samples': 4345536, 'steps': 22632, 'loss/train': 1.340884804725647} 11/07/2021 00:25:07 - INFO - __main__ - Step 22634: {'lr': 0.0004764029539295046, 'samples': 4345728, 'steps': 22633, 'loss/train': 1.3287487030029297} 11/07/2021 00:25:07 - INFO - __main__ - Step 22635: {'lr': 0.0004764007032479963, 'samples': 4345920, 'steps': 22634, 'loss/train': 1.8320151567459106} 11/07/2021 00:25:07 - INFO - __main__ - Step 22636: {'lr': 0.00047639845246447534, 'samples': 4346112, 'steps': 22635, 'loss/train': 1.7329380512237549} 11/07/2021 00:25:08 - INFO - __main__ - Step 22637: {'lr': 0.00047639620157894264, 'samples': 4346304, 'steps': 22636, 'loss/train': 1.4954795837402344} 11/07/2021 00:25:09 - INFO - __main__ - Step 22638: {'lr': 0.00047639395059139936, 'samples': 4346496, 'steps': 22637, 'loss/train': 1.446747064590454} 11/07/2021 00:25:09 - INFO - __main__ - Step 22639: {'lr': 0.0004763916995018465, 'samples': 4346688, 'steps': 22638, 'loss/train': 1.718592643737793} 11/07/2021 00:25:09 - INFO - __main__ - Step 22640: {'lr': 0.00047638944831028497, 'samples': 4346880, 'steps': 22639, 'loss/train': 1.7990508079528809} 11/07/2021 00:25:10 - INFO - __main__ - Step 22641: {'lr': 0.00047638719701671587, 'samples': 4347072, 'steps': 22640, 'loss/train': 4.697803497314453} 11/07/2021 00:25:11 - INFO - __main__ - Step 22642: {'lr': 0.00047638494562114015, 'samples': 4347264, 'steps': 22641, 'loss/train': 1.645540714263916} 11/07/2021 00:25:11 - INFO - __main__ - Step 22643: {'lr': 0.0004763826941235589, 'samples': 4347456, 'steps': 22642, 'loss/train': 1.7661207914352417} 11/07/2021 00:25:11 - INFO - __main__ - Step 22644: {'lr': 0.00047638044252397313, 'samples': 4347648, 'steps': 22643, 'loss/train': 1.580713152885437} 11/07/2021 00:25:12 - INFO - __main__ - Step 22645: {'lr': 0.0004763781908223838, 'samples': 4347840, 'steps': 22644, 'loss/train': 1.46024489402771} 11/07/2021 00:25:12 - INFO - __main__ - Step 22646: {'lr': 0.00047637593901879194, 'samples': 4348032, 'steps': 22645, 'loss/train': 1.8002101182937622} 11/07/2021 00:25:13 - INFO - __main__ - Step 22647: {'lr': 0.00047637368711319863, 'samples': 4348224, 'steps': 22646, 'loss/train': 2.0005905628204346} 11/07/2021 00:25:14 - INFO - __main__ - Step 22648: {'lr': 0.00047637143510560477, 'samples': 4348416, 'steps': 22647, 'loss/train': 1.71867835521698} 11/07/2021 00:25:14 - INFO - __main__ - Step 22649: {'lr': 0.0004763691829960114, 'samples': 4348608, 'steps': 22648, 'loss/train': 1.8255470991134644} 11/07/2021 00:25:14 - INFO - __main__ - Step 22650: {'lr': 0.00047636693078441963, 'samples': 4348800, 'steps': 22649, 'loss/train': 1.0353654623031616} 11/07/2021 00:25:15 - INFO - __main__ - Step 22651: {'lr': 0.0004763646784708304, 'samples': 4348992, 'steps': 22650, 'loss/train': 1.4039242267608643} 11/07/2021 00:25:15 - INFO - __main__ - Step 22652: {'lr': 0.00047636242605524477, 'samples': 4349184, 'steps': 22651, 'loss/train': 1.3490198850631714} 11/07/2021 00:25:16 - INFO - __main__ - Step 22653: {'lr': 0.0004763601735376637, 'samples': 4349376, 'steps': 22652, 'loss/train': 2.0028791427612305} 11/07/2021 00:25:16 - INFO - __main__ - Step 22654: {'lr': 0.0004763579209180882, 'samples': 4349568, 'steps': 22653, 'loss/train': 1.5570244789123535} 11/07/2021 00:25:17 - INFO - __main__ - Step 22655: {'lr': 0.00047635566819651936, 'samples': 4349760, 'steps': 22654, 'loss/train': 1.7072210311889648} 11/07/2021 00:25:17 - INFO - __main__ - Step 22656: {'lr': 0.00047635341537295814, 'samples': 4349952, 'steps': 22655, 'loss/train': 1.369418740272522} 11/07/2021 00:25:17 - INFO - __main__ - Step 22657: {'lr': 0.0004763511624474055, 'samples': 4350144, 'steps': 22656, 'loss/train': 2.147608518600464} 11/07/2021 00:25:18 - INFO - __main__ - Step 22658: {'lr': 0.00047634890941986263, 'samples': 4350336, 'steps': 22657, 'loss/train': 1.8280770778656006} 11/07/2021 00:25:19 - INFO - __main__ - Step 22659: {'lr': 0.00047634665629033035, 'samples': 4350528, 'steps': 22658, 'loss/train': 1.259914755821228} 11/07/2021 00:25:19 - INFO - __main__ - Step 22660: {'lr': 0.00047634440305880976, 'samples': 4350720, 'steps': 22659, 'loss/train': 1.7895315885543823} 11/07/2021 00:25:19 - INFO - __main__ - Step 22661: {'lr': 0.0004763421497253019, 'samples': 4350912, 'steps': 22660, 'loss/train': 1.1604344844818115} 11/07/2021 00:25:20 - INFO - __main__ - Step 22662: {'lr': 0.0004763398962898078, 'samples': 4351104, 'steps': 22661, 'loss/train': 1.401200771331787} 11/07/2021 00:25:21 - INFO - __main__ - Step 22663: {'lr': 0.0004763376427523284, 'samples': 4351296, 'steps': 22662, 'loss/train': 1.4624043703079224} 11/07/2021 00:25:21 - INFO - __main__ - Step 22664: {'lr': 0.0004763353891128648, 'samples': 4351488, 'steps': 22663, 'loss/train': 0.9483494758605957} 11/07/2021 00:25:22 - INFO - __main__ - Step 22665: {'lr': 0.00047633313537141786, 'samples': 4351680, 'steps': 22664, 'loss/train': 1.6869838237762451} 11/07/2021 00:25:22 - INFO - __main__ - Step 22666: {'lr': 0.00047633088152798875, 'samples': 4351872, 'steps': 22665, 'loss/train': 1.3631956577301025} 11/07/2021 00:25:22 - INFO - __main__ - Step 22667: {'lr': 0.00047632862758257845, 'samples': 4352064, 'steps': 22666, 'loss/train': 1.072783350944519} 11/07/2021 00:25:23 - INFO - __main__ - Step 22668: {'lr': 0.0004763263735351879, 'samples': 4352256, 'steps': 22667, 'loss/train': 1.6640249490737915} 11/07/2021 00:25:24 - INFO - __main__ - Step 22669: {'lr': 0.0004763241193858183, 'samples': 4352448, 'steps': 22668, 'loss/train': 1.0642592906951904} 11/07/2021 00:25:24 - INFO - __main__ - Step 22670: {'lr': 0.00047632186513447045, 'samples': 4352640, 'steps': 22669, 'loss/train': 1.4311997890472412} 11/07/2021 00:25:24 - INFO - __main__ - Step 22671: {'lr': 0.0004763196107811455, 'samples': 4352832, 'steps': 22670, 'loss/train': 1.3761380910873413} 11/07/2021 00:25:25 - INFO - __main__ - Step 22672: {'lr': 0.0004763173563258444, 'samples': 4353024, 'steps': 22671, 'loss/train': 1.5977585315704346} 11/07/2021 00:25:25 - INFO - __main__ - Step 22673: {'lr': 0.0004763151017685682, 'samples': 4353216, 'steps': 22672, 'loss/train': 1.4709689617156982} 11/07/2021 00:25:26 - INFO - __main__ - Step 22674: {'lr': 0.0004763128471093179, 'samples': 4353408, 'steps': 22673, 'loss/train': 1.834922432899475} 11/07/2021 00:25:27 - INFO - __main__ - Step 22675: {'lr': 0.0004763105923480946, 'samples': 4353600, 'steps': 22674, 'loss/train': 1.8651885986328125} 11/07/2021 00:25:27 - INFO - __main__ - Step 22676: {'lr': 0.0004763083374848991, 'samples': 4353792, 'steps': 22675, 'loss/train': 1.3292526006698608} 11/07/2021 00:25:27 - INFO - __main__ - Step 22677: {'lr': 0.00047630608251973265, 'samples': 4353984, 'steps': 22676, 'loss/train': 2.0414798259735107} 11/07/2021 00:25:28 - INFO - __main__ - Step 22678: {'lr': 0.00047630382745259616, 'samples': 4354176, 'steps': 22677, 'loss/train': 1.8651529550552368} 11/07/2021 00:25:29 - INFO - __main__ - Step 22679: {'lr': 0.0004763015722834907, 'samples': 4354368, 'steps': 22678, 'loss/train': 1.882495403289795} 11/07/2021 00:25:29 - INFO - __main__ - Step 22680: {'lr': 0.00047629931701241715, 'samples': 4354560, 'steps': 22679, 'loss/train': 1.380403757095337} 11/07/2021 00:25:29 - INFO - __main__ - Step 22681: {'lr': 0.0004762970616393767, 'samples': 4354752, 'steps': 22680, 'loss/train': 1.7098302841186523} 11/07/2021 00:25:30 - INFO - __main__ - Step 22682: {'lr': 0.0004762948061643702, 'samples': 4354944, 'steps': 22681, 'loss/train': 1.2838383913040161} 11/07/2021 00:25:30 - INFO - __main__ - Step 22683: {'lr': 0.0004762925505873988, 'samples': 4355136, 'steps': 22682, 'loss/train': 1.9014365673065186} 11/07/2021 00:25:31 - INFO - __main__ - Step 22684: {'lr': 0.00047629029490846346, 'samples': 4355328, 'steps': 22683, 'loss/train': 1.598739743232727} 11/07/2021 00:25:31 - INFO - __main__ - Step 22685: {'lr': 0.00047628803912756523, 'samples': 4355520, 'steps': 22684, 'loss/train': 1.638590931892395} 11/07/2021 00:25:32 - INFO - __main__ - Step 22686: {'lr': 0.00047628578324470505, 'samples': 4355712, 'steps': 22685, 'loss/train': 1.5135688781738281} 11/07/2021 00:25:32 - INFO - __main__ - Step 22687: {'lr': 0.00047628352725988406, 'samples': 4355904, 'steps': 22686, 'loss/train': 2.5237491130828857} 11/07/2021 00:25:32 - INFO - __main__ - Step 22688: {'lr': 0.0004762812711731032, 'samples': 4356096, 'steps': 22687, 'loss/train': 1.1969538927078247} 11/07/2021 00:25:34 - INFO - __main__ - Step 22689: {'lr': 0.00047627901498436344, 'samples': 4356288, 'steps': 22688, 'loss/train': 1.1763588190078735} 11/07/2021 00:25:34 - INFO - __main__ - Step 22690: {'lr': 0.0004762767586936658, 'samples': 4356480, 'steps': 22689, 'loss/train': 1.466115117073059} 11/07/2021 00:25:34 - INFO - __main__ - Step 22691: {'lr': 0.00047627450230101144, 'samples': 4356672, 'steps': 22690, 'loss/train': 1.8381071090698242} 11/07/2021 00:25:35 - INFO - __main__ - Step 22692: {'lr': 0.0004762722458064013, 'samples': 4356864, 'steps': 22691, 'loss/train': 2.0725739002227783} 11/07/2021 00:25:35 - INFO - __main__ - Step 22693: {'lr': 0.0004762699892098363, 'samples': 4357056, 'steps': 22692, 'loss/train': 1.396986961364746} 11/07/2021 00:25:36 - INFO - __main__ - Step 22694: {'lr': 0.0004762677325113176, 'samples': 4357248, 'steps': 22693, 'loss/train': 1.1241483688354492} 11/07/2021 00:25:37 - INFO - __main__ - Step 22695: {'lr': 0.0004762654757108461, 'samples': 4357440, 'steps': 22694, 'loss/train': 1.4990808963775635} 11/07/2021 00:25:37 - INFO - __main__ - Step 22696: {'lr': 0.00047626321880842287, 'samples': 4357632, 'steps': 22695, 'loss/train': 1.609912395477295} 11/07/2021 00:25:37 - INFO - __main__ - Step 22697: {'lr': 0.00047626096180404895, 'samples': 4357824, 'steps': 22696, 'loss/train': 0.8824257850646973} 11/07/2021 00:25:38 - INFO - __main__ - Step 22698: {'lr': 0.0004762587046977253, 'samples': 4358016, 'steps': 22697, 'loss/train': 1.386871337890625} 11/07/2021 00:25:38 - INFO - __main__ - Step 22699: {'lr': 0.000476256447489453, 'samples': 4358208, 'steps': 22698, 'loss/train': 1.5603753328323364} 11/07/2021 00:25:39 - INFO - __main__ - Step 22700: {'lr': 0.000476254190179233, 'samples': 4358400, 'steps': 22699, 'loss/train': 1.339954137802124} 11/07/2021 00:25:40 - INFO - __main__ - Step 22701: {'lr': 0.0004762519327670664, 'samples': 4358592, 'steps': 22700, 'loss/train': 1.212175965309143} 11/07/2021 00:25:40 - INFO - __main__ - Step 22702: {'lr': 0.0004762496752529541, 'samples': 4358784, 'steps': 22701, 'loss/train': 1.3022215366363525} 11/07/2021 00:25:40 - INFO - __main__ - Step 22703: {'lr': 0.0004762474176368973, 'samples': 4358976, 'steps': 22702, 'loss/train': 1.2095423936843872} 11/07/2021 00:25:41 - INFO - __main__ - Step 22704: {'lr': 0.00047624515991889684, 'samples': 4359168, 'steps': 22703, 'loss/train': 1.5306414365768433} 11/07/2021 00:25:42 - INFO - __main__ - Step 22705: {'lr': 0.00047624290209895384, 'samples': 4359360, 'steps': 22704, 'loss/train': 1.1710671186447144} 11/07/2021 00:25:42 - INFO - __main__ - Step 22706: {'lr': 0.00047624064417706917, 'samples': 4359552, 'steps': 22705, 'loss/train': 1.3341710567474365} 11/07/2021 00:25:42 - INFO - __main__ - Step 22707: {'lr': 0.00047623838615324407, 'samples': 4359744, 'steps': 22706, 'loss/train': 1.8324825763702393} 11/07/2021 00:25:43 - INFO - __main__ - Step 22708: {'lr': 0.0004762361280274794, 'samples': 4359936, 'steps': 22707, 'loss/train': 1.2356606721878052} 11/07/2021 00:25:43 - INFO - __main__ - Step 22709: {'lr': 0.0004762338697997762, 'samples': 4360128, 'steps': 22708, 'loss/train': 1.3746553659439087} 11/07/2021 00:25:44 - INFO - __main__ - Step 22710: {'lr': 0.00047623161147013557, 'samples': 4360320, 'steps': 22709, 'loss/train': 1.6129733324050903} 11/07/2021 00:25:44 - INFO - __main__ - Step 22711: {'lr': 0.0004762293530385584, 'samples': 4360512, 'steps': 22710, 'loss/train': 1.0353015661239624} 11/07/2021 00:25:45 - INFO - __main__ - Step 22712: {'lr': 0.0004762270945050458, 'samples': 4360704, 'steps': 22711, 'loss/train': 1.751045823097229} 11/07/2021 00:25:45 - INFO - __main__ - Step 22713: {'lr': 0.00047622483586959877, 'samples': 4360896, 'steps': 22712, 'loss/train': 2.044782876968384} 11/07/2021 00:25:45 - INFO - __main__ - Step 22714: {'lr': 0.00047622257713221826, 'samples': 4361088, 'steps': 22713, 'loss/train': 1.5603699684143066} 11/07/2021 00:25:47 - INFO - __main__ - Step 22715: {'lr': 0.00047622031829290545, 'samples': 4361280, 'steps': 22714, 'loss/train': 1.6232585906982422} 11/07/2021 00:25:47 - INFO - __main__ - Step 22716: {'lr': 0.0004762180593516612, 'samples': 4361472, 'steps': 22715, 'loss/train': 1.375087857246399} 11/07/2021 00:25:47 - INFO - __main__ - Step 22717: {'lr': 0.0004762158003084867, 'samples': 4361664, 'steps': 22716, 'loss/train': 1.8091025352478027} 11/07/2021 00:25:48 - INFO - __main__ - Step 22718: {'lr': 0.0004762135411633827, 'samples': 4361856, 'steps': 22717, 'loss/train': 1.6967964172363281} 11/07/2021 00:25:48 - INFO - __main__ - Step 22719: {'lr': 0.0004762112819163504, 'samples': 4362048, 'steps': 22718, 'loss/train': 1.4949711561203003} 11/07/2021 00:25:48 - INFO - __main__ - Step 22720: {'lr': 0.0004762090225673908, 'samples': 4362240, 'steps': 22719, 'loss/train': 1.857780933380127} 11/07/2021 00:25:49 - INFO - __main__ - Step 22721: {'lr': 0.0004762067631165049, 'samples': 4362432, 'steps': 22720, 'loss/train': 1.3588804006576538} 11/07/2021 00:25:50 - INFO - __main__ - Step 22722: {'lr': 0.0004762045035636937, 'samples': 4362624, 'steps': 22721, 'loss/train': 1.1537671089172363} 11/07/2021 00:25:50 - INFO - __main__ - Step 22723: {'lr': 0.0004762022439089583, 'samples': 4362816, 'steps': 22722, 'loss/train': 1.5845859050750732} 11/07/2021 00:25:50 - INFO - __main__ - Step 22724: {'lr': 0.0004761999841522996, 'samples': 4363008, 'steps': 22723, 'loss/train': 1.457655668258667} 11/07/2021 00:25:51 - INFO - __main__ - Step 22725: {'lr': 0.0004761977242937188, 'samples': 4363200, 'steps': 22724, 'loss/train': 1.7326648235321045} 11/07/2021 00:25:52 - INFO - __main__ - Step 22726: {'lr': 0.00047619546433321663, 'samples': 4363392, 'steps': 22725, 'loss/train': 2.0075771808624268} 11/07/2021 00:25:52 - INFO - __main__ - Step 22727: {'lr': 0.00047619320427079437, 'samples': 4363584, 'steps': 22726, 'loss/train': 1.7012308835983276} 11/07/2021 00:25:52 - INFO - __main__ - Step 22728: {'lr': 0.00047619094410645293, 'samples': 4363776, 'steps': 22727, 'loss/train': 1.6955498456954956} 11/07/2021 00:25:53 - INFO - __main__ - Step 22729: {'lr': 0.0004761886838401933, 'samples': 4363968, 'steps': 22728, 'loss/train': 1.3284543752670288} 11/07/2021 00:25:53 - INFO - __main__ - Step 22730: {'lr': 0.0004761864234720166, 'samples': 4364160, 'steps': 22729, 'loss/train': 1.4040530920028687} 11/07/2021 00:25:54 - INFO - __main__ - Step 22731: {'lr': 0.00047618416300192375, 'samples': 4364352, 'steps': 22730, 'loss/train': 1.7706865072250366} 11/07/2021 00:25:55 - INFO - __main__ - Step 22732: {'lr': 0.0004761819024299158, 'samples': 4364544, 'steps': 22731, 'loss/train': 1.3994696140289307} 11/07/2021 00:25:55 - INFO - __main__ - Step 22733: {'lr': 0.0004761796417559938, 'samples': 4364736, 'steps': 22732, 'loss/train': 1.0253995656967163} 11/07/2021 00:25:55 - INFO - __main__ - Step 22734: {'lr': 0.0004761773809801587, 'samples': 4364928, 'steps': 22733, 'loss/train': 1.6148645877838135} 11/07/2021 00:25:56 - INFO - __main__ - Step 22735: {'lr': 0.0004761751201024116, 'samples': 4365120, 'steps': 22734, 'loss/train': 1.7968497276306152} 11/07/2021 00:25:56 - INFO - __main__ - Step 22736: {'lr': 0.0004761728591227535, 'samples': 4365312, 'steps': 22735, 'loss/train': 1.335252285003662} 11/07/2021 00:25:57 - INFO - __main__ - Step 22737: {'lr': 0.00047617059804118536, 'samples': 4365504, 'steps': 22736, 'loss/train': 1.6151665449142456} 11/07/2021 00:25:57 - INFO - __main__ - Step 22738: {'lr': 0.0004761683368577083, 'samples': 4365696, 'steps': 22737, 'loss/train': 1.8413608074188232} 11/07/2021 00:25:58 - INFO - __main__ - Step 22739: {'lr': 0.0004761660755723232, 'samples': 4365888, 'steps': 22738, 'loss/train': 1.8213224411010742} 11/07/2021 00:25:58 - INFO - __main__ - Step 22740: {'lr': 0.0004761638141850312, 'samples': 4366080, 'steps': 22739, 'loss/train': 0.6306785345077515} 11/07/2021 00:25:58 - INFO - __main__ - Step 22741: {'lr': 0.0004761615526958333, 'samples': 4366272, 'steps': 22740, 'loss/train': 2.021019697189331} 11/07/2021 00:26:00 - INFO - __main__ - Step 22742: {'lr': 0.0004761592911047304, 'samples': 4366464, 'steps': 22741, 'loss/train': 1.3894141912460327} 11/07/2021 00:26:00 - INFO - __main__ - Step 22743: {'lr': 0.00047615702941172366, 'samples': 4366656, 'steps': 22742, 'loss/train': 1.7450761795043945} 11/07/2021 00:26:00 - INFO - __main__ - Step 22744: {'lr': 0.0004761547676168141, 'samples': 4366848, 'steps': 22743, 'loss/train': 1.6096402406692505} 11/07/2021 00:26:01 - INFO - __main__ - Step 22745: {'lr': 0.0004761525057200027, 'samples': 4367040, 'steps': 22744, 'loss/train': 1.6301637887954712} 11/07/2021 00:26:01 - INFO - __main__ - Step 22746: {'lr': 0.00047615024372129033, 'samples': 4367232, 'steps': 22745, 'loss/train': 1.208552360534668} 11/07/2021 00:26:01 - INFO - __main__ - Step 22747: {'lr': 0.0004761479816206783, 'samples': 4367424, 'steps': 22746, 'loss/train': 1.8603065013885498} 11/07/2021 00:26:02 - INFO - __main__ - Step 22748: {'lr': 0.00047614571941816743, 'samples': 4367616, 'steps': 22747, 'loss/train': 1.9403351545333862} 11/07/2021 00:26:03 - INFO - __main__ - Step 22749: {'lr': 0.00047614345711375874, 'samples': 4367808, 'steps': 22748, 'loss/train': 1.3546077013015747} 11/07/2021 00:26:03 - INFO - __main__ - Step 22750: {'lr': 0.0004761411947074533, 'samples': 4368000, 'steps': 22749, 'loss/train': 1.303208351135254} 11/07/2021 00:26:03 - INFO - __main__ - Step 22751: {'lr': 0.00047613893219925217, 'samples': 4368192, 'steps': 22750, 'loss/train': 1.6966766119003296} 11/07/2021 00:26:04 - INFO - __main__ - Step 22752: {'lr': 0.00047613666958915636, 'samples': 4368384, 'steps': 22751, 'loss/train': 1.6795576810836792} 11/07/2021 00:26:05 - INFO - __main__ - Step 22753: {'lr': 0.0004761344068771668, 'samples': 4368576, 'steps': 22752, 'loss/train': 1.901644229888916} 11/07/2021 00:26:05 - INFO - __main__ - Step 22754: {'lr': 0.0004761321440632846, 'samples': 4368768, 'steps': 22753, 'loss/train': 1.7572957277297974} 11/07/2021 00:26:05 - INFO - __main__ - Step 22755: {'lr': 0.00047612988114751074, 'samples': 4368960, 'steps': 22754, 'loss/train': 1.728715181350708} 11/07/2021 00:26:06 - INFO - __main__ - Step 22756: {'lr': 0.00047612761812984626, 'samples': 4369152, 'steps': 22755, 'loss/train': 1.803318738937378} 11/07/2021 00:26:06 - INFO - __main__ - Step 22757: {'lr': 0.00047612535501029215, 'samples': 4369344, 'steps': 22756, 'loss/train': 1.694913625717163} 11/07/2021 00:26:07 - INFO - __main__ - Step 22758: {'lr': 0.0004761230917888494, 'samples': 4369536, 'steps': 22757, 'loss/train': 1.8466224670410156} 11/07/2021 00:26:08 - INFO - __main__ - Step 22759: {'lr': 0.00047612082846551913, 'samples': 4369728, 'steps': 22758, 'loss/train': 1.1649069786071777} 11/07/2021 00:26:08 - INFO - __main__ - Step 22760: {'lr': 0.0004761185650403023, 'samples': 4369920, 'steps': 22759, 'loss/train': 1.9606319665908813} 11/07/2021 00:26:08 - INFO - __main__ - Step 22761: {'lr': 0.0004761163015131999, 'samples': 4370112, 'steps': 22760, 'loss/train': 1.2533822059631348} 11/07/2021 00:26:09 - INFO - __main__ - Step 22762: {'lr': 0.00047611403788421305, 'samples': 4370304, 'steps': 22761, 'loss/train': 1.934218168258667} 11/07/2021 00:26:10 - INFO - __main__ - Step 22763: {'lr': 0.0004761117741533426, 'samples': 4370496, 'steps': 22762, 'loss/train': 1.782967209815979} 11/07/2021 00:26:10 - INFO - __main__ - Step 22764: {'lr': 0.0004761095103205897, 'samples': 4370688, 'steps': 22763, 'loss/train': 1.5865823030471802} 11/07/2021 00:26:10 - INFO - __main__ - Step 22765: {'lr': 0.00047610724638595545, 'samples': 4370880, 'steps': 22764, 'loss/train': 1.7549775838851929} 11/07/2021 00:26:11 - INFO - __main__ - Step 22766: {'lr': 0.00047610498234944065, 'samples': 4371072, 'steps': 22765, 'loss/train': 1.639030933380127} 11/07/2021 00:26:11 - INFO - __main__ - Step 22767: {'lr': 0.00047610271821104647, 'samples': 4371264, 'steps': 22766, 'loss/train': 1.582705020904541} 11/07/2021 00:26:12 - INFO - __main__ - Step 22768: {'lr': 0.0004761004539707739, 'samples': 4371456, 'steps': 22767, 'loss/train': 1.0878734588623047} 11/07/2021 00:26:13 - INFO - __main__ - Step 22769: {'lr': 0.00047609818962862394, 'samples': 4371648, 'steps': 22768, 'loss/train': 1.2499237060546875} 11/07/2021 00:26:13 - INFO - __main__ - Step 22770: {'lr': 0.00047609592518459766, 'samples': 4371840, 'steps': 22769, 'loss/train': 1.562828540802002} 11/07/2021 00:26:13 - INFO - __main__ - Step 22771: {'lr': 0.00047609366063869595, 'samples': 4372032, 'steps': 22770, 'loss/train': 1.1859840154647827} 11/07/2021 00:26:14 - INFO - __main__ - Step 22772: {'lr': 0.00047609139599092006, 'samples': 4372224, 'steps': 22771, 'loss/train': 1.7285878658294678} 11/07/2021 00:26:14 - INFO - __main__ - Step 22773: {'lr': 0.0004760891312412708, 'samples': 4372416, 'steps': 22772, 'loss/train': 1.5550551414489746} 11/07/2021 00:26:15 - INFO - __main__ - Step 22774: {'lr': 0.0004760868663897493, 'samples': 4372608, 'steps': 22773, 'loss/train': 1.689923882484436} 11/07/2021 00:26:15 - INFO - __main__ - Step 22775: {'lr': 0.0004760846014363565, 'samples': 4372800, 'steps': 22774, 'loss/train': 1.6872327327728271} 11/07/2021 00:26:16 - INFO - __main__ - Step 22776: {'lr': 0.0004760823363810935, 'samples': 4372992, 'steps': 22775, 'loss/train': 1.6919316053390503} 11/07/2021 00:26:16 - INFO - __main__ - Step 22777: {'lr': 0.0004760800712239612, 'samples': 4373184, 'steps': 22776, 'loss/train': 1.7831788063049316} 11/07/2021 00:26:17 - INFO - __main__ - Step 22778: {'lr': 0.0004760778059649609, 'samples': 4373376, 'steps': 22777, 'loss/train': 1.4779175519943237} 11/07/2021 00:26:17 - INFO - __main__ - Step 22779: {'lr': 0.0004760755406040933, 'samples': 4373568, 'steps': 22778, 'loss/train': 1.4688366651535034} 11/07/2021 00:26:18 - INFO - __main__ - Step 22780: {'lr': 0.00047607327514135955, 'samples': 4373760, 'steps': 22779, 'loss/train': 1.6470462083816528} 11/07/2021 00:26:18 - INFO - __main__ - Step 22781: {'lr': 0.00047607100957676067, 'samples': 4373952, 'steps': 22780, 'loss/train': 1.4926844835281372} 11/07/2021 00:26:18 - INFO - __main__ - Step 22782: {'lr': 0.0004760687439102977, 'samples': 4374144, 'steps': 22781, 'loss/train': 1.4573156833648682} 11/07/2021 00:26:19 - INFO - __main__ - Step 22783: {'lr': 0.0004760664781419717, 'samples': 4374336, 'steps': 22782, 'loss/train': 1.8104127645492554} 11/07/2021 00:26:20 - INFO - __main__ - Step 22784: {'lr': 0.00047606421227178354, 'samples': 4374528, 'steps': 22783, 'loss/train': 1.9405661821365356} 11/07/2021 00:26:20 - INFO - __main__ - Step 22785: {'lr': 0.0004760619462997343, 'samples': 4374720, 'steps': 22784, 'loss/train': 1.883963942527771} 11/07/2021 00:26:20 - INFO - __main__ - Step 22786: {'lr': 0.00047605968022582513, 'samples': 4374912, 'steps': 22785, 'loss/train': 1.4331687688827515} 11/07/2021 00:26:21 - INFO - __main__ - Step 22787: {'lr': 0.000476057414050057, 'samples': 4375104, 'steps': 22786, 'loss/train': 1.257065773010254} 11/07/2021 00:26:21 - INFO - __main__ - Step 22788: {'lr': 0.00047605514777243076, 'samples': 4375296, 'steps': 22787, 'loss/train': 1.6153793334960938} 11/07/2021 00:26:22 - INFO - __main__ - Step 22789: {'lr': 0.0004760528813929476, 'samples': 4375488, 'steps': 22788, 'loss/train': 2.338101387023926} 11/07/2021 00:26:23 - INFO - __main__ - Step 22790: {'lr': 0.0004760506149116085, 'samples': 4375680, 'steps': 22789, 'loss/train': 2.3699982166290283} 11/07/2021 00:26:23 - INFO - __main__ - Step 22791: {'lr': 0.0004760483483284145, 'samples': 4375872, 'steps': 22790, 'loss/train': 1.6395772695541382} 11/07/2021 00:26:23 - INFO - __main__ - Step 22792: {'lr': 0.0004760460816433666, 'samples': 4376064, 'steps': 22791, 'loss/train': 1.8664559125900269} 11/07/2021 00:26:24 - INFO - __main__ - Step 22793: {'lr': 0.0004760438148564659, 'samples': 4376256, 'steps': 22792, 'loss/train': 1.484313726425171} 11/07/2021 00:26:24 - INFO - __main__ - Step 22794: {'lr': 0.00047604154796771327, 'samples': 4376448, 'steps': 22793, 'loss/train': 1.7671095132827759} 11/07/2021 00:26:25 - INFO - __main__ - Step 22795: {'lr': 0.0004760392809771098, 'samples': 4376640, 'steps': 22794, 'loss/train': 1.3843249082565308} 11/07/2021 00:26:25 - INFO - __main__ - Step 22796: {'lr': 0.00047603701388465646, 'samples': 4376832, 'steps': 22795, 'loss/train': 1.696913242340088} 11/07/2021 00:26:26 - INFO - __main__ - Step 22797: {'lr': 0.0004760347466903544, 'samples': 4377024, 'steps': 22796, 'loss/train': 1.2284456491470337} 11/07/2021 00:26:26 - INFO - __main__ - Step 22798: {'lr': 0.0004760324793942046, 'samples': 4377216, 'steps': 22797, 'loss/train': 1.663573145866394} 11/07/2021 00:26:26 - INFO - __main__ - Step 22799: {'lr': 0.000476030211996208, 'samples': 4377408, 'steps': 22798, 'loss/train': 1.5763083696365356} 11/07/2021 00:26:27 - INFO - __main__ - Step 22800: {'lr': 0.0004760279444963657, 'samples': 4377600, 'steps': 22799, 'loss/train': 1.9171547889709473} 11/07/2021 00:26:28 - INFO - __main__ - Step 22801: {'lr': 0.0004760256768946787, 'samples': 4377792, 'steps': 22800, 'loss/train': 1.7655497789382935} 11/07/2021 00:26:28 - INFO - __main__ - Step 22802: {'lr': 0.00047602340919114793, 'samples': 4377984, 'steps': 22801, 'loss/train': 1.8714632987976074} 11/07/2021 00:26:28 - INFO - __main__ - Step 22803: {'lr': 0.00047602114138577464, 'samples': 4378176, 'steps': 22802, 'loss/train': 1.8071902990341187} 11/07/2021 00:26:29 - INFO - __main__ - Step 22804: {'lr': 0.00047601887347855965, 'samples': 4378368, 'steps': 22803, 'loss/train': 1.7482866048812866} 11/07/2021 00:26:30 - INFO - __main__ - Step 22805: {'lr': 0.00047601660546950396, 'samples': 4378560, 'steps': 22804, 'loss/train': 1.632975697517395} 11/07/2021 00:26:30 - INFO - __main__ - Step 22806: {'lr': 0.0004760143373586088, 'samples': 4378752, 'steps': 22805, 'loss/train': 1.297473430633545} 11/07/2021 00:26:30 - INFO - __main__ - Step 22807: {'lr': 0.000476012069145875, 'samples': 4378944, 'steps': 22806, 'loss/train': 1.1406476497650146} 11/07/2021 00:26:31 - INFO - __main__ - Step 22808: {'lr': 0.00047600980083130367, 'samples': 4379136, 'steps': 22807, 'loss/train': 1.5739290714263916} 11/07/2021 00:26:31 - INFO - __main__ - Step 22809: {'lr': 0.0004760075324148959, 'samples': 4379328, 'steps': 22808, 'loss/train': 1.4321844577789307} 11/07/2021 00:26:32 - INFO - __main__ - Step 22810: {'lr': 0.00047600526389665246, 'samples': 4379520, 'steps': 22809, 'loss/train': 2.083242893218994} 11/07/2021 00:26:33 - INFO - __main__ - Step 22811: {'lr': 0.00047600299527657464, 'samples': 4379712, 'steps': 22810, 'loss/train': 1.2935341596603394} 11/07/2021 00:26:33 - INFO - __main__ - Step 22812: {'lr': 0.0004760007265546633, 'samples': 4379904, 'steps': 22811, 'loss/train': 1.2957768440246582} 11/07/2021 00:26:33 - INFO - __main__ - Step 22813: {'lr': 0.00047599845773091957, 'samples': 4380096, 'steps': 22812, 'loss/train': 1.912548303604126} 11/07/2021 00:26:34 - INFO - __main__ - Step 22814: {'lr': 0.0004759961888053444, 'samples': 4380288, 'steps': 22813, 'loss/train': 1.8325461149215698} 11/07/2021 00:26:34 - INFO - __main__ - Step 22815: {'lr': 0.00047599391977793884, 'samples': 4380480, 'steps': 22814, 'loss/train': 2.203472375869751} 11/07/2021 00:26:35 - INFO - __main__ - Step 22816: {'lr': 0.00047599165064870385, 'samples': 4380672, 'steps': 22815, 'loss/train': 1.6262705326080322} 11/07/2021 00:26:36 - INFO - __main__ - Step 22817: {'lr': 0.0004759893814176406, 'samples': 4380864, 'steps': 22816, 'loss/train': 1.609465479850769} 11/07/2021 00:26:36 - INFO - __main__ - Step 22818: {'lr': 0.00047598711208475, 'samples': 4381056, 'steps': 22817, 'loss/train': 1.697596549987793} 11/07/2021 00:26:36 - INFO - __main__ - Step 22819: {'lr': 0.00047598484265003307, 'samples': 4381248, 'steps': 22818, 'loss/train': 1.49398934841156} 11/07/2021 00:26:37 - INFO - __main__ - Step 22820: {'lr': 0.00047598257311349087, 'samples': 4381440, 'steps': 22819, 'loss/train': 1.8012315034866333} 11/07/2021 00:26:38 - INFO - __main__ - Step 22821: {'lr': 0.0004759803034751244, 'samples': 4381632, 'steps': 22820, 'loss/train': 1.7763862609863281} 11/07/2021 00:26:38 - INFO - __main__ - Step 22822: {'lr': 0.0004759780337349347, 'samples': 4381824, 'steps': 22821, 'loss/train': 1.1558592319488525} 11/07/2021 00:26:38 - INFO - __main__ - Step 22823: {'lr': 0.0004759757638929227, 'samples': 4382016, 'steps': 22822, 'loss/train': 1.7096298933029175} 11/07/2021 00:26:39 - INFO - __main__ - Step 22824: {'lr': 0.00047597349394908967, 'samples': 4382208, 'steps': 22823, 'loss/train': 1.0129907131195068} 11/07/2021 00:26:39 - INFO - __main__ - Step 22825: {'lr': 0.0004759712239034364, 'samples': 4382400, 'steps': 22824, 'loss/train': 2.0725107192993164} 11/07/2021 00:26:40 - INFO - __main__ - Step 22826: {'lr': 0.0004759689537559639, 'samples': 4382592, 'steps': 22825, 'loss/train': 0.44292402267456055} 11/07/2021 00:26:41 - INFO - __main__ - Step 22827: {'lr': 0.0004759666835066734, 'samples': 4382784, 'steps': 22826, 'loss/train': 1.713152527809143} 11/07/2021 00:26:41 - INFO - __main__ - Step 22828: {'lr': 0.00047596441315556575, 'samples': 4382976, 'steps': 22827, 'loss/train': 0.9748886227607727} 11/07/2021 00:26:41 - INFO - __main__ - Step 22829: {'lr': 0.00047596214270264204, 'samples': 4383168, 'steps': 22828, 'loss/train': 1.3580565452575684} 11/07/2021 00:26:42 - INFO - __main__ - Step 22830: {'lr': 0.00047595987214790324, 'samples': 4383360, 'steps': 22829, 'loss/train': 1.7348219156265259} 11/07/2021 00:26:43 - INFO - __main__ - Step 22831: {'lr': 0.0004759576014913505, 'samples': 4383552, 'steps': 22830, 'loss/train': 1.497948408126831} 11/07/2021 00:26:43 - INFO - __main__ - Step 22832: {'lr': 0.0004759553307329846, 'samples': 4383744, 'steps': 22831, 'loss/train': 1.397659182548523} 11/07/2021 00:26:43 - INFO - __main__ - Step 22833: {'lr': 0.0004759530598728068, 'samples': 4383936, 'steps': 22832, 'loss/train': 1.3267203569412231} 11/07/2021 00:26:44 - INFO - __main__ - Step 22834: {'lr': 0.000475950788910818, 'samples': 4384128, 'steps': 22833, 'loss/train': 1.3071024417877197} 11/07/2021 00:26:44 - INFO - __main__ - Step 22835: {'lr': 0.0004759485178470193, 'samples': 4384320, 'steps': 22834, 'loss/train': 1.684517502784729} 11/07/2021 00:26:45 - INFO - __main__ - Step 22836: {'lr': 0.0004759462466814117, 'samples': 4384512, 'steps': 22835, 'loss/train': 0.7335377335548401} 11/07/2021 00:26:45 - INFO - __main__ - Step 22837: {'lr': 0.0004759439754139962, 'samples': 4384704, 'steps': 22836, 'loss/train': 1.4387019872665405} 11/07/2021 00:26:46 - INFO - __main__ - Step 22838: {'lr': 0.0004759417040447738, 'samples': 4384896, 'steps': 22837, 'loss/train': 1.5750333070755005} 11/07/2021 00:26:46 - INFO - __main__ - Step 22839: {'lr': 0.00047593943257374563, 'samples': 4385088, 'steps': 22838, 'loss/train': 1.6899166107177734} 11/07/2021 00:26:47 - INFO - __main__ - Step 22840: {'lr': 0.00047593716100091253, 'samples': 4385280, 'steps': 22839, 'loss/train': 1.795365571975708} 11/07/2021 00:26:47 - INFO - __main__ - Step 22841: {'lr': 0.00047593488932627567, 'samples': 4385472, 'steps': 22840, 'loss/train': 1.5965057611465454} 11/07/2021 00:26:48 - INFO - __main__ - Step 22842: {'lr': 0.00047593261754983607, 'samples': 4385664, 'steps': 22841, 'loss/train': 1.7863192558288574} 11/07/2021 00:26:48 - INFO - __main__ - Step 22843: {'lr': 0.00047593034567159465, 'samples': 4385856, 'steps': 22842, 'loss/train': 2.4589531421661377} 11/07/2021 00:26:49 - INFO - __main__ - Step 22844: {'lr': 0.00047592807369155256, 'samples': 4386048, 'steps': 22843, 'loss/train': 1.5386347770690918} 11/07/2021 00:26:49 - INFO - __main__ - Step 22845: {'lr': 0.0004759258016097108, 'samples': 4386240, 'steps': 22844, 'loss/train': 1.98453950881958} 11/07/2021 00:26:49 - INFO - __main__ - Step 22846: {'lr': 0.0004759235294260703, 'samples': 4386432, 'steps': 22845, 'loss/train': 1.771543264389038} 11/07/2021 00:26:50 - INFO - __main__ - Step 22847: {'lr': 0.0004759212571406321, 'samples': 4386624, 'steps': 22846, 'loss/train': 1.4666595458984375} 11/07/2021 00:26:51 - INFO - __main__ - Step 22848: {'lr': 0.00047591898475339735, 'samples': 4386816, 'steps': 22847, 'loss/train': 1.6236677169799805} 11/07/2021 00:26:51 - INFO - __main__ - Step 22849: {'lr': 0.00047591671226436695, 'samples': 4387008, 'steps': 22848, 'loss/train': 1.4926680326461792} 11/07/2021 00:26:51 - INFO - __main__ - Step 22850: {'lr': 0.00047591443967354196, 'samples': 4387200, 'steps': 22849, 'loss/train': 1.6243668794631958} 11/07/2021 00:26:52 - INFO - __main__ - Step 22851: {'lr': 0.00047591216698092344, 'samples': 4387392, 'steps': 22850, 'loss/train': 1.8379485607147217} 11/07/2021 00:26:53 - INFO - __main__ - Step 22852: {'lr': 0.00047590989418651243, 'samples': 4387584, 'steps': 22851, 'loss/train': 1.4109525680541992} 11/07/2021 00:26:53 - INFO - __main__ - Step 22853: {'lr': 0.00047590762129030986, 'samples': 4387776, 'steps': 22852, 'loss/train': 0.5099748969078064} 11/07/2021 00:26:53 - INFO - __main__ - Step 22854: {'lr': 0.00047590534829231675, 'samples': 4387968, 'steps': 22853, 'loss/train': 1.7714526653289795} 11/07/2021 00:26:54 - INFO - __main__ - Step 22855: {'lr': 0.00047590307519253423, 'samples': 4388160, 'steps': 22854, 'loss/train': 1.9432487487792969} 11/07/2021 00:26:54 - INFO - __main__ - Step 22856: {'lr': 0.00047590080199096324, 'samples': 4388352, 'steps': 22855, 'loss/train': 0.7025504112243652} 11/07/2021 00:26:55 - INFO - __main__ - Step 22857: {'lr': 0.00047589852868760486, 'samples': 4388544, 'steps': 22856, 'loss/train': 1.6349374055862427} 11/07/2021 00:26:55 - INFO - __main__ - Step 22858: {'lr': 0.00047589625528246006, 'samples': 4388736, 'steps': 22857, 'loss/train': 1.535549283027649} 11/07/2021 00:26:56 - INFO - __main__ - Step 22859: {'lr': 0.0004758939817755299, 'samples': 4388928, 'steps': 22858, 'loss/train': 1.5929142236709595} 11/07/2021 00:26:56 - INFO - __main__ - Step 22860: {'lr': 0.0004758917081668155, 'samples': 4389120, 'steps': 22859, 'loss/train': 1.5244239568710327} 11/07/2021 00:26:56 - INFO - __main__ - Step 22861: {'lr': 0.00047588943445631767, 'samples': 4389312, 'steps': 22860, 'loss/train': 1.5270912647247314} 11/07/2021 00:26:58 - INFO - __main__ - Step 22862: {'lr': 0.0004758871606440376, 'samples': 4389504, 'steps': 22861, 'loss/train': 1.3322381973266602} 11/07/2021 00:26:58 - INFO - __main__ - Step 22863: {'lr': 0.0004758848867299762, 'samples': 4389696, 'steps': 22862, 'loss/train': 1.7966527938842773} 11/07/2021 00:26:58 - INFO - __main__ - Step 22864: {'lr': 0.0004758826127141346, 'samples': 4389888, 'steps': 22863, 'loss/train': 1.5517232418060303} 11/07/2021 00:26:59 - INFO - __main__ - Step 22865: {'lr': 0.00047588033859651376, 'samples': 4390080, 'steps': 22864, 'loss/train': 1.1566221714019775} 11/07/2021 00:26:59 - INFO - __main__ - Step 22866: {'lr': 0.00047587806437711475, 'samples': 4390272, 'steps': 22865, 'loss/train': 1.6851732730865479} 11/07/2021 00:27:00 - INFO - __main__ - Step 22867: {'lr': 0.0004758757900559385, 'samples': 4390464, 'steps': 22866, 'loss/train': 0.689471960067749} 11/07/2021 00:27:00 - INFO - __main__ - Step 22868: {'lr': 0.0004758735156329862, 'samples': 4390656, 'steps': 22867, 'loss/train': 1.991492748260498} 11/07/2021 00:27:01 - INFO - __main__ - Step 22869: {'lr': 0.00047587124110825874, 'samples': 4390848, 'steps': 22868, 'loss/train': 1.6657304763793945} 11/07/2021 00:27:01 - INFO - __main__ - Step 22870: {'lr': 0.00047586896648175715, 'samples': 4391040, 'steps': 22869, 'loss/train': 1.4751023054122925} 11/07/2021 00:27:01 - INFO - __main__ - Step 22871: {'lr': 0.00047586669175348254, 'samples': 4391232, 'steps': 22870, 'loss/train': 1.5636752843856812} 11/07/2021 00:27:02 - INFO - __main__ - Step 22872: {'lr': 0.0004758644169234359, 'samples': 4391424, 'steps': 22871, 'loss/train': 1.8853099346160889} 11/07/2021 00:27:03 - INFO - __main__ - Step 22873: {'lr': 0.00047586214199161814, 'samples': 4391616, 'steps': 22872, 'loss/train': 1.800837755203247} 11/07/2021 00:27:03 - INFO - __main__ - Step 22874: {'lr': 0.00047585986695803046, 'samples': 4391808, 'steps': 22873, 'loss/train': 1.3251843452453613} 11/07/2021 00:27:04 - INFO - __main__ - Step 22875: {'lr': 0.0004758575918226738, 'samples': 4392000, 'steps': 22874, 'loss/train': 2.003358840942383} 11/07/2021 00:27:04 - INFO - __main__ - Step 22876: {'lr': 0.0004758553165855492, 'samples': 4392192, 'steps': 22875, 'loss/train': 1.644388198852539} 11/07/2021 00:27:04 - INFO - __main__ - Step 22877: {'lr': 0.00047585304124665766, 'samples': 4392384, 'steps': 22876, 'loss/train': 1.8481217622756958} 11/07/2021 00:27:05 - INFO - __main__ - Step 22878: {'lr': 0.0004758507658060003, 'samples': 4392576, 'steps': 22877, 'loss/train': 1.3295446634292603} 11/07/2021 00:27:06 - INFO - __main__ - Step 22879: {'lr': 0.00047584849026357796, 'samples': 4392768, 'steps': 22878, 'loss/train': 1.9172389507293701} 11/07/2021 00:27:06 - INFO - __main__ - Step 22880: {'lr': 0.0004758462146193918, 'samples': 4392960, 'steps': 22879, 'loss/train': 1.363627314567566} 11/07/2021 00:27:06 - INFO - __main__ - Step 22881: {'lr': 0.00047584393887344285, 'samples': 4393152, 'steps': 22880, 'loss/train': 1.7900865077972412} 11/07/2021 00:27:07 - INFO - __main__ - Step 22882: {'lr': 0.00047584166302573204, 'samples': 4393344, 'steps': 22881, 'loss/train': 1.6964235305786133} 11/07/2021 00:27:08 - INFO - __main__ - Step 22883: {'lr': 0.0004758393870762606, 'samples': 4393536, 'steps': 22882, 'loss/train': 1.4693646430969238} 11/07/2021 00:27:08 - INFO - __main__ - Step 22884: {'lr': 0.00047583711102502934, 'samples': 4393728, 'steps': 22883, 'loss/train': 1.5916904211044312} 11/07/2021 00:27:08 - INFO - __main__ - Step 22885: {'lr': 0.0004758348348720393, 'samples': 4393920, 'steps': 22884, 'loss/train': 1.60740327835083} 11/07/2021 00:27:09 - INFO - __main__ - Step 22886: {'lr': 0.00047583255861729167, 'samples': 4394112, 'steps': 22885, 'loss/train': 1.265206217765808} 11/07/2021 00:27:09 - INFO - __main__ - Step 22887: {'lr': 0.00047583028226078734, 'samples': 4394304, 'steps': 22886, 'loss/train': 1.0524832010269165} 11/07/2021 00:27:10 - INFO - __main__ - Step 22888: {'lr': 0.0004758280058025274, 'samples': 4394496, 'steps': 22887, 'loss/train': 0.9584725499153137} 11/07/2021 00:27:10 - INFO - __main__ - Step 22889: {'lr': 0.00047582572924251276, 'samples': 4394688, 'steps': 22888, 'loss/train': 1.4555813074111938} 11/07/2021 00:27:11 - INFO - __main__ - Step 22890: {'lr': 0.00047582345258074453, 'samples': 4394880, 'steps': 22889, 'loss/train': 1.5769598484039307} 11/07/2021 00:27:11 - INFO - __main__ - Step 22891: {'lr': 0.0004758211758172238, 'samples': 4395072, 'steps': 22890, 'loss/train': 1.4620798826217651} 11/07/2021 00:27:11 - INFO - __main__ - Step 22892: {'lr': 0.00047581889895195154, 'samples': 4395264, 'steps': 22891, 'loss/train': 1.7565606832504272} 11/07/2021 00:27:13 - INFO - __main__ - Step 22893: {'lr': 0.00047581662198492873, 'samples': 4395456, 'steps': 22892, 'loss/train': 1.5614871978759766} 11/07/2021 00:27:13 - INFO - __main__ - Step 22894: {'lr': 0.0004758143449161565, 'samples': 4395648, 'steps': 22893, 'loss/train': 0.7110083103179932} 11/07/2021 00:27:13 - INFO - __main__ - Step 22895: {'lr': 0.00047581206774563575, 'samples': 4395840, 'steps': 22894, 'loss/train': 1.7186449766159058} 11/07/2021 00:27:14 - INFO - __main__ - Step 22896: {'lr': 0.0004758097904733676, 'samples': 4396032, 'steps': 22895, 'loss/train': 1.64580500125885} 11/07/2021 00:27:14 - INFO - __main__ - Step 22897: {'lr': 0.000475807513099353, 'samples': 4396224, 'steps': 22896, 'loss/train': 1.555309534072876} 11/07/2021 00:27:14 - INFO - __main__ - Step 22898: {'lr': 0.000475805235623593, 'samples': 4396416, 'steps': 22897, 'loss/train': 1.5211753845214844} 11/07/2021 00:27:15 - INFO - __main__ - Step 22899: {'lr': 0.0004758029580460887, 'samples': 4396608, 'steps': 22898, 'loss/train': 1.6425952911376953} 11/07/2021 00:27:16 - INFO - __main__ - Step 22900: {'lr': 0.0004758006803668411, 'samples': 4396800, 'steps': 22899, 'loss/train': 1.7011586427688599} 11/07/2021 00:27:16 - INFO - __main__ - Step 22901: {'lr': 0.0004757984025858511, 'samples': 4396992, 'steps': 22900, 'loss/train': 1.5620101690292358} 11/07/2021 00:27:16 - INFO - __main__ - Step 22902: {'lr': 0.0004757961247031199, 'samples': 4397184, 'steps': 22901, 'loss/train': 1.7332442998886108} 11/07/2021 00:27:17 - INFO - __main__ - Step 22903: {'lr': 0.00047579384671864845, 'samples': 4397376, 'steps': 22902, 'loss/train': 1.6374011039733887} 11/07/2021 00:27:18 - INFO - __main__ - Step 22904: {'lr': 0.0004757915686324377, 'samples': 4397568, 'steps': 22903, 'loss/train': 1.6759214401245117} 11/07/2021 00:27:18 - INFO - __main__ - Step 22905: {'lr': 0.00047578929044448883, 'samples': 4397760, 'steps': 22904, 'loss/train': 1.2218265533447266} 11/07/2021 00:27:18 - INFO - __main__ - Step 22906: {'lr': 0.0004757870121548028, 'samples': 4397952, 'steps': 22905, 'loss/train': 1.6508798599243164} 11/07/2021 00:27:19 - INFO - __main__ - Step 22907: {'lr': 0.0004757847337633806, 'samples': 4398144, 'steps': 22906, 'loss/train': 1.720141053199768} 11/07/2021 00:27:19 - INFO - __main__ - Step 22908: {'lr': 0.0004757824552702232, 'samples': 4398336, 'steps': 22907, 'loss/train': 0.8543339371681213} 11/07/2021 00:27:20 - INFO - __main__ - Step 22909: {'lr': 0.0004757801766753318, 'samples': 4398528, 'steps': 22908, 'loss/train': 1.0013171434402466} 11/07/2021 00:27:20 - INFO - __main__ - Step 22910: {'lr': 0.00047577789797870743, 'samples': 4398720, 'steps': 22909, 'loss/train': 1.8550013303756714} 11/07/2021 00:27:21 - INFO - __main__ - Step 22911: {'lr': 0.0004757756191803508, 'samples': 4398912, 'steps': 22910, 'loss/train': 0.7047666907310486} 11/07/2021 00:27:21 - INFO - __main__ - Step 22912: {'lr': 0.0004757733402802633, 'samples': 4399104, 'steps': 22911, 'loss/train': 1.7484979629516602} 11/07/2021 00:27:22 - INFO - __main__ - Step 22913: {'lr': 0.0004757710612784458, 'samples': 4399296, 'steps': 22912, 'loss/train': 1.82514488697052} 11/07/2021 00:27:23 - INFO - __main__ - Step 22914: {'lr': 0.0004757687821748994, 'samples': 4399488, 'steps': 22913, 'loss/train': 1.594335675239563} 11/07/2021 00:27:23 - INFO - __main__ - Step 22915: {'lr': 0.00047576650296962496, 'samples': 4399680, 'steps': 22914, 'loss/train': 1.4763436317443848} 11/07/2021 00:27:23 - INFO - __main__ - Step 22916: {'lr': 0.0004757642236626237, 'samples': 4399872, 'steps': 22915, 'loss/train': 1.4962226152420044} 11/07/2021 00:27:24 - INFO - __main__ - Step 22917: {'lr': 0.00047576194425389654, 'samples': 4400064, 'steps': 22916, 'loss/train': 1.6657871007919312} 11/07/2021 00:27:24 - INFO - __main__ - Step 22918: {'lr': 0.00047575966474344445, 'samples': 4400256, 'steps': 22917, 'loss/train': 1.158781886100769} 11/07/2021 00:27:25 - INFO - __main__ - Step 22919: {'lr': 0.00047575738513126867, 'samples': 4400448, 'steps': 22918, 'loss/train': 1.493218183517456} 11/07/2021 00:27:25 - INFO - __main__ - Step 22920: {'lr': 0.00047575510541737, 'samples': 4400640, 'steps': 22919, 'loss/train': 1.7854337692260742} 11/07/2021 00:27:26 - INFO - __main__ - Step 22921: {'lr': 0.0004757528256017496, 'samples': 4400832, 'steps': 22920, 'loss/train': 1.5265958309173584} 11/07/2021 00:27:26 - INFO - __main__ - Step 22922: {'lr': 0.00047575054568440846, 'samples': 4401024, 'steps': 22921, 'loss/train': 1.4380706548690796} 11/07/2021 00:27:26 - INFO - __main__ - Step 22923: {'lr': 0.00047574826566534764, 'samples': 4401216, 'steps': 22922, 'loss/train': 1.1648343801498413} 11/07/2021 00:27:27 - INFO - __main__ - Step 22924: {'lr': 0.0004757459855445681, 'samples': 4401408, 'steps': 22923, 'loss/train': 1.4443479776382446} 11/07/2021 00:27:28 - INFO - __main__ - Step 22925: {'lr': 0.0004757437053220709, 'samples': 4401600, 'steps': 22924, 'loss/train': 1.3270888328552246} 11/07/2021 00:27:28 - INFO - __main__ - Step 22926: {'lr': 0.0004757414249978571, 'samples': 4401792, 'steps': 22925, 'loss/train': 1.6503046751022339} 11/07/2021 00:27:29 - INFO - __main__ - Step 22927: {'lr': 0.0004757391445719277, 'samples': 4401984, 'steps': 22926, 'loss/train': 1.4672082662582397} 11/07/2021 00:27:29 - INFO - __main__ - Step 22928: {'lr': 0.00047573686404428365, 'samples': 4402176, 'steps': 22927, 'loss/train': 1.015838384628296} 11/07/2021 00:27:29 - INFO - __main__ - Step 22929: {'lr': 0.0004757345834149261, 'samples': 4402368, 'steps': 22928, 'loss/train': 1.3925256729125977} 11/07/2021 00:27:30 - INFO - __main__ - Step 22930: {'lr': 0.00047573230268385604, 'samples': 4402560, 'steps': 22929, 'loss/train': 1.4334114789962769} 11/07/2021 00:27:31 - INFO - __main__ - Step 22931: {'lr': 0.0004757300218510745, 'samples': 4402752, 'steps': 22930, 'loss/train': 1.7703146934509277} 11/07/2021 00:27:31 - INFO - __main__ - Step 22932: {'lr': 0.00047572774091658243, 'samples': 4402944, 'steps': 22931, 'loss/train': 1.6377800703048706} 11/07/2021 00:27:31 - INFO - __main__ - Step 22933: {'lr': 0.000475725459880381, 'samples': 4403136, 'steps': 22932, 'loss/train': 1.176165223121643} 11/07/2021 00:27:32 - INFO - __main__ - Step 22934: {'lr': 0.00047572317874247107, 'samples': 4403328, 'steps': 22933, 'loss/train': 1.5558907985687256} 11/07/2021 00:27:33 - INFO - __main__ - Step 22935: {'lr': 0.00047572089750285383, 'samples': 4403520, 'steps': 22934, 'loss/train': 2.0823795795440674} 11/07/2021 00:27:33 - INFO - __main__ - Step 22936: {'lr': 0.00047571861616153025, 'samples': 4403712, 'steps': 22935, 'loss/train': 0.2161761075258255} 11/07/2021 00:27:33 - INFO - __main__ - Step 22937: {'lr': 0.0004757163347185013, 'samples': 4403904, 'steps': 22936, 'loss/train': 1.6220515966415405} 11/07/2021 00:27:34 - INFO - __main__ - Step 22938: {'lr': 0.00047571405317376803, 'samples': 4404096, 'steps': 22937, 'loss/train': 1.4291313886642456} 11/07/2021 00:27:34 - INFO - __main__ - Step 22939: {'lr': 0.0004757117715273316, 'samples': 4404288, 'steps': 22938, 'loss/train': 1.6622034311294556} 11/07/2021 00:27:35 - INFO - __main__ - Step 22940: {'lr': 0.00047570948977919284, 'samples': 4404480, 'steps': 22939, 'loss/train': 1.4840724468231201} 11/07/2021 00:27:35 - INFO - __main__ - Step 22941: {'lr': 0.00047570720792935284, 'samples': 4404672, 'steps': 22940, 'loss/train': 1.6443581581115723} 11/07/2021 00:27:36 - INFO - __main__ - Step 22942: {'lr': 0.00047570492597781274, 'samples': 4404864, 'steps': 22941, 'loss/train': 1.1462692022323608} 11/07/2021 00:27:36 - INFO - __main__ - Step 22943: {'lr': 0.0004757026439245735, 'samples': 4405056, 'steps': 22942, 'loss/train': 1.5172832012176514} 11/07/2021 00:27:37 - INFO - __main__ - Step 22944: {'lr': 0.0004757003617696361, 'samples': 4405248, 'steps': 22943, 'loss/train': 1.5041968822479248} 11/07/2021 00:27:37 - INFO - __main__ - Step 22945: {'lr': 0.0004756980795130015, 'samples': 4405440, 'steps': 22944, 'loss/train': 1.511906623840332} 11/07/2021 00:27:38 - INFO - __main__ - Step 22946: {'lr': 0.00047569579715467093, 'samples': 4405632, 'steps': 22945, 'loss/train': 2.17930269241333} 11/07/2021 00:27:38 - INFO - __main__ - Step 22947: {'lr': 0.00047569351469464526, 'samples': 4405824, 'steps': 22946, 'loss/train': 1.2511581182479858} 11/07/2021 00:27:39 - INFO - __main__ - Step 22948: {'lr': 0.0004756912321329256, 'samples': 4406016, 'steps': 22947, 'loss/train': 1.4924403429031372} 11/07/2021 00:27:39 - INFO - __main__ - Step 22949: {'lr': 0.000475688949469513, 'samples': 4406208, 'steps': 22948, 'loss/train': 1.0683891773223877} 11/07/2021 00:27:41 - INFO - __main__ - Step 22950: {'lr': 0.0004756866667044084, 'samples': 4406400, 'steps': 22949, 'loss/train': 1.3111467361450195} 11/07/2021 00:27:41 - INFO - __main__ - Step 22951: {'lr': 0.0004756843838376128, 'samples': 4406592, 'steps': 22950, 'loss/train': 1.0856906175613403} 11/07/2021 00:27:41 - INFO - __main__ - Step 22952: {'lr': 0.0004756821008691274, 'samples': 4406784, 'steps': 22951, 'loss/train': 1.7769157886505127} 11/07/2021 00:27:42 - INFO - __main__ - Step 22953: {'lr': 0.0004756798177989531, 'samples': 4406976, 'steps': 22952, 'loss/train': 1.7058240175247192} 11/07/2021 00:27:42 - INFO - __main__ - Step 22954: {'lr': 0.00047567753462709095, 'samples': 4407168, 'steps': 22953, 'loss/train': 1.7961945533752441} 11/07/2021 00:27:42 - INFO - __main__ - Step 22955: {'lr': 0.00047567525135354193, 'samples': 4407360, 'steps': 22954, 'loss/train': 1.5496257543563843} 11/07/2021 00:27:43 - INFO - __main__ - Step 22956: {'lr': 0.00047567296797830727, 'samples': 4407552, 'steps': 22955, 'loss/train': 1.9372279644012451} 11/07/2021 00:27:44 - INFO - __main__ - Step 22957: {'lr': 0.00047567068450138773, 'samples': 4407744, 'steps': 22956, 'loss/train': 1.5832316875457764} 11/07/2021 00:27:44 - INFO - __main__ - Step 22958: {'lr': 0.0004756684009227845, 'samples': 4407936, 'steps': 22957, 'loss/train': 1.7091788053512573} 11/07/2021 00:27:44 - INFO - __main__ - Step 22959: {'lr': 0.0004756661172424986, 'samples': 4408128, 'steps': 22958, 'loss/train': 1.709111213684082} 11/07/2021 00:27:45 - INFO - __main__ - Step 22960: {'lr': 0.000475663833460531, 'samples': 4408320, 'steps': 22959, 'loss/train': 1.6512173414230347} 11/07/2021 00:27:45 - INFO - __main__ - Step 22961: {'lr': 0.00047566154957688275, 'samples': 4408512, 'steps': 22960, 'loss/train': 1.5914360284805298} 11/07/2021 00:27:46 - INFO - __main__ - Step 22962: {'lr': 0.0004756592655915549, 'samples': 4408704, 'steps': 22961, 'loss/train': 1.5957335233688354} 11/07/2021 00:27:47 - INFO - __main__ - Step 22963: {'lr': 0.00047565698150454845, 'samples': 4408896, 'steps': 22962, 'loss/train': 1.1223114728927612} 11/07/2021 00:27:47 - INFO - __main__ - Step 22964: {'lr': 0.0004756546973158644, 'samples': 4409088, 'steps': 22963, 'loss/train': 4.726345062255859} 11/07/2021 00:27:47 - INFO - __main__ - Step 22965: {'lr': 0.00047565241302550395, 'samples': 4409280, 'steps': 22964, 'loss/train': 0.8015714883804321} 11/07/2021 00:27:48 - INFO - __main__ - Step 22966: {'lr': 0.0004756501286334679, 'samples': 4409472, 'steps': 22965, 'loss/train': 1.7294597625732422} 11/07/2021 00:27:48 - INFO - __main__ - Step 22967: {'lr': 0.0004756478441397575, 'samples': 4409664, 'steps': 22966, 'loss/train': 1.8136224746704102} 11/07/2021 00:27:49 - INFO - __main__ - Step 22968: {'lr': 0.0004756455595443735, 'samples': 4409856, 'steps': 22967, 'loss/train': 1.3230981826782227} 11/07/2021 00:27:49 - INFO - __main__ - Step 22969: {'lr': 0.00047564327484731725, 'samples': 4410048, 'steps': 22968, 'loss/train': 2.6526639461517334} 11/07/2021 00:27:50 - INFO - __main__ - Step 22970: {'lr': 0.0004756409900485895, 'samples': 4410240, 'steps': 22969, 'loss/train': 1.852148413658142} 11/07/2021 00:27:50 - INFO - __main__ - Step 22971: {'lr': 0.00047563870514819154, 'samples': 4410432, 'steps': 22970, 'loss/train': 0.8799536824226379} 11/07/2021 00:27:50 - INFO - __main__ - Step 22972: {'lr': 0.0004756364201461241, 'samples': 4410624, 'steps': 22971, 'loss/train': 1.9788076877593994} 11/07/2021 00:27:51 - INFO - __main__ - Step 22973: {'lr': 0.00047563413504238847, 'samples': 4410816, 'steps': 22972, 'loss/train': 1.9070225954055786} 11/07/2021 00:27:52 - INFO - __main__ - Step 22974: {'lr': 0.0004756318498369855, 'samples': 4411008, 'steps': 22973, 'loss/train': 1.0210386514663696} 11/07/2021 00:27:52 - INFO - __main__ - Step 22975: {'lr': 0.0004756295645299164, 'samples': 4411200, 'steps': 22974, 'loss/train': 1.6513278484344482} 11/07/2021 00:27:52 - INFO - __main__ - Step 22976: {'lr': 0.00047562727912118206, 'samples': 4411392, 'steps': 22975, 'loss/train': 1.6678000688552856} 11/07/2021 00:27:53 - INFO - __main__ - Step 22977: {'lr': 0.00047562499361078356, 'samples': 4411584, 'steps': 22976, 'loss/train': 1.668380618095398} 11/07/2021 00:27:54 - INFO - __main__ - Step 22978: {'lr': 0.00047562270799872186, 'samples': 4411776, 'steps': 22977, 'loss/train': 2.0028698444366455} 11/07/2021 00:27:54 - INFO - __main__ - Step 22979: {'lr': 0.00047562042228499815, 'samples': 4411968, 'steps': 22978, 'loss/train': 1.7901252508163452} 11/07/2021 00:27:55 - INFO - __main__ - Step 22980: {'lr': 0.00047561813646961325, 'samples': 4412160, 'steps': 22979, 'loss/train': 1.4079407453536987} 11/07/2021 00:27:55 - INFO - __main__ - Step 22981: {'lr': 0.0004756158505525684, 'samples': 4412352, 'steps': 22980, 'loss/train': 1.1839544773101807} 11/07/2021 00:27:55 - INFO - __main__ - Step 22982: {'lr': 0.0004756135645338644, 'samples': 4412544, 'steps': 22981, 'loss/train': 2.2274835109710693} 11/07/2021 00:27:56 - INFO - __main__ - Step 22983: {'lr': 0.00047561127841350256, 'samples': 4412736, 'steps': 22982, 'loss/train': 1.5764931440353394} 11/07/2021 00:27:57 - INFO - __main__ - Step 22984: {'lr': 0.0004756089921914837, 'samples': 4412928, 'steps': 22983, 'loss/train': 1.4414703845977783} 11/07/2021 00:27:57 - INFO - __main__ - Step 22985: {'lr': 0.00047560670586780886, 'samples': 4413120, 'steps': 22984, 'loss/train': 1.398228645324707} 11/07/2021 00:27:57 - INFO - __main__ - Step 22986: {'lr': 0.0004756044194424792, 'samples': 4413312, 'steps': 22985, 'loss/train': 1.8721942901611328} 11/07/2021 00:27:58 - INFO - __main__ - Step 22987: {'lr': 0.0004756021329154956, 'samples': 4413504, 'steps': 22986, 'loss/train': 1.572609305381775} 11/07/2021 00:27:59 - INFO - __main__ - Step 22988: {'lr': 0.0004755998462868592, 'samples': 4413696, 'steps': 22987, 'loss/train': 1.766256332397461} 11/07/2021 00:27:59 - INFO - __main__ - Step 22989: {'lr': 0.00047559755955657097, 'samples': 4413888, 'steps': 22988, 'loss/train': 1.8802845478057861} 11/07/2021 00:27:59 - INFO - __main__ - Step 22990: {'lr': 0.000475595272724632, 'samples': 4414080, 'steps': 22989, 'loss/train': 1.8309953212738037} 11/07/2021 00:28:00 - INFO - __main__ - Step 22991: {'lr': 0.00047559298579104325, 'samples': 4414272, 'steps': 22990, 'loss/train': 1.2643400430679321} 11/07/2021 00:28:00 - INFO - __main__ - Step 22992: {'lr': 0.00047559069875580573, 'samples': 4414464, 'steps': 22991, 'loss/train': 1.68605637550354} 11/07/2021 00:28:01 - INFO - __main__ - Step 22993: {'lr': 0.00047558841161892063, 'samples': 4414656, 'steps': 22992, 'loss/train': 1.7791119813919067} 11/07/2021 00:28:01 - INFO - __main__ - Step 22994: {'lr': 0.00047558612438038887, 'samples': 4414848, 'steps': 22993, 'loss/train': 1.7085627317428589} 11/07/2021 00:28:02 - INFO - __main__ - Step 22995: {'lr': 0.00047558383704021136, 'samples': 4415040, 'steps': 22994, 'loss/train': 0.6871132850646973} 11/07/2021 00:28:02 - INFO - __main__ - Step 22996: {'lr': 0.00047558154959838935, 'samples': 4415232, 'steps': 22995, 'loss/train': 2.3702433109283447} 11/07/2021 00:28:03 - INFO - __main__ - Step 22997: {'lr': 0.0004755792620549237, 'samples': 4415424, 'steps': 22996, 'loss/train': 1.5503305196762085} 11/07/2021 00:28:03 - INFO - __main__ - Step 22998: {'lr': 0.0004755769744098156, 'samples': 4415616, 'steps': 22997, 'loss/train': 1.6456409692764282} 11/07/2021 00:28:03 - INFO - __main__ - Step 22999: {'lr': 0.00047557468666306596, 'samples': 4415808, 'steps': 22998, 'loss/train': 1.2864354848861694} 11/07/2021 00:28:04 - INFO - __main__ - Step 23000: {'lr': 0.00047557239881467584, 'samples': 4416000, 'steps': 22999, 'loss/train': 1.4137681722640991} 11/07/2021 00:28:05 - INFO - __main__ - Step 23001: {'lr': 0.0004755701108646463, 'samples': 4416192, 'steps': 23000, 'loss/train': 1.6158024072647095} 11/07/2021 00:28:05 - INFO - __main__ - Step 23002: {'lr': 0.0004755678228129784, 'samples': 4416384, 'steps': 23001, 'loss/train': 0.7091558575630188} 11/07/2021 00:28:05 - INFO - __main__ - Step 23003: {'lr': 0.000475565534659673, 'samples': 4416576, 'steps': 23002, 'loss/train': 1.7442548274993896} 11/07/2021 00:28:06 - INFO - __main__ - Step 23004: {'lr': 0.00047556324640473134, 'samples': 4416768, 'steps': 23003, 'loss/train': 1.8332017660140991} 11/07/2021 00:28:07 - INFO - __main__ - Step 23005: {'lr': 0.0004755609580481543, 'samples': 4416960, 'steps': 23004, 'loss/train': 1.8451040983200073} 11/07/2021 00:28:07 - INFO - __main__ - Step 23006: {'lr': 0.00047555866958994296, 'samples': 4417152, 'steps': 23005, 'loss/train': 1.5475908517837524} 11/07/2021 00:28:07 - INFO - __main__ - Step 23007: {'lr': 0.00047555638103009845, 'samples': 4417344, 'steps': 23006, 'loss/train': 1.035109519958496} 11/07/2021 00:28:08 - INFO - __main__ - Step 23008: {'lr': 0.0004755540923686217, 'samples': 4417536, 'steps': 23007, 'loss/train': 1.5133399963378906} 11/07/2021 00:28:08 - INFO - __main__ - Step 23009: {'lr': 0.0004755518036055137, 'samples': 4417728, 'steps': 23008, 'loss/train': 1.625259280204773} 11/07/2021 00:28:09 - INFO - __main__ - Step 23010: {'lr': 0.0004755495147407756, 'samples': 4417920, 'steps': 23009, 'loss/train': 1.5169017314910889} 11/07/2021 00:28:09 - INFO - __main__ - Step 23011: {'lr': 0.00047554722577440833, 'samples': 4418112, 'steps': 23010, 'loss/train': 1.5681982040405273} 11/07/2021 00:28:10 - INFO - __main__ - Step 23012: {'lr': 0.00047554493670641296, 'samples': 4418304, 'steps': 23011, 'loss/train': 0.36354389786720276} 11/07/2021 00:28:10 - INFO - __main__ - Step 23013: {'lr': 0.0004755426475367905, 'samples': 4418496, 'steps': 23012, 'loss/train': 1.3803929090499878} 11/07/2021 00:28:11 - INFO - __main__ - Step 23014: {'lr': 0.00047554035826554206, 'samples': 4418688, 'steps': 23013, 'loss/train': 1.3115521669387817} 11/07/2021 00:28:12 - INFO - __main__ - Step 23015: {'lr': 0.0004755380688926686, 'samples': 4418880, 'steps': 23014, 'loss/train': 1.7398924827575684} 11/07/2021 00:28:12 - INFO - __main__ - Step 23016: {'lr': 0.00047553577941817114, 'samples': 4419072, 'steps': 23015, 'loss/train': 1.7361851930618286} 11/07/2021 00:28:12 - INFO - __main__ - Step 23017: {'lr': 0.0004755334898420507, 'samples': 4419264, 'steps': 23016, 'loss/train': 1.8983994722366333} 11/07/2021 00:28:13 - INFO - __main__ - Step 23018: {'lr': 0.00047553120016430837, 'samples': 4419456, 'steps': 23017, 'loss/train': 1.3933513164520264} 11/07/2021 00:28:13 - INFO - __main__ - Step 23019: {'lr': 0.0004755289103849453, 'samples': 4419648, 'steps': 23018, 'loss/train': 1.5074931383132935} 11/07/2021 00:28:14 - INFO - __main__ - Step 23020: {'lr': 0.0004755266205039622, 'samples': 4419840, 'steps': 23019, 'loss/train': 1.4028873443603516} 11/07/2021 00:28:14 - INFO - __main__ - Step 23021: {'lr': 0.00047552433052136034, 'samples': 4420032, 'steps': 23020, 'loss/train': 1.6080424785614014} 11/07/2021 00:28:15 - INFO - __main__ - Step 23022: {'lr': 0.00047552204043714076, 'samples': 4420224, 'steps': 23021, 'loss/train': 1.7918477058410645} 11/07/2021 00:28:15 - INFO - __main__ - Step 23023: {'lr': 0.0004755197502513043, 'samples': 4420416, 'steps': 23022, 'loss/train': 1.549143671989441} 11/07/2021 00:28:15 - INFO - __main__ - Step 23024: {'lr': 0.00047551745996385233, 'samples': 4420608, 'steps': 23023, 'loss/train': 1.5438201427459717} 11/07/2021 00:28:16 - INFO - __main__ - Step 23025: {'lr': 0.00047551516957478545, 'samples': 4420800, 'steps': 23024, 'loss/train': 1.6900150775909424} 11/07/2021 00:28:17 - INFO - __main__ - Step 23026: {'lr': 0.0004755128790841051, 'samples': 4420992, 'steps': 23025, 'loss/train': 1.6069121360778809} 11/07/2021 00:28:17 - INFO - __main__ - Step 23027: {'lr': 0.000475510588491812, 'samples': 4421184, 'steps': 23026, 'loss/train': 2.0397226810455322} 11/07/2021 00:28:18 - INFO - __main__ - Step 23028: {'lr': 0.00047550829779790735, 'samples': 4421376, 'steps': 23027, 'loss/train': 1.342384934425354} 11/07/2021 00:28:18 - INFO - __main__ - Step 23029: {'lr': 0.0004755060070023921, 'samples': 4421568, 'steps': 23028, 'loss/train': 0.19474640488624573} 11/07/2021 00:28:18 - INFO - __main__ - Step 23030: {'lr': 0.0004755037161052674, 'samples': 4421760, 'steps': 23029, 'loss/train': 1.7647830247879028} 11/07/2021 00:28:19 - INFO - __main__ - Step 23031: {'lr': 0.00047550142510653415, 'samples': 4421952, 'steps': 23030, 'loss/train': 2.4951038360595703} 11/07/2021 00:28:20 - INFO - __main__ - Step 23032: {'lr': 0.0004754991340061935, 'samples': 4422144, 'steps': 23031, 'loss/train': 0.8009241223335266} 11/07/2021 00:28:20 - INFO - __main__ - Step 23033: {'lr': 0.0004754968428042463, 'samples': 4422336, 'steps': 23032, 'loss/train': 1.189576268196106} 11/07/2021 00:28:20 - INFO - __main__ - Step 23034: {'lr': 0.0004754945515006938, 'samples': 4422528, 'steps': 23033, 'loss/train': 1.348042368888855} 11/07/2021 00:28:21 - INFO - __main__ - Step 23035: {'lr': 0.0004754922600955369, 'samples': 4422720, 'steps': 23034, 'loss/train': 2.0349338054656982} 11/07/2021 00:28:22 - INFO - __main__ - Step 23036: {'lr': 0.0004754899685887767, 'samples': 4422912, 'steps': 23035, 'loss/train': 1.6004425287246704} 11/07/2021 00:28:22 - INFO - __main__ - Step 23037: {'lr': 0.0004754876769804142, 'samples': 4423104, 'steps': 23036, 'loss/train': 1.6580665111541748} 11/07/2021 00:28:23 - INFO - __main__ - Step 23038: {'lr': 0.00047548538527045035, 'samples': 4423296, 'steps': 23037, 'loss/train': 1.2863857746124268} 11/07/2021 00:28:23 - INFO - __main__ - Step 23039: {'lr': 0.00047548309345888637, 'samples': 4423488, 'steps': 23038, 'loss/train': 0.8549371957778931} 11/07/2021 00:28:23 - INFO - __main__ - Step 23040: {'lr': 0.00047548080154572315, 'samples': 4423680, 'steps': 23039, 'loss/train': 1.5918457508087158} 11/07/2021 00:28:24 - INFO - __main__ - Step 23041: {'lr': 0.00047547850953096174, 'samples': 4423872, 'steps': 23040, 'loss/train': 1.4698606729507446} 11/07/2021 00:28:24 - INFO - __main__ - Step 23042: {'lr': 0.0004754762174146032, 'samples': 4424064, 'steps': 23041, 'loss/train': 1.841143012046814} 11/07/2021 00:28:25 - INFO - __main__ - Step 23043: {'lr': 0.00047547392519664853, 'samples': 4424256, 'steps': 23042, 'loss/train': 1.5102839469909668} 11/07/2021 00:28:26 - INFO - __main__ - Step 23044: {'lr': 0.0004754716328770988, 'samples': 4424448, 'steps': 23043, 'loss/train': 2.0320229530334473} 11/07/2021 00:28:26 - INFO - __main__ - Step 23045: {'lr': 0.00047546934045595516, 'samples': 4424640, 'steps': 23044, 'loss/train': 1.711830973625183} 11/07/2021 00:28:26 - INFO - __main__ - Step 23046: {'lr': 0.00047546704793321835, 'samples': 4424832, 'steps': 23045, 'loss/train': 1.5525362491607666} 11/07/2021 00:28:27 - INFO - __main__ - Step 23047: {'lr': 0.0004754647553088896, 'samples': 4425024, 'steps': 23046, 'loss/train': 1.7210502624511719} 11/07/2021 00:28:28 - INFO - __main__ - Step 23048: {'lr': 0.00047546246258297, 'samples': 4425216, 'steps': 23047, 'loss/train': 2.0603466033935547} 11/07/2021 00:28:28 - INFO - __main__ - Step 23049: {'lr': 0.00047546016975546037, 'samples': 4425408, 'steps': 23048, 'loss/train': 1.2670799493789673} 11/07/2021 00:28:28 - INFO - __main__ - Step 23050: {'lr': 0.00047545787682636194, 'samples': 4425600, 'steps': 23049, 'loss/train': 1.3632519245147705} 11/07/2021 00:28:29 - INFO - __main__ - Step 23051: {'lr': 0.00047545558379567565, 'samples': 4425792, 'steps': 23050, 'loss/train': 1.7283782958984375} 11/07/2021 00:28:29 - INFO - __main__ - Step 23052: {'lr': 0.00047545329066340256, 'samples': 4425984, 'steps': 23051, 'loss/train': 1.7790948152542114} 11/07/2021 00:28:30 - INFO - __main__ - Step 23053: {'lr': 0.00047545099742954367, 'samples': 4426176, 'steps': 23052, 'loss/train': 1.2009434700012207} 11/07/2021 00:28:31 - INFO - __main__ - Step 23054: {'lr': 0.0004754487040941001, 'samples': 4426368, 'steps': 23053, 'loss/train': 1.2071412801742554} 11/07/2021 00:28:31 - INFO - __main__ - Step 23055: {'lr': 0.0004754464106570727, 'samples': 4426560, 'steps': 23054, 'loss/train': 1.412814974784851} 11/07/2021 00:28:31 - INFO - __main__ - Step 23056: {'lr': 0.00047544411711846277, 'samples': 4426752, 'steps': 23055, 'loss/train': 1.5205636024475098} 11/07/2021 00:28:32 - INFO - __main__ - Step 23057: {'lr': 0.00047544182347827114, 'samples': 4426944, 'steps': 23056, 'loss/train': 1.6638517379760742} 11/07/2021 00:28:33 - INFO - __main__ - Step 23058: {'lr': 0.0004754395297364989, 'samples': 4427136, 'steps': 23057, 'loss/train': 1.645391821861267} 11/07/2021 00:28:33 - INFO - __main__ - Step 23059: {'lr': 0.0004754372358931471, 'samples': 4427328, 'steps': 23058, 'loss/train': 1.6849995851516724} 11/07/2021 00:28:33 - INFO - __main__ - Step 23060: {'lr': 0.00047543494194821675, 'samples': 4427520, 'steps': 23059, 'loss/train': 1.4331011772155762} 11/07/2021 00:28:34 - INFO - __main__ - Step 23061: {'lr': 0.00047543264790170887, 'samples': 4427712, 'steps': 23060, 'loss/train': 2.1720664501190186} 11/07/2021 00:28:34 - INFO - __main__ - Step 23062: {'lr': 0.00047543035375362453, 'samples': 4427904, 'steps': 23061, 'loss/train': 1.5246973037719727} 11/07/2021 00:28:34 - INFO - __main__ - Step 23063: {'lr': 0.00047542805950396476, 'samples': 4428096, 'steps': 23062, 'loss/train': 1.4607219696044922} 11/07/2021 00:28:35 - INFO - __main__ - Step 23064: {'lr': 0.00047542576515273064, 'samples': 4428288, 'steps': 23063, 'loss/train': 1.9518522024154663} 11/07/2021 00:28:36 - INFO - __main__ - Step 23065: {'lr': 0.0004754234706999231, 'samples': 4428480, 'steps': 23064, 'loss/train': 1.777788758277893} 11/07/2021 00:28:36 - INFO - __main__ - Step 23066: {'lr': 0.0004754211761455432, 'samples': 4428672, 'steps': 23065, 'loss/train': 1.6558383703231812} 11/07/2021 00:28:36 - INFO - __main__ - Step 23067: {'lr': 0.000475418881489592, 'samples': 4428864, 'steps': 23066, 'loss/train': 1.926710605621338} 11/07/2021 00:28:37 - INFO - __main__ - Step 23068: {'lr': 0.0004754165867320706, 'samples': 4429056, 'steps': 23067, 'loss/train': 1.791853904724121} 11/07/2021 00:28:38 - INFO - __main__ - Step 23069: {'lr': 0.00047541429187297984, 'samples': 4429248, 'steps': 23068, 'loss/train': 1.4585204124450684} 11/07/2021 00:28:38 - INFO - __main__ - Step 23070: {'lr': 0.00047541199691232094, 'samples': 4429440, 'steps': 23069, 'loss/train': 1.8373855352401733} 11/07/2021 00:28:39 - INFO - __main__ - Step 23071: {'lr': 0.0004754097018500949, 'samples': 4429632, 'steps': 23070, 'loss/train': 1.821460247039795} 11/07/2021 00:28:39 - INFO - __main__ - Step 23072: {'lr': 0.0004754074066863027, 'samples': 4429824, 'steps': 23071, 'loss/train': 1.2372655868530273} 11/07/2021 00:28:39 - INFO - __main__ - Step 23073: {'lr': 0.0004754051114209454, 'samples': 4430016, 'steps': 23072, 'loss/train': 1.8181707859039307} 11/07/2021 00:28:40 - INFO - __main__ - Step 23074: {'lr': 0.0004754028160540241, 'samples': 4430208, 'steps': 23073, 'loss/train': 1.5179206132888794} 11/07/2021 00:28:41 - INFO - __main__ - Step 23075: {'lr': 0.0004754005205855397, 'samples': 4430400, 'steps': 23074, 'loss/train': 1.5440561771392822} 11/07/2021 00:28:41 - INFO - __main__ - Step 23076: {'lr': 0.0004753982250154933, 'samples': 4430592, 'steps': 23075, 'loss/train': 1.4330766201019287} 11/07/2021 00:28:41 - INFO - __main__ - Step 23077: {'lr': 0.00047539592934388596, 'samples': 4430784, 'steps': 23076, 'loss/train': 1.4417822360992432} 11/07/2021 00:28:42 - INFO - __main__ - Step 23078: {'lr': 0.0004753936335707187, 'samples': 4430976, 'steps': 23077, 'loss/train': 1.5473705530166626} 11/07/2021 00:28:43 - INFO - __main__ - Step 23079: {'lr': 0.0004753913376959925, 'samples': 4431168, 'steps': 23078, 'loss/train': 1.3038986921310425} 11/07/2021 00:28:43 - INFO - __main__ - Step 23080: {'lr': 0.00047538904171970847, 'samples': 4431360, 'steps': 23079, 'loss/train': 1.5698622465133667} 11/07/2021 00:28:43 - INFO - __main__ - Step 23081: {'lr': 0.0004753867456418677, 'samples': 4431552, 'steps': 23080, 'loss/train': 1.5879743099212646} 11/07/2021 00:28:44 - INFO - __main__ - Step 23082: {'lr': 0.000475384449462471, 'samples': 4431744, 'steps': 23081, 'loss/train': 1.3453717231750488} 11/07/2021 00:28:44 - INFO - __main__ - Step 23083: {'lr': 0.00047538215318151955, 'samples': 4431936, 'steps': 23082, 'loss/train': 1.4228205680847168} 11/07/2021 00:28:45 - INFO - __main__ - Step 23084: {'lr': 0.0004753798567990145, 'samples': 4432128, 'steps': 23083, 'loss/train': 1.6919718980789185} 11/07/2021 00:28:46 - INFO - __main__ - Step 23085: {'lr': 0.00047537756031495673, 'samples': 4432320, 'steps': 23084, 'loss/train': 1.6395727396011353} 11/07/2021 00:28:46 - INFO - __main__ - Step 23086: {'lr': 0.0004753752637293473, 'samples': 4432512, 'steps': 23085, 'loss/train': 1.7249788045883179} 11/07/2021 00:28:46 - INFO - __main__ - Step 23087: {'lr': 0.0004753729670421871, 'samples': 4432704, 'steps': 23086, 'loss/train': 1.9568339586257935} 11/07/2021 00:28:47 - INFO - __main__ - Step 23088: {'lr': 0.0004753706702534775, 'samples': 4432896, 'steps': 23087, 'loss/train': 1.488980770111084} 11/07/2021 00:28:48 - INFO - __main__ - Step 23089: {'lr': 0.0004753683733632193, 'samples': 4433088, 'steps': 23088, 'loss/train': 1.579067349433899} 11/07/2021 00:28:48 - INFO - __main__ - Step 23090: {'lr': 0.0004753660763714136, 'samples': 4433280, 'steps': 23089, 'loss/train': 2.105590581893921} 11/07/2021 00:28:48 - INFO - __main__ - Step 23091: {'lr': 0.00047536377927806143, 'samples': 4433472, 'steps': 23090, 'loss/train': 1.8617920875549316} 11/07/2021 00:28:49 - INFO - __main__ - Step 23092: {'lr': 0.0004753614820831638, 'samples': 4433664, 'steps': 23091, 'loss/train': 1.3575159311294556} 11/07/2021 00:28:49 - INFO - __main__ - Step 23093: {'lr': 0.0004753591847867218, 'samples': 4433856, 'steps': 23092, 'loss/train': 1.422724723815918} 11/07/2021 00:28:50 - INFO - __main__ - Step 23094: {'lr': 0.0004753568873887364, 'samples': 4434048, 'steps': 23093, 'loss/train': 1.7011767625808716} 11/07/2021 00:28:50 - INFO - __main__ - Step 23095: {'lr': 0.00047535458988920865, 'samples': 4434240, 'steps': 23094, 'loss/train': 1.6001523733139038} 11/07/2021 00:28:51 - INFO - __main__ - Step 23096: {'lr': 0.0004753522922881396, 'samples': 4434432, 'steps': 23095, 'loss/train': 1.5050978660583496} 11/07/2021 00:28:51 - INFO - __main__ - Step 23097: {'lr': 0.00047534999458553027, 'samples': 4434624, 'steps': 23096, 'loss/train': 1.705203652381897} 11/07/2021 00:28:51 - INFO - __main__ - Step 23098: {'lr': 0.00047534769678138177, 'samples': 4434816, 'steps': 23097, 'loss/train': 1.208935260772705} 11/07/2021 00:28:52 - INFO - __main__ - Step 23099: {'lr': 0.00047534539887569507, 'samples': 4435008, 'steps': 23098, 'loss/train': 1.8251349925994873} 11/07/2021 00:28:53 - INFO - __main__ - Step 23100: {'lr': 0.00047534310086847116, 'samples': 4435200, 'steps': 23099, 'loss/train': 1.7464089393615723} 11/07/2021 00:28:53 - INFO - __main__ - Step 23101: {'lr': 0.0004753408027597111, 'samples': 4435392, 'steps': 23100, 'loss/train': 1.768896460533142} 11/07/2021 00:28:54 - INFO - __main__ - Step 23102: {'lr': 0.0004753385045494161, 'samples': 4435584, 'steps': 23101, 'loss/train': 1.7746583223342896} 11/07/2021 00:28:54 - INFO - __main__ - Step 23103: {'lr': 0.0004753362062375869, 'samples': 4435776, 'steps': 23102, 'loss/train': 1.792876124382019} 11/07/2021 00:28:54 - INFO - __main__ - Step 23104: {'lr': 0.0004753339078242247, 'samples': 4435968, 'steps': 23103, 'loss/train': 1.5717073678970337} 11/07/2021 00:28:55 - INFO - __main__ - Step 23105: {'lr': 0.00047533160930933054, 'samples': 4436160, 'steps': 23104, 'loss/train': 1.265807032585144} 11/07/2021 00:28:56 - INFO - __main__ - Step 23106: {'lr': 0.00047532931069290546, 'samples': 4436352, 'steps': 23105, 'loss/train': 1.633703351020813} 11/07/2021 00:28:56 - INFO - __main__ - Step 23107: {'lr': 0.00047532701197495043, 'samples': 4436544, 'steps': 23106, 'loss/train': 1.6882529258728027} 11/07/2021 00:28:56 - INFO - __main__ - Step 23108: {'lr': 0.00047532471315546654, 'samples': 4436736, 'steps': 23107, 'loss/train': 1.3140676021575928} 11/07/2021 00:28:57 - INFO - __main__ - Step 23109: {'lr': 0.00047532241423445487, 'samples': 4436928, 'steps': 23108, 'loss/train': 1.6697815656661987} 11/07/2021 00:28:58 - INFO - __main__ - Step 23110: {'lr': 0.00047532011521191634, 'samples': 4437120, 'steps': 23109, 'loss/train': 1.6013033390045166} 11/07/2021 00:28:58 - INFO - __main__ - Step 23111: {'lr': 0.00047531781608785203, 'samples': 4437312, 'steps': 23110, 'loss/train': 1.1691371202468872} 11/07/2021 00:28:58 - INFO - __main__ - Step 23112: {'lr': 0.00047531551686226303, 'samples': 4437504, 'steps': 23111, 'loss/train': 1.359718918800354} 11/07/2021 00:28:59 - INFO - __main__ - Step 23113: {'lr': 0.00047531321753515026, 'samples': 4437696, 'steps': 23112, 'loss/train': 1.4464131593704224} 11/07/2021 00:28:59 - INFO - __main__ - Step 23114: {'lr': 0.0004753109181065149, 'samples': 4437888, 'steps': 23113, 'loss/train': 2.013803482055664} 11/07/2021 00:29:00 - INFO - __main__ - Step 23115: {'lr': 0.00047530861857635786, 'samples': 4438080, 'steps': 23114, 'loss/train': 2.0893020629882812} 11/07/2021 00:29:00 - INFO - __main__ - Step 23116: {'lr': 0.00047530631894468034, 'samples': 4438272, 'steps': 23115, 'loss/train': 1.3252930641174316} 11/07/2021 00:29:01 - INFO - __main__ - Step 23117: {'lr': 0.0004753040192114831, 'samples': 4438464, 'steps': 23116, 'loss/train': 1.6963107585906982} 11/07/2021 00:29:01 - INFO - __main__ - Step 23118: {'lr': 0.00047530171937676754, 'samples': 4438656, 'steps': 23117, 'loss/train': 1.0882844924926758} 11/07/2021 00:29:01 - INFO - __main__ - Step 23119: {'lr': 0.0004752994194405344, 'samples': 4438848, 'steps': 23118, 'loss/train': 1.5661946535110474} 11/07/2021 00:29:02 - INFO - __main__ - Step 23120: {'lr': 0.0004752971194027848, 'samples': 4439040, 'steps': 23119, 'loss/train': 1.7195779085159302} 11/07/2021 00:29:03 - INFO - __main__ - Step 23121: {'lr': 0.0004752948192635198, 'samples': 4439232, 'steps': 23120, 'loss/train': 0.6856600046157837} 11/07/2021 00:29:03 - INFO - __main__ - Step 23122: {'lr': 0.0004752925190227405, 'samples': 4439424, 'steps': 23121, 'loss/train': 0.4748719036579132} 11/07/2021 00:29:04 - INFO - __main__ - Step 23123: {'lr': 0.0004752902186804478, 'samples': 4439616, 'steps': 23122, 'loss/train': 1.5372296571731567} 11/07/2021 00:29:04 - INFO - __main__ - Step 23124: {'lr': 0.0004752879182366429, 'samples': 4439808, 'steps': 23123, 'loss/train': 1.7153961658477783} 11/07/2021 00:29:04 - INFO - __main__ - Step 23125: {'lr': 0.0004752856176913266, 'samples': 4440000, 'steps': 23124, 'loss/train': 1.8751126527786255} 11/07/2021 00:29:06 - INFO - __main__ - Step 23126: {'lr': 0.0004752833170445001, 'samples': 4440192, 'steps': 23125, 'loss/train': 1.5678542852401733} 11/07/2021 00:29:06 - INFO - __main__ - Step 23127: {'lr': 0.0004752810162961645, 'samples': 4440384, 'steps': 23126, 'loss/train': 1.930989146232605} 11/07/2021 00:29:06 - INFO - __main__ - Step 23128: {'lr': 0.0004752787154463207, 'samples': 4440576, 'steps': 23127, 'loss/train': 1.4092589616775513} 11/07/2021 00:29:07 - INFO - __main__ - Step 23129: {'lr': 0.0004752764144949698, 'samples': 4440768, 'steps': 23128, 'loss/train': 1.5948010683059692} 11/07/2021 00:29:07 - INFO - __main__ - Step 23130: {'lr': 0.0004752741134421128, 'samples': 4440960, 'steps': 23129, 'loss/train': 1.9338881969451904} 11/07/2021 00:29:08 - INFO - __main__ - Step 23131: {'lr': 0.00047527181228775077, 'samples': 4441152, 'steps': 23130, 'loss/train': 1.63901948928833} 11/07/2021 00:29:08 - INFO - __main__ - Step 23132: {'lr': 0.0004752695110318848, 'samples': 4441344, 'steps': 23131, 'loss/train': 1.393159031867981} 11/07/2021 00:29:09 - INFO - __main__ - Step 23133: {'lr': 0.00047526720967451573, 'samples': 4441536, 'steps': 23132, 'loss/train': 1.695439100265503} 11/07/2021 00:29:09 - INFO - __main__ - Step 23134: {'lr': 0.0004752649082156448, 'samples': 4441728, 'steps': 23133, 'loss/train': 1.4486114978790283} 11/07/2021 00:29:09 - INFO - __main__ - Step 23135: {'lr': 0.00047526260665527306, 'samples': 4441920, 'steps': 23134, 'loss/train': 1.8586546182632446} 11/07/2021 00:29:11 - INFO - __main__ - Step 23136: {'lr': 0.0004752603049934014, 'samples': 4442112, 'steps': 23135, 'loss/train': 1.6190085411071777} 11/07/2021 00:29:11 - INFO - __main__ - Step 23137: {'lr': 0.0004752580032300309, 'samples': 4442304, 'steps': 23136, 'loss/train': 1.5549322366714478} 11/07/2021 00:29:11 - INFO - __main__ - Step 23138: {'lr': 0.0004752557013651626, 'samples': 4442496, 'steps': 23137, 'loss/train': 1.221572995185852} 11/07/2021 00:29:12 - INFO - __main__ - Step 23139: {'lr': 0.00047525339939879764, 'samples': 4442688, 'steps': 23138, 'loss/train': 1.486706018447876} 11/07/2021 00:29:12 - INFO - __main__ - Step 23140: {'lr': 0.0004752510973309369, 'samples': 4442880, 'steps': 23139, 'loss/train': 1.5068085193634033} 11/07/2021 00:29:13 - INFO - __main__ - Step 23141: {'lr': 0.00047524879516158155, 'samples': 4443072, 'steps': 23140, 'loss/train': 0.5440205931663513} 11/07/2021 00:29:13 - INFO - __main__ - Step 23142: {'lr': 0.00047524649289073254, 'samples': 4443264, 'steps': 23141, 'loss/train': 1.9470374584197998} 11/07/2021 00:29:14 - INFO - __main__ - Step 23143: {'lr': 0.00047524419051839093, 'samples': 4443456, 'steps': 23142, 'loss/train': 1.6084150075912476} 11/07/2021 00:29:14 - INFO - __main__ - Step 23144: {'lr': 0.00047524188804455776, 'samples': 4443648, 'steps': 23143, 'loss/train': 1.1105417013168335} 11/07/2021 00:29:14 - INFO - __main__ - Step 23145: {'lr': 0.0004752395854692341, 'samples': 4443840, 'steps': 23144, 'loss/train': 1.6613856554031372} 11/07/2021 00:29:16 - INFO - __main__ - Step 23146: {'lr': 0.0004752372827924209, 'samples': 4444032, 'steps': 23145, 'loss/train': 1.884938359260559} 11/07/2021 00:29:16 - INFO - __main__ - Step 23147: {'lr': 0.0004752349800141193, 'samples': 4444224, 'steps': 23146, 'loss/train': 1.2518686056137085} 11/07/2021 00:29:16 - INFO - __main__ - Step 23148: {'lr': 0.0004752326771343303, 'samples': 4444416, 'steps': 23147, 'loss/train': 2.7502472400665283} 11/07/2021 00:29:17 - INFO - __main__ - Step 23149: {'lr': 0.00047523037415305494, 'samples': 4444608, 'steps': 23148, 'loss/train': 1.4677647352218628} 11/07/2021 00:29:17 - INFO - __main__ - Step 23150: {'lr': 0.0004752280710702942, 'samples': 4444800, 'steps': 23149, 'loss/train': 1.722581148147583} 11/07/2021 00:29:18 - INFO - __main__ - Step 23151: {'lr': 0.0004752257678860492, 'samples': 4444992, 'steps': 23150, 'loss/train': 1.3895703554153442} 11/07/2021 00:29:18 - INFO - __main__ - Step 23152: {'lr': 0.00047522346460032093, 'samples': 4445184, 'steps': 23151, 'loss/train': 1.1857327222824097} 11/07/2021 00:29:19 - INFO - __main__ - Step 23153: {'lr': 0.0004752211612131104, 'samples': 4445376, 'steps': 23152, 'loss/train': 0.6896765828132629} 11/07/2021 00:29:19 - INFO - __main__ - Step 23154: {'lr': 0.00047521885772441874, 'samples': 4445568, 'steps': 23153, 'loss/train': 1.9018213748931885} 11/07/2021 00:29:19 - INFO - __main__ - Step 23155: {'lr': 0.00047521655413424705, 'samples': 4445760, 'steps': 23154, 'loss/train': 2.1501245498657227} 11/07/2021 00:29:20 - INFO - __main__ - Step 23156: {'lr': 0.0004752142504425961, 'samples': 4445952, 'steps': 23155, 'loss/train': 1.5948129892349243} 11/07/2021 00:29:21 - INFO - __main__ - Step 23157: {'lr': 0.0004752119466494671, 'samples': 4446144, 'steps': 23156, 'loss/train': 1.9892706871032715} 11/07/2021 00:29:21 - INFO - __main__ - Step 23158: {'lr': 0.0004752096427548611, 'samples': 4446336, 'steps': 23157, 'loss/train': 1.5363296270370483} 11/07/2021 00:29:21 - INFO - __main__ - Step 23159: {'lr': 0.00047520733875877906, 'samples': 4446528, 'steps': 23158, 'loss/train': 1.5002020597457886} 11/07/2021 00:29:22 - INFO - __main__ - Step 23160: {'lr': 0.00047520503466122216, 'samples': 4446720, 'steps': 23159, 'loss/train': 1.7415108680725098} 11/07/2021 00:29:22 - INFO - __main__ - Step 23161: {'lr': 0.0004752027304621913, 'samples': 4446912, 'steps': 23160, 'loss/train': 0.3823089003562927} 11/07/2021 00:29:23 - INFO - __main__ - Step 23162: {'lr': 0.0004752004261616876, 'samples': 4447104, 'steps': 23161, 'loss/train': 1.2418301105499268} 11/07/2021 00:29:23 - INFO - __main__ - Step 23163: {'lr': 0.000475198121759712, 'samples': 4447296, 'steps': 23162, 'loss/train': 1.2378976345062256} 11/07/2021 00:29:24 - INFO - __main__ - Step 23164: {'lr': 0.0004751958172562656, 'samples': 4447488, 'steps': 23163, 'loss/train': 1.4196279048919678} 11/07/2021 00:29:24 - INFO - __main__ - Step 23165: {'lr': 0.00047519351265134954, 'samples': 4447680, 'steps': 23164, 'loss/train': 1.934503197669983} 11/07/2021 00:29:24 - INFO - __main__ - Step 23166: {'lr': 0.00047519120794496466, 'samples': 4447872, 'steps': 23165, 'loss/train': 1.7241228818893433} 11/07/2021 00:29:26 - INFO - __main__ - Step 23167: {'lr': 0.00047518890313711217, 'samples': 4448064, 'steps': 23166, 'loss/train': 1.5477992296218872} 11/07/2021 00:29:26 - INFO - __main__ - Step 23168: {'lr': 0.000475186598227793, 'samples': 4448256, 'steps': 23167, 'loss/train': 1.3837860822677612} 11/07/2021 00:29:26 - INFO - __main__ - Step 23169: {'lr': 0.0004751842932170082, 'samples': 4448448, 'steps': 23168, 'loss/train': 1.5913500785827637} 11/07/2021 00:29:27 - INFO - __main__ - Step 23170: {'lr': 0.00047518198810475885, 'samples': 4448640, 'steps': 23169, 'loss/train': 1.5390900373458862} 11/07/2021 00:29:27 - INFO - __main__ - Step 23171: {'lr': 0.00047517968289104596, 'samples': 4448832, 'steps': 23170, 'loss/train': 1.9195278882980347} 11/07/2021 00:29:28 - INFO - __main__ - Step 23172: {'lr': 0.0004751773775758706, 'samples': 4449024, 'steps': 23171, 'loss/train': 1.7877540588378906} 11/07/2021 00:29:28 - INFO - __main__ - Step 23173: {'lr': 0.00047517507215923376, 'samples': 4449216, 'steps': 23172, 'loss/train': 1.3253355026245117} 11/07/2021 00:29:29 - INFO - __main__ - Step 23174: {'lr': 0.00047517276664113653, 'samples': 4449408, 'steps': 23173, 'loss/train': 1.5295782089233398} 11/07/2021 00:29:29 - INFO - __main__ - Step 23175: {'lr': 0.0004751704610215799, 'samples': 4449600, 'steps': 23174, 'loss/train': 0.9073389172554016} 11/07/2021 00:29:29 - INFO - __main__ - Step 23176: {'lr': 0.000475168155300565, 'samples': 4449792, 'steps': 23175, 'loss/train': 1.260116696357727} 11/07/2021 00:29:30 - INFO - __main__ - Step 23177: {'lr': 0.00047516584947809274, 'samples': 4449984, 'steps': 23176, 'loss/train': 1.680338978767395} 11/07/2021 00:29:31 - INFO - __main__ - Step 23178: {'lr': 0.00047516354355416426, 'samples': 4450176, 'steps': 23177, 'loss/train': 1.5092757940292358} 11/07/2021 00:29:31 - INFO - __main__ - Step 23179: {'lr': 0.00047516123752878054, 'samples': 4450368, 'steps': 23178, 'loss/train': 1.9057352542877197} 11/07/2021 00:29:31 - INFO - __main__ - Step 23180: {'lr': 0.00047515893140194265, 'samples': 4450560, 'steps': 23179, 'loss/train': 1.6779406070709229} 11/07/2021 00:29:32 - INFO - __main__ - Step 23181: {'lr': 0.0004751566251736516, 'samples': 4450752, 'steps': 23180, 'loss/train': 1.547135591506958} 11/07/2021 00:29:33 - INFO - __main__ - Step 23182: {'lr': 0.00047515431884390845, 'samples': 4450944, 'steps': 23181, 'loss/train': 1.668994665145874} 11/07/2021 00:29:33 - INFO - __main__ - Step 23183: {'lr': 0.00047515201241271426, 'samples': 4451136, 'steps': 23182, 'loss/train': 2.094433307647705} 11/07/2021 00:29:34 - INFO - __main__ - Step 23184: {'lr': 0.00047514970588007007, 'samples': 4451328, 'steps': 23183, 'loss/train': 1.9435114860534668} 11/07/2021 00:29:34 - INFO - __main__ - Step 23185: {'lr': 0.0004751473992459768, 'samples': 4451520, 'steps': 23184, 'loss/train': 1.3332525491714478} 11/07/2021 00:29:34 - INFO - __main__ - Step 23186: {'lr': 0.0004751450925104357, 'samples': 4451712, 'steps': 23185, 'loss/train': 1.8284626007080078} 11/07/2021 00:29:35 - INFO - __main__ - Step 23187: {'lr': 0.00047514278567344765, 'samples': 4451904, 'steps': 23186, 'loss/train': 1.9439247846603394} 11/07/2021 00:29:36 - INFO - __main__ - Step 23188: {'lr': 0.00047514047873501374, 'samples': 4452096, 'steps': 23187, 'loss/train': 1.6083027124404907} 11/07/2021 00:29:36 - INFO - __main__ - Step 23189: {'lr': 0.000475138171695135, 'samples': 4452288, 'steps': 23188, 'loss/train': 2.0748753547668457} 11/07/2021 00:29:36 - INFO - __main__ - Step 23190: {'lr': 0.00047513586455381245, 'samples': 4452480, 'steps': 23189, 'loss/train': 1.4966256618499756} 11/07/2021 00:29:37 - INFO - __main__ - Step 23191: {'lr': 0.00047513355731104717, 'samples': 4452672, 'steps': 23190, 'loss/train': 1.6222933530807495} 11/07/2021 00:29:37 - INFO - __main__ - Step 23192: {'lr': 0.0004751312499668402, 'samples': 4452864, 'steps': 23191, 'loss/train': 0.9162195324897766} 11/07/2021 00:29:38 - INFO - __main__ - Step 23193: {'lr': 0.00047512894252119256, 'samples': 4453056, 'steps': 23192, 'loss/train': 1.50593101978302} 11/07/2021 00:29:38 - INFO - __main__ - Step 23194: {'lr': 0.0004751266349741053, 'samples': 4453248, 'steps': 23193, 'loss/train': 1.4306085109710693} 11/07/2021 00:29:39 - INFO - __main__ - Step 23195: {'lr': 0.0004751243273255794, 'samples': 4453440, 'steps': 23194, 'loss/train': 1.7216869592666626} 11/07/2021 00:29:39 - INFO - __main__ - Step 23196: {'lr': 0.000475122019575616, 'samples': 4453632, 'steps': 23195, 'loss/train': 1.568893313407898} 11/07/2021 00:29:39 - INFO - __main__ - Step 23197: {'lr': 0.0004751197117242161, 'samples': 4453824, 'steps': 23196, 'loss/train': 1.283979892730713} 11/07/2021 00:29:40 - INFO - __main__ - Step 23198: {'lr': 0.0004751174037713807, 'samples': 4454016, 'steps': 23197, 'loss/train': 1.5490138530731201} 11/07/2021 00:29:41 - INFO - __main__ - Step 23199: {'lr': 0.00047511509571711085, 'samples': 4454208, 'steps': 23198, 'loss/train': 1.2735344171524048} 11/07/2021 00:29:41 - INFO - __main__ - Step 23200: {'lr': 0.00047511278756140766, 'samples': 4454400, 'steps': 23199, 'loss/train': 1.056983232498169} 11/07/2021 00:29:41 - INFO - __main__ - Step 23201: {'lr': 0.00047511047930427216, 'samples': 4454592, 'steps': 23200, 'loss/train': 1.5129168033599854} 11/07/2021 00:29:42 - INFO - __main__ - Step 23202: {'lr': 0.00047510817094570526, 'samples': 4454784, 'steps': 23201, 'loss/train': 1.2528656721115112} 11/07/2021 00:29:43 - INFO - __main__ - Step 23203: {'lr': 0.00047510586248570815, 'samples': 4454976, 'steps': 23202, 'loss/train': 1.2748398780822754} 11/07/2021 00:29:43 - INFO - __main__ - Step 23204: {'lr': 0.00047510355392428176, 'samples': 4455168, 'steps': 23203, 'loss/train': 1.3079149723052979} 11/07/2021 00:29:44 - INFO - __main__ - Step 23205: {'lr': 0.00047510124526142723, 'samples': 4455360, 'steps': 23204, 'loss/train': 1.3610676527023315} 11/07/2021 00:29:44 - INFO - __main__ - Step 23206: {'lr': 0.00047509893649714554, 'samples': 4455552, 'steps': 23205, 'loss/train': 1.3533624410629272} 11/07/2021 00:29:44 - INFO - __main__ - Step 23207: {'lr': 0.00047509662763143775, 'samples': 4455744, 'steps': 23206, 'loss/train': 1.393203854560852} 11/07/2021 00:29:45 - INFO - __main__ - Step 23208: {'lr': 0.00047509431866430487, 'samples': 4455936, 'steps': 23207, 'loss/train': 1.4738633632659912} 11/07/2021 00:29:46 - INFO - __main__ - Step 23209: {'lr': 0.000475092009595748, 'samples': 4456128, 'steps': 23208, 'loss/train': 0.9779757857322693} 11/07/2021 00:29:46 - INFO - __main__ - Step 23210: {'lr': 0.0004750897004257681, 'samples': 4456320, 'steps': 23209, 'loss/train': 0.19317372143268585} 11/07/2021 00:29:46 - INFO - __main__ - Step 23211: {'lr': 0.0004750873911543663, 'samples': 4456512, 'steps': 23210, 'loss/train': 1.6986215114593506} 11/07/2021 00:29:47 - INFO - __main__ - Step 23212: {'lr': 0.00047508508178154354, 'samples': 4456704, 'steps': 23211, 'loss/train': 1.6802526712417603} 11/07/2021 00:29:48 - INFO - __main__ - Step 23213: {'lr': 0.00047508277230730095, 'samples': 4456896, 'steps': 23212, 'loss/train': 1.0024389028549194} 11/07/2021 00:29:48 - INFO - __main__ - Step 23214: {'lr': 0.00047508046273163953, 'samples': 4457088, 'steps': 23213, 'loss/train': 1.4807589054107666} 11/07/2021 00:29:49 - INFO - __main__ - Step 23215: {'lr': 0.0004750781530545603, 'samples': 4457280, 'steps': 23214, 'loss/train': 1.3570733070373535} 11/07/2021 00:29:49 - INFO - __main__ - Step 23216: {'lr': 0.0004750758432760644, 'samples': 4457472, 'steps': 23215, 'loss/train': 1.3327617645263672} 11/07/2021 00:29:49 - INFO - __main__ - Step 23217: {'lr': 0.0004750735333961527, 'samples': 4457664, 'steps': 23216, 'loss/train': 1.9167391061782837} 11/07/2021 00:29:50 - INFO - __main__ - Step 23218: {'lr': 0.00047507122341482644, 'samples': 4457856, 'steps': 23217, 'loss/train': 1.5629518032073975} 11/07/2021 00:29:51 - INFO - __main__ - Step 23219: {'lr': 0.00047506891333208654, 'samples': 4458048, 'steps': 23218, 'loss/train': 1.445533275604248} 11/07/2021 00:29:51 - INFO - __main__ - Step 23220: {'lr': 0.000475066603147934, 'samples': 4458240, 'steps': 23219, 'loss/train': 1.4702428579330444} 11/07/2021 00:29:51 - INFO - __main__ - Step 23221: {'lr': 0.00047506429286236997, 'samples': 4458432, 'steps': 23220, 'loss/train': 0.974677562713623} 11/07/2021 00:29:52 - INFO - __main__ - Step 23222: {'lr': 0.00047506198247539546, 'samples': 4458624, 'steps': 23221, 'loss/train': 2.327030658721924} 11/07/2021 00:29:52 - INFO - __main__ - Step 23223: {'lr': 0.0004750596719870114, 'samples': 4458816, 'steps': 23222, 'loss/train': 1.695296287536621} 11/07/2021 00:29:53 - INFO - __main__ - Step 23224: {'lr': 0.000475057361397219, 'samples': 4459008, 'steps': 23223, 'loss/train': 1.4242885112762451} 11/07/2021 00:29:53 - INFO - __main__ - Step 23225: {'lr': 0.0004750550507060192, 'samples': 4459200, 'steps': 23224, 'loss/train': 1.8439444303512573} 11/07/2021 00:29:54 - INFO - __main__ - Step 23226: {'lr': 0.0004750527399134131, 'samples': 4459392, 'steps': 23225, 'loss/train': 1.62975013256073} 11/07/2021 00:29:54 - INFO - __main__ - Step 23227: {'lr': 0.00047505042901940163, 'samples': 4459584, 'steps': 23226, 'loss/train': 0.7999213933944702} 11/07/2021 00:29:55 - INFO - __main__ - Step 23228: {'lr': 0.00047504811802398603, 'samples': 4459776, 'steps': 23227, 'loss/train': 0.9568268656730652} 11/07/2021 00:29:56 - INFO - __main__ - Step 23229: {'lr': 0.0004750458069271671, 'samples': 4459968, 'steps': 23228, 'loss/train': 1.195821762084961} 11/07/2021 00:29:56 - INFO - __main__ - Step 23230: {'lr': 0.0004750434957289461, 'samples': 4460160, 'steps': 23229, 'loss/train': 1.61564040184021} 11/07/2021 00:29:56 - INFO - __main__ - Step 23231: {'lr': 0.0004750411844293239, 'samples': 4460352, 'steps': 23230, 'loss/train': 1.6031367778778076} 11/07/2021 00:29:57 - INFO - __main__ - Step 23232: {'lr': 0.0004750388730283016, 'samples': 4460544, 'steps': 23231, 'loss/train': 1.7405587434768677} 11/07/2021 00:29:57 - INFO - __main__ - Step 23233: {'lr': 0.0004750365615258804, 'samples': 4460736, 'steps': 23232, 'loss/train': 1.6886457204818726} 11/07/2021 00:29:58 - INFO - __main__ - Step 23234: {'lr': 0.00047503424992206107, 'samples': 4460928, 'steps': 23233, 'loss/train': 1.7403072118759155} 11/07/2021 00:29:59 - INFO - __main__ - Step 23235: {'lr': 0.00047503193821684476, 'samples': 4461120, 'steps': 23234, 'loss/train': 1.0748543739318848} 11/07/2021 00:29:59 - INFO - __main__ - Step 23236: {'lr': 0.0004750296264102326, 'samples': 4461312, 'steps': 23235, 'loss/train': 0.7899654507637024} 11/07/2021 00:29:59 - INFO - __main__ - Step 23237: {'lr': 0.0004750273145022256, 'samples': 4461504, 'steps': 23236, 'loss/train': 1.9232375621795654} 11/07/2021 00:30:00 - INFO - __main__ - Step 23238: {'lr': 0.00047502500249282464, 'samples': 4461696, 'steps': 23237, 'loss/train': 2.210071563720703} 11/07/2021 00:30:00 - INFO - __main__ - Step 23239: {'lr': 0.000475022690382031, 'samples': 4461888, 'steps': 23238, 'loss/train': 1.4127061367034912} 11/07/2021 00:30:01 - INFO - __main__ - Step 23240: {'lr': 0.0004750203781698456, 'samples': 4462080, 'steps': 23239, 'loss/train': 1.924795150756836} 11/07/2021 00:30:02 - INFO - __main__ - Step 23241: {'lr': 0.0004750180658562694, 'samples': 4462272, 'steps': 23240, 'loss/train': 1.7388290166854858} 11/07/2021 00:30:02 - INFO - __main__ - Step 23242: {'lr': 0.00047501575344130356, 'samples': 4462464, 'steps': 23241, 'loss/train': 1.7781851291656494} 11/07/2021 00:30:02 - INFO - __main__ - Step 23243: {'lr': 0.00047501344092494915, 'samples': 4462656, 'steps': 23242, 'loss/train': 1.8841161727905273} 11/07/2021 00:30:03 - INFO - __main__ - Step 23244: {'lr': 0.0004750111283072071, 'samples': 4462848, 'steps': 23243, 'loss/train': 1.3987021446228027} 11/07/2021 00:30:04 - INFO - __main__ - Step 23245: {'lr': 0.00047500881558807854, 'samples': 4463040, 'steps': 23244, 'loss/train': 1.4467869997024536} 11/07/2021 00:30:04 - INFO - __main__ - Step 23246: {'lr': 0.00047500650276756455, 'samples': 4463232, 'steps': 23245, 'loss/train': 1.4589742422103882} 11/07/2021 00:30:04 - INFO - __main__ - Step 23247: {'lr': 0.00047500418984566594, 'samples': 4463424, 'steps': 23246, 'loss/train': 1.8420617580413818} 11/07/2021 00:30:05 - INFO - __main__ - Step 23248: {'lr': 0.000475001876822384, 'samples': 4463616, 'steps': 23247, 'loss/train': 1.821009635925293} 11/07/2021 00:30:05 - INFO - __main__ - Step 23249: {'lr': 0.00047499956369771967, 'samples': 4463808, 'steps': 23248, 'loss/train': 1.7371834516525269} 11/07/2021 00:30:06 - INFO - __main__ - Step 23250: {'lr': 0.00047499725047167406, 'samples': 4464000, 'steps': 23249, 'loss/train': 1.7576009035110474} 11/07/2021 00:30:06 - INFO - __main__ - Step 23251: {'lr': 0.0004749949371442481, 'samples': 4464192, 'steps': 23250, 'loss/train': 1.6941187381744385} 11/07/2021 00:30:07 - INFO - __main__ - Step 23252: {'lr': 0.00047499262371544294, 'samples': 4464384, 'steps': 23251, 'loss/train': 1.533231496810913} 11/07/2021 00:30:07 - INFO - __main__ - Step 23253: {'lr': 0.00047499031018525953, 'samples': 4464576, 'steps': 23252, 'loss/train': 0.6648880839347839} 11/07/2021 00:30:07 - INFO - __main__ - Step 23254: {'lr': 0.00047498799655369895, 'samples': 4464768, 'steps': 23253, 'loss/train': 1.6462962627410889} 11/07/2021 00:30:08 - INFO - __main__ - Step 23255: {'lr': 0.0004749856828207623, 'samples': 4464960, 'steps': 23254, 'loss/train': 2.133352041244507} 11/07/2021 00:30:09 - INFO - __main__ - Step 23256: {'lr': 0.00047498336898645055, 'samples': 4465152, 'steps': 23255, 'loss/train': 1.3252581357955933} 11/07/2021 00:30:09 - INFO - __main__ - Step 23257: {'lr': 0.00047498105505076475, 'samples': 4465344, 'steps': 23256, 'loss/train': 1.2903780937194824} 11/07/2021 00:30:09 - INFO - __main__ - Step 23258: {'lr': 0.000474978741013706, 'samples': 4465536, 'steps': 23257, 'loss/train': 1.5443459749221802} 11/07/2021 00:30:10 - INFO - __main__ - Step 23259: {'lr': 0.0004749764268752753, 'samples': 4465728, 'steps': 23258, 'loss/train': 1.795452356338501} 11/07/2021 00:30:10 - INFO - __main__ - Step 23260: {'lr': 0.0004749741126354736, 'samples': 4465920, 'steps': 23259, 'loss/train': 1.6947624683380127} 11/07/2021 00:30:11 - INFO - __main__ - Step 23261: {'lr': 0.00047497179829430217, 'samples': 4466112, 'steps': 23260, 'loss/train': 1.524401307106018} 11/07/2021 00:30:11 - INFO - __main__ - Step 23262: {'lr': 0.0004749694838517619, 'samples': 4466304, 'steps': 23261, 'loss/train': 1.6715247631072998} 11/07/2021 00:30:12 - INFO - __main__ - Step 23263: {'lr': 0.0004749671693078538, 'samples': 4466496, 'steps': 23262, 'loss/train': 1.1825659275054932} 11/07/2021 00:30:12 - INFO - __main__ - Step 23264: {'lr': 0.00047496485466257896, 'samples': 4466688, 'steps': 23263, 'loss/train': 1.2815076112747192} 11/07/2021 00:30:13 - INFO - __main__ - Step 23265: {'lr': 0.0004749625399159384, 'samples': 4466880, 'steps': 23264, 'loss/train': 0.34565478563308716} 11/07/2021 00:30:14 - INFO - __main__ - Step 23266: {'lr': 0.0004749602250679332, 'samples': 4467072, 'steps': 23265, 'loss/train': 1.8178528547286987} 11/07/2021 00:30:14 - INFO - __main__ - Step 23267: {'lr': 0.00047495791011856447, 'samples': 4467264, 'steps': 23266, 'loss/train': 1.5307931900024414} 11/07/2021 00:30:14 - INFO - __main__ - Step 23268: {'lr': 0.00047495559506783317, 'samples': 4467456, 'steps': 23267, 'loss/train': 1.597183346748352} 11/07/2021 00:30:15 - INFO - __main__ - Step 23269: {'lr': 0.00047495327991574034, 'samples': 4467648, 'steps': 23268, 'loss/train': 1.6375641822814941} 11/07/2021 00:30:15 - INFO - __main__ - Step 23270: {'lr': 0.0004749509646622869, 'samples': 4467840, 'steps': 23269, 'loss/train': 2.025968551635742} 11/07/2021 00:30:16 - INFO - __main__ - Step 23271: {'lr': 0.00047494864930747415, 'samples': 4468032, 'steps': 23270, 'loss/train': 5.877805233001709} 11/07/2021 00:30:17 - INFO - __main__ - Step 23272: {'lr': 0.000474946333851303, 'samples': 4468224, 'steps': 23271, 'loss/train': 0.9324080348014832} 11/07/2021 00:30:17 - INFO - __main__ - Step 23273: {'lr': 0.0004749440182937745, 'samples': 4468416, 'steps': 23272, 'loss/train': 1.8106902837753296} 11/07/2021 00:30:17 - INFO - __main__ - Step 23274: {'lr': 0.0004749417026348897, 'samples': 4468608, 'steps': 23273, 'loss/train': 1.6376235485076904} 11/07/2021 00:30:18 - INFO - __main__ - Step 23275: {'lr': 0.0004749393868746497, 'samples': 4468800, 'steps': 23274, 'loss/train': 1.6495087146759033} 11/07/2021 00:30:18 - INFO - __main__ - Step 23276: {'lr': 0.0004749370710130554, 'samples': 4468992, 'steps': 23275, 'loss/train': 2.2123794555664062} 11/07/2021 00:30:19 - INFO - __main__ - Step 23277: {'lr': 0.00047493475505010793, 'samples': 4469184, 'steps': 23276, 'loss/train': 0.2711905539035797} 11/07/2021 00:30:19 - INFO - __main__ - Step 23278: {'lr': 0.0004749324389858083, 'samples': 4469376, 'steps': 23277, 'loss/train': 1.7561227083206177} 11/07/2021 00:30:20 - INFO - __main__ - Step 23279: {'lr': 0.00047493012282015767, 'samples': 4469568, 'steps': 23278, 'loss/train': 1.7865687608718872} 11/07/2021 00:30:20 - INFO - __main__ - Step 23280: {'lr': 0.00047492780655315693, 'samples': 4469760, 'steps': 23279, 'loss/train': 1.2692395448684692} 11/07/2021 00:30:20 - INFO - __main__ - Step 23281: {'lr': 0.00047492549018480725, 'samples': 4469952, 'steps': 23280, 'loss/train': 1.1253389120101929} 11/07/2021 00:30:21 - INFO - __main__ - Step 23282: {'lr': 0.00047492317371510955, 'samples': 4470144, 'steps': 23281, 'loss/train': 1.7119258642196655} 11/07/2021 00:30:22 - INFO - __main__ - Step 23283: {'lr': 0.00047492085714406497, 'samples': 4470336, 'steps': 23282, 'loss/train': 1.870627760887146} 11/07/2021 00:30:22 - INFO - __main__ - Step 23284: {'lr': 0.00047491854047167453, 'samples': 4470528, 'steps': 23283, 'loss/train': 1.5494314432144165} 11/07/2021 00:30:23 - INFO - __main__ - Step 23285: {'lr': 0.0004749162236979393, 'samples': 4470720, 'steps': 23284, 'loss/train': 1.2017252445220947} 11/07/2021 00:30:23 - INFO - __main__ - Step 23286: {'lr': 0.0004749139068228602, 'samples': 4470912, 'steps': 23285, 'loss/train': 1.548548698425293} 11/07/2021 00:30:24 - INFO - __main__ - Step 23287: {'lr': 0.00047491158984643846, 'samples': 4471104, 'steps': 23286, 'loss/train': 1.3301843404769897} 11/07/2021 00:30:24 - INFO - __main__ - Step 23288: {'lr': 0.0004749092727686749, 'samples': 4471296, 'steps': 23287, 'loss/train': 1.8255505561828613} 11/07/2021 00:30:25 - INFO - __main__ - Step 23289: {'lr': 0.00047490695558957083, 'samples': 4471488, 'steps': 23288, 'loss/train': 1.5257481336593628} 11/07/2021 00:30:25 - INFO - __main__ - Step 23290: {'lr': 0.00047490463830912713, 'samples': 4471680, 'steps': 23289, 'loss/train': 1.5626459121704102} 11/07/2021 00:30:26 - INFO - __main__ - Step 23291: {'lr': 0.0004749023209273448, 'samples': 4471872, 'steps': 23290, 'loss/train': 1.669747233390808} 11/07/2021 00:30:26 - INFO - __main__ - Step 23292: {'lr': 0.000474900003444225, 'samples': 4472064, 'steps': 23291, 'loss/train': 1.540729284286499} 11/07/2021 00:30:27 - INFO - __main__ - Step 23293: {'lr': 0.0004748976858597687, 'samples': 4472256, 'steps': 23292, 'loss/train': 1.4468985795974731} 11/07/2021 00:30:27 - INFO - __main__ - Step 23294: {'lr': 0.00047489536817397706, 'samples': 4472448, 'steps': 23293, 'loss/train': 1.6595637798309326} 11/07/2021 00:30:28 - INFO - __main__ - Step 23295: {'lr': 0.00047489305038685094, 'samples': 4472640, 'steps': 23294, 'loss/train': 1.4057589769363403} 11/07/2021 00:30:28 - INFO - __main__ - Step 23296: {'lr': 0.00047489073249839153, 'samples': 4472832, 'steps': 23295, 'loss/train': 1.657401442527771} 11/07/2021 00:30:29 - INFO - __main__ - Step 23297: {'lr': 0.0004748884145085998, 'samples': 4473024, 'steps': 23296, 'loss/train': 1.5072712898254395} 11/07/2021 00:30:29 - INFO - __main__ - Step 23298: {'lr': 0.0004748860964174768, 'samples': 4473216, 'steps': 23297, 'loss/train': 1.3847980499267578} 11/07/2021 00:30:30 - INFO - __main__ - Step 23299: {'lr': 0.00047488377822502365, 'samples': 4473408, 'steps': 23298, 'loss/train': 1.3205068111419678} 11/07/2021 00:30:30 - INFO - __main__ - Step 23300: {'lr': 0.00047488145993124134, 'samples': 4473600, 'steps': 23299, 'loss/train': 0.8308882713317871} 11/07/2021 00:30:30 - INFO - __main__ - Step 23301: {'lr': 0.0004748791415361309, 'samples': 4473792, 'steps': 23300, 'loss/train': 1.611672043800354} 11/07/2021 00:30:31 - INFO - __main__ - Step 23302: {'lr': 0.00047487682303969336, 'samples': 4473984, 'steps': 23301, 'loss/train': 2.2568047046661377} 11/07/2021 00:30:32 - INFO - __main__ - Step 23303: {'lr': 0.0004748745044419298, 'samples': 4474176, 'steps': 23302, 'loss/train': 1.8289871215820312} 11/07/2021 00:30:32 - INFO - __main__ - Step 23304: {'lr': 0.0004748721857428413, 'samples': 4474368, 'steps': 23303, 'loss/train': 0.8408457040786743} 11/07/2021 00:30:32 - INFO - __main__ - Step 23305: {'lr': 0.00047486986694242887, 'samples': 4474560, 'steps': 23304, 'loss/train': 1.4229025840759277} 11/07/2021 00:30:33 - INFO - __main__ - Step 23306: {'lr': 0.0004748675480406934, 'samples': 4474752, 'steps': 23305, 'loss/train': 1.8904900550842285} 11/07/2021 00:30:33 - INFO - __main__ - Step 23307: {'lr': 0.0004748652290376363, 'samples': 4474944, 'steps': 23306, 'loss/train': 1.0955767631530762} 11/07/2021 00:30:34 - INFO - __main__ - Step 23308: {'lr': 0.00047486290993325824, 'samples': 4475136, 'steps': 23307, 'loss/train': 1.1587820053100586} 11/07/2021 00:30:35 - INFO - __main__ - Step 23309: {'lr': 0.00047486059072756047, 'samples': 4475328, 'steps': 23308, 'loss/train': 1.7340482473373413} 11/07/2021 00:30:35 - INFO - __main__ - Step 23310: {'lr': 0.00047485827142054407, 'samples': 4475520, 'steps': 23309, 'loss/train': 1.6572470664978027} 11/07/2021 00:30:35 - INFO - __main__ - Step 23311: {'lr': 0.0004748559520122099, 'samples': 4475712, 'steps': 23310, 'loss/train': 1.7451789379119873} 11/07/2021 00:30:36 - INFO - __main__ - Step 23312: {'lr': 0.0004748536325025591, 'samples': 4475904, 'steps': 23311, 'loss/train': 0.9648639559745789} 11/07/2021 00:30:37 - INFO - __main__ - Step 23313: {'lr': 0.0004748513128915928, 'samples': 4476096, 'steps': 23312, 'loss/train': 1.480389952659607} 11/07/2021 00:30:37 - INFO - __main__ - Step 23314: {'lr': 0.0004748489931793119, 'samples': 4476288, 'steps': 23313, 'loss/train': 1.9649654626846313} 11/07/2021 00:30:37 - INFO - __main__ - Step 23315: {'lr': 0.00047484667336571753, 'samples': 4476480, 'steps': 23314, 'loss/train': 1.320034384727478} 11/07/2021 00:30:38 - INFO - __main__ - Step 23316: {'lr': 0.0004748443534508107, 'samples': 4476672, 'steps': 23315, 'loss/train': 1.7430577278137207} 11/07/2021 00:30:38 - INFO - __main__ - Step 23317: {'lr': 0.00047484203343459256, 'samples': 4476864, 'steps': 23316, 'loss/train': 1.815623164176941} 11/07/2021 00:30:39 - INFO - __main__ - Step 23318: {'lr': 0.000474839713317064, 'samples': 4477056, 'steps': 23317, 'loss/train': 1.7453724145889282} 11/07/2021 00:30:39 - INFO - __main__ - Step 23319: {'lr': 0.00047483739309822615, 'samples': 4477248, 'steps': 23318, 'loss/train': 1.70332932472229} 11/07/2021 00:30:40 - INFO - __main__ - Step 23320: {'lr': 0.00047483507277808, 'samples': 4477440, 'steps': 23319, 'loss/train': 1.5589721202850342} 11/07/2021 00:30:40 - INFO - __main__ - Step 23321: {'lr': 0.0004748327523566267, 'samples': 4477632, 'steps': 23320, 'loss/train': 1.1599167585372925} 11/07/2021 00:30:40 - INFO - __main__ - Step 23322: {'lr': 0.0004748304318338672, 'samples': 4477824, 'steps': 23321, 'loss/train': 1.7838388681411743} 11/07/2021 00:30:42 - INFO - __main__ - Step 23323: {'lr': 0.00047482811120980254, 'samples': 4478016, 'steps': 23322, 'loss/train': 1.4691747426986694} 11/07/2021 00:30:42 - INFO - __main__ - Step 23324: {'lr': 0.0004748257904844339, 'samples': 4478208, 'steps': 23323, 'loss/train': 1.624101996421814} 11/07/2021 00:30:42 - INFO - __main__ - Step 23325: {'lr': 0.00047482346965776215, 'samples': 4478400, 'steps': 23324, 'loss/train': 1.6079250574111938} 11/07/2021 00:30:43 - INFO - __main__ - Step 23326: {'lr': 0.0004748211487297884, 'samples': 4478592, 'steps': 23325, 'loss/train': 1.8467061519622803} 11/07/2021 00:30:43 - INFO - __main__ - Step 23327: {'lr': 0.00047481882770051377, 'samples': 4478784, 'steps': 23326, 'loss/train': 2.046962261199951} 11/07/2021 00:30:44 - INFO - __main__ - Step 23328: {'lr': 0.00047481650656993924, 'samples': 4478976, 'steps': 23327, 'loss/train': 0.17459586262702942} 11/07/2021 00:30:44 - INFO - __main__ - Step 23329: {'lr': 0.00047481418533806586, 'samples': 4479168, 'steps': 23328, 'loss/train': 1.3363194465637207} 11/07/2021 00:30:45 - INFO - __main__ - Step 23330: {'lr': 0.0004748118640048946, 'samples': 4479360, 'steps': 23329, 'loss/train': 1.5212841033935547} 11/07/2021 00:30:45 - INFO - __main__ - Step 23331: {'lr': 0.00047480954257042666, 'samples': 4479552, 'steps': 23330, 'loss/train': 1.509041428565979} 11/07/2021 00:30:46 - INFO - __main__ - Step 23332: {'lr': 0.000474807221034663, 'samples': 4479744, 'steps': 23331, 'loss/train': 1.1403354406356812} 11/07/2021 00:30:46 - INFO - __main__ - Step 23333: {'lr': 0.0004748048993976046, 'samples': 4479936, 'steps': 23332, 'loss/train': 1.8300435543060303} 11/07/2021 00:30:47 - INFO - __main__ - Step 23334: {'lr': 0.0004748025776592527, 'samples': 4480128, 'steps': 23333, 'loss/train': 1.7769771814346313} 11/07/2021 00:30:47 - INFO - __main__ - Step 23335: {'lr': 0.00047480025581960817, 'samples': 4480320, 'steps': 23334, 'loss/train': 1.5390480756759644} 11/07/2021 00:30:48 - INFO - __main__ - Step 23336: {'lr': 0.0004747979338786721, 'samples': 4480512, 'steps': 23335, 'loss/train': 1.405137062072754} 11/07/2021 00:30:48 - INFO - __main__ - Step 23337: {'lr': 0.00047479561183644557, 'samples': 4480704, 'steps': 23336, 'loss/train': 1.7070698738098145} 11/07/2021 00:30:48 - INFO - __main__ - Step 23338: {'lr': 0.00047479328969292963, 'samples': 4480896, 'steps': 23337, 'loss/train': 1.2690377235412598} 11/07/2021 00:30:49 - INFO - __main__ - Step 23339: {'lr': 0.0004747909674481253, 'samples': 4481088, 'steps': 23338, 'loss/train': 1.9424190521240234} 11/07/2021 00:30:50 - INFO - __main__ - Step 23340: {'lr': 0.00047478864510203355, 'samples': 4481280, 'steps': 23339, 'loss/train': 1.096649169921875} 11/07/2021 00:30:50 - INFO - __main__ - Step 23341: {'lr': 0.0004747863226546556, 'samples': 4481472, 'steps': 23340, 'loss/train': 1.366512417793274} 11/07/2021 00:30:50 - INFO - __main__ - Step 23342: {'lr': 0.0004747840001059923, 'samples': 4481664, 'steps': 23341, 'loss/train': 1.7543566226959229} 11/07/2021 00:30:51 - INFO - __main__ - Step 23343: {'lr': 0.00047478167745604495, 'samples': 4481856, 'steps': 23342, 'loss/train': 1.4305206537246704} 11/07/2021 00:30:52 - INFO - __main__ - Step 23344: {'lr': 0.00047477935470481434, 'samples': 4482048, 'steps': 23343, 'loss/train': 1.8772815465927124} 11/07/2021 00:30:52 - INFO - __main__ - Step 23345: {'lr': 0.00047477703185230157, 'samples': 4482240, 'steps': 23344, 'loss/train': 1.6674097776412964} 11/07/2021 00:30:52 - INFO - __main__ - Step 23346: {'lr': 0.00047477470889850784, 'samples': 4482432, 'steps': 23345, 'loss/train': 1.761983871459961} 11/07/2021 00:30:53 - INFO - __main__ - Step 23347: {'lr': 0.00047477238584343407, 'samples': 4482624, 'steps': 23346, 'loss/train': 1.2408764362335205} 11/07/2021 00:30:53 - INFO - __main__ - Step 23348: {'lr': 0.00047477006268708134, 'samples': 4482816, 'steps': 23347, 'loss/train': 1.8868632316589355} 11/07/2021 00:30:54 - INFO - __main__ - Step 23349: {'lr': 0.00047476773942945063, 'samples': 4483008, 'steps': 23348, 'loss/train': 1.5643318891525269} 11/07/2021 00:30:55 - INFO - __main__ - Step 23350: {'lr': 0.00047476541607054313, 'samples': 4483200, 'steps': 23349, 'loss/train': 0.34093764424324036} 11/07/2021 00:30:55 - INFO - __main__ - Step 23351: {'lr': 0.0004747630926103597, 'samples': 4483392, 'steps': 23350, 'loss/train': 1.2700945138931274} 11/07/2021 00:30:55 - INFO - __main__ - Step 23352: {'lr': 0.0004747607690489015, 'samples': 4483584, 'steps': 23351, 'loss/train': 1.5466938018798828} 11/07/2021 00:30:56 - INFO - __main__ - Step 23353: {'lr': 0.00047475844538616966, 'samples': 4483776, 'steps': 23352, 'loss/train': 1.6921113729476929} 11/07/2021 00:30:57 - INFO - __main__ - Step 23354: {'lr': 0.0004747561216221651, 'samples': 4483968, 'steps': 23353, 'loss/train': 2.1484296321868896} 11/07/2021 00:30:57 - INFO - __main__ - Step 23355: {'lr': 0.0004747537977568889, 'samples': 4484160, 'steps': 23354, 'loss/train': 0.8833921551704407} 11/07/2021 00:30:57 - INFO - __main__ - Step 23356: {'lr': 0.00047475147379034206, 'samples': 4484352, 'steps': 23355, 'loss/train': 1.4659409523010254} 11/07/2021 00:30:58 - INFO - __main__ - Step 23357: {'lr': 0.0004747491497225257, 'samples': 4484544, 'steps': 23356, 'loss/train': 1.5523254871368408} 11/07/2021 00:30:58 - INFO - __main__ - Step 23358: {'lr': 0.00047474682555344083, 'samples': 4484736, 'steps': 23357, 'loss/train': 1.6649246215820312} 11/07/2021 00:30:59 - INFO - __main__ - Step 23359: {'lr': 0.00047474450128308853, 'samples': 4484928, 'steps': 23358, 'loss/train': 1.7693043947219849} 11/07/2021 00:30:59 - INFO - __main__ - Step 23360: {'lr': 0.0004747421769114698, 'samples': 4485120, 'steps': 23359, 'loss/train': 1.51509428024292} 11/07/2021 00:31:00 - INFO - __main__ - Step 23361: {'lr': 0.00047473985243858577, 'samples': 4485312, 'steps': 23360, 'loss/train': 1.0619897842407227} 11/07/2021 00:31:00 - INFO - __main__ - Step 23362: {'lr': 0.00047473752786443736, 'samples': 4485504, 'steps': 23361, 'loss/train': 1.8989733457565308} 11/07/2021 00:31:00 - INFO - __main__ - Step 23363: {'lr': 0.0004747352031890257, 'samples': 4485696, 'steps': 23362, 'loss/train': 1.4395792484283447} 11/07/2021 00:31:02 - INFO - __main__ - Step 23364: {'lr': 0.0004747328784123519, 'samples': 4485888, 'steps': 23363, 'loss/train': 1.523278832435608} 11/07/2021 00:31:02 - INFO - __main__ - Step 23365: {'lr': 0.00047473055353441685, 'samples': 4486080, 'steps': 23364, 'loss/train': 1.8186616897583008} 11/07/2021 00:31:02 - INFO - __main__ - Step 23366: {'lr': 0.0004747282285552217, 'samples': 4486272, 'steps': 23365, 'loss/train': 1.4175355434417725} 11/07/2021 00:31:03 - INFO - __main__ - Step 23367: {'lr': 0.0004747259034747675, 'samples': 4486464, 'steps': 23366, 'loss/train': 1.718819499015808} 11/07/2021 00:31:03 - INFO - __main__ - Step 23368: {'lr': 0.00047472357829305524, 'samples': 4486656, 'steps': 23367, 'loss/train': 1.7335102558135986} 11/07/2021 00:31:04 - INFO - __main__ - Step 23369: {'lr': 0.0004747212530100861, 'samples': 4486848, 'steps': 23368, 'loss/train': 1.7119697332382202} 11/07/2021 00:31:04 - INFO - __main__ - Step 23370: {'lr': 0.0004747189276258609, 'samples': 4487040, 'steps': 23369, 'loss/train': 1.813303828239441} 11/07/2021 00:31:05 - INFO - __main__ - Step 23371: {'lr': 0.0004747166021403809, 'samples': 4487232, 'steps': 23370, 'loss/train': 1.7188760042190552} 11/07/2021 00:31:05 - INFO - __main__ - Step 23372: {'lr': 0.000474714276553647, 'samples': 4487424, 'steps': 23371, 'loss/train': 1.8126929998397827} 11/07/2021 00:31:05 - INFO - __main__ - Step 23373: {'lr': 0.00047471195086566035, 'samples': 4487616, 'steps': 23372, 'loss/train': 1.1047699451446533} 11/07/2021 00:31:06 - INFO - __main__ - Step 23374: {'lr': 0.000474709625076422, 'samples': 4487808, 'steps': 23373, 'loss/train': 2.256373167037964} 11/07/2021 00:31:07 - INFO - __main__ - Step 23375: {'lr': 0.0004747072991859329, 'samples': 4488000, 'steps': 23374, 'loss/train': 1.7977204322814941} 11/07/2021 00:31:07 - INFO - __main__ - Step 23376: {'lr': 0.0004747049731941942, 'samples': 4488192, 'steps': 23375, 'loss/train': 0.995120108127594} 11/07/2021 00:31:07 - INFO - __main__ - Step 23377: {'lr': 0.0004747026471012069, 'samples': 4488384, 'steps': 23376, 'loss/train': 1.240659236907959} 11/07/2021 00:31:08 - INFO - __main__ - Step 23378: {'lr': 0.000474700320906972, 'samples': 4488576, 'steps': 23377, 'loss/train': 1.879565715789795} 11/07/2021 00:31:08 - INFO - __main__ - Step 23379: {'lr': 0.0004746979946114907, 'samples': 4488768, 'steps': 23378, 'loss/train': 1.5867724418640137} 11/07/2021 00:31:09 - INFO - __main__ - Step 23380: {'lr': 0.000474695668214764, 'samples': 4488960, 'steps': 23379, 'loss/train': 1.5567258596420288} 11/07/2021 00:31:09 - INFO - __main__ - Step 23381: {'lr': 0.00047469334171679266, 'samples': 4489152, 'steps': 23380, 'loss/train': 1.4264450073242188} 11/07/2021 00:31:10 - INFO - __main__ - Step 23382: {'lr': 0.00047469101511757815, 'samples': 4489344, 'steps': 23381, 'loss/train': 1.7234845161437988} 11/07/2021 00:31:10 - INFO - __main__ - Step 23383: {'lr': 0.00047468868841712134, 'samples': 4489536, 'steps': 23382, 'loss/train': 1.4425300359725952} 11/07/2021 00:31:10 - INFO - __main__ - Step 23384: {'lr': 0.00047468636161542325, 'samples': 4489728, 'steps': 23383, 'loss/train': 1.4994603395462036} 11/07/2021 00:31:11 - INFO - __main__ - Step 23385: {'lr': 0.0004746840347124849, 'samples': 4489920, 'steps': 23384, 'loss/train': 1.5308146476745605} 11/07/2021 00:31:12 - INFO - __main__ - Step 23386: {'lr': 0.0004746817077083074, 'samples': 4490112, 'steps': 23385, 'loss/train': 1.2547379732131958} 11/07/2021 00:31:12 - INFO - __main__ - Step 23387: {'lr': 0.00047467938060289185, 'samples': 4490304, 'steps': 23386, 'loss/train': 1.469014048576355} 11/07/2021 00:31:13 - INFO - __main__ - Step 23388: {'lr': 0.0004746770533962391, 'samples': 4490496, 'steps': 23387, 'loss/train': 1.6473876237869263} 11/07/2021 00:31:13 - INFO - __main__ - Step 23389: {'lr': 0.0004746747260883505, 'samples': 4490688, 'steps': 23388, 'loss/train': 1.6516025066375732} 11/07/2021 00:31:14 - INFO - __main__ - Step 23390: {'lr': 0.0004746723986792268, 'samples': 4490880, 'steps': 23389, 'loss/train': 1.4926345348358154} 11/07/2021 00:31:14 - INFO - __main__ - Step 23391: {'lr': 0.0004746700711688693, 'samples': 4491072, 'steps': 23390, 'loss/train': 1.5572574138641357} 11/07/2021 00:31:15 - INFO - __main__ - Step 23392: {'lr': 0.0004746677435572789, 'samples': 4491264, 'steps': 23391, 'loss/train': 1.267289161682129} 11/07/2021 00:31:15 - INFO - __main__ - Step 23393: {'lr': 0.00047466541584445667, 'samples': 4491456, 'steps': 23392, 'loss/train': 1.4724624156951904} 11/07/2021 00:31:15 - INFO - __main__ - Step 23394: {'lr': 0.0004746630880304037, 'samples': 4491648, 'steps': 23393, 'loss/train': 1.650766134262085} 11/07/2021 00:31:16 - INFO - __main__ - Step 23395: {'lr': 0.0004746607601151209, 'samples': 4491840, 'steps': 23394, 'loss/train': 1.074090838432312} 11/07/2021 00:31:17 - INFO - __main__ - Step 23396: {'lr': 0.0004746584320986096, 'samples': 4492032, 'steps': 23395, 'loss/train': 1.498843789100647} 11/07/2021 00:31:17 - INFO - __main__ - Step 23397: {'lr': 0.0004746561039808706, 'samples': 4492224, 'steps': 23396, 'loss/train': 1.0642423629760742} 11/07/2021 00:31:17 - INFO - __main__ - Step 23398: {'lr': 0.0004746537757619049, 'samples': 4492416, 'steps': 23397, 'loss/train': 1.7150928974151611} 11/07/2021 00:31:18 - INFO - __main__ - Step 23399: {'lr': 0.00047465144744171387, 'samples': 4492608, 'steps': 23398, 'loss/train': 1.2741963863372803} 11/07/2021 00:31:18 - INFO - __main__ - Step 23400: {'lr': 0.0004746491190202983, 'samples': 4492800, 'steps': 23399, 'loss/train': 1.5653659105300903} 11/07/2021 00:31:19 - INFO - __main__ - Step 23401: {'lr': 0.00047464679049765926, 'samples': 4492992, 'steps': 23400, 'loss/train': 1.7427978515625} 11/07/2021 00:31:20 - INFO - __main__ - Step 23402: {'lr': 0.00047464446187379787, 'samples': 4493184, 'steps': 23401, 'loss/train': 1.8643656969070435} 11/07/2021 00:31:20 - INFO - __main__ - Step 23403: {'lr': 0.00047464213314871514, 'samples': 4493376, 'steps': 23402, 'loss/train': 1.5454072952270508} 11/07/2021 00:31:20 - INFO - __main__ - Step 23404: {'lr': 0.0004746398043224122, 'samples': 4493568, 'steps': 23403, 'loss/train': 1.82225501537323} 11/07/2021 00:31:21 - INFO - __main__ - Step 23405: {'lr': 0.0004746374753948899, 'samples': 4493760, 'steps': 23404, 'loss/train': 1.66446852684021} 11/07/2021 00:31:22 - INFO - __main__ - Step 23406: {'lr': 0.00047463514636614945, 'samples': 4493952, 'steps': 23405, 'loss/train': 0.7836999893188477} 11/07/2021 00:31:22 - INFO - __main__ - Step 23407: {'lr': 0.00047463281723619203, 'samples': 4494144, 'steps': 23406, 'loss/train': 1.2050637006759644} 11/07/2021 00:31:22 - INFO - __main__ - Step 23408: {'lr': 0.00047463048800501837, 'samples': 4494336, 'steps': 23407, 'loss/train': 1.3217005729675293} 11/07/2021 00:31:23 - INFO - __main__ - Step 23409: {'lr': 0.00047462815867262967, 'samples': 4494528, 'steps': 23408, 'loss/train': 1.623984456062317} 11/07/2021 00:31:23 - INFO - __main__ - Step 23410: {'lr': 0.0004746258292390271, 'samples': 4494720, 'steps': 23409, 'loss/train': 1.389218807220459} 11/07/2021 00:31:24 - INFO - __main__ - Step 23411: {'lr': 0.00047462349970421147, 'samples': 4494912, 'steps': 23410, 'loss/train': 1.4058815240859985} 11/07/2021 00:31:25 - INFO - __main__ - Step 23412: {'lr': 0.0004746211700681841, 'samples': 4495104, 'steps': 23411, 'loss/train': 1.352455973625183} 11/07/2021 00:31:25 - INFO - __main__ - Step 23413: {'lr': 0.0004746188403309457, 'samples': 4495296, 'steps': 23412, 'loss/train': 1.3618943691253662} 11/07/2021 00:31:25 - INFO - __main__ - Step 23414: {'lr': 0.00047461651049249764, 'samples': 4495488, 'steps': 23413, 'loss/train': 1.349570393562317} 11/07/2021 00:31:26 - INFO - __main__ - Step 23415: {'lr': 0.0004746141805528409, 'samples': 4495680, 'steps': 23414, 'loss/train': 1.1598076820373535} 11/07/2021 00:31:27 - INFO - __main__ - Step 23416: {'lr': 0.00047461185051197644, 'samples': 4495872, 'steps': 23415, 'loss/train': 0.9544047117233276} 11/07/2021 00:31:27 - INFO - __main__ - Step 23417: {'lr': 0.0004746095203699053, 'samples': 4496064, 'steps': 23416, 'loss/train': 1.0990289449691772} 11/07/2021 00:31:28 - INFO - __main__ - Step 23418: {'lr': 0.00047460719012662857, 'samples': 4496256, 'steps': 23417, 'loss/train': 1.5603042840957642} 11/07/2021 00:31:28 - INFO - __main__ - Step 23419: {'lr': 0.00047460485978214733, 'samples': 4496448, 'steps': 23418, 'loss/train': 1.791231393814087} 11/07/2021 00:31:28 - INFO - __main__ - Step 23420: {'lr': 0.00047460252933646265, 'samples': 4496640, 'steps': 23419, 'loss/train': 1.5584287643432617} 11/07/2021 00:31:29 - INFO - __main__ - Step 23421: {'lr': 0.0004746001987895755, 'samples': 4496832, 'steps': 23420, 'loss/train': 0.7658491134643555} 11/07/2021 00:31:30 - INFO - __main__ - Step 23422: {'lr': 0.00047459786814148697, 'samples': 4497024, 'steps': 23421, 'loss/train': 0.8147704005241394} 11/07/2021 00:31:30 - INFO - __main__ - Step 23423: {'lr': 0.0004745955373921981, 'samples': 4497216, 'steps': 23422, 'loss/train': 1.684801697731018} 11/07/2021 00:31:30 - INFO - __main__ - Step 23424: {'lr': 0.0004745932065417099, 'samples': 4497408, 'steps': 23423, 'loss/train': 0.9494284987449646} 11/07/2021 00:31:31 - INFO - __main__ - Step 23425: {'lr': 0.00047459087559002355, 'samples': 4497600, 'steps': 23424, 'loss/train': 0.885067343711853} 11/07/2021 00:31:31 - INFO - __main__ - Step 23426: {'lr': 0.00047458854453713995, 'samples': 4497792, 'steps': 23425, 'loss/train': 1.3821614980697632} 11/07/2021 00:31:32 - INFO - __main__ - Step 23427: {'lr': 0.0004745862133830603, 'samples': 4497984, 'steps': 23426, 'loss/train': 1.4860104322433472} 11/07/2021 00:31:33 - INFO - __main__ - Step 23428: {'lr': 0.00047458388212778547, 'samples': 4498176, 'steps': 23427, 'loss/train': 1.527248740196228} 11/07/2021 00:31:33 - INFO - __main__ - Step 23429: {'lr': 0.00047458155077131664, 'samples': 4498368, 'steps': 23428, 'loss/train': 1.6233726739883423} 11/07/2021 00:31:33 - INFO - __main__ - Step 23430: {'lr': 0.0004745792193136549, 'samples': 4498560, 'steps': 23429, 'loss/train': 1.2394837141036987} 11/07/2021 00:31:34 - INFO - __main__ - Step 23431: {'lr': 0.00047457688775480114, 'samples': 4498752, 'steps': 23430, 'loss/train': 1.7909239530563354} 11/07/2021 00:31:35 - INFO - __main__ - Step 23432: {'lr': 0.0004745745560947565, 'samples': 4498944, 'steps': 23431, 'loss/train': 1.5196155309677124} 11/07/2021 00:31:35 - INFO - __main__ - Step 23433: {'lr': 0.0004745722243335221, 'samples': 4499136, 'steps': 23432, 'loss/train': 1.977839708328247} 11/07/2021 00:31:35 - INFO - __main__ - Step 23434: {'lr': 0.0004745698924710988, 'samples': 4499328, 'steps': 23433, 'loss/train': 1.6531399488449097} 11/07/2021 00:31:36 - INFO - __main__ - Step 23435: {'lr': 0.00047456756050748793, 'samples': 4499520, 'steps': 23434, 'loss/train': 1.1196846961975098} 11/07/2021 00:31:36 - INFO - __main__ - Step 23436: {'lr': 0.0004745652284426903, 'samples': 4499712, 'steps': 23435, 'loss/train': 1.790719747543335} 11/07/2021 00:31:36 - INFO - __main__ - Step 23437: {'lr': 0.00047456289627670703, 'samples': 4499904, 'steps': 23436, 'loss/train': 1.2567437887191772} 11/07/2021 00:31:37 - INFO - __main__ - Step 23438: {'lr': 0.0004745605640095392, 'samples': 4500096, 'steps': 23437, 'loss/train': 1.2304730415344238} 11/07/2021 00:31:38 - INFO - __main__ - Step 23439: {'lr': 0.00047455823164118787, 'samples': 4500288, 'steps': 23438, 'loss/train': 2.25645112991333} 11/07/2021 00:31:38 - INFO - __main__ - Step 23440: {'lr': 0.00047455589917165406, 'samples': 4500480, 'steps': 23439, 'loss/train': 1.7855865955352783} 11/07/2021 00:31:38 - INFO - __main__ - Step 23441: {'lr': 0.00047455356660093886, 'samples': 4500672, 'steps': 23440, 'loss/train': 1.5271779298782349} 11/07/2021 00:31:39 - INFO - __main__ - Step 23442: {'lr': 0.0004745512339290432, 'samples': 4500864, 'steps': 23441, 'loss/train': 1.948983073234558} 11/07/2021 00:31:40 - INFO - __main__ - Step 23443: {'lr': 0.00047454890115596824, 'samples': 4501056, 'steps': 23442, 'loss/train': 1.409691333770752} 11/07/2021 00:31:40 - INFO - __main__ - Step 23444: {'lr': 0.00047454656828171504, 'samples': 4501248, 'steps': 23443, 'loss/train': 1.1951667070388794} 11/07/2021 00:31:40 - INFO - __main__ - Step 23445: {'lr': 0.0004745442353062846, 'samples': 4501440, 'steps': 23444, 'loss/train': 1.2143166065216064} 11/07/2021 00:31:41 - INFO - __main__ - Step 23446: {'lr': 0.000474541902229678, 'samples': 4501632, 'steps': 23445, 'loss/train': 1.2001657485961914} 11/07/2021 00:31:41 - INFO - __main__ - Step 23447: {'lr': 0.0004745395690518963, 'samples': 4501824, 'steps': 23446, 'loss/train': 1.7846274375915527} 11/07/2021 00:31:42 - INFO - __main__ - Step 23448: {'lr': 0.0004745372357729405, 'samples': 4502016, 'steps': 23447, 'loss/train': 1.5660029649734497} 11/07/2021 00:31:43 - INFO - __main__ - Step 23449: {'lr': 0.0004745349023928117, 'samples': 4502208, 'steps': 23448, 'loss/train': 1.749601125717163} 11/07/2021 00:31:43 - INFO - __main__ - Step 23450: {'lr': 0.000474532568911511, 'samples': 4502400, 'steps': 23449, 'loss/train': 1.29769766330719} 11/07/2021 00:31:43 - INFO - __main__ - Step 23451: {'lr': 0.00047453023532903927, 'samples': 4502592, 'steps': 23450, 'loss/train': 1.027752161026001} 11/07/2021 00:31:44 - INFO - __main__ - Step 23452: {'lr': 0.00047452790164539775, 'samples': 4502784, 'steps': 23451, 'loss/train': 1.5923694372177124} 11/07/2021 00:31:45 - INFO - __main__ - Step 23453: {'lr': 0.00047452556786058744, 'samples': 4502976, 'steps': 23452, 'loss/train': 1.835474967956543} 11/07/2021 00:31:45 - INFO - __main__ - Step 23454: {'lr': 0.0004745232339746094, 'samples': 4503168, 'steps': 23453, 'loss/train': 1.6376953125} 11/07/2021 00:31:45 - INFO - __main__ - Step 23455: {'lr': 0.00047452089998746463, 'samples': 4503360, 'steps': 23454, 'loss/train': 1.2960706949234009} 11/07/2021 00:31:46 - INFO - __main__ - Step 23456: {'lr': 0.0004745185658991541, 'samples': 4503552, 'steps': 23455, 'loss/train': 1.8943711519241333} 11/07/2021 00:31:46 - INFO - __main__ - Step 23457: {'lr': 0.0004745162317096791, 'samples': 4503744, 'steps': 23456, 'loss/train': 1.8373547792434692} 11/07/2021 00:31:47 - INFO - __main__ - Step 23458: {'lr': 0.0004745138974190405, 'samples': 4503936, 'steps': 23457, 'loss/train': 1.8252520561218262} 11/07/2021 00:31:47 - INFO - __main__ - Step 23459: {'lr': 0.0004745115630272394, 'samples': 4504128, 'steps': 23458, 'loss/train': 1.5691437721252441} 11/07/2021 00:31:48 - INFO - __main__ - Step 23460: {'lr': 0.00047450922853427686, 'samples': 4504320, 'steps': 23459, 'loss/train': 1.3738198280334473} 11/07/2021 00:31:48 - INFO - __main__ - Step 23461: {'lr': 0.0004745068939401539, 'samples': 4504512, 'steps': 23460, 'loss/train': 1.3941560983657837} 11/07/2021 00:31:49 - INFO - __main__ - Step 23462: {'lr': 0.0004745045592448717, 'samples': 4504704, 'steps': 23461, 'loss/train': 1.6935898065567017} 11/07/2021 00:31:50 - INFO - __main__ - Step 23463: {'lr': 0.00047450222444843105, 'samples': 4504896, 'steps': 23462, 'loss/train': 1.527276873588562} 11/07/2021 00:31:50 - INFO - __main__ - Step 23464: {'lr': 0.0004744998895508333, 'samples': 4505088, 'steps': 23463, 'loss/train': 1.3515911102294922} 11/07/2021 00:31:50 - INFO - __main__ - Step 23465: {'lr': 0.0004744975545520793, 'samples': 4505280, 'steps': 23464, 'loss/train': 1.4795407056808472} 11/07/2021 00:31:51 - INFO - __main__ - Step 23466: {'lr': 0.00047449521945217016, 'samples': 4505472, 'steps': 23465, 'loss/train': 1.0448362827301025} 11/07/2021 00:31:51 - INFO - __main__ - Step 23467: {'lr': 0.00047449288425110693, 'samples': 4505664, 'steps': 23466, 'loss/train': 1.4558610916137695} 11/07/2021 00:31:52 - INFO - __main__ - Step 23468: {'lr': 0.00047449054894889073, 'samples': 4505856, 'steps': 23467, 'loss/train': 1.4664283990859985} 11/07/2021 00:31:53 - INFO - __main__ - Step 23469: {'lr': 0.00047448821354552253, 'samples': 4506048, 'steps': 23468, 'loss/train': 1.1906158924102783} 11/07/2021 00:31:53 - INFO - __main__ - Step 23470: {'lr': 0.0004744858780410034, 'samples': 4506240, 'steps': 23469, 'loss/train': 0.4644726514816284} 11/07/2021 00:31:53 - INFO - __main__ - Step 23471: {'lr': 0.0004744835424353344, 'samples': 4506432, 'steps': 23470, 'loss/train': 1.616047739982605} 11/07/2021 00:31:54 - INFO - __main__ - Step 23472: {'lr': 0.00047448120672851653, 'samples': 4506624, 'steps': 23471, 'loss/train': 1.0819038152694702} 11/07/2021 00:31:54 - INFO - __main__ - Step 23473: {'lr': 0.0004744788709205509, 'samples': 4506816, 'steps': 23472, 'loss/train': 1.925014853477478} 11/07/2021 00:31:55 - INFO - __main__ - Step 23474: {'lr': 0.0004744765350114386, 'samples': 4507008, 'steps': 23473, 'loss/train': 1.4738006591796875} 11/07/2021 00:31:56 - INFO - __main__ - Step 23475: {'lr': 0.00047447419900118067, 'samples': 4507200, 'steps': 23474, 'loss/train': 1.3570666313171387} 11/07/2021 00:31:56 - INFO - __main__ - Step 23476: {'lr': 0.00047447186288977804, 'samples': 4507392, 'steps': 23475, 'loss/train': 2.686858892440796} 11/07/2021 00:31:56 - INFO - __main__ - Step 23477: {'lr': 0.0004744695266772319, 'samples': 4507584, 'steps': 23476, 'loss/train': 1.099247694015503} 11/07/2021 00:31:57 - INFO - __main__ - Step 23478: {'lr': 0.00047446719036354324, 'samples': 4507776, 'steps': 23477, 'loss/train': 1.62332022190094} 11/07/2021 00:31:58 - INFO - __main__ - Step 23479: {'lr': 0.0004744648539487132, 'samples': 4507968, 'steps': 23478, 'loss/train': 1.5295865535736084} 11/07/2021 00:31:58 - INFO - __main__ - Step 23480: {'lr': 0.00047446251743274263, 'samples': 4508160, 'steps': 23479, 'loss/train': 1.2681844234466553} 11/07/2021 00:31:58 - INFO - __main__ - Step 23481: {'lr': 0.0004744601808156328, 'samples': 4508352, 'steps': 23480, 'loss/train': 1.326259970664978} 11/07/2021 00:31:59 - INFO - __main__ - Step 23482: {'lr': 0.00047445784409738467, 'samples': 4508544, 'steps': 23481, 'loss/train': 1.8529151678085327} 11/07/2021 00:31:59 - INFO - __main__ - Step 23483: {'lr': 0.0004744555072779993, 'samples': 4508736, 'steps': 23482, 'loss/train': 1.6085487604141235} 11/07/2021 00:31:59 - INFO - __main__ - Step 23484: {'lr': 0.0004744531703574777, 'samples': 4508928, 'steps': 23483, 'loss/train': 1.3744274377822876} 11/07/2021 00:32:01 - INFO - __main__ - Step 23485: {'lr': 0.00047445083333582104, 'samples': 4509120, 'steps': 23484, 'loss/train': 1.4718713760375977} 11/07/2021 00:32:01 - INFO - __main__ - Step 23486: {'lr': 0.00047444849621303023, 'samples': 4509312, 'steps': 23485, 'loss/train': 1.6028051376342773} 11/07/2021 00:32:01 - INFO - __main__ - Step 23487: {'lr': 0.00047444615898910644, 'samples': 4509504, 'steps': 23486, 'loss/train': 1.021246314048767} 11/07/2021 00:32:02 - INFO - __main__ - Step 23488: {'lr': 0.00047444382166405067, 'samples': 4509696, 'steps': 23487, 'loss/train': 5.782830238342285} 11/07/2021 00:32:02 - INFO - __main__ - Step 23489: {'lr': 0.0004744414842378639, 'samples': 4509888, 'steps': 23488, 'loss/train': 1.5995277166366577} 11/07/2021 00:32:03 - INFO - __main__ - Step 23490: {'lr': 0.0004744391467105473, 'samples': 4510080, 'steps': 23489, 'loss/train': 0.973374605178833} 11/07/2021 00:32:03 - INFO - __main__ - Step 23491: {'lr': 0.00047443680908210194, 'samples': 4510272, 'steps': 23490, 'loss/train': 2.7277424335479736} 11/07/2021 00:32:04 - INFO - __main__ - Step 23492: {'lr': 0.00047443447135252876, 'samples': 4510464, 'steps': 23491, 'loss/train': 1.391707181930542} 11/07/2021 00:32:04 - INFO - __main__ - Step 23493: {'lr': 0.0004744321335218289, 'samples': 4510656, 'steps': 23492, 'loss/train': 1.6771727800369263} 11/07/2021 00:32:04 - INFO - __main__ - Step 23494: {'lr': 0.0004744297955900034, 'samples': 4510848, 'steps': 23493, 'loss/train': 1.001266360282898} 11/07/2021 00:32:05 - INFO - __main__ - Step 23495: {'lr': 0.00047442745755705326, 'samples': 4511040, 'steps': 23494, 'loss/train': 1.4900996685028076} 11/07/2021 00:32:06 - INFO - __main__ - Step 23496: {'lr': 0.00047442511942297953, 'samples': 4511232, 'steps': 23495, 'loss/train': 1.4952882528305054} 11/07/2021 00:32:06 - INFO - __main__ - Step 23497: {'lr': 0.00047442278118778336, 'samples': 4511424, 'steps': 23496, 'loss/train': 1.5954856872558594} 11/07/2021 00:32:06 - INFO - __main__ - Step 23498: {'lr': 0.0004744204428514658, 'samples': 4511616, 'steps': 23497, 'loss/train': 1.7899006605148315} 11/07/2021 00:32:07 - INFO - __main__ - Step 23499: {'lr': 0.00047441810441402777, 'samples': 4511808, 'steps': 23498, 'loss/train': 0.658106803894043} 11/07/2021 00:32:07 - INFO - __main__ - Step 23500: {'lr': 0.0004744157658754704, 'samples': 4512000, 'steps': 23499, 'loss/train': 1.4799785614013672} 11/07/2021 00:32:08 - INFO - __main__ - Step 23501: {'lr': 0.0004744134272357948, 'samples': 4512192, 'steps': 23500, 'loss/train': 1.6532243490219116} 11/07/2021 00:32:09 - INFO - __main__ - Step 23502: {'lr': 0.0004744110884950019, 'samples': 4512384, 'steps': 23501, 'loss/train': 2.228314161300659} 11/07/2021 00:32:09 - INFO - __main__ - Step 23503: {'lr': 0.00047440874965309286, 'samples': 4512576, 'steps': 23502, 'loss/train': 1.303419589996338} 11/07/2021 00:32:09 - INFO - __main__ - Step 23504: {'lr': 0.00047440641071006874, 'samples': 4512768, 'steps': 23503, 'loss/train': 1.6008127927780151} 11/07/2021 00:32:10 - INFO - __main__ - Step 23505: {'lr': 0.00047440407166593056, 'samples': 4512960, 'steps': 23504, 'loss/train': 1.4751851558685303} 11/07/2021 00:32:11 - INFO - __main__ - Step 23506: {'lr': 0.0004744017325206793, 'samples': 4513152, 'steps': 23505, 'loss/train': 5.9636006355285645} 11/07/2021 00:32:11 - INFO - __main__ - Step 23507: {'lr': 0.00047439939327431613, 'samples': 4513344, 'steps': 23506, 'loss/train': 2.0052008628845215} 11/07/2021 00:32:11 - INFO - __main__ - Step 23508: {'lr': 0.0004743970539268421, 'samples': 4513536, 'steps': 23507, 'loss/train': 1.7036689519882202} 11/07/2021 00:32:12 - INFO - __main__ - Step 23509: {'lr': 0.00047439471447825813, 'samples': 4513728, 'steps': 23508, 'loss/train': 2.1218676567077637} 11/07/2021 00:32:12 - INFO - __main__ - Step 23510: {'lr': 0.00047439237492856543, 'samples': 4513920, 'steps': 23509, 'loss/train': 1.58950674533844} 11/07/2021 00:32:12 - INFO - __main__ - Step 23511: {'lr': 0.0004743900352777649, 'samples': 4514112, 'steps': 23510, 'loss/train': 1.7900227308273315} 11/07/2021 00:32:13 - INFO - __main__ - Step 23512: {'lr': 0.0004743876955258578, 'samples': 4514304, 'steps': 23511, 'loss/train': 1.8968209028244019} 11/07/2021 00:32:14 - INFO - __main__ - Step 23513: {'lr': 0.00047438535567284504, 'samples': 4514496, 'steps': 23512, 'loss/train': 1.653573751449585} 11/07/2021 00:32:14 - INFO - __main__ - Step 23514: {'lr': 0.00047438301571872763, 'samples': 4514688, 'steps': 23513, 'loss/train': 1.7201824188232422} 11/07/2021 00:32:14 - INFO - __main__ - Step 23515: {'lr': 0.00047438067566350675, 'samples': 4514880, 'steps': 23514, 'loss/train': 1.5156289339065552} 11/07/2021 00:32:15 - INFO - __main__ - Step 23516: {'lr': 0.00047437833550718336, 'samples': 4515072, 'steps': 23515, 'loss/train': 1.3097409009933472} 11/07/2021 00:32:16 - INFO - __main__ - Step 23517: {'lr': 0.0004743759952497586, 'samples': 4515264, 'steps': 23516, 'loss/train': 1.3028639554977417} 11/07/2021 00:32:16 - INFO - __main__ - Step 23518: {'lr': 0.0004743736548912334, 'samples': 4515456, 'steps': 23517, 'loss/train': 1.5353991985321045} 11/07/2021 00:32:17 - INFO - __main__ - Step 23519: {'lr': 0.00047437131443160897, 'samples': 4515648, 'steps': 23518, 'loss/train': 1.5825148820877075} 11/07/2021 00:32:17 - INFO - __main__ - Step 23520: {'lr': 0.0004743689738708863, 'samples': 4515840, 'steps': 23519, 'loss/train': 0.6383938193321228} 11/07/2021 00:32:17 - INFO - __main__ - Step 23521: {'lr': 0.0004743666332090664, 'samples': 4516032, 'steps': 23520, 'loss/train': 1.2356423139572144} 11/07/2021 00:32:18 - INFO - __main__ - Step 23522: {'lr': 0.00047436429244615037, 'samples': 4516224, 'steps': 23521, 'loss/train': 1.6078827381134033} 11/07/2021 00:32:19 - INFO - __main__ - Step 23523: {'lr': 0.0004743619515821392, 'samples': 4516416, 'steps': 23522, 'loss/train': 1.2691593170166016} 11/07/2021 00:32:19 - INFO - __main__ - Step 23524: {'lr': 0.00047435961061703403, 'samples': 4516608, 'steps': 23523, 'loss/train': 1.7472574710845947} 11/07/2021 00:32:19 - INFO - __main__ - Step 23525: {'lr': 0.00047435726955083593, 'samples': 4516800, 'steps': 23524, 'loss/train': 1.865501880645752} 11/07/2021 00:32:20 - INFO - __main__ - Step 23526: {'lr': 0.0004743549283835459, 'samples': 4516992, 'steps': 23525, 'loss/train': 1.6130586862564087} 11/07/2021 00:32:21 - INFO - __main__ - Step 23527: {'lr': 0.00047435258711516496, 'samples': 4517184, 'steps': 23526, 'loss/train': 1.3543508052825928} 11/07/2021 00:32:21 - INFO - __main__ - Step 23528: {'lr': 0.0004743502457456942, 'samples': 4517376, 'steps': 23527, 'loss/train': 1.4319701194763184} 11/07/2021 00:32:21 - INFO - __main__ - Step 23529: {'lr': 0.0004743479042751347, 'samples': 4517568, 'steps': 23528, 'loss/train': 2.1460018157958984} 11/07/2021 00:32:22 - INFO - __main__ - Step 23530: {'lr': 0.0004743455627034875, 'samples': 4517760, 'steps': 23529, 'loss/train': 1.602501392364502} 11/07/2021 00:32:22 - INFO - __main__ - Step 23531: {'lr': 0.0004743432210307536, 'samples': 4517952, 'steps': 23530, 'loss/train': 1.7073982954025269} 11/07/2021 00:32:23 - INFO - __main__ - Step 23532: {'lr': 0.00047434087925693415, 'samples': 4518144, 'steps': 23531, 'loss/train': 1.697928547859192} 11/07/2021 00:32:23 - INFO - __main__ - Step 23533: {'lr': 0.00047433853738203013, 'samples': 4518336, 'steps': 23532, 'loss/train': 1.2396883964538574} 11/07/2021 00:32:24 - INFO - __main__ - Step 23534: {'lr': 0.00047433619540604264, 'samples': 4518528, 'steps': 23533, 'loss/train': 2.2235891819000244} 11/07/2021 00:32:24 - INFO - __main__ - Step 23535: {'lr': 0.0004743338533289728, 'samples': 4518720, 'steps': 23534, 'loss/train': 1.1533797979354858} 11/07/2021 00:32:25 - INFO - __main__ - Step 23536: {'lr': 0.0004743315111508215, 'samples': 4518912, 'steps': 23535, 'loss/train': 1.5415807962417603} 11/07/2021 00:32:25 - INFO - __main__ - Step 23537: {'lr': 0.00047432916887158995, 'samples': 4519104, 'steps': 23536, 'loss/train': 1.1145853996276855} 11/07/2021 00:32:26 - INFO - __main__ - Step 23538: {'lr': 0.00047432682649127913, 'samples': 4519296, 'steps': 23537, 'loss/train': 1.335025668144226} 11/07/2021 00:32:26 - INFO - __main__ - Step 23539: {'lr': 0.00047432448400989004, 'samples': 4519488, 'steps': 23538, 'loss/train': 1.3478022813796997} 11/07/2021 00:32:27 - INFO - __main__ - Step 23540: {'lr': 0.0004743221414274238, 'samples': 4519680, 'steps': 23539, 'loss/train': 1.494612693786621} 11/07/2021 00:32:27 - INFO - __main__ - Step 23541: {'lr': 0.00047431979874388154, 'samples': 4519872, 'steps': 23540, 'loss/train': 1.2911465167999268} 11/07/2021 00:32:28 - INFO - __main__ - Step 23542: {'lr': 0.0004743174559592642, 'samples': 4520064, 'steps': 23541, 'loss/train': 1.6779345273971558} 11/07/2021 00:32:28 - INFO - __main__ - Step 23543: {'lr': 0.0004743151130735729, 'samples': 4520256, 'steps': 23542, 'loss/train': 0.7486419677734375} 11/07/2021 00:32:29 - INFO - __main__ - Step 23544: {'lr': 0.0004743127700868086, 'samples': 4520448, 'steps': 23543, 'loss/train': 1.1932936906814575} 11/07/2021 00:32:29 - INFO - __main__ - Step 23545: {'lr': 0.00047431042699897245, 'samples': 4520640, 'steps': 23544, 'loss/train': 1.5103429555892944} 11/07/2021 00:32:29 - INFO - __main__ - Step 23546: {'lr': 0.0004743080838100655, 'samples': 4520832, 'steps': 23545, 'loss/train': 1.6619280576705933} 11/07/2021 00:32:30 - INFO - __main__ - Step 23547: {'lr': 0.0004743057405200888, 'samples': 4521024, 'steps': 23546, 'loss/train': 0.13306494057178497} 11/07/2021 00:32:31 - INFO - __main__ - Step 23548: {'lr': 0.0004743033971290434, 'samples': 4521216, 'steps': 23547, 'loss/train': 1.6675859689712524} 11/07/2021 00:32:31 - INFO - __main__ - Step 23549: {'lr': 0.00047430105363693034, 'samples': 4521408, 'steps': 23548, 'loss/train': 1.4825546741485596} 11/07/2021 00:32:32 - INFO - __main__ - Step 23550: {'lr': 0.0004742987100437507, 'samples': 4521600, 'steps': 23549, 'loss/train': 1.565748691558838} 11/07/2021 00:32:32 - INFO - __main__ - Step 23551: {'lr': 0.00047429636634950545, 'samples': 4521792, 'steps': 23550, 'loss/train': 1.7220901250839233} 11/07/2021 00:32:33 - INFO - __main__ - Step 23552: {'lr': 0.0004742940225541958, 'samples': 4521984, 'steps': 23551, 'loss/train': 1.357005000114441} 11/07/2021 00:32:33 - INFO - __main__ - Step 23553: {'lr': 0.0004742916786578227, 'samples': 4522176, 'steps': 23552, 'loss/train': 1.6558088064193726} 11/07/2021 00:32:34 - INFO - __main__ - Step 23554: {'lr': 0.00047428933466038726, 'samples': 4522368, 'steps': 23553, 'loss/train': 1.8793753385543823} 11/07/2021 00:32:34 - INFO - __main__ - Step 23555: {'lr': 0.00047428699056189047, 'samples': 4522560, 'steps': 23554, 'loss/train': 1.76802396774292} 11/07/2021 00:32:34 - INFO - __main__ - Step 23556: {'lr': 0.0004742846463623334, 'samples': 4522752, 'steps': 23555, 'loss/train': 1.8358832597732544} 11/07/2021 00:32:35 - INFO - __main__ - Step 23557: {'lr': 0.0004742823020617172, 'samples': 4522944, 'steps': 23556, 'loss/train': 1.4426796436309814} 11/07/2021 00:32:36 - INFO - __main__ - Step 23558: {'lr': 0.0004742799576600427, 'samples': 4523136, 'steps': 23557, 'loss/train': 1.814741611480713} 11/07/2021 00:32:36 - INFO - __main__ - Step 23559: {'lr': 0.00047427761315731133, 'samples': 4523328, 'steps': 23558, 'loss/train': 6.1855244636535645} 11/07/2021 00:32:36 - INFO - __main__ - Step 23560: {'lr': 0.0004742752685535238, 'samples': 4523520, 'steps': 23559, 'loss/train': 1.8517687320709229} 11/07/2021 00:32:37 - INFO - __main__ - Step 23561: {'lr': 0.00047427292384868134, 'samples': 4523712, 'steps': 23560, 'loss/train': 1.7739310264587402} 11/07/2021 00:32:37 - INFO - __main__ - Step 23562: {'lr': 0.0004742705790427849, 'samples': 4523904, 'steps': 23561, 'loss/train': 1.4885367155075073} 11/07/2021 00:32:38 - INFO - __main__ - Step 23563: {'lr': 0.00047426823413583563, 'samples': 4524096, 'steps': 23562, 'loss/train': 1.743016242980957} 11/07/2021 00:32:39 - INFO - __main__ - Step 23564: {'lr': 0.0004742658891278346, 'samples': 4524288, 'steps': 23563, 'loss/train': 1.435360074043274} 11/07/2021 00:32:39 - INFO - __main__ - Step 23565: {'lr': 0.0004742635440187828, 'samples': 4524480, 'steps': 23564, 'loss/train': 1.9791362285614014} 11/07/2021 00:32:39 - INFO - __main__ - Step 23566: {'lr': 0.00047426119880868123, 'samples': 4524672, 'steps': 23565, 'loss/train': 1.8396615982055664} 11/07/2021 00:32:40 - INFO - __main__ - Step 23567: {'lr': 0.00047425885349753114, 'samples': 4524864, 'steps': 23566, 'loss/train': 2.3755545616149902} 11/07/2021 00:32:41 - INFO - __main__ - Step 23568: {'lr': 0.0004742565080853334, 'samples': 4525056, 'steps': 23567, 'loss/train': 1.3045393228530884} 11/07/2021 00:32:41 - INFO - __main__ - Step 23569: {'lr': 0.00047425416257208916, 'samples': 4525248, 'steps': 23568, 'loss/train': 1.8492480516433716} 11/07/2021 00:32:41 - INFO - __main__ - Step 23570: {'lr': 0.0004742518169577994, 'samples': 4525440, 'steps': 23569, 'loss/train': 0.9899062514305115} 11/07/2021 00:32:42 - INFO - __main__ - Step 23571: {'lr': 0.0004742494712424653, 'samples': 4525632, 'steps': 23570, 'loss/train': 1.6598799228668213} 11/07/2021 00:32:42 - INFO - __main__ - Step 23572: {'lr': 0.0004742471254260878, 'samples': 4525824, 'steps': 23571, 'loss/train': 1.4144138097763062} 11/07/2021 00:32:43 - INFO - __main__ - Step 23573: {'lr': 0.0004742447795086681, 'samples': 4526016, 'steps': 23572, 'loss/train': 1.53965163230896} 11/07/2021 00:32:43 - INFO - __main__ - Step 23574: {'lr': 0.00047424243349020705, 'samples': 4526208, 'steps': 23573, 'loss/train': 1.7604814767837524} 11/07/2021 00:32:44 - INFO - __main__ - Step 23575: {'lr': 0.0004742400873707059, 'samples': 4526400, 'steps': 23574, 'loss/train': 1.8213621377944946} 11/07/2021 00:32:44 - INFO - __main__ - Step 23576: {'lr': 0.0004742377411501656, 'samples': 4526592, 'steps': 23575, 'loss/train': 1.6603103876113892} 11/07/2021 00:32:44 - INFO - __main__ - Step 23577: {'lr': 0.00047423539482858724, 'samples': 4526784, 'steps': 23576, 'loss/train': 1.6569072008132935} 11/07/2021 00:32:45 - INFO - __main__ - Step 23578: {'lr': 0.0004742330484059718, 'samples': 4526976, 'steps': 23577, 'loss/train': 1.5407042503356934} 11/07/2021 00:32:46 - INFO - __main__ - Step 23579: {'lr': 0.0004742307018823205, 'samples': 4527168, 'steps': 23578, 'loss/train': 1.5448529720306396} 11/07/2021 00:32:46 - INFO - __main__ - Step 23580: {'lr': 0.0004742283552576343, 'samples': 4527360, 'steps': 23579, 'loss/train': 1.6284503936767578} 11/07/2021 00:32:47 - INFO - __main__ - Step 23581: {'lr': 0.0004742260085319142, 'samples': 4527552, 'steps': 23580, 'loss/train': 1.5191917419433594} 11/07/2021 00:32:47 - INFO - __main__ - Step 23582: {'lr': 0.0004742236617051614, 'samples': 4527744, 'steps': 23581, 'loss/train': 1.9894447326660156} 11/07/2021 00:32:48 - INFO - __main__ - Step 23583: {'lr': 0.00047422131477737684, 'samples': 4527936, 'steps': 23582, 'loss/train': 1.6390395164489746} 11/07/2021 00:32:48 - INFO - __main__ - Step 23584: {'lr': 0.00047421896774856156, 'samples': 4528128, 'steps': 23583, 'loss/train': 1.4163792133331299} 11/07/2021 00:32:49 - INFO - __main__ - Step 23585: {'lr': 0.00047421662061871675, 'samples': 4528320, 'steps': 23584, 'loss/train': 1.1335351467132568} 11/07/2021 00:32:49 - INFO - __main__ - Step 23586: {'lr': 0.0004742142733878433, 'samples': 4528512, 'steps': 23585, 'loss/train': 1.5170916318893433} 11/07/2021 00:32:49 - INFO - __main__ - Step 23587: {'lr': 0.0004742119260559424, 'samples': 4528704, 'steps': 23586, 'loss/train': 2.1219773292541504} 11/07/2021 00:32:50 - INFO - __main__ - Step 23588: {'lr': 0.0004742095786230152, 'samples': 4528896, 'steps': 23587, 'loss/train': 1.6570745706558228} 11/07/2021 00:32:51 - INFO - __main__ - Step 23589: {'lr': 0.00047420723108906247, 'samples': 4529088, 'steps': 23588, 'loss/train': 0.7063665986061096} 11/07/2021 00:32:51 - INFO - __main__ - Step 23590: {'lr': 0.0004742048834540855, 'samples': 4529280, 'steps': 23589, 'loss/train': 1.5621718168258667} 11/07/2021 00:32:51 - INFO - __main__ - Step 23591: {'lr': 0.0004742025357180852, 'samples': 4529472, 'steps': 23590, 'loss/train': 1.342724084854126} 11/07/2021 00:32:52 - INFO - __main__ - Step 23592: {'lr': 0.00047420018788106274, 'samples': 4529664, 'steps': 23591, 'loss/train': 1.7082022428512573} 11/07/2021 00:32:53 - INFO - __main__ - Step 23593: {'lr': 0.00047419783994301915, 'samples': 4529856, 'steps': 23592, 'loss/train': 0.6484775543212891} 11/07/2021 00:32:53 - INFO - __main__ - Step 23594: {'lr': 0.0004741954919039554, 'samples': 4530048, 'steps': 23593, 'loss/train': 2.074641466140747} 11/07/2021 00:32:53 - INFO - __main__ - Step 23595: {'lr': 0.0004741931437638727, 'samples': 4530240, 'steps': 23594, 'loss/train': 1.5737087726593018} 11/07/2021 00:32:54 - INFO - __main__ - Step 23596: {'lr': 0.000474190795522772, 'samples': 4530432, 'steps': 23595, 'loss/train': 1.5619804859161377} 11/07/2021 00:32:54 - INFO - __main__ - Step 23597: {'lr': 0.00047418844718065433, 'samples': 4530624, 'steps': 23596, 'loss/train': 1.6500996351242065} 11/07/2021 00:32:54 - INFO - __main__ - Step 23598: {'lr': 0.0004741860987375209, 'samples': 4530816, 'steps': 23597, 'loss/train': 1.5560089349746704} 11/07/2021 00:32:56 - INFO - __main__ - Step 23599: {'lr': 0.00047418375019337263, 'samples': 4531008, 'steps': 23598, 'loss/train': 1.3333616256713867} 11/07/2021 00:32:56 - INFO - __main__ - Step 23600: {'lr': 0.00047418140154821065, 'samples': 4531200, 'steps': 23599, 'loss/train': 1.4364854097366333} 11/07/2021 00:32:56 - INFO - __main__ - Step 23601: {'lr': 0.00047417905280203594, 'samples': 4531392, 'steps': 23600, 'loss/train': 1.3260753154754639} 11/07/2021 00:32:57 - INFO - __main__ - Step 23602: {'lr': 0.00047417670395484963, 'samples': 4531584, 'steps': 23601, 'loss/train': 1.3692469596862793} 11/07/2021 00:32:57 - INFO - __main__ - Step 23603: {'lr': 0.0004741743550066527, 'samples': 4531776, 'steps': 23602, 'loss/train': 1.2754849195480347} 11/07/2021 00:32:58 - INFO - __main__ - Step 23604: {'lr': 0.00047417200595744637, 'samples': 4531968, 'steps': 23603, 'loss/train': 1.112447738647461} 11/07/2021 00:32:58 - INFO - __main__ - Step 23605: {'lr': 0.0004741696568072316, 'samples': 4532160, 'steps': 23604, 'loss/train': 1.3442082405090332} 11/07/2021 00:32:59 - INFO - __main__ - Step 23606: {'lr': 0.00047416730755600936, 'samples': 4532352, 'steps': 23605, 'loss/train': 1.3705646991729736} 11/07/2021 00:32:59 - INFO - __main__ - Step 23607: {'lr': 0.0004741649582037808, 'samples': 4532544, 'steps': 23606, 'loss/train': 1.8504846096038818} 11/07/2021 00:32:59 - INFO - __main__ - Step 23608: {'lr': 0.000474162608750547, 'samples': 4532736, 'steps': 23607, 'loss/train': 2.0150325298309326} 11/07/2021 00:33:00 - INFO - __main__ - Step 23609: {'lr': 0.000474160259196309, 'samples': 4532928, 'steps': 23608, 'loss/train': 1.2820730209350586} 11/07/2021 00:33:01 - INFO - __main__ - Step 23610: {'lr': 0.0004741579095410678, 'samples': 4533120, 'steps': 23609, 'loss/train': 1.6252810955047607} 11/07/2021 00:33:01 - INFO - __main__ - Step 23611: {'lr': 0.0004741555597848245, 'samples': 4533312, 'steps': 23610, 'loss/train': 1.7831580638885498} 11/07/2021 00:33:01 - INFO - __main__ - Step 23612: {'lr': 0.00047415320992758025, 'samples': 4533504, 'steps': 23611, 'loss/train': 1.6855032444000244} 11/07/2021 00:33:02 - INFO - __main__ - Step 23613: {'lr': 0.00047415085996933593, 'samples': 4533696, 'steps': 23612, 'loss/train': 1.319211721420288} 11/07/2021 00:33:03 - INFO - __main__ - Step 23614: {'lr': 0.00047414850991009275, 'samples': 4533888, 'steps': 23613, 'loss/train': 1.65987229347229} 11/07/2021 00:33:03 - INFO - __main__ - Step 23615: {'lr': 0.00047414615974985164, 'samples': 4534080, 'steps': 23614, 'loss/train': 0.9914699792861938} 11/07/2021 00:33:04 - INFO - __main__ - Step 23616: {'lr': 0.0004741438094886138, 'samples': 4534272, 'steps': 23615, 'loss/train': 1.6350009441375732} 11/07/2021 00:33:04 - INFO - __main__ - Step 23617: {'lr': 0.00047414145912638017, 'samples': 4534464, 'steps': 23616, 'loss/train': 2.2170348167419434} 11/07/2021 00:33:04 - INFO - __main__ - Step 23618: {'lr': 0.00047413910866315193, 'samples': 4534656, 'steps': 23617, 'loss/train': 1.1351796388626099} 11/07/2021 00:33:05 - INFO - __main__ - Step 23619: {'lr': 0.00047413675809893, 'samples': 4534848, 'steps': 23618, 'loss/train': 1.7873129844665527} 11/07/2021 00:33:06 - INFO - __main__ - Step 23620: {'lr': 0.0004741344074337155, 'samples': 4535040, 'steps': 23619, 'loss/train': 1.7315502166748047} 11/07/2021 00:33:06 - INFO - __main__ - Step 23621: {'lr': 0.00047413205666750955, 'samples': 4535232, 'steps': 23620, 'loss/train': 1.3562512397766113} 11/07/2021 00:33:06 - INFO - __main__ - Step 23622: {'lr': 0.0004741297058003131, 'samples': 4535424, 'steps': 23621, 'loss/train': 1.4748079776763916} 11/07/2021 00:33:07 - INFO - __main__ - Step 23623: {'lr': 0.00047412735483212725, 'samples': 4535616, 'steps': 23622, 'loss/train': 1.5654685497283936} 11/07/2021 00:33:08 - INFO - __main__ - Step 23624: {'lr': 0.0004741250037629531, 'samples': 4535808, 'steps': 23623, 'loss/train': 1.7665811777114868} 11/07/2021 00:33:08 - INFO - __main__ - Step 23625: {'lr': 0.00047412265259279176, 'samples': 4536000, 'steps': 23624, 'loss/train': 1.4605038166046143} 11/07/2021 00:33:08 - INFO - __main__ - Step 23626: {'lr': 0.0004741203013216441, 'samples': 4536192, 'steps': 23625, 'loss/train': 1.5320707559585571} 11/07/2021 00:33:09 - INFO - __main__ - Step 23627: {'lr': 0.0004741179499495113, 'samples': 4536384, 'steps': 23626, 'loss/train': 1.793100118637085} 11/07/2021 00:33:09 - INFO - __main__ - Step 23628: {'lr': 0.00047411559847639447, 'samples': 4536576, 'steps': 23627, 'loss/train': 2.0532431602478027} 11/07/2021 00:33:10 - INFO - __main__ - Step 23629: {'lr': 0.0004741132469022946, 'samples': 4536768, 'steps': 23628, 'loss/train': 1.0437623262405396} 11/07/2021 00:33:10 - INFO - __main__ - Step 23630: {'lr': 0.00047411089522721275, 'samples': 4536960, 'steps': 23629, 'loss/train': 2.1174731254577637} 11/07/2021 00:33:11 - INFO - __main__ - Step 23631: {'lr': 0.00047410854345114996, 'samples': 4537152, 'steps': 23630, 'loss/train': 0.6693663597106934} 11/07/2021 00:33:11 - INFO - __main__ - Step 23632: {'lr': 0.0004741061915741073, 'samples': 4537344, 'steps': 23631, 'loss/train': 1.8703787326812744} 11/07/2021 00:33:11 - INFO - __main__ - Step 23633: {'lr': 0.0004741038395960859, 'samples': 4537536, 'steps': 23632, 'loss/train': 1.3280048370361328} 11/07/2021 00:33:12 - INFO - __main__ - Step 23634: {'lr': 0.0004741014875170867, 'samples': 4537728, 'steps': 23633, 'loss/train': 1.6490123271942139} 11/07/2021 00:33:13 - INFO - __main__ - Step 23635: {'lr': 0.0004740991353371109, 'samples': 4537920, 'steps': 23634, 'loss/train': 1.6307944059371948} 11/07/2021 00:33:13 - INFO - __main__ - Step 23636: {'lr': 0.0004740967830561595, 'samples': 4538112, 'steps': 23635, 'loss/train': 1.108677864074707} 11/07/2021 00:33:14 - INFO - __main__ - Step 23637: {'lr': 0.0004740944306742335, 'samples': 4538304, 'steps': 23636, 'loss/train': 1.346545934677124} 11/07/2021 00:33:14 - INFO - __main__ - Step 23638: {'lr': 0.00047409207819133406, 'samples': 4538496, 'steps': 23637, 'loss/train': 1.454825520515442} 11/07/2021 00:33:14 - INFO - __main__ - Step 23639: {'lr': 0.0004740897256074621, 'samples': 4538688, 'steps': 23638, 'loss/train': 1.580127239227295} 11/07/2021 00:33:15 - INFO - __main__ - Step 23640: {'lr': 0.00047408737292261883, 'samples': 4538880, 'steps': 23639, 'loss/train': 0.9592517614364624} 11/07/2021 00:33:16 - INFO - __main__ - Step 23641: {'lr': 0.0004740850201368052, 'samples': 4539072, 'steps': 23640, 'loss/train': 1.6052862405776978} 11/07/2021 00:33:16 - INFO - __main__ - Step 23642: {'lr': 0.00047408266725002234, 'samples': 4539264, 'steps': 23641, 'loss/train': 1.3032270669937134} 11/07/2021 00:33:16 - INFO - __main__ - Step 23643: {'lr': 0.00047408031426227136, 'samples': 4539456, 'steps': 23642, 'loss/train': 1.8782343864440918} 11/07/2021 00:33:17 - INFO - __main__ - Step 23644: {'lr': 0.0004740779611735532, 'samples': 4539648, 'steps': 23643, 'loss/train': 1.4260307550430298} 11/07/2021 00:33:18 - INFO - __main__ - Step 23645: {'lr': 0.00047407560798386894, 'samples': 4539840, 'steps': 23644, 'loss/train': 1.8145923614501953} 11/07/2021 00:33:18 - INFO - __main__ - Step 23646: {'lr': 0.00047407325469321973, 'samples': 4540032, 'steps': 23645, 'loss/train': 1.4783227443695068} 11/07/2021 00:33:18 - INFO - __main__ - Step 23647: {'lr': 0.0004740709013016065, 'samples': 4540224, 'steps': 23646, 'loss/train': 1.334385633468628} 11/07/2021 00:33:19 - INFO - __main__ - Step 23648: {'lr': 0.0004740685478090304, 'samples': 4540416, 'steps': 23647, 'loss/train': 1.4337353706359863} 11/07/2021 00:33:19 - INFO - __main__ - Step 23649: {'lr': 0.00047406619421549247, 'samples': 4540608, 'steps': 23648, 'loss/train': 1.5111690759658813} 11/07/2021 00:33:20 - INFO - __main__ - Step 23650: {'lr': 0.0004740638405209938, 'samples': 4540800, 'steps': 23649, 'loss/train': 0.9457454681396484} 11/07/2021 00:33:21 - INFO - __main__ - Step 23651: {'lr': 0.0004740614867255353, 'samples': 4540992, 'steps': 23650, 'loss/train': 1.8303914070129395} 11/07/2021 00:33:21 - INFO - __main__ - Step 23652: {'lr': 0.0004740591328291183, 'samples': 4541184, 'steps': 23651, 'loss/train': 1.987184762954712} 11/07/2021 00:33:21 - INFO - __main__ - Step 23653: {'lr': 0.0004740567788317437, 'samples': 4541376, 'steps': 23652, 'loss/train': 0.5793235301971436} 11/07/2021 00:33:22 - INFO - __main__ - Step 23654: {'lr': 0.00047405442473341246, 'samples': 4541568, 'steps': 23653, 'loss/train': 1.494931936264038} 11/07/2021 00:33:23 - INFO - __main__ - Step 23655: {'lr': 0.0004740520705341259, 'samples': 4541760, 'steps': 23654, 'loss/train': 1.5534117221832275} 11/07/2021 00:33:23 - INFO - __main__ - Step 23656: {'lr': 0.0004740497162338848, 'samples': 4541952, 'steps': 23655, 'loss/train': 1.853484869003296} 11/07/2021 00:33:23 - INFO - __main__ - Step 23657: {'lr': 0.00047404736183269045, 'samples': 4542144, 'steps': 23656, 'loss/train': 1.5425233840942383} 11/07/2021 00:33:24 - INFO - __main__ - Step 23658: {'lr': 0.0004740450073305438, 'samples': 4542336, 'steps': 23657, 'loss/train': 0.7227396368980408} 11/07/2021 00:33:24 - INFO - __main__ - Step 23659: {'lr': 0.00047404265272744586, 'samples': 4542528, 'steps': 23658, 'loss/train': 1.440316081047058} 11/07/2021 00:33:25 - INFO - __main__ - Step 23660: {'lr': 0.0004740402980233978, 'samples': 4542720, 'steps': 23659, 'loss/train': 2.089398145675659} 11/07/2021 00:33:26 - INFO - __main__ - Step 23661: {'lr': 0.00047403794321840064, 'samples': 4542912, 'steps': 23660, 'loss/train': 1.4879977703094482} 11/07/2021 00:33:26 - INFO - __main__ - Step 23662: {'lr': 0.0004740355883124555, 'samples': 4543104, 'steps': 23661, 'loss/train': 1.8466298580169678} 11/07/2021 00:33:26 - INFO - __main__ - Step 23663: {'lr': 0.0004740332333055633, 'samples': 4543296, 'steps': 23662, 'loss/train': 1.6448774337768555} 11/07/2021 00:33:27 - INFO - __main__ - Step 23664: {'lr': 0.00047403087819772517, 'samples': 4543488, 'steps': 23663, 'loss/train': 1.5351874828338623} 11/07/2021 00:33:27 - INFO - __main__ - Step 23665: {'lr': 0.0004740285229889423, 'samples': 4543680, 'steps': 23664, 'loss/train': 1.558956503868103} 11/07/2021 00:33:29 - INFO - __main__ - Step 23666: {'lr': 0.0004740261676792155, 'samples': 4543872, 'steps': 23665, 'loss/train': 0.8988344669342041} 11/07/2021 00:33:29 - INFO - __main__ - Step 23667: {'lr': 0.00047402381226854606, 'samples': 4544064, 'steps': 23666, 'loss/train': 1.6064356565475464} 11/07/2021 00:33:29 - INFO - __main__ - Step 23668: {'lr': 0.0004740214567569349, 'samples': 4544256, 'steps': 23667, 'loss/train': 0.2895216941833496} 11/07/2021 00:33:30 - INFO - __main__ - Step 23669: {'lr': 0.00047401910114438313, 'samples': 4544448, 'steps': 23668, 'loss/train': 1.3126306533813477} 11/07/2021 00:33:30 - INFO - __main__ - Step 23670: {'lr': 0.0004740167454308918, 'samples': 4544640, 'steps': 23669, 'loss/train': 1.4516186714172363} 11/07/2021 00:33:31 - INFO - __main__ - Step 23671: {'lr': 0.00047401438961646206, 'samples': 4544832, 'steps': 23670, 'loss/train': 1.4971883296966553} 11/07/2021 00:33:31 - INFO - __main__ - Step 23672: {'lr': 0.0004740120337010948, 'samples': 4545024, 'steps': 23671, 'loss/train': 1.5388046503067017} 11/07/2021 00:33:32 - INFO - __main__ - Step 23673: {'lr': 0.0004740096776847912, 'samples': 4545216, 'steps': 23672, 'loss/train': 1.6670520305633545} 11/07/2021 00:33:32 - INFO - __main__ - Step 23674: {'lr': 0.0004740073215675523, 'samples': 4545408, 'steps': 23673, 'loss/train': 2.9824087619781494} 11/07/2021 00:33:33 - INFO - __main__ - Step 23675: {'lr': 0.00047400496534937914, 'samples': 4545600, 'steps': 23674, 'loss/train': 1.6415568590164185} 11/07/2021 00:33:33 - INFO - __main__ - Step 23676: {'lr': 0.00047400260903027283, 'samples': 4545792, 'steps': 23675, 'loss/train': 1.2974762916564941} 11/07/2021 00:33:34 - INFO - __main__ - Step 23677: {'lr': 0.0004740002526102344, 'samples': 4545984, 'steps': 23676, 'loss/train': 1.6875091791152954} 11/07/2021 00:33:34 - INFO - __main__ - Step 23678: {'lr': 0.0004739978960892649, 'samples': 4546176, 'steps': 23677, 'loss/train': 1.305275797843933} 11/07/2021 00:33:35 - INFO - __main__ - Step 23679: {'lr': 0.0004739955394673654, 'samples': 4546368, 'steps': 23678, 'loss/train': 1.652904748916626} 11/07/2021 00:33:35 - INFO - __main__ - Step 23680: {'lr': 0.000473993182744537, 'samples': 4546560, 'steps': 23679, 'loss/train': 1.9933096170425415} 11/07/2021 00:33:35 - INFO - __main__ - Step 23681: {'lr': 0.0004739908259207807, 'samples': 4546752, 'steps': 23680, 'loss/train': 1.8885129690170288} 11/07/2021 00:33:36 - INFO - __main__ - Step 23682: {'lr': 0.00047398846899609755, 'samples': 4546944, 'steps': 23681, 'loss/train': 1.5400047302246094} 11/07/2021 00:33:37 - INFO - __main__ - Step 23683: {'lr': 0.0004739861119704887, 'samples': 4547136, 'steps': 23682, 'loss/train': 0.4810228943824768} 11/07/2021 00:33:37 - INFO - __main__ - Step 23684: {'lr': 0.00047398375484395517, 'samples': 4547328, 'steps': 23683, 'loss/train': 0.52688068151474} 11/07/2021 00:33:38 - INFO - __main__ - Step 23685: {'lr': 0.00047398139761649794, 'samples': 4547520, 'steps': 23684, 'loss/train': 1.4102141857147217} 11/07/2021 00:33:38 - INFO - __main__ - Step 23686: {'lr': 0.00047397904028811824, 'samples': 4547712, 'steps': 23685, 'loss/train': 0.9328953623771667} 11/07/2021 00:33:38 - INFO - __main__ - Step 23687: {'lr': 0.000473976682858817, 'samples': 4547904, 'steps': 23686, 'loss/train': 1.6522045135498047} 11/07/2021 00:33:39 - INFO - __main__ - Step 23688: {'lr': 0.00047397432532859533, 'samples': 4548096, 'steps': 23687, 'loss/train': 1.5263078212738037} 11/07/2021 00:33:40 - INFO - __main__ - Step 23689: {'lr': 0.00047397196769745435, 'samples': 4548288, 'steps': 23688, 'loss/train': 2.3755886554718018} 11/07/2021 00:33:40 - INFO - __main__ - Step 23690: {'lr': 0.00047396960996539495, 'samples': 4548480, 'steps': 23689, 'loss/train': 1.8563801050186157} 11/07/2021 00:33:40 - INFO - __main__ - Step 23691: {'lr': 0.00047396725213241835, 'samples': 4548672, 'steps': 23690, 'loss/train': 1.7212902307510376} 11/07/2021 00:33:41 - INFO - __main__ - Step 23692: {'lr': 0.0004739648941985256, 'samples': 4548864, 'steps': 23691, 'loss/train': 1.688942551612854} 11/07/2021 00:33:42 - INFO - __main__ - Step 23693: {'lr': 0.00047396253616371767, 'samples': 4549056, 'steps': 23692, 'loss/train': 1.251467227935791} 11/07/2021 00:33:42 - INFO - __main__ - Step 23694: {'lr': 0.00047396017802799566, 'samples': 4549248, 'steps': 23693, 'loss/train': 1.5889075994491577} 11/07/2021 00:33:42 - INFO - __main__ - Step 23695: {'lr': 0.0004739578197913607, 'samples': 4549440, 'steps': 23694, 'loss/train': 1.5907773971557617} 11/07/2021 00:33:43 - INFO - __main__ - Step 23696: {'lr': 0.00047395546145381377, 'samples': 4549632, 'steps': 23695, 'loss/train': 1.313461184501648} 11/07/2021 00:33:43 - INFO - __main__ - Step 23697: {'lr': 0.000473953103015356, 'samples': 4549824, 'steps': 23696, 'loss/train': 1.1808024644851685} 11/07/2021 00:33:44 - INFO - __main__ - Step 23698: {'lr': 0.0004739507444759884, 'samples': 4550016, 'steps': 23697, 'loss/train': 2.071528196334839} 11/07/2021 00:33:45 - INFO - __main__ - Step 23699: {'lr': 0.0004739483858357121, 'samples': 4550208, 'steps': 23698, 'loss/train': 1.835963249206543} 11/07/2021 00:33:45 - INFO - __main__ - Step 23700: {'lr': 0.00047394602709452806, 'samples': 4550400, 'steps': 23699, 'loss/train': 0.18768851459026337} 11/07/2021 00:33:45 - INFO - __main__ - Step 23701: {'lr': 0.0004739436682524373, 'samples': 4550592, 'steps': 23700, 'loss/train': 1.2520087957382202} 11/07/2021 00:33:46 - INFO - __main__ - Step 23702: {'lr': 0.00047394130930944115, 'samples': 4550784, 'steps': 23701, 'loss/train': 1.1662386655807495} 11/07/2021 00:33:47 - INFO - __main__ - Step 23703: {'lr': 0.0004739389502655404, 'samples': 4550976, 'steps': 23702, 'loss/train': 1.2932790517807007} 11/07/2021 00:33:47 - INFO - __main__ - Step 23704: {'lr': 0.0004739365911207363, 'samples': 4551168, 'steps': 23703, 'loss/train': 0.9172234535217285} 11/07/2021 00:33:48 - INFO - __main__ - Step 23705: {'lr': 0.0004739342318750297, 'samples': 4551360, 'steps': 23704, 'loss/train': 1.4475271701812744} 11/07/2021 00:33:48 - INFO - __main__ - Step 23706: {'lr': 0.00047393187252842183, 'samples': 4551552, 'steps': 23705, 'loss/train': 1.6185001134872437} 11/07/2021 00:33:48 - INFO - __main__ - Step 23707: {'lr': 0.0004739295130809138, 'samples': 4551744, 'steps': 23706, 'loss/train': 1.2477144002914429} 11/07/2021 00:33:49 - INFO - __main__ - Step 23708: {'lr': 0.0004739271535325065, 'samples': 4551936, 'steps': 23707, 'loss/train': 0.19320560991764069} 11/07/2021 00:33:50 - INFO - __main__ - Step 23709: {'lr': 0.00047392479388320106, 'samples': 4552128, 'steps': 23708, 'loss/train': 1.6384211778640747} 11/07/2021 00:33:50 - INFO - __main__ - Step 23710: {'lr': 0.0004739224341329987, 'samples': 4552320, 'steps': 23709, 'loss/train': 1.2958784103393555} 11/07/2021 00:33:50 - INFO - __main__ - Step 23711: {'lr': 0.0004739200742819002, 'samples': 4552512, 'steps': 23710, 'loss/train': 1.7707363367080688} 11/07/2021 00:33:51 - INFO - __main__ - Step 23712: {'lr': 0.0004739177143299068, 'samples': 4552704, 'steps': 23711, 'loss/train': 1.6255106925964355} 11/07/2021 00:33:52 - INFO - __main__ - Step 23713: {'lr': 0.00047391535427701966, 'samples': 4552896, 'steps': 23712, 'loss/train': 1.129692554473877} 11/07/2021 00:33:52 - INFO - __main__ - Step 23714: {'lr': 0.0004739129941232396, 'samples': 4553088, 'steps': 23713, 'loss/train': 1.1649988889694214} 11/07/2021 00:33:52 - INFO - __main__ - Step 23715: {'lr': 0.0004739106338685678, 'samples': 4553280, 'steps': 23714, 'loss/train': 1.1176279783248901} 11/07/2021 00:33:53 - INFO - __main__ - Step 23716: {'lr': 0.00047390827351300537, 'samples': 4553472, 'steps': 23715, 'loss/train': 1.1965147256851196} 11/07/2021 00:33:53 - INFO - __main__ - Step 23717: {'lr': 0.00047390591305655327, 'samples': 4553664, 'steps': 23716, 'loss/train': 1.4697896242141724} 11/07/2021 00:33:54 - INFO - __main__ - Step 23718: {'lr': 0.0004739035524992127, 'samples': 4553856, 'steps': 23717, 'loss/train': 1.6495895385742188} 11/07/2021 00:33:55 - INFO - __main__ - Step 23719: {'lr': 0.00047390119184098455, 'samples': 4554048, 'steps': 23718, 'loss/train': 1.7443530559539795} 11/07/2021 00:33:55 - INFO - __main__ - Step 23720: {'lr': 0.00047389883108187004, 'samples': 4554240, 'steps': 23719, 'loss/train': 1.670081615447998} 11/07/2021 00:33:55 - INFO - __main__ - Step 23721: {'lr': 0.00047389647022187014, 'samples': 4554432, 'steps': 23720, 'loss/train': 1.5686086416244507} 11/07/2021 00:33:56 - INFO - __main__ - Step 23722: {'lr': 0.000473894109260986, 'samples': 4554624, 'steps': 23721, 'loss/train': 1.5855025053024292} 11/07/2021 00:33:57 - INFO - __main__ - Step 23723: {'lr': 0.00047389174819921856, 'samples': 4554816, 'steps': 23722, 'loss/train': 1.3018687963485718} 11/07/2021 00:33:57 - INFO - __main__ - Step 23724: {'lr': 0.000473889387036569, 'samples': 4555008, 'steps': 23723, 'loss/train': 1.0580642223358154} 11/07/2021 00:33:57 - INFO - __main__ - Step 23725: {'lr': 0.0004738870257730383, 'samples': 4555200, 'steps': 23724, 'loss/train': 1.2072584629058838} 11/07/2021 00:33:58 - INFO - __main__ - Step 23726: {'lr': 0.00047388466440862755, 'samples': 4555392, 'steps': 23725, 'loss/train': 1.599613070487976} 11/07/2021 00:33:58 - INFO - __main__ - Step 23727: {'lr': 0.0004738823029433379, 'samples': 4555584, 'steps': 23726, 'loss/train': 1.6899038553237915} 11/07/2021 00:33:59 - INFO - __main__ - Step 23728: {'lr': 0.0004738799413771703, 'samples': 4555776, 'steps': 23727, 'loss/train': 1.2398737668991089} 11/07/2021 00:33:59 - INFO - __main__ - Step 23729: {'lr': 0.0004738775797101258, 'samples': 4555968, 'steps': 23728, 'loss/train': 1.8582526445388794} 11/07/2021 00:34:00 - INFO - __main__ - Step 23730: {'lr': 0.0004738752179422056, 'samples': 4556160, 'steps': 23729, 'loss/train': 1.6827188730239868} 11/07/2021 00:34:00 - INFO - __main__ - Step 23731: {'lr': 0.00047387285607341064, 'samples': 4556352, 'steps': 23730, 'loss/train': 1.5931893587112427} 11/07/2021 00:34:00 - INFO - __main__ - Step 23732: {'lr': 0.00047387049410374207, 'samples': 4556544, 'steps': 23731, 'loss/train': 1.499029517173767} 11/07/2021 00:34:01 - INFO - __main__ - Step 23733: {'lr': 0.00047386813203320084, 'samples': 4556736, 'steps': 23732, 'loss/train': 1.8736869096755981} 11/07/2021 00:34:02 - INFO - __main__ - Step 23734: {'lr': 0.0004738657698617881, 'samples': 4556928, 'steps': 23733, 'loss/train': 1.6546939611434937} 11/07/2021 00:34:02 - INFO - __main__ - Step 23735: {'lr': 0.00047386340758950494, 'samples': 4557120, 'steps': 23734, 'loss/train': 1.7338857650756836} 11/07/2021 00:34:02 - INFO - __main__ - Step 23736: {'lr': 0.0004738610452163523, 'samples': 4557312, 'steps': 23735, 'loss/train': 0.9500973224639893} 11/07/2021 00:34:03 - INFO - __main__ - Step 23737: {'lr': 0.00047385868274233144, 'samples': 4557504, 'steps': 23736, 'loss/train': 1.6614608764648438} 11/07/2021 00:34:03 - INFO - __main__ - Step 23738: {'lr': 0.0004738563201674432, 'samples': 4557696, 'steps': 23737, 'loss/train': 1.8918787240982056} 11/07/2021 00:34:04 - INFO - __main__ - Step 23739: {'lr': 0.00047385395749168885, 'samples': 4557888, 'steps': 23738, 'loss/train': 0.8991847038269043} 11/07/2021 00:34:04 - INFO - __main__ - Step 23740: {'lr': 0.00047385159471506936, 'samples': 4558080, 'steps': 23739, 'loss/train': 1.4461966753005981} 11/07/2021 00:34:05 - INFO - __main__ - Step 23741: {'lr': 0.00047384923183758573, 'samples': 4558272, 'steps': 23740, 'loss/train': 1.3142222166061401} 11/07/2021 00:34:05 - INFO - __main__ - Step 23742: {'lr': 0.0004738468688592391, 'samples': 4558464, 'steps': 23741, 'loss/train': 1.3879821300506592} 11/07/2021 00:34:05 - INFO - __main__ - Step 23743: {'lr': 0.00047384450578003055, 'samples': 4558656, 'steps': 23742, 'loss/train': 1.8437845706939697} 11/07/2021 00:34:07 - INFO - __main__ - Step 23744: {'lr': 0.00047384214259996117, 'samples': 4558848, 'steps': 23743, 'loss/train': 1.3957382440567017} 11/07/2021 00:34:07 - INFO - __main__ - Step 23745: {'lr': 0.0004738397793190319, 'samples': 4559040, 'steps': 23744, 'loss/train': 1.3842023611068726} 11/07/2021 00:34:07 - INFO - __main__ - Step 23746: {'lr': 0.00047383741593724386, 'samples': 4559232, 'steps': 23745, 'loss/train': 1.6281683444976807} 11/07/2021 00:34:08 - INFO - __main__ - Step 23747: {'lr': 0.0004738350524545982, 'samples': 4559424, 'steps': 23746, 'loss/train': 1.2148529291152954} 11/07/2021 00:34:08 - INFO - __main__ - Step 23748: {'lr': 0.0004738326888710959, 'samples': 4559616, 'steps': 23747, 'loss/train': 1.2071878910064697} 11/07/2021 00:34:09 - INFO - __main__ - Step 23749: {'lr': 0.000473830325186738, 'samples': 4559808, 'steps': 23748, 'loss/train': 1.7111480236053467} 11/07/2021 00:34:09 - INFO - __main__ - Step 23750: {'lr': 0.0004738279614015257, 'samples': 4560000, 'steps': 23749, 'loss/train': 1.5095964670181274} 11/07/2021 00:34:10 - INFO - __main__ - Step 23751: {'lr': 0.0004738255975154599, 'samples': 4560192, 'steps': 23750, 'loss/train': 1.6067538261413574} 11/07/2021 00:34:10 - INFO - __main__ - Step 23752: {'lr': 0.0004738232335285417, 'samples': 4560384, 'steps': 23751, 'loss/train': 1.4667108058929443} 11/07/2021 00:34:10 - INFO - __main__ - Step 23753: {'lr': 0.0004738208694407723, 'samples': 4560576, 'steps': 23752, 'loss/train': 1.8321415185928345} 11/07/2021 00:34:11 - INFO - __main__ - Step 23754: {'lr': 0.00047381850525215265, 'samples': 4560768, 'steps': 23753, 'loss/train': 1.9358234405517578} 11/07/2021 00:34:12 - INFO - __main__ - Step 23755: {'lr': 0.0004738161409626838, 'samples': 4560960, 'steps': 23754, 'loss/train': 1.4177026748657227} 11/07/2021 00:34:13 - INFO - __main__ - Step 23756: {'lr': 0.0004738137765723669, 'samples': 4561152, 'steps': 23755, 'loss/train': 1.2879034280776978} 11/07/2021 00:34:13 - INFO - __main__ - Step 23757: {'lr': 0.0004738114120812029, 'samples': 4561344, 'steps': 23756, 'loss/train': 1.8649377822875977} 11/07/2021 00:34:13 - INFO - __main__ - Step 23758: {'lr': 0.000473809047489193, 'samples': 4561536, 'steps': 23757, 'loss/train': 1.4992769956588745} 11/07/2021 00:34:14 - INFO - __main__ - Step 23759: {'lr': 0.00047380668279633814, 'samples': 4561728, 'steps': 23758, 'loss/train': 1.7122327089309692} 11/07/2021 00:34:15 - INFO - __main__ - Step 23760: {'lr': 0.00047380431800263945, 'samples': 4561920, 'steps': 23759, 'loss/train': 1.4459832906723022} 11/07/2021 00:34:15 - INFO - __main__ - Step 23761: {'lr': 0.000473801953108098, 'samples': 4562112, 'steps': 23760, 'loss/train': 2.1364073753356934} 11/07/2021 00:34:15 - INFO - __main__ - Step 23762: {'lr': 0.0004737995881127149, 'samples': 4562304, 'steps': 23761, 'loss/train': 0.9302211403846741} 11/07/2021 00:34:16 - INFO - __main__ - Step 23763: {'lr': 0.0004737972230164911, 'samples': 4562496, 'steps': 23762, 'loss/train': 1.7650277614593506} 11/07/2021 00:34:16 - INFO - __main__ - Step 23764: {'lr': 0.0004737948578194278, 'samples': 4562688, 'steps': 23763, 'loss/train': 1.2138371467590332} 11/07/2021 00:34:17 - INFO - __main__ - Step 23765: {'lr': 0.00047379249252152585, 'samples': 4562880, 'steps': 23764, 'loss/train': 1.6768308877944946} 11/07/2021 00:34:17 - INFO - __main__ - Step 23766: {'lr': 0.00047379012712278656, 'samples': 4563072, 'steps': 23765, 'loss/train': 1.929826259613037} 11/07/2021 00:34:18 - INFO - __main__ - Step 23767: {'lr': 0.0004737877616232108, 'samples': 4563264, 'steps': 23766, 'loss/train': 1.3135716915130615} 11/07/2021 00:34:18 - INFO - __main__ - Step 23768: {'lr': 0.0004737853960227998, 'samples': 4563456, 'steps': 23767, 'loss/train': 1.617723822593689} 11/07/2021 00:34:18 - INFO - __main__ - Step 23769: {'lr': 0.00047378303032155454, 'samples': 4563648, 'steps': 23768, 'loss/train': 1.3294570446014404} 11/07/2021 00:34:20 - INFO - __main__ - Step 23770: {'lr': 0.0004737806645194761, 'samples': 4563840, 'steps': 23769, 'loss/train': 0.6377082467079163} 11/07/2021 00:34:20 - INFO - __main__ - Step 23771: {'lr': 0.00047377829861656556, 'samples': 4564032, 'steps': 23770, 'loss/train': 0.9689741730690002} 11/07/2021 00:34:20 - INFO - __main__ - Step 23772: {'lr': 0.000473775932612824, 'samples': 4564224, 'steps': 23771, 'loss/train': 1.4321142435073853} 11/07/2021 00:34:21 - INFO - __main__ - Step 23773: {'lr': 0.00047377356650825245, 'samples': 4564416, 'steps': 23772, 'loss/train': 1.3861149549484253} 11/07/2021 00:34:21 - INFO - __main__ - Step 23774: {'lr': 0.00047377120030285194, 'samples': 4564608, 'steps': 23773, 'loss/train': 1.1893134117126465} 11/07/2021 00:34:22 - INFO - __main__ - Step 23775: {'lr': 0.0004737688339966235, 'samples': 4564800, 'steps': 23774, 'loss/train': 1.7470072507858276} 11/07/2021 00:34:22 - INFO - __main__ - Step 23776: {'lr': 0.00047376646758956844, 'samples': 4564992, 'steps': 23775, 'loss/train': 1.2687187194824219} 11/07/2021 00:34:23 - INFO - __main__ - Step 23777: {'lr': 0.00047376410108168756, 'samples': 4565184, 'steps': 23776, 'loss/train': 1.6561030149459839} 11/07/2021 00:34:23 - INFO - __main__ - Step 23778: {'lr': 0.0004737617344729821, 'samples': 4565376, 'steps': 23777, 'loss/train': 1.5290687084197998} 11/07/2021 00:34:23 - INFO - __main__ - Step 23779: {'lr': 0.00047375936776345297, 'samples': 4565568, 'steps': 23778, 'loss/train': 1.043493390083313} 11/07/2021 00:34:24 - INFO - __main__ - Step 23780: {'lr': 0.00047375700095310136, 'samples': 4565760, 'steps': 23779, 'loss/train': 1.7410577535629272} 11/07/2021 00:34:25 - INFO - __main__ - Step 23781: {'lr': 0.0004737546340419283, 'samples': 4565952, 'steps': 23780, 'loss/train': 1.6509655714035034} 11/07/2021 00:34:25 - INFO - __main__ - Step 23782: {'lr': 0.0004737522670299349, 'samples': 4566144, 'steps': 23781, 'loss/train': 1.9898253679275513} 11/07/2021 00:34:25 - INFO - __main__ - Step 23783: {'lr': 0.00047374989991712214, 'samples': 4566336, 'steps': 23782, 'loss/train': 1.4818731546401978} 11/07/2021 00:34:26 - INFO - __main__ - Step 23784: {'lr': 0.00047374753270349113, 'samples': 4566528, 'steps': 23783, 'loss/train': 1.6137040853500366} 11/07/2021 00:34:27 - INFO - __main__ - Step 23785: {'lr': 0.00047374516538904287, 'samples': 4566720, 'steps': 23784, 'loss/train': 1.2234424352645874} 11/07/2021 00:34:27 - INFO - __main__ - Step 23786: {'lr': 0.0004737427979737786, 'samples': 4566912, 'steps': 23785, 'loss/train': 1.9149585962295532} 11/07/2021 00:34:28 - INFO - __main__ - Step 23787: {'lr': 0.0004737404304576992, 'samples': 4567104, 'steps': 23786, 'loss/train': 1.2945975065231323} 11/07/2021 00:34:28 - INFO - __main__ - Step 23788: {'lr': 0.0004737380628408059, 'samples': 4567296, 'steps': 23787, 'loss/train': 1.7185180187225342} 11/07/2021 00:34:28 - INFO - __main__ - Step 23789: {'lr': 0.00047373569512309963, 'samples': 4567488, 'steps': 23788, 'loss/train': 1.5824964046478271} 11/07/2021 00:34:30 - INFO - __main__ - Step 23790: {'lr': 0.0004737333273045815, 'samples': 4567680, 'steps': 23789, 'loss/train': 0.2642037570476532} 11/07/2021 00:34:30 - INFO - __main__ - Step 23791: {'lr': 0.00047373095938525256, 'samples': 4567872, 'steps': 23790, 'loss/train': 1.5035150051116943} 11/07/2021 00:34:30 - INFO - __main__ - Step 23792: {'lr': 0.0004737285913651139, 'samples': 4568064, 'steps': 23791, 'loss/train': 1.7068278789520264} 11/07/2021 00:34:31 - INFO - __main__ - Step 23793: {'lr': 0.0004737262232441667, 'samples': 4568256, 'steps': 23792, 'loss/train': 0.1296090930700302} 11/07/2021 00:34:31 - INFO - __main__ - Step 23794: {'lr': 0.00047372385502241176, 'samples': 4568448, 'steps': 23793, 'loss/train': 1.8227049112319946} 11/07/2021 00:34:31 - INFO - __main__ - Step 23795: {'lr': 0.0004737214866998504, 'samples': 4568640, 'steps': 23794, 'loss/train': 1.3710237741470337} 11/07/2021 00:34:32 - INFO - __main__ - Step 23796: {'lr': 0.0004737191182764836, 'samples': 4568832, 'steps': 23795, 'loss/train': 1.4633216857910156} 11/07/2021 00:34:33 - INFO - __main__ - Step 23797: {'lr': 0.0004737167497523124, 'samples': 4569024, 'steps': 23796, 'loss/train': 2.038269281387329} 11/07/2021 00:34:33 - INFO - __main__ - Step 23798: {'lr': 0.0004737143811273379, 'samples': 4569216, 'steps': 23797, 'loss/train': 1.8234018087387085} 11/07/2021 00:34:33 - INFO - __main__ - Step 23799: {'lr': 0.0004737120124015611, 'samples': 4569408, 'steps': 23798, 'loss/train': 1.5387424230575562} 11/07/2021 00:34:34 - INFO - __main__ - Step 23800: {'lr': 0.00047370964357498313, 'samples': 4569600, 'steps': 23799, 'loss/train': 1.482106328010559} 11/07/2021 00:34:35 - INFO - __main__ - Step 23801: {'lr': 0.0004737072746476051, 'samples': 4569792, 'steps': 23800, 'loss/train': 1.0635290145874023} 11/07/2021 00:34:35 - INFO - __main__ - Step 23802: {'lr': 0.00047370490561942795, 'samples': 4569984, 'steps': 23801, 'loss/train': 1.1702690124511719} 11/07/2021 00:34:36 - INFO - __main__ - Step 23803: {'lr': 0.00047370253649045286, 'samples': 4570176, 'steps': 23802, 'loss/train': 1.4020882844924927} 11/07/2021 00:34:36 - INFO - __main__ - Step 23804: {'lr': 0.00047370016726068086, 'samples': 4570368, 'steps': 23803, 'loss/train': 2.384243965148926} 11/07/2021 00:34:36 - INFO - __main__ - Step 23805: {'lr': 0.000473697797930113, 'samples': 4570560, 'steps': 23804, 'loss/train': 1.7264128923416138} 11/07/2021 00:34:37 - INFO - __main__ - Step 23806: {'lr': 0.00047369542849875037, 'samples': 4570752, 'steps': 23805, 'loss/train': 1.715357780456543} 11/07/2021 00:34:38 - INFO - __main__ - Step 23807: {'lr': 0.0004736930589665941, 'samples': 4570944, 'steps': 23806, 'loss/train': 1.3189862966537476} 11/07/2021 00:34:38 - INFO - __main__ - Step 23808: {'lr': 0.0004736906893336451, 'samples': 4571136, 'steps': 23807, 'loss/train': 1.9343082904815674} 11/07/2021 00:34:38 - INFO - __main__ - Step 23809: {'lr': 0.00047368831959990453, 'samples': 4571328, 'steps': 23808, 'loss/train': 1.5713011026382446} 11/07/2021 00:34:39 - INFO - __main__ - Step 23810: {'lr': 0.0004736859497653735, 'samples': 4571520, 'steps': 23809, 'loss/train': 0.9398396611213684} 11/07/2021 00:34:39 - INFO - __main__ - Step 23811: {'lr': 0.0004736835798300531, 'samples': 4571712, 'steps': 23810, 'loss/train': 1.7095924615859985} 11/07/2021 00:34:40 - INFO - __main__ - Step 23812: {'lr': 0.00047368120979394415, 'samples': 4571904, 'steps': 23811, 'loss/train': 2.351248264312744} 11/07/2021 00:34:40 - INFO - __main__ - Step 23813: {'lr': 0.000473678839657048, 'samples': 4572096, 'steps': 23812, 'loss/train': 1.5410054922103882} 11/07/2021 00:34:41 - INFO - __main__ - Step 23814: {'lr': 0.0004736764694193656, 'samples': 4572288, 'steps': 23813, 'loss/train': 0.9509553909301758} 11/07/2021 00:34:41 - INFO - __main__ - Step 23815: {'lr': 0.0004736740990808981, 'samples': 4572480, 'steps': 23814, 'loss/train': 1.646188497543335} 11/07/2021 00:34:42 - INFO - __main__ - Step 23816: {'lr': 0.0004736717286416464, 'samples': 4572672, 'steps': 23815, 'loss/train': 1.1700364351272583} 11/07/2021 00:34:43 - INFO - __main__ - Step 23817: {'lr': 0.0004736693581016117, 'samples': 4572864, 'steps': 23816, 'loss/train': 1.0271397829055786} 11/07/2021 00:34:43 - INFO - __main__ - Step 23818: {'lr': 0.00047366698746079507, 'samples': 4573056, 'steps': 23817, 'loss/train': 1.5787421464920044} 11/07/2021 00:34:43 - INFO - __main__ - Step 23819: {'lr': 0.0004736646167191975, 'samples': 4573248, 'steps': 23818, 'loss/train': 1.5957248210906982} 11/07/2021 00:34:44 - INFO - __main__ - Step 23820: {'lr': 0.00047366224587682017, 'samples': 4573440, 'steps': 23819, 'loss/train': 0.7363741993904114} 11/07/2021 00:34:44 - INFO - __main__ - Step 23821: {'lr': 0.000473659874933664, 'samples': 4573632, 'steps': 23820, 'loss/train': 2.0048911571502686} 11/07/2021 00:34:45 - INFO - __main__ - Step 23822: {'lr': 0.0004736575038897303, 'samples': 4573824, 'steps': 23821, 'loss/train': 1.1328240633010864} 11/07/2021 00:34:45 - INFO - __main__ - Step 23823: {'lr': 0.0004736551327450198, 'samples': 4574016, 'steps': 23822, 'loss/train': 1.430214285850525} 11/07/2021 00:34:46 - INFO - __main__ - Step 23824: {'lr': 0.00047365276149953387, 'samples': 4574208, 'steps': 23823, 'loss/train': 1.3258752822875977} 11/07/2021 00:34:46 - INFO - __main__ - Step 23825: {'lr': 0.0004736503901532734, 'samples': 4574400, 'steps': 23824, 'loss/train': 2.036221504211426} 11/07/2021 00:34:47 - INFO - __main__ - Step 23826: {'lr': 0.00047364801870623954, 'samples': 4574592, 'steps': 23825, 'loss/train': 1.6998757123947144} 11/07/2021 00:34:48 - INFO - __main__ - Step 23827: {'lr': 0.00047364564715843326, 'samples': 4574784, 'steps': 23826, 'loss/train': 1.7433280944824219} 11/07/2021 00:34:48 - INFO - __main__ - Step 23828: {'lr': 0.00047364327550985575, 'samples': 4574976, 'steps': 23827, 'loss/train': 1.8414788246154785} 11/07/2021 00:34:48 - INFO - __main__ - Step 23829: {'lr': 0.00047364090376050805, 'samples': 4575168, 'steps': 23828, 'loss/train': 1.4711476564407349} 11/07/2021 00:34:49 - INFO - __main__ - Step 23830: {'lr': 0.0004736385319103912, 'samples': 4575360, 'steps': 23829, 'loss/train': 0.9364742636680603} 11/07/2021 00:34:49 - INFO - __main__ - Step 23831: {'lr': 0.00047363615995950624, 'samples': 4575552, 'steps': 23830, 'loss/train': 1.699023723602295} 11/07/2021 00:34:50 - INFO - __main__ - Step 23832: {'lr': 0.0004736337879078544, 'samples': 4575744, 'steps': 23831, 'loss/train': 1.367179036140442} 11/07/2021 00:34:50 - INFO - __main__ - Step 23833: {'lr': 0.0004736314157554365, 'samples': 4575936, 'steps': 23832, 'loss/train': 2.1259796619415283} 11/07/2021 00:34:51 - INFO - __main__ - Step 23834: {'lr': 0.00047362904350225376, 'samples': 4576128, 'steps': 23833, 'loss/train': 0.8249647617340088} 11/07/2021 00:34:51 - INFO - __main__ - Step 23835: {'lr': 0.0004736266711483073, 'samples': 4576320, 'steps': 23834, 'loss/train': 1.670304536819458} 11/07/2021 00:34:52 - INFO - __main__ - Step 23836: {'lr': 0.00047362429869359803, 'samples': 4576512, 'steps': 23835, 'loss/train': 1.1619794368743896} 11/07/2021 00:34:53 - INFO - __main__ - Step 23837: {'lr': 0.0004736219261381271, 'samples': 4576704, 'steps': 23836, 'loss/train': 1.7294654846191406} 11/07/2021 00:34:53 - INFO - __main__ - Step 23838: {'lr': 0.0004736195534818956, 'samples': 4576896, 'steps': 23837, 'loss/train': 1.4316507577896118} 11/07/2021 00:34:53 - INFO - __main__ - Step 23839: {'lr': 0.00047361718072490457, 'samples': 4577088, 'steps': 23838, 'loss/train': 1.2505590915679932} 11/07/2021 00:34:54 - INFO - __main__ - Step 23840: {'lr': 0.00047361480786715514, 'samples': 4577280, 'steps': 23839, 'loss/train': 1.3018585443496704} 11/07/2021 00:34:54 - INFO - __main__ - Step 23841: {'lr': 0.00047361243490864826, 'samples': 4577472, 'steps': 23840, 'loss/train': 0.14377540349960327} 11/07/2021 00:34:54 - INFO - __main__ - Step 23842: {'lr': 0.00047361006184938517, 'samples': 4577664, 'steps': 23841, 'loss/train': 4.133652687072754} 11/07/2021 00:34:55 - INFO - __main__ - Step 23843: {'lr': 0.00047360768868936673, 'samples': 4577856, 'steps': 23842, 'loss/train': 1.427064299583435} 11/07/2021 00:34:56 - INFO - __main__ - Step 23844: {'lr': 0.00047360531542859415, 'samples': 4578048, 'steps': 23843, 'loss/train': 1.4833950996398926} 11/07/2021 00:34:56 - INFO - __main__ - Step 23845: {'lr': 0.00047360294206706845, 'samples': 4578240, 'steps': 23844, 'loss/train': 1.6829880475997925} 11/07/2021 00:34:56 - INFO - __main__ - Step 23846: {'lr': 0.0004736005686047907, 'samples': 4578432, 'steps': 23845, 'loss/train': 1.974252462387085} 11/07/2021 00:34:57 - INFO - __main__ - Step 23847: {'lr': 0.000473598195041762, 'samples': 4578624, 'steps': 23846, 'loss/train': 1.9925411939620972} 11/07/2021 00:34:58 - INFO - __main__ - Step 23848: {'lr': 0.0004735958213779835, 'samples': 4578816, 'steps': 23847, 'loss/train': 1.678208351135254} 11/07/2021 00:34:58 - INFO - __main__ - Step 23849: {'lr': 0.0004735934476134561, 'samples': 4579008, 'steps': 23848, 'loss/train': 2.162973165512085} 11/07/2021 00:34:59 - INFO - __main__ - Step 23850: {'lr': 0.0004735910737481809, 'samples': 4579200, 'steps': 23849, 'loss/train': 1.549644112586975} 11/07/2021 00:34:59 - INFO - __main__ - Step 23851: {'lr': 0.0004735886997821591, 'samples': 4579392, 'steps': 23850, 'loss/train': 1.391912579536438} 11/07/2021 00:34:59 - INFO - __main__ - Step 23852: {'lr': 0.00047358632571539163, 'samples': 4579584, 'steps': 23851, 'loss/train': 1.26926851272583} 11/07/2021 00:35:00 - INFO - __main__ - Step 23853: {'lr': 0.0004735839515478796, 'samples': 4579776, 'steps': 23852, 'loss/train': 1.931373119354248} 11/07/2021 00:35:01 - INFO - __main__ - Step 23854: {'lr': 0.0004735815772796241, 'samples': 4579968, 'steps': 23853, 'loss/train': 1.3470486402511597} 11/07/2021 00:35:01 - INFO - __main__ - Step 23855: {'lr': 0.0004735792029106262, 'samples': 4580160, 'steps': 23854, 'loss/train': 1.512963056564331} 11/07/2021 00:35:01 - INFO - __main__ - Step 23856: {'lr': 0.0004735768284408869, 'samples': 4580352, 'steps': 23855, 'loss/train': 1.0526634454727173} 11/07/2021 00:35:02 - INFO - __main__ - Step 23857: {'lr': 0.00047357445387040745, 'samples': 4580544, 'steps': 23856, 'loss/train': 1.524977445602417} 11/07/2021 00:35:03 - INFO - __main__ - Step 23858: {'lr': 0.0004735720791991887, 'samples': 4580736, 'steps': 23857, 'loss/train': 1.551909327507019} 11/07/2021 00:35:03 - INFO - __main__ - Step 23859: {'lr': 0.00047356970442723184, 'samples': 4580928, 'steps': 23858, 'loss/train': 1.7394713163375854} 11/07/2021 00:35:03 - INFO - __main__ - Step 23860: {'lr': 0.00047356732955453794, 'samples': 4581120, 'steps': 23859, 'loss/train': 2.184431791305542} 11/07/2021 00:35:04 - INFO - __main__ - Step 23861: {'lr': 0.00047356495458110806, 'samples': 4581312, 'steps': 23860, 'loss/train': 1.451634168624878} 11/07/2021 00:35:04 - INFO - __main__ - Step 23862: {'lr': 0.00047356257950694326, 'samples': 4581504, 'steps': 23861, 'loss/train': 1.7785993814468384} 11/07/2021 00:35:05 - INFO - __main__ - Step 23863: {'lr': 0.0004735602043320446, 'samples': 4581696, 'steps': 23862, 'loss/train': 1.1716285943984985} 11/07/2021 00:35:06 - INFO - __main__ - Step 23864: {'lr': 0.0004735578290564132, 'samples': 4581888, 'steps': 23863, 'loss/train': 1.9256266355514526} 11/07/2021 00:35:06 - INFO - __main__ - Step 23865: {'lr': 0.00047355545368005003, 'samples': 4582080, 'steps': 23864, 'loss/train': 1.585779070854187} 11/07/2021 00:35:06 - INFO - __main__ - Step 23866: {'lr': 0.00047355307820295625, 'samples': 4582272, 'steps': 23865, 'loss/train': 1.762752890586853} 11/07/2021 00:35:07 - INFO - __main__ - Step 23867: {'lr': 0.00047355070262513287, 'samples': 4582464, 'steps': 23866, 'loss/train': 1.9047404527664185} 11/07/2021 00:35:08 - INFO - __main__ - Step 23868: {'lr': 0.00047354832694658104, 'samples': 4582656, 'steps': 23867, 'loss/train': 2.1007673740386963} 11/07/2021 00:35:08 - INFO - __main__ - Step 23869: {'lr': 0.0004735459511673018, 'samples': 4582848, 'steps': 23868, 'loss/train': 1.7195255756378174} 11/07/2021 00:35:08 - INFO - __main__ - Step 23870: {'lr': 0.0004735435752872962, 'samples': 4583040, 'steps': 23869, 'loss/train': 1.8624155521392822} 11/07/2021 00:35:09 - INFO - __main__ - Step 23871: {'lr': 0.00047354119930656524, 'samples': 4583232, 'steps': 23870, 'loss/train': 1.7606174945831299} 11/07/2021 00:35:09 - INFO - __main__ - Step 23872: {'lr': 0.0004735388232251101, 'samples': 4583424, 'steps': 23871, 'loss/train': 1.514115333557129} 11/07/2021 00:35:10 - INFO - __main__ - Step 23873: {'lr': 0.00047353644704293185, 'samples': 4583616, 'steps': 23872, 'loss/train': 1.6635241508483887} 11/07/2021 00:35:10 - INFO - __main__ - Step 23874: {'lr': 0.0004735340707600315, 'samples': 4583808, 'steps': 23873, 'loss/train': 1.2399150133132935} 11/07/2021 00:35:11 - INFO - __main__ - Step 23875: {'lr': 0.0004735316943764102, 'samples': 4584000, 'steps': 23874, 'loss/train': 1.478752851486206} 11/07/2021 00:35:11 - INFO - __main__ - Step 23876: {'lr': 0.0004735293178920689, 'samples': 4584192, 'steps': 23875, 'loss/train': 1.7473831176757812} 11/07/2021 00:35:11 - INFO - __main__ - Step 23877: {'lr': 0.00047352694130700873, 'samples': 4584384, 'steps': 23876, 'loss/train': 1.5420238971710205} 11/07/2021 00:35:12 - INFO - __main__ - Step 23878: {'lr': 0.00047352456462123086, 'samples': 4584576, 'steps': 23877, 'loss/train': 1.2136842012405396} 11/07/2021 00:35:13 - INFO - __main__ - Step 23879: {'lr': 0.00047352218783473614, 'samples': 4584768, 'steps': 23878, 'loss/train': 1.6738272905349731} 11/07/2021 00:35:13 - INFO - __main__ - Step 23880: {'lr': 0.0004735198109475258, 'samples': 4584960, 'steps': 23879, 'loss/train': 1.5064436197280884} 11/07/2021 00:35:14 - INFO - __main__ - Step 23881: {'lr': 0.000473517433959601, 'samples': 4585152, 'steps': 23880, 'loss/train': 2.9225754737854004} 11/07/2021 00:35:14 - INFO - __main__ - Step 23882: {'lr': 0.00047351505687096257, 'samples': 4585344, 'steps': 23881, 'loss/train': 1.5504294633865356} 11/07/2021 00:35:14 - INFO - __main__ - Step 23883: {'lr': 0.00047351267968161176, 'samples': 4585536, 'steps': 23882, 'loss/train': 1.7049568891525269} 11/07/2021 00:35:15 - INFO - __main__ - Step 23884: {'lr': 0.0004735103023915496, 'samples': 4585728, 'steps': 23883, 'loss/train': 1.40176260471344} 11/07/2021 00:35:16 - INFO - __main__ - Step 23885: {'lr': 0.0004735079250007771, 'samples': 4585920, 'steps': 23884, 'loss/train': 1.1555250883102417} 11/07/2021 00:35:16 - INFO - __main__ - Step 23886: {'lr': 0.00047350554750929543, 'samples': 4586112, 'steps': 23885, 'loss/train': 1.702562689781189} 11/07/2021 00:35:16 - INFO - __main__ - Step 23887: {'lr': 0.0004735031699171055, 'samples': 4586304, 'steps': 23886, 'loss/train': 1.0130150318145752} 11/07/2021 00:35:17 - INFO - __main__ - Step 23888: {'lr': 0.0004735007922242086, 'samples': 4586496, 'steps': 23887, 'loss/train': 2.6597204208374023} 11/07/2021 00:35:18 - INFO - __main__ - Step 23889: {'lr': 0.0004734984144306057, 'samples': 4586688, 'steps': 23888, 'loss/train': 1.794775128364563} 11/07/2021 00:35:18 - INFO - __main__ - Step 23890: {'lr': 0.0004734960365362978, 'samples': 4586880, 'steps': 23889, 'loss/train': 1.3223886489868164} 11/07/2021 00:35:19 - INFO - __main__ - Step 23891: {'lr': 0.0004734936585412861, 'samples': 4587072, 'steps': 23890, 'loss/train': 1.6438193321228027} 11/07/2021 00:35:19 - INFO - __main__ - Step 23892: {'lr': 0.00047349128044557153, 'samples': 4587264, 'steps': 23891, 'loss/train': 1.682618498802185} 11/07/2021 00:35:19 - INFO - __main__ - Step 23893: {'lr': 0.0004734889022491553, 'samples': 4587456, 'steps': 23892, 'loss/train': 1.850300669670105} 11/07/2021 00:35:20 - INFO - __main__ - Step 23894: {'lr': 0.0004734865239520384, 'samples': 4587648, 'steps': 23893, 'loss/train': 1.6257259845733643} 11/07/2021 00:35:21 - INFO - __main__ - Step 23895: {'lr': 0.0004734841455542219, 'samples': 4587840, 'steps': 23894, 'loss/train': 0.8706209659576416} 11/07/2021 00:35:21 - INFO - __main__ - Step 23896: {'lr': 0.0004734817670557069, 'samples': 4588032, 'steps': 23895, 'loss/train': 1.325243353843689} 11/07/2021 00:35:21 - INFO - __main__ - Step 23897: {'lr': 0.00047347938845649447, 'samples': 4588224, 'steps': 23896, 'loss/train': 1.8135654926300049} 11/07/2021 00:35:22 - INFO - __main__ - Step 23898: {'lr': 0.0004734770097565857, 'samples': 4588416, 'steps': 23897, 'loss/train': 1.6204172372817993} 11/07/2021 00:35:23 - INFO - __main__ - Step 23899: {'lr': 0.00047347463095598157, 'samples': 4588608, 'steps': 23898, 'loss/train': 1.4633369445800781} 11/07/2021 00:35:23 - INFO - __main__ - Step 23900: {'lr': 0.00047347225205468323, 'samples': 4588800, 'steps': 23899, 'loss/train': 1.0516488552093506} 11/07/2021 00:35:23 - INFO - __main__ - Step 23901: {'lr': 0.00047346987305269184, 'samples': 4588992, 'steps': 23900, 'loss/train': 1.8633294105529785} 11/07/2021 00:35:24 - INFO - __main__ - Step 23902: {'lr': 0.0004734674939500083, 'samples': 4589184, 'steps': 23901, 'loss/train': 1.2833586931228638} 11/07/2021 00:35:24 - INFO - __main__ - Step 23903: {'lr': 0.0004734651147466338, 'samples': 4589376, 'steps': 23902, 'loss/train': 0.12630239129066467} 11/07/2021 00:35:25 - INFO - __main__ - Step 23904: {'lr': 0.00047346273544256927, 'samples': 4589568, 'steps': 23903, 'loss/train': 1.9274121522903442} 11/07/2021 00:35:26 - INFO - __main__ - Step 23905: {'lr': 0.00047346035603781597, 'samples': 4589760, 'steps': 23904, 'loss/train': 1.7911525964736938} 11/07/2021 00:35:26 - INFO - __main__ - Step 23906: {'lr': 0.00047345797653237486, 'samples': 4589952, 'steps': 23905, 'loss/train': 1.6518858671188354} 11/07/2021 00:35:26 - INFO - __main__ - Step 23907: {'lr': 0.000473455596926247, 'samples': 4590144, 'steps': 23906, 'loss/train': 1.787940502166748} 11/07/2021 00:35:27 - INFO - __main__ - Step 23908: {'lr': 0.0004734532172194335, 'samples': 4590336, 'steps': 23907, 'loss/train': 1.4603474140167236} 11/07/2021 00:35:27 - INFO - __main__ - Step 23909: {'lr': 0.0004734508374119355, 'samples': 4590528, 'steps': 23908, 'loss/train': 1.3073455095291138} 11/07/2021 00:35:28 - INFO - __main__ - Step 23910: {'lr': 0.0004734484575037539, 'samples': 4590720, 'steps': 23909, 'loss/train': 1.4078184366226196} 11/07/2021 00:35:29 - INFO - __main__ - Step 23911: {'lr': 0.00047344607749489, 'samples': 4590912, 'steps': 23910, 'loss/train': 1.5765531063079834} 11/07/2021 00:35:29 - INFO - __main__ - Step 23912: {'lr': 0.00047344369738534466, 'samples': 4591104, 'steps': 23911, 'loss/train': 1.2944846153259277} 11/07/2021 00:35:29 - INFO - __main__ - Step 23913: {'lr': 0.000473441317175119, 'samples': 4591296, 'steps': 23912, 'loss/train': 1.6884845495224} 11/07/2021 00:35:30 - INFO - __main__ - Step 23914: {'lr': 0.0004734389368642142, 'samples': 4591488, 'steps': 23913, 'loss/train': 1.1201519966125488} 11/07/2021 00:35:31 - INFO - __main__ - Step 23915: {'lr': 0.0004734365564526313, 'samples': 4591680, 'steps': 23914, 'loss/train': 1.8650660514831543} 11/07/2021 00:35:31 - INFO - __main__ - Step 23916: {'lr': 0.00047343417594037117, 'samples': 4591872, 'steps': 23915, 'loss/train': 1.3060716390609741} 11/07/2021 00:35:31 - INFO - __main__ - Step 23917: {'lr': 0.00047343179532743516, 'samples': 4592064, 'steps': 23916, 'loss/train': 1.5598535537719727} 11/07/2021 00:35:32 - INFO - __main__ - Step 23918: {'lr': 0.00047342941461382427, 'samples': 4592256, 'steps': 23917, 'loss/train': 1.4983863830566406} 11/07/2021 00:35:32 - INFO - __main__ - Step 23919: {'lr': 0.0004734270337995395, 'samples': 4592448, 'steps': 23918, 'loss/train': 1.1809464693069458} 11/07/2021 00:35:35 - INFO - __main__ - Step 23920: {'lr': 0.0004734246528845819, 'samples': 4592640, 'steps': 23919, 'loss/train': 0.9813411235809326} 11/07/2021 00:35:35 - INFO - __main__ - Step 23921: {'lr': 0.0004734222718689527, 'samples': 4592832, 'steps': 23920, 'loss/train': 2.3573687076568604} 11/07/2021 00:35:35 - INFO - __main__ - Step 23922: {'lr': 0.0004734198907526528, 'samples': 4593024, 'steps': 23921, 'loss/train': 1.912838101387024} 11/07/2021 00:35:36 - INFO - __main__ - Step 23923: {'lr': 0.00047341750953568335, 'samples': 4593216, 'steps': 23922, 'loss/train': 1.816469430923462} 11/07/2021 00:35:36 - INFO - __main__ - Step 23924: {'lr': 0.0004734151282180454, 'samples': 4593408, 'steps': 23923, 'loss/train': 1.8191454410552979} 11/07/2021 00:35:36 - INFO - __main__ - Step 23925: {'lr': 0.0004734127467997401, 'samples': 4593600, 'steps': 23924, 'loss/train': 1.0661303997039795} 11/07/2021 00:35:37 - INFO - __main__ - Step 23926: {'lr': 0.0004734103652807684, 'samples': 4593792, 'steps': 23925, 'loss/train': 1.6279600858688354} 11/07/2021 00:35:38 - INFO - __main__ - Step 23927: {'lr': 0.0004734079836611315, 'samples': 4593984, 'steps': 23926, 'loss/train': 1.4247660636901855} 11/07/2021 00:35:38 - INFO - __main__ - Step 23928: {'lr': 0.0004734056019408304, 'samples': 4594176, 'steps': 23927, 'loss/train': 1.551518201828003} 11/07/2021 00:35:38 - INFO - __main__ - Step 23929: {'lr': 0.00047340322011986614, 'samples': 4594368, 'steps': 23928, 'loss/train': 1.5285148620605469} 11/07/2021 00:35:39 - INFO - __main__ - Step 23930: {'lr': 0.0004734008381982399, 'samples': 4594560, 'steps': 23929, 'loss/train': 1.452463150024414} 11/07/2021 00:35:39 - INFO - __main__ - Step 23931: {'lr': 0.0004733984561759527, 'samples': 4594752, 'steps': 23930, 'loss/train': 1.7026846408843994} 11/07/2021 00:35:39 - INFO - __main__ - Step 23932: {'lr': 0.0004733960740530055, 'samples': 4594944, 'steps': 23931, 'loss/train': 1.6613857746124268} 11/07/2021 00:35:40 - INFO - __main__ - Step 23933: {'lr': 0.0004733936918293995, 'samples': 4595136, 'steps': 23932, 'loss/train': 1.6048064231872559} 11/07/2021 00:35:41 - INFO - __main__ - Step 23934: {'lr': 0.0004733913095051358, 'samples': 4595328, 'steps': 23933, 'loss/train': 1.8170708417892456} 11/07/2021 00:35:41 - INFO - __main__ - Step 23935: {'lr': 0.0004733889270802154, 'samples': 4595520, 'steps': 23934, 'loss/train': 1.6382811069488525} 11/07/2021 00:35:41 - INFO - __main__ - Step 23936: {'lr': 0.00047338654455463935, 'samples': 4595712, 'steps': 23935, 'loss/train': 1.4931180477142334} 11/07/2021 00:35:42 - INFO - __main__ - Step 23937: {'lr': 0.00047338416192840887, 'samples': 4595904, 'steps': 23936, 'loss/train': 1.7660572528839111} 11/07/2021 00:35:43 - INFO - __main__ - Step 23938: {'lr': 0.0004733817792015249, 'samples': 4596096, 'steps': 23937, 'loss/train': 1.0775352716445923} 11/07/2021 00:35:43 - INFO - __main__ - Step 23939: {'lr': 0.00047337939637398855, 'samples': 4596288, 'steps': 23938, 'loss/train': 1.5399011373519897} 11/07/2021 00:35:43 - INFO - __main__ - Step 23940: {'lr': 0.0004733770134458009, 'samples': 4596480, 'steps': 23939, 'loss/train': 1.5492165088653564} 11/07/2021 00:35:44 - INFO - __main__ - Step 23941: {'lr': 0.0004733746304169629, 'samples': 4596672, 'steps': 23940, 'loss/train': 1.623695731163025} 11/07/2021 00:35:44 - INFO - __main__ - Step 23942: {'lr': 0.0004733722472874759, 'samples': 4596864, 'steps': 23941, 'loss/train': 1.8950859308242798} 11/07/2021 00:35:46 - INFO - __main__ - Step 23943: {'lr': 0.0004733698640573407, 'samples': 4597056, 'steps': 23942, 'loss/train': 1.560842752456665} 11/07/2021 00:35:46 - INFO - __main__ - Step 23944: {'lr': 0.0004733674807265585, 'samples': 4597248, 'steps': 23943, 'loss/train': 0.23782792687416077} 11/07/2021 00:35:46 - INFO - __main__ - Step 23945: {'lr': 0.0004733650972951304, 'samples': 4597440, 'steps': 23944, 'loss/train': 1.5143749713897705} 11/07/2021 00:35:47 - INFO - __main__ - Step 23946: {'lr': 0.0004733627137630574, 'samples': 4597632, 'steps': 23945, 'loss/train': 2.1152851581573486} 11/07/2021 00:35:47 - INFO - __main__ - Step 23947: {'lr': 0.00047336033013034063, 'samples': 4597824, 'steps': 23946, 'loss/train': 1.6347516775131226} 11/07/2021 00:35:48 - INFO - __main__ - Step 23948: {'lr': 0.00047335794639698117, 'samples': 4598016, 'steps': 23947, 'loss/train': 1.3954161405563354} 11/07/2021 00:35:48 - INFO - __main__ - Step 23949: {'lr': 0.00047335556256298, 'samples': 4598208, 'steps': 23948, 'loss/train': 1.6793127059936523} 11/07/2021 00:35:49 - INFO - __main__ - Step 23950: {'lr': 0.0004733531786283383, 'samples': 4598400, 'steps': 23949, 'loss/train': 1.897346019744873} 11/07/2021 00:35:49 - INFO - __main__ - Step 23951: {'lr': 0.0004733507945930571, 'samples': 4598592, 'steps': 23950, 'loss/train': 1.4155480861663818} 11/07/2021 00:35:49 - INFO - __main__ - Step 23952: {'lr': 0.0004733484104571375, 'samples': 4598784, 'steps': 23951, 'loss/train': 1.4704738855361938} 11/07/2021 00:35:50 - INFO - __main__ - Step 23953: {'lr': 0.0004733460262205805, 'samples': 4598976, 'steps': 23952, 'loss/train': 1.1455572843551636} 11/07/2021 00:35:51 - INFO - __main__ - Step 23954: {'lr': 0.00047334364188338725, 'samples': 4599168, 'steps': 23953, 'loss/train': 1.7158654928207397} 11/07/2021 00:35:51 - INFO - __main__ - Step 23955: {'lr': 0.0004733412574455588, 'samples': 4599360, 'steps': 23954, 'loss/train': 1.5433906316757202} 11/07/2021 00:35:51 - INFO - __main__ - Step 23956: {'lr': 0.00047333887290709623, 'samples': 4599552, 'steps': 23955, 'loss/train': 1.0119457244873047} 11/07/2021 00:35:52 - INFO - __main__ - Step 23957: {'lr': 0.00047333648826800056, 'samples': 4599744, 'steps': 23956, 'loss/train': 1.6177657842636108} 11/07/2021 00:35:53 - INFO - __main__ - Step 23958: {'lr': 0.000473334103528273, 'samples': 4599936, 'steps': 23957, 'loss/train': 1.7731554508209229} 11/07/2021 00:35:53 - INFO - __main__ - Step 23959: {'lr': 0.00047333171868791453, 'samples': 4600128, 'steps': 23958, 'loss/train': 1.7867491245269775} 11/07/2021 00:35:53 - INFO - __main__ - Step 23960: {'lr': 0.00047332933374692623, 'samples': 4600320, 'steps': 23959, 'loss/train': 1.0234673023223877} 11/07/2021 00:35:54 - INFO - __main__ - Step 23961: {'lr': 0.0004733269487053091, 'samples': 4600512, 'steps': 23960, 'loss/train': 1.6259523630142212} 11/07/2021 00:35:54 - INFO - __main__ - Step 23962: {'lr': 0.0004733245635630644, 'samples': 4600704, 'steps': 23961, 'loss/train': 1.7841717004776} 11/07/2021 00:35:55 - INFO - __main__ - Step 23963: {'lr': 0.000473322178320193, 'samples': 4600896, 'steps': 23962, 'loss/train': 1.3683499097824097} 11/07/2021 00:35:55 - INFO - __main__ - Step 23964: {'lr': 0.0004733197929766961, 'samples': 4601088, 'steps': 23963, 'loss/train': 1.8661178350448608} 11/07/2021 00:35:56 - INFO - __main__ - Step 23965: {'lr': 0.0004733174075325748, 'samples': 4601280, 'steps': 23964, 'loss/train': 0.5968111753463745} 11/07/2021 00:35:56 - INFO - __main__ - Step 23966: {'lr': 0.0004733150219878301, 'samples': 4601472, 'steps': 23965, 'loss/train': 1.1554112434387207} 11/07/2021 00:35:57 - INFO - __main__ - Step 23967: {'lr': 0.00047331263634246314, 'samples': 4601664, 'steps': 23966, 'loss/train': 1.5371090173721313} 11/07/2021 00:35:57 - INFO - __main__ - Step 23968: {'lr': 0.0004733102505964749, 'samples': 4601856, 'steps': 23967, 'loss/train': 1.333052158355713} 11/07/2021 00:35:58 - INFO - __main__ - Step 23969: {'lr': 0.00047330786474986645, 'samples': 4602048, 'steps': 23968, 'loss/train': 1.4831993579864502} 11/07/2021 00:35:58 - INFO - __main__ - Step 23970: {'lr': 0.00047330547880263896, 'samples': 4602240, 'steps': 23969, 'loss/train': 0.468535840511322} 11/07/2021 00:35:59 - INFO - __main__ - Step 23971: {'lr': 0.00047330309275479354, 'samples': 4602432, 'steps': 23970, 'loss/train': 2.043537139892578} 11/07/2021 00:35:59 - INFO - __main__ - Step 23972: {'lr': 0.00047330070660633113, 'samples': 4602624, 'steps': 23971, 'loss/train': 1.6697735786437988} 11/07/2021 00:35:59 - INFO - __main__ - Step 23973: {'lr': 0.00047329832035725286, 'samples': 4602816, 'steps': 23972, 'loss/train': 1.3469334840774536} 11/07/2021 00:36:01 - INFO - __main__ - Step 23974: {'lr': 0.0004732959340075598, 'samples': 4603008, 'steps': 23973, 'loss/train': 1.5206609964370728} 11/07/2021 00:36:01 - INFO - __main__ - Step 23975: {'lr': 0.0004732935475572531, 'samples': 4603200, 'steps': 23974, 'loss/train': 1.446755051612854} 11/07/2021 00:36:01 - INFO - __main__ - Step 23976: {'lr': 0.00047329116100633373, 'samples': 4603392, 'steps': 23975, 'loss/train': 1.2258696556091309} 11/07/2021 00:36:02 - INFO - __main__ - Step 23977: {'lr': 0.0004732887743548028, 'samples': 4603584, 'steps': 23976, 'loss/train': 0.9995880126953125} 11/07/2021 00:36:02 - INFO - __main__ - Step 23978: {'lr': 0.0004732863876026614, 'samples': 4603776, 'steps': 23977, 'loss/train': 1.8663251399993896} 11/07/2021 00:36:03 - INFO - __main__ - Step 23979: {'lr': 0.00047328400074991064, 'samples': 4603968, 'steps': 23978, 'loss/train': 1.5901334285736084} 11/07/2021 00:36:03 - INFO - __main__ - Step 23980: {'lr': 0.00047328161379655155, 'samples': 4604160, 'steps': 23979, 'loss/train': 2.803783416748047} 11/07/2021 00:36:04 - INFO - __main__ - Step 23981: {'lr': 0.00047327922674258516, 'samples': 4604352, 'steps': 23980, 'loss/train': 1.6610171794891357} 11/07/2021 00:36:04 - INFO - __main__ - Step 23982: {'lr': 0.00047327683958801257, 'samples': 4604544, 'steps': 23981, 'loss/train': 1.2914010286331177} 11/07/2021 00:36:04 - INFO - __main__ - Step 23983: {'lr': 0.00047327445233283496, 'samples': 4604736, 'steps': 23982, 'loss/train': 1.675779938697815} 11/07/2021 00:36:05 - INFO - __main__ - Step 23984: {'lr': 0.0004732720649770533, 'samples': 4604928, 'steps': 23983, 'loss/train': 1.0921618938446045} 11/07/2021 00:36:06 - INFO - __main__ - Step 23985: {'lr': 0.00047326967752066876, 'samples': 4605120, 'steps': 23984, 'loss/train': 1.6801031827926636} 11/07/2021 00:36:06 - INFO - __main__ - Step 23986: {'lr': 0.0004732672899636822, 'samples': 4605312, 'steps': 23985, 'loss/train': 1.1205083131790161} 11/07/2021 00:36:06 - INFO - __main__ - Step 23987: {'lr': 0.00047326490230609495, 'samples': 4605504, 'steps': 23986, 'loss/train': 1.521935224533081} 11/07/2021 00:36:07 - INFO - __main__ - Step 23988: {'lr': 0.000473262514547908, 'samples': 4605696, 'steps': 23987, 'loss/train': 1.4000868797302246} 11/07/2021 00:36:07 - INFO - __main__ - Step 23989: {'lr': 0.00047326012668912233, 'samples': 4605888, 'steps': 23988, 'loss/train': 1.2538903951644897} 11/07/2021 00:36:08 - INFO - __main__ - Step 23990: {'lr': 0.0004732577387297391, 'samples': 4606080, 'steps': 23989, 'loss/train': 1.4244943857192993} 11/07/2021 00:36:09 - INFO - __main__ - Step 23991: {'lr': 0.00047325535066975946, 'samples': 4606272, 'steps': 23990, 'loss/train': 1.113530158996582} 11/07/2021 00:36:09 - INFO - __main__ - Step 23992: {'lr': 0.0004732529625091843, 'samples': 4606464, 'steps': 23991, 'loss/train': 0.9550225734710693} 11/07/2021 00:36:09 - INFO - __main__ - Step 23993: {'lr': 0.0004732505742480149, 'samples': 4606656, 'steps': 23992, 'loss/train': 1.4506036043167114} 11/07/2021 00:36:10 - INFO - __main__ - Step 23994: {'lr': 0.00047324818588625214, 'samples': 4606848, 'steps': 23993, 'loss/train': 1.0872936248779297} 11/07/2021 00:36:11 - INFO - __main__ - Step 23995: {'lr': 0.0004732457974238972, 'samples': 4607040, 'steps': 23994, 'loss/train': 1.142677903175354} 11/07/2021 00:36:11 - INFO - __main__ - Step 23996: {'lr': 0.0004732434088609512, 'samples': 4607232, 'steps': 23995, 'loss/train': 1.5826812982559204} 11/07/2021 00:36:11 - INFO - __main__ - Step 23997: {'lr': 0.00047324102019741514, 'samples': 4607424, 'steps': 23996, 'loss/train': 1.9252495765686035} 11/07/2021 00:36:12 - INFO - __main__ - Step 23998: {'lr': 0.00047323863143329016, 'samples': 4607616, 'steps': 23997, 'loss/train': 1.2529178857803345} 11/07/2021 00:36:12 - INFO - __main__ - Step 23999: {'lr': 0.00047323624256857724, 'samples': 4607808, 'steps': 23998, 'loss/train': 1.5744787454605103} 11/07/2021 00:36:13 - INFO - __main__ - Step 24000: {'lr': 0.0004732338536032775, 'samples': 4608000, 'steps': 23999, 'loss/train': 0.7579989433288574} 11/07/2021 00:36:14 - INFO - __main__ - Step 24001: {'lr': 0.0004732314645373921, 'samples': 4608192, 'steps': 24000, 'loss/train': 1.5627518892288208} 11/07/2021 00:36:14 - INFO - __main__ - Step 24002: {'lr': 0.0004732290753709221, 'samples': 4608384, 'steps': 24001, 'loss/train': 1.8776737451553345} 11/07/2021 00:36:14 - INFO - __main__ - Step 24003: {'lr': 0.0004732266861038684, 'samples': 4608576, 'steps': 24002, 'loss/train': 1.3503319025039673} 11/07/2021 00:36:15 - INFO - __main__ - Step 24004: {'lr': 0.0004732242967362322, 'samples': 4608768, 'steps': 24003, 'loss/train': 0.988669753074646} 11/07/2021 00:36:16 - INFO - __main__ - Step 24005: {'lr': 0.00047322190726801464, 'samples': 4608960, 'steps': 24004, 'loss/train': 1.773216962814331} 11/07/2021 00:36:16 - INFO - __main__ - Step 24006: {'lr': 0.0004732195176992167, 'samples': 4609152, 'steps': 24005, 'loss/train': 1.4425705671310425} 11/07/2021 00:36:16 - INFO - __main__ - Step 24007: {'lr': 0.0004732171280298395, 'samples': 4609344, 'steps': 24006, 'loss/train': 1.9128704071044922} 11/07/2021 00:36:17 - INFO - __main__ - Step 24008: {'lr': 0.0004732147382598842, 'samples': 4609536, 'steps': 24007, 'loss/train': 1.3278956413269043} 11/07/2021 00:36:17 - INFO - __main__ - Step 24009: {'lr': 0.00047321234838935164, 'samples': 4609728, 'steps': 24008, 'loss/train': 1.7238233089447021} 11/07/2021 00:36:18 - INFO - __main__ - Step 24010: {'lr': 0.0004732099584182431, 'samples': 4609920, 'steps': 24009, 'loss/train': 1.3841497898101807} 11/07/2021 00:36:18 - INFO - __main__ - Step 24011: {'lr': 0.00047320756834655955, 'samples': 4610112, 'steps': 24010, 'loss/train': 1.14120352268219} 11/07/2021 00:36:19 - INFO - __main__ - Step 24012: {'lr': 0.0004732051781743022, 'samples': 4610304, 'steps': 24011, 'loss/train': 1.303422212600708} 11/07/2021 00:36:19 - INFO - __main__ - Step 24013: {'lr': 0.00047320278790147197, 'samples': 4610496, 'steps': 24012, 'loss/train': 1.3343220949172974} 11/07/2021 00:36:19 - INFO - __main__ - Step 24014: {'lr': 0.00047320039752807, 'samples': 4610688, 'steps': 24013, 'loss/train': 1.3710945844650269} 11/07/2021 00:36:20 - INFO - __main__ - Step 24015: {'lr': 0.0004731980070540974, 'samples': 4610880, 'steps': 24014, 'loss/train': 1.5271550416946411} 11/07/2021 00:36:21 - INFO - __main__ - Step 24016: {'lr': 0.0004731956164795552, 'samples': 4611072, 'steps': 24015, 'loss/train': 1.5088775157928467} 11/07/2021 00:36:21 - INFO - __main__ - Step 24017: {'lr': 0.0004731932258044446, 'samples': 4611264, 'steps': 24016, 'loss/train': 1.6866132020950317} 11/07/2021 00:36:21 - INFO - __main__ - Step 24018: {'lr': 0.00047319083502876647, 'samples': 4611456, 'steps': 24017, 'loss/train': 1.7156354188919067} 11/07/2021 00:36:22 - INFO - __main__ - Step 24019: {'lr': 0.00047318844415252204, 'samples': 4611648, 'steps': 24018, 'loss/train': 1.2008994817733765} 11/07/2021 00:36:23 - INFO - __main__ - Step 24020: {'lr': 0.00047318605317571227, 'samples': 4611840, 'steps': 24019, 'loss/train': 0.9371023774147034} 11/07/2021 00:36:23 - INFO - __main__ - Step 24021: {'lr': 0.0004731836620983384, 'samples': 4612032, 'steps': 24020, 'loss/train': 1.8780555725097656} 11/07/2021 00:36:24 - INFO - __main__ - Step 24022: {'lr': 0.00047318127092040144, 'samples': 4612224, 'steps': 24021, 'loss/train': 1.729790449142456} 11/07/2021 00:36:24 - INFO - __main__ - Step 24023: {'lr': 0.00047317887964190233, 'samples': 4612416, 'steps': 24022, 'loss/train': 1.3617117404937744} 11/07/2021 00:36:24 - INFO - __main__ - Step 24024: {'lr': 0.00047317648826284233, 'samples': 4612608, 'steps': 24023, 'loss/train': 1.4316092729568481} 11/07/2021 00:36:25 - INFO - __main__ - Step 24025: {'lr': 0.0004731740967832224, 'samples': 4612800, 'steps': 24024, 'loss/train': 1.0627192258834839} 11/07/2021 00:36:26 - INFO - __main__ - Step 24026: {'lr': 0.00047317170520304373, 'samples': 4612992, 'steps': 24025, 'loss/train': 1.2190508842468262} 11/07/2021 00:36:26 - INFO - __main__ - Step 24027: {'lr': 0.0004731693135223073, 'samples': 4613184, 'steps': 24026, 'loss/train': 1.520830750465393} 11/07/2021 00:36:26 - INFO - __main__ - Step 24028: {'lr': 0.0004731669217410142, 'samples': 4613376, 'steps': 24027, 'loss/train': 1.6594531536102295} 11/07/2021 00:36:27 - INFO - __main__ - Step 24029: {'lr': 0.0004731645298591656, 'samples': 4613568, 'steps': 24028, 'loss/train': 1.5065810680389404} 11/07/2021 00:36:28 - INFO - __main__ - Step 24030: {'lr': 0.0004731621378767624, 'samples': 4613760, 'steps': 24029, 'loss/train': 1.4267277717590332} 11/07/2021 00:36:28 - INFO - __main__ - Step 24031: {'lr': 0.0004731597457938059, 'samples': 4613952, 'steps': 24030, 'loss/train': 1.7742807865142822} 11/07/2021 00:36:29 - INFO - __main__ - Step 24032: {'lr': 0.000473157353610297, 'samples': 4614144, 'steps': 24031, 'loss/train': 1.6197173595428467} 11/07/2021 00:36:29 - INFO - __main__ - Step 24033: {'lr': 0.0004731549613262368, 'samples': 4614336, 'steps': 24032, 'loss/train': 1.5604989528656006} 11/07/2021 00:36:29 - INFO - __main__ - Step 24034: {'lr': 0.0004731525689416265, 'samples': 4614528, 'steps': 24033, 'loss/train': 2.0325191020965576} 11/07/2021 00:36:30 - INFO - __main__ - Step 24035: {'lr': 0.0004731501764564671, 'samples': 4614720, 'steps': 24034, 'loss/train': 1.6932367086410522} 11/07/2021 00:36:31 - INFO - __main__ - Step 24036: {'lr': 0.00047314778387075963, 'samples': 4614912, 'steps': 24035, 'loss/train': 0.7694287300109863} 11/07/2021 00:36:31 - INFO - __main__ - Step 24037: {'lr': 0.00047314539118450516, 'samples': 4615104, 'steps': 24036, 'loss/train': 1.7597320079803467} 11/07/2021 00:36:31 - INFO - __main__ - Step 24038: {'lr': 0.0004731429983977049, 'samples': 4615296, 'steps': 24037, 'loss/train': 1.971815586090088} 11/07/2021 00:36:32 - INFO - __main__ - Step 24039: {'lr': 0.00047314060551035983, 'samples': 4615488, 'steps': 24038, 'loss/train': 1.4033492803573608} 11/07/2021 00:36:32 - INFO - __main__ - Step 24040: {'lr': 0.00047313821252247104, 'samples': 4615680, 'steps': 24039, 'loss/train': 1.4914271831512451} 11/07/2021 00:36:33 - INFO - __main__ - Step 24041: {'lr': 0.00047313581943403963, 'samples': 4615872, 'steps': 24040, 'loss/train': 1.5378525257110596} 11/07/2021 00:36:33 - INFO - __main__ - Step 24042: {'lr': 0.0004731334262450666, 'samples': 4616064, 'steps': 24041, 'loss/train': 1.4748715162277222} 11/07/2021 00:36:34 - INFO - __main__ - Step 24043: {'lr': 0.00047313103295555317, 'samples': 4616256, 'steps': 24042, 'loss/train': 1.4187289476394653} 11/07/2021 00:36:34 - INFO - __main__ - Step 24044: {'lr': 0.0004731286395655003, 'samples': 4616448, 'steps': 24043, 'loss/train': 1.6412248611450195} 11/07/2021 00:36:34 - INFO - __main__ - Step 24045: {'lr': 0.00047312624607490913, 'samples': 4616640, 'steps': 24044, 'loss/train': 1.356187105178833} 11/07/2021 00:36:35 - INFO - __main__ - Step 24046: {'lr': 0.0004731238524837807, 'samples': 4616832, 'steps': 24045, 'loss/train': 1.3819315433502197} 11/07/2021 00:36:36 - INFO - __main__ - Step 24047: {'lr': 0.00047312145879211607, 'samples': 4617024, 'steps': 24046, 'loss/train': 1.857301115989685} 11/07/2021 00:36:36 - INFO - __main__ - Step 24048: {'lr': 0.0004731190649999164, 'samples': 4617216, 'steps': 24047, 'loss/train': 1.4999340772628784} 11/07/2021 00:36:36 - INFO - __main__ - Step 24049: {'lr': 0.0004731166711071827, 'samples': 4617408, 'steps': 24048, 'loss/train': 1.6041375398635864} 11/07/2021 00:36:37 - INFO - __main__ - Step 24050: {'lr': 0.0004731142771139161, 'samples': 4617600, 'steps': 24049, 'loss/train': 1.6878598928451538} 11/07/2021 00:36:38 - INFO - __main__ - Step 24051: {'lr': 0.00047311188302011766, 'samples': 4617792, 'steps': 24050, 'loss/train': 1.9721226692199707} 11/07/2021 00:36:38 - INFO - __main__ - Step 24052: {'lr': 0.00047310948882578843, 'samples': 4617984, 'steps': 24051, 'loss/train': 1.548930287361145} 11/07/2021 00:36:38 - INFO - __main__ - Step 24053: {'lr': 0.0004731070945309295, 'samples': 4618176, 'steps': 24052, 'loss/train': 1.5730412006378174} 11/07/2021 00:36:39 - INFO - __main__ - Step 24054: {'lr': 0.00047310470013554195, 'samples': 4618368, 'steps': 24053, 'loss/train': 1.6378589868545532} 11/07/2021 00:36:39 - INFO - __main__ - Step 24055: {'lr': 0.0004731023056396269, 'samples': 4618560, 'steps': 24054, 'loss/train': 1.4226047992706299} 11/07/2021 00:36:40 - INFO - __main__ - Step 24056: {'lr': 0.00047309991104318533, 'samples': 4618752, 'steps': 24055, 'loss/train': 1.3241724967956543} 11/07/2021 00:36:41 - INFO - __main__ - Step 24057: {'lr': 0.00047309751634621845, 'samples': 4618944, 'steps': 24056, 'loss/train': 1.4396579265594482} 11/07/2021 00:36:41 - INFO - __main__ - Step 24058: {'lr': 0.0004730951215487272, 'samples': 4619136, 'steps': 24057, 'loss/train': 1.5071772336959839} 11/07/2021 00:36:41 - INFO - __main__ - Step 24059: {'lr': 0.0004730927266507128, 'samples': 4619328, 'steps': 24058, 'loss/train': 1.7581125497817993} 11/07/2021 00:36:42 - INFO - __main__ - Step 24060: {'lr': 0.00047309033165217617, 'samples': 4619520, 'steps': 24059, 'loss/train': 1.6549838781356812} 11/07/2021 00:36:43 - INFO - __main__ - Step 24061: {'lr': 0.00047308793655311855, 'samples': 4619712, 'steps': 24060, 'loss/train': 1.6436907052993774} 11/07/2021 00:36:43 - INFO - __main__ - Step 24062: {'lr': 0.000473085541353541, 'samples': 4619904, 'steps': 24061, 'loss/train': 1.756529688835144} 11/07/2021 00:36:43 - INFO - __main__ - Step 24063: {'lr': 0.00047308314605344447, 'samples': 4620096, 'steps': 24062, 'loss/train': 1.4223600625991821} 11/07/2021 00:36:44 - INFO - __main__ - Step 24064: {'lr': 0.00047308075065283006, 'samples': 4620288, 'steps': 24063, 'loss/train': 1.6613905429840088} 11/07/2021 00:36:44 - INFO - __main__ - Step 24065: {'lr': 0.00047307835515169905, 'samples': 4620480, 'steps': 24064, 'loss/train': 1.8377280235290527} 11/07/2021 00:36:45 - INFO - __main__ - Step 24066: {'lr': 0.00047307595955005226, 'samples': 4620672, 'steps': 24065, 'loss/train': 1.2982749938964844} 11/07/2021 00:36:45 - INFO - __main__ - Step 24067: {'lr': 0.000473073563847891, 'samples': 4620864, 'steps': 24066, 'loss/train': 1.9799203872680664} 11/07/2021 00:36:46 - INFO - __main__ - Step 24068: {'lr': 0.0004730711680452161, 'samples': 4621056, 'steps': 24067, 'loss/train': 1.3927499055862427} 11/07/2021 00:36:46 - INFO - __main__ - Step 24069: {'lr': 0.00047306877214202885, 'samples': 4621248, 'steps': 24068, 'loss/train': 1.6838091611862183} 11/07/2021 00:36:46 - INFO - __main__ - Step 24070: {'lr': 0.00047306637613833024, 'samples': 4621440, 'steps': 24069, 'loss/train': 1.380083441734314} 11/07/2021 00:36:47 - INFO - __main__ - Step 24071: {'lr': 0.00047306398003412137, 'samples': 4621632, 'steps': 24070, 'loss/train': 2.2692599296569824} 11/07/2021 00:36:48 - INFO - __main__ - Step 24072: {'lr': 0.00047306158382940327, 'samples': 4621824, 'steps': 24071, 'loss/train': 1.4450585842132568} 11/07/2021 00:36:48 - INFO - __main__ - Step 24073: {'lr': 0.0004730591875241771, 'samples': 4622016, 'steps': 24072, 'loss/train': 1.9368537664413452} 11/07/2021 00:36:48 - INFO - __main__ - Step 24074: {'lr': 0.0004730567911184439, 'samples': 4622208, 'steps': 24073, 'loss/train': 1.387794852256775} 11/07/2021 00:36:49 - INFO - __main__ - Step 24075: {'lr': 0.00047305439461220477, 'samples': 4622400, 'steps': 24074, 'loss/train': 1.5761562585830688} 11/07/2021 00:36:49 - INFO - __main__ - Step 24076: {'lr': 0.00047305199800546077, 'samples': 4622592, 'steps': 24075, 'loss/train': 1.4723130464553833} 11/07/2021 00:36:50 - INFO - __main__ - Step 24077: {'lr': 0.00047304960129821295, 'samples': 4622784, 'steps': 24076, 'loss/train': 1.2436383962631226} 11/07/2021 00:36:51 - INFO - __main__ - Step 24078: {'lr': 0.00047304720449046247, 'samples': 4622976, 'steps': 24077, 'loss/train': 1.5036218166351318} 11/07/2021 00:36:51 - INFO - __main__ - Step 24079: {'lr': 0.0004730448075822103, 'samples': 4623168, 'steps': 24078, 'loss/train': 1.3758183717727661} 11/07/2021 00:36:52 - INFO - __main__ - Step 24080: {'lr': 0.0004730424105734576, 'samples': 4623360, 'steps': 24079, 'loss/train': 1.9877567291259766} 11/07/2021 00:36:52 - INFO - __main__ - Step 24081: {'lr': 0.00047304001346420543, 'samples': 4623552, 'steps': 24080, 'loss/train': 0.17084410786628723} 11/07/2021 00:36:53 - INFO - __main__ - Step 24082: {'lr': 0.0004730376162544549, 'samples': 4623744, 'steps': 24081, 'loss/train': 1.7236641645431519} 11/07/2021 00:36:53 - INFO - __main__ - Step 24083: {'lr': 0.00047303521894420707, 'samples': 4623936, 'steps': 24082, 'loss/train': 1.7466765642166138} 11/07/2021 00:36:54 - INFO - __main__ - Step 24084: {'lr': 0.00047303282153346297, 'samples': 4624128, 'steps': 24083, 'loss/train': 1.492013931274414} 11/07/2021 00:36:54 - INFO - __main__ - Step 24085: {'lr': 0.00047303042402222373, 'samples': 4624320, 'steps': 24084, 'loss/train': 1.3853580951690674} 11/07/2021 00:36:54 - INFO - __main__ - Step 24086: {'lr': 0.00047302802641049045, 'samples': 4624512, 'steps': 24085, 'loss/train': 1.2335377931594849} 11/07/2021 00:36:56 - INFO - __main__ - Step 24087: {'lr': 0.00047302562869826415, 'samples': 4624704, 'steps': 24086, 'loss/train': 1.3577960729599} 11/07/2021 00:36:56 - INFO - __main__ - Step 24088: {'lr': 0.000473023230885546, 'samples': 4624896, 'steps': 24087, 'loss/train': 1.374523639678955} 11/07/2021 00:36:56 - INFO - __main__ - Step 24089: {'lr': 0.00047302083297233693, 'samples': 4625088, 'steps': 24088, 'loss/train': 0.7093602418899536} 11/07/2021 00:36:57 - INFO - __main__ - Step 24090: {'lr': 0.0004730184349586382, 'samples': 4625280, 'steps': 24089, 'loss/train': 0.1390826255083084} 11/07/2021 00:36:57 - INFO - __main__ - Step 24091: {'lr': 0.0004730160368444507, 'samples': 4625472, 'steps': 24090, 'loss/train': 2.2058796882629395} 11/07/2021 00:36:58 - INFO - __main__ - Step 24092: {'lr': 0.00047301363862977574, 'samples': 4625664, 'steps': 24091, 'loss/train': 1.3440006971359253} 11/07/2021 00:36:58 - INFO - __main__ - Step 24093: {'lr': 0.00047301124031461425, 'samples': 4625856, 'steps': 24092, 'loss/train': 1.485878825187683} 11/07/2021 00:36:59 - INFO - __main__ - Step 24094: {'lr': 0.00047300884189896734, 'samples': 4626048, 'steps': 24093, 'loss/train': 1.568880319595337} 11/07/2021 00:36:59 - INFO - __main__ - Step 24095: {'lr': 0.00047300644338283597, 'samples': 4626240, 'steps': 24094, 'loss/train': 1.0832172632217407} 11/07/2021 00:36:59 - INFO - __main__ - Step 24096: {'lr': 0.00047300404476622145, 'samples': 4626432, 'steps': 24095, 'loss/train': 2.009178638458252} 11/07/2021 00:37:00 - INFO - __main__ - Step 24097: {'lr': 0.0004730016460491247, 'samples': 4626624, 'steps': 24096, 'loss/train': 1.7144906520843506} 11/07/2021 00:37:01 - INFO - __main__ - Step 24098: {'lr': 0.00047299924723154686, 'samples': 4626816, 'steps': 24097, 'loss/train': 1.5741411447525024} 11/07/2021 00:37:01 - INFO - __main__ - Step 24099: {'lr': 0.000472996848313489, 'samples': 4627008, 'steps': 24098, 'loss/train': 1.6352746486663818} 11/07/2021 00:37:01 - INFO - __main__ - Step 24100: {'lr': 0.0004729944492949523, 'samples': 4627200, 'steps': 24099, 'loss/train': 1.3924369812011719} 11/07/2021 00:37:02 - INFO - __main__ - Step 24101: {'lr': 0.0004729920501759376, 'samples': 4627392, 'steps': 24100, 'loss/train': 1.5177526473999023} 11/07/2021 00:37:03 - INFO - __main__ - Step 24102: {'lr': 0.0004729896509564462, 'samples': 4627584, 'steps': 24101, 'loss/train': 1.8052873611450195} 11/07/2021 00:37:03 - INFO - __main__ - Step 24103: {'lr': 0.00047298725163647903, 'samples': 4627776, 'steps': 24102, 'loss/train': 1.3988844156265259} 11/07/2021 00:37:03 - INFO - __main__ - Step 24104: {'lr': 0.00047298485221603735, 'samples': 4627968, 'steps': 24103, 'loss/train': 1.4751757383346558} 11/07/2021 00:37:04 - INFO - __main__ - Step 24105: {'lr': 0.0004729824526951221, 'samples': 4628160, 'steps': 24104, 'loss/train': 1.4389196634292603} 11/07/2021 00:37:04 - INFO - __main__ - Step 24106: {'lr': 0.0004729800530737344, 'samples': 4628352, 'steps': 24105, 'loss/train': 1.3307538032531738} 11/07/2021 00:37:05 - INFO - __main__ - Step 24107: {'lr': 0.0004729776533518753, 'samples': 4628544, 'steps': 24106, 'loss/train': 1.9035096168518066} 11/07/2021 00:37:06 - INFO - __main__ - Step 24108: {'lr': 0.00047297525352954587, 'samples': 4628736, 'steps': 24107, 'loss/train': 1.3840570449829102} 11/07/2021 00:37:06 - INFO - __main__ - Step 24109: {'lr': 0.00047297285360674724, 'samples': 4628928, 'steps': 24108, 'loss/train': 1.2842854261398315} 11/07/2021 00:37:06 - INFO - __main__ - Step 24110: {'lr': 0.0004729704535834806, 'samples': 4629120, 'steps': 24109, 'loss/train': 1.7016844749450684} 11/07/2021 00:37:07 - INFO - __main__ - Step 24111: {'lr': 0.0004729680534597468, 'samples': 4629312, 'steps': 24110, 'loss/train': 1.310340166091919} 11/07/2021 00:37:08 - INFO - __main__ - Step 24112: {'lr': 0.0004729656532355471, 'samples': 4629504, 'steps': 24111, 'loss/train': 1.403632640838623} 11/07/2021 00:37:08 - INFO - __main__ - Step 24113: {'lr': 0.00047296325291088247, 'samples': 4629696, 'steps': 24112, 'loss/train': 1.713728666305542} 11/07/2021 00:37:08 - INFO - __main__ - Step 24114: {'lr': 0.00047296085248575405, 'samples': 4629888, 'steps': 24113, 'loss/train': 2.0353336334228516} 11/07/2021 00:37:09 - INFO - __main__ - Step 24115: {'lr': 0.000472958451960163, 'samples': 4630080, 'steps': 24114, 'loss/train': 1.6103174686431885} 11/07/2021 00:37:09 - INFO - __main__ - Step 24116: {'lr': 0.0004729560513341101, 'samples': 4630272, 'steps': 24115, 'loss/train': 1.0083428621292114} 11/07/2021 00:37:10 - INFO - __main__ - Step 24117: {'lr': 0.0004729536506075969, 'samples': 4630464, 'steps': 24116, 'loss/train': 1.5582096576690674} 11/07/2021 00:37:10 - INFO - __main__ - Step 24118: {'lr': 0.000472951249780624, 'samples': 4630656, 'steps': 24117, 'loss/train': 1.225318193435669} 11/07/2021 00:37:11 - INFO - __main__ - Step 24119: {'lr': 0.0004729488488531928, 'samples': 4630848, 'steps': 24118, 'loss/train': 1.7452466487884521} 11/07/2021 00:37:11 - INFO - __main__ - Step 24120: {'lr': 0.00047294644782530437, 'samples': 4631040, 'steps': 24119, 'loss/train': 1.7983100414276123} 11/07/2021 00:37:11 - INFO - __main__ - Step 24121: {'lr': 0.0004729440466969596, 'samples': 4631232, 'steps': 24120, 'loss/train': 1.7422473430633545} 11/07/2021 00:37:12 - INFO - __main__ - Step 24122: {'lr': 0.00047294164546815977, 'samples': 4631424, 'steps': 24121, 'loss/train': 1.831278920173645} 11/07/2021 00:37:13 - INFO - __main__ - Step 24123: {'lr': 0.0004729392441389058, 'samples': 4631616, 'steps': 24122, 'loss/train': 1.5722295045852661} 11/07/2021 00:37:13 - INFO - __main__ - Step 24124: {'lr': 0.0004729368427091989, 'samples': 4631808, 'steps': 24123, 'loss/train': 1.10862398147583} 11/07/2021 00:37:14 - INFO - __main__ - Step 24125: {'lr': 0.0004729344411790401, 'samples': 4632000, 'steps': 24124, 'loss/train': 1.5833975076675415} 11/07/2021 00:37:14 - INFO - __main__ - Step 24126: {'lr': 0.00047293203954843036, 'samples': 4632192, 'steps': 24125, 'loss/train': 1.5960149765014648} 11/07/2021 00:37:14 - INFO - __main__ - Step 24127: {'lr': 0.000472929637817371, 'samples': 4632384, 'steps': 24126, 'loss/train': 1.4709511995315552} 11/07/2021 00:37:15 - INFO - __main__ - Step 24128: {'lr': 0.00047292723598586295, 'samples': 4632576, 'steps': 24127, 'loss/train': 1.7393834590911865} 11/07/2021 00:37:16 - INFO - __main__ - Step 24129: {'lr': 0.0004729248340539074, 'samples': 4632768, 'steps': 24128, 'loss/train': 1.2329339981079102} 11/07/2021 00:37:16 - INFO - __main__ - Step 24130: {'lr': 0.00047292243202150524, 'samples': 4632960, 'steps': 24129, 'loss/train': 1.3833272457122803} 11/07/2021 00:37:16 - INFO - __main__ - Step 24131: {'lr': 0.00047292002988865773, 'samples': 4633152, 'steps': 24130, 'loss/train': 1.295301914215088} 11/07/2021 00:37:17 - INFO - __main__ - Step 24132: {'lr': 0.0004729176276553659, 'samples': 4633344, 'steps': 24131, 'loss/train': 1.6785240173339844} 11/07/2021 00:37:18 - INFO - __main__ - Step 24133: {'lr': 0.00047291522532163084, 'samples': 4633536, 'steps': 24132, 'loss/train': 0.7390657067298889} 11/07/2021 00:37:18 - INFO - __main__ - Step 24134: {'lr': 0.0004729128228874536, 'samples': 4633728, 'steps': 24133, 'loss/train': 1.5403138399124146} 11/07/2021 00:37:18 - INFO - __main__ - Step 24135: {'lr': 0.0004729104203528353, 'samples': 4633920, 'steps': 24134, 'loss/train': 1.6307657957077026} 11/07/2021 00:37:19 - INFO - __main__ - Step 24136: {'lr': 0.0004729080177177769, 'samples': 4634112, 'steps': 24135, 'loss/train': 1.8441475629806519} 11/07/2021 00:37:19 - INFO - __main__ - Step 24137: {'lr': 0.0004729056149822797, 'samples': 4634304, 'steps': 24136, 'loss/train': 1.516968011856079} 11/07/2021 00:37:20 - INFO - __main__ - Step 24138: {'lr': 0.0004729032121463447, 'samples': 4634496, 'steps': 24137, 'loss/train': 2.146589517593384} 11/07/2021 00:37:20 - INFO - __main__ - Step 24139: {'lr': 0.00047290080920997285, 'samples': 4634688, 'steps': 24138, 'loss/train': 1.5535197257995605} 11/07/2021 00:37:21 - INFO - __main__ - Step 24140: {'lr': 0.0004728984061731654, 'samples': 4634880, 'steps': 24139, 'loss/train': 1.5095983743667603} 11/07/2021 00:37:21 - INFO - __main__ - Step 24141: {'lr': 0.00047289600303592334, 'samples': 4635072, 'steps': 24140, 'loss/train': 1.7222843170166016} 11/07/2021 00:37:22 - INFO - __main__ - Step 24142: {'lr': 0.00047289359979824774, 'samples': 4635264, 'steps': 24141, 'loss/train': 1.3944061994552612} 11/07/2021 00:37:23 - INFO - __main__ - Step 24143: {'lr': 0.0004728911964601398, 'samples': 4635456, 'steps': 24142, 'loss/train': 1.880142092704773} 11/07/2021 00:37:23 - INFO - __main__ - Step 24144: {'lr': 0.00047288879302160046, 'samples': 4635648, 'steps': 24143, 'loss/train': 1.4885319471359253} 11/07/2021 00:37:24 - INFO - __main__ - Step 24145: {'lr': 0.000472886389482631, 'samples': 4635840, 'steps': 24144, 'loss/train': 1.4777766466140747} 11/07/2021 00:37:24 - INFO - __main__ - Step 24146: {'lr': 0.00047288398584323225, 'samples': 4636032, 'steps': 24145, 'loss/train': 1.4003963470458984} 11/07/2021 00:37:24 - INFO - __main__ - Step 24147: {'lr': 0.0004728815821034055, 'samples': 4636224, 'steps': 24146, 'loss/train': 1.7480651140213013} 11/07/2021 00:37:25 - INFO - __main__ - Step 24148: {'lr': 0.00047287917826315163, 'samples': 4636416, 'steps': 24147, 'loss/train': 1.4331068992614746} 11/07/2021 00:37:26 - INFO - __main__ - Step 24149: {'lr': 0.00047287677432247187, 'samples': 4636608, 'steps': 24148, 'loss/train': 0.24684494733810425} 11/07/2021 00:37:26 - INFO - __main__ - Step 24150: {'lr': 0.0004728743702813674, 'samples': 4636800, 'steps': 24149, 'loss/train': 1.3840608596801758} 11/07/2021 00:37:26 - INFO - __main__ - Step 24151: {'lr': 0.00047287196613983906, 'samples': 4636992, 'steps': 24150, 'loss/train': 1.2256970405578613} 11/07/2021 00:37:27 - INFO - __main__ - Step 24152: {'lr': 0.00047286956189788803, 'samples': 4637184, 'steps': 24151, 'loss/train': 1.6570688486099243} 11/07/2021 00:37:27 - INFO - __main__ - Step 24153: {'lr': 0.0004728671575555155, 'samples': 4637376, 'steps': 24152, 'loss/train': 1.3276275396347046} 11/07/2021 00:37:28 - INFO - __main__ - Step 24154: {'lr': 0.00047286475311272244, 'samples': 4637568, 'steps': 24153, 'loss/train': 1.9737012386322021} 11/07/2021 00:37:28 - INFO - __main__ - Step 24155: {'lr': 0.00047286234856950995, 'samples': 4637760, 'steps': 24154, 'loss/train': 1.142815351486206} 11/07/2021 00:37:29 - INFO - __main__ - Step 24156: {'lr': 0.0004728599439258791, 'samples': 4637952, 'steps': 24155, 'loss/train': 1.1040961742401123} 11/07/2021 00:37:29 - INFO - __main__ - Step 24157: {'lr': 0.00047285753918183105, 'samples': 4638144, 'steps': 24156, 'loss/train': 1.6647619009017944} 11/07/2021 00:37:30 - INFO - __main__ - Step 24158: {'lr': 0.0004728551343373668, 'samples': 4638336, 'steps': 24157, 'loss/train': 1.5001345872879028} 11/07/2021 00:37:31 - INFO - __main__ - Step 24159: {'lr': 0.0004728527293924875, 'samples': 4638528, 'steps': 24158, 'loss/train': 1.7175570726394653} 11/07/2021 00:37:31 - INFO - __main__ - Step 24160: {'lr': 0.0004728503243471941, 'samples': 4638720, 'steps': 24159, 'loss/train': 2.3899855613708496} 11/07/2021 00:37:32 - INFO - __main__ - Step 24161: {'lr': 0.00047284791920148786, 'samples': 4638912, 'steps': 24160, 'loss/train': 2.2988929748535156} 11/07/2021 00:37:32 - INFO - __main__ - Step 24162: {'lr': 0.0004728455139553698, 'samples': 4639104, 'steps': 24161, 'loss/train': 1.3547323942184448} 11/07/2021 00:37:32 - INFO - __main__ - Step 24163: {'lr': 0.00047284310860884097, 'samples': 4639296, 'steps': 24162, 'loss/train': 1.4184315204620361} 11/07/2021 00:37:33 - INFO - __main__ - Step 24164: {'lr': 0.0004728407031619025, 'samples': 4639488, 'steps': 24163, 'loss/train': 1.7893747091293335} 11/07/2021 00:37:34 - INFO - __main__ - Step 24165: {'lr': 0.00047283829761455545, 'samples': 4639680, 'steps': 24164, 'loss/train': 1.2855308055877686} 11/07/2021 00:37:34 - INFO - __main__ - Step 24166: {'lr': 0.00047283589196680083, 'samples': 4639872, 'steps': 24165, 'loss/train': 1.3909536600112915} 11/07/2021 00:37:35 - INFO - __main__ - Step 24167: {'lr': 0.00047283348621863987, 'samples': 4640064, 'steps': 24166, 'loss/train': 1.9475990533828735} 11/07/2021 00:37:35 - INFO - __main__ - Step 24168: {'lr': 0.0004728310803700735, 'samples': 4640256, 'steps': 24167, 'loss/train': 1.5293444395065308} 11/07/2021 00:37:35 - INFO - __main__ - Step 24169: {'lr': 0.00047282867442110296, 'samples': 4640448, 'steps': 24168, 'loss/train': 1.4277057647705078} 11/07/2021 00:37:36 - INFO - __main__ - Step 24170: {'lr': 0.0004728262683717292, 'samples': 4640640, 'steps': 24169, 'loss/train': 1.4179643392562866} 11/07/2021 00:37:37 - INFO - __main__ - Step 24171: {'lr': 0.0004728238622219534, 'samples': 4640832, 'steps': 24170, 'loss/train': 1.114132046699524} 11/07/2021 00:37:37 - INFO - __main__ - Step 24172: {'lr': 0.0004728214559717766, 'samples': 4641024, 'steps': 24171, 'loss/train': 0.9853553771972656} 11/07/2021 00:37:37 - INFO - __main__ - Step 24173: {'lr': 0.0004728190496211999, 'samples': 4641216, 'steps': 24172, 'loss/train': 1.9363664388656616} 11/07/2021 00:37:38 - INFO - __main__ - Step 24174: {'lr': 0.0004728166431702243, 'samples': 4641408, 'steps': 24173, 'loss/train': 1.4757331609725952} 11/07/2021 00:37:39 - INFO - __main__ - Step 24175: {'lr': 0.0004728142366188511, 'samples': 4641600, 'steps': 24174, 'loss/train': 1.9317572116851807} 11/07/2021 00:37:39 - INFO - __main__ - Step 24176: {'lr': 0.0004728118299670812, 'samples': 4641792, 'steps': 24175, 'loss/train': 1.3534637689590454} 11/07/2021 00:37:40 - INFO - __main__ - Step 24177: {'lr': 0.0004728094232149156, 'samples': 4641984, 'steps': 24176, 'loss/train': 1.609655499458313} 11/07/2021 00:37:40 - INFO - __main__ - Step 24178: {'lr': 0.0004728070163623557, 'samples': 4642176, 'steps': 24177, 'loss/train': 1.2836956977844238} 11/07/2021 00:37:40 - INFO - __main__ - Step 24179: {'lr': 0.00047280460940940224, 'samples': 4642368, 'steps': 24178, 'loss/train': 2.0768063068389893} 11/07/2021 00:37:41 - INFO - __main__ - Step 24180: {'lr': 0.00047280220235605653, 'samples': 4642560, 'steps': 24179, 'loss/train': 1.5663083791732788} 11/07/2021 00:37:42 - INFO - __main__ - Step 24181: {'lr': 0.00047279979520231956, 'samples': 4642752, 'steps': 24180, 'loss/train': 1.7623138427734375} 11/07/2021 00:37:42 - INFO - __main__ - Step 24182: {'lr': 0.0004727973879481925, 'samples': 4642944, 'steps': 24181, 'loss/train': 1.9315279722213745} 11/07/2021 00:37:42 - INFO - __main__ - Step 24183: {'lr': 0.0004727949805936763, 'samples': 4643136, 'steps': 24182, 'loss/train': 1.6259862184524536} 11/07/2021 00:37:43 - INFO - __main__ - Step 24184: {'lr': 0.00047279257313877216, 'samples': 4643328, 'steps': 24183, 'loss/train': 1.3279659748077393} 11/07/2021 00:37:44 - INFO - __main__ - Step 24185: {'lr': 0.00047279016558348107, 'samples': 4643520, 'steps': 24184, 'loss/train': 1.1774322986602783} 11/07/2021 00:37:45 - INFO - __main__ - Step 24186: {'lr': 0.00047278775792780424, 'samples': 4643712, 'steps': 24185, 'loss/train': 1.7688424587249756} 11/07/2021 00:37:45 - INFO - __main__ - Step 24187: {'lr': 0.00047278535017174266, 'samples': 4643904, 'steps': 24186, 'loss/train': 1.6855511665344238} 11/07/2021 00:37:45 - INFO - __main__ - Step 24188: {'lr': 0.00047278294231529745, 'samples': 4644096, 'steps': 24187, 'loss/train': 1.364418387413025} 11/07/2021 00:37:46 - INFO - __main__ - Step 24189: {'lr': 0.0004727805343584697, 'samples': 4644288, 'steps': 24188, 'loss/train': 1.5109294652938843} 11/07/2021 00:37:47 - INFO - __main__ - Step 24190: {'lr': 0.00047277812630126044, 'samples': 4644480, 'steps': 24189, 'loss/train': 0.2451678216457367} 11/07/2021 00:37:47 - INFO - __main__ - Step 24191: {'lr': 0.0004727757181436708, 'samples': 4644672, 'steps': 24190, 'loss/train': 1.7502609491348267} 11/07/2021 00:37:47 - INFO - __main__ - Step 24192: {'lr': 0.0004727733098857019, 'samples': 4644864, 'steps': 24191, 'loss/train': 1.4970722198486328} 11/07/2021 00:37:48 - INFO - __main__ - Step 24193: {'lr': 0.0004727709015273547, 'samples': 4645056, 'steps': 24192, 'loss/train': 1.2383203506469727} 11/07/2021 00:37:48 - INFO - __main__ - Step 24194: {'lr': 0.00047276849306863045, 'samples': 4645248, 'steps': 24193, 'loss/train': 1.1926645040512085} 11/07/2021 00:37:49 - INFO - __main__ - Step 24195: {'lr': 0.0004727660845095301, 'samples': 4645440, 'steps': 24194, 'loss/train': 0.7111749649047852} 11/07/2021 00:37:49 - INFO - __main__ - Step 24196: {'lr': 0.0004727636758500548, 'samples': 4645632, 'steps': 24195, 'loss/train': 2.057713031768799} 11/07/2021 00:37:50 - INFO - __main__ - Step 24197: {'lr': 0.0004727612670902057, 'samples': 4645824, 'steps': 24196, 'loss/train': 1.836399793624878} 11/07/2021 00:37:50 - INFO - __main__ - Step 24198: {'lr': 0.0004727588582299837, 'samples': 4646016, 'steps': 24197, 'loss/train': 1.88862943649292} 11/07/2021 00:37:50 - INFO - __main__ - Step 24199: {'lr': 0.00047275644926939004, 'samples': 4646208, 'steps': 24198, 'loss/train': 2.0900471210479736} 11/07/2021 00:37:51 - INFO - __main__ - Step 24200: {'lr': 0.0004727540402084258, 'samples': 4646400, 'steps': 24199, 'loss/train': 1.6688822507858276} 11/07/2021 00:37:52 - INFO - __main__ - Step 24201: {'lr': 0.00047275163104709196, 'samples': 4646592, 'steps': 24200, 'loss/train': 1.583453893661499} 11/07/2021 00:37:52 - INFO - __main__ - Step 24202: {'lr': 0.0004727492217853897, 'samples': 4646784, 'steps': 24201, 'loss/train': 1.6207613945007324} 11/07/2021 00:37:53 - INFO - __main__ - Step 24203: {'lr': 0.0004727468124233201, 'samples': 4646976, 'steps': 24202, 'loss/train': 1.7906291484832764} 11/07/2021 00:37:53 - INFO - __main__ - Step 24204: {'lr': 0.0004727444029608842, 'samples': 4647168, 'steps': 24203, 'loss/train': 1.4829760789871216} 11/07/2021 00:37:53 - INFO - __main__ - Step 24205: {'lr': 0.0004727419933980831, 'samples': 4647360, 'steps': 24204, 'loss/train': 1.8701109886169434} 11/07/2021 00:37:54 - INFO - __main__ - Step 24206: {'lr': 0.00047273958373491795, 'samples': 4647552, 'steps': 24205, 'loss/train': 0.45274531841278076} 11/07/2021 00:37:55 - INFO - __main__ - Step 24207: {'lr': 0.0004727371739713897, 'samples': 4647744, 'steps': 24206, 'loss/train': 2.36594820022583} 11/07/2021 00:37:55 - INFO - __main__ - Step 24208: {'lr': 0.0004727347641074996, 'samples': 4647936, 'steps': 24207, 'loss/train': 1.223652958869934} 11/07/2021 00:37:55 - INFO - __main__ - Step 24209: {'lr': 0.0004727323541432486, 'samples': 4648128, 'steps': 24208, 'loss/train': 1.4723137617111206} 11/07/2021 00:37:56 - INFO - __main__ - Step 24210: {'lr': 0.0004727299440786378, 'samples': 4648320, 'steps': 24209, 'loss/train': 1.9725641012191772} 11/07/2021 00:37:57 - INFO - __main__ - Step 24211: {'lr': 0.0004727275339136684, 'samples': 4648512, 'steps': 24210, 'loss/train': 1.6833373308181763} 11/07/2021 00:37:57 - INFO - __main__ - Step 24212: {'lr': 0.0004727251236483414, 'samples': 4648704, 'steps': 24211, 'loss/train': 1.6619133949279785} 11/07/2021 00:37:58 - INFO - __main__ - Step 24213: {'lr': 0.0004727227132826579, 'samples': 4648896, 'steps': 24212, 'loss/train': 1.7764638662338257} 11/07/2021 00:37:58 - INFO - __main__ - Step 24214: {'lr': 0.00047272030281661894, 'samples': 4649088, 'steps': 24213, 'loss/train': 1.306481957435608} 11/07/2021 00:37:58 - INFO - __main__ - Step 24215: {'lr': 0.0004727178922502257, 'samples': 4649280, 'steps': 24214, 'loss/train': 1.9462131261825562} 11/07/2021 00:37:59 - INFO - __main__ - Step 24216: {'lr': 0.00047271548158347917, 'samples': 4649472, 'steps': 24215, 'loss/train': 1.5835012197494507} 11/07/2021 00:38:00 - INFO - __main__ - Step 24217: {'lr': 0.00047271307081638047, 'samples': 4649664, 'steps': 24216, 'loss/train': 0.829211950302124} 11/07/2021 00:38:00 - INFO - __main__ - Step 24218: {'lr': 0.0004727106599489307, 'samples': 4649856, 'steps': 24217, 'loss/train': 1.5512961149215698} 11/07/2021 00:38:00 - INFO - __main__ - Step 24219: {'lr': 0.000472708248981131, 'samples': 4650048, 'steps': 24218, 'loss/train': 1.442055106163025} 11/07/2021 00:38:01 - INFO - __main__ - Step 24220: {'lr': 0.0004727058379129824, 'samples': 4650240, 'steps': 24219, 'loss/train': 1.6307088136672974} 11/07/2021 00:38:02 - INFO - __main__ - Step 24221: {'lr': 0.00047270342674448593, 'samples': 4650432, 'steps': 24220, 'loss/train': 1.9180172681808472} 11/07/2021 00:38:02 - INFO - __main__ - Step 24222: {'lr': 0.0004727010154756427, 'samples': 4650624, 'steps': 24221, 'loss/train': 1.7712818384170532} 11/07/2021 00:38:03 - INFO - __main__ - Step 24223: {'lr': 0.00047269860410645395, 'samples': 4650816, 'steps': 24222, 'loss/train': 1.4384461641311646} 11/07/2021 00:38:03 - INFO - __main__ - Step 24224: {'lr': 0.00047269619263692056, 'samples': 4651008, 'steps': 24223, 'loss/train': 1.6635693311691284} 11/07/2021 00:38:03 - INFO - __main__ - Step 24225: {'lr': 0.0004726937810670437, 'samples': 4651200, 'steps': 24224, 'loss/train': 1.4228426218032837} 11/07/2021 00:38:04 - INFO - __main__ - Step 24226: {'lr': 0.00047269136939682445, 'samples': 4651392, 'steps': 24225, 'loss/train': 1.688468337059021} 11/07/2021 00:38:05 - INFO - __main__ - Step 24227: {'lr': 0.00047268895762626396, 'samples': 4651584, 'steps': 24226, 'loss/train': 1.0655486583709717} 11/07/2021 00:38:05 - INFO - __main__ - Step 24228: {'lr': 0.00047268654575536326, 'samples': 4651776, 'steps': 24227, 'loss/train': 1.910609245300293} 11/07/2021 00:38:05 - INFO - __main__ - Step 24229: {'lr': 0.0004726841337841234, 'samples': 4651968, 'steps': 24228, 'loss/train': 1.3337944746017456} 11/07/2021 00:38:06 - INFO - __main__ - Step 24230: {'lr': 0.00047268172171254554, 'samples': 4652160, 'steps': 24229, 'loss/train': 1.2254079580307007} 11/07/2021 00:38:07 - INFO - __main__ - Step 24231: {'lr': 0.00047267930954063064, 'samples': 4652352, 'steps': 24230, 'loss/train': 1.4541743993759155} 11/07/2021 00:38:07 - INFO - __main__ - Step 24232: {'lr': 0.00047267689726838004, 'samples': 4652544, 'steps': 24231, 'loss/train': 1.456627607345581} 11/07/2021 00:38:07 - INFO - __main__ - Step 24233: {'lr': 0.00047267448489579455, 'samples': 4652736, 'steps': 24232, 'loss/train': 1.5871590375900269} 11/07/2021 00:38:08 - INFO - __main__ - Step 24234: {'lr': 0.00047267207242287536, 'samples': 4652928, 'steps': 24233, 'loss/train': 1.5154452323913574} 11/07/2021 00:38:08 - INFO - __main__ - Step 24235: {'lr': 0.0004726696598496236, 'samples': 4653120, 'steps': 24234, 'loss/train': 1.6475778818130493} 11/07/2021 00:38:09 - INFO - __main__ - Step 24236: {'lr': 0.0004726672471760404, 'samples': 4653312, 'steps': 24235, 'loss/train': 1.373233675956726} 11/07/2021 00:38:09 - INFO - __main__ - Step 24237: {'lr': 0.0004726648344021267, 'samples': 4653504, 'steps': 24236, 'loss/train': 1.330606460571289} 11/07/2021 00:38:10 - INFO - __main__ - Step 24238: {'lr': 0.0004726624215278836, 'samples': 4653696, 'steps': 24237, 'loss/train': 1.8774495124816895} 11/07/2021 00:38:10 - INFO - __main__ - Step 24239: {'lr': 0.0004726600085533124, 'samples': 4653888, 'steps': 24238, 'loss/train': 1.6375892162322998} 11/07/2021 00:38:10 - INFO - __main__ - Step 24240: {'lr': 0.0004726575954784139, 'samples': 4654080, 'steps': 24239, 'loss/train': 1.302065372467041} 11/07/2021 00:38:12 - INFO - __main__ - Step 24241: {'lr': 0.0004726551823031894, 'samples': 4654272, 'steps': 24240, 'loss/train': 1.6037311553955078} 11/07/2021 00:38:12 - INFO - __main__ - Step 24242: {'lr': 0.0004726527690276399, 'samples': 4654464, 'steps': 24241, 'loss/train': 1.7429577112197876} 11/07/2021 00:38:12 - INFO - __main__ - Step 24243: {'lr': 0.0004726503556517665, 'samples': 4654656, 'steps': 24242, 'loss/train': 1.6608726978302002} 11/07/2021 00:38:13 - INFO - __main__ - Step 24244: {'lr': 0.0004726479421755703, 'samples': 4654848, 'steps': 24243, 'loss/train': 1.3861039876937866} 11/07/2021 00:38:13 - INFO - __main__ - Step 24245: {'lr': 0.0004726455285990523, 'samples': 4655040, 'steps': 24244, 'loss/train': 1.6974471807479858} 11/07/2021 00:38:14 - INFO - __main__ - Step 24246: {'lr': 0.00047264311492221375, 'samples': 4655232, 'steps': 24245, 'loss/train': 1.9845205545425415} 11/07/2021 00:38:14 - INFO - __main__ - Step 24247: {'lr': 0.00047264070114505556, 'samples': 4655424, 'steps': 24246, 'loss/train': 1.592573642730713} 11/07/2021 00:38:15 - INFO - __main__ - Step 24248: {'lr': 0.00047263828726757897, 'samples': 4655616, 'steps': 24247, 'loss/train': 1.737914800643921} 11/07/2021 00:38:15 - INFO - __main__ - Step 24249: {'lr': 0.00047263587328978495, 'samples': 4655808, 'steps': 24248, 'loss/train': 0.9708447456359863} 11/07/2021 00:38:15 - INFO - __main__ - Step 24250: {'lr': 0.00047263345921167473, 'samples': 4656000, 'steps': 24249, 'loss/train': 1.1999269723892212} 11/07/2021 00:38:17 - INFO - __main__ - Step 24251: {'lr': 0.00047263104503324926, 'samples': 4656192, 'steps': 24250, 'loss/train': 2.622537851333618} 11/07/2021 00:38:17 - INFO - __main__ - Step 24252: {'lr': 0.00047262863075450966, 'samples': 4656384, 'steps': 24251, 'loss/train': 1.4589124917984009} 11/07/2021 00:38:17 - INFO - __main__ - Step 24253: {'lr': 0.0004726262163754571, 'samples': 4656576, 'steps': 24252, 'loss/train': 1.5837563276290894} 11/07/2021 00:38:18 - INFO - __main__ - Step 24254: {'lr': 0.00047262380189609253, 'samples': 4656768, 'steps': 24253, 'loss/train': 1.8602099418640137} 11/07/2021 00:38:18 - INFO - __main__ - Step 24255: {'lr': 0.0004726213873164171, 'samples': 4656960, 'steps': 24254, 'loss/train': 1.4900504350662231} 11/07/2021 00:38:18 - INFO - __main__ - Step 24256: {'lr': 0.00047261897263643196, 'samples': 4657152, 'steps': 24255, 'loss/train': 2.08562970161438} 11/07/2021 00:38:19 - INFO - __main__ - Step 24257: {'lr': 0.0004726165578561381, 'samples': 4657344, 'steps': 24256, 'loss/train': 1.4003256559371948} 11/07/2021 00:38:20 - INFO - __main__ - Step 24258: {'lr': 0.0004726141429755367, 'samples': 4657536, 'steps': 24257, 'loss/train': 0.9711540937423706} 11/07/2021 00:38:20 - INFO - __main__ - Step 24259: {'lr': 0.0004726117279946288, 'samples': 4657728, 'steps': 24258, 'loss/train': 1.806656837463379} 11/07/2021 00:38:20 - INFO - __main__ - Step 24260: {'lr': 0.0004726093129134155, 'samples': 4657920, 'steps': 24259, 'loss/train': 1.3886107206344604} 11/07/2021 00:38:21 - INFO - __main__ - Step 24261: {'lr': 0.0004726068977318978, 'samples': 4658112, 'steps': 24260, 'loss/train': 1.43341064453125} 11/07/2021 00:38:22 - INFO - __main__ - Step 24262: {'lr': 0.0004726044824500769, 'samples': 4658304, 'steps': 24261, 'loss/train': 1.469969391822815} 11/07/2021 00:38:22 - INFO - __main__ - Step 24263: {'lr': 0.0004726020670679538, 'samples': 4658496, 'steps': 24262, 'loss/train': 1.5351600646972656} 11/07/2021 00:38:22 - INFO - __main__ - Step 24264: {'lr': 0.00047259965158552976, 'samples': 4658688, 'steps': 24263, 'loss/train': 1.5944650173187256} 11/07/2021 00:38:23 - INFO - __main__ - Step 24265: {'lr': 0.00047259723600280573, 'samples': 4658880, 'steps': 24264, 'loss/train': 1.7254376411437988} 11/07/2021 00:38:23 - INFO - __main__ - Step 24266: {'lr': 0.0004725948203197828, 'samples': 4659072, 'steps': 24265, 'loss/train': 1.1550707817077637} 11/07/2021 00:38:24 - INFO - __main__ - Step 24267: {'lr': 0.0004725924045364621, 'samples': 4659264, 'steps': 24266, 'loss/train': 1.764291524887085} 11/07/2021 00:38:24 - INFO - __main__ - Step 24268: {'lr': 0.00047258998865284463, 'samples': 4659456, 'steps': 24267, 'loss/train': 1.3322783708572388} 11/07/2021 00:38:25 - INFO - __main__ - Step 24269: {'lr': 0.0004725875726689316, 'samples': 4659648, 'steps': 24268, 'loss/train': 1.2680037021636963} 11/07/2021 00:38:25 - INFO - __main__ - Step 24270: {'lr': 0.000472585156584724, 'samples': 4659840, 'steps': 24269, 'loss/train': 1.5329806804656982} 11/07/2021 00:38:26 - INFO - __main__ - Step 24271: {'lr': 0.00047258274040022305, 'samples': 4660032, 'steps': 24270, 'loss/train': 1.2585711479187012} 11/07/2021 00:38:28 - INFO - __main__ - Step 24272: {'lr': 0.0004725803241154297, 'samples': 4660224, 'steps': 24271, 'loss/train': 1.7784254550933838} 11/07/2021 00:38:28 - INFO - __main__ - Step 24273: {'lr': 0.0004725779077303451, 'samples': 4660416, 'steps': 24272, 'loss/train': 1.6816685199737549} 11/07/2021 00:38:28 - INFO - __main__ - Step 24274: {'lr': 0.0004725754912449703, 'samples': 4660608, 'steps': 24273, 'loss/train': 1.7606521844863892} 11/07/2021 00:38:29 - INFO - __main__ - Step 24275: {'lr': 0.0004725730746593064, 'samples': 4660800, 'steps': 24274, 'loss/train': 1.7844398021697998} 11/07/2021 00:38:29 - INFO - __main__ - Step 24276: {'lr': 0.0004725706579733546, 'samples': 4660992, 'steps': 24275, 'loss/train': 1.0492531061172485} 11/07/2021 00:38:29 - INFO - __main__ - Step 24277: {'lr': 0.00047256824118711583, 'samples': 4661184, 'steps': 24276, 'loss/train': 1.2214933633804321} 11/07/2021 00:38:30 - INFO - __main__ - Step 24278: {'lr': 0.00047256582430059126, 'samples': 4661376, 'steps': 24277, 'loss/train': 1.601622462272644} 11/07/2021 00:38:31 - INFO - __main__ - Step 24279: {'lr': 0.00047256340731378194, 'samples': 4661568, 'steps': 24278, 'loss/train': 1.6877448558807373} 11/07/2021 00:38:31 - INFO - __main__ - Step 24280: {'lr': 0.00047256099022668896, 'samples': 4661760, 'steps': 24279, 'loss/train': 1.3454322814941406} 11/07/2021 00:38:32 - INFO - __main__ - Step 24281: {'lr': 0.00047255857303931347, 'samples': 4661952, 'steps': 24280, 'loss/train': 1.8590461015701294} 11/07/2021 00:38:32 - INFO - __main__ - Step 24282: {'lr': 0.00047255615575165653, 'samples': 4662144, 'steps': 24281, 'loss/train': 1.8316879272460938} 11/07/2021 00:38:32 - INFO - __main__ - Step 24283: {'lr': 0.0004725537383637193, 'samples': 4662336, 'steps': 24282, 'loss/train': 1.5933130979537964} 11/07/2021 00:38:33 - INFO - __main__ - Step 24284: {'lr': 0.0004725513208755027, 'samples': 4662528, 'steps': 24283, 'loss/train': 1.4665334224700928} 11/07/2021 00:38:34 - INFO - __main__ - Step 24285: {'lr': 0.0004725489032870079, 'samples': 4662720, 'steps': 24284, 'loss/train': 1.6883810758590698} 11/07/2021 00:38:34 - INFO - __main__ - Step 24286: {'lr': 0.000472546485598236, 'samples': 4662912, 'steps': 24285, 'loss/train': 1.6891326904296875} 11/07/2021 00:38:34 - INFO - __main__ - Step 24287: {'lr': 0.0004725440678091881, 'samples': 4663104, 'steps': 24286, 'loss/train': 1.5643234252929688} 11/07/2021 00:38:35 - INFO - __main__ - Step 24288: {'lr': 0.00047254164991986525, 'samples': 4663296, 'steps': 24287, 'loss/train': 1.4116101264953613} 11/07/2021 00:38:36 - INFO - __main__ - Step 24289: {'lr': 0.0004725392319302686, 'samples': 4663488, 'steps': 24288, 'loss/train': 1.825372576713562} 11/07/2021 00:38:36 - INFO - __main__ - Step 24290: {'lr': 0.0004725368138403992, 'samples': 4663680, 'steps': 24289, 'loss/train': 1.5913466215133667} 11/07/2021 00:38:36 - INFO - __main__ - Step 24291: {'lr': 0.00047253439565025815, 'samples': 4663872, 'steps': 24290, 'loss/train': 1.3589271306991577} 11/07/2021 00:38:37 - INFO - __main__ - Step 24292: {'lr': 0.00047253197735984653, 'samples': 4664064, 'steps': 24291, 'loss/train': 1.9609973430633545} 11/07/2021 00:38:37 - INFO - __main__ - Step 24293: {'lr': 0.00047252955896916546, 'samples': 4664256, 'steps': 24292, 'loss/train': 1.183845043182373} 11/07/2021 00:38:38 - INFO - __main__ - Step 24294: {'lr': 0.000472527140478216, 'samples': 4664448, 'steps': 24293, 'loss/train': 2.545880079269409} 11/07/2021 00:38:38 - INFO - __main__ - Step 24295: {'lr': 0.00047252472188699917, 'samples': 4664640, 'steps': 24294, 'loss/train': 1.872978925704956} 11/07/2021 00:38:39 - INFO - __main__ - Step 24296: {'lr': 0.0004725223031955162, 'samples': 4664832, 'steps': 24295, 'loss/train': 1.5187020301818848} 11/07/2021 00:38:39 - INFO - __main__ - Step 24297: {'lr': 0.0004725198844037681, 'samples': 4665024, 'steps': 24296, 'loss/train': 1.612744927406311} 11/07/2021 00:38:39 - INFO - __main__ - Step 24298: {'lr': 0.00047251746551175603, 'samples': 4665216, 'steps': 24297, 'loss/train': 1.7143771648406982} 11/07/2021 00:38:40 - INFO - __main__ - Step 24299: {'lr': 0.000472515046519481, 'samples': 4665408, 'steps': 24298, 'loss/train': 1.7912428379058838} 11/07/2021 00:38:41 - INFO - __main__ - Step 24300: {'lr': 0.000472512627426944, 'samples': 4665600, 'steps': 24299, 'loss/train': 1.251631736755371} 11/07/2021 00:38:41 - INFO - __main__ - Step 24301: {'lr': 0.0004725102082341464, 'samples': 4665792, 'steps': 24300, 'loss/train': 1.328009843826294} 11/07/2021 00:38:42 - INFO - __main__ - Step 24302: {'lr': 0.00047250778894108905, 'samples': 4665984, 'steps': 24301, 'loss/train': 0.9645561575889587} 11/07/2021 00:38:42 - INFO - __main__ - Step 24303: {'lr': 0.0004725053695477731, 'samples': 4666176, 'steps': 24302, 'loss/train': 1.8826731443405151} 11/07/2021 00:38:42 - INFO - __main__ - Step 24304: {'lr': 0.0004725029500541997, 'samples': 4666368, 'steps': 24303, 'loss/train': 1.9238576889038086} 11/07/2021 00:38:43 - INFO - __main__ - Step 24305: {'lr': 0.00047250053046036996, 'samples': 4666560, 'steps': 24304, 'loss/train': 1.5398913621902466} 11/07/2021 00:38:44 - INFO - __main__ - Step 24306: {'lr': 0.00047249811076628483, 'samples': 4666752, 'steps': 24305, 'loss/train': 2.0788919925689697} 11/07/2021 00:38:44 - INFO - __main__ - Step 24307: {'lr': 0.00047249569097194554, 'samples': 4666944, 'steps': 24306, 'loss/train': 1.6412384510040283} 11/07/2021 00:38:44 - INFO - __main__ - Step 24308: {'lr': 0.0004724932710773531, 'samples': 4667136, 'steps': 24307, 'loss/train': 1.1404309272766113} 11/07/2021 00:38:45 - INFO - __main__ - Step 24309: {'lr': 0.00047249085108250867, 'samples': 4667328, 'steps': 24308, 'loss/train': 1.5579737424850464} 11/07/2021 00:38:46 - INFO - __main__ - Step 24310: {'lr': 0.0004724884309874132, 'samples': 4667520, 'steps': 24309, 'loss/train': 1.768725872039795} 11/07/2021 00:38:46 - INFO - __main__ - Step 24311: {'lr': 0.00047248601079206797, 'samples': 4667712, 'steps': 24310, 'loss/train': 1.9995254278182983} 11/07/2021 00:38:47 - INFO - __main__ - Step 24312: {'lr': 0.0004724835904964739, 'samples': 4667904, 'steps': 24311, 'loss/train': 1.5800747871398926} 11/07/2021 00:38:47 - INFO - __main__ - Step 24313: {'lr': 0.0004724811701006322, 'samples': 4668096, 'steps': 24312, 'loss/train': 0.16031067073345184} 11/07/2021 00:38:48 - INFO - __main__ - Step 24314: {'lr': 0.00047247874960454394, 'samples': 4668288, 'steps': 24313, 'loss/train': 1.0861700773239136} 11/07/2021 00:38:48 - INFO - __main__ - Step 24315: {'lr': 0.0004724763290082102, 'samples': 4668480, 'steps': 24314, 'loss/train': 0.15731477737426758} 11/07/2021 00:38:49 - INFO - __main__ - Step 24316: {'lr': 0.000472473908311632, 'samples': 4668672, 'steps': 24315, 'loss/train': 0.9294226169586182} 11/07/2021 00:38:49 - INFO - __main__ - Step 24317: {'lr': 0.0004724714875148105, 'samples': 4668864, 'steps': 24316, 'loss/train': 1.470827579498291} 11/07/2021 00:38:50 - INFO - __main__ - Step 24318: {'lr': 0.0004724690666177468, 'samples': 4669056, 'steps': 24317, 'loss/train': 1.5593488216400146} 11/07/2021 00:38:50 - INFO - __main__ - Step 24319: {'lr': 0.00047246664562044193, 'samples': 4669248, 'steps': 24318, 'loss/train': 1.581626296043396} 11/07/2021 00:38:51 - INFO - __main__ - Step 24320: {'lr': 0.0004724642245228971, 'samples': 4669440, 'steps': 24319, 'loss/train': 1.706702709197998} 11/07/2021 00:38:51 - INFO - __main__ - Step 24321: {'lr': 0.0004724618033251133, 'samples': 4669632, 'steps': 24320, 'loss/train': 1.5025871992111206} 11/07/2021 00:38:52 - INFO - __main__ - Step 24322: {'lr': 0.0004724593820270916, 'samples': 4669824, 'steps': 24321, 'loss/train': 1.7909736633300781} 11/07/2021 00:38:52 - INFO - __main__ - Step 24323: {'lr': 0.00047245696062883316, 'samples': 4670016, 'steps': 24322, 'loss/train': 2.027930736541748} 11/07/2021 00:38:52 - INFO - __main__ - Step 24324: {'lr': 0.0004724545391303391, 'samples': 4670208, 'steps': 24323, 'loss/train': 1.6163299083709717} 11/07/2021 00:38:53 - INFO - __main__ - Step 24325: {'lr': 0.0004724521175316103, 'samples': 4670400, 'steps': 24324, 'loss/train': 1.5050382614135742} 11/07/2021 00:38:54 - INFO - __main__ - Step 24326: {'lr': 0.0004724496958326482, 'samples': 4670592, 'steps': 24325, 'loss/train': 0.8827059268951416} 11/07/2021 00:38:54 - INFO - __main__ - Step 24327: {'lr': 0.00047244727403345356, 'samples': 4670784, 'steps': 24326, 'loss/train': 1.293052315711975} 11/07/2021 00:38:55 - INFO - __main__ - Step 24328: {'lr': 0.00047244485213402765, 'samples': 4670976, 'steps': 24327, 'loss/train': 1.496334195137024} 11/07/2021 00:38:55 - INFO - __main__ - Step 24329: {'lr': 0.0004724424301343716, 'samples': 4671168, 'steps': 24328, 'loss/train': 1.7034502029418945} 11/07/2021 00:38:56 - INFO - __main__ - Step 24330: {'lr': 0.00047244000803448635, 'samples': 4671360, 'steps': 24329, 'loss/train': 1.240768551826477} 11/07/2021 00:38:56 - INFO - __main__ - Step 24331: {'lr': 0.000472437585834373, 'samples': 4671552, 'steps': 24330, 'loss/train': 1.2295843362808228} 11/07/2021 00:38:57 - INFO - __main__ - Step 24332: {'lr': 0.00047243516353403283, 'samples': 4671744, 'steps': 24331, 'loss/train': 1.6633962392807007} 11/07/2021 00:38:57 - INFO - __main__ - Step 24333: {'lr': 0.0004724327411334668, 'samples': 4671936, 'steps': 24332, 'loss/train': 0.921309232711792} 11/07/2021 00:38:57 - INFO - __main__ - Step 24334: {'lr': 0.00047243031863267594, 'samples': 4672128, 'steps': 24333, 'loss/train': 1.7888855934143066} 11/07/2021 00:38:58 - INFO - __main__ - Step 24335: {'lr': 0.0004724278960316615, 'samples': 4672320, 'steps': 24334, 'loss/train': 0.4477072060108185} 11/07/2021 00:38:59 - INFO - __main__ - Step 24336: {'lr': 0.00047242547333042434, 'samples': 4672512, 'steps': 24335, 'loss/train': 1.5591822862625122} 11/07/2021 00:38:59 - INFO - __main__ - Step 24337: {'lr': 0.0004724230505289658, 'samples': 4672704, 'steps': 24336, 'loss/train': 2.1904563903808594} 11/07/2021 00:38:59 - INFO - __main__ - Step 24338: {'lr': 0.0004724206276272868, 'samples': 4672896, 'steps': 24337, 'loss/train': 1.5123300552368164} 11/07/2021 00:39:00 - INFO - __main__ - Step 24339: {'lr': 0.0004724182046253885, 'samples': 4673088, 'steps': 24338, 'loss/train': 1.6610440015792847} 11/07/2021 00:39:01 - INFO - __main__ - Step 24340: {'lr': 0.0004724157815232721, 'samples': 4673280, 'steps': 24339, 'loss/train': 1.870625615119934} 11/07/2021 00:39:01 - INFO - __main__ - Step 24341: {'lr': 0.00047241335832093844, 'samples': 4673472, 'steps': 24340, 'loss/train': 1.2395048141479492} 11/07/2021 00:39:01 - INFO - __main__ - Step 24342: {'lr': 0.00047241093501838887, 'samples': 4673664, 'steps': 24341, 'loss/train': 1.3462673425674438} 11/07/2021 00:39:02 - INFO - __main__ - Step 24343: {'lr': 0.00047240851161562433, 'samples': 4673856, 'steps': 24342, 'loss/train': 1.980881929397583} 11/07/2021 00:39:02 - INFO - __main__ - Step 24344: {'lr': 0.00047240608811264595, 'samples': 4674048, 'steps': 24343, 'loss/train': 1.5272406339645386} 11/07/2021 00:39:03 - INFO - __main__ - Step 24345: {'lr': 0.0004724036645094548, 'samples': 4674240, 'steps': 24344, 'loss/train': 1.0640920400619507} 11/07/2021 00:39:03 - INFO - __main__ - Step 24346: {'lr': 0.00047240124080605197, 'samples': 4674432, 'steps': 24345, 'loss/train': 1.4234856367111206} 11/07/2021 00:39:04 - INFO - __main__ - Step 24347: {'lr': 0.0004723988170024386, 'samples': 4674624, 'steps': 24346, 'loss/train': 1.3413307666778564} 11/07/2021 00:39:04 - INFO - __main__ - Step 24348: {'lr': 0.0004723963930986157, 'samples': 4674816, 'steps': 24347, 'loss/train': 1.1005454063415527} 11/07/2021 00:39:05 - INFO - __main__ - Step 24349: {'lr': 0.0004723939690945845, 'samples': 4675008, 'steps': 24348, 'loss/train': 1.8610029220581055} 11/07/2021 00:39:05 - INFO - __main__ - Step 24350: {'lr': 0.000472391544990346, 'samples': 4675200, 'steps': 24349, 'loss/train': 1.0002992153167725} 11/07/2021 00:39:06 - INFO - __main__ - Step 24351: {'lr': 0.0004723891207859012, 'samples': 4675392, 'steps': 24350, 'loss/train': 1.4413059949874878} 11/07/2021 00:39:06 - INFO - __main__ - Step 24352: {'lr': 0.00047238669648125146, 'samples': 4675584, 'steps': 24351, 'loss/train': 1.5541919469833374} 11/07/2021 00:39:07 - INFO - __main__ - Step 24353: {'lr': 0.00047238427207639755, 'samples': 4675776, 'steps': 24352, 'loss/train': 1.7751007080078125} 11/07/2021 00:39:07 - INFO - __main__ - Step 24354: {'lr': 0.0004723818475713408, 'samples': 4675968, 'steps': 24353, 'loss/train': 1.778316617012024} 11/07/2021 00:39:07 - INFO - __main__ - Step 24355: {'lr': 0.00047237942296608223, 'samples': 4676160, 'steps': 24354, 'loss/train': 1.353247046470642} 11/07/2021 00:39:08 - INFO - __main__ - Step 24356: {'lr': 0.00047237699826062286, 'samples': 4676352, 'steps': 24355, 'loss/train': 1.6153926849365234} 11/07/2021 00:39:09 - INFO - __main__ - Step 24357: {'lr': 0.0004723745734549639, 'samples': 4676544, 'steps': 24356, 'loss/train': 1.786283016204834} 11/07/2021 00:39:09 - INFO - __main__ - Step 24358: {'lr': 0.0004723721485491064, 'samples': 4676736, 'steps': 24357, 'loss/train': 1.4685715436935425} 11/07/2021 00:39:09 - INFO - __main__ - Step 24359: {'lr': 0.0004723697235430514, 'samples': 4676928, 'steps': 24358, 'loss/train': 1.738546371459961} 11/07/2021 00:39:10 - INFO - __main__ - Step 24360: {'lr': 0.0004723672984368, 'samples': 4677120, 'steps': 24359, 'loss/train': 1.305877447128296} 11/07/2021 00:39:11 - INFO - __main__ - Step 24361: {'lr': 0.00047236487323035344, 'samples': 4677312, 'steps': 24360, 'loss/train': 1.785787582397461} 11/07/2021 00:39:11 - INFO - __main__ - Step 24362: {'lr': 0.00047236244792371265, 'samples': 4677504, 'steps': 24361, 'loss/train': 1.4389066696166992} 11/07/2021 00:39:12 - INFO - __main__ - Step 24363: {'lr': 0.0004723600225168787, 'samples': 4677696, 'steps': 24362, 'loss/train': 1.5341063737869263} 11/07/2021 00:39:12 - INFO - __main__ - Step 24364: {'lr': 0.0004723575970098528, 'samples': 4677888, 'steps': 24363, 'loss/train': 1.7367303371429443} 11/07/2021 00:39:12 - INFO - __main__ - Step 24365: {'lr': 0.00047235517140263605, 'samples': 4678080, 'steps': 24364, 'loss/train': 1.672031283378601} 11/07/2021 00:39:13 - INFO - __main__ - Step 24366: {'lr': 0.00047235274569522946, 'samples': 4678272, 'steps': 24365, 'loss/train': 1.890123724937439} 11/07/2021 00:39:14 - INFO - __main__ - Step 24367: {'lr': 0.0004723503198876341, 'samples': 4678464, 'steps': 24366, 'loss/train': 1.6104552745819092} 11/07/2021 00:39:14 - INFO - __main__ - Step 24368: {'lr': 0.0004723478939798512, 'samples': 4678656, 'steps': 24367, 'loss/train': 0.4229798913002014} 11/07/2021 00:39:14 - INFO - __main__ - Step 24369: {'lr': 0.0004723454679718817, 'samples': 4678848, 'steps': 24368, 'loss/train': 2.6394357681274414} 11/07/2021 00:39:15 - INFO - __main__ - Step 24370: {'lr': 0.00047234304186372685, 'samples': 4679040, 'steps': 24369, 'loss/train': 1.6752501726150513} 11/07/2021 00:39:15 - INFO - __main__ - Step 24371: {'lr': 0.00047234061565538753, 'samples': 4679232, 'steps': 24370, 'loss/train': 1.5303529500961304} 11/07/2021 00:39:16 - INFO - __main__ - Step 24372: {'lr': 0.0004723381893468651, 'samples': 4679424, 'steps': 24371, 'loss/train': 1.88096022605896} 11/07/2021 00:39:16 - INFO - __main__ - Step 24373: {'lr': 0.00047233576293816045, 'samples': 4679616, 'steps': 24372, 'loss/train': 1.6431019306182861} 11/07/2021 00:39:17 - INFO - __main__ - Step 24374: {'lr': 0.00047233333642927465, 'samples': 4679808, 'steps': 24373, 'loss/train': 0.9329357743263245} 11/07/2021 00:39:17 - INFO - __main__ - Step 24375: {'lr': 0.000472330909820209, 'samples': 4680000, 'steps': 24374, 'loss/train': 1.458146095275879} 11/07/2021 00:39:18 - INFO - __main__ - Step 24376: {'lr': 0.0004723284831109644, 'samples': 4680192, 'steps': 24375, 'loss/train': 1.1037852764129639} 11/07/2021 00:39:19 - INFO - __main__ - Step 24377: {'lr': 0.0004723260563015421, 'samples': 4680384, 'steps': 24376, 'loss/train': 1.6638442277908325} 11/07/2021 00:39:19 - INFO - __main__ - Step 24378: {'lr': 0.00047232362939194305, 'samples': 4680576, 'steps': 24377, 'loss/train': 0.7501077055931091} 11/07/2021 00:39:19 - INFO - __main__ - Step 24379: {'lr': 0.0004723212023821684, 'samples': 4680768, 'steps': 24378, 'loss/train': 1.5712947845458984} 11/07/2021 00:39:20 - INFO - __main__ - Step 24380: {'lr': 0.0004723187752722193, 'samples': 4680960, 'steps': 24379, 'loss/train': 1.0134202241897583} 11/07/2021 00:39:20 - INFO - __main__ - Step 24381: {'lr': 0.00047231634806209675, 'samples': 4681152, 'steps': 24380, 'loss/train': 1.6158535480499268} 11/07/2021 00:39:21 - INFO - __main__ - Step 24382: {'lr': 0.0004723139207518019, 'samples': 4681344, 'steps': 24381, 'loss/train': 1.6911224126815796} 11/07/2021 00:39:21 - INFO - __main__ - Step 24383: {'lr': 0.00047231149334133577, 'samples': 4681536, 'steps': 24382, 'loss/train': 1.647817850112915} 11/07/2021 00:39:22 - INFO - __main__ - Step 24384: {'lr': 0.00047230906583069953, 'samples': 4681728, 'steps': 24383, 'loss/train': 1.6826653480529785} 11/07/2021 00:39:22 - INFO - __main__ - Step 24385: {'lr': 0.0004723066382198943, 'samples': 4681920, 'steps': 24384, 'loss/train': 1.359289288520813} 11/07/2021 00:39:22 - INFO - __main__ - Step 24386: {'lr': 0.00047230421050892116, 'samples': 4682112, 'steps': 24385, 'loss/train': 1.6030157804489136} 11/07/2021 00:39:23 - INFO - __main__ - Step 24387: {'lr': 0.00047230178269778105, 'samples': 4682304, 'steps': 24386, 'loss/train': 1.5881558656692505} 11/07/2021 00:39:24 - INFO - __main__ - Step 24388: {'lr': 0.00047229935478647524, 'samples': 4682496, 'steps': 24387, 'loss/train': 1.7504736185073853} 11/07/2021 00:39:24 - INFO - __main__ - Step 24389: {'lr': 0.0004722969267750048, 'samples': 4682688, 'steps': 24388, 'loss/train': 1.652161717414856} 11/07/2021 00:39:24 - INFO - __main__ - Step 24390: {'lr': 0.0004722944986633708, 'samples': 4682880, 'steps': 24389, 'loss/train': 5.803641319274902} 11/07/2021 00:39:25 - INFO - __main__ - Step 24391: {'lr': 0.0004722920704515743, 'samples': 4683072, 'steps': 24390, 'loss/train': 1.418088674545288} 11/07/2021 00:39:26 - INFO - __main__ - Step 24392: {'lr': 0.00047228964213961647, 'samples': 4683264, 'steps': 24391, 'loss/train': 1.6268666982650757} 11/07/2021 00:39:26 - INFO - __main__ - Step 24393: {'lr': 0.00047228721372749826, 'samples': 4683456, 'steps': 24392, 'loss/train': 1.702316164970398} 11/07/2021 00:39:27 - INFO - __main__ - Step 24394: {'lr': 0.000472284785215221, 'samples': 4683648, 'steps': 24393, 'loss/train': 1.6218605041503906} 11/07/2021 00:39:27 - INFO - __main__ - Step 24395: {'lr': 0.0004722823566027855, 'samples': 4683840, 'steps': 24394, 'loss/train': 1.5702743530273438} 11/07/2021 00:39:27 - INFO - __main__ - Step 24396: {'lr': 0.00047227992789019316, 'samples': 4684032, 'steps': 24395, 'loss/train': 1.2061114311218262} 11/07/2021 00:39:28 - INFO - __main__ - Step 24397: {'lr': 0.0004722774990774448, 'samples': 4684224, 'steps': 24396, 'loss/train': 1.7952961921691895} 11/07/2021 00:39:29 - INFO - __main__ - Step 24398: {'lr': 0.00047227507016454163, 'samples': 4684416, 'steps': 24397, 'loss/train': 1.4876207113265991} 11/07/2021 00:39:29 - INFO - __main__ - Step 24399: {'lr': 0.00047227264115148475, 'samples': 4684608, 'steps': 24398, 'loss/train': 1.480579137802124} 11/07/2021 00:39:29 - INFO - __main__ - Step 24400: {'lr': 0.00047227021203827523, 'samples': 4684800, 'steps': 24399, 'loss/train': 1.2277770042419434} 11/07/2021 00:39:30 - INFO - __main__ - Step 24401: {'lr': 0.0004722677828249142, 'samples': 4684992, 'steps': 24400, 'loss/train': 1.7811293601989746} 11/07/2021 00:39:31 - INFO - __main__ - Step 24402: {'lr': 0.0004722653535114028, 'samples': 4685184, 'steps': 24401, 'loss/train': 1.2663511037826538} 11/07/2021 00:39:31 - INFO - __main__ - Step 24403: {'lr': 0.00047226292409774205, 'samples': 4685376, 'steps': 24402, 'loss/train': 1.9359652996063232} 11/07/2021 00:39:31 - INFO - __main__ - Step 24404: {'lr': 0.00047226049458393306, 'samples': 4685568, 'steps': 24403, 'loss/train': 1.5348832607269287} 11/07/2021 00:39:32 - INFO - __main__ - Step 24405: {'lr': 0.0004722580649699768, 'samples': 4685760, 'steps': 24404, 'loss/train': 1.8864802122116089} 11/07/2021 00:39:32 - INFO - __main__ - Step 24406: {'lr': 0.00047225563525587463, 'samples': 4685952, 'steps': 24405, 'loss/train': 1.5111305713653564} 11/07/2021 00:39:32 - INFO - __main__ - Step 24407: {'lr': 0.0004722532054416274, 'samples': 4686144, 'steps': 24406, 'loss/train': 1.830599308013916} 11/07/2021 00:39:33 - INFO - __main__ - Step 24408: {'lr': 0.0004722507755272364, 'samples': 4686336, 'steps': 24407, 'loss/train': 2.113673210144043} 11/07/2021 00:39:34 - INFO - __main__ - Step 24409: {'lr': 0.0004722483455127026, 'samples': 4686528, 'steps': 24408, 'loss/train': 2.154162645339966} 11/07/2021 00:39:34 - INFO - __main__ - Step 24410: {'lr': 0.000472245915398027, 'samples': 4686720, 'steps': 24409, 'loss/train': 1.6176613569259644} 11/07/2021 00:39:35 - INFO - __main__ - Step 24411: {'lr': 0.0004722434851832109, 'samples': 4686912, 'steps': 24410, 'loss/train': 1.844570279121399} 11/07/2021 00:39:35 - INFO - __main__ - Step 24412: {'lr': 0.00047224105486825543, 'samples': 4687104, 'steps': 24411, 'loss/train': 1.7264853715896606} 11/07/2021 00:39:36 - INFO - __main__ - Step 24413: {'lr': 0.0004722386244531615, 'samples': 4687296, 'steps': 24412, 'loss/train': 1.0778053998947144} 11/07/2021 00:39:37 - INFO - __main__ - Step 24414: {'lr': 0.0004722361939379302, 'samples': 4687488, 'steps': 24413, 'loss/train': 1.4467507600784302} 11/07/2021 00:39:37 - INFO - __main__ - Step 24415: {'lr': 0.0004722337633225627, 'samples': 4687680, 'steps': 24414, 'loss/train': 0.16399438679218292} 11/07/2021 00:39:37 - INFO - __main__ - Step 24416: {'lr': 0.0004722313326070602, 'samples': 4687872, 'steps': 24415, 'loss/train': 1.982115626335144} 11/07/2021 00:39:38 - INFO - __main__ - Step 24417: {'lr': 0.00047222890179142365, 'samples': 4688064, 'steps': 24416, 'loss/train': 1.489362359046936} 11/07/2021 00:39:39 - INFO - __main__ - Step 24418: {'lr': 0.00047222647087565413, 'samples': 4688256, 'steps': 24417, 'loss/train': 1.4437440633773804} 11/07/2021 00:39:39 - INFO - __main__ - Step 24419: {'lr': 0.0004722240398597528, 'samples': 4688448, 'steps': 24418, 'loss/train': 1.2767196893692017} 11/07/2021 00:39:39 - INFO - __main__ - Step 24420: {'lr': 0.0004722216087437208, 'samples': 4688640, 'steps': 24419, 'loss/train': 1.2910867929458618} 11/07/2021 00:39:40 - INFO - __main__ - Step 24421: {'lr': 0.0004722191775275592, 'samples': 4688832, 'steps': 24420, 'loss/train': 1.8720062971115112} 11/07/2021 00:39:40 - INFO - __main__ - Step 24422: {'lr': 0.00047221674621126896, 'samples': 4689024, 'steps': 24421, 'loss/train': 1.386762022972107} 11/07/2021 00:39:41 - INFO - __main__ - Step 24423: {'lr': 0.0004722143147948513, 'samples': 4689216, 'steps': 24422, 'loss/train': 0.547491192817688} 11/07/2021 00:39:42 - INFO - __main__ - Step 24424: {'lr': 0.0004722118832783074, 'samples': 4689408, 'steps': 24423, 'loss/train': 1.6926751136779785} 11/07/2021 00:39:42 - INFO - __main__ - Step 24425: {'lr': 0.0004722094516616382, 'samples': 4689600, 'steps': 24424, 'loss/train': 1.9092614650726318} 11/07/2021 00:39:42 - INFO - __main__ - Step 24426: {'lr': 0.0004722070199448448, 'samples': 4689792, 'steps': 24425, 'loss/train': 1.6068289279937744} 11/07/2021 00:39:43 - INFO - __main__ - Step 24427: {'lr': 0.00047220458812792846, 'samples': 4689984, 'steps': 24426, 'loss/train': 0.16781136393547058} 11/07/2021 00:39:44 - INFO - __main__ - Step 24428: {'lr': 0.00047220215621089005, 'samples': 4690176, 'steps': 24427, 'loss/train': 0.6731496453285217} 11/07/2021 00:39:44 - INFO - __main__ - Step 24429: {'lr': 0.00047219972419373083, 'samples': 4690368, 'steps': 24428, 'loss/train': 1.3694257736206055} 11/07/2021 00:39:45 - INFO - __main__ - Step 24430: {'lr': 0.00047219729207645183, 'samples': 4690560, 'steps': 24429, 'loss/train': 0.8720505833625793} 11/07/2021 00:39:45 - INFO - __main__ - Step 24431: {'lr': 0.0004721948598590542, 'samples': 4690752, 'steps': 24430, 'loss/train': 1.5635464191436768} 11/07/2021 00:39:46 - INFO - __main__ - Step 24432: {'lr': 0.0004721924275415389, 'samples': 4690944, 'steps': 24431, 'loss/train': 0.15982936322689056} 11/07/2021 00:39:46 - INFO - __main__ - Step 24433: {'lr': 0.0004721899951239072, 'samples': 4691136, 'steps': 24432, 'loss/train': 1.6448273658752441} 11/07/2021 00:39:47 - INFO - __main__ - Step 24434: {'lr': 0.0004721875626061601, 'samples': 4691328, 'steps': 24433, 'loss/train': 1.455288290977478} 11/07/2021 00:39:47 - INFO - __main__ - Step 24435: {'lr': 0.00047218512998829874, 'samples': 4691520, 'steps': 24434, 'loss/train': 0.9792460799217224} 11/07/2021 00:39:48 - INFO - __main__ - Step 24436: {'lr': 0.00047218269727032413, 'samples': 4691712, 'steps': 24435, 'loss/train': 1.4665985107421875} 11/07/2021 00:39:48 - INFO - __main__ - Step 24437: {'lr': 0.00047218026445223745, 'samples': 4691904, 'steps': 24436, 'loss/train': 1.6033726930618286} 11/07/2021 00:39:48 - INFO - __main__ - Step 24438: {'lr': 0.0004721778315340398, 'samples': 4692096, 'steps': 24437, 'loss/train': 1.4006061553955078} 11/07/2021 00:39:49 - INFO - __main__ - Step 24439: {'lr': 0.0004721753985157322, 'samples': 4692288, 'steps': 24438, 'loss/train': 1.8944119215011597} 11/07/2021 00:39:50 - INFO - __main__ - Step 24440: {'lr': 0.0004721729653973158, 'samples': 4692480, 'steps': 24439, 'loss/train': 1.4278517961502075} 11/07/2021 00:39:50 - INFO - __main__ - Step 24441: {'lr': 0.0004721705321787917, 'samples': 4692672, 'steps': 24440, 'loss/train': 1.145422101020813} 11/07/2021 00:39:50 - INFO - __main__ - Step 24442: {'lr': 0.00047216809886016097, 'samples': 4692864, 'steps': 24441, 'loss/train': 1.344766616821289} 11/07/2021 00:39:51 - INFO - __main__ - Step 24443: {'lr': 0.0004721656654414248, 'samples': 4693056, 'steps': 24442, 'loss/train': 2.641737461090088} 11/07/2021 00:39:52 - INFO - __main__ - Step 24444: {'lr': 0.00047216323192258416, 'samples': 4693248, 'steps': 24443, 'loss/train': 1.642939567565918} 11/07/2021 00:39:52 - INFO - __main__ - Step 24445: {'lr': 0.0004721607983036401, 'samples': 4693440, 'steps': 24444, 'loss/train': 1.9018218517303467} 11/07/2021 00:39:53 - INFO - __main__ - Step 24446: {'lr': 0.00047215836458459393, 'samples': 4693632, 'steps': 24445, 'loss/train': 1.1468150615692139} 11/07/2021 00:39:53 - INFO - __main__ - Step 24447: {'lr': 0.00047215593076544663, 'samples': 4693824, 'steps': 24446, 'loss/train': 1.8059026002883911} 11/07/2021 00:39:53 - INFO - __main__ - Step 24448: {'lr': 0.0004721534968461992, 'samples': 4694016, 'steps': 24447, 'loss/train': 1.7327210903167725} 11/07/2021 00:39:54 - INFO - __main__ - Step 24449: {'lr': 0.00047215106282685296, 'samples': 4694208, 'steps': 24448, 'loss/train': 1.154772162437439} 11/07/2021 00:39:55 - INFO - __main__ - Step 24450: {'lr': 0.0004721486287074088, 'samples': 4694400, 'steps': 24449, 'loss/train': 1.3429368734359741} 11/07/2021 00:39:55 - INFO - __main__ - Step 24451: {'lr': 0.0004721461944878679, 'samples': 4694592, 'steps': 24450, 'loss/train': 1.561524510383606} 11/07/2021 00:39:55 - INFO - __main__ - Step 24452: {'lr': 0.00047214376016823143, 'samples': 4694784, 'steps': 24451, 'loss/train': 1.277727484703064} 11/07/2021 00:39:56 - INFO - __main__ - Step 24453: {'lr': 0.0004721413257485003, 'samples': 4694976, 'steps': 24452, 'loss/train': 1.6690208911895752} 11/07/2021 00:39:57 - INFO - __main__ - Step 24454: {'lr': 0.0004721388912286758, 'samples': 4695168, 'steps': 24453, 'loss/train': 1.5438530445098877} 11/07/2021 00:39:57 - INFO - __main__ - Step 24455: {'lr': 0.0004721364566087589, 'samples': 4695360, 'steps': 24454, 'loss/train': 1.6763454675674438} 11/07/2021 00:39:57 - INFO - __main__ - Step 24456: {'lr': 0.00047213402188875077, 'samples': 4695552, 'steps': 24455, 'loss/train': 1.7971174716949463} 11/07/2021 00:39:58 - INFO - __main__ - Step 24457: {'lr': 0.00047213158706865246, 'samples': 4695744, 'steps': 24456, 'loss/train': 1.4079968929290771} 11/07/2021 00:39:58 - INFO - __main__ - Step 24458: {'lr': 0.000472129152148465, 'samples': 4695936, 'steps': 24457, 'loss/train': 1.0568734407424927} 11/07/2021 00:39:59 - INFO - __main__ - Step 24459: {'lr': 0.0004721267171281897, 'samples': 4696128, 'steps': 24458, 'loss/train': 1.2152099609375} 11/07/2021 00:39:59 - INFO - __main__ - Step 24460: {'lr': 0.00047212428200782744, 'samples': 4696320, 'steps': 24459, 'loss/train': 1.297367811203003} 11/07/2021 00:40:00 - INFO - __main__ - Step 24461: {'lr': 0.00047212184678737946, 'samples': 4696512, 'steps': 24460, 'loss/train': 1.7175503969192505} 11/07/2021 00:40:00 - INFO - __main__ - Step 24462: {'lr': 0.00047211941146684677, 'samples': 4696704, 'steps': 24461, 'loss/train': 1.3215069770812988} 11/07/2021 00:40:00 - INFO - __main__ - Step 24463: {'lr': 0.00047211697604623056, 'samples': 4696896, 'steps': 24462, 'loss/train': 1.438876748085022} 11/07/2021 00:40:02 - INFO - __main__ - Step 24464: {'lr': 0.0004721145405255318, 'samples': 4697088, 'steps': 24463, 'loss/train': 1.4503346681594849} 11/07/2021 00:40:02 - INFO - __main__ - Step 24465: {'lr': 0.00047211210490475167, 'samples': 4697280, 'steps': 24464, 'loss/train': 1.847247838973999} 11/07/2021 00:40:02 - INFO - __main__ - Step 24466: {'lr': 0.0004721096691838913, 'samples': 4697472, 'steps': 24465, 'loss/train': 0.15198959410190582} 11/07/2021 00:40:03 - INFO - __main__ - Step 24467: {'lr': 0.00047210723336295167, 'samples': 4697664, 'steps': 24466, 'loss/train': 1.6651211977005005} 11/07/2021 00:40:03 - INFO - __main__ - Step 24468: {'lr': 0.00047210479744193404, 'samples': 4697856, 'steps': 24467, 'loss/train': 1.6929770708084106} 11/07/2021 00:40:04 - INFO - __main__ - Step 24469: {'lr': 0.0004721023614208393, 'samples': 4698048, 'steps': 24468, 'loss/train': 1.1923024654388428} 11/07/2021 00:40:05 - INFO - __main__ - Step 24470: {'lr': 0.0004720999252996687, 'samples': 4698240, 'steps': 24469, 'loss/train': 1.750717282295227} 11/07/2021 00:40:05 - INFO - __main__ - Step 24471: {'lr': 0.00047209748907842337, 'samples': 4698432, 'steps': 24470, 'loss/train': 1.480593204498291} 11/07/2021 00:40:05 - INFO - __main__ - Step 24472: {'lr': 0.0004720950527571043, 'samples': 4698624, 'steps': 24471, 'loss/train': 2.0582070350646973} 11/07/2021 00:40:06 - INFO - __main__ - Step 24473: {'lr': 0.0004720926163357126, 'samples': 4698816, 'steps': 24472, 'loss/train': 1.538037896156311} 11/07/2021 00:40:07 - INFO - __main__ - Step 24474: {'lr': 0.0004720901798142494, 'samples': 4699008, 'steps': 24473, 'loss/train': 1.019331932067871} 11/07/2021 00:40:07 - INFO - __main__ - Step 24475: {'lr': 0.00047208774319271586, 'samples': 4699200, 'steps': 24474, 'loss/train': 1.8132907152175903} 11/07/2021 00:40:07 - INFO - __main__ - Step 24476: {'lr': 0.00047208530647111294, 'samples': 4699392, 'steps': 24475, 'loss/train': 1.8565229177474976} 11/07/2021 00:40:08 - INFO - __main__ - Step 24477: {'lr': 0.0004720828696494418, 'samples': 4699584, 'steps': 24476, 'loss/train': 1.696797251701355} 11/07/2021 00:40:08 - INFO - __main__ - Step 24478: {'lr': 0.00047208043272770354, 'samples': 4699776, 'steps': 24477, 'loss/train': 0.24086450040340424} 11/07/2021 00:40:09 - INFO - __main__ - Step 24479: {'lr': 0.0004720779957058993, 'samples': 4699968, 'steps': 24478, 'loss/train': 1.553345799446106} 11/07/2021 00:40:10 - INFO - __main__ - Step 24480: {'lr': 0.0004720755585840302, 'samples': 4700160, 'steps': 24479, 'loss/train': 1.356393814086914} 11/07/2021 00:40:10 - INFO - __main__ - Step 24481: {'lr': 0.0004720731213620972, 'samples': 4700352, 'steps': 24480, 'loss/train': 1.7137320041656494} 11/07/2021 00:40:10 - INFO - __main__ - Step 24482: {'lr': 0.00047207068404010147, 'samples': 4700544, 'steps': 24481, 'loss/train': 1.1787151098251343} 11/07/2021 00:40:11 - INFO - __main__ - Step 24483: {'lr': 0.00047206824661804415, 'samples': 4700736, 'steps': 24482, 'loss/train': 1.1076496839523315} 11/07/2021 00:40:11 - INFO - __main__ - Step 24484: {'lr': 0.0004720658090959263, 'samples': 4700928, 'steps': 24483, 'loss/train': 1.8427814245224} 11/07/2021 00:40:12 - INFO - __main__ - Step 24485: {'lr': 0.000472063371473749, 'samples': 4701120, 'steps': 24484, 'loss/train': 1.2564271688461304} 11/07/2021 00:40:12 - INFO - __main__ - Step 24486: {'lr': 0.0004720609337515134, 'samples': 4701312, 'steps': 24485, 'loss/train': 0.3722918629646301} 11/07/2021 00:40:13 - INFO - __main__ - Step 24487: {'lr': 0.00047205849592922057, 'samples': 4701504, 'steps': 24486, 'loss/train': 1.6480177640914917} 11/07/2021 00:40:13 - INFO - __main__ - Step 24488: {'lr': 0.00047205605800687154, 'samples': 4701696, 'steps': 24487, 'loss/train': 0.6376531720161438} 11/07/2021 00:40:13 - INFO - __main__ - Step 24489: {'lr': 0.0004720536199844676, 'samples': 4701888, 'steps': 24488, 'loss/train': 1.692022442817688} 11/07/2021 00:40:14 - INFO - __main__ - Step 24490: {'lr': 0.00047205118186200963, 'samples': 4702080, 'steps': 24489, 'loss/train': 1.5425223112106323} 11/07/2021 00:40:15 - INFO - __main__ - Step 24491: {'lr': 0.00047204874363949886, 'samples': 4702272, 'steps': 24490, 'loss/train': 1.1524288654327393} 11/07/2021 00:40:15 - INFO - __main__ - Step 24492: {'lr': 0.00047204630531693634, 'samples': 4702464, 'steps': 24491, 'loss/train': 2.0011699199676514} 11/07/2021 00:40:15 - INFO - __main__ - Step 24493: {'lr': 0.0004720438668943232, 'samples': 4702656, 'steps': 24492, 'loss/train': 1.3901020288467407} 11/07/2021 00:40:16 - INFO - __main__ - Step 24494: {'lr': 0.0004720414283716605, 'samples': 4702848, 'steps': 24493, 'loss/train': 1.6455399990081787} 11/07/2021 00:40:17 - INFO - __main__ - Step 24495: {'lr': 0.00047203898974894934, 'samples': 4703040, 'steps': 24494, 'loss/train': 1.5917654037475586} 11/07/2021 00:40:17 - INFO - __main__ - Step 24496: {'lr': 0.0004720365510261909, 'samples': 4703232, 'steps': 24495, 'loss/train': 1.9656726121902466} 11/07/2021 00:40:18 - INFO - __main__ - Step 24497: {'lr': 0.00047203411220338615, 'samples': 4703424, 'steps': 24496, 'loss/train': 1.6324363946914673} 11/07/2021 00:40:18 - INFO - __main__ - Step 24498: {'lr': 0.00047203167328053634, 'samples': 4703616, 'steps': 24497, 'loss/train': 1.4276381731033325} 11/07/2021 00:40:18 - INFO - __main__ - Step 24499: {'lr': 0.0004720292342576423, 'samples': 4703808, 'steps': 24498, 'loss/train': 1.9737954139709473} 11/07/2021 00:40:19 - INFO - __main__ - Step 24500: {'lr': 0.0004720267951347055, 'samples': 4704000, 'steps': 24499, 'loss/train': 1.656864881515503} 11/07/2021 00:40:20 - INFO - __main__ - Step 24501: {'lr': 0.00047202435591172677, 'samples': 4704192, 'steps': 24500, 'loss/train': 1.9080573320388794} 11/07/2021 00:40:20 - INFO - __main__ - Step 24502: {'lr': 0.00047202191658870737, 'samples': 4704384, 'steps': 24501, 'loss/train': 1.2758409976959229} 11/07/2021 00:40:20 - INFO - __main__ - Step 24503: {'lr': 0.00047201947716564826, 'samples': 4704576, 'steps': 24502, 'loss/train': 1.6238764524459839} 11/07/2021 00:40:21 - INFO - __main__ - Step 24504: {'lr': 0.00047201703764255057, 'samples': 4704768, 'steps': 24503, 'loss/train': 1.4966262578964233} 11/07/2021 00:40:22 - INFO - __main__ - Step 24505: {'lr': 0.0004720145980194155, 'samples': 4704960, 'steps': 24504, 'loss/train': 2.0411877632141113} 11/07/2021 00:40:22 - INFO - __main__ - Step 24506: {'lr': 0.000472012158296244, 'samples': 4705152, 'steps': 24505, 'loss/train': 1.6829310655593872} 11/07/2021 00:40:22 - INFO - __main__ - Step 24507: {'lr': 0.0004720097184730373, 'samples': 4705344, 'steps': 24506, 'loss/train': 2.1145436763763428} 11/07/2021 00:40:23 - INFO - __main__ - Step 24508: {'lr': 0.00047200727854979644, 'samples': 4705536, 'steps': 24507, 'loss/train': 1.7645108699798584} 11/07/2021 00:40:23 - INFO - __main__ - Step 24509: {'lr': 0.00047200483852652257, 'samples': 4705728, 'steps': 24508, 'loss/train': 1.630983591079712} 11/07/2021 00:40:24 - INFO - __main__ - Step 24510: {'lr': 0.0004720023984032167, 'samples': 4705920, 'steps': 24509, 'loss/train': 1.414451003074646} 11/07/2021 00:40:25 - INFO - __main__ - Step 24511: {'lr': 0.00047199995817987997, 'samples': 4706112, 'steps': 24510, 'loss/train': 1.2542831897735596} 11/07/2021 00:40:25 - INFO - __main__ - Step 24512: {'lr': 0.00047199751785651346, 'samples': 4706304, 'steps': 24511, 'loss/train': 1.7242752313613892} 11/07/2021 00:40:25 - INFO - __main__ - Step 24513: {'lr': 0.0004719950774331183, 'samples': 4706496, 'steps': 24512, 'loss/train': 1.3297889232635498} 11/07/2021 00:40:26 - INFO - __main__ - Step 24514: {'lr': 0.00047199263690969563, 'samples': 4706688, 'steps': 24513, 'loss/train': 1.5910593271255493} 11/07/2021 00:40:27 - INFO - __main__ - Step 24515: {'lr': 0.00047199019628624647, 'samples': 4706880, 'steps': 24514, 'loss/train': 1.3865641355514526} 11/07/2021 00:40:27 - INFO - __main__ - Step 24516: {'lr': 0.00047198775556277195, 'samples': 4707072, 'steps': 24515, 'loss/train': 1.639611005783081} 11/07/2021 00:40:27 - INFO - __main__ - Step 24517: {'lr': 0.0004719853147392732, 'samples': 4707264, 'steps': 24516, 'loss/train': 2.0972113609313965} 11/07/2021 00:40:28 - INFO - __main__ - Step 24518: {'lr': 0.0004719828738157512, 'samples': 4707456, 'steps': 24517, 'loss/train': 1.7587006092071533} 11/07/2021 00:40:28 - INFO - __main__ - Step 24519: {'lr': 0.0004719804327922073, 'samples': 4707648, 'steps': 24518, 'loss/train': 0.929109513759613} 11/07/2021 00:40:29 - INFO - __main__ - Step 24520: {'lr': 0.00047197799166864233, 'samples': 4707840, 'steps': 24519, 'loss/train': 1.3555177450180054} 11/07/2021 00:40:29 - INFO - __main__ - Step 24521: {'lr': 0.00047197555044505756, 'samples': 4708032, 'steps': 24520, 'loss/train': 1.371085286140442} 11/07/2021 00:40:30 - INFO - __main__ - Step 24522: {'lr': 0.000471973109121454, 'samples': 4708224, 'steps': 24521, 'loss/train': 1.789271354675293} 11/07/2021 00:40:30 - INFO - __main__ - Step 24523: {'lr': 0.00047197066769783284, 'samples': 4708416, 'steps': 24522, 'loss/train': 2.5680861473083496} 11/07/2021 00:40:30 - INFO - __main__ - Step 24524: {'lr': 0.000471968226174195, 'samples': 4708608, 'steps': 24523, 'loss/train': 1.28214430809021} 11/07/2021 00:40:31 - INFO - __main__ - Step 24525: {'lr': 0.00047196578455054175, 'samples': 4708800, 'steps': 24524, 'loss/train': 1.6914494037628174} 11/07/2021 00:40:32 - INFO - __main__ - Step 24526: {'lr': 0.00047196334282687414, 'samples': 4708992, 'steps': 24525, 'loss/train': 1.08967125415802} 11/07/2021 00:40:32 - INFO - __main__ - Step 24527: {'lr': 0.00047196090100319333, 'samples': 4709184, 'steps': 24526, 'loss/train': 2.022808313369751} 11/07/2021 00:40:32 - INFO - __main__ - Step 24528: {'lr': 0.00047195845907950035, 'samples': 4709376, 'steps': 24527, 'loss/train': 1.8660812377929688} 11/07/2021 00:40:33 - INFO - __main__ - Step 24529: {'lr': 0.0004719560170557963, 'samples': 4709568, 'steps': 24528, 'loss/train': 0.7722143530845642} 11/07/2021 00:40:34 - INFO - __main__ - Step 24530: {'lr': 0.0004719535749320823, 'samples': 4709760, 'steps': 24529, 'loss/train': 1.6197178363800049} 11/07/2021 00:40:34 - INFO - __main__ - Step 24531: {'lr': 0.0004719511327083594, 'samples': 4709952, 'steps': 24530, 'loss/train': 1.1182323694229126} 11/07/2021 00:40:35 - INFO - __main__ - Step 24532: {'lr': 0.0004719486903846288, 'samples': 4710144, 'steps': 24531, 'loss/train': 1.2211271524429321} 11/07/2021 00:40:35 - INFO - __main__ - Step 24533: {'lr': 0.0004719462479608915, 'samples': 4710336, 'steps': 24532, 'loss/train': 1.649878978729248} 11/07/2021 00:40:35 - INFO - __main__ - Step 24534: {'lr': 0.0004719438054371487, 'samples': 4710528, 'steps': 24533, 'loss/train': 1.3666858673095703} 11/07/2021 00:40:36 - INFO - __main__ - Step 24535: {'lr': 0.00047194136281340137, 'samples': 4710720, 'steps': 24534, 'loss/train': 1.4466700553894043} 11/07/2021 00:40:37 - INFO - __main__ - Step 24536: {'lr': 0.00047193892008965077, 'samples': 4710912, 'steps': 24535, 'loss/train': 0.9865069389343262} 11/07/2021 00:40:37 - INFO - __main__ - Step 24537: {'lr': 0.0004719364772658978, 'samples': 4711104, 'steps': 24536, 'loss/train': 1.7701717615127563} 11/07/2021 00:40:38 - INFO - __main__ - Step 24538: {'lr': 0.00047193403434214385, 'samples': 4711296, 'steps': 24537, 'loss/train': 1.671497106552124} 11/07/2021 00:40:38 - INFO - __main__ - Step 24539: {'lr': 0.0004719315913183897, 'samples': 4711488, 'steps': 24538, 'loss/train': 1.300622582435608} 11/07/2021 00:40:38 - INFO - __main__ - Step 24540: {'lr': 0.0004719291481946367, 'samples': 4711680, 'steps': 24539, 'loss/train': 1.2521655559539795} 11/07/2021 00:40:39 - INFO - __main__ - Step 24541: {'lr': 0.00047192670497088577, 'samples': 4711872, 'steps': 24540, 'loss/train': 1.7195566892623901} 11/07/2021 00:40:40 - INFO - __main__ - Step 24542: {'lr': 0.0004719242616471381, 'samples': 4712064, 'steps': 24541, 'loss/train': 1.094425916671753} 11/07/2021 00:40:40 - INFO - __main__ - Step 24543: {'lr': 0.00047192181822339484, 'samples': 4712256, 'steps': 24542, 'loss/train': 1.9918973445892334} 11/07/2021 00:40:40 - INFO - __main__ - Step 24544: {'lr': 0.000471919374699657, 'samples': 4712448, 'steps': 24543, 'loss/train': 1.5135279893875122} 11/07/2021 00:40:41 - INFO - __main__ - Step 24545: {'lr': 0.0004719169310759257, 'samples': 4712640, 'steps': 24544, 'loss/train': 1.1496442556381226} 11/07/2021 00:40:42 - INFO - __main__ - Step 24546: {'lr': 0.0004719144873522021, 'samples': 4712832, 'steps': 24545, 'loss/train': 1.596967339515686} 11/07/2021 00:40:42 - INFO - __main__ - Step 24547: {'lr': 0.0004719120435284872, 'samples': 4713024, 'steps': 24546, 'loss/train': 1.982956886291504} 11/07/2021 00:40:42 - INFO - __main__ - Step 24548: {'lr': 0.0004719095996047822, 'samples': 4713216, 'steps': 24547, 'loss/train': 1.156897783279419} 11/07/2021 00:40:43 - INFO - __main__ - Step 24549: {'lr': 0.0004719071555810881, 'samples': 4713408, 'steps': 24548, 'loss/train': 1.8267337083816528} 11/07/2021 00:40:43 - INFO - __main__ - Step 24550: {'lr': 0.00047190471145740616, 'samples': 4713600, 'steps': 24549, 'loss/train': 1.2680888175964355} 11/07/2021 00:40:43 - INFO - __main__ - Step 24551: {'lr': 0.0004719022672337373, 'samples': 4713792, 'steps': 24550, 'loss/train': 1.3638585805892944} 11/07/2021 00:40:44 - INFO - __main__ - Step 24552: {'lr': 0.0004718998229100827, 'samples': 4713984, 'steps': 24551, 'loss/train': 1.6720409393310547} 11/07/2021 00:40:45 - INFO - __main__ - Step 24553: {'lr': 0.00047189737848644356, 'samples': 4714176, 'steps': 24552, 'loss/train': 1.2578575611114502} 11/07/2021 00:40:45 - INFO - __main__ - Step 24554: {'lr': 0.0004718949339628208, 'samples': 4714368, 'steps': 24553, 'loss/train': 0.9185448288917542} 11/07/2021 00:40:46 - INFO - __main__ - Step 24555: {'lr': 0.0004718924893392156, 'samples': 4714560, 'steps': 24554, 'loss/train': 1.4076365232467651} 11/07/2021 00:40:46 - INFO - __main__ - Step 24556: {'lr': 0.0004718900446156291, 'samples': 4714752, 'steps': 24555, 'loss/train': 1.4986615180969238} 11/07/2021 00:40:47 - INFO - __main__ - Step 24557: {'lr': 0.00047188759979206236, 'samples': 4714944, 'steps': 24556, 'loss/train': 0.7693179845809937} 11/07/2021 00:40:47 - INFO - __main__ - Step 24558: {'lr': 0.00047188515486851646, 'samples': 4715136, 'steps': 24557, 'loss/train': 1.5965371131896973} 11/07/2021 00:40:48 - INFO - __main__ - Step 24559: {'lr': 0.0004718827098449926, 'samples': 4715328, 'steps': 24558, 'loss/train': 1.1753389835357666} 11/07/2021 00:40:48 - INFO - __main__ - Step 24560: {'lr': 0.00047188026472149184, 'samples': 4715520, 'steps': 24559, 'loss/train': 1.8036824464797974} 11/07/2021 00:40:48 - INFO - __main__ - Step 24561: {'lr': 0.0004718778194980151, 'samples': 4715712, 'steps': 24560, 'loss/train': 1.3673409223556519} 11/07/2021 00:40:50 - INFO - __main__ - Step 24562: {'lr': 0.00047187537417456375, 'samples': 4715904, 'steps': 24561, 'loss/train': 1.8237707614898682} 11/07/2021 00:40:50 - INFO - __main__ - Step 24563: {'lr': 0.00047187292875113874, 'samples': 4716096, 'steps': 24562, 'loss/train': 1.5906968116760254} 11/07/2021 00:40:50 - INFO - __main__ - Step 24564: {'lr': 0.0004718704832277413, 'samples': 4716288, 'steps': 24563, 'loss/train': 1.953343152999878} 11/07/2021 00:40:51 - INFO - __main__ - Step 24565: {'lr': 0.0004718680376043724, 'samples': 4716480, 'steps': 24564, 'loss/train': 0.17302846908569336} 11/07/2021 00:40:51 - INFO - __main__ - Step 24566: {'lr': 0.00047186559188103314, 'samples': 4716672, 'steps': 24565, 'loss/train': 1.684582233428955} 11/07/2021 00:40:51 - INFO - __main__ - Step 24567: {'lr': 0.00047186314605772466, 'samples': 4716864, 'steps': 24566, 'loss/train': 1.2550259828567505} 11/07/2021 00:40:52 - INFO - __main__ - Step 24568: {'lr': 0.00047186070013444814, 'samples': 4717056, 'steps': 24567, 'loss/train': 1.7285406589508057} 11/07/2021 00:40:53 - INFO - __main__ - Step 24569: {'lr': 0.00047185825411120454, 'samples': 4717248, 'steps': 24568, 'loss/train': 1.6954476833343506} 11/07/2021 00:40:53 - INFO - __main__ - Step 24570: {'lr': 0.0004718558079879951, 'samples': 4717440, 'steps': 24569, 'loss/train': 1.5529531240463257} 11/07/2021 00:40:53 - INFO - __main__ - Step 24571: {'lr': 0.00047185336176482084, 'samples': 4717632, 'steps': 24570, 'loss/train': 1.6657012701034546} 11/07/2021 00:40:54 - INFO - __main__ - Step 24572: {'lr': 0.00047185091544168286, 'samples': 4717824, 'steps': 24571, 'loss/train': 1.475118637084961} 11/07/2021 00:40:55 - INFO - __main__ - Step 24573: {'lr': 0.00047184846901858225, 'samples': 4718016, 'steps': 24572, 'loss/train': 1.6777819395065308} 11/07/2021 00:40:55 - INFO - __main__ - Step 24574: {'lr': 0.0004718460224955202, 'samples': 4718208, 'steps': 24573, 'loss/train': 1.7281380891799927} 11/07/2021 00:40:55 - INFO - __main__ - Step 24575: {'lr': 0.0004718435758724977, 'samples': 4718400, 'steps': 24574, 'loss/train': 1.850234031677246} 11/07/2021 00:40:56 - INFO - __main__ - Step 24576: {'lr': 0.000471841129149516, 'samples': 4718592, 'steps': 24575, 'loss/train': 1.2701635360717773} 11/07/2021 00:40:56 - INFO - __main__ - Step 24577: {'lr': 0.000471838682326576, 'samples': 4718784, 'steps': 24576, 'loss/train': 1.5703866481781006} 11/07/2021 00:40:57 - INFO - __main__ - Step 24578: {'lr': 0.000471836235403679, 'samples': 4718976, 'steps': 24577, 'loss/train': 2.2552669048309326} 11/07/2021 00:40:58 - INFO - __main__ - Step 24579: {'lr': 0.000471833788380826, 'samples': 4719168, 'steps': 24578, 'loss/train': 1.2086560726165771} 11/07/2021 00:40:58 - INFO - __main__ - Step 24580: {'lr': 0.0004718313412580181, 'samples': 4719360, 'steps': 24579, 'loss/train': 1.2085537910461426} 11/07/2021 00:40:58 - INFO - __main__ - Step 24581: {'lr': 0.0004718288940352564, 'samples': 4719552, 'steps': 24580, 'loss/train': 1.4074021577835083} 11/07/2021 00:40:59 - INFO - __main__ - Step 24582: {'lr': 0.00047182644671254207, 'samples': 4719744, 'steps': 24581, 'loss/train': 1.8190374374389648} 11/07/2021 00:41:00 - INFO - __main__ - Step 24583: {'lr': 0.0004718239992898761, 'samples': 4719936, 'steps': 24582, 'loss/train': 1.3924469947814941} 11/07/2021 00:41:00 - INFO - __main__ - Step 24584: {'lr': 0.00047182155176725974, 'samples': 4720128, 'steps': 24583, 'loss/train': 1.4869482517242432} 11/07/2021 00:41:00 - INFO - __main__ - Step 24585: {'lr': 0.00047181910414469396, 'samples': 4720320, 'steps': 24584, 'loss/train': 1.1537373065948486} 11/07/2021 00:41:01 - INFO - __main__ - Step 24586: {'lr': 0.0004718166564221799, 'samples': 4720512, 'steps': 24585, 'loss/train': 1.821776032447815} 11/07/2021 00:41:01 - INFO - __main__ - Step 24587: {'lr': 0.0004718142085997187, 'samples': 4720704, 'steps': 24586, 'loss/train': 1.1884390115737915} 11/07/2021 00:41:02 - INFO - __main__ - Step 24588: {'lr': 0.0004718117606773115, 'samples': 4720896, 'steps': 24587, 'loss/train': 3.4263834953308105} 11/07/2021 00:41:03 - INFO - __main__ - Step 24589: {'lr': 0.0004718093126549592, 'samples': 4721088, 'steps': 24588, 'loss/train': 1.226837396621704} 11/07/2021 00:41:03 - INFO - __main__ - Step 24590: {'lr': 0.0004718068645326632, 'samples': 4721280, 'steps': 24589, 'loss/train': 0.8642450571060181} 11/07/2021 00:41:03 - INFO - __main__ - Step 24591: {'lr': 0.0004718044163104244, 'samples': 4721472, 'steps': 24590, 'loss/train': 1.049943447113037} 11/07/2021 00:41:04 - INFO - __main__ - Step 24592: {'lr': 0.0004718019679882439, 'samples': 4721664, 'steps': 24591, 'loss/train': 1.2182432413101196} 11/07/2021 00:41:04 - INFO - __main__ - Step 24593: {'lr': 0.0004717995195661229, 'samples': 4721856, 'steps': 24592, 'loss/train': 3.1901888847351074} 11/07/2021 00:41:05 - INFO - __main__ - Step 24594: {'lr': 0.00047179707104406243, 'samples': 4722048, 'steps': 24593, 'loss/train': 1.1043592691421509} 11/07/2021 00:41:05 - INFO - __main__ - Step 24595: {'lr': 0.0004717946224220637, 'samples': 4722240, 'steps': 24594, 'loss/train': 1.4727208614349365} 11/07/2021 00:41:06 - INFO - __main__ - Step 24596: {'lr': 0.0004717921737001276, 'samples': 4722432, 'steps': 24595, 'loss/train': 1.3377717733383179} 11/07/2021 00:41:06 - INFO - __main__ - Step 24597: {'lr': 0.0004717897248782555, 'samples': 4722624, 'steps': 24596, 'loss/train': 1.4173022508621216} 11/07/2021 00:41:06 - INFO - __main__ - Step 24598: {'lr': 0.0004717872759564483, 'samples': 4722816, 'steps': 24597, 'loss/train': 1.6326979398727417} 11/07/2021 00:41:07 - INFO - __main__ - Step 24599: {'lr': 0.00047178482693470723, 'samples': 4723008, 'steps': 24598, 'loss/train': 1.0094153881072998} 11/07/2021 00:41:08 - INFO - __main__ - Step 24600: {'lr': 0.0004717823778130333, 'samples': 4723200, 'steps': 24599, 'loss/train': 1.2238973379135132} 11/07/2021 00:41:08 - INFO - __main__ - Step 24601: {'lr': 0.0004717799285914276, 'samples': 4723392, 'steps': 24600, 'loss/train': 1.5726311206817627} 11/07/2021 00:41:09 - INFO - __main__ - Step 24602: {'lr': 0.00047177747926989134, 'samples': 4723584, 'steps': 24601, 'loss/train': 1.7290757894515991} 11/07/2021 00:41:09 - INFO - __main__ - Step 24603: {'lr': 0.00047177502984842556, 'samples': 4723776, 'steps': 24602, 'loss/train': 1.8465595245361328} 11/07/2021 00:41:10 - INFO - __main__ - Step 24604: {'lr': 0.0004717725803270314, 'samples': 4723968, 'steps': 24603, 'loss/train': 1.3708162307739258} 11/07/2021 00:41:11 - INFO - __main__ - Step 24605: {'lr': 0.00047177013070570997, 'samples': 4724160, 'steps': 24604, 'loss/train': 1.8157235383987427} 11/07/2021 00:41:11 - INFO - __main__ - Step 24606: {'lr': 0.00047176768098446234, 'samples': 4724352, 'steps': 24605, 'loss/train': 1.3547577857971191} 11/07/2021 00:41:11 - INFO - __main__ - Step 24607: {'lr': 0.0004717652311632895, 'samples': 4724544, 'steps': 24606, 'loss/train': 1.55780029296875} 11/07/2021 00:41:12 - INFO - __main__ - Step 24608: {'lr': 0.00047176278124219276, 'samples': 4724736, 'steps': 24607, 'loss/train': 1.9747414588928223} 11/07/2021 00:41:12 - INFO - __main__ - Step 24609: {'lr': 0.0004717603312211731, 'samples': 4724928, 'steps': 24608, 'loss/train': 1.699101209640503} 11/07/2021 00:41:13 - INFO - __main__ - Step 24610: {'lr': 0.0004717578811002317, 'samples': 4725120, 'steps': 24609, 'loss/train': 1.3912477493286133} 11/07/2021 00:41:13 - INFO - __main__ - Step 24611: {'lr': 0.00047175543087936954, 'samples': 4725312, 'steps': 24610, 'loss/train': 1.4214597940444946} 11/07/2021 00:41:14 - INFO - __main__ - Step 24612: {'lr': 0.0004717529805585879, 'samples': 4725504, 'steps': 24611, 'loss/train': 1.5167274475097656} 11/07/2021 00:41:14 - INFO - __main__ - Step 24613: {'lr': 0.0004717505301378877, 'samples': 4725696, 'steps': 24612, 'loss/train': 1.8070693016052246} 11/07/2021 00:41:14 - INFO - __main__ - Step 24614: {'lr': 0.0004717480796172702, 'samples': 4725888, 'steps': 24613, 'loss/train': 1.4449204206466675} 11/07/2021 00:41:16 - INFO - __main__ - Step 24615: {'lr': 0.00047174562899673645, 'samples': 4726080, 'steps': 24614, 'loss/train': 1.1564642190933228} 11/07/2021 00:41:16 - INFO - __main__ - Step 24616: {'lr': 0.0004717431782762875, 'samples': 4726272, 'steps': 24615, 'loss/train': 1.9751918315887451} 11/07/2021 00:41:16 - INFO - __main__ - Step 24617: {'lr': 0.0004717407274559245, 'samples': 4726464, 'steps': 24616, 'loss/train': 1.8290555477142334} 11/07/2021 00:41:17 - INFO - __main__ - Step 24618: {'lr': 0.0004717382765356485, 'samples': 4726656, 'steps': 24617, 'loss/train': 1.8763755559921265} 11/07/2021 00:41:17 - INFO - __main__ - Step 24619: {'lr': 0.0004717358255154607, 'samples': 4726848, 'steps': 24618, 'loss/train': 0.9470406770706177} 11/07/2021 00:41:18 - INFO - __main__ - Step 24620: {'lr': 0.0004717333743953622, 'samples': 4727040, 'steps': 24619, 'loss/train': 0.10981487482786179} 11/07/2021 00:41:18 - INFO - __main__ - Step 24621: {'lr': 0.00047173092317535404, 'samples': 4727232, 'steps': 24620, 'loss/train': 1.7464439868927002} 11/07/2021 00:41:19 - INFO - __main__ - Step 24622: {'lr': 0.0004717284718554373, 'samples': 4727424, 'steps': 24621, 'loss/train': 1.660263180732727} 11/07/2021 00:41:19 - INFO - __main__ - Step 24623: {'lr': 0.00047172602043561317, 'samples': 4727616, 'steps': 24622, 'loss/train': 1.3084211349487305} 11/07/2021 00:41:19 - INFO - __main__ - Step 24624: {'lr': 0.00047172356891588273, 'samples': 4727808, 'steps': 24623, 'loss/train': 2.0479350090026855} 11/07/2021 00:41:21 - INFO - __main__ - Step 24625: {'lr': 0.0004717211172962471, 'samples': 4728000, 'steps': 24624, 'loss/train': 1.5201829671859741} 11/07/2021 00:41:21 - INFO - __main__ - Step 24626: {'lr': 0.0004717186655767073, 'samples': 4728192, 'steps': 24625, 'loss/train': 1.1362905502319336} 11/07/2021 00:41:21 - INFO - __main__ - Step 24627: {'lr': 0.0004717162137572645, 'samples': 4728384, 'steps': 24626, 'loss/train': 1.5421643257141113} 11/07/2021 00:41:22 - INFO - __main__ - Step 24628: {'lr': 0.0004717137618379198, 'samples': 4728576, 'steps': 24627, 'loss/train': 1.5219807624816895} 11/07/2021 00:41:22 - INFO - __main__ - Step 24629: {'lr': 0.0004717113098186743, 'samples': 4728768, 'steps': 24628, 'loss/train': 1.4583017826080322} 11/07/2021 00:41:22 - INFO - __main__ - Step 24630: {'lr': 0.00047170885769952907, 'samples': 4728960, 'steps': 24629, 'loss/train': 1.637890338897705} 11/07/2021 00:41:24 - INFO - __main__ - Step 24631: {'lr': 0.00047170640548048525, 'samples': 4729152, 'steps': 24630, 'loss/train': 1.4171907901763916} 11/07/2021 00:41:24 - INFO - __main__ - Step 24632: {'lr': 0.000471703953161544, 'samples': 4729344, 'steps': 24631, 'loss/train': 1.9250282049179077} 11/07/2021 00:41:24 - INFO - __main__ - Step 24633: {'lr': 0.00047170150074270635, 'samples': 4729536, 'steps': 24632, 'loss/train': 1.8162815570831299} 11/07/2021 00:41:25 - INFO - __main__ - Step 24634: {'lr': 0.0004716990482239735, 'samples': 4729728, 'steps': 24633, 'loss/train': 1.6207184791564941} 11/07/2021 00:41:25 - INFO - __main__ - Step 24635: {'lr': 0.0004716965956053463, 'samples': 4729920, 'steps': 24634, 'loss/train': 1.1402711868286133} 11/07/2021 00:41:26 - INFO - __main__ - Step 24636: {'lr': 0.00047169414288682616, 'samples': 4730112, 'steps': 24635, 'loss/train': 1.40772545337677} 11/07/2021 00:41:26 - INFO - __main__ - Step 24637: {'lr': 0.0004716916900684141, 'samples': 4730304, 'steps': 24636, 'loss/train': 1.7439367771148682} 11/07/2021 00:41:27 - INFO - __main__ - Step 24638: {'lr': 0.00047168923715011103, 'samples': 4730496, 'steps': 24637, 'loss/train': 1.7972429990768433} 11/07/2021 00:41:27 - INFO - __main__ - Step 24639: {'lr': 0.00047168678413191833, 'samples': 4730688, 'steps': 24638, 'loss/train': 1.9872496128082275} 11/07/2021 00:41:27 - INFO - __main__ - Step 24640: {'lr': 0.00047168433101383694, 'samples': 4730880, 'steps': 24639, 'loss/train': 1.5553479194641113} 11/07/2021 00:41:28 - INFO - __main__ - Step 24641: {'lr': 0.000471681877795868, 'samples': 4731072, 'steps': 24640, 'loss/train': 1.7604968547821045} 11/07/2021 00:41:29 - INFO - __main__ - Step 24642: {'lr': 0.0004716794244780127, 'samples': 4731264, 'steps': 24641, 'loss/train': 1.5986493825912476} 11/07/2021 00:41:29 - INFO - __main__ - Step 24643: {'lr': 0.0004716769710602721, 'samples': 4731456, 'steps': 24642, 'loss/train': 1.5960829257965088} 11/07/2021 00:41:29 - INFO - __main__ - Step 24644: {'lr': 0.00047167451754264714, 'samples': 4731648, 'steps': 24643, 'loss/train': 1.3092007637023926} 11/07/2021 00:41:30 - INFO - __main__ - Step 24645: {'lr': 0.0004716720639251392, 'samples': 4731840, 'steps': 24644, 'loss/train': 1.645534873008728} 11/07/2021 00:41:31 - INFO - __main__ - Step 24646: {'lr': 0.0004716696102077491, 'samples': 4732032, 'steps': 24645, 'loss/train': 0.7927688956260681} 11/07/2021 00:41:31 - INFO - __main__ - Step 24647: {'lr': 0.0004716671563904782, 'samples': 4732224, 'steps': 24646, 'loss/train': 1.4662938117980957} 11/07/2021 00:41:32 - INFO - __main__ - Step 24648: {'lr': 0.0004716647024733275, 'samples': 4732416, 'steps': 24647, 'loss/train': 1.9069128036499023} 11/07/2021 00:41:32 - INFO - __main__ - Step 24649: {'lr': 0.00047166224845629804, 'samples': 4732608, 'steps': 24648, 'loss/train': 1.5420812368392944} 11/07/2021 00:41:32 - INFO - __main__ - Step 24650: {'lr': 0.000471659794339391, 'samples': 4732800, 'steps': 24649, 'loss/train': 1.5934165716171265} 11/07/2021 00:41:33 - INFO - __main__ - Step 24651: {'lr': 0.00047165734012260754, 'samples': 4732992, 'steps': 24650, 'loss/train': 1.610710859298706} 11/07/2021 00:41:34 - INFO - __main__ - Step 24652: {'lr': 0.0004716548858059486, 'samples': 4733184, 'steps': 24651, 'loss/train': 1.2758663892745972} 11/07/2021 00:41:34 - INFO - __main__ - Step 24653: {'lr': 0.0004716524313894155, 'samples': 4733376, 'steps': 24652, 'loss/train': 1.113743782043457} 11/07/2021 00:41:34 - INFO - __main__ - Step 24654: {'lr': 0.0004716499768730092, 'samples': 4733568, 'steps': 24653, 'loss/train': 1.840376853942871} 11/07/2021 00:41:35 - INFO - __main__ - Step 24655: {'lr': 0.0004716475222567308, 'samples': 4733760, 'steps': 24654, 'loss/train': 1.646635890007019} 11/07/2021 00:41:36 - INFO - __main__ - Step 24656: {'lr': 0.0004716450675405815, 'samples': 4733952, 'steps': 24655, 'loss/train': 1.5526431798934937} 11/07/2021 00:41:36 - INFO - __main__ - Step 24657: {'lr': 0.0004716426127245623, 'samples': 4734144, 'steps': 24656, 'loss/train': 1.4510862827301025} 11/07/2021 00:41:36 - INFO - __main__ - Step 24658: {'lr': 0.00047164015780867444, 'samples': 4734336, 'steps': 24657, 'loss/train': 1.7362557649612427} 11/07/2021 00:41:37 - INFO - __main__ - Step 24659: {'lr': 0.0004716377027929189, 'samples': 4734528, 'steps': 24658, 'loss/train': 1.2027533054351807} 11/07/2021 00:41:37 - INFO - __main__ - Step 24660: {'lr': 0.00047163524767729684, 'samples': 4734720, 'steps': 24659, 'loss/train': 1.1476722955703735} 11/07/2021 00:41:38 - INFO - __main__ - Step 24661: {'lr': 0.0004716327924618093, 'samples': 4734912, 'steps': 24660, 'loss/train': 1.578438639640808} 11/07/2021 00:41:38 - INFO - __main__ - Step 24662: {'lr': 0.0004716303371464575, 'samples': 4735104, 'steps': 24661, 'loss/train': 2.1853394508361816} 11/07/2021 00:41:39 - INFO - __main__ - Step 24663: {'lr': 0.0004716278817312425, 'samples': 4735296, 'steps': 24662, 'loss/train': 1.361633062362671} 11/07/2021 00:41:39 - INFO - __main__ - Step 24664: {'lr': 0.0004716254262161653, 'samples': 4735488, 'steps': 24663, 'loss/train': 1.9795877933502197} 11/07/2021 00:41:39 - INFO - __main__ - Step 24665: {'lr': 0.00047162297060122726, 'samples': 4735680, 'steps': 24664, 'loss/train': 1.582615613937378} 11/07/2021 00:41:40 - INFO - __main__ - Step 24666: {'lr': 0.0004716205148864292, 'samples': 4735872, 'steps': 24665, 'loss/train': 1.3622562885284424} 11/07/2021 00:41:41 - INFO - __main__ - Step 24667: {'lr': 0.0004716180590717724, 'samples': 4736064, 'steps': 24666, 'loss/train': 0.957697868347168} 11/07/2021 00:41:41 - INFO - __main__ - Step 24668: {'lr': 0.0004716156031572579, 'samples': 4736256, 'steps': 24667, 'loss/train': 1.5222654342651367} 11/07/2021 00:41:41 - INFO - __main__ - Step 24669: {'lr': 0.00047161314714288697, 'samples': 4736448, 'steps': 24668, 'loss/train': 1.4493942260742188} 11/07/2021 00:41:42 - INFO - __main__ - Step 24670: {'lr': 0.00047161069102866037, 'samples': 4736640, 'steps': 24669, 'loss/train': 1.5919777154922485} 11/07/2021 00:41:43 - INFO - __main__ - Step 24671: {'lr': 0.00047160823481457955, 'samples': 4736832, 'steps': 24670, 'loss/train': 1.7107733488082886} 11/07/2021 00:41:43 - INFO - __main__ - Step 24672: {'lr': 0.0004716057785006454, 'samples': 4737024, 'steps': 24671, 'loss/train': 1.3636122941970825} 11/07/2021 00:41:44 - INFO - __main__ - Step 24673: {'lr': 0.00047160332208685915, 'samples': 4737216, 'steps': 24672, 'loss/train': 3.1674327850341797} 11/07/2021 00:41:44 - INFO - __main__ - Step 24674: {'lr': 0.00047160086557322185, 'samples': 4737408, 'steps': 24673, 'loss/train': 1.5195907354354858} 11/07/2021 00:41:44 - INFO - __main__ - Step 24675: {'lr': 0.0004715984089597346, 'samples': 4737600, 'steps': 24674, 'loss/train': 0.8802253007888794} 11/07/2021 00:41:45 - INFO - __main__ - Step 24676: {'lr': 0.00047159595224639854, 'samples': 4737792, 'steps': 24675, 'loss/train': 1.9900333881378174} 11/07/2021 00:41:46 - INFO - __main__ - Step 24677: {'lr': 0.00047159349543321477, 'samples': 4737984, 'steps': 24676, 'loss/train': 1.7431213855743408} 11/07/2021 00:41:46 - INFO - __main__ - Step 24678: {'lr': 0.00047159103852018443, 'samples': 4738176, 'steps': 24677, 'loss/train': 1.8001621961593628} 11/07/2021 00:41:46 - INFO - __main__ - Step 24679: {'lr': 0.00047158858150730856, 'samples': 4738368, 'steps': 24678, 'loss/train': 1.2686253786087036} 11/07/2021 00:41:47 - INFO - __main__ - Step 24680: {'lr': 0.00047158612439458824, 'samples': 4738560, 'steps': 24679, 'loss/train': 1.2404592037200928} 11/07/2021 00:41:48 - INFO - __main__ - Step 24681: {'lr': 0.00047158366718202466, 'samples': 4738752, 'steps': 24680, 'loss/train': 1.5559760332107544} 11/07/2021 00:41:48 - INFO - __main__ - Step 24682: {'lr': 0.00047158120986961897, 'samples': 4738944, 'steps': 24681, 'loss/train': 1.5051125288009644} 11/07/2021 00:41:49 - INFO - __main__ - Step 24683: {'lr': 0.00047157875245737213, 'samples': 4739136, 'steps': 24682, 'loss/train': 1.294028401374817} 11/07/2021 00:41:49 - INFO - __main__ - Step 24684: {'lr': 0.0004715762949452853, 'samples': 4739328, 'steps': 24683, 'loss/train': 0.12313494086265564} 11/07/2021 00:41:49 - INFO - __main__ - Step 24685: {'lr': 0.0004715738373333597, 'samples': 4739520, 'steps': 24684, 'loss/train': 1.85573410987854} 11/07/2021 00:41:50 - INFO - __main__ - Step 24686: {'lr': 0.00047157137962159626, 'samples': 4739712, 'steps': 24685, 'loss/train': 1.5269348621368408} 11/07/2021 00:41:51 - INFO - __main__ - Step 24687: {'lr': 0.00047156892180999624, 'samples': 4739904, 'steps': 24686, 'loss/train': 1.6515963077545166} 11/07/2021 00:41:51 - INFO - __main__ - Step 24688: {'lr': 0.0004715664638985606, 'samples': 4740096, 'steps': 24687, 'loss/train': 1.681512713432312} 11/07/2021 00:41:51 - INFO - __main__ - Step 24689: {'lr': 0.00047156400588729066, 'samples': 4740288, 'steps': 24688, 'loss/train': 1.5862340927124023} 11/07/2021 00:41:52 - INFO - __main__ - Step 24690: {'lr': 0.0004715615477761873, 'samples': 4740480, 'steps': 24689, 'loss/train': 1.5726406574249268} 11/07/2021 00:41:52 - INFO - __main__ - Step 24691: {'lr': 0.00047155908956525173, 'samples': 4740672, 'steps': 24690, 'loss/train': 1.6381641626358032} 11/07/2021 00:41:53 - INFO - __main__ - Step 24692: {'lr': 0.00047155663125448514, 'samples': 4740864, 'steps': 24691, 'loss/train': 1.716679334640503} 11/07/2021 00:41:53 - INFO - __main__ - Step 24693: {'lr': 0.00047155417284388846, 'samples': 4741056, 'steps': 24692, 'loss/train': 1.490833044052124} 11/07/2021 00:41:54 - INFO - __main__ - Step 24694: {'lr': 0.0004715517143334629, 'samples': 4741248, 'steps': 24693, 'loss/train': 0.8225095272064209} 11/07/2021 00:41:54 - INFO - __main__ - Step 24695: {'lr': 0.00047154925572320957, 'samples': 4741440, 'steps': 24694, 'loss/train': 1.3117071390151978} 11/07/2021 00:41:55 - INFO - __main__ - Step 24696: {'lr': 0.00047154679701312953, 'samples': 4741632, 'steps': 24695, 'loss/train': 1.989949107170105} 11/07/2021 00:41:55 - INFO - __main__ - Step 24697: {'lr': 0.00047154433820322395, 'samples': 4741824, 'steps': 24696, 'loss/train': 1.3134007453918457} 11/07/2021 00:41:56 - INFO - __main__ - Step 24698: {'lr': 0.0004715418792934939, 'samples': 4742016, 'steps': 24697, 'loss/train': 0.13016371428966522} 11/07/2021 00:41:56 - INFO - __main__ - Step 24699: {'lr': 0.00047153942028394056, 'samples': 4742208, 'steps': 24698, 'loss/train': 1.5684603452682495} 11/07/2021 00:41:57 - INFO - __main__ - Step 24700: {'lr': 0.0004715369611745649, 'samples': 4742400, 'steps': 24699, 'loss/train': 1.6034104824066162} 11/07/2021 00:41:57 - INFO - __main__ - Step 24701: {'lr': 0.00047153450196536816, 'samples': 4742592, 'steps': 24700, 'loss/train': 1.284940242767334} 11/07/2021 00:41:57 - INFO - __main__ - Step 24702: {'lr': 0.00047153204265635136, 'samples': 4742784, 'steps': 24701, 'loss/train': 1.3234397172927856} 11/07/2021 00:41:59 - INFO - __main__ - Step 24703: {'lr': 0.0004715295832475156, 'samples': 4742976, 'steps': 24702, 'loss/train': 1.516232967376709} 11/07/2021 00:41:59 - INFO - __main__ - Step 24704: {'lr': 0.0004715271237388621, 'samples': 4743168, 'steps': 24703, 'loss/train': 1.805282711982727} 11/07/2021 00:41:59 - INFO - __main__ - Step 24705: {'lr': 0.00047152466413039187, 'samples': 4743360, 'steps': 24704, 'loss/train': 2.048011541366577} 11/07/2021 00:42:00 - INFO - __main__ - Step 24706: {'lr': 0.000471522204422106, 'samples': 4743552, 'steps': 24705, 'loss/train': 1.764803409576416} 11/07/2021 00:42:00 - INFO - __main__ - Step 24707: {'lr': 0.0004715197446140057, 'samples': 4743744, 'steps': 24706, 'loss/train': 0.7026177644729614} 11/07/2021 00:42:01 - INFO - __main__ - Step 24708: {'lr': 0.000471517284706092, 'samples': 4743936, 'steps': 24707, 'loss/train': 1.6084614992141724} 11/07/2021 00:42:01 - INFO - __main__ - Step 24709: {'lr': 0.0004715148246983661, 'samples': 4744128, 'steps': 24708, 'loss/train': 1.4565328359603882} 11/07/2021 00:42:02 - INFO - __main__ - Step 24710: {'lr': 0.000471512364590829, 'samples': 4744320, 'steps': 24709, 'loss/train': 1.3672226667404175} 11/07/2021 00:42:02 - INFO - __main__ - Step 24711: {'lr': 0.0004715099043834818, 'samples': 4744512, 'steps': 24710, 'loss/train': 1.6812454462051392} 11/07/2021 00:42:02 - INFO - __main__ - Step 24712: {'lr': 0.00047150744407632565, 'samples': 4744704, 'steps': 24711, 'loss/train': 1.1659122705459595} 11/07/2021 00:42:03 - INFO - __main__ - Step 24713: {'lr': 0.00047150498366936165, 'samples': 4744896, 'steps': 24712, 'loss/train': 1.7485889196395874} 11/07/2021 00:42:04 - INFO - __main__ - Step 24714: {'lr': 0.000471502523162591, 'samples': 4745088, 'steps': 24713, 'loss/train': 1.2086975574493408} 11/07/2021 00:42:04 - INFO - __main__ - Step 24715: {'lr': 0.00047150006255601475, 'samples': 4745280, 'steps': 24714, 'loss/train': 1.3149487972259521} 11/07/2021 00:42:04 - INFO - __main__ - Step 24716: {'lr': 0.00047149760184963385, 'samples': 4745472, 'steps': 24715, 'loss/train': 1.8487763404846191} 11/07/2021 00:42:05 - INFO - __main__ - Step 24717: {'lr': 0.0004714951410434497, 'samples': 4745664, 'steps': 24716, 'loss/train': 1.807113528251648} 11/07/2021 00:42:06 - INFO - __main__ - Step 24718: {'lr': 0.00047149268013746317, 'samples': 4745856, 'steps': 24717, 'loss/train': 1.892370343208313} 11/07/2021 00:42:06 - INFO - __main__ - Step 24719: {'lr': 0.00047149021913167545, 'samples': 4746048, 'steps': 24718, 'loss/train': 1.6956562995910645} 11/07/2021 00:42:06 - INFO - __main__ - Step 24720: {'lr': 0.0004714877580260877, 'samples': 4746240, 'steps': 24719, 'loss/train': 1.3320220708847046} 11/07/2021 00:42:07 - INFO - __main__ - Step 24721: {'lr': 0.00047148529682070094, 'samples': 4746432, 'steps': 24720, 'loss/train': 0.9560302495956421} 11/07/2021 00:42:07 - INFO - __main__ - Step 24722: {'lr': 0.00047148283551551643, 'samples': 4746624, 'steps': 24721, 'loss/train': 1.4561172723770142} 11/07/2021 00:42:08 - INFO - __main__ - Step 24723: {'lr': 0.000471480374110535, 'samples': 4746816, 'steps': 24722, 'loss/train': 1.6286498308181763} 11/07/2021 00:42:09 - INFO - __main__ - Step 24724: {'lr': 0.00047147791260575804, 'samples': 4747008, 'steps': 24723, 'loss/train': 1.5712597370147705} 11/07/2021 00:42:09 - INFO - __main__ - Step 24725: {'lr': 0.0004714754510011866, 'samples': 4747200, 'steps': 24724, 'loss/train': 1.409942865371704} 11/07/2021 00:42:09 - INFO - __main__ - Step 24726: {'lr': 0.0004714729892968216, 'samples': 4747392, 'steps': 24725, 'loss/train': 1.7029197216033936} 11/07/2021 00:42:10 - INFO - __main__ - Step 24727: {'lr': 0.0004714705274926644, 'samples': 4747584, 'steps': 24726, 'loss/train': 1.8137012720108032} 11/07/2021 00:42:11 - INFO - __main__ - Step 24728: {'lr': 0.00047146806558871594, 'samples': 4747776, 'steps': 24727, 'loss/train': 1.5736908912658691} 11/07/2021 00:42:11 - INFO - __main__ - Step 24729: {'lr': 0.0004714656035849774, 'samples': 4747968, 'steps': 24728, 'loss/train': 1.6693710088729858} 11/07/2021 00:42:11 - INFO - __main__ - Step 24730: {'lr': 0.00047146314148144986, 'samples': 4748160, 'steps': 24729, 'loss/train': 2.116982936859131} 11/07/2021 00:42:12 - INFO - __main__ - Step 24731: {'lr': 0.00047146067927813454, 'samples': 4748352, 'steps': 24730, 'loss/train': 1.2648699283599854} 11/07/2021 00:42:12 - INFO - __main__ - Step 24732: {'lr': 0.00047145821697503235, 'samples': 4748544, 'steps': 24731, 'loss/train': 1.6949464082717896} 11/07/2021 00:42:12 - INFO - __main__ - Step 24733: {'lr': 0.00047145575457214453, 'samples': 4748736, 'steps': 24732, 'loss/train': 1.6670206785202026} 11/07/2021 00:42:13 - INFO - __main__ - Step 24734: {'lr': 0.00047145329206947216, 'samples': 4748928, 'steps': 24733, 'loss/train': 1.1943016052246094} 11/07/2021 00:42:14 - INFO - __main__ - Step 24735: {'lr': 0.0004714508294670164, 'samples': 4749120, 'steps': 24734, 'loss/train': 1.3442723751068115} 11/07/2021 00:42:14 - INFO - __main__ - Step 24736: {'lr': 0.00047144836676477823, 'samples': 4749312, 'steps': 24735, 'loss/train': 1.4667487144470215} 11/07/2021 00:42:14 - INFO - __main__ - Step 24737: {'lr': 0.00047144590396275895, 'samples': 4749504, 'steps': 24736, 'loss/train': 1.4250119924545288} 11/07/2021 00:42:15 - INFO - __main__ - Step 24738: {'lr': 0.0004714434410609595, 'samples': 4749696, 'steps': 24737, 'loss/train': 1.235796332359314} 11/07/2021 00:42:16 - INFO - __main__ - Step 24739: {'lr': 0.00047144097805938104, 'samples': 4749888, 'steps': 24738, 'loss/train': 1.1943650245666504} 11/07/2021 00:42:16 - INFO - __main__ - Step 24740: {'lr': 0.0004714385149580247, 'samples': 4750080, 'steps': 24739, 'loss/train': 1.6702942848205566} 11/07/2021 00:42:17 - INFO - __main__ - Step 24741: {'lr': 0.0004714360517568916, 'samples': 4750272, 'steps': 24740, 'loss/train': 1.8867982625961304} 11/07/2021 00:42:17 - INFO - __main__ - Step 24742: {'lr': 0.00047143358845598283, 'samples': 4750464, 'steps': 24741, 'loss/train': 1.4710402488708496} 11/07/2021 00:42:17 - INFO - __main__ - Step 24743: {'lr': 0.0004714311250552995, 'samples': 4750656, 'steps': 24742, 'loss/train': 1.4312278032302856} 11/07/2021 00:42:18 - INFO - __main__ - Step 24744: {'lr': 0.0004714286615548427, 'samples': 4750848, 'steps': 24743, 'loss/train': 1.7704944610595703} 11/07/2021 00:42:19 - INFO - __main__ - Step 24745: {'lr': 0.00047142619795461363, 'samples': 4751040, 'steps': 24744, 'loss/train': 1.7733451128005981} 11/07/2021 00:42:19 - INFO - __main__ - Step 24746: {'lr': 0.0004714237342546133, 'samples': 4751232, 'steps': 24745, 'loss/train': 1.678723692893982} 11/07/2021 00:42:19 - INFO - __main__ - Step 24747: {'lr': 0.0004714212704548428, 'samples': 4751424, 'steps': 24746, 'loss/train': 1.2840992212295532} 11/07/2021 00:42:20 - INFO - __main__ - Step 24748: {'lr': 0.0004714188065553033, 'samples': 4751616, 'steps': 24747, 'loss/train': 1.23397696018219} 11/07/2021 00:42:21 - INFO - __main__ - Step 24749: {'lr': 0.000471416342555996, 'samples': 4751808, 'steps': 24748, 'loss/train': 1.9683762788772583} 11/07/2021 00:42:21 - INFO - __main__ - Step 24750: {'lr': 0.00047141387845692174, 'samples': 4752000, 'steps': 24749, 'loss/train': 1.8132466077804565} 11/07/2021 00:42:21 - INFO - __main__ - Step 24751: {'lr': 0.0004714114142580819, 'samples': 4752192, 'steps': 24750, 'loss/train': 1.5070856809616089} 11/07/2021 00:42:22 - INFO - __main__ - Step 24752: {'lr': 0.00047140894995947755, 'samples': 4752384, 'steps': 24751, 'loss/train': 1.3774813413619995} 11/07/2021 00:42:22 - INFO - __main__ - Step 24753: {'lr': 0.00047140648556110966, 'samples': 4752576, 'steps': 24752, 'loss/train': 1.7825654745101929} 11/07/2021 00:42:23 - INFO - __main__ - Step 24754: {'lr': 0.00047140402106297946, 'samples': 4752768, 'steps': 24753, 'loss/train': 1.9710283279418945} 11/07/2021 00:42:24 - INFO - __main__ - Step 24755: {'lr': 0.000471401556465088, 'samples': 4752960, 'steps': 24754, 'loss/train': 1.391543984413147} 11/07/2021 00:42:24 - INFO - __main__ - Step 24756: {'lr': 0.00047139909176743643, 'samples': 4753152, 'steps': 24755, 'loss/train': 1.6651687622070312} 11/07/2021 00:42:24 - INFO - __main__ - Step 24757: {'lr': 0.0004713966269700259, 'samples': 4753344, 'steps': 24756, 'loss/train': 1.1757967472076416} 11/07/2021 00:42:25 - INFO - __main__ - Step 24758: {'lr': 0.0004713941620728574, 'samples': 4753536, 'steps': 24757, 'loss/train': 1.1249995231628418} 11/07/2021 00:42:25 - INFO - __main__ - Step 24759: {'lr': 0.0004713916970759321, 'samples': 4753728, 'steps': 24758, 'loss/train': 1.7377405166625977} 11/07/2021 00:42:26 - INFO - __main__ - Step 24760: {'lr': 0.0004713892319792512, 'samples': 4753920, 'steps': 24759, 'loss/train': 2.043972969055176} 11/07/2021 00:42:26 - INFO - __main__ - Step 24761: {'lr': 0.00047138676678281564, 'samples': 4754112, 'steps': 24760, 'loss/train': 1.4563531875610352} 11/07/2021 00:42:27 - INFO - __main__ - Step 24762: {'lr': 0.00047138430148662666, 'samples': 4754304, 'steps': 24761, 'loss/train': 1.994339942932129} 11/07/2021 00:42:27 - INFO - __main__ - Step 24763: {'lr': 0.0004713818360906853, 'samples': 4754496, 'steps': 24762, 'loss/train': 1.8650447130203247} 11/07/2021 00:42:27 - INFO - __main__ - Step 24764: {'lr': 0.0004713793705949927, 'samples': 4754688, 'steps': 24763, 'loss/train': 0.8770702481269836} 11/07/2021 00:42:29 - INFO - __main__ - Step 24765: {'lr': 0.00047137690499955, 'samples': 4754880, 'steps': 24764, 'loss/train': 1.3111677169799805} 11/07/2021 00:42:29 - INFO - __main__ - Step 24766: {'lr': 0.0004713744393043583, 'samples': 4755072, 'steps': 24765, 'loss/train': 1.0300610065460205} 11/07/2021 00:42:30 - INFO - __main__ - Step 24767: {'lr': 0.00047137197350941864, 'samples': 4755264, 'steps': 24766, 'loss/train': 1.0892693996429443} 11/07/2021 00:42:30 - INFO - __main__ - Step 24768: {'lr': 0.0004713695076147322, 'samples': 4755456, 'steps': 24767, 'loss/train': 0.11573341488838196} 11/07/2021 00:42:30 - INFO - __main__ - Step 24769: {'lr': 0.0004713670416203001, 'samples': 4755648, 'steps': 24768, 'loss/train': 1.62954580783844} 11/07/2021 00:42:31 - INFO - __main__ - Step 24770: {'lr': 0.00047136457552612344, 'samples': 4755840, 'steps': 24769, 'loss/train': 0.083526112139225} 11/07/2021 00:42:32 - INFO - __main__ - Step 24771: {'lr': 0.00047136210933220325, 'samples': 4756032, 'steps': 24770, 'loss/train': 1.3785045146942139} 11/07/2021 00:42:32 - INFO - __main__ - Step 24772: {'lr': 0.0004713596430385408, 'samples': 4756224, 'steps': 24771, 'loss/train': 2.175171375274658} 11/07/2021 00:42:32 - INFO - __main__ - Step 24773: {'lr': 0.00047135717664513704, 'samples': 4756416, 'steps': 24772, 'loss/train': 1.426689863204956} 11/07/2021 00:42:33 - INFO - __main__ - Step 24774: {'lr': 0.00047135471015199315, 'samples': 4756608, 'steps': 24773, 'loss/train': 1.5172810554504395} 11/07/2021 00:42:33 - INFO - __main__ - Step 24775: {'lr': 0.00047135224355911035, 'samples': 4756800, 'steps': 24774, 'loss/train': 1.5251126289367676} 11/07/2021 00:42:34 - INFO - __main__ - Step 24776: {'lr': 0.0004713497768664895, 'samples': 4756992, 'steps': 24775, 'loss/train': 1.8335155248641968} 11/07/2021 00:42:34 - INFO - __main__ - Step 24777: {'lr': 0.00047134731007413195, 'samples': 4757184, 'steps': 24776, 'loss/train': 1.5030866861343384} 11/07/2021 00:42:35 - INFO - __main__ - Step 24778: {'lr': 0.0004713448431820387, 'samples': 4757376, 'steps': 24777, 'loss/train': 1.5333009958267212} 11/07/2021 00:42:35 - INFO - __main__ - Step 24779: {'lr': 0.00047134237619021085, 'samples': 4757568, 'steps': 24778, 'loss/train': 1.6005456447601318} 11/07/2021 00:42:36 - INFO - __main__ - Step 24780: {'lr': 0.00047133990909864953, 'samples': 4757760, 'steps': 24779, 'loss/train': 1.6143572330474854} 11/07/2021 00:42:36 - INFO - __main__ - Step 24781: {'lr': 0.0004713374419073559, 'samples': 4757952, 'steps': 24780, 'loss/train': 1.168927550315857} 11/07/2021 00:42:37 - INFO - __main__ - Step 24782: {'lr': 0.000471334974616331, 'samples': 4758144, 'steps': 24781, 'loss/train': 1.3281073570251465} 11/07/2021 00:42:37 - INFO - __main__ - Step 24783: {'lr': 0.0004713325072255761, 'samples': 4758336, 'steps': 24782, 'loss/train': 1.2812906503677368} 11/07/2021 00:42:38 - INFO - __main__ - Step 24784: {'lr': 0.000471330039735092, 'samples': 4758528, 'steps': 24783, 'loss/train': 1.1857270002365112} 11/07/2021 00:42:38 - INFO - __main__ - Step 24785: {'lr': 0.0004713275721448801, 'samples': 4758720, 'steps': 24784, 'loss/train': 1.8095072507858276} 11/07/2021 00:42:39 - INFO - __main__ - Step 24786: {'lr': 0.0004713251044549414, 'samples': 4758912, 'steps': 24785, 'loss/train': 1.5033193826675415} 11/07/2021 00:42:39 - INFO - __main__ - Step 24787: {'lr': 0.000471322636665277, 'samples': 4759104, 'steps': 24786, 'loss/train': 1.3253549337387085} 11/07/2021 00:42:40 - INFO - __main__ - Step 24788: {'lr': 0.0004713201687758881, 'samples': 4759296, 'steps': 24787, 'loss/train': 1.9118258953094482} 11/07/2021 00:42:40 - INFO - __main__ - Step 24789: {'lr': 0.00047131770078677574, 'samples': 4759488, 'steps': 24788, 'loss/train': 1.806757926940918} 11/07/2021 00:42:40 - INFO - __main__ - Step 24790: {'lr': 0.000471315232697941, 'samples': 4759680, 'steps': 24789, 'loss/train': 1.7934147119522095} 11/07/2021 00:42:41 - INFO - __main__ - Step 24791: {'lr': 0.000471312764509385, 'samples': 4759872, 'steps': 24790, 'loss/train': 1.5426182746887207} 11/07/2021 00:42:42 - INFO - __main__ - Step 24792: {'lr': 0.0004713102962211089, 'samples': 4760064, 'steps': 24791, 'loss/train': 1.2751450538635254} 11/07/2021 00:42:42 - INFO - __main__ - Step 24793: {'lr': 0.0004713078278331138, 'samples': 4760256, 'steps': 24792, 'loss/train': 1.8063607215881348} 11/07/2021 00:42:42 - INFO - __main__ - Step 24794: {'lr': 0.00047130535934540086, 'samples': 4760448, 'steps': 24793, 'loss/train': 1.7444671392440796} 11/07/2021 00:42:43 - INFO - __main__ - Step 24795: {'lr': 0.00047130289075797107, 'samples': 4760640, 'steps': 24794, 'loss/train': 1.5532677173614502} 11/07/2021 00:42:44 - INFO - __main__ - Step 24796: {'lr': 0.0004713004220708257, 'samples': 4760832, 'steps': 24795, 'loss/train': 1.6229361295700073} 11/07/2021 00:42:44 - INFO - __main__ - Step 24797: {'lr': 0.0004712979532839656, 'samples': 4761024, 'steps': 24796, 'loss/train': 1.5406346321105957} 11/07/2021 00:42:44 - INFO - __main__ - Step 24798: {'lr': 0.00047129548439739225, 'samples': 4761216, 'steps': 24797, 'loss/train': 1.52739417552948} 11/07/2021 00:42:45 - INFO - __main__ - Step 24799: {'lr': 0.0004712930154111065, 'samples': 4761408, 'steps': 24798, 'loss/train': 1.6242315769195557} 11/07/2021 00:42:45 - INFO - __main__ - Step 24800: {'lr': 0.00047129054632510947, 'samples': 4761600, 'steps': 24799, 'loss/train': 0.9558914303779602} 11/07/2021 00:42:46 - INFO - __main__ - Step 24801: {'lr': 0.00047128807713940244, 'samples': 4761792, 'steps': 24800, 'loss/train': 1.9542592763900757} 11/07/2021 00:42:46 - INFO - __main__ - Step 24802: {'lr': 0.00047128560785398633, 'samples': 4761984, 'steps': 24801, 'loss/train': 2.2053301334381104} 11/07/2021 00:42:47 - INFO - __main__ - Step 24803: {'lr': 0.0004712831384688624, 'samples': 4762176, 'steps': 24802, 'loss/train': 1.2068068981170654} 11/07/2021 00:42:47 - INFO - __main__ - Step 24804: {'lr': 0.00047128066898403166, 'samples': 4762368, 'steps': 24803, 'loss/train': 1.2261611223220825} 11/07/2021 00:42:47 - INFO - __main__ - Step 24805: {'lr': 0.00047127819939949534, 'samples': 4762560, 'steps': 24804, 'loss/train': 1.5846295356750488} 11/07/2021 00:42:49 - INFO - __main__ - Step 24806: {'lr': 0.00047127572971525437, 'samples': 4762752, 'steps': 24805, 'loss/train': 1.7221107482910156} 11/07/2021 00:42:49 - INFO - __main__ - Step 24807: {'lr': 0.00047127325993131006, 'samples': 4762944, 'steps': 24806, 'loss/train': 1.6136387586593628} 11/07/2021 00:42:49 - INFO - __main__ - Step 24808: {'lr': 0.0004712707900476634, 'samples': 4763136, 'steps': 24807, 'loss/train': 1.6903235912322998} 11/07/2021 00:42:50 - INFO - __main__ - Step 24809: {'lr': 0.00047126832006431555, 'samples': 4763328, 'steps': 24808, 'loss/train': 1.5415527820587158} 11/07/2021 00:42:50 - INFO - __main__ - Step 24810: {'lr': 0.00047126584998126756, 'samples': 4763520, 'steps': 24809, 'loss/train': 1.7238624095916748} 11/07/2021 00:42:51 - INFO - __main__ - Step 24811: {'lr': 0.0004712633797985206, 'samples': 4763712, 'steps': 24810, 'loss/train': 2.1266679763793945} 11/07/2021 00:42:51 - INFO - __main__ - Step 24812: {'lr': 0.0004712609095160758, 'samples': 4763904, 'steps': 24811, 'loss/train': 1.6983754634857178} 11/07/2021 00:42:52 - INFO - __main__ - Step 24813: {'lr': 0.0004712584391339343, 'samples': 4764096, 'steps': 24812, 'loss/train': 1.1013861894607544} 11/07/2021 00:42:52 - INFO - __main__ - Step 24814: {'lr': 0.0004712559686520971, 'samples': 4764288, 'steps': 24813, 'loss/train': 0.8070735335350037} 11/07/2021 00:42:52 - INFO - __main__ - Step 24815: {'lr': 0.0004712534980705654, 'samples': 4764480, 'steps': 24814, 'loss/train': 1.4523907899856567} 11/07/2021 00:42:53 - INFO - __main__ - Step 24816: {'lr': 0.0004712510273893402, 'samples': 4764672, 'steps': 24815, 'loss/train': 2.0529558658599854} 11/07/2021 00:42:54 - INFO - __main__ - Step 24817: {'lr': 0.00047124855660842283, 'samples': 4764864, 'steps': 24816, 'loss/train': 1.5841693878173828} 11/07/2021 00:42:54 - INFO - __main__ - Step 24818: {'lr': 0.00047124608572781426, 'samples': 4765056, 'steps': 24817, 'loss/train': 0.8408837914466858} 11/07/2021 00:42:54 - INFO - __main__ - Step 24819: {'lr': 0.0004712436147475155, 'samples': 4765248, 'steps': 24818, 'loss/train': 1.3041859865188599} 11/07/2021 00:42:55 - INFO - __main__ - Step 24820: {'lr': 0.0004712411436675279, 'samples': 4765440, 'steps': 24819, 'loss/train': 2.031338691711426} 11/07/2021 00:42:56 - INFO - __main__ - Step 24821: {'lr': 0.0004712386724878524, 'samples': 4765632, 'steps': 24820, 'loss/train': 1.229081392288208} 11/07/2021 00:42:56 - INFO - __main__ - Step 24822: {'lr': 0.0004712362012084902, 'samples': 4765824, 'steps': 24821, 'loss/train': 1.6977263689041138} 11/07/2021 00:42:57 - INFO - __main__ - Step 24823: {'lr': 0.00047123372982944237, 'samples': 4766016, 'steps': 24822, 'loss/train': 1.9030191898345947} 11/07/2021 00:42:57 - INFO - __main__ - Step 24824: {'lr': 0.00047123125835071004, 'samples': 4766208, 'steps': 24823, 'loss/train': 1.301546573638916} 11/07/2021 00:42:57 - INFO - __main__ - Step 24825: {'lr': 0.00047122878677229426, 'samples': 4766400, 'steps': 24824, 'loss/train': 2.0315120220184326} 11/07/2021 00:42:58 - INFO - __main__ - Step 24826: {'lr': 0.0004712263150941962, 'samples': 4766592, 'steps': 24825, 'loss/train': 1.5814260244369507} 11/07/2021 00:42:59 - INFO - __main__ - Step 24827: {'lr': 0.0004712238433164171, 'samples': 4766784, 'steps': 24826, 'loss/train': 1.3355937004089355} 11/07/2021 00:42:59 - INFO - __main__ - Step 24828: {'lr': 0.00047122137143895785, 'samples': 4766976, 'steps': 24827, 'loss/train': 1.3185474872589111} 11/07/2021 00:42:59 - INFO - __main__ - Step 24829: {'lr': 0.0004712188994618197, 'samples': 4767168, 'steps': 24828, 'loss/train': 1.5112864971160889} 11/07/2021 00:43:00 - INFO - __main__ - Step 24830: {'lr': 0.0004712164273850037, 'samples': 4767360, 'steps': 24829, 'loss/train': 1.3218666315078735} 11/07/2021 00:43:01 - INFO - __main__ - Step 24831: {'lr': 0.00047121395520851103, 'samples': 4767552, 'steps': 24830, 'loss/train': 0.8458890914916992} 11/07/2021 00:43:01 - INFO - __main__ - Step 24832: {'lr': 0.00047121148293234274, 'samples': 4767744, 'steps': 24831, 'loss/train': 1.7730728387832642} 11/07/2021 00:43:02 - INFO - __main__ - Step 24833: {'lr': 0.00047120901055649995, 'samples': 4767936, 'steps': 24832, 'loss/train': 1.6675069332122803} 11/07/2021 00:43:02 - INFO - __main__ - Step 24834: {'lr': 0.0004712065380809838, 'samples': 4768128, 'steps': 24833, 'loss/train': 1.794461965560913} 11/07/2021 00:43:02 - INFO - __main__ - Step 24835: {'lr': 0.0004712040655057954, 'samples': 4768320, 'steps': 24834, 'loss/train': 1.578720211982727} 11/07/2021 00:43:03 - INFO - __main__ - Step 24836: {'lr': 0.0004712015928309359, 'samples': 4768512, 'steps': 24835, 'loss/train': 1.6604273319244385} 11/07/2021 00:43:04 - INFO - __main__ - Step 24837: {'lr': 0.0004711991200564064, 'samples': 4768704, 'steps': 24836, 'loss/train': 1.5459864139556885} 11/07/2021 00:43:04 - INFO - __main__ - Step 24838: {'lr': 0.0004711966471822079, 'samples': 4768896, 'steps': 24837, 'loss/train': 1.748180627822876} 11/07/2021 00:43:04 - INFO - __main__ - Step 24839: {'lr': 0.00047119417420834163, 'samples': 4769088, 'steps': 24838, 'loss/train': 1.8245811462402344} 11/07/2021 00:43:05 - INFO - __main__ - Step 24840: {'lr': 0.00047119170113480867, 'samples': 4769280, 'steps': 24839, 'loss/train': 1.0669902563095093} 11/07/2021 00:43:05 - INFO - __main__ - Step 24841: {'lr': 0.00047118922796161026, 'samples': 4769472, 'steps': 24840, 'loss/train': 0.9569848775863647} 11/07/2021 00:43:06 - INFO - __main__ - Step 24842: {'lr': 0.00047118675468874727, 'samples': 4769664, 'steps': 24841, 'loss/train': 1.7010862827301025} 11/07/2021 00:43:06 - INFO - __main__ - Step 24843: {'lr': 0.00047118428131622095, 'samples': 4769856, 'steps': 24842, 'loss/train': 1.2079625129699707} 11/07/2021 00:43:07 - INFO - __main__ - Step 24844: {'lr': 0.00047118180784403243, 'samples': 4770048, 'steps': 24843, 'loss/train': 1.1464524269104004} 11/07/2021 00:43:07 - INFO - __main__ - Step 24845: {'lr': 0.0004711793342721828, 'samples': 4770240, 'steps': 24844, 'loss/train': 1.9089672565460205} 11/07/2021 00:43:07 - INFO - __main__ - Step 24846: {'lr': 0.00047117686060067315, 'samples': 4770432, 'steps': 24845, 'loss/train': 1.6078968048095703} 11/07/2021 00:43:08 - INFO - __main__ - Step 24847: {'lr': 0.00047117438682950467, 'samples': 4770624, 'steps': 24846, 'loss/train': 1.0903383493423462} 11/07/2021 00:43:09 - INFO - __main__ - Step 24848: {'lr': 0.0004711719129586784, 'samples': 4770816, 'steps': 24847, 'loss/train': 1.6833618879318237} 11/07/2021 00:43:09 - INFO - __main__ - Step 24849: {'lr': 0.0004711694389881955, 'samples': 4771008, 'steps': 24848, 'loss/train': 1.607546329498291} 11/07/2021 00:43:09 - INFO - __main__ - Step 24850: {'lr': 0.000471166964918057, 'samples': 4771200, 'steps': 24849, 'loss/train': 1.469663381576538} 11/07/2021 00:43:10 - INFO - __main__ - Step 24851: {'lr': 0.0004711644907482641, 'samples': 4771392, 'steps': 24850, 'loss/train': 1.3836008310317993} 11/07/2021 00:43:11 - INFO - __main__ - Step 24852: {'lr': 0.00047116201647881794, 'samples': 4771584, 'steps': 24851, 'loss/train': 1.6112840175628662} 11/07/2021 00:43:11 - INFO - __main__ - Step 24853: {'lr': 0.00047115954210971955, 'samples': 4771776, 'steps': 24852, 'loss/train': 1.1933653354644775} 11/07/2021 00:43:12 - INFO - __main__ - Step 24854: {'lr': 0.0004711570676409701, 'samples': 4771968, 'steps': 24853, 'loss/train': 0.9327288866043091} 11/07/2021 00:43:12 - INFO - __main__ - Step 24855: {'lr': 0.0004711545930725707, 'samples': 4772160, 'steps': 24854, 'loss/train': 1.3270524740219116} 11/07/2021 00:43:12 - INFO - __main__ - Step 24856: {'lr': 0.0004711521184045224, 'samples': 4772352, 'steps': 24855, 'loss/train': 1.958956241607666} 11/07/2021 00:43:13 - INFO - __main__ - Step 24857: {'lr': 0.0004711496436368264, 'samples': 4772544, 'steps': 24856, 'loss/train': 1.4110667705535889} 11/07/2021 00:43:14 - INFO - __main__ - Step 24858: {'lr': 0.00047114716876948384, 'samples': 4772736, 'steps': 24857, 'loss/train': 2.1115291118621826} 11/07/2021 00:43:14 - INFO - __main__ - Step 24859: {'lr': 0.0004711446938024957, 'samples': 4772928, 'steps': 24858, 'loss/train': 1.4857505559921265} 11/07/2021 00:43:14 - INFO - __main__ - Step 24860: {'lr': 0.00047114221873586316, 'samples': 4773120, 'steps': 24859, 'loss/train': 1.7145774364471436} 11/07/2021 00:43:15 - INFO - __main__ - Step 24861: {'lr': 0.00047113974356958744, 'samples': 4773312, 'steps': 24860, 'loss/train': 1.6529386043548584} 11/07/2021 00:43:16 - INFO - __main__ - Step 24862: {'lr': 0.0004711372683036695, 'samples': 4773504, 'steps': 24861, 'loss/train': 1.7494276762008667} 11/07/2021 00:43:16 - INFO - __main__ - Step 24863: {'lr': 0.0004711347929381105, 'samples': 4773696, 'steps': 24862, 'loss/train': 1.3992499113082886} 11/07/2021 00:43:16 - INFO - __main__ - Step 24864: {'lr': 0.00047113231747291165, 'samples': 4773888, 'steps': 24863, 'loss/train': 1.5058045387268066} 11/07/2021 00:43:17 - INFO - __main__ - Step 24865: {'lr': 0.0004711298419080739, 'samples': 4774080, 'steps': 24864, 'loss/train': 1.6152267456054688} 11/07/2021 00:43:17 - INFO - __main__ - Step 24866: {'lr': 0.00047112736624359855, 'samples': 4774272, 'steps': 24865, 'loss/train': 1.489564061164856} 11/07/2021 00:43:18 - INFO - __main__ - Step 24867: {'lr': 0.00047112489047948655, 'samples': 4774464, 'steps': 24866, 'loss/train': 1.7998241186141968} 11/07/2021 00:43:18 - INFO - __main__ - Step 24868: {'lr': 0.00047112241461573913, 'samples': 4774656, 'steps': 24867, 'loss/train': 1.6361345052719116} 11/07/2021 00:43:19 - INFO - __main__ - Step 24869: {'lr': 0.0004711199386523573, 'samples': 4774848, 'steps': 24868, 'loss/train': 1.5118496417999268} 11/07/2021 00:43:19 - INFO - __main__ - Step 24870: {'lr': 0.0004711174625893423, 'samples': 4775040, 'steps': 24869, 'loss/train': 1.597435474395752} 11/07/2021 00:43:20 - INFO - __main__ - Step 24871: {'lr': 0.00047111498642669517, 'samples': 4775232, 'steps': 24870, 'loss/train': 2.201490640640259} 11/07/2021 00:43:20 - INFO - __main__ - Step 24872: {'lr': 0.00047111251016441704, 'samples': 4775424, 'steps': 24871, 'loss/train': 1.4889650344848633} 11/07/2021 00:43:21 - INFO - __main__ - Step 24873: {'lr': 0.0004711100338025089, 'samples': 4775616, 'steps': 24872, 'loss/train': 1.5616157054901123} 11/07/2021 00:43:21 - INFO - __main__ - Step 24874: {'lr': 0.00047110755734097216, 'samples': 4775808, 'steps': 24873, 'loss/train': 1.6636919975280762} 11/07/2021 00:43:22 - INFO - __main__ - Step 24875: {'lr': 0.00047110508077980774, 'samples': 4776000, 'steps': 24874, 'loss/train': 1.9187812805175781} 11/07/2021 00:43:22 - INFO - __main__ - Step 24876: {'lr': 0.00047110260411901674, 'samples': 4776192, 'steps': 24875, 'loss/train': 1.4029637575149536} 11/07/2021 00:43:22 - INFO - __main__ - Step 24877: {'lr': 0.0004711001273586003, 'samples': 4776384, 'steps': 24876, 'loss/train': 1.7008123397827148} 11/07/2021 00:43:23 - INFO - __main__ - Step 24878: {'lr': 0.0004710976504985596, 'samples': 4776576, 'steps': 24877, 'loss/train': 1.076745629310608} 11/07/2021 00:43:24 - INFO - __main__ - Step 24879: {'lr': 0.00047109517353889575, 'samples': 4776768, 'steps': 24878, 'loss/train': 1.3435444831848145} 11/07/2021 00:43:24 - INFO - __main__ - Step 24880: {'lr': 0.0004710926964796097, 'samples': 4776960, 'steps': 24879, 'loss/train': 1.8936501741409302} 11/07/2021 00:43:25 - INFO - __main__ - Step 24881: {'lr': 0.00047109021932070284, 'samples': 4777152, 'steps': 24880, 'loss/train': 1.713632345199585} 11/07/2021 00:43:25 - INFO - __main__ - Step 24882: {'lr': 0.00047108774206217605, 'samples': 4777344, 'steps': 24881, 'loss/train': 1.4235851764678955} 11/07/2021 00:43:26 - INFO - __main__ - Step 24883: {'lr': 0.00047108526470403055, 'samples': 4777536, 'steps': 24882, 'loss/train': 1.850601077079773} 11/07/2021 00:43:26 - INFO - __main__ - Step 24884: {'lr': 0.0004710827872462674, 'samples': 4777728, 'steps': 24883, 'loss/train': 1.433193564414978} 11/07/2021 00:43:27 - INFO - __main__ - Step 24885: {'lr': 0.00047108030968888784, 'samples': 4777920, 'steps': 24884, 'loss/train': 1.4146473407745361} 11/07/2021 00:43:27 - INFO - __main__ - Step 24886: {'lr': 0.00047107783203189285, 'samples': 4778112, 'steps': 24885, 'loss/train': 1.2156516313552856} 11/07/2021 00:43:27 - INFO - __main__ - Step 24887: {'lr': 0.0004710753542752836, 'samples': 4778304, 'steps': 24886, 'loss/train': 1.2330988645553589} 11/07/2021 00:43:29 - INFO - __main__ - Step 24888: {'lr': 0.0004710728764190612, 'samples': 4778496, 'steps': 24887, 'loss/train': 1.2248584032058716} 11/07/2021 00:43:29 - INFO - __main__ - Step 24889: {'lr': 0.0004710703984632268, 'samples': 4778688, 'steps': 24888, 'loss/train': 1.356774926185608} 11/07/2021 00:43:29 - INFO - __main__ - Step 24890: {'lr': 0.0004710679204077815, 'samples': 4778880, 'steps': 24889, 'loss/train': 1.610066294670105} 11/07/2021 00:43:30 - INFO - __main__ - Step 24891: {'lr': 0.0004710654422527264, 'samples': 4779072, 'steps': 24890, 'loss/train': 1.2098134756088257} 11/07/2021 00:43:30 - INFO - __main__ - Step 24892: {'lr': 0.0004710629639980626, 'samples': 4779264, 'steps': 24891, 'loss/train': 1.6932626962661743} 11/07/2021 00:43:30 - INFO - __main__ - Step 24893: {'lr': 0.0004710604856437912, 'samples': 4779456, 'steps': 24892, 'loss/train': 1.8135350942611694} 11/07/2021 00:43:31 - INFO - __main__ - Step 24894: {'lr': 0.00047105800718991343, 'samples': 4779648, 'steps': 24893, 'loss/train': 2.8847551345825195} 11/07/2021 00:43:32 - INFO - __main__ - Step 24895: {'lr': 0.0004710555286364303, 'samples': 4779840, 'steps': 24894, 'loss/train': 2.1138551235198975} 11/07/2021 00:43:32 - INFO - __main__ - Step 24896: {'lr': 0.000471053049983343, 'samples': 4780032, 'steps': 24895, 'loss/train': 0.3590337634086609} 11/07/2021 00:43:32 - INFO - __main__ - Step 24897: {'lr': 0.0004710505712306526, 'samples': 4780224, 'steps': 24896, 'loss/train': 1.4603281021118164} 11/07/2021 00:43:33 - INFO - __main__ - Step 24898: {'lr': 0.00047104809237836023, 'samples': 4780416, 'steps': 24897, 'loss/train': 1.2135181427001953} 11/07/2021 00:43:34 - INFO - __main__ - Step 24899: {'lr': 0.0004710456134264669, 'samples': 4780608, 'steps': 24898, 'loss/train': 1.645835518836975} 11/07/2021 00:43:34 - INFO - __main__ - Step 24900: {'lr': 0.0004710431343749739, 'samples': 4780800, 'steps': 24899, 'loss/train': 1.6387699842453003} 11/07/2021 00:43:35 - INFO - __main__ - Step 24901: {'lr': 0.0004710406552238823, 'samples': 4780992, 'steps': 24900, 'loss/train': 1.8654963970184326} 11/07/2021 00:43:35 - INFO - __main__ - Step 24902: {'lr': 0.0004710381759731932, 'samples': 4781184, 'steps': 24901, 'loss/train': 1.5616538524627686} 11/07/2021 00:43:35 - INFO - __main__ - Step 24903: {'lr': 0.0004710356966229077, 'samples': 4781376, 'steps': 24902, 'loss/train': 0.9458338022232056} 11/07/2021 00:43:36 - INFO - __main__ - Step 24904: {'lr': 0.00047103321717302684, 'samples': 4781568, 'steps': 24903, 'loss/train': 1.0792574882507324} 11/07/2021 00:43:37 - INFO - __main__ - Step 24905: {'lr': 0.00047103073762355186, 'samples': 4781760, 'steps': 24904, 'loss/train': 1.4467488527297974} 11/07/2021 00:43:37 - INFO - __main__ - Step 24906: {'lr': 0.0004710282579744839, 'samples': 4781952, 'steps': 24905, 'loss/train': 2.028386116027832} 11/07/2021 00:43:38 - INFO - __main__ - Step 24907: {'lr': 0.000471025778225824, 'samples': 4782144, 'steps': 24906, 'loss/train': 1.469319462776184} 11/07/2021 00:43:38 - INFO - __main__ - Step 24908: {'lr': 0.0004710232983775733, 'samples': 4782336, 'steps': 24907, 'loss/train': 1.626294493675232} 11/07/2021 00:43:39 - INFO - __main__ - Step 24909: {'lr': 0.0004710208184297329, 'samples': 4782528, 'steps': 24908, 'loss/train': 0.9432609677314758} 11/07/2021 00:43:39 - INFO - __main__ - Step 24910: {'lr': 0.0004710183383823039, 'samples': 4782720, 'steps': 24909, 'loss/train': 1.1606712341308594} 11/07/2021 00:43:40 - INFO - __main__ - Step 24911: {'lr': 0.00047101585823528745, 'samples': 4782912, 'steps': 24910, 'loss/train': 1.397504210472107} 11/07/2021 00:43:40 - INFO - __main__ - Step 24912: {'lr': 0.0004710133779886847, 'samples': 4783104, 'steps': 24911, 'loss/train': 1.5968561172485352} 11/07/2021 00:43:40 - INFO - __main__ - Step 24913: {'lr': 0.00047101089764249674, 'samples': 4783296, 'steps': 24912, 'loss/train': 1.6054977178573608} 11/07/2021 00:43:42 - INFO - __main__ - Step 24914: {'lr': 0.0004710084171967246, 'samples': 4783488, 'steps': 24913, 'loss/train': 1.6846996545791626} 11/07/2021 00:43:42 - INFO - __main__ - Step 24915: {'lr': 0.00047100593665136946, 'samples': 4783680, 'steps': 24914, 'loss/train': 1.5156437158584595} 11/07/2021 00:43:42 - INFO - __main__ - Step 24916: {'lr': 0.0004710034560064326, 'samples': 4783872, 'steps': 24915, 'loss/train': 1.3427435159683228} 11/07/2021 00:43:43 - INFO - __main__ - Step 24917: {'lr': 0.00047100097526191486, 'samples': 4784064, 'steps': 24916, 'loss/train': 0.7655185461044312} 11/07/2021 00:43:43 - INFO - __main__ - Step 24918: {'lr': 0.0004709984944178176, 'samples': 4784256, 'steps': 24917, 'loss/train': 1.5302414894104004} 11/07/2021 00:43:43 - INFO - __main__ - Step 24919: {'lr': 0.0004709960134741418, 'samples': 4784448, 'steps': 24918, 'loss/train': 1.8170706033706665} 11/07/2021 00:43:44 - INFO - __main__ - Step 24920: {'lr': 0.00047099353243088856, 'samples': 4784640, 'steps': 24919, 'loss/train': 1.6895228624343872} 11/07/2021 00:43:45 - INFO - __main__ - Step 24921: {'lr': 0.00047099105128805906, 'samples': 4784832, 'steps': 24920, 'loss/train': 1.7223882675170898} 11/07/2021 00:43:45 - INFO - __main__ - Step 24922: {'lr': 0.00047098857004565444, 'samples': 4785024, 'steps': 24921, 'loss/train': 2.0453617572784424} 11/07/2021 00:43:45 - INFO - __main__ - Step 24923: {'lr': 0.00047098608870367576, 'samples': 4785216, 'steps': 24922, 'loss/train': 1.4374672174453735} 11/07/2021 00:43:46 - INFO - __main__ - Step 24924: {'lr': 0.00047098360726212406, 'samples': 4785408, 'steps': 24923, 'loss/train': 1.454687237739563} 11/07/2021 00:43:47 - INFO - __main__ - Step 24925: {'lr': 0.0004709811257210007, 'samples': 4785600, 'steps': 24924, 'loss/train': 1.4822044372558594} 11/07/2021 00:43:47 - INFO - __main__ - Step 24926: {'lr': 0.0004709786440803066, 'samples': 4785792, 'steps': 24925, 'loss/train': 2.3867578506469727} 11/07/2021 00:43:48 - INFO - __main__ - Step 24927: {'lr': 0.00047097616234004295, 'samples': 4785984, 'steps': 24926, 'loss/train': 4.5228352546691895} 11/07/2021 00:43:48 - INFO - __main__ - Step 24928: {'lr': 0.00047097368050021083, 'samples': 4786176, 'steps': 24927, 'loss/train': 2.299553394317627} 11/07/2021 00:43:48 - INFO - __main__ - Step 24929: {'lr': 0.0004709711985608114, 'samples': 4786368, 'steps': 24928, 'loss/train': 1.8573694229125977} 11/07/2021 00:43:49 - INFO - __main__ - Step 24930: {'lr': 0.0004709687165218457, 'samples': 4786560, 'steps': 24929, 'loss/train': 5.637241363525391} 11/07/2021 00:43:50 - INFO - __main__ - Step 24931: {'lr': 0.00047096623438331497, 'samples': 4786752, 'steps': 24930, 'loss/train': 0.7219381332397461} 11/07/2021 00:43:50 - INFO - __main__ - Step 24932: {'lr': 0.00047096375214522026, 'samples': 4786944, 'steps': 24931, 'loss/train': 1.7461594343185425} 11/07/2021 00:43:50 - INFO - __main__ - Step 24933: {'lr': 0.0004709612698075627, 'samples': 4787136, 'steps': 24932, 'loss/train': 2.014444351196289} 11/07/2021 00:43:51 - INFO - __main__ - Step 24934: {'lr': 0.00047095878737034335, 'samples': 4787328, 'steps': 24933, 'loss/train': 1.3692693710327148} 11/07/2021 00:43:51 - INFO - __main__ - Step 24935: {'lr': 0.00047095630483356336, 'samples': 4787520, 'steps': 24934, 'loss/train': 1.5486135482788086} 11/07/2021 00:43:52 - INFO - __main__ - Step 24936: {'lr': 0.00047095382219722396, 'samples': 4787712, 'steps': 24935, 'loss/train': 1.3866608142852783} 11/07/2021 00:43:52 - INFO - __main__ - Step 24937: {'lr': 0.0004709513394613261, 'samples': 4787904, 'steps': 24936, 'loss/train': 1.516735315322876} 11/07/2021 00:43:53 - INFO - __main__ - Step 24938: {'lr': 0.00047094885662587104, 'samples': 4788096, 'steps': 24937, 'loss/train': 1.4386154413223267} 11/07/2021 00:43:53 - INFO - __main__ - Step 24939: {'lr': 0.0004709463736908598, 'samples': 4788288, 'steps': 24938, 'loss/train': 2.102384090423584} 11/07/2021 00:43:53 - INFO - __main__ - Step 24940: {'lr': 0.0004709438906562935, 'samples': 4788480, 'steps': 24939, 'loss/train': 1.9339085817337036} 11/07/2021 00:43:55 - INFO - __main__ - Step 24941: {'lr': 0.0004709414075221734, 'samples': 4788672, 'steps': 24940, 'loss/train': 1.0070801973342896} 11/07/2021 00:43:55 - INFO - __main__ - Step 24942: {'lr': 0.0004709389242885004, 'samples': 4788864, 'steps': 24941, 'loss/train': 1.6432303190231323} 11/07/2021 00:43:55 - INFO - __main__ - Step 24943: {'lr': 0.00047093644095527574, 'samples': 4789056, 'steps': 24942, 'loss/train': 1.5137380361557007} 11/07/2021 00:43:56 - INFO - __main__ - Step 24944: {'lr': 0.00047093395752250056, 'samples': 4789248, 'steps': 24943, 'loss/train': 1.3559476137161255} 11/07/2021 00:43:56 - INFO - __main__ - Step 24945: {'lr': 0.000470931473990176, 'samples': 4789440, 'steps': 24944, 'loss/train': 0.49037522077560425} 11/07/2021 00:43:58 - INFO - __main__ - Step 24946: {'lr': 0.00047092899035830303, 'samples': 4789632, 'steps': 24945, 'loss/train': 1.1880199909210205} 11/07/2021 00:43:58 - INFO - __main__ - Step 24947: {'lr': 0.00047092650662688295, 'samples': 4789824, 'steps': 24946, 'loss/train': 2.1021833419799805} 11/07/2021 00:43:58 - INFO - __main__ - Step 24948: {'lr': 0.00047092402279591674, 'samples': 4790016, 'steps': 24947, 'loss/train': 1.6405055522918701} 11/07/2021 00:43:59 - INFO - __main__ - Step 24949: {'lr': 0.00047092153886540554, 'samples': 4790208, 'steps': 24948, 'loss/train': 0.7883797883987427} 11/07/2021 00:43:59 - INFO - __main__ - Step 24950: {'lr': 0.0004709190548353506, 'samples': 4790400, 'steps': 24949, 'loss/train': 1.515425205230713} 11/07/2021 00:43:59 - INFO - __main__ - Step 24951: {'lr': 0.0004709165707057529, 'samples': 4790592, 'steps': 24950, 'loss/train': 1.6825127601623535} 11/07/2021 00:44:00 - INFO - __main__ - Step 24952: {'lr': 0.0004709140864766136, 'samples': 4790784, 'steps': 24951, 'loss/train': 1.4859381914138794} 11/07/2021 00:44:01 - INFO - __main__ - Step 24953: {'lr': 0.0004709116021479338, 'samples': 4790976, 'steps': 24952, 'loss/train': 1.4513448476791382} 11/07/2021 00:44:01 - INFO - __main__ - Step 24954: {'lr': 0.00047090911771971466, 'samples': 4791168, 'steps': 24953, 'loss/train': 1.3714603185653687} 11/07/2021 00:44:02 - INFO - __main__ - Step 24955: {'lr': 0.0004709066331919573, 'samples': 4791360, 'steps': 24954, 'loss/train': 1.8338035345077515} 11/07/2021 00:44:02 - INFO - __main__ - Step 24956: {'lr': 0.0004709041485646628, 'samples': 4791552, 'steps': 24955, 'loss/train': 1.6190221309661865} 11/07/2021 00:44:02 - INFO - __main__ - Step 24957: {'lr': 0.0004709016638378323, 'samples': 4791744, 'steps': 24956, 'loss/train': 1.393989086151123} 11/07/2021 00:44:03 - INFO - __main__ - Step 24958: {'lr': 0.00047089917901146694, 'samples': 4791936, 'steps': 24957, 'loss/train': 1.3725248575210571} 11/07/2021 00:44:04 - INFO - __main__ - Step 24959: {'lr': 0.0004708966940855678, 'samples': 4792128, 'steps': 24958, 'loss/train': 1.7774754762649536} 11/07/2021 00:44:04 - INFO - __main__ - Step 24960: {'lr': 0.00047089420906013603, 'samples': 4792320, 'steps': 24959, 'loss/train': 1.6581379175186157} 11/07/2021 00:44:04 - INFO - __main__ - Step 24961: {'lr': 0.0004708917239351727, 'samples': 4792512, 'steps': 24960, 'loss/train': 1.5697776079177856} 11/07/2021 00:44:05 - INFO - __main__ - Step 24962: {'lr': 0.000470889238710679, 'samples': 4792704, 'steps': 24961, 'loss/train': 1.5028849840164185} 11/07/2021 00:44:06 - INFO - __main__ - Step 24963: {'lr': 0.00047088675338665596, 'samples': 4792896, 'steps': 24962, 'loss/train': 1.3876596689224243} 11/07/2021 00:44:06 - INFO - __main__ - Step 24964: {'lr': 0.00047088426796310486, 'samples': 4793088, 'steps': 24963, 'loss/train': 1.650161862373352} 11/07/2021 00:44:06 - INFO - __main__ - Step 24965: {'lr': 0.00047088178244002665, 'samples': 4793280, 'steps': 24964, 'loss/train': 1.6179430484771729} 11/07/2021 00:44:07 - INFO - __main__ - Step 24966: {'lr': 0.00047087929681742253, 'samples': 4793472, 'steps': 24965, 'loss/train': 1.5869165658950806} 11/07/2021 00:44:07 - INFO - __main__ - Step 24967: {'lr': 0.00047087681109529364, 'samples': 4793664, 'steps': 24966, 'loss/train': 2.0685853958129883} 11/07/2021 00:44:08 - INFO - __main__ - Step 24968: {'lr': 0.00047087432527364106, 'samples': 4793856, 'steps': 24967, 'loss/train': 1.5191277265548706} 11/07/2021 00:44:08 - INFO - __main__ - Step 24969: {'lr': 0.0004708718393524659, 'samples': 4794048, 'steps': 24968, 'loss/train': 1.5035487413406372} 11/07/2021 00:44:09 - INFO - __main__ - Step 24970: {'lr': 0.0004708693533317693, 'samples': 4794240, 'steps': 24969, 'loss/train': 1.6295180320739746} 11/07/2021 00:44:09 - INFO - __main__ - Step 24971: {'lr': 0.00047086686721155237, 'samples': 4794432, 'steps': 24970, 'loss/train': 1.0224506855010986} 11/07/2021 00:44:10 - INFO - __main__ - Step 24972: {'lr': 0.00047086438099181615, 'samples': 4794624, 'steps': 24971, 'loss/train': 1.4066888093948364} 11/07/2021 00:44:10 - INFO - __main__ - Step 24973: {'lr': 0.00047086189467256194, 'samples': 4794816, 'steps': 24972, 'loss/train': 1.5826265811920166} 11/07/2021 00:44:11 - INFO - __main__ - Step 24974: {'lr': 0.0004708594082537908, 'samples': 4795008, 'steps': 24973, 'loss/train': 1.7132618427276611} 11/07/2021 00:44:11 - INFO - __main__ - Step 24975: {'lr': 0.00047085692173550375, 'samples': 4795200, 'steps': 24974, 'loss/train': 0.9648758769035339} 11/07/2021 00:44:12 - INFO - __main__ - Step 24976: {'lr': 0.00047085443511770206, 'samples': 4795392, 'steps': 24975, 'loss/train': 1.7411245107650757} 11/07/2021 00:44:12 - INFO - __main__ - Step 24977: {'lr': 0.0004708519484003867, 'samples': 4795584, 'steps': 24976, 'loss/train': 1.4710562229156494} 11/07/2021 00:44:12 - INFO - __main__ - Step 24978: {'lr': 0.0004708494615835589, 'samples': 4795776, 'steps': 24977, 'loss/train': 1.6182292699813843} 11/07/2021 00:44:13 - INFO - __main__ - Step 24979: {'lr': 0.00047084697466721973, 'samples': 4795968, 'steps': 24978, 'loss/train': 1.6603829860687256} 11/07/2021 00:44:14 - INFO - __main__ - Step 24980: {'lr': 0.0004708444876513703, 'samples': 4796160, 'steps': 24979, 'loss/train': 1.8253313302993774} 11/07/2021 00:44:14 - INFO - __main__ - Step 24981: {'lr': 0.0004708420005360118, 'samples': 4796352, 'steps': 24980, 'loss/train': 1.5156065225601196} 11/07/2021 00:44:14 - INFO - __main__ - Step 24982: {'lr': 0.0004708395133211452, 'samples': 4796544, 'steps': 24981, 'loss/train': 1.6060330867767334} 11/07/2021 00:44:15 - INFO - __main__ - Step 24983: {'lr': 0.0004708370260067718, 'samples': 4796736, 'steps': 24982, 'loss/train': 1.74071204662323} 11/07/2021 00:44:16 - INFO - __main__ - Step 24984: {'lr': 0.00047083453859289267, 'samples': 4796928, 'steps': 24983, 'loss/train': 1.570011019706726} 11/07/2021 00:44:16 - INFO - __main__ - Step 24985: {'lr': 0.00047083205107950886, 'samples': 4797120, 'steps': 24984, 'loss/train': 2.2700695991516113} 11/07/2021 00:44:16 - INFO - __main__ - Step 24986: {'lr': 0.00047082956346662153, 'samples': 4797312, 'steps': 24985, 'loss/train': 2.238264799118042} 11/07/2021 00:44:17 - INFO - __main__ - Step 24987: {'lr': 0.00047082707575423177, 'samples': 4797504, 'steps': 24986, 'loss/train': 1.502872109413147} 11/07/2021 00:44:17 - INFO - __main__ - Step 24988: {'lr': 0.00047082458794234087, 'samples': 4797696, 'steps': 24987, 'loss/train': 1.5819875001907349} 11/07/2021 00:44:18 - INFO - __main__ - Step 24989: {'lr': 0.0004708221000309497, 'samples': 4797888, 'steps': 24988, 'loss/train': 1.5078585147857666} 11/07/2021 00:44:18 - INFO - __main__ - Step 24990: {'lr': 0.0004708196120200595, 'samples': 4798080, 'steps': 24989, 'loss/train': 1.8042079210281372} 11/07/2021 00:44:19 - INFO - __main__ - Step 24991: {'lr': 0.0004708171239096715, 'samples': 4798272, 'steps': 24990, 'loss/train': 3.638201951980591} 11/07/2021 00:44:19 - INFO - __main__ - Step 24992: {'lr': 0.00047081463569978655, 'samples': 4798464, 'steps': 24991, 'loss/train': 1.200833797454834} 11/07/2021 00:44:20 - INFO - __main__ - Step 24993: {'lr': 0.00047081214739040606, 'samples': 4798656, 'steps': 24992, 'loss/train': 1.1243364810943604} 11/07/2021 00:44:21 - INFO - __main__ - Step 24994: {'lr': 0.000470809658981531, 'samples': 4798848, 'steps': 24993, 'loss/train': 1.4025392532348633} 11/07/2021 00:44:21 - INFO - __main__ - Step 24995: {'lr': 0.00047080717047316245, 'samples': 4799040, 'steps': 24994, 'loss/train': 1.8197518587112427} 11/07/2021 00:44:21 - INFO - __main__ - Step 24996: {'lr': 0.0004708046818653017, 'samples': 4799232, 'steps': 24995, 'loss/train': 1.973433256149292} 11/07/2021 00:44:22 - INFO - __main__ - Step 24997: {'lr': 0.0004708021931579497, 'samples': 4799424, 'steps': 24996, 'loss/train': 1.774387001991272} 11/07/2021 00:44:22 - INFO - __main__ - Step 24998: {'lr': 0.00047079970435110765, 'samples': 4799616, 'steps': 24997, 'loss/train': 1.645456314086914} 11/07/2021 00:44:24 - INFO - __main__ - Step 24999: {'lr': 0.0004707972154447766, 'samples': 4799808, 'steps': 24998, 'loss/train': 1.5165650844573975} 11/07/2021 00:44:24 - INFO - __main__ - Step 25000: {'lr': 0.00047079472643895784, 'samples': 4800000, 'steps': 24999, 'loss/train': 1.743245005607605} 11/07/2021 00:44:24 - INFO - __main__ - Step 25001: {'lr': 0.00047079223733365234, 'samples': 4800192, 'steps': 25000, 'loss/train': 2.0800983905792236} 11/07/2021 00:44:25 - INFO - __main__ - Step 25002: {'lr': 0.0004707897481288612, 'samples': 4800384, 'steps': 25001, 'loss/train': 1.7442258596420288} 11/07/2021 00:44:25 - INFO - __main__ - Step 25003: {'lr': 0.00047078725882458575, 'samples': 4800576, 'steps': 25002, 'loss/train': 1.497467279434204} 11/07/2021 00:44:25 - INFO - __main__ - Step 25004: {'lr': 0.0004707847694208269, 'samples': 4800768, 'steps': 25003, 'loss/train': 2.7911336421966553} 11/07/2021 00:44:26 - INFO - __main__ - Step 25005: {'lr': 0.0004707822799175858, 'samples': 4800960, 'steps': 25004, 'loss/train': 2.8122925758361816} 11/07/2021 00:44:27 - INFO - __main__ - Step 25006: {'lr': 0.00047077979031486363, 'samples': 4801152, 'steps': 25005, 'loss/train': 1.652961015701294} 11/07/2021 00:44:27 - INFO - __main__ - Step 25007: {'lr': 0.0004707773006126615, 'samples': 4801344, 'steps': 25006, 'loss/train': 1.2835978269577026} 11/07/2021 00:44:28 - INFO - __main__ - Step 25008: {'lr': 0.0004707748108109805, 'samples': 4801536, 'steps': 25007, 'loss/train': 1.211061716079712} 11/07/2021 00:44:28 - INFO - __main__ - Step 25009: {'lr': 0.0004707723209098218, 'samples': 4801728, 'steps': 25008, 'loss/train': 1.343076229095459} 11/07/2021 00:44:28 - INFO - __main__ - Step 25010: {'lr': 0.0004707698309091865, 'samples': 4801920, 'steps': 25009, 'loss/train': 1.59666109085083} 11/07/2021 00:44:29 - INFO - __main__ - Step 25011: {'lr': 0.00047076734080907576, 'samples': 4802112, 'steps': 25010, 'loss/train': 2.856903314590454} 11/07/2021 00:44:30 - INFO - __main__ - Step 25012: {'lr': 0.0004707648506094906, 'samples': 4802304, 'steps': 25011, 'loss/train': 1.2666581869125366} 11/07/2021 00:44:30 - INFO - __main__ - Step 25013: {'lr': 0.0004707623603104322, 'samples': 4802496, 'steps': 25012, 'loss/train': 1.4336587190628052} 11/07/2021 00:44:30 - INFO - __main__ - Step 25014: {'lr': 0.0004707598699119018, 'samples': 4802688, 'steps': 25013, 'loss/train': 1.6630597114562988} 11/07/2021 00:44:31 - INFO - __main__ - Step 25015: {'lr': 0.0004707573794139003, 'samples': 4802880, 'steps': 25014, 'loss/train': 1.6455538272857666} 11/07/2021 00:44:31 - INFO - __main__ - Step 25016: {'lr': 0.0004707548888164289, 'samples': 4803072, 'steps': 25015, 'loss/train': 1.7798850536346436} 11/07/2021 00:44:32 - INFO - __main__ - Step 25017: {'lr': 0.0004707523981194889, 'samples': 4803264, 'steps': 25016, 'loss/train': 1.1420173645019531} 11/07/2021 00:44:32 - INFO - __main__ - Step 25018: {'lr': 0.00047074990732308116, 'samples': 4803456, 'steps': 25017, 'loss/train': 1.7825181484222412} 11/07/2021 00:44:33 - INFO - __main__ - Step 25019: {'lr': 0.00047074741642720694, 'samples': 4803648, 'steps': 25018, 'loss/train': 1.617344856262207} 11/07/2021 00:44:33 - INFO - __main__ - Step 25020: {'lr': 0.0004707449254318673, 'samples': 4803840, 'steps': 25019, 'loss/train': 1.0126532316207886} 11/07/2021 00:44:34 - INFO - __main__ - Step 25021: {'lr': 0.0004707424343370635, 'samples': 4804032, 'steps': 25020, 'loss/train': 1.3363791704177856} 11/07/2021 00:44:35 - INFO - __main__ - Step 25022: {'lr': 0.00047073994314279647, 'samples': 4804224, 'steps': 25021, 'loss/train': 1.4168652296066284} 11/07/2021 00:44:35 - INFO - __main__ - Step 25023: {'lr': 0.0004707374518490675, 'samples': 4804416, 'steps': 25022, 'loss/train': 1.7937602996826172} 11/07/2021 00:44:35 - INFO - __main__ - Step 25024: {'lr': 0.0004707349604558776, 'samples': 4804608, 'steps': 25023, 'loss/train': 0.8730416893959045} 11/07/2021 00:44:36 - INFO - __main__ - Step 25025: {'lr': 0.00047073246896322797, 'samples': 4804800, 'steps': 25024, 'loss/train': 1.6327455043792725} 11/07/2021 00:44:36 - INFO - __main__ - Step 25026: {'lr': 0.00047072997737111966, 'samples': 4804992, 'steps': 25025, 'loss/train': 1.2765804529190063} 11/07/2021 00:44:37 - INFO - __main__ - Step 25027: {'lr': 0.0004707274856795538, 'samples': 4805184, 'steps': 25026, 'loss/train': 1.6824760437011719} 11/07/2021 00:44:37 - INFO - __main__ - Step 25028: {'lr': 0.00047072499388853164, 'samples': 4805376, 'steps': 25027, 'loss/train': 1.6089210510253906} 11/07/2021 00:44:38 - INFO - __main__ - Step 25029: {'lr': 0.0004707225019980541, 'samples': 4805568, 'steps': 25028, 'loss/train': 1.705112338066101} 11/07/2021 00:44:38 - INFO - __main__ - Step 25030: {'lr': 0.00047072001000812247, 'samples': 4805760, 'steps': 25029, 'loss/train': 2.295302391052246} 11/07/2021 00:44:38 - INFO - __main__ - Step 25031: {'lr': 0.00047071751791873774, 'samples': 4805952, 'steps': 25030, 'loss/train': 1.5089826583862305} 11/07/2021 00:44:39 - INFO - __main__ - Step 25032: {'lr': 0.0004707150257299012, 'samples': 4806144, 'steps': 25031, 'loss/train': 1.7053147554397583} 11/07/2021 00:44:40 - INFO - __main__ - Step 25033: {'lr': 0.0004707125334416138, 'samples': 4806336, 'steps': 25032, 'loss/train': 1.095575213432312} 11/07/2021 00:44:40 - INFO - __main__ - Step 25034: {'lr': 0.00047071004105387677, 'samples': 4806528, 'steps': 25033, 'loss/train': 1.6223604679107666} 11/07/2021 00:44:40 - INFO - __main__ - Step 25035: {'lr': 0.00047070754856669115, 'samples': 4806720, 'steps': 25034, 'loss/train': 1.5200324058532715} 11/07/2021 00:44:41 - INFO - __main__ - Step 25036: {'lr': 0.0004707050559800582, 'samples': 4806912, 'steps': 25035, 'loss/train': 1.3381532430648804} 11/07/2021 00:44:42 - INFO - __main__ - Step 25037: {'lr': 0.00047070256329397893, 'samples': 4807104, 'steps': 25036, 'loss/train': 1.9821909666061401} 11/07/2021 00:44:42 - INFO - __main__ - Step 25038: {'lr': 0.0004707000705084545, 'samples': 4807296, 'steps': 25037, 'loss/train': 0.9232311248779297} 11/07/2021 00:44:43 - INFO - __main__ - Step 25039: {'lr': 0.000470697577623486, 'samples': 4807488, 'steps': 25038, 'loss/train': 1.5496188402175903} 11/07/2021 00:44:43 - INFO - __main__ - Step 25040: {'lr': 0.0004706950846390746, 'samples': 4807680, 'steps': 25039, 'loss/train': 1.9020907878875732} 11/07/2021 00:44:43 - INFO - __main__ - Step 25041: {'lr': 0.00047069259155522135, 'samples': 4807872, 'steps': 25040, 'loss/train': 1.5948377847671509} 11/07/2021 00:44:44 - INFO - __main__ - Step 25042: {'lr': 0.0004706900983719274, 'samples': 4808064, 'steps': 25041, 'loss/train': 1.7538772821426392} 11/07/2021 00:44:45 - INFO - __main__ - Step 25043: {'lr': 0.000470687605089194, 'samples': 4808256, 'steps': 25042, 'loss/train': 1.5355638265609741} 11/07/2021 00:44:45 - INFO - __main__ - Step 25044: {'lr': 0.0004706851117070221, 'samples': 4808448, 'steps': 25043, 'loss/train': 1.604849100112915} 11/07/2021 00:44:45 - INFO - __main__ - Step 25045: {'lr': 0.0004706826182254129, 'samples': 4808640, 'steps': 25044, 'loss/train': 1.7193337678909302} 11/07/2021 00:44:46 - INFO - __main__ - Step 25046: {'lr': 0.0004706801246443676, 'samples': 4808832, 'steps': 25045, 'loss/train': 1.6934117078781128} 11/07/2021 00:44:46 - INFO - __main__ - Step 25047: {'lr': 0.00047067763096388717, 'samples': 4809024, 'steps': 25046, 'loss/train': 1.499121069908142} 11/07/2021 00:44:47 - INFO - __main__ - Step 25048: {'lr': 0.00047067513718397283, 'samples': 4809216, 'steps': 25047, 'loss/train': 1.7033010721206665} 11/07/2021 00:44:47 - INFO - __main__ - Step 25049: {'lr': 0.0004706726433046256, 'samples': 4809408, 'steps': 25048, 'loss/train': 1.6697512865066528} 11/07/2021 00:44:48 - INFO - __main__ - Step 25050: {'lr': 0.00047067014932584674, 'samples': 4809600, 'steps': 25049, 'loss/train': 2.169337272644043} 11/07/2021 00:44:48 - INFO - __main__ - Step 25051: {'lr': 0.0004706676552476373, 'samples': 4809792, 'steps': 25050, 'loss/train': 1.871291995048523} 11/07/2021 00:44:48 - INFO - __main__ - Step 25052: {'lr': 0.0004706651610699985, 'samples': 4809984, 'steps': 25051, 'loss/train': 1.3978254795074463} 11/07/2021 00:44:49 - INFO - __main__ - Step 25053: {'lr': 0.00047066266679293125, 'samples': 4810176, 'steps': 25052, 'loss/train': 1.6824945211410522} 11/07/2021 00:44:50 - INFO - __main__ - Step 25054: {'lr': 0.0004706601724164369, 'samples': 4810368, 'steps': 25053, 'loss/train': 4.329379558563232} 11/07/2021 00:44:50 - INFO - __main__ - Step 25055: {'lr': 0.0004706576779405165, 'samples': 4810560, 'steps': 25054, 'loss/train': 1.64674973487854} 11/07/2021 00:44:50 - INFO - __main__ - Step 25056: {'lr': 0.0004706551833651711, 'samples': 4810752, 'steps': 25055, 'loss/train': 1.7119991779327393} 11/07/2021 00:44:51 - INFO - __main__ - Step 25057: {'lr': 0.0004706526886904019, 'samples': 4810944, 'steps': 25056, 'loss/train': 1.7684483528137207} 11/07/2021 00:44:51 - INFO - __main__ - Step 25058: {'lr': 0.00047065019391621, 'samples': 4811136, 'steps': 25057, 'loss/train': 2.1801724433898926} 11/07/2021 00:44:52 - INFO - __main__ - Step 25059: {'lr': 0.0004706476990425965, 'samples': 4811328, 'steps': 25058, 'loss/train': 1.6123212575912476} 11/07/2021 00:44:52 - INFO - __main__ - Step 25060: {'lr': 0.0004706452040695626, 'samples': 4811520, 'steps': 25059, 'loss/train': 1.2272355556488037} 11/07/2021 00:44:53 - INFO - __main__ - Step 25061: {'lr': 0.0004706427089971093, 'samples': 4811712, 'steps': 25060, 'loss/train': 1.624558448791504} 11/07/2021 00:44:53 - INFO - __main__ - Step 25062: {'lr': 0.0004706402138252379, 'samples': 4811904, 'steps': 25061, 'loss/train': 1.537904143333435} 11/07/2021 00:44:54 - INFO - __main__ - Step 25063: {'lr': 0.00047063771855394935, 'samples': 4812096, 'steps': 25062, 'loss/train': 1.5233983993530273} 11/07/2021 00:44:55 - INFO - __main__ - Step 25064: {'lr': 0.00047063522318324484, 'samples': 4812288, 'steps': 25063, 'loss/train': 1.688103199005127} 11/07/2021 00:44:55 - INFO - __main__ - Step 25065: {'lr': 0.00047063272771312556, 'samples': 4812480, 'steps': 25064, 'loss/train': 2.031040668487549} 11/07/2021 00:44:55 - INFO - __main__ - Step 25066: {'lr': 0.0004706302321435926, 'samples': 4812672, 'steps': 25065, 'loss/train': 1.943956971168518} 11/07/2021 00:44:56 - INFO - __main__ - Step 25067: {'lr': 0.00047062773647464694, 'samples': 4812864, 'steps': 25066, 'loss/train': 2.0383780002593994} 11/07/2021 00:44:56 - INFO - __main__ - Step 25068: {'lr': 0.00047062524070628993, 'samples': 4813056, 'steps': 25067, 'loss/train': 0.8462532162666321} 11/07/2021 00:44:57 - INFO - __main__ - Step 25069: {'lr': 0.00047062274483852253, 'samples': 4813248, 'steps': 25068, 'loss/train': 1.6836276054382324} 11/07/2021 00:44:57 - INFO - __main__ - Step 25070: {'lr': 0.000470620248871346, 'samples': 4813440, 'steps': 25069, 'loss/train': 2.15901517868042} 11/07/2021 00:44:58 - INFO - __main__ - Step 25071: {'lr': 0.00047061775280476134, 'samples': 4813632, 'steps': 25070, 'loss/train': 2.193253993988037} 11/07/2021 00:44:58 - INFO - __main__ - Step 25072: {'lr': 0.0004706152566387697, 'samples': 4813824, 'steps': 25071, 'loss/train': 1.4540574550628662} 11/07/2021 00:44:58 - INFO - __main__ - Step 25073: {'lr': 0.0004706127603733723, 'samples': 4814016, 'steps': 25072, 'loss/train': 1.4969120025634766} 11/07/2021 00:44:59 - INFO - __main__ - Step 25074: {'lr': 0.00047061026400857015, 'samples': 4814208, 'steps': 25073, 'loss/train': 1.4810075759887695} 11/07/2021 00:45:00 - INFO - __main__ - Step 25075: {'lr': 0.0004706077675443644, 'samples': 4814400, 'steps': 25074, 'loss/train': 1.5527960062026978} 11/07/2021 00:45:00 - INFO - __main__ - Step 25076: {'lr': 0.00047060527098075625, 'samples': 4814592, 'steps': 25075, 'loss/train': 0.34231331944465637} 11/07/2021 00:45:00 - INFO - __main__ - Step 25077: {'lr': 0.0004706027743177467, 'samples': 4814784, 'steps': 25076, 'loss/train': 1.1344234943389893} 11/07/2021 00:45:01 - INFO - __main__ - Step 25078: {'lr': 0.000470600277555337, 'samples': 4814976, 'steps': 25077, 'loss/train': 1.5538222789764404} 11/07/2021 00:45:01 - INFO - __main__ - Step 25079: {'lr': 0.0004705977806935282, 'samples': 4815168, 'steps': 25078, 'loss/train': 1.3463051319122314} 11/07/2021 00:45:02 - INFO - __main__ - Step 25080: {'lr': 0.00047059528373232147, 'samples': 4815360, 'steps': 25079, 'loss/train': 1.8053596019744873} 11/07/2021 00:45:03 - INFO - __main__ - Step 25081: {'lr': 0.0004705927866717179, 'samples': 4815552, 'steps': 25080, 'loss/train': 0.673611044883728} 11/07/2021 00:45:03 - INFO - __main__ - Step 25082: {'lr': 0.0004705902895117186, 'samples': 4815744, 'steps': 25081, 'loss/train': 2.0037245750427246} 11/07/2021 00:45:03 - INFO - __main__ - Step 25083: {'lr': 0.00047058779225232474, 'samples': 4815936, 'steps': 25082, 'loss/train': 1.377502679824829} 11/07/2021 00:45:04 - INFO - __main__ - Step 25084: {'lr': 0.0004705852948935374, 'samples': 4816128, 'steps': 25083, 'loss/train': 2.0240447521209717} 11/07/2021 00:45:05 - INFO - __main__ - Step 25085: {'lr': 0.00047058279743535775, 'samples': 4816320, 'steps': 25084, 'loss/train': 1.436004877090454} 11/07/2021 00:45:05 - INFO - __main__ - Step 25086: {'lr': 0.0004705802998777869, 'samples': 4816512, 'steps': 25085, 'loss/train': 1.718674898147583} 11/07/2021 00:45:05 - INFO - __main__ - Step 25087: {'lr': 0.0004705778022208259, 'samples': 4816704, 'steps': 25086, 'loss/train': 1.7859622240066528} 11/07/2021 00:45:06 - INFO - __main__ - Step 25088: {'lr': 0.000470575304464476, 'samples': 4816896, 'steps': 25087, 'loss/train': 1.8729404211044312} 11/07/2021 00:45:06 - INFO - __main__ - Step 25089: {'lr': 0.00047057280660873835, 'samples': 4817088, 'steps': 25088, 'loss/train': 1.590794563293457} 11/07/2021 00:45:07 - INFO - __main__ - Step 25090: {'lr': 0.00047057030865361397, 'samples': 4817280, 'steps': 25089, 'loss/train': 1.306519627571106} 11/07/2021 00:45:07 - INFO - __main__ - Step 25091: {'lr': 0.0004705678105991039, 'samples': 4817472, 'steps': 25090, 'loss/train': 1.7953321933746338} 11/07/2021 00:45:08 - INFO - __main__ - Step 25092: {'lr': 0.00047056531244520945, 'samples': 4817664, 'steps': 25091, 'loss/train': 1.5451993942260742} 11/07/2021 00:45:08 - INFO - __main__ - Step 25093: {'lr': 0.0004705628141919317, 'samples': 4817856, 'steps': 25092, 'loss/train': 1.4345879554748535} 11/07/2021 00:45:09 - INFO - __main__ - Step 25094: {'lr': 0.00047056031583927175, 'samples': 4818048, 'steps': 25093, 'loss/train': 1.4095795154571533} 11/07/2021 00:45:10 - INFO - __main__ - Step 25095: {'lr': 0.00047055781738723063, 'samples': 4818240, 'steps': 25094, 'loss/train': 1.395472526550293} 11/07/2021 00:45:10 - INFO - __main__ - Step 25096: {'lr': 0.0004705553188358096, 'samples': 4818432, 'steps': 25095, 'loss/train': 1.569098711013794} 11/07/2021 00:45:10 - INFO - __main__ - Step 25097: {'lr': 0.00047055282018500976, 'samples': 4818624, 'steps': 25096, 'loss/train': 1.5912185907363892} 11/07/2021 00:45:11 - INFO - __main__ - Step 25098: {'lr': 0.0004705503214348323, 'samples': 4818816, 'steps': 25097, 'loss/train': 1.6429988145828247} 11/07/2021 00:45:11 - INFO - __main__ - Step 25099: {'lr': 0.0004705478225852782, 'samples': 4819008, 'steps': 25098, 'loss/train': 1.2610902786254883} 11/07/2021 00:45:12 - INFO - __main__ - Step 25100: {'lr': 0.0004705453236363486, 'samples': 4819200, 'steps': 25099, 'loss/train': 1.843684434890747} 11/07/2021 00:45:12 - INFO - __main__ - Step 25101: {'lr': 0.00047054282458804477, 'samples': 4819392, 'steps': 25100, 'loss/train': 1.653101921081543} 11/07/2021 00:45:13 - INFO - __main__ - Step 25102: {'lr': 0.0004705403254403677, 'samples': 4819584, 'steps': 25101, 'loss/train': 1.8327016830444336} 11/07/2021 00:45:13 - INFO - __main__ - Step 25103: {'lr': 0.0004705378261933186, 'samples': 4819776, 'steps': 25102, 'loss/train': 1.6882308721542358} 11/07/2021 00:45:13 - INFO - __main__ - Step 25104: {'lr': 0.0004705353268468985, 'samples': 4819968, 'steps': 25103, 'loss/train': 1.575581431388855} 11/07/2021 00:45:14 - INFO - __main__ - Step 25105: {'lr': 0.00047053282740110863, 'samples': 4820160, 'steps': 25104, 'loss/train': 1.2449451684951782} 11/07/2021 00:45:15 - INFO - __main__ - Step 25106: {'lr': 0.00047053032785595005, 'samples': 4820352, 'steps': 25105, 'loss/train': 1.3807636499404907} 11/07/2021 00:45:15 - INFO - __main__ - Step 25107: {'lr': 0.0004705278282114239, 'samples': 4820544, 'steps': 25106, 'loss/train': 1.588875651359558} 11/07/2021 00:45:15 - INFO - __main__ - Step 25108: {'lr': 0.0004705253284675314, 'samples': 4820736, 'steps': 25107, 'loss/train': 1.394099473953247} 11/07/2021 00:45:16 - INFO - __main__ - Step 25109: {'lr': 0.00047052282862427355, 'samples': 4820928, 'steps': 25108, 'loss/train': 1.948533058166504} 11/07/2021 00:45:16 - INFO - __main__ - Step 25110: {'lr': 0.0004705203286816514, 'samples': 4821120, 'steps': 25109, 'loss/train': 1.3867086172103882} 11/07/2021 00:45:17 - INFO - __main__ - Step 25111: {'lr': 0.0004705178286396663, 'samples': 4821312, 'steps': 25110, 'loss/train': 1.4365869760513306} 11/07/2021 00:45:17 - INFO - __main__ - Step 25112: {'lr': 0.0004705153284983192, 'samples': 4821504, 'steps': 25111, 'loss/train': 1.374193787574768} 11/07/2021 00:45:18 - INFO - __main__ - Step 25113: {'lr': 0.00047051282825761145, 'samples': 4821696, 'steps': 25112, 'loss/train': 1.4067364931106567} 11/07/2021 00:45:18 - INFO - __main__ - Step 25114: {'lr': 0.0004705103279175439, 'samples': 4821888, 'steps': 25113, 'loss/train': 1.631029725074768} 11/07/2021 00:45:19 - INFO - __main__ - Step 25115: {'lr': 0.0004705078274781178, 'samples': 4822080, 'steps': 25114, 'loss/train': 1.4061105251312256} 11/07/2021 00:45:20 - INFO - __main__ - Step 25116: {'lr': 0.0004705053269393343, 'samples': 4822272, 'steps': 25115, 'loss/train': 1.379422664642334} 11/07/2021 00:45:20 - INFO - __main__ - Step 25117: {'lr': 0.00047050282630119444, 'samples': 4822464, 'steps': 25116, 'loss/train': 1.5515828132629395} 11/07/2021 00:45:20 - INFO - __main__ - Step 25118: {'lr': 0.0004705003255636995, 'samples': 4822656, 'steps': 25117, 'loss/train': 1.743276596069336} 11/07/2021 00:45:21 - INFO - __main__ - Step 25119: {'lr': 0.0004704978247268505, 'samples': 4822848, 'steps': 25118, 'loss/train': 1.158705472946167} 11/07/2021 00:45:21 - INFO - __main__ - Step 25120: {'lr': 0.0004704953237906485, 'samples': 4823040, 'steps': 25119, 'loss/train': 1.197891354560852} 11/07/2021 00:45:22 - INFO - __main__ - Step 25121: {'lr': 0.0004704928227550949, 'samples': 4823232, 'steps': 25120, 'loss/train': 1.9623627662658691} 11/07/2021 00:45:22 - INFO - __main__ - Step 25122: {'lr': 0.00047049032162019044, 'samples': 4823424, 'steps': 25121, 'loss/train': 1.642743468284607} 11/07/2021 00:45:23 - INFO - __main__ - Step 25123: {'lr': 0.0004704878203859365, 'samples': 4823616, 'steps': 25122, 'loss/train': 1.1424055099487305} 11/07/2021 00:45:23 - INFO - __main__ - Step 25124: {'lr': 0.0004704853190523342, 'samples': 4823808, 'steps': 25123, 'loss/train': 0.8902316093444824} 11/07/2021 00:45:23 - INFO - __main__ - Step 25125: {'lr': 0.00047048281761938456, 'samples': 4824000, 'steps': 25124, 'loss/train': 1.7656041383743286} 11/07/2021 00:45:24 - INFO - __main__ - Step 25126: {'lr': 0.00047048031608708875, 'samples': 4824192, 'steps': 25125, 'loss/train': 1.437725305557251} 11/07/2021 00:45:25 - INFO - __main__ - Step 25127: {'lr': 0.000470477814455448, 'samples': 4824384, 'steps': 25126, 'loss/train': 1.609181523323059} 11/07/2021 00:45:25 - INFO - __main__ - Step 25128: {'lr': 0.0004704753127244633, 'samples': 4824576, 'steps': 25127, 'loss/train': 1.2707256078720093} 11/07/2021 00:45:25 - INFO - __main__ - Step 25129: {'lr': 0.0004704728108941358, 'samples': 4824768, 'steps': 25128, 'loss/train': 1.8579035997390747} 11/07/2021 00:45:26 - INFO - __main__ - Step 25130: {'lr': 0.00047047030896446665, 'samples': 4824960, 'steps': 25129, 'loss/train': 1.7406295537948608} 11/07/2021 00:45:27 - INFO - __main__ - Step 25131: {'lr': 0.000470467806935457, 'samples': 4825152, 'steps': 25130, 'loss/train': 1.0406829118728638} 11/07/2021 00:45:27 - INFO - __main__ - Step 25132: {'lr': 0.000470465304807108, 'samples': 4825344, 'steps': 25131, 'loss/train': 1.3713059425354004} 11/07/2021 00:45:28 - INFO - __main__ - Step 25133: {'lr': 0.00047046280257942067, 'samples': 4825536, 'steps': 25132, 'loss/train': 1.7930819988250732} 11/07/2021 00:45:28 - INFO - __main__ - Step 25134: {'lr': 0.0004704603002523962, 'samples': 4825728, 'steps': 25133, 'loss/train': 2.26041579246521} 11/07/2021 00:45:28 - INFO - __main__ - Step 25135: {'lr': 0.00047045779782603584, 'samples': 4825920, 'steps': 25134, 'loss/train': 1.8254454135894775} 11/07/2021 00:45:29 - INFO - __main__ - Step 25136: {'lr': 0.0004704552953003405, 'samples': 4826112, 'steps': 25135, 'loss/train': 1.814273715019226} 11/07/2021 00:45:30 - INFO - __main__ - Step 25137: {'lr': 0.0004704527926753114, 'samples': 4826304, 'steps': 25136, 'loss/train': 1.6575398445129395} 11/07/2021 00:45:30 - INFO - __main__ - Step 25138: {'lr': 0.00047045028995094967, 'samples': 4826496, 'steps': 25137, 'loss/train': 1.343518853187561} 11/07/2021 00:45:30 - INFO - __main__ - Step 25139: {'lr': 0.0004704477871272564, 'samples': 4826688, 'steps': 25138, 'loss/train': 1.4930167198181152} 11/07/2021 00:45:31 - INFO - __main__ - Step 25140: {'lr': 0.0004704452842042329, 'samples': 4826880, 'steps': 25139, 'loss/train': 1.235864520072937} 11/07/2021 00:45:32 - INFO - __main__ - Step 25141: {'lr': 0.00047044278118188004, 'samples': 4827072, 'steps': 25140, 'loss/train': 1.7771896123886108} 11/07/2021 00:45:32 - INFO - __main__ - Step 25142: {'lr': 0.00047044027806019914, 'samples': 4827264, 'steps': 25141, 'loss/train': 0.8156598210334778} 11/07/2021 00:45:32 - INFO - __main__ - Step 25143: {'lr': 0.0004704377748391912, 'samples': 4827456, 'steps': 25142, 'loss/train': 1.9012995958328247} 11/07/2021 00:45:33 - INFO - __main__ - Step 25144: {'lr': 0.0004704352715188574, 'samples': 4827648, 'steps': 25143, 'loss/train': 1.625672698020935} 11/07/2021 00:45:33 - INFO - __main__ - Step 25145: {'lr': 0.0004704327680991989, 'samples': 4827840, 'steps': 25144, 'loss/train': 1.3845484256744385} 11/07/2021 00:45:34 - INFO - __main__ - Step 25146: {'lr': 0.00047043026458021677, 'samples': 4828032, 'steps': 25145, 'loss/train': 1.3643563985824585} 11/07/2021 00:45:34 - INFO - __main__ - Step 25147: {'lr': 0.0004704277609619122, 'samples': 4828224, 'steps': 25146, 'loss/train': 0.5338404178619385} 11/07/2021 00:45:35 - INFO - __main__ - Step 25148: {'lr': 0.0004704252572442862, 'samples': 4828416, 'steps': 25147, 'loss/train': 1.316985845565796} 11/07/2021 00:45:35 - INFO - __main__ - Step 25149: {'lr': 0.00047042275342734006, 'samples': 4828608, 'steps': 25148, 'loss/train': 1.718420147895813} 11/07/2021 00:45:35 - INFO - __main__ - Step 25150: {'lr': 0.0004704202495110748, 'samples': 4828800, 'steps': 25149, 'loss/train': 1.6761091947555542} 11/07/2021 00:45:36 - INFO - __main__ - Step 25151: {'lr': 0.00047041774549549156, 'samples': 4828992, 'steps': 25150, 'loss/train': 1.5337203741073608} 11/07/2021 00:45:37 - INFO - __main__ - Step 25152: {'lr': 0.00047041524138059153, 'samples': 4829184, 'steps': 25151, 'loss/train': 0.9986521601676941} 11/07/2021 00:45:37 - INFO - __main__ - Step 25153: {'lr': 0.00047041273716637576, 'samples': 4829376, 'steps': 25152, 'loss/train': 2.1026737689971924} 11/07/2021 00:45:37 - INFO - __main__ - Step 25154: {'lr': 0.00047041023285284545, 'samples': 4829568, 'steps': 25153, 'loss/train': 1.5758517980575562} 11/07/2021 00:45:38 - INFO - __main__ - Step 25155: {'lr': 0.0004704077284400017, 'samples': 4829760, 'steps': 25154, 'loss/train': 1.3530430793762207} 11/07/2021 00:45:39 - INFO - __main__ - Step 25156: {'lr': 0.0004704052239278456, 'samples': 4829952, 'steps': 25155, 'loss/train': 1.5704340934753418} 11/07/2021 00:45:39 - INFO - __main__ - Step 25157: {'lr': 0.00047040271931637824, 'samples': 4830144, 'steps': 25156, 'loss/train': 1.4760314226150513} 11/07/2021 00:45:40 - INFO - __main__ - Step 25158: {'lr': 0.0004704002146056009, 'samples': 4830336, 'steps': 25157, 'loss/train': 1.5786726474761963} 11/07/2021 00:45:40 - INFO - __main__ - Step 25159: {'lr': 0.0004703977097955146, 'samples': 4830528, 'steps': 25158, 'loss/train': 1.7212551832199097} 11/07/2021 00:45:40 - INFO - __main__ - Step 25160: {'lr': 0.0004703952048861204, 'samples': 4830720, 'steps': 25159, 'loss/train': 1.3970686197280884} 11/07/2021 00:45:41 - INFO - __main__ - Step 25161: {'lr': 0.00047039269987741967, 'samples': 4830912, 'steps': 25160, 'loss/train': 1.5743192434310913} 11/07/2021 00:45:42 - INFO - __main__ - Step 25162: {'lr': 0.0004703901947694134, 'samples': 4831104, 'steps': 25161, 'loss/train': 1.6837447881698608} 11/07/2021 00:45:42 - INFO - __main__ - Step 25163: {'lr': 0.0004703876895621025, 'samples': 4831296, 'steps': 25162, 'loss/train': 1.656884789466858} 11/07/2021 00:45:42 - INFO - __main__ - Step 25164: {'lr': 0.0004703851842554885, 'samples': 4831488, 'steps': 25163, 'loss/train': 1.3900341987609863} 11/07/2021 00:45:43 - INFO - __main__ - Step 25165: {'lr': 0.0004703826788495723, 'samples': 4831680, 'steps': 25164, 'loss/train': 1.7015156745910645} 11/07/2021 00:45:43 - INFO - __main__ - Step 25166: {'lr': 0.00047038017334435504, 'samples': 4831872, 'steps': 25165, 'loss/train': 1.4276436567306519} 11/07/2021 00:45:44 - INFO - __main__ - Step 25167: {'lr': 0.00047037766773983794, 'samples': 4832064, 'steps': 25166, 'loss/train': 1.9233454465866089} 11/07/2021 00:45:44 - INFO - __main__ - Step 25168: {'lr': 0.00047037516203602195, 'samples': 4832256, 'steps': 25167, 'loss/train': 0.7028411626815796} 11/07/2021 00:45:45 - INFO - __main__ - Step 25169: {'lr': 0.0004703726562329084, 'samples': 4832448, 'steps': 25168, 'loss/train': 1.5089735984802246} 11/07/2021 00:45:45 - INFO - __main__ - Step 25170: {'lr': 0.0004703701503304983, 'samples': 4832640, 'steps': 25169, 'loss/train': 1.633947491645813} 11/07/2021 00:45:45 - INFO - __main__ - Step 25171: {'lr': 0.0004703676443287928, 'samples': 4832832, 'steps': 25170, 'loss/train': 1.2644140720367432} 11/07/2021 00:45:47 - INFO - __main__ - Step 25172: {'lr': 0.000470365138227793, 'samples': 4833024, 'steps': 25171, 'loss/train': 1.6214765310287476} 11/07/2021 00:45:47 - INFO - __main__ - Step 25173: {'lr': 0.0004703626320275002, 'samples': 4833216, 'steps': 25172, 'loss/train': 1.574013352394104} 11/07/2021 00:45:47 - INFO - __main__ - Step 25174: {'lr': 0.0004703601257279153, 'samples': 4833408, 'steps': 25173, 'loss/train': 1.7189385890960693} 11/07/2021 00:45:48 - INFO - __main__ - Step 25175: {'lr': 0.0004703576193290395, 'samples': 4833600, 'steps': 25174, 'loss/train': 0.7705118060112} 11/07/2021 00:45:48 - INFO - __main__ - Step 25176: {'lr': 0.0004703551128308741, 'samples': 4833792, 'steps': 25175, 'loss/train': 1.8714972734451294} 11/07/2021 00:45:49 - INFO - __main__ - Step 25177: {'lr': 0.00047035260623341996, 'samples': 4833984, 'steps': 25176, 'loss/train': 2.0388734340667725} 11/07/2021 00:45:49 - INFO - __main__ - Step 25178: {'lr': 0.0004703500995366784, 'samples': 4834176, 'steps': 25177, 'loss/train': 1.363921046257019} 11/07/2021 00:45:50 - INFO - __main__ - Step 25179: {'lr': 0.00047034759274065043, 'samples': 4834368, 'steps': 25178, 'loss/train': 1.8861095905303955} 11/07/2021 00:45:50 - INFO - __main__ - Step 25180: {'lr': 0.00047034508584533724, 'samples': 4834560, 'steps': 25179, 'loss/train': 1.4527485370635986} 11/07/2021 00:45:50 - INFO - __main__ - Step 25181: {'lr': 0.00047034257885074, 'samples': 4834752, 'steps': 25180, 'loss/train': 1.6563856601715088} 11/07/2021 00:45:51 - INFO - __main__ - Step 25182: {'lr': 0.00047034007175685976, 'samples': 4834944, 'steps': 25181, 'loss/train': 1.1298303604125977} 11/07/2021 00:45:52 - INFO - __main__ - Step 25183: {'lr': 0.0004703375645636977, 'samples': 4835136, 'steps': 25182, 'loss/train': 1.9646518230438232} 11/07/2021 00:45:52 - INFO - __main__ - Step 25184: {'lr': 0.0004703350572712549, 'samples': 4835328, 'steps': 25183, 'loss/train': 1.3798952102661133} 11/07/2021 00:45:52 - INFO - __main__ - Step 25185: {'lr': 0.00047033254987953254, 'samples': 4835520, 'steps': 25184, 'loss/train': 2.0689175128936768} 11/07/2021 00:45:53 - INFO - __main__ - Step 25186: {'lr': 0.0004703300423885318, 'samples': 4835712, 'steps': 25185, 'loss/train': 1.8879923820495605} 11/07/2021 00:45:54 - INFO - __main__ - Step 25187: {'lr': 0.0004703275347982536, 'samples': 4835904, 'steps': 25186, 'loss/train': 0.9834343194961548} 11/07/2021 00:45:54 - INFO - __main__ - Step 25188: {'lr': 0.00047032502710869935, 'samples': 4836096, 'steps': 25187, 'loss/train': 1.7398415803909302} 11/07/2021 00:45:55 - INFO - __main__ - Step 25189: {'lr': 0.00047032251931987, 'samples': 4836288, 'steps': 25188, 'loss/train': 1.4668735265731812} 11/07/2021 00:45:55 - INFO - __main__ - Step 25190: {'lr': 0.0004703200114317667, 'samples': 4836480, 'steps': 25189, 'loss/train': 1.608988881111145} 11/07/2021 00:45:55 - INFO - __main__ - Step 25191: {'lr': 0.0004703175034443906, 'samples': 4836672, 'steps': 25190, 'loss/train': 1.460955262184143} 11/07/2021 00:45:56 - INFO - __main__ - Step 25192: {'lr': 0.00047031499535774284, 'samples': 4836864, 'steps': 25191, 'loss/train': 1.9606585502624512} 11/07/2021 00:45:57 - INFO - __main__ - Step 25193: {'lr': 0.00047031248717182455, 'samples': 4837056, 'steps': 25192, 'loss/train': 1.4068056344985962} 11/07/2021 00:45:57 - INFO - __main__ - Step 25194: {'lr': 0.00047030997888663687, 'samples': 4837248, 'steps': 25193, 'loss/train': 1.0019291639328003} 11/07/2021 00:45:57 - INFO - __main__ - Step 25195: {'lr': 0.00047030747050218094, 'samples': 4837440, 'steps': 25194, 'loss/train': 1.4449735879898071} 11/07/2021 00:45:58 - INFO - __main__ - Step 25196: {'lr': 0.0004703049620184578, 'samples': 4837632, 'steps': 25195, 'loss/train': 1.705001711845398} 11/07/2021 00:45:58 - INFO - __main__ - Step 25197: {'lr': 0.0004703024534354686, 'samples': 4837824, 'steps': 25196, 'loss/train': 2.5124318599700928} 11/07/2021 00:45:59 - INFO - __main__ - Step 25198: {'lr': 0.0004702999447532146, 'samples': 4838016, 'steps': 25197, 'loss/train': 2.303138494491577} 11/07/2021 00:45:59 - INFO - __main__ - Step 25199: {'lr': 0.00047029743597169684, 'samples': 4838208, 'steps': 25198, 'loss/train': 1.051732063293457} 11/07/2021 00:46:00 - INFO - __main__ - Step 25200: {'lr': 0.0004702949270909164, 'samples': 4838400, 'steps': 25199, 'loss/train': 1.731921672821045} 11/07/2021 00:46:00 - INFO - __main__ - Step 25201: {'lr': 0.0004702924181108745, 'samples': 4838592, 'steps': 25200, 'loss/train': 1.6654285192489624} 11/07/2021 00:46:00 - INFO - __main__ - Step 25202: {'lr': 0.00047028990903157233, 'samples': 4838784, 'steps': 25201, 'loss/train': 1.3601429462432861} 11/07/2021 00:46:02 - INFO - __main__ - Step 25203: {'lr': 0.0004702873998530108, 'samples': 4838976, 'steps': 25202, 'loss/train': 1.6603739261627197} 11/07/2021 00:46:02 - INFO - __main__ - Step 25204: {'lr': 0.0004702848905751912, 'samples': 4839168, 'steps': 25203, 'loss/train': 1.4916635751724243} 11/07/2021 00:46:02 - INFO - __main__ - Step 25205: {'lr': 0.0004702823811981146, 'samples': 4839360, 'steps': 25204, 'loss/train': 1.0334322452545166} 11/07/2021 00:46:03 - INFO - __main__ - Step 25206: {'lr': 0.0004702798717217822, 'samples': 4839552, 'steps': 25205, 'loss/train': 1.5200536251068115} 11/07/2021 00:46:03 - INFO - __main__ - Step 25207: {'lr': 0.0004702773621461951, 'samples': 4839744, 'steps': 25206, 'loss/train': 1.2788139581680298} 11/07/2021 00:46:04 - INFO - __main__ - Step 25208: {'lr': 0.0004702748524713544, 'samples': 4839936, 'steps': 25207, 'loss/train': 1.8211277723312378} 11/07/2021 00:46:04 - INFO - __main__ - Step 25209: {'lr': 0.00047027234269726123, 'samples': 4840128, 'steps': 25208, 'loss/train': 1.4257100820541382} 11/07/2021 00:46:05 - INFO - __main__ - Step 25210: {'lr': 0.0004702698328239167, 'samples': 4840320, 'steps': 25209, 'loss/train': 1.4680659770965576} 11/07/2021 00:46:05 - INFO - __main__ - Step 25211: {'lr': 0.0004702673228513221, 'samples': 4840512, 'steps': 25210, 'loss/train': 0.9686245322227478} 11/07/2021 00:46:05 - INFO - __main__ - Step 25212: {'lr': 0.00047026481277947835, 'samples': 4840704, 'steps': 25211, 'loss/train': 1.9399867057800293} 11/07/2021 00:46:06 - INFO - __main__ - Step 25213: {'lr': 0.0004702623026083867, 'samples': 4840896, 'steps': 25212, 'loss/train': 1.4516819715499878} 11/07/2021 00:46:07 - INFO - __main__ - Step 25214: {'lr': 0.00047025979233804825, 'samples': 4841088, 'steps': 25213, 'loss/train': 1.4941320419311523} 11/07/2021 00:46:07 - INFO - __main__ - Step 25215: {'lr': 0.00047025728196846417, 'samples': 4841280, 'steps': 25214, 'loss/train': 1.2123286724090576} 11/07/2021 00:46:07 - INFO - __main__ - Step 25216: {'lr': 0.0004702547714996355, 'samples': 4841472, 'steps': 25215, 'loss/train': 1.6218972206115723} 11/07/2021 00:46:08 - INFO - __main__ - Step 25217: {'lr': 0.00047025226093156346, 'samples': 4841664, 'steps': 25216, 'loss/train': 1.8326835632324219} 11/07/2021 00:46:09 - INFO - __main__ - Step 25218: {'lr': 0.0004702497502642492, 'samples': 4841856, 'steps': 25217, 'loss/train': 1.7213815450668335} 11/07/2021 00:46:09 - INFO - __main__ - Step 25219: {'lr': 0.0004702472394976938, 'samples': 4842048, 'steps': 25218, 'loss/train': 1.4464116096496582} 11/07/2021 00:46:09 - INFO - __main__ - Step 25220: {'lr': 0.0004702447286318983, 'samples': 4842240, 'steps': 25219, 'loss/train': 1.173579454421997} 11/07/2021 00:46:10 - INFO - __main__ - Step 25221: {'lr': 0.0004702422176668639, 'samples': 4842432, 'steps': 25220, 'loss/train': 0.8121039867401123} 11/07/2021 00:46:10 - INFO - __main__ - Step 25222: {'lr': 0.00047023970660259193, 'samples': 4842624, 'steps': 25221, 'loss/train': 1.5761325359344482} 11/07/2021 00:46:11 - INFO - __main__ - Step 25223: {'lr': 0.0004702371954390832, 'samples': 4842816, 'steps': 25222, 'loss/train': 1.708838939666748} 11/07/2021 00:46:12 - INFO - __main__ - Step 25224: {'lr': 0.00047023468417633905, 'samples': 4843008, 'steps': 25223, 'loss/train': 1.512524962425232} 11/07/2021 00:46:12 - INFO - __main__ - Step 25225: {'lr': 0.0004702321728143605, 'samples': 4843200, 'steps': 25224, 'loss/train': 1.0887199640274048} 11/07/2021 00:46:12 - INFO - __main__ - Step 25226: {'lr': 0.0004702296613531488, 'samples': 4843392, 'steps': 25225, 'loss/train': 1.1451746225357056} 11/07/2021 00:46:13 - INFO - __main__ - Step 25227: {'lr': 0.00047022714979270497, 'samples': 4843584, 'steps': 25226, 'loss/train': 1.372851848602295} 11/07/2021 00:46:14 - INFO - __main__ - Step 25228: {'lr': 0.0004702246381330302, 'samples': 4843776, 'steps': 25227, 'loss/train': 1.5111043453216553} 11/07/2021 00:46:14 - INFO - __main__ - Step 25229: {'lr': 0.00047022212637412553, 'samples': 4843968, 'steps': 25228, 'loss/train': 1.1899921894073486} 11/07/2021 00:46:14 - INFO - __main__ - Step 25230: {'lr': 0.00047021961451599226, 'samples': 4844160, 'steps': 25229, 'loss/train': 1.3683167695999146} 11/07/2021 00:46:15 - INFO - __main__ - Step 25231: {'lr': 0.00047021710255863144, 'samples': 4844352, 'steps': 25230, 'loss/train': 1.5361984968185425} 11/07/2021 00:46:15 - INFO - __main__ - Step 25232: {'lr': 0.0004702145905020442, 'samples': 4844544, 'steps': 25231, 'loss/train': 1.7946422100067139} 11/07/2021 00:46:15 - INFO - __main__ - Step 25233: {'lr': 0.0004702120783462316, 'samples': 4844736, 'steps': 25232, 'loss/train': 1.4184598922729492} 11/07/2021 00:46:16 - INFO - __main__ - Step 25234: {'lr': 0.00047020956609119483, 'samples': 4844928, 'steps': 25233, 'loss/train': 3.4014549255371094} 11/07/2021 00:46:17 - INFO - __main__ - Step 25235: {'lr': 0.0004702070537369351, 'samples': 4845120, 'steps': 25234, 'loss/train': 1.3262993097305298} 11/07/2021 00:46:17 - INFO - __main__ - Step 25236: {'lr': 0.00047020454128345333, 'samples': 4845312, 'steps': 25235, 'loss/train': 1.4103611707687378} 11/07/2021 00:46:17 - INFO - __main__ - Step 25237: {'lr': 0.00047020202873075093, 'samples': 4845504, 'steps': 25236, 'loss/train': 1.9136335849761963} 11/07/2021 00:46:18 - INFO - __main__ - Step 25238: {'lr': 0.00047019951607882884, 'samples': 4845696, 'steps': 25237, 'loss/train': 1.7273340225219727} 11/07/2021 00:46:19 - INFO - __main__ - Step 25239: {'lr': 0.0004701970033276882, 'samples': 4845888, 'steps': 25238, 'loss/train': 1.530830979347229} 11/07/2021 00:46:19 - INFO - __main__ - Step 25240: {'lr': 0.0004701944904773303, 'samples': 4846080, 'steps': 25239, 'loss/train': 1.5841394662857056} 11/07/2021 00:46:20 - INFO - __main__ - Step 25241: {'lr': 0.0004701919775277561, 'samples': 4846272, 'steps': 25240, 'loss/train': 1.6559784412384033} 11/07/2021 00:46:20 - INFO - __main__ - Step 25242: {'lr': 0.0004701894644789668, 'samples': 4846464, 'steps': 25241, 'loss/train': 1.7496247291564941} 11/07/2021 00:46:20 - INFO - __main__ - Step 25243: {'lr': 0.0004701869513309635, 'samples': 4846656, 'steps': 25242, 'loss/train': 1.0005935430526733} 11/07/2021 00:46:21 - INFO - __main__ - Step 25244: {'lr': 0.0004701844380837474, 'samples': 4846848, 'steps': 25243, 'loss/train': 1.9548184871673584} 11/07/2021 00:46:22 - INFO - __main__ - Step 25245: {'lr': 0.00047018192473731956, 'samples': 4847040, 'steps': 25244, 'loss/train': 1.5735375881195068} 11/07/2021 00:46:22 - INFO - __main__ - Step 25246: {'lr': 0.0004701794112916812, 'samples': 4847232, 'steps': 25245, 'loss/train': 0.7068080306053162} 11/07/2021 00:46:22 - INFO - __main__ - Step 25247: {'lr': 0.00047017689774683325, 'samples': 4847424, 'steps': 25246, 'loss/train': 2.1233181953430176} 11/07/2021 00:46:23 - INFO - __main__ - Step 25248: {'lr': 0.0004701743841027771, 'samples': 4847616, 'steps': 25247, 'loss/train': 1.7109966278076172} 11/07/2021 00:46:24 - INFO - __main__ - Step 25249: {'lr': 0.0004701718703595138, 'samples': 4847808, 'steps': 25248, 'loss/train': 1.3317124843597412} 11/07/2021 00:46:24 - INFO - __main__ - Step 25250: {'lr': 0.0004701693565170444, 'samples': 4848000, 'steps': 25249, 'loss/train': 1.6570696830749512} 11/07/2021 00:46:24 - INFO - __main__ - Step 25251: {'lr': 0.0004701668425753701, 'samples': 4848192, 'steps': 25250, 'loss/train': 1.3250230550765991} 11/07/2021 00:46:25 - INFO - __main__ - Step 25252: {'lr': 0.000470164328534492, 'samples': 4848384, 'steps': 25251, 'loss/train': 1.9276931285858154} 11/07/2021 00:46:25 - INFO - __main__ - Step 25253: {'lr': 0.00047016181439441126, 'samples': 4848576, 'steps': 25252, 'loss/train': 2.1577646732330322} 11/07/2021 00:46:26 - INFO - __main__ - Step 25254: {'lr': 0.000470159300155129, 'samples': 4848768, 'steps': 25253, 'loss/train': 1.3344687223434448} 11/07/2021 00:46:27 - INFO - __main__ - Step 25255: {'lr': 0.00047015678581664635, 'samples': 4848960, 'steps': 25254, 'loss/train': 1.5740892887115479} 11/07/2021 00:46:27 - INFO - __main__ - Step 25256: {'lr': 0.00047015427137896446, 'samples': 4849152, 'steps': 25255, 'loss/train': 1.5407794713974} 11/07/2021 00:46:27 - INFO - __main__ - Step 25257: {'lr': 0.0004701517568420844, 'samples': 4849344, 'steps': 25256, 'loss/train': 1.5174570083618164} 11/07/2021 00:46:28 - INFO - __main__ - Step 25258: {'lr': 0.0004701492422060074, 'samples': 4849536, 'steps': 25257, 'loss/train': 1.896033525466919} 11/07/2021 00:46:28 - INFO - __main__ - Step 25259: {'lr': 0.0004701467274707346, 'samples': 4849728, 'steps': 25258, 'loss/train': 1.2070672512054443} 11/07/2021 00:46:29 - INFO - __main__ - Step 25260: {'lr': 0.0004701442126362671, 'samples': 4849920, 'steps': 25259, 'loss/train': 1.9618362188339233} 11/07/2021 00:46:29 - INFO - __main__ - Step 25261: {'lr': 0.0004701416977026059, 'samples': 4850112, 'steps': 25260, 'loss/train': 1.8493741750717163} 11/07/2021 00:46:30 - INFO - __main__ - Step 25262: {'lr': 0.0004701391826697523, 'samples': 4850304, 'steps': 25261, 'loss/train': 2.0049753189086914} 11/07/2021 00:46:30 - INFO - __main__ - Step 25263: {'lr': 0.00047013666753770736, 'samples': 4850496, 'steps': 25262, 'loss/train': 1.62056303024292} 11/07/2021 00:46:30 - INFO - __main__ - Step 25264: {'lr': 0.00047013415230647227, 'samples': 4850688, 'steps': 25263, 'loss/train': 0.6673987507820129} 11/07/2021 00:46:32 - INFO - __main__ - Step 25265: {'lr': 0.0004701316369760481, 'samples': 4850880, 'steps': 25264, 'loss/train': 1.8424373865127563} 11/07/2021 00:46:32 - INFO - __main__ - Step 25266: {'lr': 0.00047012912154643607, 'samples': 4851072, 'steps': 25265, 'loss/train': 1.5502065420150757} 11/07/2021 00:46:32 - INFO - __main__ - Step 25267: {'lr': 0.0004701266060176372, 'samples': 4851264, 'steps': 25266, 'loss/train': 2.0707805156707764} 11/07/2021 00:46:33 - INFO - __main__ - Step 25268: {'lr': 0.00047012409038965267, 'samples': 4851456, 'steps': 25267, 'loss/train': 1.219706416130066} 11/07/2021 00:46:33 - INFO - __main__ - Step 25269: {'lr': 0.0004701215746624836, 'samples': 4851648, 'steps': 25268, 'loss/train': 1.712836503982544} 11/07/2021 00:46:34 - INFO - __main__ - Step 25270: {'lr': 0.0004701190588361312, 'samples': 4851840, 'steps': 25269, 'loss/train': 1.7841097116470337} 11/07/2021 00:46:34 - INFO - __main__ - Step 25271: {'lr': 0.0004701165429105966, 'samples': 4852032, 'steps': 25270, 'loss/train': 1.5034170150756836} 11/07/2021 00:46:35 - INFO - __main__ - Step 25272: {'lr': 0.0004701140268858808, 'samples': 4852224, 'steps': 25271, 'loss/train': 3.784604787826538} 11/07/2021 00:46:35 - INFO - __main__ - Step 25273: {'lr': 0.000470111510761985, 'samples': 4852416, 'steps': 25272, 'loss/train': 1.329630970954895} 11/07/2021 00:46:35 - INFO - __main__ - Step 25274: {'lr': 0.0004701089945389104, 'samples': 4852608, 'steps': 25273, 'loss/train': 0.9505113959312439} 11/07/2021 00:46:36 - INFO - __main__ - Step 25275: {'lr': 0.00047010647821665803, 'samples': 4852800, 'steps': 25274, 'loss/train': 1.4187085628509521} 11/07/2021 00:46:37 - INFO - __main__ - Step 25276: {'lr': 0.0004701039617952291, 'samples': 4852992, 'steps': 25275, 'loss/train': 0.9424089193344116} 11/07/2021 00:46:37 - INFO - __main__ - Step 25277: {'lr': 0.00047010144527462474, 'samples': 4853184, 'steps': 25276, 'loss/train': 2.3901946544647217} 11/07/2021 00:46:37 - INFO - __main__ - Step 25278: {'lr': 0.00047009892865484607, 'samples': 4853376, 'steps': 25277, 'loss/train': 1.3440847396850586} 11/07/2021 00:46:38 - INFO - __main__ - Step 25279: {'lr': 0.00047009641193589423, 'samples': 4853568, 'steps': 25278, 'loss/train': 1.3197660446166992} 11/07/2021 00:46:38 - INFO - __main__ - Step 25280: {'lr': 0.00047009389511777036, 'samples': 4853760, 'steps': 25279, 'loss/train': 1.6327497959136963} 11/07/2021 00:46:39 - INFO - __main__ - Step 25281: {'lr': 0.0004700913782004755, 'samples': 4853952, 'steps': 25280, 'loss/train': 1.8389418125152588} 11/07/2021 00:46:40 - INFO - __main__ - Step 25282: {'lr': 0.00047008886118401084, 'samples': 4854144, 'steps': 25281, 'loss/train': 1.55793297290802} 11/07/2021 00:46:40 - INFO - __main__ - Step 25283: {'lr': 0.0004700863440683776, 'samples': 4854336, 'steps': 25282, 'loss/train': 1.6414594650268555} 11/07/2021 00:46:40 - INFO - __main__ - Step 25284: {'lr': 0.00047008382685357686, 'samples': 4854528, 'steps': 25283, 'loss/train': 1.131192684173584} 11/07/2021 00:46:41 - INFO - __main__ - Step 25285: {'lr': 0.0004700813095396098, 'samples': 4854720, 'steps': 25284, 'loss/train': 1.6616283655166626} 11/07/2021 00:46:42 - INFO - __main__ - Step 25286: {'lr': 0.00047007879212647744, 'samples': 4854912, 'steps': 25285, 'loss/train': 1.4370410442352295} 11/07/2021 00:46:42 - INFO - __main__ - Step 25287: {'lr': 0.0004700762746141809, 'samples': 4855104, 'steps': 25286, 'loss/train': 1.078500509262085} 11/07/2021 00:46:42 - INFO - __main__ - Step 25288: {'lr': 0.0004700737570027214, 'samples': 4855296, 'steps': 25287, 'loss/train': 1.5888121128082275} 11/07/2021 00:46:43 - INFO - __main__ - Step 25289: {'lr': 0.00047007123929210015, 'samples': 4855488, 'steps': 25288, 'loss/train': 1.627748966217041} 11/07/2021 00:46:43 - INFO - __main__ - Step 25290: {'lr': 0.00047006872148231814, 'samples': 4855680, 'steps': 25289, 'loss/train': 1.8425540924072266} 11/07/2021 00:46:44 - INFO - __main__ - Step 25291: {'lr': 0.0004700662035733766, 'samples': 4855872, 'steps': 25290, 'loss/train': 1.4199053049087524} 11/07/2021 00:46:44 - INFO - __main__ - Step 25292: {'lr': 0.0004700636855652766, 'samples': 4856064, 'steps': 25291, 'loss/train': 1.9728237390518188} 11/07/2021 00:46:45 - INFO - __main__ - Step 25293: {'lr': 0.0004700611674580193, 'samples': 4856256, 'steps': 25292, 'loss/train': 1.901063323020935} 11/07/2021 00:46:45 - INFO - __main__ - Step 25294: {'lr': 0.0004700586492516058, 'samples': 4856448, 'steps': 25293, 'loss/train': 1.5009207725524902} 11/07/2021 00:46:45 - INFO - __main__ - Step 25295: {'lr': 0.00047005613094603727, 'samples': 4856640, 'steps': 25294, 'loss/train': 1.6654140949249268} 11/07/2021 00:46:46 - INFO - __main__ - Step 25296: {'lr': 0.0004700536125413149, 'samples': 4856832, 'steps': 25295, 'loss/train': 1.5214890241622925} 11/07/2021 00:46:47 - INFO - __main__ - Step 25297: {'lr': 0.00047005109403743976, 'samples': 4857024, 'steps': 25296, 'loss/train': 1.6599291563034058} 11/07/2021 00:46:47 - INFO - __main__ - Step 25298: {'lr': 0.00047004857543441294, 'samples': 4857216, 'steps': 25297, 'loss/train': 1.448982834815979} 11/07/2021 00:46:47 - INFO - __main__ - Step 25299: {'lr': 0.00047004605673223567, 'samples': 4857408, 'steps': 25298, 'loss/train': 1.1666847467422485} 11/07/2021 00:46:48 - INFO - __main__ - Step 25300: {'lr': 0.00047004353793090903, 'samples': 4857600, 'steps': 25299, 'loss/train': 1.232759714126587} 11/07/2021 00:46:48 - INFO - __main__ - Step 25301: {'lr': 0.00047004101903043416, 'samples': 4857792, 'steps': 25300, 'loss/train': 0.6032378673553467} 11/07/2021 00:46:49 - INFO - __main__ - Step 25302: {'lr': 0.00047003850003081215, 'samples': 4857984, 'steps': 25301, 'loss/train': 1.9542654752731323} 11/07/2021 00:46:50 - INFO - __main__ - Step 25303: {'lr': 0.0004700359809320443, 'samples': 4858176, 'steps': 25302, 'loss/train': 1.7449018955230713} 11/07/2021 00:46:50 - INFO - __main__ - Step 25304: {'lr': 0.0004700334617341316, 'samples': 4858368, 'steps': 25303, 'loss/train': 1.4991850852966309} 11/07/2021 00:46:50 - INFO - __main__ - Step 25305: {'lr': 0.0004700309424370752, 'samples': 4858560, 'steps': 25304, 'loss/train': 1.3455928564071655} 11/07/2021 00:46:51 - INFO - __main__ - Step 25306: {'lr': 0.00047002842304087625, 'samples': 4858752, 'steps': 25305, 'loss/train': 0.9700114130973816} 11/07/2021 00:46:52 - INFO - __main__ - Step 25307: {'lr': 0.00047002590354553586, 'samples': 4858944, 'steps': 25306, 'loss/train': 0.09950239956378937} 11/07/2021 00:46:52 - INFO - __main__ - Step 25308: {'lr': 0.00047002338395105527, 'samples': 4859136, 'steps': 25307, 'loss/train': 2.488790988922119} 11/07/2021 00:46:53 - INFO - __main__ - Step 25309: {'lr': 0.00047002086425743545, 'samples': 4859328, 'steps': 25308, 'loss/train': 1.6912132501602173} 11/07/2021 00:46:53 - INFO - __main__ - Step 25310: {'lr': 0.0004700183444646776, 'samples': 4859520, 'steps': 25309, 'loss/train': 1.379109263420105} 11/07/2021 00:46:53 - INFO - __main__ - Step 25311: {'lr': 0.000470015824572783, 'samples': 4859712, 'steps': 25310, 'loss/train': 1.4917999505996704} 11/07/2021 00:46:54 - INFO - __main__ - Step 25312: {'lr': 0.00047001330458175264, 'samples': 4859904, 'steps': 25311, 'loss/train': 1.5213574171066284} 11/07/2021 00:46:55 - INFO - __main__ - Step 25313: {'lr': 0.0004700107844915876, 'samples': 4860096, 'steps': 25312, 'loss/train': 0.6688195466995239} 11/07/2021 00:46:55 - INFO - __main__ - Step 25314: {'lr': 0.00047000826430228915, 'samples': 4860288, 'steps': 25313, 'loss/train': 1.4643245935440063} 11/07/2021 00:46:55 - INFO - __main__ - Step 25315: {'lr': 0.00047000574401385835, 'samples': 4860480, 'steps': 25314, 'loss/train': 1.6544361114501953} 11/07/2021 00:46:56 - INFO - __main__ - Step 25316: {'lr': 0.0004700032236262964, 'samples': 4860672, 'steps': 25315, 'loss/train': 1.5297194719314575} 11/07/2021 00:46:57 - INFO - __main__ - Step 25317: {'lr': 0.00047000070313960436, 'samples': 4860864, 'steps': 25316, 'loss/train': 1.2998043298721313} 11/07/2021 00:46:57 - INFO - __main__ - Step 25318: {'lr': 0.00046999818255378335, 'samples': 4861056, 'steps': 25317, 'loss/train': 1.546887755393982} 11/07/2021 00:46:57 - INFO - __main__ - Step 25319: {'lr': 0.00046999566186883466, 'samples': 4861248, 'steps': 25318, 'loss/train': 1.403463363647461} 11/07/2021 00:46:58 - INFO - __main__ - Step 25320: {'lr': 0.0004699931410847592, 'samples': 4861440, 'steps': 25319, 'loss/train': 1.6991873979568481} 11/07/2021 00:46:58 - INFO - __main__ - Step 25321: {'lr': 0.00046999062020155834, 'samples': 4861632, 'steps': 25320, 'loss/train': 1.4992635250091553} 11/07/2021 00:46:59 - INFO - __main__ - Step 25322: {'lr': 0.00046998809921923305, 'samples': 4861824, 'steps': 25321, 'loss/train': 1.5245031118392944} 11/07/2021 00:46:59 - INFO - __main__ - Step 25323: {'lr': 0.0004699855781377845, 'samples': 4862016, 'steps': 25322, 'loss/train': 0.9190696477890015} 11/07/2021 00:47:00 - INFO - __main__ - Step 25324: {'lr': 0.0004699830569572139, 'samples': 4862208, 'steps': 25323, 'loss/train': 1.7949877977371216} 11/07/2021 00:47:00 - INFO - __main__ - Step 25325: {'lr': 0.00046998053567752225, 'samples': 4862400, 'steps': 25324, 'loss/train': 1.261948585510254} 11/07/2021 00:47:01 - INFO - __main__ - Step 25326: {'lr': 0.0004699780142987108, 'samples': 4862592, 'steps': 25325, 'loss/train': 1.4420384168624878} 11/07/2021 00:47:01 - INFO - __main__ - Step 25327: {'lr': 0.0004699754928207807, 'samples': 4862784, 'steps': 25326, 'loss/train': 1.365442156791687} 11/07/2021 00:47:02 - INFO - __main__ - Step 25328: {'lr': 0.00046997297124373293, 'samples': 4862976, 'steps': 25327, 'loss/train': 1.9386059045791626} 11/07/2021 00:47:02 - INFO - __main__ - Step 25329: {'lr': 0.00046997044956756883, 'samples': 4863168, 'steps': 25328, 'loss/train': 1.3372117280960083} 11/07/2021 00:47:03 - INFO - __main__ - Step 25330: {'lr': 0.00046996792779228935, 'samples': 4863360, 'steps': 25329, 'loss/train': 1.326066255569458} 11/07/2021 00:47:03 - INFO - __main__ - Step 25331: {'lr': 0.00046996540591789584, 'samples': 4863552, 'steps': 25330, 'loss/train': 1.7588785886764526} 11/07/2021 00:47:03 - INFO - __main__ - Step 25332: {'lr': 0.00046996288394438924, 'samples': 4863744, 'steps': 25331, 'loss/train': 1.963739275932312} 11/07/2021 00:47:06 - INFO - __main__ - Step 25333: {'lr': 0.00046996036187177073, 'samples': 4863936, 'steps': 25332, 'loss/train': 1.4010385274887085} 11/07/2021 00:47:06 - INFO - __main__ - Step 25334: {'lr': 0.0004699578397000415, 'samples': 4864128, 'steps': 25333, 'loss/train': 1.7472310066223145} 11/07/2021 00:47:06 - INFO - __main__ - Step 25335: {'lr': 0.00046995531742920264, 'samples': 4864320, 'steps': 25334, 'loss/train': 1.6207789182662964} 11/07/2021 00:47:07 - INFO - __main__ - Step 25336: {'lr': 0.00046995279505925535, 'samples': 4864512, 'steps': 25335, 'loss/train': 1.2272355556488037} 11/07/2021 00:47:07 - INFO - __main__ - Step 25337: {'lr': 0.00046995027259020075, 'samples': 4864704, 'steps': 25336, 'loss/train': 1.8280538320541382} 11/07/2021 00:47:07 - INFO - __main__ - Step 25338: {'lr': 0.00046994775002203994, 'samples': 4864896, 'steps': 25337, 'loss/train': 1.7909228801727295} 11/07/2021 00:47:08 - INFO - __main__ - Step 25339: {'lr': 0.000469945227354774, 'samples': 4865088, 'steps': 25338, 'loss/train': 0.8646075129508972} 11/07/2021 00:47:08 - INFO - __main__ - Step 25340: {'lr': 0.00046994270458840416, 'samples': 4865280, 'steps': 25339, 'loss/train': 1.7977173328399658} 11/07/2021 00:47:09 - INFO - __main__ - Step 25341: {'lr': 0.0004699401817229316, 'samples': 4865472, 'steps': 25340, 'loss/train': 0.9084984064102173} 11/07/2021 00:47:09 - INFO - __main__ - Step 25342: {'lr': 0.0004699376587583573, 'samples': 4865664, 'steps': 25341, 'loss/train': 1.1019365787506104} 11/07/2021 00:47:10 - INFO - __main__ - Step 25343: {'lr': 0.0004699351356946825, 'samples': 4865856, 'steps': 25342, 'loss/train': 1.4869623184204102} 11/07/2021 00:47:10 - INFO - __main__ - Step 25344: {'lr': 0.00046993261253190833, 'samples': 4866048, 'steps': 25343, 'loss/train': 1.684993863105774} 11/07/2021 00:47:10 - INFO - __main__ - Step 25345: {'lr': 0.000469930089270036, 'samples': 4866240, 'steps': 25344, 'loss/train': 1.3853352069854736} 11/07/2021 00:47:12 - INFO - __main__ - Step 25346: {'lr': 0.0004699275659090665, 'samples': 4866432, 'steps': 25345, 'loss/train': 1.6964845657348633} 11/07/2021 00:47:12 - INFO - __main__ - Step 25347: {'lr': 0.000469925042449001, 'samples': 4866624, 'steps': 25346, 'loss/train': 1.242707371711731} 11/07/2021 00:47:12 - INFO - __main__ - Step 25348: {'lr': 0.0004699225188898407, 'samples': 4866816, 'steps': 25347, 'loss/train': 1.0095255374908447} 11/07/2021 00:47:13 - INFO - __main__ - Step 25349: {'lr': 0.00046991999523158666, 'samples': 4867008, 'steps': 25348, 'loss/train': 1.6084877252578735} 11/07/2021 00:47:13 - INFO - __main__ - Step 25350: {'lr': 0.0004699174714742401, 'samples': 4867200, 'steps': 25349, 'loss/train': 1.6201637983322144} 11/07/2021 00:47:14 - INFO - __main__ - Step 25351: {'lr': 0.0004699149476178022, 'samples': 4867392, 'steps': 25350, 'loss/train': 1.87870192527771} 11/07/2021 00:47:14 - INFO - __main__ - Step 25352: {'lr': 0.00046991242366227395, 'samples': 4867584, 'steps': 25351, 'loss/train': 0.11543884128332138} 11/07/2021 00:47:15 - INFO - __main__ - Step 25353: {'lr': 0.0004699098996076565, 'samples': 4867776, 'steps': 25352, 'loss/train': 1.4073505401611328} 11/07/2021 00:47:15 - INFO - __main__ - Step 25354: {'lr': 0.0004699073754539511, 'samples': 4867968, 'steps': 25353, 'loss/train': 1.281815767288208} 11/07/2021 00:47:15 - INFO - __main__ - Step 25355: {'lr': 0.0004699048512011588, 'samples': 4868160, 'steps': 25354, 'loss/train': 2.1738197803497314} 11/07/2021 00:47:17 - INFO - __main__ - Step 25356: {'lr': 0.0004699023268492808, 'samples': 4868352, 'steps': 25355, 'loss/train': 1.910334587097168} 11/07/2021 00:47:17 - INFO - __main__ - Step 25357: {'lr': 0.0004698998023983182, 'samples': 4868544, 'steps': 25356, 'loss/train': 1.4859049320220947} 11/07/2021 00:47:17 - INFO - __main__ - Step 25358: {'lr': 0.0004698972778482722, 'samples': 4868736, 'steps': 25357, 'loss/train': 1.4127469062805176} 11/07/2021 00:47:18 - INFO - __main__ - Step 25359: {'lr': 0.0004698947531991438, 'samples': 4868928, 'steps': 25358, 'loss/train': 1.5658401250839233} 11/07/2021 00:47:18 - INFO - __main__ - Step 25360: {'lr': 0.0004698922284509342, 'samples': 4869120, 'steps': 25359, 'loss/train': 1.7960599660873413} 11/07/2021 00:47:19 - INFO - __main__ - Step 25361: {'lr': 0.00046988970360364456, 'samples': 4869312, 'steps': 25360, 'loss/train': 1.8280835151672363} 11/07/2021 00:47:19 - INFO - __main__ - Step 25362: {'lr': 0.0004698871786572761, 'samples': 4869504, 'steps': 25361, 'loss/train': 1.5560333728790283} 11/07/2021 00:47:20 - INFO - __main__ - Step 25363: {'lr': 0.0004698846536118298, 'samples': 4869696, 'steps': 25362, 'loss/train': 1.4659595489501953} 11/07/2021 00:47:20 - INFO - __main__ - Step 25364: {'lr': 0.00046988212846730686, 'samples': 4869888, 'steps': 25363, 'loss/train': 1.324087142944336} 11/07/2021 00:47:21 - INFO - __main__ - Step 25365: {'lr': 0.0004698796032237085, 'samples': 4870080, 'steps': 25364, 'loss/train': 1.577052354812622} 11/07/2021 00:47:22 - INFO - __main__ - Step 25366: {'lr': 0.0004698770778810357, 'samples': 4870272, 'steps': 25365, 'loss/train': 1.8698196411132812} 11/07/2021 00:47:22 - INFO - __main__ - Step 25367: {'lr': 0.00046987455243928974, 'samples': 4870464, 'steps': 25366, 'loss/train': 1.6207579374313354} 11/07/2021 00:47:22 - INFO - __main__ - Step 25368: {'lr': 0.00046987202689847165, 'samples': 4870656, 'steps': 25367, 'loss/train': 1.9728107452392578} 11/07/2021 00:47:23 - INFO - __main__ - Step 25369: {'lr': 0.00046986950125858264, 'samples': 4870848, 'steps': 25368, 'loss/train': 1.1386518478393555} 11/07/2021 00:47:23 - INFO - __main__ - Step 25370: {'lr': 0.0004698669755196239, 'samples': 4871040, 'steps': 25369, 'loss/train': 1.3435550928115845} 11/07/2021 00:47:24 - INFO - __main__ - Step 25371: {'lr': 0.0004698644496815964, 'samples': 4871232, 'steps': 25370, 'loss/train': 1.5909816026687622} 11/07/2021 00:47:24 - INFO - __main__ - Step 25372: {'lr': 0.0004698619237445013, 'samples': 4871424, 'steps': 25371, 'loss/train': 1.7607204914093018} 11/07/2021 00:47:25 - INFO - __main__ - Step 25373: {'lr': 0.00046985939770834, 'samples': 4871616, 'steps': 25372, 'loss/train': 1.651863932609558} 11/07/2021 00:47:25 - INFO - __main__ - Step 25374: {'lr': 0.0004698568715731133, 'samples': 4871808, 'steps': 25373, 'loss/train': 1.8538455963134766} 11/07/2021 00:47:25 - INFO - __main__ - Step 25375: {'lr': 0.00046985434533882255, 'samples': 4872000, 'steps': 25374, 'loss/train': 1.5987008810043335} 11/07/2021 00:47:27 - INFO - __main__ - Step 25376: {'lr': 0.00046985181900546883, 'samples': 4872192, 'steps': 25375, 'loss/train': 1.637808084487915} 11/07/2021 00:47:27 - INFO - __main__ - Step 25377: {'lr': 0.0004698492925730532, 'samples': 4872384, 'steps': 25376, 'loss/train': 1.5053311586380005} 11/07/2021 00:47:27 - INFO - __main__ - Step 25378: {'lr': 0.00046984676604157696, 'samples': 4872576, 'steps': 25377, 'loss/train': 1.6534550189971924} 11/07/2021 00:47:28 - INFO - __main__ - Step 25379: {'lr': 0.0004698442394110411, 'samples': 4872768, 'steps': 25378, 'loss/train': 1.3638150691986084} 11/07/2021 00:47:28 - INFO - __main__ - Step 25380: {'lr': 0.0004698417126814468, 'samples': 4872960, 'steps': 25379, 'loss/train': 1.1215922832489014} 11/07/2021 00:47:28 - INFO - __main__ - Step 25381: {'lr': 0.0004698391858527953, 'samples': 4873152, 'steps': 25380, 'loss/train': 1.7826118469238281} 11/07/2021 00:47:29 - INFO - __main__ - Step 25382: {'lr': 0.0004698366589250876, 'samples': 4873344, 'steps': 25381, 'loss/train': 1.8529301881790161} 11/07/2021 00:47:30 - INFO - __main__ - Step 25383: {'lr': 0.0004698341318983249, 'samples': 4873536, 'steps': 25382, 'loss/train': 0.7573006749153137} 11/07/2021 00:47:30 - INFO - __main__ - Step 25384: {'lr': 0.00046983160477250837, 'samples': 4873728, 'steps': 25383, 'loss/train': 1.7131630182266235} 11/07/2021 00:47:30 - INFO - __main__ - Step 25385: {'lr': 0.00046982907754763905, 'samples': 4873920, 'steps': 25384, 'loss/train': 1.6886578798294067} 11/07/2021 00:47:31 - INFO - __main__ - Step 25386: {'lr': 0.0004698265502237182, 'samples': 4874112, 'steps': 25385, 'loss/train': 1.656398892402649} 11/07/2021 00:47:32 - INFO - __main__ - Step 25387: {'lr': 0.0004698240228007469, 'samples': 4874304, 'steps': 25386, 'loss/train': 1.6194496154785156} 11/07/2021 00:47:32 - INFO - __main__ - Step 25388: {'lr': 0.0004698214952787262, 'samples': 4874496, 'steps': 25387, 'loss/train': 1.8319562673568726} 11/07/2021 00:47:33 - INFO - __main__ - Step 25389: {'lr': 0.0004698189676576574, 'samples': 4874688, 'steps': 25388, 'loss/train': 0.9109418988227844} 11/07/2021 00:47:33 - INFO - __main__ - Step 25390: {'lr': 0.00046981643993754155, 'samples': 4874880, 'steps': 25389, 'loss/train': 1.946299433708191} 11/07/2021 00:47:33 - INFO - __main__ - Step 25391: {'lr': 0.0004698139121183798, 'samples': 4875072, 'steps': 25390, 'loss/train': 1.7080029249191284} 11/07/2021 00:47:34 - INFO - __main__ - Step 25392: {'lr': 0.00046981138420017335, 'samples': 4875264, 'steps': 25391, 'loss/train': 1.9860248565673828} 11/07/2021 00:47:35 - INFO - __main__ - Step 25393: {'lr': 0.00046980885618292317, 'samples': 4875456, 'steps': 25392, 'loss/train': 1.5345799922943115} 11/07/2021 00:47:35 - INFO - __main__ - Step 25394: {'lr': 0.0004698063280666306, 'samples': 4875648, 'steps': 25393, 'loss/train': 1.693505883216858} 11/07/2021 00:47:35 - INFO - __main__ - Step 25395: {'lr': 0.0004698037998512966, 'samples': 4875840, 'steps': 25394, 'loss/train': 1.2428244352340698} 11/07/2021 00:47:36 - INFO - __main__ - Step 25396: {'lr': 0.00046980127153692256, 'samples': 4876032, 'steps': 25395, 'loss/train': 1.3942155838012695} 11/07/2021 00:47:36 - INFO - __main__ - Step 25397: {'lr': 0.00046979874312350935, 'samples': 4876224, 'steps': 25396, 'loss/train': 1.598301887512207} 11/07/2021 00:47:37 - INFO - __main__ - Step 25398: {'lr': 0.00046979621461105817, 'samples': 4876416, 'steps': 25397, 'loss/train': 1.7649084329605103} 11/07/2021 00:47:37 - INFO - __main__ - Step 25399: {'lr': 0.0004697936859995703, 'samples': 4876608, 'steps': 25398, 'loss/train': 1.5303013324737549} 11/07/2021 00:47:38 - INFO - __main__ - Step 25400: {'lr': 0.00046979115728904675, 'samples': 4876800, 'steps': 25399, 'loss/train': 1.4458239078521729} 11/07/2021 00:47:38 - INFO - __main__ - Step 25401: {'lr': 0.0004697886284794887, 'samples': 4876992, 'steps': 25400, 'loss/train': 1.4247881174087524} 11/07/2021 00:47:38 - INFO - __main__ - Step 25402: {'lr': 0.00046978609957089724, 'samples': 4877184, 'steps': 25401, 'loss/train': 1.2821444272994995} 11/07/2021 00:47:40 - INFO - __main__ - Step 25403: {'lr': 0.0004697835705632736, 'samples': 4877376, 'steps': 25402, 'loss/train': 1.5100736618041992} 11/07/2021 00:47:40 - INFO - __main__ - Step 25404: {'lr': 0.00046978104145661885, 'samples': 4877568, 'steps': 25403, 'loss/train': 1.4307785034179688} 11/07/2021 00:47:40 - INFO - __main__ - Step 25405: {'lr': 0.00046977851225093423, 'samples': 4877760, 'steps': 25404, 'loss/train': 2.60892915725708} 11/07/2021 00:47:41 - INFO - __main__ - Step 25406: {'lr': 0.0004697759829462207, 'samples': 4877952, 'steps': 25405, 'loss/train': 1.7201303243637085} 11/07/2021 00:47:41 - INFO - __main__ - Step 25407: {'lr': 0.0004697734535424796, 'samples': 4878144, 'steps': 25406, 'loss/train': 1.4767811298370361} 11/07/2021 00:47:42 - INFO - __main__ - Step 25408: {'lr': 0.0004697709240397119, 'samples': 4878336, 'steps': 25407, 'loss/train': 1.5929255485534668} 11/07/2021 00:47:42 - INFO - __main__ - Step 25409: {'lr': 0.00046976839443791887, 'samples': 4878528, 'steps': 25408, 'loss/train': 1.598000168800354} 11/07/2021 00:47:43 - INFO - __main__ - Step 25410: {'lr': 0.00046976586473710156, 'samples': 4878720, 'steps': 25409, 'loss/train': 1.6824252605438232} 11/07/2021 00:47:43 - INFO - __main__ - Step 25411: {'lr': 0.0004697633349372611, 'samples': 4878912, 'steps': 25410, 'loss/train': 1.4303624629974365} 11/07/2021 00:47:43 - INFO - __main__ - Step 25412: {'lr': 0.00046976080503839874, 'samples': 4879104, 'steps': 25411, 'loss/train': 1.5920426845550537} 11/07/2021 00:47:44 - INFO - __main__ - Step 25413: {'lr': 0.0004697582750405155, 'samples': 4879296, 'steps': 25412, 'loss/train': 1.718862771987915} 11/07/2021 00:47:45 - INFO - __main__ - Step 25414: {'lr': 0.00046975574494361263, 'samples': 4879488, 'steps': 25413, 'loss/train': 1.545186996459961} 11/07/2021 00:47:45 - INFO - __main__ - Step 25415: {'lr': 0.00046975321474769115, 'samples': 4879680, 'steps': 25414, 'loss/train': 1.6828172206878662} 11/07/2021 00:47:45 - INFO - __main__ - Step 25416: {'lr': 0.0004697506844527523, 'samples': 4879872, 'steps': 25415, 'loss/train': 1.047653079032898} 11/07/2021 00:47:46 - INFO - __main__ - Step 25417: {'lr': 0.0004697481540587972, 'samples': 4880064, 'steps': 25416, 'loss/train': 1.7983813285827637} 11/07/2021 00:47:47 - INFO - __main__ - Step 25418: {'lr': 0.00046974562356582694, 'samples': 4880256, 'steps': 25417, 'loss/train': 1.5987099409103394} 11/07/2021 00:47:47 - INFO - __main__ - Step 25419: {'lr': 0.0004697430929738427, 'samples': 4880448, 'steps': 25418, 'loss/train': 1.3499351739883423} 11/07/2021 00:47:48 - INFO - __main__ - Step 25420: {'lr': 0.0004697405622828456, 'samples': 4880640, 'steps': 25419, 'loss/train': 1.1702313423156738} 11/07/2021 00:47:48 - INFO - __main__ - Step 25421: {'lr': 0.00046973803149283686, 'samples': 4880832, 'steps': 25420, 'loss/train': 1.7239376306533813} 11/07/2021 00:47:48 - INFO - __main__ - Step 25422: {'lr': 0.0004697355006038175, 'samples': 4881024, 'steps': 25421, 'loss/train': 1.0425162315368652} 11/07/2021 00:47:49 - INFO - __main__ - Step 25423: {'lr': 0.0004697329696157887, 'samples': 4881216, 'steps': 25422, 'loss/train': 0.6917325854301453} 11/07/2021 00:47:50 - INFO - __main__ - Step 25424: {'lr': 0.00046973043852875163, 'samples': 4881408, 'steps': 25423, 'loss/train': 1.6703858375549316} 11/07/2021 00:47:50 - INFO - __main__ - Step 25425: {'lr': 0.00046972790734270745, 'samples': 4881600, 'steps': 25424, 'loss/train': 1.1467617750167847} 11/07/2021 00:47:50 - INFO - __main__ - Step 25426: {'lr': 0.0004697253760576572, 'samples': 4881792, 'steps': 25425, 'loss/train': 1.9418889284133911} 11/07/2021 00:47:51 - INFO - __main__ - Step 25427: {'lr': 0.00046972284467360217, 'samples': 4881984, 'steps': 25426, 'loss/train': 1.6337294578552246} 11/07/2021 00:47:52 - INFO - __main__ - Step 25428: {'lr': 0.0004697203131905433, 'samples': 4882176, 'steps': 25427, 'loss/train': 1.8798770904541016} 11/07/2021 00:47:52 - INFO - __main__ - Step 25429: {'lr': 0.00046971778160848196, 'samples': 4882368, 'steps': 25428, 'loss/train': 1.0612508058547974} 11/07/2021 00:47:52 - INFO - __main__ - Step 25430: {'lr': 0.0004697152499274191, 'samples': 4882560, 'steps': 25429, 'loss/train': 1.4720540046691895} 11/07/2021 00:47:53 - INFO - __main__ - Step 25431: {'lr': 0.00046971271814735593, 'samples': 4882752, 'steps': 25430, 'loss/train': 1.638587236404419} 11/07/2021 00:47:53 - INFO - __main__ - Step 25432: {'lr': 0.0004697101862682936, 'samples': 4882944, 'steps': 25431, 'loss/train': 1.127353310585022} 11/07/2021 00:47:54 - INFO - __main__ - Step 25433: {'lr': 0.00046970765429023336, 'samples': 4883136, 'steps': 25432, 'loss/train': 1.723750352859497} 11/07/2021 00:47:54 - INFO - __main__ - Step 25434: {'lr': 0.00046970512221317616, 'samples': 4883328, 'steps': 25433, 'loss/train': 1.5742734670639038} 11/07/2021 00:47:55 - INFO - __main__ - Step 25435: {'lr': 0.00046970259003712323, 'samples': 4883520, 'steps': 25434, 'loss/train': 1.4120583534240723} 11/07/2021 00:47:55 - INFO - __main__ - Step 25436: {'lr': 0.00046970005776207575, 'samples': 4883712, 'steps': 25435, 'loss/train': 1.7156976461410522} 11/07/2021 00:47:55 - INFO - __main__ - Step 25437: {'lr': 0.00046969752538803477, 'samples': 4883904, 'steps': 25436, 'loss/train': 1.1509177684783936} 11/07/2021 00:47:57 - INFO - __main__ - Step 25438: {'lr': 0.0004696949929150015, 'samples': 4884096, 'steps': 25437, 'loss/train': 1.466771125793457} 11/07/2021 00:47:57 - INFO - __main__ - Step 25439: {'lr': 0.00046969246034297697, 'samples': 4884288, 'steps': 25438, 'loss/train': 1.768174171447754} 11/07/2021 00:47:57 - INFO - __main__ - Step 25440: {'lr': 0.0004696899276719625, 'samples': 4884480, 'steps': 25439, 'loss/train': 1.7499266862869263} 11/07/2021 00:47:58 - INFO - __main__ - Step 25441: {'lr': 0.0004696873949019591, 'samples': 4884672, 'steps': 25440, 'loss/train': 1.9094892740249634} 11/07/2021 00:47:58 - INFO - __main__ - Step 25442: {'lr': 0.000469684862032968, 'samples': 4884864, 'steps': 25441, 'loss/train': 1.8627663850784302} 11/07/2021 00:47:58 - INFO - __main__ - Step 25443: {'lr': 0.0004696823290649902, 'samples': 4885056, 'steps': 25442, 'loss/train': 1.0269948244094849} 11/07/2021 00:48:00 - INFO - __main__ - Step 25444: {'lr': 0.000469679795998027, 'samples': 4885248, 'steps': 25443, 'loss/train': 2.533825635910034} 11/07/2021 00:48:00 - INFO - __main__ - Step 25445: {'lr': 0.00046967726283207945, 'samples': 4885440, 'steps': 25444, 'loss/train': 0.38005244731903076} 11/07/2021 00:48:00 - INFO - __main__ - Step 25446: {'lr': 0.0004696747295671487, 'samples': 4885632, 'steps': 25445, 'loss/train': 1.8956398963928223} 11/07/2021 00:48:01 - INFO - __main__ - Step 25447: {'lr': 0.000469672196203236, 'samples': 4885824, 'steps': 25446, 'loss/train': 1.2691547870635986} 11/07/2021 00:48:01 - INFO - __main__ - Step 25448: {'lr': 0.0004696696627403423, 'samples': 4886016, 'steps': 25447, 'loss/train': 1.8741952180862427} 11/07/2021 00:48:01 - INFO - __main__ - Step 25449: {'lr': 0.00046966712917846887, 'samples': 4886208, 'steps': 25448, 'loss/train': 1.662883996963501} 11/07/2021 00:48:02 - INFO - __main__ - Step 25450: {'lr': 0.00046966459551761684, 'samples': 4886400, 'steps': 25449, 'loss/train': 0.8091161847114563} 11/07/2021 00:48:03 - INFO - __main__ - Step 25451: {'lr': 0.00046966206175778723, 'samples': 4886592, 'steps': 25450, 'loss/train': 1.5671682357788086} 11/07/2021 00:48:03 - INFO - __main__ - Step 25452: {'lr': 0.0004696595278989814, 'samples': 4886784, 'steps': 25451, 'loss/train': 1.5331019163131714} 11/07/2021 00:48:03 - INFO - __main__ - Step 25453: {'lr': 0.00046965699394120033, 'samples': 4886976, 'steps': 25452, 'loss/train': 1.3922358751296997} 11/07/2021 00:48:04 - INFO - __main__ - Step 25454: {'lr': 0.0004696544598844452, 'samples': 4887168, 'steps': 25453, 'loss/train': 1.3637266159057617} 11/07/2021 00:48:05 - INFO - __main__ - Step 25455: {'lr': 0.00046965192572871723, 'samples': 4887360, 'steps': 25454, 'loss/train': 2.091482400894165} 11/07/2021 00:48:05 - INFO - __main__ - Step 25456: {'lr': 0.0004696493914740174, 'samples': 4887552, 'steps': 25455, 'loss/train': 1.694266676902771} 11/07/2021 00:48:05 - INFO - __main__ - Step 25457: {'lr': 0.00046964685712034697, 'samples': 4887744, 'steps': 25456, 'loss/train': 1.2359060049057007} 11/07/2021 00:48:06 - INFO - __main__ - Step 25458: {'lr': 0.00046964432266770713, 'samples': 4887936, 'steps': 25457, 'loss/train': 2.6210789680480957} 11/07/2021 00:48:06 - INFO - __main__ - Step 25459: {'lr': 0.0004696417881160989, 'samples': 4888128, 'steps': 25458, 'loss/train': 1.7765331268310547} 11/07/2021 00:48:07 - INFO - __main__ - Step 25460: {'lr': 0.0004696392534655234, 'samples': 4888320, 'steps': 25459, 'loss/train': 1.8273237943649292} 11/07/2021 00:48:07 - INFO - __main__ - Step 25461: {'lr': 0.0004696367187159819, 'samples': 4888512, 'steps': 25460, 'loss/train': 0.9927650094032288} 11/07/2021 00:48:08 - INFO - __main__ - Step 25462: {'lr': 0.00046963418386747547, 'samples': 4888704, 'steps': 25461, 'loss/train': 0.6545084118843079} 11/07/2021 00:48:08 - INFO - __main__ - Step 25463: {'lr': 0.0004696316489200053, 'samples': 4888896, 'steps': 25462, 'loss/train': 1.4440584182739258} 11/07/2021 00:48:08 - INFO - __main__ - Step 25464: {'lr': 0.00046962911387357246, 'samples': 4889088, 'steps': 25463, 'loss/train': 1.741220474243164} 11/07/2021 00:48:09 - INFO - __main__ - Step 25465: {'lr': 0.0004696265787281782, 'samples': 4889280, 'steps': 25464, 'loss/train': 1.5621095895767212} 11/07/2021 00:48:10 - INFO - __main__ - Step 25466: {'lr': 0.0004696240434838235, 'samples': 4889472, 'steps': 25465, 'loss/train': 1.0889923572540283} 11/07/2021 00:48:10 - INFO - __main__ - Step 25467: {'lr': 0.00046962150814050963, 'samples': 4889664, 'steps': 25466, 'loss/train': 1.4495553970336914} 11/07/2021 00:48:10 - INFO - __main__ - Step 25468: {'lr': 0.0004696189726982377, 'samples': 4889856, 'steps': 25467, 'loss/train': 1.2441531419754028} 11/07/2021 00:48:11 - INFO - __main__ - Step 25469: {'lr': 0.00046961643715700885, 'samples': 4890048, 'steps': 25468, 'loss/train': 1.3702870607376099} 11/07/2021 00:48:12 - INFO - __main__ - Step 25470: {'lr': 0.00046961390151682426, 'samples': 4890240, 'steps': 25469, 'loss/train': 1.9373600482940674} 11/07/2021 00:48:12 - INFO - __main__ - Step 25471: {'lr': 0.000469611365777685, 'samples': 4890432, 'steps': 25470, 'loss/train': 1.69800865650177} 11/07/2021 00:48:13 - INFO - __main__ - Step 25472: {'lr': 0.0004696088299395922, 'samples': 4890624, 'steps': 25471, 'loss/train': 0.9789637923240662} 11/07/2021 00:48:13 - INFO - __main__ - Step 25473: {'lr': 0.0004696062940025471, 'samples': 4890816, 'steps': 25472, 'loss/train': 1.6684023141860962} 11/07/2021 00:48:13 - INFO - __main__ - Step 25474: {'lr': 0.0004696037579665509, 'samples': 4891008, 'steps': 25473, 'loss/train': 1.4424618482589722} 11/07/2021 00:48:14 - INFO - __main__ - Step 25475: {'lr': 0.00046960122183160446, 'samples': 4891200, 'steps': 25474, 'loss/train': 1.417409062385559} 11/07/2021 00:48:15 - INFO - __main__ - Step 25476: {'lr': 0.00046959868559770914, 'samples': 4891392, 'steps': 25475, 'loss/train': 1.8746399879455566} 11/07/2021 00:48:15 - INFO - __main__ - Step 25477: {'lr': 0.00046959614926486606, 'samples': 4891584, 'steps': 25476, 'loss/train': 1.3747764825820923} 11/07/2021 00:48:16 - INFO - __main__ - Step 25478: {'lr': 0.00046959361283307636, 'samples': 4891776, 'steps': 25477, 'loss/train': 2.681973457336426} 11/07/2021 00:48:16 - INFO - __main__ - Step 25479: {'lr': 0.0004695910763023412, 'samples': 4891968, 'steps': 25478, 'loss/train': 1.392314076423645} 11/07/2021 00:48:16 - INFO - __main__ - Step 25480: {'lr': 0.0004695885396726616, 'samples': 4892160, 'steps': 25479, 'loss/train': 1.7377015352249146} 11/07/2021 00:48:17 - INFO - __main__ - Step 25481: {'lr': 0.00046958600294403887, 'samples': 4892352, 'steps': 25480, 'loss/train': 1.588399887084961} 11/07/2021 00:48:18 - INFO - __main__ - Step 25482: {'lr': 0.000469583466116474, 'samples': 4892544, 'steps': 25481, 'loss/train': 1.237380862236023} 11/07/2021 00:48:18 - INFO - __main__ - Step 25483: {'lr': 0.00046958092918996823, 'samples': 4892736, 'steps': 25482, 'loss/train': 1.4674667119979858} 11/07/2021 00:48:18 - INFO - __main__ - Step 25484: {'lr': 0.0004695783921645227, 'samples': 4892928, 'steps': 25483, 'loss/train': 2.004992961883545} 11/07/2021 00:48:19 - INFO - __main__ - Step 25485: {'lr': 0.00046957585504013853, 'samples': 4893120, 'steps': 25484, 'loss/train': 1.6140029430389404} 11/07/2021 00:48:20 - INFO - __main__ - Step 25486: {'lr': 0.0004695733178168169, 'samples': 4893312, 'steps': 25485, 'loss/train': 2.1163148880004883} 11/07/2021 00:48:20 - INFO - __main__ - Step 25487: {'lr': 0.00046957078049455895, 'samples': 4893504, 'steps': 25486, 'loss/train': 1.0973618030548096} 11/07/2021 00:48:21 - INFO - __main__ - Step 25488: {'lr': 0.00046956824307336565, 'samples': 4893696, 'steps': 25487, 'loss/train': 1.2262341976165771} 11/07/2021 00:48:21 - INFO - __main__ - Step 25489: {'lr': 0.0004695657055532384, 'samples': 4893888, 'steps': 25488, 'loss/train': 1.464012622833252} 11/07/2021 00:48:21 - INFO - __main__ - Step 25490: {'lr': 0.0004695631679341782, 'samples': 4894080, 'steps': 25489, 'loss/train': 1.6566625833511353} 11/07/2021 00:48:23 - INFO - __main__ - Step 25491: {'lr': 0.0004695606302161862, 'samples': 4894272, 'steps': 25490, 'loss/train': 1.4544919729232788} 11/07/2021 00:48:24 - INFO - __main__ - Step 25492: {'lr': 0.0004695580923992636, 'samples': 4894464, 'steps': 25491, 'loss/train': 5.552804946899414} 11/07/2021 00:48:24 - INFO - __main__ - Step 25493: {'lr': 0.0004695555544834116, 'samples': 4894656, 'steps': 25492, 'loss/train': 5.530755043029785} 11/07/2021 00:48:24 - INFO - __main__ - Step 25494: {'lr': 0.00046955301646863114, 'samples': 4894848, 'steps': 25493, 'loss/train': 5.656627178192139} 11/07/2021 00:48:25 - INFO - __main__ - Step 25495: {'lr': 0.0004695504783549235, 'samples': 4895040, 'steps': 25494, 'loss/train': 5.666793346405029} 11/07/2021 00:48:25 - INFO - __main__ - Step 25496: {'lr': 0.0004695479401422898, 'samples': 4895232, 'steps': 25495, 'loss/train': 1.751136302947998} 11/07/2021 00:48:25 - INFO - __main__ - Step 25497: {'lr': 0.0004695454018307312, 'samples': 4895424, 'steps': 25496, 'loss/train': 1.942580223083496} 11/07/2021 00:48:26 - INFO - __main__ - Step 25498: {'lr': 0.0004695428634202488, 'samples': 4895616, 'steps': 25497, 'loss/train': 1.7638825178146362} 11/07/2021 00:48:27 - INFO - __main__ - Step 25499: {'lr': 0.0004695403249108438, 'samples': 4895808, 'steps': 25498, 'loss/train': 1.5841854810714722} 11/07/2021 00:48:27 - INFO - __main__ - Step 25500: {'lr': 0.0004695377863025173, 'samples': 4896000, 'steps': 25499, 'loss/train': 1.8827564716339111} 11/07/2021 00:48:28 - INFO - __main__ - Step 25501: {'lr': 0.00046953524759527055, 'samples': 4896192, 'steps': 25500, 'loss/train': 1.5366376638412476} 11/07/2021 00:48:28 - INFO - __main__ - Step 25502: {'lr': 0.0004695327087891045, 'samples': 4896384, 'steps': 25501, 'loss/train': 1.039491891860962} 11/07/2021 00:48:28 - INFO - __main__ - Step 25503: {'lr': 0.00046953016988402044, 'samples': 4896576, 'steps': 25502, 'loss/train': 1.6493101119995117} 11/07/2021 00:48:29 - INFO - __main__ - Step 25504: {'lr': 0.0004695276308800194, 'samples': 4896768, 'steps': 25503, 'loss/train': 1.5515714883804321} 11/07/2021 00:48:30 - INFO - __main__ - Step 25505: {'lr': 0.00046952509177710267, 'samples': 4896960, 'steps': 25504, 'loss/train': 1.435947299003601} 11/07/2021 00:48:30 - INFO - __main__ - Step 25506: {'lr': 0.00046952255257527134, 'samples': 4897152, 'steps': 25505, 'loss/train': 1.8032622337341309} 11/07/2021 00:48:31 - INFO - __main__ - Step 25507: {'lr': 0.0004695200132745265, 'samples': 4897344, 'steps': 25506, 'loss/train': 1.7287282943725586} 11/07/2021 00:48:31 - INFO - __main__ - Step 25508: {'lr': 0.00046951747387486933, 'samples': 4897536, 'steps': 25507, 'loss/train': 1.765174388885498} 11/07/2021 00:48:32 - INFO - __main__ - Step 25509: {'lr': 0.00046951493437630097, 'samples': 4897728, 'steps': 25508, 'loss/train': 1.709006667137146} 11/07/2021 00:48:33 - INFO - __main__ - Step 25510: {'lr': 0.0004695123947788226, 'samples': 4897920, 'steps': 25509, 'loss/train': 1.191457748413086} 11/07/2021 00:48:33 - INFO - __main__ - Step 25511: {'lr': 0.0004695098550824353, 'samples': 4898112, 'steps': 25510, 'loss/train': 2.3190207481384277} 11/07/2021 00:48:33 - INFO - __main__ - Step 25512: {'lr': 0.0004695073152871403, 'samples': 4898304, 'steps': 25511, 'loss/train': 1.6699031591415405} 11/07/2021 00:48:34 - INFO - __main__ - Step 25513: {'lr': 0.00046950477539293864, 'samples': 4898496, 'steps': 25512, 'loss/train': 0.8998978734016418} 11/07/2021 00:48:35 - INFO - __main__ - Step 25514: {'lr': 0.0004695022353998315, 'samples': 4898688, 'steps': 25513, 'loss/train': 0.20698074996471405} 11/07/2021 00:48:35 - INFO - __main__ - Step 25515: {'lr': 0.0004694996953078201, 'samples': 4898880, 'steps': 25514, 'loss/train': 1.1759369373321533} 11/07/2021 00:48:35 - INFO - __main__ - Step 25516: {'lr': 0.0004694971551169055, 'samples': 4899072, 'steps': 25515, 'loss/train': 2.215197801589966} 11/07/2021 00:48:36 - INFO - __main__ - Step 25517: {'lr': 0.00046949461482708875, 'samples': 4899264, 'steps': 25516, 'loss/train': 1.4564876556396484} 11/07/2021 00:48:36 - INFO - __main__ - Step 25518: {'lr': 0.0004694920744383713, 'samples': 4899456, 'steps': 25517, 'loss/train': 1.4380667209625244} 11/07/2021 00:48:37 - INFO - __main__ - Step 25519: {'lr': 0.000469489533950754, 'samples': 4899648, 'steps': 25518, 'loss/train': 2.123898506164551} 11/07/2021 00:48:37 - INFO - __main__ - Step 25520: {'lr': 0.00046948699336423817, 'samples': 4899840, 'steps': 25519, 'loss/train': 1.817137598991394} 11/07/2021 00:48:38 - INFO - __main__ - Step 25521: {'lr': 0.0004694844526788248, 'samples': 4900032, 'steps': 25520, 'loss/train': 1.683441162109375} 11/07/2021 00:48:38 - INFO - __main__ - Step 25522: {'lr': 0.0004694819118945152, 'samples': 4900224, 'steps': 25521, 'loss/train': 1.1400400400161743} 11/07/2021 00:48:39 - INFO - __main__ - Step 25523: {'lr': 0.00046947937101131046, 'samples': 4900416, 'steps': 25522, 'loss/train': 2.1464781761169434} 11/07/2021 00:48:39 - INFO - __main__ - Step 25524: {'lr': 0.0004694768300292116, 'samples': 4900608, 'steps': 25523, 'loss/train': 6.091759204864502} 11/07/2021 00:48:40 - INFO - __main__ - Step 25525: {'lr': 0.0004694742889482199, 'samples': 4900800, 'steps': 25524, 'loss/train': 1.5668319463729858} 11/07/2021 00:48:40 - INFO - __main__ - Step 25526: {'lr': 0.0004694717477683365, 'samples': 4900992, 'steps': 25525, 'loss/train': 1.4446377754211426} 11/07/2021 00:48:41 - INFO - __main__ - Step 25527: {'lr': 0.0004694692064895625, 'samples': 4901184, 'steps': 25526, 'loss/train': 0.9974812865257263} 11/07/2021 00:48:41 - INFO - __main__ - Step 25528: {'lr': 0.0004694666651118991, 'samples': 4901376, 'steps': 25527, 'loss/train': 1.7450157403945923} 11/07/2021 00:48:41 - INFO - __main__ - Step 25529: {'lr': 0.00046946412363534735, 'samples': 4901568, 'steps': 25528, 'loss/train': 1.9022449254989624} 11/07/2021 00:48:43 - INFO - __main__ - Step 25530: {'lr': 0.0004694615820599085, 'samples': 4901760, 'steps': 25529, 'loss/train': 1.3847417831420898} 11/07/2021 00:48:43 - INFO - __main__ - Step 25531: {'lr': 0.00046945904038558364, 'samples': 4901952, 'steps': 25530, 'loss/train': 1.3199286460876465} 11/07/2021 00:48:43 - INFO - __main__ - Step 25532: {'lr': 0.00046945649861237387, 'samples': 4902144, 'steps': 25531, 'loss/train': 1.8600738048553467} 11/07/2021 00:48:44 - INFO - __main__ - Step 25533: {'lr': 0.00046945395674028047, 'samples': 4902336, 'steps': 25532, 'loss/train': 1.9562991857528687} 11/07/2021 00:48:44 - INFO - __main__ - Step 25534: {'lr': 0.0004694514147693044, 'samples': 4902528, 'steps': 25533, 'loss/train': 1.8586628437042236} 11/07/2021 00:48:45 - INFO - __main__ - Step 25535: {'lr': 0.000469448872699447, 'samples': 4902720, 'steps': 25534, 'loss/train': 1.3385511636734009} 11/07/2021 00:48:45 - INFO - __main__ - Step 25536: {'lr': 0.0004694463305307093, 'samples': 4902912, 'steps': 25535, 'loss/train': 1.7087907791137695} 11/07/2021 00:48:46 - INFO - __main__ - Step 25537: {'lr': 0.00046944378826309244, 'samples': 4903104, 'steps': 25536, 'loss/train': 1.7909225225448608} 11/07/2021 00:48:46 - INFO - __main__ - Step 25538: {'lr': 0.00046944124589659765, 'samples': 4903296, 'steps': 25537, 'loss/train': 1.7152092456817627} 11/07/2021 00:48:47 - INFO - __main__ - Step 25539: {'lr': 0.00046943870343122595, 'samples': 4903488, 'steps': 25538, 'loss/train': 1.7453871965408325} 11/07/2021 00:48:47 - INFO - __main__ - Step 25540: {'lr': 0.0004694361608669786, 'samples': 4903680, 'steps': 25539, 'loss/train': 1.4773492813110352} 11/07/2021 00:48:47 - INFO - __main__ - Step 25541: {'lr': 0.0004694336182038567, 'samples': 4903872, 'steps': 25540, 'loss/train': 1.7164955139160156} 11/07/2021 00:48:48 - INFO - __main__ - Step 25542: {'lr': 0.00046943107544186144, 'samples': 4904064, 'steps': 25541, 'loss/train': 1.448752760887146} 11/07/2021 00:48:49 - INFO - __main__ - Step 25543: {'lr': 0.0004694285325809938, 'samples': 4904256, 'steps': 25542, 'loss/train': 1.6738860607147217} 11/07/2021 00:48:49 - INFO - __main__ - Step 25544: {'lr': 0.00046942598962125515, 'samples': 4904448, 'steps': 25543, 'loss/train': 1.511159062385559} 11/07/2021 00:48:49 - INFO - __main__ - Step 25545: {'lr': 0.00046942344656264657, 'samples': 4904640, 'steps': 25544, 'loss/train': 1.6937353610992432} 11/07/2021 00:48:50 - INFO - __main__ - Step 25546: {'lr': 0.0004694209034051691, 'samples': 4904832, 'steps': 25545, 'loss/train': 1.914302945137024} 11/07/2021 00:48:51 - INFO - __main__ - Step 25547: {'lr': 0.00046941836014882394, 'samples': 4905024, 'steps': 25546, 'loss/train': 1.5163538455963135} 11/07/2021 00:48:51 - INFO - __main__ - Step 25548: {'lr': 0.00046941581679361234, 'samples': 4905216, 'steps': 25547, 'loss/train': 1.1521172523498535} 11/07/2021 00:48:51 - INFO - __main__ - Step 25549: {'lr': 0.00046941327333953526, 'samples': 4905408, 'steps': 25548, 'loss/train': 1.5841848850250244} 11/07/2021 00:48:52 - INFO - __main__ - Step 25550: {'lr': 0.00046941072978659397, 'samples': 4905600, 'steps': 25549, 'loss/train': 1.1602411270141602} 11/07/2021 00:48:52 - INFO - __main__ - Step 25551: {'lr': 0.00046940818613478964, 'samples': 4905792, 'steps': 25550, 'loss/train': 1.572580337524414} 11/07/2021 00:48:53 - INFO - __main__ - Step 25552: {'lr': 0.0004694056423841233, 'samples': 4905984, 'steps': 25551, 'loss/train': 1.8355903625488281} 11/07/2021 00:48:54 - INFO - __main__ - Step 25553: {'lr': 0.00046940309853459625, 'samples': 4906176, 'steps': 25552, 'loss/train': 1.4142526388168335} 11/07/2021 00:48:54 - INFO - __main__ - Step 25554: {'lr': 0.00046940055458620945, 'samples': 4906368, 'steps': 25553, 'loss/train': 1.5380394458770752} 11/07/2021 00:48:54 - INFO - __main__ - Step 25555: {'lr': 0.0004693980105389642, 'samples': 4906560, 'steps': 25554, 'loss/train': 1.888519287109375} 11/07/2021 00:48:55 - INFO - __main__ - Step 25556: {'lr': 0.00046939546639286156, 'samples': 4906752, 'steps': 25555, 'loss/train': 1.2919965982437134} 11/07/2021 00:48:56 - INFO - __main__ - Step 25557: {'lr': 0.00046939292214790275, 'samples': 4906944, 'steps': 25556, 'loss/train': 1.5348103046417236} 11/07/2021 00:48:56 - INFO - __main__ - Step 25558: {'lr': 0.0004693903778040889, 'samples': 4907136, 'steps': 25557, 'loss/train': 1.0258276462554932} 11/07/2021 00:48:56 - INFO - __main__ - Step 25559: {'lr': 0.0004693878333614211, 'samples': 4907328, 'steps': 25558, 'loss/train': 1.624510407447815} 11/07/2021 00:48:57 - INFO - __main__ - Step 25560: {'lr': 0.0004693852888199005, 'samples': 4907520, 'steps': 25559, 'loss/train': 1.7044670581817627} 11/07/2021 00:48:57 - INFO - __main__ - Step 25561: {'lr': 0.0004693827441795283, 'samples': 4907712, 'steps': 25560, 'loss/train': 1.3589155673980713} 11/07/2021 00:48:57 - INFO - __main__ - Step 25562: {'lr': 0.00046938019944030556, 'samples': 4907904, 'steps': 25561, 'loss/train': 1.0206115245819092} 11/07/2021 00:48:58 - INFO - __main__ - Step 25563: {'lr': 0.00046937765460223357, 'samples': 4908096, 'steps': 25562, 'loss/train': 1.426313877105713} 11/07/2021 00:48:59 - INFO - __main__ - Step 25564: {'lr': 0.0004693751096653134, 'samples': 4908288, 'steps': 25563, 'loss/train': 1.781064510345459} 11/07/2021 00:48:59 - INFO - __main__ - Step 25565: {'lr': 0.00046937256462954615, 'samples': 4908480, 'steps': 25564, 'loss/train': 0.6571682691574097} 11/07/2021 00:48:59 - INFO - __main__ - Step 25566: {'lr': 0.00046937001949493294, 'samples': 4908672, 'steps': 25565, 'loss/train': 1.5839568376541138} 11/07/2021 00:49:00 - INFO - __main__ - Step 25567: {'lr': 0.0004693674742614751, 'samples': 4908864, 'steps': 25566, 'loss/train': 1.3333218097686768} 11/07/2021 00:49:01 - INFO - __main__ - Step 25568: {'lr': 0.0004693649289291736, 'samples': 4909056, 'steps': 25567, 'loss/train': 1.342308759689331} 11/07/2021 00:49:01 - INFO - __main__ - Step 25569: {'lr': 0.0004693623834980297, 'samples': 4909248, 'steps': 25568, 'loss/train': 1.3374139070510864} 11/07/2021 00:49:02 - INFO - __main__ - Step 25570: {'lr': 0.00046935983796804443, 'samples': 4909440, 'steps': 25569, 'loss/train': 0.830653727054596} 11/07/2021 00:49:02 - INFO - __main__ - Step 25571: {'lr': 0.000469357292339219, 'samples': 4909632, 'steps': 25570, 'loss/train': 1.6451069116592407} 11/07/2021 00:49:02 - INFO - __main__ - Step 25572: {'lr': 0.00046935474661155465, 'samples': 4909824, 'steps': 25571, 'loss/train': 1.5277132987976074} 11/07/2021 00:49:03 - INFO - __main__ - Step 25573: {'lr': 0.00046935220078505235, 'samples': 4910016, 'steps': 25572, 'loss/train': 1.750598430633545} 11/07/2021 00:49:04 - INFO - __main__ - Step 25574: {'lr': 0.00046934965485971337, 'samples': 4910208, 'steps': 25573, 'loss/train': 0.6289024353027344} 11/07/2021 00:49:04 - INFO - __main__ - Step 25575: {'lr': 0.00046934710883553884, 'samples': 4910400, 'steps': 25574, 'loss/train': 1.674824595451355} 11/07/2021 00:49:04 - INFO - __main__ - Step 25576: {'lr': 0.00046934456271252985, 'samples': 4910592, 'steps': 25575, 'loss/train': 1.5637032985687256} 11/07/2021 00:49:05 - INFO - __main__ - Step 25577: {'lr': 0.0004693420164906876, 'samples': 4910784, 'steps': 25576, 'loss/train': 1.707536220550537} 11/07/2021 00:49:06 - INFO - __main__ - Step 25578: {'lr': 0.0004693394701700132, 'samples': 4910976, 'steps': 25577, 'loss/train': 1.2549412250518799} 11/07/2021 00:49:06 - INFO - __main__ - Step 25579: {'lr': 0.00046933692375050783, 'samples': 4911168, 'steps': 25578, 'loss/train': 0.6779226064682007} 11/07/2021 00:49:06 - INFO - __main__ - Step 25580: {'lr': 0.00046933437723217265, 'samples': 4911360, 'steps': 25579, 'loss/train': 1.3821730613708496} 11/07/2021 00:49:07 - INFO - __main__ - Step 25581: {'lr': 0.0004693318306150087, 'samples': 4911552, 'steps': 25580, 'loss/train': 2.004680633544922} 11/07/2021 00:49:07 - INFO - __main__ - Step 25582: {'lr': 0.0004693292838990173, 'samples': 4911744, 'steps': 25581, 'loss/train': 1.2198615074157715} 11/07/2021 00:49:08 - INFO - __main__ - Step 25583: {'lr': 0.0004693267370841995, 'samples': 4911936, 'steps': 25582, 'loss/train': 1.6979387998580933} 11/07/2021 00:49:09 - INFO - __main__ - Step 25584: {'lr': 0.00046932419017055646, 'samples': 4912128, 'steps': 25583, 'loss/train': 1.5728169679641724} 11/07/2021 00:49:09 - INFO - __main__ - Step 25585: {'lr': 0.0004693216431580893, 'samples': 4912320, 'steps': 25584, 'loss/train': 1.439206838607788} 11/07/2021 00:49:09 - INFO - __main__ - Step 25586: {'lr': 0.00046931909604679925, 'samples': 4912512, 'steps': 25585, 'loss/train': 1.6354206800460815} 11/07/2021 00:49:10 - INFO - __main__ - Step 25587: {'lr': 0.0004693165488366873, 'samples': 4912704, 'steps': 25586, 'loss/train': 1.040775179862976} 11/07/2021 00:49:11 - INFO - __main__ - Step 25588: {'lr': 0.00046931400152775473, 'samples': 4912896, 'steps': 25587, 'loss/train': 1.4883759021759033} 11/07/2021 00:49:11 - INFO - __main__ - Step 25589: {'lr': 0.00046931145412000265, 'samples': 4913088, 'steps': 25588, 'loss/train': 1.6937187910079956} 11/07/2021 00:49:11 - INFO - __main__ - Step 25590: {'lr': 0.00046930890661343226, 'samples': 4913280, 'steps': 25589, 'loss/train': 1.5540019273757935} 11/07/2021 00:49:12 - INFO - __main__ - Step 25591: {'lr': 0.00046930635900804466, 'samples': 4913472, 'steps': 25590, 'loss/train': 1.1948552131652832} 11/07/2021 00:49:12 - INFO - __main__ - Step 25592: {'lr': 0.0004693038113038409, 'samples': 4913664, 'steps': 25591, 'loss/train': 1.5493038892745972} 11/07/2021 00:49:13 - INFO - __main__ - Step 25593: {'lr': 0.0004693012635008224, 'samples': 4913856, 'steps': 25592, 'loss/train': 1.657433271408081} 11/07/2021 00:49:13 - INFO - __main__ - Step 25594: {'lr': 0.00046929871559898994, 'samples': 4914048, 'steps': 25593, 'loss/train': 1.1808425188064575} 11/07/2021 00:49:14 - INFO - __main__ - Step 25595: {'lr': 0.00046929616759834505, 'samples': 4914240, 'steps': 25594, 'loss/train': 1.7247182130813599} 11/07/2021 00:49:14 - INFO - __main__ - Step 25596: {'lr': 0.00046929361949888857, 'samples': 4914432, 'steps': 25595, 'loss/train': 1.9493794441223145} 11/07/2021 00:49:14 - INFO - __main__ - Step 25597: {'lr': 0.00046929107130062176, 'samples': 4914624, 'steps': 25596, 'loss/train': 1.5374634265899658} 11/07/2021 00:49:15 - INFO - __main__ - Step 25598: {'lr': 0.00046928852300354585, 'samples': 4914816, 'steps': 25597, 'loss/train': 1.6057541370391846} 11/07/2021 00:49:16 - INFO - __main__ - Step 25599: {'lr': 0.0004692859746076619, 'samples': 4915008, 'steps': 25598, 'loss/train': 1.3104592561721802} 11/07/2021 00:49:16 - INFO - __main__ - Step 25600: {'lr': 0.00046928342611297105, 'samples': 4915200, 'steps': 25599, 'loss/train': 1.26352059841156} 11/07/2021 00:49:16 - INFO - __main__ - Step 25601: {'lr': 0.00046928087751947444, 'samples': 4915392, 'steps': 25600, 'loss/train': 1.4352083206176758} 11/07/2021 00:49:17 - INFO - __main__ - Step 25602: {'lr': 0.00046927832882717323, 'samples': 4915584, 'steps': 25601, 'loss/train': 1.254616618156433} 11/07/2021 00:49:18 - INFO - __main__ - Step 25603: {'lr': 0.0004692757800360687, 'samples': 4915776, 'steps': 25602, 'loss/train': 1.7988314628601074} 11/07/2021 00:49:18 - INFO - __main__ - Step 25604: {'lr': 0.0004692732311461618, 'samples': 4915968, 'steps': 25603, 'loss/train': 1.255089521408081} 11/07/2021 00:49:18 - INFO - __main__ - Step 25605: {'lr': 0.0004692706821574538, 'samples': 4916160, 'steps': 25604, 'loss/train': 1.0325216054916382} 11/07/2021 00:49:19 - INFO - __main__ - Step 25606: {'lr': 0.00046926813306994586, 'samples': 4916352, 'steps': 25605, 'loss/train': 1.6360634565353394} 11/07/2021 00:49:19 - INFO - __main__ - Step 25607: {'lr': 0.00046926558388363904, 'samples': 4916544, 'steps': 25606, 'loss/train': 1.79670250415802} 11/07/2021 00:49:20 - INFO - __main__ - Step 25608: {'lr': 0.00046926303459853447, 'samples': 4916736, 'steps': 25607, 'loss/train': 1.8682706356048584} 11/07/2021 00:49:21 - INFO - __main__ - Step 25609: {'lr': 0.00046926048521463344, 'samples': 4916928, 'steps': 25608, 'loss/train': 1.3411468267440796} 11/07/2021 00:49:21 - INFO - __main__ - Step 25610: {'lr': 0.000469257935731937, 'samples': 4917120, 'steps': 25609, 'loss/train': 1.0871062278747559} 11/07/2021 00:49:21 - INFO - __main__ - Step 25611: {'lr': 0.0004692553861504463, 'samples': 4917312, 'steps': 25610, 'loss/train': 1.6056123971939087} 11/07/2021 00:49:22 - INFO - __main__ - Step 25612: {'lr': 0.00046925283647016253, 'samples': 4917504, 'steps': 25611, 'loss/train': 1.4586424827575684} 11/07/2021 00:49:23 - INFO - __main__ - Step 25613: {'lr': 0.0004692502866910868, 'samples': 4917696, 'steps': 25612, 'loss/train': 1.538058876991272} 11/07/2021 00:49:23 - INFO - __main__ - Step 25614: {'lr': 0.0004692477368132203, 'samples': 4917888, 'steps': 25613, 'loss/train': 1.575411319732666} 11/07/2021 00:49:23 - INFO - __main__ - Step 25615: {'lr': 0.0004692451868365641, 'samples': 4918080, 'steps': 25614, 'loss/train': 1.7118977308273315} 11/07/2021 00:49:24 - INFO - __main__ - Step 25616: {'lr': 0.00046924263676111945, 'samples': 4918272, 'steps': 25615, 'loss/train': 2.131321907043457} 11/07/2021 00:49:24 - INFO - __main__ - Step 25617: {'lr': 0.00046924008658688745, 'samples': 4918464, 'steps': 25616, 'loss/train': 2.023993968963623} 11/07/2021 00:49:25 - INFO - __main__ - Step 25618: {'lr': 0.00046923753631386924, 'samples': 4918656, 'steps': 25617, 'loss/train': 1.8225526809692383} 11/07/2021 00:49:25 - INFO - __main__ - Step 25619: {'lr': 0.0004692349859420659, 'samples': 4918848, 'steps': 25618, 'loss/train': 1.5940064191818237} 11/07/2021 00:49:26 - INFO - __main__ - Step 25620: {'lr': 0.00046923243547147874, 'samples': 4919040, 'steps': 25619, 'loss/train': 1.45718252658844} 11/07/2021 00:49:26 - INFO - __main__ - Step 25621: {'lr': 0.0004692298849021088, 'samples': 4919232, 'steps': 25620, 'loss/train': 1.6011896133422852} 11/07/2021 00:49:26 - INFO - __main__ - Step 25622: {'lr': 0.00046922733423395736, 'samples': 4919424, 'steps': 25621, 'loss/train': 1.3006465435028076} 11/07/2021 00:49:27 - INFO - __main__ - Step 25623: {'lr': 0.0004692247834670253, 'samples': 4919616, 'steps': 25622, 'loss/train': 1.3544636964797974} 11/07/2021 00:49:28 - INFO - __main__ - Step 25624: {'lr': 0.000469222232601314, 'samples': 4919808, 'steps': 25623, 'loss/train': 1.1881569623947144} 11/07/2021 00:49:28 - INFO - __main__ - Step 25625: {'lr': 0.0004692196816368246, 'samples': 4920000, 'steps': 25624, 'loss/train': 1.6739518642425537} 11/07/2021 00:49:28 - INFO - __main__ - Step 25626: {'lr': 0.00046921713057355817, 'samples': 4920192, 'steps': 25625, 'loss/train': 1.2167147397994995} 11/07/2021 00:49:29 - INFO - __main__ - Step 25627: {'lr': 0.0004692145794115159, 'samples': 4920384, 'steps': 25626, 'loss/train': 1.683274269104004} 11/07/2021 00:49:29 - INFO - __main__ - Step 25628: {'lr': 0.00046921202815069883, 'samples': 4920576, 'steps': 25627, 'loss/train': 1.4491691589355469} 11/07/2021 00:49:30 - INFO - __main__ - Step 25629: {'lr': 0.00046920947679110833, 'samples': 4920768, 'steps': 25628, 'loss/train': 1.2186248302459717} 11/07/2021 00:49:31 - INFO - __main__ - Step 25630: {'lr': 0.00046920692533274533, 'samples': 4920960, 'steps': 25629, 'loss/train': 1.3265304565429688} 11/07/2021 00:49:31 - INFO - __main__ - Step 25631: {'lr': 0.0004692043737756111, 'samples': 4921152, 'steps': 25630, 'loss/train': 1.4819444417953491} 11/07/2021 00:49:31 - INFO - __main__ - Step 25632: {'lr': 0.00046920182211970677, 'samples': 4921344, 'steps': 25631, 'loss/train': 0.8713647127151489} 11/07/2021 00:49:32 - INFO - __main__ - Step 25633: {'lr': 0.00046919927036503353, 'samples': 4921536, 'steps': 25632, 'loss/train': 1.6610690355300903} 11/07/2021 00:49:33 - INFO - __main__ - Step 25634: {'lr': 0.0004691967185115924, 'samples': 4921728, 'steps': 25633, 'loss/train': 1.2283509969711304} 11/07/2021 00:49:33 - INFO - __main__ - Step 25635: {'lr': 0.00046919416655938465, 'samples': 4921920, 'steps': 25634, 'loss/train': 1.736107349395752} 11/07/2021 00:49:34 - INFO - __main__ - Step 25636: {'lr': 0.0004691916145084113, 'samples': 4922112, 'steps': 25635, 'loss/train': 1.667926549911499} 11/07/2021 00:49:34 - INFO - __main__ - Step 25637: {'lr': 0.0004691890623586737, 'samples': 4922304, 'steps': 25636, 'loss/train': 0.921523928642273} 11/07/2021 00:49:35 - INFO - __main__ - Step 25638: {'lr': 0.00046918651011017287, 'samples': 4922496, 'steps': 25637, 'loss/train': 1.4885398149490356} 11/07/2021 00:49:35 - INFO - __main__ - Step 25639: {'lr': 0.00046918395776290997, 'samples': 4922688, 'steps': 25638, 'loss/train': 1.7190399169921875} 11/07/2021 00:49:36 - INFO - __main__ - Step 25640: {'lr': 0.0004691814053168861, 'samples': 4922880, 'steps': 25639, 'loss/train': 1.4409922361373901} 11/07/2021 00:49:36 - INFO - __main__ - Step 25641: {'lr': 0.0004691788527721026, 'samples': 4923072, 'steps': 25640, 'loss/train': 1.4608443975448608} 11/07/2021 00:49:36 - INFO - __main__ - Step 25642: {'lr': 0.0004691763001285604, 'samples': 4923264, 'steps': 25641, 'loss/train': 1.7773683071136475} 11/07/2021 00:49:37 - INFO - __main__ - Step 25643: {'lr': 0.0004691737473862607, 'samples': 4923456, 'steps': 25642, 'loss/train': 1.7889008522033691} 11/07/2021 00:49:38 - INFO - __main__ - Step 25644: {'lr': 0.00046917119454520487, 'samples': 4923648, 'steps': 25643, 'loss/train': 1.1047998666763306} 11/07/2021 00:49:38 - INFO - __main__ - Step 25645: {'lr': 0.00046916864160539376, 'samples': 4923840, 'steps': 25644, 'loss/train': 1.7732115983963013} 11/07/2021 00:49:38 - INFO - __main__ - Step 25646: {'lr': 0.00046916608856682865, 'samples': 4924032, 'steps': 25645, 'loss/train': 1.751808524131775} 11/07/2021 00:49:39 - INFO - __main__ - Step 25647: {'lr': 0.0004691635354295106, 'samples': 4924224, 'steps': 25646, 'loss/train': 1.6011745929718018} 11/07/2021 00:49:39 - INFO - __main__ - Step 25648: {'lr': 0.00046916098219344093, 'samples': 4924416, 'steps': 25647, 'loss/train': 1.3450523614883423} 11/07/2021 00:49:40 - INFO - __main__ - Step 25649: {'lr': 0.0004691584288586207, 'samples': 4924608, 'steps': 25648, 'loss/train': 0.7353626489639282} 11/07/2021 00:49:41 - INFO - __main__ - Step 25650: {'lr': 0.0004691558754250511, 'samples': 4924800, 'steps': 25649, 'loss/train': 0.6195133328437805} 11/07/2021 00:49:41 - INFO - __main__ - Step 25651: {'lr': 0.0004691533218927332, 'samples': 4924992, 'steps': 25650, 'loss/train': 1.8022180795669556} 11/07/2021 00:49:41 - INFO - __main__ - Step 25652: {'lr': 0.00046915076826166814, 'samples': 4925184, 'steps': 25651, 'loss/train': 1.5118467807769775} 11/07/2021 00:49:42 - INFO - __main__ - Step 25653: {'lr': 0.0004691482145318572, 'samples': 4925376, 'steps': 25652, 'loss/train': 1.5533722639083862} 11/07/2021 00:49:43 - INFO - __main__ - Step 25654: {'lr': 0.00046914566070330144, 'samples': 4925568, 'steps': 25653, 'loss/train': 2.113215446472168} 11/07/2021 00:49:43 - INFO - __main__ - Step 25655: {'lr': 0.00046914310677600204, 'samples': 4925760, 'steps': 25654, 'loss/train': 1.3403749465942383} 11/07/2021 00:49:43 - INFO - __main__ - Step 25656: {'lr': 0.00046914055274996017, 'samples': 4925952, 'steps': 25655, 'loss/train': 1.0155887603759766} 11/07/2021 00:49:44 - INFO - __main__ - Step 25657: {'lr': 0.00046913799862517686, 'samples': 4926144, 'steps': 25656, 'loss/train': 0.8465630412101746} 11/07/2021 00:49:44 - INFO - __main__ - Step 25658: {'lr': 0.0004691354444016534, 'samples': 4926336, 'steps': 25657, 'loss/train': 1.2985042333602905} 11/07/2021 00:49:45 - INFO - __main__ - Step 25659: {'lr': 0.00046913289007939087, 'samples': 4926528, 'steps': 25658, 'loss/train': 1.632158875465393} 11/07/2021 00:49:45 - INFO - __main__ - Step 25660: {'lr': 0.00046913033565839046, 'samples': 4926720, 'steps': 25659, 'loss/train': 1.7725186347961426} 11/07/2021 00:49:46 - INFO - __main__ - Step 25661: {'lr': 0.0004691277811386533, 'samples': 4926912, 'steps': 25660, 'loss/train': 1.2652547359466553} 11/07/2021 00:49:46 - INFO - __main__ - Step 25662: {'lr': 0.0004691252265201805, 'samples': 4927104, 'steps': 25661, 'loss/train': 1.4861279726028442} 11/07/2021 00:49:46 - INFO - __main__ - Step 25663: {'lr': 0.00046912267180297337, 'samples': 4927296, 'steps': 25662, 'loss/train': 1.463240385055542} 11/07/2021 00:49:47 - INFO - __main__ - Step 25664: {'lr': 0.0004691201169870328, 'samples': 4927488, 'steps': 25663, 'loss/train': 1.071416974067688} 11/07/2021 00:49:48 - INFO - __main__ - Step 25665: {'lr': 0.00046911756207236024, 'samples': 4927680, 'steps': 25664, 'loss/train': 0.5899576544761658} 11/07/2021 00:49:48 - INFO - __main__ - Step 25666: {'lr': 0.0004691150070589566, 'samples': 4927872, 'steps': 25665, 'loss/train': 1.5438692569732666} 11/07/2021 00:49:49 - INFO - __main__ - Step 25667: {'lr': 0.00046911245194682306, 'samples': 4928064, 'steps': 25666, 'loss/train': 0.9179422855377197} 11/07/2021 00:49:49 - INFO - __main__ - Step 25668: {'lr': 0.00046910989673596093, 'samples': 4928256, 'steps': 25667, 'loss/train': 1.6727534532546997} 11/07/2021 00:49:49 - INFO - __main__ - Step 25669: {'lr': 0.00046910734142637124, 'samples': 4928448, 'steps': 25668, 'loss/train': 1.525063157081604} 11/07/2021 00:49:50 - INFO - __main__ - Step 25670: {'lr': 0.00046910478601805514, 'samples': 4928640, 'steps': 25669, 'loss/train': 5.881829738616943} 11/07/2021 00:49:51 - INFO - __main__ - Step 25671: {'lr': 0.0004691022305110138, 'samples': 4928832, 'steps': 25670, 'loss/train': 1.5334299802780151} 11/07/2021 00:49:51 - INFO - __main__ - Step 25672: {'lr': 0.0004690996749052484, 'samples': 4929024, 'steps': 25671, 'loss/train': 1.8652445077896118} 11/07/2021 00:49:51 - INFO - __main__ - Step 25673: {'lr': 0.00046909711920076, 'samples': 4929216, 'steps': 25672, 'loss/train': 1.9792180061340332} 11/07/2021 00:49:52 - INFO - __main__ - Step 25674: {'lr': 0.0004690945633975499, 'samples': 4929408, 'steps': 25673, 'loss/train': 0.8824723362922668} 11/07/2021 00:49:52 - INFO - __main__ - Step 25675: {'lr': 0.00046909200749561914, 'samples': 4929600, 'steps': 25674, 'loss/train': 1.751390814781189} 11/07/2021 00:49:53 - INFO - __main__ - Step 25676: {'lr': 0.00046908945149496897, 'samples': 4929792, 'steps': 25675, 'loss/train': 1.4224282503128052} 11/07/2021 00:49:53 - INFO - __main__ - Step 25677: {'lr': 0.00046908689539560034, 'samples': 4929984, 'steps': 25676, 'loss/train': 1.8100460767745972} 11/07/2021 00:49:54 - INFO - __main__ - Step 25678: {'lr': 0.0004690843391975146, 'samples': 4930176, 'steps': 25677, 'loss/train': 1.1137361526489258} 11/07/2021 00:49:54 - INFO - __main__ - Step 25679: {'lr': 0.0004690817829007129, 'samples': 4930368, 'steps': 25678, 'loss/train': 1.2293751239776611} 11/07/2021 00:49:55 - INFO - __main__ - Step 25680: {'lr': 0.00046907922650519623, 'samples': 4930560, 'steps': 25679, 'loss/train': 1.0705848932266235} 11/07/2021 00:49:56 - INFO - __main__ - Step 25681: {'lr': 0.0004690766700109659, 'samples': 4930752, 'steps': 25680, 'loss/train': 1.33608877658844} 11/07/2021 00:49:56 - INFO - __main__ - Step 25682: {'lr': 0.00046907411341802295, 'samples': 4930944, 'steps': 25681, 'loss/train': 1.132555603981018} 11/07/2021 00:49:56 - INFO - __main__ - Step 25683: {'lr': 0.0004690715567263687, 'samples': 4931136, 'steps': 25682, 'loss/train': 1.588343858718872} 11/07/2021 00:49:57 - INFO - __main__ - Step 25684: {'lr': 0.00046906899993600406, 'samples': 4931328, 'steps': 25683, 'loss/train': 1.7066028118133545} 11/07/2021 00:49:57 - INFO - __main__ - Step 25685: {'lr': 0.00046906644304693033, 'samples': 4931520, 'steps': 25684, 'loss/train': 1.7568970918655396} 11/07/2021 00:49:58 - INFO - __main__ - Step 25686: {'lr': 0.0004690638860591487, 'samples': 4931712, 'steps': 25685, 'loss/train': 1.4653843641281128} 11/07/2021 00:49:58 - INFO - __main__ - Step 25687: {'lr': 0.00046906132897266026, 'samples': 4931904, 'steps': 25686, 'loss/train': 1.3618601560592651} 11/07/2021 00:49:59 - INFO - __main__ - Step 25688: {'lr': 0.00046905877178746614, 'samples': 4932096, 'steps': 25687, 'loss/train': 1.6840893030166626} 11/07/2021 00:49:59 - INFO - __main__ - Step 25689: {'lr': 0.0004690562145035675, 'samples': 4932288, 'steps': 25688, 'loss/train': 1.1401145458221436} 11/07/2021 00:49:59 - INFO - __main__ - Step 25690: {'lr': 0.00046905365712096553, 'samples': 4932480, 'steps': 25689, 'loss/train': 2.9032070636749268} 11/07/2021 00:50:00 - INFO - __main__ - Step 25691: {'lr': 0.0004690510996396614, 'samples': 4932672, 'steps': 25690, 'loss/train': 2.0789008140563965} 11/07/2021 00:50:01 - INFO - __main__ - Step 25692: {'lr': 0.0004690485420596561, 'samples': 4932864, 'steps': 25691, 'loss/train': 1.3529332876205444} 11/07/2021 00:50:01 - INFO - __main__ - Step 25693: {'lr': 0.000469045984380951, 'samples': 4933056, 'steps': 25692, 'loss/train': 1.932854413986206} 11/07/2021 00:50:02 - INFO - __main__ - Step 25694: {'lr': 0.0004690434266035471, 'samples': 4933248, 'steps': 25693, 'loss/train': 1.516493320465088} 11/07/2021 00:50:02 - INFO - __main__ - Step 25695: {'lr': 0.00046904086872744577, 'samples': 4933440, 'steps': 25694, 'loss/train': 1.5998808145523071} 11/07/2021 00:50:02 - INFO - __main__ - Step 25696: {'lr': 0.0004690383107526479, 'samples': 4933632, 'steps': 25695, 'loss/train': 1.5363796949386597} 11/07/2021 00:50:03 - INFO - __main__ - Step 25697: {'lr': 0.0004690357526791547, 'samples': 4933824, 'steps': 25696, 'loss/train': 1.045938491821289} 11/07/2021 00:50:04 - INFO - __main__ - Step 25698: {'lr': 0.00046903319450696744, 'samples': 4934016, 'steps': 25697, 'loss/train': 1.6437040567398071} 11/07/2021 00:50:04 - INFO - __main__ - Step 25699: {'lr': 0.00046903063623608714, 'samples': 4934208, 'steps': 25698, 'loss/train': 1.6086574792861938} 11/07/2021 00:50:04 - INFO - __main__ - Step 25700: {'lr': 0.00046902807786651507, 'samples': 4934400, 'steps': 25699, 'loss/train': 1.112980604171753} 11/07/2021 00:50:05 - INFO - __main__ - Step 25701: {'lr': 0.00046902551939825236, 'samples': 4934592, 'steps': 25700, 'loss/train': 1.9607683420181274} 11/07/2021 00:50:06 - INFO - __main__ - Step 25702: {'lr': 0.00046902296083130003, 'samples': 4934784, 'steps': 25701, 'loss/train': 1.5503727197647095} 11/07/2021 00:50:06 - INFO - __main__ - Step 25703: {'lr': 0.00046902040216565945, 'samples': 4934976, 'steps': 25702, 'loss/train': 2.105358362197876} 11/07/2021 00:50:06 - INFO - __main__ - Step 25704: {'lr': 0.0004690178434013316, 'samples': 4935168, 'steps': 25703, 'loss/train': 1.0405181646347046} 11/07/2021 00:50:07 - INFO - __main__ - Step 25705: {'lr': 0.00046901528453831764, 'samples': 4935360, 'steps': 25704, 'loss/train': 1.5190848112106323} 11/07/2021 00:50:07 - INFO - __main__ - Step 25706: {'lr': 0.0004690127255766188, 'samples': 4935552, 'steps': 25705, 'loss/train': 1.511635661125183} 11/07/2021 00:50:08 - INFO - __main__ - Step 25707: {'lr': 0.0004690101665162362, 'samples': 4935744, 'steps': 25706, 'loss/train': 1.6795202493667603} 11/07/2021 00:50:09 - INFO - __main__ - Step 25708: {'lr': 0.00046900760735717103, 'samples': 4935936, 'steps': 25707, 'loss/train': 1.828741192817688} 11/07/2021 00:50:09 - INFO - __main__ - Step 25709: {'lr': 0.00046900504809942433, 'samples': 4936128, 'steps': 25708, 'loss/train': 0.857280969619751} 11/07/2021 00:50:09 - INFO - __main__ - Step 25710: {'lr': 0.00046900248874299746, 'samples': 4936320, 'steps': 25709, 'loss/train': 1.425267219543457} 11/07/2021 00:50:10 - INFO - __main__ - Step 25711: {'lr': 0.0004689999292878914, 'samples': 4936512, 'steps': 25710, 'loss/train': 0.9910033345222473} 11/07/2021 00:50:11 - INFO - __main__ - Step 25712: {'lr': 0.00046899736973410734, 'samples': 4936704, 'steps': 25711, 'loss/train': 1.9840439558029175} 11/07/2021 00:50:11 - INFO - __main__ - Step 25713: {'lr': 0.0004689948100816465, 'samples': 4936896, 'steps': 25712, 'loss/train': 1.0799232721328735} 11/07/2021 00:50:11 - INFO - __main__ - Step 25714: {'lr': 0.00046899225033050985, 'samples': 4937088, 'steps': 25713, 'loss/train': 1.4242122173309326} 11/07/2021 00:50:12 - INFO - __main__ - Step 25715: {'lr': 0.0004689896904806987, 'samples': 4937280, 'steps': 25714, 'loss/train': 1.2976279258728027} 11/07/2021 00:50:12 - INFO - __main__ - Step 25716: {'lr': 0.0004689871305322143, 'samples': 4937472, 'steps': 25715, 'loss/train': 1.7055093050003052} 11/07/2021 00:50:13 - INFO - __main__ - Step 25717: {'lr': 0.0004689845704850576, 'samples': 4937664, 'steps': 25716, 'loss/train': 1.5551862716674805} 11/07/2021 00:50:13 - INFO - __main__ - Step 25718: {'lr': 0.0004689820103392298, 'samples': 4937856, 'steps': 25717, 'loss/train': 1.6573007106781006} 11/07/2021 00:50:14 - INFO - __main__ - Step 25719: {'lr': 0.0004689794500947321, 'samples': 4938048, 'steps': 25718, 'loss/train': 1.3254516124725342} 11/07/2021 00:50:14 - INFO - __main__ - Step 25720: {'lr': 0.0004689768897515657, 'samples': 4938240, 'steps': 25719, 'loss/train': 1.4422892332077026} 11/07/2021 00:50:15 - INFO - __main__ - Step 25721: {'lr': 0.0004689743293097316, 'samples': 4938432, 'steps': 25720, 'loss/train': 1.3288673162460327} 11/07/2021 00:50:16 - INFO - __main__ - Step 25722: {'lr': 0.0004689717687692311, 'samples': 4938624, 'steps': 25721, 'loss/train': 1.748436689376831} 11/07/2021 00:50:16 - INFO - __main__ - Step 25723: {'lr': 0.0004689692081300653, 'samples': 4938816, 'steps': 25722, 'loss/train': 1.3790818452835083} 11/07/2021 00:50:16 - INFO - __main__ - Step 25724: {'lr': 0.0004689666473922354, 'samples': 4939008, 'steps': 25723, 'loss/train': 1.511187195777893} 11/07/2021 00:50:17 - INFO - __main__ - Step 25725: {'lr': 0.0004689640865557424, 'samples': 4939200, 'steps': 25724, 'loss/train': 1.4920438528060913} 11/07/2021 00:50:17 - INFO - __main__ - Step 25726: {'lr': 0.0004689615256205876, 'samples': 4939392, 'steps': 25725, 'loss/train': 1.3425120115280151} 11/07/2021 00:50:17 - INFO - __main__ - Step 25727: {'lr': 0.0004689589645867721, 'samples': 4939584, 'steps': 25726, 'loss/train': 1.7583677768707275} 11/07/2021 00:50:18 - INFO - __main__ - Step 25728: {'lr': 0.0004689564034542971, 'samples': 4939776, 'steps': 25727, 'loss/train': 1.1632046699523926} 11/07/2021 00:50:19 - INFO - __main__ - Step 25729: {'lr': 0.00046895384222316375, 'samples': 4939968, 'steps': 25728, 'loss/train': 1.4341073036193848} 11/07/2021 00:50:19 - INFO - __main__ - Step 25730: {'lr': 0.0004689512808933731, 'samples': 4940160, 'steps': 25729, 'loss/train': 1.5172381401062012} 11/07/2021 00:50:19 - INFO - __main__ - Step 25731: {'lr': 0.0004689487194649265, 'samples': 4940352, 'steps': 25730, 'loss/train': 1.843948245048523} 11/07/2021 00:50:20 - INFO - __main__ - Step 25732: {'lr': 0.0004689461579378249, 'samples': 4940544, 'steps': 25731, 'loss/train': 1.2686885595321655} 11/07/2021 00:50:21 - INFO - __main__ - Step 25733: {'lr': 0.0004689435963120696, 'samples': 4940736, 'steps': 25732, 'loss/train': 0.15897658467292786} 11/07/2021 00:50:21 - INFO - __main__ - Step 25734: {'lr': 0.00046894103458766163, 'samples': 4940928, 'steps': 25733, 'loss/train': 1.826562523841858} 11/07/2021 00:50:22 - INFO - __main__ - Step 25735: {'lr': 0.0004689384727646022, 'samples': 4941120, 'steps': 25734, 'loss/train': 1.2627410888671875} 11/07/2021 00:50:22 - INFO - __main__ - Step 25736: {'lr': 0.00046893591084289256, 'samples': 4941312, 'steps': 25735, 'loss/train': 1.6012457609176636} 11/07/2021 00:50:22 - INFO - __main__ - Step 25737: {'lr': 0.0004689333488225337, 'samples': 4941504, 'steps': 25736, 'loss/train': 1.6334772109985352} 11/07/2021 00:50:23 - INFO - __main__ - Step 25738: {'lr': 0.00046893078670352686, 'samples': 4941696, 'steps': 25737, 'loss/train': 0.9714683294296265} 11/07/2021 00:50:24 - INFO - __main__ - Step 25739: {'lr': 0.0004689282244858732, 'samples': 4941888, 'steps': 25738, 'loss/train': 1.7057424783706665} 11/07/2021 00:50:24 - INFO - __main__ - Step 25740: {'lr': 0.00046892566216957387, 'samples': 4942080, 'steps': 25739, 'loss/train': 1.7884376049041748} 11/07/2021 00:50:24 - INFO - __main__ - Step 25741: {'lr': 0.00046892309975463, 'samples': 4942272, 'steps': 25740, 'loss/train': 1.3078798055648804} 11/07/2021 00:50:25 - INFO - __main__ - Step 25742: {'lr': 0.0004689205372410427, 'samples': 4942464, 'steps': 25741, 'loss/train': 0.8204142451286316} 11/07/2021 00:50:25 - INFO - __main__ - Step 25743: {'lr': 0.00046891797462881327, 'samples': 4942656, 'steps': 25742, 'loss/train': 1.6207472085952759} 11/07/2021 00:50:27 - INFO - __main__ - Step 25744: {'lr': 0.0004689154119179427, 'samples': 4942848, 'steps': 25743, 'loss/train': 0.8706039786338806} 11/07/2021 00:50:27 - INFO - __main__ - Step 25745: {'lr': 0.00046891284910843237, 'samples': 4943040, 'steps': 25744, 'loss/train': 1.7266850471496582} 11/07/2021 00:50:27 - INFO - __main__ - Step 25746: {'lr': 0.0004689102862002832, 'samples': 4943232, 'steps': 25745, 'loss/train': 0.45676106214523315} 11/07/2021 00:50:28 - INFO - __main__ - Step 25747: {'lr': 0.00046890772319349637, 'samples': 4943424, 'steps': 25746, 'loss/train': 0.28906792402267456} 11/07/2021 00:50:28 - INFO - __main__ - Step 25748: {'lr': 0.00046890516008807315, 'samples': 4943616, 'steps': 25747, 'loss/train': 1.7977261543273926} 11/07/2021 00:50:30 - INFO - __main__ - Step 25749: {'lr': 0.0004689025968840147, 'samples': 4943808, 'steps': 25748, 'loss/train': 1.4963880777359009} 11/07/2021 00:50:30 - INFO - __main__ - Step 25750: {'lr': 0.00046890003358132204, 'samples': 4944000, 'steps': 25749, 'loss/train': 1.0647135972976685} 11/07/2021 00:50:30 - INFO - __main__ - Step 25751: {'lr': 0.0004688974701799964, 'samples': 4944192, 'steps': 25750, 'loss/train': 1.5756062269210815} 11/07/2021 00:50:31 - INFO - __main__ - Step 25752: {'lr': 0.00046889490668003896, 'samples': 4944384, 'steps': 25751, 'loss/train': 1.8758376836776733} 11/07/2021 00:50:31 - INFO - __main__ - Step 25753: {'lr': 0.0004688923430814509, 'samples': 4944576, 'steps': 25752, 'loss/train': 1.2992342710494995} 11/07/2021 00:50:31 - INFO - __main__ - Step 25754: {'lr': 0.00046888977938423326, 'samples': 4944768, 'steps': 25753, 'loss/train': 1.5431861877441406} 11/07/2021 00:50:33 - INFO - __main__ - Step 25755: {'lr': 0.00046888721558838734, 'samples': 4944960, 'steps': 25754, 'loss/train': 1.9197421073913574} 11/07/2021 00:50:33 - INFO - __main__ - Step 25756: {'lr': 0.00046888465169391414, 'samples': 4945152, 'steps': 25755, 'loss/train': 1.1083108186721802} 11/07/2021 00:50:33 - INFO - __main__ - Step 25757: {'lr': 0.00046888208770081493, 'samples': 4945344, 'steps': 25756, 'loss/train': 1.2790623903274536} 11/07/2021 00:50:34 - INFO - __main__ - Step 25758: {'lr': 0.0004688795236090908, 'samples': 4945536, 'steps': 25757, 'loss/train': 1.6210299730300903} 11/07/2021 00:50:34 - INFO - __main__ - Step 25759: {'lr': 0.000468876959418743, 'samples': 4945728, 'steps': 25758, 'loss/train': 1.7421590089797974} 11/07/2021 00:50:35 - INFO - __main__ - Step 25760: {'lr': 0.0004688743951297726, 'samples': 4945920, 'steps': 25759, 'loss/train': 1.5871471166610718} 11/07/2021 00:50:35 - INFO - __main__ - Step 25761: {'lr': 0.0004688718307421807, 'samples': 4946112, 'steps': 25760, 'loss/train': 1.4662115573883057} 11/07/2021 00:50:36 - INFO - __main__ - Step 25762: {'lr': 0.0004688692662559686, 'samples': 4946304, 'steps': 25761, 'loss/train': 1.6176438331604004} 11/07/2021 00:50:36 - INFO - __main__ - Step 25763: {'lr': 0.00046886670167113734, 'samples': 4946496, 'steps': 25762, 'loss/train': 1.8013571500778198} 11/07/2021 00:50:36 - INFO - __main__ - Step 25764: {'lr': 0.00046886413698768816, 'samples': 4946688, 'steps': 25763, 'loss/train': 1.3539690971374512} 11/07/2021 00:50:38 - INFO - __main__ - Step 25765: {'lr': 0.0004688615722056222, 'samples': 4946880, 'steps': 25764, 'loss/train': 1.3331290483474731} 11/07/2021 00:50:38 - INFO - __main__ - Step 25766: {'lr': 0.00046885900732494053, 'samples': 4947072, 'steps': 25765, 'loss/train': 1.4751242399215698} 11/07/2021 00:50:38 - INFO - __main__ - Step 25767: {'lr': 0.0004688564423456444, 'samples': 4947264, 'steps': 25766, 'loss/train': 1.634786605834961} 11/07/2021 00:50:39 - INFO - __main__ - Step 25768: {'lr': 0.00046885387726773494, 'samples': 4947456, 'steps': 25767, 'loss/train': 0.9453241229057312} 11/07/2021 00:50:39 - INFO - __main__ - Step 25769: {'lr': 0.0004688513120912133, 'samples': 4947648, 'steps': 25768, 'loss/train': 2.193246364593506} 11/07/2021 00:50:40 - INFO - __main__ - Step 25770: {'lr': 0.0004688487468160806, 'samples': 4947840, 'steps': 25769, 'loss/train': 1.9358593225479126} 11/07/2021 00:50:40 - INFO - __main__ - Step 25771: {'lr': 0.000468846181442338, 'samples': 4948032, 'steps': 25770, 'loss/train': 1.1964941024780273} 11/07/2021 00:50:41 - INFO - __main__ - Step 25772: {'lr': 0.0004688436159699868, 'samples': 4948224, 'steps': 25771, 'loss/train': 1.4133274555206299} 11/07/2021 00:50:41 - INFO - __main__ - Step 25773: {'lr': 0.000468841050399028, 'samples': 4948416, 'steps': 25772, 'loss/train': 1.092087984085083} 11/07/2021 00:50:41 - INFO - __main__ - Step 25774: {'lr': 0.0004688384847294628, 'samples': 4948608, 'steps': 25773, 'loss/train': 2.0307538509368896} 11/07/2021 00:50:42 - INFO - __main__ - Step 25775: {'lr': 0.0004688359189612923, 'samples': 4948800, 'steps': 25774, 'loss/train': 1.6210196018218994} 11/07/2021 00:50:43 - INFO - __main__ - Step 25776: {'lr': 0.0004688333530945178, 'samples': 4948992, 'steps': 25775, 'loss/train': 1.1951122283935547} 11/07/2021 00:50:43 - INFO - __main__ - Step 25777: {'lr': 0.0004688307871291403, 'samples': 4949184, 'steps': 25776, 'loss/train': 1.7040596008300781} 11/07/2021 00:50:44 - INFO - __main__ - Step 25778: {'lr': 0.0004688282210651611, 'samples': 4949376, 'steps': 25777, 'loss/train': 1.745054841041565} 11/07/2021 00:50:44 - INFO - __main__ - Step 25779: {'lr': 0.00046882565490258125, 'samples': 4949568, 'steps': 25778, 'loss/train': 1.9911561012268066} 11/07/2021 00:50:44 - INFO - __main__ - Step 25780: {'lr': 0.0004688230886414019, 'samples': 4949760, 'steps': 25779, 'loss/train': 1.8546158075332642} 11/07/2021 00:50:45 - INFO - __main__ - Step 25781: {'lr': 0.0004688205222816242, 'samples': 4949952, 'steps': 25780, 'loss/train': 1.5687676668167114} 11/07/2021 00:50:46 - INFO - __main__ - Step 25782: {'lr': 0.00046881795582324944, 'samples': 4950144, 'steps': 25781, 'loss/train': 1.6519067287445068} 11/07/2021 00:50:46 - INFO - __main__ - Step 25783: {'lr': 0.00046881538926627864, 'samples': 4950336, 'steps': 25782, 'loss/train': 1.1861686706542969} 11/07/2021 00:50:46 - INFO - __main__ - Step 25784: {'lr': 0.000468812822610713, 'samples': 4950528, 'steps': 25783, 'loss/train': 1.4683573246002197} 11/07/2021 00:50:47 - INFO - __main__ - Step 25785: {'lr': 0.00046881025585655367, 'samples': 4950720, 'steps': 25784, 'loss/train': 1.6115069389343262} 11/07/2021 00:50:48 - INFO - __main__ - Step 25786: {'lr': 0.0004688076890038019, 'samples': 4950912, 'steps': 25785, 'loss/train': 1.5832531452178955} 11/07/2021 00:50:48 - INFO - __main__ - Step 25787: {'lr': 0.00046880512205245867, 'samples': 4951104, 'steps': 25786, 'loss/train': 1.3586066961288452} 11/07/2021 00:50:48 - INFO - __main__ - Step 25788: {'lr': 0.00046880255500252526, 'samples': 4951296, 'steps': 25787, 'loss/train': 1.8434799909591675} 11/07/2021 00:50:49 - INFO - __main__ - Step 25789: {'lr': 0.0004687999878540028, 'samples': 4951488, 'steps': 25788, 'loss/train': 1.8325769901275635} 11/07/2021 00:50:49 - INFO - __main__ - Step 25790: {'lr': 0.00046879742060689243, 'samples': 4951680, 'steps': 25789, 'loss/train': 1.702770709991455} 11/07/2021 00:50:50 - INFO - __main__ - Step 25791: {'lr': 0.0004687948532611953, 'samples': 4951872, 'steps': 25790, 'loss/train': 1.678737998008728} 11/07/2021 00:50:51 - INFO - __main__ - Step 25792: {'lr': 0.0004687922858169126, 'samples': 4952064, 'steps': 25791, 'loss/train': 1.7778791189193726} 11/07/2021 00:50:51 - INFO - __main__ - Step 25793: {'lr': 0.0004687897182740455, 'samples': 4952256, 'steps': 25792, 'loss/train': 1.9369837045669556} 11/07/2021 00:50:51 - INFO - __main__ - Step 25794: {'lr': 0.0004687871506325951, 'samples': 4952448, 'steps': 25793, 'loss/train': 1.0146952867507935} 11/07/2021 00:50:52 - INFO - __main__ - Step 25795: {'lr': 0.00046878458289256264, 'samples': 4952640, 'steps': 25794, 'loss/train': 1.5751017332077026} 11/07/2021 00:50:53 - INFO - __main__ - Step 25796: {'lr': 0.00046878201505394913, 'samples': 4952832, 'steps': 25795, 'loss/train': 1.5337727069854736} 11/07/2021 00:50:53 - INFO - __main__ - Step 25797: {'lr': 0.0004687794471167559, 'samples': 4953024, 'steps': 25796, 'loss/train': 1.672663927078247} 11/07/2021 00:50:53 - INFO - __main__ - Step 25798: {'lr': 0.00046877687908098396, 'samples': 4953216, 'steps': 25797, 'loss/train': 1.6757265329360962} 11/07/2021 00:50:54 - INFO - __main__ - Step 25799: {'lr': 0.0004687743109466346, 'samples': 4953408, 'steps': 25798, 'loss/train': 1.7293604612350464} 11/07/2021 00:50:54 - INFO - __main__ - Step 25800: {'lr': 0.00046877174271370894, 'samples': 4953600, 'steps': 25799, 'loss/train': 1.3573501110076904} 11/07/2021 00:50:55 - INFO - __main__ - Step 25801: {'lr': 0.000468769174382208, 'samples': 4953792, 'steps': 25800, 'loss/train': 1.5040295124053955} 11/07/2021 00:50:56 - INFO - __main__ - Step 25802: {'lr': 0.0004687666059521331, 'samples': 4953984, 'steps': 25801, 'loss/train': 1.2513548135757446} 11/07/2021 00:50:56 - INFO - __main__ - Step 25803: {'lr': 0.0004687640374234854, 'samples': 4954176, 'steps': 25802, 'loss/train': 0.8608061075210571} 11/07/2021 00:50:56 - INFO - __main__ - Step 25804: {'lr': 0.0004687614687962659, 'samples': 4954368, 'steps': 25803, 'loss/train': 1.5000717639923096} 11/07/2021 00:50:57 - INFO - __main__ - Step 25805: {'lr': 0.0004687589000704759, 'samples': 4954560, 'steps': 25804, 'loss/train': 1.4666575193405151} 11/07/2021 00:50:57 - INFO - __main__ - Step 25806: {'lr': 0.0004687563312461165, 'samples': 4954752, 'steps': 25805, 'loss/train': 1.379432201385498} 11/07/2021 00:50:58 - INFO - __main__ - Step 25807: {'lr': 0.00046875376232318887, 'samples': 4954944, 'steps': 25806, 'loss/train': 0.6750864386558533} 11/07/2021 00:50:59 - INFO - __main__ - Step 25808: {'lr': 0.00046875119330169426, 'samples': 4955136, 'steps': 25807, 'loss/train': 1.6305607557296753} 11/07/2021 00:50:59 - INFO - __main__ - Step 25809: {'lr': 0.00046874862418163363, 'samples': 4955328, 'steps': 25808, 'loss/train': 1.6389553546905518} 11/07/2021 00:50:59 - INFO - __main__ - Step 25810: {'lr': 0.00046874605496300824, 'samples': 4955520, 'steps': 25809, 'loss/train': 1.5285234451293945} 11/07/2021 00:51:00 - INFO - __main__ - Step 25811: {'lr': 0.00046874348564581933, 'samples': 4955712, 'steps': 25810, 'loss/train': 1.624723196029663} 11/07/2021 00:51:00 - INFO - __main__ - Step 25812: {'lr': 0.00046874091623006793, 'samples': 4955904, 'steps': 25811, 'loss/train': 1.0928027629852295} 11/07/2021 00:51:01 - INFO - __main__ - Step 25813: {'lr': 0.0004687383467157553, 'samples': 4956096, 'steps': 25812, 'loss/train': 1.514067530632019} 11/07/2021 00:51:01 - INFO - __main__ - Step 25814: {'lr': 0.0004687357771028825, 'samples': 4956288, 'steps': 25813, 'loss/train': 0.9820759296417236} 11/07/2021 00:51:02 - INFO - __main__ - Step 25815: {'lr': 0.00046873320739145073, 'samples': 4956480, 'steps': 25814, 'loss/train': 1.932958722114563} 11/07/2021 00:51:02 - INFO - __main__ - Step 25816: {'lr': 0.0004687306375814612, 'samples': 4956672, 'steps': 25815, 'loss/train': 1.3385789394378662} 11/07/2021 00:51:02 - INFO - __main__ - Step 25817: {'lr': 0.000468728067672915, 'samples': 4956864, 'steps': 25816, 'loss/train': 1.031846284866333} 11/07/2021 00:51:03 - INFO - __main__ - Step 25818: {'lr': 0.00046872549766581326, 'samples': 4957056, 'steps': 25817, 'loss/train': 1.293784737586975} 11/07/2021 00:51:04 - INFO - __main__ - Step 25819: {'lr': 0.00046872292756015724, 'samples': 4957248, 'steps': 25818, 'loss/train': 1.3724759817123413} 11/07/2021 00:51:04 - INFO - __main__ - Step 25820: {'lr': 0.000468720357355948, 'samples': 4957440, 'steps': 25819, 'loss/train': 1.1009554862976074} 11/07/2021 00:51:04 - INFO - __main__ - Step 25821: {'lr': 0.00046871778705318673, 'samples': 4957632, 'steps': 25820, 'loss/train': 1.3064281940460205} 11/07/2021 00:51:05 - INFO - __main__ - Step 25822: {'lr': 0.0004687152166518747, 'samples': 4957824, 'steps': 25821, 'loss/train': 1.665042757987976} 11/07/2021 00:51:06 - INFO - __main__ - Step 25823: {'lr': 0.0004687126461520128, 'samples': 4958016, 'steps': 25822, 'loss/train': 1.5745177268981934} 11/07/2021 00:51:06 - INFO - __main__ - Step 25824: {'lr': 0.0004687100755536025, 'samples': 4958208, 'steps': 25823, 'loss/train': 1.9364532232284546} 11/07/2021 00:51:07 - INFO - __main__ - Step 25825: {'lr': 0.00046870750485664484, 'samples': 4958400, 'steps': 25824, 'loss/train': 1.2087626457214355} 11/07/2021 00:51:07 - INFO - __main__ - Step 25826: {'lr': 0.00046870493406114084, 'samples': 4958592, 'steps': 25825, 'loss/train': 1.2004915475845337} 11/07/2021 00:51:07 - INFO - __main__ - Step 25827: {'lr': 0.0004687023631670918, 'samples': 4958784, 'steps': 25826, 'loss/train': 1.7887715101242065} 11/07/2021 00:51:08 - INFO - __main__ - Step 25828: {'lr': 0.0004686997921744989, 'samples': 4958976, 'steps': 25827, 'loss/train': 1.3411016464233398} 11/07/2021 00:51:09 - INFO - __main__ - Step 25829: {'lr': 0.0004686972210833632, 'samples': 4959168, 'steps': 25828, 'loss/train': 1.493360996246338} 11/07/2021 00:51:09 - INFO - __main__ - Step 25830: {'lr': 0.0004686946498936859, 'samples': 4959360, 'steps': 25829, 'loss/train': 1.9060460329055786} 11/07/2021 00:51:09 - INFO - __main__ - Step 25831: {'lr': 0.00046869207860546826, 'samples': 4959552, 'steps': 25830, 'loss/train': 1.3489595651626587} 11/07/2021 00:51:10 - INFO - __main__ - Step 25832: {'lr': 0.00046868950721871126, 'samples': 4959744, 'steps': 25831, 'loss/train': 1.4942917823791504} 11/07/2021 00:51:10 - INFO - __main__ - Step 25833: {'lr': 0.00046868693573341616, 'samples': 4959936, 'steps': 25832, 'loss/train': 1.5614725351333618} 11/07/2021 00:51:11 - INFO - __main__ - Step 25834: {'lr': 0.00046868436414958405, 'samples': 4960128, 'steps': 25833, 'loss/train': 1.7506455183029175} 11/07/2021 00:51:12 - INFO - __main__ - Step 25835: {'lr': 0.00046868179246721623, 'samples': 4960320, 'steps': 25834, 'loss/train': 1.282240390777588} 11/07/2021 00:51:12 - INFO - __main__ - Step 25836: {'lr': 0.00046867922068631374, 'samples': 4960512, 'steps': 25835, 'loss/train': 1.5849071741104126} 11/07/2021 00:51:12 - INFO - __main__ - Step 25837: {'lr': 0.00046867664880687775, 'samples': 4960704, 'steps': 25836, 'loss/train': 2.374248743057251} 11/07/2021 00:51:13 - INFO - __main__ - Step 25838: {'lr': 0.00046867407682890937, 'samples': 4960896, 'steps': 25837, 'loss/train': 1.8533856868743896} 11/07/2021 00:51:14 - INFO - __main__ - Step 25839: {'lr': 0.00046867150475240994, 'samples': 4961088, 'steps': 25838, 'loss/train': 1.5554496049880981} 11/07/2021 00:51:14 - INFO - __main__ - Step 25840: {'lr': 0.0004686689325773805, 'samples': 4961280, 'steps': 25839, 'loss/train': 1.4998165369033813} 11/07/2021 00:51:14 - INFO - __main__ - Step 25841: {'lr': 0.00046866636030382217, 'samples': 4961472, 'steps': 25840, 'loss/train': 1.7674634456634521} 11/07/2021 00:51:15 - INFO - __main__ - Step 25842: {'lr': 0.00046866378793173616, 'samples': 4961664, 'steps': 25841, 'loss/train': 1.5353429317474365} 11/07/2021 00:51:15 - INFO - __main__ - Step 25843: {'lr': 0.0004686612154611236, 'samples': 4961856, 'steps': 25842, 'loss/train': 2.041708469390869} 11/07/2021 00:51:16 - INFO - __main__ - Step 25844: {'lr': 0.0004686586428919857, 'samples': 4962048, 'steps': 25843, 'loss/train': 1.6728260517120361} 11/07/2021 00:51:16 - INFO - __main__ - Step 25845: {'lr': 0.00046865607022432356, 'samples': 4962240, 'steps': 25844, 'loss/train': 1.5199110507965088} 11/07/2021 00:51:17 - INFO - __main__ - Step 25846: {'lr': 0.00046865349745813835, 'samples': 4962432, 'steps': 25845, 'loss/train': 0.546825110912323} 11/07/2021 00:51:17 - INFO - __main__ - Step 25847: {'lr': 0.00046865092459343126, 'samples': 4962624, 'steps': 25846, 'loss/train': 1.6953691244125366} 11/07/2021 00:51:17 - INFO - __main__ - Step 25848: {'lr': 0.00046864835163020353, 'samples': 4962816, 'steps': 25847, 'loss/train': 1.7198363542556763} 11/07/2021 00:51:19 - INFO - __main__ - Step 25849: {'lr': 0.00046864577856845613, 'samples': 4963008, 'steps': 25848, 'loss/train': 1.6675834655761719} 11/07/2021 00:51:19 - INFO - __main__ - Step 25850: {'lr': 0.0004686432054081904, 'samples': 4963200, 'steps': 25849, 'loss/train': 1.765515685081482} 11/07/2021 00:51:19 - INFO - __main__ - Step 25851: {'lr': 0.00046864063214940735, 'samples': 4963392, 'steps': 25850, 'loss/train': 1.7096325159072876} 11/07/2021 00:51:20 - INFO - __main__ - Step 25852: {'lr': 0.0004686380587921082, 'samples': 4963584, 'steps': 25851, 'loss/train': 1.2849268913269043} 11/07/2021 00:51:20 - INFO - __main__ - Step 25853: {'lr': 0.00046863548533629406, 'samples': 4963776, 'steps': 25852, 'loss/train': 1.6374523639678955} 11/07/2021 00:51:21 - INFO - __main__ - Step 25854: {'lr': 0.00046863291178196625, 'samples': 4963968, 'steps': 25853, 'loss/train': 1.1156255006790161} 11/07/2021 00:51:21 - INFO - __main__ - Step 25855: {'lr': 0.0004686303381291258, 'samples': 4964160, 'steps': 25854, 'loss/train': 0.7119954228401184} 11/07/2021 00:51:22 - INFO - __main__ - Step 25856: {'lr': 0.00046862776437777386, 'samples': 4964352, 'steps': 25855, 'loss/train': 1.733963131904602} 11/07/2021 00:51:22 - INFO - __main__ - Step 25857: {'lr': 0.00046862519052791166, 'samples': 4964544, 'steps': 25856, 'loss/train': 1.4181243181228638} 11/07/2021 00:51:22 - INFO - __main__ - Step 25858: {'lr': 0.00046862261657954033, 'samples': 4964736, 'steps': 25857, 'loss/train': 1.3477070331573486} 11/07/2021 00:51:23 - INFO - __main__ - Step 25859: {'lr': 0.000468620042532661, 'samples': 4964928, 'steps': 25858, 'loss/train': 1.6740336418151855} 11/07/2021 00:51:24 - INFO - __main__ - Step 25860: {'lr': 0.0004686174683872748, 'samples': 4965120, 'steps': 25859, 'loss/train': 1.1245931386947632} 11/07/2021 00:51:24 - INFO - __main__ - Step 25861: {'lr': 0.00046861489414338304, 'samples': 4965312, 'steps': 25860, 'loss/train': 1.3914529085159302} 11/07/2021 00:51:24 - INFO - __main__ - Step 25862: {'lr': 0.0004686123198009867, 'samples': 4965504, 'steps': 25861, 'loss/train': 1.3693437576293945} 11/07/2021 00:51:25 - INFO - __main__ - Step 25863: {'lr': 0.00046860974536008706, 'samples': 4965696, 'steps': 25862, 'loss/train': 3.1253714561462402} 11/07/2021 00:51:25 - INFO - __main__ - Step 25864: {'lr': 0.0004686071708206853, 'samples': 4965888, 'steps': 25863, 'loss/train': 1.4004058837890625} 11/07/2021 00:51:26 - INFO - __main__ - Step 25865: {'lr': 0.0004686045961827824, 'samples': 4966080, 'steps': 25864, 'loss/train': 1.5016511678695679} 11/07/2021 00:51:26 - INFO - __main__ - Step 25866: {'lr': 0.00046860202144637976, 'samples': 4966272, 'steps': 25865, 'loss/train': 1.7874298095703125} 11/07/2021 00:51:27 - INFO - __main__ - Step 25867: {'lr': 0.00046859944661147837, 'samples': 4966464, 'steps': 25866, 'loss/train': 1.6364786624908447} 11/07/2021 00:51:27 - INFO - __main__ - Step 25868: {'lr': 0.00046859687167807943, 'samples': 4966656, 'steps': 25867, 'loss/train': 1.4097360372543335} 11/07/2021 00:51:28 - INFO - __main__ - Step 25869: {'lr': 0.0004685942966461841, 'samples': 4966848, 'steps': 25868, 'loss/train': 1.6922928094863892} 11/07/2021 00:51:29 - INFO - __main__ - Step 25870: {'lr': 0.00046859172151579354, 'samples': 4967040, 'steps': 25869, 'loss/train': 1.5949517488479614} 11/07/2021 00:51:29 - INFO - __main__ - Step 25871: {'lr': 0.00046858914628690896, 'samples': 4967232, 'steps': 25870, 'loss/train': 1.5514227151870728} 11/07/2021 00:51:29 - INFO - __main__ - Step 25872: {'lr': 0.0004685865709595315, 'samples': 4967424, 'steps': 25871, 'loss/train': 1.4934883117675781} 11/07/2021 00:51:30 - INFO - __main__ - Step 25873: {'lr': 0.00046858399553366224, 'samples': 4967616, 'steps': 25872, 'loss/train': 1.4775656461715698} 11/07/2021 00:51:30 - INFO - __main__ - Step 25874: {'lr': 0.0004685814200093025, 'samples': 4967808, 'steps': 25873, 'loss/train': 1.5905988216400146} 11/07/2021 00:51:31 - INFO - __main__ - Step 25875: {'lr': 0.00046857884438645327, 'samples': 4968000, 'steps': 25874, 'loss/train': 1.4419405460357666} 11/07/2021 00:51:31 - INFO - __main__ - Step 25876: {'lr': 0.0004685762686651158, 'samples': 4968192, 'steps': 25875, 'loss/train': 1.5453418493270874} 11/07/2021 00:51:32 - INFO - __main__ - Step 25877: {'lr': 0.0004685736928452913, 'samples': 4968384, 'steps': 25876, 'loss/train': 1.4005645513534546} 11/07/2021 00:51:32 - INFO - __main__ - Step 25878: {'lr': 0.00046857111692698083, 'samples': 4968576, 'steps': 25877, 'loss/train': 1.641775131225586} 11/07/2021 00:51:32 - INFO - __main__ - Step 25879: {'lr': 0.0004685685409101855, 'samples': 4968768, 'steps': 25878, 'loss/train': 1.3810776472091675} 11/07/2021 00:51:33 - INFO - __main__ - Step 25880: {'lr': 0.00046856596479490667, 'samples': 4968960, 'steps': 25879, 'loss/train': 1.6162208318710327} 11/07/2021 00:51:34 - INFO - __main__ - Step 25881: {'lr': 0.0004685633885811453, 'samples': 4969152, 'steps': 25880, 'loss/train': 1.7117780447006226} 11/07/2021 00:51:34 - INFO - __main__ - Step 25882: {'lr': 0.0004685608122689027, 'samples': 4969344, 'steps': 25881, 'loss/train': 1.4712426662445068} 11/07/2021 00:51:34 - INFO - __main__ - Step 25883: {'lr': 0.00046855823585818004, 'samples': 4969536, 'steps': 25882, 'loss/train': 0.21515725553035736} 11/07/2021 00:51:35 - INFO - __main__ - Step 25884: {'lr': 0.0004685556593489783, 'samples': 4969728, 'steps': 25883, 'loss/train': 1.5416388511657715} 11/07/2021 00:51:36 - INFO - __main__ - Step 25885: {'lr': 0.0004685530827412988, 'samples': 4969920, 'steps': 25884, 'loss/train': 1.395815372467041} 11/07/2021 00:51:36 - INFO - __main__ - Step 25886: {'lr': 0.0004685505060351426, 'samples': 4970112, 'steps': 25885, 'loss/train': 1.784235954284668} 11/07/2021 00:51:37 - INFO - __main__ - Step 25887: {'lr': 0.00046854792923051094, 'samples': 4970304, 'steps': 25886, 'loss/train': 0.9251062870025635} 11/07/2021 00:51:37 - INFO - __main__ - Step 25888: {'lr': 0.00046854535232740505, 'samples': 4970496, 'steps': 25887, 'loss/train': 1.6409744024276733} 11/07/2021 00:51:37 - INFO - __main__ - Step 25889: {'lr': 0.00046854277532582585, 'samples': 4970688, 'steps': 25888, 'loss/train': 1.5012282133102417} 11/07/2021 00:51:38 - INFO - __main__ - Step 25890: {'lr': 0.0004685401982257747, 'samples': 4970880, 'steps': 25889, 'loss/train': 1.4797322750091553} 11/07/2021 00:51:39 - INFO - __main__ - Step 25891: {'lr': 0.0004685376210272527, 'samples': 4971072, 'steps': 25890, 'loss/train': 1.2445049285888672} 11/07/2021 00:51:39 - INFO - __main__ - Step 25892: {'lr': 0.00046853504373026107, 'samples': 4971264, 'steps': 25891, 'loss/train': 1.618612289428711} 11/07/2021 00:51:40 - INFO - __main__ - Step 25893: {'lr': 0.00046853246633480087, 'samples': 4971456, 'steps': 25892, 'loss/train': 0.987562358379364} 11/07/2021 00:51:40 - INFO - __main__ - Step 25894: {'lr': 0.0004685298888408733, 'samples': 4971648, 'steps': 25893, 'loss/train': 1.8932851552963257} 11/07/2021 00:51:40 - INFO - __main__ - Step 25895: {'lr': 0.0004685273112484796, 'samples': 4971840, 'steps': 25894, 'loss/train': 1.9858580827713013} 11/07/2021 00:51:41 - INFO - __main__ - Step 25896: {'lr': 0.0004685247335576209, 'samples': 4972032, 'steps': 25895, 'loss/train': 0.9556924104690552} 11/07/2021 00:51:42 - INFO - __main__ - Step 25897: {'lr': 0.00046852215576829824, 'samples': 4972224, 'steps': 25896, 'loss/train': 1.6164839267730713} 11/07/2021 00:51:42 - INFO - __main__ - Step 25898: {'lr': 0.0004685195778805129, 'samples': 4972416, 'steps': 25897, 'loss/train': 1.8304446935653687} 11/07/2021 00:51:42 - INFO - __main__ - Step 25899: {'lr': 0.000468516999894266, 'samples': 4972608, 'steps': 25898, 'loss/train': 1.8032854795455933} 11/07/2021 00:51:43 - INFO - __main__ - Step 25900: {'lr': 0.0004685144218095587, 'samples': 4972800, 'steps': 25899, 'loss/train': 1.3517341613769531} 11/07/2021 00:51:44 - INFO - __main__ - Step 25901: {'lr': 0.00046851184362639223, 'samples': 4972992, 'steps': 25900, 'loss/train': 1.5504690408706665} 11/07/2021 00:51:44 - INFO - __main__ - Step 25902: {'lr': 0.0004685092653447676, 'samples': 4973184, 'steps': 25901, 'loss/train': 1.3224358558654785} 11/07/2021 00:51:45 - INFO - __main__ - Step 25903: {'lr': 0.00046850668696468614, 'samples': 4973376, 'steps': 25902, 'loss/train': 1.6287102699279785} 11/07/2021 00:51:45 - INFO - __main__ - Step 25904: {'lr': 0.0004685041084861489, 'samples': 4973568, 'steps': 25903, 'loss/train': 1.6833339929580688} 11/07/2021 00:51:45 - INFO - __main__ - Step 25905: {'lr': 0.00046850152990915705, 'samples': 4973760, 'steps': 25904, 'loss/train': 1.308502435684204} 11/07/2021 00:51:46 - INFO - __main__ - Step 25906: {'lr': 0.0004684989512337119, 'samples': 4973952, 'steps': 25905, 'loss/train': 1.5645153522491455} 11/07/2021 00:51:47 - INFO - __main__ - Step 25907: {'lr': 0.00046849637245981434, 'samples': 4974144, 'steps': 25906, 'loss/train': 0.9965031147003174} 11/07/2021 00:51:47 - INFO - __main__ - Step 25908: {'lr': 0.0004684937935874658, 'samples': 4974336, 'steps': 25907, 'loss/train': 1.4623723030090332} 11/07/2021 00:51:47 - INFO - __main__ - Step 25909: {'lr': 0.00046849121461666734, 'samples': 4974528, 'steps': 25908, 'loss/train': 1.144979476928711} 11/07/2021 00:51:48 - INFO - __main__ - Step 25910: {'lr': 0.00046848863554742006, 'samples': 4974720, 'steps': 25909, 'loss/train': 1.27719247341156} 11/07/2021 00:51:49 - INFO - __main__ - Step 25911: {'lr': 0.0004684860563797252, 'samples': 4974912, 'steps': 25910, 'loss/train': 1.6980345249176025} 11/07/2021 00:51:49 - INFO - __main__ - Step 25912: {'lr': 0.00046848347711358384, 'samples': 4975104, 'steps': 25911, 'loss/train': 1.4265433549880981} 11/07/2021 00:51:49 - INFO - __main__ - Step 25913: {'lr': 0.0004684808977489973, 'samples': 4975296, 'steps': 25912, 'loss/train': 1.625329613685608} 11/07/2021 00:51:50 - INFO - __main__ - Step 25914: {'lr': 0.00046847831828596647, 'samples': 4975488, 'steps': 25913, 'loss/train': 1.4921817779541016} 11/07/2021 00:51:50 - INFO - __main__ - Step 25915: {'lr': 0.0004684757387244928, 'samples': 4975680, 'steps': 25914, 'loss/train': 1.9082525968551636} 11/07/2021 00:51:51 - INFO - __main__ - Step 25916: {'lr': 0.00046847315906457733, 'samples': 4975872, 'steps': 25915, 'loss/train': 1.6177970170974731} 11/07/2021 00:51:52 - INFO - __main__ - Step 25917: {'lr': 0.0004684705793062212, 'samples': 4976064, 'steps': 25916, 'loss/train': 1.7028489112854004} 11/07/2021 00:51:52 - INFO - __main__ - Step 25918: {'lr': 0.00046846799944942564, 'samples': 4976256, 'steps': 25917, 'loss/train': 0.4418361485004425} 11/07/2021 00:51:52 - INFO - __main__ - Step 25919: {'lr': 0.00046846541949419177, 'samples': 4976448, 'steps': 25918, 'loss/train': 1.2632676362991333} 11/07/2021 00:51:53 - INFO - __main__ - Step 25920: {'lr': 0.00046846283944052073, 'samples': 4976640, 'steps': 25919, 'loss/train': 1.417981743812561} 11/07/2021 00:51:54 - INFO - __main__ - Step 25921: {'lr': 0.0004684602592884136, 'samples': 4976832, 'steps': 25920, 'loss/train': 2.107151985168457} 11/07/2021 00:51:54 - INFO - __main__ - Step 25922: {'lr': 0.0004684576790378718, 'samples': 4977024, 'steps': 25921, 'loss/train': 1.2924543619155884} 11/07/2021 00:51:54 - INFO - __main__ - Step 25923: {'lr': 0.00046845509868889625, 'samples': 4977216, 'steps': 25922, 'loss/train': 1.6383317708969116} 11/07/2021 00:51:55 - INFO - __main__ - Step 25924: {'lr': 0.00046845251824148825, 'samples': 4977408, 'steps': 25923, 'loss/train': 1.748632788658142} 11/07/2021 00:51:55 - INFO - __main__ - Step 25925: {'lr': 0.0004684499376956489, 'samples': 4977600, 'steps': 25924, 'loss/train': 1.5455869436264038} 11/07/2021 00:51:55 - INFO - __main__ - Step 25926: {'lr': 0.00046844735705137944, 'samples': 4977792, 'steps': 25925, 'loss/train': 1.6856175661087036} 11/07/2021 00:51:56 - INFO - __main__ - Step 25927: {'lr': 0.0004684447763086809, 'samples': 4977984, 'steps': 25926, 'loss/train': 1.0863152742385864} 11/07/2021 00:51:57 - INFO - __main__ - Step 25928: {'lr': 0.00046844219546755454, 'samples': 4978176, 'steps': 25927, 'loss/train': 1.6790918111801147} 11/07/2021 00:51:57 - INFO - __main__ - Step 25929: {'lr': 0.0004684396145280014, 'samples': 4978368, 'steps': 25928, 'loss/train': 1.7366427183151245} 11/07/2021 00:51:57 - INFO - __main__ - Step 25930: {'lr': 0.00046843703349002286, 'samples': 4978560, 'steps': 25929, 'loss/train': 1.4150608777999878} 11/07/2021 00:51:58 - INFO - __main__ - Step 25931: {'lr': 0.00046843445235361994, 'samples': 4978752, 'steps': 25930, 'loss/train': 1.5314158201217651} 11/07/2021 00:51:59 - INFO - __main__ - Step 25932: {'lr': 0.0004684318711187938, 'samples': 4978944, 'steps': 25931, 'loss/train': 1.862462043762207} 11/07/2021 00:51:59 - INFO - __main__ - Step 25933: {'lr': 0.0004684292897855457, 'samples': 4979136, 'steps': 25932, 'loss/train': 2.1105966567993164} 11/07/2021 00:52:00 - INFO - __main__ - Step 25934: {'lr': 0.00046842670835387667, 'samples': 4979328, 'steps': 25933, 'loss/train': 1.3371798992156982} 11/07/2021 00:52:00 - INFO - __main__ - Step 25935: {'lr': 0.00046842412682378796, 'samples': 4979520, 'steps': 25934, 'loss/train': 1.817678451538086} 11/07/2021 00:52:00 - INFO - __main__ - Step 25936: {'lr': 0.0004684215451952807, 'samples': 4979712, 'steps': 25935, 'loss/train': 1.6571016311645508} 11/07/2021 00:52:01 - INFO - __main__ - Step 25937: {'lr': 0.000468418963468356, 'samples': 4979904, 'steps': 25936, 'loss/train': 1.8304779529571533} 11/07/2021 00:52:02 - INFO - __main__ - Step 25938: {'lr': 0.0004684163816430152, 'samples': 4980096, 'steps': 25937, 'loss/train': 1.5367423295974731} 11/07/2021 00:52:02 - INFO - __main__ - Step 25939: {'lr': 0.00046841379971925923, 'samples': 4980288, 'steps': 25938, 'loss/train': 1.5942084789276123} 11/07/2021 00:52:02 - INFO - __main__ - Step 25940: {'lr': 0.0004684112176970895, 'samples': 4980480, 'steps': 25939, 'loss/train': 1.5396287441253662} 11/07/2021 00:52:03 - INFO - __main__ - Step 25941: {'lr': 0.0004684086355765069, 'samples': 4980672, 'steps': 25940, 'loss/train': 1.4727085828781128} 11/07/2021 00:52:04 - INFO - __main__ - Step 25942: {'lr': 0.00046840605335751284, 'samples': 4980864, 'steps': 25941, 'loss/train': 1.663288950920105} 11/07/2021 00:52:04 - INFO - __main__ - Step 25943: {'lr': 0.0004684034710401084, 'samples': 4981056, 'steps': 25942, 'loss/train': 1.538399577140808} 11/07/2021 00:52:05 - INFO - __main__ - Step 25944: {'lr': 0.00046840088862429465, 'samples': 4981248, 'steps': 25943, 'loss/train': 1.3046379089355469} 11/07/2021 00:52:05 - INFO - __main__ - Step 25945: {'lr': 0.00046839830611007297, 'samples': 4981440, 'steps': 25944, 'loss/train': 1.7420538663864136} 11/07/2021 00:52:05 - INFO - __main__ - Step 25946: {'lr': 0.00046839572349744417, 'samples': 4981632, 'steps': 25945, 'loss/train': 1.4014198780059814} 11/07/2021 00:52:06 - INFO - __main__ - Step 25947: {'lr': 0.0004683931407864098, 'samples': 4981824, 'steps': 25946, 'loss/train': 1.6909465789794922} 11/07/2021 00:52:07 - INFO - __main__ - Step 25948: {'lr': 0.0004683905579769708, 'samples': 4982016, 'steps': 25947, 'loss/train': 1.726481556892395} 11/07/2021 00:52:07 - INFO - __main__ - Step 25949: {'lr': 0.0004683879750691283, 'samples': 4982208, 'steps': 25948, 'loss/train': 1.3670543432235718} 11/07/2021 00:52:07 - INFO - __main__ - Step 25950: {'lr': 0.00046838539206288366, 'samples': 4982400, 'steps': 25949, 'loss/train': 1.9250874519348145} 11/07/2021 00:52:08 - INFO - __main__ - Step 25951: {'lr': 0.00046838280895823795, 'samples': 4982592, 'steps': 25950, 'loss/train': 1.8457273244857788} 11/07/2021 00:52:08 - INFO - __main__ - Step 25952: {'lr': 0.0004683802257551922, 'samples': 4982784, 'steps': 25951, 'loss/train': 1.3912980556488037} 11/07/2021 00:52:09 - INFO - __main__ - Step 25953: {'lr': 0.00046837764245374777, 'samples': 4982976, 'steps': 25952, 'loss/train': 1.2061102390289307} 11/07/2021 00:52:10 - INFO - __main__ - Step 25954: {'lr': 0.0004683750590539057, 'samples': 4983168, 'steps': 25953, 'loss/train': 1.2821147441864014} 11/07/2021 00:52:10 - INFO - __main__ - Step 25955: {'lr': 0.00046837247555566727, 'samples': 4983360, 'steps': 25954, 'loss/train': 1.5056250095367432} 11/07/2021 00:52:10 - INFO - __main__ - Step 25956: {'lr': 0.00046836989195903344, 'samples': 4983552, 'steps': 25955, 'loss/train': 1.6103827953338623} 11/07/2021 00:52:11 - INFO - __main__ - Step 25957: {'lr': 0.00046836730826400565, 'samples': 4983744, 'steps': 25956, 'loss/train': 2.7427055835723877} 11/07/2021 00:52:12 - INFO - __main__ - Step 25958: {'lr': 0.00046836472447058485, 'samples': 4983936, 'steps': 25957, 'loss/train': 1.5038542747497559} 11/07/2021 00:52:12 - INFO - __main__ - Step 25959: {'lr': 0.0004683621405787723, 'samples': 4984128, 'steps': 25958, 'loss/train': 1.4676628112792969} 11/07/2021 00:52:12 - INFO - __main__ - Step 25960: {'lr': 0.0004683595565885691, 'samples': 4984320, 'steps': 25959, 'loss/train': 1.1955684423446655} 11/07/2021 00:52:13 - INFO - __main__ - Step 25961: {'lr': 0.0004683569724999765, 'samples': 4984512, 'steps': 25960, 'loss/train': 1.4902448654174805} 11/07/2021 00:52:13 - INFO - __main__ - Step 25962: {'lr': 0.0004683543883129956, 'samples': 4984704, 'steps': 25961, 'loss/train': 1.6452752351760864} 11/07/2021 00:52:14 - INFO - __main__ - Step 25963: {'lr': 0.00046835180402762756, 'samples': 4984896, 'steps': 25962, 'loss/train': 1.4151829481124878} 11/07/2021 00:52:14 - INFO - __main__ - Step 25964: {'lr': 0.00046834921964387363, 'samples': 4985088, 'steps': 25963, 'loss/train': 1.4597492218017578} 11/07/2021 00:52:15 - INFO - __main__ - Step 25965: {'lr': 0.0004683466351617348, 'samples': 4985280, 'steps': 25964, 'loss/train': 1.5860645771026611} 11/07/2021 00:52:15 - INFO - __main__ - Step 25966: {'lr': 0.00046834405058121244, 'samples': 4985472, 'steps': 25965, 'loss/train': 1.5850639343261719} 11/07/2021 00:52:15 - INFO - __main__ - Step 25967: {'lr': 0.0004683414659023076, 'samples': 4985664, 'steps': 25966, 'loss/train': 1.1708070039749146} 11/07/2021 00:52:16 - INFO - __main__ - Step 25968: {'lr': 0.0004683388811250214, 'samples': 4985856, 'steps': 25967, 'loss/train': 1.4193698167800903} 11/07/2021 00:52:17 - INFO - __main__ - Step 25969: {'lr': 0.0004683362962493552, 'samples': 4986048, 'steps': 25968, 'loss/train': 1.1845221519470215} 11/07/2021 00:52:17 - INFO - __main__ - Step 25970: {'lr': 0.00046833371127530995, 'samples': 4986240, 'steps': 25969, 'loss/train': 1.4896739721298218} 11/07/2021 00:52:18 - INFO - __main__ - Step 25971: {'lr': 0.00046833112620288684, 'samples': 4986432, 'steps': 25970, 'loss/train': 1.51420259475708} 11/07/2021 00:52:18 - INFO - __main__ - Step 25972: {'lr': 0.0004683285410320872, 'samples': 4986624, 'steps': 25971, 'loss/train': 1.2518377304077148} 11/07/2021 00:52:19 - INFO - __main__ - Step 25973: {'lr': 0.000468325955762912, 'samples': 4986816, 'steps': 25972, 'loss/train': 1.53132164478302} 11/07/2021 00:52:19 - INFO - __main__ - Step 25974: {'lr': 0.0004683233703953626, 'samples': 4987008, 'steps': 25973, 'loss/train': 1.5884449481964111} 11/07/2021 00:52:20 - INFO - __main__ - Step 25975: {'lr': 0.00046832078492944, 'samples': 4987200, 'steps': 25974, 'loss/train': 1.120357871055603} 11/07/2021 00:52:20 - INFO - __main__ - Step 25976: {'lr': 0.0004683181993651454, 'samples': 4987392, 'steps': 25975, 'loss/train': 1.7608742713928223} 11/07/2021 00:52:20 - INFO - __main__ - Step 25977: {'lr': 0.0004683156137024801, 'samples': 4987584, 'steps': 25976, 'loss/train': 1.5439300537109375} 11/07/2021 00:52:21 - INFO - __main__ - Step 25978: {'lr': 0.00046831302794144504, 'samples': 4987776, 'steps': 25977, 'loss/train': 1.1016311645507812} 11/07/2021 00:52:22 - INFO - __main__ - Step 25979: {'lr': 0.00046831044208204154, 'samples': 4987968, 'steps': 25978, 'loss/train': 1.5977383852005005} 11/07/2021 00:52:22 - INFO - __main__ - Step 25980: {'lr': 0.0004683078561242707, 'samples': 4988160, 'steps': 25979, 'loss/train': 1.0856884717941284} 11/07/2021 00:52:22 - INFO - __main__ - Step 25981: {'lr': 0.00046830527006813373, 'samples': 4988352, 'steps': 25980, 'loss/train': 1.28452730178833} 11/07/2021 00:52:23 - INFO - __main__ - Step 25982: {'lr': 0.00046830268391363176, 'samples': 4988544, 'steps': 25981, 'loss/train': 1.5783252716064453} 11/07/2021 00:52:24 - INFO - __main__ - Step 25983: {'lr': 0.0004683000976607659, 'samples': 4988736, 'steps': 25982, 'loss/train': 2.0148348808288574} 11/07/2021 00:52:24 - INFO - __main__ - Step 25984: {'lr': 0.00046829751130953747, 'samples': 4988928, 'steps': 25983, 'loss/train': 2.1376888751983643} 11/07/2021 00:52:25 - INFO - __main__ - Step 25985: {'lr': 0.0004682949248599476, 'samples': 4989120, 'steps': 25984, 'loss/train': 1.2700146436691284} 11/07/2021 00:52:25 - INFO - __main__ - Step 25986: {'lr': 0.0004682923383119973, 'samples': 4989312, 'steps': 25985, 'loss/train': 1.7473276853561401} 11/07/2021 00:52:25 - INFO - __main__ - Step 25987: {'lr': 0.0004682897516656879, 'samples': 4989504, 'steps': 25986, 'loss/train': 1.577156662940979} 11/07/2021 00:52:26 - INFO - __main__ - Step 25988: {'lr': 0.00046828716492102043, 'samples': 4989696, 'steps': 25987, 'loss/train': 1.371836543083191} 11/07/2021 00:52:27 - INFO - __main__ - Step 25989: {'lr': 0.0004682845780779962, 'samples': 4989888, 'steps': 25988, 'loss/train': 1.2244542837142944} 11/07/2021 00:52:27 - INFO - __main__ - Step 25990: {'lr': 0.00046828199113661627, 'samples': 4990080, 'steps': 25989, 'loss/train': 1.1802819967269897} 11/07/2021 00:52:27 - INFO - __main__ - Step 25991: {'lr': 0.0004682794040968819, 'samples': 4990272, 'steps': 25990, 'loss/train': 1.1837917566299438} 11/07/2021 00:52:28 - INFO - __main__ - Step 25992: {'lr': 0.0004682768169587942, 'samples': 4990464, 'steps': 25991, 'loss/train': 1.503637671470642} 11/07/2021 00:52:28 - INFO - __main__ - Step 25993: {'lr': 0.0004682742297223543, 'samples': 4990656, 'steps': 25992, 'loss/train': 1.3802827596664429} 11/07/2021 00:52:29 - INFO - __main__ - Step 25994: {'lr': 0.00046827164238756337, 'samples': 4990848, 'steps': 25993, 'loss/train': 1.4477992057800293} 11/07/2021 00:52:30 - INFO - __main__ - Step 25995: {'lr': 0.00046826905495442263, 'samples': 4991040, 'steps': 25994, 'loss/train': 1.8412169218063354} 11/07/2021 00:52:30 - INFO - __main__ - Step 25996: {'lr': 0.00046826646742293326, 'samples': 4991232, 'steps': 25995, 'loss/train': 1.4182357788085938} 11/07/2021 00:52:30 - INFO - __main__ - Step 25997: {'lr': 0.00046826387979309635, 'samples': 4991424, 'steps': 25996, 'loss/train': 1.7624053955078125} 11/07/2021 00:52:31 - INFO - __main__ - Step 25998: {'lr': 0.0004682612920649131, 'samples': 4991616, 'steps': 25997, 'loss/train': 1.870514988899231} 11/07/2021 00:52:32 - INFO - __main__ - Step 25999: {'lr': 0.00046825870423838466, 'samples': 4991808, 'steps': 25998, 'loss/train': 1.5395135879516602} 11/07/2021 00:52:32 - INFO - __main__ - Step 26000: {'lr': 0.00046825611631351227, 'samples': 4992000, 'steps': 25999, 'loss/train': 1.233712911605835} 11/07/2021 00:52:32 - INFO - __main__ - Step 26001: {'lr': 0.00046825352829029705, 'samples': 4992192, 'steps': 26000, 'loss/train': 1.740976095199585} 11/07/2021 00:52:33 - INFO - __main__ - Step 26002: {'lr': 0.00046825094016874014, 'samples': 4992384, 'steps': 26001, 'loss/train': 1.8134573698043823} 11/07/2021 00:52:33 - INFO - __main__ - Step 26003: {'lr': 0.00046824835194884273, 'samples': 4992576, 'steps': 26002, 'loss/train': 1.9249521493911743} 11/07/2021 00:52:34 - INFO - __main__ - Step 26004: {'lr': 0.0004682457636306059, 'samples': 4992768, 'steps': 26003, 'loss/train': 1.6932024955749512} 11/07/2021 00:52:35 - INFO - __main__ - Step 26005: {'lr': 0.000468243175214031, 'samples': 4992960, 'steps': 26004, 'loss/train': 1.567561149597168} 11/07/2021 00:52:35 - INFO - __main__ - Step 26006: {'lr': 0.00046824058669911906, 'samples': 4993152, 'steps': 26005, 'loss/train': 1.3454017639160156} 11/07/2021 00:52:35 - INFO - __main__ - Step 26007: {'lr': 0.00046823799808587126, 'samples': 4993344, 'steps': 26006, 'loss/train': 2.0492119789123535} 11/07/2021 00:52:36 - INFO - __main__ - Step 26008: {'lr': 0.00046823540937428876, 'samples': 4993536, 'steps': 26007, 'loss/train': 1.4472533464431763} 11/07/2021 00:52:37 - INFO - __main__ - Step 26009: {'lr': 0.0004682328205643728, 'samples': 4993728, 'steps': 26008, 'loss/train': 2.2082765102386475} 11/07/2021 00:52:37 - INFO - __main__ - Step 26010: {'lr': 0.00046823023165612455, 'samples': 4993920, 'steps': 26009, 'loss/train': 1.7340260744094849} 11/07/2021 00:52:37 - INFO - __main__ - Step 26011: {'lr': 0.000468227642649545, 'samples': 4994112, 'steps': 26010, 'loss/train': 1.229740858078003} 11/07/2021 00:52:38 - INFO - __main__ - Step 26012: {'lr': 0.00046822505354463553, 'samples': 4994304, 'steps': 26011, 'loss/train': 1.5559407472610474} 11/07/2021 00:52:38 - INFO - __main__ - Step 26013: {'lr': 0.0004682224643413972, 'samples': 4994496, 'steps': 26012, 'loss/train': 1.1459075212478638} 11/07/2021 00:52:39 - INFO - __main__ - Step 26014: {'lr': 0.0004682198750398312, 'samples': 4994688, 'steps': 26013, 'loss/train': 1.929360032081604} 11/07/2021 00:52:39 - INFO - __main__ - Step 26015: {'lr': 0.00046821728563993867, 'samples': 4994880, 'steps': 26014, 'loss/train': 1.8622995615005493} 11/07/2021 00:52:40 - INFO - __main__ - Step 26016: {'lr': 0.0004682146961417208, 'samples': 4995072, 'steps': 26015, 'loss/train': 1.5445281267166138} 11/07/2021 00:52:40 - INFO - __main__ - Step 26017: {'lr': 0.00046821210654517874, 'samples': 4995264, 'steps': 26016, 'loss/train': 1.3797636032104492} 11/07/2021 00:52:40 - INFO - __main__ - Step 26018: {'lr': 0.0004682095168503137, 'samples': 4995456, 'steps': 26017, 'loss/train': 1.3731129169464111} 11/07/2021 00:52:41 - INFO - __main__ - Step 26019: {'lr': 0.00046820692705712685, 'samples': 4995648, 'steps': 26018, 'loss/train': 2.186852216720581} 11/07/2021 00:52:42 - INFO - __main__ - Step 26020: {'lr': 0.00046820433716561927, 'samples': 4995840, 'steps': 26019, 'loss/train': 1.5869444608688354} 11/07/2021 00:52:42 - INFO - __main__ - Step 26021: {'lr': 0.0004682017471757922, 'samples': 4996032, 'steps': 26020, 'loss/train': 1.7409318685531616} 11/07/2021 00:52:42 - INFO - __main__ - Step 26022: {'lr': 0.0004681991570876468, 'samples': 4996224, 'steps': 26021, 'loss/train': 1.7557164430618286} 11/07/2021 00:52:43 - INFO - __main__ - Step 26023: {'lr': 0.00046819656690118424, 'samples': 4996416, 'steps': 26022, 'loss/train': 1.6353843212127686} 11/07/2021 00:52:44 - INFO - __main__ - Step 26024: {'lr': 0.00046819397661640563, 'samples': 4996608, 'steps': 26023, 'loss/train': 1.4602898359298706} 11/07/2021 00:52:44 - INFO - __main__ - Step 26025: {'lr': 0.0004681913862333122, 'samples': 4996800, 'steps': 26024, 'loss/train': 1.1732585430145264} 11/07/2021 00:52:45 - INFO - __main__ - Step 26026: {'lr': 0.0004681887957519051, 'samples': 4996992, 'steps': 26025, 'loss/train': 1.5690240859985352} 11/07/2021 00:52:45 - INFO - __main__ - Step 26027: {'lr': 0.00046818620517218544, 'samples': 4997184, 'steps': 26026, 'loss/train': 1.5888382196426392} 11/07/2021 00:52:45 - INFO - __main__ - Step 26028: {'lr': 0.00046818361449415456, 'samples': 4997376, 'steps': 26027, 'loss/train': 1.7676751613616943} 11/07/2021 00:52:46 - INFO - __main__ - Step 26029: {'lr': 0.00046818102371781343, 'samples': 4997568, 'steps': 26028, 'loss/train': 1.2550809383392334} 11/07/2021 00:52:47 - INFO - __main__ - Step 26030: {'lr': 0.0004681784328431633, 'samples': 4997760, 'steps': 26029, 'loss/train': 1.797165036201477} 11/07/2021 00:52:47 - INFO - __main__ - Step 26031: {'lr': 0.0004681758418702054, 'samples': 4997952, 'steps': 26030, 'loss/train': 1.6159076690673828} 11/07/2021 00:52:47 - INFO - __main__ - Step 26032: {'lr': 0.0004681732507989408, 'samples': 4998144, 'steps': 26031, 'loss/train': 1.2013065814971924} 11/07/2021 00:52:48 - INFO - __main__ - Step 26033: {'lr': 0.00046817065962937067, 'samples': 4998336, 'steps': 26032, 'loss/train': 1.5733553171157837} 11/07/2021 00:52:48 - INFO - __main__ - Step 26034: {'lr': 0.00046816806836149624, 'samples': 4998528, 'steps': 26033, 'loss/train': 1.91973078250885} 11/07/2021 00:52:49 - INFO - __main__ - Step 26035: {'lr': 0.00046816547699531866, 'samples': 4998720, 'steps': 26034, 'loss/train': 1.1797672510147095} 11/07/2021 00:52:49 - INFO - __main__ - Step 26036: {'lr': 0.000468162885530839, 'samples': 4998912, 'steps': 26035, 'loss/train': 2.0054972171783447} 11/07/2021 00:52:50 - INFO - __main__ - Step 26037: {'lr': 0.00046816029396805857, 'samples': 4999104, 'steps': 26036, 'loss/train': 1.3194879293441772} 11/07/2021 00:52:50 - INFO - __main__ - Step 26038: {'lr': 0.00046815770230697844, 'samples': 4999296, 'steps': 26037, 'loss/train': 1.9048802852630615} 11/07/2021 00:52:50 - INFO - __main__ - Step 26039: {'lr': 0.0004681551105475999, 'samples': 4999488, 'steps': 26038, 'loss/train': 1.6333225965499878} 11/07/2021 00:52:51 - INFO - __main__ - Step 26040: {'lr': 0.0004681525186899239, 'samples': 4999680, 'steps': 26039, 'loss/train': 0.944560170173645} 11/07/2021 00:52:52 - INFO - __main__ - Step 26041: {'lr': 0.00046814992673395185, 'samples': 4999872, 'steps': 26040, 'loss/train': 2.0325982570648193} 11/07/2021 00:52:52 - INFO - __main__ - Step 26042: {'lr': 0.0004681473346796848, 'samples': 5000064, 'steps': 26041, 'loss/train': 1.6044882535934448} 11/07/2021 00:52:52 - INFO - __main__ - Step 26043: {'lr': 0.0004681447425271239, 'samples': 5000256, 'steps': 26042, 'loss/train': 1.1876591444015503} 11/07/2021 00:52:53 - INFO - __main__ - Step 26044: {'lr': 0.0004681421502762704, 'samples': 5000448, 'steps': 26043, 'loss/train': 1.3685396909713745} 11/07/2021 00:52:54 - INFO - __main__ - Step 26045: {'lr': 0.0004681395579271253, 'samples': 5000640, 'steps': 26044, 'loss/train': 1.4207525253295898} 11/07/2021 00:52:54 - INFO - __main__ - Step 26046: {'lr': 0.00046813696547969, 'samples': 5000832, 'steps': 26045, 'loss/train': 1.6542054414749146} 11/07/2021 00:52:55 - INFO - __main__ - Step 26047: {'lr': 0.00046813437293396543, 'samples': 5001024, 'steps': 26046, 'loss/train': 1.1289676427841187} 11/07/2021 00:52:55 - INFO - __main__ - Step 26048: {'lr': 0.000468131780289953, 'samples': 5001216, 'steps': 26047, 'loss/train': 1.707362174987793} 11/07/2021 00:52:55 - INFO - __main__ - Step 26049: {'lr': 0.00046812918754765364, 'samples': 5001408, 'steps': 26048, 'loss/train': 1.5160295963287354} 11/07/2021 00:52:57 - INFO - __main__ - Step 26050: {'lr': 0.00046812659470706877, 'samples': 5001600, 'steps': 26049, 'loss/train': 1.486424207687378} 11/07/2021 00:52:57 - INFO - __main__ - Step 26051: {'lr': 0.0004681240017681993, 'samples': 5001792, 'steps': 26050, 'loss/train': 1.995141863822937} 11/07/2021 00:52:57 - INFO - __main__ - Step 26052: {'lr': 0.00046812140873104657, 'samples': 5001984, 'steps': 26051, 'loss/train': 0.976646363735199} 11/07/2021 00:52:58 - INFO - __main__ - Step 26053: {'lr': 0.00046811881559561167, 'samples': 5002176, 'steps': 26052, 'loss/train': 1.8532328605651855} 11/07/2021 00:52:58 - INFO - __main__ - Step 26054: {'lr': 0.00046811622236189585, 'samples': 5002368, 'steps': 26053, 'loss/train': 1.7580597400665283} 11/07/2021 00:52:58 - INFO - __main__ - Step 26055: {'lr': 0.0004681136290299002, 'samples': 5002560, 'steps': 26054, 'loss/train': 1.3783228397369385} 11/07/2021 00:52:59 - INFO - __main__ - Step 26056: {'lr': 0.00046811103559962585, 'samples': 5002752, 'steps': 26055, 'loss/train': 1.3633968830108643} 11/07/2021 00:53:00 - INFO - __main__ - Step 26057: {'lr': 0.00046810844207107415, 'samples': 5002944, 'steps': 26056, 'loss/train': 1.6410704851150513} 11/07/2021 00:53:00 - INFO - __main__ - Step 26058: {'lr': 0.0004681058484442461, 'samples': 5003136, 'steps': 26057, 'loss/train': 1.8240699768066406} 11/07/2021 00:53:00 - INFO - __main__ - Step 26059: {'lr': 0.00046810325471914295, 'samples': 5003328, 'steps': 26058, 'loss/train': 1.9498460292816162} 11/07/2021 00:53:01 - INFO - __main__ - Step 26060: {'lr': 0.00046810066089576573, 'samples': 5003520, 'steps': 26059, 'loss/train': 1.6209688186645508} 11/07/2021 00:53:01 - INFO - __main__ - Step 26061: {'lr': 0.00046809806697411583, 'samples': 5003712, 'steps': 26060, 'loss/train': 1.3615363836288452} 11/07/2021 00:53:02 - INFO - __main__ - Step 26062: {'lr': 0.0004680954729541942, 'samples': 5003904, 'steps': 26061, 'loss/train': 1.435072898864746} 11/07/2021 00:53:03 - INFO - __main__ - Step 26063: {'lr': 0.00046809287883600227, 'samples': 5004096, 'steps': 26062, 'loss/train': 1.6992723941802979} 11/07/2021 00:53:03 - INFO - __main__ - Step 26064: {'lr': 0.00046809028461954093, 'samples': 5004288, 'steps': 26063, 'loss/train': 1.3957271575927734} 11/07/2021 00:53:03 - INFO - __main__ - Step 26065: {'lr': 0.00046808769030481153, 'samples': 5004480, 'steps': 26064, 'loss/train': 1.38559091091156} 11/07/2021 00:53:04 - INFO - __main__ - Step 26066: {'lr': 0.00046808509589181513, 'samples': 5004672, 'steps': 26065, 'loss/train': 1.3515117168426514} 11/07/2021 00:53:05 - INFO - __main__ - Step 26067: {'lr': 0.00046808250138055305, 'samples': 5004864, 'steps': 26066, 'loss/train': 1.3374156951904297} 11/07/2021 00:53:05 - INFO - __main__ - Step 26068: {'lr': 0.0004680799067710263, 'samples': 5005056, 'steps': 26067, 'loss/train': 1.1858316659927368} 11/07/2021 00:53:05 - INFO - __main__ - Step 26069: {'lr': 0.00046807731206323605, 'samples': 5005248, 'steps': 26068, 'loss/train': 1.3988279104232788} 11/07/2021 00:53:06 - INFO - __main__ - Step 26070: {'lr': 0.00046807471725718357, 'samples': 5005440, 'steps': 26069, 'loss/train': 1.8223246335983276} 11/07/2021 00:53:06 - INFO - __main__ - Step 26071: {'lr': 0.00046807212235287, 'samples': 5005632, 'steps': 26070, 'loss/train': 1.149131417274475} 11/07/2021 00:53:07 - INFO - __main__ - Step 26072: {'lr': 0.0004680695273502965, 'samples': 5005824, 'steps': 26071, 'loss/train': 1.7531110048294067} 11/07/2021 00:53:08 - INFO - __main__ - Step 26073: {'lr': 0.00046806693224946426, 'samples': 5006016, 'steps': 26072, 'loss/train': 1.817716121673584} 11/07/2021 00:53:08 - INFO - __main__ - Step 26074: {'lr': 0.00046806433705037445, 'samples': 5006208, 'steps': 26073, 'loss/train': 1.7162048816680908} 11/07/2021 00:53:08 - INFO - __main__ - Step 26075: {'lr': 0.00046806174175302806, 'samples': 5006400, 'steps': 26074, 'loss/train': 0.9646897912025452} 11/07/2021 00:53:09 - INFO - __main__ - Step 26076: {'lr': 0.00046805914635742656, 'samples': 5006592, 'steps': 26075, 'loss/train': 1.3818391561508179} 11/07/2021 00:53:10 - INFO - __main__ - Step 26077: {'lr': 0.0004680565508635709, 'samples': 5006784, 'steps': 26076, 'loss/train': 1.4534307718276978} 11/07/2021 00:53:10 - INFO - __main__ - Step 26078: {'lr': 0.00046805395527146237, 'samples': 5006976, 'steps': 26077, 'loss/train': 1.2013325691223145} 11/07/2021 00:53:10 - INFO - __main__ - Step 26079: {'lr': 0.0004680513595811021, 'samples': 5007168, 'steps': 26078, 'loss/train': 1.4902340173721313} 11/07/2021 00:53:11 - INFO - __main__ - Step 26080: {'lr': 0.0004680487637924912, 'samples': 5007360, 'steps': 26079, 'loss/train': 1.373033046722412} 11/07/2021 00:53:11 - INFO - __main__ - Step 26081: {'lr': 0.0004680461679056309, 'samples': 5007552, 'steps': 26080, 'loss/train': 1.8461296558380127} 11/07/2021 00:53:12 - INFO - __main__ - Step 26082: {'lr': 0.00046804357192052246, 'samples': 5007744, 'steps': 26081, 'loss/train': 1.5331897735595703} 11/07/2021 00:53:13 - INFO - __main__ - Step 26083: {'lr': 0.00046804097583716685, 'samples': 5007936, 'steps': 26082, 'loss/train': 1.252344012260437} 11/07/2021 00:53:13 - INFO - __main__ - Step 26084: {'lr': 0.0004680383796555654, 'samples': 5008128, 'steps': 26083, 'loss/train': 1.4637391567230225} 11/07/2021 00:53:13 - INFO - __main__ - Step 26085: {'lr': 0.00046803578337571917, 'samples': 5008320, 'steps': 26084, 'loss/train': 1.5039665699005127} 11/07/2021 00:53:14 - INFO - __main__ - Step 26086: {'lr': 0.00046803318699762937, 'samples': 5008512, 'steps': 26085, 'loss/train': 1.2997361421585083} 11/07/2021 00:53:15 - INFO - __main__ - Step 26087: {'lr': 0.0004680305905212972, 'samples': 5008704, 'steps': 26086, 'loss/train': 1.739014983177185} 11/07/2021 00:53:15 - INFO - __main__ - Step 26088: {'lr': 0.0004680279939467238, 'samples': 5008896, 'steps': 26087, 'loss/train': 1.3671000003814697} 11/07/2021 00:53:16 - INFO - __main__ - Step 26089: {'lr': 0.00046802539727391033, 'samples': 5009088, 'steps': 26088, 'loss/train': 1.5393263101577759} 11/07/2021 00:53:16 - INFO - __main__ - Step 26090: {'lr': 0.0004680228005028581, 'samples': 5009280, 'steps': 26089, 'loss/train': 1.33014714717865} 11/07/2021 00:53:16 - INFO - __main__ - Step 26091: {'lr': 0.000468020203633568, 'samples': 5009472, 'steps': 26090, 'loss/train': 1.4916388988494873} 11/07/2021 00:53:17 - INFO - __main__ - Step 26092: {'lr': 0.0004680176066660415, 'samples': 5009664, 'steps': 26091, 'loss/train': 1.7128859758377075} 11/07/2021 00:53:18 - INFO - __main__ - Step 26093: {'lr': 0.00046801500960027957, 'samples': 5009856, 'steps': 26092, 'loss/train': 1.4452115297317505} 11/07/2021 00:53:18 - INFO - __main__ - Step 26094: {'lr': 0.00046801241243628344, 'samples': 5010048, 'steps': 26093, 'loss/train': 1.9708287715911865} 11/07/2021 00:53:18 - INFO - __main__ - Step 26095: {'lr': 0.00046800981517405426, 'samples': 5010240, 'steps': 26094, 'loss/train': 1.3643174171447754} 11/07/2021 00:53:19 - INFO - __main__ - Step 26096: {'lr': 0.0004680072178135932, 'samples': 5010432, 'steps': 26095, 'loss/train': 2.488478899002075} 11/07/2021 00:53:19 - INFO - __main__ - Step 26097: {'lr': 0.00046800462035490156, 'samples': 5010624, 'steps': 26096, 'loss/train': 1.4790648221969604} 11/07/2021 00:53:20 - INFO - __main__ - Step 26098: {'lr': 0.0004680020227979803, 'samples': 5010816, 'steps': 26097, 'loss/train': 1.0440701246261597} 11/07/2021 00:53:20 - INFO - __main__ - Step 26099: {'lr': 0.0004679994251428308, 'samples': 5011008, 'steps': 26098, 'loss/train': 1.7082839012145996} 11/07/2021 00:53:21 - INFO - __main__ - Step 26100: {'lr': 0.00046799682738945397, 'samples': 5011200, 'steps': 26099, 'loss/train': 1.8536376953125} 11/07/2021 00:53:21 - INFO - __main__ - Step 26101: {'lr': 0.00046799422953785124, 'samples': 5011392, 'steps': 26100, 'loss/train': 1.5862019062042236} 11/07/2021 00:53:21 - INFO - __main__ - Step 26102: {'lr': 0.00046799163158802365, 'samples': 5011584, 'steps': 26101, 'loss/train': 1.9192478656768799} 11/07/2021 00:53:22 - INFO - __main__ - Step 26103: {'lr': 0.00046798903353997243, 'samples': 5011776, 'steps': 26102, 'loss/train': 1.8214565515518188} 11/07/2021 00:53:23 - INFO - __main__ - Step 26104: {'lr': 0.0004679864353936987, 'samples': 5011968, 'steps': 26103, 'loss/train': 2.0594255924224854} 11/07/2021 00:53:23 - INFO - __main__ - Step 26105: {'lr': 0.0004679838371492036, 'samples': 5012160, 'steps': 26104, 'loss/train': 1.3771926164627075} 11/07/2021 00:53:23 - INFO - __main__ - Step 26106: {'lr': 0.00046798123880648833, 'samples': 5012352, 'steps': 26105, 'loss/train': 1.7747282981872559} 11/07/2021 00:53:24 - INFO - __main__ - Step 26107: {'lr': 0.0004679786403655542, 'samples': 5012544, 'steps': 26106, 'loss/train': 1.5721908807754517} 11/07/2021 00:53:25 - INFO - __main__ - Step 26108: {'lr': 0.0004679760418264021, 'samples': 5012736, 'steps': 26107, 'loss/train': 1.555148720741272} 11/07/2021 00:53:25 - INFO - __main__ - Step 26109: {'lr': 0.00046797344318903343, 'samples': 5012928, 'steps': 26108, 'loss/train': 1.344144582748413} 11/07/2021 00:53:26 - INFO - __main__ - Step 26110: {'lr': 0.0004679708444534493, 'samples': 5013120, 'steps': 26109, 'loss/train': 0.920870304107666} 11/07/2021 00:53:26 - INFO - __main__ - Step 26111: {'lr': 0.0004679682456196509, 'samples': 5013312, 'steps': 26110, 'loss/train': 1.5601238012313843} 11/07/2021 00:53:26 - INFO - __main__ - Step 26112: {'lr': 0.0004679656466876393, 'samples': 5013504, 'steps': 26111, 'loss/train': 1.429107904434204} 11/07/2021 00:53:27 - INFO - __main__ - Step 26113: {'lr': 0.00046796304765741583, 'samples': 5013696, 'steps': 26112, 'loss/train': 1.5923376083374023} 11/07/2021 00:53:28 - INFO - __main__ - Step 26114: {'lr': 0.00046796044852898144, 'samples': 5013888, 'steps': 26113, 'loss/train': 1.8528300523757935} 11/07/2021 00:53:28 - INFO - __main__ - Step 26115: {'lr': 0.0004679578493023375, 'samples': 5014080, 'steps': 26114, 'loss/train': 1.4153774976730347} 11/07/2021 00:53:28 - INFO - __main__ - Step 26116: {'lr': 0.00046795524997748515, 'samples': 5014272, 'steps': 26115, 'loss/train': 1.7058981657028198} 11/07/2021 00:53:29 - INFO - __main__ - Step 26117: {'lr': 0.0004679526505544256, 'samples': 5014464, 'steps': 26116, 'loss/train': 1.5847781896591187} 11/07/2021 00:53:30 - INFO - __main__ - Step 26118: {'lr': 0.0004679500510331598, 'samples': 5014656, 'steps': 26117, 'loss/train': 1.441420078277588} 11/07/2021 00:53:30 - INFO - __main__ - Step 26119: {'lr': 0.00046794745141368917, 'samples': 5014848, 'steps': 26118, 'loss/train': 1.183732032775879} 11/07/2021 00:53:30 - INFO - __main__ - Step 26120: {'lr': 0.00046794485169601474, 'samples': 5015040, 'steps': 26119, 'loss/train': 1.5869964361190796} 11/07/2021 00:53:31 - INFO - __main__ - Step 26121: {'lr': 0.00046794225188013773, 'samples': 5015232, 'steps': 26120, 'loss/train': 1.3868657350540161} 11/07/2021 00:53:31 - INFO - __main__ - Step 26122: {'lr': 0.00046793965196605927, 'samples': 5015424, 'steps': 26121, 'loss/train': 1.588295817375183} 11/07/2021 00:53:31 - INFO - __main__ - Step 26123: {'lr': 0.00046793705195378066, 'samples': 5015616, 'steps': 26122, 'loss/train': 2.0373282432556152} 11/07/2021 00:53:32 - INFO - __main__ - Step 26124: {'lr': 0.0004679344518433029, 'samples': 5015808, 'steps': 26123, 'loss/train': 1.2457910776138306} 11/07/2021 00:53:33 - INFO - __main__ - Step 26125: {'lr': 0.0004679318516346273, 'samples': 5016000, 'steps': 26124, 'loss/train': 1.1270509958267212} 11/07/2021 00:53:33 - INFO - __main__ - Step 26126: {'lr': 0.0004679292513277549, 'samples': 5016192, 'steps': 26125, 'loss/train': 1.675414800643921} 11/07/2021 00:53:33 - INFO - __main__ - Step 26127: {'lr': 0.0004679266509226869, 'samples': 5016384, 'steps': 26126, 'loss/train': 1.6026920080184937} 11/07/2021 00:53:34 - INFO - __main__ - Step 26128: {'lr': 0.0004679240504194246, 'samples': 5016576, 'steps': 26127, 'loss/train': 1.3696702718734741} 11/07/2021 00:53:35 - INFO - __main__ - Step 26129: {'lr': 0.00046792144981796905, 'samples': 5016768, 'steps': 26128, 'loss/train': 1.309404969215393} 11/07/2021 00:53:35 - INFO - __main__ - Step 26130: {'lr': 0.0004679188491183215, 'samples': 5016960, 'steps': 26129, 'loss/train': 1.17288339138031} 11/07/2021 00:53:36 - INFO - __main__ - Step 26131: {'lr': 0.00046791624832048307, 'samples': 5017152, 'steps': 26130, 'loss/train': 1.457753300666809} 11/07/2021 00:53:36 - INFO - __main__ - Step 26132: {'lr': 0.0004679136474244549, 'samples': 5017344, 'steps': 26131, 'loss/train': 1.6909582614898682} 11/07/2021 00:53:36 - INFO - __main__ - Step 26133: {'lr': 0.00046791104643023823, 'samples': 5017536, 'steps': 26132, 'loss/train': 1.6915613412857056} 11/07/2021 00:53:37 - INFO - __main__ - Step 26134: {'lr': 0.0004679084453378342, 'samples': 5017728, 'steps': 26133, 'loss/train': 1.97373366355896} 11/07/2021 00:53:38 - INFO - __main__ - Step 26135: {'lr': 0.00046790584414724404, 'samples': 5017920, 'steps': 26134, 'loss/train': 1.0392847061157227} 11/07/2021 00:53:38 - INFO - __main__ - Step 26136: {'lr': 0.0004679032428584687, 'samples': 5018112, 'steps': 26135, 'loss/train': 1.4240553379058838} 11/07/2021 00:53:38 - INFO - __main__ - Step 26137: {'lr': 0.0004679006414715097, 'samples': 5018304, 'steps': 26136, 'loss/train': 1.4850119352340698} 11/07/2021 00:53:39 - INFO - __main__ - Step 26138: {'lr': 0.00046789803998636796, 'samples': 5018496, 'steps': 26137, 'loss/train': 1.529703974723816} 11/07/2021 00:53:40 - INFO - __main__ - Step 26139: {'lr': 0.0004678954384030448, 'samples': 5018688, 'steps': 26138, 'loss/train': 1.807644248008728} 11/07/2021 00:53:40 - INFO - __main__ - Step 26140: {'lr': 0.00046789283672154125, 'samples': 5018880, 'steps': 26139, 'loss/train': 1.7455646991729736} 11/07/2021 00:53:40 - INFO - __main__ - Step 26141: {'lr': 0.00046789023494185855, 'samples': 5019072, 'steps': 26140, 'loss/train': 1.525965929031372} 11/07/2021 00:53:41 - INFO - __main__ - Step 26142: {'lr': 0.0004678876330639978, 'samples': 5019264, 'steps': 26141, 'loss/train': 1.9528864622116089} 11/07/2021 00:53:41 - INFO - __main__ - Step 26143: {'lr': 0.0004678850310879604, 'samples': 5019456, 'steps': 26142, 'loss/train': 1.6074074506759644} 11/07/2021 00:53:42 - INFO - __main__ - Step 26144: {'lr': 0.0004678824290137473, 'samples': 5019648, 'steps': 26143, 'loss/train': 1.5852299928665161} 11/07/2021 00:53:43 - INFO - __main__ - Step 26145: {'lr': 0.0004678798268413597, 'samples': 5019840, 'steps': 26144, 'loss/train': 0.9158662557601929} 11/07/2021 00:53:43 - INFO - __main__ - Step 26146: {'lr': 0.00046787722457079887, 'samples': 5020032, 'steps': 26145, 'loss/train': 1.5754239559173584} 11/07/2021 00:53:43 - INFO - __main__ - Step 26147: {'lr': 0.00046787462220206587, 'samples': 5020224, 'steps': 26146, 'loss/train': 1.568524956703186} 11/07/2021 00:53:44 - INFO - __main__ - Step 26148: {'lr': 0.00046787201973516195, 'samples': 5020416, 'steps': 26147, 'loss/train': 1.5438616275787354} 11/07/2021 00:53:44 - INFO - __main__ - Step 26149: {'lr': 0.00046786941717008823, 'samples': 5020608, 'steps': 26148, 'loss/train': 1.8527145385742188} 11/07/2021 00:53:45 - INFO - __main__ - Step 26150: {'lr': 0.00046786681450684597, 'samples': 5020800, 'steps': 26149, 'loss/train': 1.8514100313186646} 11/07/2021 00:53:45 - INFO - __main__ - Step 26151: {'lr': 0.00046786421174543625, 'samples': 5020992, 'steps': 26150, 'loss/train': 0.8867548704147339} 11/07/2021 00:53:46 - INFO - __main__ - Step 26152: {'lr': 0.0004678616088858603, 'samples': 5021184, 'steps': 26151, 'loss/train': 1.5728057622909546} 11/07/2021 00:53:46 - INFO - __main__ - Step 26153: {'lr': 0.0004678590059281193, 'samples': 5021376, 'steps': 26152, 'loss/train': 1.5114586353302002} 11/07/2021 00:53:46 - INFO - __main__ - Step 26154: {'lr': 0.0004678564028722143, 'samples': 5021568, 'steps': 26153, 'loss/train': 1.5125452280044556} 11/07/2021 00:53:47 - INFO - __main__ - Step 26155: {'lr': 0.0004678537997181467, 'samples': 5021760, 'steps': 26154, 'loss/train': 1.6505911350250244} 11/07/2021 00:53:48 - INFO - __main__ - Step 26156: {'lr': 0.00046785119646591746, 'samples': 5021952, 'steps': 26155, 'loss/train': 1.7003118991851807} 11/07/2021 00:53:48 - INFO - __main__ - Step 26157: {'lr': 0.0004678485931155278, 'samples': 5022144, 'steps': 26156, 'loss/train': 1.5242116451263428} 11/07/2021 00:53:49 - INFO - __main__ - Step 26158: {'lr': 0.000467845989666979, 'samples': 5022336, 'steps': 26157, 'loss/train': 1.6533890962600708} 11/07/2021 00:53:49 - INFO - __main__ - Step 26159: {'lr': 0.0004678433861202721, 'samples': 5022528, 'steps': 26158, 'loss/train': 1.1330708265304565} 11/07/2021 00:53:50 - INFO - __main__ - Step 26160: {'lr': 0.0004678407824754083, 'samples': 5022720, 'steps': 26159, 'loss/train': 1.8086222410202026} 11/07/2021 00:53:50 - INFO - __main__ - Step 26161: {'lr': 0.00046783817873238885, 'samples': 5022912, 'steps': 26160, 'loss/train': 1.5170518159866333} 11/07/2021 00:53:51 - INFO - __main__ - Step 26162: {'lr': 0.0004678355748912149, 'samples': 5023104, 'steps': 26161, 'loss/train': 1.6570253372192383} 11/07/2021 00:53:51 - INFO - __main__ - Step 26163: {'lr': 0.0004678329709518876, 'samples': 5023296, 'steps': 26162, 'loss/train': 1.50641930103302} 11/07/2021 00:53:51 - INFO - __main__ - Step 26164: {'lr': 0.0004678303669144081, 'samples': 5023488, 'steps': 26163, 'loss/train': 1.6530953645706177} 11/07/2021 00:53:52 - INFO - __main__ - Step 26165: {'lr': 0.0004678277627787776, 'samples': 5023680, 'steps': 26164, 'loss/train': 0.966736912727356} 11/07/2021 00:53:53 - INFO - __main__ - Step 26166: {'lr': 0.0004678251585449973, 'samples': 5023872, 'steps': 26165, 'loss/train': 1.3423575162887573} 11/07/2021 00:53:53 - INFO - __main__ - Step 26167: {'lr': 0.0004678225542130683, 'samples': 5024064, 'steps': 26166, 'loss/train': 1.4492461681365967} 11/07/2021 00:53:53 - INFO - __main__ - Step 26168: {'lr': 0.0004678199497829919, 'samples': 5024256, 'steps': 26167, 'loss/train': 1.3655349016189575} 11/07/2021 00:53:54 - INFO - __main__ - Step 26169: {'lr': 0.0004678173452547691, 'samples': 5024448, 'steps': 26168, 'loss/train': 2.049821615219116} 11/07/2021 00:53:55 - INFO - __main__ - Step 26170: {'lr': 0.00046781474062840126, 'samples': 5024640, 'steps': 26169, 'loss/train': 1.5945308208465576} 11/07/2021 00:53:55 - INFO - __main__ - Step 26171: {'lr': 0.0004678121359038894, 'samples': 5024832, 'steps': 26170, 'loss/train': 2.121659994125366} 11/07/2021 00:53:56 - INFO - __main__ - Step 26172: {'lr': 0.0004678095310812347, 'samples': 5025024, 'steps': 26171, 'loss/train': 1.6644989252090454} 11/07/2021 00:53:56 - INFO - __main__ - Step 26173: {'lr': 0.0004678069261604384, 'samples': 5025216, 'steps': 26172, 'loss/train': 1.843034029006958} 11/07/2021 00:53:56 - INFO - __main__ - Step 26174: {'lr': 0.00046780432114150173, 'samples': 5025408, 'steps': 26173, 'loss/train': 1.1142686605453491} 11/07/2021 00:53:57 - INFO - __main__ - Step 26175: {'lr': 0.0004678017160244258, 'samples': 5025600, 'steps': 26174, 'loss/train': 1.3084547519683838} 11/07/2021 00:53:58 - INFO - __main__ - Step 26176: {'lr': 0.00046779911080921166, 'samples': 5025792, 'steps': 26175, 'loss/train': 1.2665361166000366} 11/07/2021 00:53:58 - INFO - __main__ - Step 26177: {'lr': 0.00046779650549586075, 'samples': 5025984, 'steps': 26176, 'loss/train': 1.6010990142822266} 11/07/2021 00:53:59 - INFO - __main__ - Step 26178: {'lr': 0.000467793900084374, 'samples': 5026176, 'steps': 26177, 'loss/train': 1.8408987522125244} 11/07/2021 00:53:59 - INFO - __main__ - Step 26179: {'lr': 0.0004677912945747527, 'samples': 5026368, 'steps': 26178, 'loss/train': 1.700212001800537} 11/07/2021 00:53:59 - INFO - __main__ - Step 26180: {'lr': 0.000467788688966998, 'samples': 5026560, 'steps': 26179, 'loss/train': 1.9452217817306519} 11/07/2021 00:54:00 - INFO - __main__ - Step 26181: {'lr': 0.00046778608326111104, 'samples': 5026752, 'steps': 26180, 'loss/train': 0.5576347708702087} 11/07/2021 00:54:01 - INFO - __main__ - Step 26182: {'lr': 0.00046778347745709317, 'samples': 5026944, 'steps': 26181, 'loss/train': 1.705079197883606} 11/07/2021 00:54:01 - INFO - __main__ - Step 26183: {'lr': 0.0004677808715549453, 'samples': 5027136, 'steps': 26182, 'loss/train': 1.4773807525634766} 11/07/2021 00:54:02 - INFO - __main__ - Step 26184: {'lr': 0.0004677782655546687, 'samples': 5027328, 'steps': 26183, 'loss/train': 1.2588014602661133} 11/07/2021 00:54:02 - INFO - __main__ - Step 26185: {'lr': 0.00046777565945626463, 'samples': 5027520, 'steps': 26184, 'loss/train': 1.3694013357162476} 11/07/2021 00:54:02 - INFO - __main__ - Step 26186: {'lr': 0.0004677730532597343, 'samples': 5027712, 'steps': 26185, 'loss/train': 0.18336506187915802} 11/07/2021 00:54:03 - INFO - __main__ - Step 26187: {'lr': 0.00046777044696507867, 'samples': 5027904, 'steps': 26186, 'loss/train': 1.2982537746429443} 11/07/2021 00:54:04 - INFO - __main__ - Step 26188: {'lr': 0.00046776784057229906, 'samples': 5028096, 'steps': 26187, 'loss/train': 1.6988948583602905} 11/07/2021 00:54:04 - INFO - __main__ - Step 26189: {'lr': 0.00046776523408139666, 'samples': 5028288, 'steps': 26188, 'loss/train': 1.8552799224853516} 11/07/2021 00:54:04 - INFO - __main__ - Step 26190: {'lr': 0.0004677626274923726, 'samples': 5028480, 'steps': 26189, 'loss/train': 1.4913320541381836} 11/07/2021 00:54:05 - INFO - __main__ - Step 26191: {'lr': 0.000467760020805228, 'samples': 5028672, 'steps': 26190, 'loss/train': 1.6391502618789673} 11/07/2021 00:54:06 - INFO - __main__ - Step 26192: {'lr': 0.0004677574140199642, 'samples': 5028864, 'steps': 26191, 'loss/train': 1.589673638343811} 11/07/2021 00:54:06 - INFO - __main__ - Step 26193: {'lr': 0.00046775480713658215, 'samples': 5029056, 'steps': 26192, 'loss/train': 1.6726784706115723} 11/07/2021 00:54:06 - INFO - __main__ - Step 26194: {'lr': 0.00046775220015508325, 'samples': 5029248, 'steps': 26193, 'loss/train': 1.3490551710128784} 11/07/2021 00:54:07 - INFO - __main__ - Step 26195: {'lr': 0.0004677495930754685, 'samples': 5029440, 'steps': 26194, 'loss/train': 1.2924178838729858} 11/07/2021 00:54:07 - INFO - __main__ - Step 26196: {'lr': 0.0004677469858977391, 'samples': 5029632, 'steps': 26195, 'loss/train': 2.0205130577087402} 11/07/2021 00:54:08 - INFO - __main__ - Step 26197: {'lr': 0.00046774437862189634, 'samples': 5029824, 'steps': 26196, 'loss/train': 1.8864357471466064} 11/07/2021 00:54:09 - INFO - __main__ - Step 26198: {'lr': 0.00046774177124794136, 'samples': 5030016, 'steps': 26197, 'loss/train': 1.3467926979064941} 11/07/2021 00:54:09 - INFO - __main__ - Step 26199: {'lr': 0.00046773916377587524, 'samples': 5030208, 'steps': 26198, 'loss/train': 1.4463247060775757} 11/07/2021 00:54:09 - INFO - __main__ - Step 26200: {'lr': 0.00046773655620569924, 'samples': 5030400, 'steps': 26199, 'loss/train': 1.1757603883743286} 11/07/2021 00:54:10 - INFO - __main__ - Step 26201: {'lr': 0.0004677339485374145, 'samples': 5030592, 'steps': 26200, 'loss/train': 1.4346240758895874} 11/07/2021 00:54:11 - INFO - __main__ - Step 26202: {'lr': 0.00046773134077102217, 'samples': 5030784, 'steps': 26201, 'loss/train': 1.732772707939148} 11/07/2021 00:54:11 - INFO - __main__ - Step 26203: {'lr': 0.00046772873290652344, 'samples': 5030976, 'steps': 26202, 'loss/train': 1.3843315839767456} 11/07/2021 00:54:11 - INFO - __main__ - Step 26204: {'lr': 0.0004677261249439196, 'samples': 5031168, 'steps': 26203, 'loss/train': 1.8565138578414917} 11/07/2021 00:54:12 - INFO - __main__ - Step 26205: {'lr': 0.0004677235168832117, 'samples': 5031360, 'steps': 26204, 'loss/train': 1.6033073663711548} 11/07/2021 00:54:12 - INFO - __main__ - Step 26206: {'lr': 0.0004677209087244009, 'samples': 5031552, 'steps': 26205, 'loss/train': 1.8159793615341187} 11/07/2021 00:54:13 - INFO - __main__ - Step 26207: {'lr': 0.0004677183004674884, 'samples': 5031744, 'steps': 26206, 'loss/train': 1.5248264074325562} 11/07/2021 00:54:13 - INFO - __main__ - Step 26208: {'lr': 0.00046771569211247546, 'samples': 5031936, 'steps': 26207, 'loss/train': 1.6858869791030884} 11/07/2021 00:54:14 - INFO - __main__ - Step 26209: {'lr': 0.00046771308365936315, 'samples': 5032128, 'steps': 26208, 'loss/train': 1.5982725620269775} 11/07/2021 00:54:14 - INFO - __main__ - Step 26210: {'lr': 0.00046771047510815267, 'samples': 5032320, 'steps': 26209, 'loss/train': 1.8095290660858154} 11/07/2021 00:54:14 - INFO - __main__ - Step 26211: {'lr': 0.0004677078664588452, 'samples': 5032512, 'steps': 26210, 'loss/train': 1.5817720890045166} 11/07/2021 00:54:16 - INFO - __main__ - Step 26212: {'lr': 0.000467705257711442, 'samples': 5032704, 'steps': 26211, 'loss/train': 1.801270842552185} 11/07/2021 00:54:16 - INFO - __main__ - Step 26213: {'lr': 0.0004677026488659441, 'samples': 5032896, 'steps': 26212, 'loss/train': 1.433125376701355} 11/07/2021 00:54:16 - INFO - __main__ - Step 26214: {'lr': 0.0004677000399223528, 'samples': 5033088, 'steps': 26213, 'loss/train': 1.7132267951965332} 11/07/2021 00:54:17 - INFO - __main__ - Step 26215: {'lr': 0.0004676974308806692, 'samples': 5033280, 'steps': 26214, 'loss/train': 1.7332468032836914} 11/07/2021 00:54:17 - INFO - __main__ - Step 26216: {'lr': 0.00046769482174089446, 'samples': 5033472, 'steps': 26215, 'loss/train': 1.8117945194244385} 11/07/2021 00:54:17 - INFO - __main__ - Step 26217: {'lr': 0.00046769221250302984, 'samples': 5033664, 'steps': 26216, 'loss/train': 1.13893723487854} 11/07/2021 00:54:18 - INFO - __main__ - Step 26218: {'lr': 0.0004676896031670764, 'samples': 5033856, 'steps': 26217, 'loss/train': 1.3056902885437012} 11/07/2021 00:54:19 - INFO - __main__ - Step 26219: {'lr': 0.00046768699373303546, 'samples': 5034048, 'steps': 26218, 'loss/train': 2.4533538818359375} 11/07/2021 00:54:19 - INFO - __main__ - Step 26220: {'lr': 0.00046768438420090807, 'samples': 5034240, 'steps': 26219, 'loss/train': 1.8676079511642456} 11/07/2021 00:54:19 - INFO - __main__ - Step 26221: {'lr': 0.0004676817745706955, 'samples': 5034432, 'steps': 26220, 'loss/train': 1.638798475265503} 11/07/2021 00:54:20 - INFO - __main__ - Step 26222: {'lr': 0.0004676791648423989, 'samples': 5034624, 'steps': 26221, 'loss/train': 0.4636261761188507} 11/07/2021 00:54:21 - INFO - __main__ - Step 26223: {'lr': 0.00046767655501601935, 'samples': 5034816, 'steps': 26222, 'loss/train': 1.136460542678833} 11/07/2021 00:54:21 - INFO - __main__ - Step 26224: {'lr': 0.0004676739450915581, 'samples': 5035008, 'steps': 26223, 'loss/train': 1.7114912271499634} 11/07/2021 00:54:22 - INFO - __main__ - Step 26225: {'lr': 0.0004676713350690164, 'samples': 5035200, 'steps': 26224, 'loss/train': 1.770675778388977} 11/07/2021 00:54:22 - INFO - __main__ - Step 26226: {'lr': 0.0004676687249483953, 'samples': 5035392, 'steps': 26225, 'loss/train': 1.5932910442352295} 11/07/2021 00:54:22 - INFO - __main__ - Step 26227: {'lr': 0.0004676661147296961, 'samples': 5035584, 'steps': 26226, 'loss/train': 1.6483561992645264} 11/07/2021 00:54:23 - INFO - __main__ - Step 26228: {'lr': 0.00046766350441291985, 'samples': 5035776, 'steps': 26227, 'loss/train': 1.5081048011779785} 11/07/2021 00:54:24 - INFO - __main__ - Step 26229: {'lr': 0.00046766089399806775, 'samples': 5035968, 'steps': 26228, 'loss/train': 1.5878859758377075} 11/07/2021 00:54:24 - INFO - __main__ - Step 26230: {'lr': 0.0004676582834851411, 'samples': 5036160, 'steps': 26229, 'loss/train': 1.6839022636413574} 11/07/2021 00:54:24 - INFO - __main__ - Step 26231: {'lr': 0.0004676556728741409, 'samples': 5036352, 'steps': 26230, 'loss/train': 1.9797955751419067} 11/07/2021 00:54:25 - INFO - __main__ - Step 26232: {'lr': 0.0004676530621650685, 'samples': 5036544, 'steps': 26231, 'loss/train': 1.775161862373352} 11/07/2021 00:54:25 - INFO - __main__ - Step 26233: {'lr': 0.00046765045135792495, 'samples': 5036736, 'steps': 26232, 'loss/train': 1.5758485794067383} 11/07/2021 00:54:26 - INFO - __main__ - Step 26234: {'lr': 0.00046764784045271146, 'samples': 5036928, 'steps': 26233, 'loss/train': 1.1304795742034912} 11/07/2021 00:54:26 - INFO - __main__ - Step 26235: {'lr': 0.0004676452294494292, 'samples': 5037120, 'steps': 26234, 'loss/train': 1.6500743627548218} 11/07/2021 00:54:27 - INFO - __main__ - Step 26236: {'lr': 0.00046764261834807944, 'samples': 5037312, 'steps': 26235, 'loss/train': 1.3043909072875977} 11/07/2021 00:54:27 - INFO - __main__ - Step 26237: {'lr': 0.0004676400071486632, 'samples': 5037504, 'steps': 26236, 'loss/train': 1.1312769651412964} 11/07/2021 00:54:27 - INFO - __main__ - Step 26238: {'lr': 0.0004676373958511817, 'samples': 5037696, 'steps': 26237, 'loss/train': 1.7899938821792603} 11/07/2021 00:54:28 - INFO - __main__ - Step 26239: {'lr': 0.00046763478445563617, 'samples': 5037888, 'steps': 26238, 'loss/train': 1.2193043231964111} 11/07/2021 00:54:29 - INFO - __main__ - Step 26240: {'lr': 0.0004676321729620278, 'samples': 5038080, 'steps': 26239, 'loss/train': 1.5249736309051514} 11/07/2021 00:54:29 - INFO - __main__ - Step 26241: {'lr': 0.0004676295613703577, 'samples': 5038272, 'steps': 26240, 'loss/train': 1.710008978843689} 11/07/2021 00:54:30 - INFO - __main__ - Step 26242: {'lr': 0.00046762694968062706, 'samples': 5038464, 'steps': 26241, 'loss/train': 1.24952232837677} 11/07/2021 00:54:30 - INFO - __main__ - Step 26243: {'lr': 0.0004676243378928371, 'samples': 5038656, 'steps': 26242, 'loss/train': 2.338625431060791} 11/07/2021 00:54:31 - INFO - __main__ - Step 26244: {'lr': 0.000467621726006989, 'samples': 5038848, 'steps': 26243, 'loss/train': 1.184955358505249} 11/07/2021 00:54:31 - INFO - __main__ - Step 26245: {'lr': 0.0004676191140230839, 'samples': 5039040, 'steps': 26244, 'loss/train': 1.1319618225097656} 11/07/2021 00:54:32 - INFO - __main__ - Step 26246: {'lr': 0.0004676165019411229, 'samples': 5039232, 'steps': 26245, 'loss/train': 1.7131967544555664} 11/07/2021 00:54:32 - INFO - __main__ - Step 26247: {'lr': 0.00046761388976110737, 'samples': 5039424, 'steps': 26246, 'loss/train': 1.0271192789077759} 11/07/2021 00:54:32 - INFO - __main__ - Step 26248: {'lr': 0.00046761127748303833, 'samples': 5039616, 'steps': 26247, 'loss/train': 1.5839369297027588} 11/07/2021 00:54:33 - INFO - __main__ - Step 26249: {'lr': 0.000467608665106917, 'samples': 5039808, 'steps': 26248, 'loss/train': 1.4229629039764404} 11/07/2021 00:54:34 - INFO - __main__ - Step 26250: {'lr': 0.0004676060526327446, 'samples': 5040000, 'steps': 26249, 'loss/train': 1.781359314918518} 11/07/2021 00:54:34 - INFO - __main__ - Step 26251: {'lr': 0.00046760344006052223, 'samples': 5040192, 'steps': 26250, 'loss/train': 1.8070279359817505} 11/07/2021 00:54:34 - INFO - __main__ - Step 26252: {'lr': 0.00046760082739025113, 'samples': 5040384, 'steps': 26251, 'loss/train': 1.6231582164764404} 11/07/2021 00:54:35 - INFO - __main__ - Step 26253: {'lr': 0.0004675982146219324, 'samples': 5040576, 'steps': 26252, 'loss/train': 1.6634784936904907} 11/07/2021 00:54:36 - INFO - __main__ - Step 26254: {'lr': 0.00046759560175556737, 'samples': 5040768, 'steps': 26253, 'loss/train': 1.4208816289901733} 11/07/2021 00:54:36 - INFO - __main__ - Step 26255: {'lr': 0.0004675929887911571, 'samples': 5040960, 'steps': 26254, 'loss/train': 1.548449993133545} 11/07/2021 00:54:36 - INFO - __main__ - Step 26256: {'lr': 0.0004675903757287027, 'samples': 5041152, 'steps': 26255, 'loss/train': 1.503303050994873} 11/07/2021 00:54:37 - INFO - __main__ - Step 26257: {'lr': 0.0004675877625682055, 'samples': 5041344, 'steps': 26256, 'loss/train': 1.5762418508529663} 11/07/2021 00:54:37 - INFO - __main__ - Step 26258: {'lr': 0.00046758514930966664, 'samples': 5041536, 'steps': 26257, 'loss/train': 1.1812611818313599} 11/07/2021 00:54:38 - INFO - __main__ - Step 26259: {'lr': 0.0004675825359530872, 'samples': 5041728, 'steps': 26258, 'loss/train': 1.4623467922210693} 11/07/2021 00:54:38 - INFO - __main__ - Step 26260: {'lr': 0.0004675799224984685, 'samples': 5041920, 'steps': 26259, 'loss/train': 0.653400182723999} 11/07/2021 00:54:39 - INFO - __main__ - Step 26261: {'lr': 0.00046757730894581164, 'samples': 5042112, 'steps': 26260, 'loss/train': 1.5049389600753784} 11/07/2021 00:54:39 - INFO - __main__ - Step 26262: {'lr': 0.00046757469529511777, 'samples': 5042304, 'steps': 26261, 'loss/train': 1.2047209739685059} 11/07/2021 00:54:40 - INFO - __main__ - Step 26263: {'lr': 0.0004675720815463881, 'samples': 5042496, 'steps': 26262, 'loss/train': 1.4288592338562012} 11/07/2021 00:54:41 - INFO - __main__ - Step 26264: {'lr': 0.00046756946769962375, 'samples': 5042688, 'steps': 26263, 'loss/train': 1.3080549240112305} 11/07/2021 00:54:41 - INFO - __main__ - Step 26265: {'lr': 0.000467566853754826, 'samples': 5042880, 'steps': 26264, 'loss/train': 1.5261808633804321} 11/07/2021 00:54:41 - INFO - __main__ - Step 26266: {'lr': 0.00046756423971199603, 'samples': 5043072, 'steps': 26265, 'loss/train': 1.3504849672317505} 11/07/2021 00:54:42 - INFO - __main__ - Step 26267: {'lr': 0.0004675616255711349, 'samples': 5043264, 'steps': 26266, 'loss/train': 1.8957056999206543} 11/07/2021 00:54:42 - INFO - __main__ - Step 26268: {'lr': 0.0004675590113322439, 'samples': 5043456, 'steps': 26267, 'loss/train': 2.033751964569092} 11/07/2021 00:54:43 - INFO - __main__ - Step 26269: {'lr': 0.00046755639699532414, 'samples': 5043648, 'steps': 26268, 'loss/train': 1.4890309572219849} 11/07/2021 00:54:43 - INFO - __main__ - Step 26270: {'lr': 0.00046755378256037685, 'samples': 5043840, 'steps': 26269, 'loss/train': 1.3406038284301758} 11/07/2021 00:54:44 - INFO - __main__ - Step 26271: {'lr': 0.00046755116802740316, 'samples': 5044032, 'steps': 26270, 'loss/train': 1.4244365692138672} 11/07/2021 00:54:44 - INFO - __main__ - Step 26272: {'lr': 0.00046754855339640436, 'samples': 5044224, 'steps': 26271, 'loss/train': 1.261167287826538} 11/07/2021 00:54:44 - INFO - __main__ - Step 26273: {'lr': 0.00046754593866738144, 'samples': 5044416, 'steps': 26272, 'loss/train': 1.2437100410461426} 11/07/2021 00:54:45 - INFO - __main__ - Step 26274: {'lr': 0.0004675433238403357, 'samples': 5044608, 'steps': 26273, 'loss/train': 1.7398239374160767} 11/07/2021 00:54:46 - INFO - __main__ - Step 26275: {'lr': 0.0004675407089152683, 'samples': 5044800, 'steps': 26274, 'loss/train': 1.3467254638671875} 11/07/2021 00:54:46 - INFO - __main__ - Step 26276: {'lr': 0.00046753809389218036, 'samples': 5044992, 'steps': 26275, 'loss/train': 1.356188178062439} 11/07/2021 00:54:46 - INFO - __main__ - Step 26277: {'lr': 0.0004675354787710732, 'samples': 5045184, 'steps': 26276, 'loss/train': 1.1773614883422852} 11/07/2021 00:54:47 - INFO - __main__ - Step 26278: {'lr': 0.0004675328635519479, 'samples': 5045376, 'steps': 26277, 'loss/train': 1.1977851390838623} 11/07/2021 00:54:47 - INFO - __main__ - Step 26279: {'lr': 0.0004675302482348056, 'samples': 5045568, 'steps': 26278, 'loss/train': 1.286010980606079} 11/07/2021 00:54:48 - INFO - __main__ - Step 26280: {'lr': 0.00046752763281964757, 'samples': 5045760, 'steps': 26279, 'loss/train': 1.3089194297790527} 11/07/2021 00:54:49 - INFO - __main__ - Step 26281: {'lr': 0.0004675250173064749, 'samples': 5045952, 'steps': 26280, 'loss/train': 1.6992191076278687} 11/07/2021 00:54:49 - INFO - __main__ - Step 26282: {'lr': 0.0004675224016952888, 'samples': 5046144, 'steps': 26281, 'loss/train': 1.4516578912734985} 11/07/2021 00:54:49 - INFO - __main__ - Step 26283: {'lr': 0.00046751978598609056, 'samples': 5046336, 'steps': 26282, 'loss/train': 1.43919837474823} 11/07/2021 00:54:50 - INFO - __main__ - Step 26284: {'lr': 0.00046751717017888116, 'samples': 5046528, 'steps': 26283, 'loss/train': 1.5895462036132812} 11/07/2021 00:54:51 - INFO - __main__ - Step 26285: {'lr': 0.00046751455427366194, 'samples': 5046720, 'steps': 26284, 'loss/train': 0.6971876621246338} 11/07/2021 00:54:51 - INFO - __main__ - Step 26286: {'lr': 0.00046751193827043405, 'samples': 5046912, 'steps': 26285, 'loss/train': 1.4004305601119995} 11/07/2021 00:54:51 - INFO - __main__ - Step 26287: {'lr': 0.0004675093221691985, 'samples': 5047104, 'steps': 26286, 'loss/train': 1.6197659969329834} 11/07/2021 00:54:52 - INFO - __main__ - Step 26288: {'lr': 0.0004675067059699567, 'samples': 5047296, 'steps': 26287, 'loss/train': 1.8886771202087402} 11/07/2021 00:54:52 - INFO - __main__ - Step 26289: {'lr': 0.00046750408967270973, 'samples': 5047488, 'steps': 26288, 'loss/train': 1.566393256187439} 11/07/2021 00:54:53 - INFO - __main__ - Step 26290: {'lr': 0.0004675014732774588, 'samples': 5047680, 'steps': 26289, 'loss/train': 1.660553216934204} 11/07/2021 00:54:54 - INFO - __main__ - Step 26291: {'lr': 0.000467498856784205, 'samples': 5047872, 'steps': 26290, 'loss/train': 1.9005926847457886} 11/07/2021 00:54:54 - INFO - __main__ - Step 26292: {'lr': 0.0004674962401929496, 'samples': 5048064, 'steps': 26291, 'loss/train': 0.9089964628219604} 11/07/2021 00:54:54 - INFO - __main__ - Step 26293: {'lr': 0.0004674936235036938, 'samples': 5048256, 'steps': 26292, 'loss/train': 1.5952370166778564} 11/07/2021 00:54:55 - INFO - __main__ - Step 26294: {'lr': 0.00046749100671643866, 'samples': 5048448, 'steps': 26293, 'loss/train': 1.6620838642120361} 11/07/2021 00:54:56 - INFO - __main__ - Step 26295: {'lr': 0.00046748838983118546, 'samples': 5048640, 'steps': 26294, 'loss/train': 1.4640512466430664} 11/07/2021 00:54:56 - INFO - __main__ - Step 26296: {'lr': 0.00046748577284793535, 'samples': 5048832, 'steps': 26295, 'loss/train': 1.491481065750122} 11/07/2021 00:54:56 - INFO - __main__ - Step 26297: {'lr': 0.00046748315576668946, 'samples': 5049024, 'steps': 26296, 'loss/train': 1.6581768989562988} 11/07/2021 00:54:57 - INFO - __main__ - Step 26298: {'lr': 0.0004674805385874491, 'samples': 5049216, 'steps': 26297, 'loss/train': 1.7091991901397705} 11/07/2021 00:54:57 - INFO - __main__ - Step 26299: {'lr': 0.0004674779213102153, 'samples': 5049408, 'steps': 26298, 'loss/train': 0.7105774283409119} 11/07/2021 00:54:58 - INFO - __main__ - Step 26300: {'lr': 0.00046747530393498934, 'samples': 5049600, 'steps': 26299, 'loss/train': 0.5618196129798889} 11/07/2021 00:54:58 - INFO - __main__ - Step 26301: {'lr': 0.0004674726864617723, 'samples': 5049792, 'steps': 26300, 'loss/train': 1.4105170965194702} 11/07/2021 00:54:59 - INFO - __main__ - Step 26302: {'lr': 0.00046747006889056556, 'samples': 5049984, 'steps': 26301, 'loss/train': 1.1258419752120972} 11/07/2021 00:54:59 - INFO - __main__ - Step 26303: {'lr': 0.00046746745122137, 'samples': 5050176, 'steps': 26302, 'loss/train': 1.513561725616455} 11/07/2021 00:55:00 - INFO - __main__ - Step 26304: {'lr': 0.000467464833454187, 'samples': 5050368, 'steps': 26303, 'loss/train': 1.7200157642364502} 11/07/2021 00:55:01 - INFO - __main__ - Step 26305: {'lr': 0.0004674622155890178, 'samples': 5050560, 'steps': 26304, 'loss/train': 1.6631202697753906} 11/07/2021 00:55:01 - INFO - __main__ - Step 26306: {'lr': 0.00046745959762586344, 'samples': 5050752, 'steps': 26305, 'loss/train': 1.888162612915039} 11/07/2021 00:55:01 - INFO - __main__ - Step 26307: {'lr': 0.0004674569795647251, 'samples': 5050944, 'steps': 26306, 'loss/train': 1.8183954954147339} 11/07/2021 00:55:02 - INFO - __main__ - Step 26308: {'lr': 0.00046745436140560397, 'samples': 5051136, 'steps': 26307, 'loss/train': 1.4250584840774536} 11/07/2021 00:55:02 - INFO - __main__ - Step 26309: {'lr': 0.00046745174314850136, 'samples': 5051328, 'steps': 26308, 'loss/train': 0.5985563397407532} 11/07/2021 00:55:03 - INFO - __main__ - Step 26310: {'lr': 0.00046744912479341826, 'samples': 5051520, 'steps': 26309, 'loss/train': 1.5755923986434937} 11/07/2021 00:55:03 - INFO - __main__ - Step 26311: {'lr': 0.00046744650634035603, 'samples': 5051712, 'steps': 26310, 'loss/train': 1.6963415145874023} 11/07/2021 00:55:04 - INFO - __main__ - Step 26312: {'lr': 0.0004674438877893157, 'samples': 5051904, 'steps': 26311, 'loss/train': 1.540200114250183} 11/07/2021 00:55:04 - INFO - __main__ - Step 26313: {'lr': 0.0004674412691402985, 'samples': 5052096, 'steps': 26312, 'loss/train': 1.536624789237976} 11/07/2021 00:55:04 - INFO - __main__ - Step 26314: {'lr': 0.00046743865039330565, 'samples': 5052288, 'steps': 26313, 'loss/train': 1.3715239763259888} 11/07/2021 00:55:05 - INFO - __main__ - Step 26315: {'lr': 0.00046743603154833827, 'samples': 5052480, 'steps': 26314, 'loss/train': 1.5983755588531494} 11/07/2021 00:55:06 - INFO - __main__ - Step 26316: {'lr': 0.00046743341260539756, 'samples': 5052672, 'steps': 26315, 'loss/train': 1.4672774076461792} 11/07/2021 00:55:06 - INFO - __main__ - Step 26317: {'lr': 0.00046743079356448476, 'samples': 5052864, 'steps': 26316, 'loss/train': 1.2478556632995605} 11/07/2021 00:55:06 - INFO - __main__ - Step 26318: {'lr': 0.000467428174425601, 'samples': 5053056, 'steps': 26317, 'loss/train': 1.39298415184021} 11/07/2021 00:55:07 - INFO - __main__ - Step 26319: {'lr': 0.0004674255551887474, 'samples': 5053248, 'steps': 26318, 'loss/train': 1.4729957580566406} 11/07/2021 00:55:07 - INFO - __main__ - Step 26320: {'lr': 0.0004674229358539253, 'samples': 5053440, 'steps': 26319, 'loss/train': 1.4828534126281738} 11/07/2021 00:55:09 - INFO - __main__ - Step 26321: {'lr': 0.0004674203164211357, 'samples': 5053632, 'steps': 26320, 'loss/train': 1.4319690465927124} 11/07/2021 00:55:09 - INFO - __main__ - Step 26322: {'lr': 0.00046741769689037985, 'samples': 5053824, 'steps': 26321, 'loss/train': 1.4296718835830688} 11/07/2021 00:55:09 - INFO - __main__ - Step 26323: {'lr': 0.0004674150772616589, 'samples': 5054016, 'steps': 26322, 'loss/train': 1.5283063650131226} 11/07/2021 00:55:10 - INFO - __main__ - Step 26324: {'lr': 0.0004674124575349742, 'samples': 5054208, 'steps': 26323, 'loss/train': 1.8238298892974854} 11/07/2021 00:55:10 - INFO - __main__ - Step 26325: {'lr': 0.00046740983771032674, 'samples': 5054400, 'steps': 26324, 'loss/train': 1.7715691328048706} 11/07/2021 00:55:11 - INFO - __main__ - Step 26326: {'lr': 0.0004674072177877178, 'samples': 5054592, 'steps': 26325, 'loss/train': 1.4993889331817627} 11/07/2021 00:55:12 - INFO - __main__ - Step 26327: {'lr': 0.0004674045977671484, 'samples': 5054784, 'steps': 26326, 'loss/train': 1.3500524759292603} 11/07/2021 00:55:12 - INFO - __main__ - Step 26328: {'lr': 0.00046740197764862, 'samples': 5054976, 'steps': 26327, 'loss/train': 1.5933117866516113} 11/07/2021 00:55:12 - INFO - __main__ - Step 26329: {'lr': 0.00046739935743213344, 'samples': 5055168, 'steps': 26328, 'loss/train': 1.496039628982544} 11/07/2021 00:55:13 - INFO - __main__ - Step 26330: {'lr': 0.00046739673711769026, 'samples': 5055360, 'steps': 26329, 'loss/train': 2.607337474822998} 11/07/2021 00:55:13 - INFO - __main__ - Step 26331: {'lr': 0.0004673941167052914, 'samples': 5055552, 'steps': 26330, 'loss/train': 1.002419114112854} 11/07/2021 00:55:13 - INFO - __main__ - Step 26332: {'lr': 0.0004673914961949381, 'samples': 5055744, 'steps': 26331, 'loss/train': 1.1333625316619873} 11/07/2021 00:55:14 - INFO - __main__ - Step 26333: {'lr': 0.0004673888755866316, 'samples': 5055936, 'steps': 26332, 'loss/train': 1.761634111404419} 11/07/2021 00:55:15 - INFO - __main__ - Step 26334: {'lr': 0.0004673862548803729, 'samples': 5056128, 'steps': 26333, 'loss/train': 1.6359180212020874} 11/07/2021 00:55:15 - INFO - __main__ - Step 26335: {'lr': 0.0004673836340761634, 'samples': 5056320, 'steps': 26334, 'loss/train': 1.7028871774673462} 11/07/2021 00:55:15 - INFO - __main__ - Step 26336: {'lr': 0.00046738101317400415, 'samples': 5056512, 'steps': 26335, 'loss/train': 1.6568629741668701} 11/07/2021 00:55:16 - INFO - __main__ - Step 26337: {'lr': 0.00046737839217389645, 'samples': 5056704, 'steps': 26336, 'loss/train': 1.5187630653381348} 11/07/2021 00:55:17 - INFO - __main__ - Step 26338: {'lr': 0.0004673757710758413, 'samples': 5056896, 'steps': 26337, 'loss/train': 1.3679804801940918} 11/07/2021 00:55:17 - INFO - __main__ - Step 26339: {'lr': 0.00046737314987984, 'samples': 5057088, 'steps': 26338, 'loss/train': 1.8160271644592285} 11/07/2021 00:55:17 - INFO - __main__ - Step 26340: {'lr': 0.0004673705285858938, 'samples': 5057280, 'steps': 26339, 'loss/train': 1.5020647048950195} 11/07/2021 00:55:18 - INFO - __main__ - Step 26341: {'lr': 0.00046736790719400373, 'samples': 5057472, 'steps': 26340, 'loss/train': 1.307187557220459} 11/07/2021 00:55:18 - INFO - __main__ - Step 26342: {'lr': 0.000467365285704171, 'samples': 5057664, 'steps': 26341, 'loss/train': 1.7790440320968628} 11/07/2021 00:55:19 - INFO - __main__ - Step 26343: {'lr': 0.00046736266411639694, 'samples': 5057856, 'steps': 26342, 'loss/train': 1.3135005235671997} 11/07/2021 00:55:20 - INFO - __main__ - Step 26344: {'lr': 0.00046736004243068255, 'samples': 5058048, 'steps': 26343, 'loss/train': 0.7145043611526489} 11/07/2021 00:55:20 - INFO - __main__ - Step 26345: {'lr': 0.00046735742064702904, 'samples': 5058240, 'steps': 26344, 'loss/train': 1.4895724058151245} 11/07/2021 00:55:20 - INFO - __main__ - Step 26346: {'lr': 0.00046735479876543765, 'samples': 5058432, 'steps': 26345, 'loss/train': 1.4524718523025513} 11/07/2021 00:55:21 - INFO - __main__ - Step 26347: {'lr': 0.00046735217678590957, 'samples': 5058624, 'steps': 26346, 'loss/train': 0.29512056708335876} 11/07/2021 00:55:22 - INFO - __main__ - Step 26348: {'lr': 0.00046734955470844594, 'samples': 5058816, 'steps': 26347, 'loss/train': 1.5119823217391968} 11/07/2021 00:55:22 - INFO - __main__ - Step 26349: {'lr': 0.00046734693253304795, 'samples': 5059008, 'steps': 26348, 'loss/train': 1.7592660188674927} 11/07/2021 00:55:23 - INFO - __main__ - Step 26350: {'lr': 0.0004673443102597168, 'samples': 5059200, 'steps': 26349, 'loss/train': 1.5248686075210571} 11/07/2021 00:55:23 - INFO - __main__ - Step 26351: {'lr': 0.00046734168788845363, 'samples': 5059392, 'steps': 26350, 'loss/train': 1.859383463859558} 11/07/2021 00:55:23 - INFO - __main__ - Step 26352: {'lr': 0.00046733906541925963, 'samples': 5059584, 'steps': 26351, 'loss/train': 1.4118645191192627} 11/07/2021 00:55:24 - INFO - __main__ - Step 26353: {'lr': 0.00046733644285213604, 'samples': 5059776, 'steps': 26352, 'loss/train': 1.3316917419433594} 11/07/2021 00:55:25 - INFO - __main__ - Step 26354: {'lr': 0.00046733382018708405, 'samples': 5059968, 'steps': 26353, 'loss/train': 1.6919605731964111} 11/07/2021 00:55:25 - INFO - __main__ - Step 26355: {'lr': 0.00046733119742410476, 'samples': 5060160, 'steps': 26354, 'loss/train': 1.5889085531234741} 11/07/2021 00:55:25 - INFO - __main__ - Step 26356: {'lr': 0.0004673285745631993, 'samples': 5060352, 'steps': 26355, 'loss/train': 1.95510995388031} 11/07/2021 00:55:26 - INFO - __main__ - Step 26357: {'lr': 0.000467325951604369, 'samples': 5060544, 'steps': 26356, 'loss/train': 2.011507034301758} 11/07/2021 00:55:26 - INFO - __main__ - Step 26358: {'lr': 0.00046732332854761507, 'samples': 5060736, 'steps': 26357, 'loss/train': 1.6759419441223145} 11/07/2021 00:55:27 - INFO - __main__ - Step 26359: {'lr': 0.00046732070539293847, 'samples': 5060928, 'steps': 26358, 'loss/train': 1.350864052772522} 11/07/2021 00:55:28 - INFO - __main__ - Step 26360: {'lr': 0.0004673180821403405, 'samples': 5061120, 'steps': 26359, 'loss/train': 1.483938455581665} 11/07/2021 00:55:28 - INFO - __main__ - Step 26361: {'lr': 0.00046731545878982253, 'samples': 5061312, 'steps': 26360, 'loss/train': 2.304304361343384} 11/07/2021 00:55:28 - INFO - __main__ - Step 26362: {'lr': 0.0004673128353413854, 'samples': 5061504, 'steps': 26361, 'loss/train': 1.9996891021728516} 11/07/2021 00:55:29 - INFO - __main__ - Step 26363: {'lr': 0.00046731021179503054, 'samples': 5061696, 'steps': 26362, 'loss/train': 1.1960369348526} 11/07/2021 00:55:30 - INFO - __main__ - Step 26364: {'lr': 0.00046730758815075903, 'samples': 5061888, 'steps': 26363, 'loss/train': 1.8923099040985107} 11/07/2021 00:55:30 - INFO - __main__ - Step 26365: {'lr': 0.0004673049644085721, 'samples': 5062080, 'steps': 26364, 'loss/train': 1.914736032485962} 11/07/2021 00:55:31 - INFO - __main__ - Step 26366: {'lr': 0.00046730234056847084, 'samples': 5062272, 'steps': 26365, 'loss/train': 1.7014739513397217} 11/07/2021 00:55:31 - INFO - __main__ - Step 26367: {'lr': 0.00046729971663045654, 'samples': 5062464, 'steps': 26366, 'loss/train': 1.6129825115203857} 11/07/2021 00:55:31 - INFO - __main__ - Step 26368: {'lr': 0.00046729709259453033, 'samples': 5062656, 'steps': 26367, 'loss/train': 0.9432440996170044} 11/07/2021 00:55:32 - INFO - __main__ - Step 26369: {'lr': 0.0004672944684606934, 'samples': 5062848, 'steps': 26368, 'loss/train': 1.4223899841308594} 11/07/2021 00:55:33 - INFO - __main__ - Step 26370: {'lr': 0.000467291844228947, 'samples': 5063040, 'steps': 26369, 'loss/train': 1.867058277130127} 11/07/2021 00:55:33 - INFO - __main__ - Step 26371: {'lr': 0.00046728921989929215, 'samples': 5063232, 'steps': 26370, 'loss/train': 1.4381300210952759} 11/07/2021 00:55:33 - INFO - __main__ - Step 26372: {'lr': 0.0004672865954717301, 'samples': 5063424, 'steps': 26371, 'loss/train': 1.4920525550842285} 11/07/2021 00:55:34 - INFO - __main__ - Step 26373: {'lr': 0.00046728397094626217, 'samples': 5063616, 'steps': 26372, 'loss/train': 1.6166425943374634} 11/07/2021 00:55:34 - INFO - __main__ - Step 26374: {'lr': 0.0004672813463228894, 'samples': 5063808, 'steps': 26373, 'loss/train': 1.3579533100128174} 11/07/2021 00:55:35 - INFO - __main__ - Step 26375: {'lr': 0.00046727872160161305, 'samples': 5064000, 'steps': 26374, 'loss/train': 1.3808456659317017} 11/07/2021 00:55:35 - INFO - __main__ - Step 26376: {'lr': 0.0004672760967824342, 'samples': 5064192, 'steps': 26375, 'loss/train': 1.6732211112976074} 11/07/2021 00:55:36 - INFO - __main__ - Step 26377: {'lr': 0.0004672734718653541, 'samples': 5064384, 'steps': 26376, 'loss/train': 2.095519781112671} 11/07/2021 00:55:36 - INFO - __main__ - Step 26378: {'lr': 0.00046727084685037394, 'samples': 5064576, 'steps': 26377, 'loss/train': 1.7713539600372314} 11/07/2021 00:55:37 - INFO - __main__ - Step 26379: {'lr': 0.00046726822173749497, 'samples': 5064768, 'steps': 26378, 'loss/train': 1.264957070350647} 11/07/2021 00:55:38 - INFO - __main__ - Step 26380: {'lr': 0.0004672655965267182, 'samples': 5064960, 'steps': 26379, 'loss/train': 1.3401517868041992} 11/07/2021 00:55:38 - INFO - __main__ - Step 26381: {'lr': 0.0004672629712180448, 'samples': 5065152, 'steps': 26380, 'loss/train': 5.758746147155762} 11/07/2021 00:55:38 - INFO - __main__ - Step 26382: {'lr': 0.00046726034581147624, 'samples': 5065344, 'steps': 26381, 'loss/train': 1.748800277709961} 11/07/2021 00:55:39 - INFO - __main__ - Step 26383: {'lr': 0.0004672577203070135, 'samples': 5065536, 'steps': 26382, 'loss/train': 1.6716623306274414} 11/07/2021 00:55:39 - INFO - __main__ - Step 26384: {'lr': 0.0004672550947046577, 'samples': 5065728, 'steps': 26383, 'loss/train': 1.507440209388733} 11/07/2021 00:55:39 - INFO - __main__ - Step 26385: {'lr': 0.0004672524690044102, 'samples': 5065920, 'steps': 26384, 'loss/train': 1.4670366048812866} 11/07/2021 00:55:40 - INFO - __main__ - Step 26386: {'lr': 0.000467249843206272, 'samples': 5066112, 'steps': 26385, 'loss/train': 1.9021941423416138} 11/07/2021 00:55:41 - INFO - __main__ - Step 26387: {'lr': 0.00046724721731024446, 'samples': 5066304, 'steps': 26386, 'loss/train': 2.118704080581665} 11/07/2021 00:55:41 - INFO - __main__ - Step 26388: {'lr': 0.00046724459131632854, 'samples': 5066496, 'steps': 26387, 'loss/train': 1.3493355512619019} 11/07/2021 00:55:42 - INFO - __main__ - Step 26389: {'lr': 0.00046724196522452565, 'samples': 5066688, 'steps': 26388, 'loss/train': 0.40509405732154846} 11/07/2021 00:55:42 - INFO - __main__ - Step 26390: {'lr': 0.00046723933903483687, 'samples': 5066880, 'steps': 26389, 'loss/train': 1.8364577293395996} 11/07/2021 00:55:43 - INFO - __main__ - Step 26391: {'lr': 0.00046723671274726344, 'samples': 5067072, 'steps': 26390, 'loss/train': 1.6346997022628784} 11/07/2021 00:55:43 - INFO - __main__ - Step 26392: {'lr': 0.00046723408636180645, 'samples': 5067264, 'steps': 26391, 'loss/train': 1.1927516460418701} 11/07/2021 00:55:43 - INFO - __main__ - Step 26393: {'lr': 0.00046723145987846715, 'samples': 5067456, 'steps': 26392, 'loss/train': 1.2922899723052979} 11/07/2021 00:55:44 - INFO - __main__ - Step 26394: {'lr': 0.00046722883329724667, 'samples': 5067648, 'steps': 26393, 'loss/train': 1.5836986303329468} 11/07/2021 00:55:44 - INFO - __main__ - Step 26395: {'lr': 0.0004672262066181463, 'samples': 5067840, 'steps': 26394, 'loss/train': 1.281071662902832} 11/07/2021 00:55:45 - INFO - __main__ - Step 26396: {'lr': 0.00046722357984116717, 'samples': 5068032, 'steps': 26395, 'loss/train': 1.5925194025039673} 11/07/2021 00:55:46 - INFO - __main__ - Step 26397: {'lr': 0.0004672209529663103, 'samples': 5068224, 'steps': 26396, 'loss/train': 1.9649863243103027} 11/07/2021 00:55:46 - INFO - __main__ - Step 26398: {'lr': 0.00046721832599357717, 'samples': 5068416, 'steps': 26397, 'loss/train': 1.9178727865219116} 11/07/2021 00:55:46 - INFO - __main__ - Step 26399: {'lr': 0.00046721569892296875, 'samples': 5068608, 'steps': 26398, 'loss/train': 1.6494786739349365} 11/07/2021 00:55:47 - INFO - __main__ - Step 26400: {'lr': 0.00046721307175448626, 'samples': 5068800, 'steps': 26399, 'loss/train': 1.4572476148605347} 11/07/2021 00:55:47 - INFO - __main__ - Step 26401: {'lr': 0.000467210444488131, 'samples': 5068992, 'steps': 26400, 'loss/train': 1.1953966617584229} 11/07/2021 00:55:48 - INFO - __main__ - Step 26402: {'lr': 0.000467207817123904, 'samples': 5069184, 'steps': 26401, 'loss/train': 1.564107894897461} 11/07/2021 00:55:49 - INFO - __main__ - Step 26403: {'lr': 0.0004672051896618065, 'samples': 5069376, 'steps': 26402, 'loss/train': 1.5417752265930176} 11/07/2021 00:55:49 - INFO - __main__ - Step 26404: {'lr': 0.0004672025621018397, 'samples': 5069568, 'steps': 26403, 'loss/train': 1.4741133451461792} 11/07/2021 00:55:49 - INFO - __main__ - Step 26405: {'lr': 0.00046719993444400477, 'samples': 5069760, 'steps': 26404, 'loss/train': 1.48698890209198} 11/07/2021 00:55:50 - INFO - __main__ - Step 26406: {'lr': 0.00046719730668830293, 'samples': 5069952, 'steps': 26405, 'loss/train': 1.4376373291015625} 11/07/2021 00:55:51 - INFO - __main__ - Step 26407: {'lr': 0.0004671946788347353, 'samples': 5070144, 'steps': 26406, 'loss/train': 1.4625463485717773} 11/07/2021 00:55:51 - INFO - __main__ - Step 26408: {'lr': 0.00046719205088330317, 'samples': 5070336, 'steps': 26407, 'loss/train': 1.9800376892089844} 11/07/2021 00:55:51 - INFO - __main__ - Step 26409: {'lr': 0.0004671894228340076, 'samples': 5070528, 'steps': 26408, 'loss/train': 1.3407139778137207} 11/07/2021 00:55:52 - INFO - __main__ - Step 26410: {'lr': 0.0004671867946868499, 'samples': 5070720, 'steps': 26409, 'loss/train': 1.9088515043258667} 11/07/2021 00:55:52 - INFO - __main__ - Step 26411: {'lr': 0.000467184166441831, 'samples': 5070912, 'steps': 26410, 'loss/train': 2.3126955032348633} 11/07/2021 00:55:53 - INFO - __main__ - Step 26412: {'lr': 0.0004671815380989525, 'samples': 5071104, 'steps': 26411, 'loss/train': 1.4580514430999756} 11/07/2021 00:55:53 - INFO - __main__ - Step 26413: {'lr': 0.0004671789096582152, 'samples': 5071296, 'steps': 26412, 'loss/train': 1.5600773096084595} 11/07/2021 00:55:54 - INFO - __main__ - Step 26414: {'lr': 0.00046717628111962045, 'samples': 5071488, 'steps': 26413, 'loss/train': 1.7384356260299683} 11/07/2021 00:55:54 - INFO - __main__ - Step 26415: {'lr': 0.00046717365248316947, 'samples': 5071680, 'steps': 26414, 'loss/train': 1.0957757234573364} 11/07/2021 00:55:54 - INFO - __main__ - Step 26416: {'lr': 0.00046717102374886334, 'samples': 5071872, 'steps': 26415, 'loss/train': 0.904839813709259} 11/07/2021 00:55:55 - INFO - __main__ - Step 26417: {'lr': 0.0004671683949167033, 'samples': 5072064, 'steps': 26416, 'loss/train': 1.6241687536239624} 11/07/2021 00:55:56 - INFO - __main__ - Step 26418: {'lr': 0.0004671657659866906, 'samples': 5072256, 'steps': 26417, 'loss/train': 1.8980976343154907} 11/07/2021 00:55:56 - INFO - __main__ - Step 26419: {'lr': 0.00046716313695882626, 'samples': 5072448, 'steps': 26418, 'loss/train': 1.2110810279846191} 11/07/2021 00:55:56 - INFO - __main__ - Step 26420: {'lr': 0.00046716050783311166, 'samples': 5072640, 'steps': 26419, 'loss/train': 1.319366693496704} 11/07/2021 00:55:57 - INFO - __main__ - Step 26421: {'lr': 0.00046715787860954785, 'samples': 5072832, 'steps': 26420, 'loss/train': 1.9053850173950195} 11/07/2021 00:55:58 - INFO - __main__ - Step 26422: {'lr': 0.000467155249288136, 'samples': 5073024, 'steps': 26421, 'loss/train': 1.293556809425354} 11/07/2021 00:55:58 - INFO - __main__ - Step 26423: {'lr': 0.00046715261986887734, 'samples': 5073216, 'steps': 26422, 'loss/train': 1.701919674873352} 11/07/2021 00:55:58 - INFO - __main__ - Step 26424: {'lr': 0.0004671499903517732, 'samples': 5073408, 'steps': 26423, 'loss/train': 1.5245121717453003} 11/07/2021 00:55:59 - INFO - __main__ - Step 26425: {'lr': 0.00046714736073682453, 'samples': 5073600, 'steps': 26424, 'loss/train': 1.1771432161331177} 11/07/2021 00:55:59 - INFO - __main__ - Step 26426: {'lr': 0.00046714473102403255, 'samples': 5073792, 'steps': 26425, 'loss/train': 1.4140645265579224} 11/07/2021 00:56:00 - INFO - __main__ - Step 26427: {'lr': 0.0004671421012133986, 'samples': 5073984, 'steps': 26426, 'loss/train': 1.5135844945907593} 11/07/2021 00:56:01 - INFO - __main__ - Step 26428: {'lr': 0.00046713947130492373, 'samples': 5074176, 'steps': 26427, 'loss/train': 1.628362774848938} 11/07/2021 00:56:01 - INFO - __main__ - Step 26429: {'lr': 0.0004671368412986091, 'samples': 5074368, 'steps': 26428, 'loss/train': 1.2044190168380737} 11/07/2021 00:56:01 - INFO - __main__ - Step 26430: {'lr': 0.0004671342111944561, 'samples': 5074560, 'steps': 26429, 'loss/train': 1.845758318901062} 11/07/2021 00:56:02 - INFO - __main__ - Step 26431: {'lr': 0.00046713158099246564, 'samples': 5074752, 'steps': 26430, 'loss/train': 1.8170115947723389} 11/07/2021 00:56:02 - INFO - __main__ - Step 26432: {'lr': 0.00046712895069263917, 'samples': 5074944, 'steps': 26431, 'loss/train': 1.7663159370422363} 11/07/2021 00:56:03 - INFO - __main__ - Step 26433: {'lr': 0.00046712632029497766, 'samples': 5075136, 'steps': 26432, 'loss/train': 1.277019739151001} 11/07/2021 00:56:03 - INFO - __main__ - Step 26434: {'lr': 0.0004671236897994824, 'samples': 5075328, 'steps': 26433, 'loss/train': 1.7203088998794556} 11/07/2021 00:56:04 - INFO - __main__ - Step 26435: {'lr': 0.00046712105920615455, 'samples': 5075520, 'steps': 26434, 'loss/train': 1.7292240858078003} 11/07/2021 00:56:04 - INFO - __main__ - Step 26436: {'lr': 0.00046711842851499533, 'samples': 5075712, 'steps': 26435, 'loss/train': 1.3361963033676147} 11/07/2021 00:56:04 - INFO - __main__ - Step 26437: {'lr': 0.0004671157977260059, 'samples': 5075904, 'steps': 26436, 'loss/train': 1.5535763502120972} 11/07/2021 00:56:06 - INFO - __main__ - Step 26438: {'lr': 0.0004671131668391874, 'samples': 5076096, 'steps': 26437, 'loss/train': 1.8279119729995728} 11/07/2021 00:56:06 - INFO - __main__ - Step 26439: {'lr': 0.00046711053585454104, 'samples': 5076288, 'steps': 26438, 'loss/train': 1.0926793813705444} 11/07/2021 00:56:06 - INFO - __main__ - Step 26440: {'lr': 0.0004671079047720681, 'samples': 5076480, 'steps': 26439, 'loss/train': 4.682829856872559} 11/07/2021 00:56:07 - INFO - __main__ - Step 26441: {'lr': 0.00046710527359176957, 'samples': 5076672, 'steps': 26440, 'loss/train': 1.4131914377212524} 11/07/2021 00:56:07 - INFO - __main__ - Step 26442: {'lr': 0.0004671026423136469, 'samples': 5076864, 'steps': 26441, 'loss/train': 1.6354575157165527} 11/07/2021 00:56:08 - INFO - __main__ - Step 26443: {'lr': 0.00046710001093770107, 'samples': 5077056, 'steps': 26442, 'loss/train': 1.4998884201049805} 11/07/2021 00:56:08 - INFO - __main__ - Step 26444: {'lr': 0.0004670973794639333, 'samples': 5077248, 'steps': 26443, 'loss/train': 1.4020780324935913} 11/07/2021 00:56:09 - INFO - __main__ - Step 26445: {'lr': 0.0004670947478923447, 'samples': 5077440, 'steps': 26444, 'loss/train': 1.4827768802642822} 11/07/2021 00:56:09 - INFO - __main__ - Step 26446: {'lr': 0.00046709211622293677, 'samples': 5077632, 'steps': 26445, 'loss/train': 1.532038927078247} 11/07/2021 00:56:10 - INFO - __main__ - Step 26447: {'lr': 0.00046708948445571037, 'samples': 5077824, 'steps': 26446, 'loss/train': 1.4269554615020752} 11/07/2021 00:56:10 - INFO - __main__ - Step 26448: {'lr': 0.0004670868525906668, 'samples': 5078016, 'steps': 26447, 'loss/train': 2.0321922302246094} 11/07/2021 00:56:11 - INFO - __main__ - Step 26449: {'lr': 0.00046708422062780725, 'samples': 5078208, 'steps': 26448, 'loss/train': 1.5487173795700073} 11/07/2021 00:56:11 - INFO - __main__ - Step 26450: {'lr': 0.0004670815885671329, 'samples': 5078400, 'steps': 26449, 'loss/train': 1.6052219867706299} 11/07/2021 00:56:12 - INFO - __main__ - Step 26451: {'lr': 0.00046707895640864494, 'samples': 5078592, 'steps': 26450, 'loss/train': 1.3936687707901} 11/07/2021 00:56:12 - INFO - __main__ - Step 26452: {'lr': 0.0004670763241523446, 'samples': 5078784, 'steps': 26451, 'loss/train': 1.4854100942611694} 11/07/2021 00:56:13 - INFO - __main__ - Step 26453: {'lr': 0.00046707369179823294, 'samples': 5078976, 'steps': 26452, 'loss/train': 1.4796305894851685} 11/07/2021 00:56:13 - INFO - __main__ - Step 26454: {'lr': 0.00046707105934631123, 'samples': 5079168, 'steps': 26453, 'loss/train': 1.150570034980774} 11/07/2021 00:56:14 - INFO - __main__ - Step 26455: {'lr': 0.00046706842679658067, 'samples': 5079360, 'steps': 26454, 'loss/train': 1.8077322244644165} 11/07/2021 00:56:14 - INFO - __main__ - Step 26456: {'lr': 0.0004670657941490425, 'samples': 5079552, 'steps': 26455, 'loss/train': 1.230172038078308} 11/07/2021 00:56:14 - INFO - __main__ - Step 26457: {'lr': 0.00046706316140369774, 'samples': 5079744, 'steps': 26456, 'loss/train': 1.8720364570617676} 11/07/2021 00:56:15 - INFO - __main__ - Step 26458: {'lr': 0.0004670605285605477, 'samples': 5079936, 'steps': 26457, 'loss/train': 1.4917019605636597} 11/07/2021 00:56:16 - INFO - __main__ - Step 26459: {'lr': 0.0004670578956195935, 'samples': 5080128, 'steps': 26458, 'loss/train': 1.4443360567092896} 11/07/2021 00:56:16 - INFO - __main__ - Step 26460: {'lr': 0.00046705526258083643, 'samples': 5080320, 'steps': 26459, 'loss/train': 1.5752904415130615} 11/07/2021 00:56:16 - INFO - __main__ - Step 26461: {'lr': 0.0004670526294442775, 'samples': 5080512, 'steps': 26460, 'loss/train': 1.234046459197998} 11/07/2021 00:56:17 - INFO - __main__ - Step 26462: {'lr': 0.0004670499962099181, 'samples': 5080704, 'steps': 26461, 'loss/train': 1.3155179023742676} 11/07/2021 00:56:17 - INFO - __main__ - Step 26463: {'lr': 0.0004670473628777593, 'samples': 5080896, 'steps': 26462, 'loss/train': 1.976694107055664} 11/07/2021 00:56:18 - INFO - __main__ - Step 26464: {'lr': 0.0004670447294478023, 'samples': 5081088, 'steps': 26463, 'loss/train': 1.5602601766586304} 11/07/2021 00:56:18 - INFO - __main__ - Step 26465: {'lr': 0.0004670420959200483, 'samples': 5081280, 'steps': 26464, 'loss/train': 1.194366693496704} 11/07/2021 00:56:19 - INFO - __main__ - Step 26466: {'lr': 0.00046703946229449846, 'samples': 5081472, 'steps': 26465, 'loss/train': 1.8380380868911743} 11/07/2021 00:56:19 - INFO - __main__ - Step 26467: {'lr': 0.00046703682857115406, 'samples': 5081664, 'steps': 26466, 'loss/train': 1.755829095840454} 11/07/2021 00:56:19 - INFO - __main__ - Step 26468: {'lr': 0.0004670341947500161, 'samples': 5081856, 'steps': 26467, 'loss/train': 0.9049766063690186} 11/07/2021 00:56:21 - INFO - __main__ - Step 26469: {'lr': 0.00046703156083108597, 'samples': 5082048, 'steps': 26468, 'loss/train': 1.4993257522583008} 11/07/2021 00:56:21 - INFO - __main__ - Step 26470: {'lr': 0.0004670289268143647, 'samples': 5082240, 'steps': 26469, 'loss/train': 1.6964185237884521} 11/07/2021 00:56:21 - INFO - __main__ - Step 26471: {'lr': 0.0004670262926998536, 'samples': 5082432, 'steps': 26470, 'loss/train': 1.3337154388427734} 11/07/2021 00:56:22 - INFO - __main__ - Step 26472: {'lr': 0.00046702365848755377, 'samples': 5082624, 'steps': 26471, 'loss/train': 1.2978936433792114} 11/07/2021 00:56:22 - INFO - __main__ - Step 26473: {'lr': 0.0004670210241774664, 'samples': 5082816, 'steps': 26472, 'loss/train': 1.5517992973327637} 11/07/2021 00:56:23 - INFO - __main__ - Step 26474: {'lr': 0.0004670183897695928, 'samples': 5083008, 'steps': 26473, 'loss/train': 1.5308436155319214} 11/07/2021 00:56:23 - INFO - __main__ - Step 26475: {'lr': 0.00046701575526393395, 'samples': 5083200, 'steps': 26474, 'loss/train': 1.3590461015701294} 11/07/2021 00:56:24 - INFO - __main__ - Step 26476: {'lr': 0.00046701312066049126, 'samples': 5083392, 'steps': 26475, 'loss/train': 1.4693495035171509} 11/07/2021 00:56:24 - INFO - __main__ - Step 26477: {'lr': 0.00046701048595926574, 'samples': 5083584, 'steps': 26476, 'loss/train': 1.6932660341262817} 11/07/2021 00:56:24 - INFO - __main__ - Step 26478: {'lr': 0.00046700785116025867, 'samples': 5083776, 'steps': 26477, 'loss/train': 1.4177215099334717} 11/07/2021 00:56:25 - INFO - __main__ - Step 26479: {'lr': 0.0004670052162634712, 'samples': 5083968, 'steps': 26478, 'loss/train': 2.0390894412994385} 11/07/2021 00:56:26 - INFO - __main__ - Step 26480: {'lr': 0.0004670025812689045, 'samples': 5084160, 'steps': 26479, 'loss/train': 1.4703865051269531} 11/07/2021 00:56:27 - INFO - __main__ - Step 26481: {'lr': 0.00046699994617655985, 'samples': 5084352, 'steps': 26480, 'loss/train': 1.6275564432144165} 11/07/2021 00:56:27 - INFO - __main__ - Step 26482: {'lr': 0.0004669973109864383, 'samples': 5084544, 'steps': 26481, 'loss/train': 0.5352272987365723} 11/07/2021 00:56:27 - INFO - __main__ - Step 26483: {'lr': 0.00046699467569854115, 'samples': 5084736, 'steps': 26482, 'loss/train': 1.8470903635025024} 11/07/2021 00:56:28 - INFO - __main__ - Step 26484: {'lr': 0.0004669920403128696, 'samples': 5084928, 'steps': 26483, 'loss/train': 1.5987228155136108} 11/07/2021 00:56:29 - INFO - __main__ - Step 26485: {'lr': 0.00046698940482942466, 'samples': 5085120, 'steps': 26484, 'loss/train': 1.9088078737258911} 11/07/2021 00:56:29 - INFO - __main__ - Step 26486: {'lr': 0.0004669867692482077, 'samples': 5085312, 'steps': 26485, 'loss/train': 1.6015602350234985} 11/07/2021 00:56:30 - INFO - __main__ - Step 26487: {'lr': 0.00046698413356921985, 'samples': 5085504, 'steps': 26486, 'loss/train': 1.708426594734192} 11/07/2021 00:56:30 - INFO - __main__ - Step 26488: {'lr': 0.00046698149779246235, 'samples': 5085696, 'steps': 26487, 'loss/train': 1.7259870767593384} 11/07/2021 00:56:30 - INFO - __main__ - Step 26489: {'lr': 0.0004669788619179363, 'samples': 5085888, 'steps': 26488, 'loss/train': 1.945603609085083} 11/07/2021 00:56:32 - INFO - __main__ - Step 26490: {'lr': 0.0004669762259456429, 'samples': 5086080, 'steps': 26489, 'loss/train': 0.9242888689041138} 11/07/2021 00:56:32 - INFO - __main__ - Step 26491: {'lr': 0.00046697358987558336, 'samples': 5086272, 'steps': 26490, 'loss/train': 2.0681726932525635} 11/07/2021 00:56:32 - INFO - __main__ - Step 26492: {'lr': 0.0004669709537077589, 'samples': 5086464, 'steps': 26491, 'loss/train': 1.5688045024871826} 11/07/2021 00:56:33 - INFO - __main__ - Step 26493: {'lr': 0.00046696831744217065, 'samples': 5086656, 'steps': 26492, 'loss/train': 1.3535728454589844} 11/07/2021 00:56:33 - INFO - __main__ - Step 26494: {'lr': 0.0004669656810788199, 'samples': 5086848, 'steps': 26493, 'loss/train': 1.0657094717025757} 11/07/2021 00:56:33 - INFO - __main__ - Step 26495: {'lr': 0.0004669630446177077, 'samples': 5087040, 'steps': 26494, 'loss/train': 1.4741947650909424} 11/07/2021 00:56:34 - INFO - __main__ - Step 26496: {'lr': 0.0004669604080588352, 'samples': 5087232, 'steps': 26495, 'loss/train': 1.8306576013565063} 11/07/2021 00:56:35 - INFO - __main__ - Step 26497: {'lr': 0.0004669577714022039, 'samples': 5087424, 'steps': 26496, 'loss/train': 1.7917108535766602} 11/07/2021 00:56:35 - INFO - __main__ - Step 26498: {'lr': 0.00046695513464781456, 'samples': 5087616, 'steps': 26497, 'loss/train': 1.7983497381210327} 11/07/2021 00:56:36 - INFO - __main__ - Step 26499: {'lr': 0.00046695249779566875, 'samples': 5087808, 'steps': 26498, 'loss/train': 1.3506072759628296} 11/07/2021 00:56:36 - INFO - __main__ - Step 26500: {'lr': 0.0004669498608457674, 'samples': 5088000, 'steps': 26499, 'loss/train': 1.5733281373977661} 11/07/2021 00:56:36 - INFO - __main__ - Step 26501: {'lr': 0.0004669472237981118, 'samples': 5088192, 'steps': 26500, 'loss/train': 1.33139169216156} 11/07/2021 00:56:37 - INFO - __main__ - Step 26502: {'lr': 0.00046694458665270315, 'samples': 5088384, 'steps': 26501, 'loss/train': 1.685796856880188} 11/07/2021 00:56:38 - INFO - __main__ - Step 26503: {'lr': 0.0004669419494095426, 'samples': 5088576, 'steps': 26502, 'loss/train': 1.5072300434112549} 11/07/2021 00:56:38 - INFO - __main__ - Step 26504: {'lr': 0.0004669393120686314, 'samples': 5088768, 'steps': 26503, 'loss/train': 1.6246660947799683} 11/07/2021 00:56:38 - INFO - __main__ - Step 26505: {'lr': 0.0004669366746299707, 'samples': 5088960, 'steps': 26504, 'loss/train': 1.5311013460159302} 11/07/2021 00:56:39 - INFO - __main__ - Step 26506: {'lr': 0.00046693403709356163, 'samples': 5089152, 'steps': 26505, 'loss/train': 1.2966697216033936} 11/07/2021 00:56:40 - INFO - __main__ - Step 26507: {'lr': 0.00046693139945940546, 'samples': 5089344, 'steps': 26506, 'loss/train': 1.573506236076355} 11/07/2021 00:56:40 - INFO - __main__ - Step 26508: {'lr': 0.0004669287617275033, 'samples': 5089536, 'steps': 26507, 'loss/train': 1.7121988534927368} 11/07/2021 00:56:41 - INFO - __main__ - Step 26509: {'lr': 0.0004669261238978564, 'samples': 5089728, 'steps': 26508, 'loss/train': 1.489428997039795} 11/07/2021 00:56:41 - INFO - __main__ - Step 26510: {'lr': 0.00046692348597046596, 'samples': 5089920, 'steps': 26509, 'loss/train': 1.5976169109344482} 11/07/2021 00:56:41 - INFO - __main__ - Step 26511: {'lr': 0.0004669208479453332, 'samples': 5090112, 'steps': 26510, 'loss/train': 1.6850656270980835} 11/07/2021 00:56:42 - INFO - __main__ - Step 26512: {'lr': 0.00046691820982245913, 'samples': 5090304, 'steps': 26511, 'loss/train': 1.530301809310913} 11/07/2021 00:56:43 - INFO - __main__ - Step 26513: {'lr': 0.00046691557160184516, 'samples': 5090496, 'steps': 26512, 'loss/train': 1.611854910850525} 11/07/2021 00:56:43 - INFO - __main__ - Step 26514: {'lr': 0.0004669129332834923, 'samples': 5090688, 'steps': 26513, 'loss/train': 1.6495177745819092} 11/07/2021 00:56:43 - INFO - __main__ - Step 26515: {'lr': 0.0004669102948674019, 'samples': 5090880, 'steps': 26514, 'loss/train': 1.213331699371338} 11/07/2021 00:56:44 - INFO - __main__ - Step 26516: {'lr': 0.000466907656353575, 'samples': 5091072, 'steps': 26515, 'loss/train': 1.4162719249725342} 11/07/2021 00:56:44 - INFO - __main__ - Step 26517: {'lr': 0.0004669050177420129, 'samples': 5091264, 'steps': 26516, 'loss/train': 1.6753724813461304} 11/07/2021 00:56:45 - INFO - __main__ - Step 26518: {'lr': 0.0004669023790327168, 'samples': 5091456, 'steps': 26517, 'loss/train': 1.3420944213867188} 11/07/2021 00:56:45 - INFO - __main__ - Step 26519: {'lr': 0.0004668997402256877, 'samples': 5091648, 'steps': 26518, 'loss/train': 1.114307165145874} 11/07/2021 00:56:46 - INFO - __main__ - Step 26520: {'lr': 0.00046689710132092704, 'samples': 5091840, 'steps': 26519, 'loss/train': 1.227500557899475} 11/07/2021 00:56:46 - INFO - __main__ - Step 26521: {'lr': 0.00046689446231843585, 'samples': 5092032, 'steps': 26520, 'loss/train': 5.975055694580078} 11/07/2021 00:56:47 - INFO - __main__ - Step 26522: {'lr': 0.0004668918232182153, 'samples': 5092224, 'steps': 26521, 'loss/train': 1.521012306213379} 11/07/2021 00:56:47 - INFO - __main__ - Step 26523: {'lr': 0.0004668891840202668, 'samples': 5092416, 'steps': 26522, 'loss/train': 1.4542280435562134} 11/07/2021 00:56:48 - INFO - __main__ - Step 26524: {'lr': 0.00046688654472459124, 'samples': 5092608, 'steps': 26523, 'loss/train': 1.7532317638397217} 11/07/2021 00:56:48 - INFO - __main__ - Step 26525: {'lr': 0.00046688390533119003, 'samples': 5092800, 'steps': 26524, 'loss/train': 1.5033717155456543} 11/07/2021 00:56:49 - INFO - __main__ - Step 26526: {'lr': 0.00046688126584006425, 'samples': 5092992, 'steps': 26525, 'loss/train': 1.1880830526351929} 11/07/2021 00:56:49 - INFO - __main__ - Step 26527: {'lr': 0.00046687862625121505, 'samples': 5093184, 'steps': 26526, 'loss/train': 1.5129117965698242} 11/07/2021 00:56:49 - INFO - __main__ - Step 26528: {'lr': 0.0004668759865646438, 'samples': 5093376, 'steps': 26527, 'loss/train': 1.808837652206421} 11/07/2021 00:56:50 - INFO - __main__ - Step 26529: {'lr': 0.00046687334678035153, 'samples': 5093568, 'steps': 26528, 'loss/train': 1.3349472284317017} 11/07/2021 00:56:51 - INFO - __main__ - Step 26530: {'lr': 0.00046687070689833943, 'samples': 5093760, 'steps': 26529, 'loss/train': 1.699949026107788} 11/07/2021 00:56:51 - INFO - __main__ - Step 26531: {'lr': 0.00046686806691860884, 'samples': 5093952, 'steps': 26530, 'loss/train': 1.6412250995635986} 11/07/2021 00:56:51 - INFO - __main__ - Step 26532: {'lr': 0.00046686542684116073, 'samples': 5094144, 'steps': 26531, 'loss/train': 1.5748305320739746} 11/07/2021 00:56:52 - INFO - __main__ - Step 26533: {'lr': 0.00046686278666599647, 'samples': 5094336, 'steps': 26532, 'loss/train': 1.6863652467727661} 11/07/2021 00:56:53 - INFO - __main__ - Step 26534: {'lr': 0.0004668601463931172, 'samples': 5094528, 'steps': 26533, 'loss/train': 1.5847936868667603} 11/07/2021 00:56:53 - INFO - __main__ - Step 26535: {'lr': 0.00046685750602252406, 'samples': 5094720, 'steps': 26534, 'loss/train': 1.4715627431869507} 11/07/2021 00:56:53 - INFO - __main__ - Step 26536: {'lr': 0.0004668548655542183, 'samples': 5094912, 'steps': 26535, 'loss/train': 1.1541719436645508} 11/07/2021 00:56:54 - INFO - __main__ - Step 26537: {'lr': 0.000466852224988201, 'samples': 5095104, 'steps': 26536, 'loss/train': 1.6187621355056763} 11/07/2021 00:56:54 - INFO - __main__ - Step 26538: {'lr': 0.00046684958432447355, 'samples': 5095296, 'steps': 26537, 'loss/train': 1.0727238655090332} 11/07/2021 00:56:55 - INFO - __main__ - Step 26539: {'lr': 0.00046684694356303693, 'samples': 5095488, 'steps': 26538, 'loss/train': 1.1763319969177246} 11/07/2021 00:56:56 - INFO - __main__ - Step 26540: {'lr': 0.0004668443027038925, 'samples': 5095680, 'steps': 26539, 'loss/train': 2.1044039726257324} 11/07/2021 00:56:56 - INFO - __main__ - Step 26541: {'lr': 0.00046684166174704134, 'samples': 5095872, 'steps': 26540, 'loss/train': 2.556725025177002} 11/07/2021 00:56:56 - INFO - __main__ - Step 26542: {'lr': 0.00046683902069248465, 'samples': 5096064, 'steps': 26541, 'loss/train': 1.3768486976623535} 11/07/2021 00:56:57 - INFO - __main__ - Step 26543: {'lr': 0.0004668363795402237, 'samples': 5096256, 'steps': 26542, 'loss/train': 1.3912994861602783} 11/07/2021 00:56:58 - INFO - __main__ - Step 26544: {'lr': 0.00046683373829025954, 'samples': 5096448, 'steps': 26543, 'loss/train': 1.4116778373718262} 11/07/2021 00:56:58 - INFO - __main__ - Step 26545: {'lr': 0.0004668310969425935, 'samples': 5096640, 'steps': 26544, 'loss/train': 1.3927422761917114} 11/07/2021 00:56:58 - INFO - __main__ - Step 26546: {'lr': 0.00046682845549722677, 'samples': 5096832, 'steps': 26545, 'loss/train': 1.552696943283081} 11/07/2021 00:56:59 - INFO - __main__ - Step 26547: {'lr': 0.0004668258139541604, 'samples': 5097024, 'steps': 26546, 'loss/train': 1.2917490005493164} 11/07/2021 00:56:59 - INFO - __main__ - Step 26548: {'lr': 0.00046682317231339565, 'samples': 5097216, 'steps': 26547, 'loss/train': 5.877769470214844} 11/07/2021 00:56:59 - INFO - __main__ - Step 26549: {'lr': 0.00046682053057493377, 'samples': 5097408, 'steps': 26548, 'loss/train': 0.9490289688110352} 11/07/2021 00:57:01 - INFO - __main__ - Step 26550: {'lr': 0.00046681788873877595, 'samples': 5097600, 'steps': 26549, 'loss/train': 1.548933982849121} 11/07/2021 00:57:01 - INFO - __main__ - Step 26551: {'lr': 0.00046681524680492327, 'samples': 5097792, 'steps': 26550, 'loss/train': 1.545667052268982} 11/07/2021 00:57:01 - INFO - __main__ - Step 26552: {'lr': 0.00046681260477337693, 'samples': 5097984, 'steps': 26551, 'loss/train': 1.4085031747817993} 11/07/2021 00:57:02 - INFO - __main__ - Step 26553: {'lr': 0.0004668099626441383, 'samples': 5098176, 'steps': 26552, 'loss/train': 1.656846523284912} 11/07/2021 00:57:02 - INFO - __main__ - Step 26554: {'lr': 0.00046680732041720836, 'samples': 5098368, 'steps': 26553, 'loss/train': 1.8511707782745361} 11/07/2021 00:57:03 - INFO - __main__ - Step 26555: {'lr': 0.0004668046780925884, 'samples': 5098560, 'steps': 26554, 'loss/train': 1.1346396207809448} 11/07/2021 00:57:03 - INFO - __main__ - Step 26556: {'lr': 0.0004668020356702796, 'samples': 5098752, 'steps': 26555, 'loss/train': 1.6512269973754883} 11/07/2021 00:57:04 - INFO - __main__ - Step 26557: {'lr': 0.0004667993931502832, 'samples': 5098944, 'steps': 26556, 'loss/train': 1.944300889968872} 11/07/2021 00:57:04 - INFO - __main__ - Step 26558: {'lr': 0.00046679675053260027, 'samples': 5099136, 'steps': 26557, 'loss/train': 1.6029387712478638} 11/07/2021 00:57:04 - INFO - __main__ - Step 26559: {'lr': 0.00046679410781723206, 'samples': 5099328, 'steps': 26558, 'loss/train': 1.526893973350525} 11/07/2021 00:57:05 - INFO - __main__ - Step 26560: {'lr': 0.0004667914650041799, 'samples': 5099520, 'steps': 26559, 'loss/train': 1.6373034715652466} 11/07/2021 00:57:06 - INFO - __main__ - Step 26561: {'lr': 0.00046678882209344474, 'samples': 5099712, 'steps': 26560, 'loss/train': 1.3498308658599854} 11/07/2021 00:57:06 - INFO - __main__ - Step 26562: {'lr': 0.00046678617908502785, 'samples': 5099904, 'steps': 26561, 'loss/train': 1.223779559135437} 11/07/2021 00:57:06 - INFO - __main__ - Step 26563: {'lr': 0.00046678353597893053, 'samples': 5100096, 'steps': 26562, 'loss/train': 1.7865734100341797} 11/07/2021 00:57:07 - INFO - __main__ - Step 26564: {'lr': 0.0004667808927751539, 'samples': 5100288, 'steps': 26563, 'loss/train': 1.7990397214889526} 11/07/2021 00:57:08 - INFO - __main__ - Step 26565: {'lr': 0.00046677824947369907, 'samples': 5100480, 'steps': 26564, 'loss/train': 1.9247980117797852} 11/07/2021 00:57:08 - INFO - __main__ - Step 26566: {'lr': 0.0004667756060745674, 'samples': 5100672, 'steps': 26565, 'loss/train': 1.6731843948364258} 11/07/2021 00:57:08 - INFO - __main__ - Step 26567: {'lr': 0.0004667729625777599, 'samples': 5100864, 'steps': 26566, 'loss/train': 1.8321985006332397} 11/07/2021 00:57:09 - INFO - __main__ - Step 26568: {'lr': 0.0004667703189832779, 'samples': 5101056, 'steps': 26567, 'loss/train': 1.3584908246994019} 11/07/2021 00:57:09 - INFO - __main__ - Step 26569: {'lr': 0.00046676767529112254, 'samples': 5101248, 'steps': 26568, 'loss/train': 1.4127012491226196} 11/07/2021 00:57:10 - INFO - __main__ - Step 26570: {'lr': 0.000466765031501295, 'samples': 5101440, 'steps': 26569, 'loss/train': 0.9467480182647705} 11/07/2021 00:57:10 - INFO - __main__ - Step 26571: {'lr': 0.0004667623876137965, 'samples': 5101632, 'steps': 26570, 'loss/train': 0.8069031238555908} 11/07/2021 00:57:11 - INFO - __main__ - Step 26572: {'lr': 0.00046675974362862815, 'samples': 5101824, 'steps': 26571, 'loss/train': 1.582007884979248} 11/07/2021 00:57:11 - INFO - __main__ - Step 26573: {'lr': 0.00046675709954579125, 'samples': 5102016, 'steps': 26572, 'loss/train': 1.6229231357574463} 11/07/2021 00:57:12 - INFO - __main__ - Step 26574: {'lr': 0.0004667544553652869, 'samples': 5102208, 'steps': 26573, 'loss/train': 1.5536285638809204} 11/07/2021 00:57:12 - INFO - __main__ - Step 26575: {'lr': 0.0004667518110871164, 'samples': 5102400, 'steps': 26574, 'loss/train': 1.0139391422271729} 11/07/2021 00:57:13 - INFO - __main__ - Step 26576: {'lr': 0.0004667491667112809, 'samples': 5102592, 'steps': 26575, 'loss/train': 1.362045168876648} 11/07/2021 00:57:13 - INFO - __main__ - Step 26577: {'lr': 0.0004667465222377815, 'samples': 5102784, 'steps': 26576, 'loss/train': 1.0441974401474} 11/07/2021 00:57:14 - INFO - __main__ - Step 26578: {'lr': 0.0004667438776666195, 'samples': 5102976, 'steps': 26577, 'loss/train': 1.6765360832214355} 11/07/2021 00:57:14 - INFO - __main__ - Step 26579: {'lr': 0.00046674123299779603, 'samples': 5103168, 'steps': 26578, 'loss/train': 0.872469961643219} 11/07/2021 00:57:14 - INFO - __main__ - Step 26580: {'lr': 0.0004667385882313123, 'samples': 5103360, 'steps': 26579, 'loss/train': 0.9930628538131714} 11/07/2021 00:57:15 - INFO - __main__ - Step 26581: {'lr': 0.0004667359433671695, 'samples': 5103552, 'steps': 26580, 'loss/train': 1.233246922492981} 11/07/2021 00:57:16 - INFO - __main__ - Step 26582: {'lr': 0.0004667332984053689, 'samples': 5103744, 'steps': 26581, 'loss/train': 1.1279759407043457} 11/07/2021 00:57:16 - INFO - __main__ - Step 26583: {'lr': 0.00046673065334591155, 'samples': 5103936, 'steps': 26582, 'loss/train': 1.609163522720337} 11/07/2021 00:57:16 - INFO - __main__ - Step 26584: {'lr': 0.00046672800818879873, 'samples': 5104128, 'steps': 26583, 'loss/train': 1.5642935037612915} 11/07/2021 00:57:17 - INFO - __main__ - Step 26585: {'lr': 0.0004667253629340316, 'samples': 5104320, 'steps': 26584, 'loss/train': 1.6174025535583496} 11/07/2021 00:57:18 - INFO - __main__ - Step 26586: {'lr': 0.0004667227175816114, 'samples': 5104512, 'steps': 26585, 'loss/train': 2.0503993034362793} 11/07/2021 00:57:18 - INFO - __main__ - Step 26587: {'lr': 0.0004667200721315393, 'samples': 5104704, 'steps': 26586, 'loss/train': 1.585839033126831} 11/07/2021 00:57:18 - INFO - __main__ - Step 26588: {'lr': 0.00046671742658381646, 'samples': 5104896, 'steps': 26587, 'loss/train': 1.4367401599884033} 11/07/2021 00:57:19 - INFO - __main__ - Step 26589: {'lr': 0.000466714780938444, 'samples': 5105088, 'steps': 26588, 'loss/train': 1.3386892080307007} 11/07/2021 00:57:19 - INFO - __main__ - Step 26590: {'lr': 0.0004667121351954233, 'samples': 5105280, 'steps': 26589, 'loss/train': 1.768083930015564} 11/07/2021 00:57:20 - INFO - __main__ - Step 26591: {'lr': 0.00046670948935475544, 'samples': 5105472, 'steps': 26590, 'loss/train': 1.9811300039291382} 11/07/2021 00:57:21 - INFO - __main__ - Step 26592: {'lr': 0.00046670684341644167, 'samples': 5105664, 'steps': 26591, 'loss/train': 0.7801468968391418} 11/07/2021 00:57:21 - INFO - __main__ - Step 26593: {'lr': 0.0004667041973804831, 'samples': 5105856, 'steps': 26592, 'loss/train': 0.811613142490387} 11/07/2021 00:57:22 - INFO - __main__ - Step 26594: {'lr': 0.00046670155124688096, 'samples': 5106048, 'steps': 26593, 'loss/train': 1.5572509765625} 11/07/2021 00:57:22 - INFO - __main__ - Step 26595: {'lr': 0.00046669890501563636, 'samples': 5106240, 'steps': 26594, 'loss/train': 1.732027292251587} 11/07/2021 00:57:22 - INFO - __main__ - Step 26596: {'lr': 0.0004666962586867507, 'samples': 5106432, 'steps': 26595, 'loss/train': 1.4996187686920166} 11/07/2021 00:57:23 - INFO - __main__ - Step 26597: {'lr': 0.000466693612260225, 'samples': 5106624, 'steps': 26596, 'loss/train': 1.6806252002716064} 11/07/2021 00:57:23 - INFO - __main__ - Step 26598: {'lr': 0.00046669096573606053, 'samples': 5106816, 'steps': 26597, 'loss/train': 1.095277190208435} 11/07/2021 00:57:24 - INFO - __main__ - Step 26599: {'lr': 0.00046668831911425844, 'samples': 5107008, 'steps': 26598, 'loss/train': 1.8940539360046387} 11/07/2021 00:57:24 - INFO - __main__ - Step 26600: {'lr': 0.00046668567239481994, 'samples': 5107200, 'steps': 26599, 'loss/train': 1.3034520149230957} 11/07/2021 00:57:25 - INFO - __main__ - Step 26601: {'lr': 0.0004666830255777462, 'samples': 5107392, 'steps': 26600, 'loss/train': 1.082856297492981} 11/07/2021 00:57:26 - INFO - __main__ - Step 26602: {'lr': 0.00046668037866303845, 'samples': 5107584, 'steps': 26601, 'loss/train': 1.5188623666763306} 11/07/2021 00:57:26 - INFO - __main__ - Step 26603: {'lr': 0.0004666777316506979, 'samples': 5107776, 'steps': 26602, 'loss/train': 1.660784363746643} 11/07/2021 00:57:26 - INFO - __main__ - Step 26604: {'lr': 0.00046667508454072566, 'samples': 5107968, 'steps': 26603, 'loss/train': 1.7365378141403198} 11/07/2021 00:57:27 - INFO - __main__ - Step 26605: {'lr': 0.00046667243733312296, 'samples': 5108160, 'steps': 26604, 'loss/train': 1.6284083127975464} 11/07/2021 00:57:27 - INFO - __main__ - Step 26606: {'lr': 0.000466669790027891, 'samples': 5108352, 'steps': 26605, 'loss/train': 1.1745686531066895} 11/07/2021 00:57:28 - INFO - __main__ - Step 26607: {'lr': 0.00046666714262503107, 'samples': 5108544, 'steps': 26606, 'loss/train': 1.6178178787231445} 11/07/2021 00:57:29 - INFO - __main__ - Step 26608: {'lr': 0.00046666449512454416, 'samples': 5108736, 'steps': 26607, 'loss/train': 1.1699672937393188} 11/07/2021 00:57:29 - INFO - __main__ - Step 26609: {'lr': 0.0004666618475264316, 'samples': 5108928, 'steps': 26608, 'loss/train': 1.7242342233657837} 11/07/2021 00:57:29 - INFO - __main__ - Step 26610: {'lr': 0.0004666591998306946, 'samples': 5109120, 'steps': 26609, 'loss/train': 1.758998155593872} 11/07/2021 00:57:30 - INFO - __main__ - Step 26611: {'lr': 0.0004666565520373343, 'samples': 5109312, 'steps': 26610, 'loss/train': 1.7067108154296875} 11/07/2021 00:57:31 - INFO - __main__ - Step 26612: {'lr': 0.00046665390414635184, 'samples': 5109504, 'steps': 26611, 'loss/train': 1.829398274421692} 11/07/2021 00:57:31 - INFO - __main__ - Step 26613: {'lr': 0.0004666512561577485, 'samples': 5109696, 'steps': 26612, 'loss/train': 1.667002558708191} 11/07/2021 00:57:31 - INFO - __main__ - Step 26614: {'lr': 0.0004666486080715255, 'samples': 5109888, 'steps': 26613, 'loss/train': 1.830881118774414} 11/07/2021 00:57:32 - INFO - __main__ - Step 26615: {'lr': 0.0004666459598876839, 'samples': 5110080, 'steps': 26614, 'loss/train': 0.904382050037384} 11/07/2021 00:57:32 - INFO - __main__ - Step 26616: {'lr': 0.000466643311606225, 'samples': 5110272, 'steps': 26615, 'loss/train': 1.795561671257019} 11/07/2021 00:57:33 - INFO - __main__ - Step 26617: {'lr': 0.00046664066322715006, 'samples': 5110464, 'steps': 26616, 'loss/train': 1.5816446542739868} 11/07/2021 00:57:34 - INFO - __main__ - Step 26618: {'lr': 0.00046663801475046004, 'samples': 5110656, 'steps': 26617, 'loss/train': 1.3576115369796753} 11/07/2021 00:57:34 - INFO - __main__ - Step 26619: {'lr': 0.0004666353661761563, 'samples': 5110848, 'steps': 26618, 'loss/train': 1.2115764617919922} 11/07/2021 00:57:34 - INFO - __main__ - Step 26620: {'lr': 0.0004666327175042401, 'samples': 5111040, 'steps': 26619, 'loss/train': 1.1487702131271362} 11/07/2021 00:57:35 - INFO - __main__ - Step 26621: {'lr': 0.00046663006873471247, 'samples': 5111232, 'steps': 26620, 'loss/train': 1.5510127544403076} 11/07/2021 00:57:35 - INFO - __main__ - Step 26622: {'lr': 0.00046662741986757463, 'samples': 5111424, 'steps': 26621, 'loss/train': 1.8322904109954834} 11/07/2021 00:57:36 - INFO - __main__ - Step 26623: {'lr': 0.0004666247709028279, 'samples': 5111616, 'steps': 26622, 'loss/train': 1.5032947063446045} 11/07/2021 00:57:36 - INFO - __main__ - Step 26624: {'lr': 0.00046662212184047334, 'samples': 5111808, 'steps': 26623, 'loss/train': 1.4322452545166016} 11/07/2021 00:57:37 - INFO - __main__ - Step 26625: {'lr': 0.0004666194726805122, 'samples': 5112000, 'steps': 26624, 'loss/train': 1.82273268699646} 11/07/2021 00:57:37 - INFO - __main__ - Step 26626: {'lr': 0.0004666168234229457, 'samples': 5112192, 'steps': 26625, 'loss/train': 0.9876610040664673} 11/07/2021 00:57:37 - INFO - __main__ - Step 26627: {'lr': 0.000466614174067775, 'samples': 5112384, 'steps': 26626, 'loss/train': 1.672943353652954} 11/07/2021 00:57:38 - INFO - __main__ - Step 26628: {'lr': 0.00046661152461500126, 'samples': 5112576, 'steps': 26627, 'loss/train': 1.9646693468093872} 11/07/2021 00:57:39 - INFO - __main__ - Step 26629: {'lr': 0.0004666088750646257, 'samples': 5112768, 'steps': 26628, 'loss/train': 1.5872939825057983} 11/07/2021 00:57:39 - INFO - __main__ - Step 26630: {'lr': 0.0004666062254166496, 'samples': 5112960, 'steps': 26629, 'loss/train': 1.6622138023376465} 11/07/2021 00:57:39 - INFO - __main__ - Step 26631: {'lr': 0.000466603575671074, 'samples': 5113152, 'steps': 26630, 'loss/train': 1.4871768951416016} 11/07/2021 00:57:40 - INFO - __main__ - Step 26632: {'lr': 0.00046660092582790025, 'samples': 5113344, 'steps': 26631, 'loss/train': 1.6067029237747192} 11/07/2021 00:57:41 - INFO - __main__ - Step 26633: {'lr': 0.0004665982758871294, 'samples': 5113536, 'steps': 26632, 'loss/train': 1.1848700046539307} 11/07/2021 00:57:41 - INFO - __main__ - Step 26634: {'lr': 0.0004665956258487627, 'samples': 5113728, 'steps': 26633, 'loss/train': 1.102095365524292} 11/07/2021 00:57:42 - INFO - __main__ - Step 26635: {'lr': 0.0004665929757128014, 'samples': 5113920, 'steps': 26634, 'loss/train': 1.3685040473937988} 11/07/2021 00:57:42 - INFO - __main__ - Step 26636: {'lr': 0.0004665903254792466, 'samples': 5114112, 'steps': 26635, 'loss/train': 2.0076229572296143} 11/07/2021 00:57:42 - INFO - __main__ - Step 26637: {'lr': 0.0004665876751480996, 'samples': 5114304, 'steps': 26636, 'loss/train': 1.8105500936508179} 11/07/2021 00:57:43 - INFO - __main__ - Step 26638: {'lr': 0.0004665850247193615, 'samples': 5114496, 'steps': 26637, 'loss/train': 1.4529505968093872} 11/07/2021 00:57:44 - INFO - __main__ - Step 26639: {'lr': 0.0004665823741930335, 'samples': 5114688, 'steps': 26638, 'loss/train': 1.054864764213562} 11/07/2021 00:57:44 - INFO - __main__ - Step 26640: {'lr': 0.00046657972356911696, 'samples': 5114880, 'steps': 26639, 'loss/train': 1.6424245834350586} 11/07/2021 00:57:44 - INFO - __main__ - Step 26641: {'lr': 0.00046657707284761274, 'samples': 5115072, 'steps': 26640, 'loss/train': 1.2736396789550781} 11/07/2021 00:57:45 - INFO - __main__ - Step 26642: {'lr': 0.0004665744220285224, 'samples': 5115264, 'steps': 26641, 'loss/train': 0.37159693241119385} 11/07/2021 00:57:45 - INFO - __main__ - Step 26643: {'lr': 0.0004665717711118469, 'samples': 5115456, 'steps': 26642, 'loss/train': 1.5933653116226196} 11/07/2021 00:57:46 - INFO - __main__ - Step 26644: {'lr': 0.00046656912009758743, 'samples': 5115648, 'steps': 26643, 'loss/train': 1.0128573179244995} 11/07/2021 00:57:46 - INFO - __main__ - Step 26645: {'lr': 0.0004665664689857454, 'samples': 5115840, 'steps': 26644, 'loss/train': 1.5813493728637695} 11/07/2021 00:57:47 - INFO - __main__ - Step 26646: {'lr': 0.00046656381777632173, 'samples': 5116032, 'steps': 26645, 'loss/train': 1.6319818496704102} 11/07/2021 00:57:47 - INFO - __main__ - Step 26647: {'lr': 0.0004665611664693178, 'samples': 5116224, 'steps': 26646, 'loss/train': 1.7234327793121338} 11/07/2021 00:57:48 - INFO - __main__ - Step 26648: {'lr': 0.0004665585150647348, 'samples': 5116416, 'steps': 26647, 'loss/train': 1.7393739223480225} 11/07/2021 00:57:49 - INFO - __main__ - Step 26649: {'lr': 0.0004665558635625738, 'samples': 5116608, 'steps': 26648, 'loss/train': 1.6047013998031616} 11/07/2021 00:57:49 - INFO - __main__ - Step 26650: {'lr': 0.00046655321196283604, 'samples': 5116800, 'steps': 26649, 'loss/train': 1.8685024976730347} 11/07/2021 00:57:49 - INFO - __main__ - Step 26651: {'lr': 0.00046655056026552287, 'samples': 5116992, 'steps': 26650, 'loss/train': 0.18996243178844452} 11/07/2021 00:57:50 - INFO - __main__ - Step 26652: {'lr': 0.0004665479084706353, 'samples': 5117184, 'steps': 26651, 'loss/train': 1.8232330083847046} 11/07/2021 00:57:50 - INFO - __main__ - Step 26653: {'lr': 0.00046654525657817457, 'samples': 5117376, 'steps': 26652, 'loss/train': 1.292604923248291} 11/07/2021 00:57:51 - INFO - __main__ - Step 26654: {'lr': 0.0004665426045881419, 'samples': 5117568, 'steps': 26653, 'loss/train': 1.6663436889648438} 11/07/2021 00:57:51 - INFO - __main__ - Step 26655: {'lr': 0.00046653995250053843, 'samples': 5117760, 'steps': 26654, 'loss/train': 1.220912218093872} 11/07/2021 00:57:52 - INFO - __main__ - Step 26656: {'lr': 0.00046653730031536545, 'samples': 5117952, 'steps': 26655, 'loss/train': 1.6533703804016113} 11/07/2021 00:57:52 - INFO - __main__ - Step 26657: {'lr': 0.0004665346480326241, 'samples': 5118144, 'steps': 26656, 'loss/train': 1.6312495470046997} 11/07/2021 00:57:53 - INFO - __main__ - Step 26658: {'lr': 0.00046653199565231554, 'samples': 5118336, 'steps': 26657, 'loss/train': 1.40569007396698} 11/07/2021 00:57:54 - INFO - __main__ - Step 26659: {'lr': 0.00046652934317444104, 'samples': 5118528, 'steps': 26658, 'loss/train': 1.6105440855026245} 11/07/2021 00:57:54 - INFO - __main__ - Step 26660: {'lr': 0.00046652669059900174, 'samples': 5118720, 'steps': 26659, 'loss/train': 1.2899856567382812} 11/07/2021 00:57:54 - INFO - __main__ - Step 26661: {'lr': 0.0004665240379259989, 'samples': 5118912, 'steps': 26660, 'loss/train': 1.2735904455184937} 11/07/2021 00:57:55 - INFO - __main__ - Step 26662: {'lr': 0.00046652138515543366, 'samples': 5119104, 'steps': 26661, 'loss/train': 1.6175771951675415} 11/07/2021 00:57:55 - INFO - __main__ - Step 26663: {'lr': 0.00046651873228730715, 'samples': 5119296, 'steps': 26662, 'loss/train': 1.0031540393829346} 11/07/2021 00:57:56 - INFO - __main__ - Step 26664: {'lr': 0.0004665160793216207, 'samples': 5119488, 'steps': 26663, 'loss/train': 1.4231845140457153} 11/07/2021 00:57:57 - INFO - __main__ - Step 26665: {'lr': 0.00046651342625837544, 'samples': 5119680, 'steps': 26664, 'loss/train': 1.0525844097137451} 11/07/2021 00:57:57 - INFO - __main__ - Step 26666: {'lr': 0.00046651077309757256, 'samples': 5119872, 'steps': 26665, 'loss/train': 0.7705839276313782} 11/07/2021 00:57:57 - INFO - __main__ - Step 26667: {'lr': 0.0004665081198392133, 'samples': 5120064, 'steps': 26666, 'loss/train': 1.1075023412704468} 11/07/2021 00:57:58 - INFO - __main__ - Step 26668: {'lr': 0.0004665054664832988, 'samples': 5120256, 'steps': 26667, 'loss/train': 1.461146354675293} 11/07/2021 00:57:58 - INFO - __main__ - Step 26669: {'lr': 0.00046650281302983024, 'samples': 5120448, 'steps': 26668, 'loss/train': 1.7595139741897583} 11/07/2021 00:57:59 - INFO - __main__ - Step 26670: {'lr': 0.00046650015947880886, 'samples': 5120640, 'steps': 26669, 'loss/train': 1.0714360475540161} 11/07/2021 00:58:00 - INFO - __main__ - Step 26671: {'lr': 0.00046649750583023595, 'samples': 5120832, 'steps': 26670, 'loss/train': 1.328011155128479} 11/07/2021 00:58:00 - INFO - __main__ - Step 26672: {'lr': 0.00046649485208411244, 'samples': 5121024, 'steps': 26671, 'loss/train': 1.5091999769210815} 11/07/2021 00:58:00 - INFO - __main__ - Step 26673: {'lr': 0.00046649219824043984, 'samples': 5121216, 'steps': 26672, 'loss/train': 1.622520089149475} 11/07/2021 00:58:01 - INFO - __main__ - Step 26674: {'lr': 0.00046648954429921914, 'samples': 5121408, 'steps': 26673, 'loss/train': 1.4281224012374878} 11/07/2021 00:58:02 - INFO - __main__ - Step 26675: {'lr': 0.00046648689026045157, 'samples': 5121600, 'steps': 26674, 'loss/train': 1.4314370155334473} 11/07/2021 00:58:02 - INFO - __main__ - Step 26676: {'lr': 0.0004664842361241384, 'samples': 5121792, 'steps': 26675, 'loss/train': 1.8390313386917114} 11/07/2021 00:58:02 - INFO - __main__ - Step 26677: {'lr': 0.00046648158189028073, 'samples': 5121984, 'steps': 26676, 'loss/train': 1.6365265846252441} 11/07/2021 00:58:03 - INFO - __main__ - Step 26678: {'lr': 0.0004664789275588798, 'samples': 5122176, 'steps': 26677, 'loss/train': 1.651261806488037} 11/07/2021 00:58:03 - INFO - __main__ - Step 26679: {'lr': 0.0004664762731299368, 'samples': 5122368, 'steps': 26678, 'loss/train': 1.6249920129776} 11/07/2021 00:58:04 - INFO - __main__ - Step 26680: {'lr': 0.00046647361860345293, 'samples': 5122560, 'steps': 26679, 'loss/train': 2.140415668487549} 11/07/2021 00:58:04 - INFO - __main__ - Step 26681: {'lr': 0.00046647096397942945, 'samples': 5122752, 'steps': 26680, 'loss/train': 2.295044183731079} 11/07/2021 00:58:05 - INFO - __main__ - Step 26682: {'lr': 0.0004664683092578674, 'samples': 5122944, 'steps': 26681, 'loss/train': 1.655160665512085} 11/07/2021 00:58:05 - INFO - __main__ - Step 26683: {'lr': 0.00046646565443876815, 'samples': 5123136, 'steps': 26682, 'loss/train': 1.6432113647460938} 11/07/2021 00:58:05 - INFO - __main__ - Step 26684: {'lr': 0.00046646299952213277, 'samples': 5123328, 'steps': 26683, 'loss/train': 1.8974426984786987} 11/07/2021 00:58:06 - INFO - __main__ - Step 26685: {'lr': 0.00046646034450796255, 'samples': 5123520, 'steps': 26684, 'loss/train': 1.5197091102600098} 11/07/2021 00:58:07 - INFO - __main__ - Step 26686: {'lr': 0.0004664576893962586, 'samples': 5123712, 'steps': 26685, 'loss/train': 1.4399789571762085} 11/07/2021 00:58:07 - INFO - __main__ - Step 26687: {'lr': 0.0004664550341870222, 'samples': 5123904, 'steps': 26686, 'loss/train': 1.472825050354004} 11/07/2021 00:58:07 - INFO - __main__ - Step 26688: {'lr': 0.00046645237888025444, 'samples': 5124096, 'steps': 26687, 'loss/train': 1.816226601600647} 11/07/2021 00:58:08 - INFO - __main__ - Step 26689: {'lr': 0.0004664497234759566, 'samples': 5124288, 'steps': 26688, 'loss/train': 1.6909376382827759} 11/07/2021 00:58:09 - INFO - __main__ - Step 26690: {'lr': 0.00046644706797412984, 'samples': 5124480, 'steps': 26689, 'loss/train': 1.7982065677642822} 11/07/2021 00:58:09 - INFO - __main__ - Step 26691: {'lr': 0.00046644441237477544, 'samples': 5124672, 'steps': 26690, 'loss/train': 0.9981279373168945} 11/07/2021 00:58:10 - INFO - __main__ - Step 26692: {'lr': 0.00046644175667789444, 'samples': 5124864, 'steps': 26691, 'loss/train': 1.3450186252593994} 11/07/2021 00:58:10 - INFO - __main__ - Step 26693: {'lr': 0.00046643910088348817, 'samples': 5125056, 'steps': 26692, 'loss/train': 1.7120623588562012} 11/07/2021 00:58:10 - INFO - __main__ - Step 26694: {'lr': 0.0004664364449915578, 'samples': 5125248, 'steps': 26693, 'loss/train': 1.2893990278244019} 11/07/2021 00:58:11 - INFO - __main__ - Step 26695: {'lr': 0.0004664337890021044, 'samples': 5125440, 'steps': 26694, 'loss/train': 1.5590112209320068} 11/07/2021 00:58:12 - INFO - __main__ - Step 26696: {'lr': 0.0004664311329151294, 'samples': 5125632, 'steps': 26695, 'loss/train': 1.4270391464233398} 11/07/2021 00:58:12 - INFO - __main__ - Step 26697: {'lr': 0.0004664284767306338, 'samples': 5125824, 'steps': 26696, 'loss/train': 1.45754075050354} 11/07/2021 00:58:12 - INFO - __main__ - Step 26698: {'lr': 0.0004664258204486189, 'samples': 5126016, 'steps': 26697, 'loss/train': 1.4459261894226074} 11/07/2021 00:58:13 - INFO - __main__ - Step 26699: {'lr': 0.0004664231640690859, 'samples': 5126208, 'steps': 26698, 'loss/train': 1.7300431728363037} 11/07/2021 00:58:13 - INFO - __main__ - Step 26700: {'lr': 0.0004664205075920359, 'samples': 5126400, 'steps': 26699, 'loss/train': 2.156492233276367} 11/07/2021 00:58:14 - INFO - __main__ - Step 26701: {'lr': 0.0004664178510174702, 'samples': 5126592, 'steps': 26700, 'loss/train': 1.6679294109344482} 11/07/2021 00:58:14 - INFO - __main__ - Step 26702: {'lr': 0.0004664151943453899, 'samples': 5126784, 'steps': 26701, 'loss/train': 1.5192807912826538} 11/07/2021 00:58:15 - INFO - __main__ - Step 26703: {'lr': 0.0004664125375757963, 'samples': 5126976, 'steps': 26702, 'loss/train': 1.510108470916748} 11/07/2021 00:58:15 - INFO - __main__ - Step 26704: {'lr': 0.00046640988070869053, 'samples': 5127168, 'steps': 26703, 'loss/train': 1.4754691123962402} 11/07/2021 00:58:16 - INFO - __main__ - Step 26705: {'lr': 0.00046640722374407384, 'samples': 5127360, 'steps': 26704, 'loss/train': 1.5549184083938599} 11/07/2021 00:58:17 - INFO - __main__ - Step 26706: {'lr': 0.00046640456668194737, 'samples': 5127552, 'steps': 26705, 'loss/train': 1.417669653892517} 11/07/2021 00:58:17 - INFO - __main__ - Step 26707: {'lr': 0.0004664019095223123, 'samples': 5127744, 'steps': 26706, 'loss/train': 1.262236475944519} 11/07/2021 00:58:17 - INFO - __main__ - Step 26708: {'lr': 0.00046639925226517, 'samples': 5127936, 'steps': 26707, 'loss/train': 1.6880942583084106} 11/07/2021 00:58:18 - INFO - __main__ - Step 26709: {'lr': 0.0004663965949105214, 'samples': 5128128, 'steps': 26708, 'loss/train': 1.3780642747879028} 11/07/2021 00:58:18 - INFO - __main__ - Step 26710: {'lr': 0.0004663939374583679, 'samples': 5128320, 'steps': 26709, 'loss/train': 1.3778291940689087} 11/07/2021 00:58:19 - INFO - __main__ - Step 26711: {'lr': 0.00046639127990871055, 'samples': 5128512, 'steps': 26710, 'loss/train': 1.5793744325637817} 11/07/2021 00:58:19 - INFO - __main__ - Step 26712: {'lr': 0.00046638862226155075, 'samples': 5128704, 'steps': 26711, 'loss/train': 1.086185336112976} 11/07/2021 00:58:20 - INFO - __main__ - Step 26713: {'lr': 0.0004663859645168895, 'samples': 5128896, 'steps': 26712, 'loss/train': 1.713158369064331} 11/07/2021 00:58:20 - INFO - __main__ - Step 26714: {'lr': 0.00046638330667472805, 'samples': 5129088, 'steps': 26713, 'loss/train': 1.4040732383728027} 11/07/2021 00:58:21 - INFO - __main__ - Step 26715: {'lr': 0.0004663806487350677, 'samples': 5129280, 'steps': 26714, 'loss/train': 1.3665399551391602} 11/07/2021 00:58:21 - INFO - __main__ - Step 26716: {'lr': 0.00046637799069790953, 'samples': 5129472, 'steps': 26715, 'loss/train': 1.311948299407959} 11/07/2021 00:58:22 - INFO - __main__ - Step 26717: {'lr': 0.00046637533256325476, 'samples': 5129664, 'steps': 26716, 'loss/train': 1.5316460132598877} 11/07/2021 00:58:22 - INFO - __main__ - Step 26718: {'lr': 0.0004663726743311046, 'samples': 5129856, 'steps': 26717, 'loss/train': 1.1193568706512451} 11/07/2021 00:58:23 - INFO - __main__ - Step 26719: {'lr': 0.00046637001600146027, 'samples': 5130048, 'steps': 26718, 'loss/train': 1.4479613304138184} 11/07/2021 00:58:23 - INFO - __main__ - Step 26720: {'lr': 0.000466367357574323, 'samples': 5130240, 'steps': 26719, 'loss/train': 1.812766671180725} 11/07/2021 00:58:24 - INFO - __main__ - Step 26721: {'lr': 0.00046636469904969387, 'samples': 5130432, 'steps': 26720, 'loss/train': 1.200969934463501} 11/07/2021 00:58:24 - INFO - __main__ - Step 26722: {'lr': 0.0004663620404275741, 'samples': 5130624, 'steps': 26721, 'loss/train': 1.617827296257019} 11/07/2021 00:58:25 - INFO - __main__ - Step 26723: {'lr': 0.00046635938170796505, 'samples': 5130816, 'steps': 26722, 'loss/train': 1.5450035333633423} 11/07/2021 00:58:25 - INFO - __main__ - Step 26724: {'lr': 0.00046635672289086774, 'samples': 5131008, 'steps': 26723, 'loss/train': 1.504841923713684} 11/07/2021 00:58:25 - INFO - __main__ - Step 26725: {'lr': 0.00046635406397628346, 'samples': 5131200, 'steps': 26724, 'loss/train': 1.0579419136047363} 11/07/2021 00:58:26 - INFO - __main__ - Step 26726: {'lr': 0.00046635140496421336, 'samples': 5131392, 'steps': 26725, 'loss/train': 1.883946418762207} 11/07/2021 00:58:27 - INFO - __main__ - Step 26727: {'lr': 0.0004663487458546586, 'samples': 5131584, 'steps': 26726, 'loss/train': 2.128173351287842} 11/07/2021 00:58:27 - INFO - __main__ - Step 26728: {'lr': 0.0004663460866476205, 'samples': 5131776, 'steps': 26727, 'loss/train': 1.4596672058105469} 11/07/2021 00:58:27 - INFO - __main__ - Step 26729: {'lr': 0.00046634342734310023, 'samples': 5131968, 'steps': 26728, 'loss/train': 1.3028556108474731} 11/07/2021 00:58:28 - INFO - __main__ - Step 26730: {'lr': 0.0004663407679410988, 'samples': 5132160, 'steps': 26729, 'loss/train': 1.107367992401123} 11/07/2021 00:58:29 - INFO - __main__ - Step 26731: {'lr': 0.0004663381084416177, 'samples': 5132352, 'steps': 26730, 'loss/train': 0.9006083607673645} 11/07/2021 00:58:29 - INFO - __main__ - Step 26732: {'lr': 0.00046633544884465796, 'samples': 5132544, 'steps': 26731, 'loss/train': 1.004862904548645} 11/07/2021 00:58:30 - INFO - __main__ - Step 26733: {'lr': 0.0004663327891502208, 'samples': 5132736, 'steps': 26732, 'loss/train': 1.9089033603668213} 11/07/2021 00:58:30 - INFO - __main__ - Step 26734: {'lr': 0.0004663301293583073, 'samples': 5132928, 'steps': 26733, 'loss/train': 1.0407803058624268} 11/07/2021 00:58:30 - INFO - __main__ - Step 26735: {'lr': 0.000466327469468919, 'samples': 5133120, 'steps': 26734, 'loss/train': 1.420594573020935} 11/07/2021 00:58:31 - INFO - __main__ - Step 26736: {'lr': 0.0004663248094820567, 'samples': 5133312, 'steps': 26735, 'loss/train': 1.2276805639266968} 11/07/2021 00:58:32 - INFO - __main__ - Step 26737: {'lr': 0.00046632214939772187, 'samples': 5133504, 'steps': 26736, 'loss/train': 1.0551986694335938} 11/07/2021 00:58:32 - INFO - __main__ - Step 26738: {'lr': 0.0004663194892159156, 'samples': 5133696, 'steps': 26737, 'loss/train': 1.1808747053146362} 11/07/2021 00:58:32 - INFO - __main__ - Step 26739: {'lr': 0.0004663168289366391, 'samples': 5133888, 'steps': 26738, 'loss/train': 1.6448155641555786} 11/07/2021 00:58:33 - INFO - __main__ - Step 26740: {'lr': 0.0004663141685598936, 'samples': 5134080, 'steps': 26739, 'loss/train': 1.2650645971298218} 11/07/2021 00:58:34 - INFO - __main__ - Step 26741: {'lr': 0.00046631150808568026, 'samples': 5134272, 'steps': 26740, 'loss/train': 1.4323738813400269} 11/07/2021 00:58:34 - INFO - __main__ - Step 26742: {'lr': 0.00046630884751400024, 'samples': 5134464, 'steps': 26741, 'loss/train': 1.3132368326187134} 11/07/2021 00:58:34 - INFO - __main__ - Step 26743: {'lr': 0.0004663061868448548, 'samples': 5134656, 'steps': 26742, 'loss/train': 1.777282476425171} 11/07/2021 00:58:35 - INFO - __main__ - Step 26744: {'lr': 0.0004663035260782452, 'samples': 5134848, 'steps': 26743, 'loss/train': 1.5186762809753418} 11/07/2021 00:58:35 - INFO - __main__ - Step 26745: {'lr': 0.0004663008652141726, 'samples': 5135040, 'steps': 26744, 'loss/train': 1.3067575693130493} 11/07/2021 00:58:36 - INFO - __main__ - Step 26746: {'lr': 0.00046629820425263805, 'samples': 5135232, 'steps': 26745, 'loss/train': 1.5075962543487549} 11/07/2021 00:58:37 - INFO - __main__ - Step 26747: {'lr': 0.00046629554319364293, 'samples': 5135424, 'steps': 26746, 'loss/train': 1.64328932762146} 11/07/2021 00:58:37 - INFO - __main__ - Step 26748: {'lr': 0.00046629288203718834, 'samples': 5135616, 'steps': 26747, 'loss/train': 1.1373062133789062} 11/07/2021 00:58:37 - INFO - __main__ - Step 26749: {'lr': 0.00046629022078327557, 'samples': 5135808, 'steps': 26748, 'loss/train': 1.623582124710083} 11/07/2021 00:58:38 - INFO - __main__ - Step 26750: {'lr': 0.0004662875594319057, 'samples': 5136000, 'steps': 26749, 'loss/train': 0.9610598683357239} 11/07/2021 00:58:38 - INFO - __main__ - Step 26751: {'lr': 0.00046628489798308006, 'samples': 5136192, 'steps': 26750, 'loss/train': 1.5221467018127441} 11/07/2021 00:58:39 - INFO - __main__ - Step 26752: {'lr': 0.0004662822364367997, 'samples': 5136384, 'steps': 26751, 'loss/train': 1.4284276962280273} 11/07/2021 00:58:39 - INFO - __main__ - Step 26753: {'lr': 0.000466279574793066, 'samples': 5136576, 'steps': 26752, 'loss/train': 1.580201268196106} 11/07/2021 00:58:40 - INFO - __main__ - Step 26754: {'lr': 0.00046627691305188004, 'samples': 5136768, 'steps': 26753, 'loss/train': 1.4526443481445312} 11/07/2021 00:58:40 - INFO - __main__ - Step 26755: {'lr': 0.00046627425121324294, 'samples': 5136960, 'steps': 26754, 'loss/train': 0.7486782670021057} 11/07/2021 00:58:40 - INFO - __main__ - Step 26756: {'lr': 0.0004662715892771561, 'samples': 5137152, 'steps': 26755, 'loss/train': 1.6319177150726318} 11/07/2021 00:58:42 - INFO - __main__ - Step 26757: {'lr': 0.0004662689272436206, 'samples': 5137344, 'steps': 26756, 'loss/train': 1.4941084384918213} 11/07/2021 00:58:42 - INFO - __main__ - Step 26758: {'lr': 0.00046626626511263764, 'samples': 5137536, 'steps': 26757, 'loss/train': 1.7254610061645508} 11/07/2021 00:58:42 - INFO - __main__ - Step 26759: {'lr': 0.00046626360288420845, 'samples': 5137728, 'steps': 26758, 'loss/train': 1.5891088247299194} 11/07/2021 00:58:43 - INFO - __main__ - Step 26760: {'lr': 0.00046626094055833426, 'samples': 5137920, 'steps': 26759, 'loss/train': 1.6341724395751953} 11/07/2021 00:58:43 - INFO - __main__ - Step 26761: {'lr': 0.0004662582781350161, 'samples': 5138112, 'steps': 26760, 'loss/train': 1.5685557126998901} 11/07/2021 00:58:44 - INFO - __main__ - Step 26762: {'lr': 0.00046625561561425543, 'samples': 5138304, 'steps': 26761, 'loss/train': 0.4045870006084442} 11/07/2021 00:58:44 - INFO - __main__ - Step 26763: {'lr': 0.00046625295299605323, 'samples': 5138496, 'steps': 26762, 'loss/train': 1.108884334564209} 11/07/2021 00:58:45 - INFO - __main__ - Step 26764: {'lr': 0.0004662502902804109, 'samples': 5138688, 'steps': 26763, 'loss/train': 1.5114766359329224} 11/07/2021 00:58:45 - INFO - __main__ - Step 26765: {'lr': 0.0004662476274673294, 'samples': 5138880, 'steps': 26764, 'loss/train': 0.9535731077194214} 11/07/2021 00:58:46 - INFO - __main__ - Step 26766: {'lr': 0.00046624496455681006, 'samples': 5139072, 'steps': 26765, 'loss/train': 0.9992877244949341} 11/07/2021 00:58:46 - INFO - __main__ - Step 26767: {'lr': 0.00046624230154885415, 'samples': 5139264, 'steps': 26766, 'loss/train': 0.9134907722473145} 11/07/2021 00:58:47 - INFO - __main__ - Step 26768: {'lr': 0.0004662396384434627, 'samples': 5139456, 'steps': 26767, 'loss/train': 1.1533561944961548} 11/07/2021 00:58:47 - INFO - __main__ - Step 26769: {'lr': 0.00046623697524063713, 'samples': 5139648, 'steps': 26768, 'loss/train': 1.5940784215927124} 11/07/2021 00:58:48 - INFO - __main__ - Step 26770: {'lr': 0.00046623431194037847, 'samples': 5139840, 'steps': 26769, 'loss/train': 1.6589995622634888} 11/07/2021 00:58:48 - INFO - __main__ - Step 26771: {'lr': 0.000466231648542688, 'samples': 5140032, 'steps': 26770, 'loss/train': 1.7405768632888794} 11/07/2021 00:58:48 - INFO - __main__ - Step 26772: {'lr': 0.0004662289850475668, 'samples': 5140224, 'steps': 26771, 'loss/train': 1.5431126356124878} 11/07/2021 00:58:49 - INFO - __main__ - Step 26773: {'lr': 0.0004662263214550162, 'samples': 5140416, 'steps': 26772, 'loss/train': 1.5235966444015503} 11/07/2021 00:58:50 - INFO - __main__ - Step 26774: {'lr': 0.00046622365776503735, 'samples': 5140608, 'steps': 26773, 'loss/train': 1.3490772247314453} 11/07/2021 00:58:50 - INFO - __main__ - Step 26775: {'lr': 0.0004662209939776315, 'samples': 5140800, 'steps': 26774, 'loss/train': 1.6184896230697632} 11/07/2021 00:58:50 - INFO - __main__ - Step 26776: {'lr': 0.0004662183300927997, 'samples': 5140992, 'steps': 26775, 'loss/train': 1.5121411085128784} 11/07/2021 00:58:51 - INFO - __main__ - Step 26777: {'lr': 0.0004662156661105433, 'samples': 5141184, 'steps': 26776, 'loss/train': 1.5256009101867676} 11/07/2021 00:58:52 - INFO - __main__ - Step 26778: {'lr': 0.0004662130020308635, 'samples': 5141376, 'steps': 26777, 'loss/train': 1.8537344932556152} 11/07/2021 00:58:52 - INFO - __main__ - Step 26779: {'lr': 0.00046621033785376146, 'samples': 5141568, 'steps': 26778, 'loss/train': 1.7795383930206299} 11/07/2021 00:58:53 - INFO - __main__ - Step 26780: {'lr': 0.00046620767357923834, 'samples': 5141760, 'steps': 26779, 'loss/train': 1.3969024419784546} 11/07/2021 00:58:53 - INFO - __main__ - Step 26781: {'lr': 0.0004662050092072954, 'samples': 5141952, 'steps': 26780, 'loss/train': 1.4966504573822021} 11/07/2021 00:58:53 - INFO - __main__ - Step 26782: {'lr': 0.0004662023447379338, 'samples': 5142144, 'steps': 26781, 'loss/train': 1.615160584449768} 11/07/2021 00:58:54 - INFO - __main__ - Step 26783: {'lr': 0.0004661996801711548, 'samples': 5142336, 'steps': 26782, 'loss/train': 1.3632062673568726} 11/07/2021 00:58:55 - INFO - __main__ - Step 26784: {'lr': 0.0004661970155069595, 'samples': 5142528, 'steps': 26783, 'loss/train': 1.2376118898391724} 11/07/2021 00:58:55 - INFO - __main__ - Step 26785: {'lr': 0.00046619435074534923, 'samples': 5142720, 'steps': 26784, 'loss/train': 1.4335567951202393} 11/07/2021 00:58:55 - INFO - __main__ - Step 26786: {'lr': 0.0004661916858863251, 'samples': 5142912, 'steps': 26785, 'loss/train': 1.4138081073760986} 11/07/2021 00:58:56 - INFO - __main__ - Step 26787: {'lr': 0.00046618902092988824, 'samples': 5143104, 'steps': 26786, 'loss/train': 1.8235607147216797} 11/07/2021 00:58:57 - INFO - __main__ - Step 26788: {'lr': 0.00046618635587604006, 'samples': 5143296, 'steps': 26787, 'loss/train': 1.5318324565887451} 11/07/2021 00:58:57 - INFO - __main__ - Step 26789: {'lr': 0.00046618369072478163, 'samples': 5143488, 'steps': 26788, 'loss/train': 1.4976593255996704} 11/07/2021 00:58:58 - INFO - __main__ - Step 26790: {'lr': 0.0004661810254761141, 'samples': 5143680, 'steps': 26789, 'loss/train': 1.76413893699646} 11/07/2021 00:58:58 - INFO - __main__ - Step 26791: {'lr': 0.0004661783601300388, 'samples': 5143872, 'steps': 26790, 'loss/train': 1.2355625629425049} 11/07/2021 00:58:58 - INFO - __main__ - Step 26792: {'lr': 0.00046617569468655686, 'samples': 5144064, 'steps': 26791, 'loss/train': 1.8549610376358032} 11/07/2021 00:58:59 - INFO - __main__ - Step 26793: {'lr': 0.00046617302914566945, 'samples': 5144256, 'steps': 26792, 'loss/train': 1.5732007026672363} 11/07/2021 00:59:00 - INFO - __main__ - Step 26794: {'lr': 0.00046617036350737786, 'samples': 5144448, 'steps': 26793, 'loss/train': 1.8792085647583008} 11/07/2021 00:59:00 - INFO - __main__ - Step 26795: {'lr': 0.0004661676977716832, 'samples': 5144640, 'steps': 26794, 'loss/train': 0.6580197215080261} 11/07/2021 00:59:00 - INFO - __main__ - Step 26796: {'lr': 0.0004661650319385867, 'samples': 5144832, 'steps': 26795, 'loss/train': 1.1191657781600952} 11/07/2021 00:59:01 - INFO - __main__ - Step 26797: {'lr': 0.0004661623660080896, 'samples': 5145024, 'steps': 26796, 'loss/train': 1.5374164581298828} 11/07/2021 00:59:01 - INFO - __main__ - Step 26798: {'lr': 0.000466159699980193, 'samples': 5145216, 'steps': 26797, 'loss/train': 1.153203010559082} 11/07/2021 00:59:02 - INFO - __main__ - Step 26799: {'lr': 0.0004661570338548983, 'samples': 5145408, 'steps': 26798, 'loss/train': 1.4095265865325928} 11/07/2021 00:59:02 - INFO - __main__ - Step 26800: {'lr': 0.00046615436763220645, 'samples': 5145600, 'steps': 26799, 'loss/train': 1.2437288761138916} 11/07/2021 00:59:03 - INFO - __main__ - Step 26801: {'lr': 0.0004661517013121189, 'samples': 5145792, 'steps': 26800, 'loss/train': 1.4183251857757568} 11/07/2021 00:59:03 - INFO - __main__ - Step 26802: {'lr': 0.00046614903489463667, 'samples': 5145984, 'steps': 26801, 'loss/train': 1.6035794019699097} 11/07/2021 00:59:04 - INFO - __main__ - Step 26803: {'lr': 0.000466146368379761, 'samples': 5146176, 'steps': 26802, 'loss/train': 1.5024195909500122} 11/07/2021 00:59:05 - INFO - __main__ - Step 26804: {'lr': 0.0004661437017674931, 'samples': 5146368, 'steps': 26803, 'loss/train': 1.229723572731018} 11/07/2021 00:59:05 - INFO - __main__ - Step 26805: {'lr': 0.00046614103505783423, 'samples': 5146560, 'steps': 26804, 'loss/train': 1.6293288469314575} 11/07/2021 00:59:05 - INFO - __main__ - Step 26806: {'lr': 0.0004661383682507856, 'samples': 5146752, 'steps': 26805, 'loss/train': 1.8361610174179077} 11/07/2021 00:59:06 - INFO - __main__ - Step 26807: {'lr': 0.00046613570134634825, 'samples': 5146944, 'steps': 26806, 'loss/train': 1.4449939727783203} 11/07/2021 00:59:06 - INFO - __main__ - Step 26808: {'lr': 0.00046613303434452346, 'samples': 5147136, 'steps': 26807, 'loss/train': 1.5649182796478271} 11/07/2021 00:59:07 - INFO - __main__ - Step 26809: {'lr': 0.00046613036724531254, 'samples': 5147328, 'steps': 26808, 'loss/train': 1.4508755207061768} 11/07/2021 00:59:07 - INFO - __main__ - Step 26810: {'lr': 0.00046612770004871663, 'samples': 5147520, 'steps': 26809, 'loss/train': 0.7488903403282166} 11/07/2021 00:59:08 - INFO - __main__ - Step 26811: {'lr': 0.00046612503275473687, 'samples': 5147712, 'steps': 26810, 'loss/train': 1.3369272947311401} 11/07/2021 00:59:08 - INFO - __main__ - Step 26812: {'lr': 0.00046612236536337456, 'samples': 5147904, 'steps': 26811, 'loss/train': 1.7601577043533325} 11/07/2021 00:59:08 - INFO - __main__ - Step 26813: {'lr': 0.00046611969787463083, 'samples': 5148096, 'steps': 26812, 'loss/train': 1.4340001344680786} 11/07/2021 00:59:10 - INFO - __main__ - Step 26814: {'lr': 0.00046611703028850683, 'samples': 5148288, 'steps': 26813, 'loss/train': 1.94196617603302} 11/07/2021 00:59:10 - INFO - __main__ - Step 26815: {'lr': 0.00046611436260500386, 'samples': 5148480, 'steps': 26814, 'loss/train': 1.50175142288208} 11/07/2021 00:59:10 - INFO - __main__ - Step 26816: {'lr': 0.00046611169482412305, 'samples': 5148672, 'steps': 26815, 'loss/train': 1.3230130672454834} 11/07/2021 00:59:11 - INFO - __main__ - Step 26817: {'lr': 0.00046610902694586576, 'samples': 5148864, 'steps': 26816, 'loss/train': 1.762872338294983} 11/07/2021 00:59:11 - INFO - __main__ - Step 26818: {'lr': 0.00046610635897023303, 'samples': 5149056, 'steps': 26817, 'loss/train': 0.9505210518836975} 11/07/2021 00:59:11 - INFO - __main__ - Step 26819: {'lr': 0.0004661036908972261, 'samples': 5149248, 'steps': 26818, 'loss/train': 1.371229648590088} 11/07/2021 00:59:12 - INFO - __main__ - Step 26820: {'lr': 0.0004661010227268462, 'samples': 5149440, 'steps': 26819, 'loss/train': 1.170758605003357} 11/07/2021 00:59:13 - INFO - __main__ - Step 26821: {'lr': 0.0004660983544590944, 'samples': 5149632, 'steps': 26820, 'loss/train': 1.5897732973098755} 11/07/2021 00:59:13 - INFO - __main__ - Step 26822: {'lr': 0.0004660956860939722, 'samples': 5149824, 'steps': 26821, 'loss/train': 1.6550143957138062} 11/07/2021 00:59:13 - INFO - __main__ - Step 26823: {'lr': 0.0004660930176314805, 'samples': 5150016, 'steps': 26822, 'loss/train': 1.2598298788070679} 11/07/2021 00:59:14 - INFO - __main__ - Step 26824: {'lr': 0.0004660903490716206, 'samples': 5150208, 'steps': 26823, 'loss/train': 1.2444219589233398} 11/07/2021 00:59:15 - INFO - __main__ - Step 26825: {'lr': 0.0004660876804143938, 'samples': 5150400, 'steps': 26824, 'loss/train': 1.2991782426834106} 11/07/2021 00:59:15 - INFO - __main__ - Step 26826: {'lr': 0.0004660850116598012, 'samples': 5150592, 'steps': 26825, 'loss/train': 2.0060222148895264} 11/07/2021 00:59:16 - INFO - __main__ - Step 26827: {'lr': 0.00046608234280784406, 'samples': 5150784, 'steps': 26826, 'loss/train': 1.3262913227081299} 11/07/2021 00:59:16 - INFO - __main__ - Step 26828: {'lr': 0.0004660796738585235, 'samples': 5150976, 'steps': 26827, 'loss/train': 1.8722556829452515} 11/07/2021 00:59:16 - INFO - __main__ - Step 26829: {'lr': 0.0004660770048118408, 'samples': 5151168, 'steps': 26828, 'loss/train': 1.2325032949447632} 11/07/2021 00:59:18 - INFO - __main__ - Step 26830: {'lr': 0.00046607433566779713, 'samples': 5151360, 'steps': 26829, 'loss/train': 0.7690796256065369} 11/07/2021 00:59:18 - INFO - __main__ - Step 26831: {'lr': 0.00046607166642639365, 'samples': 5151552, 'steps': 26830, 'loss/train': 1.4862192869186401} 11/07/2021 00:59:18 - INFO - __main__ - Step 26832: {'lr': 0.00046606899708763174, 'samples': 5151744, 'steps': 26831, 'loss/train': 1.6997482776641846} 11/07/2021 00:59:19 - INFO - __main__ - Step 26833: {'lr': 0.0004660663276515124, 'samples': 5151936, 'steps': 26832, 'loss/train': 1.7106437683105469} 11/07/2021 00:59:19 - INFO - __main__ - Step 26834: {'lr': 0.00046606365811803686, 'samples': 5152128, 'steps': 26833, 'loss/train': 1.5306406021118164} 11/07/2021 00:59:20 - INFO - __main__ - Step 26835: {'lr': 0.0004660609884872064, 'samples': 5152320, 'steps': 26834, 'loss/train': 1.4699666500091553} 11/07/2021 00:59:20 - INFO - __main__ - Step 26836: {'lr': 0.00046605831875902215, 'samples': 5152512, 'steps': 26835, 'loss/train': 1.2721798419952393} 11/07/2021 00:59:21 - INFO - __main__ - Step 26837: {'lr': 0.00046605564893348545, 'samples': 5152704, 'steps': 26836, 'loss/train': 1.8770490884780884} 11/07/2021 00:59:21 - INFO - __main__ - Step 26838: {'lr': 0.0004660529790105974, 'samples': 5152896, 'steps': 26837, 'loss/train': 1.1704834699630737} 11/07/2021 00:59:21 - INFO - __main__ - Step 26839: {'lr': 0.00046605030899035915, 'samples': 5153088, 'steps': 26838, 'loss/train': 1.6779916286468506} 11/07/2021 00:59:23 - INFO - __main__ - Step 26840: {'lr': 0.000466047638872772, 'samples': 5153280, 'steps': 26839, 'loss/train': 1.5715537071228027} 11/07/2021 00:59:23 - INFO - __main__ - Step 26841: {'lr': 0.0004660449686578371, 'samples': 5153472, 'steps': 26840, 'loss/train': 1.7219822406768799} 11/07/2021 00:59:23 - INFO - __main__ - Step 26842: {'lr': 0.0004660422983455557, 'samples': 5153664, 'steps': 26841, 'loss/train': 1.8314694166183472} 11/07/2021 00:59:24 - INFO - __main__ - Step 26843: {'lr': 0.0004660396279359289, 'samples': 5153856, 'steps': 26842, 'loss/train': 1.2206279039382935} 11/07/2021 00:59:24 - INFO - __main__ - Step 26844: {'lr': 0.000466036957428958, 'samples': 5154048, 'steps': 26843, 'loss/train': 1.7041361331939697} 11/07/2021 00:59:25 - INFO - __main__ - Step 26845: {'lr': 0.0004660342868246442, 'samples': 5154240, 'steps': 26844, 'loss/train': 2.0032145977020264} 11/07/2021 00:59:26 - INFO - __main__ - Step 26846: {'lr': 0.0004660316161229887, 'samples': 5154432, 'steps': 26845, 'loss/train': 0.9417108297348022} 11/07/2021 00:59:26 - INFO - __main__ - Step 26847: {'lr': 0.00046602894532399275, 'samples': 5154624, 'steps': 26846, 'loss/train': 1.3233318328857422} 11/07/2021 00:59:26 - INFO - __main__ - Step 26848: {'lr': 0.00046602627442765744, 'samples': 5154816, 'steps': 26847, 'loss/train': 1.4452942609786987} 11/07/2021 00:59:27 - INFO - __main__ - Step 26849: {'lr': 0.00046602360343398397, 'samples': 5155008, 'steps': 26848, 'loss/train': 1.0663906335830688} 11/07/2021 00:59:27 - INFO - __main__ - Step 26850: {'lr': 0.0004660209323429736, 'samples': 5155200, 'steps': 26849, 'loss/train': 1.239379644393921} 11/07/2021 00:59:27 - INFO - __main__ - Step 26851: {'lr': 0.0004660182611546276, 'samples': 5155392, 'steps': 26850, 'loss/train': 1.7812410593032837} 11/07/2021 00:59:28 - INFO - __main__ - Step 26852: {'lr': 0.0004660155898689471, 'samples': 5155584, 'steps': 26851, 'loss/train': 1.6919243335723877} 11/07/2021 00:59:29 - INFO - __main__ - Step 26853: {'lr': 0.0004660129184859332, 'samples': 5155776, 'steps': 26852, 'loss/train': 2.3535852432250977} 11/07/2021 00:59:29 - INFO - __main__ - Step 26854: {'lr': 0.00046601024700558736, 'samples': 5155968, 'steps': 26853, 'loss/train': 1.173360824584961} 11/07/2021 00:59:30 - INFO - __main__ - Step 26855: {'lr': 0.0004660075754279105, 'samples': 5156160, 'steps': 26854, 'loss/train': 1.432096242904663} 11/07/2021 00:59:30 - INFO - __main__ - Step 26856: {'lr': 0.00046600490375290406, 'samples': 5156352, 'steps': 26855, 'loss/train': 1.6817970275878906} 11/07/2021 00:59:31 - INFO - __main__ - Step 26857: {'lr': 0.0004660022319805691, 'samples': 5156544, 'steps': 26856, 'loss/train': 1.6581238508224487} 11/07/2021 00:59:31 - INFO - __main__ - Step 26858: {'lr': 0.0004659995601109069, 'samples': 5156736, 'steps': 26857, 'loss/train': 1.334643006324768} 11/07/2021 00:59:32 - INFO - __main__ - Step 26859: {'lr': 0.0004659968881439186, 'samples': 5156928, 'steps': 26858, 'loss/train': 1.651904582977295} 11/07/2021 00:59:32 - INFO - __main__ - Step 26860: {'lr': 0.00046599421607960545, 'samples': 5157120, 'steps': 26859, 'loss/train': 1.7464357614517212} 11/07/2021 00:59:32 - INFO - __main__ - Step 26861: {'lr': 0.0004659915439179686, 'samples': 5157312, 'steps': 26860, 'loss/train': 1.5475162267684937} 11/07/2021 00:59:33 - INFO - __main__ - Step 26862: {'lr': 0.0004659888716590094, 'samples': 5157504, 'steps': 26861, 'loss/train': 1.158503770828247} 11/07/2021 00:59:34 - INFO - __main__ - Step 26863: {'lr': 0.00046598619930272883, 'samples': 5157696, 'steps': 26862, 'loss/train': 1.7206814289093018} 11/07/2021 00:59:34 - INFO - __main__ - Step 26864: {'lr': 0.00046598352684912824, 'samples': 5157888, 'steps': 26863, 'loss/train': 1.4895248413085938} 11/07/2021 00:59:34 - INFO - __main__ - Step 26865: {'lr': 0.0004659808542982088, 'samples': 5158080, 'steps': 26864, 'loss/train': 1.326369047164917} 11/07/2021 00:59:35 - INFO - __main__ - Step 26866: {'lr': 0.0004659781816499718, 'samples': 5158272, 'steps': 26865, 'loss/train': 1.5209534168243408} 11/07/2021 00:59:36 - INFO - __main__ - Step 26867: {'lr': 0.0004659755089044183, 'samples': 5158464, 'steps': 26866, 'loss/train': 1.605056643486023} 11/07/2021 00:59:36 - INFO - __main__ - Step 26868: {'lr': 0.00046597283606154957, 'samples': 5158656, 'steps': 26867, 'loss/train': 0.6222965121269226} 11/07/2021 00:59:37 - INFO - __main__ - Step 26869: {'lr': 0.0004659701631213668, 'samples': 5158848, 'steps': 26868, 'loss/train': 1.559769868850708} 11/07/2021 00:59:37 - INFO - __main__ - Step 26870: {'lr': 0.00046596749008387124, 'samples': 5159040, 'steps': 26869, 'loss/train': 1.3670252561569214} 11/07/2021 00:59:37 - INFO - __main__ - Step 26871: {'lr': 0.00046596481694906403, 'samples': 5159232, 'steps': 26870, 'loss/train': 0.9740362763404846} 11/07/2021 00:59:38 - INFO - __main__ - Step 26872: {'lr': 0.00046596214371694643, 'samples': 5159424, 'steps': 26871, 'loss/train': 1.776827335357666} 11/07/2021 00:59:39 - INFO - __main__ - Step 26873: {'lr': 0.00046595947038751963, 'samples': 5159616, 'steps': 26872, 'loss/train': 1.6072895526885986} 11/07/2021 00:59:39 - INFO - __main__ - Step 26874: {'lr': 0.00046595679696078476, 'samples': 5159808, 'steps': 26873, 'loss/train': 1.5575486421585083} 11/07/2021 00:59:39 - INFO - __main__ - Step 26875: {'lr': 0.00046595412343674317, 'samples': 5160000, 'steps': 26874, 'loss/train': 1.6973767280578613} 11/07/2021 00:59:40 - INFO - __main__ - Step 26876: {'lr': 0.00046595144981539596, 'samples': 5160192, 'steps': 26875, 'loss/train': 1.2691295146942139} 11/07/2021 00:59:41 - INFO - __main__ - Step 26877: {'lr': 0.00046594877609674437, 'samples': 5160384, 'steps': 26876, 'loss/train': 1.1658782958984375} 11/07/2021 00:59:41 - INFO - __main__ - Step 26878: {'lr': 0.00046594610228078954, 'samples': 5160576, 'steps': 26877, 'loss/train': 1.6623400449752808} 11/07/2021 00:59:41 - INFO - __main__ - Step 26879: {'lr': 0.00046594342836753276, 'samples': 5160768, 'steps': 26878, 'loss/train': 2.1451001167297363} 11/07/2021 00:59:42 - INFO - __main__ - Step 26880: {'lr': 0.0004659407543569752, 'samples': 5160960, 'steps': 26879, 'loss/train': 1.7946914434432983} 11/07/2021 00:59:42 - INFO - __main__ - Step 26881: {'lr': 0.0004659380802491181, 'samples': 5161152, 'steps': 26880, 'loss/train': 1.3722161054611206} 11/07/2021 00:59:43 - INFO - __main__ - Step 26882: {'lr': 0.00046593540604396256, 'samples': 5161344, 'steps': 26881, 'loss/train': 0.8809364438056946} 11/07/2021 00:59:44 - INFO - __main__ - Step 26883: {'lr': 0.00046593273174150995, 'samples': 5161536, 'steps': 26882, 'loss/train': 1.5647692680358887} 11/07/2021 00:59:44 - INFO - __main__ - Step 26884: {'lr': 0.0004659300573417613, 'samples': 5161728, 'steps': 26883, 'loss/train': 1.522611141204834} 11/07/2021 00:59:44 - INFO - __main__ - Step 26885: {'lr': 0.00046592738284471794, 'samples': 5161920, 'steps': 26884, 'loss/train': 1.2077406644821167} 11/07/2021 00:59:45 - INFO - __main__ - Step 26886: {'lr': 0.000465924708250381, 'samples': 5162112, 'steps': 26885, 'loss/train': 1.7725774049758911} 11/07/2021 00:59:45 - INFO - __main__ - Step 26887: {'lr': 0.00046592203355875177, 'samples': 5162304, 'steps': 26886, 'loss/train': 1.3362911939620972} 11/07/2021 00:59:46 - INFO - __main__ - Step 26888: {'lr': 0.00046591935876983136, 'samples': 5162496, 'steps': 26887, 'loss/train': 1.8308025598526} 11/07/2021 00:59:46 - INFO - __main__ - Step 26889: {'lr': 0.0004659166838836211, 'samples': 5162688, 'steps': 26888, 'loss/train': 1.39945650100708} 11/07/2021 00:59:47 - INFO - __main__ - Step 26890: {'lr': 0.000465914008900122, 'samples': 5162880, 'steps': 26889, 'loss/train': 1.7456254959106445} 11/07/2021 00:59:47 - INFO - __main__ - Step 26891: {'lr': 0.00046591133381933546, 'samples': 5163072, 'steps': 26890, 'loss/train': 1.739495038986206} 11/07/2021 00:59:47 - INFO - __main__ - Step 26892: {'lr': 0.0004659086586412626, 'samples': 5163264, 'steps': 26891, 'loss/train': 1.6727256774902344} 11/07/2021 00:59:49 - INFO - __main__ - Step 26893: {'lr': 0.0004659059833659046, 'samples': 5163456, 'steps': 26892, 'loss/train': 1.331292748451233} 11/07/2021 00:59:49 - INFO - __main__ - Step 26894: {'lr': 0.0004659033079932627, 'samples': 5163648, 'steps': 26893, 'loss/train': 1.4414929151535034} 11/07/2021 00:59:49 - INFO - __main__ - Step 26895: {'lr': 0.00046590063252333806, 'samples': 5163840, 'steps': 26894, 'loss/train': 5.165720462799072} 11/07/2021 00:59:50 - INFO - __main__ - Step 26896: {'lr': 0.000465897956956132, 'samples': 5164032, 'steps': 26895, 'loss/train': 1.4558935165405273} 11/07/2021 00:59:50 - INFO - __main__ - Step 26897: {'lr': 0.0004658952812916456, 'samples': 5164224, 'steps': 26896, 'loss/train': 1.7362563610076904} 11/07/2021 00:59:51 - INFO - __main__ - Step 26898: {'lr': 0.0004658926055298802, 'samples': 5164416, 'steps': 26897, 'loss/train': 1.4366354942321777} 11/07/2021 00:59:52 - INFO - __main__ - Step 26899: {'lr': 0.0004658899296708369, 'samples': 5164608, 'steps': 26898, 'loss/train': 1.7091032266616821} 11/07/2021 00:59:52 - INFO - __main__ - Step 26900: {'lr': 0.00046588725371451685, 'samples': 5164800, 'steps': 26899, 'loss/train': 1.7321528196334839} 11/07/2021 00:59:52 - INFO - __main__ - Step 26901: {'lr': 0.00046588457766092134, 'samples': 5164992, 'steps': 26900, 'loss/train': 1.428074836730957} 11/07/2021 00:59:53 - INFO - __main__ - Step 26902: {'lr': 0.00046588190151005163, 'samples': 5165184, 'steps': 26901, 'loss/train': 1.7397880554199219} 11/07/2021 00:59:54 - INFO - __main__ - Step 26903: {'lr': 0.00046587922526190883, 'samples': 5165376, 'steps': 26902, 'loss/train': 1.7550833225250244} 11/07/2021 00:59:54 - INFO - __main__ - Step 26904: {'lr': 0.00046587654891649423, 'samples': 5165568, 'steps': 26903, 'loss/train': 1.6489547491073608} 11/07/2021 00:59:54 - INFO - __main__ - Step 26905: {'lr': 0.00046587387247380897, 'samples': 5165760, 'steps': 26904, 'loss/train': 1.046838641166687} 11/07/2021 00:59:55 - INFO - __main__ - Step 26906: {'lr': 0.00046587119593385424, 'samples': 5165952, 'steps': 26905, 'loss/train': 1.820117473602295} 11/07/2021 00:59:55 - INFO - __main__ - Step 26907: {'lr': 0.00046586851929663134, 'samples': 5166144, 'steps': 26906, 'loss/train': 1.7989552021026611} 11/07/2021 00:59:56 - INFO - __main__ - Step 26908: {'lr': 0.00046586584256214135, 'samples': 5166336, 'steps': 26907, 'loss/train': 1.5433944463729858} 11/07/2021 00:59:56 - INFO - __main__ - Step 26909: {'lr': 0.0004658631657303856, 'samples': 5166528, 'steps': 26908, 'loss/train': 2.116696834564209} 11/07/2021 00:59:57 - INFO - __main__ - Step 26910: {'lr': 0.0004658604888013652, 'samples': 5166720, 'steps': 26909, 'loss/train': 1.8567917346954346} 11/07/2021 00:59:57 - INFO - __main__ - Step 26911: {'lr': 0.00046585781177508137, 'samples': 5166912, 'steps': 26910, 'loss/train': 1.115211009979248} 11/07/2021 00:59:58 - INFO - __main__ - Step 26912: {'lr': 0.0004658551346515354, 'samples': 5167104, 'steps': 26911, 'loss/train': 1.6359246969223022} 11/07/2021 00:59:59 - INFO - __main__ - Step 26913: {'lr': 0.00046585245743072833, 'samples': 5167296, 'steps': 26912, 'loss/train': 2.238630533218384} 11/07/2021 00:59:59 - INFO - __main__ - Step 26914: {'lr': 0.0004658497801126616, 'samples': 5167488, 'steps': 26913, 'loss/train': 1.3893851041793823} 11/07/2021 00:59:59 - INFO - __main__ - Step 26915: {'lr': 0.00046584710269733623, 'samples': 5167680, 'steps': 26914, 'loss/train': 1.373899221420288} 11/07/2021 01:00:00 - INFO - __main__ - Step 26916: {'lr': 0.00046584442518475354, 'samples': 5167872, 'steps': 26915, 'loss/train': 1.1140812635421753} 11/07/2021 01:00:00 - INFO - __main__ - Step 26917: {'lr': 0.0004658417475749146, 'samples': 5168064, 'steps': 26916, 'loss/train': 1.7710485458374023} 11/07/2021 01:00:01 - INFO - __main__ - Step 26918: {'lr': 0.00046583906986782074, 'samples': 5168256, 'steps': 26917, 'loss/train': 1.4844192266464233} 11/07/2021 01:00:01 - INFO - __main__ - Step 26919: {'lr': 0.0004658363920634732, 'samples': 5168448, 'steps': 26918, 'loss/train': 1.3179831504821777} 11/07/2021 01:00:02 - INFO - __main__ - Step 26920: {'lr': 0.000465833714161873, 'samples': 5168640, 'steps': 26919, 'loss/train': 1.7123818397521973} 11/07/2021 01:00:02 - INFO - __main__ - Step 26921: {'lr': 0.00046583103616302146, 'samples': 5168832, 'steps': 26920, 'loss/train': 1.5609678030014038} 11/07/2021 01:00:02 - INFO - __main__ - Step 26922: {'lr': 0.0004658283580669198, 'samples': 5169024, 'steps': 26921, 'loss/train': 1.2969560623168945} 11/07/2021 01:00:03 - INFO - __main__ - Step 26923: {'lr': 0.0004658256798735693, 'samples': 5169216, 'steps': 26922, 'loss/train': 1.5124496221542358} 11/07/2021 01:00:04 - INFO - __main__ - Step 26924: {'lr': 0.000465823001582971, 'samples': 5169408, 'steps': 26923, 'loss/train': 1.7385716438293457} 11/07/2021 01:00:04 - INFO - __main__ - Step 26925: {'lr': 0.00046582032319512624, 'samples': 5169600, 'steps': 26924, 'loss/train': 1.0073723793029785} 11/07/2021 01:00:05 - INFO - __main__ - Step 26926: {'lr': 0.00046581764471003605, 'samples': 5169792, 'steps': 26925, 'loss/train': 1.363101840019226} 11/07/2021 01:00:05 - INFO - __main__ - Step 26927: {'lr': 0.0004658149661277019, 'samples': 5169984, 'steps': 26926, 'loss/train': 1.4179913997650146} 11/07/2021 01:00:05 - INFO - __main__ - Step 26928: {'lr': 0.0004658122874481248, 'samples': 5170176, 'steps': 26927, 'loss/train': 1.6892448663711548} 11/07/2021 01:00:07 - INFO - __main__ - Step 26929: {'lr': 0.000465809608671306, 'samples': 5170368, 'steps': 26928, 'loss/train': 1.7115468978881836} 11/07/2021 01:00:08 - INFO - __main__ - Step 26930: {'lr': 0.0004658069297972467, 'samples': 5170560, 'steps': 26929, 'loss/train': 2.0289013385772705} 11/07/2021 01:00:08 - INFO - __main__ - Step 26931: {'lr': 0.00046580425082594823, 'samples': 5170752, 'steps': 26930, 'loss/train': 2.270247220993042} 11/07/2021 01:00:08 - INFO - __main__ - Step 26932: {'lr': 0.00046580157175741155, 'samples': 5170944, 'steps': 26931, 'loss/train': 2.732706308364868} 11/07/2021 01:00:09 - INFO - __main__ - Step 26933: {'lr': 0.0004657988925916381, 'samples': 5171136, 'steps': 26932, 'loss/train': 1.691531777381897} 11/07/2021 01:00:09 - INFO - __main__ - Step 26934: {'lr': 0.000465796213328629, 'samples': 5171328, 'steps': 26933, 'loss/train': 1.6522505283355713} 11/07/2021 01:00:10 - INFO - __main__ - Step 26935: {'lr': 0.00046579353396838545, 'samples': 5171520, 'steps': 26934, 'loss/train': 2.0361757278442383} 11/07/2021 01:00:10 - INFO - __main__ - Step 26936: {'lr': 0.00046579085451090864, 'samples': 5171712, 'steps': 26935, 'loss/train': 1.6468123197555542} 11/07/2021 01:00:11 - INFO - __main__ - Step 26937: {'lr': 0.00046578817495619983, 'samples': 5171904, 'steps': 26936, 'loss/train': 1.5967553853988647} 11/07/2021 01:00:11 - INFO - __main__ - Step 26938: {'lr': 0.0004657854953042602, 'samples': 5172096, 'steps': 26937, 'loss/train': 2.384519577026367} 11/07/2021 01:00:12 - INFO - __main__ - Step 26939: {'lr': 0.00046578281555509094, 'samples': 5172288, 'steps': 26938, 'loss/train': 1.3857241868972778} 11/07/2021 01:00:12 - INFO - __main__ - Step 26940: {'lr': 0.00046578013570869325, 'samples': 5172480, 'steps': 26939, 'loss/train': 1.522921085357666} 11/07/2021 01:00:13 - INFO - __main__ - Step 26941: {'lr': 0.00046577745576506844, 'samples': 5172672, 'steps': 26940, 'loss/train': 2.2995705604553223} 11/07/2021 01:00:13 - INFO - __main__ - Step 26942: {'lr': 0.00046577477572421757, 'samples': 5172864, 'steps': 26941, 'loss/train': 1.364117980003357} 11/07/2021 01:00:14 - INFO - __main__ - Step 26943: {'lr': 0.0004657720955861419, 'samples': 5173056, 'steps': 26942, 'loss/train': 1.8198291063308716} 11/07/2021 01:00:14 - INFO - __main__ - Step 26944: {'lr': 0.00046576941535084274, 'samples': 5173248, 'steps': 26943, 'loss/train': 1.622612714767456} 11/07/2021 01:00:14 - INFO - __main__ - Step 26945: {'lr': 0.0004657667350183211, 'samples': 5173440, 'steps': 26944, 'loss/train': 1.3657492399215698} 11/07/2021 01:00:15 - INFO - __main__ - Step 26946: {'lr': 0.00046576405458857836, 'samples': 5173632, 'steps': 26945, 'loss/train': 1.5600732564926147} 11/07/2021 01:00:16 - INFO - __main__ - Step 26947: {'lr': 0.0004657613740616157, 'samples': 5173824, 'steps': 26946, 'loss/train': 1.2829210758209229} 11/07/2021 01:00:16 - INFO - __main__ - Step 26948: {'lr': 0.0004657586934374342, 'samples': 5174016, 'steps': 26947, 'loss/train': 1.333815336227417} 11/07/2021 01:00:17 - INFO - __main__ - Step 26949: {'lr': 0.0004657560127160352, 'samples': 5174208, 'steps': 26948, 'loss/train': 1.4350104331970215} 11/07/2021 01:00:17 - INFO - __main__ - Step 26950: {'lr': 0.00046575333189741993, 'samples': 5174400, 'steps': 26949, 'loss/train': 1.6693968772888184} 11/07/2021 01:00:18 - INFO - __main__ - Step 26951: {'lr': 0.00046575065098158945, 'samples': 5174592, 'steps': 26950, 'loss/train': 1.3539212942123413} 11/07/2021 01:00:18 - INFO - __main__ - Step 26952: {'lr': 0.0004657479699685451, 'samples': 5174784, 'steps': 26951, 'loss/train': 1.4676611423492432} 11/07/2021 01:00:19 - INFO - __main__ - Step 26953: {'lr': 0.00046574528885828803, 'samples': 5174976, 'steps': 26952, 'loss/train': 1.1450750827789307} 11/07/2021 01:00:19 - INFO - __main__ - Step 26954: {'lr': 0.0004657426076508195, 'samples': 5175168, 'steps': 26953, 'loss/train': 1.5569909811019897} 11/07/2021 01:00:19 - INFO - __main__ - Step 26955: {'lr': 0.00046573992634614064, 'samples': 5175360, 'steps': 26954, 'loss/train': 1.4932307004928589} 11/07/2021 01:00:20 - INFO - __main__ - Step 26956: {'lr': 0.00046573724494425274, 'samples': 5175552, 'steps': 26955, 'loss/train': 1.704939842224121} 11/07/2021 01:00:21 - INFO - __main__ - Step 26957: {'lr': 0.00046573456344515694, 'samples': 5175744, 'steps': 26956, 'loss/train': 2.1936867237091064} 11/07/2021 01:00:21 - INFO - __main__ - Step 26958: {'lr': 0.00046573188184885445, 'samples': 5175936, 'steps': 26957, 'loss/train': 1.7155307531356812} 11/07/2021 01:00:21 - INFO - __main__ - Step 26959: {'lr': 0.0004657292001553465, 'samples': 5176128, 'steps': 26958, 'loss/train': 0.9837502837181091} 11/07/2021 01:00:22 - INFO - __main__ - Step 26960: {'lr': 0.0004657265183646344, 'samples': 5176320, 'steps': 26959, 'loss/train': 0.9638455510139465} 11/07/2021 01:00:24 - INFO - __main__ - Step 26961: {'lr': 0.00046572383647671913, 'samples': 5176512, 'steps': 26960, 'loss/train': 1.653584361076355} 11/07/2021 01:00:24 - INFO - __main__ - Step 26962: {'lr': 0.0004657211544916021, 'samples': 5176704, 'steps': 26961, 'loss/train': 1.24321711063385} 11/07/2021 01:00:25 - INFO - __main__ - Step 26963: {'lr': 0.00046571847240928444, 'samples': 5176896, 'steps': 26962, 'loss/train': 1.4641504287719727} 11/07/2021 01:00:25 - INFO - __main__ - Step 26964: {'lr': 0.0004657157902297674, 'samples': 5177088, 'steps': 26963, 'loss/train': 1.522215723991394} 11/07/2021 01:00:25 - INFO - __main__ - Step 26965: {'lr': 0.00046571310795305213, 'samples': 5177280, 'steps': 26964, 'loss/train': 1.6530050039291382} 11/07/2021 01:00:26 - INFO - __main__ - Step 26966: {'lr': 0.0004657104255791398, 'samples': 5177472, 'steps': 26965, 'loss/train': 1.2860393524169922} 11/07/2021 01:00:26 - INFO - __main__ - Step 26967: {'lr': 0.0004657077431080317, 'samples': 5177664, 'steps': 26966, 'loss/train': 1.0747127532958984} 11/07/2021 01:00:26 - INFO - __main__ - Step 26968: {'lr': 0.00046570506053972906, 'samples': 5177856, 'steps': 26967, 'loss/train': 1.3991793394088745} 11/07/2021 01:00:27 - INFO - __main__ - Step 26969: {'lr': 0.000465702377874233, 'samples': 5178048, 'steps': 26968, 'loss/train': 1.2089049816131592} 11/07/2021 01:00:28 - INFO - __main__ - Step 26970: {'lr': 0.00046569969511154485, 'samples': 5178240, 'steps': 26969, 'loss/train': 1.4461259841918945} 11/07/2021 01:00:28 - INFO - __main__ - Step 26971: {'lr': 0.0004656970122516657, 'samples': 5178432, 'steps': 26970, 'loss/train': 1.7595458030700684} 11/07/2021 01:00:28 - INFO - __main__ - Step 26972: {'lr': 0.0004656943292945968, 'samples': 5178624, 'steps': 26971, 'loss/train': 1.6467506885528564} 11/07/2021 01:00:29 - INFO - __main__ - Step 26973: {'lr': 0.0004656916462403394, 'samples': 5178816, 'steps': 26972, 'loss/train': 1.8223869800567627} 11/07/2021 01:00:30 - INFO - __main__ - Step 26974: {'lr': 0.0004656889630888946, 'samples': 5179008, 'steps': 26973, 'loss/train': 1.9120339155197144} 11/07/2021 01:00:30 - INFO - __main__ - Step 26975: {'lr': 0.0004656862798402638, 'samples': 5179200, 'steps': 26974, 'loss/train': 1.7358126640319824} 11/07/2021 01:00:30 - INFO - __main__ - Step 26976: {'lr': 0.00046568359649444796, 'samples': 5179392, 'steps': 26975, 'loss/train': 1.541935920715332} 11/07/2021 01:00:31 - INFO - __main__ - Step 26977: {'lr': 0.0004656809130514485, 'samples': 5179584, 'steps': 26976, 'loss/train': 1.5258926153182983} 11/07/2021 01:00:31 - INFO - __main__ - Step 26978: {'lr': 0.00046567822951126646, 'samples': 5179776, 'steps': 26977, 'loss/train': 0.8509805798530579} 11/07/2021 01:00:32 - INFO - __main__ - Step 26979: {'lr': 0.00046567554587390324, 'samples': 5179968, 'steps': 26978, 'loss/train': 1.5014252662658691} 11/07/2021 01:00:33 - INFO - __main__ - Step 26980: {'lr': 0.00046567286213935994, 'samples': 5180160, 'steps': 26979, 'loss/train': 1.6096993684768677} 11/07/2021 01:00:33 - INFO - __main__ - Step 26981: {'lr': 0.00046567017830763776, 'samples': 5180352, 'steps': 26980, 'loss/train': 1.5932661294937134} 11/07/2021 01:00:33 - INFO - __main__ - Step 26982: {'lr': 0.0004656674943787379, 'samples': 5180544, 'steps': 26981, 'loss/train': 1.6337497234344482} 11/07/2021 01:00:34 - INFO - __main__ - Step 26983: {'lr': 0.0004656648103526616, 'samples': 5180736, 'steps': 26982, 'loss/train': 1.1553945541381836} 11/07/2021 01:00:34 - INFO - __main__ - Step 26984: {'lr': 0.00046566212622941005, 'samples': 5180928, 'steps': 26983, 'loss/train': 1.3024195432662964} 11/07/2021 01:00:35 - INFO - __main__ - Step 26985: {'lr': 0.00046565944200898453, 'samples': 5181120, 'steps': 26984, 'loss/train': 1.115247130393982} 11/07/2021 01:00:35 - INFO - __main__ - Step 26986: {'lr': 0.00046565675769138614, 'samples': 5181312, 'steps': 26985, 'loss/train': 1.7781128883361816} 11/07/2021 01:00:36 - INFO - __main__ - Step 26987: {'lr': 0.00046565407327661614, 'samples': 5181504, 'steps': 26986, 'loss/train': 1.3278738260269165} 11/07/2021 01:00:36 - INFO - __main__ - Step 26988: {'lr': 0.0004656513887646758, 'samples': 5181696, 'steps': 26987, 'loss/train': 1.3004204034805298} 11/07/2021 01:00:37 - INFO - __main__ - Step 26989: {'lr': 0.00046564870415556625, 'samples': 5181888, 'steps': 26988, 'loss/train': 1.5969914197921753} 11/07/2021 01:00:38 - INFO - __main__ - Step 26990: {'lr': 0.0004656460194492887, 'samples': 5182080, 'steps': 26989, 'loss/train': 1.6689649820327759} 11/07/2021 01:00:38 - INFO - __main__ - Step 26991: {'lr': 0.0004656433346458444, 'samples': 5182272, 'steps': 26990, 'loss/train': 1.328795075416565} 11/07/2021 01:00:38 - INFO - __main__ - Step 26992: {'lr': 0.0004656406497452345, 'samples': 5182464, 'steps': 26991, 'loss/train': 1.5303890705108643} 11/07/2021 01:00:39 - INFO - __main__ - Step 26993: {'lr': 0.0004656379647474603, 'samples': 5182656, 'steps': 26992, 'loss/train': 1.785988211631775} 11/07/2021 01:00:39 - INFO - __main__ - Step 26994: {'lr': 0.0004656352796525229, 'samples': 5182848, 'steps': 26993, 'loss/train': 1.4376311302185059} 11/07/2021 01:00:39 - INFO - __main__ - Step 26995: {'lr': 0.0004656325944604236, 'samples': 5183040, 'steps': 26994, 'loss/train': 1.5736900568008423} 11/07/2021 01:00:40 - INFO - __main__ - Step 26996: {'lr': 0.00046562990917116366, 'samples': 5183232, 'steps': 26995, 'loss/train': 1.916595697402954} 11/07/2021 01:00:41 - INFO - __main__ - Step 26997: {'lr': 0.0004656272237847441, 'samples': 5183424, 'steps': 26996, 'loss/train': 1.01767098903656} 11/07/2021 01:00:41 - INFO - __main__ - Step 26998: {'lr': 0.0004656245383011663, 'samples': 5183616, 'steps': 26997, 'loss/train': 1.6805261373519897} 11/07/2021 01:00:42 - INFO - __main__ - Step 26999: {'lr': 0.00046562185272043137, 'samples': 5183808, 'steps': 26998, 'loss/train': 1.7131564617156982} 11/07/2021 01:00:42 - INFO - __main__ - Step 27000: {'lr': 0.00046561916704254057, 'samples': 5184000, 'steps': 26999, 'loss/train': 1.855010986328125} 11/07/2021 01:00:43 - INFO - __main__ - Step 27001: {'lr': 0.0004656164812674951, 'samples': 5184192, 'steps': 27000, 'loss/train': 1.456249475479126} 11/07/2021 01:00:43 - INFO - __main__ - Step 27002: {'lr': 0.00046561379539529626, 'samples': 5184384, 'steps': 27001, 'loss/train': 0.16935046017169952} 11/07/2021 01:00:44 - INFO - __main__ - Step 27003: {'lr': 0.0004656111094259451, 'samples': 5184576, 'steps': 27002, 'loss/train': 1.3302737474441528} 11/07/2021 01:00:44 - INFO - __main__ - Step 27004: {'lr': 0.0004656084233594429, 'samples': 5184768, 'steps': 27003, 'loss/train': 1.6964117288589478} 11/07/2021 01:00:44 - INFO - __main__ - Step 27005: {'lr': 0.0004656057371957908, 'samples': 5184960, 'steps': 27004, 'loss/train': 1.0182842016220093} 11/07/2021 01:00:45 - INFO - __main__ - Step 27006: {'lr': 0.00046560305093499015, 'samples': 5185152, 'steps': 27005, 'loss/train': 1.4617400169372559} 11/07/2021 01:00:46 - INFO - __main__ - Step 27007: {'lr': 0.00046560036457704215, 'samples': 5185344, 'steps': 27006, 'loss/train': 1.2601394653320312} 11/07/2021 01:00:46 - INFO - __main__ - Step 27008: {'lr': 0.00046559767812194786, 'samples': 5185536, 'steps': 27007, 'loss/train': 1.5308825969696045} 11/07/2021 01:00:46 - INFO - __main__ - Step 27009: {'lr': 0.0004655949915697086, 'samples': 5185728, 'steps': 27008, 'loss/train': 1.6513859033584595} 11/07/2021 01:00:47 - INFO - __main__ - Step 27010: {'lr': 0.0004655923049203256, 'samples': 5185920, 'steps': 27009, 'loss/train': 1.715781807899475} 11/07/2021 01:00:48 - INFO - __main__ - Step 27011: {'lr': 0.00046558961817380005, 'samples': 5186112, 'steps': 27010, 'loss/train': 1.478920578956604} 11/07/2021 01:00:48 - INFO - __main__ - Step 27012: {'lr': 0.00046558693133013306, 'samples': 5186304, 'steps': 27011, 'loss/train': 1.7847849130630493} 11/07/2021 01:00:49 - INFO - __main__ - Step 27013: {'lr': 0.000465584244389326, 'samples': 5186496, 'steps': 27012, 'loss/train': 1.8046079874038696} 11/07/2021 01:00:49 - INFO - __main__ - Step 27014: {'lr': 0.00046558155735137996, 'samples': 5186688, 'steps': 27013, 'loss/train': 1.7372362613677979} 11/07/2021 01:00:49 - INFO - __main__ - Step 27015: {'lr': 0.00046557887021629623, 'samples': 5186880, 'steps': 27014, 'loss/train': 1.6177726984024048} 11/07/2021 01:00:50 - INFO - __main__ - Step 27016: {'lr': 0.000465576182984076, 'samples': 5187072, 'steps': 27015, 'loss/train': 1.7490988969802856} 11/07/2021 01:00:51 - INFO - __main__ - Step 27017: {'lr': 0.0004655734956547204, 'samples': 5187264, 'steps': 27016, 'loss/train': 1.8980671167373657} 11/07/2021 01:00:51 - INFO - __main__ - Step 27018: {'lr': 0.00046557080822823076, 'samples': 5187456, 'steps': 27017, 'loss/train': 1.4750101566314697} 11/07/2021 01:00:52 - INFO - __main__ - Step 27019: {'lr': 0.0004655681207046083, 'samples': 5187648, 'steps': 27018, 'loss/train': 1.6902590990066528} 11/07/2021 01:00:52 - INFO - __main__ - Step 27020: {'lr': 0.0004655654330838541, 'samples': 5187840, 'steps': 27019, 'loss/train': 0.45184364914894104} 11/07/2021 01:00:53 - INFO - __main__ - Step 27021: {'lr': 0.00046556274536596945, 'samples': 5188032, 'steps': 27020, 'loss/train': 1.976624846458435} 11/07/2021 01:00:53 - INFO - __main__ - Step 27022: {'lr': 0.00046556005755095555, 'samples': 5188224, 'steps': 27021, 'loss/train': 1.4296730756759644} 11/07/2021 01:00:54 - INFO - __main__ - Step 27023: {'lr': 0.00046555736963881355, 'samples': 5188416, 'steps': 27022, 'loss/train': 1.521439552307129} 11/07/2021 01:00:54 - INFO - __main__ - Step 27024: {'lr': 0.0004655546816295448, 'samples': 5188608, 'steps': 27023, 'loss/train': 1.745589256286621} 11/07/2021 01:00:54 - INFO - __main__ - Step 27025: {'lr': 0.0004655519935231505, 'samples': 5188800, 'steps': 27024, 'loss/train': 1.2226965427398682} 11/07/2021 01:00:55 - INFO - __main__ - Step 27026: {'lr': 0.00046554930531963166, 'samples': 5188992, 'steps': 27025, 'loss/train': 1.0397146940231323} 11/07/2021 01:00:56 - INFO - __main__ - Step 27027: {'lr': 0.0004655466170189897, 'samples': 5189184, 'steps': 27026, 'loss/train': 1.48588228225708} 11/07/2021 01:00:56 - INFO - __main__ - Step 27028: {'lr': 0.0004655439286212257, 'samples': 5189376, 'steps': 27027, 'loss/train': 2.311471939086914} 11/07/2021 01:00:57 - INFO - __main__ - Step 27029: {'lr': 0.00046554124012634105, 'samples': 5189568, 'steps': 27028, 'loss/train': 1.452401876449585} 11/07/2021 01:00:57 - INFO - __main__ - Step 27030: {'lr': 0.0004655385515343368, 'samples': 5189760, 'steps': 27029, 'loss/train': 2.314892530441284} 11/07/2021 01:00:57 - INFO - __main__ - Step 27031: {'lr': 0.0004655358628452142, 'samples': 5189952, 'steps': 27030, 'loss/train': 1.920503854751587} 11/07/2021 01:00:58 - INFO - __main__ - Step 27032: {'lr': 0.00046553317405897444, 'samples': 5190144, 'steps': 27031, 'loss/train': 1.8143914937973022} 11/07/2021 01:00:59 - INFO - __main__ - Step 27033: {'lr': 0.0004655304851756188, 'samples': 5190336, 'steps': 27032, 'loss/train': 1.876479983329773} 11/07/2021 01:00:59 - INFO - __main__ - Step 27034: {'lr': 0.0004655277961951484, 'samples': 5190528, 'steps': 27033, 'loss/train': 1.343095302581787} 11/07/2021 01:00:59 - INFO - __main__ - Step 27035: {'lr': 0.00046552510711756444, 'samples': 5190720, 'steps': 27034, 'loss/train': 1.8001316785812378} 11/07/2021 01:01:00 - INFO - __main__ - Step 27036: {'lr': 0.0004655224179428683, 'samples': 5190912, 'steps': 27035, 'loss/train': 1.749971628189087} 11/07/2021 01:01:01 - INFO - __main__ - Step 27037: {'lr': 0.00046551972867106106, 'samples': 5191104, 'steps': 27036, 'loss/train': 1.6416561603546143} 11/07/2021 01:01:01 - INFO - __main__ - Step 27038: {'lr': 0.00046551703930214393, 'samples': 5191296, 'steps': 27037, 'loss/train': 1.3201777935028076} 11/07/2021 01:01:02 - INFO - __main__ - Step 27039: {'lr': 0.00046551434983611823, 'samples': 5191488, 'steps': 27038, 'loss/train': 1.6423300504684448} 11/07/2021 01:01:02 - INFO - __main__ - Step 27040: {'lr': 0.00046551166027298505, 'samples': 5191680, 'steps': 27039, 'loss/train': 2.7235004901885986} 11/07/2021 01:01:02 - INFO - __main__ - Step 27041: {'lr': 0.0004655089706127456, 'samples': 5191872, 'steps': 27040, 'loss/train': 1.454121470451355} 11/07/2021 01:01:03 - INFO - __main__ - Step 27042: {'lr': 0.00046550628085540114, 'samples': 5192064, 'steps': 27041, 'loss/train': 1.6096910238265991} 11/07/2021 01:01:04 - INFO - __main__ - Step 27043: {'lr': 0.0004655035910009529, 'samples': 5192256, 'steps': 27042, 'loss/train': 1.7329154014587402} 11/07/2021 01:01:04 - INFO - __main__ - Step 27044: {'lr': 0.00046550090104940207, 'samples': 5192448, 'steps': 27043, 'loss/train': 1.8347967863082886} 11/07/2021 01:01:04 - INFO - __main__ - Step 27045: {'lr': 0.00046549821100074987, 'samples': 5192640, 'steps': 27044, 'loss/train': 1.4639747142791748} 11/07/2021 01:01:05 - INFO - __main__ - Step 27046: {'lr': 0.0004654955208549975, 'samples': 5192832, 'steps': 27045, 'loss/train': 1.7632167339324951} 11/07/2021 01:01:05 - INFO - __main__ - Step 27047: {'lr': 0.0004654928306121461, 'samples': 5193024, 'steps': 27046, 'loss/train': 1.101479172706604} 11/07/2021 01:01:07 - INFO - __main__ - Step 27048: {'lr': 0.000465490140272197, 'samples': 5193216, 'steps': 27047, 'loss/train': 1.7649903297424316} 11/07/2021 01:01:07 - INFO - __main__ - Step 27049: {'lr': 0.00046548744983515133, 'samples': 5193408, 'steps': 27048, 'loss/train': 1.3125827312469482} 11/07/2021 01:01:07 - INFO - __main__ - Step 27050: {'lr': 0.0004654847593010104, 'samples': 5193600, 'steps': 27049, 'loss/train': 1.8120845556259155} 11/07/2021 01:01:08 - INFO - __main__ - Step 27051: {'lr': 0.0004654820686697754, 'samples': 5193792, 'steps': 27050, 'loss/train': 0.23133371770381927} 11/07/2021 01:01:08 - INFO - __main__ - Step 27052: {'lr': 0.00046547937794144743, 'samples': 5193984, 'steps': 27051, 'loss/train': 1.9630906581878662} 11/07/2021 01:01:09 - INFO - __main__ - Step 27053: {'lr': 0.00046547668711602774, 'samples': 5194176, 'steps': 27052, 'loss/train': 1.4378554821014404} 11/07/2021 01:01:09 - INFO - __main__ - Step 27054: {'lr': 0.0004654739961935177, 'samples': 5194368, 'steps': 27053, 'loss/train': 1.6807277202606201} 11/07/2021 01:01:10 - INFO - __main__ - Step 27055: {'lr': 0.0004654713051739183, 'samples': 5194560, 'steps': 27054, 'loss/train': 1.4902753829956055} 11/07/2021 01:01:10 - INFO - __main__ - Step 27056: {'lr': 0.000465468614057231, 'samples': 5194752, 'steps': 27055, 'loss/train': 1.6699409484863281} 11/07/2021 01:01:10 - INFO - __main__ - Step 27057: {'lr': 0.0004654659228434567, 'samples': 5194944, 'steps': 27056, 'loss/train': 1.7816848754882812} 11/07/2021 01:01:11 - INFO - __main__ - Step 27058: {'lr': 0.00046546323153259686, 'samples': 5195136, 'steps': 27057, 'loss/train': 1.823567509651184} 11/07/2021 01:01:12 - INFO - __main__ - Step 27059: {'lr': 0.00046546054012465253, 'samples': 5195328, 'steps': 27058, 'loss/train': 1.432388186454773} 11/07/2021 01:01:12 - INFO - __main__ - Step 27060: {'lr': 0.00046545784861962516, 'samples': 5195520, 'steps': 27059, 'loss/train': 1.312885046005249} 11/07/2021 01:01:12 - INFO - __main__ - Step 27061: {'lr': 0.00046545515701751567, 'samples': 5195712, 'steps': 27060, 'loss/train': 1.9596468210220337} 11/07/2021 01:01:13 - INFO - __main__ - Step 27062: {'lr': 0.00046545246531832547, 'samples': 5195904, 'steps': 27061, 'loss/train': 1.8299909830093384} 11/07/2021 01:01:14 - INFO - __main__ - Step 27063: {'lr': 0.0004654497735220557, 'samples': 5196096, 'steps': 27062, 'loss/train': 1.0981870889663696} 11/07/2021 01:01:14 - INFO - __main__ - Step 27064: {'lr': 0.0004654470816287076, 'samples': 5196288, 'steps': 27063, 'loss/train': 1.9853992462158203} 11/07/2021 01:01:15 - INFO - __main__ - Step 27065: {'lr': 0.0004654443896382824, 'samples': 5196480, 'steps': 27064, 'loss/train': 2.3117482662200928} 11/07/2021 01:01:15 - INFO - __main__ - Step 27066: {'lr': 0.0004654416975507812, 'samples': 5196672, 'steps': 27065, 'loss/train': 1.6154303550720215} 11/07/2021 01:01:15 - INFO - __main__ - Step 27067: {'lr': 0.0004654390053662053, 'samples': 5196864, 'steps': 27066, 'loss/train': 1.7417590618133545} 11/07/2021 01:01:16 - INFO - __main__ - Step 27068: {'lr': 0.000465436313084556, 'samples': 5197056, 'steps': 27067, 'loss/train': 1.621261477470398} 11/07/2021 01:01:17 - INFO - __main__ - Step 27069: {'lr': 0.0004654336207058344, 'samples': 5197248, 'steps': 27068, 'loss/train': 1.588318943977356} 11/07/2021 01:01:17 - INFO - __main__ - Step 27070: {'lr': 0.0004654309282300416, 'samples': 5197440, 'steps': 27069, 'loss/train': 1.6834807395935059} 11/07/2021 01:01:17 - INFO - __main__ - Step 27071: {'lr': 0.00046542823565717914, 'samples': 5197632, 'steps': 27070, 'loss/train': 1.6977858543395996} 11/07/2021 01:01:18 - INFO - __main__ - Step 27072: {'lr': 0.00046542554298724793, 'samples': 5197824, 'steps': 27071, 'loss/train': 1.3400862216949463} 11/07/2021 01:01:19 - INFO - __main__ - Step 27073: {'lr': 0.00046542285022024935, 'samples': 5198016, 'steps': 27072, 'loss/train': 1.7301197052001953} 11/07/2021 01:01:19 - INFO - __main__ - Step 27074: {'lr': 0.0004654201573561845, 'samples': 5198208, 'steps': 27073, 'loss/train': 1.090531587600708} 11/07/2021 01:01:20 - INFO - __main__ - Step 27075: {'lr': 0.00046541746439505467, 'samples': 5198400, 'steps': 27074, 'loss/train': 1.289291501045227} 11/07/2021 01:01:20 - INFO - __main__ - Step 27076: {'lr': 0.00046541477133686107, 'samples': 5198592, 'steps': 27075, 'loss/train': 1.4234788417816162} 11/07/2021 01:01:20 - INFO - __main__ - Step 27077: {'lr': 0.0004654120781816049, 'samples': 5198784, 'steps': 27076, 'loss/train': 1.8242205381393433} 11/07/2021 01:01:21 - INFO - __main__ - Step 27078: {'lr': 0.00046540938492928735, 'samples': 5198976, 'steps': 27077, 'loss/train': 2.2250328063964844} 11/07/2021 01:01:22 - INFO - __main__ - Step 27079: {'lr': 0.0004654066915799097, 'samples': 5199168, 'steps': 27078, 'loss/train': 1.3047109842300415} 11/07/2021 01:01:22 - INFO - __main__ - Step 27080: {'lr': 0.000465403998133473, 'samples': 5199360, 'steps': 27079, 'loss/train': 1.8516128063201904} 11/07/2021 01:01:22 - INFO - __main__ - Step 27081: {'lr': 0.0004654013045899788, 'samples': 5199552, 'steps': 27080, 'loss/train': 1.7745814323425293} 11/07/2021 01:01:23 - INFO - __main__ - Step 27082: {'lr': 0.00046539861094942794, 'samples': 5199744, 'steps': 27081, 'loss/train': 1.7650301456451416} 11/07/2021 01:01:24 - INFO - __main__ - Step 27083: {'lr': 0.00046539591721182175, 'samples': 5199936, 'steps': 27082, 'loss/train': 1.4061222076416016} 11/07/2021 01:01:24 - INFO - __main__ - Step 27084: {'lr': 0.00046539322337716153, 'samples': 5200128, 'steps': 27083, 'loss/train': 2.0047829151153564} 11/07/2021 01:01:24 - INFO - __main__ - Step 27085: {'lr': 0.00046539052944544846, 'samples': 5200320, 'steps': 27084, 'loss/train': 1.4529653787612915} 11/07/2021 01:01:25 - INFO - __main__ - Step 27086: {'lr': 0.0004653878354166838, 'samples': 5200512, 'steps': 27085, 'loss/train': 1.6772856712341309} 11/07/2021 01:01:25 - INFO - __main__ - Step 27087: {'lr': 0.0004653851412908686, 'samples': 5200704, 'steps': 27086, 'loss/train': 1.6235452890396118} 11/07/2021 01:01:25 - INFO - __main__ - Step 27088: {'lr': 0.0004653824470680043, 'samples': 5200896, 'steps': 27087, 'loss/train': 1.2486436367034912} 11/07/2021 01:01:27 - INFO - __main__ - Step 27089: {'lr': 0.00046537975274809186, 'samples': 5201088, 'steps': 27088, 'loss/train': 1.7086710929870605} 11/07/2021 01:01:27 - INFO - __main__ - Step 27090: {'lr': 0.0004653770583311327, 'samples': 5201280, 'steps': 27089, 'loss/train': 1.279076099395752} 11/07/2021 01:01:27 - INFO - __main__ - Step 27091: {'lr': 0.00046537436381712796, 'samples': 5201472, 'steps': 27090, 'loss/train': 0.8488227725028992} 11/07/2021 01:01:28 - INFO - __main__ - Step 27092: {'lr': 0.00046537166920607886, 'samples': 5201664, 'steps': 27091, 'loss/train': 2.424959897994995} 11/07/2021 01:01:28 - INFO - __main__ - Step 27093: {'lr': 0.00046536897449798656, 'samples': 5201856, 'steps': 27092, 'loss/train': 1.0911108255386353} 11/07/2021 01:01:29 - INFO - __main__ - Step 27094: {'lr': 0.00046536627969285236, 'samples': 5202048, 'steps': 27093, 'loss/train': 1.772316813468933} 11/07/2021 01:01:29 - INFO - __main__ - Step 27095: {'lr': 0.0004653635847906774, 'samples': 5202240, 'steps': 27094, 'loss/train': 0.8457456231117249} 11/07/2021 01:01:30 - INFO - __main__ - Step 27096: {'lr': 0.000465360889791463, 'samples': 5202432, 'steps': 27095, 'loss/train': 1.7497080564498901} 11/07/2021 01:01:30 - INFO - __main__ - Step 27097: {'lr': 0.0004653581946952103, 'samples': 5202624, 'steps': 27096, 'loss/train': 1.5715991258621216} 11/07/2021 01:01:30 - INFO - __main__ - Step 27098: {'lr': 0.0004653554995019205, 'samples': 5202816, 'steps': 27097, 'loss/train': 1.6218191385269165} 11/07/2021 01:01:31 - INFO - __main__ - Step 27099: {'lr': 0.0004653528042115948, 'samples': 5203008, 'steps': 27098, 'loss/train': 1.8767164945602417} 11/07/2021 01:01:32 - INFO - __main__ - Step 27100: {'lr': 0.0004653501088242345, 'samples': 5203200, 'steps': 27099, 'loss/train': 1.2835679054260254} 11/07/2021 01:01:32 - INFO - __main__ - Step 27101: {'lr': 0.0004653474133398408, 'samples': 5203392, 'steps': 27100, 'loss/train': 1.3385190963745117} 11/07/2021 01:01:32 - INFO - __main__ - Step 27102: {'lr': 0.00046534471775841474, 'samples': 5203584, 'steps': 27101, 'loss/train': 1.365417718887329} 11/07/2021 01:01:33 - INFO - __main__ - Step 27103: {'lr': 0.0004653420220799578, 'samples': 5203776, 'steps': 27102, 'loss/train': 1.6177514791488647} 11/07/2021 01:01:34 - INFO - __main__ - Step 27104: {'lr': 0.000465339326304471, 'samples': 5203968, 'steps': 27103, 'loss/train': 1.4537159204483032} 11/07/2021 01:01:34 - INFO - __main__ - Step 27105: {'lr': 0.0004653366304319556, 'samples': 5204160, 'steps': 27104, 'loss/train': 1.086722493171692} 11/07/2021 01:01:35 - INFO - __main__ - Step 27106: {'lr': 0.0004653339344624129, 'samples': 5204352, 'steps': 27105, 'loss/train': 1.5146363973617554} 11/07/2021 01:01:35 - INFO - __main__ - Step 27107: {'lr': 0.00046533123839584406, 'samples': 5204544, 'steps': 27106, 'loss/train': 1.4945223331451416} 11/07/2021 01:01:35 - INFO - __main__ - Step 27108: {'lr': 0.0004653285422322503, 'samples': 5204736, 'steps': 27107, 'loss/train': 1.6637580394744873} 11/07/2021 01:01:36 - INFO - __main__ - Step 27109: {'lr': 0.00046532584597163275, 'samples': 5204928, 'steps': 27108, 'loss/train': 1.4501129388809204} 11/07/2021 01:01:37 - INFO - __main__ - Step 27110: {'lr': 0.0004653231496139927, 'samples': 5205120, 'steps': 27109, 'loss/train': 1.5620125532150269} 11/07/2021 01:01:37 - INFO - __main__ - Step 27111: {'lr': 0.0004653204531593315, 'samples': 5205312, 'steps': 27110, 'loss/train': 1.3244616985321045} 11/07/2021 01:01:38 - INFO - __main__ - Step 27112: {'lr': 0.0004653177566076501, 'samples': 5205504, 'steps': 27111, 'loss/train': 1.5549123287200928} 11/07/2021 01:01:38 - INFO - __main__ - Step 27113: {'lr': 0.0004653150599589498, 'samples': 5205696, 'steps': 27112, 'loss/train': 1.4433401823043823} 11/07/2021 01:01:38 - INFO - __main__ - Step 27114: {'lr': 0.0004653123632132319, 'samples': 5205888, 'steps': 27113, 'loss/train': 1.7523740530014038} 11/07/2021 01:01:39 - INFO - __main__ - Step 27115: {'lr': 0.0004653096663704976, 'samples': 5206080, 'steps': 27114, 'loss/train': 1.6249898672103882} 11/07/2021 01:01:40 - INFO - __main__ - Step 27116: {'lr': 0.0004653069694307481, 'samples': 5206272, 'steps': 27115, 'loss/train': 1.8098299503326416} 11/07/2021 01:01:40 - INFO - __main__ - Step 27117: {'lr': 0.00046530427239398453, 'samples': 5206464, 'steps': 27116, 'loss/train': 0.9234550595283508} 11/07/2021 01:01:40 - INFO - __main__ - Step 27118: {'lr': 0.0004653015752602082, 'samples': 5206656, 'steps': 27117, 'loss/train': 1.5943477153778076} 11/07/2021 01:01:41 - INFO - __main__ - Step 27119: {'lr': 0.0004652988780294204, 'samples': 5206848, 'steps': 27118, 'loss/train': 1.3826004266738892} 11/07/2021 01:01:42 - INFO - __main__ - Step 27120: {'lr': 0.00046529618070162215, 'samples': 5207040, 'steps': 27119, 'loss/train': 1.474747896194458} 11/07/2021 01:01:42 - INFO - __main__ - Step 27121: {'lr': 0.00046529348327681476, 'samples': 5207232, 'steps': 27120, 'loss/train': 1.324153184890747} 11/07/2021 01:01:42 - INFO - __main__ - Step 27122: {'lr': 0.0004652907857549995, 'samples': 5207424, 'steps': 27121, 'loss/train': 1.472080945968628} 11/07/2021 01:01:43 - INFO - __main__ - Step 27123: {'lr': 0.0004652880881361775, 'samples': 5207616, 'steps': 27122, 'loss/train': 1.215911626815796} 11/07/2021 01:01:43 - INFO - __main__ - Step 27124: {'lr': 0.00046528539042035, 'samples': 5207808, 'steps': 27123, 'loss/train': 1.5094414949417114} 11/07/2021 01:01:44 - INFO - __main__ - Step 27125: {'lr': 0.0004652826926075183, 'samples': 5208000, 'steps': 27124, 'loss/train': 1.5033190250396729} 11/07/2021 01:01:45 - INFO - __main__ - Step 27126: {'lr': 0.00046527999469768346, 'samples': 5208192, 'steps': 27125, 'loss/train': 1.8493744134902954} 11/07/2021 01:01:45 - INFO - __main__ - Step 27127: {'lr': 0.0004652772966908468, 'samples': 5208384, 'steps': 27126, 'loss/train': 1.2037136554718018} 11/07/2021 01:01:45 - INFO - __main__ - Step 27128: {'lr': 0.0004652745985870095, 'samples': 5208576, 'steps': 27127, 'loss/train': 1.5834203958511353} 11/07/2021 01:01:46 - INFO - __main__ - Step 27129: {'lr': 0.0004652719003861728, 'samples': 5208768, 'steps': 27128, 'loss/train': 1.11555016040802} 11/07/2021 01:01:46 - INFO - __main__ - Step 27130: {'lr': 0.0004652692020883379, 'samples': 5208960, 'steps': 27129, 'loss/train': 1.6084431409835815} 11/07/2021 01:01:47 - INFO - __main__ - Step 27131: {'lr': 0.00046526650369350605, 'samples': 5209152, 'steps': 27130, 'loss/train': 1.3682796955108643} 11/07/2021 01:01:47 - INFO - __main__ - Step 27132: {'lr': 0.0004652638052016784, 'samples': 5209344, 'steps': 27131, 'loss/train': 1.5609694719314575} 11/07/2021 01:01:48 - INFO - __main__ - Step 27133: {'lr': 0.00046526110661285615, 'samples': 5209536, 'steps': 27132, 'loss/train': 1.7686269283294678} 11/07/2021 01:01:48 - INFO - __main__ - Step 27134: {'lr': 0.00046525840792704064, 'samples': 5209728, 'steps': 27133, 'loss/train': 1.5374583005905151} 11/07/2021 01:01:49 - INFO - __main__ - Step 27135: {'lr': 0.000465255709144233, 'samples': 5209920, 'steps': 27134, 'loss/train': 1.4260269403457642} 11/07/2021 01:01:49 - INFO - __main__ - Step 27136: {'lr': 0.00046525301026443443, 'samples': 5210112, 'steps': 27135, 'loss/train': 1.6430789232254028} 11/07/2021 01:01:50 - INFO - __main__ - Step 27137: {'lr': 0.0004652503112876463, 'samples': 5210304, 'steps': 27136, 'loss/train': 1.476821780204773} 11/07/2021 01:01:50 - INFO - __main__ - Step 27138: {'lr': 0.00046524761221386956, 'samples': 5210496, 'steps': 27137, 'loss/train': 1.0227559804916382} 11/07/2021 01:01:51 - INFO - __main__ - Step 27139: {'lr': 0.0004652449130431056, 'samples': 5210688, 'steps': 27138, 'loss/train': 1.1180068254470825} 11/07/2021 01:01:51 - INFO - __main__ - Step 27140: {'lr': 0.00046524221377535564, 'samples': 5210880, 'steps': 27139, 'loss/train': 1.7611134052276611} 11/07/2021 01:01:52 - INFO - __main__ - Step 27141: {'lr': 0.00046523951441062087, 'samples': 5211072, 'steps': 27140, 'loss/train': 1.591782569885254} 11/07/2021 01:01:52 - INFO - __main__ - Step 27142: {'lr': 0.0004652368149489024, 'samples': 5211264, 'steps': 27141, 'loss/train': 1.1209391355514526} 11/07/2021 01:01:53 - INFO - __main__ - Step 27143: {'lr': 0.0004652341153902016, 'samples': 5211456, 'steps': 27142, 'loss/train': 1.741696834564209} 11/07/2021 01:01:53 - INFO - __main__ - Step 27144: {'lr': 0.00046523141573451965, 'samples': 5211648, 'steps': 27143, 'loss/train': 1.4735972881317139} 11/07/2021 01:01:53 - INFO - __main__ - Step 27145: {'lr': 0.0004652287159818577, 'samples': 5211840, 'steps': 27144, 'loss/train': 1.5447921752929688} 11/07/2021 01:01:54 - INFO - __main__ - Step 27146: {'lr': 0.00046522601613221704, 'samples': 5212032, 'steps': 27145, 'loss/train': 1.7040530443191528} 11/07/2021 01:01:55 - INFO - __main__ - Step 27147: {'lr': 0.0004652233161855989, 'samples': 5212224, 'steps': 27146, 'loss/train': 1.7633503675460815} 11/07/2021 01:01:55 - INFO - __main__ - Step 27148: {'lr': 0.0004652206161420044, 'samples': 5212416, 'steps': 27147, 'loss/train': 1.4151040315628052} 11/07/2021 01:01:55 - INFO - __main__ - Step 27149: {'lr': 0.00046521791600143483, 'samples': 5212608, 'steps': 27148, 'loss/train': 1.2859762907028198} 11/07/2021 01:01:56 - INFO - __main__ - Step 27150: {'lr': 0.00046521521576389134, 'samples': 5212800, 'steps': 27149, 'loss/train': 1.3248544931411743} 11/07/2021 01:01:57 - INFO - __main__ - Step 27151: {'lr': 0.00046521251542937524, 'samples': 5212992, 'steps': 27150, 'loss/train': 1.7026891708374023} 11/07/2021 01:01:57 - INFO - __main__ - Step 27152: {'lr': 0.0004652098149978877, 'samples': 5213184, 'steps': 27151, 'loss/train': 1.7094151973724365} 11/07/2021 01:01:57 - INFO - __main__ - Step 27153: {'lr': 0.00046520711446943, 'samples': 5213376, 'steps': 27152, 'loss/train': 1.3547008037567139} 11/07/2021 01:01:58 - INFO - __main__ - Step 27154: {'lr': 0.0004652044138440032, 'samples': 5213568, 'steps': 27153, 'loss/train': 1.6012340784072876} 11/07/2021 01:01:58 - INFO - __main__ - Step 27155: {'lr': 0.00046520171312160863, 'samples': 5213760, 'steps': 27154, 'loss/train': 1.2679134607315063} 11/07/2021 01:01:59 - INFO - __main__ - Step 27156: {'lr': 0.00046519901230224756, 'samples': 5213952, 'steps': 27155, 'loss/train': 1.0411334037780762} 11/07/2021 01:02:00 - INFO - __main__ - Step 27157: {'lr': 0.000465196311385921, 'samples': 5214144, 'steps': 27156, 'loss/train': 0.4594906270503998} 11/07/2021 01:02:00 - INFO - __main__ - Step 27158: {'lr': 0.0004651936103726304, 'samples': 5214336, 'steps': 27157, 'loss/train': 1.5192071199417114} 11/07/2021 01:02:00 - INFO - __main__ - Step 27159: {'lr': 0.0004651909092623769, 'samples': 5214528, 'steps': 27158, 'loss/train': 1.6818640232086182} 11/07/2021 01:02:01 - INFO - __main__ - Step 27160: {'lr': 0.00046518820805516165, 'samples': 5214720, 'steps': 27159, 'loss/train': 1.5658918619155884} 11/07/2021 01:02:02 - INFO - __main__ - Step 27161: {'lr': 0.0004651855067509859, 'samples': 5214912, 'steps': 27160, 'loss/train': 1.4818499088287354} 11/07/2021 01:02:02 - INFO - __main__ - Step 27162: {'lr': 0.0004651828053498509, 'samples': 5215104, 'steps': 27161, 'loss/train': 1.4773788452148438} 11/07/2021 01:02:02 - INFO - __main__ - Step 27163: {'lr': 0.0004651801038517579, 'samples': 5215296, 'steps': 27162, 'loss/train': 1.1611590385437012} 11/07/2021 01:02:03 - INFO - __main__ - Step 27164: {'lr': 0.000465177402256708, 'samples': 5215488, 'steps': 27163, 'loss/train': 1.544758677482605} 11/07/2021 01:02:03 - INFO - __main__ - Step 27165: {'lr': 0.00046517470056470244, 'samples': 5215680, 'steps': 27164, 'loss/train': 1.5025895833969116} 11/07/2021 01:02:05 - INFO - __main__ - Step 27166: {'lr': 0.00046517199877574257, 'samples': 5215872, 'steps': 27165, 'loss/train': 1.3345242738723755} 11/07/2021 01:02:05 - INFO - __main__ - Step 27167: {'lr': 0.0004651692968898295, 'samples': 5216064, 'steps': 27166, 'loss/train': 1.942888617515564} 11/07/2021 01:02:05 - INFO - __main__ - Step 27168: {'lr': 0.00046516659490696444, 'samples': 5216256, 'steps': 27167, 'loss/train': 1.3459722995758057} 11/07/2021 01:02:06 - INFO - __main__ - Step 27169: {'lr': 0.0004651638928271487, 'samples': 5216448, 'steps': 27168, 'loss/train': 1.6680691242218018} 11/07/2021 01:02:06 - INFO - __main__ - Step 27170: {'lr': 0.00046516119065038335, 'samples': 5216640, 'steps': 27169, 'loss/train': 0.6846105456352234} 11/07/2021 01:02:07 - INFO - __main__ - Step 27171: {'lr': 0.00046515848837666975, 'samples': 5216832, 'steps': 27170, 'loss/train': 0.5701700448989868} 11/07/2021 01:02:08 - INFO - __main__ - Step 27172: {'lr': 0.00046515578600600895, 'samples': 5217024, 'steps': 27171, 'loss/train': 1.4467335939407349} 11/07/2021 01:02:08 - INFO - __main__ - Step 27173: {'lr': 0.0004651530835384024, 'samples': 5217216, 'steps': 27172, 'loss/train': 1.3392056226730347} 11/07/2021 01:02:08 - INFO - __main__ - Step 27174: {'lr': 0.0004651503809738511, 'samples': 5217408, 'steps': 27173, 'loss/train': 1.2066261768341064} 11/07/2021 01:02:09 - INFO - __main__ - Step 27175: {'lr': 0.0004651476783123564, 'samples': 5217600, 'steps': 27174, 'loss/train': 1.503208875656128} 11/07/2021 01:02:09 - INFO - __main__ - Step 27176: {'lr': 0.00046514497555391946, 'samples': 5217792, 'steps': 27175, 'loss/train': 1.1453332901000977} 11/07/2021 01:02:10 - INFO - __main__ - Step 27177: {'lr': 0.0004651422726985415, 'samples': 5217984, 'steps': 27176, 'loss/train': 1.8215045928955078} 11/07/2021 01:02:10 - INFO - __main__ - Step 27178: {'lr': 0.00046513956974622377, 'samples': 5218176, 'steps': 27177, 'loss/train': 2.074622631072998} 11/07/2021 01:02:11 - INFO - __main__ - Step 27179: {'lr': 0.00046513686669696756, 'samples': 5218368, 'steps': 27178, 'loss/train': 1.3883646726608276} 11/07/2021 01:02:11 - INFO - __main__ - Step 27180: {'lr': 0.00046513416355077386, 'samples': 5218560, 'steps': 27179, 'loss/train': 1.4552028179168701} 11/07/2021 01:02:12 - INFO - __main__ - Step 27181: {'lr': 0.0004651314603076441, 'samples': 5218752, 'steps': 27180, 'loss/train': 1.4661346673965454} 11/07/2021 01:02:13 - INFO - __main__ - Step 27182: {'lr': 0.00046512875696757937, 'samples': 5218944, 'steps': 27181, 'loss/train': 1.8287869691848755} 11/07/2021 01:02:13 - INFO - __main__ - Step 27183: {'lr': 0.00046512605353058096, 'samples': 5219136, 'steps': 27182, 'loss/train': 0.7925441265106201} 11/07/2021 01:02:13 - INFO - __main__ - Step 27184: {'lr': 0.00046512334999665006, 'samples': 5219328, 'steps': 27183, 'loss/train': 1.6642957925796509} 11/07/2021 01:02:14 - INFO - __main__ - Step 27185: {'lr': 0.000465120646365788, 'samples': 5219520, 'steps': 27184, 'loss/train': 1.2242282629013062} 11/07/2021 01:02:14 - INFO - __main__ - Step 27186: {'lr': 0.0004651179426379958, 'samples': 5219712, 'steps': 27185, 'loss/train': 1.9081281423568726} 11/07/2021 01:02:15 - INFO - __main__ - Step 27187: {'lr': 0.00046511523881327476, 'samples': 5219904, 'steps': 27186, 'loss/train': 1.7007331848144531} 11/07/2021 01:02:16 - INFO - __main__ - Step 27188: {'lr': 0.00046511253489162616, 'samples': 5220096, 'steps': 27187, 'loss/train': 1.0498052835464478} 11/07/2021 01:02:16 - INFO - __main__ - Step 27189: {'lr': 0.00046510983087305114, 'samples': 5220288, 'steps': 27188, 'loss/train': 1.450149655342102} 11/07/2021 01:02:16 - INFO - __main__ - Step 27190: {'lr': 0.00046510712675755094, 'samples': 5220480, 'steps': 27189, 'loss/train': 1.5899168252944946} 11/07/2021 01:02:17 - INFO - __main__ - Step 27191: {'lr': 0.00046510442254512686, 'samples': 5220672, 'steps': 27190, 'loss/train': 1.399792194366455} 11/07/2021 01:02:18 - INFO - __main__ - Step 27192: {'lr': 0.00046510171823578, 'samples': 5220864, 'steps': 27191, 'loss/train': 1.7889716625213623} 11/07/2021 01:02:18 - INFO - __main__ - Step 27193: {'lr': 0.0004650990138295116, 'samples': 5221056, 'steps': 27192, 'loss/train': 1.8554221391677856} 11/07/2021 01:02:18 - INFO - __main__ - Step 27194: {'lr': 0.00046509630932632293, 'samples': 5221248, 'steps': 27193, 'loss/train': 1.738891839981079} 11/07/2021 01:02:19 - INFO - __main__ - Step 27195: {'lr': 0.0004650936047262152, 'samples': 5221440, 'steps': 27194, 'loss/train': 1.9472607374191284} 11/07/2021 01:02:19 - INFO - __main__ - Step 27196: {'lr': 0.0004650909000291895, 'samples': 5221632, 'steps': 27195, 'loss/train': 1.299528956413269} 11/07/2021 01:02:19 - INFO - __main__ - Step 27197: {'lr': 0.00046508819523524724, 'samples': 5221824, 'steps': 27196, 'loss/train': 1.4237334728240967} 11/07/2021 01:02:21 - INFO - __main__ - Step 27198: {'lr': 0.0004650854903443896, 'samples': 5222016, 'steps': 27197, 'loss/train': 2.340219497680664} 11/07/2021 01:02:21 - INFO - __main__ - Step 27199: {'lr': 0.00046508278535661775, 'samples': 5222208, 'steps': 27198, 'loss/train': 1.6890065670013428} 11/07/2021 01:02:22 - INFO - __main__ - Step 27200: {'lr': 0.00046508008027193286, 'samples': 5222400, 'steps': 27199, 'loss/train': 1.5595864057540894} 11/07/2021 01:02:22 - INFO - __main__ - Step 27201: {'lr': 0.0004650773750903363, 'samples': 5222592, 'steps': 27200, 'loss/train': 2.2038509845733643} 11/07/2021 01:02:22 - INFO - __main__ - Step 27202: {'lr': 0.0004650746698118291, 'samples': 5222784, 'steps': 27201, 'loss/train': 1.0606508255004883} 11/07/2021 01:02:23 - INFO - __main__ - Step 27203: {'lr': 0.0004650719644364126, 'samples': 5222976, 'steps': 27202, 'loss/train': 1.355365514755249} 11/07/2021 01:02:24 - INFO - __main__ - Step 27204: {'lr': 0.000465069258964088, 'samples': 5223168, 'steps': 27203, 'loss/train': 1.6440300941467285} 11/07/2021 01:02:24 - INFO - __main__ - Step 27205: {'lr': 0.0004650665533948565, 'samples': 5223360, 'steps': 27204, 'loss/train': 1.5045417547225952} 11/07/2021 01:02:24 - INFO - __main__ - Step 27206: {'lr': 0.00046506384772871935, 'samples': 5223552, 'steps': 27205, 'loss/train': 1.5792165994644165} 11/07/2021 01:02:25 - INFO - __main__ - Step 27207: {'lr': 0.0004650611419656777, 'samples': 5223744, 'steps': 27206, 'loss/train': 1.613562822341919} 11/07/2021 01:02:26 - INFO - __main__ - Step 27208: {'lr': 0.0004650584361057328, 'samples': 5223936, 'steps': 27207, 'loss/train': 1.467961311340332} 11/07/2021 01:02:26 - INFO - __main__ - Step 27209: {'lr': 0.00046505573014888604, 'samples': 5224128, 'steps': 27208, 'loss/train': 1.6682863235473633} 11/07/2021 01:02:26 - INFO - __main__ - Step 27210: {'lr': 0.0004650530240951383, 'samples': 5224320, 'steps': 27209, 'loss/train': 1.2404732704162598} 11/07/2021 01:02:27 - INFO - __main__ - Step 27211: {'lr': 0.0004650503179444911, 'samples': 5224512, 'steps': 27210, 'loss/train': 1.53238046169281} 11/07/2021 01:02:27 - INFO - __main__ - Step 27212: {'lr': 0.00046504761169694555, 'samples': 5224704, 'steps': 27211, 'loss/train': 1.7771190404891968} 11/07/2021 01:02:28 - INFO - __main__ - Step 27213: {'lr': 0.0004650449053525028, 'samples': 5224896, 'steps': 27212, 'loss/train': 1.3833065032958984} 11/07/2021 01:02:29 - INFO - __main__ - Step 27214: {'lr': 0.00046504219891116416, 'samples': 5225088, 'steps': 27213, 'loss/train': 1.3884916305541992} 11/07/2021 01:02:29 - INFO - __main__ - Step 27215: {'lr': 0.0004650394923729309, 'samples': 5225280, 'steps': 27214, 'loss/train': 1.773545742034912} 11/07/2021 01:02:29 - INFO - __main__ - Step 27216: {'lr': 0.00046503678573780403, 'samples': 5225472, 'steps': 27215, 'loss/train': 1.5457409620285034} 11/07/2021 01:02:30 - INFO - __main__ - Step 27217: {'lr': 0.000465034079005785, 'samples': 5225664, 'steps': 27216, 'loss/train': 1.5216282606124878} 11/07/2021 01:02:31 - INFO - __main__ - Step 27218: {'lr': 0.00046503137217687485, 'samples': 5225856, 'steps': 27217, 'loss/train': 1.4964834451675415} 11/07/2021 01:02:31 - INFO - __main__ - Step 27219: {'lr': 0.0004650286652510749, 'samples': 5226048, 'steps': 27218, 'loss/train': 1.5795546770095825} 11/07/2021 01:02:31 - INFO - __main__ - Step 27220: {'lr': 0.0004650259582283864, 'samples': 5226240, 'steps': 27219, 'loss/train': 1.4796018600463867} 11/07/2021 01:02:32 - INFO - __main__ - Step 27221: {'lr': 0.0004650232511088105, 'samples': 5226432, 'steps': 27220, 'loss/train': 1.6209487915039062} 11/07/2021 01:02:32 - INFO - __main__ - Step 27222: {'lr': 0.00046502054389234844, 'samples': 5226624, 'steps': 27221, 'loss/train': 0.906596302986145} 11/07/2021 01:02:33 - INFO - __main__ - Step 27223: {'lr': 0.0004650178365790014, 'samples': 5226816, 'steps': 27222, 'loss/train': 1.618649959564209} 11/07/2021 01:02:33 - INFO - __main__ - Step 27224: {'lr': 0.0004650151291687707, 'samples': 5227008, 'steps': 27223, 'loss/train': 1.4684704542160034} 11/07/2021 01:02:34 - INFO - __main__ - Step 27225: {'lr': 0.00046501242166165747, 'samples': 5227200, 'steps': 27224, 'loss/train': 1.6474354267120361} 11/07/2021 01:02:34 - INFO - __main__ - Step 27226: {'lr': 0.000465009714057663, 'samples': 5227392, 'steps': 27225, 'loss/train': 1.7976675033569336} 11/07/2021 01:02:34 - INFO - __main__ - Step 27227: {'lr': 0.00046500700635678844, 'samples': 5227584, 'steps': 27226, 'loss/train': 1.836778163909912} 11/07/2021 01:02:36 - INFO - __main__ - Step 27228: {'lr': 0.000465004298559035, 'samples': 5227776, 'steps': 27227, 'loss/train': 1.5999199151992798} 11/07/2021 01:02:36 - INFO - __main__ - Step 27229: {'lr': 0.00046500159066440404, 'samples': 5227968, 'steps': 27228, 'loss/train': 1.2133694887161255} 11/07/2021 01:02:36 - INFO - __main__ - Step 27230: {'lr': 0.0004649988826728966, 'samples': 5228160, 'steps': 27229, 'loss/train': 1.2805842161178589} 11/07/2021 01:02:37 - INFO - __main__ - Step 27231: {'lr': 0.000464996174584514, 'samples': 5228352, 'steps': 27230, 'loss/train': 1.5810779333114624} 11/07/2021 01:02:37 - INFO - __main__ - Step 27232: {'lr': 0.00046499346639925746, 'samples': 5228544, 'steps': 27231, 'loss/train': 1.931288242340088} 11/07/2021 01:02:38 - INFO - __main__ - Step 27233: {'lr': 0.0004649907581171282, 'samples': 5228736, 'steps': 27232, 'loss/train': 1.3784176111221313} 11/07/2021 01:02:38 - INFO - __main__ - Step 27234: {'lr': 0.00046498804973812735, 'samples': 5228928, 'steps': 27233, 'loss/train': 5.008947372436523} 11/07/2021 01:02:39 - INFO - __main__ - Step 27235: {'lr': 0.00046498534126225625, 'samples': 5229120, 'steps': 27234, 'loss/train': 1.4662317037582397} 11/07/2021 01:02:39 - INFO - __main__ - Step 27236: {'lr': 0.0004649826326895161, 'samples': 5229312, 'steps': 27235, 'loss/train': 1.7126282453536987} 11/07/2021 01:02:39 - INFO - __main__ - Step 27237: {'lr': 0.0004649799240199081, 'samples': 5229504, 'steps': 27236, 'loss/train': 1.5118376016616821} 11/07/2021 01:02:41 - INFO - __main__ - Step 27238: {'lr': 0.0004649772152534334, 'samples': 5229696, 'steps': 27237, 'loss/train': 2.3526248931884766} 11/07/2021 01:02:41 - INFO - __main__ - Step 27239: {'lr': 0.0004649745063900933, 'samples': 5229888, 'steps': 27238, 'loss/train': 1.6515519618988037} 11/07/2021 01:02:41 - INFO - __main__ - Step 27240: {'lr': 0.000464971797429889, 'samples': 5230080, 'steps': 27239, 'loss/train': 1.2007354497909546} 11/07/2021 01:02:42 - INFO - __main__ - Step 27241: {'lr': 0.00046496908837282173, 'samples': 5230272, 'steps': 27240, 'loss/train': 1.9977402687072754} 11/07/2021 01:02:42 - INFO - __main__ - Step 27242: {'lr': 0.00046496637921889276, 'samples': 5230464, 'steps': 27241, 'loss/train': 1.0261619091033936} 11/07/2021 01:02:42 - INFO - __main__ - Step 27243: {'lr': 0.0004649636699681031, 'samples': 5230656, 'steps': 27242, 'loss/train': 1.638994812965393} 11/07/2021 01:02:43 - INFO - __main__ - Step 27244: {'lr': 0.00046496096062045427, 'samples': 5230848, 'steps': 27243, 'loss/train': 1.7429476976394653} 11/07/2021 01:02:44 - INFO - __main__ - Step 27245: {'lr': 0.00046495825117594735, 'samples': 5231040, 'steps': 27244, 'loss/train': 1.555149793624878} 11/07/2021 01:02:44 - INFO - __main__ - Step 27246: {'lr': 0.0004649555416345835, 'samples': 5231232, 'steps': 27245, 'loss/train': 1.261429786682129} 11/07/2021 01:02:44 - INFO - __main__ - Step 27247: {'lr': 0.0004649528319963641, 'samples': 5231424, 'steps': 27246, 'loss/train': 1.4237511157989502} 11/07/2021 01:02:45 - INFO - __main__ - Step 27248: {'lr': 0.0004649501222612901, 'samples': 5231616, 'steps': 27247, 'loss/train': 2.1382250785827637} 11/07/2021 01:02:46 - INFO - __main__ - Step 27249: {'lr': 0.000464947412429363, 'samples': 5231808, 'steps': 27248, 'loss/train': 1.8834290504455566} 11/07/2021 01:02:46 - INFO - __main__ - Step 27250: {'lr': 0.000464944702500584, 'samples': 5232000, 'steps': 27249, 'loss/train': 1.60459303855896} 11/07/2021 01:02:47 - INFO - __main__ - Step 27251: {'lr': 0.0004649419924749541, 'samples': 5232192, 'steps': 27250, 'loss/train': 1.6707024574279785} 11/07/2021 01:02:47 - INFO - __main__ - Step 27252: {'lr': 0.0004649392823524746, 'samples': 5232384, 'steps': 27251, 'loss/train': 2.083890676498413} 11/07/2021 01:02:47 - INFO - __main__ - Step 27253: {'lr': 0.0004649365721331469, 'samples': 5232576, 'steps': 27252, 'loss/train': 1.1771005392074585} 11/07/2021 01:02:48 - INFO - __main__ - Step 27254: {'lr': 0.00046493386181697206, 'samples': 5232768, 'steps': 27253, 'loss/train': 1.486518144607544} 11/07/2021 01:02:49 - INFO - __main__ - Step 27255: {'lr': 0.00046493115140395136, 'samples': 5232960, 'steps': 27254, 'loss/train': 1.6178797483444214} 11/07/2021 01:02:49 - INFO - __main__ - Step 27256: {'lr': 0.000464928440894086, 'samples': 5233152, 'steps': 27255, 'loss/train': 1.117812156677246} 11/07/2021 01:02:49 - INFO - __main__ - Step 27257: {'lr': 0.00046492573028737716, 'samples': 5233344, 'steps': 27256, 'loss/train': 1.468658208847046} 11/07/2021 01:02:50 - INFO - __main__ - Step 27258: {'lr': 0.0004649230195838261, 'samples': 5233536, 'steps': 27257, 'loss/train': 1.4425030946731567} 11/07/2021 01:02:51 - INFO - __main__ - Step 27259: {'lr': 0.00046492030878343406, 'samples': 5233728, 'steps': 27258, 'loss/train': 1.1143912076950073} 11/07/2021 01:02:51 - INFO - __main__ - Step 27260: {'lr': 0.00046491759788620227, 'samples': 5233920, 'steps': 27259, 'loss/train': 1.2509312629699707} 11/07/2021 01:02:51 - INFO - __main__ - Step 27261: {'lr': 0.0004649148868921319, 'samples': 5234112, 'steps': 27260, 'loss/train': 1.6943997144699097} 11/07/2021 01:02:52 - INFO - __main__ - Step 27262: {'lr': 0.00046491217580122427, 'samples': 5234304, 'steps': 27261, 'loss/train': 1.703986644744873} 11/07/2021 01:02:52 - INFO - __main__ - Step 27263: {'lr': 0.00046490946461348045, 'samples': 5234496, 'steps': 27262, 'loss/train': 1.753912329673767} 11/07/2021 01:02:53 - INFO - __main__ - Step 27264: {'lr': 0.00046490675332890177, 'samples': 5234688, 'steps': 27263, 'loss/train': 1.8319761753082275} 11/07/2021 01:02:54 - INFO - __main__ - Step 27265: {'lr': 0.00046490404194748935, 'samples': 5234880, 'steps': 27264, 'loss/train': 1.6805005073547363} 11/07/2021 01:02:54 - INFO - __main__ - Step 27266: {'lr': 0.00046490133046924457, 'samples': 5235072, 'steps': 27265, 'loss/train': 1.169919729232788} 11/07/2021 01:02:54 - INFO - __main__ - Step 27267: {'lr': 0.0004648986188941685, 'samples': 5235264, 'steps': 27266, 'loss/train': 1.7877448797225952} 11/07/2021 01:02:55 - INFO - __main__ - Step 27268: {'lr': 0.0004648959072222625, 'samples': 5235456, 'steps': 27267, 'loss/train': 2.4874370098114014} 11/07/2021 01:02:56 - INFO - __main__ - Step 27269: {'lr': 0.0004648931954535277, 'samples': 5235648, 'steps': 27268, 'loss/train': 1.5646990537643433} 11/07/2021 01:02:56 - INFO - __main__ - Step 27270: {'lr': 0.0004648904835879654, 'samples': 5235840, 'steps': 27269, 'loss/train': 1.285988450050354} 11/07/2021 01:02:56 - INFO - __main__ - Step 27271: {'lr': 0.0004648877716255766, 'samples': 5236032, 'steps': 27270, 'loss/train': 1.7725143432617188} 11/07/2021 01:02:57 - INFO - __main__ - Step 27272: {'lr': 0.00046488505956636286, 'samples': 5236224, 'steps': 27271, 'loss/train': 1.777441143989563} 11/07/2021 01:02:57 - INFO - __main__ - Step 27273: {'lr': 0.0004648823474103251, 'samples': 5236416, 'steps': 27272, 'loss/train': 1.489535927772522} 11/07/2021 01:02:58 - INFO - __main__ - Step 27274: {'lr': 0.0004648796351574648, 'samples': 5236608, 'steps': 27273, 'loss/train': 1.0099929571151733} 11/07/2021 01:02:58 - INFO - __main__ - Step 27275: {'lr': 0.0004648769228077829, 'samples': 5236800, 'steps': 27274, 'loss/train': 1.5548713207244873} 11/07/2021 01:02:59 - INFO - __main__ - Step 27276: {'lr': 0.00046487421036128085, 'samples': 5236992, 'steps': 27275, 'loss/train': 1.2483229637145996} 11/07/2021 01:02:59 - INFO - __main__ - Step 27277: {'lr': 0.00046487149781795976, 'samples': 5237184, 'steps': 27276, 'loss/train': 1.5984280109405518} 11/07/2021 01:02:59 - INFO - __main__ - Step 27278: {'lr': 0.00046486878517782094, 'samples': 5237376, 'steps': 27277, 'loss/train': 1.1107038259506226} 11/07/2021 01:03:01 - INFO - __main__ - Step 27279: {'lr': 0.0004648660724408656, 'samples': 5237568, 'steps': 27278, 'loss/train': 1.5515042543411255} 11/07/2021 01:03:01 - INFO - __main__ - Step 27280: {'lr': 0.00046486335960709485, 'samples': 5237760, 'steps': 27279, 'loss/train': 1.8898242712020874} 11/07/2021 01:03:01 - INFO - __main__ - Step 27281: {'lr': 0.00046486064667651, 'samples': 5237952, 'steps': 27280, 'loss/train': 1.912441372871399} 11/07/2021 01:03:02 - INFO - __main__ - Step 27282: {'lr': 0.0004648579336491123, 'samples': 5238144, 'steps': 27281, 'loss/train': 1.492627739906311} 11/07/2021 01:03:02 - INFO - __main__ - Step 27283: {'lr': 0.0004648552205249029, 'samples': 5238336, 'steps': 27282, 'loss/train': 1.3607357740402222} 11/07/2021 01:03:02 - INFO - __main__ - Step 27284: {'lr': 0.000464852507303883, 'samples': 5238528, 'steps': 27283, 'loss/train': 1.754043698310852} 11/07/2021 01:03:03 - INFO - __main__ - Step 27285: {'lr': 0.0004648497939860539, 'samples': 5238720, 'steps': 27284, 'loss/train': 1.7543745040893555} 11/07/2021 01:03:04 - INFO - __main__ - Step 27286: {'lr': 0.0004648470805714169, 'samples': 5238912, 'steps': 27285, 'loss/train': 1.300349235534668} 11/07/2021 01:03:04 - INFO - __main__ - Step 27287: {'lr': 0.00046484436705997303, 'samples': 5239104, 'steps': 27286, 'loss/train': 1.2074387073516846} 11/07/2021 01:03:04 - INFO - __main__ - Step 27288: {'lr': 0.0004648416534517236, 'samples': 5239296, 'steps': 27287, 'loss/train': 1.1679052114486694} 11/07/2021 01:03:05 - INFO - __main__ - Step 27289: {'lr': 0.00046483893974666983, 'samples': 5239488, 'steps': 27288, 'loss/train': 1.7679553031921387} 11/07/2021 01:03:06 - INFO - __main__ - Step 27290: {'lr': 0.000464836225944813, 'samples': 5239680, 'steps': 27289, 'loss/train': 0.6809923648834229} 11/07/2021 01:03:06 - INFO - __main__ - Step 27291: {'lr': 0.00046483351204615423, 'samples': 5239872, 'steps': 27290, 'loss/train': 1.628966212272644} 11/07/2021 01:03:07 - INFO - __main__ - Step 27292: {'lr': 0.0004648307980506948, 'samples': 5240064, 'steps': 27291, 'loss/train': 1.4171942472457886} 11/07/2021 01:03:07 - INFO - __main__ - Step 27293: {'lr': 0.00046482808395843594, 'samples': 5240256, 'steps': 27292, 'loss/train': 2.384295701980591} 11/07/2021 01:03:07 - INFO - __main__ - Step 27294: {'lr': 0.0004648253697693789, 'samples': 5240448, 'steps': 27293, 'loss/train': 1.6184483766555786} 11/07/2021 01:03:08 - INFO - __main__ - Step 27295: {'lr': 0.0004648226554835248, 'samples': 5240640, 'steps': 27294, 'loss/train': 1.4949777126312256} 11/07/2021 01:03:09 - INFO - __main__ - Step 27296: {'lr': 0.000464819941100875, 'samples': 5240832, 'steps': 27295, 'loss/train': 1.562004566192627} 11/07/2021 01:03:09 - INFO - __main__ - Step 27297: {'lr': 0.00046481722662143057, 'samples': 5241024, 'steps': 27296, 'loss/train': 1.9358444213867188} 11/07/2021 01:03:09 - INFO - __main__ - Step 27298: {'lr': 0.0004648145120451929, 'samples': 5241216, 'steps': 27297, 'loss/train': 1.275517463684082} 11/07/2021 01:03:10 - INFO - __main__ - Step 27299: {'lr': 0.000464811797372163, 'samples': 5241408, 'steps': 27298, 'loss/train': 1.4358162879943848} 11/07/2021 01:03:11 - INFO - __main__ - Step 27300: {'lr': 0.00046480908260234234, 'samples': 5241600, 'steps': 27299, 'loss/train': 1.7115789651870728} 11/07/2021 01:03:12 - INFO - __main__ - Step 27301: {'lr': 0.0004648063677357319, 'samples': 5241792, 'steps': 27300, 'loss/train': 2.180518865585327} 11/07/2021 01:03:12 - INFO - __main__ - Step 27302: {'lr': 0.00046480365277233316, 'samples': 5241984, 'steps': 27301, 'loss/train': 2.0012290477752686} 11/07/2021 01:03:12 - INFO - __main__ - Step 27303: {'lr': 0.00046480093771214716, 'samples': 5242176, 'steps': 27302, 'loss/train': 1.8879752159118652} 11/07/2021 01:03:13 - INFO - __main__ - Step 27304: {'lr': 0.0004647982225551751, 'samples': 5242368, 'steps': 27303, 'loss/train': 0.4549408257007599} 11/07/2021 01:03:14 - INFO - __main__ - Step 27305: {'lr': 0.0004647955073014184, 'samples': 5242560, 'steps': 27304, 'loss/train': 2.0891189575195312} 11/07/2021 01:03:14 - INFO - __main__ - Step 27306: {'lr': 0.00046479279195087804, 'samples': 5242752, 'steps': 27305, 'loss/train': 1.5999822616577148} 11/07/2021 01:03:15 - INFO - __main__ - Step 27307: {'lr': 0.0004647900765035554, 'samples': 5242944, 'steps': 27306, 'loss/train': 1.7527434825897217} 11/07/2021 01:03:15 - INFO - __main__ - Step 27308: {'lr': 0.0004647873609594517, 'samples': 5243136, 'steps': 27307, 'loss/train': 1.2198915481567383} 11/07/2021 01:03:15 - INFO - __main__ - Step 27309: {'lr': 0.0004647846453185681, 'samples': 5243328, 'steps': 27308, 'loss/train': 1.5550477504730225} 11/07/2021 01:03:16 - INFO - __main__ - Step 27310: {'lr': 0.0004647819295809059, 'samples': 5243520, 'steps': 27309, 'loss/train': 1.791429877281189} 11/07/2021 01:03:17 - INFO - __main__ - Step 27311: {'lr': 0.00046477921374646624, 'samples': 5243712, 'steps': 27310, 'loss/train': 1.506710410118103} 11/07/2021 01:03:17 - INFO - __main__ - Step 27312: {'lr': 0.0004647764978152503, 'samples': 5243904, 'steps': 27311, 'loss/train': 1.6585369110107422} 11/07/2021 01:03:17 - INFO - __main__ - Step 27313: {'lr': 0.0004647737817872595, 'samples': 5244096, 'steps': 27312, 'loss/train': 1.6855876445770264} 11/07/2021 01:03:18 - INFO - __main__ - Step 27314: {'lr': 0.0004647710656624949, 'samples': 5244288, 'steps': 27313, 'loss/train': 1.1921801567077637} 11/07/2021 01:03:18 - INFO - __main__ - Step 27315: {'lr': 0.0004647683494409578, 'samples': 5244480, 'steps': 27314, 'loss/train': 1.5598256587982178} 11/07/2021 01:03:19 - INFO - __main__ - Step 27316: {'lr': 0.0004647656331226494, 'samples': 5244672, 'steps': 27315, 'loss/train': 1.3678679466247559} 11/07/2021 01:03:19 - INFO - __main__ - Step 27317: {'lr': 0.0004647629167075709, 'samples': 5244864, 'steps': 27316, 'loss/train': 1.58489191532135} 11/07/2021 01:03:20 - INFO - __main__ - Step 27318: {'lr': 0.00046476020019572354, 'samples': 5245056, 'steps': 27317, 'loss/train': 1.6443164348602295} 11/07/2021 01:03:20 - INFO - __main__ - Step 27319: {'lr': 0.00046475748358710856, 'samples': 5245248, 'steps': 27318, 'loss/train': 1.410498023033142} 11/07/2021 01:03:20 - INFO - __main__ - Step 27320: {'lr': 0.0004647547668817271, 'samples': 5245440, 'steps': 27319, 'loss/train': 1.5386722087860107} 11/07/2021 01:03:22 - INFO - __main__ - Step 27321: {'lr': 0.00046475205007958054, 'samples': 5245632, 'steps': 27320, 'loss/train': 1.3879882097244263} 11/07/2021 01:03:22 - INFO - __main__ - Step 27322: {'lr': 0.00046474933318067004, 'samples': 5245824, 'steps': 27321, 'loss/train': 1.2655384540557861} 11/07/2021 01:03:22 - INFO - __main__ - Step 27323: {'lr': 0.0004647466161849968, 'samples': 5246016, 'steps': 27322, 'loss/train': 1.598747730255127} 11/07/2021 01:03:23 - INFO - __main__ - Step 27324: {'lr': 0.000464743899092562, 'samples': 5246208, 'steps': 27323, 'loss/train': 1.4427452087402344} 11/07/2021 01:03:23 - INFO - __main__ - Step 27325: {'lr': 0.0004647411819033669, 'samples': 5246400, 'steps': 27324, 'loss/train': 1.6169787645339966} 11/07/2021 01:03:24 - INFO - __main__ - Step 27326: {'lr': 0.00046473846461741276, 'samples': 5246592, 'steps': 27325, 'loss/train': 1.5075080394744873} 11/07/2021 01:03:24 - INFO - __main__ - Step 27327: {'lr': 0.0004647357472347008, 'samples': 5246784, 'steps': 27326, 'loss/train': 1.5084218978881836} 11/07/2021 01:03:25 - INFO - __main__ - Step 27328: {'lr': 0.00046473302975523224, 'samples': 5246976, 'steps': 27327, 'loss/train': 1.3807927370071411} 11/07/2021 01:03:25 - INFO - __main__ - Step 27329: {'lr': 0.0004647303121790082, 'samples': 5247168, 'steps': 27328, 'loss/train': 1.8661872148513794} 11/07/2021 01:03:25 - INFO - __main__ - Step 27330: {'lr': 0.0004647275945060301, 'samples': 5247360, 'steps': 27329, 'loss/train': 1.6493861675262451} 11/07/2021 01:03:27 - INFO - __main__ - Step 27331: {'lr': 0.000464724876736299, 'samples': 5247552, 'steps': 27330, 'loss/train': 1.5648044347763062} 11/07/2021 01:03:27 - INFO - __main__ - Step 27332: {'lr': 0.00046472215886981616, 'samples': 5247744, 'steps': 27331, 'loss/train': 0.851533055305481} 11/07/2021 01:03:27 - INFO - __main__ - Step 27333: {'lr': 0.00046471944090658294, 'samples': 5247936, 'steps': 27332, 'loss/train': 1.6356053352355957} 11/07/2021 01:03:28 - INFO - __main__ - Step 27334: {'lr': 0.0004647167228466004, 'samples': 5248128, 'steps': 27333, 'loss/train': 1.8776708841323853} 11/07/2021 01:03:28 - INFO - __main__ - Step 27335: {'lr': 0.0004647140046898697, 'samples': 5248320, 'steps': 27334, 'loss/train': 1.3127461671829224} 11/07/2021 01:03:29 - INFO - __main__ - Step 27336: {'lr': 0.0004647112864363923, 'samples': 5248512, 'steps': 27335, 'loss/train': 1.7084875106811523} 11/07/2021 01:03:30 - INFO - __main__ - Step 27337: {'lr': 0.00046470856808616934, 'samples': 5248704, 'steps': 27336, 'loss/train': 1.421209454536438} 11/07/2021 01:03:30 - INFO - __main__ - Step 27338: {'lr': 0.0004647058496392019, 'samples': 5248896, 'steps': 27337, 'loss/train': 1.2902867794036865} 11/07/2021 01:03:30 - INFO - __main__ - Step 27339: {'lr': 0.0004647031310954914, 'samples': 5249088, 'steps': 27338, 'loss/train': 1.4306479692459106} 11/07/2021 01:03:31 - INFO - __main__ - Step 27340: {'lr': 0.00046470041245503895, 'samples': 5249280, 'steps': 27339, 'loss/train': 1.445697546005249} 11/07/2021 01:03:31 - INFO - __main__ - Step 27341: {'lr': 0.0004646976937178459, 'samples': 5249472, 'steps': 27340, 'loss/train': 1.2455345392227173} 11/07/2021 01:03:32 - INFO - __main__ - Step 27342: {'lr': 0.0004646949748839132, 'samples': 5249664, 'steps': 27341, 'loss/train': 1.1632879972457886} 11/07/2021 01:03:33 - INFO - __main__ - Step 27343: {'lr': 0.0004646922559532424, 'samples': 5249856, 'steps': 27342, 'loss/train': 1.8211750984191895} 11/07/2021 01:03:33 - INFO - __main__ - Step 27344: {'lr': 0.0004646895369258345, 'samples': 5250048, 'steps': 27343, 'loss/train': 1.6821507215499878} 11/07/2021 01:03:33 - INFO - __main__ - Step 27345: {'lr': 0.00046468681780169086, 'samples': 5250240, 'steps': 27344, 'loss/train': 1.6495689153671265} 11/07/2021 01:03:34 - INFO - __main__ - Step 27346: {'lr': 0.0004646840985808126, 'samples': 5250432, 'steps': 27345, 'loss/train': 1.566925287246704} 11/07/2021 01:03:35 - INFO - __main__ - Step 27347: {'lr': 0.0004646813792632011, 'samples': 5250624, 'steps': 27346, 'loss/train': 1.3244365453720093} 11/07/2021 01:03:35 - INFO - __main__ - Step 27348: {'lr': 0.00046467865984885736, 'samples': 5250816, 'steps': 27347, 'loss/train': 1.6940046548843384} 11/07/2021 01:03:35 - INFO - __main__ - Step 27349: {'lr': 0.0004646759403377828, 'samples': 5251008, 'steps': 27348, 'loss/train': 1.6910820007324219} 11/07/2021 01:03:36 - INFO - __main__ - Step 27350: {'lr': 0.00046467322072997865, 'samples': 5251200, 'steps': 27349, 'loss/train': 1.679405689239502} 11/07/2021 01:03:36 - INFO - __main__ - Step 27351: {'lr': 0.00046467050102544594, 'samples': 5251392, 'steps': 27350, 'loss/train': 1.8179465532302856} 11/07/2021 01:03:37 - INFO - __main__ - Step 27352: {'lr': 0.0004646677812241861, 'samples': 5251584, 'steps': 27351, 'loss/train': 1.789126992225647} 11/07/2021 01:03:37 - INFO - __main__ - Step 27353: {'lr': 0.0004646650613262001, 'samples': 5251776, 'steps': 27352, 'loss/train': 1.3540021181106567} 11/07/2021 01:03:38 - INFO - __main__ - Step 27354: {'lr': 0.00046466234133148957, 'samples': 5251968, 'steps': 27353, 'loss/train': 1.5730952024459839} 11/07/2021 01:03:38 - INFO - __main__ - Step 27355: {'lr': 0.00046465962124005535, 'samples': 5252160, 'steps': 27354, 'loss/train': 1.4978750944137573} 11/07/2021 01:03:38 - INFO - __main__ - Step 27356: {'lr': 0.0004646569010518988, 'samples': 5252352, 'steps': 27355, 'loss/train': 1.8019400835037231} 11/07/2021 01:03:40 - INFO - __main__ - Step 27357: {'lr': 0.00046465418076702125, 'samples': 5252544, 'steps': 27356, 'loss/train': 1.7005836963653564} 11/07/2021 01:03:40 - INFO - __main__ - Step 27358: {'lr': 0.00046465146038542375, 'samples': 5252736, 'steps': 27357, 'loss/train': 1.577629566192627} 11/07/2021 01:03:40 - INFO - __main__ - Step 27359: {'lr': 0.0004646487399071077, 'samples': 5252928, 'steps': 27358, 'loss/train': 1.6958972215652466} 11/07/2021 01:03:41 - INFO - __main__ - Step 27360: {'lr': 0.00046464601933207417, 'samples': 5253120, 'steps': 27359, 'loss/train': 1.4246464967727661} 11/07/2021 01:03:41 - INFO - __main__ - Step 27361: {'lr': 0.0004646432986603245, 'samples': 5253312, 'steps': 27360, 'loss/train': 1.8367726802825928} 11/07/2021 01:03:41 - INFO - __main__ - Step 27362: {'lr': 0.00046464057789185985, 'samples': 5253504, 'steps': 27361, 'loss/train': 1.778257966041565} 11/07/2021 01:03:42 - INFO - __main__ - Step 27363: {'lr': 0.00046463785702668156, 'samples': 5253696, 'steps': 27362, 'loss/train': 1.8035478591918945} 11/07/2021 01:03:43 - INFO - __main__ - Step 27364: {'lr': 0.0004646351360647907, 'samples': 5253888, 'steps': 27363, 'loss/train': 1.9287103414535522} 11/07/2021 01:03:43 - INFO - __main__ - Step 27365: {'lr': 0.00046463241500618846, 'samples': 5254080, 'steps': 27364, 'loss/train': 1.322129249572754} 11/07/2021 01:03:43 - INFO - __main__ - Step 27366: {'lr': 0.00046462969385087626, 'samples': 5254272, 'steps': 27365, 'loss/train': 1.7644007205963135} 11/07/2021 01:03:44 - INFO - __main__ - Step 27367: {'lr': 0.00046462697259885523, 'samples': 5254464, 'steps': 27366, 'loss/train': 2.6626901626586914} 11/07/2021 01:03:45 - INFO - __main__ - Step 27368: {'lr': 0.0004646242512501266, 'samples': 5254656, 'steps': 27367, 'loss/train': 1.7474702596664429} 11/07/2021 01:03:45 - INFO - __main__ - Step 27369: {'lr': 0.0004646215298046916, 'samples': 5254848, 'steps': 27368, 'loss/train': 1.823857307434082} 11/07/2021 01:03:45 - INFO - __main__ - Step 27370: {'lr': 0.00046461880826255143, 'samples': 5255040, 'steps': 27369, 'loss/train': 1.2764437198638916} 11/07/2021 01:03:46 - INFO - __main__ - Step 27371: {'lr': 0.00046461608662370734, 'samples': 5255232, 'steps': 27370, 'loss/train': 1.5031490325927734} 11/07/2021 01:03:46 - INFO - __main__ - Step 27372: {'lr': 0.0004646133648881606, 'samples': 5255424, 'steps': 27371, 'loss/train': 1.596226692199707} 11/07/2021 01:03:47 - INFO - __main__ - Step 27373: {'lr': 0.00046461064305591235, 'samples': 5255616, 'steps': 27372, 'loss/train': 1.7974703311920166} 11/07/2021 01:03:48 - INFO - __main__ - Step 27374: {'lr': 0.00046460792112696384, 'samples': 5255808, 'steps': 27373, 'loss/train': 1.4550420045852661} 11/07/2021 01:03:48 - INFO - __main__ - Step 27375: {'lr': 0.0004646051991013163, 'samples': 5256000, 'steps': 27374, 'loss/train': 2.2443268299102783} 11/07/2021 01:03:48 - INFO - __main__ - Step 27376: {'lr': 0.000464602476978971, 'samples': 5256192, 'steps': 27375, 'loss/train': 1.6068828105926514} 11/07/2021 01:03:49 - INFO - __main__ - Step 27377: {'lr': 0.00046459975475992914, 'samples': 5256384, 'steps': 27376, 'loss/train': 1.4636454582214355} 11/07/2021 01:03:50 - INFO - __main__ - Step 27378: {'lr': 0.00046459703244419194, 'samples': 5256576, 'steps': 27377, 'loss/train': 1.7761300802230835} 11/07/2021 01:03:50 - INFO - __main__ - Step 27379: {'lr': 0.0004645943100317606, 'samples': 5256768, 'steps': 27378, 'loss/train': 1.193964958190918} 11/07/2021 01:03:50 - INFO - __main__ - Step 27380: {'lr': 0.00046459158752263643, 'samples': 5256960, 'steps': 27379, 'loss/train': 1.663011908531189} 11/07/2021 01:03:51 - INFO - __main__ - Step 27381: {'lr': 0.0004645888649168205, 'samples': 5257152, 'steps': 27380, 'loss/train': 1.6378246545791626} 11/07/2021 01:03:51 - INFO - __main__ - Step 27382: {'lr': 0.0004645861422143143, 'samples': 5257344, 'steps': 27381, 'loss/train': 2.5152504444122314} 11/07/2021 01:03:52 - INFO - __main__ - Step 27383: {'lr': 0.0004645834194151187, 'samples': 5257536, 'steps': 27382, 'loss/train': 1.734678030014038} 11/07/2021 01:03:53 - INFO - __main__ - Step 27384: {'lr': 0.0004645806965192353, 'samples': 5257728, 'steps': 27383, 'loss/train': 0.6871659159660339} 11/07/2021 01:03:53 - INFO - __main__ - Step 27385: {'lr': 0.000464577973526665, 'samples': 5257920, 'steps': 27384, 'loss/train': 1.138061761856079} 11/07/2021 01:03:53 - INFO - __main__ - Step 27386: {'lr': 0.00046457525043740926, 'samples': 5258112, 'steps': 27385, 'loss/train': 1.6762138605117798} 11/07/2021 01:03:54 - INFO - __main__ - Step 27387: {'lr': 0.0004645725272514693, 'samples': 5258304, 'steps': 27386, 'loss/train': 1.5222851037979126} 11/07/2021 01:03:55 - INFO - __main__ - Step 27388: {'lr': 0.0004645698039688461, 'samples': 5258496, 'steps': 27387, 'loss/train': 1.952163815498352} 11/07/2021 01:03:55 - INFO - __main__ - Step 27389: {'lr': 0.00046456708058954116, 'samples': 5258688, 'steps': 27388, 'loss/train': 1.1741615533828735} 11/07/2021 01:03:55 - INFO - __main__ - Step 27390: {'lr': 0.0004645643571135556, 'samples': 5258880, 'steps': 27389, 'loss/train': 1.7031134366989136} 11/07/2021 01:03:56 - INFO - __main__ - Step 27391: {'lr': 0.00046456163354089065, 'samples': 5259072, 'steps': 27390, 'loss/train': 1.7008723020553589} 11/07/2021 01:03:56 - INFO - __main__ - Step 27392: {'lr': 0.00046455890987154747, 'samples': 5259264, 'steps': 27391, 'loss/train': 1.5040371417999268} 11/07/2021 01:03:56 - INFO - __main__ - Step 27393: {'lr': 0.0004645561861055274, 'samples': 5259456, 'steps': 27392, 'loss/train': 0.9468610882759094} 11/07/2021 01:03:58 - INFO - __main__ - Step 27394: {'lr': 0.00046455346224283167, 'samples': 5259648, 'steps': 27393, 'loss/train': 1.3570585250854492} 11/07/2021 01:03:58 - INFO - __main__ - Step 27395: {'lr': 0.00046455073828346137, 'samples': 5259840, 'steps': 27394, 'loss/train': 1.878167748451233} 11/07/2021 01:03:58 - INFO - __main__ - Step 27396: {'lr': 0.0004645480142274179, 'samples': 5260032, 'steps': 27395, 'loss/train': 1.4309208393096924} 11/07/2021 01:03:59 - INFO - __main__ - Step 27397: {'lr': 0.0004645452900747024, 'samples': 5260224, 'steps': 27396, 'loss/train': 0.8551251888275146} 11/07/2021 01:03:59 - INFO - __main__ - Step 27398: {'lr': 0.00046454256582531604, 'samples': 5260416, 'steps': 27397, 'loss/train': 1.6647512912750244} 11/07/2021 01:04:00 - INFO - __main__ - Step 27399: {'lr': 0.0004645398414792602, 'samples': 5260608, 'steps': 27398, 'loss/train': 1.1795614957809448} 11/07/2021 01:04:00 - INFO - __main__ - Step 27400: {'lr': 0.000464537117036536, 'samples': 5260800, 'steps': 27399, 'loss/train': 1.4849789142608643} 11/07/2021 01:04:01 - INFO - __main__ - Step 27401: {'lr': 0.00046453439249714466, 'samples': 5260992, 'steps': 27400, 'loss/train': 1.4565329551696777} 11/07/2021 01:04:01 - INFO - __main__ - Step 27402: {'lr': 0.00046453166786108736, 'samples': 5261184, 'steps': 27401, 'loss/train': 1.5868231058120728} 11/07/2021 01:04:01 - INFO - __main__ - Step 27403: {'lr': 0.00046452894312836547, 'samples': 5261376, 'steps': 27402, 'loss/train': 1.570541501045227} 11/07/2021 01:04:03 - INFO - __main__ - Step 27404: {'lr': 0.0004645262182989802, 'samples': 5261568, 'steps': 27403, 'loss/train': 1.6559795141220093} 11/07/2021 01:04:03 - INFO - __main__ - Step 27405: {'lr': 0.0004645234933729327, 'samples': 5261760, 'steps': 27404, 'loss/train': 4.920253753662109} 11/07/2021 01:04:03 - INFO - __main__ - Step 27406: {'lr': 0.00046452076835022416, 'samples': 5261952, 'steps': 27405, 'loss/train': 2.2183635234832764} 11/07/2021 01:04:04 - INFO - __main__ - Step 27407: {'lr': 0.0004645180432308559, 'samples': 5262144, 'steps': 27406, 'loss/train': 1.607749104499817} 11/07/2021 01:04:04 - INFO - __main__ - Step 27408: {'lr': 0.00046451531801482913, 'samples': 5262336, 'steps': 27407, 'loss/train': 1.6804707050323486} 11/07/2021 01:04:04 - INFO - __main__ - Step 27409: {'lr': 0.00046451259270214505, 'samples': 5262528, 'steps': 27408, 'loss/train': 1.7157375812530518} 11/07/2021 01:04:05 - INFO - __main__ - Step 27410: {'lr': 0.00046450986729280495, 'samples': 5262720, 'steps': 27409, 'loss/train': 0.6833308339118958} 11/07/2021 01:04:06 - INFO - __main__ - Step 27411: {'lr': 0.00046450714178680996, 'samples': 5262912, 'steps': 27410, 'loss/train': 1.1192928552627563} 11/07/2021 01:04:06 - INFO - __main__ - Step 27412: {'lr': 0.0004645044161841614, 'samples': 5263104, 'steps': 27411, 'loss/train': 1.5120153427124023} 11/07/2021 01:04:07 - INFO - __main__ - Step 27413: {'lr': 0.00046450169048486045, 'samples': 5263296, 'steps': 27412, 'loss/train': 1.5773380994796753} 11/07/2021 01:04:07 - INFO - __main__ - Step 27414: {'lr': 0.0004644989646889084, 'samples': 5263488, 'steps': 27413, 'loss/train': 1.3461500406265259} 11/07/2021 01:04:08 - INFO - __main__ - Step 27415: {'lr': 0.0004644962387963063, 'samples': 5263680, 'steps': 27414, 'loss/train': 1.4467198848724365} 11/07/2021 01:04:08 - INFO - __main__ - Step 27416: {'lr': 0.0004644935128070556, 'samples': 5263872, 'steps': 27415, 'loss/train': 0.9525635242462158} 11/07/2021 01:04:09 - INFO - __main__ - Step 27417: {'lr': 0.0004644907867211574, 'samples': 5264064, 'steps': 27416, 'loss/train': 2.4026198387145996} 11/07/2021 01:04:09 - INFO - __main__ - Step 27418: {'lr': 0.000464488060538613, 'samples': 5264256, 'steps': 27417, 'loss/train': 1.6511754989624023} 11/07/2021 01:04:09 - INFO - __main__ - Step 27419: {'lr': 0.0004644853342594235, 'samples': 5264448, 'steps': 27418, 'loss/train': 1.9703359603881836} 11/07/2021 01:04:10 - INFO - __main__ - Step 27420: {'lr': 0.0004644826078835903, 'samples': 5264640, 'steps': 27419, 'loss/train': 1.7109949588775635} 11/07/2021 01:04:11 - INFO - __main__ - Step 27421: {'lr': 0.00046447988141111457, 'samples': 5264832, 'steps': 27420, 'loss/train': 1.3898118734359741} 11/07/2021 01:04:11 - INFO - __main__ - Step 27422: {'lr': 0.0004644771548419975, 'samples': 5265024, 'steps': 27421, 'loss/train': 1.4866039752960205} 11/07/2021 01:04:11 - INFO - __main__ - Step 27423: {'lr': 0.0004644744281762403, 'samples': 5265216, 'steps': 27422, 'loss/train': 1.669671893119812} 11/07/2021 01:04:12 - INFO - __main__ - Step 27424: {'lr': 0.0004644717014138442, 'samples': 5265408, 'steps': 27423, 'loss/train': 2.594538450241089} 11/07/2021 01:04:13 - INFO - __main__ - Step 27425: {'lr': 0.0004644689745548105, 'samples': 5265600, 'steps': 27424, 'loss/train': 1.887316346168518} 11/07/2021 01:04:13 - INFO - __main__ - Step 27426: {'lr': 0.00046446624759914043, 'samples': 5265792, 'steps': 27425, 'loss/train': 1.6005641222000122} 11/07/2021 01:04:14 - INFO - __main__ - Step 27427: {'lr': 0.0004644635205468351, 'samples': 5265984, 'steps': 27426, 'loss/train': 1.8438071012496948} 11/07/2021 01:04:14 - INFO - __main__ - Step 27428: {'lr': 0.00046446079339789587, 'samples': 5266176, 'steps': 27427, 'loss/train': 1.7589665651321411} 11/07/2021 01:04:14 - INFO - __main__ - Step 27429: {'lr': 0.0004644580661523239, 'samples': 5266368, 'steps': 27428, 'loss/train': 1.9420253038406372} 11/07/2021 01:04:15 - INFO - __main__ - Step 27430: {'lr': 0.00046445533881012043, 'samples': 5266560, 'steps': 27429, 'loss/train': 1.7142422199249268} 11/07/2021 01:04:16 - INFO - __main__ - Step 27431: {'lr': 0.0004644526113712867, 'samples': 5266752, 'steps': 27430, 'loss/train': 1.6149872541427612} 11/07/2021 01:04:16 - INFO - __main__ - Step 27432: {'lr': 0.00046444988383582394, 'samples': 5266944, 'steps': 27431, 'loss/train': 1.6525791883468628} 11/07/2021 01:04:16 - INFO - __main__ - Step 27433: {'lr': 0.0004644471562037333, 'samples': 5267136, 'steps': 27432, 'loss/train': 1.556830644607544} 11/07/2021 01:04:17 - INFO - __main__ - Step 27434: {'lr': 0.0004644444284750162, 'samples': 5267328, 'steps': 27433, 'loss/train': 1.8048747777938843} 11/07/2021 01:04:17 - INFO - __main__ - Step 27435: {'lr': 0.0004644417006496737, 'samples': 5267520, 'steps': 27434, 'loss/train': 1.553612470626831} 11/07/2021 01:04:18 - INFO - __main__ - Step 27436: {'lr': 0.0004644389727277071, 'samples': 5267712, 'steps': 27435, 'loss/train': 1.6055339574813843} 11/07/2021 01:04:19 - INFO - __main__ - Step 27437: {'lr': 0.00046443624470911754, 'samples': 5267904, 'steps': 27436, 'loss/train': 1.7883402109146118} 11/07/2021 01:04:19 - INFO - __main__ - Step 27438: {'lr': 0.00046443351659390637, 'samples': 5268096, 'steps': 27437, 'loss/train': 1.3733422756195068} 11/07/2021 01:04:19 - INFO - __main__ - Step 27439: {'lr': 0.00046443078838207474, 'samples': 5268288, 'steps': 27438, 'loss/train': 1.651326298713684} 11/07/2021 01:04:20 - INFO - __main__ - Step 27440: {'lr': 0.00046442806007362394, 'samples': 5268480, 'steps': 27439, 'loss/train': 1.7419207096099854} 11/07/2021 01:04:21 - INFO - __main__ - Step 27441: {'lr': 0.00046442533166855517, 'samples': 5268672, 'steps': 27440, 'loss/train': 1.782727837562561} 11/07/2021 01:04:21 - INFO - __main__ - Step 27442: {'lr': 0.00046442260316686957, 'samples': 5268864, 'steps': 27441, 'loss/train': 1.3582515716552734} 11/07/2021 01:04:22 - INFO - __main__ - Step 27443: {'lr': 0.0004644198745685685, 'samples': 5269056, 'steps': 27442, 'loss/train': 1.7487573623657227} 11/07/2021 01:04:22 - INFO - __main__ - Step 27444: {'lr': 0.00046441714587365317, 'samples': 5269248, 'steps': 27443, 'loss/train': 1.5707968473434448} 11/07/2021 01:04:22 - INFO - __main__ - Step 27445: {'lr': 0.00046441441708212477, 'samples': 5269440, 'steps': 27444, 'loss/train': 1.8958215713500977} 11/07/2021 01:04:23 - INFO - __main__ - Step 27446: {'lr': 0.00046441168819398457, 'samples': 5269632, 'steps': 27445, 'loss/train': 1.5987136363983154} 11/07/2021 01:04:24 - INFO - __main__ - Step 27447: {'lr': 0.0004644089592092338, 'samples': 5269824, 'steps': 27446, 'loss/train': 1.5505801439285278} 11/07/2021 01:04:24 - INFO - __main__ - Step 27448: {'lr': 0.0004644062301278735, 'samples': 5270016, 'steps': 27447, 'loss/train': 1.6309155225753784} 11/07/2021 01:04:24 - INFO - __main__ - Step 27449: {'lr': 0.0004644035009499052, 'samples': 5270208, 'steps': 27448, 'loss/train': 1.9587770700454712} 11/07/2021 01:04:25 - INFO - __main__ - Step 27450: {'lr': 0.0004644007716753299, 'samples': 5270400, 'steps': 27449, 'loss/train': 2.0377914905548096} 11/07/2021 01:04:26 - INFO - __main__ - Step 27451: {'lr': 0.00046439804230414904, 'samples': 5270592, 'steps': 27450, 'loss/train': 1.4115687608718872} 11/07/2021 01:04:26 - INFO - __main__ - Step 27452: {'lr': 0.0004643953128363637, 'samples': 5270784, 'steps': 27451, 'loss/train': 1.5786956548690796} 11/07/2021 01:04:27 - INFO - __main__ - Step 27453: {'lr': 0.0004643925832719751, 'samples': 5270976, 'steps': 27452, 'loss/train': 1.5803961753845215} 11/07/2021 01:04:27 - INFO - __main__ - Step 27454: {'lr': 0.0004643898536109845, 'samples': 5271168, 'steps': 27453, 'loss/train': 1.38846755027771} 11/07/2021 01:04:27 - INFO - __main__ - Step 27455: {'lr': 0.0004643871238533931, 'samples': 5271360, 'steps': 27454, 'loss/train': 1.9401226043701172} 11/07/2021 01:04:28 - INFO - __main__ - Step 27456: {'lr': 0.0004643843939992022, 'samples': 5271552, 'steps': 27455, 'loss/train': 1.8082693815231323} 11/07/2021 01:04:29 - INFO - __main__ - Step 27457: {'lr': 0.0004643816640484131, 'samples': 5271744, 'steps': 27456, 'loss/train': 1.6021026372909546} 11/07/2021 01:04:29 - INFO - __main__ - Step 27458: {'lr': 0.0004643789340010268, 'samples': 5271936, 'steps': 27457, 'loss/train': 1.9755364656448364} 11/07/2021 01:04:29 - INFO - __main__ - Step 27459: {'lr': 0.00046437620385704476, 'samples': 5272128, 'steps': 27458, 'loss/train': 0.6707550287246704} 11/07/2021 01:04:30 - INFO - __main__ - Step 27460: {'lr': 0.0004643734736164681, 'samples': 5272320, 'steps': 27459, 'loss/train': 2.275068998336792} 11/07/2021 01:04:30 - INFO - __main__ - Step 27461: {'lr': 0.00046437074327929795, 'samples': 5272512, 'steps': 27460, 'loss/train': 1.1870478391647339} 11/07/2021 01:04:31 - INFO - __main__ - Step 27462: {'lr': 0.0004643680128455358, 'samples': 5272704, 'steps': 27461, 'loss/train': 1.5905396938323975} 11/07/2021 01:04:31 - INFO - __main__ - Step 27463: {'lr': 0.00046436528231518263, 'samples': 5272896, 'steps': 27462, 'loss/train': 1.551306128501892} 11/07/2021 01:04:32 - INFO - __main__ - Step 27464: {'lr': 0.0004643625516882398, 'samples': 5273088, 'steps': 27463, 'loss/train': 1.5059658288955688} 11/07/2021 01:04:32 - INFO - __main__ - Step 27465: {'lr': 0.0004643598209647085, 'samples': 5273280, 'steps': 27464, 'loss/train': 1.6166335344314575} 11/07/2021 01:04:32 - INFO - __main__ - Step 27466: {'lr': 0.00046435709014459, 'samples': 5273472, 'steps': 27465, 'loss/train': 1.4816384315490723} 11/07/2021 01:04:34 - INFO - __main__ - Step 27467: {'lr': 0.0004643543592278855, 'samples': 5273664, 'steps': 27466, 'loss/train': 0.9950994253158569} 11/07/2021 01:04:34 - INFO - __main__ - Step 27468: {'lr': 0.0004643516282145962, 'samples': 5273856, 'steps': 27467, 'loss/train': 2.083305835723877} 11/07/2021 01:04:34 - INFO - __main__ - Step 27469: {'lr': 0.0004643488971047234, 'samples': 5274048, 'steps': 27468, 'loss/train': 1.0330928564071655} 11/07/2021 01:04:35 - INFO - __main__ - Step 27470: {'lr': 0.0004643461658982683, 'samples': 5274240, 'steps': 27469, 'loss/train': 1.721252202987671} 11/07/2021 01:04:35 - INFO - __main__ - Step 27471: {'lr': 0.00046434343459523207, 'samples': 5274432, 'steps': 27470, 'loss/train': 1.5718274116516113} 11/07/2021 01:04:36 - INFO - __main__ - Step 27472: {'lr': 0.00046434070319561604, 'samples': 5274624, 'steps': 27471, 'loss/train': 1.4991551637649536} 11/07/2021 01:04:36 - INFO - __main__ - Step 27473: {'lr': 0.0004643379716994214, 'samples': 5274816, 'steps': 27472, 'loss/train': 1.754794955253601} 11/07/2021 01:04:37 - INFO - __main__ - Step 27474: {'lr': 0.0004643352401066494, 'samples': 5275008, 'steps': 27473, 'loss/train': 1.6142194271087646} 11/07/2021 01:04:37 - INFO - __main__ - Step 27475: {'lr': 0.00046433250841730123, 'samples': 5275200, 'steps': 27474, 'loss/train': 1.5695093870162964} 11/07/2021 01:04:37 - INFO - __main__ - Step 27476: {'lr': 0.0004643297766313781, 'samples': 5275392, 'steps': 27475, 'loss/train': 0.8923225402832031} 11/07/2021 01:04:39 - INFO - __main__ - Step 27477: {'lr': 0.0004643270447488813, 'samples': 5275584, 'steps': 27476, 'loss/train': 1.443546175956726} 11/07/2021 01:04:39 - INFO - __main__ - Step 27478: {'lr': 0.000464324312769812, 'samples': 5275776, 'steps': 27477, 'loss/train': 1.7197577953338623} 11/07/2021 01:04:39 - INFO - __main__ - Step 27479: {'lr': 0.0004643215806941716, 'samples': 5275968, 'steps': 27478, 'loss/train': 1.650207281112671} 11/07/2021 01:04:40 - INFO - __main__ - Step 27480: {'lr': 0.00046431884852196105, 'samples': 5276160, 'steps': 27479, 'loss/train': 1.3830845355987549} 11/07/2021 01:04:40 - INFO - __main__ - Step 27481: {'lr': 0.0004643161162531818, 'samples': 5276352, 'steps': 27480, 'loss/train': 1.2445502281188965} 11/07/2021 01:04:41 - INFO - __main__ - Step 27482: {'lr': 0.00046431338388783504, 'samples': 5276544, 'steps': 27481, 'loss/train': 0.7495806813240051} 11/07/2021 01:04:41 - INFO - __main__ - Step 27483: {'lr': 0.000464310651425922, 'samples': 5276736, 'steps': 27482, 'loss/train': 1.5686403512954712} 11/07/2021 01:04:42 - INFO - __main__ - Step 27484: {'lr': 0.00046430791886744384, 'samples': 5276928, 'steps': 27483, 'loss/train': 1.2439922094345093} 11/07/2021 01:04:42 - INFO - __main__ - Step 27485: {'lr': 0.0004643051862124018, 'samples': 5277120, 'steps': 27484, 'loss/train': 1.5504546165466309} 11/07/2021 01:04:42 - INFO - __main__ - Step 27486: {'lr': 0.0004643024534607973, 'samples': 5277312, 'steps': 27485, 'loss/train': 1.6598097085952759} 11/07/2021 01:04:43 - INFO - __main__ - Step 27487: {'lr': 0.00046429972061263125, 'samples': 5277504, 'steps': 27486, 'loss/train': 1.5114681720733643} 11/07/2021 01:04:44 - INFO - __main__ - Step 27488: {'lr': 0.0004642969876679051, 'samples': 5277696, 'steps': 27487, 'loss/train': 1.344405174255371} 11/07/2021 01:04:44 - INFO - __main__ - Step 27489: {'lr': 0.00046429425462662, 'samples': 5277888, 'steps': 27488, 'loss/train': 1.8302743434906006} 11/07/2021 01:04:44 - INFO - __main__ - Step 27490: {'lr': 0.00046429152148877727, 'samples': 5278080, 'steps': 27489, 'loss/train': 1.5790754556655884} 11/07/2021 01:04:45 - INFO - __main__ - Step 27491: {'lr': 0.00046428878825437815, 'samples': 5278272, 'steps': 27490, 'loss/train': 1.4722696542739868} 11/07/2021 01:04:45 - INFO - __main__ - Step 27492: {'lr': 0.00046428605492342367, 'samples': 5278464, 'steps': 27491, 'loss/train': 1.368982195854187} 11/07/2021 01:04:46 - INFO - __main__ - Step 27493: {'lr': 0.00046428332149591535, 'samples': 5278656, 'steps': 27492, 'loss/train': 1.5615782737731934} 11/07/2021 01:04:47 - INFO - __main__ - Step 27494: {'lr': 0.0004642805879718541, 'samples': 5278848, 'steps': 27493, 'loss/train': 0.2677178680896759} 11/07/2021 01:04:47 - INFO - __main__ - Step 27495: {'lr': 0.00046427785435124147, 'samples': 5279040, 'steps': 27494, 'loss/train': 1.5332293510437012} 11/07/2021 01:04:47 - INFO - __main__ - Step 27496: {'lr': 0.0004642751206340785, 'samples': 5279232, 'steps': 27495, 'loss/train': 1.7823522090911865} 11/07/2021 01:04:48 - INFO - __main__ - Step 27497: {'lr': 0.00046427238682036643, 'samples': 5279424, 'steps': 27496, 'loss/train': 1.2205898761749268} 11/07/2021 01:04:49 - INFO - __main__ - Step 27498: {'lr': 0.0004642696529101066, 'samples': 5279616, 'steps': 27497, 'loss/train': 1.7394107580184937} 11/07/2021 01:04:49 - INFO - __main__ - Step 27499: {'lr': 0.0004642669189033001, 'samples': 5279808, 'steps': 27498, 'loss/train': 1.6603978872299194} 11/07/2021 01:04:49 - INFO - __main__ - Step 27500: {'lr': 0.0004642641847999483, 'samples': 5280000, 'steps': 27499, 'loss/train': 1.5572772026062012} 11/07/2021 01:04:50 - INFO - __main__ - Step 27501: {'lr': 0.0004642614506000523, 'samples': 5280192, 'steps': 27500, 'loss/train': 1.5926954746246338} 11/07/2021 01:04:50 - INFO - __main__ - Step 27502: {'lr': 0.00046425871630361343, 'samples': 5280384, 'steps': 27501, 'loss/train': 1.7171363830566406} 11/07/2021 01:04:51 - INFO - __main__ - Step 27503: {'lr': 0.0004642559819106329, 'samples': 5280576, 'steps': 27502, 'loss/train': 1.238904595375061} 11/07/2021 01:04:51 - INFO - __main__ - Step 27504: {'lr': 0.0004642532474211119, 'samples': 5280768, 'steps': 27503, 'loss/train': 1.410136103630066} 11/07/2021 01:04:52 - INFO - __main__ - Step 27505: {'lr': 0.0004642505128350517, 'samples': 5280960, 'steps': 27504, 'loss/train': 1.3256465196609497} 11/07/2021 01:04:52 - INFO - __main__ - Step 27506: {'lr': 0.00046424777815245354, 'samples': 5281152, 'steps': 27505, 'loss/train': 3.3891711235046387} 11/07/2021 01:04:52 - INFO - __main__ - Step 27507: {'lr': 0.0004642450433733186, 'samples': 5281344, 'steps': 27506, 'loss/train': 1.6674591302871704} 11/07/2021 01:04:54 - INFO - __main__ - Step 27508: {'lr': 0.0004642423084976482, 'samples': 5281536, 'steps': 27507, 'loss/train': 1.3353351354599} 11/07/2021 01:04:54 - INFO - __main__ - Step 27509: {'lr': 0.0004642395735254435, 'samples': 5281728, 'steps': 27508, 'loss/train': 1.587765097618103} 11/07/2021 01:04:54 - INFO - __main__ - Step 27510: {'lr': 0.0004642368384567058, 'samples': 5281920, 'steps': 27509, 'loss/train': 1.5925922393798828} 11/07/2021 01:04:55 - INFO - __main__ - Step 27511: {'lr': 0.0004642341032914362, 'samples': 5282112, 'steps': 27510, 'loss/train': 1.5339746475219727} 11/07/2021 01:04:55 - INFO - __main__ - Step 27512: {'lr': 0.00046423136802963607, 'samples': 5282304, 'steps': 27511, 'loss/train': 1.8380541801452637} 11/07/2021 01:04:56 - INFO - __main__ - Step 27513: {'lr': 0.0004642286326713065, 'samples': 5282496, 'steps': 27512, 'loss/train': 1.510982871055603} 11/07/2021 01:04:56 - INFO - __main__ - Step 27514: {'lr': 0.000464225897216449, 'samples': 5282688, 'steps': 27513, 'loss/train': 1.748958945274353} 11/07/2021 01:04:57 - INFO - __main__ - Step 27515: {'lr': 0.0004642231616650645, 'samples': 5282880, 'steps': 27514, 'loss/train': 1.3821334838867188} 11/07/2021 01:04:57 - INFO - __main__ - Step 27516: {'lr': 0.00046422042601715433, 'samples': 5283072, 'steps': 27515, 'loss/train': 1.5916210412979126} 11/07/2021 01:04:57 - INFO - __main__ - Step 27517: {'lr': 0.00046421769027271974, 'samples': 5283264, 'steps': 27516, 'loss/train': 1.6284141540527344} 11/07/2021 01:04:58 - INFO - __main__ - Step 27518: {'lr': 0.00046421495443176204, 'samples': 5283456, 'steps': 27517, 'loss/train': 1.2993590831756592} 11/07/2021 01:04:59 - INFO - __main__ - Step 27519: {'lr': 0.0004642122184942824, 'samples': 5283648, 'steps': 27518, 'loss/train': 1.648287296295166} 11/07/2021 01:04:59 - INFO - __main__ - Step 27520: {'lr': 0.00046420948246028194, 'samples': 5283840, 'steps': 27519, 'loss/train': 1.5649181604385376} 11/07/2021 01:05:00 - INFO - __main__ - Step 27521: {'lr': 0.000464206746329762, 'samples': 5284032, 'steps': 27520, 'loss/train': 1.5230170488357544} 11/07/2021 01:05:00 - INFO - __main__ - Step 27522: {'lr': 0.00046420401010272385, 'samples': 5284224, 'steps': 27521, 'loss/train': 1.478598952293396} 11/07/2021 01:05:00 - INFO - __main__ - Step 27523: {'lr': 0.00046420127377916863, 'samples': 5284416, 'steps': 27522, 'loss/train': 1.6656675338745117} 11/07/2021 01:05:01 - INFO - __main__ - Step 27524: {'lr': 0.0004641985373590977, 'samples': 5284608, 'steps': 27523, 'loss/train': 0.9977850914001465} 11/07/2021 01:05:02 - INFO - __main__ - Step 27525: {'lr': 0.00046419580084251224, 'samples': 5284800, 'steps': 27524, 'loss/train': 1.8102953433990479} 11/07/2021 01:05:02 - INFO - __main__ - Step 27526: {'lr': 0.0004641930642294133, 'samples': 5284992, 'steps': 27525, 'loss/train': 1.6369602680206299} 11/07/2021 01:05:02 - INFO - __main__ - Step 27527: {'lr': 0.0004641903275198024, 'samples': 5285184, 'steps': 27526, 'loss/train': 0.6573544144630432} 11/07/2021 01:05:03 - INFO - __main__ - Step 27528: {'lr': 0.0004641875907136806, 'samples': 5285376, 'steps': 27527, 'loss/train': 1.3041669130325317} 11/07/2021 01:05:04 - INFO - __main__ - Step 27529: {'lr': 0.0004641848538110492, 'samples': 5285568, 'steps': 27528, 'loss/train': 1.6319307088851929} 11/07/2021 01:05:04 - INFO - __main__ - Step 27530: {'lr': 0.00046418211681190937, 'samples': 5285760, 'steps': 27529, 'loss/train': 1.5995657444000244} 11/07/2021 01:05:05 - INFO - __main__ - Step 27531: {'lr': 0.00046417937971626245, 'samples': 5285952, 'steps': 27530, 'loss/train': 1.3074729442596436} 11/07/2021 01:05:05 - INFO - __main__ - Step 27532: {'lr': 0.0004641766425241095, 'samples': 5286144, 'steps': 27531, 'loss/train': 1.555513858795166} 11/07/2021 01:05:05 - INFO - __main__ - Step 27533: {'lr': 0.000464173905235452, 'samples': 5286336, 'steps': 27532, 'loss/train': 1.5921692848205566} 11/07/2021 01:05:06 - INFO - __main__ - Step 27534: {'lr': 0.0004641711678502909, 'samples': 5286528, 'steps': 27533, 'loss/train': 1.369351863861084} 11/07/2021 01:05:07 - INFO - __main__ - Step 27535: {'lr': 0.00046416843036862766, 'samples': 5286720, 'steps': 27534, 'loss/train': 1.361032485961914} 11/07/2021 01:05:07 - INFO - __main__ - Step 27536: {'lr': 0.0004641656927904634, 'samples': 5286912, 'steps': 27535, 'loss/train': 1.411777377128601} 11/07/2021 01:05:07 - INFO - __main__ - Step 27537: {'lr': 0.00046416295511579944, 'samples': 5287104, 'steps': 27536, 'loss/train': 2.0430092811584473} 11/07/2021 01:05:08 - INFO - __main__ - Step 27538: {'lr': 0.0004641602173446369, 'samples': 5287296, 'steps': 27537, 'loss/train': 1.117910623550415} 11/07/2021 01:05:09 - INFO - __main__ - Step 27539: {'lr': 0.00046415747947697704, 'samples': 5287488, 'steps': 27538, 'loss/train': 1.8439534902572632} 11/07/2021 01:05:09 - INFO - __main__ - Step 27540: {'lr': 0.00046415474151282124, 'samples': 5287680, 'steps': 27539, 'loss/train': 2.1135165691375732} 11/07/2021 01:05:10 - INFO - __main__ - Step 27541: {'lr': 0.0004641520034521705, 'samples': 5287872, 'steps': 27540, 'loss/train': 1.5234469175338745} 11/07/2021 01:05:10 - INFO - __main__ - Step 27542: {'lr': 0.0004641492652950262, 'samples': 5288064, 'steps': 27541, 'loss/train': 1.8389887809753418} 11/07/2021 01:05:10 - INFO - __main__ - Step 27543: {'lr': 0.0004641465270413896, 'samples': 5288256, 'steps': 27542, 'loss/train': 0.9904304146766663} 11/07/2021 01:05:12 - INFO - __main__ - Step 27544: {'lr': 0.00046414378869126185, 'samples': 5288448, 'steps': 27543, 'loss/train': 0.8239241242408752} 11/07/2021 01:05:12 - INFO - __main__ - Step 27545: {'lr': 0.0004641410502446442, 'samples': 5288640, 'steps': 27544, 'loss/train': 1.446048378944397} 11/07/2021 01:05:12 - INFO - __main__ - Step 27546: {'lr': 0.00046413831170153785, 'samples': 5288832, 'steps': 27545, 'loss/train': 1.452160358428955} 11/07/2021 01:05:13 - INFO - __main__ - Step 27547: {'lr': 0.0004641355730619442, 'samples': 5289024, 'steps': 27546, 'loss/train': 1.589703917503357} 11/07/2021 01:05:13 - INFO - __main__ - Step 27548: {'lr': 0.0004641328343258643, 'samples': 5289216, 'steps': 27547, 'loss/train': 1.2328104972839355} 11/07/2021 01:05:13 - INFO - __main__ - Step 27549: {'lr': 0.00046413009549329946, 'samples': 5289408, 'steps': 27548, 'loss/train': 1.5615376234054565} 11/07/2021 01:05:15 - INFO - __main__ - Step 27550: {'lr': 0.0004641273565642509, 'samples': 5289600, 'steps': 27549, 'loss/train': 1.6135731935501099} 11/07/2021 01:05:15 - INFO - __main__ - Step 27551: {'lr': 0.0004641246175387198, 'samples': 5289792, 'steps': 27550, 'loss/train': 1.5954854488372803} 11/07/2021 01:05:15 - INFO - __main__ - Step 27552: {'lr': 0.0004641218784167075, 'samples': 5289984, 'steps': 27551, 'loss/train': 1.8309478759765625} 11/07/2021 01:05:16 - INFO - __main__ - Step 27553: {'lr': 0.0004641191391982152, 'samples': 5290176, 'steps': 27552, 'loss/train': 1.3916101455688477} 11/07/2021 01:05:16 - INFO - __main__ - Step 27554: {'lr': 0.00046411639988324407, 'samples': 5290368, 'steps': 27553, 'loss/train': 1.8725727796554565} 11/07/2021 01:05:17 - INFO - __main__ - Step 27555: {'lr': 0.00046411366047179547, 'samples': 5290560, 'steps': 27554, 'loss/train': 1.6179206371307373} 11/07/2021 01:05:17 - INFO - __main__ - Step 27556: {'lr': 0.00046411092096387054, 'samples': 5290752, 'steps': 27555, 'loss/train': 1.7249908447265625} 11/07/2021 01:05:18 - INFO - __main__ - Step 27557: {'lr': 0.0004641081813594705, 'samples': 5290944, 'steps': 27556, 'loss/train': 1.6517274379730225} 11/07/2021 01:05:18 - INFO - __main__ - Step 27558: {'lr': 0.0004641054416585966, 'samples': 5291136, 'steps': 27557, 'loss/train': 1.313826084136963} 11/07/2021 01:05:18 - INFO - __main__ - Step 27559: {'lr': 0.00046410270186125014, 'samples': 5291328, 'steps': 27558, 'loss/train': 1.542672038078308} 11/07/2021 01:05:21 - INFO - __main__ - Step 27560: {'lr': 0.0004640999619674323, 'samples': 5291520, 'steps': 27559, 'loss/train': 1.4132981300354004} 11/07/2021 01:05:21 - INFO - __main__ - Step 27561: {'lr': 0.0004640972219771443, 'samples': 5291712, 'steps': 27560, 'loss/train': 1.1761044263839722} 11/07/2021 01:05:22 - INFO - __main__ - Step 27562: {'lr': 0.00046409448189038737, 'samples': 5291904, 'steps': 27561, 'loss/train': 0.558796226978302} 11/07/2021 01:05:22 - INFO - __main__ - Step 27563: {'lr': 0.00046409174170716284, 'samples': 5292096, 'steps': 27562, 'loss/train': 1.4939899444580078} 11/07/2021 01:05:22 - INFO - __main__ - Step 27564: {'lr': 0.0004640890014274718, 'samples': 5292288, 'steps': 27563, 'loss/train': 1.3283414840698242} 11/07/2021 01:05:23 - INFO - __main__ - Step 27565: {'lr': 0.0004640862610513156, 'samples': 5292480, 'steps': 27564, 'loss/train': 2.123770236968994} 11/07/2021 01:05:23 - INFO - __main__ - Step 27566: {'lr': 0.00046408352057869545, 'samples': 5292672, 'steps': 27565, 'loss/train': 1.8661320209503174} 11/07/2021 01:05:23 - INFO - __main__ - Step 27567: {'lr': 0.0004640807800096126, 'samples': 5292864, 'steps': 27566, 'loss/train': 1.7928009033203125} 11/07/2021 01:05:24 - INFO - __main__ - Step 27568: {'lr': 0.0004640780393440682, 'samples': 5293056, 'steps': 27567, 'loss/train': 1.7955752611160278} 11/07/2021 01:05:25 - INFO - __main__ - Step 27569: {'lr': 0.0004640752985820635, 'samples': 5293248, 'steps': 27568, 'loss/train': 1.5294089317321777} 11/07/2021 01:05:25 - INFO - __main__ - Step 27570: {'lr': 0.0004640725577235998, 'samples': 5293440, 'steps': 27569, 'loss/train': 1.3322672843933105} 11/07/2021 01:05:25 - INFO - __main__ - Step 27571: {'lr': 0.00046406981676867836, 'samples': 5293632, 'steps': 27570, 'loss/train': 1.7561861276626587} 11/07/2021 01:05:26 - INFO - __main__ - Step 27572: {'lr': 0.00046406707571730035, 'samples': 5293824, 'steps': 27571, 'loss/train': 2.1232874393463135} 11/07/2021 01:05:26 - INFO - __main__ - Step 27573: {'lr': 0.000464064334569467, 'samples': 5294016, 'steps': 27572, 'loss/train': 1.5122051239013672} 11/07/2021 01:05:27 - INFO - __main__ - Step 27574: {'lr': 0.00046406159332517956, 'samples': 5294208, 'steps': 27573, 'loss/train': 0.921796977519989} 11/07/2021 01:05:28 - INFO - __main__ - Step 27575: {'lr': 0.00046405885198443926, 'samples': 5294400, 'steps': 27574, 'loss/train': 1.4629522562026978} 11/07/2021 01:05:28 - INFO - __main__ - Step 27576: {'lr': 0.00046405611054724737, 'samples': 5294592, 'steps': 27575, 'loss/train': 1.4078749418258667} 11/07/2021 01:05:28 - INFO - __main__ - Step 27577: {'lr': 0.00046405336901360507, 'samples': 5294784, 'steps': 27576, 'loss/train': 1.6363303661346436} 11/07/2021 01:05:29 - INFO - __main__ - Step 27578: {'lr': 0.00046405062738351366, 'samples': 5294976, 'steps': 27577, 'loss/train': 1.8103362321853638} 11/07/2021 01:05:30 - INFO - __main__ - Step 27579: {'lr': 0.00046404788565697434, 'samples': 5295168, 'steps': 27578, 'loss/train': 1.8461319208145142} 11/07/2021 01:05:30 - INFO - __main__ - Step 27580: {'lr': 0.00046404514383398835, 'samples': 5295360, 'steps': 27579, 'loss/train': 1.24917471408844} 11/07/2021 01:05:30 - INFO - __main__ - Step 27581: {'lr': 0.0004640424019145568, 'samples': 5295552, 'steps': 27580, 'loss/train': 1.6586353778839111} 11/07/2021 01:05:31 - INFO - __main__ - Step 27582: {'lr': 0.00046403965989868124, 'samples': 5295744, 'steps': 27581, 'loss/train': 1.3456676006317139} 11/07/2021 01:05:31 - INFO - __main__ - Step 27583: {'lr': 0.0004640369177863626, 'samples': 5295936, 'steps': 27582, 'loss/train': 1.9105370044708252} 11/07/2021 01:05:32 - INFO - __main__ - Step 27584: {'lr': 0.00046403417557760226, 'samples': 5296128, 'steps': 27583, 'loss/train': 1.5361580848693848} 11/07/2021 01:05:32 - INFO - __main__ - Step 27585: {'lr': 0.00046403143327240136, 'samples': 5296320, 'steps': 27584, 'loss/train': 1.2893943786621094} 11/07/2021 01:05:33 - INFO - __main__ - Step 27586: {'lr': 0.00046402869087076127, 'samples': 5296512, 'steps': 27585, 'loss/train': 1.3101916313171387} 11/07/2021 01:05:33 - INFO - __main__ - Step 27587: {'lr': 0.00046402594837268314, 'samples': 5296704, 'steps': 27586, 'loss/train': 1.1525654792785645} 11/07/2021 01:05:33 - INFO - __main__ - Step 27588: {'lr': 0.0004640232057781682, 'samples': 5296896, 'steps': 27587, 'loss/train': 1.3633612394332886} 11/07/2021 01:05:35 - INFO - __main__ - Step 27589: {'lr': 0.00046402046308721776, 'samples': 5297088, 'steps': 27588, 'loss/train': 1.827514886856079} 11/07/2021 01:05:35 - INFO - __main__ - Step 27590: {'lr': 0.0004640177202998329, 'samples': 5297280, 'steps': 27589, 'loss/train': 1.6969636678695679} 11/07/2021 01:05:35 - INFO - __main__ - Step 27591: {'lr': 0.00046401497741601505, 'samples': 5297472, 'steps': 27590, 'loss/train': 1.3765476942062378} 11/07/2021 01:05:36 - INFO - __main__ - Step 27592: {'lr': 0.00046401223443576537, 'samples': 5297664, 'steps': 27591, 'loss/train': 1.5166794061660767} 11/07/2021 01:05:36 - INFO - __main__ - Step 27593: {'lr': 0.00046400949135908497, 'samples': 5297856, 'steps': 27592, 'loss/train': 1.599624514579773} 11/07/2021 01:05:37 - INFO - __main__ - Step 27594: {'lr': 0.0004640067481859753, 'samples': 5298048, 'steps': 27593, 'loss/train': 1.871532917022705} 11/07/2021 01:05:37 - INFO - __main__ - Step 27595: {'lr': 0.00046400400491643744, 'samples': 5298240, 'steps': 27594, 'loss/train': 1.5986907482147217} 11/07/2021 01:05:38 - INFO - __main__ - Step 27596: {'lr': 0.00046400126155047265, 'samples': 5298432, 'steps': 27595, 'loss/train': 0.5922433733940125} 11/07/2021 01:05:38 - INFO - __main__ - Step 27597: {'lr': 0.0004639985180880822, 'samples': 5298624, 'steps': 27596, 'loss/train': 1.390427827835083} 11/07/2021 01:05:38 - INFO - __main__ - Step 27598: {'lr': 0.0004639957745292674, 'samples': 5298816, 'steps': 27597, 'loss/train': 1.2177232503890991} 11/07/2021 01:05:40 - INFO - __main__ - Step 27599: {'lr': 0.00046399303087402935, 'samples': 5299008, 'steps': 27598, 'loss/train': 1.1444631814956665} 11/07/2021 01:05:40 - INFO - __main__ - Step 27600: {'lr': 0.00046399028712236935, 'samples': 5299200, 'steps': 27599, 'loss/train': 0.8689119815826416} 11/07/2021 01:05:40 - INFO - __main__ - Step 27601: {'lr': 0.0004639875432742886, 'samples': 5299392, 'steps': 27600, 'loss/train': 1.5241239070892334} 11/07/2021 01:05:41 - INFO - __main__ - Step 27602: {'lr': 0.0004639847993297884, 'samples': 5299584, 'steps': 27601, 'loss/train': 1.5791829824447632} 11/07/2021 01:05:41 - INFO - __main__ - Step 27603: {'lr': 0.00046398205528886994, 'samples': 5299776, 'steps': 27602, 'loss/train': 0.9915977120399475} 11/07/2021 01:05:42 - INFO - __main__ - Step 27604: {'lr': 0.00046397931115153444, 'samples': 5299968, 'steps': 27603, 'loss/train': 1.067671775817871} 11/07/2021 01:05:42 - INFO - __main__ - Step 27605: {'lr': 0.0004639765669177833, 'samples': 5300160, 'steps': 27604, 'loss/train': 1.4505391120910645} 11/07/2021 01:05:43 - INFO - __main__ - Step 27606: {'lr': 0.00046397382258761744, 'samples': 5300352, 'steps': 27605, 'loss/train': 1.1855945587158203} 11/07/2021 01:05:43 - INFO - __main__ - Step 27607: {'lr': 0.0004639710781610384, 'samples': 5300544, 'steps': 27606, 'loss/train': 0.813217043876648} 11/07/2021 01:05:43 - INFO - __main__ - Step 27608: {'lr': 0.00046396833363804724, 'samples': 5300736, 'steps': 27607, 'loss/train': 1.563719391822815} 11/07/2021 01:05:44 - INFO - __main__ - Step 27609: {'lr': 0.00046396558901864527, 'samples': 5300928, 'steps': 27608, 'loss/train': 0.7621616125106812} 11/07/2021 01:05:45 - INFO - __main__ - Step 27610: {'lr': 0.0004639628443028337, 'samples': 5301120, 'steps': 27609, 'loss/train': 1.638516902923584} 11/07/2021 01:05:45 - INFO - __main__ - Step 27611: {'lr': 0.0004639600994906138, 'samples': 5301312, 'steps': 27610, 'loss/train': 1.7831859588623047} 11/07/2021 01:05:46 - INFO - __main__ - Step 27612: {'lr': 0.00046395735458198674, 'samples': 5301504, 'steps': 27611, 'loss/train': 2.2419371604919434} 11/07/2021 01:05:46 - INFO - __main__ - Step 27613: {'lr': 0.0004639546095769538, 'samples': 5301696, 'steps': 27612, 'loss/train': 1.0362420082092285} 11/07/2021 01:05:46 - INFO - __main__ - Step 27614: {'lr': 0.00046395186447551617, 'samples': 5301888, 'steps': 27613, 'loss/train': 1.3505686521530151} 11/07/2021 01:05:47 - INFO - __main__ - Step 27615: {'lr': 0.00046394911927767526, 'samples': 5302080, 'steps': 27614, 'loss/train': 1.3175309896469116} 11/07/2021 01:05:48 - INFO - __main__ - Step 27616: {'lr': 0.0004639463739834321, 'samples': 5302272, 'steps': 27615, 'loss/train': 1.468790054321289} 11/07/2021 01:05:48 - INFO - __main__ - Step 27617: {'lr': 0.00046394362859278793, 'samples': 5302464, 'steps': 27616, 'loss/train': 1.662192702293396} 11/07/2021 01:05:48 - INFO - __main__ - Step 27618: {'lr': 0.00046394088310574416, 'samples': 5302656, 'steps': 27617, 'loss/train': 1.2508586645126343} 11/07/2021 01:05:49 - INFO - __main__ - Step 27619: {'lr': 0.000463938137522302, 'samples': 5302848, 'steps': 27618, 'loss/train': 1.403387188911438} 11/07/2021 01:05:50 - INFO - __main__ - Step 27620: {'lr': 0.00046393539184246246, 'samples': 5303040, 'steps': 27619, 'loss/train': 1.3286373615264893} 11/07/2021 01:05:50 - INFO - __main__ - Step 27621: {'lr': 0.000463932646066227, 'samples': 5303232, 'steps': 27620, 'loss/train': 1.2126948833465576} 11/07/2021 01:05:50 - INFO - __main__ - Step 27622: {'lr': 0.0004639299001935968, 'samples': 5303424, 'steps': 27621, 'loss/train': 1.6715281009674072} 11/07/2021 01:05:51 - INFO - __main__ - Step 27623: {'lr': 0.0004639271542245731, 'samples': 5303616, 'steps': 27622, 'loss/train': 1.7359400987625122} 11/07/2021 01:05:51 - INFO - __main__ - Step 27624: {'lr': 0.000463924408159157, 'samples': 5303808, 'steps': 27623, 'loss/train': 1.3238176107406616} 11/07/2021 01:05:52 - INFO - __main__ - Step 27625: {'lr': 0.00046392166199735, 'samples': 5304000, 'steps': 27624, 'loss/train': 1.7469561100006104} 11/07/2021 01:05:53 - INFO - __main__ - Step 27626: {'lr': 0.00046391891573915325, 'samples': 5304192, 'steps': 27625, 'loss/train': 1.4644496440887451} 11/07/2021 01:05:53 - INFO - __main__ - Step 27627: {'lr': 0.0004639161693845678, 'samples': 5304384, 'steps': 27626, 'loss/train': 1.8536758422851562} 11/07/2021 01:05:53 - INFO - __main__ - Step 27628: {'lr': 0.0004639134229335951, 'samples': 5304576, 'steps': 27627, 'loss/train': 1.0943701267242432} 11/07/2021 01:05:54 - INFO - __main__ - Step 27629: {'lr': 0.0004639106763862363, 'samples': 5304768, 'steps': 27628, 'loss/train': 1.6058448553085327} 11/07/2021 01:05:55 - INFO - __main__ - Step 27630: {'lr': 0.00046390792974249263, 'samples': 5304960, 'steps': 27629, 'loss/train': 1.5895884037017822} 11/07/2021 01:05:55 - INFO - __main__ - Step 27631: {'lr': 0.00046390518300236535, 'samples': 5305152, 'steps': 27630, 'loss/train': 1.5727344751358032} 11/07/2021 01:05:55 - INFO - __main__ - Step 27632: {'lr': 0.0004639024361658557, 'samples': 5305344, 'steps': 27631, 'loss/train': 1.2182259559631348} 11/07/2021 01:05:56 - INFO - __main__ - Step 27633: {'lr': 0.00046389968923296496, 'samples': 5305536, 'steps': 27632, 'loss/train': 2.014322519302368} 11/07/2021 01:05:56 - INFO - __main__ - Step 27634: {'lr': 0.0004638969422036943, 'samples': 5305728, 'steps': 27633, 'loss/train': 1.4085277318954468} 11/07/2021 01:05:57 - INFO - __main__ - Step 27635: {'lr': 0.00046389419507804493, 'samples': 5305920, 'steps': 27634, 'loss/train': 1.580593466758728} 11/07/2021 01:05:57 - INFO - __main__ - Step 27636: {'lr': 0.00046389144785601813, 'samples': 5306112, 'steps': 27635, 'loss/train': 1.948710560798645} 11/07/2021 01:05:58 - INFO - __main__ - Step 27637: {'lr': 0.0004638887005376152, 'samples': 5306304, 'steps': 27636, 'loss/train': 1.4450467824935913} 11/07/2021 01:05:58 - INFO - __main__ - Step 27638: {'lr': 0.0004638859531228373, 'samples': 5306496, 'steps': 27637, 'loss/train': 1.8315891027450562} 11/07/2021 01:05:58 - INFO - __main__ - Step 27639: {'lr': 0.00046388320561168567, 'samples': 5306688, 'steps': 27638, 'loss/train': 1.0632545948028564} 11/07/2021 01:06:00 - INFO - __main__ - Step 27640: {'lr': 0.00046388045800416157, 'samples': 5306880, 'steps': 27639, 'loss/train': 1.5780152082443237} 11/07/2021 01:06:00 - INFO - __main__ - Step 27641: {'lr': 0.00046387771030026627, 'samples': 5307072, 'steps': 27640, 'loss/train': 1.3077924251556396} 11/07/2021 01:06:00 - INFO - __main__ - Step 27642: {'lr': 0.00046387496250000095, 'samples': 5307264, 'steps': 27641, 'loss/train': 1.3584179878234863} 11/07/2021 01:06:01 - INFO - __main__ - Step 27643: {'lr': 0.0004638722146033669, 'samples': 5307456, 'steps': 27642, 'loss/train': 1.375866174697876} 11/07/2021 01:06:01 - INFO - __main__ - Step 27644: {'lr': 0.0004638694666103653, 'samples': 5307648, 'steps': 27643, 'loss/train': 1.599300742149353} 11/07/2021 01:06:02 - INFO - __main__ - Step 27645: {'lr': 0.00046386671852099743, 'samples': 5307840, 'steps': 27644, 'loss/train': 1.7007393836975098} 11/07/2021 01:06:02 - INFO - __main__ - Step 27646: {'lr': 0.0004638639703352645, 'samples': 5308032, 'steps': 27645, 'loss/train': 1.2627149820327759} 11/07/2021 01:06:03 - INFO - __main__ - Step 27647: {'lr': 0.00046386122205316783, 'samples': 5308224, 'steps': 27646, 'loss/train': 1.877655029296875} 11/07/2021 01:06:03 - INFO - __main__ - Step 27648: {'lr': 0.0004638584736747085, 'samples': 5308416, 'steps': 27647, 'loss/train': 1.1750167608261108} 11/07/2021 01:06:03 - INFO - __main__ - Step 27649: {'lr': 0.00046385572519988793, 'samples': 5308608, 'steps': 27648, 'loss/train': 1.7251285314559937} 11/07/2021 01:06:04 - INFO - __main__ - Step 27650: {'lr': 0.00046385297662870716, 'samples': 5308800, 'steps': 27649, 'loss/train': 1.1966801881790161} 11/07/2021 01:06:05 - INFO - __main__ - Step 27651: {'lr': 0.00046385022796116766, 'samples': 5308992, 'steps': 27650, 'loss/train': 1.1399863958358765} 11/07/2021 01:06:05 - INFO - __main__ - Step 27652: {'lr': 0.0004638474791972705, 'samples': 5309184, 'steps': 27651, 'loss/train': 1.7064568996429443} 11/07/2021 01:06:06 - INFO - __main__ - Step 27653: {'lr': 0.000463844730337017, 'samples': 5309376, 'steps': 27652, 'loss/train': 1.5111110210418701} 11/07/2021 01:06:06 - INFO - __main__ - Step 27654: {'lr': 0.00046384198138040825, 'samples': 5309568, 'steps': 27653, 'loss/train': 1.664940595626831} 11/07/2021 01:06:06 - INFO - __main__ - Step 27655: {'lr': 0.00046383923232744565, 'samples': 5309760, 'steps': 27654, 'loss/train': 1.659050464630127} 11/07/2021 01:06:07 - INFO - __main__ - Step 27656: {'lr': 0.00046383648317813045, 'samples': 5309952, 'steps': 27655, 'loss/train': 1.68919837474823} 11/07/2021 01:06:08 - INFO - __main__ - Step 27657: {'lr': 0.0004638337339324638, 'samples': 5310144, 'steps': 27656, 'loss/train': 1.3775867223739624} 11/07/2021 01:06:08 - INFO - __main__ - Step 27658: {'lr': 0.00046383098459044697, 'samples': 5310336, 'steps': 27657, 'loss/train': 1.3585413694381714} 11/07/2021 01:06:08 - INFO - __main__ - Step 27659: {'lr': 0.0004638282351520812, 'samples': 5310528, 'steps': 27658, 'loss/train': 0.984664261341095} 11/07/2021 01:06:09 - INFO - __main__ - Step 27660: {'lr': 0.00046382548561736773, 'samples': 5310720, 'steps': 27659, 'loss/train': 1.7749240398406982} 11/07/2021 01:06:10 - INFO - __main__ - Step 27661: {'lr': 0.0004638227359863078, 'samples': 5310912, 'steps': 27660, 'loss/train': 1.3609564304351807} 11/07/2021 01:06:10 - INFO - __main__ - Step 27662: {'lr': 0.0004638199862589026, 'samples': 5311104, 'steps': 27661, 'loss/train': 1.6411045789718628} 11/07/2021 01:06:10 - INFO - __main__ - Step 27663: {'lr': 0.0004638172364351535, 'samples': 5311296, 'steps': 27662, 'loss/train': 1.0792185068130493} 11/07/2021 01:06:11 - INFO - __main__ - Step 27664: {'lr': 0.00046381448651506153, 'samples': 5311488, 'steps': 27663, 'loss/train': 1.6429427862167358} 11/07/2021 01:06:11 - INFO - __main__ - Step 27665: {'lr': 0.00046381173649862815, 'samples': 5311680, 'steps': 27664, 'loss/train': 1.6976220607757568} 11/07/2021 01:06:12 - INFO - __main__ - Step 27666: {'lr': 0.00046380898638585447, 'samples': 5311872, 'steps': 27665, 'loss/train': 1.4528379440307617} 11/07/2021 01:06:13 - INFO - __main__ - Step 27667: {'lr': 0.0004638062361767418, 'samples': 5312064, 'steps': 27666, 'loss/train': 1.2322639226913452} 11/07/2021 01:06:13 - INFO - __main__ - Step 27668: {'lr': 0.00046380348587129127, 'samples': 5312256, 'steps': 27667, 'loss/train': 1.549231767654419} 11/07/2021 01:06:13 - INFO - __main__ - Step 27669: {'lr': 0.0004638007354695042, 'samples': 5312448, 'steps': 27668, 'loss/train': 1.3693461418151855} 11/07/2021 01:06:14 - INFO - __main__ - Step 27670: {'lr': 0.0004637979849713818, 'samples': 5312640, 'steps': 27669, 'loss/train': 0.9689860343933105} 11/07/2021 01:06:15 - INFO - __main__ - Step 27671: {'lr': 0.0004637952343769254, 'samples': 5312832, 'steps': 27670, 'loss/train': 1.2298322916030884} 11/07/2021 01:06:15 - INFO - __main__ - Step 27672: {'lr': 0.00046379248368613615, 'samples': 5313024, 'steps': 27671, 'loss/train': 1.229887843132019} 11/07/2021 01:06:15 - INFO - __main__ - Step 27673: {'lr': 0.0004637897328990153, 'samples': 5313216, 'steps': 27672, 'loss/train': 1.0177441835403442} 11/07/2021 01:06:16 - INFO - __main__ - Step 27674: {'lr': 0.000463786982015564, 'samples': 5313408, 'steps': 27673, 'loss/train': 1.730981707572937} 11/07/2021 01:06:16 - INFO - __main__ - Step 27675: {'lr': 0.00046378423103578373, 'samples': 5313600, 'steps': 27674, 'loss/train': 1.2419869899749756} 11/07/2021 01:06:17 - INFO - __main__ - Step 27676: {'lr': 0.0004637814799596755, 'samples': 5313792, 'steps': 27675, 'loss/train': 2.1071531772613525} 11/07/2021 01:06:18 - INFO - __main__ - Step 27677: {'lr': 0.00046377872878724066, 'samples': 5313984, 'steps': 27676, 'loss/train': 1.697929859161377} 11/07/2021 01:06:18 - INFO - __main__ - Step 27678: {'lr': 0.0004637759775184804, 'samples': 5314176, 'steps': 27677, 'loss/train': 1.393756628036499} 11/07/2021 01:06:18 - INFO - __main__ - Step 27679: {'lr': 0.000463773226153396, 'samples': 5314368, 'steps': 27678, 'loss/train': 0.9993812441825867} 11/07/2021 01:06:19 - INFO - __main__ - Step 27680: {'lr': 0.00046377047469198875, 'samples': 5314560, 'steps': 27679, 'loss/train': 1.5201777219772339} 11/07/2021 01:06:20 - INFO - __main__ - Step 27681: {'lr': 0.00046376772313425974, 'samples': 5314752, 'steps': 27680, 'loss/train': 1.6973458528518677} 11/07/2021 01:06:20 - INFO - __main__ - Step 27682: {'lr': 0.0004637649714802102, 'samples': 5314944, 'steps': 27681, 'loss/train': 1.1721813678741455} 11/07/2021 01:06:20 - INFO - __main__ - Step 27683: {'lr': 0.0004637622197298417, 'samples': 5315136, 'steps': 27682, 'loss/train': 1.7788022756576538} 11/07/2021 01:06:21 - INFO - __main__ - Step 27684: {'lr': 0.000463759467883155, 'samples': 5315328, 'steps': 27683, 'loss/train': 1.6087898015975952} 11/07/2021 01:06:21 - INFO - __main__ - Step 27685: {'lr': 0.0004637567159401518, 'samples': 5315520, 'steps': 27684, 'loss/train': 1.1776622533798218} 11/07/2021 01:06:23 - INFO - __main__ - Step 27686: {'lr': 0.00046375396390083303, 'samples': 5315712, 'steps': 27685, 'loss/train': 1.6208345890045166} 11/07/2021 01:06:23 - INFO - __main__ - Step 27687: {'lr': 0.0004637512117652, 'samples': 5315904, 'steps': 27686, 'loss/train': 1.5504517555236816} 11/07/2021 01:06:23 - INFO - __main__ - Step 27688: {'lr': 0.00046374845953325394, 'samples': 5316096, 'steps': 27687, 'loss/train': 1.558203935623169} 11/07/2021 01:06:24 - INFO - __main__ - Step 27689: {'lr': 0.0004637457072049962, 'samples': 5316288, 'steps': 27688, 'loss/train': 2.274839162826538} 11/07/2021 01:06:24 - INFO - __main__ - Step 27690: {'lr': 0.0004637429547804279, 'samples': 5316480, 'steps': 27689, 'loss/train': 2.026301383972168} 11/07/2021 01:06:24 - INFO - __main__ - Step 27691: {'lr': 0.0004637402022595503, 'samples': 5316672, 'steps': 27690, 'loss/train': 1.0505366325378418} 11/07/2021 01:06:25 - INFO - __main__ - Step 27692: {'lr': 0.0004637374496423647, 'samples': 5316864, 'steps': 27691, 'loss/train': 1.7772705554962158} 11/07/2021 01:06:26 - INFO - __main__ - Step 27693: {'lr': 0.0004637346969288723, 'samples': 5317056, 'steps': 27692, 'loss/train': 0.7610371708869934} 11/07/2021 01:06:26 - INFO - __main__ - Step 27694: {'lr': 0.0004637319441190743, 'samples': 5317248, 'steps': 27693, 'loss/train': 0.8985799551010132} 11/07/2021 01:06:26 - INFO - __main__ - Step 27695: {'lr': 0.00046372919121297207, 'samples': 5317440, 'steps': 27694, 'loss/train': 1.4711743593215942} 11/07/2021 01:06:27 - INFO - __main__ - Step 27696: {'lr': 0.0004637264382105667, 'samples': 5317632, 'steps': 27695, 'loss/train': 1.6593880653381348} 11/07/2021 01:06:28 - INFO - __main__ - Step 27697: {'lr': 0.00046372368511185953, 'samples': 5317824, 'steps': 27696, 'loss/train': 1.6002044677734375} 11/07/2021 01:06:28 - INFO - __main__ - Step 27698: {'lr': 0.0004637209319168517, 'samples': 5318016, 'steps': 27697, 'loss/train': 1.400031566619873} 11/07/2021 01:06:29 - INFO - __main__ - Step 27699: {'lr': 0.0004637181786255446, 'samples': 5318208, 'steps': 27698, 'loss/train': 1.4328306913375854} 11/07/2021 01:06:29 - INFO - __main__ - Step 27700: {'lr': 0.0004637154252379394, 'samples': 5318400, 'steps': 27699, 'loss/train': 1.6177538633346558} 11/07/2021 01:06:29 - INFO - __main__ - Step 27701: {'lr': 0.00046371267175403724, 'samples': 5318592, 'steps': 27700, 'loss/train': 1.7811652421951294} 11/07/2021 01:06:31 - INFO - __main__ - Step 27702: {'lr': 0.0004637099181738395, 'samples': 5318784, 'steps': 27701, 'loss/train': 1.770586371421814} 11/07/2021 01:06:32 - INFO - __main__ - Step 27703: {'lr': 0.00046370716449734733, 'samples': 5318976, 'steps': 27702, 'loss/train': 1.5121160745620728} 11/07/2021 01:06:32 - INFO - __main__ - Step 27704: {'lr': 0.00046370441072456206, 'samples': 5319168, 'steps': 27703, 'loss/train': 1.7021570205688477} 11/07/2021 01:06:32 - INFO - __main__ - Step 27705: {'lr': 0.00046370165685548484, 'samples': 5319360, 'steps': 27704, 'loss/train': 1.472056269645691} 11/07/2021 01:06:33 - INFO - __main__ - Step 27706: {'lr': 0.00046369890289011696, 'samples': 5319552, 'steps': 27705, 'loss/train': 1.6031389236450195} 11/07/2021 01:06:33 - INFO - __main__ - Step 27707: {'lr': 0.0004636961488284597, 'samples': 5319744, 'steps': 27706, 'loss/train': 1.5327539443969727} 11/07/2021 01:06:33 - INFO - __main__ - Step 27708: {'lr': 0.0004636933946705142, 'samples': 5319936, 'steps': 27707, 'loss/train': 0.7630791664123535} 11/07/2021 01:06:34 - INFO - __main__ - Step 27709: {'lr': 0.00046369064041628175, 'samples': 5320128, 'steps': 27708, 'loss/train': 0.810263991355896} 11/07/2021 01:06:35 - INFO - __main__ - Step 27710: {'lr': 0.00046368788606576363, 'samples': 5320320, 'steps': 27709, 'loss/train': 1.462875247001648} 11/07/2021 01:06:35 - INFO - __main__ - Step 27711: {'lr': 0.00046368513161896104, 'samples': 5320512, 'steps': 27710, 'loss/train': 1.803904414176941} 11/07/2021 01:06:36 - INFO - __main__ - Step 27712: {'lr': 0.0004636823770758752, 'samples': 5320704, 'steps': 27711, 'loss/train': 2.0148212909698486} 11/07/2021 01:06:36 - INFO - __main__ - Step 27713: {'lr': 0.0004636796224365074, 'samples': 5320896, 'steps': 27712, 'loss/train': 1.991600751876831} 11/07/2021 01:06:37 - INFO - __main__ - Step 27714: {'lr': 0.0004636768677008588, 'samples': 5321088, 'steps': 27713, 'loss/train': 1.4234155416488647} 11/07/2021 01:06:37 - INFO - __main__ - Step 27715: {'lr': 0.0004636741128689308, 'samples': 5321280, 'steps': 27714, 'loss/train': 2.0229625701904297} 11/07/2021 01:06:38 - INFO - __main__ - Step 27716: {'lr': 0.00046367135794072445, 'samples': 5321472, 'steps': 27715, 'loss/train': 1.2980085611343384} 11/07/2021 01:06:38 - INFO - __main__ - Step 27717: {'lr': 0.0004636686029162411, 'samples': 5321664, 'steps': 27716, 'loss/train': 1.2549235820770264} 11/07/2021 01:06:38 - INFO - __main__ - Step 27718: {'lr': 0.000463665847795482, 'samples': 5321856, 'steps': 27717, 'loss/train': 1.4861565828323364} 11/07/2021 01:06:39 - INFO - __main__ - Step 27719: {'lr': 0.0004636630925784484, 'samples': 5322048, 'steps': 27718, 'loss/train': 1.936221957206726} 11/07/2021 01:06:40 - INFO - __main__ - Step 27720: {'lr': 0.0004636603372651415, 'samples': 5322240, 'steps': 27719, 'loss/train': 1.259960412979126} 11/07/2021 01:06:40 - INFO - __main__ - Step 27721: {'lr': 0.0004636575818555625, 'samples': 5322432, 'steps': 27720, 'loss/train': 1.568810224533081} 11/07/2021 01:06:40 - INFO - __main__ - Step 27722: {'lr': 0.00046365482634971275, 'samples': 5322624, 'steps': 27721, 'loss/train': 1.6244637966156006} 11/07/2021 01:06:41 - INFO - __main__ - Step 27723: {'lr': 0.00046365207074759344, 'samples': 5322816, 'steps': 27722, 'loss/train': 1.82841157913208} 11/07/2021 01:06:41 - INFO - __main__ - Step 27724: {'lr': 0.0004636493150492057, 'samples': 5323008, 'steps': 27723, 'loss/train': 0.8045539855957031} 11/07/2021 01:06:42 - INFO - __main__ - Step 27725: {'lr': 0.00046364655925455094, 'samples': 5323200, 'steps': 27724, 'loss/train': 1.3680577278137207} 11/07/2021 01:06:43 - INFO - __main__ - Step 27726: {'lr': 0.0004636438033636303, 'samples': 5323392, 'steps': 27725, 'loss/train': 1.5489109754562378} 11/07/2021 01:06:43 - INFO - __main__ - Step 27727: {'lr': 0.00046364104737644515, 'samples': 5323584, 'steps': 27726, 'loss/train': 1.6654754877090454} 11/07/2021 01:06:43 - INFO - __main__ - Step 27728: {'lr': 0.00046363829129299655, 'samples': 5323776, 'steps': 27727, 'loss/train': 1.5989880561828613} 11/07/2021 01:06:44 - INFO - __main__ - Step 27729: {'lr': 0.0004636355351132859, 'samples': 5323968, 'steps': 27728, 'loss/train': 1.1280689239501953} 11/07/2021 01:06:45 - INFO - __main__ - Step 27730: {'lr': 0.00046363277883731437, 'samples': 5324160, 'steps': 27729, 'loss/train': 1.599190354347229} 11/07/2021 01:06:45 - INFO - __main__ - Step 27731: {'lr': 0.0004636300224650831, 'samples': 5324352, 'steps': 27730, 'loss/train': 1.2962560653686523} 11/07/2021 01:06:45 - INFO - __main__ - Step 27732: {'lr': 0.00046362726599659355, 'samples': 5324544, 'steps': 27731, 'loss/train': 1.2981113195419312} 11/07/2021 01:06:46 - INFO - __main__ - Step 27733: {'lr': 0.0004636245094318468, 'samples': 5324736, 'steps': 27732, 'loss/train': 1.5841000080108643} 11/07/2021 01:06:46 - INFO - __main__ - Step 27734: {'lr': 0.0004636217527708442, 'samples': 5324928, 'steps': 27733, 'loss/train': 1.590748906135559} 11/07/2021 01:06:47 - INFO - __main__ - Step 27735: {'lr': 0.0004636189960135869, 'samples': 5325120, 'steps': 27734, 'loss/train': 1.6335985660552979} 11/07/2021 01:06:48 - INFO - __main__ - Step 27736: {'lr': 0.0004636162391600761, 'samples': 5325312, 'steps': 27735, 'loss/train': 1.3171145915985107} 11/07/2021 01:06:48 - INFO - __main__ - Step 27737: {'lr': 0.00046361348221031316, 'samples': 5325504, 'steps': 27736, 'loss/train': 1.492238998413086} 11/07/2021 01:06:48 - INFO - __main__ - Step 27738: {'lr': 0.00046361072516429936, 'samples': 5325696, 'steps': 27737, 'loss/train': 1.0480579137802124} 11/07/2021 01:06:49 - INFO - __main__ - Step 27739: {'lr': 0.0004636079680220358, 'samples': 5325888, 'steps': 27738, 'loss/train': 1.7593157291412354} 11/07/2021 01:06:50 - INFO - __main__ - Step 27740: {'lr': 0.0004636052107835238, 'samples': 5326080, 'steps': 27739, 'loss/train': 1.6487047672271729} 11/07/2021 01:06:50 - INFO - __main__ - Step 27741: {'lr': 0.0004636024534487646, 'samples': 5326272, 'steps': 27740, 'loss/train': 1.5217450857162476} 11/07/2021 01:06:51 - INFO - __main__ - Step 27742: {'lr': 0.0004635996960177594, 'samples': 5326464, 'steps': 27741, 'loss/train': 1.5599013566970825} 11/07/2021 01:06:51 - INFO - __main__ - Step 27743: {'lr': 0.0004635969384905095, 'samples': 5326656, 'steps': 27742, 'loss/train': 1.571807622909546} 11/07/2021 01:06:51 - INFO - __main__ - Step 27744: {'lr': 0.0004635941808670161, 'samples': 5326848, 'steps': 27743, 'loss/train': 1.3286199569702148} 11/07/2021 01:06:52 - INFO - __main__ - Step 27745: {'lr': 0.00046359142314728047, 'samples': 5327040, 'steps': 27744, 'loss/train': 1.714626431465149} 11/07/2021 01:06:53 - INFO - __main__ - Step 27746: {'lr': 0.00046358866533130385, 'samples': 5327232, 'steps': 27745, 'loss/train': 1.262636423110962} 11/07/2021 01:06:53 - INFO - __main__ - Step 27747: {'lr': 0.00046358590741908744, 'samples': 5327424, 'steps': 27746, 'loss/train': 1.909021258354187} 11/07/2021 01:06:53 - INFO - __main__ - Step 27748: {'lr': 0.0004635831494106325, 'samples': 5327616, 'steps': 27747, 'loss/train': 1.4999247789382935} 11/07/2021 01:06:54 - INFO - __main__ - Step 27749: {'lr': 0.0004635803913059404, 'samples': 5327808, 'steps': 27748, 'loss/train': 1.1166845560073853} 11/07/2021 01:06:54 - INFO - __main__ - Step 27750: {'lr': 0.00046357763310501216, 'samples': 5328000, 'steps': 27749, 'loss/train': 1.756516456604004} 11/07/2021 01:06:55 - INFO - __main__ - Step 27751: {'lr': 0.0004635748748078492, 'samples': 5328192, 'steps': 27750, 'loss/train': 1.2605432271957397} 11/07/2021 01:06:55 - INFO - __main__ - Step 27752: {'lr': 0.0004635721164144526, 'samples': 5328384, 'steps': 27751, 'loss/train': 1.455724835395813} 11/07/2021 01:06:56 - INFO - __main__ - Step 27753: {'lr': 0.0004635693579248238, 'samples': 5328576, 'steps': 27752, 'loss/train': 1.5267298221588135} 11/07/2021 01:06:56 - INFO - __main__ - Step 27754: {'lr': 0.00046356659933896393, 'samples': 5328768, 'steps': 27753, 'loss/train': 1.2926586866378784} 11/07/2021 01:06:56 - INFO - __main__ - Step 27755: {'lr': 0.0004635638406568742, 'samples': 5328960, 'steps': 27754, 'loss/train': 1.6973003149032593} 11/07/2021 01:06:57 - INFO - __main__ - Step 27756: {'lr': 0.00046356108187855594, 'samples': 5329152, 'steps': 27755, 'loss/train': 1.769636869430542} 11/07/2021 01:06:58 - INFO - __main__ - Step 27757: {'lr': 0.00046355832300401035, 'samples': 5329344, 'steps': 27756, 'loss/train': 1.4013556241989136} 11/07/2021 01:06:58 - INFO - __main__ - Step 27758: {'lr': 0.0004635555640332386, 'samples': 5329536, 'steps': 27757, 'loss/train': 1.6594982147216797} 11/07/2021 01:06:59 - INFO - __main__ - Step 27759: {'lr': 0.0004635528049662421, 'samples': 5329728, 'steps': 27758, 'loss/train': 1.3016235828399658} 11/07/2021 01:06:59 - INFO - __main__ - Step 27760: {'lr': 0.000463550045803022, 'samples': 5329920, 'steps': 27759, 'loss/train': 1.2639565467834473} 11/07/2021 01:07:00 - INFO - __main__ - Step 27761: {'lr': 0.00046354728654357947, 'samples': 5330112, 'steps': 27760, 'loss/train': 2.085947275161743} 11/07/2021 01:07:01 - INFO - __main__ - Step 27762: {'lr': 0.00046354452718791586, 'samples': 5330304, 'steps': 27761, 'loss/train': 0.21334145963191986} 11/07/2021 01:07:01 - INFO - __main__ - Step 27763: {'lr': 0.0004635417677360324, 'samples': 5330496, 'steps': 27762, 'loss/train': 1.1936448812484741} 11/07/2021 01:07:01 - INFO - __main__ - Step 27764: {'lr': 0.0004635390081879303, 'samples': 5330688, 'steps': 27763, 'loss/train': 1.293779730796814} 11/07/2021 01:07:02 - INFO - __main__ - Step 27765: {'lr': 0.0004635362485436109, 'samples': 5330880, 'steps': 27764, 'loss/train': 1.78763747215271} 11/07/2021 01:07:03 - INFO - __main__ - Step 27766: {'lr': 0.00046353348880307524, 'samples': 5331072, 'steps': 27765, 'loss/train': 1.6645318269729614} 11/07/2021 01:07:03 - INFO - __main__ - Step 27767: {'lr': 0.0004635307289663248, 'samples': 5331264, 'steps': 27766, 'loss/train': 0.7043894529342651} 11/07/2021 01:07:03 - INFO - __main__ - Step 27768: {'lr': 0.0004635279690333606, 'samples': 5331456, 'steps': 27767, 'loss/train': 1.8142160177230835} 11/07/2021 01:07:04 - INFO - __main__ - Step 27769: {'lr': 0.00046352520900418403, 'samples': 5331648, 'steps': 27768, 'loss/train': 1.401672124862671} 11/07/2021 01:07:04 - INFO - __main__ - Step 27770: {'lr': 0.00046352244887879623, 'samples': 5331840, 'steps': 27769, 'loss/train': 1.6772185564041138} 11/07/2021 01:07:05 - INFO - __main__ - Step 27771: {'lr': 0.0004635196886571986, 'samples': 5332032, 'steps': 27770, 'loss/train': 1.560492753982544} 11/07/2021 01:07:06 - INFO - __main__ - Step 27772: {'lr': 0.0004635169283393923, 'samples': 5332224, 'steps': 27771, 'loss/train': 1.3133739233016968} 11/07/2021 01:07:06 - INFO - __main__ - Step 27773: {'lr': 0.0004635141679253785, 'samples': 5332416, 'steps': 27772, 'loss/train': 1.5891863107681274} 11/07/2021 01:07:06 - INFO - __main__ - Step 27774: {'lr': 0.0004635114074151586, 'samples': 5332608, 'steps': 27773, 'loss/train': 1.5065265893936157} 11/07/2021 01:07:07 - INFO - __main__ - Step 27775: {'lr': 0.00046350864680873375, 'samples': 5332800, 'steps': 27774, 'loss/train': 1.1675381660461426} 11/07/2021 01:07:08 - INFO - __main__ - Step 27776: {'lr': 0.0004635058861061051, 'samples': 5332992, 'steps': 27775, 'loss/train': 1.5662585496902466} 11/07/2021 01:07:08 - INFO - __main__ - Step 27777: {'lr': 0.00046350312530727403, 'samples': 5333184, 'steps': 27776, 'loss/train': 1.0655912160873413} 11/07/2021 01:07:08 - INFO - __main__ - Step 27778: {'lr': 0.00046350036441224175, 'samples': 5333376, 'steps': 27777, 'loss/train': 1.738042950630188} 11/07/2021 01:07:09 - INFO - __main__ - Step 27779: {'lr': 0.00046349760342100955, 'samples': 5333568, 'steps': 27778, 'loss/train': 1.7702325582504272} 11/07/2021 01:07:09 - INFO - __main__ - Step 27780: {'lr': 0.00046349484233357854, 'samples': 5333760, 'steps': 27779, 'loss/train': 1.5026328563690186} 11/07/2021 01:07:09 - INFO - __main__ - Step 27781: {'lr': 0.0004634920811499501, 'samples': 5333952, 'steps': 27780, 'loss/train': 1.67792546749115} 11/07/2021 01:07:11 - INFO - __main__ - Step 27782: {'lr': 0.00046348931987012543, 'samples': 5334144, 'steps': 27781, 'loss/train': 0.21696703135967255} 11/07/2021 01:07:11 - INFO - __main__ - Step 27783: {'lr': 0.00046348655849410577, 'samples': 5334336, 'steps': 27782, 'loss/train': 1.6288264989852905} 11/07/2021 01:07:11 - INFO - __main__ - Step 27784: {'lr': 0.0004634837970218924, 'samples': 5334528, 'steps': 27783, 'loss/train': 0.30237165093421936} 11/07/2021 01:07:12 - INFO - __main__ - Step 27785: {'lr': 0.0004634810354534864, 'samples': 5334720, 'steps': 27784, 'loss/train': 1.7129555940628052} 11/07/2021 01:07:12 - INFO - __main__ - Step 27786: {'lr': 0.0004634782737888892, 'samples': 5334912, 'steps': 27785, 'loss/train': 0.9289341568946838} 11/07/2021 01:07:13 - INFO - __main__ - Step 27787: {'lr': 0.000463475512028102, 'samples': 5335104, 'steps': 27786, 'loss/train': 1.6294125318527222} 11/07/2021 01:07:14 - INFO - __main__ - Step 27788: {'lr': 0.000463472750171126, 'samples': 5335296, 'steps': 27787, 'loss/train': 1.5533576011657715} 11/07/2021 01:07:14 - INFO - __main__ - Step 27789: {'lr': 0.0004634699882179625, 'samples': 5335488, 'steps': 27788, 'loss/train': 1.5945382118225098} 11/07/2021 01:07:14 - INFO - __main__ - Step 27790: {'lr': 0.0004634672261686127, 'samples': 5335680, 'steps': 27789, 'loss/train': 1.3665266036987305} 11/07/2021 01:07:15 - INFO - __main__ - Step 27791: {'lr': 0.0004634644640230779, 'samples': 5335872, 'steps': 27790, 'loss/train': 1.4442942142486572} 11/07/2021 01:07:16 - INFO - __main__ - Step 27792: {'lr': 0.0004634617017813593, 'samples': 5336064, 'steps': 27791, 'loss/train': 1.4849623441696167} 11/07/2021 01:07:16 - INFO - __main__ - Step 27793: {'lr': 0.00046345893944345806, 'samples': 5336256, 'steps': 27792, 'loss/train': 1.4545289278030396} 11/07/2021 01:07:16 - INFO - __main__ - Step 27794: {'lr': 0.00046345617700937564, 'samples': 5336448, 'steps': 27793, 'loss/train': 1.5662834644317627} 11/07/2021 01:07:17 - INFO - __main__ - Step 27795: {'lr': 0.0004634534144791131, 'samples': 5336640, 'steps': 27794, 'loss/train': 1.4726775884628296} 11/07/2021 01:07:17 - INFO - __main__ - Step 27796: {'lr': 0.0004634506518526718, 'samples': 5336832, 'steps': 27795, 'loss/train': 1.4754371643066406} 11/07/2021 01:07:18 - INFO - __main__ - Step 27797: {'lr': 0.00046344788913005286, 'samples': 5337024, 'steps': 27796, 'loss/train': 1.5549851655960083} 11/07/2021 01:07:19 - INFO - __main__ - Step 27798: {'lr': 0.00046344512631125756, 'samples': 5337216, 'steps': 27797, 'loss/train': 1.6476571559906006} 11/07/2021 01:07:19 - INFO - __main__ - Step 27799: {'lr': 0.00046344236339628724, 'samples': 5337408, 'steps': 27798, 'loss/train': 1.7478976249694824} 11/07/2021 01:07:19 - INFO - __main__ - Step 27800: {'lr': 0.0004634396003851431, 'samples': 5337600, 'steps': 27799, 'loss/train': 1.513837456703186} 11/07/2021 01:07:20 - INFO - __main__ - Step 27801: {'lr': 0.00046343683727782635, 'samples': 5337792, 'steps': 27800, 'loss/train': 0.918851375579834} 11/07/2021 01:07:21 - INFO - __main__ - Step 27802: {'lr': 0.0004634340740743382, 'samples': 5337984, 'steps': 27801, 'loss/train': 1.074940800666809} 11/07/2021 01:07:21 - INFO - __main__ - Step 27803: {'lr': 0.00046343131077468, 'samples': 5338176, 'steps': 27802, 'loss/train': 1.383703589439392} 11/07/2021 01:07:22 - INFO - __main__ - Step 27804: {'lr': 0.00046342854737885296, 'samples': 5338368, 'steps': 27803, 'loss/train': 0.3601319193840027} 11/07/2021 01:07:22 - INFO - __main__ - Step 27805: {'lr': 0.00046342578388685837, 'samples': 5338560, 'steps': 27804, 'loss/train': 1.6942323446273804} 11/07/2021 01:07:22 - INFO - __main__ - Step 27806: {'lr': 0.0004634230202986973, 'samples': 5338752, 'steps': 27805, 'loss/train': 1.3652087450027466} 11/07/2021 01:07:23 - INFO - __main__ - Step 27807: {'lr': 0.0004634202566143712, 'samples': 5338944, 'steps': 27806, 'loss/train': 1.6079967021942139} 11/07/2021 01:07:23 - INFO - __main__ - Step 27808: {'lr': 0.00046341749283388117, 'samples': 5339136, 'steps': 27807, 'loss/train': 1.3260587453842163} 11/07/2021 01:07:24 - INFO - __main__ - Step 27809: {'lr': 0.0004634147289572285, 'samples': 5339328, 'steps': 27808, 'loss/train': 1.6334463357925415} 11/07/2021 01:07:24 - INFO - __main__ - Step 27810: {'lr': 0.00046341196498441453, 'samples': 5339520, 'steps': 27809, 'loss/train': 1.769137978553772} 11/07/2021 01:07:25 - INFO - __main__ - Step 27811: {'lr': 0.0004634092009154403, 'samples': 5339712, 'steps': 27810, 'loss/train': 1.2536112070083618} 11/07/2021 01:07:25 - INFO - __main__ - Step 27812: {'lr': 0.0004634064367503072, 'samples': 5339904, 'steps': 27811, 'loss/train': 1.5238810777664185} 11/07/2021 01:07:26 - INFO - __main__ - Step 27813: {'lr': 0.00046340367248901655, 'samples': 5340096, 'steps': 27812, 'loss/train': 1.3737224340438843} 11/07/2021 01:07:27 - INFO - __main__ - Step 27814: {'lr': 0.00046340090813156944, 'samples': 5340288, 'steps': 27813, 'loss/train': 1.1866050958633423} 11/07/2021 01:07:27 - INFO - __main__ - Step 27815: {'lr': 0.00046339814367796716, 'samples': 5340480, 'steps': 27814, 'loss/train': 1.5412652492523193} 11/07/2021 01:07:28 - INFO - __main__ - Step 27816: {'lr': 0.00046339537912821094, 'samples': 5340672, 'steps': 27815, 'loss/train': 1.8829264640808105} 11/07/2021 01:07:28 - INFO - __main__ - Step 27817: {'lr': 0.0004633926144823022, 'samples': 5340864, 'steps': 27816, 'loss/train': 1.5201081037521362} 11/07/2021 01:07:28 - INFO - __main__ - Step 27818: {'lr': 0.0004633898497402419, 'samples': 5341056, 'steps': 27817, 'loss/train': 0.9563632607460022} 11/07/2021 01:07:29 - INFO - __main__ - Step 27819: {'lr': 0.0004633870849020314, 'samples': 5341248, 'steps': 27818, 'loss/train': 1.2424542903900146} 11/07/2021 01:07:30 - INFO - __main__ - Step 27820: {'lr': 0.00046338431996767205, 'samples': 5341440, 'steps': 27819, 'loss/train': 1.699521780014038} 11/07/2021 01:07:30 - INFO - __main__ - Step 27821: {'lr': 0.00046338155493716503, 'samples': 5341632, 'steps': 27820, 'loss/train': 1.3380119800567627} 11/07/2021 01:07:30 - INFO - __main__ - Step 27822: {'lr': 0.0004633787898105115, 'samples': 5341824, 'steps': 27821, 'loss/train': 2.1813955307006836} 11/07/2021 01:07:31 - INFO - __main__ - Step 27823: {'lr': 0.0004633760245877129, 'samples': 5342016, 'steps': 27822, 'loss/train': 1.960642695426941} 11/07/2021 01:07:31 - INFO - __main__ - Step 27824: {'lr': 0.0004633732592687703, 'samples': 5342208, 'steps': 27823, 'loss/train': 1.445009469985962} 11/07/2021 01:07:32 - INFO - __main__ - Step 27825: {'lr': 0.00046337049385368495, 'samples': 5342400, 'steps': 27824, 'loss/train': 1.9721317291259766} 11/07/2021 01:07:33 - INFO - __main__ - Step 27826: {'lr': 0.00046336772834245824, 'samples': 5342592, 'steps': 27825, 'loss/train': 1.5805824995040894} 11/07/2021 01:07:33 - INFO - __main__ - Step 27827: {'lr': 0.0004633649627350912, 'samples': 5342784, 'steps': 27826, 'loss/train': 1.6834359169006348} 11/07/2021 01:07:33 - INFO - __main__ - Step 27828: {'lr': 0.00046336219703158526, 'samples': 5342976, 'steps': 27827, 'loss/train': 1.6386125087738037} 11/07/2021 01:07:34 - INFO - __main__ - Step 27829: {'lr': 0.00046335943123194164, 'samples': 5343168, 'steps': 27828, 'loss/train': 1.5754843950271606} 11/07/2021 01:07:35 - INFO - __main__ - Step 27830: {'lr': 0.0004633566653361615, 'samples': 5343360, 'steps': 27829, 'loss/train': 1.437250018119812} 11/07/2021 01:07:35 - INFO - __main__ - Step 27831: {'lr': 0.0004633538993442462, 'samples': 5343552, 'steps': 27830, 'loss/train': 1.7498289346694946} 11/07/2021 01:07:35 - INFO - __main__ - Step 27832: {'lr': 0.00046335113325619685, 'samples': 5343744, 'steps': 27831, 'loss/train': 1.5722873210906982} 11/07/2021 01:07:36 - INFO - __main__ - Step 27833: {'lr': 0.00046334836707201486, 'samples': 5343936, 'steps': 27832, 'loss/train': 1.515210509300232} 11/07/2021 01:07:36 - INFO - __main__ - Step 27834: {'lr': 0.0004633456007917013, 'samples': 5344128, 'steps': 27833, 'loss/train': 1.1740343570709229} 11/07/2021 01:07:37 - INFO - __main__ - Step 27835: {'lr': 0.0004633428344152576, 'samples': 5344320, 'steps': 27834, 'loss/train': 1.6871955394744873} 11/07/2021 01:07:38 - INFO - __main__ - Step 27836: {'lr': 0.0004633400679426848, 'samples': 5344512, 'steps': 27835, 'loss/train': 1.7944591045379639} 11/07/2021 01:07:38 - INFO - __main__ - Step 27837: {'lr': 0.00046333730137398433, 'samples': 5344704, 'steps': 27836, 'loss/train': 1.3786191940307617} 11/07/2021 01:07:38 - INFO - __main__ - Step 27838: {'lr': 0.00046333453470915736, 'samples': 5344896, 'steps': 27837, 'loss/train': 1.5384540557861328} 11/07/2021 01:07:39 - INFO - __main__ - Step 27839: {'lr': 0.0004633317679482051, 'samples': 5345088, 'steps': 27838, 'loss/train': 1.5437536239624023} 11/07/2021 01:07:39 - INFO - __main__ - Step 27840: {'lr': 0.00046332900109112893, 'samples': 5345280, 'steps': 27839, 'loss/train': 1.3671104907989502} 11/07/2021 01:07:40 - INFO - __main__ - Step 27841: {'lr': 0.0004633262341379299, 'samples': 5345472, 'steps': 27840, 'loss/train': 1.2937971353530884} 11/07/2021 01:07:41 - INFO - __main__ - Step 27842: {'lr': 0.0004633234670886094, 'samples': 5345664, 'steps': 27841, 'loss/train': 1.469732642173767} 11/07/2021 01:07:41 - INFO - __main__ - Step 27843: {'lr': 0.0004633206999431686, 'samples': 5345856, 'steps': 27842, 'loss/train': 1.695931077003479} 11/07/2021 01:07:41 - INFO - __main__ - Step 27844: {'lr': 0.00046331793270160885, 'samples': 5346048, 'steps': 27843, 'loss/train': 1.369606614112854} 11/07/2021 01:07:42 - INFO - __main__ - Step 27845: {'lr': 0.0004633151653639314, 'samples': 5346240, 'steps': 27844, 'loss/train': 1.5834770202636719} 11/07/2021 01:07:43 - INFO - __main__ - Step 27846: {'lr': 0.00046331239793013726, 'samples': 5346432, 'steps': 27845, 'loss/train': 1.5611268281936646} 11/07/2021 01:07:43 - INFO - __main__ - Step 27847: {'lr': 0.0004633096304002279, 'samples': 5346624, 'steps': 27846, 'loss/train': 1.500084638595581} 11/07/2021 01:07:43 - INFO - __main__ - Step 27848: {'lr': 0.00046330686277420454, 'samples': 5346816, 'steps': 27847, 'loss/train': 2.1239893436431885} 11/07/2021 01:07:44 - INFO - __main__ - Step 27849: {'lr': 0.00046330409505206837, 'samples': 5347008, 'steps': 27848, 'loss/train': 1.8391716480255127} 11/07/2021 01:07:45 - INFO - __main__ - Step 27850: {'lr': 0.00046330132723382066, 'samples': 5347200, 'steps': 27849, 'loss/train': 1.156402349472046} 11/07/2021 01:07:45 - INFO - __main__ - Step 27851: {'lr': 0.0004632985593194627, 'samples': 5347392, 'steps': 27850, 'loss/train': 1.3428056240081787} 11/07/2021 01:07:46 - INFO - __main__ - Step 27852: {'lr': 0.00046329579130899567, 'samples': 5347584, 'steps': 27851, 'loss/train': 1.7555490732192993} 11/07/2021 01:07:46 - INFO - __main__ - Step 27853: {'lr': 0.0004632930232024209, 'samples': 5347776, 'steps': 27852, 'loss/train': 1.594679832458496} 11/07/2021 01:07:46 - INFO - __main__ - Step 27854: {'lr': 0.0004632902549997395, 'samples': 5347968, 'steps': 27853, 'loss/train': 1.824711799621582} 11/07/2021 01:07:47 - INFO - __main__ - Step 27855: {'lr': 0.00046328748670095287, 'samples': 5348160, 'steps': 27854, 'loss/train': 1.6588963270187378} 11/07/2021 01:07:48 - INFO - __main__ - Step 27856: {'lr': 0.0004632847183060622, 'samples': 5348352, 'steps': 27855, 'loss/train': 1.4669275283813477} 11/07/2021 01:07:48 - INFO - __main__ - Step 27857: {'lr': 0.0004632819498150688, 'samples': 5348544, 'steps': 27856, 'loss/train': 1.2879362106323242} 11/07/2021 01:07:48 - INFO - __main__ - Step 27858: {'lr': 0.00046327918122797363, 'samples': 5348736, 'steps': 27857, 'loss/train': 1.6480536460876465} 11/07/2021 01:07:49 - INFO - __main__ - Step 27859: {'lr': 0.00046327641254477833, 'samples': 5348928, 'steps': 27858, 'loss/train': 1.6864488124847412} 11/07/2021 01:07:49 - INFO - __main__ - Step 27860: {'lr': 0.00046327364376548384, 'samples': 5349120, 'steps': 27859, 'loss/train': 1.5656856298446655} 11/07/2021 01:07:50 - INFO - __main__ - Step 27861: {'lr': 0.0004632708748900917, 'samples': 5349312, 'steps': 27860, 'loss/train': 0.713880717754364} 11/07/2021 01:07:50 - INFO - __main__ - Step 27862: {'lr': 0.00046326810591860285, 'samples': 5349504, 'steps': 27861, 'loss/train': 1.2501379251480103} 11/07/2021 01:07:51 - INFO - __main__ - Step 27863: {'lr': 0.0004632653368510187, 'samples': 5349696, 'steps': 27862, 'loss/train': 1.0524699687957764} 11/07/2021 01:07:51 - INFO - __main__ - Step 27864: {'lr': 0.00046326256768734053, 'samples': 5349888, 'steps': 27863, 'loss/train': 1.8845356702804565} 11/07/2021 01:07:51 - INFO - __main__ - Step 27865: {'lr': 0.0004632597984275695, 'samples': 5350080, 'steps': 27864, 'loss/train': 1.9486255645751953} 11/07/2021 01:07:53 - INFO - __main__ - Step 27866: {'lr': 0.00046325702907170697, 'samples': 5350272, 'steps': 27865, 'loss/train': 1.428919792175293} 11/07/2021 01:07:53 - INFO - __main__ - Step 27867: {'lr': 0.000463254259619754, 'samples': 5350464, 'steps': 27866, 'loss/train': 1.1179231405258179} 11/07/2021 01:07:53 - INFO - __main__ - Step 27868: {'lr': 0.000463251490071712, 'samples': 5350656, 'steps': 27867, 'loss/train': 0.8844492435455322} 11/07/2021 01:07:54 - INFO - __main__ - Step 27869: {'lr': 0.0004632487204275822, 'samples': 5350848, 'steps': 27868, 'loss/train': 2.0191144943237305} 11/07/2021 01:07:54 - INFO - __main__ - Step 27870: {'lr': 0.0004632459506873658, 'samples': 5351040, 'steps': 27869, 'loss/train': 1.6613689661026} 11/07/2021 01:07:54 - INFO - __main__ - Step 27871: {'lr': 0.0004632431808510641, 'samples': 5351232, 'steps': 27870, 'loss/train': 1.1953953504562378} 11/07/2021 01:07:55 - INFO - __main__ - Step 27872: {'lr': 0.0004632404109186782, 'samples': 5351424, 'steps': 27871, 'loss/train': 1.4809138774871826} 11/07/2021 01:07:56 - INFO - __main__ - Step 27873: {'lr': 0.0004632376408902096, 'samples': 5351616, 'steps': 27872, 'loss/train': 1.6379996538162231} 11/07/2021 01:07:56 - INFO - __main__ - Step 27874: {'lr': 0.0004632348707656593, 'samples': 5351808, 'steps': 27873, 'loss/train': 0.8251739740371704} 11/07/2021 01:07:56 - INFO - __main__ - Step 27875: {'lr': 0.00046323210054502874, 'samples': 5352000, 'steps': 27874, 'loss/train': 1.391992211341858} 11/07/2021 01:07:57 - INFO - __main__ - Step 27876: {'lr': 0.00046322933022831903, 'samples': 5352192, 'steps': 27875, 'loss/train': 1.0245487689971924} 11/07/2021 01:07:58 - INFO - __main__ - Step 27877: {'lr': 0.0004632265598155315, 'samples': 5352384, 'steps': 27876, 'loss/train': 1.4199742078781128} 11/07/2021 01:07:58 - INFO - __main__ - Step 27878: {'lr': 0.00046322378930666736, 'samples': 5352576, 'steps': 27877, 'loss/train': 2.1536476612091064} 11/07/2021 01:07:59 - INFO - __main__ - Step 27879: {'lr': 0.0004632210187017278, 'samples': 5352768, 'steps': 27878, 'loss/train': 1.9955917596817017} 11/07/2021 01:07:59 - INFO - __main__ - Step 27880: {'lr': 0.00046321824800071425, 'samples': 5352960, 'steps': 27879, 'loss/train': 1.3227732181549072} 11/07/2021 01:07:59 - INFO - __main__ - Step 27881: {'lr': 0.0004632154772036279, 'samples': 5353152, 'steps': 27880, 'loss/train': 1.6037013530731201} 11/07/2021 01:08:00 - INFO - __main__ - Step 27882: {'lr': 0.0004632127063104698, 'samples': 5353344, 'steps': 27881, 'loss/train': 1.2945456504821777} 11/07/2021 01:08:01 - INFO - __main__ - Step 27883: {'lr': 0.00046320993532124137, 'samples': 5353536, 'steps': 27882, 'loss/train': 1.5950944423675537} 11/07/2021 01:08:01 - INFO - __main__ - Step 27884: {'lr': 0.0004632071642359439, 'samples': 5353728, 'steps': 27883, 'loss/train': 1.419105052947998} 11/07/2021 01:08:01 - INFO - __main__ - Step 27885: {'lr': 0.0004632043930545785, 'samples': 5353920, 'steps': 27884, 'loss/train': 1.512109398841858} 11/07/2021 01:08:02 - INFO - __main__ - Step 27886: {'lr': 0.00046320162177714653, 'samples': 5354112, 'steps': 27885, 'loss/train': 1.366862416267395} 11/07/2021 01:08:03 - INFO - __main__ - Step 27887: {'lr': 0.00046319885040364925, 'samples': 5354304, 'steps': 27886, 'loss/train': 1.5487957000732422} 11/07/2021 01:08:04 - INFO - __main__ - Step 27888: {'lr': 0.00046319607893408776, 'samples': 5354496, 'steps': 27887, 'loss/train': 2.327791452407837} 11/07/2021 01:08:04 - INFO - __main__ - Step 27889: {'lr': 0.0004631933073684635, 'samples': 5354688, 'steps': 27888, 'loss/train': 1.4284238815307617} 11/07/2021 01:08:04 - INFO - __main__ - Step 27890: {'lr': 0.00046319053570677754, 'samples': 5354880, 'steps': 27889, 'loss/train': 1.7574843168258667} 11/07/2021 01:08:05 - INFO - __main__ - Step 27891: {'lr': 0.0004631877639490313, 'samples': 5355072, 'steps': 27890, 'loss/train': 1.6959110498428345} 11/07/2021 01:08:05 - INFO - __main__ - Step 27892: {'lr': 0.0004631849920952259, 'samples': 5355264, 'steps': 27891, 'loss/train': 1.7205824851989746} 11/07/2021 01:08:06 - INFO - __main__ - Step 27893: {'lr': 0.0004631822201453626, 'samples': 5355456, 'steps': 27892, 'loss/train': 1.7389143705368042} 11/07/2021 01:08:06 - INFO - __main__ - Step 27894: {'lr': 0.0004631794480994427, 'samples': 5355648, 'steps': 27893, 'loss/train': 1.2902930974960327} 11/07/2021 01:08:07 - INFO - __main__ - Step 27895: {'lr': 0.0004631766759574675, 'samples': 5355840, 'steps': 27894, 'loss/train': 1.7765417098999023} 11/07/2021 01:08:07 - INFO - __main__ - Step 27896: {'lr': 0.0004631739037194381, 'samples': 5356032, 'steps': 27895, 'loss/train': 1.2097101211547852} 11/07/2021 01:08:08 - INFO - __main__ - Step 27897: {'lr': 0.00046317113138535584, 'samples': 5356224, 'steps': 27896, 'loss/train': 1.501318097114563} 11/07/2021 01:08:08 - INFO - __main__ - Step 27898: {'lr': 0.0004631683589552219, 'samples': 5356416, 'steps': 27897, 'loss/train': 1.8397634029388428} 11/07/2021 01:08:09 - INFO - __main__ - Step 27899: {'lr': 0.00046316558642903774, 'samples': 5356608, 'steps': 27898, 'loss/train': 1.658473014831543} 11/07/2021 01:08:10 - INFO - __main__ - Step 27900: {'lr': 0.0004631628138068043, 'samples': 5356800, 'steps': 27899, 'loss/train': 1.2211940288543701} 11/07/2021 01:08:10 - INFO - __main__ - Step 27901: {'lr': 0.00046316004108852305, 'samples': 5356992, 'steps': 27900, 'loss/train': 1.6784851551055908} 11/07/2021 01:08:10 - INFO - __main__ - Step 27902: {'lr': 0.0004631572682741952, 'samples': 5357184, 'steps': 27901, 'loss/train': 1.5503257513046265} 11/07/2021 01:08:11 - INFO - __main__ - Step 27903: {'lr': 0.0004631544953638219, 'samples': 5357376, 'steps': 27902, 'loss/train': 2.5688607692718506} 11/07/2021 01:08:12 - INFO - __main__ - Step 27904: {'lr': 0.00046315172235740455, 'samples': 5357568, 'steps': 27903, 'loss/train': 1.378450632095337} 11/07/2021 01:08:12 - INFO - __main__ - Step 27905: {'lr': 0.0004631489492549443, 'samples': 5357760, 'steps': 27904, 'loss/train': 1.9023025035858154} 11/07/2021 01:08:12 - INFO - __main__ - Step 27906: {'lr': 0.00046314617605644243, 'samples': 5357952, 'steps': 27905, 'loss/train': 1.4637824296951294} 11/07/2021 01:08:13 - INFO - __main__ - Step 27907: {'lr': 0.0004631434027619001, 'samples': 5358144, 'steps': 27906, 'loss/train': 0.5341514945030212} 11/07/2021 01:08:13 - INFO - __main__ - Step 27908: {'lr': 0.0004631406293713188, 'samples': 5358336, 'steps': 27907, 'loss/train': 0.705164909362793} 11/07/2021 01:08:14 - INFO - __main__ - Step 27909: {'lr': 0.0004631378558846995, 'samples': 5358528, 'steps': 27908, 'loss/train': 1.9466551542282104} 11/07/2021 01:08:15 - INFO - __main__ - Step 27910: {'lr': 0.00046313508230204364, 'samples': 5358720, 'steps': 27909, 'loss/train': 1.1565824747085571} 11/07/2021 01:08:15 - INFO - __main__ - Step 27911: {'lr': 0.00046313230862335235, 'samples': 5358912, 'steps': 27910, 'loss/train': 1.4948753118515015} 11/07/2021 01:08:15 - INFO - __main__ - Step 27912: {'lr': 0.000463129534848627, 'samples': 5359104, 'steps': 27911, 'loss/train': 1.0003376007080078} 11/07/2021 01:08:16 - INFO - __main__ - Step 27913: {'lr': 0.0004631267609778687, 'samples': 5359296, 'steps': 27912, 'loss/train': 1.6802488565444946} 11/07/2021 01:08:17 - INFO - __main__ - Step 27914: {'lr': 0.0004631239870110788, 'samples': 5359488, 'steps': 27913, 'loss/train': 1.7102736234664917} 11/07/2021 01:08:17 - INFO - __main__ - Step 27915: {'lr': 0.00046312121294825846, 'samples': 5359680, 'steps': 27914, 'loss/train': 1.6894041299819946} 11/07/2021 01:08:17 - INFO - __main__ - Step 27916: {'lr': 0.00046311843878940904, 'samples': 5359872, 'steps': 27915, 'loss/train': 1.746033787727356} 11/07/2021 01:08:18 - INFO - __main__ - Step 27917: {'lr': 0.0004631156645345318, 'samples': 5360064, 'steps': 27916, 'loss/train': 1.992263674736023} 11/07/2021 01:08:18 - INFO - __main__ - Step 27918: {'lr': 0.0004631128901836278, 'samples': 5360256, 'steps': 27917, 'loss/train': 1.7350739240646362} 11/07/2021 01:08:19 - INFO - __main__ - Step 27919: {'lr': 0.0004631101157366985, 'samples': 5360448, 'steps': 27918, 'loss/train': 1.9519680738449097} 11/07/2021 01:08:19 - INFO - __main__ - Step 27920: {'lr': 0.0004631073411937451, 'samples': 5360640, 'steps': 27919, 'loss/train': 1.4760541915893555} 11/07/2021 01:08:20 - INFO - __main__ - Step 27921: {'lr': 0.00046310456655476875, 'samples': 5360832, 'steps': 27920, 'loss/train': 0.7511192560195923} 11/07/2021 01:08:20 - INFO - __main__ - Step 27922: {'lr': 0.0004631017918197709, 'samples': 5361024, 'steps': 27921, 'loss/train': 1.594227910041809} 11/07/2021 01:08:20 - INFO - __main__ - Step 27923: {'lr': 0.00046309901698875244, 'samples': 5361216, 'steps': 27922, 'loss/train': 1.0119025707244873} 11/07/2021 01:08:21 - INFO - __main__ - Step 27924: {'lr': 0.00046309624206171505, 'samples': 5361408, 'steps': 27923, 'loss/train': 1.7466933727264404} 11/07/2021 01:08:22 - INFO - __main__ - Step 27925: {'lr': 0.00046309346703865973, 'samples': 5361600, 'steps': 27924, 'loss/train': 1.556564211845398} 11/07/2021 01:08:22 - INFO - __main__ - Step 27926: {'lr': 0.00046309069191958775, 'samples': 5361792, 'steps': 27925, 'loss/train': 1.549757957458496} 11/07/2021 01:08:23 - INFO - __main__ - Step 27927: {'lr': 0.00046308791670450033, 'samples': 5361984, 'steps': 27926, 'loss/train': 1.4141809940338135} 11/07/2021 01:08:23 - INFO - __main__ - Step 27928: {'lr': 0.00046308514139339896, 'samples': 5362176, 'steps': 27927, 'loss/train': 1.1684983968734741} 11/07/2021 01:08:24 - INFO - __main__ - Step 27929: {'lr': 0.0004630823659862846, 'samples': 5362368, 'steps': 27928, 'loss/train': 1.5739498138427734} 11/07/2021 01:08:24 - INFO - __main__ - Step 27930: {'lr': 0.0004630795904831586, 'samples': 5362560, 'steps': 27929, 'loss/train': 1.2718456983566284} 11/07/2021 01:08:25 - INFO - __main__ - Step 27931: {'lr': 0.0004630768148840223, 'samples': 5362752, 'steps': 27930, 'loss/train': 1.2361918687820435} 11/07/2021 01:08:25 - INFO - __main__ - Step 27932: {'lr': 0.0004630740391888768, 'samples': 5362944, 'steps': 27931, 'loss/train': 1.6679351329803467} 11/07/2021 01:08:25 - INFO - __main__ - Step 27933: {'lr': 0.0004630712633977234, 'samples': 5363136, 'steps': 27932, 'loss/train': 1.3536804914474487} 11/07/2021 01:08:26 - INFO - __main__ - Step 27934: {'lr': 0.00046306848751056346, 'samples': 5363328, 'steps': 27933, 'loss/train': 1.3461530208587646} 11/07/2021 01:08:27 - INFO - __main__ - Step 27935: {'lr': 0.0004630657115273981, 'samples': 5363520, 'steps': 27934, 'loss/train': 1.6294420957565308} 11/07/2021 01:08:27 - INFO - __main__ - Step 27936: {'lr': 0.0004630629354482286, 'samples': 5363712, 'steps': 27935, 'loss/train': 1.4199550151824951} 11/07/2021 01:08:27 - INFO - __main__ - Step 27937: {'lr': 0.00046306015927305633, 'samples': 5363904, 'steps': 27936, 'loss/train': 1.5213775634765625} 11/07/2021 01:08:28 - INFO - __main__ - Step 27938: {'lr': 0.0004630573830018824, 'samples': 5364096, 'steps': 27937, 'loss/train': 1.4308301210403442} 11/07/2021 01:08:28 - INFO - __main__ - Step 27939: {'lr': 0.00046305460663470803, 'samples': 5364288, 'steps': 27938, 'loss/train': 1.7389413118362427} 11/07/2021 01:08:29 - INFO - __main__ - Step 27940: {'lr': 0.0004630518301715346, 'samples': 5364480, 'steps': 27939, 'loss/train': 1.478661060333252} 11/07/2021 01:08:29 - INFO - __main__ - Step 27941: {'lr': 0.00046304905361236335, 'samples': 5364672, 'steps': 27940, 'loss/train': 1.7717012166976929} 11/07/2021 01:08:30 - INFO - __main__ - Step 27942: {'lr': 0.00046304627695719535, 'samples': 5364864, 'steps': 27941, 'loss/train': 1.5955733060836792} 11/07/2021 01:08:30 - INFO - __main__ - Step 27943: {'lr': 0.0004630435002060321, 'samples': 5365056, 'steps': 27942, 'loss/train': 1.272430419921875} 11/07/2021 01:08:30 - INFO - __main__ - Step 27944: {'lr': 0.0004630407233588747, 'samples': 5365248, 'steps': 27943, 'loss/train': 1.54817533493042} 11/07/2021 01:08:31 - INFO - __main__ - Step 27945: {'lr': 0.00046303794641572445, 'samples': 5365440, 'steps': 27944, 'loss/train': 1.4762077331542969} 11/07/2021 01:08:32 - INFO - __main__ - Step 27946: {'lr': 0.0004630351693765825, 'samples': 5365632, 'steps': 27945, 'loss/train': 1.6068906784057617} 11/07/2021 01:08:32 - INFO - __main__ - Step 27947: {'lr': 0.0004630323922414503, 'samples': 5365824, 'steps': 27946, 'loss/train': 1.3695404529571533} 11/07/2021 01:08:32 - INFO - __main__ - Step 27948: {'lr': 0.00046302961501032896, 'samples': 5366016, 'steps': 27947, 'loss/train': 1.3957011699676514} 11/07/2021 01:08:33 - INFO - __main__ - Step 27949: {'lr': 0.00046302683768321973, 'samples': 5366208, 'steps': 27948, 'loss/train': 1.190190076828003} 11/07/2021 01:08:34 - INFO - __main__ - Step 27950: {'lr': 0.00046302406026012396, 'samples': 5366400, 'steps': 27949, 'loss/train': 1.5490968227386475} 11/07/2021 01:08:34 - INFO - __main__ - Step 27951: {'lr': 0.0004630212827410428, 'samples': 5366592, 'steps': 27950, 'loss/train': 1.3641514778137207} 11/07/2021 01:08:35 - INFO - __main__ - Step 27952: {'lr': 0.00046301850512597755, 'samples': 5366784, 'steps': 27951, 'loss/train': 1.511609435081482} 11/07/2021 01:08:35 - INFO - __main__ - Step 27953: {'lr': 0.0004630157274149294, 'samples': 5366976, 'steps': 27952, 'loss/train': 1.105630874633789} 11/07/2021 01:08:35 - INFO - __main__ - Step 27954: {'lr': 0.0004630129496078997, 'samples': 5367168, 'steps': 27953, 'loss/train': 1.6662789583206177} 11/07/2021 01:08:36 - INFO - __main__ - Step 27955: {'lr': 0.00046301017170488965, 'samples': 5367360, 'steps': 27954, 'loss/train': 1.4889657497406006} 11/07/2021 01:08:37 - INFO - __main__ - Step 27956: {'lr': 0.0004630073937059005, 'samples': 5367552, 'steps': 27955, 'loss/train': 0.7831249833106995} 11/07/2021 01:08:37 - INFO - __main__ - Step 27957: {'lr': 0.0004630046156109334, 'samples': 5367744, 'steps': 27956, 'loss/train': 1.9498980045318604} 11/07/2021 01:08:37 - INFO - __main__ - Step 27958: {'lr': 0.0004630018374199899, 'samples': 5367936, 'steps': 27957, 'loss/train': 1.4369566440582275} 11/07/2021 01:08:38 - INFO - __main__ - Step 27959: {'lr': 0.00046299905913307096, 'samples': 5368128, 'steps': 27958, 'loss/train': 1.1461580991744995} 11/07/2021 01:08:39 - INFO - __main__ - Step 27960: {'lr': 0.00046299628075017785, 'samples': 5368320, 'steps': 27959, 'loss/train': 1.9767727851867676} 11/07/2021 01:08:39 - INFO - __main__ - Step 27961: {'lr': 0.000462993502271312, 'samples': 5368512, 'steps': 27960, 'loss/train': 1.7098110914230347} 11/07/2021 01:08:40 - INFO - __main__ - Step 27962: {'lr': 0.00046299072369647453, 'samples': 5368704, 'steps': 27961, 'loss/train': 1.5643975734710693} 11/07/2021 01:08:40 - INFO - __main__ - Step 27963: {'lr': 0.00046298794502566676, 'samples': 5368896, 'steps': 27962, 'loss/train': 1.5460844039916992} 11/07/2021 01:08:40 - INFO - __main__ - Step 27964: {'lr': 0.0004629851662588899, 'samples': 5369088, 'steps': 27963, 'loss/train': 2.0571935176849365} 11/07/2021 01:08:41 - INFO - __main__ - Step 27965: {'lr': 0.00046298238739614524, 'samples': 5369280, 'steps': 27964, 'loss/train': 1.5529454946517944} 11/07/2021 01:08:42 - INFO - __main__ - Step 27966: {'lr': 0.0004629796084374339, 'samples': 5369472, 'steps': 27965, 'loss/train': 1.4575004577636719} 11/07/2021 01:08:42 - INFO - __main__ - Step 27967: {'lr': 0.00046297682938275733, 'samples': 5369664, 'steps': 27966, 'loss/train': 1.5414485931396484} 11/07/2021 01:08:42 - INFO - __main__ - Step 27968: {'lr': 0.0004629740502321167, 'samples': 5369856, 'steps': 27967, 'loss/train': 1.659104585647583} 11/07/2021 01:08:43 - INFO - __main__ - Step 27969: {'lr': 0.00046297127098551317, 'samples': 5370048, 'steps': 27968, 'loss/train': 0.7998484373092651} 11/07/2021 01:08:43 - INFO - __main__ - Step 27970: {'lr': 0.00046296849164294816, 'samples': 5370240, 'steps': 27969, 'loss/train': 1.6854795217514038} 11/07/2021 01:08:44 - INFO - __main__ - Step 27971: {'lr': 0.00046296571220442274, 'samples': 5370432, 'steps': 27970, 'loss/train': 1.4134701490402222} 11/07/2021 01:08:45 - INFO - __main__ - Step 27972: {'lr': 0.00046296293266993833, 'samples': 5370624, 'steps': 27971, 'loss/train': 1.8665457963943481} 11/07/2021 01:08:45 - INFO - __main__ - Step 27973: {'lr': 0.00046296015303949606, 'samples': 5370816, 'steps': 27972, 'loss/train': 1.2247053384780884} 11/07/2021 01:08:45 - INFO - __main__ - Step 27974: {'lr': 0.0004629573733130973, 'samples': 5371008, 'steps': 27973, 'loss/train': 1.5885862112045288} 11/07/2021 01:08:46 - INFO - __main__ - Step 27975: {'lr': 0.00046295459349074316, 'samples': 5371200, 'steps': 27974, 'loss/train': 1.613280177116394} 11/07/2021 01:08:47 - INFO - __main__ - Step 27976: {'lr': 0.000462951813572435, 'samples': 5371392, 'steps': 27975, 'loss/train': 1.2622509002685547} 11/07/2021 01:08:47 - INFO - __main__ - Step 27977: {'lr': 0.00046294903355817397, 'samples': 5371584, 'steps': 27976, 'loss/train': 1.672298789024353} 11/07/2021 01:08:47 - INFO - __main__ - Step 27978: {'lr': 0.0004629462534479615, 'samples': 5371776, 'steps': 27977, 'loss/train': 1.6013282537460327} 11/07/2021 01:08:48 - INFO - __main__ - Step 27979: {'lr': 0.0004629434732417986, 'samples': 5371968, 'steps': 27978, 'loss/train': 1.2018623352050781} 11/07/2021 01:08:48 - INFO - __main__ - Step 27980: {'lr': 0.0004629406929396868, 'samples': 5372160, 'steps': 27979, 'loss/train': 1.450639247894287} 11/07/2021 01:08:49 - INFO - __main__ - Step 27981: {'lr': 0.00046293791254162713, 'samples': 5372352, 'steps': 27980, 'loss/train': 1.7362723350524902} 11/07/2021 01:08:49 - INFO - __main__ - Step 27982: {'lr': 0.0004629351320476209, 'samples': 5372544, 'steps': 27981, 'loss/train': 1.3867053985595703} 11/07/2021 01:08:50 - INFO - __main__ - Step 27983: {'lr': 0.00046293235145766955, 'samples': 5372736, 'steps': 27982, 'loss/train': 1.4130687713623047} 11/07/2021 01:08:50 - INFO - __main__ - Step 27984: {'lr': 0.000462929570771774, 'samples': 5372928, 'steps': 27983, 'loss/train': 0.9213356375694275} 11/07/2021 01:08:50 - INFO - __main__ - Step 27985: {'lr': 0.0004629267899899358, 'samples': 5373120, 'steps': 27984, 'loss/train': 0.8098600506782532} 11/07/2021 01:08:51 - INFO - __main__ - Step 27986: {'lr': 0.00046292400911215594, 'samples': 5373312, 'steps': 27985, 'loss/train': 1.0836659669876099} 11/07/2021 01:08:52 - INFO - __main__ - Step 27987: {'lr': 0.00046292122813843586, 'samples': 5373504, 'steps': 27986, 'loss/train': 1.5532180070877075} 11/07/2021 01:08:52 - INFO - __main__ - Step 27988: {'lr': 0.00046291844706877674, 'samples': 5373696, 'steps': 27987, 'loss/train': 1.2126308679580688} 11/07/2021 01:08:52 - INFO - __main__ - Step 27989: {'lr': 0.0004629156659031799, 'samples': 5373888, 'steps': 27988, 'loss/train': 1.552217721939087} 11/07/2021 01:08:53 - INFO - __main__ - Step 27990: {'lr': 0.0004629128846416465, 'samples': 5374080, 'steps': 27989, 'loss/train': 1.5919543504714966} 11/07/2021 01:08:54 - INFO - __main__ - Step 27991: {'lr': 0.00046291010328417784, 'samples': 5374272, 'steps': 27990, 'loss/train': 1.7490825653076172} 11/07/2021 01:08:54 - INFO - __main__ - Step 27992: {'lr': 0.0004629073218307752, 'samples': 5374464, 'steps': 27991, 'loss/train': 1.7721277475357056} 11/07/2021 01:08:55 - INFO - __main__ - Step 27993: {'lr': 0.0004629045402814398, 'samples': 5374656, 'steps': 27992, 'loss/train': 1.260655403137207} 11/07/2021 01:08:55 - INFO - __main__ - Step 27994: {'lr': 0.0004629017586361729, 'samples': 5374848, 'steps': 27993, 'loss/train': 1.638946533203125} 11/07/2021 01:08:55 - INFO - __main__ - Step 27995: {'lr': 0.0004628989768949757, 'samples': 5375040, 'steps': 27994, 'loss/train': 1.3483721017837524} 11/07/2021 01:08:56 - INFO - __main__ - Step 27996: {'lr': 0.0004628961950578496, 'samples': 5375232, 'steps': 27995, 'loss/train': 1.5386420488357544} 11/07/2021 01:08:57 - INFO - __main__ - Step 27997: {'lr': 0.00046289341312479574, 'samples': 5375424, 'steps': 27996, 'loss/train': 1.6511589288711548} 11/07/2021 01:08:57 - INFO - __main__ - Step 27998: {'lr': 0.0004628906310958153, 'samples': 5375616, 'steps': 27997, 'loss/train': 1.5828850269317627} 11/07/2021 01:08:57 - INFO - __main__ - Step 27999: {'lr': 0.00046288784897090973, 'samples': 5375808, 'steps': 27998, 'loss/train': 1.1882143020629883} 11/07/2021 01:08:58 - INFO - __main__ - Step 28000: {'lr': 0.00046288506675008014, 'samples': 5376000, 'steps': 27999, 'loss/train': 1.262174367904663} 11/07/2021 01:08:59 - INFO - __main__ - Step 28001: {'lr': 0.0004628822844333278, 'samples': 5376192, 'steps': 28000, 'loss/train': 1.8064054250717163} 11/07/2021 01:08:59 - INFO - __main__ - Step 28002: {'lr': 0.0004628795020206541, 'samples': 5376384, 'steps': 28001, 'loss/train': 2.0902321338653564} 11/07/2021 01:09:00 - INFO - __main__ - Step 28003: {'lr': 0.00046287671951206004, 'samples': 5376576, 'steps': 28002, 'loss/train': 1.5915676355361938} 11/07/2021 01:09:00 - INFO - __main__ - Step 28004: {'lr': 0.0004628739369075471, 'samples': 5376768, 'steps': 28003, 'loss/train': 1.1522947549819946} 11/07/2021 01:09:00 - INFO - __main__ - Step 28005: {'lr': 0.00046287115420711643, 'samples': 5376960, 'steps': 28004, 'loss/train': 1.939139723777771} 11/07/2021 01:09:01 - INFO - __main__ - Step 28006: {'lr': 0.00046286837141076934, 'samples': 5377152, 'steps': 28005, 'loss/train': 1.4172292947769165} 11/07/2021 01:09:02 - INFO - __main__ - Step 28007: {'lr': 0.0004628655885185069, 'samples': 5377344, 'steps': 28006, 'loss/train': 0.7986924648284912} 11/07/2021 01:09:02 - INFO - __main__ - Step 28008: {'lr': 0.00046286280553033067, 'samples': 5377536, 'steps': 28007, 'loss/train': 1.7829539775848389} 11/07/2021 01:09:02 - INFO - __main__ - Step 28009: {'lr': 0.0004628600224462417, 'samples': 5377728, 'steps': 28008, 'loss/train': 2.08845853805542} 11/07/2021 01:09:03 - INFO - __main__ - Step 28010: {'lr': 0.00046285723926624126, 'samples': 5377920, 'steps': 28009, 'loss/train': 0.8777673244476318} 11/07/2021 01:09:03 - INFO - __main__ - Step 28011: {'lr': 0.00046285445599033063, 'samples': 5378112, 'steps': 28010, 'loss/train': 1.582600712776184} 11/07/2021 01:09:04 - INFO - __main__ - Step 28012: {'lr': 0.00046285167261851114, 'samples': 5378304, 'steps': 28011, 'loss/train': 1.723099708557129} 11/07/2021 01:09:04 - INFO - __main__ - Step 28013: {'lr': 0.00046284888915078384, 'samples': 5378496, 'steps': 28012, 'loss/train': 1.5498576164245605} 11/07/2021 01:09:05 - INFO - __main__ - Step 28014: {'lr': 0.00046284610558715024, 'samples': 5378688, 'steps': 28013, 'loss/train': 1.2753721475601196} 11/07/2021 01:09:05 - INFO - __main__ - Step 28015: {'lr': 0.00046284332192761136, 'samples': 5378880, 'steps': 28014, 'loss/train': 1.680116891860962} 11/07/2021 01:09:05 - INFO - __main__ - Step 28016: {'lr': 0.0004628405381721686, 'samples': 5379072, 'steps': 28015, 'loss/train': 1.4995726346969604} 11/07/2021 01:09:07 - INFO - __main__ - Step 28017: {'lr': 0.00046283775432082327, 'samples': 5379264, 'steps': 28016, 'loss/train': 1.733219027519226} 11/07/2021 01:09:07 - INFO - __main__ - Step 28018: {'lr': 0.0004628349703735765, 'samples': 5379456, 'steps': 28017, 'loss/train': 1.6560436487197876} 11/07/2021 01:09:07 - INFO - __main__ - Step 28019: {'lr': 0.0004628321863304295, 'samples': 5379648, 'steps': 28018, 'loss/train': 1.7639132738113403} 11/07/2021 01:09:08 - INFO - __main__ - Step 28020: {'lr': 0.00046282940219138366, 'samples': 5379840, 'steps': 28019, 'loss/train': 1.337292194366455} 11/07/2021 01:09:08 - INFO - __main__ - Step 28021: {'lr': 0.0004628266179564401, 'samples': 5380032, 'steps': 28020, 'loss/train': 0.2998325824737549} 11/07/2021 01:09:08 - INFO - __main__ - Step 28022: {'lr': 0.0004628238336256002, 'samples': 5380224, 'steps': 28021, 'loss/train': 1.5005648136138916} 11/07/2021 01:09:09 - INFO - __main__ - Step 28023: {'lr': 0.0004628210491988652, 'samples': 5380416, 'steps': 28022, 'loss/train': 1.5982273817062378} 11/07/2021 01:09:10 - INFO - __main__ - Step 28024: {'lr': 0.0004628182646762363, 'samples': 5380608, 'steps': 28023, 'loss/train': 1.552241325378418} 11/07/2021 01:09:10 - INFO - __main__ - Step 28025: {'lr': 0.00046281548005771476, 'samples': 5380800, 'steps': 28024, 'loss/train': 1.9095693826675415} 11/07/2021 01:09:10 - INFO - __main__ - Step 28026: {'lr': 0.0004628126953433018, 'samples': 5380992, 'steps': 28025, 'loss/train': 1.3487874269485474} 11/07/2021 01:09:11 - INFO - __main__ - Step 28027: {'lr': 0.00046280991053299883, 'samples': 5381184, 'steps': 28026, 'loss/train': 1.3562543392181396} 11/07/2021 01:09:12 - INFO - __main__ - Step 28028: {'lr': 0.00046280712562680695, 'samples': 5381376, 'steps': 28027, 'loss/train': 1.626530408859253} 11/07/2021 01:09:12 - INFO - __main__ - Step 28029: {'lr': 0.0004628043406247274, 'samples': 5381568, 'steps': 28028, 'loss/train': 2.292478084564209} 11/07/2021 01:09:12 - INFO - __main__ - Step 28030: {'lr': 0.0004628015555267616, 'samples': 5381760, 'steps': 28029, 'loss/train': 1.692482352256775} 11/07/2021 01:09:13 - INFO - __main__ - Step 28031: {'lr': 0.00046279877033291063, 'samples': 5381952, 'steps': 28030, 'loss/train': 1.5280697345733643} 11/07/2021 01:09:13 - INFO - __main__ - Step 28032: {'lr': 0.0004627959850431759, 'samples': 5382144, 'steps': 28031, 'loss/train': 2.019195318222046} 11/07/2021 01:09:14 - INFO - __main__ - Step 28033: {'lr': 0.0004627931996575585, 'samples': 5382336, 'steps': 28032, 'loss/train': 1.5592269897460938} 11/07/2021 01:09:15 - INFO - __main__ - Step 28034: {'lr': 0.0004627904141760598, 'samples': 5382528, 'steps': 28033, 'loss/train': 1.290827751159668} 11/07/2021 01:09:15 - INFO - __main__ - Step 28035: {'lr': 0.000462787628598681, 'samples': 5382720, 'steps': 28034, 'loss/train': 1.722243070602417} 11/07/2021 01:09:15 - INFO - __main__ - Step 28036: {'lr': 0.00046278484292542346, 'samples': 5382912, 'steps': 28035, 'loss/train': 1.1731336116790771} 11/07/2021 01:09:16 - INFO - __main__ - Step 28037: {'lr': 0.0004627820571562883, 'samples': 5383104, 'steps': 28036, 'loss/train': 1.524124026298523} 11/07/2021 01:09:17 - INFO - __main__ - Step 28038: {'lr': 0.0004627792712912768, 'samples': 5383296, 'steps': 28037, 'loss/train': 2.1107864379882812} 11/07/2021 01:09:17 - INFO - __main__ - Step 28039: {'lr': 0.0004627764853303902, 'samples': 5383488, 'steps': 28038, 'loss/train': 1.5921052694320679} 11/07/2021 01:09:17 - INFO - __main__ - Step 28040: {'lr': 0.00046277369927362987, 'samples': 5383680, 'steps': 28039, 'loss/train': 1.7127749919891357} 11/07/2021 01:09:18 - INFO - __main__ - Step 28041: {'lr': 0.00046277091312099704, 'samples': 5383872, 'steps': 28040, 'loss/train': 1.6613887548446655} 11/07/2021 01:09:18 - INFO - __main__ - Step 28042: {'lr': 0.00046276812687249283, 'samples': 5384064, 'steps': 28041, 'loss/train': 1.9463579654693604} 11/07/2021 01:09:19 - INFO - __main__ - Step 28043: {'lr': 0.00046276534052811863, 'samples': 5384256, 'steps': 28042, 'loss/train': 1.6028937101364136} 11/07/2021 01:09:20 - INFO - __main__ - Step 28044: {'lr': 0.00046276255408787565, 'samples': 5384448, 'steps': 28043, 'loss/train': 1.499482274055481} 11/07/2021 01:09:20 - INFO - __main__ - Step 28045: {'lr': 0.0004627597675517652, 'samples': 5384640, 'steps': 28044, 'loss/train': 0.9538659453392029} 11/07/2021 01:09:20 - INFO - __main__ - Step 28046: {'lr': 0.00046275698091978836, 'samples': 5384832, 'steps': 28045, 'loss/train': 1.4438320398330688} 11/07/2021 01:09:21 - INFO - __main__ - Step 28047: {'lr': 0.0004627541941919466, 'samples': 5385024, 'steps': 28046, 'loss/train': 1.389336109161377} 11/07/2021 01:09:22 - INFO - __main__ - Step 28048: {'lr': 0.00046275140736824104, 'samples': 5385216, 'steps': 28047, 'loss/train': 1.7338414192199707} 11/07/2021 01:09:22 - INFO - __main__ - Step 28049: {'lr': 0.000462748620448673, 'samples': 5385408, 'steps': 28048, 'loss/train': 1.335105061531067} 11/07/2021 01:09:22 - INFO - __main__ - Step 28050: {'lr': 0.0004627458334332437, 'samples': 5385600, 'steps': 28049, 'loss/train': 1.6704505681991577} 11/07/2021 01:09:23 - INFO - __main__ - Step 28051: {'lr': 0.0004627430463219544, 'samples': 5385792, 'steps': 28050, 'loss/train': 1.591986060142517} 11/07/2021 01:09:23 - INFO - __main__ - Step 28052: {'lr': 0.0004627402591148064, 'samples': 5385984, 'steps': 28051, 'loss/train': 1.5422168970108032} 11/07/2021 01:09:23 - INFO - __main__ - Step 28053: {'lr': 0.0004627374718118009, 'samples': 5386176, 'steps': 28052, 'loss/train': 1.6419172286987305} 11/07/2021 01:09:24 - INFO - __main__ - Step 28054: {'lr': 0.0004627346844129392, 'samples': 5386368, 'steps': 28053, 'loss/train': 1.4023685455322266} 11/07/2021 01:09:25 - INFO - __main__ - Step 28055: {'lr': 0.0004627318969182225, 'samples': 5386560, 'steps': 28054, 'loss/train': 1.5047255754470825} 11/07/2021 01:09:25 - INFO - __main__ - Step 28056: {'lr': 0.0004627291093276521, 'samples': 5386752, 'steps': 28055, 'loss/train': 1.5556131601333618} 11/07/2021 01:09:25 - INFO - __main__ - Step 28057: {'lr': 0.0004627263216412292, 'samples': 5386944, 'steps': 28056, 'loss/train': 2.0487048625946045} 11/07/2021 01:09:26 - INFO - __main__ - Step 28058: {'lr': 0.00046272353385895515, 'samples': 5387136, 'steps': 28057, 'loss/train': 1.6594773530960083} 11/07/2021 01:09:27 - INFO - __main__ - Step 28059: {'lr': 0.0004627207459808312, 'samples': 5387328, 'steps': 28058, 'loss/train': 1.4510393142700195} 11/07/2021 01:09:27 - INFO - __main__ - Step 28060: {'lr': 0.00046271795800685854, 'samples': 5387520, 'steps': 28059, 'loss/train': 1.6694095134735107} 11/07/2021 01:09:27 - INFO - __main__ - Step 28061: {'lr': 0.00046271516993703844, 'samples': 5387712, 'steps': 28060, 'loss/train': 1.4473903179168701} 11/07/2021 01:09:28 - INFO - __main__ - Step 28062: {'lr': 0.00046271238177137216, 'samples': 5387904, 'steps': 28061, 'loss/train': 1.3745687007904053} 11/07/2021 01:09:28 - INFO - __main__ - Step 28063: {'lr': 0.00046270959350986095, 'samples': 5388096, 'steps': 28062, 'loss/train': 1.5512062311172485} 11/07/2021 01:09:29 - INFO - __main__ - Step 28064: {'lr': 0.0004627068051525061, 'samples': 5388288, 'steps': 28063, 'loss/train': 1.3083319664001465} 11/07/2021 01:09:29 - INFO - __main__ - Step 28065: {'lr': 0.00046270401669930885, 'samples': 5388480, 'steps': 28064, 'loss/train': 1.5867365598678589} 11/07/2021 01:09:30 - INFO - __main__ - Step 28066: {'lr': 0.0004627012281502704, 'samples': 5388672, 'steps': 28065, 'loss/train': 1.640199065208435} 11/07/2021 01:09:30 - INFO - __main__ - Step 28067: {'lr': 0.00046269843950539214, 'samples': 5388864, 'steps': 28066, 'loss/train': 1.7950187921524048} 11/07/2021 01:09:31 - INFO - __main__ - Step 28068: {'lr': 0.00046269565076467517, 'samples': 5389056, 'steps': 28067, 'loss/train': 1.6963131427764893} 11/07/2021 01:09:32 - INFO - __main__ - Step 28069: {'lr': 0.0004626928619281209, 'samples': 5389248, 'steps': 28068, 'loss/train': 1.6827151775360107} 11/07/2021 01:09:32 - INFO - __main__ - Step 28070: {'lr': 0.0004626900729957305, 'samples': 5389440, 'steps': 28069, 'loss/train': 1.433807611465454} 11/07/2021 01:09:32 - INFO - __main__ - Step 28071: {'lr': 0.00046268728396750515, 'samples': 5389632, 'steps': 28070, 'loss/train': 1.8343700170516968} 11/07/2021 01:09:33 - INFO - __main__ - Step 28072: {'lr': 0.0004626844948434462, 'samples': 5389824, 'steps': 28071, 'loss/train': 1.6992357969284058} 11/07/2021 01:09:33 - INFO - __main__ - Step 28073: {'lr': 0.00046268170562355497, 'samples': 5390016, 'steps': 28072, 'loss/train': 1.352782964706421} 11/07/2021 01:09:34 - INFO - __main__ - Step 28074: {'lr': 0.0004626789163078327, 'samples': 5390208, 'steps': 28073, 'loss/train': 1.7138376235961914} 11/07/2021 01:09:35 - INFO - __main__ - Step 28075: {'lr': 0.00046267612689628046, 'samples': 5390400, 'steps': 28074, 'loss/train': 1.884217381477356} 11/07/2021 01:09:35 - INFO - __main__ - Step 28076: {'lr': 0.00046267333738889973, 'samples': 5390592, 'steps': 28075, 'loss/train': 0.8912597894668579} 11/07/2021 01:09:35 - INFO - __main__ - Step 28077: {'lr': 0.00046267054778569163, 'samples': 5390784, 'steps': 28076, 'loss/train': 1.3834558725357056} 11/07/2021 01:09:36 - INFO - __main__ - Step 28078: {'lr': 0.0004626677580866574, 'samples': 5390976, 'steps': 28077, 'loss/train': 1.657981514930725} 11/07/2021 01:09:37 - INFO - __main__ - Step 28079: {'lr': 0.00046266496829179847, 'samples': 5391168, 'steps': 28078, 'loss/train': 1.483892560005188} 11/07/2021 01:09:37 - INFO - __main__ - Step 28080: {'lr': 0.0004626621784011159, 'samples': 5391360, 'steps': 28079, 'loss/train': 1.7254626750946045} 11/07/2021 01:09:37 - INFO - __main__ - Step 28081: {'lr': 0.0004626593884146111, 'samples': 5391552, 'steps': 28080, 'loss/train': 1.7378684282302856} 11/07/2021 01:09:38 - INFO - __main__ - Step 28082: {'lr': 0.00046265659833228523, 'samples': 5391744, 'steps': 28081, 'loss/train': 1.5451849699020386} 11/07/2021 01:09:38 - INFO - __main__ - Step 28083: {'lr': 0.0004626538081541396, 'samples': 5391936, 'steps': 28082, 'loss/train': 1.3562519550323486} 11/07/2021 01:09:38 - INFO - __main__ - Step 28084: {'lr': 0.00046265101788017543, 'samples': 5392128, 'steps': 28083, 'loss/train': 1.4944548606872559} 11/07/2021 01:09:39 - INFO - __main__ - Step 28085: {'lr': 0.00046264822751039406, 'samples': 5392320, 'steps': 28084, 'loss/train': 1.656470775604248} 11/07/2021 01:09:40 - INFO - __main__ - Step 28086: {'lr': 0.00046264543704479654, 'samples': 5392512, 'steps': 28085, 'loss/train': 1.6204808950424194} 11/07/2021 01:09:40 - INFO - __main__ - Step 28087: {'lr': 0.0004626426464833844, 'samples': 5392704, 'steps': 28086, 'loss/train': 1.1907124519348145} 11/07/2021 01:09:40 - INFO - __main__ - Step 28088: {'lr': 0.0004626398558261586, 'samples': 5392896, 'steps': 28087, 'loss/train': 1.611730694770813} 11/07/2021 01:09:41 - INFO - __main__ - Step 28089: {'lr': 0.00046263706507312073, 'samples': 5393088, 'steps': 28088, 'loss/train': 1.4733778238296509} 11/07/2021 01:09:42 - INFO - __main__ - Step 28090: {'lr': 0.00046263427422427183, 'samples': 5393280, 'steps': 28089, 'loss/train': 1.566117525100708} 11/07/2021 01:09:42 - INFO - __main__ - Step 28091: {'lr': 0.00046263148327961324, 'samples': 5393472, 'steps': 28090, 'loss/train': 1.3886053562164307} 11/07/2021 01:09:43 - INFO - __main__ - Step 28092: {'lr': 0.00046262869223914613, 'samples': 5393664, 'steps': 28091, 'loss/train': 1.5615581274032593} 11/07/2021 01:09:43 - INFO - __main__ - Step 28093: {'lr': 0.00046262590110287183, 'samples': 5393856, 'steps': 28092, 'loss/train': 1.5337178707122803} 11/07/2021 01:09:43 - INFO - __main__ - Step 28094: {'lr': 0.00046262310987079156, 'samples': 5394048, 'steps': 28093, 'loss/train': 1.2868162393569946} 11/07/2021 01:09:44 - INFO - __main__ - Step 28095: {'lr': 0.0004626203185429066, 'samples': 5394240, 'steps': 28094, 'loss/train': 1.0722566843032837} 11/07/2021 01:09:45 - INFO - __main__ - Step 28096: {'lr': 0.00046261752711921825, 'samples': 5394432, 'steps': 28095, 'loss/train': 1.422263503074646} 11/07/2021 01:09:45 - INFO - __main__ - Step 28097: {'lr': 0.00046261473559972764, 'samples': 5394624, 'steps': 28096, 'loss/train': 1.1242746114730835} 11/07/2021 01:09:45 - INFO - __main__ - Step 28098: {'lr': 0.00046261194398443617, 'samples': 5394816, 'steps': 28097, 'loss/train': 1.7802011966705322} 11/07/2021 01:09:46 - INFO - __main__ - Step 28099: {'lr': 0.00046260915227334503, 'samples': 5395008, 'steps': 28098, 'loss/train': 1.442736029624939} 11/07/2021 01:09:47 - INFO - __main__ - Step 28100: {'lr': 0.0004626063604664555, 'samples': 5395200, 'steps': 28099, 'loss/train': 1.458739995956421} 11/07/2021 01:09:47 - INFO - __main__ - Step 28101: {'lr': 0.00046260356856376884, 'samples': 5395392, 'steps': 28100, 'loss/train': 1.3972065448760986} 11/07/2021 01:09:47 - INFO - __main__ - Step 28102: {'lr': 0.0004626007765652862, 'samples': 5395584, 'steps': 28101, 'loss/train': 1.6531623601913452} 11/07/2021 01:09:48 - INFO - __main__ - Step 28103: {'lr': 0.00046259798447100903, 'samples': 5395776, 'steps': 28102, 'loss/train': 0.6677843332290649} 11/07/2021 01:09:48 - INFO - __main__ - Step 28104: {'lr': 0.0004625951922809385, 'samples': 5395968, 'steps': 28103, 'loss/train': 1.4591470956802368} 11/07/2021 01:09:49 - INFO - __main__ - Step 28105: {'lr': 0.0004625923999950758, 'samples': 5396160, 'steps': 28104, 'loss/train': 1.812696933746338} 11/07/2021 01:09:50 - INFO - __main__ - Step 28106: {'lr': 0.0004625896076134222, 'samples': 5396352, 'steps': 28105, 'loss/train': 1.6228306293487549} 11/07/2021 01:09:50 - INFO - __main__ - Step 28107: {'lr': 0.00046258681513597913, 'samples': 5396544, 'steps': 28106, 'loss/train': 1.282670497894287} 11/07/2021 01:09:50 - INFO - __main__ - Step 28108: {'lr': 0.0004625840225627476, 'samples': 5396736, 'steps': 28107, 'loss/train': 1.8279814720153809} 11/07/2021 01:09:51 - INFO - __main__ - Step 28109: {'lr': 0.0004625812298937291, 'samples': 5396928, 'steps': 28108, 'loss/train': 1.9043680429458618} 11/07/2021 01:09:51 - INFO - __main__ - Step 28110: {'lr': 0.0004625784371289247, 'samples': 5397120, 'steps': 28109, 'loss/train': 1.7655878067016602} 11/07/2021 01:09:52 - INFO - __main__ - Step 28111: {'lr': 0.00046257564426833574, 'samples': 5397312, 'steps': 28110, 'loss/train': 0.7204713225364685} 11/07/2021 01:09:52 - INFO - __main__ - Step 28112: {'lr': 0.0004625728513119635, 'samples': 5397504, 'steps': 28111, 'loss/train': 1.454695463180542} 11/07/2021 01:09:53 - INFO - __main__ - Step 28113: {'lr': 0.0004625700582598092, 'samples': 5397696, 'steps': 28112, 'loss/train': 0.9713383316993713} 11/07/2021 01:09:53 - INFO - __main__ - Step 28114: {'lr': 0.00046256726511187407, 'samples': 5397888, 'steps': 28113, 'loss/train': 1.6995383501052856} 11/07/2021 01:09:53 - INFO - __main__ - Step 28115: {'lr': 0.0004625644718681595, 'samples': 5398080, 'steps': 28114, 'loss/train': 0.9354874491691589} 11/07/2021 01:09:54 - INFO - __main__ - Step 28116: {'lr': 0.0004625616785286666, 'samples': 5398272, 'steps': 28115, 'loss/train': 1.810640573501587} 11/07/2021 01:09:55 - INFO - __main__ - Step 28117: {'lr': 0.0004625588850933967, 'samples': 5398464, 'steps': 28116, 'loss/train': 1.472623348236084} 11/07/2021 01:09:55 - INFO - __main__ - Step 28118: {'lr': 0.00046255609156235105, 'samples': 5398656, 'steps': 28117, 'loss/train': 1.1488120555877686} 11/07/2021 01:09:55 - INFO - __main__ - Step 28119: {'lr': 0.0004625532979355309, 'samples': 5398848, 'steps': 28118, 'loss/train': 1.590772271156311} 11/07/2021 01:09:56 - INFO - __main__ - Step 28120: {'lr': 0.00046255050421293756, 'samples': 5399040, 'steps': 28119, 'loss/train': 1.3995095491409302} 11/07/2021 01:09:57 - INFO - __main__ - Step 28121: {'lr': 0.0004625477103945722, 'samples': 5399232, 'steps': 28120, 'loss/train': 0.9050951600074768} 11/07/2021 01:09:57 - INFO - __main__ - Step 28122: {'lr': 0.00046254491648043604, 'samples': 5399424, 'steps': 28121, 'loss/train': 0.5893778800964355} 11/07/2021 01:09:57 - INFO - __main__ - Step 28123: {'lr': 0.00046254212247053055, 'samples': 5399616, 'steps': 28122, 'loss/train': 1.2392082214355469} 11/07/2021 01:09:58 - INFO - __main__ - Step 28124: {'lr': 0.0004625393283648568, 'samples': 5399808, 'steps': 28123, 'loss/train': 1.4818974733352661} 11/07/2021 01:09:58 - INFO - __main__ - Step 28125: {'lr': 0.0004625365341634161, 'samples': 5400000, 'steps': 28124, 'loss/train': 1.5307754278182983} 11/07/2021 01:09:59 - INFO - __main__ - Step 28126: {'lr': 0.00046253373986620985, 'samples': 5400192, 'steps': 28125, 'loss/train': 1.4790600538253784} 11/07/2021 01:10:00 - INFO - __main__ - Step 28127: {'lr': 0.00046253094547323904, 'samples': 5400384, 'steps': 28126, 'loss/train': 1.7845100164413452} 11/07/2021 01:10:00 - INFO - __main__ - Step 28128: {'lr': 0.0004625281509845051, 'samples': 5400576, 'steps': 28127, 'loss/train': 2.040771961212158} 11/07/2021 01:10:00 - INFO - __main__ - Step 28129: {'lr': 0.0004625253564000092, 'samples': 5400768, 'steps': 28128, 'loss/train': 2.1188430786132812} 11/07/2021 01:10:01 - INFO - __main__ - Step 28130: {'lr': 0.00046252256171975273, 'samples': 5400960, 'steps': 28129, 'loss/train': 1.036460518836975} 11/07/2021 01:10:02 - INFO - __main__ - Step 28131: {'lr': 0.0004625197669437368, 'samples': 5401152, 'steps': 28130, 'loss/train': 1.5945188999176025} 11/07/2021 01:10:02 - INFO - __main__ - Step 28132: {'lr': 0.0004625169720719628, 'samples': 5401344, 'steps': 28131, 'loss/train': 1.317168951034546} 11/07/2021 01:10:02 - INFO - __main__ - Step 28133: {'lr': 0.0004625141771044319, 'samples': 5401536, 'steps': 28132, 'loss/train': 2.130425453186035} 11/07/2021 01:10:03 - INFO - __main__ - Step 28134: {'lr': 0.0004625113820411454, 'samples': 5401728, 'steps': 28133, 'loss/train': 2.2005603313446045} 11/07/2021 01:10:03 - INFO - __main__ - Step 28135: {'lr': 0.0004625085868821046, 'samples': 5401920, 'steps': 28134, 'loss/train': 2.0499491691589355} 11/07/2021 01:10:04 - INFO - __main__ - Step 28136: {'lr': 0.0004625057916273107, 'samples': 5402112, 'steps': 28135, 'loss/train': 1.384974479675293} 11/07/2021 01:10:04 - INFO - __main__ - Step 28137: {'lr': 0.00046250299627676486, 'samples': 5402304, 'steps': 28136, 'loss/train': 1.0415135622024536} 11/07/2021 01:10:05 - INFO - __main__ - Step 28138: {'lr': 0.0004625002008304685, 'samples': 5402496, 'steps': 28137, 'loss/train': 1.3801599740982056} 11/07/2021 01:10:05 - INFO - __main__ - Step 28139: {'lr': 0.00046249740528842286, 'samples': 5402688, 'steps': 28138, 'loss/train': 1.7484960556030273} 11/07/2021 01:10:06 - INFO - __main__ - Step 28140: {'lr': 0.00046249460965062917, 'samples': 5402880, 'steps': 28139, 'loss/train': 0.8357280492782593} 11/07/2021 01:10:06 - INFO - __main__ - Step 28141: {'lr': 0.0004624918139170887, 'samples': 5403072, 'steps': 28140, 'loss/train': 1.7102713584899902} 11/07/2021 01:10:07 - INFO - __main__ - Step 28142: {'lr': 0.0004624890180878027, 'samples': 5403264, 'steps': 28141, 'loss/train': 1.3664212226867676} 11/07/2021 01:10:07 - INFO - __main__ - Step 28143: {'lr': 0.00046248622216277235, 'samples': 5403456, 'steps': 28142, 'loss/train': 0.8420030474662781} 11/07/2021 01:10:08 - INFO - __main__ - Step 28144: {'lr': 0.0004624834261419991, 'samples': 5403648, 'steps': 28143, 'loss/train': 1.5578577518463135} 11/07/2021 01:10:08 - INFO - __main__ - Step 28145: {'lr': 0.000462480630025484, 'samples': 5403840, 'steps': 28144, 'loss/train': 1.7505838871002197} 11/07/2021 01:10:08 - INFO - __main__ - Step 28146: {'lr': 0.0004624778338132285, 'samples': 5404032, 'steps': 28145, 'loss/train': 1.77066171169281} 11/07/2021 01:10:09 - INFO - __main__ - Step 28147: {'lr': 0.0004624750375052337, 'samples': 5404224, 'steps': 28146, 'loss/train': 1.6351187229156494} 11/07/2021 01:10:10 - INFO - __main__ - Step 28148: {'lr': 0.0004624722411015009, 'samples': 5404416, 'steps': 28147, 'loss/train': 1.363378643989563} 11/07/2021 01:10:10 - INFO - __main__ - Step 28149: {'lr': 0.0004624694446020314, 'samples': 5404608, 'steps': 28148, 'loss/train': 1.5831588506698608} 11/07/2021 01:10:10 - INFO - __main__ - Step 28150: {'lr': 0.0004624666480068265, 'samples': 5404800, 'steps': 28149, 'loss/train': 1.492389440536499} 11/07/2021 01:10:11 - INFO - __main__ - Step 28151: {'lr': 0.0004624638513158874, 'samples': 5404992, 'steps': 28150, 'loss/train': 1.449171543121338} 11/07/2021 01:10:12 - INFO - __main__ - Step 28152: {'lr': 0.0004624610545292154, 'samples': 5405184, 'steps': 28151, 'loss/train': 1.6068949699401855} 11/07/2021 01:10:12 - INFO - __main__ - Step 28153: {'lr': 0.00046245825764681166, 'samples': 5405376, 'steps': 28152, 'loss/train': 1.152055263519287} 11/07/2021 01:10:13 - INFO - __main__ - Step 28154: {'lr': 0.0004624554606686775, 'samples': 5405568, 'steps': 28153, 'loss/train': 1.5159696340560913} 11/07/2021 01:10:13 - INFO - __main__ - Step 28155: {'lr': 0.0004624526635948142, 'samples': 5405760, 'steps': 28154, 'loss/train': 1.4149354696273804} 11/07/2021 01:10:13 - INFO - __main__ - Step 28156: {'lr': 0.000462449866425223, 'samples': 5405952, 'steps': 28155, 'loss/train': 1.441688060760498} 11/07/2021 01:10:14 - INFO - __main__ - Step 28157: {'lr': 0.0004624470691599052, 'samples': 5406144, 'steps': 28156, 'loss/train': 1.7866506576538086} 11/07/2021 01:10:15 - INFO - __main__ - Step 28158: {'lr': 0.00046244427179886207, 'samples': 5406336, 'steps': 28157, 'loss/train': 1.7607122659683228} 11/07/2021 01:10:15 - INFO - __main__ - Step 28159: {'lr': 0.0004624414743420947, 'samples': 5406528, 'steps': 28158, 'loss/train': 0.31811484694480896} 11/07/2021 01:10:15 - INFO - __main__ - Step 28160: {'lr': 0.00046243867678960463, 'samples': 5406720, 'steps': 28159, 'loss/train': 1.0300776958465576} 11/07/2021 01:10:16 - INFO - __main__ - Step 28161: {'lr': 0.00046243587914139285, 'samples': 5406912, 'steps': 28160, 'loss/train': 1.6217018365859985} 11/07/2021 01:10:16 - INFO - __main__ - Step 28162: {'lr': 0.00046243308139746076, 'samples': 5407104, 'steps': 28161, 'loss/train': 1.4089314937591553} 11/07/2021 01:10:17 - INFO - __main__ - Step 28163: {'lr': 0.00046243028355780967, 'samples': 5407296, 'steps': 28162, 'loss/train': 1.966225028038025} 11/07/2021 01:10:18 - INFO - __main__ - Step 28164: {'lr': 0.00046242748562244076, 'samples': 5407488, 'steps': 28163, 'loss/train': 1.695816159248352} 11/07/2021 01:10:18 - INFO - __main__ - Step 28165: {'lr': 0.00046242468759135523, 'samples': 5407680, 'steps': 28164, 'loss/train': 0.20627062022686005} 11/07/2021 01:10:18 - INFO - __main__ - Step 28166: {'lr': 0.00046242188946455444, 'samples': 5407872, 'steps': 28165, 'loss/train': 1.6223536729812622} 11/07/2021 01:10:19 - INFO - __main__ - Step 28167: {'lr': 0.0004624190912420397, 'samples': 5408064, 'steps': 28166, 'loss/train': 1.5699890851974487} 11/07/2021 01:10:20 - INFO - __main__ - Step 28168: {'lr': 0.0004624162929238121, 'samples': 5408256, 'steps': 28167, 'loss/train': 1.6369304656982422} 11/07/2021 01:10:20 - INFO - __main__ - Step 28169: {'lr': 0.000462413494509873, 'samples': 5408448, 'steps': 28168, 'loss/train': 1.537266731262207} 11/07/2021 01:10:20 - INFO - __main__ - Step 28170: {'lr': 0.0004624106960002237, 'samples': 5408640, 'steps': 28169, 'loss/train': 1.0919755697250366} 11/07/2021 01:10:21 - INFO - __main__ - Step 28171: {'lr': 0.0004624078973948654, 'samples': 5408832, 'steps': 28170, 'loss/train': 1.5566425323486328} 11/07/2021 01:10:21 - INFO - __main__ - Step 28172: {'lr': 0.00046240509869379943, 'samples': 5409024, 'steps': 28171, 'loss/train': 1.5323331356048584} 11/07/2021 01:10:22 - INFO - __main__ - Step 28173: {'lr': 0.00046240229989702697, 'samples': 5409216, 'steps': 28172, 'loss/train': 1.5322365760803223} 11/07/2021 01:10:22 - INFO - __main__ - Step 28174: {'lr': 0.0004623995010045493, 'samples': 5409408, 'steps': 28173, 'loss/train': 0.9593502879142761} 11/07/2021 01:10:23 - INFO - __main__ - Step 28175: {'lr': 0.0004623967020163677, 'samples': 5409600, 'steps': 28174, 'loss/train': 1.6781811714172363} 11/07/2021 01:10:23 - INFO - __main__ - Step 28176: {'lr': 0.0004623939029324834, 'samples': 5409792, 'steps': 28175, 'loss/train': 1.395211935043335} 11/07/2021 01:10:23 - INFO - __main__ - Step 28177: {'lr': 0.0004623911037528977, 'samples': 5409984, 'steps': 28176, 'loss/train': 1.3277342319488525} 11/07/2021 01:10:25 - INFO - __main__ - Step 28178: {'lr': 0.00046238830447761184, 'samples': 5410176, 'steps': 28177, 'loss/train': 1.2456005811691284} 11/07/2021 01:10:25 - INFO - __main__ - Step 28179: {'lr': 0.0004623855051066271, 'samples': 5410368, 'steps': 28178, 'loss/train': 1.7877765893936157} 11/07/2021 01:10:25 - INFO - __main__ - Step 28180: {'lr': 0.00046238270563994465, 'samples': 5410560, 'steps': 28179, 'loss/train': 1.342936635017395} 11/07/2021 01:10:26 - INFO - __main__ - Step 28181: {'lr': 0.00046237990607756596, 'samples': 5410752, 'steps': 28180, 'loss/train': 1.268366813659668} 11/07/2021 01:10:26 - INFO - __main__ - Step 28182: {'lr': 0.0004623771064194921, 'samples': 5410944, 'steps': 28181, 'loss/train': 1.347800612449646} 11/07/2021 01:10:27 - INFO - __main__ - Step 28183: {'lr': 0.0004623743066657244, 'samples': 5411136, 'steps': 28182, 'loss/train': 0.17466305196285248} 11/07/2021 01:10:27 - INFO - __main__ - Step 28184: {'lr': 0.00046237150681626414, 'samples': 5411328, 'steps': 28183, 'loss/train': 1.1365277767181396} 11/07/2021 01:10:28 - INFO - __main__ - Step 28185: {'lr': 0.00046236870687111254, 'samples': 5411520, 'steps': 28184, 'loss/train': 0.9038922786712646} 11/07/2021 01:10:28 - INFO - __main__ - Step 28186: {'lr': 0.0004623659068302708, 'samples': 5411712, 'steps': 28185, 'loss/train': 1.4337131977081299} 11/07/2021 01:10:28 - INFO - __main__ - Step 28187: {'lr': 0.00046236310669374035, 'samples': 5411904, 'steps': 28186, 'loss/train': 2.480987071990967} 11/07/2021 01:10:30 - INFO - __main__ - Step 28188: {'lr': 0.0004623603064615223, 'samples': 5412096, 'steps': 28187, 'loss/train': 1.4874110221862793} 11/07/2021 01:10:30 - INFO - __main__ - Step 28189: {'lr': 0.000462357506133618, 'samples': 5412288, 'steps': 28188, 'loss/train': 2.2280004024505615} 11/07/2021 01:10:30 - INFO - __main__ - Step 28190: {'lr': 0.00046235470571002877, 'samples': 5412480, 'steps': 28189, 'loss/train': 1.6419894695281982} 11/07/2021 01:10:31 - INFO - __main__ - Step 28191: {'lr': 0.00046235190519075564, 'samples': 5412672, 'steps': 28190, 'loss/train': 1.4072515964508057} 11/07/2021 01:10:31 - INFO - __main__ - Step 28192: {'lr': 0.00046234910457580014, 'samples': 5412864, 'steps': 28191, 'loss/train': 1.2326654195785522} 11/07/2021 01:10:31 - INFO - __main__ - Step 28193: {'lr': 0.0004623463038651633, 'samples': 5413056, 'steps': 28192, 'loss/train': 5.861023902893066} 11/07/2021 01:10:32 - INFO - __main__ - Step 28194: {'lr': 0.0004623435030588466, 'samples': 5413248, 'steps': 28193, 'loss/train': 5.822354316711426} 11/07/2021 01:10:33 - INFO - __main__ - Step 28195: {'lr': 0.00046234070215685116, 'samples': 5413440, 'steps': 28194, 'loss/train': 0.5922368168830872} 11/07/2021 01:10:33 - INFO - __main__ - Step 28196: {'lr': 0.0004623379011591782, 'samples': 5413632, 'steps': 28195, 'loss/train': 1.4715194702148438} 11/07/2021 01:10:33 - INFO - __main__ - Step 28197: {'lr': 0.00046233510006582913, 'samples': 5413824, 'steps': 28196, 'loss/train': 1.698293924331665} 11/07/2021 01:10:34 - INFO - __main__ - Step 28198: {'lr': 0.00046233229887680517, 'samples': 5414016, 'steps': 28197, 'loss/train': 1.3359768390655518} 11/07/2021 01:10:34 - INFO - __main__ - Step 28199: {'lr': 0.00046232949759210753, 'samples': 5414208, 'steps': 28198, 'loss/train': 1.5249158143997192} 11/07/2021 01:10:35 - INFO - __main__ - Step 28200: {'lr': 0.00046232669621173745, 'samples': 5414400, 'steps': 28199, 'loss/train': 1.369057297706604} 11/07/2021 01:10:36 - INFO - __main__ - Step 28201: {'lr': 0.00046232389473569623, 'samples': 5414592, 'steps': 28200, 'loss/train': 1.5750991106033325} 11/07/2021 01:10:36 - INFO - __main__ - Step 28202: {'lr': 0.0004623210931639852, 'samples': 5414784, 'steps': 28201, 'loss/train': 1.4961779117584229} 11/07/2021 01:10:36 - INFO - __main__ - Step 28203: {'lr': 0.00046231829149660553, 'samples': 5414976, 'steps': 28202, 'loss/train': 1.7517107725143433} 11/07/2021 01:10:37 - INFO - __main__ - Step 28204: {'lr': 0.00046231548973355854, 'samples': 5415168, 'steps': 28203, 'loss/train': 1.505367398262024} 11/07/2021 01:10:38 - INFO - __main__ - Step 28205: {'lr': 0.00046231268787484545, 'samples': 5415360, 'steps': 28204, 'loss/train': 1.7063173055648804} 11/07/2021 01:10:38 - INFO - __main__ - Step 28206: {'lr': 0.0004623098859204675, 'samples': 5415552, 'steps': 28205, 'loss/train': 1.0347256660461426} 11/07/2021 01:10:39 - INFO - __main__ - Step 28207: {'lr': 0.00046230708387042603, 'samples': 5415744, 'steps': 28206, 'loss/train': 1.5646445751190186} 11/07/2021 01:10:39 - INFO - __main__ - Step 28208: {'lr': 0.0004623042817247223, 'samples': 5415936, 'steps': 28207, 'loss/train': 1.6940830945968628} 11/07/2021 01:10:39 - INFO - __main__ - Step 28209: {'lr': 0.00046230147948335746, 'samples': 5416128, 'steps': 28208, 'loss/train': 0.8263731598854065} 11/07/2021 01:10:40 - INFO - __main__ - Step 28210: {'lr': 0.0004622986771463329, 'samples': 5416320, 'steps': 28209, 'loss/train': 0.4169537127017975} 11/07/2021 01:10:41 - INFO - __main__ - Step 28211: {'lr': 0.0004622958747136498, 'samples': 5416512, 'steps': 28210, 'loss/train': 1.35165536403656} 11/07/2021 01:10:41 - INFO - __main__ - Step 28212: {'lr': 0.00046229307218530945, 'samples': 5416704, 'steps': 28211, 'loss/train': 1.235717535018921} 11/07/2021 01:10:41 - INFO - __main__ - Step 28213: {'lr': 0.0004622902695613131, 'samples': 5416896, 'steps': 28212, 'loss/train': 0.730085551738739} 11/07/2021 01:10:42 - INFO - __main__ - Step 28214: {'lr': 0.00046228746684166214, 'samples': 5417088, 'steps': 28213, 'loss/train': 1.573953628540039} 11/07/2021 01:10:43 - INFO - __main__ - Step 28215: {'lr': 0.00046228466402635764, 'samples': 5417280, 'steps': 28214, 'loss/train': 1.9090080261230469} 11/07/2021 01:10:43 - INFO - __main__ - Step 28216: {'lr': 0.0004622818611154009, 'samples': 5417472, 'steps': 28215, 'loss/train': 0.8026754856109619} 11/07/2021 01:10:43 - INFO - __main__ - Step 28217: {'lr': 0.00046227905810879334, 'samples': 5417664, 'steps': 28216, 'loss/train': 1.769519329071045} 11/07/2021 01:10:44 - INFO - __main__ - Step 28218: {'lr': 0.0004622762550065361, 'samples': 5417856, 'steps': 28217, 'loss/train': 1.2289551496505737} 11/07/2021 01:10:44 - INFO - __main__ - Step 28219: {'lr': 0.0004622734518086304, 'samples': 5418048, 'steps': 28218, 'loss/train': 1.9097330570220947} 11/07/2021 01:10:45 - INFO - __main__ - Step 28220: {'lr': 0.0004622706485150776, 'samples': 5418240, 'steps': 28219, 'loss/train': 1.358104944229126} 11/07/2021 01:10:46 - INFO - __main__ - Step 28221: {'lr': 0.0004622678451258788, 'samples': 5418432, 'steps': 28220, 'loss/train': 1.5007022619247437} 11/07/2021 01:10:46 - INFO - __main__ - Step 28222: {'lr': 0.00046226504164103557, 'samples': 5418624, 'steps': 28221, 'loss/train': 1.2808752059936523} 11/07/2021 01:10:46 - INFO - __main__ - Step 28223: {'lr': 0.0004622622380605489, 'samples': 5418816, 'steps': 28222, 'loss/train': 1.0220810174942017} 11/07/2021 01:10:47 - INFO - __main__ - Step 28224: {'lr': 0.0004622594343844201, 'samples': 5419008, 'steps': 28223, 'loss/train': 1.1426414251327515} 11/07/2021 01:10:47 - INFO - __main__ - Step 28225: {'lr': 0.00046225663061265056, 'samples': 5419200, 'steps': 28224, 'loss/train': 1.3588509559631348} 11/07/2021 01:10:48 - INFO - __main__ - Step 28226: {'lr': 0.0004622538267452414, 'samples': 5419392, 'steps': 28225, 'loss/train': 1.5352381467819214} 11/07/2021 01:10:48 - INFO - __main__ - Step 28227: {'lr': 0.00046225102278219394, 'samples': 5419584, 'steps': 28226, 'loss/train': 1.7459994554519653} 11/07/2021 01:10:49 - INFO - __main__ - Step 28228: {'lr': 0.0004622482187235094, 'samples': 5419776, 'steps': 28227, 'loss/train': 1.6979622840881348} 11/07/2021 01:10:49 - INFO - __main__ - Step 28229: {'lr': 0.00046224541456918916, 'samples': 5419968, 'steps': 28228, 'loss/train': 1.3146947622299194} 11/07/2021 01:10:49 - INFO - __main__ - Step 28230: {'lr': 0.0004622426103192344, 'samples': 5420160, 'steps': 28229, 'loss/train': 1.72724187374115} 11/07/2021 01:10:50 - INFO - __main__ - Step 28231: {'lr': 0.00046223980597364647, 'samples': 5420352, 'steps': 28230, 'loss/train': 1.4171353578567505} 11/07/2021 01:10:51 - INFO - __main__ - Step 28232: {'lr': 0.0004622370015324264, 'samples': 5420544, 'steps': 28231, 'loss/train': 1.5277044773101807} 11/07/2021 01:10:51 - INFO - __main__ - Step 28233: {'lr': 0.0004622341969955757, 'samples': 5420736, 'steps': 28232, 'loss/train': 1.284793734550476} 11/07/2021 01:10:51 - INFO - __main__ - Step 28234: {'lr': 0.00046223139236309553, 'samples': 5420928, 'steps': 28233, 'loss/train': 1.54947030544281} 11/07/2021 01:10:52 - INFO - __main__ - Step 28235: {'lr': 0.0004622285876349872, 'samples': 5421120, 'steps': 28234, 'loss/train': 2.010934591293335} 11/07/2021 01:10:53 - INFO - __main__ - Step 28236: {'lr': 0.00046222578281125194, 'samples': 5421312, 'steps': 28235, 'loss/train': 1.3534091711044312} 11/07/2021 01:10:53 - INFO - __main__ - Step 28237: {'lr': 0.0004622229778918909, 'samples': 5421504, 'steps': 28236, 'loss/train': 1.2735710144042969} 11/07/2021 01:10:53 - INFO - __main__ - Step 28238: {'lr': 0.00046222017287690566, 'samples': 5421696, 'steps': 28237, 'loss/train': 1.203763484954834} 11/07/2021 01:10:54 - INFO - __main__ - Step 28239: {'lr': 0.00046221736776629713, 'samples': 5421888, 'steps': 28238, 'loss/train': 2.086308002471924} 11/07/2021 01:10:54 - INFO - __main__ - Step 28240: {'lr': 0.0004622145625600668, 'samples': 5422080, 'steps': 28239, 'loss/train': 1.3777828216552734} 11/07/2021 01:10:55 - INFO - __main__ - Step 28241: {'lr': 0.00046221175725821585, 'samples': 5422272, 'steps': 28240, 'loss/train': 1.2123701572418213} 11/07/2021 01:10:55 - INFO - __main__ - Step 28242: {'lr': 0.00046220895186074553, 'samples': 5422464, 'steps': 28241, 'loss/train': 1.9083006381988525} 11/07/2021 01:10:56 - INFO - __main__ - Step 28243: {'lr': 0.0004622061463676572, 'samples': 5422656, 'steps': 28242, 'loss/train': 1.196201205253601} 11/07/2021 01:10:56 - INFO - __main__ - Step 28244: {'lr': 0.000462203340778952, 'samples': 5422848, 'steps': 28243, 'loss/train': 1.1842589378356934} 11/07/2021 01:10:57 - INFO - __main__ - Step 28245: {'lr': 0.0004622005350946312, 'samples': 5423040, 'steps': 28244, 'loss/train': 1.2179888486862183} 11/07/2021 01:10:57 - INFO - __main__ - Step 28246: {'lr': 0.00046219772931469617, 'samples': 5423232, 'steps': 28245, 'loss/train': 1.2169467210769653} 11/07/2021 01:10:58 - INFO - __main__ - Step 28247: {'lr': 0.00046219492343914815, 'samples': 5423424, 'steps': 28246, 'loss/train': 1.2924308776855469} 11/07/2021 01:10:58 - INFO - __main__ - Step 28248: {'lr': 0.00046219211746798835, 'samples': 5423616, 'steps': 28247, 'loss/train': 1.3133745193481445} 11/07/2021 01:10:59 - INFO - __main__ - Step 28249: {'lr': 0.000462189311401218, 'samples': 5423808, 'steps': 28248, 'loss/train': 0.5545594096183777} 11/07/2021 01:10:59 - INFO - __main__ - Step 28250: {'lr': 0.0004621865052388385, 'samples': 5424000, 'steps': 28249, 'loss/train': 1.7208218574523926} 11/07/2021 01:11:00 - INFO - __main__ - Step 28251: {'lr': 0.00046218369898085097, 'samples': 5424192, 'steps': 28250, 'loss/train': 1.277891755104065} 11/07/2021 01:11:00 - INFO - __main__ - Step 28252: {'lr': 0.0004621808926272568, 'samples': 5424384, 'steps': 28251, 'loss/train': 1.8041847944259644} 11/07/2021 01:11:01 - INFO - __main__ - Step 28253: {'lr': 0.0004621780861780572, 'samples': 5424576, 'steps': 28252, 'loss/train': 1.4097659587860107} 11/07/2021 01:11:01 - INFO - __main__ - Step 28254: {'lr': 0.00046217527963325335, 'samples': 5424768, 'steps': 28253, 'loss/train': 1.9377212524414062} 11/07/2021 01:11:01 - INFO - __main__ - Step 28255: {'lr': 0.00046217247299284666, 'samples': 5424960, 'steps': 28254, 'loss/train': 2.2206597328186035} 11/07/2021 01:11:02 - INFO - __main__ - Step 28256: {'lr': 0.00046216966625683834, 'samples': 5425152, 'steps': 28255, 'loss/train': 0.9910633563995361} 11/07/2021 01:11:03 - INFO - __main__ - Step 28257: {'lr': 0.00046216685942522957, 'samples': 5425344, 'steps': 28256, 'loss/train': 1.0055487155914307} 11/07/2021 01:11:03 - INFO - __main__ - Step 28258: {'lr': 0.00046216405249802176, 'samples': 5425536, 'steps': 28257, 'loss/train': 1.5653692483901978} 11/07/2021 01:11:04 - INFO - __main__ - Step 28259: {'lr': 0.000462161245475216, 'samples': 5425728, 'steps': 28258, 'loss/train': 2.771857976913452} 11/07/2021 01:11:04 - INFO - __main__ - Step 28260: {'lr': 0.0004621584383568137, 'samples': 5425920, 'steps': 28259, 'loss/train': 0.6507758498191833} 11/07/2021 01:11:04 - INFO - __main__ - Step 28261: {'lr': 0.00046215563114281613, 'samples': 5426112, 'steps': 28260, 'loss/train': 1.4386929273605347} 11/07/2021 01:11:06 - INFO - __main__ - Step 28262: {'lr': 0.0004621528238332245, 'samples': 5426304, 'steps': 28261, 'loss/train': 1.5386488437652588} 11/07/2021 01:11:06 - INFO - __main__ - Step 28263: {'lr': 0.00046215001642804, 'samples': 5426496, 'steps': 28262, 'loss/train': 1.4531859159469604} 11/07/2021 01:11:06 - INFO - __main__ - Step 28264: {'lr': 0.0004621472089272641, 'samples': 5426688, 'steps': 28263, 'loss/train': 1.2668935060501099} 11/07/2021 01:11:07 - INFO - __main__ - Step 28265: {'lr': 0.0004621444013308979, 'samples': 5426880, 'steps': 28264, 'loss/train': 1.032285451889038} 11/07/2021 01:11:07 - INFO - __main__ - Step 28266: {'lr': 0.00046214159363894264, 'samples': 5427072, 'steps': 28265, 'loss/train': 1.93840491771698} 11/07/2021 01:11:08 - INFO - __main__ - Step 28267: {'lr': 0.0004621387858513997, 'samples': 5427264, 'steps': 28266, 'loss/train': 0.8712663650512695} 11/07/2021 01:11:08 - INFO - __main__ - Step 28268: {'lr': 0.0004621359779682703, 'samples': 5427456, 'steps': 28267, 'loss/train': 1.6369661092758179} 11/07/2021 01:11:09 - INFO - __main__ - Step 28269: {'lr': 0.0004621331699895557, 'samples': 5427648, 'steps': 28268, 'loss/train': 1.922584056854248} 11/07/2021 01:11:09 - INFO - __main__ - Step 28270: {'lr': 0.00046213036191525714, 'samples': 5427840, 'steps': 28269, 'loss/train': 1.4946922063827515} 11/07/2021 01:11:09 - INFO - __main__ - Step 28271: {'lr': 0.00046212755374537594, 'samples': 5428032, 'steps': 28270, 'loss/train': 1.5393568277359009} 11/07/2021 01:11:10 - INFO - __main__ - Step 28272: {'lr': 0.0004621247454799133, 'samples': 5428224, 'steps': 28271, 'loss/train': 1.8413314819335938} 11/07/2021 01:11:11 - INFO - __main__ - Step 28273: {'lr': 0.0004621219371188706, 'samples': 5428416, 'steps': 28272, 'loss/train': 1.6785149574279785} 11/07/2021 01:11:11 - INFO - __main__ - Step 28274: {'lr': 0.0004621191286622489, 'samples': 5428608, 'steps': 28273, 'loss/train': 1.8030606508255005} 11/07/2021 01:11:11 - INFO - __main__ - Step 28275: {'lr': 0.00046211632011004973, 'samples': 5428800, 'steps': 28274, 'loss/train': 1.560758113861084} 11/07/2021 01:11:12 - INFO - __main__ - Step 28276: {'lr': 0.0004621135114622742, 'samples': 5428992, 'steps': 28275, 'loss/train': 1.2767295837402344} 11/07/2021 01:11:13 - INFO - __main__ - Step 28277: {'lr': 0.00046211070271892353, 'samples': 5429184, 'steps': 28276, 'loss/train': 1.8365651369094849} 11/07/2021 01:11:13 - INFO - __main__ - Step 28278: {'lr': 0.00046210789387999906, 'samples': 5429376, 'steps': 28277, 'loss/train': 1.224928379058838} 11/07/2021 01:11:13 - INFO - __main__ - Step 28279: {'lr': 0.00046210508494550206, 'samples': 5429568, 'steps': 28278, 'loss/train': 1.620279312133789} 11/07/2021 01:11:14 - INFO - __main__ - Step 28280: {'lr': 0.0004621022759154338, 'samples': 5429760, 'steps': 28279, 'loss/train': 1.5172548294067383} 11/07/2021 01:11:14 - INFO - __main__ - Step 28281: {'lr': 0.0004620994667897955, 'samples': 5429952, 'steps': 28280, 'loss/train': 1.7610883712768555} 11/07/2021 01:11:15 - INFO - __main__ - Step 28282: {'lr': 0.0004620966575685885, 'samples': 5430144, 'steps': 28281, 'loss/train': 1.4005475044250488} 11/07/2021 01:11:16 - INFO - __main__ - Step 28283: {'lr': 0.000462093848251814, 'samples': 5430336, 'steps': 28282, 'loss/train': 1.6726186275482178} 11/07/2021 01:11:16 - INFO - __main__ - Step 28284: {'lr': 0.00046209103883947323, 'samples': 5430528, 'steps': 28283, 'loss/train': 1.5743645429611206} 11/07/2021 01:11:16 - INFO - __main__ - Step 28285: {'lr': 0.00046208822933156756, 'samples': 5430720, 'steps': 28284, 'loss/train': 1.523167371749878} 11/07/2021 01:11:17 - INFO - __main__ - Step 28286: {'lr': 0.00046208541972809824, 'samples': 5430912, 'steps': 28285, 'loss/train': 1.5007579326629639} 11/07/2021 01:11:17 - INFO - __main__ - Step 28287: {'lr': 0.00046208261002906643, 'samples': 5431104, 'steps': 28286, 'loss/train': 1.5673551559448242} 11/07/2021 01:11:18 - INFO - __main__ - Step 28288: {'lr': 0.00046207980023447347, 'samples': 5431296, 'steps': 28287, 'loss/train': 1.0199003219604492} 11/07/2021 01:11:18 - INFO - __main__ - Step 28289: {'lr': 0.0004620769903443207, 'samples': 5431488, 'steps': 28288, 'loss/train': 1.2680659294128418} 11/07/2021 01:11:19 - INFO - __main__ - Step 28290: {'lr': 0.00046207418035860927, 'samples': 5431680, 'steps': 28289, 'loss/train': 1.4011750221252441} 11/07/2021 01:11:19 - INFO - __main__ - Step 28291: {'lr': 0.00046207137027734046, 'samples': 5431872, 'steps': 28290, 'loss/train': 1.3798640966415405} 11/07/2021 01:11:19 - INFO - __main__ - Step 28292: {'lr': 0.00046206856010051555, 'samples': 5432064, 'steps': 28291, 'loss/train': 1.6333099603652954} 11/07/2021 01:11:20 - INFO - __main__ - Step 28293: {'lr': 0.0004620657498281359, 'samples': 5432256, 'steps': 28292, 'loss/train': 1.6269264221191406} 11/07/2021 01:11:21 - INFO - __main__ - Step 28294: {'lr': 0.0004620629394602027, 'samples': 5432448, 'steps': 28293, 'loss/train': 1.6605522632598877} 11/07/2021 01:11:21 - INFO - __main__ - Step 28295: {'lr': 0.00046206012899671715, 'samples': 5432640, 'steps': 28294, 'loss/train': 1.629433035850525} 11/07/2021 01:11:21 - INFO - __main__ - Step 28296: {'lr': 0.00046205731843768056, 'samples': 5432832, 'steps': 28295, 'loss/train': 1.2553709745407104} 11/07/2021 01:11:22 - INFO - __main__ - Step 28297: {'lr': 0.0004620545077830942, 'samples': 5433024, 'steps': 28296, 'loss/train': 1.2244160175323486} 11/07/2021 01:11:23 - INFO - __main__ - Step 28298: {'lr': 0.00046205169703295945, 'samples': 5433216, 'steps': 28297, 'loss/train': 1.3817449808120728} 11/07/2021 01:11:24 - INFO - __main__ - Step 28299: {'lr': 0.00046204888618727743, 'samples': 5433408, 'steps': 28298, 'loss/train': 2.1621477603912354} 11/07/2021 01:11:24 - INFO - __main__ - Step 28300: {'lr': 0.00046204607524604944, 'samples': 5433600, 'steps': 28299, 'loss/train': 1.4274929761886597} 11/07/2021 01:11:24 - INFO - __main__ - Step 28301: {'lr': 0.0004620432642092768, 'samples': 5433792, 'steps': 28300, 'loss/train': 0.30110833048820496} 11/07/2021 01:11:25 - INFO - __main__ - Step 28302: {'lr': 0.00046204045307696065, 'samples': 5433984, 'steps': 28301, 'loss/train': 1.907424807548523} 11/07/2021 01:11:26 - INFO - __main__ - Step 28303: {'lr': 0.0004620376418491024, 'samples': 5434176, 'steps': 28302, 'loss/train': 1.513163447380066} 11/07/2021 01:11:26 - INFO - __main__ - Step 28304: {'lr': 0.0004620348305257033, 'samples': 5434368, 'steps': 28303, 'loss/train': 1.9480259418487549} 11/07/2021 01:11:27 - INFO - __main__ - Step 28305: {'lr': 0.00046203201910676453, 'samples': 5434560, 'steps': 28304, 'loss/train': 1.8017507791519165} 11/07/2021 01:11:27 - INFO - __main__ - Step 28306: {'lr': 0.0004620292075922874, 'samples': 5434752, 'steps': 28305, 'loss/train': 1.4269638061523438} 11/07/2021 01:11:27 - INFO - __main__ - Step 28307: {'lr': 0.0004620263959822732, 'samples': 5434944, 'steps': 28306, 'loss/train': 1.9615912437438965} 11/07/2021 01:11:28 - INFO - __main__ - Step 28308: {'lr': 0.00046202358427672313, 'samples': 5435136, 'steps': 28307, 'loss/train': 1.8395740985870361} 11/07/2021 01:11:29 - INFO - __main__ - Step 28309: {'lr': 0.0004620207724756386, 'samples': 5435328, 'steps': 28308, 'loss/train': 0.8543437719345093} 11/07/2021 01:11:29 - INFO - __main__ - Step 28310: {'lr': 0.0004620179605790207, 'samples': 5435520, 'steps': 28309, 'loss/train': 1.6621850728988647} 11/07/2021 01:11:29 - INFO - __main__ - Step 28311: {'lr': 0.00046201514858687075, 'samples': 5435712, 'steps': 28310, 'loss/train': 1.4606200456619263} 11/07/2021 01:11:30 - INFO - __main__ - Step 28312: {'lr': 0.00046201233649919015, 'samples': 5435904, 'steps': 28311, 'loss/train': 2.1351730823516846} 11/07/2021 01:11:31 - INFO - __main__ - Step 28313: {'lr': 0.00046200952431598, 'samples': 5436096, 'steps': 28312, 'loss/train': 1.5596314668655396} 11/07/2021 01:11:31 - INFO - __main__ - Step 28314: {'lr': 0.00046200671203724166, 'samples': 5436288, 'steps': 28313, 'loss/train': 1.620673418045044} 11/07/2021 01:11:31 - INFO - __main__ - Step 28315: {'lr': 0.00046200389966297633, 'samples': 5436480, 'steps': 28314, 'loss/train': 1.219846248626709} 11/07/2021 01:11:32 - INFO - __main__ - Step 28316: {'lr': 0.00046200108719318537, 'samples': 5436672, 'steps': 28315, 'loss/train': 1.4547991752624512} 11/07/2021 01:11:32 - INFO - __main__ - Step 28317: {'lr': 0.0004619982746278699, 'samples': 5436864, 'steps': 28316, 'loss/train': 1.4756008386611938} 11/07/2021 01:11:33 - INFO - __main__ - Step 28318: {'lr': 0.00046199546196703134, 'samples': 5437056, 'steps': 28317, 'loss/train': 1.5366729497909546} 11/07/2021 01:11:34 - INFO - __main__ - Step 28319: {'lr': 0.0004619926492106709, 'samples': 5437248, 'steps': 28318, 'loss/train': 1.4585894346237183} 11/07/2021 01:11:34 - INFO - __main__ - Step 28320: {'lr': 0.0004619898363587899, 'samples': 5437440, 'steps': 28319, 'loss/train': 1.9013971090316772} 11/07/2021 01:11:34 - INFO - __main__ - Step 28321: {'lr': 0.00046198702341138944, 'samples': 5437632, 'steps': 28320, 'loss/train': 1.7438561916351318} 11/07/2021 01:11:35 - INFO - __main__ - Step 28322: {'lr': 0.00046198421036847093, 'samples': 5437824, 'steps': 28321, 'loss/train': 1.4920424222946167} 11/07/2021 01:11:35 - INFO - __main__ - Step 28323: {'lr': 0.00046198139723003563, 'samples': 5438016, 'steps': 28322, 'loss/train': 0.998529851436615} 11/07/2021 01:11:36 - INFO - __main__ - Step 28324: {'lr': 0.00046197858399608477, 'samples': 5438208, 'steps': 28323, 'loss/train': 1.9227137565612793} 11/07/2021 01:11:36 - INFO - __main__ - Step 28325: {'lr': 0.00046197577066661965, 'samples': 5438400, 'steps': 28324, 'loss/train': 1.6951842308044434} 11/07/2021 01:11:37 - INFO - __main__ - Step 28326: {'lr': 0.0004619729572416415, 'samples': 5438592, 'steps': 28325, 'loss/train': 1.8733631372451782} 11/07/2021 01:11:37 - INFO - __main__ - Step 28327: {'lr': 0.0004619701437211516, 'samples': 5438784, 'steps': 28326, 'loss/train': 1.6878166198730469} 11/07/2021 01:11:37 - INFO - __main__ - Step 28328: {'lr': 0.00046196733010515125, 'samples': 5438976, 'steps': 28327, 'loss/train': 1.9754751920700073} 11/07/2021 01:11:38 - INFO - __main__ - Step 28329: {'lr': 0.0004619645163936417, 'samples': 5439168, 'steps': 28328, 'loss/train': 1.7113882303237915} 11/07/2021 01:11:39 - INFO - __main__ - Step 28330: {'lr': 0.0004619617025866242, 'samples': 5439360, 'steps': 28329, 'loss/train': 0.6419276595115662} 11/07/2021 01:11:39 - INFO - __main__ - Step 28331: {'lr': 0.00046195888868409994, 'samples': 5439552, 'steps': 28330, 'loss/train': 1.6990545988082886} 11/07/2021 01:11:39 - INFO - __main__ - Step 28332: {'lr': 0.0004619560746860704, 'samples': 5439744, 'steps': 28331, 'loss/train': 1.4887057542800903} 11/07/2021 01:11:40 - INFO - __main__ - Step 28333: {'lr': 0.0004619532605925366, 'samples': 5439936, 'steps': 28332, 'loss/train': 1.4601359367370605} 11/07/2021 01:11:41 - INFO - __main__ - Step 28334: {'lr': 0.00046195044640350003, 'samples': 5440128, 'steps': 28333, 'loss/train': 1.3070403337478638} 11/07/2021 01:11:41 - INFO - __main__ - Step 28335: {'lr': 0.00046194763211896187, 'samples': 5440320, 'steps': 28334, 'loss/train': 1.5026382207870483} 11/07/2021 01:11:42 - INFO - __main__ - Step 28336: {'lr': 0.0004619448177389233, 'samples': 5440512, 'steps': 28335, 'loss/train': 1.9780086278915405} 11/07/2021 01:11:42 - INFO - __main__ - Step 28337: {'lr': 0.0004619420032633857, 'samples': 5440704, 'steps': 28336, 'loss/train': 1.5093754529953003} 11/07/2021 01:11:42 - INFO - __main__ - Step 28338: {'lr': 0.0004619391886923503, 'samples': 5440896, 'steps': 28337, 'loss/train': 1.0746419429779053} 11/07/2021 01:11:43 - INFO - __main__ - Step 28339: {'lr': 0.0004619363740258184, 'samples': 5441088, 'steps': 28338, 'loss/train': 1.0466197729110718} 11/07/2021 01:11:44 - INFO - __main__ - Step 28340: {'lr': 0.00046193355926379124, 'samples': 5441280, 'steps': 28339, 'loss/train': 1.5124543905258179} 11/07/2021 01:11:44 - INFO - __main__ - Step 28341: {'lr': 0.00046193074440627, 'samples': 5441472, 'steps': 28340, 'loss/train': 1.259055733680725} 11/07/2021 01:11:44 - INFO - __main__ - Step 28342: {'lr': 0.0004619279294532561, 'samples': 5441664, 'steps': 28341, 'loss/train': 1.285418152809143} 11/07/2021 01:11:45 - INFO - __main__ - Step 28343: {'lr': 0.00046192511440475083, 'samples': 5441856, 'steps': 28342, 'loss/train': 1.717706561088562} 11/07/2021 01:11:46 - INFO - __main__ - Step 28344: {'lr': 0.00046192229926075526, 'samples': 5442048, 'steps': 28343, 'loss/train': 1.809889316558838} 11/07/2021 01:11:46 - INFO - __main__ - Step 28345: {'lr': 0.0004619194840212708, 'samples': 5442240, 'steps': 28344, 'loss/train': 1.3425153493881226} 11/07/2021 01:11:47 - INFO - __main__ - Step 28346: {'lr': 0.0004619166686862987, 'samples': 5442432, 'steps': 28345, 'loss/train': 1.6994646787643433} 11/07/2021 01:11:47 - INFO - __main__ - Step 28347: {'lr': 0.0004619138532558402, 'samples': 5442624, 'steps': 28346, 'loss/train': 1.3344404697418213} 11/07/2021 01:11:47 - INFO - __main__ - Step 28348: {'lr': 0.00046191103772989664, 'samples': 5442816, 'steps': 28347, 'loss/train': 2.1455984115600586} 11/07/2021 01:11:48 - INFO - __main__ - Step 28349: {'lr': 0.00046190822210846917, 'samples': 5443008, 'steps': 28348, 'loss/train': 1.6893006563186646} 11/07/2021 01:11:49 - INFO - __main__ - Step 28350: {'lr': 0.0004619054063915592, 'samples': 5443200, 'steps': 28349, 'loss/train': 2.1755573749542236} 11/07/2021 01:11:49 - INFO - __main__ - Step 28351: {'lr': 0.00046190259057916786, 'samples': 5443392, 'steps': 28350, 'loss/train': 1.812320351600647} 11/07/2021 01:11:49 - INFO - __main__ - Step 28352: {'lr': 0.0004618997746712965, 'samples': 5443584, 'steps': 28351, 'loss/train': 1.5001801252365112} 11/07/2021 01:11:50 - INFO - __main__ - Step 28353: {'lr': 0.00046189695866794635, 'samples': 5443776, 'steps': 28352, 'loss/train': 1.3247753381729126} 11/07/2021 01:11:50 - INFO - __main__ - Step 28354: {'lr': 0.00046189414256911875, 'samples': 5443968, 'steps': 28353, 'loss/train': 1.6866405010223389} 11/07/2021 01:11:51 - INFO - __main__ - Step 28355: {'lr': 0.0004618913263748149, 'samples': 5444160, 'steps': 28354, 'loss/train': 1.6076704263687134} 11/07/2021 01:11:52 - INFO - __main__ - Step 28356: {'lr': 0.0004618885100850361, 'samples': 5444352, 'steps': 28355, 'loss/train': 1.0575443506240845} 11/07/2021 01:11:52 - INFO - __main__ - Step 28357: {'lr': 0.0004618856936997836, 'samples': 5444544, 'steps': 28356, 'loss/train': 1.492514729499817} 11/07/2021 01:11:52 - INFO - __main__ - Step 28358: {'lr': 0.0004618828772190586, 'samples': 5444736, 'steps': 28357, 'loss/train': 0.2902561128139496} 11/07/2021 01:11:53 - INFO - __main__ - Step 28359: {'lr': 0.0004618800606428626, 'samples': 5444928, 'steps': 28358, 'loss/train': 1.3694254159927368} 11/07/2021 01:11:53 - INFO - __main__ - Step 28360: {'lr': 0.00046187724397119657, 'samples': 5445120, 'steps': 28359, 'loss/train': 1.4556535482406616} 11/07/2021 01:11:54 - INFO - __main__ - Step 28361: {'lr': 0.000461874427204062, 'samples': 5445312, 'steps': 28360, 'loss/train': 1.813988447189331} 11/07/2021 01:11:55 - INFO - __main__ - Step 28362: {'lr': 0.00046187161034146, 'samples': 5445504, 'steps': 28361, 'loss/train': 0.8661501407623291} 11/07/2021 01:11:55 - INFO - __main__ - Step 28363: {'lr': 0.00046186879338339207, 'samples': 5445696, 'steps': 28362, 'loss/train': 1.7381328344345093} 11/07/2021 01:11:55 - INFO - __main__ - Step 28364: {'lr': 0.0004618659763298592, 'samples': 5445888, 'steps': 28363, 'loss/train': 1.5734864473342896} 11/07/2021 01:11:56 - INFO - __main__ - Step 28365: {'lr': 0.00046186315918086285, 'samples': 5446080, 'steps': 28364, 'loss/train': 1.5111933946609497} 11/07/2021 01:11:57 - INFO - __main__ - Step 28366: {'lr': 0.0004618603419364042, 'samples': 5446272, 'steps': 28365, 'loss/train': 1.3224831819534302} 11/07/2021 01:11:57 - INFO - __main__ - Step 28367: {'lr': 0.00046185752459648456, 'samples': 5446464, 'steps': 28366, 'loss/train': 1.7753583192825317} 11/07/2021 01:11:57 - INFO - __main__ - Step 28368: {'lr': 0.00046185470716110516, 'samples': 5446656, 'steps': 28367, 'loss/train': 1.8063596487045288} 11/07/2021 01:11:58 - INFO - __main__ - Step 28369: {'lr': 0.00046185188963026734, 'samples': 5446848, 'steps': 28368, 'loss/train': 1.3005485534667969} 11/07/2021 01:11:58 - INFO - __main__ - Step 28370: {'lr': 0.0004618490720039723, 'samples': 5447040, 'steps': 28369, 'loss/train': 1.5186891555786133} 11/07/2021 01:11:59 - INFO - __main__ - Step 28371: {'lr': 0.0004618462542822214, 'samples': 5447232, 'steps': 28370, 'loss/train': 1.5141217708587646} 11/07/2021 01:12:00 - INFO - __main__ - Step 28372: {'lr': 0.0004618434364650158, 'samples': 5447424, 'steps': 28371, 'loss/train': 1.753641963005066} 11/07/2021 01:12:00 - INFO - __main__ - Step 28373: {'lr': 0.00046184061855235683, 'samples': 5447616, 'steps': 28372, 'loss/train': 1.5009593963623047} 11/07/2021 01:12:00 - INFO - __main__ - Step 28374: {'lr': 0.00046183780054424574, 'samples': 5447808, 'steps': 28373, 'loss/train': 1.1196340322494507} 11/07/2021 01:12:01 - INFO - __main__ - Step 28375: {'lr': 0.00046183498244068376, 'samples': 5448000, 'steps': 28374, 'loss/train': 1.3198537826538086} 11/07/2021 01:12:02 - INFO - __main__ - Step 28376: {'lr': 0.00046183216424167226, 'samples': 5448192, 'steps': 28375, 'loss/train': 1.1616865396499634} 11/07/2021 01:12:02 - INFO - __main__ - Step 28377: {'lr': 0.0004618293459472124, 'samples': 5448384, 'steps': 28376, 'loss/train': 1.6450309753417969} 11/07/2021 01:12:02 - INFO - __main__ - Step 28378: {'lr': 0.0004618265275573056, 'samples': 5448576, 'steps': 28377, 'loss/train': 1.5486198663711548} 11/07/2021 01:12:03 - INFO - __main__ - Step 28379: {'lr': 0.00046182370907195294, 'samples': 5448768, 'steps': 28378, 'loss/train': 1.7771023511886597} 11/07/2021 01:12:03 - INFO - __main__ - Step 28380: {'lr': 0.00046182089049115585, 'samples': 5448960, 'steps': 28379, 'loss/train': 1.8674331903457642} 11/07/2021 01:12:03 - INFO - __main__ - Step 28381: {'lr': 0.0004618180718149155, 'samples': 5449152, 'steps': 28380, 'loss/train': 2.1471502780914307} 11/07/2021 01:12:05 - INFO - __main__ - Step 28382: {'lr': 0.00046181525304323325, 'samples': 5449344, 'steps': 28381, 'loss/train': 1.8558167219161987} 11/07/2021 01:12:05 - INFO - __main__ - Step 28383: {'lr': 0.0004618124341761102, 'samples': 5449536, 'steps': 28382, 'loss/train': 1.9591509103775024} 11/07/2021 01:12:05 - INFO - __main__ - Step 28384: {'lr': 0.0004618096152135478, 'samples': 5449728, 'steps': 28383, 'loss/train': 1.861006736755371} 11/07/2021 01:12:06 - INFO - __main__ - Step 28385: {'lr': 0.00046180679615554735, 'samples': 5449920, 'steps': 28384, 'loss/train': 1.9714553356170654} 11/07/2021 01:12:06 - INFO - __main__ - Step 28386: {'lr': 0.00046180397700210985, 'samples': 5450112, 'steps': 28385, 'loss/train': 1.4372633695602417} 11/07/2021 01:12:07 - INFO - __main__ - Step 28387: {'lr': 0.0004618011577532368, 'samples': 5450304, 'steps': 28386, 'loss/train': 1.5484124422073364} 11/07/2021 01:12:07 - INFO - __main__ - Step 28388: {'lr': 0.0004617983384089295, 'samples': 5450496, 'steps': 28387, 'loss/train': 1.984816074371338} 11/07/2021 01:12:08 - INFO - __main__ - Step 28389: {'lr': 0.00046179551896918916, 'samples': 5450688, 'steps': 28388, 'loss/train': 1.4153586626052856} 11/07/2021 01:12:08 - INFO - __main__ - Step 28390: {'lr': 0.00046179269943401693, 'samples': 5450880, 'steps': 28389, 'loss/train': 1.8363102674484253} 11/07/2021 01:12:08 - INFO - __main__ - Step 28391: {'lr': 0.00046178987980341414, 'samples': 5451072, 'steps': 28390, 'loss/train': 1.4534401893615723} 11/07/2021 01:12:10 - INFO - __main__ - Step 28392: {'lr': 0.00046178706007738227, 'samples': 5451264, 'steps': 28391, 'loss/train': 1.542893648147583} 11/07/2021 01:12:10 - INFO - __main__ - Step 28393: {'lr': 0.0004617842402559223, 'samples': 5451456, 'steps': 28392, 'loss/train': 1.431994915008545} 11/07/2021 01:12:10 - INFO - __main__ - Step 28394: {'lr': 0.0004617814203390356, 'samples': 5451648, 'steps': 28393, 'loss/train': 2.0125787258148193} 11/07/2021 01:12:11 - INFO - __main__ - Step 28395: {'lr': 0.0004617786003267235, 'samples': 5451840, 'steps': 28394, 'loss/train': 1.5543104410171509} 11/07/2021 01:12:11 - INFO - __main__ - Step 28396: {'lr': 0.00046177578021898717, 'samples': 5452032, 'steps': 28395, 'loss/train': 1.5467292070388794} 11/07/2021 01:12:12 - INFO - __main__ - Step 28397: {'lr': 0.000461772960015828, 'samples': 5452224, 'steps': 28396, 'loss/train': 1.1211336851119995} 11/07/2021 01:12:12 - INFO - __main__ - Step 28398: {'lr': 0.00046177013971724723, 'samples': 5452416, 'steps': 28397, 'loss/train': 1.3289721012115479} 11/07/2021 01:12:13 - INFO - __main__ - Step 28399: {'lr': 0.00046176731932324604, 'samples': 5452608, 'steps': 28398, 'loss/train': 1.5812649726867676} 11/07/2021 01:12:13 - INFO - __main__ - Step 28400: {'lr': 0.0004617644988338258, 'samples': 5452800, 'steps': 28399, 'loss/train': 1.628312110900879} 11/07/2021 01:12:13 - INFO - __main__ - Step 28401: {'lr': 0.0004617616782489877, 'samples': 5452992, 'steps': 28400, 'loss/train': 1.4269146919250488} 11/07/2021 01:12:14 - INFO - __main__ - Step 28402: {'lr': 0.00046175885756873314, 'samples': 5453184, 'steps': 28401, 'loss/train': 1.520078420639038} 11/07/2021 01:12:15 - INFO - __main__ - Step 28403: {'lr': 0.00046175603679306324, 'samples': 5453376, 'steps': 28402, 'loss/train': 1.4348992109298706} 11/07/2021 01:12:15 - INFO - __main__ - Step 28404: {'lr': 0.0004617532159219794, 'samples': 5453568, 'steps': 28403, 'loss/train': 1.6966314315795898} 11/07/2021 01:12:15 - INFO - __main__ - Step 28405: {'lr': 0.0004617503949554828, 'samples': 5453760, 'steps': 28404, 'loss/train': 1.50644052028656} 11/07/2021 01:12:16 - INFO - __main__ - Step 28406: {'lr': 0.0004617475738935747, 'samples': 5453952, 'steps': 28405, 'loss/train': 1.5906078815460205} 11/07/2021 01:12:17 - INFO - __main__ - Step 28407: {'lr': 0.0004617447527362564, 'samples': 5454144, 'steps': 28406, 'loss/train': 1.3887377977371216} 11/07/2021 01:12:17 - INFO - __main__ - Step 28408: {'lr': 0.00046174193148352914, 'samples': 5454336, 'steps': 28407, 'loss/train': 2.1632323265075684} 11/07/2021 01:12:18 - INFO - __main__ - Step 28409: {'lr': 0.00046173911013539437, 'samples': 5454528, 'steps': 28408, 'loss/train': 1.411903977394104} 11/07/2021 01:12:18 - INFO - __main__ - Step 28410: {'lr': 0.0004617362886918531, 'samples': 5454720, 'steps': 28409, 'loss/train': 1.8714888095855713} 11/07/2021 01:12:19 - INFO - __main__ - Step 28411: {'lr': 0.0004617334671529069, 'samples': 5454912, 'steps': 28410, 'loss/train': 1.2680634260177612} 11/07/2021 01:12:19 - INFO - __main__ - Step 28412: {'lr': 0.0004617306455185567, 'samples': 5455104, 'steps': 28411, 'loss/train': 2.167773962020874} 11/07/2021 01:12:20 - INFO - __main__ - Step 28413: {'lr': 0.00046172782378880404, 'samples': 5455296, 'steps': 28412, 'loss/train': 1.7435532808303833} 11/07/2021 01:12:20 - INFO - __main__ - Step 28414: {'lr': 0.00046172500196364996, 'samples': 5455488, 'steps': 28413, 'loss/train': 0.7721033692359924} 11/07/2021 01:12:21 - INFO - __main__ - Step 28415: {'lr': 0.000461722180043096, 'samples': 5455680, 'steps': 28414, 'loss/train': 1.5514968633651733} 11/07/2021 01:12:21 - INFO - __main__ - Step 28416: {'lr': 0.0004617193580271433, 'samples': 5455872, 'steps': 28415, 'loss/train': 3.148294687271118} 11/07/2021 01:12:21 - INFO - __main__ - Step 28417: {'lr': 0.000461716535915793, 'samples': 5456064, 'steps': 28416, 'loss/train': 1.618602991104126} 11/07/2021 01:12:22 - INFO - __main__ - Step 28418: {'lr': 0.0004617137137090466, 'samples': 5456256, 'steps': 28417, 'loss/train': 1.428109884262085} 11/07/2021 01:12:23 - INFO - __main__ - Step 28419: {'lr': 0.0004617108914069052, 'samples': 5456448, 'steps': 28418, 'loss/train': 1.310810923576355} 11/07/2021 01:12:23 - INFO - __main__ - Step 28420: {'lr': 0.0004617080690093701, 'samples': 5456640, 'steps': 28419, 'loss/train': 1.2667416334152222} 11/07/2021 01:12:23 - INFO - __main__ - Step 28421: {'lr': 0.00046170524651644276, 'samples': 5456832, 'steps': 28420, 'loss/train': 1.5690606832504272} 11/07/2021 01:12:24 - INFO - __main__ - Step 28422: {'lr': 0.00046170242392812425, 'samples': 5457024, 'steps': 28421, 'loss/train': 1.829864501953125} 11/07/2021 01:12:25 - INFO - __main__ - Step 28423: {'lr': 0.0004616996012444158, 'samples': 5457216, 'steps': 28422, 'loss/train': 1.873397707939148} 11/07/2021 01:12:25 - INFO - __main__ - Step 28424: {'lr': 0.00046169677846531884, 'samples': 5457408, 'steps': 28423, 'loss/train': 1.3015440702438354} 11/07/2021 01:12:26 - INFO - __main__ - Step 28425: {'lr': 0.0004616939555908346, 'samples': 5457600, 'steps': 28424, 'loss/train': 0.9528545141220093} 11/07/2021 01:12:26 - INFO - __main__ - Step 28426: {'lr': 0.0004616911326209643, 'samples': 5457792, 'steps': 28425, 'loss/train': 2.169158935546875} 11/07/2021 01:12:26 - INFO - __main__ - Step 28427: {'lr': 0.0004616883095557092, 'samples': 5457984, 'steps': 28426, 'loss/train': 1.5702557563781738} 11/07/2021 01:12:27 - INFO - __main__ - Step 28428: {'lr': 0.0004616854863950707, 'samples': 5458176, 'steps': 28427, 'loss/train': 1.5576798915863037} 11/07/2021 01:12:28 - INFO - __main__ - Step 28429: {'lr': 0.00046168266313904995, 'samples': 5458368, 'steps': 28428, 'loss/train': 1.4445343017578125} 11/07/2021 01:12:28 - INFO - __main__ - Step 28430: {'lr': 0.00046167983978764827, 'samples': 5458560, 'steps': 28429, 'loss/train': 5.771895408630371} 11/07/2021 01:12:28 - INFO - __main__ - Step 28431: {'lr': 0.0004616770163408669, 'samples': 5458752, 'steps': 28430, 'loss/train': 1.5220882892608643} 11/07/2021 01:12:29 - INFO - __main__ - Step 28432: {'lr': 0.00046167419279870715, 'samples': 5458944, 'steps': 28431, 'loss/train': 2.1063215732574463} 11/07/2021 01:12:29 - INFO - __main__ - Step 28433: {'lr': 0.00046167136916117025, 'samples': 5459136, 'steps': 28432, 'loss/train': 1.3773339986801147} 11/07/2021 01:12:30 - INFO - __main__ - Step 28434: {'lr': 0.00046166854542825756, 'samples': 5459328, 'steps': 28433, 'loss/train': 1.733251929283142} 11/07/2021 01:12:31 - INFO - __main__ - Step 28435: {'lr': 0.0004616657215999702, 'samples': 5459520, 'steps': 28434, 'loss/train': 1.6719069480895996} 11/07/2021 01:12:31 - INFO - __main__ - Step 28436: {'lr': 0.0004616628976763096, 'samples': 5459712, 'steps': 28435, 'loss/train': 1.5270695686340332} 11/07/2021 01:12:31 - INFO - __main__ - Step 28437: {'lr': 0.0004616600736572769, 'samples': 5459904, 'steps': 28436, 'loss/train': 1.066251516342163} 11/07/2021 01:12:32 - INFO - __main__ - Step 28438: {'lr': 0.0004616572495428735, 'samples': 5460096, 'steps': 28437, 'loss/train': 1.0775712728500366} 11/07/2021 01:12:32 - INFO - __main__ - Step 28439: {'lr': 0.0004616544253331006, 'samples': 5460288, 'steps': 28438, 'loss/train': 1.4521342515945435} 11/07/2021 01:12:33 - INFO - __main__ - Step 28440: {'lr': 0.00046165160102795943, 'samples': 5460480, 'steps': 28439, 'loss/train': 0.8910706639289856} 11/07/2021 01:12:33 - INFO - __main__ - Step 28441: {'lr': 0.0004616487766274514, 'samples': 5460672, 'steps': 28440, 'loss/train': 1.8507764339447021} 11/07/2021 01:12:34 - INFO - __main__ - Step 28442: {'lr': 0.0004616459521315777, 'samples': 5460864, 'steps': 28441, 'loss/train': 1.7701815366744995} 11/07/2021 01:12:34 - INFO - __main__ - Step 28443: {'lr': 0.0004616431275403395, 'samples': 5461056, 'steps': 28442, 'loss/train': 1.6071275472640991} 11/07/2021 01:12:34 - INFO - __main__ - Step 28444: {'lr': 0.0004616403028537382, 'samples': 5461248, 'steps': 28443, 'loss/train': 1.9984899759292603} 11/07/2021 01:12:36 - INFO - __main__ - Step 28445: {'lr': 0.0004616374780717751, 'samples': 5461440, 'steps': 28444, 'loss/train': 1.3097763061523438} 11/07/2021 01:12:36 - INFO - __main__ - Step 28446: {'lr': 0.0004616346531944514, 'samples': 5461632, 'steps': 28445, 'loss/train': 1.804141640663147} 11/07/2021 01:12:36 - INFO - __main__ - Step 28447: {'lr': 0.00046163182822176835, 'samples': 5461824, 'steps': 28446, 'loss/train': 1.2777339220046997} 11/07/2021 01:12:37 - INFO - __main__ - Step 28448: {'lr': 0.0004616290031537273, 'samples': 5462016, 'steps': 28447, 'loss/train': 1.434464454650879} 11/07/2021 01:12:37 - INFO - __main__ - Step 28449: {'lr': 0.0004616261779903295, 'samples': 5462208, 'steps': 28448, 'loss/train': 2.0488641262054443} 11/07/2021 01:12:38 - INFO - __main__ - Step 28450: {'lr': 0.0004616233527315762, 'samples': 5462400, 'steps': 28449, 'loss/train': 2.1894938945770264} 11/07/2021 01:12:38 - INFO - __main__ - Step 28451: {'lr': 0.0004616205273774686, 'samples': 5462592, 'steps': 28450, 'loss/train': 1.64618718624115} 11/07/2021 01:12:39 - INFO - __main__ - Step 28452: {'lr': 0.00046161770192800817, 'samples': 5462784, 'steps': 28451, 'loss/train': 1.3208832740783691} 11/07/2021 01:12:39 - INFO - __main__ - Step 28453: {'lr': 0.000461614876383196, 'samples': 5462976, 'steps': 28452, 'loss/train': 1.8666967153549194} 11/07/2021 01:12:40 - INFO - __main__ - Step 28454: {'lr': 0.0004616120507430335, 'samples': 5463168, 'steps': 28453, 'loss/train': 1.324458360671997} 11/07/2021 01:12:40 - INFO - __main__ - Step 28455: {'lr': 0.00046160922500752176, 'samples': 5463360, 'steps': 28454, 'loss/train': 1.936042308807373} 11/07/2021 01:12:41 - INFO - __main__ - Step 28456: {'lr': 0.0004616063991766623, 'samples': 5463552, 'steps': 28455, 'loss/train': 1.9124518632888794} 11/07/2021 01:12:41 - INFO - __main__ - Step 28457: {'lr': 0.0004616035732504562, 'samples': 5463744, 'steps': 28456, 'loss/train': 1.5295157432556152} 11/07/2021 01:12:42 - INFO - __main__ - Step 28458: {'lr': 0.0004616007472289048, 'samples': 5463936, 'steps': 28457, 'loss/train': 1.5819324254989624} 11/07/2021 01:12:42 - INFO - __main__ - Step 28459: {'lr': 0.00046159792111200937, 'samples': 5464128, 'steps': 28458, 'loss/train': 1.3726308345794678} 11/07/2021 01:12:43 - INFO - __main__ - Step 28460: {'lr': 0.0004615950948997711, 'samples': 5464320, 'steps': 28459, 'loss/train': 1.8839472532272339} 11/07/2021 01:12:43 - INFO - __main__ - Step 28461: {'lr': 0.0004615922685921915, 'samples': 5464512, 'steps': 28460, 'loss/train': 1.636143445968628} 11/07/2021 01:12:44 - INFO - __main__ - Step 28462: {'lr': 0.0004615894421892716, 'samples': 5464704, 'steps': 28461, 'loss/train': 1.6835225820541382} 11/07/2021 01:12:44 - INFO - __main__ - Step 28463: {'lr': 0.0004615866156910128, 'samples': 5464896, 'steps': 28462, 'loss/train': 1.6727573871612549} 11/07/2021 01:12:44 - INFO - __main__ - Step 28464: {'lr': 0.00046158378909741626, 'samples': 5465088, 'steps': 28463, 'loss/train': 1.4402834177017212} 11/07/2021 01:12:45 - INFO - __main__ - Step 28465: {'lr': 0.00046158096240848343, 'samples': 5465280, 'steps': 28464, 'loss/train': 1.9877482652664185} 11/07/2021 01:12:46 - INFO - __main__ - Step 28466: {'lr': 0.00046157813562421545, 'samples': 5465472, 'steps': 28465, 'loss/train': 1.5668485164642334} 11/07/2021 01:12:46 - INFO - __main__ - Step 28467: {'lr': 0.0004615753087446136, 'samples': 5465664, 'steps': 28466, 'loss/train': 1.9259995222091675} 11/07/2021 01:12:47 - INFO - __main__ - Step 28468: {'lr': 0.00046157248176967915, 'samples': 5465856, 'steps': 28467, 'loss/train': 1.5455939769744873} 11/07/2021 01:12:47 - INFO - __main__ - Step 28469: {'lr': 0.0004615696546994135, 'samples': 5466048, 'steps': 28468, 'loss/train': 1.7399810552597046} 11/07/2021 01:12:48 - INFO - __main__ - Step 28470: {'lr': 0.00046156682753381774, 'samples': 5466240, 'steps': 28469, 'loss/train': 1.6447303295135498} 11/07/2021 01:12:48 - INFO - __main__ - Step 28471: {'lr': 0.0004615640002728932, 'samples': 5466432, 'steps': 28470, 'loss/train': 1.3740919828414917} 11/07/2021 01:12:49 - INFO - __main__ - Step 28472: {'lr': 0.00046156117291664133, 'samples': 5466624, 'steps': 28471, 'loss/train': 1.115888237953186} 11/07/2021 01:12:49 - INFO - __main__ - Step 28473: {'lr': 0.0004615583454650632, 'samples': 5466816, 'steps': 28472, 'loss/train': 1.3618167638778687} 11/07/2021 01:12:49 - INFO - __main__ - Step 28474: {'lr': 0.00046155551791816007, 'samples': 5467008, 'steps': 28473, 'loss/train': 1.1851608753204346} 11/07/2021 01:12:50 - INFO - __main__ - Step 28475: {'lr': 0.00046155269027593337, 'samples': 5467200, 'steps': 28474, 'loss/train': 1.7829506397247314} 11/07/2021 01:12:51 - INFO - __main__ - Step 28476: {'lr': 0.00046154986253838426, 'samples': 5467392, 'steps': 28475, 'loss/train': 1.390133023262024} 11/07/2021 01:12:51 - INFO - __main__ - Step 28477: {'lr': 0.00046154703470551405, 'samples': 5467584, 'steps': 28476, 'loss/train': 2.0066704750061035} 11/07/2021 01:12:51 - INFO - __main__ - Step 28478: {'lr': 0.000461544206777324, 'samples': 5467776, 'steps': 28477, 'loss/train': 1.3123130798339844} 11/07/2021 01:12:52 - INFO - __main__ - Step 28479: {'lr': 0.00046154137875381547, 'samples': 5467968, 'steps': 28478, 'loss/train': 1.2928277254104614} 11/07/2021 01:12:52 - INFO - __main__ - Step 28480: {'lr': 0.00046153855063498964, 'samples': 5468160, 'steps': 28479, 'loss/train': 1.7671639919281006} 11/07/2021 01:12:53 - INFO - __main__ - Step 28481: {'lr': 0.00046153572242084776, 'samples': 5468352, 'steps': 28480, 'loss/train': 0.9786882400512695} 11/07/2021 01:12:53 - INFO - __main__ - Step 28482: {'lr': 0.0004615328941113911, 'samples': 5468544, 'steps': 28481, 'loss/train': 1.719839334487915} 11/07/2021 01:12:54 - INFO - __main__ - Step 28483: {'lr': 0.00046153006570662106, 'samples': 5468736, 'steps': 28482, 'loss/train': 1.1868432760238647} 11/07/2021 01:12:54 - INFO - __main__ - Step 28484: {'lr': 0.0004615272372065388, 'samples': 5468928, 'steps': 28483, 'loss/train': 1.8249025344848633} 11/07/2021 01:12:55 - INFO - __main__ - Step 28485: {'lr': 0.0004615244086111456, 'samples': 5469120, 'steps': 28484, 'loss/train': 1.76587975025177} 11/07/2021 01:12:56 - INFO - __main__ - Step 28486: {'lr': 0.00046152157992044283, 'samples': 5469312, 'steps': 28485, 'loss/train': 1.6965020895004272} 11/07/2021 01:12:56 - INFO - __main__ - Step 28487: {'lr': 0.0004615187511344316, 'samples': 5469504, 'steps': 28486, 'loss/train': 1.5689350366592407} 11/07/2021 01:12:56 - INFO - __main__ - Step 28488: {'lr': 0.00046151592225311347, 'samples': 5469696, 'steps': 28487, 'loss/train': 1.4092538356781006} 11/07/2021 01:12:57 - INFO - __main__ - Step 28489: {'lr': 0.0004615130932764894, 'samples': 5469888, 'steps': 28488, 'loss/train': 1.659226655960083} 11/07/2021 01:12:57 - INFO - __main__ - Step 28490: {'lr': 0.0004615102642045608, 'samples': 5470080, 'steps': 28489, 'loss/train': 1.4775367975234985} 11/07/2021 01:12:58 - INFO - __main__ - Step 28491: {'lr': 0.00046150743503732897, 'samples': 5470272, 'steps': 28490, 'loss/train': 1.2291374206542969} 11/07/2021 01:12:58 - INFO - __main__ - Step 28492: {'lr': 0.0004615046057747951, 'samples': 5470464, 'steps': 28491, 'loss/train': 1.5803325176239014} 11/07/2021 01:12:59 - INFO - __main__ - Step 28493: {'lr': 0.0004615017764169606, 'samples': 5470656, 'steps': 28492, 'loss/train': 1.3385908603668213} 11/07/2021 01:12:59 - INFO - __main__ - Step 28494: {'lr': 0.00046149894696382655, 'samples': 5470848, 'steps': 28493, 'loss/train': 1.8293430805206299} 11/07/2021 01:12:59 - INFO - __main__ - Step 28495: {'lr': 0.00046149611741539445, 'samples': 5471040, 'steps': 28494, 'loss/train': 2.066129684448242} 11/07/2021 01:13:00 - INFO - __main__ - Step 28496: {'lr': 0.00046149328777166543, 'samples': 5471232, 'steps': 28495, 'loss/train': 1.5534108877182007} 11/07/2021 01:13:01 - INFO - __main__ - Step 28497: {'lr': 0.0004614904580326408, 'samples': 5471424, 'steps': 28496, 'loss/train': 1.92973792552948} 11/07/2021 01:13:01 - INFO - __main__ - Step 28498: {'lr': 0.0004614876281983218, 'samples': 5471616, 'steps': 28497, 'loss/train': 1.3637464046478271} 11/07/2021 01:13:01 - INFO - __main__ - Step 28499: {'lr': 0.0004614847982687097, 'samples': 5471808, 'steps': 28498, 'loss/train': 1.5086700916290283} 11/07/2021 01:13:02 - INFO - __main__ - Step 28500: {'lr': 0.0004614819682438059, 'samples': 5472000, 'steps': 28499, 'loss/train': 1.41326105594635} 11/07/2021 01:13:03 - INFO - __main__ - Step 28501: {'lr': 0.00046147913812361155, 'samples': 5472192, 'steps': 28500, 'loss/train': 1.4224140644073486} 11/07/2021 01:13:03 - INFO - __main__ - Step 28502: {'lr': 0.000461476307908128, 'samples': 5472384, 'steps': 28501, 'loss/train': 1.9110409021377563} 11/07/2021 01:13:04 - INFO - __main__ - Step 28503: {'lr': 0.00046147347759735647, 'samples': 5472576, 'steps': 28502, 'loss/train': 2.035921573638916} 11/07/2021 01:13:04 - INFO - __main__ - Step 28504: {'lr': 0.00046147064719129823, 'samples': 5472768, 'steps': 28503, 'loss/train': 1.4053170680999756} 11/07/2021 01:13:04 - INFO - __main__ - Step 28505: {'lr': 0.00046146781668995456, 'samples': 5472960, 'steps': 28504, 'loss/train': 1.5621429681777954} 11/07/2021 01:13:05 - INFO - __main__ - Step 28506: {'lr': 0.0004614649860933268, 'samples': 5473152, 'steps': 28505, 'loss/train': 1.4961036443710327} 11/07/2021 01:13:06 - INFO - __main__ - Step 28507: {'lr': 0.0004614621554014162, 'samples': 5473344, 'steps': 28506, 'loss/train': 1.6253371238708496} 11/07/2021 01:13:06 - INFO - __main__ - Step 28508: {'lr': 0.00046145932461422396, 'samples': 5473536, 'steps': 28507, 'loss/train': 1.7970013618469238} 11/07/2021 01:13:06 - INFO - __main__ - Step 28509: {'lr': 0.00046145649373175145, 'samples': 5473728, 'steps': 28508, 'loss/train': 1.8721892833709717} 11/07/2021 01:13:07 - INFO - __main__ - Step 28510: {'lr': 0.0004614536627539999, 'samples': 5473920, 'steps': 28509, 'loss/train': 1.5840485095977783} 11/07/2021 01:13:08 - INFO - __main__ - Step 28511: {'lr': 0.0004614508316809706, 'samples': 5474112, 'steps': 28510, 'loss/train': 1.2819918394088745} 11/07/2021 01:13:08 - INFO - __main__ - Step 28512: {'lr': 0.00046144800051266477, 'samples': 5474304, 'steps': 28511, 'loss/train': 1.383216142654419} 11/07/2021 01:13:09 - INFO - __main__ - Step 28513: {'lr': 0.00046144516924908377, 'samples': 5474496, 'steps': 28512, 'loss/train': 1.5542588233947754} 11/07/2021 01:13:09 - INFO - __main__ - Step 28514: {'lr': 0.0004614423378902289, 'samples': 5474688, 'steps': 28513, 'loss/train': 1.4837170839309692} 11/07/2021 01:13:09 - INFO - __main__ - Step 28515: {'lr': 0.0004614395064361013, 'samples': 5474880, 'steps': 28514, 'loss/train': 1.7974506616592407} 11/07/2021 01:13:11 - INFO - __main__ - Step 28516: {'lr': 0.00046143667488670226, 'samples': 5475072, 'steps': 28515, 'loss/train': 1.2926692962646484} 11/07/2021 01:13:11 - INFO - __main__ - Step 28517: {'lr': 0.00046143384324203325, 'samples': 5475264, 'steps': 28516, 'loss/train': 1.5892486572265625} 11/07/2021 01:13:12 - INFO - __main__ - Step 28518: {'lr': 0.00046143101150209533, 'samples': 5475456, 'steps': 28517, 'loss/train': 1.1255959272384644} 11/07/2021 01:13:12 - INFO - __main__ - Step 28519: {'lr': 0.0004614281796668899, 'samples': 5475648, 'steps': 28518, 'loss/train': 0.5575433373451233} 11/07/2021 01:13:12 - INFO - __main__ - Step 28520: {'lr': 0.0004614253477364182, 'samples': 5475840, 'steps': 28519, 'loss/train': 1.642098069190979} 11/07/2021 01:13:13 - INFO - __main__ - Step 28521: {'lr': 0.0004614225157106815, 'samples': 5476032, 'steps': 28520, 'loss/train': 1.7778156995773315} 11/07/2021 01:13:14 - INFO - __main__ - Step 28522: {'lr': 0.00046141968358968103, 'samples': 5476224, 'steps': 28521, 'loss/train': 1.1304028034210205} 11/07/2021 01:13:14 - INFO - __main__ - Step 28523: {'lr': 0.00046141685137341814, 'samples': 5476416, 'steps': 28522, 'loss/train': 1.6093932390213013} 11/07/2021 01:13:14 - INFO - __main__ - Step 28524: {'lr': 0.00046141401906189404, 'samples': 5476608, 'steps': 28523, 'loss/train': 1.4824190139770508} 11/07/2021 01:13:15 - INFO - __main__ - Step 28525: {'lr': 0.0004614111866551101, 'samples': 5476800, 'steps': 28524, 'loss/train': 1.7257108688354492} 11/07/2021 01:13:15 - INFO - __main__ - Step 28526: {'lr': 0.0004614083541530675, 'samples': 5476992, 'steps': 28525, 'loss/train': 1.8572736978530884} 11/07/2021 01:13:16 - INFO - __main__ - Step 28527: {'lr': 0.00046140552155576767, 'samples': 5477184, 'steps': 28526, 'loss/train': 1.5072067975997925} 11/07/2021 01:13:17 - INFO - __main__ - Step 28528: {'lr': 0.0004614026888632116, 'samples': 5477376, 'steps': 28527, 'loss/train': 1.8376927375793457} 11/07/2021 01:13:17 - INFO - __main__ - Step 28529: {'lr': 0.00046139985607540087, 'samples': 5477568, 'steps': 28528, 'loss/train': 1.7291768789291382} 11/07/2021 01:13:17 - INFO - __main__ - Step 28530: {'lr': 0.00046139702319233656, 'samples': 5477760, 'steps': 28529, 'loss/train': 1.209162712097168} 11/07/2021 01:13:18 - INFO - __main__ - Step 28531: {'lr': 0.00046139419021402005, 'samples': 5477952, 'steps': 28530, 'loss/train': 0.9393748641014099} 11/07/2021 01:13:18 - INFO - __main__ - Step 28532: {'lr': 0.00046139135714045253, 'samples': 5478144, 'steps': 28531, 'loss/train': 2.0351850986480713} 11/07/2021 01:13:19 - INFO - __main__ - Step 28533: {'lr': 0.00046138852397163547, 'samples': 5478336, 'steps': 28532, 'loss/train': 1.5962907075881958} 11/07/2021 01:13:19 - INFO - __main__ - Step 28534: {'lr': 0.00046138569070756984, 'samples': 5478528, 'steps': 28533, 'loss/train': 1.830586552619934} 11/07/2021 01:13:20 - INFO - __main__ - Step 28535: {'lr': 0.00046138285734825715, 'samples': 5478720, 'steps': 28534, 'loss/train': 1.1147873401641846} 11/07/2021 01:13:20 - INFO - __main__ - Step 28536: {'lr': 0.0004613800238936986, 'samples': 5478912, 'steps': 28535, 'loss/train': 1.8073807954788208} 11/07/2021 01:13:20 - INFO - __main__ - Step 28537: {'lr': 0.0004613771903438955, 'samples': 5479104, 'steps': 28536, 'loss/train': 1.6306997537612915} 11/07/2021 01:13:22 - INFO - __main__ - Step 28538: {'lr': 0.00046137435669884897, 'samples': 5479296, 'steps': 28537, 'loss/train': 1.6193535327911377} 11/07/2021 01:13:23 - INFO - __main__ - Step 28539: {'lr': 0.00046137152295856054, 'samples': 5479488, 'steps': 28538, 'loss/train': 0.4254659116268158} 11/07/2021 01:13:23 - INFO - __main__ - Step 28540: {'lr': 0.0004613686891230313, 'samples': 5479680, 'steps': 28539, 'loss/train': 1.918066382408142} 11/07/2021 01:13:23 - INFO - __main__ - Step 28541: {'lr': 0.0004613658551922627, 'samples': 5479872, 'steps': 28540, 'loss/train': 1.661318063735962} 11/07/2021 01:13:24 - INFO - __main__ - Step 28542: {'lr': 0.0004613630211662558, 'samples': 5480064, 'steps': 28541, 'loss/train': 1.55586576461792} 11/07/2021 01:13:24 - INFO - __main__ - Step 28543: {'lr': 0.00046136018704501203, 'samples': 5480256, 'steps': 28542, 'loss/train': 1.6200159788131714} 11/07/2021 01:13:25 - INFO - __main__ - Step 28544: {'lr': 0.00046135735282853263, 'samples': 5480448, 'steps': 28543, 'loss/train': 1.8351638317108154} 11/07/2021 01:13:25 - INFO - __main__ - Step 28545: {'lr': 0.0004613545185168188, 'samples': 5480640, 'steps': 28544, 'loss/train': 1.0412639379501343} 11/07/2021 01:13:26 - INFO - __main__ - Step 28546: {'lr': 0.0004613516841098719, 'samples': 5480832, 'steps': 28545, 'loss/train': 1.9801928997039795} 11/07/2021 01:13:26 - INFO - __main__ - Step 28547: {'lr': 0.0004613488496076933, 'samples': 5481024, 'steps': 28546, 'loss/train': 1.2984365224838257} 11/07/2021 01:13:26 - INFO - __main__ - Step 28548: {'lr': 0.00046134601501028404, 'samples': 5481216, 'steps': 28547, 'loss/train': 1.278522253036499} 11/07/2021 01:13:28 - INFO - __main__ - Step 28549: {'lr': 0.0004613431803176456, 'samples': 5481408, 'steps': 28548, 'loss/train': 1.4743411540985107} 11/07/2021 01:13:28 - INFO - __main__ - Step 28550: {'lr': 0.00046134034552977924, 'samples': 5481600, 'steps': 28549, 'loss/train': 1.5944483280181885} 11/07/2021 01:13:28 - INFO - __main__ - Step 28551: {'lr': 0.00046133751064668605, 'samples': 5481792, 'steps': 28550, 'loss/train': 1.5976324081420898} 11/07/2021 01:13:29 - INFO - __main__ - Step 28552: {'lr': 0.0004613346756683675, 'samples': 5481984, 'steps': 28551, 'loss/train': 1.5659213066101074} 11/07/2021 01:13:29 - INFO - __main__ - Step 28553: {'lr': 0.0004613318405948248, 'samples': 5482176, 'steps': 28552, 'loss/train': 1.5147918462753296} 11/07/2021 01:13:29 - INFO - __main__ - Step 28554: {'lr': 0.00046132900542605925, 'samples': 5482368, 'steps': 28553, 'loss/train': 2.00808048248291} 11/07/2021 01:13:31 - INFO - __main__ - Step 28555: {'lr': 0.0004613261701620721, 'samples': 5482560, 'steps': 28554, 'loss/train': 2.5371766090393066} 11/07/2021 01:13:31 - INFO - __main__ - Step 28556: {'lr': 0.0004613233348028646, 'samples': 5482752, 'steps': 28555, 'loss/train': 1.2087116241455078} 11/07/2021 01:13:31 - INFO - __main__ - Step 28557: {'lr': 0.0004613204993484381, 'samples': 5482944, 'steps': 28556, 'loss/train': 1.5834543704986572} 11/07/2021 01:13:32 - INFO - __main__ - Step 28558: {'lr': 0.00046131766379879386, 'samples': 5483136, 'steps': 28557, 'loss/train': 1.4561253786087036} 11/07/2021 01:13:32 - INFO - __main__ - Step 28559: {'lr': 0.0004613148281539331, 'samples': 5483328, 'steps': 28558, 'loss/train': 1.8825464248657227} 11/07/2021 01:13:33 - INFO - __main__ - Step 28560: {'lr': 0.00046131199241385726, 'samples': 5483520, 'steps': 28559, 'loss/train': 1.786787986755371} 11/07/2021 01:13:33 - INFO - __main__ - Step 28561: {'lr': 0.0004613091565785673, 'samples': 5483712, 'steps': 28560, 'loss/train': 1.2102888822555542} 11/07/2021 01:13:34 - INFO - __main__ - Step 28562: {'lr': 0.0004613063206480649, 'samples': 5483904, 'steps': 28561, 'loss/train': 2.037170886993408} 11/07/2021 01:13:34 - INFO - __main__ - Step 28563: {'lr': 0.000461303484622351, 'samples': 5484096, 'steps': 28562, 'loss/train': 1.8780871629714966} 11/07/2021 01:13:34 - INFO - __main__ - Step 28564: {'lr': 0.00046130064850142703, 'samples': 5484288, 'steps': 28563, 'loss/train': 1.3572795391082764} 11/07/2021 01:13:35 - INFO - __main__ - Step 28565: {'lr': 0.0004612978122852942, 'samples': 5484480, 'steps': 28564, 'loss/train': 1.3927767276763916} 11/07/2021 01:13:36 - INFO - __main__ - Step 28566: {'lr': 0.000461294975973954, 'samples': 5484672, 'steps': 28565, 'loss/train': 2.1032114028930664} 11/07/2021 01:13:36 - INFO - __main__ - Step 28567: {'lr': 0.0004612921395674074, 'samples': 5484864, 'steps': 28566, 'loss/train': 1.9771486520767212} 11/07/2021 01:13:36 - INFO - __main__ - Step 28568: {'lr': 0.0004612893030656559, 'samples': 5485056, 'steps': 28567, 'loss/train': 1.6528112888336182} 11/07/2021 01:13:37 - INFO - __main__ - Step 28569: {'lr': 0.0004612864664687007, 'samples': 5485248, 'steps': 28568, 'loss/train': 1.4817839860916138} 11/07/2021 01:13:38 - INFO - __main__ - Step 28570: {'lr': 0.0004612836297765429, 'samples': 5485440, 'steps': 28569, 'loss/train': 1.3428547382354736} 11/07/2021 01:13:38 - INFO - __main__ - Step 28571: {'lr': 0.00046128079298918414, 'samples': 5485632, 'steps': 28570, 'loss/train': 1.592185616493225} 11/07/2021 01:13:39 - INFO - __main__ - Step 28572: {'lr': 0.00046127795610662547, 'samples': 5485824, 'steps': 28571, 'loss/train': 2.0489470958709717} 11/07/2021 01:13:39 - INFO - __main__ - Step 28573: {'lr': 0.0004612751191288682, 'samples': 5486016, 'steps': 28572, 'loss/train': 1.5951439142227173} 11/07/2021 01:13:39 - INFO - __main__ - Step 28574: {'lr': 0.00046127228205591366, 'samples': 5486208, 'steps': 28573, 'loss/train': 1.285809874534607} 11/07/2021 01:13:40 - INFO - __main__ - Step 28575: {'lr': 0.0004612694448877631, 'samples': 5486400, 'steps': 28574, 'loss/train': 1.5925551652908325} 11/07/2021 01:13:41 - INFO - __main__ - Step 28576: {'lr': 0.00046126660762441774, 'samples': 5486592, 'steps': 28575, 'loss/train': 0.4271937906742096} 11/07/2021 01:13:41 - INFO - __main__ - Step 28577: {'lr': 0.00046126377026587897, 'samples': 5486784, 'steps': 28576, 'loss/train': 1.7694735527038574} 11/07/2021 01:13:41 - INFO - __main__ - Step 28578: {'lr': 0.0004612609328121479, 'samples': 5486976, 'steps': 28577, 'loss/train': 1.6860110759735107} 11/07/2021 01:13:42 - INFO - __main__ - Step 28579: {'lr': 0.000461258095263226, 'samples': 5487168, 'steps': 28578, 'loss/train': 1.9026232957839966} 11/07/2021 01:13:43 - INFO - __main__ - Step 28580: {'lr': 0.00046125525761911445, 'samples': 5487360, 'steps': 28579, 'loss/train': 1.164626955986023} 11/07/2021 01:13:43 - INFO - __main__ - Step 28581: {'lr': 0.00046125241987981445, 'samples': 5487552, 'steps': 28580, 'loss/train': 1.4468891620635986} 11/07/2021 01:13:43 - INFO - __main__ - Step 28582: {'lr': 0.0004612495820453275, 'samples': 5487744, 'steps': 28581, 'loss/train': 1.3925243616104126} 11/07/2021 01:13:44 - INFO - __main__ - Step 28583: {'lr': 0.0004612467441156547, 'samples': 5487936, 'steps': 28582, 'loss/train': 1.0571708679199219} 11/07/2021 01:13:44 - INFO - __main__ - Step 28584: {'lr': 0.00046124390609079735, 'samples': 5488128, 'steps': 28583, 'loss/train': 1.5288128852844238} 11/07/2021 01:13:45 - INFO - __main__ - Step 28585: {'lr': 0.00046124106797075683, 'samples': 5488320, 'steps': 28584, 'loss/train': 1.6580777168273926} 11/07/2021 01:13:45 - INFO - __main__ - Step 28586: {'lr': 0.00046123822975553425, 'samples': 5488512, 'steps': 28585, 'loss/train': 1.5447169542312622} 11/07/2021 01:13:46 - INFO - __main__ - Step 28587: {'lr': 0.00046123539144513103, 'samples': 5488704, 'steps': 28586, 'loss/train': 1.0325883626937866} 11/07/2021 01:13:46 - INFO - __main__ - Step 28588: {'lr': 0.00046123255303954835, 'samples': 5488896, 'steps': 28587, 'loss/train': 0.9563302397727966} 11/07/2021 01:13:46 - INFO - __main__ - Step 28589: {'lr': 0.0004612297145387876, 'samples': 5489088, 'steps': 28588, 'loss/train': 2.1595003604888916} 11/07/2021 01:13:47 - INFO - __main__ - Step 28590: {'lr': 0.00046122687594285, 'samples': 5489280, 'steps': 28589, 'loss/train': 1.2853444814682007} 11/07/2021 01:13:48 - INFO - __main__ - Step 28591: {'lr': 0.0004612240372517368, 'samples': 5489472, 'steps': 28590, 'loss/train': 1.3485485315322876} 11/07/2021 01:13:48 - INFO - __main__ - Step 28592: {'lr': 0.00046122119846544936, 'samples': 5489664, 'steps': 28591, 'loss/train': 1.4400455951690674} 11/07/2021 01:13:48 - INFO - __main__ - Step 28593: {'lr': 0.00046121835958398883, 'samples': 5489856, 'steps': 28592, 'loss/train': 1.4383652210235596} 11/07/2021 01:13:49 - INFO - __main__ - Step 28594: {'lr': 0.0004612155206073566, 'samples': 5490048, 'steps': 28593, 'loss/train': 1.3600523471832275} 11/07/2021 01:13:49 - INFO - __main__ - Step 28595: {'lr': 0.000461212681535554, 'samples': 5490240, 'steps': 28594, 'loss/train': 1.7095885276794434} 11/07/2021 01:13:50 - INFO - __main__ - Step 28596: {'lr': 0.0004612098423685821, 'samples': 5490432, 'steps': 28595, 'loss/train': 1.767296314239502} 11/07/2021 01:13:51 - INFO - __main__ - Step 28597: {'lr': 0.0004612070031064424, 'samples': 5490624, 'steps': 28596, 'loss/train': 1.2135729789733887} 11/07/2021 01:13:51 - INFO - __main__ - Step 28598: {'lr': 0.000461204163749136, 'samples': 5490816, 'steps': 28597, 'loss/train': 1.7606945037841797} 11/07/2021 01:13:51 - INFO - __main__ - Step 28599: {'lr': 0.0004612013242966643, 'samples': 5491008, 'steps': 28598, 'loss/train': 1.0029528141021729} 11/07/2021 01:13:52 - INFO - __main__ - Step 28600: {'lr': 0.0004611984847490285, 'samples': 5491200, 'steps': 28599, 'loss/train': 1.3767914772033691} 11/07/2021 01:13:53 - INFO - __main__ - Step 28601: {'lr': 0.00046119564510623, 'samples': 5491392, 'steps': 28600, 'loss/train': 1.7137017250061035} 11/07/2021 01:13:53 - INFO - __main__ - Step 28602: {'lr': 0.00046119280536827, 'samples': 5491584, 'steps': 28601, 'loss/train': 1.4423989057540894} 11/07/2021 01:13:54 - INFO - __main__ - Step 28603: {'lr': 0.0004611899655351497, 'samples': 5491776, 'steps': 28602, 'loss/train': 1.636980652809143} 11/07/2021 01:13:54 - INFO - __main__ - Step 28604: {'lr': 0.0004611871256068705, 'samples': 5491968, 'steps': 28603, 'loss/train': 1.7325501441955566} 11/07/2021 01:13:54 - INFO - __main__ - Step 28605: {'lr': 0.0004611842855834336, 'samples': 5492160, 'steps': 28604, 'loss/train': 1.6320688724517822} 11/07/2021 01:13:55 - INFO - __main__ - Step 28606: {'lr': 0.00046118144546484043, 'samples': 5492352, 'steps': 28605, 'loss/train': 1.2296459674835205} 11/07/2021 01:13:56 - INFO - __main__ - Step 28607: {'lr': 0.0004611786052510921, 'samples': 5492544, 'steps': 28606, 'loss/train': 0.447788268327713} 11/07/2021 01:13:56 - INFO - __main__ - Step 28608: {'lr': 0.0004611757649421899, 'samples': 5492736, 'steps': 28607, 'loss/train': 1.6928179264068604} 11/07/2021 01:13:57 - INFO - __main__ - Step 28609: {'lr': 0.0004611729245381352, 'samples': 5492928, 'steps': 28608, 'loss/train': 2.53912615776062} 11/07/2021 01:13:57 - INFO - __main__ - Step 28610: {'lr': 0.00046117008403892925, 'samples': 5493120, 'steps': 28609, 'loss/train': 1.485178828239441} 11/07/2021 01:13:57 - INFO - __main__ - Step 28611: {'lr': 0.0004611672434445733, 'samples': 5493312, 'steps': 28610, 'loss/train': 1.8342007398605347} 11/07/2021 01:13:58 - INFO - __main__ - Step 28612: {'lr': 0.0004611644027550687, 'samples': 5493504, 'steps': 28611, 'loss/train': 4.171876907348633} 11/07/2021 01:13:59 - INFO - __main__ - Step 28613: {'lr': 0.00046116156197041657, 'samples': 5493696, 'steps': 28612, 'loss/train': 1.8307241201400757} 11/07/2021 01:13:59 - INFO - __main__ - Step 28614: {'lr': 0.0004611587210906184, 'samples': 5493888, 'steps': 28613, 'loss/train': 0.7530031204223633} 11/07/2021 01:14:00 - INFO - __main__ - Step 28615: {'lr': 0.0004611558801156753, 'samples': 5494080, 'steps': 28614, 'loss/train': 1.3507496118545532} 11/07/2021 01:14:00 - INFO - __main__ - Step 28616: {'lr': 0.0004611530390455887, 'samples': 5494272, 'steps': 28615, 'loss/train': 1.4227341413497925} 11/07/2021 01:14:01 - INFO - __main__ - Step 28617: {'lr': 0.00046115019788035974, 'samples': 5494464, 'steps': 28616, 'loss/train': 0.9620317816734314} 11/07/2021 01:14:01 - INFO - __main__ - Step 28618: {'lr': 0.00046114735661998975, 'samples': 5494656, 'steps': 28617, 'loss/train': 0.9758543372154236} 11/07/2021 01:14:02 - INFO - __main__ - Step 28619: {'lr': 0.0004611445152644801, 'samples': 5494848, 'steps': 28618, 'loss/train': 0.5890863537788391} 11/07/2021 01:14:02 - INFO - __main__ - Step 28620: {'lr': 0.00046114167381383186, 'samples': 5495040, 'steps': 28619, 'loss/train': 2.1583333015441895} 11/07/2021 01:14:02 - INFO - __main__ - Step 28621: {'lr': 0.0004611388322680465, 'samples': 5495232, 'steps': 28620, 'loss/train': 1.5053393840789795} 11/07/2021 01:14:03 - INFO - __main__ - Step 28622: {'lr': 0.0004611359906271253, 'samples': 5495424, 'steps': 28621, 'loss/train': 1.5294830799102783} 11/07/2021 01:14:04 - INFO - __main__ - Step 28623: {'lr': 0.0004611331488910694, 'samples': 5495616, 'steps': 28622, 'loss/train': 1.6916823387145996} 11/07/2021 01:14:04 - INFO - __main__ - Step 28624: {'lr': 0.00046113030705988026, 'samples': 5495808, 'steps': 28623, 'loss/train': 1.8420982360839844} 11/07/2021 01:14:04 - INFO - __main__ - Step 28625: {'lr': 0.000461127465133559, 'samples': 5496000, 'steps': 28624, 'loss/train': 1.3573448657989502} 11/07/2021 01:14:05 - INFO - __main__ - Step 28626: {'lr': 0.0004611246231121069, 'samples': 5496192, 'steps': 28625, 'loss/train': 1.6356327533721924} 11/07/2021 01:14:05 - INFO - __main__ - Step 28627: {'lr': 0.00046112178099552535, 'samples': 5496384, 'steps': 28626, 'loss/train': 1.0614502429962158} 11/07/2021 01:14:06 - INFO - __main__ - Step 28628: {'lr': 0.0004611189387838156, 'samples': 5496576, 'steps': 28627, 'loss/train': 1.4590609073638916} 11/07/2021 01:14:06 - INFO - __main__ - Step 28629: {'lr': 0.00046111609647697893, 'samples': 5496768, 'steps': 28628, 'loss/train': 1.7812243700027466} 11/07/2021 01:14:07 - INFO - __main__ - Step 28630: {'lr': 0.0004611132540750166, 'samples': 5496960, 'steps': 28629, 'loss/train': 1.6481937170028687} 11/07/2021 01:14:07 - INFO - __main__ - Step 28631: {'lr': 0.00046111041157792987, 'samples': 5497152, 'steps': 28630, 'loss/train': 2.1358513832092285} 11/07/2021 01:14:08 - INFO - __main__ - Step 28632: {'lr': 0.00046110756898572, 'samples': 5497344, 'steps': 28631, 'loss/train': 1.7727172374725342} 11/07/2021 01:14:09 - INFO - __main__ - Step 28633: {'lr': 0.0004611047262983884, 'samples': 5497536, 'steps': 28632, 'loss/train': 1.3694134950637817} 11/07/2021 01:14:09 - INFO - __main__ - Step 28634: {'lr': 0.00046110188351593625, 'samples': 5497728, 'steps': 28633, 'loss/train': 1.2695839405059814} 11/07/2021 01:14:09 - INFO - __main__ - Step 28635: {'lr': 0.0004610990406383648, 'samples': 5497920, 'steps': 28634, 'loss/train': 1.223016619682312} 11/07/2021 01:14:10 - INFO - __main__ - Step 28636: {'lr': 0.00046109619766567547, 'samples': 5498112, 'steps': 28635, 'loss/train': 0.545827329158783} 11/07/2021 01:14:10 - INFO - __main__ - Step 28637: {'lr': 0.0004610933545978694, 'samples': 5498304, 'steps': 28636, 'loss/train': 2.735299587249756} 11/07/2021 01:14:11 - INFO - __main__ - Step 28638: {'lr': 0.0004610905114349478, 'samples': 5498496, 'steps': 28637, 'loss/train': 1.3574023246765137} 11/07/2021 01:14:12 - INFO - __main__ - Step 28639: {'lr': 0.0004610876681769123, 'samples': 5498688, 'steps': 28638, 'loss/train': 1.8435614109039307} 11/07/2021 01:14:12 - INFO - __main__ - Step 28640: {'lr': 0.0004610848248237638, 'samples': 5498880, 'steps': 28639, 'loss/train': 1.2343521118164062} 11/07/2021 01:14:12 - INFO - __main__ - Step 28641: {'lr': 0.00046108198137550377, 'samples': 5499072, 'steps': 28640, 'loss/train': 1.4320803880691528} 11/07/2021 01:14:13 - INFO - __main__ - Step 28642: {'lr': 0.0004610791378321335, 'samples': 5499264, 'steps': 28641, 'loss/train': 1.575751543045044} 11/07/2021 01:14:13 - INFO - __main__ - Step 28643: {'lr': 0.0004610762941936542, 'samples': 5499456, 'steps': 28642, 'loss/train': 1.3416160345077515} 11/07/2021 01:14:14 - INFO - __main__ - Step 28644: {'lr': 0.0004610734504600671, 'samples': 5499648, 'steps': 28643, 'loss/train': 1.591726541519165} 11/07/2021 01:14:14 - INFO - __main__ - Step 28645: {'lr': 0.00046107060663137366, 'samples': 5499840, 'steps': 28644, 'loss/train': 1.678292989730835} 11/07/2021 01:14:15 - INFO - __main__ - Step 28646: {'lr': 0.00046106776270757506, 'samples': 5500032, 'steps': 28645, 'loss/train': 1.7594455480575562} 11/07/2021 01:14:15 - INFO - __main__ - Step 28647: {'lr': 0.0004610649186886725, 'samples': 5500224, 'steps': 28646, 'loss/train': 1.5664775371551514} 11/07/2021 01:14:16 - INFO - __main__ - Step 28648: {'lr': 0.00046106207457466744, 'samples': 5500416, 'steps': 28647, 'loss/train': 1.4586031436920166} 11/07/2021 01:14:17 - INFO - __main__ - Step 28649: {'lr': 0.0004610592303655611, 'samples': 5500608, 'steps': 28648, 'loss/train': 1.7166132926940918} 11/07/2021 01:14:17 - INFO - __main__ - Step 28650: {'lr': 0.0004610563860613546, 'samples': 5500800, 'steps': 28649, 'loss/train': 5.587038516998291} 11/07/2021 01:14:17 - INFO - __main__ - Step 28651: {'lr': 0.00046105354166204937, 'samples': 5500992, 'steps': 28650, 'loss/train': 1.741302251815796} 11/07/2021 01:14:18 - INFO - __main__ - Step 28652: {'lr': 0.00046105069716764676, 'samples': 5501184, 'steps': 28651, 'loss/train': 0.7651360630989075} 11/07/2021 01:14:18 - INFO - __main__ - Step 28653: {'lr': 0.00046104785257814786, 'samples': 5501376, 'steps': 28652, 'loss/train': 1.889872431755066} 11/07/2021 01:14:19 - INFO - __main__ - Step 28654: {'lr': 0.0004610450078935541, 'samples': 5501568, 'steps': 28653, 'loss/train': 1.3259265422821045} 11/07/2021 01:14:20 - INFO - __main__ - Step 28655: {'lr': 0.00046104216311386676, 'samples': 5501760, 'steps': 28654, 'loss/train': 1.460955023765564} 11/07/2021 01:14:20 - INFO - __main__ - Step 28656: {'lr': 0.000461039318239087, 'samples': 5501952, 'steps': 28655, 'loss/train': 1.2348376512527466} 11/07/2021 01:14:21 - INFO - __main__ - Step 28657: {'lr': 0.00046103647326921625, 'samples': 5502144, 'steps': 28656, 'loss/train': 2.395113945007324} 11/07/2021 01:14:21 - INFO - __main__ - Step 28658: {'lr': 0.00046103362820425567, 'samples': 5502336, 'steps': 28657, 'loss/train': 1.311152696609497} 11/07/2021 01:14:21 - INFO - __main__ - Step 28659: {'lr': 0.00046103078304420665, 'samples': 5502528, 'steps': 28658, 'loss/train': 1.4987415075302124} 11/07/2021 01:14:22 - INFO - __main__ - Step 28660: {'lr': 0.0004610279377890704, 'samples': 5502720, 'steps': 28659, 'loss/train': 0.693972647190094} 11/07/2021 01:14:23 - INFO - __main__ - Step 28661: {'lr': 0.00046102509243884813, 'samples': 5502912, 'steps': 28660, 'loss/train': 1.4814105033874512} 11/07/2021 01:14:23 - INFO - __main__ - Step 28662: {'lr': 0.0004610222469935413, 'samples': 5503104, 'steps': 28661, 'loss/train': 1.6567411422729492} 11/07/2021 01:14:23 - INFO - __main__ - Step 28663: {'lr': 0.000461019401453151, 'samples': 5503296, 'steps': 28662, 'loss/train': 2.106566905975342} 11/07/2021 01:14:24 - INFO - __main__ - Step 28664: {'lr': 0.00046101655581767874, 'samples': 5503488, 'steps': 28663, 'loss/train': 0.5619341731071472} 11/07/2021 01:14:25 - INFO - __main__ - Step 28665: {'lr': 0.0004610137100871257, 'samples': 5503680, 'steps': 28664, 'loss/train': 1.7786967754364014} 11/07/2021 01:14:25 - INFO - __main__ - Step 28666: {'lr': 0.00046101086426149297, 'samples': 5503872, 'steps': 28665, 'loss/train': 1.7782741785049438} 11/07/2021 01:14:26 - INFO - __main__ - Step 28667: {'lr': 0.0004610080183407821, 'samples': 5504064, 'steps': 28666, 'loss/train': 1.60888671875} 11/07/2021 01:14:26 - INFO - __main__ - Step 28668: {'lr': 0.0004610051723249943, 'samples': 5504256, 'steps': 28667, 'loss/train': 1.6466768980026245} 11/07/2021 01:14:26 - INFO - __main__ - Step 28669: {'lr': 0.0004610023262141308, 'samples': 5504448, 'steps': 28668, 'loss/train': 2.1476855278015137} 11/07/2021 01:14:27 - INFO - __main__ - Step 28670: {'lr': 0.00046099948000819294, 'samples': 5504640, 'steps': 28669, 'loss/train': 2.24654483795166} 11/07/2021 01:14:28 - INFO - __main__ - Step 28671: {'lr': 0.0004609966337071819, 'samples': 5504832, 'steps': 28670, 'loss/train': 1.561232089996338} 11/07/2021 01:14:28 - INFO - __main__ - Step 28672: {'lr': 0.00046099378731109906, 'samples': 5505024, 'steps': 28671, 'loss/train': 0.2538740634918213} 11/07/2021 01:14:28 - INFO - __main__ - Step 28673: {'lr': 0.00046099094081994565, 'samples': 5505216, 'steps': 28672, 'loss/train': 1.683111310005188} 11/07/2021 01:14:29 - INFO - __main__ - Step 28674: {'lr': 0.000460988094233723, 'samples': 5505408, 'steps': 28673, 'loss/train': 0.622854471206665} 11/07/2021 01:14:29 - INFO - __main__ - Step 28675: {'lr': 0.00046098524755243246, 'samples': 5505600, 'steps': 28674, 'loss/train': 1.3095825910568237} 11/07/2021 01:14:30 - INFO - __main__ - Step 28676: {'lr': 0.0004609824007760751, 'samples': 5505792, 'steps': 28675, 'loss/train': 1.7884206771850586} 11/07/2021 01:14:31 - INFO - __main__ - Step 28677: {'lr': 0.0004609795539046524, 'samples': 5505984, 'steps': 28676, 'loss/train': 0.9840785264968872} 11/07/2021 01:14:31 - INFO - __main__ - Step 28678: {'lr': 0.0004609767069381655, 'samples': 5506176, 'steps': 28677, 'loss/train': 1.2989946603775024} 11/07/2021 01:14:31 - INFO - __main__ - Step 28679: {'lr': 0.00046097385987661576, 'samples': 5506368, 'steps': 28678, 'loss/train': 0.7278133630752563} 11/07/2021 01:14:32 - INFO - __main__ - Step 28680: {'lr': 0.00046097101272000454, 'samples': 5506560, 'steps': 28679, 'loss/train': 2.0719168186187744} 11/07/2021 01:14:33 - INFO - __main__ - Step 28681: {'lr': 0.0004609681654683329, 'samples': 5506752, 'steps': 28680, 'loss/train': 1.3236138820648193} 11/07/2021 01:14:33 - INFO - __main__ - Step 28682: {'lr': 0.0004609653181216024, 'samples': 5506944, 'steps': 28681, 'loss/train': 1.3993245363235474} 11/07/2021 01:14:33 - INFO - __main__ - Step 28683: {'lr': 0.0004609624706798141, 'samples': 5507136, 'steps': 28682, 'loss/train': 1.698938250541687} 11/07/2021 01:14:34 - INFO - __main__ - Step 28684: {'lr': 0.00046095962314296934, 'samples': 5507328, 'steps': 28683, 'loss/train': 1.4753496646881104} 11/07/2021 01:14:34 - INFO - __main__ - Step 28685: {'lr': 0.00046095677551106953, 'samples': 5507520, 'steps': 28684, 'loss/train': 1.476138710975647} 11/07/2021 01:14:35 - INFO - __main__ - Step 28686: {'lr': 0.00046095392778411576, 'samples': 5507712, 'steps': 28685, 'loss/train': 2.1143760681152344} 11/07/2021 01:14:35 - INFO - __main__ - Step 28687: {'lr': 0.0004609510799621095, 'samples': 5507904, 'steps': 28686, 'loss/train': 1.7127150297164917} 11/07/2021 01:14:36 - INFO - __main__ - Step 28688: {'lr': 0.0004609482320450519, 'samples': 5508096, 'steps': 28687, 'loss/train': 1.77858567237854} 11/07/2021 01:14:36 - INFO - __main__ - Step 28689: {'lr': 0.00046094538403294416, 'samples': 5508288, 'steps': 28688, 'loss/train': 1.3900631666183472} 11/07/2021 01:14:36 - INFO - __main__ - Step 28690: {'lr': 0.00046094253592578784, 'samples': 5508480, 'steps': 28689, 'loss/train': 1.9384557008743286} 11/07/2021 01:14:37 - INFO - __main__ - Step 28691: {'lr': 0.000460939687723584, 'samples': 5508672, 'steps': 28690, 'loss/train': 1.296342134475708} 11/07/2021 01:14:38 - INFO - __main__ - Step 28692: {'lr': 0.000460936839426334, 'samples': 5508864, 'steps': 28691, 'loss/train': 2.038013458251953} 11/07/2021 01:14:38 - INFO - __main__ - Step 28693: {'lr': 0.00046093399103403913, 'samples': 5509056, 'steps': 28692, 'loss/train': 1.3611464500427246} 11/07/2021 01:14:39 - INFO - __main__ - Step 28694: {'lr': 0.00046093114254670066, 'samples': 5509248, 'steps': 28693, 'loss/train': 0.881278395652771} 11/07/2021 01:14:39 - INFO - __main__ - Step 28695: {'lr': 0.0004609282939643199, 'samples': 5509440, 'steps': 28694, 'loss/train': 1.3509044647216797} 11/07/2021 01:14:39 - INFO - __main__ - Step 28696: {'lr': 0.00046092544528689806, 'samples': 5509632, 'steps': 28695, 'loss/train': 1.7343835830688477} 11/07/2021 01:14:40 - INFO - __main__ - Step 28697: {'lr': 0.0004609225965144365, 'samples': 5509824, 'steps': 28696, 'loss/train': 1.621491551399231} 11/07/2021 01:14:41 - INFO - __main__ - Step 28698: {'lr': 0.00046091974764693645, 'samples': 5510016, 'steps': 28697, 'loss/train': 1.771262764930725} 11/07/2021 01:14:41 - INFO - __main__ - Step 28699: {'lr': 0.0004609168986843992, 'samples': 5510208, 'steps': 28698, 'loss/train': 1.3424490690231323} 11/07/2021 01:14:41 - INFO - __main__ - Step 28700: {'lr': 0.000460914049626826, 'samples': 5510400, 'steps': 28699, 'loss/train': 1.46780526638031} 11/07/2021 01:14:42 - INFO - __main__ - Step 28701: {'lr': 0.0004609112004742183, 'samples': 5510592, 'steps': 28700, 'loss/train': 1.7891309261322021} 11/07/2021 01:14:43 - INFO - __main__ - Step 28702: {'lr': 0.0004609083512265773, 'samples': 5510784, 'steps': 28701, 'loss/train': 1.23452889919281} 11/07/2021 01:14:43 - INFO - __main__ - Step 28703: {'lr': 0.0004609055018839041, 'samples': 5510976, 'steps': 28702, 'loss/train': 1.4976916313171387} 11/07/2021 01:14:43 - INFO - __main__ - Step 28704: {'lr': 0.0004609026524462002, 'samples': 5511168, 'steps': 28703, 'loss/train': 1.540532112121582} 11/07/2021 01:14:44 - INFO - __main__ - Step 28705: {'lr': 0.00046089980291346685, 'samples': 5511360, 'steps': 28704, 'loss/train': 1.2702617645263672} 11/07/2021 01:14:44 - INFO - __main__ - Step 28706: {'lr': 0.00046089695328570523, 'samples': 5511552, 'steps': 28705, 'loss/train': 1.664182186126709} 11/07/2021 01:14:45 - INFO - __main__ - Step 28707: {'lr': 0.0004608941035629168, 'samples': 5511744, 'steps': 28706, 'loss/train': 1.796590805053711} 11/07/2021 01:14:45 - INFO - __main__ - Step 28708: {'lr': 0.0004608912537451027, 'samples': 5511936, 'steps': 28707, 'loss/train': 1.765599012374878} 11/07/2021 01:14:46 - INFO - __main__ - Step 28709: {'lr': 0.0004608884038322642, 'samples': 5512128, 'steps': 28708, 'loss/train': 1.7580174207687378} 11/07/2021 01:14:46 - INFO - __main__ - Step 28710: {'lr': 0.00046088555382440275, 'samples': 5512320, 'steps': 28709, 'loss/train': 1.446540117263794} 11/07/2021 01:14:46 - INFO - __main__ - Step 28711: {'lr': 0.0004608827037215194, 'samples': 5512512, 'steps': 28710, 'loss/train': 1.4470421075820923} 11/07/2021 01:14:47 - INFO - __main__ - Step 28712: {'lr': 0.0004608798535236156, 'samples': 5512704, 'steps': 28711, 'loss/train': 1.5437874794006348} 11/07/2021 01:14:48 - INFO - __main__ - Step 28713: {'lr': 0.0004608770032306926, 'samples': 5512896, 'steps': 28712, 'loss/train': 1.2525914907455444} 11/07/2021 01:14:48 - INFO - __main__ - Step 28714: {'lr': 0.0004608741528427517, 'samples': 5513088, 'steps': 28713, 'loss/train': 1.7943804264068604} 11/07/2021 01:14:48 - INFO - __main__ - Step 28715: {'lr': 0.0004608713023597941, 'samples': 5513280, 'steps': 28714, 'loss/train': 1.7214324474334717} 11/07/2021 01:14:49 - INFO - __main__ - Step 28716: {'lr': 0.00046086845178182123, 'samples': 5513472, 'steps': 28715, 'loss/train': 1.343458652496338} 11/07/2021 01:14:49 - INFO - __main__ - Step 28717: {'lr': 0.00046086560110883423, 'samples': 5513664, 'steps': 28716, 'loss/train': 1.327499270439148} 11/07/2021 01:14:50 - INFO - __main__ - Step 28718: {'lr': 0.00046086275034083453, 'samples': 5513856, 'steps': 28717, 'loss/train': 1.1367992162704468} 11/07/2021 01:14:51 - INFO - __main__ - Step 28719: {'lr': 0.00046085989947782327, 'samples': 5514048, 'steps': 28718, 'loss/train': 1.0489040613174438} 11/07/2021 01:14:51 - INFO - __main__ - Step 28720: {'lr': 0.00046085704851980174, 'samples': 5514240, 'steps': 28719, 'loss/train': 1.5698318481445312} 11/07/2021 01:14:51 - INFO - __main__ - Step 28721: {'lr': 0.00046085419746677136, 'samples': 5514432, 'steps': 28720, 'loss/train': 1.8643708229064941} 11/07/2021 01:14:52 - INFO - __main__ - Step 28722: {'lr': 0.00046085134631873326, 'samples': 5514624, 'steps': 28721, 'loss/train': 0.3820008933544159} 11/07/2021 01:14:53 - INFO - __main__ - Step 28723: {'lr': 0.0004608484950756888, 'samples': 5514816, 'steps': 28722, 'loss/train': 1.9114362001419067} 11/07/2021 01:14:53 - INFO - __main__ - Step 28724: {'lr': 0.0004608456437376393, 'samples': 5515008, 'steps': 28723, 'loss/train': 1.7168536186218262} 11/07/2021 01:14:54 - INFO - __main__ - Step 28725: {'lr': 0.000460842792304586, 'samples': 5515200, 'steps': 28724, 'loss/train': 1.5014718770980835} 11/07/2021 01:14:54 - INFO - __main__ - Step 28726: {'lr': 0.00046083994077653024, 'samples': 5515392, 'steps': 28725, 'loss/train': 1.781377911567688} 11/07/2021 01:14:54 - INFO - __main__ - Step 28727: {'lr': 0.0004608370891534732, 'samples': 5515584, 'steps': 28726, 'loss/train': 1.7531365156173706} 11/07/2021 01:14:55 - INFO - __main__ - Step 28728: {'lr': 0.0004608342374354162, 'samples': 5515776, 'steps': 28727, 'loss/train': 1.8156870603561401} 11/07/2021 01:14:56 - INFO - __main__ - Step 28729: {'lr': 0.0004608313856223606, 'samples': 5515968, 'steps': 28728, 'loss/train': 1.3125196695327759} 11/07/2021 01:14:56 - INFO - __main__ - Step 28730: {'lr': 0.00046082853371430754, 'samples': 5516160, 'steps': 28729, 'loss/train': 0.5172771215438843} 11/07/2021 01:14:56 - INFO - __main__ - Step 28731: {'lr': 0.0004608256817112585, 'samples': 5516352, 'steps': 28730, 'loss/train': 1.6598970890045166} 11/07/2021 01:14:57 - INFO - __main__ - Step 28732: {'lr': 0.00046082282961321466, 'samples': 5516544, 'steps': 28731, 'loss/train': 1.5800282955169678} 11/07/2021 01:14:57 - INFO - __main__ - Step 28733: {'lr': 0.00046081997742017725, 'samples': 5516736, 'steps': 28732, 'loss/train': 1.9237587451934814} 11/07/2021 01:14:58 - INFO - __main__ - Step 28734: {'lr': 0.00046081712513214757, 'samples': 5516928, 'steps': 28733, 'loss/train': 1.1939243078231812} 11/07/2021 01:14:59 - INFO - __main__ - Step 28735: {'lr': 0.0004608142727491271, 'samples': 5517120, 'steps': 28734, 'loss/train': 1.5787652730941772} 11/07/2021 01:14:59 - INFO - __main__ - Step 28736: {'lr': 0.00046081142027111683, 'samples': 5517312, 'steps': 28735, 'loss/train': 1.1781924962997437} 11/07/2021 01:14:59 - INFO - __main__ - Step 28737: {'lr': 0.0004608085676981182, 'samples': 5517504, 'steps': 28736, 'loss/train': 1.891564965248108} 11/07/2021 01:15:00 - INFO - __main__ - Step 28738: {'lr': 0.0004608057150301326, 'samples': 5517696, 'steps': 28737, 'loss/train': 0.9074145555496216} 11/07/2021 01:15:01 - INFO - __main__ - Step 28739: {'lr': 0.00046080286226716106, 'samples': 5517888, 'steps': 28738, 'loss/train': 1.5400950908660889} 11/07/2021 01:15:01 - INFO - __main__ - Step 28740: {'lr': 0.00046080000940920506, 'samples': 5518080, 'steps': 28739, 'loss/train': 1.1455605030059814} 11/07/2021 01:15:02 - INFO - __main__ - Step 28741: {'lr': 0.00046079715645626584, 'samples': 5518272, 'steps': 28740, 'loss/train': 1.4673417806625366} 11/07/2021 01:15:02 - INFO - __main__ - Step 28742: {'lr': 0.00046079430340834467, 'samples': 5518464, 'steps': 28741, 'loss/train': 1.6762181520462036} 11/07/2021 01:15:03 - INFO - __main__ - Step 28743: {'lr': 0.00046079145026544277, 'samples': 5518656, 'steps': 28742, 'loss/train': 1.453457236289978} 11/07/2021 01:15:04 - INFO - __main__ - Step 28744: {'lr': 0.0004607885970275616, 'samples': 5518848, 'steps': 28743, 'loss/train': 0.2404089868068695} 11/07/2021 01:15:04 - INFO - __main__ - Step 28745: {'lr': 0.0004607857436947023, 'samples': 5519040, 'steps': 28744, 'loss/train': 1.8219144344329834} 11/07/2021 01:15:04 - INFO - __main__ - Step 28746: {'lr': 0.00046078289026686616, 'samples': 5519232, 'steps': 28745, 'loss/train': 1.8101202249526978} 11/07/2021 01:15:05 - INFO - __main__ - Step 28747: {'lr': 0.00046078003674405457, 'samples': 5519424, 'steps': 28746, 'loss/train': 1.357011079788208} 11/07/2021 01:15:05 - INFO - __main__ - Step 28748: {'lr': 0.0004607771831262687, 'samples': 5519616, 'steps': 28747, 'loss/train': 1.8157292604446411} 11/07/2021 01:15:06 - INFO - __main__ - Step 28749: {'lr': 0.00046077432941350993, 'samples': 5519808, 'steps': 28748, 'loss/train': 1.8717877864837646} 11/07/2021 01:15:06 - INFO - __main__ - Step 28750: {'lr': 0.00046077147560577943, 'samples': 5520000, 'steps': 28749, 'loss/train': 1.5922538042068481} 11/07/2021 01:15:07 - INFO - __main__ - Step 28751: {'lr': 0.0004607686217030786, 'samples': 5520192, 'steps': 28750, 'loss/train': 2.1468424797058105} 11/07/2021 01:15:07 - INFO - __main__ - Step 28752: {'lr': 0.00046076576770540865, 'samples': 5520384, 'steps': 28751, 'loss/train': 1.7125825881958008} 11/07/2021 01:15:08 - INFO - __main__ - Step 28753: {'lr': 0.00046076291361277097, 'samples': 5520576, 'steps': 28752, 'loss/train': 1.5815342664718628} 11/07/2021 01:15:08 - INFO - __main__ - Step 28754: {'lr': 0.00046076005942516666, 'samples': 5520768, 'steps': 28753, 'loss/train': 1.7074263095855713} 11/07/2021 01:15:09 - INFO - __main__ - Step 28755: {'lr': 0.0004607572051425972, 'samples': 5520960, 'steps': 28754, 'loss/train': 2.52258563041687} 11/07/2021 01:15:09 - INFO - __main__ - Step 28756: {'lr': 0.00046075435076506376, 'samples': 5521152, 'steps': 28755, 'loss/train': 1.4358375072479248} 11/07/2021 01:15:10 - INFO - __main__ - Step 28757: {'lr': 0.0004607514962925677, 'samples': 5521344, 'steps': 28756, 'loss/train': 1.3483976125717163} 11/07/2021 01:15:10 - INFO - __main__ - Step 28758: {'lr': 0.00046074864172511025, 'samples': 5521536, 'steps': 28757, 'loss/train': 1.3330392837524414} 11/07/2021 01:15:10 - INFO - __main__ - Step 28759: {'lr': 0.0004607457870626928, 'samples': 5521728, 'steps': 28758, 'loss/train': 2.126948118209839} 11/07/2021 01:15:11 - INFO - __main__ - Step 28760: {'lr': 0.0004607429323053164, 'samples': 5521920, 'steps': 28759, 'loss/train': 1.9364488124847412} 11/07/2021 01:15:12 - INFO - __main__ - Step 28761: {'lr': 0.0004607400774529825, 'samples': 5522112, 'steps': 28760, 'loss/train': 1.7834445238113403} 11/07/2021 01:15:12 - INFO - __main__ - Step 28762: {'lr': 0.0004607372225056925, 'samples': 5522304, 'steps': 28761, 'loss/train': 1.589208960533142} 11/07/2021 01:15:12 - INFO - __main__ - Step 28763: {'lr': 0.00046073436746344744, 'samples': 5522496, 'steps': 28762, 'loss/train': 1.5198677778244019} 11/07/2021 01:15:13 - INFO - __main__ - Step 28764: {'lr': 0.0004607315123262488, 'samples': 5522688, 'steps': 28763, 'loss/train': 1.6237186193466187} 11/07/2021 01:15:14 - INFO - __main__ - Step 28765: {'lr': 0.0004607286570940977, 'samples': 5522880, 'steps': 28764, 'loss/train': 0.8403159379959106} 11/07/2021 01:15:14 - INFO - __main__ - Step 28766: {'lr': 0.0004607258017669956, 'samples': 5523072, 'steps': 28765, 'loss/train': 1.419538140296936} 11/07/2021 01:15:15 - INFO - __main__ - Step 28767: {'lr': 0.0004607229463449437, 'samples': 5523264, 'steps': 28766, 'loss/train': 1.5430967807769775} 11/07/2021 01:15:15 - INFO - __main__ - Step 28768: {'lr': 0.00046072009082794333, 'samples': 5523456, 'steps': 28767, 'loss/train': 1.846705436706543} 11/07/2021 01:15:15 - INFO - __main__ - Step 28769: {'lr': 0.00046071723521599563, 'samples': 5523648, 'steps': 28768, 'loss/train': 1.8131253719329834} 11/07/2021 01:15:16 - INFO - __main__ - Step 28770: {'lr': 0.000460714379509102, 'samples': 5523840, 'steps': 28769, 'loss/train': 1.3720142841339111} 11/07/2021 01:15:17 - INFO - __main__ - Step 28771: {'lr': 0.0004607115237072638, 'samples': 5524032, 'steps': 28770, 'loss/train': 1.7073737382888794} 11/07/2021 01:15:17 - INFO - __main__ - Step 28772: {'lr': 0.00046070866781048225, 'samples': 5524224, 'steps': 28771, 'loss/train': 1.6079767942428589} 11/07/2021 01:15:17 - INFO - __main__ - Step 28773: {'lr': 0.0004607058118187586, 'samples': 5524416, 'steps': 28772, 'loss/train': 1.6379915475845337} 11/07/2021 01:15:18 - INFO - __main__ - Step 28774: {'lr': 0.00046070295573209406, 'samples': 5524608, 'steps': 28773, 'loss/train': 1.3397719860076904} 11/07/2021 01:15:18 - INFO - __main__ - Step 28775: {'lr': 0.00046070009955049017, 'samples': 5524800, 'steps': 28774, 'loss/train': 1.4922139644622803} 11/07/2021 01:15:19 - INFO - __main__ - Step 28776: {'lr': 0.000460697243273948, 'samples': 5524992, 'steps': 28775, 'loss/train': 1.7376784086227417} 11/07/2021 01:15:19 - INFO - __main__ - Step 28777: {'lr': 0.0004606943869024689, 'samples': 5525184, 'steps': 28776, 'loss/train': 1.3660744428634644} 11/07/2021 01:15:20 - INFO - __main__ - Step 28778: {'lr': 0.0004606915304360542, 'samples': 5525376, 'steps': 28777, 'loss/train': 1.768597960472107} 11/07/2021 01:15:20 - INFO - __main__ - Step 28779: {'lr': 0.00046068867387470507, 'samples': 5525568, 'steps': 28778, 'loss/train': 1.5304911136627197} 11/07/2021 01:15:20 - INFO - __main__ - Step 28780: {'lr': 0.00046068581721842294, 'samples': 5525760, 'steps': 28779, 'loss/train': 1.5088049173355103} 11/07/2021 01:15:22 - INFO - __main__ - Step 28781: {'lr': 0.00046068296046720904, 'samples': 5525952, 'steps': 28780, 'loss/train': 1.3194479942321777} 11/07/2021 01:15:22 - INFO - __main__ - Step 28782: {'lr': 0.0004606801036210646, 'samples': 5526144, 'steps': 28781, 'loss/train': 1.318917155265808} 11/07/2021 01:15:22 - INFO - __main__ - Step 28783: {'lr': 0.000460677246679991, 'samples': 5526336, 'steps': 28782, 'loss/train': 1.745903491973877} 11/07/2021 01:15:23 - INFO - __main__ - Step 28784: {'lr': 0.00046067438964398944, 'samples': 5526528, 'steps': 28783, 'loss/train': 1.8044811487197876} 11/07/2021 01:15:23 - INFO - __main__ - Step 28785: {'lr': 0.00046067153251306127, 'samples': 5526720, 'steps': 28784, 'loss/train': 1.8561345338821411} 11/07/2021 01:15:23 - INFO - __main__ - Step 28786: {'lr': 0.0004606686752872078, 'samples': 5526912, 'steps': 28785, 'loss/train': 0.1970166563987732} 11/07/2021 01:15:24 - INFO - __main__ - Step 28787: {'lr': 0.0004606658179664302, 'samples': 5527104, 'steps': 28786, 'loss/train': 1.122194528579712} 11/07/2021 01:15:25 - INFO - __main__ - Step 28788: {'lr': 0.00046066296055072986, 'samples': 5527296, 'steps': 28787, 'loss/train': 1.4112536907196045} 11/07/2021 01:15:25 - INFO - __main__ - Step 28789: {'lr': 0.0004606601030401081, 'samples': 5527488, 'steps': 28788, 'loss/train': 1.6186010837554932} 11/07/2021 01:15:26 - INFO - __main__ - Step 28790: {'lr': 0.0004606572454345661, 'samples': 5527680, 'steps': 28789, 'loss/train': 1.6745975017547607} 11/07/2021 01:15:26 - INFO - __main__ - Step 28791: {'lr': 0.0004606543877341052, 'samples': 5527872, 'steps': 28790, 'loss/train': 1.5834710597991943} 11/07/2021 01:15:27 - INFO - __main__ - Step 28792: {'lr': 0.00046065152993872665, 'samples': 5528064, 'steps': 28791, 'loss/train': 1.9523475170135498} 11/07/2021 01:15:27 - INFO - __main__ - Step 28793: {'lr': 0.0004606486720484318, 'samples': 5528256, 'steps': 28792, 'loss/train': 1.4230964183807373} 11/07/2021 01:15:28 - INFO - __main__ - Step 28794: {'lr': 0.0004606458140632219, 'samples': 5528448, 'steps': 28793, 'loss/train': 1.6033148765563965} 11/07/2021 01:15:28 - INFO - __main__ - Step 28795: {'lr': 0.0004606429559830982, 'samples': 5528640, 'steps': 28794, 'loss/train': 1.8930948972702026} 11/07/2021 01:15:28 - INFO - __main__ - Step 28796: {'lr': 0.00046064009780806217, 'samples': 5528832, 'steps': 28795, 'loss/train': 1.2086946964263916} 11/07/2021 01:15:29 - INFO - __main__ - Step 28797: {'lr': 0.0004606372395381149, 'samples': 5529024, 'steps': 28796, 'loss/train': 1.422204852104187} 11/07/2021 01:15:30 - INFO - __main__ - Step 28798: {'lr': 0.0004606343811732577, 'samples': 5529216, 'steps': 28797, 'loss/train': 1.636315941810608} 11/07/2021 01:15:30 - INFO - __main__ - Step 28799: {'lr': 0.0004606315227134919, 'samples': 5529408, 'steps': 28798, 'loss/train': 1.4297635555267334} 11/07/2021 01:15:30 - INFO - __main__ - Step 28800: {'lr': 0.0004606286641588188, 'samples': 5529600, 'steps': 28799, 'loss/train': 1.58151113986969} 11/07/2021 01:15:31 - INFO - __main__ - Step 28801: {'lr': 0.0004606258055092397, 'samples': 5529792, 'steps': 28800, 'loss/train': 1.6272387504577637} 11/07/2021 01:15:32 - INFO - __main__ - Step 28802: {'lr': 0.00046062294676475584, 'samples': 5529984, 'steps': 28801, 'loss/train': 1.5560965538024902} 11/07/2021 01:15:32 - INFO - __main__ - Step 28803: {'lr': 0.0004606200879253685, 'samples': 5530176, 'steps': 28802, 'loss/train': 1.6053823232650757} 11/07/2021 01:15:33 - INFO - __main__ - Step 28804: {'lr': 0.00046061722899107905, 'samples': 5530368, 'steps': 28803, 'loss/train': 0.8932515978813171} 11/07/2021 01:15:33 - INFO - __main__ - Step 28805: {'lr': 0.0004606143699618888, 'samples': 5530560, 'steps': 28804, 'loss/train': 0.1487438976764679} 11/07/2021 01:15:33 - INFO - __main__ - Step 28806: {'lr': 0.00046061151083779886, 'samples': 5530752, 'steps': 28805, 'loss/train': 1.1285860538482666} 11/07/2021 01:15:34 - INFO - __main__ - Step 28807: {'lr': 0.0004606086516188106, 'samples': 5530944, 'steps': 28806, 'loss/train': 0.792855441570282} 11/07/2021 01:15:35 - INFO - __main__ - Step 28808: {'lr': 0.00046060579230492533, 'samples': 5531136, 'steps': 28807, 'loss/train': 1.6321196556091309} 11/07/2021 01:15:35 - INFO - __main__ - Step 28809: {'lr': 0.0004606029328961444, 'samples': 5531328, 'steps': 28808, 'loss/train': 0.44900208711624146} 11/07/2021 01:15:36 - INFO - __main__ - Step 28810: {'lr': 0.000460600073392469, 'samples': 5531520, 'steps': 28809, 'loss/train': 0.22598089277744293} 11/07/2021 01:15:36 - INFO - __main__ - Step 28811: {'lr': 0.00046059721379390053, 'samples': 5531712, 'steps': 28810, 'loss/train': 2.1848089694976807} 11/07/2021 01:15:36 - INFO - __main__ - Step 28812: {'lr': 0.0004605943541004401, 'samples': 5531904, 'steps': 28811, 'loss/train': 2.273839235305786} 11/07/2021 01:15:38 - INFO - __main__ - Step 28813: {'lr': 0.00046059149431208914, 'samples': 5532096, 'steps': 28812, 'loss/train': 1.8862618207931519} 11/07/2021 01:15:38 - INFO - __main__ - Step 28814: {'lr': 0.0004605886344288489, 'samples': 5532288, 'steps': 28813, 'loss/train': 1.0264168977737427} 11/07/2021 01:15:38 - INFO - __main__ - Step 28815: {'lr': 0.0004605857744507207, 'samples': 5532480, 'steps': 28814, 'loss/train': 1.2002111673355103} 11/07/2021 01:15:39 - INFO - __main__ - Step 28816: {'lr': 0.00046058291437770584, 'samples': 5532672, 'steps': 28815, 'loss/train': 1.7769715785980225} 11/07/2021 01:15:39 - INFO - __main__ - Step 28817: {'lr': 0.0004605800542098054, 'samples': 5532864, 'steps': 28816, 'loss/train': 1.6534240245819092} 11/07/2021 01:15:40 - INFO - __main__ - Step 28818: {'lr': 0.00046057719394702103, 'samples': 5533056, 'steps': 28817, 'loss/train': 1.4326366186141968} 11/07/2021 01:15:40 - INFO - __main__ - Step 28819: {'lr': 0.00046057433358935373, 'samples': 5533248, 'steps': 28818, 'loss/train': 1.4733604192733765} 11/07/2021 01:15:41 - INFO - __main__ - Step 28820: {'lr': 0.0004605714731368049, 'samples': 5533440, 'steps': 28819, 'loss/train': 1.6320061683654785} 11/07/2021 01:15:41 - INFO - __main__ - Step 28821: {'lr': 0.0004605686125893758, 'samples': 5533632, 'steps': 28820, 'loss/train': 1.660555124282837} 11/07/2021 01:15:41 - INFO - __main__ - Step 28822: {'lr': 0.00046056575194706773, 'samples': 5533824, 'steps': 28821, 'loss/train': 1.3864659070968628} 11/07/2021 01:15:42 - INFO - __main__ - Step 28823: {'lr': 0.000460562891209882, 'samples': 5534016, 'steps': 28822, 'loss/train': 1.2763863801956177} 11/07/2021 01:15:43 - INFO - __main__ - Step 28824: {'lr': 0.0004605600303778199, 'samples': 5534208, 'steps': 28823, 'loss/train': 1.6745431423187256} 11/07/2021 01:15:43 - INFO - __main__ - Step 28825: {'lr': 0.0004605571694508827, 'samples': 5534400, 'steps': 28824, 'loss/train': 1.4444369077682495} 11/07/2021 01:15:44 - INFO - __main__ - Step 28826: {'lr': 0.0004605543084290716, 'samples': 5534592, 'steps': 28825, 'loss/train': 1.4224810600280762} 11/07/2021 01:15:44 - INFO - __main__ - Step 28827: {'lr': 0.00046055144731238805, 'samples': 5534784, 'steps': 28826, 'loss/train': 1.407976746559143} 11/07/2021 01:15:45 - INFO - __main__ - Step 28828: {'lr': 0.00046054858610083325, 'samples': 5534976, 'steps': 28827, 'loss/train': 3.1830976009368896} 11/07/2021 01:15:45 - INFO - __main__ - Step 28829: {'lr': 0.0004605457247944086, 'samples': 5535168, 'steps': 28828, 'loss/train': 1.439624309539795} 11/07/2021 01:15:46 - INFO - __main__ - Step 28830: {'lr': 0.0004605428633931152, 'samples': 5535360, 'steps': 28829, 'loss/train': 1.4871230125427246} 11/07/2021 01:15:46 - INFO - __main__ - Step 28831: {'lr': 0.00046054000189695444, 'samples': 5535552, 'steps': 28830, 'loss/train': 1.4554721117019653} 11/07/2021 01:15:46 - INFO - __main__ - Step 28832: {'lr': 0.00046053714030592764, 'samples': 5535744, 'steps': 28831, 'loss/train': 1.7599273920059204} 11/07/2021 01:15:47 - INFO - __main__ - Step 28833: {'lr': 0.0004605342786200359, 'samples': 5535936, 'steps': 28832, 'loss/train': 1.0530322790145874} 11/07/2021 01:15:48 - INFO - __main__ - Step 28834: {'lr': 0.0004605314168392809, 'samples': 5536128, 'steps': 28833, 'loss/train': 1.3571157455444336} 11/07/2021 01:15:48 - INFO - __main__ - Step 28835: {'lr': 0.00046052855496366354, 'samples': 5536320, 'steps': 28834, 'loss/train': 1.7880867719650269} 11/07/2021 01:15:48 - INFO - __main__ - Step 28836: {'lr': 0.0004605256929931853, 'samples': 5536512, 'steps': 28835, 'loss/train': 1.3933273553848267} 11/07/2021 01:15:49 - INFO - __main__ - Step 28837: {'lr': 0.0004605228309278474, 'samples': 5536704, 'steps': 28836, 'loss/train': 1.5931357145309448} 11/07/2021 01:15:49 - INFO - __main__ - Step 28838: {'lr': 0.0004605199687676512, 'samples': 5536896, 'steps': 28837, 'loss/train': 1.2500851154327393} 11/07/2021 01:15:50 - INFO - __main__ - Step 28839: {'lr': 0.00046051710651259797, 'samples': 5537088, 'steps': 28838, 'loss/train': 1.5644257068634033} 11/07/2021 01:15:51 - INFO - __main__ - Step 28840: {'lr': 0.00046051424416268896, 'samples': 5537280, 'steps': 28839, 'loss/train': 1.6065336465835571} 11/07/2021 01:15:51 - INFO - __main__ - Step 28841: {'lr': 0.0004605113817179255, 'samples': 5537472, 'steps': 28840, 'loss/train': 1.748841404914856} 11/07/2021 01:15:51 - INFO - __main__ - Step 28842: {'lr': 0.00046050851917830884, 'samples': 5537664, 'steps': 28841, 'loss/train': 1.69772207736969} 11/07/2021 01:15:52 - INFO - __main__ - Step 28843: {'lr': 0.00046050565654384023, 'samples': 5537856, 'steps': 28842, 'loss/train': 1.5625964403152466} 11/07/2021 01:15:53 - INFO - __main__ - Step 28844: {'lr': 0.0004605027938145211, 'samples': 5538048, 'steps': 28843, 'loss/train': 0.6934532523155212} 11/07/2021 01:15:53 - INFO - __main__ - Step 28845: {'lr': 0.0004604999309903526, 'samples': 5538240, 'steps': 28844, 'loss/train': 1.4595178365707397} 11/07/2021 01:15:53 - INFO - __main__ - Step 28846: {'lr': 0.0004604970680713362, 'samples': 5538432, 'steps': 28845, 'loss/train': 1.1132322549819946} 11/07/2021 01:15:54 - INFO - __main__ - Step 28847: {'lr': 0.00046049420505747294, 'samples': 5538624, 'steps': 28846, 'loss/train': 1.3030951023101807} 11/07/2021 01:15:54 - INFO - __main__ - Step 28848: {'lr': 0.0004604913419487643, 'samples': 5538816, 'steps': 28847, 'loss/train': 1.7613273859024048} 11/07/2021 01:15:55 - INFO - __main__ - Step 28849: {'lr': 0.00046048847874521144, 'samples': 5539008, 'steps': 28848, 'loss/train': 1.3294694423675537} 11/07/2021 01:15:56 - INFO - __main__ - Step 28850: {'lr': 0.00046048561544681575, 'samples': 5539200, 'steps': 28849, 'loss/train': 1.4349967241287231} 11/07/2021 01:15:56 - INFO - __main__ - Step 28851: {'lr': 0.00046048275205357855, 'samples': 5539392, 'steps': 28850, 'loss/train': 0.1284380406141281} 11/07/2021 01:15:56 - INFO - __main__ - Step 28852: {'lr': 0.00046047988856550104, 'samples': 5539584, 'steps': 28851, 'loss/train': 1.5595612525939941} 11/07/2021 01:15:57 - INFO - __main__ - Step 28853: {'lr': 0.00046047702498258446, 'samples': 5539776, 'steps': 28852, 'loss/train': 1.5112496614456177} 11/07/2021 01:15:58 - INFO - __main__ - Step 28854: {'lr': 0.00046047416130483033, 'samples': 5539968, 'steps': 28853, 'loss/train': 1.659077525138855} 11/07/2021 01:15:58 - INFO - __main__ - Step 28855: {'lr': 0.00046047129753223973, 'samples': 5540160, 'steps': 28854, 'loss/train': 1.7341511249542236} 11/07/2021 01:15:59 - INFO - __main__ - Step 28856: {'lr': 0.0004604684336648139, 'samples': 5540352, 'steps': 28855, 'loss/train': 1.5935263633728027} 11/07/2021 01:15:59 - INFO - __main__ - Step 28857: {'lr': 0.00046046556970255435, 'samples': 5540544, 'steps': 28856, 'loss/train': 1.6662061214447021} 11/07/2021 01:15:59 - INFO - __main__ - Step 28858: {'lr': 0.0004604627056454622, 'samples': 5540736, 'steps': 28857, 'loss/train': 1.6770565509796143} 11/07/2021 01:16:00 - INFO - __main__ - Step 28859: {'lr': 0.00046045984149353894, 'samples': 5540928, 'steps': 28858, 'loss/train': 1.8274426460266113} 11/07/2021 01:16:01 - INFO - __main__ - Step 28860: {'lr': 0.0004604569772467856, 'samples': 5541120, 'steps': 28859, 'loss/train': 1.324919581413269} 11/07/2021 01:16:01 - INFO - __main__ - Step 28861: {'lr': 0.00046045411290520364, 'samples': 5541312, 'steps': 28860, 'loss/train': 1.090241551399231} 11/07/2021 01:16:01 - INFO - __main__ - Step 28862: {'lr': 0.00046045124846879427, 'samples': 5541504, 'steps': 28861, 'loss/train': 1.5456140041351318} 11/07/2021 01:16:02 - INFO - __main__ - Step 28863: {'lr': 0.00046044838393755885, 'samples': 5541696, 'steps': 28862, 'loss/train': 1.5965884923934937} 11/07/2021 01:16:02 - INFO - __main__ - Step 28864: {'lr': 0.00046044551931149856, 'samples': 5541888, 'steps': 28863, 'loss/train': 1.7188166379928589} 11/07/2021 01:16:03 - INFO - __main__ - Step 28865: {'lr': 0.0004604426545906149, 'samples': 5542080, 'steps': 28864, 'loss/train': 1.2183637619018555} 11/07/2021 01:16:03 - INFO - __main__ - Step 28866: {'lr': 0.0004604397897749089, 'samples': 5542272, 'steps': 28865, 'loss/train': 0.32164669036865234} 11/07/2021 01:16:04 - INFO - __main__ - Step 28867: {'lr': 0.00046043692486438207, 'samples': 5542464, 'steps': 28866, 'loss/train': 1.7163790464401245} 11/07/2021 01:16:04 - INFO - __main__ - Step 28868: {'lr': 0.00046043405985903555, 'samples': 5542656, 'steps': 28867, 'loss/train': 1.8655368089675903} 11/07/2021 01:16:04 - INFO - __main__ - Step 28869: {'lr': 0.00046043119475887073, 'samples': 5542848, 'steps': 28868, 'loss/train': 1.7012056112289429} 11/07/2021 01:16:06 - INFO - __main__ - Step 28870: {'lr': 0.0004604283295638888, 'samples': 5543040, 'steps': 28869, 'loss/train': 3.340463638305664} 11/07/2021 01:16:06 - INFO - __main__ - Step 28871: {'lr': 0.00046042546427409116, 'samples': 5543232, 'steps': 28870, 'loss/train': 1.3275420665740967} 11/07/2021 01:16:06 - INFO - __main__ - Step 28872: {'lr': 0.000460422598889479, 'samples': 5543424, 'steps': 28871, 'loss/train': 1.4772756099700928} 11/07/2021 01:16:07 - INFO - __main__ - Step 28873: {'lr': 0.0004604197334100537, 'samples': 5543616, 'steps': 28872, 'loss/train': 1.8350965976715088} 11/07/2021 01:16:07 - INFO - __main__ - Step 28874: {'lr': 0.0004604168678358166, 'samples': 5543808, 'steps': 28873, 'loss/train': 1.100998044013977} 11/07/2021 01:16:08 - INFO - __main__ - Step 28875: {'lr': 0.00046041400216676874, 'samples': 5544000, 'steps': 28874, 'loss/train': 1.4697104692459106} 11/07/2021 01:16:08 - INFO - __main__ - Step 28876: {'lr': 0.0004604111364029118, 'samples': 5544192, 'steps': 28875, 'loss/train': 1.0781641006469727} 11/07/2021 01:16:09 - INFO - __main__ - Step 28877: {'lr': 0.0004604082705442466, 'samples': 5544384, 'steps': 28876, 'loss/train': 1.4803532361984253} 11/07/2021 01:16:09 - INFO - __main__ - Step 28878: {'lr': 0.00046040540459077483, 'samples': 5544576, 'steps': 28877, 'loss/train': 1.636860966682434} 11/07/2021 01:16:09 - INFO - __main__ - Step 28879: {'lr': 0.0004604025385424976, 'samples': 5544768, 'steps': 28878, 'loss/train': 1.2803477048873901} 11/07/2021 01:16:10 - INFO - __main__ - Step 28880: {'lr': 0.00046039967239941626, 'samples': 5544960, 'steps': 28879, 'loss/train': 1.3108853101730347} 11/07/2021 01:16:11 - INFO - __main__ - Step 28881: {'lr': 0.000460396806161532, 'samples': 5545152, 'steps': 28880, 'loss/train': 1.2313545942306519} 11/07/2021 01:16:11 - INFO - __main__ - Step 28882: {'lr': 0.0004603939398288463, 'samples': 5545344, 'steps': 28881, 'loss/train': 1.4944992065429688} 11/07/2021 01:16:11 - INFO - __main__ - Step 28883: {'lr': 0.00046039107340136023, 'samples': 5545536, 'steps': 28882, 'loss/train': 1.5530414581298828} 11/07/2021 01:16:12 - INFO - __main__ - Step 28884: {'lr': 0.00046038820687907523, 'samples': 5545728, 'steps': 28883, 'loss/train': 1.3977794647216797} 11/07/2021 01:16:13 - INFO - __main__ - Step 28885: {'lr': 0.0004603853402619925, 'samples': 5545920, 'steps': 28884, 'loss/train': 1.6549947261810303} 11/07/2021 01:16:13 - INFO - __main__ - Step 28886: {'lr': 0.00046038247355011347, 'samples': 5546112, 'steps': 28885, 'loss/train': 1.4465737342834473} 11/07/2021 01:16:14 - INFO - __main__ - Step 28887: {'lr': 0.00046037960674343925, 'samples': 5546304, 'steps': 28886, 'loss/train': 1.5918290615081787} 11/07/2021 01:16:14 - INFO - __main__ - Step 28888: {'lr': 0.0004603767398419713, 'samples': 5546496, 'steps': 28887, 'loss/train': 1.016764760017395} 11/07/2021 01:16:14 - INFO - __main__ - Step 28889: {'lr': 0.0004603738728457109, 'samples': 5546688, 'steps': 28888, 'loss/train': 1.5167356729507446} 11/07/2021 01:16:15 - INFO - __main__ - Step 28890: {'lr': 0.0004603710057546592, 'samples': 5546880, 'steps': 28889, 'loss/train': 1.4391789436340332} 11/07/2021 01:16:16 - INFO - __main__ - Step 28891: {'lr': 0.0004603681385688175, 'samples': 5547072, 'steps': 28890, 'loss/train': 1.2934634685516357} 11/07/2021 01:16:16 - INFO - __main__ - Step 28892: {'lr': 0.00046036527128818724, 'samples': 5547264, 'steps': 28891, 'loss/train': 1.4205087423324585} 11/07/2021 01:16:16 - INFO - __main__ - Step 28893: {'lr': 0.0004603624039127696, 'samples': 5547456, 'steps': 28892, 'loss/train': 2.161067008972168} 11/07/2021 01:16:17 - INFO - __main__ - Step 28894: {'lr': 0.00046035953644256596, 'samples': 5547648, 'steps': 28893, 'loss/train': 1.1943391561508179} 11/07/2021 01:16:18 - INFO - __main__ - Step 28895: {'lr': 0.00046035666887757755, 'samples': 5547840, 'steps': 28894, 'loss/train': 1.3796454668045044} 11/07/2021 01:16:18 - INFO - __main__ - Step 28896: {'lr': 0.00046035380121780563, 'samples': 5548032, 'steps': 28895, 'loss/train': 1.6449450254440308} 11/07/2021 01:16:18 - INFO - __main__ - Step 28897: {'lr': 0.0004603509334632515, 'samples': 5548224, 'steps': 28896, 'loss/train': 1.3734986782073975} 11/07/2021 01:16:19 - INFO - __main__ - Step 28898: {'lr': 0.00046034806561391655, 'samples': 5548416, 'steps': 28897, 'loss/train': 1.2445614337921143} 11/07/2021 01:16:19 - INFO - __main__ - Step 28899: {'lr': 0.000460345197669802, 'samples': 5548608, 'steps': 28898, 'loss/train': 1.8080830574035645} 11/07/2021 01:16:20 - INFO - __main__ - Step 28900: {'lr': 0.0004603423296309092, 'samples': 5548800, 'steps': 28899, 'loss/train': 1.0341877937316895} 11/07/2021 01:16:21 - INFO - __main__ - Step 28901: {'lr': 0.0004603394614972393, 'samples': 5548992, 'steps': 28900, 'loss/train': 0.27347704768180847} 11/07/2021 01:16:21 - INFO - __main__ - Step 28902: {'lr': 0.00046033659326879373, 'samples': 5549184, 'steps': 28901, 'loss/train': 1.4428738355636597} 11/07/2021 01:16:21 - INFO - __main__ - Step 28903: {'lr': 0.00046033372494557373, 'samples': 5549376, 'steps': 28902, 'loss/train': 0.5731782913208008} 11/07/2021 01:16:22 - INFO - __main__ - Step 28904: {'lr': 0.00046033085652758053, 'samples': 5549568, 'steps': 28903, 'loss/train': 1.4512176513671875} 11/07/2021 01:16:22 - INFO - __main__ - Step 28905: {'lr': 0.00046032798801481564, 'samples': 5549760, 'steps': 28904, 'loss/train': 1.1815423965454102} 11/07/2021 01:16:23 - INFO - __main__ - Step 28906: {'lr': 0.0004603251194072801, 'samples': 5549952, 'steps': 28905, 'loss/train': 1.6696475744247437} 11/07/2021 01:16:23 - INFO - __main__ - Step 28907: {'lr': 0.0004603222507049754, 'samples': 5550144, 'steps': 28906, 'loss/train': 1.1920045614242554} 11/07/2021 01:16:24 - INFO - __main__ - Step 28908: {'lr': 0.00046031938190790254, 'samples': 5550336, 'steps': 28907, 'loss/train': 1.7647217512130737} 11/07/2021 01:16:24 - INFO - __main__ - Step 28909: {'lr': 0.0004603165130160633, 'samples': 5550528, 'steps': 28908, 'loss/train': 0.8402041792869568} 11/07/2021 01:16:24 - INFO - __main__ - Step 28910: {'lr': 0.0004603136440294584, 'samples': 5550720, 'steps': 28909, 'loss/train': 1.6364444494247437} 11/07/2021 01:16:25 - INFO - __main__ - Step 28911: {'lr': 0.0004603107749480896, 'samples': 5550912, 'steps': 28910, 'loss/train': 2.21443510055542} 11/07/2021 01:16:26 - INFO - __main__ - Step 28912: {'lr': 0.0004603079057719579, 'samples': 5551104, 'steps': 28911, 'loss/train': 1.697066068649292} 11/07/2021 01:16:26 - INFO - __main__ - Step 28913: {'lr': 0.0004603050365010648, 'samples': 5551296, 'steps': 28912, 'loss/train': 1.8834847211837769} 11/07/2021 01:16:26 - INFO - __main__ - Step 28914: {'lr': 0.00046030216713541147, 'samples': 5551488, 'steps': 28913, 'loss/train': 0.9854968786239624} 11/07/2021 01:16:27 - INFO - __main__ - Step 28915: {'lr': 0.00046029929767499924, 'samples': 5551680, 'steps': 28914, 'loss/train': 1.711262822151184} 11/07/2021 01:16:28 - INFO - __main__ - Step 28916: {'lr': 0.0004602964281198293, 'samples': 5551872, 'steps': 28915, 'loss/train': 1.4778679609298706} 11/07/2021 01:16:28 - INFO - __main__ - Step 28917: {'lr': 0.0004602935584699031, 'samples': 5552064, 'steps': 28916, 'loss/train': 1.2499537467956543} 11/07/2021 01:16:29 - INFO - __main__ - Step 28918: {'lr': 0.00046029068872522185, 'samples': 5552256, 'steps': 28917, 'loss/train': 1.4326282739639282} 11/07/2021 01:16:29 - INFO - __main__ - Step 28919: {'lr': 0.0004602878188857869, 'samples': 5552448, 'steps': 28918, 'loss/train': 1.5676058530807495} 11/07/2021 01:16:29 - INFO - __main__ - Step 28920: {'lr': 0.0004602849489515995, 'samples': 5552640, 'steps': 28919, 'loss/train': 1.5974217653274536} 11/07/2021 01:16:30 - INFO - __main__ - Step 28921: {'lr': 0.00046028207892266095, 'samples': 5552832, 'steps': 28920, 'loss/train': 1.2199342250823975} 11/07/2021 01:16:31 - INFO - __main__ - Step 28922: {'lr': 0.00046027920879897243, 'samples': 5553024, 'steps': 28921, 'loss/train': 1.4522511959075928} 11/07/2021 01:16:31 - INFO - __main__ - Step 28923: {'lr': 0.00046027633858053554, 'samples': 5553216, 'steps': 28922, 'loss/train': 2.157140016555786} 11/07/2021 01:16:31 - INFO - __main__ - Step 28924: {'lr': 0.0004602734682673512, 'samples': 5553408, 'steps': 28923, 'loss/train': 1.2557610273361206} 11/07/2021 01:16:32 - INFO - __main__ - Step 28925: {'lr': 0.0004602705978594209, 'samples': 5553600, 'steps': 28924, 'loss/train': 1.552826166152954} 11/07/2021 01:16:33 - INFO - __main__ - Step 28926: {'lr': 0.00046026772735674606, 'samples': 5553792, 'steps': 28925, 'loss/train': 1.3594396114349365} 11/07/2021 01:16:33 - INFO - __main__ - Step 28927: {'lr': 0.00046026485675932765, 'samples': 5553984, 'steps': 28926, 'loss/train': 1.2299970388412476} 11/07/2021 01:16:33 - INFO - __main__ - Step 28928: {'lr': 0.0004602619860671672, 'samples': 5554176, 'steps': 28927, 'loss/train': 1.5360386371612549} 11/07/2021 01:16:34 - INFO - __main__ - Step 28929: {'lr': 0.000460259115280266, 'samples': 5554368, 'steps': 28928, 'loss/train': 1.3246605396270752} 11/07/2021 01:16:34 - INFO - __main__ - Step 28930: {'lr': 0.00046025624439862523, 'samples': 5554560, 'steps': 28929, 'loss/train': 1.4679009914398193} 11/07/2021 01:16:35 - INFO - __main__ - Step 28931: {'lr': 0.0004602533734222463, 'samples': 5554752, 'steps': 28930, 'loss/train': 1.6778373718261719} 11/07/2021 01:16:35 - INFO - __main__ - Step 28932: {'lr': 0.00046025050235113036, 'samples': 5554944, 'steps': 28931, 'loss/train': 1.3062094449996948} 11/07/2021 01:16:36 - INFO - __main__ - Step 28933: {'lr': 0.00046024763118527885, 'samples': 5555136, 'steps': 28932, 'loss/train': 1.3774343729019165} 11/07/2021 01:16:36 - INFO - __main__ - Step 28934: {'lr': 0.00046024475992469295, 'samples': 5555328, 'steps': 28933, 'loss/train': 1.7465287446975708} 11/07/2021 01:16:36 - INFO - __main__ - Step 28935: {'lr': 0.0004602418885693741, 'samples': 5555520, 'steps': 28934, 'loss/train': 1.3427748680114746} 11/07/2021 01:16:38 - INFO - __main__ - Step 28936: {'lr': 0.0004602390171193234, 'samples': 5555712, 'steps': 28935, 'loss/train': 1.4913746118545532} 11/07/2021 01:16:38 - INFO - __main__ - Step 28937: {'lr': 0.0004602361455745423, 'samples': 5555904, 'steps': 28936, 'loss/train': 1.596330165863037} 11/07/2021 01:16:38 - INFO - __main__ - Step 28938: {'lr': 0.000460233273935032, 'samples': 5556096, 'steps': 28937, 'loss/train': 1.6190026998519897} 11/07/2021 01:16:39 - INFO - __main__ - Step 28939: {'lr': 0.00046023040220079383, 'samples': 5556288, 'steps': 28938, 'loss/train': 0.8826161026954651} 11/07/2021 01:16:39 - INFO - __main__ - Step 28940: {'lr': 0.00046022753037182915, 'samples': 5556480, 'steps': 28939, 'loss/train': 1.280436635017395} 11/07/2021 01:16:39 - INFO - __main__ - Step 28941: {'lr': 0.0004602246584481391, 'samples': 5556672, 'steps': 28940, 'loss/train': 1.4741681814193726} 11/07/2021 01:16:41 - INFO - __main__ - Step 28942: {'lr': 0.00046022178642972513, 'samples': 5556864, 'steps': 28941, 'loss/train': 1.4609594345092773} 11/07/2021 01:16:41 - INFO - __main__ - Step 28943: {'lr': 0.00046021891431658845, 'samples': 5557056, 'steps': 28942, 'loss/train': 1.2200900316238403} 11/07/2021 01:16:41 - INFO - __main__ - Step 28944: {'lr': 0.00046021604210873035, 'samples': 5557248, 'steps': 28943, 'loss/train': 1.5990513563156128} 11/07/2021 01:16:42 - INFO - __main__ - Step 28945: {'lr': 0.0004602131698061521, 'samples': 5557440, 'steps': 28944, 'loss/train': 2.0704479217529297} 11/07/2021 01:16:42 - INFO - __main__ - Step 28946: {'lr': 0.0004602102974088551, 'samples': 5557632, 'steps': 28945, 'loss/train': 1.6211740970611572} 11/07/2021 01:16:42 - INFO - __main__ - Step 28947: {'lr': 0.00046020742491684067, 'samples': 5557824, 'steps': 28946, 'loss/train': 1.4822145700454712} 11/07/2021 01:16:45 - INFO - __main__ - Step 28948: {'lr': 0.0004602045523301099, 'samples': 5558016, 'steps': 28947, 'loss/train': 0.5823309421539307} 11/07/2021 01:16:45 - INFO - __main__ - Step 28949: {'lr': 0.0004602016796486642, 'samples': 5558208, 'steps': 28948, 'loss/train': 1.7212554216384888} 11/07/2021 01:16:45 - INFO - __main__ - Step 28950: {'lr': 0.00046019880687250494, 'samples': 5558400, 'steps': 28949, 'loss/train': 1.8272355794906616} 11/07/2021 01:16:46 - INFO - __main__ - Step 28951: {'lr': 0.0004601959340016333, 'samples': 5558592, 'steps': 28950, 'loss/train': 1.1154563426971436} 11/07/2021 01:16:46 - INFO - __main__ - Step 28952: {'lr': 0.0004601930610360506, 'samples': 5558784, 'steps': 28951, 'loss/train': 1.3865123987197876} 11/07/2021 01:16:47 - INFO - __main__ - Step 28953: {'lr': 0.0004601901879757582, 'samples': 5558976, 'steps': 28952, 'loss/train': 1.8884626626968384} 11/07/2021 01:16:47 - INFO - __main__ - Step 28954: {'lr': 0.0004601873148207573, 'samples': 5559168, 'steps': 28953, 'loss/train': 1.8347766399383545} 11/07/2021 01:16:47 - INFO - __main__ - Step 28955: {'lr': 0.00046018444157104924, 'samples': 5559360, 'steps': 28954, 'loss/train': 1.8239595890045166} 11/07/2021 01:16:48 - INFO - __main__ - Step 28956: {'lr': 0.0004601815682266353, 'samples': 5559552, 'steps': 28955, 'loss/train': 1.7957842350006104} 11/07/2021 01:16:49 - INFO - __main__ - Step 28957: {'lr': 0.00046017869478751685, 'samples': 5559744, 'steps': 28956, 'loss/train': 2.1072323322296143} 11/07/2021 01:16:49 - INFO - __main__ - Step 28958: {'lr': 0.00046017582125369505, 'samples': 5559936, 'steps': 28957, 'loss/train': 1.7136385440826416} 11/07/2021 01:16:49 - INFO - __main__ - Step 28959: {'lr': 0.00046017294762517127, 'samples': 5560128, 'steps': 28958, 'loss/train': 1.5811877250671387} 11/07/2021 01:16:50 - INFO - __main__ - Step 28960: {'lr': 0.0004601700739019469, 'samples': 5560320, 'steps': 28959, 'loss/train': 1.454060673713684} 11/07/2021 01:16:51 - INFO - __main__ - Step 28961: {'lr': 0.000460167200084023, 'samples': 5560512, 'steps': 28960, 'loss/train': 1.2183918952941895} 11/07/2021 01:16:51 - INFO - __main__ - Step 28962: {'lr': 0.00046016432617140113, 'samples': 5560704, 'steps': 28961, 'loss/train': 1.2029508352279663} 11/07/2021 01:16:52 - INFO - __main__ - Step 28963: {'lr': 0.0004601614521640824, 'samples': 5560896, 'steps': 28962, 'loss/train': 0.5521527528762817} 11/07/2021 01:16:52 - INFO - __main__ - Step 28964: {'lr': 0.00046015857806206816, 'samples': 5561088, 'steps': 28963, 'loss/train': 1.4131869077682495} 11/07/2021 01:16:52 - INFO - __main__ - Step 28965: {'lr': 0.0004601557038653597, 'samples': 5561280, 'steps': 28964, 'loss/train': 1.6671829223632812} 11/07/2021 01:16:54 - INFO - __main__ - Step 28966: {'lr': 0.0004601528295739583, 'samples': 5561472, 'steps': 28965, 'loss/train': 1.7216248512268066} 11/07/2021 01:16:54 - INFO - __main__ - Step 28967: {'lr': 0.00046014995518786536, 'samples': 5561664, 'steps': 28966, 'loss/train': 1.4837855100631714} 11/07/2021 01:16:54 - INFO - __main__ - Step 28968: {'lr': 0.000460147080707082, 'samples': 5561856, 'steps': 28967, 'loss/train': 1.7088741064071655} 11/07/2021 01:16:55 - INFO - __main__ - Step 28969: {'lr': 0.00046014420613160967, 'samples': 5562048, 'steps': 28968, 'loss/train': 1.698925256729126} 11/07/2021 01:16:55 - INFO - __main__ - Step 28970: {'lr': 0.00046014133146144966, 'samples': 5562240, 'steps': 28969, 'loss/train': 2.0839052200317383} 11/07/2021 01:16:55 - INFO - __main__ - Step 28971: {'lr': 0.0004601384566966031, 'samples': 5562432, 'steps': 28970, 'loss/train': 2.054642915725708} 11/07/2021 01:16:57 - INFO - __main__ - Step 28972: {'lr': 0.0004601355818370714, 'samples': 5562624, 'steps': 28971, 'loss/train': 1.8072031736373901} 11/07/2021 01:16:57 - INFO - __main__ - Step 28973: {'lr': 0.0004601327068828559, 'samples': 5562816, 'steps': 28972, 'loss/train': 1.22524893283844} 11/07/2021 01:16:57 - INFO - __main__ - Step 28974: {'lr': 0.0004601298318339578, 'samples': 5563008, 'steps': 28973, 'loss/train': 1.5835840702056885} 11/07/2021 01:16:58 - INFO - __main__ - Step 28975: {'lr': 0.0004601269566903785, 'samples': 5563200, 'steps': 28974, 'loss/train': 1.7203696966171265} 11/07/2021 01:16:58 - INFO - __main__ - Step 28976: {'lr': 0.0004601240814521192, 'samples': 5563392, 'steps': 28975, 'loss/train': 1.3847191333770752} 11/07/2021 01:16:59 - INFO - __main__ - Step 28977: {'lr': 0.00046012120611918126, 'samples': 5563584, 'steps': 28976, 'loss/train': 1.4361244440078735} 11/07/2021 01:16:59 - INFO - __main__ - Step 28978: {'lr': 0.0004601183306915659, 'samples': 5563776, 'steps': 28977, 'loss/train': 1.4285329580307007} 11/07/2021 01:17:00 - INFO - __main__ - Step 28979: {'lr': 0.0004601154551692745, 'samples': 5563968, 'steps': 28978, 'loss/train': 1.5221688747406006} 11/07/2021 01:17:00 - INFO - __main__ - Step 28980: {'lr': 0.00046011257955230826, 'samples': 5564160, 'steps': 28979, 'loss/train': 1.8839620351791382} 11/07/2021 01:17:00 - INFO - __main__ - Step 28981: {'lr': 0.00046010970384066863, 'samples': 5564352, 'steps': 28980, 'loss/train': 1.4196536540985107} 11/07/2021 01:17:02 - INFO - __main__ - Step 28982: {'lr': 0.00046010682803435674, 'samples': 5564544, 'steps': 28981, 'loss/train': 1.8078101873397827} 11/07/2021 01:17:02 - INFO - __main__ - Step 28983: {'lr': 0.000460103952133374, 'samples': 5564736, 'steps': 28982, 'loss/train': 1.357619285583496} 11/07/2021 01:17:02 - INFO - __main__ - Step 28984: {'lr': 0.00046010107613772154, 'samples': 5564928, 'steps': 28983, 'loss/train': 1.4675019979476929} 11/07/2021 01:17:03 - INFO - __main__ - Step 28985: {'lr': 0.0004600982000474009, 'samples': 5565120, 'steps': 28984, 'loss/train': 1.3713990449905396} 11/07/2021 01:17:03 - INFO - __main__ - Step 28986: {'lr': 0.0004600953238624133, 'samples': 5565312, 'steps': 28985, 'loss/train': 1.625765323638916} 11/07/2021 01:17:04 - INFO - __main__ - Step 28987: {'lr': 0.00046009244758275986, 'samples': 5565504, 'steps': 28986, 'loss/train': 1.4445399045944214} 11/07/2021 01:17:04 - INFO - __main__ - Step 28988: {'lr': 0.0004600895712084421, 'samples': 5565696, 'steps': 28987, 'loss/train': 2.432183265686035} 11/07/2021 01:17:05 - INFO - __main__ - Step 28989: {'lr': 0.0004600866947394611, 'samples': 5565888, 'steps': 28988, 'loss/train': 1.3554515838623047} 11/07/2021 01:17:05 - INFO - __main__ - Step 28990: {'lr': 0.0004600838181758184, 'samples': 5566080, 'steps': 28989, 'loss/train': 0.8822077512741089} 11/07/2021 01:17:05 - INFO - __main__ - Step 28991: {'lr': 0.00046008094151751513, 'samples': 5566272, 'steps': 28990, 'loss/train': 0.9555196166038513} 11/07/2021 01:17:07 - INFO - __main__ - Step 28992: {'lr': 0.0004600780647645526, 'samples': 5566464, 'steps': 28991, 'loss/train': 1.7758179903030396} 11/07/2021 01:17:07 - INFO - __main__ - Step 28993: {'lr': 0.0004600751879169321, 'samples': 5566656, 'steps': 28992, 'loss/train': 1.2778682708740234} 11/07/2021 01:17:07 - INFO - __main__ - Step 28994: {'lr': 0.00046007231097465505, 'samples': 5566848, 'steps': 28993, 'loss/train': 1.7804596424102783} 11/07/2021 01:17:08 - INFO - __main__ - Step 28995: {'lr': 0.00046006943393772274, 'samples': 5567040, 'steps': 28994, 'loss/train': 1.4568381309509277} 11/07/2021 01:17:08 - INFO - __main__ - Step 28996: {'lr': 0.00046006655680613616, 'samples': 5567232, 'steps': 28995, 'loss/train': 1.5894285440444946} 11/07/2021 01:17:08 - INFO - __main__ - Step 28997: {'lr': 0.00046006367957989705, 'samples': 5567424, 'steps': 28996, 'loss/train': 1.3978854417800903} 11/07/2021 01:17:09 - INFO - __main__ - Step 28998: {'lr': 0.0004600608022590064, 'samples': 5567616, 'steps': 28997, 'loss/train': 1.5449564456939697} 11/07/2021 01:17:10 - INFO - __main__ - Step 28999: {'lr': 0.0004600579248434655, 'samples': 5567808, 'steps': 28998, 'loss/train': 1.7216355800628662} 11/07/2021 01:17:10 - INFO - __main__ - Step 29000: {'lr': 0.0004600550473332759, 'samples': 5568000, 'steps': 28999, 'loss/train': 1.7568962574005127} 11/07/2021 01:17:10 - INFO - __main__ - Step 29001: {'lr': 0.0004600521697284386, 'samples': 5568192, 'steps': 29000, 'loss/train': 1.4287441968917847} 11/07/2021 01:17:11 - INFO - __main__ - Step 29002: {'lr': 0.0004600492920289551, 'samples': 5568384, 'steps': 29001, 'loss/train': 2.267775535583496} 11/07/2021 01:17:12 - INFO - __main__ - Step 29003: {'lr': 0.00046004641423482665, 'samples': 5568576, 'steps': 29002, 'loss/train': 1.5189000368118286} 11/07/2021 01:17:12 - INFO - __main__ - Step 29004: {'lr': 0.00046004353634605447, 'samples': 5568768, 'steps': 29003, 'loss/train': 1.6470844745635986} 11/07/2021 01:17:12 - INFO - __main__ - Step 29005: {'lr': 0.00046004065836263995, 'samples': 5568960, 'steps': 29004, 'loss/train': 1.8419655561447144} 11/07/2021 01:17:13 - INFO - __main__ - Step 29006: {'lr': 0.00046003778028458434, 'samples': 5569152, 'steps': 29005, 'loss/train': 1.6046967506408691} 11/07/2021 01:17:13 - INFO - __main__ - Step 29007: {'lr': 0.00046003490211188894, 'samples': 5569344, 'steps': 29006, 'loss/train': 1.4088990688323975} 11/07/2021 01:17:14 - INFO - __main__ - Step 29008: {'lr': 0.00046003202384455505, 'samples': 5569536, 'steps': 29007, 'loss/train': 1.0364364385604858} 11/07/2021 01:17:15 - INFO - __main__ - Step 29009: {'lr': 0.000460029145482584, 'samples': 5569728, 'steps': 29008, 'loss/train': 1.6944390535354614} 11/07/2021 01:17:15 - INFO - __main__ - Step 29010: {'lr': 0.00046002626702597706, 'samples': 5569920, 'steps': 29009, 'loss/train': 1.6687331199645996} 11/07/2021 01:17:15 - INFO - __main__ - Step 29011: {'lr': 0.00046002338847473545, 'samples': 5570112, 'steps': 29010, 'loss/train': 1.7623625993728638} 11/07/2021 01:17:16 - INFO - __main__ - Step 29012: {'lr': 0.0004600205098288606, 'samples': 5570304, 'steps': 29011, 'loss/train': 1.7710977792739868} 11/07/2021 01:17:17 - INFO - __main__ - Step 29013: {'lr': 0.00046001763108835384, 'samples': 5570496, 'steps': 29012, 'loss/train': 1.4041374921798706} 11/07/2021 01:17:17 - INFO - __main__ - Step 29014: {'lr': 0.0004600147522532162, 'samples': 5570688, 'steps': 29013, 'loss/train': 1.6077314615249634} 11/07/2021 01:17:17 - INFO - __main__ - Step 29015: {'lr': 0.0004600118733234493, 'samples': 5570880, 'steps': 29014, 'loss/train': 1.8222843408584595} 11/07/2021 01:17:18 - INFO - __main__ - Step 29016: {'lr': 0.0004600089942990542, 'samples': 5571072, 'steps': 29015, 'loss/train': 1.5851895809173584} 11/07/2021 01:17:18 - INFO - __main__ - Step 29017: {'lr': 0.00046000611518003234, 'samples': 5571264, 'steps': 29016, 'loss/train': 1.1701395511627197} 11/07/2021 01:17:19 - INFO - __main__ - Step 29018: {'lr': 0.00046000323596638495, 'samples': 5571456, 'steps': 29017, 'loss/train': 2.0442075729370117} 11/07/2021 01:17:19 - INFO - __main__ - Step 29019: {'lr': 0.0004600003566581133, 'samples': 5571648, 'steps': 29018, 'loss/train': 1.799600601196289} 11/07/2021 01:17:20 - INFO - __main__ - Step 29020: {'lr': 0.00045999747725521876, 'samples': 5571840, 'steps': 29019, 'loss/train': 1.7369462251663208} 11/07/2021 01:17:20 - INFO - __main__ - Step 29021: {'lr': 0.0004599945977577026, 'samples': 5572032, 'steps': 29020, 'loss/train': 1.4682435989379883} 11/07/2021 01:17:21 - INFO - __main__ - Step 29022: {'lr': 0.0004599917181655661, 'samples': 5572224, 'steps': 29021, 'loss/train': 1.2437880039215088} 11/07/2021 01:17:21 - INFO - __main__ - Step 29023: {'lr': 0.00045998883847881057, 'samples': 5572416, 'steps': 29022, 'loss/train': 1.7971512079238892} 11/07/2021 01:17:22 - INFO - __main__ - Step 29024: {'lr': 0.00045998595869743735, 'samples': 5572608, 'steps': 29023, 'loss/train': 1.206907868385315} 11/07/2021 01:17:22 - INFO - __main__ - Step 29025: {'lr': 0.0004599830788214477, 'samples': 5572800, 'steps': 29024, 'loss/train': 0.9922259449958801} 11/07/2021 01:17:23 - INFO - __main__ - Step 29026: {'lr': 0.0004599801988508429, 'samples': 5572992, 'steps': 29025, 'loss/train': 1.8546106815338135} 11/07/2021 01:17:23 - INFO - __main__ - Step 29027: {'lr': 0.00045997731878562423, 'samples': 5573184, 'steps': 29026, 'loss/train': 1.5244265794754028} 11/07/2021 01:17:24 - INFO - __main__ - Step 29028: {'lr': 0.000459974438625793, 'samples': 5573376, 'steps': 29027, 'loss/train': 2.1775026321411133} 11/07/2021 01:17:24 - INFO - __main__ - Step 29029: {'lr': 0.0004599715583713506, 'samples': 5573568, 'steps': 29028, 'loss/train': 1.9676076173782349} 11/07/2021 01:17:25 - INFO - __main__ - Step 29030: {'lr': 0.00045996867802229824, 'samples': 5573760, 'steps': 29029, 'loss/train': 1.1934735774993896} 11/07/2021 01:17:25 - INFO - __main__ - Step 29031: {'lr': 0.0004599657975786372, 'samples': 5573952, 'steps': 29030, 'loss/train': 1.431510329246521} 11/07/2021 01:17:25 - INFO - __main__ - Step 29032: {'lr': 0.00045996291704036884, 'samples': 5574144, 'steps': 29031, 'loss/train': 1.6111900806427002} 11/07/2021 01:17:27 - INFO - __main__ - Step 29033: {'lr': 0.00045996003640749446, 'samples': 5574336, 'steps': 29032, 'loss/train': 1.6642199754714966} 11/07/2021 01:17:27 - INFO - __main__ - Step 29034: {'lr': 0.0004599571556800153, 'samples': 5574528, 'steps': 29033, 'loss/train': 0.855462372303009} 11/07/2021 01:17:27 - INFO - __main__ - Step 29035: {'lr': 0.00045995427485793263, 'samples': 5574720, 'steps': 29034, 'loss/train': 1.7289525270462036} 11/07/2021 01:17:28 - INFO - __main__ - Step 29036: {'lr': 0.00045995139394124784, 'samples': 5574912, 'steps': 29035, 'loss/train': 1.251185417175293} 11/07/2021 01:17:28 - INFO - __main__ - Step 29037: {'lr': 0.0004599485129299622, 'samples': 5575104, 'steps': 29036, 'loss/train': 1.4728577136993408} 11/07/2021 01:17:29 - INFO - __main__ - Step 29038: {'lr': 0.000459945631824077, 'samples': 5575296, 'steps': 29037, 'loss/train': 1.2936763763427734} 11/07/2021 01:17:29 - INFO - __main__ - Step 29039: {'lr': 0.0004599427506235936, 'samples': 5575488, 'steps': 29038, 'loss/train': 0.8940146565437317} 11/07/2021 01:17:30 - INFO - __main__ - Step 29040: {'lr': 0.0004599398693285132, 'samples': 5575680, 'steps': 29039, 'loss/train': 0.8102036118507385} 11/07/2021 01:17:30 - INFO - __main__ - Step 29041: {'lr': 0.0004599369879388371, 'samples': 5575872, 'steps': 29040, 'loss/train': 0.8322492241859436} 11/07/2021 01:17:30 - INFO - __main__ - Step 29042: {'lr': 0.0004599341064545666, 'samples': 5576064, 'steps': 29041, 'loss/train': 1.413248896598816} 11/07/2021 01:17:31 - INFO - __main__ - Step 29043: {'lr': 0.00045993122487570303, 'samples': 5576256, 'steps': 29042, 'loss/train': 1.6776422262191772} 11/07/2021 01:17:32 - INFO - __main__ - Step 29044: {'lr': 0.00045992834320224773, 'samples': 5576448, 'steps': 29043, 'loss/train': 1.8192821741104126} 11/07/2021 01:17:32 - INFO - __main__ - Step 29045: {'lr': 0.000459925461434202, 'samples': 5576640, 'steps': 29044, 'loss/train': 1.0544776916503906} 11/07/2021 01:17:33 - INFO - __main__ - Step 29046: {'lr': 0.00045992257957156704, 'samples': 5576832, 'steps': 29045, 'loss/train': 1.755774736404419} 11/07/2021 01:17:33 - INFO - __main__ - Step 29047: {'lr': 0.00045991969761434426, 'samples': 5577024, 'steps': 29046, 'loss/train': 1.1235843896865845} 11/07/2021 01:17:33 - INFO - __main__ - Step 29048: {'lr': 0.0004599168155625348, 'samples': 5577216, 'steps': 29047, 'loss/train': 1.1373010873794556} 11/07/2021 01:17:34 - INFO - __main__ - Step 29049: {'lr': 0.00045991393341614017, 'samples': 5577408, 'steps': 29048, 'loss/train': 2.1169443130493164} 11/07/2021 01:17:35 - INFO - __main__ - Step 29050: {'lr': 0.0004599110511751615, 'samples': 5577600, 'steps': 29049, 'loss/train': 1.5747123956680298} 11/07/2021 01:17:35 - INFO - __main__ - Step 29051: {'lr': 0.0004599081688396002, 'samples': 5577792, 'steps': 29050, 'loss/train': 1.0253344774246216} 11/07/2021 01:17:35 - INFO - __main__ - Step 29052: {'lr': 0.0004599052864094575, 'samples': 5577984, 'steps': 29051, 'loss/train': 1.7327574491500854} 11/07/2021 01:17:36 - INFO - __main__ - Step 29053: {'lr': 0.0004599024038847347, 'samples': 5578176, 'steps': 29052, 'loss/train': 1.7626140117645264} 11/07/2021 01:17:37 - INFO - __main__ - Step 29054: {'lr': 0.0004598995212654331, 'samples': 5578368, 'steps': 29053, 'loss/train': 1.4643959999084473} 11/07/2021 01:17:37 - INFO - __main__ - Step 29055: {'lr': 0.0004598966385515541, 'samples': 5578560, 'steps': 29054, 'loss/train': 1.8471664190292358} 11/07/2021 01:17:37 - INFO - __main__ - Step 29056: {'lr': 0.00045989375574309875, 'samples': 5578752, 'steps': 29055, 'loss/train': 2.308751106262207} 11/07/2021 01:17:38 - INFO - __main__ - Step 29057: {'lr': 0.00045989087284006863, 'samples': 5578944, 'steps': 29056, 'loss/train': 1.551805019378662} 11/07/2021 01:17:38 - INFO - __main__ - Step 29058: {'lr': 0.00045988798984246496, 'samples': 5579136, 'steps': 29057, 'loss/train': 1.2563121318817139} 11/07/2021 01:17:39 - INFO - __main__ - Step 29059: {'lr': 0.0004598851067502889, 'samples': 5579328, 'steps': 29058, 'loss/train': 1.0828661918640137} 11/07/2021 01:17:39 - INFO - __main__ - Step 29060: {'lr': 0.00045988222356354186, 'samples': 5579520, 'steps': 29059, 'loss/train': 1.0682960748672485} 11/07/2021 01:17:40 - INFO - __main__ - Step 29061: {'lr': 0.00045987934028222515, 'samples': 5579712, 'steps': 29060, 'loss/train': 1.3602944612503052} 11/07/2021 01:17:40 - INFO - __main__ - Step 29062: {'lr': 0.00045987645690634003, 'samples': 5579904, 'steps': 29061, 'loss/train': 1.555138349533081} 11/07/2021 01:17:41 - INFO - __main__ - Step 29063: {'lr': 0.0004598735734358879, 'samples': 5580096, 'steps': 29062, 'loss/train': 1.8989344835281372} 11/07/2021 01:17:41 - INFO - __main__ - Step 29064: {'lr': 0.0004598706898708699, 'samples': 5580288, 'steps': 29063, 'loss/train': 1.1886340379714966} 11/07/2021 01:17:42 - INFO - __main__ - Step 29065: {'lr': 0.00045986780621128743, 'samples': 5580480, 'steps': 29064, 'loss/train': 1.308013677597046} 11/07/2021 01:17:42 - INFO - __main__ - Step 29066: {'lr': 0.00045986492245714175, 'samples': 5580672, 'steps': 29065, 'loss/train': 1.5125694274902344} 11/07/2021 01:17:43 - INFO - __main__ - Step 29067: {'lr': 0.0004598620386084342, 'samples': 5580864, 'steps': 29066, 'loss/train': 1.6981607675552368} 11/07/2021 01:17:43 - INFO - __main__ - Step 29068: {'lr': 0.00045985915466516605, 'samples': 5581056, 'steps': 29067, 'loss/train': 0.809975802898407} 11/07/2021 01:17:43 - INFO - __main__ - Step 29069: {'lr': 0.0004598562706273386, 'samples': 5581248, 'steps': 29068, 'loss/train': 1.6589301824569702} 11/07/2021 01:17:44 - INFO - __main__ - Step 29070: {'lr': 0.0004598533864949531, 'samples': 5581440, 'steps': 29069, 'loss/train': 1.6808274984359741} 11/07/2021 01:17:45 - INFO - __main__ - Step 29071: {'lr': 0.00045985050226801097, 'samples': 5581632, 'steps': 29070, 'loss/train': 1.6667805910110474} 11/07/2021 01:17:45 - INFO - __main__ - Step 29072: {'lr': 0.0004598476179465134, 'samples': 5581824, 'steps': 29071, 'loss/train': 0.8117663264274597} 11/07/2021 01:17:46 - INFO - __main__ - Step 29073: {'lr': 0.00045984473353046174, 'samples': 5582016, 'steps': 29072, 'loss/train': 1.6126930713653564} 11/07/2021 01:17:46 - INFO - __main__ - Step 29074: {'lr': 0.00045984184901985735, 'samples': 5582208, 'steps': 29073, 'loss/train': 1.5174460411071777} 11/07/2021 01:17:47 - INFO - __main__ - Step 29075: {'lr': 0.00045983896441470143, 'samples': 5582400, 'steps': 29074, 'loss/train': 1.7275384664535522} 11/07/2021 01:17:47 - INFO - __main__ - Step 29076: {'lr': 0.00045983607971499527, 'samples': 5582592, 'steps': 29075, 'loss/train': 1.643984317779541} 11/07/2021 01:17:48 - INFO - __main__ - Step 29077: {'lr': 0.0004598331949207402, 'samples': 5582784, 'steps': 29076, 'loss/train': 1.5335731506347656} 11/07/2021 01:17:48 - INFO - __main__ - Step 29078: {'lr': 0.00045983031003193756, 'samples': 5582976, 'steps': 29077, 'loss/train': 1.7958552837371826} 11/07/2021 01:17:48 - INFO - __main__ - Step 29079: {'lr': 0.0004598274250485886, 'samples': 5583168, 'steps': 29078, 'loss/train': 1.3020613193511963} 11/07/2021 01:17:49 - INFO - __main__ - Step 29080: {'lr': 0.00045982453997069463, 'samples': 5583360, 'steps': 29079, 'loss/train': 1.7859662771224976} 11/07/2021 01:17:50 - INFO - __main__ - Step 29081: {'lr': 0.00045982165479825697, 'samples': 5583552, 'steps': 29080, 'loss/train': 1.3405777215957642} 11/07/2021 01:17:50 - INFO - __main__ - Step 29082: {'lr': 0.000459818769531277, 'samples': 5583744, 'steps': 29081, 'loss/train': 1.7002794742584229} 11/07/2021 01:17:50 - INFO - __main__ - Step 29083: {'lr': 0.00045981588416975583, 'samples': 5583936, 'steps': 29082, 'loss/train': 1.580579161643982} 11/07/2021 01:17:51 - INFO - __main__ - Step 29084: {'lr': 0.00045981299871369484, 'samples': 5584128, 'steps': 29083, 'loss/train': 0.7894611358642578} 11/07/2021 01:17:51 - INFO - __main__ - Step 29085: {'lr': 0.0004598101131630954, 'samples': 5584320, 'steps': 29084, 'loss/train': 1.9372822046279907} 11/07/2021 01:17:52 - INFO - __main__ - Step 29086: {'lr': 0.0004598072275179588, 'samples': 5584512, 'steps': 29085, 'loss/train': 1.543207049369812} 11/07/2021 01:17:52 - INFO - __main__ - Step 29087: {'lr': 0.00045980434177828625, 'samples': 5584704, 'steps': 29086, 'loss/train': 1.4375332593917847} 11/07/2021 01:17:53 - INFO - __main__ - Step 29088: {'lr': 0.00045980145594407907, 'samples': 5584896, 'steps': 29087, 'loss/train': 2.165194511413574} 11/07/2021 01:17:53 - INFO - __main__ - Step 29089: {'lr': 0.00045979857001533867, 'samples': 5585088, 'steps': 29088, 'loss/train': 1.6920965909957886} 11/07/2021 01:17:53 - INFO - __main__ - Step 29090: {'lr': 0.0004597956839920662, 'samples': 5585280, 'steps': 29089, 'loss/train': 0.8672987818717957} 11/07/2021 01:17:54 - INFO - __main__ - Step 29091: {'lr': 0.00045979279787426307, 'samples': 5585472, 'steps': 29090, 'loss/train': 1.85176420211792} 11/07/2021 01:17:55 - INFO - __main__ - Step 29092: {'lr': 0.00045978991166193057, 'samples': 5585664, 'steps': 29091, 'loss/train': 1.672255516052246} 11/07/2021 01:17:55 - INFO - __main__ - Step 29093: {'lr': 0.0004597870253550699, 'samples': 5585856, 'steps': 29092, 'loss/train': 1.3873833417892456} 11/07/2021 01:17:55 - INFO - __main__ - Step 29094: {'lr': 0.0004597841389536825, 'samples': 5586048, 'steps': 29093, 'loss/train': 1.0229157209396362} 11/07/2021 01:17:56 - INFO - __main__ - Step 29095: {'lr': 0.00045978125245776957, 'samples': 5586240, 'steps': 29094, 'loss/train': 1.6391146183013916} 11/07/2021 01:17:57 - INFO - __main__ - Step 29096: {'lr': 0.00045977836586733246, 'samples': 5586432, 'steps': 29095, 'loss/train': 1.6857407093048096} 11/07/2021 01:17:57 - INFO - __main__ - Step 29097: {'lr': 0.00045977547918237243, 'samples': 5586624, 'steps': 29096, 'loss/train': 1.3226008415222168} 11/07/2021 01:17:58 - INFO - __main__ - Step 29098: {'lr': 0.0004597725924028908, 'samples': 5586816, 'steps': 29097, 'loss/train': 1.9238613843917847} 11/07/2021 01:17:58 - INFO - __main__ - Step 29099: {'lr': 0.00045976970552888896, 'samples': 5587008, 'steps': 29098, 'loss/train': 1.1772266626358032} 11/07/2021 01:17:58 - INFO - __main__ - Step 29100: {'lr': 0.00045976681856036805, 'samples': 5587200, 'steps': 29099, 'loss/train': 1.3465580940246582} 11/07/2021 01:17:59 - INFO - __main__ - Step 29101: {'lr': 0.00045976393149732943, 'samples': 5587392, 'steps': 29100, 'loss/train': 1.9874027967453003} 11/07/2021 01:18:00 - INFO - __main__ - Step 29102: {'lr': 0.0004597610443397745, 'samples': 5587584, 'steps': 29101, 'loss/train': 1.2521934509277344} 11/07/2021 01:18:00 - INFO - __main__ - Step 29103: {'lr': 0.0004597581570877044, 'samples': 5587776, 'steps': 29102, 'loss/train': 1.4241652488708496} 11/07/2021 01:18:00 - INFO - __main__ - Step 29104: {'lr': 0.00045975526974112056, 'samples': 5587968, 'steps': 29103, 'loss/train': 2.0824873447418213} 11/07/2021 01:18:01 - INFO - __main__ - Step 29105: {'lr': 0.0004597523823000243, 'samples': 5588160, 'steps': 29104, 'loss/train': 1.5525542497634888} 11/07/2021 01:18:01 - INFO - __main__ - Step 29106: {'lr': 0.0004597494947644167, 'samples': 5588352, 'steps': 29105, 'loss/train': 1.7280526161193848} 11/07/2021 01:18:02 - INFO - __main__ - Step 29107: {'lr': 0.0004597466071342993, 'samples': 5588544, 'steps': 29106, 'loss/train': 0.9949897527694702} 11/07/2021 01:18:03 - INFO - __main__ - Step 29108: {'lr': 0.0004597437194096733, 'samples': 5588736, 'steps': 29107, 'loss/train': 1.6158809661865234} 11/07/2021 01:18:03 - INFO - __main__ - Step 29109: {'lr': 0.00045974083159054, 'samples': 5588928, 'steps': 29108, 'loss/train': 1.43483567237854} 11/07/2021 01:18:03 - INFO - __main__ - Step 29110: {'lr': 0.0004597379436769008, 'samples': 5589120, 'steps': 29109, 'loss/train': 1.5917630195617676} 11/07/2021 01:18:04 - INFO - __main__ - Step 29111: {'lr': 0.00045973505566875684, 'samples': 5589312, 'steps': 29110, 'loss/train': 2.0971126556396484} 11/07/2021 01:18:05 - INFO - __main__ - Step 29112: {'lr': 0.00045973216756610945, 'samples': 5589504, 'steps': 29111, 'loss/train': 1.5469576120376587} 11/07/2021 01:18:05 - INFO - __main__ - Step 29113: {'lr': 0.00045972927936896007, 'samples': 5589696, 'steps': 29112, 'loss/train': 1.5790016651153564} 11/07/2021 01:18:05 - INFO - __main__ - Step 29114: {'lr': 0.0004597263910773099, 'samples': 5589888, 'steps': 29113, 'loss/train': 1.2219035625457764} 11/07/2021 01:18:06 - INFO - __main__ - Step 29115: {'lr': 0.0004597235026911603, 'samples': 5590080, 'steps': 29114, 'loss/train': 1.5206252336502075} 11/07/2021 01:18:06 - INFO - __main__ - Step 29116: {'lr': 0.0004597206142105124, 'samples': 5590272, 'steps': 29115, 'loss/train': 1.6539644002914429} 11/07/2021 01:18:07 - INFO - __main__ - Step 29117: {'lr': 0.0004597177256353677, 'samples': 5590464, 'steps': 29116, 'loss/train': 1.8472036123275757} 11/07/2021 01:18:07 - INFO - __main__ - Step 29118: {'lr': 0.0004597148369657275, 'samples': 5590656, 'steps': 29117, 'loss/train': 1.2005980014801025} 11/07/2021 01:18:08 - INFO - __main__ - Step 29119: {'lr': 0.0004597119482015929, 'samples': 5590848, 'steps': 29118, 'loss/train': 1.5274251699447632} 11/07/2021 01:18:08 - INFO - __main__ - Step 29120: {'lr': 0.00045970905934296537, 'samples': 5591040, 'steps': 29119, 'loss/train': 1.3336411714553833} 11/07/2021 01:18:08 - INFO - __main__ - Step 29121: {'lr': 0.0004597061703898462, 'samples': 5591232, 'steps': 29120, 'loss/train': 1.4341195821762085} 11/07/2021 01:18:10 - INFO - __main__ - Step 29122: {'lr': 0.0004597032813422367, 'samples': 5591424, 'steps': 29121, 'loss/train': 1.2545655965805054} 11/07/2021 01:18:10 - INFO - __main__ - Step 29123: {'lr': 0.00045970039220013804, 'samples': 5591616, 'steps': 29122, 'loss/train': 1.6790838241577148} 11/07/2021 01:18:10 - INFO - __main__ - Step 29124: {'lr': 0.00045969750296355173, 'samples': 5591808, 'steps': 29123, 'loss/train': 1.3595398664474487} 11/07/2021 01:18:11 - INFO - __main__ - Step 29125: {'lr': 0.0004596946136324789, 'samples': 5592000, 'steps': 29124, 'loss/train': 1.5711055994033813} 11/07/2021 01:18:11 - INFO - __main__ - Step 29126: {'lr': 0.0004596917242069209, 'samples': 5592192, 'steps': 29125, 'loss/train': 1.3224055767059326} 11/07/2021 01:18:12 - INFO - __main__ - Step 29127: {'lr': 0.00045968883468687906, 'samples': 5592384, 'steps': 29126, 'loss/train': 1.640031099319458} 11/07/2021 01:18:12 - INFO - __main__ - Step 29128: {'lr': 0.00045968594507235467, 'samples': 5592576, 'steps': 29127, 'loss/train': 1.769572138786316} 11/07/2021 01:18:13 - INFO - __main__ - Step 29129: {'lr': 0.00045968305536334906, 'samples': 5592768, 'steps': 29128, 'loss/train': 1.6369398832321167} 11/07/2021 01:18:13 - INFO - __main__ - Step 29130: {'lr': 0.00045968016555986347, 'samples': 5592960, 'steps': 29129, 'loss/train': 1.5712953805923462} 11/07/2021 01:18:13 - INFO - __main__ - Step 29131: {'lr': 0.0004596772756618992, 'samples': 5593152, 'steps': 29130, 'loss/train': 1.609351634979248} 11/07/2021 01:18:14 - INFO - __main__ - Step 29132: {'lr': 0.0004596743856694576, 'samples': 5593344, 'steps': 29131, 'loss/train': 0.9651497602462769} 11/07/2021 01:18:15 - INFO - __main__ - Step 29133: {'lr': 0.00045967149558254, 'samples': 5593536, 'steps': 29132, 'loss/train': 1.5109275579452515} 11/07/2021 01:18:15 - INFO - __main__ - Step 29134: {'lr': 0.0004596686054011476, 'samples': 5593728, 'steps': 29133, 'loss/train': 1.9324721097946167} 11/07/2021 01:18:15 - INFO - __main__ - Step 29135: {'lr': 0.0004596657151252819, 'samples': 5593920, 'steps': 29134, 'loss/train': 1.8306808471679688} 11/07/2021 01:18:16 - INFO - __main__ - Step 29136: {'lr': 0.0004596628247549439, 'samples': 5594112, 'steps': 29135, 'loss/train': 1.4166126251220703} 11/07/2021 01:18:16 - INFO - __main__ - Step 29137: {'lr': 0.00045965993429013507, 'samples': 5594304, 'steps': 29136, 'loss/train': 1.7979376316070557} 11/07/2021 01:18:17 - INFO - __main__ - Step 29138: {'lr': 0.0004596570437308568, 'samples': 5594496, 'steps': 29137, 'loss/train': 1.7460390329360962} 11/07/2021 01:18:18 - INFO - __main__ - Step 29139: {'lr': 0.0004596541530771103, 'samples': 5594688, 'steps': 29138, 'loss/train': 1.9361435174942017} 11/07/2021 01:18:18 - INFO - __main__ - Step 29140: {'lr': 0.0004596512623288969, 'samples': 5594880, 'steps': 29139, 'loss/train': 1.3820250034332275} 11/07/2021 01:18:18 - INFO - __main__ - Step 29141: {'lr': 0.00045964837148621776, 'samples': 5595072, 'steps': 29140, 'loss/train': 1.4857066869735718} 11/07/2021 01:18:19 - INFO - __main__ - Step 29142: {'lr': 0.00045964548054907434, 'samples': 5595264, 'steps': 29141, 'loss/train': 1.4133851528167725} 11/07/2021 01:18:20 - INFO - __main__ - Step 29143: {'lr': 0.00045964258951746795, 'samples': 5595456, 'steps': 29142, 'loss/train': 2.953083038330078} 11/07/2021 01:18:20 - INFO - __main__ - Step 29144: {'lr': 0.0004596396983913998, 'samples': 5595648, 'steps': 29143, 'loss/train': 1.48224675655365} 11/07/2021 01:18:20 - INFO - __main__ - Step 29145: {'lr': 0.00045963680717087124, 'samples': 5595840, 'steps': 29144, 'loss/train': 1.531014323234558} 11/07/2021 01:18:21 - INFO - __main__ - Step 29146: {'lr': 0.0004596339158558835, 'samples': 5596032, 'steps': 29145, 'loss/train': 1.592858910560608} 11/07/2021 01:18:21 - INFO - __main__ - Step 29147: {'lr': 0.0004596310244464381, 'samples': 5596224, 'steps': 29146, 'loss/train': 0.8829703330993652} 11/07/2021 01:18:22 - INFO - __main__ - Step 29148: {'lr': 0.0004596281329425361, 'samples': 5596416, 'steps': 29147, 'loss/train': 1.290963888168335} 11/07/2021 01:18:23 - INFO - __main__ - Step 29149: {'lr': 0.0004596252413441789, 'samples': 5596608, 'steps': 29148, 'loss/train': 0.9775720834732056} 11/07/2021 01:18:23 - INFO - __main__ - Step 29150: {'lr': 0.00045962234965136783, 'samples': 5596800, 'steps': 29149, 'loss/train': 1.0753238201141357} 11/07/2021 01:18:23 - INFO - __main__ - Step 29151: {'lr': 0.0004596194578641042, 'samples': 5596992, 'steps': 29150, 'loss/train': 1.0315475463867188} 11/07/2021 01:18:24 - INFO - __main__ - Step 29152: {'lr': 0.00045961656598238925, 'samples': 5597184, 'steps': 29151, 'loss/train': 1.5725375413894653} 11/07/2021 01:18:24 - INFO - __main__ - Step 29153: {'lr': 0.00045961367400622436, 'samples': 5597376, 'steps': 29152, 'loss/train': 1.589746356010437} 11/07/2021 01:18:25 - INFO - __main__ - Step 29154: {'lr': 0.00045961078193561066, 'samples': 5597568, 'steps': 29153, 'loss/train': 1.2909806966781616} 11/07/2021 01:18:26 - INFO - __main__ - Step 29155: {'lr': 0.00045960788977054967, 'samples': 5597760, 'steps': 29154, 'loss/train': 1.0973178148269653} 11/07/2021 01:18:26 - INFO - __main__ - Step 29156: {'lr': 0.0004596049975110426, 'samples': 5597952, 'steps': 29155, 'loss/train': 1.581822156906128} 11/07/2021 01:18:26 - INFO - __main__ - Step 29157: {'lr': 0.00045960210515709064, 'samples': 5598144, 'steps': 29156, 'loss/train': 1.6044692993164062} 11/07/2021 01:18:27 - INFO - __main__ - Step 29158: {'lr': 0.0004595992127086953, 'samples': 5598336, 'steps': 29157, 'loss/train': 1.4625189304351807} 11/07/2021 01:18:28 - INFO - __main__ - Step 29159: {'lr': 0.00045959632016585774, 'samples': 5598528, 'steps': 29158, 'loss/train': 1.1689990758895874} 11/07/2021 01:18:28 - INFO - __main__ - Step 29160: {'lr': 0.0004595934275285794, 'samples': 5598720, 'steps': 29159, 'loss/train': 1.17728853225708} 11/07/2021 01:18:28 - INFO - __main__ - Step 29161: {'lr': 0.00045959053479686143, 'samples': 5598912, 'steps': 29160, 'loss/train': 1.8568990230560303} 11/07/2021 01:18:29 - INFO - __main__ - Step 29162: {'lr': 0.0004595876419707052, 'samples': 5599104, 'steps': 29161, 'loss/train': 1.5310627222061157} 11/07/2021 01:18:29 - INFO - __main__ - Step 29163: {'lr': 0.00045958474905011205, 'samples': 5599296, 'steps': 29162, 'loss/train': 1.558485984802246} 11/07/2021 01:18:30 - INFO - __main__ - Step 29164: {'lr': 0.0004595818560350832, 'samples': 5599488, 'steps': 29163, 'loss/train': 1.6781156063079834} 11/07/2021 01:18:30 - INFO - __main__ - Step 29165: {'lr': 0.00045957896292562003, 'samples': 5599680, 'steps': 29164, 'loss/train': 1.1443002223968506} 11/07/2021 01:18:31 - INFO - __main__ - Step 29166: {'lr': 0.0004595760697217238, 'samples': 5599872, 'steps': 29165, 'loss/train': 1.450128436088562} 11/07/2021 01:18:31 - INFO - __main__ - Step 29167: {'lr': 0.0004595731764233958, 'samples': 5600064, 'steps': 29166, 'loss/train': 1.574720859527588} 11/07/2021 01:18:31 - INFO - __main__ - Step 29168: {'lr': 0.0004595702830306374, 'samples': 5600256, 'steps': 29167, 'loss/train': 1.3746927976608276} 11/07/2021 01:18:32 - INFO - __main__ - Step 29169: {'lr': 0.0004595673895434498, 'samples': 5600448, 'steps': 29168, 'loss/train': 1.7352036237716675} 11/07/2021 01:18:33 - INFO - __main__ - Step 29170: {'lr': 0.00045956449596183446, 'samples': 5600640, 'steps': 29169, 'loss/train': 1.508607268333435} 11/07/2021 01:18:33 - INFO - __main__ - Step 29171: {'lr': 0.00045956160228579257, 'samples': 5600832, 'steps': 29170, 'loss/train': 1.5599982738494873} 11/07/2021 01:18:33 - INFO - __main__ - Step 29172: {'lr': 0.00045955870851532545, 'samples': 5601024, 'steps': 29171, 'loss/train': 1.8760854005813599} 11/07/2021 01:18:34 - INFO - __main__ - Step 29173: {'lr': 0.0004595558146504344, 'samples': 5601216, 'steps': 29172, 'loss/train': 1.5260308980941772} 11/07/2021 01:18:35 - INFO - __main__ - Step 29174: {'lr': 0.0004595529206911207, 'samples': 5601408, 'steps': 29173, 'loss/train': 2.0422980785369873} 11/07/2021 01:18:35 - INFO - __main__ - Step 29175: {'lr': 0.00045955002663738574, 'samples': 5601600, 'steps': 29174, 'loss/train': 1.5203852653503418} 11/07/2021 01:18:36 - INFO - __main__ - Step 29176: {'lr': 0.0004595471324892307, 'samples': 5601792, 'steps': 29175, 'loss/train': 1.7010663747787476} 11/07/2021 01:18:36 - INFO - __main__ - Step 29177: {'lr': 0.00045954423824665704, 'samples': 5601984, 'steps': 29176, 'loss/train': 1.3339821100234985} 11/07/2021 01:18:36 - INFO - __main__ - Step 29178: {'lr': 0.00045954134390966593, 'samples': 5602176, 'steps': 29177, 'loss/train': 1.674849033355713} 11/07/2021 01:18:37 - INFO - __main__ - Step 29179: {'lr': 0.00045953844947825876, 'samples': 5602368, 'steps': 29178, 'loss/train': 1.3179696798324585} 11/07/2021 01:18:38 - INFO - __main__ - Step 29180: {'lr': 0.0004595355549524368, 'samples': 5602560, 'steps': 29179, 'loss/train': 1.878377079963684} 11/07/2021 01:18:38 - INFO - __main__ - Step 29181: {'lr': 0.0004595326603322013, 'samples': 5602752, 'steps': 29180, 'loss/train': 1.4170050621032715} 11/07/2021 01:18:38 - INFO - __main__ - Step 29182: {'lr': 0.00045952976561755365, 'samples': 5602944, 'steps': 29181, 'loss/train': 1.6162670850753784} 11/07/2021 01:18:39 - INFO - __main__ - Step 29183: {'lr': 0.00045952687080849517, 'samples': 5603136, 'steps': 29182, 'loss/train': 1.6467374563217163} 11/07/2021 01:18:39 - INFO - __main__ - Step 29184: {'lr': 0.000459523975905027, 'samples': 5603328, 'steps': 29183, 'loss/train': 1.4982033967971802} 11/07/2021 01:18:40 - INFO - __main__ - Step 29185: {'lr': 0.0004595210809071506, 'samples': 5603520, 'steps': 29184, 'loss/train': 1.130165934562683} 11/07/2021 01:18:41 - INFO - __main__ - Step 29186: {'lr': 0.0004595181858148673, 'samples': 5603712, 'steps': 29185, 'loss/train': 1.5109838247299194} 11/07/2021 01:18:41 - INFO - __main__ - Step 29187: {'lr': 0.00045951529062817834, 'samples': 5603904, 'steps': 29186, 'loss/train': 1.0921766757965088} 11/07/2021 01:18:41 - INFO - __main__ - Step 29188: {'lr': 0.00045951239534708496, 'samples': 5604096, 'steps': 29187, 'loss/train': 1.9147355556488037} 11/07/2021 01:18:42 - INFO - __main__ - Step 29189: {'lr': 0.0004595094999715885, 'samples': 5604288, 'steps': 29188, 'loss/train': 1.8540912866592407} 11/07/2021 01:18:43 - INFO - __main__ - Step 29190: {'lr': 0.00045950660450169034, 'samples': 5604480, 'steps': 29189, 'loss/train': 1.7768203020095825} 11/07/2021 01:18:43 - INFO - __main__ - Step 29191: {'lr': 0.0004595037089373918, 'samples': 5604672, 'steps': 29190, 'loss/train': 2.194700241088867} 11/07/2021 01:18:43 - INFO - __main__ - Step 29192: {'lr': 0.000459500813278694, 'samples': 5604864, 'steps': 29191, 'loss/train': 0.8722490072250366} 11/07/2021 01:18:44 - INFO - __main__ - Step 29193: {'lr': 0.0004594979175255984, 'samples': 5605056, 'steps': 29192, 'loss/train': 1.1617752313613892} 11/07/2021 01:18:44 - INFO - __main__ - Step 29194: {'lr': 0.0004594950216781063, 'samples': 5605248, 'steps': 29193, 'loss/train': 1.210199236869812} 11/07/2021 01:18:45 - INFO - __main__ - Step 29195: {'lr': 0.000459492125736219, 'samples': 5605440, 'steps': 29194, 'loss/train': 1.6432205438613892} 11/07/2021 01:18:45 - INFO - __main__ - Step 29196: {'lr': 0.00045948922969993777, 'samples': 5605632, 'steps': 29195, 'loss/train': 0.8898891806602478} 11/07/2021 01:18:46 - INFO - __main__ - Step 29197: {'lr': 0.0004594863335692639, 'samples': 5605824, 'steps': 29196, 'loss/train': 1.2649930715560913} 11/07/2021 01:18:46 - INFO - __main__ - Step 29198: {'lr': 0.00045948343734419873, 'samples': 5606016, 'steps': 29197, 'loss/train': 1.6112602949142456} 11/07/2021 01:18:47 - INFO - __main__ - Step 29199: {'lr': 0.00045948054102474357, 'samples': 5606208, 'steps': 29198, 'loss/train': 1.6902039051055908} 11/07/2021 01:18:48 - INFO - __main__ - Step 29200: {'lr': 0.00045947764461089967, 'samples': 5606400, 'steps': 29199, 'loss/train': 1.480944275856018} 11/07/2021 01:18:48 - INFO - __main__ - Step 29201: {'lr': 0.00045947474810266844, 'samples': 5606592, 'steps': 29200, 'loss/train': 0.9518805146217346} 11/07/2021 01:18:48 - INFO - __main__ - Step 29202: {'lr': 0.00045947185150005106, 'samples': 5606784, 'steps': 29201, 'loss/train': 1.26365065574646} 11/07/2021 01:18:49 - INFO - __main__ - Step 29203: {'lr': 0.0004594689548030489, 'samples': 5606976, 'steps': 29202, 'loss/train': 1.0666319131851196} 11/07/2021 01:18:49 - INFO - __main__ - Step 29204: {'lr': 0.0004594660580116633, 'samples': 5607168, 'steps': 29203, 'loss/train': 0.9265233874320984} 11/07/2021 01:18:49 - INFO - __main__ - Step 29205: {'lr': 0.00045946316112589546, 'samples': 5607360, 'steps': 29204, 'loss/train': 1.244853138923645} 11/07/2021 01:18:51 - INFO - __main__ - Step 29206: {'lr': 0.0004594602641457468, 'samples': 5607552, 'steps': 29205, 'loss/train': 1.705867052078247} 11/07/2021 01:18:51 - INFO - __main__ - Step 29207: {'lr': 0.0004594573670712186, 'samples': 5607744, 'steps': 29206, 'loss/train': 1.5587643384933472} 11/07/2021 01:18:51 - INFO - __main__ - Step 29208: {'lr': 0.0004594544699023121, 'samples': 5607936, 'steps': 29207, 'loss/train': 1.1340076923370361} 11/07/2021 01:18:52 - INFO - __main__ - Step 29209: {'lr': 0.0004594515726390287, 'samples': 5608128, 'steps': 29208, 'loss/train': 1.416571855545044} 11/07/2021 01:18:52 - INFO - __main__ - Step 29210: {'lr': 0.00045944867528136956, 'samples': 5608320, 'steps': 29209, 'loss/train': 1.8951489925384521} 11/07/2021 01:18:53 - INFO - __main__ - Step 29211: {'lr': 0.00045944577782933615, 'samples': 5608512, 'steps': 29210, 'loss/train': 1.3687822818756104} 11/07/2021 01:18:53 - INFO - __main__ - Step 29212: {'lr': 0.0004594428802829297, 'samples': 5608704, 'steps': 29211, 'loss/train': 1.7491780519485474} 11/07/2021 01:18:54 - INFO - __main__ - Step 29213: {'lr': 0.00045943998264215153, 'samples': 5608896, 'steps': 29212, 'loss/train': 1.585396409034729} 11/07/2021 01:18:54 - INFO - __main__ - Step 29214: {'lr': 0.0004594370849070029, 'samples': 5609088, 'steps': 29213, 'loss/train': 1.8933584690093994} 11/07/2021 01:18:54 - INFO - __main__ - Step 29215: {'lr': 0.00045943418707748517, 'samples': 5609280, 'steps': 29214, 'loss/train': 1.7437858581542969} 11/07/2021 01:18:56 - INFO - __main__ - Step 29216: {'lr': 0.00045943128915359966, 'samples': 5609472, 'steps': 29215, 'loss/train': 1.339712381362915} 11/07/2021 01:18:56 - INFO - __main__ - Step 29217: {'lr': 0.0004594283911353476, 'samples': 5609664, 'steps': 29216, 'loss/train': 1.498565673828125} 11/07/2021 01:18:56 - INFO - __main__ - Step 29218: {'lr': 0.0004594254930227303, 'samples': 5609856, 'steps': 29217, 'loss/train': 1.5219602584838867} 11/07/2021 01:18:57 - INFO - __main__ - Step 29219: {'lr': 0.0004594225948157492, 'samples': 5610048, 'steps': 29218, 'loss/train': 1.590041160583496} 11/07/2021 01:18:57 - INFO - __main__ - Step 29220: {'lr': 0.0004594196965144054, 'samples': 5610240, 'steps': 29219, 'loss/train': 1.4370402097702026} 11/07/2021 01:18:58 - INFO - __main__ - Step 29221: {'lr': 0.0004594167981187004, 'samples': 5610432, 'steps': 29220, 'loss/train': 1.6703990697860718} 11/07/2021 01:18:58 - INFO - __main__ - Step 29222: {'lr': 0.00045941389962863546, 'samples': 5610624, 'steps': 29221, 'loss/train': 1.4225343465805054} 11/07/2021 01:18:59 - INFO - __main__ - Step 29223: {'lr': 0.00045941100104421176, 'samples': 5610816, 'steps': 29222, 'loss/train': 0.5798635482788086} 11/07/2021 01:18:59 - INFO - __main__ - Step 29224: {'lr': 0.0004594081023654307, 'samples': 5611008, 'steps': 29223, 'loss/train': 1.542803406715393} 11/07/2021 01:18:59 - INFO - __main__ - Step 29225: {'lr': 0.00045940520359229366, 'samples': 5611200, 'steps': 29224, 'loss/train': 2.027190923690796} 11/07/2021 01:19:00 - INFO - __main__ - Step 29226: {'lr': 0.0004594023047248018, 'samples': 5611392, 'steps': 29225, 'loss/train': 1.4777716398239136} 11/07/2021 01:19:01 - INFO - __main__ - Step 29227: {'lr': 0.0004593994057629565, 'samples': 5611584, 'steps': 29226, 'loss/train': 2.0218591690063477} 11/07/2021 01:19:01 - INFO - __main__ - Step 29228: {'lr': 0.000459396506706759, 'samples': 5611776, 'steps': 29227, 'loss/train': 1.5503898859024048} 11/07/2021 01:19:02 - INFO - __main__ - Step 29229: {'lr': 0.00045939360755621074, 'samples': 5611968, 'steps': 29228, 'loss/train': 1.5109821557998657} 11/07/2021 01:19:02 - INFO - __main__ - Step 29230: {'lr': 0.00045939070831131293, 'samples': 5612160, 'steps': 29229, 'loss/train': 1.4671088457107544} 11/07/2021 01:19:02 - INFO - __main__ - Step 29231: {'lr': 0.00045938780897206686, 'samples': 5612352, 'steps': 29230, 'loss/train': 1.9293100833892822} 11/07/2021 01:19:03 - INFO - __main__ - Step 29232: {'lr': 0.000459384909538474, 'samples': 5612544, 'steps': 29231, 'loss/train': 1.5910476446151733} 11/07/2021 01:19:04 - INFO - __main__ - Step 29233: {'lr': 0.00045938201001053546, 'samples': 5612736, 'steps': 29232, 'loss/train': 1.4884496927261353} 11/07/2021 01:19:04 - INFO - __main__ - Step 29234: {'lr': 0.00045937911038825257, 'samples': 5612928, 'steps': 29233, 'loss/train': 2.0655133724212646} 11/07/2021 01:19:04 - INFO - __main__ - Step 29235: {'lr': 0.00045937621067162674, 'samples': 5613120, 'steps': 29234, 'loss/train': 1.460890293121338} 11/07/2021 01:19:05 - INFO - __main__ - Step 29236: {'lr': 0.0004593733108606592, 'samples': 5613312, 'steps': 29235, 'loss/train': 1.3662402629852295} 11/07/2021 01:19:06 - INFO - __main__ - Step 29237: {'lr': 0.00045937041095535125, 'samples': 5613504, 'steps': 29236, 'loss/train': 2.0695712566375732} 11/07/2021 01:19:06 - INFO - __main__ - Step 29238: {'lr': 0.00045936751095570426, 'samples': 5613696, 'steps': 29237, 'loss/train': 0.9652360081672668} 11/07/2021 01:19:06 - INFO - __main__ - Step 29239: {'lr': 0.0004593646108617195, 'samples': 5613888, 'steps': 29238, 'loss/train': 1.482219934463501} 11/07/2021 01:19:07 - INFO - __main__ - Step 29240: {'lr': 0.00045936171067339826, 'samples': 5614080, 'steps': 29239, 'loss/train': 1.7664074897766113} 11/07/2021 01:19:07 - INFO - __main__ - Step 29241: {'lr': 0.0004593588103907419, 'samples': 5614272, 'steps': 29240, 'loss/train': 2.0414185523986816} 11/07/2021 01:19:09 - INFO - __main__ - Step 29242: {'lr': 0.00045935591001375163, 'samples': 5614464, 'steps': 29241, 'loss/train': 1.3956793546676636} 11/07/2021 01:19:09 - INFO - __main__ - Step 29243: {'lr': 0.0004593530095424289, 'samples': 5614656, 'steps': 29242, 'loss/train': 0.9382502436637878} 11/07/2021 01:19:09 - INFO - __main__ - Step 29244: {'lr': 0.0004593501089767749, 'samples': 5614848, 'steps': 29243, 'loss/train': 0.818658173084259} 11/07/2021 01:19:10 - INFO - __main__ - Step 29245: {'lr': 0.00045934720831679093, 'samples': 5615040, 'steps': 29244, 'loss/train': 0.30459633469581604} 11/07/2021 01:19:10 - INFO - __main__ - Step 29246: {'lr': 0.00045934430756247835, 'samples': 5615232, 'steps': 29245, 'loss/train': 0.5222163796424866} 11/07/2021 01:19:10 - INFO - __main__ - Step 29247: {'lr': 0.0004593414067138385, 'samples': 5615424, 'steps': 29246, 'loss/train': 2.345348596572876} 11/07/2021 01:19:12 - INFO - __main__ - Step 29248: {'lr': 0.0004593385057708726, 'samples': 5615616, 'steps': 29247, 'loss/train': 1.585673451423645} 11/07/2021 01:19:12 - INFO - __main__ - Step 29249: {'lr': 0.00045933560473358206, 'samples': 5615808, 'steps': 29248, 'loss/train': 1.8178879022598267} 11/07/2021 01:19:12 - INFO - __main__ - Step 29250: {'lr': 0.00045933270360196804, 'samples': 5616000, 'steps': 29249, 'loss/train': 0.9683892130851746} 11/07/2021 01:19:13 - INFO - __main__ - Step 29251: {'lr': 0.00045932980237603196, 'samples': 5616192, 'steps': 29250, 'loss/train': 1.3415101766586304} 11/07/2021 01:19:13 - INFO - __main__ - Step 29252: {'lr': 0.0004593269010557751, 'samples': 5616384, 'steps': 29251, 'loss/train': 1.6597208976745605} 11/07/2021 01:19:13 - INFO - __main__ - Step 29253: {'lr': 0.00045932399964119884, 'samples': 5616576, 'steps': 29252, 'loss/train': 1.457929253578186} 11/07/2021 01:19:14 - INFO - __main__ - Step 29254: {'lr': 0.00045932109813230437, 'samples': 5616768, 'steps': 29253, 'loss/train': 1.772652506828308} 11/07/2021 01:19:15 - INFO - __main__ - Step 29255: {'lr': 0.00045931819652909303, 'samples': 5616960, 'steps': 29254, 'loss/train': 1.9733912944793701} 11/07/2021 01:19:15 - INFO - __main__ - Step 29256: {'lr': 0.0004593152948315661, 'samples': 5617152, 'steps': 29255, 'loss/train': 1.6951172351837158} 11/07/2021 01:19:15 - INFO - __main__ - Step 29257: {'lr': 0.000459312393039725, 'samples': 5617344, 'steps': 29256, 'loss/train': 1.5540403127670288} 11/07/2021 01:19:16 - INFO - __main__ - Step 29258: {'lr': 0.0004593094911535709, 'samples': 5617536, 'steps': 29257, 'loss/train': 2.5532915592193604} 11/07/2021 01:19:17 - INFO - __main__ - Step 29259: {'lr': 0.00045930658917310525, 'samples': 5617728, 'steps': 29258, 'loss/train': 1.8263392448425293} 11/07/2021 01:19:17 - INFO - __main__ - Step 29260: {'lr': 0.0004593036870983293, 'samples': 5617920, 'steps': 29259, 'loss/train': 1.6476937532424927} 11/07/2021 01:19:18 - INFO - __main__ - Step 29261: {'lr': 0.0004593007849292442, 'samples': 5618112, 'steps': 29260, 'loss/train': 1.4127036333084106} 11/07/2021 01:19:18 - INFO - __main__ - Step 29262: {'lr': 0.0004592978826658515, 'samples': 5618304, 'steps': 29261, 'loss/train': 1.454675555229187} 11/07/2021 01:19:18 - INFO - __main__ - Step 29263: {'lr': 0.0004592949803081524, 'samples': 5618496, 'steps': 29262, 'loss/train': 1.6450902223587036} 11/07/2021 01:19:19 - INFO - __main__ - Step 29264: {'lr': 0.0004592920778561481, 'samples': 5618688, 'steps': 29263, 'loss/train': 1.128226637840271} 11/07/2021 01:19:20 - INFO - __main__ - Step 29265: {'lr': 0.00045928917530984014, 'samples': 5618880, 'steps': 29264, 'loss/train': 1.5634617805480957} 11/07/2021 01:19:20 - INFO - __main__ - Step 29266: {'lr': 0.00045928627266922974, 'samples': 5619072, 'steps': 29265, 'loss/train': 1.4417423009872437} 11/07/2021 01:19:20 - INFO - __main__ - Step 29267: {'lr': 0.0004592833699343181, 'samples': 5619264, 'steps': 29266, 'loss/train': 1.5430980920791626} 11/07/2021 01:19:21 - INFO - __main__ - Step 29268: {'lr': 0.0004592804671051066, 'samples': 5619456, 'steps': 29267, 'loss/train': 1.0452214479446411} 11/07/2021 01:19:22 - INFO - __main__ - Step 29269: {'lr': 0.0004592775641815966, 'samples': 5619648, 'steps': 29268, 'loss/train': 1.6853535175323486} 11/07/2021 01:19:22 - INFO - __main__ - Step 29270: {'lr': 0.0004592746611637893, 'samples': 5619840, 'steps': 29269, 'loss/train': 1.5771098136901855} 11/07/2021 01:19:23 - INFO - __main__ - Step 29271: {'lr': 0.00045927175805168607, 'samples': 5620032, 'steps': 29270, 'loss/train': 1.4390790462493896} 11/07/2021 01:19:23 - INFO - __main__ - Step 29272: {'lr': 0.00045926885484528823, 'samples': 5620224, 'steps': 29271, 'loss/train': 1.9551517963409424} 11/07/2021 01:19:23 - INFO - __main__ - Step 29273: {'lr': 0.0004592659515445971, 'samples': 5620416, 'steps': 29272, 'loss/train': 1.2658156156539917} 11/07/2021 01:19:24 - INFO - __main__ - Step 29274: {'lr': 0.00045926304814961397, 'samples': 5620608, 'steps': 29273, 'loss/train': 1.2549620866775513} 11/07/2021 01:19:25 - INFO - __main__ - Step 29275: {'lr': 0.00045926014466034004, 'samples': 5620800, 'steps': 29274, 'loss/train': 1.371767282485962} 11/07/2021 01:19:25 - INFO - __main__ - Step 29276: {'lr': 0.0004592572410767768, 'samples': 5620992, 'steps': 29275, 'loss/train': 1.3488487005233765} 11/07/2021 01:19:25 - INFO - __main__ - Step 29277: {'lr': 0.0004592543373989255, 'samples': 5621184, 'steps': 29276, 'loss/train': 1.4339321851730347} 11/07/2021 01:19:26 - INFO - __main__ - Step 29278: {'lr': 0.0004592514336267874, 'samples': 5621376, 'steps': 29277, 'loss/train': 1.396821141242981} 11/07/2021 01:19:26 - INFO - __main__ - Step 29279: {'lr': 0.0004592485297603638, 'samples': 5621568, 'steps': 29278, 'loss/train': 1.555566668510437} 11/07/2021 01:19:27 - INFO - __main__ - Step 29280: {'lr': 0.0004592456257996561, 'samples': 5621760, 'steps': 29279, 'loss/train': 1.6349639892578125} 11/07/2021 01:19:28 - INFO - __main__ - Step 29281: {'lr': 0.0004592427217446655, 'samples': 5621952, 'steps': 29280, 'loss/train': 1.6176657676696777} 11/07/2021 01:19:28 - INFO - __main__ - Step 29282: {'lr': 0.00045923981759539336, 'samples': 5622144, 'steps': 29281, 'loss/train': 1.900167465209961} 11/07/2021 01:19:28 - INFO - __main__ - Step 29283: {'lr': 0.000459236913351841, 'samples': 5622336, 'steps': 29282, 'loss/train': 2.1861953735351562} 11/07/2021 01:19:29 - INFO - __main__ - Step 29284: {'lr': 0.0004592340090140097, 'samples': 5622528, 'steps': 29283, 'loss/train': 1.3462752103805542} 11/07/2021 01:19:30 - INFO - __main__ - Step 29285: {'lr': 0.0004592311045819008, 'samples': 5622720, 'steps': 29284, 'loss/train': 1.4200493097305298} 11/07/2021 01:19:31 - INFO - __main__ - Step 29286: {'lr': 0.00045922820005551556, 'samples': 5622912, 'steps': 29285, 'loss/train': 1.8635931015014648} 11/07/2021 01:19:31 - INFO - __main__ - Step 29287: {'lr': 0.0004592252954348554, 'samples': 5623104, 'steps': 29286, 'loss/train': 1.6215516328811646} 11/07/2021 01:19:31 - INFO - __main__ - Step 29288: {'lr': 0.0004592223907199215, 'samples': 5623296, 'steps': 29287, 'loss/train': 1.471245527267456} 11/07/2021 01:19:32 - INFO - __main__ - Step 29289: {'lr': 0.0004592194859107153, 'samples': 5623488, 'steps': 29288, 'loss/train': 0.9702801704406738} 11/07/2021 01:19:33 - INFO - __main__ - Step 29290: {'lr': 0.0004592165810072379, 'samples': 5623680, 'steps': 29289, 'loss/train': 1.7796401977539062} 11/07/2021 01:19:33 - INFO - __main__ - Step 29291: {'lr': 0.00045921367600949077, 'samples': 5623872, 'steps': 29290, 'loss/train': 1.4581795930862427} 11/07/2021 01:19:33 - INFO - __main__ - Step 29292: {'lr': 0.0004592107709174752, 'samples': 5624064, 'steps': 29291, 'loss/train': 1.3144384622573853} 11/07/2021 01:19:34 - INFO - __main__ - Step 29293: {'lr': 0.0004592078657311925, 'samples': 5624256, 'steps': 29292, 'loss/train': 1.609805703163147} 11/07/2021 01:19:34 - INFO - __main__ - Step 29294: {'lr': 0.000459204960450644, 'samples': 5624448, 'steps': 29293, 'loss/train': 1.7830214500427246} 11/07/2021 01:19:35 - INFO - __main__ - Step 29295: {'lr': 0.0004592020550758309, 'samples': 5624640, 'steps': 29294, 'loss/train': 1.4547706842422485} 11/07/2021 01:19:35 - INFO - __main__ - Step 29296: {'lr': 0.0004591991496067546, 'samples': 5624832, 'steps': 29295, 'loss/train': 2.029094696044922} 11/07/2021 01:19:36 - INFO - __main__ - Step 29297: {'lr': 0.00045919624404341643, 'samples': 5625024, 'steps': 29296, 'loss/train': 0.851370632648468} 11/07/2021 01:19:36 - INFO - __main__ - Step 29298: {'lr': 0.00045919333838581757, 'samples': 5625216, 'steps': 29297, 'loss/train': 1.4973098039627075} 11/07/2021 01:19:37 - INFO - __main__ - Step 29299: {'lr': 0.00045919043263395953, 'samples': 5625408, 'steps': 29298, 'loss/train': 1.7972590923309326} 11/07/2021 01:19:37 - INFO - __main__ - Step 29300: {'lr': 0.00045918752678784344, 'samples': 5625600, 'steps': 29299, 'loss/train': 1.6929970979690552} 11/07/2021 01:19:38 - INFO - __main__ - Step 29301: {'lr': 0.0004591846208474707, 'samples': 5625792, 'steps': 29300, 'loss/train': 1.6178035736083984} 11/07/2021 01:19:38 - INFO - __main__ - Step 29302: {'lr': 0.00045918171481284256, 'samples': 5625984, 'steps': 29301, 'loss/train': 1.3055503368377686} 11/07/2021 01:19:39 - INFO - __main__ - Step 29303: {'lr': 0.0004591788086839604, 'samples': 5626176, 'steps': 29302, 'loss/train': 1.4833933115005493} 11/07/2021 01:19:39 - INFO - __main__ - Step 29304: {'lr': 0.0004591759024608255, 'samples': 5626368, 'steps': 29303, 'loss/train': 1.6076900959014893} 11/07/2021 01:19:39 - INFO - __main__ - Step 29305: {'lr': 0.0004591729961434392, 'samples': 5626560, 'steps': 29304, 'loss/train': 1.6441346406936646} 11/07/2021 01:19:40 - INFO - __main__ - Step 29306: {'lr': 0.00045917008973180273, 'samples': 5626752, 'steps': 29305, 'loss/train': 1.4150656461715698} 11/07/2021 01:19:41 - INFO - __main__ - Step 29307: {'lr': 0.0004591671832259174, 'samples': 5626944, 'steps': 29306, 'loss/train': 1.6857331991195679} 11/07/2021 01:19:41 - INFO - __main__ - Step 29308: {'lr': 0.00045916427662578464, 'samples': 5627136, 'steps': 29307, 'loss/train': 1.3254634141921997} 11/07/2021 01:19:41 - INFO - __main__ - Step 29309: {'lr': 0.00045916136993140574, 'samples': 5627328, 'steps': 29308, 'loss/train': 2.0325093269348145} 11/07/2021 01:19:42 - INFO - __main__ - Step 29310: {'lr': 0.00045915846314278187, 'samples': 5627520, 'steps': 29309, 'loss/train': 1.971750020980835} 11/07/2021 01:19:43 - INFO - __main__ - Step 29311: {'lr': 0.0004591555562599144, 'samples': 5627712, 'steps': 29310, 'loss/train': 1.5656483173370361} 11/07/2021 01:19:43 - INFO - __main__ - Step 29312: {'lr': 0.00045915264928280476, 'samples': 5627904, 'steps': 29311, 'loss/train': 1.7062572240829468} 11/07/2021 01:19:44 - INFO - __main__ - Step 29313: {'lr': 0.00045914974221145403, 'samples': 5628096, 'steps': 29312, 'loss/train': 1.6548060178756714} 11/07/2021 01:19:44 - INFO - __main__ - Step 29314: {'lr': 0.00045914683504586374, 'samples': 5628288, 'steps': 29313, 'loss/train': 1.8358749151229858} 11/07/2021 01:19:44 - INFO - __main__ - Step 29315: {'lr': 0.0004591439277860351, 'samples': 5628480, 'steps': 29314, 'loss/train': 1.781209945678711} 11/07/2021 01:19:45 - INFO - __main__ - Step 29316: {'lr': 0.00045914102043196947, 'samples': 5628672, 'steps': 29315, 'loss/train': 0.8205442428588867} 11/07/2021 01:19:46 - INFO - __main__ - Step 29317: {'lr': 0.00045913811298366804, 'samples': 5628864, 'steps': 29316, 'loss/train': 1.5368036031723022} 11/07/2021 01:19:46 - INFO - __main__ - Step 29318: {'lr': 0.0004591352054411323, 'samples': 5629056, 'steps': 29317, 'loss/train': 1.600978970527649} 11/07/2021 01:19:46 - INFO - __main__ - Step 29319: {'lr': 0.00045913229780436337, 'samples': 5629248, 'steps': 29318, 'loss/train': 0.5395891070365906} 11/07/2021 01:19:47 - INFO - __main__ - Step 29320: {'lr': 0.00045912939007336273, 'samples': 5629440, 'steps': 29319, 'loss/train': 1.9441035985946655} 11/07/2021 01:19:47 - INFO - __main__ - Step 29321: {'lr': 0.0004591264822481316, 'samples': 5629632, 'steps': 29320, 'loss/train': 1.6170451641082764} 11/07/2021 01:19:48 - INFO - __main__ - Step 29322: {'lr': 0.00045912357432867124, 'samples': 5629824, 'steps': 29321, 'loss/train': 0.8419768214225769} 11/07/2021 01:19:49 - INFO - __main__ - Step 29323: {'lr': 0.00045912066631498304, 'samples': 5630016, 'steps': 29322, 'loss/train': 1.2050539255142212} 11/07/2021 01:19:49 - INFO - __main__ - Step 29324: {'lr': 0.00045911775820706835, 'samples': 5630208, 'steps': 29323, 'loss/train': 1.4944329261779785} 11/07/2021 01:19:49 - INFO - __main__ - Step 29325: {'lr': 0.0004591148500049284, 'samples': 5630400, 'steps': 29324, 'loss/train': 1.7582799196243286} 11/07/2021 01:19:50 - INFO - __main__ - Step 29326: {'lr': 0.00045911194170856454, 'samples': 5630592, 'steps': 29325, 'loss/train': 2.302961826324463} 11/07/2021 01:19:51 - INFO - __main__ - Step 29327: {'lr': 0.00045910903331797807, 'samples': 5630784, 'steps': 29326, 'loss/train': 1.354048252105713} 11/07/2021 01:19:51 - INFO - __main__ - Step 29328: {'lr': 0.00045910612483317025, 'samples': 5630976, 'steps': 29327, 'loss/train': 1.7314233779907227} 11/07/2021 01:19:51 - INFO - __main__ - Step 29329: {'lr': 0.00045910321625414245, 'samples': 5631168, 'steps': 29328, 'loss/train': 1.6378663778305054} 11/07/2021 01:19:52 - INFO - __main__ - Step 29330: {'lr': 0.00045910030758089597, 'samples': 5631360, 'steps': 29329, 'loss/train': 1.2831581830978394} 11/07/2021 01:19:52 - INFO - __main__ - Step 29331: {'lr': 0.00045909739881343215, 'samples': 5631552, 'steps': 29330, 'loss/train': 1.3802523612976074} 11/07/2021 01:19:53 - INFO - __main__ - Step 29332: {'lr': 0.00045909448995175224, 'samples': 5631744, 'steps': 29331, 'loss/train': 1.3257133960723877} 11/07/2021 01:19:54 - INFO - __main__ - Step 29333: {'lr': 0.00045909158099585756, 'samples': 5631936, 'steps': 29332, 'loss/train': 1.4882980585098267} 11/07/2021 01:19:54 - INFO - __main__ - Step 29334: {'lr': 0.00045908867194574955, 'samples': 5632128, 'steps': 29333, 'loss/train': 1.4736945629119873} 11/07/2021 01:19:54 - INFO - __main__ - Step 29335: {'lr': 0.00045908576280142925, 'samples': 5632320, 'steps': 29334, 'loss/train': 1.6490302085876465} 11/07/2021 01:19:55 - INFO - __main__ - Step 29336: {'lr': 0.00045908285356289824, 'samples': 5632512, 'steps': 29335, 'loss/train': 1.8033547401428223} 11/07/2021 01:19:56 - INFO - __main__ - Step 29337: {'lr': 0.0004590799442301577, 'samples': 5632704, 'steps': 29336, 'loss/train': 1.8330843448638916} 11/07/2021 01:19:56 - INFO - __main__ - Step 29338: {'lr': 0.00045907703480320894, 'samples': 5632896, 'steps': 29337, 'loss/train': 1.6994603872299194} 11/07/2021 01:19:56 - INFO - __main__ - Step 29339: {'lr': 0.0004590741252820533, 'samples': 5633088, 'steps': 29338, 'loss/train': 1.590700387954712} 11/07/2021 01:19:57 - INFO - __main__ - Step 29340: {'lr': 0.00045907121566669216, 'samples': 5633280, 'steps': 29339, 'loss/train': 1.7707066535949707} 11/07/2021 01:19:57 - INFO - __main__ - Step 29341: {'lr': 0.0004590683059571267, 'samples': 5633472, 'steps': 29340, 'loss/train': 1.6956660747528076} 11/07/2021 01:19:58 - INFO - __main__ - Step 29342: {'lr': 0.0004590653961533582, 'samples': 5633664, 'steps': 29341, 'loss/train': 1.455352544784546} 11/07/2021 01:19:58 - INFO - __main__ - Step 29343: {'lr': 0.00045906248625538816, 'samples': 5633856, 'steps': 29342, 'loss/train': 1.6896904706954956} 11/07/2021 01:19:59 - INFO - __main__ - Step 29344: {'lr': 0.00045905957626321775, 'samples': 5634048, 'steps': 29343, 'loss/train': 1.7443041801452637} 11/07/2021 01:19:59 - INFO - __main__ - Step 29345: {'lr': 0.0004590566661768484, 'samples': 5634240, 'steps': 29344, 'loss/train': 1.7320188283920288} 11/07/2021 01:19:59 - INFO - __main__ - Step 29346: {'lr': 0.00045905375599628127, 'samples': 5634432, 'steps': 29345, 'loss/train': 1.636330246925354} 11/07/2021 01:20:01 - INFO - __main__ - Step 29347: {'lr': 0.00045905084572151774, 'samples': 5634624, 'steps': 29346, 'loss/train': 1.3468916416168213} 11/07/2021 01:20:01 - INFO - __main__ - Step 29348: {'lr': 0.0004590479353525591, 'samples': 5634816, 'steps': 29347, 'loss/train': 1.354005217552185} 11/07/2021 01:20:01 - INFO - __main__ - Step 29349: {'lr': 0.00045904502488940677, 'samples': 5635008, 'steps': 29348, 'loss/train': 1.5239914655685425} 11/07/2021 01:20:02 - INFO - __main__ - Step 29350: {'lr': 0.0004590421143320619, 'samples': 5635200, 'steps': 29349, 'loss/train': 0.6602169871330261} 11/07/2021 01:20:02 - INFO - __main__ - Step 29351: {'lr': 0.0004590392036805259, 'samples': 5635392, 'steps': 29350, 'loss/train': 1.3653275966644287} 11/07/2021 01:20:02 - INFO - __main__ - Step 29352: {'lr': 0.0004590362929348001, 'samples': 5635584, 'steps': 29351, 'loss/train': 1.6804929971694946} 11/07/2021 01:20:03 - INFO - __main__ - Step 29353: {'lr': 0.00045903338209488575, 'samples': 5635776, 'steps': 29352, 'loss/train': 1.9492771625518799} 11/07/2021 01:20:04 - INFO - __main__ - Step 29354: {'lr': 0.0004590304711607842, 'samples': 5635968, 'steps': 29353, 'loss/train': 1.5898215770721436} 11/07/2021 01:20:04 - INFO - __main__ - Step 29355: {'lr': 0.0004590275601324967, 'samples': 5636160, 'steps': 29354, 'loss/train': 2.0531885623931885} 11/07/2021 01:20:05 - INFO - __main__ - Step 29356: {'lr': 0.0004590246490100246, 'samples': 5636352, 'steps': 29355, 'loss/train': 0.7158932089805603} 11/07/2021 01:20:05 - INFO - __main__ - Step 29357: {'lr': 0.00045902173779336925, 'samples': 5636544, 'steps': 29356, 'loss/train': 0.8776659965515137} 11/07/2021 01:20:06 - INFO - __main__ - Step 29358: {'lr': 0.0004590188264825319, 'samples': 5636736, 'steps': 29357, 'loss/train': 1.6090434789657593} 11/07/2021 01:20:06 - INFO - __main__ - Step 29359: {'lr': 0.00045901591507751393, 'samples': 5636928, 'steps': 29358, 'loss/train': 1.2736858129501343} 11/07/2021 01:20:07 - INFO - __main__ - Step 29360: {'lr': 0.00045901300357831666, 'samples': 5637120, 'steps': 29359, 'loss/train': 0.9557590484619141} 11/07/2021 01:20:07 - INFO - __main__ - Step 29361: {'lr': 0.00045901009198494124, 'samples': 5637312, 'steps': 29360, 'loss/train': 1.6712108850479126} 11/07/2021 01:20:07 - INFO - __main__ - Step 29362: {'lr': 0.0004590071802973892, 'samples': 5637504, 'steps': 29361, 'loss/train': 1.7179455757141113} 11/07/2021 01:20:08 - INFO - __main__ - Step 29363: {'lr': 0.0004590042685156617, 'samples': 5637696, 'steps': 29362, 'loss/train': 2.0271809101104736} 11/07/2021 01:20:09 - INFO - __main__ - Step 29364: {'lr': 0.0004590013566397601, 'samples': 5637888, 'steps': 29363, 'loss/train': 1.2667248249053955} 11/07/2021 01:20:09 - INFO - __main__ - Step 29365: {'lr': 0.00045899844466968574, 'samples': 5638080, 'steps': 29364, 'loss/train': 1.6341489553451538} 11/07/2021 01:20:09 - INFO - __main__ - Step 29366: {'lr': 0.00045899553260543986, 'samples': 5638272, 'steps': 29365, 'loss/train': 1.4595669507980347} 11/07/2021 01:20:10 - INFO - __main__ - Step 29367: {'lr': 0.0004589926204470238, 'samples': 5638464, 'steps': 29366, 'loss/train': 1.6928445100784302} 11/07/2021 01:20:11 - INFO - __main__ - Step 29368: {'lr': 0.000458989708194439, 'samples': 5638656, 'steps': 29367, 'loss/train': 1.4349541664123535} 11/07/2021 01:20:11 - INFO - __main__ - Step 29369: {'lr': 0.0004589867958476866, 'samples': 5638848, 'steps': 29368, 'loss/train': 1.667285680770874} 11/07/2021 01:20:11 - INFO - __main__ - Step 29370: {'lr': 0.000458983883406768, 'samples': 5639040, 'steps': 29369, 'loss/train': 1.657423734664917} 11/07/2021 01:20:12 - INFO - __main__ - Step 29371: {'lr': 0.0004589809708716844, 'samples': 5639232, 'steps': 29370, 'loss/train': 1.6851989030838013} 11/07/2021 01:20:12 - INFO - __main__ - Step 29372: {'lr': 0.0004589780582424373, 'samples': 5639424, 'steps': 29371, 'loss/train': 1.1141544580459595} 11/07/2021 01:20:13 - INFO - __main__ - Step 29373: {'lr': 0.00045897514551902785, 'samples': 5639616, 'steps': 29372, 'loss/train': 1.305080771446228} 11/07/2021 01:20:14 - INFO - __main__ - Step 29374: {'lr': 0.0004589722327014575, 'samples': 5639808, 'steps': 29373, 'loss/train': 1.4186768531799316} 11/07/2021 01:20:14 - INFO - __main__ - Step 29375: {'lr': 0.0004589693197897274, 'samples': 5640000, 'steps': 29374, 'loss/train': 1.5759755373001099} 11/07/2021 01:20:15 - INFO - __main__ - Step 29376: {'lr': 0.0004589664067838389, 'samples': 5640192, 'steps': 29375, 'loss/train': 1.7532117366790771} 11/07/2021 01:20:15 - INFO - __main__ - Step 29377: {'lr': 0.00045896349368379356, 'samples': 5640384, 'steps': 29376, 'loss/train': 1.5392422676086426} 11/07/2021 01:20:16 - INFO - __main__ - Step 29378: {'lr': 0.00045896058048959233, 'samples': 5640576, 'steps': 29377, 'loss/train': 0.8304669857025146} 11/07/2021 01:20:16 - INFO - __main__ - Step 29379: {'lr': 0.00045895766720123677, 'samples': 5640768, 'steps': 29378, 'loss/train': 1.7094236612319946} 11/07/2021 01:20:17 - INFO - __main__ - Step 29380: {'lr': 0.0004589547538187281, 'samples': 5640960, 'steps': 29379, 'loss/train': 1.5307104587554932} 11/07/2021 01:20:17 - INFO - __main__ - Step 29381: {'lr': 0.0004589518403420676, 'samples': 5641152, 'steps': 29380, 'loss/train': 1.6963664293289185} 11/07/2021 01:20:17 - INFO - __main__ - Step 29382: {'lr': 0.00045894892677125667, 'samples': 5641344, 'steps': 29381, 'loss/train': 1.6436216831207275} 11/07/2021 01:20:18 - INFO - __main__ - Step 29383: {'lr': 0.0004589460131062965, 'samples': 5641536, 'steps': 29382, 'loss/train': 1.7731062173843384} 11/07/2021 01:20:19 - INFO - __main__ - Step 29384: {'lr': 0.00045894309934718853, 'samples': 5641728, 'steps': 29383, 'loss/train': 1.121886134147644} 11/07/2021 01:20:19 - INFO - __main__ - Step 29385: {'lr': 0.00045894018549393404, 'samples': 5641920, 'steps': 29384, 'loss/train': 1.575050711631775} 11/07/2021 01:20:19 - INFO - __main__ - Step 29386: {'lr': 0.0004589372715465343, 'samples': 5642112, 'steps': 29385, 'loss/train': 1.6603325605392456} 11/07/2021 01:20:20 - INFO - __main__ - Step 29387: {'lr': 0.0004589343575049907, 'samples': 5642304, 'steps': 29386, 'loss/train': 1.5052731037139893} 11/07/2021 01:20:21 - INFO - __main__ - Step 29388: {'lr': 0.0004589314433693044, 'samples': 5642496, 'steps': 29387, 'loss/train': 1.483961820602417} 11/07/2021 01:20:21 - INFO - __main__ - Step 29389: {'lr': 0.0004589285291394769, 'samples': 5642688, 'steps': 29388, 'loss/train': 1.6082696914672852} 11/07/2021 01:20:22 - INFO - __main__ - Step 29390: {'lr': 0.00045892561481550943, 'samples': 5642880, 'steps': 29389, 'loss/train': 0.4304291605949402} 11/07/2021 01:20:22 - INFO - __main__ - Step 29391: {'lr': 0.0004589227003974032, 'samples': 5643072, 'steps': 29390, 'loss/train': 2.2525391578674316} 11/07/2021 01:20:22 - INFO - __main__ - Step 29392: {'lr': 0.00045891978588515975, 'samples': 5643264, 'steps': 29391, 'loss/train': 1.5738767385482788} 11/07/2021 01:20:23 - INFO - __main__ - Step 29393: {'lr': 0.0004589168712787802, 'samples': 5643456, 'steps': 29392, 'loss/train': 0.9495261907577515} 11/07/2021 01:20:24 - INFO - __main__ - Step 29394: {'lr': 0.00045891395657826595, 'samples': 5643648, 'steps': 29393, 'loss/train': 1.761458158493042} 11/07/2021 01:20:24 - INFO - __main__ - Step 29395: {'lr': 0.0004589110417836183, 'samples': 5643840, 'steps': 29394, 'loss/train': 1.4254180192947388} 11/07/2021 01:20:24 - INFO - __main__ - Step 29396: {'lr': 0.0004589081268948386, 'samples': 5644032, 'steps': 29395, 'loss/train': 0.7713015675544739} 11/07/2021 01:20:25 - INFO - __main__ - Step 29397: {'lr': 0.00045890521191192807, 'samples': 5644224, 'steps': 29396, 'loss/train': 1.3440076112747192} 11/07/2021 01:20:26 - INFO - __main__ - Step 29398: {'lr': 0.0004589022968348881, 'samples': 5644416, 'steps': 29397, 'loss/train': 1.3670624494552612} 11/07/2021 01:20:26 - INFO - __main__ - Step 29399: {'lr': 0.0004588993816637199, 'samples': 5644608, 'steps': 29398, 'loss/train': 1.271843433380127} 11/07/2021 01:20:26 - INFO - __main__ - Step 29400: {'lr': 0.00045889646639842496, 'samples': 5644800, 'steps': 29399, 'loss/train': 1.7491928339004517} 11/07/2021 01:20:27 - INFO - __main__ - Step 29401: {'lr': 0.0004588935510390045, 'samples': 5644992, 'steps': 29400, 'loss/train': 1.18025541305542} 11/07/2021 01:20:27 - INFO - __main__ - Step 29402: {'lr': 0.00045889063558545974, 'samples': 5645184, 'steps': 29401, 'loss/train': 1.5742133855819702} 11/07/2021 01:20:28 - INFO - __main__ - Step 29403: {'lr': 0.0004588877200377921, 'samples': 5645376, 'steps': 29402, 'loss/train': 1.396234154701233} 11/07/2021 01:20:29 - INFO - __main__ - Step 29404: {'lr': 0.000458884804396003, 'samples': 5645568, 'steps': 29403, 'loss/train': 1.0069153308868408} 11/07/2021 01:20:29 - INFO - __main__ - Step 29405: {'lr': 0.0004588818886600935, 'samples': 5645760, 'steps': 29404, 'loss/train': 1.5877037048339844} 11/07/2021 01:20:29 - INFO - __main__ - Step 29406: {'lr': 0.00045887897283006506, 'samples': 5645952, 'steps': 29405, 'loss/train': 1.3609683513641357} 11/07/2021 01:20:30 - INFO - __main__ - Step 29407: {'lr': 0.00045887605690591904, 'samples': 5646144, 'steps': 29406, 'loss/train': 1.4616987705230713} 11/07/2021 01:20:30 - INFO - __main__ - Step 29408: {'lr': 0.0004588731408876566, 'samples': 5646336, 'steps': 29407, 'loss/train': 1.570209264755249} 11/07/2021 01:20:31 - INFO - __main__ - Step 29409: {'lr': 0.00045887022477527923, 'samples': 5646528, 'steps': 29408, 'loss/train': 1.9197208881378174} 11/07/2021 01:20:32 - INFO - __main__ - Step 29410: {'lr': 0.0004588673085687881, 'samples': 5646720, 'steps': 29409, 'loss/train': 1.5169775485992432} 11/07/2021 01:20:32 - INFO - __main__ - Step 29411: {'lr': 0.00045886439226818464, 'samples': 5646912, 'steps': 29410, 'loss/train': 1.6643450260162354} 11/07/2021 01:20:32 - INFO - __main__ - Step 29412: {'lr': 0.0004588614758734701, 'samples': 5647104, 'steps': 29411, 'loss/train': 1.5076509714126587} 11/07/2021 01:20:33 - INFO - __main__ - Step 29413: {'lr': 0.0004588585593846458, 'samples': 5647296, 'steps': 29412, 'loss/train': 0.8832883834838867} 11/07/2021 01:20:33 - INFO - __main__ - Step 29414: {'lr': 0.000458855642801713, 'samples': 5647488, 'steps': 29413, 'loss/train': 1.680592656135559} 11/07/2021 01:20:34 - INFO - __main__ - Step 29415: {'lr': 0.00045885272612467313, 'samples': 5647680, 'steps': 29414, 'loss/train': 1.6815725564956665} 11/07/2021 01:20:34 - INFO - __main__ - Step 29416: {'lr': 0.0004588498093535274, 'samples': 5647872, 'steps': 29415, 'loss/train': 1.485547423362732} 11/07/2021 01:20:35 - INFO - __main__ - Step 29417: {'lr': 0.0004588468924882772, 'samples': 5648064, 'steps': 29416, 'loss/train': 1.5461474657058716} 11/07/2021 01:20:35 - INFO - __main__ - Step 29418: {'lr': 0.0004588439755289238, 'samples': 5648256, 'steps': 29417, 'loss/train': 1.3511896133422852} 11/07/2021 01:20:35 - INFO - __main__ - Step 29419: {'lr': 0.00045884105847546853, 'samples': 5648448, 'steps': 29418, 'loss/train': 2.118772506713867} 11/07/2021 01:20:37 - INFO - __main__ - Step 29420: {'lr': 0.00045883814132791274, 'samples': 5648640, 'steps': 29419, 'loss/train': 1.338430643081665} 11/07/2021 01:20:37 - INFO - __main__ - Step 29421: {'lr': 0.0004588352240862577, 'samples': 5648832, 'steps': 29420, 'loss/train': 1.2146153450012207} 11/07/2021 01:20:37 - INFO - __main__ - Step 29422: {'lr': 0.0004588323067505047, 'samples': 5649024, 'steps': 29421, 'loss/train': 1.3759852647781372} 11/07/2021 01:20:38 - INFO - __main__ - Step 29423: {'lr': 0.00045882938932065504, 'samples': 5649216, 'steps': 29422, 'loss/train': 1.1928725242614746} 11/07/2021 01:20:38 - INFO - __main__ - Step 29424: {'lr': 0.0004588264717967101, 'samples': 5649408, 'steps': 29423, 'loss/train': 1.5485185384750366} 11/07/2021 01:20:39 - INFO - __main__ - Step 29425: {'lr': 0.00045882355417867124, 'samples': 5649600, 'steps': 29424, 'loss/train': 1.6709578037261963} 11/07/2021 01:20:39 - INFO - __main__ - Step 29426: {'lr': 0.00045882063646653966, 'samples': 5649792, 'steps': 29425, 'loss/train': 1.5203857421875} 11/07/2021 01:20:40 - INFO - __main__ - Step 29427: {'lr': 0.00045881771866031673, 'samples': 5649984, 'steps': 29426, 'loss/train': 1.921944260597229} 11/07/2021 01:20:40 - INFO - __main__ - Step 29428: {'lr': 0.00045881480076000376, 'samples': 5650176, 'steps': 29427, 'loss/train': 1.3370532989501953} 11/07/2021 01:20:40 - INFO - __main__ - Step 29429: {'lr': 0.00045881188276560204, 'samples': 5650368, 'steps': 29428, 'loss/train': 2.0083775520324707} 11/07/2021 01:20:42 - INFO - __main__ - Step 29430: {'lr': 0.000458808964677113, 'samples': 5650560, 'steps': 29429, 'loss/train': 1.303795576095581} 11/07/2021 01:20:42 - INFO - __main__ - Step 29431: {'lr': 0.00045880604649453774, 'samples': 5650752, 'steps': 29430, 'loss/train': 1.9484096765518188} 11/07/2021 01:20:42 - INFO - __main__ - Step 29432: {'lr': 0.00045880312821787775, 'samples': 5650944, 'steps': 29431, 'loss/train': 1.3311874866485596} 11/07/2021 01:20:43 - INFO - __main__ - Step 29433: {'lr': 0.00045880020984713434, 'samples': 5651136, 'steps': 29432, 'loss/train': 1.860125184059143} 11/07/2021 01:20:43 - INFO - __main__ - Step 29434: {'lr': 0.0004587972913823087, 'samples': 5651328, 'steps': 29433, 'loss/train': 1.5912048816680908} 11/07/2021 01:20:43 - INFO - __main__ - Step 29435: {'lr': 0.00045879437282340225, 'samples': 5651520, 'steps': 29434, 'loss/train': 1.089166522026062} 11/07/2021 01:20:44 - INFO - __main__ - Step 29436: {'lr': 0.00045879145417041623, 'samples': 5651712, 'steps': 29435, 'loss/train': 1.9212347269058228} 11/07/2021 01:20:45 - INFO - __main__ - Step 29437: {'lr': 0.0004587885354233521, 'samples': 5651904, 'steps': 29436, 'loss/train': 1.658786416053772} 11/07/2021 01:20:45 - INFO - __main__ - Step 29438: {'lr': 0.0004587856165822111, 'samples': 5652096, 'steps': 29437, 'loss/train': 1.6033729314804077} 11/07/2021 01:20:45 - INFO - __main__ - Step 29439: {'lr': 0.0004587826976469944, 'samples': 5652288, 'steps': 29438, 'loss/train': 1.6085463762283325} 11/07/2021 01:20:46 - INFO - __main__ - Step 29440: {'lr': 0.0004587797786177035, 'samples': 5652480, 'steps': 29439, 'loss/train': 2.093695640563965} 11/07/2021 01:20:46 - INFO - __main__ - Step 29441: {'lr': 0.0004587768594943396, 'samples': 5652672, 'steps': 29440, 'loss/train': 1.3613848686218262} 11/07/2021 01:20:47 - INFO - __main__ - Step 29442: {'lr': 0.00045877394027690413, 'samples': 5652864, 'steps': 29441, 'loss/train': 1.7916233539581299} 11/07/2021 01:20:47 - INFO - __main__ - Step 29443: {'lr': 0.0004587710209653984, 'samples': 5653056, 'steps': 29442, 'loss/train': 1.3992745876312256} 11/07/2021 01:20:48 - INFO - __main__ - Step 29444: {'lr': 0.0004587681015598235, 'samples': 5653248, 'steps': 29443, 'loss/train': 1.518426775932312} 11/07/2021 01:20:48 - INFO - __main__ - Step 29445: {'lr': 0.00045876518206018103, 'samples': 5653440, 'steps': 29444, 'loss/train': 1.5235440731048584} 11/07/2021 01:20:48 - INFO - __main__ - Step 29446: {'lr': 0.00045876226246647226, 'samples': 5653632, 'steps': 29445, 'loss/train': 1.6063261032104492} 11/07/2021 01:20:50 - INFO - __main__ - Step 29447: {'lr': 0.0004587593427786983, 'samples': 5653824, 'steps': 29446, 'loss/train': 1.4011942148208618} 11/07/2021 01:20:50 - INFO - __main__ - Step 29448: {'lr': 0.0004587564229968606, 'samples': 5654016, 'steps': 29447, 'loss/train': 1.538805603981018} 11/07/2021 01:20:50 - INFO - __main__ - Step 29449: {'lr': 0.00045875350312096053, 'samples': 5654208, 'steps': 29448, 'loss/train': 1.532183289527893} 11/07/2021 01:20:51 - INFO - __main__ - Step 29450: {'lr': 0.0004587505831509994, 'samples': 5654400, 'steps': 29449, 'loss/train': 1.8191397190093994} 11/07/2021 01:20:51 - INFO - __main__ - Step 29451: {'lr': 0.0004587476630869784, 'samples': 5654592, 'steps': 29450, 'loss/train': 1.310355305671692} 11/07/2021 01:20:52 - INFO - __main__ - Step 29452: {'lr': 0.000458744742928899, 'samples': 5654784, 'steps': 29451, 'loss/train': 0.8070980906486511} 11/07/2021 01:20:52 - INFO - __main__ - Step 29453: {'lr': 0.00045874182267676236, 'samples': 5654976, 'steps': 29452, 'loss/train': 2.1116678714752197} 11/07/2021 01:20:53 - INFO - __main__ - Step 29454: {'lr': 0.0004587389023305699, 'samples': 5655168, 'steps': 29453, 'loss/train': 1.662084698677063} 11/07/2021 01:20:53 - INFO - __main__ - Step 29455: {'lr': 0.00045873598189032295, 'samples': 5655360, 'steps': 29454, 'loss/train': 0.8377273082733154} 11/07/2021 01:20:53 - INFO - __main__ - Step 29456: {'lr': 0.00045873306135602276, 'samples': 5655552, 'steps': 29455, 'loss/train': 1.3423774242401123} 11/07/2021 01:20:55 - INFO - __main__ - Step 29457: {'lr': 0.00045873014072767064, 'samples': 5655744, 'steps': 29456, 'loss/train': 1.6420247554779053} 11/07/2021 01:20:55 - INFO - __main__ - Step 29458: {'lr': 0.000458727220005268, 'samples': 5655936, 'steps': 29457, 'loss/train': 1.5810794830322266} 11/07/2021 01:20:55 - INFO - __main__ - Step 29459: {'lr': 0.00045872429918881606, 'samples': 5656128, 'steps': 29458, 'loss/train': 1.1572012901306152} 11/07/2021 01:20:56 - INFO - __main__ - Step 29460: {'lr': 0.00045872137827831616, 'samples': 5656320, 'steps': 29459, 'loss/train': 0.9177290797233582} 11/07/2021 01:20:56 - INFO - __main__ - Step 29461: {'lr': 0.00045871845727376973, 'samples': 5656512, 'steps': 29460, 'loss/train': 1.4853347539901733} 11/07/2021 01:20:57 - INFO - __main__ - Step 29462: {'lr': 0.0004587155361751778, 'samples': 5656704, 'steps': 29461, 'loss/train': 1.3394935131072998} 11/07/2021 01:20:57 - INFO - __main__ - Step 29463: {'lr': 0.000458712614982542, 'samples': 5656896, 'steps': 29462, 'loss/train': 1.614371418952942} 11/07/2021 01:20:58 - INFO - __main__ - Step 29464: {'lr': 0.00045870969369586346, 'samples': 5657088, 'steps': 29463, 'loss/train': 1.7958014011383057} 11/07/2021 01:20:58 - INFO - __main__ - Step 29465: {'lr': 0.00045870677231514356, 'samples': 5657280, 'steps': 29464, 'loss/train': 1.4317177534103394} 11/07/2021 01:20:59 - INFO - __main__ - Step 29466: {'lr': 0.0004587038508403837, 'samples': 5657472, 'steps': 29465, 'loss/train': 1.5790847539901733} 11/07/2021 01:21:00 - INFO - __main__ - Step 29467: {'lr': 0.000458700929271585, 'samples': 5657664, 'steps': 29466, 'loss/train': 1.7337079048156738} 11/07/2021 01:21:00 - INFO - __main__ - Step 29468: {'lr': 0.0004586980076087489, 'samples': 5657856, 'steps': 29467, 'loss/train': 1.7457712888717651} 11/07/2021 01:21:00 - INFO - __main__ - Step 29469: {'lr': 0.0004586950858518767, 'samples': 5658048, 'steps': 29468, 'loss/train': 1.7356659173965454} 11/07/2021 01:21:01 - INFO - __main__ - Step 29470: {'lr': 0.0004586921640009697, 'samples': 5658240, 'steps': 29469, 'loss/train': 1.5608547925949097} 11/07/2021 01:21:01 - INFO - __main__ - Step 29471: {'lr': 0.0004586892420560294, 'samples': 5658432, 'steps': 29470, 'loss/train': 1.1268757581710815} 11/07/2021 01:21:02 - INFO - __main__ - Step 29472: {'lr': 0.0004586863200170567, 'samples': 5658624, 'steps': 29471, 'loss/train': 0.8702432513237} 11/07/2021 01:21:03 - INFO - __main__ - Step 29473: {'lr': 0.00045868339788405333, 'samples': 5658816, 'steps': 29472, 'loss/train': 1.633955955505371} 11/07/2021 01:21:03 - INFO - __main__ - Step 29474: {'lr': 0.0004586804756570204, 'samples': 5659008, 'steps': 29473, 'loss/train': 1.199129343032837} 11/07/2021 01:21:03 - INFO - __main__ - Step 29475: {'lr': 0.0004586775533359592, 'samples': 5659200, 'steps': 29474, 'loss/train': 2.014317750930786} 11/07/2021 01:21:04 - INFO - __main__ - Step 29476: {'lr': 0.00045867463092087116, 'samples': 5659392, 'steps': 29475, 'loss/train': 1.0253338813781738} 11/07/2021 01:21:05 - INFO - __main__ - Step 29477: {'lr': 0.00045867170841175755, 'samples': 5659584, 'steps': 29476, 'loss/train': 0.2625601887702942} 11/07/2021 01:21:05 - INFO - __main__ - Step 29478: {'lr': 0.0004586687858086197, 'samples': 5659776, 'steps': 29477, 'loss/train': 1.3529233932495117} 11/07/2021 01:21:05 - INFO - __main__ - Step 29479: {'lr': 0.0004586658631114589, 'samples': 5659968, 'steps': 29478, 'loss/train': 1.5325437784194946} 11/07/2021 01:21:06 - INFO - __main__ - Step 29480: {'lr': 0.0004586629403202765, 'samples': 5660160, 'steps': 29479, 'loss/train': 1.3387819528579712} 11/07/2021 01:21:06 - INFO - __main__ - Step 29481: {'lr': 0.0004586600174350738, 'samples': 5660352, 'steps': 29480, 'loss/train': 2.0539634227752686} 11/07/2021 01:21:07 - INFO - __main__ - Step 29482: {'lr': 0.0004586570944558521, 'samples': 5660544, 'steps': 29481, 'loss/train': 0.8757557272911072} 11/07/2021 01:21:08 - INFO - __main__ - Step 29483: {'lr': 0.00045865417138261276, 'samples': 5660736, 'steps': 29482, 'loss/train': 1.633339762687683} 11/07/2021 01:21:08 - INFO - __main__ - Step 29484: {'lr': 0.00045865124821535704, 'samples': 5660928, 'steps': 29483, 'loss/train': 1.6316306591033936} 11/07/2021 01:21:08 - INFO - __main__ - Step 29485: {'lr': 0.00045864832495408624, 'samples': 5661120, 'steps': 29484, 'loss/train': 2.1339690685272217} 11/07/2021 01:21:09 - INFO - __main__ - Step 29486: {'lr': 0.0004586454015988019, 'samples': 5661312, 'steps': 29485, 'loss/train': 1.2461416721343994} 11/07/2021 01:21:09 - INFO - __main__ - Step 29487: {'lr': 0.000458642478149505, 'samples': 5661504, 'steps': 29486, 'loss/train': 1.9071553945541382} 11/07/2021 01:21:11 - INFO - __main__ - Step 29488: {'lr': 0.00045863955460619707, 'samples': 5661696, 'steps': 29487, 'loss/train': 1.6638797521591187} 11/07/2021 01:21:12 - INFO - __main__ - Step 29489: {'lr': 0.0004586366309688793, 'samples': 5661888, 'steps': 29488, 'loss/train': 0.8595585823059082} 11/07/2021 01:21:12 - INFO - __main__ - Step 29490: {'lr': 0.00045863370723755315, 'samples': 5662080, 'steps': 29489, 'loss/train': 1.6457905769348145} 11/07/2021 01:21:12 - INFO - __main__ - Step 29491: {'lr': 0.00045863078341221993, 'samples': 5662272, 'steps': 29490, 'loss/train': 1.3534252643585205} 11/07/2021 01:21:13 - INFO - __main__ - Step 29492: {'lr': 0.0004586278594928808, 'samples': 5662464, 'steps': 29491, 'loss/train': 1.8189722299575806} 11/07/2021 01:21:13 - INFO - __main__ - Step 29493: {'lr': 0.0004586249354795372, 'samples': 5662656, 'steps': 29492, 'loss/train': 1.8026632070541382} 11/07/2021 01:21:13 - INFO - __main__ - Step 29494: {'lr': 0.0004586220113721905, 'samples': 5662848, 'steps': 29493, 'loss/train': 1.0064070224761963} 11/07/2021 01:21:14 - INFO - __main__ - Step 29495: {'lr': 0.0004586190871708419, 'samples': 5663040, 'steps': 29494, 'loss/train': 1.061107873916626} 11/07/2021 01:21:15 - INFO - __main__ - Step 29496: {'lr': 0.0004586161628754927, 'samples': 5663232, 'steps': 29495, 'loss/train': 1.4288783073425293} 11/07/2021 01:21:15 - INFO - __main__ - Step 29497: {'lr': 0.0004586132384861443, 'samples': 5663424, 'steps': 29496, 'loss/train': 0.9088807106018066} 11/07/2021 01:21:16 - INFO - __main__ - Step 29498: {'lr': 0.000458610314002798, 'samples': 5663616, 'steps': 29497, 'loss/train': 1.5161337852478027} 11/07/2021 01:21:16 - INFO - __main__ - Step 29499: {'lr': 0.0004586073894254551, 'samples': 5663808, 'steps': 29498, 'loss/train': 2.5317184925079346} 11/07/2021 01:21:16 - INFO - __main__ - Step 29500: {'lr': 0.000458604464754117, 'samples': 5664000, 'steps': 29499, 'loss/train': 1.939312219619751} 11/07/2021 01:21:18 - INFO - __main__ - Step 29501: {'lr': 0.0004586015399887849, 'samples': 5664192, 'steps': 29500, 'loss/train': 1.8874361515045166} 11/07/2021 01:21:18 - INFO - __main__ - Step 29502: {'lr': 0.0004585986151294602, 'samples': 5664384, 'steps': 29501, 'loss/train': 1.2751479148864746} 11/07/2021 01:21:18 - INFO - __main__ - Step 29503: {'lr': 0.0004585956901761441, 'samples': 5664576, 'steps': 29502, 'loss/train': 1.4074900150299072} 11/07/2021 01:21:19 - INFO - __main__ - Step 29504: {'lr': 0.00045859276512883807, 'samples': 5664768, 'steps': 29503, 'loss/train': 1.9444576501846313} 11/07/2021 01:21:19 - INFO - __main__ - Step 29505: {'lr': 0.00045858983998754336, 'samples': 5664960, 'steps': 29504, 'loss/train': 1.9173896312713623} 11/07/2021 01:21:20 - INFO - __main__ - Step 29506: {'lr': 0.0004585869147522612, 'samples': 5665152, 'steps': 29505, 'loss/train': 1.8542300462722778} 11/07/2021 01:21:20 - INFO - __main__ - Step 29507: {'lr': 0.00045858398942299306, 'samples': 5665344, 'steps': 29506, 'loss/train': 1.4023475646972656} 11/07/2021 01:21:21 - INFO - __main__ - Step 29508: {'lr': 0.0004585810639997402, 'samples': 5665536, 'steps': 29507, 'loss/train': 1.542800784111023} 11/07/2021 01:21:21 - INFO - __main__ - Step 29509: {'lr': 0.0004585781384825039, 'samples': 5665728, 'steps': 29508, 'loss/train': 2.0159964561462402} 11/07/2021 01:21:21 - INFO - __main__ - Step 29510: {'lr': 0.00045857521287128556, 'samples': 5665920, 'steps': 29509, 'loss/train': 1.922012448310852} 11/07/2021 01:21:22 - INFO - __main__ - Step 29511: {'lr': 0.0004585722871660864, 'samples': 5666112, 'steps': 29510, 'loss/train': 1.5604584217071533} 11/07/2021 01:21:23 - INFO - __main__ - Step 29512: {'lr': 0.0004585693613669078, 'samples': 5666304, 'steps': 29511, 'loss/train': 2.035801887512207} 11/07/2021 01:21:23 - INFO - __main__ - Step 29513: {'lr': 0.0004585664354737511, 'samples': 5666496, 'steps': 29512, 'loss/train': 1.9828746318817139} 11/07/2021 01:21:23 - INFO - __main__ - Step 29514: {'lr': 0.0004585635094866175, 'samples': 5666688, 'steps': 29513, 'loss/train': 1.7652790546417236} 11/07/2021 01:21:24 - INFO - __main__ - Step 29515: {'lr': 0.0004585605834055084, 'samples': 5666880, 'steps': 29514, 'loss/train': 2.0004334449768066} 11/07/2021 01:21:24 - INFO - __main__ - Step 29516: {'lr': 0.00045855765723042526, 'samples': 5667072, 'steps': 29515, 'loss/train': 1.1060426235198975} 11/07/2021 01:21:26 - INFO - __main__ - Step 29517: {'lr': 0.00045855473096136914, 'samples': 5667264, 'steps': 29516, 'loss/train': 1.4863965511322021} 11/07/2021 01:21:26 - INFO - __main__ - Step 29518: {'lr': 0.00045855180459834153, 'samples': 5667456, 'steps': 29517, 'loss/train': 1.6433055400848389} 11/07/2021 01:21:26 - INFO - __main__ - Step 29519: {'lr': 0.0004585488781413437, 'samples': 5667648, 'steps': 29518, 'loss/train': 1.927058219909668} 11/07/2021 01:21:27 - INFO - __main__ - Step 29520: {'lr': 0.00045854595159037695, 'samples': 5667840, 'steps': 29519, 'loss/train': 1.3411040306091309} 11/07/2021 01:21:27 - INFO - __main__ - Step 29521: {'lr': 0.0004585430249454425, 'samples': 5668032, 'steps': 29520, 'loss/train': 0.4509488046169281} 11/07/2021 01:21:28 - INFO - __main__ - Step 29522: {'lr': 0.000458540098206542, 'samples': 5668224, 'steps': 29521, 'loss/train': 1.7586899995803833} 11/07/2021 01:21:28 - INFO - __main__ - Step 29523: {'lr': 0.00045853717137367634, 'samples': 5668416, 'steps': 29522, 'loss/train': 1.6397231817245483} 11/07/2021 01:21:29 - INFO - __main__ - Step 29524: {'lr': 0.0004585342444468471, 'samples': 5668608, 'steps': 29523, 'loss/train': 1.4895061254501343} 11/07/2021 01:21:29 - INFO - __main__ - Step 29525: {'lr': 0.00045853131742605563, 'samples': 5668800, 'steps': 29524, 'loss/train': 1.8235915899276733} 11/07/2021 01:21:29 - INFO - __main__ - Step 29526: {'lr': 0.0004585283903113031, 'samples': 5668992, 'steps': 29525, 'loss/train': 1.258795142173767} 11/07/2021 01:21:30 - INFO - __main__ - Step 29527: {'lr': 0.00045852546310259093, 'samples': 5669184, 'steps': 29526, 'loss/train': 1.4527385234832764} 11/07/2021 01:21:31 - INFO - __main__ - Step 29528: {'lr': 0.00045852253579992043, 'samples': 5669376, 'steps': 29527, 'loss/train': 1.888344168663025} 11/07/2021 01:21:31 - INFO - __main__ - Step 29529: {'lr': 0.0004585196084032928, 'samples': 5669568, 'steps': 29528, 'loss/train': 1.6944209337234497} 11/07/2021 01:21:32 - INFO - __main__ - Step 29530: {'lr': 0.0004585166809127095, 'samples': 5669760, 'steps': 29529, 'loss/train': 1.6376452445983887} 11/07/2021 01:21:32 - INFO - __main__ - Step 29531: {'lr': 0.0004585137533281718, 'samples': 5669952, 'steps': 29530, 'loss/train': 1.8920257091522217} 11/07/2021 01:21:33 - INFO - __main__ - Step 29532: {'lr': 0.00045851082564968103, 'samples': 5670144, 'steps': 29531, 'loss/train': 1.875730276107788} 11/07/2021 01:21:33 - INFO - __main__ - Step 29533: {'lr': 0.0004585078978772385, 'samples': 5670336, 'steps': 29532, 'loss/train': 1.7644169330596924} 11/07/2021 01:21:34 - INFO - __main__ - Step 29534: {'lr': 0.0004585049700108455, 'samples': 5670528, 'steps': 29533, 'loss/train': 1.3630928993225098} 11/07/2021 01:21:34 - INFO - __main__ - Step 29535: {'lr': 0.00045850204205050344, 'samples': 5670720, 'steps': 29534, 'loss/train': 1.7111839056015015} 11/07/2021 01:21:34 - INFO - __main__ - Step 29536: {'lr': 0.0004584991139962135, 'samples': 5670912, 'steps': 29535, 'loss/train': 1.4443424940109253} 11/07/2021 01:21:35 - INFO - __main__ - Step 29537: {'lr': 0.00045849618584797717, 'samples': 5671104, 'steps': 29536, 'loss/train': 1.442642092704773} 11/07/2021 01:21:36 - INFO - __main__ - Step 29538: {'lr': 0.0004584932576057956, 'samples': 5671296, 'steps': 29537, 'loss/train': 1.4730128049850464} 11/07/2021 01:21:36 - INFO - __main__ - Step 29539: {'lr': 0.00045849032926967016, 'samples': 5671488, 'steps': 29538, 'loss/train': 1.708472728729248} 11/07/2021 01:21:36 - INFO - __main__ - Step 29540: {'lr': 0.0004584874008396023, 'samples': 5671680, 'steps': 29539, 'loss/train': 1.5613409280776978} 11/07/2021 01:21:37 - INFO - __main__ - Step 29541: {'lr': 0.00045848447231559315, 'samples': 5671872, 'steps': 29540, 'loss/train': 1.6380871534347534} 11/07/2021 01:21:38 - INFO - __main__ - Step 29542: {'lr': 0.00045848154369764415, 'samples': 5672064, 'steps': 29541, 'loss/train': 1.8458172082901} 11/07/2021 01:21:38 - INFO - __main__ - Step 29543: {'lr': 0.0004584786149857566, 'samples': 5672256, 'steps': 29542, 'loss/train': 1.5878684520721436} 11/07/2021 01:21:39 - INFO - __main__ - Step 29544: {'lr': 0.00045847568617993174, 'samples': 5672448, 'steps': 29543, 'loss/train': 1.3373163938522339} 11/07/2021 01:21:39 - INFO - __main__ - Step 29545: {'lr': 0.000458472757280171, 'samples': 5672640, 'steps': 29544, 'loss/train': 1.7970337867736816} 11/07/2021 01:21:39 - INFO - __main__ - Step 29546: {'lr': 0.0004584698282864757, 'samples': 5672832, 'steps': 29545, 'loss/train': 1.8570822477340698} 11/07/2021 01:21:40 - INFO - __main__ - Step 29547: {'lr': 0.000458466899198847, 'samples': 5673024, 'steps': 29546, 'loss/train': 1.45450758934021} 11/07/2021 01:21:41 - INFO - __main__ - Step 29548: {'lr': 0.0004584639700172863, 'samples': 5673216, 'steps': 29547, 'loss/train': 1.361185073852539} 11/07/2021 01:21:41 - INFO - __main__ - Step 29549: {'lr': 0.00045846104074179504, 'samples': 5673408, 'steps': 29548, 'loss/train': 1.5456278324127197} 11/07/2021 01:21:41 - INFO - __main__ - Step 29550: {'lr': 0.00045845811137237445, 'samples': 5673600, 'steps': 29549, 'loss/train': 2.0732905864715576} 11/07/2021 01:21:42 - INFO - __main__ - Step 29551: {'lr': 0.0004584551819090259, 'samples': 5673792, 'steps': 29550, 'loss/train': 1.9945400953292847} 11/07/2021 01:21:42 - INFO - __main__ - Step 29552: {'lr': 0.0004584522523517506, 'samples': 5673984, 'steps': 29551, 'loss/train': 1.3577299118041992} 11/07/2021 01:21:43 - INFO - __main__ - Step 29553: {'lr': 0.00045844932270054997, 'samples': 5674176, 'steps': 29552, 'loss/train': 2.1527366638183594} 11/07/2021 01:21:43 - INFO - __main__ - Step 29554: {'lr': 0.00045844639295542525, 'samples': 5674368, 'steps': 29553, 'loss/train': 1.4013590812683105} 11/07/2021 01:21:44 - INFO - __main__ - Step 29555: {'lr': 0.0004584434631163779, 'samples': 5674560, 'steps': 29554, 'loss/train': 1.350577712059021} 11/07/2021 01:21:44 - INFO - __main__ - Step 29556: {'lr': 0.000458440533183409, 'samples': 5674752, 'steps': 29555, 'loss/train': 1.1925235986709595} 11/07/2021 01:21:44 - INFO - __main__ - Step 29557: {'lr': 0.0004584376031565201, 'samples': 5674944, 'steps': 29556, 'loss/train': 1.6832846403121948} 11/07/2021 01:21:46 - INFO - __main__ - Step 29558: {'lr': 0.0004584346730357124, 'samples': 5675136, 'steps': 29557, 'loss/train': 1.5070879459381104} 11/07/2021 01:21:46 - INFO - __main__ - Step 29559: {'lr': 0.0004584317428209872, 'samples': 5675328, 'steps': 29558, 'loss/train': 1.5490034818649292} 11/07/2021 01:21:46 - INFO - __main__ - Step 29560: {'lr': 0.0004584288125123459, 'samples': 5675520, 'steps': 29559, 'loss/train': 1.1577174663543701} 11/07/2021 01:21:47 - INFO - __main__ - Step 29561: {'lr': 0.0004584258821097899, 'samples': 5675712, 'steps': 29560, 'loss/train': 1.5316812992095947} 11/07/2021 01:21:47 - INFO - __main__ - Step 29562: {'lr': 0.0004584229516133203, 'samples': 5675904, 'steps': 29561, 'loss/train': 1.6711317300796509} 11/07/2021 01:21:48 - INFO - __main__ - Step 29563: {'lr': 0.00045842002102293856, 'samples': 5676096, 'steps': 29562, 'loss/train': 1.4156216382980347} 11/07/2021 01:21:48 - INFO - __main__ - Step 29564: {'lr': 0.000458417090338646, 'samples': 5676288, 'steps': 29563, 'loss/train': 1.5477793216705322} 11/07/2021 01:21:49 - INFO - __main__ - Step 29565: {'lr': 0.00045841415956044394, 'samples': 5676480, 'steps': 29564, 'loss/train': 0.47286054491996765} 11/07/2021 01:21:49 - INFO - __main__ - Step 29566: {'lr': 0.0004584112286883336, 'samples': 5676672, 'steps': 29565, 'loss/train': 1.372520089149475} 11/07/2021 01:21:49 - INFO - __main__ - Step 29567: {'lr': 0.0004584082977223164, 'samples': 5676864, 'steps': 29566, 'loss/train': 1.6289653778076172} 11/07/2021 01:21:51 - INFO - __main__ - Step 29568: {'lr': 0.0004584053666623937, 'samples': 5677056, 'steps': 29567, 'loss/train': 1.2834872007369995} 11/07/2021 01:21:51 - INFO - __main__ - Step 29569: {'lr': 0.00045840243550856666, 'samples': 5677248, 'steps': 29568, 'loss/train': 1.9355740547180176} 11/07/2021 01:21:51 - INFO - __main__ - Step 29570: {'lr': 0.00045839950426083677, 'samples': 5677440, 'steps': 29569, 'loss/train': 1.7786345481872559} 11/07/2021 01:21:52 - INFO - __main__ - Step 29571: {'lr': 0.0004583965729192052, 'samples': 5677632, 'steps': 29570, 'loss/train': 1.6350317001342773} 11/07/2021 01:21:52 - INFO - __main__ - Step 29572: {'lr': 0.00045839364148367345, 'samples': 5677824, 'steps': 29571, 'loss/train': 1.5409319400787354} 11/07/2021 01:21:52 - INFO - __main__ - Step 29573: {'lr': 0.00045839070995424273, 'samples': 5678016, 'steps': 29572, 'loss/train': 1.367226004600525} 11/07/2021 01:21:53 - INFO - __main__ - Step 29574: {'lr': 0.00045838777833091425, 'samples': 5678208, 'steps': 29573, 'loss/train': 1.731628656387329} 11/07/2021 01:21:54 - INFO - __main__ - Step 29575: {'lr': 0.00045838484661368963, 'samples': 5678400, 'steps': 29574, 'loss/train': 1.5066232681274414} 11/07/2021 01:21:54 - INFO - __main__ - Step 29576: {'lr': 0.00045838191480256985, 'samples': 5678592, 'steps': 29575, 'loss/train': 1.3395975828170776} 11/07/2021 01:21:54 - INFO - __main__ - Step 29577: {'lr': 0.00045837898289755654, 'samples': 5678784, 'steps': 29576, 'loss/train': 1.3253049850463867} 11/07/2021 01:21:55 - INFO - __main__ - Step 29578: {'lr': 0.0004583760508986508, 'samples': 5678976, 'steps': 29577, 'loss/train': 1.2868000268936157} 11/07/2021 01:21:56 - INFO - __main__ - Step 29579: {'lr': 0.000458373118805854, 'samples': 5679168, 'steps': 29578, 'loss/train': 1.4119806289672852} 11/07/2021 01:21:56 - INFO - __main__ - Step 29580: {'lr': 0.00045837018661916754, 'samples': 5679360, 'steps': 29579, 'loss/train': 1.2361005544662476} 11/07/2021 01:21:56 - INFO - __main__ - Step 29581: {'lr': 0.00045836725433859266, 'samples': 5679552, 'steps': 29580, 'loss/train': 2.188758611679077} 11/07/2021 01:21:57 - INFO - __main__ - Step 29582: {'lr': 0.0004583643219641307, 'samples': 5679744, 'steps': 29581, 'loss/train': 1.2891649007797241} 11/07/2021 01:21:57 - INFO - __main__ - Step 29583: {'lr': 0.00045836138949578297, 'samples': 5679936, 'steps': 29582, 'loss/train': 1.5164471864700317} 11/07/2021 01:21:58 - INFO - __main__ - Step 29584: {'lr': 0.00045835845693355096, 'samples': 5680128, 'steps': 29583, 'loss/train': 1.9012441635131836} 11/07/2021 01:21:59 - INFO - __main__ - Step 29585: {'lr': 0.00045835552427743567, 'samples': 5680320, 'steps': 29584, 'loss/train': 1.8651920557022095} 11/07/2021 01:21:59 - INFO - __main__ - Step 29586: {'lr': 0.00045835259152743866, 'samples': 5680512, 'steps': 29585, 'loss/train': 1.9096503257751465} 11/07/2021 01:21:59 - INFO - __main__ - Step 29587: {'lr': 0.0004583496586835612, 'samples': 5680704, 'steps': 29586, 'loss/train': 1.5053143501281738} 11/07/2021 01:22:00 - INFO - __main__ - Step 29588: {'lr': 0.0004583467257458046, 'samples': 5680896, 'steps': 29587, 'loss/train': 0.6775304079055786} 11/07/2021 01:22:01 - INFO - __main__ - Step 29589: {'lr': 0.00045834379271417013, 'samples': 5681088, 'steps': 29588, 'loss/train': 1.7188389301300049} 11/07/2021 01:22:02 - INFO - __main__ - Step 29590: {'lr': 0.0004583408595886592, 'samples': 5681280, 'steps': 29589, 'loss/train': 1.8896228075027466} 11/07/2021 01:22:02 - INFO - __main__ - Step 29591: {'lr': 0.0004583379263692732, 'samples': 5681472, 'steps': 29590, 'loss/train': 1.986293077468872} 11/07/2021 01:22:02 - INFO - __main__ - Step 29592: {'lr': 0.0004583349930560132, 'samples': 5681664, 'steps': 29591, 'loss/train': 0.1942824423313141} 11/07/2021 01:22:03 - INFO - __main__ - Step 29593: {'lr': 0.0004583320596488807, 'samples': 5681856, 'steps': 29592, 'loss/train': 1.5070799589157104} 11/07/2021 01:22:04 - INFO - __main__ - Step 29594: {'lr': 0.000458329126147877, 'samples': 5682048, 'steps': 29593, 'loss/train': 2.0239007472991943} 11/07/2021 01:22:04 - INFO - __main__ - Step 29595: {'lr': 0.00045832619255300344, 'samples': 5682240, 'steps': 29594, 'loss/train': 1.57216215133667} 11/07/2021 01:22:05 - INFO - __main__ - Step 29596: {'lr': 0.00045832325886426125, 'samples': 5682432, 'steps': 29595, 'loss/train': 1.732596755027771} 11/07/2021 01:22:05 - INFO - __main__ - Step 29597: {'lr': 0.0004583203250816518, 'samples': 5682624, 'steps': 29596, 'loss/train': 1.4190809726715088} 11/07/2021 01:22:05 - INFO - __main__ - Step 29598: {'lr': 0.0004583173912051765, 'samples': 5682816, 'steps': 29597, 'loss/train': 0.721015989780426} 11/07/2021 01:22:06 - INFO - __main__ - Step 29599: {'lr': 0.00045831445723483656, 'samples': 5683008, 'steps': 29598, 'loss/train': 1.7590328454971313} 11/07/2021 01:22:07 - INFO - __main__ - Step 29600: {'lr': 0.0004583115231706334, 'samples': 5683200, 'steps': 29599, 'loss/train': 1.3749176263809204} 11/07/2021 01:22:07 - INFO - __main__ - Step 29601: {'lr': 0.0004583085890125682, 'samples': 5683392, 'steps': 29600, 'loss/train': 1.6116366386413574} 11/07/2021 01:22:07 - INFO - __main__ - Step 29602: {'lr': 0.0004583056547606424, 'samples': 5683584, 'steps': 29601, 'loss/train': 1.6327733993530273} 11/07/2021 01:22:08 - INFO - __main__ - Step 29603: {'lr': 0.0004583027204148573, 'samples': 5683776, 'steps': 29602, 'loss/train': 1.0818742513656616} 11/07/2021 01:22:08 - INFO - __main__ - Step 29604: {'lr': 0.0004582997859752142, 'samples': 5683968, 'steps': 29603, 'loss/train': 1.9879107475280762} 11/07/2021 01:22:09 - INFO - __main__ - Step 29605: {'lr': 0.0004582968514417144, 'samples': 5684160, 'steps': 29604, 'loss/train': 1.4256826639175415} 11/07/2021 01:22:10 - INFO - __main__ - Step 29606: {'lr': 0.00045829391681435926, 'samples': 5684352, 'steps': 29605, 'loss/train': 1.6640363931655884} 11/07/2021 01:22:10 - INFO - __main__ - Step 29607: {'lr': 0.0004582909820931501, 'samples': 5684544, 'steps': 29606, 'loss/train': 1.8146600723266602} 11/07/2021 01:22:10 - INFO - __main__ - Step 29608: {'lr': 0.00045828804727808824, 'samples': 5684736, 'steps': 29607, 'loss/train': 1.7391760349273682} 11/07/2021 01:22:11 - INFO - __main__ - Step 29609: {'lr': 0.000458285112369175, 'samples': 5684928, 'steps': 29608, 'loss/train': 1.596211314201355} 11/07/2021 01:22:12 - INFO - __main__ - Step 29610: {'lr': 0.0004582821773664118, 'samples': 5685120, 'steps': 29609, 'loss/train': 1.5998541116714478} 11/07/2021 01:22:12 - INFO - __main__ - Step 29611: {'lr': 0.0004582792422697997, 'samples': 5685312, 'steps': 29610, 'loss/train': 1.378007173538208} 11/07/2021 01:22:12 - INFO - __main__ - Step 29612: {'lr': 0.0004582763070793403, 'samples': 5685504, 'steps': 29611, 'loss/train': 1.8445258140563965} 11/07/2021 01:22:13 - INFO - __main__ - Step 29613: {'lr': 0.0004582733717950347, 'samples': 5685696, 'steps': 29612, 'loss/train': 1.6518462896347046} 11/07/2021 01:22:13 - INFO - __main__ - Step 29614: {'lr': 0.00045827043641688444, 'samples': 5685888, 'steps': 29613, 'loss/train': 1.6120516061782837} 11/07/2021 01:22:14 - INFO - __main__ - Step 29615: {'lr': 0.00045826750094489065, 'samples': 5686080, 'steps': 29614, 'loss/train': 1.4761862754821777} 11/07/2021 01:22:15 - INFO - __main__ - Step 29616: {'lr': 0.00045826456537905483, 'samples': 5686272, 'steps': 29615, 'loss/train': 1.6258699893951416} 11/07/2021 01:22:15 - INFO - __main__ - Step 29617: {'lr': 0.0004582616297193781, 'samples': 5686464, 'steps': 29616, 'loss/train': 1.48533296585083} 11/07/2021 01:22:15 - INFO - __main__ - Step 29618: {'lr': 0.000458258693965862, 'samples': 5686656, 'steps': 29617, 'loss/train': 1.9197696447372437} 11/07/2021 01:22:16 - INFO - __main__ - Step 29619: {'lr': 0.0004582557581185077, 'samples': 5686848, 'steps': 29618, 'loss/train': 0.8355976343154907} 11/07/2021 01:22:17 - INFO - __main__ - Step 29620: {'lr': 0.00045825282217731655, 'samples': 5687040, 'steps': 29619, 'loss/train': 1.4850707054138184} 11/07/2021 01:22:17 - INFO - __main__ - Step 29621: {'lr': 0.00045824988614228995, 'samples': 5687232, 'steps': 29620, 'loss/train': 0.415968120098114} 11/07/2021 01:22:17 - INFO - __main__ - Step 29622: {'lr': 0.0004582469500134292, 'samples': 5687424, 'steps': 29621, 'loss/train': 1.5455137491226196} 11/07/2021 01:22:18 - INFO - __main__ - Step 29623: {'lr': 0.00045824401379073544, 'samples': 5687616, 'steps': 29622, 'loss/train': 2.0614051818847656} 11/07/2021 01:22:18 - INFO - __main__ - Step 29624: {'lr': 0.0004582410774742103, 'samples': 5687808, 'steps': 29623, 'loss/train': 1.6692601442337036} 11/07/2021 01:22:20 - INFO - __main__ - Step 29625: {'lr': 0.00045823814106385485, 'samples': 5688000, 'steps': 29624, 'loss/train': 1.550163984298706} 11/07/2021 01:22:20 - INFO - __main__ - Step 29626: {'lr': 0.0004582352045596705, 'samples': 5688192, 'steps': 29625, 'loss/train': 1.4578899145126343} 11/07/2021 01:22:20 - INFO - __main__ - Step 29627: {'lr': 0.0004582322679616586, 'samples': 5688384, 'steps': 29626, 'loss/train': 2.6318256855010986} 11/07/2021 01:22:21 - INFO - __main__ - Step 29628: {'lr': 0.0004582293312698205, 'samples': 5688576, 'steps': 29627, 'loss/train': 1.6896731853485107} 11/07/2021 01:22:21 - INFO - __main__ - Step 29629: {'lr': 0.00045822639448415736, 'samples': 5688768, 'steps': 29628, 'loss/train': 1.2518410682678223} 11/07/2021 01:22:21 - INFO - __main__ - Step 29630: {'lr': 0.0004582234576046707, 'samples': 5688960, 'steps': 29629, 'loss/train': 1.666911005973816} 11/07/2021 01:22:23 - INFO - __main__ - Step 29631: {'lr': 0.00045822052063136177, 'samples': 5689152, 'steps': 29630, 'loss/train': 1.9199504852294922} 11/07/2021 01:22:23 - INFO - __main__ - Step 29632: {'lr': 0.0004582175835642319, 'samples': 5689344, 'steps': 29631, 'loss/train': 1.5861963033676147} 11/07/2021 01:22:23 - INFO - __main__ - Step 29633: {'lr': 0.0004582146464032824, 'samples': 5689536, 'steps': 29632, 'loss/train': 1.229762077331543} 11/07/2021 01:22:24 - INFO - __main__ - Step 29634: {'lr': 0.0004582117091485145, 'samples': 5689728, 'steps': 29633, 'loss/train': 1.304877758026123} 11/07/2021 01:22:24 - INFO - __main__ - Step 29635: {'lr': 0.0004582087717999297, 'samples': 5689920, 'steps': 29634, 'loss/train': 1.5370056629180908} 11/07/2021 01:22:24 - INFO - __main__ - Step 29636: {'lr': 0.0004582058343575292, 'samples': 5690112, 'steps': 29635, 'loss/train': 1.7247610092163086} 11/07/2021 01:22:25 - INFO - __main__ - Step 29637: {'lr': 0.00045820289682131437, 'samples': 5690304, 'steps': 29636, 'loss/train': 0.6707466840744019} 11/07/2021 01:22:26 - INFO - __main__ - Step 29638: {'lr': 0.0004581999591912865, 'samples': 5690496, 'steps': 29637, 'loss/train': 1.200347900390625} 11/07/2021 01:22:26 - INFO - __main__ - Step 29639: {'lr': 0.000458197021467447, 'samples': 5690688, 'steps': 29638, 'loss/train': 1.260533094406128} 11/07/2021 01:22:27 - INFO - __main__ - Step 29640: {'lr': 0.00045819408364979714, 'samples': 5690880, 'steps': 29639, 'loss/train': 0.8662011623382568} 11/07/2021 01:22:27 - INFO - __main__ - Step 29641: {'lr': 0.0004581911457383382, 'samples': 5691072, 'steps': 29640, 'loss/train': 1.6379077434539795} 11/07/2021 01:22:28 - INFO - __main__ - Step 29642: {'lr': 0.0004581882077330716, 'samples': 5691264, 'steps': 29641, 'loss/train': 1.6704963445663452} 11/07/2021 01:22:28 - INFO - __main__ - Step 29643: {'lr': 0.0004581852696339985, 'samples': 5691456, 'steps': 29642, 'loss/train': 1.9362915754318237} 11/07/2021 01:22:29 - INFO - __main__ - Step 29644: {'lr': 0.00045818233144112044, 'samples': 5691648, 'steps': 29643, 'loss/train': 1.3519880771636963} 11/07/2021 01:22:29 - INFO - __main__ - Step 29645: {'lr': 0.00045817939315443855, 'samples': 5691840, 'steps': 29644, 'loss/train': 1.4004114866256714} 11/07/2021 01:22:29 - INFO - __main__ - Step 29646: {'lr': 0.0004581764547739543, 'samples': 5692032, 'steps': 29645, 'loss/train': 1.1390637159347534} 11/07/2021 01:22:30 - INFO - __main__ - Step 29647: {'lr': 0.00045817351629966896, 'samples': 5692224, 'steps': 29646, 'loss/train': 1.4488816261291504} 11/07/2021 01:22:31 - INFO - __main__ - Step 29648: {'lr': 0.00045817057773158375, 'samples': 5692416, 'steps': 29647, 'loss/train': 1.579978346824646} 11/07/2021 01:22:31 - INFO - __main__ - Step 29649: {'lr': 0.0004581676390697002, 'samples': 5692608, 'steps': 29648, 'loss/train': 0.9321296811103821} 11/07/2021 01:22:31 - INFO - __main__ - Step 29650: {'lr': 0.00045816470031401945, 'samples': 5692800, 'steps': 29649, 'loss/train': 1.462112307548523} 11/07/2021 01:22:32 - INFO - __main__ - Step 29651: {'lr': 0.00045816176146454296, 'samples': 5692992, 'steps': 29650, 'loss/train': 1.399960994720459} 11/07/2021 01:22:32 - INFO - __main__ - Step 29652: {'lr': 0.00045815882252127197, 'samples': 5693184, 'steps': 29651, 'loss/train': 1.866282343864441} 11/07/2021 01:22:34 - INFO - __main__ - Step 29653: {'lr': 0.0004581558834842078, 'samples': 5693376, 'steps': 29652, 'loss/train': 1.5247520208358765} 11/07/2021 01:22:34 - INFO - __main__ - Step 29654: {'lr': 0.00045815294435335184, 'samples': 5693568, 'steps': 29653, 'loss/train': 1.0651295185089111} 11/07/2021 01:22:34 - INFO - __main__ - Step 29655: {'lr': 0.0004581500051287053, 'samples': 5693760, 'steps': 29654, 'loss/train': 0.4343424439430237} 11/07/2021 01:22:35 - INFO - __main__ - Step 29656: {'lr': 0.00045814706581026967, 'samples': 5693952, 'steps': 29655, 'loss/train': 1.4132171869277954} 11/07/2021 01:22:35 - INFO - __main__ - Step 29657: {'lr': 0.0004581441263980461, 'samples': 5694144, 'steps': 29656, 'loss/train': 1.448478102684021} 11/07/2021 01:22:36 - INFO - __main__ - Step 29658: {'lr': 0.0004581411868920361, 'samples': 5694336, 'steps': 29657, 'loss/train': 0.7318187952041626} 11/07/2021 01:22:36 - INFO - __main__ - Step 29659: {'lr': 0.00045813824729224085, 'samples': 5694528, 'steps': 29658, 'loss/train': 1.9026389122009277} 11/07/2021 01:22:37 - INFO - __main__ - Step 29660: {'lr': 0.0004581353075986617, 'samples': 5694720, 'steps': 29659, 'loss/train': 1.1821388006210327} 11/07/2021 01:22:37 - INFO - __main__ - Step 29661: {'lr': 0.00045813236781129996, 'samples': 5694912, 'steps': 29660, 'loss/train': 1.7136733531951904} 11/07/2021 01:22:37 - INFO - __main__ - Step 29662: {'lr': 0.00045812942793015707, 'samples': 5695104, 'steps': 29661, 'loss/train': 0.9536595940589905} 11/07/2021 01:22:38 - INFO - __main__ - Step 29663: {'lr': 0.0004581264879552342, 'samples': 5695296, 'steps': 29662, 'loss/train': 1.8095760345458984} 11/07/2021 01:22:39 - INFO - __main__ - Step 29664: {'lr': 0.00045812354788653275, 'samples': 5695488, 'steps': 29663, 'loss/train': 2.0689356327056885} 11/07/2021 01:22:39 - INFO - __main__ - Step 29665: {'lr': 0.00045812060772405403, 'samples': 5695680, 'steps': 29664, 'loss/train': 1.4600616693496704} 11/07/2021 01:22:40 - INFO - __main__ - Step 29666: {'lr': 0.0004581176674677995, 'samples': 5695872, 'steps': 29665, 'loss/train': 1.7237344980239868} 11/07/2021 01:22:40 - INFO - __main__ - Step 29667: {'lr': 0.00045811472711777026, 'samples': 5696064, 'steps': 29666, 'loss/train': 1.4242371320724487} 11/07/2021 01:22:41 - INFO - __main__ - Step 29668: {'lr': 0.0004581117866739677, 'samples': 5696256, 'steps': 29667, 'loss/train': 1.5728676319122314} 11/07/2021 01:22:41 - INFO - __main__ - Step 29669: {'lr': 0.00045810884613639325, 'samples': 5696448, 'steps': 29668, 'loss/train': 1.5524414777755737} 11/07/2021 01:22:42 - INFO - __main__ - Step 29670: {'lr': 0.00045810590550504816, 'samples': 5696640, 'steps': 29669, 'loss/train': 1.336570382118225} 11/07/2021 01:22:42 - INFO - __main__ - Step 29671: {'lr': 0.0004581029647799337, 'samples': 5696832, 'steps': 29670, 'loss/train': 1.8532898426055908} 11/07/2021 01:22:42 - INFO - __main__ - Step 29672: {'lr': 0.0004581000239610513, 'samples': 5697024, 'steps': 29671, 'loss/train': 1.6476597785949707} 11/07/2021 01:22:43 - INFO - __main__ - Step 29673: {'lr': 0.0004580970830484023, 'samples': 5697216, 'steps': 29672, 'loss/train': 1.576622486114502} 11/07/2021 01:22:44 - INFO - __main__ - Step 29674: {'lr': 0.00045809414204198785, 'samples': 5697408, 'steps': 29673, 'loss/train': 0.773138701915741} 11/07/2021 01:22:44 - INFO - __main__ - Step 29675: {'lr': 0.00045809120094180946, 'samples': 5697600, 'steps': 29674, 'loss/train': 1.7726764678955078} 11/07/2021 01:22:44 - INFO - __main__ - Step 29676: {'lr': 0.00045808825974786834, 'samples': 5697792, 'steps': 29675, 'loss/train': 1.6256364583969116} 11/07/2021 01:22:45 - INFO - __main__ - Step 29677: {'lr': 0.0004580853184601659, 'samples': 5697984, 'steps': 29676, 'loss/train': 1.227905511856079} 11/07/2021 01:22:46 - INFO - __main__ - Step 29678: {'lr': 0.0004580823770787034, 'samples': 5698176, 'steps': 29677, 'loss/train': 1.5438227653503418} 11/07/2021 01:22:46 - INFO - __main__ - Step 29679: {'lr': 0.0004580794356034822, 'samples': 5698368, 'steps': 29678, 'loss/train': 1.741622805595398} 11/07/2021 01:22:46 - INFO - __main__ - Step 29680: {'lr': 0.0004580764940345036, 'samples': 5698560, 'steps': 29679, 'loss/train': 1.3502264022827148} 11/07/2021 01:22:47 - INFO - __main__ - Step 29681: {'lr': 0.00045807355237176896, 'samples': 5698752, 'steps': 29680, 'loss/train': 1.0882927179336548} 11/07/2021 01:22:47 - INFO - __main__ - Step 29682: {'lr': 0.0004580706106152796, 'samples': 5698944, 'steps': 29681, 'loss/train': 1.5208772420883179} 11/07/2021 01:22:48 - INFO - __main__ - Step 29683: {'lr': 0.00045806766876503683, 'samples': 5699136, 'steps': 29682, 'loss/train': 1.5638811588287354} 11/07/2021 01:22:49 - INFO - __main__ - Step 29684: {'lr': 0.000458064726821042, 'samples': 5699328, 'steps': 29683, 'loss/train': 2.4141056537628174} 11/07/2021 01:22:49 - INFO - __main__ - Step 29685: {'lr': 0.0004580617847832964, 'samples': 5699520, 'steps': 29684, 'loss/train': 1.8928292989730835} 11/07/2021 01:22:49 - INFO - __main__ - Step 29686: {'lr': 0.0004580588426518013, 'samples': 5699712, 'steps': 29685, 'loss/train': 1.3639895915985107} 11/07/2021 01:22:50 - INFO - __main__ - Step 29687: {'lr': 0.0004580559004265582, 'samples': 5699904, 'steps': 29686, 'loss/train': 1.5898321866989136} 11/07/2021 01:22:51 - INFO - __main__ - Step 29688: {'lr': 0.0004580529581075683, 'samples': 5700096, 'steps': 29687, 'loss/train': 1.2457053661346436} 11/07/2021 01:22:51 - INFO - __main__ - Step 29689: {'lr': 0.0004580500156948329, 'samples': 5700288, 'steps': 29688, 'loss/train': 1.5697243213653564} 11/07/2021 01:22:51 - INFO - __main__ - Step 29690: {'lr': 0.0004580470731883534, 'samples': 5700480, 'steps': 29689, 'loss/train': 1.5891913175582886} 11/07/2021 01:22:52 - INFO - __main__ - Step 29691: {'lr': 0.0004580441305881311, 'samples': 5700672, 'steps': 29690, 'loss/train': 1.4484436511993408} 11/07/2021 01:22:52 - INFO - __main__ - Step 29692: {'lr': 0.0004580411878941673, 'samples': 5700864, 'steps': 29691, 'loss/train': 1.649519681930542} 11/07/2021 01:22:53 - INFO - __main__ - Step 29693: {'lr': 0.0004580382451064634, 'samples': 5701056, 'steps': 29692, 'loss/train': 1.3619575500488281} 11/07/2021 01:22:53 - INFO - __main__ - Step 29694: {'lr': 0.00045803530222502065, 'samples': 5701248, 'steps': 29693, 'loss/train': 1.5870784521102905} 11/07/2021 01:22:54 - INFO - __main__ - Step 29695: {'lr': 0.0004580323592498404, 'samples': 5701440, 'steps': 29694, 'loss/train': 1.6245121955871582} 11/07/2021 01:22:54 - INFO - __main__ - Step 29696: {'lr': 0.00045802941618092397, 'samples': 5701632, 'steps': 29695, 'loss/train': 1.578203797340393} 11/07/2021 01:22:55 - INFO - __main__ - Step 29697: {'lr': 0.0004580264730182727, 'samples': 5701824, 'steps': 29696, 'loss/train': 1.8025213479995728} 11/07/2021 01:22:55 - INFO - __main__ - Step 29698: {'lr': 0.000458023529761888, 'samples': 5702016, 'steps': 29697, 'loss/train': 1.5820423364639282} 11/07/2021 01:22:56 - INFO - __main__ - Step 29699: {'lr': 0.00045802058641177104, 'samples': 5702208, 'steps': 29698, 'loss/train': 1.6937873363494873} 11/07/2021 01:22:56 - INFO - __main__ - Step 29700: {'lr': 0.00045801764296792317, 'samples': 5702400, 'steps': 29699, 'loss/train': 1.8489793539047241} 11/07/2021 01:22:57 - INFO - __main__ - Step 29701: {'lr': 0.0004580146994303458, 'samples': 5702592, 'steps': 29700, 'loss/train': 1.885706901550293} 11/07/2021 01:22:57 - INFO - __main__ - Step 29702: {'lr': 0.0004580117557990402, 'samples': 5702784, 'steps': 29701, 'loss/train': 1.9294558763504028} 11/07/2021 01:22:57 - INFO - __main__ - Step 29703: {'lr': 0.0004580088120740077, 'samples': 5702976, 'steps': 29702, 'loss/train': 1.8159981966018677} 11/07/2021 01:22:58 - INFO - __main__ - Step 29704: {'lr': 0.0004580058682552497, 'samples': 5703168, 'steps': 29703, 'loss/train': 1.570683479309082} 11/07/2021 01:22:59 - INFO - __main__ - Step 29705: {'lr': 0.00045800292434276736, 'samples': 5703360, 'steps': 29704, 'loss/train': 1.5014185905456543} 11/07/2021 01:22:59 - INFO - __main__ - Step 29706: {'lr': 0.0004579999803365622, 'samples': 5703552, 'steps': 29705, 'loss/train': 1.7709827423095703} 11/07/2021 01:23:00 - INFO - __main__ - Step 29707: {'lr': 0.00045799703623663546, 'samples': 5703744, 'steps': 29706, 'loss/train': 1.4139878749847412} 11/07/2021 01:23:00 - INFO - __main__ - Step 29708: {'lr': 0.00045799409204298844, 'samples': 5703936, 'steps': 29707, 'loss/train': 1.8164315223693848} 11/07/2021 01:23:00 - INFO - __main__ - Step 29709: {'lr': 0.00045799114775562245, 'samples': 5704128, 'steps': 29708, 'loss/train': 1.409840703010559} 11/07/2021 01:23:01 - INFO - __main__ - Step 29710: {'lr': 0.00045798820337453894, 'samples': 5704320, 'steps': 29709, 'loss/train': 1.6338276863098145} 11/07/2021 01:23:02 - INFO - __main__ - Step 29711: {'lr': 0.00045798525889973905, 'samples': 5704512, 'steps': 29710, 'loss/train': 1.4953669309616089} 11/07/2021 01:23:02 - INFO - __main__ - Step 29712: {'lr': 0.00045798231433122436, 'samples': 5704704, 'steps': 29711, 'loss/train': 1.7392159700393677} 11/07/2021 01:23:02 - INFO - __main__ - Step 29713: {'lr': 0.00045797936966899595, 'samples': 5704896, 'steps': 29712, 'loss/train': 1.2416294813156128} 11/07/2021 01:23:03 - INFO - __main__ - Step 29714: {'lr': 0.00045797642491305523, 'samples': 5705088, 'steps': 29713, 'loss/train': 1.5721782445907593} 11/07/2021 01:23:04 - INFO - __main__ - Step 29715: {'lr': 0.0004579734800634036, 'samples': 5705280, 'steps': 29714, 'loss/train': 1.1889970302581787} 11/07/2021 01:23:04 - INFO - __main__ - Step 29716: {'lr': 0.0004579705351200423, 'samples': 5705472, 'steps': 29715, 'loss/train': 1.799396276473999} 11/07/2021 01:23:04 - INFO - __main__ - Step 29717: {'lr': 0.0004579675900829727, 'samples': 5705664, 'steps': 29716, 'loss/train': 0.6931185126304626} 11/07/2021 01:23:05 - INFO - __main__ - Step 29718: {'lr': 0.00045796464495219614, 'samples': 5705856, 'steps': 29717, 'loss/train': 0.6500502824783325} 11/07/2021 01:23:05 - INFO - __main__ - Step 29719: {'lr': 0.00045796169972771387, 'samples': 5706048, 'steps': 29718, 'loss/train': 1.6350395679473877} 11/07/2021 01:23:05 - INFO - __main__ - Step 29720: {'lr': 0.00045795875440952726, 'samples': 5706240, 'steps': 29719, 'loss/train': 1.9400047063827515} 11/07/2021 01:23:07 - INFO - __main__ - Step 29721: {'lr': 0.00045795580899763767, 'samples': 5706432, 'steps': 29720, 'loss/train': 5.601465225219727} 11/07/2021 01:23:07 - INFO - __main__ - Step 29722: {'lr': 0.00045795286349204633, 'samples': 5706624, 'steps': 29721, 'loss/train': 1.3674595355987549} 11/07/2021 01:23:07 - INFO - __main__ - Step 29723: {'lr': 0.0004579499178927547, 'samples': 5706816, 'steps': 29722, 'loss/train': 1.397915005683899} 11/07/2021 01:23:08 - INFO - __main__ - Step 29724: {'lr': 0.0004579469721997641, 'samples': 5707008, 'steps': 29723, 'loss/train': 1.326294183731079} 11/07/2021 01:23:08 - INFO - __main__ - Step 29725: {'lr': 0.0004579440264130758, 'samples': 5707200, 'steps': 29724, 'loss/train': 1.4774163961410522} 11/07/2021 01:23:09 - INFO - __main__ - Step 29726: {'lr': 0.000457941080532691, 'samples': 5707392, 'steps': 29725, 'loss/train': 1.7879488468170166} 11/07/2021 01:23:09 - INFO - __main__ - Step 29727: {'lr': 0.0004579381345586113, 'samples': 5707584, 'steps': 29726, 'loss/train': 1.5059592723846436} 11/07/2021 01:23:10 - INFO - __main__ - Step 29728: {'lr': 0.0004579351884908378, 'samples': 5707776, 'steps': 29727, 'loss/train': 1.71578848361969} 11/07/2021 01:23:10 - INFO - __main__ - Step 29729: {'lr': 0.00045793224232937193, 'samples': 5707968, 'steps': 29728, 'loss/train': 1.545784831047058} 11/07/2021 01:23:10 - INFO - __main__ - Step 29730: {'lr': 0.0004579292960742151, 'samples': 5708160, 'steps': 29729, 'loss/train': 1.4610202312469482} 11/07/2021 01:23:11 - INFO - __main__ - Step 29731: {'lr': 0.0004579263497253684, 'samples': 5708352, 'steps': 29730, 'loss/train': 1.483440637588501} 11/07/2021 01:23:12 - INFO - __main__ - Step 29732: {'lr': 0.00045792340328283334, 'samples': 5708544, 'steps': 29731, 'loss/train': 1.6959503889083862} 11/07/2021 01:23:12 - INFO - __main__ - Step 29733: {'lr': 0.0004579204567466112, 'samples': 5708736, 'steps': 29732, 'loss/train': 1.0574575662612915} 11/07/2021 01:23:12 - INFO - __main__ - Step 29734: {'lr': 0.0004579175101167033, 'samples': 5708928, 'steps': 29733, 'loss/train': 1.4164676666259766} 11/07/2021 01:23:13 - INFO - __main__ - Step 29735: {'lr': 0.000457914563393111, 'samples': 5709120, 'steps': 29734, 'loss/train': 1.6582592725753784} 11/07/2021 01:23:13 - INFO - __main__ - Step 29736: {'lr': 0.00045791161657583555, 'samples': 5709312, 'steps': 29735, 'loss/train': 1.8787389993667603} 11/07/2021 01:23:14 - INFO - __main__ - Step 29737: {'lr': 0.00045790866966487843, 'samples': 5709504, 'steps': 29736, 'loss/train': 1.2307028770446777} 11/07/2021 01:23:15 - INFO - __main__ - Step 29738: {'lr': 0.0004579057226602408, 'samples': 5709696, 'steps': 29737, 'loss/train': 1.615642786026001} 11/07/2021 01:23:15 - INFO - __main__ - Step 29739: {'lr': 0.00045790277556192414, 'samples': 5709888, 'steps': 29738, 'loss/train': 1.572300910949707} 11/07/2021 01:23:15 - INFO - __main__ - Step 29740: {'lr': 0.0004578998283699296, 'samples': 5710080, 'steps': 29739, 'loss/train': 1.4859967231750488} 11/07/2021 01:23:16 - INFO - __main__ - Step 29741: {'lr': 0.0004578968810842586, 'samples': 5710272, 'steps': 29740, 'loss/train': 1.0473401546478271} 11/07/2021 01:23:17 - INFO - __main__ - Step 29742: {'lr': 0.0004578939337049126, 'samples': 5710464, 'steps': 29741, 'loss/train': 1.2427921295166016} 11/07/2021 01:23:17 - INFO - __main__ - Step 29743: {'lr': 0.0004578909862318927, 'samples': 5710656, 'steps': 29742, 'loss/train': 2.206838369369507} 11/07/2021 01:23:17 - INFO - __main__ - Step 29744: {'lr': 0.00045788803866520037, 'samples': 5710848, 'steps': 29743, 'loss/train': 1.1564019918441772} 11/07/2021 01:23:18 - INFO - __main__ - Step 29745: {'lr': 0.0004578850910048369, 'samples': 5711040, 'steps': 29744, 'loss/train': 0.6552532315254211} 11/07/2021 01:23:18 - INFO - __main__ - Step 29746: {'lr': 0.0004578821432508036, 'samples': 5711232, 'steps': 29745, 'loss/train': 1.3801320791244507} 11/07/2021 01:23:19 - INFO - __main__ - Step 29747: {'lr': 0.00045787919540310175, 'samples': 5711424, 'steps': 29746, 'loss/train': 1.7450164556503296} 11/07/2021 01:23:20 - INFO - __main__ - Step 29748: {'lr': 0.0004578762474617328, 'samples': 5711616, 'steps': 29747, 'loss/train': 1.2243391275405884} 11/07/2021 01:23:20 - INFO - __main__ - Step 29749: {'lr': 0.00045787329942669803, 'samples': 5711808, 'steps': 29748, 'loss/train': 1.2877225875854492} 11/07/2021 01:23:20 - INFO - __main__ - Step 29750: {'lr': 0.0004578703512979988, 'samples': 5712000, 'steps': 29749, 'loss/train': 1.1598126888275146} 11/07/2021 01:23:21 - INFO - __main__ - Step 29751: {'lr': 0.00045786740307563633, 'samples': 5712192, 'steps': 29750, 'loss/train': 1.3428281545639038} 11/07/2021 01:23:22 - INFO - __main__ - Step 29752: {'lr': 0.000457864454759612, 'samples': 5712384, 'steps': 29751, 'loss/train': 1.5730271339416504} 11/07/2021 01:23:22 - INFO - __main__ - Step 29753: {'lr': 0.00045786150634992716, 'samples': 5712576, 'steps': 29752, 'loss/train': 1.583794355392456} 11/07/2021 01:23:22 - INFO - __main__ - Step 29754: {'lr': 0.0004578585578465833, 'samples': 5712768, 'steps': 29753, 'loss/train': 1.4892146587371826} 11/07/2021 01:23:23 - INFO - __main__ - Step 29755: {'lr': 0.00045785560924958135, 'samples': 5712960, 'steps': 29754, 'loss/train': 1.5873689651489258} 11/07/2021 01:23:23 - INFO - __main__ - Step 29756: {'lr': 0.00045785266055892296, 'samples': 5713152, 'steps': 29755, 'loss/train': 1.9628443717956543} 11/07/2021 01:23:23 - INFO - __main__ - Step 29757: {'lr': 0.0004578497117746094, 'samples': 5713344, 'steps': 29756, 'loss/train': 1.3814347982406616} 11/07/2021 01:23:24 - INFO - __main__ - Step 29758: {'lr': 0.00045784676289664194, 'samples': 5713536, 'steps': 29757, 'loss/train': 1.3554935455322266} 11/07/2021 01:23:25 - INFO - __main__ - Step 29759: {'lr': 0.00045784381392502193, 'samples': 5713728, 'steps': 29758, 'loss/train': 1.3909201622009277} 11/07/2021 01:23:25 - INFO - __main__ - Step 29760: {'lr': 0.00045784086485975076, 'samples': 5713920, 'steps': 29759, 'loss/train': 1.44081449508667} 11/07/2021 01:23:26 - INFO - __main__ - Step 29761: {'lr': 0.00045783791570082956, 'samples': 5714112, 'steps': 29760, 'loss/train': 1.479193091392517} 11/07/2021 01:23:26 - INFO - __main__ - Step 29762: {'lr': 0.00045783496644825997, 'samples': 5714304, 'steps': 29761, 'loss/train': 1.6843669414520264} 11/07/2021 01:23:27 - INFO - __main__ - Step 29763: {'lr': 0.000457832017102043, 'samples': 5714496, 'steps': 29762, 'loss/train': 1.9135608673095703} 11/07/2021 01:23:27 - INFO - __main__ - Step 29764: {'lr': 0.00045782906766218026, 'samples': 5714688, 'steps': 29763, 'loss/train': 1.373593807220459} 11/07/2021 01:23:28 - INFO - __main__ - Step 29765: {'lr': 0.00045782611812867285, 'samples': 5714880, 'steps': 29764, 'loss/train': 1.7488089799880981} 11/07/2021 01:23:28 - INFO - __main__ - Step 29766: {'lr': 0.0004578231685015223, 'samples': 5715072, 'steps': 29765, 'loss/train': 1.6487336158752441} 11/07/2021 01:23:28 - INFO - __main__ - Step 29767: {'lr': 0.00045782021878072976, 'samples': 5715264, 'steps': 29766, 'loss/train': 1.114486575126648} 11/07/2021 01:23:30 - INFO - __main__ - Step 29768: {'lr': 0.0004578172689662967, 'samples': 5715456, 'steps': 29767, 'loss/train': 1.9899827241897583} 11/07/2021 01:23:30 - INFO - __main__ - Step 29769: {'lr': 0.0004578143190582243, 'samples': 5715648, 'steps': 29768, 'loss/train': 1.4738233089447021} 11/07/2021 01:23:31 - INFO - __main__ - Step 29770: {'lr': 0.000457811369056514, 'samples': 5715840, 'steps': 29769, 'loss/train': 1.630418300628662} 11/07/2021 01:23:31 - INFO - __main__ - Step 29771: {'lr': 0.0004578084189611671, 'samples': 5716032, 'steps': 29770, 'loss/train': 2.4739456176757812} 11/07/2021 01:23:31 - INFO - __main__ - Step 29772: {'lr': 0.000457805468772185, 'samples': 5716224, 'steps': 29771, 'loss/train': 1.9433209896087646} 11/07/2021 01:23:32 - INFO - __main__ - Step 29773: {'lr': 0.00045780251848956887, 'samples': 5716416, 'steps': 29772, 'loss/train': 1.7320784330368042} 11/07/2021 01:23:33 - INFO - __main__ - Step 29774: {'lr': 0.0004577995681133202, 'samples': 5716608, 'steps': 29773, 'loss/train': 1.7899585962295532} 11/07/2021 01:23:33 - INFO - __main__ - Step 29775: {'lr': 0.00045779661764344025, 'samples': 5716800, 'steps': 29774, 'loss/train': 1.639638900756836} 11/07/2021 01:23:34 - INFO - __main__ - Step 29776: {'lr': 0.0004577936670799303, 'samples': 5716992, 'steps': 29775, 'loss/train': 1.1416207551956177} 11/07/2021 01:23:34 - INFO - __main__ - Step 29777: {'lr': 0.00045779071642279177, 'samples': 5717184, 'steps': 29776, 'loss/train': 1.4320186376571655} 11/07/2021 01:23:34 - INFO - __main__ - Step 29778: {'lr': 0.00045778776567202597, 'samples': 5717376, 'steps': 29777, 'loss/train': 0.19236861169338226} 11/07/2021 01:23:35 - INFO - __main__ - Step 29779: {'lr': 0.0004577848148276341, 'samples': 5717568, 'steps': 29778, 'loss/train': 1.3992671966552734} 11/07/2021 01:23:36 - INFO - __main__ - Step 29780: {'lr': 0.00045778186388961776, 'samples': 5717760, 'steps': 29779, 'loss/train': 0.9883819818496704} 11/07/2021 01:23:36 - INFO - __main__ - Step 29781: {'lr': 0.000457778912857978, 'samples': 5717952, 'steps': 29780, 'loss/train': 1.4643261432647705} 11/07/2021 01:23:36 - INFO - __main__ - Step 29782: {'lr': 0.0004577759617327163, 'samples': 5718144, 'steps': 29781, 'loss/train': 0.6966073513031006} 11/07/2021 01:23:37 - INFO - __main__ - Step 29783: {'lr': 0.000457773010513834, 'samples': 5718336, 'steps': 29782, 'loss/train': 1.7028216123580933} 11/07/2021 01:23:38 - INFO - __main__ - Step 29784: {'lr': 0.0004577700592013323, 'samples': 5718528, 'steps': 29783, 'loss/train': 1.820124864578247} 11/07/2021 01:23:38 - INFO - __main__ - Step 29785: {'lr': 0.0004577671077952127, 'samples': 5718720, 'steps': 29784, 'loss/train': 1.080474853515625} 11/07/2021 01:23:38 - INFO - __main__ - Step 29786: {'lr': 0.0004577641562954764, 'samples': 5718912, 'steps': 29785, 'loss/train': 1.6864469051361084} 11/07/2021 01:23:39 - INFO - __main__ - Step 29787: {'lr': 0.00045776120470212477, 'samples': 5719104, 'steps': 29786, 'loss/train': 1.9222395420074463} 11/07/2021 01:23:39 - INFO - __main__ - Step 29788: {'lr': 0.00045775825301515923, 'samples': 5719296, 'steps': 29787, 'loss/train': 1.5662935972213745} 11/07/2021 01:23:40 - INFO - __main__ - Step 29789: {'lr': 0.00045775530123458096, 'samples': 5719488, 'steps': 29788, 'loss/train': 1.7799813747406006} 11/07/2021 01:23:41 - INFO - __main__ - Step 29790: {'lr': 0.00045775234936039133, 'samples': 5719680, 'steps': 29789, 'loss/train': 1.6136198043823242} 11/07/2021 01:23:41 - INFO - __main__ - Step 29791: {'lr': 0.00045774939739259173, 'samples': 5719872, 'steps': 29790, 'loss/train': 1.7736300230026245} 11/07/2021 01:23:41 - INFO - __main__ - Step 29792: {'lr': 0.0004577464453311835, 'samples': 5720064, 'steps': 29791, 'loss/train': 1.390116572380066} 11/07/2021 01:23:42 - INFO - __main__ - Step 29793: {'lr': 0.00045774349317616786, 'samples': 5720256, 'steps': 29792, 'loss/train': 1.5883327722549438} 11/07/2021 01:23:43 - INFO - __main__ - Step 29794: {'lr': 0.00045774054092754624, 'samples': 5720448, 'steps': 29793, 'loss/train': 1.7770684957504272} 11/07/2021 01:23:43 - INFO - __main__ - Step 29795: {'lr': 0.00045773758858531997, 'samples': 5720640, 'steps': 29794, 'loss/train': 1.6819716691970825} 11/07/2021 01:23:43 - INFO - __main__ - Step 29796: {'lr': 0.0004577346361494903, 'samples': 5720832, 'steps': 29795, 'loss/train': 1.8606592416763306} 11/07/2021 01:23:44 - INFO - __main__ - Step 29797: {'lr': 0.0004577316836200586, 'samples': 5721024, 'steps': 29796, 'loss/train': 1.347123146057129} 11/07/2021 01:23:44 - INFO - __main__ - Step 29798: {'lr': 0.0004577287309970262, 'samples': 5721216, 'steps': 29797, 'loss/train': 1.5793622732162476} 11/07/2021 01:23:46 - INFO - __main__ - Step 29799: {'lr': 0.0004577257782803945, 'samples': 5721408, 'steps': 29798, 'loss/train': 1.1619411706924438} 11/07/2021 01:23:47 - INFO - __main__ - Step 29800: {'lr': 0.00045772282547016475, 'samples': 5721600, 'steps': 29799, 'loss/train': 1.731318712234497} 11/07/2021 01:23:47 - INFO - __main__ - Step 29801: {'lr': 0.0004577198725663383, 'samples': 5721792, 'steps': 29800, 'loss/train': 1.8544946908950806} 11/07/2021 01:23:47 - INFO - __main__ - Step 29802: {'lr': 0.00045771691956891645, 'samples': 5721984, 'steps': 29801, 'loss/train': 1.3345541954040527} 11/07/2021 01:23:48 - INFO - __main__ - Step 29803: {'lr': 0.00045771396647790053, 'samples': 5722176, 'steps': 29802, 'loss/train': 1.8053834438323975} 11/07/2021 01:23:48 - INFO - __main__ - Step 29804: {'lr': 0.00045771101329329195, 'samples': 5722368, 'steps': 29803, 'loss/train': 1.8068797588348389} 11/07/2021 01:23:48 - INFO - __main__ - Step 29805: {'lr': 0.00045770806001509205, 'samples': 5722560, 'steps': 29804, 'loss/train': 1.7853294610977173} 11/07/2021 01:23:49 - INFO - __main__ - Step 29806: {'lr': 0.00045770510664330203, 'samples': 5722752, 'steps': 29805, 'loss/train': 1.2462117671966553} 11/07/2021 01:23:50 - INFO - __main__ - Step 29807: {'lr': 0.0004577021531779233, 'samples': 5722944, 'steps': 29806, 'loss/train': 1.5601024627685547} 11/07/2021 01:23:50 - INFO - __main__ - Step 29808: {'lr': 0.00045769919961895716, 'samples': 5723136, 'steps': 29807, 'loss/train': 1.5851486921310425} 11/07/2021 01:23:50 - INFO - __main__ - Step 29809: {'lr': 0.000457696245966405, 'samples': 5723328, 'steps': 29808, 'loss/train': 1.6994768381118774} 11/07/2021 01:23:51 - INFO - __main__ - Step 29810: {'lr': 0.0004576932922202681, 'samples': 5723520, 'steps': 29809, 'loss/train': 1.0932499170303345} 11/07/2021 01:23:51 - INFO - __main__ - Step 29811: {'lr': 0.00045769033838054783, 'samples': 5723712, 'steps': 29810, 'loss/train': 1.7573410272598267} 11/07/2021 01:23:52 - INFO - __main__ - Step 29812: {'lr': 0.0004576873844472455, 'samples': 5723904, 'steps': 29811, 'loss/train': 1.5087987184524536} 11/07/2021 01:23:52 - INFO - __main__ - Step 29813: {'lr': 0.00045768443042036247, 'samples': 5724096, 'steps': 29812, 'loss/train': 1.4405393600463867} 11/07/2021 01:23:53 - INFO - __main__ - Step 29814: {'lr': 0.0004576814762999, 'samples': 5724288, 'steps': 29813, 'loss/train': 1.6026747226715088} 11/07/2021 01:23:53 - INFO - __main__ - Step 29815: {'lr': 0.00045767852208585945, 'samples': 5724480, 'steps': 29814, 'loss/train': 1.3455612659454346} 11/07/2021 01:23:53 - INFO - __main__ - Step 29816: {'lr': 0.00045767556777824217, 'samples': 5724672, 'steps': 29815, 'loss/train': 1.5217554569244385} 11/07/2021 01:23:55 - INFO - __main__ - Step 29817: {'lr': 0.00045767261337704946, 'samples': 5724864, 'steps': 29816, 'loss/train': 2.1377294063568115} 11/07/2021 01:23:55 - INFO - __main__ - Step 29818: {'lr': 0.00045766965888228273, 'samples': 5725056, 'steps': 29817, 'loss/train': 1.8070579767227173} 11/07/2021 01:23:55 - INFO - __main__ - Step 29819: {'lr': 0.00045766670429394317, 'samples': 5725248, 'steps': 29818, 'loss/train': 1.0186076164245605} 11/07/2021 01:23:56 - INFO - __main__ - Step 29820: {'lr': 0.00045766374961203236, 'samples': 5725440, 'steps': 29819, 'loss/train': 1.401145100593567} 11/07/2021 01:23:56 - INFO - __main__ - Step 29821: {'lr': 0.0004576607948365513, 'samples': 5725632, 'steps': 29820, 'loss/train': 1.6666818857192993} 11/07/2021 01:23:57 - INFO - __main__ - Step 29822: {'lr': 0.0004576578399675015, 'samples': 5725824, 'steps': 29821, 'loss/train': 1.4097083806991577} 11/07/2021 01:23:57 - INFO - __main__ - Step 29823: {'lr': 0.00045765488500488437, 'samples': 5726016, 'steps': 29822, 'loss/train': 1.3067562580108643} 11/07/2021 01:23:58 - INFO - __main__ - Step 29824: {'lr': 0.0004576519299487012, 'samples': 5726208, 'steps': 29823, 'loss/train': 1.5579853057861328} 11/07/2021 01:23:58 - INFO - __main__ - Step 29825: {'lr': 0.00045764897479895315, 'samples': 5726400, 'steps': 29824, 'loss/train': 1.7080674171447754} 11/07/2021 01:23:58 - INFO - __main__ - Step 29826: {'lr': 0.0004576460195556418, 'samples': 5726592, 'steps': 29825, 'loss/train': 1.8079490661621094} 11/07/2021 01:23:59 - INFO - __main__ - Step 29827: {'lr': 0.0004576430642187682, 'samples': 5726784, 'steps': 29826, 'loss/train': 1.7533303499221802} 11/07/2021 01:24:00 - INFO - __main__ - Step 29828: {'lr': 0.00045764010878833396, 'samples': 5726976, 'steps': 29827, 'loss/train': 1.6237348318099976} 11/07/2021 01:24:00 - INFO - __main__ - Step 29829: {'lr': 0.00045763715326434023, 'samples': 5727168, 'steps': 29828, 'loss/train': 1.3736622333526611} 11/07/2021 01:24:00 - INFO - __main__ - Step 29830: {'lr': 0.0004576341976467884, 'samples': 5727360, 'steps': 29829, 'loss/train': 1.267899990081787} 11/07/2021 01:24:01 - INFO - __main__ - Step 29831: {'lr': 0.00045763124193567983, 'samples': 5727552, 'steps': 29830, 'loss/train': 0.6706776022911072} 11/07/2021 01:24:02 - INFO - __main__ - Step 29832: {'lr': 0.0004576282861310158, 'samples': 5727744, 'steps': 29831, 'loss/train': 1.6785303354263306} 11/07/2021 01:24:02 - INFO - __main__ - Step 29833: {'lr': 0.00045762533023279773, 'samples': 5727936, 'steps': 29832, 'loss/train': 1.4475414752960205} 11/07/2021 01:24:02 - INFO - __main__ - Step 29834: {'lr': 0.00045762237424102687, 'samples': 5728128, 'steps': 29833, 'loss/train': 1.6399792432785034} 11/07/2021 01:24:03 - INFO - __main__ - Step 29835: {'lr': 0.0004576194181557045, 'samples': 5728320, 'steps': 29834, 'loss/train': 1.3964948654174805} 11/07/2021 01:24:03 - INFO - __main__ - Step 29836: {'lr': 0.00045761646197683216, 'samples': 5728512, 'steps': 29835, 'loss/train': 2.0420026779174805} 11/07/2021 01:24:04 - INFO - __main__ - Step 29837: {'lr': 0.00045761350570441096, 'samples': 5728704, 'steps': 29836, 'loss/train': 1.8259185552597046} 11/07/2021 01:24:05 - INFO - __main__ - Step 29838: {'lr': 0.0004576105493384423, 'samples': 5728896, 'steps': 29837, 'loss/train': 1.2299456596374512} 11/07/2021 01:24:05 - INFO - __main__ - Step 29839: {'lr': 0.00045760759287892755, 'samples': 5729088, 'steps': 29838, 'loss/train': 0.7691632509231567} 11/07/2021 01:24:05 - INFO - __main__ - Step 29840: {'lr': 0.000457604636325868, 'samples': 5729280, 'steps': 29839, 'loss/train': 1.6238632202148438} 11/07/2021 01:24:06 - INFO - __main__ - Step 29841: {'lr': 0.00045760167967926504, 'samples': 5729472, 'steps': 29840, 'loss/train': 1.3403931856155396} 11/07/2021 01:24:07 - INFO - __main__ - Step 29842: {'lr': 0.00045759872293911995, 'samples': 5729664, 'steps': 29841, 'loss/train': 1.3798387050628662} 11/07/2021 01:24:07 - INFO - __main__ - Step 29843: {'lr': 0.00045759576610543407, 'samples': 5729856, 'steps': 29842, 'loss/train': 1.6725951433181763} 11/07/2021 01:24:07 - INFO - __main__ - Step 29844: {'lr': 0.0004575928091782088, 'samples': 5730048, 'steps': 29843, 'loss/train': 1.3758196830749512} 11/07/2021 01:24:08 - INFO - __main__ - Step 29845: {'lr': 0.00045758985215744536, 'samples': 5730240, 'steps': 29844, 'loss/train': 1.447424292564392} 11/07/2021 01:24:08 - INFO - __main__ - Step 29846: {'lr': 0.0004575868950431452, 'samples': 5730432, 'steps': 29845, 'loss/train': 1.807071328163147} 11/07/2021 01:24:08 - INFO - __main__ - Step 29847: {'lr': 0.0004575839378353095, 'samples': 5730624, 'steps': 29846, 'loss/train': 1.5199185609817505} 11/07/2021 01:24:09 - INFO - __main__ - Step 29848: {'lr': 0.0004575809805339397, 'samples': 5730816, 'steps': 29847, 'loss/train': 1.3373128175735474} 11/07/2021 01:24:10 - INFO - __main__ - Step 29849: {'lr': 0.0004575780231390371, 'samples': 5731008, 'steps': 29848, 'loss/train': 1.7042051553726196} 11/07/2021 01:24:10 - INFO - __main__ - Step 29850: {'lr': 0.0004575750656506031, 'samples': 5731200, 'steps': 29849, 'loss/train': 1.6827337741851807} 11/07/2021 01:24:11 - INFO - __main__ - Step 29851: {'lr': 0.00045757210806863895, 'samples': 5731392, 'steps': 29850, 'loss/train': 1.5924853086471558} 11/07/2021 01:24:11 - INFO - __main__ - Step 29852: {'lr': 0.0004575691503931461, 'samples': 5731584, 'steps': 29851, 'loss/train': 1.6998484134674072} 11/07/2021 01:24:12 - INFO - __main__ - Step 29853: {'lr': 0.00045756619262412565, 'samples': 5731776, 'steps': 29852, 'loss/train': 1.5841456651687622} 11/07/2021 01:24:12 - INFO - __main__ - Step 29854: {'lr': 0.0004575632347615791, 'samples': 5731968, 'steps': 29853, 'loss/train': 1.5362069606781006} 11/07/2021 01:24:13 - INFO - __main__ - Step 29855: {'lr': 0.0004575602768055078, 'samples': 5732160, 'steps': 29854, 'loss/train': 2.408329963684082} 11/07/2021 01:24:13 - INFO - __main__ - Step 29856: {'lr': 0.00045755731875591303, 'samples': 5732352, 'steps': 29855, 'loss/train': 1.5316470861434937} 11/07/2021 01:24:13 - INFO - __main__ - Step 29857: {'lr': 0.0004575543606127961, 'samples': 5732544, 'steps': 29856, 'loss/train': 1.3378548622131348} 11/07/2021 01:24:14 - INFO - __main__ - Step 29858: {'lr': 0.0004575514023761585, 'samples': 5732736, 'steps': 29857, 'loss/train': 1.5171469449996948} 11/07/2021 01:24:15 - INFO - __main__ - Step 29859: {'lr': 0.00045754844404600136, 'samples': 5732928, 'steps': 29858, 'loss/train': 1.53959059715271} 11/07/2021 01:24:15 - INFO - __main__ - Step 29860: {'lr': 0.00045754548562232605, 'samples': 5733120, 'steps': 29859, 'loss/train': 1.0466532707214355} 11/07/2021 01:24:16 - INFO - __main__ - Step 29861: {'lr': 0.00045754252710513397, 'samples': 5733312, 'steps': 29860, 'loss/train': 1.539314866065979} 11/07/2021 01:24:16 - INFO - __main__ - Step 29862: {'lr': 0.00045753956849442647, 'samples': 5733504, 'steps': 29861, 'loss/train': 1.9545060396194458} 11/07/2021 01:24:17 - INFO - __main__ - Step 29863: {'lr': 0.00045753660979020485, 'samples': 5733696, 'steps': 29862, 'loss/train': 1.7656947374343872} 11/07/2021 01:24:17 - INFO - __main__ - Step 29864: {'lr': 0.0004575336509924704, 'samples': 5733888, 'steps': 29863, 'loss/train': 1.5456066131591797} 11/07/2021 01:24:18 - INFO - __main__ - Step 29865: {'lr': 0.0004575306921012245, 'samples': 5734080, 'steps': 29864, 'loss/train': 1.8647871017456055} 11/07/2021 01:24:18 - INFO - __main__ - Step 29866: {'lr': 0.00045752773311646846, 'samples': 5734272, 'steps': 29865, 'loss/train': 1.4891483783721924} 11/07/2021 01:24:18 - INFO - __main__ - Step 29867: {'lr': 0.0004575247740382037, 'samples': 5734464, 'steps': 29866, 'loss/train': 1.5018141269683838} 11/07/2021 01:24:20 - INFO - __main__ - Step 29868: {'lr': 0.0004575218148664314, 'samples': 5734656, 'steps': 29867, 'loss/train': 1.6410999298095703} 11/07/2021 01:24:20 - INFO - __main__ - Step 29869: {'lr': 0.00045751885560115294, 'samples': 5734848, 'steps': 29868, 'loss/train': 1.5205363035202026} 11/07/2021 01:24:20 - INFO - __main__ - Step 29870: {'lr': 0.0004575158962423698, 'samples': 5735040, 'steps': 29869, 'loss/train': 1.8936104774475098} 11/07/2021 01:24:21 - INFO - __main__ - Step 29871: {'lr': 0.0004575129367900831, 'samples': 5735232, 'steps': 29870, 'loss/train': 1.8289682865142822} 11/07/2021 01:24:21 - INFO - __main__ - Step 29872: {'lr': 0.0004575099772442943, 'samples': 5735424, 'steps': 29871, 'loss/train': 1.6737574338912964} 11/07/2021 01:24:21 - INFO - __main__ - Step 29873: {'lr': 0.0004575070176050047, 'samples': 5735616, 'steps': 29872, 'loss/train': 1.4331068992614746} 11/07/2021 01:24:22 - INFO - __main__ - Step 29874: {'lr': 0.00045750405787221566, 'samples': 5735808, 'steps': 29873, 'loss/train': 1.4061284065246582} 11/07/2021 01:24:23 - INFO - __main__ - Step 29875: {'lr': 0.0004575010980459285, 'samples': 5736000, 'steps': 29874, 'loss/train': 1.251508355140686} 11/07/2021 01:24:23 - INFO - __main__ - Step 29876: {'lr': 0.0004574981381261445, 'samples': 5736192, 'steps': 29875, 'loss/train': 1.3016911745071411} 11/07/2021 01:24:23 - INFO - __main__ - Step 29877: {'lr': 0.0004574951781128651, 'samples': 5736384, 'steps': 29876, 'loss/train': 1.335336446762085} 11/07/2021 01:24:24 - INFO - __main__ - Step 29878: {'lr': 0.0004574922180060915, 'samples': 5736576, 'steps': 29877, 'loss/train': 1.3123310804367065} 11/07/2021 01:24:25 - INFO - __main__ - Step 29879: {'lr': 0.0004574892578058252, 'samples': 5736768, 'steps': 29878, 'loss/train': 0.9599051475524902} 11/07/2021 01:24:25 - INFO - __main__ - Step 29880: {'lr': 0.0004574862975120674, 'samples': 5736960, 'steps': 29879, 'loss/train': 1.6392614841461182} 11/07/2021 01:24:25 - INFO - __main__ - Step 29881: {'lr': 0.0004574833371248195, 'samples': 5737152, 'steps': 29880, 'loss/train': 1.8147220611572266} 11/07/2021 01:24:26 - INFO - __main__ - Step 29882: {'lr': 0.00045748037664408275, 'samples': 5737344, 'steps': 29881, 'loss/train': 1.6705900430679321} 11/07/2021 01:24:26 - INFO - __main__ - Step 29883: {'lr': 0.0004574774160698586, 'samples': 5737536, 'steps': 29882, 'loss/train': 1.6094480752944946} 11/07/2021 01:24:27 - INFO - __main__ - Step 29884: {'lr': 0.00045747445540214826, 'samples': 5737728, 'steps': 29883, 'loss/train': 1.6537147760391235} 11/07/2021 01:24:28 - INFO - __main__ - Step 29885: {'lr': 0.00045747149464095324, 'samples': 5737920, 'steps': 29884, 'loss/train': 1.493760347366333} 11/07/2021 01:24:28 - INFO - __main__ - Step 29886: {'lr': 0.00045746853378627467, 'samples': 5738112, 'steps': 29885, 'loss/train': 1.9688208103179932} 11/07/2021 01:24:28 - INFO - __main__ - Step 29887: {'lr': 0.000457465572838114, 'samples': 5738304, 'steps': 29886, 'loss/train': 0.6421449184417725} 11/07/2021 01:24:29 - INFO - __main__ - Step 29888: {'lr': 0.0004574626117964726, 'samples': 5738496, 'steps': 29887, 'loss/train': 2.212495803833008} 11/07/2021 01:24:30 - INFO - __main__ - Step 29889: {'lr': 0.00045745965066135163, 'samples': 5738688, 'steps': 29888, 'loss/train': 0.825211763381958} 11/07/2021 01:24:30 - INFO - __main__ - Step 29890: {'lr': 0.00045745668943275266, 'samples': 5738880, 'steps': 29889, 'loss/train': 1.9920496940612793} 11/07/2021 01:24:30 - INFO - __main__ - Step 29891: {'lr': 0.00045745372811067687, 'samples': 5739072, 'steps': 29890, 'loss/train': 1.0584920644760132} 11/07/2021 01:24:31 - INFO - __main__ - Step 29892: {'lr': 0.00045745076669512566, 'samples': 5739264, 'steps': 29891, 'loss/train': 1.6357108354568481} 11/07/2021 01:24:31 - INFO - __main__ - Step 29893: {'lr': 0.0004574478051861003, 'samples': 5739456, 'steps': 29892, 'loss/train': 1.5066235065460205} 11/07/2021 01:24:32 - INFO - __main__ - Step 29894: {'lr': 0.00045744484358360216, 'samples': 5739648, 'steps': 29893, 'loss/train': 1.6722524166107178} 11/07/2021 01:24:33 - INFO - __main__ - Step 29895: {'lr': 0.0004574418818876326, 'samples': 5739840, 'steps': 29894, 'loss/train': 1.4991633892059326} 11/07/2021 01:24:33 - INFO - __main__ - Step 29896: {'lr': 0.0004574389200981929, 'samples': 5740032, 'steps': 29895, 'loss/train': 1.4193624258041382} 11/07/2021 01:24:33 - INFO - __main__ - Step 29897: {'lr': 0.00045743595821528437, 'samples': 5740224, 'steps': 29896, 'loss/train': 1.590252161026001} 11/07/2021 01:24:34 - INFO - __main__ - Step 29898: {'lr': 0.0004574329962389085, 'samples': 5740416, 'steps': 29897, 'loss/train': 1.6737258434295654} 11/07/2021 01:24:34 - INFO - __main__ - Step 29899: {'lr': 0.0004574300341690665, 'samples': 5740608, 'steps': 29898, 'loss/train': 1.7634446620941162} 11/07/2021 01:24:35 - INFO - __main__ - Step 29900: {'lr': 0.00045742707200575975, 'samples': 5740800, 'steps': 29899, 'loss/train': 1.6548808813095093} 11/07/2021 01:24:35 - INFO - __main__ - Step 29901: {'lr': 0.00045742410974898947, 'samples': 5740992, 'steps': 29900, 'loss/train': 1.6476119756698608} 11/07/2021 01:24:36 - INFO - __main__ - Step 29902: {'lr': 0.0004574211473987571, 'samples': 5741184, 'steps': 29901, 'loss/train': 1.7704423666000366} 11/07/2021 01:24:36 - INFO - __main__ - Step 29903: {'lr': 0.00045741818495506403, 'samples': 5741376, 'steps': 29902, 'loss/train': 1.6585086584091187} 11/07/2021 01:24:36 - INFO - __main__ - Step 29904: {'lr': 0.0004574152224179115, 'samples': 5741568, 'steps': 29903, 'loss/train': 1.849440097808838} 11/07/2021 01:24:37 - INFO - __main__ - Step 29905: {'lr': 0.0004574122597873008, 'samples': 5741760, 'steps': 29904, 'loss/train': 1.5148991346359253} 11/07/2021 01:24:38 - INFO - __main__ - Step 29906: {'lr': 0.0004574092970632335, 'samples': 5741952, 'steps': 29905, 'loss/train': 1.5750218629837036} 11/07/2021 01:24:38 - INFO - __main__ - Step 29907: {'lr': 0.00045740633424571064, 'samples': 5742144, 'steps': 29906, 'loss/train': 1.9244686365127563} 11/07/2021 01:24:39 - INFO - __main__ - Step 29908: {'lr': 0.00045740337133473374, 'samples': 5742336, 'steps': 29907, 'loss/train': 1.775071620941162} 11/07/2021 01:24:39 - INFO - __main__ - Step 29909: {'lr': 0.00045740040833030404, 'samples': 5742528, 'steps': 29908, 'loss/train': 1.9028395414352417} 11/07/2021 01:24:40 - INFO - __main__ - Step 29910: {'lr': 0.00045739744523242294, 'samples': 5742720, 'steps': 29909, 'loss/train': 1.3661624193191528} 11/07/2021 01:24:40 - INFO - __main__ - Step 29911: {'lr': 0.0004573944820410918, 'samples': 5742912, 'steps': 29910, 'loss/train': 1.7625046968460083} 11/07/2021 01:24:41 - INFO - __main__ - Step 29912: {'lr': 0.0004573915187563118, 'samples': 5743104, 'steps': 29911, 'loss/train': 1.7702594995498657} 11/07/2021 01:24:41 - INFO - __main__ - Step 29913: {'lr': 0.00045738855537808443, 'samples': 5743296, 'steps': 29912, 'loss/train': 1.0329153537750244} 11/07/2021 01:24:41 - INFO - __main__ - Step 29914: {'lr': 0.000457385591906411, 'samples': 5743488, 'steps': 29913, 'loss/train': 1.6367841958999634} 11/07/2021 01:24:42 - INFO - __main__ - Step 29915: {'lr': 0.00045738262834129283, 'samples': 5743680, 'steps': 29914, 'loss/train': 1.5828520059585571} 11/07/2021 01:24:43 - INFO - __main__ - Step 29916: {'lr': 0.0004573796646827312, 'samples': 5743872, 'steps': 29915, 'loss/train': 1.8843293190002441} 11/07/2021 01:24:43 - INFO - __main__ - Step 29917: {'lr': 0.0004573767009307276, 'samples': 5744064, 'steps': 29916, 'loss/train': 1.4333949089050293} 11/07/2021 01:24:44 - INFO - __main__ - Step 29918: {'lr': 0.0004573737370852831, 'samples': 5744256, 'steps': 29917, 'loss/train': 1.7710152864456177} 11/07/2021 01:24:44 - INFO - __main__ - Step 29919: {'lr': 0.0004573707731463993, 'samples': 5744448, 'steps': 29918, 'loss/train': 1.1830955743789673} 11/07/2021 01:24:45 - INFO - __main__ - Step 29920: {'lr': 0.00045736780911407736, 'samples': 5744640, 'steps': 29919, 'loss/train': 1.6793465614318848} 11/07/2021 01:24:45 - INFO - __main__ - Step 29921: {'lr': 0.00045736484498831877, 'samples': 5744832, 'steps': 29920, 'loss/train': 1.932478427886963} 11/07/2021 01:24:46 - INFO - __main__ - Step 29922: {'lr': 0.0004573618807691248, 'samples': 5745024, 'steps': 29921, 'loss/train': 1.56174635887146} 11/07/2021 01:24:46 - INFO - __main__ - Step 29923: {'lr': 0.0004573589164564966, 'samples': 5745216, 'steps': 29922, 'loss/train': 1.684158205986023} 11/07/2021 01:24:46 - INFO - __main__ - Step 29924: {'lr': 0.00045735595205043583, 'samples': 5745408, 'steps': 29923, 'loss/train': 1.9968838691711426} 11/07/2021 01:24:47 - INFO - __main__ - Step 29925: {'lr': 0.00045735298755094364, 'samples': 5745600, 'steps': 29924, 'loss/train': 1.8304612636566162} 11/07/2021 01:24:48 - INFO - __main__ - Step 29926: {'lr': 0.00045735002295802137, 'samples': 5745792, 'steps': 29925, 'loss/train': 1.8053874969482422} 11/07/2021 01:24:48 - INFO - __main__ - Step 29927: {'lr': 0.00045734705827167035, 'samples': 5745984, 'steps': 29926, 'loss/train': 1.7693537473678589} 11/07/2021 01:24:48 - INFO - __main__ - Step 29928: {'lr': 0.000457344093491892, 'samples': 5746176, 'steps': 29927, 'loss/train': 1.6845461130142212} 11/07/2021 01:24:49 - INFO - __main__ - Step 29929: {'lr': 0.00045734112861868753, 'samples': 5746368, 'steps': 29928, 'loss/train': 0.90385502576828} 11/07/2021 01:24:49 - INFO - __main__ - Step 29930: {'lr': 0.0004573381636520584, 'samples': 5746560, 'steps': 29929, 'loss/train': 1.354880690574646} 11/07/2021 01:24:50 - INFO - __main__ - Step 29931: {'lr': 0.0004573351985920059, 'samples': 5746752, 'steps': 29930, 'loss/train': 1.7663991451263428} 11/07/2021 01:24:51 - INFO - __main__ - Step 29932: {'lr': 0.0004573322334385314, 'samples': 5746944, 'steps': 29931, 'loss/train': 0.8987229466438293} 11/07/2021 01:24:51 - INFO - __main__ - Step 29933: {'lr': 0.0004573292681916361, 'samples': 5747136, 'steps': 29932, 'loss/train': 1.6049652099609375} 11/07/2021 01:24:51 - INFO - __main__ - Step 29934: {'lr': 0.0004573263028513214, 'samples': 5747328, 'steps': 29933, 'loss/train': 1.4241623878479004} 11/07/2021 01:24:52 - INFO - __main__ - Step 29935: {'lr': 0.0004573233374175888, 'samples': 5747520, 'steps': 29934, 'loss/train': 1.8239710330963135} 11/07/2021 01:24:53 - INFO - __main__ - Step 29936: {'lr': 0.0004573203718904394, 'samples': 5747712, 'steps': 29935, 'loss/train': 1.6157418489456177} 11/07/2021 01:24:53 - INFO - __main__ - Step 29937: {'lr': 0.00045731740626987473, 'samples': 5747904, 'steps': 29936, 'loss/train': 1.3402079343795776} 11/07/2021 01:24:53 - INFO - __main__ - Step 29938: {'lr': 0.00045731444055589597, 'samples': 5748096, 'steps': 29937, 'loss/train': 1.3002804517745972} 11/07/2021 01:24:54 - INFO - __main__ - Step 29939: {'lr': 0.0004573114747485045, 'samples': 5748288, 'steps': 29938, 'loss/train': 1.498435139656067} 11/07/2021 01:24:54 - INFO - __main__ - Step 29940: {'lr': 0.0004573085088477017, 'samples': 5748480, 'steps': 29939, 'loss/train': 1.1412640810012817} 11/07/2021 01:24:55 - INFO - __main__ - Step 29941: {'lr': 0.0004573055428534889, 'samples': 5748672, 'steps': 29940, 'loss/train': 1.5386608839035034} 11/07/2021 01:24:56 - INFO - __main__ - Step 29942: {'lr': 0.00045730257676586747, 'samples': 5748864, 'steps': 29941, 'loss/train': 0.6990121603012085} 11/07/2021 01:24:56 - INFO - __main__ - Step 29943: {'lr': 0.0004572996105848386, 'samples': 5749056, 'steps': 29942, 'loss/train': 1.4737290143966675} 11/07/2021 01:24:56 - INFO - __main__ - Step 29944: {'lr': 0.0004572966443104038, 'samples': 5749248, 'steps': 29943, 'loss/train': 1.4378646612167358} 11/07/2021 01:24:57 - INFO - __main__ - Step 29945: {'lr': 0.00045729367794256434, 'samples': 5749440, 'steps': 29944, 'loss/train': 1.599612832069397} 11/07/2021 01:24:58 - INFO - __main__ - Step 29946: {'lr': 0.0004572907114813215, 'samples': 5749632, 'steps': 29945, 'loss/train': 1.34317946434021} 11/07/2021 01:24:58 - INFO - __main__ - Step 29947: {'lr': 0.0004572877449266767, 'samples': 5749824, 'steps': 29946, 'loss/train': 1.4258098602294922} 11/07/2021 01:24:58 - INFO - __main__ - Step 29948: {'lr': 0.0004572847782786312, 'samples': 5750016, 'steps': 29947, 'loss/train': 1.0848888158798218} 11/07/2021 01:24:59 - INFO - __main__ - Step 29949: {'lr': 0.0004572818115371864, 'samples': 5750208, 'steps': 29948, 'loss/train': 1.4937506914138794} 11/07/2021 01:24:59 - INFO - __main__ - Step 29950: {'lr': 0.0004572788447023436, 'samples': 5750400, 'steps': 29949, 'loss/train': 1.7091032266616821} 11/07/2021 01:25:00 - INFO - __main__ - Step 29951: {'lr': 0.00045727587777410415, 'samples': 5750592, 'steps': 29950, 'loss/train': 1.3422096967697144} 11/07/2021 01:25:00 - INFO - __main__ - Step 29952: {'lr': 0.00045727291075246937, 'samples': 5750784, 'steps': 29951, 'loss/train': 1.6312906742095947} 11/07/2021 01:25:01 - INFO - __main__ - Step 29953: {'lr': 0.0004572699436374407, 'samples': 5750976, 'steps': 29952, 'loss/train': 1.4829158782958984} 11/07/2021 01:25:01 - INFO - __main__ - Step 29954: {'lr': 0.00045726697642901925, 'samples': 5751168, 'steps': 29953, 'loss/train': 1.410493016242981} 11/07/2021 01:25:01 - INFO - __main__ - Step 29955: {'lr': 0.0004572640091272066, 'samples': 5751360, 'steps': 29954, 'loss/train': 1.5101686716079712} 11/07/2021 01:25:02 - INFO - __main__ - Step 29956: {'lr': 0.000457261041732004, 'samples': 5751552, 'steps': 29955, 'loss/train': 1.3401647806167603} 11/07/2021 01:25:03 - INFO - __main__ - Step 29957: {'lr': 0.0004572580742434127, 'samples': 5751744, 'steps': 29956, 'loss/train': 1.9295035600662231} 11/07/2021 01:25:03 - INFO - __main__ - Step 29958: {'lr': 0.00045725510666143424, 'samples': 5751936, 'steps': 29957, 'loss/train': 1.5255932807922363} 11/07/2021 01:25:03 - INFO - __main__ - Step 29959: {'lr': 0.0004572521389860697, 'samples': 5752128, 'steps': 29958, 'loss/train': 1.4836983680725098} 11/07/2021 01:25:04 - INFO - __main__ - Step 29960: {'lr': 0.00045724917121732055, 'samples': 5752320, 'steps': 29959, 'loss/train': 1.8313194513320923} 11/07/2021 01:25:04 - INFO - __main__ - Step 29961: {'lr': 0.0004572462033551882, 'samples': 5752512, 'steps': 29960, 'loss/train': 1.5937235355377197} 11/07/2021 01:25:05 - INFO - __main__ - Step 29962: {'lr': 0.00045724323539967385, 'samples': 5752704, 'steps': 29961, 'loss/train': 1.5976606607437134} 11/07/2021 01:25:06 - INFO - __main__ - Step 29963: {'lr': 0.00045724026735077886, 'samples': 5752896, 'steps': 29962, 'loss/train': 1.7151837348937988} 11/07/2021 01:25:06 - INFO - __main__ - Step 29964: {'lr': 0.00045723729920850464, 'samples': 5753088, 'steps': 29963, 'loss/train': 1.3524528741836548} 11/07/2021 01:25:06 - INFO - __main__ - Step 29965: {'lr': 0.00045723433097285247, 'samples': 5753280, 'steps': 29964, 'loss/train': 1.1049710512161255} 11/07/2021 01:25:07 - INFO - __main__ - Step 29966: {'lr': 0.0004572313626438238, 'samples': 5753472, 'steps': 29965, 'loss/train': 1.2778706550598145} 11/07/2021 01:25:08 - INFO - __main__ - Step 29967: {'lr': 0.00045722839422141984, 'samples': 5753664, 'steps': 29966, 'loss/train': 1.5637139081954956} 11/07/2021 01:25:08 - INFO - __main__ - Step 29968: {'lr': 0.000457225425705642, 'samples': 5753856, 'steps': 29967, 'loss/train': 1.3151350021362305} 11/07/2021 01:25:08 - INFO - __main__ - Step 29969: {'lr': 0.0004572224570964915, 'samples': 5754048, 'steps': 29968, 'loss/train': 1.915236473083496} 11/07/2021 01:25:09 - INFO - __main__ - Step 29970: {'lr': 0.0004572194883939697, 'samples': 5754240, 'steps': 29969, 'loss/train': 1.596436619758606} 11/07/2021 01:25:09 - INFO - __main__ - Step 29971: {'lr': 0.0004572165195980781, 'samples': 5754432, 'steps': 29970, 'loss/train': 1.238713026046753} 11/07/2021 01:25:10 - INFO - __main__ - Step 29972: {'lr': 0.0004572135507088179, 'samples': 5754624, 'steps': 29971, 'loss/train': 1.2251867055892944} 11/07/2021 01:25:11 - INFO - __main__ - Step 29973: {'lr': 0.00045721058172619043, 'samples': 5754816, 'steps': 29972, 'loss/train': 1.2458912134170532} 11/07/2021 01:25:11 - INFO - __main__ - Step 29974: {'lr': 0.0004572076126501972, 'samples': 5755008, 'steps': 29973, 'loss/train': 1.693926215171814} 11/07/2021 01:25:11 - INFO - __main__ - Step 29975: {'lr': 0.00045720464348083937, 'samples': 5755200, 'steps': 29974, 'loss/train': 1.7528555393218994} 11/07/2021 01:25:12 - INFO - __main__ - Step 29976: {'lr': 0.0004572016742181182, 'samples': 5755392, 'steps': 29975, 'loss/train': 1.8526099920272827} 11/07/2021 01:25:12 - INFO - __main__ - Step 29977: {'lr': 0.0004571987048620353, 'samples': 5755584, 'steps': 29976, 'loss/train': 1.5356825590133667} 11/07/2021 01:25:13 - INFO - __main__ - Step 29978: {'lr': 0.0004571957354125918, 'samples': 5755776, 'steps': 29977, 'loss/train': 1.5769151449203491} 11/07/2021 01:25:14 - INFO - __main__ - Step 29979: {'lr': 0.00045719276586978907, 'samples': 5755968, 'steps': 29978, 'loss/train': 1.5161664485931396} 11/07/2021 01:25:14 - INFO - __main__ - Step 29980: {'lr': 0.00045718979623362855, 'samples': 5756160, 'steps': 29979, 'loss/train': 1.3842449188232422} 11/07/2021 01:25:14 - INFO - __main__ - Step 29981: {'lr': 0.00045718682650411146, 'samples': 5756352, 'steps': 29980, 'loss/train': 1.6693494319915771} 11/07/2021 01:25:15 - INFO - __main__ - Step 29982: {'lr': 0.0004571838566812392, 'samples': 5756544, 'steps': 29981, 'loss/train': 1.5294941663742065} 11/07/2021 01:25:16 - INFO - __main__ - Step 29983: {'lr': 0.00045718088676501305, 'samples': 5756736, 'steps': 29982, 'loss/train': 1.6696785688400269} 11/07/2021 01:25:16 - INFO - __main__ - Step 29984: {'lr': 0.0004571779167554344, 'samples': 5756928, 'steps': 29983, 'loss/train': 1.5812838077545166} 11/07/2021 01:25:16 - INFO - __main__ - Step 29985: {'lr': 0.0004571749466525046, 'samples': 5757120, 'steps': 29984, 'loss/train': 1.3214921951293945} 11/07/2021 01:25:17 - INFO - __main__ - Step 29986: {'lr': 0.000457171976456225, 'samples': 5757312, 'steps': 29985, 'loss/train': 1.646202802658081} 11/07/2021 01:25:17 - INFO - __main__ - Step 29987: {'lr': 0.00045716900616659686, 'samples': 5757504, 'steps': 29986, 'loss/train': 2.17270565032959} 11/07/2021 01:25:18 - INFO - __main__ - Step 29988: {'lr': 0.00045716603578362157, 'samples': 5757696, 'steps': 29987, 'loss/train': 1.8010432720184326} 11/07/2021 01:25:19 - INFO - __main__ - Step 29989: {'lr': 0.00045716306530730043, 'samples': 5757888, 'steps': 29988, 'loss/train': 1.2930711507797241} 11/07/2021 01:25:19 - INFO - __main__ - Step 29990: {'lr': 0.00045716009473763486, 'samples': 5758080, 'steps': 29989, 'loss/train': 1.630855679512024} 11/07/2021 01:25:19 - INFO - __main__ - Step 29991: {'lr': 0.0004571571240746262, 'samples': 5758272, 'steps': 29990, 'loss/train': 1.8557555675506592} 11/07/2021 01:25:20 - INFO - __main__ - Step 29992: {'lr': 0.00045715415331827564, 'samples': 5758464, 'steps': 29991, 'loss/train': 1.6086505651474} 11/07/2021 01:25:20 - INFO - __main__ - Step 29993: {'lr': 0.00045715118246858466, 'samples': 5758656, 'steps': 29992, 'loss/train': 1.5766459703445435} 11/07/2021 01:25:21 - INFO - __main__ - Step 29994: {'lr': 0.0004571482115255545, 'samples': 5758848, 'steps': 29993, 'loss/train': 1.404898762702942} 11/07/2021 01:25:21 - INFO - __main__ - Step 29995: {'lr': 0.0004571452404891866, 'samples': 5759040, 'steps': 29994, 'loss/train': 1.7848975658416748} 11/07/2021 01:25:22 - INFO - __main__ - Step 29996: {'lr': 0.0004571422693594822, 'samples': 5759232, 'steps': 29995, 'loss/train': 1.7715831995010376} 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335} 11/07/2021 01:25:22 - INFO - __main__ - Step 29998: {'lr': 0.0004571363268200695, 'samples': 5759616, 'steps': 29997, 'loss/train': 1.6564396619796753} 11/07/2021 01:25:23 - INFO - __main__ - Step 29999: {'lr': 0.0004571333554103638, 'samples': 5759808, 'steps': 29998, 'loss/train': 1.6401314735412598} 11/07/2021 01:25:24 - INFO - __main__ - Step 30000: {'lr': 0.0004571303839073271, 'samples': 5760000, 'steps': 29999, 'loss/train': 1.3726354837417603} 11/07/2021 01:25:24 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 01:28:38 - INFO - __main__ - Step 30000: {'loss/eval': 1.5031383037567139, 'perplexity': 4.495776176452637} 11/07/2021 01:28:49 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20211106_211610-dtkf2u0m/logs/debug-internal.log']. This may take a bit of time if the files are large. 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - Several commits (2) will be pushed upstream. 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 01:29:19 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small acc8d6f..d425b2d proud-haze-135 -> proud-haze-135 11/07/2021 01:29:21 - INFO - __main__ - Step 30001: {'lr': 0.00045712741231096054, 'samples': 5760192, 'steps': 30000, 'loss/train': 1.1647251844406128} 11/07/2021 01:29:21 - INFO - __main__ - Step 30002: {'lr': 0.0004571244406212656, 'samples': 5760384, 'steps': 30001, 'loss/train': 1.471921682357788} 11/07/2021 01:29:22 - INFO - __main__ - Step 30003: {'lr': 0.00045712146883824357, 'samples': 5760576, 'steps': 30002, 'loss/train': 1.4950001239776611} 11/07/2021 01:29:22 - INFO - __main__ - Step 30004: {'lr': 0.00045711849696189585, 'samples': 5760768, 'steps': 30003, 'loss/train': 1.5751121044158936} 11/07/2021 01:29:23 - INFO - __main__ - Step 30005: {'lr': 0.0004571155249922237, 'samples': 5760960, 'steps': 30004, 'loss/train': 0.9164716601371765} 11/07/2021 01:29:24 - INFO - __main__ - Step 30006: {'lr': 0.00045711255292922847, 'samples': 5761152, 'steps': 30005, 'loss/train': 1.8894248008728027} 11/07/2021 01:29:24 - INFO - __main__ - Step 30007: {'lr': 0.00045710958077291156, 'samples': 5761344, 'steps': 30006, 'loss/train': 1.6442089080810547} 11/07/2021 01:29:24 - INFO - __main__ - Step 30008: {'lr': 0.00045710660852327423, 'samples': 5761536, 'steps': 30007, 'loss/train': 1.578920841217041} 11/07/2021 01:29:25 - INFO - __main__ - Step 30009: {'lr': 0.00045710363618031783, 'samples': 5761728, 'steps': 30008, 'loss/train': 1.5235463380813599} 11/07/2021 01:29:26 - INFO - __main__ - Step 30010: {'lr': 0.0004571006637440438, 'samples': 5761920, 'steps': 30009, 'loss/train': 1.534705400466919} 11/07/2021 01:29:26 - INFO - __main__ - Step 30011: {'lr': 0.00045709769121445335, 'samples': 5762112, 'steps': 30010, 'loss/train': 1.927817463874817} 11/07/2021 01:29:26 - INFO - __main__ - Step 30012: {'lr': 0.00045709471859154793, 'samples': 5762304, 'steps': 30011, 'loss/train': 1.541087031364441} 11/07/2021 01:29:27 - INFO - __main__ - Step 30013: {'lr': 0.0004570917458753288, 'samples': 5762496, 'steps': 30012, 'loss/train': 0.8787887692451477} 11/07/2021 01:29:27 - INFO - __main__ - Step 30014: {'lr': 0.00045708877306579733, 'samples': 5762688, 'steps': 30013, 'loss/train': 1.8259247541427612} 11/07/2021 01:29:28 - INFO - __main__ - Step 30015: {'lr': 0.00045708580016295486, 'samples': 5762880, 'steps': 30014, 'loss/train': 5.611205101013184} 11/07/2021 01:29:28 - INFO - __main__ - Step 30016: {'lr': 0.0004570828271668027, 'samples': 5763072, 'steps': 30015, 'loss/train': 1.2683762311935425} 11/07/2021 01:29:29 - INFO - __main__ - Step 30017: {'lr': 0.0004570798540773422, 'samples': 5763264, 'steps': 30016, 'loss/train': 1.17982816696167} 11/07/2021 01:29:29 - INFO - __main__ - Step 30018: {'lr': 0.0004570768808945748, 'samples': 5763456, 'steps': 30017, 'loss/train': 2.3335750102996826} 11/07/2021 01:29:30 - INFO - __main__ - Step 30019: {'lr': 0.00045707390761850163, 'samples': 5763648, 'steps': 30018, 'loss/train': 1.6228090524673462} 11/07/2021 01:29:30 - INFO - __main__ - Step 30020: {'lr': 0.00045707093424912426, 'samples': 5763840, 'steps': 30019, 'loss/train': 1.74102783203125} 11/07/2021 01:29:31 - INFO - __main__ - Step 30021: {'lr': 0.00045706796078644386, 'samples': 5764032, 'steps': 30020, 'loss/train': 1.6227269172668457} 11/07/2021 01:29:32 - INFO - __main__ - Step 30022: {'lr': 0.00045706498723046185, 'samples': 5764224, 'steps': 30021, 'loss/train': 1.0815889835357666} 11/07/2021 01:29:32 - INFO - __main__ - Step 30023: {'lr': 0.0004570620135811795, 'samples': 5764416, 'steps': 30022, 'loss/train': 1.4398486614227295} 11/07/2021 01:29:32 - INFO - __main__ - Step 30024: {'lr': 0.0004570590398385983, 'samples': 5764608, 'steps': 30023, 'loss/train': 1.8302394151687622} 11/07/2021 01:29:33 - INFO - __main__ - Step 30025: {'lr': 0.0004570560660027194, 'samples': 5764800, 'steps': 30024, 'loss/train': 1.3542307615280151} 11/07/2021 01:29:34 - INFO - __main__ - Step 30026: {'lr': 0.00045705309207354433, 'samples': 5764992, 'steps': 30025, 'loss/train': 1.7640297412872314} 11/07/2021 01:29:34 - INFO - __main__ - Step 30027: {'lr': 0.00045705011805107426, 'samples': 5765184, 'steps': 30026, 'loss/train': 1.3665052652359009} 11/07/2021 01:29:35 - INFO - __main__ - Step 30028: {'lr': 0.00045704714393531064, 'samples': 5765376, 'steps': 30027, 'loss/train': 1.857277750968933} 11/07/2021 01:29:35 - INFO - __main__ - Step 30029: {'lr': 0.00045704416972625474, 'samples': 5765568, 'steps': 30028, 'loss/train': 1.743507981300354} 11/07/2021 01:29:35 - INFO - __main__ - Step 30030: {'lr': 0.000457041195423908, 'samples': 5765760, 'steps': 30029, 'loss/train': 0.26046043634414673} 11/07/2021 01:29:36 - INFO - __main__ - Step 30031: {'lr': 0.0004570382210282716, 'samples': 5765952, 'steps': 30030, 'loss/train': 1.2793536186218262} 11/07/2021 01:29:37 - INFO - __main__ - Step 30032: {'lr': 0.00045703524653934705, 'samples': 5766144, 'steps': 30031, 'loss/train': 1.660413384437561} 11/07/2021 01:29:37 - INFO - __main__ - Step 30033: {'lr': 0.0004570322719571355, 'samples': 5766336, 'steps': 30032, 'loss/train': 1.4386049509048462} 11/07/2021 01:29:37 - INFO - __main__ - Step 30034: {'lr': 0.00045702929728163845, 'samples': 5766528, 'steps': 30033, 'loss/train': 1.5465935468673706} 11/07/2021 01:29:38 - INFO - __main__ - Step 30035: {'lr': 0.00045702632251285727, 'samples': 5766720, 'steps': 30034, 'loss/train': 1.477612853050232} 11/07/2021 01:29:39 - INFO - __main__ - Step 30036: {'lr': 0.0004570233476507931, 'samples': 5766912, 'steps': 30035, 'loss/train': 1.1126904487609863} 11/07/2021 01:29:39 - INFO - __main__ - Step 30037: {'lr': 0.0004570203726954475, 'samples': 5767104, 'steps': 30036, 'loss/train': 1.389814853668213} 11/07/2021 01:29:40 - INFO - __main__ - Step 30038: {'lr': 0.0004570173976468217, 'samples': 5767296, 'steps': 30037, 'loss/train': 1.1946580410003662} 11/07/2021 01:29:40 - INFO - __main__ - Step 30039: {'lr': 0.0004570144225049171, 'samples': 5767488, 'steps': 30038, 'loss/train': 1.7716975212097168} 11/07/2021 01:29:40 - INFO - __main__ - Step 30040: {'lr': 0.00045701144726973487, 'samples': 5767680, 'steps': 30039, 'loss/train': 1.5792721509933472} 11/07/2021 01:29:41 - INFO - __main__ - Step 30041: {'lr': 0.0004570084719412766, 'samples': 5767872, 'steps': 30040, 'loss/train': 1.5281107425689697} 11/07/2021 01:29:42 - INFO - __main__ - Step 30042: {'lr': 0.00045700549651954344, 'samples': 5768064, 'steps': 30041, 'loss/train': 1.7906538248062134} 11/07/2021 01:29:42 - INFO - __main__ - Step 30043: {'lr': 0.0004570025210045368, 'samples': 5768256, 'steps': 30042, 'loss/train': 1.53825044631958} 11/07/2021 01:29:42 - INFO - __main__ - Step 30044: {'lr': 0.00045699954539625803, 'samples': 5768448, 'steps': 30043, 'loss/train': 1.9995921850204468} 11/07/2021 01:29:43 - INFO - __main__ - Step 30045: {'lr': 0.0004569965696947085, 'samples': 5768640, 'steps': 30044, 'loss/train': 1.798302173614502} 11/07/2021 01:29:44 - INFO - __main__ - Step 30046: {'lr': 0.00045699359389988944, 'samples': 5768832, 'steps': 30045, 'loss/train': 1.6981176137924194} 11/07/2021 01:29:44 - INFO - __main__ - Step 30047: {'lr': 0.0004569906180118023, 'samples': 5769024, 'steps': 30046, 'loss/train': 1.1167492866516113} 11/07/2021 01:29:45 - INFO - __main__ - Step 30048: {'lr': 0.0004569876420304484, 'samples': 5769216, 'steps': 30047, 'loss/train': 1.6173962354660034} 11/07/2021 01:29:45 - INFO - __main__ - Step 30049: {'lr': 0.000456984665955829, 'samples': 5769408, 'steps': 30048, 'loss/train': 1.273735523223877} 11/07/2021 01:29:45 - INFO - __main__ - Step 30050: {'lr': 0.00045698168978794553, 'samples': 5769600, 'steps': 30049, 'loss/train': 1.3786624670028687} 11/07/2021 01:29:46 - INFO - __main__ - Step 30051: {'lr': 0.0004569787135267993, 'samples': 5769792, 'steps': 30050, 'loss/train': 0.8889176845550537} 11/07/2021 01:29:47 - INFO - __main__ - Step 30052: {'lr': 0.00045697573717239174, 'samples': 5769984, 'steps': 30051, 'loss/train': 1.804728388786316} 11/07/2021 01:29:47 - INFO - __main__ - Step 30053: {'lr': 0.0004569727607247239, 'samples': 5770176, 'steps': 30052, 'loss/train': 1.58971107006073} 11/07/2021 01:29:47 - INFO - __main__ - Step 30054: {'lr': 0.00045696978418379754, 'samples': 5770368, 'steps': 30053, 'loss/train': 1.5156389474868774} 11/07/2021 01:29:48 - INFO - __main__ - Step 30055: {'lr': 0.0004569668075496137, 'samples': 5770560, 'steps': 30054, 'loss/train': 0.4359086751937866} 11/07/2021 01:29:48 - INFO - __main__ - Step 30056: {'lr': 0.00045696383082217387, 'samples': 5770752, 'steps': 30055, 'loss/train': 1.3137329816818237} 11/07/2021 01:29:49 - INFO - __main__ - Step 30057: {'lr': 0.00045696085400147925, 'samples': 5770944, 'steps': 30056, 'loss/train': 1.8441475629806519} 11/07/2021 01:29:50 - INFO - __main__ - Step 30058: {'lr': 0.00045695787708753126, 'samples': 5771136, 'steps': 30057, 'loss/train': 0.7664819359779358} 11/07/2021 01:29:50 - INFO - __main__ - Step 30059: {'lr': 0.0004569549000803313, 'samples': 5771328, 'steps': 30058, 'loss/train': 1.503903865814209} 11/07/2021 01:29:50 - INFO - __main__ - Step 30060: {'lr': 0.00045695192297988066, 'samples': 5771520, 'steps': 30059, 'loss/train': 1.5076161623001099} 11/07/2021 01:29:51 - INFO - __main__ - Step 30061: {'lr': 0.00045694894578618064, 'samples': 5771712, 'steps': 30060, 'loss/train': 1.1038973331451416} 11/07/2021 01:29:52 - INFO - __main__ - Step 30062: {'lr': 0.00045694596849923263, 'samples': 5771904, 'steps': 30061, 'loss/train': 2.0637073516845703} 11/07/2021 01:29:52 - INFO - __main__ - Step 30063: {'lr': 0.0004569429911190379, 'samples': 5772096, 'steps': 30062, 'loss/train': 1.0061378479003906} 11/07/2021 01:29:52 - INFO - __main__ - Step 30064: {'lr': 0.00045694001364559797, 'samples': 5772288, 'steps': 30063, 'loss/train': 1.5570694208145142} 11/07/2021 01:29:53 - INFO - __main__ - Step 30065: {'lr': 0.00045693703607891403, 'samples': 5772480, 'steps': 30064, 'loss/train': 0.8967161774635315} 11/07/2021 01:29:53 - INFO - __main__ - Step 30066: {'lr': 0.0004569340584189874, 'samples': 5772672, 'steps': 30065, 'loss/train': 1.420305609703064} 11/07/2021 01:29:54 - INFO - __main__ - Step 30067: {'lr': 0.0004569310806658195, 'samples': 5772864, 'steps': 30066, 'loss/train': 0.3159783184528351} 11/07/2021 01:29:55 - INFO - __main__ - Step 30068: {'lr': 0.0004569281028194117, 'samples': 5773056, 'steps': 30067, 'loss/train': 1.5327019691467285} 11/07/2021 01:29:55 - INFO - __main__ - Step 30069: {'lr': 0.0004569251248797652, 'samples': 5773248, 'steps': 30068, 'loss/train': 2.2474756240844727} 11/07/2021 01:29:56 - INFO - __main__ - Step 30070: {'lr': 0.0004569221468468815, 'samples': 5773440, 'steps': 30069, 'loss/train': 2.7942593097686768} 11/07/2021 01:29:56 - INFO - __main__ - Step 30071: {'lr': 0.0004569191687207618, 'samples': 5773632, 'steps': 30070, 'loss/train': 2.573376417160034} 11/07/2021 01:29:56 - INFO - __main__ - Step 30072: {'lr': 0.0004569161905014076, 'samples': 5773824, 'steps': 30071, 'loss/train': 2.012526273727417} 11/07/2021 01:29:57 - INFO - __main__ - Step 30073: {'lr': 0.0004569132121888201, 'samples': 5774016, 'steps': 30072, 'loss/train': 1.594914197921753} 11/07/2021 01:29:58 - INFO - __main__ - Step 30074: {'lr': 0.0004569102337830007, 'samples': 5774208, 'steps': 30073, 'loss/train': 1.4602880477905273} 11/07/2021 01:29:58 - INFO - __main__ - Step 30075: {'lr': 0.00045690725528395077, 'samples': 5774400, 'steps': 30074, 'loss/train': 1.65486741065979} 11/07/2021 01:29:58 - INFO - __main__ - Step 30076: {'lr': 0.0004569042766916717, 'samples': 5774592, 'steps': 30075, 'loss/train': 2.147484540939331} 11/07/2021 01:29:59 - INFO - __main__ - Step 30077: {'lr': 0.0004569012980061646, 'samples': 5774784, 'steps': 30076, 'loss/train': 1.3971514701843262} 11/07/2021 01:30:00 - INFO - __main__ - Step 30078: {'lr': 0.00045689831922743107, 'samples': 5774976, 'steps': 30077, 'loss/train': 1.5956676006317139} 11/07/2021 01:30:00 - INFO - __main__ - Step 30079: {'lr': 0.0004568953403554723, 'samples': 5775168, 'steps': 30078, 'loss/train': 2.0979011058807373} 11/07/2021 01:30:00 - INFO - __main__ - Step 30080: {'lr': 0.0004568923613902897, 'samples': 5775360, 'steps': 30079, 'loss/train': 1.4927865266799927} 11/07/2021 01:30:01 - INFO - __main__ - Step 30081: {'lr': 0.0004568893823318846, 'samples': 5775552, 'steps': 30080, 'loss/train': 1.610865592956543} 11/07/2021 01:30:01 - INFO - __main__ - Step 30082: {'lr': 0.0004568864031802583, 'samples': 5775744, 'steps': 30081, 'loss/train': 1.672869324684143} 11/07/2021 01:30:02 - INFO - __main__ - Step 30083: {'lr': 0.00045688342393541227, 'samples': 5775936, 'steps': 30082, 'loss/train': 1.5688196420669556} 11/07/2021 01:30:03 - INFO - __main__ - Step 30084: {'lr': 0.00045688044459734766, 'samples': 5776128, 'steps': 30083, 'loss/train': 0.9201650023460388} 11/07/2021 01:30:03 - INFO - __main__ - Step 30085: {'lr': 0.000456877465166066, 'samples': 5776320, 'steps': 30084, 'loss/train': 1.4790961742401123} 11/07/2021 01:30:03 - INFO - __main__ - Step 30086: {'lr': 0.0004568744856415685, 'samples': 5776512, 'steps': 30085, 'loss/train': 1.6495736837387085} 11/07/2021 01:30:04 - INFO - __main__ - Step 30087: {'lr': 0.0004568715060238565, 'samples': 5776704, 'steps': 30086, 'loss/train': 1.9113773107528687} 11/07/2021 01:30:05 - INFO - __main__ - Step 30088: {'lr': 0.0004568685263129315, 'samples': 5776896, 'steps': 30087, 'loss/train': 1.5765928030014038} 11/07/2021 01:30:05 - INFO - __main__ - Step 30089: {'lr': 0.00045686554650879464, 'samples': 5777088, 'steps': 30088, 'loss/train': 2.0161173343658447} 11/07/2021 01:30:05 - INFO - __main__ - Step 30090: {'lr': 0.0004568625666114474, 'samples': 5777280, 'steps': 30089, 'loss/train': 1.174398422241211} 11/07/2021 01:30:06 - INFO - __main__ - Step 30091: {'lr': 0.00045685958662089113, 'samples': 5777472, 'steps': 30090, 'loss/train': 2.05692195892334} 11/07/2021 01:30:06 - INFO - __main__ - Step 30092: {'lr': 0.000456856606537127, 'samples': 5777664, 'steps': 30091, 'loss/train': 1.8893474340438843} 11/07/2021 01:30:06 - INFO - __main__ - Step 30093: {'lr': 0.00045685362636015657, 'samples': 5777856, 'steps': 30092, 'loss/train': 1.5812064409255981} 11/07/2021 01:30:07 - INFO - __main__ - Step 30094: {'lr': 0.00045685064608998107, 'samples': 5778048, 'steps': 30093, 'loss/train': 1.4328776597976685} 11/07/2021 01:30:08 - INFO - __main__ - Step 30095: {'lr': 0.00045684766572660185, 'samples': 5778240, 'steps': 30094, 'loss/train': 1.4008127450942993} 11/07/2021 01:30:08 - INFO - __main__ - Step 30096: {'lr': 0.0004568446852700203, 'samples': 5778432, 'steps': 30095, 'loss/train': 1.5155335664749146} 11/07/2021 01:30:08 - INFO - __main__ - Step 30097: {'lr': 0.00045684170472023766, 'samples': 5778624, 'steps': 30096, 'loss/train': 1.069875955581665} 11/07/2021 01:30:09 - INFO - __main__ - Step 30098: {'lr': 0.00045683872407725534, 'samples': 5778816, 'steps': 30097, 'loss/train': 1.413217306137085} 11/07/2021 01:30:10 - INFO - __main__ - Step 30099: {'lr': 0.00045683574334107473, 'samples': 5779008, 'steps': 30098, 'loss/train': 1.6160032749176025} 11/07/2021 01:30:10 - INFO - __main__ - Step 30100: {'lr': 0.00045683276251169713, 'samples': 5779200, 'steps': 30099, 'loss/train': 1.3631938695907593} 11/07/2021 01:30:10 - INFO - __main__ - Step 30101: {'lr': 0.00045682978158912384, 'samples': 5779392, 'steps': 30100, 'loss/train': 1.698129653930664} 11/07/2021 01:30:11 - INFO - __main__ - Step 30102: {'lr': 0.0004568268005733562, 'samples': 5779584, 'steps': 30101, 'loss/train': 1.546492576599121} 11/07/2021 01:30:11 - INFO - __main__ - Step 30103: {'lr': 0.0004568238194643958, 'samples': 5779776, 'steps': 30102, 'loss/train': 2.1197760105133057} 11/07/2021 01:30:12 - INFO - __main__ - Step 30104: {'lr': 0.00045682083826224356, 'samples': 5779968, 'steps': 30103, 'loss/train': 1.2861956357955933} 11/07/2021 01:30:13 - INFO - __main__ - Step 30105: {'lr': 0.00045681785696690113, 'samples': 5780160, 'steps': 30104, 'loss/train': 1.3815172910690308} 11/07/2021 01:30:13 - INFO - __main__ - Step 30106: {'lr': 0.0004568148755783698, 'samples': 5780352, 'steps': 30105, 'loss/train': 1.5989118814468384} 11/07/2021 01:30:13 - INFO - __main__ - Step 30107: {'lr': 0.00045681189409665083, 'samples': 5780544, 'steps': 30106, 'loss/train': 1.6340943574905396} 11/07/2021 01:30:14 - INFO - __main__ - Step 30108: {'lr': 0.00045680891252174557, 'samples': 5780736, 'steps': 30107, 'loss/train': 0.8320098519325256} 11/07/2021 01:30:15 - INFO - __main__ - Step 30109: {'lr': 0.0004568059308536554, 'samples': 5780928, 'steps': 30108, 'loss/train': 2.0117850303649902} 11/07/2021 01:30:15 - INFO - __main__ - Step 30110: {'lr': 0.00045680294909238175, 'samples': 5781120, 'steps': 30109, 'loss/train': 1.3014116287231445} 11/07/2021 01:30:16 - INFO - __main__ - Step 30111: {'lr': 0.00045679996723792585, 'samples': 5781312, 'steps': 30110, 'loss/train': 1.8772363662719727} 11/07/2021 01:30:16 - INFO - __main__ - Step 30112: {'lr': 0.00045679698529028906, 'samples': 5781504, 'steps': 30111, 'loss/train': 1.3997163772583008} 11/07/2021 01:30:16 - INFO - __main__ - Step 30113: {'lr': 0.00045679400324947274, 'samples': 5781696, 'steps': 30112, 'loss/train': 1.5095996856689453} 11/07/2021 01:30:17 - INFO - __main__ - Step 30114: {'lr': 0.00045679102111547825, 'samples': 5781888, 'steps': 30113, 'loss/train': 1.160984754562378} 11/07/2021 01:30:18 - INFO - __main__ - Step 30115: {'lr': 0.00045678803888830687, 'samples': 5782080, 'steps': 30114, 'loss/train': 1.4564838409423828} 11/07/2021 01:30:18 - INFO - __main__ - Step 30116: {'lr': 0.0004567850565679601, 'samples': 5782272, 'steps': 30115, 'loss/train': 1.005316972732544} 11/07/2021 01:30:19 - INFO - __main__ - Step 30117: {'lr': 0.00045678207415443913, 'samples': 5782464, 'steps': 30116, 'loss/train': 1.553978443145752} 11/07/2021 01:30:19 - INFO - __main__ - Step 30118: {'lr': 0.0004567790916477453, 'samples': 5782656, 'steps': 30117, 'loss/train': 1.6825364828109741} 11/07/2021 01:30:19 - INFO - __main__ - Step 30119: {'lr': 0.00045677610904788004, 'samples': 5782848, 'steps': 30118, 'loss/train': 2.8823390007019043} 11/07/2021 01:30:20 - INFO - __main__ - Step 30120: {'lr': 0.00045677312635484466, 'samples': 5783040, 'steps': 30119, 'loss/train': 1.8276495933532715} 11/07/2021 01:30:21 - INFO - __main__ - Step 30121: {'lr': 0.00045677014356864043, 'samples': 5783232, 'steps': 30120, 'loss/train': 1.7994734048843384} 11/07/2021 01:30:21 - INFO - __main__ - Step 30122: {'lr': 0.0004567671606892688, 'samples': 5783424, 'steps': 30121, 'loss/train': 1.3283331394195557} 11/07/2021 01:30:21 - INFO - __main__ - Step 30123: {'lr': 0.00045676417771673116, 'samples': 5783616, 'steps': 30122, 'loss/train': 1.4653687477111816} 11/07/2021 01:30:22 - INFO - __main__ - Step 30124: {'lr': 0.0004567611946510287, 'samples': 5783808, 'steps': 30123, 'loss/train': 1.199395775794983} 11/07/2021 01:30:22 - INFO - __main__ - Step 30125: {'lr': 0.00045675821149216285, 'samples': 5784000, 'steps': 30124, 'loss/train': 1.7691177129745483} 11/07/2021 01:30:24 - INFO - __main__ - Step 30126: {'lr': 0.00045675522824013495, 'samples': 5784192, 'steps': 30125, 'loss/train': 1.8721429109573364} 11/07/2021 01:30:24 - INFO - __main__ - Step 30127: {'lr': 0.00045675224489494633, 'samples': 5784384, 'steps': 30126, 'loss/train': 1.6406543254852295} 11/07/2021 01:30:24 - INFO - __main__ - Step 30128: {'lr': 0.00045674926145659834, 'samples': 5784576, 'steps': 30127, 'loss/train': 2.016087293624878} 11/07/2021 01:30:25 - INFO - __main__ - Step 30129: {'lr': 0.0004567462779250923, 'samples': 5784768, 'steps': 30128, 'loss/train': 1.9647272825241089} 11/07/2021 01:30:25 - INFO - __main__ - Step 30130: {'lr': 0.0004567432943004296, 'samples': 5784960, 'steps': 30129, 'loss/train': 2.230543375015259} 11/07/2021 01:30:26 - INFO - __main__ - Step 30131: {'lr': 0.00045674031058261157, 'samples': 5785152, 'steps': 30130, 'loss/train': 1.6160472631454468} 11/07/2021 01:30:27 - INFO - __main__ - Step 30132: {'lr': 0.0004567373267716395, 'samples': 5785344, 'steps': 30131, 'loss/train': 1.335902452468872} 11/07/2021 01:30:27 - INFO - __main__ - Step 30133: {'lr': 0.0004567343428675148, 'samples': 5785536, 'steps': 30132, 'loss/train': 3.5647571086883545} 11/07/2021 01:30:27 - INFO - __main__ - Step 30134: {'lr': 0.00045673135887023874, 'samples': 5785728, 'steps': 30133, 'loss/train': 1.5163437128067017} 11/07/2021 01:30:28 - INFO - __main__ - Step 30135: {'lr': 0.0004567283747798128, 'samples': 5785920, 'steps': 30134, 'loss/train': 0.4308475852012634} 11/07/2021 01:30:28 - INFO - __main__ - Step 30136: {'lr': 0.0004567253905962383, 'samples': 5786112, 'steps': 30135, 'loss/train': 0.8566787838935852} 11/07/2021 01:30:29 - INFO - __main__ - Step 30137: {'lr': 0.00045672240631951645, 'samples': 5786304, 'steps': 30136, 'loss/train': 2.039283275604248} 11/07/2021 01:30:29 - INFO - __main__ - Step 30138: {'lr': 0.0004567194219496487, 'samples': 5786496, 'steps': 30137, 'loss/train': 2.0596466064453125} 11/07/2021 01:30:30 - INFO - __main__ - Step 30139: {'lr': 0.0004567164374866363, 'samples': 5786688, 'steps': 30138, 'loss/train': 1.325622797012329} 11/07/2021 01:30:30 - INFO - __main__ - Step 30140: {'lr': 0.00045671345293048075, 'samples': 5786880, 'steps': 30139, 'loss/train': 2.0110180377960205} 11/07/2021 01:30:30 - INFO - __main__ - Step 30141: {'lr': 0.00045671046828118324, 'samples': 5787072, 'steps': 30140, 'loss/train': 1.7381306886672974} 11/07/2021 01:30:32 - INFO - __main__ - Step 30142: {'lr': 0.0004567074835387452, 'samples': 5787264, 'steps': 30141, 'loss/train': 1.5954102277755737} 11/07/2021 01:30:32 - INFO - __main__ - Step 30143: {'lr': 0.000456704498703168, 'samples': 5787456, 'steps': 30142, 'loss/train': 1.3785072565078735} 11/07/2021 01:30:32 - INFO - __main__ - Step 30144: {'lr': 0.0004567015137744529, 'samples': 5787648, 'steps': 30143, 'loss/train': 1.4675315618515015} 11/07/2021 01:30:33 - INFO - __main__ - Step 30145: {'lr': 0.00045669852875260134, 'samples': 5787840, 'steps': 30144, 'loss/train': 1.5212641954421997} 11/07/2021 01:30:33 - INFO - __main__ - Step 30146: {'lr': 0.00045669554363761454, 'samples': 5788032, 'steps': 30145, 'loss/train': 1.521730661392212} 11/07/2021 01:30:34 - INFO - __main__ - Step 30147: {'lr': 0.0004566925584294939, 'samples': 5788224, 'steps': 30146, 'loss/train': 1.2389037609100342} 11/07/2021 01:30:34 - INFO - __main__ - Step 30148: {'lr': 0.00045668957312824086, 'samples': 5788416, 'steps': 30147, 'loss/train': 1.8396090269088745} 11/07/2021 01:30:35 - INFO - __main__ - Step 30149: {'lr': 0.00045668658773385663, 'samples': 5788608, 'steps': 30148, 'loss/train': 1.3113459348678589} 11/07/2021 01:30:35 - INFO - __main__ - Step 30150: {'lr': 0.00045668360224634263, 'samples': 5788800, 'steps': 30149, 'loss/train': 1.4173098802566528} 11/07/2021 01:30:36 - INFO - __main__ - Step 30151: {'lr': 0.00045668061666570027, 'samples': 5788992, 'steps': 30150, 'loss/train': 1.859587550163269} 11/07/2021 01:30:36 - INFO - __main__ - Step 30152: {'lr': 0.0004566776309919307, 'samples': 5789184, 'steps': 30151, 'loss/train': 1.4605780839920044} 11/07/2021 01:30:37 - INFO - __main__ - Step 30153: {'lr': 0.0004566746452250354, 'samples': 5789376, 'steps': 30152, 'loss/train': 1.5716723203659058} 11/07/2021 01:30:37 - INFO - __main__ - Step 30154: {'lr': 0.00045667165936501573, 'samples': 5789568, 'steps': 30153, 'loss/train': 1.8349000215530396} 11/07/2021 01:30:38 - INFO - __main__ - Step 30155: {'lr': 0.000456668673411873, 'samples': 5789760, 'steps': 30154, 'loss/train': 1.5699809789657593} 11/07/2021 01:30:38 - INFO - __main__ - Step 30156: {'lr': 0.00045666568736560853, 'samples': 5789952, 'steps': 30155, 'loss/train': 1.7113829851150513} 11/07/2021 01:30:38 - INFO - __main__ - Step 30157: {'lr': 0.0004566627012262238, 'samples': 5790144, 'steps': 30156, 'loss/train': 1.5282222032546997} 11/07/2021 01:30:39 - INFO - __main__ - Step 30158: {'lr': 0.0004566597149937199, 'samples': 5790336, 'steps': 30157, 'loss/train': 1.5575758218765259} 11/07/2021 01:30:40 - INFO - __main__ - Step 30159: {'lr': 0.00045665672866809835, 'samples': 5790528, 'steps': 30158, 'loss/train': 1.7184828519821167} 11/07/2021 01:30:40 - INFO - __main__ - Step 30160: {'lr': 0.0004566537422493605, 'samples': 5790720, 'steps': 30159, 'loss/train': 1.5131640434265137} 11/07/2021 01:30:40 - INFO - __main__ - Step 30161: {'lr': 0.00045665075573750764, 'samples': 5790912, 'steps': 30160, 'loss/train': 1.4869385957717896} 11/07/2021 01:30:41 - INFO - __main__ - Step 30162: {'lr': 0.00045664776913254115, 'samples': 5791104, 'steps': 30161, 'loss/train': 3.2416024208068848} 11/07/2021 01:30:42 - INFO - __main__ - Step 30163: {'lr': 0.0004566447824344624, 'samples': 5791296, 'steps': 30162, 'loss/train': 1.4250823259353638} 11/07/2021 01:30:42 - INFO - __main__ - Step 30164: {'lr': 0.00045664179564327266, 'samples': 5791488, 'steps': 30163, 'loss/train': 1.4880365133285522} 11/07/2021 01:30:42 - INFO - __main__ - Step 30165: {'lr': 0.00045663880875897325, 'samples': 5791680, 'steps': 30164, 'loss/train': 1.4041705131530762} 11/07/2021 01:30:43 - INFO - __main__ - Step 30166: {'lr': 0.00045663582178156564, 'samples': 5791872, 'steps': 30165, 'loss/train': 1.2579805850982666} 11/07/2021 01:30:43 - INFO - __main__ - Step 30167: {'lr': 0.00045663283471105115, 'samples': 5792064, 'steps': 30166, 'loss/train': 1.209857702255249} 11/07/2021 01:30:45 - INFO - __main__ - Step 30168: {'lr': 0.00045662984754743106, 'samples': 5792256, 'steps': 30167, 'loss/train': 1.6755338907241821} 11/07/2021 01:30:45 - INFO - __main__ - Step 30169: {'lr': 0.00045662686029070674, 'samples': 5792448, 'steps': 30168, 'loss/train': 1.4897969961166382} 11/07/2021 01:30:45 - INFO - __main__ - Step 30170: {'lr': 0.0004566238729408796, 'samples': 5792640, 'steps': 30169, 'loss/train': 0.8648298382759094} 11/07/2021 01:30:46 - INFO - __main__ - Step 30171: {'lr': 0.00045662088549795087, 'samples': 5792832, 'steps': 30170, 'loss/train': 1.545324444770813} 11/07/2021 01:30:46 - INFO - __main__ - Step 30172: {'lr': 0.000456617897961922, 'samples': 5793024, 'steps': 30171, 'loss/train': 1.4268949031829834} 11/07/2021 01:30:47 - INFO - __main__ - Step 30173: {'lr': 0.00045661491033279427, 'samples': 5793216, 'steps': 30172, 'loss/train': 1.5952434539794922} 11/07/2021 01:30:48 - INFO - __main__ - Step 30174: {'lr': 0.00045661192261056905, 'samples': 5793408, 'steps': 30173, 'loss/train': 1.5790772438049316} 11/07/2021 01:30:48 - INFO - __main__ - Step 30175: {'lr': 0.00045660893479524767, 'samples': 5793600, 'steps': 30174, 'loss/train': 2.000520706176758} 11/07/2021 01:30:48 - INFO - __main__ - Step 30176: {'lr': 0.00045660594688683154, 'samples': 5793792, 'steps': 30175, 'loss/train': 1.4284133911132812} 11/07/2021 01:30:49 - INFO - __main__ - Step 30177: {'lr': 0.00045660295888532196, 'samples': 5793984, 'steps': 30176, 'loss/train': 1.7523822784423828} 11/07/2021 01:30:50 - INFO - __main__ - Step 30178: {'lr': 0.00045659997079072024, 'samples': 5794176, 'steps': 30177, 'loss/train': 1.5193082094192505} 11/07/2021 01:30:50 - INFO - __main__ - Step 30179: {'lr': 0.00045659698260302773, 'samples': 5794368, 'steps': 30178, 'loss/train': 1.563475251197815} 11/07/2021 01:30:50 - INFO - __main__ - Step 30180: {'lr': 0.00045659399432224583, 'samples': 5794560, 'steps': 30179, 'loss/train': 1.3032660484313965} 11/07/2021 01:30:51 - INFO - __main__ - Step 30181: {'lr': 0.00045659100594837586, 'samples': 5794752, 'steps': 30180, 'loss/train': 1.5408105850219727} 11/07/2021 01:30:51 - INFO - __main__ - Step 30182: {'lr': 0.0004565880174814192, 'samples': 5794944, 'steps': 30181, 'loss/train': 1.7648100852966309} 11/07/2021 01:30:51 - INFO - __main__ - Step 30183: {'lr': 0.0004565850289213772, 'samples': 5795136, 'steps': 30182, 'loss/train': 1.5357639789581299} 11/07/2021 01:30:53 - INFO - __main__ - Step 30184: {'lr': 0.0004565820402682511, 'samples': 5795328, 'steps': 30183, 'loss/train': 1.9818118810653687} 11/07/2021 01:30:53 - INFO - __main__ - Step 30185: {'lr': 0.00045657905152204236, 'samples': 5795520, 'steps': 30184, 'loss/train': 0.6554961204528809} 11/07/2021 01:30:53 - INFO - __main__ - Step 30186: {'lr': 0.0004565760626827523, 'samples': 5795712, 'steps': 30185, 'loss/train': 1.025604248046875} 11/07/2021 01:30:54 - INFO - __main__ - Step 30187: {'lr': 0.00045657307375038226, 'samples': 5795904, 'steps': 30186, 'loss/train': 1.1355433464050293} 11/07/2021 01:30:54 - INFO - __main__ - Step 30188: {'lr': 0.00045657008472493356, 'samples': 5796096, 'steps': 30187, 'loss/train': 1.4717738628387451} 11/07/2021 01:30:55 - INFO - __main__ - Step 30189: {'lr': 0.0004565670956064075, 'samples': 5796288, 'steps': 30188, 'loss/train': 1.4684523344039917} 11/07/2021 01:30:55 - INFO - __main__ - Step 30190: {'lr': 0.00045656410639480563, 'samples': 5796480, 'steps': 30189, 'loss/train': 1.4059257507324219} 11/07/2021 01:30:56 - INFO - __main__ - Step 30191: {'lr': 0.00045656111709012906, 'samples': 5796672, 'steps': 30190, 'loss/train': 1.1448701620101929} 11/07/2021 01:30:56 - INFO - __main__ - Step 30192: {'lr': 0.00045655812769237927, 'samples': 5796864, 'steps': 30191, 'loss/train': 1.4939942359924316} 11/07/2021 01:30:57 - INFO - __main__ - Step 30193: {'lr': 0.00045655513820155755, 'samples': 5797056, 'steps': 30192, 'loss/train': 4.016806602478027} 11/07/2021 01:30:58 - INFO - __main__ - Step 30194: {'lr': 0.00045655214861766525, 'samples': 5797248, 'steps': 30193, 'loss/train': 1.5990046262741089} 11/07/2021 01:30:58 - INFO - __main__ - Step 30195: {'lr': 0.0004565491589407038, 'samples': 5797440, 'steps': 30194, 'loss/train': 1.4188684225082397} 11/07/2021 01:30:59 - INFO - __main__ - Step 30196: {'lr': 0.0004565461691706745, 'samples': 5797632, 'steps': 30195, 'loss/train': 1.6000747680664062} 11/07/2021 01:30:59 - INFO - __main__ - Step 30197: {'lr': 0.0004565431793075786, 'samples': 5797824, 'steps': 30196, 'loss/train': 1.141895055770874} 11/07/2021 01:30:59 - INFO - __main__ - Step 30198: {'lr': 0.0004565401893514176, 'samples': 5798016, 'steps': 30197, 'loss/train': 1.2154690027236938} 11/07/2021 01:31:00 - INFO - __main__ - Step 30199: {'lr': 0.0004565371993021927, 'samples': 5798208, 'steps': 30198, 'loss/train': 1.5073283910751343} 11/07/2021 01:31:01 - INFO - __main__ - Step 30200: {'lr': 0.00045653420915990546, 'samples': 5798400, 'steps': 30199, 'loss/train': 1.5848606824874878} 11/07/2021 01:31:01 - INFO - __main__ - Step 30201: {'lr': 0.000456531218924557, 'samples': 5798592, 'steps': 30200, 'loss/train': 1.862545371055603} 11/07/2021 01:31:01 - INFO - __main__ - Step 30202: {'lr': 0.0004565282285961488, 'samples': 5798784, 'steps': 30201, 'loss/train': 1.546978235244751} 11/07/2021 01:31:02 - INFO - __main__ - Step 30203: {'lr': 0.0004565252381746821, 'samples': 5798976, 'steps': 30202, 'loss/train': 1.5877180099487305} 11/07/2021 01:31:02 - INFO - __main__ - Step 30204: {'lr': 0.0004565222476601584, 'samples': 5799168, 'steps': 30203, 'loss/train': 1.850545883178711} 11/07/2021 01:31:03 - INFO - __main__ - Step 30205: {'lr': 0.0004565192570525789, 'samples': 5799360, 'steps': 30204, 'loss/train': 1.8471728563308716} 11/07/2021 01:31:03 - INFO - __main__ - Step 30206: {'lr': 0.00045651626635194497, 'samples': 5799552, 'steps': 30205, 'loss/train': 1.6950767040252686} 11/07/2021 01:31:04 - INFO - __main__ - Step 30207: {'lr': 0.0004565132755582581, 'samples': 5799744, 'steps': 30206, 'loss/train': 1.3628997802734375} 11/07/2021 01:31:04 - INFO - __main__ - Step 30208: {'lr': 0.0004565102846715195, 'samples': 5799936, 'steps': 30207, 'loss/train': 1.5977096557617188} 11/07/2021 01:31:04 - INFO - __main__ - Step 30209: {'lr': 0.0004565072936917305, 'samples': 5800128, 'steps': 30208, 'loss/train': 1.0420420169830322} 11/07/2021 01:31:06 - INFO - __main__ - Step 30210: {'lr': 0.0004565043026188926, 'samples': 5800320, 'steps': 30209, 'loss/train': 1.3461986780166626} 11/07/2021 01:31:06 - INFO - __main__ - Step 30211: {'lr': 0.000456501311453007, 'samples': 5800512, 'steps': 30210, 'loss/train': 1.7356064319610596} 11/07/2021 01:31:06 - INFO - __main__ - Step 30212: {'lr': 0.00045649832019407504, 'samples': 5800704, 'steps': 30211, 'loss/train': 1.5009082555770874} 11/07/2021 01:31:07 - INFO - __main__ - Step 30213: {'lr': 0.0004564953288420982, 'samples': 5800896, 'steps': 30212, 'loss/train': 1.2765473127365112} 11/07/2021 01:31:07 - INFO - __main__ - Step 30214: {'lr': 0.00045649233739707774, 'samples': 5801088, 'steps': 30213, 'loss/train': 1.5545157194137573} 11/07/2021 01:31:07 - INFO - __main__ - Step 30215: {'lr': 0.00045648934585901496, 'samples': 5801280, 'steps': 30214, 'loss/train': 1.4928683042526245} 11/07/2021 01:31:08 - INFO - __main__ - Step 30216: {'lr': 0.0004564863542279113, 'samples': 5801472, 'steps': 30215, 'loss/train': 1.6348497867584229} 11/07/2021 01:31:09 - INFO - __main__ - Step 30217: {'lr': 0.0004564833625037681, 'samples': 5801664, 'steps': 30216, 'loss/train': 1.4835448265075684} 11/07/2021 01:31:09 - INFO - __main__ - Step 30218: {'lr': 0.00045648037068658667, 'samples': 5801856, 'steps': 30217, 'loss/train': 1.124776005744934} 11/07/2021 01:31:09 - INFO - __main__ - Step 30219: {'lr': 0.00045647737877636834, 'samples': 5802048, 'steps': 30218, 'loss/train': 1.5227097272872925} 11/07/2021 01:31:10 - INFO - __main__ - Step 30220: {'lr': 0.0004564743867731145, 'samples': 5802240, 'steps': 30219, 'loss/train': 1.4747366905212402} 11/07/2021 01:31:11 - INFO - __main__ - Step 30221: {'lr': 0.0004564713946768265, 'samples': 5802432, 'steps': 30220, 'loss/train': 1.6206881999969482} 11/07/2021 01:31:11 - INFO - __main__ - Step 30222: {'lr': 0.0004564684024875057, 'samples': 5802624, 'steps': 30221, 'loss/train': 1.8404051065444946} 11/07/2021 01:31:12 - INFO - __main__ - Step 30223: {'lr': 0.0004564654102051534, 'samples': 5802816, 'steps': 30222, 'loss/train': 0.8817972540855408} 11/07/2021 01:31:12 - INFO - __main__ - Step 30224: {'lr': 0.000456462417829771, 'samples': 5803008, 'steps': 30223, 'loss/train': 1.4281342029571533} 11/07/2021 01:31:12 - INFO - __main__ - Step 30225: {'lr': 0.0004564594253613598, 'samples': 5803200, 'steps': 30224, 'loss/train': 1.5440261363983154} 11/07/2021 01:31:13 - INFO - __main__ - Step 30226: {'lr': 0.0004564564327999211, 'samples': 5803392, 'steps': 30225, 'loss/train': 1.7672604322433472} 11/07/2021 01:31:14 - INFO - __main__ - Step 30227: {'lr': 0.00045645344014545643, 'samples': 5803584, 'steps': 30226, 'loss/train': 1.117114543914795} 11/07/2021 01:31:14 - INFO - __main__ - Step 30228: {'lr': 0.00045645044739796694, 'samples': 5803776, 'steps': 30227, 'loss/train': 1.7497179508209229} 11/07/2021 01:31:14 - INFO - __main__ - Step 30229: {'lr': 0.00045644745455745414, 'samples': 5803968, 'steps': 30228, 'loss/train': 1.7879761457443237} 11/07/2021 01:31:15 - INFO - __main__ - Step 30230: {'lr': 0.0004564444616239193, 'samples': 5804160, 'steps': 30229, 'loss/train': 1.0915133953094482} 11/07/2021 01:31:16 - INFO - __main__ - Step 30231: {'lr': 0.0004564414685973637, 'samples': 5804352, 'steps': 30230, 'loss/train': 1.574084758758545} 11/07/2021 01:31:16 - INFO - __main__ - Step 30232: {'lr': 0.0004564384754777888, 'samples': 5804544, 'steps': 30231, 'loss/train': 1.618294358253479} 11/07/2021 01:31:16 - INFO - __main__ - Step 30233: {'lr': 0.00045643548226519587, 'samples': 5804736, 'steps': 30232, 'loss/train': 1.8467299938201904} 11/07/2021 01:31:17 - INFO - __main__ - Step 30234: {'lr': 0.00045643248895958636, 'samples': 5804928, 'steps': 30233, 'loss/train': 1.3164348602294922} 11/07/2021 01:31:17 - INFO - __main__ - Step 30235: {'lr': 0.00045642949556096146, 'samples': 5805120, 'steps': 30234, 'loss/train': 1.2188383340835571} 11/07/2021 01:31:17 - INFO - __main__ - Step 30236: {'lr': 0.0004564265020693227, 'samples': 5805312, 'steps': 30235, 'loss/train': 1.6616016626358032} 11/07/2021 01:31:18 - INFO - __main__ - Step 30237: {'lr': 0.0004564235084846713, 'samples': 5805504, 'steps': 30236, 'loss/train': 1.7576926946640015} 11/07/2021 01:31:19 - INFO - __main__ - Step 30238: {'lr': 0.00045642051480700873, 'samples': 5805696, 'steps': 30237, 'loss/train': 1.550876259803772} 11/07/2021 01:31:19 - INFO - __main__ - Step 30239: {'lr': 0.0004564175210363362, 'samples': 5805888, 'steps': 30238, 'loss/train': 1.633492350578308} 11/07/2021 01:31:20 - INFO - __main__ - Step 30240: {'lr': 0.00045641452717265507, 'samples': 5806080, 'steps': 30239, 'loss/train': 1.3447606563568115} 11/07/2021 01:31:20 - INFO - __main__ - Step 30241: {'lr': 0.00045641153321596687, 'samples': 5806272, 'steps': 30240, 'loss/train': 1.3774157762527466} 11/07/2021 01:31:21 - INFO - __main__ - Step 30242: {'lr': 0.0004564085391662727, 'samples': 5806464, 'steps': 30241, 'loss/train': 1.6286267042160034} 11/07/2021 01:31:21 - INFO - __main__ - Step 30243: {'lr': 0.00045640554502357413, 'samples': 5806656, 'steps': 30242, 'loss/train': 1.4465882778167725} 11/07/2021 01:31:22 - INFO - __main__ - Step 30244: {'lr': 0.0004564025507878723, 'samples': 5806848, 'steps': 30243, 'loss/train': 1.7390762567520142} 11/07/2021 01:31:22 - INFO - __main__ - Step 30245: {'lr': 0.00045639955645916875, 'samples': 5807040, 'steps': 30244, 'loss/train': 0.9178106188774109} 11/07/2021 01:31:22 - INFO - __main__ - Step 30246: {'lr': 0.0004563965620374647, 'samples': 5807232, 'steps': 30245, 'loss/train': 1.531065583229065} 11/07/2021 01:31:23 - INFO - __main__ - Step 30247: {'lr': 0.0004563935675227615, 'samples': 5807424, 'steps': 30246, 'loss/train': 1.9159185886383057} 11/07/2021 01:31:24 - INFO - __main__ - Step 30248: {'lr': 0.00045639057291506065, 'samples': 5807616, 'steps': 30247, 'loss/train': 1.6641156673431396} 11/07/2021 01:31:24 - INFO - __main__ - Step 30249: {'lr': 0.0004563875782143633, 'samples': 5807808, 'steps': 30248, 'loss/train': 1.6026545763015747} 11/07/2021 01:31:25 - INFO - __main__ - Step 30250: {'lr': 0.000456384583420671, 'samples': 5808000, 'steps': 30249, 'loss/train': 1.2319056987762451} 11/07/2021 01:31:25 - INFO - __main__ - Step 30251: {'lr': 0.0004563815885339849, 'samples': 5808192, 'steps': 30250, 'loss/train': 1.30437171459198} 11/07/2021 01:31:25 - INFO - __main__ - Step 30252: {'lr': 0.00045637859355430647, 'samples': 5808384, 'steps': 30251, 'loss/train': 1.471893548965454} 11/07/2021 01:31:26 - INFO - __main__ - Step 30253: {'lr': 0.000456375598481637, 'samples': 5808576, 'steps': 30252, 'loss/train': 1.7426419258117676} 11/07/2021 01:31:27 - INFO - __main__ - Step 30254: {'lr': 0.00045637260331597793, 'samples': 5808768, 'steps': 30253, 'loss/train': 1.6000956296920776} 11/07/2021 01:31:27 - INFO - __main__ - Step 30255: {'lr': 0.00045636960805733054, 'samples': 5808960, 'steps': 30254, 'loss/train': 1.3028124570846558} 11/07/2021 01:31:27 - INFO - __main__ - Step 30256: {'lr': 0.0004563666127056961, 'samples': 5809152, 'steps': 30255, 'loss/train': 1.526604413986206} 11/07/2021 01:31:28 - INFO - __main__ - Step 30257: {'lr': 0.0004563636172610761, 'samples': 5809344, 'steps': 30256, 'loss/train': 1.6551815271377563} 11/07/2021 01:31:29 - INFO - __main__ - Step 30258: {'lr': 0.00045636062172347186, 'samples': 5809536, 'steps': 30257, 'loss/train': 0.7848110198974609} 11/07/2021 01:31:29 - INFO - __main__ - Step 30259: {'lr': 0.0004563576260928847, 'samples': 5809728, 'steps': 30258, 'loss/train': 1.6645081043243408} 11/07/2021 01:31:30 - INFO - __main__ - Step 30260: {'lr': 0.000456354630369316, 'samples': 5809920, 'steps': 30259, 'loss/train': 1.7983936071395874} 11/07/2021 01:31:30 - INFO - __main__ - Step 30261: {'lr': 0.00045635163455276707, 'samples': 5810112, 'steps': 30260, 'loss/train': 1.5694831609725952} 11/07/2021 01:31:30 - INFO - __main__ - Step 30262: {'lr': 0.0004563486386432393, 'samples': 5810304, 'steps': 30261, 'loss/train': 1.768189549446106} 11/07/2021 01:31:31 - INFO - __main__ - Step 30263: {'lr': 0.00045634564264073396, 'samples': 5810496, 'steps': 30262, 'loss/train': 0.912808358669281} 11/07/2021 01:31:32 - INFO - __main__ - Step 30264: {'lr': 0.0004563426465452525, 'samples': 5810688, 'steps': 30263, 'loss/train': 1.8347886800765991} 11/07/2021 01:31:32 - INFO - __main__ - Step 30265: {'lr': 0.00045633965035679614, 'samples': 5810880, 'steps': 30264, 'loss/train': 1.2841131687164307} 11/07/2021 01:31:33 - INFO - __main__ - Step 30266: {'lr': 0.0004563366540753664, 'samples': 5811072, 'steps': 30265, 'loss/train': 1.7946243286132812} 11/07/2021 01:31:33 - INFO - __main__ - Step 30267: {'lr': 0.00045633365770096456, 'samples': 5811264, 'steps': 30266, 'loss/train': 1.5827088356018066} 11/07/2021 01:31:33 - INFO - __main__ - Step 30268: {'lr': 0.000456330661233592, 'samples': 5811456, 'steps': 30267, 'loss/train': 1.5512784719467163} 11/07/2021 01:31:35 - INFO - __main__ - Step 30269: {'lr': 0.00045632766467324995, 'samples': 5811648, 'steps': 30268, 'loss/train': 1.6750398874282837} 11/07/2021 01:31:35 - INFO - __main__ - Step 30270: {'lr': 0.0004563246680199398, 'samples': 5811840, 'steps': 30269, 'loss/train': 1.5613741874694824} 11/07/2021 01:31:36 - INFO - __main__ - Step 30271: {'lr': 0.000456321671273663, 'samples': 5812032, 'steps': 30270, 'loss/train': 1.44844651222229} 11/07/2021 01:31:36 - INFO - __main__ - Step 30272: {'lr': 0.00045631867443442084, 'samples': 5812224, 'steps': 30271, 'loss/train': 1.8668856620788574} 11/07/2021 01:31:36 - INFO - __main__ - Step 30273: {'lr': 0.00045631567750221465, 'samples': 5812416, 'steps': 30272, 'loss/train': 1.2790777683258057} 11/07/2021 01:31:37 - INFO - __main__ - Step 30274: {'lr': 0.0004563126804770458, 'samples': 5812608, 'steps': 30273, 'loss/train': 1.2653971910476685} 11/07/2021 01:31:38 - INFO - __main__ - Step 30275: {'lr': 0.00045630968335891564, 'samples': 5812800, 'steps': 30274, 'loss/train': 0.49572262167930603} 11/07/2021 01:31:38 - INFO - __main__ - Step 30276: {'lr': 0.00045630668614782553, 'samples': 5812992, 'steps': 30275, 'loss/train': 1.835469365119934} 11/07/2021 01:31:39 - INFO - __main__ - Step 30277: {'lr': 0.0004563036888437768, 'samples': 5813184, 'steps': 30276, 'loss/train': 1.9106202125549316} 11/07/2021 01:31:39 - INFO - __main__ - Step 30278: {'lr': 0.0004563006914467709, 'samples': 5813376, 'steps': 30277, 'loss/train': 1.3554658889770508} 11/07/2021 01:31:39 - INFO - __main__ - Step 30279: {'lr': 0.000456297693956809, 'samples': 5813568, 'steps': 30278, 'loss/train': 1.355484127998352} 11/07/2021 01:31:40 - INFO - __main__ - Step 30280: {'lr': 0.0004562946963738925, 'samples': 5813760, 'steps': 30279, 'loss/train': 1.4761924743652344} 11/07/2021 01:31:41 - INFO - __main__ - Step 30281: {'lr': 0.0004562916986980229, 'samples': 5813952, 'steps': 30280, 'loss/train': 2.2222375869750977} 11/07/2021 01:31:41 - INFO - __main__ - Step 30282: {'lr': 0.0004562887009292014, 'samples': 5814144, 'steps': 30281, 'loss/train': 1.4915366172790527} 11/07/2021 01:31:41 - INFO - __main__ - Step 30283: {'lr': 0.0004562857030674293, 'samples': 5814336, 'steps': 30282, 'loss/train': 1.5470994710922241} 11/07/2021 01:31:42 - INFO - __main__ - Step 30284: {'lr': 0.0004562827051127082, 'samples': 5814528, 'steps': 30283, 'loss/train': 1.2854667901992798} 11/07/2021 01:31:43 - INFO - __main__ - Step 30285: {'lr': 0.0004562797070650392, 'samples': 5814720, 'steps': 30284, 'loss/train': 1.4260727167129517} 11/07/2021 01:31:43 - INFO - __main__ - Step 30286: {'lr': 0.00045627670892442376, 'samples': 5814912, 'steps': 30285, 'loss/train': 1.5417121648788452} 11/07/2021 01:31:43 - INFO - __main__ - Step 30287: {'lr': 0.0004562737106908632, 'samples': 5815104, 'steps': 30286, 'loss/train': 0.8534297943115234} 11/07/2021 01:31:44 - INFO - __main__ - Step 30288: {'lr': 0.00045627071236435896, 'samples': 5815296, 'steps': 30287, 'loss/train': 1.6247920989990234} 11/07/2021 01:31:44 - INFO - __main__ - Step 30289: {'lr': 0.0004562677139449123, 'samples': 5815488, 'steps': 30288, 'loss/train': 1.485723853111267} 11/07/2021 01:31:45 - INFO - __main__ - Step 30290: {'lr': 0.0004562647154325246, 'samples': 5815680, 'steps': 30289, 'loss/train': 1.6737288236618042} 11/07/2021 01:31:45 - INFO - __main__ - Step 30291: {'lr': 0.0004562617168271971, 'samples': 5815872, 'steps': 30290, 'loss/train': 1.262340784072876} 11/07/2021 01:31:46 - INFO - __main__ - Step 30292: {'lr': 0.0004562587181289314, 'samples': 5816064, 'steps': 30291, 'loss/train': 1.1251622438430786} 11/07/2021 01:31:46 - INFO - __main__ - Step 30293: {'lr': 0.00045625571933772857, 'samples': 5816256, 'steps': 30292, 'loss/train': 1.7130277156829834} 11/07/2021 01:31:47 - INFO - __main__ - Step 30294: {'lr': 0.0004562527204535902, 'samples': 5816448, 'steps': 30293, 'loss/train': 1.5328103303909302} 11/07/2021 01:31:47 - INFO - __main__ - Step 30295: {'lr': 0.00045624972147651746, 'samples': 5816640, 'steps': 30294, 'loss/train': 1.6957600116729736} 11/07/2021 01:31:48 - INFO - __main__ - Step 30296: {'lr': 0.00045624672240651183, 'samples': 5816832, 'steps': 30295, 'loss/train': 1.4732813835144043} 11/07/2021 01:31:48 - INFO - __main__ - Step 30297: {'lr': 0.00045624372324357457, 'samples': 5817024, 'steps': 30296, 'loss/train': 1.6591161489486694} 11/07/2021 01:31:49 - INFO - __main__ - Step 30298: {'lr': 0.0004562407239877071, 'samples': 5817216, 'steps': 30297, 'loss/train': 1.7493314743041992} 11/07/2021 01:31:49 - INFO - __main__ - Step 30299: {'lr': 0.0004562377246389108, 'samples': 5817408, 'steps': 30298, 'loss/train': 1.5536787509918213} 11/07/2021 01:31:50 - INFO - __main__ - Step 30300: {'lr': 0.00045623472519718683, 'samples': 5817600, 'steps': 30299, 'loss/train': 1.7294906377792358} 11/07/2021 01:31:50 - INFO - __main__ - Step 30301: {'lr': 0.00045623172566253676, 'samples': 5817792, 'steps': 30300, 'loss/train': 1.6559518575668335} 11/07/2021 01:31:51 - INFO - __main__ - Step 30302: {'lr': 0.00045622872603496184, 'samples': 5817984, 'steps': 30301, 'loss/train': 1.169908881187439} 11/07/2021 01:31:51 - INFO - __main__ - Step 30303: {'lr': 0.0004562257263144635, 'samples': 5818176, 'steps': 30302, 'loss/train': 1.2968664169311523} 11/07/2021 01:31:51 - INFO - __main__ - Step 30304: {'lr': 0.0004562227265010429, 'samples': 5818368, 'steps': 30303, 'loss/train': 1.3897583484649658} 11/07/2021 01:31:52 - INFO - __main__ - Step 30305: {'lr': 0.00045621972659470156, 'samples': 5818560, 'steps': 30304, 'loss/train': 1.796301007270813} 11/07/2021 01:31:53 - INFO - __main__ - Step 30306: {'lr': 0.0004562167265954409, 'samples': 5818752, 'steps': 30305, 'loss/train': 1.7360395193099976} 11/07/2021 01:31:53 - INFO - __main__ - Step 30307: {'lr': 0.000456213726503262, 'samples': 5818944, 'steps': 30306, 'loss/train': 1.4913867712020874} 11/07/2021 01:31:53 - INFO - __main__ - Step 30308: {'lr': 0.0004562107263181665, 'samples': 5819136, 'steps': 30307, 'loss/train': 1.6164582967758179} 11/07/2021 01:31:54 - INFO - __main__ - Step 30309: {'lr': 0.0004562077260401556, 'samples': 5819328, 'steps': 30308, 'loss/train': 1.4213775396347046} 11/07/2021 01:31:54 - INFO - __main__ - Step 30310: {'lr': 0.00045620472566923064, 'samples': 5819520, 'steps': 30309, 'loss/train': 1.0018659830093384} 11/07/2021 01:31:55 - INFO - __main__ - Step 30311: {'lr': 0.0004562017252053931, 'samples': 5819712, 'steps': 30310, 'loss/train': 1.6342417001724243} 11/07/2021 01:31:56 - INFO - __main__ - Step 30312: {'lr': 0.0004561987246486442, 'samples': 5819904, 'steps': 30311, 'loss/train': 1.7264950275421143} 11/07/2021 01:31:56 - INFO - __main__ - Step 30313: {'lr': 0.00045619572399898534, 'samples': 5820096, 'steps': 30312, 'loss/train': 6.909998893737793} 11/07/2021 01:31:56 - INFO - __main__ - Step 30314: {'lr': 0.0004561927232564179, 'samples': 5820288, 'steps': 30313, 'loss/train': 2.314699172973633} 11/07/2021 01:31:57 - INFO - __main__ - Step 30315: {'lr': 0.00045618972242094313, 'samples': 5820480, 'steps': 30314, 'loss/train': 0.9875410199165344} 11/07/2021 01:31:58 - INFO - __main__ - Step 30316: {'lr': 0.00045618672149256244, 'samples': 5820672, 'steps': 30315, 'loss/train': 1.4033745527267456} 11/07/2021 01:31:58 - INFO - __main__ - Step 30317: {'lr': 0.0004561837204712773, 'samples': 5820864, 'steps': 30316, 'loss/train': 0.5098898410797119} 11/07/2021 01:31:59 - INFO - __main__ - Step 30318: {'lr': 0.0004561807193570888, 'samples': 5821056, 'steps': 30317, 'loss/train': 0.8450649976730347} 11/07/2021 01:31:59 - INFO - __main__ - Step 30319: {'lr': 0.0004561777181499986, 'samples': 5821248, 'steps': 30318, 'loss/train': 1.991714358329773} 11/07/2021 01:31:59 - INFO - __main__ - Step 30320: {'lr': 0.00045617471685000785, 'samples': 5821440, 'steps': 30319, 'loss/train': 0.922199547290802} 11/07/2021 01:32:00 - INFO - __main__ - Step 30321: {'lr': 0.00045617171545711793, 'samples': 5821632, 'steps': 30320, 'loss/train': 1.2312003374099731} 11/07/2021 01:32:01 - INFO - __main__ - Step 30322: {'lr': 0.0004561687139713302, 'samples': 5821824, 'steps': 30321, 'loss/train': 1.8956984281539917} 11/07/2021 01:32:01 - INFO - __main__ - Step 30323: {'lr': 0.00045616571239264614, 'samples': 5822016, 'steps': 30322, 'loss/train': 1.717467188835144} 11/07/2021 01:32:02 - INFO - __main__ - Step 30324: {'lr': 0.0004561627107210669, 'samples': 5822208, 'steps': 30323, 'loss/train': 1.844076156616211} 11/07/2021 01:32:02 - INFO - __main__ - Step 30325: {'lr': 0.00045615970895659393, 'samples': 5822400, 'steps': 30324, 'loss/train': 0.860467255115509} 11/07/2021 01:32:02 - INFO - __main__ - Step 30326: {'lr': 0.00045615670709922855, 'samples': 5822592, 'steps': 30325, 'loss/train': 2.927840232849121} 11/07/2021 01:32:03 - INFO - __main__ - Step 30327: {'lr': 0.0004561537051489722, 'samples': 5822784, 'steps': 30326, 'loss/train': 2.943389892578125} 11/07/2021 01:32:04 - INFO - __main__ - Step 30328: {'lr': 0.00045615070310582617, 'samples': 5822976, 'steps': 30327, 'loss/train': 1.597343921661377} 11/07/2021 01:32:04 - INFO - __main__ - Step 30329: {'lr': 0.00045614770096979177, 'samples': 5823168, 'steps': 30328, 'loss/train': 1.5728250741958618} 11/07/2021 01:32:05 - INFO - __main__ - Step 30330: {'lr': 0.0004561446987408704, 'samples': 5823360, 'steps': 30329, 'loss/train': 1.852549433708191} 11/07/2021 01:32:05 - INFO - __main__ - Step 30331: {'lr': 0.00045614169641906344, 'samples': 5823552, 'steps': 30330, 'loss/train': 1.5262786149978638} 11/07/2021 01:32:05 - INFO - __main__ - Step 30332: {'lr': 0.00045613869400437223, 'samples': 5823744, 'steps': 30331, 'loss/train': 1.6622519493103027} 11/07/2021 01:32:06 - INFO - __main__ - Step 30333: {'lr': 0.000456135691496798, 'samples': 5823936, 'steps': 30332, 'loss/train': 1.857367992401123} 11/07/2021 01:32:07 - INFO - __main__ - Step 30334: {'lr': 0.0004561326888963423, 'samples': 5824128, 'steps': 30333, 'loss/train': 1.3868197202682495} 11/07/2021 01:32:07 - INFO - __main__ - Step 30335: {'lr': 0.0004561296862030064, 'samples': 5824320, 'steps': 30334, 'loss/train': 1.6627488136291504} 11/07/2021 01:32:07 - INFO - __main__ - Step 30336: {'lr': 0.00045612668341679164, 'samples': 5824512, 'steps': 30335, 'loss/train': 1.669124722480774} 11/07/2021 01:32:08 - INFO - __main__ - Step 30337: {'lr': 0.0004561236805376994, 'samples': 5824704, 'steps': 30336, 'loss/train': 1.3865309953689575} 11/07/2021 01:32:09 - INFO - __main__ - Step 30338: {'lr': 0.00045612067756573097, 'samples': 5824896, 'steps': 30337, 'loss/train': 1.4449416399002075} 11/07/2021 01:32:09 - INFO - __main__ - Step 30339: {'lr': 0.0004561176745008877, 'samples': 5825088, 'steps': 30338, 'loss/train': 1.609216570854187} 11/07/2021 01:32:10 - INFO - __main__ - Step 30340: {'lr': 0.000456114671343171, 'samples': 5825280, 'steps': 30339, 'loss/train': 1.2884266376495361} 11/07/2021 01:32:10 - INFO - __main__ - Step 30341: {'lr': 0.00045611166809258227, 'samples': 5825472, 'steps': 30340, 'loss/train': 1.75202214717865} 11/07/2021 01:32:10 - INFO - __main__ - Step 30342: {'lr': 0.0004561086647491227, 'samples': 5825664, 'steps': 30341, 'loss/train': 1.4077476263046265} 11/07/2021 01:32:11 - INFO - __main__ - Step 30343: {'lr': 0.00045610566131279386, 'samples': 5825856, 'steps': 30342, 'loss/train': 1.4677209854125977} 11/07/2021 01:32:12 - INFO - __main__ - Step 30344: {'lr': 0.00045610265778359696, 'samples': 5826048, 'steps': 30343, 'loss/train': 0.9540789723396301} 11/07/2021 01:32:12 - INFO - __main__ - Step 30345: {'lr': 0.00045609965416153333, 'samples': 5826240, 'steps': 30344, 'loss/train': 1.9768776893615723} 11/07/2021 01:32:12 - INFO - __main__ - Step 30346: {'lr': 0.0004560966504466044, 'samples': 5826432, 'steps': 30345, 'loss/train': 1.844706416130066} 11/07/2021 01:32:13 - INFO - __main__ - Step 30347: {'lr': 0.00045609364663881153, 'samples': 5826624, 'steps': 30346, 'loss/train': 1.6334888935089111} 11/07/2021 01:32:14 - INFO - __main__ - Step 30348: {'lr': 0.000456090642738156, 'samples': 5826816, 'steps': 30347, 'loss/train': 1.814347505569458} 11/07/2021 01:32:14 - INFO - __main__ - Step 30349: {'lr': 0.00045608763874463925, 'samples': 5827008, 'steps': 30348, 'loss/train': 1.4767755270004272} 11/07/2021 01:32:14 - INFO - __main__ - Step 30350: {'lr': 0.00045608463465826257, 'samples': 5827200, 'steps': 30349, 'loss/train': 1.694061040878296} 11/07/2021 01:32:15 - INFO - __main__ - Step 30351: {'lr': 0.0004560816304790274, 'samples': 5827392, 'steps': 30350, 'loss/train': 1.9288541078567505} 11/07/2021 01:32:15 - INFO - __main__ - Step 30352: {'lr': 0.0004560786262069349, 'samples': 5827584, 'steps': 30351, 'loss/train': 1.5461041927337646} 11/07/2021 01:32:16 - INFO - __main__ - Step 30353: {'lr': 0.00045607562184198666, 'samples': 5827776, 'steps': 30352, 'loss/train': 1.8285548686981201} 11/07/2021 01:32:17 - INFO - __main__ - Step 30354: {'lr': 0.00045607261738418384, 'samples': 5827968, 'steps': 30353, 'loss/train': 1.6511898040771484} 11/07/2021 01:32:17 - INFO - __main__ - Step 30355: {'lr': 0.00045606961283352793, 'samples': 5828160, 'steps': 30354, 'loss/train': 1.591403603553772} 11/07/2021 01:32:17 - INFO - __main__ - Step 30356: {'lr': 0.0004560666081900202, 'samples': 5828352, 'steps': 30355, 'loss/train': 1.5361576080322266} 11/07/2021 01:32:18 - INFO - __main__ - Step 30357: {'lr': 0.00045606360345366203, 'samples': 5828544, 'steps': 30356, 'loss/train': 1.9898298978805542} 11/07/2021 01:32:18 - INFO - __main__ - Step 30358: {'lr': 0.00045606059862445485, 'samples': 5828736, 'steps': 30357, 'loss/train': 0.9508589506149292} 11/07/2021 01:32:20 - INFO - __main__ - Step 30359: {'lr': 0.0004560575937023999, 'samples': 5828928, 'steps': 30358, 'loss/train': 1.9072965383529663} 11/07/2021 01:32:20 - INFO - __main__ - Step 30360: {'lr': 0.0004560545886874986, 'samples': 5829120, 'steps': 30359, 'loss/train': 0.832706093788147} 11/07/2021 01:32:20 - INFO - __main__ - Step 30361: {'lr': 0.00045605158357975225, 'samples': 5829312, 'steps': 30360, 'loss/train': 1.1118340492248535} 11/07/2021 01:32:21 - INFO - __main__ - Step 30362: {'lr': 0.00045604857837916224, 'samples': 5829504, 'steps': 30361, 'loss/train': 1.3667658567428589} 11/07/2021 01:32:21 - INFO - __main__ - Step 30363: {'lr': 0.0004560455730857299, 'samples': 5829696, 'steps': 30362, 'loss/train': 2.0607755184173584} 11/07/2021 01:32:21 - INFO - __main__ - Step 30364: {'lr': 0.0004560425676994566, 'samples': 5829888, 'steps': 30363, 'loss/train': 1.497362732887268} 11/07/2021 01:32:22 - INFO - __main__ - Step 30365: {'lr': 0.00045603956222034384, 'samples': 5830080, 'steps': 30364, 'loss/train': 1.8845192193984985} 11/07/2021 01:32:23 - INFO - __main__ - Step 30366: {'lr': 0.0004560365566483927, 'samples': 5830272, 'steps': 30365, 'loss/train': 1.4809709787368774} 11/07/2021 01:32:23 - INFO - __main__ - Step 30367: {'lr': 0.00045603355098360466, 'samples': 5830464, 'steps': 30366, 'loss/train': 1.458250880241394} 11/07/2021 01:32:23 - INFO - __main__ - Step 30368: {'lr': 0.00045603054522598107, 'samples': 5830656, 'steps': 30367, 'loss/train': 1.8294758796691895} 11/07/2021 01:32:24 - INFO - __main__ - Step 30369: {'lr': 0.0004560275393755233, 'samples': 5830848, 'steps': 30368, 'loss/train': 1.6920772790908813} 11/07/2021 01:32:25 - INFO - __main__ - Step 30370: {'lr': 0.0004560245334322328, 'samples': 5831040, 'steps': 30369, 'loss/train': 1.7748409509658813} 11/07/2021 01:32:25 - INFO - __main__ - Step 30371: {'lr': 0.00045602152739611075, 'samples': 5831232, 'steps': 30370, 'loss/train': 1.7006174325942993} 11/07/2021 01:32:25 - INFO - __main__ - Step 30372: {'lr': 0.0004560185212671586, 'samples': 5831424, 'steps': 30371, 'loss/train': 1.10115647315979} 11/07/2021 01:32:26 - INFO - __main__ - Step 30373: {'lr': 0.00045601551504537765, 'samples': 5831616, 'steps': 30372, 'loss/train': 1.4324897527694702} 11/07/2021 01:32:26 - INFO - __main__ - Step 30374: {'lr': 0.0004560125087307693, 'samples': 5831808, 'steps': 30373, 'loss/train': 1.8265717029571533} 11/07/2021 01:32:27 - INFO - __main__ - Step 30375: {'lr': 0.00045600950232333495, 'samples': 5832000, 'steps': 30374, 'loss/train': 1.6013633012771606} 11/07/2021 01:32:28 - INFO - __main__ - Step 30376: {'lr': 0.00045600649582307586, 'samples': 5832192, 'steps': 30375, 'loss/train': 1.4706649780273438} 11/07/2021 01:32:28 - INFO - __main__ - Step 30377: {'lr': 0.00045600348922999334, 'samples': 5832384, 'steps': 30376, 'loss/train': 0.897438645362854} 11/07/2021 01:32:28 - INFO - __main__ - Step 30378: {'lr': 0.0004560004825440889, 'samples': 5832576, 'steps': 30377, 'loss/train': 1.6653149127960205} 11/07/2021 01:32:29 - INFO - __main__ - Step 30379: {'lr': 0.0004559974757653639, 'samples': 5832768, 'steps': 30378, 'loss/train': 1.7461222410202026} 11/07/2021 01:32:30 - INFO - __main__ - Step 30380: {'lr': 0.0004559944688938195, 'samples': 5832960, 'steps': 30379, 'loss/train': 1.4538600444793701} 11/07/2021 01:32:30 - INFO - __main__ - Step 30381: {'lr': 0.0004559914619294572, 'samples': 5833152, 'steps': 30380, 'loss/train': 1.8406343460083008} 11/07/2021 01:32:30 - INFO - __main__ - Step 30382: {'lr': 0.00045598845487227835, 'samples': 5833344, 'steps': 30381, 'loss/train': 2.0610239505767822} 11/07/2021 01:32:31 - INFO - __main__ - Step 30383: {'lr': 0.0004559854477222842, 'samples': 5833536, 'steps': 30382, 'loss/train': 1.8445994853973389} 11/07/2021 01:32:31 - INFO - __main__ - Step 30384: {'lr': 0.0004559824404794763, 'samples': 5833728, 'steps': 30383, 'loss/train': 1.59585702419281} 11/07/2021 01:32:32 - INFO - __main__ - Step 30385: {'lr': 0.0004559794331438558, 'samples': 5833920, 'steps': 30384, 'loss/train': 1.5763202905654907} 11/07/2021 01:32:32 - INFO - __main__ - Step 30386: {'lr': 0.0004559764257154242, 'samples': 5834112, 'steps': 30385, 'loss/train': 1.6152302026748657} 11/07/2021 01:32:33 - INFO - __main__ - Step 30387: {'lr': 0.0004559734181941828, 'samples': 5834304, 'steps': 30386, 'loss/train': 2.4096007347106934} 11/07/2021 01:32:33 - INFO - __main__ - Step 30388: {'lr': 0.0004559704105801329, 'samples': 5834496, 'steps': 30387, 'loss/train': 1.8948359489440918} 11/07/2021 01:32:34 - INFO - __main__ - Step 30389: {'lr': 0.00045596740287327597, 'samples': 5834688, 'steps': 30388, 'loss/train': 0.19201065599918365} 11/07/2021 01:32:34 - INFO - __main__ - Step 30390: {'lr': 0.0004559643950736133, 'samples': 5834880, 'steps': 30389, 'loss/train': 1.251489520072937} 11/07/2021 01:32:35 - INFO - __main__ - Step 30391: {'lr': 0.00045596138718114626, 'samples': 5835072, 'steps': 30390, 'loss/train': 1.5772721767425537} 11/07/2021 01:32:35 - INFO - __main__ - Step 30392: {'lr': 0.00045595837919587616, 'samples': 5835264, 'steps': 30391, 'loss/train': 1.3363888263702393} 11/07/2021 01:32:36 - INFO - __main__ - Step 30393: {'lr': 0.0004559553711178044, 'samples': 5835456, 'steps': 30392, 'loss/train': 0.6586275696754456} 11/07/2021 01:32:36 - INFO - __main__ - Step 30394: {'lr': 0.00045595236294693236, 'samples': 5835648, 'steps': 30393, 'loss/train': 1.7032047510147095} 11/07/2021 01:32:36 - INFO - __main__ - Step 30395: {'lr': 0.00045594935468326137, 'samples': 5835840, 'steps': 30394, 'loss/train': 1.174822211265564} 11/07/2021 01:32:37 - INFO - __main__ - Step 30396: {'lr': 0.00045594634632679275, 'samples': 5836032, 'steps': 30395, 'loss/train': 1.8963793516159058} 11/07/2021 01:32:38 - INFO - __main__ - Step 30397: {'lr': 0.0004559433378775278, 'samples': 5836224, 'steps': 30396, 'loss/train': 1.6081353425979614} 11/07/2021 01:32:38 - INFO - __main__ - Step 30398: {'lr': 0.00045594032933546813, 'samples': 5836416, 'steps': 30397, 'loss/train': 1.5449819564819336} 11/07/2021 01:32:38 - INFO - __main__ - Step 30399: {'lr': 0.00045593732070061484, 'samples': 5836608, 'steps': 30398, 'loss/train': 1.6387768983840942} 11/07/2021 01:32:39 - INFO - __main__ - Step 30400: {'lr': 0.00045593431197296934, 'samples': 5836800, 'steps': 30399, 'loss/train': 1.495215654373169} 11/07/2021 01:32:40 - INFO - __main__ - Step 30401: {'lr': 0.00045593130315253305, 'samples': 5836992, 'steps': 30400, 'loss/train': 1.1243253946304321} 11/07/2021 01:32:40 - INFO - __main__ - Step 30402: {'lr': 0.0004559282942393073, 'samples': 5837184, 'steps': 30401, 'loss/train': 2.1499686241149902} 11/07/2021 01:32:41 - INFO - __main__ - Step 30403: {'lr': 0.00045592528523329346, 'samples': 5837376, 'steps': 30402, 'loss/train': 1.4585944414138794} 11/07/2021 01:32:41 - INFO - __main__ - Step 30404: {'lr': 0.0004559222761344928, 'samples': 5837568, 'steps': 30403, 'loss/train': 1.3113209009170532} 11/07/2021 01:32:41 - INFO - __main__ - Step 30405: {'lr': 0.0004559192669429068, 'samples': 5837760, 'steps': 30404, 'loss/train': 1.6635924577713013} 11/07/2021 01:32:42 - INFO - __main__ - Step 30406: {'lr': 0.0004559162576585367, 'samples': 5837952, 'steps': 30405, 'loss/train': 1.4389300346374512} 11/07/2021 01:32:43 - INFO - __main__ - Step 30407: {'lr': 0.00045591324828138396, 'samples': 5838144, 'steps': 30406, 'loss/train': 2.16180157661438} 11/07/2021 01:32:43 - INFO - __main__ - Step 30408: {'lr': 0.0004559102388114499, 'samples': 5838336, 'steps': 30407, 'loss/train': 1.4051746129989624} 11/07/2021 01:32:43 - INFO - __main__ - Step 30409: {'lr': 0.00045590722924873585, 'samples': 5838528, 'steps': 30408, 'loss/train': 1.2232269048690796} 11/07/2021 01:32:44 - INFO - __main__ - Step 30410: {'lr': 0.00045590421959324314, 'samples': 5838720, 'steps': 30409, 'loss/train': 1.7550944089889526} 11/07/2021 01:32:44 - INFO - __main__ - Step 30411: {'lr': 0.0004559012098449732, 'samples': 5838912, 'steps': 30410, 'loss/train': 1.7170732021331787} 11/07/2021 01:32:45 - INFO - __main__ - Step 30412: {'lr': 0.00045589820000392736, 'samples': 5839104, 'steps': 30411, 'loss/train': 1.328830361366272} 11/07/2021 01:32:46 - INFO - __main__ - Step 30413: {'lr': 0.00045589519007010695, 'samples': 5839296, 'steps': 30412, 'loss/train': 1.4785791635513306} 11/07/2021 01:32:46 - INFO - __main__ - Step 30414: {'lr': 0.0004558921800435133, 'samples': 5839488, 'steps': 30413, 'loss/train': 1.152627944946289} 11/07/2021 01:32:46 - INFO - __main__ - Step 30415: {'lr': 0.00045588916992414784, 'samples': 5839680, 'steps': 30414, 'loss/train': 0.5767570734024048} 11/07/2021 01:32:47 - INFO - __main__ - Step 30416: {'lr': 0.0004558861597120119, 'samples': 5839872, 'steps': 30415, 'loss/train': 2.3084864616394043} 11/07/2021 01:32:48 - INFO - __main__ - Step 30417: {'lr': 0.00045588314940710683, 'samples': 5840064, 'steps': 30416, 'loss/train': 1.6227964162826538} 11/07/2021 01:32:48 - INFO - __main__ - Step 30418: {'lr': 0.00045588013900943404, 'samples': 5840256, 'steps': 30417, 'loss/train': 0.9265647530555725} 11/07/2021 01:32:48 - INFO - __main__ - Step 30419: {'lr': 0.0004558771285189948, 'samples': 5840448, 'steps': 30418, 'loss/train': 1.315839171409607} 11/07/2021 01:32:49 - INFO - __main__ - Step 30420: {'lr': 0.00045587411793579047, 'samples': 5840640, 'steps': 30419, 'loss/train': 1.4096364974975586} 11/07/2021 01:32:49 - INFO - __main__ - Step 30421: {'lr': 0.0004558711072598225, 'samples': 5840832, 'steps': 30420, 'loss/train': 1.443522572517395} 11/07/2021 01:32:50 - INFO - __main__ - Step 30422: {'lr': 0.0004558680964910922, 'samples': 5841024, 'steps': 30421, 'loss/train': 1.4016804695129395} 11/07/2021 01:32:50 - INFO - __main__ - Step 30423: {'lr': 0.0004558650856296008, 'samples': 5841216, 'steps': 30422, 'loss/train': 1.8984380960464478} 11/07/2021 01:32:51 - INFO - __main__ - Step 30424: {'lr': 0.0004558620746753499, 'samples': 5841408, 'steps': 30423, 'loss/train': 1.569122076034546} 11/07/2021 01:32:51 - INFO - __main__ - Step 30425: {'lr': 0.00045585906362834063, 'samples': 5841600, 'steps': 30424, 'loss/train': 1.7328251600265503} 11/07/2021 01:32:51 - INFO - __main__ - Step 30426: {'lr': 0.00045585605248857456, 'samples': 5841792, 'steps': 30425, 'loss/train': 1.608799695968628} 11/07/2021 01:32:52 - INFO - __main__ - Step 30427: {'lr': 0.00045585304125605276, 'samples': 5841984, 'steps': 30426, 'loss/train': 1.329828143119812} 11/07/2021 01:32:53 - INFO - __main__ - Step 30428: {'lr': 0.0004558500299307768, 'samples': 5842176, 'steps': 30427, 'loss/train': 1.7235779762268066} 11/07/2021 01:32:53 - INFO - __main__ - Step 30429: {'lr': 0.00045584701851274814, 'samples': 5842368, 'steps': 30428, 'loss/train': 1.8039371967315674} 11/07/2021 01:32:53 - INFO - __main__ - Step 30430: {'lr': 0.0004558440070019678, 'samples': 5842560, 'steps': 30429, 'loss/train': 0.8503919839859009} 11/07/2021 01:32:54 - INFO - __main__ - Step 30431: {'lr': 0.0004558409953984375, 'samples': 5842752, 'steps': 30430, 'loss/train': 1.530393362045288} 11/07/2021 01:32:55 - INFO - __main__ - Step 30432: {'lr': 0.00045583798370215837, 'samples': 5842944, 'steps': 30431, 'loss/train': 0.22017337381839752} 11/07/2021 01:32:55 - INFO - __main__ - Step 30433: {'lr': 0.00045583497191313175, 'samples': 5843136, 'steps': 30432, 'loss/train': 1.5525765419006348} 11/07/2021 01:32:56 - INFO - __main__ - Step 30434: {'lr': 0.00045583196003135906, 'samples': 5843328, 'steps': 30433, 'loss/train': 1.5046756267547607} 11/07/2021 01:32:56 - INFO - __main__ - Step 30435: {'lr': 0.0004558289480568417, 'samples': 5843520, 'steps': 30434, 'loss/train': 1.2534716129302979} 11/07/2021 01:32:56 - INFO - __main__ - Step 30436: {'lr': 0.00045582593598958107, 'samples': 5843712, 'steps': 30435, 'loss/train': 1.0785267353057861} 11/07/2021 01:32:57 - INFO - __main__ - Step 30437: {'lr': 0.00045582292382957836, 'samples': 5843904, 'steps': 30436, 'loss/train': 1.7116634845733643} 11/07/2021 01:32:58 - INFO - __main__ - Step 30438: {'lr': 0.000455819911576835, 'samples': 5844096, 'steps': 30437, 'loss/train': 1.413965106010437} 11/07/2021 01:32:58 - INFO - __main__ - Step 30439: {'lr': 0.00045581689923135247, 'samples': 5844288, 'steps': 30438, 'loss/train': 1.6645784378051758} 11/07/2021 01:32:58 - INFO - __main__ - Step 30440: {'lr': 0.00045581388679313194, 'samples': 5844480, 'steps': 30439, 'loss/train': 1.785849928855896} 11/07/2021 01:32:59 - INFO - __main__ - Step 30441: {'lr': 0.0004558108742621748, 'samples': 5844672, 'steps': 30440, 'loss/train': 0.9725473523139954} 11/07/2021 01:32:59 - INFO - __main__ - Step 30442: {'lr': 0.00045580786163848254, 'samples': 5844864, 'steps': 30441, 'loss/train': 1.6476956605911255} 11/07/2021 01:33:00 - INFO - __main__ - Step 30443: {'lr': 0.00045580484892205643, 'samples': 5845056, 'steps': 30442, 'loss/train': 1.4735348224639893} 11/07/2021 01:33:00 - INFO - __main__ - Step 30444: {'lr': 0.0004558018361128978, 'samples': 5845248, 'steps': 30443, 'loss/train': 1.8190598487854004} 11/07/2021 01:33:01 - INFO - __main__ - Step 30445: {'lr': 0.0004557988232110081, 'samples': 5845440, 'steps': 30444, 'loss/train': 1.864393711090088} 11/07/2021 01:33:01 - INFO - __main__ - Step 30446: {'lr': 0.00045579581021638855, 'samples': 5845632, 'steps': 30445, 'loss/train': 0.5898078083992004} 11/07/2021 01:33:01 - INFO - __main__ - Step 30447: {'lr': 0.00045579279712904057, 'samples': 5845824, 'steps': 30446, 'loss/train': 1.3282579183578491} 11/07/2021 01:33:02 - INFO - __main__ - Step 30448: {'lr': 0.00045578978394896565, 'samples': 5846016, 'steps': 30447, 'loss/train': 1.671467661857605} 11/07/2021 01:33:03 - INFO - __main__ - Step 30449: {'lr': 0.00045578677067616494, 'samples': 5846208, 'steps': 30448, 'loss/train': 1.5926649570465088} 11/07/2021 01:33:03 - INFO - __main__ - Step 30450: {'lr': 0.0004557837573106399, 'samples': 5846400, 'steps': 30449, 'loss/train': 1.6386229991912842} 11/07/2021 01:33:04 - INFO - __main__ - Step 30451: {'lr': 0.0004557807438523919, 'samples': 5846592, 'steps': 30450, 'loss/train': 1.629556655883789} 11/07/2021 01:33:04 - INFO - __main__ - Step 30452: {'lr': 0.00045577773030142224, 'samples': 5846784, 'steps': 30451, 'loss/train': 1.3279333114624023} 11/07/2021 01:33:05 - INFO - __main__ - Step 30453: {'lr': 0.0004557747166577323, 'samples': 5846976, 'steps': 30452, 'loss/train': 1.726426362991333} 11/07/2021 01:33:06 - INFO - __main__ - Step 30454: {'lr': 0.0004557717029213234, 'samples': 5847168, 'steps': 30453, 'loss/train': 1.1271165609359741} 11/07/2021 01:33:06 - INFO - __main__ - Step 30455: {'lr': 0.00045576868909219704, 'samples': 5847360, 'steps': 30454, 'loss/train': 1.4849731922149658} 11/07/2021 01:33:06 - INFO - __main__ - Step 30456: {'lr': 0.0004557656751703544, 'samples': 5847552, 'steps': 30455, 'loss/train': 1.2597217559814453} 11/07/2021 01:33:07 - INFO - __main__ - Step 30457: {'lr': 0.000455762661155797, 'samples': 5847744, 'steps': 30456, 'loss/train': 1.6121560335159302} 11/07/2021 01:33:08 - INFO - __main__ - Step 30458: {'lr': 0.0004557596470485261, 'samples': 5847936, 'steps': 30457, 'loss/train': 1.1347659826278687} 11/07/2021 01:33:08 - INFO - __main__ - Step 30459: {'lr': 0.0004557566328485431, 'samples': 5848128, 'steps': 30458, 'loss/train': 0.9545296430587769} 11/07/2021 01:33:08 - INFO - __main__ - Step 30460: {'lr': 0.00045575361855584927, 'samples': 5848320, 'steps': 30459, 'loss/train': 1.9373095035552979} 11/07/2021 01:33:09 - INFO - __main__ - Step 30461: {'lr': 0.00045575060417044614, 'samples': 5848512, 'steps': 30460, 'loss/train': 1.6543169021606445} 11/07/2021 01:33:09 - INFO - __main__ - Step 30462: {'lr': 0.0004557475896923349, 'samples': 5848704, 'steps': 30461, 'loss/train': 1.2315062284469604} 11/07/2021 01:33:10 - INFO - __main__ - Step 30463: {'lr': 0.0004557445751215169, 'samples': 5848896, 'steps': 30462, 'loss/train': 1.3893226385116577} 11/07/2021 01:33:10 - INFO - __main__ - Step 30464: {'lr': 0.00045574156045799367, 'samples': 5849088, 'steps': 30463, 'loss/train': 1.6300371885299683} 11/07/2021 01:33:11 - INFO - __main__ - Step 30465: {'lr': 0.0004557385457017664, 'samples': 5849280, 'steps': 30464, 'loss/train': 1.397165298461914} 11/07/2021 01:33:11 - INFO - __main__ - Step 30466: {'lr': 0.0004557355308528366, 'samples': 5849472, 'steps': 30465, 'loss/train': 1.7070040702819824} 11/07/2021 01:33:12 - INFO - __main__ - Step 30467: {'lr': 0.00045573251591120545, 'samples': 5849664, 'steps': 30466, 'loss/train': 0.650546133518219} 11/07/2021 01:33:12 - INFO - __main__ - Step 30468: {'lr': 0.00045572950087687447, 'samples': 5849856, 'steps': 30467, 'loss/train': 1.5740357637405396} 11/07/2021 01:33:13 - INFO - __main__ - Step 30469: {'lr': 0.0004557264857498449, 'samples': 5850048, 'steps': 30468, 'loss/train': 1.7624471187591553} 11/07/2021 01:33:13 - INFO - __main__ - Step 30470: {'lr': 0.0004557234705301182, 'samples': 5850240, 'steps': 30469, 'loss/train': 1.3435560464859009} 11/07/2021 01:33:14 - INFO - __main__ - Step 30471: {'lr': 0.0004557204552176957, 'samples': 5850432, 'steps': 30470, 'loss/train': 1.2548309564590454} 11/07/2021 01:33:14 - INFO - __main__ - Step 30472: {'lr': 0.0004557174398125786, 'samples': 5850624, 'steps': 30471, 'loss/train': 1.7051408290863037} 11/07/2021 01:33:14 - INFO - __main__ - Step 30473: {'lr': 0.00045571442431476856, 'samples': 5850816, 'steps': 30472, 'loss/train': 0.7577283382415771} 11/07/2021 01:33:15 - INFO - __main__ - Step 30474: {'lr': 0.0004557114087242667, 'samples': 5851008, 'steps': 30473, 'loss/train': 1.2576820850372314} 11/07/2021 01:33:16 - INFO - __main__ - Step 30475: {'lr': 0.0004557083930410745, 'samples': 5851200, 'steps': 30474, 'loss/train': 1.5838309526443481} 11/07/2021 01:33:16 - INFO - __main__ - Step 30476: {'lr': 0.0004557053772651932, 'samples': 5851392, 'steps': 30475, 'loss/train': 1.0980839729309082} 11/07/2021 01:33:16 - INFO - __main__ - Step 30477: {'lr': 0.00045570236139662426, 'samples': 5851584, 'steps': 30476, 'loss/train': 1.8236390352249146} 11/07/2021 01:33:17 - INFO - __main__ - Step 30478: {'lr': 0.000455699345435369, 'samples': 5851776, 'steps': 30477, 'loss/train': 1.789986252784729} 11/07/2021 01:33:18 - INFO - __main__ - Step 30479: {'lr': 0.0004556963293814288, 'samples': 5851968, 'steps': 30478, 'loss/train': 1.3823659420013428} 11/07/2021 01:33:18 - INFO - __main__ - Step 30480: {'lr': 0.000455693313234805, 'samples': 5852160, 'steps': 30479, 'loss/train': 1.5483280420303345} 11/07/2021 01:33:18 - INFO - __main__ - Step 30481: {'lr': 0.000455690296995499, 'samples': 5852352, 'steps': 30480, 'loss/train': 1.7918452024459839} 11/07/2021 01:33:19 - INFO - __main__ - Step 30482: {'lr': 0.00045568728066351205, 'samples': 5852544, 'steps': 30481, 'loss/train': 1.6299755573272705} 11/07/2021 01:33:19 - INFO - __main__ - Step 30483: {'lr': 0.0004556842642388457, 'samples': 5852736, 'steps': 30482, 'loss/train': 1.5032507181167603} 11/07/2021 01:33:20 - INFO - __main__ - Step 30484: {'lr': 0.0004556812477215011, 'samples': 5852928, 'steps': 30483, 'loss/train': 1.8862870931625366} 11/07/2021 01:33:21 - INFO - __main__ - Step 30485: {'lr': 0.0004556782311114798, 'samples': 5853120, 'steps': 30484, 'loss/train': 1.2660750150680542} 11/07/2021 01:33:21 - INFO - __main__ - Step 30486: {'lr': 0.00045567521440878294, 'samples': 5853312, 'steps': 30485, 'loss/train': 1.1814024448394775} 11/07/2021 01:33:22 - INFO - __main__ - Step 30487: {'lr': 0.000455672197613412, 'samples': 5853504, 'steps': 30486, 'loss/train': 1.257530689239502} 11/07/2021 01:33:22 - INFO - __main__ - Step 30488: {'lr': 0.00045566918072536844, 'samples': 5853696, 'steps': 30487, 'loss/train': 1.7170592546463013} 11/07/2021 01:33:23 - INFO - __main__ - Step 30489: {'lr': 0.00045566616374465355, 'samples': 5853888, 'steps': 30488, 'loss/train': 1.4508795738220215} 11/07/2021 01:33:24 - INFO - __main__ - Step 30490: {'lr': 0.0004556631466712686, 'samples': 5854080, 'steps': 30489, 'loss/train': 1.3915959596633911} 11/07/2021 01:33:24 - INFO - __main__ - Step 30491: {'lr': 0.00045566012950521497, 'samples': 5854272, 'steps': 30490, 'loss/train': 1.7133105993270874} 11/07/2021 01:33:24 - INFO - __main__ - Step 30492: {'lr': 0.0004556571122464941, 'samples': 5854464, 'steps': 30491, 'loss/train': 0.9542956948280334} 11/07/2021 01:33:25 - INFO - __main__ - Step 30493: {'lr': 0.0004556540948951073, 'samples': 5854656, 'steps': 30492, 'loss/train': 1.3986448049545288} 11/07/2021 01:33:25 - INFO - __main__ - Step 30494: {'lr': 0.00045565107745105594, 'samples': 5854848, 'steps': 30493, 'loss/train': 1.6358494758605957} 11/07/2021 01:33:26 - INFO - __main__ - Step 30495: {'lr': 0.00045564805991434135, 'samples': 5855040, 'steps': 30494, 'loss/train': 1.6779626607894897} 11/07/2021 01:33:26 - INFO - __main__ - Step 30496: {'lr': 0.00045564504228496494, 'samples': 5855232, 'steps': 30495, 'loss/train': 1.607884168624878} 11/07/2021 01:33:27 - INFO - __main__ - Step 30497: {'lr': 0.0004556420245629281, 'samples': 5855424, 'steps': 30496, 'loss/train': 1.7303855419158936} 11/07/2021 01:33:27 - INFO - __main__ - Step 30498: {'lr': 0.00045563900674823205, 'samples': 5855616, 'steps': 30497, 'loss/train': 1.3164902925491333} 11/07/2021 01:33:27 - INFO - __main__ - Step 30499: {'lr': 0.0004556359888408783, 'samples': 5855808, 'steps': 30498, 'loss/train': 1.3503849506378174} 11/07/2021 01:33:28 - INFO - __main__ - Step 30500: {'lr': 0.00045563297084086807, 'samples': 5856000, 'steps': 30499, 'loss/train': 1.9133379459381104} 11/07/2021 01:33:29 - INFO - __main__ - Step 30501: {'lr': 0.00045562995274820285, 'samples': 5856192, 'steps': 30500, 'loss/train': 1.8207396268844604} 11/07/2021 01:33:29 - INFO - __main__ - Step 30502: {'lr': 0.00045562693456288394, 'samples': 5856384, 'steps': 30501, 'loss/train': 1.9690886735916138} 11/07/2021 01:33:29 - INFO - __main__ - Step 30503: {'lr': 0.00045562391628491274, 'samples': 5856576, 'steps': 30502, 'loss/train': 1.3326011896133423} 11/07/2021 01:33:30 - INFO - __main__ - Step 30504: {'lr': 0.00045562089791429056, 'samples': 5856768, 'steps': 30503, 'loss/train': 1.9938852787017822} 11/07/2021 01:33:31 - INFO - __main__ - Step 30505: {'lr': 0.00045561787945101875, 'samples': 5856960, 'steps': 30504, 'loss/train': 1.5999820232391357} 11/07/2021 01:33:32 - INFO - __main__ - Step 30506: {'lr': 0.0004556148608950987, 'samples': 5857152, 'steps': 30505, 'loss/train': 1.660935401916504} 11/07/2021 01:33:32 - INFO - __main__ - Step 30507: {'lr': 0.0004556118422465319, 'samples': 5857344, 'steps': 30506, 'loss/train': 1.7182539701461792} 11/07/2021 01:33:32 - INFO - __main__ - Step 30508: {'lr': 0.00045560882350531936, 'samples': 5857536, 'steps': 30507, 'loss/train': 3.5188839435577393} 11/07/2021 01:33:33 - INFO - __main__ - Step 30509: {'lr': 0.00045560580467146275, 'samples': 5857728, 'steps': 30508, 'loss/train': 1.6736948490142822} 11/07/2021 01:33:33 - INFO - __main__ - Step 30510: {'lr': 0.00045560278574496334, 'samples': 5857920, 'steps': 30509, 'loss/train': 1.7277591228485107} 11/07/2021 01:33:34 - INFO - __main__ - Step 30511: {'lr': 0.0004555997667258225, 'samples': 5858112, 'steps': 30510, 'loss/train': 1.3974577188491821} 11/07/2021 01:33:34 - INFO - __main__ - Step 30512: {'lr': 0.0004555967476140416, 'samples': 5858304, 'steps': 30511, 'loss/train': 1.6479278802871704} 11/07/2021 01:33:35 - INFO - __main__ - Step 30513: {'lr': 0.00045559372840962186, 'samples': 5858496, 'steps': 30512, 'loss/train': 1.2236391305923462} 11/07/2021 01:33:35 - INFO - __main__ - Step 30514: {'lr': 0.00045559070911256486, 'samples': 5858688, 'steps': 30513, 'loss/train': 1.4619766473770142} 11/07/2021 01:33:35 - INFO - __main__ - Step 30515: {'lr': 0.00045558768972287183, 'samples': 5858880, 'steps': 30514, 'loss/train': 1.701179027557373} 11/07/2021 01:33:36 - INFO - __main__ - Step 30516: {'lr': 0.0004555846702405442, 'samples': 5859072, 'steps': 30515, 'loss/train': 1.1603152751922607} 11/07/2021 01:33:37 - INFO - __main__ - Step 30517: {'lr': 0.0004555816506655832, 'samples': 5859264, 'steps': 30516, 'loss/train': 1.6459802389144897} 11/07/2021 01:33:37 - INFO - __main__ - Step 30518: {'lr': 0.00045557863099799034, 'samples': 5859456, 'steps': 30517, 'loss/train': 1.5410176515579224} 11/07/2021 01:33:38 - INFO - __main__ - Step 30519: {'lr': 0.000455575611237767, 'samples': 5859648, 'steps': 30518, 'loss/train': 2.3247158527374268} 11/07/2021 01:33:38 - INFO - __main__ - Step 30520: {'lr': 0.00045557259138491435, 'samples': 5859840, 'steps': 30519, 'loss/train': 1.3978822231292725} 11/07/2021 01:33:38 - INFO - __main__ - Step 30521: {'lr': 0.0004555695714394339, 'samples': 5860032, 'steps': 30520, 'loss/train': 1.592763066291809} 11/07/2021 01:33:39 - INFO - __main__ - Step 30522: {'lr': 0.00045556655140132696, 'samples': 5860224, 'steps': 30521, 'loss/train': 1.1270097494125366} 11/07/2021 01:33:40 - INFO - __main__ - Step 30523: {'lr': 0.00045556353127059493, 'samples': 5860416, 'steps': 30522, 'loss/train': 1.5414197444915771} 11/07/2021 01:33:40 - INFO - __main__ - Step 30524: {'lr': 0.0004555605110472391, 'samples': 5860608, 'steps': 30523, 'loss/train': 0.3523915410041809} 11/07/2021 01:33:40 - INFO - __main__ - Step 30525: {'lr': 0.0004555574907312609, 'samples': 5860800, 'steps': 30524, 'loss/train': 1.533868670463562} 11/07/2021 01:33:41 - INFO - __main__ - Step 30526: {'lr': 0.00045555447032266167, 'samples': 5860992, 'steps': 30525, 'loss/train': 1.9231667518615723} 11/07/2021 01:33:42 - INFO - __main__ - Step 30527: {'lr': 0.0004555514498214428, 'samples': 5861184, 'steps': 30526, 'loss/train': 0.2879134714603424} 11/07/2021 01:33:42 - INFO - __main__ - Step 30528: {'lr': 0.0004555484292276055, 'samples': 5861376, 'steps': 30527, 'loss/train': 1.9613897800445557} 11/07/2021 01:33:42 - INFO - __main__ - Step 30529: {'lr': 0.0004555454085411514, 'samples': 5861568, 'steps': 30528, 'loss/train': 1.6907329559326172} 11/07/2021 01:33:43 - INFO - __main__ - Step 30530: {'lr': 0.0004555423877620817, 'samples': 5861760, 'steps': 30529, 'loss/train': 1.6974469423294067} 11/07/2021 01:33:43 - INFO - __main__ - Step 30531: {'lr': 0.00045553936689039765, 'samples': 5861952, 'steps': 30530, 'loss/train': 1.763445258140564} 11/07/2021 01:33:44 - INFO - __main__ - Step 30532: {'lr': 0.00045553634592610084, 'samples': 5862144, 'steps': 30531, 'loss/train': 1.4664535522460938} 11/07/2021 01:33:45 - INFO - __main__ - Step 30533: {'lr': 0.00045553332486919246, 'samples': 5862336, 'steps': 30532, 'loss/train': 1.4843565225601196} 11/07/2021 01:33:45 - INFO - __main__ - Step 30534: {'lr': 0.000455530303719674, 'samples': 5862528, 'steps': 30533, 'loss/train': 1.5967292785644531} 11/07/2021 01:33:45 - INFO - __main__ - Step 30535: {'lr': 0.00045552728247754673, 'samples': 5862720, 'steps': 30534, 'loss/train': 1.8246897459030151} 11/07/2021 01:33:46 - INFO - __main__ - Step 30536: {'lr': 0.000455524261142812, 'samples': 5862912, 'steps': 30535, 'loss/train': 5.765089988708496} 11/07/2021 01:33:46 - INFO - __main__ - Step 30537: {'lr': 0.00045552123971547123, 'samples': 5863104, 'steps': 30536, 'loss/train': 0.8816110491752625} 11/07/2021 01:33:47 - INFO - __main__ - Step 30538: {'lr': 0.00045551821819552575, 'samples': 5863296, 'steps': 30537, 'loss/train': 1.4403982162475586} 11/07/2021 01:33:47 - INFO - __main__ - Step 30539: {'lr': 0.0004555151965829769, 'samples': 5863488, 'steps': 30538, 'loss/train': 1.8970357179641724} 11/07/2021 01:33:48 - INFO - __main__ - Step 30540: {'lr': 0.0004555121748778261, 'samples': 5863680, 'steps': 30539, 'loss/train': 1.0221960544586182} 11/07/2021 01:33:48 - INFO - __main__ - Step 30541: {'lr': 0.0004555091530800748, 'samples': 5863872, 'steps': 30540, 'loss/train': 1.4456019401550293} 11/07/2021 01:33:49 - INFO - __main__ - Step 30542: {'lr': 0.0004555061311897241, 'samples': 5864064, 'steps': 30541, 'loss/train': 1.6158738136291504} 11/07/2021 01:33:50 - INFO - __main__ - Step 30543: {'lr': 0.0004555031092067756, 'samples': 5864256, 'steps': 30542, 'loss/train': 2.1358749866485596} 11/07/2021 01:33:50 - INFO - __main__ - Step 30544: {'lr': 0.00045550008713123047, 'samples': 5864448, 'steps': 30543, 'loss/train': 1.3832736015319824} 11/07/2021 01:33:50 - INFO - __main__ - Step 30545: {'lr': 0.00045549706496309027, 'samples': 5864640, 'steps': 30544, 'loss/train': 1.976824164390564} 11/07/2021 01:33:51 - INFO - __main__ - Step 30546: {'lr': 0.0004554940427023562, 'samples': 5864832, 'steps': 30545, 'loss/train': 1.4866725206375122} 11/07/2021 01:33:51 - INFO - __main__ - Step 30547: {'lr': 0.00045549102034902973, 'samples': 5865024, 'steps': 30546, 'loss/train': 1.2114545106887817} 11/07/2021 01:33:53 - INFO - __main__ - Step 30548: {'lr': 0.0004554879979031121, 'samples': 5865216, 'steps': 30547, 'loss/train': 1.4681395292282104} 11/07/2021 01:33:53 - INFO - __main__ - Step 30549: {'lr': 0.00045548497536460487, 'samples': 5865408, 'steps': 30548, 'loss/train': 1.0890514850616455} 11/07/2021 01:33:54 - INFO - __main__ - Step 30550: {'lr': 0.00045548195273350926, 'samples': 5865600, 'steps': 30549, 'loss/train': 1.9650791883468628} 11/07/2021 01:33:54 - INFO - __main__ - Step 30551: {'lr': 0.0004554789300098265, 'samples': 5865792, 'steps': 30550, 'loss/train': 1.7189000844955444} 11/07/2021 01:33:54 - INFO - __main__ - Step 30552: {'lr': 0.00045547590719355823, 'samples': 5865984, 'steps': 30551, 'loss/train': 1.40700364112854} 11/07/2021 01:33:55 - INFO - __main__ - Step 30553: {'lr': 0.00045547288428470574, 'samples': 5866176, 'steps': 30552, 'loss/train': 1.4973241090774536} 11/07/2021 01:33:55 - INFO - __main__ - Step 30554: {'lr': 0.0004554698612832703, 'samples': 5866368, 'steps': 30553, 'loss/train': 2.791121482849121} 11/07/2021 01:33:55 - INFO - __main__ - Step 30555: {'lr': 0.00045546683818925327, 'samples': 5866560, 'steps': 30554, 'loss/train': 2.858717918395996} 11/07/2021 01:33:56 - INFO - __main__ - Step 30556: {'lr': 0.000455463815002656, 'samples': 5866752, 'steps': 30555, 'loss/train': 2.730980634689331} 11/07/2021 01:33:57 - INFO - __main__ - Step 30557: {'lr': 0.00045546079172348, 'samples': 5866944, 'steps': 30556, 'loss/train': 1.6503729820251465} 11/07/2021 01:33:57 - INFO - __main__ - Step 30558: {'lr': 0.00045545776835172647, 'samples': 5867136, 'steps': 30557, 'loss/train': 1.6318061351776123} 11/07/2021 01:33:57 - INFO - __main__ - Step 30559: {'lr': 0.00045545474488739693, 'samples': 5867328, 'steps': 30558, 'loss/train': 1.0276840925216675} 11/07/2021 01:33:58 - INFO - __main__ - Step 30560: {'lr': 0.0004554517213304926, 'samples': 5867520, 'steps': 30559, 'loss/train': 1.5448262691497803} 11/07/2021 01:33:59 - INFO - __main__ - Step 30561: {'lr': 0.00045544869768101486, 'samples': 5867712, 'steps': 30560, 'loss/train': 1.3254529237747192} 11/07/2021 01:33:59 - INFO - __main__ - Step 30562: {'lr': 0.0004554456739389652, 'samples': 5867904, 'steps': 30561, 'loss/train': 2.473137855529785} 11/07/2021 01:34:00 - INFO - __main__ - Step 30563: {'lr': 0.00045544265010434484, 'samples': 5868096, 'steps': 30562, 'loss/train': 1.485152244567871} 11/07/2021 01:34:00 - INFO - __main__ - Step 30564: {'lr': 0.0004554396261771552, 'samples': 5868288, 'steps': 30563, 'loss/train': 1.870257019996643} 11/07/2021 01:34:00 - INFO - __main__ - Step 30565: {'lr': 0.00045543660215739755, 'samples': 5868480, 'steps': 30564, 'loss/train': 1.6671714782714844} 11/07/2021 01:34:02 - INFO - __main__ - Step 30566: {'lr': 0.00045543357804507344, 'samples': 5868672, 'steps': 30565, 'loss/train': 1.806578516960144} 11/07/2021 01:34:02 - INFO - __main__ - Step 30567: {'lr': 0.00045543055384018405, 'samples': 5868864, 'steps': 30566, 'loss/train': 1.5219746828079224} 11/07/2021 01:34:02 - INFO - __main__ - Step 30568: {'lr': 0.0004554275295427309, 'samples': 5869056, 'steps': 30567, 'loss/train': 1.6241127252578735} 11/07/2021 01:34:03 - INFO - __main__ - Step 30569: {'lr': 0.0004554245051527153, 'samples': 5869248, 'steps': 30568, 'loss/train': 4.016051292419434} 11/07/2021 01:34:03 - INFO - __main__ - Step 30570: {'lr': 0.0004554214806701384, 'samples': 5869440, 'steps': 30569, 'loss/train': 1.164947748184204} 11/07/2021 01:34:05 - INFO - __main__ - Step 30571: {'lr': 0.000455418456095002, 'samples': 5869632, 'steps': 30570, 'loss/train': 0.8092489838600159} 11/07/2021 01:34:05 - INFO - __main__ - Step 30572: {'lr': 0.000455415431427307, 'samples': 5869824, 'steps': 30571, 'loss/train': 1.6690469980239868} 11/07/2021 01:34:06 - INFO - __main__ - Step 30573: {'lr': 0.00045541240666705516, 'samples': 5870016, 'steps': 30572, 'loss/train': 2.228196144104004} 11/07/2021 01:34:06 - INFO - __main__ - Step 30574: {'lr': 0.0004554093818142475, 'samples': 5870208, 'steps': 30573, 'loss/train': 1.8303005695343018} 11/07/2021 01:34:06 - INFO - __main__ - Step 30575: {'lr': 0.0004554063568688857, 'samples': 5870400, 'steps': 30574, 'loss/train': 1.8389360904693604} 11/07/2021 01:34:07 - INFO - __main__ - Step 30576: {'lr': 0.0004554033318309708, 'samples': 5870592, 'steps': 30575, 'loss/train': 1.5906447172164917} 11/07/2021 01:34:07 - INFO - __main__ - Step 30577: {'lr': 0.00045540030670050447, 'samples': 5870784, 'steps': 30576, 'loss/train': 1.800966739654541} 11/07/2021 01:34:08 - INFO - __main__ - Step 30578: {'lr': 0.0004553972814774878, 'samples': 5870976, 'steps': 30577, 'loss/train': 1.8366080522537231} 11/07/2021 01:34:08 - INFO - __main__ - Step 30579: {'lr': 0.00045539425616192243, 'samples': 5871168, 'steps': 30578, 'loss/train': 1.4894955158233643} 11/07/2021 01:34:09 - INFO - __main__ - Step 30580: {'lr': 0.0004553912307538095, 'samples': 5871360, 'steps': 30579, 'loss/train': 1.6211987733840942} 11/07/2021 01:34:09 - INFO - __main__ - Step 30581: {'lr': 0.0004553882052531504, 'samples': 5871552, 'steps': 30580, 'loss/train': 1.6969722509384155} 11/07/2021 01:34:10 - INFO - __main__ - Step 30582: {'lr': 0.00045538517965994663, 'samples': 5871744, 'steps': 30581, 'loss/train': 1.7053320407867432} 11/07/2021 01:34:10 - INFO - __main__ - Step 30583: {'lr': 0.0004553821539741994, 'samples': 5871936, 'steps': 30582, 'loss/train': 1.3603167533874512} 11/07/2021 01:34:11 - INFO - __main__ - Step 30584: {'lr': 0.0004553791281959102, 'samples': 5872128, 'steps': 30583, 'loss/train': 1.4426591396331787} 11/07/2021 01:34:12 - INFO - __main__ - Step 30585: {'lr': 0.00045537610232508033, 'samples': 5872320, 'steps': 30584, 'loss/train': 1.5270507335662842} 11/07/2021 01:34:12 - INFO - __main__ - Step 30586: {'lr': 0.0004553730763617111, 'samples': 5872512, 'steps': 30585, 'loss/train': 1.8075188398361206} 11/07/2021 01:34:12 - INFO - __main__ - Step 30587: {'lr': 0.000455370050305804, 'samples': 5872704, 'steps': 30586, 'loss/train': 1.578984022140503} 11/07/2021 01:34:13 - INFO - __main__ - Step 30588: {'lr': 0.0004553670241573603, 'samples': 5872896, 'steps': 30587, 'loss/train': 1.416538119316101} 11/07/2021 01:34:13 - INFO - __main__ - Step 30589: {'lr': 0.00045536399791638133, 'samples': 5873088, 'steps': 30588, 'loss/train': 2.0954222679138184} 11/07/2021 01:34:14 - INFO - __main__ - Step 30590: {'lr': 0.0004553609715828686, 'samples': 5873280, 'steps': 30589, 'loss/train': 1.8816574811935425} 11/07/2021 01:34:14 - INFO - __main__ - Step 30591: {'lr': 0.00045535794515682334, 'samples': 5873472, 'steps': 30590, 'loss/train': 1.5985524654388428} 11/07/2021 01:34:15 - INFO - __main__ - Step 30592: {'lr': 0.00045535491863824695, 'samples': 5873664, 'steps': 30591, 'loss/train': 1.902991771697998} 11/07/2021 01:34:15 - INFO - __main__ - Step 30593: {'lr': 0.0004553518920271408, 'samples': 5873856, 'steps': 30592, 'loss/train': 1.2298734188079834} 11/07/2021 01:34:15 - INFO - __main__ - Step 30594: {'lr': 0.00045534886532350627, 'samples': 5874048, 'steps': 30593, 'loss/train': 1.693086862564087} 11/07/2021 01:34:16 - INFO - __main__ - Step 30595: {'lr': 0.00045534583852734474, 'samples': 5874240, 'steps': 30594, 'loss/train': 1.6848118305206299} 11/07/2021 01:34:17 - INFO - __main__ - Step 30596: {'lr': 0.00045534281163865756, 'samples': 5874432, 'steps': 30595, 'loss/train': 2.0250253677368164} 11/07/2021 01:34:17 - INFO - __main__ - Step 30597: {'lr': 0.000455339784657446, 'samples': 5874624, 'steps': 30596, 'loss/train': 1.2345815896987915} 11/07/2021 01:34:17 - INFO - __main__ - Step 30598: {'lr': 0.0004553367575837115, 'samples': 5874816, 'steps': 30597, 'loss/train': 1.6201162338256836} 11/07/2021 01:34:18 - INFO - __main__ - Step 30599: {'lr': 0.00045533373041745545, 'samples': 5875008, 'steps': 30598, 'loss/train': 1.4778693914413452} 11/07/2021 01:34:19 - INFO - __main__ - Step 30600: {'lr': 0.00045533070315867917, 'samples': 5875200, 'steps': 30599, 'loss/train': 1.1637945175170898} 11/07/2021 01:34:19 - INFO - __main__ - Step 30601: {'lr': 0.0004553276758073841, 'samples': 5875392, 'steps': 30600, 'loss/train': 0.970828115940094} 11/07/2021 01:34:20 - INFO - __main__ - Step 30602: {'lr': 0.00045532464836357155, 'samples': 5875584, 'steps': 30601, 'loss/train': 1.6925225257873535} 11/07/2021 01:34:20 - INFO - __main__ - Step 30603: {'lr': 0.0004553216208272428, 'samples': 5875776, 'steps': 30602, 'loss/train': 1.729225993156433} 11/07/2021 01:34:20 - INFO - __main__ - Step 30604: {'lr': 0.0004553185931983994, 'samples': 5875968, 'steps': 30603, 'loss/train': 1.6972366571426392} 11/07/2021 01:34:21 - INFO - __main__ - Step 30605: {'lr': 0.00045531556547704255, 'samples': 5876160, 'steps': 30604, 'loss/train': 1.6841012239456177} 11/07/2021 01:34:22 - INFO - __main__ - Step 30606: {'lr': 0.00045531253766317373, 'samples': 5876352, 'steps': 30605, 'loss/train': 1.5396567583084106} 11/07/2021 01:34:22 - INFO - __main__ - Step 30607: {'lr': 0.0004553095097567942, 'samples': 5876544, 'steps': 30606, 'loss/train': 1.3511580228805542} 11/07/2021 01:34:22 - INFO - __main__ - Step 30608: {'lr': 0.0004553064817579053, 'samples': 5876736, 'steps': 30607, 'loss/train': 1.6638671159744263} 11/07/2021 01:34:23 - INFO - __main__ - Step 30609: {'lr': 0.0004553034536665086, 'samples': 5876928, 'steps': 30608, 'loss/train': 1.478841781616211} 11/07/2021 01:34:24 - INFO - __main__ - Step 30610: {'lr': 0.0004553004254826053, 'samples': 5877120, 'steps': 30609, 'loss/train': 1.5827250480651855} 11/07/2021 01:34:24 - INFO - __main__ - Step 30611: {'lr': 0.0004552973972061967, 'samples': 5877312, 'steps': 30610, 'loss/train': 2.2319352626800537} 11/07/2021 01:34:24 - INFO - __main__ - Step 30612: {'lr': 0.00045529436883728436, 'samples': 5877504, 'steps': 30611, 'loss/train': 1.6994431018829346} 11/07/2021 01:34:25 - INFO - __main__ - Step 30613: {'lr': 0.0004552913403758695, 'samples': 5877696, 'steps': 30612, 'loss/train': 1.6245088577270508} 11/07/2021 01:34:25 - INFO - __main__ - Step 30614: {'lr': 0.00045528831182195355, 'samples': 5877888, 'steps': 30613, 'loss/train': 1.5530428886413574} 11/07/2021 01:34:26 - INFO - __main__ - Step 30615: {'lr': 0.00045528528317553786, 'samples': 5878080, 'steps': 30614, 'loss/train': 1.5801184177398682} 11/07/2021 01:34:27 - INFO - __main__ - Step 30616: {'lr': 0.0004552822544366238, 'samples': 5878272, 'steps': 30615, 'loss/train': 1.069941759109497} 11/07/2021 01:34:27 - INFO - __main__ - Step 30617: {'lr': 0.00045527922560521274, 'samples': 5878464, 'steps': 30616, 'loss/train': 1.248826265335083} 11/07/2021 01:34:27 - INFO - __main__ - Step 30618: {'lr': 0.0004552761966813059, 'samples': 5878656, 'steps': 30617, 'loss/train': 1.8219928741455078} 11/07/2021 01:34:28 - INFO - __main__ - Step 30619: {'lr': 0.00045527316766490487, 'samples': 5878848, 'steps': 30618, 'loss/train': 1.383254051208496} 11/07/2021 01:34:28 - INFO - __main__ - Step 30620: {'lr': 0.000455270138556011, 'samples': 5879040, 'steps': 30619, 'loss/train': 1.7578457593917847} 11/07/2021 01:34:29 - INFO - __main__ - Step 30621: {'lr': 0.00045526710935462543, 'samples': 5879232, 'steps': 30620, 'loss/train': 1.6432160139083862} 11/07/2021 01:34:29 - INFO - __main__ - Step 30622: {'lr': 0.00045526408006074973, 'samples': 5879424, 'steps': 30621, 'loss/train': 1.3899623155593872} 11/07/2021 01:34:30 - INFO - __main__ - Step 30623: {'lr': 0.00045526105067438525, 'samples': 5879616, 'steps': 30622, 'loss/train': 1.098164439201355} 11/07/2021 01:34:30 - INFO - __main__ - Step 30624: {'lr': 0.00045525802119553323, 'samples': 5879808, 'steps': 30623, 'loss/train': 1.6256321668624878} 11/07/2021 01:34:30 - INFO - __main__ - Step 30625: {'lr': 0.0004552549916241951, 'samples': 5880000, 'steps': 30624, 'loss/train': 1.5688073635101318} 11/07/2021 01:34:32 - INFO - __main__ - Step 30626: {'lr': 0.0004552519619603723, 'samples': 5880192, 'steps': 30625, 'loss/train': 0.8748301863670349} 11/07/2021 01:34:32 - INFO - __main__ - Step 30627: {'lr': 0.00045524893220406617, 'samples': 5880384, 'steps': 30626, 'loss/train': 1.403340220451355} 11/07/2021 01:34:32 - INFO - __main__ - Step 30628: {'lr': 0.00045524590235527796, 'samples': 5880576, 'steps': 30627, 'loss/train': 1.6631159782409668} 11/07/2021 01:34:33 - INFO - __main__ - Step 30629: {'lr': 0.0004552428724140091, 'samples': 5880768, 'steps': 30628, 'loss/train': 0.9583744406700134} 11/07/2021 01:34:33 - INFO - __main__ - Step 30630: {'lr': 0.000455239842380261, 'samples': 5880960, 'steps': 30629, 'loss/train': 0.6214451789855957} 11/07/2021 01:34:35 - INFO - __main__ - Step 30631: {'lr': 0.000455236812254035, 'samples': 5881152, 'steps': 30630, 'loss/train': 1.3606207370758057} 11/07/2021 01:34:36 - INFO - __main__ - Step 30632: {'lr': 0.0004552337820353325, 'samples': 5881344, 'steps': 30631, 'loss/train': 1.9707520008087158} 11/07/2021 01:34:36 - INFO - __main__ - Step 30633: {'lr': 0.00045523075172415476, 'samples': 5881536, 'steps': 30632, 'loss/train': 1.6842832565307617} 11/07/2021 01:34:36 - INFO - __main__ - Step 30634: {'lr': 0.0004552277213205032, 'samples': 5881728, 'steps': 30633, 'loss/train': 1.5775192975997925} 11/07/2021 01:34:37 - INFO - __main__ - Step 30635: {'lr': 0.0004552246908243792, 'samples': 5881920, 'steps': 30634, 'loss/train': 1.81877601146698} 11/07/2021 01:34:37 - INFO - __main__ - Step 30636: {'lr': 0.00045522166023578413, 'samples': 5882112, 'steps': 30635, 'loss/train': 1.7876152992248535} 11/07/2021 01:34:37 - INFO - __main__ - Step 30637: {'lr': 0.0004552186295547194, 'samples': 5882304, 'steps': 30636, 'loss/train': 1.7799782752990723} 11/07/2021 01:34:38 - INFO - __main__ - Step 30638: {'lr': 0.0004552155987811863, 'samples': 5882496, 'steps': 30637, 'loss/train': 1.8900376558303833} 11/07/2021 01:34:39 - INFO - __main__ - Step 30639: {'lr': 0.00045521256791518616, 'samples': 5882688, 'steps': 30638, 'loss/train': 1.7110861539840698} 11/07/2021 01:34:39 - INFO - __main__ - Step 30640: {'lr': 0.0004552095369567205, 'samples': 5882880, 'steps': 30639, 'loss/train': 1.56343674659729} 11/07/2021 01:34:39 - INFO - __main__ - Step 30641: {'lr': 0.00045520650590579056, 'samples': 5883072, 'steps': 30640, 'loss/train': 1.666163444519043} 11/07/2021 01:34:40 - INFO - __main__ - Step 30642: {'lr': 0.00045520347476239763, 'samples': 5883264, 'steps': 30641, 'loss/train': 1.7846308946609497} 11/07/2021 01:34:40 - INFO - __main__ - Step 30643: {'lr': 0.00045520044352654335, 'samples': 5883456, 'steps': 30642, 'loss/train': 1.5713022947311401} 11/07/2021 01:34:41 - INFO - __main__ - Step 30644: {'lr': 0.0004551974121982288, 'samples': 5883648, 'steps': 30643, 'loss/train': 1.5387201309204102} 11/07/2021 01:34:42 - INFO - __main__ - Step 30645: {'lr': 0.00045519438077745543, 'samples': 5883840, 'steps': 30644, 'loss/train': 1.3501056432724} 11/07/2021 01:34:42 - INFO - __main__ - Step 30646: {'lr': 0.0004551913492642248, 'samples': 5884032, 'steps': 30645, 'loss/train': 1.4950666427612305} 11/07/2021 01:34:42 - INFO - __main__ - Step 30647: {'lr': 0.00045518831765853796, 'samples': 5884224, 'steps': 30646, 'loss/train': 1.1170263290405273} 11/07/2021 01:34:43 - INFO - __main__ - Step 30648: {'lr': 0.0004551852859603965, 'samples': 5884416, 'steps': 30647, 'loss/train': 1.549283742904663} 11/07/2021 01:34:43 - INFO - __main__ - Step 30649: {'lr': 0.0004551822541698017, 'samples': 5884608, 'steps': 30648, 'loss/train': 1.6494215726852417} 11/07/2021 01:34:44 - INFO - __main__ - Step 30650: {'lr': 0.0004551792222867549, 'samples': 5884800, 'steps': 30649, 'loss/train': 1.7705411911010742} 11/07/2021 01:34:44 - INFO - __main__ - Step 30651: {'lr': 0.0004551761903112576, 'samples': 5884992, 'steps': 30650, 'loss/train': 1.90091073513031} 11/07/2021 01:34:45 - INFO - __main__ - Step 30652: {'lr': 0.000455173158243311, 'samples': 5885184, 'steps': 30651, 'loss/train': 1.9132969379425049} 11/07/2021 01:34:45 - INFO - __main__ - Step 30653: {'lr': 0.0004551701260829166, 'samples': 5885376, 'steps': 30652, 'loss/train': 1.7156943082809448} 11/07/2021 01:34:46 - INFO - __main__ - Step 30654: {'lr': 0.00045516709383007563, 'samples': 5885568, 'steps': 30653, 'loss/train': 1.2612545490264893} 11/07/2021 01:34:46 - INFO - __main__ - Step 30655: {'lr': 0.0004551640614847896, 'samples': 5885760, 'steps': 30654, 'loss/train': 0.7886247038841248} 11/07/2021 01:34:47 - INFO - __main__ - Step 30656: {'lr': 0.00045516102904705983, 'samples': 5885952, 'steps': 30655, 'loss/train': 1.3656479120254517} 11/07/2021 01:34:47 - INFO - __main__ - Step 30657: {'lr': 0.0004551579965168876, 'samples': 5886144, 'steps': 30656, 'loss/train': 1.431735634803772} 11/07/2021 01:34:47 - INFO - __main__ - Step 30658: {'lr': 0.00045515496389427433, 'samples': 5886336, 'steps': 30657, 'loss/train': 1.4148099422454834} 11/07/2021 01:34:49 - INFO - __main__ - Step 30659: {'lr': 0.0004551519311792215, 'samples': 5886528, 'steps': 30658, 'loss/train': 1.6572784185409546} 11/07/2021 01:34:49 - INFO - __main__ - Step 30660: {'lr': 0.00045514889837173025, 'samples': 5886720, 'steps': 30659, 'loss/train': 1.0424095392227173} 11/07/2021 01:34:49 - INFO - __main__ - Step 30661: {'lr': 0.00045514586547180214, 'samples': 5886912, 'steps': 30660, 'loss/train': 1.1317081451416016} 11/07/2021 01:34:50 - INFO - __main__ - Step 30662: {'lr': 0.0004551428324794385, 'samples': 5887104, 'steps': 30661, 'loss/train': 1.580672025680542} 11/07/2021 01:34:50 - INFO - __main__ - Step 30663: {'lr': 0.00045513979939464056, 'samples': 5887296, 'steps': 30662, 'loss/train': 1.5205079317092896} 11/07/2021 01:34:51 - INFO - __main__ - Step 30664: {'lr': 0.0004551367662174099, 'samples': 5887488, 'steps': 30663, 'loss/train': 1.219908595085144} 11/07/2021 01:34:52 - INFO - __main__ - Step 30665: {'lr': 0.0004551337329477477, 'samples': 5887680, 'steps': 30664, 'loss/train': 1.2398808002471924} 11/07/2021 01:34:52 - INFO - __main__ - Step 30666: {'lr': 0.00045513069958565545, 'samples': 5887872, 'steps': 30665, 'loss/train': 1.4924900531768799} 11/07/2021 01:34:52 - INFO - __main__ - Step 30667: {'lr': 0.00045512766613113457, 'samples': 5888064, 'steps': 30666, 'loss/train': 1.8135350942611694} 11/07/2021 01:34:53 - INFO - __main__ - Step 30668: {'lr': 0.00045512463258418615, 'samples': 5888256, 'steps': 30667, 'loss/train': 1.542351245880127} 11/07/2021 01:34:53 - INFO - __main__ - Step 30669: {'lr': 0.00045512159894481183, 'samples': 5888448, 'steps': 30668, 'loss/train': 1.554823637008667} 11/07/2021 01:34:55 - INFO - __main__ - Step 30670: {'lr': 0.00045511856521301286, 'samples': 5888640, 'steps': 30669, 'loss/train': 1.5602335929870605} 11/07/2021 01:34:55 - INFO - __main__ - Step 30671: {'lr': 0.0004551155313887906, 'samples': 5888832, 'steps': 30670, 'loss/train': 1.5313934087753296} 11/07/2021 01:34:55 - INFO - __main__ - Step 30672: {'lr': 0.0004551124974721465, 'samples': 5889024, 'steps': 30671, 'loss/train': 1.4817216396331787} 11/07/2021 01:34:56 - INFO - __main__ - Step 30673: {'lr': 0.00045510946346308186, 'samples': 5889216, 'steps': 30672, 'loss/train': 1.8890703916549683} 11/07/2021 01:34:56 - INFO - __main__ - Step 30674: {'lr': 0.0004551064293615981, 'samples': 5889408, 'steps': 30673, 'loss/train': 1.8428637981414795} 11/07/2021 01:34:57 - INFO - __main__ - Step 30675: {'lr': 0.00045510339516769647, 'samples': 5889600, 'steps': 30674, 'loss/train': 1.5842695236206055} 11/07/2021 01:34:57 - INFO - __main__ - Step 30676: {'lr': 0.0004551003608813784, 'samples': 5889792, 'steps': 30675, 'loss/train': 1.331461787223816} 11/07/2021 01:34:58 - INFO - __main__ - Step 30677: {'lr': 0.00045509732650264535, 'samples': 5889984, 'steps': 30676, 'loss/train': 1.6517804861068726} 11/07/2021 01:34:58 - INFO - __main__ - Step 30678: {'lr': 0.00045509429203149856, 'samples': 5890176, 'steps': 30677, 'loss/train': 1.6982057094573975} 11/07/2021 01:34:59 - INFO - __main__ - Step 30679: {'lr': 0.00045509125746793946, 'samples': 5890368, 'steps': 30678, 'loss/train': 1.3411773443222046} 11/07/2021 01:34:59 - INFO - __main__ - Step 30680: {'lr': 0.00045508822281196937, 'samples': 5890560, 'steps': 30679, 'loss/train': 1.4626752138137817} 11/07/2021 01:34:59 - INFO - __main__ - Step 30681: {'lr': 0.0004550851880635898, 'samples': 5890752, 'steps': 30680, 'loss/train': 1.4452056884765625} 11/07/2021 01:35:00 - INFO - __main__ - Step 30682: {'lr': 0.0004550821532228019, 'samples': 5890944, 'steps': 30681, 'loss/train': 1.9559125900268555} 11/07/2021 01:35:01 - INFO - __main__ - Step 30683: {'lr': 0.00045507911828960717, 'samples': 5891136, 'steps': 30682, 'loss/train': 1.2644442319869995} 11/07/2021 01:35:01 - INFO - __main__ - Step 30684: {'lr': 0.000455076083264007, 'samples': 5891328, 'steps': 30683, 'loss/train': 1.6457964181900024} 11/07/2021 01:35:01 - INFO - __main__ - Step 30685: {'lr': 0.0004550730481460027, 'samples': 5891520, 'steps': 30684, 'loss/train': 1.6096035242080688} 11/07/2021 01:35:02 - INFO - __main__ - Step 30686: {'lr': 0.0004550700129355956, 'samples': 5891712, 'steps': 30685, 'loss/train': 1.6629770994186401} 11/07/2021 01:35:03 - INFO - __main__ - Step 30687: {'lr': 0.0004550669776327871, 'samples': 5891904, 'steps': 30686, 'loss/train': 1.6891008615493774} 11/07/2021 01:35:03 - INFO - __main__ - Step 30688: {'lr': 0.00045506394223757867, 'samples': 5892096, 'steps': 30687, 'loss/train': 1.3316161632537842} 11/07/2021 01:35:03 - INFO - __main__ - Step 30689: {'lr': 0.00045506090674997157, 'samples': 5892288, 'steps': 30688, 'loss/train': 1.6416163444519043} 11/07/2021 01:35:04 - INFO - __main__ - Step 30690: {'lr': 0.00045505787116996714, 'samples': 5892480, 'steps': 30689, 'loss/train': 1.483043909072876} 11/07/2021 01:35:04 - INFO - __main__ - Step 30691: {'lr': 0.0004550548354975669, 'samples': 5892672, 'steps': 30690, 'loss/train': 1.6424208879470825} 11/07/2021 01:35:05 - INFO - __main__ - Step 30692: {'lr': 0.000455051799732772, 'samples': 5892864, 'steps': 30691, 'loss/train': 1.5616493225097656} 11/07/2021 01:35:05 - INFO - __main__ - Step 30693: {'lr': 0.000455048763875584, 'samples': 5893056, 'steps': 30692, 'loss/train': 1.2560726404190063} 11/07/2021 01:35:06 - INFO - __main__ - Step 30694: {'lr': 0.00045504572792600415, 'samples': 5893248, 'steps': 30693, 'loss/train': 1.7685329914093018} 11/07/2021 01:35:06 - INFO - __main__ - Step 30695: {'lr': 0.00045504269188403386, 'samples': 5893440, 'steps': 30694, 'loss/train': 1.0254260301589966} 11/07/2021 01:35:07 - INFO - __main__ - Step 30696: {'lr': 0.00045503965574967447, 'samples': 5893632, 'steps': 30695, 'loss/train': 1.7200754880905151} 11/07/2021 01:35:08 - INFO - __main__ - Step 30697: {'lr': 0.0004550366195229274, 'samples': 5893824, 'steps': 30696, 'loss/train': 1.283591628074646} 11/07/2021 01:35:08 - INFO - __main__ - Step 30698: {'lr': 0.00045503358320379405, 'samples': 5894016, 'steps': 30697, 'loss/train': 2.061112403869629} 11/07/2021 01:35:08 - INFO - __main__ - Step 30699: {'lr': 0.00045503054679227567, 'samples': 5894208, 'steps': 30698, 'loss/train': 0.9666875004768372} 11/07/2021 01:35:09 - INFO - __main__ - Step 30700: {'lr': 0.00045502751028837367, 'samples': 5894400, 'steps': 30699, 'loss/train': 1.7538293600082397} 11/07/2021 01:35:09 - INFO - __main__ - Step 30701: {'lr': 0.00045502447369208957, 'samples': 5894592, 'steps': 30700, 'loss/train': 0.1268743872642517} 11/07/2021 01:35:10 - INFO - __main__ - Step 30702: {'lr': 0.00045502143700342445, 'samples': 5894784, 'steps': 30701, 'loss/train': 1.3488339185714722} 11/07/2021 01:35:10 - INFO - __main__ - Step 30703: {'lr': 0.0004550184002223799, 'samples': 5894976, 'steps': 30702, 'loss/train': 1.573800802230835} 11/07/2021 01:35:11 - INFO - __main__ - Step 30704: {'lr': 0.0004550153633489572, 'samples': 5895168, 'steps': 30703, 'loss/train': 1.883345127105713} 11/07/2021 01:35:11 - INFO - __main__ - Step 30705: {'lr': 0.0004550123263831578, 'samples': 5895360, 'steps': 30704, 'loss/train': 2.1430163383483887} 11/07/2021 01:35:11 - INFO - __main__ - Step 30706: {'lr': 0.0004550092893249829, 'samples': 5895552, 'steps': 30705, 'loss/train': 1.7087527513504028} 11/07/2021 01:35:12 - INFO - __main__ - Step 30707: {'lr': 0.00045500625217443404, 'samples': 5895744, 'steps': 30706, 'loss/train': 1.6913039684295654} 11/07/2021 01:35:13 - INFO - __main__ - Step 30708: {'lr': 0.0004550032149315125, 'samples': 5895936, 'steps': 30707, 'loss/train': 1.7982276678085327} 11/07/2021 01:35:13 - INFO - __main__ - Step 30709: {'lr': 0.00045500017759621974, 'samples': 5896128, 'steps': 30708, 'loss/train': 1.4455924034118652} 11/07/2021 01:35:13 - INFO - __main__ - Step 30710: {'lr': 0.00045499714016855705, 'samples': 5896320, 'steps': 30709, 'loss/train': 1.656386137008667} 11/07/2021 01:35:14 - INFO - __main__ - Step 30711: {'lr': 0.0004549941026485258, 'samples': 5896512, 'steps': 30710, 'loss/train': 1.3936392068862915} 11/07/2021 01:35:14 - INFO - __main__ - Step 30712: {'lr': 0.00045499106503612733, 'samples': 5896704, 'steps': 30711, 'loss/train': 1.678410530090332} 11/07/2021 01:35:15 - INFO - __main__ - Step 30713: {'lr': 0.00045498802733136306, 'samples': 5896896, 'steps': 30712, 'loss/train': 1.5728613138198853} 11/07/2021 01:35:16 - INFO - __main__ - Step 30714: {'lr': 0.0004549849895342344, 'samples': 5897088, 'steps': 30713, 'loss/train': 1.51631760597229} 11/07/2021 01:35:16 - INFO - __main__ - Step 30715: {'lr': 0.00045498195164474264, 'samples': 5897280, 'steps': 30714, 'loss/train': 1.4547069072723389} 11/07/2021 01:35:16 - INFO - __main__ - Step 30716: {'lr': 0.00045497891366288914, 'samples': 5897472, 'steps': 30715, 'loss/train': 2.198493480682373} 11/07/2021 01:35:17 - INFO - __main__ - Step 30717: {'lr': 0.0004549758755886754, 'samples': 5897664, 'steps': 30716, 'loss/train': 5.7820844650268555} 11/07/2021 01:35:17 - INFO - __main__ - Step 30718: {'lr': 0.00045497283742210263, 'samples': 5897856, 'steps': 30717, 'loss/train': 1.0657594203948975} 11/07/2021 01:35:18 - INFO - __main__ - Step 30719: {'lr': 0.0004549697991631722, 'samples': 5898048, 'steps': 30718, 'loss/train': 1.59716796875} 11/07/2021 01:35:18 - INFO - __main__ - Step 30720: {'lr': 0.0004549667608118856, 'samples': 5898240, 'steps': 30719, 'loss/train': 1.0632942914962769} 11/07/2021 01:35:19 - INFO - __main__ - Step 30721: {'lr': 0.0004549637223682441, 'samples': 5898432, 'steps': 30720, 'loss/train': 1.446715235710144} 11/07/2021 01:35:19 - INFO - __main__ - Step 30722: {'lr': 0.0004549606838322492, 'samples': 5898624, 'steps': 30721, 'loss/train': 1.3796584606170654} 11/07/2021 01:35:19 - INFO - __main__ - Step 30723: {'lr': 0.00045495764520390216, 'samples': 5898816, 'steps': 30722, 'loss/train': 1.4826099872589111} 11/07/2021 01:35:21 - INFO - __main__ - Step 30724: {'lr': 0.0004549546064832043, 'samples': 5899008, 'steps': 30723, 'loss/train': 1.9744141101837158} 11/07/2021 01:35:21 - INFO - __main__ - Step 30725: {'lr': 0.0004549515676701571, 'samples': 5899200, 'steps': 30724, 'loss/train': 1.6435303688049316} 11/07/2021 01:35:21 - INFO - __main__ - Step 30726: {'lr': 0.0004549485287647619, 'samples': 5899392, 'steps': 30725, 'loss/train': 1.5569947957992554} 11/07/2021 01:35:22 - INFO - __main__ - Step 30727: {'lr': 0.00045494548976702, 'samples': 5899584, 'steps': 30726, 'loss/train': 1.3677185773849487} 11/07/2021 01:35:22 - INFO - __main__ - Step 30728: {'lr': 0.0004549424506769329, 'samples': 5899776, 'steps': 30727, 'loss/train': 1.946964144706726} 11/07/2021 01:35:23 - INFO - __main__ - Step 30729: {'lr': 0.00045493941149450185, 'samples': 5899968, 'steps': 30728, 'loss/train': 1.450317621231079} 11/07/2021 01:35:23 - INFO - __main__ - Step 30730: {'lr': 0.00045493637221972826, 'samples': 5900160, 'steps': 30729, 'loss/train': 1.8467463254928589} 11/07/2021 01:35:24 - INFO - __main__ - Step 30731: {'lr': 0.0004549333328526135, 'samples': 5900352, 'steps': 30730, 'loss/train': 1.7263344526290894} 11/07/2021 01:35:24 - INFO - __main__ - Step 30732: {'lr': 0.0004549302933931589, 'samples': 5900544, 'steps': 30731, 'loss/train': 1.9243172407150269} 11/07/2021 01:35:24 - INFO - __main__ - Step 30733: {'lr': 0.000454927253841366, 'samples': 5900736, 'steps': 30732, 'loss/train': 1.7375285625457764} 11/07/2021 01:35:26 - INFO - __main__ - Step 30734: {'lr': 0.00045492421419723595, 'samples': 5900928, 'steps': 30733, 'loss/train': 1.4696018695831299} 11/07/2021 01:35:26 - INFO - __main__ - Step 30735: {'lr': 0.00045492117446077027, 'samples': 5901120, 'steps': 30734, 'loss/train': 0.8575605154037476} 11/07/2021 01:35:26 - INFO - __main__ - Step 30736: {'lr': 0.0004549181346319702, 'samples': 5901312, 'steps': 30735, 'loss/train': 2.1260533332824707} 11/07/2021 01:35:27 - INFO - __main__ - Step 30737: {'lr': 0.00045491509471083717, 'samples': 5901504, 'steps': 30736, 'loss/train': 1.023028016090393} 11/07/2021 01:35:27 - INFO - __main__ - Step 30738: {'lr': 0.00045491205469737263, 'samples': 5901696, 'steps': 30737, 'loss/train': 1.1181199550628662} 11/07/2021 01:35:28 - INFO - __main__ - Step 30739: {'lr': 0.00045490901459157787, 'samples': 5901888, 'steps': 30738, 'loss/train': 0.6728991270065308} 11/07/2021 01:35:28 - INFO - __main__ - Step 30740: {'lr': 0.0004549059743934543, 'samples': 5902080, 'steps': 30739, 'loss/train': 1.761975884437561} 11/07/2021 01:35:29 - INFO - __main__ - Step 30741: {'lr': 0.00045490293410300315, 'samples': 5902272, 'steps': 30740, 'loss/train': 1.6367331743240356} 11/07/2021 01:35:29 - INFO - __main__ - Step 30742: {'lr': 0.000454899893720226, 'samples': 5902464, 'steps': 30741, 'loss/train': 1.7650607824325562} 11/07/2021 01:35:30 - INFO - __main__ - Step 30743: {'lr': 0.000454896853245124, 'samples': 5902656, 'steps': 30742, 'loss/train': 1.5175005197525024} 11/07/2021 01:35:30 - INFO - __main__ - Step 30744: {'lr': 0.00045489381267769873, 'samples': 5902848, 'steps': 30743, 'loss/train': 1.7161122560501099} 11/07/2021 01:35:31 - INFO - __main__ - Step 30745: {'lr': 0.00045489077201795147, 'samples': 5903040, 'steps': 30744, 'loss/train': 1.8334271907806396} 11/07/2021 01:35:31 - INFO - __main__ - Step 30746: {'lr': 0.0004548877312658836, 'samples': 5903232, 'steps': 30745, 'loss/train': 1.4619717597961426} 11/07/2021 01:35:32 - INFO - __main__ - Step 30747: {'lr': 0.0004548846904214964, 'samples': 5903424, 'steps': 30746, 'loss/train': 1.4247400760650635} 11/07/2021 01:35:32 - INFO - __main__ - Step 30748: {'lr': 0.00045488164948479144, 'samples': 5903616, 'steps': 30747, 'loss/train': 2.2171173095703125} 11/07/2021 01:35:32 - INFO - __main__ - Step 30749: {'lr': 0.0004548786084557699, 'samples': 5903808, 'steps': 30748, 'loss/train': 2.2073068618774414} 11/07/2021 01:35:33 - INFO - __main__ - Step 30750: {'lr': 0.00045487556733443327, 'samples': 5904000, 'steps': 30749, 'loss/train': 1.677097201347351} 11/07/2021 01:35:34 - INFO - __main__ - Step 30751: {'lr': 0.0004548725261207828, 'samples': 5904192, 'steps': 30750, 'loss/train': 1.304649829864502} 11/07/2021 01:35:34 - INFO - __main__ - Step 30752: {'lr': 0.0004548694848148199, 'samples': 5904384, 'steps': 30751, 'loss/train': 1.7409619092941284} 11/07/2021 01:35:34 - INFO - __main__ - Step 30753: {'lr': 0.0004548664434165461, 'samples': 5904576, 'steps': 30752, 'loss/train': 1.7309997081756592} 11/07/2021 01:35:35 - INFO - __main__ - Step 30754: {'lr': 0.0004548634019259625, 'samples': 5904768, 'steps': 30753, 'loss/train': 1.2886046171188354} 11/07/2021 01:35:36 - INFO - __main__ - Step 30755: {'lr': 0.0004548603603430708, 'samples': 5904960, 'steps': 30754, 'loss/train': 1.547845482826233} 11/07/2021 01:35:36 - INFO - __main__ - Step 30756: {'lr': 0.00045485731866787206, 'samples': 5905152, 'steps': 30755, 'loss/train': 1.908001184463501} 11/07/2021 01:35:36 - INFO - __main__ - Step 30757: {'lr': 0.00045485427690036774, 'samples': 5905344, 'steps': 30756, 'loss/train': 1.702413558959961} 11/07/2021 01:35:37 - INFO - __main__ - Step 30758: {'lr': 0.0004548512350405593, 'samples': 5905536, 'steps': 30757, 'loss/train': 2.074098587036133} 11/07/2021 01:35:37 - INFO - __main__ - Step 30759: {'lr': 0.00045484819308844806, 'samples': 5905728, 'steps': 30758, 'loss/train': 1.4840266704559326} 11/07/2021 01:35:38 - INFO - __main__ - Step 30760: {'lr': 0.00045484515104403535, 'samples': 5905920, 'steps': 30759, 'loss/train': 1.3921496868133545} 11/07/2021 01:35:38 - INFO - __main__ - Step 30761: {'lr': 0.00045484210890732257, 'samples': 5906112, 'steps': 30760, 'loss/train': 1.3195807933807373} 11/07/2021 01:35:39 - INFO - __main__ - Step 30762: {'lr': 0.0004548390666783111, 'samples': 5906304, 'steps': 30761, 'loss/train': 1.9495140314102173} 11/07/2021 01:35:39 - INFO - __main__ - Step 30763: {'lr': 0.00045483602435700233, 'samples': 5906496, 'steps': 30762, 'loss/train': 1.6819427013397217} 11/07/2021 01:35:40 - INFO - __main__ - Step 30764: {'lr': 0.0004548329819433976, 'samples': 5906688, 'steps': 30763, 'loss/train': 1.9270631074905396} 11/07/2021 01:35:41 - INFO - __main__ - Step 30765: {'lr': 0.00045482993943749835, 'samples': 5906880, 'steps': 30764, 'loss/train': 1.6744219064712524} 11/07/2021 01:35:41 - INFO - __main__ - Step 30766: {'lr': 0.0004548268968393058, 'samples': 5907072, 'steps': 30765, 'loss/train': 1.958350658416748} 11/07/2021 01:35:41 - INFO - __main__ - Step 30767: {'lr': 0.0004548238541488214, 'samples': 5907264, 'steps': 30766, 'loss/train': 1.2977490425109863} 11/07/2021 01:35:42 - INFO - __main__ - Step 30768: {'lr': 0.00045482081136604665, 'samples': 5907456, 'steps': 30767, 'loss/train': 1.6346162557601929} 11/07/2021 01:35:42 - INFO - __main__ - Step 30769: {'lr': 0.0004548177684909827, 'samples': 5907648, 'steps': 30768, 'loss/train': 0.8329635262489319} 11/07/2021 01:35:43 - INFO - __main__ - Step 30770: {'lr': 0.0004548147255236311, 'samples': 5907840, 'steps': 30769, 'loss/train': 1.6094725131988525} 11/07/2021 01:35:43 - INFO - __main__ - Step 30771: {'lr': 0.0004548116824639931, 'samples': 5908032, 'steps': 30770, 'loss/train': 1.5290954113006592} 11/07/2021 01:35:44 - INFO - __main__ - Step 30772: {'lr': 0.00045480863931207004, 'samples': 5908224, 'steps': 30771, 'loss/train': 1.5912102460861206} 11/07/2021 01:35:44 - INFO - __main__ - Step 30773: {'lr': 0.0004548055960678635, 'samples': 5908416, 'steps': 30772, 'loss/train': 2.0271196365356445} 11/07/2021 01:35:44 - INFO - __main__ - Step 30774: {'lr': 0.0004548025527313746, 'samples': 5908608, 'steps': 30773, 'loss/train': 1.5651638507843018} 11/07/2021 01:35:45 - INFO - __main__ - Step 30775: {'lr': 0.00045479950930260495, 'samples': 5908800, 'steps': 30774, 'loss/train': 1.0466399192810059} 11/07/2021 01:35:46 - INFO - __main__ - Step 30776: {'lr': 0.0004547964657815558, 'samples': 5908992, 'steps': 30775, 'loss/train': 1.261448860168457} 11/07/2021 01:35:46 - INFO - __main__ - Step 30777: {'lr': 0.0004547934221682284, 'samples': 5909184, 'steps': 30776, 'loss/train': 1.8580161333084106} 11/07/2021 01:35:46 - INFO - __main__ - Step 30778: {'lr': 0.00045479037846262436, 'samples': 5909376, 'steps': 30777, 'loss/train': 1.7419246435165405} 11/07/2021 01:35:47 - INFO - __main__ - Step 30779: {'lr': 0.00045478733466474487, 'samples': 5909568, 'steps': 30778, 'loss/train': 1.6353005170822144} 11/07/2021 01:35:47 - INFO - __main__ - Step 30780: {'lr': 0.0004547842907745914, 'samples': 5909760, 'steps': 30779, 'loss/train': 1.5967276096343994} 11/07/2021 01:35:48 - INFO - __main__ - Step 30781: {'lr': 0.00045478124679216523, 'samples': 5909952, 'steps': 30780, 'loss/train': 1.739806890487671} 11/07/2021 01:35:49 - INFO - __main__ - Step 30782: {'lr': 0.00045477820271746784, 'samples': 5910144, 'steps': 30781, 'loss/train': 1.5588977336883545} 11/07/2021 01:35:49 - INFO - __main__ - Step 30783: {'lr': 0.00045477515855050056, 'samples': 5910336, 'steps': 30782, 'loss/train': 1.5034235715866089} 11/07/2021 01:35:49 - INFO - __main__ - Step 30784: {'lr': 0.0004547721142912647, 'samples': 5910528, 'steps': 30783, 'loss/train': 1.5956065654754639} 11/07/2021 01:35:50 - INFO - __main__ - Step 30785: {'lr': 0.00045476906993976177, 'samples': 5910720, 'steps': 30784, 'loss/train': 1.606019139289856} 11/07/2021 01:35:51 - INFO - __main__ - Step 30786: {'lr': 0.000454766025495993, 'samples': 5910912, 'steps': 30785, 'loss/train': 1.5922966003417969} 11/07/2021 01:35:51 - INFO - __main__ - Step 30787: {'lr': 0.00045476298095995985, 'samples': 5911104, 'steps': 30786, 'loss/train': 5.82503604888916} 11/07/2021 01:35:51 - INFO - __main__ - Step 30788: {'lr': 0.00045475993633166357, 'samples': 5911296, 'steps': 30787, 'loss/train': 1.2205564975738525} 11/07/2021 01:35:52 - INFO - __main__ - Step 30789: {'lr': 0.00045475689161110565, 'samples': 5911488, 'steps': 30788, 'loss/train': 1.7455384731292725} 11/07/2021 01:35:52 - INFO - __main__ - Step 30790: {'lr': 0.0004547538467982876, 'samples': 5911680, 'steps': 30789, 'loss/train': 0.18178731203079224} 11/07/2021 01:35:53 - INFO - __main__ - Step 30791: {'lr': 0.00045475080189321044, 'samples': 5911872, 'steps': 30790, 'loss/train': 1.3786267042160034} 11/07/2021 01:35:53 - INFO - __main__ - Step 30792: {'lr': 0.00045474775689587576, 'samples': 5912064, 'steps': 30791, 'loss/train': 1.1766566038131714} 11/07/2021 01:35:54 - INFO - __main__ - Step 30793: {'lr': 0.00045474471180628496, 'samples': 5912256, 'steps': 30792, 'loss/train': 1.1489925384521484} 11/07/2021 01:35:54 - INFO - __main__ - Step 30794: {'lr': 0.0004547416666244393, 'samples': 5912448, 'steps': 30793, 'loss/train': 1.3420683145523071} 11/07/2021 01:35:55 - INFO - __main__ - Step 30795: {'lr': 0.00045473862135034026, 'samples': 5912640, 'steps': 30794, 'loss/train': 1.1864869594573975} 11/07/2021 01:35:55 - INFO - __main__ - Step 30796: {'lr': 0.0004547355759839891, 'samples': 5912832, 'steps': 30795, 'loss/train': 1.5019656419754028} 11/07/2021 01:35:56 - INFO - __main__ - Step 30797: {'lr': 0.00045473253052538725, 'samples': 5913024, 'steps': 30796, 'loss/train': 2.0348379611968994} 11/07/2021 01:35:56 - INFO - __main__ - Step 30798: {'lr': 0.00045472948497453613, 'samples': 5913216, 'steps': 30797, 'loss/train': 1.2545139789581299} 11/07/2021 01:35:57 - INFO - __main__ - Step 30799: {'lr': 0.00045472643933143703, 'samples': 5913408, 'steps': 30798, 'loss/train': 1.5450226068496704} 11/07/2021 01:35:57 - INFO - __main__ - Step 30800: {'lr': 0.0004547233935960914, 'samples': 5913600, 'steps': 30799, 'loss/train': 1.9017813205718994} 11/07/2021 01:35:57 - INFO - __main__ - Step 30801: {'lr': 0.00045472034776850045, 'samples': 5913792, 'steps': 30800, 'loss/train': 1.516133189201355} 11/07/2021 01:35:58 - INFO - __main__ - Step 30802: {'lr': 0.0004547173018486658, 'samples': 5913984, 'steps': 30801, 'loss/train': 1.5497409105300903} 11/07/2021 01:35:59 - INFO - __main__ - Step 30803: {'lr': 0.0004547142558365887, 'samples': 5914176, 'steps': 30802, 'loss/train': 1.4988566637039185} 11/07/2021 01:35:59 - INFO - __main__ - Step 30804: {'lr': 0.0004547112097322704, 'samples': 5914368, 'steps': 30803, 'loss/train': 1.5426658391952515} 11/07/2021 01:35:59 - INFO - __main__ - Step 30805: {'lr': 0.00045470816353571244, 'samples': 5914560, 'steps': 30804, 'loss/train': 1.770569920539856} 11/07/2021 01:36:00 - INFO - __main__ - Step 30806: {'lr': 0.00045470511724691613, 'samples': 5914752, 'steps': 30805, 'loss/train': 1.310795783996582} 11/07/2021 01:36:01 - INFO - __main__ - Step 30807: {'lr': 0.0004547020708658829, 'samples': 5914944, 'steps': 30806, 'loss/train': 1.5118852853775024} 11/07/2021 01:36:01 - INFO - __main__ - Step 30808: {'lr': 0.000454699024392614, 'samples': 5915136, 'steps': 30807, 'loss/train': 1.5655699968338013} 11/07/2021 01:36:02 - INFO - __main__ - Step 30809: {'lr': 0.0004546959778271109, 'samples': 5915328, 'steps': 30808, 'loss/train': 1.6117366552352905} 11/07/2021 01:36:02 - INFO - __main__ - Step 30810: {'lr': 0.00045469293116937504, 'samples': 5915520, 'steps': 30809, 'loss/train': 1.5515666007995605} 11/07/2021 01:36:02 - INFO - __main__ - Step 30811: {'lr': 0.0004546898844194076, 'samples': 5915712, 'steps': 30810, 'loss/train': 1.419262409210205} 11/07/2021 01:36:03 - INFO - __main__ - Step 30812: {'lr': 0.00045468683757721005, 'samples': 5915904, 'steps': 30811, 'loss/train': 1.8689558506011963} 11/07/2021 01:36:04 - INFO - __main__ - Step 30813: {'lr': 0.0004546837906427839, 'samples': 5916096, 'steps': 30812, 'loss/train': 1.618916392326355} 11/07/2021 01:36:04 - INFO - __main__ - Step 30814: {'lr': 0.00045468074361613026, 'samples': 5916288, 'steps': 30813, 'loss/train': 1.5968495607376099} 11/07/2021 01:36:04 - INFO - __main__ - Step 30815: {'lr': 0.0004546776964972507, 'samples': 5916480, 'steps': 30814, 'loss/train': 1.4214028120040894} 11/07/2021 01:36:05 - INFO - __main__ - Step 30816: {'lr': 0.00045467464928614657, 'samples': 5916672, 'steps': 30815, 'loss/train': 1.8217263221740723} 11/07/2021 01:36:05 - INFO - __main__ - Step 30817: {'lr': 0.0004546716019828191, 'samples': 5916864, 'steps': 30816, 'loss/train': 1.5973073244094849} 11/07/2021 01:36:06 - INFO - __main__ - Step 30818: {'lr': 0.00045466855458726975, 'samples': 5917056, 'steps': 30817, 'loss/train': 1.2746789455413818} 11/07/2021 01:36:06 - INFO - __main__ - Step 30819: {'lr': 0.0004546655070995, 'samples': 5917248, 'steps': 30818, 'loss/train': 1.660507082939148} 11/07/2021 01:36:07 - INFO - __main__ - Step 30820: {'lr': 0.0004546624595195111, 'samples': 5917440, 'steps': 30819, 'loss/train': 1.6329345703125} 11/07/2021 01:36:07 - INFO - __main__ - Step 30821: {'lr': 0.0004546594118473044, 'samples': 5917632, 'steps': 30820, 'loss/train': 1.6678979396820068} 11/07/2021 01:36:07 - INFO - __main__ - Step 30822: {'lr': 0.0004546563640828814, 'samples': 5917824, 'steps': 30821, 'loss/train': 1.5760250091552734} 11/07/2021 01:36:08 - INFO - __main__ - Step 30823: {'lr': 0.0004546533162262434, 'samples': 5918016, 'steps': 30822, 'loss/train': 1.591220498085022} 11/07/2021 01:36:09 - INFO - __main__ - Step 30824: {'lr': 0.00045465026827739175, 'samples': 5918208, 'steps': 30823, 'loss/train': 1.3347065448760986} 11/07/2021 01:36:09 - INFO - __main__ - Step 30825: {'lr': 0.00045464722023632784, 'samples': 5918400, 'steps': 30824, 'loss/train': 1.0544623136520386} 11/07/2021 01:36:09 - INFO - __main__ - Step 30826: {'lr': 0.00045464417210305303, 'samples': 5918592, 'steps': 30825, 'loss/train': 0.868805468082428} 11/07/2021 01:36:10 - INFO - __main__ - Step 30827: {'lr': 0.0004546411238775687, 'samples': 5918784, 'steps': 30826, 'loss/train': 1.4293575286865234} 11/07/2021 01:36:11 - INFO - __main__ - Step 30828: {'lr': 0.00045463807555987633, 'samples': 5918976, 'steps': 30827, 'loss/train': 1.6750293970108032} 11/07/2021 01:36:11 - INFO - __main__ - Step 30829: {'lr': 0.0004546350271499772, 'samples': 5919168, 'steps': 30828, 'loss/train': 1.5183910131454468} 11/07/2021 01:36:12 - INFO - __main__ - Step 30830: {'lr': 0.0004546319786478726, 'samples': 5919360, 'steps': 30829, 'loss/train': 1.4833358526229858} 11/07/2021 01:36:12 - INFO - __main__ - Step 30831: {'lr': 0.000454628930053564, 'samples': 5919552, 'steps': 30830, 'loss/train': 1.5842455625534058} 11/07/2021 01:36:12 - INFO - __main__ - Step 30832: {'lr': 0.0004546258813670528, 'samples': 5919744, 'steps': 30831, 'loss/train': 1.7990853786468506} 11/07/2021 01:36:13 - INFO - __main__ - Step 30833: {'lr': 0.0004546228325883403, 'samples': 5919936, 'steps': 30832, 'loss/train': 1.3607431650161743} 11/07/2021 01:36:14 - INFO - __main__ - Step 30834: {'lr': 0.00045461978371742794, 'samples': 5920128, 'steps': 30833, 'loss/train': 2.1018025875091553} 11/07/2021 01:36:14 - INFO - __main__ - Step 30835: {'lr': 0.00045461673475431704, 'samples': 5920320, 'steps': 30834, 'loss/train': 1.1725069284439087} 11/07/2021 01:36:14 - INFO - __main__ - Step 30836: {'lr': 0.00045461368569900895, 'samples': 5920512, 'steps': 30835, 'loss/train': 1.8515321016311646} 11/07/2021 01:36:15 - INFO - __main__ - Step 30837: {'lr': 0.0004546106365515052, 'samples': 5920704, 'steps': 30836, 'loss/train': 1.7336543798446655} 11/07/2021 01:36:15 - INFO - __main__ - Step 30838: {'lr': 0.000454607587311807, 'samples': 5920896, 'steps': 30837, 'loss/train': 1.4543534517288208} 11/07/2021 01:36:16 - INFO - __main__ - Step 30839: {'lr': 0.00045460453797991577, 'samples': 5921088, 'steps': 30838, 'loss/train': 1.7454028129577637} 11/07/2021 01:36:17 - INFO - __main__ - Step 30840: {'lr': 0.00045460148855583295, 'samples': 5921280, 'steps': 30839, 'loss/train': 1.305275559425354} 11/07/2021 01:36:17 - INFO - __main__ - Step 30841: {'lr': 0.00045459843903955977, 'samples': 5921472, 'steps': 30840, 'loss/train': 0.245193749666214} 11/07/2021 01:36:17 - INFO - __main__ - Step 30842: {'lr': 0.00045459538943109774, 'samples': 5921664, 'steps': 30841, 'loss/train': 1.5130547285079956} 11/07/2021 01:36:18 - INFO - __main__ - Step 30843: {'lr': 0.0004545923397304482, 'samples': 5921856, 'steps': 30842, 'loss/train': 1.6042557954788208} 11/07/2021 01:36:19 - INFO - __main__ - Step 30844: {'lr': 0.0004545892899376125, 'samples': 5922048, 'steps': 30843, 'loss/train': 1.2996978759765625} 11/07/2021 01:36:19 - INFO - __main__ - Step 30845: {'lr': 0.000454586240052592, 'samples': 5922240, 'steps': 30844, 'loss/train': 1.932795524597168} 11/07/2021 01:36:19 - INFO - __main__ - Step 30846: {'lr': 0.00045458319007538804, 'samples': 5922432, 'steps': 30845, 'loss/train': 1.3812726736068726} 11/07/2021 01:36:20 - INFO - __main__ - Step 30847: {'lr': 0.00045458014000600213, 'samples': 5922624, 'steps': 30846, 'loss/train': 1.7849777936935425} 11/07/2021 01:36:20 - INFO - __main__ - Step 30848: {'lr': 0.00045457708984443556, 'samples': 5922816, 'steps': 30847, 'loss/train': 1.9209630489349365} 11/07/2021 01:36:21 - INFO - __main__ - Step 30849: {'lr': 0.0004545740395906897, 'samples': 5923008, 'steps': 30848, 'loss/train': 1.4414986371994019} 11/07/2021 01:36:22 - INFO - __main__ - Step 30850: {'lr': 0.0004545709892447659, 'samples': 5923200, 'steps': 30849, 'loss/train': 1.7687026262283325} 11/07/2021 01:36:22 - INFO - __main__ - Step 30851: {'lr': 0.00045456793880666556, 'samples': 5923392, 'steps': 30850, 'loss/train': 1.5331692695617676} 11/07/2021 01:36:22 - INFO - __main__ - Step 30852: {'lr': 0.0004545648882763902, 'samples': 5923584, 'steps': 30851, 'loss/train': 1.2778525352478027} 11/07/2021 01:36:23 - INFO - __main__ - Step 30853: {'lr': 0.0004545618376539409, 'samples': 5923776, 'steps': 30852, 'loss/train': 1.6304030418395996} 11/07/2021 01:36:23 - INFO - __main__ - Step 30854: {'lr': 0.0004545587869393193, 'samples': 5923968, 'steps': 30853, 'loss/train': 0.9640048742294312} 11/07/2021 01:36:24 - INFO - __main__ - Step 30855: {'lr': 0.00045455573613252667, 'samples': 5924160, 'steps': 30854, 'loss/train': 1.319207787513733} 11/07/2021 01:36:24 - INFO - __main__ - Step 30856: {'lr': 0.0004545526852335643, 'samples': 5924352, 'steps': 30855, 'loss/train': 3.6997063159942627} 11/07/2021 01:36:25 - INFO - __main__ - Step 30857: {'lr': 0.0004545496342424337, 'samples': 5924544, 'steps': 30856, 'loss/train': 1.8156380653381348} 11/07/2021 01:36:25 - INFO - __main__ - Step 30858: {'lr': 0.00045454658315913617, 'samples': 5924736, 'steps': 30857, 'loss/train': 1.4153854846954346} 11/07/2021 01:36:26 - INFO - __main__ - Step 30859: {'lr': 0.0004545435319836731, 'samples': 5924928, 'steps': 30858, 'loss/train': 1.6114808320999146} 11/07/2021 01:36:27 - INFO - __main__ - Step 30860: {'lr': 0.00045454048071604593, 'samples': 5925120, 'steps': 30859, 'loss/train': 1.5351415872573853} 11/07/2021 01:36:27 - INFO - __main__ - Step 30861: {'lr': 0.0004545374293562559, 'samples': 5925312, 'steps': 30860, 'loss/train': 1.7278437614440918} 11/07/2021 01:36:27 - INFO - __main__ - Step 30862: {'lr': 0.00045453437790430446, 'samples': 5925504, 'steps': 30861, 'loss/train': 1.910088300704956} 11/07/2021 01:36:28 - INFO - __main__ - Step 30863: {'lr': 0.000454531326360193, 'samples': 5925696, 'steps': 30862, 'loss/train': 1.8231418132781982} 11/07/2021 01:36:28 - INFO - __main__ - Step 30864: {'lr': 0.00045452827472392286, 'samples': 5925888, 'steps': 30863, 'loss/train': 1.726006269454956} 11/07/2021 01:36:29 - INFO - __main__ - Step 30865: {'lr': 0.0004545252229954955, 'samples': 5926080, 'steps': 30864, 'loss/train': 1.0793594121932983} 11/07/2021 01:36:29 - INFO - __main__ - Step 30866: {'lr': 0.00045452217117491225, 'samples': 5926272, 'steps': 30865, 'loss/train': 1.897141456604004} 11/07/2021 01:36:30 - INFO - __main__ - Step 30867: {'lr': 0.00045451911926217437, 'samples': 5926464, 'steps': 30866, 'loss/train': 1.3745735883712769} 11/07/2021 01:36:30 - INFO - __main__ - Step 30868: {'lr': 0.00045451606725728337, 'samples': 5926656, 'steps': 30867, 'loss/train': 1.2515405416488647} 11/07/2021 01:36:30 - INFO - __main__ - Step 30869: {'lr': 0.0004545130151602406, 'samples': 5926848, 'steps': 30868, 'loss/train': 2.071918249130249} 11/07/2021 01:36:31 - INFO - __main__ - Step 30870: {'lr': 0.00045450996297104743, 'samples': 5927040, 'steps': 30869, 'loss/train': 1.5846422910690308} 11/07/2021 01:36:32 - INFO - __main__ - Step 30871: {'lr': 0.00045450691068970515, 'samples': 5927232, 'steps': 30870, 'loss/train': 1.7457700967788696} 11/07/2021 01:36:32 - INFO - __main__ - Step 30872: {'lr': 0.00045450385831621534, 'samples': 5927424, 'steps': 30871, 'loss/train': 1.7874201536178589} 11/07/2021 01:36:32 - INFO - __main__ - Step 30873: {'lr': 0.0004545008058505792, 'samples': 5927616, 'steps': 30872, 'loss/train': 3.894684076309204} 11/07/2021 01:36:33 - INFO - __main__ - Step 30874: {'lr': 0.0004544977532927981, 'samples': 5927808, 'steps': 30873, 'loss/train': 1.8667088747024536} 11/07/2021 01:36:33 - INFO - __main__ - Step 30875: {'lr': 0.0004544947006428735, 'samples': 5928000, 'steps': 30874, 'loss/train': 2.0149505138397217} 11/07/2021 01:36:34 - INFO - __main__ - Step 30876: {'lr': 0.00045449164790080675, 'samples': 5928192, 'steps': 30875, 'loss/train': 1.3990708589553833} 11/07/2021 01:36:35 - INFO - __main__ - Step 30877: {'lr': 0.00045448859506659926, 'samples': 5928384, 'steps': 30876, 'loss/train': 1.3811663389205933} 11/07/2021 01:36:35 - INFO - __main__ - Step 30878: {'lr': 0.0004544855421402523, 'samples': 5928576, 'steps': 30877, 'loss/train': 1.2561410665512085} 11/07/2021 01:36:35 - INFO - __main__ - Step 30879: {'lr': 0.00045448248912176726, 'samples': 5928768, 'steps': 30878, 'loss/train': 1.218980312347412} 11/07/2021 01:36:36 - INFO - __main__ - Step 30880: {'lr': 0.00045447943601114563, 'samples': 5928960, 'steps': 30879, 'loss/train': 1.5677345991134644} 11/07/2021 01:36:37 - INFO - __main__ - Step 30881: {'lr': 0.00045447638280838877, 'samples': 5929152, 'steps': 30880, 'loss/train': 2.2396042346954346} 11/07/2021 01:36:37 - INFO - __main__ - Step 30882: {'lr': 0.000454473329513498, 'samples': 5929344, 'steps': 30881, 'loss/train': 1.402809739112854} 11/07/2021 01:36:38 - INFO - __main__ - Step 30883: {'lr': 0.0004544702761264746, 'samples': 5929536, 'steps': 30882, 'loss/train': 1.6992740631103516} 11/07/2021 01:36:38 - INFO - __main__ - Step 30884: {'lr': 0.0004544672226473201, 'samples': 5929728, 'steps': 30883, 'loss/train': 1.5864168405532837} 11/07/2021 01:36:38 - INFO - __main__ - Step 30885: {'lr': 0.00045446416907603585, 'samples': 5929920, 'steps': 30884, 'loss/train': 1.8292831182479858} 11/07/2021 01:36:39 - INFO - __main__ - Step 30886: {'lr': 0.00045446111541262317, 'samples': 5930112, 'steps': 30885, 'loss/train': 2.021177053451538} 11/07/2021 01:36:40 - INFO - __main__ - Step 30887: {'lr': 0.0004544580616570835, 'samples': 5930304, 'steps': 30886, 'loss/train': 1.7107443809509277} 11/07/2021 01:36:40 - INFO - __main__ - Step 30888: {'lr': 0.0004544550078094182, 'samples': 5930496, 'steps': 30887, 'loss/train': 1.7455068826675415} 11/07/2021 01:36:40 - INFO - __main__ - Step 30889: {'lr': 0.00045445195386962855, 'samples': 5930688, 'steps': 30888, 'loss/train': 1.7165838479995728} 11/07/2021 01:36:41 - INFO - __main__ - Step 30890: {'lr': 0.0004544488998377161, 'samples': 5930880, 'steps': 30889, 'loss/train': 1.4680341482162476} 11/07/2021 01:36:41 - INFO - __main__ - Step 30891: {'lr': 0.000454445845713682, 'samples': 5931072, 'steps': 30890, 'loss/train': 1.9559266567230225} 11/07/2021 01:36:42 - INFO - __main__ - Step 30892: {'lr': 0.0004544427914975279, 'samples': 5931264, 'steps': 30891, 'loss/train': 1.3994450569152832} 11/07/2021 01:36:42 - INFO - __main__ - Step 30893: {'lr': 0.0004544397371892549, 'samples': 5931456, 'steps': 30892, 'loss/train': 1.4075242280960083} 11/07/2021 01:36:43 - INFO - __main__ - Step 30894: {'lr': 0.00045443668278886463, 'samples': 5931648, 'steps': 30893, 'loss/train': 1.1928571462631226} 11/07/2021 01:36:43 - INFO - __main__ - Step 30895: {'lr': 0.00045443362829635826, 'samples': 5931840, 'steps': 30894, 'loss/train': 1.7529635429382324} 11/07/2021 01:36:43 - INFO - __main__ - Step 30896: {'lr': 0.00045443057371173727, 'samples': 5932032, 'steps': 30895, 'loss/train': 1.4993561506271362} 11/07/2021 01:36:44 - INFO - __main__ - Step 30897: {'lr': 0.00045442751903500305, 'samples': 5932224, 'steps': 30896, 'loss/train': 1.454590916633606} 11/07/2021 01:36:45 - INFO - __main__ - Step 30898: {'lr': 0.0004544244642661569, 'samples': 5932416, 'steps': 30897, 'loss/train': 0.9046458601951599} 11/07/2021 01:36:45 - INFO - __main__ - Step 30899: {'lr': 0.00045442140940520027, 'samples': 5932608, 'steps': 30898, 'loss/train': 1.2835602760314941} 11/07/2021 01:36:45 - INFO - __main__ - Step 30900: {'lr': 0.0004544183544521345, 'samples': 5932800, 'steps': 30899, 'loss/train': 2.022730827331543} 11/07/2021 01:36:46 - INFO - __main__ - Step 30901: {'lr': 0.00045441529940696104, 'samples': 5932992, 'steps': 30900, 'loss/train': 1.4904640913009644} 11/07/2021 01:36:47 - INFO - __main__ - Step 30902: {'lr': 0.0004544122442696811, 'samples': 5933184, 'steps': 30901, 'loss/train': 1.6012802124023438} 11/07/2021 01:36:47 - INFO - __main__ - Step 30903: {'lr': 0.0004544091890402962, 'samples': 5933376, 'steps': 30902, 'loss/train': 1.2576720714569092} 11/07/2021 01:36:48 - INFO - __main__ - Step 30904: {'lr': 0.0004544061337188077, 'samples': 5933568, 'steps': 30903, 'loss/train': 1.4356153011322021} 11/07/2021 01:36:48 - INFO - __main__ - Step 30905: {'lr': 0.0004544030783052169, 'samples': 5933760, 'steps': 30904, 'loss/train': 1.6024492979049683} 11/07/2021 01:36:48 - INFO - __main__ - Step 30906: {'lr': 0.0004544000227995253, 'samples': 5933952, 'steps': 30905, 'loss/train': 1.9224432706832886} 11/07/2021 01:36:49 - INFO - __main__ - Step 30907: {'lr': 0.00045439696720173405, 'samples': 5934144, 'steps': 30906, 'loss/train': 1.2479407787322998} 11/07/2021 01:36:50 - INFO - __main__ - Step 30908: {'lr': 0.00045439391151184483, 'samples': 5934336, 'steps': 30907, 'loss/train': 1.609938383102417} 11/07/2021 01:36:50 - INFO - __main__ - Step 30909: {'lr': 0.0004543908557298588, 'samples': 5934528, 'steps': 30908, 'loss/train': 1.8101469278335571} 11/07/2021 01:36:50 - INFO - __main__ - Step 30910: {'lr': 0.0004543877998557775, 'samples': 5934720, 'steps': 30909, 'loss/train': 0.7057313919067383} 11/07/2021 01:36:51 - INFO - __main__ - Step 30911: {'lr': 0.00045438474388960205, 'samples': 5934912, 'steps': 30910, 'loss/train': 1.4680485725402832} 11/07/2021 01:36:52 - INFO - __main__ - Step 30912: {'lr': 0.0004543816878313341, 'samples': 5935104, 'steps': 30911, 'loss/train': 1.6297519207000732} 11/07/2021 01:36:52 - INFO - __main__ - Step 30913: {'lr': 0.0004543786316809749, 'samples': 5935296, 'steps': 30912, 'loss/train': 1.6242061853408813} 11/07/2021 01:36:53 - INFO - __main__ - Step 30914: {'lr': 0.0004543755754385258, 'samples': 5935488, 'steps': 30913, 'loss/train': 1.5210531949996948} 11/07/2021 01:36:53 - INFO - __main__ - Step 30915: {'lr': 0.00045437251910398824, 'samples': 5935680, 'steps': 30914, 'loss/train': 1.4314866065979004} 11/07/2021 01:36:53 - INFO - __main__ - Step 30916: {'lr': 0.00045436946267736364, 'samples': 5935872, 'steps': 30915, 'loss/train': 1.382826805114746} 11/07/2021 01:36:54 - INFO - __main__ - Step 30917: {'lr': 0.0004543664061586532, 'samples': 5936064, 'steps': 30916, 'loss/train': 1.454323649406433} 11/07/2021 01:36:55 - INFO - __main__ - Step 30918: {'lr': 0.00045436334954785854, 'samples': 5936256, 'steps': 30917, 'loss/train': 1.7000982761383057} 11/07/2021 01:36:55 - INFO - __main__ - Step 30919: {'lr': 0.0004543602928449808, 'samples': 5936448, 'steps': 30918, 'loss/train': 1.026580810546875} 11/07/2021 01:36:55 - INFO - __main__ - Step 30920: {'lr': 0.00045435723605002156, 'samples': 5936640, 'steps': 30919, 'loss/train': 1.9266397953033447} 11/07/2021 01:36:56 - INFO - __main__ - Step 30921: {'lr': 0.00045435417916298205, 'samples': 5936832, 'steps': 30920, 'loss/train': 1.4937081336975098} 11/07/2021 01:36:57 - INFO - __main__ - Step 30922: {'lr': 0.00045435112218386364, 'samples': 5937024, 'steps': 30921, 'loss/train': 1.2779721021652222} 11/07/2021 01:36:57 - INFO - __main__ - Step 30923: {'lr': 0.00045434806511266784, 'samples': 5937216, 'steps': 30922, 'loss/train': 2.076853036880493} 11/07/2021 01:36:57 - INFO - __main__ - Step 30924: {'lr': 0.0004543450079493959, 'samples': 5937408, 'steps': 30923, 'loss/train': 1.4119458198547363} 11/07/2021 01:36:58 - INFO - __main__ - Step 30925: {'lr': 0.0004543419506940494, 'samples': 5937600, 'steps': 30924, 'loss/train': 1.425700306892395} 11/07/2021 01:36:58 - INFO - __main__ - Step 30926: {'lr': 0.0004543388933466294, 'samples': 5937792, 'steps': 30925, 'loss/train': 1.519874930381775} 11/07/2021 01:36:58 - INFO - __main__ - Step 30927: {'lr': 0.00045433583590713756, 'samples': 5937984, 'steps': 30926, 'loss/train': 1.5032317638397217} 11/07/2021 01:37:00 - INFO - __main__ - Step 30928: {'lr': 0.0004543327783755751, 'samples': 5938176, 'steps': 30927, 'loss/train': 1.0906578302383423} 11/07/2021 01:37:00 - INFO - __main__ - Step 30929: {'lr': 0.0004543297207519434, 'samples': 5938368, 'steps': 30928, 'loss/train': 1.8797330856323242} 11/07/2021 01:37:00 - INFO - __main__ - Step 30930: {'lr': 0.0004543266630362439, 'samples': 5938560, 'steps': 30929, 'loss/train': 1.2183094024658203} 11/07/2021 01:37:01 - INFO - __main__ - Step 30931: {'lr': 0.00045432360522847803, 'samples': 5938752, 'steps': 30930, 'loss/train': 1.430260419845581} 11/07/2021 01:37:01 - INFO - __main__ - Step 30932: {'lr': 0.000454320547328647, 'samples': 5938944, 'steps': 30931, 'loss/train': 1.7848718166351318} 11/07/2021 01:37:02 - INFO - __main__ - Step 30933: {'lr': 0.00045431748933675236, 'samples': 5939136, 'steps': 30932, 'loss/train': 1.704218864440918} 11/07/2021 01:37:02 - INFO - __main__ - Step 30934: {'lr': 0.00045431443125279534, 'samples': 5939328, 'steps': 30933, 'loss/train': 1.6153076887130737} 11/07/2021 01:37:03 - INFO - __main__ - Step 30935: {'lr': 0.00045431137307677753, 'samples': 5939520, 'steps': 30934, 'loss/train': 1.322769045829773} 11/07/2021 01:37:03 - INFO - __main__ - Step 30936: {'lr': 0.00045430831480870005, 'samples': 5939712, 'steps': 30935, 'loss/train': 2.219740390777588} 11/07/2021 01:37:03 - INFO - __main__ - Step 30937: {'lr': 0.0004543052564485644, 'samples': 5939904, 'steps': 30936, 'loss/train': 1.327895164489746} 11/07/2021 01:37:04 - INFO - __main__ - Step 30938: {'lr': 0.00045430219799637197, 'samples': 5940096, 'steps': 30937, 'loss/train': 1.515455722808838} 11/07/2021 01:37:05 - INFO - __main__ - Step 30939: {'lr': 0.0004542991394521241, 'samples': 5940288, 'steps': 30938, 'loss/train': 2.256800413131714} 11/07/2021 01:37:05 - INFO - __main__ - Step 30940: {'lr': 0.00045429608081582216, 'samples': 5940480, 'steps': 30939, 'loss/train': 1.5054608583450317} 11/07/2021 01:37:06 - INFO - __main__ - Step 30941: {'lr': 0.0004542930220874677, 'samples': 5940672, 'steps': 30940, 'loss/train': 1.8650132417678833} 11/07/2021 01:37:06 - INFO - __main__ - Step 30942: {'lr': 0.00045428996326706185, 'samples': 5940864, 'steps': 30941, 'loss/train': 1.4545024633407593} 11/07/2021 01:37:07 - INFO - __main__ - Step 30943: {'lr': 0.0004542869043546061, 'samples': 5941056, 'steps': 30942, 'loss/train': 1.7451108694076538} 11/07/2021 01:37:07 - INFO - __main__ - Step 30944: {'lr': 0.0004542838453501018, 'samples': 5941248, 'steps': 30943, 'loss/train': 1.6728150844573975} 11/07/2021 01:37:08 - INFO - __main__ - Step 30945: {'lr': 0.0004542807862535504, 'samples': 5941440, 'steps': 30944, 'loss/train': 1.211879014968872} 11/07/2021 01:37:08 - INFO - __main__ - Step 30946: {'lr': 0.0004542777270649533, 'samples': 5941632, 'steps': 30945, 'loss/train': 1.518599033355713} 11/07/2021 01:37:08 - INFO - __main__ - Step 30947: {'lr': 0.0004542746677843117, 'samples': 5941824, 'steps': 30946, 'loss/train': 1.4764024019241333} 11/07/2021 01:37:09 - INFO - __main__ - Step 30948: {'lr': 0.0004542716084116271, 'samples': 5942016, 'steps': 30947, 'loss/train': 1.9140671491622925} 11/07/2021 01:37:10 - INFO - __main__ - Step 30949: {'lr': 0.0004542685489469008, 'samples': 5942208, 'steps': 30948, 'loss/train': 2.9213457107543945} 11/07/2021 01:37:10 - INFO - __main__ - Step 30950: {'lr': 0.0004542654893901344, 'samples': 5942400, 'steps': 30949, 'loss/train': 1.50641667842865} 11/07/2021 01:37:10 - INFO - __main__ - Step 30951: {'lr': 0.00045426242974132904, 'samples': 5942592, 'steps': 30950, 'loss/train': 1.8398200273513794} 11/07/2021 01:37:11 - INFO - __main__ - Step 30952: {'lr': 0.0004542593700004862, 'samples': 5942784, 'steps': 30951, 'loss/train': 1.6665246486663818} 11/07/2021 01:37:11 - INFO - __main__ - Step 30953: {'lr': 0.0004542563101676072, 'samples': 5942976, 'steps': 30952, 'loss/train': 1.343689203262329} 11/07/2021 01:37:12 - INFO - __main__ - Step 30954: {'lr': 0.0004542532502426935, 'samples': 5943168, 'steps': 30953, 'loss/train': 1.6398005485534668} 11/07/2021 01:37:12 - INFO - __main__ - Step 30955: {'lr': 0.0004542501902257464, 'samples': 5943360, 'steps': 30954, 'loss/train': 1.0328978300094604} 11/07/2021 01:37:13 - INFO - __main__ - Step 30956: {'lr': 0.0004542471301167673, 'samples': 5943552, 'steps': 30955, 'loss/train': 1.7728264331817627} 11/07/2021 01:37:13 - INFO - __main__ - Step 30957: {'lr': 0.0004542440699157577, 'samples': 5943744, 'steps': 30956, 'loss/train': 1.5043604373931885} 11/07/2021 01:37:13 - INFO - __main__ - Step 30958: {'lr': 0.00045424100962271883, 'samples': 5943936, 'steps': 30957, 'loss/train': 1.5568513870239258} 11/07/2021 01:37:15 - INFO - __main__ - Step 30959: {'lr': 0.00045423794923765204, 'samples': 5944128, 'steps': 30958, 'loss/train': 1.4711169004440308} 11/07/2021 01:37:15 - INFO - __main__ - Step 30960: {'lr': 0.00045423488876055883, 'samples': 5944320, 'steps': 30959, 'loss/train': 1.6682064533233643} 11/07/2021 01:37:15 - INFO - __main__ - Step 30961: {'lr': 0.00045423182819144054, 'samples': 5944512, 'steps': 30960, 'loss/train': 1.5531412363052368} 11/07/2021 01:37:16 - INFO - __main__ - Step 30962: {'lr': 0.00045422876753029853, 'samples': 5944704, 'steps': 30961, 'loss/train': 1.60807466506958} 11/07/2021 01:37:16 - INFO - __main__ - Step 30963: {'lr': 0.0004542257067771342, 'samples': 5944896, 'steps': 30962, 'loss/train': 1.7051243782043457} 11/07/2021 01:37:17 - INFO - __main__ - Step 30964: {'lr': 0.0004542226459319489, 'samples': 5945088, 'steps': 30963, 'loss/train': 1.5874546766281128} 11/07/2021 01:37:17 - INFO - __main__ - Step 30965: {'lr': 0.000454219584994744, 'samples': 5945280, 'steps': 30964, 'loss/train': 2.2392561435699463} 11/07/2021 01:37:18 - INFO - __main__ - Step 30966: {'lr': 0.00045421652396552094, 'samples': 5945472, 'steps': 30965, 'loss/train': 1.6757169961929321} 11/07/2021 01:37:18 - INFO - __main__ - Step 30967: {'lr': 0.0004542134628442811, 'samples': 5945664, 'steps': 30966, 'loss/train': 1.6425938606262207} 11/07/2021 01:37:18 - INFO - __main__ - Step 30968: {'lr': 0.0004542104016310258, 'samples': 5945856, 'steps': 30967, 'loss/train': 1.8963409662246704} 11/07/2021 01:37:19 - INFO - __main__ - Step 30969: {'lr': 0.0004542073403257564, 'samples': 5946048, 'steps': 30968, 'loss/train': 1.6767983436584473} 11/07/2021 01:37:20 - INFO - __main__ - Step 30970: {'lr': 0.0004542042789284744, 'samples': 5946240, 'steps': 30969, 'loss/train': 1.701396107673645} 11/07/2021 01:37:20 - INFO - __main__ - Step 30971: {'lr': 0.0004542012174391811, 'samples': 5946432, 'steps': 30970, 'loss/train': 1.6189563274383545} 11/07/2021 01:37:21 - INFO - __main__ - Step 30972: {'lr': 0.0004541981558578778, 'samples': 5946624, 'steps': 30971, 'loss/train': 1.6633687019348145} 11/07/2021 01:37:21 - INFO - __main__ - Step 30973: {'lr': 0.00045419509418456603, 'samples': 5946816, 'steps': 30972, 'loss/train': 1.5318022966384888} 11/07/2021 01:37:22 - INFO - __main__ - Step 30974: {'lr': 0.00045419203241924705, 'samples': 5947008, 'steps': 30973, 'loss/train': 1.4163742065429688} 11/07/2021 01:37:22 - INFO - __main__ - Step 30975: {'lr': 0.00045418897056192234, 'samples': 5947200, 'steps': 30974, 'loss/train': 1.6730180978775024} 11/07/2021 01:37:23 - INFO - __main__ - Step 30976: {'lr': 0.00045418590861259317, 'samples': 5947392, 'steps': 30975, 'loss/train': 1.4678789377212524} 11/07/2021 01:37:23 - INFO - __main__ - Step 30977: {'lr': 0.0004541828465712611, 'samples': 5947584, 'steps': 30976, 'loss/train': 1.392598271369934} 11/07/2021 01:37:23 - INFO - __main__ - Step 30978: {'lr': 0.0004541797844379273, 'samples': 5947776, 'steps': 30977, 'loss/train': 1.4600822925567627} 11/07/2021 01:37:24 - INFO - __main__ - Step 30979: {'lr': 0.0004541767222125932, 'samples': 5947968, 'steps': 30978, 'loss/train': 1.5391762256622314} 11/07/2021 01:37:25 - INFO - __main__ - Step 30980: {'lr': 0.0004541736598952603, 'samples': 5948160, 'steps': 30979, 'loss/train': 1.474677324295044} 11/07/2021 01:37:25 - INFO - __main__ - Step 30981: {'lr': 0.0004541705974859298, 'samples': 5948352, 'steps': 30980, 'loss/train': 1.150425672531128} 11/07/2021 01:37:25 - INFO - __main__ - Step 30982: {'lr': 0.0004541675349846033, 'samples': 5948544, 'steps': 30981, 'loss/train': 1.3606353998184204} 11/07/2021 01:37:26 - INFO - __main__ - Step 30983: {'lr': 0.000454164472391282, 'samples': 5948736, 'steps': 30982, 'loss/train': 1.8952081203460693} 11/07/2021 01:37:27 - INFO - __main__ - Step 30984: {'lr': 0.00045416140970596736, 'samples': 5948928, 'steps': 30983, 'loss/train': 1.3433048725128174} 11/07/2021 01:37:27 - INFO - __main__ - Step 30985: {'lr': 0.0004541583469286607, 'samples': 5949120, 'steps': 30984, 'loss/train': 1.4420620203018188} 11/07/2021 01:37:27 - INFO - __main__ - Step 30986: {'lr': 0.00045415528405936347, 'samples': 5949312, 'steps': 30985, 'loss/train': 1.4229804277420044} 11/07/2021 01:37:28 - INFO - __main__ - Step 30987: {'lr': 0.000454152221098077, 'samples': 5949504, 'steps': 30986, 'loss/train': 1.2379319667816162} 11/07/2021 01:37:28 - INFO - __main__ - Step 30988: {'lr': 0.0004541491580448027, 'samples': 5949696, 'steps': 30987, 'loss/train': 1.443123698234558} 11/07/2021 01:37:29 - INFO - __main__ - Step 30989: {'lr': 0.00045414609489954195, 'samples': 5949888, 'steps': 30988, 'loss/train': 1.3216699361801147} 11/07/2021 01:37:29 - INFO - __main__ - Step 30990: {'lr': 0.00045414303166229616, 'samples': 5950080, 'steps': 30989, 'loss/train': 1.2806390523910522} 11/07/2021 01:37:30 - INFO - __main__ - Step 30991: {'lr': 0.0004541399683330666, 'samples': 5950272, 'steps': 30990, 'loss/train': 1.38321053981781} 11/07/2021 01:37:30 - INFO - __main__ - Step 30992: {'lr': 0.00045413690491185476, 'samples': 5950464, 'steps': 30991, 'loss/train': 1.6959577798843384} 11/07/2021 01:37:30 - INFO - __main__ - Step 30993: {'lr': 0.00045413384139866196, 'samples': 5950656, 'steps': 30992, 'loss/train': 1.4465599060058594} 11/07/2021 01:37:31 - INFO - __main__ - Step 30994: {'lr': 0.0004541307777934896, 'samples': 5950848, 'steps': 30993, 'loss/train': 1.340911626815796} 11/07/2021 01:37:32 - INFO - __main__ - Step 30995: {'lr': 0.00045412771409633905, 'samples': 5951040, 'steps': 30994, 'loss/train': 1.0903595685958862} 11/07/2021 01:37:32 - INFO - __main__ - Step 30996: {'lr': 0.0004541246503072117, 'samples': 5951232, 'steps': 30995, 'loss/train': 1.2522296905517578} 11/07/2021 01:37:33 - INFO - __main__ - Step 30997: {'lr': 0.000454121586426109, 'samples': 5951424, 'steps': 30996, 'loss/train': 1.7061339616775513} 11/07/2021 01:37:33 - INFO - __main__ - Step 30998: {'lr': 0.0004541185224530322, 'samples': 5951616, 'steps': 30997, 'loss/train': 1.4970251321792603} 11/07/2021 01:37:33 - INFO - __main__ - Step 30999: {'lr': 0.00045411545838798273, 'samples': 5951808, 'steps': 30998, 'loss/train': 1.1896541118621826} 11/07/2021 01:37:34 - INFO - __main__ - Step 31000: {'lr': 0.00045411239423096206, 'samples': 5952000, 'steps': 30999, 'loss/train': 1.9125195741653442} 11/07/2021 01:37:35 - INFO - __main__ - Step 31001: {'lr': 0.0004541093299819714, 'samples': 5952192, 'steps': 31000, 'loss/train': 1.6603349447250366} 11/07/2021 01:37:35 - INFO - __main__ - Step 31002: {'lr': 0.0004541062656410123, 'samples': 5952384, 'steps': 31001, 'loss/train': 5.856283187866211} 11/07/2021 01:37:35 - INFO - __main__ - Step 31003: {'lr': 0.000454103201208086, 'samples': 5952576, 'steps': 31002, 'loss/train': 1.9771652221679688} 11/07/2021 01:37:36 - INFO - __main__ - Step 31004: {'lr': 0.00045410013668319404, 'samples': 5952768, 'steps': 31003, 'loss/train': 0.9490795135498047} 11/07/2021 01:37:37 - INFO - __main__ - Step 31005: {'lr': 0.00045409707206633764, 'samples': 5952960, 'steps': 31004, 'loss/train': 1.821311354637146} 11/07/2021 01:37:37 - INFO - __main__ - Step 31006: {'lr': 0.0004540940073575183, 'samples': 5953152, 'steps': 31005, 'loss/train': 1.4057084321975708} 11/07/2021 01:37:38 - INFO - __main__ - Step 31007: {'lr': 0.00045409094255673734, 'samples': 5953344, 'steps': 31006, 'loss/train': 1.750818133354187} 11/07/2021 01:37:38 - INFO - __main__ - Step 31008: {'lr': 0.00045408787766399605, 'samples': 5953536, 'steps': 31007, 'loss/train': 1.7356077432632446} 11/07/2021 01:37:38 - INFO - __main__ - Step 31009: {'lr': 0.00045408481267929604, 'samples': 5953728, 'steps': 31008, 'loss/train': 3.454223155975342} 11/07/2021 01:37:39 - INFO - __main__ - Step 31010: {'lr': 0.0004540817476026385, 'samples': 5953920, 'steps': 31009, 'loss/train': 1.6089481115341187} 11/07/2021 01:37:40 - INFO - __main__ - Step 31011: {'lr': 0.00045407868243402483, 'samples': 5954112, 'steps': 31010, 'loss/train': 1.8852707147598267} 11/07/2021 01:37:40 - INFO - __main__ - Step 31012: {'lr': 0.0004540756171734565, 'samples': 5954304, 'steps': 31011, 'loss/train': 1.0137343406677246} 11/07/2021 01:37:40 - INFO - __main__ - Step 31013: {'lr': 0.0004540725518209349, 'samples': 5954496, 'steps': 31012, 'loss/train': 1.3237601518630981} 11/07/2021 01:37:41 - INFO - __main__ - Step 31014: {'lr': 0.0004540694863764613, 'samples': 5954688, 'steps': 31013, 'loss/train': 1.5047588348388672} 11/07/2021 01:37:41 - INFO - __main__ - Step 31015: {'lr': 0.0004540664208400371, 'samples': 5954880, 'steps': 31014, 'loss/train': 1.7855403423309326} 11/07/2021 01:37:42 - INFO - __main__ - Step 31016: {'lr': 0.0004540633552116638, 'samples': 5955072, 'steps': 31015, 'loss/train': 1.128483772277832} 11/07/2021 01:37:43 - INFO - __main__ - Step 31017: {'lr': 0.0004540602894913427, 'samples': 5955264, 'steps': 31016, 'loss/train': 1.2099534273147583} 11/07/2021 01:37:43 - INFO - __main__ - Step 31018: {'lr': 0.0004540572236790751, 'samples': 5955456, 'steps': 31017, 'loss/train': 1.6549265384674072} 11/07/2021 01:37:43 - INFO - __main__ - Step 31019: {'lr': 0.0004540541577748625, 'samples': 5955648, 'steps': 31018, 'loss/train': 1.7200446128845215} 11/07/2021 01:37:44 - INFO - __main__ - Step 31020: {'lr': 0.0004540510917787063, 'samples': 5955840, 'steps': 31019, 'loss/train': 1.4873820543289185} 11/07/2021 01:37:45 - INFO - __main__ - Step 31021: {'lr': 0.00045404802569060776, 'samples': 5956032, 'steps': 31020, 'loss/train': 1.7632642984390259} 11/07/2021 01:37:45 - INFO - __main__ - Step 31022: {'lr': 0.00045404495951056835, 'samples': 5956224, 'steps': 31021, 'loss/train': 1.854540228843689} 11/07/2021 01:37:45 - INFO - __main__ - Step 31023: {'lr': 0.00045404189323858946, 'samples': 5956416, 'steps': 31022, 'loss/train': 1.431431770324707} 11/07/2021 01:37:46 - INFO - __main__ - Step 31024: {'lr': 0.0004540388268746724, 'samples': 5956608, 'steps': 31023, 'loss/train': 1.7353435754776} 11/07/2021 01:37:46 - INFO - __main__ - Step 31025: {'lr': 0.0004540357604188186, 'samples': 5956800, 'steps': 31024, 'loss/train': 1.3156670331954956} 11/07/2021 01:37:47 - INFO - __main__ - Step 31026: {'lr': 0.0004540326938710295, 'samples': 5956992, 'steps': 31025, 'loss/train': 1.1969841718673706} 11/07/2021 01:37:47 - INFO - __main__ - Step 31027: {'lr': 0.0004540296272313064, 'samples': 5957184, 'steps': 31026, 'loss/train': 1.8228636980056763} 11/07/2021 01:37:48 - INFO - __main__ - Step 31028: {'lr': 0.00045402656049965055, 'samples': 5957376, 'steps': 31027, 'loss/train': 1.6666202545166016} 11/07/2021 01:37:48 - INFO - __main__ - Step 31029: {'lr': 0.0004540234936760636, 'samples': 5957568, 'steps': 31028, 'loss/train': 1.6700613498687744} 11/07/2021 01:37:48 - INFO - __main__ - Step 31030: {'lr': 0.00045402042676054684, 'samples': 5957760, 'steps': 31029, 'loss/train': 1.8225733041763306} 11/07/2021 01:37:49 - INFO - __main__ - Step 31031: {'lr': 0.0004540173597531015, 'samples': 5957952, 'steps': 31030, 'loss/train': 1.4744887351989746} 11/07/2021 01:37:50 - INFO - __main__ - Step 31032: {'lr': 0.00045401429265372925, 'samples': 5958144, 'steps': 31031, 'loss/train': 1.3476732969284058} 11/07/2021 01:37:50 - INFO - __main__ - Step 31033: {'lr': 0.0004540112254624312, 'samples': 5958336, 'steps': 31032, 'loss/train': 1.4836091995239258} 11/07/2021 01:37:51 - INFO - __main__ - Step 31034: {'lr': 0.0004540081581792089, 'samples': 5958528, 'steps': 31033, 'loss/train': 1.074027180671692} 11/07/2021 01:37:51 - INFO - __main__ - Step 31035: {'lr': 0.0004540050908040636, 'samples': 5958720, 'steps': 31034, 'loss/train': 1.2955214977264404} 11/07/2021 01:37:51 - INFO - __main__ - Step 31036: {'lr': 0.0004540020233369968, 'samples': 5958912, 'steps': 31035, 'loss/train': 1.5886207818984985} 11/07/2021 01:37:52 - INFO - __main__ - Step 31037: {'lr': 0.00045399895577800985, 'samples': 5959104, 'steps': 31036, 'loss/train': 1.8249047994613647} 11/07/2021 01:37:53 - INFO - __main__ - Step 31038: {'lr': 0.00045399588812710415, 'samples': 5959296, 'steps': 31037, 'loss/train': 1.6218334436416626} 11/07/2021 01:37:53 - INFO - __main__ - Step 31039: {'lr': 0.0004539928203842809, 'samples': 5959488, 'steps': 31038, 'loss/train': 1.527317762374878} 11/07/2021 01:37:53 - INFO - __main__ - Step 31040: {'lr': 0.0004539897525495418, 'samples': 5959680, 'steps': 31039, 'loss/train': 1.6444486379623413} 11/07/2021 01:37:54 - INFO - __main__ - Step 31041: {'lr': 0.0004539866846228879, 'samples': 5959872, 'steps': 31040, 'loss/train': 1.0026158094406128} 11/07/2021 01:37:55 - INFO - __main__ - Step 31042: {'lr': 0.0004539836166043209, 'samples': 5960064, 'steps': 31041, 'loss/train': 1.2307610511779785} 11/07/2021 01:37:55 - INFO - __main__ - Step 31043: {'lr': 0.00045398054849384197, 'samples': 5960256, 'steps': 31042, 'loss/train': 2.5408263206481934} 11/07/2021 01:37:55 - INFO - __main__ - Step 31044: {'lr': 0.0004539774802914526, 'samples': 5960448, 'steps': 31043, 'loss/train': 1.5283260345458984} 11/07/2021 01:37:56 - INFO - __main__ - Step 31045: {'lr': 0.00045397441199715406, 'samples': 5960640, 'steps': 31044, 'loss/train': 1.871448040008545} 11/07/2021 01:37:56 - INFO - __main__ - Step 31046: {'lr': 0.0004539713436109478, 'samples': 5960832, 'steps': 31045, 'loss/train': 2.1768293380737305} 11/07/2021 01:37:57 - INFO - __main__ - Step 31047: {'lr': 0.0004539682751328352, 'samples': 5961024, 'steps': 31046, 'loss/train': 1.7534023523330688} 11/07/2021 01:37:57 - INFO - __main__ - Step 31048: {'lr': 0.0004539652065628177, 'samples': 5961216, 'steps': 31047, 'loss/train': 1.5624446868896484} 11/07/2021 01:37:58 - INFO - __main__ - Step 31049: {'lr': 0.00045396213790089657, 'samples': 5961408, 'steps': 31048, 'loss/train': 2.0577123165130615} 11/07/2021 01:37:58 - INFO - __main__ - Step 31050: {'lr': 0.0004539590691470733, 'samples': 5961600, 'steps': 31049, 'loss/train': 1.3571999073028564} 11/07/2021 01:37:59 - INFO - __main__ - Step 31051: {'lr': 0.0004539560003013492, 'samples': 5961792, 'steps': 31050, 'loss/train': 1.3011759519577026} 11/07/2021 01:38:00 - INFO - __main__ - Step 31052: {'lr': 0.0004539529313637256, 'samples': 5961984, 'steps': 31051, 'loss/train': 1.3227177858352661} 11/07/2021 01:38:00 - INFO - __main__ - Step 31053: {'lr': 0.0004539498623342041, 'samples': 5962176, 'steps': 31052, 'loss/train': 1.3051633834838867} 11/07/2021 01:38:00 - INFO - __main__ - Step 31054: {'lr': 0.0004539467932127858, 'samples': 5962368, 'steps': 31053, 'loss/train': 1.5299521684646606} 11/07/2021 01:38:01 - INFO - __main__ - Step 31055: {'lr': 0.00045394372399947225, 'samples': 5962560, 'steps': 31054, 'loss/train': 1.6661772727966309} 11/07/2021 01:38:01 - INFO - __main__ - Step 31056: {'lr': 0.0004539406546942649, 'samples': 5962752, 'steps': 31055, 'loss/train': 0.8315156102180481} 11/07/2021 01:38:02 - INFO - __main__ - Step 31057: {'lr': 0.00045393758529716497, 'samples': 5962944, 'steps': 31056, 'loss/train': 1.5746870040893555} 11/07/2021 01:38:02 - INFO - __main__ - Step 31058: {'lr': 0.0004539345158081739, 'samples': 5963136, 'steps': 31057, 'loss/train': 1.131955862045288} 11/07/2021 01:38:03 - INFO - __main__ - Step 31059: {'lr': 0.0004539314462272931, 'samples': 5963328, 'steps': 31058, 'loss/train': 5.825372695922852} 11/07/2021 01:38:03 - INFO - __main__ - Step 31060: {'lr': 0.0004539283765545239, 'samples': 5963520, 'steps': 31059, 'loss/train': 1.8322217464447021} 11/07/2021 01:38:03 - INFO - __main__ - Step 31061: {'lr': 0.00045392530678986775, 'samples': 5963712, 'steps': 31060, 'loss/train': 1.517366647720337} 11/07/2021 01:38:04 - INFO - __main__ - Step 31062: {'lr': 0.00045392223693332604, 'samples': 5963904, 'steps': 31061, 'loss/train': 1.5481271743774414} 11/07/2021 01:38:05 - INFO - __main__ - Step 31063: {'lr': 0.0004539191669849001, 'samples': 5964096, 'steps': 31062, 'loss/train': 1.5892140865325928} 11/07/2021 01:38:05 - INFO - __main__ - Step 31064: {'lr': 0.0004539160969445913, 'samples': 5964288, 'steps': 31063, 'loss/train': 1.6205822229385376} 11/07/2021 01:38:06 - INFO - __main__ - Step 31065: {'lr': 0.0004539130268124011, 'samples': 5964480, 'steps': 31064, 'loss/train': 1.731955885887146} 11/07/2021 01:38:06 - INFO - __main__ - Step 31066: {'lr': 0.0004539099565883308, 'samples': 5964672, 'steps': 31065, 'loss/train': 1.4995030164718628} 11/07/2021 01:38:06 - INFO - __main__ - Step 31067: {'lr': 0.0004539068862723818, 'samples': 5964864, 'steps': 31066, 'loss/train': 1.1494752168655396} 11/07/2021 01:38:07 - INFO - __main__ - Step 31068: {'lr': 0.0004539038158645555, 'samples': 5965056, 'steps': 31067, 'loss/train': 1.4408982992172241} 11/07/2021 01:38:08 - INFO - __main__ - Step 31069: {'lr': 0.00045390074536485336, 'samples': 5965248, 'steps': 31068, 'loss/train': 1.6689640283584595} 11/07/2021 01:38:08 - INFO - __main__ - Step 31070: {'lr': 0.00045389767477327657, 'samples': 5965440, 'steps': 31069, 'loss/train': 2.307602643966675} 11/07/2021 01:38:08 - INFO - __main__ - Step 31071: {'lr': 0.00045389460408982676, 'samples': 5965632, 'steps': 31070, 'loss/train': 1.3233267068862915} 11/07/2021 01:38:09 - INFO - __main__ - Step 31072: {'lr': 0.0004538915333145052, 'samples': 5965824, 'steps': 31071, 'loss/train': 1.5080645084381104} 11/07/2021 01:38:09 - INFO - __main__ - Step 31073: {'lr': 0.00045388846244731314, 'samples': 5966016, 'steps': 31072, 'loss/train': 1.944614052772522} 11/07/2021 01:38:10 - INFO - __main__ - Step 31074: {'lr': 0.00045388539148825214, 'samples': 5966208, 'steps': 31073, 'loss/train': 1.272490382194519} 11/07/2021 01:38:10 - INFO - __main__ - Step 31075: {'lr': 0.0004538823204373235, 'samples': 5966400, 'steps': 31074, 'loss/train': 2.0401172637939453} 11/07/2021 01:38:11 - INFO - __main__ - Step 31076: {'lr': 0.00045387924929452873, 'samples': 5966592, 'steps': 31075, 'loss/train': 1.6341118812561035} 11/07/2021 01:38:11 - INFO - __main__ - Step 31077: {'lr': 0.000453876178059869, 'samples': 5966784, 'steps': 31076, 'loss/train': 1.3141553401947021} 11/07/2021 01:38:12 - INFO - __main__ - Step 31078: {'lr': 0.0004538731067333459, 'samples': 5966976, 'steps': 31077, 'loss/train': 1.8784071207046509} 11/07/2021 01:38:13 - INFO - __main__ - Step 31079: {'lr': 0.00045387003531496064, 'samples': 5967168, 'steps': 31078, 'loss/train': 1.3279436826705933} 11/07/2021 01:38:13 - INFO - __main__ - Step 31080: {'lr': 0.00045386696380471473, 'samples': 5967360, 'steps': 31079, 'loss/train': 1.4521465301513672} 11/07/2021 01:38:13 - INFO - __main__ - Step 31081: {'lr': 0.0004538638922026095, 'samples': 5967552, 'steps': 31080, 'loss/train': 1.4966466426849365} 11/07/2021 01:38:14 - INFO - __main__ - Step 31082: {'lr': 0.0004538608205086464, 'samples': 5967744, 'steps': 31081, 'loss/train': 1.457392692565918} 11/07/2021 01:38:14 - INFO - __main__ - Step 31083: {'lr': 0.0004538577487228267, 'samples': 5967936, 'steps': 31082, 'loss/train': 1.7701526880264282} 11/07/2021 01:38:15 - INFO - __main__ - Step 31084: {'lr': 0.00045385467684515193, 'samples': 5968128, 'steps': 31083, 'loss/train': 1.5525671243667603} 11/07/2021 01:38:16 - INFO - __main__ - Step 31085: {'lr': 0.0004538516048756233, 'samples': 5968320, 'steps': 31084, 'loss/train': 1.4492534399032593} 11/07/2021 01:38:16 - INFO - __main__ - Step 31086: {'lr': 0.00045384853281424235, 'samples': 5968512, 'steps': 31085, 'loss/train': 1.1388990879058838} 11/07/2021 01:38:16 - INFO - __main__ - Step 31087: {'lr': 0.0004538454606610103, 'samples': 5968704, 'steps': 31086, 'loss/train': 1.311660647392273} 11/07/2021 01:38:17 - INFO - __main__ - Step 31088: {'lr': 0.0004538423884159287, 'samples': 5968896, 'steps': 31087, 'loss/train': 1.760115146636963} 11/07/2021 01:38:18 - INFO - __main__ - Step 31089: {'lr': 0.0004538393160789988, 'samples': 5969088, 'steps': 31088, 'loss/train': 2.212783098220825} 11/07/2021 01:38:18 - INFO - __main__ - Step 31090: {'lr': 0.0004538362436502221, 'samples': 5969280, 'steps': 31089, 'loss/train': 0.6168873310089111} 11/07/2021 01:38:19 - INFO - __main__ - Step 31091: {'lr': 0.00045383317112959997, 'samples': 5969472, 'steps': 31090, 'loss/train': 0.956122636795044} 11/07/2021 01:38:19 - INFO - __main__ - Step 31092: {'lr': 0.0004538300985171337, 'samples': 5969664, 'steps': 31091, 'loss/train': 1.3800127506256104} 11/07/2021 01:38:19 - INFO - __main__ - Step 31093: {'lr': 0.00045382702581282477, 'samples': 5969856, 'steps': 31092, 'loss/train': 1.537808895111084} 11/07/2021 01:38:20 - INFO - __main__ - Step 31094: {'lr': 0.0004538239530166745, 'samples': 5970048, 'steps': 31093, 'loss/train': 1.8231440782546997} 11/07/2021 01:38:21 - INFO - __main__ - Step 31095: {'lr': 0.0004538208801286843, 'samples': 5970240, 'steps': 31094, 'loss/train': 1.6640911102294922} 11/07/2021 01:38:21 - INFO - __main__ - Step 31096: {'lr': 0.0004538178071488556, 'samples': 5970432, 'steps': 31095, 'loss/train': 0.9510982036590576} 11/07/2021 01:38:21 - INFO - __main__ - Step 31097: {'lr': 0.00045381473407718963, 'samples': 5970624, 'steps': 31096, 'loss/train': 2.8618736267089844} 11/07/2021 01:38:22 - INFO - __main__ - Step 31098: {'lr': 0.000453811660913688, 'samples': 5970816, 'steps': 31097, 'loss/train': 1.8852229118347168} 11/07/2021 01:38:22 - INFO - __main__ - Step 31099: {'lr': 0.000453808587658352, 'samples': 5971008, 'steps': 31098, 'loss/train': 1.3138790130615234} 11/07/2021 01:38:23 - INFO - __main__ - Step 31100: {'lr': 0.0004538055143111829, 'samples': 5971200, 'steps': 31099, 'loss/train': 1.3649941682815552} 11/07/2021 01:38:23 - INFO - __main__ - Step 31101: {'lr': 0.00045380244087218224, 'samples': 5971392, 'steps': 31100, 'loss/train': 1.3116509914398193} 11/07/2021 01:38:24 - INFO - __main__ - Step 31102: {'lr': 0.0004537993673413513, 'samples': 5971584, 'steps': 31101, 'loss/train': 1.72334885597229} 11/07/2021 01:38:24 - INFO - __main__ - Step 31103: {'lr': 0.0004537962937186916, 'samples': 5971776, 'steps': 31102, 'loss/train': 1.065061092376709} 11/07/2021 01:38:24 - INFO - __main__ - Step 31104: {'lr': 0.00045379322000420433, 'samples': 5971968, 'steps': 31103, 'loss/train': 1.550279140472412} 11/07/2021 01:38:25 - INFO - __main__ - Step 31105: {'lr': 0.00045379014619789106, 'samples': 5972160, 'steps': 31104, 'loss/train': 1.6130434274673462} 11/07/2021 01:38:26 - INFO - __main__ - Step 31106: {'lr': 0.00045378707229975303, 'samples': 5972352, 'steps': 31105, 'loss/train': 1.2381612062454224} 11/07/2021 01:38:26 - INFO - __main__ - Step 31107: {'lr': 0.0004537839983097917, 'samples': 5972544, 'steps': 31106, 'loss/train': 1.5743529796600342} 11/07/2021 01:38:26 - INFO - __main__ - Step 31108: {'lr': 0.0004537809242280085, 'samples': 5972736, 'steps': 31107, 'loss/train': 1.7774684429168701} 11/07/2021 01:38:27 - INFO - __main__ - Step 31109: {'lr': 0.0004537778500544047, 'samples': 5972928, 'steps': 31108, 'loss/train': 1.7018706798553467} 11/07/2021 01:38:28 - INFO - __main__ - Step 31110: {'lr': 0.0004537747757889817, 'samples': 5973120, 'steps': 31109, 'loss/train': 1.1860202550888062} 11/07/2021 01:38:28 - INFO - __main__ - Step 31111: {'lr': 0.0004537717014317411, 'samples': 5973312, 'steps': 31110, 'loss/train': 1.3271428346633911} 11/07/2021 01:38:28 - INFO - __main__ - Step 31112: {'lr': 0.00045376862698268393, 'samples': 5973504, 'steps': 31111, 'loss/train': 1.6374403238296509} 11/07/2021 01:38:29 - INFO - __main__ - Step 31113: {'lr': 0.0004537655524418119, 'samples': 5973696, 'steps': 31112, 'loss/train': 1.679964303970337} 11/07/2021 01:38:29 - INFO - __main__ - Step 31114: {'lr': 0.00045376247780912616, 'samples': 5973888, 'steps': 31113, 'loss/train': 1.6255829334259033} 11/07/2021 01:38:30 - INFO - __main__ - Step 31115: {'lr': 0.00045375940308462826, 'samples': 5974080, 'steps': 31114, 'loss/train': 1.6915082931518555} 11/07/2021 01:38:31 - INFO - __main__ - Step 31116: {'lr': 0.00045375632826831947, 'samples': 5974272, 'steps': 31115, 'loss/train': 1.3882372379302979} 11/07/2021 01:38:31 - INFO - __main__ - Step 31117: {'lr': 0.00045375325336020124, 'samples': 5974464, 'steps': 31116, 'loss/train': 1.4284358024597168} 11/07/2021 01:38:31 - INFO - __main__ - Step 31118: {'lr': 0.000453750178360275, 'samples': 5974656, 'steps': 31117, 'loss/train': 1.585280418395996} 11/07/2021 01:38:32 - INFO - __main__ - Step 31119: {'lr': 0.00045374710326854194, 'samples': 5974848, 'steps': 31118, 'loss/train': 1.8023872375488281} 11/07/2021 01:38:33 - INFO - __main__ - Step 31120: {'lr': 0.0004537440280850037, 'samples': 5975040, 'steps': 31119, 'loss/train': 1.4168965816497803} 11/07/2021 01:38:33 - INFO - __main__ - Step 31121: {'lr': 0.00045374095280966147, 'samples': 5975232, 'steps': 31120, 'loss/train': 2.086824417114258} 11/07/2021 01:38:33 - INFO - __main__ - Step 31122: {'lr': 0.00045373787744251677, 'samples': 5975424, 'steps': 31121, 'loss/train': 1.2819502353668213} 11/07/2021 01:38:34 - INFO - __main__ - Step 31123: {'lr': 0.0004537348019835709, 'samples': 5975616, 'steps': 31122, 'loss/train': 1.7239537239074707} 11/07/2021 01:38:34 - INFO - __main__ - Step 31124: {'lr': 0.0004537317264328252, 'samples': 5975808, 'steps': 31123, 'loss/train': 1.7349528074264526} 11/07/2021 01:38:35 - INFO - __main__ - Step 31125: {'lr': 0.00045372865079028123, 'samples': 5976000, 'steps': 31124, 'loss/train': 1.5164841413497925} 11/07/2021 01:38:36 - INFO - __main__ - Step 31126: {'lr': 0.00045372557505594024, 'samples': 5976192, 'steps': 31125, 'loss/train': 1.804694652557373} 11/07/2021 01:38:36 - INFO - __main__ - Step 31127: {'lr': 0.0004537224992298037, 'samples': 5976384, 'steps': 31126, 'loss/train': 1.4712467193603516} 11/07/2021 01:38:36 - INFO - __main__ - Step 31128: {'lr': 0.00045371942331187286, 'samples': 5976576, 'steps': 31127, 'loss/train': 1.8973110914230347} 11/07/2021 01:38:37 - INFO - __main__ - Step 31129: {'lr': 0.00045371634730214923, 'samples': 5976768, 'steps': 31128, 'loss/train': 1.736704707145691} 11/07/2021 01:38:37 - INFO - __main__ - Step 31130: {'lr': 0.00045371327120063417, 'samples': 5976960, 'steps': 31129, 'loss/train': 1.9374653100967407} 11/07/2021 01:38:38 - INFO - __main__ - Step 31131: {'lr': 0.00045371019500732904, 'samples': 5977152, 'steps': 31130, 'loss/train': 1.2832658290863037} 11/07/2021 01:38:39 - INFO - __main__ - Step 31132: {'lr': 0.00045370711872223525, 'samples': 5977344, 'steps': 31131, 'loss/train': 1.8077436685562134} 11/07/2021 01:38:39 - INFO - __main__ - Step 31133: {'lr': 0.00045370404234535414, 'samples': 5977536, 'steps': 31132, 'loss/train': 0.6687050461769104} 11/07/2021 01:38:40 - INFO - __main__ - Step 31134: {'lr': 0.00045370096587668714, 'samples': 5977728, 'steps': 31133, 'loss/train': 1.3561971187591553} 11/07/2021 01:38:40 - INFO - __main__ - Step 31135: {'lr': 0.0004536978893162357, 'samples': 5977920, 'steps': 31134, 'loss/train': 1.6032251119613647} 11/07/2021 01:38:40 - INFO - __main__ - Step 31136: {'lr': 0.000453694812664001, 'samples': 5978112, 'steps': 31135, 'loss/train': 1.2463250160217285} 11/07/2021 01:38:42 - INFO - __main__ - Step 31137: {'lr': 0.00045369173591998466, 'samples': 5978304, 'steps': 31136, 'loss/train': 1.5777970552444458} 11/07/2021 01:38:42 - INFO - __main__ - Step 31138: {'lr': 0.00045368865908418794, 'samples': 5978496, 'steps': 31137, 'loss/train': 1.6632001399993896} 11/07/2021 01:38:42 - INFO - __main__ - Step 31139: {'lr': 0.00045368558215661225, 'samples': 5978688, 'steps': 31138, 'loss/train': 1.4597536325454712} 11/07/2021 01:38:43 - INFO - __main__ - Step 31140: {'lr': 0.00045368250513725896, 'samples': 5978880, 'steps': 31139, 'loss/train': 5.873146057128906} 11/07/2021 01:38:43 - INFO - __main__ - Step 31141: {'lr': 0.00045367942802612953, 'samples': 5979072, 'steps': 31140, 'loss/train': 1.5452513694763184} 11/07/2021 01:38:43 - INFO - __main__ - Step 31142: {'lr': 0.0004536763508232252, 'samples': 5979264, 'steps': 31141, 'loss/train': 1.6367824077606201} 11/07/2021 01:38:44 - INFO - __main__ - Step 31143: {'lr': 0.0004536732735285476, 'samples': 5979456, 'steps': 31142, 'loss/train': 1.359701156616211} 11/07/2021 01:38:46 - INFO - __main__ - Step 31144: {'lr': 0.00045367019614209783, 'samples': 5979648, 'steps': 31143, 'loss/train': 1.3247382640838623} 11/07/2021 01:38:46 - INFO - __main__ - Step 31145: {'lr': 0.0004536671186638775, 'samples': 5979840, 'steps': 31144, 'loss/train': 1.2860476970672607} 11/07/2021 01:38:46 - INFO - __main__ - Step 31146: {'lr': 0.0004536640410938879, 'samples': 5980032, 'steps': 31145, 'loss/train': 0.5325157046318054} 11/07/2021 01:38:47 - INFO - __main__ - Step 31147: {'lr': 0.00045366096343213034, 'samples': 5980224, 'steps': 31146, 'loss/train': 1.368627667427063} 11/07/2021 01:38:47 - INFO - __main__ - Step 31148: {'lr': 0.0004536578856786064, 'samples': 5980416, 'steps': 31147, 'loss/train': 1.343773365020752} 11/07/2021 01:38:48 - INFO - __main__ - Step 31149: {'lr': 0.0004536548078333172, 'samples': 5980608, 'steps': 31148, 'loss/train': 1.6496331691741943} 11/07/2021 01:38:48 - INFO - __main__ - Step 31150: {'lr': 0.0004536517298962645, 'samples': 5980800, 'steps': 31149, 'loss/train': 2.274333953857422} 11/07/2021 01:38:49 - INFO - __main__ - Step 31151: {'lr': 0.00045364865186744936, 'samples': 5980992, 'steps': 31150, 'loss/train': 1.941991925239563} 11/07/2021 01:38:49 - INFO - __main__ - Step 31152: {'lr': 0.0004536455737468733, 'samples': 5981184, 'steps': 31151, 'loss/train': 1.501450538635254} 11/07/2021 01:38:49 - INFO - __main__ - Step 31153: {'lr': 0.00045364249553453764, 'samples': 5981376, 'steps': 31152, 'loss/train': 1.7700766324996948} 11/07/2021 01:38:50 - INFO - __main__ - Step 31154: {'lr': 0.00045363941723044386, 'samples': 5981568, 'steps': 31153, 'loss/train': 1.6942732334136963} 11/07/2021 01:38:51 - INFO - __main__ - Step 31155: {'lr': 0.0004536363388345933, 'samples': 5981760, 'steps': 31154, 'loss/train': 1.6029417514801025} 11/07/2021 01:38:51 - INFO - __main__ - Step 31156: {'lr': 0.0004536332603469873, 'samples': 5981952, 'steps': 31155, 'loss/train': 1.394454836845398} 11/07/2021 01:38:51 - INFO - __main__ - Step 31157: {'lr': 0.0004536301817676274, 'samples': 5982144, 'steps': 31156, 'loss/train': 1.6801949739456177} 11/07/2021 01:38:52 - INFO - __main__ - Step 31158: {'lr': 0.0004536271030965148, 'samples': 5982336, 'steps': 31157, 'loss/train': 1.5780308246612549} 11/07/2021 01:38:53 - INFO - __main__ - Step 31159: {'lr': 0.00045362402433365094, 'samples': 5982528, 'steps': 31158, 'loss/train': 1.2975704669952393} 11/07/2021 01:38:53 - INFO - __main__ - Step 31160: {'lr': 0.0004536209454790373, 'samples': 5982720, 'steps': 31159, 'loss/train': 1.505381464958191} 11/07/2021 01:38:53 - INFO - __main__ - Step 31161: {'lr': 0.00045361786653267517, 'samples': 5982912, 'steps': 31160, 'loss/train': 1.8381296396255493} 11/07/2021 01:38:54 - INFO - __main__ - Step 31162: {'lr': 0.00045361478749456595, 'samples': 5983104, 'steps': 31161, 'loss/train': 1.8032037019729614} 11/07/2021 01:38:54 - INFO - __main__ - Step 31163: {'lr': 0.0004536117083647111, 'samples': 5983296, 'steps': 31162, 'loss/train': 1.577376127243042} 11/07/2021 01:38:54 - INFO - __main__ - Step 31164: {'lr': 0.00045360862914311194, 'samples': 5983488, 'steps': 31163, 'loss/train': 1.6728628873825073} 11/07/2021 01:38:55 - INFO - __main__ - Step 31165: {'lr': 0.0004536055498297699, 'samples': 5983680, 'steps': 31164, 'loss/train': 1.7613489627838135} 11/07/2021 01:38:56 - INFO - __main__ - Step 31166: {'lr': 0.00045360247042468635, 'samples': 5983872, 'steps': 31165, 'loss/train': 1.4214516878128052} 11/07/2021 01:38:56 - INFO - __main__ - Step 31167: {'lr': 0.0004535993909278626, 'samples': 5984064, 'steps': 31166, 'loss/train': 1.4152767658233643} 11/07/2021 01:38:56 - INFO - __main__ - Step 31168: {'lr': 0.00045359631133930016, 'samples': 5984256, 'steps': 31167, 'loss/train': 1.7573238611221313} 11/07/2021 01:38:57 - INFO - __main__ - Step 31169: {'lr': 0.0004535932316590003, 'samples': 5984448, 'steps': 31168, 'loss/train': 1.2462897300720215} 11/07/2021 01:38:58 - INFO - __main__ - Step 31170: {'lr': 0.00045359015188696457, 'samples': 5984640, 'steps': 31169, 'loss/train': 1.7859525680541992} 11/07/2021 01:38:58 - INFO - __main__ - Step 31171: {'lr': 0.00045358707202319414, 'samples': 5984832, 'steps': 31170, 'loss/train': 1.3356480598449707} 11/07/2021 01:38:59 - INFO - __main__ - Step 31172: {'lr': 0.0004535839920676906, 'samples': 5985024, 'steps': 31171, 'loss/train': 1.5641802549362183} 11/07/2021 01:38:59 - INFO - __main__ - Step 31173: {'lr': 0.0004535809120204553, 'samples': 5985216, 'steps': 31172, 'loss/train': 2.0797674655914307} 11/07/2021 01:38:59 - INFO - __main__ - Step 31174: {'lr': 0.0004535778318814895, 'samples': 5985408, 'steps': 31173, 'loss/train': 1.4527065753936768} 11/07/2021 01:39:01 - INFO - __main__ - Step 31175: {'lr': 0.0004535747516507947, 'samples': 5985600, 'steps': 31174, 'loss/train': 1.1287404298782349} 11/07/2021 01:39:01 - INFO - __main__ - Step 31176: {'lr': 0.00045357167132837223, 'samples': 5985792, 'steps': 31175, 'loss/train': 1.2972478866577148} 11/07/2021 01:39:01 - INFO - __main__ - Step 31177: {'lr': 0.00045356859091422354, 'samples': 5985984, 'steps': 31176, 'loss/train': 1.5784897804260254} 11/07/2021 01:39:02 - INFO - __main__ - Step 31178: {'lr': 0.00045356551040835, 'samples': 5986176, 'steps': 31177, 'loss/train': 1.3720043897628784} 11/07/2021 01:39:02 - INFO - __main__ - Step 31179: {'lr': 0.0004535624298107529, 'samples': 5986368, 'steps': 31178, 'loss/train': 1.567328929901123} 11/07/2021 01:39:03 - INFO - __main__ - Step 31180: {'lr': 0.00045355934912143383, 'samples': 5986560, 'steps': 31179, 'loss/train': 1.5847002267837524} 11/07/2021 01:39:03 - INFO - __main__ - Step 31181: {'lr': 0.00045355626834039394, 'samples': 5986752, 'steps': 31180, 'loss/train': 1.4634809494018555} 11/07/2021 01:39:04 - INFO - __main__ - Step 31182: {'lr': 0.00045355318746763477, 'samples': 5986944, 'steps': 31181, 'loss/train': 0.7799688577651978} 11/07/2021 01:39:04 - INFO - __main__ - Step 31183: {'lr': 0.0004535501065031577, 'samples': 5987136, 'steps': 31182, 'loss/train': 1.4410059452056885} 11/07/2021 01:39:04 - INFO - __main__ - Step 31184: {'lr': 0.0004535470254469641, 'samples': 5987328, 'steps': 31183, 'loss/train': 1.0951820611953735} 11/07/2021 01:39:05 - INFO - __main__ - Step 31185: {'lr': 0.00045354394429905534, 'samples': 5987520, 'steps': 31184, 'loss/train': 1.872873306274414} 11/07/2021 01:39:06 - INFO - __main__ - Step 31186: {'lr': 0.0004535408630594328, 'samples': 5987712, 'steps': 31185, 'loss/train': 1.5808696746826172} 11/07/2021 01:39:06 - INFO - __main__ - Step 31187: {'lr': 0.0004535377817280979, 'samples': 5987904, 'steps': 31186, 'loss/train': 1.2969399690628052} 11/07/2021 01:39:06 - INFO - __main__ - Step 31188: {'lr': 0.0004535347003050521, 'samples': 5988096, 'steps': 31187, 'loss/train': 1.254632592201233} 11/07/2021 01:39:07 - INFO - __main__ - Step 31189: {'lr': 0.0004535316187902966, 'samples': 5988288, 'steps': 31188, 'loss/train': 1.425153136253357} 11/07/2021 01:39:07 - INFO - __main__ - Step 31190: {'lr': 0.00045352853718383287, 'samples': 5988480, 'steps': 31189, 'loss/train': 1.2327990531921387} 11/07/2021 01:39:08 - INFO - __main__ - Step 31191: {'lr': 0.00045352545548566235, 'samples': 5988672, 'steps': 31190, 'loss/train': 1.3700438737869263} 11/07/2021 01:39:08 - INFO - __main__ - Step 31192: {'lr': 0.00045352237369578643, 'samples': 5988864, 'steps': 31191, 'loss/train': 1.604499340057373} 11/07/2021 01:39:09 - INFO - __main__ - Step 31193: {'lr': 0.00045351929181420647, 'samples': 5989056, 'steps': 31192, 'loss/train': 1.962519645690918} 11/07/2021 01:39:09 - INFO - __main__ - Step 31194: {'lr': 0.0004535162098409238, 'samples': 5989248, 'steps': 31193, 'loss/train': 2.144946575164795} 11/07/2021 01:39:09 - INFO - __main__ - Step 31195: {'lr': 0.00045351312777593995, 'samples': 5989440, 'steps': 31194, 'loss/train': 1.7803518772125244} 11/07/2021 01:39:11 - INFO - __main__ - Step 31196: {'lr': 0.0004535100456192562, 'samples': 5989632, 'steps': 31195, 'loss/train': 1.4107824563980103} 11/07/2021 01:39:11 - INFO - __main__ - Step 31197: {'lr': 0.00045350696337087396, 'samples': 5989824, 'steps': 31196, 'loss/train': 1.4641228914260864} 11/07/2021 01:39:11 - INFO - __main__ - Step 31198: {'lr': 0.0004535038810307946, 'samples': 5990016, 'steps': 31197, 'loss/train': 0.8417909741401672} 11/07/2021 01:39:12 - INFO - __main__ - Step 31199: {'lr': 0.00045350079859901956, 'samples': 5990208, 'steps': 31198, 'loss/train': 1.6086115837097168} 11/07/2021 01:39:12 - INFO - __main__ - Step 31200: {'lr': 0.00045349771607555017, 'samples': 5990400, 'steps': 31199, 'loss/train': 1.4114199876785278} 11/07/2021 01:39:13 - INFO - __main__ - Step 31201: {'lr': 0.0004534946334603879, 'samples': 5990592, 'steps': 31200, 'loss/train': 1.380301833152771} 11/07/2021 01:39:13 - INFO - __main__ - Step 31202: {'lr': 0.000453491550753534, 'samples': 5990784, 'steps': 31201, 'loss/train': 1.8243534564971924} 11/07/2021 01:39:14 - INFO - __main__ - Step 31203: {'lr': 0.00045348846795499, 'samples': 5990976, 'steps': 31202, 'loss/train': 1.5747188329696655} 11/07/2021 01:39:14 - INFO - __main__ - Step 31204: {'lr': 0.0004534853850647572, 'samples': 5991168, 'steps': 31203, 'loss/train': 0.8451942205429077} 11/07/2021 01:39:14 - INFO - __main__ - Step 31205: {'lr': 0.00045348230208283716, 'samples': 5991360, 'steps': 31204, 'loss/train': 1.8767951726913452} 11/07/2021 01:39:15 - INFO - __main__ - Step 31206: {'lr': 0.000453479219009231, 'samples': 5991552, 'steps': 31205, 'loss/train': 1.5493534803390503} 11/07/2021 01:39:16 - INFO - __main__ - Step 31207: {'lr': 0.00045347613584394034, 'samples': 5991744, 'steps': 31206, 'loss/train': 1.2486952543258667} 11/07/2021 01:39:16 - INFO - __main__ - Step 31208: {'lr': 0.0004534730525869664, 'samples': 5991936, 'steps': 31207, 'loss/train': 0.9741658568382263} 11/07/2021 01:39:17 - INFO - __main__ - Step 31209: {'lr': 0.0004534699692383106, 'samples': 5992128, 'steps': 31208, 'loss/train': 2.1556811332702637} 11/07/2021 01:39:17 - INFO - __main__ - Step 31210: {'lr': 0.00045346688579797444, 'samples': 5992320, 'steps': 31209, 'loss/train': 1.6638199090957642} 11/07/2021 01:39:17 - INFO - __main__ - Step 31211: {'lr': 0.0004534638022659592, 'samples': 5992512, 'steps': 31210, 'loss/train': 2.4656479358673096} 11/07/2021 01:39:18 - INFO - __main__ - Step 31212: {'lr': 0.00045346071864226634, 'samples': 5992704, 'steps': 31211, 'loss/train': 1.1677452325820923} 11/07/2021 01:39:19 - INFO - __main__ - Step 31213: {'lr': 0.0004534576349268973, 'samples': 5992896, 'steps': 31212, 'loss/train': 1.6039478778839111} 11/07/2021 01:39:19 - INFO - __main__ - Step 31214: {'lr': 0.00045345455111985326, 'samples': 5993088, 'steps': 31213, 'loss/train': 2.0135111808776855} 11/07/2021 01:39:19 - INFO - __main__ - Step 31215: {'lr': 0.0004534514672211358, 'samples': 5993280, 'steps': 31214, 'loss/train': 1.1419930458068848} 11/07/2021 01:39:20 - INFO - __main__ - Step 31216: {'lr': 0.0004534483832307462, 'samples': 5993472, 'steps': 31215, 'loss/train': 1.2015453577041626} 11/07/2021 01:39:21 - INFO - __main__ - Step 31217: {'lr': 0.00045344529914868593, 'samples': 5993664, 'steps': 31216, 'loss/train': 1.2807214260101318} 11/07/2021 01:39:21 - INFO - __main__ - Step 31218: {'lr': 0.0004534422149749564, 'samples': 5993856, 'steps': 31217, 'loss/train': 1.241632342338562} 11/07/2021 01:39:22 - INFO - __main__ - Step 31219: {'lr': 0.0004534391307095589, 'samples': 5994048, 'steps': 31218, 'loss/train': 1.4084670543670654} 11/07/2021 01:39:22 - INFO - __main__ - Step 31220: {'lr': 0.0004534360463524948, 'samples': 5994240, 'steps': 31219, 'loss/train': 1.3927873373031616} 11/07/2021 01:39:22 - INFO - __main__ - Step 31221: {'lr': 0.00045343296190376566, 'samples': 5994432, 'steps': 31220, 'loss/train': 1.400227665901184} 11/07/2021 01:39:23 - INFO - __main__ - Step 31222: {'lr': 0.0004534298773633727, 'samples': 5994624, 'steps': 31221, 'loss/train': 1.4523824453353882} 11/07/2021 01:39:24 - INFO - __main__ - Step 31223: {'lr': 0.00045342679273131743, 'samples': 5994816, 'steps': 31222, 'loss/train': 1.671177864074707} 11/07/2021 01:39:24 - INFO - __main__ - Step 31224: {'lr': 0.0004534237080076011, 'samples': 5995008, 'steps': 31223, 'loss/train': 1.8985406160354614} 11/07/2021 01:39:24 - INFO - __main__ - Step 31225: {'lr': 0.0004534206231922253, 'samples': 5995200, 'steps': 31224, 'loss/train': 1.1981773376464844} 11/07/2021 01:39:25 - INFO - __main__ - Step 31226: {'lr': 0.0004534175382851913, 'samples': 5995392, 'steps': 31225, 'loss/train': 1.4175541400909424} 11/07/2021 01:39:26 - INFO - __main__ - Step 31227: {'lr': 0.0004534144532865004, 'samples': 5995584, 'steps': 31226, 'loss/train': 1.29689621925354} 11/07/2021 01:39:26 - INFO - __main__ - Step 31228: {'lr': 0.00045341136819615415, 'samples': 5995776, 'steps': 31227, 'loss/train': 1.5480940341949463} 11/07/2021 01:39:26 - INFO - __main__ - Step 31229: {'lr': 0.0004534082830141538, 'samples': 5995968, 'steps': 31228, 'loss/train': 0.43363645672798157} 11/07/2021 01:39:27 - INFO - __main__ - Step 31230: {'lr': 0.00045340519774050093, 'samples': 5996160, 'steps': 31229, 'loss/train': 1.1463749408721924} 11/07/2021 01:39:27 - INFO - __main__ - Step 31231: {'lr': 0.0004534021123751968, 'samples': 5996352, 'steps': 31230, 'loss/train': 1.552856206893921} 11/07/2021 01:39:29 - INFO - __main__ - Step 31232: {'lr': 0.00045339902691824275, 'samples': 5996544, 'steps': 31231, 'loss/train': 1.4515273571014404} 11/07/2021 01:39:29 - INFO - __main__ - Step 31233: {'lr': 0.0004533959413696402, 'samples': 5996736, 'steps': 31232, 'loss/train': 1.795810341835022} 11/07/2021 01:39:29 - INFO - __main__ - Step 31234: {'lr': 0.0004533928557293907, 'samples': 5996928, 'steps': 31233, 'loss/train': 1.6251254081726074} 11/07/2021 01:39:30 - INFO - __main__ - Step 31235: {'lr': 0.00045338976999749546, 'samples': 5997120, 'steps': 31234, 'loss/train': 0.5078571438789368} 11/07/2021 01:39:30 - INFO - __main__ - Step 31236: {'lr': 0.00045338668417395595, 'samples': 5997312, 'steps': 31235, 'loss/train': 0.497263103723526} 11/07/2021 01:39:30 - INFO - __main__ - Step 31237: {'lr': 0.0004533835982587735, 'samples': 5997504, 'steps': 31236, 'loss/train': 1.6334309577941895} 11/07/2021 01:39:32 - INFO - __main__ - Step 31238: {'lr': 0.00045338051225194954, 'samples': 5997696, 'steps': 31237, 'loss/train': 1.2597155570983887} 11/07/2021 01:39:32 - INFO - __main__ - Step 31239: {'lr': 0.0004533774261534855, 'samples': 5997888, 'steps': 31238, 'loss/train': 1.2250351905822754} 11/07/2021 01:39:32 - INFO - __main__ - Step 31240: {'lr': 0.00045337433996338274, 'samples': 5998080, 'steps': 31239, 'loss/train': 1.5766150951385498} 11/07/2021 01:39:33 - INFO - __main__ - Step 31241: {'lr': 0.0004533712536816426, 'samples': 5998272, 'steps': 31240, 'loss/train': 1.227736234664917} 11/07/2021 01:39:33 - INFO - __main__ - Step 31242: {'lr': 0.0004533681673082665, 'samples': 5998464, 'steps': 31241, 'loss/train': 1.553439974784851} 11/07/2021 01:39:34 - INFO - __main__ - Step 31243: {'lr': 0.00045336508084325587, 'samples': 5998656, 'steps': 31242, 'loss/train': 1.0459153652191162} 11/07/2021 01:39:34 - INFO - __main__ - Step 31244: {'lr': 0.0004533619942866121, 'samples': 5998848, 'steps': 31243, 'loss/train': 1.800862193107605} 11/07/2021 01:39:35 - INFO - __main__ - Step 31245: {'lr': 0.00045335890763833646, 'samples': 5999040, 'steps': 31244, 'loss/train': 2.0331690311431885} 11/07/2021 01:39:35 - INFO - __main__ - Step 31246: {'lr': 0.0004533558208984305, 'samples': 5999232, 'steps': 31245, 'loss/train': 1.387972354888916} 11/07/2021 01:39:35 - INFO - __main__ - Step 31247: {'lr': 0.0004533527340668956, 'samples': 5999424, 'steps': 31246, 'loss/train': 1.293097972869873} 11/07/2021 01:39:36 - INFO - __main__ - Step 31248: {'lr': 0.000453349647143733, 'samples': 5999616, 'steps': 31247, 'loss/train': 0.2818108797073364} 11/07/2021 01:39:37 - INFO - __main__ - Step 31249: {'lr': 0.00045334656012894424, 'samples': 5999808, 'steps': 31248, 'loss/train': 1.0691889524459839} 11/07/2021 01:39:37 - INFO - __main__ - Step 31250: {'lr': 0.00045334347302253064, 'samples': 6000000, 'steps': 31249, 'loss/train': 1.497523307800293} 11/07/2021 01:39:37 - INFO - __main__ - Step 31251: {'lr': 0.00045334038582449355, 'samples': 6000192, 'steps': 31250, 'loss/train': 1.2824853658676147} 11/07/2021 01:39:38 - INFO - __main__ - Step 31252: {'lr': 0.0004533372985348345, 'samples': 6000384, 'steps': 31251, 'loss/train': 1.5305395126342773} 11/07/2021 01:39:39 - INFO - __main__ - Step 31253: {'lr': 0.00045333421115355477, 'samples': 6000576, 'steps': 31252, 'loss/train': 1.006943702697754} 11/07/2021 01:39:39 - INFO - __main__ - Step 31254: {'lr': 0.00045333112368065585, 'samples': 6000768, 'steps': 31253, 'loss/train': 1.5808206796646118} 11/07/2021 01:39:40 - INFO - __main__ - Step 31255: {'lr': 0.00045332803611613896, 'samples': 6000960, 'steps': 31254, 'loss/train': 1.7644098997116089} 11/07/2021 01:39:40 - INFO - __main__ - Step 31256: {'lr': 0.00045332494846000564, 'samples': 6001152, 'steps': 31255, 'loss/train': 1.4161990880966187} 11/07/2021 01:39:40 - INFO - __main__ - Step 31257: {'lr': 0.00045332186071225724, 'samples': 6001344, 'steps': 31256, 'loss/train': 1.2591632604599} 11/07/2021 01:39:41 - INFO - __main__ - Step 31258: {'lr': 0.00045331877287289516, 'samples': 6001536, 'steps': 31257, 'loss/train': 1.2635985612869263} 11/07/2021 01:39:42 - INFO - __main__ - Step 31259: {'lr': 0.00045331568494192076, 'samples': 6001728, 'steps': 31258, 'loss/train': 1.3842456340789795} 11/07/2021 01:39:42 - INFO - __main__ - Step 31260: {'lr': 0.00045331259691933545, 'samples': 6001920, 'steps': 31259, 'loss/train': 1.3079363107681274} 11/07/2021 01:39:42 - INFO - __main__ - Step 31261: {'lr': 0.00045330950880514065, 'samples': 6002112, 'steps': 31260, 'loss/train': 1.3240946531295776} 11/07/2021 01:39:43 - INFO - __main__ - Step 31262: {'lr': 0.0004533064205993377, 'samples': 6002304, 'steps': 31261, 'loss/train': 1.4610227346420288} 11/07/2021 01:39:43 - INFO - __main__ - Step 31263: {'lr': 0.000453303332301928, 'samples': 6002496, 'steps': 31262, 'loss/train': 1.0992581844329834} 11/07/2021 01:39:44 - INFO - __main__ - Step 31264: {'lr': 0.00045330024391291294, 'samples': 6002688, 'steps': 31263, 'loss/train': 1.1758882999420166} 11/07/2021 01:39:44 - INFO - __main__ - Step 31265: {'lr': 0.00045329715543229396, 'samples': 6002880, 'steps': 31264, 'loss/train': 1.224266767501831} 11/07/2021 01:39:45 - INFO - __main__ - Step 31266: {'lr': 0.0004532940668600724, 'samples': 6003072, 'steps': 31265, 'loss/train': 0.8703139424324036} 11/07/2021 01:39:45 - INFO - __main__ - Step 31267: {'lr': 0.00045329097819624966, 'samples': 6003264, 'steps': 31266, 'loss/train': 1.3362345695495605} 11/07/2021 01:39:45 - INFO - __main__ - Step 31268: {'lr': 0.00045328788944082717, 'samples': 6003456, 'steps': 31267, 'loss/train': 1.6372350454330444} 11/07/2021 01:39:46 - INFO - __main__ - Step 31269: {'lr': 0.0004532848005938063, 'samples': 6003648, 'steps': 31268, 'loss/train': 1.5215959548950195} 11/07/2021 01:39:47 - INFO - __main__ - Step 31270: {'lr': 0.0004532817116551884, 'samples': 6003840, 'steps': 31269, 'loss/train': 1.169777512550354} 11/07/2021 01:39:47 - INFO - __main__ - Step 31271: {'lr': 0.00045327862262497495, 'samples': 6004032, 'steps': 31270, 'loss/train': 1.9799604415893555} 11/07/2021 01:39:47 - INFO - __main__ - Step 31272: {'lr': 0.00045327553350316726, 'samples': 6004224, 'steps': 31271, 'loss/train': 1.4954971075057983} 11/07/2021 01:39:48 - INFO - __main__ - Step 31273: {'lr': 0.00045327244428976677, 'samples': 6004416, 'steps': 31272, 'loss/train': 1.5546213388442993} 11/07/2021 01:39:49 - INFO - __main__ - Step 31274: {'lr': 0.00045326935498477477, 'samples': 6004608, 'steps': 31273, 'loss/train': 1.14182448387146} 11/07/2021 01:39:49 - INFO - __main__ - Step 31275: {'lr': 0.00045326626558819284, 'samples': 6004800, 'steps': 31274, 'loss/train': 1.727343201637268} 11/07/2021 01:39:49 - INFO - __main__ - Step 31276: {'lr': 0.00045326317610002223, 'samples': 6004992, 'steps': 31275, 'loss/train': 1.546411395072937} 11/07/2021 01:39:50 - INFO - __main__ - Step 31277: {'lr': 0.00045326008652026435, 'samples': 6005184, 'steps': 31276, 'loss/train': 1.3291178941726685} 11/07/2021 01:39:50 - INFO - __main__ - Step 31278: {'lr': 0.00045325699684892065, 'samples': 6005376, 'steps': 31277, 'loss/train': 1.491533875465393} 11/07/2021 01:39:51 - INFO - __main__ - Step 31279: {'lr': 0.00045325390708599245, 'samples': 6005568, 'steps': 31278, 'loss/train': 1.7529364824295044} 11/07/2021 01:39:52 - INFO - __main__ - Step 31280: {'lr': 0.0004532508172314812, 'samples': 6005760, 'steps': 31279, 'loss/train': 2.064553737640381} 11/07/2021 01:39:52 - INFO - __main__ - Step 31281: {'lr': 0.0004532477272853882, 'samples': 6005952, 'steps': 31280, 'loss/train': 1.5577925443649292} 11/07/2021 01:39:52 - INFO - __main__ - Step 31282: {'lr': 0.000453244637247715, 'samples': 6006144, 'steps': 31281, 'loss/train': 0.903304934501648} 11/07/2021 01:39:53 - INFO - __main__ - Step 31283: {'lr': 0.0004532415471184629, 'samples': 6006336, 'steps': 31282, 'loss/train': 1.596295952796936} 11/07/2021 01:39:54 - INFO - __main__ - Step 31284: {'lr': 0.0004532384568976332, 'samples': 6006528, 'steps': 31283, 'loss/train': 1.7317078113555908} 11/07/2021 01:39:54 - INFO - __main__ - Step 31285: {'lr': 0.00045323536658522747, 'samples': 6006720, 'steps': 31284, 'loss/train': 1.4143906831741333} 11/07/2021 01:39:55 - INFO - __main__ - Step 31286: {'lr': 0.00045323227618124695, 'samples': 6006912, 'steps': 31285, 'loss/train': 1.4001790285110474} 11/07/2021 01:39:55 - INFO - __main__ - Step 31287: {'lr': 0.00045322918568569315, 'samples': 6007104, 'steps': 31286, 'loss/train': 1.453641414642334} 11/07/2021 01:39:55 - INFO - __main__ - Step 31288: {'lr': 0.0004532260950985675, 'samples': 6007296, 'steps': 31287, 'loss/train': 1.6401914358139038} 11/07/2021 01:39:56 - INFO - __main__ - Step 31289: {'lr': 0.0004532230044198712, 'samples': 6007488, 'steps': 31288, 'loss/train': 1.590362787246704} 11/07/2021 01:39:57 - INFO - __main__ - Step 31290: {'lr': 0.00045321991364960577, 'samples': 6007680, 'steps': 31289, 'loss/train': 1.0576629638671875} 11/07/2021 01:39:57 - INFO - __main__ - Step 31291: {'lr': 0.00045321682278777253, 'samples': 6007872, 'steps': 31290, 'loss/train': 1.9191278219223022} 11/07/2021 01:39:57 - INFO - __main__ - Step 31292: {'lr': 0.00045321373183437305, 'samples': 6008064, 'steps': 31291, 'loss/train': 1.0246641635894775} 11/07/2021 01:39:58 - INFO - __main__ - Step 31293: {'lr': 0.0004532106407894085, 'samples': 6008256, 'steps': 31292, 'loss/train': 2.011497974395752} 11/07/2021 01:39:59 - INFO - __main__ - Step 31294: {'lr': 0.0004532075496528804, 'samples': 6008448, 'steps': 31293, 'loss/train': 1.4250741004943848} 11/07/2021 01:39:59 - INFO - __main__ - Step 31295: {'lr': 0.0004532044584247901, 'samples': 6008640, 'steps': 31294, 'loss/train': 1.4246947765350342} 11/07/2021 01:39:59 - INFO - __main__ - Step 31296: {'lr': 0.00045320136710513907, 'samples': 6008832, 'steps': 31295, 'loss/train': 1.3835338354110718} 11/07/2021 01:40:00 - INFO - __main__ - Step 31297: {'lr': 0.00045319827569392855, 'samples': 6009024, 'steps': 31296, 'loss/train': 1.537290096282959} 11/07/2021 01:40:00 - INFO - __main__ - Step 31298: {'lr': 0.00045319518419116014, 'samples': 6009216, 'steps': 31297, 'loss/train': 1.195200800895691} 11/07/2021 01:40:00 - INFO - __main__ - Step 31299: {'lr': 0.00045319209259683503, 'samples': 6009408, 'steps': 31298, 'loss/train': 1.696137547492981} 11/07/2021 01:40:02 - INFO - __main__ - Step 31300: {'lr': 0.0004531890009109547, 'samples': 6009600, 'steps': 31299, 'loss/train': 1.2820237874984741} 11/07/2021 01:40:02 - INFO - __main__ - Step 31301: {'lr': 0.0004531859091335205, 'samples': 6009792, 'steps': 31300, 'loss/train': 1.8485389947891235} 11/07/2021 01:40:02 - INFO - __main__ - Step 31302: {'lr': 0.00045318281726453393, 'samples': 6009984, 'steps': 31301, 'loss/train': 2.0493111610412598} 11/07/2021 01:40:03 - INFO - __main__ - Step 31303: {'lr': 0.00045317972530399634, 'samples': 6010176, 'steps': 31302, 'loss/train': 1.545304536819458} 11/07/2021 01:40:03 - INFO - __main__ - Step 31304: {'lr': 0.00045317663325190904, 'samples': 6010368, 'steps': 31303, 'loss/train': 1.0387685298919678} 11/07/2021 01:40:04 - INFO - __main__ - Step 31305: {'lr': 0.00045317354110827344, 'samples': 6010560, 'steps': 31304, 'loss/train': 1.1423134803771973} 11/07/2021 01:40:05 - INFO - __main__ - Step 31306: {'lr': 0.0004531704488730911, 'samples': 6010752, 'steps': 31305, 'loss/train': 1.5734665393829346} 11/07/2021 01:40:05 - INFO - __main__ - Step 31307: {'lr': 0.0004531673565463632, 'samples': 6010944, 'steps': 31306, 'loss/train': 1.2878657579421997} 11/07/2021 01:40:05 - INFO - __main__ - Step 31308: {'lr': 0.0004531642641280913, 'samples': 6011136, 'steps': 31307, 'loss/train': 1.762477159500122} 11/07/2021 01:40:06 - INFO - __main__ - Step 31309: {'lr': 0.0004531611716182767, 'samples': 6011328, 'steps': 31308, 'loss/train': 1.5436822175979614} 11/07/2021 01:40:07 - INFO - __main__ - Step 31310: {'lr': 0.0004531580790169207, 'samples': 6011520, 'steps': 31309, 'loss/train': 1.662217140197754} 11/07/2021 01:40:07 - INFO - __main__ - Step 31311: {'lr': 0.00045315498632402494, 'samples': 6011712, 'steps': 31310, 'loss/train': 1.181226372718811} 11/07/2021 01:40:07 - INFO - __main__ - Step 31312: {'lr': 0.0004531518935395906, 'samples': 6011904, 'steps': 31311, 'loss/train': 1.003279685974121} 11/07/2021 01:40:08 - INFO - __main__ - Step 31313: {'lr': 0.00045314880066361923, 'samples': 6012096, 'steps': 31312, 'loss/train': 1.1623375415802002} 11/07/2021 01:40:08 - INFO - __main__ - Step 31314: {'lr': 0.00045314570769611207, 'samples': 6012288, 'steps': 31313, 'loss/train': 3.8758158683776855} 11/07/2021 01:40:09 - INFO - __main__ - Step 31315: {'lr': 0.00045314261463707064, 'samples': 6012480, 'steps': 31314, 'loss/train': 0.8304257392883301} 11/07/2021 01:40:10 - INFO - __main__ - Step 31316: {'lr': 0.00045313952148649626, 'samples': 6012672, 'steps': 31315, 'loss/train': 1.7472083568572998} 11/07/2021 01:40:10 - INFO - __main__ - Step 31317: {'lr': 0.0004531364282443904, 'samples': 6012864, 'steps': 31316, 'loss/train': 1.6921648979187012} 11/07/2021 01:40:10 - INFO - __main__ - Step 31318: {'lr': 0.00045313333491075433, 'samples': 6013056, 'steps': 31317, 'loss/train': 1.840403437614441} 11/07/2021 01:40:11 - INFO - __main__ - Step 31319: {'lr': 0.0004531302414855895, 'samples': 6013248, 'steps': 31318, 'loss/train': 1.498241662979126} 11/07/2021 01:40:11 - INFO - __main__ - Step 31320: {'lr': 0.0004531271479688974, 'samples': 6013440, 'steps': 31319, 'loss/train': 1.4011257886886597} 11/07/2021 01:40:12 - INFO - __main__ - Step 31321: {'lr': 0.00045312405436067927, 'samples': 6013632, 'steps': 31320, 'loss/train': 1.671433687210083} 11/07/2021 01:40:12 - INFO - __main__ - Step 31322: {'lr': 0.00045312096066093654, 'samples': 6013824, 'steps': 31321, 'loss/train': 1.430420994758606} 11/07/2021 01:40:13 - INFO - __main__ - Step 31323: {'lr': 0.0004531178668696707, 'samples': 6014016, 'steps': 31322, 'loss/train': 1.492526650428772} 11/07/2021 01:40:13 - INFO - __main__ - Step 31324: {'lr': 0.00045311477298688306, 'samples': 6014208, 'steps': 31323, 'loss/train': 1.6847875118255615} 11/07/2021 01:40:14 - INFO - __main__ - Step 31325: {'lr': 0.0004531116790125751, 'samples': 6014400, 'steps': 31324, 'loss/train': 1.22118079662323} 11/07/2021 01:40:15 - INFO - __main__ - Step 31326: {'lr': 0.00045310858494674813, 'samples': 6014592, 'steps': 31325, 'loss/train': 1.5117628574371338} 11/07/2021 01:40:15 - INFO - __main__ - Step 31327: {'lr': 0.00045310549078940356, 'samples': 6014784, 'steps': 31326, 'loss/train': 1.5198588371276855} 11/07/2021 01:40:15 - INFO - __main__ - Step 31328: {'lr': 0.00045310239654054274, 'samples': 6014976, 'steps': 31327, 'loss/train': 1.6896679401397705} 11/07/2021 01:40:16 - INFO - __main__ - Step 31329: {'lr': 0.0004530993022001672, 'samples': 6015168, 'steps': 31328, 'loss/train': 2.6464505195617676} 11/07/2021 01:40:16 - INFO - __main__ - Step 31330: {'lr': 0.00045309620776827817, 'samples': 6015360, 'steps': 31329, 'loss/train': 1.5829106569290161} 11/07/2021 01:40:17 - INFO - __main__ - Step 31331: {'lr': 0.00045309311324487713, 'samples': 6015552, 'steps': 31330, 'loss/train': 1.3637856245040894} 11/07/2021 01:40:17 - INFO - __main__ - Step 31332: {'lr': 0.0004530900186299655, 'samples': 6015744, 'steps': 31331, 'loss/train': 1.5064327716827393} 11/07/2021 01:40:18 - INFO - __main__ - Step 31333: {'lr': 0.0004530869239235446, 'samples': 6015936, 'steps': 31332, 'loss/train': 1.8033262491226196} 11/07/2021 01:40:18 - INFO - __main__ - Step 31334: {'lr': 0.0004530838291256159, 'samples': 6016128, 'steps': 31333, 'loss/train': 1.8291977643966675} 11/07/2021 01:40:18 - INFO - __main__ - Step 31335: {'lr': 0.0004530807342361807, 'samples': 6016320, 'steps': 31334, 'loss/train': 2.119213581085205} 11/07/2021 01:40:19 - INFO - __main__ - Step 31336: {'lr': 0.0004530776392552406, 'samples': 6016512, 'steps': 31335, 'loss/train': 1.0838651657104492} 11/07/2021 01:40:20 - INFO - __main__ - Step 31337: {'lr': 0.0004530745441827967, 'samples': 6016704, 'steps': 31336, 'loss/train': 1.8087306022644043} 11/07/2021 01:40:20 - INFO - __main__ - Step 31338: {'lr': 0.0004530714490188506, 'samples': 6016896, 'steps': 31337, 'loss/train': 1.5076483488082886} 11/07/2021 01:40:21 - INFO - __main__ - Step 31339: {'lr': 0.00045306835376340366, 'samples': 6017088, 'steps': 31338, 'loss/train': 1.6746957302093506} 11/07/2021 01:40:21 - INFO - __main__ - Step 31340: {'lr': 0.00045306525841645723, 'samples': 6017280, 'steps': 31339, 'loss/train': 1.0600357055664062} 11/07/2021 01:40:21 - INFO - __main__ - Step 31341: {'lr': 0.0004530621629780127, 'samples': 6017472, 'steps': 31340, 'loss/train': 1.423075795173645} 11/07/2021 01:40:22 - INFO - __main__ - Step 31342: {'lr': 0.00045305906744807156, 'samples': 6017664, 'steps': 31341, 'loss/train': 1.7189316749572754} 11/07/2021 01:40:23 - INFO - __main__ - Step 31343: {'lr': 0.0004530559718266351, 'samples': 6017856, 'steps': 31342, 'loss/train': 1.5785162448883057} 11/07/2021 01:40:23 - INFO - __main__ - Step 31344: {'lr': 0.0004530528761137047, 'samples': 6018048, 'steps': 31343, 'loss/train': 1.2819000482559204} 11/07/2021 01:40:23 - INFO - __main__ - Step 31345: {'lr': 0.0004530497803092819, 'samples': 6018240, 'steps': 31344, 'loss/train': 1.6384159326553345} 11/07/2021 01:40:24 - INFO - __main__ - Step 31346: {'lr': 0.000453046684413368, 'samples': 6018432, 'steps': 31345, 'loss/train': 1.8902775049209595} 11/07/2021 01:40:25 - INFO - __main__ - Step 31347: {'lr': 0.0004530435884259644, 'samples': 6018624, 'steps': 31346, 'loss/train': 1.60947847366333} 11/07/2021 01:40:25 - INFO - __main__ - Step 31348: {'lr': 0.0004530404923470724, 'samples': 6018816, 'steps': 31347, 'loss/train': 1.114456295967102} 11/07/2021 01:40:25 - INFO - __main__ - Step 31349: {'lr': 0.0004530373961766935, 'samples': 6019008, 'steps': 31348, 'loss/train': 1.5226140022277832} 11/07/2021 01:40:26 - INFO - __main__ - Step 31350: {'lr': 0.00045303429991482914, 'samples': 6019200, 'steps': 31349, 'loss/train': 1.4229016304016113} 11/07/2021 01:40:26 - INFO - __main__ - Step 31351: {'lr': 0.00045303120356148067, 'samples': 6019392, 'steps': 31350, 'loss/train': 1.539341688156128} 11/07/2021 01:40:27 - INFO - __main__ - Step 31352: {'lr': 0.00045302810711664944, 'samples': 6019584, 'steps': 31351, 'loss/train': 1.164172649383545} 11/07/2021 01:40:27 - INFO - __main__ - Step 31353: {'lr': 0.00045302501058033687, 'samples': 6019776, 'steps': 31352, 'loss/train': 1.0640827417373657} 11/07/2021 01:40:28 - INFO - __main__ - Step 31354: {'lr': 0.0004530219139525444, 'samples': 6019968, 'steps': 31353, 'loss/train': 1.6237752437591553} 11/07/2021 01:40:28 - INFO - __main__ - Step 31355: {'lr': 0.0004530188172332733, 'samples': 6020160, 'steps': 31354, 'loss/train': 1.6850532293319702} 11/07/2021 01:40:28 - INFO - __main__ - Step 31356: {'lr': 0.00045301572042252516, 'samples': 6020352, 'steps': 31355, 'loss/train': 0.7566108703613281} 11/07/2021 01:40:30 - INFO - __main__ - Step 31357: {'lr': 0.00045301262352030123, 'samples': 6020544, 'steps': 31356, 'loss/train': 1.6130330562591553} 11/07/2021 01:40:30 - INFO - __main__ - Step 31358: {'lr': 0.00045300952652660296, 'samples': 6020736, 'steps': 31357, 'loss/train': 2.542454481124878} 11/07/2021 01:40:30 - INFO - __main__ - Step 31359: {'lr': 0.0004530064294414317, 'samples': 6020928, 'steps': 31358, 'loss/train': 1.5415786504745483} 11/07/2021 01:40:31 - INFO - __main__ - Step 31360: {'lr': 0.00045300333226478887, 'samples': 6021120, 'steps': 31359, 'loss/train': 1.4599359035491943} 11/07/2021 01:40:31 - INFO - __main__ - Step 31361: {'lr': 0.0004530002349966759, 'samples': 6021312, 'steps': 31360, 'loss/train': 1.840524435043335} 11/07/2021 01:40:31 - INFO - __main__ - Step 31362: {'lr': 0.0004529971376370941, 'samples': 6021504, 'steps': 31361, 'loss/train': 2.0179741382598877} 11/07/2021 01:40:32 - INFO - __main__ - Step 31363: {'lr': 0.00045299404018604494, 'samples': 6021696, 'steps': 31362, 'loss/train': 1.009627103805542} 11/07/2021 01:40:33 - INFO - __main__ - Step 31364: {'lr': 0.00045299094264352987, 'samples': 6021888, 'steps': 31363, 'loss/train': 1.5297051668167114} 11/07/2021 01:40:33 - INFO - __main__ - Step 31365: {'lr': 0.00045298784500955014, 'samples': 6022080, 'steps': 31364, 'loss/train': 1.5274115800857544} 11/07/2021 01:40:33 - INFO - __main__ - Step 31366: {'lr': 0.0004529847472841073, 'samples': 6022272, 'steps': 31365, 'loss/train': 1.376988410949707} 11/07/2021 01:40:34 - INFO - __main__ - Step 31367: {'lr': 0.00045298164946720254, 'samples': 6022464, 'steps': 31366, 'loss/train': 1.6952353715896606} 11/07/2021 01:40:35 - INFO - __main__ - Step 31368: {'lr': 0.0004529785515588375, 'samples': 6022656, 'steps': 31367, 'loss/train': 1.6672512292861938} 11/07/2021 01:40:35 - INFO - __main__ - Step 31369: {'lr': 0.00045297545355901336, 'samples': 6022848, 'steps': 31368, 'loss/train': 1.4154484272003174} 11/07/2021 01:40:35 - INFO - __main__ - Step 31370: {'lr': 0.00045297235546773175, 'samples': 6023040, 'steps': 31369, 'loss/train': 1.7862539291381836} 11/07/2021 01:40:36 - INFO - __main__ - Step 31371: {'lr': 0.0004529692572849938, 'samples': 6023232, 'steps': 31370, 'loss/train': 1.7392520904541016} 11/07/2021 01:40:36 - INFO - __main__ - Step 31372: {'lr': 0.00045296615901080107, 'samples': 6023424, 'steps': 31371, 'loss/train': 1.8784608840942383} 11/07/2021 01:40:37 - INFO - __main__ - Step 31373: {'lr': 0.00045296306064515493, 'samples': 6023616, 'steps': 31372, 'loss/train': 1.1499531269073486} 11/07/2021 01:40:38 - INFO - __main__ - Step 31374: {'lr': 0.0004529599621880567, 'samples': 6023808, 'steps': 31373, 'loss/train': 2.2191970348358154} 11/07/2021 01:40:38 - INFO - __main__ - Step 31375: {'lr': 0.00045295686363950796, 'samples': 6024000, 'steps': 31374, 'loss/train': 1.522124171257019} 11/07/2021 01:40:38 - INFO - __main__ - Step 31376: {'lr': 0.0004529537649995099, 'samples': 6024192, 'steps': 31375, 'loss/train': 1.51448392868042} 11/07/2021 01:40:39 - INFO - __main__ - Step 31377: {'lr': 0.0004529506662680641, 'samples': 6024384, 'steps': 31376, 'loss/train': 1.0709054470062256} 11/07/2021 01:40:40 - INFO - __main__ - Step 31378: {'lr': 0.00045294756744517173, 'samples': 6024576, 'steps': 31377, 'loss/train': 1.801628828048706} 11/07/2021 01:40:40 - INFO - __main__ - Step 31379: {'lr': 0.00045294446853083446, 'samples': 6024768, 'steps': 31378, 'loss/train': 1.7513327598571777} 11/07/2021 01:40:41 - INFO - __main__ - Step 31380: {'lr': 0.00045294136952505346, 'samples': 6024960, 'steps': 31379, 'loss/train': 1.8567157983779907} 11/07/2021 01:40:41 - INFO - __main__ - Step 31381: {'lr': 0.0004529382704278302, 'samples': 6025152, 'steps': 31380, 'loss/train': 1.5834095478057861} 11/07/2021 01:40:41 - INFO - __main__ - Step 31382: {'lr': 0.0004529351712391661, 'samples': 6025344, 'steps': 31381, 'loss/train': 1.2147400379180908} 11/07/2021 01:40:42 - INFO - __main__ - Step 31383: {'lr': 0.0004529320719590626, 'samples': 6025536, 'steps': 31382, 'loss/train': 1.2692679166793823} 11/07/2021 01:40:43 - INFO - __main__ - Step 31384: {'lr': 0.00045292897258752095, 'samples': 6025728, 'steps': 31383, 'loss/train': 5.561633110046387} 11/07/2021 01:40:43 - INFO - __main__ - Step 31385: {'lr': 0.0004529258731245427, 'samples': 6025920, 'steps': 31384, 'loss/train': 0.9220340847969055} 11/07/2021 01:40:43 - INFO - __main__ - Step 31386: {'lr': 0.0004529227735701291, 'samples': 6026112, 'steps': 31385, 'loss/train': 1.5717658996582031} 11/07/2021 01:40:44 - INFO - __main__ - Step 31387: {'lr': 0.00045291967392428175, 'samples': 6026304, 'steps': 31386, 'loss/train': 1.629934310913086} 11/07/2021 01:40:44 - INFO - __main__ - Step 31388: {'lr': 0.0004529165741870018, 'samples': 6026496, 'steps': 31387, 'loss/train': 1.2687602043151855} 11/07/2021 01:40:45 - INFO - __main__ - Step 31389: {'lr': 0.00045291347435829087, 'samples': 6026688, 'steps': 31388, 'loss/train': 1.2830164432525635} 11/07/2021 01:40:45 - INFO - __main__ - Step 31390: {'lr': 0.0004529103744381503, 'samples': 6026880, 'steps': 31389, 'loss/train': 1.6968694925308228} 11/07/2021 01:40:46 - INFO - __main__ - Step 31391: {'lr': 0.0004529072744265813, 'samples': 6027072, 'steps': 31390, 'loss/train': 1.6593273878097534} 11/07/2021 01:40:46 - INFO - __main__ - Step 31392: {'lr': 0.00045290417432358553, 'samples': 6027264, 'steps': 31391, 'loss/train': 1.7551089525222778} 11/07/2021 01:40:46 - INFO - __main__ - Step 31393: {'lr': 0.00045290107412916425, 'samples': 6027456, 'steps': 31392, 'loss/train': 1.7737340927124023} 11/07/2021 01:40:47 - INFO - __main__ - Step 31394: {'lr': 0.0004528979738433189, 'samples': 6027648, 'steps': 31393, 'loss/train': 1.4928796291351318} 11/07/2021 01:40:48 - INFO - __main__ - Step 31395: {'lr': 0.00045289487346605075, 'samples': 6027840, 'steps': 31394, 'loss/train': 1.5345834493637085} 11/07/2021 01:40:48 - INFO - __main__ - Step 31396: {'lr': 0.0004528917729973614, 'samples': 6028032, 'steps': 31395, 'loss/train': 1.5296452045440674} 11/07/2021 01:40:49 - INFO - __main__ - Step 31397: {'lr': 0.00045288867243725207, 'samples': 6028224, 'steps': 31396, 'loss/train': 0.6855195760726929} 11/07/2021 01:40:49 - INFO - __main__ - Step 31398: {'lr': 0.00045288557178572433, 'samples': 6028416, 'steps': 31397, 'loss/train': 1.5773770809173584} 11/07/2021 01:40:50 - INFO - __main__ - Step 31399: {'lr': 0.00045288247104277937, 'samples': 6028608, 'steps': 31398, 'loss/train': 1.0291240215301514} 11/07/2021 01:40:50 - INFO - __main__ - Step 31400: {'lr': 0.0004528793702084187, 'samples': 6028800, 'steps': 31399, 'loss/train': 1.807216763496399} 11/07/2021 01:40:51 - INFO - __main__ - Step 31401: {'lr': 0.0004528762692826439, 'samples': 6028992, 'steps': 31400, 'loss/train': 1.1373130083084106} 11/07/2021 01:40:51 - INFO - __main__ - Step 31402: {'lr': 0.000452873168265456, 'samples': 6029184, 'steps': 31401, 'loss/train': 1.3746041059494019} 11/07/2021 01:40:51 - INFO - __main__ - Step 31403: {'lr': 0.00045287006715685665, 'samples': 6029376, 'steps': 31402, 'loss/train': 1.6347107887268066} 11/07/2021 01:40:52 - INFO - __main__ - Step 31404: {'lr': 0.0004528669659568472, 'samples': 6029568, 'steps': 31403, 'loss/train': 1.9266588687896729} 11/07/2021 01:40:53 - INFO - __main__ - Step 31405: {'lr': 0.00045286386466542896, 'samples': 6029760, 'steps': 31404, 'loss/train': 0.9506607055664062} 11/07/2021 01:40:53 - INFO - __main__ - Step 31406: {'lr': 0.0004528607632826034, 'samples': 6029952, 'steps': 31405, 'loss/train': 1.502123236656189} 11/07/2021 01:40:53 - INFO - __main__ - Step 31407: {'lr': 0.00045285766180837197, 'samples': 6030144, 'steps': 31406, 'loss/train': 1.4770770072937012} 11/07/2021 01:40:54 - INFO - __main__ - Step 31408: {'lr': 0.000452854560242736, 'samples': 6030336, 'steps': 31407, 'loss/train': 1.152550458908081} 11/07/2021 01:40:54 - INFO - __main__ - Step 31409: {'lr': 0.0004528514585856968, 'samples': 6030528, 'steps': 31408, 'loss/train': 1.5592893362045288} 11/07/2021 01:40:55 - INFO - __main__ - Step 31410: {'lr': 0.0004528483568372559, 'samples': 6030720, 'steps': 31409, 'loss/train': 1.3015888929367065} 11/07/2021 01:40:55 - INFO - __main__ - Step 31411: {'lr': 0.00045284525499741474, 'samples': 6030912, 'steps': 31410, 'loss/train': 1.7930604219436646} 11/07/2021 01:40:56 - INFO - __main__ - Step 31412: {'lr': 0.0004528421530661746, 'samples': 6031104, 'steps': 31411, 'loss/train': 1.5233491659164429} 11/07/2021 01:40:56 - INFO - __main__ - Step 31413: {'lr': 0.0004528390510435368, 'samples': 6031296, 'steps': 31412, 'loss/train': 1.2986992597579956} 11/07/2021 01:40:56 - INFO - __main__ - Step 31414: {'lr': 0.0004528359489295031, 'samples': 6031488, 'steps': 31413, 'loss/train': 1.3755381107330322} 11/07/2021 01:40:58 - INFO - __main__ - Step 31415: {'lr': 0.00045283284672407444, 'samples': 6031680, 'steps': 31414, 'loss/train': 1.8881810903549194} 11/07/2021 01:40:58 - INFO - __main__ - Step 31416: {'lr': 0.0004528297444272525, 'samples': 6031872, 'steps': 31415, 'loss/train': 1.3041670322418213} 11/07/2021 01:40:58 - INFO - __main__ - Step 31417: {'lr': 0.0004528266420390386, 'samples': 6032064, 'steps': 31416, 'loss/train': 2.038400650024414} 11/07/2021 01:40:59 - INFO - __main__ - Step 31418: {'lr': 0.00045282353955943417, 'samples': 6032256, 'steps': 31417, 'loss/train': 1.755852222442627} 11/07/2021 01:40:59 - INFO - __main__ - Step 31419: {'lr': 0.00045282043698844054, 'samples': 6032448, 'steps': 31418, 'loss/train': 1.505255103111267} 11/07/2021 01:41:00 - INFO - __main__ - Step 31420: {'lr': 0.0004528173343260592, 'samples': 6032640, 'steps': 31419, 'loss/train': 1.8033087253570557} 11/07/2021 01:41:00 - INFO - __main__ - Step 31421: {'lr': 0.0004528142315722915, 'samples': 6032832, 'steps': 31420, 'loss/train': 1.9411840438842773} 11/07/2021 01:41:01 - INFO - __main__ - Step 31422: {'lr': 0.0004528111287271388, 'samples': 6033024, 'steps': 31421, 'loss/train': 1.262176752090454} 11/07/2021 01:41:01 - INFO - __main__ - Step 31423: {'lr': 0.00045280802579060253, 'samples': 6033216, 'steps': 31422, 'loss/train': 1.6663079261779785} 11/07/2021 01:41:01 - INFO - __main__ - Step 31424: {'lr': 0.00045280492276268414, 'samples': 6033408, 'steps': 31423, 'loss/train': 1.670093059539795} 11/07/2021 01:41:02 - INFO - __main__ - Step 31425: {'lr': 0.0004528018196433849, 'samples': 6033600, 'steps': 31424, 'loss/train': 1.385120153427124} 11/07/2021 01:41:03 - INFO - __main__ - Step 31426: {'lr': 0.0004527987164327063, 'samples': 6033792, 'steps': 31425, 'loss/train': 1.0587403774261475} 11/07/2021 01:41:03 - INFO - __main__ - Step 31427: {'lr': 0.0004527956131306498, 'samples': 6033984, 'steps': 31426, 'loss/train': 1.2909725904464722} 11/07/2021 01:41:03 - INFO - __main__ - Step 31428: {'lr': 0.0004527925097372168, 'samples': 6034176, 'steps': 31427, 'loss/train': 1.6139020919799805} 11/07/2021 01:41:04 - INFO - __main__ - Step 31429: {'lr': 0.0004527894062524084, 'samples': 6034368, 'steps': 31428, 'loss/train': 1.4685879945755005} 11/07/2021 01:41:05 - INFO - __main__ - Step 31430: {'lr': 0.00045278630267622637, 'samples': 6034560, 'steps': 31429, 'loss/train': 1.514011263847351} 11/07/2021 01:41:05 - INFO - __main__ - Step 31431: {'lr': 0.0004527831990086719, 'samples': 6034752, 'steps': 31430, 'loss/train': 1.653131127357483} 11/07/2021 01:41:06 - INFO - __main__ - Step 31432: {'lr': 0.0004527800952497465, 'samples': 6034944, 'steps': 31431, 'loss/train': 1.4159690141677856} 11/07/2021 01:41:06 - INFO - __main__ - Step 31433: {'lr': 0.0004527769913994515, 'samples': 6035136, 'steps': 31432, 'loss/train': 1.1822314262390137} 11/07/2021 01:41:06 - INFO - __main__ - Step 31434: {'lr': 0.00045277388745778836, 'samples': 6035328, 'steps': 31433, 'loss/train': 1.4472757577896118} 11/07/2021 01:41:07 - INFO - __main__ - Step 31435: {'lr': 0.00045277078342475835, 'samples': 6035520, 'steps': 31434, 'loss/train': 1.4460768699645996} 11/07/2021 01:41:08 - INFO - __main__ - Step 31436: {'lr': 0.000452767679300363, 'samples': 6035712, 'steps': 31435, 'loss/train': 0.7411298155784607} 11/07/2021 01:41:08 - INFO - __main__ - Step 31437: {'lr': 0.00045276457508460367, 'samples': 6035904, 'steps': 31436, 'loss/train': 1.0947308540344238} 11/07/2021 01:41:08 - INFO - __main__ - Step 31438: {'lr': 0.00045276147077748176, 'samples': 6036096, 'steps': 31437, 'loss/train': 1.8609302043914795} 11/07/2021 01:41:09 - INFO - __main__ - Step 31439: {'lr': 0.0004527583663789986, 'samples': 6036288, 'steps': 31438, 'loss/train': 0.918387770652771} 11/07/2021 01:41:09 - INFO - __main__ - Step 31440: {'lr': 0.0004527552618891557, 'samples': 6036480, 'steps': 31439, 'loss/train': 1.2426555156707764} 11/07/2021 01:41:10 - INFO - __main__ - Step 31441: {'lr': 0.0004527521573079544, 'samples': 6036672, 'steps': 31440, 'loss/train': 1.590278148651123} 11/07/2021 01:41:11 - INFO - __main__ - Step 31442: {'lr': 0.0004527490526353961, 'samples': 6036864, 'steps': 31441, 'loss/train': 1.5419851541519165} 11/07/2021 01:41:11 - INFO - __main__ - Step 31443: {'lr': 0.0004527459478714822, 'samples': 6037056, 'steps': 31442, 'loss/train': 1.2262630462646484} 11/07/2021 01:41:11 - INFO - __main__ - Step 31444: {'lr': 0.00045274284301621414, 'samples': 6037248, 'steps': 31443, 'loss/train': 1.4954262971878052} 11/07/2021 01:41:12 - INFO - __main__ - Step 31445: {'lr': 0.00045273973806959325, 'samples': 6037440, 'steps': 31444, 'loss/train': 1.7720775604248047} 11/07/2021 01:41:13 - INFO - __main__ - Step 31446: {'lr': 0.00045273663303162096, 'samples': 6037632, 'steps': 31445, 'loss/train': 1.8512814044952393} 11/07/2021 01:41:13 - INFO - __main__ - Step 31447: {'lr': 0.00045273352790229873, 'samples': 6037824, 'steps': 31446, 'loss/train': 1.5024912357330322} 11/07/2021 01:41:13 - INFO - __main__ - Step 31448: {'lr': 0.0004527304226816278, 'samples': 6038016, 'steps': 31447, 'loss/train': 1.6347882747650146} 11/07/2021 01:41:14 - INFO - __main__ - Step 31449: {'lr': 0.0004527273173696097, 'samples': 6038208, 'steps': 31448, 'loss/train': 1.7430769205093384} 11/07/2021 01:41:14 - INFO - __main__ - Step 31450: {'lr': 0.0004527242119662458, 'samples': 6038400, 'steps': 31449, 'loss/train': 2.1880335807800293} 11/07/2021 01:41:14 - INFO - __main__ - Step 31451: {'lr': 0.00045272110647153754, 'samples': 6038592, 'steps': 31450, 'loss/train': 1.2813395261764526} 11/07/2021 01:41:15 - INFO - __main__ - Step 31452: {'lr': 0.00045271800088548625, 'samples': 6038784, 'steps': 31451, 'loss/train': 1.6504584550857544} 11/07/2021 01:41:16 - INFO - __main__ - Step 31453: {'lr': 0.00045271489520809337, 'samples': 6038976, 'steps': 31452, 'loss/train': 1.8255902528762817} 11/07/2021 01:41:16 - INFO - __main__ - Step 31454: {'lr': 0.0004527117894393603, 'samples': 6039168, 'steps': 31453, 'loss/train': 1.3392289876937866} 11/07/2021 01:41:17 - INFO - __main__ - Step 31455: {'lr': 0.0004527086835792884, 'samples': 6039360, 'steps': 31454, 'loss/train': 1.391819715499878} 11/07/2021 01:41:17 - INFO - __main__ - Step 31456: {'lr': 0.0004527055776278791, 'samples': 6039552, 'steps': 31455, 'loss/train': 1.526758074760437} 11/07/2021 01:41:18 - INFO - __main__ - Step 31457: {'lr': 0.00045270247158513377, 'samples': 6039744, 'steps': 31456, 'loss/train': 1.5195149183273315} 11/07/2021 01:41:18 - INFO - __main__ - Step 31458: {'lr': 0.00045269936545105384, 'samples': 6039936, 'steps': 31457, 'loss/train': 1.703020453453064} 11/07/2021 01:41:19 - INFO - __main__ - Step 31459: {'lr': 0.0004526962592256407, 'samples': 6040128, 'steps': 31458, 'loss/train': 1.6962475776672363} 11/07/2021 01:41:19 - INFO - __main__ - Step 31460: {'lr': 0.00045269315290889583, 'samples': 6040320, 'steps': 31459, 'loss/train': 1.5722194910049438} 11/07/2021 01:41:19 - INFO - __main__ - Step 31461: {'lr': 0.00045269004650082045, 'samples': 6040512, 'steps': 31460, 'loss/train': 2.1078343391418457} 11/07/2021 01:41:20 - INFO - __main__ - Step 31462: {'lr': 0.0004526869400014162, 'samples': 6040704, 'steps': 31461, 'loss/train': 1.6594512462615967} 11/07/2021 01:41:21 - INFO - __main__ - Step 31463: {'lr': 0.0004526838334106842, 'samples': 6040896, 'steps': 31462, 'loss/train': 1.9305704832077026} 11/07/2021 01:41:21 - INFO - __main__ - Step 31464: {'lr': 0.000452680726728626, 'samples': 6041088, 'steps': 31463, 'loss/train': 1.2269365787506104} 11/07/2021 01:41:21 - INFO - __main__ - Step 31465: {'lr': 0.00045267761995524314, 'samples': 6041280, 'steps': 31464, 'loss/train': 1.296766996383667} 11/07/2021 01:41:22 - INFO - __main__ - Step 31466: {'lr': 0.00045267451309053677, 'samples': 6041472, 'steps': 31465, 'loss/train': 0.952112078666687} 11/07/2021 01:41:23 - INFO - __main__ - Step 31467: {'lr': 0.0004526714061345084, 'samples': 6041664, 'steps': 31466, 'loss/train': 0.7781227231025696} 11/07/2021 01:41:23 - INFO - __main__ - Step 31468: {'lr': 0.0004526682990871593, 'samples': 6041856, 'steps': 31467, 'loss/train': 1.4286562204360962} 11/07/2021 01:41:23 - INFO - __main__ - Step 31469: {'lr': 0.0004526651919484912, 'samples': 6042048, 'steps': 31468, 'loss/train': 1.330336332321167} 11/07/2021 01:41:24 - INFO - __main__ - Step 31470: {'lr': 0.00045266208471850516, 'samples': 6042240, 'steps': 31469, 'loss/train': 0.9660208821296692} 11/07/2021 01:41:24 - INFO - __main__ - Step 31471: {'lr': 0.00045265897739720277, 'samples': 6042432, 'steps': 31470, 'loss/train': 1.2458423376083374} 11/07/2021 01:41:25 - INFO - __main__ - Step 31472: {'lr': 0.00045265586998458534, 'samples': 6042624, 'steps': 31471, 'loss/train': 1.4908545017242432} 11/07/2021 01:41:25 - INFO - __main__ - Step 31473: {'lr': 0.00045265276248065436, 'samples': 6042816, 'steps': 31472, 'loss/train': 1.8791873455047607} 11/07/2021 01:41:26 - INFO - __main__ - Step 31474: {'lr': 0.0004526496548854111, 'samples': 6043008, 'steps': 31473, 'loss/train': 1.3738685846328735} 11/07/2021 01:41:26 - INFO - __main__ - Step 31475: {'lr': 0.000452646547198857, 'samples': 6043200, 'steps': 31474, 'loss/train': 1.3974919319152832} 11/07/2021 01:41:27 - INFO - __main__ - Step 31476: {'lr': 0.0004526434394209936, 'samples': 6043392, 'steps': 31475, 'loss/train': 1.493938684463501} 11/07/2021 01:41:27 - INFO - __main__ - Step 31477: {'lr': 0.00045264033155182216, 'samples': 6043584, 'steps': 31476, 'loss/train': 1.66843581199646} 11/07/2021 01:41:28 - INFO - __main__ - Step 31478: {'lr': 0.0004526372235913441, 'samples': 6043776, 'steps': 31477, 'loss/train': 1.3735740184783936} 11/07/2021 01:41:28 - INFO - __main__ - Step 31479: {'lr': 0.0004526341155395608, 'samples': 6043968, 'steps': 31478, 'loss/train': 1.2243516445159912} 11/07/2021 01:41:29 - INFO - __main__ - Step 31480: {'lr': 0.00045263100739647373, 'samples': 6044160, 'steps': 31479, 'loss/train': 1.4764176607131958} 11/07/2021 01:41:29 - INFO - __main__ - Step 31481: {'lr': 0.00045262789916208424, 'samples': 6044352, 'steps': 31480, 'loss/train': 1.6852359771728516} 11/07/2021 01:41:30 - INFO - __main__ - Step 31482: {'lr': 0.00045262479083639376, 'samples': 6044544, 'steps': 31481, 'loss/train': 1.6857291460037231} 11/07/2021 01:41:30 - INFO - __main__ - Step 31483: {'lr': 0.0004526216824194037, 'samples': 6044736, 'steps': 31482, 'loss/train': 0.5070812106132507} 11/07/2021 01:41:31 - INFO - __main__ - Step 31484: {'lr': 0.00045261857391111536, 'samples': 6044928, 'steps': 31483, 'loss/train': 0.8546400666236877} 11/07/2021 01:41:31 - INFO - __main__ - Step 31485: {'lr': 0.0004526154653115303, 'samples': 6045120, 'steps': 31484, 'loss/train': 1.9318288564682007} 11/07/2021 01:41:31 - INFO - __main__ - Step 31486: {'lr': 0.0004526123566206498, 'samples': 6045312, 'steps': 31485, 'loss/train': 1.4708870649337769} 11/07/2021 01:41:32 - INFO - __main__ - Step 31487: {'lr': 0.0004526092478384753, 'samples': 6045504, 'steps': 31486, 'loss/train': 1.7455925941467285} 11/07/2021 01:41:33 - INFO - __main__ - Step 31488: {'lr': 0.00045260613896500827, 'samples': 6045696, 'steps': 31487, 'loss/train': 1.8832435607910156} 11/07/2021 01:41:33 - INFO - __main__ - Step 31489: {'lr': 0.00045260303000024994, 'samples': 6045888, 'steps': 31488, 'loss/train': 1.7852832078933716} 11/07/2021 01:41:33 - INFO - __main__ - Step 31490: {'lr': 0.0004525999209442018, 'samples': 6046080, 'steps': 31489, 'loss/train': 1.0340776443481445} 11/07/2021 01:41:34 - INFO - __main__ - Step 31491: {'lr': 0.0004525968117968653, 'samples': 6046272, 'steps': 31490, 'loss/train': 1.484983205795288} 11/07/2021 01:41:34 - INFO - __main__ - Step 31492: {'lr': 0.00045259370255824183, 'samples': 6046464, 'steps': 31491, 'loss/train': 1.6369127035140991} 11/07/2021 01:41:35 - INFO - __main__ - Step 31493: {'lr': 0.0004525905932283327, 'samples': 6046656, 'steps': 31492, 'loss/train': 1.3930445909500122} 11/07/2021 01:41:36 - INFO - __main__ - Step 31494: {'lr': 0.00045258748380713943, 'samples': 6046848, 'steps': 31493, 'loss/train': 1.0716495513916016} 11/07/2021 01:41:36 - INFO - __main__ - Step 31495: {'lr': 0.00045258437429466337, 'samples': 6047040, 'steps': 31494, 'loss/train': 0.8319592475891113} 11/07/2021 01:41:36 - INFO - __main__ - Step 31496: {'lr': 0.0004525812646909059, 'samples': 6047232, 'steps': 31495, 'loss/train': 1.5362430810928345} 11/07/2021 01:41:37 - INFO - __main__ - Step 31497: {'lr': 0.0004525781549958684, 'samples': 6047424, 'steps': 31496, 'loss/train': 0.8949344754219055} 11/07/2021 01:41:38 - INFO - __main__ - Step 31498: {'lr': 0.0004525750452095524, 'samples': 6047616, 'steps': 31497, 'loss/train': 2.0039658546447754} 11/07/2021 01:41:38 - INFO - __main__ - Step 31499: {'lr': 0.00045257193533195916, 'samples': 6047808, 'steps': 31498, 'loss/train': 1.5422204732894897} 11/07/2021 01:41:38 - INFO - __main__ - Step 31500: {'lr': 0.0004525688253630901, 'samples': 6048000, 'steps': 31499, 'loss/train': 2.1831295490264893} 11/07/2021 01:41:39 - INFO - __main__ - Step 31501: {'lr': 0.00045256571530294664, 'samples': 6048192, 'steps': 31500, 'loss/train': 1.1621021032333374} 11/07/2021 01:41:39 - INFO - __main__ - Step 31502: {'lr': 0.0004525626051515302, 'samples': 6048384, 'steps': 31501, 'loss/train': 1.3157600164413452} 11/07/2021 01:41:40 - INFO - __main__ - Step 31503: {'lr': 0.0004525594949088423, 'samples': 6048576, 'steps': 31502, 'loss/train': 0.9412500262260437} 11/07/2021 01:41:40 - INFO - __main__ - Step 31504: {'lr': 0.00045255638457488415, 'samples': 6048768, 'steps': 31503, 'loss/train': 0.45820948481559753} 11/07/2021 01:41:41 - INFO - __main__ - Step 31505: {'lr': 0.0004525532741496572, 'samples': 6048960, 'steps': 31504, 'loss/train': 1.6507850885391235} 11/07/2021 01:41:41 - INFO - __main__ - Step 31506: {'lr': 0.0004525501636331628, 'samples': 6049152, 'steps': 31505, 'loss/train': 1.6992638111114502} 11/07/2021 01:41:41 - INFO - __main__ - Step 31507: {'lr': 0.00045254705302540257, 'samples': 6049344, 'steps': 31506, 'loss/train': 1.7947462797164917} 11/07/2021 01:41:43 - INFO - __main__ - Step 31508: {'lr': 0.00045254394232637765, 'samples': 6049536, 'steps': 31507, 'loss/train': 1.3244881629943848} 11/07/2021 01:41:43 - INFO - __main__ - Step 31509: {'lr': 0.0004525408315360896, 'samples': 6049728, 'steps': 31508, 'loss/train': 2.219317674636841} 11/07/2021 01:41:43 - INFO - __main__ - Step 31510: {'lr': 0.00045253772065453977, 'samples': 6049920, 'steps': 31509, 'loss/train': 1.6535799503326416} 11/07/2021 01:41:44 - INFO - __main__ - Step 31511: {'lr': 0.00045253460968172957, 'samples': 6050112, 'steps': 31510, 'loss/train': 1.5129849910736084} 11/07/2021 01:41:44 - INFO - __main__ - Step 31512: {'lr': 0.0004525314986176604, 'samples': 6050304, 'steps': 31511, 'loss/train': 1.443256139755249} 11/07/2021 01:41:45 - INFO - __main__ - Step 31513: {'lr': 0.0004525283874623336, 'samples': 6050496, 'steps': 31512, 'loss/train': 1.5510832071304321} 11/07/2021 01:41:45 - INFO - __main__ - Step 31514: {'lr': 0.00045252527621575075, 'samples': 6050688, 'steps': 31513, 'loss/train': 1.423369288444519} 11/07/2021 01:41:46 - INFO - __main__ - Step 31515: {'lr': 0.0004525221648779131, 'samples': 6050880, 'steps': 31514, 'loss/train': 0.9893391132354736} 11/07/2021 01:41:46 - INFO - __main__ - Step 31516: {'lr': 0.00045251905344882205, 'samples': 6051072, 'steps': 31515, 'loss/train': 1.3399033546447754} 11/07/2021 01:41:46 - INFO - __main__ - Step 31517: {'lr': 0.000452515941928479, 'samples': 6051264, 'steps': 31516, 'loss/train': 1.4280192852020264} 11/07/2021 01:41:47 - INFO - __main__ - Step 31518: {'lr': 0.0004525128303168855, 'samples': 6051456, 'steps': 31517, 'loss/train': 1.543757677078247} 11/07/2021 01:41:48 - INFO - __main__ - Step 31519: {'lr': 0.00045250971861404276, 'samples': 6051648, 'steps': 31518, 'loss/train': 1.4377233982086182} 11/07/2021 01:41:48 - INFO - __main__ - Step 31520: {'lr': 0.0004525066068199523, 'samples': 6051840, 'steps': 31519, 'loss/train': 1.5889067649841309} 11/07/2021 01:41:48 - INFO - __main__ - Step 31521: {'lr': 0.0004525034949346155, 'samples': 6052032, 'steps': 31520, 'loss/train': 1.63162100315094} 11/07/2021 01:41:49 - INFO - __main__ - Step 31522: {'lr': 0.0004525003829580337, 'samples': 6052224, 'steps': 31521, 'loss/train': 1.91524076461792} 11/07/2021 01:41:49 - INFO - __main__ - Step 31523: {'lr': 0.0004524972708902084, 'samples': 6052416, 'steps': 31522, 'loss/train': 1.4860838651657104} 11/07/2021 01:41:50 - INFO - __main__ - Step 31524: {'lr': 0.0004524941587311409, 'samples': 6052608, 'steps': 31523, 'loss/train': 1.5862531661987305} 11/07/2021 01:41:50 - INFO - __main__ - Step 31525: {'lr': 0.0004524910464808327, 'samples': 6052800, 'steps': 31524, 'loss/train': 1.6125633716583252} 11/07/2021 01:41:51 - INFO - __main__ - Step 31526: {'lr': 0.00045248793413928514, 'samples': 6052992, 'steps': 31525, 'loss/train': 1.2867863178253174} 11/07/2021 01:41:51 - INFO - __main__ - Step 31527: {'lr': 0.0004524848217064997, 'samples': 6053184, 'steps': 31526, 'loss/train': 1.5448814630508423} 11/07/2021 01:41:51 - INFO - __main__ - Step 31528: {'lr': 0.0004524817091824777, 'samples': 6053376, 'steps': 31527, 'loss/train': 1.3339594602584839} 11/07/2021 01:41:52 - INFO - __main__ - Step 31529: {'lr': 0.00045247859656722056, 'samples': 6053568, 'steps': 31528, 'loss/train': 1.3554816246032715} 11/07/2021 01:41:53 - INFO - __main__ - Step 31530: {'lr': 0.0004524754838607297, 'samples': 6053760, 'steps': 31529, 'loss/train': 1.2281231880187988} 11/07/2021 01:41:53 - INFO - __main__ - Step 31531: {'lr': 0.0004524723710630064, 'samples': 6053952, 'steps': 31530, 'loss/train': 1.370402455329895} 11/07/2021 01:41:53 - INFO - __main__ - Step 31532: {'lr': 0.0004524692581740523, 'samples': 6054144, 'steps': 31531, 'loss/train': 1.7626378536224365} 11/07/2021 01:41:54 - INFO - __main__ - Step 31533: {'lr': 0.00045246614519386865, 'samples': 6054336, 'steps': 31532, 'loss/train': 1.6262603998184204} 11/07/2021 01:41:55 - INFO - __main__ - Step 31534: {'lr': 0.0004524630321224569, 'samples': 6054528, 'steps': 31533, 'loss/train': 1.3756903409957886} 11/07/2021 01:41:55 - INFO - __main__ - Step 31535: {'lr': 0.0004524599189598183, 'samples': 6054720, 'steps': 31534, 'loss/train': 2.3842573165893555} 11/07/2021 01:41:56 - INFO - __main__ - Step 31536: {'lr': 0.0004524568057059545, 'samples': 6054912, 'steps': 31535, 'loss/train': 1.2920001745224} 11/07/2021 01:41:56 - INFO - __main__ - Step 31537: {'lr': 0.00045245369236086673, 'samples': 6055104, 'steps': 31536, 'loss/train': 1.7213891744613647} 11/07/2021 01:41:56 - INFO - __main__ - Step 31538: {'lr': 0.00045245057892455653, 'samples': 6055296, 'steps': 31537, 'loss/train': 1.6329360008239746} 11/07/2021 01:41:57 - INFO - __main__ - Step 31539: {'lr': 0.0004524474653970252, 'samples': 6055488, 'steps': 31538, 'loss/train': 1.9639887809753418} 11/07/2021 01:41:58 - INFO - __main__ - Step 31540: {'lr': 0.00045244435177827413, 'samples': 6055680, 'steps': 31539, 'loss/train': 1.7359849214553833} 11/07/2021 01:41:58 - INFO - __main__ - Step 31541: {'lr': 0.00045244123806830486, 'samples': 6055872, 'steps': 31540, 'loss/train': 1.2768898010253906} 11/07/2021 01:41:59 - INFO - __main__ - Step 31542: {'lr': 0.00045243812426711856, 'samples': 6056064, 'steps': 31541, 'loss/train': 1.4651298522949219} 11/07/2021 01:41:59 - INFO - __main__ - Step 31543: {'lr': 0.0004524350103747168, 'samples': 6056256, 'steps': 31542, 'loss/train': 1.398427128791809} 11/07/2021 01:42:00 - INFO - __main__ - Step 31544: {'lr': 0.00045243189639110093, 'samples': 6056448, 'steps': 31543, 'loss/train': 1.791009783744812} 11/07/2021 01:42:01 - INFO - __main__ - Step 31545: {'lr': 0.00045242878231627247, 'samples': 6056640, 'steps': 31544, 'loss/train': 1.4630930423736572} 11/07/2021 01:42:01 - INFO - __main__ - Step 31546: {'lr': 0.0004524256681502327, 'samples': 6056832, 'steps': 31545, 'loss/train': 1.5687968730926514} 11/07/2021 01:42:01 - INFO - __main__ - Step 31547: {'lr': 0.0004524225538929829, 'samples': 6057024, 'steps': 31546, 'loss/train': 1.714564323425293} 11/07/2021 01:42:02 - INFO - __main__ - Step 31548: {'lr': 0.0004524194395445248, 'samples': 6057216, 'steps': 31547, 'loss/train': 0.513561487197876} 11/07/2021 01:42:03 - INFO - __main__ - Step 31549: {'lr': 0.0004524163251048595, 'samples': 6057408, 'steps': 31548, 'loss/train': 1.0790307521820068} 11/07/2021 01:42:03 - INFO - __main__ - Step 31550: {'lr': 0.0004524132105739886, 'samples': 6057600, 'steps': 31549, 'loss/train': 1.5847446918487549} 11/07/2021 01:42:03 - INFO - __main__ - Step 31551: {'lr': 0.0004524100959519134, 'samples': 6057792, 'steps': 31550, 'loss/train': 1.0553903579711914} 11/07/2021 01:42:04 - INFO - __main__ - Step 31552: {'lr': 0.00045240698123863535, 'samples': 6057984, 'steps': 31551, 'loss/train': 1.499010443687439} 11/07/2021 01:42:04 - INFO - __main__ - Step 31553: {'lr': 0.0004524038664341558, 'samples': 6058176, 'steps': 31552, 'loss/train': 1.598149299621582} 11/07/2021 01:42:05 - INFO - __main__ - Step 31554: {'lr': 0.00045240075153847625, 'samples': 6058368, 'steps': 31553, 'loss/train': 1.6579509973526} 11/07/2021 01:42:05 - INFO - __main__ - Step 31555: {'lr': 0.00045239763655159805, 'samples': 6058560, 'steps': 31554, 'loss/train': 1.8098655939102173} 11/07/2021 01:42:06 - INFO - __main__ - Step 31556: {'lr': 0.00045239452147352257, 'samples': 6058752, 'steps': 31555, 'loss/train': 1.4907845258712769} 11/07/2021 01:42:06 - INFO - __main__ - Step 31557: {'lr': 0.0004523914063042512, 'samples': 6058944, 'steps': 31556, 'loss/train': 1.4728018045425415} 11/07/2021 01:42:07 - INFO - __main__ - Step 31558: {'lr': 0.00045238829104378545, 'samples': 6059136, 'steps': 31557, 'loss/train': 1.4942617416381836} 11/07/2021 01:42:07 - INFO - __main__ - Step 31559: {'lr': 0.0004523851756921266, 'samples': 6059328, 'steps': 31558, 'loss/train': 1.0715967416763306} 11/07/2021 01:42:08 - INFO - __main__ - Step 31560: {'lr': 0.00045238206024927614, 'samples': 6059520, 'steps': 31559, 'loss/train': 1.5499707460403442} 11/07/2021 01:42:08 - INFO - __main__ - Step 31561: {'lr': 0.00045237894471523543, 'samples': 6059712, 'steps': 31560, 'loss/train': 1.5776026248931885} 11/07/2021 01:42:09 - INFO - __main__ - Step 31562: {'lr': 0.00045237582909000594, 'samples': 6059904, 'steps': 31561, 'loss/train': 1.4897489547729492} 11/07/2021 01:42:09 - INFO - __main__ - Step 31563: {'lr': 0.00045237271337358897, 'samples': 6060096, 'steps': 31562, 'loss/train': 1.5823599100112915} 11/07/2021 01:42:09 - INFO - __main__ - Step 31564: {'lr': 0.00045236959756598605, 'samples': 6060288, 'steps': 31563, 'loss/train': 1.4825657606124878} 11/07/2021 01:42:10 - INFO - __main__ - Step 31565: {'lr': 0.0004523664816671985, 'samples': 6060480, 'steps': 31564, 'loss/train': 1.520081639289856} 11/07/2021 01:42:11 - INFO - __main__ - Step 31566: {'lr': 0.0004523633656772277, 'samples': 6060672, 'steps': 31565, 'loss/train': 1.5277254581451416} 11/07/2021 01:42:11 - INFO - __main__ - Step 31567: {'lr': 0.00045236024959607505, 'samples': 6060864, 'steps': 31566, 'loss/train': 1.4078350067138672} 11/07/2021 01:42:11 - INFO - __main__ - Step 31568: {'lr': 0.00045235713342374207, 'samples': 6061056, 'steps': 31567, 'loss/train': 1.245788812637329} 11/07/2021 01:42:12 - INFO - __main__ - Step 31569: {'lr': 0.00045235401716023, 'samples': 6061248, 'steps': 31568, 'loss/train': 1.6251945495605469} 11/07/2021 01:42:13 - INFO - __main__ - Step 31570: {'lr': 0.0004523509008055404, 'samples': 6061440, 'steps': 31569, 'loss/train': 0.8039262890815735} 11/07/2021 01:42:13 - INFO - __main__ - Step 31571: {'lr': 0.0004523477843596746, 'samples': 6061632, 'steps': 31570, 'loss/train': 1.3582419157028198} 11/07/2021 01:42:13 - INFO - __main__ - Step 31572: {'lr': 0.00045234466782263403, 'samples': 6061824, 'steps': 31571, 'loss/train': 1.2797014713287354} 11/07/2021 01:42:14 - INFO - __main__ - Step 31573: {'lr': 0.00045234155119442, 'samples': 6062016, 'steps': 31572, 'loss/train': 1.9971174001693726} 11/07/2021 01:42:14 - INFO - __main__ - Step 31574: {'lr': 0.00045233843447503407, 'samples': 6062208, 'steps': 31573, 'loss/train': 1.3037500381469727} 11/07/2021 01:42:15 - INFO - __main__ - Step 31575: {'lr': 0.00045233531766447757, 'samples': 6062400, 'steps': 31574, 'loss/train': 1.8818330764770508} 11/07/2021 01:42:16 - INFO - __main__ - Step 31576: {'lr': 0.00045233220076275186, 'samples': 6062592, 'steps': 31575, 'loss/train': 1.2794921398162842} 11/07/2021 01:42:16 - INFO - __main__ - Step 31577: {'lr': 0.0004523290837698583, 'samples': 6062784, 'steps': 31576, 'loss/train': 1.7762749195098877} 11/07/2021 01:42:17 - INFO - __main__ - Step 31578: {'lr': 0.0004523259666857985, 'samples': 6062976, 'steps': 31577, 'loss/train': 0.7119866013526917} 11/07/2021 01:42:17 - INFO - __main__ - Step 31579: {'lr': 0.00045232284951057366, 'samples': 6063168, 'steps': 31578, 'loss/train': 1.141363263130188} 11/07/2021 01:42:17 - INFO - __main__ - Step 31580: {'lr': 0.00045231973224418533, 'samples': 6063360, 'steps': 31579, 'loss/train': 1.2381643056869507} 11/07/2021 01:42:18 - INFO - __main__ - Step 31581: {'lr': 0.00045231661488663485, 'samples': 6063552, 'steps': 31580, 'loss/train': 1.0016676187515259} 11/07/2021 01:42:19 - INFO - __main__ - Step 31582: {'lr': 0.0004523134974379236, 'samples': 6063744, 'steps': 31581, 'loss/train': 1.7150145769119263} 11/07/2021 01:42:19 - INFO - __main__ - Step 31583: {'lr': 0.000452310379898053, 'samples': 6063936, 'steps': 31582, 'loss/train': 1.178554654121399} 11/07/2021 01:42:19 - INFO - __main__ - Step 31584: {'lr': 0.00045230726226702444, 'samples': 6064128, 'steps': 31583, 'loss/train': 1.5771918296813965} 11/07/2021 01:42:20 - INFO - __main__ - Step 31585: {'lr': 0.0004523041445448394, 'samples': 6064320, 'steps': 31584, 'loss/train': 1.6697421073913574} 11/07/2021 01:42:21 - INFO - __main__ - Step 31586: {'lr': 0.00045230102673149923, 'samples': 6064512, 'steps': 31585, 'loss/train': 1.4931589365005493} 11/07/2021 01:42:21 - INFO - __main__ - Step 31587: {'lr': 0.00045229790882700535, 'samples': 6064704, 'steps': 31586, 'loss/train': 1.292038917541504} 11/07/2021 01:42:21 - INFO - __main__ - Step 31588: {'lr': 0.00045229479083135917, 'samples': 6064896, 'steps': 31587, 'loss/train': 2.3265585899353027} 11/07/2021 01:42:22 - INFO - __main__ - Step 31589: {'lr': 0.000452291672744562, 'samples': 6065088, 'steps': 31588, 'loss/train': 0.961132824420929} 11/07/2021 01:42:22 - INFO - __main__ - Step 31590: {'lr': 0.0004522885545666153, 'samples': 6065280, 'steps': 31589, 'loss/train': 0.4769384562969208} 11/07/2021 01:42:24 - INFO - __main__ - Step 31591: {'lr': 0.0004522854362975206, 'samples': 6065472, 'steps': 31590, 'loss/train': 0.729523241519928} 11/07/2021 01:42:24 - INFO - __main__ - Step 31592: {'lr': 0.00045228231793727924, 'samples': 6065664, 'steps': 31591, 'loss/train': 0.5749496221542358} 11/07/2021 01:42:24 - INFO - __main__ - Step 31593: {'lr': 0.00045227919948589247, 'samples': 6065856, 'steps': 31592, 'loss/train': 1.5208745002746582} 11/07/2021 01:42:25 - INFO - __main__ - Step 31594: {'lr': 0.0004522760809433619, 'samples': 6066048, 'steps': 31593, 'loss/train': 5.449794292449951} 11/07/2021 01:42:25 - INFO - __main__ - Step 31595: {'lr': 0.0004522729623096888, 'samples': 6066240, 'steps': 31594, 'loss/train': 5.459228038787842} 11/07/2021 01:42:25 - INFO - __main__ - Step 31596: {'lr': 0.0004522698435848747, 'samples': 6066432, 'steps': 31595, 'loss/train': 1.2829856872558594} 11/07/2021 01:42:26 - INFO - __main__ - Step 31597: {'lr': 0.0004522667247689208, 'samples': 6066624, 'steps': 31596, 'loss/train': 1.6735295057296753} 11/07/2021 01:42:27 - INFO - __main__ - Step 31598: {'lr': 0.0004522636058618287, 'samples': 6066816, 'steps': 31597, 'loss/train': 0.953147828578949} 11/07/2021 01:42:27 - INFO - __main__ - Step 31599: {'lr': 0.0004522604868635998, 'samples': 6067008, 'steps': 31598, 'loss/train': 1.3888795375823975} 11/07/2021 01:42:27 - INFO - __main__ - Step 31600: {'lr': 0.0004522573677742353, 'samples': 6067200, 'steps': 31599, 'loss/train': 1.8566700220108032} 11/07/2021 01:42:28 - INFO - __main__ - Step 31601: {'lr': 0.0004522542485937369, 'samples': 6067392, 'steps': 31600, 'loss/train': 1.5361170768737793} 11/07/2021 01:42:28 - INFO - __main__ - Step 31602: {'lr': 0.0004522511293221058, 'samples': 6067584, 'steps': 31601, 'loss/train': 1.6162889003753662} 11/07/2021 01:42:29 - INFO - __main__ - Step 31603: {'lr': 0.00045224800995934345, 'samples': 6067776, 'steps': 31602, 'loss/train': 1.435838222503662} 11/07/2021 01:42:30 - INFO - __main__ - Step 31604: {'lr': 0.00045224489050545125, 'samples': 6067968, 'steps': 31603, 'loss/train': 1.3735251426696777} 11/07/2021 01:42:30 - INFO - __main__ - Step 31605: {'lr': 0.0004522417709604306, 'samples': 6068160, 'steps': 31604, 'loss/train': 1.6512362957000732} 11/07/2021 01:42:30 - INFO - __main__ - Step 31606: {'lr': 0.000452238651324283, 'samples': 6068352, 'steps': 31605, 'loss/train': 1.6479780673980713} 11/07/2021 01:42:31 - INFO - __main__ - Step 31607: {'lr': 0.0004522355315970098, 'samples': 6068544, 'steps': 31606, 'loss/train': 1.9048182964324951} 11/07/2021 01:42:32 - INFO - __main__ - Step 31608: {'lr': 0.0004522324117786123, 'samples': 6068736, 'steps': 31607, 'loss/train': 1.587449073791504} 11/07/2021 01:42:32 - INFO - __main__ - Step 31609: {'lr': 0.0004522292918690921, 'samples': 6068928, 'steps': 31608, 'loss/train': 1.3543143272399902} 11/07/2021 01:42:32 - INFO - __main__ - Step 31610: {'lr': 0.0004522261718684504, 'samples': 6069120, 'steps': 31609, 'loss/train': 1.546761393547058} 11/07/2021 01:42:33 - INFO - __main__ - Step 31611: {'lr': 0.00045222305177668875, 'samples': 6069312, 'steps': 31610, 'loss/train': 0.9692068696022034} 11/07/2021 01:42:33 - INFO - __main__ - Step 31612: {'lr': 0.00045221993159380857, 'samples': 6069504, 'steps': 31611, 'loss/train': 1.5525907278060913} 11/07/2021 01:42:34 - INFO - __main__ - Step 31613: {'lr': 0.00045221681131981116, 'samples': 6069696, 'steps': 31612, 'loss/train': 1.702857494354248} 11/07/2021 01:42:34 - INFO - __main__ - Step 31614: {'lr': 0.00045221369095469795, 'samples': 6069888, 'steps': 31613, 'loss/train': 0.9163973927497864} 11/07/2021 01:42:35 - INFO - __main__ - Step 31615: {'lr': 0.00045221057049847044, 'samples': 6070080, 'steps': 31614, 'loss/train': 1.3347078561782837} 11/07/2021 01:42:35 - INFO - __main__ - Step 31616: {'lr': 0.0004522074499511299, 'samples': 6070272, 'steps': 31615, 'loss/train': 1.5862935781478882} 11/07/2021 01:42:35 - INFO - __main__ - Step 31617: {'lr': 0.0004522043293126778, 'samples': 6070464, 'steps': 31616, 'loss/train': 2.0183849334716797} 11/07/2021 01:42:37 - INFO - __main__ - Step 31618: {'lr': 0.00045220120858311557, 'samples': 6070656, 'steps': 31617, 'loss/train': 1.6048452854156494} 11/07/2021 01:42:37 - INFO - __main__ - Step 31619: {'lr': 0.0004521980877624446, 'samples': 6070848, 'steps': 31618, 'loss/train': 1.529024362564087} 11/07/2021 01:42:37 - INFO - __main__ - Step 31620: {'lr': 0.0004521949668506663, 'samples': 6071040, 'steps': 31619, 'loss/train': 1.7623778581619263} 11/07/2021 01:42:38 - INFO - __main__ - Step 31621: {'lr': 0.00045219184584778207, 'samples': 6071232, 'steps': 31620, 'loss/train': 1.584256887435913} 11/07/2021 01:42:38 - INFO - __main__ - Step 31622: {'lr': 0.0004521887247537933, 'samples': 6071424, 'steps': 31621, 'loss/train': 1.4615974426269531} 11/07/2021 01:42:38 - INFO - __main__ - Step 31623: {'lr': 0.00045218560356870144, 'samples': 6071616, 'steps': 31622, 'loss/train': 1.1545277833938599} 11/07/2021 01:42:39 - INFO - __main__ - Step 31624: {'lr': 0.0004521824822925078, 'samples': 6071808, 'steps': 31623, 'loss/train': 1.0066297054290771} 11/07/2021 01:42:40 - INFO - __main__ - Step 31625: {'lr': 0.00045217936092521396, 'samples': 6072000, 'steps': 31624, 'loss/train': 1.429289698600769} 11/07/2021 01:42:40 - INFO - __main__ - Step 31626: {'lr': 0.00045217623946682114, 'samples': 6072192, 'steps': 31625, 'loss/train': 1.3774598836898804} 11/07/2021 01:42:40 - INFO - __main__ - Step 31627: {'lr': 0.00045217311791733084, 'samples': 6072384, 'steps': 31626, 'loss/train': 1.5545233488082886} 11/07/2021 01:42:41 - INFO - __main__ - Step 31628: {'lr': 0.00045216999627674436, 'samples': 6072576, 'steps': 31627, 'loss/train': 1.5675814151763916} 11/07/2021 01:42:42 - INFO - __main__ - Step 31629: {'lr': 0.0004521668745450633, 'samples': 6072768, 'steps': 31628, 'loss/train': 1.2844632863998413} 11/07/2021 01:42:42 - INFO - __main__ - Step 31630: {'lr': 0.00045216375272228907, 'samples': 6072960, 'steps': 31629, 'loss/train': 1.017756700515747} 11/07/2021 01:42:42 - INFO - __main__ - Step 31631: {'lr': 0.00045216063080842287, 'samples': 6073152, 'steps': 31630, 'loss/train': 1.531299114227295} 11/07/2021 01:42:43 - INFO - __main__ - Step 31632: {'lr': 0.00045215750880346617, 'samples': 6073344, 'steps': 31631, 'loss/train': 1.265383243560791} 11/07/2021 01:42:43 - INFO - __main__ - Step 31633: {'lr': 0.00045215438670742045, 'samples': 6073536, 'steps': 31632, 'loss/train': 1.9258782863616943} 11/07/2021 01:42:44 - INFO - __main__ - Step 31634: {'lr': 0.00045215126452028705, 'samples': 6073728, 'steps': 31633, 'loss/train': 1.5267999172210693} 11/07/2021 01:42:44 - INFO - __main__ - Step 31635: {'lr': 0.00045214814224206744, 'samples': 6073920, 'steps': 31634, 'loss/train': 1.5844289064407349} 11/07/2021 01:42:45 - INFO - __main__ - Step 31636: {'lr': 0.00045214501987276304, 'samples': 6074112, 'steps': 31635, 'loss/train': 1.2692776918411255} 11/07/2021 01:42:45 - INFO - __main__ - Step 31637: {'lr': 0.0004521418974123751, 'samples': 6074304, 'steps': 31636, 'loss/train': 1.620078682899475} 11/07/2021 01:42:45 - INFO - __main__ - Step 31638: {'lr': 0.00045213877486090524, 'samples': 6074496, 'steps': 31637, 'loss/train': 2.1654961109161377} 11/07/2021 01:42:47 - INFO - __main__ - Step 31639: {'lr': 0.00045213565221835473, 'samples': 6074688, 'steps': 31638, 'loss/train': 1.277814507484436} 11/07/2021 01:42:48 - INFO - __main__ - Step 31640: {'lr': 0.00045213252948472505, 'samples': 6074880, 'steps': 31639, 'loss/train': 1.407594919204712} 11/07/2021 01:42:48 - INFO - __main__ - Step 31641: {'lr': 0.0004521294066600175, 'samples': 6075072, 'steps': 31640, 'loss/train': 1.788429617881775} 11/07/2021 01:42:48 - INFO - __main__ - Step 31642: {'lr': 0.0004521262837442336, 'samples': 6075264, 'steps': 31641, 'loss/train': 1.7870267629623413} 11/07/2021 01:42:49 - INFO - __main__ - Step 31643: {'lr': 0.0004521231607373747, 'samples': 6075456, 'steps': 31642, 'loss/train': 1.7724577188491821} 11/07/2021 01:42:49 - INFO - __main__ - Step 31644: {'lr': 0.00045212003763944226, 'samples': 6075648, 'steps': 31643, 'loss/train': 1.35056734085083} 11/07/2021 01:42:50 - INFO - __main__ - Step 31645: {'lr': 0.00045211691445043765, 'samples': 6075840, 'steps': 31644, 'loss/train': 1.589632511138916} 11/07/2021 01:42:51 - INFO - __main__ - Step 31646: {'lr': 0.0004521137911703622, 'samples': 6076032, 'steps': 31645, 'loss/train': 1.411072015762329} 11/07/2021 01:42:51 - INFO - __main__ - Step 31647: {'lr': 0.0004521106677992175, 'samples': 6076224, 'steps': 31646, 'loss/train': 1.1863641738891602} 11/07/2021 01:42:51 - INFO - __main__ - Step 31648: {'lr': 0.0004521075443370048, 'samples': 6076416, 'steps': 31647, 'loss/train': 1.4727528095245361} 11/07/2021 01:42:52 - INFO - __main__ - Step 31649: {'lr': 0.0004521044207837256, 'samples': 6076608, 'steps': 31648, 'loss/train': 1.6370353698730469} 11/07/2021 01:42:52 - INFO - __main__ - Step 31650: {'lr': 0.0004521012971393812, 'samples': 6076800, 'steps': 31649, 'loss/train': 1.5181607007980347} 11/07/2021 01:42:52 - INFO - __main__ - Step 31651: {'lr': 0.0004520981734039731, 'samples': 6076992, 'steps': 31650, 'loss/train': 1.3422218561172485} 11/07/2021 01:42:53 - INFO - __main__ - Step 31652: {'lr': 0.0004520950495775027, 'samples': 6077184, 'steps': 31651, 'loss/train': 1.3841309547424316} 11/07/2021 01:42:54 - INFO - __main__ - Step 31653: {'lr': 0.00045209192565997137, 'samples': 6077376, 'steps': 31652, 'loss/train': 1.6002979278564453} 11/07/2021 01:42:54 - INFO - __main__ - Step 31654: {'lr': 0.00045208880165138054, 'samples': 6077568, 'steps': 31653, 'loss/train': 1.987862229347229} 11/07/2021 01:42:54 - INFO - __main__ - Step 31655: {'lr': 0.0004520856775517316, 'samples': 6077760, 'steps': 31654, 'loss/train': 1.7040534019470215} 11/07/2021 01:42:55 - INFO - __main__ - Step 31656: {'lr': 0.00045208255336102597, 'samples': 6077952, 'steps': 31655, 'loss/train': 1.6022289991378784} 11/07/2021 01:42:56 - INFO - __main__ - Step 31657: {'lr': 0.0004520794290792651, 'samples': 6078144, 'steps': 31656, 'loss/train': 1.529978632926941} 11/07/2021 01:42:56 - INFO - __main__ - Step 31658: {'lr': 0.0004520763047064503, 'samples': 6078336, 'steps': 31657, 'loss/train': 1.1939507722854614} 11/07/2021 01:42:57 - INFO - __main__ - Step 31659: {'lr': 0.0004520731802425831, 'samples': 6078528, 'steps': 31658, 'loss/train': 1.6138416528701782} 11/07/2021 01:42:57 - INFO - __main__ - Step 31660: {'lr': 0.0004520700556876648, 'samples': 6078720, 'steps': 31659, 'loss/train': 1.4073563814163208} 11/07/2021 01:42:57 - INFO - __main__ - Step 31661: {'lr': 0.0004520669310416969, 'samples': 6078912, 'steps': 31660, 'loss/train': 1.5274254083633423} 11/07/2021 01:42:59 - INFO - __main__ - Step 31662: {'lr': 0.0004520638063046807, 'samples': 6079104, 'steps': 31661, 'loss/train': 1.4219837188720703} 11/07/2021 01:42:59 - INFO - __main__ - Step 31663: {'lr': 0.0004520606814766177, 'samples': 6079296, 'steps': 31662, 'loss/train': 1.5876414775848389} 11/07/2021 01:42:59 - INFO - __main__ - Step 31664: {'lr': 0.00045205755655750924, 'samples': 6079488, 'steps': 31663, 'loss/train': 1.267156958580017} 11/07/2021 01:43:00 - INFO - __main__ - Step 31665: {'lr': 0.0004520544315473568, 'samples': 6079680, 'steps': 31664, 'loss/train': 1.715897798538208} 11/07/2021 01:43:00 - INFO - __main__ - Step 31666: {'lr': 0.00045205130644616177, 'samples': 6079872, 'steps': 31665, 'loss/train': 1.593644618988037} 11/07/2021 01:43:01 - INFO - __main__ - Step 31667: {'lr': 0.0004520481812539255, 'samples': 6080064, 'steps': 31666, 'loss/train': 1.4776939153671265} 11/07/2021 01:43:01 - INFO - __main__ - Step 31668: {'lr': 0.00045204505597064943, 'samples': 6080256, 'steps': 31667, 'loss/train': 1.5776082277297974} 11/07/2021 01:43:02 - INFO - __main__ - Step 31669: {'lr': 0.00045204193059633505, 'samples': 6080448, 'steps': 31668, 'loss/train': 1.3004459142684937} 11/07/2021 01:43:02 - INFO - __main__ - Step 31670: {'lr': 0.0004520388051309836, 'samples': 6080640, 'steps': 31669, 'loss/train': 1.5337731838226318} 11/07/2021 01:43:03 - INFO - __main__ - Step 31671: {'lr': 0.00045203567957459657, 'samples': 6080832, 'steps': 31670, 'loss/train': 1.2069718837738037} 11/07/2021 01:43:03 - INFO - __main__ - Step 31672: {'lr': 0.00045203255392717545, 'samples': 6081024, 'steps': 31671, 'loss/train': 5.560124397277832} 11/07/2021 01:43:04 - INFO - __main__ - Step 31673: {'lr': 0.00045202942818872157, 'samples': 6081216, 'steps': 31672, 'loss/train': 1.22538161277771} 11/07/2021 01:43:04 - INFO - __main__ - Step 31674: {'lr': 0.0004520263023592363, 'samples': 6081408, 'steps': 31673, 'loss/train': 1.615771770477295} 11/07/2021 01:43:05 - INFO - __main__ - Step 31675: {'lr': 0.00045202317643872113, 'samples': 6081600, 'steps': 31674, 'loss/train': 1.6662970781326294} 11/07/2021 01:43:05 - INFO - __main__ - Step 31676: {'lr': 0.00045202005042717743, 'samples': 6081792, 'steps': 31675, 'loss/train': 1.475319266319275} 11/07/2021 01:43:05 - INFO - __main__ - Step 31677: {'lr': 0.0004520169243246066, 'samples': 6081984, 'steps': 31676, 'loss/train': 1.3714836835861206} 11/07/2021 01:43:06 - INFO - __main__ - Step 31678: {'lr': 0.0004520137981310101, 'samples': 6082176, 'steps': 31677, 'loss/train': 1.3201550245285034} 11/07/2021 01:43:07 - INFO - __main__ - Step 31679: {'lr': 0.0004520106718463893, 'samples': 6082368, 'steps': 31678, 'loss/train': 1.5420702695846558} 11/07/2021 01:43:07 - INFO - __main__ - Step 31680: {'lr': 0.0004520075454707456, 'samples': 6082560, 'steps': 31679, 'loss/train': 1.2182254791259766} 11/07/2021 01:43:07 - INFO - __main__ - Step 31681: {'lr': 0.0004520044190040804, 'samples': 6082752, 'steps': 31680, 'loss/train': 1.6463545560836792} 11/07/2021 01:43:08 - INFO - __main__ - Step 31682: {'lr': 0.0004520012924463951, 'samples': 6082944, 'steps': 31681, 'loss/train': 1.6987247467041016} 11/07/2021 01:43:09 - INFO - __main__ - Step 31683: {'lr': 0.0004519981657976912, 'samples': 6083136, 'steps': 31682, 'loss/train': 1.5654196739196777} 11/07/2021 01:43:09 - INFO - __main__ - Step 31684: {'lr': 0.00045199503905797, 'samples': 6083328, 'steps': 31683, 'loss/train': 1.4091193675994873} 11/07/2021 01:43:10 - INFO - __main__ - Step 31685: {'lr': 0.0004519919122272329, 'samples': 6083520, 'steps': 31684, 'loss/train': 0.2060466855764389} 11/07/2021 01:43:10 - INFO - __main__ - Step 31686: {'lr': 0.00045198878530548146, 'samples': 6083712, 'steps': 31685, 'loss/train': 1.2138254642486572} 11/07/2021 01:43:10 - INFO - __main__ - Step 31687: {'lr': 0.0004519856582927169, 'samples': 6083904, 'steps': 31686, 'loss/train': 0.8990980982780457} 11/07/2021 01:43:11 - INFO - __main__ - Step 31688: {'lr': 0.00045198253118894084, 'samples': 6084096, 'steps': 31687, 'loss/train': 1.5900429487228394} 11/07/2021 01:43:12 - INFO - __main__ - Step 31689: {'lr': 0.0004519794039941545, 'samples': 6084288, 'steps': 31688, 'loss/train': 1.7415271997451782} 11/07/2021 01:43:12 - INFO - __main__ - Step 31690: {'lr': 0.0004519762767083593, 'samples': 6084480, 'steps': 31689, 'loss/train': 1.670418620109558} 11/07/2021 01:43:12 - INFO - __main__ - Step 31691: {'lr': 0.00045197314933155677, 'samples': 6084672, 'steps': 31690, 'loss/train': 1.6502057313919067} 11/07/2021 01:43:13 - INFO - __main__ - Step 31692: {'lr': 0.0004519700218637482, 'samples': 6084864, 'steps': 31691, 'loss/train': 1.5056148767471313} 11/07/2021 01:43:13 - INFO - __main__ - Step 31693: {'lr': 0.00045196689430493516, 'samples': 6085056, 'steps': 31692, 'loss/train': 1.3275924921035767} 11/07/2021 01:43:14 - INFO - __main__ - Step 31694: {'lr': 0.00045196376665511883, 'samples': 6085248, 'steps': 31693, 'loss/train': 1.2099881172180176} 11/07/2021 01:43:14 - INFO - __main__ - Step 31695: {'lr': 0.00045196063891430086, 'samples': 6085440, 'steps': 31694, 'loss/train': 1.4326869249343872} 11/07/2021 01:43:15 - INFO - __main__ - Step 31696: {'lr': 0.0004519575110824825, 'samples': 6085632, 'steps': 31695, 'loss/train': 1.1381953954696655} 11/07/2021 01:43:15 - INFO - __main__ - Step 31697: {'lr': 0.0004519543831596652, 'samples': 6085824, 'steps': 31696, 'loss/train': 1.0026839971542358} 11/07/2021 01:43:15 - INFO - __main__ - Step 31698: {'lr': 0.0004519512551458503, 'samples': 6086016, 'steps': 31697, 'loss/train': 1.536896824836731} 11/07/2021 01:43:17 - INFO - __main__ - Step 31699: {'lr': 0.0004519481270410394, 'samples': 6086208, 'steps': 31698, 'loss/train': 1.6450562477111816} 11/07/2021 01:43:17 - INFO - __main__ - Step 31700: {'lr': 0.00045194499884523376, 'samples': 6086400, 'steps': 31699, 'loss/train': 1.4656116962432861} 11/07/2021 01:43:17 - INFO - __main__ - Step 31701: {'lr': 0.0004519418705584348, 'samples': 6086592, 'steps': 31700, 'loss/train': 1.449439525604248} 11/07/2021 01:43:18 - INFO - __main__ - Step 31702: {'lr': 0.0004519387421806439, 'samples': 6086784, 'steps': 31701, 'loss/train': 1.4537426233291626} 11/07/2021 01:43:18 - INFO - __main__ - Step 31703: {'lr': 0.0004519356137118625, 'samples': 6086976, 'steps': 31702, 'loss/train': 1.1907501220703125} 11/07/2021 01:43:20 - INFO - __main__ - Step 31704: {'lr': 0.00045193248515209216, 'samples': 6087168, 'steps': 31703, 'loss/train': 1.3883063793182373} 11/07/2021 01:43:20 - INFO - __main__ - Step 31705: {'lr': 0.0004519293565013341, 'samples': 6087360, 'steps': 31704, 'loss/train': 1.6694093942642212} 11/07/2021 01:43:20 - INFO - __main__ - Step 31706: {'lr': 0.0004519262277595898, 'samples': 6087552, 'steps': 31705, 'loss/train': 1.9630138874053955} 11/07/2021 01:43:21 - INFO - __main__ - Step 31707: {'lr': 0.0004519230989268606, 'samples': 6087744, 'steps': 31706, 'loss/train': 1.0818159580230713} 11/07/2021 01:43:21 - INFO - __main__ - Step 31708: {'lr': 0.000451919970003148, 'samples': 6087936, 'steps': 31707, 'loss/train': 1.182492971420288} 11/07/2021 01:43:21 - INFO - __main__ - Step 31709: {'lr': 0.0004519168409884534, 'samples': 6088128, 'steps': 31708, 'loss/train': 0.6430357098579407} 11/07/2021 01:43:23 - INFO - __main__ - Step 31710: {'lr': 0.00045191371188277817, 'samples': 6088320, 'steps': 31709, 'loss/train': 1.2160098552703857} 11/07/2021 01:43:23 - INFO - __main__ - Step 31711: {'lr': 0.0004519105826861237, 'samples': 6088512, 'steps': 31710, 'loss/train': 2.0542500019073486} 11/07/2021 01:43:23 - INFO - __main__ - Step 31712: {'lr': 0.0004519074533984915, 'samples': 6088704, 'steps': 31711, 'loss/train': 1.3618507385253906} 11/07/2021 01:43:24 - INFO - __main__ - Step 31713: {'lr': 0.0004519043240198829, 'samples': 6088896, 'steps': 31712, 'loss/train': 1.457452654838562} 11/07/2021 01:43:24 - INFO - __main__ - Step 31714: {'lr': 0.0004519011945502993, 'samples': 6089088, 'steps': 31713, 'loss/train': 0.8872636556625366} 11/07/2021 01:43:26 - INFO - __main__ - Step 31715: {'lr': 0.00045189806498974216, 'samples': 6089280, 'steps': 31714, 'loss/train': 1.2994664907455444} 11/07/2021 01:43:26 - INFO - __main__ - Step 31716: {'lr': 0.00045189493533821285, 'samples': 6089472, 'steps': 31715, 'loss/train': 1.7757633924484253} 11/07/2021 01:43:26 - INFO - __main__ - Step 31717: {'lr': 0.0004518918055957128, 'samples': 6089664, 'steps': 31716, 'loss/train': 2.037893056869507} 11/07/2021 01:43:27 - INFO - __main__ - Step 31718: {'lr': 0.0004518886757622435, 'samples': 6089856, 'steps': 31717, 'loss/train': 1.869469165802002} 11/07/2021 01:43:27 - INFO - __main__ - Step 31719: {'lr': 0.0004518855458378062, 'samples': 6090048, 'steps': 31718, 'loss/train': 1.8262141942977905} 11/07/2021 01:43:27 - INFO - __main__ - Step 31720: {'lr': 0.0004518824158224023, 'samples': 6090240, 'steps': 31719, 'loss/train': 1.5757958889007568} 11/07/2021 01:43:28 - INFO - __main__ - Step 31721: {'lr': 0.00045187928571603343, 'samples': 6090432, 'steps': 31720, 'loss/train': 1.4796817302703857} 11/07/2021 01:43:29 - INFO - __main__ - Step 31722: {'lr': 0.0004518761555187008, 'samples': 6090624, 'steps': 31721, 'loss/train': 1.763049840927124} 11/07/2021 01:43:29 - INFO - __main__ - Step 31723: {'lr': 0.00045187302523040597, 'samples': 6090816, 'steps': 31722, 'loss/train': 1.7150776386260986} 11/07/2021 01:43:29 - INFO - __main__ - Step 31724: {'lr': 0.00045186989485115014, 'samples': 6091008, 'steps': 31723, 'loss/train': 1.8566874265670776} 11/07/2021 01:43:30 - INFO - __main__ - Step 31725: {'lr': 0.000451866764380935, 'samples': 6091200, 'steps': 31724, 'loss/train': 0.8527278900146484} 11/07/2021 01:43:30 - INFO - __main__ - Step 31726: {'lr': 0.0004518636338197617, 'samples': 6091392, 'steps': 31725, 'loss/train': 0.79350346326828} 11/07/2021 01:43:31 - INFO - __main__ - Step 31727: {'lr': 0.00045186050316763186, 'samples': 6091584, 'steps': 31726, 'loss/train': 1.8084723949432373} 11/07/2021 01:43:32 - INFO - __main__ - Step 31728: {'lr': 0.0004518573724245467, 'samples': 6091776, 'steps': 31727, 'loss/train': 1.43443763256073} 11/07/2021 01:43:32 - INFO - __main__ - Step 31729: {'lr': 0.00045185424159050776, 'samples': 6091968, 'steps': 31728, 'loss/train': 0.6599883437156677} 11/07/2021 01:43:32 - INFO - __main__ - Step 31730: {'lr': 0.00045185111066551643, 'samples': 6092160, 'steps': 31729, 'loss/train': 1.3211134672164917} 11/07/2021 01:43:33 - INFO - __main__ - Step 31731: {'lr': 0.0004518479796495741, 'samples': 6092352, 'steps': 31730, 'loss/train': 1.319759488105774} 11/07/2021 01:43:33 - INFO - __main__ - Step 31732: {'lr': 0.00045184484854268216, 'samples': 6092544, 'steps': 31731, 'loss/train': 2.2386746406555176} 11/07/2021 01:43:34 - INFO - __main__ - Step 31733: {'lr': 0.00045184171734484203, 'samples': 6092736, 'steps': 31732, 'loss/train': 1.6354384422302246} 11/07/2021 01:43:34 - INFO - __main__ - Step 31734: {'lr': 0.00045183858605605517, 'samples': 6092928, 'steps': 31733, 'loss/train': 1.3734209537506104} 11/07/2021 01:43:35 - INFO - __main__ - Step 31735: {'lr': 0.00045183545467632295, 'samples': 6093120, 'steps': 31734, 'loss/train': 1.4117457866668701} 11/07/2021 01:43:35 - INFO - __main__ - Step 31736: {'lr': 0.0004518323232056468, 'samples': 6093312, 'steps': 31735, 'loss/train': 1.3299400806427002} 11/07/2021 01:43:35 - INFO - __main__ - Step 31737: {'lr': 0.0004518291916440281, 'samples': 6093504, 'steps': 31736, 'loss/train': 1.7732288837432861} 11/07/2021 01:43:36 - INFO - __main__ - Step 31738: {'lr': 0.0004518260599914683, 'samples': 6093696, 'steps': 31737, 'loss/train': 1.7293723821640015} 11/07/2021 01:43:37 - INFO - __main__ - Step 31739: {'lr': 0.0004518229282479688, 'samples': 6093888, 'steps': 31738, 'loss/train': 1.130365252494812} 11/07/2021 01:43:37 - INFO - __main__ - Step 31740: {'lr': 0.000451819796413531, 'samples': 6094080, 'steps': 31739, 'loss/train': 1.5491985082626343} 11/07/2021 01:43:37 - INFO - __main__ - Step 31741: {'lr': 0.0004518166644881563, 'samples': 6094272, 'steps': 31740, 'loss/train': 1.8045982122421265} 11/07/2021 01:43:38 - INFO - __main__ - Step 31742: {'lr': 0.0004518135324718461, 'samples': 6094464, 'steps': 31741, 'loss/train': 1.6794017553329468} 11/07/2021 01:43:39 - INFO - __main__ - Step 31743: {'lr': 0.00045181040036460185, 'samples': 6094656, 'steps': 31742, 'loss/train': 1.4577114582061768} 11/07/2021 01:43:39 - INFO - __main__ - Step 31744: {'lr': 0.0004518072681664249, 'samples': 6094848, 'steps': 31743, 'loss/train': 1.1413843631744385} 11/07/2021 01:43:40 - INFO - __main__ - Step 31745: {'lr': 0.0004518041358773168, 'samples': 6095040, 'steps': 31744, 'loss/train': 1.9787827730178833} 11/07/2021 01:43:40 - INFO - __main__ - Step 31746: {'lr': 0.0004518010034972788, 'samples': 6095232, 'steps': 31745, 'loss/train': 0.9560813903808594} 11/07/2021 01:43:40 - INFO - __main__ - Step 31747: {'lr': 0.0004517978710263124, 'samples': 6095424, 'steps': 31746, 'loss/train': 2.453749179840088} 11/07/2021 01:43:42 - INFO - __main__ - Step 31748: {'lr': 0.0004517947384644191, 'samples': 6095616, 'steps': 31747, 'loss/train': 1.4488246440887451} 11/07/2021 01:43:42 - INFO - __main__ - Step 31749: {'lr': 0.00045179160581160005, 'samples': 6095808, 'steps': 31748, 'loss/train': 1.95890212059021} 11/07/2021 01:43:42 - INFO - __main__ - Step 31750: {'lr': 0.0004517884730678569, 'samples': 6096000, 'steps': 31749, 'loss/train': 2.0341854095458984} 11/07/2021 01:43:43 - INFO - __main__ - Step 31751: {'lr': 0.00045178534023319097, 'samples': 6096192, 'steps': 31750, 'loss/train': 1.2793896198272705} 11/07/2021 01:43:43 - INFO - __main__ - Step 31752: {'lr': 0.00045178220730760367, 'samples': 6096384, 'steps': 31751, 'loss/train': 1.2757604122161865} 11/07/2021 01:43:43 - INFO - __main__ - Step 31753: {'lr': 0.0004517790742910964, 'samples': 6096576, 'steps': 31752, 'loss/train': 1.3032244443893433} 11/07/2021 01:43:44 - INFO - __main__ - Step 31754: {'lr': 0.0004517759411836706, 'samples': 6096768, 'steps': 31753, 'loss/train': 1.0655202865600586} 11/07/2021 01:43:45 - INFO - __main__ - Step 31755: {'lr': 0.0004517728079853277, 'samples': 6096960, 'steps': 31754, 'loss/train': 1.4300718307495117} 11/07/2021 01:43:45 - INFO - __main__ - Step 31756: {'lr': 0.0004517696746960691, 'samples': 6097152, 'steps': 31755, 'loss/train': 1.6231884956359863} 11/07/2021 01:43:45 - INFO - __main__ - Step 31757: {'lr': 0.00045176654131589617, 'samples': 6097344, 'steps': 31756, 'loss/train': 1.2734031677246094} 11/07/2021 01:43:46 - INFO - __main__ - Step 31758: {'lr': 0.0004517634078448103, 'samples': 6097536, 'steps': 31757, 'loss/train': 1.4724065065383911} 11/07/2021 01:43:46 - INFO - __main__ - Step 31759: {'lr': 0.0004517602742828131, 'samples': 6097728, 'steps': 31758, 'loss/train': 1.2505757808685303} 11/07/2021 01:43:47 - INFO - __main__ - Step 31760: {'lr': 0.0004517571406299057, 'samples': 6097920, 'steps': 31759, 'loss/train': 1.5878269672393799} 11/07/2021 01:43:47 - INFO - __main__ - Step 31761: {'lr': 0.0004517540068860897, 'samples': 6098112, 'steps': 31760, 'loss/train': 1.867502212524414} 11/07/2021 01:43:48 - INFO - __main__ - Step 31762: {'lr': 0.0004517508730513664, 'samples': 6098304, 'steps': 31761, 'loss/train': 1.4676836729049683} 11/07/2021 01:43:48 - INFO - __main__ - Step 31763: {'lr': 0.00045174773912573735, 'samples': 6098496, 'steps': 31762, 'loss/train': 1.7907060384750366} 11/07/2021 01:43:49 - INFO - __main__ - Step 31764: {'lr': 0.00045174460510920386, 'samples': 6098688, 'steps': 31763, 'loss/train': 1.3359169960021973} 11/07/2021 01:43:49 - INFO - __main__ - Step 31765: {'lr': 0.00045174147100176734, 'samples': 6098880, 'steps': 31764, 'loss/train': 1.377150535583496} 11/07/2021 01:43:50 - INFO - __main__ - Step 31766: {'lr': 0.00045173833680342925, 'samples': 6099072, 'steps': 31765, 'loss/train': 1.5649809837341309} 11/07/2021 01:43:50 - INFO - __main__ - Step 31767: {'lr': 0.00045173520251419095, 'samples': 6099264, 'steps': 31766, 'loss/train': 1.931465983390808} 11/07/2021 01:43:51 - INFO - __main__ - Step 31768: {'lr': 0.0004517320681340539, 'samples': 6099456, 'steps': 31767, 'loss/train': 1.7786474227905273} 11/07/2021 01:43:51 - INFO - __main__ - Step 31769: {'lr': 0.0004517289336630195, 'samples': 6099648, 'steps': 31768, 'loss/train': 1.407793641090393} 11/07/2021 01:43:52 - INFO - __main__ - Step 31770: {'lr': 0.0004517257991010891, 'samples': 6099840, 'steps': 31769, 'loss/train': 1.3097972869873047} 11/07/2021 01:43:52 - INFO - __main__ - Step 31771: {'lr': 0.0004517226644482642, 'samples': 6100032, 'steps': 31770, 'loss/train': 1.1694785356521606} 11/07/2021 01:43:53 - INFO - __main__ - Step 31772: {'lr': 0.00045171952970454623, 'samples': 6100224, 'steps': 31771, 'loss/train': 1.6580748558044434} 11/07/2021 01:43:53 - INFO - __main__ - Step 31773: {'lr': 0.0004517163948699365, 'samples': 6100416, 'steps': 31772, 'loss/train': 1.575675129890442} 11/07/2021 01:43:53 - INFO - __main__ - Step 31774: {'lr': 0.00045171325994443644, 'samples': 6100608, 'steps': 31773, 'loss/train': 1.6436625719070435} 11/07/2021 01:43:54 - INFO - __main__ - Step 31775: {'lr': 0.00045171012492804753, 'samples': 6100800, 'steps': 31774, 'loss/train': 0.5877825021743774} 11/07/2021 01:43:55 - INFO - __main__ - Step 31776: {'lr': 0.0004517069898207712, 'samples': 6100992, 'steps': 31775, 'loss/train': 1.739137053489685} 11/07/2021 01:43:55 - INFO - __main__ - Step 31777: {'lr': 0.00045170385462260876, 'samples': 6101184, 'steps': 31776, 'loss/train': 1.6380271911621094} 11/07/2021 01:43:55 - INFO - __main__ - Step 31778: {'lr': 0.0004517007193335617, 'samples': 6101376, 'steps': 31777, 'loss/train': 1.0152171850204468} 11/07/2021 01:43:56 - INFO - __main__ - Step 31779: {'lr': 0.0004516975839536314, 'samples': 6101568, 'steps': 31778, 'loss/train': 1.0910487174987793} 11/07/2021 01:43:57 - INFO - __main__ - Step 31780: {'lr': 0.0004516944484828193, 'samples': 6101760, 'steps': 31779, 'loss/train': 1.5282224416732788} 11/07/2021 01:43:57 - INFO - __main__ - Step 31781: {'lr': 0.0004516913129211268, 'samples': 6101952, 'steps': 31780, 'loss/train': 0.6756600141525269} 11/07/2021 01:43:57 - INFO - __main__ - Step 31782: {'lr': 0.00045168817726855525, 'samples': 6102144, 'steps': 31781, 'loss/train': 0.26307106018066406} 11/07/2021 01:43:58 - INFO - __main__ - Step 31783: {'lr': 0.0004516850415251061, 'samples': 6102336, 'steps': 31782, 'loss/train': 1.4687156677246094} 11/07/2021 01:43:58 - INFO - __main__ - Step 31784: {'lr': 0.0004516819056907809, 'samples': 6102528, 'steps': 31783, 'loss/train': 1.848503589630127} 11/07/2021 01:43:59 - INFO - __main__ - Step 31785: {'lr': 0.0004516787697655809, 'samples': 6102720, 'steps': 31784, 'loss/train': 0.48024705052375793} 11/07/2021 01:44:00 - INFO - __main__ - Step 31786: {'lr': 0.0004516756337495075, 'samples': 6102912, 'steps': 31785, 'loss/train': 1.448751449584961} 11/07/2021 01:44:00 - INFO - __main__ - Step 31787: {'lr': 0.0004516724976425622, 'samples': 6103104, 'steps': 31786, 'loss/train': 1.7140107154846191} 11/07/2021 01:44:00 - INFO - __main__ - Step 31788: {'lr': 0.0004516693614447464, 'samples': 6103296, 'steps': 31787, 'loss/train': 1.5359638929367065} 11/07/2021 01:44:01 - INFO - __main__ - Step 31789: {'lr': 0.0004516662251560615, 'samples': 6103488, 'steps': 31788, 'loss/train': 1.5012446641921997} 11/07/2021 01:44:01 - INFO - __main__ - Step 31790: {'lr': 0.0004516630887765089, 'samples': 6103680, 'steps': 31789, 'loss/train': 1.7036118507385254} 11/07/2021 01:44:02 - INFO - __main__ - Step 31791: {'lr': 0.00045165995230609003, 'samples': 6103872, 'steps': 31790, 'loss/train': 1.1799037456512451} 11/07/2021 01:44:02 - INFO - __main__ - Step 31792: {'lr': 0.0004516568157448063, 'samples': 6104064, 'steps': 31791, 'loss/train': 1.6661583185195923} 11/07/2021 01:44:03 - INFO - __main__ - Step 31793: {'lr': 0.00045165367909265916, 'samples': 6104256, 'steps': 31792, 'loss/train': 0.8608067631721497} 11/07/2021 01:44:03 - INFO - __main__ - Step 31794: {'lr': 0.00045165054234964984, 'samples': 6104448, 'steps': 31793, 'loss/train': 1.4671131372451782} 11/07/2021 01:44:03 - INFO - __main__ - Step 31795: {'lr': 0.0004516474055157801, 'samples': 6104640, 'steps': 31794, 'loss/train': 1.7963557243347168} 11/07/2021 01:44:04 - INFO - __main__ - Step 31796: {'lr': 0.000451644268591051, 'samples': 6104832, 'steps': 31795, 'loss/train': 1.595018982887268} 11/07/2021 01:44:05 - INFO - __main__ - Step 31797: {'lr': 0.00045164113157546414, 'samples': 6105024, 'steps': 31796, 'loss/train': 1.6584199666976929} 11/07/2021 01:44:05 - INFO - __main__ - Step 31798: {'lr': 0.0004516379944690209, 'samples': 6105216, 'steps': 31797, 'loss/train': 1.519771933555603} 11/07/2021 01:44:05 - INFO - __main__ - Step 31799: {'lr': 0.0004516348572717227, 'samples': 6105408, 'steps': 31798, 'loss/train': 1.4903631210327148} 11/07/2021 01:44:06 - INFO - __main__ - Step 31800: {'lr': 0.000451631719983571, 'samples': 6105600, 'steps': 31799, 'loss/train': 1.4289230108261108} 11/07/2021 01:44:07 - INFO - __main__ - Step 31801: {'lr': 0.00045162858260456705, 'samples': 6105792, 'steps': 31800, 'loss/train': 1.5293996334075928} 11/07/2021 01:44:07 - INFO - __main__ - Step 31802: {'lr': 0.0004516254451347125, 'samples': 6105984, 'steps': 31801, 'loss/train': 1.545749306678772} 11/07/2021 01:44:08 - INFO - __main__ - Step 31803: {'lr': 0.0004516223075740085, 'samples': 6106176, 'steps': 31802, 'loss/train': 0.7313265204429626} 11/07/2021 01:44:08 - INFO - __main__ - Step 31804: {'lr': 0.00045161916992245664, 'samples': 6106368, 'steps': 31803, 'loss/train': 1.424360990524292} 11/07/2021 01:44:08 - INFO - __main__ - Step 31805: {'lr': 0.0004516160321800584, 'samples': 6106560, 'steps': 31804, 'loss/train': 1.6591503620147705} 11/07/2021 01:44:09 - INFO - __main__ - Step 31806: {'lr': 0.000451612894346815, 'samples': 6106752, 'steps': 31805, 'loss/train': 1.6420533657073975} 11/07/2021 01:44:10 - INFO - __main__ - Step 31807: {'lr': 0.00045160975642272795, 'samples': 6106944, 'steps': 31806, 'loss/train': 1.7411733865737915} 11/07/2021 01:44:10 - INFO - __main__ - Step 31808: {'lr': 0.0004516066184077986, 'samples': 6107136, 'steps': 31807, 'loss/train': 0.8653108477592468} 11/07/2021 01:44:10 - INFO - __main__ - Step 31809: {'lr': 0.0004516034803020285, 'samples': 6107328, 'steps': 31808, 'loss/train': 1.52175772190094} 11/07/2021 01:44:11 - INFO - __main__ - Step 31810: {'lr': 0.0004516003421054189, 'samples': 6107520, 'steps': 31809, 'loss/train': 1.6941521167755127} 11/07/2021 01:44:12 - INFO - __main__ - Step 31811: {'lr': 0.0004515972038179714, 'samples': 6107712, 'steps': 31810, 'loss/train': 1.3670642375946045} 11/07/2021 01:44:12 - INFO - __main__ - Step 31812: {'lr': 0.0004515940654396872, 'samples': 6107904, 'steps': 31811, 'loss/train': 2.074129581451416} 11/07/2021 01:44:13 - INFO - __main__ - Step 31813: {'lr': 0.00045159092697056794, 'samples': 6108096, 'steps': 31812, 'loss/train': 1.4383933544158936} 11/07/2021 01:44:13 - INFO - __main__ - Step 31814: {'lr': 0.00045158778841061483, 'samples': 6108288, 'steps': 31813, 'loss/train': 1.4272984266281128} 11/07/2021 01:44:13 - INFO - __main__ - Step 31815: {'lr': 0.0004515846497598294, 'samples': 6108480, 'steps': 31814, 'loss/train': 1.7123581171035767} 11/07/2021 01:44:14 - INFO - __main__ - Step 31816: {'lr': 0.000451581511018213, 'samples': 6108672, 'steps': 31815, 'loss/train': 1.681820034980774} 11/07/2021 01:44:15 - INFO - __main__ - Step 31817: {'lr': 0.00045157837218576713, 'samples': 6108864, 'steps': 31816, 'loss/train': 1.696065068244934} 11/07/2021 01:44:15 - INFO - __main__ - Step 31818: {'lr': 0.00045157523326249316, 'samples': 6109056, 'steps': 31817, 'loss/train': 1.6681617498397827} 11/07/2021 01:44:15 - INFO - __main__ - Step 31819: {'lr': 0.00045157209424839253, 'samples': 6109248, 'steps': 31818, 'loss/train': 1.529592752456665} 11/07/2021 01:44:16 - INFO - __main__ - Step 31820: {'lr': 0.0004515689551434665, 'samples': 6109440, 'steps': 31819, 'loss/train': 1.5102119445800781} 11/07/2021 01:44:17 - INFO - __main__ - Step 31821: {'lr': 0.00045156581594771675, 'samples': 6109632, 'steps': 31820, 'loss/train': 1.5752092599868774} 11/07/2021 01:44:17 - INFO - __main__ - Step 31822: {'lr': 0.00045156267666114446, 'samples': 6109824, 'steps': 31821, 'loss/train': 1.095093846321106} 11/07/2021 01:44:17 - INFO - __main__ - Step 31823: {'lr': 0.0004515595372837512, 'samples': 6110016, 'steps': 31822, 'loss/train': 1.6193000078201294} 11/07/2021 01:44:18 - INFO - __main__ - Step 31824: {'lr': 0.00045155639781553825, 'samples': 6110208, 'steps': 31823, 'loss/train': 1.6955679655075073} 11/07/2021 01:44:18 - INFO - __main__ - Step 31825: {'lr': 0.00045155325825650715, 'samples': 6110400, 'steps': 31824, 'loss/train': 1.2122478485107422} 11/07/2021 01:44:18 - INFO - __main__ - Step 31826: {'lr': 0.00045155011860665927, 'samples': 6110592, 'steps': 31825, 'loss/train': 1.2631003856658936} 11/07/2021 01:44:20 - INFO - __main__ - Step 31827: {'lr': 0.00045154697886599606, 'samples': 6110784, 'steps': 31826, 'loss/train': 0.4462384581565857} 11/07/2021 01:44:20 - INFO - __main__ - Step 31828: {'lr': 0.0004515438390345188, 'samples': 6110976, 'steps': 31827, 'loss/train': 1.3006891012191772} 11/07/2021 01:44:20 - INFO - __main__ - Step 31829: {'lr': 0.00045154069911222905, 'samples': 6111168, 'steps': 31828, 'loss/train': 1.6836390495300293} 11/07/2021 01:44:21 - INFO - __main__ - Step 31830: {'lr': 0.0004515375590991281, 'samples': 6111360, 'steps': 31829, 'loss/train': 1.4358208179473877} 11/07/2021 01:44:21 - INFO - __main__ - Step 31831: {'lr': 0.0004515344189952175, 'samples': 6111552, 'steps': 31830, 'loss/train': 1.475838541984558} 11/07/2021 01:44:22 - INFO - __main__ - Step 31832: {'lr': 0.0004515312788004986, 'samples': 6111744, 'steps': 31831, 'loss/train': 1.4673216342926025} 11/07/2021 01:44:22 - INFO - __main__ - Step 31833: {'lr': 0.00045152813851497274, 'samples': 6111936, 'steps': 31832, 'loss/train': 1.525496006011963} 11/07/2021 01:44:23 - INFO - __main__ - Step 31834: {'lr': 0.0004515249981386416, 'samples': 6112128, 'steps': 31833, 'loss/train': 1.4047716856002808} 11/07/2021 01:44:23 - INFO - __main__ - Step 31835: {'lr': 0.0004515218576715062, 'samples': 6112320, 'steps': 31834, 'loss/train': 1.6120771169662476} 11/07/2021 01:44:23 - INFO - __main__ - Step 31836: {'lr': 0.00045151871711356827, 'samples': 6112512, 'steps': 31835, 'loss/train': 0.9932732582092285} 11/07/2021 01:44:25 - INFO - __main__ - Step 31837: {'lr': 0.0004515155764648291, 'samples': 6112704, 'steps': 31836, 'loss/train': 1.6056442260742188} 11/07/2021 01:44:25 - INFO - __main__ - Step 31838: {'lr': 0.0004515124357252901, 'samples': 6112896, 'steps': 31837, 'loss/train': 0.9801680445671082} 11/07/2021 01:44:25 - INFO - __main__ - Step 31839: {'lr': 0.0004515092948949527, 'samples': 6113088, 'steps': 31838, 'loss/train': 1.792125940322876} 11/07/2021 01:44:26 - INFO - __main__ - Step 31840: {'lr': 0.00045150615397381835, 'samples': 6113280, 'steps': 31839, 'loss/train': 1.0138678550720215} 11/07/2021 01:44:26 - INFO - __main__ - Step 31841: {'lr': 0.0004515030129618884, 'samples': 6113472, 'steps': 31840, 'loss/train': 1.8094062805175781} 11/07/2021 01:44:27 - INFO - __main__ - Step 31842: {'lr': 0.0004514998718591643, 'samples': 6113664, 'steps': 31841, 'loss/train': 1.5744199752807617} 11/07/2021 01:44:27 - INFO - __main__ - Step 31843: {'lr': 0.0004514967306656475, 'samples': 6113856, 'steps': 31842, 'loss/train': 1.845720648765564} 11/07/2021 01:44:28 - INFO - __main__ - Step 31844: {'lr': 0.0004514935893813394, 'samples': 6114048, 'steps': 31843, 'loss/train': 1.3678845167160034} 11/07/2021 01:44:28 - INFO - __main__ - Step 31845: {'lr': 0.00045149044800624135, 'samples': 6114240, 'steps': 31844, 'loss/train': 1.5553712844848633} 11/07/2021 01:44:28 - INFO - __main__ - Step 31846: {'lr': 0.0004514873065403549, 'samples': 6114432, 'steps': 31845, 'loss/train': 1.1362268924713135} 11/07/2021 01:44:29 - INFO - __main__ - Step 31847: {'lr': 0.0004514841649836813, 'samples': 6114624, 'steps': 31846, 'loss/train': 1.644763708114624} 11/07/2021 01:44:30 - INFO - __main__ - Step 31848: {'lr': 0.000451481023336222, 'samples': 6114816, 'steps': 31847, 'loss/train': 1.5752571821212769} 11/07/2021 01:44:30 - INFO - __main__ - Step 31849: {'lr': 0.0004514778815979785, 'samples': 6115008, 'steps': 31848, 'loss/train': 1.764130711555481} 11/07/2021 01:44:30 - INFO - __main__ - Step 31850: {'lr': 0.0004514747397689522, 'samples': 6115200, 'steps': 31849, 'loss/train': 1.4945341348648071} 11/07/2021 01:44:31 - INFO - __main__ - Step 31851: {'lr': 0.0004514715978491445, 'samples': 6115392, 'steps': 31850, 'loss/train': 1.451790452003479} 11/07/2021 01:44:31 - INFO - __main__ - Step 31852: {'lr': 0.0004514684558385568, 'samples': 6115584, 'steps': 31851, 'loss/train': 1.7369520664215088} 11/07/2021 01:44:32 - INFO - __main__ - Step 31853: {'lr': 0.0004514653137371905, 'samples': 6115776, 'steps': 31852, 'loss/train': 1.2488102912902832} 11/07/2021 01:44:33 - INFO - __main__ - Step 31854: {'lr': 0.000451462171545047, 'samples': 6115968, 'steps': 31853, 'loss/train': 1.4568167924880981} 11/07/2021 01:44:33 - INFO - __main__ - Step 31855: {'lr': 0.00045145902926212785, 'samples': 6116160, 'steps': 31854, 'loss/train': 2.047454357147217} 11/07/2021 01:44:33 - INFO - __main__ - Step 31856: {'lr': 0.0004514558868884343, 'samples': 6116352, 'steps': 31855, 'loss/train': 1.3141847848892212} 11/07/2021 01:44:34 - INFO - __main__ - Step 31857: {'lr': 0.00045145274442396786, 'samples': 6116544, 'steps': 31856, 'loss/train': 1.307631015777588} 11/07/2021 01:44:35 - INFO - __main__ - Step 31858: {'lr': 0.00045144960186872996, 'samples': 6116736, 'steps': 31857, 'loss/train': 1.6855310201644897} 11/07/2021 01:44:35 - INFO - __main__ - Step 31859: {'lr': 0.0004514464592227219, 'samples': 6116928, 'steps': 31858, 'loss/train': 1.911537528038025} 11/07/2021 01:44:35 - INFO - __main__ - Step 31860: {'lr': 0.0004514433164859453, 'samples': 6117120, 'steps': 31859, 'loss/train': 1.107447862625122} 11/07/2021 01:44:36 - INFO - __main__ - Step 31861: {'lr': 0.0004514401736584013, 'samples': 6117312, 'steps': 31860, 'loss/train': 1.288212537765503} 11/07/2021 01:44:36 - INFO - __main__ - Step 31862: {'lr': 0.0004514370307400916, 'samples': 6117504, 'steps': 31861, 'loss/train': 2.230525016784668} 11/07/2021 01:44:36 - INFO - __main__ - Step 31863: {'lr': 0.00045143388773101733, 'samples': 6117696, 'steps': 31862, 'loss/train': 1.853481411933899} 11/07/2021 01:44:37 - INFO - __main__ - Step 31864: {'lr': 0.0004514307446311802, 'samples': 6117888, 'steps': 31863, 'loss/train': 1.6902889013290405} 11/07/2021 01:44:38 - INFO - __main__ - Step 31865: {'lr': 0.0004514276014405814, 'samples': 6118080, 'steps': 31864, 'loss/train': 0.38764888048171997} 11/07/2021 01:44:38 - INFO - __main__ - Step 31866: {'lr': 0.00045142445815922244, 'samples': 6118272, 'steps': 31865, 'loss/train': 1.5117849111557007} 11/07/2021 01:44:38 - INFO - __main__ - Step 31867: {'lr': 0.0004514213147871047, 'samples': 6118464, 'steps': 31866, 'loss/train': 1.4329771995544434} 11/07/2021 01:44:39 - INFO - __main__ - Step 31868: {'lr': 0.00045141817132422974, 'samples': 6118656, 'steps': 31867, 'loss/train': 1.3428623676300049} 11/07/2021 01:44:40 - INFO - __main__ - Step 31869: {'lr': 0.0004514150277705988, 'samples': 6118848, 'steps': 31868, 'loss/train': 1.7656563520431519} 11/07/2021 01:44:40 - INFO - __main__ - Step 31870: {'lr': 0.0004514118841262133, 'samples': 6119040, 'steps': 31869, 'loss/train': 1.6058647632598877} 11/07/2021 01:44:41 - INFO - __main__ - Step 31871: {'lr': 0.0004514087403910748, 'samples': 6119232, 'steps': 31870, 'loss/train': 1.6545133590698242} 11/07/2021 01:44:41 - INFO - __main__ - Step 31872: {'lr': 0.00045140559656518456, 'samples': 6119424, 'steps': 31871, 'loss/train': 1.609268069267273} 11/07/2021 01:44:41 - INFO - __main__ - Step 31873: {'lr': 0.0004514024526485441, 'samples': 6119616, 'steps': 31872, 'loss/train': 1.8082526922225952} 11/07/2021 01:44:43 - INFO - __main__ - Step 31874: {'lr': 0.0004513993086411548, 'samples': 6119808, 'steps': 31873, 'loss/train': 1.2211987972259521} 11/07/2021 01:44:43 - INFO - __main__ - Step 31875: {'lr': 0.00045139616454301806, 'samples': 6120000, 'steps': 31874, 'loss/train': 0.29446327686309814} 11/07/2021 01:44:43 - INFO - __main__ - Step 31876: {'lr': 0.00045139302035413534, 'samples': 6120192, 'steps': 31875, 'loss/train': 1.5343080759048462} 11/07/2021 01:44:44 - INFO - __main__ - Step 31877: {'lr': 0.00045138987607450803, 'samples': 6120384, 'steps': 31876, 'loss/train': 2.0093131065368652} 11/07/2021 01:44:44 - INFO - __main__ - Step 31878: {'lr': 0.00045138673170413756, 'samples': 6120576, 'steps': 31877, 'loss/train': 1.5735539197921753} 11/07/2021 01:44:45 - INFO - __main__ - Step 31879: {'lr': 0.0004513835872430253, 'samples': 6120768, 'steps': 31878, 'loss/train': 1.7831584215164185} 11/07/2021 01:44:46 - INFO - __main__ - Step 31880: {'lr': 0.0004513804426911727, 'samples': 6120960, 'steps': 31879, 'loss/train': 1.510374665260315} 11/07/2021 01:44:46 - INFO - __main__ - Step 31881: {'lr': 0.00045137729804858124, 'samples': 6121152, 'steps': 31880, 'loss/train': 1.622793436050415} 11/07/2021 01:44:46 - INFO - __main__ - Step 31882: {'lr': 0.00045137415331525225, 'samples': 6121344, 'steps': 31881, 'loss/train': 1.7737555503845215} 11/07/2021 01:44:47 - INFO - __main__ - Step 31883: {'lr': 0.0004513710084911872, 'samples': 6121536, 'steps': 31882, 'loss/train': 1.6020547151565552} 11/07/2021 01:44:48 - INFO - __main__ - Step 31884: {'lr': 0.00045136786357638736, 'samples': 6121728, 'steps': 31883, 'loss/train': 0.3957446217536926} 11/07/2021 01:44:48 - INFO - __main__ - Step 31885: {'lr': 0.00045136471857085435, 'samples': 6121920, 'steps': 31884, 'loss/train': 1.7443677186965942} 11/07/2021 01:44:48 - INFO - __main__ - Step 31886: {'lr': 0.0004513615734745895, 'samples': 6122112, 'steps': 31885, 'loss/train': 1.861700415611267} 11/07/2021 01:44:49 - INFO - __main__ - Step 31887: {'lr': 0.00045135842828759426, 'samples': 6122304, 'steps': 31886, 'loss/train': 1.7172307968139648} 11/07/2021 01:44:49 - INFO - __main__ - Step 31888: {'lr': 0.00045135528300987006, 'samples': 6122496, 'steps': 31887, 'loss/train': 1.388944149017334} 11/07/2021 01:44:50 - INFO - __main__ - Step 31889: {'lr': 0.00045135213764141814, 'samples': 6122688, 'steps': 31888, 'loss/train': 1.505753993988037} 11/07/2021 01:44:50 - INFO - __main__ - Step 31890: {'lr': 0.00045134899218224014, 'samples': 6122880, 'steps': 31889, 'loss/train': 1.5831694602966309} 11/07/2021 01:44:51 - INFO - __main__ - Step 31891: {'lr': 0.0004513458466323374, 'samples': 6123072, 'steps': 31890, 'loss/train': 1.688417673110962} 11/07/2021 01:44:51 - INFO - __main__ - Step 31892: {'lr': 0.0004513427009917113, 'samples': 6123264, 'steps': 31891, 'loss/train': 1.3869106769561768} 11/07/2021 01:44:52 - INFO - __main__ - Step 31893: {'lr': 0.0004513395552603633, 'samples': 6123456, 'steps': 31892, 'loss/train': 1.3275775909423828} 11/07/2021 01:44:52 - INFO - __main__ - Step 31894: {'lr': 0.0004513364094382948, 'samples': 6123648, 'steps': 31893, 'loss/train': 1.7369481325149536} 11/07/2021 01:44:53 - INFO - __main__ - Step 31895: {'lr': 0.00045133326352550724, 'samples': 6123840, 'steps': 31894, 'loss/train': 2.2003068923950195} 11/07/2021 01:44:53 - INFO - __main__ - Step 31896: {'lr': 0.000451330117522002, 'samples': 6124032, 'steps': 31895, 'loss/train': 1.3032506704330444} 11/07/2021 01:44:54 - INFO - __main__ - Step 31897: {'lr': 0.00045132697142778044, 'samples': 6124224, 'steps': 31896, 'loss/train': 1.6254322528839111} 11/07/2021 01:44:54 - INFO - __main__ - Step 31898: {'lr': 0.0004513238252428442, 'samples': 6124416, 'steps': 31897, 'loss/train': 1.728275179862976} 11/07/2021 01:44:54 - INFO - __main__ - Step 31899: {'lr': 0.0004513206789671945, 'samples': 6124608, 'steps': 31898, 'loss/train': 1.701587438583374} 11/07/2021 01:44:55 - INFO - __main__ - Step 31900: {'lr': 0.00045131753260083276, 'samples': 6124800, 'steps': 31899, 'loss/train': 1.1780604124069214} 11/07/2021 01:44:56 - INFO - __main__ - Step 31901: {'lr': 0.0004513143861437605, 'samples': 6124992, 'steps': 31900, 'loss/train': 1.972902536392212} 11/07/2021 01:44:56 - INFO - __main__ - Step 31902: {'lr': 0.00045131123959597905, 'samples': 6125184, 'steps': 31901, 'loss/train': 1.482000708580017} 11/07/2021 01:44:56 - INFO - __main__ - Step 31903: {'lr': 0.0004513080929574899, 'samples': 6125376, 'steps': 31902, 'loss/train': 1.3008524179458618} 11/07/2021 01:44:57 - INFO - __main__ - Step 31904: {'lr': 0.0004513049462282943, 'samples': 6125568, 'steps': 31903, 'loss/train': 1.3399587869644165} 11/07/2021 01:44:58 - INFO - __main__ - Step 31905: {'lr': 0.00045130179940839395, 'samples': 6125760, 'steps': 31904, 'loss/train': 1.515134334564209} 11/07/2021 01:44:58 - INFO - __main__ - Step 31906: {'lr': 0.00045129865249779, 'samples': 6125952, 'steps': 31905, 'loss/train': 1.4602670669555664} 11/07/2021 01:44:59 - INFO - __main__ - Step 31907: {'lr': 0.0004512955054964841, 'samples': 6126144, 'steps': 31906, 'loss/train': 1.7514537572860718} 11/07/2021 01:44:59 - INFO - __main__ - Step 31908: {'lr': 0.0004512923584044775, 'samples': 6126336, 'steps': 31907, 'loss/train': 2.1205618381500244} 11/07/2021 01:44:59 - INFO - __main__ - Step 31909: {'lr': 0.0004512892112217717, 'samples': 6126528, 'steps': 31908, 'loss/train': 1.4181747436523438} 11/07/2021 01:45:00 - INFO - __main__ - Step 31910: {'lr': 0.00045128606394836805, 'samples': 6126720, 'steps': 31909, 'loss/train': 1.6861112117767334} 11/07/2021 01:45:01 - INFO - __main__ - Step 31911: {'lr': 0.00045128291658426796, 'samples': 6126912, 'steps': 31910, 'loss/train': 2.197659492492676} 11/07/2021 01:45:01 - INFO - __main__ - Step 31912: {'lr': 0.00045127976912947296, 'samples': 6127104, 'steps': 31911, 'loss/train': 1.7058151960372925} 11/07/2021 01:45:01 - INFO - __main__ - Step 31913: {'lr': 0.00045127662158398434, 'samples': 6127296, 'steps': 31912, 'loss/train': 2.1316757202148438} 11/07/2021 01:45:02 - INFO - __main__ - Step 31914: {'lr': 0.00045127347394780367, 'samples': 6127488, 'steps': 31913, 'loss/train': 1.6989363431930542} 11/07/2021 01:45:02 - INFO - __main__ - Step 31915: {'lr': 0.00045127032622093225, 'samples': 6127680, 'steps': 31914, 'loss/train': 1.6610476970672607} 11/07/2021 01:45:03 - INFO - __main__ - Step 31916: {'lr': 0.0004512671784033715, 'samples': 6127872, 'steps': 31915, 'loss/train': 1.3292311429977417} 11/07/2021 01:45:03 - INFO - __main__ - Step 31917: {'lr': 0.00045126403049512286, 'samples': 6128064, 'steps': 31916, 'loss/train': 1.2609515190124512} 11/07/2021 01:45:04 - INFO - __main__ - Step 31918: {'lr': 0.0004512608824961878, 'samples': 6128256, 'steps': 31917, 'loss/train': 1.5588932037353516} 11/07/2021 01:45:04 - INFO - __main__ - Step 31919: {'lr': 0.00045125773440656756, 'samples': 6128448, 'steps': 31918, 'loss/train': 1.2627248764038086} 11/07/2021 01:45:05 - INFO - __main__ - Step 31920: {'lr': 0.0004512545862262638, 'samples': 6128640, 'steps': 31919, 'loss/train': 1.905514121055603} 11/07/2021 01:45:06 - INFO - __main__ - Step 31921: {'lr': 0.0004512514379552779, 'samples': 6128832, 'steps': 31920, 'loss/train': 1.595982313156128} 11/07/2021 01:45:06 - INFO - __main__ - Step 31922: {'lr': 0.0004512482895936111, 'samples': 6129024, 'steps': 31921, 'loss/train': 1.3344855308532715} 11/07/2021 01:45:06 - INFO - __main__ - Step 31923: {'lr': 0.00045124514114126493, 'samples': 6129216, 'steps': 31922, 'loss/train': 1.5508592128753662} 11/07/2021 01:45:07 - INFO - __main__ - Step 31924: {'lr': 0.0004512419925982408, 'samples': 6129408, 'steps': 31923, 'loss/train': 1.8285584449768066} 11/07/2021 01:45:07 - INFO - __main__ - Step 31925: {'lr': 0.0004512388439645402, 'samples': 6129600, 'steps': 31924, 'loss/train': 1.5829516649246216} 11/07/2021 01:45:08 - INFO - __main__ - Step 31926: {'lr': 0.00045123569524016446, 'samples': 6129792, 'steps': 31925, 'loss/train': 1.6425074338912964} 11/07/2021 01:45:09 - INFO - __main__ - Step 31927: {'lr': 0.00045123254642511504, 'samples': 6129984, 'steps': 31926, 'loss/train': 1.4603376388549805} 11/07/2021 01:45:09 - INFO - __main__ - Step 31928: {'lr': 0.0004512293975193933, 'samples': 6130176, 'steps': 31927, 'loss/train': 1.7124818563461304} 11/07/2021 01:45:09 - INFO - __main__ - Step 31929: {'lr': 0.0004512262485230007, 'samples': 6130368, 'steps': 31928, 'loss/train': 1.0703651905059814} 11/07/2021 01:45:10 - INFO - __main__ - Step 31930: {'lr': 0.00045122309943593865, 'samples': 6130560, 'steps': 31929, 'loss/train': 1.5126354694366455} 11/07/2021 01:45:11 - INFO - __main__ - Step 31931: {'lr': 0.0004512199502582086, 'samples': 6130752, 'steps': 31930, 'loss/train': 1.3348126411437988} 11/07/2021 01:45:11 - INFO - __main__ - Step 31932: {'lr': 0.00045121680098981186, 'samples': 6130944, 'steps': 31931, 'loss/train': 1.4302009344100952} 11/07/2021 01:45:11 - INFO - __main__ - Step 31933: {'lr': 0.00045121365163075007, 'samples': 6131136, 'steps': 31932, 'loss/train': 1.6032031774520874} 11/07/2021 01:45:12 - INFO - __main__ - Step 31934: {'lr': 0.0004512105021810244, 'samples': 6131328, 'steps': 31933, 'loss/train': 1.6673365831375122} 11/07/2021 01:45:12 - INFO - __main__ - Step 31935: {'lr': 0.0004512073526406365, 'samples': 6131520, 'steps': 31934, 'loss/train': 1.3719717264175415} 11/07/2021 01:45:12 - INFO - __main__ - Step 31936: {'lr': 0.0004512042030095876, 'samples': 6131712, 'steps': 31935, 'loss/train': 1.7193927764892578} 11/07/2021 01:45:13 - INFO - __main__ - Step 31937: {'lr': 0.0004512010532878792, 'samples': 6131904, 'steps': 31936, 'loss/train': 1.2810825109481812} 11/07/2021 01:45:14 - INFO - __main__ - Step 31938: {'lr': 0.0004511979034755127, 'samples': 6132096, 'steps': 31937, 'loss/train': 1.3766365051269531} 11/07/2021 01:45:14 - INFO - __main__ - Step 31939: {'lr': 0.0004511947535724895, 'samples': 6132288, 'steps': 31938, 'loss/train': 1.246138334274292} 11/07/2021 01:45:15 - INFO - __main__ - Step 31940: {'lr': 0.00045119160357881105, 'samples': 6132480, 'steps': 31939, 'loss/train': 1.8165699243545532} 11/07/2021 01:45:15 - INFO - __main__ - Step 31941: {'lr': 0.0004511884534944789, 'samples': 6132672, 'steps': 31940, 'loss/train': 1.4489672183990479} 11/07/2021 01:45:16 - INFO - __main__ - Step 31942: {'lr': 0.0004511853033194942, 'samples': 6132864, 'steps': 31941, 'loss/train': 0.7833683490753174} 11/07/2021 01:45:16 - INFO - __main__ - Step 31943: {'lr': 0.00045118215305385855, 'samples': 6133056, 'steps': 31942, 'loss/train': 1.383625864982605} 11/07/2021 01:45:16 - INFO - __main__ - Step 31944: {'lr': 0.0004511790026975733, 'samples': 6133248, 'steps': 31943, 'loss/train': 1.392366886138916} 11/07/2021 01:45:17 - INFO - __main__ - Step 31945: {'lr': 0.00045117585225063996, 'samples': 6133440, 'steps': 31944, 'loss/train': 1.6650818586349487} 11/07/2021 01:45:17 - INFO - __main__ - Step 31946: {'lr': 0.0004511727017130598, 'samples': 6133632, 'steps': 31945, 'loss/train': 1.8258341550827026} 11/07/2021 01:45:18 - INFO - __main__ - Step 31947: {'lr': 0.00045116955108483436, 'samples': 6133824, 'steps': 31946, 'loss/train': 1.6164908409118652} 11/07/2021 01:45:19 - INFO - __main__ - Step 31948: {'lr': 0.00045116640036596507, 'samples': 6134016, 'steps': 31947, 'loss/train': 1.5087451934814453} 11/07/2021 01:45:19 - INFO - __main__ - Step 31949: {'lr': 0.0004511632495564533, 'samples': 6134208, 'steps': 31948, 'loss/train': 1.5558593273162842} 11/07/2021 01:45:19 - INFO - __main__ - Step 31950: {'lr': 0.00045116009865630034, 'samples': 6134400, 'steps': 31949, 'loss/train': 1.8072712421417236} 11/07/2021 01:45:20 - INFO - __main__ - Step 31951: {'lr': 0.0004511569476655079, 'samples': 6134592, 'steps': 31950, 'loss/train': 1.579148530960083} 11/07/2021 01:45:21 - INFO - __main__ - Step 31952: {'lr': 0.00045115379658407717, 'samples': 6134784, 'steps': 31951, 'loss/train': 0.6126420497894287} 11/07/2021 01:45:21 - INFO - __main__ - Step 31953: {'lr': 0.0004511506454120097, 'samples': 6134976, 'steps': 31952, 'loss/train': 1.2280786037445068} 11/07/2021 01:45:21 - INFO - __main__ - Step 31954: {'lr': 0.00045114749414930676, 'samples': 6135168, 'steps': 31953, 'loss/train': 1.9271167516708374} 11/07/2021 01:45:22 - INFO - __main__ - Step 31955: {'lr': 0.00045114434279596994, 'samples': 6135360, 'steps': 31954, 'loss/train': 1.5438740253448486} 11/07/2021 01:45:22 - INFO - __main__ - Step 31956: {'lr': 0.0004511411913520006, 'samples': 6135552, 'steps': 31955, 'loss/train': 1.3665361404418945} 11/07/2021 01:45:23 - INFO - __main__ - Step 31957: {'lr': 0.0004511380398174001, 'samples': 6135744, 'steps': 31956, 'loss/train': 1.552093505859375} 11/07/2021 01:45:24 - INFO - __main__ - Step 31958: {'lr': 0.00045113488819216983, 'samples': 6135936, 'steps': 31957, 'loss/train': 1.0456637144088745} 11/07/2021 01:45:24 - INFO - __main__ - Step 31959: {'lr': 0.00045113173647631143, 'samples': 6136128, 'steps': 31958, 'loss/train': 0.6969603896141052} 11/07/2021 01:45:24 - INFO - __main__ - Step 31960: {'lr': 0.0004511285846698261, 'samples': 6136320, 'steps': 31959, 'loss/train': 1.286709189414978} 11/07/2021 01:45:25 - INFO - __main__ - Step 31961: {'lr': 0.0004511254327727153, 'samples': 6136512, 'steps': 31960, 'loss/train': 1.1964833736419678} 11/07/2021 01:45:25 - INFO - __main__ - Step 31962: {'lr': 0.00045112228078498053, 'samples': 6136704, 'steps': 31961, 'loss/train': 1.115088701248169} 11/07/2021 01:45:26 - INFO - __main__ - Step 31963: {'lr': 0.0004511191287066232, 'samples': 6136896, 'steps': 31962, 'loss/train': 1.9909965991973877} 11/07/2021 01:45:27 - INFO - __main__ - Step 31964: {'lr': 0.00045111597653764456, 'samples': 6137088, 'steps': 31963, 'loss/train': 1.7869207859039307} 11/07/2021 01:45:27 - INFO - __main__ - Step 31965: {'lr': 0.00045111282427804636, 'samples': 6137280, 'steps': 31964, 'loss/train': 3.042821168899536} 11/07/2021 01:45:27 - INFO - __main__ - Step 31966: {'lr': 0.0004511096719278297, 'samples': 6137472, 'steps': 31965, 'loss/train': 1.2482612133026123} 11/07/2021 01:45:28 - INFO - __main__ - Step 31967: {'lr': 0.0004511065194869961, 'samples': 6137664, 'steps': 31966, 'loss/train': 1.316340446472168} 11/07/2021 01:45:29 - INFO - __main__ - Step 31968: {'lr': 0.00045110336695554707, 'samples': 6137856, 'steps': 31967, 'loss/train': 1.5354576110839844} 11/07/2021 01:45:29 - INFO - __main__ - Step 31969: {'lr': 0.0004511002143334839, 'samples': 6138048, 'steps': 31968, 'loss/train': 5.773683071136475} 11/07/2021 01:45:29 - INFO - __main__ - Step 31970: {'lr': 0.0004510970616208081, 'samples': 6138240, 'steps': 31969, 'loss/train': 1.477264642715454} 11/07/2021 01:45:30 - INFO - __main__ - Step 31971: {'lr': 0.0004510939088175211, 'samples': 6138432, 'steps': 31970, 'loss/train': 1.5126852989196777} 11/07/2021 01:45:30 - INFO - __main__ - Step 31972: {'lr': 0.00045109075592362433, 'samples': 6138624, 'steps': 31971, 'loss/train': 1.9196875095367432} 11/07/2021 01:45:31 - INFO - __main__ - Step 31973: {'lr': 0.0004510876029391191, 'samples': 6138816, 'steps': 31972, 'loss/train': 1.5249228477478027} 11/07/2021 01:45:32 - INFO - __main__ - Step 31974: {'lr': 0.00045108444986400687, 'samples': 6139008, 'steps': 31973, 'loss/train': 1.2626824378967285} 11/07/2021 01:45:32 - INFO - __main__ - Step 31975: {'lr': 0.0004510812966982892, 'samples': 6139200, 'steps': 31974, 'loss/train': 1.2591478824615479} 11/07/2021 01:45:32 - INFO - __main__ - Step 31976: {'lr': 0.0004510781434419673, 'samples': 6139392, 'steps': 31975, 'loss/train': 2.4961934089660645} 11/07/2021 01:45:33 - INFO - __main__ - Step 31977: {'lr': 0.0004510749900950427, 'samples': 6139584, 'steps': 31976, 'loss/train': 1.5184125900268555} 11/07/2021 01:45:33 - INFO - __main__ - Step 31978: {'lr': 0.00045107183665751686, 'samples': 6139776, 'steps': 31977, 'loss/train': 1.482326865196228} 11/07/2021 01:45:34 - INFO - __main__ - Step 31979: {'lr': 0.00045106868312939116, 'samples': 6139968, 'steps': 31978, 'loss/train': 0.6979764103889465} 11/07/2021 01:45:34 - INFO - __main__ - Step 31980: {'lr': 0.0004510655295106669, 'samples': 6140160, 'steps': 31979, 'loss/train': 0.9233061671257019} 11/07/2021 01:45:35 - INFO - __main__ - Step 31981: {'lr': 0.00045106237580134573, 'samples': 6140352, 'steps': 31980, 'loss/train': 1.0737470388412476} 11/07/2021 01:45:35 - INFO - __main__ - Step 31982: {'lr': 0.000451059222001429, 'samples': 6140544, 'steps': 31981, 'loss/train': 1.8268646001815796} 11/07/2021 01:45:35 - INFO - __main__ - Step 31983: {'lr': 0.0004510560681109179, 'samples': 6140736, 'steps': 31982, 'loss/train': 1.8358553647994995} 11/07/2021 01:45:37 - INFO - __main__ - Step 31984: {'lr': 0.0004510529141298142, 'samples': 6140928, 'steps': 31983, 'loss/train': 1.2324832677841187} 11/07/2021 01:45:37 - INFO - __main__ - Step 31985: {'lr': 0.00045104976005811917, 'samples': 6141120, 'steps': 31984, 'loss/train': 1.5840585231781006} 11/07/2021 01:45:37 - INFO - __main__ - Step 31986: {'lr': 0.00045104660589583413, 'samples': 6141312, 'steps': 31985, 'loss/train': 1.898639440536499} 11/07/2021 01:45:38 - INFO - __main__ - Step 31987: {'lr': 0.0004510434516429606, 'samples': 6141504, 'steps': 31986, 'loss/train': 3.6436564922332764} 11/07/2021 01:45:38 - INFO - __main__ - Step 31988: {'lr': 0.0004510402972995, 'samples': 6141696, 'steps': 31987, 'loss/train': 1.653199553489685} 11/07/2021 01:45:39 - INFO - __main__ - Step 31989: {'lr': 0.0004510371428654538, 'samples': 6141888, 'steps': 31988, 'loss/train': 1.620031476020813} 11/07/2021 01:45:39 - INFO - __main__ - Step 31990: {'lr': 0.00045103398834082334, 'samples': 6142080, 'steps': 31989, 'loss/train': 1.748823642730713} 11/07/2021 01:45:40 - INFO - __main__ - Step 31991: {'lr': 0.00045103083372561003, 'samples': 6142272, 'steps': 31990, 'loss/train': 1.3456593751907349} 11/07/2021 01:45:40 - INFO - __main__ - Step 31992: {'lr': 0.0004510276790198153, 'samples': 6142464, 'steps': 31991, 'loss/train': 1.7565659284591675} 11/07/2021 01:45:40 - INFO - __main__ - Step 31993: {'lr': 0.00045102452422344065, 'samples': 6142656, 'steps': 31992, 'loss/train': 1.65485680103302} 11/07/2021 01:45:41 - INFO - __main__ - Step 31994: {'lr': 0.0004510213693364875, 'samples': 6142848, 'steps': 31993, 'loss/train': 1.589545488357544} 11/07/2021 01:45:42 - INFO - __main__ - Step 31995: {'lr': 0.0004510182143589572, 'samples': 6143040, 'steps': 31994, 'loss/train': 1.8118722438812256} 11/07/2021 01:45:42 - INFO - __main__ - Step 31996: {'lr': 0.0004510150592908511, 'samples': 6143232, 'steps': 31995, 'loss/train': 0.27138158679008484} 11/07/2021 01:45:42 - INFO - __main__ - Step 31997: {'lr': 0.00045101190413217085, 'samples': 6143424, 'steps': 31996, 'loss/train': 1.6206790208816528} 11/07/2021 01:45:43 - INFO - __main__ - Step 31998: {'lr': 0.0004510087488829177, 'samples': 6143616, 'steps': 31997, 'loss/train': 1.502052664756775} 11/07/2021 01:45:44 - INFO - __main__ - Step 31999: {'lr': 0.000451005593543093, 'samples': 6143808, 'steps': 31998, 'loss/train': 1.7499158382415771} 11/07/2021 01:45:44 - INFO - __main__ - Step 32000: {'lr': 0.00045100243811269834, 'samples': 6144000, 'steps': 31999, 'loss/train': 1.2587133646011353} 11/07/2021 01:45:45 - INFO - __main__ - Step 32001: {'lr': 0.00045099928259173516, 'samples': 6144192, 'steps': 32000, 'loss/train': 1.6135326623916626} 11/07/2021 01:45:45 - INFO - __main__ - Step 32002: {'lr': 0.0004509961269802048, 'samples': 6144384, 'steps': 32001, 'loss/train': 1.5403943061828613} 11/07/2021 01:45:45 - INFO - __main__ - Step 32003: {'lr': 0.00045099297127810855, 'samples': 6144576, 'steps': 32002, 'loss/train': 0.7216783165931702} 11/07/2021 01:45:46 - INFO - __main__ - Step 32004: {'lr': 0.0004509898154854481, 'samples': 6144768, 'steps': 32003, 'loss/train': 1.133880615234375} 11/07/2021 01:45:47 - INFO - __main__ - Step 32005: {'lr': 0.00045098665960222474, 'samples': 6144960, 'steps': 32004, 'loss/train': 1.5767760276794434} 11/07/2021 01:45:47 - INFO - __main__ - Step 32006: {'lr': 0.00045098350362843975, 'samples': 6145152, 'steps': 32005, 'loss/train': 1.8912142515182495} 11/07/2021 01:45:47 - INFO - __main__ - Step 32007: {'lr': 0.0004509803475640948, 'samples': 6145344, 'steps': 32006, 'loss/train': 1.3936712741851807} 11/07/2021 01:45:48 - INFO - __main__ - Step 32008: {'lr': 0.00045097719140919126, 'samples': 6145536, 'steps': 32007, 'loss/train': 1.929354190826416} 11/07/2021 01:45:48 - INFO - __main__ - Step 32009: {'lr': 0.0004509740351637304, 'samples': 6145728, 'steps': 32008, 'loss/train': 0.8071759939193726} 11/07/2021 01:45:49 - INFO - __main__ - Step 32010: {'lr': 0.0004509708788277138, 'samples': 6145920, 'steps': 32009, 'loss/train': 1.3026527166366577} 11/07/2021 01:45:50 - INFO - __main__ - Step 32011: {'lr': 0.0004509677224011428, 'samples': 6146112, 'steps': 32010, 'loss/train': 1.4915457963943481} 11/07/2021 01:45:50 - INFO - __main__ - Step 32012: {'lr': 0.00045096456588401883, 'samples': 6146304, 'steps': 32011, 'loss/train': 1.9986481666564941} 11/07/2021 01:45:50 - INFO - __main__ - Step 32013: {'lr': 0.0004509614092763434, 'samples': 6146496, 'steps': 32012, 'loss/train': 1.2444761991500854} 11/07/2021 01:45:51 - INFO - __main__ - Step 32014: {'lr': 0.00045095825257811776, 'samples': 6146688, 'steps': 32013, 'loss/train': 1.3719403743743896} 11/07/2021 01:45:52 - INFO - __main__ - Step 32015: {'lr': 0.00045095509578934353, 'samples': 6146880, 'steps': 32014, 'loss/train': 1.5320340394973755} 11/07/2021 01:45:52 - INFO - __main__ - Step 32016: {'lr': 0.00045095193891002194, 'samples': 6147072, 'steps': 32015, 'loss/train': 1.5162948369979858} 11/07/2021 01:45:53 - INFO - __main__ - Step 32017: {'lr': 0.00045094878194015456, 'samples': 6147264, 'steps': 32016, 'loss/train': 1.094167947769165} 11/07/2021 01:45:53 - INFO - __main__ - Step 32018: {'lr': 0.0004509456248797428, 'samples': 6147456, 'steps': 32017, 'loss/train': 2.1765406131744385} 11/07/2021 01:45:53 - INFO - __main__ - Step 32019: {'lr': 0.000450942467728788, 'samples': 6147648, 'steps': 32018, 'loss/train': 1.4130141735076904} 11/07/2021 01:45:54 - INFO - __main__ - Step 32020: {'lr': 0.00045093931048729156, 'samples': 6147840, 'steps': 32019, 'loss/train': 1.719070315361023} 11/07/2021 01:45:55 - INFO - __main__ - Step 32021: {'lr': 0.00045093615315525506, 'samples': 6148032, 'steps': 32020, 'loss/train': 1.7311382293701172} 11/07/2021 01:45:55 - INFO - __main__ - Step 32022: {'lr': 0.00045093299573267977, 'samples': 6148224, 'steps': 32021, 'loss/train': 1.6059623956680298} 11/07/2021 01:45:55 - INFO - __main__ - Step 32023: {'lr': 0.00045092983821956725, 'samples': 6148416, 'steps': 32022, 'loss/train': 1.6733207702636719} 11/07/2021 01:45:56 - INFO - __main__ - Step 32024: {'lr': 0.00045092668061591875, 'samples': 6148608, 'steps': 32023, 'loss/train': 1.3829171657562256} 11/07/2021 01:45:56 - INFO - __main__ - Step 32025: {'lr': 0.00045092352292173585, 'samples': 6148800, 'steps': 32024, 'loss/train': 0.8372914791107178} 11/07/2021 01:45:57 - INFO - __main__ - Step 32026: {'lr': 0.00045092036513701985, 'samples': 6148992, 'steps': 32025, 'loss/train': 1.7354086637496948} 11/07/2021 01:45:57 - INFO - __main__ - Step 32027: {'lr': 0.0004509172072617723, 'samples': 6149184, 'steps': 32026, 'loss/train': 2.015341281890869} 11/07/2021 01:45:58 - INFO - __main__ - Step 32028: {'lr': 0.00045091404929599455, 'samples': 6149376, 'steps': 32027, 'loss/train': 1.555013656616211} 11/07/2021 01:45:58 - INFO - __main__ - Step 32029: {'lr': 0.00045091089123968796, 'samples': 6149568, 'steps': 32028, 'loss/train': 1.7335764169692993} 11/07/2021 01:45:59 - INFO - __main__ - Step 32030: {'lr': 0.0004509077330928541, 'samples': 6149760, 'steps': 32029, 'loss/train': 1.2832915782928467} 11/07/2021 01:45:59 - INFO - __main__ - Step 32031: {'lr': 0.0004509045748554943, 'samples': 6149952, 'steps': 32030, 'loss/train': 1.601660132408142} 11/07/2021 01:46:00 - INFO - __main__ - Step 32032: {'lr': 0.00045090141652760995, 'samples': 6150144, 'steps': 32031, 'loss/train': 1.7143197059631348} 11/07/2021 01:46:00 - INFO - __main__ - Step 32033: {'lr': 0.0004508982581092026, 'samples': 6150336, 'steps': 32032, 'loss/train': 0.9741845726966858} 11/07/2021 01:46:01 - INFO - __main__ - Step 32034: {'lr': 0.00045089509960027354, 'samples': 6150528, 'steps': 32033, 'loss/train': 1.5302150249481201} 11/07/2021 01:46:01 - INFO - __main__ - Step 32035: {'lr': 0.00045089194100082433, 'samples': 6150720, 'steps': 32034, 'loss/train': 1.5801377296447754} 11/07/2021 01:46:02 - INFO - __main__ - Step 32036: {'lr': 0.00045088878231085616, 'samples': 6150912, 'steps': 32035, 'loss/train': 1.6848492622375488} 11/07/2021 01:46:02 - INFO - __main__ - Step 32037: {'lr': 0.00045088562353037077, 'samples': 6151104, 'steps': 32036, 'loss/train': 1.8636360168457031} 11/07/2021 01:46:02 - INFO - __main__ - Step 32038: {'lr': 0.00045088246465936936, 'samples': 6151296, 'steps': 32037, 'loss/train': 1.515488862991333} 11/07/2021 01:46:03 - INFO - __main__ - Step 32039: {'lr': 0.0004508793056978534, 'samples': 6151488, 'steps': 32038, 'loss/train': 1.8291229009628296} 11/07/2021 01:46:03 - INFO - __main__ - Step 32040: {'lr': 0.00045087614664582424, 'samples': 6151680, 'steps': 32039, 'loss/train': 1.4584295749664307} 11/07/2021 01:46:04 - INFO - __main__ - Step 32041: {'lr': 0.0004508729875032834, 'samples': 6151872, 'steps': 32040, 'loss/train': 1.70821213722229} 11/07/2021 01:46:05 - INFO - __main__ - Step 32042: {'lr': 0.0004508698282702324, 'samples': 6152064, 'steps': 32041, 'loss/train': 1.9895410537719727} 11/07/2021 01:46:05 - INFO - __main__ - Step 32043: {'lr': 0.0004508666689466725, 'samples': 6152256, 'steps': 32042, 'loss/train': 1.8032820224761963} 11/07/2021 01:46:05 - INFO - __main__ - Step 32044: {'lr': 0.00045086350953260526, 'samples': 6152448, 'steps': 32043, 'loss/train': 1.8391896486282349} 11/07/2021 01:46:06 - INFO - __main__ - Step 32045: {'lr': 0.0004508603500280319, 'samples': 6152640, 'steps': 32044, 'loss/train': 1.5459450483322144} 11/07/2021 01:46:07 - INFO - __main__ - Step 32046: {'lr': 0.00045085719043295406, 'samples': 6152832, 'steps': 32045, 'loss/train': 1.458228349685669} 11/07/2021 01:46:07 - INFO - __main__ - Step 32047: {'lr': 0.00045085403074737295, 'samples': 6153024, 'steps': 32046, 'loss/train': 1.8259520530700684} 11/07/2021 01:46:07 - INFO - __main__ - Step 32048: {'lr': 0.0004508508709712902, 'samples': 6153216, 'steps': 32047, 'loss/train': 1.78195321559906} 11/07/2021 01:46:08 - INFO - __main__ - Step 32049: {'lr': 0.00045084771110470717, 'samples': 6153408, 'steps': 32048, 'loss/train': 1.6490238904953003} 11/07/2021 01:46:08 - INFO - __main__ - Step 32050: {'lr': 0.00045084455114762525, 'samples': 6153600, 'steps': 32049, 'loss/train': 1.6333950757980347} 11/07/2021 01:46:09 - INFO - __main__ - Step 32051: {'lr': 0.00045084139110004585, 'samples': 6153792, 'steps': 32050, 'loss/train': 1.9807506799697876} 11/07/2021 01:46:10 - INFO - __main__ - Step 32052: {'lr': 0.0004508382309619704, 'samples': 6153984, 'steps': 32051, 'loss/train': 1.0197628736495972} 11/07/2021 01:46:10 - INFO - __main__ - Step 32053: {'lr': 0.0004508350707334004, 'samples': 6154176, 'steps': 32052, 'loss/train': 1.282412052154541} 11/07/2021 01:46:10 - INFO - __main__ - Step 32054: {'lr': 0.00045083191041433713, 'samples': 6154368, 'steps': 32053, 'loss/train': 0.9735081791877747} 11/07/2021 01:46:11 - INFO - __main__ - Step 32055: {'lr': 0.00045082875000478214, 'samples': 6154560, 'steps': 32054, 'loss/train': 1.6473565101623535} 11/07/2021 01:46:12 - INFO - __main__ - Step 32056: {'lr': 0.0004508255895047368, 'samples': 6154752, 'steps': 32055, 'loss/train': 1.4841833114624023} 11/07/2021 01:46:12 - INFO - __main__ - Step 32057: {'lr': 0.0004508224289142026, 'samples': 6154944, 'steps': 32056, 'loss/train': 1.6833237409591675} 11/07/2021 01:46:12 - INFO - __main__ - Step 32058: {'lr': 0.0004508192682331809, 'samples': 6155136, 'steps': 32057, 'loss/train': 1.6849783658981323} 11/07/2021 01:46:13 - INFO - __main__ - Step 32059: {'lr': 0.0004508161074616731, 'samples': 6155328, 'steps': 32058, 'loss/train': 1.8919442892074585} 11/07/2021 01:46:13 - INFO - __main__ - Step 32060: {'lr': 0.0004508129465996806, 'samples': 6155520, 'steps': 32059, 'loss/train': 1.4205477237701416} 11/07/2021 01:46:14 - INFO - __main__ - Step 32061: {'lr': 0.00045080978564720505, 'samples': 6155712, 'steps': 32060, 'loss/train': 1.436179757118225} 11/07/2021 01:46:15 - INFO - __main__ - Step 32062: {'lr': 0.0004508066246042476, 'samples': 6155904, 'steps': 32061, 'loss/train': 1.7141095399856567} 11/07/2021 01:46:15 - INFO - __main__ - Step 32063: {'lr': 0.0004508034634708098, 'samples': 6156096, 'steps': 32062, 'loss/train': 1.0422805547714233} 11/07/2021 01:46:15 - INFO - __main__ - Step 32064: {'lr': 0.0004508003022468931, 'samples': 6156288, 'steps': 32063, 'loss/train': 1.554802417755127} 11/07/2021 01:46:16 - INFO - __main__ - Step 32065: {'lr': 0.00045079714093249887, 'samples': 6156480, 'steps': 32064, 'loss/train': 1.9286205768585205} 11/07/2021 01:46:16 - INFO - __main__ - Step 32066: {'lr': 0.00045079397952762845, 'samples': 6156672, 'steps': 32065, 'loss/train': 1.645257830619812} 11/07/2021 01:46:17 - INFO - __main__ - Step 32067: {'lr': 0.0004507908180322835, 'samples': 6156864, 'steps': 32066, 'loss/train': 1.5263566970825195} 11/07/2021 01:46:17 - INFO - __main__ - Step 32068: {'lr': 0.00045078765644646524, 'samples': 6157056, 'steps': 32067, 'loss/train': 1.6802290678024292} 11/07/2021 01:46:18 - INFO - __main__ - Step 32069: {'lr': 0.00045078449477017516, 'samples': 6157248, 'steps': 32068, 'loss/train': 0.8311300277709961} 11/07/2021 01:46:18 - INFO - __main__ - Step 32070: {'lr': 0.0004507813330034147, 'samples': 6157440, 'steps': 32069, 'loss/train': 1.153349757194519} 11/07/2021 01:46:19 - INFO - __main__ - Step 32071: {'lr': 0.00045077817114618526, 'samples': 6157632, 'steps': 32070, 'loss/train': 1.3929682970046997} 11/07/2021 01:46:19 - INFO - __main__ - Step 32072: {'lr': 0.00045077500919848826, 'samples': 6157824, 'steps': 32071, 'loss/train': 1.6842589378356934} 11/07/2021 01:46:20 - INFO - __main__ - Step 32073: {'lr': 0.00045077184716032516, 'samples': 6158016, 'steps': 32072, 'loss/train': 1.767591118812561} 11/07/2021 01:46:20 - INFO - __main__ - Step 32074: {'lr': 0.0004507686850316973, 'samples': 6158208, 'steps': 32073, 'loss/train': 1.6692560911178589} 11/07/2021 01:46:21 - INFO - __main__ - Step 32075: {'lr': 0.00045076552281260625, 'samples': 6158400, 'steps': 32074, 'loss/train': 1.4434971809387207} 11/07/2021 01:46:21 - INFO - __main__ - Step 32076: {'lr': 0.0004507623605030533, 'samples': 6158592, 'steps': 32075, 'loss/train': 1.3661025762557983} 11/07/2021 01:46:22 - INFO - __main__ - Step 32077: {'lr': 0.00045075919810304, 'samples': 6158784, 'steps': 32076, 'loss/train': 1.3306758403778076} 11/07/2021 01:46:22 - INFO - __main__ - Step 32078: {'lr': 0.0004507560356125676, 'samples': 6158976, 'steps': 32077, 'loss/train': 1.8561049699783325} 11/07/2021 01:46:23 - INFO - __main__ - Step 32079: {'lr': 0.0004507528730316377, 'samples': 6159168, 'steps': 32078, 'loss/train': 1.629477858543396} 11/07/2021 01:46:23 - INFO - __main__ - Step 32080: {'lr': 0.0004507497103602517, 'samples': 6159360, 'steps': 32079, 'loss/train': 1.2134417295455933} 11/07/2021 01:46:23 - INFO - __main__ - Step 32081: {'lr': 0.00045074654759841087, 'samples': 6159552, 'steps': 32080, 'loss/train': 1.6609727144241333} 11/07/2021 01:46:24 - INFO - __main__ - Step 32082: {'lr': 0.00045074338474611683, 'samples': 6159744, 'steps': 32081, 'loss/train': 5.802849769592285} 11/07/2021 01:46:25 - INFO - __main__ - Step 32083: {'lr': 0.00045074022180337085, 'samples': 6159936, 'steps': 32082, 'loss/train': 1.6034067869186401} 11/07/2021 01:46:25 - INFO - __main__ - Step 32084: {'lr': 0.0004507370587701745, 'samples': 6160128, 'steps': 32083, 'loss/train': 1.6937822103500366} 11/07/2021 01:46:25 - INFO - __main__ - Step 32085: {'lr': 0.000450733895646529, 'samples': 6160320, 'steps': 32084, 'loss/train': 1.17350172996521} 11/07/2021 01:46:26 - INFO - __main__ - Step 32086: {'lr': 0.00045073073243243603, 'samples': 6160512, 'steps': 32085, 'loss/train': 1.0529669523239136} 11/07/2021 01:46:26 - INFO - __main__ - Step 32087: {'lr': 0.0004507275691278968, 'samples': 6160704, 'steps': 32086, 'loss/train': 1.3048980236053467} 11/07/2021 01:46:27 - INFO - __main__ - Step 32088: {'lr': 0.00045072440573291293, 'samples': 6160896, 'steps': 32087, 'loss/train': 1.3754926919937134} 11/07/2021 01:46:28 - INFO - __main__ - Step 32089: {'lr': 0.0004507212422474857, 'samples': 6161088, 'steps': 32088, 'loss/train': 1.2092654705047607} 11/07/2021 01:46:28 - INFO - __main__ - Step 32090: {'lr': 0.0004507180786716165, 'samples': 6161280, 'steps': 32089, 'loss/train': 1.558481216430664} 11/07/2021 01:46:28 - INFO - __main__ - Step 32091: {'lr': 0.00045071491500530694, 'samples': 6161472, 'steps': 32090, 'loss/train': 0.4279614984989166} 11/07/2021 01:46:29 - INFO - __main__ - Step 32092: {'lr': 0.0004507117512485582, 'samples': 6161664, 'steps': 32091, 'loss/train': 0.9977620244026184} 11/07/2021 01:46:30 - INFO - __main__ - Step 32093: {'lr': 0.000450708587401372, 'samples': 6161856, 'steps': 32092, 'loss/train': 1.0190815925598145} 11/07/2021 01:46:30 - INFO - __main__ - Step 32094: {'lr': 0.0004507054234637495, 'samples': 6162048, 'steps': 32093, 'loss/train': 1.9626078605651855} 11/07/2021 01:46:31 - INFO - __main__ - Step 32095: {'lr': 0.0004507022594356922, 'samples': 6162240, 'steps': 32094, 'loss/train': 1.7547073364257812} 11/07/2021 01:46:31 - INFO - __main__ - Step 32096: {'lr': 0.00045069909531720166, 'samples': 6162432, 'steps': 32095, 'loss/train': 1.469717025756836} 11/07/2021 01:46:31 - INFO - __main__ - Step 32097: {'lr': 0.0004506959311082792, 'samples': 6162624, 'steps': 32096, 'loss/train': 0.25666314363479614} 11/07/2021 01:46:33 - INFO - __main__ - Step 32098: {'lr': 0.00045069276680892624, 'samples': 6162816, 'steps': 32097, 'loss/train': 1.6708344221115112} 11/07/2021 01:46:33 - INFO - __main__ - Step 32099: {'lr': 0.00045068960241914413, 'samples': 6163008, 'steps': 32098, 'loss/train': 1.4840983152389526} 11/07/2021 01:46:33 - INFO - __main__ - Step 32100: {'lr': 0.00045068643793893447, 'samples': 6163200, 'steps': 32099, 'loss/train': 1.1935482025146484} 11/07/2021 01:46:34 - INFO - __main__ - Step 32101: {'lr': 0.0004506832733682986, 'samples': 6163392, 'steps': 32100, 'loss/train': 1.4337197542190552} 11/07/2021 01:46:34 - INFO - __main__ - Step 32102: {'lr': 0.00045068010870723783, 'samples': 6163584, 'steps': 32101, 'loss/train': 1.115096092224121} 11/07/2021 01:46:34 - INFO - __main__ - Step 32103: {'lr': 0.00045067694395575385, 'samples': 6163776, 'steps': 32102, 'loss/train': 1.8311907052993774} 11/07/2021 01:46:35 - INFO - __main__ - Step 32104: {'lr': 0.0004506737791138479, 'samples': 6163968, 'steps': 32103, 'loss/train': 1.3427801132202148} 11/07/2021 01:46:36 - INFO - __main__ - Step 32105: {'lr': 0.00045067061418152136, 'samples': 6164160, 'steps': 32104, 'loss/train': 1.3438457250595093} 11/07/2021 01:46:36 - INFO - __main__ - Step 32106: {'lr': 0.00045066744915877585, 'samples': 6164352, 'steps': 32105, 'loss/train': 1.156517744064331} 11/07/2021 01:46:36 - INFO - __main__ - Step 32107: {'lr': 0.0004506642840456126, 'samples': 6164544, 'steps': 32106, 'loss/train': 1.7105660438537598} 11/07/2021 01:46:37 - INFO - __main__ - Step 32108: {'lr': 0.00045066111884203315, 'samples': 6164736, 'steps': 32107, 'loss/train': 1.4578814506530762} 11/07/2021 01:46:38 - INFO - __main__ - Step 32109: {'lr': 0.0004506579535480389, 'samples': 6164928, 'steps': 32108, 'loss/train': 1.5972881317138672} 11/07/2021 01:46:38 - INFO - __main__ - Step 32110: {'lr': 0.00045065478816363124, 'samples': 6165120, 'steps': 32109, 'loss/train': 2.4207537174224854} 11/07/2021 01:46:38 - INFO - __main__ - Step 32111: {'lr': 0.00045065162268881164, 'samples': 6165312, 'steps': 32110, 'loss/train': 1.431797981262207} 11/07/2021 01:46:39 - INFO - __main__ - Step 32112: {'lr': 0.0004506484571235816, 'samples': 6165504, 'steps': 32111, 'loss/train': 1.3394352197647095} 11/07/2021 01:46:39 - INFO - __main__ - Step 32113: {'lr': 0.00045064529146794234, 'samples': 6165696, 'steps': 32112, 'loss/train': 2.1921451091766357} 11/07/2021 01:46:40 - INFO - __main__ - Step 32114: {'lr': 0.0004506421257218955, 'samples': 6165888, 'steps': 32113, 'loss/train': 1.3960522413253784} 11/07/2021 01:46:40 - INFO - __main__ - Step 32115: {'lr': 0.00045063895988544235, 'samples': 6166080, 'steps': 32114, 'loss/train': 1.5505790710449219} 11/07/2021 01:46:41 - INFO - __main__ - Step 32116: {'lr': 0.00045063579395858444, 'samples': 6166272, 'steps': 32115, 'loss/train': 1.7635595798492432} 11/07/2021 01:46:41 - INFO - __main__ - Step 32117: {'lr': 0.0004506326279413231, 'samples': 6166464, 'steps': 32116, 'loss/train': 1.816580891609192} 11/07/2021 01:46:41 - INFO - __main__ - Step 32118: {'lr': 0.0004506294618336598, 'samples': 6166656, 'steps': 32117, 'loss/train': 1.704832911491394} 11/07/2021 01:46:43 - INFO - __main__ - Step 32119: {'lr': 0.00045062629563559595, 'samples': 6166848, 'steps': 32118, 'loss/train': 1.7365407943725586} 11/07/2021 01:46:43 - INFO - __main__ - Step 32120: {'lr': 0.00045062312934713303, 'samples': 6167040, 'steps': 32119, 'loss/train': 1.4368287324905396} 11/07/2021 01:46:43 - INFO - __main__ - Step 32121: {'lr': 0.00045061996296827237, 'samples': 6167232, 'steps': 32120, 'loss/train': 1.7316168546676636} 11/07/2021 01:46:44 - INFO - __main__ - Step 32122: {'lr': 0.00045061679649901543, 'samples': 6167424, 'steps': 32121, 'loss/train': 1.6725994348526} 11/07/2021 01:46:44 - INFO - __main__ - Step 32123: {'lr': 0.00045061362993936374, 'samples': 6167616, 'steps': 32122, 'loss/train': 1.4833842515945435} 11/07/2021 01:46:45 - INFO - __main__ - Step 32124: {'lr': 0.0004506104632893185, 'samples': 6167808, 'steps': 32123, 'loss/train': 1.17832612991333} 11/07/2021 01:46:45 - INFO - __main__ - Step 32125: {'lr': 0.00045060729654888143, 'samples': 6168000, 'steps': 32124, 'loss/train': 0.9483261108398438} 11/07/2021 01:46:46 - INFO - __main__ - Step 32126: {'lr': 0.00045060412971805375, 'samples': 6168192, 'steps': 32125, 'loss/train': 1.0364069938659668} 11/07/2021 01:46:46 - INFO - __main__ - Step 32127: {'lr': 0.00045060096279683694, 'samples': 6168384, 'steps': 32126, 'loss/train': 1.4542152881622314} 11/07/2021 01:46:46 - INFO - __main__ - Step 32128: {'lr': 0.0004505977957852325, 'samples': 6168576, 'steps': 32127, 'loss/train': 1.5777549743652344} 11/07/2021 01:46:47 - INFO - __main__ - Step 32129: {'lr': 0.00045059462868324177, 'samples': 6168768, 'steps': 32128, 'loss/train': 1.4561687707901} 11/07/2021 01:46:48 - INFO - __main__ - Step 32130: {'lr': 0.00045059146149086605, 'samples': 6168960, 'steps': 32129, 'loss/train': 1.6465238332748413} 11/07/2021 01:46:48 - INFO - __main__ - Step 32131: {'lr': 0.00045058829420810707, 'samples': 6169152, 'steps': 32130, 'loss/train': 1.500243902206421} 11/07/2021 01:46:48 - INFO - __main__ - Step 32132: {'lr': 0.00045058512683496607, 'samples': 6169344, 'steps': 32131, 'loss/train': 1.505821943283081} 11/07/2021 01:46:49 - INFO - __main__ - Step 32133: {'lr': 0.00045058195937144446, 'samples': 6169536, 'steps': 32132, 'loss/train': 1.2399656772613525} 11/07/2021 01:46:50 - INFO - __main__ - Step 32134: {'lr': 0.00045057879181754375, 'samples': 6169728, 'steps': 32133, 'loss/train': 1.7505378723144531} 11/07/2021 01:46:50 - INFO - __main__ - Step 32135: {'lr': 0.0004505756241732653, 'samples': 6169920, 'steps': 32134, 'loss/train': 1.483353614807129} 11/07/2021 01:46:51 - INFO - __main__ - Step 32136: {'lr': 0.0004505724564386106, 'samples': 6170112, 'steps': 32135, 'loss/train': 1.8550945520401} 11/07/2021 01:46:51 - INFO - __main__ - Step 32137: {'lr': 0.00045056928861358106, 'samples': 6170304, 'steps': 32136, 'loss/train': 1.6168700456619263} 11/07/2021 01:46:51 - INFO - __main__ - Step 32138: {'lr': 0.000450566120698178, 'samples': 6170496, 'steps': 32137, 'loss/train': 1.487418293952942} 11/07/2021 01:46:52 - INFO - __main__ - Step 32139: {'lr': 0.0004505629526924031, 'samples': 6170688, 'steps': 32138, 'loss/train': 1.502426028251648} 11/07/2021 01:46:53 - INFO - __main__ - Step 32140: {'lr': 0.0004505597845962575, 'samples': 6170880, 'steps': 32139, 'loss/train': 1.7972909212112427} 11/07/2021 01:46:53 - INFO - __main__ - Step 32141: {'lr': 0.0004505566164097428, 'samples': 6171072, 'steps': 32140, 'loss/train': 2.0394623279571533} 11/07/2021 01:46:53 - INFO - __main__ - Step 32142: {'lr': 0.0004505534481328604, 'samples': 6171264, 'steps': 32141, 'loss/train': 1.7296563386917114} 11/07/2021 01:46:54 - INFO - __main__ - Step 32143: {'lr': 0.0004505502797656117, 'samples': 6171456, 'steps': 32142, 'loss/train': 1.802706003189087} 11/07/2021 01:46:55 - INFO - __main__ - Step 32144: {'lr': 0.00045054711130799806, 'samples': 6171648, 'steps': 32143, 'loss/train': 1.3463897705078125} 11/07/2021 01:46:55 - INFO - __main__ - Step 32145: {'lr': 0.00045054394276002106, 'samples': 6171840, 'steps': 32144, 'loss/train': 1.841357946395874} 11/07/2021 01:46:55 - INFO - __main__ - Step 32146: {'lr': 0.00045054077412168215, 'samples': 6172032, 'steps': 32145, 'loss/train': 1.4753705263137817} 11/07/2021 01:46:56 - INFO - __main__ - Step 32147: {'lr': 0.0004505376053929825, 'samples': 6172224, 'steps': 32146, 'loss/train': 1.3590627908706665} 11/07/2021 01:46:56 - INFO - __main__ - Step 32148: {'lr': 0.0004505344365739238, 'samples': 6172416, 'steps': 32147, 'loss/train': 1.6588155031204224} 11/07/2021 01:46:57 - INFO - __main__ - Step 32149: {'lr': 0.0004505312676645073, 'samples': 6172608, 'steps': 32148, 'loss/train': 1.8488324880599976} 11/07/2021 01:46:57 - INFO - __main__ - Step 32150: {'lr': 0.00045052809866473454, 'samples': 6172800, 'steps': 32149, 'loss/train': 2.2110037803649902} 11/07/2021 01:46:58 - INFO - __main__ - Step 32151: {'lr': 0.00045052492957460696, 'samples': 6172992, 'steps': 32150, 'loss/train': 1.240863561630249} 11/07/2021 01:46:58 - INFO - __main__ - Step 32152: {'lr': 0.00045052176039412587, 'samples': 6173184, 'steps': 32151, 'loss/train': 1.3574669361114502} 11/07/2021 01:46:58 - INFO - __main__ - Step 32153: {'lr': 0.0004505185911232928, 'samples': 6173376, 'steps': 32152, 'loss/train': 1.5875046253204346} 11/07/2021 01:47:00 - INFO - __main__ - Step 32154: {'lr': 0.00045051542176210914, 'samples': 6173568, 'steps': 32153, 'loss/train': 1.974134922027588} 11/07/2021 01:47:00 - INFO - __main__ - Step 32155: {'lr': 0.0004505122523105764, 'samples': 6173760, 'steps': 32154, 'loss/train': 1.2119091749191284} 11/07/2021 01:47:00 - INFO - __main__ - Step 32156: {'lr': 0.00045050908276869585, 'samples': 6173952, 'steps': 32155, 'loss/train': 2.0083765983581543} 11/07/2021 01:47:01 - INFO - __main__ - Step 32157: {'lr': 0.0004505059131364689, 'samples': 6174144, 'steps': 32156, 'loss/train': 1.6518559455871582} 11/07/2021 01:47:01 - INFO - __main__ - Step 32158: {'lr': 0.00045050274341389726, 'samples': 6174336, 'steps': 32157, 'loss/train': 1.3425853252410889} 11/07/2021 01:47:01 - INFO - __main__ - Step 32159: {'lr': 0.00045049957360098207, 'samples': 6174528, 'steps': 32158, 'loss/train': 5.946632385253906} 11/07/2021 01:47:02 - INFO - __main__ - Step 32160: {'lr': 0.0004504964036977249, 'samples': 6174720, 'steps': 32159, 'loss/train': 1.4935715198516846} 11/07/2021 01:47:03 - INFO - __main__ - Step 32161: {'lr': 0.00045049323370412723, 'samples': 6174912, 'steps': 32160, 'loss/train': 1.4618717432022095} 11/07/2021 01:47:03 - INFO - __main__ - Step 32162: {'lr': 0.0004504900636201903, 'samples': 6175104, 'steps': 32161, 'loss/train': 1.820804476737976} 11/07/2021 01:47:03 - INFO - __main__ - Step 32163: {'lr': 0.00045048689344591566, 'samples': 6175296, 'steps': 32162, 'loss/train': 1.4637774229049683} 11/07/2021 01:47:04 - INFO - __main__ - Step 32164: {'lr': 0.0004504837231813047, 'samples': 6175488, 'steps': 32163, 'loss/train': 1.688586711883545} 11/07/2021 01:47:05 - INFO - __main__ - Step 32165: {'lr': 0.0004504805528263589, 'samples': 6175680, 'steps': 32164, 'loss/train': 1.6876972913742065} 11/07/2021 01:47:05 - INFO - __main__ - Step 32166: {'lr': 0.00045047738238107967, 'samples': 6175872, 'steps': 32165, 'loss/train': 1.708917260169983} 11/07/2021 01:47:05 - INFO - __main__ - Step 32167: {'lr': 0.00045047421184546844, 'samples': 6176064, 'steps': 32166, 'loss/train': 1.567901611328125} 11/07/2021 01:47:06 - INFO - __main__ - Step 32168: {'lr': 0.0004504710412195265, 'samples': 6176256, 'steps': 32167, 'loss/train': 1.5221161842346191} 11/07/2021 01:47:06 - INFO - __main__ - Step 32169: {'lr': 0.00045046787050325555, 'samples': 6176448, 'steps': 32168, 'loss/train': 0.7906864285469055} 11/07/2021 01:47:07 - INFO - __main__ - Step 32170: {'lr': 0.0004504646996966568, 'samples': 6176640, 'steps': 32169, 'loss/train': 1.4437230825424194} 11/07/2021 01:47:07 - INFO - __main__ - Step 32171: {'lr': 0.0004504615287997318, 'samples': 6176832, 'steps': 32170, 'loss/train': 1.4785854816436768} 11/07/2021 01:47:08 - INFO - __main__ - Step 32172: {'lr': 0.00045045835781248184, 'samples': 6177024, 'steps': 32171, 'loss/train': 1.773765206336975} 11/07/2021 01:47:08 - INFO - __main__ - Step 32173: {'lr': 0.0004504551867349085, 'samples': 6177216, 'steps': 32172, 'loss/train': 1.5836315155029297} 11/07/2021 01:47:09 - INFO - __main__ - Step 32174: {'lr': 0.0004504520155670131, 'samples': 6177408, 'steps': 32173, 'loss/train': 1.4165898561477661} 11/07/2021 01:47:09 - INFO - __main__ - Step 32175: {'lr': 0.0004504488443087972, 'samples': 6177600, 'steps': 32174, 'loss/train': 1.5395853519439697} 11/07/2021 01:47:10 - INFO - __main__ - Step 32176: {'lr': 0.00045044567296026206, 'samples': 6177792, 'steps': 32175, 'loss/train': 1.4260025024414062} 11/07/2021 01:47:10 - INFO - __main__ - Step 32177: {'lr': 0.0004504425015214092, 'samples': 6177984, 'steps': 32176, 'loss/train': 1.4530431032180786} 11/07/2021 01:47:11 - INFO - __main__ - Step 32178: {'lr': 0.00045043932999224015, 'samples': 6178176, 'steps': 32177, 'loss/train': 1.5154881477355957} 11/07/2021 01:47:11 - INFO - __main__ - Step 32179: {'lr': 0.00045043615837275607, 'samples': 6178368, 'steps': 32178, 'loss/train': 1.7684738636016846} 11/07/2021 01:47:12 - INFO - __main__ - Step 32180: {'lr': 0.0004504329866629586, 'samples': 6178560, 'steps': 32179, 'loss/train': 1.756452202796936} 11/07/2021 01:47:13 - INFO - __main__ - Step 32181: {'lr': 0.0004504298148628492, 'samples': 6178752, 'steps': 32180, 'loss/train': 0.48151570558547974} 11/07/2021 01:47:13 - INFO - __main__ - Step 32182: {'lr': 0.0004504266429724292, 'samples': 6178944, 'steps': 32181, 'loss/train': 1.5893335342407227} 11/07/2021 01:47:13 - INFO - __main__ - Step 32183: {'lr': 0.0004504234709917, 'samples': 6179136, 'steps': 32182, 'loss/train': 1.4612712860107422} 11/07/2021 01:47:14 - INFO - __main__ - Step 32184: {'lr': 0.00045042029892066306, 'samples': 6179328, 'steps': 32183, 'loss/train': 1.3556455373764038} 11/07/2021 01:47:14 - INFO - __main__ - Step 32185: {'lr': 0.00045041712675931983, 'samples': 6179520, 'steps': 32184, 'loss/train': 1.6092675924301147} 11/07/2021 01:47:15 - INFO - __main__ - Step 32186: {'lr': 0.0004504139545076717, 'samples': 6179712, 'steps': 32185, 'loss/train': 1.374077320098877} 11/07/2021 01:47:15 - INFO - __main__ - Step 32187: {'lr': 0.0004504107821657203, 'samples': 6179904, 'steps': 32186, 'loss/train': 1.4730110168457031} 11/07/2021 01:47:16 - INFO - __main__ - Step 32188: {'lr': 0.00045040760973346673, 'samples': 6180096, 'steps': 32187, 'loss/train': 1.5549176931381226} 11/07/2021 01:47:16 - INFO - __main__ - Step 32189: {'lr': 0.00045040443721091266, 'samples': 6180288, 'steps': 32188, 'loss/train': 1.3511254787445068} 11/07/2021 01:47:16 - INFO - __main__ - Step 32190: {'lr': 0.0004504012645980594, 'samples': 6180480, 'steps': 32189, 'loss/train': 1.49689519405365} 11/07/2021 01:47:17 - INFO - __main__ - Step 32191: {'lr': 0.0004503980918949085, 'samples': 6180672, 'steps': 32190, 'loss/train': 1.4713667631149292} 11/07/2021 01:47:18 - INFO - __main__ - Step 32192: {'lr': 0.00045039491910146124, 'samples': 6180864, 'steps': 32191, 'loss/train': 1.3565356731414795} 11/07/2021 01:47:18 - INFO - __main__ - Step 32193: {'lr': 0.00045039174621771915, 'samples': 6181056, 'steps': 32192, 'loss/train': 2.052945852279663} 11/07/2021 01:47:18 - INFO - __main__ - Step 32194: {'lr': 0.00045038857324368367, 'samples': 6181248, 'steps': 32193, 'loss/train': 1.5134190320968628} 11/07/2021 01:47:19 - INFO - __main__ - Step 32195: {'lr': 0.0004503854001793561, 'samples': 6181440, 'steps': 32194, 'loss/train': 1.8116105794906616} 11/07/2021 01:47:20 - INFO - __main__ - Step 32196: {'lr': 0.00045038222702473797, 'samples': 6181632, 'steps': 32195, 'loss/train': 1.4274109601974487} 11/07/2021 01:47:20 - INFO - __main__ - Step 32197: {'lr': 0.0004503790537798308, 'samples': 6181824, 'steps': 32196, 'loss/train': 1.777016520500183} 11/07/2021 01:47:21 - INFO - __main__ - Step 32198: {'lr': 0.00045037588044463586, 'samples': 6182016, 'steps': 32197, 'loss/train': 1.5776311159133911} 11/07/2021 01:47:21 - INFO - __main__ - Step 32199: {'lr': 0.00045037270701915464, 'samples': 6182208, 'steps': 32198, 'loss/train': 1.589194655418396} 11/07/2021 01:47:21 - INFO - __main__ - Step 32200: {'lr': 0.0004503695335033885, 'samples': 6182400, 'steps': 32199, 'loss/train': 3.5294759273529053} 11/07/2021 01:47:22 - INFO - __main__ - Step 32201: {'lr': 0.00045036635989733904, 'samples': 6182592, 'steps': 32200, 'loss/train': 0.16695256531238556} 11/07/2021 01:47:22 - INFO - __main__ - Step 32202: {'lr': 0.0004503631862010076, 'samples': 6182784, 'steps': 32201, 'loss/train': 1.8757727146148682} 11/07/2021 01:47:23 - INFO - __main__ - Step 32203: {'lr': 0.0004503600124143955, 'samples': 6182976, 'steps': 32202, 'loss/train': 1.8070666790008545} 11/07/2021 01:47:24 - INFO - __main__ - Step 32204: {'lr': 0.0004503568385375043, 'samples': 6183168, 'steps': 32203, 'loss/train': 1.2195488214492798} 11/07/2021 01:47:24 - INFO - __main__ - Step 32205: {'lr': 0.00045035366457033546, 'samples': 6183360, 'steps': 32204, 'loss/train': 1.3440762758255005} 11/07/2021 01:47:24 - INFO - __main__ - Step 32206: {'lr': 0.00045035049051289037, 'samples': 6183552, 'steps': 32205, 'loss/train': 1.389227032661438} 11/07/2021 01:47:25 - INFO - __main__ - Step 32207: {'lr': 0.00045034731636517036, 'samples': 6183744, 'steps': 32206, 'loss/train': 1.765787959098816} 11/07/2021 01:47:26 - INFO - __main__ - Step 32208: {'lr': 0.0004503441421271769, 'samples': 6183936, 'steps': 32207, 'loss/train': 1.577767014503479} 11/07/2021 01:47:26 - INFO - __main__ - Step 32209: {'lr': 0.0004503409677989115, 'samples': 6184128, 'steps': 32208, 'loss/train': 1.5536357164382935} 11/07/2021 01:47:27 - INFO - __main__ - Step 32210: {'lr': 0.00045033779338037565, 'samples': 6184320, 'steps': 32209, 'loss/train': 3.0786328315734863} 11/07/2021 01:47:27 - INFO - __main__ - Step 32211: {'lr': 0.0004503346188715706, 'samples': 6184512, 'steps': 32210, 'loss/train': 1.5683656930923462} 11/07/2021 01:47:27 - INFO - __main__ - Step 32212: {'lr': 0.0004503314442724979, 'samples': 6184704, 'steps': 32211, 'loss/train': 1.1409876346588135} 11/07/2021 01:47:28 - INFO - __main__ - Step 32213: {'lr': 0.0004503282695831589, 'samples': 6184896, 'steps': 32212, 'loss/train': 2.376875400543213} 11/07/2021 01:47:29 - INFO - __main__ - Step 32214: {'lr': 0.0004503250948035551, 'samples': 6185088, 'steps': 32213, 'loss/train': 1.9134149551391602} 11/07/2021 01:47:29 - INFO - __main__ - Step 32215: {'lr': 0.0004503219199336879, 'samples': 6185280, 'steps': 32214, 'loss/train': 0.9256809949874878} 11/07/2021 01:47:30 - INFO - __main__ - Step 32216: {'lr': 0.00045031874497355876, 'samples': 6185472, 'steps': 32215, 'loss/train': 1.794251799583435} 11/07/2021 01:47:30 - INFO - __main__ - Step 32217: {'lr': 0.000450315569923169, 'samples': 6185664, 'steps': 32216, 'loss/train': 1.9647176265716553} 11/07/2021 01:47:30 - INFO - __main__ - Step 32218: {'lr': 0.00045031239478252017, 'samples': 6185856, 'steps': 32217, 'loss/train': 1.6685179471969604} 11/07/2021 01:47:31 - INFO - __main__ - Step 32219: {'lr': 0.00045030921955161373, 'samples': 6186048, 'steps': 32218, 'loss/train': 1.7403217554092407} 11/07/2021 01:47:32 - INFO - __main__ - Step 32220: {'lr': 0.000450306044230451, 'samples': 6186240, 'steps': 32219, 'loss/train': 1.0186405181884766} 11/07/2021 01:47:32 - INFO - __main__ - Step 32221: {'lr': 0.0004503028688190335, 'samples': 6186432, 'steps': 32220, 'loss/train': 1.6435362100601196} 11/07/2021 01:47:32 - INFO - __main__ - Step 32222: {'lr': 0.00045029969331736254, 'samples': 6186624, 'steps': 32221, 'loss/train': 1.9964864253997803} 11/07/2021 01:47:33 - INFO - __main__ - Step 32223: {'lr': 0.00045029651772543965, 'samples': 6186816, 'steps': 32222, 'loss/train': 1.9309375286102295} 11/07/2021 01:47:34 - INFO - __main__ - Step 32224: {'lr': 0.0004502933420432662, 'samples': 6187008, 'steps': 32223, 'loss/train': 1.053402304649353} 11/07/2021 01:47:34 - INFO - __main__ - Step 32225: {'lr': 0.0004502901662708437, 'samples': 6187200, 'steps': 32224, 'loss/train': 1.0154645442962646} 11/07/2021 01:47:34 - INFO - __main__ - Step 32226: {'lr': 0.0004502869904081736, 'samples': 6187392, 'steps': 32225, 'loss/train': 1.5956135988235474} 11/07/2021 01:47:35 - INFO - __main__ - Step 32227: {'lr': 0.00045028381445525725, 'samples': 6187584, 'steps': 32226, 'loss/train': 1.768977403640747} 11/07/2021 01:47:35 - INFO - __main__ - Step 32228: {'lr': 0.0004502806384120961, 'samples': 6187776, 'steps': 32227, 'loss/train': 1.180979609489441} 11/07/2021 01:47:36 - INFO - __main__ - Step 32229: {'lr': 0.0004502774622786915, 'samples': 6187968, 'steps': 32228, 'loss/train': 1.636723518371582} 11/07/2021 01:47:36 - INFO - __main__ - Step 32230: {'lr': 0.00045027428605504507, 'samples': 6188160, 'steps': 32229, 'loss/train': 1.6560455560684204} 11/07/2021 01:47:37 - INFO - __main__ - Step 32231: {'lr': 0.00045027110974115814, 'samples': 6188352, 'steps': 32230, 'loss/train': 1.9419082403182983} 11/07/2021 01:47:37 - INFO - __main__ - Step 32232: {'lr': 0.0004502679333370321, 'samples': 6188544, 'steps': 32231, 'loss/train': 1.2935874462127686} 11/07/2021 01:47:37 - INFO - __main__ - Step 32233: {'lr': 0.0004502647568426684, 'samples': 6188736, 'steps': 32232, 'loss/train': 1.598137378692627} 11/07/2021 01:47:38 - INFO - __main__ - Step 32234: {'lr': 0.0004502615802580685, 'samples': 6188928, 'steps': 32233, 'loss/train': 1.5005950927734375} 11/07/2021 01:47:39 - INFO - __main__ - Step 32235: {'lr': 0.0004502584035832338, 'samples': 6189120, 'steps': 32234, 'loss/train': 1.0328993797302246} 11/07/2021 01:47:39 - INFO - __main__ - Step 32236: {'lr': 0.00045025522681816586, 'samples': 6189312, 'steps': 32235, 'loss/train': 1.4519612789154053} 11/07/2021 01:47:39 - INFO - __main__ - Step 32237: {'lr': 0.0004502520499628659, 'samples': 6189504, 'steps': 32236, 'loss/train': 1.620317816734314} 11/07/2021 01:47:40 - INFO - __main__ - Step 32238: {'lr': 0.00045024887301733555, 'samples': 6189696, 'steps': 32237, 'loss/train': 1.6385005712509155} 11/07/2021 01:47:41 - INFO - __main__ - Step 32239: {'lr': 0.0004502456959815761, 'samples': 6189888, 'steps': 32238, 'loss/train': 1.7109335660934448} 11/07/2021 01:47:41 - INFO - __main__ - Step 32240: {'lr': 0.000450242518855589, 'samples': 6190080, 'steps': 32239, 'loss/train': 0.9207674860954285} 11/07/2021 01:47:42 - INFO - __main__ - Step 32241: {'lr': 0.00045023934163937565, 'samples': 6190272, 'steps': 32240, 'loss/train': 1.6926556825637817} 11/07/2021 01:47:42 - INFO - __main__ - Step 32242: {'lr': 0.00045023616433293763, 'samples': 6190464, 'steps': 32241, 'loss/train': 1.55082106590271} 11/07/2021 01:47:42 - INFO - __main__ - Step 32243: {'lr': 0.00045023298693627626, 'samples': 6190656, 'steps': 32242, 'loss/train': 1.2392164468765259} 11/07/2021 01:47:43 - INFO - __main__ - Step 32244: {'lr': 0.000450229809449393, 'samples': 6190848, 'steps': 32243, 'loss/train': 1.5431742668151855} 11/07/2021 01:47:44 - INFO - __main__ - Step 32245: {'lr': 0.00045022663187228927, 'samples': 6191040, 'steps': 32244, 'loss/train': 1.3102967739105225} 11/07/2021 01:47:44 - INFO - __main__ - Step 32246: {'lr': 0.0004502234542049666, 'samples': 6191232, 'steps': 32245, 'loss/train': 1.9321751594543457} 11/07/2021 01:47:44 - INFO - __main__ - Step 32247: {'lr': 0.00045022027644742624, 'samples': 6191424, 'steps': 32246, 'loss/train': 1.9687552452087402} 11/07/2021 01:47:45 - INFO - __main__ - Step 32248: {'lr': 0.0004502170985996697, 'samples': 6191616, 'steps': 32247, 'loss/train': 1.6017018556594849} 11/07/2021 01:47:45 - INFO - __main__ - Step 32249: {'lr': 0.00045021392066169844, 'samples': 6191808, 'steps': 32248, 'loss/train': 1.8045872449874878} 11/07/2021 01:47:46 - INFO - __main__ - Step 32250: {'lr': 0.0004502107426335139, 'samples': 6192000, 'steps': 32249, 'loss/train': 1.603269338607788} 11/07/2021 01:47:47 - INFO - __main__ - Step 32251: {'lr': 0.0004502075645151175, 'samples': 6192192, 'steps': 32250, 'loss/train': 1.9542632102966309} 11/07/2021 01:47:47 - INFO - __main__ - Step 32252: {'lr': 0.0004502043863065106, 'samples': 6192384, 'steps': 32251, 'loss/train': 1.8218039274215698} 11/07/2021 01:47:47 - INFO - __main__ - Step 32253: {'lr': 0.00045020120800769474, 'samples': 6192576, 'steps': 32252, 'loss/train': 1.427304744720459} 11/07/2021 01:47:48 - INFO - __main__ - Step 32254: {'lr': 0.0004501980296186713, 'samples': 6192768, 'steps': 32253, 'loss/train': 1.3513579368591309} 11/07/2021 01:47:48 - INFO - __main__ - Step 32255: {'lr': 0.0004501948511394417, 'samples': 6192960, 'steps': 32254, 'loss/train': 1.3751741647720337} 11/07/2021 01:47:49 - INFO - __main__ - Step 32256: {'lr': 0.0004501916725700074, 'samples': 6193152, 'steps': 32255, 'loss/train': 1.3976471424102783} 11/07/2021 01:47:50 - INFO - __main__ - Step 32257: {'lr': 0.00045018849391036987, 'samples': 6193344, 'steps': 32256, 'loss/train': 1.300757646560669} 11/07/2021 01:47:50 - INFO - __main__ - Step 32258: {'lr': 0.00045018531516053046, 'samples': 6193536, 'steps': 32257, 'loss/train': 1.3747690916061401} 11/07/2021 01:47:50 - INFO - __main__ - Step 32259: {'lr': 0.0004501821363204906, 'samples': 6193728, 'steps': 32258, 'loss/train': 1.2249864339828491} 11/07/2021 01:47:51 - INFO - __main__ - Step 32260: {'lr': 0.00045017895739025185, 'samples': 6193920, 'steps': 32259, 'loss/train': 0.8205302357673645} 11/07/2021 01:47:52 - INFO - __main__ - Step 32261: {'lr': 0.0004501757783698154, 'samples': 6194112, 'steps': 32260, 'loss/train': 1.5389779806137085} 11/07/2021 01:47:52 - INFO - __main__ - Step 32262: {'lr': 0.00045017259925918295, 'samples': 6194304, 'steps': 32261, 'loss/train': 1.7591694593429565} 11/07/2021 01:47:52 - INFO - __main__ - Step 32263: {'lr': 0.0004501694200583558, 'samples': 6194496, 'steps': 32262, 'loss/train': 1.3185373544692993} 11/07/2021 01:47:53 - INFO - __main__ - Step 32264: {'lr': 0.0004501662407673354, 'samples': 6194688, 'steps': 32263, 'loss/train': 1.338904857635498} 11/07/2021 01:47:53 - INFO - __main__ - Step 32265: {'lr': 0.00045016306138612313, 'samples': 6194880, 'steps': 32264, 'loss/train': 1.3383913040161133} 11/07/2021 01:47:54 - INFO - __main__ - Step 32266: {'lr': 0.0004501598819147205, 'samples': 6195072, 'steps': 32265, 'loss/train': 2.0385541915893555} 11/07/2021 01:47:54 - INFO - __main__ - Step 32267: {'lr': 0.00045015670235312895, 'samples': 6195264, 'steps': 32266, 'loss/train': 1.398764729499817} 11/07/2021 01:47:55 - INFO - __main__ - Step 32268: {'lr': 0.0004501535227013498, 'samples': 6195456, 'steps': 32267, 'loss/train': 1.4296706914901733} 11/07/2021 01:47:55 - INFO - __main__ - Step 32269: {'lr': 0.0004501503429593846, 'samples': 6195648, 'steps': 32268, 'loss/train': 2.022183418273926} 11/07/2021 01:47:55 - INFO - __main__ - Step 32270: {'lr': 0.0004501471631272348, 'samples': 6195840, 'steps': 32269, 'loss/train': 1.373524785041809} 11/07/2021 01:47:57 - INFO - __main__ - Step 32271: {'lr': 0.00045014398320490173, 'samples': 6196032, 'steps': 32270, 'loss/train': 1.0820329189300537} 11/07/2021 01:47:57 - INFO - __main__ - Step 32272: {'lr': 0.00045014080319238686, 'samples': 6196224, 'steps': 32271, 'loss/train': 1.12027907371521} 11/07/2021 01:47:57 - INFO - __main__ - Step 32273: {'lr': 0.00045013762308969164, 'samples': 6196416, 'steps': 32272, 'loss/train': 1.7232547998428345} 11/07/2021 01:47:58 - INFO - __main__ - Step 32274: {'lr': 0.00045013444289681757, 'samples': 6196608, 'steps': 32273, 'loss/train': 1.3769302368164062} 11/07/2021 01:47:58 - INFO - __main__ - Step 32275: {'lr': 0.0004501312626137659, 'samples': 6196800, 'steps': 32274, 'loss/train': 1.300300121307373} 11/07/2021 01:47:59 - INFO - __main__ - Step 32276: {'lr': 0.0004501280822405382, 'samples': 6196992, 'steps': 32275, 'loss/train': 1.6742898225784302} 11/07/2021 01:47:59 - INFO - __main__ - Step 32277: {'lr': 0.00045012490177713586, 'samples': 6197184, 'steps': 32276, 'loss/train': 1.5575422048568726} 11/07/2021 01:48:00 - INFO - __main__ - Step 32278: {'lr': 0.00045012172122356036, 'samples': 6197376, 'steps': 32277, 'loss/train': 1.6483980417251587} 11/07/2021 01:48:00 - INFO - __main__ - Step 32279: {'lr': 0.0004501185405798131, 'samples': 6197568, 'steps': 32278, 'loss/train': 1.6229832172393799} 11/07/2021 01:48:00 - INFO - __main__ - Step 32280: {'lr': 0.00045011535984589544, 'samples': 6197760, 'steps': 32279, 'loss/train': 1.3071985244750977} 11/07/2021 01:48:01 - INFO - __main__ - Step 32281: {'lr': 0.000450112179021809, 'samples': 6197952, 'steps': 32280, 'loss/train': 1.3725543022155762} 11/07/2021 01:48:02 - INFO - __main__ - Step 32282: {'lr': 0.00045010899810755506, 'samples': 6198144, 'steps': 32281, 'loss/train': 1.4773143529891968} 11/07/2021 01:48:02 - INFO - __main__ - Step 32283: {'lr': 0.00045010581710313506, 'samples': 6198336, 'steps': 32282, 'loss/train': 1.4009032249450684} 11/07/2021 01:48:02 - INFO - __main__ - Step 32284: {'lr': 0.0004501026360085505, 'samples': 6198528, 'steps': 32283, 'loss/train': 1.030112624168396} 11/07/2021 01:48:03 - INFO - __main__ - Step 32285: {'lr': 0.0004500994548238028, 'samples': 6198720, 'steps': 32284, 'loss/train': 1.2472800016403198} 11/07/2021 01:48:03 - INFO - __main__ - Step 32286: {'lr': 0.00045009627354889337, 'samples': 6198912, 'steps': 32285, 'loss/train': 1.8431708812713623} 11/07/2021 01:48:04 - INFO - __main__ - Step 32287: {'lr': 0.0004500930921838236, 'samples': 6199104, 'steps': 32286, 'loss/train': 1.1655179262161255} 11/07/2021 01:48:05 - INFO - __main__ - Step 32288: {'lr': 0.000450089910728595, 'samples': 6199296, 'steps': 32287, 'loss/train': 0.7153118252754211} 11/07/2021 01:48:05 - INFO - __main__ - Step 32289: {'lr': 0.0004500867291832089, 'samples': 6199488, 'steps': 32288, 'loss/train': 1.3063770532608032} 11/07/2021 01:48:05 - INFO - __main__ - Step 32290: {'lr': 0.00045008354754766687, 'samples': 6199680, 'steps': 32289, 'loss/train': 1.4782352447509766} 11/07/2021 01:48:06 - INFO - __main__ - Step 32291: {'lr': 0.0004500803658219703, 'samples': 6199872, 'steps': 32290, 'loss/train': 1.7052644491195679} 11/07/2021 01:48:07 - INFO - __main__ - Step 32292: {'lr': 0.0004500771840061206, 'samples': 6200064, 'steps': 32291, 'loss/train': 1.9112358093261719} 11/07/2021 01:48:07 - INFO - __main__ - Step 32293: {'lr': 0.00045007400210011925, 'samples': 6200256, 'steps': 32292, 'loss/train': 2.1962902545928955} 11/07/2021 01:48:07 - INFO - __main__ - Step 32294: {'lr': 0.0004500708201039676, 'samples': 6200448, 'steps': 32293, 'loss/train': 1.5316219329833984} 11/07/2021 01:48:08 - INFO - __main__ - Step 32295: {'lr': 0.0004500676380176671, 'samples': 6200640, 'steps': 32294, 'loss/train': 1.7628265619277954} 11/07/2021 01:48:08 - INFO - __main__ - Step 32296: {'lr': 0.00045006445584121923, 'samples': 6200832, 'steps': 32295, 'loss/train': 1.9976251125335693} 11/07/2021 01:48:09 - INFO - __main__ - Step 32297: {'lr': 0.00045006127357462533, 'samples': 6201024, 'steps': 32296, 'loss/train': 1.5440441370010376} 11/07/2021 01:48:09 - INFO - __main__ - Step 32298: {'lr': 0.000450058091217887, 'samples': 6201216, 'steps': 32297, 'loss/train': 1.672995924949646} 11/07/2021 01:48:10 - INFO - __main__ - Step 32299: {'lr': 0.0004500549087710056, 'samples': 6201408, 'steps': 32298, 'loss/train': 1.8858267068862915} 11/07/2021 01:48:10 - INFO - __main__ - Step 32300: {'lr': 0.0004500517262339825, 'samples': 6201600, 'steps': 32299, 'loss/train': 1.860845685005188} 11/07/2021 01:48:11 - INFO - __main__ - Step 32301: {'lr': 0.0004500485436068191, 'samples': 6201792, 'steps': 32300, 'loss/train': 1.3300939798355103} 11/07/2021 01:48:12 - INFO - __main__ - Step 32302: {'lr': 0.0004500453608895171, 'samples': 6201984, 'steps': 32301, 'loss/train': 1.4846974611282349} 11/07/2021 01:48:12 - INFO - __main__ - Step 32303: {'lr': 0.00045004217808207757, 'samples': 6202176, 'steps': 32302, 'loss/train': 1.9711004495620728} 11/07/2021 01:48:12 - INFO - __main__ - Step 32304: {'lr': 0.0004500389951845022, 'samples': 6202368, 'steps': 32303, 'loss/train': 0.865800678730011} 11/07/2021 01:48:13 - INFO - __main__ - Step 32305: {'lr': 0.00045003581219679235, 'samples': 6202560, 'steps': 32304, 'loss/train': 1.3765199184417725} 11/07/2021 01:48:13 - INFO - __main__ - Step 32306: {'lr': 0.00045003262911894943, 'samples': 6202752, 'steps': 32305, 'loss/train': 1.862096905708313} 11/07/2021 01:48:14 - INFO - __main__ - Step 32307: {'lr': 0.00045002944595097494, 'samples': 6202944, 'steps': 32306, 'loss/train': 1.1746426820755005} 11/07/2021 01:48:14 - INFO - __main__ - Step 32308: {'lr': 0.00045002626269287024, 'samples': 6203136, 'steps': 32307, 'loss/train': 1.4557210206985474} 11/07/2021 01:48:15 - INFO - __main__ - Step 32309: {'lr': 0.00045002307934463673, 'samples': 6203328, 'steps': 32308, 'loss/train': 1.4870140552520752} 11/07/2021 01:48:15 - INFO - __main__ - Step 32310: {'lr': 0.000450019895906276, 'samples': 6203520, 'steps': 32309, 'loss/train': 1.0892274379730225} 11/07/2021 01:48:15 - INFO - __main__ - Step 32311: {'lr': 0.0004500167123777894, 'samples': 6203712, 'steps': 32310, 'loss/train': 1.544129729270935} 11/07/2021 01:48:16 - INFO - __main__ - Step 32312: {'lr': 0.00045001352875917824, 'samples': 6203904, 'steps': 32311, 'loss/train': 1.6808295249938965} 11/07/2021 01:48:17 - INFO - __main__ - Step 32313: {'lr': 0.00045001034505044415, 'samples': 6204096, 'steps': 32312, 'loss/train': 1.686047077178955} 11/07/2021 01:48:17 - INFO - __main__ - Step 32314: {'lr': 0.00045000716125158846, 'samples': 6204288, 'steps': 32313, 'loss/train': 1.6952117681503296} 11/07/2021 01:48:18 - INFO - __main__ - Step 32315: {'lr': 0.0004500039773626127, 'samples': 6204480, 'steps': 32314, 'loss/train': 1.413800835609436} 11/07/2021 01:48:18 - INFO - __main__ - Step 32316: {'lr': 0.00045000079338351805, 'samples': 6204672, 'steps': 32315, 'loss/train': 1.6651508808135986} 11/07/2021 01:48:19 - INFO - __main__ - Step 32317: {'lr': 0.0004499976093143063, 'samples': 6204864, 'steps': 32316, 'loss/train': 1.5840280055999756} 11/07/2021 01:48:19 - INFO - __main__ - Step 32318: {'lr': 0.00044999442515497866, 'samples': 6205056, 'steps': 32317, 'loss/train': 1.7664008140563965} 11/07/2021 01:48:20 - INFO - __main__ - Step 32319: {'lr': 0.0004499912409055367, 'samples': 6205248, 'steps': 32318, 'loss/train': 1.6453135013580322} 11/07/2021 01:48:20 - INFO - __main__ - Step 32320: {'lr': 0.0004499880565659816, 'samples': 6205440, 'steps': 32319, 'loss/train': 1.4619554281234741} 11/07/2021 01:48:20 - INFO - __main__ - Step 32321: {'lr': 0.0004499848721363151, 'samples': 6205632, 'steps': 32320, 'loss/train': 2.0209126472473145} 11/07/2021 01:48:21 - INFO - __main__ - Step 32322: {'lr': 0.0004499816876165385, 'samples': 6205824, 'steps': 32321, 'loss/train': 1.6794594526290894} 11/07/2021 01:48:22 - INFO - __main__ - Step 32323: {'lr': 0.0004499785030066532, 'samples': 6206016, 'steps': 32322, 'loss/train': 1.405039668083191} 11/07/2021 01:48:22 - INFO - __main__ - Step 32324: {'lr': 0.00044997531830666073, 'samples': 6206208, 'steps': 32323, 'loss/train': 0.8761853575706482} 11/07/2021 01:48:23 - INFO - __main__ - Step 32325: {'lr': 0.00044997213351656237, 'samples': 6206400, 'steps': 32324, 'loss/train': 1.3782182931900024} 11/07/2021 01:48:23 - INFO - __main__ - Step 32326: {'lr': 0.00044996894863635965, 'samples': 6206592, 'steps': 32325, 'loss/train': 1.345014214515686} 11/07/2021 01:48:23 - INFO - __main__ - Step 32327: {'lr': 0.00044996576366605415, 'samples': 6206784, 'steps': 32326, 'loss/train': 1.9174363613128662} 11/07/2021 01:48:24 - INFO - __main__ - Step 32328: {'lr': 0.00044996257860564705, 'samples': 6206976, 'steps': 32327, 'loss/train': 1.2590142488479614} 11/07/2021 01:48:25 - INFO - __main__ - Step 32329: {'lr': 0.0004499593934551399, 'samples': 6207168, 'steps': 32328, 'loss/train': 3.155000686645508} 11/07/2021 01:48:25 - INFO - __main__ - Step 32330: {'lr': 0.00044995620821453416, 'samples': 6207360, 'steps': 32329, 'loss/train': 1.9484862089157104} 11/07/2021 01:48:25 - INFO - __main__ - Step 32331: {'lr': 0.00044995302288383123, 'samples': 6207552, 'steps': 32330, 'loss/train': 1.1827287673950195} 11/07/2021 01:48:26 - INFO - __main__ - Step 32332: {'lr': 0.0004499498374630325, 'samples': 6207744, 'steps': 32331, 'loss/train': 0.9975447654724121} 11/07/2021 01:48:26 - INFO - __main__ - Step 32333: {'lr': 0.0004499466519521396, 'samples': 6207936, 'steps': 32332, 'loss/train': 1.891614556312561} 11/07/2021 01:48:27 - INFO - __main__ - Step 32334: {'lr': 0.00044994346635115367, 'samples': 6208128, 'steps': 32333, 'loss/train': 1.2407118082046509} 11/07/2021 01:48:28 - INFO - __main__ - Step 32335: {'lr': 0.00044994028066007636, 'samples': 6208320, 'steps': 32334, 'loss/train': 0.8754911422729492} 11/07/2021 01:48:28 - INFO - __main__ - Step 32336: {'lr': 0.00044993709487890906, 'samples': 6208512, 'steps': 32335, 'loss/train': 1.402904748916626} 11/07/2021 01:48:28 - INFO - __main__ - Step 32337: {'lr': 0.0004499339090076532, 'samples': 6208704, 'steps': 32336, 'loss/train': 2.068150520324707} 11/07/2021 01:48:29 - INFO - __main__ - Step 32338: {'lr': 0.0004499307230463102, 'samples': 6208896, 'steps': 32337, 'loss/train': 1.460711121559143} 11/07/2021 01:48:30 - INFO - __main__ - Step 32339: {'lr': 0.0004499275369948814, 'samples': 6209088, 'steps': 32338, 'loss/train': 1.7848551273345947} 11/07/2021 01:48:30 - INFO - __main__ - Step 32340: {'lr': 0.0004499243508533685, 'samples': 6209280, 'steps': 32339, 'loss/train': 1.4753457307815552} 11/07/2021 01:48:30 - INFO - __main__ - Step 32341: {'lr': 0.0004499211646217727, 'samples': 6209472, 'steps': 32340, 'loss/train': 1.7661265134811401} 11/07/2021 01:48:31 - INFO - __main__ - Step 32342: {'lr': 0.00044991797830009543, 'samples': 6209664, 'steps': 32341, 'loss/train': 2.685377597808838} 11/07/2021 01:48:31 - INFO - __main__ - Step 32343: {'lr': 0.00044991479188833826, 'samples': 6209856, 'steps': 32342, 'loss/train': 1.4869948625564575} 11/07/2021 01:48:32 - INFO - __main__ - Step 32344: {'lr': 0.0004499116053865026, 'samples': 6210048, 'steps': 32343, 'loss/train': 1.4335551261901855} 11/07/2021 01:48:32 - INFO - __main__ - Step 32345: {'lr': 0.0004499084187945899, 'samples': 6210240, 'steps': 32344, 'loss/train': 1.3646057844161987} 11/07/2021 01:48:33 - INFO - __main__ - Step 32346: {'lr': 0.0004499052321126015, 'samples': 6210432, 'steps': 32345, 'loss/train': 1.448341965675354} 11/07/2021 01:48:33 - INFO - __main__ - Step 32347: {'lr': 0.0004499020453405388, 'samples': 6210624, 'steps': 32346, 'loss/train': 1.9387210607528687} 11/07/2021 01:48:33 - INFO - __main__ - Step 32348: {'lr': 0.00044989885847840344, 'samples': 6210816, 'steps': 32347, 'loss/train': 1.3974018096923828} 11/07/2021 01:48:35 - INFO - __main__ - Step 32349: {'lr': 0.0004498956715261967, 'samples': 6211008, 'steps': 32348, 'loss/train': 0.2207542210817337} 11/07/2021 01:48:35 - INFO - __main__ - Step 32350: {'lr': 0.00044989248448392007, 'samples': 6211200, 'steps': 32349, 'loss/train': 1.630966305732727} 11/07/2021 01:48:35 - INFO - __main__ - Step 32351: {'lr': 0.000449889297351575, 'samples': 6211392, 'steps': 32350, 'loss/train': 1.0773181915283203} 11/07/2021 01:48:36 - INFO - __main__ - Step 32352: {'lr': 0.0004498861101291628, 'samples': 6211584, 'steps': 32351, 'loss/train': 1.5037585496902466} 11/07/2021 01:48:36 - INFO - __main__ - Step 32353: {'lr': 0.0004498829228166851, 'samples': 6211776, 'steps': 32352, 'loss/train': 1.4931249618530273} 11/07/2021 01:48:37 - INFO - __main__ - Step 32354: {'lr': 0.0004498797354141432, 'samples': 6211968, 'steps': 32353, 'loss/train': 1.459004521369934} 11/07/2021 01:48:37 - INFO - __main__ - Step 32355: {'lr': 0.00044987654792153853, 'samples': 6212160, 'steps': 32354, 'loss/train': 1.4395281076431274} 11/07/2021 01:48:38 - INFO - __main__ - Step 32356: {'lr': 0.0004498733603388726, 'samples': 6212352, 'steps': 32355, 'loss/train': 1.4898239374160767} 11/07/2021 01:48:38 - INFO - __main__ - Step 32357: {'lr': 0.00044987017266614684, 'samples': 6212544, 'steps': 32356, 'loss/train': 1.539486289024353} 11/07/2021 01:48:38 - INFO - __main__ - Step 32358: {'lr': 0.00044986698490336263, 'samples': 6212736, 'steps': 32357, 'loss/train': 0.8823086023330688} 11/07/2021 01:48:39 - INFO - __main__ - Step 32359: {'lr': 0.0004498637970505215, 'samples': 6212928, 'steps': 32358, 'loss/train': 1.9765815734863281} 11/07/2021 01:48:40 - INFO - __main__ - Step 32360: {'lr': 0.0004498606091076248, 'samples': 6213120, 'steps': 32359, 'loss/train': 1.6369905471801758} 11/07/2021 01:48:40 - INFO - __main__ - Step 32361: {'lr': 0.000449857421074674, 'samples': 6213312, 'steps': 32360, 'loss/train': 1.4293484687805176} 11/07/2021 01:48:40 - INFO - __main__ - Step 32362: {'lr': 0.0004498542329516705, 'samples': 6213504, 'steps': 32361, 'loss/train': 1.5993635654449463} 11/07/2021 01:48:41 - INFO - __main__ - Step 32363: {'lr': 0.00044985104473861583, 'samples': 6213696, 'steps': 32362, 'loss/train': 1.6762796640396118} 11/07/2021 01:48:41 - INFO - __main__ - Step 32364: {'lr': 0.0004498478564355113, 'samples': 6213888, 'steps': 32363, 'loss/train': 1.624647617340088} 11/07/2021 01:48:42 - INFO - __main__ - Step 32365: {'lr': 0.0004498446680423584, 'samples': 6214080, 'steps': 32364, 'loss/train': 1.441513180732727} 11/07/2021 01:48:42 - INFO - __main__ - Step 32366: {'lr': 0.0004498414795591586, 'samples': 6214272, 'steps': 32365, 'loss/train': 1.322672963142395} 11/07/2021 01:48:43 - INFO - __main__ - Step 32367: {'lr': 0.00044983829098591336, 'samples': 6214464, 'steps': 32366, 'loss/train': 0.8546894788742065} 11/07/2021 01:48:43 - INFO - __main__ - Step 32368: {'lr': 0.00044983510232262405, 'samples': 6214656, 'steps': 32367, 'loss/train': 1.2581473588943481} 11/07/2021 01:48:43 - INFO - __main__ - Step 32369: {'lr': 0.0004498319135692921, 'samples': 6214848, 'steps': 32368, 'loss/train': 1.3132758140563965} 11/07/2021 01:48:44 - INFO - __main__ - Step 32370: {'lr': 0.00044982872472591897, 'samples': 6215040, 'steps': 32369, 'loss/train': 1.441627025604248} 11/07/2021 01:48:45 - INFO - __main__ - Step 32371: {'lr': 0.00044982553579250606, 'samples': 6215232, 'steps': 32370, 'loss/train': 1.1300324201583862} 11/07/2021 01:48:45 - INFO - __main__ - Step 32372: {'lr': 0.0004498223467690549, 'samples': 6215424, 'steps': 32371, 'loss/train': 1.257175087928772} 11/07/2021 01:48:45 - INFO - __main__ - Step 32373: {'lr': 0.0004498191576555669, 'samples': 6215616, 'steps': 32372, 'loss/train': 0.9374439120292664} 11/07/2021 01:48:46 - INFO - __main__ - Step 32374: {'lr': 0.00044981596845204344, 'samples': 6215808, 'steps': 32373, 'loss/train': 1.484535813331604} 11/07/2021 01:48:47 - INFO - __main__ - Step 32375: {'lr': 0.00044981277915848595, 'samples': 6216000, 'steps': 32374, 'loss/train': 1.8038363456726074} 11/07/2021 01:48:47 - INFO - __main__ - Step 32376: {'lr': 0.00044980958977489593, 'samples': 6216192, 'steps': 32375, 'loss/train': 1.2944344282150269} 11/07/2021 01:48:48 - INFO - __main__ - Step 32377: {'lr': 0.00044980640030127484, 'samples': 6216384, 'steps': 32376, 'loss/train': 1.5786190032958984} 11/07/2021 01:48:48 - INFO - __main__ - Step 32378: {'lr': 0.00044980321073762405, 'samples': 6216576, 'steps': 32377, 'loss/train': 1.259092926979065} 11/07/2021 01:48:48 - INFO - __main__ - Step 32379: {'lr': 0.00044980002108394496, 'samples': 6216768, 'steps': 32378, 'loss/train': 1.260203242301941} 11/07/2021 01:48:49 - INFO - __main__ - Step 32380: {'lr': 0.0004497968313402391, 'samples': 6216960, 'steps': 32379, 'loss/train': 1.7739508152008057} 11/07/2021 01:48:50 - INFO - __main__ - Step 32381: {'lr': 0.00044979364150650794, 'samples': 6217152, 'steps': 32380, 'loss/train': 1.782575011253357} 11/07/2021 01:48:50 - INFO - __main__ - Step 32382: {'lr': 0.00044979045158275273, 'samples': 6217344, 'steps': 32381, 'loss/train': 1.9300806522369385} 11/07/2021 01:48:51 - INFO - __main__ - Step 32383: {'lr': 0.0004497872615689751, 'samples': 6217536, 'steps': 32382, 'loss/train': 0.8333058953285217} 11/07/2021 01:48:51 - INFO - __main__ - Step 32384: {'lr': 0.00044978407146517634, 'samples': 6217728, 'steps': 32383, 'loss/train': 0.957953155040741} 11/07/2021 01:48:52 - INFO - __main__ - Step 32385: {'lr': 0.0004497808812713581, 'samples': 6217920, 'steps': 32384, 'loss/train': 1.0447723865509033} 11/07/2021 01:48:52 - INFO - __main__ - Step 32386: {'lr': 0.00044977769098752154, 'samples': 6218112, 'steps': 32385, 'loss/train': 1.5469779968261719} 11/07/2021 01:48:53 - INFO - __main__ - Step 32387: {'lr': 0.0004497745006136683, 'samples': 6218304, 'steps': 32386, 'loss/train': 1.6424702405929565} 11/07/2021 01:48:53 - INFO - __main__ - Step 32388: {'lr': 0.00044977131014979974, 'samples': 6218496, 'steps': 32387, 'loss/train': 1.2199071645736694} 11/07/2021 01:48:53 - INFO - __main__ - Step 32389: {'lr': 0.0004497681195959173, 'samples': 6218688, 'steps': 32388, 'loss/train': 1.6586096286773682} 11/07/2021 01:48:54 - INFO - __main__ - Step 32390: {'lr': 0.0004497649289520224, 'samples': 6218880, 'steps': 32389, 'loss/train': 0.7270222902297974} 11/07/2021 01:48:55 - INFO - __main__ - Step 32391: {'lr': 0.00044976173821811654, 'samples': 6219072, 'steps': 32390, 'loss/train': 1.5894689559936523} 11/07/2021 01:48:55 - INFO - __main__ - Step 32392: {'lr': 0.0004497585473942011, 'samples': 6219264, 'steps': 32391, 'loss/train': 1.3609628677368164} 11/07/2021 01:48:55 - INFO - __main__ - Step 32393: {'lr': 0.0004497553564802776, 'samples': 6219456, 'steps': 32392, 'loss/train': 1.3615731000900269} 11/07/2021 01:48:56 - INFO - __main__ - Step 32394: {'lr': 0.0004497521654763474, 'samples': 6219648, 'steps': 32393, 'loss/train': 1.7795207500457764} 11/07/2021 01:48:57 - INFO - __main__ - Step 32395: {'lr': 0.0004497489743824119, 'samples': 6219840, 'steps': 32394, 'loss/train': 1.0525424480438232} 11/07/2021 01:48:57 - INFO - __main__ - Step 32396: {'lr': 0.0004497457831984727, 'samples': 6220032, 'steps': 32395, 'loss/train': 1.5898518562316895} 11/07/2021 01:48:57 - INFO - __main__ - Step 32397: {'lr': 0.00044974259192453103, 'samples': 6220224, 'steps': 32396, 'loss/train': 1.645259141921997} 11/07/2021 01:48:58 - INFO - __main__ - Step 32398: {'lr': 0.0004497394005605885, 'samples': 6220416, 'steps': 32397, 'loss/train': 1.6547985076904297} 11/07/2021 01:48:58 - INFO - __main__ - Step 32399: {'lr': 0.00044973620910664645, 'samples': 6220608, 'steps': 32398, 'loss/train': 1.6538923978805542} 11/07/2021 01:48:59 - INFO - __main__ - Step 32400: {'lr': 0.00044973301756270635, 'samples': 6220800, 'steps': 32399, 'loss/train': 1.4757723808288574} 11/07/2021 01:49:00 - INFO - __main__ - Step 32401: {'lr': 0.0004497298259287696, 'samples': 6220992, 'steps': 32400, 'loss/train': 1.810659646987915} 11/07/2021 01:49:00 - INFO - __main__ - Step 32402: {'lr': 0.00044972663420483774, 'samples': 6221184, 'steps': 32401, 'loss/train': 1.176303505897522} 11/07/2021 01:49:00 - INFO - __main__ - Step 32403: {'lr': 0.00044972344239091206, 'samples': 6221376, 'steps': 32402, 'loss/train': 1.7836946249008179} 11/07/2021 01:49:01 - INFO - __main__ - Step 32404: {'lr': 0.0004497202504869941, 'samples': 6221568, 'steps': 32403, 'loss/train': 1.795836091041565} 11/07/2021 01:49:01 - INFO - __main__ - Step 32405: {'lr': 0.0004497170584930853, 'samples': 6221760, 'steps': 32404, 'loss/train': 2.021099090576172} 11/07/2021 01:49:02 - INFO - __main__ - Step 32406: {'lr': 0.0004497138664091871, 'samples': 6221952, 'steps': 32405, 'loss/train': 1.593692421913147} 11/07/2021 01:49:02 - INFO - __main__ - Step 32407: {'lr': 0.00044971067423530087, 'samples': 6222144, 'steps': 32406, 'loss/train': 1.3890024423599243} 11/07/2021 01:49:03 - INFO - __main__ - Step 32408: {'lr': 0.0004497074819714281, 'samples': 6222336, 'steps': 32407, 'loss/train': 1.8967686891555786} 11/07/2021 01:49:03 - INFO - __main__ - Step 32409: {'lr': 0.00044970428961757026, 'samples': 6222528, 'steps': 32408, 'loss/train': 1.5618425607681274} 11/07/2021 01:49:03 - INFO - __main__ - Step 32410: {'lr': 0.00044970109717372864, 'samples': 6222720, 'steps': 32409, 'loss/train': 1.4434159994125366} 11/07/2021 01:49:04 - INFO - __main__ - Step 32411: {'lr': 0.0004496979046399049, 'samples': 6222912, 'steps': 32410, 'loss/train': 0.6913343071937561} 11/07/2021 01:49:05 - INFO - __main__ - Step 32412: {'lr': 0.00044969471201610037, 'samples': 6223104, 'steps': 32411, 'loss/train': 1.7394918203353882} 11/07/2021 01:49:05 - INFO - __main__ - Step 32413: {'lr': 0.00044969151930231643, 'samples': 6223296, 'steps': 32412, 'loss/train': 1.5709692239761353} 11/07/2021 01:49:06 - INFO - __main__ - Step 32414: {'lr': 0.00044968832649855455, 'samples': 6223488, 'steps': 32413, 'loss/train': 1.6851338148117065} 11/07/2021 01:49:06 - INFO - __main__ - Step 32415: {'lr': 0.00044968513360481624, 'samples': 6223680, 'steps': 32414, 'loss/train': 1.2950356006622314} 11/07/2021 01:49:07 - INFO - __main__ - Step 32416: {'lr': 0.0004496819406211029, 'samples': 6223872, 'steps': 32415, 'loss/train': 1.3438385725021362} 11/07/2021 01:49:07 - INFO - __main__ - Step 32417: {'lr': 0.0004496787475474159, 'samples': 6224064, 'steps': 32416, 'loss/train': 1.436164140701294} 11/07/2021 01:49:08 - INFO - __main__ - Step 32418: {'lr': 0.00044967555438375675, 'samples': 6224256, 'steps': 32417, 'loss/train': 1.7283543348312378} 11/07/2021 01:49:08 - INFO - __main__ - Step 32419: {'lr': 0.0004496723611301269, 'samples': 6224448, 'steps': 32418, 'loss/train': 1.2320383787155151} 11/07/2021 01:49:08 - INFO - __main__ - Step 32420: {'lr': 0.00044966916778652776, 'samples': 6224640, 'steps': 32419, 'loss/train': 1.3301286697387695} 11/07/2021 01:49:09 - INFO - __main__ - Step 32421: {'lr': 0.0004496659743529608, 'samples': 6224832, 'steps': 32420, 'loss/train': 1.5266687870025635} 11/07/2021 01:49:10 - INFO - __main__ - Step 32422: {'lr': 0.00044966278082942746, 'samples': 6225024, 'steps': 32421, 'loss/train': 0.8708489537239075} 11/07/2021 01:49:10 - INFO - __main__ - Step 32423: {'lr': 0.000449659587215929, 'samples': 6225216, 'steps': 32422, 'loss/train': 1.3048748970031738} 11/07/2021 01:49:11 - INFO - __main__ - Step 32424: {'lr': 0.0004496563935124672, 'samples': 6225408, 'steps': 32423, 'loss/train': 0.9543754458427429} 11/07/2021 01:49:11 - INFO - __main__ - Step 32425: {'lr': 0.0004496531997190432, 'samples': 6225600, 'steps': 32424, 'loss/train': 1.8037725687026978} 11/07/2021 01:49:11 - INFO - __main__ - Step 32426: {'lr': 0.0004496500058356586, 'samples': 6225792, 'steps': 32425, 'loss/train': 0.3545287847518921} 11/07/2021 01:49:13 - INFO - __main__ - Step 32427: {'lr': 0.00044964681186231473, 'samples': 6225984, 'steps': 32426, 'loss/train': 0.7700253129005432} 11/07/2021 01:49:13 - INFO - __main__ - Step 32428: {'lr': 0.0004496436177990131, 'samples': 6226176, 'steps': 32427, 'loss/train': 1.4964241981506348} 11/07/2021 01:49:13 - INFO - __main__ - Step 32429: {'lr': 0.0004496404236457552, 'samples': 6226368, 'steps': 32428, 'loss/train': 1.5284323692321777} 11/07/2021 01:49:14 - INFO - __main__ - Step 32430: {'lr': 0.0004496372294025424, 'samples': 6226560, 'steps': 32429, 'loss/train': 0.7127718329429626} 11/07/2021 01:49:14 - INFO - __main__ - Step 32431: {'lr': 0.00044963403506937603, 'samples': 6226752, 'steps': 32430, 'loss/train': 2.3373682498931885} 11/07/2021 01:49:15 - INFO - __main__ - Step 32432: {'lr': 0.00044963084064625775, 'samples': 6226944, 'steps': 32431, 'loss/train': 1.677183747291565} 11/07/2021 01:49:15 - INFO - __main__ - Step 32433: {'lr': 0.00044962764613318886, 'samples': 6227136, 'steps': 32432, 'loss/train': 2.08242130279541} 11/07/2021 01:49:16 - INFO - __main__ - Step 32434: {'lr': 0.00044962445153017087, 'samples': 6227328, 'steps': 32433, 'loss/train': 1.7830533981323242} 11/07/2021 01:49:16 - INFO - __main__ - Step 32435: {'lr': 0.00044962125683720513, 'samples': 6227520, 'steps': 32434, 'loss/train': 1.3276549577713013} 11/07/2021 01:49:16 - INFO - __main__ - Step 32436: {'lr': 0.0004496180620542931, 'samples': 6227712, 'steps': 32435, 'loss/train': 1.2223323583602905} 11/07/2021 01:49:17 - INFO - __main__ - Step 32437: {'lr': 0.00044961486718143634, 'samples': 6227904, 'steps': 32436, 'loss/train': 1.3821481466293335} 11/07/2021 01:49:18 - INFO - __main__ - Step 32438: {'lr': 0.0004496116722186362, 'samples': 6228096, 'steps': 32437, 'loss/train': 1.492997169494629} 11/07/2021 01:49:18 - INFO - __main__ - Step 32439: {'lr': 0.00044960847716589403, 'samples': 6228288, 'steps': 32438, 'loss/train': 1.5349807739257812} 11/07/2021 01:49:18 - INFO - __main__ - Step 32440: {'lr': 0.00044960528202321143, 'samples': 6228480, 'steps': 32439, 'loss/train': 1.3973376750946045} 11/07/2021 01:49:19 - INFO - __main__ - Step 32441: {'lr': 0.0004496020867905898, 'samples': 6228672, 'steps': 32440, 'loss/train': 1.4841481447219849} 11/07/2021 01:49:19 - INFO - __main__ - Step 32442: {'lr': 0.00044959889146803047, 'samples': 6228864, 'steps': 32441, 'loss/train': 1.9865716695785522} 11/07/2021 01:49:20 - INFO - __main__ - Step 32443: {'lr': 0.00044959569605553494, 'samples': 6229056, 'steps': 32442, 'loss/train': 1.5784991979599} 11/07/2021 01:49:21 - INFO - __main__ - Step 32444: {'lr': 0.00044959250055310473, 'samples': 6229248, 'steps': 32443, 'loss/train': 0.6850621700286865} 11/07/2021 01:49:21 - INFO - __main__ - Step 32445: {'lr': 0.00044958930496074125, 'samples': 6229440, 'steps': 32444, 'loss/train': 0.20532159507274628} 11/07/2021 01:49:21 - INFO - __main__ - Step 32446: {'lr': 0.0004495861092784459, 'samples': 6229632, 'steps': 32445, 'loss/train': 1.6461862325668335} 11/07/2021 01:49:22 - INFO - __main__ - Step 32447: {'lr': 0.00044958291350622007, 'samples': 6229824, 'steps': 32446, 'loss/train': 1.5703867673873901} 11/07/2021 01:49:23 - INFO - __main__ - Step 32448: {'lr': 0.0004495797176440653, 'samples': 6230016, 'steps': 32447, 'loss/train': 1.3373690843582153} 11/07/2021 01:49:23 - INFO - __main__ - Step 32449: {'lr': 0.000449576521691983, 'samples': 6230208, 'steps': 32448, 'loss/train': 1.449952244758606} 11/07/2021 01:49:23 - INFO - __main__ - Step 32450: {'lr': 0.00044957332564997453, 'samples': 6230400, 'steps': 32449, 'loss/train': 1.6825079917907715} 11/07/2021 01:49:24 - INFO - __main__ - Step 32451: {'lr': 0.0004495701295180414, 'samples': 6230592, 'steps': 32450, 'loss/train': 1.9212239980697632} 11/07/2021 01:49:24 - INFO - __main__ - Step 32452: {'lr': 0.0004495669332961852, 'samples': 6230784, 'steps': 32451, 'loss/train': 1.3632677793502808} 11/07/2021 01:49:25 - INFO - __main__ - Step 32453: {'lr': 0.0004495637369844071, 'samples': 6230976, 'steps': 32452, 'loss/train': 1.0895562171936035} 11/07/2021 01:49:25 - INFO - __main__ - Step 32454: {'lr': 0.0004495605405827087, 'samples': 6231168, 'steps': 32453, 'loss/train': 1.1377387046813965} 11/07/2021 01:49:26 - INFO - __main__ - Step 32455: {'lr': 0.00044955734409109135, 'samples': 6231360, 'steps': 32454, 'loss/train': 1.732886791229248} 11/07/2021 01:49:26 - INFO - __main__ - Step 32456: {'lr': 0.0004495541475095566, 'samples': 6231552, 'steps': 32455, 'loss/train': 1.8484389781951904} 11/07/2021 01:49:26 - INFO - __main__ - Step 32457: {'lr': 0.0004495509508381058, 'samples': 6231744, 'steps': 32456, 'loss/train': 1.1795073747634888} 11/07/2021 01:49:27 - INFO - __main__ - Step 32458: {'lr': 0.00044954775407674035, 'samples': 6231936, 'steps': 32457, 'loss/train': 1.8034723997116089} 11/07/2021 01:49:28 - INFO - __main__ - Step 32459: {'lr': 0.00044954455722546186, 'samples': 6232128, 'steps': 32458, 'loss/train': 1.6304291486740112} 11/07/2021 01:49:28 - INFO - __main__ - Step 32460: {'lr': 0.0004495413602842716, 'samples': 6232320, 'steps': 32459, 'loss/train': 1.134009838104248} 11/07/2021 01:49:28 - INFO - __main__ - Step 32461: {'lr': 0.00044953816325317116, 'samples': 6232512, 'steps': 32460, 'loss/train': 1.4539257287979126} 11/07/2021 01:49:29 - INFO - __main__ - Step 32462: {'lr': 0.0004495349661321618, 'samples': 6232704, 'steps': 32461, 'loss/train': 1.4682179689407349} 11/07/2021 01:49:30 - INFO - __main__ - Step 32463: {'lr': 0.0004495317689212452, 'samples': 6232896, 'steps': 32462, 'loss/train': 1.2072484493255615} 11/07/2021 01:49:30 - INFO - __main__ - Step 32464: {'lr': 0.0004495285716204226, 'samples': 6233088, 'steps': 32463, 'loss/train': 1.7558974027633667} 11/07/2021 01:49:30 - INFO - __main__ - Step 32465: {'lr': 0.00044952537422969545, 'samples': 6233280, 'steps': 32464, 'loss/train': 1.4803388118743896} 11/07/2021 01:49:31 - INFO - __main__ - Step 32466: {'lr': 0.0004495221767490653, 'samples': 6233472, 'steps': 32465, 'loss/train': 1.3486028909683228} 11/07/2021 01:49:31 - INFO - __main__ - Step 32467: {'lr': 0.00044951897917853355, 'samples': 6233664, 'steps': 32466, 'loss/train': 1.2698243856430054} 11/07/2021 01:49:31 - INFO - __main__ - Step 32468: {'lr': 0.0004495157815181016, 'samples': 6233856, 'steps': 32467, 'loss/train': 1.788853645324707} 11/07/2021 01:49:33 - INFO - __main__ - Step 32469: {'lr': 0.00044951258376777094, 'samples': 6234048, 'steps': 32468, 'loss/train': 0.866815447807312} 11/07/2021 01:49:33 - INFO - __main__ - Step 32470: {'lr': 0.00044950938592754297, 'samples': 6234240, 'steps': 32469, 'loss/train': 1.0849798917770386} 11/07/2021 01:49:33 - INFO - __main__ - Step 32471: {'lr': 0.00044950618799741913, 'samples': 6234432, 'steps': 32470, 'loss/train': 1.2643122673034668} 11/07/2021 01:49:34 - INFO - __main__ - Step 32472: {'lr': 0.0004495029899774009, 'samples': 6234624, 'steps': 32471, 'loss/train': 1.5671745538711548} 11/07/2021 01:49:34 - INFO - __main__ - Step 32473: {'lr': 0.00044949979186748967, 'samples': 6234816, 'steps': 32472, 'loss/train': 1.7395416498184204} 11/07/2021 01:49:35 - INFO - __main__ - Step 32474: {'lr': 0.00044949659366768697, 'samples': 6235008, 'steps': 32473, 'loss/train': 1.6390635967254639} 11/07/2021 01:49:36 - INFO - __main__ - Step 32475: {'lr': 0.00044949339537799415, 'samples': 6235200, 'steps': 32474, 'loss/train': 1.600160002708435} 11/07/2021 01:49:36 - INFO - __main__ - Step 32476: {'lr': 0.0004494901969984127, 'samples': 6235392, 'steps': 32475, 'loss/train': 1.9038304090499878} 11/07/2021 01:49:36 - INFO - __main__ - Step 32477: {'lr': 0.000449486998528944, 'samples': 6235584, 'steps': 32476, 'loss/train': 1.197152853012085} 11/07/2021 01:49:37 - INFO - __main__ - Step 32478: {'lr': 0.00044948379996958963, 'samples': 6235776, 'steps': 32477, 'loss/train': 1.706484317779541} 11/07/2021 01:49:38 - INFO - __main__ - Step 32479: {'lr': 0.00044948060132035087, 'samples': 6235968, 'steps': 32478, 'loss/train': 1.5682185888290405} 11/07/2021 01:49:38 - INFO - __main__ - Step 32480: {'lr': 0.00044947740258122925, 'samples': 6236160, 'steps': 32479, 'loss/train': 1.4988538026809692} 11/07/2021 01:49:38 - INFO - __main__ - Step 32481: {'lr': 0.00044947420375222614, 'samples': 6236352, 'steps': 32480, 'loss/train': 1.2382683753967285} 11/07/2021 01:49:39 - INFO - __main__ - Step 32482: {'lr': 0.00044947100483334315, 'samples': 6236544, 'steps': 32481, 'loss/train': 1.956660509109497} 11/07/2021 01:49:39 - INFO - __main__ - Step 32483: {'lr': 0.0004494678058245815, 'samples': 6236736, 'steps': 32482, 'loss/train': 1.8256279230117798} 11/07/2021 01:49:40 - INFO - __main__ - Step 32484: {'lr': 0.00044946460672594277, 'samples': 6236928, 'steps': 32483, 'loss/train': 1.758445143699646} 11/07/2021 01:49:41 - INFO - __main__ - Step 32485: {'lr': 0.0004494614075374283, 'samples': 6237120, 'steps': 32484, 'loss/train': 1.2483205795288086} 11/07/2021 01:49:41 - INFO - __main__ - Step 32486: {'lr': 0.0004494582082590397, 'samples': 6237312, 'steps': 32485, 'loss/train': 1.7636712789535522} 11/07/2021 01:49:41 - INFO - __main__ - Step 32487: {'lr': 0.0004494550088907783, 'samples': 6237504, 'steps': 32486, 'loss/train': 1.5964031219482422} 11/07/2021 01:49:42 - INFO - __main__ - Step 32488: {'lr': 0.00044945180943264544, 'samples': 6237696, 'steps': 32487, 'loss/train': 1.6114511489868164} 11/07/2021 01:49:43 - INFO - __main__ - Step 32489: {'lr': 0.00044944860988464276, 'samples': 6237888, 'steps': 32488, 'loss/train': 1.499119520187378} 11/07/2021 01:49:43 - INFO - __main__ - Step 32490: {'lr': 0.0004494454102467716, 'samples': 6238080, 'steps': 32489, 'loss/train': 1.1793768405914307} 11/07/2021 01:49:43 - INFO - __main__ - Step 32491: {'lr': 0.00044944221051903345, 'samples': 6238272, 'steps': 32490, 'loss/train': 1.6606731414794922} 11/07/2021 01:49:44 - INFO - __main__ - Step 32492: {'lr': 0.0004494390107014297, 'samples': 6238464, 'steps': 32491, 'loss/train': 1.3472111225128174} 11/07/2021 01:49:44 - INFO - __main__ - Step 32493: {'lr': 0.0004494358107939618, 'samples': 6238656, 'steps': 32492, 'loss/train': 1.0546207427978516} 11/07/2021 01:49:44 - INFO - __main__ - Step 32494: {'lr': 0.0004494326107966311, 'samples': 6238848, 'steps': 32493, 'loss/train': 1.3394577503204346} 11/07/2021 01:49:46 - INFO - __main__ - Step 32495: {'lr': 0.0004494294107094393, 'samples': 6239040, 'steps': 32494, 'loss/train': 1.860493779182434} 11/07/2021 01:49:46 - INFO - __main__ - Step 32496: {'lr': 0.00044942621053238764, 'samples': 6239232, 'steps': 32495, 'loss/train': 1.495239019393921} 11/07/2021 01:49:46 - INFO - __main__ - Step 32497: {'lr': 0.00044942301026547755, 'samples': 6239424, 'steps': 32496, 'loss/train': 1.4385201930999756} 11/07/2021 01:49:47 - INFO - __main__ - Step 32498: {'lr': 0.0004494198099087106, 'samples': 6239616, 'steps': 32497, 'loss/train': 1.2754056453704834} 11/07/2021 01:49:47 - INFO - __main__ - Step 32499: {'lr': 0.00044941660946208806, 'samples': 6239808, 'steps': 32498, 'loss/train': 0.22919543087482452} 11/07/2021 01:49:48 - INFO - __main__ - Step 32500: {'lr': 0.00044941340892561154, 'samples': 6240000, 'steps': 32499, 'loss/train': 1.4570839405059814} 11/07/2021 01:49:48 - INFO - __main__ - Step 32501: {'lr': 0.00044941020829928247, 'samples': 6240192, 'steps': 32500, 'loss/train': 1.5001204013824463} 11/07/2021 01:49:49 - INFO - __main__ - Step 32502: {'lr': 0.00044940700758310214, 'samples': 6240384, 'steps': 32501, 'loss/train': 1.2210010290145874} 11/07/2021 01:49:49 - INFO - __main__ - Step 32503: {'lr': 0.00044940380677707214, 'samples': 6240576, 'steps': 32502, 'loss/train': 0.6593044400215149} 11/07/2021 01:49:49 - INFO - __main__ - Step 32504: {'lr': 0.00044940060588119393, 'samples': 6240768, 'steps': 32503, 'loss/train': 1.254817008972168} 11/07/2021 01:49:50 - INFO - __main__ - Step 32505: {'lr': 0.00044939740489546875, 'samples': 6240960, 'steps': 32504, 'loss/train': 1.3713253736495972} 11/07/2021 01:49:51 - INFO - __main__ - Step 32506: {'lr': 0.0004493942038198983, 'samples': 6241152, 'steps': 32505, 'loss/train': 1.299810528755188} 11/07/2021 01:49:51 - INFO - __main__ - Step 32507: {'lr': 0.0004493910026544838, 'samples': 6241344, 'steps': 32506, 'loss/train': 1.505722999572754} 11/07/2021 01:49:51 - INFO - __main__ - Step 32508: {'lr': 0.0004493878013992268, 'samples': 6241536, 'steps': 32507, 'loss/train': 1.458335518836975} 11/07/2021 01:49:52 - INFO - __main__ - Step 32509: {'lr': 0.0004493846000541287, 'samples': 6241728, 'steps': 32508, 'loss/train': 1.7668204307556152} 11/07/2021 01:49:53 - INFO - __main__ - Step 32510: {'lr': 0.00044938139861919115, 'samples': 6241920, 'steps': 32509, 'loss/train': 1.238909363746643} 11/07/2021 01:49:53 - INFO - __main__ - Step 32511: {'lr': 0.00044937819709441523, 'samples': 6242112, 'steps': 32510, 'loss/train': 1.5238277912139893} 11/07/2021 01:49:53 - INFO - __main__ - Step 32512: {'lr': 0.00044937499547980265, 'samples': 6242304, 'steps': 32511, 'loss/train': 1.2921862602233887} 11/07/2021 01:49:54 - INFO - __main__ - Step 32513: {'lr': 0.00044937179377535475, 'samples': 6242496, 'steps': 32512, 'loss/train': 1.3551661968231201} 11/07/2021 01:49:54 - INFO - __main__ - Step 32514: {'lr': 0.00044936859198107306, 'samples': 6242688, 'steps': 32513, 'loss/train': 1.663925051689148} 11/07/2021 01:49:55 - INFO - __main__ - Step 32515: {'lr': 0.0004493653900969589, 'samples': 6242880, 'steps': 32514, 'loss/train': 1.4537755250930786} 11/07/2021 01:49:55 - INFO - __main__ - Step 32516: {'lr': 0.0004493621881230138, 'samples': 6243072, 'steps': 32515, 'loss/train': 1.0435876846313477} 11/07/2021 01:49:56 - INFO - __main__ - Step 32517: {'lr': 0.00044935898605923916, 'samples': 6243264, 'steps': 32516, 'loss/train': 1.5774314403533936} 11/07/2021 01:49:56 - INFO - __main__ - Step 32518: {'lr': 0.0004493557839056364, 'samples': 6243456, 'steps': 32517, 'loss/train': 1.5218448638916016} 11/07/2021 01:49:56 - INFO - __main__ - Step 32519: {'lr': 0.00044935258166220704, 'samples': 6243648, 'steps': 32518, 'loss/train': 1.6474796533584595} 11/07/2021 01:49:58 - INFO - __main__ - Step 32520: {'lr': 0.00044934937932895246, 'samples': 6243840, 'steps': 32519, 'loss/train': 1.3461508750915527} 11/07/2021 01:49:58 - INFO - __main__ - Step 32521: {'lr': 0.0004493461769058742, 'samples': 6244032, 'steps': 32520, 'loss/train': 1.253962516784668} 11/07/2021 01:49:59 - INFO - __main__ - Step 32522: {'lr': 0.00044934297439297357, 'samples': 6244224, 'steps': 32521, 'loss/train': 0.16634128987789154} 11/07/2021 01:49:59 - INFO - __main__ - Step 32523: {'lr': 0.0004493397717902521, 'samples': 6244416, 'steps': 32522, 'loss/train': 1.3674840927124023} 11/07/2021 01:49:59 - INFO - __main__ - Step 32524: {'lr': 0.00044933656909771117, 'samples': 6244608, 'steps': 32523, 'loss/train': 1.438563585281372} 11/07/2021 01:50:00 - INFO - __main__ - Step 32525: {'lr': 0.00044933336631535224, 'samples': 6244800, 'steps': 32524, 'loss/train': 1.3042861223220825} 11/07/2021 01:50:01 - INFO - __main__ - Step 32526: {'lr': 0.0004493301634431768, 'samples': 6244992, 'steps': 32525, 'loss/train': 1.6236354112625122} 11/07/2021 01:50:01 - INFO - __main__ - Step 32527: {'lr': 0.0004493269604811863, 'samples': 6245184, 'steps': 32526, 'loss/train': 1.4350732564926147} 11/07/2021 01:50:01 - INFO - __main__ - Step 32528: {'lr': 0.000449323757429382, 'samples': 6245376, 'steps': 32527, 'loss/train': 1.4659003019332886} 11/07/2021 01:50:02 - INFO - __main__ - Step 32529: {'lr': 0.00044932055428776566, 'samples': 6245568, 'steps': 32528, 'loss/train': 1.1015076637268066} 11/07/2021 01:50:02 - INFO - __main__ - Step 32530: {'lr': 0.00044931735105633853, 'samples': 6245760, 'steps': 32529, 'loss/train': 1.3505935668945312} 11/07/2021 01:50:03 - INFO - __main__ - Step 32531: {'lr': 0.00044931414773510207, 'samples': 6245952, 'steps': 32530, 'loss/train': 1.467262864112854} 11/07/2021 01:50:04 - INFO - __main__ - Step 32532: {'lr': 0.00044931094432405766, 'samples': 6246144, 'steps': 32531, 'loss/train': 1.387347936630249} 11/07/2021 01:50:04 - INFO - __main__ - Step 32533: {'lr': 0.00044930774082320684, 'samples': 6246336, 'steps': 32532, 'loss/train': 1.7457025051116943} 11/07/2021 01:50:04 - INFO - __main__ - Step 32534: {'lr': 0.00044930453723255107, 'samples': 6246528, 'steps': 32533, 'loss/train': 1.3946459293365479} 11/07/2021 01:50:05 - INFO - __main__ - Step 32535: {'lr': 0.0004493013335520917, 'samples': 6246720, 'steps': 32534, 'loss/train': 1.4471338987350464} 11/07/2021 01:50:06 - INFO - __main__ - Step 32536: {'lr': 0.00044929812978183024, 'samples': 6246912, 'steps': 32535, 'loss/train': 1.768898606300354} 11/07/2021 01:50:06 - INFO - __main__ - Step 32537: {'lr': 0.0004492949259217681, 'samples': 6247104, 'steps': 32536, 'loss/train': 1.1240431070327759} 11/07/2021 01:50:06 - INFO - __main__ - Step 32538: {'lr': 0.00044929172197190684, 'samples': 6247296, 'steps': 32537, 'loss/train': 1.7329168319702148} 11/07/2021 01:50:07 - INFO - __main__ - Step 32539: {'lr': 0.00044928851793224765, 'samples': 6247488, 'steps': 32538, 'loss/train': 1.605699896812439} 11/07/2021 01:50:07 - INFO - __main__ - Step 32540: {'lr': 0.00044928531380279224, 'samples': 6247680, 'steps': 32539, 'loss/train': 1.6163506507873535} 11/07/2021 01:50:09 - INFO - __main__ - Step 32541: {'lr': 0.00044928210958354196, 'samples': 6247872, 'steps': 32540, 'loss/train': 1.6673510074615479} 11/07/2021 01:50:09 - INFO - __main__ - Step 32542: {'lr': 0.0004492789052744982, 'samples': 6248064, 'steps': 32541, 'loss/train': 1.6779664754867554} 11/07/2021 01:50:10 - INFO - __main__ - Step 32543: {'lr': 0.0004492757008756624, 'samples': 6248256, 'steps': 32542, 'loss/train': 1.5406057834625244} 11/07/2021 01:50:10 - INFO - __main__ - Step 32544: {'lr': 0.0004492724963870361, 'samples': 6248448, 'steps': 32543, 'loss/train': 1.3281011581420898} 11/07/2021 01:50:10 - INFO - __main__ - Step 32545: {'lr': 0.00044926929180862064, 'samples': 6248640, 'steps': 32544, 'loss/train': 1.0876872539520264} 11/07/2021 01:50:11 - INFO - __main__ - Step 32546: {'lr': 0.00044926608714041763, 'samples': 6248832, 'steps': 32545, 'loss/train': 1.8412197828292847} 11/07/2021 01:50:11 - INFO - __main__ - Step 32547: {'lr': 0.0004492628823824282, 'samples': 6249024, 'steps': 32546, 'loss/train': 1.2169973850250244} 11/07/2021 01:50:12 - INFO - __main__ - Step 32548: {'lr': 0.0004492596775346541, 'samples': 6249216, 'steps': 32547, 'loss/train': 1.6807281970977783} 11/07/2021 01:50:13 - INFO - __main__ - Step 32549: {'lr': 0.0004492564725970967, 'samples': 6249408, 'steps': 32548, 'loss/train': 1.6871615648269653} 11/07/2021 01:50:13 - INFO - __main__ - Step 32550: {'lr': 0.00044925326756975736, 'samples': 6249600, 'steps': 32549, 'loss/train': 1.5211896896362305} 11/07/2021 01:50:13 - INFO - __main__ - Step 32551: {'lr': 0.00044925006245263757, 'samples': 6249792, 'steps': 32550, 'loss/train': 1.6064367294311523} 11/07/2021 01:50:14 - INFO - __main__ - Step 32552: {'lr': 0.0004492468572457388, 'samples': 6249984, 'steps': 32551, 'loss/train': 1.7926182746887207} 11/07/2021 01:50:14 - INFO - __main__ - Step 32553: {'lr': 0.0004492436519490625, 'samples': 6250176, 'steps': 32552, 'loss/train': 1.2677383422851562} 11/07/2021 01:50:15 - INFO - __main__ - Step 32554: {'lr': 0.00044924044656260997, 'samples': 6250368, 'steps': 32553, 'loss/train': 1.7775638103485107} 11/07/2021 01:50:15 - INFO - __main__ - Step 32555: {'lr': 0.00044923724108638285, 'samples': 6250560, 'steps': 32554, 'loss/train': 1.6647224426269531} 11/07/2021 01:50:16 - INFO - __main__ - Step 32556: {'lr': 0.00044923403552038255, 'samples': 6250752, 'steps': 32555, 'loss/train': 1.6538268327713013} 11/07/2021 01:50:16 - INFO - __main__ - Step 32557: {'lr': 0.0004492308298646104, 'samples': 6250944, 'steps': 32556, 'loss/train': 0.9958004355430603} 11/07/2021 01:50:16 - INFO - __main__ - Step 32558: {'lr': 0.0004492276241190679, 'samples': 6251136, 'steps': 32557, 'loss/train': 1.4049321413040161} 11/07/2021 01:50:17 - INFO - __main__ - Step 32559: {'lr': 0.0004492244182837565, 'samples': 6251328, 'steps': 32558, 'loss/train': 1.5362626314163208} 11/07/2021 01:50:18 - INFO - __main__ - Step 32560: {'lr': 0.00044922121235867776, 'samples': 6251520, 'steps': 32559, 'loss/train': 1.790116548538208} 11/07/2021 01:50:18 - INFO - __main__ - Step 32561: {'lr': 0.00044921800634383294, 'samples': 6251712, 'steps': 32560, 'loss/train': 2.2426087856292725} 11/07/2021 01:50:18 - INFO - __main__ - Step 32562: {'lr': 0.0004492148002392235, 'samples': 6251904, 'steps': 32561, 'loss/train': 1.8030542135238647} 11/07/2021 01:50:19 - INFO - __main__ - Step 32563: {'lr': 0.000449211594044851, 'samples': 6252096, 'steps': 32562, 'loss/train': 1.7514045238494873} 11/07/2021 01:50:20 - INFO - __main__ - Step 32564: {'lr': 0.0004492083877607168, 'samples': 6252288, 'steps': 32563, 'loss/train': 0.4884999394416809} 11/07/2021 01:50:20 - INFO - __main__ - Step 32565: {'lr': 0.00044920518138682244, 'samples': 6252480, 'steps': 32564, 'loss/train': 1.502118468284607} 11/07/2021 01:50:21 - INFO - __main__ - Step 32566: {'lr': 0.00044920197492316925, 'samples': 6252672, 'steps': 32565, 'loss/train': 1.3578180074691772} 11/07/2021 01:50:21 - INFO - __main__ - Step 32567: {'lr': 0.00044919876836975876, 'samples': 6252864, 'steps': 32566, 'loss/train': 1.298283338546753} 11/07/2021 01:50:21 - INFO - __main__ - Step 32568: {'lr': 0.0004491955617265924, 'samples': 6253056, 'steps': 32567, 'loss/train': 1.2520887851715088} 11/07/2021 01:50:22 - INFO - __main__ - Step 32569: {'lr': 0.0004491923549936715, 'samples': 6253248, 'steps': 32568, 'loss/train': 2.1562204360961914} 11/07/2021 01:50:23 - INFO - __main__ - Step 32570: {'lr': 0.0004491891481709977, 'samples': 6253440, 'steps': 32569, 'loss/train': 0.5885365605354309} 11/07/2021 01:50:23 - INFO - __main__ - Step 32571: {'lr': 0.0004491859412585723, 'samples': 6253632, 'steps': 32570, 'loss/train': 1.0534669160842896} 11/07/2021 01:50:23 - INFO - __main__ - Step 32572: {'lr': 0.0004491827342563968, 'samples': 6253824, 'steps': 32571, 'loss/train': 1.350450038909912} 11/07/2021 01:50:24 - INFO - __main__ - Step 32573: {'lr': 0.0004491795271644726, 'samples': 6254016, 'steps': 32572, 'loss/train': 1.6586860418319702} 11/07/2021 01:50:25 - INFO - __main__ - Step 32574: {'lr': 0.0004491763199828012, 'samples': 6254208, 'steps': 32573, 'loss/train': 1.730752944946289} 11/07/2021 01:50:25 - INFO - __main__ - Step 32575: {'lr': 0.00044917311271138393, 'samples': 6254400, 'steps': 32574, 'loss/train': 2.4597697257995605} 11/07/2021 01:50:25 - INFO - __main__ - Step 32576: {'lr': 0.00044916990535022244, 'samples': 6254592, 'steps': 32575, 'loss/train': 1.6362158060073853} 11/07/2021 01:50:26 - INFO - __main__ - Step 32577: {'lr': 0.00044916669789931806, 'samples': 6254784, 'steps': 32576, 'loss/train': 1.377771258354187} 11/07/2021 01:50:26 - INFO - __main__ - Step 32578: {'lr': 0.0004491634903586722, 'samples': 6254976, 'steps': 32577, 'loss/train': 2.027752637863159} 11/07/2021 01:50:27 - INFO - __main__ - Step 32579: {'lr': 0.00044916028272828636, 'samples': 6255168, 'steps': 32578, 'loss/train': 1.014981985092163} 11/07/2021 01:50:28 - INFO - __main__ - Step 32580: {'lr': 0.00044915707500816206, 'samples': 6255360, 'steps': 32579, 'loss/train': 1.317042589187622} 11/07/2021 01:50:28 - INFO - __main__ - Step 32581: {'lr': 0.0004491538671983005, 'samples': 6255552, 'steps': 32580, 'loss/train': 1.6304385662078857} 11/07/2021 01:50:28 - INFO - __main__ - Step 32582: {'lr': 0.00044915065929870335, 'samples': 6255744, 'steps': 32581, 'loss/train': 1.2867671251296997} 11/07/2021 01:50:29 - INFO - __main__ - Step 32583: {'lr': 0.00044914745130937204, 'samples': 6255936, 'steps': 32582, 'loss/train': 1.5918105840682983} 11/07/2021 01:50:29 - INFO - __main__ - Step 32584: {'lr': 0.0004491442432303079, 'samples': 6256128, 'steps': 32583, 'loss/train': 1.6612629890441895} 11/07/2021 01:50:30 - INFO - __main__ - Step 32585: {'lr': 0.0004491410350615124, 'samples': 6256320, 'steps': 32584, 'loss/train': 1.5205086469650269} 11/07/2021 01:50:30 - INFO - __main__ - Step 32586: {'lr': 0.0004491378268029871, 'samples': 6256512, 'steps': 32585, 'loss/train': 1.4549225568771362} 11/07/2021 01:50:31 - INFO - __main__ - Step 32587: {'lr': 0.00044913461845473335, 'samples': 6256704, 'steps': 32586, 'loss/train': 1.4229557514190674} 11/07/2021 01:50:31 - INFO - __main__ - Step 32588: {'lr': 0.0004491314100167526, 'samples': 6256896, 'steps': 32587, 'loss/train': 1.5734999179840088} 11/07/2021 01:50:32 - INFO - __main__ - Step 32589: {'lr': 0.00044912820148904634, 'samples': 6257088, 'steps': 32588, 'loss/train': 1.9351128339767456} 11/07/2021 01:50:33 - INFO - __main__ - Step 32590: {'lr': 0.0004491249928716159, 'samples': 6257280, 'steps': 32589, 'loss/train': 1.3977437019348145} 11/07/2021 01:50:33 - INFO - __main__ - Step 32591: {'lr': 0.0004491217841644629, 'samples': 6257472, 'steps': 32590, 'loss/train': 1.452478051185608} 11/07/2021 01:50:33 - INFO - __main__ - Step 32592: {'lr': 0.0004491185753675886, 'samples': 6257664, 'steps': 32591, 'loss/train': 1.539903163909912} 11/07/2021 01:50:34 - INFO - __main__ - Step 32593: {'lr': 0.0004491153664809947, 'samples': 6257856, 'steps': 32592, 'loss/train': 2.1112542152404785} 11/07/2021 01:50:34 - INFO - __main__ - Step 32594: {'lr': 0.00044911215750468236, 'samples': 6258048, 'steps': 32593, 'loss/train': 0.5317099094390869} 11/07/2021 01:50:35 - INFO - __main__ - Step 32595: {'lr': 0.0004491089484386531, 'samples': 6258240, 'steps': 32594, 'loss/train': 1.503760814666748} 11/07/2021 01:50:36 - INFO - __main__ - Step 32596: {'lr': 0.0004491057392829086, 'samples': 6258432, 'steps': 32595, 'loss/train': 1.5326488018035889} 11/07/2021 01:50:36 - INFO - __main__ - Step 32597: {'lr': 0.00044910253003745007, 'samples': 6258624, 'steps': 32596, 'loss/train': 1.3503015041351318} 11/07/2021 01:50:36 - INFO - __main__ - Step 32598: {'lr': 0.00044909932070227887, 'samples': 6258816, 'steps': 32597, 'loss/train': 2.20670485496521} 11/07/2021 01:50:37 - INFO - __main__ - Step 32599: {'lr': 0.00044909611127739676, 'samples': 6259008, 'steps': 32598, 'loss/train': 1.7348023653030396} 11/07/2021 01:50:38 - INFO - __main__ - Step 32600: {'lr': 0.00044909290176280495, 'samples': 6259200, 'steps': 32599, 'loss/train': 0.8739935159683228} 11/07/2021 01:50:38 - INFO - __main__ - Step 32601: {'lr': 0.00044908969215850495, 'samples': 6259392, 'steps': 32600, 'loss/train': 1.4008625745773315} 11/07/2021 01:50:38 - INFO - __main__ - Step 32602: {'lr': 0.0004490864824644982, 'samples': 6259584, 'steps': 32601, 'loss/train': 1.0510605573654175} 11/07/2021 01:50:39 - INFO - __main__ - Step 32603: {'lr': 0.0004490832726807862, 'samples': 6259776, 'steps': 32602, 'loss/train': 1.8836473226547241} 11/07/2021 01:50:39 - INFO - __main__ - Step 32604: {'lr': 0.0004490800628073703, 'samples': 6259968, 'steps': 32603, 'loss/train': 0.8565096855163574} 11/07/2021 01:50:40 - INFO - __main__ - Step 32605: {'lr': 0.000449076852844252, 'samples': 6260160, 'steps': 32604, 'loss/train': 1.799138069152832} 11/07/2021 01:50:41 - INFO - __main__ - Step 32606: {'lr': 0.0004490736427914327, 'samples': 6260352, 'steps': 32605, 'loss/train': 1.4113820791244507} 11/07/2021 01:50:41 - INFO - __main__ - Step 32607: {'lr': 0.000449070432648914, 'samples': 6260544, 'steps': 32606, 'loss/train': 2.0013928413391113} 11/07/2021 01:50:41 - INFO - __main__ - Step 32608: {'lr': 0.0004490672224166972, 'samples': 6260736, 'steps': 32607, 'loss/train': 1.7255065441131592} 11/07/2021 01:50:42 - INFO - __main__ - Step 32609: {'lr': 0.00044906401209478367, 'samples': 6260928, 'steps': 32608, 'loss/train': 1.8993556499481201} 11/07/2021 01:50:43 - INFO - __main__ - Step 32610: {'lr': 0.00044906080168317507, 'samples': 6261120, 'steps': 32609, 'loss/train': 1.6636123657226562} 11/07/2021 01:50:43 - INFO - __main__ - Step 32611: {'lr': 0.0004490575911818727, 'samples': 6261312, 'steps': 32610, 'loss/train': 1.6317161321640015} 11/07/2021 01:50:43 - INFO - __main__ - Step 32612: {'lr': 0.0004490543805908781, 'samples': 6261504, 'steps': 32611, 'loss/train': 1.4363155364990234} 11/07/2021 01:50:44 - INFO - __main__ - Step 32613: {'lr': 0.00044905116991019264, 'samples': 6261696, 'steps': 32612, 'loss/train': 1.5295449495315552} 11/07/2021 01:50:44 - INFO - __main__ - Step 32614: {'lr': 0.00044904795913981775, 'samples': 6261888, 'steps': 32613, 'loss/train': 1.0661262273788452} 11/07/2021 01:50:45 - INFO - __main__ - Step 32615: {'lr': 0.00044904474827975506, 'samples': 6262080, 'steps': 32614, 'loss/train': 1.8657482862472534} 11/07/2021 01:50:45 - INFO - __main__ - Step 32616: {'lr': 0.00044904153733000575, 'samples': 6262272, 'steps': 32615, 'loss/train': 2.3484842777252197} 11/07/2021 01:50:46 - INFO - __main__ - Step 32617: {'lr': 0.0004490383262905714, 'samples': 6262464, 'steps': 32616, 'loss/train': 1.5342575311660767} 11/07/2021 01:50:46 - INFO - __main__ - Step 32618: {'lr': 0.00044903511516145353, 'samples': 6262656, 'steps': 32617, 'loss/train': 1.6417678594589233} 11/07/2021 01:50:47 - INFO - __main__ - Step 32619: {'lr': 0.0004490319039426535, 'samples': 6262848, 'steps': 32618, 'loss/train': 1.7578563690185547} 11/07/2021 01:50:47 - INFO - __main__ - Step 32620: {'lr': 0.0004490286926341727, 'samples': 6263040, 'steps': 32619, 'loss/train': 1.2401134967803955} 11/07/2021 01:50:48 - INFO - __main__ - Step 32621: {'lr': 0.0004490254812360126, 'samples': 6263232, 'steps': 32620, 'loss/train': 1.5842281579971313} 11/07/2021 01:50:48 - INFO - __main__ - Step 32622: {'lr': 0.0004490222697481748, 'samples': 6263424, 'steps': 32621, 'loss/train': 1.4485617876052856} 11/07/2021 01:50:49 - INFO - __main__ - Step 32623: {'lr': 0.00044901905817066055, 'samples': 6263616, 'steps': 32622, 'loss/train': 1.4813716411590576} 11/07/2021 01:50:49 - INFO - __main__ - Step 32624: {'lr': 0.00044901584650347147, 'samples': 6263808, 'steps': 32623, 'loss/train': 1.515797734260559} 11/07/2021 01:50:49 - INFO - __main__ - Step 32625: {'lr': 0.00044901263474660894, 'samples': 6264000, 'steps': 32624, 'loss/train': 1.7303627729415894} 11/07/2021 01:50:50 - INFO - __main__ - Step 32626: {'lr': 0.0004490094229000743, 'samples': 6264192, 'steps': 32625, 'loss/train': 1.6219604015350342} 11/07/2021 01:50:51 - INFO - __main__ - Step 32627: {'lr': 0.00044900621096386904, 'samples': 6264384, 'steps': 32626, 'loss/train': 1.5284425020217896} 11/07/2021 01:50:51 - INFO - __main__ - Step 32628: {'lr': 0.00044900299893799476, 'samples': 6264576, 'steps': 32627, 'loss/train': 2.063946008682251} 11/07/2021 01:50:51 - INFO - __main__ - Step 32629: {'lr': 0.0004489997868224528, 'samples': 6264768, 'steps': 32628, 'loss/train': 1.539214015007019} 11/07/2021 01:50:52 - INFO - __main__ - Step 32630: {'lr': 0.00044899657461724453, 'samples': 6264960, 'steps': 32629, 'loss/train': 1.2570024728775024} 11/07/2021 01:50:53 - INFO - __main__ - Step 32631: {'lr': 0.00044899336232237156, 'samples': 6265152, 'steps': 32630, 'loss/train': 1.4593011140823364} 11/07/2021 01:50:53 - INFO - __main__ - Step 32632: {'lr': 0.0004489901499378352, 'samples': 6265344, 'steps': 32631, 'loss/train': 1.3911088705062866} 11/07/2021 01:50:54 - INFO - __main__ - Step 32633: {'lr': 0.00044898693746363695, 'samples': 6265536, 'steps': 32632, 'loss/train': 1.2107559442520142} 11/07/2021 01:50:54 - INFO - __main__ - Step 32634: {'lr': 0.00044898372489977825, 'samples': 6265728, 'steps': 32633, 'loss/train': 1.6459823846817017} 11/07/2021 01:50:54 - INFO - __main__ - Step 32635: {'lr': 0.0004489805122462606, 'samples': 6265920, 'steps': 32634, 'loss/train': 1.6577672958374023} 11/07/2021 01:50:55 - INFO - __main__ - Step 32636: {'lr': 0.0004489772995030853, 'samples': 6266112, 'steps': 32635, 'loss/train': 1.554206371307373} 11/07/2021 01:50:56 - INFO - __main__ - Step 32637: {'lr': 0.00044897408667025397, 'samples': 6266304, 'steps': 32636, 'loss/train': 1.4235544204711914} 11/07/2021 01:50:56 - INFO - __main__ - Step 32638: {'lr': 0.000448970873747768, 'samples': 6266496, 'steps': 32637, 'loss/train': 1.7344202995300293} 11/07/2021 01:50:56 - INFO - __main__ - Step 32639: {'lr': 0.0004489676607356288, 'samples': 6266688, 'steps': 32638, 'loss/train': 1.4471172094345093} 11/07/2021 01:50:57 - INFO - __main__ - Step 32640: {'lr': 0.00044896444763383787, 'samples': 6266880, 'steps': 32639, 'loss/train': 1.6442102193832397} 11/07/2021 01:50:58 - INFO - __main__ - Step 32641: {'lr': 0.00044896123444239654, 'samples': 6267072, 'steps': 32640, 'loss/train': 1.3261736631393433} 11/07/2021 01:50:58 - INFO - __main__ - Step 32642: {'lr': 0.00044895802116130644, 'samples': 6267264, 'steps': 32641, 'loss/train': 1.5865647792816162} 11/07/2021 01:50:59 - INFO - __main__ - Step 32643: {'lr': 0.0004489548077905689, 'samples': 6267456, 'steps': 32642, 'loss/train': 1.6438711881637573} 11/07/2021 01:50:59 - INFO - __main__ - Step 32644: {'lr': 0.0004489515943301854, 'samples': 6267648, 'steps': 32643, 'loss/train': 1.586187720298767} 11/07/2021 01:50:59 - INFO - __main__ - Step 32645: {'lr': 0.0004489483807801574, 'samples': 6267840, 'steps': 32644, 'loss/train': 2.010782241821289} 11/07/2021 01:51:00 - INFO - __main__ - Step 32646: {'lr': 0.00044894516714048626, 'samples': 6268032, 'steps': 32645, 'loss/train': 0.6327466368675232} 11/07/2021 01:51:01 - INFO - __main__ - Step 32647: {'lr': 0.0004489419534111736, 'samples': 6268224, 'steps': 32646, 'loss/train': 1.2490745782852173} 11/07/2021 01:51:01 - INFO - __main__ - Step 32648: {'lr': 0.0004489387395922207, 'samples': 6268416, 'steps': 32647, 'loss/train': 1.1977260112762451} 11/07/2021 01:51:02 - INFO - __main__ - Step 32649: {'lr': 0.00044893552568362903, 'samples': 6268608, 'steps': 32648, 'loss/train': 1.0430372953414917} 11/07/2021 01:51:02 - INFO - __main__ - Step 32650: {'lr': 0.0004489323116854002, 'samples': 6268800, 'steps': 32649, 'loss/train': 2.163680076599121} 11/07/2021 01:51:02 - INFO - __main__ - Step 32651: {'lr': 0.00044892909759753545, 'samples': 6268992, 'steps': 32650, 'loss/train': 1.574050784111023} 11/07/2021 01:51:03 - INFO - __main__ - Step 32652: {'lr': 0.00044892588342003637, 'samples': 6269184, 'steps': 32651, 'loss/train': 1.681310772895813} 11/07/2021 01:51:04 - INFO - __main__ - Step 32653: {'lr': 0.00044892266915290435, 'samples': 6269376, 'steps': 32652, 'loss/train': 1.147005319595337} 11/07/2021 01:51:04 - INFO - __main__ - Step 32654: {'lr': 0.00044891945479614084, 'samples': 6269568, 'steps': 32653, 'loss/train': 1.4096343517303467} 11/07/2021 01:51:04 - INFO - __main__ - Step 32655: {'lr': 0.00044891624034974726, 'samples': 6269760, 'steps': 32654, 'loss/train': 0.13647647202014923} 11/07/2021 01:51:05 - INFO - __main__ - Step 32656: {'lr': 0.00044891302581372513, 'samples': 6269952, 'steps': 32655, 'loss/train': 1.183340072631836} 11/07/2021 01:51:06 - INFO - __main__ - Step 32657: {'lr': 0.00044890981118807585, 'samples': 6270144, 'steps': 32656, 'loss/train': 2.038282871246338} 11/07/2021 01:51:06 - INFO - __main__ - Step 32658: {'lr': 0.00044890659647280084, 'samples': 6270336, 'steps': 32657, 'loss/train': 1.4143182039260864} 11/07/2021 01:51:07 - INFO - __main__ - Step 32659: {'lr': 0.0004489033816679016, 'samples': 6270528, 'steps': 32658, 'loss/train': 1.9537559747695923} 11/07/2021 01:51:07 - INFO - __main__ - Step 32660: {'lr': 0.0004489001667733796, 'samples': 6270720, 'steps': 32659, 'loss/train': 1.4231013059616089} 11/07/2021 01:51:07 - INFO - __main__ - Step 32661: {'lr': 0.0004488969517892363, 'samples': 6270912, 'steps': 32660, 'loss/train': 0.7900300025939941} 11/07/2021 01:51:08 - INFO - __main__ - Step 32662: {'lr': 0.000448893736715473, 'samples': 6271104, 'steps': 32661, 'loss/train': 0.13605500757694244} 11/07/2021 01:51:09 - INFO - __main__ - Step 32663: {'lr': 0.0004488905215520913, 'samples': 6271296, 'steps': 32662, 'loss/train': 1.8541309833526611} 11/07/2021 01:51:09 - INFO - __main__ - Step 32664: {'lr': 0.00044888730629909256, 'samples': 6271488, 'steps': 32663, 'loss/train': 1.7157195806503296} 11/07/2021 01:51:09 - INFO - __main__ - Step 32665: {'lr': 0.00044888409095647833, 'samples': 6271680, 'steps': 32664, 'loss/train': 1.3088997602462769} 11/07/2021 01:51:10 - INFO - __main__ - Step 32666: {'lr': 0.00044888087552424997, 'samples': 6271872, 'steps': 32665, 'loss/train': 2.039842128753662} 11/07/2021 01:51:11 - INFO - __main__ - Step 32667: {'lr': 0.00044887766000240893, 'samples': 6272064, 'steps': 32666, 'loss/train': 1.5708863735198975} 11/07/2021 01:51:11 - INFO - __main__ - Step 32668: {'lr': 0.0004488744443909567, 'samples': 6272256, 'steps': 32667, 'loss/train': 0.8644485473632812} 11/07/2021 01:51:12 - INFO - __main__ - Step 32669: {'lr': 0.0004488712286898947, 'samples': 6272448, 'steps': 32668, 'loss/train': 1.4932048320770264} 11/07/2021 01:51:12 - INFO - __main__ - Step 32670: {'lr': 0.0004488680128992244, 'samples': 6272640, 'steps': 32669, 'loss/train': 1.103938341140747} 11/07/2021 01:51:12 - INFO - __main__ - Step 32671: {'lr': 0.00044886479701894736, 'samples': 6272832, 'steps': 32670, 'loss/train': 1.7463362216949463} 11/07/2021 01:51:13 - INFO - __main__ - Step 32672: {'lr': 0.00044886158104906476, 'samples': 6273024, 'steps': 32671, 'loss/train': 1.4635449647903442} 11/07/2021 01:51:14 - INFO - __main__ - Step 32673: {'lr': 0.0004488583649895782, 'samples': 6273216, 'steps': 32672, 'loss/train': 1.4198908805847168} 11/07/2021 01:51:14 - INFO - __main__ - Step 32674: {'lr': 0.00044885514884048926, 'samples': 6273408, 'steps': 32673, 'loss/train': 2.2054474353790283} 11/07/2021 01:51:14 - INFO - __main__ - Step 32675: {'lr': 0.0004488519326017991, 'samples': 6273600, 'steps': 32674, 'loss/train': 0.6003201603889465} 11/07/2021 01:51:15 - INFO - __main__ - Step 32676: {'lr': 0.0004488487162735094, 'samples': 6273792, 'steps': 32675, 'loss/train': 1.483665943145752} 11/07/2021 01:51:16 - INFO - __main__ - Step 32677: {'lr': 0.00044884549985562165, 'samples': 6273984, 'steps': 32676, 'loss/train': 1.7669689655303955} 11/07/2021 01:51:16 - INFO - __main__ - Step 32678: {'lr': 0.000448842283348137, 'samples': 6274176, 'steps': 32677, 'loss/train': 1.4703480005264282} 11/07/2021 01:51:16 - INFO - __main__ - Step 32679: {'lr': 0.0004488390667510572, 'samples': 6274368, 'steps': 32678, 'loss/train': 1.728654146194458} 11/07/2021 01:51:17 - INFO - __main__ - Step 32680: {'lr': 0.00044883585006438354, 'samples': 6274560, 'steps': 32679, 'loss/train': 2.1558849811553955} 11/07/2021 01:51:17 - INFO - __main__ - Step 32681: {'lr': 0.0004488326332881175, 'samples': 6274752, 'steps': 32680, 'loss/train': 1.354612946510315} 11/07/2021 01:51:17 - INFO - __main__ - Step 32682: {'lr': 0.0004488294164222606, 'samples': 6274944, 'steps': 32681, 'loss/train': 1.248321771621704} 11/07/2021 01:51:18 - INFO - __main__ - Step 32683: {'lr': 0.0004488261994668142, 'samples': 6275136, 'steps': 32682, 'loss/train': 1.4758888483047485} 11/07/2021 01:51:19 - INFO - __main__ - Step 32684: {'lr': 0.00044882298242177976, 'samples': 6275328, 'steps': 32683, 'loss/train': 1.4852855205535889} 11/07/2021 01:51:19 - INFO - __main__ - Step 32685: {'lr': 0.00044881976528715877, 'samples': 6275520, 'steps': 32684, 'loss/train': 1.3000860214233398} 11/07/2021 01:51:20 - INFO - __main__ - Step 32686: {'lr': 0.0004488165480629527, 'samples': 6275712, 'steps': 32685, 'loss/train': 1.4111723899841309} 11/07/2021 01:51:20 - INFO - __main__ - Step 32687: {'lr': 0.00044881333074916287, 'samples': 6275904, 'steps': 32686, 'loss/train': 1.4297772645950317} 11/07/2021 01:51:21 - INFO - __main__ - Step 32688: {'lr': 0.00044881011334579093, 'samples': 6276096, 'steps': 32687, 'loss/train': 1.4350974559783936} 11/07/2021 01:51:21 - INFO - __main__ - Step 32689: {'lr': 0.0004488068958528382, 'samples': 6276288, 'steps': 32688, 'loss/train': 1.2064836025238037} 11/07/2021 01:51:22 - INFO - __main__ - Step 32690: {'lr': 0.0004488036782703061, 'samples': 6276480, 'steps': 32689, 'loss/train': 1.6409106254577637} 11/07/2021 01:51:22 - INFO - __main__ - Step 32691: {'lr': 0.00044880046059819615, 'samples': 6276672, 'steps': 32690, 'loss/train': 1.3003720045089722} 11/07/2021 01:51:22 - INFO - __main__ - Step 32692: {'lr': 0.00044879724283650976, 'samples': 6276864, 'steps': 32691, 'loss/train': 1.6657252311706543} 11/07/2021 01:51:23 - INFO - __main__ - Step 32693: {'lr': 0.0004487940249852484, 'samples': 6277056, 'steps': 32692, 'loss/train': 1.2542271614074707} 11/07/2021 01:51:24 - INFO - __main__ - Step 32694: {'lr': 0.0004487908070444136, 'samples': 6277248, 'steps': 32693, 'loss/train': 1.806897521018982} 11/07/2021 01:51:24 - INFO - __main__ - Step 32695: {'lr': 0.00044878758901400665, 'samples': 6277440, 'steps': 32694, 'loss/train': 1.5347445011138916} 11/07/2021 01:51:24 - INFO - __main__ - Step 32696: {'lr': 0.00044878437089402906, 'samples': 6277632, 'steps': 32695, 'loss/train': 2.1197705268859863} 11/07/2021 01:51:25 - INFO - __main__ - Step 32697: {'lr': 0.0004487811526844824, 'samples': 6277824, 'steps': 32696, 'loss/train': 1.1654471158981323} 11/07/2021 01:51:26 - INFO - __main__ - Step 32698: {'lr': 0.0004487779343853679, 'samples': 6278016, 'steps': 32697, 'loss/train': 1.547286868095398} 11/07/2021 01:51:26 - INFO - __main__ - Step 32699: {'lr': 0.00044877471599668716, 'samples': 6278208, 'steps': 32698, 'loss/train': 0.7873335480690002} 11/07/2021 01:51:27 - INFO - __main__ - Step 32700: {'lr': 0.00044877149751844164, 'samples': 6278400, 'steps': 32699, 'loss/train': 2.0370588302612305} 11/07/2021 01:51:27 - INFO - __main__ - Step 32701: {'lr': 0.00044876827895063277, 'samples': 6278592, 'steps': 32700, 'loss/train': 1.2202073335647583} 11/07/2021 01:51:27 - INFO - __main__ - Step 32702: {'lr': 0.0004487650602932619, 'samples': 6278784, 'steps': 32701, 'loss/train': 1.208509087562561} 11/07/2021 01:51:28 - INFO - __main__ - Step 32703: {'lr': 0.00044876184154633066, 'samples': 6278976, 'steps': 32702, 'loss/train': 1.2279237508773804} 11/07/2021 01:51:29 - INFO - __main__ - Step 32704: {'lr': 0.00044875862270984035, 'samples': 6279168, 'steps': 32703, 'loss/train': 1.2749691009521484} 11/07/2021 01:51:29 - INFO - __main__ - Step 32705: {'lr': 0.0004487554037837925, 'samples': 6279360, 'steps': 32704, 'loss/train': 1.5158764123916626} 11/07/2021 01:51:29 - INFO - __main__ - Step 32706: {'lr': 0.00044875218476818845, 'samples': 6279552, 'steps': 32705, 'loss/train': 0.7631714940071106} 11/07/2021 01:51:30 - INFO - __main__ - Step 32707: {'lr': 0.0004487489656630298, 'samples': 6279744, 'steps': 32706, 'loss/train': 1.2436619997024536} 11/07/2021 01:51:31 - INFO - __main__ - Step 32708: {'lr': 0.00044874574646831794, 'samples': 6279936, 'steps': 32707, 'loss/train': 1.5233253240585327} 11/07/2021 01:51:31 - INFO - __main__ - Step 32709: {'lr': 0.0004487425271840543, 'samples': 6280128, 'steps': 32708, 'loss/train': 1.2317440509796143} 11/07/2021 01:51:32 - INFO - __main__ - Step 32710: {'lr': 0.0004487393078102403, 'samples': 6280320, 'steps': 32709, 'loss/train': 1.1704007387161255} 11/07/2021 01:51:32 - INFO - __main__ - Step 32711: {'lr': 0.00044873608834687754, 'samples': 6280512, 'steps': 32710, 'loss/train': 0.9771917462348938} 11/07/2021 01:51:32 - INFO - __main__ - Step 32712: {'lr': 0.00044873286879396724, 'samples': 6280704, 'steps': 32711, 'loss/train': 1.4926505088806152} 11/07/2021 01:51:33 - INFO - __main__ - Step 32713: {'lr': 0.00044872964915151106, 'samples': 6280896, 'steps': 32712, 'loss/train': 1.4966946840286255} 11/07/2021 01:51:33 - INFO - __main__ - Step 32714: {'lr': 0.00044872642941951035, 'samples': 6281088, 'steps': 32713, 'loss/train': 0.7444011569023132} 11/07/2021 01:51:34 - INFO - __main__ - Step 32715: {'lr': 0.0004487232095979666, 'samples': 6281280, 'steps': 32714, 'loss/train': 0.12462414056062698} 11/07/2021 01:51:35 - INFO - __main__ - Step 32716: {'lr': 0.0004487199896868812, 'samples': 6281472, 'steps': 32715, 'loss/train': 1.3743922710418701} 11/07/2021 01:51:35 - INFO - __main__ - Step 32717: {'lr': 0.00044871676968625564, 'samples': 6281664, 'steps': 32716, 'loss/train': 0.969983696937561} 11/07/2021 01:51:35 - INFO - __main__ - Step 32718: {'lr': 0.00044871354959609135, 'samples': 6281856, 'steps': 32717, 'loss/train': 0.7198702096939087} 11/07/2021 01:51:36 - INFO - __main__ - Step 32719: {'lr': 0.00044871032941638984, 'samples': 6282048, 'steps': 32718, 'loss/train': 1.4743660688400269} 11/07/2021 01:51:37 - INFO - __main__ - Step 32720: {'lr': 0.00044870710914715254, 'samples': 6282240, 'steps': 32719, 'loss/train': 1.7300770282745361} 11/07/2021 01:51:37 - INFO - __main__ - Step 32721: {'lr': 0.00044870388878838084, 'samples': 6282432, 'steps': 32720, 'loss/train': 1.1336641311645508} 11/07/2021 01:51:37 - INFO - __main__ - Step 32722: {'lr': 0.00044870066834007627, 'samples': 6282624, 'steps': 32721, 'loss/train': 2.130664587020874} 11/07/2021 01:51:38 - INFO - __main__ - Step 32723: {'lr': 0.0004486974478022402, 'samples': 6282816, 'steps': 32722, 'loss/train': 1.418270230293274} 11/07/2021 01:51:38 - INFO - __main__ - Step 32724: {'lr': 0.0004486942271748742, 'samples': 6283008, 'steps': 32723, 'loss/train': 1.6737797260284424} 11/07/2021 01:51:39 - INFO - __main__ - Step 32725: {'lr': 0.0004486910064579796, 'samples': 6283200, 'steps': 32724, 'loss/train': 1.8791981935501099} 11/07/2021 01:51:40 - INFO - __main__ - Step 32726: {'lr': 0.00044868778565155783, 'samples': 6283392, 'steps': 32725, 'loss/train': 1.7475714683532715} 11/07/2021 01:51:40 - INFO - __main__ - Step 32727: {'lr': 0.00044868456475561047, 'samples': 6283584, 'steps': 32726, 'loss/train': 1.5137977600097656} 11/07/2021 01:51:40 - INFO - __main__ - Step 32728: {'lr': 0.0004486813437701389, 'samples': 6283776, 'steps': 32727, 'loss/train': 1.4692602157592773} 11/07/2021 01:51:41 - INFO - __main__ - Step 32729: {'lr': 0.0004486781226951446, 'samples': 6283968, 'steps': 32728, 'loss/train': 1.7525389194488525} 11/07/2021 01:51:41 - INFO - __main__ - Step 32730: {'lr': 0.000448674901530629, 'samples': 6284160, 'steps': 32729, 'loss/train': 2.4053380489349365} 11/07/2021 01:51:42 - INFO - __main__ - Step 32731: {'lr': 0.00044867168027659356, 'samples': 6284352, 'steps': 32730, 'loss/train': 0.664936900138855} 11/07/2021 01:51:42 - INFO - __main__ - Step 32732: {'lr': 0.00044866845893303973, 'samples': 6284544, 'steps': 32731, 'loss/train': 1.8049708604812622} 11/07/2021 01:51:43 - INFO - __main__ - Step 32733: {'lr': 0.00044866523749996897, 'samples': 6284736, 'steps': 32732, 'loss/train': 1.127227783203125} 11/07/2021 01:51:43 - INFO - __main__ - Step 32734: {'lr': 0.0004486620159773827, 'samples': 6284928, 'steps': 32733, 'loss/train': 1.4827308654785156} 11/07/2021 01:51:43 - INFO - __main__ - Step 32735: {'lr': 0.0004486587943652823, 'samples': 6285120, 'steps': 32734, 'loss/train': 0.731379508972168} 11/07/2021 01:51:44 - INFO - __main__ - Step 32736: {'lr': 0.00044865557266366953, 'samples': 6285312, 'steps': 32735, 'loss/train': 1.535205602645874} 11/07/2021 01:51:45 - INFO - __main__ - Step 32737: {'lr': 0.0004486523508725454, 'samples': 6285504, 'steps': 32736, 'loss/train': 0.8805704712867737} 11/07/2021 01:51:45 - INFO - __main__ - Step 32738: {'lr': 0.00044864912899191174, 'samples': 6285696, 'steps': 32737, 'loss/train': 1.415027141571045} 11/07/2021 01:51:45 - INFO - __main__ - Step 32739: {'lr': 0.00044864590702176977, 'samples': 6285888, 'steps': 32738, 'loss/train': 2.4802517890930176} 11/07/2021 01:51:46 - INFO - __main__ - Step 32740: {'lr': 0.000448642684962121, 'samples': 6286080, 'steps': 32739, 'loss/train': 0.7023144364356995} 11/07/2021 01:51:47 - INFO - __main__ - Step 32741: {'lr': 0.000448639462812967, 'samples': 6286272, 'steps': 32740, 'loss/train': 1.2147858142852783} 11/07/2021 01:51:47 - INFO - __main__ - Step 32742: {'lr': 0.0004486362405743091, 'samples': 6286464, 'steps': 32741, 'loss/train': 1.6830997467041016} 11/07/2021 01:51:47 - INFO - __main__ - Step 32743: {'lr': 0.0004486330182461487, 'samples': 6286656, 'steps': 32742, 'loss/train': 1.9115580320358276} 11/07/2021 01:51:48 - INFO - __main__ - Step 32744: {'lr': 0.0004486297958284874, 'samples': 6286848, 'steps': 32743, 'loss/train': 1.754269003868103} 11/07/2021 01:51:48 - INFO - __main__ - Step 32745: {'lr': 0.0004486265733213265, 'samples': 6287040, 'steps': 32744, 'loss/train': 1.3951165676116943} 11/07/2021 01:51:49 - INFO - __main__ - Step 32746: {'lr': 0.00044862335072466767, 'samples': 6287232, 'steps': 32745, 'loss/train': 1.4054536819458008} 11/07/2021 01:51:49 - INFO - __main__ - Step 32747: {'lr': 0.00044862012803851203, 'samples': 6287424, 'steps': 32746, 'loss/train': 1.815017819404602} 11/07/2021 01:51:50 - INFO - __main__ - Step 32748: {'lr': 0.00044861690526286135, 'samples': 6287616, 'steps': 32747, 'loss/train': 1.6497738361358643} 11/07/2021 01:51:50 - INFO - __main__ - Step 32749: {'lr': 0.00044861368239771694, 'samples': 6287808, 'steps': 32748, 'loss/train': 1.5400633811950684} 11/07/2021 01:51:51 - INFO - __main__ - Step 32750: {'lr': 0.00044861045944308026, 'samples': 6288000, 'steps': 32749, 'loss/train': 2.035322427749634} 11/07/2021 01:51:52 - INFO - __main__ - Step 32751: {'lr': 0.0004486072363989528, 'samples': 6288192, 'steps': 32750, 'loss/train': 1.2623913288116455} 11/07/2021 01:51:52 - INFO - __main__ - Step 32752: {'lr': 0.00044860401326533595, 'samples': 6288384, 'steps': 32751, 'loss/train': 1.6374726295471191} 11/07/2021 01:51:52 - INFO - __main__ - Step 32753: {'lr': 0.0004486007900422312, 'samples': 6288576, 'steps': 32752, 'loss/train': 1.5325310230255127} 11/07/2021 01:51:53 - INFO - __main__ - Step 32754: {'lr': 0.00044859756672964, 'samples': 6288768, 'steps': 32753, 'loss/train': 1.6418508291244507} 11/07/2021 01:51:53 - INFO - __main__ - Step 32755: {'lr': 0.00044859434332756383, 'samples': 6288960, 'steps': 32754, 'loss/train': 1.3992834091186523} 11/07/2021 01:51:53 - INFO - __main__ - Step 32756: {'lr': 0.0004485911198360041, 'samples': 6289152, 'steps': 32755, 'loss/train': 1.0894807577133179} 11/07/2021 01:51:55 - INFO - __main__ - Step 32757: {'lr': 0.0004485878962549622, 'samples': 6289344, 'steps': 32756, 'loss/train': 1.551389217376709} 11/07/2021 01:51:55 - INFO - __main__ - Step 32758: {'lr': 0.0004485846725844398, 'samples': 6289536, 'steps': 32757, 'loss/train': 1.5550835132598877} 11/07/2021 01:51:55 - INFO - __main__ - Step 32759: {'lr': 0.0004485814488244381, 'samples': 6289728, 'steps': 32758, 'loss/train': 1.1718662977218628} 11/07/2021 01:51:56 - INFO - __main__ - Step 32760: {'lr': 0.0004485782249749587, 'samples': 6289920, 'steps': 32759, 'loss/train': 1.6369692087173462} 11/07/2021 01:51:57 - INFO - __main__ - Step 32761: {'lr': 0.00044857500103600304, 'samples': 6290112, 'steps': 32760, 'loss/train': 1.2651540040969849} 11/07/2021 01:51:57 - INFO - __main__ - Step 32762: {'lr': 0.00044857177700757247, 'samples': 6290304, 'steps': 32761, 'loss/train': 1.0267809629440308} 11/07/2021 01:51:58 - INFO - __main__ - Step 32763: {'lr': 0.00044856855288966856, 'samples': 6290496, 'steps': 32762, 'loss/train': 1.6318790912628174} 11/07/2021 01:51:58 - INFO - __main__ - Step 32764: {'lr': 0.0004485653286822927, 'samples': 6290688, 'steps': 32763, 'loss/train': 1.508261799812317} 11/07/2021 01:51:58 - INFO - __main__ - Step 32765: {'lr': 0.0004485621043854465, 'samples': 6290880, 'steps': 32764, 'loss/train': 1.1478630304336548} 11/07/2021 01:52:00 - INFO - __main__ - Step 32766: {'lr': 0.0004485588799991311, 'samples': 6291072, 'steps': 32765, 'loss/train': 1.610480785369873} 11/07/2021 01:52:00 - INFO - __main__ - Step 32767: {'lr': 0.0004485556555233483, 'samples': 6291264, 'steps': 32766, 'loss/train': 1.4900367259979248} 11/07/2021 01:52:01 - INFO - __main__ - Step 32768: {'lr': 0.0004485524309580993, 'samples': 6291456, 'steps': 32767, 'loss/train': 1.6736993789672852} 11/07/2021 01:52:01 - INFO - __main__ - Step 32769: {'lr': 0.0004485492063033856, 'samples': 6291648, 'steps': 32768, 'loss/train': 1.2844321727752686} 11/07/2021 01:52:01 - INFO - __main__ - Step 32770: {'lr': 0.0004485459815592087, 'samples': 6291840, 'steps': 32769, 'loss/train': 0.6486344933509827} 11/07/2021 01:52:02 - INFO - __main__ - Step 32771: {'lr': 0.0004485427567255701, 'samples': 6292032, 'steps': 32770, 'loss/train': 1.6293174028396606} 11/07/2021 01:52:03 - INFO - __main__ - Step 32772: {'lr': 0.0004485395318024712, 'samples': 6292224, 'steps': 32771, 'loss/train': 0.7933187484741211} 11/07/2021 01:52:03 - INFO - __main__ - Step 32773: {'lr': 0.00044853630678991344, 'samples': 6292416, 'steps': 32772, 'loss/train': 1.568111777305603} 11/07/2021 01:52:04 - INFO - __main__ - Step 32774: {'lr': 0.00044853308168789824, 'samples': 6292608, 'steps': 32773, 'loss/train': 1.1893032789230347} 11/07/2021 01:52:04 - INFO - __main__ - Step 32775: {'lr': 0.00044852985649642714, 'samples': 6292800, 'steps': 32774, 'loss/train': 1.2858831882476807} 11/07/2021 01:52:05 - INFO - __main__ - Step 32776: {'lr': 0.0004485266312155015, 'samples': 6292992, 'steps': 32775, 'loss/train': 0.5672725439071655} 11/07/2021 01:52:05 - INFO - __main__ - Step 32777: {'lr': 0.00044852340584512285, 'samples': 6293184, 'steps': 32776, 'loss/train': 1.5529122352600098} 11/07/2021 01:52:07 - INFO - __main__ - Step 32778: {'lr': 0.00044852018038529264, 'samples': 6293376, 'steps': 32777, 'loss/train': 0.8248746395111084} 11/07/2021 01:52:07 - INFO - __main__ - Step 32779: {'lr': 0.00044851695483601227, 'samples': 6293568, 'steps': 32778, 'loss/train': 1.5078567266464233} 11/07/2021 01:52:07 - INFO - __main__ - Step 32780: {'lr': 0.0004485137291972833, 'samples': 6293760, 'steps': 32779, 'loss/train': 0.17733517289161682} 11/07/2021 01:52:08 - INFO - __main__ - Step 32781: {'lr': 0.00044851050346910706, 'samples': 6293952, 'steps': 32780, 'loss/train': 1.8087482452392578} 11/07/2021 01:52:08 - INFO - __main__ - Step 32782: {'lr': 0.00044850727765148504, 'samples': 6294144, 'steps': 32781, 'loss/train': 1.6935482025146484} 11/07/2021 01:52:08 - INFO - __main__ - Step 32783: {'lr': 0.00044850405174441866, 'samples': 6294336, 'steps': 32782, 'loss/train': 1.7380541563034058} 11/07/2021 01:52:09 - INFO - __main__ - Step 32784: {'lr': 0.00044850082574790945, 'samples': 6294528, 'steps': 32783, 'loss/train': 1.4098385572433472} 11/07/2021 01:52:10 - INFO - __main__ - Step 32785: {'lr': 0.0004484975996619589, 'samples': 6294720, 'steps': 32784, 'loss/train': 1.4865195751190186} 11/07/2021 01:52:10 - INFO - __main__ - Step 32786: {'lr': 0.0004484943734865683, 'samples': 6294912, 'steps': 32785, 'loss/train': 1.7889684438705444} 11/07/2021 01:52:10 - INFO - __main__ - Step 32787: {'lr': 0.0004484911472217392, 'samples': 6295104, 'steps': 32786, 'loss/train': 1.7738747596740723} 11/07/2021 01:52:11 - INFO - __main__ - Step 32788: {'lr': 0.0004484879208674731, 'samples': 6295296, 'steps': 32787, 'loss/train': 1.524864673614502} 11/07/2021 01:52:12 - INFO - __main__ - Step 32789: {'lr': 0.0004484846944237714, 'samples': 6295488, 'steps': 32788, 'loss/train': 1.3773404359817505} 11/07/2021 01:52:12 - INFO - __main__ - Step 32790: {'lr': 0.0004484814678906355, 'samples': 6295680, 'steps': 32789, 'loss/train': 1.2628083229064941} 11/07/2021 01:52:12 - INFO - __main__ - Step 32791: {'lr': 0.00044847824126806703, 'samples': 6295872, 'steps': 32790, 'loss/train': 1.7810370922088623} 11/07/2021 01:52:13 - INFO - __main__ - Step 32792: {'lr': 0.0004484750145560672, 'samples': 6296064, 'steps': 32791, 'loss/train': 1.3144395351409912} 11/07/2021 01:52:13 - INFO - __main__ - Step 32793: {'lr': 0.0004484717877546377, 'samples': 6296256, 'steps': 32792, 'loss/train': 1.530842900276184} 11/07/2021 01:52:14 - INFO - __main__ - Step 32794: {'lr': 0.0004484685608637798, 'samples': 6296448, 'steps': 32793, 'loss/train': 2.359969139099121} 11/07/2021 01:52:14 - INFO - __main__ - Step 32795: {'lr': 0.00044846533388349507, 'samples': 6296640, 'steps': 32794, 'loss/train': 1.4508795738220215} 11/07/2021 01:52:15 - INFO - __main__ - Step 32796: {'lr': 0.00044846210681378487, 'samples': 6296832, 'steps': 32795, 'loss/train': 1.4554624557495117} 11/07/2021 01:52:15 - INFO - __main__ - Step 32797: {'lr': 0.00044845887965465076, 'samples': 6297024, 'steps': 32796, 'loss/train': 0.6192265748977661} 11/07/2021 01:52:16 - INFO - __main__ - Step 32798: {'lr': 0.0004484556524060941, 'samples': 6297216, 'steps': 32797, 'loss/train': 1.7106328010559082} 11/07/2021 01:52:16 - INFO - __main__ - Step 32799: {'lr': 0.00044845242506811646, 'samples': 6297408, 'steps': 32798, 'loss/train': 1.029484510421753} 11/07/2021 01:52:17 - INFO - __main__ - Step 32800: {'lr': 0.0004484491976407192, 'samples': 6297600, 'steps': 32799, 'loss/train': 0.9896617531776428} 11/07/2021 01:52:17 - INFO - __main__ - Step 32801: {'lr': 0.00044844597012390374, 'samples': 6297792, 'steps': 32800, 'loss/train': 1.7723320722579956} 11/07/2021 01:52:18 - INFO - __main__ - Step 32802: {'lr': 0.0004484427425176716, 'samples': 6297984, 'steps': 32801, 'loss/train': 1.3181513547897339} 11/07/2021 01:52:18 - INFO - __main__ - Step 32803: {'lr': 0.0004484395148220243, 'samples': 6298176, 'steps': 32802, 'loss/train': 1.4404723644256592} 11/07/2021 01:52:18 - INFO - __main__ - Step 32804: {'lr': 0.000448436287036963, 'samples': 6298368, 'steps': 32803, 'loss/train': 1.3333238363265991} 11/07/2021 01:52:19 - INFO - __main__ - Step 32805: {'lr': 0.0004484330591624896, 'samples': 6298560, 'steps': 32804, 'loss/train': 1.694399118423462} 11/07/2021 01:52:20 - INFO - __main__ - Step 32806: {'lr': 0.00044842983119860525, 'samples': 6298752, 'steps': 32805, 'loss/train': 1.9107273817062378} 11/07/2021 01:52:20 - INFO - __main__ - Step 32807: {'lr': 0.00044842660314531145, 'samples': 6298944, 'steps': 32806, 'loss/train': 1.6565542221069336} 11/07/2021 01:52:20 - INFO - __main__ - Step 32808: {'lr': 0.0004484233750026098, 'samples': 6299136, 'steps': 32807, 'loss/train': 1.9830169677734375} 11/07/2021 01:52:21 - INFO - __main__ - Step 32809: {'lr': 0.00044842014677050145, 'samples': 6299328, 'steps': 32808, 'loss/train': 1.630691409111023} 11/07/2021 01:52:22 - INFO - __main__ - Step 32810: {'lr': 0.0004484169184489882, 'samples': 6299520, 'steps': 32809, 'loss/train': 1.3876402378082275} 11/07/2021 01:52:22 - INFO - __main__ - Step 32811: {'lr': 0.0004484136900380713, 'samples': 6299712, 'steps': 32810, 'loss/train': 1.570265531539917} 11/07/2021 01:52:22 - INFO - __main__ - Step 32812: {'lr': 0.00044841046153775224, 'samples': 6299904, 'steps': 32811, 'loss/train': 1.195267915725708} 11/07/2021 01:52:23 - INFO - __main__ - Step 32813: {'lr': 0.0004484072329480325, 'samples': 6300096, 'steps': 32812, 'loss/train': 1.3839094638824463} 11/07/2021 01:52:23 - INFO - __main__ - Step 32814: {'lr': 0.00044840400426891347, 'samples': 6300288, 'steps': 32813, 'loss/train': 1.3134123086929321} 11/07/2021 01:52:24 - INFO - __main__ - Step 32815: {'lr': 0.00044840077550039676, 'samples': 6300480, 'steps': 32814, 'loss/train': 1.7066586017608643} 11/07/2021 01:52:24 - INFO - __main__ - Step 32816: {'lr': 0.0004483975466424837, 'samples': 6300672, 'steps': 32815, 'loss/train': 1.567929983139038} 11/07/2021 01:52:25 - INFO - __main__ - Step 32817: {'lr': 0.0004483943176951757, 'samples': 6300864, 'steps': 32816, 'loss/train': 1.2354013919830322} 11/07/2021 01:52:25 - INFO - __main__ - Step 32818: {'lr': 0.0004483910886584743, 'samples': 6301056, 'steps': 32817, 'loss/train': 1.8153525590896606} 11/07/2021 01:52:26 - INFO - __main__ - Step 32819: {'lr': 0.00044838785953238094, 'samples': 6301248, 'steps': 32818, 'loss/train': 1.7477885484695435} 11/07/2021 01:52:27 - INFO - __main__ - Step 32820: {'lr': 0.0004483846303168971, 'samples': 6301440, 'steps': 32819, 'loss/train': 1.4585903882980347} 11/07/2021 01:52:27 - INFO - __main__ - Step 32821: {'lr': 0.0004483814010120242, 'samples': 6301632, 'steps': 32820, 'loss/train': 1.7392491102218628} 11/07/2021 01:52:27 - INFO - __main__ - Step 32822: {'lr': 0.00044837817161776366, 'samples': 6301824, 'steps': 32821, 'loss/train': 1.7412809133529663} 11/07/2021 01:52:28 - INFO - __main__ - Step 32823: {'lr': 0.000448374942134117, 'samples': 6302016, 'steps': 32822, 'loss/train': 1.758912205696106} 11/07/2021 01:52:28 - INFO - __main__ - Step 32824: {'lr': 0.0004483717125610857, 'samples': 6302208, 'steps': 32823, 'loss/train': 1.3502815961837769} 11/07/2021 01:52:29 - INFO - __main__ - Step 32825: {'lr': 0.0004483684828986712, 'samples': 6302400, 'steps': 32824, 'loss/train': 1.465355396270752} 11/07/2021 01:52:29 - INFO - __main__ - Step 32826: {'lr': 0.00044836525314687477, 'samples': 6302592, 'steps': 32825, 'loss/train': 1.514260172843933} 11/07/2021 01:52:30 - INFO - __main__ - Step 32827: {'lr': 0.0004483620233056981, 'samples': 6302784, 'steps': 32826, 'loss/train': 1.2928588390350342} 11/07/2021 01:52:30 - INFO - __main__ - Step 32828: {'lr': 0.00044835879337514254, 'samples': 6302976, 'steps': 32827, 'loss/train': 1.0733113288879395} 11/07/2021 01:52:30 - INFO - __main__ - Step 32829: {'lr': 0.0004483555633552096, 'samples': 6303168, 'steps': 32828, 'loss/train': 1.7348523139953613} 11/07/2021 01:52:32 - INFO - __main__ - Step 32830: {'lr': 0.00044835233324590077, 'samples': 6303360, 'steps': 32829, 'loss/train': 1.7591265439987183} 11/07/2021 01:52:32 - INFO - __main__ - Step 32831: {'lr': 0.0004483491030472173, 'samples': 6303552, 'steps': 32830, 'loss/train': 0.8813719749450684} 11/07/2021 01:52:33 - INFO - __main__ - Step 32832: {'lr': 0.00044834587275916084, 'samples': 6303744, 'steps': 32831, 'loss/train': 1.1142882108688354} 11/07/2021 01:52:33 - INFO - __main__ - Step 32833: {'lr': 0.00044834264238173283, 'samples': 6303936, 'steps': 32832, 'loss/train': 0.9702965617179871} 11/07/2021 01:52:33 - INFO - __main__ - Step 32834: {'lr': 0.00044833941191493463, 'samples': 6304128, 'steps': 32833, 'loss/train': 1.464808464050293} 11/07/2021 01:52:34 - INFO - __main__ - Step 32835: {'lr': 0.0004483361813587678, 'samples': 6304320, 'steps': 32834, 'loss/train': 1.6141979694366455} 11/07/2021 01:52:35 - INFO - __main__ - Step 32836: {'lr': 0.0004483329507132337, 'samples': 6304512, 'steps': 32835, 'loss/train': 0.1967136561870575} 11/07/2021 01:52:35 - INFO - __main__ - Step 32837: {'lr': 0.0004483297199783338, 'samples': 6304704, 'steps': 32836, 'loss/train': 1.5581127405166626} 11/07/2021 01:52:36 - INFO - __main__ - Step 32838: {'lr': 0.0004483264891540697, 'samples': 6304896, 'steps': 32837, 'loss/train': 1.515840768814087} 11/07/2021 01:52:36 - INFO - __main__ - Step 32839: {'lr': 0.00044832325824044274, 'samples': 6305088, 'steps': 32838, 'loss/train': 1.810701847076416} 11/07/2021 01:52:36 - INFO - __main__ - Step 32840: {'lr': 0.0004483200272374543, 'samples': 6305280, 'steps': 32839, 'loss/train': 1.9597874879837036} 11/07/2021 01:52:37 - INFO - __main__ - Step 32841: {'lr': 0.0004483167961451059, 'samples': 6305472, 'steps': 32840, 'loss/train': 1.7444734573364258} 11/07/2021 01:52:38 - INFO - __main__ - Step 32842: {'lr': 0.00044831356496339913, 'samples': 6305664, 'steps': 32841, 'loss/train': 1.6750328540802002} 11/07/2021 01:52:38 - INFO - __main__ - Step 32843: {'lr': 0.0004483103336923352, 'samples': 6305856, 'steps': 32842, 'loss/train': 1.8195252418518066} 11/07/2021 01:52:38 - INFO - __main__ - Step 32844: {'lr': 0.00044830710233191573, 'samples': 6306048, 'steps': 32843, 'loss/train': 1.419494867324829} 11/07/2021 01:52:39 - INFO - __main__ - Step 32845: {'lr': 0.0004483038708821422, 'samples': 6306240, 'steps': 32844, 'loss/train': 1.6472922563552856} 11/07/2021 01:52:40 - INFO - __main__ - Step 32846: {'lr': 0.00044830063934301603, 'samples': 6306432, 'steps': 32845, 'loss/train': 1.400617241859436} 11/07/2021 01:52:40 - INFO - __main__ - Step 32847: {'lr': 0.0004482974077145385, 'samples': 6306624, 'steps': 32846, 'loss/train': 1.5048298835754395} 11/07/2021 01:52:40 - INFO - __main__ - Step 32848: {'lr': 0.0004482941759967113, 'samples': 6306816, 'steps': 32847, 'loss/train': 1.3856581449508667} 11/07/2021 01:52:41 - INFO - __main__ - Step 32849: {'lr': 0.00044829094418953586, 'samples': 6307008, 'steps': 32848, 'loss/train': 1.4409213066101074} 11/07/2021 01:52:41 - INFO - __main__ - Step 32850: {'lr': 0.00044828771229301354, 'samples': 6307200, 'steps': 32849, 'loss/train': 1.7861766815185547} 11/07/2021 01:52:43 - INFO - __main__ - Step 32851: {'lr': 0.0004482844803071458, 'samples': 6307392, 'steps': 32850, 'loss/train': 1.4633113145828247} 11/07/2021 01:52:43 - INFO - __main__ - Step 32852: {'lr': 0.00044828124823193417, 'samples': 6307584, 'steps': 32851, 'loss/train': 1.0925400257110596} 11/07/2021 01:52:44 - INFO - __main__ - Step 32853: {'lr': 0.00044827801606738004, 'samples': 6307776, 'steps': 32852, 'loss/train': 0.734474778175354} 11/07/2021 01:52:44 - INFO - __main__ - Step 32854: {'lr': 0.00044827478381348495, 'samples': 6307968, 'steps': 32853, 'loss/train': 0.8628458976745605} 11/07/2021 01:52:44 - INFO - __main__ - Step 32855: {'lr': 0.00044827155147025025, 'samples': 6308160, 'steps': 32854, 'loss/train': 1.8457977771759033} 11/07/2021 01:52:45 - INFO - __main__ - Step 32856: {'lr': 0.00044826831903767745, 'samples': 6308352, 'steps': 32855, 'loss/train': 1.9462559223175049} 11/07/2021 01:52:45 - INFO - __main__ - Step 32857: {'lr': 0.000448265086515768, 'samples': 6308544, 'steps': 32856, 'loss/train': 1.4201304912567139} 11/07/2021 01:52:45 - INFO - __main__ - Step 32858: {'lr': 0.0004482618539045234, 'samples': 6308736, 'steps': 32857, 'loss/train': 1.3535361289978027} 11/07/2021 01:52:46 - INFO - __main__ - Step 32859: {'lr': 0.00044825862120394504, 'samples': 6308928, 'steps': 32858, 'loss/train': 2.0859124660491943} 11/07/2021 01:52:47 - INFO - __main__ - Step 32860: {'lr': 0.00044825538841403444, 'samples': 6309120, 'steps': 32859, 'loss/train': 1.2962948083877563} 11/07/2021 01:52:47 - INFO - __main__ - Step 32861: {'lr': 0.000448252155534793, 'samples': 6309312, 'steps': 32860, 'loss/train': 1.482293963432312} 11/07/2021 01:52:47 - INFO - __main__ - Step 32862: {'lr': 0.0004482489225662222, 'samples': 6309504, 'steps': 32861, 'loss/train': 1.0501551628112793} 11/07/2021 01:52:48 - INFO - __main__ - Step 32863: {'lr': 0.00044824568950832343, 'samples': 6309696, 'steps': 32862, 'loss/train': 1.8620959520339966} 11/07/2021 01:52:49 - INFO - __main__ - Step 32864: {'lr': 0.0004482424563610983, 'samples': 6309888, 'steps': 32863, 'loss/train': 1.2694894075393677} 11/07/2021 01:52:49 - INFO - __main__ - Step 32865: {'lr': 0.00044823922312454815, 'samples': 6310080, 'steps': 32864, 'loss/train': 1.4919681549072266} 11/07/2021 01:52:50 - INFO - __main__ - Step 32866: {'lr': 0.00044823598979867445, 'samples': 6310272, 'steps': 32865, 'loss/train': 1.699884295463562} 11/07/2021 01:52:50 - INFO - __main__ - Step 32867: {'lr': 0.0004482327563834787, 'samples': 6310464, 'steps': 32866, 'loss/train': 2.1392643451690674} 11/07/2021 01:52:50 - INFO - __main__ - Step 32868: {'lr': 0.00044822952287896237, 'samples': 6310656, 'steps': 32867, 'loss/train': 4.73948860168457} 11/07/2021 01:52:51 - INFO - __main__ - Step 32869: {'lr': 0.00044822628928512675, 'samples': 6310848, 'steps': 32868, 'loss/train': 1.5720044374465942} 11/07/2021 01:52:52 - INFO - __main__ - Step 32870: {'lr': 0.0004482230556019735, 'samples': 6311040, 'steps': 32869, 'loss/train': 1.441838026046753} 11/07/2021 01:52:52 - INFO - __main__ - Step 32871: {'lr': 0.00044821982182950405, 'samples': 6311232, 'steps': 32870, 'loss/train': 1.2322503328323364} 11/07/2021 01:52:52 - INFO - __main__ - Step 32872: {'lr': 0.0004482165879677197, 'samples': 6311424, 'steps': 32871, 'loss/train': 1.16120183467865} 11/07/2021 01:52:53 - INFO - __main__ - Step 32873: {'lr': 0.0004482133540166221, 'samples': 6311616, 'steps': 32872, 'loss/train': 1.548761248588562} 11/07/2021 01:52:53 - INFO - __main__ - Step 32874: {'lr': 0.00044821011997621255, 'samples': 6311808, 'steps': 32873, 'loss/train': 1.5385053157806396} 11/07/2021 01:52:54 - INFO - __main__ - Step 32875: {'lr': 0.0004482068858464926, 'samples': 6312000, 'steps': 32874, 'loss/train': 1.227051019668579} 11/07/2021 01:52:54 - INFO - __main__ - Step 32876: {'lr': 0.00044820365162746373, 'samples': 6312192, 'steps': 32875, 'loss/train': 1.523007869720459} 11/07/2021 01:52:55 - INFO - __main__ - Step 32877: {'lr': 0.00044820041731912733, 'samples': 6312384, 'steps': 32876, 'loss/train': 1.4473198652267456} 11/07/2021 01:52:55 - INFO - __main__ - Step 32878: {'lr': 0.0004481971829214848, 'samples': 6312576, 'steps': 32877, 'loss/train': 1.064133644104004} 11/07/2021 01:52:55 - INFO - __main__ - Step 32879: {'lr': 0.0004481939484345378, 'samples': 6312768, 'steps': 32878, 'loss/train': 1.7819557189941406} 11/07/2021 01:52:56 - INFO - __main__ - Step 32880: {'lr': 0.0004481907138582876, 'samples': 6312960, 'steps': 32879, 'loss/train': 1.6354236602783203} 11/07/2021 01:52:57 - INFO - __main__ - Step 32881: {'lr': 0.00044818747919273574, 'samples': 6313152, 'steps': 32880, 'loss/train': 1.5709049701690674} 11/07/2021 01:52:57 - INFO - __main__ - Step 32882: {'lr': 0.0004481842444378837, 'samples': 6313344, 'steps': 32881, 'loss/train': 1.5239031314849854} 11/07/2021 01:52:58 - INFO - __main__ - Step 32883: {'lr': 0.0004481810095937329, 'samples': 6313536, 'steps': 32882, 'loss/train': 1.4392449855804443} 11/07/2021 01:52:58 - INFO - __main__ - Step 32884: {'lr': 0.00044817777466028467, 'samples': 6313728, 'steps': 32883, 'loss/train': 0.982414960861206} 11/07/2021 01:52:59 - INFO - __main__ - Step 32885: {'lr': 0.0004481745396375407, 'samples': 6313920, 'steps': 32884, 'loss/train': 1.0925793647766113} 11/07/2021 01:52:59 - INFO - __main__ - Step 32886: {'lr': 0.0004481713045255023, 'samples': 6314112, 'steps': 32885, 'loss/train': 1.6663484573364258} 11/07/2021 01:53:00 - INFO - __main__ - Step 32887: {'lr': 0.000448168069324171, 'samples': 6314304, 'steps': 32886, 'loss/train': 1.6085951328277588} 11/07/2021 01:53:00 - INFO - __main__ - Step 32888: {'lr': 0.0004481648340335482, 'samples': 6314496, 'steps': 32887, 'loss/train': 1.4408528804779053} 11/07/2021 01:53:00 - INFO - __main__ - Step 32889: {'lr': 0.0004481615986536354, 'samples': 6314688, 'steps': 32888, 'loss/train': 2.108576536178589} 11/07/2021 01:53:01 - INFO - __main__ - Step 32890: {'lr': 0.000448158363184434, 'samples': 6314880, 'steps': 32889, 'loss/train': 1.7130979299545288} 11/07/2021 01:53:02 - INFO - __main__ - Step 32891: {'lr': 0.00044815512762594556, 'samples': 6315072, 'steps': 32890, 'loss/train': 2.199972629547119} 11/07/2021 01:53:02 - INFO - __main__ - Step 32892: {'lr': 0.00044815189197817143, 'samples': 6315264, 'steps': 32891, 'loss/train': 1.4540982246398926} 11/07/2021 01:53:03 - INFO - __main__ - Step 32893: {'lr': 0.0004481486562411131, 'samples': 6315456, 'steps': 32892, 'loss/train': 0.44139307737350464} 11/07/2021 01:53:03 - INFO - __main__ - Step 32894: {'lr': 0.0004481454204147721, 'samples': 6315648, 'steps': 32893, 'loss/train': 1.2219226360321045} 11/07/2021 01:53:04 - INFO - __main__ - Step 32895: {'lr': 0.0004481421844991498, 'samples': 6315840, 'steps': 32894, 'loss/train': 1.4876707792282104} 11/07/2021 01:53:04 - INFO - __main__ - Step 32896: {'lr': 0.00044813894849424777, 'samples': 6316032, 'steps': 32895, 'loss/train': 1.8859457969665527} 11/07/2021 01:53:05 - INFO - __main__ - Step 32897: {'lr': 0.0004481357124000672, 'samples': 6316224, 'steps': 32896, 'loss/train': 1.562057375907898} 11/07/2021 01:53:05 - INFO - __main__ - Step 32898: {'lr': 0.0004481324762166099, 'samples': 6316416, 'steps': 32897, 'loss/train': 2.0499045848846436} 11/07/2021 01:53:05 - INFO - __main__ - Step 32899: {'lr': 0.0004481292399438771, 'samples': 6316608, 'steps': 32898, 'loss/train': 1.692986249923706} 11/07/2021 01:53:06 - INFO - __main__ - Step 32900: {'lr': 0.0004481260035818704, 'samples': 6316800, 'steps': 32899, 'loss/train': 0.8094185590744019} 11/07/2021 01:53:07 - INFO - __main__ - Step 32901: {'lr': 0.00044812276713059106, 'samples': 6316992, 'steps': 32900, 'loss/train': 1.7495591640472412} 11/07/2021 01:53:07 - INFO - __main__ - Step 32902: {'lr': 0.00044811953059004073, 'samples': 6317184, 'steps': 32901, 'loss/train': 1.744451880455017} 11/07/2021 01:53:07 - INFO - __main__ - Step 32903: {'lr': 0.0004481162939602208, 'samples': 6317376, 'steps': 32902, 'loss/train': 0.218343123793602} 11/07/2021 01:53:08 - INFO - __main__ - Step 32904: {'lr': 0.0004481130572411327, 'samples': 6317568, 'steps': 32903, 'loss/train': 1.5772401094436646} 11/07/2021 01:53:08 - INFO - __main__ - Step 32905: {'lr': 0.00044810982043277795, 'samples': 6317760, 'steps': 32904, 'loss/train': 1.4718493223190308} 11/07/2021 01:53:09 - INFO - __main__ - Step 32906: {'lr': 0.0004481065835351579, 'samples': 6317952, 'steps': 32905, 'loss/train': 1.7350021600723267} 11/07/2021 01:53:10 - INFO - __main__ - Step 32907: {'lr': 0.0004481033465482741, 'samples': 6318144, 'steps': 32906, 'loss/train': 1.5972042083740234} 11/07/2021 01:53:10 - INFO - __main__ - Step 32908: {'lr': 0.00044810010947212803, 'samples': 6318336, 'steps': 32907, 'loss/train': 1.625590443611145} 11/07/2021 01:53:10 - INFO - __main__ - Step 32909: {'lr': 0.00044809687230672115, 'samples': 6318528, 'steps': 32908, 'loss/train': 1.201430320739746} 11/07/2021 01:53:11 - INFO - __main__ - Step 32910: {'lr': 0.0004480936350520548, 'samples': 6318720, 'steps': 32909, 'loss/train': 1.5031765699386597} 11/07/2021 01:53:12 - INFO - __main__ - Step 32911: {'lr': 0.0004480903977081305, 'samples': 6318912, 'steps': 32910, 'loss/train': 1.6832504272460938} 11/07/2021 01:53:12 - INFO - __main__ - Step 32912: {'lr': 0.00044808716027494973, 'samples': 6319104, 'steps': 32911, 'loss/train': 1.54212486743927} 11/07/2021 01:53:13 - INFO - __main__ - Step 32913: {'lr': 0.000448083922752514, 'samples': 6319296, 'steps': 32912, 'loss/train': 1.823056936264038} 11/07/2021 01:53:13 - INFO - __main__ - Step 32914: {'lr': 0.00044808068514082467, 'samples': 6319488, 'steps': 32913, 'loss/train': 2.00506854057312} 11/07/2021 01:53:13 - INFO - __main__ - Step 32915: {'lr': 0.0004480774474398832, 'samples': 6319680, 'steps': 32914, 'loss/train': 1.3997207880020142} 11/07/2021 01:53:14 - INFO - __main__ - Step 32916: {'lr': 0.00044807420964969113, 'samples': 6319872, 'steps': 32915, 'loss/train': 1.5196776390075684} 11/07/2021 01:53:15 - INFO - __main__ - Step 32917: {'lr': 0.0004480709717702499, 'samples': 6320064, 'steps': 32916, 'loss/train': 1.6158102750778198} 11/07/2021 01:53:15 - INFO - __main__ - Step 32918: {'lr': 0.000448067733801561, 'samples': 6320256, 'steps': 32917, 'loss/train': 1.8232070207595825} 11/07/2021 01:53:15 - INFO - __main__ - Step 32919: {'lr': 0.00044806449574362575, 'samples': 6320448, 'steps': 32918, 'loss/train': 1.3813356161117554} 11/07/2021 01:53:16 - INFO - __main__ - Step 32920: {'lr': 0.00044806125759644567, 'samples': 6320640, 'steps': 32919, 'loss/train': 1.3492869138717651} 11/07/2021 01:53:17 - INFO - __main__ - Step 32921: {'lr': 0.00044805801936002225, 'samples': 6320832, 'steps': 32920, 'loss/train': 1.7767366170883179} 11/07/2021 01:53:17 - INFO - __main__ - Step 32922: {'lr': 0.00044805478103435707, 'samples': 6321024, 'steps': 32921, 'loss/train': 0.8150038719177246} 11/07/2021 01:53:17 - INFO - __main__ - Step 32923: {'lr': 0.0004480515426194513, 'samples': 6321216, 'steps': 32922, 'loss/train': 1.5820306539535522} 11/07/2021 01:53:18 - INFO - __main__ - Step 32924: {'lr': 0.0004480483041153066, 'samples': 6321408, 'steps': 32923, 'loss/train': 1.350351095199585} 11/07/2021 01:53:18 - INFO - __main__ - Step 32925: {'lr': 0.00044804506552192447, 'samples': 6321600, 'steps': 32924, 'loss/train': 1.5434849262237549} 11/07/2021 01:53:19 - INFO - __main__ - Step 32926: {'lr': 0.0004480418268393062, 'samples': 6321792, 'steps': 32925, 'loss/train': 1.0062862634658813} 11/07/2021 01:53:20 - INFO - __main__ - Step 32927: {'lr': 0.0004480385880674534, 'samples': 6321984, 'steps': 32926, 'loss/train': 1.7393062114715576} 11/07/2021 01:53:20 - INFO - __main__ - Step 32928: {'lr': 0.00044803534920636744, 'samples': 6322176, 'steps': 32927, 'loss/train': 2.093810796737671} 11/07/2021 01:53:20 - INFO - __main__ - Step 32929: {'lr': 0.00044803211025604985, 'samples': 6322368, 'steps': 32928, 'loss/train': 1.494863748550415} 11/07/2021 01:53:21 - INFO - __main__ - Step 32930: {'lr': 0.000448028871216502, 'samples': 6322560, 'steps': 32929, 'loss/train': 1.5068172216415405} 11/07/2021 01:53:22 - INFO - __main__ - Step 32931: {'lr': 0.0004480256320877254, 'samples': 6322752, 'steps': 32930, 'loss/train': 0.7170737385749817} 11/07/2021 01:53:22 - INFO - __main__ - Step 32932: {'lr': 0.00044802239286972147, 'samples': 6322944, 'steps': 32931, 'loss/train': 1.9031884670257568} 11/07/2021 01:53:22 - INFO - __main__ - Step 32933: {'lr': 0.0004480191535624918, 'samples': 6323136, 'steps': 32932, 'loss/train': 1.4989677667617798} 11/07/2021 01:53:23 - INFO - __main__ - Step 32934: {'lr': 0.0004480159141660377, 'samples': 6323328, 'steps': 32933, 'loss/train': 1.6413919925689697} 11/07/2021 01:53:23 - INFO - __main__ - Step 32935: {'lr': 0.00044801267468036064, 'samples': 6323520, 'steps': 32934, 'loss/train': 1.7031339406967163} 11/07/2021 01:53:23 - INFO - __main__ - Step 32936: {'lr': 0.0004480094351054622, 'samples': 6323712, 'steps': 32935, 'loss/train': 1.2702281475067139} 11/07/2021 01:53:24 - INFO - __main__ - Step 32937: {'lr': 0.00044800619544134375, 'samples': 6323904, 'steps': 32936, 'loss/train': 1.7458767890930176} 11/07/2021 01:53:25 - INFO - __main__ - Step 32938: {'lr': 0.00044800295568800673, 'samples': 6324096, 'steps': 32937, 'loss/train': 1.6693315505981445} 11/07/2021 01:53:25 - INFO - __main__ - Step 32939: {'lr': 0.0004479997158454526, 'samples': 6324288, 'steps': 32938, 'loss/train': 1.2809923887252808} 11/07/2021 01:53:25 - INFO - __main__ - Step 32940: {'lr': 0.00044799647591368296, 'samples': 6324480, 'steps': 32939, 'loss/train': 1.6685653924942017} 11/07/2021 01:53:26 - INFO - __main__ - Step 32941: {'lr': 0.00044799323589269914, 'samples': 6324672, 'steps': 32940, 'loss/train': 2.008671760559082} 11/07/2021 01:53:27 - INFO - __main__ - Step 32942: {'lr': 0.00044798999578250255, 'samples': 6324864, 'steps': 32941, 'loss/train': 1.6154990196228027} 11/07/2021 01:53:27 - INFO - __main__ - Step 32943: {'lr': 0.0004479867555830948, 'samples': 6325056, 'steps': 32942, 'loss/train': 1.376146674156189} 11/07/2021 01:53:27 - INFO - __main__ - Step 32944: {'lr': 0.0004479835152944772, 'samples': 6325248, 'steps': 32943, 'loss/train': 1.3978157043457031} 11/07/2021 01:53:28 - INFO - __main__ - Step 32945: {'lr': 0.00044798027491665135, 'samples': 6325440, 'steps': 32944, 'loss/train': 1.762140154838562} 11/07/2021 01:53:28 - INFO - __main__ - Step 32946: {'lr': 0.00044797703444961857, 'samples': 6325632, 'steps': 32945, 'loss/train': 2.225754976272583} 11/07/2021 01:53:29 - INFO - __main__ - Step 32947: {'lr': 0.00044797379389338045, 'samples': 6325824, 'steps': 32946, 'loss/train': 1.8722946643829346} 11/07/2021 01:53:29 - INFO - __main__ - Step 32948: {'lr': 0.0004479705532479384, 'samples': 6326016, 'steps': 32947, 'loss/train': 1.6617766618728638} 11/07/2021 01:53:30 - INFO - __main__ - Step 32949: {'lr': 0.0004479673125132938, 'samples': 6326208, 'steps': 32948, 'loss/train': 1.630037784576416} 11/07/2021 01:53:30 - INFO - __main__ - Step 32950: {'lr': 0.0004479640716894483, 'samples': 6326400, 'steps': 32949, 'loss/train': 1.559309720993042} 11/07/2021 01:53:31 - INFO - __main__ - Step 32951: {'lr': 0.00044796083077640314, 'samples': 6326592, 'steps': 32950, 'loss/train': 1.286965012550354} 11/07/2021 01:53:32 - INFO - __main__ - Step 32952: {'lr': 0.00044795758977416, 'samples': 6326784, 'steps': 32951, 'loss/train': 1.2813061475753784} 11/07/2021 01:53:32 - INFO - __main__ - Step 32953: {'lr': 0.0004479543486827201, 'samples': 6326976, 'steps': 32952, 'loss/train': 1.5169000625610352} 11/07/2021 01:53:32 - INFO - __main__ - Step 32954: {'lr': 0.0004479511075020851, 'samples': 6327168, 'steps': 32953, 'loss/train': 1.5031007528305054} 11/07/2021 01:53:33 - INFO - __main__ - Step 32955: {'lr': 0.00044794786623225636, 'samples': 6327360, 'steps': 32954, 'loss/train': 1.3424718379974365} 11/07/2021 01:53:33 - INFO - __main__ - Step 32956: {'lr': 0.0004479446248732354, 'samples': 6327552, 'steps': 32955, 'loss/train': 0.9912104606628418} 11/07/2021 01:53:34 - INFO - __main__ - Step 32957: {'lr': 0.00044794138342502354, 'samples': 6327744, 'steps': 32956, 'loss/train': 1.6631126403808594} 11/07/2021 01:53:35 - INFO - __main__ - Step 32958: {'lr': 0.0004479381418876225, 'samples': 6327936, 'steps': 32957, 'loss/train': 1.2141307592391968} 11/07/2021 01:53:35 - INFO - __main__ - Step 32959: {'lr': 0.00044793490026103346, 'samples': 6328128, 'steps': 32958, 'loss/train': 1.517661690711975} 11/07/2021 01:53:35 - INFO - __main__ - Step 32960: {'lr': 0.0004479316585452581, 'samples': 6328320, 'steps': 32959, 'loss/train': 1.7940460443496704} 11/07/2021 01:53:36 - INFO - __main__ - Step 32961: {'lr': 0.0004479284167402977, 'samples': 6328512, 'steps': 32960, 'loss/train': 1.6163368225097656} 11/07/2021 01:53:36 - INFO - __main__ - Step 32962: {'lr': 0.00044792517484615384, 'samples': 6328704, 'steps': 32961, 'loss/train': 1.1192171573638916} 11/07/2021 01:53:37 - INFO - __main__ - Step 32963: {'lr': 0.000447921932862828, 'samples': 6328896, 'steps': 32962, 'loss/train': 0.14294178783893585} 11/07/2021 01:53:37 - INFO - __main__ - Step 32964: {'lr': 0.00044791869079032154, 'samples': 6329088, 'steps': 32963, 'loss/train': 1.6653163433074951} 11/07/2021 01:53:38 - INFO - __main__ - Step 32965: {'lr': 0.000447915448628636, 'samples': 6329280, 'steps': 32964, 'loss/train': 1.598552942276001} 11/07/2021 01:53:38 - INFO - __main__ - Step 32966: {'lr': 0.0004479122063777728, 'samples': 6329472, 'steps': 32965, 'loss/train': 1.8037034273147583} 11/07/2021 01:53:38 - INFO - __main__ - Step 32967: {'lr': 0.0004479089640377334, 'samples': 6329664, 'steps': 32966, 'loss/train': 1.4403834342956543} 11/07/2021 01:53:39 - INFO - __main__ - Step 32968: {'lr': 0.00044790572160851926, 'samples': 6329856, 'steps': 32967, 'loss/train': 1.6658954620361328} 11/07/2021 01:53:40 - INFO - __main__ - Step 32969: {'lr': 0.00044790247909013195, 'samples': 6330048, 'steps': 32968, 'loss/train': 1.7885410785675049} 11/07/2021 01:53:40 - INFO - __main__ - Step 32970: {'lr': 0.0004478992364825728, 'samples': 6330240, 'steps': 32969, 'loss/train': 1.591148018836975} 11/07/2021 01:53:41 - INFO - __main__ - Step 32971: {'lr': 0.00044789599378584324, 'samples': 6330432, 'steps': 32970, 'loss/train': 1.7470932006835938} 11/07/2021 01:53:41 - INFO - __main__ - Step 32972: {'lr': 0.0004478927509999449, 'samples': 6330624, 'steps': 32971, 'loss/train': 1.6637678146362305} 11/07/2021 01:53:42 - INFO - __main__ - Step 32973: {'lr': 0.00044788950812487907, 'samples': 6330816, 'steps': 32972, 'loss/train': 1.8130570650100708} 11/07/2021 01:53:42 - INFO - __main__ - Step 32974: {'lr': 0.0004478862651606472, 'samples': 6331008, 'steps': 32973, 'loss/train': 1.705833077430725} 11/07/2021 01:53:43 - INFO - __main__ - Step 32975: {'lr': 0.000447883022107251, 'samples': 6331200, 'steps': 32974, 'loss/train': 1.3867896795272827} 11/07/2021 01:53:43 - INFO - __main__ - Step 32976: {'lr': 0.00044787977896469167, 'samples': 6331392, 'steps': 32975, 'loss/train': 1.3941127061843872} 11/07/2021 01:53:43 - INFO - __main__ - Step 32977: {'lr': 0.0004478765357329708, 'samples': 6331584, 'steps': 32976, 'loss/train': 1.7699060440063477} 11/07/2021 01:53:44 - INFO - __main__ - Step 32978: {'lr': 0.0004478732924120897, 'samples': 6331776, 'steps': 32977, 'loss/train': 1.2456763982772827} 11/07/2021 01:53:45 - INFO - __main__ - Step 32979: {'lr': 0.0004478700490020501, 'samples': 6331968, 'steps': 32978, 'loss/train': 1.3738280534744263} 11/07/2021 01:53:45 - INFO - __main__ - Step 32980: {'lr': 0.0004478668055028533, 'samples': 6332160, 'steps': 32979, 'loss/train': 1.2072436809539795} 11/07/2021 01:53:45 - INFO - __main__ - Step 32981: {'lr': 0.0004478635619145007, 'samples': 6332352, 'steps': 32980, 'loss/train': 1.619855284690857} 11/07/2021 01:53:46 - INFO - __main__ - Step 32982: {'lr': 0.00044786031823699384, 'samples': 6332544, 'steps': 32981, 'loss/train': 1.4322315454483032} 11/07/2021 01:53:46 - INFO - __main__ - Step 32983: {'lr': 0.0004478570744703342, 'samples': 6332736, 'steps': 32982, 'loss/train': 1.6997398138046265} 11/07/2021 01:53:47 - INFO - __main__ - Step 32984: {'lr': 0.00044785383061452324, 'samples': 6332928, 'steps': 32983, 'loss/train': 1.6025248765945435} 11/07/2021 01:53:48 - INFO - __main__ - Step 32985: {'lr': 0.00044785058666956234, 'samples': 6333120, 'steps': 32984, 'loss/train': 1.3251734972000122} 11/07/2021 01:53:48 - INFO - __main__ - Step 32986: {'lr': 0.000447847342635453, 'samples': 6333312, 'steps': 32985, 'loss/train': 2.5228066444396973} 11/07/2021 01:53:48 - INFO - __main__ - Step 32987: {'lr': 0.00044784409851219675, 'samples': 6333504, 'steps': 32986, 'loss/train': 1.1910200119018555} 11/07/2021 01:53:49 - INFO - __main__ - Step 32988: {'lr': 0.00044784085429979504, 'samples': 6333696, 'steps': 32987, 'loss/train': 1.3550323247909546} 11/07/2021 01:53:50 - INFO - __main__ - Step 32989: {'lr': 0.00044783760999824926, 'samples': 6333888, 'steps': 32988, 'loss/train': 1.6246103048324585} 11/07/2021 01:53:50 - INFO - __main__ - Step 32990: {'lr': 0.00044783436560756086, 'samples': 6334080, 'steps': 32989, 'loss/train': 1.7610799074172974} 11/07/2021 01:53:50 - INFO - __main__ - Step 32991: {'lr': 0.00044783112112773137, 'samples': 6334272, 'steps': 32990, 'loss/train': 1.7590878009796143} 11/07/2021 01:53:51 - INFO - __main__ - Step 32992: {'lr': 0.0004478278765587623, 'samples': 6334464, 'steps': 32991, 'loss/train': 1.602809190750122} 11/07/2021 01:53:51 - INFO - __main__ - Step 32993: {'lr': 0.000447824631900655, 'samples': 6334656, 'steps': 32992, 'loss/train': 1.953904151916504} 11/07/2021 01:53:52 - INFO - __main__ - Step 32994: {'lr': 0.00044782138715341094, 'samples': 6334848, 'steps': 32993, 'loss/train': 1.7642078399658203} 11/07/2021 01:53:53 - INFO - __main__ - Step 32995: {'lr': 0.00044781814231703164, 'samples': 6335040, 'steps': 32994, 'loss/train': 1.3278212547302246} 11/07/2021 01:53:53 - INFO - __main__ - Step 32996: {'lr': 0.00044781489739151856, 'samples': 6335232, 'steps': 32995, 'loss/train': 1.373674988746643} 11/07/2021 01:53:53 - INFO - __main__ - Step 32997: {'lr': 0.00044781165237687306, 'samples': 6335424, 'steps': 32996, 'loss/train': 1.150309443473816} 11/07/2021 01:53:54 - INFO - __main__ - Step 32998: {'lr': 0.00044780840727309676, 'samples': 6335616, 'steps': 32997, 'loss/train': 1.4560761451721191} 11/07/2021 01:53:54 - INFO - __main__ - Step 32999: {'lr': 0.000447805162080191, 'samples': 6335808, 'steps': 32998, 'loss/train': 0.7766704559326172} 11/07/2021 01:53:55 - INFO - __main__ - Step 33000: {'lr': 0.0004478019167981573, 'samples': 6336000, 'steps': 32999, 'loss/train': 1.489380955696106} 11/07/2021 01:53:55 - INFO - __main__ - Step 33001: {'lr': 0.00044779867142699713, 'samples': 6336192, 'steps': 33000, 'loss/train': 1.2429890632629395} 11/07/2021 01:53:56 - INFO - __main__ - Step 33002: {'lr': 0.0004477954259667119, 'samples': 6336384, 'steps': 33001, 'loss/train': 1.9934924840927124} 11/07/2021 01:53:56 - INFO - __main__ - Step 33003: {'lr': 0.00044779218041730314, 'samples': 6336576, 'steps': 33002, 'loss/train': 1.16958487033844} 11/07/2021 01:53:56 - INFO - __main__ - Step 33004: {'lr': 0.00044778893477877225, 'samples': 6336768, 'steps': 33003, 'loss/train': 1.4972314834594727} 11/07/2021 01:53:57 - INFO - __main__ - Step 33005: {'lr': 0.0004477856890511207, 'samples': 6336960, 'steps': 33004, 'loss/train': 2.1951231956481934} 11/07/2021 01:53:58 - INFO - __main__ - Step 33006: {'lr': 0.00044778244323435, 'samples': 6337152, 'steps': 33005, 'loss/train': 1.486146092414856} 11/07/2021 01:53:58 - INFO - __main__ - Step 33007: {'lr': 0.0004477791973284616, 'samples': 6337344, 'steps': 33006, 'loss/train': 1.2795722484588623} 11/07/2021 01:53:59 - INFO - __main__ - Step 33008: {'lr': 0.00044777595133345686, 'samples': 6337536, 'steps': 33007, 'loss/train': 1.5722613334655762} 11/07/2021 01:53:59 - INFO - __main__ - Step 33009: {'lr': 0.0004477727052493374, 'samples': 6337728, 'steps': 33008, 'loss/train': 1.4420450925827026} 11/07/2021 01:54:00 - INFO - __main__ - Step 33010: {'lr': 0.0004477694590761046, 'samples': 6337920, 'steps': 33009, 'loss/train': 1.5427229404449463} 11/07/2021 01:54:00 - INFO - __main__ - Step 33011: {'lr': 0.00044776621281375994, 'samples': 6338112, 'steps': 33010, 'loss/train': 1.6263377666473389} 11/07/2021 01:54:01 - INFO - __main__ - Step 33012: {'lr': 0.00044776296646230487, 'samples': 6338304, 'steps': 33011, 'loss/train': 5.8149800300598145} 11/07/2021 01:54:01 - INFO - __main__ - Step 33013: {'lr': 0.00044775972002174085, 'samples': 6338496, 'steps': 33012, 'loss/train': 1.5900511741638184} 11/07/2021 01:54:01 - INFO - __main__ - Step 33014: {'lr': 0.0004477564734920694, 'samples': 6338688, 'steps': 33013, 'loss/train': 1.2043906450271606} 11/07/2021 01:54:02 - INFO - __main__ - Step 33015: {'lr': 0.0004477532268732919, 'samples': 6338880, 'steps': 33014, 'loss/train': 0.8074337244033813} 11/07/2021 01:54:03 - INFO - __main__ - Step 33016: {'lr': 0.00044774998016540977, 'samples': 6339072, 'steps': 33015, 'loss/train': 2.2873222827911377} 11/07/2021 01:54:03 - INFO - __main__ - Step 33017: {'lr': 0.00044774673336842464, 'samples': 6339264, 'steps': 33016, 'loss/train': 1.6069782972335815} 11/07/2021 01:54:03 - INFO - __main__ - Step 33018: {'lr': 0.0004477434864823379, 'samples': 6339456, 'steps': 33017, 'loss/train': 1.5662287473678589} 11/07/2021 01:54:04 - INFO - __main__ - Step 33019: {'lr': 0.00044774023950715095, 'samples': 6339648, 'steps': 33018, 'loss/train': 1.329482078552246} 11/07/2021 01:54:05 - INFO - __main__ - Step 33020: {'lr': 0.0004477369924428653, 'samples': 6339840, 'steps': 33019, 'loss/train': 1.8639891147613525} 11/07/2021 01:54:05 - INFO - __main__ - Step 33021: {'lr': 0.0004477337452894824, 'samples': 6340032, 'steps': 33020, 'loss/train': 1.563912034034729} 11/07/2021 01:54:05 - INFO - __main__ - Step 33022: {'lr': 0.0004477304980470038, 'samples': 6340224, 'steps': 33021, 'loss/train': 1.6074427366256714} 11/07/2021 01:54:06 - INFO - __main__ - Step 33023: {'lr': 0.0004477272507154308, 'samples': 6340416, 'steps': 33022, 'loss/train': 1.5063848495483398} 11/07/2021 01:54:06 - INFO - __main__ - Step 33024: {'lr': 0.00044772400329476505, 'samples': 6340608, 'steps': 33023, 'loss/train': 1.8239989280700684} 11/07/2021 01:54:07 - INFO - __main__ - Step 33025: {'lr': 0.0004477207557850078, 'samples': 6340800, 'steps': 33024, 'loss/train': 1.4097603559494019} 11/07/2021 01:54:08 - INFO - __main__ - Step 33026: {'lr': 0.00044771750818616067, 'samples': 6340992, 'steps': 33025, 'loss/train': 1.4742379188537598} 11/07/2021 01:54:08 - INFO - __main__ - Step 33027: {'lr': 0.0004477142604982251, 'samples': 6341184, 'steps': 33026, 'loss/train': 1.2127858400344849} 11/07/2021 01:54:08 - INFO - __main__ - Step 33028: {'lr': 0.0004477110127212025, 'samples': 6341376, 'steps': 33027, 'loss/train': 1.0178325176239014} 11/07/2021 01:54:09 - INFO - __main__ - Step 33029: {'lr': 0.00044770776485509445, 'samples': 6341568, 'steps': 33028, 'loss/train': 1.7714896202087402} 11/07/2021 01:54:09 - INFO - __main__ - Step 33030: {'lr': 0.00044770451689990227, 'samples': 6341760, 'steps': 33029, 'loss/train': 1.6244250535964966} 11/07/2021 01:54:10 - INFO - __main__ - Step 33031: {'lr': 0.0004477012688556275, 'samples': 6341952, 'steps': 33030, 'loss/train': 1.3644365072250366} 11/07/2021 01:54:10 - INFO - __main__ - Step 33032: {'lr': 0.0004476980207222716, 'samples': 6342144, 'steps': 33031, 'loss/train': 1.3227752447128296} 11/07/2021 01:54:11 - INFO - __main__ - Step 33033: {'lr': 0.00044769477249983596, 'samples': 6342336, 'steps': 33032, 'loss/train': 1.5948199033737183} 11/07/2021 01:54:11 - INFO - __main__ - Step 33034: {'lr': 0.00044769152418832215, 'samples': 6342528, 'steps': 33033, 'loss/train': 1.3293739557266235} 11/07/2021 01:54:11 - INFO - __main__ - Step 33035: {'lr': 0.00044768827578773164, 'samples': 6342720, 'steps': 33034, 'loss/train': 1.208864688873291} 11/07/2021 01:54:13 - INFO - __main__ - Step 33036: {'lr': 0.00044768502729806574, 'samples': 6342912, 'steps': 33035, 'loss/train': 1.484110951423645} 11/07/2021 01:54:13 - INFO - __main__ - Step 33037: {'lr': 0.0004476817787193261, 'samples': 6343104, 'steps': 33036, 'loss/train': 1.348806381225586} 11/07/2021 01:54:13 - INFO - __main__ - Step 33038: {'lr': 0.0004476785300515141, 'samples': 6343296, 'steps': 33037, 'loss/train': 1.033874750137329} 11/07/2021 01:54:14 - INFO - __main__ - Step 33039: {'lr': 0.0004476752812946312, 'samples': 6343488, 'steps': 33038, 'loss/train': 1.3962721824645996} 11/07/2021 01:54:14 - INFO - __main__ - Step 33040: {'lr': 0.0004476720324486788, 'samples': 6343680, 'steps': 33039, 'loss/train': 1.698463797569275} 11/07/2021 01:54:15 - INFO - __main__ - Step 33041: {'lr': 0.0004476687835136585, 'samples': 6343872, 'steps': 33040, 'loss/train': 1.321233868598938} 11/07/2021 01:54:15 - INFO - __main__ - Step 33042: {'lr': 0.0004476655344895717, 'samples': 6344064, 'steps': 33041, 'loss/train': 1.7292245626449585} 11/07/2021 01:54:16 - INFO - __main__ - Step 33043: {'lr': 0.0004476622853764198, 'samples': 6344256, 'steps': 33042, 'loss/train': 1.1548570394515991} 11/07/2021 01:54:16 - INFO - __main__ - Step 33044: {'lr': 0.00044765903617420436, 'samples': 6344448, 'steps': 33043, 'loss/train': 1.1298271417617798} 11/07/2021 01:54:16 - INFO - __main__ - Step 33045: {'lr': 0.00044765578688292686, 'samples': 6344640, 'steps': 33044, 'loss/train': 1.1050087213516235} 11/07/2021 01:54:17 - INFO - __main__ - Step 33046: {'lr': 0.0004476525375025886, 'samples': 6344832, 'steps': 33045, 'loss/train': 1.1851271390914917} 11/07/2021 01:54:18 - INFO - __main__ - Step 33047: {'lr': 0.00044764928803319126, 'samples': 6345024, 'steps': 33046, 'loss/train': 1.1974475383758545} 11/07/2021 01:54:18 - INFO - __main__ - Step 33048: {'lr': 0.00044764603847473615, 'samples': 6345216, 'steps': 33047, 'loss/train': 0.9402886629104614} 11/07/2021 01:54:18 - INFO - __main__ - Step 33049: {'lr': 0.0004476427888272248, 'samples': 6345408, 'steps': 33048, 'loss/train': 1.2218657732009888} 11/07/2021 01:54:19 - INFO - __main__ - Step 33050: {'lr': 0.0004476395390906586, 'samples': 6345600, 'steps': 33049, 'loss/train': 1.2480854988098145} 11/07/2021 01:54:20 - INFO - __main__ - Step 33051: {'lr': 0.0004476362892650392, 'samples': 6345792, 'steps': 33050, 'loss/train': 1.395213007926941} 11/07/2021 01:54:20 - INFO - __main__ - Step 33052: {'lr': 0.0004476330393503678, 'samples': 6345984, 'steps': 33051, 'loss/train': 1.5939335823059082} 11/07/2021 01:54:20 - INFO - __main__ - Step 33053: {'lr': 0.0004476297893466461, 'samples': 6346176, 'steps': 33052, 'loss/train': 1.820644736289978} 11/07/2021 01:54:21 - INFO - __main__ - Step 33054: {'lr': 0.0004476265392538754, 'samples': 6346368, 'steps': 33053, 'loss/train': 1.1525547504425049} 11/07/2021 01:54:21 - INFO - __main__ - Step 33055: {'lr': 0.0004476232890720573, 'samples': 6346560, 'steps': 33054, 'loss/train': 1.3899204730987549} 11/07/2021 01:54:22 - INFO - __main__ - Step 33056: {'lr': 0.0004476200388011932, 'samples': 6346752, 'steps': 33055, 'loss/train': 1.2742310762405396} 11/07/2021 01:54:23 - INFO - __main__ - Step 33057: {'lr': 0.0004476167884412845, 'samples': 6346944, 'steps': 33056, 'loss/train': 1.7061872482299805} 11/07/2021 01:54:23 - INFO - __main__ - Step 33058: {'lr': 0.00044761353799233273, 'samples': 6347136, 'steps': 33057, 'loss/train': 1.2749007940292358} 11/07/2021 01:54:23 - INFO - __main__ - Step 33059: {'lr': 0.00044761028745433934, 'samples': 6347328, 'steps': 33058, 'loss/train': 1.2940367460250854} 11/07/2021 01:54:24 - INFO - __main__ - Step 33060: {'lr': 0.00044760703682730584, 'samples': 6347520, 'steps': 33059, 'loss/train': 3.064441680908203} 11/07/2021 01:54:25 - INFO - __main__ - Step 33061: {'lr': 0.00044760378611123365, 'samples': 6347712, 'steps': 33060, 'loss/train': 1.6477994918823242} 11/07/2021 01:54:25 - INFO - __main__ - Step 33062: {'lr': 0.0004476005353061242, 'samples': 6347904, 'steps': 33061, 'loss/train': 1.6297001838684082} 11/07/2021 01:54:25 - INFO - __main__ - Step 33063: {'lr': 0.00044759728441197904, 'samples': 6348096, 'steps': 33062, 'loss/train': 1.65060555934906} 11/07/2021 01:54:26 - INFO - __main__ - Step 33064: {'lr': 0.0004475940334287996, 'samples': 6348288, 'steps': 33063, 'loss/train': 1.500831961631775} 11/07/2021 01:54:26 - INFO - __main__ - Step 33065: {'lr': 0.0004475907823565873, 'samples': 6348480, 'steps': 33064, 'loss/train': 1.7707630395889282} 11/07/2021 01:54:27 - INFO - __main__ - Step 33066: {'lr': 0.00044758753119534373, 'samples': 6348672, 'steps': 33065, 'loss/train': 1.3939059972763062} 11/07/2021 01:54:28 - INFO - __main__ - Step 33067: {'lr': 0.0004475842799450702, 'samples': 6348864, 'steps': 33066, 'loss/train': 1.124437689781189} 11/07/2021 01:54:28 - INFO - __main__ - Step 33068: {'lr': 0.0004475810286057682, 'samples': 6349056, 'steps': 33067, 'loss/train': 1.4587339162826538} 11/07/2021 01:54:28 - INFO - __main__ - Step 33069: {'lr': 0.0004475777771774393, 'samples': 6349248, 'steps': 33068, 'loss/train': 1.1545443534851074} 11/07/2021 01:54:29 - INFO - __main__ - Step 33070: {'lr': 0.00044757452566008497, 'samples': 6349440, 'steps': 33069, 'loss/train': 0.7420926690101624} 11/07/2021 01:54:29 - INFO - __main__ - Step 33071: {'lr': 0.00044757127405370645, 'samples': 6349632, 'steps': 33070, 'loss/train': 1.7578048706054688} 11/07/2021 01:54:30 - INFO - __main__ - Step 33072: {'lr': 0.00044756802235830544, 'samples': 6349824, 'steps': 33071, 'loss/train': 1.780644416809082} 11/07/2021 01:54:30 - INFO - __main__ - Step 33073: {'lr': 0.00044756477057388336, 'samples': 6350016, 'steps': 33072, 'loss/train': 1.4639098644256592} 11/07/2021 01:54:31 - INFO - __main__ - Step 33074: {'lr': 0.0004475615187004416, 'samples': 6350208, 'steps': 33073, 'loss/train': 1.2951197624206543} 11/07/2021 01:54:31 - INFO - __main__ - Step 33075: {'lr': 0.0004475582667379817, 'samples': 6350400, 'steps': 33074, 'loss/train': 1.5856717824935913} 11/07/2021 01:54:31 - INFO - __main__ - Step 33076: {'lr': 0.0004475550146865051, 'samples': 6350592, 'steps': 33075, 'loss/train': 1.4116233587265015} 11/07/2021 01:54:33 - INFO - __main__ - Step 33077: {'lr': 0.00044755176254601323, 'samples': 6350784, 'steps': 33076, 'loss/train': 0.7812098860740662} 11/07/2021 01:54:33 - INFO - __main__ - Step 33078: {'lr': 0.00044754851031650756, 'samples': 6350976, 'steps': 33077, 'loss/train': 0.7144190073013306} 11/07/2021 01:54:33 - INFO - __main__ - Step 33079: {'lr': 0.0004475452579979896, 'samples': 6351168, 'steps': 33078, 'loss/train': 1.2214468717575073} 11/07/2021 01:54:34 - INFO - __main__ - Step 33080: {'lr': 0.00044754200559046076, 'samples': 6351360, 'steps': 33079, 'loss/train': 1.2222187519073486} 11/07/2021 01:54:34 - INFO - __main__ - Step 33081: {'lr': 0.0004475387530939226, 'samples': 6351552, 'steps': 33080, 'loss/train': 1.365666389465332} 11/07/2021 01:54:35 - INFO - __main__ - Step 33082: {'lr': 0.00044753550050837654, 'samples': 6351744, 'steps': 33081, 'loss/train': 1.3580585718154907} 11/07/2021 01:54:35 - INFO - __main__ - Step 33083: {'lr': 0.00044753224783382394, 'samples': 6351936, 'steps': 33082, 'loss/train': 1.7456175088882446} 11/07/2021 01:54:36 - INFO - __main__ - Step 33084: {'lr': 0.00044752899507026646, 'samples': 6352128, 'steps': 33083, 'loss/train': 1.4929946660995483} 11/07/2021 01:54:36 - INFO - __main__ - Step 33085: {'lr': 0.00044752574221770537, 'samples': 6352320, 'steps': 33084, 'loss/train': 1.0361472368240356} 11/07/2021 01:54:36 - INFO - __main__ - Step 33086: {'lr': 0.0004475224892761423, 'samples': 6352512, 'steps': 33085, 'loss/train': 1.3302377462387085} 11/07/2021 01:54:38 - INFO - __main__ - Step 33087: {'lr': 0.00044751923624557866, 'samples': 6352704, 'steps': 33086, 'loss/train': 1.3933597803115845} 11/07/2021 01:54:38 - INFO - __main__ - Step 33088: {'lr': 0.0004475159831260158, 'samples': 6352896, 'steps': 33087, 'loss/train': 1.3008657693862915} 11/07/2021 01:54:39 - INFO - __main__ - Step 33089: {'lr': 0.00044751272991745537, 'samples': 6353088, 'steps': 33088, 'loss/train': 1.3040766716003418} 11/07/2021 01:54:39 - INFO - __main__ - Step 33090: {'lr': 0.00044750947661989873, 'samples': 6353280, 'steps': 33089, 'loss/train': 1.4220153093338013} 11/07/2021 01:54:39 - INFO - __main__ - Step 33091: {'lr': 0.0004475062232333474, 'samples': 6353472, 'steps': 33090, 'loss/train': 1.1344974040985107} 11/07/2021 01:54:40 - INFO - __main__ - Step 33092: {'lr': 0.00044750296975780277, 'samples': 6353664, 'steps': 33091, 'loss/train': 1.87221097946167} 11/07/2021 01:54:40 - INFO - __main__ - Step 33093: {'lr': 0.00044749971619326633, 'samples': 6353856, 'steps': 33092, 'loss/train': 1.8795900344848633} 11/07/2021 01:54:41 - INFO - __main__ - Step 33094: {'lr': 0.0004474964625397396, 'samples': 6354048, 'steps': 33093, 'loss/train': 1.8172962665557861} 11/07/2021 01:54:41 - INFO - __main__ - Step 33095: {'lr': 0.000447493208797224, 'samples': 6354240, 'steps': 33094, 'loss/train': 1.0788723230361938} 11/07/2021 01:54:42 - INFO - __main__ - Step 33096: {'lr': 0.00044748995496572105, 'samples': 6354432, 'steps': 33095, 'loss/train': 1.5874394178390503} 11/07/2021 01:54:42 - INFO - __main__ - Step 33097: {'lr': 0.0004474867010452321, 'samples': 6354624, 'steps': 33096, 'loss/train': 1.044918179512024} 11/07/2021 01:54:42 - INFO - __main__ - Step 33098: {'lr': 0.0004474834470357587, 'samples': 6354816, 'steps': 33097, 'loss/train': 0.8912402987480164} 11/07/2021 01:54:43 - INFO - __main__ - Step 33099: {'lr': 0.00044748019293730236, 'samples': 6355008, 'steps': 33098, 'loss/train': 1.5409010648727417} 11/07/2021 01:54:44 - INFO - __main__ - Step 33100: {'lr': 0.0004474769387498645, 'samples': 6355200, 'steps': 33099, 'loss/train': 1.6009552478790283} 11/07/2021 01:54:44 - INFO - __main__ - Step 33101: {'lr': 0.0004474736844734465, 'samples': 6355392, 'steps': 33100, 'loss/train': 1.1920632123947144} 11/07/2021 01:54:44 - INFO - __main__ - Step 33102: {'lr': 0.00044747043010805, 'samples': 6355584, 'steps': 33101, 'loss/train': 1.7007176876068115} 11/07/2021 01:54:45 - INFO - __main__ - Step 33103: {'lr': 0.0004474671756536763, 'samples': 6355776, 'steps': 33102, 'loss/train': 1.4003307819366455} 11/07/2021 01:54:46 - INFO - __main__ - Step 33104: {'lr': 0.00044746392111032695, 'samples': 6355968, 'steps': 33103, 'loss/train': 1.2260773181915283} 11/07/2021 01:54:46 - INFO - __main__ - Step 33105: {'lr': 0.00044746066647800343, 'samples': 6356160, 'steps': 33104, 'loss/train': 1.6759220361709595} 11/07/2021 01:54:47 - INFO - __main__ - Step 33106: {'lr': 0.0004474574117567072, 'samples': 6356352, 'steps': 33105, 'loss/train': 1.1456880569458008} 11/07/2021 01:54:47 - INFO - __main__ - Step 33107: {'lr': 0.00044745415694643964, 'samples': 6356544, 'steps': 33106, 'loss/train': 1.1245477199554443} 11/07/2021 01:54:47 - INFO - __main__ - Step 33108: {'lr': 0.0004474509020472023, 'samples': 6356736, 'steps': 33107, 'loss/train': 1.9677975177764893} 11/07/2021 01:54:48 - INFO - __main__ - Step 33109: {'lr': 0.0004474476470589967, 'samples': 6356928, 'steps': 33108, 'loss/train': 1.1387183666229248} 11/07/2021 01:54:49 - INFO - __main__ - Step 33110: {'lr': 0.0004474443919818241, 'samples': 6357120, 'steps': 33109, 'loss/train': 1.4442769289016724} 11/07/2021 01:54:49 - INFO - __main__ - Step 33111: {'lr': 0.0004474411368156862, 'samples': 6357312, 'steps': 33110, 'loss/train': 1.4242744445800781} 11/07/2021 01:54:49 - INFO - __main__ - Step 33112: {'lr': 0.00044743788156058437, 'samples': 6357504, 'steps': 33111, 'loss/train': 1.6928004026412964} 11/07/2021 01:54:50 - INFO - __main__ - Step 33113: {'lr': 0.00044743462621652007, 'samples': 6357696, 'steps': 33112, 'loss/train': 1.145045518875122} 11/07/2021 01:54:51 - INFO - __main__ - Step 33114: {'lr': 0.0004474313707834947, 'samples': 6357888, 'steps': 33113, 'loss/train': 4.484914779663086} 11/07/2021 01:54:51 - INFO - __main__ - Step 33115: {'lr': 0.00044742811526150996, 'samples': 6358080, 'steps': 33114, 'loss/train': 1.396562933921814} 11/07/2021 01:54:51 - INFO - __main__ - Step 33116: {'lr': 0.000447424859650567, 'samples': 6358272, 'steps': 33115, 'loss/train': 1.4449611902236938} 11/07/2021 01:54:52 - INFO - __main__ - Step 33117: {'lr': 0.00044742160395066756, 'samples': 6358464, 'steps': 33116, 'loss/train': 1.3184423446655273} 11/07/2021 01:54:52 - INFO - __main__ - Step 33118: {'lr': 0.0004474183481618129, 'samples': 6358656, 'steps': 33117, 'loss/train': 1.3420227766036987} 11/07/2021 01:54:52 - INFO - __main__ - Step 33119: {'lr': 0.00044741509228400465, 'samples': 6358848, 'steps': 33118, 'loss/train': 5.817997932434082} 11/07/2021 01:54:54 - INFO - __main__ - Step 33120: {'lr': 0.0004474118363172441, 'samples': 6359040, 'steps': 33119, 'loss/train': 1.763843059539795} 11/07/2021 01:54:54 - INFO - __main__ - Step 33121: {'lr': 0.000447408580261533, 'samples': 6359232, 'steps': 33120, 'loss/train': 1.727284550666809} 11/07/2021 01:54:54 - INFO - __main__ - Step 33122: {'lr': 0.0004474053241168725, 'samples': 6359424, 'steps': 33121, 'loss/train': 1.9628920555114746} 11/07/2021 01:54:55 - INFO - __main__ - Step 33123: {'lr': 0.00044740206788326423, 'samples': 6359616, 'steps': 33122, 'loss/train': 1.0621058940887451} 11/07/2021 01:54:55 - INFO - __main__ - Step 33124: {'lr': 0.0004473988115607097, 'samples': 6359808, 'steps': 33123, 'loss/train': 1.2733900547027588} 11/07/2021 01:54:56 - INFO - __main__ - Step 33125: {'lr': 0.00044739555514921025, 'samples': 6360000, 'steps': 33124, 'loss/train': 1.3947138786315918} 11/07/2021 01:54:56 - INFO - __main__ - Step 33126: {'lr': 0.0004473922986487674, 'samples': 6360192, 'steps': 33125, 'loss/train': 1.3780709505081177} 11/07/2021 01:54:57 - INFO - __main__ - Step 33127: {'lr': 0.00044738904205938264, 'samples': 6360384, 'steps': 33126, 'loss/train': 1.715932846069336} 11/07/2021 01:54:57 - INFO - __main__ - Step 33128: {'lr': 0.00044738578538105746, 'samples': 6360576, 'steps': 33127, 'loss/train': 1.3106427192687988} 11/07/2021 01:54:58 - INFO - __main__ - Step 33129: {'lr': 0.0004473825286137933, 'samples': 6360768, 'steps': 33128, 'loss/train': 1.4293633699417114} 11/07/2021 01:54:58 - INFO - __main__ - Step 33130: {'lr': 0.0004473792717575915, 'samples': 6360960, 'steps': 33129, 'loss/train': 1.9495084285736084} 11/07/2021 01:54:59 - INFO - __main__ - Step 33131: {'lr': 0.00044737601481245376, 'samples': 6361152, 'steps': 33130, 'loss/train': 1.7546299695968628} 11/07/2021 01:54:59 - INFO - __main__ - Step 33132: {'lr': 0.00044737275777838136, 'samples': 6361344, 'steps': 33131, 'loss/train': 1.9084562063217163} 11/07/2021 01:55:00 - INFO - __main__ - Step 33133: {'lr': 0.0004473695006553759, 'samples': 6361536, 'steps': 33132, 'loss/train': 1.3284623622894287} 11/07/2021 01:55:00 - INFO - __main__ - Step 33134: {'lr': 0.0004473662434434388, 'samples': 6361728, 'steps': 33133, 'loss/train': 1.762020230293274} 11/07/2021 01:55:00 - INFO - __main__ - Step 33135: {'lr': 0.00044736298614257144, 'samples': 6361920, 'steps': 33134, 'loss/train': 2.226206064224243} 11/07/2021 01:55:01 - INFO - __main__ - Step 33136: {'lr': 0.0004473597287527754, 'samples': 6362112, 'steps': 33135, 'loss/train': 1.3786884546279907} 11/07/2021 01:55:02 - INFO - __main__ - Step 33137: {'lr': 0.00044735647127405216, 'samples': 6362304, 'steps': 33136, 'loss/train': 0.980176568031311} 11/07/2021 01:55:02 - INFO - __main__ - Step 33138: {'lr': 0.00044735321370640316, 'samples': 6362496, 'steps': 33137, 'loss/train': 1.749757170677185} 11/07/2021 01:55:02 - INFO - __main__ - Step 33139: {'lr': 0.00044734995604982973, 'samples': 6362688, 'steps': 33138, 'loss/train': 1.522831916809082} 11/07/2021 01:55:03 - INFO - __main__ - Step 33140: {'lr': 0.0004473466983043335, 'samples': 6362880, 'steps': 33139, 'loss/train': 1.335568904876709} 11/07/2021 01:55:04 - INFO - __main__ - Step 33141: {'lr': 0.0004473434404699159, 'samples': 6363072, 'steps': 33140, 'loss/train': 1.7207343578338623} 11/07/2021 01:55:05 - INFO - __main__ - Step 33142: {'lr': 0.00044734018254657845, 'samples': 6363264, 'steps': 33141, 'loss/train': 1.8754812479019165} 11/07/2021 01:55:05 - INFO - __main__ - Step 33143: {'lr': 0.00044733692453432253, 'samples': 6363456, 'steps': 33142, 'loss/train': 1.7550863027572632} 11/07/2021 01:55:05 - INFO - __main__ - Step 33144: {'lr': 0.00044733366643314956, 'samples': 6363648, 'steps': 33143, 'loss/train': 1.365227222442627} 11/07/2021 01:55:06 - INFO - __main__ - Step 33145: {'lr': 0.00044733040824306117, 'samples': 6363840, 'steps': 33144, 'loss/train': 1.446139931678772} 11/07/2021 01:55:07 - INFO - __main__ - Step 33146: {'lr': 0.00044732714996405866, 'samples': 6364032, 'steps': 33145, 'loss/train': 0.4527989327907562} 11/07/2021 01:55:07 - INFO - __main__ - Step 33147: {'lr': 0.0004473238915961436, 'samples': 6364224, 'steps': 33146, 'loss/train': 1.3928862810134888} 11/07/2021 01:55:08 - INFO - __main__ - Step 33148: {'lr': 0.0004473206331393175, 'samples': 6364416, 'steps': 33147, 'loss/train': 1.408677577972412} 11/07/2021 01:55:08 - INFO - __main__ - Step 33149: {'lr': 0.0004473173745935818, 'samples': 6364608, 'steps': 33148, 'loss/train': 1.4953867197036743} 11/07/2021 01:55:08 - INFO - __main__ - Step 33150: {'lr': 0.00044731411595893785, 'samples': 6364800, 'steps': 33149, 'loss/train': 0.8405914306640625} 11/07/2021 01:55:09 - INFO - __main__ - Step 33151: {'lr': 0.00044731085723538725, 'samples': 6364992, 'steps': 33150, 'loss/train': 1.1372485160827637} 11/07/2021 01:55:10 - INFO - __main__ - Step 33152: {'lr': 0.00044730759842293136, 'samples': 6365184, 'steps': 33151, 'loss/train': 0.7570802569389343} 11/07/2021 01:55:10 - INFO - __main__ - Step 33153: {'lr': 0.0004473043395215718, 'samples': 6365376, 'steps': 33152, 'loss/train': 1.256797432899475} 11/07/2021 01:55:10 - INFO - __main__ - Step 33154: {'lr': 0.00044730108053130986, 'samples': 6365568, 'steps': 33153, 'loss/train': 1.3361729383468628} 11/07/2021 01:55:11 - INFO - __main__ - Step 33155: {'lr': 0.00044729782145214717, 'samples': 6365760, 'steps': 33154, 'loss/train': 1.0276557207107544} 11/07/2021 01:55:12 - INFO - __main__ - Step 33156: {'lr': 0.00044729456228408506, 'samples': 6365952, 'steps': 33155, 'loss/train': 1.8448816537857056} 11/07/2021 01:55:12 - INFO - __main__ - Step 33157: {'lr': 0.00044729130302712504, 'samples': 6366144, 'steps': 33156, 'loss/train': 0.8225948810577393} 11/07/2021 01:55:12 - INFO - __main__ - Step 33158: {'lr': 0.00044728804368126873, 'samples': 6366336, 'steps': 33157, 'loss/train': 1.77656888961792} 11/07/2021 01:55:13 - INFO - __main__ - Step 33159: {'lr': 0.00044728478424651744, 'samples': 6366528, 'steps': 33158, 'loss/train': 0.8608505129814148} 11/07/2021 01:55:13 - INFO - __main__ - Step 33160: {'lr': 0.0004472815247228726, 'samples': 6366720, 'steps': 33159, 'loss/train': 1.0385569334030151} 11/07/2021 01:55:13 - INFO - __main__ - Step 33161: {'lr': 0.00044727826511033577, 'samples': 6366912, 'steps': 33160, 'loss/train': 1.214247226715088} 11/07/2021 01:55:15 - INFO - __main__ - Step 33162: {'lr': 0.0004472750054089084, 'samples': 6367104, 'steps': 33161, 'loss/train': 0.7041410207748413} 11/07/2021 01:55:15 - INFO - __main__ - Step 33163: {'lr': 0.00044727174561859194, 'samples': 6367296, 'steps': 33162, 'loss/train': 1.3713628053665161} 11/07/2021 01:55:15 - INFO - __main__ - Step 33164: {'lr': 0.00044726848573938796, 'samples': 6367488, 'steps': 33163, 'loss/train': 1.7455008029937744} 11/07/2021 01:55:16 - INFO - __main__ - Step 33165: {'lr': 0.0004472652257712978, 'samples': 6367680, 'steps': 33164, 'loss/train': 1.180960774421692} 11/07/2021 01:55:16 - INFO - __main__ - Step 33166: {'lr': 0.0004472619657143229, 'samples': 6367872, 'steps': 33165, 'loss/train': 1.4132847785949707} 11/07/2021 01:55:17 - INFO - __main__ - Step 33167: {'lr': 0.00044725870556846495, 'samples': 6368064, 'steps': 33166, 'loss/train': 1.4858864545822144} 11/07/2021 01:55:17 - INFO - __main__ - Step 33168: {'lr': 0.00044725544533372516, 'samples': 6368256, 'steps': 33167, 'loss/train': 1.4591635465621948} 11/07/2021 01:55:18 - INFO - __main__ - Step 33169: {'lr': 0.00044725218501010514, 'samples': 6368448, 'steps': 33168, 'loss/train': 1.80044686794281} 11/07/2021 01:55:18 - INFO - __main__ - Step 33170: {'lr': 0.0004472489245976063, 'samples': 6368640, 'steps': 33169, 'loss/train': 1.5054649114608765} 11/07/2021 01:55:18 - INFO - __main__ - Step 33171: {'lr': 0.00044724566409623013, 'samples': 6368832, 'steps': 33170, 'loss/train': 1.65577232837677} 11/07/2021 01:55:19 - INFO - __main__ - Step 33172: {'lr': 0.0004472424035059782, 'samples': 6369024, 'steps': 33171, 'loss/train': 1.4333148002624512} 11/07/2021 01:55:20 - INFO - __main__ - Step 33173: {'lr': 0.0004472391428268518, 'samples': 6369216, 'steps': 33172, 'loss/train': 1.6931020021438599} 11/07/2021 01:55:20 - INFO - __main__ - Step 33174: {'lr': 0.00044723588205885254, 'samples': 6369408, 'steps': 33173, 'loss/train': 1.4227957725524902} 11/07/2021 01:55:20 - INFO - __main__ - Step 33175: {'lr': 0.00044723262120198177, 'samples': 6369600, 'steps': 33174, 'loss/train': 1.5991913080215454} 11/07/2021 01:55:21 - INFO - __main__ - Step 33176: {'lr': 0.00044722936025624107, 'samples': 6369792, 'steps': 33175, 'loss/train': 1.453271746635437} 11/07/2021 01:55:21 - INFO - __main__ - Step 33177: {'lr': 0.00044722609922163184, 'samples': 6369984, 'steps': 33176, 'loss/train': 1.264987587928772} 11/07/2021 01:55:22 - INFO - __main__ - Step 33178: {'lr': 0.0004472228380981556, 'samples': 6370176, 'steps': 33177, 'loss/train': 2.117309093475342} 11/07/2021 01:55:23 - INFO - __main__ - Step 33179: {'lr': 0.0004472195768858138, 'samples': 6370368, 'steps': 33178, 'loss/train': 1.6110841035842896} 11/07/2021 01:55:23 - INFO - __main__ - Step 33180: {'lr': 0.0004472163155846078, 'samples': 6370560, 'steps': 33179, 'loss/train': 1.5283021926879883} 11/07/2021 01:55:23 - INFO - __main__ - Step 33181: {'lr': 0.0004472130541945393, 'samples': 6370752, 'steps': 33180, 'loss/train': 1.6061872243881226} 11/07/2021 01:55:24 - INFO - __main__ - Step 33182: {'lr': 0.00044720979271560963, 'samples': 6370944, 'steps': 33181, 'loss/train': 1.4953434467315674} 11/07/2021 01:55:25 - INFO - __main__ - Step 33183: {'lr': 0.00044720653114782024, 'samples': 6371136, 'steps': 33182, 'loss/train': 1.4750101566314697} 11/07/2021 01:55:25 - INFO - __main__ - Step 33184: {'lr': 0.0004472032694911726, 'samples': 6371328, 'steps': 33183, 'loss/train': 1.404825210571289} 11/07/2021 01:55:25 - INFO - __main__ - Step 33185: {'lr': 0.0004472000077456683, 'samples': 6371520, 'steps': 33184, 'loss/train': 1.6009184122085571} 11/07/2021 01:55:26 - INFO - __main__ - Step 33186: {'lr': 0.0004471967459113086, 'samples': 6371712, 'steps': 33185, 'loss/train': 0.953913688659668} 11/07/2021 01:55:26 - INFO - __main__ - Step 33187: {'lr': 0.0004471934839880951, 'samples': 6371904, 'steps': 33186, 'loss/train': 1.4461911916732788} 11/07/2021 01:55:27 - INFO - __main__ - Step 33188: {'lr': 0.00044719022197602933, 'samples': 6372096, 'steps': 33187, 'loss/train': 1.843044400215149} 11/07/2021 01:55:27 - INFO - __main__ - Step 33189: {'lr': 0.0004471869598751127, 'samples': 6372288, 'steps': 33188, 'loss/train': 1.5555542707443237} 11/07/2021 01:55:28 - INFO - __main__ - Step 33190: {'lr': 0.0004471836976853466, 'samples': 6372480, 'steps': 33189, 'loss/train': 1.6003851890563965} 11/07/2021 01:55:28 - INFO - __main__ - Step 33191: {'lr': 0.00044718043540673257, 'samples': 6372672, 'steps': 33190, 'loss/train': 1.7337013483047485} 11/07/2021 01:55:29 - INFO - __main__ - Step 33192: {'lr': 0.0004471771730392722, 'samples': 6372864, 'steps': 33191, 'loss/train': 1.5682861804962158} 11/07/2021 01:55:29 - INFO - __main__ - Step 33193: {'lr': 0.0004471739105829667, 'samples': 6373056, 'steps': 33192, 'loss/train': 1.4911301136016846} 11/07/2021 01:55:30 - INFO - __main__ - Step 33194: {'lr': 0.00044717064803781773, 'samples': 6373248, 'steps': 33193, 'loss/train': 1.4212634563446045} 11/07/2021 01:55:30 - INFO - __main__ - Step 33195: {'lr': 0.00044716738540382674, 'samples': 6373440, 'steps': 33194, 'loss/train': 1.1901922225952148} 11/07/2021 01:55:31 - INFO - __main__ - Step 33196: {'lr': 0.0004471641226809951, 'samples': 6373632, 'steps': 33195, 'loss/train': 1.32235848903656} 11/07/2021 01:55:31 - INFO - __main__ - Step 33197: {'lr': 0.0004471608598693244, 'samples': 6373824, 'steps': 33196, 'loss/train': 1.7602043151855469} 11/07/2021 01:55:32 - INFO - __main__ - Step 33198: {'lr': 0.000447157596968816, 'samples': 6374016, 'steps': 33197, 'loss/train': 1.5535030364990234} 11/07/2021 01:55:32 - INFO - __main__ - Step 33199: {'lr': 0.0004471543339794715, 'samples': 6374208, 'steps': 33198, 'loss/train': 1.2976843118667603} 11/07/2021 01:55:33 - INFO - __main__ - Step 33200: {'lr': 0.00044715107090129223, 'samples': 6374400, 'steps': 33199, 'loss/train': 1.5983165502548218} 11/07/2021 01:55:33 - INFO - __main__ - Step 33201: {'lr': 0.00044714780773427975, 'samples': 6374592, 'steps': 33200, 'loss/train': 1.6424078941345215} 11/07/2021 01:55:33 - INFO - __main__ - Step 33202: {'lr': 0.00044714454447843555, 'samples': 6374784, 'steps': 33201, 'loss/train': 1.4659985303878784} 11/07/2021 01:55:34 - INFO - __main__ - Step 33203: {'lr': 0.0004471412811337611, 'samples': 6374976, 'steps': 33202, 'loss/train': 1.6096456050872803} 11/07/2021 01:55:35 - INFO - __main__ - Step 33204: {'lr': 0.00044713801770025774, 'samples': 6375168, 'steps': 33203, 'loss/train': 1.6545803546905518} 11/07/2021 01:55:35 - INFO - __main__ - Step 33205: {'lr': 0.00044713475417792705, 'samples': 6375360, 'steps': 33204, 'loss/train': 1.561865210533142} 11/07/2021 01:55:36 - INFO - __main__ - Step 33206: {'lr': 0.0004471314905667705, 'samples': 6375552, 'steps': 33205, 'loss/train': 1.2108513116836548} 11/07/2021 01:55:36 - INFO - __main__ - Step 33207: {'lr': 0.00044712822686678955, 'samples': 6375744, 'steps': 33206, 'loss/train': 1.668056607246399} 11/07/2021 01:55:36 - INFO - __main__ - Step 33208: {'lr': 0.00044712496307798566, 'samples': 6375936, 'steps': 33207, 'loss/train': 1.120060682296753} 11/07/2021 01:55:37 - INFO - __main__ - Step 33209: {'lr': 0.0004471216992003603, 'samples': 6376128, 'steps': 33208, 'loss/train': 1.4917442798614502} 11/07/2021 01:55:38 - INFO - __main__ - Step 33210: {'lr': 0.0004471184352339149, 'samples': 6376320, 'steps': 33209, 'loss/train': 1.4822920560836792} 11/07/2021 01:55:38 - INFO - __main__ - Step 33211: {'lr': 0.00044711517117865105, 'samples': 6376512, 'steps': 33210, 'loss/train': 1.4753338098526} 11/07/2021 01:55:38 - INFO - __main__ - Step 33212: {'lr': 0.00044711190703457005, 'samples': 6376704, 'steps': 33211, 'loss/train': 1.2352567911148071} 11/07/2021 01:55:39 - INFO - __main__ - Step 33213: {'lr': 0.00044710864280167353, 'samples': 6376896, 'steps': 33212, 'loss/train': 1.4942405223846436} 11/07/2021 01:55:40 - INFO - __main__ - Step 33214: {'lr': 0.0004471053784799629, 'samples': 6377088, 'steps': 33213, 'loss/train': 1.5369149446487427} 11/07/2021 01:55:40 - INFO - __main__ - Step 33215: {'lr': 0.0004471021140694396, 'samples': 6377280, 'steps': 33214, 'loss/train': 1.7257037162780762} 11/07/2021 01:55:40 - INFO - __main__ - Step 33216: {'lr': 0.0004470988495701052, 'samples': 6377472, 'steps': 33215, 'loss/train': 1.591020107269287} 11/07/2021 01:55:41 - INFO - __main__ - Step 33217: {'lr': 0.00044709558498196104, 'samples': 6377664, 'steps': 33216, 'loss/train': 1.453142523765564} 11/07/2021 01:55:41 - INFO - __main__ - Step 33218: {'lr': 0.00044709232030500865, 'samples': 6377856, 'steps': 33217, 'loss/train': 1.6254675388336182} 11/07/2021 01:55:42 - INFO - __main__ - Step 33219: {'lr': 0.0004470890555392495, 'samples': 6378048, 'steps': 33218, 'loss/train': 1.5756146907806396} 11/07/2021 01:55:42 - INFO - __main__ - Step 33220: {'lr': 0.00044708579068468505, 'samples': 6378240, 'steps': 33219, 'loss/train': 1.5764384269714355} 11/07/2021 01:55:43 - INFO - __main__ - Step 33221: {'lr': 0.0004470825257413168, 'samples': 6378432, 'steps': 33220, 'loss/train': 1.4939641952514648} 11/07/2021 01:55:43 - INFO - __main__ - Step 33222: {'lr': 0.00044707926070914624, 'samples': 6378624, 'steps': 33221, 'loss/train': 1.782503604888916} 11/07/2021 01:55:43 - INFO - __main__ - Step 33223: {'lr': 0.0004470759955881748, 'samples': 6378816, 'steps': 33222, 'loss/train': 1.5841856002807617} 11/07/2021 01:55:44 - INFO - __main__ - Step 33224: {'lr': 0.0004470727303784039, 'samples': 6379008, 'steps': 33223, 'loss/train': 1.5153917074203491} 11/07/2021 01:55:45 - INFO - __main__ - Step 33225: {'lr': 0.00044706946507983513, 'samples': 6379200, 'steps': 33224, 'loss/train': 1.5163484811782837} 11/07/2021 01:55:45 - INFO - __main__ - Step 33226: {'lr': 0.00044706619969246984, 'samples': 6379392, 'steps': 33225, 'loss/train': 1.426256775856018} 11/07/2021 01:55:46 - INFO - __main__ - Step 33227: {'lr': 0.0004470629342163096, 'samples': 6379584, 'steps': 33226, 'loss/train': 1.8674992322921753} 11/07/2021 01:55:46 - INFO - __main__ - Step 33228: {'lr': 0.00044705966865135583, 'samples': 6379776, 'steps': 33227, 'loss/train': 2.1757476329803467} 11/07/2021 01:55:47 - INFO - __main__ - Step 33229: {'lr': 0.00044705640299761004, 'samples': 6379968, 'steps': 33228, 'loss/train': 1.5316298007965088} 11/07/2021 01:55:47 - INFO - __main__ - Step 33230: {'lr': 0.0004470531372550736, 'samples': 6380160, 'steps': 33229, 'loss/train': 1.7049391269683838} 11/07/2021 01:55:48 - INFO - __main__ - Step 33231: {'lr': 0.00044704987142374814, 'samples': 6380352, 'steps': 33230, 'loss/train': 1.018943428993225} 11/07/2021 01:55:48 - INFO - __main__ - Step 33232: {'lr': 0.00044704660550363507, 'samples': 6380544, 'steps': 33231, 'loss/train': 1.9485957622528076} 11/07/2021 01:55:48 - INFO - __main__ - Step 33233: {'lr': 0.00044704333949473576, 'samples': 6380736, 'steps': 33232, 'loss/train': 1.3192261457443237} 11/07/2021 01:55:49 - INFO - __main__ - Step 33234: {'lr': 0.0004470400733970518, 'samples': 6380928, 'steps': 33233, 'loss/train': 1.5830713510513306} 11/07/2021 01:55:50 - INFO - __main__ - Step 33235: {'lr': 0.0004470368072105846, 'samples': 6381120, 'steps': 33234, 'loss/train': 1.4584518671035767} 11/07/2021 01:55:50 - INFO - __main__ - Step 33236: {'lr': 0.00044703354093533564, 'samples': 6381312, 'steps': 33235, 'loss/train': 1.4330943822860718} 11/07/2021 01:55:50 - INFO - __main__ - Step 33237: {'lr': 0.0004470302745713065, 'samples': 6381504, 'steps': 33236, 'loss/train': 1.4370431900024414} 11/07/2021 01:55:51 - INFO - __main__ - Step 33238: {'lr': 0.0004470270081184985, 'samples': 6381696, 'steps': 33237, 'loss/train': 1.5408488512039185} 11/07/2021 01:55:51 - INFO - __main__ - Step 33239: {'lr': 0.00044702374157691316, 'samples': 6381888, 'steps': 33238, 'loss/train': 1.9147197008132935} 11/07/2021 01:55:52 - INFO - __main__ - Step 33240: {'lr': 0.00044702047494655194, 'samples': 6382080, 'steps': 33239, 'loss/train': 1.4002445936203003} 11/07/2021 01:55:53 - INFO - __main__ - Step 33241: {'lr': 0.0004470172082274164, 'samples': 6382272, 'steps': 33240, 'loss/train': 1.276917815208435} 11/07/2021 01:55:53 - INFO - __main__ - Step 33242: {'lr': 0.0004470139414195079, 'samples': 6382464, 'steps': 33241, 'loss/train': 1.0814828872680664} 11/07/2021 01:55:53 - INFO - __main__ - Step 33243: {'lr': 0.00044701067452282796, 'samples': 6382656, 'steps': 33242, 'loss/train': 0.8401081562042236} 11/07/2021 01:55:54 - INFO - __main__ - Step 33244: {'lr': 0.00044700740753737806, 'samples': 6382848, 'steps': 33243, 'loss/train': 1.2903378009796143} 11/07/2021 01:55:55 - INFO - __main__ - Step 33245: {'lr': 0.0004470041404631597, 'samples': 6383040, 'steps': 33244, 'loss/train': 0.9164215922355652} 11/07/2021 01:55:55 - INFO - __main__ - Step 33246: {'lr': 0.0004470008733001742, 'samples': 6383232, 'steps': 33245, 'loss/train': 1.027038335800171} 11/07/2021 01:55:55 - INFO - __main__ - Step 33247: {'lr': 0.0004469976060484233, 'samples': 6383424, 'steps': 33246, 'loss/train': 1.533467411994934} 11/07/2021 01:55:56 - INFO - __main__ - Step 33248: {'lr': 0.00044699433870790817, 'samples': 6383616, 'steps': 33247, 'loss/train': 1.3193182945251465} 11/07/2021 01:55:56 - INFO - __main__ - Step 33249: {'lr': 0.00044699107127863056, 'samples': 6383808, 'steps': 33248, 'loss/train': 1.3086721897125244} 11/07/2021 01:55:58 - INFO - __main__ - Step 33250: {'lr': 0.0004469878037605917, 'samples': 6384000, 'steps': 33249, 'loss/train': 1.1139196157455444} 11/07/2021 01:55:58 - INFO - __main__ - Step 33251: {'lr': 0.0004469845361537933, 'samples': 6384192, 'steps': 33250, 'loss/train': 1.488115668296814} 11/07/2021 01:55:58 - INFO - __main__ - Step 33252: {'lr': 0.0004469812684582366, 'samples': 6384384, 'steps': 33251, 'loss/train': 1.7786945104599} 11/07/2021 01:55:59 - INFO - __main__ - Step 33253: {'lr': 0.00044697800067392327, 'samples': 6384576, 'steps': 33252, 'loss/train': 1.573535680770874} 11/07/2021 01:55:59 - INFO - __main__ - Step 33254: {'lr': 0.00044697473280085455, 'samples': 6384768, 'steps': 33253, 'loss/train': 1.5784077644348145} 11/07/2021 01:55:59 - INFO - __main__ - Step 33255: {'lr': 0.0004469714648390322, 'samples': 6384960, 'steps': 33254, 'loss/train': 1.598545789718628} 11/07/2021 01:56:01 - INFO - __main__ - Step 33256: {'lr': 0.00044696819678845744, 'samples': 6385152, 'steps': 33255, 'loss/train': 1.2595524787902832} 11/07/2021 01:56:01 - INFO - __main__ - Step 33257: {'lr': 0.000446964928649132, 'samples': 6385344, 'steps': 33256, 'loss/train': 1.6089627742767334} 11/07/2021 01:56:01 - INFO - __main__ - Step 33258: {'lr': 0.00044696166042105704, 'samples': 6385536, 'steps': 33257, 'loss/train': 1.3691866397857666} 11/07/2021 01:56:02 - INFO - __main__ - Step 33259: {'lr': 0.0004469583921042343, 'samples': 6385728, 'steps': 33258, 'loss/train': 0.2382746934890747} 11/07/2021 01:56:02 - INFO - __main__ - Step 33260: {'lr': 0.0004469551236986651, 'samples': 6385920, 'steps': 33259, 'loss/train': 1.6525382995605469} 11/07/2021 01:56:03 - INFO - __main__ - Step 33261: {'lr': 0.00044695185520435087, 'samples': 6386112, 'steps': 33260, 'loss/train': 1.2914975881576538} 11/07/2021 01:56:03 - INFO - __main__ - Step 33262: {'lr': 0.00044694858662129333, 'samples': 6386304, 'steps': 33261, 'loss/train': 1.8371024131774902} 11/07/2021 01:56:04 - INFO - __main__ - Step 33263: {'lr': 0.0004469453179494938, 'samples': 6386496, 'steps': 33262, 'loss/train': 1.6654703617095947} 11/07/2021 01:56:04 - INFO - __main__ - Step 33264: {'lr': 0.00044694204918895367, 'samples': 6386688, 'steps': 33263, 'loss/train': 1.9074795246124268} 11/07/2021 01:56:04 - INFO - __main__ - Step 33265: {'lr': 0.0004469387803396745, 'samples': 6386880, 'steps': 33264, 'loss/train': 1.5051642656326294} 11/07/2021 01:56:05 - INFO - __main__ - Step 33266: {'lr': 0.0004469355114016577, 'samples': 6387072, 'steps': 33265, 'loss/train': 1.4132717847824097} 11/07/2021 01:56:06 - INFO - __main__ - Step 33267: {'lr': 0.00044693224237490485, 'samples': 6387264, 'steps': 33266, 'loss/train': 1.890087366104126} 11/07/2021 01:56:06 - INFO - __main__ - Step 33268: {'lr': 0.00044692897325941737, 'samples': 6387456, 'steps': 33267, 'loss/train': 1.05165433883667} 11/07/2021 01:56:06 - INFO - __main__ - Step 33269: {'lr': 0.00044692570405519683, 'samples': 6387648, 'steps': 33268, 'loss/train': 1.416410207748413} 11/07/2021 01:56:07 - INFO - __main__ - Step 33270: {'lr': 0.0004469224347622445, 'samples': 6387840, 'steps': 33269, 'loss/train': 1.4195667505264282} 11/07/2021 01:56:08 - INFO - __main__ - Step 33271: {'lr': 0.000446919165380562, 'samples': 6388032, 'steps': 33270, 'loss/train': 1.4561563730239868} 11/07/2021 01:56:08 - INFO - __main__ - Step 33272: {'lr': 0.0004469158959101507, 'samples': 6388224, 'steps': 33271, 'loss/train': 1.471016764640808} 11/07/2021 01:56:09 - INFO - __main__ - Step 33273: {'lr': 0.00044691262635101223, 'samples': 6388416, 'steps': 33272, 'loss/train': 1.933846354484558} 11/07/2021 01:56:09 - INFO - __main__ - Step 33274: {'lr': 0.0004469093567031479, 'samples': 6388608, 'steps': 33273, 'loss/train': 0.4682588577270508} 11/07/2021 01:56:09 - INFO - __main__ - Step 33275: {'lr': 0.00044690608696655923, 'samples': 6388800, 'steps': 33274, 'loss/train': 0.1448262631893158} 11/07/2021 01:56:10 - INFO - __main__ - Step 33276: {'lr': 0.0004469028171412478, 'samples': 6388992, 'steps': 33275, 'loss/train': 1.8468519449234009} 11/07/2021 01:56:11 - INFO - __main__ - Step 33277: {'lr': 0.00044689954722721494, 'samples': 6389184, 'steps': 33276, 'loss/train': 1.537747859954834} 11/07/2021 01:56:11 - INFO - __main__ - Step 33278: {'lr': 0.0004468962772244622, 'samples': 6389376, 'steps': 33277, 'loss/train': 1.2993748188018799} 11/07/2021 01:56:11 - INFO - __main__ - Step 33279: {'lr': 0.00044689300713299105, 'samples': 6389568, 'steps': 33278, 'loss/train': 2.205153465270996} 11/07/2021 01:56:12 - INFO - __main__ - Step 33280: {'lr': 0.0004468897369528029, 'samples': 6389760, 'steps': 33279, 'loss/train': 1.2932888269424438} 11/07/2021 01:56:12 - INFO - __main__ - Step 33281: {'lr': 0.00044688646668389933, 'samples': 6389952, 'steps': 33280, 'loss/train': 1.5683271884918213} 11/07/2021 01:56:13 - INFO - __main__ - Step 33282: {'lr': 0.0004468831963262817, 'samples': 6390144, 'steps': 33281, 'loss/train': 1.5767319202423096} 11/07/2021 01:56:14 - INFO - __main__ - Step 33283: {'lr': 0.00044687992587995155, 'samples': 6390336, 'steps': 33282, 'loss/train': 1.4285104274749756} 11/07/2021 01:56:14 - INFO - __main__ - Step 33284: {'lr': 0.0004468766553449104, 'samples': 6390528, 'steps': 33283, 'loss/train': 1.5589231252670288} 11/07/2021 01:56:14 - INFO - __main__ - Step 33285: {'lr': 0.00044687338472115964, 'samples': 6390720, 'steps': 33284, 'loss/train': 1.4872294664382935} 11/07/2021 01:56:15 - INFO - __main__ - Step 33286: {'lr': 0.00044687011400870074, 'samples': 6390912, 'steps': 33285, 'loss/train': 0.994925856590271} 11/07/2021 01:56:16 - INFO - __main__ - Step 33287: {'lr': 0.00044686684320753524, 'samples': 6391104, 'steps': 33286, 'loss/train': 1.6664528846740723} 11/07/2021 01:56:16 - INFO - __main__ - Step 33288: {'lr': 0.00044686357231766454, 'samples': 6391296, 'steps': 33287, 'loss/train': 1.7071375846862793} 11/07/2021 01:56:16 - INFO - __main__ - Step 33289: {'lr': 0.00044686030133909017, 'samples': 6391488, 'steps': 33288, 'loss/train': 1.7450517416000366} 11/07/2021 01:56:17 - INFO - __main__ - Step 33290: {'lr': 0.00044685703027181364, 'samples': 6391680, 'steps': 33289, 'loss/train': 1.8223915100097656} 11/07/2021 01:56:17 - INFO - __main__ - Step 33291: {'lr': 0.0004468537591158363, 'samples': 6391872, 'steps': 33290, 'loss/train': 1.4068268537521362} 11/07/2021 01:56:18 - INFO - __main__ - Step 33292: {'lr': 0.0004468504878711597, 'samples': 6392064, 'steps': 33291, 'loss/train': 1.7510511875152588} 11/07/2021 01:56:18 - INFO - __main__ - Step 33293: {'lr': 0.00044684721653778537, 'samples': 6392256, 'steps': 33292, 'loss/train': 1.8125139474868774} 11/07/2021 01:56:19 - INFO - __main__ - Step 33294: {'lr': 0.00044684394511571463, 'samples': 6392448, 'steps': 33293, 'loss/train': 1.2472978830337524} 11/07/2021 01:56:19 - INFO - __main__ - Step 33295: {'lr': 0.00044684067360494905, 'samples': 6392640, 'steps': 33294, 'loss/train': 1.848178744316101} 11/07/2021 01:56:19 - INFO - __main__ - Step 33296: {'lr': 0.00044683740200549015, 'samples': 6392832, 'steps': 33295, 'loss/train': 1.204787254333496} 11/07/2021 01:56:20 - INFO - __main__ - Step 33297: {'lr': 0.00044683413031733945, 'samples': 6393024, 'steps': 33296, 'loss/train': 0.9038345813751221} 11/07/2021 01:56:21 - INFO - __main__ - Step 33298: {'lr': 0.00044683085854049814, 'samples': 6393216, 'steps': 33297, 'loss/train': 1.6067466735839844} 11/07/2021 01:56:21 - INFO - __main__ - Step 33299: {'lr': 0.00044682758667496806, 'samples': 6393408, 'steps': 33298, 'loss/train': 1.0957818031311035} 11/07/2021 01:56:22 - INFO - __main__ - Step 33300: {'lr': 0.00044682431472075035, 'samples': 6393600, 'steps': 33299, 'loss/train': 1.1766695976257324} 11/07/2021 01:56:22 - INFO - __main__ - Step 33301: {'lr': 0.00044682104267784674, 'samples': 6393792, 'steps': 33300, 'loss/train': 1.4583467245101929} 11/07/2021 01:56:23 - INFO - __main__ - Step 33302: {'lr': 0.0004468177705462585, 'samples': 6393984, 'steps': 33301, 'loss/train': 1.3737915754318237} 11/07/2021 01:56:23 - INFO - __main__ - Step 33303: {'lr': 0.0004468144983259873, 'samples': 6394176, 'steps': 33302, 'loss/train': 1.6026164293289185} 11/07/2021 01:56:24 - INFO - __main__ - Step 33304: {'lr': 0.0004468112260170345, 'samples': 6394368, 'steps': 33303, 'loss/train': 1.5704874992370605} 11/07/2021 01:56:24 - INFO - __main__ - Step 33305: {'lr': 0.0004468079536194016, 'samples': 6394560, 'steps': 33304, 'loss/train': 1.5559061765670776} 11/07/2021 01:56:24 - INFO - __main__ - Step 33306: {'lr': 0.00044680468113309006, 'samples': 6394752, 'steps': 33305, 'loss/train': 1.0961544513702393} 11/07/2021 01:56:25 - INFO - __main__ - Step 33307: {'lr': 0.0004468014085581014, 'samples': 6394944, 'steps': 33306, 'loss/train': 2.8458991050720215} 11/07/2021 01:56:26 - INFO - __main__ - Step 33308: {'lr': 0.0004467981358944371, 'samples': 6395136, 'steps': 33307, 'loss/train': 1.6939728260040283} 11/07/2021 01:56:26 - INFO - __main__ - Step 33309: {'lr': 0.0004467948631420985, 'samples': 6395328, 'steps': 33308, 'loss/train': 1.5496588945388794} 11/07/2021 01:56:26 - INFO - __main__ - Step 33310: {'lr': 0.0004467915903010872, 'samples': 6395520, 'steps': 33309, 'loss/train': 1.1974292993545532} 11/07/2021 01:56:27 - INFO - __main__ - Step 33311: {'lr': 0.0004467883173714047, 'samples': 6395712, 'steps': 33310, 'loss/train': 1.6529051065444946} 11/07/2021 01:56:27 - INFO - __main__ - Step 33312: {'lr': 0.0004467850443530523, 'samples': 6395904, 'steps': 33311, 'loss/train': 1.399214267730713} 11/07/2021 01:56:28 - INFO - __main__ - Step 33313: {'lr': 0.0004467817712460317, 'samples': 6396096, 'steps': 33312, 'loss/train': 1.6385629177093506} 11/07/2021 01:56:28 - INFO - __main__ - Step 33314: {'lr': 0.00044677849805034424, 'samples': 6396288, 'steps': 33313, 'loss/train': 1.6142476797103882} 11/07/2021 01:56:29 - INFO - __main__ - Step 33315: {'lr': 0.0004467752247659914, 'samples': 6396480, 'steps': 33314, 'loss/train': 1.3866982460021973} 11/07/2021 01:56:29 - INFO - __main__ - Step 33316: {'lr': 0.00044677195139297476, 'samples': 6396672, 'steps': 33315, 'loss/train': 1.5994012355804443} 11/07/2021 01:56:29 - INFO - __main__ - Step 33317: {'lr': 0.00044676867793129574, 'samples': 6396864, 'steps': 33316, 'loss/train': 1.6208961009979248} 11/07/2021 01:56:30 - INFO - __main__ - Step 33318: {'lr': 0.00044676540438095565, 'samples': 6397056, 'steps': 33317, 'loss/train': 1.0598540306091309} 11/07/2021 01:56:31 - INFO - __main__ - Step 33319: {'lr': 0.0004467621307419562, 'samples': 6397248, 'steps': 33318, 'loss/train': 1.4713943004608154} 11/07/2021 01:56:31 - INFO - __main__ - Step 33320: {'lr': 0.00044675885701429873, 'samples': 6397440, 'steps': 33319, 'loss/train': 1.3172155618667603} 11/07/2021 01:56:32 - INFO - __main__ - Step 33321: {'lr': 0.00044675558319798477, 'samples': 6397632, 'steps': 33320, 'loss/train': 1.5222651958465576} 11/07/2021 01:56:32 - INFO - __main__ - Step 33322: {'lr': 0.00044675230929301575, 'samples': 6397824, 'steps': 33321, 'loss/train': 1.2951338291168213} 11/07/2021 01:56:33 - INFO - __main__ - Step 33323: {'lr': 0.0004467490352993932, 'samples': 6398016, 'steps': 33322, 'loss/train': 1.2071497440338135} 11/07/2021 01:56:33 - INFO - __main__ - Step 33324: {'lr': 0.00044674576121711855, 'samples': 6398208, 'steps': 33323, 'loss/train': 1.645951509475708} 11/07/2021 01:56:34 - INFO - __main__ - Step 33325: {'lr': 0.00044674248704619333, 'samples': 6398400, 'steps': 33324, 'loss/train': 1.4131158590316772} 11/07/2021 01:56:34 - INFO - __main__ - Step 33326: {'lr': 0.000446739212786619, 'samples': 6398592, 'steps': 33325, 'loss/train': 1.915831208229065} 11/07/2021 01:56:34 - INFO - __main__ - Step 33327: {'lr': 0.000446735938438397, 'samples': 6398784, 'steps': 33326, 'loss/train': 0.787378191947937} 11/07/2021 01:56:36 - INFO - __main__ - Step 33328: {'lr': 0.0004467326640015288, 'samples': 6398976, 'steps': 33327, 'loss/train': 1.6845728158950806} 11/07/2021 01:56:36 - INFO - __main__ - Step 33329: {'lr': 0.00044672938947601593, 'samples': 6399168, 'steps': 33328, 'loss/train': 1.1684160232543945} 11/07/2021 01:56:36 - INFO - __main__ - Step 33330: {'lr': 0.00044672611486185976, 'samples': 6399360, 'steps': 33329, 'loss/train': 0.7616485953330994} 11/07/2021 01:56:37 - INFO - __main__ - Step 33331: {'lr': 0.0004467228401590619, 'samples': 6399552, 'steps': 33330, 'loss/train': 1.2289947271347046} 11/07/2021 01:56:37 - INFO - __main__ - Step 33332: {'lr': 0.00044671956536762375, 'samples': 6399744, 'steps': 33331, 'loss/train': 0.993565022945404} 11/07/2021 01:56:38 - INFO - __main__ - Step 33333: {'lr': 0.00044671629048754683, 'samples': 6399936, 'steps': 33332, 'loss/train': 1.4252376556396484} 11/07/2021 01:56:38 - INFO - __main__ - Step 33334: {'lr': 0.00044671301551883253, 'samples': 6400128, 'steps': 33333, 'loss/train': 1.616149663925171} 11/07/2021 01:56:39 - INFO - __main__ - Step 33335: {'lr': 0.0004467097404614824, 'samples': 6400320, 'steps': 33334, 'loss/train': 1.5558617115020752} 11/07/2021 01:56:39 - INFO - __main__ - Step 33336: {'lr': 0.0004467064653154979, 'samples': 6400512, 'steps': 33335, 'loss/train': 1.4129722118377686} 11/07/2021 01:56:40 - INFO - __main__ - Step 33337: {'lr': 0.0004467031900808805, 'samples': 6400704, 'steps': 33336, 'loss/train': 1.6587660312652588} 11/07/2021 01:56:41 - INFO - __main__ - Step 33338: {'lr': 0.00044669991475763173, 'samples': 6400896, 'steps': 33337, 'loss/train': 1.0887442827224731} 11/07/2021 01:56:41 - INFO - __main__ - Step 33339: {'lr': 0.00044669663934575294, 'samples': 6401088, 'steps': 33338, 'loss/train': 1.6212620735168457} 11/07/2021 01:56:41 - INFO - __main__ - Step 33340: {'lr': 0.0004466933638452457, 'samples': 6401280, 'steps': 33339, 'loss/train': 1.008997917175293} 11/07/2021 01:56:42 - INFO - __main__ - Step 33341: {'lr': 0.0004466900882561115, 'samples': 6401472, 'steps': 33340, 'loss/train': 1.5845059156417847} 11/07/2021 01:56:42 - INFO - __main__ - Step 33342: {'lr': 0.00044668681257835173, 'samples': 6401664, 'steps': 33341, 'loss/train': 1.473926305770874} 11/07/2021 01:56:43 - INFO - __main__ - Step 33343: {'lr': 0.00044668353681196794, 'samples': 6401856, 'steps': 33342, 'loss/train': 1.2813016176223755} 11/07/2021 01:56:43 - INFO - __main__ - Step 33344: {'lr': 0.0004466802609569616, 'samples': 6402048, 'steps': 33343, 'loss/train': 1.5135631561279297} 11/07/2021 01:56:44 - INFO - __main__ - Step 33345: {'lr': 0.00044667698501333415, 'samples': 6402240, 'steps': 33344, 'loss/train': 1.1744544506072998} 11/07/2021 01:56:44 - INFO - __main__ - Step 33346: {'lr': 0.0004466737089810871, 'samples': 6402432, 'steps': 33345, 'loss/train': 1.721977710723877} 11/07/2021 01:56:44 - INFO - __main__ - Step 33347: {'lr': 0.00044667043286022193, 'samples': 6402624, 'steps': 33346, 'loss/train': 1.3514186143875122} 11/07/2021 01:56:45 - INFO - __main__ - Step 33348: {'lr': 0.00044666715665074, 'samples': 6402816, 'steps': 33347, 'loss/train': 0.6177819967269897} 11/07/2021 01:56:46 - INFO - __main__ - Step 33349: {'lr': 0.0004466638803526429, 'samples': 6403008, 'steps': 33348, 'loss/train': 1.8638166189193726} 11/07/2021 01:56:46 - INFO - __main__ - Step 33350: {'lr': 0.0004466606039659322, 'samples': 6403200, 'steps': 33349, 'loss/train': 1.4739454984664917} 11/07/2021 01:56:47 - INFO - __main__ - Step 33351: {'lr': 0.0004466573274906092, 'samples': 6403392, 'steps': 33350, 'loss/train': 1.2804734706878662} 11/07/2021 01:56:47 - INFO - __main__ - Step 33352: {'lr': 0.0004466540509266754, 'samples': 6403584, 'steps': 33351, 'loss/train': 1.2557836771011353} 11/07/2021 01:56:48 - INFO - __main__ - Step 33353: {'lr': 0.0004466507742741325, 'samples': 6403776, 'steps': 33352, 'loss/train': 1.5667165517807007} 11/07/2021 01:56:48 - INFO - __main__ - Step 33354: {'lr': 0.0004466474975329816, 'samples': 6403968, 'steps': 33353, 'loss/train': 1.852150797843933} 11/07/2021 01:56:48 - INFO - __main__ - Step 33355: {'lr': 0.0004466442207032244, 'samples': 6404160, 'steps': 33354, 'loss/train': 1.2369035482406616} 11/07/2021 01:56:49 - INFO - __main__ - Step 33356: {'lr': 0.00044664094378486243, 'samples': 6404352, 'steps': 33355, 'loss/train': 1.349674940109253} 11/07/2021 01:56:49 - INFO - __main__ - Step 33357: {'lr': 0.00044663766677789706, 'samples': 6404544, 'steps': 33356, 'loss/train': 2.1358084678649902} 11/07/2021 01:56:50 - INFO - __main__ - Step 33358: {'lr': 0.0004466343896823297, 'samples': 6404736, 'steps': 33357, 'loss/train': 1.5977809429168701} 11/07/2021 01:56:50 - INFO - __main__ - Step 33359: {'lr': 0.000446631112498162, 'samples': 6404928, 'steps': 33358, 'loss/train': 1.3789011240005493} 11/07/2021 01:56:51 - INFO - __main__ - Step 33360: {'lr': 0.0004466278352253954, 'samples': 6405120, 'steps': 33359, 'loss/train': 1.5842740535736084} 11/07/2021 01:56:51 - INFO - __main__ - Step 33361: {'lr': 0.00044662455786403124, 'samples': 6405312, 'steps': 33360, 'loss/train': 1.3825491666793823} 11/07/2021 01:56:52 - INFO - __main__ - Step 33362: {'lr': 0.0004466212804140711, 'samples': 6405504, 'steps': 33361, 'loss/train': 1.7108070850372314} 11/07/2021 01:56:52 - INFO - __main__ - Step 33363: {'lr': 0.00044661800287551653, 'samples': 6405696, 'steps': 33362, 'loss/train': 1.69106924533844} 11/07/2021 01:56:53 - INFO - __main__ - Step 33364: {'lr': 0.00044661472524836886, 'samples': 6405888, 'steps': 33363, 'loss/train': 1.4570802450180054} 11/07/2021 01:56:53 - INFO - __main__ - Step 33365: {'lr': 0.00044661144753262963, 'samples': 6406080, 'steps': 33364, 'loss/train': 1.4948817491531372} 11/07/2021 01:56:54 - INFO - __main__ - Step 33366: {'lr': 0.0004466081697283003, 'samples': 6406272, 'steps': 33365, 'loss/train': 0.9278396368026733} 11/07/2021 01:56:54 - INFO - __main__ - Step 33367: {'lr': 0.00044660489183538237, 'samples': 6406464, 'steps': 33366, 'loss/train': 1.8919562101364136} 11/07/2021 01:56:54 - INFO - __main__ - Step 33368: {'lr': 0.0004466016138538773, 'samples': 6406656, 'steps': 33367, 'loss/train': 1.572937250137329} 11/07/2021 01:56:56 - INFO - __main__ - Step 33369: {'lr': 0.0004465983357837866, 'samples': 6406848, 'steps': 33368, 'loss/train': 2.0625107288360596} 11/07/2021 01:56:56 - INFO - __main__ - Step 33370: {'lr': 0.00044659505762511176, 'samples': 6407040, 'steps': 33369, 'loss/train': 1.6907017230987549} 11/07/2021 01:56:56 - INFO - __main__ - Step 33371: {'lr': 0.00044659177937785417, 'samples': 6407232, 'steps': 33370, 'loss/train': 1.1050999164581299} 11/07/2021 01:56:57 - INFO - __main__ - Step 33372: {'lr': 0.0004465885010420154, 'samples': 6407424, 'steps': 33371, 'loss/train': 1.249595046043396} 11/07/2021 01:56:57 - INFO - __main__ - Step 33373: {'lr': 0.0004465852226175968, 'samples': 6407616, 'steps': 33372, 'loss/train': 0.7163944840431213} 11/07/2021 01:56:59 - INFO - __main__ - Step 33374: {'lr': 0.00044658194410460004, 'samples': 6407808, 'steps': 33373, 'loss/train': 1.4177186489105225} 11/07/2021 01:56:59 - INFO - __main__ - Step 33375: {'lr': 0.0004465786655030264, 'samples': 6408000, 'steps': 33374, 'loss/train': 5.1912384033203125} 11/07/2021 01:56:59 - INFO - __main__ - Step 33376: {'lr': 0.00044657538681287746, 'samples': 6408192, 'steps': 33375, 'loss/train': 4.983016014099121} 11/07/2021 01:57:00 - INFO - __main__ - Step 33377: {'lr': 0.0004465721080341547, 'samples': 6408384, 'steps': 33376, 'loss/train': 4.876257419586182} 11/07/2021 01:57:00 - INFO - __main__ - Step 33378: {'lr': 0.0004465688291668596, 'samples': 6408576, 'steps': 33377, 'loss/train': 1.365555763244629} 11/07/2021 01:57:00 - INFO - __main__ - Step 33379: {'lr': 0.00044656555021099363, 'samples': 6408768, 'steps': 33378, 'loss/train': 1.2451014518737793} 11/07/2021 01:57:01 - INFO - __main__ - Step 33380: {'lr': 0.00044656227116655824, 'samples': 6408960, 'steps': 33379, 'loss/train': 1.671711802482605} 11/07/2021 01:57:02 - INFO - __main__ - Step 33381: {'lr': 0.00044655899203355486, 'samples': 6409152, 'steps': 33380, 'loss/train': 1.3759804964065552} 11/07/2021 01:57:02 - INFO - __main__ - Step 33382: {'lr': 0.0004465557128119852, 'samples': 6409344, 'steps': 33381, 'loss/train': 1.7203022241592407} 11/07/2021 01:57:03 - INFO - __main__ - Step 33383: {'lr': 0.00044655243350185037, 'samples': 6409536, 'steps': 33382, 'loss/train': 1.7072322368621826} 11/07/2021 01:57:03 - INFO - __main__ - Step 33384: {'lr': 0.0004465491541031522, 'samples': 6409728, 'steps': 33383, 'loss/train': 1.633778691291809} 11/07/2021 01:57:03 - INFO - __main__ - Step 33385: {'lr': 0.00044654587461589193, 'samples': 6409920, 'steps': 33384, 'loss/train': 1.545433521270752} 11/07/2021 01:57:04 - INFO - __main__ - Step 33386: {'lr': 0.0004465425950400711, 'samples': 6410112, 'steps': 33385, 'loss/train': 1.7061132192611694} 11/07/2021 01:57:05 - INFO - __main__ - Step 33387: {'lr': 0.00044653931537569125, 'samples': 6410304, 'steps': 33386, 'loss/train': 7.1502532958984375} 11/07/2021 01:57:05 - INFO - __main__ - Step 33388: {'lr': 0.0004465360356227538, 'samples': 6410496, 'steps': 33387, 'loss/train': 1.4677163362503052} 11/07/2021 01:57:05 - INFO - __main__ - Step 33389: {'lr': 0.0004465327557812603, 'samples': 6410688, 'steps': 33388, 'loss/train': 1.8780077695846558} 11/07/2021 01:57:06 - INFO - __main__ - Step 33390: {'lr': 0.0004465294758512121, 'samples': 6410880, 'steps': 33389, 'loss/train': 1.5223056077957153} 11/07/2021 01:57:06 - INFO - __main__ - Step 33391: {'lr': 0.0004465261958326108, 'samples': 6411072, 'steps': 33390, 'loss/train': 2.0901994705200195} 11/07/2021 01:57:07 - INFO - __main__ - Step 33392: {'lr': 0.0004465229157254578, 'samples': 6411264, 'steps': 33391, 'loss/train': 1.741840124130249} 11/07/2021 01:57:07 - INFO - __main__ - Step 33393: {'lr': 0.0004465196355297546, 'samples': 6411456, 'steps': 33392, 'loss/train': 0.1811494380235672} 11/07/2021 01:57:08 - INFO - __main__ - Step 33394: {'lr': 0.0004465163552455027, 'samples': 6411648, 'steps': 33393, 'loss/train': 1.7289592027664185} 11/07/2021 01:57:08 - INFO - __main__ - Step 33395: {'lr': 0.0004465130748727036, 'samples': 6411840, 'steps': 33394, 'loss/train': 1.4100768566131592} 11/07/2021 01:57:09 - INFO - __main__ - Step 33396: {'lr': 0.0004465097944113587, 'samples': 6412032, 'steps': 33395, 'loss/train': 0.8781396746635437} 11/07/2021 01:57:10 - INFO - __main__ - Step 33397: {'lr': 0.00044650651386146954, 'samples': 6412224, 'steps': 33396, 'loss/train': 1.5576465129852295} 11/07/2021 01:57:10 - INFO - __main__ - Step 33398: {'lr': 0.00044650323322303757, 'samples': 6412416, 'steps': 33397, 'loss/train': 1.6613613367080688} 11/07/2021 01:57:11 - INFO - __main__ - Step 33399: {'lr': 0.0004464999524960642, 'samples': 6412608, 'steps': 33398, 'loss/train': 1.6993510723114014} 11/07/2021 01:57:11 - INFO - __main__ - Step 33400: {'lr': 0.0004464966716805511, 'samples': 6412800, 'steps': 33399, 'loss/train': 1.7051008939743042} 11/07/2021 01:57:11 - INFO - __main__ - Step 33401: {'lr': 0.0004464933907764996, 'samples': 6412992, 'steps': 33400, 'loss/train': 1.900296926498413} 11/07/2021 01:57:12 - INFO - __main__ - Step 33402: {'lr': 0.0004464901097839112, 'samples': 6413184, 'steps': 33401, 'loss/train': 0.3372882008552551} 11/07/2021 01:57:13 - INFO - __main__ - Step 33403: {'lr': 0.00044648682870278733, 'samples': 6413376, 'steps': 33402, 'loss/train': 1.684402346611023} 11/07/2021 01:57:13 - INFO - __main__ - Step 33404: {'lr': 0.0004464835475331296, 'samples': 6413568, 'steps': 33403, 'loss/train': 1.8330544233322144} 11/07/2021 01:57:13 - INFO - __main__ - Step 33405: {'lr': 0.0004464802662749394, 'samples': 6413760, 'steps': 33404, 'loss/train': 1.9801530838012695} 11/07/2021 01:57:14 - INFO - __main__ - Step 33406: {'lr': 0.00044647698492821826, 'samples': 6413952, 'steps': 33405, 'loss/train': 1.53224778175354} 11/07/2021 01:57:15 - INFO - __main__ - Step 33407: {'lr': 0.00044647370349296757, 'samples': 6414144, 'steps': 33406, 'loss/train': 1.296061396598816} 11/07/2021 01:57:15 - INFO - __main__ - Step 33408: {'lr': 0.00044647042196918884, 'samples': 6414336, 'steps': 33407, 'loss/train': 1.4821537733078003} 11/07/2021 01:57:16 - INFO - __main__ - Step 33409: {'lr': 0.00044646714035688365, 'samples': 6414528, 'steps': 33408, 'loss/train': 1.12440824508667} 11/07/2021 01:57:16 - INFO - __main__ - Step 33410: {'lr': 0.00044646385865605335, 'samples': 6414720, 'steps': 33409, 'loss/train': 1.8271406888961792} 11/07/2021 01:57:16 - INFO - __main__ - Step 33411: {'lr': 0.0004464605768666995, 'samples': 6414912, 'steps': 33410, 'loss/train': 1.5178637504577637} 11/07/2021 01:57:17 - INFO - __main__ - Step 33412: {'lr': 0.0004464572949888235, 'samples': 6415104, 'steps': 33411, 'loss/train': 1.2276266813278198} 11/07/2021 01:57:18 - INFO - __main__ - Step 33413: {'lr': 0.0004464540130224268, 'samples': 6415296, 'steps': 33412, 'loss/train': 1.6815807819366455} 11/07/2021 01:57:18 - INFO - __main__ - Step 33414: {'lr': 0.0004464507309675111, 'samples': 6415488, 'steps': 33413, 'loss/train': 1.7691391706466675} 11/07/2021 01:57:18 - INFO - __main__ - Step 33415: {'lr': 0.00044644744882407767, 'samples': 6415680, 'steps': 33414, 'loss/train': 2.3698713779449463} 11/07/2021 01:57:19 - INFO - __main__ - Step 33416: {'lr': 0.00044644416659212806, 'samples': 6415872, 'steps': 33415, 'loss/train': 1.6313021183013916} 11/07/2021 01:57:19 - INFO - __main__ - Step 33417: {'lr': 0.00044644088427166375, 'samples': 6416064, 'steps': 33416, 'loss/train': 1.6351815462112427} 11/07/2021 01:57:20 - INFO - __main__ - Step 33418: {'lr': 0.00044643760186268615, 'samples': 6416256, 'steps': 33417, 'loss/train': 1.43039071559906} 11/07/2021 01:57:20 - INFO - __main__ - Step 33419: {'lr': 0.00044643431936519683, 'samples': 6416448, 'steps': 33418, 'loss/train': 1.6958608627319336} 11/07/2021 01:57:21 - INFO - __main__ - Step 33420: {'lr': 0.00044643103677919726, 'samples': 6416640, 'steps': 33419, 'loss/train': 1.557178020477295} 11/07/2021 01:57:21 - INFO - __main__ - Step 33421: {'lr': 0.00044642775410468896, 'samples': 6416832, 'steps': 33420, 'loss/train': 1.6278393268585205} 11/07/2021 01:57:22 - INFO - __main__ - Step 33422: {'lr': 0.00044642447134167316, 'samples': 6417024, 'steps': 33421, 'loss/train': 1.4954309463500977} 11/07/2021 01:57:22 - INFO - __main__ - Step 33423: {'lr': 0.00044642118849015167, 'samples': 6417216, 'steps': 33422, 'loss/train': 1.7011927366256714} 11/07/2021 01:57:23 - INFO - __main__ - Step 33424: {'lr': 0.0004464179055501258, 'samples': 6417408, 'steps': 33423, 'loss/train': 1.4190086126327515} 11/07/2021 01:57:23 - INFO - __main__ - Step 33425: {'lr': 0.00044641462252159705, 'samples': 6417600, 'steps': 33424, 'loss/train': 1.3510175943374634} 11/07/2021 01:57:24 - INFO - __main__ - Step 33426: {'lr': 0.0004464113394045669, 'samples': 6417792, 'steps': 33425, 'loss/train': 2.233720064163208} 11/07/2021 01:57:24 - INFO - __main__ - Step 33427: {'lr': 0.00044640805619903677, 'samples': 6417984, 'steps': 33426, 'loss/train': 1.6520930528640747} 11/07/2021 01:57:25 - INFO - __main__ - Step 33428: {'lr': 0.00044640477290500824, 'samples': 6418176, 'steps': 33427, 'loss/train': 1.526247501373291} 11/07/2021 01:57:25 - INFO - __main__ - Step 33429: {'lr': 0.00044640148952248285, 'samples': 6418368, 'steps': 33428, 'loss/train': 1.2569032907485962} 11/07/2021 01:57:26 - INFO - __main__ - Step 33430: {'lr': 0.00044639820605146184, 'samples': 6418560, 'steps': 33429, 'loss/train': 0.49115127325057983} 11/07/2021 01:57:26 - INFO - __main__ - Step 33431: {'lr': 0.0004463949224919469, 'samples': 6418752, 'steps': 33430, 'loss/train': 1.3409916162490845} 11/07/2021 01:57:26 - INFO - __main__ - Step 33432: {'lr': 0.0004463916388439394, 'samples': 6418944, 'steps': 33431, 'loss/train': 1.5159145593643188} 11/07/2021 01:57:27 - INFO - __main__ - Step 33433: {'lr': 0.00044638835510744094, 'samples': 6419136, 'steps': 33432, 'loss/train': 1.2072536945343018} 11/07/2021 01:57:28 - INFO - __main__ - Step 33434: {'lr': 0.0004463850712824528, 'samples': 6419328, 'steps': 33433, 'loss/train': 1.160642385482788} 11/07/2021 01:57:28 - INFO - __main__ - Step 33435: {'lr': 0.0004463817873689766, 'samples': 6419520, 'steps': 33434, 'loss/train': 1.7231621742248535} 11/07/2021 01:57:28 - INFO - __main__ - Step 33436: {'lr': 0.00044637850336701386, 'samples': 6419712, 'steps': 33435, 'loss/train': 1.5787726640701294} 11/07/2021 01:57:29 - INFO - __main__ - Step 33437: {'lr': 0.000446375219276566, 'samples': 6419904, 'steps': 33436, 'loss/train': 1.4341305494308472} 11/07/2021 01:57:29 - INFO - __main__ - Step 33438: {'lr': 0.0004463719350976344, 'samples': 6420096, 'steps': 33437, 'loss/train': 1.730055570602417} 11/07/2021 01:57:30 - INFO - __main__ - Step 33439: {'lr': 0.0004463686508302207, 'samples': 6420288, 'steps': 33438, 'loss/train': 1.4806171655654907} 11/07/2021 01:57:30 - INFO - __main__ - Step 33440: {'lr': 0.00044636536647432636, 'samples': 6420480, 'steps': 33439, 'loss/train': 1.4073017835617065} 11/07/2021 01:57:31 - INFO - __main__ - Step 33441: {'lr': 0.00044636208202995277, 'samples': 6420672, 'steps': 33440, 'loss/train': 1.4302234649658203} 11/07/2021 01:57:31 - INFO - __main__ - Step 33442: {'lr': 0.0004463587974971014, 'samples': 6420864, 'steps': 33441, 'loss/train': 1.2936042547225952} 11/07/2021 01:57:32 - INFO - __main__ - Step 33443: {'lr': 0.0004463555128757739, 'samples': 6421056, 'steps': 33442, 'loss/train': 1.3029223680496216} 11/07/2021 01:57:33 - INFO - __main__ - Step 33444: {'lr': 0.00044635222816597153, 'samples': 6421248, 'steps': 33443, 'loss/train': 1.8932257890701294} 11/07/2021 01:57:33 - INFO - __main__ - Step 33445: {'lr': 0.0004463489433676959, 'samples': 6421440, 'steps': 33444, 'loss/train': 2.250821352005005} 11/07/2021 01:57:33 - INFO - __main__ - Step 33446: {'lr': 0.00044634565848094854, 'samples': 6421632, 'steps': 33445, 'loss/train': 1.5037392377853394} 11/07/2021 01:57:34 - INFO - __main__ - Step 33447: {'lr': 0.0004463423735057308, 'samples': 6421824, 'steps': 33446, 'loss/train': 1.1936708688735962} 11/07/2021 01:57:34 - INFO - __main__ - Step 33448: {'lr': 0.00044633908844204424, 'samples': 6422016, 'steps': 33447, 'loss/train': 1.7531934976577759} 11/07/2021 01:57:35 - INFO - __main__ - Step 33449: {'lr': 0.0004463358032898903, 'samples': 6422208, 'steps': 33448, 'loss/train': 2.1877307891845703} 11/07/2021 01:57:35 - INFO - __main__ - Step 33450: {'lr': 0.00044633251804927044, 'samples': 6422400, 'steps': 33449, 'loss/train': 1.2530103921890259} 11/07/2021 01:57:36 - INFO - __main__ - Step 33451: {'lr': 0.0004463292327201862, 'samples': 6422592, 'steps': 33450, 'loss/train': 1.3324371576309204} 11/07/2021 01:57:36 - INFO - __main__ - Step 33452: {'lr': 0.0004463259473026391, 'samples': 6422784, 'steps': 33451, 'loss/train': 1.1305065155029297} 11/07/2021 01:57:36 - INFO - __main__ - Step 33453: {'lr': 0.0004463226617966305, 'samples': 6422976, 'steps': 33452, 'loss/train': 1.6182461977005005} 11/07/2021 01:57:37 - INFO - __main__ - Step 33454: {'lr': 0.00044631937620216196, 'samples': 6423168, 'steps': 33453, 'loss/train': 1.6138521432876587} 11/07/2021 01:57:38 - INFO - __main__ - Step 33455: {'lr': 0.00044631609051923494, 'samples': 6423360, 'steps': 33454, 'loss/train': 1.4291763305664062} 11/07/2021 01:57:38 - INFO - __main__ - Step 33456: {'lr': 0.00044631280474785086, 'samples': 6423552, 'steps': 33455, 'loss/train': 1.4215017557144165} 11/07/2021 01:57:38 - INFO - __main__ - Step 33457: {'lr': 0.0004463095188880113, 'samples': 6423744, 'steps': 33456, 'loss/train': 1.511284589767456} 11/07/2021 01:57:39 - INFO - __main__ - Step 33458: {'lr': 0.00044630623293971775, 'samples': 6423936, 'steps': 33457, 'loss/train': 1.4104000329971313} 11/07/2021 01:57:40 - INFO - __main__ - Step 33459: {'lr': 0.0004463029469029716, 'samples': 6424128, 'steps': 33458, 'loss/train': 1.7511613368988037} 11/07/2021 01:57:40 - INFO - __main__ - Step 33460: {'lr': 0.0004462996607777743, 'samples': 6424320, 'steps': 33459, 'loss/train': 1.7468429803848267} 11/07/2021 01:57:41 - INFO - __main__ - Step 33461: {'lr': 0.00044629637456412754, 'samples': 6424512, 'steps': 33460, 'loss/train': 1.6965515613555908} 11/07/2021 01:57:41 - INFO - __main__ - Step 33462: {'lr': 0.0004462930882620325, 'samples': 6424704, 'steps': 33461, 'loss/train': 1.2532994747161865} 11/07/2021 01:57:41 - INFO - __main__ - Step 33463: {'lr': 0.0004462898018714909, 'samples': 6424896, 'steps': 33462, 'loss/train': 1.335274338722229} 11/07/2021 01:57:42 - INFO - __main__ - Step 33464: {'lr': 0.0004462865153925042, 'samples': 6425088, 'steps': 33463, 'loss/train': 1.3503310680389404} 11/07/2021 01:57:43 - INFO - __main__ - Step 33465: {'lr': 0.00044628322882507375, 'samples': 6425280, 'steps': 33464, 'loss/train': 1.1921557188034058} 11/07/2021 01:57:43 - INFO - __main__ - Step 33466: {'lr': 0.0004462799421692012, 'samples': 6425472, 'steps': 33465, 'loss/train': 1.49006986618042} 11/07/2021 01:57:43 - INFO - __main__ - Step 33467: {'lr': 0.0004462766554248878, 'samples': 6425664, 'steps': 33466, 'loss/train': 0.9925422072410583} 11/07/2021 01:57:44 - INFO - __main__ - Step 33468: {'lr': 0.0004462733685921353, 'samples': 6425856, 'steps': 33467, 'loss/train': 0.9103108048439026} 11/07/2021 01:57:44 - INFO - __main__ - Step 33469: {'lr': 0.000446270081670945, 'samples': 6426048, 'steps': 33468, 'loss/train': 0.8118385076522827} 11/07/2021 01:57:45 - INFO - __main__ - Step 33470: {'lr': 0.0004462667946613184, 'samples': 6426240, 'steps': 33469, 'loss/train': 1.6399989128112793} 11/07/2021 01:57:46 - INFO - __main__ - Step 33471: {'lr': 0.00044626350756325707, 'samples': 6426432, 'steps': 33470, 'loss/train': 1.365263819694519} 11/07/2021 01:57:46 - INFO - __main__ - Step 33472: {'lr': 0.0004462602203767624, 'samples': 6426624, 'steps': 33471, 'loss/train': 1.7585831880569458} 11/07/2021 01:57:46 - INFO - __main__ - Step 33473: {'lr': 0.0004462569331018359, 'samples': 6426816, 'steps': 33472, 'loss/train': 1.4998859167099} 11/07/2021 01:57:47 - INFO - __main__ - Step 33474: {'lr': 0.00044625364573847904, 'samples': 6427008, 'steps': 33473, 'loss/train': 0.8817580342292786} 11/07/2021 01:57:48 - INFO - __main__ - Step 33475: {'lr': 0.0004462503582866933, 'samples': 6427200, 'steps': 33474, 'loss/train': 0.19651857018470764} 11/07/2021 01:57:48 - INFO - __main__ - Step 33476: {'lr': 0.00044624707074648017, 'samples': 6427392, 'steps': 33475, 'loss/train': 1.4824800491333008} 11/07/2021 01:57:49 - INFO - __main__ - Step 33477: {'lr': 0.0004462437831178412, 'samples': 6427584, 'steps': 33476, 'loss/train': 0.8712844252586365} 11/07/2021 01:57:49 - INFO - __main__ - Step 33478: {'lr': 0.00044624049540077784, 'samples': 6427776, 'steps': 33477, 'loss/train': 1.6491341590881348} 11/07/2021 01:57:49 - INFO - __main__ - Step 33479: {'lr': 0.0004462372075952914, 'samples': 6427968, 'steps': 33478, 'loss/train': 1.6323633193969727} 11/07/2021 01:57:50 - INFO - __main__ - Step 33480: {'lr': 0.0004462339197013836, 'samples': 6428160, 'steps': 33479, 'loss/train': 1.5793582201004028} 11/07/2021 01:57:51 - INFO - __main__ - Step 33481: {'lr': 0.00044623063171905585, 'samples': 6428352, 'steps': 33480, 'loss/train': 1.7055293321609497} 11/07/2021 01:57:51 - INFO - __main__ - Step 33482: {'lr': 0.0004462273436483095, 'samples': 6428544, 'steps': 33481, 'loss/train': 2.0158817768096924} 11/07/2021 01:57:51 - INFO - __main__ - Step 33483: {'lr': 0.00044622405548914627, 'samples': 6428736, 'steps': 33482, 'loss/train': 1.3248236179351807} 11/07/2021 01:57:52 - INFO - __main__ - Step 33484: {'lr': 0.00044622076724156747, 'samples': 6428928, 'steps': 33483, 'loss/train': 1.8139656782150269} 11/07/2021 01:57:53 - INFO - __main__ - Step 33485: {'lr': 0.00044621747890557454, 'samples': 6429120, 'steps': 33484, 'loss/train': 1.4172933101654053} 11/07/2021 01:57:53 - INFO - __main__ - Step 33486: {'lr': 0.0004462141904811691, 'samples': 6429312, 'steps': 33485, 'loss/train': 1.271095633506775} 11/07/2021 01:57:53 - INFO - __main__ - Step 33487: {'lr': 0.00044621090196835254, 'samples': 6429504, 'steps': 33486, 'loss/train': 1.781255841255188} 11/07/2021 01:57:54 - INFO - __main__ - Step 33488: {'lr': 0.00044620761336712646, 'samples': 6429696, 'steps': 33487, 'loss/train': 1.747527837753296} 11/07/2021 01:57:54 - INFO - __main__ - Step 33489: {'lr': 0.00044620432467749215, 'samples': 6429888, 'steps': 33488, 'loss/train': 2.0125067234039307} 11/07/2021 01:57:55 - INFO - __main__ - Step 33490: {'lr': 0.0004462010358994513, 'samples': 6430080, 'steps': 33489, 'loss/train': 1.6507304906845093} 11/07/2021 01:57:56 - INFO - __main__ - Step 33491: {'lr': 0.0004461977470330052, 'samples': 6430272, 'steps': 33490, 'loss/train': 1.1020450592041016} 11/07/2021 01:57:56 - INFO - __main__ - Step 33492: {'lr': 0.00044619445807815545, 'samples': 6430464, 'steps': 33491, 'loss/train': 1.0469087362289429} 11/07/2021 01:57:56 - INFO - __main__ - Step 33493: {'lr': 0.00044619116903490356, 'samples': 6430656, 'steps': 33492, 'loss/train': 1.407516598701477} 11/07/2021 01:57:57 - INFO - __main__ - Step 33494: {'lr': 0.00044618787990325086, 'samples': 6430848, 'steps': 33493, 'loss/train': 1.9450844526290894} 11/07/2021 01:57:58 - INFO - __main__ - Step 33495: {'lr': 0.000446184590683199, 'samples': 6431040, 'steps': 33494, 'loss/train': 1.4101066589355469} 11/07/2021 01:57:58 - INFO - __main__ - Step 33496: {'lr': 0.00044618130137474935, 'samples': 6431232, 'steps': 33495, 'loss/train': 1.4940515756607056} 11/07/2021 01:57:58 - INFO - __main__ - Step 33497: {'lr': 0.0004461780119779034, 'samples': 6431424, 'steps': 33496, 'loss/train': 1.3933483362197876} 11/07/2021 01:57:59 - INFO - __main__ - Step 33498: {'lr': 0.0004461747224926628, 'samples': 6431616, 'steps': 33497, 'loss/train': 1.6501895189285278} 11/07/2021 01:57:59 - INFO - __main__ - Step 33499: {'lr': 0.0004461714329190288, 'samples': 6431808, 'steps': 33498, 'loss/train': 1.6280629634857178} 11/07/2021 01:58:00 - INFO - __main__ - Step 33500: {'lr': 0.00044616814325700293, 'samples': 6432000, 'steps': 33499, 'loss/train': 1.6826666593551636} 11/07/2021 01:58:00 - INFO - __main__ - Step 33501: {'lr': 0.0004461648535065869, 'samples': 6432192, 'steps': 33500, 'loss/train': 1.7501734495162964} 11/07/2021 01:58:01 - INFO - __main__ - Step 33502: {'lr': 0.0004461615636677818, 'samples': 6432384, 'steps': 33501, 'loss/train': 1.3026123046875} 11/07/2021 01:58:01 - INFO - __main__ - Step 33503: {'lr': 0.0004461582737405895, 'samples': 6432576, 'steps': 33502, 'loss/train': 1.302112340927124} 11/07/2021 01:58:02 - INFO - __main__ - Step 33504: {'lr': 0.00044615498372501116, 'samples': 6432768, 'steps': 33503, 'loss/train': 1.6485215425491333} 11/07/2021 01:58:02 - INFO - __main__ - Step 33505: {'lr': 0.00044615169362104856, 'samples': 6432960, 'steps': 33504, 'loss/train': 0.8213014602661133} 11/07/2021 01:58:03 - INFO - __main__ - Step 33506: {'lr': 0.00044614840342870293, 'samples': 6433152, 'steps': 33505, 'loss/train': 1.8052080869674683} 11/07/2021 01:58:03 - INFO - __main__ - Step 33507: {'lr': 0.0004461451131479759, 'samples': 6433344, 'steps': 33506, 'loss/train': 1.5932680368423462} 11/07/2021 01:58:04 - INFO - __main__ - Step 33508: {'lr': 0.0004461418227788689, 'samples': 6433536, 'steps': 33507, 'loss/train': 1.8350179195404053} 11/07/2021 01:58:04 - INFO - __main__ - Step 33509: {'lr': 0.00044613853232138343, 'samples': 6433728, 'steps': 33508, 'loss/train': 1.463523268699646} 11/07/2021 01:58:04 - INFO - __main__ - Step 33510: {'lr': 0.0004461352417755209, 'samples': 6433920, 'steps': 33509, 'loss/train': 1.4737497568130493} 11/07/2021 01:58:05 - INFO - __main__ - Step 33511: {'lr': 0.0004461319511412829, 'samples': 6434112, 'steps': 33510, 'loss/train': 1.8179429769515991} 11/07/2021 01:58:06 - INFO - __main__ - Step 33512: {'lr': 0.00044612866041867093, 'samples': 6434304, 'steps': 33511, 'loss/train': 1.6133488416671753} 11/07/2021 01:58:06 - INFO - __main__ - Step 33513: {'lr': 0.0004461253696076863, 'samples': 6434496, 'steps': 33512, 'loss/train': 1.5488224029541016} 11/07/2021 01:58:06 - INFO - __main__ - Step 33514: {'lr': 0.00044612207870833073, 'samples': 6434688, 'steps': 33513, 'loss/train': 1.478121042251587} 11/07/2021 01:58:07 - INFO - __main__ - Step 33515: {'lr': 0.0004461187877206055, 'samples': 6434880, 'steps': 33514, 'loss/train': 1.1957716941833496} 11/07/2021 01:58:08 - INFO - __main__ - Step 33516: {'lr': 0.00044611549664451216, 'samples': 6435072, 'steps': 33515, 'loss/train': 1.937234878540039} 11/07/2021 01:58:08 - INFO - __main__ - Step 33517: {'lr': 0.0004461122054800522, 'samples': 6435264, 'steps': 33516, 'loss/train': 1.4780783653259277} 11/07/2021 01:58:09 - INFO - __main__ - Step 33518: {'lr': 0.00044610891422722714, 'samples': 6435456, 'steps': 33517, 'loss/train': 1.3189829587936401} 11/07/2021 01:58:09 - INFO - __main__ - Step 33519: {'lr': 0.00044610562288603846, 'samples': 6435648, 'steps': 33518, 'loss/train': 1.0899434089660645} 11/07/2021 01:58:09 - INFO - __main__ - Step 33520: {'lr': 0.00044610233145648756, 'samples': 6435840, 'steps': 33519, 'loss/train': 1.1641112565994263} 11/07/2021 01:58:10 - INFO - __main__ - Step 33521: {'lr': 0.00044609903993857603, 'samples': 6436032, 'steps': 33520, 'loss/train': 1.6565847396850586} 11/07/2021 01:58:11 - INFO - __main__ - Step 33522: {'lr': 0.0004460957483323052, 'samples': 6436224, 'steps': 33521, 'loss/train': 1.5398041009902954} 11/07/2021 01:58:11 - INFO - __main__ - Step 33523: {'lr': 0.0004460924566376767, 'samples': 6436416, 'steps': 33522, 'loss/train': 1.4112274646759033} 11/07/2021 01:58:11 - INFO - __main__ - Step 33524: {'lr': 0.00044608916485469195, 'samples': 6436608, 'steps': 33523, 'loss/train': 1.9318363666534424} 11/07/2021 01:58:12 - INFO - __main__ - Step 33525: {'lr': 0.0004460858729833525, 'samples': 6436800, 'steps': 33524, 'loss/train': 1.6608656644821167} 11/07/2021 01:58:13 - INFO - __main__ - Step 33526: {'lr': 0.0004460825810236598, 'samples': 6436992, 'steps': 33525, 'loss/train': 1.7003602981567383} 11/07/2021 01:58:13 - INFO - __main__ - Step 33527: {'lr': 0.00044607928897561524, 'samples': 6437184, 'steps': 33526, 'loss/train': 1.3078749179840088} 11/07/2021 01:58:13 - INFO - __main__ - Step 33528: {'lr': 0.0004460759968392204, 'samples': 6437376, 'steps': 33527, 'loss/train': 1.5011610984802246} 11/07/2021 01:58:14 - INFO - __main__ - Step 33529: {'lr': 0.0004460727046144768, 'samples': 6437568, 'steps': 33528, 'loss/train': 0.829114556312561} 11/07/2021 01:58:14 - INFO - __main__ - Step 33530: {'lr': 0.00044606941230138574, 'samples': 6437760, 'steps': 33529, 'loss/train': 1.5233813524246216} 11/07/2021 01:58:15 - INFO - __main__ - Step 33531: {'lr': 0.0004460661198999489, 'samples': 6437952, 'steps': 33530, 'loss/train': 1.2734391689300537} 11/07/2021 01:58:15 - INFO - __main__ - Step 33532: {'lr': 0.0004460628274101677, 'samples': 6438144, 'steps': 33531, 'loss/train': 1.682828664779663} 11/07/2021 01:58:16 - INFO - __main__ - Step 33533: {'lr': 0.0004460595348320436, 'samples': 6438336, 'steps': 33532, 'loss/train': 1.3908790349960327} 11/07/2021 01:58:16 - INFO - __main__ - Step 33534: {'lr': 0.0004460562421655782, 'samples': 6438528, 'steps': 33533, 'loss/train': 1.3196871280670166} 11/07/2021 01:58:16 - INFO - __main__ - Step 33535: {'lr': 0.0004460529494107727, 'samples': 6438720, 'steps': 33534, 'loss/train': 1.2997703552246094} 11/07/2021 01:58:17 - INFO - __main__ - Step 33536: {'lr': 0.00044604965656762884, 'samples': 6438912, 'steps': 33535, 'loss/train': 1.7080373764038086} 11/07/2021 01:58:18 - INFO - __main__ - Step 33537: {'lr': 0.0004460463636361481, 'samples': 6439104, 'steps': 33536, 'loss/train': 1.4093761444091797} 11/07/2021 01:58:18 - INFO - __main__ - Step 33538: {'lr': 0.00044604307061633187, 'samples': 6439296, 'steps': 33537, 'loss/train': 2.498072624206543} 11/07/2021 01:58:18 - INFO - __main__ - Step 33539: {'lr': 0.0004460397775081816, 'samples': 6439488, 'steps': 33538, 'loss/train': 1.409963846206665} 11/07/2021 01:58:19 - INFO - __main__ - Step 33540: {'lr': 0.00044603648431169884, 'samples': 6439680, 'steps': 33539, 'loss/train': 1.581534504890442} 11/07/2021 01:58:19 - INFO - __main__ - Step 33541: {'lr': 0.0004460331910268851, 'samples': 6439872, 'steps': 33540, 'loss/train': 1.930228590965271} 11/07/2021 01:58:20 - INFO - __main__ - Step 33542: {'lr': 0.0004460298976537418, 'samples': 6440064, 'steps': 33541, 'loss/train': 1.6128143072128296} 11/07/2021 01:58:21 - INFO - __main__ - Step 33543: {'lr': 0.00044602660419227046, 'samples': 6440256, 'steps': 33542, 'loss/train': 1.5586212873458862} 11/07/2021 01:58:21 - INFO - __main__ - Step 33544: {'lr': 0.0004460233106424726, 'samples': 6440448, 'steps': 33543, 'loss/train': 1.1883673667907715} 11/07/2021 01:58:21 - INFO - __main__ - Step 33545: {'lr': 0.00044602001700434963, 'samples': 6440640, 'steps': 33544, 'loss/train': 0.7786942720413208} 11/07/2021 01:58:22 - INFO - __main__ - Step 33546: {'lr': 0.00044601672327790304, 'samples': 6440832, 'steps': 33545, 'loss/train': 1.3047401905059814} 11/07/2021 01:58:23 - INFO - __main__ - Step 33547: {'lr': 0.00044601342946313437, 'samples': 6441024, 'steps': 33546, 'loss/train': 1.2048883438110352} 11/07/2021 01:58:23 - INFO - __main__ - Step 33548: {'lr': 0.0004460101355600451, 'samples': 6441216, 'steps': 33547, 'loss/train': 1.9878222942352295} 11/07/2021 01:58:23 - INFO - __main__ - Step 33549: {'lr': 0.0004460068415686366, 'samples': 6441408, 'steps': 33548, 'loss/train': 1.798831582069397} 11/07/2021 01:58:24 - INFO - __main__ - Step 33550: {'lr': 0.0004460035474889105, 'samples': 6441600, 'steps': 33549, 'loss/train': 1.7199984788894653} 11/07/2021 01:58:24 - INFO - __main__ - Step 33551: {'lr': 0.00044600025332086824, 'samples': 6441792, 'steps': 33550, 'loss/train': 1.4604843854904175} 11/07/2021 01:58:25 - INFO - __main__ - Step 33552: {'lr': 0.0004459969590645113, 'samples': 6441984, 'steps': 33551, 'loss/train': 1.351448893547058} 11/07/2021 01:58:25 - INFO - __main__ - Step 33553: {'lr': 0.000445993664719841, 'samples': 6442176, 'steps': 33552, 'loss/train': 1.5944808721542358} 11/07/2021 01:58:26 - INFO - __main__ - Step 33554: {'lr': 0.0004459903702868592, 'samples': 6442368, 'steps': 33553, 'loss/train': 1.6904922723770142} 11/07/2021 01:58:26 - INFO - __main__ - Step 33555: {'lr': 0.00044598707576556706, 'samples': 6442560, 'steps': 33554, 'loss/train': 2.0922768115997314} 11/07/2021 01:58:26 - INFO - __main__ - Step 33556: {'lr': 0.00044598378115596614, 'samples': 6442752, 'steps': 33555, 'loss/train': 1.494410753250122} 11/07/2021 01:58:27 - INFO - __main__ - Step 33557: {'lr': 0.000445980486458058, 'samples': 6442944, 'steps': 33556, 'loss/train': 1.659221887588501} 11/07/2021 01:58:28 - INFO - __main__ - Step 33558: {'lr': 0.0004459771916718441, 'samples': 6443136, 'steps': 33557, 'loss/train': 1.557417392730713} 11/07/2021 01:58:28 - INFO - __main__ - Step 33559: {'lr': 0.0004459738967973258, 'samples': 6443328, 'steps': 33558, 'loss/train': 1.2724696397781372} 11/07/2021 01:58:29 - INFO - __main__ - Step 33560: {'lr': 0.00044597060183450477, 'samples': 6443520, 'steps': 33559, 'loss/train': 1.2706172466278076} 11/07/2021 01:58:29 - INFO - __main__ - Step 33561: {'lr': 0.00044596730678338236, 'samples': 6443712, 'steps': 33560, 'loss/train': 1.454308032989502} 11/07/2021 01:58:30 - INFO - __main__ - Step 33562: {'lr': 0.0004459640116439602, 'samples': 6443904, 'steps': 33561, 'loss/train': 1.3097172975540161} 11/07/2021 01:58:30 - INFO - __main__ - Step 33563: {'lr': 0.0004459607164162396, 'samples': 6444096, 'steps': 33562, 'loss/train': 1.4599028825759888} 11/07/2021 01:58:31 - INFO - __main__ - Step 33564: {'lr': 0.00044595742110022216, 'samples': 6444288, 'steps': 33563, 'loss/train': 1.3603464365005493} 11/07/2021 01:58:31 - INFO - __main__ - Step 33565: {'lr': 0.00044595412569590934, 'samples': 6444480, 'steps': 33564, 'loss/train': 1.0743993520736694} 11/07/2021 01:58:31 - INFO - __main__ - Step 33566: {'lr': 0.0004459508302033025, 'samples': 6444672, 'steps': 33565, 'loss/train': 1.602871060371399} 11/07/2021 01:58:32 - INFO - __main__ - Step 33567: {'lr': 0.00044594753462240335, 'samples': 6444864, 'steps': 33566, 'loss/train': 1.5943396091461182} 11/07/2021 01:58:33 - INFO - __main__ - Step 33568: {'lr': 0.0004459442389532132, 'samples': 6445056, 'steps': 33567, 'loss/train': 1.4100042581558228} 11/07/2021 01:58:33 - INFO - __main__ - Step 33569: {'lr': 0.0004459409431957337, 'samples': 6445248, 'steps': 33568, 'loss/train': 1.6690175533294678} 11/07/2021 01:58:33 - INFO - __main__ - Step 33570: {'lr': 0.00044593764734996615, 'samples': 6445440, 'steps': 33569, 'loss/train': 1.4638475179672241} 11/07/2021 01:58:34 - INFO - __main__ - Step 33571: {'lr': 0.00044593435141591215, 'samples': 6445632, 'steps': 33570, 'loss/train': 1.7800277471542358} 11/07/2021 01:58:35 - INFO - __main__ - Step 33572: {'lr': 0.00044593105539357313, 'samples': 6445824, 'steps': 33571, 'loss/train': 1.390863299369812} 11/07/2021 01:58:35 - INFO - __main__ - Step 33573: {'lr': 0.00044592775928295063, 'samples': 6446016, 'steps': 33572, 'loss/train': 1.5232267379760742} 11/07/2021 01:58:35 - INFO - __main__ - Step 33574: {'lr': 0.0004459244630840461, 'samples': 6446208, 'steps': 33573, 'loss/train': 0.9683477878570557} 11/07/2021 01:58:36 - INFO - __main__ - Step 33575: {'lr': 0.000445921166796861, 'samples': 6446400, 'steps': 33574, 'loss/train': 1.6692181825637817} 11/07/2021 01:58:36 - INFO - __main__ - Step 33576: {'lr': 0.00044591787042139684, 'samples': 6446592, 'steps': 33575, 'loss/train': 1.3814464807510376} 11/07/2021 01:58:37 - INFO - __main__ - Step 33577: {'lr': 0.0004459145739576552, 'samples': 6446784, 'steps': 33576, 'loss/train': 1.4177926778793335} 11/07/2021 01:58:38 - INFO - __main__ - Step 33578: {'lr': 0.0004459112774056374, 'samples': 6446976, 'steps': 33577, 'loss/train': 1.8841018676757812} 11/07/2021 01:58:38 - INFO - __main__ - Step 33579: {'lr': 0.000445907980765345, 'samples': 6447168, 'steps': 33578, 'loss/train': 1.088173508644104} 11/07/2021 01:58:38 - INFO - __main__ - Step 33580: {'lr': 0.00044590468403677954, 'samples': 6447360, 'steps': 33579, 'loss/train': 1.6133272647857666} 11/07/2021 01:58:39 - INFO - __main__ - Step 33581: {'lr': 0.00044590138721994243, 'samples': 6447552, 'steps': 33580, 'loss/train': 1.7381786108016968} 11/07/2021 01:58:40 - INFO - __main__ - Step 33582: {'lr': 0.00044589809031483517, 'samples': 6447744, 'steps': 33581, 'loss/train': 1.8601634502410889} 11/07/2021 01:58:40 - INFO - __main__ - Step 33583: {'lr': 0.0004458947933214592, 'samples': 6447936, 'steps': 33582, 'loss/train': 1.626869559288025} 11/07/2021 01:58:40 - INFO - __main__ - Step 33584: {'lr': 0.0004458914962398162, 'samples': 6448128, 'steps': 33583, 'loss/train': 1.0731712579727173} 11/07/2021 01:58:41 - INFO - __main__ - Step 33585: {'lr': 0.0004458881990699074, 'samples': 6448320, 'steps': 33584, 'loss/train': 1.6439554691314697} 11/07/2021 01:58:41 - INFO - __main__ - Step 33586: {'lr': 0.00044588490181173435, 'samples': 6448512, 'steps': 33585, 'loss/train': 1.6263537406921387} 11/07/2021 01:58:41 - INFO - __main__ - Step 33587: {'lr': 0.0004458816044652987, 'samples': 6448704, 'steps': 33586, 'loss/train': 1.4457780122756958} 11/07/2021 01:58:42 - INFO - __main__ - Step 33588: {'lr': 0.00044587830703060176, 'samples': 6448896, 'steps': 33587, 'loss/train': 1.634812831878662} 11/07/2021 01:58:43 - INFO - __main__ - Step 33589: {'lr': 0.00044587500950764514, 'samples': 6449088, 'steps': 33588, 'loss/train': 1.8523485660552979} 11/07/2021 01:58:43 - INFO - __main__ - Step 33590: {'lr': 0.0004458717118964302, 'samples': 6449280, 'steps': 33589, 'loss/train': 1.581408977508545} 11/07/2021 01:58:43 - INFO - __main__ - Step 33591: {'lr': 0.0004458684141969585, 'samples': 6449472, 'steps': 33590, 'loss/train': 1.5999499559402466} 11/07/2021 01:58:44 - INFO - __main__ - Step 33592: {'lr': 0.0004458651164092315, 'samples': 6449664, 'steps': 33591, 'loss/train': 1.786829948425293} 11/07/2021 01:58:45 - INFO - __main__ - Step 33593: {'lr': 0.00044586181853325076, 'samples': 6449856, 'steps': 33592, 'loss/train': 1.5149943828582764} 11/07/2021 01:58:45 - INFO - __main__ - Step 33594: {'lr': 0.0004458585205690177, 'samples': 6450048, 'steps': 33593, 'loss/train': 1.1522443294525146} 11/07/2021 01:58:46 - INFO - __main__ - Step 33595: {'lr': 0.0004458552225165338, 'samples': 6450240, 'steps': 33594, 'loss/train': 1.3876985311508179} 11/07/2021 01:58:46 - INFO - __main__ - Step 33596: {'lr': 0.00044585192437580044, 'samples': 6450432, 'steps': 33595, 'loss/train': 1.510378360748291} 11/07/2021 01:58:46 - INFO - __main__ - Step 33597: {'lr': 0.0004458486261468194, 'samples': 6450624, 'steps': 33596, 'loss/train': 1.6011792421340942} 11/07/2021 01:58:47 - INFO - __main__ - Step 33598: {'lr': 0.0004458453278295919, 'samples': 6450816, 'steps': 33597, 'loss/train': 1.1178830862045288} 11/07/2021 01:58:48 - INFO - __main__ - Step 33599: {'lr': 0.00044584202942411956, 'samples': 6451008, 'steps': 33598, 'loss/train': 1.7060731649398804} 11/07/2021 01:58:48 - INFO - __main__ - Step 33600: {'lr': 0.00044583873093040376, 'samples': 6451200, 'steps': 33599, 'loss/train': 1.4395594596862793} 11/07/2021 01:58:48 - INFO - __main__ - Step 33601: {'lr': 0.00044583543234844616, 'samples': 6451392, 'steps': 33600, 'loss/train': 1.3613064289093018} 11/07/2021 01:58:49 - INFO - __main__ - Step 33602: {'lr': 0.00044583213367824806, 'samples': 6451584, 'steps': 33601, 'loss/train': 1.7319318056106567} 11/07/2021 01:58:50 - INFO - __main__ - Step 33603: {'lr': 0.00044582883491981097, 'samples': 6451776, 'steps': 33602, 'loss/train': 1.7198821306228638} 11/07/2021 01:58:50 - INFO - __main__ - Step 33604: {'lr': 0.0004458255360731365, 'samples': 6451968, 'steps': 33603, 'loss/train': 1.7537992000579834} 11/07/2021 01:58:51 - INFO - __main__ - Step 33605: {'lr': 0.00044582223713822606, 'samples': 6452160, 'steps': 33604, 'loss/train': 1.4399713277816772} 11/07/2021 01:58:51 - INFO - __main__ - Step 33606: {'lr': 0.0004458189381150811, 'samples': 6452352, 'steps': 33605, 'loss/train': 1.2576117515563965} 11/07/2021 01:58:51 - INFO - __main__ - Step 33607: {'lr': 0.00044581563900370326, 'samples': 6452544, 'steps': 33606, 'loss/train': 1.5778969526290894} 11/07/2021 01:58:52 - INFO - __main__ - Step 33608: {'lr': 0.0004458123398040938, 'samples': 6452736, 'steps': 33607, 'loss/train': 1.6962167024612427} 11/07/2021 01:58:53 - INFO - __main__ - Step 33609: {'lr': 0.0004458090405162544, 'samples': 6452928, 'steps': 33608, 'loss/train': 1.5823737382888794} 11/07/2021 01:58:53 - INFO - __main__ - Step 33610: {'lr': 0.0004458057411401864, 'samples': 6453120, 'steps': 33609, 'loss/train': 1.258210301399231} 11/07/2021 01:58:53 - INFO - __main__ - Step 33611: {'lr': 0.00044580244167589136, 'samples': 6453312, 'steps': 33610, 'loss/train': 1.672492265701294} 11/07/2021 01:58:54 - INFO - __main__ - Step 33612: {'lr': 0.00044579914212337083, 'samples': 6453504, 'steps': 33611, 'loss/train': 1.6796356439590454} 11/07/2021 01:58:54 - INFO - __main__ - Step 33613: {'lr': 0.00044579584248262617, 'samples': 6453696, 'steps': 33612, 'loss/train': 1.7429602146148682} 11/07/2021 01:58:55 - INFO - __main__ - Step 33614: {'lr': 0.0004457925427536589, 'samples': 6453888, 'steps': 33613, 'loss/train': 1.4019567966461182} 11/07/2021 01:58:55 - INFO - __main__ - Step 33615: {'lr': 0.0004457892429364706, 'samples': 6454080, 'steps': 33614, 'loss/train': 1.949160099029541} 11/07/2021 01:58:56 - INFO - __main__ - Step 33616: {'lr': 0.00044578594303106266, 'samples': 6454272, 'steps': 33615, 'loss/train': 1.8838460445404053} 11/07/2021 01:58:56 - INFO - __main__ - Step 33617: {'lr': 0.00044578264303743654, 'samples': 6454464, 'steps': 33616, 'loss/train': 1.7442598342895508} 11/07/2021 01:58:56 - INFO - __main__ - Step 33618: {'lr': 0.00044577934295559387, 'samples': 6454656, 'steps': 33617, 'loss/train': 1.8235018253326416} 11/07/2021 01:58:58 - INFO - __main__ - Step 33619: {'lr': 0.000445776042785536, 'samples': 6454848, 'steps': 33618, 'loss/train': 1.3736298084259033} 11/07/2021 01:58:58 - INFO - __main__ - Step 33620: {'lr': 0.00044577274252726454, 'samples': 6455040, 'steps': 33619, 'loss/train': 1.5522053241729736} 11/07/2021 01:58:58 - INFO - __main__ - Step 33621: {'lr': 0.00044576944218078075, 'samples': 6455232, 'steps': 33620, 'loss/train': 2.243539571762085} 11/07/2021 01:58:59 - INFO - __main__ - Step 33622: {'lr': 0.00044576614174608644, 'samples': 6455424, 'steps': 33621, 'loss/train': 1.4455134868621826} 11/07/2021 01:58:59 - INFO - __main__ - Step 33623: {'lr': 0.0004457628412231828, 'samples': 6455616, 'steps': 33622, 'loss/train': 1.6696631908416748} 11/07/2021 01:59:00 - INFO - __main__ - Step 33624: {'lr': 0.0004457595406120715, 'samples': 6455808, 'steps': 33623, 'loss/train': 0.5843969583511353} 11/07/2021 01:59:00 - INFO - __main__ - Step 33625: {'lr': 0.000445756239912754, 'samples': 6456000, 'steps': 33624, 'loss/train': 1.5091285705566406} 11/07/2021 01:59:01 - INFO - __main__ - Step 33626: {'lr': 0.00044575293912523173, 'samples': 6456192, 'steps': 33625, 'loss/train': 1.6436296701431274} 11/07/2021 01:59:01 - INFO - __main__ - Step 33627: {'lr': 0.0004457496382495062, 'samples': 6456384, 'steps': 33626, 'loss/train': 1.0140951871871948} 11/07/2021 01:59:01 - INFO - __main__ - Step 33628: {'lr': 0.00044574633728557887, 'samples': 6456576, 'steps': 33627, 'loss/train': 1.8059523105621338} 11/07/2021 01:59:02 - INFO - __main__ - Step 33629: {'lr': 0.0004457430362334513, 'samples': 6456768, 'steps': 33628, 'loss/train': 1.9687384366989136} 11/07/2021 01:59:03 - INFO - __main__ - Step 33630: {'lr': 0.00044573973509312494, 'samples': 6456960, 'steps': 33629, 'loss/train': 1.120401382446289} 11/07/2021 01:59:03 - INFO - __main__ - Step 33631: {'lr': 0.00044573643386460127, 'samples': 6457152, 'steps': 33630, 'loss/train': 1.464900255203247} 11/07/2021 01:59:03 - INFO - __main__ - Step 33632: {'lr': 0.00044573313254788176, 'samples': 6457344, 'steps': 33631, 'loss/train': 1.1901419162750244} 11/07/2021 01:59:04 - INFO - __main__ - Step 33633: {'lr': 0.00044572983114296794, 'samples': 6457536, 'steps': 33632, 'loss/train': 2.5087969303131104} 11/07/2021 01:59:04 - INFO - __main__ - Step 33634: {'lr': 0.00044572652964986126, 'samples': 6457728, 'steps': 33633, 'loss/train': 1.0945255756378174} 11/07/2021 01:59:05 - INFO - __main__ - Step 33635: {'lr': 0.0004457232280685633, 'samples': 6457920, 'steps': 33634, 'loss/train': 1.6997337341308594} 11/07/2021 01:59:06 - INFO - __main__ - Step 33636: {'lr': 0.0004457199263990754, 'samples': 6458112, 'steps': 33635, 'loss/train': 1.3791000843048096} 11/07/2021 01:59:06 - INFO - __main__ - Step 33637: {'lr': 0.0004457166246413992, 'samples': 6458304, 'steps': 33636, 'loss/train': 1.040553092956543} 11/07/2021 01:59:06 - INFO - __main__ - Step 33638: {'lr': 0.000445713322795536, 'samples': 6458496, 'steps': 33637, 'loss/train': 1.3512609004974365} 11/07/2021 01:59:07 - INFO - __main__ - Step 33639: {'lr': 0.0004457100208614875, 'samples': 6458688, 'steps': 33638, 'loss/train': 1.6804062128067017} 11/07/2021 01:59:08 - INFO - __main__ - Step 33640: {'lr': 0.00044570671883925497, 'samples': 6458880, 'steps': 33639, 'loss/train': 1.4761654138565063} 11/07/2021 01:59:08 - INFO - __main__ - Step 33641: {'lr': 0.00044570341672884006, 'samples': 6459072, 'steps': 33640, 'loss/train': 1.2570524215698242} 11/07/2021 01:59:08 - INFO - __main__ - Step 33642: {'lr': 0.0004457001145302443, 'samples': 6459264, 'steps': 33641, 'loss/train': 1.4354379177093506} 11/07/2021 01:59:09 - INFO - __main__ - Step 33643: {'lr': 0.00044569681224346897, 'samples': 6459456, 'steps': 33642, 'loss/train': 1.6788442134857178} 11/07/2021 01:59:09 - INFO - __main__ - Step 33644: {'lr': 0.0004456935098685158, 'samples': 6459648, 'steps': 33643, 'loss/train': 1.1437855958938599} 11/07/2021 01:59:10 - INFO - __main__ - Step 33645: {'lr': 0.000445690207405386, 'samples': 6459840, 'steps': 33644, 'loss/train': 1.4899927377700806} 11/07/2021 01:59:10 - INFO - __main__ - Step 33646: {'lr': 0.00044568690485408125, 'samples': 6460032, 'steps': 33645, 'loss/train': 1.0562697649002075} 11/07/2021 01:59:11 - INFO - __main__ - Step 33647: {'lr': 0.0004456836022146031, 'samples': 6460224, 'steps': 33646, 'loss/train': 1.6380075216293335} 11/07/2021 01:59:11 - INFO - __main__ - Step 33648: {'lr': 0.00044568029948695287, 'samples': 6460416, 'steps': 33647, 'loss/train': 1.5932193994522095} 11/07/2021 01:59:12 - INFO - __main__ - Step 33649: {'lr': 0.0004456769966711321, 'samples': 6460608, 'steps': 33648, 'loss/train': 1.6578596830368042} 11/07/2021 01:59:12 - INFO - __main__ - Step 33650: {'lr': 0.00044567369376714226, 'samples': 6460800, 'steps': 33649, 'loss/train': 1.3997435569763184} 11/07/2021 01:59:13 - INFO - __main__ - Step 33651: {'lr': 0.00044567039077498497, 'samples': 6460992, 'steps': 33650, 'loss/train': 1.5730409622192383} 11/07/2021 01:59:13 - INFO - __main__ - Step 33652: {'lr': 0.00044566708769466155, 'samples': 6461184, 'steps': 33651, 'loss/train': 0.9869714379310608} 11/07/2021 01:59:14 - INFO - __main__ - Step 33653: {'lr': 0.00044566378452617363, 'samples': 6461376, 'steps': 33652, 'loss/train': 1.5953766107559204} 11/07/2021 01:59:14 - INFO - __main__ - Step 33654: {'lr': 0.0004456604812695226, 'samples': 6461568, 'steps': 33653, 'loss/train': 1.6929657459259033} 11/07/2021 01:59:15 - INFO - __main__ - Step 33655: {'lr': 0.0004456571779247099, 'samples': 6461760, 'steps': 33654, 'loss/train': 1.1120388507843018} 11/07/2021 01:59:15 - INFO - __main__ - Step 33656: {'lr': 0.0004456538744917372, 'samples': 6461952, 'steps': 33655, 'loss/train': 1.0251704454421997} 11/07/2021 01:59:16 - INFO - __main__ - Step 33657: {'lr': 0.0004456505709706059, 'samples': 6462144, 'steps': 33656, 'loss/train': 1.697401523590088} 11/07/2021 01:59:16 - INFO - __main__ - Step 33658: {'lr': 0.0004456472673613174, 'samples': 6462336, 'steps': 33657, 'loss/train': 1.685717225074768} 11/07/2021 01:59:16 - INFO - __main__ - Step 33659: {'lr': 0.00044564396366387327, 'samples': 6462528, 'steps': 33658, 'loss/train': 1.6014701128005981} 11/07/2021 01:59:17 - INFO - __main__ - Step 33660: {'lr': 0.000445640659878275, 'samples': 6462720, 'steps': 33659, 'loss/train': 1.3411377668380737} 11/07/2021 01:59:18 - INFO - __main__ - Step 33661: {'lr': 0.00044563735600452407, 'samples': 6462912, 'steps': 33660, 'loss/train': 1.6042089462280273} 11/07/2021 01:59:18 - INFO - __main__ - Step 33662: {'lr': 0.000445634052042622, 'samples': 6463104, 'steps': 33661, 'loss/train': 1.6241892576217651} 11/07/2021 01:59:18 - INFO - __main__ - Step 33663: {'lr': 0.00044563074799257015, 'samples': 6463296, 'steps': 33662, 'loss/train': 0.9637057781219482} 11/07/2021 01:59:19 - INFO - __main__ - Step 33664: {'lr': 0.0004456274438543702, 'samples': 6463488, 'steps': 33663, 'loss/train': 1.9259538650512695} 11/07/2021 01:59:20 - INFO - __main__ - Step 33665: {'lr': 0.0004456241396280234, 'samples': 6463680, 'steps': 33664, 'loss/train': 1.3537373542785645} 11/07/2021 01:59:20 - INFO - __main__ - Step 33666: {'lr': 0.00044562083531353154, 'samples': 6463872, 'steps': 33665, 'loss/train': 1.466401219367981} 11/07/2021 01:59:21 - INFO - __main__ - Step 33667: {'lr': 0.00044561753091089585, 'samples': 6464064, 'steps': 33666, 'loss/train': 1.6249371767044067} 11/07/2021 01:59:21 - INFO - __main__ - Step 33668: {'lr': 0.00044561422642011794, 'samples': 6464256, 'steps': 33667, 'loss/train': 1.7363765239715576} 11/07/2021 01:59:21 - INFO - __main__ - Step 33669: {'lr': 0.00044561092184119933, 'samples': 6464448, 'steps': 33668, 'loss/train': 1.6479049921035767} 11/07/2021 01:59:22 - INFO - __main__ - Step 33670: {'lr': 0.00044560761717414143, 'samples': 6464640, 'steps': 33669, 'loss/train': 0.8955526947975159} 11/07/2021 01:59:23 - INFO - __main__ - Step 33671: {'lr': 0.0004456043124189458, 'samples': 6464832, 'steps': 33670, 'loss/train': 1.129746913909912} 11/07/2021 01:59:23 - INFO - __main__ - Step 33672: {'lr': 0.00044560100757561386, 'samples': 6465024, 'steps': 33671, 'loss/train': 1.6933951377868652} 11/07/2021 01:59:23 - INFO - __main__ - Step 33673: {'lr': 0.000445597702644147, 'samples': 6465216, 'steps': 33672, 'loss/train': 1.821217656135559} 11/07/2021 01:59:24 - INFO - __main__ - Step 33674: {'lr': 0.000445594397624547, 'samples': 6465408, 'steps': 33673, 'loss/train': 1.5372188091278076} 11/07/2021 01:59:25 - INFO - __main__ - Step 33675: {'lr': 0.0004455910925168151, 'samples': 6465600, 'steps': 33674, 'loss/train': 1.3831593990325928} 11/07/2021 01:59:25 - INFO - __main__ - Step 33676: {'lr': 0.0004455877873209529, 'samples': 6465792, 'steps': 33675, 'loss/train': 1.8606778383255005} 11/07/2021 01:59:25 - INFO - __main__ - Step 33677: {'lr': 0.00044558448203696184, 'samples': 6465984, 'steps': 33676, 'loss/train': 1.8869925737380981} 11/07/2021 01:59:26 - INFO - __main__ - Step 33678: {'lr': 0.0004455811766648434, 'samples': 6466176, 'steps': 33677, 'loss/train': 1.6165025234222412} 11/07/2021 01:59:26 - INFO - __main__ - Step 33679: {'lr': 0.0004455778712045992, 'samples': 6466368, 'steps': 33678, 'loss/train': 1.620165228843689} 11/07/2021 01:59:27 - INFO - __main__ - Step 33680: {'lr': 0.0004455745656562306, 'samples': 6466560, 'steps': 33679, 'loss/train': 1.4367172718048096} 11/07/2021 01:59:28 - INFO - __main__ - Step 33681: {'lr': 0.000445571260019739, 'samples': 6466752, 'steps': 33680, 'loss/train': 1.590633511543274} 11/07/2021 01:59:28 - INFO - __main__ - Step 33682: {'lr': 0.00044556795429512617, 'samples': 6466944, 'steps': 33681, 'loss/train': 1.524424433708191} 11/07/2021 01:59:28 - INFO - __main__ - Step 33683: {'lr': 0.0004455646484823933, 'samples': 6467136, 'steps': 33682, 'loss/train': 2.1249895095825195} 11/07/2021 01:59:29 - INFO - __main__ - Step 33684: {'lr': 0.00044556134258154215, 'samples': 6467328, 'steps': 33683, 'loss/train': 1.1808171272277832} 11/07/2021 01:59:29 - INFO - __main__ - Step 33685: {'lr': 0.000445558036592574, 'samples': 6467520, 'steps': 33684, 'loss/train': 2.523301362991333} 11/07/2021 01:59:30 - INFO - __main__ - Step 33686: {'lr': 0.0004455547305154904, 'samples': 6467712, 'steps': 33685, 'loss/train': 1.156882643699646} 11/07/2021 01:59:30 - INFO - __main__ - Step 33687: {'lr': 0.00044555142435029284, 'samples': 6467904, 'steps': 33686, 'loss/train': 1.8231897354125977} 11/07/2021 01:59:31 - INFO - __main__ - Step 33688: {'lr': 0.0004455481180969829, 'samples': 6468096, 'steps': 33687, 'loss/train': 1.935068130493164} 11/07/2021 01:59:31 - INFO - __main__ - Step 33689: {'lr': 0.00044554481175556194, 'samples': 6468288, 'steps': 33688, 'loss/train': 1.3995652198791504} 11/07/2021 01:59:31 - INFO - __main__ - Step 33690: {'lr': 0.00044554150532603154, 'samples': 6468480, 'steps': 33689, 'loss/train': 1.949639081954956} 11/07/2021 01:59:32 - INFO - __main__ - Step 33691: {'lr': 0.00044553819880839313, 'samples': 6468672, 'steps': 33690, 'loss/train': 2.097712516784668} 11/07/2021 01:59:33 - INFO - __main__ - Step 33692: {'lr': 0.0004455348922026483, 'samples': 6468864, 'steps': 33691, 'loss/train': 1.9351541996002197} 11/07/2021 01:59:33 - INFO - __main__ - Step 33693: {'lr': 0.00044553158550879833, 'samples': 6469056, 'steps': 33692, 'loss/train': 1.3654865026474} 11/07/2021 01:59:34 - INFO - __main__ - Step 33694: {'lr': 0.00044552827872684493, 'samples': 6469248, 'steps': 33693, 'loss/train': 1.9272459745407104} 11/07/2021 01:59:34 - INFO - __main__ - Step 33695: {'lr': 0.00044552497185678953, 'samples': 6469440, 'steps': 33694, 'loss/train': 1.2972309589385986} 11/07/2021 01:59:35 - INFO - __main__ - Step 33696: {'lr': 0.00044552166489863354, 'samples': 6469632, 'steps': 33695, 'loss/train': 1.9361430406570435} 11/07/2021 01:59:35 - INFO - __main__ - Step 33697: {'lr': 0.0004455183578523785, 'samples': 6469824, 'steps': 33696, 'loss/train': 0.749098002910614} 11/07/2021 01:59:36 - INFO - __main__ - Step 33698: {'lr': 0.00044551505071802587, 'samples': 6470016, 'steps': 33697, 'loss/train': 1.533054232597351} 11/07/2021 01:59:36 - INFO - __main__ - Step 33699: {'lr': 0.00044551174349557733, 'samples': 6470208, 'steps': 33698, 'loss/train': 1.4626320600509644} 11/07/2021 01:59:36 - INFO - __main__ - Step 33700: {'lr': 0.0004455084361850341, 'samples': 6470400, 'steps': 33699, 'loss/train': 1.5492607355117798} 11/07/2021 01:59:37 - INFO - __main__ - Step 33701: {'lr': 0.00044550512878639784, 'samples': 6470592, 'steps': 33700, 'loss/train': 1.0996320247650146} 11/07/2021 01:59:38 - INFO - __main__ - Step 33702: {'lr': 0.0004455018212996699, 'samples': 6470784, 'steps': 33701, 'loss/train': 1.604270577430725} 11/07/2021 01:59:38 - INFO - __main__ - Step 33703: {'lr': 0.0004454985137248519, 'samples': 6470976, 'steps': 33702, 'loss/train': 1.4556504487991333} 11/07/2021 01:59:38 - INFO - __main__ - Step 33704: {'lr': 0.00044549520606194525, 'samples': 6471168, 'steps': 33703, 'loss/train': 1.2125494480133057} 11/07/2021 01:59:39 - INFO - __main__ - Step 33705: {'lr': 0.00044549189831095157, 'samples': 6471360, 'steps': 33704, 'loss/train': 1.5545730590820312} 11/07/2021 01:59:39 - INFO - __main__ - Step 33706: {'lr': 0.0004454885904718722, 'samples': 6471552, 'steps': 33705, 'loss/train': 1.6618608236312866} 11/07/2021 01:59:40 - INFO - __main__ - Step 33707: {'lr': 0.0004454852825447087, 'samples': 6471744, 'steps': 33706, 'loss/train': 1.672659158706665} 11/07/2021 01:59:40 - INFO - __main__ - Step 33708: {'lr': 0.0004454819745294625, 'samples': 6471936, 'steps': 33707, 'loss/train': 1.5245839357376099} 11/07/2021 01:59:41 - INFO - __main__ - Step 33709: {'lr': 0.0004454786664261352, 'samples': 6472128, 'steps': 33708, 'loss/train': 1.432139277458191} 11/07/2021 01:59:41 - INFO - __main__ - Step 33710: {'lr': 0.0004454753582347282, 'samples': 6472320, 'steps': 33709, 'loss/train': 1.639000415802002} 11/07/2021 01:59:42 - INFO - __main__ - Step 33711: {'lr': 0.00044547204995524305, 'samples': 6472512, 'steps': 33710, 'loss/train': 1.8348017930984497} 11/07/2021 01:59:43 - INFO - __main__ - Step 33712: {'lr': 0.00044546874158768115, 'samples': 6472704, 'steps': 33711, 'loss/train': 1.8240238428115845} 11/07/2021 01:59:43 - INFO - __main__ - Step 33713: {'lr': 0.00044546543313204415, 'samples': 6472896, 'steps': 33712, 'loss/train': 1.4014474153518677} 11/07/2021 01:59:43 - INFO - __main__ - Step 33714: {'lr': 0.00044546212458833334, 'samples': 6473088, 'steps': 33713, 'loss/train': 1.8096492290496826} 11/07/2021 01:59:44 - INFO - __main__ - Step 33715: {'lr': 0.00044545881595655035, 'samples': 6473280, 'steps': 33714, 'loss/train': 1.4596387147903442} 11/07/2021 01:59:44 - INFO - __main__ - Step 33716: {'lr': 0.00044545550723669664, 'samples': 6473472, 'steps': 33715, 'loss/train': 0.7976817488670349} 11/07/2021 01:59:45 - INFO - __main__ - Step 33717: {'lr': 0.00044545219842877373, 'samples': 6473664, 'steps': 33716, 'loss/train': 1.4980156421661377} 11/07/2021 01:59:45 - INFO - __main__ - Step 33718: {'lr': 0.000445448889532783, 'samples': 6473856, 'steps': 33717, 'loss/train': 1.0201311111450195} 11/07/2021 01:59:46 - INFO - __main__ - Step 33719: {'lr': 0.0004454455805487261, 'samples': 6474048, 'steps': 33718, 'loss/train': 1.7880730628967285} 11/07/2021 01:59:46 - INFO - __main__ - Step 33720: {'lr': 0.0004454422714766043, 'samples': 6474240, 'steps': 33719, 'loss/train': 1.2125518321990967} 11/07/2021 01:59:46 - INFO - __main__ - Step 33721: {'lr': 0.00044543896231641935, 'samples': 6474432, 'steps': 33720, 'loss/train': 1.5912449359893799} 11/07/2021 01:59:47 - INFO - __main__ - Step 33722: {'lr': 0.00044543565306817256, 'samples': 6474624, 'steps': 33721, 'loss/train': 1.6939294338226318} 11/07/2021 01:59:48 - INFO - __main__ - Step 33723: {'lr': 0.00044543234373186556, 'samples': 6474816, 'steps': 33722, 'loss/train': 1.5587968826293945} 11/07/2021 01:59:48 - INFO - __main__ - Step 33724: {'lr': 0.0004454290343074997, 'samples': 6475008, 'steps': 33723, 'loss/train': 1.2607208490371704} 11/07/2021 01:59:48 - INFO - __main__ - Step 33725: {'lr': 0.00044542572479507655, 'samples': 6475200, 'steps': 33724, 'loss/train': 1.7119576930999756} 11/07/2021 01:59:49 - INFO - __main__ - Step 33726: {'lr': 0.00044542241519459757, 'samples': 6475392, 'steps': 33725, 'loss/train': 1.4209587574005127} 11/07/2021 01:59:50 - INFO - __main__ - Step 33727: {'lr': 0.0004454191055060643, 'samples': 6475584, 'steps': 33726, 'loss/train': 1.4337310791015625} 11/07/2021 01:59:51 - INFO - __main__ - Step 33728: {'lr': 0.00044541579572947814, 'samples': 6475776, 'steps': 33727, 'loss/train': 1.5337797403335571} 11/07/2021 01:59:51 - INFO - __main__ - Step 33729: {'lr': 0.0004454124858648407, 'samples': 6475968, 'steps': 33728, 'loss/train': 0.11811182647943497} 11/07/2021 01:59:51 - INFO - __main__ - Step 33730: {'lr': 0.00044540917591215335, 'samples': 6476160, 'steps': 33729, 'loss/train': 1.6598936319351196} 11/07/2021 01:59:52 - INFO - __main__ - Step 33731: {'lr': 0.0004454058658714177, 'samples': 6476352, 'steps': 33730, 'loss/train': 1.5373272895812988} 11/07/2021 01:59:52 - INFO - __main__ - Step 33732: {'lr': 0.0004454025557426351, 'samples': 6476544, 'steps': 33731, 'loss/train': 0.6578313708305359} 11/07/2021 01:59:53 - INFO - __main__ - Step 33733: {'lr': 0.00044539924552580723, 'samples': 6476736, 'steps': 33732, 'loss/train': 1.5535787343978882} 11/07/2021 01:59:53 - INFO - __main__ - Step 33734: {'lr': 0.0004453959352209354, 'samples': 6476928, 'steps': 33733, 'loss/train': 1.8786280155181885} 11/07/2021 01:59:54 - INFO - __main__ - Step 33735: {'lr': 0.0004453926248280212, 'samples': 6477120, 'steps': 33734, 'loss/train': 1.306776762008667} 11/07/2021 01:59:54 - INFO - __main__ - Step 33736: {'lr': 0.0004453893143470661, 'samples': 6477312, 'steps': 33735, 'loss/train': 1.5552427768707275} 11/07/2021 01:59:54 - INFO - __main__ - Step 33737: {'lr': 0.0004453860037780716, 'samples': 6477504, 'steps': 33736, 'loss/train': 1.4108299016952515} 11/07/2021 01:59:55 - INFO - __main__ - Step 33738: {'lr': 0.00044538269312103916, 'samples': 6477696, 'steps': 33737, 'loss/train': 1.8733634948730469} 11/07/2021 01:59:56 - INFO - __main__ - Step 33739: {'lr': 0.00044537938237597033, 'samples': 6477888, 'steps': 33738, 'loss/train': 0.788608193397522} 11/07/2021 01:59:56 - INFO - __main__ - Step 33740: {'lr': 0.00044537607154286645, 'samples': 6478080, 'steps': 33739, 'loss/train': 1.1113461256027222} 11/07/2021 01:59:57 - INFO - __main__ - Step 33741: {'lr': 0.00044537276062172926, 'samples': 6478272, 'steps': 33740, 'loss/train': 1.4108548164367676} 11/07/2021 01:59:57 - INFO - __main__ - Step 33742: {'lr': 0.0004453694496125601, 'samples': 6478464, 'steps': 33741, 'loss/train': 1.6025670766830444} 11/07/2021 01:59:58 - INFO - __main__ - Step 33743: {'lr': 0.0004453661385153604, 'samples': 6478656, 'steps': 33742, 'loss/train': 1.8474880456924438} 11/07/2021 01:59:58 - INFO - __main__ - Step 33744: {'lr': 0.0004453628273301318, 'samples': 6478848, 'steps': 33743, 'loss/train': 1.5394790172576904} 11/07/2021 01:59:59 - INFO - __main__ - Step 33745: {'lr': 0.0004453595160568757, 'samples': 6479040, 'steps': 33744, 'loss/train': 1.138857126235962} 11/07/2021 01:59:59 - INFO - __main__ - Step 33746: {'lr': 0.0004453562046955937, 'samples': 6479232, 'steps': 33745, 'loss/train': 1.4568958282470703} 11/07/2021 01:59:59 - INFO - __main__ - Step 33747: {'lr': 0.00044535289324628704, 'samples': 6479424, 'steps': 33746, 'loss/train': 1.3100298643112183} 11/07/2021 02:00:00 - INFO - __main__ - Step 33748: {'lr': 0.00044534958170895753, 'samples': 6479616, 'steps': 33747, 'loss/train': 1.7168136835098267} 11/07/2021 02:00:01 - INFO - __main__ - Step 33749: {'lr': 0.0004453462700836064, 'samples': 6479808, 'steps': 33748, 'loss/train': 1.7134358882904053} 11/07/2021 02:00:01 - INFO - __main__ - Step 33750: {'lr': 0.0004453429583702353, 'samples': 6480000, 'steps': 33749, 'loss/train': 1.1795400381088257} 11/07/2021 02:00:01 - INFO - __main__ - Step 33751: {'lr': 0.0004453396465688457, 'samples': 6480192, 'steps': 33750, 'loss/train': 1.4245951175689697} 11/07/2021 02:00:02 - INFO - __main__ - Step 33752: {'lr': 0.00044533633467943906, 'samples': 6480384, 'steps': 33751, 'loss/train': 1.7902945280075073} 11/07/2021 02:00:03 - INFO - __main__ - Step 33753: {'lr': 0.00044533302270201693, 'samples': 6480576, 'steps': 33752, 'loss/train': 1.0339568853378296} 11/07/2021 02:00:03 - INFO - __main__ - Step 33754: {'lr': 0.00044532971063658067, 'samples': 6480768, 'steps': 33753, 'loss/train': 1.6112436056137085} 11/07/2021 02:00:04 - INFO - __main__ - Step 33755: {'lr': 0.00044532639848313187, 'samples': 6480960, 'steps': 33754, 'loss/train': 1.2984110116958618} 11/07/2021 02:00:04 - INFO - __main__ - Step 33756: {'lr': 0.0004453230862416721, 'samples': 6481152, 'steps': 33755, 'loss/train': 1.6372071504592896} 11/07/2021 02:00:04 - INFO - __main__ - Step 33757: {'lr': 0.00044531977391220267, 'samples': 6481344, 'steps': 33756, 'loss/train': 1.6506997346878052} 11/07/2021 02:00:05 - INFO - __main__ - Step 33758: {'lr': 0.00044531646149472516, 'samples': 6481536, 'steps': 33757, 'loss/train': 1.0395686626434326} 11/07/2021 02:00:06 - INFO - __main__ - Step 33759: {'lr': 0.00044531314898924116, 'samples': 6481728, 'steps': 33758, 'loss/train': 1.5042260885238647} 11/07/2021 02:00:06 - INFO - __main__ - Step 33760: {'lr': 0.00044530983639575193, 'samples': 6481920, 'steps': 33759, 'loss/train': 1.6232199668884277} 11/07/2021 02:00:06 - INFO - __main__ - Step 33761: {'lr': 0.00044530652371425916, 'samples': 6482112, 'steps': 33760, 'loss/train': 1.5966721773147583} 11/07/2021 02:00:07 - INFO - __main__ - Step 33762: {'lr': 0.00044530321094476434, 'samples': 6482304, 'steps': 33761, 'loss/train': 1.578473448753357} 11/07/2021 02:00:07 - INFO - __main__ - Step 33763: {'lr': 0.0004452998980872689, 'samples': 6482496, 'steps': 33762, 'loss/train': 1.1778452396392822} 11/07/2021 02:00:08 - INFO - __main__ - Step 33764: {'lr': 0.0004452965851417743, 'samples': 6482688, 'steps': 33763, 'loss/train': 1.709043025970459} 11/07/2021 02:00:09 - INFO - __main__ - Step 33765: {'lr': 0.000445293272108282, 'samples': 6482880, 'steps': 33764, 'loss/train': 1.7417343854904175} 11/07/2021 02:00:09 - INFO - __main__ - Step 33766: {'lr': 0.0004452899589867937, 'samples': 6483072, 'steps': 33765, 'loss/train': 0.9052668213844299} 11/07/2021 02:00:09 - INFO - __main__ - Step 33767: {'lr': 0.00044528664577731073, 'samples': 6483264, 'steps': 33766, 'loss/train': 1.6804091930389404} 11/07/2021 02:00:10 - INFO - __main__ - Step 33768: {'lr': 0.00044528333247983456, 'samples': 6483456, 'steps': 33767, 'loss/train': 2.097458600997925} 11/07/2021 02:00:10 - INFO - __main__ - Step 33769: {'lr': 0.0004452800190943667, 'samples': 6483648, 'steps': 33768, 'loss/train': 2.2681474685668945} 11/07/2021 02:00:11 - INFO - __main__ - Step 33770: {'lr': 0.0004452767056209087, 'samples': 6483840, 'steps': 33769, 'loss/train': 1.4090787172317505} 11/07/2021 02:00:11 - INFO - __main__ - Step 33771: {'lr': 0.0004452733920594621, 'samples': 6484032, 'steps': 33770, 'loss/train': 1.3013017177581787} 11/07/2021 02:00:12 - INFO - __main__ - Step 33772: {'lr': 0.0004452700784100283, 'samples': 6484224, 'steps': 33771, 'loss/train': 1.6307063102722168} 11/07/2021 02:00:12 - INFO - __main__ - Step 33773: {'lr': 0.0004452667646726088, 'samples': 6484416, 'steps': 33772, 'loss/train': 1.6249957084655762} 11/07/2021 02:00:13 - INFO - __main__ - Step 33774: {'lr': 0.0004452634508472051, 'samples': 6484608, 'steps': 33773, 'loss/train': 1.4302394390106201} 11/07/2021 02:00:14 - INFO - __main__ - Step 33775: {'lr': 0.0004452601369338187, 'samples': 6484800, 'steps': 33774, 'loss/train': 1.3013135194778442} 11/07/2021 02:00:14 - INFO - __main__ - Step 33776: {'lr': 0.00044525682293245107, 'samples': 6484992, 'steps': 33775, 'loss/train': 1.478948712348938} 11/07/2021 02:00:14 - INFO - __main__ - Step 33777: {'lr': 0.0004452535088431038, 'samples': 6485184, 'steps': 33776, 'loss/train': 1.4473791122436523} 11/07/2021 02:00:15 - INFO - __main__ - Step 33778: {'lr': 0.00044525019466577824, 'samples': 6485376, 'steps': 33777, 'loss/train': 1.9431716203689575} 11/07/2021 02:00:15 - INFO - __main__ - Step 33779: {'lr': 0.000445246880400476, 'samples': 6485568, 'steps': 33778, 'loss/train': 1.5126373767852783} 11/07/2021 02:00:15 - INFO - __main__ - Step 33780: {'lr': 0.0004452435660471985, 'samples': 6485760, 'steps': 33779, 'loss/train': 1.7080620527267456} 11/07/2021 02:00:16 - INFO - __main__ - Step 33781: {'lr': 0.00044524025160594735, 'samples': 6485952, 'steps': 33780, 'loss/train': 0.19829830527305603} 11/07/2021 02:00:17 - INFO - __main__ - Step 33782: {'lr': 0.00044523693707672384, 'samples': 6486144, 'steps': 33781, 'loss/train': 1.7072633504867554} 11/07/2021 02:00:17 - INFO - __main__ - Step 33783: {'lr': 0.0004452336224595296, 'samples': 6486336, 'steps': 33782, 'loss/train': 1.5358293056488037} 11/07/2021 02:00:17 - INFO - __main__ - Step 33784: {'lr': 0.00044523030775436617, 'samples': 6486528, 'steps': 33783, 'loss/train': 1.2860418558120728} 11/07/2021 02:00:18 - INFO - __main__ - Step 33785: {'lr': 0.00044522699296123495, 'samples': 6486720, 'steps': 33784, 'loss/train': 0.9292582273483276} 11/07/2021 02:00:19 - INFO - __main__ - Step 33786: {'lr': 0.0004452236780801374, 'samples': 6486912, 'steps': 33785, 'loss/train': 1.1000765562057495} 11/07/2021 02:00:19 - INFO - __main__ - Step 33787: {'lr': 0.00044522036311107514, 'samples': 6487104, 'steps': 33786, 'loss/train': 1.5459463596343994} 11/07/2021 02:00:20 - INFO - __main__ - Step 33788: {'lr': 0.0004452170480540496, 'samples': 6487296, 'steps': 33787, 'loss/train': 1.036749243736267} 11/07/2021 02:00:20 - INFO - __main__ - Step 33789: {'lr': 0.0004452137329090622, 'samples': 6487488, 'steps': 33788, 'loss/train': 1.3245213031768799} 11/07/2021 02:00:20 - INFO - __main__ - Step 33790: {'lr': 0.0004452104176761146, 'samples': 6487680, 'steps': 33789, 'loss/train': 1.3617459535598755} 11/07/2021 02:00:21 - INFO - __main__ - Step 33791: {'lr': 0.0004452071023552081, 'samples': 6487872, 'steps': 33790, 'loss/train': 1.1394017934799194} 11/07/2021 02:00:22 - INFO - __main__ - Step 33792: {'lr': 0.0004452037869463443, 'samples': 6488064, 'steps': 33791, 'loss/train': 2.0320191383361816} 11/07/2021 02:00:22 - INFO - __main__ - Step 33793: {'lr': 0.0004452004714495248, 'samples': 6488256, 'steps': 33792, 'loss/train': 1.3852721452713013} 11/07/2021 02:00:22 - INFO - __main__ - Step 33794: {'lr': 0.00044519715586475083, 'samples': 6488448, 'steps': 33793, 'loss/train': 1.6966907978057861} 11/07/2021 02:00:23 - INFO - __main__ - Step 33795: {'lr': 0.0004451938401920241, 'samples': 6488640, 'steps': 33794, 'loss/train': 1.5629920959472656} 11/07/2021 02:00:24 - INFO - __main__ - Step 33796: {'lr': 0.0004451905244313461, 'samples': 6488832, 'steps': 33795, 'loss/train': 1.5933583974838257} 11/07/2021 02:00:24 - INFO - __main__ - Step 33797: {'lr': 0.0004451872085827182, 'samples': 6489024, 'steps': 33796, 'loss/train': 1.5892422199249268} 11/07/2021 02:00:24 - INFO - __main__ - Step 33798: {'lr': 0.000445183892646142, 'samples': 6489216, 'steps': 33797, 'loss/train': 1.499814748764038} 11/07/2021 02:00:25 - INFO - __main__ - Step 33799: {'lr': 0.0004451805766216189, 'samples': 6489408, 'steps': 33798, 'loss/train': 1.4643924236297607} 11/07/2021 02:00:25 - INFO - __main__ - Step 33800: {'lr': 0.00044517726050915044, 'samples': 6489600, 'steps': 33799, 'loss/train': 1.5761510133743286} 11/07/2021 02:00:26 - INFO - __main__ - Step 33801: {'lr': 0.0004451739443087381, 'samples': 6489792, 'steps': 33800, 'loss/train': 1.3621511459350586} 11/07/2021 02:00:26 - INFO - __main__ - Step 33802: {'lr': 0.0004451706280203834, 'samples': 6489984, 'steps': 33801, 'loss/train': 1.673937439918518} 11/07/2021 02:00:27 - INFO - __main__ - Step 33803: {'lr': 0.0004451673116440879, 'samples': 6490176, 'steps': 33802, 'loss/train': 1.839440107345581} 11/07/2021 02:00:27 - INFO - __main__ - Step 33804: {'lr': 0.00044516399517985296, 'samples': 6490368, 'steps': 33803, 'loss/train': 1.7208832502365112} 11/07/2021 02:00:28 - INFO - __main__ - Step 33805: {'lr': 0.00044516067862768015, 'samples': 6490560, 'steps': 33804, 'loss/train': 1.5042489767074585} 11/07/2021 02:00:29 - INFO - __main__ - Step 33806: {'lr': 0.00044515736198757095, 'samples': 6490752, 'steps': 33805, 'loss/train': 2.0620195865631104} 11/07/2021 02:00:29 - INFO - __main__ - Step 33807: {'lr': 0.0004451540452595268, 'samples': 6490944, 'steps': 33806, 'loss/train': 1.3464220762252808} 11/07/2021 02:00:29 - INFO - __main__ - Step 33808: {'lr': 0.0004451507284435494, 'samples': 6491136, 'steps': 33807, 'loss/train': 1.5946115255355835} 11/07/2021 02:00:30 - INFO - __main__ - Step 33809: {'lr': 0.00044514741153964, 'samples': 6491328, 'steps': 33808, 'loss/train': 1.6844371557235718} 11/07/2021 02:00:30 - INFO - __main__ - Step 33810: {'lr': 0.00044514409454780016, 'samples': 6491520, 'steps': 33809, 'loss/train': 1.2887932062149048} 11/07/2021 02:00:30 - INFO - __main__ - Step 33811: {'lr': 0.0004451407774680314, 'samples': 6491712, 'steps': 33810, 'loss/train': 1.2394468784332275} 11/07/2021 02:00:31 - INFO - __main__ - Step 33812: {'lr': 0.0004451374603003353, 'samples': 6491904, 'steps': 33811, 'loss/train': 1.0655345916748047} 11/07/2021 02:00:32 - INFO - __main__ - Step 33813: {'lr': 0.0004451341430447132, 'samples': 6492096, 'steps': 33812, 'loss/train': 1.8083118200302124} 11/07/2021 02:00:32 - INFO - __main__ - Step 33814: {'lr': 0.0004451308257011667, 'samples': 6492288, 'steps': 33813, 'loss/train': 1.1663693189620972} 11/07/2021 02:00:32 - INFO - __main__ - Step 33815: {'lr': 0.00044512750826969724, 'samples': 6492480, 'steps': 33814, 'loss/train': 1.7158915996551514} 11/07/2021 02:00:33 - INFO - __main__ - Step 33816: {'lr': 0.0004451241907503063, 'samples': 6492672, 'steps': 33815, 'loss/train': 1.364975094795227} 11/07/2021 02:00:34 - INFO - __main__ - Step 33817: {'lr': 0.0004451208731429954, 'samples': 6492864, 'steps': 33816, 'loss/train': 1.61083984375} 11/07/2021 02:00:34 - INFO - __main__ - Step 33818: {'lr': 0.00044511755544776615, 'samples': 6493056, 'steps': 33817, 'loss/train': 1.5559664964675903} 11/07/2021 02:00:34 - INFO - __main__ - Step 33819: {'lr': 0.0004451142376646199, 'samples': 6493248, 'steps': 33818, 'loss/train': 1.484433889389038} 11/07/2021 02:00:35 - INFO - __main__ - Step 33820: {'lr': 0.0004451109197935582, 'samples': 6493440, 'steps': 33819, 'loss/train': 1.8341032266616821} 11/07/2021 02:00:35 - INFO - __main__ - Step 33821: {'lr': 0.0004451076018345824, 'samples': 6493632, 'steps': 33820, 'loss/train': 1.5442001819610596} 11/07/2021 02:00:36 - INFO - __main__ - Step 33822: {'lr': 0.0004451042837876943, 'samples': 6493824, 'steps': 33821, 'loss/train': 1.7790117263793945} 11/07/2021 02:00:37 - INFO - __main__ - Step 33823: {'lr': 0.00044510096565289513, 'samples': 6494016, 'steps': 33822, 'loss/train': 1.7140729427337646} 11/07/2021 02:00:37 - INFO - __main__ - Step 33824: {'lr': 0.0004450976474301865, 'samples': 6494208, 'steps': 33823, 'loss/train': 1.504670262336731} 11/07/2021 02:00:37 - INFO - __main__ - Step 33825: {'lr': 0.0004450943291195698, 'samples': 6494400, 'steps': 33824, 'loss/train': 1.624586820602417} 11/07/2021 02:00:38 - INFO - __main__ - Step 33826: {'lr': 0.0004450910107210467, 'samples': 6494592, 'steps': 33825, 'loss/train': 1.1198339462280273} 11/07/2021 02:00:39 - INFO - __main__ - Step 33827: {'lr': 0.00044508769223461863, 'samples': 6494784, 'steps': 33826, 'loss/train': 2.0186984539031982} 11/07/2021 02:00:39 - INFO - __main__ - Step 33828: {'lr': 0.00044508437366028695, 'samples': 6494976, 'steps': 33827, 'loss/train': 1.7827537059783936} 11/07/2021 02:00:39 - INFO - __main__ - Step 33829: {'lr': 0.00044508105499805337, 'samples': 6495168, 'steps': 33828, 'loss/train': 1.2118760347366333} 11/07/2021 02:00:40 - INFO - __main__ - Step 33830: {'lr': 0.0004450777362479192, 'samples': 6495360, 'steps': 33829, 'loss/train': 1.4964271783828735} 11/07/2021 02:00:40 - INFO - __main__ - Step 33831: {'lr': 0.000445074417409886, 'samples': 6495552, 'steps': 33830, 'loss/train': 1.4948548078536987} 11/07/2021 02:00:40 - INFO - __main__ - Step 33832: {'lr': 0.0004450710984839553, 'samples': 6495744, 'steps': 33831, 'loss/train': 1.772908329963684} 11/07/2021 02:00:41 - INFO - __main__ - Step 33833: {'lr': 0.00044506777947012863, 'samples': 6495936, 'steps': 33832, 'loss/train': 2.1931819915771484} 11/07/2021 02:00:42 - INFO - __main__ - Step 33834: {'lr': 0.0004450644603684074, 'samples': 6496128, 'steps': 33833, 'loss/train': 1.6842870712280273} 11/07/2021 02:00:42 - INFO - __main__ - Step 33835: {'lr': 0.0004450611411787931, 'samples': 6496320, 'steps': 33834, 'loss/train': 1.9571033716201782} 11/07/2021 02:00:42 - INFO - __main__ - Step 33836: {'lr': 0.0004450578219012873, 'samples': 6496512, 'steps': 33835, 'loss/train': 1.7628679275512695} 11/07/2021 02:00:43 - INFO - __main__ - Step 33837: {'lr': 0.00044505450253589144, 'samples': 6496704, 'steps': 33836, 'loss/train': 1.7430778741836548} 11/07/2021 02:00:44 - INFO - __main__ - Step 33838: {'lr': 0.00044505118308260693, 'samples': 6496896, 'steps': 33837, 'loss/train': 1.683057427406311} 11/07/2021 02:00:44 - INFO - __main__ - Step 33839: {'lr': 0.0004450478635414355, 'samples': 6497088, 'steps': 33838, 'loss/train': 1.1430109739303589} 11/07/2021 02:00:45 - INFO - __main__ - Step 33840: {'lr': 0.0004450445439123785, 'samples': 6497280, 'steps': 33839, 'loss/train': 1.5153329372406006} 11/07/2021 02:00:45 - INFO - __main__ - Step 33841: {'lr': 0.0004450412241954374, 'samples': 6497472, 'steps': 33840, 'loss/train': 1.6892775297164917} 11/07/2021 02:00:45 - INFO - __main__ - Step 33842: {'lr': 0.00044503790439061374, 'samples': 6497664, 'steps': 33841, 'loss/train': 1.6585344076156616} 11/07/2021 02:00:46 - INFO - __main__ - Step 33843: {'lr': 0.000445034584497909, 'samples': 6497856, 'steps': 33842, 'loss/train': 1.845517635345459} 11/07/2021 02:00:47 - INFO - __main__ - Step 33844: {'lr': 0.00044503126451732474, 'samples': 6498048, 'steps': 33843, 'loss/train': 1.7846750020980835} 11/07/2021 02:00:47 - INFO - __main__ - Step 33845: {'lr': 0.00044502794444886234, 'samples': 6498240, 'steps': 33844, 'loss/train': 1.4909343719482422} 11/07/2021 02:00:47 - INFO - __main__ - Step 33846: {'lr': 0.00044502462429252336, 'samples': 6498432, 'steps': 33845, 'loss/train': 1.6039619445800781} 11/07/2021 02:00:48 - INFO - __main__ - Step 33847: {'lr': 0.0004450213040483093, 'samples': 6498624, 'steps': 33846, 'loss/train': 1.3290834426879883} 11/07/2021 02:00:49 - INFO - __main__ - Step 33848: {'lr': 0.00044501798371622173, 'samples': 6498816, 'steps': 33847, 'loss/train': 1.3662832975387573} 11/07/2021 02:00:49 - INFO - __main__ - Step 33849: {'lr': 0.00044501466329626197, 'samples': 6499008, 'steps': 33848, 'loss/train': 1.15182363986969} 11/07/2021 02:00:49 - INFO - __main__ - Step 33850: {'lr': 0.0004450113427884317, 'samples': 6499200, 'steps': 33849, 'loss/train': 2.1332545280456543} 11/07/2021 02:00:50 - INFO - __main__ - Step 33851: {'lr': 0.00044500802219273224, 'samples': 6499392, 'steps': 33850, 'loss/train': 1.3523143529891968} 11/07/2021 02:00:50 - INFO - __main__ - Step 33852: {'lr': 0.00044500470150916514, 'samples': 6499584, 'steps': 33851, 'loss/train': 1.659804344177246} 11/07/2021 02:00:51 - INFO - __main__ - Step 33853: {'lr': 0.000445001380737732, 'samples': 6499776, 'steps': 33852, 'loss/train': 1.4241199493408203} 11/07/2021 02:00:52 - INFO - __main__ - Step 33854: {'lr': 0.0004449980598784343, 'samples': 6499968, 'steps': 33853, 'loss/train': 1.4666374921798706} 11/07/2021 02:00:52 - INFO - __main__ - Step 33855: {'lr': 0.0004449947389312734, 'samples': 6500160, 'steps': 33854, 'loss/train': 1.3092784881591797} 11/07/2021 02:00:53 - INFO - __main__ - Step 33856: {'lr': 0.00044499141789625086, 'samples': 6500352, 'steps': 33855, 'loss/train': 1.5748056173324585} 11/07/2021 02:00:53 - INFO - __main__ - Step 33857: {'lr': 0.0004449880967733683, 'samples': 6500544, 'steps': 33856, 'loss/train': 0.20113760232925415} 11/07/2021 02:00:54 - INFO - __main__ - Step 33858: {'lr': 0.0004449847755626271, 'samples': 6500736, 'steps': 33857, 'loss/train': 1.6535253524780273} 11/07/2021 02:00:55 - INFO - __main__ - Step 33859: {'lr': 0.0004449814542640287, 'samples': 6500928, 'steps': 33858, 'loss/train': 1.9745224714279175} 11/07/2021 02:00:55 - INFO - __main__ - Step 33860: {'lr': 0.0004449781328775746, 'samples': 6501120, 'steps': 33859, 'loss/train': 1.4337353706359863} 11/07/2021 02:00:55 - INFO - __main__ - Step 33861: {'lr': 0.0004449748114032665, 'samples': 6501312, 'steps': 33860, 'loss/train': 1.499574899673462} 11/07/2021 02:00:56 - INFO - __main__ - Step 33862: {'lr': 0.00044497148984110567, 'samples': 6501504, 'steps': 33861, 'loss/train': 1.399930715560913} 11/07/2021 02:00:56 - INFO - __main__ - Step 33863: {'lr': 0.00044496816819109377, 'samples': 6501696, 'steps': 33862, 'loss/train': 1.5060970783233643} 11/07/2021 02:00:56 - INFO - __main__ - Step 33864: {'lr': 0.0004449648464532322, 'samples': 6501888, 'steps': 33863, 'loss/train': 1.3464213609695435} 11/07/2021 02:00:57 - INFO - __main__ - Step 33865: {'lr': 0.0004449615246275225, 'samples': 6502080, 'steps': 33864, 'loss/train': 1.3262618780136108} 11/07/2021 02:00:58 - INFO - __main__ - Step 33866: {'lr': 0.000444958202713966, 'samples': 6502272, 'steps': 33865, 'loss/train': 1.3003273010253906} 11/07/2021 02:00:58 - INFO - __main__ - Step 33867: {'lr': 0.0004449548807125645, 'samples': 6502464, 'steps': 33866, 'loss/train': 1.1961196660995483} 11/07/2021 02:00:59 - INFO - __main__ - Step 33868: {'lr': 0.0004449515586233193, 'samples': 6502656, 'steps': 33867, 'loss/train': 0.7612836956977844} 11/07/2021 02:00:59 - INFO - __main__ - Step 33869: {'lr': 0.0004449482364462319, 'samples': 6502848, 'steps': 33868, 'loss/train': 1.5784677267074585} 11/07/2021 02:01:00 - INFO - __main__ - Step 33870: {'lr': 0.0004449449141813039, 'samples': 6503040, 'steps': 33869, 'loss/train': 1.3834257125854492} 11/07/2021 02:01:00 - INFO - __main__ - Step 33871: {'lr': 0.00044494159182853667, 'samples': 6503232, 'steps': 33870, 'loss/train': 1.4731563329696655} 11/07/2021 02:01:01 - INFO - __main__ - Step 33872: {'lr': 0.0004449382693879318, 'samples': 6503424, 'steps': 33871, 'loss/train': 1.233699083328247} 11/07/2021 02:01:01 - INFO - __main__ - Step 33873: {'lr': 0.0004449349468594908, 'samples': 6503616, 'steps': 33872, 'loss/train': 1.6775755882263184} 11/07/2021 02:01:01 - INFO - __main__ - Step 33874: {'lr': 0.000444931624243215, 'samples': 6503808, 'steps': 33873, 'loss/train': 1.4026579856872559} 11/07/2021 02:01:02 - INFO - __main__ - Step 33875: {'lr': 0.0004449283015391061, 'samples': 6504000, 'steps': 33874, 'loss/train': 1.7538774013519287} 11/07/2021 02:01:03 - INFO - __main__ - Step 33876: {'lr': 0.0004449249787471655, 'samples': 6504192, 'steps': 33875, 'loss/train': 1.0890296697616577} 11/07/2021 02:01:03 - INFO - __main__ - Step 33877: {'lr': 0.0004449216558673947, 'samples': 6504384, 'steps': 33876, 'loss/train': 1.704645037651062} 11/07/2021 02:01:03 - INFO - __main__ - Step 33878: {'lr': 0.0004449183328997952, 'samples': 6504576, 'steps': 33877, 'loss/train': 1.4584956169128418} 11/07/2021 02:01:04 - INFO - __main__ - Step 33879: {'lr': 0.0004449150098443685, 'samples': 6504768, 'steps': 33878, 'loss/train': 1.571358323097229} 11/07/2021 02:01:05 - INFO - __main__ - Step 33880: {'lr': 0.00044491168670111615, 'samples': 6504960, 'steps': 33879, 'loss/train': 1.1301995515823364} 11/07/2021 02:01:05 - INFO - __main__ - Step 33881: {'lr': 0.0004449083634700396, 'samples': 6505152, 'steps': 33880, 'loss/train': 1.473876714706421} 11/07/2021 02:01:05 - INFO - __main__ - Step 33882: {'lr': 0.00044490504015114033, 'samples': 6505344, 'steps': 33881, 'loss/train': 1.712478518486023} 11/07/2021 02:01:06 - INFO - __main__ - Step 33883: {'lr': 0.0004449017167444198, 'samples': 6505536, 'steps': 33882, 'loss/train': 1.813271164894104} 11/07/2021 02:01:06 - INFO - __main__ - Step 33884: {'lr': 0.0004448983932498797, 'samples': 6505728, 'steps': 33883, 'loss/train': 1.7393323183059692} 11/07/2021 02:01:06 - INFO - __main__ - Step 33885: {'lr': 0.00044489506966752127, 'samples': 6505920, 'steps': 33884, 'loss/train': 1.731095552444458} 11/07/2021 02:01:08 - INFO - __main__ - Step 33886: {'lr': 0.00044489174599734614, 'samples': 6506112, 'steps': 33885, 'loss/train': 1.4670522212982178} 11/07/2021 02:01:08 - INFO - __main__ - Step 33887: {'lr': 0.0004448884222393559, 'samples': 6506304, 'steps': 33886, 'loss/train': 1.4478570222854614} 11/07/2021 02:01:08 - INFO - __main__ - Step 33888: {'lr': 0.00044488509839355183, 'samples': 6506496, 'steps': 33887, 'loss/train': 1.1947063207626343} 11/07/2021 02:01:09 - INFO - __main__ - Step 33889: {'lr': 0.00044488177445993563, 'samples': 6506688, 'steps': 33888, 'loss/train': 1.2288641929626465} 11/07/2021 02:01:09 - INFO - __main__ - Step 33890: {'lr': 0.0004448784504385086, 'samples': 6506880, 'steps': 33889, 'loss/train': 1.3459579944610596} 11/07/2021 02:01:10 - INFO - __main__ - Step 33891: {'lr': 0.0004448751263292724, 'samples': 6507072, 'steps': 33890, 'loss/train': 1.4274922609329224} 11/07/2021 02:01:11 - INFO - __main__ - Step 33892: {'lr': 0.0004448718021322285, 'samples': 6507264, 'steps': 33891, 'loss/train': 1.4666961431503296} 11/07/2021 02:01:11 - INFO - __main__ - Step 33893: {'lr': 0.0004448684778473784, 'samples': 6507456, 'steps': 33892, 'loss/train': 1.6061058044433594} 11/07/2021 02:01:11 - INFO - __main__ - Step 33894: {'lr': 0.0004448651534747235, 'samples': 6507648, 'steps': 33893, 'loss/train': 1.383569359779358} 11/07/2021 02:01:12 - INFO - __main__ - Step 33895: {'lr': 0.0004448618290142654, 'samples': 6507840, 'steps': 33894, 'loss/train': 1.421039342880249} 11/07/2021 02:01:13 - INFO - __main__ - Step 33896: {'lr': 0.0004448585044660055, 'samples': 6508032, 'steps': 33895, 'loss/train': 0.20409615337848663} 11/07/2021 02:01:13 - INFO - __main__ - Step 33897: {'lr': 0.0004448551798299455, 'samples': 6508224, 'steps': 33896, 'loss/train': 1.7236816883087158} 11/07/2021 02:01:13 - INFO - __main__ - Step 33898: {'lr': 0.00044485185510608665, 'samples': 6508416, 'steps': 33897, 'loss/train': 1.4264075756072998} 11/07/2021 02:01:14 - INFO - __main__ - Step 33899: {'lr': 0.0004448485302944306, 'samples': 6508608, 'steps': 33898, 'loss/train': 1.5501539707183838} 11/07/2021 02:01:14 - INFO - __main__ - Step 33900: {'lr': 0.0004448452053949789, 'samples': 6508800, 'steps': 33899, 'loss/train': 1.3028892278671265} 11/07/2021 02:01:14 - INFO - __main__ - Step 33901: {'lr': 0.0004448418804077328, 'samples': 6508992, 'steps': 33900, 'loss/train': 1.5605762004852295} 11/07/2021 02:01:16 - INFO - __main__ - Step 33902: {'lr': 0.000444838555332694, 'samples': 6509184, 'steps': 33901, 'loss/train': 1.3784602880477905} 11/07/2021 02:01:16 - INFO - __main__ - Step 33903: {'lr': 0.000444835230169864, 'samples': 6509376, 'steps': 33902, 'loss/train': 2.122793674468994} 11/07/2021 02:01:16 - INFO - __main__ - Step 33904: {'lr': 0.00044483190491924427, 'samples': 6509568, 'steps': 33903, 'loss/train': 1.8116071224212646} 11/07/2021 02:01:17 - INFO - __main__ - Step 33905: {'lr': 0.0004448285795808362, 'samples': 6509760, 'steps': 33904, 'loss/train': 1.4722168445587158} 11/07/2021 02:01:17 - INFO - __main__ - Step 33906: {'lr': 0.00044482525415464144, 'samples': 6509952, 'steps': 33905, 'loss/train': 1.5740023851394653} 11/07/2021 02:01:18 - INFO - __main__ - Step 33907: {'lr': 0.0004448219286406614, 'samples': 6510144, 'steps': 33906, 'loss/train': 1.7123525142669678} 11/07/2021 02:01:18 - INFO - __main__ - Step 33908: {'lr': 0.00044481860303889766, 'samples': 6510336, 'steps': 33907, 'loss/train': 1.1449620723724365} 11/07/2021 02:01:19 - INFO - __main__ - Step 33909: {'lr': 0.0004448152773493516, 'samples': 6510528, 'steps': 33908, 'loss/train': 1.907379150390625} 11/07/2021 02:01:19 - INFO - __main__ - Step 33910: {'lr': 0.0004448119515720248, 'samples': 6510720, 'steps': 33909, 'loss/train': 1.6828466653823853} 11/07/2021 02:01:19 - INFO - __main__ - Step 33911: {'lr': 0.0004448086257069187, 'samples': 6510912, 'steps': 33910, 'loss/train': 1.4255051612854004} 11/07/2021 02:01:20 - INFO - __main__ - Step 33912: {'lr': 0.00044480529975403496, 'samples': 6511104, 'steps': 33911, 'loss/train': 1.3303273916244507} 11/07/2021 02:01:21 - INFO - __main__ - Step 33913: {'lr': 0.00044480197371337484, 'samples': 6511296, 'steps': 33912, 'loss/train': 0.7251231670379639} 11/07/2021 02:01:21 - INFO - __main__ - Step 33914: {'lr': 0.00044479864758494004, 'samples': 6511488, 'steps': 33913, 'loss/train': 1.377323865890503} 11/07/2021 02:01:22 - INFO - __main__ - Step 33915: {'lr': 0.0004447953213687319, 'samples': 6511680, 'steps': 33914, 'loss/train': 1.4334322214126587} 11/07/2021 02:01:22 - INFO - __main__ - Step 33916: {'lr': 0.00044479199506475205, 'samples': 6511872, 'steps': 33915, 'loss/train': 1.859991192817688} 11/07/2021 02:01:23 - INFO - __main__ - Step 33917: {'lr': 0.0004447886686730019, 'samples': 6512064, 'steps': 33916, 'loss/train': 1.1730103492736816} 11/07/2021 02:01:23 - INFO - __main__ - Step 33918: {'lr': 0.00044478534219348297, 'samples': 6512256, 'steps': 33917, 'loss/train': 1.6439616680145264} 11/07/2021 02:01:24 - INFO - __main__ - Step 33919: {'lr': 0.0004447820156261968, 'samples': 6512448, 'steps': 33918, 'loss/train': 1.3017641305923462} 11/07/2021 02:01:24 - INFO - __main__ - Step 33920: {'lr': 0.0004447786889711449, 'samples': 6512640, 'steps': 33919, 'loss/train': 1.3927388191223145} 11/07/2021 02:01:24 - INFO - __main__ - Step 33921: {'lr': 0.00044477536222832867, 'samples': 6512832, 'steps': 33920, 'loss/train': 1.7710654735565186} 11/07/2021 02:01:25 - INFO - __main__ - Step 33922: {'lr': 0.0004447720353977497, 'samples': 6513024, 'steps': 33921, 'loss/train': 1.3544845581054688} 11/07/2021 02:01:26 - INFO - __main__ - Step 33923: {'lr': 0.0004447687084794094, 'samples': 6513216, 'steps': 33922, 'loss/train': 1.0227186679840088} 11/07/2021 02:01:26 - INFO - __main__ - Step 33924: {'lr': 0.00044476538147330934, 'samples': 6513408, 'steps': 33923, 'loss/train': 7.762441635131836} 11/07/2021 02:01:27 - INFO - __main__ - Step 33925: {'lr': 0.00044476205437945105, 'samples': 6513600, 'steps': 33924, 'loss/train': 1.4574618339538574} 11/07/2021 02:01:27 - INFO - __main__ - Step 33926: {'lr': 0.0004447587271978359, 'samples': 6513792, 'steps': 33925, 'loss/train': 3.2924323081970215} 11/07/2021 02:01:27 - INFO - __main__ - Step 33927: {'lr': 0.0004447553999284656, 'samples': 6513984, 'steps': 33926, 'loss/train': 5.801220417022705} 11/07/2021 02:01:28 - INFO - __main__ - Step 33928: {'lr': 0.00044475207257134143, 'samples': 6514176, 'steps': 33927, 'loss/train': 1.7115229368209839} 11/07/2021 02:01:29 - INFO - __main__ - Step 33929: {'lr': 0.000444748745126465, 'samples': 6514368, 'steps': 33928, 'loss/train': 1.2595962285995483} 11/07/2021 02:01:29 - INFO - __main__ - Step 33930: {'lr': 0.0004447454175938378, 'samples': 6514560, 'steps': 33929, 'loss/train': 1.2194738388061523} 11/07/2021 02:01:29 - INFO - __main__ - Step 33931: {'lr': 0.00044474208997346133, 'samples': 6514752, 'steps': 33930, 'loss/train': 1.2058517932891846} 11/07/2021 02:01:30 - INFO - __main__ - Step 33932: {'lr': 0.00044473876226533703, 'samples': 6514944, 'steps': 33931, 'loss/train': 1.2794572114944458} 11/07/2021 02:01:30 - INFO - __main__ - Step 33933: {'lr': 0.0004447354344694665, 'samples': 6515136, 'steps': 33932, 'loss/train': 1.6853723526000977} 11/07/2021 02:01:31 - INFO - __main__ - Step 33934: {'lr': 0.0004447321065858512, 'samples': 6515328, 'steps': 33933, 'loss/train': 1.336300253868103} 11/07/2021 02:01:31 - INFO - __main__ - Step 33935: {'lr': 0.00044472877861449257, 'samples': 6515520, 'steps': 33934, 'loss/train': 1.6641114950180054} 11/07/2021 02:01:32 - INFO - __main__ - Step 33936: {'lr': 0.00044472545055539213, 'samples': 6515712, 'steps': 33935, 'loss/train': 1.3842114210128784} 11/07/2021 02:01:32 - INFO - __main__ - Step 33937: {'lr': 0.00044472212240855155, 'samples': 6515904, 'steps': 33936, 'loss/train': 1.6173049211502075} 11/07/2021 02:01:33 - INFO - __main__ - Step 33938: {'lr': 0.0004447187941739721, 'samples': 6516096, 'steps': 33937, 'loss/train': 1.6227734088897705} 11/07/2021 02:01:34 - INFO - __main__ - Step 33939: {'lr': 0.00044471546585165536, 'samples': 6516288, 'steps': 33938, 'loss/train': 1.2357654571533203} 11/07/2021 02:01:34 - INFO - __main__ - Step 33940: {'lr': 0.0004447121374416028, 'samples': 6516480, 'steps': 33939, 'loss/train': 1.5931684970855713} 11/07/2021 02:01:34 - INFO - __main__ - Step 33941: {'lr': 0.000444708808943816, 'samples': 6516672, 'steps': 33940, 'loss/train': 1.5928783416748047} 11/07/2021 02:01:35 - INFO - __main__ - Step 33942: {'lr': 0.00044470548035829637, 'samples': 6516864, 'steps': 33941, 'loss/train': 1.4349801540374756} 11/07/2021 02:01:35 - INFO - __main__ - Step 33943: {'lr': 0.00044470215168504554, 'samples': 6517056, 'steps': 33942, 'loss/train': 1.7674825191497803} 11/07/2021 02:01:36 - INFO - __main__ - Step 33944: {'lr': 0.0004446988229240648, 'samples': 6517248, 'steps': 33943, 'loss/train': 1.7986441850662231} 11/07/2021 02:01:36 - INFO - __main__ - Step 33945: {'lr': 0.00044469549407535593, 'samples': 6517440, 'steps': 33944, 'loss/train': 1.3439016342163086} 11/07/2021 02:01:37 - INFO - __main__ - Step 33946: {'lr': 0.0004446921651389202, 'samples': 6517632, 'steps': 33945, 'loss/train': 1.4017547369003296} 11/07/2021 02:01:37 - INFO - __main__ - Step 33947: {'lr': 0.00044468883611475913, 'samples': 6517824, 'steps': 33946, 'loss/train': 1.85745370388031} 11/07/2021 02:01:37 - INFO - __main__ - Step 33948: {'lr': 0.00044468550700287436, 'samples': 6518016, 'steps': 33947, 'loss/train': 0.9758426547050476} 11/07/2021 02:01:38 - INFO - __main__ - Step 33949: {'lr': 0.00044468217780326724, 'samples': 6518208, 'steps': 33948, 'loss/train': 1.5032635927200317} 11/07/2021 02:01:39 - INFO - __main__ - Step 33950: {'lr': 0.0004446788485159393, 'samples': 6518400, 'steps': 33949, 'loss/train': 1.3904517889022827} 11/07/2021 02:01:39 - INFO - __main__ - Step 33951: {'lr': 0.00044467551914089223, 'samples': 6518592, 'steps': 33950, 'loss/train': 2.2070465087890625} 11/07/2021 02:01:39 - INFO - __main__ - Step 33952: {'lr': 0.0004446721896781273, 'samples': 6518784, 'steps': 33951, 'loss/train': 1.8837158679962158} 11/07/2021 02:01:40 - INFO - __main__ - Step 33953: {'lr': 0.00044466886012764603, 'samples': 6518976, 'steps': 33952, 'loss/train': 1.2247635126113892} 11/07/2021 02:01:41 - INFO - __main__ - Step 33954: {'lr': 0.00044466553048944996, 'samples': 6519168, 'steps': 33953, 'loss/train': 1.437774419784546} 11/07/2021 02:01:41 - INFO - __main__ - Step 33955: {'lr': 0.0004446622007635407, 'samples': 6519360, 'steps': 33954, 'loss/train': 1.6104636192321777} 11/07/2021 02:01:42 - INFO - __main__ - Step 33956: {'lr': 0.0004446588709499196, 'samples': 6519552, 'steps': 33955, 'loss/train': 1.6318778991699219} 11/07/2021 02:01:42 - INFO - __main__ - Step 33957: {'lr': 0.00044465554104858817, 'samples': 6519744, 'steps': 33956, 'loss/train': 1.220327377319336} 11/07/2021 02:01:42 - INFO - __main__ - Step 33958: {'lr': 0.0004446522110595481, 'samples': 6519936, 'steps': 33957, 'loss/train': 1.54439377784729} 11/07/2021 02:01:43 - INFO - __main__ - Step 33959: {'lr': 0.00044464888098280067, 'samples': 6520128, 'steps': 33958, 'loss/train': 1.7272281646728516} 11/07/2021 02:01:44 - INFO - __main__ - Step 33960: {'lr': 0.00044464555081834745, 'samples': 6520320, 'steps': 33959, 'loss/train': 1.3974066972732544} 11/07/2021 02:01:44 - INFO - __main__ - Step 33961: {'lr': 0.00044464222056618996, 'samples': 6520512, 'steps': 33960, 'loss/train': 1.8621840476989746} 11/07/2021 02:01:44 - INFO - __main__ - Step 33962: {'lr': 0.00044463889022632963, 'samples': 6520704, 'steps': 33961, 'loss/train': 1.528019666671753} 11/07/2021 02:01:45 - INFO - __main__ - Step 33963: {'lr': 0.0004446355597987681, 'samples': 6520896, 'steps': 33962, 'loss/train': 1.360838770866394} 11/07/2021 02:01:46 - INFO - __main__ - Step 33964: {'lr': 0.00044463222928350677, 'samples': 6521088, 'steps': 33963, 'loss/train': 1.8312397003173828} 11/07/2021 02:01:46 - INFO - __main__ - Step 33965: {'lr': 0.0004446288986805471, 'samples': 6521280, 'steps': 33964, 'loss/train': 1.583130955696106} 11/07/2021 02:01:46 - INFO - __main__ - Step 33966: {'lr': 0.0004446255679898907, 'samples': 6521472, 'steps': 33965, 'loss/train': 1.6411830186843872} 11/07/2021 02:01:47 - INFO - __main__ - Step 33967: {'lr': 0.000444622237211539, 'samples': 6521664, 'steps': 33966, 'loss/train': 1.4181243181228638} 11/07/2021 02:01:47 - INFO - __main__ - Step 33968: {'lr': 0.00044461890634549364, 'samples': 6521856, 'steps': 33967, 'loss/train': 1.8298888206481934} 11/07/2021 02:01:49 - INFO - __main__ - Step 33969: {'lr': 0.00044461557539175587, 'samples': 6522048, 'steps': 33968, 'loss/train': 1.7349551916122437} 11/07/2021 02:01:49 - INFO - __main__ - Step 33970: {'lr': 0.0004446122443503274, 'samples': 6522240, 'steps': 33969, 'loss/train': 1.7679076194763184} 11/07/2021 02:01:49 - INFO - __main__ - Step 33971: {'lr': 0.00044460891322120963, 'samples': 6522432, 'steps': 33970, 'loss/train': 1.5789631605148315} 11/07/2021 02:01:50 - INFO - __main__ - Step 33972: {'lr': 0.000444605582004404, 'samples': 6522624, 'steps': 33971, 'loss/train': 0.24429169297218323} 11/07/2021 02:01:50 - INFO - __main__ - Step 33973: {'lr': 0.0004446022506999122, 'samples': 6522816, 'steps': 33972, 'loss/train': 1.216179370880127} 11/07/2021 02:01:51 - INFO - __main__ - Step 33974: {'lr': 0.0004445989193077356, 'samples': 6523008, 'steps': 33973, 'loss/train': 1.6011016368865967} 11/07/2021 02:01:51 - INFO - __main__ - Step 33975: {'lr': 0.0004445955878278758, 'samples': 6523200, 'steps': 33974, 'loss/train': 2.066823720932007} 11/07/2021 02:01:52 - INFO - __main__ - Step 33976: {'lr': 0.00044459225626033413, 'samples': 6523392, 'steps': 33975, 'loss/train': 1.8326551914215088} 11/07/2021 02:01:52 - INFO - __main__ - Step 33977: {'lr': 0.00044458892460511225, 'samples': 6523584, 'steps': 33976, 'loss/train': 1.4237685203552246} 11/07/2021 02:01:52 - INFO - __main__ - Step 33978: {'lr': 0.0004445855928622116, 'samples': 6523776, 'steps': 33977, 'loss/train': 1.754783272743225} 11/07/2021 02:01:53 - INFO - __main__ - Step 33979: {'lr': 0.00044458226103163365, 'samples': 6523968, 'steps': 33978, 'loss/train': 1.2844159603118896} 11/07/2021 02:01:54 - INFO - __main__ - Step 33980: {'lr': 0.0004445789291133799, 'samples': 6524160, 'steps': 33979, 'loss/train': 1.2192468643188477} 11/07/2021 02:01:54 - INFO - __main__ - Step 33981: {'lr': 0.0004445755971074519, 'samples': 6524352, 'steps': 33980, 'loss/train': 1.436919093132019} 11/07/2021 02:01:54 - INFO - __main__ - Step 33982: {'lr': 0.0004445722650138512, 'samples': 6524544, 'steps': 33981, 'loss/train': 1.6285544633865356} 11/07/2021 02:01:55 - INFO - __main__ - Step 33983: {'lr': 0.00044456893283257925, 'samples': 6524736, 'steps': 33982, 'loss/train': 1.518558144569397} 11/07/2021 02:01:55 - INFO - __main__ - Step 33984: {'lr': 0.00044456560056363746, 'samples': 6524928, 'steps': 33983, 'loss/train': 1.6287636756896973} 11/07/2021 02:01:56 - INFO - __main__ - Step 33985: {'lr': 0.0004445622682070275, 'samples': 6525120, 'steps': 33984, 'loss/train': 1.4861018657684326} 11/07/2021 02:01:56 - INFO - __main__ - Step 33986: {'lr': 0.00044455893576275077, 'samples': 6525312, 'steps': 33985, 'loss/train': 1.591300129890442} 11/07/2021 02:01:57 - INFO - __main__ - Step 33987: {'lr': 0.00044455560323080874, 'samples': 6525504, 'steps': 33986, 'loss/train': 1.9785436391830444} 11/07/2021 02:01:57 - INFO - __main__ - Step 33988: {'lr': 0.00044455227061120296, 'samples': 6525696, 'steps': 33987, 'loss/train': 1.6648945808410645} 11/07/2021 02:01:57 - INFO - __main__ - Step 33989: {'lr': 0.000444548937903935, 'samples': 6525888, 'steps': 33988, 'loss/train': 1.4953155517578125} 11/07/2021 02:01:58 - INFO - __main__ - Step 33990: {'lr': 0.0004445456051090062, 'samples': 6526080, 'steps': 33989, 'loss/train': 1.3475840091705322} 11/07/2021 02:01:59 - INFO - __main__ - Step 33991: {'lr': 0.0004445422722264182, 'samples': 6526272, 'steps': 33990, 'loss/train': 1.6253974437713623} 11/07/2021 02:01:59 - INFO - __main__ - Step 33992: {'lr': 0.0004445389392561724, 'samples': 6526464, 'steps': 33991, 'loss/train': 1.6526176929473877} 11/07/2021 02:01:59 - INFO - __main__ - Step 33993: {'lr': 0.0004445356061982704, 'samples': 6526656, 'steps': 33992, 'loss/train': 1.5785486698150635} 11/07/2021 02:02:00 - INFO - __main__ - Step 33994: {'lr': 0.0004445322730527137, 'samples': 6526848, 'steps': 33993, 'loss/train': 1.6258047819137573} 11/07/2021 02:02:01 - INFO - __main__ - Step 33995: {'lr': 0.0004445289398195037, 'samples': 6527040, 'steps': 33994, 'loss/train': 1.8948335647583008} 11/07/2021 02:02:01 - INFO - __main__ - Step 33996: {'lr': 0.000444525606498642, 'samples': 6527232, 'steps': 33995, 'loss/train': 1.3909330368041992} 11/07/2021 02:02:02 - INFO - __main__ - Step 33997: {'lr': 0.00044452227309013003, 'samples': 6527424, 'steps': 33996, 'loss/train': 1.4994609355926514} 11/07/2021 02:02:02 - INFO - __main__ - Step 33998: {'lr': 0.0004445189395939694, 'samples': 6527616, 'steps': 33997, 'loss/train': 1.4667303562164307} 11/07/2021 02:02:02 - INFO - __main__ - Step 33999: {'lr': 0.0004445156060101614, 'samples': 6527808, 'steps': 33998, 'loss/train': 1.8702677488327026} 11/07/2021 02:02:03 - INFO - __main__ - Step 34000: {'lr': 0.0004445122723387077, 'samples': 6528000, 'steps': 33999, 'loss/train': 1.4197800159454346} 11/07/2021 02:02:04 - INFO - __main__ - Step 34001: {'lr': 0.0004445089385796099, 'samples': 6528192, 'steps': 34000, 'loss/train': 1.6524118185043335} 11/07/2021 02:02:04 - INFO - __main__ - Step 34002: {'lr': 0.0004445056047328693, 'samples': 6528384, 'steps': 34001, 'loss/train': 1.7949652671813965} 11/07/2021 02:02:05 - INFO - __main__ - Step 34003: {'lr': 0.0004445022707984874, 'samples': 6528576, 'steps': 34002, 'loss/train': 0.221288800239563} 11/07/2021 02:02:05 - INFO - __main__ - Step 34004: {'lr': 0.0004444989367764659, 'samples': 6528768, 'steps': 34003, 'loss/train': 1.1219404935836792} 11/07/2021 02:02:06 - INFO - __main__ - Step 34005: {'lr': 0.0004444956026668061, 'samples': 6528960, 'steps': 34004, 'loss/train': 1.2909691333770752} 11/07/2021 02:02:07 - INFO - __main__ - Step 34006: {'lr': 0.00044449226846950964, 'samples': 6529152, 'steps': 34005, 'loss/train': 0.9231574535369873} 11/07/2021 02:02:07 - INFO - __main__ - Step 34007: {'lr': 0.00044448893418457794, 'samples': 6529344, 'steps': 34006, 'loss/train': 2.1114120483398438} 11/07/2021 02:02:07 - INFO - __main__ - Step 34008: {'lr': 0.00044448559981201256, 'samples': 6529536, 'steps': 34007, 'loss/train': 1.8139729499816895} 11/07/2021 02:02:08 - INFO - __main__ - Step 34009: {'lr': 0.00044448226535181485, 'samples': 6529728, 'steps': 34008, 'loss/train': 1.598117470741272} 11/07/2021 02:02:08 - INFO - __main__ - Step 34010: {'lr': 0.0004444789308039865, 'samples': 6529920, 'steps': 34009, 'loss/train': 2.2077717781066895} 11/07/2021 02:02:09 - INFO - __main__ - Step 34011: {'lr': 0.00044447559616852893, 'samples': 6530112, 'steps': 34010, 'loss/train': 1.6955511569976807} 11/07/2021 02:02:09 - INFO - __main__ - Step 34012: {'lr': 0.0004444722614454437, 'samples': 6530304, 'steps': 34011, 'loss/train': 1.916416049003601} 11/07/2021 02:02:10 - INFO - __main__ - Step 34013: {'lr': 0.00044446892663473227, 'samples': 6530496, 'steps': 34012, 'loss/train': 1.7840545177459717} 11/07/2021 02:02:10 - INFO - __main__ - Step 34014: {'lr': 0.0004444655917363961, 'samples': 6530688, 'steps': 34013, 'loss/train': 1.3438079357147217} 11/07/2021 02:02:10 - INFO - __main__ - Step 34015: {'lr': 0.00044446225675043684, 'samples': 6530880, 'steps': 34014, 'loss/train': 2.495532751083374} 11/07/2021 02:02:11 - INFO - __main__ - Step 34016: {'lr': 0.0004444589216768558, 'samples': 6531072, 'steps': 34015, 'loss/train': 1.3623573780059814} 11/07/2021 02:02:12 - INFO - __main__ - Step 34017: {'lr': 0.0004444555865156545, 'samples': 6531264, 'steps': 34016, 'loss/train': 1.2433745861053467} 11/07/2021 02:02:12 - INFO - __main__ - Step 34018: {'lr': 0.0004444522512668346, 'samples': 6531456, 'steps': 34017, 'loss/train': 1.8973829746246338} 11/07/2021 02:02:13 - INFO - __main__ - Step 34019: {'lr': 0.0004444489159303976, 'samples': 6531648, 'steps': 34018, 'loss/train': 1.5133620500564575} 11/07/2021 02:02:13 - INFO - __main__ - Step 34020: {'lr': 0.0004444455805063448, 'samples': 6531840, 'steps': 34019, 'loss/train': 1.4161401987075806} 11/07/2021 02:02:14 - INFO - __main__ - Step 34021: {'lr': 0.00044444224499467784, 'samples': 6532032, 'steps': 34020, 'loss/train': 1.7862776517868042} 11/07/2021 02:02:14 - INFO - __main__ - Step 34022: {'lr': 0.0004444389093953982, 'samples': 6532224, 'steps': 34021, 'loss/train': 1.5086743831634521} 11/07/2021 02:02:15 - INFO - __main__ - Step 34023: {'lr': 0.00044443557370850743, 'samples': 6532416, 'steps': 34022, 'loss/train': 1.3057926893234253} 11/07/2021 02:02:15 - INFO - __main__ - Step 34024: {'lr': 0.00044443223793400695, 'samples': 6532608, 'steps': 34023, 'loss/train': 1.6543852090835571} 11/07/2021 02:02:16 - INFO - __main__ - Step 34025: {'lr': 0.0004444289020718983, 'samples': 6532800, 'steps': 34024, 'loss/train': 1.9857542514801025} 11/07/2021 02:02:17 - INFO - __main__ - Step 34026: {'lr': 0.000444425566122183, 'samples': 6532992, 'steps': 34025, 'loss/train': 0.9507812261581421} 11/07/2021 02:02:17 - INFO - __main__ - Step 34027: {'lr': 0.0004444222300848626, 'samples': 6533184, 'steps': 34026, 'loss/train': 1.056647777557373} 11/07/2021 02:02:17 - INFO - __main__ - Step 34028: {'lr': 0.00044441889395993844, 'samples': 6533376, 'steps': 34027, 'loss/train': 1.6833502054214478} 11/07/2021 02:02:18 - INFO - __main__ - Step 34029: {'lr': 0.00044441555774741215, 'samples': 6533568, 'steps': 34028, 'loss/train': 1.3435801267623901} 11/07/2021 02:02:18 - INFO - __main__ - Step 34030: {'lr': 0.00044441222144728525, 'samples': 6533760, 'steps': 34029, 'loss/train': 1.3523625135421753} 11/07/2021 02:02:19 - INFO - __main__ - Step 34031: {'lr': 0.00044440888505955926, 'samples': 6533952, 'steps': 34030, 'loss/train': 2.1320152282714844} 11/07/2021 02:02:19 - INFO - __main__ - Step 34032: {'lr': 0.00044440554858423553, 'samples': 6534144, 'steps': 34031, 'loss/train': 1.4462076425552368} 11/07/2021 02:02:20 - INFO - __main__ - Step 34033: {'lr': 0.0004444022120213157, 'samples': 6534336, 'steps': 34032, 'loss/train': 1.2795424461364746} 11/07/2021 02:02:20 - INFO - __main__ - Step 34034: {'lr': 0.00044439887537080116, 'samples': 6534528, 'steps': 34033, 'loss/train': 1.4343459606170654} 11/07/2021 02:02:20 - INFO - __main__ - Step 34035: {'lr': 0.00044439553863269356, 'samples': 6534720, 'steps': 34034, 'loss/train': 1.7785248756408691} 11/07/2021 02:02:21 - INFO - __main__ - Step 34036: {'lr': 0.00044439220180699434, 'samples': 6534912, 'steps': 34035, 'loss/train': 1.6420665979385376} 11/07/2021 02:02:22 - INFO - __main__ - Step 34037: {'lr': 0.00044438886489370493, 'samples': 6535104, 'steps': 34036, 'loss/train': 1.4897013902664185} 11/07/2021 02:02:22 - INFO - __main__ - Step 34038: {'lr': 0.00044438552789282694, 'samples': 6535296, 'steps': 34037, 'loss/train': 1.4812804460525513} 11/07/2021 02:02:22 - INFO - __main__ - Step 34039: {'lr': 0.00044438219080436184, 'samples': 6535488, 'steps': 34038, 'loss/train': 1.7232627868652344} 11/07/2021 02:02:23 - INFO - __main__ - Step 34040: {'lr': 0.0004443788536283111, 'samples': 6535680, 'steps': 34039, 'loss/train': 1.4123303890228271} 11/07/2021 02:02:23 - INFO - __main__ - Step 34041: {'lr': 0.0004443755163646762, 'samples': 6535872, 'steps': 34040, 'loss/train': 1.4441375732421875} 11/07/2021 02:02:24 - INFO - __main__ - Step 34042: {'lr': 0.00044437217901345885, 'samples': 6536064, 'steps': 34041, 'loss/train': 1.9585773944854736} 11/07/2021 02:02:24 - INFO - __main__ - Step 34043: {'lr': 0.0004443688415746602, 'samples': 6536256, 'steps': 34042, 'loss/train': 0.7225326895713806} 11/07/2021 02:02:25 - INFO - __main__ - Step 34044: {'lr': 0.00044436550404828207, 'samples': 6536448, 'steps': 34043, 'loss/train': 1.2998487949371338} 11/07/2021 02:02:25 - INFO - __main__ - Step 34045: {'lr': 0.0004443621664343258, 'samples': 6536640, 'steps': 34044, 'loss/train': 1.6461623907089233} 11/07/2021 02:02:25 - INFO - __main__ - Step 34046: {'lr': 0.000444358828732793, 'samples': 6536832, 'steps': 34045, 'loss/train': 1.1608872413635254} 11/07/2021 02:02:27 - INFO - __main__ - Step 34047: {'lr': 0.000444355490943685, 'samples': 6537024, 'steps': 34046, 'loss/train': 1.3167895078659058} 11/07/2021 02:02:27 - INFO - __main__ - Step 34048: {'lr': 0.0004443521530670035, 'samples': 6537216, 'steps': 34047, 'loss/train': 1.7646113634109497} 11/07/2021 02:02:27 - INFO - __main__ - Step 34049: {'lr': 0.00044434881510274995, 'samples': 6537408, 'steps': 34048, 'loss/train': 1.6775469779968262} 11/07/2021 02:02:28 - INFO - __main__ - Step 34050: {'lr': 0.00044434547705092574, 'samples': 6537600, 'steps': 34049, 'loss/train': 1.299673318862915} 11/07/2021 02:02:28 - INFO - __main__ - Step 34051: {'lr': 0.0004443421389115325, 'samples': 6537792, 'steps': 34050, 'loss/train': 1.6322948932647705} 11/07/2021 02:02:29 - INFO - __main__ - Step 34052: {'lr': 0.00044433880068457166, 'samples': 6537984, 'steps': 34051, 'loss/train': 1.6336530447006226} 11/07/2021 02:02:29 - INFO - __main__ - Step 34053: {'lr': 0.0004443354623700447, 'samples': 6538176, 'steps': 34052, 'loss/train': 1.146722435951233} 11/07/2021 02:02:30 - INFO - __main__ - Step 34054: {'lr': 0.0004443321239679533, 'samples': 6538368, 'steps': 34053, 'loss/train': 1.7375221252441406} 11/07/2021 02:02:30 - INFO - __main__ - Step 34055: {'lr': 0.0004443287854782988, 'samples': 6538560, 'steps': 34054, 'loss/train': 1.0673551559448242} 11/07/2021 02:02:30 - INFO - __main__ - Step 34056: {'lr': 0.0004443254469010828, 'samples': 6538752, 'steps': 34055, 'loss/train': 1.014756441116333} 11/07/2021 02:02:31 - INFO - __main__ - Step 34057: {'lr': 0.0004443221082363067, 'samples': 6538944, 'steps': 34056, 'loss/train': 1.288285255432129} 11/07/2021 02:02:32 - INFO - __main__ - Step 34058: {'lr': 0.000444318769483972, 'samples': 6539136, 'steps': 34057, 'loss/train': 1.7834769487380981} 11/07/2021 02:02:32 - INFO - __main__ - Step 34059: {'lr': 0.0004443154306440803, 'samples': 6539328, 'steps': 34058, 'loss/train': 1.7867774963378906} 11/07/2021 02:02:32 - INFO - __main__ - Step 34060: {'lr': 0.00044431209171663313, 'samples': 6539520, 'steps': 34059, 'loss/train': 0.7567717432975769} 11/07/2021 02:02:33 - INFO - __main__ - Step 34061: {'lr': 0.00044430875270163185, 'samples': 6539712, 'steps': 34060, 'loss/train': 1.6711994409561157} 11/07/2021 02:02:33 - INFO - __main__ - Step 34062: {'lr': 0.00044430541359907804, 'samples': 6539904, 'steps': 34061, 'loss/train': 1.6374399662017822} 11/07/2021 02:02:34 - INFO - __main__ - Step 34063: {'lr': 0.0004443020744089733, 'samples': 6540096, 'steps': 34062, 'loss/train': 1.5656423568725586} 11/07/2021 02:02:35 - INFO - __main__ - Step 34064: {'lr': 0.00044429873513131897, 'samples': 6540288, 'steps': 34063, 'loss/train': 3.7890005111694336} 11/07/2021 02:02:35 - INFO - __main__ - Step 34065: {'lr': 0.00044429539576611664, 'samples': 6540480, 'steps': 34064, 'loss/train': 1.0325992107391357} 11/07/2021 02:02:35 - INFO - __main__ - Step 34066: {'lr': 0.0004442920563133678, 'samples': 6540672, 'steps': 34065, 'loss/train': 0.26503893733024597} 11/07/2021 02:02:36 - INFO - __main__ - Step 34067: {'lr': 0.000444288716773074, 'samples': 6540864, 'steps': 34066, 'loss/train': 1.4189012050628662} 11/07/2021 02:02:37 - INFO - __main__ - Step 34068: {'lr': 0.00044428537714523664, 'samples': 6541056, 'steps': 34067, 'loss/train': 1.4169859886169434} 11/07/2021 02:02:37 - INFO - __main__ - Step 34069: {'lr': 0.00044428203742985734, 'samples': 6541248, 'steps': 34068, 'loss/train': 1.668953776359558} 11/07/2021 02:02:37 - INFO - __main__ - Step 34070: {'lr': 0.0004442786976269375, 'samples': 6541440, 'steps': 34069, 'loss/train': 1.685661792755127} 11/07/2021 02:02:38 - INFO - __main__ - Step 34071: {'lr': 0.0004442753577364788, 'samples': 6541632, 'steps': 34070, 'loss/train': 2.0190627574920654} 11/07/2021 02:02:38 - INFO - __main__ - Step 34072: {'lr': 0.00044427201775848246, 'samples': 6541824, 'steps': 34071, 'loss/train': 1.4158926010131836} 11/07/2021 02:02:39 - INFO - __main__ - Step 34073: {'lr': 0.0004442686776929502, 'samples': 6542016, 'steps': 34072, 'loss/train': 1.2631921768188477} 11/07/2021 02:02:39 - INFO - __main__ - Step 34074: {'lr': 0.0004442653375398835, 'samples': 6542208, 'steps': 34073, 'loss/train': 1.4398088455200195} 11/07/2021 02:02:40 - INFO - __main__ - Step 34075: {'lr': 0.0004442619972992838, 'samples': 6542400, 'steps': 34074, 'loss/train': 1.88358473777771} 11/07/2021 02:02:40 - INFO - __main__ - Step 34076: {'lr': 0.00044425865697115266, 'samples': 6542592, 'steps': 34075, 'loss/train': 0.38768935203552246} 11/07/2021 02:02:40 - INFO - __main__ - Step 34077: {'lr': 0.00044425531655549157, 'samples': 6542784, 'steps': 34076, 'loss/train': 0.8294751048088074} 11/07/2021 02:02:42 - INFO - __main__ - Step 34078: {'lr': 0.0004442519760523021, 'samples': 6542976, 'steps': 34077, 'loss/train': 1.6006672382354736} 11/07/2021 02:02:42 - INFO - __main__ - Step 34079: {'lr': 0.00044424863546158554, 'samples': 6543168, 'steps': 34078, 'loss/train': 1.5854127407073975} 11/07/2021 02:02:42 - INFO - __main__ - Step 34080: {'lr': 0.00044424529478334364, 'samples': 6543360, 'steps': 34079, 'loss/train': 1.4523556232452393} 11/07/2021 02:02:43 - INFO - __main__ - Step 34081: {'lr': 0.0004442419540175778, 'samples': 6543552, 'steps': 34080, 'loss/train': 1.5299001932144165} 11/07/2021 02:02:43 - INFO - __main__ - Step 34082: {'lr': 0.0004442386131642895, 'samples': 6543744, 'steps': 34081, 'loss/train': 1.4755926132202148} 11/07/2021 02:02:43 - INFO - __main__ - Step 34083: {'lr': 0.0004442352722234803, 'samples': 6543936, 'steps': 34082, 'loss/train': 1.3150346279144287} 11/07/2021 02:02:44 - INFO - __main__ - Step 34084: {'lr': 0.0004442319311951517, 'samples': 6544128, 'steps': 34083, 'loss/train': 2.093810558319092} 11/07/2021 02:02:45 - INFO - __main__ - Step 34085: {'lr': 0.00044422859007930515, 'samples': 6544320, 'steps': 34084, 'loss/train': 0.9736217260360718} 11/07/2021 02:02:45 - INFO - __main__ - Step 34086: {'lr': 0.00044422524887594223, 'samples': 6544512, 'steps': 34085, 'loss/train': 1.3088901042938232} 11/07/2021 02:02:45 - INFO - __main__ - Step 34087: {'lr': 0.0004442219075850644, 'samples': 6544704, 'steps': 34086, 'loss/train': 2.357985734939575} 11/07/2021 02:02:46 - INFO - __main__ - Step 34088: {'lr': 0.0004442185662066731, 'samples': 6544896, 'steps': 34087, 'loss/train': 1.6649442911148071} 11/07/2021 02:02:47 - INFO - __main__ - Step 34089: {'lr': 0.00044421522474077, 'samples': 6545088, 'steps': 34088, 'loss/train': 1.016621708869934} 11/07/2021 02:02:47 - INFO - __main__ - Step 34090: {'lr': 0.0004442118831873565, 'samples': 6545280, 'steps': 34089, 'loss/train': 2.521681547164917} 11/07/2021 02:02:48 - INFO - __main__ - Step 34091: {'lr': 0.00044420854154643413, 'samples': 6545472, 'steps': 34090, 'loss/train': 1.7413277626037598} 11/07/2021 02:02:48 - INFO - __main__ - Step 34092: {'lr': 0.00044420519981800446, 'samples': 6545664, 'steps': 34091, 'loss/train': 1.9090162515640259} 11/07/2021 02:02:48 - INFO - __main__ - Step 34093: {'lr': 0.0004442018580020688, 'samples': 6545856, 'steps': 34092, 'loss/train': 1.6657209396362305} 11/07/2021 02:02:50 - INFO - __main__ - Step 34094: {'lr': 0.0004441985160986288, 'samples': 6546048, 'steps': 34093, 'loss/train': 1.4866628646850586} 11/07/2021 02:02:50 - INFO - __main__ - Step 34095: {'lr': 0.00044419517410768594, 'samples': 6546240, 'steps': 34094, 'loss/train': 1.7155829668045044} 11/07/2021 02:02:50 - INFO - __main__ - Step 34096: {'lr': 0.0004441918320292418, 'samples': 6546432, 'steps': 34095, 'loss/train': 2.6343283653259277} 11/07/2021 02:02:51 - INFO - __main__ - Step 34097: {'lr': 0.00044418848986329775, 'samples': 6546624, 'steps': 34096, 'loss/train': 1.75544273853302} 11/07/2021 02:02:51 - INFO - __main__ - Step 34098: {'lr': 0.0004441851476098554, 'samples': 6546816, 'steps': 34097, 'loss/train': 1.4818480014801025} 11/07/2021 02:02:52 - INFO - __main__ - Step 34099: {'lr': 0.0004441818052689162, 'samples': 6547008, 'steps': 34098, 'loss/train': 1.7657145261764526} 11/07/2021 02:02:52 - INFO - __main__ - Step 34100: {'lr': 0.0004441784628404817, 'samples': 6547200, 'steps': 34099, 'loss/train': 1.303661823272705} 11/07/2021 02:02:53 - INFO - __main__ - Step 34101: {'lr': 0.0004441751203245533, 'samples': 6547392, 'steps': 34100, 'loss/train': 1.3592103719711304} 11/07/2021 02:02:53 - INFO - __main__ - Step 34102: {'lr': 0.0004441717777211327, 'samples': 6547584, 'steps': 34101, 'loss/train': 1.1900908946990967} 11/07/2021 02:02:53 - INFO - __main__ - Step 34103: {'lr': 0.00044416843503022126, 'samples': 6547776, 'steps': 34102, 'loss/train': 1.4008874893188477} 11/07/2021 02:02:55 - INFO - __main__ - Step 34104: {'lr': 0.00044416509225182044, 'samples': 6547968, 'steps': 34103, 'loss/train': 1.9034819602966309} 11/07/2021 02:02:55 - INFO - __main__ - Step 34105: {'lr': 0.0004441617493859319, 'samples': 6548160, 'steps': 34104, 'loss/train': 1.9238120317459106} 11/07/2021 02:02:55 - INFO - __main__ - Step 34106: {'lr': 0.0004441584064325571, 'samples': 6548352, 'steps': 34105, 'loss/train': 1.6103079319000244} 11/07/2021 02:02:56 - INFO - __main__ - Step 34107: {'lr': 0.0004441550633916975, 'samples': 6548544, 'steps': 34106, 'loss/train': 1.407387375831604} 11/07/2021 02:02:56 - INFO - __main__ - Step 34108: {'lr': 0.0004441517202633546, 'samples': 6548736, 'steps': 34107, 'loss/train': 1.857909083366394} 11/07/2021 02:02:56 - INFO - __main__ - Step 34109: {'lr': 0.0004441483770475299, 'samples': 6548928, 'steps': 34108, 'loss/train': 1.783825397491455} 11/07/2021 02:02:58 - INFO - __main__ - Step 34110: {'lr': 0.000444145033744225, 'samples': 6549120, 'steps': 34109, 'loss/train': 1.9305288791656494} 11/07/2021 02:02:58 - INFO - __main__ - Step 34111: {'lr': 0.0004441416903534413, 'samples': 6549312, 'steps': 34110, 'loss/train': 1.528344988822937} 11/07/2021 02:02:59 - INFO - __main__ - Step 34112: {'lr': 0.00044413834687518034, 'samples': 6549504, 'steps': 34111, 'loss/train': 1.9612399339675903} 11/07/2021 02:02:59 - INFO - __main__ - Step 34113: {'lr': 0.00044413500330944366, 'samples': 6549696, 'steps': 34112, 'loss/train': 1.5819721221923828} 11/07/2021 02:02:59 - INFO - __main__ - Step 34114: {'lr': 0.00044413165965623275, 'samples': 6549888, 'steps': 34113, 'loss/train': 1.7284300327301025} 11/07/2021 02:03:01 - INFO - __main__ - Step 34115: {'lr': 0.00044412831591554916, 'samples': 6550080, 'steps': 34114, 'loss/train': 1.3581510782241821} 11/07/2021 02:03:01 - INFO - __main__ - Step 34116: {'lr': 0.0004441249720873942, 'samples': 6550272, 'steps': 34115, 'loss/train': 1.1771085262298584} 11/07/2021 02:03:01 - INFO - __main__ - Step 34117: {'lr': 0.00044412162817176966, 'samples': 6550464, 'steps': 34116, 'loss/train': 1.4967354536056519} 11/07/2021 02:03:02 - INFO - __main__ - Step 34118: {'lr': 0.00044411828416867684, 'samples': 6550656, 'steps': 34117, 'loss/train': 1.4765671491622925} 11/07/2021 02:03:02 - INFO - __main__ - Step 34119: {'lr': 0.00044411494007811736, 'samples': 6550848, 'steps': 34118, 'loss/train': 3.327427864074707} 11/07/2021 02:03:03 - INFO - __main__ - Step 34120: {'lr': 0.00044411159590009263, 'samples': 6551040, 'steps': 34119, 'loss/train': 5.610230445861816} 11/07/2021 02:03:03 - INFO - __main__ - Step 34121: {'lr': 0.0004441082516346043, 'samples': 6551232, 'steps': 34120, 'loss/train': 5.60709285736084} 11/07/2021 02:03:04 - INFO - __main__ - Step 34122: {'lr': 0.0004441049072816537, 'samples': 6551424, 'steps': 34121, 'loss/train': 1.4392449855804443} 11/07/2021 02:03:04 - INFO - __main__ - Step 34123: {'lr': 0.0004441015628412425, 'samples': 6551616, 'steps': 34122, 'loss/train': 1.6424055099487305} 11/07/2021 02:03:05 - INFO - __main__ - Step 34124: {'lr': 0.0004440982183133721, 'samples': 6551808, 'steps': 34123, 'loss/train': 1.508685827255249} 11/07/2021 02:03:05 - INFO - __main__ - Step 34125: {'lr': 0.00044409487369804395, 'samples': 6552000, 'steps': 34124, 'loss/train': 1.6763535737991333} 11/07/2021 02:03:05 - INFO - __main__ - Step 34126: {'lr': 0.00044409152899525973, 'samples': 6552192, 'steps': 34125, 'loss/train': 1.591783046722412} 11/07/2021 02:03:06 - INFO - __main__ - Step 34127: {'lr': 0.00044408818420502085, 'samples': 6552384, 'steps': 34126, 'loss/train': 1.2297704219818115} 11/07/2021 02:03:07 - INFO - __main__ - Step 34128: {'lr': 0.00044408483932732886, 'samples': 6552576, 'steps': 34127, 'loss/train': 1.5073697566986084} 11/07/2021 02:03:07 - INFO - __main__ - Step 34129: {'lr': 0.00044408149436218523, 'samples': 6552768, 'steps': 34128, 'loss/train': 1.2436527013778687} 11/07/2021 02:03:07 - INFO - __main__ - Step 34130: {'lr': 0.00044407814930959137, 'samples': 6552960, 'steps': 34129, 'loss/train': 1.310021162033081} 11/07/2021 02:03:08 - INFO - __main__ - Step 34131: {'lr': 0.000444074804169549, 'samples': 6553152, 'steps': 34130, 'loss/train': 1.5414221286773682} 11/07/2021 02:03:09 - INFO - __main__ - Step 34132: {'lr': 0.00044407145894205947, 'samples': 6553344, 'steps': 34131, 'loss/train': 1.5979948043823242} 11/07/2021 02:03:09 - INFO - __main__ - Step 34133: {'lr': 0.0004440681136271244, 'samples': 6553536, 'steps': 34132, 'loss/train': 1.8100224733352661} 11/07/2021 02:03:10 - INFO - __main__ - Step 34134: {'lr': 0.0004440647682247452, 'samples': 6553728, 'steps': 34133, 'loss/train': 1.0398575067520142} 11/07/2021 02:03:10 - INFO - __main__ - Step 34135: {'lr': 0.00044406142273492334, 'samples': 6553920, 'steps': 34134, 'loss/train': 1.5329359769821167} 11/07/2021 02:03:11 - INFO - __main__ - Step 34136: {'lr': 0.00044405807715766047, 'samples': 6554112, 'steps': 34135, 'loss/train': 1.1437962055206299} 11/07/2021 02:03:12 - INFO - __main__ - Step 34137: {'lr': 0.00044405473149295804, 'samples': 6554304, 'steps': 34136, 'loss/train': 1.5289555788040161} 11/07/2021 02:03:12 - INFO - __main__ - Step 34138: {'lr': 0.0004440513857408175, 'samples': 6554496, 'steps': 34137, 'loss/train': 1.8025574684143066} 11/07/2021 02:03:12 - INFO - __main__ - Step 34139: {'lr': 0.0004440480399012404, 'samples': 6554688, 'steps': 34138, 'loss/train': 1.7068231105804443} 11/07/2021 02:03:13 - INFO - __main__ - Step 34140: {'lr': 0.00044404469397422823, 'samples': 6554880, 'steps': 34139, 'loss/train': 0.6725926399230957} 11/07/2021 02:03:13 - INFO - __main__ - Step 34141: {'lr': 0.00044404134795978257, 'samples': 6555072, 'steps': 34140, 'loss/train': 1.5433404445648193} 11/07/2021 02:03:14 - INFO - __main__ - Step 34142: {'lr': 0.0004440380018579049, 'samples': 6555264, 'steps': 34141, 'loss/train': 1.7330764532089233} 11/07/2021 02:03:14 - INFO - __main__ - Step 34143: {'lr': 0.00044403465566859656, 'samples': 6555456, 'steps': 34142, 'loss/train': 1.6334328651428223} 11/07/2021 02:03:15 - INFO - __main__ - Step 34144: {'lr': 0.0004440313093918593, 'samples': 6555648, 'steps': 34143, 'loss/train': 1.4875729084014893} 11/07/2021 02:03:15 - INFO - __main__ - Step 34145: {'lr': 0.00044402796302769453, 'samples': 6555840, 'steps': 34144, 'loss/train': 0.7958040833473206} 11/07/2021 02:03:15 - INFO - __main__ - Step 34146: {'lr': 0.0004440246165761037, 'samples': 6556032, 'steps': 34145, 'loss/train': 1.0676820278167725} 11/07/2021 02:03:16 - INFO - __main__ - Step 34147: {'lr': 0.00044402127003708846, 'samples': 6556224, 'steps': 34146, 'loss/train': 1.3615130186080933} 11/07/2021 02:03:17 - INFO - __main__ - Step 34148: {'lr': 0.0004440179234106502, 'samples': 6556416, 'steps': 34147, 'loss/train': 1.5049426555633545} 11/07/2021 02:03:17 - INFO - __main__ - Step 34149: {'lr': 0.00044401457669679043, 'samples': 6556608, 'steps': 34148, 'loss/train': 1.7335518598556519} 11/07/2021 02:03:18 - INFO - __main__ - Step 34150: {'lr': 0.0004440112298955107, 'samples': 6556800, 'steps': 34149, 'loss/train': 2.2905702590942383} 11/07/2021 02:03:18 - INFO - __main__ - Step 34151: {'lr': 0.0004440078830068125, 'samples': 6556992, 'steps': 34150, 'loss/train': 1.9597066640853882} 11/07/2021 02:03:18 - INFO - __main__ - Step 34152: {'lr': 0.00044400453603069727, 'samples': 6557184, 'steps': 34151, 'loss/train': 1.5736281871795654} 11/07/2021 02:03:20 - INFO - __main__ - Step 34153: {'lr': 0.0004440011889671667, 'samples': 6557376, 'steps': 34152, 'loss/train': 1.454236626625061} 11/07/2021 02:03:20 - INFO - __main__ - Step 34154: {'lr': 0.00044399784181622216, 'samples': 6557568, 'steps': 34153, 'loss/train': 1.4838290214538574} 11/07/2021 02:03:20 - INFO - __main__ - Step 34155: {'lr': 0.0004439944945778651, 'samples': 6557760, 'steps': 34154, 'loss/train': 1.7440612316131592} 11/07/2021 02:03:21 - INFO - __main__ - Step 34156: {'lr': 0.0004439911472520972, 'samples': 6557952, 'steps': 34155, 'loss/train': 1.247413992881775} 11/07/2021 02:03:21 - INFO - __main__ - Step 34157: {'lr': 0.0004439877998389199, 'samples': 6558144, 'steps': 34156, 'loss/train': 1.444315791130066} 11/07/2021 02:03:22 - INFO - __main__ - Step 34158: {'lr': 0.0004439844523383346, 'samples': 6558336, 'steps': 34157, 'loss/train': 1.4981380701065063} 11/07/2021 02:03:22 - INFO - __main__ - Step 34159: {'lr': 0.000443981104750343, 'samples': 6558528, 'steps': 34158, 'loss/train': 1.642318606376648} 11/07/2021 02:03:23 - INFO - __main__ - Step 34160: {'lr': 0.0004439777570749465, 'samples': 6558720, 'steps': 34159, 'loss/train': 1.1470081806182861} 11/07/2021 02:03:23 - INFO - __main__ - Step 34161: {'lr': 0.0004439744093121465, 'samples': 6558912, 'steps': 34160, 'loss/train': 1.8299084901809692} 11/07/2021 02:03:23 - INFO - __main__ - Step 34162: {'lr': 0.00044397106146194473, 'samples': 6559104, 'steps': 34161, 'loss/train': 1.927542805671692} 11/07/2021 02:03:24 - INFO - __main__ - Step 34163: {'lr': 0.00044396771352434256, 'samples': 6559296, 'steps': 34162, 'loss/train': 1.4433425664901733} 11/07/2021 02:03:25 - INFO - __main__ - Step 34164: {'lr': 0.00044396436549934155, 'samples': 6559488, 'steps': 34163, 'loss/train': 1.2462340593338013} 11/07/2021 02:03:25 - INFO - __main__ - Step 34165: {'lr': 0.00044396101738694316, 'samples': 6559680, 'steps': 34164, 'loss/train': 1.1547960042953491} 11/07/2021 02:03:25 - INFO - __main__ - Step 34166: {'lr': 0.000443957669187149, 'samples': 6559872, 'steps': 34165, 'loss/train': 0.9203764200210571} 11/07/2021 02:03:26 - INFO - __main__ - Step 34167: {'lr': 0.0004439543208999604, 'samples': 6560064, 'steps': 34166, 'loss/train': 1.015457034111023} 11/07/2021 02:03:26 - INFO - __main__ - Step 34168: {'lr': 0.00044395097252537905, 'samples': 6560256, 'steps': 34167, 'loss/train': 1.355995535850525} 11/07/2021 02:03:27 - INFO - __main__ - Step 34169: {'lr': 0.0004439476240634064, 'samples': 6560448, 'steps': 34168, 'loss/train': 1.298944115638733} 11/07/2021 02:03:28 - INFO - __main__ - Step 34170: {'lr': 0.00044394427551404386, 'samples': 6560640, 'steps': 34169, 'loss/train': 5.923316478729248} 11/07/2021 02:03:28 - INFO - __main__ - Step 34171: {'lr': 0.00044394092687729305, 'samples': 6560832, 'steps': 34170, 'loss/train': 1.089755654335022} 11/07/2021 02:03:28 - INFO - __main__ - Step 34172: {'lr': 0.0004439375781531555, 'samples': 6561024, 'steps': 34171, 'loss/train': 1.5006424188613892} 11/07/2021 02:03:29 - INFO - __main__ - Step 34173: {'lr': 0.00044393422934163265, 'samples': 6561216, 'steps': 34172, 'loss/train': 1.8292887210845947} 11/07/2021 02:03:30 - INFO - __main__ - Step 34174: {'lr': 0.000443930880442726, 'samples': 6561408, 'steps': 34173, 'loss/train': 1.6458317041397095} 11/07/2021 02:03:30 - INFO - __main__ - Step 34175: {'lr': 0.0004439275314564371, 'samples': 6561600, 'steps': 34174, 'loss/train': 1.5969767570495605} 11/07/2021 02:03:30 - INFO - __main__ - Step 34176: {'lr': 0.0004439241823827674, 'samples': 6561792, 'steps': 34175, 'loss/train': 0.9351884722709656} 11/07/2021 02:03:31 - INFO - __main__ - Step 34177: {'lr': 0.0004439208332217186, 'samples': 6561984, 'steps': 34176, 'loss/train': 1.02590012550354} 11/07/2021 02:03:31 - INFO - __main__ - Step 34178: {'lr': 0.00044391748397329194, 'samples': 6562176, 'steps': 34177, 'loss/train': 1.222740888595581} 11/07/2021 02:03:32 - INFO - __main__ - Step 34179: {'lr': 0.0004439141346374891, 'samples': 6562368, 'steps': 34178, 'loss/train': 1.8320636749267578} 11/07/2021 02:03:32 - INFO - __main__ - Step 34180: {'lr': 0.0004439107852143115, 'samples': 6562560, 'steps': 34179, 'loss/train': 1.2978649139404297} 11/07/2021 02:03:33 - INFO - __main__ - Step 34181: {'lr': 0.0004439074357037607, 'samples': 6562752, 'steps': 34180, 'loss/train': 2.211667060852051} 11/07/2021 02:03:33 - INFO - __main__ - Step 34182: {'lr': 0.0004439040861058383, 'samples': 6562944, 'steps': 34181, 'loss/train': 1.599212408065796} 11/07/2021 02:03:33 - INFO - __main__ - Step 34183: {'lr': 0.00044390073642054564, 'samples': 6563136, 'steps': 34182, 'loss/train': 1.436307668685913} 11/07/2021 02:03:35 - INFO - __main__ - Step 34184: {'lr': 0.00044389738664788424, 'samples': 6563328, 'steps': 34183, 'loss/train': 0.9093039631843567} 11/07/2021 02:03:35 - INFO - __main__ - Step 34185: {'lr': 0.00044389403678785576, 'samples': 6563520, 'steps': 34184, 'loss/train': 1.4228278398513794} 11/07/2021 02:03:35 - INFO - __main__ - Step 34186: {'lr': 0.0004438906868404616, 'samples': 6563712, 'steps': 34185, 'loss/train': 1.4958488941192627} 11/07/2021 02:03:36 - INFO - __main__ - Step 34187: {'lr': 0.00044388733680570324, 'samples': 6563904, 'steps': 34186, 'loss/train': 1.353678584098816} 11/07/2021 02:03:36 - INFO - __main__ - Step 34188: {'lr': 0.00044388398668358234, 'samples': 6564096, 'steps': 34187, 'loss/train': 1.4119741916656494} 11/07/2021 02:03:36 - INFO - __main__ - Step 34189: {'lr': 0.00044388063647410016, 'samples': 6564288, 'steps': 34188, 'loss/train': 1.8877049684524536} 11/07/2021 02:03:37 - INFO - __main__ - Step 34190: {'lr': 0.00044387728617725845, 'samples': 6564480, 'steps': 34189, 'loss/train': 1.3518003225326538} 11/07/2021 02:03:38 - INFO - __main__ - Step 34191: {'lr': 0.0004438739357930586, 'samples': 6564672, 'steps': 34190, 'loss/train': 1.495152473449707} 11/07/2021 02:03:38 - INFO - __main__ - Step 34192: {'lr': 0.00044387058532150217, 'samples': 6564864, 'steps': 34191, 'loss/train': 1.3404079675674438} 11/07/2021 02:03:38 - INFO - __main__ - Step 34193: {'lr': 0.0004438672347625907, 'samples': 6565056, 'steps': 34192, 'loss/train': 1.344968557357788} 11/07/2021 02:03:39 - INFO - __main__ - Step 34194: {'lr': 0.0004438638841163255, 'samples': 6565248, 'steps': 34193, 'loss/train': 1.5332821607589722} 11/07/2021 02:03:40 - INFO - __main__ - Step 34195: {'lr': 0.0004438605333827083, 'samples': 6565440, 'steps': 34194, 'loss/train': 1.675861120223999} 11/07/2021 02:03:40 - INFO - __main__ - Step 34196: {'lr': 0.00044385718256174055, 'samples': 6565632, 'steps': 34195, 'loss/train': 1.3681551218032837} 11/07/2021 02:03:40 - INFO - __main__ - Step 34197: {'lr': 0.0004438538316534237, 'samples': 6565824, 'steps': 34196, 'loss/train': 1.3978954553604126} 11/07/2021 02:03:41 - INFO - __main__ - Step 34198: {'lr': 0.0004438504806577594, 'samples': 6566016, 'steps': 34197, 'loss/train': 1.6417779922485352} 11/07/2021 02:03:41 - INFO - __main__ - Step 34199: {'lr': 0.000443847129574749, 'samples': 6566208, 'steps': 34198, 'loss/train': 1.458509922027588} 11/07/2021 02:03:42 - INFO - __main__ - Step 34200: {'lr': 0.0004438437784043941, 'samples': 6566400, 'steps': 34199, 'loss/train': 1.0733407735824585} 11/07/2021 02:03:42 - INFO - __main__ - Step 34201: {'lr': 0.00044384042714669614, 'samples': 6566592, 'steps': 34200, 'loss/train': 1.784488320350647} 11/07/2021 02:03:43 - INFO - __main__ - Step 34202: {'lr': 0.0004438370758016567, 'samples': 6566784, 'steps': 34201, 'loss/train': 1.4622405767440796} 11/07/2021 02:03:43 - INFO - __main__ - Step 34203: {'lr': 0.00044383372436927727, 'samples': 6566976, 'steps': 34202, 'loss/train': 1.6074336767196655} 11/07/2021 02:03:44 - INFO - __main__ - Step 34204: {'lr': 0.00044383037284955937, 'samples': 6567168, 'steps': 34203, 'loss/train': 1.4448004961013794} 11/07/2021 02:03:45 - INFO - __main__ - Step 34205: {'lr': 0.00044382702124250444, 'samples': 6567360, 'steps': 34204, 'loss/train': 1.7972103357315063} 11/07/2021 02:03:45 - INFO - __main__ - Step 34206: {'lr': 0.0004438236695481141, 'samples': 6567552, 'steps': 34205, 'loss/train': 1.9503910541534424} 11/07/2021 02:03:45 - INFO - __main__ - Step 34207: {'lr': 0.00044382031776638974, 'samples': 6567744, 'steps': 34206, 'loss/train': 1.1128954887390137} 11/07/2021 02:03:46 - INFO - __main__ - Step 34208: {'lr': 0.000443816965897333, 'samples': 6567936, 'steps': 34207, 'loss/train': 1.2528254985809326} 11/07/2021 02:03:46 - INFO - __main__ - Step 34209: {'lr': 0.0004438136139409453, 'samples': 6568128, 'steps': 34208, 'loss/train': 1.4778404235839844} 11/07/2021 02:03:47 - INFO - __main__ - Step 34210: {'lr': 0.00044381026189722824, 'samples': 6568320, 'steps': 34209, 'loss/train': 1.6742136478424072} 11/07/2021 02:03:47 - INFO - __main__ - Step 34211: {'lr': 0.0004438069097661832, 'samples': 6568512, 'steps': 34210, 'loss/train': 1.3790103197097778} 11/07/2021 02:03:48 - INFO - __main__ - Step 34212: {'lr': 0.0004438035575478118, 'samples': 6568704, 'steps': 34211, 'loss/train': 0.9726940393447876} 11/07/2021 02:03:48 - INFO - __main__ - Step 34213: {'lr': 0.0004438002052421154, 'samples': 6568896, 'steps': 34212, 'loss/train': 1.7135347127914429} 11/07/2021 02:03:49 - INFO - __main__ - Step 34214: {'lr': 0.00044379685284909575, 'samples': 6569088, 'steps': 34213, 'loss/train': 1.9888619184494019} 11/07/2021 02:03:49 - INFO - __main__ - Step 34215: {'lr': 0.00044379350036875413, 'samples': 6569280, 'steps': 34214, 'loss/train': 1.7909215688705444} 11/07/2021 02:03:50 - INFO - __main__ - Step 34216: {'lr': 0.00044379014780109217, 'samples': 6569472, 'steps': 34215, 'loss/train': 1.947856068611145} 11/07/2021 02:03:50 - INFO - __main__ - Step 34217: {'lr': 0.00044378679514611144, 'samples': 6569664, 'steps': 34216, 'loss/train': 1.699139952659607} 11/07/2021 02:03:51 - INFO - __main__ - Step 34218: {'lr': 0.0004437834424038133, 'samples': 6569856, 'steps': 34217, 'loss/train': 0.8595290780067444} 11/07/2021 02:03:51 - INFO - __main__ - Step 34219: {'lr': 0.00044378008957419936, 'samples': 6570048, 'steps': 34218, 'loss/train': 1.4129565954208374} 11/07/2021 02:03:51 - INFO - __main__ - Step 34220: {'lr': 0.00044377673665727105, 'samples': 6570240, 'steps': 34219, 'loss/train': 1.8418340682983398} 11/07/2021 02:03:52 - INFO - __main__ - Step 34221: {'lr': 0.00044377338365303, 'samples': 6570432, 'steps': 34220, 'loss/train': 1.3927035331726074} 11/07/2021 02:03:53 - INFO - __main__ - Step 34222: {'lr': 0.00044377003056147757, 'samples': 6570624, 'steps': 34221, 'loss/train': 1.8908228874206543} 11/07/2021 02:03:53 - INFO - __main__ - Step 34223: {'lr': 0.00044376667738261545, 'samples': 6570816, 'steps': 34222, 'loss/train': 1.8357990980148315} 11/07/2021 02:03:53 - INFO - __main__ - Step 34224: {'lr': 0.000443763324116445, 'samples': 6571008, 'steps': 34223, 'loss/train': 1.339423418045044} 11/07/2021 02:03:54 - INFO - __main__ - Step 34225: {'lr': 0.00044375997076296774, 'samples': 6571200, 'steps': 34224, 'loss/train': 1.7065150737762451} 11/07/2021 02:03:55 - INFO - __main__ - Step 34226: {'lr': 0.0004437566173221853, 'samples': 6571392, 'steps': 34225, 'loss/train': 1.329896330833435} 11/07/2021 02:03:55 - INFO - __main__ - Step 34227: {'lr': 0.0004437532637940991, 'samples': 6571584, 'steps': 34226, 'loss/train': 1.8014112710952759} 11/07/2021 02:03:55 - INFO - __main__ - Step 34228: {'lr': 0.0004437499101787107, 'samples': 6571776, 'steps': 34227, 'loss/train': 1.0249602794647217} 11/07/2021 02:03:56 - INFO - __main__ - Step 34229: {'lr': 0.00044374655647602153, 'samples': 6571968, 'steps': 34228, 'loss/train': 1.4240094423294067} 11/07/2021 02:03:56 - INFO - __main__ - Step 34230: {'lr': 0.0004437432026860332, 'samples': 6572160, 'steps': 34229, 'loss/train': 1.3589770793914795} 11/07/2021 02:03:57 - INFO - __main__ - Step 34231: {'lr': 0.00044373984880874705, 'samples': 6572352, 'steps': 34230, 'loss/train': 1.9758151769638062} 11/07/2021 02:03:57 - INFO - __main__ - Step 34232: {'lr': 0.0004437364948441649, 'samples': 6572544, 'steps': 34231, 'loss/train': 1.4315061569213867} 11/07/2021 02:03:58 - INFO - __main__ - Step 34233: {'lr': 0.00044373314079228796, 'samples': 6572736, 'steps': 34232, 'loss/train': 1.163349986076355} 11/07/2021 02:03:58 - INFO - __main__ - Step 34234: {'lr': 0.0004437297866531179, 'samples': 6572928, 'steps': 34233, 'loss/train': 1.6355782747268677} 11/07/2021 02:03:59 - INFO - __main__ - Step 34235: {'lr': 0.0004437264324266561, 'samples': 6573120, 'steps': 34234, 'loss/train': 1.6662013530731201} 11/07/2021 02:03:59 - INFO - __main__ - Step 34236: {'lr': 0.00044372307811290425, 'samples': 6573312, 'steps': 34235, 'loss/train': 0.80659419298172} 11/07/2021 02:04:00 - INFO - __main__ - Step 34237: {'lr': 0.00044371972371186374, 'samples': 6573504, 'steps': 34236, 'loss/train': 1.1076829433441162} 11/07/2021 02:04:00 - INFO - __main__ - Step 34238: {'lr': 0.0004437163692235361, 'samples': 6573696, 'steps': 34237, 'loss/train': 1.6724625825881958} 11/07/2021 02:04:00 - INFO - __main__ - Step 34239: {'lr': 0.0004437130146479229, 'samples': 6573888, 'steps': 34238, 'loss/train': 1.646666169166565} 11/07/2021 02:04:01 - INFO - __main__ - Step 34240: {'lr': 0.00044370965998502554, 'samples': 6574080, 'steps': 34239, 'loss/train': 2.2665205001831055} 11/07/2021 02:04:01 - INFO - __main__ - Step 34241: {'lr': 0.0004437063052348457, 'samples': 6574272, 'steps': 34240, 'loss/train': 1.2336525917053223} 11/07/2021 02:04:02 - INFO - __main__ - Step 34242: {'lr': 0.0004437029503973847, 'samples': 6574464, 'steps': 34241, 'loss/train': 1.5797135829925537} 11/07/2021 02:04:03 - INFO - __main__ - Step 34243: {'lr': 0.00044369959547264416, 'samples': 6574656, 'steps': 34242, 'loss/train': 1.3858048915863037} 11/07/2021 02:04:03 - INFO - __main__ - Step 34244: {'lr': 0.0004436962404606255, 'samples': 6574848, 'steps': 34243, 'loss/train': 1.3134781122207642} 11/07/2021 02:04:03 - INFO - __main__ - Step 34245: {'lr': 0.0004436928853613304, 'samples': 6575040, 'steps': 34244, 'loss/train': 1.6575181484222412} 11/07/2021 02:04:04 - INFO - __main__ - Step 34246: {'lr': 0.0004436895301747602, 'samples': 6575232, 'steps': 34245, 'loss/train': 1.971530795097351} 11/07/2021 02:04:05 - INFO - __main__ - Step 34247: {'lr': 0.00044368617490091655, 'samples': 6575424, 'steps': 34246, 'loss/train': 1.436553955078125} 11/07/2021 02:04:05 - INFO - __main__ - Step 34248: {'lr': 0.0004436828195398009, 'samples': 6575616, 'steps': 34247, 'loss/train': 1.2922219038009644} 11/07/2021 02:04:05 - INFO - __main__ - Step 34249: {'lr': 0.0004436794640914148, 'samples': 6575808, 'steps': 34248, 'loss/train': 1.1830744743347168} 11/07/2021 02:04:06 - INFO - __main__ - Step 34250: {'lr': 0.00044367610855575965, 'samples': 6576000, 'steps': 34249, 'loss/train': 1.7076432704925537} 11/07/2021 02:04:06 - INFO - __main__ - Step 34251: {'lr': 0.00044367275293283705, 'samples': 6576192, 'steps': 34250, 'loss/train': 1.1485662460327148} 11/07/2021 02:04:07 - INFO - __main__ - Step 34252: {'lr': 0.00044366939722264843, 'samples': 6576384, 'steps': 34251, 'loss/train': 1.5567431449890137} 11/07/2021 02:04:07 - INFO - __main__ - Step 34253: {'lr': 0.00044366604142519547, 'samples': 6576576, 'steps': 34252, 'loss/train': 1.6270051002502441} 11/07/2021 02:04:08 - INFO - __main__ - Step 34254: {'lr': 0.0004436626855404796, 'samples': 6576768, 'steps': 34253, 'loss/train': 1.0859005451202393} 11/07/2021 02:04:08 - INFO - __main__ - Step 34255: {'lr': 0.0004436593295685022, 'samples': 6576960, 'steps': 34254, 'loss/train': 1.3940390348434448} 11/07/2021 02:04:09 - INFO - __main__ - Step 34256: {'lr': 0.00044365597350926495, 'samples': 6577152, 'steps': 34255, 'loss/train': 1.350616455078125} 11/07/2021 02:04:10 - INFO - __main__ - Step 34257: {'lr': 0.0004436526173627693, 'samples': 6577344, 'steps': 34256, 'loss/train': 1.7524422407150269} 11/07/2021 02:04:10 - INFO - __main__ - Step 34258: {'lr': 0.00044364926112901675, 'samples': 6577536, 'steps': 34257, 'loss/train': 1.117216944694519} 11/07/2021 02:04:10 - INFO - __main__ - Step 34259: {'lr': 0.0004436459048080089, 'samples': 6577728, 'steps': 34258, 'loss/train': 0.9890215992927551} 11/07/2021 02:04:11 - INFO - __main__ - Step 34260: {'lr': 0.00044364254839974717, 'samples': 6577920, 'steps': 34259, 'loss/train': 1.560340404510498} 11/07/2021 02:04:11 - INFO - __main__ - Step 34261: {'lr': 0.0004436391919042331, 'samples': 6578112, 'steps': 34260, 'loss/train': 1.3585392236709595} 11/07/2021 02:04:12 - INFO - __main__ - Step 34262: {'lr': 0.00044363583532146814, 'samples': 6578304, 'steps': 34261, 'loss/train': 1.6851139068603516} 11/07/2021 02:04:12 - INFO - __main__ - Step 34263: {'lr': 0.0004436324786514538, 'samples': 6578496, 'steps': 34262, 'loss/train': 1.8596597909927368} 11/07/2021 02:04:13 - INFO - __main__ - Step 34264: {'lr': 0.0004436291218941918, 'samples': 6578688, 'steps': 34263, 'loss/train': 1.6587039232254028} 11/07/2021 02:04:13 - INFO - __main__ - Step 34265: {'lr': 0.00044362576504968344, 'samples': 6578880, 'steps': 34264, 'loss/train': 0.7693606019020081} 11/07/2021 02:04:13 - INFO - __main__ - Step 34266: {'lr': 0.0004436224081179303, 'samples': 6579072, 'steps': 34265, 'loss/train': 1.7554024457931519} 11/07/2021 02:04:15 - INFO - __main__ - Step 34267: {'lr': 0.00044361905109893397, 'samples': 6579264, 'steps': 34266, 'loss/train': 1.649536371231079} 11/07/2021 02:04:15 - INFO - __main__ - Step 34268: {'lr': 0.00044361569399269574, 'samples': 6579456, 'steps': 34267, 'loss/train': 1.2417728900909424} 11/07/2021 02:04:15 - INFO - __main__ - Step 34269: {'lr': 0.0004436123367992174, 'samples': 6579648, 'steps': 34268, 'loss/train': 1.3689631223678589} 11/07/2021 02:04:16 - INFO - __main__ - Step 34270: {'lr': 0.0004436089795185003, 'samples': 6579840, 'steps': 34269, 'loss/train': 1.7414630651474} 11/07/2021 02:04:16 - INFO - __main__ - Step 34271: {'lr': 0.0004436056221505459, 'samples': 6580032, 'steps': 34270, 'loss/train': 1.6733810901641846} 11/07/2021 02:04:17 - INFO - __main__ - Step 34272: {'lr': 0.00044360226469535583, 'samples': 6580224, 'steps': 34271, 'loss/train': 1.5781022310256958} 11/07/2021 02:04:17 - INFO - __main__ - Step 34273: {'lr': 0.0004435989071529316, 'samples': 6580416, 'steps': 34272, 'loss/train': 1.8803343772888184} 11/07/2021 02:04:18 - INFO - __main__ - Step 34274: {'lr': 0.0004435955495232746, 'samples': 6580608, 'steps': 34273, 'loss/train': 1.4304903745651245} 11/07/2021 02:04:18 - INFO - __main__ - Step 34275: {'lr': 0.00044359219180638656, 'samples': 6580800, 'steps': 34274, 'loss/train': 1.7356127500534058} 11/07/2021 02:04:18 - INFO - __main__ - Step 34276: {'lr': 0.0004435888340022688, 'samples': 6580992, 'steps': 34275, 'loss/train': 1.8282495737075806} 11/07/2021 02:04:19 - INFO - __main__ - Step 34277: {'lr': 0.0004435854761109229, 'samples': 6581184, 'steps': 34276, 'loss/train': 1.6848821640014648} 11/07/2021 02:04:20 - INFO - __main__ - Step 34278: {'lr': 0.00044358211813235046, 'samples': 6581376, 'steps': 34277, 'loss/train': 1.6042040586471558} 11/07/2021 02:04:20 - INFO - __main__ - Step 34279: {'lr': 0.0004435787600665528, 'samples': 6581568, 'steps': 34278, 'loss/train': 2.087636947631836} 11/07/2021 02:04:21 - INFO - __main__ - Step 34280: {'lr': 0.0004435754019135315, 'samples': 6581760, 'steps': 34279, 'loss/train': 1.4953944683074951} 11/07/2021 02:04:21 - INFO - __main__ - Step 34281: {'lr': 0.0004435720436732882, 'samples': 6581952, 'steps': 34280, 'loss/train': 1.3213019371032715} 11/07/2021 02:04:21 - INFO - __main__ - Step 34282: {'lr': 0.0004435686853458243, 'samples': 6582144, 'steps': 34281, 'loss/train': 1.548249363899231} 11/07/2021 02:04:22 - INFO - __main__ - Step 34283: {'lr': 0.0004435653269311414, 'samples': 6582336, 'steps': 34282, 'loss/train': 1.2772735357284546} 11/07/2021 02:04:23 - INFO - __main__ - Step 34284: {'lr': 0.00044356196842924086, 'samples': 6582528, 'steps': 34283, 'loss/train': 0.16299578547477722} 11/07/2021 02:04:23 - INFO - __main__ - Step 34285: {'lr': 0.0004435586098401243, 'samples': 6582720, 'steps': 34284, 'loss/train': 1.3020122051239014} 11/07/2021 02:04:24 - INFO - __main__ - Step 34286: {'lr': 0.00044355525116379326, 'samples': 6582912, 'steps': 34285, 'loss/train': 1.6647453308105469} 11/07/2021 02:04:24 - INFO - __main__ - Step 34287: {'lr': 0.00044355189240024917, 'samples': 6583104, 'steps': 34286, 'loss/train': 1.517380952835083} 11/07/2021 02:04:24 - INFO - __main__ - Step 34288: {'lr': 0.00044354853354949353, 'samples': 6583296, 'steps': 34287, 'loss/train': 1.3190693855285645} 11/07/2021 02:04:25 - INFO - __main__ - Step 34289: {'lr': 0.000443545174611528, 'samples': 6583488, 'steps': 34288, 'loss/train': 1.1464428901672363} 11/07/2021 02:04:25 - INFO - __main__ - Step 34290: {'lr': 0.000443541815586354, 'samples': 6583680, 'steps': 34289, 'loss/train': 1.4929620027542114} 11/07/2021 02:04:26 - INFO - __main__ - Step 34291: {'lr': 0.0004435384564739729, 'samples': 6583872, 'steps': 34290, 'loss/train': 1.5682131052017212} 11/07/2021 02:04:26 - INFO - __main__ - Step 34292: {'lr': 0.00044353509727438657, 'samples': 6584064, 'steps': 34291, 'loss/train': 1.5812877416610718} 11/07/2021 02:04:27 - INFO - __main__ - Step 34293: {'lr': 0.00044353173798759616, 'samples': 6584256, 'steps': 34292, 'loss/train': 1.7356139421463013} 11/07/2021 02:04:28 - INFO - __main__ - Step 34294: {'lr': 0.0004435283786136034, 'samples': 6584448, 'steps': 34293, 'loss/train': 1.6473431587219238} 11/07/2021 02:04:28 - INFO - __main__ - Step 34295: {'lr': 0.0004435250191524097, 'samples': 6584640, 'steps': 34294, 'loss/train': 1.5632716417312622} 11/07/2021 02:04:28 - INFO - __main__ - Step 34296: {'lr': 0.0004435216596040167, 'samples': 6584832, 'steps': 34295, 'loss/train': 1.4862587451934814} 11/07/2021 02:04:29 - INFO - __main__ - Step 34297: {'lr': 0.00044351829996842575, 'samples': 6585024, 'steps': 34296, 'loss/train': 0.8310204148292542} 11/07/2021 02:04:29 - INFO - __main__ - Step 34298: {'lr': 0.00044351494024563845, 'samples': 6585216, 'steps': 34297, 'loss/train': 1.7653529644012451} 11/07/2021 02:04:30 - INFO - __main__ - Step 34299: {'lr': 0.0004435115804356563, 'samples': 6585408, 'steps': 34298, 'loss/train': 0.5662867426872253} 11/07/2021 02:04:30 - INFO - __main__ - Step 34300: {'lr': 0.0004435082205384808, 'samples': 6585600, 'steps': 34299, 'loss/train': 1.818901777267456} 11/07/2021 02:04:31 - INFO - __main__ - Step 34301: {'lr': 0.00044350486055411354, 'samples': 6585792, 'steps': 34300, 'loss/train': 1.3899507522583008} 11/07/2021 02:04:31 - INFO - __main__ - Step 34302: {'lr': 0.000443501500482556, 'samples': 6585984, 'steps': 34301, 'loss/train': 0.19430121779441833} 11/07/2021 02:04:31 - INFO - __main__ - Step 34303: {'lr': 0.0004434981403238096, 'samples': 6586176, 'steps': 34302, 'loss/train': 1.502541184425354} 11/07/2021 02:04:32 - INFO - __main__ - Step 34304: {'lr': 0.0004434947800778759, 'samples': 6586368, 'steps': 34303, 'loss/train': 0.8370174765586853} 11/07/2021 02:04:33 - INFO - __main__ - Step 34305: {'lr': 0.0004434914197447565, 'samples': 6586560, 'steps': 34304, 'loss/train': 1.4450031518936157} 11/07/2021 02:04:33 - INFO - __main__ - Step 34306: {'lr': 0.0004434880593244528, 'samples': 6586752, 'steps': 34305, 'loss/train': 1.5231199264526367} 11/07/2021 02:04:33 - INFO - __main__ - Step 34307: {'lr': 0.0004434846988169664, 'samples': 6586944, 'steps': 34306, 'loss/train': 0.9623751640319824} 11/07/2021 02:04:34 - INFO - __main__ - Step 34308: {'lr': 0.0004434813382222989, 'samples': 6587136, 'steps': 34307, 'loss/train': 1.5782952308654785} 11/07/2021 02:04:34 - INFO - __main__ - Step 34309: {'lr': 0.0004434779775404515, 'samples': 6587328, 'steps': 34308, 'loss/train': 1.2805538177490234} 11/07/2021 02:04:35 - INFO - __main__ - Step 34310: {'lr': 0.000443474616771426, 'samples': 6587520, 'steps': 34309, 'loss/train': 1.5393397808074951} 11/07/2021 02:04:36 - INFO - __main__ - Step 34311: {'lr': 0.00044347125591522377, 'samples': 6587712, 'steps': 34310, 'loss/train': 1.085399866104126} 11/07/2021 02:04:36 - INFO - __main__ - Step 34312: {'lr': 0.00044346789497184643, 'samples': 6587904, 'steps': 34311, 'loss/train': 1.8326754570007324} 11/07/2021 02:04:36 - INFO - __main__ - Step 34313: {'lr': 0.0004434645339412954, 'samples': 6588096, 'steps': 34312, 'loss/train': 1.1530722379684448} 11/07/2021 02:04:37 - INFO - __main__ - Step 34314: {'lr': 0.0004434611728235722, 'samples': 6588288, 'steps': 34313, 'loss/train': 1.2123658657073975} 11/07/2021 02:04:38 - INFO - __main__ - Step 34315: {'lr': 0.0004434578116186785, 'samples': 6588480, 'steps': 34314, 'loss/train': 1.23298978805542} 11/07/2021 02:04:38 - INFO - __main__ - Step 34316: {'lr': 0.00044345445032661565, 'samples': 6588672, 'steps': 34315, 'loss/train': 1.730208158493042} 11/07/2021 02:04:38 - INFO - __main__ - Step 34317: {'lr': 0.0004434510889473852, 'samples': 6588864, 'steps': 34316, 'loss/train': 1.5077552795410156} 11/07/2021 02:04:39 - INFO - __main__ - Step 34318: {'lr': 0.00044344772748098867, 'samples': 6589056, 'steps': 34317, 'loss/train': 1.4679759740829468} 11/07/2021 02:04:39 - INFO - __main__ - Step 34319: {'lr': 0.00044344436592742755, 'samples': 6589248, 'steps': 34318, 'loss/train': 1.4131392240524292} 11/07/2021 02:04:41 - INFO - __main__ - Step 34320: {'lr': 0.0004434410042867034, 'samples': 6589440, 'steps': 34319, 'loss/train': 1.2468388080596924} 11/07/2021 02:04:41 - INFO - __main__ - Step 34321: {'lr': 0.0004434376425588178, 'samples': 6589632, 'steps': 34320, 'loss/train': 1.5157110691070557} 11/07/2021 02:04:41 - INFO - __main__ - Step 34322: {'lr': 0.00044343428074377207, 'samples': 6589824, 'steps': 34321, 'loss/train': 1.6445945501327515} 11/07/2021 02:04:42 - INFO - __main__ - Step 34323: {'lr': 0.0004434309188415679, 'samples': 6590016, 'steps': 34322, 'loss/train': 1.7932441234588623} 11/07/2021 02:04:42 - INFO - __main__ - Step 34324: {'lr': 0.0004434275568522067, 'samples': 6590208, 'steps': 34323, 'loss/train': 1.385819673538208} 11/07/2021 02:04:42 - INFO - __main__ - Step 34325: {'lr': 0.0004434241947756901, 'samples': 6590400, 'steps': 34324, 'loss/train': 1.7647372484207153} 11/07/2021 02:04:43 - INFO - __main__ - Step 34326: {'lr': 0.0004434208326120195, 'samples': 6590592, 'steps': 34325, 'loss/train': 1.27634596824646} 11/07/2021 02:04:44 - INFO - __main__ - Step 34327: {'lr': 0.0004434174703611964, 'samples': 6590784, 'steps': 34326, 'loss/train': 1.4668068885803223} 11/07/2021 02:04:44 - INFO - __main__ - Step 34328: {'lr': 0.00044341410802322247, 'samples': 6590976, 'steps': 34327, 'loss/train': 1.522247076034546} 11/07/2021 02:04:45 - INFO - __main__ - Step 34329: {'lr': 0.00044341074559809903, 'samples': 6591168, 'steps': 34328, 'loss/train': 1.5705645084381104} 11/07/2021 02:04:45 - INFO - __main__ - Step 34330: {'lr': 0.00044340738308582775, 'samples': 6591360, 'steps': 34329, 'loss/train': 1.3702070713043213} 11/07/2021 02:04:46 - INFO - __main__ - Step 34331: {'lr': 0.0004434040204864101, 'samples': 6591552, 'steps': 34330, 'loss/train': 1.5339696407318115} 11/07/2021 02:04:46 - INFO - __main__ - Step 34332: {'lr': 0.00044340065779984757, 'samples': 6591744, 'steps': 34331, 'loss/train': 1.4517829418182373} 11/07/2021 02:04:47 - INFO - __main__ - Step 34333: {'lr': 0.0004433972950261417, 'samples': 6591936, 'steps': 34332, 'loss/train': 1.3155765533447266} 11/07/2021 02:04:47 - INFO - __main__ - Step 34334: {'lr': 0.00044339393216529394, 'samples': 6592128, 'steps': 34333, 'loss/train': 1.4728405475616455} 11/07/2021 02:04:47 - INFO - __main__ - Step 34335: {'lr': 0.00044339056921730593, 'samples': 6592320, 'steps': 34334, 'loss/train': 1.1240208148956299} 11/07/2021 02:04:48 - INFO - __main__ - Step 34336: {'lr': 0.000443387206182179, 'samples': 6592512, 'steps': 34335, 'loss/train': 1.7121858596801758} 11/07/2021 02:04:49 - INFO - __main__ - Step 34337: {'lr': 0.0004433838430599149, 'samples': 6592704, 'steps': 34336, 'loss/train': 1.7511014938354492} 11/07/2021 02:04:49 - INFO - __main__ - Step 34338: {'lr': 0.000443380479850515, 'samples': 6592896, 'steps': 34337, 'loss/train': 1.3754760026931763} 11/07/2021 02:04:49 - INFO - __main__ - Step 34339: {'lr': 0.00044337711655398083, 'samples': 6593088, 'steps': 34338, 'loss/train': 1.6327975988388062} 11/07/2021 02:04:50 - INFO - __main__ - Step 34340: {'lr': 0.00044337375317031393, 'samples': 6593280, 'steps': 34339, 'loss/train': 1.2993462085723877} 11/07/2021 02:04:50 - INFO - __main__ - Step 34341: {'lr': 0.0004433703896995157, 'samples': 6593472, 'steps': 34340, 'loss/train': 1.703192114830017} 11/07/2021 02:04:51 - INFO - __main__ - Step 34342: {'lr': 0.0004433670261415879, 'samples': 6593664, 'steps': 34341, 'loss/train': 1.6066464185714722} 11/07/2021 02:04:51 - INFO - __main__ - Step 34343: {'lr': 0.0004433636624965318, 'samples': 6593856, 'steps': 34342, 'loss/train': 2.237250328063965} 11/07/2021 02:04:52 - INFO - __main__ - Step 34344: {'lr': 0.0004433602987643491, 'samples': 6594048, 'steps': 34343, 'loss/train': 1.402521014213562} 11/07/2021 02:04:52 - INFO - __main__ - Step 34345: {'lr': 0.00044335693494504115, 'samples': 6594240, 'steps': 34344, 'loss/train': 1.711129903793335} 11/07/2021 02:04:52 - INFO - __main__ - Step 34346: {'lr': 0.00044335357103860964, 'samples': 6594432, 'steps': 34345, 'loss/train': 1.3711202144622803} 11/07/2021 02:04:53 - INFO - __main__ - Step 34347: {'lr': 0.0004433502070450559, 'samples': 6594624, 'steps': 34346, 'loss/train': 1.3290324211120605} 11/07/2021 02:04:54 - INFO - __main__ - Step 34348: {'lr': 0.0004433468429643816, 'samples': 6594816, 'steps': 34347, 'loss/train': 1.5208077430725098} 11/07/2021 02:04:54 - INFO - __main__ - Step 34349: {'lr': 0.00044334347879658817, 'samples': 6595008, 'steps': 34348, 'loss/train': 1.363816261291504} 11/07/2021 02:04:55 - INFO - __main__ - Step 34350: {'lr': 0.0004433401145416771, 'samples': 6595200, 'steps': 34349, 'loss/train': 1.1282519102096558} 11/07/2021 02:04:55 - INFO - __main__ - Step 34351: {'lr': 0.00044333675019965, 'samples': 6595392, 'steps': 34350, 'loss/train': 1.5368919372558594} 11/07/2021 02:04:56 - INFO - __main__ - Step 34352: {'lr': 0.00044333338577050844, 'samples': 6595584, 'steps': 34351, 'loss/train': 1.2156026363372803} 11/07/2021 02:04:56 - INFO - __main__ - Step 34353: {'lr': 0.0004433300212542537, 'samples': 6595776, 'steps': 34352, 'loss/train': 1.3872274160385132} 11/07/2021 02:04:57 - INFO - __main__ - Step 34354: {'lr': 0.00044332665665088755, 'samples': 6595968, 'steps': 34353, 'loss/train': 1.441105604171753} 11/07/2021 02:04:57 - INFO - __main__ - Step 34355: {'lr': 0.00044332329196041133, 'samples': 6596160, 'steps': 34354, 'loss/train': 1.3930827379226685} 11/07/2021 02:04:57 - INFO - __main__ - Step 34356: {'lr': 0.0004433199271828267, 'samples': 6596352, 'steps': 34355, 'loss/train': 1.6682108640670776} 11/07/2021 02:04:58 - INFO - __main__ - Step 34357: {'lr': 0.0004433165623181349, 'samples': 6596544, 'steps': 34356, 'loss/train': 1.5918173789978027} 11/07/2021 02:04:59 - INFO - __main__ - Step 34358: {'lr': 0.0004433131973663378, 'samples': 6596736, 'steps': 34357, 'loss/train': 1.7932053804397583} 11/07/2021 02:04:59 - INFO - __main__ - Step 34359: {'lr': 0.0004433098323274367, 'samples': 6596928, 'steps': 34358, 'loss/train': 1.6864224672317505} 11/07/2021 02:04:59 - INFO - __main__ - Step 34360: {'lr': 0.00044330646720143317, 'samples': 6597120, 'steps': 34359, 'loss/train': 1.3417962789535522} 11/07/2021 02:05:00 - INFO - __main__ - Step 34361: {'lr': 0.0004433031019883288, 'samples': 6597312, 'steps': 34360, 'loss/train': 1.8701308965682983} 11/07/2021 02:05:00 - INFO - __main__ - Step 34362: {'lr': 0.00044329973668812497, 'samples': 6597504, 'steps': 34361, 'loss/train': 1.982292652130127} 11/07/2021 02:05:01 - INFO - __main__ - Step 34363: {'lr': 0.00044329637130082324, 'samples': 6597696, 'steps': 34362, 'loss/train': 1.4699153900146484} 11/07/2021 02:05:02 - INFO - __main__ - Step 34364: {'lr': 0.00044329300582642516, 'samples': 6597888, 'steps': 34363, 'loss/train': 1.9153653383255005} 11/07/2021 02:05:02 - INFO - __main__ - Step 34365: {'lr': 0.0004432896402649323, 'samples': 6598080, 'steps': 34364, 'loss/train': 0.23294086754322052} 11/07/2021 02:05:02 - INFO - __main__ - Step 34366: {'lr': 0.0004432862746163461, 'samples': 6598272, 'steps': 34365, 'loss/train': 0.15816542506217957} 11/07/2021 02:05:03 - INFO - __main__ - Step 34367: {'lr': 0.000443282908880668, 'samples': 6598464, 'steps': 34366, 'loss/train': 1.5377881526947021} 11/07/2021 02:05:04 - INFO - __main__ - Step 34368: {'lr': 0.00044327954305789963, 'samples': 6598656, 'steps': 34367, 'loss/train': 1.5757653713226318} 11/07/2021 02:05:04 - INFO - __main__ - Step 34369: {'lr': 0.0004432761771480426, 'samples': 6598848, 'steps': 34368, 'loss/train': 1.7795270681381226} 11/07/2021 02:05:05 - INFO - __main__ - Step 34370: {'lr': 0.0004432728111510982, 'samples': 6599040, 'steps': 34369, 'loss/train': 1.7433745861053467} 11/07/2021 02:05:05 - INFO - __main__ - Step 34371: {'lr': 0.000443269445067068, 'samples': 6599232, 'steps': 34370, 'loss/train': 1.4963665008544922} 11/07/2021 02:05:05 - INFO - __main__ - Step 34372: {'lr': 0.0004432660788959537, 'samples': 6599424, 'steps': 34371, 'loss/train': 1.4771994352340698} 11/07/2021 02:05:06 - INFO - __main__ - Step 34373: {'lr': 0.00044326271263775657, 'samples': 6599616, 'steps': 34372, 'loss/train': 1.2979824542999268} 11/07/2021 02:05:07 - INFO - __main__ - Step 34374: {'lr': 0.0004432593462924783, 'samples': 6599808, 'steps': 34373, 'loss/train': 1.4521307945251465} 11/07/2021 02:05:07 - INFO - __main__ - Step 34375: {'lr': 0.0004432559798601203, 'samples': 6600000, 'steps': 34374, 'loss/train': 2.020451307296753} 11/07/2021 02:05:08 - INFO - __main__ - Step 34376: {'lr': 0.0004432526133406842, 'samples': 6600192, 'steps': 34375, 'loss/train': 1.3123501539230347} 11/07/2021 02:05:08 - INFO - __main__ - Step 34377: {'lr': 0.0004432492467341715, 'samples': 6600384, 'steps': 34376, 'loss/train': 1.956892967224121} 11/07/2021 02:05:09 - INFO - __main__ - Step 34378: {'lr': 0.00044324588004058364, 'samples': 6600576, 'steps': 34377, 'loss/train': 1.793972373008728} 11/07/2021 02:05:09 - INFO - __main__ - Step 34379: {'lr': 0.00044324251325992214, 'samples': 6600768, 'steps': 34378, 'loss/train': 1.768172264099121} 11/07/2021 02:05:10 - INFO - __main__ - Step 34380: {'lr': 0.0004432391463921885, 'samples': 6600960, 'steps': 34379, 'loss/train': 1.3202482461929321} 11/07/2021 02:05:10 - INFO - __main__ - Step 34381: {'lr': 0.00044323577943738437, 'samples': 6601152, 'steps': 34380, 'loss/train': 1.0357273817062378} 11/07/2021 02:05:10 - INFO - __main__ - Step 34382: {'lr': 0.00044323241239551113, 'samples': 6601344, 'steps': 34381, 'loss/train': 1.7776743173599243} 11/07/2021 02:05:11 - INFO - __main__ - Step 34383: {'lr': 0.0004432290452665704, 'samples': 6601536, 'steps': 34382, 'loss/train': 1.7808390855789185} 11/07/2021 02:05:12 - INFO - __main__ - Step 34384: {'lr': 0.00044322567805056356, 'samples': 6601728, 'steps': 34383, 'loss/train': 0.9358270764350891} 11/07/2021 02:05:12 - INFO - __main__ - Step 34385: {'lr': 0.00044322231074749225, 'samples': 6601920, 'steps': 34384, 'loss/train': 1.4120177030563354} 11/07/2021 02:05:12 - INFO - __main__ - Step 34386: {'lr': 0.0004432189433573579, 'samples': 6602112, 'steps': 34385, 'loss/train': 1.6435332298278809} 11/07/2021 02:05:13 - INFO - __main__ - Step 34387: {'lr': 0.00044321557588016214, 'samples': 6602304, 'steps': 34386, 'loss/train': 1.2353405952453613} 11/07/2021 02:05:14 - INFO - __main__ - Step 34388: {'lr': 0.0004432122083159065, 'samples': 6602496, 'steps': 34387, 'loss/train': 1.4600342512130737} 11/07/2021 02:05:14 - INFO - __main__ - Step 34389: {'lr': 0.0004432088406645922, 'samples': 6602688, 'steps': 34388, 'loss/train': 1.603829264640808} 11/07/2021 02:05:15 - INFO - __main__ - Step 34390: {'lr': 0.00044320547292622114, 'samples': 6602880, 'steps': 34389, 'loss/train': 1.6319403648376465} 11/07/2021 02:05:15 - INFO - __main__ - Step 34391: {'lr': 0.0004432021051007946, 'samples': 6603072, 'steps': 34390, 'loss/train': 5.095420837402344} 11/07/2021 02:05:15 - INFO - __main__ - Step 34392: {'lr': 0.00044319873718831425, 'samples': 6603264, 'steps': 34391, 'loss/train': 4.394473552703857} 11/07/2021 02:05:16 - INFO - __main__ - Step 34393: {'lr': 0.00044319536918878156, 'samples': 6603456, 'steps': 34392, 'loss/train': 0.47028297185897827} 11/07/2021 02:05:17 - INFO - __main__ - Step 34394: {'lr': 0.00044319200110219794, 'samples': 6603648, 'steps': 34393, 'loss/train': 1.8763916492462158} 11/07/2021 02:05:17 - INFO - __main__ - Step 34395: {'lr': 0.000443188632928565, 'samples': 6603840, 'steps': 34394, 'loss/train': 0.4508192241191864} 11/07/2021 02:05:17 - INFO - __main__ - Step 34396: {'lr': 0.0004431852646678842, 'samples': 6604032, 'steps': 34395, 'loss/train': 1.4153156280517578} 11/07/2021 02:05:18 - INFO - __main__ - Step 34397: {'lr': 0.00044318189632015716, 'samples': 6604224, 'steps': 34396, 'loss/train': 1.6592044830322266} 11/07/2021 02:05:18 - INFO - __main__ - Step 34398: {'lr': 0.0004431785278853853, 'samples': 6604416, 'steps': 34397, 'loss/train': 0.4165332317352295} 11/07/2021 02:05:19 - INFO - __main__ - Step 34399: {'lr': 0.0004431751593635702, 'samples': 6604608, 'steps': 34398, 'loss/train': 1.0399876832962036} 11/07/2021 02:05:20 - INFO - __main__ - Step 34400: {'lr': 0.00044317179075471335, 'samples': 6604800, 'steps': 34399, 'loss/train': 0.6964524388313293} 11/07/2021 02:05:20 - INFO - __main__ - Step 34401: {'lr': 0.00044316842205881625, 'samples': 6604992, 'steps': 34400, 'loss/train': 1.3911337852478027} 11/07/2021 02:05:20 - INFO - __main__ - Step 34402: {'lr': 0.00044316505327588054, 'samples': 6605184, 'steps': 34401, 'loss/train': 1.3631318807601929} 11/07/2021 02:05:21 - INFO - __main__ - Step 34403: {'lr': 0.00044316168440590757, 'samples': 6605376, 'steps': 34402, 'loss/train': 1.8850730657577515} 11/07/2021 02:05:22 - INFO - __main__ - Step 34404: {'lr': 0.00044315831544889886, 'samples': 6605568, 'steps': 34403, 'loss/train': 1.9254916906356812} 11/07/2021 02:05:22 - INFO - __main__ - Step 34405: {'lr': 0.0004431549464048561, 'samples': 6605760, 'steps': 34404, 'loss/train': 1.6083252429962158} 11/07/2021 02:05:22 - INFO - __main__ - Step 34406: {'lr': 0.0004431515772737806, 'samples': 6605952, 'steps': 34405, 'loss/train': 1.5108321905136108} 11/07/2021 02:05:23 - INFO - __main__ - Step 34407: {'lr': 0.000443148208055674, 'samples': 6606144, 'steps': 34406, 'loss/train': 1.199692726135254} 11/07/2021 02:05:23 - INFO - __main__ - Step 34408: {'lr': 0.0004431448387505379, 'samples': 6606336, 'steps': 34407, 'loss/train': 1.8412938117980957} 11/07/2021 02:05:23 - INFO - __main__ - Step 34409: {'lr': 0.00044314146935837365, 'samples': 6606528, 'steps': 34408, 'loss/train': 1.6678533554077148} 11/07/2021 02:05:24 - INFO - __main__ - Step 34410: {'lr': 0.0004431380998791828, 'samples': 6606720, 'steps': 34409, 'loss/train': 1.5640023946762085} 11/07/2021 02:05:25 - INFO - __main__ - Step 34411: {'lr': 0.0004431347303129669, 'samples': 6606912, 'steps': 34410, 'loss/train': 1.5821495056152344} 11/07/2021 02:05:25 - INFO - __main__ - Step 34412: {'lr': 0.00044313136065972754, 'samples': 6607104, 'steps': 34411, 'loss/train': 2.051985502243042} 11/07/2021 02:05:25 - INFO - __main__ - Step 34413: {'lr': 0.0004431279909194661, 'samples': 6607296, 'steps': 34412, 'loss/train': 2.042417526245117} 11/07/2021 02:05:26 - INFO - __main__ - Step 34414: {'lr': 0.00044312462109218423, 'samples': 6607488, 'steps': 34413, 'loss/train': 0.17461176216602325} 11/07/2021 02:05:27 - INFO - __main__ - Step 34415: {'lr': 0.0004431212511778834, 'samples': 6607680, 'steps': 34414, 'loss/train': 1.5423073768615723} 11/07/2021 02:05:27 - INFO - __main__ - Step 34416: {'lr': 0.000443117881176565, 'samples': 6607872, 'steps': 34415, 'loss/train': 2.077214002609253} 11/07/2021 02:05:28 - INFO - __main__ - Step 34417: {'lr': 0.00044311451108823075, 'samples': 6608064, 'steps': 34416, 'loss/train': 1.7995798587799072} 11/07/2021 02:05:28 - INFO - __main__ - Step 34418: {'lr': 0.00044311114091288205, 'samples': 6608256, 'steps': 34417, 'loss/train': 1.594545841217041} 11/07/2021 02:05:29 - INFO - __main__ - Step 34419: {'lr': 0.0004431077706505205, 'samples': 6608448, 'steps': 34418, 'loss/train': 1.0214651823043823} 11/07/2021 02:05:30 - INFO - __main__ - Step 34420: {'lr': 0.0004431044003011475, 'samples': 6608640, 'steps': 34419, 'loss/train': 1.7008183002471924} 11/07/2021 02:05:30 - INFO - __main__ - Step 34421: {'lr': 0.00044310102986476463, 'samples': 6608832, 'steps': 34420, 'loss/train': 1.6831791400909424} 11/07/2021 02:05:30 - INFO - __main__ - Step 34422: {'lr': 0.0004430976593413735, 'samples': 6609024, 'steps': 34421, 'loss/train': 1.9238362312316895} 11/07/2021 02:05:31 - INFO - __main__ - Step 34423: {'lr': 0.0004430942887309755, 'samples': 6609216, 'steps': 34422, 'loss/train': 1.5944414138793945} 11/07/2021 02:05:31 - INFO - __main__ - Step 34424: {'lr': 0.00044309091803357216, 'samples': 6609408, 'steps': 34423, 'loss/train': 2.4328577518463135} 11/07/2021 02:05:32 - INFO - __main__ - Step 34425: {'lr': 0.0004430875472491651, 'samples': 6609600, 'steps': 34424, 'loss/train': 1.4454017877578735} 11/07/2021 02:05:33 - INFO - __main__ - Step 34426: {'lr': 0.0004430841763777557, 'samples': 6609792, 'steps': 34425, 'loss/train': 0.7977309823036194} 11/07/2021 02:05:33 - INFO - __main__ - Step 34427: {'lr': 0.0004430808054193456, 'samples': 6609984, 'steps': 34426, 'loss/train': 1.1987460851669312} 11/07/2021 02:05:33 - INFO - __main__ - Step 34428: {'lr': 0.00044307743437393623, 'samples': 6610176, 'steps': 34427, 'loss/train': 0.5758892893791199} 11/07/2021 02:05:34 - INFO - __main__ - Step 34429: {'lr': 0.0004430740632415292, 'samples': 6610368, 'steps': 34428, 'loss/train': 1.741318702697754} 11/07/2021 02:05:34 - INFO - __main__ - Step 34430: {'lr': 0.0004430706920221259, 'samples': 6610560, 'steps': 34429, 'loss/train': 1.1640336513519287} 11/07/2021 02:05:35 - INFO - __main__ - Step 34431: {'lr': 0.00044306732071572796, 'samples': 6610752, 'steps': 34430, 'loss/train': 1.7533520460128784} 11/07/2021 02:05:35 - INFO - __main__ - Step 34432: {'lr': 0.00044306394932233694, 'samples': 6610944, 'steps': 34431, 'loss/train': 1.181859016418457} 11/07/2021 02:05:36 - INFO - __main__ - Step 34433: {'lr': 0.0004430605778419542, 'samples': 6611136, 'steps': 34432, 'loss/train': 0.981018602848053} 11/07/2021 02:05:36 - INFO - __main__ - Step 34434: {'lr': 0.00044305720627458136, 'samples': 6611328, 'steps': 34433, 'loss/train': 1.5004545450210571} 11/07/2021 02:05:36 - INFO - __main__ - Step 34435: {'lr': 0.00044305383462022, 'samples': 6611520, 'steps': 34434, 'loss/train': 1.9751158952713013} 11/07/2021 02:05:37 - INFO - __main__ - Step 34436: {'lr': 0.0004430504628788714, 'samples': 6611712, 'steps': 34435, 'loss/train': 1.7379834651947021} 11/07/2021 02:05:38 - INFO - __main__ - Step 34437: {'lr': 0.0004430470910505373, 'samples': 6611904, 'steps': 34436, 'loss/train': 1.6008168458938599} 11/07/2021 02:05:38 - INFO - __main__ - Step 34438: {'lr': 0.00044304371913521926, 'samples': 6612096, 'steps': 34437, 'loss/train': 1.5827112197875977} 11/07/2021 02:05:38 - INFO - __main__ - Step 34439: {'lr': 0.0004430403471329186, 'samples': 6612288, 'steps': 34438, 'loss/train': 1.6009305715560913} 11/07/2021 02:05:39 - INFO - __main__ - Step 34440: {'lr': 0.0004430369750436369, 'samples': 6612480, 'steps': 34439, 'loss/train': 1.3736330270767212} 11/07/2021 02:05:40 - INFO - __main__ - Step 34441: {'lr': 0.0004430336028673758, 'samples': 6612672, 'steps': 34440, 'loss/train': 1.412989854812622} 11/07/2021 02:05:40 - INFO - __main__ - Step 34442: {'lr': 0.00044303023060413677, 'samples': 6612864, 'steps': 34441, 'loss/train': 1.7590703964233398} 11/07/2021 02:05:41 - INFO - __main__ - Step 34443: {'lr': 0.0004430268582539212, 'samples': 6613056, 'steps': 34442, 'loss/train': 0.712694525718689} 11/07/2021 02:05:41 - INFO - __main__ - Step 34444: {'lr': 0.0004430234858167308, 'samples': 6613248, 'steps': 34443, 'loss/train': 1.2908469438552856} 11/07/2021 02:05:41 - INFO - __main__ - Step 34445: {'lr': 0.000443020113292567, 'samples': 6613440, 'steps': 34444, 'loss/train': 1.832324743270874} 11/07/2021 02:05:42 - INFO - __main__ - Step 34446: {'lr': 0.0004430167406814312, 'samples': 6613632, 'steps': 34445, 'loss/train': 1.7984143495559692} 11/07/2021 02:05:43 - INFO - __main__ - Step 34447: {'lr': 0.0004430133679833251, 'samples': 6613824, 'steps': 34446, 'loss/train': 1.3497729301452637} 11/07/2021 02:05:43 - INFO - __main__ - Step 34448: {'lr': 0.00044300999519825016, 'samples': 6614016, 'steps': 34447, 'loss/train': 1.2625563144683838} 11/07/2021 02:05:43 - INFO - __main__ - Step 34449: {'lr': 0.00044300662232620784, 'samples': 6614208, 'steps': 34448, 'loss/train': 1.7782011032104492} 11/07/2021 02:05:44 - INFO - __main__ - Step 34450: {'lr': 0.0004430032493671998, 'samples': 6614400, 'steps': 34449, 'loss/train': 1.4495292901992798} 11/07/2021 02:05:45 - INFO - __main__ - Step 34451: {'lr': 0.0004429998763212274, 'samples': 6614592, 'steps': 34450, 'loss/train': 1.4055038690567017} 11/07/2021 02:05:45 - INFO - __main__ - Step 34452: {'lr': 0.00044299650318829233, 'samples': 6614784, 'steps': 34451, 'loss/train': 1.5259429216384888} 11/07/2021 02:05:45 - INFO - __main__ - Step 34453: {'lr': 0.0004429931299683959, 'samples': 6614976, 'steps': 34452, 'loss/train': 1.4391940832138062} 11/07/2021 02:05:46 - INFO - __main__ - Step 34454: {'lr': 0.0004429897566615398, 'samples': 6615168, 'steps': 34453, 'loss/train': 1.6740025281906128} 11/07/2021 02:05:46 - INFO - __main__ - Step 34455: {'lr': 0.0004429863832677255, 'samples': 6615360, 'steps': 34454, 'loss/train': 1.5832898616790771} 11/07/2021 02:05:47 - INFO - __main__ - Step 34456: {'lr': 0.0004429830097869545, 'samples': 6615552, 'steps': 34455, 'loss/train': 2.2626101970672607} 11/07/2021 02:05:48 - INFO - __main__ - Step 34457: {'lr': 0.0004429796362192283, 'samples': 6615744, 'steps': 34456, 'loss/train': 1.631561279296875} 11/07/2021 02:05:48 - INFO - __main__ - Step 34458: {'lr': 0.0004429762625645485, 'samples': 6615936, 'steps': 34457, 'loss/train': 1.811454176902771} 11/07/2021 02:05:48 - INFO - __main__ - Step 34459: {'lr': 0.0004429728888229166, 'samples': 6616128, 'steps': 34458, 'loss/train': 0.7323458194732666} 11/07/2021 02:05:49 - INFO - __main__ - Step 34460: {'lr': 0.000442969514994334, 'samples': 6616320, 'steps': 34459, 'loss/train': 2.325310707092285} 11/07/2021 02:05:49 - INFO - __main__ - Step 34461: {'lr': 0.0004429661410788024, 'samples': 6616512, 'steps': 34460, 'loss/train': 1.6277323961257935} 11/07/2021 02:05:50 - INFO - __main__ - Step 34462: {'lr': 0.00044296276707632323, 'samples': 6616704, 'steps': 34461, 'loss/train': 1.4860053062438965} 11/07/2021 02:05:50 - INFO - __main__ - Step 34463: {'lr': 0.000442959392986898, 'samples': 6616896, 'steps': 34462, 'loss/train': 1.9971235990524292} 11/07/2021 02:05:51 - INFO - __main__ - Step 34464: {'lr': 0.0004429560188105282, 'samples': 6617088, 'steps': 34463, 'loss/train': 1.9872291088104248} 11/07/2021 02:05:51 - INFO - __main__ - Step 34465: {'lr': 0.00044295264454721544, 'samples': 6617280, 'steps': 34464, 'loss/train': 1.2352428436279297} 11/07/2021 02:05:51 - INFO - __main__ - Step 34466: {'lr': 0.0004429492701969612, 'samples': 6617472, 'steps': 34465, 'loss/train': 1.6460707187652588} 11/07/2021 02:05:52 - INFO - __main__ - Step 34467: {'lr': 0.00044294589575976696, 'samples': 6617664, 'steps': 34466, 'loss/train': 1.4971388578414917} 11/07/2021 02:05:53 - INFO - __main__ - Step 34468: {'lr': 0.00044294252123563434, 'samples': 6617856, 'steps': 34467, 'loss/train': 1.170795202255249} 11/07/2021 02:05:53 - INFO - __main__ - Step 34469: {'lr': 0.00044293914662456475, 'samples': 6618048, 'steps': 34468, 'loss/train': 1.8215768337249756} 11/07/2021 02:05:53 - INFO - __main__ - Step 34470: {'lr': 0.00044293577192655977, 'samples': 6618240, 'steps': 34469, 'loss/train': 1.0702993869781494} 11/07/2021 02:05:54 - INFO - __main__ - Step 34471: {'lr': 0.0004429323971416209, 'samples': 6618432, 'steps': 34470, 'loss/train': 1.573993444442749} 11/07/2021 02:05:55 - INFO - __main__ - Step 34472: {'lr': 0.0004429290222697497, 'samples': 6618624, 'steps': 34471, 'loss/train': 1.3658677339553833} 11/07/2021 02:05:55 - INFO - __main__ - Step 34473: {'lr': 0.0004429256473109476, 'samples': 6618816, 'steps': 34472, 'loss/train': 2.1451096534729004} 11/07/2021 02:05:56 - INFO - __main__ - Step 34474: {'lr': 0.0004429222722652162, 'samples': 6619008, 'steps': 34473, 'loss/train': 1.6938809156417847} 11/07/2021 02:05:56 - INFO - __main__ - Step 34475: {'lr': 0.0004429188971325571, 'samples': 6619200, 'steps': 34474, 'loss/train': 1.140890121459961} 11/07/2021 02:05:56 - INFO - __main__ - Step 34476: {'lr': 0.00044291552191297155, 'samples': 6619392, 'steps': 34475, 'loss/train': 1.288952350616455} 11/07/2021 02:05:57 - INFO - __main__ - Step 34477: {'lr': 0.0004429121466064614, 'samples': 6619584, 'steps': 34476, 'loss/train': 1.6861153841018677} 11/07/2021 02:05:58 - INFO - __main__ - Step 34478: {'lr': 0.0004429087712130279, 'samples': 6619776, 'steps': 34477, 'loss/train': 1.5473966598510742} 11/07/2021 02:05:58 - INFO - __main__ - Step 34479: {'lr': 0.00044290539573267276, 'samples': 6619968, 'steps': 34478, 'loss/train': 1.5820155143737793} 11/07/2021 02:05:58 - INFO - __main__ - Step 34480: {'lr': 0.00044290202016539736, 'samples': 6620160, 'steps': 34479, 'loss/train': 1.4073911905288696} 11/07/2021 02:05:59 - INFO - __main__ - Step 34481: {'lr': 0.0004428986445112033, 'samples': 6620352, 'steps': 34480, 'loss/train': 1.5794472694396973} 11/07/2021 02:06:00 - INFO - __main__ - Step 34482: {'lr': 0.00044289526877009213, 'samples': 6620544, 'steps': 34481, 'loss/train': 0.8621578216552734} 11/07/2021 02:06:00 - INFO - __main__ - Step 34483: {'lr': 0.00044289189294206534, 'samples': 6620736, 'steps': 34482, 'loss/train': 1.5604722499847412} 11/07/2021 02:06:01 - INFO - __main__ - Step 34484: {'lr': 0.0004428885170271244, 'samples': 6620928, 'steps': 34483, 'loss/train': 1.750520944595337} 11/07/2021 02:06:01 - INFO - __main__ - Step 34485: {'lr': 0.0004428851410252709, 'samples': 6621120, 'steps': 34484, 'loss/train': 1.702553629875183} 11/07/2021 02:06:01 - INFO - __main__ - Step 34486: {'lr': 0.0004428817649365063, 'samples': 6621312, 'steps': 34485, 'loss/train': 1.4129055738449097} 11/07/2021 02:06:02 - INFO - __main__ - Step 34487: {'lr': 0.0004428783887608321, 'samples': 6621504, 'steps': 34486, 'loss/train': 1.323349118232727} 11/07/2021 02:06:03 - INFO - __main__ - Step 34488: {'lr': 0.00044287501249824996, 'samples': 6621696, 'steps': 34487, 'loss/train': 1.4780468940734863} 11/07/2021 02:06:03 - INFO - __main__ - Step 34489: {'lr': 0.0004428716361487613, 'samples': 6621888, 'steps': 34488, 'loss/train': 1.4240431785583496} 11/07/2021 02:06:03 - INFO - __main__ - Step 34490: {'lr': 0.0004428682597123677, 'samples': 6622080, 'steps': 34489, 'loss/train': 1.1206878423690796} 11/07/2021 02:06:04 - INFO - __main__ - Step 34491: {'lr': 0.0004428648831890705, 'samples': 6622272, 'steps': 34490, 'loss/train': 1.1265442371368408} 11/07/2021 02:06:04 - INFO - __main__ - Step 34492: {'lr': 0.0004428615065788715, 'samples': 6622464, 'steps': 34491, 'loss/train': 1.5670143365859985} 11/07/2021 02:06:05 - INFO - __main__ - Step 34493: {'lr': 0.00044285812988177197, 'samples': 6622656, 'steps': 34492, 'loss/train': 1.810963749885559} 11/07/2021 02:06:05 - INFO - __main__ - Step 34494: {'lr': 0.0004428547530977736, 'samples': 6622848, 'steps': 34493, 'loss/train': 2.4662539958953857} 11/07/2021 02:06:06 - INFO - __main__ - Step 34495: {'lr': 0.0004428513762268779, 'samples': 6623040, 'steps': 34494, 'loss/train': 1.601396918296814} 11/07/2021 02:06:06 - INFO - __main__ - Step 34496: {'lr': 0.00044284799926908627, 'samples': 6623232, 'steps': 34495, 'loss/train': 1.6252416372299194} 11/07/2021 02:06:06 - INFO - __main__ - Step 34497: {'lr': 0.0004428446222244004, 'samples': 6623424, 'steps': 34496, 'loss/train': 1.8902145624160767} 11/07/2021 02:06:07 - INFO - __main__ - Step 34498: {'lr': 0.0004428412450928216, 'samples': 6623616, 'steps': 34497, 'loss/train': 2.3211231231689453} 11/07/2021 02:06:08 - INFO - __main__ - Step 34499: {'lr': 0.00044283786787435156, 'samples': 6623808, 'steps': 34498, 'loss/train': 1.5495243072509766} 11/07/2021 02:06:08 - INFO - __main__ - Step 34500: {'lr': 0.0004428344905689917, 'samples': 6624000, 'steps': 34499, 'loss/train': 1.4227359294891357} 11/07/2021 02:06:09 - INFO - __main__ - Step 34501: {'lr': 0.0004428311131767437, 'samples': 6624192, 'steps': 34500, 'loss/train': 1.3194031715393066} 11/07/2021 02:06:09 - INFO - __main__ - Step 34502: {'lr': 0.0004428277356976089, 'samples': 6624384, 'steps': 34501, 'loss/train': 1.5118417739868164} 11/07/2021 02:06:10 - INFO - __main__ - Step 34503: {'lr': 0.0004428243581315889, 'samples': 6624576, 'steps': 34502, 'loss/train': 1.6565394401550293} 11/07/2021 02:06:10 - INFO - __main__ - Step 34504: {'lr': 0.0004428209804786853, 'samples': 6624768, 'steps': 34503, 'loss/train': 2.0643577575683594} 11/07/2021 02:06:11 - INFO - __main__ - Step 34505: {'lr': 0.0004428176027388995, 'samples': 6624960, 'steps': 34504, 'loss/train': 1.7344928979873657} 11/07/2021 02:06:11 - INFO - __main__ - Step 34506: {'lr': 0.0004428142249122331, 'samples': 6625152, 'steps': 34505, 'loss/train': 1.3635393381118774} 11/07/2021 02:06:11 - INFO - __main__ - Step 34507: {'lr': 0.00044281084699868747, 'samples': 6625344, 'steps': 34506, 'loss/train': 2.03305983543396} 11/07/2021 02:06:12 - INFO - __main__ - Step 34508: {'lr': 0.0004428074689982643, 'samples': 6625536, 'steps': 34507, 'loss/train': 1.5268868207931519} 11/07/2021 02:06:13 - INFO - __main__ - Step 34509: {'lr': 0.0004428040909109651, 'samples': 6625728, 'steps': 34508, 'loss/train': 1.0226173400878906} 11/07/2021 02:06:13 - INFO - __main__ - Step 34510: {'lr': 0.00044280071273679133, 'samples': 6625920, 'steps': 34509, 'loss/train': 2.0236382484436035} 11/07/2021 02:06:13 - INFO - __main__ - Step 34511: {'lr': 0.00044279733447574456, 'samples': 6626112, 'steps': 34510, 'loss/train': 1.3775317668914795} 11/07/2021 02:06:14 - INFO - __main__ - Step 34512: {'lr': 0.00044279395612782625, 'samples': 6626304, 'steps': 34511, 'loss/train': 1.2630903720855713} 11/07/2021 02:06:14 - INFO - __main__ - Step 34513: {'lr': 0.0004427905776930379, 'samples': 6626496, 'steps': 34512, 'loss/train': 1.267950415611267} 11/07/2021 02:06:15 - INFO - __main__ - Step 34514: {'lr': 0.0004427871991713812, 'samples': 6626688, 'steps': 34513, 'loss/train': 1.7912590503692627} 11/07/2021 02:06:15 - INFO - __main__ - Step 34515: {'lr': 0.0004427838205628575, 'samples': 6626880, 'steps': 34514, 'loss/train': 1.34334397315979} 11/07/2021 02:06:16 - INFO - __main__ - Step 34516: {'lr': 0.0004427804418674684, 'samples': 6627072, 'steps': 34515, 'loss/train': 1.7392035722732544} 11/07/2021 02:06:16 - INFO - __main__ - Step 34517: {'lr': 0.00044277706308521543, 'samples': 6627264, 'steps': 34516, 'loss/train': 1.7281630039215088} 11/07/2021 02:06:16 - INFO - __main__ - Step 34518: {'lr': 0.0004427736842161001, 'samples': 6627456, 'steps': 34517, 'loss/train': 1.7590572834014893} 11/07/2021 02:06:18 - INFO - __main__ - Step 34519: {'lr': 0.00044277030526012386, 'samples': 6627648, 'steps': 34518, 'loss/train': 1.390249252319336} 11/07/2021 02:06:18 - INFO - __main__ - Step 34520: {'lr': 0.0004427669262172883, 'samples': 6627840, 'steps': 34519, 'loss/train': 1.3163725137710571} 11/07/2021 02:06:18 - INFO - __main__ - Step 34521: {'lr': 0.000442763547087595, 'samples': 6628032, 'steps': 34520, 'loss/train': 1.7077306509017944} 11/07/2021 02:06:19 - INFO - __main__ - Step 34522: {'lr': 0.00044276016787104535, 'samples': 6628224, 'steps': 34521, 'loss/train': 1.5454742908477783} 11/07/2021 02:06:19 - INFO - __main__ - Step 34523: {'lr': 0.000442756788567641, 'samples': 6628416, 'steps': 34522, 'loss/train': 1.7005842924118042} 11/07/2021 02:06:20 - INFO - __main__ - Step 34524: {'lr': 0.0004427534091773834, 'samples': 6628608, 'steps': 34523, 'loss/train': 1.4194958209991455} 11/07/2021 02:06:20 - INFO - __main__ - Step 34525: {'lr': 0.00044275002970027403, 'samples': 6628800, 'steps': 34524, 'loss/train': 1.3518060445785522} 11/07/2021 02:06:21 - INFO - __main__ - Step 34526: {'lr': 0.00044274665013631457, 'samples': 6628992, 'steps': 34525, 'loss/train': 1.4356197118759155} 11/07/2021 02:06:21 - INFO - __main__ - Step 34527: {'lr': 0.0004427432704855064, 'samples': 6629184, 'steps': 34526, 'loss/train': 2.385096311569214} 11/07/2021 02:06:21 - INFO - __main__ - Step 34528: {'lr': 0.000442739890747851, 'samples': 6629376, 'steps': 34527, 'loss/train': 1.1072453260421753} 11/07/2021 02:06:22 - INFO - __main__ - Step 34529: {'lr': 0.0004427365109233502, 'samples': 6629568, 'steps': 34528, 'loss/train': 1.5673176050186157} 11/07/2021 02:06:23 - INFO - __main__ - Step 34530: {'lr': 0.00044273313101200507, 'samples': 6629760, 'steps': 34529, 'loss/train': 1.4148625135421753} 11/07/2021 02:06:23 - INFO - __main__ - Step 34531: {'lr': 0.00044272975101381754, 'samples': 6629952, 'steps': 34530, 'loss/train': 1.4686460494995117} 11/07/2021 02:06:23 - INFO - __main__ - Step 34532: {'lr': 0.0004427263709287889, 'samples': 6630144, 'steps': 34531, 'loss/train': 1.4572937488555908} 11/07/2021 02:06:24 - INFO - __main__ - Step 34533: {'lr': 0.00044272299075692067, 'samples': 6630336, 'steps': 34532, 'loss/train': 1.7855355739593506} 11/07/2021 02:06:24 - INFO - __main__ - Step 34534: {'lr': 0.0004427196104982145, 'samples': 6630528, 'steps': 34533, 'loss/train': 1.7922431230545044} 11/07/2021 02:06:25 - INFO - __main__ - Step 34535: {'lr': 0.0004427162301526718, 'samples': 6630720, 'steps': 34534, 'loss/train': 1.4673815965652466} 11/07/2021 02:06:26 - INFO - __main__ - Step 34536: {'lr': 0.0004427128497202941, 'samples': 6630912, 'steps': 34535, 'loss/train': 1.608618974685669} 11/07/2021 02:06:26 - INFO - __main__ - Step 34537: {'lr': 0.00044270946920108305, 'samples': 6631104, 'steps': 34536, 'loss/train': 1.6207798719406128} 11/07/2021 02:06:26 - INFO - __main__ - Step 34538: {'lr': 0.00044270608859504006, 'samples': 6631296, 'steps': 34537, 'loss/train': 0.22153504192829132} 11/07/2021 02:06:27 - INFO - __main__ - Step 34539: {'lr': 0.0004427027079021667, 'samples': 6631488, 'steps': 34538, 'loss/train': 1.4280678033828735} 11/07/2021 02:06:28 - INFO - __main__ - Step 34540: {'lr': 0.0004426993271224645, 'samples': 6631680, 'steps': 34539, 'loss/train': 1.7425754070281982} 11/07/2021 02:06:28 - INFO - __main__ - Step 34541: {'lr': 0.0004426959462559349, 'samples': 6631872, 'steps': 34540, 'loss/train': 1.6339389085769653} 11/07/2021 02:06:28 - INFO - __main__ - Step 34542: {'lr': 0.0004426925653025795, 'samples': 6632064, 'steps': 34541, 'loss/train': 0.9596696496009827} 11/07/2021 02:06:29 - INFO - __main__ - Step 34543: {'lr': 0.0004426891842623998, 'samples': 6632256, 'steps': 34542, 'loss/train': 1.1412307024002075} 11/07/2021 02:06:29 - INFO - __main__ - Step 34544: {'lr': 0.0004426858031353973, 'samples': 6632448, 'steps': 34543, 'loss/train': 1.977781891822815} 11/07/2021 02:06:30 - INFO - __main__ - Step 34545: {'lr': 0.0004426824219215736, 'samples': 6632640, 'steps': 34544, 'loss/train': 1.415607213973999} 11/07/2021 02:06:31 - INFO - __main__ - Step 34546: {'lr': 0.00044267904062093014, 'samples': 6632832, 'steps': 34545, 'loss/train': 1.7016005516052246} 11/07/2021 02:06:31 - INFO - __main__ - Step 34547: {'lr': 0.0004426756592334685, 'samples': 6633024, 'steps': 34546, 'loss/train': 1.861208438873291} 11/07/2021 02:06:31 - INFO - __main__ - Step 34548: {'lr': 0.0004426722777591902, 'samples': 6633216, 'steps': 34547, 'loss/train': 1.6598010063171387} 11/07/2021 02:06:32 - INFO - __main__ - Step 34549: {'lr': 0.00044266889619809665, 'samples': 6633408, 'steps': 34548, 'loss/train': 0.9987548589706421} 11/07/2021 02:06:33 - INFO - __main__ - Step 34550: {'lr': 0.00044266551455018953, 'samples': 6633600, 'steps': 34549, 'loss/train': 1.3395533561706543} 11/07/2021 02:06:33 - INFO - __main__ - Step 34551: {'lr': 0.0004426621328154703, 'samples': 6633792, 'steps': 34550, 'loss/train': 1.5238016843795776} 11/07/2021 02:06:33 - INFO - __main__ - Step 34552: {'lr': 0.0004426587509939405, 'samples': 6633984, 'steps': 34551, 'loss/train': 1.5283972024917603} 11/07/2021 02:06:34 - INFO - __main__ - Step 34553: {'lr': 0.0004426553690856016, 'samples': 6634176, 'steps': 34552, 'loss/train': 0.9244617223739624} 11/07/2021 02:06:34 - INFO - __main__ - Step 34554: {'lr': 0.0004426519870904552, 'samples': 6634368, 'steps': 34553, 'loss/train': 1.0109336376190186} 11/07/2021 02:06:35 - INFO - __main__ - Step 34555: {'lr': 0.0004426486050085028, 'samples': 6634560, 'steps': 34554, 'loss/train': 1.4150766134262085} 11/07/2021 02:06:35 - INFO - __main__ - Step 34556: {'lr': 0.0004426452228397458, 'samples': 6634752, 'steps': 34555, 'loss/train': 1.6579521894454956} 11/07/2021 02:06:36 - INFO - __main__ - Step 34557: {'lr': 0.000442641840584186, 'samples': 6634944, 'steps': 34556, 'loss/train': 1.2919400930404663} 11/07/2021 02:06:36 - INFO - __main__ - Step 34558: {'lr': 0.00044263845824182467, 'samples': 6635136, 'steps': 34557, 'loss/train': 1.589131474494934} 11/07/2021 02:06:37 - INFO - __main__ - Step 34559: {'lr': 0.0004426350758126634, 'samples': 6635328, 'steps': 34558, 'loss/train': 1.3349565267562866} 11/07/2021 02:06:37 - INFO - __main__ - Step 34560: {'lr': 0.0004426316932967038, 'samples': 6635520, 'steps': 34559, 'loss/train': 1.3592138290405273} 11/07/2021 02:06:38 - INFO - __main__ - Step 34561: {'lr': 0.0004426283106939473, 'samples': 6635712, 'steps': 34560, 'loss/train': 0.11542725563049316} 11/07/2021 02:06:38 - INFO - __main__ - Step 34562: {'lr': 0.00044262492800439547, 'samples': 6635904, 'steps': 34561, 'loss/train': 1.0397703647613525} 11/07/2021 02:06:39 - INFO - __main__ - Step 34563: {'lr': 0.00044262154522804986, 'samples': 6636096, 'steps': 34562, 'loss/train': 1.0929821729660034} 11/07/2021 02:06:39 - INFO - __main__ - Step 34564: {'lr': 0.00044261816236491186, 'samples': 6636288, 'steps': 34563, 'loss/train': 1.6028127670288086} 11/07/2021 02:06:39 - INFO - __main__ - Step 34565: {'lr': 0.00044261477941498316, 'samples': 6636480, 'steps': 34564, 'loss/train': 1.8307973146438599} 11/07/2021 02:06:41 - INFO - __main__ - Step 34566: {'lr': 0.0004426113963782652, 'samples': 6636672, 'steps': 34565, 'loss/train': 1.577295184135437} 11/07/2021 02:06:41 - INFO - __main__ - Step 34567: {'lr': 0.00044260801325475953, 'samples': 6636864, 'steps': 34566, 'loss/train': 1.518283724784851} 11/07/2021 02:06:41 - INFO - __main__ - Step 34568: {'lr': 0.0004426046300444676, 'samples': 6637056, 'steps': 34567, 'loss/train': 1.3665200471878052} 11/07/2021 02:06:42 - INFO - __main__ - Step 34569: {'lr': 0.000442601246747391, 'samples': 6637248, 'steps': 34568, 'loss/train': 0.9717766046524048} 11/07/2021 02:06:42 - INFO - __main__ - Step 34570: {'lr': 0.0004425978633635313, 'samples': 6637440, 'steps': 34569, 'loss/train': 1.7114320993423462} 11/07/2021 02:06:43 - INFO - __main__ - Step 34571: {'lr': 0.0004425944798928899, 'samples': 6637632, 'steps': 34570, 'loss/train': 1.2611310482025146} 11/07/2021 02:06:43 - INFO - __main__ - Step 34572: {'lr': 0.0004425910963354685, 'samples': 6637824, 'steps': 34571, 'loss/train': 1.8750895261764526} 11/07/2021 02:06:44 - INFO - __main__ - Step 34573: {'lr': 0.0004425877126912685, 'samples': 6638016, 'steps': 34572, 'loss/train': 1.4090310335159302} 11/07/2021 02:06:44 - INFO - __main__ - Step 34574: {'lr': 0.00044258432896029145, 'samples': 6638208, 'steps': 34573, 'loss/train': 1.5623373985290527} 11/07/2021 02:06:44 - INFO - __main__ - Step 34575: {'lr': 0.00044258094514253876, 'samples': 6638400, 'steps': 34574, 'loss/train': 1.7206676006317139} 11/07/2021 02:06:46 - INFO - __main__ - Step 34576: {'lr': 0.00044257756123801216, 'samples': 6638592, 'steps': 34575, 'loss/train': 1.7571005821228027} 11/07/2021 02:06:46 - INFO - __main__ - Step 34577: {'lr': 0.0004425741772467131, 'samples': 6638784, 'steps': 34576, 'loss/train': 1.5985610485076904} 11/07/2021 02:06:46 - INFO - __main__ - Step 34578: {'lr': 0.0004425707931686431, 'samples': 6638976, 'steps': 34577, 'loss/train': 1.5311017036437988} 11/07/2021 02:06:47 - INFO - __main__ - Step 34579: {'lr': 0.00044256740900380364, 'samples': 6639168, 'steps': 34578, 'loss/train': 1.5247910022735596} 11/07/2021 02:06:47 - INFO - __main__ - Step 34580: {'lr': 0.0004425640247521963, 'samples': 6639360, 'steps': 34579, 'loss/train': 1.1286942958831787} 11/07/2021 02:06:48 - INFO - __main__ - Step 34581: {'lr': 0.00044256064041382255, 'samples': 6639552, 'steps': 34580, 'loss/train': 1.1327813863754272} 11/07/2021 02:06:48 - INFO - __main__ - Step 34582: {'lr': 0.0004425572559886839, 'samples': 6639744, 'steps': 34581, 'loss/train': 1.8219044208526611} 11/07/2021 02:06:49 - INFO - __main__ - Step 34583: {'lr': 0.00044255387147678206, 'samples': 6639936, 'steps': 34582, 'loss/train': 1.4360731840133667} 11/07/2021 02:06:49 - INFO - __main__ - Step 34584: {'lr': 0.0004425504868781183, 'samples': 6640128, 'steps': 34583, 'loss/train': 1.4580005407333374} 11/07/2021 02:06:49 - INFO - __main__ - Step 34585: {'lr': 0.0004425471021926943, 'samples': 6640320, 'steps': 34584, 'loss/train': 1.171671986579895} 11/07/2021 02:06:51 - INFO - __main__ - Step 34586: {'lr': 0.0004425437174205115, 'samples': 6640512, 'steps': 34585, 'loss/train': 1.599912166595459} 11/07/2021 02:06:51 - INFO - __main__ - Step 34587: {'lr': 0.00044254033256157154, 'samples': 6640704, 'steps': 34586, 'loss/train': 1.182337760925293} 11/07/2021 02:06:51 - INFO - __main__ - Step 34588: {'lr': 0.0004425369476158759, 'samples': 6640896, 'steps': 34587, 'loss/train': 1.4919441938400269} 11/07/2021 02:06:52 - INFO - __main__ - Step 34589: {'lr': 0.000442533562583426, 'samples': 6641088, 'steps': 34588, 'loss/train': 1.6986876726150513} 11/07/2021 02:06:52 - INFO - __main__ - Step 34590: {'lr': 0.00044253017746422355, 'samples': 6641280, 'steps': 34589, 'loss/train': 1.2049636840820312} 11/07/2021 02:06:53 - INFO - __main__ - Step 34591: {'lr': 0.00044252679225826984, 'samples': 6641472, 'steps': 34590, 'loss/train': 1.3515605926513672} 11/07/2021 02:06:53 - INFO - __main__ - Step 34592: {'lr': 0.0004425234069655666, 'samples': 6641664, 'steps': 34591, 'loss/train': 1.5792065858840942} 11/07/2021 02:06:54 - INFO - __main__ - Step 34593: {'lr': 0.0004425200215861153, 'samples': 6641856, 'steps': 34592, 'loss/train': 1.6354613304138184} 11/07/2021 02:06:54 - INFO - __main__ - Step 34594: {'lr': 0.00044251663611991743, 'samples': 6642048, 'steps': 34593, 'loss/train': 0.740597128868103} 11/07/2021 02:06:54 - INFO - __main__ - Step 34595: {'lr': 0.0004425132505669745, 'samples': 6642240, 'steps': 34594, 'loss/train': 0.7850571274757385} 11/07/2021 02:06:56 - INFO - __main__ - Step 34596: {'lr': 0.00044250986492728805, 'samples': 6642432, 'steps': 34595, 'loss/train': 1.890939712524414} 11/07/2021 02:06:56 - INFO - __main__ - Step 34597: {'lr': 0.0004425064792008597, 'samples': 6642624, 'steps': 34596, 'loss/train': 2.3666765689849854} 11/07/2021 02:06:56 - INFO - __main__ - Step 34598: {'lr': 0.0004425030933876909, 'samples': 6642816, 'steps': 34597, 'loss/train': 1.675109624862671} 11/07/2021 02:06:57 - INFO - __main__ - Step 34599: {'lr': 0.0004424997074877831, 'samples': 6643008, 'steps': 34598, 'loss/train': 1.7059627771377563} 11/07/2021 02:06:57 - INFO - __main__ - Step 34600: {'lr': 0.00044249632150113806, 'samples': 6643200, 'steps': 34599, 'loss/train': 1.568487286567688} 11/07/2021 02:06:57 - INFO - __main__ - Step 34601: {'lr': 0.000442492935427757, 'samples': 6643392, 'steps': 34600, 'loss/train': 1.5505502223968506} 11/07/2021 02:06:58 - INFO - __main__ - Step 34602: {'lr': 0.00044248954926764164, 'samples': 6643584, 'steps': 34601, 'loss/train': 1.3338634967803955} 11/07/2021 02:06:59 - INFO - __main__ - Step 34603: {'lr': 0.0004424861630207935, 'samples': 6643776, 'steps': 34602, 'loss/train': 1.2154052257537842} 11/07/2021 02:06:59 - INFO - __main__ - Step 34604: {'lr': 0.00044248277668721396, 'samples': 6643968, 'steps': 34603, 'loss/train': 1.4059666395187378} 11/07/2021 02:06:59 - INFO - __main__ - Step 34605: {'lr': 0.00044247939026690475, 'samples': 6644160, 'steps': 34604, 'loss/train': 1.9255297183990479} 11/07/2021 02:07:00 - INFO - __main__ - Step 34606: {'lr': 0.0004424760037598673, 'samples': 6644352, 'steps': 34605, 'loss/train': 1.4503154754638672} 11/07/2021 02:07:01 - INFO - __main__ - Step 34607: {'lr': 0.00044247261716610307, 'samples': 6644544, 'steps': 34606, 'loss/train': 2.0782949924468994} 11/07/2021 02:07:01 - INFO - __main__ - Step 34608: {'lr': 0.0004424692304856136, 'samples': 6644736, 'steps': 34607, 'loss/train': 1.8896739482879639} 11/07/2021 02:07:02 - INFO - __main__ - Step 34609: {'lr': 0.0004424658437184006, 'samples': 6644928, 'steps': 34608, 'loss/train': 1.4949098825454712} 11/07/2021 02:07:02 - INFO - __main__ - Step 34610: {'lr': 0.0004424624568644654, 'samples': 6645120, 'steps': 34609, 'loss/train': 1.5061067342758179} 11/07/2021 02:07:02 - INFO - __main__ - Step 34611: {'lr': 0.00044245906992380955, 'samples': 6645312, 'steps': 34610, 'loss/train': 1.4784544706344604} 11/07/2021 02:07:03 - INFO - __main__ - Step 34612: {'lr': 0.0004424556828964347, 'samples': 6645504, 'steps': 34611, 'loss/train': 1.0631170272827148} 11/07/2021 02:07:04 - INFO - __main__ - Step 34613: {'lr': 0.0004424522957823422, 'samples': 6645696, 'steps': 34612, 'loss/train': 1.161199927330017} 11/07/2021 02:07:04 - INFO - __main__ - Step 34614: {'lr': 0.00044244890858153376, 'samples': 6645888, 'steps': 34613, 'loss/train': 1.1820472478866577} 11/07/2021 02:07:04 - INFO - __main__ - Step 34615: {'lr': 0.00044244552129401075, 'samples': 6646080, 'steps': 34614, 'loss/train': 1.2303963899612427} 11/07/2021 02:07:05 - INFO - __main__ - Step 34616: {'lr': 0.0004424421339197747, 'samples': 6646272, 'steps': 34615, 'loss/train': 1.4271446466445923} 11/07/2021 02:07:06 - INFO - __main__ - Step 34617: {'lr': 0.00044243874645882733, 'samples': 6646464, 'steps': 34616, 'loss/train': 1.3017340898513794} 11/07/2021 02:07:06 - INFO - __main__ - Step 34618: {'lr': 0.0004424353589111699, 'samples': 6646656, 'steps': 34617, 'loss/train': 1.511954426765442} 11/07/2021 02:07:06 - INFO - __main__ - Step 34619: {'lr': 0.0004424319712768041, 'samples': 6646848, 'steps': 34618, 'loss/train': 1.4079127311706543} 11/07/2021 02:07:07 - INFO - __main__ - Step 34620: {'lr': 0.00044242858355573143, 'samples': 6647040, 'steps': 34619, 'loss/train': 1.2601699829101562} 11/07/2021 02:07:07 - INFO - __main__ - Step 34621: {'lr': 0.00044242519574795347, 'samples': 6647232, 'steps': 34620, 'loss/train': 1.5387800931930542} 11/07/2021 02:07:08 - INFO - __main__ - Step 34622: {'lr': 0.00044242180785347164, 'samples': 6647424, 'steps': 34621, 'loss/train': 1.591779112815857} 11/07/2021 02:07:09 - INFO - __main__ - Step 34623: {'lr': 0.00044241841987228747, 'samples': 6647616, 'steps': 34622, 'loss/train': 1.5455999374389648} 11/07/2021 02:07:09 - INFO - __main__ - Step 34624: {'lr': 0.00044241503180440263, 'samples': 6647808, 'steps': 34623, 'loss/train': 1.5934284925460815} 11/07/2021 02:07:09 - INFO - __main__ - Step 34625: {'lr': 0.0004424116436498185, 'samples': 6648000, 'steps': 34624, 'loss/train': 1.0495797395706177} 11/07/2021 02:07:10 - INFO - __main__ - Step 34626: {'lr': 0.0004424082554085366, 'samples': 6648192, 'steps': 34625, 'loss/train': 1.6510077714920044} 11/07/2021 02:07:11 - INFO - __main__ - Step 34627: {'lr': 0.0004424048670805586, 'samples': 6648384, 'steps': 34626, 'loss/train': 1.0533374547958374} 11/07/2021 02:07:11 - INFO - __main__ - Step 34628: {'lr': 0.0004424014786658859, 'samples': 6648576, 'steps': 34627, 'loss/train': 1.5134165287017822} 11/07/2021 02:07:11 - INFO - __main__ - Step 34629: {'lr': 0.00044239809016452, 'samples': 6648768, 'steps': 34628, 'loss/train': 1.4800750017166138} 11/07/2021 02:07:12 - INFO - __main__ - Step 34630: {'lr': 0.00044239470157646254, 'samples': 6648960, 'steps': 34629, 'loss/train': 1.61317777633667} 11/07/2021 02:07:12 - INFO - __main__ - Step 34631: {'lr': 0.000442391312901715, 'samples': 6649152, 'steps': 34630, 'loss/train': 1.6633764505386353} 11/07/2021 02:07:12 - INFO - __main__ - Step 34632: {'lr': 0.0004423879241402788, 'samples': 6649344, 'steps': 34631, 'loss/train': 1.8914424180984497} 11/07/2021 02:07:13 - INFO - __main__ - Step 34633: {'lr': 0.00044238453529215575, 'samples': 6649536, 'steps': 34632, 'loss/train': 1.3253827095031738} 11/07/2021 02:07:14 - INFO - __main__ - Step 34634: {'lr': 0.00044238114635734713, 'samples': 6649728, 'steps': 34633, 'loss/train': 1.6357054710388184} 11/07/2021 02:07:14 - INFO - __main__ - Step 34635: {'lr': 0.0004423777573358545, 'samples': 6649920, 'steps': 34634, 'loss/train': 1.5532491207122803} 11/07/2021 02:07:15 - INFO - __main__ - Step 34636: {'lr': 0.0004423743682276794, 'samples': 6650112, 'steps': 34635, 'loss/train': 1.3229199647903442} 11/07/2021 02:07:15 - INFO - __main__ - Step 34637: {'lr': 0.0004423709790328235, 'samples': 6650304, 'steps': 34636, 'loss/train': 1.3128234148025513} 11/07/2021 02:07:16 - INFO - __main__ - Step 34638: {'lr': 0.0004423675897512881, 'samples': 6650496, 'steps': 34637, 'loss/train': 1.764772653579712} 11/07/2021 02:07:16 - INFO - __main__ - Step 34639: {'lr': 0.0004423642003830748, 'samples': 6650688, 'steps': 34638, 'loss/train': 1.3187963962554932} 11/07/2021 02:07:17 - INFO - __main__ - Step 34640: {'lr': 0.00044236081092818527, 'samples': 6650880, 'steps': 34639, 'loss/train': 1.4747387170791626} 11/07/2021 02:07:17 - INFO - __main__ - Step 34641: {'lr': 0.00044235742138662085, 'samples': 6651072, 'steps': 34640, 'loss/train': 1.697799563407898} 11/07/2021 02:07:17 - INFO - __main__ - Step 34642: {'lr': 0.0004423540317583832, 'samples': 6651264, 'steps': 34641, 'loss/train': 1.5078938007354736} 11/07/2021 02:07:18 - INFO - __main__ - Step 34643: {'lr': 0.00044235064204347377, 'samples': 6651456, 'steps': 34642, 'loss/train': 1.9758487939834595} 11/07/2021 02:07:19 - INFO - __main__ - Step 34644: {'lr': 0.0004423472522418941, 'samples': 6651648, 'steps': 34643, 'loss/train': 1.5313408374786377} 11/07/2021 02:07:19 - INFO - __main__ - Step 34645: {'lr': 0.0004423438623536457, 'samples': 6651840, 'steps': 34644, 'loss/train': 1.3848991394042969} 11/07/2021 02:07:19 - INFO - __main__ - Step 34646: {'lr': 0.0004423404723787301, 'samples': 6652032, 'steps': 34645, 'loss/train': 0.7880754470825195} 11/07/2021 02:07:20 - INFO - __main__ - Step 34647: {'lr': 0.000442337082317149, 'samples': 6652224, 'steps': 34646, 'loss/train': 1.656903624534607} 11/07/2021 02:07:21 - INFO - __main__ - Step 34648: {'lr': 0.0004423336921689036, 'samples': 6652416, 'steps': 34647, 'loss/train': 1.0038855075836182} 11/07/2021 02:07:21 - INFO - __main__ - Step 34649: {'lr': 0.0004423303019339957, 'samples': 6652608, 'steps': 34648, 'loss/train': 1.808814525604248} 11/07/2021 02:07:21 - INFO - __main__ - Step 34650: {'lr': 0.0004423269116124267, 'samples': 6652800, 'steps': 34649, 'loss/train': 1.7514744997024536} 11/07/2021 02:07:22 - INFO - __main__ - Step 34651: {'lr': 0.0004423235212041982, 'samples': 6652992, 'steps': 34650, 'loss/train': 1.5523748397827148} 11/07/2021 02:07:22 - INFO - __main__ - Step 34652: {'lr': 0.00044232013070931165, 'samples': 6653184, 'steps': 34651, 'loss/train': 0.8525782823562622} 11/07/2021 02:07:23 - INFO - __main__ - Step 34653: {'lr': 0.00044231674012776864, 'samples': 6653376, 'steps': 34652, 'loss/train': 1.537770390510559} 11/07/2021 02:07:24 - INFO - __main__ - Step 34654: {'lr': 0.0004423133494595707, 'samples': 6653568, 'steps': 34653, 'loss/train': 1.724595546722412} 11/07/2021 02:07:24 - INFO - __main__ - Step 34655: {'lr': 0.00044230995870471923, 'samples': 6653760, 'steps': 34654, 'loss/train': 2.3284170627593994} 11/07/2021 02:07:24 - INFO - __main__ - Step 34656: {'lr': 0.000442306567863216, 'samples': 6653952, 'steps': 34655, 'loss/train': 1.2631199359893799} 11/07/2021 02:07:25 - INFO - __main__ - Step 34657: {'lr': 0.00044230317693506226, 'samples': 6654144, 'steps': 34656, 'loss/train': 1.8032299280166626} 11/07/2021 02:07:25 - INFO - __main__ - Step 34658: {'lr': 0.00044229978592025975, 'samples': 6654336, 'steps': 34657, 'loss/train': 1.4368984699249268} 11/07/2021 02:07:26 - INFO - __main__ - Step 34659: {'lr': 0.00044229639481881, 'samples': 6654528, 'steps': 34658, 'loss/train': 1.7114256620407104} 11/07/2021 02:07:26 - INFO - __main__ - Step 34660: {'lr': 0.00044229300363071434, 'samples': 6654720, 'steps': 34659, 'loss/train': 1.4131542444229126} 11/07/2021 02:07:27 - INFO - __main__ - Step 34661: {'lr': 0.0004422896123559744, 'samples': 6654912, 'steps': 34660, 'loss/train': 1.4714182615280151} 11/07/2021 02:07:27 - INFO - __main__ - Step 34662: {'lr': 0.00044228622099459183, 'samples': 6655104, 'steps': 34661, 'loss/train': 1.4233191013336182} 11/07/2021 02:07:27 - INFO - __main__ - Step 34663: {'lr': 0.000442282829546568, 'samples': 6655296, 'steps': 34662, 'loss/train': 1.2954325675964355} 11/07/2021 02:07:28 - INFO - __main__ - Step 34664: {'lr': 0.00044227943801190454, 'samples': 6655488, 'steps': 34663, 'loss/train': 1.5540417432785034} 11/07/2021 02:07:29 - INFO - __main__ - Step 34665: {'lr': 0.0004422760463906029, 'samples': 6655680, 'steps': 34664, 'loss/train': 1.7930601835250854} 11/07/2021 02:07:29 - INFO - __main__ - Step 34666: {'lr': 0.00044227265468266464, 'samples': 6655872, 'steps': 34665, 'loss/train': 1.5526427030563354} 11/07/2021 02:07:30 - INFO - __main__ - Step 34667: {'lr': 0.0004422692628880913, 'samples': 6656064, 'steps': 34666, 'loss/train': 1.0020115375518799} 11/07/2021 02:07:30 - INFO - __main__ - Step 34668: {'lr': 0.00044226587100688436, 'samples': 6656256, 'steps': 34667, 'loss/train': 0.9848887324333191} 11/07/2021 02:07:31 - INFO - __main__ - Step 34669: {'lr': 0.0004422624790390454, 'samples': 6656448, 'steps': 34668, 'loss/train': 1.892004370689392} 11/07/2021 02:07:31 - INFO - __main__ - Step 34670: {'lr': 0.000442259086984576, 'samples': 6656640, 'steps': 34669, 'loss/train': 0.9863758683204651} 11/07/2021 02:07:32 - INFO - __main__ - Step 34671: {'lr': 0.00044225569484347753, 'samples': 6656832, 'steps': 34670, 'loss/train': 1.4947410821914673} 11/07/2021 02:07:32 - INFO - __main__ - Step 34672: {'lr': 0.00044225230261575165, 'samples': 6657024, 'steps': 34671, 'loss/train': 1.550666332244873} 11/07/2021 02:07:32 - INFO - __main__ - Step 34673: {'lr': 0.00044224891030139986, 'samples': 6657216, 'steps': 34672, 'loss/train': 1.230271577835083} 11/07/2021 02:07:33 - INFO - __main__ - Step 34674: {'lr': 0.0004422455179004237, 'samples': 6657408, 'steps': 34673, 'loss/train': 1.6041208505630493} 11/07/2021 02:07:34 - INFO - __main__ - Step 34675: {'lr': 0.00044224212541282463, 'samples': 6657600, 'steps': 34674, 'loss/train': 1.1184190511703491} 11/07/2021 02:07:34 - INFO - __main__ - Step 34676: {'lr': 0.0004422387328386042, 'samples': 6657792, 'steps': 34675, 'loss/train': 1.2925069332122803} 11/07/2021 02:07:34 - INFO - __main__ - Step 34677: {'lr': 0.000442235340177764, 'samples': 6657984, 'steps': 34676, 'loss/train': 1.5358885526657104} 11/07/2021 02:07:35 - INFO - __main__ - Step 34678: {'lr': 0.00044223194743030556, 'samples': 6658176, 'steps': 34677, 'loss/train': 1.7155054807662964} 11/07/2021 02:07:36 - INFO - __main__ - Step 34679: {'lr': 0.00044222855459623034, 'samples': 6658368, 'steps': 34678, 'loss/train': 1.3513246774673462} 11/07/2021 02:07:37 - INFO - __main__ - Step 34680: {'lr': 0.00044222516167553985, 'samples': 6658560, 'steps': 34679, 'loss/train': 1.089221715927124} 11/07/2021 02:07:37 - INFO - __main__ - Step 34681: {'lr': 0.0004422217686682357, 'samples': 6658752, 'steps': 34680, 'loss/train': 0.9267388582229614} 11/07/2021 02:07:37 - INFO - __main__ - Step 34682: {'lr': 0.00044221837557431945, 'samples': 6658944, 'steps': 34681, 'loss/train': 1.5414425134658813} 11/07/2021 02:07:38 - INFO - __main__ - Step 34683: {'lr': 0.00044221498239379247, 'samples': 6659136, 'steps': 34682, 'loss/train': 1.783010482788086} 11/07/2021 02:07:38 - INFO - __main__ - Step 34684: {'lr': 0.0004422115891266565, 'samples': 6659328, 'steps': 34683, 'loss/train': 1.790891408920288} 11/07/2021 02:07:38 - INFO - __main__ - Step 34685: {'lr': 0.00044220819577291283, 'samples': 6659520, 'steps': 34684, 'loss/train': 1.6551603078842163} 11/07/2021 02:07:39 - INFO - __main__ - Step 34686: {'lr': 0.00044220480233256315, 'samples': 6659712, 'steps': 34685, 'loss/train': 1.8620059490203857} 11/07/2021 02:07:40 - INFO - __main__ - Step 34687: {'lr': 0.00044220140880560897, 'samples': 6659904, 'steps': 34686, 'loss/train': 1.5913258790969849} 11/07/2021 02:07:40 - INFO - __main__ - Step 34688: {'lr': 0.0004421980151920518, 'samples': 6660096, 'steps': 34687, 'loss/train': 1.3367774486541748} 11/07/2021 02:07:40 - INFO - __main__ - Step 34689: {'lr': 0.00044219462149189313, 'samples': 6660288, 'steps': 34688, 'loss/train': 1.2430310249328613} 11/07/2021 02:07:41 - INFO - __main__ - Step 34690: {'lr': 0.0004421912277051346, 'samples': 6660480, 'steps': 34689, 'loss/train': 1.542733907699585} 11/07/2021 02:07:42 - INFO - __main__ - Step 34691: {'lr': 0.00044218783383177763, 'samples': 6660672, 'steps': 34690, 'loss/train': 2.0949127674102783} 11/07/2021 02:07:42 - INFO - __main__ - Step 34692: {'lr': 0.00044218443987182384, 'samples': 6660864, 'steps': 34691, 'loss/train': 1.4428645372390747} 11/07/2021 02:07:42 - INFO - __main__ - Step 34693: {'lr': 0.0004421810458252746, 'samples': 6661056, 'steps': 34692, 'loss/train': 1.51067054271698} 11/07/2021 02:07:43 - INFO - __main__ - Step 34694: {'lr': 0.00044217765169213166, 'samples': 6661248, 'steps': 34693, 'loss/train': 2.990875244140625} 11/07/2021 02:07:43 - INFO - __main__ - Step 34695: {'lr': 0.00044217425747239636, 'samples': 6661440, 'steps': 34694, 'loss/train': 1.3043237924575806} 11/07/2021 02:07:44 - INFO - __main__ - Step 34696: {'lr': 0.00044217086316607033, 'samples': 6661632, 'steps': 34695, 'loss/train': 0.15015660226345062} 11/07/2021 02:07:45 - INFO - __main__ - Step 34697: {'lr': 0.00044216746877315504, 'samples': 6661824, 'steps': 34696, 'loss/train': 1.921492338180542} 11/07/2021 02:07:45 - INFO - __main__ - Step 34698: {'lr': 0.0004421640742936521, 'samples': 6662016, 'steps': 34697, 'loss/train': 1.4255037307739258} 11/07/2021 02:07:45 - INFO - __main__ - Step 34699: {'lr': 0.000442160679727563, 'samples': 6662208, 'steps': 34698, 'loss/train': 0.9124966859817505} 11/07/2021 02:07:46 - INFO - __main__ - Step 34700: {'lr': 0.0004421572850748893, 'samples': 6662400, 'steps': 34699, 'loss/train': 1.3457306623458862} 11/07/2021 02:07:47 - INFO - __main__ - Step 34701: {'lr': 0.00044215389033563235, 'samples': 6662592, 'steps': 34700, 'loss/train': 0.5043824911117554} 11/07/2021 02:07:47 - INFO - __main__ - Step 34702: {'lr': 0.00044215049550979394, 'samples': 6662784, 'steps': 34701, 'loss/train': 1.148721694946289} 11/07/2021 02:07:47 - INFO - __main__ - Step 34703: {'lr': 0.0004421471005973755, 'samples': 6662976, 'steps': 34702, 'loss/train': 3.545989990234375} 11/07/2021 02:07:48 - INFO - __main__ - Step 34704: {'lr': 0.0004421437055983785, 'samples': 6663168, 'steps': 34703, 'loss/train': 1.1789891719818115} 11/07/2021 02:07:48 - INFO - __main__ - Step 34705: {'lr': 0.0004421403105128045, 'samples': 6663360, 'steps': 34704, 'loss/train': 1.4928643703460693} 11/07/2021 02:07:49 - INFO - __main__ - Step 34706: {'lr': 0.00044213691534065503, 'samples': 6663552, 'steps': 34705, 'loss/train': 1.7695435285568237} 11/07/2021 02:07:49 - INFO - __main__ - Step 34707: {'lr': 0.0004421335200819316, 'samples': 6663744, 'steps': 34706, 'loss/train': 1.4045119285583496} 11/07/2021 02:07:50 - INFO - __main__ - Step 34708: {'lr': 0.00044213012473663584, 'samples': 6663936, 'steps': 34707, 'loss/train': 1.6348592042922974} 11/07/2021 02:07:50 - INFO - __main__ - Step 34709: {'lr': 0.0004421267293047692, 'samples': 6664128, 'steps': 34708, 'loss/train': 1.248964786529541} 11/07/2021 02:07:51 - INFO - __main__ - Step 34710: {'lr': 0.0004421233337863332, 'samples': 6664320, 'steps': 34709, 'loss/train': 1.435050368309021} 11/07/2021 02:07:52 - INFO - __main__ - Step 34711: {'lr': 0.0004421199381813293, 'samples': 6664512, 'steps': 34710, 'loss/train': 0.36825597286224365} 11/07/2021 02:07:52 - INFO - __main__ - Step 34712: {'lr': 0.0004421165424897593, 'samples': 6664704, 'steps': 34711, 'loss/train': 1.7275134325027466} 11/07/2021 02:07:52 - INFO - __main__ - Step 34713: {'lr': 0.00044211314671162446, 'samples': 6664896, 'steps': 34712, 'loss/train': 1.3651692867279053} 11/07/2021 02:07:53 - INFO - __main__ - Step 34714: {'lr': 0.0004421097508469264, 'samples': 6665088, 'steps': 34713, 'loss/train': 1.607322096824646} 11/07/2021 02:07:53 - INFO - __main__ - Step 34715: {'lr': 0.0004421063548956666, 'samples': 6665280, 'steps': 34714, 'loss/train': 1.7779440879821777} 11/07/2021 02:07:53 - INFO - __main__ - Step 34716: {'lr': 0.0004421029588578468, 'samples': 6665472, 'steps': 34715, 'loss/train': 1.2604187726974487} 11/07/2021 02:07:54 - INFO - __main__ - Step 34717: {'lr': 0.00044209956273346816, 'samples': 6665664, 'steps': 34716, 'loss/train': 1.660188913345337} 11/07/2021 02:07:55 - INFO - __main__ - Step 34718: {'lr': 0.0004420961665225326, 'samples': 6665856, 'steps': 34717, 'loss/train': 1.611341118812561} 11/07/2021 02:07:55 - INFO - __main__ - Step 34719: {'lr': 0.0004420927702250414, 'samples': 6666048, 'steps': 34718, 'loss/train': 1.270687222480774} 11/07/2021 02:07:56 - INFO - __main__ - Step 34720: {'lr': 0.00044208937384099614, 'samples': 6666240, 'steps': 34719, 'loss/train': 1.3792513608932495} 11/07/2021 02:07:56 - INFO - __main__ - Step 34721: {'lr': 0.0004420859773703985, 'samples': 6666432, 'steps': 34720, 'loss/train': 1.6422091722488403} 11/07/2021 02:07:57 - INFO - __main__ - Step 34722: {'lr': 0.0004420825808132497, 'samples': 6666624, 'steps': 34721, 'loss/train': 1.6275794506072998} 11/07/2021 02:07:57 - INFO - __main__ - Step 34723: {'lr': 0.0004420791841695515, 'samples': 6666816, 'steps': 34722, 'loss/train': 1.3561608791351318} 11/07/2021 02:07:58 - INFO - __main__ - Step 34724: {'lr': 0.00044207578743930544, 'samples': 6667008, 'steps': 34723, 'loss/train': 1.690434455871582} 11/07/2021 02:07:58 - INFO - __main__ - Step 34725: {'lr': 0.00044207239062251297, 'samples': 6667200, 'steps': 34724, 'loss/train': 1.4305295944213867} 11/07/2021 02:07:58 - INFO - __main__ - Step 34726: {'lr': 0.00044206899371917563, 'samples': 6667392, 'steps': 34725, 'loss/train': 1.0719469785690308} 11/07/2021 02:07:59 - INFO - __main__ - Step 34727: {'lr': 0.00044206559672929505, 'samples': 6667584, 'steps': 34726, 'loss/train': 1.5531504154205322} 11/07/2021 02:08:00 - INFO - __main__ - Step 34728: {'lr': 0.00044206219965287253, 'samples': 6667776, 'steps': 34727, 'loss/train': 1.0178858041763306} 11/07/2021 02:08:00 - INFO - __main__ - Step 34729: {'lr': 0.0004420588024899098, 'samples': 6667968, 'steps': 34728, 'loss/train': 1.555402159690857} 11/07/2021 02:08:00 - INFO - __main__ - Step 34730: {'lr': 0.00044205540524040846, 'samples': 6668160, 'steps': 34729, 'loss/train': 1.5309548377990723} 11/07/2021 02:08:01 - INFO - __main__ - Step 34731: {'lr': 0.0004420520079043698, 'samples': 6668352, 'steps': 34730, 'loss/train': 1.6775578260421753} 11/07/2021 02:08:01 - INFO - __main__ - Step 34732: {'lr': 0.00044204861048179544, 'samples': 6668544, 'steps': 34731, 'loss/train': 1.8591980934143066} 11/07/2021 02:08:02 - INFO - __main__ - Step 34733: {'lr': 0.000442045212972687, 'samples': 6668736, 'steps': 34732, 'loss/train': 1.4219645261764526} 11/07/2021 02:08:02 - INFO - __main__ - Step 34734: {'lr': 0.00044204181537704594, 'samples': 6668928, 'steps': 34733, 'loss/train': 3.3481669425964355} 11/07/2021 02:08:03 - INFO - __main__ - Step 34735: {'lr': 0.0004420384176948738, 'samples': 6669120, 'steps': 34734, 'loss/train': 1.136333703994751} 11/07/2021 02:08:03 - INFO - __main__ - Step 34736: {'lr': 0.0004420350199261721, 'samples': 6669312, 'steps': 34735, 'loss/train': 1.2063430547714233} 11/07/2021 02:08:03 - INFO - __main__ - Step 34737: {'lr': 0.0004420316220709424, 'samples': 6669504, 'steps': 34736, 'loss/train': 1.3695883750915527} 11/07/2021 02:08:05 - INFO - __main__ - Step 34738: {'lr': 0.0004420282241291862, 'samples': 6669696, 'steps': 34737, 'loss/train': 1.309014916419983} 11/07/2021 02:08:05 - INFO - __main__ - Step 34739: {'lr': 0.0004420248261009051, 'samples': 6669888, 'steps': 34738, 'loss/train': 1.4217370748519897} 11/07/2021 02:08:05 - INFO - __main__ - Step 34740: {'lr': 0.0004420214279861005, 'samples': 6670080, 'steps': 34739, 'loss/train': 1.6708537340164185} 11/07/2021 02:08:06 - INFO - __main__ - Step 34741: {'lr': 0.000442018029784774, 'samples': 6670272, 'steps': 34740, 'loss/train': 1.6417057514190674} 11/07/2021 02:08:06 - INFO - __main__ - Step 34742: {'lr': 0.00044201463149692725, 'samples': 6670464, 'steps': 34741, 'loss/train': 0.7682157754898071} 11/07/2021 02:08:07 - INFO - __main__ - Step 34743: {'lr': 0.0004420112331225616, 'samples': 6670656, 'steps': 34742, 'loss/train': 1.6508562564849854} 11/07/2021 02:08:07 - INFO - __main__ - Step 34744: {'lr': 0.0004420078346616786, 'samples': 6670848, 'steps': 34743, 'loss/train': 1.143585205078125} 11/07/2021 02:08:08 - INFO - __main__ - Step 34745: {'lr': 0.00044200443611427985, 'samples': 6671040, 'steps': 34744, 'loss/train': 1.240492343902588} 11/07/2021 02:08:08 - INFO - __main__ - Step 34746: {'lr': 0.000442001037480367, 'samples': 6671232, 'steps': 34745, 'loss/train': 1.3379449844360352} 11/07/2021 02:08:08 - INFO - __main__ - Step 34747: {'lr': 0.0004419976387599413, 'samples': 6671424, 'steps': 34746, 'loss/train': 1.318930983543396} 11/07/2021 02:08:09 - INFO - __main__ - Step 34748: {'lr': 0.0004419942399530045, 'samples': 6671616, 'steps': 34747, 'loss/train': 1.1978917121887207} 11/07/2021 02:08:10 - INFO - __main__ - Step 34749: {'lr': 0.000441990841059558, 'samples': 6671808, 'steps': 34748, 'loss/train': 1.252805471420288} 11/07/2021 02:08:10 - INFO - __main__ - Step 34750: {'lr': 0.0004419874420796034, 'samples': 6672000, 'steps': 34749, 'loss/train': 1.8787426948547363} 11/07/2021 02:08:10 - INFO - __main__ - Step 34751: {'lr': 0.00044198404301314223, 'samples': 6672192, 'steps': 34750, 'loss/train': 1.2014528512954712} 11/07/2021 02:08:11 - INFO - __main__ - Step 34752: {'lr': 0.000441980643860176, 'samples': 6672384, 'steps': 34751, 'loss/train': 1.153993844985962} 11/07/2021 02:08:12 - INFO - __main__ - Step 34753: {'lr': 0.0004419772446207063, 'samples': 6672576, 'steps': 34752, 'loss/train': 1.0246278047561646} 11/07/2021 02:08:12 - INFO - __main__ - Step 34754: {'lr': 0.0004419738452947346, 'samples': 6672768, 'steps': 34753, 'loss/train': 1.601043462753296} 11/07/2021 02:08:13 - INFO - __main__ - Step 34755: {'lr': 0.00044197044588226245, 'samples': 6672960, 'steps': 34754, 'loss/train': 1.3129998445510864} 11/07/2021 02:08:13 - INFO - __main__ - Step 34756: {'lr': 0.00044196704638329134, 'samples': 6673152, 'steps': 34755, 'loss/train': 1.5781275033950806} 11/07/2021 02:08:13 - INFO - __main__ - Step 34757: {'lr': 0.00044196364679782284, 'samples': 6673344, 'steps': 34756, 'loss/train': 1.534688115119934} 11/07/2021 02:08:14 - INFO - __main__ - Step 34758: {'lr': 0.00044196024712585854, 'samples': 6673536, 'steps': 34757, 'loss/train': 1.2186495065689087} 11/07/2021 02:08:15 - INFO - __main__ - Step 34759: {'lr': 0.0004419568473673999, 'samples': 6673728, 'steps': 34758, 'loss/train': 1.7138926982879639} 11/07/2021 02:08:15 - INFO - __main__ - Step 34760: {'lr': 0.00044195344752244844, 'samples': 6673920, 'steps': 34759, 'loss/train': 1.3245975971221924} 11/07/2021 02:08:15 - INFO - __main__ - Step 34761: {'lr': 0.0004419500475910057, 'samples': 6674112, 'steps': 34760, 'loss/train': 2.0108070373535156} 11/07/2021 02:08:16 - INFO - __main__ - Step 34762: {'lr': 0.0004419466475730732, 'samples': 6674304, 'steps': 34761, 'loss/train': 2.0209977626800537} 11/07/2021 02:08:17 - INFO - __main__ - Step 34763: {'lr': 0.00044194324746865265, 'samples': 6674496, 'steps': 34762, 'loss/train': 0.17314231395721436} 11/07/2021 02:08:17 - INFO - __main__ - Step 34764: {'lr': 0.00044193984727774533, 'samples': 6674688, 'steps': 34763, 'loss/train': 1.089399814605713} 11/07/2021 02:08:17 - INFO - __main__ - Step 34765: {'lr': 0.0004419364470003529, 'samples': 6674880, 'steps': 34764, 'loss/train': 1.7820653915405273} 11/07/2021 02:08:18 - INFO - __main__ - Step 34766: {'lr': 0.00044193304663647684, 'samples': 6675072, 'steps': 34765, 'loss/train': 0.5651389360427856} 11/07/2021 02:08:18 - INFO - __main__ - Step 34767: {'lr': 0.00044192964618611875, 'samples': 6675264, 'steps': 34766, 'loss/train': 1.915136694908142} 11/07/2021 02:08:19 - INFO - __main__ - Step 34768: {'lr': 0.0004419262456492801, 'samples': 6675456, 'steps': 34767, 'loss/train': 1.3549784421920776} 11/07/2021 02:08:19 - INFO - __main__ - Step 34769: {'lr': 0.0004419228450259625, 'samples': 6675648, 'steps': 34768, 'loss/train': 0.9835052490234375} 11/07/2021 02:08:20 - INFO - __main__ - Step 34770: {'lr': 0.00044191944431616734, 'samples': 6675840, 'steps': 34769, 'loss/train': 1.5672987699508667} 11/07/2021 02:08:20 - INFO - __main__ - Step 34771: {'lr': 0.0004419160435198963, 'samples': 6676032, 'steps': 34770, 'loss/train': 1.1811498403549194} 11/07/2021 02:08:21 - INFO - __main__ - Step 34772: {'lr': 0.00044191264263715083, 'samples': 6676224, 'steps': 34771, 'loss/train': 1.283642053604126} 11/07/2021 02:08:21 - INFO - __main__ - Step 34773: {'lr': 0.00044190924166793245, 'samples': 6676416, 'steps': 34772, 'loss/train': 1.3295031785964966} 11/07/2021 02:08:22 - INFO - __main__ - Step 34774: {'lr': 0.00044190584061224277, 'samples': 6676608, 'steps': 34773, 'loss/train': 1.20137357711792} 11/07/2021 02:08:22 - INFO - __main__ - Step 34775: {'lr': 0.0004419024394700833, 'samples': 6676800, 'steps': 34774, 'loss/train': 0.9757902026176453} 11/07/2021 02:08:23 - INFO - __main__ - Step 34776: {'lr': 0.0004418990382414555, 'samples': 6676992, 'steps': 34775, 'loss/train': 1.6098065376281738} 11/07/2021 02:08:23 - INFO - __main__ - Step 34777: {'lr': 0.000441895636926361, 'samples': 6677184, 'steps': 34776, 'loss/train': 1.4362106323242188} 11/07/2021 02:08:23 - INFO - __main__ - Step 34778: {'lr': 0.0004418922355248013, 'samples': 6677376, 'steps': 34777, 'loss/train': 1.3258627653121948} 11/07/2021 02:08:24 - INFO - __main__ - Step 34779: {'lr': 0.00044188883403677783, 'samples': 6677568, 'steps': 34778, 'loss/train': 1.1535234451293945} 11/07/2021 02:08:25 - INFO - __main__ - Step 34780: {'lr': 0.0004418854324622923, 'samples': 6677760, 'steps': 34779, 'loss/train': 1.4015830755233765} 11/07/2021 02:08:25 - INFO - __main__ - Step 34781: {'lr': 0.0004418820308013461, 'samples': 6677952, 'steps': 34780, 'loss/train': 1.5423986911773682} 11/07/2021 02:08:25 - INFO - __main__ - Step 34782: {'lr': 0.0004418786290539408, 'samples': 6678144, 'steps': 34781, 'loss/train': 2.191751718521118} 11/07/2021 02:08:26 - INFO - __main__ - Step 34783: {'lr': 0.000441875227220078, 'samples': 6678336, 'steps': 34782, 'loss/train': 1.3227585554122925} 11/07/2021 02:08:27 - INFO - __main__ - Step 34784: {'lr': 0.00044187182529975924, 'samples': 6678528, 'steps': 34783, 'loss/train': 1.5984337329864502} 11/07/2021 02:08:27 - INFO - __main__ - Step 34785: {'lr': 0.00044186842329298594, 'samples': 6678720, 'steps': 34784, 'loss/train': 1.293143391609192} 11/07/2021 02:08:27 - INFO - __main__ - Step 34786: {'lr': 0.0004418650211997596, 'samples': 6678912, 'steps': 34785, 'loss/train': 1.5011814832687378} 11/07/2021 02:08:28 - INFO - __main__ - Step 34787: {'lr': 0.00044186161902008193, 'samples': 6679104, 'steps': 34786, 'loss/train': 0.9370352625846863} 11/07/2021 02:08:28 - INFO - __main__ - Step 34788: {'lr': 0.0004418582167539544, 'samples': 6679296, 'steps': 34787, 'loss/train': 1.3798874616622925} 11/07/2021 02:08:29 - INFO - __main__ - Step 34789: {'lr': 0.00044185481440137846, 'samples': 6679488, 'steps': 34788, 'loss/train': 1.240277886390686} 11/07/2021 02:08:30 - INFO - __main__ - Step 34790: {'lr': 0.0004418514119623557, 'samples': 6679680, 'steps': 34789, 'loss/train': 1.4599852561950684} 11/07/2021 02:08:30 - INFO - __main__ - Step 34791: {'lr': 0.00044184800943688774, 'samples': 6679872, 'steps': 34790, 'loss/train': 1.3775103092193604} 11/07/2021 02:08:30 - INFO - __main__ - Step 34792: {'lr': 0.00044184460682497595, 'samples': 6680064, 'steps': 34791, 'loss/train': 1.8663573265075684} 11/07/2021 02:08:31 - INFO - __main__ - Step 34793: {'lr': 0.00044184120412662196, 'samples': 6680256, 'steps': 34792, 'loss/train': 0.7620434761047363} 11/07/2021 02:08:31 - INFO - __main__ - Step 34794: {'lr': 0.00044183780134182725, 'samples': 6680448, 'steps': 34793, 'loss/train': 1.5115052461624146} 11/07/2021 02:08:32 - INFO - __main__ - Step 34795: {'lr': 0.0004418343984705935, 'samples': 6680640, 'steps': 34794, 'loss/train': 1.2904330492019653} 11/07/2021 02:08:33 - INFO - __main__ - Step 34796: {'lr': 0.000441830995512922, 'samples': 6680832, 'steps': 34795, 'loss/train': 1.414196491241455} 11/07/2021 02:08:33 - INFO - __main__ - Step 34797: {'lr': 0.00044182759246881446, 'samples': 6681024, 'steps': 34796, 'loss/train': 1.9183604717254639} 11/07/2021 02:08:33 - INFO - __main__ - Step 34798: {'lr': 0.0004418241893382724, 'samples': 6681216, 'steps': 34797, 'loss/train': 1.7295217514038086} 11/07/2021 02:08:34 - INFO - __main__ - Step 34799: {'lr': 0.0004418207861212973, 'samples': 6681408, 'steps': 34798, 'loss/train': 1.2074306011199951} 11/07/2021 02:08:35 - INFO - __main__ - Step 34800: {'lr': 0.0004418173828178906, 'samples': 6681600, 'steps': 34799, 'loss/train': 1.9541829824447632} 11/07/2021 02:08:35 - INFO - __main__ - Step 34801: {'lr': 0.0004418139794280541, 'samples': 6681792, 'steps': 34800, 'loss/train': 1.5093656778335571} 11/07/2021 02:08:35 - INFO - __main__ - Step 34802: {'lr': 0.0004418105759517892, 'samples': 6681984, 'steps': 34801, 'loss/train': 0.3188191056251526} 11/07/2021 02:08:36 - INFO - __main__ - Step 34803: {'lr': 0.0004418071723890973, 'samples': 6682176, 'steps': 34802, 'loss/train': 1.9970906972885132} 11/07/2021 02:08:36 - INFO - __main__ - Step 34804: {'lr': 0.0004418037687399801, 'samples': 6682368, 'steps': 34803, 'loss/train': 1.8315550088882446} 11/07/2021 02:08:37 - INFO - __main__ - Step 34805: {'lr': 0.0004418003650044391, 'samples': 6682560, 'steps': 34804, 'loss/train': 1.305469274520874} 11/07/2021 02:08:37 - INFO - __main__ - Step 34806: {'lr': 0.0004417969611824758, 'samples': 6682752, 'steps': 34805, 'loss/train': 1.107684850692749} 11/07/2021 02:08:38 - INFO - __main__ - Step 34807: {'lr': 0.00044179355727409173, 'samples': 6682944, 'steps': 34806, 'loss/train': 1.085576057434082} 11/07/2021 02:08:38 - INFO - __main__ - Step 34808: {'lr': 0.00044179015327928847, 'samples': 6683136, 'steps': 34807, 'loss/train': 1.496474266052246} 11/07/2021 02:08:38 - INFO - __main__ - Step 34809: {'lr': 0.0004417867491980675, 'samples': 6683328, 'steps': 34808, 'loss/train': 1.8663268089294434} 11/07/2021 02:08:40 - INFO - __main__ - Step 34810: {'lr': 0.0004417833450304304, 'samples': 6683520, 'steps': 34809, 'loss/train': 1.9572205543518066} 11/07/2021 02:08:40 - INFO - __main__ - Step 34811: {'lr': 0.0004417799407763786, 'samples': 6683712, 'steps': 34810, 'loss/train': 1.5946073532104492} 11/07/2021 02:08:40 - INFO - __main__ - Step 34812: {'lr': 0.00044177653643591387, 'samples': 6683904, 'steps': 34811, 'loss/train': 1.580494999885559} 11/07/2021 02:08:41 - INFO - __main__ - Step 34813: {'lr': 0.00044177313200903745, 'samples': 6684096, 'steps': 34812, 'loss/train': 0.7876717448234558} 11/07/2021 02:08:41 - INFO - __main__ - Step 34814: {'lr': 0.0004417697274957511, 'samples': 6684288, 'steps': 34813, 'loss/train': 1.619801640510559} 11/07/2021 02:08:42 - INFO - __main__ - Step 34815: {'lr': 0.0004417663228960562, 'samples': 6684480, 'steps': 34814, 'loss/train': 1.3058736324310303} 11/07/2021 02:08:42 - INFO - __main__ - Step 34816: {'lr': 0.0004417629182099545, 'samples': 6684672, 'steps': 34815, 'loss/train': 1.5034188032150269} 11/07/2021 02:08:43 - INFO - __main__ - Step 34817: {'lr': 0.00044175951343744725, 'samples': 6684864, 'steps': 34816, 'loss/train': 1.7466281652450562} 11/07/2021 02:08:43 - INFO - __main__ - Step 34818: {'lr': 0.0004417561085785362, 'samples': 6685056, 'steps': 34817, 'loss/train': 1.3654601573944092} 11/07/2021 02:08:43 - INFO - __main__ - Step 34819: {'lr': 0.0004417527036332227, 'samples': 6685248, 'steps': 34818, 'loss/train': 1.158371090888977} 11/07/2021 02:08:44 - INFO - __main__ - Step 34820: {'lr': 0.0004417492986015085, 'samples': 6685440, 'steps': 34819, 'loss/train': 1.3802626132965088} 11/07/2021 02:08:45 - INFO - __main__ - Step 34821: {'lr': 0.000441745893483395, 'samples': 6685632, 'steps': 34820, 'loss/train': 1.6779146194458008} 11/07/2021 02:08:45 - INFO - __main__ - Step 34822: {'lr': 0.00044174248827888376, 'samples': 6685824, 'steps': 34821, 'loss/train': 1.3448272943496704} 11/07/2021 02:08:45 - INFO - __main__ - Step 34823: {'lr': 0.00044173908298797627, 'samples': 6686016, 'steps': 34822, 'loss/train': 1.4924274682998657} 11/07/2021 02:08:46 - INFO - __main__ - Step 34824: {'lr': 0.0004417356776106741, 'samples': 6686208, 'steps': 34823, 'loss/train': 2.0932040214538574} 11/07/2021 02:08:47 - INFO - __main__ - Step 34825: {'lr': 0.00044173227214697885, 'samples': 6686400, 'steps': 34824, 'loss/train': 1.7518690824508667} 11/07/2021 02:08:47 - INFO - __main__ - Step 34826: {'lr': 0.000441728866596892, 'samples': 6686592, 'steps': 34825, 'loss/train': 1.52781343460083} 11/07/2021 02:08:48 - INFO - __main__ - Step 34827: {'lr': 0.00044172546096041504, 'samples': 6686784, 'steps': 34826, 'loss/train': 0.2420790195465088} 11/07/2021 02:08:48 - INFO - __main__ - Step 34828: {'lr': 0.0004417220552375496, 'samples': 6686976, 'steps': 34827, 'loss/train': 1.4486048221588135} 11/07/2021 02:08:48 - INFO - __main__ - Step 34829: {'lr': 0.00044171864942829707, 'samples': 6687168, 'steps': 34828, 'loss/train': 1.6413379907608032} 11/07/2021 02:08:49 - INFO - __main__ - Step 34830: {'lr': 0.0004417152435326591, 'samples': 6687360, 'steps': 34829, 'loss/train': 1.489789366722107} 11/07/2021 02:08:50 - INFO - __main__ - Step 34831: {'lr': 0.00044171183755063726, 'samples': 6687552, 'steps': 34830, 'loss/train': 1.3714736700057983} 11/07/2021 02:08:50 - INFO - __main__ - Step 34832: {'lr': 0.00044170843148223305, 'samples': 6687744, 'steps': 34831, 'loss/train': 1.9806476831436157} 11/07/2021 02:08:51 - INFO - __main__ - Step 34833: {'lr': 0.0004417050253274479, 'samples': 6687936, 'steps': 34832, 'loss/train': 1.3821278810501099} 11/07/2021 02:08:51 - INFO - __main__ - Step 34834: {'lr': 0.00044170161908628345, 'samples': 6688128, 'steps': 34833, 'loss/train': 1.7906062602996826} 11/07/2021 02:08:51 - INFO - __main__ - Step 34835: {'lr': 0.0004416982127587412, 'samples': 6688320, 'steps': 34834, 'loss/train': 1.3996402025222778} 11/07/2021 02:08:52 - INFO - __main__ - Step 34836: {'lr': 0.00044169480634482274, 'samples': 6688512, 'steps': 34835, 'loss/train': 1.677834391593933} 11/07/2021 02:08:53 - INFO - __main__ - Step 34837: {'lr': 0.0004416913998445294, 'samples': 6688704, 'steps': 34836, 'loss/train': 1.5204591751098633} 11/07/2021 02:08:53 - INFO - __main__ - Step 34838: {'lr': 0.000441687993257863, 'samples': 6688896, 'steps': 34837, 'loss/train': 1.1863371133804321} 11/07/2021 02:08:53 - INFO - __main__ - Step 34839: {'lr': 0.000441684586584825, 'samples': 6689088, 'steps': 34838, 'loss/train': 1.1081938743591309} 11/07/2021 02:08:54 - INFO - __main__ - Step 34840: {'lr': 0.0004416811798254168, 'samples': 6689280, 'steps': 34839, 'loss/train': 1.6745431423187256} 11/07/2021 02:08:55 - INFO - __main__ - Step 34841: {'lr': 0.00044167777297964006, 'samples': 6689472, 'steps': 34840, 'loss/train': 1.4791312217712402} 11/07/2021 02:08:55 - INFO - __main__ - Step 34842: {'lr': 0.0004416743660474962, 'samples': 6689664, 'steps': 34841, 'loss/train': 1.4962350130081177} 11/07/2021 02:08:55 - INFO - __main__ - Step 34843: {'lr': 0.0004416709590289869, 'samples': 6689856, 'steps': 34842, 'loss/train': 1.2518876791000366} 11/07/2021 02:08:56 - INFO - __main__ - Step 34844: {'lr': 0.00044166755192411364, 'samples': 6690048, 'steps': 34843, 'loss/train': 2.0981333255767822} 11/07/2021 02:08:56 - INFO - __main__ - Step 34845: {'lr': 0.00044166414473287784, 'samples': 6690240, 'steps': 34844, 'loss/train': 1.6097639799118042} 11/07/2021 02:08:57 - INFO - __main__ - Step 34846: {'lr': 0.0004416607374552812, 'samples': 6690432, 'steps': 34845, 'loss/train': 1.0811222791671753} 11/07/2021 02:08:58 - INFO - __main__ - Step 34847: {'lr': 0.00044165733009132524, 'samples': 6690624, 'steps': 34846, 'loss/train': 1.3376858234405518} 11/07/2021 02:08:58 - INFO - __main__ - Step 34848: {'lr': 0.00044165392264101136, 'samples': 6690816, 'steps': 34847, 'loss/train': 1.521743655204773} 11/07/2021 02:08:58 - INFO - __main__ - Step 34849: {'lr': 0.0004416505151043412, 'samples': 6691008, 'steps': 34848, 'loss/train': 1.796020269393921} 11/07/2021 02:08:59 - INFO - __main__ - Step 34850: {'lr': 0.0004416471074813163, 'samples': 6691200, 'steps': 34849, 'loss/train': 1.783268690109253} 11/07/2021 02:09:00 - INFO - __main__ - Step 34851: {'lr': 0.0004416436997719382, 'samples': 6691392, 'steps': 34850, 'loss/train': 1.8800702095031738} 11/07/2021 02:09:00 - INFO - __main__ - Step 34852: {'lr': 0.0004416402919762084, 'samples': 6691584, 'steps': 34851, 'loss/train': 1.96084463596344} 11/07/2021 02:09:00 - INFO - __main__ - Step 34853: {'lr': 0.00044163688409412833, 'samples': 6691776, 'steps': 34852, 'loss/train': 1.48081636428833} 11/07/2021 02:09:01 - INFO - __main__ - Step 34854: {'lr': 0.0004416334761256997, 'samples': 6691968, 'steps': 34853, 'loss/train': 1.1761119365692139} 11/07/2021 02:09:01 - INFO - __main__ - Step 34855: {'lr': 0.000441630068070924, 'samples': 6692160, 'steps': 34854, 'loss/train': 1.4678617715835571} 11/07/2021 02:09:01 - INFO - __main__ - Step 34856: {'lr': 0.0004416266599298028, 'samples': 6692352, 'steps': 34855, 'loss/train': 1.2836898565292358} 11/07/2021 02:09:03 - INFO - __main__ - Step 34857: {'lr': 0.00044162325170233745, 'samples': 6692544, 'steps': 34856, 'loss/train': 0.8834148645401001} 11/07/2021 02:09:03 - INFO - __main__ - Step 34858: {'lr': 0.00044161984338852967, 'samples': 6692736, 'steps': 34857, 'loss/train': 1.5354430675506592} 11/07/2021 02:09:03 - INFO - __main__ - Step 34859: {'lr': 0.000441616434988381, 'samples': 6692928, 'steps': 34858, 'loss/train': 1.648168921470642} 11/07/2021 02:09:04 - INFO - __main__ - Step 34860: {'lr': 0.00044161302650189295, 'samples': 6693120, 'steps': 34859, 'loss/train': 1.5468337535858154} 11/07/2021 02:09:04 - INFO - __main__ - Step 34861: {'lr': 0.00044160961792906694, 'samples': 6693312, 'steps': 34860, 'loss/train': 1.5955241918563843} 11/07/2021 02:09:05 - INFO - __main__ - Step 34862: {'lr': 0.00044160620926990456, 'samples': 6693504, 'steps': 34861, 'loss/train': 1.1796307563781738} 11/07/2021 02:09:05 - INFO - __main__ - Step 34863: {'lr': 0.0004416028005244075, 'samples': 6693696, 'steps': 34862, 'loss/train': 1.7557843923568726} 11/07/2021 02:09:06 - INFO - __main__ - Step 34864: {'lr': 0.0004415993916925771, 'samples': 6693888, 'steps': 34863, 'loss/train': 1.9630286693572998} 11/07/2021 02:09:06 - INFO - __main__ - Step 34865: {'lr': 0.000441595982774415, 'samples': 6694080, 'steps': 34864, 'loss/train': 1.3882853984832764} 11/07/2021 02:09:06 - INFO - __main__ - Step 34866: {'lr': 0.00044159257376992267, 'samples': 6694272, 'steps': 34865, 'loss/train': 1.6050423383712769} 11/07/2021 02:09:07 - INFO - __main__ - Step 34867: {'lr': 0.0004415891646791017, 'samples': 6694464, 'steps': 34866, 'loss/train': 1.2810680866241455} 11/07/2021 02:09:08 - INFO - __main__ - Step 34868: {'lr': 0.0004415857555019536, 'samples': 6694656, 'steps': 34867, 'loss/train': 1.513373851776123} 11/07/2021 02:09:08 - INFO - __main__ - Step 34869: {'lr': 0.00044158234623847993, 'samples': 6694848, 'steps': 34868, 'loss/train': 1.4827241897583008} 11/07/2021 02:09:08 - INFO - __main__ - Step 34870: {'lr': 0.00044157893688868223, 'samples': 6695040, 'steps': 34869, 'loss/train': 1.635990023612976} 11/07/2021 02:09:09 - INFO - __main__ - Step 34871: {'lr': 0.00044157552745256203, 'samples': 6695232, 'steps': 34870, 'loss/train': 0.9106978178024292} 11/07/2021 02:09:09 - INFO - __main__ - Step 34872: {'lr': 0.0004415721179301208, 'samples': 6695424, 'steps': 34871, 'loss/train': 1.7310855388641357} 11/07/2021 02:09:10 - INFO - __main__ - Step 34873: {'lr': 0.00044156870832136015, 'samples': 6695616, 'steps': 34872, 'loss/train': 1.2648017406463623} 11/07/2021 02:09:11 - INFO - __main__ - Step 34874: {'lr': 0.00044156529862628157, 'samples': 6695808, 'steps': 34873, 'loss/train': 1.3931591510772705} 11/07/2021 02:09:11 - INFO - __main__ - Step 34875: {'lr': 0.00044156188884488667, 'samples': 6696000, 'steps': 34874, 'loss/train': 1.921125054359436} 11/07/2021 02:09:11 - INFO - __main__ - Step 34876: {'lr': 0.0004415584789771769, 'samples': 6696192, 'steps': 34875, 'loss/train': 1.4276232719421387} 11/07/2021 02:09:12 - INFO - __main__ - Step 34877: {'lr': 0.0004415550690231539, 'samples': 6696384, 'steps': 34876, 'loss/train': 1.5939267873764038} 11/07/2021 02:09:13 - INFO - __main__ - Step 34878: {'lr': 0.0004415516589828191, 'samples': 6696576, 'steps': 34877, 'loss/train': 1.5690876245498657} 11/07/2021 02:09:13 - INFO - __main__ - Step 34879: {'lr': 0.00044154824885617405, 'samples': 6696768, 'steps': 34878, 'loss/train': 1.8448790311813354} 11/07/2021 02:09:13 - INFO - __main__ - Step 34880: {'lr': 0.0004415448386432204, 'samples': 6696960, 'steps': 34879, 'loss/train': 1.4996871948242188} 11/07/2021 02:09:14 - INFO - __main__ - Step 34881: {'lr': 0.00044154142834395947, 'samples': 6697152, 'steps': 34880, 'loss/train': 1.8461229801177979} 11/07/2021 02:09:14 - INFO - __main__ - Step 34882: {'lr': 0.00044153801795839296, 'samples': 6697344, 'steps': 34881, 'loss/train': 1.3578943014144897} 11/07/2021 02:09:15 - INFO - __main__ - Step 34883: {'lr': 0.00044153460748652245, 'samples': 6697536, 'steps': 34882, 'loss/train': 0.4732913076877594} 11/07/2021 02:09:15 - INFO - __main__ - Step 34884: {'lr': 0.00044153119692834944, 'samples': 6697728, 'steps': 34883, 'loss/train': 1.1813597679138184} 11/07/2021 02:09:16 - INFO - __main__ - Step 34885: {'lr': 0.0004415277862838753, 'samples': 6697920, 'steps': 34884, 'loss/train': 1.055881142616272} 11/07/2021 02:09:16 - INFO - __main__ - Step 34886: {'lr': 0.00044152437555310174, 'samples': 6698112, 'steps': 34885, 'loss/train': 1.097977638244629} 11/07/2021 02:09:17 - INFO - __main__ - Step 34887: {'lr': 0.00044152096473603025, 'samples': 6698304, 'steps': 34886, 'loss/train': 1.558925986289978} 11/07/2021 02:09:18 - INFO - __main__ - Step 34888: {'lr': 0.00044151755383266234, 'samples': 6698496, 'steps': 34887, 'loss/train': 1.0544556379318237} 11/07/2021 02:09:18 - INFO - __main__ - Step 34889: {'lr': 0.0004415141428429997, 'samples': 6698688, 'steps': 34888, 'loss/train': 1.5570998191833496} 11/07/2021 02:09:18 - INFO - __main__ - Step 34890: {'lr': 0.0004415107317670436, 'samples': 6698880, 'steps': 34889, 'loss/train': 1.7016712427139282} 11/07/2021 02:09:19 - INFO - __main__ - Step 34891: {'lr': 0.0004415073206047958, 'samples': 6699072, 'steps': 34890, 'loss/train': 1.6461631059646606} 11/07/2021 02:09:19 - INFO - __main__ - Step 34892: {'lr': 0.0004415039093562577, 'samples': 6699264, 'steps': 34891, 'loss/train': 1.7420457601547241} 11/07/2021 02:09:20 - INFO - __main__ - Step 34893: {'lr': 0.00044150049802143095, 'samples': 6699456, 'steps': 34892, 'loss/train': 1.101386547088623} 11/07/2021 02:09:20 - INFO - __main__ - Step 34894: {'lr': 0.00044149708660031704, 'samples': 6699648, 'steps': 34893, 'loss/train': 1.4222402572631836} 11/07/2021 02:09:21 - INFO - __main__ - Step 34895: {'lr': 0.0004414936750929174, 'samples': 6699840, 'steps': 34894, 'loss/train': 1.5587239265441895} 11/07/2021 02:09:21 - INFO - __main__ - Step 34896: {'lr': 0.0004414902634992338, 'samples': 6700032, 'steps': 34895, 'loss/train': 1.662794589996338} 11/07/2021 02:09:21 - INFO - __main__ - Step 34897: {'lr': 0.0004414868518192675, 'samples': 6700224, 'steps': 34896, 'loss/train': 1.5693986415863037} 11/07/2021 02:09:22 - INFO - __main__ - Step 34898: {'lr': 0.0004414834400530203, 'samples': 6700416, 'steps': 34897, 'loss/train': 1.4663927555084229} 11/07/2021 02:09:23 - INFO - __main__ - Step 34899: {'lr': 0.00044148002820049354, 'samples': 6700608, 'steps': 34898, 'loss/train': 1.5058009624481201} 11/07/2021 02:09:23 - INFO - __main__ - Step 34900: {'lr': 0.00044147661626168887, 'samples': 6700800, 'steps': 34899, 'loss/train': 1.8423506021499634} 11/07/2021 02:09:24 - INFO - __main__ - Step 34901: {'lr': 0.0004414732042366078, 'samples': 6700992, 'steps': 34900, 'loss/train': 1.5679491758346558} 11/07/2021 02:09:24 - INFO - __main__ - Step 34902: {'lr': 0.00044146979212525184, 'samples': 6701184, 'steps': 34901, 'loss/train': 1.720201015472412} 11/07/2021 02:09:25 - INFO - __main__ - Step 34903: {'lr': 0.0004414663799276225, 'samples': 6701376, 'steps': 34902, 'loss/train': 1.7476109266281128} 11/07/2021 02:09:25 - INFO - __main__ - Step 34904: {'lr': 0.0004414629676437214, 'samples': 6701568, 'steps': 34903, 'loss/train': 1.56584632396698} 11/07/2021 02:09:26 - INFO - __main__ - Step 34905: {'lr': 0.00044145955527355007, 'samples': 6701760, 'steps': 34904, 'loss/train': 1.7871931791305542} 11/07/2021 02:09:26 - INFO - __main__ - Step 34906: {'lr': 0.00044145614281711, 'samples': 6701952, 'steps': 34905, 'loss/train': 1.3921641111373901} 11/07/2021 02:09:26 - INFO - __main__ - Step 34907: {'lr': 0.00044145273027440275, 'samples': 6702144, 'steps': 34906, 'loss/train': 0.9424101710319519} 11/07/2021 02:09:27 - INFO - __main__ - Step 34908: {'lr': 0.0004414493176454298, 'samples': 6702336, 'steps': 34907, 'loss/train': 1.542037844657898} 11/07/2021 02:09:28 - INFO - __main__ - Step 34909: {'lr': 0.0004414459049301929, 'samples': 6702528, 'steps': 34908, 'loss/train': 2.1305997371673584} 11/07/2021 02:09:28 - INFO - __main__ - Step 34910: {'lr': 0.00044144249212869327, 'samples': 6702720, 'steps': 34909, 'loss/train': 1.6564462184906006} 11/07/2021 02:09:28 - INFO - __main__ - Step 34911: {'lr': 0.0004414390792409326, 'samples': 6702912, 'steps': 34910, 'loss/train': 1.1971884965896606} 11/07/2021 02:09:29 - INFO - __main__ - Step 34912: {'lr': 0.0004414356662669126, 'samples': 6703104, 'steps': 34911, 'loss/train': 1.6676355600357056} 11/07/2021 02:09:29 - INFO - __main__ - Step 34913: {'lr': 0.0004414322532066345, 'samples': 6703296, 'steps': 34912, 'loss/train': 1.4663547277450562} 11/07/2021 02:09:30 - INFO - __main__ - Step 34914: {'lr': 0.0004414288400601, 'samples': 6703488, 'steps': 34913, 'loss/train': 1.660070538520813} 11/07/2021 02:09:30 - INFO - __main__ - Step 34915: {'lr': 0.0004414254268273107, 'samples': 6703680, 'steps': 34914, 'loss/train': 1.2348486185073853} 11/07/2021 02:09:31 - INFO - __main__ - Step 34916: {'lr': 0.0004414220135082679, 'samples': 6703872, 'steps': 34915, 'loss/train': 1.82661771774292} 11/07/2021 02:09:31 - INFO - __main__ - Step 34917: {'lr': 0.0004414186001029734, 'samples': 6704064, 'steps': 34916, 'loss/train': 0.8421643376350403} 11/07/2021 02:09:32 - INFO - __main__ - Step 34918: {'lr': 0.00044141518661142864, 'samples': 6704256, 'steps': 34917, 'loss/train': 1.6239686012268066} 11/07/2021 02:09:32 - INFO - __main__ - Step 34919: {'lr': 0.0004414117730336351, 'samples': 6704448, 'steps': 34918, 'loss/train': 1.4219346046447754} 11/07/2021 02:09:33 - INFO - __main__ - Step 34920: {'lr': 0.0004414083593695944, 'samples': 6704640, 'steps': 34919, 'loss/train': 1.5815865993499756} 11/07/2021 02:09:33 - INFO - __main__ - Step 34921: {'lr': 0.0004414049456193081, 'samples': 6704832, 'steps': 34920, 'loss/train': 1.4162994623184204} 11/07/2021 02:09:34 - INFO - __main__ - Step 34922: {'lr': 0.00044140153178277765, 'samples': 6705024, 'steps': 34921, 'loss/train': 1.4170336723327637} 11/07/2021 02:09:34 - INFO - __main__ - Step 34923: {'lr': 0.0004413981178600046, 'samples': 6705216, 'steps': 34922, 'loss/train': 1.5527942180633545} 11/07/2021 02:09:34 - INFO - __main__ - Step 34924: {'lr': 0.00044139470385099047, 'samples': 6705408, 'steps': 34923, 'loss/train': 1.1438456773757935} 11/07/2021 02:09:35 - INFO - __main__ - Step 34925: {'lr': 0.0004413912897557369, 'samples': 6705600, 'steps': 34924, 'loss/train': 1.6169018745422363} 11/07/2021 02:09:36 - INFO - __main__ - Step 34926: {'lr': 0.0004413878755742454, 'samples': 6705792, 'steps': 34925, 'loss/train': 1.356520414352417} 11/07/2021 02:09:36 - INFO - __main__ - Step 34927: {'lr': 0.00044138446130651736, 'samples': 6705984, 'steps': 34926, 'loss/train': 1.3168401718139648} 11/07/2021 02:09:36 - INFO - __main__ - Step 34928: {'lr': 0.00044138104695255455, 'samples': 6706176, 'steps': 34927, 'loss/train': 1.990412712097168} 11/07/2021 02:09:37 - INFO - __main__ - Step 34929: {'lr': 0.00044137763251235837, 'samples': 6706368, 'steps': 34928, 'loss/train': 1.5698679685592651} 11/07/2021 02:09:38 - INFO - __main__ - Step 34930: {'lr': 0.0004413742179859304, 'samples': 6706560, 'steps': 34929, 'loss/train': 1.8614038228988647} 11/07/2021 02:09:38 - INFO - __main__ - Step 34931: {'lr': 0.00044137080337327205, 'samples': 6706752, 'steps': 34930, 'loss/train': 1.3010913133621216} 11/07/2021 02:09:39 - INFO - __main__ - Step 34932: {'lr': 0.000441367388674385, 'samples': 6706944, 'steps': 34931, 'loss/train': 1.3092995882034302} 11/07/2021 02:09:39 - INFO - __main__ - Step 34933: {'lr': 0.00044136397388927083, 'samples': 6707136, 'steps': 34932, 'loss/train': 2.328650951385498} 11/07/2021 02:09:39 - INFO - __main__ - Step 34934: {'lr': 0.000441360559017931, 'samples': 6707328, 'steps': 34933, 'loss/train': 1.8487857580184937} 11/07/2021 02:09:40 - INFO - __main__ - Step 34935: {'lr': 0.00044135714406036696, 'samples': 6707520, 'steps': 34934, 'loss/train': 1.416915774345398} 11/07/2021 02:09:41 - INFO - __main__ - Step 34936: {'lr': 0.00044135372901658046, 'samples': 6707712, 'steps': 34935, 'loss/train': 1.6198914051055908} 11/07/2021 02:09:41 - INFO - __main__ - Step 34937: {'lr': 0.0004413503138865729, 'samples': 6707904, 'steps': 34936, 'loss/train': 1.4788812398910522} 11/07/2021 02:09:41 - INFO - __main__ - Step 34938: {'lr': 0.00044134689867034583, 'samples': 6708096, 'steps': 34937, 'loss/train': 1.63387131690979} 11/07/2021 02:09:42 - INFO - __main__ - Step 34939: {'lr': 0.00044134348336790074, 'samples': 6708288, 'steps': 34938, 'loss/train': 1.6570875644683838} 11/07/2021 02:09:43 - INFO - __main__ - Step 34940: {'lr': 0.0004413400679792393, 'samples': 6708480, 'steps': 34939, 'loss/train': 1.1660223007202148} 11/07/2021 02:09:43 - INFO - __main__ - Step 34941: {'lr': 0.00044133665250436295, 'samples': 6708672, 'steps': 34940, 'loss/train': 1.474959135055542} 11/07/2021 02:09:44 - INFO - __main__ - Step 34942: {'lr': 0.00044133323694327324, 'samples': 6708864, 'steps': 34941, 'loss/train': 1.4280176162719727} 11/07/2021 02:09:44 - INFO - __main__ - Step 34943: {'lr': 0.0004413298212959718, 'samples': 6709056, 'steps': 34942, 'loss/train': 1.8201326131820679} 11/07/2021 02:09:44 - INFO - __main__ - Step 34944: {'lr': 0.00044132640556246, 'samples': 6709248, 'steps': 34943, 'loss/train': 1.5002176761627197} 11/07/2021 02:09:45 - INFO - __main__ - Step 34945: {'lr': 0.00044132298974273955, 'samples': 6709440, 'steps': 34944, 'loss/train': 1.5995829105377197} 11/07/2021 02:09:46 - INFO - __main__ - Step 34946: {'lr': 0.00044131957383681186, 'samples': 6709632, 'steps': 34945, 'loss/train': 1.2983976602554321} 11/07/2021 02:09:46 - INFO - __main__ - Step 34947: {'lr': 0.0004413161578446785, 'samples': 6709824, 'steps': 34946, 'loss/train': 1.9769909381866455} 11/07/2021 02:09:46 - INFO - __main__ - Step 34948: {'lr': 0.00044131274176634113, 'samples': 6710016, 'steps': 34947, 'loss/train': 1.4638006687164307} 11/07/2021 02:09:47 - INFO - __main__ - Step 34949: {'lr': 0.00044130932560180114, 'samples': 6710208, 'steps': 34948, 'loss/train': 1.0998560190200806} 11/07/2021 02:09:48 - INFO - __main__ - Step 34950: {'lr': 0.0004413059093510601, 'samples': 6710400, 'steps': 34949, 'loss/train': 1.550802230834961} 11/07/2021 02:09:48 - INFO - __main__ - Step 34951: {'lr': 0.00044130249301411957, 'samples': 6710592, 'steps': 34950, 'loss/train': 1.3702425956726074} 11/07/2021 02:09:48 - INFO - __main__ - Step 34952: {'lr': 0.0004412990765909811, 'samples': 6710784, 'steps': 34951, 'loss/train': 1.2934271097183228} 11/07/2021 02:09:49 - INFO - __main__ - Step 34953: {'lr': 0.0004412956600816462, 'samples': 6710976, 'steps': 34952, 'loss/train': 1.6466397047042847} 11/07/2021 02:09:49 - INFO - __main__ - Step 34954: {'lr': 0.00044129224348611644, 'samples': 6711168, 'steps': 34953, 'loss/train': 1.0837256908416748} 11/07/2021 02:09:49 - INFO - __main__ - Step 34955: {'lr': 0.0004412888268043934, 'samples': 6711360, 'steps': 34954, 'loss/train': 1.3979381322860718} 11/07/2021 02:09:50 - INFO - __main__ - Step 34956: {'lr': 0.0004412854100364785, 'samples': 6711552, 'steps': 34955, 'loss/train': 1.2368285655975342} 11/07/2021 02:09:51 - INFO - __main__ - Step 34957: {'lr': 0.0004412819931823734, 'samples': 6711744, 'steps': 34956, 'loss/train': 1.5563851594924927} 11/07/2021 02:09:51 - INFO - __main__ - Step 34958: {'lr': 0.0004412785762420795, 'samples': 6711936, 'steps': 34957, 'loss/train': 1.4423242807388306} 11/07/2021 02:09:51 - INFO - __main__ - Step 34959: {'lr': 0.0004412751592155985, 'samples': 6712128, 'steps': 34958, 'loss/train': 1.4660046100616455} 11/07/2021 02:09:52 - INFO - __main__ - Step 34960: {'lr': 0.00044127174210293186, 'samples': 6712320, 'steps': 34959, 'loss/train': 1.6709436178207397} 11/07/2021 02:09:53 - INFO - __main__ - Step 34961: {'lr': 0.0004412683249040811, 'samples': 6712512, 'steps': 34960, 'loss/train': 5.549959182739258} 11/07/2021 02:09:53 - INFO - __main__ - Step 34962: {'lr': 0.0004412649076190478, 'samples': 6712704, 'steps': 34961, 'loss/train': 1.7965469360351562} 11/07/2021 02:09:54 - INFO - __main__ - Step 34963: {'lr': 0.00044126149024783346, 'samples': 6712896, 'steps': 34962, 'loss/train': 1.778951644897461} 11/07/2021 02:09:54 - INFO - __main__ - Step 34964: {'lr': 0.0004412580727904396, 'samples': 6713088, 'steps': 34963, 'loss/train': 1.7027897834777832} 11/07/2021 02:09:54 - INFO - __main__ - Step 34965: {'lr': 0.0004412546552468679, 'samples': 6713280, 'steps': 34964, 'loss/train': 1.5230058431625366} 11/07/2021 02:09:55 - INFO - __main__ - Step 34966: {'lr': 0.00044125123761711975, 'samples': 6713472, 'steps': 34965, 'loss/train': 1.532887578010559} 11/07/2021 02:09:56 - INFO - __main__ - Step 34967: {'lr': 0.00044124781990119677, 'samples': 6713664, 'steps': 34966, 'loss/train': 1.342242956161499} 11/07/2021 02:09:56 - INFO - __main__ - Step 34968: {'lr': 0.0004412444020991004, 'samples': 6713856, 'steps': 34967, 'loss/train': 1.6180024147033691} 11/07/2021 02:09:56 - INFO - __main__ - Step 34969: {'lr': 0.0004412409842108324, 'samples': 6714048, 'steps': 34968, 'loss/train': 1.4326096773147583} 11/07/2021 02:09:57 - INFO - __main__ - Step 34970: {'lr': 0.0004412375662363941, 'samples': 6714240, 'steps': 34969, 'loss/train': 1.4714924097061157} 11/07/2021 02:09:57 - INFO - __main__ - Step 34971: {'lr': 0.00044123414817578705, 'samples': 6714432, 'steps': 34970, 'loss/train': 1.4865937232971191} 11/07/2021 02:09:58 - INFO - __main__ - Step 34972: {'lr': 0.00044123073002901286, 'samples': 6714624, 'steps': 34971, 'loss/train': 0.8073616027832031} 11/07/2021 02:09:58 - INFO - __main__ - Step 34973: {'lr': 0.0004412273117960731, 'samples': 6714816, 'steps': 34972, 'loss/train': 1.7737014293670654} 11/07/2021 02:09:59 - INFO - __main__ - Step 34974: {'lr': 0.00044122389347696925, 'samples': 6715008, 'steps': 34973, 'loss/train': 2.0137529373168945} 11/07/2021 02:09:59 - INFO - __main__ - Step 34975: {'lr': 0.0004412204750717028, 'samples': 6715200, 'steps': 34974, 'loss/train': 0.3884543478488922} 11/07/2021 02:10:00 - INFO - __main__ - Step 34976: {'lr': 0.00044121705658027545, 'samples': 6715392, 'steps': 34975, 'loss/train': 1.497333288192749} 11/07/2021 02:10:01 - INFO - __main__ - Step 34977: {'lr': 0.00044121363800268853, 'samples': 6715584, 'steps': 34976, 'loss/train': 1.0845884084701538} 11/07/2021 02:10:01 - INFO - __main__ - Step 34978: {'lr': 0.0004412102193389438, 'samples': 6715776, 'steps': 34977, 'loss/train': 1.5789704322814941} 11/07/2021 02:10:01 - INFO - __main__ - Step 34979: {'lr': 0.0004412068005890427, 'samples': 6715968, 'steps': 34978, 'loss/train': 1.7439966201782227} 11/07/2021 02:10:02 - INFO - __main__ - Step 34980: {'lr': 0.0004412033817529867, 'samples': 6716160, 'steps': 34979, 'loss/train': 1.5079094171524048} 11/07/2021 02:10:02 - INFO - __main__ - Step 34981: {'lr': 0.0004411999628307775, 'samples': 6716352, 'steps': 34980, 'loss/train': 1.6859904527664185} 11/07/2021 02:10:03 - INFO - __main__ - Step 34982: {'lr': 0.0004411965438224164, 'samples': 6716544, 'steps': 34981, 'loss/train': 1.6609724760055542} 11/07/2021 02:10:03 - INFO - __main__ - Step 34983: {'lr': 0.0004411931247279052, 'samples': 6716736, 'steps': 34982, 'loss/train': 1.7958378791809082} 11/07/2021 02:10:04 - INFO - __main__ - Step 34984: {'lr': 0.00044118970554724523, 'samples': 6716928, 'steps': 34983, 'loss/train': 1.3812191486358643} 11/07/2021 02:10:04 - INFO - __main__ - Step 34985: {'lr': 0.0004411862862804382, 'samples': 6717120, 'steps': 34984, 'loss/train': 1.4085384607315063} 11/07/2021 02:10:04 - INFO - __main__ - Step 34986: {'lr': 0.0004411828669274856, 'samples': 6717312, 'steps': 34985, 'loss/train': 1.5094749927520752} 11/07/2021 02:10:05 - INFO - __main__ - Step 34987: {'lr': 0.0004411794474883889, 'samples': 6717504, 'steps': 34986, 'loss/train': 1.621686577796936} 11/07/2021 02:10:06 - INFO - __main__ - Step 34988: {'lr': 0.0004411760279631497, 'samples': 6717696, 'steps': 34987, 'loss/train': 1.5280160903930664} 11/07/2021 02:10:06 - INFO - __main__ - Step 34989: {'lr': 0.0004411726083517696, 'samples': 6717888, 'steps': 34988, 'loss/train': 1.2565999031066895} 11/07/2021 02:10:06 - INFO - __main__ - Step 34990: {'lr': 0.00044116918865425004, 'samples': 6718080, 'steps': 34989, 'loss/train': 1.393542766571045} 11/07/2021 02:10:07 - INFO - __main__ - Step 34991: {'lr': 0.00044116576887059255, 'samples': 6718272, 'steps': 34990, 'loss/train': 1.5766887664794922} 11/07/2021 02:10:07 - INFO - __main__ - Step 34992: {'lr': 0.0004411623490007988, 'samples': 6718464, 'steps': 34991, 'loss/train': 1.8799902200698853} 11/07/2021 02:10:08 - INFO - __main__ - Step 34993: {'lr': 0.0004411589290448701, 'samples': 6718656, 'steps': 34992, 'loss/train': 1.3170068264007568} 11/07/2021 02:10:09 - INFO - __main__ - Step 34994: {'lr': 0.0004411555090028082, 'samples': 6718848, 'steps': 34993, 'loss/train': 1.787194848060608} 11/07/2021 02:10:09 - INFO - __main__ - Step 34995: {'lr': 0.00044115208887461464, 'samples': 6719040, 'steps': 34994, 'loss/train': 1.9786200523376465} 11/07/2021 02:10:09 - INFO - __main__ - Step 34996: {'lr': 0.00044114866866029086, 'samples': 6719232, 'steps': 34995, 'loss/train': 1.6246813535690308} 11/07/2021 02:10:10 - INFO - __main__ - Step 34997: {'lr': 0.00044114524835983844, 'samples': 6719424, 'steps': 34996, 'loss/train': 1.6504180431365967} 11/07/2021 02:10:11 - INFO - __main__ - Step 34998: {'lr': 0.00044114182797325884, 'samples': 6719616, 'steps': 34997, 'loss/train': 1.5485693216323853} 11/07/2021 02:10:11 - INFO - __main__ - Step 34999: {'lr': 0.0004411384075005538, 'samples': 6719808, 'steps': 34998, 'loss/train': 1.5711191892623901} 11/07/2021 02:10:11 - INFO - __main__ - Step 35000: {'lr': 0.0004411349869417247, 'samples': 6720000, 'steps': 34999, 'loss/train': 1.5556280612945557} 11/07/2021 02:10:12 - INFO - __main__ - Step 35001: {'lr': 0.00044113156629677313, 'samples': 6720192, 'steps': 35000, 'loss/train': 1.2026489973068237} 11/07/2021 02:10:12 - INFO - __main__ - Step 35002: {'lr': 0.00044112814556570066, 'samples': 6720384, 'steps': 35001, 'loss/train': 1.5875060558319092} 11/07/2021 02:10:13 - INFO - __main__ - Step 35003: {'lr': 0.00044112472474850875, 'samples': 6720576, 'steps': 35002, 'loss/train': 1.0374408960342407} 11/07/2021 02:10:14 - INFO - __main__ - Step 35004: {'lr': 0.000441121303845199, 'samples': 6720768, 'steps': 35003, 'loss/train': 1.64098060131073} 11/07/2021 02:10:14 - INFO - __main__ - Step 35005: {'lr': 0.0004411178828557729, 'samples': 6720960, 'steps': 35004, 'loss/train': 1.4752988815307617} 11/07/2021 02:10:14 - INFO - __main__ - Step 35006: {'lr': 0.00044111446178023205, 'samples': 6721152, 'steps': 35005, 'loss/train': 1.3469408750534058} 11/07/2021 02:10:15 - INFO - __main__ - Step 35007: {'lr': 0.000441111040618578, 'samples': 6721344, 'steps': 35006, 'loss/train': 1.4651458263397217} 11/07/2021 02:10:16 - INFO - __main__ - Step 35008: {'lr': 0.0004411076193708122, 'samples': 6721536, 'steps': 35007, 'loss/train': 1.4971396923065186} 11/07/2021 02:10:16 - INFO - __main__ - Step 35009: {'lr': 0.00044110419803693635, 'samples': 6721728, 'steps': 35008, 'loss/train': 0.8315962553024292} 11/07/2021 02:10:16 - INFO - __main__ - Step 35010: {'lr': 0.00044110077661695194, 'samples': 6721920, 'steps': 35009, 'loss/train': 1.402260661125183} 11/07/2021 02:10:17 - INFO - __main__ - Step 35011: {'lr': 0.00044109735511086036, 'samples': 6722112, 'steps': 35010, 'loss/train': 1.282967448234558} 11/07/2021 02:10:17 - INFO - __main__ - Step 35012: {'lr': 0.00044109393351866324, 'samples': 6722304, 'steps': 35011, 'loss/train': 1.2022982835769653} 11/07/2021 02:10:18 - INFO - __main__ - Step 35013: {'lr': 0.0004410905118403622, 'samples': 6722496, 'steps': 35012, 'loss/train': 1.6865010261535645} 11/07/2021 02:10:18 - INFO - __main__ - Step 35014: {'lr': 0.0004410870900759587, 'samples': 6722688, 'steps': 35013, 'loss/train': 1.4623432159423828} 11/07/2021 02:10:19 - INFO - __main__ - Step 35015: {'lr': 0.0004410836682254543, 'samples': 6722880, 'steps': 35014, 'loss/train': 0.9109211564064026} 11/07/2021 02:10:19 - INFO - __main__ - Step 35016: {'lr': 0.0004410802462888506, 'samples': 6723072, 'steps': 35015, 'loss/train': 0.7895492911338806} 11/07/2021 02:10:19 - INFO - __main__ - Step 35017: {'lr': 0.00044107682426614903, 'samples': 6723264, 'steps': 35016, 'loss/train': 1.300127387046814} 11/07/2021 02:10:20 - INFO - __main__ - Step 35018: {'lr': 0.00044107340215735125, 'samples': 6723456, 'steps': 35017, 'loss/train': 1.3238352537155151} 11/07/2021 02:10:21 - INFO - __main__ - Step 35019: {'lr': 0.00044106997996245866, 'samples': 6723648, 'steps': 35018, 'loss/train': 2.105529546737671} 11/07/2021 02:10:21 - INFO - __main__ - Step 35020: {'lr': 0.000441066557681473, 'samples': 6723840, 'steps': 35019, 'loss/train': 1.2819880247116089} 11/07/2021 02:10:22 - INFO - __main__ - Step 35021: {'lr': 0.00044106313531439565, 'samples': 6724032, 'steps': 35020, 'loss/train': 1.5403814315795898} 11/07/2021 02:10:22 - INFO - __main__ - Step 35022: {'lr': 0.00044105971286122816, 'samples': 6724224, 'steps': 35021, 'loss/train': 1.7440218925476074} 11/07/2021 02:10:22 - INFO - __main__ - Step 35023: {'lr': 0.00044105629032197214, 'samples': 6724416, 'steps': 35022, 'loss/train': 1.8121604919433594} 11/07/2021 02:10:23 - INFO - __main__ - Step 35024: {'lr': 0.0004410528676966291, 'samples': 6724608, 'steps': 35023, 'loss/train': 1.0739877223968506} 11/07/2021 02:10:24 - INFO - __main__ - Step 35025: {'lr': 0.00044104944498520054, 'samples': 6724800, 'steps': 35024, 'loss/train': 1.7789803743362427} 11/07/2021 02:10:24 - INFO - __main__ - Step 35026: {'lr': 0.00044104602218768805, 'samples': 6724992, 'steps': 35025, 'loss/train': 1.231187105178833} 11/07/2021 02:10:24 - INFO - __main__ - Step 35027: {'lr': 0.0004410425993040933, 'samples': 6725184, 'steps': 35026, 'loss/train': 1.0795031785964966} 11/07/2021 02:10:25 - INFO - __main__ - Step 35028: {'lr': 0.0004410391763344176, 'samples': 6725376, 'steps': 35027, 'loss/train': 1.4300146102905273} 11/07/2021 02:10:26 - INFO - __main__ - Step 35029: {'lr': 0.00044103575327866264, 'samples': 6725568, 'steps': 35028, 'loss/train': 1.447784185409546} 11/07/2021 02:10:26 - INFO - __main__ - Step 35030: {'lr': 0.0004410323301368299, 'samples': 6725760, 'steps': 35029, 'loss/train': 1.640128254890442} 11/07/2021 02:10:26 - INFO - __main__ - Step 35031: {'lr': 0.0004410289069089209, 'samples': 6725952, 'steps': 35030, 'loss/train': 1.679473638534546} 11/07/2021 02:10:27 - INFO - __main__ - Step 35032: {'lr': 0.0004410254835949372, 'samples': 6726144, 'steps': 35031, 'loss/train': 2.850522041320801} 11/07/2021 02:10:27 - INFO - __main__ - Step 35033: {'lr': 0.00044102206019488045, 'samples': 6726336, 'steps': 35032, 'loss/train': 1.5364875793457031} 11/07/2021 02:10:28 - INFO - __main__ - Step 35034: {'lr': 0.00044101863670875207, 'samples': 6726528, 'steps': 35033, 'loss/train': 1.8543156385421753} 11/07/2021 02:10:28 - INFO - __main__ - Step 35035: {'lr': 0.0004410152131365536, 'samples': 6726720, 'steps': 35034, 'loss/train': 1.3566513061523438} 11/07/2021 02:10:29 - INFO - __main__ - Step 35036: {'lr': 0.00044101178947828667, 'samples': 6726912, 'steps': 35035, 'loss/train': 1.6695739030838013} 11/07/2021 02:10:29 - INFO - __main__ - Step 35037: {'lr': 0.0004410083657339528, 'samples': 6727104, 'steps': 35036, 'loss/train': 1.3210400342941284} 11/07/2021 02:10:30 - INFO - __main__ - Step 35038: {'lr': 0.00044100494190355347, 'samples': 6727296, 'steps': 35037, 'loss/train': 1.3826854228973389} 11/07/2021 02:10:30 - INFO - __main__ - Step 35039: {'lr': 0.0004410015179870903, 'samples': 6727488, 'steps': 35038, 'loss/train': 1.1722732782363892} 11/07/2021 02:10:31 - INFO - __main__ - Step 35040: {'lr': 0.0004409980939845647, 'samples': 6727680, 'steps': 35039, 'loss/train': 1.5157371759414673} 11/07/2021 02:10:31 - INFO - __main__ - Step 35041: {'lr': 0.00044099466989597837, 'samples': 6727872, 'steps': 35040, 'loss/train': 0.8572977781295776} 11/07/2021 02:10:32 - INFO - __main__ - Step 35042: {'lr': 0.00044099124572133283, 'samples': 6728064, 'steps': 35041, 'loss/train': 1.6489825248718262} 11/07/2021 02:10:32 - INFO - __main__ - Step 35043: {'lr': 0.00044098782146062955, 'samples': 6728256, 'steps': 35042, 'loss/train': 1.5331951379776} 11/07/2021 02:10:33 - INFO - __main__ - Step 35044: {'lr': 0.00044098439711387006, 'samples': 6728448, 'steps': 35043, 'loss/train': 2.036832332611084} 11/07/2021 02:10:33 - INFO - __main__ - Step 35045: {'lr': 0.000440980972681056, 'samples': 6728640, 'steps': 35044, 'loss/train': 1.3515995740890503} 11/07/2021 02:10:34 - INFO - __main__ - Step 35046: {'lr': 0.0004409775481621888, 'samples': 6728832, 'steps': 35045, 'loss/train': 1.6868525743484497} 11/07/2021 02:10:34 - INFO - __main__ - Step 35047: {'lr': 0.0004409741235572701, 'samples': 6729024, 'steps': 35046, 'loss/train': 1.4944453239440918} 11/07/2021 02:10:34 - INFO - __main__ - Step 35048: {'lr': 0.0004409706988663015, 'samples': 6729216, 'steps': 35047, 'loss/train': 1.173051357269287} 11/07/2021 02:10:35 - INFO - __main__ - Step 35049: {'lr': 0.00044096727408928426, 'samples': 6729408, 'steps': 35048, 'loss/train': 0.7876967787742615} 11/07/2021 02:10:36 - INFO - __main__ - Step 35050: {'lr': 0.0004409638492262202, 'samples': 6729600, 'steps': 35049, 'loss/train': 1.5770975351333618} 11/07/2021 02:10:36 - INFO - __main__ - Step 35051: {'lr': 0.0004409604242771108, 'samples': 6729792, 'steps': 35050, 'loss/train': 1.332051157951355} 11/07/2021 02:10:36 - INFO - __main__ - Step 35052: {'lr': 0.0004409569992419576, 'samples': 6729984, 'steps': 35051, 'loss/train': 1.4920984506607056} 11/07/2021 02:10:37 - INFO - __main__ - Step 35053: {'lr': 0.0004409535741207621, 'samples': 6730176, 'steps': 35052, 'loss/train': 1.529261589050293} 11/07/2021 02:10:37 - INFO - __main__ - Step 35054: {'lr': 0.00044095014891352584, 'samples': 6730368, 'steps': 35053, 'loss/train': 1.5102005004882812} 11/07/2021 02:10:38 - INFO - __main__ - Step 35055: {'lr': 0.0004409467236202505, 'samples': 6730560, 'steps': 35054, 'loss/train': 1.2564643621444702} 11/07/2021 02:10:38 - INFO - __main__ - Step 35056: {'lr': 0.0004409432982409374, 'samples': 6730752, 'steps': 35055, 'loss/train': 1.3979930877685547} 11/07/2021 02:10:39 - INFO - __main__ - Step 35057: {'lr': 0.0004409398727755882, 'samples': 6730944, 'steps': 35056, 'loss/train': 1.7569228410720825} 11/07/2021 02:10:39 - INFO - __main__ - Step 35058: {'lr': 0.00044093644722420445, 'samples': 6731136, 'steps': 35057, 'loss/train': 1.5639421939849854} 11/07/2021 02:10:40 - INFO - __main__ - Step 35059: {'lr': 0.00044093302158678766, 'samples': 6731328, 'steps': 35058, 'loss/train': 1.4638268947601318} 11/07/2021 02:10:41 - INFO - __main__ - Step 35060: {'lr': 0.0004409295958633394, 'samples': 6731520, 'steps': 35059, 'loss/train': 1.401228427886963} 11/07/2021 02:10:41 - INFO - __main__ - Step 35061: {'lr': 0.00044092617005386125, 'samples': 6731712, 'steps': 35060, 'loss/train': 1.2886829376220703} 11/07/2021 02:10:41 - INFO - __main__ - Step 35062: {'lr': 0.00044092274415835473, 'samples': 6731904, 'steps': 35061, 'loss/train': 1.3746353387832642} 11/07/2021 02:10:42 - INFO - __main__ - Step 35063: {'lr': 0.0004409193181768213, 'samples': 6732096, 'steps': 35062, 'loss/train': 1.3529815673828125} 11/07/2021 02:10:42 - INFO - __main__ - Step 35064: {'lr': 0.00044091589210926266, 'samples': 6732288, 'steps': 35063, 'loss/train': 1.3183367252349854} 11/07/2021 02:10:43 - INFO - __main__ - Step 35065: {'lr': 0.00044091246595568025, 'samples': 6732480, 'steps': 35064, 'loss/train': 1.1994445323944092} 11/07/2021 02:10:43 - INFO - __main__ - Step 35066: {'lr': 0.00044090903971607555, 'samples': 6732672, 'steps': 35065, 'loss/train': 1.7616498470306396} 11/07/2021 02:10:44 - INFO - __main__ - Step 35067: {'lr': 0.0004409056133904502, 'samples': 6732864, 'steps': 35066, 'loss/train': 1.6132047176361084} 11/07/2021 02:10:44 - INFO - __main__ - Step 35068: {'lr': 0.00044090218697880577, 'samples': 6733056, 'steps': 35067, 'loss/train': 1.7040773630142212} 11/07/2021 02:10:44 - INFO - __main__ - Step 35069: {'lr': 0.0004408987604811437, 'samples': 6733248, 'steps': 35068, 'loss/train': 1.5884809494018555} 11/07/2021 02:10:46 - INFO - __main__ - Step 35070: {'lr': 0.00044089533389746573, 'samples': 6733440, 'steps': 35069, 'loss/train': 1.5156008005142212} 11/07/2021 02:10:46 - INFO - __main__ - Step 35071: {'lr': 0.00044089190722777316, 'samples': 6733632, 'steps': 35070, 'loss/train': 0.1750890463590622} 11/07/2021 02:10:47 - INFO - __main__ - Step 35072: {'lr': 0.00044088848047206763, 'samples': 6733824, 'steps': 35071, 'loss/train': 1.5426185131072998} 11/07/2021 02:10:47 - INFO - __main__ - Step 35073: {'lr': 0.0004408850536303507, 'samples': 6734016, 'steps': 35072, 'loss/train': 1.6482864618301392} 11/07/2021 02:10:47 - INFO - __main__ - Step 35074: {'lr': 0.000440881626702624, 'samples': 6734208, 'steps': 35073, 'loss/train': 1.0828303098678589} 11/07/2021 02:10:48 - INFO - __main__ - Step 35075: {'lr': 0.00044087819968888887, 'samples': 6734400, 'steps': 35074, 'loss/train': 1.6241428852081299} 11/07/2021 02:10:49 - INFO - __main__ - Step 35076: {'lr': 0.00044087477258914696, 'samples': 6734592, 'steps': 35075, 'loss/train': 1.5479053258895874} 11/07/2021 02:10:49 - INFO - __main__ - Step 35077: {'lr': 0.00044087134540339996, 'samples': 6734784, 'steps': 35076, 'loss/train': 1.4886677265167236} 11/07/2021 02:10:50 - INFO - __main__ - Step 35078: {'lr': 0.00044086791813164916, 'samples': 6734976, 'steps': 35077, 'loss/train': 1.4844224452972412} 11/07/2021 02:10:50 - INFO - __main__ - Step 35079: {'lr': 0.00044086449077389636, 'samples': 6735168, 'steps': 35078, 'loss/train': 0.624704897403717} 11/07/2021 02:10:50 - INFO - __main__ - Step 35080: {'lr': 0.0004408610633301428, 'samples': 6735360, 'steps': 35079, 'loss/train': 1.6156529188156128} 11/07/2021 02:10:51 - INFO - __main__ - Step 35081: {'lr': 0.00044085763580039027, 'samples': 6735552, 'steps': 35080, 'loss/train': 2.0500946044921875} 11/07/2021 02:10:52 - INFO - __main__ - Step 35082: {'lr': 0.0004408542081846402, 'samples': 6735744, 'steps': 35081, 'loss/train': 1.3576505184173584} 11/07/2021 02:10:52 - INFO - __main__ - Step 35083: {'lr': 0.0004408507804828942, 'samples': 6735936, 'steps': 35082, 'loss/train': 1.5053588151931763} 11/07/2021 02:10:52 - INFO - __main__ - Step 35084: {'lr': 0.00044084735269515375, 'samples': 6736128, 'steps': 35083, 'loss/train': 1.6152665615081787} 11/07/2021 02:10:53 - INFO - __main__ - Step 35085: {'lr': 0.0004408439248214205, 'samples': 6736320, 'steps': 35084, 'loss/train': 1.7226558923721313} 11/07/2021 02:10:54 - INFO - __main__ - Step 35086: {'lr': 0.00044084049686169584, 'samples': 6736512, 'steps': 35085, 'loss/train': 0.5707530379295349} 11/07/2021 02:10:54 - INFO - __main__ - Step 35087: {'lr': 0.00044083706881598147, 'samples': 6736704, 'steps': 35086, 'loss/train': 1.0183169841766357} 11/07/2021 02:10:54 - INFO - __main__ - Step 35088: {'lr': 0.00044083364068427875, 'samples': 6736896, 'steps': 35087, 'loss/train': 1.3448383808135986} 11/07/2021 02:10:55 - INFO - __main__ - Step 35089: {'lr': 0.0004408302124665894, 'samples': 6737088, 'steps': 35088, 'loss/train': 1.307666540145874} 11/07/2021 02:10:55 - INFO - __main__ - Step 35090: {'lr': 0.00044082678416291495, 'samples': 6737280, 'steps': 35089, 'loss/train': 1.2448477745056152} 11/07/2021 02:10:56 - INFO - __main__ - Step 35091: {'lr': 0.00044082335577325685, 'samples': 6737472, 'steps': 35090, 'loss/train': 1.7214866876602173} 11/07/2021 02:10:56 - INFO - __main__ - Step 35092: {'lr': 0.0004408199272976167, 'samples': 6737664, 'steps': 35091, 'loss/train': 1.1318559646606445} 11/07/2021 02:10:57 - INFO - __main__ - Step 35093: {'lr': 0.00044081649873599604, 'samples': 6737856, 'steps': 35092, 'loss/train': 1.1720187664031982} 11/07/2021 02:10:57 - INFO - __main__ - Step 35094: {'lr': 0.0004408130700883964, 'samples': 6738048, 'steps': 35093, 'loss/train': 1.5962964296340942} 11/07/2021 02:10:57 - INFO - __main__ - Step 35095: {'lr': 0.0004408096413548193, 'samples': 6738240, 'steps': 35094, 'loss/train': 1.6870752573013306} 11/07/2021 02:10:59 - INFO - __main__ - Step 35096: {'lr': 0.00044080621253526637, 'samples': 6738432, 'steps': 35095, 'loss/train': 1.6659575700759888} 11/07/2021 02:10:59 - INFO - __main__ - Step 35097: {'lr': 0.00044080278362973913, 'samples': 6738624, 'steps': 35096, 'loss/train': 0.179875910282135} 11/07/2021 02:10:59 - INFO - __main__ - Step 35098: {'lr': 0.00044079935463823904, 'samples': 6738816, 'steps': 35097, 'loss/train': 1.3444435596466064} 11/07/2021 02:11:00 - INFO - __main__ - Step 35099: {'lr': 0.00044079592556076774, 'samples': 6739008, 'steps': 35098, 'loss/train': 1.744091510772705} 11/07/2021 02:11:00 - INFO - __main__ - Step 35100: {'lr': 0.00044079249639732664, 'samples': 6739200, 'steps': 35099, 'loss/train': 1.3593708276748657} 11/07/2021 02:11:00 - INFO - __main__ - Step 35101: {'lr': 0.00044078906714791757, 'samples': 6739392, 'steps': 35100, 'loss/train': 1.0961785316467285} 11/07/2021 02:11:01 - INFO - __main__ - Step 35102: {'lr': 0.0004407856378125418, 'samples': 6739584, 'steps': 35101, 'loss/train': 0.9805355668067932} 11/07/2021 02:11:02 - INFO - __main__ - Step 35103: {'lr': 0.00044078220839120086, 'samples': 6739776, 'steps': 35102, 'loss/train': 1.060352087020874} 11/07/2021 02:11:02 - INFO - __main__ - Step 35104: {'lr': 0.0004407787788838966, 'samples': 6739968, 'steps': 35103, 'loss/train': 1.8644888401031494} 11/07/2021 02:11:02 - INFO - __main__ - Step 35105: {'lr': 0.00044077534929063024, 'samples': 6740160, 'steps': 35104, 'loss/train': 0.9975321292877197} 11/07/2021 02:11:03 - INFO - __main__ - Step 35106: {'lr': 0.00044077191961140337, 'samples': 6740352, 'steps': 35105, 'loss/train': 1.116125226020813} 11/07/2021 02:11:04 - INFO - __main__ - Step 35107: {'lr': 0.00044076848984621775, 'samples': 6740544, 'steps': 35106, 'loss/train': 1.8047592639923096} 11/07/2021 02:11:04 - INFO - __main__ - Step 35108: {'lr': 0.00044076505999507474, 'samples': 6740736, 'steps': 35107, 'loss/train': 1.368557333946228} 11/07/2021 02:11:04 - INFO - __main__ - Step 35109: {'lr': 0.00044076163005797597, 'samples': 6740928, 'steps': 35108, 'loss/train': 1.1860182285308838} 11/07/2021 02:11:05 - INFO - __main__ - Step 35110: {'lr': 0.00044075820003492295, 'samples': 6741120, 'steps': 35109, 'loss/train': 1.5062674283981323} 11/07/2021 02:11:05 - INFO - __main__ - Step 35111: {'lr': 0.0004407547699259173, 'samples': 6741312, 'steps': 35110, 'loss/train': 0.8496090769767761} 11/07/2021 02:11:07 - INFO - __main__ - Step 35112: {'lr': 0.0004407513397309604, 'samples': 6741504, 'steps': 35111, 'loss/train': 1.6295762062072754} 11/07/2021 02:11:08 - INFO - __main__ - Step 35113: {'lr': 0.0004407479094500539, 'samples': 6741696, 'steps': 35112, 'loss/train': 1.8969480991363525} 11/07/2021 02:11:08 - INFO - __main__ - Step 35114: {'lr': 0.00044074447908319935, 'samples': 6741888, 'steps': 35113, 'loss/train': 1.7611387968063354} 11/07/2021 02:11:08 - INFO - __main__ - Step 35115: {'lr': 0.0004407410486303983, 'samples': 6742080, 'steps': 35114, 'loss/train': 1.7748589515686035} 11/07/2021 02:11:09 - INFO - __main__ - Step 35116: {'lr': 0.0004407376180916522, 'samples': 6742272, 'steps': 35115, 'loss/train': 1.7863101959228516} 11/07/2021 02:11:09 - INFO - __main__ - Step 35117: {'lr': 0.0004407341874669627, 'samples': 6742464, 'steps': 35116, 'loss/train': 1.1374247074127197} 11/07/2021 02:11:09 - INFO - __main__ - Step 35118: {'lr': 0.00044073075675633134, 'samples': 6742656, 'steps': 35117, 'loss/train': 1.0929744243621826} 11/07/2021 02:11:10 - INFO - __main__ - Step 35119: {'lr': 0.0004407273259597597, 'samples': 6742848, 'steps': 35118, 'loss/train': 1.2872812747955322} 11/07/2021 02:11:11 - INFO - __main__ - Step 35120: {'lr': 0.0004407238950772492, 'samples': 6743040, 'steps': 35119, 'loss/train': 1.0824187994003296} 11/07/2021 02:11:11 - INFO - __main__ - Step 35121: {'lr': 0.00044072046410880143, 'samples': 6743232, 'steps': 35120, 'loss/train': 1.271795392036438} 11/07/2021 02:11:12 - INFO - __main__ - Step 35122: {'lr': 0.000440717033054418, 'samples': 6743424, 'steps': 35121, 'loss/train': 1.3499759435653687} 11/07/2021 02:11:12 - INFO - __main__ - Step 35123: {'lr': 0.0004407136019141005, 'samples': 6743616, 'steps': 35122, 'loss/train': 2.010877847671509} 11/07/2021 02:11:12 - INFO - __main__ - Step 35124: {'lr': 0.0004407101706878502, 'samples': 6743808, 'steps': 35123, 'loss/train': 1.6080549955368042} 11/07/2021 02:11:13 - INFO - __main__ - Step 35125: {'lr': 0.000440706739375669, 'samples': 6744000, 'steps': 35124, 'loss/train': 1.2922800779342651} 11/07/2021 02:11:14 - INFO - __main__ - Step 35126: {'lr': 0.00044070330797755825, 'samples': 6744192, 'steps': 35125, 'loss/train': 1.4698541164398193} 11/07/2021 02:11:14 - INFO - __main__ - Step 35127: {'lr': 0.0004406998764935195, 'samples': 6744384, 'steps': 35126, 'loss/train': 1.8241018056869507} 11/07/2021 02:11:14 - INFO - __main__ - Step 35128: {'lr': 0.0004406964449235544, 'samples': 6744576, 'steps': 35127, 'loss/train': 1.5889697074890137} 11/07/2021 02:11:15 - INFO - __main__ - Step 35129: {'lr': 0.00044069301326766434, 'samples': 6744768, 'steps': 35128, 'loss/train': 1.4178204536437988} 11/07/2021 02:11:16 - INFO - __main__ - Step 35130: {'lr': 0.00044068958152585104, 'samples': 6744960, 'steps': 35129, 'loss/train': 1.3953704833984375} 11/07/2021 02:11:16 - INFO - __main__ - Step 35131: {'lr': 0.00044068614969811586, 'samples': 6745152, 'steps': 35130, 'loss/train': 1.991770625114441} 11/07/2021 02:11:16 - INFO - __main__ - Step 35132: {'lr': 0.0004406827177844605, 'samples': 6745344, 'steps': 35131, 'loss/train': 1.2228375673294067} 11/07/2021 02:11:17 - INFO - __main__ - Step 35133: {'lr': 0.00044067928578488645, 'samples': 6745536, 'steps': 35132, 'loss/train': 1.795264720916748} 11/07/2021 02:11:17 - INFO - __main__ - Step 35134: {'lr': 0.0004406758536993952, 'samples': 6745728, 'steps': 35133, 'loss/train': 1.8044919967651367} 11/07/2021 02:11:19 - INFO - __main__ - Step 35135: {'lr': 0.00044067242152798843, 'samples': 6745920, 'steps': 35134, 'loss/train': 1.4043021202087402} 11/07/2021 02:11:19 - INFO - __main__ - Step 35136: {'lr': 0.00044066898927066757, 'samples': 6746112, 'steps': 35135, 'loss/train': 1.4440217018127441} 11/07/2021 02:11:19 - INFO - __main__ - Step 35137: {'lr': 0.0004406655569274342, 'samples': 6746304, 'steps': 35136, 'loss/train': 0.3199705183506012} 11/07/2021 02:11:20 - INFO - __main__ - Step 35138: {'lr': 0.0004406621244982899, 'samples': 6746496, 'steps': 35137, 'loss/train': 1.5187562704086304} 11/07/2021 02:11:20 - INFO - __main__ - Step 35139: {'lr': 0.00044065869198323614, 'samples': 6746688, 'steps': 35138, 'loss/train': 1.7210921049118042} 11/07/2021 02:11:20 - INFO - __main__ - Step 35140: {'lr': 0.0004406552593822746, 'samples': 6746880, 'steps': 35139, 'loss/train': 1.8576350212097168} 11/07/2021 02:11:21 - INFO - __main__ - Step 35141: {'lr': 0.00044065182669540665, 'samples': 6747072, 'steps': 35140, 'loss/train': 1.4928169250488281} 11/07/2021 02:11:22 - INFO - __main__ - Step 35142: {'lr': 0.000440648393922634, 'samples': 6747264, 'steps': 35141, 'loss/train': 0.8861057162284851} 11/07/2021 02:11:22 - INFO - __main__ - Step 35143: {'lr': 0.0004406449610639581, 'samples': 6747456, 'steps': 35142, 'loss/train': 1.1979506015777588} 11/07/2021 02:11:22 - INFO - __main__ - Step 35144: {'lr': 0.0004406415281193805, 'samples': 6747648, 'steps': 35143, 'loss/train': 2.018486738204956} 11/07/2021 02:11:23 - INFO - __main__ - Step 35145: {'lr': 0.0004406380950889027, 'samples': 6747840, 'steps': 35144, 'loss/train': 1.4049015045166016} 11/07/2021 02:11:24 - INFO - __main__ - Step 35146: {'lr': 0.0004406346619725265, 'samples': 6748032, 'steps': 35145, 'loss/train': 1.5860729217529297} 11/07/2021 02:11:24 - INFO - __main__ - Step 35147: {'lr': 0.00044063122877025315, 'samples': 6748224, 'steps': 35146, 'loss/train': 1.0698484182357788} 11/07/2021 02:11:24 - INFO - __main__ - Step 35148: {'lr': 0.0004406277954820843, 'samples': 6748416, 'steps': 35147, 'loss/train': 1.1080349683761597} 11/07/2021 02:11:25 - INFO - __main__ - Step 35149: {'lr': 0.0004406243621080216, 'samples': 6748608, 'steps': 35148, 'loss/train': 1.8648744821548462} 11/07/2021 02:11:25 - INFO - __main__ - Step 35150: {'lr': 0.00044062092864806634, 'samples': 6748800, 'steps': 35149, 'loss/train': 1.0627784729003906} 11/07/2021 02:11:26 - INFO - __main__ - Step 35151: {'lr': 0.00044061749510222037, 'samples': 6748992, 'steps': 35150, 'loss/train': 1.6568477153778076} 11/07/2021 02:11:27 - INFO - __main__ - Step 35152: {'lr': 0.00044061406147048504, 'samples': 6749184, 'steps': 35151, 'loss/train': 1.7818259000778198} 11/07/2021 02:11:27 - INFO - __main__ - Step 35153: {'lr': 0.000440610627752862, 'samples': 6749376, 'steps': 35152, 'loss/train': 1.7012907266616821} 11/07/2021 02:11:27 - INFO - __main__ - Step 35154: {'lr': 0.00044060719394935265, 'samples': 6749568, 'steps': 35153, 'loss/train': 1.8704605102539062} 11/07/2021 02:11:28 - INFO - __main__ - Step 35155: {'lr': 0.0004406037600599588, 'samples': 6749760, 'steps': 35154, 'loss/train': 1.3309500217437744} 11/07/2021 02:11:29 - INFO - __main__ - Step 35156: {'lr': 0.0004406003260846817, 'samples': 6749952, 'steps': 35155, 'loss/train': 1.7605141401290894} 11/07/2021 02:11:29 - INFO - __main__ - Step 35157: {'lr': 0.0004405968920235231, 'samples': 6750144, 'steps': 35156, 'loss/train': 1.6250115633010864} 11/07/2021 02:11:29 - INFO - __main__ - Step 35158: {'lr': 0.0004405934578764845, 'samples': 6750336, 'steps': 35157, 'loss/train': 1.0034600496292114} 11/07/2021 02:11:30 - INFO - __main__ - Step 35159: {'lr': 0.0004405900236435674, 'samples': 6750528, 'steps': 35158, 'loss/train': 0.8062552809715271} 11/07/2021 02:11:30 - INFO - __main__ - Step 35160: {'lr': 0.00044058658932477336, 'samples': 6750720, 'steps': 35159, 'loss/train': 2.0084025859832764} 11/07/2021 02:11:31 - INFO - __main__ - Step 35161: {'lr': 0.0004405831549201039, 'samples': 6750912, 'steps': 35160, 'loss/train': 1.330609917640686} 11/07/2021 02:11:31 - INFO - __main__ - Step 35162: {'lr': 0.0004405797204295607, 'samples': 6751104, 'steps': 35161, 'loss/train': 1.5582258701324463} 11/07/2021 02:11:32 - INFO - __main__ - Step 35163: {'lr': 0.0004405762858531451, 'samples': 6751296, 'steps': 35162, 'loss/train': 1.6793432235717773} 11/07/2021 02:11:32 - INFO - __main__ - Step 35164: {'lr': 0.00044057285119085887, 'samples': 6751488, 'steps': 35163, 'loss/train': 1.3808073997497559} 11/07/2021 02:11:32 - INFO - __main__ - Step 35165: {'lr': 0.0004405694164427035, 'samples': 6751680, 'steps': 35164, 'loss/train': 2.0149216651916504} 11/07/2021 02:11:34 - INFO - __main__ - Step 35166: {'lr': 0.0004405659816086804, 'samples': 6751872, 'steps': 35165, 'loss/train': 0.9017762541770935} 11/07/2021 02:11:34 - INFO - __main__ - Step 35167: {'lr': 0.00044056254668879127, 'samples': 6752064, 'steps': 35166, 'loss/train': 1.2902568578720093} 11/07/2021 02:11:34 - INFO - __main__ - Step 35168: {'lr': 0.00044055911168303753, 'samples': 6752256, 'steps': 35167, 'loss/train': 1.8604564666748047} 11/07/2021 02:11:35 - INFO - __main__ - Step 35169: {'lr': 0.00044055567659142083, 'samples': 6752448, 'steps': 35168, 'loss/train': 0.897270679473877} 11/07/2021 02:11:35 - INFO - __main__ - Step 35170: {'lr': 0.0004405522414139427, 'samples': 6752640, 'steps': 35169, 'loss/train': 1.6258909702301025} 11/07/2021 02:11:36 - INFO - __main__ - Step 35171: {'lr': 0.0004405488061506047, 'samples': 6752832, 'steps': 35170, 'loss/train': 1.4886150360107422} 11/07/2021 02:11:36 - INFO - __main__ - Step 35172: {'lr': 0.0004405453708014082, 'samples': 6753024, 'steps': 35171, 'loss/train': 1.5348491668701172} 11/07/2021 02:11:37 - INFO - __main__ - Step 35173: {'lr': 0.00044054193536635503, 'samples': 6753216, 'steps': 35172, 'loss/train': 1.6122689247131348} 11/07/2021 02:11:37 - INFO - __main__ - Step 35174: {'lr': 0.00044053849984544653, 'samples': 6753408, 'steps': 35173, 'loss/train': 1.3885899782180786} 11/07/2021 02:11:37 - INFO - __main__ - Step 35175: {'lr': 0.0004405350642386844, 'samples': 6753600, 'steps': 35174, 'loss/train': 1.5848331451416016} 11/07/2021 02:11:38 - INFO - __main__ - Step 35176: {'lr': 0.00044053162854607004, 'samples': 6753792, 'steps': 35175, 'loss/train': 1.2173967361450195} 11/07/2021 02:11:39 - INFO - __main__ - Step 35177: {'lr': 0.0004405281927676051, 'samples': 6753984, 'steps': 35176, 'loss/train': 1.600562572479248} 11/07/2021 02:11:39 - INFO - __main__ - Step 35178: {'lr': 0.0004405247569032911, 'samples': 6754176, 'steps': 35177, 'loss/train': 1.6044585704803467} 11/07/2021 02:11:39 - INFO - __main__ - Step 35179: {'lr': 0.00044052132095312956, 'samples': 6754368, 'steps': 35178, 'loss/train': 1.7462291717529297} 11/07/2021 02:11:40 - INFO - __main__ - Step 35180: {'lr': 0.0004405178849171221, 'samples': 6754560, 'steps': 35179, 'loss/train': 1.35542631149292} 11/07/2021 02:11:40 - INFO - __main__ - Step 35181: {'lr': 0.00044051444879527013, 'samples': 6754752, 'steps': 35180, 'loss/train': 1.525868535041809} 11/07/2021 02:11:41 - INFO - __main__ - Step 35182: {'lr': 0.00044051101258757544, 'samples': 6754944, 'steps': 35181, 'loss/train': 1.3119614124298096} 11/07/2021 02:11:42 - INFO - __main__ - Step 35183: {'lr': 0.0004405075762940393, 'samples': 6755136, 'steps': 35182, 'loss/train': 1.6410095691680908} 11/07/2021 02:11:42 - INFO - __main__ - Step 35184: {'lr': 0.00044050413991466344, 'samples': 6755328, 'steps': 35183, 'loss/train': 2.1109349727630615} 11/07/2021 02:11:42 - INFO - __main__ - Step 35185: {'lr': 0.0004405007034494494, 'samples': 6755520, 'steps': 35184, 'loss/train': 1.5869981050491333} 11/07/2021 02:11:43 - INFO - __main__ - Step 35186: {'lr': 0.00044049726689839854, 'samples': 6755712, 'steps': 35185, 'loss/train': 1.2204842567443848} 11/07/2021 02:11:44 - INFO - __main__ - Step 35187: {'lr': 0.0004404938302615126, 'samples': 6755904, 'steps': 35186, 'loss/train': 2.1978719234466553} 11/07/2021 02:11:44 - INFO - __main__ - Step 35188: {'lr': 0.00044049039353879317, 'samples': 6756096, 'steps': 35187, 'loss/train': 1.2825113534927368} 11/07/2021 02:11:44 - INFO - __main__ - Step 35189: {'lr': 0.00044048695673024166, 'samples': 6756288, 'steps': 35188, 'loss/train': 1.7250984907150269} 11/07/2021 02:11:45 - INFO - __main__ - Step 35190: {'lr': 0.00044048351983585966, 'samples': 6756480, 'steps': 35189, 'loss/train': 1.7315864562988281} 11/07/2021 02:11:45 - INFO - __main__ - Step 35191: {'lr': 0.00044048008285564865, 'samples': 6756672, 'steps': 35190, 'loss/train': 1.1441526412963867} 11/07/2021 02:11:46 - INFO - __main__ - Step 35192: {'lr': 0.0004404766457896104, 'samples': 6756864, 'steps': 35191, 'loss/train': 1.5588198900222778} 11/07/2021 02:11:47 - INFO - __main__ - Step 35193: {'lr': 0.0004404732086377462, 'samples': 6757056, 'steps': 35192, 'loss/train': 1.1891738176345825} 11/07/2021 02:11:47 - INFO - __main__ - Step 35194: {'lr': 0.00044046977140005774, 'samples': 6757248, 'steps': 35193, 'loss/train': 1.190617561340332} 11/07/2021 02:11:47 - INFO - __main__ - Step 35195: {'lr': 0.00044046633407654657, 'samples': 6757440, 'steps': 35194, 'loss/train': 1.2330223321914673} 11/07/2021 02:11:48 - INFO - __main__ - Step 35196: {'lr': 0.0004404628966672142, 'samples': 6757632, 'steps': 35195, 'loss/train': 1.1752219200134277} 11/07/2021 02:11:49 - INFO - __main__ - Step 35197: {'lr': 0.0004404594591720622, 'samples': 6757824, 'steps': 35196, 'loss/train': 1.444063663482666} 11/07/2021 02:11:49 - INFO - __main__ - Step 35198: {'lr': 0.00044045602159109207, 'samples': 6758016, 'steps': 35197, 'loss/train': 1.4819884300231934} 11/07/2021 02:11:49 - INFO - __main__ - Step 35199: {'lr': 0.0004404525839243054, 'samples': 6758208, 'steps': 35198, 'loss/train': 1.7841776609420776} 11/07/2021 02:11:50 - INFO - __main__ - Step 35200: {'lr': 0.00044044914617170374, 'samples': 6758400, 'steps': 35199, 'loss/train': 1.6430798768997192} 11/07/2021 02:11:50 - INFO - __main__ - Step 35201: {'lr': 0.00044044570833328865, 'samples': 6758592, 'steps': 35200, 'loss/train': 1.6653860807418823} 11/07/2021 02:11:51 - INFO - __main__ - Step 35202: {'lr': 0.00044044227040906166, 'samples': 6758784, 'steps': 35201, 'loss/train': 1.5937496423721313} 11/07/2021 02:11:51 - INFO - __main__ - Step 35203: {'lr': 0.00044043883239902425, 'samples': 6758976, 'steps': 35202, 'loss/train': 0.7953693270683289} 11/07/2021 02:11:52 - INFO - __main__ - Step 35204: {'lr': 0.00044043539430317814, 'samples': 6759168, 'steps': 35203, 'loss/train': 1.1854610443115234} 11/07/2021 02:11:52 - INFO - __main__ - Step 35205: {'lr': 0.00044043195612152475, 'samples': 6759360, 'steps': 35204, 'loss/train': 1.4423195123672485} 11/07/2021 02:11:52 - INFO - __main__ - Step 35206: {'lr': 0.0004404285178540657, 'samples': 6759552, 'steps': 35205, 'loss/train': 1.2376537322998047} 11/07/2021 02:11:53 - INFO - __main__ - Step 35207: {'lr': 0.0004404250795008024, 'samples': 6759744, 'steps': 35206, 'loss/train': 1.9665284156799316} 11/07/2021 02:11:54 - INFO - __main__ - Step 35208: {'lr': 0.00044042164106173655, 'samples': 6759936, 'steps': 35207, 'loss/train': 1.6511269807815552} 11/07/2021 02:11:54 - INFO - __main__ - Step 35209: {'lr': 0.00044041820253686964, 'samples': 6760128, 'steps': 35208, 'loss/train': 1.4399462938308716} 11/07/2021 02:11:55 - INFO - __main__ - Step 35210: {'lr': 0.0004404147639262032, 'samples': 6760320, 'steps': 35209, 'loss/train': 2.1247313022613525} 11/07/2021 02:11:55 - INFO - __main__ - Step 35211: {'lr': 0.00044041132522973885, 'samples': 6760512, 'steps': 35210, 'loss/train': 1.3795652389526367} 11/07/2021 02:11:55 - INFO - __main__ - Step 35212: {'lr': 0.0004404078864474781, 'samples': 6760704, 'steps': 35211, 'loss/train': 1.4530143737792969} 11/07/2021 02:11:56 - INFO - __main__ - Step 35213: {'lr': 0.00044040444757942245, 'samples': 6760896, 'steps': 35212, 'loss/train': 0.6121195554733276} 11/07/2021 02:11:57 - INFO - __main__ - Step 35214: {'lr': 0.00044040100862557355, 'samples': 6761088, 'steps': 35213, 'loss/train': 1.6529427766799927} 11/07/2021 02:11:57 - INFO - __main__ - Step 35215: {'lr': 0.00044039756958593287, 'samples': 6761280, 'steps': 35214, 'loss/train': 1.8432458639144897} 11/07/2021 02:11:57 - INFO - __main__ - Step 35216: {'lr': 0.000440394130460502, 'samples': 6761472, 'steps': 35215, 'loss/train': 1.6579116582870483} 11/07/2021 02:11:58 - INFO - __main__ - Step 35217: {'lr': 0.00044039069124928245, 'samples': 6761664, 'steps': 35216, 'loss/train': 1.7480180263519287} 11/07/2021 02:11:59 - INFO - __main__ - Step 35218: {'lr': 0.0004403872519522758, 'samples': 6761856, 'steps': 35217, 'loss/train': 1.7644002437591553} 11/07/2021 02:11:59 - INFO - __main__ - Step 35219: {'lr': 0.00044038381256948357, 'samples': 6762048, 'steps': 35218, 'loss/train': 1.3994035720825195} 11/07/2021 02:11:59 - INFO - __main__ - Step 35220: {'lr': 0.00044038037310090736, 'samples': 6762240, 'steps': 35219, 'loss/train': 1.061874270439148} 11/07/2021 02:12:00 - INFO - __main__ - Step 35221: {'lr': 0.00044037693354654863, 'samples': 6762432, 'steps': 35220, 'loss/train': 1.141141653060913} 11/07/2021 02:12:00 - INFO - __main__ - Step 35222: {'lr': 0.0004403734939064091, 'samples': 6762624, 'steps': 35221, 'loss/train': 1.7482527494430542} 11/07/2021 02:12:01 - INFO - __main__ - Step 35223: {'lr': 0.00044037005418049016, 'samples': 6762816, 'steps': 35222, 'loss/train': 2.018164873123169} 11/07/2021 02:12:02 - INFO - __main__ - Step 35224: {'lr': 0.00044036661436879334, 'samples': 6763008, 'steps': 35223, 'loss/train': 1.7777644395828247} 11/07/2021 02:12:02 - INFO - __main__ - Step 35225: {'lr': 0.00044036317447132035, 'samples': 6763200, 'steps': 35224, 'loss/train': 1.500864863395691} 11/07/2021 02:12:02 - INFO - __main__ - Step 35226: {'lr': 0.00044035973448807266, 'samples': 6763392, 'steps': 35225, 'loss/train': 1.6354080438613892} 11/07/2021 02:12:03 - INFO - __main__ - Step 35227: {'lr': 0.00044035629441905173, 'samples': 6763584, 'steps': 35226, 'loss/train': 1.693660020828247} 11/07/2021 02:12:04 - INFO - __main__ - Step 35228: {'lr': 0.0004403528542642592, 'samples': 6763776, 'steps': 35227, 'loss/train': 1.55222487449646} 11/07/2021 02:12:04 - INFO - __main__ - Step 35229: {'lr': 0.00044034941402369666, 'samples': 6763968, 'steps': 35228, 'loss/train': 1.2032617330551147} 11/07/2021 02:12:04 - INFO - __main__ - Step 35230: {'lr': 0.0004403459736973656, 'samples': 6764160, 'steps': 35229, 'loss/train': 1.302371859550476} 11/07/2021 02:12:05 - INFO - __main__ - Step 35231: {'lr': 0.00044034253328526765, 'samples': 6764352, 'steps': 35230, 'loss/train': 1.5945069789886475} 11/07/2021 02:12:05 - INFO - __main__ - Step 35232: {'lr': 0.00044033909278740416, 'samples': 6764544, 'steps': 35231, 'loss/train': 1.67051362991333} 11/07/2021 02:12:06 - INFO - __main__ - Step 35233: {'lr': 0.0004403356522037769, 'samples': 6764736, 'steps': 35232, 'loss/train': 0.9942646622657776} 11/07/2021 02:12:06 - INFO - __main__ - Step 35234: {'lr': 0.00044033221153438727, 'samples': 6764928, 'steps': 35233, 'loss/train': 1.897809386253357} 11/07/2021 02:12:07 - INFO - __main__ - Step 35235: {'lr': 0.00044032877077923696, 'samples': 6765120, 'steps': 35234, 'loss/train': 1.6269075870513916} 11/07/2021 02:12:07 - INFO - __main__ - Step 35236: {'lr': 0.0004403253299383274, 'samples': 6765312, 'steps': 35235, 'loss/train': 1.3000872135162354} 11/07/2021 02:12:07 - INFO - __main__ - Step 35237: {'lr': 0.00044032188901166016, 'samples': 6765504, 'steps': 35236, 'loss/train': 1.3950632810592651} 11/07/2021 02:12:08 - INFO - __main__ - Step 35238: {'lr': 0.0004403184479992368, 'samples': 6765696, 'steps': 35237, 'loss/train': 1.5457737445831299} 11/07/2021 02:12:09 - INFO - __main__ - Step 35239: {'lr': 0.000440315006901059, 'samples': 6765888, 'steps': 35238, 'loss/train': 1.7797479629516602} 11/07/2021 02:12:09 - INFO - __main__ - Step 35240: {'lr': 0.00044031156571712807, 'samples': 6766080, 'steps': 35239, 'loss/train': 1.6283468008041382} 11/07/2021 02:12:10 - INFO - __main__ - Step 35241: {'lr': 0.0004403081244474457, 'samples': 6766272, 'steps': 35240, 'loss/train': 0.35879331827163696} 11/07/2021 02:12:10 - INFO - __main__ - Step 35242: {'lr': 0.00044030468309201354, 'samples': 6766464, 'steps': 35241, 'loss/train': 1.5128042697906494} 11/07/2021 02:12:10 - INFO - __main__ - Step 35243: {'lr': 0.0004403012416508329, 'samples': 6766656, 'steps': 35242, 'loss/train': 1.65208899974823} 11/07/2021 02:12:11 - INFO - __main__ - Step 35244: {'lr': 0.00044029780012390553, 'samples': 6766848, 'steps': 35243, 'loss/train': 1.3009748458862305} 11/07/2021 02:12:12 - INFO - __main__ - Step 35245: {'lr': 0.0004402943585112329, 'samples': 6767040, 'steps': 35244, 'loss/train': 1.2903783321380615} 11/07/2021 02:12:12 - INFO - __main__ - Step 35246: {'lr': 0.0004402909168128165, 'samples': 6767232, 'steps': 35245, 'loss/train': 0.9609237909317017} 11/07/2021 02:12:12 - INFO - __main__ - Step 35247: {'lr': 0.00044028747502865794, 'samples': 6767424, 'steps': 35246, 'loss/train': 1.4407219886779785} 11/07/2021 02:12:13 - INFO - __main__ - Step 35248: {'lr': 0.0004402840331587589, 'samples': 6767616, 'steps': 35247, 'loss/train': 1.1330758333206177} 11/07/2021 02:12:14 - INFO - __main__ - Step 35249: {'lr': 0.0004402805912031207, 'samples': 6767808, 'steps': 35248, 'loss/train': 1.5032768249511719} 11/07/2021 02:12:15 - INFO - __main__ - Step 35250: {'lr': 0.0004402771491617451, 'samples': 6768000, 'steps': 35249, 'loss/train': 1.1299022436141968} 11/07/2021 02:12:15 - INFO - __main__ - Step 35251: {'lr': 0.0004402737070346335, 'samples': 6768192, 'steps': 35250, 'loss/train': 1.785966396331787} 11/07/2021 02:12:15 - INFO - __main__ - Step 35252: {'lr': 0.0004402702648217875, 'samples': 6768384, 'steps': 35251, 'loss/train': 1.8759437799453735} 11/07/2021 02:12:16 - INFO - __main__ - Step 35253: {'lr': 0.00044026682252320864, 'samples': 6768576, 'steps': 35252, 'loss/train': 1.5136964321136475} 11/07/2021 02:12:16 - INFO - __main__ - Step 35254: {'lr': 0.00044026338013889853, 'samples': 6768768, 'steps': 35253, 'loss/train': 1.7734421491622925} 11/07/2021 02:12:17 - INFO - __main__ - Step 35255: {'lr': 0.00044025993766885866, 'samples': 6768960, 'steps': 35254, 'loss/train': 1.382339358329773} 11/07/2021 02:12:17 - INFO - __main__ - Step 35256: {'lr': 0.00044025649511309064, 'samples': 6769152, 'steps': 35255, 'loss/train': 1.1511459350585938} 11/07/2021 02:12:18 - INFO - __main__ - Step 35257: {'lr': 0.00044025305247159585, 'samples': 6769344, 'steps': 35256, 'loss/train': 1.6177397966384888} 11/07/2021 02:12:18 - INFO - __main__ - Step 35258: {'lr': 0.00044024960974437606, 'samples': 6769536, 'steps': 35257, 'loss/train': 1.3453264236450195} 11/07/2021 02:12:18 - INFO - __main__ - Step 35259: {'lr': 0.0004402461669314327, 'samples': 6769728, 'steps': 35258, 'loss/train': 1.6906169652938843} 11/07/2021 02:12:21 - INFO - __main__ - Step 35260: {'lr': 0.0004402427240327674, 'samples': 6769920, 'steps': 35259, 'loss/train': 1.5880019664764404} 11/07/2021 02:12:21 - INFO - __main__ - Step 35261: {'lr': 0.0004402392810483816, 'samples': 6770112, 'steps': 35260, 'loss/train': 1.5076464414596558} 11/07/2021 02:12:21 - INFO - __main__ - Step 35262: {'lr': 0.000440235837978277, 'samples': 6770304, 'steps': 35261, 'loss/train': 1.831540584564209} 11/07/2021 02:12:22 - INFO - __main__ - Step 35263: {'lr': 0.00044023239482245504, 'samples': 6770496, 'steps': 35262, 'loss/train': 2.123952865600586} 11/07/2021 02:12:22 - INFO - __main__ - Step 35264: {'lr': 0.0004402289515809172, 'samples': 6770688, 'steps': 35263, 'loss/train': 1.891985297203064} 11/07/2021 02:12:22 - INFO - __main__ - Step 35265: {'lr': 0.00044022550825366526, 'samples': 6770880, 'steps': 35264, 'loss/train': 1.8454724550247192} 11/07/2021 02:12:23 - INFO - __main__ - Step 35266: {'lr': 0.0004402220648407006, 'samples': 6771072, 'steps': 35265, 'loss/train': 1.8766562938690186} 11/07/2021 02:12:23 - INFO - __main__ - Step 35267: {'lr': 0.00044021862134202485, 'samples': 6771264, 'steps': 35266, 'loss/train': 1.5124690532684326} 11/07/2021 02:12:24 - INFO - __main__ - Step 35268: {'lr': 0.00044021517775763943, 'samples': 6771456, 'steps': 35267, 'loss/train': 1.4166333675384521} 11/07/2021 02:12:25 - INFO - __main__ - Step 35269: {'lr': 0.00044021173408754604, 'samples': 6771648, 'steps': 35268, 'loss/train': 1.3877267837524414} 11/07/2021 02:12:25 - INFO - __main__ - Step 35270: {'lr': 0.00044020829033174615, 'samples': 6771840, 'steps': 35269, 'loss/train': 1.5778212547302246} 11/07/2021 02:12:25 - INFO - __main__ - Step 35271: {'lr': 0.0004402048464902414, 'samples': 6772032, 'steps': 35270, 'loss/train': 1.4013853073120117} 11/07/2021 02:12:26 - INFO - __main__ - Step 35272: {'lr': 0.0004402014025630332, 'samples': 6772224, 'steps': 35271, 'loss/train': 2.010936737060547} 11/07/2021 02:12:27 - INFO - __main__ - Step 35273: {'lr': 0.00044019795855012325, 'samples': 6772416, 'steps': 35272, 'loss/train': 1.4244587421417236} 11/07/2021 02:12:27 - INFO - __main__ - Step 35274: {'lr': 0.00044019451445151305, 'samples': 6772608, 'steps': 35273, 'loss/train': 1.4892654418945312} 11/07/2021 02:12:27 - INFO - __main__ - Step 35275: {'lr': 0.00044019107026720404, 'samples': 6772800, 'steps': 35274, 'loss/train': 1.6502214670181274} 11/07/2021 02:12:28 - INFO - __main__ - Step 35276: {'lr': 0.00044018762599719796, 'samples': 6772992, 'steps': 35275, 'loss/train': 1.509718656539917} 11/07/2021 02:12:28 - INFO - __main__ - Step 35277: {'lr': 0.0004401841816414962, 'samples': 6773184, 'steps': 35276, 'loss/train': 0.5344533920288086} 11/07/2021 02:12:29 - INFO - __main__ - Step 35278: {'lr': 0.0004401807372001004, 'samples': 6773376, 'steps': 35277, 'loss/train': 1.63253915309906} 11/07/2021 02:12:29 - INFO - __main__ - Step 35279: {'lr': 0.0004401772926730122, 'samples': 6773568, 'steps': 35278, 'loss/train': 1.8484654426574707} 11/07/2021 02:12:30 - INFO - __main__ - Step 35280: {'lr': 0.0004401738480602329, 'samples': 6773760, 'steps': 35279, 'loss/train': 1.147681713104248} 11/07/2021 02:12:30 - INFO - __main__ - Step 35281: {'lr': 0.0004401704033617643, 'samples': 6773952, 'steps': 35280, 'loss/train': 1.761243462562561} 11/07/2021 02:12:30 - INFO - __main__ - Step 35282: {'lr': 0.0004401669585776078, 'samples': 6774144, 'steps': 35281, 'loss/train': 1.5396783351898193} 11/07/2021 02:12:31 - INFO - __main__ - Step 35283: {'lr': 0.000440163513707765, 'samples': 6774336, 'steps': 35282, 'loss/train': 1.0905910730361938} 11/07/2021 02:12:32 - INFO - __main__ - Step 35284: {'lr': 0.00044016006875223745, 'samples': 6774528, 'steps': 35283, 'loss/train': 1.369025468826294} 11/07/2021 02:12:32 - INFO - __main__ - Step 35285: {'lr': 0.00044015662371102676, 'samples': 6774720, 'steps': 35284, 'loss/train': 1.5036505460739136} 11/07/2021 02:12:32 - INFO - __main__ - Step 35286: {'lr': 0.0004401531785841344, 'samples': 6774912, 'steps': 35285, 'loss/train': 1.7355401515960693} 11/07/2021 02:12:33 - INFO - __main__ - Step 35287: {'lr': 0.00044014973337156197, 'samples': 6775104, 'steps': 35286, 'loss/train': 1.3399806022644043} 11/07/2021 02:12:34 - INFO - __main__ - Step 35288: {'lr': 0.0004401462880733109, 'samples': 6775296, 'steps': 35287, 'loss/train': 1.9543825387954712} 11/07/2021 02:12:34 - INFO - __main__ - Step 35289: {'lr': 0.000440142842689383, 'samples': 6775488, 'steps': 35288, 'loss/train': 1.6181656122207642} 11/07/2021 02:12:34 - INFO - __main__ - Step 35290: {'lr': 0.00044013939721977957, 'samples': 6775680, 'steps': 35289, 'loss/train': 2.1377604007720947} 11/07/2021 02:12:35 - INFO - __main__ - Step 35291: {'lr': 0.0004401359516645023, 'samples': 6775872, 'steps': 35290, 'loss/train': 1.8301565647125244} 11/07/2021 02:12:35 - INFO - __main__ - Step 35292: {'lr': 0.0004401325060235527, 'samples': 6776064, 'steps': 35291, 'loss/train': 1.776128888130188} 11/07/2021 02:12:36 - INFO - __main__ - Step 35293: {'lr': 0.00044012906029693236, 'samples': 6776256, 'steps': 35292, 'loss/train': 1.6864397525787354} 11/07/2021 02:12:37 - INFO - __main__ - Step 35294: {'lr': 0.0004401256144846427, 'samples': 6776448, 'steps': 35293, 'loss/train': 0.7780770659446716} 11/07/2021 02:12:37 - INFO - __main__ - Step 35295: {'lr': 0.0004401221685866854, 'samples': 6776640, 'steps': 35294, 'loss/train': 1.1424438953399658} 11/07/2021 02:12:37 - INFO - __main__ - Step 35296: {'lr': 0.00044011872260306205, 'samples': 6776832, 'steps': 35295, 'loss/train': 1.2669594287872314} 11/07/2021 02:12:38 - INFO - __main__ - Step 35297: {'lr': 0.00044011527653377416, 'samples': 6777024, 'steps': 35296, 'loss/train': 1.7180006504058838} 11/07/2021 02:12:38 - INFO - __main__ - Step 35298: {'lr': 0.0004401118303788232, 'samples': 6777216, 'steps': 35297, 'loss/train': 1.8391923904418945} 11/07/2021 02:12:39 - INFO - __main__ - Step 35299: {'lr': 0.00044010838413821075, 'samples': 6777408, 'steps': 35298, 'loss/train': 1.658327341079712} 11/07/2021 02:12:39 - INFO - __main__ - Step 35300: {'lr': 0.0004401049378119384, 'samples': 6777600, 'steps': 35299, 'loss/train': 1.3578120470046997} 11/07/2021 02:12:40 - INFO - __main__ - Step 35301: {'lr': 0.0004401014914000078, 'samples': 6777792, 'steps': 35300, 'loss/train': 1.9098882675170898} 11/07/2021 02:12:40 - INFO - __main__ - Step 35302: {'lr': 0.00044009804490242026, 'samples': 6777984, 'steps': 35301, 'loss/train': 0.855135977268219} 11/07/2021 02:12:40 - INFO - __main__ - Step 35303: {'lr': 0.00044009459831917755, 'samples': 6778176, 'steps': 35302, 'loss/train': 1.5483982563018799} 11/07/2021 02:12:42 - INFO - __main__ - Step 35304: {'lr': 0.00044009115165028113, 'samples': 6778368, 'steps': 35303, 'loss/train': 1.7052702903747559} 11/07/2021 02:12:42 - INFO - __main__ - Step 35305: {'lr': 0.0004400877048957326, 'samples': 6778560, 'steps': 35304, 'loss/train': 1.751997470855713} 11/07/2021 02:12:42 - INFO - __main__ - Step 35306: {'lr': 0.00044008425805553347, 'samples': 6778752, 'steps': 35305, 'loss/train': 2.147096872329712} 11/07/2021 02:12:43 - INFO - __main__ - Step 35307: {'lr': 0.00044008081112968537, 'samples': 6778944, 'steps': 35306, 'loss/train': 1.7320812940597534} 11/07/2021 02:12:43 - INFO - __main__ - Step 35308: {'lr': 0.0004400773641181897, 'samples': 6779136, 'steps': 35307, 'loss/train': 1.7000387907028198} 11/07/2021 02:12:44 - INFO - __main__ - Step 35309: {'lr': 0.0004400739170210481, 'samples': 6779328, 'steps': 35308, 'loss/train': 1.2110624313354492} 11/07/2021 02:12:44 - INFO - __main__ - Step 35310: {'lr': 0.00044007046983826213, 'samples': 6779520, 'steps': 35309, 'loss/train': 0.6447055339813232} 11/07/2021 02:12:45 - INFO - __main__ - Step 35311: {'lr': 0.0004400670225698333, 'samples': 6779712, 'steps': 35310, 'loss/train': 0.7492512464523315} 11/07/2021 02:12:45 - INFO - __main__ - Step 35312: {'lr': 0.00044006357521576334, 'samples': 6779904, 'steps': 35311, 'loss/train': 0.9323607087135315} 11/07/2021 02:12:45 - INFO - __main__ - Step 35313: {'lr': 0.0004400601277760536, 'samples': 6780096, 'steps': 35312, 'loss/train': 1.7272684574127197} 11/07/2021 02:12:46 - INFO - __main__ - Step 35314: {'lr': 0.0004400566802507057, 'samples': 6780288, 'steps': 35313, 'loss/train': 1.1753439903259277} 11/07/2021 02:12:47 - INFO - __main__ - Step 35315: {'lr': 0.0004400532326397211, 'samples': 6780480, 'steps': 35314, 'loss/train': 1.8355354070663452} 11/07/2021 02:12:47 - INFO - __main__ - Step 35316: {'lr': 0.00044004978494310154, 'samples': 6780672, 'steps': 35315, 'loss/train': 0.8124325275421143} 11/07/2021 02:12:47 - INFO - __main__ - Step 35317: {'lr': 0.00044004633716084854, 'samples': 6780864, 'steps': 35316, 'loss/train': 1.5831191539764404} 11/07/2021 02:12:48 - INFO - __main__ - Step 35318: {'lr': 0.0004400428892929635, 'samples': 6781056, 'steps': 35317, 'loss/train': 1.7521748542785645} 11/07/2021 02:12:48 - INFO - __main__ - Step 35319: {'lr': 0.00044003944133944804, 'samples': 6781248, 'steps': 35318, 'loss/train': 1.0224285125732422} 11/07/2021 02:12:49 - INFO - __main__ - Step 35320: {'lr': 0.00044003599330030385, 'samples': 6781440, 'steps': 35319, 'loss/train': 1.5260698795318604} 11/07/2021 02:12:50 - INFO - __main__ - Step 35321: {'lr': 0.00044003254517553225, 'samples': 6781632, 'steps': 35320, 'loss/train': 1.7216631174087524} 11/07/2021 02:12:50 - INFO - __main__ - Step 35322: {'lr': 0.000440029096965135, 'samples': 6781824, 'steps': 35321, 'loss/train': 1.5419079065322876} 11/07/2021 02:12:50 - INFO - __main__ - Step 35323: {'lr': 0.0004400256486691135, 'samples': 6782016, 'steps': 35322, 'loss/train': 1.5323067903518677} 11/07/2021 02:12:51 - INFO - __main__ - Step 35324: {'lr': 0.0004400222002874695, 'samples': 6782208, 'steps': 35323, 'loss/train': 1.541028380393982} 11/07/2021 02:12:52 - INFO - __main__ - Step 35325: {'lr': 0.0004400187518202043, 'samples': 6782400, 'steps': 35324, 'loss/train': 1.1801986694335938} 11/07/2021 02:12:53 - INFO - __main__ - Step 35326: {'lr': 0.00044001530326731966, 'samples': 6782592, 'steps': 35325, 'loss/train': 0.6416899561882019} 11/07/2021 02:12:53 - INFO - __main__ - Step 35327: {'lr': 0.00044001185462881707, 'samples': 6782784, 'steps': 35326, 'loss/train': 1.5198839902877808} 11/07/2021 02:12:53 - INFO - __main__ - Step 35328: {'lr': 0.000440008405904698, 'samples': 6782976, 'steps': 35327, 'loss/train': 1.5916683673858643} 11/07/2021 02:12:54 - INFO - __main__ - Step 35329: {'lr': 0.0004400049570949641, 'samples': 6783168, 'steps': 35328, 'loss/train': 1.5998841524124146} 11/07/2021 02:12:55 - INFO - __main__ - Step 35330: {'lr': 0.0004400015081996169, 'samples': 6783360, 'steps': 35329, 'loss/train': 1.3884435892105103} 11/07/2021 02:12:55 - INFO - __main__ - Step 35331: {'lr': 0.000439998059218658, 'samples': 6783552, 'steps': 35330, 'loss/train': 1.563869833946228} 11/07/2021 02:12:55 - INFO - __main__ - Step 35332: {'lr': 0.0004399946101520889, 'samples': 6783744, 'steps': 35331, 'loss/train': 1.4010270833969116} 11/07/2021 02:12:56 - INFO - __main__ - Step 35333: {'lr': 0.0004399911609999111, 'samples': 6783936, 'steps': 35332, 'loss/train': 1.2245500087738037} 11/07/2021 02:12:56 - INFO - __main__ - Step 35334: {'lr': 0.0004399877117621262, 'samples': 6784128, 'steps': 35333, 'loss/train': 0.999417781829834} 11/07/2021 02:12:57 - INFO - __main__ - Step 35335: {'lr': 0.0004399842624387358, 'samples': 6784320, 'steps': 35334, 'loss/train': 2.229874610900879} 11/07/2021 02:12:57 - INFO - __main__ - Step 35336: {'lr': 0.0004399808130297415, 'samples': 6784512, 'steps': 35335, 'loss/train': 1.3630006313323975} 11/07/2021 02:12:58 - INFO - __main__ - Step 35337: {'lr': 0.0004399773635351446, 'samples': 6784704, 'steps': 35336, 'loss/train': 1.609230637550354} 11/07/2021 02:12:58 - INFO - __main__ - Step 35338: {'lr': 0.000439973913954947, 'samples': 6784896, 'steps': 35337, 'loss/train': 1.4650626182556152} 11/07/2021 02:12:58 - INFO - __main__ - Step 35339: {'lr': 0.00043997046428915, 'samples': 6785088, 'steps': 35338, 'loss/train': 1.5246323347091675} 11/07/2021 02:12:59 - INFO - __main__ - Step 35340: {'lr': 0.00043996701453775526, 'samples': 6785280, 'steps': 35339, 'loss/train': 1.4155287742614746} 11/07/2021 02:13:00 - INFO - __main__ - Step 35341: {'lr': 0.0004399635647007643, 'samples': 6785472, 'steps': 35340, 'loss/train': 1.72676420211792} 11/07/2021 02:13:00 - INFO - __main__ - Step 35342: {'lr': 0.00043996011477817875, 'samples': 6785664, 'steps': 35341, 'loss/train': 2.053565740585327} 11/07/2021 02:13:00 - INFO - __main__ - Step 35343: {'lr': 0.0004399566647700001, 'samples': 6785856, 'steps': 35342, 'loss/train': 1.2722868919372559} 11/07/2021 02:13:01 - INFO - __main__ - Step 35344: {'lr': 0.00043995321467622984, 'samples': 6786048, 'steps': 35343, 'loss/train': 1.4926680326461792} 11/07/2021 02:13:02 - INFO - __main__ - Step 35345: {'lr': 0.00043994976449686964, 'samples': 6786240, 'steps': 35344, 'loss/train': 1.2482376098632812} 11/07/2021 02:13:02 - INFO - __main__ - Step 35346: {'lr': 0.000439946314231921, 'samples': 6786432, 'steps': 35345, 'loss/train': 1.3723719120025635} 11/07/2021 02:13:03 - INFO - __main__ - Step 35347: {'lr': 0.00043994286388138545, 'samples': 6786624, 'steps': 35346, 'loss/train': 1.4757342338562012} 11/07/2021 02:13:03 - INFO - __main__ - Step 35348: {'lr': 0.00043993941344526455, 'samples': 6786816, 'steps': 35347, 'loss/train': 1.232430100440979} 11/07/2021 02:13:03 - INFO - __main__ - Step 35349: {'lr': 0.00043993596292356, 'samples': 6787008, 'steps': 35348, 'loss/train': 1.8471217155456543} 11/07/2021 02:13:04 - INFO - __main__ - Step 35350: {'lr': 0.00043993251231627315, 'samples': 6787200, 'steps': 35349, 'loss/train': 1.4239959716796875} 11/07/2021 02:13:05 - INFO - __main__ - Step 35351: {'lr': 0.00043992906162340563, 'samples': 6787392, 'steps': 35350, 'loss/train': 1.6814604997634888} 11/07/2021 02:13:05 - INFO - __main__ - Step 35352: {'lr': 0.00043992561084495906, 'samples': 6787584, 'steps': 35351, 'loss/train': 0.9884973168373108} 11/07/2021 02:13:05 - INFO - __main__ - Step 35353: {'lr': 0.0004399221599809349, 'samples': 6787776, 'steps': 35352, 'loss/train': 1.569356918334961} 11/07/2021 02:13:06 - INFO - __main__ - Step 35354: {'lr': 0.0004399187090313348, 'samples': 6787968, 'steps': 35353, 'loss/train': 1.5267250537872314} 11/07/2021 02:13:06 - INFO - __main__ - Step 35355: {'lr': 0.00043991525799616017, 'samples': 6788160, 'steps': 35354, 'loss/train': 1.7280054092407227} 11/07/2021 02:13:07 - INFO - __main__ - Step 35356: {'lr': 0.0004399118068754127, 'samples': 6788352, 'steps': 35355, 'loss/train': 1.4081023931503296} 11/07/2021 02:13:08 - INFO - __main__ - Step 35357: {'lr': 0.0004399083556690939, 'samples': 6788544, 'steps': 35356, 'loss/train': 1.168870449066162} 11/07/2021 02:13:08 - INFO - __main__ - Step 35358: {'lr': 0.0004399049043772053, 'samples': 6788736, 'steps': 35357, 'loss/train': 1.4735119342803955} 11/07/2021 02:13:08 - INFO - __main__ - Step 35359: {'lr': 0.00043990145299974853, 'samples': 6788928, 'steps': 35358, 'loss/train': 1.1163095235824585} 11/07/2021 02:13:09 - INFO - __main__ - Step 35360: {'lr': 0.0004398980015367251, 'samples': 6789120, 'steps': 35359, 'loss/train': 5.794185638427734} 11/07/2021 02:13:10 - INFO - __main__ - Step 35361: {'lr': 0.00043989454998813655, 'samples': 6789312, 'steps': 35360, 'loss/train': 1.7031978368759155} 11/07/2021 02:13:10 - INFO - __main__ - Step 35362: {'lr': 0.00043989109835398444, 'samples': 6789504, 'steps': 35361, 'loss/train': 1.6699821949005127} 11/07/2021 02:13:10 - INFO - __main__ - Step 35363: {'lr': 0.0004398876466342703, 'samples': 6789696, 'steps': 35362, 'loss/train': 1.4967632293701172} 11/07/2021 02:13:11 - INFO - __main__ - Step 35364: {'lr': 0.0004398841948289958, 'samples': 6789888, 'steps': 35363, 'loss/train': 0.6771053075790405} 11/07/2021 02:13:11 - INFO - __main__ - Step 35365: {'lr': 0.0004398807429381623, 'samples': 6790080, 'steps': 35364, 'loss/train': 1.4188239574432373} 11/07/2021 02:13:11 - INFO - __main__ - Step 35366: {'lr': 0.0004398772909617715, 'samples': 6790272, 'steps': 35365, 'loss/train': 1.3704005479812622} 11/07/2021 02:13:12 - INFO - __main__ - Step 35367: {'lr': 0.00043987383889982495, 'samples': 6790464, 'steps': 35366, 'loss/train': 1.699660301208496} 11/07/2021 02:13:13 - INFO - __main__ - Step 35368: {'lr': 0.00043987038675232415, 'samples': 6790656, 'steps': 35367, 'loss/train': 1.3130336999893188} 11/07/2021 02:13:13 - INFO - __main__ - Step 35369: {'lr': 0.00043986693451927074, 'samples': 6790848, 'steps': 35368, 'loss/train': 0.9575570225715637} 11/07/2021 02:13:13 - INFO - __main__ - Step 35370: {'lr': 0.0004398634822006662, 'samples': 6791040, 'steps': 35369, 'loss/train': 0.9162417650222778} 11/07/2021 02:13:14 - INFO - __main__ - Step 35371: {'lr': 0.0004398600297965121, 'samples': 6791232, 'steps': 35370, 'loss/train': 1.3104331493377686} 11/07/2021 02:13:15 - INFO - __main__ - Step 35372: {'lr': 0.00043985657730680997, 'samples': 6791424, 'steps': 35371, 'loss/train': 1.8338526487350464} 11/07/2021 02:13:15 - INFO - __main__ - Step 35373: {'lr': 0.00043985312473156143, 'samples': 6791616, 'steps': 35372, 'loss/train': 1.758212924003601} 11/07/2021 02:13:16 - INFO - __main__ - Step 35374: {'lr': 0.000439849672070768, 'samples': 6791808, 'steps': 35373, 'loss/train': 1.1626574993133545} 11/07/2021 02:13:16 - INFO - __main__ - Step 35375: {'lr': 0.00043984621932443115, 'samples': 6792000, 'steps': 35374, 'loss/train': 1.2128241062164307} 11/07/2021 02:13:16 - INFO - __main__ - Step 35376: {'lr': 0.0004398427664925526, 'samples': 6792192, 'steps': 35375, 'loss/train': 1.3160113096237183} 11/07/2021 02:13:17 - INFO - __main__ - Step 35377: {'lr': 0.0004398393135751338, 'samples': 6792384, 'steps': 35376, 'loss/train': 0.8111445903778076} 11/07/2021 02:13:18 - INFO - __main__ - Step 35378: {'lr': 0.0004398358605721764, 'samples': 6792576, 'steps': 35377, 'loss/train': 1.995750904083252} 11/07/2021 02:13:18 - INFO - __main__ - Step 35379: {'lr': 0.00043983240748368186, 'samples': 6792768, 'steps': 35378, 'loss/train': 1.58150315284729} 11/07/2021 02:13:18 - INFO - __main__ - Step 35380: {'lr': 0.0004398289543096518, 'samples': 6792960, 'steps': 35379, 'loss/train': 1.2393131256103516} 11/07/2021 02:13:19 - INFO - __main__ - Step 35381: {'lr': 0.0004398255010500877, 'samples': 6793152, 'steps': 35380, 'loss/train': 1.5714269876480103} 11/07/2021 02:13:20 - INFO - __main__ - Step 35382: {'lr': 0.00043982204770499114, 'samples': 6793344, 'steps': 35381, 'loss/train': 1.323848843574524} 11/07/2021 02:13:20 - INFO - __main__ - Step 35383: {'lr': 0.0004398185942743637, 'samples': 6793536, 'steps': 35382, 'loss/train': 1.5941745042800903} 11/07/2021 02:13:21 - INFO - __main__ - Step 35384: {'lr': 0.00043981514075820693, 'samples': 6793728, 'steps': 35383, 'loss/train': 1.4334259033203125} 11/07/2021 02:13:21 - INFO - __main__ - Step 35385: {'lr': 0.0004398116871565224, 'samples': 6793920, 'steps': 35384, 'loss/train': 1.3924572467803955} 11/07/2021 02:13:21 - INFO - __main__ - Step 35386: {'lr': 0.0004398082334693116, 'samples': 6794112, 'steps': 35385, 'loss/train': 2.0170538425445557} 11/07/2021 02:13:23 - INFO - __main__ - Step 35387: {'lr': 0.0004398047796965762, 'samples': 6794304, 'steps': 35386, 'loss/train': 0.7017478942871094} 11/07/2021 02:13:23 - INFO - __main__ - Step 35388: {'lr': 0.0004398013258383177, 'samples': 6794496, 'steps': 35387, 'loss/train': 1.5313912630081177} 11/07/2021 02:13:23 - INFO - __main__ - Step 35389: {'lr': 0.0004397978718945377, 'samples': 6794688, 'steps': 35388, 'loss/train': 1.1975125074386597} 11/07/2021 02:13:24 - INFO - __main__ - Step 35390: {'lr': 0.0004397944178652376, 'samples': 6794880, 'steps': 35389, 'loss/train': 1.7182281017303467} 11/07/2021 02:13:24 - INFO - __main__ - Step 35391: {'lr': 0.0004397909637504191, 'samples': 6795072, 'steps': 35390, 'loss/train': 1.3362889289855957} 11/07/2021 02:13:24 - INFO - __main__ - Step 35392: {'lr': 0.00043978750955008374, 'samples': 6795264, 'steps': 35391, 'loss/train': 1.5932806730270386} 11/07/2021 02:13:26 - INFO - __main__ - Step 35393: {'lr': 0.00043978405526423305, 'samples': 6795456, 'steps': 35392, 'loss/train': 3.2164599895477295} 11/07/2021 02:13:26 - INFO - __main__ - Step 35394: {'lr': 0.0004397806008928686, 'samples': 6795648, 'steps': 35393, 'loss/train': 1.7339038848876953} 11/07/2021 02:13:26 - INFO - __main__ - Step 35395: {'lr': 0.00043977714643599194, 'samples': 6795840, 'steps': 35394, 'loss/train': 1.467126488685608} 11/07/2021 02:13:27 - INFO - __main__ - Step 35396: {'lr': 0.0004397736918936046, 'samples': 6796032, 'steps': 35395, 'loss/train': 1.4847302436828613} 11/07/2021 02:13:27 - INFO - __main__ - Step 35397: {'lr': 0.0004397702372657082, 'samples': 6796224, 'steps': 35396, 'loss/train': 1.2340567111968994} 11/07/2021 02:13:28 - INFO - __main__ - Step 35398: {'lr': 0.00043976678255230417, 'samples': 6796416, 'steps': 35397, 'loss/train': 1.5708730220794678} 11/07/2021 02:13:28 - INFO - __main__ - Step 35399: {'lr': 0.0004397633277533942, 'samples': 6796608, 'steps': 35398, 'loss/train': 1.2345236539840698} 11/07/2021 02:13:29 - INFO - __main__ - Step 35400: {'lr': 0.0004397598728689799, 'samples': 6796800, 'steps': 35399, 'loss/train': 1.1859363317489624} 11/07/2021 02:13:29 - INFO - __main__ - Step 35401: {'lr': 0.0004397564178990626, 'samples': 6796992, 'steps': 35400, 'loss/train': 1.856229305267334} 11/07/2021 02:13:30 - INFO - __main__ - Step 35402: {'lr': 0.0004397529628436441, 'samples': 6797184, 'steps': 35401, 'loss/train': 1.287499189376831} 11/07/2021 02:13:30 - INFO - __main__ - Step 35403: {'lr': 0.0004397495077027258, 'samples': 6797376, 'steps': 35402, 'loss/train': 1.4517358541488647} 11/07/2021 02:13:31 - INFO - __main__ - Step 35404: {'lr': 0.0004397460524763093, 'samples': 6797568, 'steps': 35403, 'loss/train': 0.9907420873641968} 11/07/2021 02:13:31 - INFO - __main__ - Step 35405: {'lr': 0.00043974259716439613, 'samples': 6797760, 'steps': 35404, 'loss/train': 1.5516440868377686} 11/07/2021 02:13:32 - INFO - __main__ - Step 35406: {'lr': 0.0004397391417669878, 'samples': 6797952, 'steps': 35405, 'loss/train': 1.601456880569458} 11/07/2021 02:13:32 - INFO - __main__ - Step 35407: {'lr': 0.0004397356862840861, 'samples': 6798144, 'steps': 35406, 'loss/train': 1.5725170373916626} 11/07/2021 02:13:33 - INFO - __main__ - Step 35408: {'lr': 0.00043973223071569234, 'samples': 6798336, 'steps': 35407, 'loss/train': 1.9015896320343018} 11/07/2021 02:13:33 - INFO - __main__ - Step 35409: {'lr': 0.0004397287750618082, 'samples': 6798528, 'steps': 35408, 'loss/train': 2.284215211868286} 11/07/2021 02:13:34 - INFO - __main__ - Step 35410: {'lr': 0.00043972531932243516, 'samples': 6798720, 'steps': 35409, 'loss/train': 1.706610918045044} 11/07/2021 02:13:34 - INFO - __main__ - Step 35411: {'lr': 0.00043972186349757484, 'samples': 6798912, 'steps': 35410, 'loss/train': 1.4776911735534668} 11/07/2021 02:13:34 - INFO - __main__ - Step 35412: {'lr': 0.0004397184075872288, 'samples': 6799104, 'steps': 35411, 'loss/train': 1.2144941091537476} 11/07/2021 02:13:35 - INFO - __main__ - Step 35413: {'lr': 0.0004397149515913985, 'samples': 6799296, 'steps': 35412, 'loss/train': 5.854411602020264} 11/07/2021 02:13:36 - INFO - __main__ - Step 35414: {'lr': 0.0004397114955100856, 'samples': 6799488, 'steps': 35413, 'loss/train': 1.5621206760406494} 11/07/2021 02:13:36 - INFO - __main__ - Step 35415: {'lr': 0.00043970803934329167, 'samples': 6799680, 'steps': 35414, 'loss/train': 1.4797799587249756} 11/07/2021 02:13:37 - INFO - __main__ - Step 35416: {'lr': 0.00043970458309101825, 'samples': 6799872, 'steps': 35415, 'loss/train': 1.556593656539917} 11/07/2021 02:13:37 - INFO - __main__ - Step 35417: {'lr': 0.0004397011267532668, 'samples': 6800064, 'steps': 35416, 'loss/train': 1.8843176364898682} 11/07/2021 02:13:37 - INFO - __main__ - Step 35418: {'lr': 0.00043969767033003894, 'samples': 6800256, 'steps': 35417, 'loss/train': 1.3072028160095215} 11/07/2021 02:13:38 - INFO - __main__ - Step 35419: {'lr': 0.0004396942138213363, 'samples': 6800448, 'steps': 35418, 'loss/train': 1.5829036235809326} 11/07/2021 02:13:38 - INFO - __main__ - Step 35420: {'lr': 0.00043969075722716033, 'samples': 6800640, 'steps': 35419, 'loss/train': 1.3367642164230347} 11/07/2021 02:13:39 - INFO - __main__ - Step 35421: {'lr': 0.0004396873005475127, 'samples': 6800832, 'steps': 35420, 'loss/train': 1.5796177387237549} 11/07/2021 02:13:39 - INFO - __main__ - Step 35422: {'lr': 0.00043968384378239477, 'samples': 6801024, 'steps': 35421, 'loss/train': 1.3193522691726685} 11/07/2021 02:13:40 - INFO - __main__ - Step 35423: {'lr': 0.00043968038693180834, 'samples': 6801216, 'steps': 35422, 'loss/train': 1.4077221155166626} 11/07/2021 02:13:40 - INFO - __main__ - Step 35424: {'lr': 0.00043967692999575484, 'samples': 6801408, 'steps': 35423, 'loss/train': 1.858082890510559} 11/07/2021 02:13:41 - INFO - __main__ - Step 35425: {'lr': 0.00043967347297423575, 'samples': 6801600, 'steps': 35424, 'loss/train': 1.2148572206497192} 11/07/2021 02:13:41 - INFO - __main__ - Step 35426: {'lr': 0.0004396700158672528, 'samples': 6801792, 'steps': 35425, 'loss/train': 0.840027928352356} 11/07/2021 02:13:42 - INFO - __main__ - Step 35427: {'lr': 0.0004396665586748075, 'samples': 6801984, 'steps': 35426, 'loss/train': 1.7397141456604004} 11/07/2021 02:13:42 - INFO - __main__ - Step 35428: {'lr': 0.0004396631013969013, 'samples': 6802176, 'steps': 35427, 'loss/train': 1.712583065032959} 11/07/2021 02:13:43 - INFO - __main__ - Step 35429: {'lr': 0.0004396596440335359, 'samples': 6802368, 'steps': 35428, 'loss/train': 0.4670521020889282} 11/07/2021 02:13:43 - INFO - __main__ - Step 35430: {'lr': 0.00043965618658471276, 'samples': 6802560, 'steps': 35429, 'loss/train': 1.2149841785430908} 11/07/2021 02:13:44 - INFO - __main__ - Step 35431: {'lr': 0.0004396527290504334, 'samples': 6802752, 'steps': 35430, 'loss/train': 3.315039873123169} 11/07/2021 02:13:44 - INFO - __main__ - Step 35432: {'lr': 0.00043964927143069955, 'samples': 6802944, 'steps': 35431, 'loss/train': 1.2933154106140137} 11/07/2021 02:13:44 - INFO - __main__ - Step 35433: {'lr': 0.0004396458137255126, 'samples': 6803136, 'steps': 35432, 'loss/train': 1.6630334854125977} 11/07/2021 02:13:45 - INFO - __main__ - Step 35434: {'lr': 0.0004396423559348742, 'samples': 6803328, 'steps': 35433, 'loss/train': 1.2026915550231934} 11/07/2021 02:13:46 - INFO - __main__ - Step 35435: {'lr': 0.0004396388980587859, 'samples': 6803520, 'steps': 35434, 'loss/train': 1.7204598188400269} 11/07/2021 02:13:46 - INFO - __main__ - Step 35436: {'lr': 0.0004396354400972492, 'samples': 6803712, 'steps': 35435, 'loss/train': 1.4436103105545044} 11/07/2021 02:13:46 - INFO - __main__ - Step 35437: {'lr': 0.0004396319820502657, 'samples': 6803904, 'steps': 35436, 'loss/train': 1.448534607887268} 11/07/2021 02:13:47 - INFO - __main__ - Step 35438: {'lr': 0.000439628523917837, 'samples': 6804096, 'steps': 35437, 'loss/train': 1.177272915840149} 11/07/2021 02:13:47 - INFO - __main__ - Step 35439: {'lr': 0.0004396250656999646, 'samples': 6804288, 'steps': 35438, 'loss/train': 1.060101866722107} 11/07/2021 02:13:48 - INFO - __main__ - Step 35440: {'lr': 0.00043962160739665, 'samples': 6804480, 'steps': 35439, 'loss/train': 1.3558481931686401} 11/07/2021 02:13:49 - INFO - __main__ - Step 35441: {'lr': 0.0004396181490078949, 'samples': 6804672, 'steps': 35440, 'loss/train': 1.5725367069244385} 11/07/2021 02:13:49 - INFO - __main__ - Step 35442: {'lr': 0.0004396146905337008, 'samples': 6804864, 'steps': 35441, 'loss/train': 1.5690183639526367} 11/07/2021 02:13:49 - INFO - __main__ - Step 35443: {'lr': 0.0004396112319740692, 'samples': 6805056, 'steps': 35442, 'loss/train': 1.7257599830627441} 11/07/2021 02:13:50 - INFO - __main__ - Step 35444: {'lr': 0.0004396077733290017, 'samples': 6805248, 'steps': 35443, 'loss/train': 1.7161669731140137} 11/07/2021 02:13:50 - INFO - __main__ - Step 35445: {'lr': 0.00043960431459849993, 'samples': 6805440, 'steps': 35444, 'loss/train': 2.2584564685821533} 11/07/2021 02:13:51 - INFO - __main__ - Step 35446: {'lr': 0.00043960085578256537, 'samples': 6805632, 'steps': 35445, 'loss/train': 1.974791407585144} 11/07/2021 02:13:51 - INFO - __main__ - Step 35447: {'lr': 0.0004395973968811995, 'samples': 6805824, 'steps': 35446, 'loss/train': 1.463413119316101} 11/07/2021 02:13:52 - INFO - __main__ - Step 35448: {'lr': 0.00043959393789440407, 'samples': 6806016, 'steps': 35447, 'loss/train': 2.0213165283203125} 11/07/2021 02:13:52 - INFO - __main__ - Step 35449: {'lr': 0.0004395904788221805, 'samples': 6806208, 'steps': 35448, 'loss/train': 1.3373453617095947} 11/07/2021 02:13:52 - INFO - __main__ - Step 35450: {'lr': 0.00043958701966453033, 'samples': 6806400, 'steps': 35449, 'loss/train': 1.6128551959991455} 11/07/2021 02:13:53 - INFO - __main__ - Step 35451: {'lr': 0.00043958356042145524, 'samples': 6806592, 'steps': 35450, 'loss/train': 1.284608244895935} 11/07/2021 02:13:54 - INFO - __main__ - Step 35452: {'lr': 0.0004395801010929567, 'samples': 6806784, 'steps': 35451, 'loss/train': 0.7865902185440063} 11/07/2021 02:13:54 - INFO - __main__ - Step 35453: {'lr': 0.0004395766416790363, 'samples': 6806976, 'steps': 35452, 'loss/train': 0.9731584787368774} 11/07/2021 02:13:54 - INFO - __main__ - Step 35454: {'lr': 0.0004395731821796956, 'samples': 6807168, 'steps': 35453, 'loss/train': 1.5661472082138062} 11/07/2021 02:13:55 - INFO - __main__ - Step 35455: {'lr': 0.00043956972259493615, 'samples': 6807360, 'steps': 35454, 'loss/train': 1.5036189556121826} 11/07/2021 02:13:56 - INFO - __main__ - Step 35456: {'lr': 0.0004395662629247595, 'samples': 6807552, 'steps': 35455, 'loss/train': 1.6254386901855469} 11/07/2021 02:13:56 - INFO - __main__ - Step 35457: {'lr': 0.0004395628031691672, 'samples': 6807744, 'steps': 35456, 'loss/train': 1.3540188074111938} 11/07/2021 02:13:57 - INFO - __main__ - Step 35458: {'lr': 0.00043955934332816083, 'samples': 6807936, 'steps': 35457, 'loss/train': 0.7331323623657227} 11/07/2021 02:13:57 - INFO - __main__ - Step 35459: {'lr': 0.00043955588340174195, 'samples': 6808128, 'steps': 35458, 'loss/train': 2.366567373275757} 11/07/2021 02:13:57 - INFO - __main__ - Step 35460: {'lr': 0.00043955242338991217, 'samples': 6808320, 'steps': 35459, 'loss/train': 0.8286648988723755} 11/07/2021 02:13:58 - INFO - __main__ - Step 35461: {'lr': 0.0004395489632926729, 'samples': 6808512, 'steps': 35460, 'loss/train': 1.2756918668746948} 11/07/2021 02:13:59 - INFO - __main__ - Step 35462: {'lr': 0.0004395455031100258, 'samples': 6808704, 'steps': 35461, 'loss/train': 0.8456819653511047} 11/07/2021 02:13:59 - INFO - __main__ - Step 35463: {'lr': 0.0004395420428419725, 'samples': 6808896, 'steps': 35462, 'loss/train': 1.4862419366836548} 11/07/2021 02:14:00 - INFO - __main__ - Step 35464: {'lr': 0.0004395385824885144, 'samples': 6809088, 'steps': 35463, 'loss/train': 1.42411208152771} 11/07/2021 02:14:00 - INFO - __main__ - Step 35465: {'lr': 0.0004395351220496532, 'samples': 6809280, 'steps': 35464, 'loss/train': 1.655928611755371} 11/07/2021 02:14:00 - INFO - __main__ - Step 35466: {'lr': 0.00043953166152539035, 'samples': 6809472, 'steps': 35465, 'loss/train': 1.353151559829712} 11/07/2021 02:14:01 - INFO - __main__ - Step 35467: {'lr': 0.00043952820091572753, 'samples': 6809664, 'steps': 35466, 'loss/train': 1.1998428106307983} 11/07/2021 02:14:02 - INFO - __main__ - Step 35468: {'lr': 0.0004395247402206662, 'samples': 6809856, 'steps': 35467, 'loss/train': 1.4462324380874634} 11/07/2021 02:14:02 - INFO - __main__ - Step 35469: {'lr': 0.0004395212794402079, 'samples': 6810048, 'steps': 35468, 'loss/train': 1.8410362005233765} 11/07/2021 02:14:02 - INFO - __main__ - Step 35470: {'lr': 0.00043951781857435424, 'samples': 6810240, 'steps': 35469, 'loss/train': 1.621268391609192} 11/07/2021 02:14:03 - INFO - __main__ - Step 35471: {'lr': 0.00043951435762310686, 'samples': 6810432, 'steps': 35470, 'loss/train': 1.4175527095794678} 11/07/2021 02:14:04 - INFO - __main__ - Step 35472: {'lr': 0.0004395108965864671, 'samples': 6810624, 'steps': 35471, 'loss/train': 1.7941079139709473} 11/07/2021 02:14:04 - INFO - __main__ - Step 35473: {'lr': 0.00043950743546443676, 'samples': 6810816, 'steps': 35472, 'loss/train': 1.252071738243103} 11/07/2021 02:14:04 - INFO - __main__ - Step 35474: {'lr': 0.0004395039742570173, 'samples': 6811008, 'steps': 35473, 'loss/train': 1.4083306789398193} 11/07/2021 02:14:05 - INFO - __main__ - Step 35475: {'lr': 0.00043950051296421023, 'samples': 6811200, 'steps': 35474, 'loss/train': 1.6213352680206299} 11/07/2021 02:14:05 - INFO - __main__ - Step 35476: {'lr': 0.00043949705158601715, 'samples': 6811392, 'steps': 35475, 'loss/train': 1.4592680931091309} 11/07/2021 02:14:06 - INFO - __main__ - Step 35477: {'lr': 0.00043949359012243963, 'samples': 6811584, 'steps': 35476, 'loss/train': 2.0029690265655518} 11/07/2021 02:14:06 - INFO - __main__ - Step 35478: {'lr': 0.00043949012857347924, 'samples': 6811776, 'steps': 35477, 'loss/train': 1.3724820613861084} 11/07/2021 02:14:07 - INFO - __main__ - Step 35479: {'lr': 0.0004394866669391375, 'samples': 6811968, 'steps': 35478, 'loss/train': 1.372178077697754} 11/07/2021 02:14:07 - INFO - __main__ - Step 35480: {'lr': 0.00043948320521941596, 'samples': 6812160, 'steps': 35479, 'loss/train': 1.7159295082092285} 11/07/2021 02:14:08 - INFO - __main__ - Step 35481: {'lr': 0.00043947974341431627, 'samples': 6812352, 'steps': 35480, 'loss/train': 1.0123029947280884} 11/07/2021 02:14:08 - INFO - __main__ - Step 35482: {'lr': 0.0004394762815238399, 'samples': 6812544, 'steps': 35481, 'loss/train': 1.3081108331680298} 11/07/2021 02:14:09 - INFO - __main__ - Step 35483: {'lr': 0.00043947281954798844, 'samples': 6812736, 'steps': 35482, 'loss/train': 1.5746831893920898} 11/07/2021 02:14:09 - INFO - __main__ - Step 35484: {'lr': 0.0004394693574867635, 'samples': 6812928, 'steps': 35483, 'loss/train': 1.6583791971206665} 11/07/2021 02:14:10 - INFO - __main__ - Step 35485: {'lr': 0.0004394658953401666, 'samples': 6813120, 'steps': 35484, 'loss/train': 1.2911481857299805} 11/07/2021 02:14:10 - INFO - __main__ - Step 35486: {'lr': 0.0004394624331081992, 'samples': 6813312, 'steps': 35485, 'loss/train': 1.4848313331604004} 11/07/2021 02:14:10 - INFO - __main__ - Step 35487: {'lr': 0.00043945897079086295, 'samples': 6813504, 'steps': 35486, 'loss/train': 1.6941444873809814} 11/07/2021 02:14:12 - INFO - __main__ - Step 35488: {'lr': 0.00043945550838815953, 'samples': 6813696, 'steps': 35487, 'loss/train': 1.4050376415252686} 11/07/2021 02:14:12 - INFO - __main__ - Step 35489: {'lr': 0.00043945204590009027, 'samples': 6813888, 'steps': 35488, 'loss/train': 1.667410969734192} 11/07/2021 02:14:12 - INFO - __main__ - Step 35490: {'lr': 0.0004394485833266569, 'samples': 6814080, 'steps': 35489, 'loss/train': 5.882075309753418} 11/07/2021 02:14:13 - INFO - __main__ - Step 35491: {'lr': 0.0004394451206678609, 'samples': 6814272, 'steps': 35490, 'loss/train': 1.7875516414642334} 11/07/2021 02:14:13 - INFO - __main__ - Step 35492: {'lr': 0.00043944165792370385, 'samples': 6814464, 'steps': 35491, 'loss/train': 1.8067296743392944} 11/07/2021 02:14:14 - INFO - __main__ - Step 35493: {'lr': 0.00043943819509418723, 'samples': 6814656, 'steps': 35492, 'loss/train': 0.9062034487724304} 11/07/2021 02:14:14 - INFO - __main__ - Step 35494: {'lr': 0.00043943473217931283, 'samples': 6814848, 'steps': 35493, 'loss/train': 1.8187721967697144} 11/07/2021 02:14:15 - INFO - __main__ - Step 35495: {'lr': 0.0004394312691790821, 'samples': 6815040, 'steps': 35494, 'loss/train': 1.4220510721206665} 11/07/2021 02:14:15 - INFO - __main__ - Step 35496: {'lr': 0.00043942780609349636, 'samples': 6815232, 'steps': 35495, 'loss/train': 1.5365619659423828} 11/07/2021 02:14:16 - INFO - __main__ - Step 35497: {'lr': 0.0004394243429225575, 'samples': 6815424, 'steps': 35496, 'loss/train': 1.6511274576187134} 11/07/2021 02:14:16 - INFO - __main__ - Step 35498: {'lr': 0.0004394208796662669, 'samples': 6815616, 'steps': 35497, 'loss/train': 0.9219018816947937} 11/07/2021 02:14:17 - INFO - __main__ - Step 35499: {'lr': 0.00043941741632462625, 'samples': 6815808, 'steps': 35498, 'loss/train': 1.7441105842590332} 11/07/2021 02:14:17 - INFO - __main__ - Step 35500: {'lr': 0.000439413952897637, 'samples': 6816000, 'steps': 35499, 'loss/train': 1.183741569519043} 11/07/2021 02:14:18 - INFO - __main__ - Step 35501: {'lr': 0.0004394104893853007, 'samples': 6816192, 'steps': 35500, 'loss/train': 0.7865248918533325} 11/07/2021 02:14:18 - INFO - __main__ - Step 35502: {'lr': 0.00043940702578761906, 'samples': 6816384, 'steps': 35501, 'loss/train': 1.31686532497406} 11/07/2021 02:14:19 - INFO - __main__ - Step 35503: {'lr': 0.00043940356210459344, 'samples': 6816576, 'steps': 35502, 'loss/train': 1.5352532863616943} 11/07/2021 02:14:19 - INFO - __main__ - Step 35504: {'lr': 0.0004394000983362255, 'samples': 6816768, 'steps': 35503, 'loss/train': 1.7768218517303467} 11/07/2021 02:14:20 - INFO - __main__ - Step 35505: {'lr': 0.0004393966344825168, 'samples': 6816960, 'steps': 35504, 'loss/train': 2.8831872940063477} 11/07/2021 02:14:20 - INFO - __main__ - Step 35506: {'lr': 0.00043939317054346894, 'samples': 6817152, 'steps': 35505, 'loss/train': 1.364589810371399} 11/07/2021 02:14:20 - INFO - __main__ - Step 35507: {'lr': 0.00043938970651908346, 'samples': 6817344, 'steps': 35506, 'loss/train': 1.6305081844329834} 11/07/2021 02:14:21 - INFO - __main__ - Step 35508: {'lr': 0.0004393862424093619, 'samples': 6817536, 'steps': 35507, 'loss/train': 1.354570746421814} 11/07/2021 02:14:22 - INFO - __main__ - Step 35509: {'lr': 0.0004393827782143057, 'samples': 6817728, 'steps': 35508, 'loss/train': 1.7895361185073853} 11/07/2021 02:14:22 - INFO - __main__ - Step 35510: {'lr': 0.00043937931393391667, 'samples': 6817920, 'steps': 35509, 'loss/train': 2.0235912799835205} 11/07/2021 02:14:22 - INFO - __main__ - Step 35511: {'lr': 0.0004393758495681962, 'samples': 6818112, 'steps': 35510, 'loss/train': 1.6815040111541748} 11/07/2021 02:14:23 - INFO - __main__ - Step 35512: {'lr': 0.0004393723851171459, 'samples': 6818304, 'steps': 35511, 'loss/train': 2.5056517124176025} 11/07/2021 02:14:23 - INFO - __main__ - Step 35513: {'lr': 0.0004393689205807673, 'samples': 6818496, 'steps': 35512, 'loss/train': 0.8651651740074158} 11/07/2021 02:14:24 - INFO - __main__ - Step 35514: {'lr': 0.00043936545595906206, 'samples': 6818688, 'steps': 35513, 'loss/train': 1.3810594081878662} 11/07/2021 02:14:25 - INFO - __main__ - Step 35515: {'lr': 0.00043936199125203156, 'samples': 6818880, 'steps': 35514, 'loss/train': 1.0598888397216797} 11/07/2021 02:14:25 - INFO - __main__ - Step 35516: {'lr': 0.00043935852645967755, 'samples': 6819072, 'steps': 35515, 'loss/train': 1.621749997138977} 11/07/2021 02:14:25 - INFO - __main__ - Step 35517: {'lr': 0.00043935506158200143, 'samples': 6819264, 'steps': 35516, 'loss/train': 1.2624260187149048} 11/07/2021 02:14:26 - INFO - __main__ - Step 35518: {'lr': 0.000439351596619005, 'samples': 6819456, 'steps': 35517, 'loss/train': 1.732316493988037} 11/07/2021 02:14:27 - INFO - __main__ - Step 35519: {'lr': 0.00043934813157068956, 'samples': 6819648, 'steps': 35518, 'loss/train': 0.9514675736427307} 11/07/2021 02:14:27 - INFO - __main__ - Step 35520: {'lr': 0.00043934466643705673, 'samples': 6819840, 'steps': 35519, 'loss/train': 2.08212947845459} 11/07/2021 02:14:27 - INFO - __main__ - Step 35521: {'lr': 0.00043934120121810814, 'samples': 6820032, 'steps': 35520, 'loss/train': 1.5012516975402832} 11/07/2021 02:14:28 - INFO - __main__ - Step 35522: {'lr': 0.0004393377359138454, 'samples': 6820224, 'steps': 35521, 'loss/train': 1.1675945520401} 11/07/2021 02:14:28 - INFO - __main__ - Step 35523: {'lr': 0.00043933427052426986, 'samples': 6820416, 'steps': 35522, 'loss/train': 1.2477281093597412} 11/07/2021 02:14:29 - INFO - __main__ - Step 35524: {'lr': 0.00043933080504938337, 'samples': 6820608, 'steps': 35523, 'loss/train': 1.6102840900421143} 11/07/2021 02:14:29 - INFO - __main__ - Step 35525: {'lr': 0.00043932733948918724, 'samples': 6820800, 'steps': 35524, 'loss/train': 0.9736477136611938} 11/07/2021 02:14:30 - INFO - __main__ - Step 35526: {'lr': 0.0004393238738436832, 'samples': 6820992, 'steps': 35525, 'loss/train': 1.7441880702972412} 11/07/2021 02:14:30 - INFO - __main__ - Step 35527: {'lr': 0.00043932040811287264, 'samples': 6821184, 'steps': 35526, 'loss/train': 1.398417353630066} 11/07/2021 02:14:30 - INFO - __main__ - Step 35528: {'lr': 0.0004393169422967573, 'samples': 6821376, 'steps': 35527, 'loss/train': 1.6070457696914673} 11/07/2021 02:14:31 - INFO - __main__ - Step 35529: {'lr': 0.0004393134763953387, 'samples': 6821568, 'steps': 35528, 'loss/train': 1.5105087757110596} 11/07/2021 02:14:32 - INFO - __main__ - Step 35530: {'lr': 0.00043931001040861835, 'samples': 6821760, 'steps': 35529, 'loss/train': 1.590580701828003} 11/07/2021 02:14:32 - INFO - __main__ - Step 35531: {'lr': 0.00043930654433659775, 'samples': 6821952, 'steps': 35530, 'loss/train': 1.6376655101776123} 11/07/2021 02:14:33 - INFO - __main__ - Step 35532: {'lr': 0.0004393030781792787, 'samples': 6822144, 'steps': 35531, 'loss/train': 1.5782145261764526} 11/07/2021 02:14:33 - INFO - __main__ - Step 35533: {'lr': 0.00043929961193666246, 'samples': 6822336, 'steps': 35532, 'loss/train': 1.3283123970031738} 11/07/2021 02:14:34 - INFO - __main__ - Step 35534: {'lr': 0.0004392961456087508, 'samples': 6822528, 'steps': 35533, 'loss/train': 1.3335016965866089} 11/07/2021 02:14:34 - INFO - __main__ - Step 35535: {'lr': 0.00043929267919554516, 'samples': 6822720, 'steps': 35534, 'loss/train': 0.9739982485771179} 11/07/2021 02:14:35 - INFO - __main__ - Step 35536: {'lr': 0.00043928921269704725, 'samples': 6822912, 'steps': 35535, 'loss/train': 1.3560470342636108} 11/07/2021 02:14:35 - INFO - __main__ - Step 35537: {'lr': 0.00043928574611325845, 'samples': 6823104, 'steps': 35536, 'loss/train': 2.5706093311309814} 11/07/2021 02:14:35 - INFO - __main__ - Step 35538: {'lr': 0.00043928227944418046, 'samples': 6823296, 'steps': 35537, 'loss/train': 0.8618325591087341} 11/07/2021 02:14:36 - INFO - __main__ - Step 35539: {'lr': 0.00043927881268981484, 'samples': 6823488, 'steps': 35538, 'loss/train': 1.1628984212875366} 11/07/2021 02:14:37 - INFO - __main__ - Step 35540: {'lr': 0.00043927534585016305, 'samples': 6823680, 'steps': 35539, 'loss/train': 1.6144016981124878} 11/07/2021 02:14:37 - INFO - __main__ - Step 35541: {'lr': 0.0004392718789252267, 'samples': 6823872, 'steps': 35540, 'loss/train': 1.7553297281265259} 11/07/2021 02:14:37 - INFO - __main__ - Step 35542: {'lr': 0.0004392684119150074, 'samples': 6824064, 'steps': 35541, 'loss/train': 1.8475892543792725} 11/07/2021 02:14:38 - INFO - __main__ - Step 35543: {'lr': 0.0004392649448195066, 'samples': 6824256, 'steps': 35542, 'loss/train': 1.538231372833252} 11/07/2021 02:14:38 - INFO - __main__ - Step 35544: {'lr': 0.000439261477638726, 'samples': 6824448, 'steps': 35543, 'loss/train': 1.5907025337219238} 11/07/2021 02:14:39 - INFO - __main__ - Step 35545: {'lr': 0.0004392580103726671, 'samples': 6824640, 'steps': 35544, 'loss/train': 1.6743935346603394} 11/07/2021 02:14:39 - INFO - __main__ - Step 35546: {'lr': 0.0004392545430213315, 'samples': 6824832, 'steps': 35545, 'loss/train': 1.476622462272644} 11/07/2021 02:14:40 - INFO - __main__ - Step 35547: {'lr': 0.00043925107558472065, 'samples': 6825024, 'steps': 35546, 'loss/train': 1.8071759939193726} 11/07/2021 02:14:40 - INFO - __main__ - Step 35548: {'lr': 0.0004392476080628363, 'samples': 6825216, 'steps': 35547, 'loss/train': 1.7994885444641113} 11/07/2021 02:14:40 - INFO - __main__ - Step 35549: {'lr': 0.00043924414045567973, 'samples': 6825408, 'steps': 35548, 'loss/train': 1.926434874534607} 11/07/2021 02:14:41 - INFO - __main__ - Step 35550: {'lr': 0.00043924067276325274, 'samples': 6825600, 'steps': 35549, 'loss/train': 1.4588193893432617} 11/07/2021 02:14:42 - INFO - __main__ - Step 35551: {'lr': 0.0004392372049855569, 'samples': 6825792, 'steps': 35550, 'loss/train': 1.6992769241333008} 11/07/2021 02:14:42 - INFO - __main__ - Step 35552: {'lr': 0.0004392337371225936, 'samples': 6825984, 'steps': 35551, 'loss/train': 1.381164312362671} 11/07/2021 02:14:43 - INFO - __main__ - Step 35553: {'lr': 0.0004392302691743645, 'samples': 6826176, 'steps': 35552, 'loss/train': 1.5796421766281128} 11/07/2021 02:14:43 - INFO - __main__ - Step 35554: {'lr': 0.0004392268011408712, 'samples': 6826368, 'steps': 35553, 'loss/train': 1.6246516704559326} 11/07/2021 02:14:44 - INFO - __main__ - Step 35555: {'lr': 0.0004392233330221152, 'samples': 6826560, 'steps': 35554, 'loss/train': 0.7908975481987} 11/07/2021 02:14:44 - INFO - __main__ - Step 35556: {'lr': 0.0004392198648180981, 'samples': 6826752, 'steps': 35555, 'loss/train': 1.2291918992996216} 11/07/2021 02:14:45 - INFO - __main__ - Step 35557: {'lr': 0.0004392163965288215, 'samples': 6826944, 'steps': 35556, 'loss/train': 1.5592041015625} 11/07/2021 02:14:45 - INFO - __main__ - Step 35558: {'lr': 0.0004392129281542868, 'samples': 6827136, 'steps': 35557, 'loss/train': 1.6843079328536987} 11/07/2021 02:14:45 - INFO - __main__ - Step 35559: {'lr': 0.00043920945969449577, 'samples': 6827328, 'steps': 35558, 'loss/train': 1.2580102682113647} 11/07/2021 02:14:46 - INFO - __main__ - Step 35560: {'lr': 0.0004392059911494498, 'samples': 6827520, 'steps': 35559, 'loss/train': 1.5198917388916016} 11/07/2021 02:14:47 - INFO - __main__ - Step 35561: {'lr': 0.0004392025225191506, 'samples': 6827712, 'steps': 35560, 'loss/train': 1.4956892728805542} 11/07/2021 02:14:47 - INFO - __main__ - Step 35562: {'lr': 0.0004391990538035996, 'samples': 6827904, 'steps': 35561, 'loss/train': 1.6502302885055542} 11/07/2021 02:14:48 - INFO - __main__ - Step 35563: {'lr': 0.00043919558500279845, 'samples': 6828096, 'steps': 35562, 'loss/train': 1.2379157543182373} 11/07/2021 02:14:48 - INFO - __main__ - Step 35564: {'lr': 0.0004391921161167487, 'samples': 6828288, 'steps': 35563, 'loss/train': 1.503430962562561} 11/07/2021 02:14:49 - INFO - __main__ - Step 35565: {'lr': 0.00043918864714545194, 'samples': 6828480, 'steps': 35564, 'loss/train': 1.0588741302490234} 11/07/2021 02:14:49 - INFO - __main__ - Step 35566: {'lr': 0.00043918517808890964, 'samples': 6828672, 'steps': 35565, 'loss/train': 2.0430285930633545} 11/07/2021 02:14:50 - INFO - __main__ - Step 35567: {'lr': 0.0004391817089471234, 'samples': 6828864, 'steps': 35566, 'loss/train': 0.12924258410930634} 11/07/2021 02:14:50 - INFO - __main__ - Step 35568: {'lr': 0.0004391782397200949, 'samples': 6829056, 'steps': 35567, 'loss/train': 0.8577550053596497} 11/07/2021 02:14:50 - INFO - __main__ - Step 35569: {'lr': 0.0004391747704078255, 'samples': 6829248, 'steps': 35568, 'loss/train': 2.1193783283233643} 11/07/2021 02:14:51 - INFO - __main__ - Step 35570: {'lr': 0.0004391713010103169, 'samples': 6829440, 'steps': 35569, 'loss/train': 1.119254469871521} 11/07/2021 02:14:52 - INFO - __main__ - Step 35571: {'lr': 0.0004391678315275706, 'samples': 6829632, 'steps': 35570, 'loss/train': 1.577763557434082} 11/07/2021 02:14:52 - INFO - __main__ - Step 35572: {'lr': 0.00043916436195958825, 'samples': 6829824, 'steps': 35571, 'loss/train': 1.3878240585327148} 11/07/2021 02:14:52 - INFO - __main__ - Step 35573: {'lr': 0.00043916089230637133, 'samples': 6830016, 'steps': 35572, 'loss/train': 1.3805381059646606} 11/07/2021 02:14:53 - INFO - __main__ - Step 35574: {'lr': 0.0004391574225679215, 'samples': 6830208, 'steps': 35573, 'loss/train': 1.2656093835830688} 11/07/2021 02:14:53 - INFO - __main__ - Step 35575: {'lr': 0.0004391539527442401, 'samples': 6830400, 'steps': 35574, 'loss/train': 1.359189510345459} 11/07/2021 02:14:55 - INFO - __main__ - Step 35576: {'lr': 0.000439150482835329, 'samples': 6830592, 'steps': 35575, 'loss/train': 1.4779083728790283} 11/07/2021 02:14:55 - INFO - __main__ - Step 35577: {'lr': 0.0004391470128411895, 'samples': 6830784, 'steps': 35576, 'loss/train': 1.0927032232284546} 11/07/2021 02:14:55 - INFO - __main__ - Step 35578: {'lr': 0.00043914354276182335, 'samples': 6830976, 'steps': 35577, 'loss/train': 1.5809870958328247} 11/07/2021 02:14:56 - INFO - __main__ - Step 35579: {'lr': 0.00043914007259723196, 'samples': 6831168, 'steps': 35578, 'loss/train': 0.13724112510681152} 11/07/2021 02:14:56 - INFO - __main__ - Step 35580: {'lr': 0.000439136602347417, 'samples': 6831360, 'steps': 35579, 'loss/train': 0.6508349180221558} 11/07/2021 02:14:57 - INFO - __main__ - Step 35581: {'lr': 0.00043913313201238017, 'samples': 6831552, 'steps': 35580, 'loss/train': 1.9046825170516968} 11/07/2021 02:14:57 - INFO - __main__ - Step 35582: {'lr': 0.00043912966159212263, 'samples': 6831744, 'steps': 35581, 'loss/train': 1.856600284576416} 11/07/2021 02:14:58 - INFO - __main__ - Step 35583: {'lr': 0.0004391261910866463, 'samples': 6831936, 'steps': 35582, 'loss/train': 1.4237700700759888} 11/07/2021 02:14:58 - INFO - __main__ - Step 35584: {'lr': 0.0004391227204959526, 'samples': 6832128, 'steps': 35583, 'loss/train': 1.4016250371932983} 11/07/2021 02:14:59 - INFO - __main__ - Step 35585: {'lr': 0.00043911924982004315, 'samples': 6832320, 'steps': 35584, 'loss/train': 1.6603078842163086} 11/07/2021 02:15:00 - INFO - __main__ - Step 35586: {'lr': 0.0004391157790589195, 'samples': 6832512, 'steps': 35585, 'loss/train': 1.489089012145996} 11/07/2021 02:15:00 - INFO - __main__ - Step 35587: {'lr': 0.00043911230821258313, 'samples': 6832704, 'steps': 35586, 'loss/train': 1.623017430305481} 11/07/2021 02:15:00 - INFO - __main__ - Step 35588: {'lr': 0.00043910883728103575, 'samples': 6832896, 'steps': 35587, 'loss/train': 1.4644653797149658} 11/07/2021 02:15:01 - INFO - __main__ - Step 35589: {'lr': 0.0004391053662642788, 'samples': 6833088, 'steps': 35588, 'loss/train': 0.9706271290779114} 11/07/2021 02:15:01 - INFO - __main__ - Step 35590: {'lr': 0.00043910189516231386, 'samples': 6833280, 'steps': 35589, 'loss/train': 1.742411732673645} 11/07/2021 02:15:02 - INFO - __main__ - Step 35591: {'lr': 0.00043909842397514255, 'samples': 6833472, 'steps': 35590, 'loss/train': 1.5278904438018799} 11/07/2021 02:15:02 - INFO - __main__ - Step 35592: {'lr': 0.00043909495270276646, 'samples': 6833664, 'steps': 35591, 'loss/train': 1.4585014581680298} 11/07/2021 02:15:03 - INFO - __main__ - Step 35593: {'lr': 0.00043909148134518703, 'samples': 6833856, 'steps': 35592, 'loss/train': 1.5156004428863525} 11/07/2021 02:15:03 - INFO - __main__ - Step 35594: {'lr': 0.0004390880099024059, 'samples': 6834048, 'steps': 35593, 'loss/train': 0.9020565748214722} 11/07/2021 02:15:03 - INFO - __main__ - Step 35595: {'lr': 0.00043908453837442464, 'samples': 6834240, 'steps': 35594, 'loss/train': 1.130458950996399} 11/07/2021 02:15:04 - INFO - __main__ - Step 35596: {'lr': 0.0004390810667612448, 'samples': 6834432, 'steps': 35595, 'loss/train': 0.2694261968135834} 11/07/2021 02:15:05 - INFO - __main__ - Step 35597: {'lr': 0.00043907759506286797, 'samples': 6834624, 'steps': 35596, 'loss/train': 0.9140653610229492} 11/07/2021 02:15:05 - INFO - __main__ - Step 35598: {'lr': 0.00043907412327929575, 'samples': 6834816, 'steps': 35597, 'loss/train': 1.7793086767196655} 11/07/2021 02:15:06 - INFO - __main__ - Step 35599: {'lr': 0.00043907065141052953, 'samples': 6835008, 'steps': 35598, 'loss/train': 0.11254219710826874} 11/07/2021 02:15:06 - INFO - __main__ - Step 35600: {'lr': 0.00043906717945657104, 'samples': 6835200, 'steps': 35599, 'loss/train': 1.2892582416534424} 11/07/2021 02:15:07 - INFO - __main__ - Step 35601: {'lr': 0.00043906370741742185, 'samples': 6835392, 'steps': 35600, 'loss/train': 1.770271897315979} 11/07/2021 02:15:07 - INFO - __main__ - Step 35602: {'lr': 0.0004390602352930834, 'samples': 6835584, 'steps': 35601, 'loss/train': 6.062001705169678} 11/07/2021 02:15:08 - INFO - __main__ - Step 35603: {'lr': 0.00043905676308355734, 'samples': 6835776, 'steps': 35602, 'loss/train': 1.4519600868225098} 11/07/2021 02:15:08 - INFO - __main__ - Step 35604: {'lr': 0.00043905329078884527, 'samples': 6835968, 'steps': 35603, 'loss/train': 0.758042573928833} 11/07/2021 02:15:08 - INFO - __main__ - Step 35605: {'lr': 0.00043904981840894863, 'samples': 6836160, 'steps': 35604, 'loss/train': 1.488784670829773} 11/07/2021 02:15:09 - INFO - __main__ - Step 35606: {'lr': 0.0004390463459438691, 'samples': 6836352, 'steps': 35605, 'loss/train': 1.6607863903045654} 11/07/2021 02:15:10 - INFO - __main__ - Step 35607: {'lr': 0.0004390428733936082, 'samples': 6836544, 'steps': 35606, 'loss/train': 0.6637961268424988} 11/07/2021 02:15:10 - INFO - __main__ - Step 35608: {'lr': 0.0004390394007581675, 'samples': 6836736, 'steps': 35607, 'loss/train': 1.5490474700927734} 11/07/2021 02:15:10 - INFO - __main__ - Step 35609: {'lr': 0.00043903592803754856, 'samples': 6836928, 'steps': 35608, 'loss/train': 1.330698013305664} 11/07/2021 02:15:11 - INFO - __main__ - Step 35610: {'lr': 0.00043903245523175296, 'samples': 6837120, 'steps': 35609, 'loss/train': 1.4494189023971558} 11/07/2021 02:15:11 - INFO - __main__ - Step 35611: {'lr': 0.00043902898234078223, 'samples': 6837312, 'steps': 35610, 'loss/train': 1.6454213857650757} 11/07/2021 02:15:13 - INFO - __main__ - Step 35612: {'lr': 0.000439025509364638, 'samples': 6837504, 'steps': 35611, 'loss/train': 1.4363117218017578} 11/07/2021 02:15:13 - INFO - __main__ - Step 35613: {'lr': 0.0004390220363033217, 'samples': 6837696, 'steps': 35612, 'loss/train': 1.6490654945373535} 11/07/2021 02:15:13 - INFO - __main__ - Step 35614: {'lr': 0.0004390185631568351, 'samples': 6837888, 'steps': 35613, 'loss/train': 1.2435897588729858} 11/07/2021 02:15:14 - INFO - __main__ - Step 35615: {'lr': 0.00043901508992517956, 'samples': 6838080, 'steps': 35614, 'loss/train': 0.969472348690033} 11/07/2021 02:15:14 - INFO - __main__ - Step 35616: {'lr': 0.0004390116166083568, 'samples': 6838272, 'steps': 35615, 'loss/train': 0.3451453149318695} 11/07/2021 02:15:14 - INFO - __main__ - Step 35617: {'lr': 0.00043900814320636827, 'samples': 6838464, 'steps': 35616, 'loss/train': 1.7469807863235474} 11/07/2021 02:15:15 - INFO - __main__ - Step 35618: {'lr': 0.00043900466971921563, 'samples': 6838656, 'steps': 35617, 'loss/train': 0.8231580853462219} 11/07/2021 02:15:16 - INFO - __main__ - Step 35619: {'lr': 0.00043900119614690043, 'samples': 6838848, 'steps': 35618, 'loss/train': 1.0175918340682983} 11/07/2021 02:15:16 - INFO - __main__ - Step 35620: {'lr': 0.00043899772248942413, 'samples': 6839040, 'steps': 35619, 'loss/train': 1.2768357992172241} 11/07/2021 02:15:16 - INFO - __main__ - Step 35621: {'lr': 0.0004389942487467884, 'samples': 6839232, 'steps': 35620, 'loss/train': 1.4337198734283447} 11/07/2021 02:15:17 - INFO - __main__ - Step 35622: {'lr': 0.00043899077491899485, 'samples': 6839424, 'steps': 35621, 'loss/train': 1.788061261177063} 11/07/2021 02:15:18 - INFO - __main__ - Step 35623: {'lr': 0.0004389873010060449, 'samples': 6839616, 'steps': 35622, 'loss/train': 0.8303018808364868} 11/07/2021 02:15:18 - INFO - __main__ - Step 35624: {'lr': 0.00043898382700794015, 'samples': 6839808, 'steps': 35623, 'loss/train': 1.1721681356430054} 11/07/2021 02:15:18 - INFO - __main__ - Step 35625: {'lr': 0.0004389803529246823, 'samples': 6840000, 'steps': 35624, 'loss/train': 1.319284439086914} 11/07/2021 02:15:19 - INFO - __main__ - Step 35626: {'lr': 0.00043897687875627277, 'samples': 6840192, 'steps': 35625, 'loss/train': 1.1438817977905273} 11/07/2021 02:15:19 - INFO - __main__ - Step 35627: {'lr': 0.00043897340450271317, 'samples': 6840384, 'steps': 35626, 'loss/train': 1.2990198135375977} 11/07/2021 02:15:20 - INFO - __main__ - Step 35628: {'lr': 0.0004389699301640051, 'samples': 6840576, 'steps': 35627, 'loss/train': 1.5525676012039185} 11/07/2021 02:15:21 - INFO - __main__ - Step 35629: {'lr': 0.00043896645574015004, 'samples': 6840768, 'steps': 35628, 'loss/train': 2.9011871814727783} 11/07/2021 02:15:21 - INFO - __main__ - Step 35630: {'lr': 0.00043896298123114965, 'samples': 6840960, 'steps': 35629, 'loss/train': 1.5629982948303223} 11/07/2021 02:15:21 - INFO - __main__ - Step 35631: {'lr': 0.00043895950663700546, 'samples': 6841152, 'steps': 35630, 'loss/train': 1.5904690027236938} 11/07/2021 02:15:22 - INFO - __main__ - Step 35632: {'lr': 0.000438956031957719, 'samples': 6841344, 'steps': 35631, 'loss/train': 1.8988369703292847} 11/07/2021 02:15:23 - INFO - __main__ - Step 35633: {'lr': 0.0004389525571932919, 'samples': 6841536, 'steps': 35632, 'loss/train': 1.0826889276504517} 11/07/2021 02:15:23 - INFO - __main__ - Step 35634: {'lr': 0.00043894908234372564, 'samples': 6841728, 'steps': 35633, 'loss/train': 0.31450581550598145} 11/07/2021 02:15:23 - INFO - __main__ - Step 35635: {'lr': 0.0004389456074090219, 'samples': 6841920, 'steps': 35634, 'loss/train': 1.471784234046936} 11/07/2021 02:15:24 - INFO - __main__ - Step 35636: {'lr': 0.0004389421323891822, 'samples': 6842112, 'steps': 35635, 'loss/train': 1.6761291027069092} 11/07/2021 02:15:24 - INFO - __main__ - Step 35637: {'lr': 0.000438938657284208, 'samples': 6842304, 'steps': 35636, 'loss/train': 1.598026990890503} 11/07/2021 02:15:25 - INFO - __main__ - Step 35638: {'lr': 0.000438935182094101, 'samples': 6842496, 'steps': 35637, 'loss/train': 1.366559386253357} 11/07/2021 02:15:25 - INFO - __main__ - Step 35639: {'lr': 0.0004389317068188628, 'samples': 6842688, 'steps': 35638, 'loss/train': 1.5255507230758667} 11/07/2021 02:15:26 - INFO - __main__ - Step 35640: {'lr': 0.0004389282314584948, 'samples': 6842880, 'steps': 35639, 'loss/train': 1.4930880069732666} 11/07/2021 02:15:26 - INFO - __main__ - Step 35641: {'lr': 0.0004389247560129987, 'samples': 6843072, 'steps': 35640, 'loss/train': 1.627358078956604} 11/07/2021 02:15:26 - INFO - __main__ - Step 35642: {'lr': 0.000438921280482376, 'samples': 6843264, 'steps': 35641, 'loss/train': 1.5437060594558716} 11/07/2021 02:15:28 - INFO - __main__ - Step 35643: {'lr': 0.00043891780486662825, 'samples': 6843456, 'steps': 35642, 'loss/train': 1.588767170906067} 11/07/2021 02:15:28 - INFO - __main__ - Step 35644: {'lr': 0.00043891432916575714, 'samples': 6843648, 'steps': 35643, 'loss/train': 1.012986183166504} 11/07/2021 02:15:28 - INFO - __main__ - Step 35645: {'lr': 0.0004389108533797641, 'samples': 6843840, 'steps': 35644, 'loss/train': 1.3460534811019897} 11/07/2021 02:15:29 - INFO - __main__ - Step 35646: {'lr': 0.00043890737750865074, 'samples': 6844032, 'steps': 35645, 'loss/train': 1.658281683921814} 11/07/2021 02:15:29 - INFO - __main__ - Step 35647: {'lr': 0.0004389039015524186, 'samples': 6844224, 'steps': 35646, 'loss/train': 0.9040864109992981} 11/07/2021 02:15:29 - INFO - __main__ - Step 35648: {'lr': 0.0004389004255110693, 'samples': 6844416, 'steps': 35647, 'loss/train': 1.8229610919952393} 11/07/2021 02:15:30 - INFO - __main__ - Step 35649: {'lr': 0.0004388969493846044, 'samples': 6844608, 'steps': 35648, 'loss/train': 5.77647590637207} 11/07/2021 02:15:31 - INFO - __main__ - Step 35650: {'lr': 0.00043889347317302543, 'samples': 6844800, 'steps': 35649, 'loss/train': 1.7431615591049194} 11/07/2021 02:15:31 - INFO - __main__ - Step 35651: {'lr': 0.000438889996876334, 'samples': 6844992, 'steps': 35650, 'loss/train': 0.9316908717155457} 11/07/2021 02:15:31 - INFO - __main__ - Step 35652: {'lr': 0.00043888652049453163, 'samples': 6845184, 'steps': 35651, 'loss/train': 1.656050443649292} 11/07/2021 02:15:32 - INFO - __main__ - Step 35653: {'lr': 0.0004388830440276199, 'samples': 6845376, 'steps': 35652, 'loss/train': 1.6132490634918213} 11/07/2021 02:15:33 - INFO - __main__ - Step 35654: {'lr': 0.0004388795674756004, 'samples': 6845568, 'steps': 35653, 'loss/train': 1.0542579889297485} 11/07/2021 02:15:33 - INFO - __main__ - Step 35655: {'lr': 0.0004388760908384747, 'samples': 6845760, 'steps': 35654, 'loss/train': 1.361189603805542} 11/07/2021 02:15:33 - INFO - __main__ - Step 35656: {'lr': 0.00043887261411624433, 'samples': 6845952, 'steps': 35655, 'loss/train': 1.5127336978912354} 11/07/2021 02:15:34 - INFO - __main__ - Step 35657: {'lr': 0.00043886913730891087, 'samples': 6846144, 'steps': 35656, 'loss/train': 1.7226002216339111} 11/07/2021 02:15:34 - INFO - __main__ - Step 35658: {'lr': 0.00043886566041647593, 'samples': 6846336, 'steps': 35657, 'loss/train': 1.3042670488357544} 11/07/2021 02:15:35 - INFO - __main__ - Step 35659: {'lr': 0.000438862183438941, 'samples': 6846528, 'steps': 35658, 'loss/train': 1.8118523359298706} 11/07/2021 02:15:36 - INFO - __main__ - Step 35660: {'lr': 0.00043885870637630763, 'samples': 6846720, 'steps': 35659, 'loss/train': 1.3242474794387817} 11/07/2021 02:15:36 - INFO - __main__ - Step 35661: {'lr': 0.00043885522922857757, 'samples': 6846912, 'steps': 35660, 'loss/train': 1.9415103197097778} 11/07/2021 02:15:36 - INFO - __main__ - Step 35662: {'lr': 0.00043885175199575216, 'samples': 6847104, 'steps': 35661, 'loss/train': 1.6594187021255493} 11/07/2021 02:15:37 - INFO - __main__ - Step 35663: {'lr': 0.00043884827467783303, 'samples': 6847296, 'steps': 35662, 'loss/train': 1.6623964309692383} 11/07/2021 02:15:37 - INFO - __main__ - Step 35664: {'lr': 0.00043884479727482193, 'samples': 6847488, 'steps': 35663, 'loss/train': 1.5567348003387451} 11/07/2021 02:15:38 - INFO - __main__ - Step 35665: {'lr': 0.00043884131978672014, 'samples': 6847680, 'steps': 35664, 'loss/train': 1.7647892236709595} 11/07/2021 02:15:38 - INFO - __main__ - Step 35666: {'lr': 0.00043883784221352947, 'samples': 6847872, 'steps': 35665, 'loss/train': 1.313801884651184} 11/07/2021 02:15:39 - INFO - __main__ - Step 35667: {'lr': 0.00043883436455525125, 'samples': 6848064, 'steps': 35666, 'loss/train': 1.6134251356124878} 11/07/2021 02:15:39 - INFO - __main__ - Step 35668: {'lr': 0.0004388308868118873, 'samples': 6848256, 'steps': 35667, 'loss/train': 0.6823694109916687} 11/07/2021 02:15:39 - INFO - __main__ - Step 35669: {'lr': 0.00043882740898343905, 'samples': 6848448, 'steps': 35668, 'loss/train': 1.1683130264282227} 11/07/2021 02:15:41 - INFO - __main__ - Step 35670: {'lr': 0.00043882393106990804, 'samples': 6848640, 'steps': 35669, 'loss/train': 1.3475676774978638} 11/07/2021 02:15:41 - INFO - __main__ - Step 35671: {'lr': 0.0004388204530712959, 'samples': 6848832, 'steps': 35670, 'loss/train': 1.5418709516525269} 11/07/2021 02:15:41 - INFO - __main__ - Step 35672: {'lr': 0.0004388169749876042, 'samples': 6849024, 'steps': 35671, 'loss/train': 1.551112174987793} 11/07/2021 02:15:42 - INFO - __main__ - Step 35673: {'lr': 0.0004388134968188344, 'samples': 6849216, 'steps': 35672, 'loss/train': 1.7342904806137085} 11/07/2021 02:15:42 - INFO - __main__ - Step 35674: {'lr': 0.00043881001856498823, 'samples': 6849408, 'steps': 35673, 'loss/train': 1.8497107028961182} 11/07/2021 02:15:43 - INFO - __main__ - Step 35675: {'lr': 0.0004388065402260672, 'samples': 6849600, 'steps': 35674, 'loss/train': 1.751163363456726} 11/07/2021 02:15:43 - INFO - __main__ - Step 35676: {'lr': 0.0004388030618020729, 'samples': 6849792, 'steps': 35675, 'loss/train': 0.6405913233757019} 11/07/2021 02:15:44 - INFO - __main__ - Step 35677: {'lr': 0.0004387995832930067, 'samples': 6849984, 'steps': 35676, 'loss/train': 1.7262928485870361} 11/07/2021 02:15:44 - INFO - __main__ - Step 35678: {'lr': 0.00043879610469887043, 'samples': 6850176, 'steps': 35677, 'loss/train': 1.0168603658676147} 11/07/2021 02:15:44 - INFO - __main__ - Step 35679: {'lr': 0.00043879262601966544, 'samples': 6850368, 'steps': 35678, 'loss/train': 1.1950746774673462} 11/07/2021 02:15:45 - INFO - __main__ - Step 35680: {'lr': 0.00043878914725539356, 'samples': 6850560, 'steps': 35679, 'loss/train': 1.5614596605300903} 11/07/2021 02:15:46 - INFO - __main__ - Step 35681: {'lr': 0.00043878566840605606, 'samples': 6850752, 'steps': 35680, 'loss/train': 0.8708986639976501} 11/07/2021 02:15:46 - INFO - __main__ - Step 35682: {'lr': 0.0004387821894716547, 'samples': 6850944, 'steps': 35681, 'loss/train': 1.5628100633621216} 11/07/2021 02:15:46 - INFO - __main__ - Step 35683: {'lr': 0.000438778710452191, 'samples': 6851136, 'steps': 35682, 'loss/train': 1.6288344860076904} 11/07/2021 02:15:47 - INFO - __main__ - Step 35684: {'lr': 0.00043877523134766664, 'samples': 6851328, 'steps': 35683, 'loss/train': 1.3384032249450684} 11/07/2021 02:15:48 - INFO - __main__ - Step 35685: {'lr': 0.0004387717521580829, 'samples': 6851520, 'steps': 35684, 'loss/train': 1.6574108600616455} 11/07/2021 02:15:48 - INFO - __main__ - Step 35686: {'lr': 0.00043876827288344156, 'samples': 6851712, 'steps': 35685, 'loss/train': 1.457568645477295} 11/07/2021 02:15:49 - INFO - __main__ - Step 35687: {'lr': 0.00043876479352374423, 'samples': 6851904, 'steps': 35686, 'loss/train': 1.7087618112564087} 11/07/2021 02:15:49 - INFO - __main__ - Step 35688: {'lr': 0.00043876131407899233, 'samples': 6852096, 'steps': 35687, 'loss/train': 0.9426607489585876} 11/07/2021 02:15:49 - INFO - __main__ - Step 35689: {'lr': 0.00043875783454918753, 'samples': 6852288, 'steps': 35688, 'loss/train': 1.389722228050232} 11/07/2021 02:15:50 - INFO - __main__ - Step 35690: {'lr': 0.00043875435493433135, 'samples': 6852480, 'steps': 35689, 'loss/train': 1.626808524131775} 11/07/2021 02:15:51 - INFO - __main__ - Step 35691: {'lr': 0.00043875087523442537, 'samples': 6852672, 'steps': 35690, 'loss/train': 1.179518699645996} 11/07/2021 02:15:51 - INFO - __main__ - Step 35692: {'lr': 0.0004387473954494712, 'samples': 6852864, 'steps': 35691, 'loss/train': 1.2149150371551514} 11/07/2021 02:15:51 - INFO - __main__ - Step 35693: {'lr': 0.00043874391557947027, 'samples': 6853056, 'steps': 35692, 'loss/train': 1.4332332611083984} 11/07/2021 02:15:52 - INFO - __main__ - Step 35694: {'lr': 0.0004387404356244243, 'samples': 6853248, 'steps': 35693, 'loss/train': 1.674772024154663} 11/07/2021 02:15:52 - INFO - __main__ - Step 35695: {'lr': 0.0004387369555843348, 'samples': 6853440, 'steps': 35694, 'loss/train': 1.6376999616622925} 11/07/2021 02:15:53 - INFO - __main__ - Step 35696: {'lr': 0.00043873347545920333, 'samples': 6853632, 'steps': 35695, 'loss/train': 1.5448943376541138} 11/07/2021 02:15:54 - INFO - __main__ - Step 35697: {'lr': 0.00043872999524903147, 'samples': 6853824, 'steps': 35696, 'loss/train': 1.4090485572814941} 11/07/2021 02:15:54 - INFO - __main__ - Step 35698: {'lr': 0.00043872651495382076, 'samples': 6854016, 'steps': 35697, 'loss/train': 1.7205495834350586} 11/07/2021 02:15:54 - INFO - __main__ - Step 35699: {'lr': 0.00043872303457357287, 'samples': 6854208, 'steps': 35698, 'loss/train': 1.7329543828964233} 11/07/2021 02:15:55 - INFO - __main__ - Step 35700: {'lr': 0.0004387195541082892, 'samples': 6854400, 'steps': 35699, 'loss/train': 1.2321515083312988} 11/07/2021 02:15:56 - INFO - __main__ - Step 35701: {'lr': 0.0004387160735579715, 'samples': 6854592, 'steps': 35700, 'loss/train': 1.6390836238861084} 11/07/2021 02:15:56 - INFO - __main__ - Step 35702: {'lr': 0.0004387125929226212, 'samples': 6854784, 'steps': 35701, 'loss/train': 1.5886253118515015} 11/07/2021 02:15:56 - INFO - __main__ - Step 35703: {'lr': 0.00043870911220224, 'samples': 6854976, 'steps': 35702, 'loss/train': 1.6304876804351807} 11/07/2021 02:15:57 - INFO - __main__ - Step 35704: {'lr': 0.0004387056313968293, 'samples': 6855168, 'steps': 35703, 'loss/train': 1.816699743270874} 11/07/2021 02:15:57 - INFO - __main__ - Step 35705: {'lr': 0.00043870215050639073, 'samples': 6855360, 'steps': 35704, 'loss/train': 5.773858070373535} 11/07/2021 02:15:57 - INFO - __main__ - Step 35706: {'lr': 0.00043869866953092593, 'samples': 6855552, 'steps': 35705, 'loss/train': 1.3915510177612305} 11/07/2021 02:15:58 - INFO - __main__ - Step 35707: {'lr': 0.00043869518847043643, 'samples': 6855744, 'steps': 35706, 'loss/train': 1.6798690557479858} 11/07/2021 02:15:59 - INFO - __main__ - Step 35708: {'lr': 0.0004386917073249237, 'samples': 6855936, 'steps': 35707, 'loss/train': 1.8041768074035645} 11/07/2021 02:15:59 - INFO - __main__ - Step 35709: {'lr': 0.00043868822609438953, 'samples': 6856128, 'steps': 35708, 'loss/train': 1.6509307622909546} 11/07/2021 02:15:59 - INFO - __main__ - Step 35710: {'lr': 0.00043868474477883523, 'samples': 6856320, 'steps': 35709, 'loss/train': 1.6956952810287476} 11/07/2021 02:16:00 - INFO - __main__ - Step 35711: {'lr': 0.0004386812633782626, 'samples': 6856512, 'steps': 35710, 'loss/train': 1.3712821006774902} 11/07/2021 02:16:01 - INFO - __main__ - Step 35712: {'lr': 0.00043867778189267306, 'samples': 6856704, 'steps': 35711, 'loss/train': 1.5459964275360107} 11/07/2021 02:16:02 - INFO - __main__ - Step 35713: {'lr': 0.0004386743003220682, 'samples': 6856896, 'steps': 35712, 'loss/train': 2.210019588470459} 11/07/2021 02:16:02 - INFO - __main__ - Step 35714: {'lr': 0.0004386708186664496, 'samples': 6857088, 'steps': 35713, 'loss/train': 2.363863945007324} 11/07/2021 02:16:02 - INFO - __main__ - Step 35715: {'lr': 0.00043866733692581896, 'samples': 6857280, 'steps': 35714, 'loss/train': 1.3447531461715698} 11/07/2021 02:16:03 - INFO - __main__ - Step 35716: {'lr': 0.0004386638551001777, 'samples': 6857472, 'steps': 35715, 'loss/train': 1.3204762935638428} 11/07/2021 02:16:04 - INFO - __main__ - Step 35717: {'lr': 0.00043866037318952735, 'samples': 6857664, 'steps': 35716, 'loss/train': 2.144253730773926} 11/07/2021 02:16:04 - INFO - __main__ - Step 35718: {'lr': 0.0004386568911938695, 'samples': 6857856, 'steps': 35717, 'loss/train': 1.7581815719604492} 11/07/2021 02:16:04 - INFO - __main__ - Step 35719: {'lr': 0.0004386534091132059, 'samples': 6858048, 'steps': 35718, 'loss/train': 1.5662397146224976} 11/07/2021 02:16:05 - INFO - __main__ - Step 35720: {'lr': 0.0004386499269475379, 'samples': 6858240, 'steps': 35719, 'loss/train': 1.9442492723464966} 11/07/2021 02:16:05 - INFO - __main__ - Step 35721: {'lr': 0.00043864644469686717, 'samples': 6858432, 'steps': 35720, 'loss/train': 0.9276940226554871} 11/07/2021 02:16:06 - INFO - __main__ - Step 35722: {'lr': 0.0004386429623611953, 'samples': 6858624, 'steps': 35721, 'loss/train': 1.2161866426467896} 11/07/2021 02:16:07 - INFO - __main__ - Step 35723: {'lr': 0.0004386394799405238, 'samples': 6858816, 'steps': 35722, 'loss/train': 1.693832516670227} 11/07/2021 02:16:07 - INFO - __main__ - Step 35724: {'lr': 0.00043863599743485416, 'samples': 6859008, 'steps': 35723, 'loss/train': 1.8028156757354736} 11/07/2021 02:16:08 - INFO - __main__ - Step 35725: {'lr': 0.0004386325148441882, 'samples': 6859200, 'steps': 35724, 'loss/train': 1.640588402748108} 11/07/2021 02:16:08 - INFO - __main__ - Step 35726: {'lr': 0.00043862903216852723, 'samples': 6859392, 'steps': 35725, 'loss/train': 1.5772079229354858} 11/07/2021 02:16:08 - INFO - __main__ - Step 35727: {'lr': 0.00043862554940787303, 'samples': 6859584, 'steps': 35726, 'loss/train': 2.072113037109375} 11/07/2021 02:16:09 - INFO - __main__ - Step 35728: {'lr': 0.000438622066562227, 'samples': 6859776, 'steps': 35727, 'loss/train': 1.4786450862884521} 11/07/2021 02:16:10 - INFO - __main__ - Step 35729: {'lr': 0.0004386185836315908, 'samples': 6859968, 'steps': 35728, 'loss/train': 1.6002217531204224} 11/07/2021 02:16:10 - INFO - __main__ - Step 35730: {'lr': 0.0004386151006159659, 'samples': 6860160, 'steps': 35729, 'loss/train': 1.2281548976898193} 11/07/2021 02:16:10 - INFO - __main__ - Step 35731: {'lr': 0.00043861161751535406, 'samples': 6860352, 'steps': 35730, 'loss/train': 0.9255541563034058} 11/07/2021 02:16:11 - INFO - __main__ - Step 35732: {'lr': 0.0004386081343297567, 'samples': 6860544, 'steps': 35731, 'loss/train': 1.7340439558029175} 11/07/2021 02:16:11 - INFO - __main__ - Step 35733: {'lr': 0.0004386046510591754, 'samples': 6860736, 'steps': 35732, 'loss/train': 1.8112010955810547} 11/07/2021 02:16:12 - INFO - __main__ - Step 35734: {'lr': 0.0004386011677036118, 'samples': 6860928, 'steps': 35733, 'loss/train': 1.756344199180603} 11/07/2021 02:16:12 - INFO - __main__ - Step 35735: {'lr': 0.00043859768426306737, 'samples': 6861120, 'steps': 35734, 'loss/train': 1.2330529689788818} 11/07/2021 02:16:13 - INFO - __main__ - Step 35736: {'lr': 0.00043859420073754377, 'samples': 6861312, 'steps': 35735, 'loss/train': 1.1958736181259155} 11/07/2021 02:16:13 - INFO - __main__ - Step 35737: {'lr': 0.0004385907171270425, 'samples': 6861504, 'steps': 35736, 'loss/train': 2.0505869388580322} 11/07/2021 02:16:13 - INFO - __main__ - Step 35738: {'lr': 0.00043858723343156514, 'samples': 6861696, 'steps': 35737, 'loss/train': 1.6566646099090576} 11/07/2021 02:16:15 - INFO - __main__ - Step 35739: {'lr': 0.00043858374965111336, 'samples': 6861888, 'steps': 35738, 'loss/train': 1.0120819807052612} 11/07/2021 02:16:15 - INFO - __main__ - Step 35740: {'lr': 0.00043858026578568864, 'samples': 6862080, 'steps': 35739, 'loss/train': 1.7157846689224243} 11/07/2021 02:16:15 - INFO - __main__ - Step 35741: {'lr': 0.00043857678183529256, 'samples': 6862272, 'steps': 35740, 'loss/train': 1.0181148052215576} 11/07/2021 02:16:16 - INFO - __main__ - Step 35742: {'lr': 0.0004385732977999266, 'samples': 6862464, 'steps': 35741, 'loss/train': 1.0279544591903687} 11/07/2021 02:16:16 - INFO - __main__ - Step 35743: {'lr': 0.0004385698136795926, 'samples': 6862656, 'steps': 35742, 'loss/train': 1.773728609085083} 11/07/2021 02:16:17 - INFO - __main__ - Step 35744: {'lr': 0.00043856632947429175, 'samples': 6862848, 'steps': 35743, 'loss/train': 0.8074324727058411} 11/07/2021 02:16:17 - INFO - __main__ - Step 35745: {'lr': 0.00043856284518402594, 'samples': 6863040, 'steps': 35744, 'loss/train': 1.3862853050231934} 11/07/2021 02:16:18 - INFO - __main__ - Step 35746: {'lr': 0.00043855936080879667, 'samples': 6863232, 'steps': 35745, 'loss/train': 1.3445626497268677} 11/07/2021 02:16:18 - INFO - __main__ - Step 35747: {'lr': 0.0004385558763486053, 'samples': 6863424, 'steps': 35746, 'loss/train': 1.5872620344161987} 11/07/2021 02:16:18 - INFO - __main__ - Step 35748: {'lr': 0.00043855239180345376, 'samples': 6863616, 'steps': 35747, 'loss/train': 1.3158793449401855} 11/07/2021 02:16:19 - INFO - __main__ - Step 35749: {'lr': 0.00043854890717334326, 'samples': 6863808, 'steps': 35748, 'loss/train': 1.9186967611312866} 11/07/2021 02:16:20 - INFO - __main__ - Step 35750: {'lr': 0.00043854542245827554, 'samples': 6864000, 'steps': 35749, 'loss/train': 1.4672558307647705} 11/07/2021 02:16:20 - INFO - __main__ - Step 35751: {'lr': 0.00043854193765825223, 'samples': 6864192, 'steps': 35750, 'loss/train': 1.7083171606063843} 11/07/2021 02:16:20 - INFO - __main__ - Step 35752: {'lr': 0.00043853845277327485, 'samples': 6864384, 'steps': 35751, 'loss/train': 1.9616976976394653} 11/07/2021 02:16:21 - INFO - __main__ - Step 35753: {'lr': 0.0004385349678033449, 'samples': 6864576, 'steps': 35752, 'loss/train': 1.361376166343689} 11/07/2021 02:16:21 - INFO - __main__ - Step 35754: {'lr': 0.000438531482748464, 'samples': 6864768, 'steps': 35753, 'loss/train': 1.4319709539413452} 11/07/2021 02:16:22 - INFO - __main__ - Step 35755: {'lr': 0.00043852799760863375, 'samples': 6864960, 'steps': 35754, 'loss/train': 1.3204424381256104} 11/07/2021 02:16:23 - INFO - __main__ - Step 35756: {'lr': 0.0004385245123838557, 'samples': 6865152, 'steps': 35755, 'loss/train': 1.7563824653625488} 11/07/2021 02:16:23 - INFO - __main__ - Step 35757: {'lr': 0.00043852102707413144, 'samples': 6865344, 'steps': 35756, 'loss/train': 1.5977060794830322} 11/07/2021 02:16:23 - INFO - __main__ - Step 35758: {'lr': 0.00043851754167946244, 'samples': 6865536, 'steps': 35757, 'loss/train': 1.4238637685775757} 11/07/2021 02:16:24 - INFO - __main__ - Step 35759: {'lr': 0.00043851405619985037, 'samples': 6865728, 'steps': 35758, 'loss/train': 1.9423731565475464} 11/07/2021 02:16:25 - INFO - __main__ - Step 35760: {'lr': 0.00043851057063529675, 'samples': 6865920, 'steps': 35759, 'loss/train': 1.5438404083251953} 11/07/2021 02:16:25 - INFO - __main__ - Step 35761: {'lr': 0.00043850708498580326, 'samples': 6866112, 'steps': 35760, 'loss/train': 1.65753173828125} 11/07/2021 02:16:25 - INFO - __main__ - Step 35762: {'lr': 0.00043850359925137126, 'samples': 6866304, 'steps': 35761, 'loss/train': 1.8983148336410522} 11/07/2021 02:16:26 - INFO - __main__ - Step 35763: {'lr': 0.0004385001134320026, 'samples': 6866496, 'steps': 35762, 'loss/train': 1.5388737916946411} 11/07/2021 02:16:26 - INFO - __main__ - Step 35764: {'lr': 0.0004384966275276986, 'samples': 6866688, 'steps': 35763, 'loss/train': 1.6825393438339233} 11/07/2021 02:16:27 - INFO - __main__ - Step 35765: {'lr': 0.00043849314153846094, 'samples': 6866880, 'steps': 35764, 'loss/train': 1.8381291627883911} 11/07/2021 02:16:28 - INFO - __main__ - Step 35766: {'lr': 0.0004384896554642912, 'samples': 6867072, 'steps': 35765, 'loss/train': 1.4535595178604126} 11/07/2021 02:16:28 - INFO - __main__ - Step 35767: {'lr': 0.00043848616930519094, 'samples': 6867264, 'steps': 35766, 'loss/train': 1.5976481437683105} 11/07/2021 02:16:28 - INFO - __main__ - Step 35768: {'lr': 0.0004384826830611617, 'samples': 6867456, 'steps': 35767, 'loss/train': 1.8629978895187378} 11/07/2021 02:16:29 - INFO - __main__ - Step 35769: {'lr': 0.00043847919673220504, 'samples': 6867648, 'steps': 35768, 'loss/train': 1.934322714805603} 11/07/2021 02:16:30 - INFO - __main__ - Step 35770: {'lr': 0.00043847571031832257, 'samples': 6867840, 'steps': 35769, 'loss/train': 1.3385542631149292} 11/07/2021 02:16:30 - INFO - __main__ - Step 35771: {'lr': 0.0004384722238195159, 'samples': 6868032, 'steps': 35770, 'loss/train': 0.987594723701477} 11/07/2021 02:16:30 - INFO - __main__ - Step 35772: {'lr': 0.0004384687372357865, 'samples': 6868224, 'steps': 35771, 'loss/train': 1.5806580781936646} 11/07/2021 02:16:31 - INFO - __main__ - Step 35773: {'lr': 0.000438465250567136, 'samples': 6868416, 'steps': 35772, 'loss/train': 1.2526181936264038} 11/07/2021 02:16:31 - INFO - __main__ - Step 35774: {'lr': 0.00043846176381356607, 'samples': 6868608, 'steps': 35773, 'loss/train': 1.3293781280517578} 11/07/2021 02:16:32 - INFO - __main__ - Step 35775: {'lr': 0.000438458276975078, 'samples': 6868800, 'steps': 35774, 'loss/train': 1.3052928447723389} 11/07/2021 02:16:32 - INFO - __main__ - Step 35776: {'lr': 0.0004384547900516737, 'samples': 6868992, 'steps': 35775, 'loss/train': 1.5681229829788208} 11/07/2021 02:16:33 - INFO - __main__ - Step 35777: {'lr': 0.00043845130304335454, 'samples': 6869184, 'steps': 35776, 'loss/train': 1.1610848903656006} 11/07/2021 02:16:33 - INFO - __main__ - Step 35778: {'lr': 0.00043844781595012204, 'samples': 6869376, 'steps': 35777, 'loss/train': 1.4218180179595947} 11/07/2021 02:16:33 - INFO - __main__ - Step 35779: {'lr': 0.0004384443287719779, 'samples': 6869568, 'steps': 35778, 'loss/train': 1.264674186706543} 11/07/2021 02:16:35 - INFO - __main__ - Step 35780: {'lr': 0.0004384408415089237, 'samples': 6869760, 'steps': 35779, 'loss/train': 1.3298534154891968} 11/07/2021 02:16:35 - INFO - __main__ - Step 35781: {'lr': 0.000438437354160961, 'samples': 6869952, 'steps': 35780, 'loss/train': 1.25020170211792} 11/07/2021 02:16:35 - INFO - __main__ - Step 35782: {'lr': 0.00043843386672809127, 'samples': 6870144, 'steps': 35781, 'loss/train': 1.6183899641036987} 11/07/2021 02:16:36 - INFO - __main__ - Step 35783: {'lr': 0.00043843037921031616, 'samples': 6870336, 'steps': 35782, 'loss/train': 1.6619880199432373} 11/07/2021 02:16:36 - INFO - __main__ - Step 35784: {'lr': 0.00043842689160763723, 'samples': 6870528, 'steps': 35783, 'loss/train': 1.8179965019226074} 11/07/2021 02:16:37 - INFO - __main__ - Step 35785: {'lr': 0.00043842340392005605, 'samples': 6870720, 'steps': 35784, 'loss/train': 1.1560782194137573} 11/07/2021 02:16:37 - INFO - __main__ - Step 35786: {'lr': 0.00043841991614757415, 'samples': 6870912, 'steps': 35785, 'loss/train': 1.5421417951583862} 11/07/2021 02:16:38 - INFO - __main__ - Step 35787: {'lr': 0.00043841642829019325, 'samples': 6871104, 'steps': 35786, 'loss/train': 1.424499750137329} 11/07/2021 02:16:38 - INFO - __main__ - Step 35788: {'lr': 0.00043841294034791466, 'samples': 6871296, 'steps': 35787, 'loss/train': 1.5242619514465332} 11/07/2021 02:16:38 - INFO - __main__ - Step 35789: {'lr': 0.0004384094523207403, 'samples': 6871488, 'steps': 35788, 'loss/train': 1.5820269584655762} 11/07/2021 02:16:39 - INFO - __main__ - Step 35790: {'lr': 0.0004384059642086714, 'samples': 6871680, 'steps': 35789, 'loss/train': 1.568547010421753} 11/07/2021 02:16:40 - INFO - __main__ - Step 35791: {'lr': 0.00043840247601170966, 'samples': 6871872, 'steps': 35790, 'loss/train': 1.5818415880203247} 11/07/2021 02:16:40 - INFO - __main__ - Step 35792: {'lr': 0.0004383989877298568, 'samples': 6872064, 'steps': 35791, 'loss/train': 1.6161761283874512} 11/07/2021 02:16:40 - INFO - __main__ - Step 35793: {'lr': 0.0004383954993631142, 'samples': 6872256, 'steps': 35792, 'loss/train': 1.337131381034851} 11/07/2021 02:16:41 - INFO - __main__ - Step 35794: {'lr': 0.0004383920109114835, 'samples': 6872448, 'steps': 35793, 'loss/train': 1.4632744789123535} 11/07/2021 02:16:41 - INFO - __main__ - Step 35795: {'lr': 0.00043838852237496626, 'samples': 6872640, 'steps': 35794, 'loss/train': 1.4462274312973022} 11/07/2021 02:16:42 - INFO - __main__ - Step 35796: {'lr': 0.000438385033753564, 'samples': 6872832, 'steps': 35795, 'loss/train': 1.5358773469924927} 11/07/2021 02:16:42 - INFO - __main__ - Step 35797: {'lr': 0.00043838154504727847, 'samples': 6873024, 'steps': 35796, 'loss/train': 1.5611079931259155} 11/07/2021 02:16:43 - INFO - __main__ - Step 35798: {'lr': 0.00043837805625611105, 'samples': 6873216, 'steps': 35797, 'loss/train': 1.5787655115127563} 11/07/2021 02:16:43 - INFO - __main__ - Step 35799: {'lr': 0.0004383745673800634, 'samples': 6873408, 'steps': 35798, 'loss/train': 2.0828986167907715} 11/07/2021 02:16:43 - INFO - __main__ - Step 35800: {'lr': 0.000438371078419137, 'samples': 6873600, 'steps': 35799, 'loss/train': 1.3921492099761963} 11/07/2021 02:16:44 - INFO - __main__ - Step 35801: {'lr': 0.00043836758937333366, 'samples': 6873792, 'steps': 35800, 'loss/train': 1.8202711343765259} 11/07/2021 02:16:45 - INFO - __main__ - Step 35802: {'lr': 0.0004383641002426547, 'samples': 6873984, 'steps': 35801, 'loss/train': 1.5541462898254395} 11/07/2021 02:16:45 - INFO - __main__ - Step 35803: {'lr': 0.0004383606110271018, 'samples': 6874176, 'steps': 35802, 'loss/train': 1.2755517959594727} 11/07/2021 02:16:45 - INFO - __main__ - Step 35804: {'lr': 0.00043835712172667643, 'samples': 6874368, 'steps': 35803, 'loss/train': 1.0208276510238647} 11/07/2021 02:16:46 - INFO - __main__ - Step 35805: {'lr': 0.00043835363234138037, 'samples': 6874560, 'steps': 35804, 'loss/train': 1.4622864723205566} 11/07/2021 02:16:47 - INFO - __main__ - Step 35806: {'lr': 0.00043835014287121497, 'samples': 6874752, 'steps': 35805, 'loss/train': 0.9984145760536194} 11/07/2021 02:16:47 - INFO - __main__ - Step 35807: {'lr': 0.00043834665331618196, 'samples': 6874944, 'steps': 35806, 'loss/train': 1.4266750812530518} 11/07/2021 02:16:48 - INFO - __main__ - Step 35808: {'lr': 0.00043834316367628287, 'samples': 6875136, 'steps': 35807, 'loss/train': 1.6676244735717773} 11/07/2021 02:16:48 - INFO - __main__ - Step 35809: {'lr': 0.0004383396739515192, 'samples': 6875328, 'steps': 35808, 'loss/train': 1.5520555973052979} 11/07/2021 02:16:48 - INFO - __main__ - Step 35810: {'lr': 0.00043833618414189265, 'samples': 6875520, 'steps': 35809, 'loss/train': 1.4976266622543335} 11/07/2021 02:16:49 - INFO - __main__ - Step 35811: {'lr': 0.0004383326942474046, 'samples': 6875712, 'steps': 35810, 'loss/train': 1.654467225074768} 11/07/2021 02:16:50 - INFO - __main__ - Step 35812: {'lr': 0.0004383292042680569, 'samples': 6875904, 'steps': 35811, 'loss/train': 1.66763436794281} 11/07/2021 02:16:50 - INFO - __main__ - Step 35813: {'lr': 0.0004383257142038509, 'samples': 6876096, 'steps': 35812, 'loss/train': 2.1094698905944824} 11/07/2021 02:16:50 - INFO - __main__ - Step 35814: {'lr': 0.0004383222240547882, 'samples': 6876288, 'steps': 35813, 'loss/train': 0.9822989702224731} 11/07/2021 02:16:51 - INFO - __main__ - Step 35815: {'lr': 0.00043831873382087043, 'samples': 6876480, 'steps': 35814, 'loss/train': 1.5135447978973389} 11/07/2021 02:16:52 - INFO - __main__ - Step 35816: {'lr': 0.0004383152435020992, 'samples': 6876672, 'steps': 35815, 'loss/train': 0.8099708557128906} 11/07/2021 02:16:52 - INFO - __main__ - Step 35817: {'lr': 0.0004383117530984759, 'samples': 6876864, 'steps': 35816, 'loss/train': 1.5529484748840332} 11/07/2021 02:16:52 - INFO - __main__ - Step 35818: {'lr': 0.0004383082626100024, 'samples': 6877056, 'steps': 35817, 'loss/train': 0.5928159356117249} 11/07/2021 02:16:53 - INFO - __main__ - Step 35819: {'lr': 0.00043830477203668, 'samples': 6877248, 'steps': 35818, 'loss/train': 1.6680653095245361} 11/07/2021 02:16:53 - INFO - __main__ - Step 35820: {'lr': 0.0004383012813785104, 'samples': 6877440, 'steps': 35819, 'loss/train': 1.5121403932571411} 11/07/2021 02:16:54 - INFO - __main__ - Step 35821: {'lr': 0.00043829779063549515, 'samples': 6877632, 'steps': 35820, 'loss/train': 1.6605607271194458} 11/07/2021 02:16:54 - INFO - __main__ - Step 35822: {'lr': 0.0004382942998076358, 'samples': 6877824, 'steps': 35821, 'loss/train': 1.255125641822815} 11/07/2021 02:16:55 - INFO - __main__ - Step 35823: {'lr': 0.000438290808894934, 'samples': 6878016, 'steps': 35822, 'loss/train': 1.2337799072265625} 11/07/2021 02:16:55 - INFO - __main__ - Step 35824: {'lr': 0.0004382873178973912, 'samples': 6878208, 'steps': 35823, 'loss/train': 1.0595225095748901} 11/07/2021 02:16:56 - INFO - __main__ - Step 35825: {'lr': 0.00043828382681500907, 'samples': 6878400, 'steps': 35824, 'loss/train': 1.4158706665039062} 11/07/2021 02:16:57 - INFO - __main__ - Step 35826: {'lr': 0.0004382803356477891, 'samples': 6878592, 'steps': 35825, 'loss/train': 0.6019604206085205} 11/07/2021 02:16:57 - INFO - __main__ - Step 35827: {'lr': 0.000438276844395733, 'samples': 6878784, 'steps': 35826, 'loss/train': 1.0249121189117432} 11/07/2021 02:16:57 - INFO - __main__ - Step 35828: {'lr': 0.0004382733530588422, 'samples': 6878976, 'steps': 35827, 'loss/train': 1.403064489364624} 11/07/2021 02:16:58 - INFO - __main__ - Step 35829: {'lr': 0.00043826986163711835, 'samples': 6879168, 'steps': 35828, 'loss/train': 1.2509788274765015} 11/07/2021 02:16:58 - INFO - __main__ - Step 35830: {'lr': 0.000438266370130563, 'samples': 6879360, 'steps': 35829, 'loss/train': 1.7037445306777954} 11/07/2021 02:16:59 - INFO - __main__ - Step 35831: {'lr': 0.0004382628785391778, 'samples': 6879552, 'steps': 35830, 'loss/train': 2.075359582901001} 11/07/2021 02:16:59 - INFO - __main__ - Step 35832: {'lr': 0.00043825938686296417, 'samples': 6879744, 'steps': 35831, 'loss/train': 1.9884377717971802} 11/07/2021 02:17:00 - INFO - __main__ - Step 35833: {'lr': 0.00043825589510192376, 'samples': 6879936, 'steps': 35832, 'loss/train': 1.0667155981063843} 11/07/2021 02:17:00 - INFO - __main__ - Step 35834: {'lr': 0.0004382524032560582, 'samples': 6880128, 'steps': 35833, 'loss/train': 1.533860683441162} 11/07/2021 02:17:00 - INFO - __main__ - Step 35835: {'lr': 0.000438248911325369, 'samples': 6880320, 'steps': 35834, 'loss/train': 1.6826927661895752} 11/07/2021 02:17:01 - INFO - __main__ - Step 35836: {'lr': 0.00043824541930985775, 'samples': 6880512, 'steps': 35835, 'loss/train': 1.4012782573699951} 11/07/2021 02:17:02 - INFO - __main__ - Step 35837: {'lr': 0.0004382419272095259, 'samples': 6880704, 'steps': 35836, 'loss/train': 1.4396758079528809} 11/07/2021 02:17:02 - INFO - __main__ - Step 35838: {'lr': 0.00043823843502437533, 'samples': 6880896, 'steps': 35837, 'loss/train': 1.3039255142211914} 11/07/2021 02:17:02 - INFO - __main__ - Step 35839: {'lr': 0.00043823494275440733, 'samples': 6881088, 'steps': 35838, 'loss/train': 1.4565870761871338} 11/07/2021 02:17:03 - INFO - __main__ - Step 35840: {'lr': 0.0004382314503996236, 'samples': 6881280, 'steps': 35839, 'loss/train': 1.4898042678833008} 11/07/2021 02:17:03 - INFO - __main__ - Step 35841: {'lr': 0.0004382279579600256, 'samples': 6881472, 'steps': 35840, 'loss/train': 1.6548384428024292} 11/07/2021 02:17:04 - INFO - __main__ - Step 35842: {'lr': 0.0004382244654356151, 'samples': 6881664, 'steps': 35841, 'loss/train': 1.4465525150299072} 11/07/2021 02:17:04 - INFO - __main__ - Step 35843: {'lr': 0.0004382209728263935, 'samples': 6881856, 'steps': 35842, 'loss/train': 1.3423986434936523} 11/07/2021 02:17:05 - INFO - __main__ - Step 35844: {'lr': 0.0004382174801323624, 'samples': 6882048, 'steps': 35843, 'loss/train': 0.9695390462875366} 11/07/2021 02:17:05 - INFO - __main__ - Step 35845: {'lr': 0.00043821398735352344, 'samples': 6882240, 'steps': 35844, 'loss/train': 1.79738450050354} 11/07/2021 02:17:05 - INFO - __main__ - Step 35846: {'lr': 0.0004382104944898782, 'samples': 6882432, 'steps': 35845, 'loss/train': 1.3911375999450684} 11/07/2021 02:17:07 - INFO - __main__ - Step 35847: {'lr': 0.00043820700154142825, 'samples': 6882624, 'steps': 35846, 'loss/train': 1.7680792808532715} 11/07/2021 02:17:07 - INFO - __main__ - Step 35848: {'lr': 0.00043820350850817504, 'samples': 6882816, 'steps': 35847, 'loss/train': 1.1441446542739868} 11/07/2021 02:17:07 - INFO - __main__ - Step 35849: {'lr': 0.00043820001539012025, 'samples': 6883008, 'steps': 35848, 'loss/train': 1.174292802810669} 11/07/2021 02:17:08 - INFO - __main__ - Step 35850: {'lr': 0.00043819652218726545, 'samples': 6883200, 'steps': 35849, 'loss/train': 1.4147001504898071} 11/07/2021 02:17:08 - INFO - __main__ - Step 35851: {'lr': 0.0004381930288996122, 'samples': 6883392, 'steps': 35850, 'loss/train': 1.5950745344161987} 11/07/2021 02:17:09 - INFO - __main__ - Step 35852: {'lr': 0.0004381895355271621, 'samples': 6883584, 'steps': 35851, 'loss/train': 1.5812777280807495} 11/07/2021 02:17:09 - INFO - __main__ - Step 35853: {'lr': 0.00043818604206991664, 'samples': 6883776, 'steps': 35852, 'loss/train': 1.6486752033233643} 11/07/2021 02:17:10 - INFO - __main__ - Step 35854: {'lr': 0.0004381825485278775, 'samples': 6883968, 'steps': 35853, 'loss/train': 1.576576828956604} 11/07/2021 02:17:10 - INFO - __main__ - Step 35855: {'lr': 0.00043817905490104613, 'samples': 6884160, 'steps': 35854, 'loss/train': 1.4157003164291382} 11/07/2021 02:17:10 - INFO - __main__ - Step 35856: {'lr': 0.00043817556118942426, 'samples': 6884352, 'steps': 35855, 'loss/train': 1.8426508903503418} 11/07/2021 02:17:11 - INFO - __main__ - Step 35857: {'lr': 0.0004381720673930134, 'samples': 6884544, 'steps': 35856, 'loss/train': 1.4605894088745117} 11/07/2021 02:17:12 - INFO - __main__ - Step 35858: {'lr': 0.00043816857351181503, 'samples': 6884736, 'steps': 35857, 'loss/train': 1.3808422088623047} 11/07/2021 02:17:12 - INFO - __main__ - Step 35859: {'lr': 0.0004381650795458309, 'samples': 6884928, 'steps': 35858, 'loss/train': 1.7283581495285034} 11/07/2021 02:17:12 - INFO - __main__ - Step 35860: {'lr': 0.0004381615854950625, 'samples': 6885120, 'steps': 35859, 'loss/train': 1.409956455230713} 11/07/2021 02:17:13 - INFO - __main__ - Step 35861: {'lr': 0.0004381580913595113, 'samples': 6885312, 'steps': 35860, 'loss/train': 1.3731625080108643} 11/07/2021 02:17:14 - INFO - __main__ - Step 35862: {'lr': 0.000438154597139179, 'samples': 6885504, 'steps': 35861, 'loss/train': 1.197447657585144} 11/07/2021 02:17:14 - INFO - __main__ - Step 35863: {'lr': 0.0004381511028340671, 'samples': 6885696, 'steps': 35862, 'loss/train': 1.5057181119918823} 11/07/2021 02:17:14 - INFO - __main__ - Step 35864: {'lr': 0.0004381476084441773, 'samples': 6885888, 'steps': 35863, 'loss/train': 1.9163185358047485} 11/07/2021 02:17:15 - INFO - __main__ - Step 35865: {'lr': 0.00043814411396951103, 'samples': 6886080, 'steps': 35864, 'loss/train': 1.081902027130127} 11/07/2021 02:17:15 - INFO - __main__ - Step 35866: {'lr': 0.00043814061941007, 'samples': 6886272, 'steps': 35865, 'loss/train': 1.0974922180175781} 11/07/2021 02:17:16 - INFO - __main__ - Step 35867: {'lr': 0.00043813712476585564, 'samples': 6886464, 'steps': 35866, 'loss/train': 1.9265427589416504} 11/07/2021 02:17:16 - INFO - __main__ - Step 35868: {'lr': 0.00043813363003686963, 'samples': 6886656, 'steps': 35867, 'loss/train': 1.4821633100509644} 11/07/2021 02:17:17 - INFO - __main__ - Step 35869: {'lr': 0.00043813013522311353, 'samples': 6886848, 'steps': 35868, 'loss/train': 1.6978538036346436} 11/07/2021 02:17:17 - INFO - __main__ - Step 35870: {'lr': 0.0004381266403245888, 'samples': 6887040, 'steps': 35869, 'loss/train': 1.4966309070587158} 11/07/2021 02:17:17 - INFO - __main__ - Step 35871: {'lr': 0.00043812314534129716, 'samples': 6887232, 'steps': 35870, 'loss/train': 1.0092236995697021} 11/07/2021 02:17:19 - INFO - __main__ - Step 35872: {'lr': 0.0004381196502732402, 'samples': 6887424, 'steps': 35871, 'loss/train': 1.904168963432312} 11/07/2021 02:17:19 - INFO - __main__ - Step 35873: {'lr': 0.00043811615512041934, 'samples': 6887616, 'steps': 35872, 'loss/train': 1.436159610748291} 11/07/2021 02:17:19 - INFO - __main__ - Step 35874: {'lr': 0.00043811265988283625, 'samples': 6887808, 'steps': 35873, 'loss/train': 1.665523648262024} 11/07/2021 02:17:20 - INFO - __main__ - Step 35875: {'lr': 0.00043810916456049257, 'samples': 6888000, 'steps': 35874, 'loss/train': 0.8110805153846741} 11/07/2021 02:17:20 - INFO - __main__ - Step 35876: {'lr': 0.00043810566915338965, 'samples': 6888192, 'steps': 35875, 'loss/train': 1.5055104494094849} 11/07/2021 02:17:21 - INFO - __main__ - Step 35877: {'lr': 0.0004381021736615294, 'samples': 6888384, 'steps': 35876, 'loss/train': 1.3504819869995117} 11/07/2021 02:17:21 - INFO - __main__ - Step 35878: {'lr': 0.0004380986780849131, 'samples': 6888576, 'steps': 35877, 'loss/train': 1.6569801568984985} 11/07/2021 02:17:22 - INFO - __main__ - Step 35879: {'lr': 0.0004380951824235425, 'samples': 6888768, 'steps': 35878, 'loss/train': 1.4905952215194702} 11/07/2021 02:17:22 - INFO - __main__ - Step 35880: {'lr': 0.00043809168667741907, 'samples': 6888960, 'steps': 35879, 'loss/train': 1.5604488849639893} 11/07/2021 02:17:22 - INFO - __main__ - Step 35881: {'lr': 0.0004380881908465445, 'samples': 6889152, 'steps': 35880, 'loss/train': 0.3603174090385437} 11/07/2021 02:17:23 - INFO - __main__ - Step 35882: {'lr': 0.0004380846949309202, 'samples': 6889344, 'steps': 35881, 'loss/train': 1.644689679145813} 11/07/2021 02:17:24 - INFO - __main__ - Step 35883: {'lr': 0.00043808119893054787, 'samples': 6889536, 'steps': 35882, 'loss/train': 1.728338599205017} 11/07/2021 02:17:24 - INFO - __main__ - Step 35884: {'lr': 0.0004380777028454291, 'samples': 6889728, 'steps': 35883, 'loss/train': 1.7343021631240845} 11/07/2021 02:17:25 - INFO - __main__ - Step 35885: {'lr': 0.0004380742066755654, 'samples': 6889920, 'steps': 35884, 'loss/train': 1.5336843729019165} 11/07/2021 02:17:25 - INFO - __main__ - Step 35886: {'lr': 0.0004380707104209583, 'samples': 6890112, 'steps': 35885, 'loss/train': 1.9161925315856934} 11/07/2021 02:17:25 - INFO - __main__ - Step 35887: {'lr': 0.0004380672140816095, 'samples': 6890304, 'steps': 35886, 'loss/train': 1.350689172744751} 11/07/2021 02:17:26 - INFO - __main__ - Step 35888: {'lr': 0.0004380637176575205, 'samples': 6890496, 'steps': 35887, 'loss/train': 1.5569615364074707} 11/07/2021 02:17:27 - INFO - __main__ - Step 35889: {'lr': 0.00043806022114869294, 'samples': 6890688, 'steps': 35888, 'loss/train': 1.4751020669937134} 11/07/2021 02:17:27 - INFO - __main__ - Step 35890: {'lr': 0.0004380567245551282, 'samples': 6890880, 'steps': 35889, 'loss/train': 1.6205087900161743} 11/07/2021 02:17:27 - INFO - __main__ - Step 35891: {'lr': 0.0004380532278768282, 'samples': 6891072, 'steps': 35890, 'loss/train': 1.6306557655334473} 11/07/2021 02:17:28 - INFO - __main__ - Step 35892: {'lr': 0.0004380497311137942, 'samples': 6891264, 'steps': 35891, 'loss/train': 1.3647589683532715} 11/07/2021 02:17:28 - INFO - __main__ - Step 35893: {'lr': 0.00043804623426602784, 'samples': 6891456, 'steps': 35892, 'loss/train': 1.743046760559082} 11/07/2021 02:17:29 - INFO - __main__ - Step 35894: {'lr': 0.00043804273733353085, 'samples': 6891648, 'steps': 35893, 'loss/train': 1.4048981666564941} 11/07/2021 02:17:30 - INFO - __main__ - Step 35895: {'lr': 0.0004380392403163047, 'samples': 6891840, 'steps': 35894, 'loss/train': 1.4988237619400024} 11/07/2021 02:17:30 - INFO - __main__ - Step 35896: {'lr': 0.00043803574321435093, 'samples': 6892032, 'steps': 35895, 'loss/train': 1.2499364614486694} 11/07/2021 02:17:31 - INFO - __main__ - Step 35897: {'lr': 0.00043803224602767115, 'samples': 6892224, 'steps': 35896, 'loss/train': 1.531029224395752} 11/07/2021 02:17:31 - INFO - __main__ - Step 35898: {'lr': 0.000438028748756267, 'samples': 6892416, 'steps': 35897, 'loss/train': 0.14744128286838531} 11/07/2021 02:17:32 - INFO - __main__ - Step 35899: {'lr': 0.00043802525140013994, 'samples': 6892608, 'steps': 35898, 'loss/train': 1.589914083480835} 11/07/2021 02:17:32 - INFO - __main__ - Step 35900: {'lr': 0.00043802175395929156, 'samples': 6892800, 'steps': 35899, 'loss/train': 1.3293200731277466} 11/07/2021 02:17:33 - INFO - __main__ - Step 35901: {'lr': 0.00043801825643372363, 'samples': 6892992, 'steps': 35900, 'loss/train': 1.0679981708526611} 11/07/2021 02:17:33 - INFO - __main__ - Step 35902: {'lr': 0.00043801475882343743, 'samples': 6893184, 'steps': 35901, 'loss/train': 1.5215486288070679} 11/07/2021 02:17:33 - INFO - __main__ - Step 35903: {'lr': 0.0004380112611284347, 'samples': 6893376, 'steps': 35902, 'loss/train': 1.6985673904418945} 11/07/2021 02:17:34 - INFO - __main__ - Step 35904: {'lr': 0.00043800776334871705, 'samples': 6893568, 'steps': 35903, 'loss/train': 1.8185510635375977} 11/07/2021 02:17:35 - INFO - __main__ - Step 35905: {'lr': 0.000438004265484286, 'samples': 6893760, 'steps': 35904, 'loss/train': 1.454746127128601} 11/07/2021 02:17:35 - INFO - __main__ - Step 35906: {'lr': 0.0004380007675351431, 'samples': 6893952, 'steps': 35905, 'loss/train': 0.9168397784233093} 11/07/2021 02:17:35 - INFO - __main__ - Step 35907: {'lr': 0.00043799726950128997, 'samples': 6894144, 'steps': 35906, 'loss/train': 1.206464171409607} 11/07/2021 02:17:36 - INFO - __main__ - Step 35908: {'lr': 0.0004379937713827282, 'samples': 6894336, 'steps': 35907, 'loss/train': 1.4056463241577148} 11/07/2021 02:17:37 - INFO - __main__ - Step 35909: {'lr': 0.0004379902731794593, 'samples': 6894528, 'steps': 35908, 'loss/train': 1.3022297620773315} 11/07/2021 02:17:37 - INFO - __main__ - Step 35910: {'lr': 0.00043798677489148487, 'samples': 6894720, 'steps': 35909, 'loss/train': 1.1841340065002441} 11/07/2021 02:17:37 - INFO - __main__ - Step 35911: {'lr': 0.0004379832765188065, 'samples': 6894912, 'steps': 35910, 'loss/train': 1.4022372961044312} 11/07/2021 02:17:38 - INFO - __main__ - Step 35912: {'lr': 0.00043797977806142585, 'samples': 6895104, 'steps': 35911, 'loss/train': 1.616759181022644} 11/07/2021 02:17:38 - INFO - __main__ - Step 35913: {'lr': 0.0004379762795193443, 'samples': 6895296, 'steps': 35912, 'loss/train': 1.444366216659546} 11/07/2021 02:17:39 - INFO - __main__ - Step 35914: {'lr': 0.0004379727808925636, 'samples': 6895488, 'steps': 35913, 'loss/train': 0.9257416129112244} 11/07/2021 02:17:40 - INFO - __main__ - Step 35915: {'lr': 0.00043796928218108527, 'samples': 6895680, 'steps': 35914, 'loss/train': 1.7744345664978027} 11/07/2021 02:17:40 - INFO - __main__ - Step 35916: {'lr': 0.0004379657833849109, 'samples': 6895872, 'steps': 35915, 'loss/train': 1.4082496166229248} 11/07/2021 02:17:40 - INFO - __main__ - Step 35917: {'lr': 0.000437962284504042, 'samples': 6896064, 'steps': 35916, 'loss/train': 1.4445488452911377} 11/07/2021 02:17:41 - INFO - __main__ - Step 35918: {'lr': 0.00043795878553848025, 'samples': 6896256, 'steps': 35917, 'loss/train': 3.3619754314422607} 11/07/2021 02:17:41 - INFO - __main__ - Step 35919: {'lr': 0.0004379552864882271, 'samples': 6896448, 'steps': 35918, 'loss/train': 1.4456998109817505} 11/07/2021 02:17:42 - INFO - __main__ - Step 35920: {'lr': 0.00043795178735328425, 'samples': 6896640, 'steps': 35919, 'loss/train': 1.248893141746521} 11/07/2021 02:17:42 - INFO - __main__ - Step 35921: {'lr': 0.0004379482881336532, 'samples': 6896832, 'steps': 35920, 'loss/train': 1.5730149745941162} 11/07/2021 02:17:43 - INFO - __main__ - Step 35922: {'lr': 0.0004379447888293355, 'samples': 6897024, 'steps': 35921, 'loss/train': 1.6229360103607178} 11/07/2021 02:17:43 - INFO - __main__ - Step 35923: {'lr': 0.0004379412894403328, 'samples': 6897216, 'steps': 35922, 'loss/train': 1.5293281078338623} 11/07/2021 02:17:43 - INFO - __main__ - Step 35924: {'lr': 0.0004379377899666468, 'samples': 6897408, 'steps': 35923, 'loss/train': 1.3089522123336792} 11/07/2021 02:17:44 - INFO - __main__ - Step 35925: {'lr': 0.0004379342904082788, 'samples': 6897600, 'steps': 35924, 'loss/train': 1.3258305788040161} 11/07/2021 02:17:45 - INFO - __main__ - Step 35926: {'lr': 0.00043793079076523053, 'samples': 6897792, 'steps': 35925, 'loss/train': 0.957828164100647} 11/07/2021 02:17:45 - INFO - __main__ - Step 35927: {'lr': 0.0004379272910375035, 'samples': 6897984, 'steps': 35926, 'loss/train': 1.5545063018798828} 11/07/2021 02:17:45 - INFO - __main__ - Step 35928: {'lr': 0.0004379237912250994, 'samples': 6898176, 'steps': 35927, 'loss/train': 1.6177557706832886} 11/07/2021 02:17:46 - INFO - __main__ - Step 35929: {'lr': 0.0004379202913280197, 'samples': 6898368, 'steps': 35928, 'loss/train': 1.0915831327438354} 11/07/2021 02:17:47 - INFO - __main__ - Step 35930: {'lr': 0.0004379167913462661, 'samples': 6898560, 'steps': 35929, 'loss/train': 1.5333443880081177} 11/07/2021 02:17:47 - INFO - __main__ - Step 35931: {'lr': 0.00043791329127984004, 'samples': 6898752, 'steps': 35930, 'loss/train': 1.651257872581482} 11/07/2021 02:17:48 - INFO - __main__ - Step 35932: {'lr': 0.0004379097911287431, 'samples': 6898944, 'steps': 35931, 'loss/train': 1.124588966369629} 11/07/2021 02:17:48 - INFO - __main__ - Step 35933: {'lr': 0.000437906290892977, 'samples': 6899136, 'steps': 35932, 'loss/train': 1.3447669744491577} 11/07/2021 02:17:48 - INFO - __main__ - Step 35934: {'lr': 0.00043790279057254314, 'samples': 6899328, 'steps': 35933, 'loss/train': 2.0153403282165527} 11/07/2021 02:17:49 - INFO - __main__ - Step 35935: {'lr': 0.00043789929016744324, 'samples': 6899520, 'steps': 35934, 'loss/train': 1.6986846923828125} 11/07/2021 02:17:50 - INFO - __main__ - Step 35936: {'lr': 0.0004378957896776787, 'samples': 6899712, 'steps': 35935, 'loss/train': 1.446560025215149} 11/07/2021 02:17:50 - INFO - __main__ - Step 35937: {'lr': 0.0004378922891032514, 'samples': 6899904, 'steps': 35936, 'loss/train': 1.6637734174728394} 11/07/2021 02:17:50 - INFO - __main__ - Step 35938: {'lr': 0.0004378887884441626, 'samples': 6900096, 'steps': 35937, 'loss/train': 0.8005592823028564} 11/07/2021 02:17:51 - INFO - __main__ - Step 35939: {'lr': 0.000437885287700414, 'samples': 6900288, 'steps': 35938, 'loss/train': 1.0051047801971436} 11/07/2021 02:17:51 - INFO - __main__ - Step 35940: {'lr': 0.0004378817868720073, 'samples': 6900480, 'steps': 35939, 'loss/train': 2.8363940715789795} 11/07/2021 02:17:52 - INFO - __main__ - Step 35941: {'lr': 0.0004378782859589439, 'samples': 6900672, 'steps': 35940, 'loss/train': 1.4257400035858154} 11/07/2021 02:17:52 - INFO - __main__ - Step 35942: {'lr': 0.00043787478496122546, 'samples': 6900864, 'steps': 35941, 'loss/train': 1.3198901414871216} 11/07/2021 02:17:53 - INFO - __main__ - Step 35943: {'lr': 0.0004378712838788536, 'samples': 6901056, 'steps': 35942, 'loss/train': 1.3227993249893188} 11/07/2021 02:17:53 - INFO - __main__ - Step 35944: {'lr': 0.0004378677827118297, 'samples': 6901248, 'steps': 35943, 'loss/train': 1.7186728715896606} 11/07/2021 02:17:53 - INFO - __main__ - Step 35945: {'lr': 0.0004378642814601556, 'samples': 6901440, 'steps': 35944, 'loss/train': 1.5145400762557983} 11/07/2021 02:17:55 - INFO - __main__ - Step 35946: {'lr': 0.0004378607801238327, 'samples': 6901632, 'steps': 35945, 'loss/train': 1.6515378952026367} 11/07/2021 02:17:55 - INFO - __main__ - Step 35947: {'lr': 0.00043785727870286265, 'samples': 6901824, 'steps': 35946, 'loss/train': 2.2473514080047607} 11/07/2021 02:17:55 - INFO - __main__ - Step 35948: {'lr': 0.00043785377719724697, 'samples': 6902016, 'steps': 35947, 'loss/train': 1.0143795013427734} 11/07/2021 02:17:56 - INFO - __main__ - Step 35949: {'lr': 0.0004378502756069873, 'samples': 6902208, 'steps': 35948, 'loss/train': 0.6523028612136841} 11/07/2021 02:17:56 - INFO - __main__ - Step 35950: {'lr': 0.0004378467739320852, 'samples': 6902400, 'steps': 35949, 'loss/train': 1.716147780418396} 11/07/2021 02:17:57 - INFO - __main__ - Step 35951: {'lr': 0.0004378432721725422, 'samples': 6902592, 'steps': 35950, 'loss/train': 1.1859630346298218} 11/07/2021 02:17:57 - INFO - __main__ - Step 35952: {'lr': 0.00043783977032836, 'samples': 6902784, 'steps': 35951, 'loss/train': 1.202818751335144} 11/07/2021 02:17:58 - INFO - __main__ - Step 35953: {'lr': 0.00043783626839954005, 'samples': 6902976, 'steps': 35952, 'loss/train': 1.3891003131866455} 11/07/2021 02:17:58 - INFO - __main__ - Step 35954: {'lr': 0.0004378327663860839, 'samples': 6903168, 'steps': 35953, 'loss/train': 1.661206841468811} 11/07/2021 02:17:58 - INFO - __main__ - Step 35955: {'lr': 0.00043782926428799333, 'samples': 6903360, 'steps': 35954, 'loss/train': 1.4225733280181885} 11/07/2021 02:17:59 - INFO - __main__ - Step 35956: {'lr': 0.0004378257621052698, 'samples': 6903552, 'steps': 35955, 'loss/train': 1.5549224615097046} 11/07/2021 02:18:00 - INFO - __main__ - Step 35957: {'lr': 0.0004378222598379148, 'samples': 6903744, 'steps': 35956, 'loss/train': 1.6702907085418701} 11/07/2021 02:18:00 - INFO - __main__ - Step 35958: {'lr': 0.00043781875748593, 'samples': 6903936, 'steps': 35957, 'loss/train': 1.0661641359329224} 11/07/2021 02:18:00 - INFO - __main__ - Step 35959: {'lr': 0.000437815255049317, 'samples': 6904128, 'steps': 35958, 'loss/train': 1.3309332132339478} 11/07/2021 02:18:01 - INFO - __main__ - Step 35960: {'lr': 0.0004378117525280773, 'samples': 6904320, 'steps': 35959, 'loss/train': 1.4166483879089355} 11/07/2021 02:18:02 - INFO - __main__ - Step 35961: {'lr': 0.00043780824992221257, 'samples': 6904512, 'steps': 35960, 'loss/train': 1.2731068134307861} 11/07/2021 02:18:02 - INFO - __main__ - Step 35962: {'lr': 0.00043780474723172433, 'samples': 6904704, 'steps': 35961, 'loss/train': 1.650513768196106} 11/07/2021 02:18:02 - INFO - __main__ - Step 35963: {'lr': 0.00043780124445661416, 'samples': 6904896, 'steps': 35962, 'loss/train': 1.721779704093933} 11/07/2021 02:18:03 - INFO - __main__ - Step 35964: {'lr': 0.00043779774159688364, 'samples': 6905088, 'steps': 35963, 'loss/train': 1.142958402633667} 11/07/2021 02:18:03 - INFO - __main__ - Step 35965: {'lr': 0.00043779423865253434, 'samples': 6905280, 'steps': 35964, 'loss/train': 1.630504846572876} 11/07/2021 02:18:04 - INFO - __main__ - Step 35966: {'lr': 0.00043779073562356783, 'samples': 6905472, 'steps': 35965, 'loss/train': 1.7916053533554077} 11/07/2021 02:18:04 - INFO - __main__ - Step 35967: {'lr': 0.0004377872325099858, 'samples': 6905664, 'steps': 35966, 'loss/train': 1.4563828706741333} 11/07/2021 02:18:05 - INFO - __main__ - Step 35968: {'lr': 0.00043778372931178974, 'samples': 6905856, 'steps': 35967, 'loss/train': 1.1095385551452637} 11/07/2021 02:18:05 - INFO - __main__ - Step 35969: {'lr': 0.00043778022602898115, 'samples': 6906048, 'steps': 35968, 'loss/train': 1.3237872123718262} 11/07/2021 02:18:05 - INFO - __main__ - Step 35970: {'lr': 0.0004377767226615617, 'samples': 6906240, 'steps': 35969, 'loss/train': 1.250608205795288} 11/07/2021 02:18:07 - INFO - __main__ - Step 35971: {'lr': 0.000437773219209533, 'samples': 6906432, 'steps': 35970, 'loss/train': 1.6383625268936157} 11/07/2021 02:18:07 - INFO - __main__ - Step 35972: {'lr': 0.00043776971567289656, 'samples': 6906624, 'steps': 35971, 'loss/train': 1.3096139430999756} 11/07/2021 02:18:07 - INFO - __main__ - Step 35973: {'lr': 0.00043776621205165404, 'samples': 6906816, 'steps': 35972, 'loss/train': 1.204809546470642} 11/07/2021 02:18:08 - INFO - __main__ - Step 35974: {'lr': 0.0004377627083458069, 'samples': 6907008, 'steps': 35973, 'loss/train': 1.3687340021133423} 11/07/2021 02:18:08 - INFO - __main__ - Step 35975: {'lr': 0.0004377592045553568, 'samples': 6907200, 'steps': 35974, 'loss/train': 1.6243177652359009} 11/07/2021 02:18:09 - INFO - __main__ - Step 35976: {'lr': 0.00043775570068030524, 'samples': 6907392, 'steps': 35975, 'loss/train': 1.8994401693344116} 11/07/2021 02:18:09 - INFO - __main__ - Step 35977: {'lr': 0.0004377521967206539, 'samples': 6907584, 'steps': 35976, 'loss/train': 1.6218942403793335} 11/07/2021 02:18:10 - INFO - __main__ - Step 35978: {'lr': 0.00043774869267640436, 'samples': 6907776, 'steps': 35977, 'loss/train': 1.7045758962631226} 11/07/2021 02:18:10 - INFO - __main__ - Step 35979: {'lr': 0.0004377451885475581, 'samples': 6907968, 'steps': 35978, 'loss/train': 0.8041215538978577} 11/07/2021 02:18:10 - INFO - __main__ - Step 35980: {'lr': 0.0004377416843341168, 'samples': 6908160, 'steps': 35979, 'loss/train': 1.9746583700180054} 11/07/2021 02:18:12 - INFO - __main__ - Step 35981: {'lr': 0.00043773818003608203, 'samples': 6908352, 'steps': 35980, 'loss/train': 1.262722134590149} 11/07/2021 02:18:12 - INFO - __main__ - Step 35982: {'lr': 0.00043773467565345523, 'samples': 6908544, 'steps': 35981, 'loss/train': 1.465383768081665} 11/07/2021 02:18:12 - INFO - __main__ - Step 35983: {'lr': 0.0004377311711862381, 'samples': 6908736, 'steps': 35982, 'loss/train': 1.469982385635376} 11/07/2021 02:18:13 - INFO - __main__ - Step 35984: {'lr': 0.0004377276666344322, 'samples': 6908928, 'steps': 35983, 'loss/train': 1.7502566576004028} 11/07/2021 02:18:13 - INFO - __main__ - Step 35985: {'lr': 0.00043772416199803924, 'samples': 6909120, 'steps': 35984, 'loss/train': 1.5767804384231567} 11/07/2021 02:18:13 - INFO - __main__ - Step 35986: {'lr': 0.00043772065727706053, 'samples': 6909312, 'steps': 35985, 'loss/train': 1.3555277585983276} 11/07/2021 02:18:15 - INFO - __main__ - Step 35987: {'lr': 0.0004377171524714978, 'samples': 6909504, 'steps': 35986, 'loss/train': 1.3033314943313599} 11/07/2021 02:18:15 - INFO - __main__ - Step 35988: {'lr': 0.0004377136475813527, 'samples': 6909696, 'steps': 35987, 'loss/train': 1.681982159614563} 11/07/2021 02:18:16 - INFO - __main__ - Step 35989: {'lr': 0.0004377101426066266, 'samples': 6909888, 'steps': 35988, 'loss/train': 1.3505668640136719} 11/07/2021 02:18:16 - INFO - __main__ - Step 35990: {'lr': 0.0004377066375473213, 'samples': 6910080, 'steps': 35989, 'loss/train': 0.25427624583244324} 11/07/2021 02:18:16 - INFO - __main__ - Step 35991: {'lr': 0.00043770313240343826, 'samples': 6910272, 'steps': 35990, 'loss/train': 1.435550570487976} 11/07/2021 02:18:17 - INFO - __main__ - Step 35992: {'lr': 0.00043769962717497916, 'samples': 6910464, 'steps': 35991, 'loss/train': 0.8674271106719971} 11/07/2021 02:18:18 - INFO - __main__ - Step 35993: {'lr': 0.0004376961218619454, 'samples': 6910656, 'steps': 35992, 'loss/train': 1.3199352025985718} 11/07/2021 02:18:18 - INFO - __main__ - Step 35994: {'lr': 0.00043769261646433867, 'samples': 6910848, 'steps': 35993, 'loss/train': 0.7368350028991699} 11/07/2021 02:18:18 - INFO - __main__ - Step 35995: {'lr': 0.0004376891109821606, 'samples': 6911040, 'steps': 35994, 'loss/train': 1.4026904106140137} 11/07/2021 02:18:19 - INFO - __main__ - Step 35996: {'lr': 0.0004376856054154127, 'samples': 6911232, 'steps': 35995, 'loss/train': 1.77277672290802} 11/07/2021 02:18:20 - INFO - __main__ - Step 35997: {'lr': 0.00043768209976409645, 'samples': 6911424, 'steps': 35996, 'loss/train': 1.7561774253845215} 11/07/2021 02:18:20 - INFO - __main__ - Step 35998: {'lr': 0.0004376785940282137, 'samples': 6911616, 'steps': 35997, 'loss/train': 0.8580551147460938} 11/07/2021 02:18:20 - INFO - __main__ - Step 35999: {'lr': 0.0004376750882077658, 'samples': 6911808, 'steps': 35998, 'loss/train': 1.5717216730117798} 11/07/2021 02:18:21 - INFO - __main__ - Step 36000: {'lr': 0.0004376715823027544, 'samples': 6912000, 'steps': 35999, 'loss/train': 1.4603431224822998} 11/07/2021 02:18:21 - INFO - __main__ - Step 36001: {'lr': 0.0004376680763131811, 'samples': 6912192, 'steps': 36000, 'loss/train': 0.7720442414283752} 11/07/2021 02:18:21 - INFO - __main__ - Step 36002: {'lr': 0.0004376645702390475, 'samples': 6912384, 'steps': 36001, 'loss/train': 1.1306639909744263} 11/07/2021 02:18:23 - INFO - __main__ - Step 36003: {'lr': 0.00043766106408035506, 'samples': 6912576, 'steps': 36002, 'loss/train': 0.8562272787094116} 11/07/2021 02:18:23 - INFO - __main__ - Step 36004: {'lr': 0.0004376575578371055, 'samples': 6912768, 'steps': 36003, 'loss/train': 1.6573060750961304} 11/07/2021 02:18:23 - INFO - __main__ - Step 36005: {'lr': 0.0004376540515093003, 'samples': 6912960, 'steps': 36004, 'loss/train': 1.3827896118164062} 11/07/2021 02:18:24 - INFO - __main__ - Step 36006: {'lr': 0.0004376505450969411, 'samples': 6913152, 'steps': 36005, 'loss/train': 1.5291324853897095} 11/07/2021 02:18:24 - INFO - __main__ - Step 36007: {'lr': 0.0004376470386000294, 'samples': 6913344, 'steps': 36006, 'loss/train': 1.5615692138671875} 11/07/2021 02:18:25 - INFO - __main__ - Step 36008: {'lr': 0.0004376435320185669, 'samples': 6913536, 'steps': 36007, 'loss/train': 1.4178293943405151} 11/07/2021 02:18:26 - INFO - __main__ - Step 36009: {'lr': 0.0004376400253525551, 'samples': 6913728, 'steps': 36008, 'loss/train': 1.5691521167755127} 11/07/2021 02:18:26 - INFO - __main__ - Step 36010: {'lr': 0.0004376365186019956, 'samples': 6913920, 'steps': 36009, 'loss/train': 1.0562870502471924} 11/07/2021 02:18:26 - INFO - __main__ - Step 36011: {'lr': 0.00043763301176689, 'samples': 6914112, 'steps': 36010, 'loss/train': 2.8755180835723877} 11/07/2021 02:18:27 - INFO - __main__ - Step 36012: {'lr': 0.0004376295048472399, 'samples': 6914304, 'steps': 36011, 'loss/train': 1.0411345958709717} 11/07/2021 02:18:28 - INFO - __main__ - Step 36013: {'lr': 0.0004376259978430468, 'samples': 6914496, 'steps': 36012, 'loss/train': 1.9251238107681274} 11/07/2021 02:18:28 - INFO - __main__ - Step 36014: {'lr': 0.0004376224907543123, 'samples': 6914688, 'steps': 36013, 'loss/train': 1.528991937637329} 11/07/2021 02:18:28 - INFO - __main__ - Step 36015: {'lr': 0.00043761898358103804, 'samples': 6914880, 'steps': 36014, 'loss/train': 1.6339377164840698} 11/07/2021 02:18:29 - INFO - __main__ - Step 36016: {'lr': 0.0004376154763232255, 'samples': 6915072, 'steps': 36015, 'loss/train': 1.160670518875122} 11/07/2021 02:18:29 - INFO - __main__ - Step 36017: {'lr': 0.0004376119689808764, 'samples': 6915264, 'steps': 36016, 'loss/train': 1.3139722347259521} 11/07/2021 02:18:29 - INFO - __main__ - Step 36018: {'lr': 0.00043760846155399216, 'samples': 6915456, 'steps': 36017, 'loss/train': 1.5870747566223145} 11/07/2021 02:18:31 - INFO - __main__ - Step 36019: {'lr': 0.0004376049540425745, 'samples': 6915648, 'steps': 36018, 'loss/train': 1.3712278604507446} 11/07/2021 02:18:31 - INFO - __main__ - Step 36020: {'lr': 0.0004376014464466249, 'samples': 6915840, 'steps': 36019, 'loss/train': 1.959718942642212} 11/07/2021 02:18:31 - INFO - __main__ - Step 36021: {'lr': 0.0004375979387661451, 'samples': 6916032, 'steps': 36020, 'loss/train': 1.5496079921722412} 11/07/2021 02:18:32 - INFO - __main__ - Step 36022: {'lr': 0.0004375944310011364, 'samples': 6916224, 'steps': 36021, 'loss/train': 1.8799645900726318} 11/07/2021 02:18:32 - INFO - __main__ - Step 36023: {'lr': 0.00043759092315160064, 'samples': 6916416, 'steps': 36022, 'loss/train': 1.4155391454696655} 11/07/2021 02:18:33 - INFO - __main__ - Step 36024: {'lr': 0.00043758741521753925, 'samples': 6916608, 'steps': 36023, 'loss/train': 1.5093621015548706} 11/07/2021 02:18:33 - INFO - __main__ - Step 36025: {'lr': 0.0004375839071989539, 'samples': 6916800, 'steps': 36024, 'loss/train': 1.1388566493988037} 11/07/2021 02:18:34 - INFO - __main__ - Step 36026: {'lr': 0.00043758039909584613, 'samples': 6916992, 'steps': 36025, 'loss/train': 1.8856382369995117} 11/07/2021 02:18:34 - INFO - __main__ - Step 36027: {'lr': 0.0004375768909082175, 'samples': 6917184, 'steps': 36026, 'loss/train': 1.2582416534423828} 11/07/2021 02:18:35 - INFO - __main__ - Step 36028: {'lr': 0.0004375733826360697, 'samples': 6917376, 'steps': 36027, 'loss/train': 1.3716334104537964} 11/07/2021 02:18:35 - INFO - __main__ - Step 36029: {'lr': 0.0004375698742794042, 'samples': 6917568, 'steps': 36028, 'loss/train': 1.2687499523162842} 11/07/2021 02:18:36 - INFO - __main__ - Step 36030: {'lr': 0.0004375663658382225, 'samples': 6917760, 'steps': 36029, 'loss/train': 1.315381646156311} 11/07/2021 02:18:36 - INFO - __main__ - Step 36031: {'lr': 0.0004375628573125264, 'samples': 6917952, 'steps': 36030, 'loss/train': 1.1570274829864502} 11/07/2021 02:18:37 - INFO - __main__ - Step 36032: {'lr': 0.0004375593487023174, 'samples': 6918144, 'steps': 36031, 'loss/train': 1.5267812013626099} 11/07/2021 02:18:37 - INFO - __main__ - Step 36033: {'lr': 0.00043755584000759696, 'samples': 6918336, 'steps': 36032, 'loss/train': 1.284977912902832} 11/07/2021 02:18:38 - INFO - __main__ - Step 36034: {'lr': 0.0004375523312283668, 'samples': 6918528, 'steps': 36033, 'loss/train': 1.2571640014648438} 11/07/2021 02:18:38 - INFO - __main__ - Step 36035: {'lr': 0.00043754882236462844, 'samples': 6918720, 'steps': 36034, 'loss/train': 1.487125277519226} 11/07/2021 02:18:39 - INFO - __main__ - Step 36036: {'lr': 0.00043754531341638346, 'samples': 6918912, 'steps': 36035, 'loss/train': 2.0117814540863037} 11/07/2021 02:18:39 - INFO - __main__ - Step 36037: {'lr': 0.00043754180438363344, 'samples': 6919104, 'steps': 36036, 'loss/train': 0.9645311832427979} 11/07/2021 02:18:39 - INFO - __main__ - Step 36038: {'lr': 0.00043753829526638, 'samples': 6919296, 'steps': 36037, 'loss/train': 1.0889891386032104} 11/07/2021 02:18:40 - INFO - __main__ - Step 36039: {'lr': 0.0004375347860646247, 'samples': 6919488, 'steps': 36038, 'loss/train': 1.0317258834838867} 11/07/2021 02:18:41 - INFO - __main__ - Step 36040: {'lr': 0.00043753127677836917, 'samples': 6919680, 'steps': 36039, 'loss/train': 0.44519782066345215} 11/07/2021 02:18:41 - INFO - __main__ - Step 36041: {'lr': 0.0004375277674076149, 'samples': 6919872, 'steps': 36040, 'loss/train': 1.6520081758499146} 11/07/2021 02:18:41 - INFO - __main__ - Step 36042: {'lr': 0.0004375242579523635, 'samples': 6920064, 'steps': 36041, 'loss/train': 1.540985107421875} 11/07/2021 02:18:42 - INFO - __main__ - Step 36043: {'lr': 0.0004375207484126166, 'samples': 6920256, 'steps': 36042, 'loss/train': 1.9579271078109741} 11/07/2021 02:18:43 - INFO - __main__ - Step 36044: {'lr': 0.0004375172387883757, 'samples': 6920448, 'steps': 36043, 'loss/train': 1.5872255563735962} 11/07/2021 02:18:43 - INFO - __main__ - Step 36045: {'lr': 0.00043751372907964247, 'samples': 6920640, 'steps': 36044, 'loss/train': 1.1674432754516602} 11/07/2021 02:18:43 - INFO - __main__ - Step 36046: {'lr': 0.00043751021928641845, 'samples': 6920832, 'steps': 36045, 'loss/train': 1.4402272701263428} 11/07/2021 02:18:44 - INFO - __main__ - Step 36047: {'lr': 0.0004375067094087051, 'samples': 6921024, 'steps': 36046, 'loss/train': 1.7606974840164185} 11/07/2021 02:18:44 - INFO - __main__ - Step 36048: {'lr': 0.0004375031994465042, 'samples': 6921216, 'steps': 36047, 'loss/train': 1.1131813526153564} 11/07/2021 02:18:44 - INFO - __main__ - Step 36049: {'lr': 0.00043749968939981734, 'samples': 6921408, 'steps': 36048, 'loss/train': 1.9732738733291626} 11/07/2021 02:18:46 - INFO - __main__ - Step 36050: {'lr': 0.0004374961792686459, 'samples': 6921600, 'steps': 36049, 'loss/train': 1.4669445753097534} 11/07/2021 02:18:46 - INFO - __main__ - Step 36051: {'lr': 0.00043749266905299155, 'samples': 6921792, 'steps': 36050, 'loss/train': 0.8080386519432068} 11/07/2021 02:18:46 - INFO - __main__ - Step 36052: {'lr': 0.000437489158752856, 'samples': 6921984, 'steps': 36051, 'loss/train': 0.20430973172187805} 11/07/2021 02:18:47 - INFO - __main__ - Step 36053: {'lr': 0.00043748564836824065, 'samples': 6922176, 'steps': 36052, 'loss/train': 1.7568432092666626} 11/07/2021 02:18:47 - INFO - __main__ - Step 36054: {'lr': 0.0004374821378991473, 'samples': 6922368, 'steps': 36053, 'loss/train': 1.6545710563659668} 11/07/2021 02:18:48 - INFO - __main__ - Step 36055: {'lr': 0.0004374786273455772, 'samples': 6922560, 'steps': 36054, 'loss/train': 1.3741494417190552} 11/07/2021 02:18:49 - INFO - __main__ - Step 36056: {'lr': 0.0004374751167075322, 'samples': 6922752, 'steps': 36055, 'loss/train': 1.911236047744751} 11/07/2021 02:18:49 - INFO - __main__ - Step 36057: {'lr': 0.0004374716059850138, 'samples': 6922944, 'steps': 36056, 'loss/train': 1.8412857055664062} 11/07/2021 02:18:49 - INFO - __main__ - Step 36058: {'lr': 0.0004374680951780236, 'samples': 6923136, 'steps': 36057, 'loss/train': 1.6093480587005615} 11/07/2021 02:18:50 - INFO - __main__ - Step 36059: {'lr': 0.00043746458428656324, 'samples': 6923328, 'steps': 36058, 'loss/train': 1.5204285383224487} 11/07/2021 02:18:51 - INFO - __main__ - Step 36060: {'lr': 0.00043746107331063414, 'samples': 6923520, 'steps': 36059, 'loss/train': 1.6594641208648682} 11/07/2021 02:18:51 - INFO - __main__ - Step 36061: {'lr': 0.000437457562250238, 'samples': 6923712, 'steps': 36060, 'loss/train': 1.6589689254760742} 11/07/2021 02:18:51 - INFO - __main__ - Step 36062: {'lr': 0.0004374540511053763, 'samples': 6923904, 'steps': 36061, 'loss/train': 1.4177128076553345} 11/07/2021 02:18:52 - INFO - __main__ - Step 36063: {'lr': 0.00043745053987605075, 'samples': 6924096, 'steps': 36062, 'loss/train': 1.710706353187561} 11/07/2021 02:18:52 - INFO - __main__ - Step 36064: {'lr': 0.00043744702856226295, 'samples': 6924288, 'steps': 36063, 'loss/train': 1.4724761247634888} 11/07/2021 02:18:53 - INFO - __main__ - Step 36065: {'lr': 0.0004374435171640144, 'samples': 6924480, 'steps': 36064, 'loss/train': 1.2224045991897583} 11/07/2021 02:18:53 - INFO - __main__ - Step 36066: {'lr': 0.0004374400056813066, 'samples': 6924672, 'steps': 36065, 'loss/train': 1.6701277494430542} 11/07/2021 02:18:54 - INFO - __main__ - Step 36067: {'lr': 0.0004374364941141413, 'samples': 6924864, 'steps': 36066, 'loss/train': 0.6505734920501709} 11/07/2021 02:18:54 - INFO - __main__ - Step 36068: {'lr': 0.00043743298246251994, 'samples': 6925056, 'steps': 36067, 'loss/train': 1.7695902585983276} 11/07/2021 02:18:54 - INFO - __main__ - Step 36069: {'lr': 0.00043742947072644424, 'samples': 6925248, 'steps': 36068, 'loss/train': 1.2349269390106201} 11/07/2021 02:18:55 - INFO - __main__ - Step 36070: {'lr': 0.0004374259589059157, 'samples': 6925440, 'steps': 36069, 'loss/train': 0.949213981628418} 11/07/2021 02:18:56 - INFO - __main__ - Step 36071: {'lr': 0.0004374224470009359, 'samples': 6925632, 'steps': 36070, 'loss/train': 0.6275143027305603} 11/07/2021 02:18:56 - INFO - __main__ - Step 36072: {'lr': 0.00043741893501150644, 'samples': 6925824, 'steps': 36071, 'loss/train': 1.3203625679016113} 11/07/2021 02:18:57 - INFO - __main__ - Step 36073: {'lr': 0.0004374154229376289, 'samples': 6926016, 'steps': 36072, 'loss/train': 1.954107642173767} 11/07/2021 02:18:57 - INFO - __main__ - Step 36074: {'lr': 0.00043741191077930486, 'samples': 6926208, 'steps': 36073, 'loss/train': 1.9093469381332397} 11/07/2021 02:18:58 - INFO - __main__ - Step 36075: {'lr': 0.00043740839853653594, 'samples': 6926400, 'steps': 36074, 'loss/train': 1.2463798522949219} 11/07/2021 02:18:58 - INFO - __main__ - Step 36076: {'lr': 0.0004374048862093236, 'samples': 6926592, 'steps': 36075, 'loss/train': 1.6173049211502075} 11/07/2021 02:18:59 - INFO - __main__ - Step 36077: {'lr': 0.00043740137379766954, 'samples': 6926784, 'steps': 36076, 'loss/train': 1.380768895149231} 11/07/2021 02:18:59 - INFO - __main__ - Step 36078: {'lr': 0.0004373978613015753, 'samples': 6926976, 'steps': 36077, 'loss/train': 1.100445032119751} 11/07/2021 02:18:59 - INFO - __main__ - Step 36079: {'lr': 0.00043739434872104257, 'samples': 6927168, 'steps': 36078, 'loss/train': 1.6051472425460815} 11/07/2021 02:19:00 - INFO - __main__ - Step 36080: {'lr': 0.00043739083605607275, 'samples': 6927360, 'steps': 36079, 'loss/train': 1.165663242340088} 11/07/2021 02:19:01 - INFO - __main__ - Step 36081: {'lr': 0.0004373873233066676, 'samples': 6927552, 'steps': 36080, 'loss/train': 1.3163766860961914} 11/07/2021 02:19:01 - INFO - __main__ - Step 36082: {'lr': 0.00043738381047282856, 'samples': 6927744, 'steps': 36081, 'loss/train': 1.433258056640625} 11/07/2021 02:19:01 - INFO - __main__ - Step 36083: {'lr': 0.00043738029755455724, 'samples': 6927936, 'steps': 36082, 'loss/train': 0.9717826247215271} 11/07/2021 02:19:02 - INFO - __main__ - Step 36084: {'lr': 0.00043737678455185524, 'samples': 6928128, 'steps': 36083, 'loss/train': 1.3838664293289185} 11/07/2021 02:19:02 - INFO - __main__ - Step 36085: {'lr': 0.0004373732714647242, 'samples': 6928320, 'steps': 36084, 'loss/train': 1.2434223890304565} 11/07/2021 02:19:03 - INFO - __main__ - Step 36086: {'lr': 0.0004373697582931657, 'samples': 6928512, 'steps': 36085, 'loss/train': 1.0288443565368652} 11/07/2021 02:19:04 - INFO - __main__ - Step 36087: {'lr': 0.0004373662450371812, 'samples': 6928704, 'steps': 36086, 'loss/train': 1.307567834854126} 11/07/2021 02:19:04 - INFO - __main__ - Step 36088: {'lr': 0.0004373627316967723, 'samples': 6928896, 'steps': 36087, 'loss/train': 1.4782875776290894} 11/07/2021 02:19:04 - INFO - __main__ - Step 36089: {'lr': 0.0004373592182719408, 'samples': 6929088, 'steps': 36088, 'loss/train': 1.0795824527740479} 11/07/2021 02:19:05 - INFO - __main__ - Step 36090: {'lr': 0.00043735570476268804, 'samples': 6929280, 'steps': 36089, 'loss/train': 1.4311720132827759} 11/07/2021 02:19:06 - INFO - __main__ - Step 36091: {'lr': 0.0004373521911690157, 'samples': 6929472, 'steps': 36090, 'loss/train': 1.5301270484924316} 11/07/2021 02:19:06 - INFO - __main__ - Step 36092: {'lr': 0.00043734867749092534, 'samples': 6929664, 'steps': 36091, 'loss/train': 0.7610101103782654} 11/07/2021 02:19:06 - INFO - __main__ - Step 36093: {'lr': 0.0004373451637284186, 'samples': 6929856, 'steps': 36092, 'loss/train': 2.1793367862701416} 11/07/2021 02:19:07 - INFO - __main__ - Step 36094: {'lr': 0.0004373416498814969, 'samples': 6930048, 'steps': 36093, 'loss/train': 1.5620012283325195} 11/07/2021 02:19:07 - INFO - __main__ - Step 36095: {'lr': 0.0004373381359501621, 'samples': 6930240, 'steps': 36094, 'loss/train': 1.59977388381958} 11/07/2021 02:19:08 - INFO - __main__ - Step 36096: {'lr': 0.00043733462193441553, 'samples': 6930432, 'steps': 36095, 'loss/train': 1.613214373588562} 11/07/2021 02:19:08 - INFO - __main__ - Step 36097: {'lr': 0.00043733110783425894, 'samples': 6930624, 'steps': 36096, 'loss/train': 1.4539893865585327} 11/07/2021 02:19:09 - INFO - __main__ - Step 36098: {'lr': 0.00043732759364969374, 'samples': 6930816, 'steps': 36097, 'loss/train': 1.4978737831115723} 11/07/2021 02:19:09 - INFO - __main__ - Step 36099: {'lr': 0.0004373240793807217, 'samples': 6931008, 'steps': 36098, 'loss/train': 2.053466320037842} 11/07/2021 02:19:09 - INFO - __main__ - Step 36100: {'lr': 0.00043732056502734435, 'samples': 6931200, 'steps': 36099, 'loss/train': 1.4732714891433716} 11/07/2021 02:19:11 - INFO - __main__ - Step 36101: {'lr': 0.0004373170505895632, 'samples': 6931392, 'steps': 36100, 'loss/train': 1.4807472229003906} 11/07/2021 02:19:11 - INFO - __main__ - Step 36102: {'lr': 0.0004373135360673799, 'samples': 6931584, 'steps': 36101, 'loss/train': 1.4360374212265015} 11/07/2021 02:19:11 - INFO - __main__ - Step 36103: {'lr': 0.000437310021460796, 'samples': 6931776, 'steps': 36102, 'loss/train': 1.6048221588134766} 11/07/2021 02:19:12 - INFO - __main__ - Step 36104: {'lr': 0.000437306506769813, 'samples': 6931968, 'steps': 36103, 'loss/train': 0.9805557727813721} 11/07/2021 02:19:12 - INFO - __main__ - Step 36105: {'lr': 0.0004373029919944327, 'samples': 6932160, 'steps': 36104, 'loss/train': 0.14362464845180511} 11/07/2021 02:19:13 - INFO - __main__ - Step 36106: {'lr': 0.00043729947713465653, 'samples': 6932352, 'steps': 36105, 'loss/train': 1.5470880270004272} 11/07/2021 02:19:13 - INFO - __main__ - Step 36107: {'lr': 0.00043729596219048607, 'samples': 6932544, 'steps': 36106, 'loss/train': 1.026743769645691} 11/07/2021 02:19:14 - INFO - __main__ - Step 36108: {'lr': 0.000437292447161923, 'samples': 6932736, 'steps': 36107, 'loss/train': 1.5140167474746704} 11/07/2021 02:19:14 - INFO - __main__ - Step 36109: {'lr': 0.0004372889320489688, 'samples': 6932928, 'steps': 36108, 'loss/train': 1.5239927768707275} 11/07/2021 02:19:14 - INFO - __main__ - Step 36110: {'lr': 0.00043728541685162503, 'samples': 6933120, 'steps': 36109, 'loss/train': 1.677078366279602} 11/07/2021 02:19:15 - INFO - __main__ - Step 36111: {'lr': 0.0004372819015698934, 'samples': 6933312, 'steps': 36110, 'loss/train': 1.5815430879592896} 11/07/2021 02:19:16 - INFO - __main__ - Step 36112: {'lr': 0.0004372783862037755, 'samples': 6933504, 'steps': 36111, 'loss/train': 1.2556015253067017} 11/07/2021 02:19:16 - INFO - __main__ - Step 36113: {'lr': 0.00043727487075327285, 'samples': 6933696, 'steps': 36112, 'loss/train': 0.9244870543479919} 11/07/2021 02:19:16 - INFO - __main__ - Step 36114: {'lr': 0.00043727135521838697, 'samples': 6933888, 'steps': 36113, 'loss/train': 1.4923559427261353} 11/07/2021 02:19:17 - INFO - __main__ - Step 36115: {'lr': 0.00043726783959911953, 'samples': 6934080, 'steps': 36114, 'loss/train': 1.527382254600525} 11/07/2021 02:19:18 - INFO - __main__ - Step 36116: {'lr': 0.00043726432389547205, 'samples': 6934272, 'steps': 36115, 'loss/train': 1.7110966444015503} 11/07/2021 02:19:18 - INFO - __main__ - Step 36117: {'lr': 0.00043726080810744616, 'samples': 6934464, 'steps': 36116, 'loss/train': 1.2762972116470337} 11/07/2021 02:19:19 - INFO - __main__ - Step 36118: {'lr': 0.0004372572922350435, 'samples': 6934656, 'steps': 36117, 'loss/train': 1.5502105951309204} 11/07/2021 02:19:19 - INFO - __main__ - Step 36119: {'lr': 0.0004372537762782656, 'samples': 6934848, 'steps': 36118, 'loss/train': 1.138943076133728} 11/07/2021 02:19:19 - INFO - __main__ - Step 36120: {'lr': 0.00043725026023711395, 'samples': 6935040, 'steps': 36119, 'loss/train': 1.5728424787521362} 11/07/2021 02:19:20 - INFO - __main__ - Step 36121: {'lr': 0.0004372467441115903, 'samples': 6935232, 'steps': 36120, 'loss/train': 1.1723154783248901} 11/07/2021 02:19:21 - INFO - __main__ - Step 36122: {'lr': 0.00043724322790169613, 'samples': 6935424, 'steps': 36121, 'loss/train': 1.4476337432861328} 11/07/2021 02:19:21 - INFO - __main__ - Step 36123: {'lr': 0.00043723971160743305, 'samples': 6935616, 'steps': 36122, 'loss/train': 1.4831031560897827} 11/07/2021 02:19:21 - INFO - __main__ - Step 36124: {'lr': 0.00043723619522880266, 'samples': 6935808, 'steps': 36123, 'loss/train': 1.0746320486068726} 11/07/2021 02:19:22 - INFO - __main__ - Step 36125: {'lr': 0.0004372326787658065, 'samples': 6936000, 'steps': 36124, 'loss/train': 0.9541419148445129} 11/07/2021 02:19:22 - INFO - __main__ - Step 36126: {'lr': 0.00043722916221844617, 'samples': 6936192, 'steps': 36125, 'loss/train': 0.38092461228370667} 11/07/2021 02:19:23 - INFO - __main__ - Step 36127: {'lr': 0.0004372256455867233, 'samples': 6936384, 'steps': 36126, 'loss/train': 1.2290235757827759} 11/07/2021 02:19:23 - INFO - __main__ - Step 36128: {'lr': 0.0004372221288706394, 'samples': 6936576, 'steps': 36127, 'loss/train': 1.2454577684402466} 11/07/2021 02:19:24 - INFO - __main__ - Step 36129: {'lr': 0.0004372186120701962, 'samples': 6936768, 'steps': 36128, 'loss/train': 1.9584871530532837} 11/07/2021 02:19:24 - INFO - __main__ - Step 36130: {'lr': 0.00043721509518539507, 'samples': 6936960, 'steps': 36129, 'loss/train': 1.2257826328277588} 11/07/2021 02:19:24 - INFO - __main__ - Step 36131: {'lr': 0.0004372115782162378, 'samples': 6937152, 'steps': 36130, 'loss/train': 1.4203920364379883} 11/07/2021 02:19:25 - INFO - __main__ - Step 36132: {'lr': 0.00043720806116272584, 'samples': 6937344, 'steps': 36131, 'loss/train': 1.0633304119110107} 11/07/2021 02:19:26 - INFO - __main__ - Step 36133: {'lr': 0.00043720454402486076, 'samples': 6937536, 'steps': 36132, 'loss/train': 0.8455348014831543} 11/07/2021 02:19:26 - INFO - __main__ - Step 36134: {'lr': 0.00043720102680264427, 'samples': 6937728, 'steps': 36133, 'loss/train': 1.6558862924575806} 11/07/2021 02:19:27 - INFO - __main__ - Step 36135: {'lr': 0.0004371975094960778, 'samples': 6937920, 'steps': 36134, 'loss/train': 1.6989877223968506} 11/07/2021 02:19:27 - INFO - __main__ - Step 36136: {'lr': 0.0004371939921051632, 'samples': 6938112, 'steps': 36135, 'loss/train': 1.5624903440475464} 11/07/2021 02:19:28 - INFO - __main__ - Step 36137: {'lr': 0.00043719047462990174, 'samples': 6938304, 'steps': 36136, 'loss/train': 1.592354416847229} 11/07/2021 02:19:28 - INFO - __main__ - Step 36138: {'lr': 0.0004371869570702952, 'samples': 6938496, 'steps': 36137, 'loss/train': 1.591860055923462} 11/07/2021 02:19:29 - INFO - __main__ - Step 36139: {'lr': 0.0004371834394263451, 'samples': 6938688, 'steps': 36138, 'loss/train': 1.3959500789642334} 11/07/2021 02:19:29 - INFO - __main__ - Step 36140: {'lr': 0.000437179921698053, 'samples': 6938880, 'steps': 36139, 'loss/train': 1.7648470401763916} 11/07/2021 02:19:29 - INFO - __main__ - Step 36141: {'lr': 0.00043717640388542045, 'samples': 6939072, 'steps': 36140, 'loss/train': 1.1295483112335205} 11/07/2021 02:19:30 - INFO - __main__ - Step 36142: {'lr': 0.00043717288598844916, 'samples': 6939264, 'steps': 36141, 'loss/train': 1.166265606880188} 11/07/2021 02:19:31 - INFO - __main__ - Step 36143: {'lr': 0.0004371693680071407, 'samples': 6939456, 'steps': 36142, 'loss/train': 1.6560958623886108} 11/07/2021 02:19:31 - INFO - __main__ - Step 36144: {'lr': 0.00043716584994149657, 'samples': 6939648, 'steps': 36143, 'loss/train': 1.515106201171875} 11/07/2021 02:19:31 - INFO - __main__ - Step 36145: {'lr': 0.0004371623317915184, 'samples': 6939840, 'steps': 36144, 'loss/train': 1.1427377462387085} 11/07/2021 02:19:32 - INFO - __main__ - Step 36146: {'lr': 0.00043715881355720776, 'samples': 6940032, 'steps': 36145, 'loss/train': 1.328829288482666} 11/07/2021 02:19:33 - INFO - __main__ - Step 36147: {'lr': 0.0004371552952385663, 'samples': 6940224, 'steps': 36146, 'loss/train': 1.6909960508346558} 11/07/2021 02:19:33 - INFO - __main__ - Step 36148: {'lr': 0.00043715177683559546, 'samples': 6940416, 'steps': 36147, 'loss/train': 1.5440813302993774} 11/07/2021 02:19:33 - INFO - __main__ - Step 36149: {'lr': 0.000437148258348297, 'samples': 6940608, 'steps': 36148, 'loss/train': 1.8825254440307617} 11/07/2021 02:19:34 - INFO - __main__ - Step 36150: {'lr': 0.0004371447397766724, 'samples': 6940800, 'steps': 36149, 'loss/train': 1.4469424486160278} 11/07/2021 02:19:34 - INFO - __main__ - Step 36151: {'lr': 0.0004371412211207233, 'samples': 6940992, 'steps': 36150, 'loss/train': 1.580246090888977} 11/07/2021 02:19:35 - INFO - __main__ - Step 36152: {'lr': 0.0004371377023804512, 'samples': 6941184, 'steps': 36151, 'loss/train': 1.47393000125885} 11/07/2021 02:19:35 - INFO - __main__ - Step 36153: {'lr': 0.0004371341835558578, 'samples': 6941376, 'steps': 36152, 'loss/train': 1.0574392080307007} 11/07/2021 02:19:36 - INFO - __main__ - Step 36154: {'lr': 0.0004371306646469445, 'samples': 6941568, 'steps': 36153, 'loss/train': 2.2134885787963867} 11/07/2021 02:19:36 - INFO - __main__ - Step 36155: {'lr': 0.00043712714565371315, 'samples': 6941760, 'steps': 36154, 'loss/train': 1.4620238542556763} 11/07/2021 02:19:37 - INFO - __main__ - Step 36156: {'lr': 0.0004371236265761651, 'samples': 6941952, 'steps': 36155, 'loss/train': 1.5506919622421265} 11/07/2021 02:19:37 - INFO - __main__ - Step 36157: {'lr': 0.0004371201074143021, 'samples': 6942144, 'steps': 36156, 'loss/train': 1.710616946220398} 11/07/2021 02:19:38 - INFO - __main__ - Step 36158: {'lr': 0.0004371165881681256, 'samples': 6942336, 'steps': 36157, 'loss/train': 1.4615522623062134} 11/07/2021 02:19:38 - INFO - __main__ - Step 36159: {'lr': 0.0004371130688376373, 'samples': 6942528, 'steps': 36158, 'loss/train': 1.606250524520874} 11/07/2021 02:19:39 - INFO - __main__ - Step 36160: {'lr': 0.00043710954942283875, 'samples': 6942720, 'steps': 36159, 'loss/train': 1.5076546669006348} 11/07/2021 02:19:39 - INFO - __main__ - Step 36161: {'lr': 0.0004371060299237315, 'samples': 6942912, 'steps': 36160, 'loss/train': 1.523021936416626} 11/07/2021 02:19:39 - INFO - __main__ - Step 36162: {'lr': 0.00043710251034031713, 'samples': 6943104, 'steps': 36161, 'loss/train': 1.4017515182495117} 11/07/2021 02:19:41 - INFO - __main__ - Step 36163: {'lr': 0.0004370989906725973, 'samples': 6943296, 'steps': 36162, 'loss/train': 1.7055500745773315} 11/07/2021 02:19:41 - INFO - __main__ - Step 36164: {'lr': 0.00043709547092057356, 'samples': 6943488, 'steps': 36163, 'loss/train': 1.627466082572937} 11/07/2021 02:19:41 - INFO - __main__ - Step 36165: {'lr': 0.00043709195108424746, 'samples': 6943680, 'steps': 36164, 'loss/train': 1.0193427801132202} 11/07/2021 02:19:42 - INFO - __main__ - Step 36166: {'lr': 0.0004370884311636206, 'samples': 6943872, 'steps': 36165, 'loss/train': 0.8380038142204285} 11/07/2021 02:19:42 - INFO - __main__ - Step 36167: {'lr': 0.0004370849111586946, 'samples': 6944064, 'steps': 36166, 'loss/train': 1.7870904207229614} 11/07/2021 02:19:43 - INFO - __main__ - Step 36168: {'lr': 0.000437081391069471, 'samples': 6944256, 'steps': 36167, 'loss/train': 1.1971633434295654} 11/07/2021 02:19:43 - INFO - __main__ - Step 36169: {'lr': 0.0004370778708959514, 'samples': 6944448, 'steps': 36168, 'loss/train': 1.4381688833236694} 11/07/2021 02:19:44 - INFO - __main__ - Step 36170: {'lr': 0.00043707435063813747, 'samples': 6944640, 'steps': 36169, 'loss/train': 1.353993535041809} 11/07/2021 02:19:44 - INFO - __main__ - Step 36171: {'lr': 0.0004370708302960307, 'samples': 6944832, 'steps': 36170, 'loss/train': 1.1478573083877563} 11/07/2021 02:19:44 - INFO - __main__ - Step 36172: {'lr': 0.00043706730986963274, 'samples': 6945024, 'steps': 36171, 'loss/train': 1.386587142944336} 11/07/2021 02:19:46 - INFO - __main__ - Step 36173: {'lr': 0.0004370637893589451, 'samples': 6945216, 'steps': 36172, 'loss/train': 1.4752625226974487} 11/07/2021 02:19:46 - INFO - __main__ - Step 36174: {'lr': 0.0004370602687639693, 'samples': 6945408, 'steps': 36173, 'loss/train': 1.3060412406921387} 11/07/2021 02:19:46 - INFO - __main__ - Step 36175: {'lr': 0.00043705674808470715, 'samples': 6945600, 'steps': 36174, 'loss/train': 1.583465337753296} 11/07/2021 02:19:47 - INFO - __main__ - Step 36176: {'lr': 0.00043705322732116007, 'samples': 6945792, 'steps': 36175, 'loss/train': 1.362337350845337} 11/07/2021 02:19:47 - INFO - __main__ - Step 36177: {'lr': 0.00043704970647332977, 'samples': 6945984, 'steps': 36176, 'loss/train': 1.276375651359558} 11/07/2021 02:19:48 - INFO - __main__ - Step 36178: {'lr': 0.00043704618554121766, 'samples': 6946176, 'steps': 36177, 'loss/train': 1.6004598140716553} 11/07/2021 02:19:48 - INFO - __main__ - Step 36179: {'lr': 0.0004370426645248254, 'samples': 6946368, 'steps': 36178, 'loss/train': 1.3531804084777832} 11/07/2021 02:19:49 - INFO - __main__ - Step 36180: {'lr': 0.00043703914342415473, 'samples': 6946560, 'steps': 36179, 'loss/train': 1.6111247539520264} 11/07/2021 02:19:49 - INFO - __main__ - Step 36181: {'lr': 0.000437035622239207, 'samples': 6946752, 'steps': 36180, 'loss/train': 0.882485568523407} 11/07/2021 02:19:49 - INFO - __main__ - Step 36182: {'lr': 0.00043703210096998396, 'samples': 6946944, 'steps': 36181, 'loss/train': 1.731668472290039} 11/07/2021 02:19:50 - INFO - __main__ - Step 36183: {'lr': 0.00043702857961648713, 'samples': 6947136, 'steps': 36182, 'loss/train': 1.1887149810791016} 11/07/2021 02:19:51 - INFO - __main__ - Step 36184: {'lr': 0.0004370250581787181, 'samples': 6947328, 'steps': 36183, 'loss/train': 0.8045414686203003} 11/07/2021 02:19:51 - INFO - __main__ - Step 36185: {'lr': 0.00043702153665667846, 'samples': 6947520, 'steps': 36184, 'loss/train': 1.8129420280456543} 11/07/2021 02:19:51 - INFO - __main__ - Step 36186: {'lr': 0.0004370180150503698, 'samples': 6947712, 'steps': 36185, 'loss/train': 1.8427050113677979} 11/07/2021 02:19:52 - INFO - __main__ - Step 36187: {'lr': 0.0004370144933597938, 'samples': 6947904, 'steps': 36186, 'loss/train': 1.2437989711761475} 11/07/2021 02:19:53 - INFO - __main__ - Step 36188: {'lr': 0.00043701097158495186, 'samples': 6948096, 'steps': 36187, 'loss/train': 1.0144716501235962} 11/07/2021 02:19:53 - INFO - __main__ - Step 36189: {'lr': 0.0004370074497258456, 'samples': 6948288, 'steps': 36188, 'loss/train': 1.7620385885238647} 11/07/2021 02:19:53 - INFO - __main__ - Step 36190: {'lr': 0.00043700392778247676, 'samples': 6948480, 'steps': 36189, 'loss/train': 1.395757794380188} 11/07/2021 02:19:54 - INFO - __main__ - Step 36191: {'lr': 0.0004370004057548468, 'samples': 6948672, 'steps': 36190, 'loss/train': 1.7740306854248047} 11/07/2021 02:19:54 - INFO - __main__ - Step 36192: {'lr': 0.0004369968836429574, 'samples': 6948864, 'steps': 36191, 'loss/train': 1.3015068769454956} 11/07/2021 02:19:55 - INFO - __main__ - Step 36193: {'lr': 0.0004369933614468101, 'samples': 6949056, 'steps': 36192, 'loss/train': 1.3714641332626343} 11/07/2021 02:19:55 - INFO - __main__ - Step 36194: {'lr': 0.0004369898391664064, 'samples': 6949248, 'steps': 36193, 'loss/train': 1.4054350852966309} 11/07/2021 02:19:56 - INFO - __main__ - Step 36195: {'lr': 0.000436986316801748, 'samples': 6949440, 'steps': 36194, 'loss/train': 1.5505949258804321} 11/07/2021 02:19:56 - INFO - __main__ - Step 36196: {'lr': 0.00043698279435283637, 'samples': 6949632, 'steps': 36195, 'loss/train': 1.662702202796936} 11/07/2021 02:19:56 - INFO - __main__ - Step 36197: {'lr': 0.0004369792718196733, 'samples': 6949824, 'steps': 36196, 'loss/train': 0.9160897135734558} 11/07/2021 02:19:58 - INFO - __main__ - Step 36198: {'lr': 0.0004369757492022602, 'samples': 6950016, 'steps': 36197, 'loss/train': 1.199594259262085} 11/07/2021 02:19:58 - INFO - __main__ - Step 36199: {'lr': 0.00043697222650059876, 'samples': 6950208, 'steps': 36198, 'loss/train': 1.3858470916748047} 11/07/2021 02:19:58 - INFO - __main__ - Step 36200: {'lr': 0.00043696870371469045, 'samples': 6950400, 'steps': 36199, 'loss/train': 1.6673192977905273} 11/07/2021 02:19:59 - INFO - __main__ - Step 36201: {'lr': 0.000436965180844537, 'samples': 6950592, 'steps': 36200, 'loss/train': 1.7721482515335083} 11/07/2021 02:19:59 - INFO - __main__ - Step 36202: {'lr': 0.00043696165789013986, 'samples': 6950784, 'steps': 36201, 'loss/train': 1.8136097192764282} 11/07/2021 02:20:00 - INFO - __main__ - Step 36203: {'lr': 0.0004369581348515007, 'samples': 6950976, 'steps': 36202, 'loss/train': 1.477937936782837} 11/07/2021 02:20:00 - INFO - __main__ - Step 36204: {'lr': 0.00043695461172862113, 'samples': 6951168, 'steps': 36203, 'loss/train': 1.594497799873352} 11/07/2021 02:20:01 - INFO - __main__ - Step 36205: {'lr': 0.0004369510885215026, 'samples': 6951360, 'steps': 36204, 'loss/train': 1.2825634479522705} 11/07/2021 02:20:01 - INFO - __main__ - Step 36206: {'lr': 0.0004369475652301469, 'samples': 6951552, 'steps': 36205, 'loss/train': 1.2375483512878418} 11/07/2021 02:20:01 - INFO - __main__ - Step 36207: {'lr': 0.0004369440418545555, 'samples': 6951744, 'steps': 36206, 'loss/train': 1.5646448135375977} 11/07/2021 02:20:02 - INFO - __main__ - Step 36208: {'lr': 0.00043694051839472995, 'samples': 6951936, 'steps': 36207, 'loss/train': 1.567756175994873} 11/07/2021 02:20:03 - INFO - __main__ - Step 36209: {'lr': 0.00043693699485067186, 'samples': 6952128, 'steps': 36208, 'loss/train': 1.2388901710510254} 11/07/2021 02:20:03 - INFO - __main__ - Step 36210: {'lr': 0.0004369334712223829, 'samples': 6952320, 'steps': 36209, 'loss/train': 1.9396334886550903} 11/07/2021 02:20:03 - INFO - __main__ - Step 36211: {'lr': 0.0004369299475098646, 'samples': 6952512, 'steps': 36210, 'loss/train': 0.9735457301139832} 11/07/2021 02:20:04 - INFO - __main__ - Step 36212: {'lr': 0.00043692642371311854, 'samples': 6952704, 'steps': 36211, 'loss/train': 1.311751127243042} 11/07/2021 02:20:04 - INFO - __main__ - Step 36213: {'lr': 0.00043692289983214626, 'samples': 6952896, 'steps': 36212, 'loss/train': 1.5022273063659668} 11/07/2021 02:20:05 - INFO - __main__ - Step 36214: {'lr': 0.0004369193758669495, 'samples': 6953088, 'steps': 36213, 'loss/train': 1.4548381567001343} 11/07/2021 02:20:05 - INFO - __main__ - Step 36215: {'lr': 0.0004369158518175297, 'samples': 6953280, 'steps': 36214, 'loss/train': 1.3823559284210205} 11/07/2021 02:20:06 - INFO - __main__ - Step 36216: {'lr': 0.00043691232768388856, 'samples': 6953472, 'steps': 36215, 'loss/train': 1.1766122579574585} 11/07/2021 02:20:06 - INFO - __main__ - Step 36217: {'lr': 0.00043690880346602755, 'samples': 6953664, 'steps': 36216, 'loss/train': 1.4669190645217896} 11/07/2021 02:20:07 - INFO - __main__ - Step 36218: {'lr': 0.0004369052791639483, 'samples': 6953856, 'steps': 36217, 'loss/train': 1.1427199840545654} 11/07/2021 02:20:07 - INFO - __main__ - Step 36219: {'lr': 0.0004369017547776525, 'samples': 6954048, 'steps': 36218, 'loss/train': 1.8163267374038696} 11/07/2021 02:20:08 - INFO - __main__ - Step 36220: {'lr': 0.0004368982303071416, 'samples': 6954240, 'steps': 36219, 'loss/train': 1.1658703088760376} 11/07/2021 02:20:08 - INFO - __main__ - Step 36221: {'lr': 0.0004368947057524173, 'samples': 6954432, 'steps': 36220, 'loss/train': 1.4470137357711792} 11/07/2021 02:20:09 - INFO - __main__ - Step 36222: {'lr': 0.00043689118111348105, 'samples': 6954624, 'steps': 36221, 'loss/train': 1.7597016096115112} 11/07/2021 02:20:09 - INFO - __main__ - Step 36223: {'lr': 0.00043688765639033456, 'samples': 6954816, 'steps': 36222, 'loss/train': 1.475920557975769} 11/07/2021 02:20:10 - INFO - __main__ - Step 36224: {'lr': 0.00043688413158297934, 'samples': 6955008, 'steps': 36223, 'loss/train': 0.4032767713069916} 11/07/2021 02:20:10 - INFO - __main__ - Step 36225: {'lr': 0.00043688060669141705, 'samples': 6955200, 'steps': 36224, 'loss/train': 1.700075387954712} 11/07/2021 02:20:11 - INFO - __main__ - Step 36226: {'lr': 0.00043687708171564923, 'samples': 6955392, 'steps': 36225, 'loss/train': 1.6373037099838257} 11/07/2021 02:20:11 - INFO - __main__ - Step 36227: {'lr': 0.00043687355665567745, 'samples': 6955584, 'steps': 36226, 'loss/train': 1.5465506315231323} 11/07/2021 02:20:11 - INFO - __main__ - Step 36228: {'lr': 0.0004368700315115034, 'samples': 6955776, 'steps': 36227, 'loss/train': 1.9873836040496826} 11/07/2021 02:20:12 - INFO - __main__ - Step 36229: {'lr': 0.00043686650628312854, 'samples': 6955968, 'steps': 36228, 'loss/train': 1.3647935390472412} 11/07/2021 02:20:13 - INFO - __main__ - Step 36230: {'lr': 0.00043686298097055456, 'samples': 6956160, 'steps': 36229, 'loss/train': 1.7766621112823486} 11/07/2021 02:20:13 - INFO - __main__ - Step 36231: {'lr': 0.0004368594555737829, 'samples': 6956352, 'steps': 36230, 'loss/train': 1.8264625072479248} 11/07/2021 02:20:13 - INFO - __main__ - Step 36232: {'lr': 0.0004368559300928153, 'samples': 6956544, 'steps': 36231, 'loss/train': 1.3406790494918823} 11/07/2021 02:20:14 - INFO - __main__ - Step 36233: {'lr': 0.0004368524045276534, 'samples': 6956736, 'steps': 36232, 'loss/train': 0.7710010409355164} 11/07/2021 02:20:15 - INFO - __main__ - Step 36234: {'lr': 0.00043684887887829863, 'samples': 6956928, 'steps': 36233, 'loss/train': 1.5048789978027344} 11/07/2021 02:20:15 - INFO - __main__ - Step 36235: {'lr': 0.0004368453531447526, 'samples': 6957120, 'steps': 36234, 'loss/train': 0.9283801913261414} 11/07/2021 02:20:15 - INFO - __main__ - Step 36236: {'lr': 0.00043684182732701694, 'samples': 6957312, 'steps': 36235, 'loss/train': 1.9875965118408203} 11/07/2021 02:20:16 - INFO - __main__ - Step 36237: {'lr': 0.00043683830142509327, 'samples': 6957504, 'steps': 36236, 'loss/train': 1.540000081062317} 11/07/2021 02:20:16 - INFO - __main__ - Step 36238: {'lr': 0.00043683477543898314, 'samples': 6957696, 'steps': 36237, 'loss/train': 0.8398938775062561} 11/07/2021 02:20:17 - INFO - __main__ - Step 36239: {'lr': 0.0004368312493686881, 'samples': 6957888, 'steps': 36238, 'loss/train': 1.4075531959533691} 11/07/2021 02:20:18 - INFO - __main__ - Step 36240: {'lr': 0.0004368277232142098, 'samples': 6958080, 'steps': 36239, 'loss/train': 1.2510409355163574} 11/07/2021 02:20:18 - INFO - __main__ - Step 36241: {'lr': 0.00043682419697554985, 'samples': 6958272, 'steps': 36240, 'loss/train': 1.9596631526947021} 11/07/2021 02:20:18 - INFO - __main__ - Step 36242: {'lr': 0.0004368206706527098, 'samples': 6958464, 'steps': 36241, 'loss/train': 1.5783356428146362} 11/07/2021 02:20:19 - INFO - __main__ - Step 36243: {'lr': 0.00043681714424569117, 'samples': 6958656, 'steps': 36242, 'loss/train': 0.5691535472869873} 11/07/2021 02:20:19 - INFO - __main__ - Step 36244: {'lr': 0.0004368136177544957, 'samples': 6958848, 'steps': 36243, 'loss/train': 1.6415650844573975} 11/07/2021 02:20:20 - INFO - __main__ - Step 36245: {'lr': 0.00043681009117912484, 'samples': 6959040, 'steps': 36244, 'loss/train': 1.0533183813095093} 11/07/2021 02:20:20 - INFO - __main__ - Step 36246: {'lr': 0.0004368065645195803, 'samples': 6959232, 'steps': 36245, 'loss/train': 0.1379714012145996} 11/07/2021 02:20:21 - INFO - __main__ - Step 36247: {'lr': 0.0004368030377758636, 'samples': 6959424, 'steps': 36246, 'loss/train': 1.1800734996795654} 11/07/2021 02:20:21 - INFO - __main__ - Step 36248: {'lr': 0.0004367995109479763, 'samples': 6959616, 'steps': 36247, 'loss/train': 1.4469062089920044} 11/07/2021 02:20:21 - INFO - __main__ - Step 36249: {'lr': 0.00043679598403592, 'samples': 6959808, 'steps': 36248, 'loss/train': 1.611397385597229} 11/07/2021 02:20:22 - INFO - __main__ - Step 36250: {'lr': 0.00043679245703969627, 'samples': 6960000, 'steps': 36249, 'loss/train': 1.2449549436569214} 11/07/2021 02:20:23 - INFO - __main__ - Step 36251: {'lr': 0.00043678892995930685, 'samples': 6960192, 'steps': 36250, 'loss/train': 1.9472393989562988} 11/07/2021 02:20:23 - INFO - __main__ - Step 36252: {'lr': 0.00043678540279475314, 'samples': 6960384, 'steps': 36251, 'loss/train': 1.4475023746490479} 11/07/2021 02:20:23 - INFO - __main__ - Step 36253: {'lr': 0.0004367818755460369, 'samples': 6960576, 'steps': 36252, 'loss/train': 1.3991891145706177} 11/07/2021 02:20:24 - INFO - __main__ - Step 36254: {'lr': 0.00043677834821315956, 'samples': 6960768, 'steps': 36253, 'loss/train': 0.6604613661766052} 11/07/2021 02:20:25 - INFO - __main__ - Step 36255: {'lr': 0.00043677482079612276, 'samples': 6960960, 'steps': 36254, 'loss/train': 1.4953562021255493} 11/07/2021 02:20:25 - INFO - __main__ - Step 36256: {'lr': 0.00043677129329492814, 'samples': 6961152, 'steps': 36255, 'loss/train': 1.5402852296829224} 11/07/2021 02:20:26 - INFO - __main__ - Step 36257: {'lr': 0.00043676776570957725, 'samples': 6961344, 'steps': 36256, 'loss/train': 1.8290815353393555} 11/07/2021 02:20:26 - INFO - __main__ - Step 36258: {'lr': 0.0004367642380400717, 'samples': 6961536, 'steps': 36257, 'loss/train': 1.542945146560669} 11/07/2021 02:20:26 - INFO - __main__ - Step 36259: {'lr': 0.0004367607102864131, 'samples': 6961728, 'steps': 36258, 'loss/train': 1.9337825775146484} 11/07/2021 02:20:27 - INFO - __main__ - Step 36260: {'lr': 0.00043675718244860296, 'samples': 6961920, 'steps': 36259, 'loss/train': 1.3398491144180298} 11/07/2021 02:20:28 - INFO - __main__ - Step 36261: {'lr': 0.00043675365452664286, 'samples': 6962112, 'steps': 36260, 'loss/train': 1.286232352256775} 11/07/2021 02:20:28 - INFO - __main__ - Step 36262: {'lr': 0.0004367501265205345, 'samples': 6962304, 'steps': 36261, 'loss/train': 1.5498446226119995} 11/07/2021 02:20:28 - INFO - __main__ - Step 36263: {'lr': 0.0004367465984302794, 'samples': 6962496, 'steps': 36262, 'loss/train': 1.3904285430908203} 11/07/2021 02:20:29 - INFO - __main__ - Step 36264: {'lr': 0.0004367430702558792, 'samples': 6962688, 'steps': 36263, 'loss/train': 1.4619240760803223} 11/07/2021 02:20:30 - INFO - __main__ - Step 36265: {'lr': 0.0004367395419973355, 'samples': 6962880, 'steps': 36264, 'loss/train': 1.400206208229065} 11/07/2021 02:20:30 - INFO - __main__ - Step 36266: {'lr': 0.00043673601365464975, 'samples': 6963072, 'steps': 36265, 'loss/train': 1.4053270816802979} 11/07/2021 02:20:31 - INFO - __main__ - Step 36267: {'lr': 0.00043673248522782364, 'samples': 6963264, 'steps': 36266, 'loss/train': 1.2345603704452515} 11/07/2021 02:20:31 - INFO - __main__ - Step 36268: {'lr': 0.0004367289567168588, 'samples': 6963456, 'steps': 36267, 'loss/train': 1.3548353910446167} 11/07/2021 02:20:31 - INFO - __main__ - Step 36269: {'lr': 0.00043672542812175675, 'samples': 6963648, 'steps': 36268, 'loss/train': 1.8345106840133667} 11/07/2021 02:20:32 - INFO - __main__ - Step 36270: {'lr': 0.00043672189944251905, 'samples': 6963840, 'steps': 36269, 'loss/train': 1.6689475774765015} 11/07/2021 02:20:33 - INFO - __main__ - Step 36271: {'lr': 0.0004367183706791474, 'samples': 6964032, 'steps': 36270, 'loss/train': 1.5568740367889404} 11/07/2021 02:20:33 - INFO - __main__ - Step 36272: {'lr': 0.0004367148418316434, 'samples': 6964224, 'steps': 36271, 'loss/train': 1.1501420736312866} 11/07/2021 02:20:33 - INFO - __main__ - Step 36273: {'lr': 0.0004367113129000085, 'samples': 6964416, 'steps': 36272, 'loss/train': 1.8946768045425415} 11/07/2021 02:20:34 - INFO - __main__ - Step 36274: {'lr': 0.00043670778388424434, 'samples': 6964608, 'steps': 36273, 'loss/train': 1.0662518739700317} 11/07/2021 02:20:35 - INFO - __main__ - Step 36275: {'lr': 0.00043670425478435263, 'samples': 6964800, 'steps': 36274, 'loss/train': 1.4347180128097534} 11/07/2021 02:20:35 - INFO - __main__ - Step 36276: {'lr': 0.00043670072560033474, 'samples': 6964992, 'steps': 36275, 'loss/train': 1.8485217094421387} 11/07/2021 02:20:35 - INFO - __main__ - Step 36277: {'lr': 0.00043669719633219247, 'samples': 6965184, 'steps': 36276, 'loss/train': 1.1847352981567383} 11/07/2021 02:20:36 - INFO - __main__ - Step 36278: {'lr': 0.0004366936669799273, 'samples': 6965376, 'steps': 36277, 'loss/train': 1.7239091396331787} 11/07/2021 02:20:36 - INFO - __main__ - Step 36279: {'lr': 0.0004366901375435408, 'samples': 6965568, 'steps': 36278, 'loss/train': 1.6254451274871826} 11/07/2021 02:20:37 - INFO - __main__ - Step 36280: {'lr': 0.0004366866080230347, 'samples': 6965760, 'steps': 36279, 'loss/train': 0.7942847609519958} 11/07/2021 02:20:37 - INFO - __main__ - Step 36281: {'lr': 0.0004366830784184104, 'samples': 6965952, 'steps': 36280, 'loss/train': 1.7162615060806274} 11/07/2021 02:20:38 - INFO - __main__ - Step 36282: {'lr': 0.00043667954872966965, 'samples': 6966144, 'steps': 36281, 'loss/train': 1.5471854209899902} 11/07/2021 02:20:38 - INFO - __main__ - Step 36283: {'lr': 0.000436676018956814, 'samples': 6966336, 'steps': 36282, 'loss/train': 1.441980004310608} 11/07/2021 02:20:38 - INFO - __main__ - Step 36284: {'lr': 0.0004366724890998449, 'samples': 6966528, 'steps': 36283, 'loss/train': 1.545427918434143} 11/07/2021 02:20:40 - INFO - __main__ - Step 36285: {'lr': 0.00043666895915876416, 'samples': 6966720, 'steps': 36284, 'loss/train': 1.0880110263824463} 11/07/2021 02:20:40 - INFO - __main__ - Step 36286: {'lr': 0.0004366654291335732, 'samples': 6966912, 'steps': 36285, 'loss/train': 1.8009814023971558} 11/07/2021 02:20:40 - INFO - __main__ - Step 36287: {'lr': 0.00043666189902427367, 'samples': 6967104, 'steps': 36286, 'loss/train': 1.3985798358917236} 11/07/2021 02:20:41 - INFO - __main__ - Step 36288: {'lr': 0.00043665836883086725, 'samples': 6967296, 'steps': 36287, 'loss/train': 1.3476370573043823} 11/07/2021 02:20:41 - INFO - __main__ - Step 36289: {'lr': 0.0004366548385533554, 'samples': 6967488, 'steps': 36288, 'loss/train': 0.9500797390937805} 11/07/2021 02:20:42 - INFO - __main__ - Step 36290: {'lr': 0.0004366513081917398, 'samples': 6967680, 'steps': 36289, 'loss/train': 1.5582741498947144} 11/07/2021 02:20:42 - INFO - __main__ - Step 36291: {'lr': 0.00043664777774602196, 'samples': 6967872, 'steps': 36290, 'loss/train': 1.4448840618133545} 11/07/2021 02:20:43 - INFO - __main__ - Step 36292: {'lr': 0.00043664424721620354, 'samples': 6968064, 'steps': 36291, 'loss/train': 1.062769889831543} 11/07/2021 02:20:43 - INFO - __main__ - Step 36293: {'lr': 0.00043664071660228605, 'samples': 6968256, 'steps': 36292, 'loss/train': 1.2941185235977173} 11/07/2021 02:20:43 - INFO - __main__ - Step 36294: {'lr': 0.00043663718590427117, 'samples': 6968448, 'steps': 36293, 'loss/train': 1.5335264205932617} 11/07/2021 02:20:44 - INFO - __main__ - Step 36295: {'lr': 0.0004366336551221605, 'samples': 6968640, 'steps': 36294, 'loss/train': 1.4312095642089844} 11/07/2021 02:20:45 - INFO - __main__ - Step 36296: {'lr': 0.0004366301242559555, 'samples': 6968832, 'steps': 36295, 'loss/train': 1.7201085090637207} 11/07/2021 02:20:45 - INFO - __main__ - Step 36297: {'lr': 0.00043662659330565793, 'samples': 6969024, 'steps': 36296, 'loss/train': 1.400758147239685} 11/07/2021 02:20:45 - INFO - __main__ - Step 36298: {'lr': 0.00043662306227126917, 'samples': 6969216, 'steps': 36297, 'loss/train': 1.5006085634231567} 11/07/2021 02:20:46 - INFO - __main__ - Step 36299: {'lr': 0.00043661953115279104, 'samples': 6969408, 'steps': 36298, 'loss/train': 0.569497287273407} 11/07/2021 02:20:46 - INFO - __main__ - Step 36300: {'lr': 0.000436615999950225, 'samples': 6969600, 'steps': 36299, 'loss/train': 0.7550137639045715} 11/07/2021 02:20:47 - INFO - __main__ - Step 36301: {'lr': 0.0004366124686635727, 'samples': 6969792, 'steps': 36300, 'loss/train': 1.804154396057129} 11/07/2021 02:20:47 - INFO - __main__ - Step 36302: {'lr': 0.00043660893729283564, 'samples': 6969984, 'steps': 36301, 'loss/train': 1.6032986640930176} 11/07/2021 02:20:48 - INFO - __main__ - Step 36303: {'lr': 0.0004366054058380155, 'samples': 6970176, 'steps': 36302, 'loss/train': 1.344807505607605} 11/07/2021 02:20:48 - INFO - __main__ - Step 36304: {'lr': 0.0004366018742991139, 'samples': 6970368, 'steps': 36303, 'loss/train': 1.6097887754440308} 11/07/2021 02:20:49 - INFO - __main__ - Step 36305: {'lr': 0.00043659834267613227, 'samples': 6970560, 'steps': 36304, 'loss/train': 1.4944956302642822} 11/07/2021 02:20:50 - INFO - __main__ - Step 36306: {'lr': 0.0004365948109690724, 'samples': 6970752, 'steps': 36305, 'loss/train': 1.6863945722579956} 11/07/2021 02:20:50 - INFO - __main__ - Step 36307: {'lr': 0.0004365912791779357, 'samples': 6970944, 'steps': 36306, 'loss/train': 1.2818877696990967} 11/07/2021 02:20:50 - INFO - __main__ - Step 36308: {'lr': 0.00043658774730272393, 'samples': 6971136, 'steps': 36307, 'loss/train': 1.4839028120040894} 11/07/2021 02:20:51 - INFO - __main__ - Step 36309: {'lr': 0.00043658421534343856, 'samples': 6971328, 'steps': 36308, 'loss/train': 1.585021734237671} 11/07/2021 02:20:51 - INFO - __main__ - Step 36310: {'lr': 0.0004365806833000813, 'samples': 6971520, 'steps': 36309, 'loss/train': 1.2770180702209473} 11/07/2021 02:20:52 - INFO - __main__ - Step 36311: {'lr': 0.0004365771511726535, 'samples': 6971712, 'steps': 36310, 'loss/train': 1.1752785444259644} 11/07/2021 02:20:53 - INFO - __main__ - Step 36312: {'lr': 0.00043657361896115706, 'samples': 6971904, 'steps': 36311, 'loss/train': 1.58674156665802} 11/07/2021 02:20:53 - INFO - __main__ - Step 36313: {'lr': 0.0004365700866655934, 'samples': 6972096, 'steps': 36312, 'loss/train': 1.2428057193756104} 11/07/2021 02:20:53 - INFO - __main__ - Step 36314: {'lr': 0.00043656655428596407, 'samples': 6972288, 'steps': 36313, 'loss/train': 1.4094693660736084} 11/07/2021 02:20:54 - INFO - __main__ - Step 36315: {'lr': 0.0004365630218222708, 'samples': 6972480, 'steps': 36314, 'loss/train': 1.0743906497955322} 11/07/2021 02:20:54 - INFO - __main__ - Step 36316: {'lr': 0.00043655948927451505, 'samples': 6972672, 'steps': 36315, 'loss/train': 0.8647530674934387} 11/07/2021 02:20:55 - INFO - __main__ - Step 36317: {'lr': 0.0004365559566426985, 'samples': 6972864, 'steps': 36316, 'loss/train': 1.3983964920043945} 11/07/2021 02:20:55 - INFO - __main__ - Step 36318: {'lr': 0.0004365524239268227, 'samples': 6973056, 'steps': 36317, 'loss/train': 1.3517965078353882} 11/07/2021 02:20:56 - INFO - __main__ - Step 36319: {'lr': 0.00043654889112688933, 'samples': 6973248, 'steps': 36318, 'loss/train': 1.9730541706085205} 11/07/2021 02:20:56 - INFO - __main__ - Step 36320: {'lr': 0.00043654535824289985, 'samples': 6973440, 'steps': 36319, 'loss/train': 1.7234567403793335} 11/07/2021 02:20:56 - INFO - __main__ - Step 36321: {'lr': 0.0004365418252748559, 'samples': 6973632, 'steps': 36320, 'loss/train': 1.5906174182891846} 11/07/2021 02:20:57 - INFO - __main__ - Step 36322: {'lr': 0.0004365382922227591, 'samples': 6973824, 'steps': 36321, 'loss/train': 1.6755872964859009} 11/07/2021 02:20:58 - INFO - __main__ - Step 36323: {'lr': 0.000436534759086611, 'samples': 6974016, 'steps': 36322, 'loss/train': 1.397365689277649} 11/07/2021 02:20:58 - INFO - __main__ - Step 36324: {'lr': 0.00043653122586641323, 'samples': 6974208, 'steps': 36323, 'loss/train': 1.6728856563568115} 11/07/2021 02:20:58 - INFO - __main__ - Step 36325: {'lr': 0.0004365276925621674, 'samples': 6974400, 'steps': 36324, 'loss/train': 1.2448979616165161} 11/07/2021 02:20:59 - INFO - __main__ - Step 36326: {'lr': 0.0004365241591738751, 'samples': 6974592, 'steps': 36325, 'loss/train': 1.7174886465072632} 11/07/2021 02:20:59 - INFO - __main__ - Step 36327: {'lr': 0.0004365206257015378, 'samples': 6974784, 'steps': 36326, 'loss/train': 1.560450792312622} 11/07/2021 02:21:00 - INFO - __main__ - Step 36328: {'lr': 0.0004365170921451572, 'samples': 6974976, 'steps': 36327, 'loss/train': 1.1287360191345215} 11/07/2021 02:21:01 - INFO - __main__ - Step 36329: {'lr': 0.00043651355850473495, 'samples': 6975168, 'steps': 36328, 'loss/train': 1.4507524967193604} 11/07/2021 02:21:01 - INFO - __main__ - Step 36330: {'lr': 0.0004365100247802725, 'samples': 6975360, 'steps': 36329, 'loss/train': 1.696042776107788} 11/07/2021 02:21:01 - INFO - __main__ - Step 36331: {'lr': 0.0004365064909717715, 'samples': 6975552, 'steps': 36330, 'loss/train': 1.5696899890899658} 11/07/2021 02:21:03 - INFO - __main__ - Step 36332: {'lr': 0.0004365029570792336, 'samples': 6975744, 'steps': 36331, 'loss/train': 1.5845402479171753} 11/07/2021 02:21:03 - INFO - __main__ - Step 36333: {'lr': 0.00043649942310266035, 'samples': 6975936, 'steps': 36332, 'loss/train': 0.9859981536865234} 11/07/2021 02:21:04 - INFO - __main__ - Step 36334: {'lr': 0.00043649588904205326, 'samples': 6976128, 'steps': 36333, 'loss/train': 1.5097932815551758} 11/07/2021 02:21:04 - INFO - __main__ - Step 36335: {'lr': 0.0004364923548974141, 'samples': 6976320, 'steps': 36334, 'loss/train': 2.3358113765716553} 11/07/2021 02:21:04 - INFO - __main__ - Step 36336: {'lr': 0.0004364888206687443, 'samples': 6976512, 'steps': 36335, 'loss/train': 2.372626304626465} 11/07/2021 02:21:05 - INFO - __main__ - Step 36337: {'lr': 0.00043648528635604556, 'samples': 6976704, 'steps': 36336, 'loss/train': 0.7707897424697876} 11/07/2021 02:21:06 - INFO - __main__ - Step 36338: {'lr': 0.00043648175195931937, 'samples': 6976896, 'steps': 36337, 'loss/train': 1.2917875051498413} 11/07/2021 02:21:06 - INFO - __main__ - Step 36339: {'lr': 0.0004364782174785674, 'samples': 6977088, 'steps': 36338, 'loss/train': 1.872575283050537} 11/07/2021 02:21:06 - INFO - __main__ - Step 36340: {'lr': 0.0004364746829137912, 'samples': 6977280, 'steps': 36339, 'loss/train': 1.193190336227417} 11/07/2021 02:21:07 - INFO - __main__ - Step 36341: {'lr': 0.0004364711482649925, 'samples': 6977472, 'steps': 36340, 'loss/train': 1.3252372741699219} 11/07/2021 02:21:07 - INFO - __main__ - Step 36342: {'lr': 0.00043646761353217266, 'samples': 6977664, 'steps': 36341, 'loss/train': 1.2717697620391846} 11/07/2021 02:21:08 - INFO - __main__ - Step 36343: {'lr': 0.0004364640787153334, 'samples': 6977856, 'steps': 36342, 'loss/train': 1.3853132724761963} 11/07/2021 02:21:08 - INFO - __main__ - Step 36344: {'lr': 0.0004364605438144764, 'samples': 6978048, 'steps': 36343, 'loss/train': 1.5299681425094604} 11/07/2021 02:21:09 - INFO - __main__ - Step 36345: {'lr': 0.000436457008829603, 'samples': 6978240, 'steps': 36344, 'loss/train': 1.5941821336746216} 11/07/2021 02:21:09 - INFO - __main__ - Step 36346: {'lr': 0.00043645347376071507, 'samples': 6978432, 'steps': 36345, 'loss/train': 1.7303555011749268} 11/07/2021 02:21:09 - INFO - __main__ - Step 36347: {'lr': 0.0004364499386078141, 'samples': 6978624, 'steps': 36346, 'loss/train': 1.1477150917053223} 11/07/2021 02:21:11 - INFO - __main__ - Step 36348: {'lr': 0.00043644640337090157, 'samples': 6978816, 'steps': 36347, 'loss/train': 1.7781087160110474} 11/07/2021 02:21:11 - INFO - __main__ - Step 36349: {'lr': 0.0004364428680499792, 'samples': 6979008, 'steps': 36348, 'loss/train': 1.4340267181396484} 11/07/2021 02:21:11 - INFO - __main__ - Step 36350: {'lr': 0.0004364393326450486, 'samples': 6979200, 'steps': 36349, 'loss/train': 1.4936268329620361} 11/07/2021 02:21:12 - INFO - __main__ - Step 36351: {'lr': 0.00043643579715611124, 'samples': 6979392, 'steps': 36350, 'loss/train': 1.516542673110962} 11/07/2021 02:21:12 - INFO - __main__ - Step 36352: {'lr': 0.00043643226158316886, 'samples': 6979584, 'steps': 36351, 'loss/train': 3.1933603286743164} 11/07/2021 02:21:12 - INFO - __main__ - Step 36353: {'lr': 0.00043642872592622293, 'samples': 6979776, 'steps': 36352, 'loss/train': 1.3439143896102905} 11/07/2021 02:21:13 - INFO - __main__ - Step 36354: {'lr': 0.0004364251901852751, 'samples': 6979968, 'steps': 36353, 'loss/train': 1.352319359779358} 11/07/2021 02:21:14 - INFO - __main__ - Step 36355: {'lr': 0.000436421654360327, 'samples': 6980160, 'steps': 36354, 'loss/train': 1.4244893789291382} 11/07/2021 02:21:14 - INFO - __main__ - Step 36356: {'lr': 0.00043641811845138016, 'samples': 6980352, 'steps': 36355, 'loss/train': 1.4638200998306274} 11/07/2021 02:21:14 - INFO - __main__ - Step 36357: {'lr': 0.0004364145824584361, 'samples': 6980544, 'steps': 36356, 'loss/train': 1.4082454442977905} 11/07/2021 02:21:15 - INFO - __main__ - Step 36358: {'lr': 0.00043641104638149656, 'samples': 6980736, 'steps': 36357, 'loss/train': 1.9423258304595947} 11/07/2021 02:21:16 - INFO - __main__ - Step 36359: {'lr': 0.00043640751022056316, 'samples': 6980928, 'steps': 36358, 'loss/train': 1.1190898418426514} 11/07/2021 02:21:16 - INFO - __main__ - Step 36360: {'lr': 0.00043640397397563737, 'samples': 6981120, 'steps': 36359, 'loss/train': 1.2740697860717773} 11/07/2021 02:21:16 - INFO - __main__ - Step 36361: {'lr': 0.00043640043764672077, 'samples': 6981312, 'steps': 36360, 'loss/train': 1.6111739873886108} 11/07/2021 02:21:17 - INFO - __main__ - Step 36362: {'lr': 0.00043639690123381503, 'samples': 6981504, 'steps': 36361, 'loss/train': 0.9121442437171936} 11/07/2021 02:21:17 - INFO - __main__ - Step 36363: {'lr': 0.00043639336473692174, 'samples': 6981696, 'steps': 36362, 'loss/train': 1.8936302661895752} 11/07/2021 02:21:18 - INFO - __main__ - Step 36364: {'lr': 0.00043638982815604247, 'samples': 6981888, 'steps': 36363, 'loss/train': 1.2038345336914062} 11/07/2021 02:21:19 - INFO - __main__ - Step 36365: {'lr': 0.00043638629149117883, 'samples': 6982080, 'steps': 36364, 'loss/train': 1.5564985275268555} 11/07/2021 02:21:19 - INFO - __main__ - Step 36366: {'lr': 0.0004363827547423324, 'samples': 6982272, 'steps': 36365, 'loss/train': 4.262648582458496} 11/07/2021 02:21:19 - INFO - __main__ - Step 36367: {'lr': 0.00043637921790950476, 'samples': 6982464, 'steps': 36366, 'loss/train': 1.6036688089370728} 11/07/2021 02:21:20 - INFO - __main__ - Step 36368: {'lr': 0.00043637568099269753, 'samples': 6982656, 'steps': 36367, 'loss/train': 1.0962079763412476} 11/07/2021 02:21:21 - INFO - __main__ - Step 36369: {'lr': 0.00043637214399191234, 'samples': 6982848, 'steps': 36368, 'loss/train': 1.3688468933105469} 11/07/2021 02:21:21 - INFO - __main__ - Step 36370: {'lr': 0.00043636860690715064, 'samples': 6983040, 'steps': 36369, 'loss/train': 1.4799058437347412} 11/07/2021 02:21:21 - INFO - __main__ - Step 36371: {'lr': 0.00043636506973841424, 'samples': 6983232, 'steps': 36370, 'loss/train': 1.8318012952804565} 11/07/2021 02:21:22 - INFO - __main__ - Step 36372: {'lr': 0.00043636153248570453, 'samples': 6983424, 'steps': 36371, 'loss/train': 1.3730608224868774} 11/07/2021 02:21:22 - INFO - __main__ - Step 36373: {'lr': 0.0004363579951490232, 'samples': 6983616, 'steps': 36372, 'loss/train': 1.7554099559783936} 11/07/2021 02:21:22 - INFO - __main__ - Step 36374: {'lr': 0.0004363544577283718, 'samples': 6983808, 'steps': 36373, 'loss/train': 1.4895075559616089} 11/07/2021 02:21:24 - INFO - __main__ - Step 36375: {'lr': 0.0004363509202237521, 'samples': 6984000, 'steps': 36374, 'loss/train': 1.4852935075759888} 11/07/2021 02:21:24 - INFO - __main__ - Step 36376: {'lr': 0.0004363473826351654, 'samples': 6984192, 'steps': 36375, 'loss/train': 1.0466312170028687} 11/07/2021 02:21:24 - INFO - __main__ - Step 36377: {'lr': 0.0004363438449626135, 'samples': 6984384, 'steps': 36376, 'loss/train': 1.2807888984680176} 11/07/2021 02:21:25 - INFO - __main__ - Step 36378: {'lr': 0.000436340307206098, 'samples': 6984576, 'steps': 36377, 'loss/train': 1.4843051433563232} 11/07/2021 02:21:25 - INFO - __main__ - Step 36379: {'lr': 0.00043633676936562026, 'samples': 6984768, 'steps': 36378, 'loss/train': 1.423012375831604} 11/07/2021 02:21:26 - INFO - __main__ - Step 36380: {'lr': 0.0004363332314411822, 'samples': 6984960, 'steps': 36379, 'loss/train': 1.7427172660827637} 11/07/2021 02:21:26 - INFO - __main__ - Step 36381: {'lr': 0.0004363296934327852, 'samples': 6985152, 'steps': 36380, 'loss/train': 1.600832223892212} 11/07/2021 02:21:27 - INFO - __main__ - Step 36382: {'lr': 0.00043632615534043096, 'samples': 6985344, 'steps': 36381, 'loss/train': 2.1246707439422607} 11/07/2021 02:21:27 - INFO - __main__ - Step 36383: {'lr': 0.00043632261716412097, 'samples': 6985536, 'steps': 36382, 'loss/train': 0.906044065952301} 11/07/2021 02:21:27 - INFO - __main__ - Step 36384: {'lr': 0.0004363190789038569, 'samples': 6985728, 'steps': 36383, 'loss/train': 1.3539206981658936} 11/07/2021 02:21:28 - INFO - __main__ - Step 36385: {'lr': 0.0004363155405596404, 'samples': 6985920, 'steps': 36384, 'loss/train': 1.7368046045303345} 11/07/2021 02:21:29 - INFO - __main__ - Step 36386: {'lr': 0.00043631200213147296, 'samples': 6986112, 'steps': 36385, 'loss/train': 1.5509334802627563} 11/07/2021 02:21:29 - INFO - __main__ - Step 36387: {'lr': 0.0004363084636193561, 'samples': 6986304, 'steps': 36386, 'loss/train': 1.6322015523910522} 11/07/2021 02:21:29 - INFO - __main__ - Step 36388: {'lr': 0.0004363049250232917, 'samples': 6986496, 'steps': 36387, 'loss/train': 0.9802109003067017} 11/07/2021 02:21:30 - INFO - __main__ - Step 36389: {'lr': 0.000436301386343281, 'samples': 6986688, 'steps': 36388, 'loss/train': 1.4826627969741821} 11/07/2021 02:21:31 - INFO - __main__ - Step 36390: {'lr': 0.0004362978475793259, 'samples': 6986880, 'steps': 36389, 'loss/train': 1.9039173126220703} 11/07/2021 02:21:31 - INFO - __main__ - Step 36391: {'lr': 0.00043629430873142773, 'samples': 6987072, 'steps': 36390, 'loss/train': 0.39275264739990234} 11/07/2021 02:21:31 - INFO - __main__ - Step 36392: {'lr': 0.00043629076979958837, 'samples': 6987264, 'steps': 36391, 'loss/train': 1.394723653793335} 11/07/2021 02:21:32 - INFO - __main__ - Step 36393: {'lr': 0.00043628723078380916, 'samples': 6987456, 'steps': 36392, 'loss/train': 1.5663731098175049} 11/07/2021 02:21:32 - INFO - __main__ - Step 36394: {'lr': 0.0004362836916840919, 'samples': 6987648, 'steps': 36393, 'loss/train': 1.3121269941329956} 11/07/2021 02:21:33 - INFO - __main__ - Step 36395: {'lr': 0.00043628015250043794, 'samples': 6987840, 'steps': 36394, 'loss/train': 1.107160210609436} 11/07/2021 02:21:33 - INFO - __main__ - Step 36396: {'lr': 0.00043627661323284914, 'samples': 6988032, 'steps': 36395, 'loss/train': 1.2629873752593994} 11/07/2021 02:21:34 - INFO - __main__ - Step 36397: {'lr': 0.00043627307388132693, 'samples': 6988224, 'steps': 36396, 'loss/train': 2.3927032947540283} 11/07/2021 02:21:34 - INFO - __main__ - Step 36398: {'lr': 0.0004362695344458729, 'samples': 6988416, 'steps': 36397, 'loss/train': 1.5395231246948242} 11/07/2021 02:21:35 - INFO - __main__ - Step 36399: {'lr': 0.00043626599492648877, 'samples': 6988608, 'steps': 36398, 'loss/train': 1.4821674823760986} 11/07/2021 02:21:36 - INFO - __main__ - Step 36400: {'lr': 0.000436262455323176, 'samples': 6988800, 'steps': 36399, 'loss/train': 1.1854488849639893} 11/07/2021 02:21:36 - INFO - __main__ - Step 36401: {'lr': 0.0004362589156359363, 'samples': 6988992, 'steps': 36400, 'loss/train': 1.455713152885437} 11/07/2021 02:21:36 - INFO - __main__ - Step 36402: {'lr': 0.00043625537586477114, 'samples': 6989184, 'steps': 36401, 'loss/train': 1.39728581905365} 11/07/2021 02:21:37 - INFO - __main__ - Step 36403: {'lr': 0.00043625183600968224, 'samples': 6989376, 'steps': 36402, 'loss/train': 1.764100193977356} 11/07/2021 02:21:37 - INFO - __main__ - Step 36404: {'lr': 0.00043624829607067105, 'samples': 6989568, 'steps': 36403, 'loss/train': 1.5400242805480957} 11/07/2021 02:21:37 - INFO - __main__ - Step 36405: {'lr': 0.0004362447560477394, 'samples': 6989760, 'steps': 36404, 'loss/train': 1.6310828924179077} 11/07/2021 02:21:38 - INFO - __main__ - Step 36406: {'lr': 0.0004362412159408886, 'samples': 6989952, 'steps': 36405, 'loss/train': 1.5401593446731567} 11/07/2021 02:21:39 - INFO - __main__ - Step 36407: {'lr': 0.0004362376757501205, 'samples': 6990144, 'steps': 36406, 'loss/train': 0.9658600091934204} 11/07/2021 02:21:39 - INFO - __main__ - Step 36408: {'lr': 0.00043623413547543645, 'samples': 6990336, 'steps': 36407, 'loss/train': 1.0938903093338013} 11/07/2021 02:21:39 - INFO - __main__ - Step 36409: {'lr': 0.00043623059511683826, 'samples': 6990528, 'steps': 36408, 'loss/train': 1.7532463073730469} 11/07/2021 02:21:40 - INFO - __main__ - Step 36410: {'lr': 0.0004362270546743274, 'samples': 6990720, 'steps': 36409, 'loss/train': 1.4016672372817993} 11/07/2021 02:21:41 - INFO - __main__ - Step 36411: {'lr': 0.0004362235141479055, 'samples': 6990912, 'steps': 36410, 'loss/train': 1.5435714721679688} 11/07/2021 02:21:41 - INFO - __main__ - Step 36412: {'lr': 0.0004362199735375742, 'samples': 6991104, 'steps': 36411, 'loss/train': 1.1211869716644287} 11/07/2021 02:21:42 - INFO - __main__ - Step 36413: {'lr': 0.000436216432843335, 'samples': 6991296, 'steps': 36412, 'loss/train': 0.736153244972229} 11/07/2021 02:21:42 - INFO - __main__ - Step 36414: {'lr': 0.00043621289206518957, 'samples': 6991488, 'steps': 36413, 'loss/train': 1.6953119039535522} 11/07/2021 02:21:42 - INFO - __main__ - Step 36415: {'lr': 0.00043620935120313955, 'samples': 6991680, 'steps': 36414, 'loss/train': 1.5244629383087158} 11/07/2021 02:21:44 - INFO - __main__ - Step 36416: {'lr': 0.0004362058102571864, 'samples': 6991872, 'steps': 36415, 'loss/train': 1.6046603918075562} 11/07/2021 02:21:44 - INFO - __main__ - Step 36417: {'lr': 0.00043620226922733174, 'samples': 6992064, 'steps': 36416, 'loss/train': 1.2570933103561401} 11/07/2021 02:21:44 - INFO - __main__ - Step 36418: {'lr': 0.0004361987281135773, 'samples': 6992256, 'steps': 36417, 'loss/train': 1.9624028205871582} 11/07/2021 02:21:45 - INFO - __main__ - Step 36419: {'lr': 0.00043619518691592453, 'samples': 6992448, 'steps': 36418, 'loss/train': 0.7464272975921631} 11/07/2021 02:21:45 - INFO - __main__ - Step 36420: {'lr': 0.00043619164563437506, 'samples': 6992640, 'steps': 36419, 'loss/train': 2.0012402534484863} 11/07/2021 02:21:46 - INFO - __main__ - Step 36421: {'lr': 0.0004361881042689306, 'samples': 6992832, 'steps': 36420, 'loss/train': 0.9991865754127502} 11/07/2021 02:21:46 - INFO - __main__ - Step 36422: {'lr': 0.00043618456281959263, 'samples': 6993024, 'steps': 36421, 'loss/train': 1.4010941982269287} 11/07/2021 02:21:47 - INFO - __main__ - Step 36423: {'lr': 0.0004361810212863627, 'samples': 6993216, 'steps': 36422, 'loss/train': 1.5331019163131714} 11/07/2021 02:21:47 - INFO - __main__ - Step 36424: {'lr': 0.0004361774796692425, 'samples': 6993408, 'steps': 36423, 'loss/train': 1.7541043758392334} 11/07/2021 02:21:47 - INFO - __main__ - Step 36425: {'lr': 0.00043617393796823367, 'samples': 6993600, 'steps': 36424, 'loss/train': 1.4145983457565308} 11/07/2021 02:21:48 - INFO - __main__ - Step 36426: {'lr': 0.00043617039618333765, 'samples': 6993792, 'steps': 36425, 'loss/train': 1.6360641717910767} 11/07/2021 02:21:49 - INFO - __main__ - Step 36427: {'lr': 0.00043616685431455615, 'samples': 6993984, 'steps': 36426, 'loss/train': 1.2648183107376099} 11/07/2021 02:21:49 - INFO - __main__ - Step 36428: {'lr': 0.0004361633123618908, 'samples': 6994176, 'steps': 36427, 'loss/train': 1.10480797290802} 11/07/2021 02:21:49 - INFO - __main__ - Step 36429: {'lr': 0.00043615977032534305, 'samples': 6994368, 'steps': 36428, 'loss/train': 1.7303485870361328} 11/07/2021 02:21:50 - INFO - __main__ - Step 36430: {'lr': 0.00043615622820491464, 'samples': 6994560, 'steps': 36429, 'loss/train': 1.4921950101852417} 11/07/2021 02:21:50 - INFO - __main__ - Step 36431: {'lr': 0.00043615268600060705, 'samples': 6994752, 'steps': 36430, 'loss/train': 1.4308103322982788} 11/07/2021 02:21:52 - INFO - __main__ - Step 36432: {'lr': 0.000436149143712422, 'samples': 6994944, 'steps': 36431, 'loss/train': 1.1433682441711426} 11/07/2021 02:21:52 - INFO - __main__ - Step 36433: {'lr': 0.0004361456013403609, 'samples': 6995136, 'steps': 36432, 'loss/train': 1.522065281867981} 11/07/2021 02:21:53 - INFO - __main__ - Step 36434: {'lr': 0.00043614205888442553, 'samples': 6995328, 'steps': 36433, 'loss/train': 1.2452378273010254} 11/07/2021 02:21:53 - INFO - __main__ - Step 36435: {'lr': 0.00043613851634461743, 'samples': 6995520, 'steps': 36434, 'loss/train': 1.5099252462387085} 11/07/2021 02:21:53 - INFO - __main__ - Step 36436: {'lr': 0.00043613497372093827, 'samples': 6995712, 'steps': 36435, 'loss/train': 1.4279507398605347} 11/07/2021 02:21:54 - INFO - __main__ - Step 36437: {'lr': 0.0004361314310133894, 'samples': 6995904, 'steps': 36436, 'loss/train': 1.2324637174606323} 11/07/2021 02:21:54 - INFO - __main__ - Step 36438: {'lr': 0.00043612788822197266, 'samples': 6996096, 'steps': 36437, 'loss/train': 1.9096089601516724} 11/07/2021 02:21:54 - INFO - __main__ - Step 36439: {'lr': 0.0004361243453466896, 'samples': 6996288, 'steps': 36438, 'loss/train': 1.8450305461883545} 11/07/2021 02:21:55 - INFO - __main__ - Step 36440: {'lr': 0.0004361208023875417, 'samples': 6996480, 'steps': 36439, 'loss/train': 1.8311712741851807} 11/07/2021 02:21:56 - INFO - __main__ - Step 36441: {'lr': 0.00043611725934453074, 'samples': 6996672, 'steps': 36440, 'loss/train': 1.368140459060669} 11/07/2021 02:21:56 - INFO - __main__ - Step 36442: {'lr': 0.00043611371621765817, 'samples': 6996864, 'steps': 36441, 'loss/train': 1.5333380699157715} 11/07/2021 02:21:57 - INFO - __main__ - Step 36443: {'lr': 0.0004361101730069256, 'samples': 6997056, 'steps': 36442, 'loss/train': 0.9707183837890625} 11/07/2021 02:21:57 - INFO - __main__ - Step 36444: {'lr': 0.00043610662971233465, 'samples': 6997248, 'steps': 36443, 'loss/train': 1.5515861511230469} 11/07/2021 02:21:58 - INFO - __main__ - Step 36445: {'lr': 0.00043610308633388695, 'samples': 6997440, 'steps': 36444, 'loss/train': 1.4722208976745605} 11/07/2021 02:21:58 - INFO - __main__ - Step 36446: {'lr': 0.0004360995428715841, 'samples': 6997632, 'steps': 36445, 'loss/train': 1.4910660982131958} 11/07/2021 02:21:59 - INFO - __main__ - Step 36447: {'lr': 0.00043609599932542764, 'samples': 6997824, 'steps': 36446, 'loss/train': 1.9633461236953735} 11/07/2021 02:21:59 - INFO - __main__ - Step 36448: {'lr': 0.00043609245569541924, 'samples': 6998016, 'steps': 36447, 'loss/train': 1.5884859561920166} 11/07/2021 02:21:59 - INFO - __main__ - Step 36449: {'lr': 0.00043608891198156037, 'samples': 6998208, 'steps': 36448, 'loss/train': 1.4491236209869385} 11/07/2021 02:22:00 - INFO - __main__ - Step 36450: {'lr': 0.0004360853681838528, 'samples': 6998400, 'steps': 36449, 'loss/train': 1.4041427373886108} 11/07/2021 02:22:01 - INFO - __main__ - Step 36451: {'lr': 0.0004360818243022979, 'samples': 6998592, 'steps': 36450, 'loss/train': 1.247759222984314} 11/07/2021 02:22:01 - INFO - __main__ - Step 36452: {'lr': 0.00043607828033689753, 'samples': 6998784, 'steps': 36451, 'loss/train': 1.145880937576294} 11/07/2021 02:22:01 - INFO - __main__ - Step 36453: {'lr': 0.000436074736287653, 'samples': 6998976, 'steps': 36452, 'loss/train': 1.6394306421279907} 11/07/2021 02:22:02 - INFO - __main__ - Step 36454: {'lr': 0.00043607119215456625, 'samples': 6999168, 'steps': 36453, 'loss/train': 1.3804353475570679} 11/07/2021 02:22:03 - INFO - __main__ - Step 36455: {'lr': 0.00043606764793763865, 'samples': 6999360, 'steps': 36454, 'loss/train': 1.7423021793365479} 11/07/2021 02:22:03 - INFO - __main__ - Step 36456: {'lr': 0.00043606410363687177, 'samples': 6999552, 'steps': 36455, 'loss/train': 1.3185689449310303} 11/07/2021 02:22:04 - INFO - __main__ - Step 36457: {'lr': 0.00043606055925226727, 'samples': 6999744, 'steps': 36456, 'loss/train': 1.5543650388717651} 11/07/2021 02:22:04 - INFO - __main__ - Step 36458: {'lr': 0.0004360570147838269, 'samples': 6999936, 'steps': 36457, 'loss/train': 1.7427715063095093} 11/07/2021 02:22:04 - INFO - __main__ - Step 36459: {'lr': 0.00043605347023155193, 'samples': 7000128, 'steps': 36458, 'loss/train': 1.5395492315292358} 11/07/2021 02:22:05 - INFO - __main__ - Step 36460: {'lr': 0.0004360499255954442, 'samples': 7000320, 'steps': 36459, 'loss/train': 1.176275372505188} 11/07/2021 02:22:06 - INFO - __main__ - Step 36461: {'lr': 0.0004360463808755053, 'samples': 7000512, 'steps': 36460, 'loss/train': 1.3563385009765625} 11/07/2021 02:22:06 - INFO - __main__ - Step 36462: {'lr': 0.00043604283607173673, 'samples': 7000704, 'steps': 36461, 'loss/train': 1.5283113718032837} 11/07/2021 02:22:06 - INFO - __main__ - Step 36463: {'lr': 0.0004360392911841401, 'samples': 7000896, 'steps': 36462, 'loss/train': 1.3267556428909302} 11/07/2021 02:22:07 - INFO - __main__ - Step 36464: {'lr': 0.0004360357462127171, 'samples': 7001088, 'steps': 36463, 'loss/train': 1.5522664785385132} 11/07/2021 02:22:07 - INFO - __main__ - Step 36465: {'lr': 0.0004360322011574692, 'samples': 7001280, 'steps': 36464, 'loss/train': 1.4915170669555664} 11/07/2021 02:22:08 - INFO - __main__ - Step 36466: {'lr': 0.00043602865601839817, 'samples': 7001472, 'steps': 36465, 'loss/train': 1.3296517133712769} 11/07/2021 02:22:08 - INFO - __main__ - Step 36467: {'lr': 0.00043602511079550535, 'samples': 7001664, 'steps': 36466, 'loss/train': 1.2712076902389526} 11/07/2021 02:22:09 - INFO - __main__ - Step 36468: {'lr': 0.0004360215654887926, 'samples': 7001856, 'steps': 36467, 'loss/train': 1.1578062772750854} 11/07/2021 02:22:09 - INFO - __main__ - Step 36469: {'lr': 0.0004360180200982613, 'samples': 7002048, 'steps': 36468, 'loss/train': 1.7998279333114624} 11/07/2021 02:22:09 - INFO - __main__ - Step 36470: {'lr': 0.00043601447462391317, 'samples': 7002240, 'steps': 36469, 'loss/train': 1.8373972177505493} 11/07/2021 02:22:10 - INFO - __main__ - Step 36471: {'lr': 0.00043601092906574986, 'samples': 7002432, 'steps': 36470, 'loss/train': 1.81533944606781} 11/07/2021 02:22:11 - INFO - __main__ - Step 36472: {'lr': 0.0004360073834237729, 'samples': 7002624, 'steps': 36471, 'loss/train': 1.2724525928497314} 11/07/2021 02:22:11 - INFO - __main__ - Step 36473: {'lr': 0.0004360038376979838, 'samples': 7002816, 'steps': 36472, 'loss/train': 1.720353364944458} 11/07/2021 02:22:12 - INFO - __main__ - Step 36474: {'lr': 0.0004360002918883843, 'samples': 7003008, 'steps': 36473, 'loss/train': 1.4903210401535034} 11/07/2021 02:22:12 - INFO - __main__ - Step 36475: {'lr': 0.00043599674599497593, 'samples': 7003200, 'steps': 36474, 'loss/train': 1.5465788841247559} 11/07/2021 02:22:13 - INFO - __main__ - Step 36476: {'lr': 0.00043599320001776025, 'samples': 7003392, 'steps': 36475, 'loss/train': 1.2533528804779053} 11/07/2021 02:22:13 - INFO - __main__ - Step 36477: {'lr': 0.00043598965395673893, 'samples': 7003584, 'steps': 36476, 'loss/train': 1.5747464895248413} 11/07/2021 02:22:14 - INFO - __main__ - Step 36478: {'lr': 0.0004359861078119136, 'samples': 7003776, 'steps': 36477, 'loss/train': 1.5156866312026978} 11/07/2021 02:22:14 - INFO - __main__ - Step 36479: {'lr': 0.00043598256158328575, 'samples': 7003968, 'steps': 36478, 'loss/train': 1.7108465433120728} 11/07/2021 02:22:14 - INFO - __main__ - Step 36480: {'lr': 0.00043597901527085703, 'samples': 7004160, 'steps': 36479, 'loss/train': 0.9954426884651184} 11/07/2021 02:22:15 - INFO - __main__ - Step 36481: {'lr': 0.000435975468874629, 'samples': 7004352, 'steps': 36480, 'loss/train': 1.4760842323303223} 11/07/2021 02:22:16 - INFO - __main__ - Step 36482: {'lr': 0.00043597192239460336, 'samples': 7004544, 'steps': 36481, 'loss/train': 1.4253824949264526} 11/07/2021 02:22:16 - INFO - __main__ - Step 36483: {'lr': 0.00043596837583078165, 'samples': 7004736, 'steps': 36482, 'loss/train': 1.7219206094741821} 11/07/2021 02:22:16 - INFO - __main__ - Step 36484: {'lr': 0.0004359648291831654, 'samples': 7004928, 'steps': 36483, 'loss/train': 1.0681475400924683} 11/07/2021 02:22:17 - INFO - __main__ - Step 36485: {'lr': 0.0004359612824517563, 'samples': 7005120, 'steps': 36484, 'loss/train': 1.6380879878997803} 11/07/2021 02:22:18 - INFO - __main__ - Step 36486: {'lr': 0.0004359577356365559, 'samples': 7005312, 'steps': 36485, 'loss/train': 1.3437206745147705} 11/07/2021 02:22:18 - INFO - __main__ - Step 36487: {'lr': 0.00043595418873756584, 'samples': 7005504, 'steps': 36486, 'loss/train': 1.4168643951416016} 11/07/2021 02:22:18 - INFO - __main__ - Step 36488: {'lr': 0.0004359506417547876, 'samples': 7005696, 'steps': 36487, 'loss/train': 1.3627259731292725} 11/07/2021 02:22:19 - INFO - __main__ - Step 36489: {'lr': 0.000435947094688223, 'samples': 7005888, 'steps': 36488, 'loss/train': 1.4367139339447021} 11/07/2021 02:22:19 - INFO - __main__ - Step 36490: {'lr': 0.0004359435475378735, 'samples': 7006080, 'steps': 36489, 'loss/train': 1.403445839881897} 11/07/2021 02:22:20 - INFO - __main__ - Step 36491: {'lr': 0.0004359400003037406, 'samples': 7006272, 'steps': 36490, 'loss/train': 1.608238697052002} 11/07/2021 02:22:20 - INFO - __main__ - Step 36492: {'lr': 0.0004359364529858261, 'samples': 7006464, 'steps': 36491, 'loss/train': 2.306702136993408} 11/07/2021 02:22:21 - INFO - __main__ - Step 36493: {'lr': 0.00043593290558413143, 'samples': 7006656, 'steps': 36492, 'loss/train': 1.7007659673690796} 11/07/2021 02:22:21 - INFO - __main__ - Step 36494: {'lr': 0.0004359293580986583, 'samples': 7006848, 'steps': 36493, 'loss/train': 1.8951630592346191} 11/07/2021 02:22:21 - INFO - __main__ - Step 36495: {'lr': 0.0004359258105294083, 'samples': 7007040, 'steps': 36494, 'loss/train': 1.4092471599578857} 11/07/2021 02:22:23 - INFO - __main__ - Step 36496: {'lr': 0.0004359222628763829, 'samples': 7007232, 'steps': 36495, 'loss/train': 1.3110671043395996} 11/07/2021 02:22:23 - INFO - __main__ - Step 36497: {'lr': 0.0004359187151395839, 'samples': 7007424, 'steps': 36496, 'loss/train': 1.6679835319519043} 11/07/2021 02:22:24 - INFO - __main__ - Step 36498: {'lr': 0.0004359151673190127, 'samples': 7007616, 'steps': 36497, 'loss/train': 1.349755883216858} 11/07/2021 02:22:24 - INFO - __main__ - Step 36499: {'lr': 0.0004359116194146711, 'samples': 7007808, 'steps': 36498, 'loss/train': 1.5411666631698608} 11/07/2021 02:22:24 - INFO - __main__ - Step 36500: {'lr': 0.0004359080714265605, 'samples': 7008000, 'steps': 36499, 'loss/train': 1.6351027488708496} 11/07/2021 02:22:25 - INFO - __main__ - Step 36501: {'lr': 0.00043590452335468265, 'samples': 7008192, 'steps': 36500, 'loss/train': 0.6577228307723999} 11/07/2021 02:22:26 - INFO - __main__ - Step 36502: {'lr': 0.00043590097519903917, 'samples': 7008384, 'steps': 36501, 'loss/train': 0.4223167896270752} 11/07/2021 02:22:26 - INFO - __main__ - Step 36503: {'lr': 0.0004358974269596314, 'samples': 7008576, 'steps': 36502, 'loss/train': 1.6163660287857056} 11/07/2021 02:22:26 - INFO - __main__ - Step 36504: {'lr': 0.00043589387863646125, 'samples': 7008768, 'steps': 36503, 'loss/train': 1.5610636472702026} 11/07/2021 02:22:27 - INFO - __main__ - Step 36505: {'lr': 0.0004358903302295301, 'samples': 7008960, 'steps': 36504, 'loss/train': 1.8605214357376099} 11/07/2021 02:22:27 - INFO - __main__ - Step 36506: {'lr': 0.0004358867817388397, 'samples': 7009152, 'steps': 36505, 'loss/train': 1.8321130275726318} 11/07/2021 02:22:28 - INFO - __main__ - Step 36507: {'lr': 0.0004358832331643916, 'samples': 7009344, 'steps': 36506, 'loss/train': 1.1233673095703125} 11/07/2021 02:22:29 - INFO - __main__ - Step 36508: {'lr': 0.0004358796845061873, 'samples': 7009536, 'steps': 36507, 'loss/train': 1.2812395095825195} 11/07/2021 02:22:29 - INFO - __main__ - Step 36509: {'lr': 0.00043587613576422855, 'samples': 7009728, 'steps': 36508, 'loss/train': 1.5616999864578247} 11/07/2021 02:22:29 - INFO - __main__ - Step 36510: {'lr': 0.00043587258693851685, 'samples': 7009920, 'steps': 36509, 'loss/train': 1.5353094339370728} 11/07/2021 02:22:30 - INFO - __main__ - Step 36511: {'lr': 0.0004358690380290539, 'samples': 7010112, 'steps': 36510, 'loss/train': 1.4021555185317993} 11/07/2021 02:22:31 - INFO - __main__ - Step 36512: {'lr': 0.00043586548903584113, 'samples': 7010304, 'steps': 36511, 'loss/train': 1.5102307796478271} 11/07/2021 02:22:31 - INFO - __main__ - Step 36513: {'lr': 0.0004358619399588802, 'samples': 7010496, 'steps': 36512, 'loss/train': 1.0852004289627075} 11/07/2021 02:22:31 - INFO - __main__ - Step 36514: {'lr': 0.0004358583907981729, 'samples': 7010688, 'steps': 36513, 'loss/train': 1.7945281267166138} 11/07/2021 02:22:32 - INFO - __main__ - Step 36515: {'lr': 0.0004358548415537206, 'samples': 7010880, 'steps': 36514, 'loss/train': 0.9975671768188477} 11/07/2021 02:22:32 - INFO - __main__ - Step 36516: {'lr': 0.000435851292225525, 'samples': 7011072, 'steps': 36515, 'loss/train': 1.5559824705123901} 11/07/2021 02:22:32 - INFO - __main__ - Step 36517: {'lr': 0.0004358477428135876, 'samples': 7011264, 'steps': 36516, 'loss/train': 1.3623974323272705} 11/07/2021 02:22:34 - INFO - __main__ - Step 36518: {'lr': 0.00043584419331791014, 'samples': 7011456, 'steps': 36517, 'loss/train': 1.2668063640594482} 11/07/2021 02:22:34 - INFO - __main__ - Step 36519: {'lr': 0.0004358406437384942, 'samples': 7011648, 'steps': 36518, 'loss/train': 1.5969531536102295} 11/07/2021 02:22:34 - INFO - __main__ - Step 36520: {'lr': 0.0004358370940753412, 'samples': 7011840, 'steps': 36519, 'loss/train': 0.9871808886528015} 11/07/2021 02:22:35 - INFO - __main__ - Step 36521: {'lr': 0.000435833544328453, 'samples': 7012032, 'steps': 36520, 'loss/train': 1.0032232999801636} 11/07/2021 02:22:35 - INFO - __main__ - Step 36522: {'lr': 0.00043582999449783103, 'samples': 7012224, 'steps': 36521, 'loss/train': 1.8288178443908691} 11/07/2021 02:22:36 - INFO - __main__ - Step 36523: {'lr': 0.0004358264445834769, 'samples': 7012416, 'steps': 36522, 'loss/train': 1.3894726037979126} 11/07/2021 02:22:37 - INFO - __main__ - Step 36524: {'lr': 0.00043582289458539224, 'samples': 7012608, 'steps': 36523, 'loss/train': 1.4437963962554932} 11/07/2021 02:22:37 - INFO - __main__ - Step 36525: {'lr': 0.00043581934450357876, 'samples': 7012800, 'steps': 36524, 'loss/train': 1.240963339805603} 11/07/2021 02:22:37 - INFO - __main__ - Step 36526: {'lr': 0.0004358157943380379, 'samples': 7012992, 'steps': 36525, 'loss/train': 1.2418889999389648} 11/07/2021 02:22:38 - INFO - __main__ - Step 36527: {'lr': 0.00043581224408877116, 'samples': 7013184, 'steps': 36526, 'loss/train': 1.6116684675216675} 11/07/2021 02:22:38 - INFO - __main__ - Step 36528: {'lr': 0.00043580869375578046, 'samples': 7013376, 'steps': 36527, 'loss/train': 1.7534099817276} 11/07/2021 02:22:39 - INFO - __main__ - Step 36529: {'lr': 0.00043580514333906717, 'samples': 7013568, 'steps': 36528, 'loss/train': 1.2966219186782837} 11/07/2021 02:22:39 - INFO - __main__ - Step 36530: {'lr': 0.000435801592838633, 'samples': 7013760, 'steps': 36529, 'loss/train': 1.220832347869873} 11/07/2021 02:22:40 - INFO - __main__ - Step 36531: {'lr': 0.0004357980422544794, 'samples': 7013952, 'steps': 36530, 'loss/train': 1.5813374519348145} 11/07/2021 02:22:40 - INFO - __main__ - Step 36532: {'lr': 0.00043579449158660815, 'samples': 7014144, 'steps': 36531, 'loss/train': 1.3674315214157104} 11/07/2021 02:22:40 - INFO - __main__ - Step 36533: {'lr': 0.0004357909408350208, 'samples': 7014336, 'steps': 36532, 'loss/train': 1.3425029516220093} 11/07/2021 02:22:42 - INFO - __main__ - Step 36534: {'lr': 0.00043578738999971886, 'samples': 7014528, 'steps': 36533, 'loss/train': 1.2998322248458862} 11/07/2021 02:22:42 - INFO - __main__ - Step 36535: {'lr': 0.000435783839080704, 'samples': 7014720, 'steps': 36534, 'loss/train': 1.6563870906829834} 11/07/2021 02:22:42 - INFO - __main__ - Step 36536: {'lr': 0.00043578028807797774, 'samples': 7014912, 'steps': 36535, 'loss/train': 1.437389850616455} 11/07/2021 02:22:43 - INFO - __main__ - Step 36537: {'lr': 0.0004357767369915419, 'samples': 7015104, 'steps': 36536, 'loss/train': 1.6424156427383423} 11/07/2021 02:22:43 - INFO - __main__ - Step 36538: {'lr': 0.0004357731858213978, 'samples': 7015296, 'steps': 36537, 'loss/train': 1.26777982711792} 11/07/2021 02:22:44 - INFO - __main__ - Step 36539: {'lr': 0.0004357696345675472, 'samples': 7015488, 'steps': 36538, 'loss/train': 1.8215261697769165} 11/07/2021 02:22:44 - INFO - __main__ - Step 36540: {'lr': 0.00043576608322999167, 'samples': 7015680, 'steps': 36539, 'loss/train': 1.4973750114440918} 11/07/2021 02:22:45 - INFO - __main__ - Step 36541: {'lr': 0.0004357625318087328, 'samples': 7015872, 'steps': 36540, 'loss/train': 1.465362787246704} 11/07/2021 02:22:45 - INFO - __main__ - Step 36542: {'lr': 0.00043575898030377225, 'samples': 7016064, 'steps': 36541, 'loss/train': 1.8516111373901367} 11/07/2021 02:22:45 - INFO - __main__ - Step 36543: {'lr': 0.00043575542871511155, 'samples': 7016256, 'steps': 36542, 'loss/train': 1.786741852760315} 11/07/2021 02:22:46 - INFO - __main__ - Step 36544: {'lr': 0.00043575187704275234, 'samples': 7016448, 'steps': 36543, 'loss/train': 1.696166753768921} 11/07/2021 02:22:47 - INFO - __main__ - Step 36545: {'lr': 0.0004357483252866961, 'samples': 7016640, 'steps': 36544, 'loss/train': 1.6869455575942993} 11/07/2021 02:22:47 - INFO - __main__ - Step 36546: {'lr': 0.00043574477344694463, 'samples': 7016832, 'steps': 36545, 'loss/train': 1.7611186504364014} 11/07/2021 02:22:47 - INFO - __main__ - Step 36547: {'lr': 0.0004357412215234994, 'samples': 7017024, 'steps': 36546, 'loss/train': 1.2742689847946167} 11/07/2021 02:22:48 - INFO - __main__ - Step 36548: {'lr': 0.00043573766951636206, 'samples': 7017216, 'steps': 36547, 'loss/train': 0.5363247990608215} 11/07/2021 02:22:49 - INFO - __main__ - Step 36549: {'lr': 0.00043573411742553415, 'samples': 7017408, 'steps': 36548, 'loss/train': 1.056071162223816} 11/07/2021 02:22:49 - INFO - __main__ - Step 36550: {'lr': 0.0004357305652510174, 'samples': 7017600, 'steps': 36549, 'loss/train': 1.721631646156311} 11/07/2021 02:22:50 - INFO - __main__ - Step 36551: {'lr': 0.00043572701299281327, 'samples': 7017792, 'steps': 36550, 'loss/train': 1.5491282939910889} 11/07/2021 02:22:50 - INFO - __main__ - Step 36552: {'lr': 0.0004357234606509234, 'samples': 7017984, 'steps': 36551, 'loss/train': 1.3987603187561035} 11/07/2021 02:22:50 - INFO - __main__ - Step 36553: {'lr': 0.00043571990822534936, 'samples': 7018176, 'steps': 36552, 'loss/train': 0.201223224401474} 11/07/2021 02:22:51 - INFO - __main__ - Step 36554: {'lr': 0.00043571635571609287, 'samples': 7018368, 'steps': 36553, 'loss/train': 1.3155345916748047} 11/07/2021 02:22:51 - INFO - __main__ - Step 36555: {'lr': 0.00043571280312315543, 'samples': 7018560, 'steps': 36554, 'loss/train': 0.42579716444015503} 11/07/2021 02:22:52 - INFO - __main__ - Step 36556: {'lr': 0.0004357092504465386, 'samples': 7018752, 'steps': 36555, 'loss/train': 1.812836766242981} 11/07/2021 02:22:52 - INFO - __main__ - Step 36557: {'lr': 0.00043570569768624416, 'samples': 7018944, 'steps': 36556, 'loss/train': 1.2242940664291382} 11/07/2021 02:22:53 - INFO - __main__ - Step 36558: {'lr': 0.00043570214484227353, 'samples': 7019136, 'steps': 36557, 'loss/train': 1.6130986213684082} 11/07/2021 02:22:54 - INFO - __main__ - Step 36559: {'lr': 0.00043569859191462847, 'samples': 7019328, 'steps': 36558, 'loss/train': 1.3331043720245361} 11/07/2021 02:22:54 - INFO - __main__ - Step 36560: {'lr': 0.0004356950389033104, 'samples': 7019520, 'steps': 36559, 'loss/train': 1.8335299491882324} 11/07/2021 02:22:54 - INFO - __main__ - Step 36561: {'lr': 0.0004356914858083211, 'samples': 7019712, 'steps': 36560, 'loss/train': 1.64692223072052} 11/07/2021 02:22:55 - INFO - __main__ - Step 36562: {'lr': 0.00043568793262966195, 'samples': 7019904, 'steps': 36561, 'loss/train': 1.4393301010131836} 11/07/2021 02:22:55 - INFO - __main__ - Step 36563: {'lr': 0.00043568437936733473, 'samples': 7020096, 'steps': 36562, 'loss/train': 1.5390021800994873} 11/07/2021 02:22:56 - INFO - __main__ - Step 36564: {'lr': 0.0004356808260213411, 'samples': 7020288, 'steps': 36563, 'loss/train': 1.780922293663025} 11/07/2021 02:22:57 - INFO - __main__ - Step 36565: {'lr': 0.00043567727259168244, 'samples': 7020480, 'steps': 36564, 'loss/train': 1.7212803363800049} 11/07/2021 02:22:57 - INFO - __main__ - Step 36566: {'lr': 0.0004356737190783605, 'samples': 7020672, 'steps': 36565, 'loss/train': 2.4525327682495117} 11/07/2021 02:22:57 - INFO - __main__ - Step 36567: {'lr': 0.00043567016548137685, 'samples': 7020864, 'steps': 36566, 'loss/train': 1.0905752182006836} 11/07/2021 02:22:58 - INFO - __main__ - Step 36568: {'lr': 0.00043566661180073304, 'samples': 7021056, 'steps': 36567, 'loss/train': 0.7993988394737244} 11/07/2021 02:22:59 - INFO - __main__ - Step 36569: {'lr': 0.00043566305803643073, 'samples': 7021248, 'steps': 36568, 'loss/train': 1.5785763263702393} 11/07/2021 02:22:59 - INFO - __main__ - Step 36570: {'lr': 0.00043565950418847154, 'samples': 7021440, 'steps': 36569, 'loss/train': 1.1388972997665405} 11/07/2021 02:23:00 - INFO - __main__ - Step 36571: {'lr': 0.00043565595025685705, 'samples': 7021632, 'steps': 36570, 'loss/train': 1.6771471500396729} 11/07/2021 02:23:00 - INFO - __main__ - Step 36572: {'lr': 0.0004356523962415889, 'samples': 7021824, 'steps': 36571, 'loss/train': 1.3315123319625854} 11/07/2021 02:23:00 - INFO - __main__ - Step 36573: {'lr': 0.00043564884214266855, 'samples': 7022016, 'steps': 36572, 'loss/train': 1.4760372638702393} 11/07/2021 02:23:01 - INFO - __main__ - Step 36574: {'lr': 0.00043564528796009774, 'samples': 7022208, 'steps': 36573, 'loss/train': 1.5408062934875488} 11/07/2021 02:23:02 - INFO - __main__ - Step 36575: {'lr': 0.00043564173369387807, 'samples': 7022400, 'steps': 36574, 'loss/train': 1.4901825189590454} 11/07/2021 02:23:02 - INFO - __main__ - Step 36576: {'lr': 0.00043563817934401107, 'samples': 7022592, 'steps': 36575, 'loss/train': 1.4674835205078125} 11/07/2021 02:23:03 - INFO - __main__ - Step 36577: {'lr': 0.0004356346249104983, 'samples': 7022784, 'steps': 36576, 'loss/train': 1.30613112449646} 11/07/2021 02:23:03 - INFO - __main__ - Step 36578: {'lr': 0.0004356310703933415, 'samples': 7022976, 'steps': 36577, 'loss/train': 0.35052648186683655} 11/07/2021 02:23:03 - INFO - __main__ - Step 36579: {'lr': 0.00043562751579254215, 'samples': 7023168, 'steps': 36578, 'loss/train': 1.4443248510360718} 11/07/2021 02:23:04 - INFO - __main__ - Step 36580: {'lr': 0.00043562396110810196, 'samples': 7023360, 'steps': 36579, 'loss/train': 2.2824246883392334} 11/07/2021 02:23:05 - INFO - __main__ - Step 36581: {'lr': 0.00043562040634002245, 'samples': 7023552, 'steps': 36580, 'loss/train': 1.9743016958236694} 11/07/2021 02:23:05 - INFO - __main__ - Step 36582: {'lr': 0.0004356168514883053, 'samples': 7023744, 'steps': 36581, 'loss/train': 1.9472196102142334} 11/07/2021 02:23:06 - INFO - __main__ - Step 36583: {'lr': 0.000435613296552952, 'samples': 7023936, 'steps': 36582, 'loss/train': 1.6587493419647217} 11/07/2021 02:23:06 - INFO - __main__ - Step 36584: {'lr': 0.0004356097415339643, 'samples': 7024128, 'steps': 36583, 'loss/train': 1.0660061836242676} 11/07/2021 02:23:07 - INFO - __main__ - Step 36585: {'lr': 0.0004356061864313436, 'samples': 7024320, 'steps': 36584, 'loss/train': 1.33091402053833} 11/07/2021 02:23:07 - INFO - __main__ - Step 36586: {'lr': 0.0004356026312450917, 'samples': 7024512, 'steps': 36585, 'loss/train': 1.5652490854263306} 11/07/2021 02:23:08 - INFO - __main__ - Step 36587: {'lr': 0.00043559907597521007, 'samples': 7024704, 'steps': 36586, 'loss/train': 1.6647975444793701} 11/07/2021 02:23:08 - INFO - __main__ - Step 36588: {'lr': 0.00043559552062170037, 'samples': 7024896, 'steps': 36587, 'loss/train': 1.09634268283844} 11/07/2021 02:23:08 - INFO - __main__ - Step 36589: {'lr': 0.00043559196518456425, 'samples': 7025088, 'steps': 36588, 'loss/train': 0.9539096355438232} 11/07/2021 02:23:10 - INFO - __main__ - Step 36590: {'lr': 0.0004355884096638032, 'samples': 7025280, 'steps': 36589, 'loss/train': 0.8309692740440369} 11/07/2021 02:23:10 - INFO - __main__ - Step 36591: {'lr': 0.0004355848540594188, 'samples': 7025472, 'steps': 36590, 'loss/train': 0.5594035387039185} 11/07/2021 02:23:10 - INFO - __main__ - Step 36592: {'lr': 0.00043558129837141285, 'samples': 7025664, 'steps': 36591, 'loss/train': 1.1925039291381836} 11/07/2021 02:23:11 - INFO - __main__ - Step 36593: {'lr': 0.0004355777425997868, 'samples': 7025856, 'steps': 36592, 'loss/train': 1.5464204549789429} 11/07/2021 02:23:11 - INFO - __main__ - Step 36594: {'lr': 0.0004355741867445423, 'samples': 7026048, 'steps': 36593, 'loss/train': 1.5897024869918823} 11/07/2021 02:23:12 - INFO - __main__ - Step 36595: {'lr': 0.00043557063080568094, 'samples': 7026240, 'steps': 36594, 'loss/train': 0.15254853665828705} 11/07/2021 02:23:12 - INFO - __main__ - Step 36596: {'lr': 0.00043556707478320425, 'samples': 7026432, 'steps': 36595, 'loss/train': 1.6399388313293457} 11/07/2021 02:23:13 - INFO - __main__ - Step 36597: {'lr': 0.000435563518677114, 'samples': 7026624, 'steps': 36596, 'loss/train': 1.5131480693817139} 11/07/2021 02:23:13 - INFO - __main__ - Step 36598: {'lr': 0.00043555996248741157, 'samples': 7026816, 'steps': 36597, 'loss/train': 1.5443997383117676} 11/07/2021 02:23:13 - INFO - __main__ - Step 36599: {'lr': 0.00043555640621409874, 'samples': 7027008, 'steps': 36598, 'loss/train': 1.3245813846588135} 11/07/2021 02:23:14 - INFO - __main__ - Step 36600: {'lr': 0.000435552849857177, 'samples': 7027200, 'steps': 36599, 'loss/train': 1.3773012161254883} 11/07/2021 02:23:15 - INFO - __main__ - Step 36601: {'lr': 0.0004355492934166481, 'samples': 7027392, 'steps': 36600, 'loss/train': 1.1193139553070068} 11/07/2021 02:23:15 - INFO - __main__ - Step 36602: {'lr': 0.00043554573689251355, 'samples': 7027584, 'steps': 36601, 'loss/train': 1.734878420829773} 11/07/2021 02:23:16 - INFO - __main__ - Step 36603: {'lr': 0.00043554218028477493, 'samples': 7027776, 'steps': 36602, 'loss/train': 1.6818993091583252} 11/07/2021 02:23:16 - INFO - __main__ - Step 36604: {'lr': 0.0004355386235934339, 'samples': 7027968, 'steps': 36603, 'loss/train': 1.6259936094284058} 11/07/2021 02:23:16 - INFO - __main__ - Step 36605: {'lr': 0.0004355350668184919, 'samples': 7028160, 'steps': 36604, 'loss/train': 1.6531397104263306} 11/07/2021 02:23:17 - INFO - __main__ - Step 36606: {'lr': 0.0004355315099599508, 'samples': 7028352, 'steps': 36605, 'loss/train': 1.665704369544983} 11/07/2021 02:23:18 - INFO - __main__ - Step 36607: {'lr': 0.000435527953017812, 'samples': 7028544, 'steps': 36606, 'loss/train': 2.162864923477173} 11/07/2021 02:23:18 - INFO - __main__ - Step 36608: {'lr': 0.00043552439599207714, 'samples': 7028736, 'steps': 36607, 'loss/train': 1.3255237340927124} 11/07/2021 02:23:18 - INFO - __main__ - Step 36609: {'lr': 0.00043552083888274794, 'samples': 7028928, 'steps': 36608, 'loss/train': 2.0128681659698486} 11/07/2021 02:23:19 - INFO - __main__ - Step 36610: {'lr': 0.00043551728168982583, 'samples': 7029120, 'steps': 36609, 'loss/train': 1.391836404800415} 11/07/2021 02:23:20 - INFO - __main__ - Step 36611: {'lr': 0.0004355137244133126, 'samples': 7029312, 'steps': 36610, 'loss/train': 0.7241917252540588} 11/07/2021 02:23:20 - INFO - __main__ - Step 36612: {'lr': 0.00043551016705320965, 'samples': 7029504, 'steps': 36611, 'loss/train': 1.4233378171920776} 11/07/2021 02:23:21 - INFO - __main__ - Step 36613: {'lr': 0.00043550660960951874, 'samples': 7029696, 'steps': 36612, 'loss/train': 1.660108208656311} 11/07/2021 02:23:21 - INFO - __main__ - Step 36614: {'lr': 0.0004355030520822414, 'samples': 7029888, 'steps': 36613, 'loss/train': 1.3485804796218872} 11/07/2021 02:23:21 - INFO - __main__ - Step 36615: {'lr': 0.00043549949447137915, 'samples': 7030080, 'steps': 36614, 'loss/train': 1.6928882598876953} 11/07/2021 02:23:22 - INFO - __main__ - Step 36616: {'lr': 0.00043549593677693385, 'samples': 7030272, 'steps': 36615, 'loss/train': 0.6086564064025879} 11/07/2021 02:23:23 - INFO - __main__ - Step 36617: {'lr': 0.0004354923789989068, 'samples': 7030464, 'steps': 36616, 'loss/train': 1.5144529342651367} 11/07/2021 02:23:23 - INFO - __main__ - Step 36618: {'lr': 0.0004354888211372998, 'samples': 7030656, 'steps': 36617, 'loss/train': 1.6198910474777222} 11/07/2021 02:23:24 - INFO - __main__ - Step 36619: {'lr': 0.0004354852631921145, 'samples': 7030848, 'steps': 36618, 'loss/train': 1.5385793447494507} 11/07/2021 02:23:24 - INFO - __main__ - Step 36620: {'lr': 0.0004354817051633523, 'samples': 7031040, 'steps': 36619, 'loss/train': 1.7353585958480835} 11/07/2021 02:23:24 - INFO - __main__ - Step 36621: {'lr': 0.00043547814705101486, 'samples': 7031232, 'steps': 36620, 'loss/train': 1.6816554069519043} 11/07/2021 02:23:25 - INFO - __main__ - Step 36622: {'lr': 0.00043547458885510393, 'samples': 7031424, 'steps': 36621, 'loss/train': 1.4382396936416626} 11/07/2021 02:23:26 - INFO - __main__ - Step 36623: {'lr': 0.00043547103057562097, 'samples': 7031616, 'steps': 36622, 'loss/train': 1.4511727094650269} 11/07/2021 02:23:26 - INFO - __main__ - Step 36624: {'lr': 0.00043546747221256764, 'samples': 7031808, 'steps': 36623, 'loss/train': 1.165474772453308} 11/07/2021 02:23:26 - INFO - __main__ - Step 36625: {'lr': 0.00043546391376594553, 'samples': 7032000, 'steps': 36624, 'loss/train': 0.9305865168571472} 11/07/2021 02:23:27 - INFO - __main__ - Step 36626: {'lr': 0.0004354603552357562, 'samples': 7032192, 'steps': 36625, 'loss/train': 1.2832889556884766} 11/07/2021 02:23:28 - INFO - __main__ - Step 36627: {'lr': 0.0004354567966220013, 'samples': 7032384, 'steps': 36626, 'loss/train': 0.542655348777771} 11/07/2021 02:23:28 - INFO - __main__ - Step 36628: {'lr': 0.0004354532379246825, 'samples': 7032576, 'steps': 36627, 'loss/train': 0.9858495593070984} 11/07/2021 02:23:28 - INFO - __main__ - Step 36629: {'lr': 0.0004354496791438013, 'samples': 7032768, 'steps': 36628, 'loss/train': 1.6054434776306152} 11/07/2021 02:23:29 - INFO - __main__ - Step 36630: {'lr': 0.0004354461202793593, 'samples': 7032960, 'steps': 36629, 'loss/train': 1.5780274868011475} 11/07/2021 02:23:29 - INFO - __main__ - Step 36631: {'lr': 0.00043544256133135815, 'samples': 7033152, 'steps': 36630, 'loss/train': 1.2616537809371948} 11/07/2021 02:23:30 - INFO - __main__ - Step 36632: {'lr': 0.0004354390022997995, 'samples': 7033344, 'steps': 36631, 'loss/train': 1.5695288181304932} 11/07/2021 02:23:31 - INFO - __main__ - Step 36633: {'lr': 0.0004354354431846848, 'samples': 7033536, 'steps': 36632, 'loss/train': 0.17641419172286987} 11/07/2021 02:23:31 - INFO - __main__ - Step 36634: {'lr': 0.00043543188398601586, 'samples': 7033728, 'steps': 36633, 'loss/train': 1.5314275026321411} 11/07/2021 02:23:31 - INFO - __main__ - Step 36635: {'lr': 0.00043542832470379415, 'samples': 7033920, 'steps': 36634, 'loss/train': 0.9428682923316956} 11/07/2021 02:23:32 - INFO - __main__ - Step 36636: {'lr': 0.0004354247653380212, 'samples': 7034112, 'steps': 36635, 'loss/train': 1.2503483295440674} 11/07/2021 02:23:33 - INFO - __main__ - Step 36637: {'lr': 0.00043542120588869885, 'samples': 7034304, 'steps': 36636, 'loss/train': 1.3206701278686523} 11/07/2021 02:23:33 - INFO - __main__ - Step 36638: {'lr': 0.0004354176463558284, 'samples': 7034496, 'steps': 36637, 'loss/train': 1.1053410768508911} 11/07/2021 02:23:33 - INFO - __main__ - Step 36639: {'lr': 0.00043541408673941173, 'samples': 7034688, 'steps': 36638, 'loss/train': 1.5448483228683472} 11/07/2021 02:23:34 - INFO - __main__ - Step 36640: {'lr': 0.00043541052703945034, 'samples': 7034880, 'steps': 36639, 'loss/train': 1.4707204103469849} 11/07/2021 02:23:34 - INFO - __main__ - Step 36641: {'lr': 0.0004354069672559458, 'samples': 7035072, 'steps': 36640, 'loss/train': 0.9098020195960999} 11/07/2021 02:23:35 - INFO - __main__ - Step 36642: {'lr': 0.0004354034073888997, 'samples': 7035264, 'steps': 36641, 'loss/train': 1.4666318893432617} 11/07/2021 02:23:35 - INFO - __main__ - Step 36643: {'lr': 0.00043539984743831375, 'samples': 7035456, 'steps': 36642, 'loss/train': 0.9557470083236694} 11/07/2021 02:23:36 - INFO - __main__ - Step 36644: {'lr': 0.0004353962874041895, 'samples': 7035648, 'steps': 36643, 'loss/train': 1.6082971096038818} 11/07/2021 02:23:36 - INFO - __main__ - Step 36645: {'lr': 0.0004353927272865285, 'samples': 7035840, 'steps': 36644, 'loss/train': 1.5697284936904907} 11/07/2021 02:23:36 - INFO - __main__ - Step 36646: {'lr': 0.0004353891670853324, 'samples': 7036032, 'steps': 36645, 'loss/train': 1.5849113464355469} 11/07/2021 02:23:37 - INFO - __main__ - Step 36647: {'lr': 0.00043538560680060287, 'samples': 7036224, 'steps': 36646, 'loss/train': 1.685613751411438} 11/07/2021 02:23:38 - INFO - __main__ - Step 36648: {'lr': 0.00043538204643234137, 'samples': 7036416, 'steps': 36647, 'loss/train': 2.1813371181488037} 11/07/2021 02:23:38 - INFO - __main__ - Step 36649: {'lr': 0.0004353784859805496, 'samples': 7036608, 'steps': 36648, 'loss/train': 1.4566642045974731} 11/07/2021 02:23:38 - INFO - __main__ - Step 36650: {'lr': 0.00043537492544522917, 'samples': 7036800, 'steps': 36649, 'loss/train': 1.6880543231964111} 11/07/2021 02:23:39 - INFO - __main__ - Step 36651: {'lr': 0.0004353713648263816, 'samples': 7036992, 'steps': 36650, 'loss/train': 1.4583431482315063} 11/07/2021 02:23:40 - INFO - __main__ - Step 36652: {'lr': 0.00043536780412400857, 'samples': 7037184, 'steps': 36651, 'loss/train': 1.5300203561782837} 11/07/2021 02:23:40 - INFO - __main__ - Step 36653: {'lr': 0.0004353642433381117, 'samples': 7037376, 'steps': 36652, 'loss/train': 1.5487596988677979} 11/07/2021 02:23:41 - INFO - __main__ - Step 36654: {'lr': 0.00043536068246869254, 'samples': 7037568, 'steps': 36653, 'loss/train': 5.736063480377197} 11/07/2021 02:23:41 - INFO - __main__ - Step 36655: {'lr': 0.00043535712151575274, 'samples': 7037760, 'steps': 36654, 'loss/train': 1.9262999296188354} 11/07/2021 02:23:41 - INFO - __main__ - Step 36656: {'lr': 0.00043535356047929387, 'samples': 7037952, 'steps': 36655, 'loss/train': 1.262406826019287} 11/07/2021 02:23:42 - INFO - __main__ - Step 36657: {'lr': 0.0004353499993593176, 'samples': 7038144, 'steps': 36656, 'loss/train': 1.1855562925338745} 11/07/2021 02:23:43 - INFO - __main__ - Step 36658: {'lr': 0.0004353464381558254, 'samples': 7038336, 'steps': 36657, 'loss/train': 1.3844404220581055} 11/07/2021 02:23:43 - INFO - __main__ - Step 36659: {'lr': 0.00043534287686881895, 'samples': 7038528, 'steps': 36658, 'loss/train': 1.4972984790802002} 11/07/2021 02:23:43 - INFO - __main__ - Step 36660: {'lr': 0.00043533931549829993, 'samples': 7038720, 'steps': 36659, 'loss/train': 1.7652618885040283} 11/07/2021 02:23:44 - INFO - __main__ - Step 36661: {'lr': 0.00043533575404426986, 'samples': 7038912, 'steps': 36660, 'loss/train': 1.4423415660858154} 11/07/2021 02:23:44 - INFO - __main__ - Step 36662: {'lr': 0.0004353321925067303, 'samples': 7039104, 'steps': 36661, 'loss/train': 1.2279847860336304} 11/07/2021 02:23:45 - INFO - __main__ - Step 36663: {'lr': 0.0004353286308856829, 'samples': 7039296, 'steps': 36662, 'loss/train': 1.6344609260559082} 11/07/2021 02:23:45 - INFO - __main__ - Step 36664: {'lr': 0.00043532506918112933, 'samples': 7039488, 'steps': 36663, 'loss/train': 1.8688920736312866} 11/07/2021 02:23:46 - INFO - __main__ - Step 36665: {'lr': 0.0004353215073930712, 'samples': 7039680, 'steps': 36664, 'loss/train': 1.543150544166565} 11/07/2021 02:23:46 - INFO - __main__ - Step 36666: {'lr': 0.00043531794552150994, 'samples': 7039872, 'steps': 36665, 'loss/train': 1.6412619352340698} 11/07/2021 02:23:47 - INFO - __main__ - Step 36667: {'lr': 0.0004353143835664474, 'samples': 7040064, 'steps': 36666, 'loss/train': 1.6174161434173584} 11/07/2021 02:23:48 - INFO - __main__ - Step 36668: {'lr': 0.00043531082152788495, 'samples': 7040256, 'steps': 36667, 'loss/train': 0.18082042038440704} 11/07/2021 02:23:48 - INFO - __main__ - Step 36669: {'lr': 0.0004353072594058243, 'samples': 7040448, 'steps': 36668, 'loss/train': 1.6925253868103027} 11/07/2021 02:23:48 - INFO - __main__ - Step 36670: {'lr': 0.0004353036972002671, 'samples': 7040640, 'steps': 36669, 'loss/train': 2.6391284465789795} 11/07/2021 02:23:49 - INFO - __main__ - Step 36671: {'lr': 0.00043530013491121497, 'samples': 7040832, 'steps': 36670, 'loss/train': 1.4973509311676025} 11/07/2021 02:23:49 - INFO - __main__ - Step 36672: {'lr': 0.00043529657253866936, 'samples': 7041024, 'steps': 36671, 'loss/train': 1.5227664709091187} 11/07/2021 02:23:50 - INFO - __main__ - Step 36673: {'lr': 0.000435293010082632, 'samples': 7041216, 'steps': 36672, 'loss/train': 1.4857127666473389} 11/07/2021 02:23:50 - INFO - __main__ - Step 36674: {'lr': 0.0004352894475431045, 'samples': 7041408, 'steps': 36673, 'loss/train': 1.3619614839553833} 11/07/2021 02:23:51 - INFO - __main__ - Step 36675: {'lr': 0.0004352858849200885, 'samples': 7041600, 'steps': 36674, 'loss/train': 1.106205940246582} 11/07/2021 02:23:51 - INFO - __main__ - Step 36676: {'lr': 0.0004352823222135854, 'samples': 7041792, 'steps': 36675, 'loss/train': 0.8346518278121948} 11/07/2021 02:23:51 - INFO - __main__ - Step 36677: {'lr': 0.00043527875942359697, 'samples': 7041984, 'steps': 36676, 'loss/train': 1.895818829536438} 11/07/2021 02:23:52 - INFO - __main__ - Step 36678: {'lr': 0.0004352751965501248, 'samples': 7042176, 'steps': 36677, 'loss/train': 1.3899580240249634} 11/07/2021 02:23:53 - INFO - __main__ - Step 36679: {'lr': 0.0004352716335931706, 'samples': 7042368, 'steps': 36678, 'loss/train': 1.309022307395935} 11/07/2021 02:23:53 - INFO - __main__ - Step 36680: {'lr': 0.0004352680705527357, 'samples': 7042560, 'steps': 36679, 'loss/train': 1.7067924737930298} 11/07/2021 02:23:53 - INFO - __main__ - Step 36681: {'lr': 0.00043526450742882193, 'samples': 7042752, 'steps': 36680, 'loss/train': 1.3617024421691895} 11/07/2021 02:23:54 - INFO - __main__ - Step 36682: {'lr': 0.0004352609442214309, 'samples': 7042944, 'steps': 36681, 'loss/train': 1.5160088539123535} 11/07/2021 02:23:54 - INFO - __main__ - Step 36683: {'lr': 0.00043525738093056404, 'samples': 7043136, 'steps': 36682, 'loss/train': 1.2032099962234497} 11/07/2021 02:23:55 - INFO - __main__ - Step 36684: {'lr': 0.0004352538175562231, 'samples': 7043328, 'steps': 36683, 'loss/train': 1.4337596893310547} 11/07/2021 02:23:56 - INFO - __main__ - Step 36685: {'lr': 0.00043525025409840967, 'samples': 7043520, 'steps': 36684, 'loss/train': 1.725263237953186} 11/07/2021 02:23:56 - INFO - __main__ - Step 36686: {'lr': 0.00043524669055712534, 'samples': 7043712, 'steps': 36685, 'loss/train': 1.6312520503997803} 11/07/2021 02:23:56 - INFO - __main__ - Step 36687: {'lr': 0.00043524312693237166, 'samples': 7043904, 'steps': 36686, 'loss/train': 1.4041990041732788} 11/07/2021 02:23:57 - INFO - __main__ - Step 36688: {'lr': 0.0004352395632241504, 'samples': 7044096, 'steps': 36687, 'loss/train': 1.4240286350250244} 11/07/2021 02:23:58 - INFO - __main__ - Step 36689: {'lr': 0.00043523599943246297, 'samples': 7044288, 'steps': 36688, 'loss/train': 1.589342474937439} 11/07/2021 02:23:58 - INFO - __main__ - Step 36690: {'lr': 0.00043523243555731094, 'samples': 7044480, 'steps': 36689, 'loss/train': 1.670009732246399} 11/07/2021 02:23:58 - INFO - __main__ - Step 36691: {'lr': 0.00043522887159869617, 'samples': 7044672, 'steps': 36690, 'loss/train': 1.1319444179534912} 11/07/2021 02:23:59 - INFO - __main__ - Step 36692: {'lr': 0.00043522530755662017, 'samples': 7044864, 'steps': 36691, 'loss/train': 1.1176179647445679} 11/07/2021 02:23:59 - INFO - __main__ - Step 36693: {'lr': 0.00043522174343108445, 'samples': 7045056, 'steps': 36692, 'loss/train': 1.937666416168213} 11/07/2021 02:24:00 - INFO - __main__ - Step 36694: {'lr': 0.00043521817922209064, 'samples': 7045248, 'steps': 36693, 'loss/train': 1.047343134880066} 11/07/2021 02:24:00 - INFO - __main__ - Step 36695: {'lr': 0.00043521461492964037, 'samples': 7045440, 'steps': 36694, 'loss/train': 1.0359920263290405} 11/07/2021 02:24:01 - INFO - __main__ - Step 36696: {'lr': 0.00043521105055373526, 'samples': 7045632, 'steps': 36695, 'loss/train': 1.6470108032226562} 11/07/2021 02:24:01 - INFO - __main__ - Step 36697: {'lr': 0.000435207486094377, 'samples': 7045824, 'steps': 36696, 'loss/train': 1.8380084037780762} 11/07/2021 02:24:02 - INFO - __main__ - Step 36698: {'lr': 0.00043520392155156694, 'samples': 7046016, 'steps': 36697, 'loss/train': 1.2285186052322388} 11/07/2021 02:24:02 - INFO - __main__ - Step 36699: {'lr': 0.000435200356925307, 'samples': 7046208, 'steps': 36698, 'loss/train': 1.6174869537353516} 11/07/2021 02:24:04 - INFO - __main__ - Step 36700: {'lr': 0.0004351967922155986, 'samples': 7046400, 'steps': 36699, 'loss/train': 1.3436497449874878} 11/07/2021 02:24:04 - INFO - __main__ - Step 36701: {'lr': 0.0004351932274224434, 'samples': 7046592, 'steps': 36700, 'loss/train': 1.2693761587142944} 11/07/2021 02:24:04 - INFO - __main__ - Step 36702: {'lr': 0.0004351896625458429, 'samples': 7046784, 'steps': 36701, 'loss/train': 1.9657156467437744} 11/07/2021 02:24:05 - INFO - __main__ - Step 36703: {'lr': 0.0004351860975857989, 'samples': 7046976, 'steps': 36702, 'loss/train': 1.970827579498291} 11/07/2021 02:24:05 - INFO - __main__ - Step 36704: {'lr': 0.00043518253254231276, 'samples': 7047168, 'steps': 36703, 'loss/train': 2.3808977603912354} 11/07/2021 02:24:06 - INFO - __main__ - Step 36705: {'lr': 0.00043517896741538634, 'samples': 7047360, 'steps': 36704, 'loss/train': 1.3965660333633423} 11/07/2021 02:24:07 - INFO - __main__ - Step 36706: {'lr': 0.0004351754022050212, 'samples': 7047552, 'steps': 36705, 'loss/train': 1.7651844024658203} 11/07/2021 02:24:07 - INFO - __main__ - Step 36707: {'lr': 0.00043517183691121875, 'samples': 7047744, 'steps': 36706, 'loss/train': 1.3832669258117676} 11/07/2021 02:24:07 - INFO - __main__ - Step 36708: {'lr': 0.00043516827153398073, 'samples': 7047936, 'steps': 36707, 'loss/train': 1.2296851873397827} 11/07/2021 02:24:08 - INFO - __main__ - Step 36709: {'lr': 0.0004351647060733088, 'samples': 7048128, 'steps': 36708, 'loss/train': 1.5138906240463257} 11/07/2021 02:24:08 - INFO - __main__ - Step 36710: {'lr': 0.00043516114052920453, 'samples': 7048320, 'steps': 36709, 'loss/train': 1.4793736934661865} 11/07/2021 02:24:09 - INFO - __main__ - Step 36711: {'lr': 0.00043515757490166944, 'samples': 7048512, 'steps': 36710, 'loss/train': 1.8552041053771973} 11/07/2021 02:24:09 - INFO - __main__ - Step 36712: {'lr': 0.00043515400919070526, 'samples': 7048704, 'steps': 36711, 'loss/train': 1.8112801313400269} 11/07/2021 02:24:10 - INFO - __main__ - Step 36713: {'lr': 0.0004351504433963135, 'samples': 7048896, 'steps': 36712, 'loss/train': 1.026114583015442} 11/07/2021 02:24:10 - INFO - __main__ - Step 36714: {'lr': 0.0004351468775184959, 'samples': 7049088, 'steps': 36713, 'loss/train': 1.2794389724731445} 11/07/2021 02:24:11 - INFO - __main__ - Step 36715: {'lr': 0.0004351433115572538, 'samples': 7049280, 'steps': 36714, 'loss/train': 1.2215988636016846} 11/07/2021 02:24:11 - INFO - __main__ - Step 36716: {'lr': 0.00043513974551258913, 'samples': 7049472, 'steps': 36715, 'loss/train': 1.294337511062622} 11/07/2021 02:24:12 - INFO - __main__ - Step 36717: {'lr': 0.00043513617938450327, 'samples': 7049664, 'steps': 36716, 'loss/train': 0.6976569294929504} 11/07/2021 02:24:12 - INFO - __main__ - Step 36718: {'lr': 0.00043513261317299797, 'samples': 7049856, 'steps': 36717, 'loss/train': 1.472899317741394} 11/07/2021 02:24:13 - INFO - __main__ - Step 36719: {'lr': 0.00043512904687807475, 'samples': 7050048, 'steps': 36718, 'loss/train': 1.4063820838928223} 11/07/2021 02:24:13 - INFO - __main__ - Step 36720: {'lr': 0.00043512548049973523, 'samples': 7050240, 'steps': 36719, 'loss/train': 1.2921974658966064} 11/07/2021 02:24:13 - INFO - __main__ - Step 36721: {'lr': 0.00043512191403798095, 'samples': 7050432, 'steps': 36720, 'loss/train': 1.3332635164260864} 11/07/2021 02:24:14 - INFO - __main__ - Step 36722: {'lr': 0.0004351183474928137, 'samples': 7050624, 'steps': 36721, 'loss/train': 0.9220923185348511} 11/07/2021 02:24:15 - INFO - __main__ - Step 36723: {'lr': 0.00043511478086423493, 'samples': 7050816, 'steps': 36722, 'loss/train': 1.4664701223373413} 11/07/2021 02:24:15 - INFO - __main__ - Step 36724: {'lr': 0.0004351112141522463, 'samples': 7051008, 'steps': 36723, 'loss/train': 1.647239327430725} 11/07/2021 02:24:15 - INFO - __main__ - Step 36725: {'lr': 0.00043510764735684945, 'samples': 7051200, 'steps': 36724, 'loss/train': 1.5401238203048706} 11/07/2021 02:24:16 - INFO - __main__ - Step 36726: {'lr': 0.0004351040804780459, 'samples': 7051392, 'steps': 36725, 'loss/train': 1.699589490890503} 11/07/2021 02:24:17 - INFO - __main__ - Step 36727: {'lr': 0.00043510051351583733, 'samples': 7051584, 'steps': 36726, 'loss/train': 1.6409080028533936} 11/07/2021 02:24:17 - INFO - __main__ - Step 36728: {'lr': 0.0004350969464702254, 'samples': 7051776, 'steps': 36727, 'loss/train': 1.6988903284072876} 11/07/2021 02:24:17 - INFO - __main__ - Step 36729: {'lr': 0.0004350933793412115, 'samples': 7051968, 'steps': 36728, 'loss/train': 1.8149951696395874} 11/07/2021 02:24:18 - INFO - __main__ - Step 36730: {'lr': 0.00043508981212879737, 'samples': 7052160, 'steps': 36729, 'loss/train': 1.5463894605636597} 11/07/2021 02:24:18 - INFO - __main__ - Step 36731: {'lr': 0.0004350862448329848, 'samples': 7052352, 'steps': 36730, 'loss/train': 1.5224485397338867} 11/07/2021 02:24:19 - INFO - __main__ - Step 36732: {'lr': 0.00043508267745377504, 'samples': 7052544, 'steps': 36731, 'loss/train': 1.1639156341552734} 11/07/2021 02:24:19 - INFO - __main__ - Step 36733: {'lr': 0.00043507910999117003, 'samples': 7052736, 'steps': 36732, 'loss/train': 1.5083235502243042} 11/07/2021 02:24:20 - INFO - __main__ - Step 36734: {'lr': 0.00043507554244517113, 'samples': 7052928, 'steps': 36733, 'loss/train': 1.9087817668914795} 11/07/2021 02:24:20 - INFO - __main__ - Step 36735: {'lr': 0.0004350719748157801, 'samples': 7053120, 'steps': 36734, 'loss/train': 1.727805733680725} 11/07/2021 02:24:21 - INFO - __main__ - Step 36736: {'lr': 0.00043506840710299844, 'samples': 7053312, 'steps': 36735, 'loss/train': 1.4072787761688232} 11/07/2021 02:24:21 - INFO - __main__ - Step 36737: {'lr': 0.00043506483930682785, 'samples': 7053504, 'steps': 36736, 'loss/train': 1.6502940654754639} 11/07/2021 02:24:22 - INFO - __main__ - Step 36738: {'lr': 0.0004350612714272699, 'samples': 7053696, 'steps': 36737, 'loss/train': 1.123324990272522} 11/07/2021 02:24:22 - INFO - __main__ - Step 36739: {'lr': 0.0004350577034643262, 'samples': 7053888, 'steps': 36738, 'loss/train': 1.5874708890914917} 11/07/2021 02:24:23 - INFO - __main__ - Step 36740: {'lr': 0.0004350541354179983, 'samples': 7054080, 'steps': 36739, 'loss/train': 1.4291845560073853} 11/07/2021 02:24:23 - INFO - __main__ - Step 36741: {'lr': 0.00043505056728828794, 'samples': 7054272, 'steps': 36740, 'loss/train': 1.6307123899459839} 11/07/2021 02:24:23 - INFO - __main__ - Step 36742: {'lr': 0.0004350469990751966, 'samples': 7054464, 'steps': 36741, 'loss/train': 1.233171820640564} 11/07/2021 02:24:24 - INFO - __main__ - Step 36743: {'lr': 0.000435043430778726, 'samples': 7054656, 'steps': 36742, 'loss/train': 1.7694103717803955} 11/07/2021 02:24:25 - INFO - __main__ - Step 36744: {'lr': 0.00043503986239887765, 'samples': 7054848, 'steps': 36743, 'loss/train': 1.4569706916809082} 11/07/2021 02:24:25 - INFO - __main__ - Step 36745: {'lr': 0.0004350362939356532, 'samples': 7055040, 'steps': 36744, 'loss/train': 1.8365846872329712} 11/07/2021 02:24:25 - INFO - __main__ - Step 36746: {'lr': 0.00043503272538905423, 'samples': 7055232, 'steps': 36745, 'loss/train': 1.5790698528289795} 11/07/2021 02:24:26 - INFO - __main__ - Step 36747: {'lr': 0.0004350291567590824, 'samples': 7055424, 'steps': 36746, 'loss/train': 1.2959239482879639} 11/07/2021 02:24:27 - INFO - __main__ - Step 36748: {'lr': 0.00043502558804573924, 'samples': 7055616, 'steps': 36747, 'loss/train': 1.3608461618423462} 11/07/2021 02:24:27 - INFO - __main__ - Step 36749: {'lr': 0.0004350220192490264, 'samples': 7055808, 'steps': 36748, 'loss/train': 1.5675292015075684} 11/07/2021 02:24:28 - INFO - __main__ - Step 36750: {'lr': 0.00043501845036894555, 'samples': 7056000, 'steps': 36749, 'loss/train': 1.1520994901657104} 11/07/2021 02:24:28 - INFO - __main__ - Step 36751: {'lr': 0.00043501488140549824, 'samples': 7056192, 'steps': 36750, 'loss/train': 1.3899540901184082} 11/07/2021 02:24:28 - INFO - __main__ - Step 36752: {'lr': 0.000435011312358686, 'samples': 7056384, 'steps': 36751, 'loss/train': 1.8039408922195435} 11/07/2021 02:24:29 - INFO - __main__ - Step 36753: {'lr': 0.0004350077432285106, 'samples': 7056576, 'steps': 36752, 'loss/train': 1.3066352605819702} 11/07/2021 02:24:30 - INFO - __main__ - Step 36754: {'lr': 0.0004350041740149735, 'samples': 7056768, 'steps': 36753, 'loss/train': 1.5671954154968262} 11/07/2021 02:24:30 - INFO - __main__ - Step 36755: {'lr': 0.00043500060471807645, 'samples': 7056960, 'steps': 36754, 'loss/train': 1.3846319913864136} 11/07/2021 02:24:30 - INFO - __main__ - Step 36756: {'lr': 0.000434997035337821, 'samples': 7057152, 'steps': 36755, 'loss/train': 1.269380807876587} 11/07/2021 02:24:31 - INFO - __main__ - Step 36757: {'lr': 0.0004349934658742086, 'samples': 7057344, 'steps': 36756, 'loss/train': 1.081022024154663} 11/07/2021 02:24:31 - INFO - __main__ - Step 36758: {'lr': 0.00043498989632724105, 'samples': 7057536, 'steps': 36757, 'loss/train': 1.3352800607681274} 11/07/2021 02:24:32 - INFO - __main__ - Step 36759: {'lr': 0.00043498632669692, 'samples': 7057728, 'steps': 36758, 'loss/train': 1.6395902633666992} 11/07/2021 02:24:32 - INFO - __main__ - Step 36760: {'lr': 0.0004349827569832469, 'samples': 7057920, 'steps': 36759, 'loss/train': 1.2451893091201782} 11/07/2021 02:24:33 - INFO - __main__ - Step 36761: {'lr': 0.00043497918718622344, 'samples': 7058112, 'steps': 36760, 'loss/train': 1.2479592561721802} 11/07/2021 02:24:33 - INFO - __main__ - Step 36762: {'lr': 0.0004349756173058512, 'samples': 7058304, 'steps': 36761, 'loss/train': 1.8825902938842773} 11/07/2021 02:24:33 - INFO - __main__ - Step 36763: {'lr': 0.0004349720473421318, 'samples': 7058496, 'steps': 36762, 'loss/train': 1.6168584823608398} 11/07/2021 02:24:35 - INFO - __main__ - Step 36764: {'lr': 0.00043496847729506685, 'samples': 7058688, 'steps': 36763, 'loss/train': 0.8299821019172668} 11/07/2021 02:24:35 - INFO - __main__ - Step 36765: {'lr': 0.000434964907164658, 'samples': 7058880, 'steps': 36764, 'loss/train': 1.609496831893921} 11/07/2021 02:24:35 - INFO - __main__ - Step 36766: {'lr': 0.0004349613369509067, 'samples': 7059072, 'steps': 36765, 'loss/train': 1.9354546070098877} 11/07/2021 02:24:36 - INFO - __main__ - Step 36767: {'lr': 0.0004349577666538148, 'samples': 7059264, 'steps': 36766, 'loss/train': 1.339677333831787} 11/07/2021 02:24:36 - INFO - __main__ - Step 36768: {'lr': 0.0004349541962733837, 'samples': 7059456, 'steps': 36767, 'loss/train': 1.247060775756836} 11/07/2021 02:24:37 - INFO - __main__ - Step 36769: {'lr': 0.0004349506258096152, 'samples': 7059648, 'steps': 36768, 'loss/train': 1.4814777374267578} 11/07/2021 02:24:37 - INFO - __main__ - Step 36770: {'lr': 0.00043494705526251064, 'samples': 7059840, 'steps': 36769, 'loss/train': 1.7929636240005493} 11/07/2021 02:24:38 - INFO - __main__ - Step 36771: {'lr': 0.00043494348463207197, 'samples': 7060032, 'steps': 36770, 'loss/train': 1.7439746856689453} 11/07/2021 02:24:38 - INFO - __main__ - Step 36772: {'lr': 0.0004349399139183005, 'samples': 7060224, 'steps': 36771, 'loss/train': 1.70767080783844} 11/07/2021 02:24:38 - INFO - __main__ - Step 36773: {'lr': 0.000434936343121198, 'samples': 7060416, 'steps': 36772, 'loss/train': 1.3965067863464355} 11/07/2021 02:24:40 - INFO - __main__ - Step 36774: {'lr': 0.000434932772240766, 'samples': 7060608, 'steps': 36773, 'loss/train': 1.1901990175247192} 11/07/2021 02:24:40 - INFO - __main__ - Step 36775: {'lr': 0.0004349292012770062, 'samples': 7060800, 'steps': 36774, 'loss/train': 1.4630550146102905} 11/07/2021 02:24:40 - INFO - __main__ - Step 36776: {'lr': 0.00043492563022992013, 'samples': 7060992, 'steps': 36775, 'loss/train': 1.6922273635864258} 11/07/2021 02:24:41 - INFO - __main__ - Step 36777: {'lr': 0.00043492205909950943, 'samples': 7061184, 'steps': 36776, 'loss/train': 1.1019692420959473} 11/07/2021 02:24:41 - INFO - __main__ - Step 36778: {'lr': 0.0004349184878857757, 'samples': 7061376, 'steps': 36777, 'loss/train': 1.772645354270935} 11/07/2021 02:24:41 - INFO - __main__ - Step 36779: {'lr': 0.0004349149165887205, 'samples': 7061568, 'steps': 36778, 'loss/train': 1.589666485786438} 11/07/2021 02:24:42 - INFO - __main__ - Step 36780: {'lr': 0.0004349113452083456, 'samples': 7061760, 'steps': 36779, 'loss/train': 0.10330658406019211} 11/07/2021 02:24:43 - INFO - __main__ - Step 36781: {'lr': 0.00043490777374465244, 'samples': 7061952, 'steps': 36780, 'loss/train': 1.489288091659546} 11/07/2021 02:24:43 - INFO - __main__ - Step 36782: {'lr': 0.0004349042021976427, 'samples': 7062144, 'steps': 36781, 'loss/train': 1.6141446828842163} 11/07/2021 02:24:43 - INFO - __main__ - Step 36783: {'lr': 0.000434900630567318, 'samples': 7062336, 'steps': 36782, 'loss/train': 1.4311100244522095} 11/07/2021 02:24:44 - INFO - __main__ - Step 36784: {'lr': 0.00043489705885367986, 'samples': 7062528, 'steps': 36783, 'loss/train': 1.6908950805664062} 11/07/2021 02:24:45 - INFO - __main__ - Step 36785: {'lr': 0.00043489348705673, 'samples': 7062720, 'steps': 36784, 'loss/train': 1.6144988536834717} 11/07/2021 02:24:46 - INFO - __main__ - Step 36786: {'lr': 0.00043488991517647, 'samples': 7062912, 'steps': 36785, 'loss/train': 1.3355114459991455} 11/07/2021 02:24:46 - INFO - __main__ - Step 36787: {'lr': 0.00043488634321290146, 'samples': 7063104, 'steps': 36786, 'loss/train': 0.8564496636390686} 11/07/2021 02:24:46 - INFO - __main__ - Step 36788: {'lr': 0.000434882771166026, 'samples': 7063296, 'steps': 36787, 'loss/train': 1.6882917881011963} 11/07/2021 02:24:47 - INFO - __main__ - Step 36789: {'lr': 0.00043487919903584515, 'samples': 7063488, 'steps': 36788, 'loss/train': 2.0521750450134277} 11/07/2021 02:24:47 - INFO - __main__ - Step 36790: {'lr': 0.00043487562682236066, 'samples': 7063680, 'steps': 36789, 'loss/train': 1.1685245037078857} 11/07/2021 02:24:48 - INFO - __main__ - Step 36791: {'lr': 0.000434872054525574, 'samples': 7063872, 'steps': 36790, 'loss/train': 0.06848043948411942} 11/07/2021 02:24:48 - INFO - __main__ - Step 36792: {'lr': 0.00043486848214548693, 'samples': 7064064, 'steps': 36791, 'loss/train': 1.5211457014083862} 11/07/2021 02:24:49 - INFO - __main__ - Step 36793: {'lr': 0.0004348649096821009, 'samples': 7064256, 'steps': 36792, 'loss/train': 1.2461212873458862} 11/07/2021 02:24:49 - INFO - __main__ - Step 36794: {'lr': 0.0004348613371354176, 'samples': 7064448, 'steps': 36793, 'loss/train': 1.6461915969848633} 11/07/2021 02:24:49 - INFO - __main__ - Step 36795: {'lr': 0.0004348577645054387, 'samples': 7064640, 'steps': 36794, 'loss/train': 0.5114266276359558} 11/07/2021 02:24:51 - INFO - __main__ - Step 36796: {'lr': 0.0004348541917921657, 'samples': 7064832, 'steps': 36795, 'loss/train': 1.4484294652938843} 11/07/2021 02:24:51 - INFO - __main__ - Step 36797: {'lr': 0.0004348506189956002, 'samples': 7065024, 'steps': 36796, 'loss/train': 0.8689265251159668} 11/07/2021 02:24:51 - INFO - __main__ - Step 36798: {'lr': 0.0004348470461157439, 'samples': 7065216, 'steps': 36797, 'loss/train': 1.5061328411102295} 11/07/2021 02:24:52 - INFO - __main__ - Step 36799: {'lr': 0.0004348434731525984, 'samples': 7065408, 'steps': 36798, 'loss/train': 1.167148232460022} 11/07/2021 02:24:52 - INFO - __main__ - Step 36800: {'lr': 0.00043483990010616524, 'samples': 7065600, 'steps': 36799, 'loss/train': 1.0397213697433472} 11/07/2021 02:24:53 - INFO - __main__ - Step 36801: {'lr': 0.00043483632697644616, 'samples': 7065792, 'steps': 36800, 'loss/train': 1.305525779724121} 11/07/2021 02:24:53 - INFO - __main__ - Step 36802: {'lr': 0.00043483275376344257, 'samples': 7065984, 'steps': 36801, 'loss/train': 1.4266273975372314} 11/07/2021 02:24:54 - INFO - __main__ - Step 36803: {'lr': 0.00043482918046715627, 'samples': 7066176, 'steps': 36802, 'loss/train': 1.41310453414917} 11/07/2021 02:24:54 - INFO - __main__ - Step 36804: {'lr': 0.00043482560708758876, 'samples': 7066368, 'steps': 36803, 'loss/train': 1.4844629764556885} 11/07/2021 02:24:55 - INFO - __main__ - Step 36805: {'lr': 0.0004348220336247417, 'samples': 7066560, 'steps': 36804, 'loss/train': 1.1278146505355835} 11/07/2021 02:24:56 - INFO - __main__ - Step 36806: {'lr': 0.0004348184600786167, 'samples': 7066752, 'steps': 36805, 'loss/train': 1.7667964696884155} 11/07/2021 02:24:56 - INFO - __main__ - Step 36807: {'lr': 0.0004348148864492153, 'samples': 7066944, 'steps': 36806, 'loss/train': 1.5919344425201416} 11/07/2021 02:24:56 - INFO - __main__ - Step 36808: {'lr': 0.00043481131273653926, 'samples': 7067136, 'steps': 36807, 'loss/train': 1.6064618825912476} 11/07/2021 02:24:57 - INFO - __main__ - Step 36809: {'lr': 0.00043480773894059, 'samples': 7067328, 'steps': 36808, 'loss/train': 1.606161117553711} 11/07/2021 02:24:57 - INFO - __main__ - Step 36810: {'lr': 0.0004348041650613692, 'samples': 7067520, 'steps': 36809, 'loss/train': 1.257421851158142} 11/07/2021 02:24:57 - INFO - __main__ - Step 36811: {'lr': 0.0004348005910988786, 'samples': 7067712, 'steps': 36810, 'loss/train': 0.9066978096961975} 11/07/2021 02:24:58 - INFO - __main__ - Step 36812: {'lr': 0.0004347970170531197, 'samples': 7067904, 'steps': 36811, 'loss/train': 0.178208589553833} 11/07/2021 02:24:59 - INFO - __main__ - Step 36813: {'lr': 0.000434793442924094, 'samples': 7068096, 'steps': 36812, 'loss/train': 0.929162859916687} 11/07/2021 02:24:59 - INFO - __main__ - Step 36814: {'lr': 0.0004347898687118033, 'samples': 7068288, 'steps': 36813, 'loss/train': 0.5680878162384033} 11/07/2021 02:25:00 - INFO - __main__ - Step 36815: {'lr': 0.0004347862944162492, 'samples': 7068480, 'steps': 36814, 'loss/train': 1.6141563653945923} 11/07/2021 02:25:00 - INFO - __main__ - Step 36816: {'lr': 0.00043478272003743315, 'samples': 7068672, 'steps': 36815, 'loss/train': 1.3449026346206665} 11/07/2021 02:25:01 - INFO - __main__ - Step 36817: {'lr': 0.0004347791455753569, 'samples': 7068864, 'steps': 36816, 'loss/train': 1.3908735513687134} 11/07/2021 02:25:01 - INFO - __main__ - Step 36818: {'lr': 0.00043477557103002197, 'samples': 7069056, 'steps': 36817, 'loss/train': 1.5589827299118042} 11/07/2021 02:25:02 - INFO - __main__ - Step 36819: {'lr': 0.00043477199640143004, 'samples': 7069248, 'steps': 36818, 'loss/train': 1.72110116481781} 11/07/2021 02:25:02 - INFO - __main__ - Step 36820: {'lr': 0.00043476842168958276, 'samples': 7069440, 'steps': 36819, 'loss/train': 1.308180332183838} 11/07/2021 02:25:02 - INFO - __main__ - Step 36821: {'lr': 0.0004347648468944816, 'samples': 7069632, 'steps': 36820, 'loss/train': 1.819994568824768} 11/07/2021 02:25:04 - INFO - __main__ - Step 36822: {'lr': 0.0004347612720161283, 'samples': 7069824, 'steps': 36821, 'loss/train': 1.3080558776855469} 11/07/2021 02:25:04 - INFO - __main__ - Step 36823: {'lr': 0.00043475769705452437, 'samples': 7070016, 'steps': 36822, 'loss/train': 1.2287598848342896} 11/07/2021 02:25:04 - INFO - __main__ - Step 36824: {'lr': 0.00043475412200967155, 'samples': 7070208, 'steps': 36823, 'loss/train': 1.4287643432617188} 11/07/2021 02:25:05 - INFO - __main__ - Step 36825: {'lr': 0.00043475054688157136, 'samples': 7070400, 'steps': 36824, 'loss/train': 1.386857509613037} 11/07/2021 02:25:05 - INFO - __main__ - Step 36826: {'lr': 0.00043474697167022536, 'samples': 7070592, 'steps': 36825, 'loss/train': 1.217334270477295} 11/07/2021 02:25:06 - INFO - __main__ - Step 36827: {'lr': 0.0004347433963756353, 'samples': 7070784, 'steps': 36826, 'loss/train': 1.668638825416565} 11/07/2021 02:25:06 - INFO - __main__ - Step 36828: {'lr': 0.0004347398209978027, 'samples': 7070976, 'steps': 36827, 'loss/train': 1.6760013103485107} 11/07/2021 02:25:07 - INFO - __main__ - Step 36829: {'lr': 0.0004347362455367292, 'samples': 7071168, 'steps': 36828, 'loss/train': 1.2976011037826538} 11/07/2021 02:25:07 - INFO - __main__ - Step 36830: {'lr': 0.0004347326699924163, 'samples': 7071360, 'steps': 36829, 'loss/train': 1.2785390615463257} 11/07/2021 02:25:07 - INFO - __main__ - Step 36831: {'lr': 0.0004347290943648658, 'samples': 7071552, 'steps': 36830, 'loss/train': 1.8591219186782837} 11/07/2021 02:25:09 - INFO - __main__ - Step 36832: {'lr': 0.00043472551865407917, 'samples': 7071744, 'steps': 36831, 'loss/train': 1.2369974851608276} 11/07/2021 02:25:09 - INFO - __main__ - Step 36833: {'lr': 0.0004347219428600581, 'samples': 7071936, 'steps': 36832, 'loss/train': 1.2083044052124023} 11/07/2021 02:25:09 - INFO - __main__ - Step 36834: {'lr': 0.0004347183669828042, 'samples': 7072128, 'steps': 36833, 'loss/train': 0.12292854487895966} 11/07/2021 02:25:10 - INFO - __main__ - Step 36835: {'lr': 0.00043471479102231904, 'samples': 7072320, 'steps': 36834, 'loss/train': 1.0675302743911743} 11/07/2021 02:25:10 - INFO - __main__ - Step 36836: {'lr': 0.0004347112149786042, 'samples': 7072512, 'steps': 36835, 'loss/train': 1.5071020126342773} 11/07/2021 02:25:11 - INFO - __main__ - Step 36837: {'lr': 0.0004347076388516614, 'samples': 7072704, 'steps': 36836, 'loss/train': 1.4102238416671753} 11/07/2021 02:25:11 - INFO - __main__ - Step 36838: {'lr': 0.00043470406264149215, 'samples': 7072896, 'steps': 36837, 'loss/train': 1.6307144165039062} 11/07/2021 02:25:12 - INFO - __main__ - Step 36839: {'lr': 0.00043470048634809813, 'samples': 7073088, 'steps': 36838, 'loss/train': 1.3599307537078857} 11/07/2021 02:25:12 - INFO - __main__ - Step 36840: {'lr': 0.00043469690997148086, 'samples': 7073280, 'steps': 36839, 'loss/train': 2.168046474456787} 11/07/2021 02:25:12 - INFO - __main__ - Step 36841: {'lr': 0.00043469333351164207, 'samples': 7073472, 'steps': 36840, 'loss/train': 1.3043498992919922} 11/07/2021 02:25:13 - INFO - __main__ - Step 36842: {'lr': 0.0004346897569685833, 'samples': 7073664, 'steps': 36841, 'loss/train': 1.4920456409454346} 11/07/2021 02:25:14 - INFO - __main__ - Step 36843: {'lr': 0.00043468618034230613, 'samples': 7073856, 'steps': 36842, 'loss/train': 1.8202351331710815} 11/07/2021 02:25:14 - INFO - __main__ - Step 36844: {'lr': 0.00043468260363281234, 'samples': 7074048, 'steps': 36843, 'loss/train': 1.4558583498001099} 11/07/2021 02:25:14 - INFO - __main__ - Step 36845: {'lr': 0.0004346790268401033, 'samples': 7074240, 'steps': 36844, 'loss/train': 2.1799943447113037} 11/07/2021 02:25:15 - INFO - __main__ - Step 36846: {'lr': 0.00043467544996418075, 'samples': 7074432, 'steps': 36845, 'loss/train': 1.497530221939087} 11/07/2021 02:25:16 - INFO - __main__ - Step 36847: {'lr': 0.0004346718730050463, 'samples': 7074624, 'steps': 36846, 'loss/train': 2.035320997238159} 11/07/2021 02:25:16 - INFO - __main__ - Step 36848: {'lr': 0.0004346682959627016, 'samples': 7074816, 'steps': 36847, 'loss/train': 1.273651361465454} 11/07/2021 02:25:17 - INFO - __main__ - Step 36849: {'lr': 0.0004346647188371482, 'samples': 7075008, 'steps': 36848, 'loss/train': 1.195306658744812} 11/07/2021 02:25:17 - INFO - __main__ - Step 36850: {'lr': 0.00043466114162838765, 'samples': 7075200, 'steps': 36849, 'loss/train': 1.5177725553512573} 11/07/2021 02:25:17 - INFO - __main__ - Step 36851: {'lr': 0.00043465756433642175, 'samples': 7075392, 'steps': 36850, 'loss/train': 1.799225091934204} 11/07/2021 02:25:18 - INFO - __main__ - Step 36852: {'lr': 0.0004346539869612519, 'samples': 7075584, 'steps': 36851, 'loss/train': 1.2354676723480225} 11/07/2021 02:25:19 - INFO - __main__ - Step 36853: {'lr': 0.0004346504095028799, 'samples': 7075776, 'steps': 36852, 'loss/train': 0.6996008157730103} 11/07/2021 02:25:19 - INFO - __main__ - Step 36854: {'lr': 0.00043464683196130726, 'samples': 7075968, 'steps': 36853, 'loss/train': 1.5860463380813599} 11/07/2021 02:25:19 - INFO - __main__ - Step 36855: {'lr': 0.00043464325433653563, 'samples': 7076160, 'steps': 36854, 'loss/train': 1.1152883768081665} 11/07/2021 02:25:20 - INFO - __main__ - Step 36856: {'lr': 0.0004346396766285665, 'samples': 7076352, 'steps': 36855, 'loss/train': 1.8893579244613647} 11/07/2021 02:25:20 - INFO - __main__ - Step 36857: {'lr': 0.0004346360988374016, 'samples': 7076544, 'steps': 36856, 'loss/train': 1.0569288730621338} 11/07/2021 02:25:21 - INFO - __main__ - Step 36858: {'lr': 0.0004346325209630426, 'samples': 7076736, 'steps': 36857, 'loss/train': 1.5878360271453857} 11/07/2021 02:25:22 - INFO - __main__ - Step 36859: {'lr': 0.00043462894300549097, 'samples': 7076928, 'steps': 36858, 'loss/train': 1.5616737604141235} 11/07/2021 02:25:22 - INFO - __main__ - Step 36860: {'lr': 0.0004346253649647485, 'samples': 7077120, 'steps': 36859, 'loss/train': 1.5420434474945068} 11/07/2021 02:25:22 - INFO - __main__ - Step 36861: {'lr': 0.00043462178684081657, 'samples': 7077312, 'steps': 36860, 'loss/train': 1.1887201070785522} 11/07/2021 02:25:23 - INFO - __main__ - Step 36862: {'lr': 0.00043461820863369697, 'samples': 7077504, 'steps': 36861, 'loss/train': 1.6546112298965454} 11/07/2021 02:25:24 - INFO - __main__ - Step 36863: {'lr': 0.0004346146303433912, 'samples': 7077696, 'steps': 36862, 'loss/train': 1.3106971979141235} 11/07/2021 02:25:24 - INFO - __main__ - Step 36864: {'lr': 0.00043461105196990093, 'samples': 7077888, 'steps': 36863, 'loss/train': 1.595474123954773} 11/07/2021 02:25:24 - INFO - __main__ - Step 36865: {'lr': 0.0004346074735132278, 'samples': 7078080, 'steps': 36864, 'loss/train': 0.924349844455719} 11/07/2021 02:25:25 - INFO - __main__ - Step 36866: {'lr': 0.0004346038949733734, 'samples': 7078272, 'steps': 36865, 'loss/train': 1.4717979431152344} 11/07/2021 02:25:25 - INFO - __main__ - Step 36867: {'lr': 0.0004346003163503393, 'samples': 7078464, 'steps': 36866, 'loss/train': 1.654711365699768} 11/07/2021 02:25:26 - INFO - __main__ - Step 36868: {'lr': 0.00043459673764412713, 'samples': 7078656, 'steps': 36867, 'loss/train': 1.5710206031799316} 11/07/2021 02:25:26 - INFO - __main__ - Step 36869: {'lr': 0.0004345931588547386, 'samples': 7078848, 'steps': 36868, 'loss/train': 1.3123658895492554} 11/07/2021 02:25:27 - INFO - __main__ - Step 36870: {'lr': 0.00043458957998217517, 'samples': 7079040, 'steps': 36869, 'loss/train': 0.5438855290412903} 11/07/2021 02:25:27 - INFO - __main__ - Step 36871: {'lr': 0.0004345860010264385, 'samples': 7079232, 'steps': 36870, 'loss/train': 1.6957143545150757} 11/07/2021 02:25:27 - INFO - __main__ - Step 36872: {'lr': 0.00043458242198753035, 'samples': 7079424, 'steps': 36871, 'loss/train': 1.5708094835281372} 11/07/2021 02:25:28 - INFO - __main__ - Step 36873: {'lr': 0.00043457884286545216, 'samples': 7079616, 'steps': 36872, 'loss/train': 1.5407085418701172} 11/07/2021 02:25:29 - INFO - __main__ - Step 36874: {'lr': 0.0004345752636602055, 'samples': 7079808, 'steps': 36873, 'loss/train': 0.5413639545440674} 11/07/2021 02:25:29 - INFO - __main__ - Step 36875: {'lr': 0.00043457168437179217, 'samples': 7080000, 'steps': 36874, 'loss/train': 1.3550437688827515} 11/07/2021 02:25:30 - INFO - __main__ - Step 36876: {'lr': 0.00043456810500021363, 'samples': 7080192, 'steps': 36875, 'loss/train': 1.5162456035614014} 11/07/2021 02:25:30 - INFO - __main__ - Step 36877: {'lr': 0.00043456452554547153, 'samples': 7080384, 'steps': 36876, 'loss/train': 1.5580357313156128} 11/07/2021 02:25:30 - INFO - __main__ - Step 36878: {'lr': 0.0004345609460075676, 'samples': 7080576, 'steps': 36877, 'loss/train': 1.3629759550094604} 11/07/2021 02:25:31 - INFO - __main__ - Step 36879: {'lr': 0.00043455736638650335, 'samples': 7080768, 'steps': 36878, 'loss/train': 1.707269310951233} 11/07/2021 02:25:32 - INFO - __main__ - Step 36880: {'lr': 0.0004345537866822803, 'samples': 7080960, 'steps': 36879, 'loss/train': 1.4857181310653687} 11/07/2021 02:25:32 - INFO - __main__ - Step 36881: {'lr': 0.0004345502068949002, 'samples': 7081152, 'steps': 36880, 'loss/train': 0.4709186255931854} 11/07/2021 02:25:32 - INFO - __main__ - Step 36882: {'lr': 0.0004345466270243646, 'samples': 7081344, 'steps': 36881, 'loss/train': 1.5124123096466064} 11/07/2021 02:25:33 - INFO - __main__ - Step 36883: {'lr': 0.0004345430470706753, 'samples': 7081536, 'steps': 36882, 'loss/train': 1.4317415952682495} 11/07/2021 02:25:34 - INFO - __main__ - Step 36884: {'lr': 0.00043453946703383354, 'samples': 7081728, 'steps': 36883, 'loss/train': 1.5005035400390625} 11/07/2021 02:25:34 - INFO - __main__ - Step 36885: {'lr': 0.00043453588691384125, 'samples': 7081920, 'steps': 36884, 'loss/train': 1.4188697338104248} 11/07/2021 02:25:34 - INFO - __main__ - Step 36886: {'lr': 0.0004345323067106999, 'samples': 7082112, 'steps': 36885, 'loss/train': 1.356079339981079} 11/07/2021 02:25:35 - INFO - __main__ - Step 36887: {'lr': 0.00043452872642441124, 'samples': 7082304, 'steps': 36886, 'loss/train': 1.3987716436386108} 11/07/2021 02:25:35 - INFO - __main__ - Step 36888: {'lr': 0.0004345251460549766, 'samples': 7082496, 'steps': 36887, 'loss/train': 1.269465684890747} 11/07/2021 02:25:36 - INFO - __main__ - Step 36889: {'lr': 0.0004345215656023979, 'samples': 7082688, 'steps': 36888, 'loss/train': 1.5068018436431885} 11/07/2021 02:25:36 - INFO - __main__ - Step 36890: {'lr': 0.0004345179850666766, 'samples': 7082880, 'steps': 36889, 'loss/train': 0.7879098057746887} 11/07/2021 02:25:37 - INFO - __main__ - Step 36891: {'lr': 0.0004345144044478144, 'samples': 7083072, 'steps': 36890, 'loss/train': 1.3742188215255737} 11/07/2021 02:25:37 - INFO - __main__ - Step 36892: {'lr': 0.0004345108237458128, 'samples': 7083264, 'steps': 36891, 'loss/train': 1.10400390625} 11/07/2021 02:25:38 - INFO - __main__ - Step 36893: {'lr': 0.00043450724296067344, 'samples': 7083456, 'steps': 36892, 'loss/train': 1.1535897254943848} 11/07/2021 02:25:39 - INFO - __main__ - Step 36894: {'lr': 0.00043450366209239803, 'samples': 7083648, 'steps': 36893, 'loss/train': 2.1971640586853027} 11/07/2021 02:25:39 - INFO - __main__ - Step 36895: {'lr': 0.0004345000811409881, 'samples': 7083840, 'steps': 36894, 'loss/train': 1.6039793491363525} 11/07/2021 02:25:39 - INFO - __main__ - Step 36896: {'lr': 0.0004344965001064453, 'samples': 7084032, 'steps': 36895, 'loss/train': 1.6271588802337646} 11/07/2021 02:25:40 - INFO - __main__ - Step 36897: {'lr': 0.0004344929189887712, 'samples': 7084224, 'steps': 36896, 'loss/train': 1.7677448987960815} 11/07/2021 02:25:40 - INFO - __main__ - Step 36898: {'lr': 0.0004344893377879674, 'samples': 7084416, 'steps': 36897, 'loss/train': 1.7011865377426147} 11/07/2021 02:25:40 - INFO - __main__ - Step 36899: {'lr': 0.00043448575650403555, 'samples': 7084608, 'steps': 36898, 'loss/train': 1.2332217693328857} 11/07/2021 02:25:41 - INFO - __main__ - Step 36900: {'lr': 0.00043448217513697727, 'samples': 7084800, 'steps': 36899, 'loss/train': 1.2743849754333496} 11/07/2021 02:25:42 - INFO - __main__ - Step 36901: {'lr': 0.0004344785936867942, 'samples': 7084992, 'steps': 36900, 'loss/train': 1.7653430700302124} 11/07/2021 02:25:42 - INFO - __main__ - Step 36902: {'lr': 0.00043447501215348794, 'samples': 7085184, 'steps': 36901, 'loss/train': 1.8011488914489746} 11/07/2021 02:25:43 - INFO - __main__ - Step 36903: {'lr': 0.00043447143053706007, 'samples': 7085376, 'steps': 36902, 'loss/train': 0.938480794429779} 11/07/2021 02:25:43 - INFO - __main__ - Step 36904: {'lr': 0.00043446784883751223, 'samples': 7085568, 'steps': 36903, 'loss/train': 1.3846172094345093} 11/07/2021 02:25:44 - INFO - __main__ - Step 36905: {'lr': 0.000434464267054846, 'samples': 7085760, 'steps': 36904, 'loss/train': 1.6487951278686523} 11/07/2021 02:25:44 - INFO - __main__ - Step 36906: {'lr': 0.000434460685189063, 'samples': 7085952, 'steps': 36905, 'loss/train': 1.008130431175232} 11/07/2021 02:25:45 - INFO - __main__ - Step 36907: {'lr': 0.0004344571032401649, 'samples': 7086144, 'steps': 36906, 'loss/train': 1.8134862184524536} 11/07/2021 02:25:45 - INFO - __main__ - Step 36908: {'lr': 0.0004344535212081533, 'samples': 7086336, 'steps': 36907, 'loss/train': 1.481315016746521} 11/07/2021 02:25:45 - INFO - __main__ - Step 36909: {'lr': 0.0004344499390930298, 'samples': 7086528, 'steps': 36908, 'loss/train': 1.0674848556518555} 11/07/2021 02:25:46 - INFO - __main__ - Step 36910: {'lr': 0.0004344463568947959, 'samples': 7086720, 'steps': 36909, 'loss/train': 1.404719591140747} 11/07/2021 02:25:47 - INFO - __main__ - Step 36911: {'lr': 0.0004344427746134534, 'samples': 7086912, 'steps': 36910, 'loss/train': 1.7163230180740356} 11/07/2021 02:25:47 - INFO - __main__ - Step 36912: {'lr': 0.0004344391922490037, 'samples': 7087104, 'steps': 36911, 'loss/train': 1.4719501733779907} 11/07/2021 02:25:47 - INFO - __main__ - Step 36913: {'lr': 0.0004344356098014487, 'samples': 7087296, 'steps': 36912, 'loss/train': 1.7014365196228027} 11/07/2021 02:25:48 - INFO - __main__ - Step 36914: {'lr': 0.0004344320272707898, 'samples': 7087488, 'steps': 36913, 'loss/train': 0.6131091713905334} 11/07/2021 02:25:49 - INFO - __main__ - Step 36915: {'lr': 0.0004344284446570287, 'samples': 7087680, 'steps': 36914, 'loss/train': 1.565171480178833} 11/07/2021 02:25:49 - INFO - __main__ - Step 36916: {'lr': 0.00043442486196016697, 'samples': 7087872, 'steps': 36915, 'loss/train': 1.4022310972213745} 11/07/2021 02:25:49 - INFO - __main__ - Step 36917: {'lr': 0.00043442127918020624, 'samples': 7088064, 'steps': 36916, 'loss/train': 1.5596140623092651} 11/07/2021 02:25:50 - INFO - __main__ - Step 36918: {'lr': 0.00043441769631714813, 'samples': 7088256, 'steps': 36917, 'loss/train': 1.1450285911560059} 11/07/2021 02:25:50 - INFO - __main__ - Step 36919: {'lr': 0.0004344141133709943, 'samples': 7088448, 'steps': 36918, 'loss/train': 1.3123325109481812} 11/07/2021 02:25:51 - INFO - __main__ - Step 36920: {'lr': 0.00043441053034174625, 'samples': 7088640, 'steps': 36919, 'loss/train': 1.5261822938919067} 11/07/2021 02:25:52 - INFO - __main__ - Step 36921: {'lr': 0.00043440694722940567, 'samples': 7088832, 'steps': 36920, 'loss/train': 1.166629672050476} 11/07/2021 02:25:52 - INFO - __main__ - Step 36922: {'lr': 0.00043440336403397417, 'samples': 7089024, 'steps': 36921, 'loss/train': 1.3760560750961304} 11/07/2021 02:25:52 - INFO - __main__ - Step 36923: {'lr': 0.00043439978075545337, 'samples': 7089216, 'steps': 36922, 'loss/train': 1.4996906518936157} 11/07/2021 02:25:53 - INFO - __main__ - Step 36924: {'lr': 0.0004343961973938449, 'samples': 7089408, 'steps': 36923, 'loss/train': 1.2339190244674683} 11/07/2021 02:25:54 - INFO - __main__ - Step 36925: {'lr': 0.00043439261394915033, 'samples': 7089600, 'steps': 36924, 'loss/train': 1.7183473110198975} 11/07/2021 02:25:54 - INFO - __main__ - Step 36926: {'lr': 0.0004343890304213713, 'samples': 7089792, 'steps': 36925, 'loss/train': 1.4664045572280884} 11/07/2021 02:25:54 - INFO - __main__ - Step 36927: {'lr': 0.0004343854468105094, 'samples': 7089984, 'steps': 36926, 'loss/train': 1.5949413776397705} 11/07/2021 02:25:55 - INFO - __main__ - Step 36928: {'lr': 0.00043438186311656624, 'samples': 7090176, 'steps': 36927, 'loss/train': 1.6770905256271362} 11/07/2021 02:25:55 - INFO - __main__ - Step 36929: {'lr': 0.0004343782793395435, 'samples': 7090368, 'steps': 36928, 'loss/train': 1.344126582145691} 11/07/2021 02:25:56 - INFO - __main__ - Step 36930: {'lr': 0.00043437469547944277, 'samples': 7090560, 'steps': 36929, 'loss/train': 1.7253801822662354} 11/07/2021 02:25:56 - INFO - __main__ - Step 36931: {'lr': 0.0004343711115362656, 'samples': 7090752, 'steps': 36930, 'loss/train': 1.561821460723877} 11/07/2021 02:25:57 - INFO - __main__ - Step 36932: {'lr': 0.00043436752751001365, 'samples': 7090944, 'steps': 36931, 'loss/train': 1.4372137784957886} 11/07/2021 02:25:57 - INFO - __main__ - Step 36933: {'lr': 0.0004343639434006885, 'samples': 7091136, 'steps': 36932, 'loss/train': 1.7856611013412476} 11/07/2021 02:25:57 - INFO - __main__ - Step 36934: {'lr': 0.00043436035920829186, 'samples': 7091328, 'steps': 36933, 'loss/train': 1.2089636325836182} 11/07/2021 02:25:58 - INFO - __main__ - Step 36935: {'lr': 0.0004343567749328253, 'samples': 7091520, 'steps': 36934, 'loss/train': 1.7776851654052734} 11/07/2021 02:25:59 - INFO - __main__ - Step 36936: {'lr': 0.00043435319057429046, 'samples': 7091712, 'steps': 36935, 'loss/train': 1.6643397808074951} 11/07/2021 02:25:59 - INFO - __main__ - Step 36937: {'lr': 0.0004343496061326888, 'samples': 7091904, 'steps': 36936, 'loss/train': 1.0694328546524048} 11/07/2021 02:26:00 - INFO - __main__ - Step 36938: {'lr': 0.0004343460216080221, 'samples': 7092096, 'steps': 36937, 'loss/train': 1.3263893127441406} 11/07/2021 02:26:00 - INFO - __main__ - Step 36939: {'lr': 0.00043434243700029196, 'samples': 7092288, 'steps': 36938, 'loss/train': 0.12158344686031342} 11/07/2021 02:26:00 - INFO - __main__ - Step 36940: {'lr': 0.0004343388523095, 'samples': 7092480, 'steps': 36939, 'loss/train': 0.7859086394309998} 11/07/2021 02:26:01 - INFO - __main__ - Step 36941: {'lr': 0.00043433526753564766, 'samples': 7092672, 'steps': 36940, 'loss/train': 1.9187930822372437} 11/07/2021 02:26:02 - INFO - __main__ - Step 36942: {'lr': 0.00043433168267873677, 'samples': 7092864, 'steps': 36941, 'loss/train': 1.7053115367889404} 11/07/2021 02:26:02 - INFO - __main__ - Step 36943: {'lr': 0.0004343280977387689, 'samples': 7093056, 'steps': 36942, 'loss/train': 1.142830729484558} 11/07/2021 02:26:02 - INFO - __main__ - Step 36944: {'lr': 0.0004343245127157456, 'samples': 7093248, 'steps': 36943, 'loss/train': 1.1533764600753784} 11/07/2021 02:26:03 - INFO - __main__ - Step 36945: {'lr': 0.0004343209276096686, 'samples': 7093440, 'steps': 36944, 'loss/train': 1.976108431816101} 11/07/2021 02:26:04 - INFO - __main__ - Step 36946: {'lr': 0.00043431734242053933, 'samples': 7093632, 'steps': 36945, 'loss/train': 1.3674890995025635} 11/07/2021 02:26:04 - INFO - __main__ - Step 36947: {'lr': 0.0004343137571483595, 'samples': 7093824, 'steps': 36946, 'loss/train': 1.2369861602783203} 11/07/2021 02:26:05 - INFO - __main__ - Step 36948: {'lr': 0.00043431017179313075, 'samples': 7094016, 'steps': 36947, 'loss/train': 1.1444861888885498} 11/07/2021 02:26:05 - INFO - __main__ - Step 36949: {'lr': 0.0004343065863548548, 'samples': 7094208, 'steps': 36948, 'loss/train': 1.6003363132476807} 11/07/2021 02:26:05 - INFO - __main__ - Step 36950: {'lr': 0.000434303000833533, 'samples': 7094400, 'steps': 36949, 'loss/train': 1.830175757408142} 11/07/2021 02:26:06 - INFO - __main__ - Step 36951: {'lr': 0.00043429941522916715, 'samples': 7094592, 'steps': 36950, 'loss/train': 1.4522465467453003} 11/07/2021 02:26:07 - INFO - __main__ - Step 36952: {'lr': 0.0004342958295417588, 'samples': 7094784, 'steps': 36951, 'loss/train': 1.5480337142944336} 11/07/2021 02:26:07 - INFO - __main__ - Step 36953: {'lr': 0.00043429224377130964, 'samples': 7094976, 'steps': 36952, 'loss/train': 1.239585041999817} 11/07/2021 02:26:07 - INFO - __main__ - Step 36954: {'lr': 0.00043428865791782126, 'samples': 7095168, 'steps': 36953, 'loss/train': 1.5699453353881836} 11/07/2021 02:26:08 - INFO - __main__ - Step 36955: {'lr': 0.0004342850719812952, 'samples': 7095360, 'steps': 36954, 'loss/train': 5.742282390594482} 11/07/2021 02:26:08 - INFO - __main__ - Step 36956: {'lr': 0.00043428148596173316, 'samples': 7095552, 'steps': 36955, 'loss/train': 1.2339860200881958} 11/07/2021 02:26:09 - INFO - __main__ - Step 36957: {'lr': 0.00043427789985913675, 'samples': 7095744, 'steps': 36956, 'loss/train': 1.3179750442504883} 11/07/2021 02:26:10 - INFO - __main__ - Step 36958: {'lr': 0.00043427431367350753, 'samples': 7095936, 'steps': 36957, 'loss/train': 3.1354281902313232} 11/07/2021 02:26:10 - INFO - __main__ - Step 36959: {'lr': 0.0004342707274048472, 'samples': 7096128, 'steps': 36958, 'loss/train': 1.6288416385650635} 11/07/2021 02:26:10 - INFO - __main__ - Step 36960: {'lr': 0.0004342671410531572, 'samples': 7096320, 'steps': 36959, 'loss/train': 1.7127184867858887} 11/07/2021 02:26:11 - INFO - __main__ - Step 36961: {'lr': 0.00043426355461843934, 'samples': 7096512, 'steps': 36960, 'loss/train': 1.260001540184021} 11/07/2021 02:26:12 - INFO - __main__ - Step 36962: {'lr': 0.00043425996810069525, 'samples': 7096704, 'steps': 36961, 'loss/train': 0.8840351104736328} 11/07/2021 02:26:12 - INFO - __main__ - Step 36963: {'lr': 0.0004342563814999264, 'samples': 7096896, 'steps': 36962, 'loss/train': 1.4114840030670166} 11/07/2021 02:26:12 - INFO - __main__ - Step 36964: {'lr': 0.0004342527948161344, 'samples': 7097088, 'steps': 36963, 'loss/train': 1.3243039846420288} 11/07/2021 02:26:13 - INFO - __main__ - Step 36965: {'lr': 0.000434249208049321, 'samples': 7097280, 'steps': 36964, 'loss/train': 1.5644744634628296} 11/07/2021 02:26:13 - INFO - __main__ - Step 36966: {'lr': 0.0004342456211994877, 'samples': 7097472, 'steps': 36965, 'loss/train': 1.495862603187561} 11/07/2021 02:26:14 - INFO - __main__ - Step 36967: {'lr': 0.00043424203426663623, 'samples': 7097664, 'steps': 36966, 'loss/train': 0.9820639491081238} 11/07/2021 02:26:14 - INFO - __main__ - Step 36968: {'lr': 0.0004342384472507681, 'samples': 7097856, 'steps': 36967, 'loss/train': 2.0179545879364014} 11/07/2021 02:26:15 - INFO - __main__ - Step 36969: {'lr': 0.00043423486015188497, 'samples': 7098048, 'steps': 36968, 'loss/train': 1.4496098756790161} 11/07/2021 02:26:15 - INFO - __main__ - Step 36970: {'lr': 0.00043423127296998845, 'samples': 7098240, 'steps': 36969, 'loss/train': 1.783840298652649} 11/07/2021 02:26:15 - INFO - __main__ - Step 36971: {'lr': 0.0004342276857050802, 'samples': 7098432, 'steps': 36970, 'loss/train': 1.8025826215744019} 11/07/2021 02:26:16 - INFO - __main__ - Step 36972: {'lr': 0.00043422409835716175, 'samples': 7098624, 'steps': 36971, 'loss/train': 1.6911512613296509} 11/07/2021 02:26:17 - INFO - __main__ - Step 36973: {'lr': 0.00043422051092623483, 'samples': 7098816, 'steps': 36972, 'loss/train': 1.4299386739730835} 11/07/2021 02:26:17 - INFO - __main__ - Step 36974: {'lr': 0.0004342169234123009, 'samples': 7099008, 'steps': 36973, 'loss/train': 1.6480128765106201} 11/07/2021 02:26:18 - INFO - __main__ - Step 36975: {'lr': 0.0004342133358153617, 'samples': 7099200, 'steps': 36974, 'loss/train': 1.752324104309082} 11/07/2021 02:26:18 - INFO - __main__ - Step 36976: {'lr': 0.0004342097481354189, 'samples': 7099392, 'steps': 36975, 'loss/train': 1.761583685874939} 11/07/2021 02:26:19 - INFO - __main__ - Step 36977: {'lr': 0.00043420616037247395, 'samples': 7099584, 'steps': 36976, 'loss/train': 1.5475198030471802} 11/07/2021 02:26:19 - INFO - __main__ - Step 36978: {'lr': 0.0004342025725265285, 'samples': 7099776, 'steps': 36977, 'loss/train': 1.5415979623794556} 11/07/2021 02:26:20 - INFO - __main__ - Step 36979: {'lr': 0.00043419898459758435, 'samples': 7099968, 'steps': 36978, 'loss/train': 1.387312889099121} 11/07/2021 02:26:20 - INFO - __main__ - Step 36980: {'lr': 0.00043419539658564286, 'samples': 7100160, 'steps': 36979, 'loss/train': 1.094067931175232} 11/07/2021 02:26:20 - INFO - __main__ - Step 36981: {'lr': 0.0004341918084907058, 'samples': 7100352, 'steps': 36980, 'loss/train': 2.066326379776001} 11/07/2021 02:26:21 - INFO - __main__ - Step 36982: {'lr': 0.0004341882203127747, 'samples': 7100544, 'steps': 36981, 'loss/train': 1.1311571598052979} 11/07/2021 02:26:22 - INFO - __main__ - Step 36983: {'lr': 0.00043418463205185134, 'samples': 7100736, 'steps': 36982, 'loss/train': 1.410170555114746} 11/07/2021 02:26:22 - INFO - __main__ - Step 36984: {'lr': 0.0004341810437079372, 'samples': 7100928, 'steps': 36983, 'loss/train': 1.2440638542175293} 11/07/2021 02:26:22 - INFO - __main__ - Step 36985: {'lr': 0.0004341774552810339, 'samples': 7101120, 'steps': 36984, 'loss/train': 0.09527873247861862} 11/07/2021 02:26:23 - INFO - __main__ - Step 36986: {'lr': 0.0004341738667711431, 'samples': 7101312, 'steps': 36985, 'loss/train': 1.8854491710662842} 11/07/2021 02:26:23 - INFO - __main__ - Step 36987: {'lr': 0.0004341702781782664, 'samples': 7101504, 'steps': 36986, 'loss/train': 1.4928325414657593} 11/07/2021 02:26:24 - INFO - __main__ - Step 36988: {'lr': 0.00043416668950240536, 'samples': 7101696, 'steps': 36987, 'loss/train': 1.3889577388763428} 11/07/2021 02:26:25 - INFO - __main__ - Step 36989: {'lr': 0.0004341631007435617, 'samples': 7101888, 'steps': 36988, 'loss/train': 1.1590068340301514} 11/07/2021 02:26:25 - INFO - __main__ - Step 36990: {'lr': 0.00043415951190173697, 'samples': 7102080, 'steps': 36989, 'loss/train': 1.2773730754852295} 11/07/2021 02:26:25 - INFO - __main__ - Step 36991: {'lr': 0.00043415592297693276, 'samples': 7102272, 'steps': 36990, 'loss/train': 1.5589598417282104} 11/07/2021 02:26:26 - INFO - __main__ - Step 36992: {'lr': 0.00043415233396915077, 'samples': 7102464, 'steps': 36991, 'loss/train': 1.318044662475586} 11/07/2021 02:26:27 - INFO - __main__ - Step 36993: {'lr': 0.0004341487448783926, 'samples': 7102656, 'steps': 36992, 'loss/train': 2.0069692134857178} 11/07/2021 02:26:27 - INFO - __main__ - Step 36994: {'lr': 0.00043414515570465987, 'samples': 7102848, 'steps': 36993, 'loss/train': 1.1666438579559326} 11/07/2021 02:26:27 - INFO - __main__ - Step 36995: {'lr': 0.0004341415664479541, 'samples': 7103040, 'steps': 36994, 'loss/train': 1.4736157655715942} 11/07/2021 02:26:28 - INFO - __main__ - Step 36996: {'lr': 0.00043413797710827707, 'samples': 7103232, 'steps': 36995, 'loss/train': 1.1507172584533691} 11/07/2021 02:26:28 - INFO - __main__ - Step 36997: {'lr': 0.00043413438768563026, 'samples': 7103424, 'steps': 36996, 'loss/train': 1.53364896774292} 11/07/2021 02:26:29 - INFO - __main__ - Step 36998: {'lr': 0.0004341307981800153, 'samples': 7103616, 'steps': 36997, 'loss/train': 1.4414423704147339} 11/07/2021 02:26:30 - INFO - __main__ - Step 36999: {'lr': 0.0004341272085914339, 'samples': 7103808, 'steps': 36998, 'loss/train': 1.2519278526306152} 11/07/2021 02:26:30 - INFO - __main__ - Step 37000: {'lr': 0.00043412361891988763, 'samples': 7104000, 'steps': 36999, 'loss/train': 1.7549240589141846} 11/07/2021 02:26:30 - INFO - __main__ - Step 37001: {'lr': 0.0004341200291653781, 'samples': 7104192, 'steps': 37000, 'loss/train': 1.153167486190796} 11/07/2021 02:26:31 - INFO - __main__ - Step 37002: {'lr': 0.00043411643932790686, 'samples': 7104384, 'steps': 37001, 'loss/train': 0.5130859613418579} 11/07/2021 02:26:32 - INFO - __main__ - Step 37003: {'lr': 0.0004341128494074756, 'samples': 7104576, 'steps': 37002, 'loss/train': 0.9205571413040161} 11/07/2021 02:26:32 - INFO - __main__ - Step 37004: {'lr': 0.00043410925940408595, 'samples': 7104768, 'steps': 37003, 'loss/train': 1.8095409870147705} 11/07/2021 02:26:32 - INFO - __main__ - Step 37005: {'lr': 0.00043410566931773953, 'samples': 7104960, 'steps': 37004, 'loss/train': 1.2690837383270264} 11/07/2021 02:26:33 - INFO - __main__ - Step 37006: {'lr': 0.000434102079148438, 'samples': 7105152, 'steps': 37005, 'loss/train': 1.5979087352752686} 11/07/2021 02:26:33 - INFO - __main__ - Step 37007: {'lr': 0.0004340984888961828, 'samples': 7105344, 'steps': 37006, 'loss/train': 1.3557651042938232} 11/07/2021 02:26:33 - INFO - __main__ - Step 37008: {'lr': 0.00043409489856097573, 'samples': 7105536, 'steps': 37007, 'loss/train': 1.4115225076675415} 11/07/2021 02:26:34 - INFO - __main__ - Step 37009: {'lr': 0.0004340913081428183, 'samples': 7105728, 'steps': 37008, 'loss/train': 1.5827487707138062} 11/07/2021 02:26:35 - INFO - __main__ - Step 37010: {'lr': 0.00043408771764171216, 'samples': 7105920, 'steps': 37009, 'loss/train': 1.5879969596862793} 11/07/2021 02:26:35 - INFO - __main__ - Step 37011: {'lr': 0.000434084127057659, 'samples': 7106112, 'steps': 37010, 'loss/train': 1.4903258085250854} 11/07/2021 02:26:35 - INFO - __main__ - Step 37012: {'lr': 0.0004340805363906603, 'samples': 7106304, 'steps': 37011, 'loss/train': 1.3931164741516113} 11/07/2021 02:26:36 - INFO - __main__ - Step 37013: {'lr': 0.00043407694564071773, 'samples': 7106496, 'steps': 37012, 'loss/train': 2.0241074562072754} 11/07/2021 02:26:37 - INFO - __main__ - Step 37014: {'lr': 0.00043407335480783306, 'samples': 7106688, 'steps': 37013, 'loss/train': 1.6712000370025635} 11/07/2021 02:26:37 - INFO - __main__ - Step 37015: {'lr': 0.0004340697638920077, 'samples': 7106880, 'steps': 37014, 'loss/train': 1.5157111883163452} 11/07/2021 02:26:38 - INFO - __main__ - Step 37016: {'lr': 0.0004340661728932433, 'samples': 7107072, 'steps': 37015, 'loss/train': 1.5959793329238892} 11/07/2021 02:26:38 - INFO - __main__ - Step 37017: {'lr': 0.0004340625818115416, 'samples': 7107264, 'steps': 37016, 'loss/train': 1.7012815475463867} 11/07/2021 02:26:38 - INFO - __main__ - Step 37018: {'lr': 0.00043405899064690405, 'samples': 7107456, 'steps': 37017, 'loss/train': 1.6925928592681885} 11/07/2021 02:26:39 - INFO - __main__ - Step 37019: {'lr': 0.0004340553993993325, 'samples': 7107648, 'steps': 37018, 'loss/train': 1.8460217714309692} 11/07/2021 02:26:40 - INFO - __main__ - Step 37020: {'lr': 0.0004340518080688283, 'samples': 7107840, 'steps': 37019, 'loss/train': 1.5365556478500366} 11/07/2021 02:26:40 - INFO - __main__ - Step 37021: {'lr': 0.0004340482166553932, 'samples': 7108032, 'steps': 37020, 'loss/train': 1.8368773460388184} 11/07/2021 02:26:40 - INFO - __main__ - Step 37022: {'lr': 0.0004340446251590289, 'samples': 7108224, 'steps': 37021, 'loss/train': 1.7531707286834717} 11/07/2021 02:26:41 - INFO - __main__ - Step 37023: {'lr': 0.00043404103357973684, 'samples': 7108416, 'steps': 37022, 'loss/train': 1.1846529245376587} 11/07/2021 02:26:42 - INFO - __main__ - Step 37024: {'lr': 0.0004340374419175188, 'samples': 7108608, 'steps': 37023, 'loss/train': 1.5365815162658691} 11/07/2021 02:26:42 - INFO - __main__ - Step 37025: {'lr': 0.0004340338501723763, 'samples': 7108800, 'steps': 37024, 'loss/train': 2.1280877590179443} 11/07/2021 02:26:42 - INFO - __main__ - Step 37026: {'lr': 0.00043403025834431097, 'samples': 7108992, 'steps': 37025, 'loss/train': 1.6921418905258179} 11/07/2021 02:26:43 - INFO - __main__ - Step 37027: {'lr': 0.00043402666643332444, 'samples': 7109184, 'steps': 37026, 'loss/train': 1.3075718879699707} 11/07/2021 02:26:43 - INFO - __main__ - Step 37028: {'lr': 0.00043402307443941835, 'samples': 7109376, 'steps': 37027, 'loss/train': 1.4104558229446411} 11/07/2021 02:26:44 - INFO - __main__ - Step 37029: {'lr': 0.00043401948236259437, 'samples': 7109568, 'steps': 37028, 'loss/train': 1.4770339727401733} 11/07/2021 02:26:45 - INFO - __main__ - Step 37030: {'lr': 0.000434015890202854, 'samples': 7109760, 'steps': 37029, 'loss/train': 1.0834112167358398} 11/07/2021 02:26:45 - INFO - __main__ - Step 37031: {'lr': 0.0004340122979601989, 'samples': 7109952, 'steps': 37030, 'loss/train': 1.4986871480941772} 11/07/2021 02:26:45 - INFO - __main__ - Step 37032: {'lr': 0.0004340087056346307, 'samples': 7110144, 'steps': 37031, 'loss/train': 1.2967861890792847} 11/07/2021 02:26:46 - INFO - __main__ - Step 37033: {'lr': 0.000434005113226151, 'samples': 7110336, 'steps': 37032, 'loss/train': 1.0271788835525513} 11/07/2021 02:26:47 - INFO - __main__ - Step 37034: {'lr': 0.0004340015207347614, 'samples': 7110528, 'steps': 37033, 'loss/train': 1.4559848308563232} 11/07/2021 02:26:47 - INFO - __main__ - Step 37035: {'lr': 0.0004339979281604636, 'samples': 7110720, 'steps': 37034, 'loss/train': 1.8344744443893433} 11/07/2021 02:26:48 - INFO - __main__ - Step 37036: {'lr': 0.00043399433550325917, 'samples': 7110912, 'steps': 37035, 'loss/train': 1.378834843635559} 11/07/2021 02:26:48 - INFO - __main__ - Step 37037: {'lr': 0.00043399074276314974, 'samples': 7111104, 'steps': 37036, 'loss/train': 1.4435096979141235} 11/07/2021 02:26:48 - INFO - __main__ - Step 37038: {'lr': 0.00043398714994013696, 'samples': 7111296, 'steps': 37037, 'loss/train': 1.315832257270813} 11/07/2021 02:26:49 - INFO - __main__ - Step 37039: {'lr': 0.00043398355703422233, 'samples': 7111488, 'steps': 37038, 'loss/train': 1.2000634670257568} 11/07/2021 02:26:50 - INFO - __main__ - Step 37040: {'lr': 0.0004339799640454076, 'samples': 7111680, 'steps': 37039, 'loss/train': 1.1245691776275635} 11/07/2021 02:26:50 - INFO - __main__ - Step 37041: {'lr': 0.00043397637097369434, 'samples': 7111872, 'steps': 37040, 'loss/train': 1.210996150970459} 11/07/2021 02:26:50 - INFO - __main__ - Step 37042: {'lr': 0.0004339727778190842, 'samples': 7112064, 'steps': 37041, 'loss/train': 1.5488569736480713} 11/07/2021 02:26:51 - INFO - __main__ - Step 37043: {'lr': 0.0004339691845815786, 'samples': 7112256, 'steps': 37042, 'loss/train': 1.4970704317092896} 11/07/2021 02:26:51 - INFO - __main__ - Step 37044: {'lr': 0.0004339655912611795, 'samples': 7112448, 'steps': 37043, 'loss/train': 1.3883769512176514} 11/07/2021 02:26:52 - INFO - __main__ - Step 37045: {'lr': 0.00043396199785788824, 'samples': 7112640, 'steps': 37044, 'loss/train': 1.9679926633834839} 11/07/2021 02:26:52 - INFO - __main__ - Step 37046: {'lr': 0.00043395840437170666, 'samples': 7112832, 'steps': 37045, 'loss/train': 1.365040898323059} 11/07/2021 02:26:53 - INFO - __main__ - Step 37047: {'lr': 0.00043395481080263614, 'samples': 7113024, 'steps': 37046, 'loss/train': 1.3028538227081299} 11/07/2021 02:26:53 - INFO - __main__ - Step 37048: {'lr': 0.0004339512171506785, 'samples': 7113216, 'steps': 37047, 'loss/train': 1.3216134309768677} 11/07/2021 02:26:53 - INFO - __main__ - Step 37049: {'lr': 0.0004339476234158352, 'samples': 7113408, 'steps': 37048, 'loss/train': 1.917819619178772} 11/07/2021 02:26:55 - INFO - __main__ - Step 37050: {'lr': 0.00043394402959810795, 'samples': 7113600, 'steps': 37049, 'loss/train': 1.5562846660614014} 11/07/2021 02:26:55 - INFO - __main__ - Step 37051: {'lr': 0.00043394043569749843, 'samples': 7113792, 'steps': 37050, 'loss/train': 1.555881381034851} 11/07/2021 02:26:55 - INFO - __main__ - Step 37052: {'lr': 0.00043393684171400817, 'samples': 7113984, 'steps': 37051, 'loss/train': 1.501509666442871} 11/07/2021 02:26:56 - INFO - __main__ - Step 37053: {'lr': 0.00043393324764763873, 'samples': 7114176, 'steps': 37052, 'loss/train': 1.3671139478683472} 11/07/2021 02:26:56 - INFO - __main__ - Step 37054: {'lr': 0.0004339296534983919, 'samples': 7114368, 'steps': 37053, 'loss/train': 1.194187045097351} 11/07/2021 02:26:58 - INFO - __main__ - Step 37055: {'lr': 0.00043392605926626914, 'samples': 7114560, 'steps': 37054, 'loss/train': 1.2914605140686035} 11/07/2021 02:26:58 - INFO - __main__ - Step 37056: {'lr': 0.0004339224649512722, 'samples': 7114752, 'steps': 37055, 'loss/train': 1.5854653120040894} 11/07/2021 02:26:58 - INFO - __main__ - Step 37057: {'lr': 0.00043391887055340263, 'samples': 7114944, 'steps': 37056, 'loss/train': 1.5734776258468628} 11/07/2021 02:26:59 - INFO - __main__ - Step 37058: {'lr': 0.000433915276072662, 'samples': 7115136, 'steps': 37057, 'loss/train': 1.7411218881607056} 11/07/2021 02:26:59 - INFO - __main__ - Step 37059: {'lr': 0.00043391168150905203, 'samples': 7115328, 'steps': 37058, 'loss/train': 1.6579246520996094} 11/07/2021 02:26:59 - INFO - __main__ - Step 37060: {'lr': 0.0004339080868625743, 'samples': 7115520, 'steps': 37059, 'loss/train': 0.5986313223838806} 11/07/2021 02:27:01 - INFO - __main__ - Step 37061: {'lr': 0.00043390449213323037, 'samples': 7115712, 'steps': 37060, 'loss/train': 0.4319573640823364} 11/07/2021 02:27:01 - INFO - __main__ - Step 37062: {'lr': 0.000433900897321022, 'samples': 7115904, 'steps': 37061, 'loss/train': 0.08170078694820404} 11/07/2021 02:27:01 - INFO - __main__ - Step 37063: {'lr': 0.0004338973024259506, 'samples': 7116096, 'steps': 37062, 'loss/train': 0.7094062566757202} 11/07/2021 02:27:02 - INFO - __main__ - Step 37064: {'lr': 0.00043389370744801806, 'samples': 7116288, 'steps': 37063, 'loss/train': 1.4509012699127197} 11/07/2021 02:27:02 - INFO - __main__ - Step 37065: {'lr': 0.00043389011238722575, 'samples': 7116480, 'steps': 37064, 'loss/train': 1.0019127130508423} 11/07/2021 02:27:03 - INFO - __main__ - Step 37066: {'lr': 0.0004338865172435754, 'samples': 7116672, 'steps': 37065, 'loss/train': 1.5191227197647095} 11/07/2021 02:27:03 - INFO - __main__ - Step 37067: {'lr': 0.00043388292201706867, 'samples': 7116864, 'steps': 37066, 'loss/train': 1.7019362449645996} 11/07/2021 02:27:04 - INFO - __main__ - Step 37068: {'lr': 0.0004338793267077071, 'samples': 7117056, 'steps': 37067, 'loss/train': 1.5938674211502075} 11/07/2021 02:27:04 - INFO - __main__ - Step 37069: {'lr': 0.0004338757313154923, 'samples': 7117248, 'steps': 37068, 'loss/train': 1.4200563430786133} 11/07/2021 02:27:04 - INFO - __main__ - Step 37070: {'lr': 0.000433872135840426, 'samples': 7117440, 'steps': 37069, 'loss/train': 0.9483161568641663} 11/07/2021 02:27:05 - INFO - __main__ - Step 37071: {'lr': 0.00043386854028250977, 'samples': 7117632, 'steps': 37070, 'loss/train': 1.6696423292160034} 11/07/2021 02:27:06 - INFO - __main__ - Step 37072: {'lr': 0.00043386494464174515, 'samples': 7117824, 'steps': 37071, 'loss/train': 2.824124336242676} 11/07/2021 02:27:06 - INFO - __main__ - Step 37073: {'lr': 0.0004338613489181338, 'samples': 7118016, 'steps': 37072, 'loss/train': 1.9422165155410767} 11/07/2021 02:27:06 - INFO - __main__ - Step 37074: {'lr': 0.00043385775311167746, 'samples': 7118208, 'steps': 37073, 'loss/train': 1.103857398033142} 11/07/2021 02:27:07 - INFO - __main__ - Step 37075: {'lr': 0.00043385415722237765, 'samples': 7118400, 'steps': 37074, 'loss/train': 1.910669207572937} 11/07/2021 02:27:08 - INFO - __main__ - Step 37076: {'lr': 0.0004338505612502359, 'samples': 7118592, 'steps': 37075, 'loss/train': 1.5614584684371948} 11/07/2021 02:27:08 - INFO - __main__ - Step 37077: {'lr': 0.000433846965195254, 'samples': 7118784, 'steps': 37076, 'loss/train': 1.5544991493225098} 11/07/2021 02:27:08 - INFO - __main__ - Step 37078: {'lr': 0.00043384336905743343, 'samples': 7118976, 'steps': 37077, 'loss/train': 1.7339357137680054} 11/07/2021 02:27:09 - INFO - __main__ - Step 37079: {'lr': 0.0004338397728367759, 'samples': 7119168, 'steps': 37078, 'loss/train': 1.6730190515518188} 11/07/2021 02:27:09 - INFO - __main__ - Step 37080: {'lr': 0.000433836176533283, 'samples': 7119360, 'steps': 37079, 'loss/train': 1.6230782270431519} 11/07/2021 02:27:10 - INFO - __main__ - Step 37081: {'lr': 0.0004338325801469564, 'samples': 7119552, 'steps': 37080, 'loss/train': 1.6183432340621948} 11/07/2021 02:27:11 - INFO - __main__ - Step 37082: {'lr': 0.00043382898367779767, 'samples': 7119744, 'steps': 37081, 'loss/train': 1.7554967403411865} 11/07/2021 02:27:11 - INFO - __main__ - Step 37083: {'lr': 0.00043382538712580845, 'samples': 7119936, 'steps': 37082, 'loss/train': 1.476054310798645} 11/07/2021 02:27:11 - INFO - __main__ - Step 37084: {'lr': 0.00043382179049099024, 'samples': 7120128, 'steps': 37083, 'loss/train': 1.7822281122207642} 11/07/2021 02:27:12 - INFO - __main__ - Step 37085: {'lr': 0.00043381819377334485, 'samples': 7120320, 'steps': 37084, 'loss/train': 1.2737452983856201} 11/07/2021 02:27:12 - INFO - __main__ - Step 37086: {'lr': 0.00043381459697287383, 'samples': 7120512, 'steps': 37085, 'loss/train': 1.7624574899673462} 11/07/2021 02:27:13 - INFO - __main__ - Step 37087: {'lr': 0.0004338110000895787, 'samples': 7120704, 'steps': 37086, 'loss/train': 1.9725539684295654} 11/07/2021 02:27:13 - INFO - __main__ - Step 37088: {'lr': 0.00043380740312346135, 'samples': 7120896, 'steps': 37087, 'loss/train': 1.3805655241012573} 11/07/2021 02:27:14 - INFO - __main__ - Step 37089: {'lr': 0.00043380380607452307, 'samples': 7121088, 'steps': 37088, 'loss/train': 2.1629951000213623} 11/07/2021 02:27:14 - INFO - __main__ - Step 37090: {'lr': 0.0004338002089427657, 'samples': 7121280, 'steps': 37089, 'loss/train': 1.704150676727295} 11/07/2021 02:27:14 - INFO - __main__ - Step 37091: {'lr': 0.00043379661172819075, 'samples': 7121472, 'steps': 37090, 'loss/train': 1.3050607442855835} 11/07/2021 02:27:15 - INFO - __main__ - Step 37092: {'lr': 0.0004337930144307999, 'samples': 7121664, 'steps': 37091, 'loss/train': 1.029489278793335} 11/07/2021 02:27:16 - INFO - __main__ - Step 37093: {'lr': 0.0004337894170505947, 'samples': 7121856, 'steps': 37092, 'loss/train': 1.5230679512023926} 11/07/2021 02:27:16 - INFO - __main__ - Step 37094: {'lr': 0.0004337858195875769, 'samples': 7122048, 'steps': 37093, 'loss/train': 1.5719635486602783} 11/07/2021 02:27:17 - INFO - __main__ - Step 37095: {'lr': 0.00043378222204174807, 'samples': 7122240, 'steps': 37094, 'loss/train': 1.6674373149871826} 11/07/2021 02:27:17 - INFO - __main__ - Step 37096: {'lr': 0.0004337786244131097, 'samples': 7122432, 'steps': 37095, 'loss/train': 1.4746836423873901} 11/07/2021 02:27:18 - INFO - __main__ - Step 37097: {'lr': 0.00043377502670166357, 'samples': 7122624, 'steps': 37096, 'loss/train': 2.1680285930633545} 11/07/2021 02:27:18 - INFO - __main__ - Step 37098: {'lr': 0.0004337714289074113, 'samples': 7122816, 'steps': 37097, 'loss/train': 1.1618105173110962} 11/07/2021 02:27:19 - INFO - __main__ - Step 37099: {'lr': 0.0004337678310303544, 'samples': 7123008, 'steps': 37098, 'loss/train': 1.6241350173950195} 11/07/2021 02:27:19 - INFO - __main__ - Step 37100: {'lr': 0.00043376423307049455, 'samples': 7123200, 'steps': 37099, 'loss/train': 0.8144013285636902} 11/07/2021 02:27:19 - INFO - __main__ - Step 37101: {'lr': 0.00043376063502783337, 'samples': 7123392, 'steps': 37100, 'loss/train': 1.6171725988388062} 11/07/2021 02:27:20 - INFO - __main__ - Step 37102: {'lr': 0.00043375703690237254, 'samples': 7123584, 'steps': 37101, 'loss/train': 1.6170181035995483} 11/07/2021 02:27:21 - INFO - __main__ - Step 37103: {'lr': 0.0004337534386941135, 'samples': 7123776, 'steps': 37102, 'loss/train': 1.580391764640808} 11/07/2021 02:27:21 - INFO - __main__ - Step 37104: {'lr': 0.00043374984040305816, 'samples': 7123968, 'steps': 37103, 'loss/train': 4.709044456481934} 11/07/2021 02:27:21 - INFO - __main__ - Step 37105: {'lr': 0.00043374624202920786, 'samples': 7124160, 'steps': 37104, 'loss/train': 1.4684909582138062} 11/07/2021 02:27:22 - INFO - __main__ - Step 37106: {'lr': 0.0004337426435725644, 'samples': 7124352, 'steps': 37105, 'loss/train': 1.618173599243164} 11/07/2021 02:27:23 - INFO - __main__ - Step 37107: {'lr': 0.00043373904503312934, 'samples': 7124544, 'steps': 37106, 'loss/train': 1.5031660795211792} 11/07/2021 02:27:23 - INFO - __main__ - Step 37108: {'lr': 0.0004337354464109042, 'samples': 7124736, 'steps': 37107, 'loss/train': 1.3616937398910522} 11/07/2021 02:27:23 - INFO - __main__ - Step 37109: {'lr': 0.0004337318477058908, 'samples': 7124928, 'steps': 37108, 'loss/train': 1.0435408353805542} 11/07/2021 02:27:24 - INFO - __main__ - Step 37110: {'lr': 0.0004337282489180907, 'samples': 7125120, 'steps': 37109, 'loss/train': 1.6722913980484009} 11/07/2021 02:27:24 - INFO - __main__ - Step 37111: {'lr': 0.0004337246500475054, 'samples': 7125312, 'steps': 37110, 'loss/train': 1.277995228767395} 11/07/2021 02:27:24 - INFO - __main__ - Step 37112: {'lr': 0.0004337210510941366, 'samples': 7125504, 'steps': 37111, 'loss/train': 1.1224045753479004} 11/07/2021 02:27:25 - INFO - __main__ - Step 37113: {'lr': 0.000433717452057986, 'samples': 7125696, 'steps': 37112, 'loss/train': 0.9895749092102051} 11/07/2021 02:27:26 - INFO - __main__ - Step 37114: {'lr': 0.00043371385293905517, 'samples': 7125888, 'steps': 37113, 'loss/train': 1.5234973430633545} 11/07/2021 02:27:26 - INFO - __main__ - Step 37115: {'lr': 0.0004337102537373456, 'samples': 7126080, 'steps': 37114, 'loss/train': 1.5487487316131592} 11/07/2021 02:27:27 - INFO - __main__ - Step 37116: {'lr': 0.0004337066544528591, 'samples': 7126272, 'steps': 37115, 'loss/train': 0.6512836217880249} 11/07/2021 02:27:27 - INFO - __main__ - Step 37117: {'lr': 0.00043370305508559723, 'samples': 7126464, 'steps': 37116, 'loss/train': 1.3662883043289185} 11/07/2021 02:27:28 - INFO - __main__ - Step 37118: {'lr': 0.00043369945563556157, 'samples': 7126656, 'steps': 37117, 'loss/train': 1.2095894813537598} 11/07/2021 02:27:28 - INFO - __main__ - Step 37119: {'lr': 0.00043369585610275374, 'samples': 7126848, 'steps': 37118, 'loss/train': 0.9861893057823181} 11/07/2021 02:27:29 - INFO - __main__ - Step 37120: {'lr': 0.0004336922564871755, 'samples': 7127040, 'steps': 37119, 'loss/train': 1.63252592086792} 11/07/2021 02:27:29 - INFO - __main__ - Step 37121: {'lr': 0.00043368865678882824, 'samples': 7127232, 'steps': 37120, 'loss/train': 1.4385783672332764} 11/07/2021 02:27:29 - INFO - __main__ - Step 37122: {'lr': 0.00043368505700771377, 'samples': 7127424, 'steps': 37121, 'loss/train': 1.6075456142425537} 11/07/2021 02:27:30 - INFO - __main__ - Step 37123: {'lr': 0.00043368145714383364, 'samples': 7127616, 'steps': 37122, 'loss/train': 1.1798597574234009} 11/07/2021 02:27:31 - INFO - __main__ - Step 37124: {'lr': 0.00043367785719718947, 'samples': 7127808, 'steps': 37123, 'loss/train': 1.5068614482879639} 11/07/2021 02:27:31 - INFO - __main__ - Step 37125: {'lr': 0.0004336742571677829, 'samples': 7128000, 'steps': 37124, 'loss/train': 1.5282413959503174} 11/07/2021 02:27:31 - INFO - __main__ - Step 37126: {'lr': 0.00043367065705561547, 'samples': 7128192, 'steps': 37125, 'loss/train': 1.5183273553848267} 11/07/2021 02:27:32 - INFO - __main__ - Step 37127: {'lr': 0.00043366705686068895, 'samples': 7128384, 'steps': 37126, 'loss/train': 1.2544848918914795} 11/07/2021 02:27:33 - INFO - __main__ - Step 37128: {'lr': 0.0004336634565830049, 'samples': 7128576, 'steps': 37127, 'loss/train': 1.8652119636535645} 11/07/2021 02:27:33 - INFO - __main__ - Step 37129: {'lr': 0.0004336598562225649, 'samples': 7128768, 'steps': 37128, 'loss/train': 1.3766703605651855} 11/07/2021 02:27:34 - INFO - __main__ - Step 37130: {'lr': 0.00043365625577937065, 'samples': 7128960, 'steps': 37129, 'loss/train': 1.6950541734695435} 11/07/2021 02:27:34 - INFO - __main__ - Step 37131: {'lr': 0.00043365265525342365, 'samples': 7129152, 'steps': 37130, 'loss/train': 1.2819048166275024} 11/07/2021 02:27:34 - INFO - __main__ - Step 37132: {'lr': 0.00043364905464472563, 'samples': 7129344, 'steps': 37131, 'loss/train': 1.628318428993225} 11/07/2021 02:27:35 - INFO - __main__ - Step 37133: {'lr': 0.0004336454539532782, 'samples': 7129536, 'steps': 37132, 'loss/train': 5.755924701690674} 11/07/2021 02:27:36 - INFO - __main__ - Step 37134: {'lr': 0.00043364185317908296, 'samples': 7129728, 'steps': 37133, 'loss/train': 1.2615383863449097} 11/07/2021 02:27:36 - INFO - __main__ - Step 37135: {'lr': 0.0004336382523221415, 'samples': 7129920, 'steps': 37134, 'loss/train': 1.1689530611038208} 11/07/2021 02:27:36 - INFO - __main__ - Step 37136: {'lr': 0.0004336346513824555, 'samples': 7130112, 'steps': 37135, 'loss/train': 1.9664125442504883} 11/07/2021 02:27:37 - INFO - __main__ - Step 37137: {'lr': 0.0004336310503600266, 'samples': 7130304, 'steps': 37136, 'loss/train': 1.0943663120269775} 11/07/2021 02:27:37 - INFO - __main__ - Step 37138: {'lr': 0.0004336274492548563, 'samples': 7130496, 'steps': 37137, 'loss/train': 1.752846598625183} 11/07/2021 02:27:38 - INFO - __main__ - Step 37139: {'lr': 0.0004336238480669463, 'samples': 7130688, 'steps': 37138, 'loss/train': 1.6972535848617554} 11/07/2021 02:27:39 - INFO - __main__ - Step 37140: {'lr': 0.0004336202467962983, 'samples': 7130880, 'steps': 37139, 'loss/train': 1.4095814228057861} 11/07/2021 02:27:39 - INFO - __main__ - Step 37141: {'lr': 0.0004336166454429139, 'samples': 7131072, 'steps': 37140, 'loss/train': 1.2461761236190796} 11/07/2021 02:27:39 - INFO - __main__ - Step 37142: {'lr': 0.0004336130440067946, 'samples': 7131264, 'steps': 37141, 'loss/train': 1.0689070224761963} 11/07/2021 02:27:40 - INFO - __main__ - Step 37143: {'lr': 0.000433609442487942, 'samples': 7131456, 'steps': 37142, 'loss/train': 1.5266462564468384} 11/07/2021 02:27:41 - INFO - __main__ - Step 37144: {'lr': 0.00043360584088635804, 'samples': 7131648, 'steps': 37143, 'loss/train': 1.4167495965957642} 11/07/2021 02:27:41 - INFO - __main__ - Step 37145: {'lr': 0.0004336022392020439, 'samples': 7131840, 'steps': 37144, 'loss/train': 1.4522278308868408} 11/07/2021 02:27:41 - INFO - __main__ - Step 37146: {'lr': 0.0004335986374350015, 'samples': 7132032, 'steps': 37145, 'loss/train': 1.5540838241577148} 11/07/2021 02:27:42 - INFO - __main__ - Step 37147: {'lr': 0.00043359503558523246, 'samples': 7132224, 'steps': 37146, 'loss/train': 1.2634097337722778} 11/07/2021 02:27:42 - INFO - __main__ - Step 37148: {'lr': 0.0004335914336527382, 'samples': 7132416, 'steps': 37147, 'loss/train': 1.746785044670105} 11/07/2021 02:27:43 - INFO - __main__ - Step 37149: {'lr': 0.0004335878316375206, 'samples': 7132608, 'steps': 37148, 'loss/train': 1.4561712741851807} 11/07/2021 02:27:43 - INFO - __main__ - Step 37150: {'lr': 0.0004335842295395811, 'samples': 7132800, 'steps': 37149, 'loss/train': 0.9840590953826904} 11/07/2021 02:27:44 - INFO - __main__ - Step 37151: {'lr': 0.0004335806273589214, 'samples': 7132992, 'steps': 37150, 'loss/train': 1.2678557634353638} 11/07/2021 02:27:44 - INFO - __main__ - Step 37152: {'lr': 0.0004335770250955431, 'samples': 7133184, 'steps': 37151, 'loss/train': 1.4049510955810547} 11/07/2021 02:27:45 - INFO - __main__ - Step 37153: {'lr': 0.0004335734227494478, 'samples': 7133376, 'steps': 37152, 'loss/train': 1.6425676345825195} 11/07/2021 02:27:46 - INFO - __main__ - Step 37154: {'lr': 0.0004335698203206372, 'samples': 7133568, 'steps': 37153, 'loss/train': 1.8100358247756958} 11/07/2021 02:27:46 - INFO - __main__ - Step 37155: {'lr': 0.00043356621780911273, 'samples': 7133760, 'steps': 37154, 'loss/train': 1.6426031589508057} 11/07/2021 02:27:46 - INFO - __main__ - Step 37156: {'lr': 0.0004335626152148763, 'samples': 7133952, 'steps': 37155, 'loss/train': 1.9970622062683105} 11/07/2021 02:27:47 - INFO - __main__ - Step 37157: {'lr': 0.0004335590125379293, 'samples': 7134144, 'steps': 37156, 'loss/train': 1.66996169090271} 11/07/2021 02:27:47 - INFO - __main__ - Step 37158: {'lr': 0.00043355540977827356, 'samples': 7134336, 'steps': 37157, 'loss/train': 1.1085461378097534} 11/07/2021 02:27:48 - INFO - __main__ - Step 37159: {'lr': 0.0004335518069359105, 'samples': 7134528, 'steps': 37158, 'loss/train': 0.21571595966815948} 11/07/2021 02:27:48 - INFO - __main__ - Step 37160: {'lr': 0.0004335482040108418, 'samples': 7134720, 'steps': 37159, 'loss/train': 1.7107292413711548} 11/07/2021 02:27:49 - INFO - __main__ - Step 37161: {'lr': 0.00043354460100306915, 'samples': 7134912, 'steps': 37160, 'loss/train': 1.9382342100143433} 11/07/2021 02:27:49 - INFO - __main__ - Step 37162: {'lr': 0.00043354099791259414, 'samples': 7135104, 'steps': 37161, 'loss/train': 0.9119004011154175} 11/07/2021 02:27:49 - INFO - __main__ - Step 37163: {'lr': 0.00043353739473941846, 'samples': 7135296, 'steps': 37162, 'loss/train': 1.6841670274734497} 11/07/2021 02:27:50 - INFO - __main__ - Step 37164: {'lr': 0.0004335337914835435, 'samples': 7135488, 'steps': 37163, 'loss/train': 1.644423484802246} 11/07/2021 02:27:51 - INFO - __main__ - Step 37165: {'lr': 0.0004335301881449711, 'samples': 7135680, 'steps': 37164, 'loss/train': 1.5664054155349731} 11/07/2021 02:27:51 - INFO - __main__ - Step 37166: {'lr': 0.00043352658472370294, 'samples': 7135872, 'steps': 37165, 'loss/train': 1.2007473707199097} 11/07/2021 02:27:51 - INFO - __main__ - Step 37167: {'lr': 0.00043352298121974043, 'samples': 7136064, 'steps': 37166, 'loss/train': 1.7131773233413696} 11/07/2021 02:27:52 - INFO - __main__ - Step 37168: {'lr': 0.00043351937763308533, 'samples': 7136256, 'steps': 37167, 'loss/train': 1.607792854309082} 11/07/2021 02:27:52 - INFO - __main__ - Step 37169: {'lr': 0.0004335157739637392, 'samples': 7136448, 'steps': 37168, 'loss/train': 1.5750174522399902} 11/07/2021 02:27:53 - INFO - __main__ - Step 37170: {'lr': 0.0004335121702117038, 'samples': 7136640, 'steps': 37169, 'loss/train': 1.431115746498108} 11/07/2021 02:27:54 - INFO - __main__ - Step 37171: {'lr': 0.0004335085663769805, 'samples': 7136832, 'steps': 37170, 'loss/train': 1.4753409624099731} 11/07/2021 02:27:54 - INFO - __main__ - Step 37172: {'lr': 0.00043350496245957116, 'samples': 7137024, 'steps': 37171, 'loss/train': 1.4689682722091675} 11/07/2021 02:27:54 - INFO - __main__ - Step 37173: {'lr': 0.00043350135845947725, 'samples': 7137216, 'steps': 37172, 'loss/train': 0.965884804725647} 11/07/2021 02:27:55 - INFO - __main__ - Step 37174: {'lr': 0.00043349775437670046, 'samples': 7137408, 'steps': 37173, 'loss/train': 0.5817450881004333} 11/07/2021 02:27:56 - INFO - __main__ - Step 37175: {'lr': 0.0004334941502112425, 'samples': 7137600, 'steps': 37174, 'loss/train': 1.1755576133728027} 11/07/2021 02:27:56 - INFO - __main__ - Step 37176: {'lr': 0.0004334905459631049, 'samples': 7137792, 'steps': 37175, 'loss/train': 1.4950543642044067} 11/07/2021 02:27:56 - INFO - __main__ - Step 37177: {'lr': 0.0004334869416322892, 'samples': 7137984, 'steps': 37176, 'loss/train': 1.5821439027786255} 11/07/2021 02:27:57 - INFO - __main__ - Step 37178: {'lr': 0.0004334833372187972, 'samples': 7138176, 'steps': 37177, 'loss/train': 1.3863235712051392} 11/07/2021 02:27:57 - INFO - __main__ - Step 37179: {'lr': 0.0004334797327226304, 'samples': 7138368, 'steps': 37178, 'loss/train': 1.3994529247283936} 11/07/2021 02:27:58 - INFO - __main__ - Step 37180: {'lr': 0.00043347612814379047, 'samples': 7138560, 'steps': 37179, 'loss/train': 1.3833494186401367} 11/07/2021 02:27:59 - INFO - __main__ - Step 37181: {'lr': 0.000433472523482279, 'samples': 7138752, 'steps': 37180, 'loss/train': 0.6981086134910583} 11/07/2021 02:27:59 - INFO - __main__ - Step 37182: {'lr': 0.0004334689187380977, 'samples': 7138944, 'steps': 37181, 'loss/train': 1.9053527116775513} 11/07/2021 02:27:59 - INFO - __main__ - Step 37183: {'lr': 0.0004334653139112481, 'samples': 7139136, 'steps': 37182, 'loss/train': 1.3545410633087158} 11/07/2021 02:28:00 - INFO - __main__ - Step 37184: {'lr': 0.0004334617090017319, 'samples': 7139328, 'steps': 37183, 'loss/train': 1.5792773962020874} 11/07/2021 02:28:00 - INFO - __main__ - Step 37185: {'lr': 0.0004334581040095506, 'samples': 7139520, 'steps': 37184, 'loss/train': 1.7746398448944092} 11/07/2021 02:28:01 - INFO - __main__ - Step 37186: {'lr': 0.00043345449893470594, 'samples': 7139712, 'steps': 37185, 'loss/train': 1.6468799114227295} 11/07/2021 02:28:02 - INFO - __main__ - Step 37187: {'lr': 0.00043345089377719954, 'samples': 7139904, 'steps': 37186, 'loss/train': 1.2727960348129272} 11/07/2021 02:28:02 - INFO - __main__ - Step 37188: {'lr': 0.00043344728853703297, 'samples': 7140096, 'steps': 37187, 'loss/train': 1.5978360176086426} 11/07/2021 02:28:02 - INFO - __main__ - Step 37189: {'lr': 0.0004334436832142079, 'samples': 7140288, 'steps': 37188, 'loss/train': 1.308510661125183} 11/07/2021 02:28:03 - INFO - __main__ - Step 37190: {'lr': 0.000433440077808726, 'samples': 7140480, 'steps': 37189, 'loss/train': 1.7172496318817139} 11/07/2021 02:28:03 - INFO - __main__ - Step 37191: {'lr': 0.00043343647232058877, 'samples': 7140672, 'steps': 37190, 'loss/train': 1.2515482902526855} 11/07/2021 02:28:04 - INFO - __main__ - Step 37192: {'lr': 0.0004334328667497979, 'samples': 7140864, 'steps': 37191, 'loss/train': 0.8863022923469543} 11/07/2021 02:28:04 - INFO - __main__ - Step 37193: {'lr': 0.00043342926109635497, 'samples': 7141056, 'steps': 37192, 'loss/train': 1.1954317092895508} 11/07/2021 02:28:05 - INFO - __main__ - Step 37194: {'lr': 0.0004334256553602617, 'samples': 7141248, 'steps': 37193, 'loss/train': 1.6151297092437744} 11/07/2021 02:28:05 - INFO - __main__ - Step 37195: {'lr': 0.00043342204954151963, 'samples': 7141440, 'steps': 37194, 'loss/train': 1.5363059043884277} 11/07/2021 02:28:05 - INFO - __main__ - Step 37196: {'lr': 0.00043341844364013047, 'samples': 7141632, 'steps': 37195, 'loss/train': 1.5967592000961304} 11/07/2021 02:28:07 - INFO - __main__ - Step 37197: {'lr': 0.00043341483765609566, 'samples': 7141824, 'steps': 37196, 'loss/train': 1.2228840589523315} 11/07/2021 02:28:07 - INFO - __main__ - Step 37198: {'lr': 0.0004334112315894171, 'samples': 7142016, 'steps': 37197, 'loss/train': 1.1674365997314453} 11/07/2021 02:28:07 - INFO - __main__ - Step 37199: {'lr': 0.00043340762544009627, 'samples': 7142208, 'steps': 37198, 'loss/train': 1.2941004037857056} 11/07/2021 02:28:08 - INFO - __main__ - Step 37200: {'lr': 0.0004334040192081347, 'samples': 7142400, 'steps': 37199, 'loss/train': 1.625350832939148} 11/07/2021 02:28:08 - INFO - __main__ - Step 37201: {'lr': 0.00043340041289353416, 'samples': 7142592, 'steps': 37200, 'loss/train': 1.4278935194015503} 11/07/2021 02:28:09 - INFO - __main__ - Step 37202: {'lr': 0.0004333968064962962, 'samples': 7142784, 'steps': 37201, 'loss/train': 1.6567054986953735} 11/07/2021 02:28:09 - INFO - __main__ - Step 37203: {'lr': 0.00043339320001642244, 'samples': 7142976, 'steps': 37202, 'loss/train': 1.620715618133545} 11/07/2021 02:28:10 - INFO - __main__ - Step 37204: {'lr': 0.0004333895934539146, 'samples': 7143168, 'steps': 37203, 'loss/train': 1.299027442932129} 11/07/2021 02:28:10 - INFO - __main__ - Step 37205: {'lr': 0.00043338598680877423, 'samples': 7143360, 'steps': 37204, 'loss/train': 1.6379092931747437} 11/07/2021 02:28:10 - INFO - __main__ - Step 37206: {'lr': 0.00043338238008100297, 'samples': 7143552, 'steps': 37205, 'loss/train': 1.3296124935150146} 11/07/2021 02:28:11 - INFO - __main__ - Step 37207: {'lr': 0.0004333787732706024, 'samples': 7143744, 'steps': 37206, 'loss/train': 1.4551992416381836} 11/07/2021 02:28:12 - INFO - __main__ - Step 37208: {'lr': 0.00043337516637757416, 'samples': 7143936, 'steps': 37207, 'loss/train': 1.1694921255111694} 11/07/2021 02:28:12 - INFO - __main__ - Step 37209: {'lr': 0.00043337155940191996, 'samples': 7144128, 'steps': 37208, 'loss/train': 1.6044751405715942} 11/07/2021 02:28:12 - INFO - __main__ - Step 37210: {'lr': 0.0004333679523436413, 'samples': 7144320, 'steps': 37209, 'loss/train': 1.1683112382888794} 11/07/2021 02:28:13 - INFO - __main__ - Step 37211: {'lr': 0.0004333643452027399, 'samples': 7144512, 'steps': 37210, 'loss/train': 0.2887013554573059} 11/07/2021 02:28:14 - INFO - __main__ - Step 37212: {'lr': 0.00043336073797921743, 'samples': 7144704, 'steps': 37211, 'loss/train': 1.025676965713501} 11/07/2021 02:28:14 - INFO - __main__ - Step 37213: {'lr': 0.0004333571306730754, 'samples': 7144896, 'steps': 37212, 'loss/train': 1.276005744934082} 11/07/2021 02:28:14 - INFO - __main__ - Step 37214: {'lr': 0.00043335352328431544, 'samples': 7145088, 'steps': 37213, 'loss/train': 1.3431577682495117} 11/07/2021 02:28:15 - INFO - __main__ - Step 37215: {'lr': 0.00043334991581293924, 'samples': 7145280, 'steps': 37214, 'loss/train': 1.2101293802261353} 11/07/2021 02:28:15 - INFO - __main__ - Step 37216: {'lr': 0.0004333463082589484, 'samples': 7145472, 'steps': 37215, 'loss/train': 1.5610136985778809} 11/07/2021 02:28:16 - INFO - __main__ - Step 37217: {'lr': 0.0004333427006223445, 'samples': 7145664, 'steps': 37216, 'loss/train': 1.2841063737869263} 11/07/2021 02:28:16 - INFO - __main__ - Step 37218: {'lr': 0.00043333909290312923, 'samples': 7145856, 'steps': 37217, 'loss/train': 1.0707043409347534} 11/07/2021 02:28:17 - INFO - __main__ - Step 37219: {'lr': 0.00043333548510130426, 'samples': 7146048, 'steps': 37218, 'loss/train': 1.5830148458480835} 11/07/2021 02:28:17 - INFO - __main__ - Step 37220: {'lr': 0.00043333187721687104, 'samples': 7146240, 'steps': 37219, 'loss/train': 1.8787128925323486} 11/07/2021 02:28:18 - INFO - __main__ - Step 37221: {'lr': 0.0004333282692498314, 'samples': 7146432, 'steps': 37220, 'loss/train': 2.0772974491119385} 11/07/2021 02:28:19 - INFO - __main__ - Step 37222: {'lr': 0.00043332466120018685, 'samples': 7146624, 'steps': 37221, 'loss/train': 1.8868778944015503} 11/07/2021 02:28:19 - INFO - __main__ - Step 37223: {'lr': 0.000433321053067939, 'samples': 7146816, 'steps': 37222, 'loss/train': 1.276110291481018} 11/07/2021 02:28:19 - INFO - __main__ - Step 37224: {'lr': 0.00043331744485308954, 'samples': 7147008, 'steps': 37223, 'loss/train': 1.0257227420806885} 11/07/2021 02:28:20 - INFO - __main__ - Step 37225: {'lr': 0.00043331383655564003, 'samples': 7147200, 'steps': 37224, 'loss/train': 1.7596057653427124} 11/07/2021 02:28:20 - INFO - __main__ - Step 37226: {'lr': 0.0004333102281755922, 'samples': 7147392, 'steps': 37225, 'loss/train': 1.1045256853103638} 11/07/2021 02:28:20 - INFO - __main__ - Step 37227: {'lr': 0.0004333066197129475, 'samples': 7147584, 'steps': 37226, 'loss/train': 1.3454959392547607} 11/07/2021 02:28:22 - INFO - __main__ - Step 37228: {'lr': 0.00043330301116770777, 'samples': 7147776, 'steps': 37227, 'loss/train': 1.0431512594223022} 11/07/2021 02:28:22 - INFO - __main__ - Step 37229: {'lr': 0.0004332994025398745, 'samples': 7147968, 'steps': 37228, 'loss/train': 1.4733880758285522} 11/07/2021 02:28:22 - INFO - __main__ - Step 37230: {'lr': 0.0004332957938294493, 'samples': 7148160, 'steps': 37229, 'loss/train': 1.5776143074035645} 11/07/2021 02:28:23 - INFO - __main__ - Step 37231: {'lr': 0.0004332921850364339, 'samples': 7148352, 'steps': 37230, 'loss/train': 0.6922283172607422} 11/07/2021 02:28:23 - INFO - __main__ - Step 37232: {'lr': 0.00043328857616082986, 'samples': 7148544, 'steps': 37231, 'loss/train': 1.354921579360962} 11/07/2021 02:28:24 - INFO - __main__ - Step 37233: {'lr': 0.0004332849672026388, 'samples': 7148736, 'steps': 37232, 'loss/train': 1.1492215394973755} 11/07/2021 02:28:24 - INFO - __main__ - Step 37234: {'lr': 0.0004332813581618624, 'samples': 7148928, 'steps': 37233, 'loss/train': 1.270585060119629} 11/07/2021 02:28:25 - INFO - __main__ - Step 37235: {'lr': 0.00043327774903850226, 'samples': 7149120, 'steps': 37234, 'loss/train': 1.6720829010009766} 11/07/2021 02:28:25 - INFO - __main__ - Step 37236: {'lr': 0.0004332741398325599, 'samples': 7149312, 'steps': 37235, 'loss/train': 1.7749427556991577} 11/07/2021 02:28:25 - INFO - __main__ - Step 37237: {'lr': 0.00043327053054403707, 'samples': 7149504, 'steps': 37236, 'loss/train': 1.336981177330017} 11/07/2021 02:28:26 - INFO - __main__ - Step 37238: {'lr': 0.0004332669211729354, 'samples': 7149696, 'steps': 37237, 'loss/train': 1.2597352266311646} 11/07/2021 02:28:27 - INFO - __main__ - Step 37239: {'lr': 0.00043326331171925656, 'samples': 7149888, 'steps': 37238, 'loss/train': 0.9623879194259644} 11/07/2021 02:28:27 - INFO - __main__ - Step 37240: {'lr': 0.000433259702183002, 'samples': 7150080, 'steps': 37239, 'loss/train': 1.4372882843017578} 11/07/2021 02:28:27 - INFO - __main__ - Step 37241: {'lr': 0.0004332560925641734, 'samples': 7150272, 'steps': 37240, 'loss/train': 1.8603291511535645} 11/07/2021 02:28:28 - INFO - __main__ - Step 37242: {'lr': 0.0004332524828627725, 'samples': 7150464, 'steps': 37241, 'loss/train': 1.3960036039352417} 11/07/2021 02:28:29 - INFO - __main__ - Step 37243: {'lr': 0.0004332488730788009, 'samples': 7150656, 'steps': 37242, 'loss/train': 1.51231050491333} 11/07/2021 02:28:29 - INFO - __main__ - Step 37244: {'lr': 0.0004332452632122601, 'samples': 7150848, 'steps': 37243, 'loss/train': 1.967123031616211} 11/07/2021 02:28:29 - INFO - __main__ - Step 37245: {'lr': 0.0004332416532631519, 'samples': 7151040, 'steps': 37244, 'loss/train': 1.2595200538635254} 11/07/2021 02:28:30 - INFO - __main__ - Step 37246: {'lr': 0.00043323804323147777, 'samples': 7151232, 'steps': 37245, 'loss/train': 0.15449711680412292} 11/07/2021 02:28:30 - INFO - __main__ - Step 37247: {'lr': 0.0004332344331172394, 'samples': 7151424, 'steps': 37246, 'loss/train': 1.4898953437805176} 11/07/2021 02:28:31 - INFO - __main__ - Step 37248: {'lr': 0.0004332308229204385, 'samples': 7151616, 'steps': 37247, 'loss/train': 1.775602102279663} 11/07/2021 02:28:32 - INFO - __main__ - Step 37249: {'lr': 0.00043322721264107657, 'samples': 7151808, 'steps': 37248, 'loss/train': 1.1206448078155518} 11/07/2021 02:28:32 - INFO - __main__ - Step 37250: {'lr': 0.00043322360227915526, 'samples': 7152000, 'steps': 37249, 'loss/train': 1.5136470794677734} 11/07/2021 02:28:32 - INFO - __main__ - Step 37251: {'lr': 0.0004332199918346763, 'samples': 7152192, 'steps': 37250, 'loss/train': 1.70261549949646} 11/07/2021 02:28:33 - INFO - __main__ - Step 37252: {'lr': 0.00043321638130764116, 'samples': 7152384, 'steps': 37251, 'loss/train': 1.5859137773513794} 11/07/2021 02:28:33 - INFO - __main__ - Step 37253: {'lr': 0.00043321277069805153, 'samples': 7152576, 'steps': 37252, 'loss/train': 1.2263541221618652} 11/07/2021 02:28:34 - INFO - __main__ - Step 37254: {'lr': 0.0004332091600059091, 'samples': 7152768, 'steps': 37253, 'loss/train': 0.814882755279541} 11/07/2021 02:28:35 - INFO - __main__ - Step 37255: {'lr': 0.00043320554923121545, 'samples': 7152960, 'steps': 37254, 'loss/train': 1.5587509870529175} 11/07/2021 02:28:35 - INFO - __main__ - Step 37256: {'lr': 0.0004332019383739722, 'samples': 7153152, 'steps': 37255, 'loss/train': 1.4879378080368042} 11/07/2021 02:28:35 - INFO - __main__ - Step 37257: {'lr': 0.000433198327434181, 'samples': 7153344, 'steps': 37256, 'loss/train': 1.7939229011535645} 11/07/2021 02:28:36 - INFO - __main__ - Step 37258: {'lr': 0.0004331947164118434, 'samples': 7153536, 'steps': 37257, 'loss/train': 1.3613734245300293} 11/07/2021 02:28:37 - INFO - __main__ - Step 37259: {'lr': 0.00043319110530696116, 'samples': 7153728, 'steps': 37258, 'loss/train': 1.498550534248352} 11/07/2021 02:28:37 - INFO - __main__ - Step 37260: {'lr': 0.00043318749411953584, 'samples': 7153920, 'steps': 37259, 'loss/train': 1.7559316158294678} 11/07/2021 02:28:37 - INFO - __main__ - Step 37261: {'lr': 0.000433183882849569, 'samples': 7154112, 'steps': 37260, 'loss/train': 1.7043496370315552} 11/07/2021 02:28:38 - INFO - __main__ - Step 37262: {'lr': 0.0004331802714970624, 'samples': 7154304, 'steps': 37261, 'loss/train': 1.487119197845459} 11/07/2021 02:28:38 - INFO - __main__ - Step 37263: {'lr': 0.0004331766600620175, 'samples': 7154496, 'steps': 37262, 'loss/train': 1.4115322828292847} 11/07/2021 02:28:39 - INFO - __main__ - Step 37264: {'lr': 0.00043317304854443607, 'samples': 7154688, 'steps': 37263, 'loss/train': 1.5334402322769165} 11/07/2021 02:28:40 - INFO - __main__ - Step 37265: {'lr': 0.0004331694369443197, 'samples': 7154880, 'steps': 37264, 'loss/train': 1.5968416929244995} 11/07/2021 02:28:40 - INFO - __main__ - Step 37266: {'lr': 0.00043316582526167004, 'samples': 7155072, 'steps': 37265, 'loss/train': 1.634678602218628} 11/07/2021 02:28:40 - INFO - __main__ - Step 37267: {'lr': 0.0004331622134964887, 'samples': 7155264, 'steps': 37266, 'loss/train': 1.4934666156768799} 11/07/2021 02:28:41 - INFO - __main__ - Step 37268: {'lr': 0.0004331586016487772, 'samples': 7155456, 'steps': 37267, 'loss/train': 1.3731762170791626} 11/07/2021 02:28:42 - INFO - __main__ - Step 37269: {'lr': 0.00043315498971853726, 'samples': 7155648, 'steps': 37268, 'loss/train': 1.780970573425293} 11/07/2021 02:28:42 - INFO - __main__ - Step 37270: {'lr': 0.0004331513777057706, 'samples': 7155840, 'steps': 37269, 'loss/train': 1.3365075588226318} 11/07/2021 02:28:42 - INFO - __main__ - Step 37271: {'lr': 0.00043314776561047865, 'samples': 7156032, 'steps': 37270, 'loss/train': 1.6340923309326172} 11/07/2021 02:28:43 - INFO - __main__ - Step 37272: {'lr': 0.0004331441534326632, 'samples': 7156224, 'steps': 37271, 'loss/train': 1.6194738149642944} 11/07/2021 02:28:43 - INFO - __main__ - Step 37273: {'lr': 0.0004331405411723258, 'samples': 7156416, 'steps': 37272, 'loss/train': 1.3026750087738037} 11/07/2021 02:28:44 - INFO - __main__ - Step 37274: {'lr': 0.0004331369288294681, 'samples': 7156608, 'steps': 37273, 'loss/train': 1.0709071159362793} 11/07/2021 02:28:44 - INFO - __main__ - Step 37275: {'lr': 0.0004331333164040918, 'samples': 7156800, 'steps': 37274, 'loss/train': 1.5334340333938599} 11/07/2021 02:28:45 - INFO - __main__ - Step 37276: {'lr': 0.0004331297038961984, 'samples': 7156992, 'steps': 37275, 'loss/train': 1.5693700313568115} 11/07/2021 02:28:45 - INFO - __main__ - Step 37277: {'lr': 0.00043312609130578963, 'samples': 7157184, 'steps': 37276, 'loss/train': 1.5914875268936157} 11/07/2021 02:28:45 - INFO - __main__ - Step 37278: {'lr': 0.000433122478632867, 'samples': 7157376, 'steps': 37277, 'loss/train': 0.8798840641975403} 11/07/2021 02:28:46 - INFO - __main__ - Step 37279: {'lr': 0.0004331188658774322, 'samples': 7157568, 'steps': 37278, 'loss/train': 1.3093528747558594} 11/07/2021 02:28:47 - INFO - __main__ - Step 37280: {'lr': 0.00043311525303948685, 'samples': 7157760, 'steps': 37279, 'loss/train': 1.349352478981018} 11/07/2021 02:28:47 - INFO - __main__ - Step 37281: {'lr': 0.0004331116401190327, 'samples': 7157952, 'steps': 37280, 'loss/train': 1.670189380645752} 11/07/2021 02:28:47 - INFO - __main__ - Step 37282: {'lr': 0.0004331080271160712, 'samples': 7158144, 'steps': 37281, 'loss/train': 1.5440136194229126} 11/07/2021 02:28:48 - INFO - __main__ - Step 37283: {'lr': 0.00043310441403060404, 'samples': 7158336, 'steps': 37282, 'loss/train': 1.0048491954803467} 11/07/2021 02:28:48 - INFO - __main__ - Step 37284: {'lr': 0.00043310080086263284, 'samples': 7158528, 'steps': 37283, 'loss/train': 0.912543535232544} 11/07/2021 02:28:49 - INFO - __main__ - Step 37285: {'lr': 0.0004330971876121593, 'samples': 7158720, 'steps': 37284, 'loss/train': 1.51612389087677} 11/07/2021 02:28:50 - INFO - __main__ - Step 37286: {'lr': 0.0004330935742791849, 'samples': 7158912, 'steps': 37285, 'loss/train': 1.4151886701583862} 11/07/2021 02:28:50 - INFO - __main__ - Step 37287: {'lr': 0.00043308996086371146, 'samples': 7159104, 'steps': 37286, 'loss/train': 0.7171698212623596} 11/07/2021 02:28:50 - INFO - __main__ - Step 37288: {'lr': 0.0004330863473657405, 'samples': 7159296, 'steps': 37287, 'loss/train': 1.3389252424240112} 11/07/2021 02:28:51 - INFO - __main__ - Step 37289: {'lr': 0.00043308273378527364, 'samples': 7159488, 'steps': 37288, 'loss/train': 1.61613929271698} 11/07/2021 02:28:52 - INFO - __main__ - Step 37290: {'lr': 0.00043307912012231255, 'samples': 7159680, 'steps': 37289, 'loss/train': 1.9408761262893677} 11/07/2021 02:28:52 - INFO - __main__ - Step 37291: {'lr': 0.0004330755063768588, 'samples': 7159872, 'steps': 37290, 'loss/train': 1.1934666633605957} 11/07/2021 02:28:52 - INFO - __main__ - Step 37292: {'lr': 0.000433071892548914, 'samples': 7160064, 'steps': 37291, 'loss/train': 1.5802807807922363} 11/07/2021 02:28:53 - INFO - __main__ - Step 37293: {'lr': 0.00043306827863847985, 'samples': 7160256, 'steps': 37292, 'loss/train': 1.3688923120498657} 11/07/2021 02:28:53 - INFO - __main__ - Step 37294: {'lr': 0.00043306466464555803, 'samples': 7160448, 'steps': 37293, 'loss/train': 1.7352780103683472} 11/07/2021 02:28:54 - INFO - __main__ - Step 37295: {'lr': 0.0004330610505701501, 'samples': 7160640, 'steps': 37294, 'loss/train': 1.8678953647613525} 11/07/2021 02:28:55 - INFO - __main__ - Step 37296: {'lr': 0.00043305743641225766, 'samples': 7160832, 'steps': 37295, 'loss/train': 1.132670283317566} 11/07/2021 02:28:55 - INFO - __main__ - Step 37297: {'lr': 0.00043305382217188225, 'samples': 7161024, 'steps': 37296, 'loss/train': 1.6592589616775513} 11/07/2021 02:28:55 - INFO - __main__ - Step 37298: {'lr': 0.0004330502078490258, 'samples': 7161216, 'steps': 37297, 'loss/train': 1.2880827188491821} 11/07/2021 02:28:56 - INFO - __main__ - Step 37299: {'lr': 0.0004330465934436896, 'samples': 7161408, 'steps': 37298, 'loss/train': 1.9087271690368652} 11/07/2021 02:28:56 - INFO - __main__ - Step 37300: {'lr': 0.00043304297895587553, 'samples': 7161600, 'steps': 37299, 'loss/train': 1.2395339012145996} 11/07/2021 02:28:57 - INFO - __main__ - Step 37301: {'lr': 0.0004330393643855851, 'samples': 7161792, 'steps': 37300, 'loss/train': 1.3079183101654053} 11/07/2021 02:28:57 - INFO - __main__ - Step 37302: {'lr': 0.0004330357497328199, 'samples': 7161984, 'steps': 37301, 'loss/train': 2.2721056938171387} 11/07/2021 02:28:58 - INFO - __main__ - Step 37303: {'lr': 0.00043303213499758166, 'samples': 7162176, 'steps': 37302, 'loss/train': 1.6880961656570435} 11/07/2021 02:28:58 - INFO - __main__ - Step 37304: {'lr': 0.00043302852017987196, 'samples': 7162368, 'steps': 37303, 'loss/train': 1.1895135641098022} 11/07/2021 02:28:58 - INFO - __main__ - Step 37305: {'lr': 0.0004330249052796924, 'samples': 7162560, 'steps': 37304, 'loss/train': 1.7214696407318115} 11/07/2021 02:28:59 - INFO - __main__ - Step 37306: {'lr': 0.0004330212902970447, 'samples': 7162752, 'steps': 37305, 'loss/train': 1.4084619283676147} 11/07/2021 02:29:00 - INFO - __main__ - Step 37307: {'lr': 0.0004330176752319304, 'samples': 7162944, 'steps': 37306, 'loss/train': 1.3558456897735596} 11/07/2021 02:29:00 - INFO - __main__ - Step 37308: {'lr': 0.0004330140600843512, 'samples': 7163136, 'steps': 37307, 'loss/train': 1.9286340475082397} 11/07/2021 02:29:00 - INFO - __main__ - Step 37309: {'lr': 0.0004330104448543086, 'samples': 7163328, 'steps': 37308, 'loss/train': 1.5556902885437012} 11/07/2021 02:29:01 - INFO - __main__ - Step 37310: {'lr': 0.0004330068295418044, 'samples': 7163520, 'steps': 37309, 'loss/train': 1.5030561685562134} 11/07/2021 02:29:02 - INFO - __main__ - Step 37311: {'lr': 0.0004330032141468401, 'samples': 7163712, 'steps': 37310, 'loss/train': 1.6441051959991455} 11/07/2021 02:29:02 - INFO - __main__ - Step 37312: {'lr': 0.0004329995986694174, 'samples': 7163904, 'steps': 37311, 'loss/train': 1.1074079275131226} 11/07/2021 02:29:02 - INFO - __main__ - Step 37313: {'lr': 0.00043299598310953793, 'samples': 7164096, 'steps': 37312, 'loss/train': 1.6179929971694946} 11/07/2021 02:29:03 - INFO - __main__ - Step 37314: {'lr': 0.0004329923674672032, 'samples': 7164288, 'steps': 37313, 'loss/train': 1.4974441528320312} 11/07/2021 02:29:03 - INFO - __main__ - Step 37315: {'lr': 0.00043298875174241504, 'samples': 7164480, 'steps': 37314, 'loss/train': 0.9005528092384338} 11/07/2021 02:29:04 - INFO - __main__ - Step 37316: {'lr': 0.00043298513593517483, 'samples': 7164672, 'steps': 37315, 'loss/train': 1.9145193099975586} 11/07/2021 02:29:04 - INFO - __main__ - Step 37317: {'lr': 0.0004329815200454845, 'samples': 7164864, 'steps': 37316, 'loss/train': 1.3091903924942017} 11/07/2021 02:29:05 - INFO - __main__ - Step 37318: {'lr': 0.00043297790407334545, 'samples': 7165056, 'steps': 37317, 'loss/train': 1.0597180128097534} 11/07/2021 02:29:05 - INFO - __main__ - Step 37319: {'lr': 0.0004329742880187594, 'samples': 7165248, 'steps': 37318, 'loss/train': 1.1523969173431396} 11/07/2021 02:29:06 - INFO - __main__ - Step 37320: {'lr': 0.0004329706718817279, 'samples': 7165440, 'steps': 37319, 'loss/train': 1.1664011478424072} 11/07/2021 02:29:07 - INFO - __main__ - Step 37321: {'lr': 0.00043296705566225267, 'samples': 7165632, 'steps': 37320, 'loss/train': 1.5477774143218994} 11/07/2021 02:29:07 - INFO - __main__ - Step 37322: {'lr': 0.00043296343936033535, 'samples': 7165824, 'steps': 37321, 'loss/train': 1.6472041606903076} 11/07/2021 02:29:07 - INFO - __main__ - Step 37323: {'lr': 0.0004329598229759775, 'samples': 7166016, 'steps': 37322, 'loss/train': 1.7595402002334595} 11/07/2021 02:29:08 - INFO - __main__ - Step 37324: {'lr': 0.00043295620650918076, 'samples': 7166208, 'steps': 37323, 'loss/train': 1.29658043384552} 11/07/2021 02:29:08 - INFO - __main__ - Step 37325: {'lr': 0.0004329525899599468, 'samples': 7166400, 'steps': 37324, 'loss/train': 1.2506625652313232} 11/07/2021 02:29:09 - INFO - __main__ - Step 37326: {'lr': 0.0004329489733282772, 'samples': 7166592, 'steps': 37325, 'loss/train': 1.4976551532745361} 11/07/2021 02:29:09 - INFO - __main__ - Step 37327: {'lr': 0.0004329453566141737, 'samples': 7166784, 'steps': 37326, 'loss/train': 1.637865424156189} 11/07/2021 02:29:10 - INFO - __main__ - Step 37328: {'lr': 0.00043294173981763776, 'samples': 7166976, 'steps': 37327, 'loss/train': 0.9973399639129639} 11/07/2021 02:29:10 - INFO - __main__ - Step 37329: {'lr': 0.00043293812293867113, 'samples': 7167168, 'steps': 37328, 'loss/train': 0.9981698393821716} 11/07/2021 02:29:10 - INFO - __main__ - Step 37330: {'lr': 0.0004329345059772754, 'samples': 7167360, 'steps': 37329, 'loss/train': 1.8823930025100708} 11/07/2021 02:29:11 - INFO - __main__ - Step 37331: {'lr': 0.0004329308889334522, 'samples': 7167552, 'steps': 37330, 'loss/train': 1.1413021087646484} 11/07/2021 02:29:12 - INFO - __main__ - Step 37332: {'lr': 0.00043292727180720315, 'samples': 7167744, 'steps': 37331, 'loss/train': 1.2384685277938843} 11/07/2021 02:29:12 - INFO - __main__ - Step 37333: {'lr': 0.0004329236545985299, 'samples': 7167936, 'steps': 37332, 'loss/train': 1.3601319789886475} 11/07/2021 02:29:12 - INFO - __main__ - Step 37334: {'lr': 0.000432920037307434, 'samples': 7168128, 'steps': 37333, 'loss/train': 1.5531187057495117} 11/07/2021 02:29:13 - INFO - __main__ - Step 37335: {'lr': 0.00043291641993391727, 'samples': 7168320, 'steps': 37334, 'loss/train': 1.5530239343643188} 11/07/2021 02:29:13 - INFO - __main__ - Step 37336: {'lr': 0.0004329128024779812, 'samples': 7168512, 'steps': 37335, 'loss/train': 1.7532144784927368} 11/07/2021 02:29:14 - INFO - __main__ - Step 37337: {'lr': 0.0004329091849396274, 'samples': 7168704, 'steps': 37336, 'loss/train': 1.3057897090911865} 11/07/2021 02:29:15 - INFO - __main__ - Step 37338: {'lr': 0.00043290556731885756, 'samples': 7168896, 'steps': 37337, 'loss/train': 2.0908994674682617} 11/07/2021 02:29:15 - INFO - __main__ - Step 37339: {'lr': 0.0004329019496156733, 'samples': 7169088, 'steps': 37338, 'loss/train': 1.0564286708831787} 11/07/2021 02:29:15 - INFO - __main__ - Step 37340: {'lr': 0.0004328983318300763, 'samples': 7169280, 'steps': 37339, 'loss/train': 1.6685435771942139} 11/07/2021 02:29:16 - INFO - __main__ - Step 37341: {'lr': 0.00043289471396206803, 'samples': 7169472, 'steps': 37340, 'loss/train': 1.920317530632019} 11/07/2021 02:29:17 - INFO - __main__ - Step 37342: {'lr': 0.0004328910960116503, 'samples': 7169664, 'steps': 37341, 'loss/train': 1.225765347480774} 11/07/2021 02:29:17 - INFO - __main__ - Step 37343: {'lr': 0.00043288747797882467, 'samples': 7169856, 'steps': 37342, 'loss/train': 1.519812822341919} 11/07/2021 02:29:17 - INFO - __main__ - Step 37344: {'lr': 0.00043288385986359266, 'samples': 7170048, 'steps': 37343, 'loss/train': 1.6947556734085083} 11/07/2021 02:29:18 - INFO - __main__ - Step 37345: {'lr': 0.00043288024166595614, 'samples': 7170240, 'steps': 37344, 'loss/train': 1.7099330425262451} 11/07/2021 02:29:18 - INFO - __main__ - Step 37346: {'lr': 0.00043287662338591657, 'samples': 7170432, 'steps': 37345, 'loss/train': 1.2091368436813354} 11/07/2021 02:29:19 - INFO - __main__ - Step 37347: {'lr': 0.0004328730050234756, 'samples': 7170624, 'steps': 37346, 'loss/train': 1.602807641029358} 11/07/2021 02:29:20 - INFO - __main__ - Step 37348: {'lr': 0.00043286938657863483, 'samples': 7170816, 'steps': 37347, 'loss/train': 1.6188071966171265} 11/07/2021 02:29:20 - INFO - __main__ - Step 37349: {'lr': 0.00043286576805139597, 'samples': 7171008, 'steps': 37348, 'loss/train': 1.606528639793396} 11/07/2021 02:29:20 - INFO - __main__ - Step 37350: {'lr': 0.0004328621494417606, 'samples': 7171200, 'steps': 37349, 'loss/train': 1.2447718381881714} 11/07/2021 02:29:21 - INFO - __main__ - Step 37351: {'lr': 0.0004328585307497304, 'samples': 7171392, 'steps': 37350, 'loss/train': 0.8961560726165771} 11/07/2021 02:29:21 - INFO - __main__ - Step 37352: {'lr': 0.00043285491197530694, 'samples': 7171584, 'steps': 37351, 'loss/train': 0.8640336394309998} 11/07/2021 02:29:22 - INFO - __main__ - Step 37353: {'lr': 0.00043285129311849193, 'samples': 7171776, 'steps': 37352, 'loss/train': 1.5787489414215088} 11/07/2021 02:29:22 - INFO - __main__ - Step 37354: {'lr': 0.0004328476741792869, 'samples': 7171968, 'steps': 37353, 'loss/train': 0.7472124695777893} 11/07/2021 02:29:23 - INFO - __main__ - Step 37355: {'lr': 0.00043284405515769356, 'samples': 7172160, 'steps': 37354, 'loss/train': 1.5969117879867554} 11/07/2021 02:29:23 - INFO - __main__ - Step 37356: {'lr': 0.00043284043605371346, 'samples': 7172352, 'steps': 37355, 'loss/train': 1.0829873085021973} 11/07/2021 02:29:23 - INFO - __main__ - Step 37357: {'lr': 0.0004328368168673483, 'samples': 7172544, 'steps': 37356, 'loss/train': 1.5583910942077637} 11/07/2021 02:29:24 - INFO - __main__ - Step 37358: {'lr': 0.00043283319759859974, 'samples': 7172736, 'steps': 37357, 'loss/train': 1.3432074785232544} 11/07/2021 02:29:25 - INFO - __main__ - Step 37359: {'lr': 0.0004328295782474693, 'samples': 7172928, 'steps': 37358, 'loss/train': 1.6358716487884521} 11/07/2021 02:29:25 - INFO - __main__ - Step 37360: {'lr': 0.0004328259588139587, 'samples': 7173120, 'steps': 37359, 'loss/train': 1.6624610424041748} 11/07/2021 02:29:25 - INFO - __main__ - Step 37361: {'lr': 0.0004328223392980696, 'samples': 7173312, 'steps': 37360, 'loss/train': 1.577709436416626} 11/07/2021 02:29:26 - INFO - __main__ - Step 37362: {'lr': 0.00043281871969980346, 'samples': 7173504, 'steps': 37361, 'loss/train': 1.5553466081619263} 11/07/2021 02:29:27 - INFO - __main__ - Step 37363: {'lr': 0.00043281510001916214, 'samples': 7173696, 'steps': 37362, 'loss/train': 1.6953697204589844} 11/07/2021 02:29:27 - INFO - __main__ - Step 37364: {'lr': 0.0004328114802561471, 'samples': 7173888, 'steps': 37363, 'loss/train': 1.4359856843948364} 11/07/2021 02:29:28 - INFO - __main__ - Step 37365: {'lr': 0.00043280786041076006, 'samples': 7174080, 'steps': 37364, 'loss/train': 0.9891050457954407} 11/07/2021 02:29:28 - INFO - __main__ - Step 37366: {'lr': 0.0004328042404830026, 'samples': 7174272, 'steps': 37365, 'loss/train': 1.09203040599823} 11/07/2021 02:29:28 - INFO - __main__ - Step 37367: {'lr': 0.0004328006204728763, 'samples': 7174464, 'steps': 37366, 'loss/train': 1.4810203313827515} 11/07/2021 02:29:29 - INFO - __main__ - Step 37368: {'lr': 0.00043279700038038296, 'samples': 7174656, 'steps': 37367, 'loss/train': 0.7582082748413086} 11/07/2021 02:29:30 - INFO - __main__ - Step 37369: {'lr': 0.0004327933802055241, 'samples': 7174848, 'steps': 37368, 'loss/train': 1.428175449371338} 11/07/2021 02:29:30 - INFO - __main__ - Step 37370: {'lr': 0.0004327897599483013, 'samples': 7175040, 'steps': 37369, 'loss/train': 1.807591438293457} 11/07/2021 02:29:30 - INFO - __main__ - Step 37371: {'lr': 0.00043278613960871624, 'samples': 7175232, 'steps': 37370, 'loss/train': 0.9312697649002075} 11/07/2021 02:29:31 - INFO - __main__ - Step 37372: {'lr': 0.00043278251918677066, 'samples': 7175424, 'steps': 37371, 'loss/train': 1.7605900764465332} 11/07/2021 02:29:32 - INFO - __main__ - Step 37373: {'lr': 0.00043277889868246605, 'samples': 7175616, 'steps': 37372, 'loss/train': 1.78399658203125} 11/07/2021 02:29:32 - INFO - __main__ - Step 37374: {'lr': 0.0004327752780958041, 'samples': 7175808, 'steps': 37373, 'loss/train': 1.1700432300567627} 11/07/2021 02:29:32 - INFO - __main__ - Step 37375: {'lr': 0.0004327716574267864, 'samples': 7176000, 'steps': 37374, 'loss/train': 1.495086908340454} 11/07/2021 02:29:33 - INFO - __main__ - Step 37376: {'lr': 0.00043276803667541465, 'samples': 7176192, 'steps': 37375, 'loss/train': 1.4538503885269165} 11/07/2021 02:29:33 - INFO - __main__ - Step 37377: {'lr': 0.0004327644158416905, 'samples': 7176384, 'steps': 37376, 'loss/train': 1.8409719467163086} 11/07/2021 02:29:34 - INFO - __main__ - Step 37378: {'lr': 0.0004327607949256154, 'samples': 7176576, 'steps': 37377, 'loss/train': 1.5816887617111206} 11/07/2021 02:29:35 - INFO - __main__ - Step 37379: {'lr': 0.00043275717392719115, 'samples': 7176768, 'steps': 37378, 'loss/train': 1.695273995399475} 11/07/2021 02:29:35 - INFO - __main__ - Step 37380: {'lr': 0.0004327535528464194, 'samples': 7176960, 'steps': 37379, 'loss/train': 1.7499052286148071} 11/07/2021 02:29:35 - INFO - __main__ - Step 37381: {'lr': 0.0004327499316833016, 'samples': 7177152, 'steps': 37380, 'loss/train': 1.8813824653625488} 11/07/2021 02:29:36 - INFO - __main__ - Step 37382: {'lr': 0.0004327463104378395, 'samples': 7177344, 'steps': 37381, 'loss/train': 1.160994291305542} 11/07/2021 02:29:36 - INFO - __main__ - Step 37383: {'lr': 0.0004327426891100349, 'samples': 7177536, 'steps': 37382, 'loss/train': 1.6030007600784302} 11/07/2021 02:29:37 - INFO - __main__ - Step 37384: {'lr': 0.0004327390676998891, 'samples': 7177728, 'steps': 37383, 'loss/train': 1.600156307220459} 11/07/2021 02:29:37 - INFO - __main__ - Step 37385: {'lr': 0.000432735446207404, 'samples': 7177920, 'steps': 37384, 'loss/train': 1.6125463247299194} 11/07/2021 02:29:38 - INFO - __main__ - Step 37386: {'lr': 0.0004327318246325811, 'samples': 7178112, 'steps': 37385, 'loss/train': 1.6646647453308105} 11/07/2021 02:29:38 - INFO - __main__ - Step 37387: {'lr': 0.000432728202975422, 'samples': 7178304, 'steps': 37386, 'loss/train': 1.385424256324768} 11/07/2021 02:29:38 - INFO - __main__ - Step 37388: {'lr': 0.0004327245812359285, 'samples': 7178496, 'steps': 37387, 'loss/train': 1.5408601760864258} 11/07/2021 02:29:39 - INFO - __main__ - Step 37389: {'lr': 0.000432720959414102, 'samples': 7178688, 'steps': 37388, 'loss/train': 1.6477293968200684} 11/07/2021 02:29:40 - INFO - __main__ - Step 37390: {'lr': 0.00043271733750994436, 'samples': 7178880, 'steps': 37389, 'loss/train': 1.4274452924728394} 11/07/2021 02:29:40 - INFO - __main__ - Step 37391: {'lr': 0.00043271371552345704, 'samples': 7179072, 'steps': 37390, 'loss/train': 1.5618799924850464} 11/07/2021 02:29:41 - INFO - __main__ - Step 37392: {'lr': 0.00043271009345464175, 'samples': 7179264, 'steps': 37391, 'loss/train': 1.3555269241333008} 11/07/2021 02:29:41 - INFO - __main__ - Step 37393: {'lr': 0.0004327064713035002, 'samples': 7179456, 'steps': 37392, 'loss/train': 1.3795374631881714} 11/07/2021 02:29:42 - INFO - __main__ - Step 37394: {'lr': 0.00043270284907003377, 'samples': 7179648, 'steps': 37393, 'loss/train': 0.3336257338523865} 11/07/2021 02:29:42 - INFO - __main__ - Step 37395: {'lr': 0.0004326992267542443, 'samples': 7179840, 'steps': 37394, 'loss/train': 1.483880639076233} 11/07/2021 02:29:43 - INFO - __main__ - Step 37396: {'lr': 0.0004326956043561335, 'samples': 7180032, 'steps': 37395, 'loss/train': 1.0873039960861206} 11/07/2021 02:29:43 - INFO - __main__ - Step 37397: {'lr': 0.0004326919818757028, 'samples': 7180224, 'steps': 37396, 'loss/train': 0.8401727676391602} 11/07/2021 02:29:43 - INFO - __main__ - Step 37398: {'lr': 0.00043268835931295393, 'samples': 7180416, 'steps': 37397, 'loss/train': 1.5042715072631836} 11/07/2021 02:29:44 - INFO - __main__ - Step 37399: {'lr': 0.00043268473666788844, 'samples': 7180608, 'steps': 37398, 'loss/train': 0.5492748618125916} 11/07/2021 02:29:45 - INFO - __main__ - Step 37400: {'lr': 0.0004326811139405081, 'samples': 7180800, 'steps': 37399, 'loss/train': 1.7564066648483276} 11/07/2021 02:29:45 - INFO - __main__ - Step 37401: {'lr': 0.0004326774911308145, 'samples': 7180992, 'steps': 37400, 'loss/train': 1.5959196090698242} 11/07/2021 02:29:45 - INFO - __main__ - Step 37402: {'lr': 0.00043267386823880904, 'samples': 7181184, 'steps': 37401, 'loss/train': 1.6912891864776611} 11/07/2021 02:29:46 - INFO - __main__ - Step 37403: {'lr': 0.00043267024526449374, 'samples': 7181376, 'steps': 37402, 'loss/train': 1.5138664245605469} 11/07/2021 02:29:46 - INFO - __main__ - Step 37404: {'lr': 0.00043266662220787003, 'samples': 7181568, 'steps': 37403, 'loss/train': 1.2460224628448486} 11/07/2021 02:29:47 - INFO - __main__ - Step 37405: {'lr': 0.0004326629990689395, 'samples': 7181760, 'steps': 37404, 'loss/train': 1.3983253240585327} 11/07/2021 02:29:48 - INFO - __main__ - Step 37406: {'lr': 0.0004326593758477039, 'samples': 7181952, 'steps': 37405, 'loss/train': 1.5294811725616455} 11/07/2021 02:29:48 - INFO - __main__ - Step 37407: {'lr': 0.0004326557525441648, 'samples': 7182144, 'steps': 37406, 'loss/train': 1.2356144189834595} 11/07/2021 02:29:48 - INFO - __main__ - Step 37408: {'lr': 0.00043265212915832374, 'samples': 7182336, 'steps': 37407, 'loss/train': 2.397145986557007} 11/07/2021 02:29:49 - INFO - __main__ - Step 37409: {'lr': 0.00043264850569018254, 'samples': 7182528, 'steps': 37408, 'loss/train': 1.4570412635803223} 11/07/2021 02:29:50 - INFO - __main__ - Step 37410: {'lr': 0.00043264488213974275, 'samples': 7182720, 'steps': 37409, 'loss/train': 1.3757929801940918} 11/07/2021 02:29:50 - INFO - __main__ - Step 37411: {'lr': 0.000432641258507006, 'samples': 7182912, 'steps': 37410, 'loss/train': 1.6848138570785522} 11/07/2021 02:29:51 - INFO - __main__ - Step 37412: {'lr': 0.0004326376347919738, 'samples': 7183104, 'steps': 37411, 'loss/train': 1.6494650840759277} 11/07/2021 02:29:51 - INFO - __main__ - Step 37413: {'lr': 0.00043263401099464805, 'samples': 7183296, 'steps': 37412, 'loss/train': 3.1627933979034424} 11/07/2021 02:29:52 - INFO - __main__ - Step 37414: {'lr': 0.00043263038711503017, 'samples': 7183488, 'steps': 37413, 'loss/train': 1.6756221055984497} 11/07/2021 02:29:52 - INFO - __main__ - Step 37415: {'lr': 0.00043262676315312183, 'samples': 7183680, 'steps': 37414, 'loss/train': 1.3630532026290894} 11/07/2021 02:29:53 - INFO - __main__ - Step 37416: {'lr': 0.0004326231391089247, 'samples': 7183872, 'steps': 37415, 'loss/train': 1.6684272289276123} 11/07/2021 02:29:53 - INFO - __main__ - Step 37417: {'lr': 0.00043261951498244045, 'samples': 7184064, 'steps': 37416, 'loss/train': 1.718359112739563} 11/07/2021 02:29:54 - INFO - __main__ - Step 37418: {'lr': 0.0004326158907736706, 'samples': 7184256, 'steps': 37417, 'loss/train': 1.6083804368972778} 11/07/2021 02:29:54 - INFO - __main__ - Step 37419: {'lr': 0.00043261226648261687, 'samples': 7184448, 'steps': 37418, 'loss/train': 1.5865517854690552} 11/07/2021 02:29:54 - INFO - __main__ - Step 37420: {'lr': 0.0004326086421092809, 'samples': 7184640, 'steps': 37419, 'loss/train': 1.3192373514175415} 11/07/2021 02:29:55 - INFO - __main__ - Step 37421: {'lr': 0.00043260501765366425, 'samples': 7184832, 'steps': 37420, 'loss/train': 1.606441617012024} 11/07/2021 02:29:56 - INFO - __main__ - Step 37422: {'lr': 0.00043260139311576863, 'samples': 7185024, 'steps': 37421, 'loss/train': 2.185137987136841} 11/07/2021 02:29:56 - INFO - __main__ - Step 37423: {'lr': 0.0004325977684955956, 'samples': 7185216, 'steps': 37422, 'loss/train': 1.5154505968093872} 11/07/2021 02:29:56 - INFO - __main__ - Step 37424: {'lr': 0.0004325941437931469, 'samples': 7185408, 'steps': 37423, 'loss/train': 1.1395184993743896} 11/07/2021 02:29:57 - INFO - __main__ - Step 37425: {'lr': 0.0004325905190084241, 'samples': 7185600, 'steps': 37424, 'loss/train': 1.191431999206543} 11/07/2021 02:29:58 - INFO - __main__ - Step 37426: {'lr': 0.00043258689414142875, 'samples': 7185792, 'steps': 37425, 'loss/train': 1.3336917161941528} 11/07/2021 02:29:58 - INFO - __main__ - Step 37427: {'lr': 0.0004325832691921626, 'samples': 7185984, 'steps': 37426, 'loss/train': 1.6650985479354858} 11/07/2021 02:29:59 - INFO - __main__ - Step 37428: {'lr': 0.00043257964416062723, 'samples': 7186176, 'steps': 37427, 'loss/train': 1.8389983177185059} 11/07/2021 02:29:59 - INFO - __main__ - Step 37429: {'lr': 0.0004325760190468243, 'samples': 7186368, 'steps': 37428, 'loss/train': 1.5231657028198242} 11/07/2021 02:29:59 - INFO - __main__ - Step 37430: {'lr': 0.0004325723938507555, 'samples': 7186560, 'steps': 37429, 'loss/train': 1.52488112449646} 11/07/2021 02:30:00 - INFO - __main__ - Step 37431: {'lr': 0.0004325687685724223, 'samples': 7186752, 'steps': 37430, 'loss/train': 1.412951946258545} 11/07/2021 02:30:01 - INFO - __main__ - Step 37432: {'lr': 0.0004325651432118265, 'samples': 7186944, 'steps': 37431, 'loss/train': 1.3657993078231812} 11/07/2021 02:30:01 - INFO - __main__ - Step 37433: {'lr': 0.00043256151776896955, 'samples': 7187136, 'steps': 37432, 'loss/train': 2.3355555534362793} 11/07/2021 02:30:01 - INFO - __main__ - Step 37434: {'lr': 0.0004325578922438533, 'samples': 7187328, 'steps': 37433, 'loss/train': 1.6410709619522095} 11/07/2021 02:30:02 - INFO - __main__ - Step 37435: {'lr': 0.0004325542666364793, 'samples': 7187520, 'steps': 37434, 'loss/train': 1.3707133531570435} 11/07/2021 02:30:02 - INFO - __main__ - Step 37436: {'lr': 0.00043255064094684917, 'samples': 7187712, 'steps': 37435, 'loss/train': 1.3452285528182983} 11/07/2021 02:30:03 - INFO - __main__ - Step 37437: {'lr': 0.0004325470151749644, 'samples': 7187904, 'steps': 37436, 'loss/train': 1.5764644145965576} 11/07/2021 02:30:03 - INFO - __main__ - Step 37438: {'lr': 0.00043254338932082696, 'samples': 7188096, 'steps': 37437, 'loss/train': 1.3807365894317627} 11/07/2021 02:30:04 - INFO - __main__ - Step 37439: {'lr': 0.00043253976338443814, 'samples': 7188288, 'steps': 37438, 'loss/train': 0.7496334314346313} 11/07/2021 02:30:04 - INFO - __main__ - Step 37440: {'lr': 0.00043253613736579975, 'samples': 7188480, 'steps': 37439, 'loss/train': 1.6858636140823364} 11/07/2021 02:30:05 - INFO - __main__ - Step 37441: {'lr': 0.0004325325112649134, 'samples': 7188672, 'steps': 37440, 'loss/train': 1.2810533046722412} 11/07/2021 02:30:06 - INFO - __main__ - Step 37442: {'lr': 0.00043252888508178066, 'samples': 7188864, 'steps': 37441, 'loss/train': 1.7188156843185425} 11/07/2021 02:30:06 - INFO - __main__ - Step 37443: {'lr': 0.0004325252588164033, 'samples': 7189056, 'steps': 37442, 'loss/train': 1.2332797050476074} 11/07/2021 02:30:06 - INFO - __main__ - Step 37444: {'lr': 0.00043252163246878286, 'samples': 7189248, 'steps': 37443, 'loss/train': 1.7089459896087646} 11/07/2021 02:30:07 - INFO - __main__ - Step 37445: {'lr': 0.000432518006038921, 'samples': 7189440, 'steps': 37444, 'loss/train': 1.2559268474578857} 11/07/2021 02:30:07 - INFO - __main__ - Step 37446: {'lr': 0.00043251437952681926, 'samples': 7189632, 'steps': 37445, 'loss/train': 1.6364524364471436} 11/07/2021 02:30:08 - INFO - __main__ - Step 37447: {'lr': 0.0004325107529324795, 'samples': 7189824, 'steps': 37446, 'loss/train': 1.7149345874786377} 11/07/2021 02:30:08 - INFO - __main__ - Step 37448: {'lr': 0.0004325071262559031, 'samples': 7190016, 'steps': 37447, 'loss/train': 1.6025023460388184} 11/07/2021 02:30:09 - INFO - __main__ - Step 37449: {'lr': 0.00043250349949709184, 'samples': 7190208, 'steps': 37448, 'loss/train': 1.4580620527267456} 11/07/2021 02:30:09 - INFO - __main__ - Step 37450: {'lr': 0.0004324998726560473, 'samples': 7190400, 'steps': 37449, 'loss/train': 1.1127818822860718} 11/07/2021 02:30:09 - INFO - __main__ - Step 37451: {'lr': 0.0004324962457327712, 'samples': 7190592, 'steps': 37450, 'loss/train': 1.3189001083374023} 11/07/2021 02:30:10 - INFO - __main__ - Step 37452: {'lr': 0.00043249261872726504, 'samples': 7190784, 'steps': 37451, 'loss/train': 1.1091448068618774} 11/07/2021 02:30:11 - INFO - __main__ - Step 37453: {'lr': 0.0004324889916395305, 'samples': 7190976, 'steps': 37452, 'loss/train': 1.6013160943984985} 11/07/2021 02:30:11 - INFO - __main__ - Step 37454: {'lr': 0.0004324853644695693, 'samples': 7191168, 'steps': 37453, 'loss/train': 1.1790231466293335} 11/07/2021 02:30:11 - INFO - __main__ - Step 37455: {'lr': 0.000432481737217383, 'samples': 7191360, 'steps': 37454, 'loss/train': 1.5752537250518799} 11/07/2021 02:30:12 - INFO - __main__ - Step 37456: {'lr': 0.0004324781098829732, 'samples': 7191552, 'steps': 37455, 'loss/train': 1.2282586097717285} 11/07/2021 02:30:12 - INFO - __main__ - Step 37457: {'lr': 0.0004324744824663417, 'samples': 7191744, 'steps': 37456, 'loss/train': 1.5415738821029663} 11/07/2021 02:30:13 - INFO - __main__ - Step 37458: {'lr': 0.00043247085496748983, 'samples': 7191936, 'steps': 37457, 'loss/train': 0.7288332581520081} 11/07/2021 02:30:14 - INFO - __main__ - Step 37459: {'lr': 0.0004324672273864195, 'samples': 7192128, 'steps': 37458, 'loss/train': 1.6295570135116577} 11/07/2021 02:30:14 - INFO - __main__ - Step 37460: {'lr': 0.00043246359972313233, 'samples': 7192320, 'steps': 37459, 'loss/train': 1.5285189151763916} 11/07/2021 02:30:14 - INFO - __main__ - Step 37461: {'lr': 0.0004324599719776298, 'samples': 7192512, 'steps': 37460, 'loss/train': 1.4358800649642944} 11/07/2021 02:30:15 - INFO - __main__ - Step 37462: {'lr': 0.00043245634414991365, 'samples': 7192704, 'steps': 37461, 'loss/train': 1.258275032043457} 11/07/2021 02:30:16 - INFO - __main__ - Step 37463: {'lr': 0.0004324527162399854, 'samples': 7192896, 'steps': 37462, 'loss/train': 1.5770810842514038} 11/07/2021 02:30:16 - INFO - __main__ - Step 37464: {'lr': 0.0004324490882478469, 'samples': 7193088, 'steps': 37463, 'loss/train': 1.6385400295257568} 11/07/2021 02:30:16 - INFO - __main__ - Step 37465: {'lr': 0.0004324454601734995, 'samples': 7193280, 'steps': 37464, 'loss/train': 1.7759833335876465} 11/07/2021 02:30:17 - INFO - __main__ - Step 37466: {'lr': 0.0004324418320169451, 'samples': 7193472, 'steps': 37465, 'loss/train': 1.5825846195220947} 11/07/2021 02:30:17 - INFO - __main__ - Step 37467: {'lr': 0.00043243820377818524, 'samples': 7193664, 'steps': 37466, 'loss/train': 1.4279179573059082} 11/07/2021 02:30:19 - INFO - __main__ - Step 37468: {'lr': 0.0004324345754572215, 'samples': 7193856, 'steps': 37467, 'loss/train': 0.7902894616127014} 11/07/2021 02:30:19 - INFO - __main__ - Step 37469: {'lr': 0.00043243094705405554, 'samples': 7194048, 'steps': 37468, 'loss/train': 1.3145751953125} 11/07/2021 02:30:19 - INFO - __main__ - Step 37470: {'lr': 0.0004324273185686891, 'samples': 7194240, 'steps': 37469, 'loss/train': 0.8086821436882019} 11/07/2021 02:30:20 - INFO - __main__ - Step 37471: {'lr': 0.00043242369000112365, 'samples': 7194432, 'steps': 37470, 'loss/train': 1.961127519607544} 11/07/2021 02:30:20 - INFO - __main__ - Step 37472: {'lr': 0.00043242006135136093, 'samples': 7194624, 'steps': 37471, 'loss/train': 2.164335250854492} 11/07/2021 02:30:21 - INFO - __main__ - Step 37473: {'lr': 0.00043241643261940246, 'samples': 7194816, 'steps': 37472, 'loss/train': 2.507589340209961} 11/07/2021 02:30:21 - INFO - __main__ - Step 37474: {'lr': 0.00043241280380525003, 'samples': 7195008, 'steps': 37473, 'loss/train': 1.6702580451965332} 11/07/2021 02:30:22 - INFO - __main__ - Step 37475: {'lr': 0.0004324091749089052, 'samples': 7195200, 'steps': 37474, 'loss/train': 1.4887938499450684} 11/07/2021 02:30:22 - INFO - __main__ - Step 37476: {'lr': 0.0004324055459303696, 'samples': 7195392, 'steps': 37475, 'loss/train': 2.0435266494750977} 11/07/2021 02:30:23 - INFO - __main__ - Step 37477: {'lr': 0.00043240191686964494, 'samples': 7195584, 'steps': 37476, 'loss/train': 1.1949541568756104} 11/07/2021 02:30:23 - INFO - __main__ - Step 37478: {'lr': 0.00043239828772673276, 'samples': 7195776, 'steps': 37477, 'loss/train': 1.53923499584198} 11/07/2021 02:30:23 - INFO - __main__ - Step 37479: {'lr': 0.0004323946585016347, 'samples': 7195968, 'steps': 37478, 'loss/train': 1.6027249097824097} 11/07/2021 02:30:24 - INFO - __main__ - Step 37480: {'lr': 0.00043239102919435235, 'samples': 7196160, 'steps': 37479, 'loss/train': 2.0103933811187744} 11/07/2021 02:30:25 - INFO - __main__ - Step 37481: {'lr': 0.0004323873998048875, 'samples': 7196352, 'steps': 37480, 'loss/train': 1.2224072217941284} 11/07/2021 02:30:25 - INFO - __main__ - Step 37482: {'lr': 0.00043238377033324175, 'samples': 7196544, 'steps': 37481, 'loss/train': 1.319748044013977} 11/07/2021 02:30:25 - INFO - __main__ - Step 37483: {'lr': 0.00043238014077941656, 'samples': 7196736, 'steps': 37482, 'loss/train': 1.6039621829986572} 11/07/2021 02:30:26 - INFO - __main__ - Step 37484: {'lr': 0.00043237651114341383, 'samples': 7196928, 'steps': 37483, 'loss/train': 1.4563041925430298} 11/07/2021 02:30:27 - INFO - __main__ - Step 37485: {'lr': 0.00043237288142523503, 'samples': 7197120, 'steps': 37484, 'loss/train': 2.3065671920776367} 11/07/2021 02:30:27 - INFO - __main__ - Step 37486: {'lr': 0.00043236925162488173, 'samples': 7197312, 'steps': 37485, 'loss/train': 1.2834789752960205} 11/07/2021 02:30:27 - INFO - __main__ - Step 37487: {'lr': 0.0004323656217423557, 'samples': 7197504, 'steps': 37486, 'loss/train': 1.5869083404541016} 11/07/2021 02:30:28 - INFO - __main__ - Step 37488: {'lr': 0.00043236199177765856, 'samples': 7197696, 'steps': 37487, 'loss/train': 1.7081737518310547} 11/07/2021 02:30:28 - INFO - __main__ - Step 37489: {'lr': 0.0004323583617307919, 'samples': 7197888, 'steps': 37488, 'loss/train': 1.4253218173980713} 11/07/2021 02:30:29 - INFO - __main__ - Step 37490: {'lr': 0.00043235473160175745, 'samples': 7198080, 'steps': 37489, 'loss/train': 1.9894713163375854} 11/07/2021 02:30:29 - INFO - __main__ - Step 37491: {'lr': 0.0004323511013905567, 'samples': 7198272, 'steps': 37490, 'loss/train': 1.5830626487731934} 11/07/2021 02:30:30 - INFO - __main__ - Step 37492: {'lr': 0.0004323474710971913, 'samples': 7198464, 'steps': 37491, 'loss/train': 1.5441548824310303} 11/07/2021 02:30:30 - INFO - __main__ - Step 37493: {'lr': 0.0004323438407216631, 'samples': 7198656, 'steps': 37492, 'loss/train': 1.6030843257904053} 11/07/2021 02:30:30 - INFO - __main__ - Step 37494: {'lr': 0.0004323402102639734, 'samples': 7198848, 'steps': 37493, 'loss/train': 1.7096853256225586} 11/07/2021 02:30:32 - INFO - __main__ - Step 37495: {'lr': 0.00043233657972412414, 'samples': 7199040, 'steps': 37494, 'loss/train': 1.8373485803604126} 11/07/2021 02:30:32 - INFO - __main__ - Step 37496: {'lr': 0.00043233294910211684, 'samples': 7199232, 'steps': 37495, 'loss/train': 1.551588535308838} 11/07/2021 02:30:32 - INFO - __main__ - Step 37497: {'lr': 0.0004323293183979531, 'samples': 7199424, 'steps': 37496, 'loss/train': 1.4369267225265503} 11/07/2021 02:30:33 - INFO - __main__ - Step 37498: {'lr': 0.0004323256876116345, 'samples': 7199616, 'steps': 37497, 'loss/train': 1.0210753679275513} 11/07/2021 02:30:33 - INFO - __main__ - Step 37499: {'lr': 0.0004323220567431628, 'samples': 7199808, 'steps': 37498, 'loss/train': 1.9081380367279053} 11/07/2021 02:30:34 - INFO - __main__ - Step 37500: {'lr': 0.0004323184257925397, 'samples': 7200000, 'steps': 37499, 'loss/train': 1.5604526996612549} 11/07/2021 02:30:34 - INFO - __main__ - Step 37501: {'lr': 0.0004323147947597667, 'samples': 7200192, 'steps': 37500, 'loss/train': 1.0608760118484497} 11/07/2021 02:30:35 - INFO - __main__ - Step 37502: {'lr': 0.00043231116364484534, 'samples': 7200384, 'steps': 37501, 'loss/train': 1.4155505895614624} 11/07/2021 02:30:35 - INFO - __main__ - Step 37503: {'lr': 0.00043230753244777743, 'samples': 7200576, 'steps': 37502, 'loss/train': 1.5481852293014526} 11/07/2021 02:30:35 - INFO - __main__ - Step 37504: {'lr': 0.00043230390116856467, 'samples': 7200768, 'steps': 37503, 'loss/train': 0.5808529257774353} 11/07/2021 02:30:36 - INFO - __main__ - Step 37505: {'lr': 0.00043230026980720847, 'samples': 7200960, 'steps': 37504, 'loss/train': 1.8550664186477661} 11/07/2021 02:30:37 - INFO - __main__ - Step 37506: {'lr': 0.00043229663836371056, 'samples': 7201152, 'steps': 37505, 'loss/train': 1.284300684928894} 11/07/2021 02:30:37 - INFO - __main__ - Step 37507: {'lr': 0.0004322930068380727, 'samples': 7201344, 'steps': 37506, 'loss/train': 1.435318112373352} 11/07/2021 02:30:37 - INFO - __main__ - Step 37508: {'lr': 0.00043228937523029636, 'samples': 7201536, 'steps': 37507, 'loss/train': 1.5970197916030884} 11/07/2021 02:30:38 - INFO - __main__ - Step 37509: {'lr': 0.00043228574354038326, 'samples': 7201728, 'steps': 37508, 'loss/train': 1.5501351356506348} 11/07/2021 02:30:38 - INFO - __main__ - Step 37510: {'lr': 0.00043228211176833496, 'samples': 7201920, 'steps': 37509, 'loss/train': 1.5573985576629639} 11/07/2021 02:30:39 - INFO - __main__ - Step 37511: {'lr': 0.00043227847991415326, 'samples': 7202112, 'steps': 37510, 'loss/train': 0.7737532258033752} 11/07/2021 02:30:40 - INFO - __main__ - Step 37512: {'lr': 0.00043227484797783965, 'samples': 7202304, 'steps': 37511, 'loss/train': 1.446190595626831} 11/07/2021 02:30:40 - INFO - __main__ - Step 37513: {'lr': 0.0004322712159593958, 'samples': 7202496, 'steps': 37512, 'loss/train': 2.2026243209838867} 11/07/2021 02:30:40 - INFO - __main__ - Step 37514: {'lr': 0.0004322675838588234, 'samples': 7202688, 'steps': 37513, 'loss/train': 1.200168490409851} 11/07/2021 02:30:41 - INFO - __main__ - Step 37515: {'lr': 0.0004322639516761239, 'samples': 7202880, 'steps': 37514, 'loss/train': 1.6389999389648438} 11/07/2021 02:30:42 - INFO - __main__ - Step 37516: {'lr': 0.0004322603194112992, 'samples': 7203072, 'steps': 37515, 'loss/train': 1.6824238300323486} 11/07/2021 02:30:42 - INFO - __main__ - Step 37517: {'lr': 0.00043225668706435073, 'samples': 7203264, 'steps': 37516, 'loss/train': 1.6325851678848267} 11/07/2021 02:30:42 - INFO - __main__ - Step 37518: {'lr': 0.0004322530546352803, 'samples': 7203456, 'steps': 37517, 'loss/train': 1.74091374874115} 11/07/2021 02:30:43 - INFO - __main__ - Step 37519: {'lr': 0.0004322494221240894, 'samples': 7203648, 'steps': 37518, 'loss/train': 1.3719381093978882} 11/07/2021 02:30:43 - INFO - __main__ - Step 37520: {'lr': 0.0004322457895307797, 'samples': 7203840, 'steps': 37519, 'loss/train': 2.352436065673828} 11/07/2021 02:30:44 - INFO - __main__ - Step 37521: {'lr': 0.00043224215685535287, 'samples': 7204032, 'steps': 37520, 'loss/train': 2.0461223125457764} 11/07/2021 02:30:44 - INFO - __main__ - Step 37522: {'lr': 0.0004322385240978106, 'samples': 7204224, 'steps': 37521, 'loss/train': 1.1710293292999268} 11/07/2021 02:30:45 - INFO - __main__ - Step 37523: {'lr': 0.0004322348912581544, 'samples': 7204416, 'steps': 37522, 'loss/train': 1.662919044494629} 11/07/2021 02:30:45 - INFO - __main__ - Step 37524: {'lr': 0.000432231258336386, 'samples': 7204608, 'steps': 37523, 'loss/train': 1.4173078536987305} 11/07/2021 02:30:46 - INFO - __main__ - Step 37525: {'lr': 0.000432227625332507, 'samples': 7204800, 'steps': 37524, 'loss/train': 0.7475195527076721} 11/07/2021 02:30:47 - INFO - __main__ - Step 37526: {'lr': 0.000432223992246519, 'samples': 7204992, 'steps': 37525, 'loss/train': 1.65089750289917} 11/07/2021 02:30:47 - INFO - __main__ - Step 37527: {'lr': 0.0004322203590784237, 'samples': 7205184, 'steps': 37526, 'loss/train': 1.536938190460205} 11/07/2021 02:30:47 - INFO - __main__ - Step 37528: {'lr': 0.0004322167258282228, 'samples': 7205376, 'steps': 37527, 'loss/train': 1.5707414150238037} 11/07/2021 02:30:48 - INFO - __main__ - Step 37529: {'lr': 0.0004322130924959178, 'samples': 7205568, 'steps': 37528, 'loss/train': 1.5095049142837524} 11/07/2021 02:30:48 - INFO - __main__ - Step 37530: {'lr': 0.0004322094590815104, 'samples': 7205760, 'steps': 37529, 'loss/train': 1.9747005701065063} 11/07/2021 02:30:50 - INFO - __main__ - Step 37531: {'lr': 0.00043220582558500223, 'samples': 7205952, 'steps': 37530, 'loss/train': 1.743624210357666} 11/07/2021 02:30:50 - INFO - __main__ - Step 37532: {'lr': 0.00043220219200639485, 'samples': 7206144, 'steps': 37531, 'loss/train': 1.7449406385421753} 11/07/2021 02:30:50 - INFO - __main__ - Step 37533: {'lr': 0.00043219855834569006, 'samples': 7206336, 'steps': 37532, 'loss/train': 2.2813148498535156} 11/07/2021 02:30:51 - INFO - __main__ - Step 37534: {'lr': 0.00043219492460288937, 'samples': 7206528, 'steps': 37533, 'loss/train': 1.6646699905395508} 11/07/2021 02:30:51 - INFO - __main__ - Step 37535: {'lr': 0.00043219129077799447, 'samples': 7206720, 'steps': 37534, 'loss/train': 1.7239679098129272} 11/07/2021 02:30:51 - INFO - __main__ - Step 37536: {'lr': 0.000432187656871007, 'samples': 7206912, 'steps': 37535, 'loss/train': 1.7892603874206543} 11/07/2021 02:30:52 - INFO - __main__ - Step 37537: {'lr': 0.0004321840228819286, 'samples': 7207104, 'steps': 37536, 'loss/train': 1.0783336162567139} 11/07/2021 02:30:53 - INFO - __main__ - Step 37538: {'lr': 0.0004321803888107608, 'samples': 7207296, 'steps': 37537, 'loss/train': 0.732699990272522} 11/07/2021 02:30:53 - INFO - __main__ - Step 37539: {'lr': 0.0004321767546575054, 'samples': 7207488, 'steps': 37538, 'loss/train': 1.7513829469680786} 11/07/2021 02:30:54 - INFO - __main__ - Step 37540: {'lr': 0.000432173120422164, 'samples': 7207680, 'steps': 37539, 'loss/train': 10.400634765625} 11/07/2021 02:30:54 - INFO - __main__ - Step 37541: {'lr': 0.00043216948610473816, 'samples': 7207872, 'steps': 37540, 'loss/train': 1.5212979316711426} 11/07/2021 02:30:54 - INFO - __main__ - Step 37542: {'lr': 0.0004321658517052296, 'samples': 7208064, 'steps': 37541, 'loss/train': 1.4999489784240723} 11/07/2021 02:30:55 - INFO - __main__ - Step 37543: {'lr': 0.00043216221722363983, 'samples': 7208256, 'steps': 37542, 'loss/train': 1.7804269790649414} 11/07/2021 02:30:56 - INFO - __main__ - Step 37544: {'lr': 0.00043215858265997065, 'samples': 7208448, 'steps': 37543, 'loss/train': 1.8416721820831299} 11/07/2021 02:30:56 - INFO - __main__ - Step 37545: {'lr': 0.0004321549480142236, 'samples': 7208640, 'steps': 37544, 'loss/train': 1.489058017730713} 11/07/2021 02:30:56 - INFO - __main__ - Step 37546: {'lr': 0.0004321513132864003, 'samples': 7208832, 'steps': 37545, 'loss/train': 1.4994314908981323} 11/07/2021 02:30:57 - INFO - __main__ - Step 37547: {'lr': 0.0004321476784765025, 'samples': 7209024, 'steps': 37546, 'loss/train': 1.3192144632339478} 11/07/2021 02:30:57 - INFO - __main__ - Step 37548: {'lr': 0.00043214404358453174, 'samples': 7209216, 'steps': 37547, 'loss/train': 1.5115631818771362} 11/07/2021 02:30:58 - INFO - __main__ - Step 37549: {'lr': 0.0004321404086104897, 'samples': 7209408, 'steps': 37548, 'loss/train': 0.9762508869171143} 11/07/2021 02:30:58 - INFO - __main__ - Step 37550: {'lr': 0.00043213677355437795, 'samples': 7209600, 'steps': 37549, 'loss/train': 1.087836742401123} 11/07/2021 02:30:59 - INFO - __main__ - Step 37551: {'lr': 0.0004321331384161983, 'samples': 7209792, 'steps': 37550, 'loss/train': 1.6014374494552612} 11/07/2021 02:30:59 - INFO - __main__ - Step 37552: {'lr': 0.00043212950319595215, 'samples': 7209984, 'steps': 37551, 'loss/train': 1.8291975259780884} 11/07/2021 02:30:59 - INFO - __main__ - Step 37553: {'lr': 0.0004321258678936413, 'samples': 7210176, 'steps': 37552, 'loss/train': 1.8783771991729736} 11/07/2021 02:31:01 - INFO - __main__ - Step 37554: {'lr': 0.00043212223250926727, 'samples': 7210368, 'steps': 37553, 'loss/train': 1.5376328229904175} 11/07/2021 02:31:01 - INFO - __main__ - Step 37555: {'lr': 0.00043211859704283184, 'samples': 7210560, 'steps': 37554, 'loss/train': 1.462335228919983} 11/07/2021 02:31:01 - INFO - __main__ - Step 37556: {'lr': 0.0004321149614943366, 'samples': 7210752, 'steps': 37555, 'loss/train': 1.7638007402420044} 11/07/2021 02:31:02 - INFO - __main__ - Step 37557: {'lr': 0.0004321113258637832, 'samples': 7210944, 'steps': 37556, 'loss/train': 1.3894625902175903} 11/07/2021 02:31:02 - INFO - __main__ - Step 37558: {'lr': 0.0004321076901511731, 'samples': 7211136, 'steps': 37557, 'loss/train': 1.167650818824768} 11/07/2021 02:31:03 - INFO - __main__ - Step 37559: {'lr': 0.0004321040543565082, 'samples': 7211328, 'steps': 37558, 'loss/train': 1.5407660007476807} 11/07/2021 02:31:03 - INFO - __main__ - Step 37560: {'lr': 0.00043210041847979003, 'samples': 7211520, 'steps': 37559, 'loss/train': 1.55744469165802} 11/07/2021 02:31:04 - INFO - __main__ - Step 37561: {'lr': 0.0004320967825210202, 'samples': 7211712, 'steps': 37560, 'loss/train': 1.7859392166137695} 11/07/2021 02:31:04 - INFO - __main__ - Step 37562: {'lr': 0.00043209314648020035, 'samples': 7211904, 'steps': 37561, 'loss/train': 1.5488879680633545} 11/07/2021 02:31:04 - INFO - __main__ - Step 37563: {'lr': 0.0004320895103573321, 'samples': 7212096, 'steps': 37562, 'loss/train': 2.0785441398620605} 11/07/2021 02:31:06 - INFO - __main__ - Step 37564: {'lr': 0.00043208587415241725, 'samples': 7212288, 'steps': 37563, 'loss/train': 2.692457675933838} 11/07/2021 02:31:06 - INFO - __main__ - Step 37565: {'lr': 0.00043208223786545723, 'samples': 7212480, 'steps': 37564, 'loss/train': 1.5505104064941406} 11/07/2021 02:31:06 - INFO - __main__ - Step 37566: {'lr': 0.0004320786014964538, 'samples': 7212672, 'steps': 37565, 'loss/train': 1.716882586479187} 11/07/2021 02:31:07 - INFO - __main__ - Step 37567: {'lr': 0.0004320749650454085, 'samples': 7212864, 'steps': 37566, 'loss/train': 1.477885127067566} 11/07/2021 02:31:07 - INFO - __main__ - Step 37568: {'lr': 0.0004320713285123231, 'samples': 7213056, 'steps': 37567, 'loss/train': 1.2575966119766235} 11/07/2021 02:31:07 - INFO - __main__ - Step 37569: {'lr': 0.0004320676918971991, 'samples': 7213248, 'steps': 37568, 'loss/train': 0.40955543518066406} 11/07/2021 02:31:08 - INFO - __main__ - Step 37570: {'lr': 0.00043206405520003824, 'samples': 7213440, 'steps': 37569, 'loss/train': 1.5961387157440186} 11/07/2021 02:31:09 - INFO - __main__ - Step 37571: {'lr': 0.00043206041842084214, 'samples': 7213632, 'steps': 37570, 'loss/train': 1.733590006828308} 11/07/2021 02:31:09 - INFO - __main__ - Step 37572: {'lr': 0.00043205678155961244, 'samples': 7213824, 'steps': 37571, 'loss/train': 1.8249460458755493} 11/07/2021 02:31:09 - INFO - __main__ - Step 37573: {'lr': 0.0004320531446163507, 'samples': 7214016, 'steps': 37572, 'loss/train': 1.5001862049102783} 11/07/2021 02:31:10 - INFO - __main__ - Step 37574: {'lr': 0.00043204950759105865, 'samples': 7214208, 'steps': 37573, 'loss/train': 0.9354721903800964} 11/07/2021 02:31:11 - INFO - __main__ - Step 37575: {'lr': 0.0004320458704837379, 'samples': 7214400, 'steps': 37574, 'loss/train': 1.4452364444732666} 11/07/2021 02:31:11 - INFO - __main__ - Step 37576: {'lr': 0.00043204223329439015, 'samples': 7214592, 'steps': 37575, 'loss/train': 1.3027896881103516} 11/07/2021 02:31:11 - INFO - __main__ - Step 37577: {'lr': 0.00043203859602301695, 'samples': 7214784, 'steps': 37576, 'loss/train': 1.559409499168396} 11/07/2021 02:31:12 - INFO - __main__ - Step 37578: {'lr': 0.00043203495866961996, 'samples': 7214976, 'steps': 37577, 'loss/train': 1.2092102766036987} 11/07/2021 02:31:12 - INFO - __main__ - Step 37579: {'lr': 0.00043203132123420074, 'samples': 7215168, 'steps': 37578, 'loss/train': 1.8662713766098022} 11/07/2021 02:31:13 - INFO - __main__ - Step 37580: {'lr': 0.00043202768371676113, 'samples': 7215360, 'steps': 37579, 'loss/train': 1.269534707069397} 11/07/2021 02:31:14 - INFO - __main__ - Step 37581: {'lr': 0.0004320240461173026, 'samples': 7215552, 'steps': 37580, 'loss/train': 0.5584875345230103} 11/07/2021 02:31:14 - INFO - __main__ - Step 37582: {'lr': 0.00043202040843582685, 'samples': 7215744, 'steps': 37581, 'loss/train': 1.0845144987106323} 11/07/2021 02:31:14 - INFO - __main__ - Step 37583: {'lr': 0.00043201677067233554, 'samples': 7215936, 'steps': 37582, 'loss/train': 2.1765105724334717} 11/07/2021 02:31:15 - INFO - __main__ - Step 37584: {'lr': 0.00043201313282683024, 'samples': 7216128, 'steps': 37583, 'loss/train': 1.4663903713226318} 11/07/2021 02:31:15 - INFO - __main__ - Step 37585: {'lr': 0.0004320094948993127, 'samples': 7216320, 'steps': 37584, 'loss/train': 1.7186776399612427} 11/07/2021 02:31:16 - INFO - __main__ - Step 37586: {'lr': 0.00043200585688978445, 'samples': 7216512, 'steps': 37585, 'loss/train': 1.0050069093704224} 11/07/2021 02:31:16 - INFO - __main__ - Step 37587: {'lr': 0.00043200221879824706, 'samples': 7216704, 'steps': 37586, 'loss/train': 1.7899430990219116} 11/07/2021 02:31:17 - INFO - __main__ - Step 37588: {'lr': 0.0004319985806247024, 'samples': 7216896, 'steps': 37587, 'loss/train': 1.0412225723266602} 11/07/2021 02:31:17 - INFO - __main__ - Step 37589: {'lr': 0.00043199494236915206, 'samples': 7217088, 'steps': 37588, 'loss/train': 2.0551319122314453} 11/07/2021 02:31:17 - INFO - __main__ - Step 37590: {'lr': 0.0004319913040315975, 'samples': 7217280, 'steps': 37589, 'loss/train': 1.3337748050689697} 11/07/2021 02:31:18 - INFO - __main__ - Step 37591: {'lr': 0.00043198766561204047, 'samples': 7217472, 'steps': 37590, 'loss/train': 1.7535258531570435} 11/07/2021 02:31:19 - INFO - __main__ - Step 37592: {'lr': 0.0004319840271104826, 'samples': 7217664, 'steps': 37591, 'loss/train': 1.6593502759933472} 11/07/2021 02:31:19 - INFO - __main__ - Step 37593: {'lr': 0.0004319803885269256, 'samples': 7217856, 'steps': 37592, 'loss/train': 1.2667961120605469} 11/07/2021 02:31:20 - INFO - __main__ - Step 37594: {'lr': 0.0004319767498613709, 'samples': 7218048, 'steps': 37593, 'loss/train': 1.6652697324752808} 11/07/2021 02:31:20 - INFO - __main__ - Step 37595: {'lr': 0.00043197311111382045, 'samples': 7218240, 'steps': 37594, 'loss/train': 1.6020832061767578} 11/07/2021 02:31:21 - INFO - __main__ - Step 37596: {'lr': 0.00043196947228427564, 'samples': 7218432, 'steps': 37595, 'loss/train': 1.2289795875549316} 11/07/2021 02:31:21 - INFO - __main__ - Step 37597: {'lr': 0.0004319658333727382, 'samples': 7218624, 'steps': 37596, 'loss/train': 2.1435749530792236} 11/07/2021 02:31:22 - INFO - __main__ - Step 37598: {'lr': 0.0004319621943792098, 'samples': 7218816, 'steps': 37597, 'loss/train': 0.6204817295074463} 11/07/2021 02:31:22 - INFO - __main__ - Step 37599: {'lr': 0.000431958555303692, 'samples': 7219008, 'steps': 37598, 'loss/train': 1.4375860691070557} 11/07/2021 02:31:22 - INFO - __main__ - Step 37600: {'lr': 0.00043195491614618655, 'samples': 7219200, 'steps': 37599, 'loss/train': 1.17434561252594} 11/07/2021 02:31:24 - INFO - __main__ - Step 37601: {'lr': 0.00043195127690669486, 'samples': 7219392, 'steps': 37600, 'loss/train': 1.445520281791687} 11/07/2021 02:31:24 - INFO - __main__ - Step 37602: {'lr': 0.00043194763758521896, 'samples': 7219584, 'steps': 37601, 'loss/train': 1.8835210800170898} 11/07/2021 02:31:24 - INFO - __main__ - Step 37603: {'lr': 0.00043194399818176013, 'samples': 7219776, 'steps': 37602, 'loss/train': 0.8516283631324768} 11/07/2021 02:31:25 - INFO - __main__ - Step 37604: {'lr': 0.00043194035869632017, 'samples': 7219968, 'steps': 37603, 'loss/train': 1.772552251815796} 11/07/2021 02:31:25 - INFO - __main__ - Step 37605: {'lr': 0.00043193671912890064, 'samples': 7220160, 'steps': 37604, 'loss/train': 1.627819299697876} 11/07/2021 02:31:25 - INFO - __main__ - Step 37606: {'lr': 0.0004319330794795033, 'samples': 7220352, 'steps': 37605, 'loss/train': 1.452150821685791} 11/07/2021 02:31:27 - INFO - __main__ - Step 37607: {'lr': 0.0004319294397481297, 'samples': 7220544, 'steps': 37606, 'loss/train': 1.6680288314819336} 11/07/2021 02:31:27 - INFO - __main__ - Step 37608: {'lr': 0.0004319257999347815, 'samples': 7220736, 'steps': 37607, 'loss/train': 1.2022203207015991} 11/07/2021 02:31:28 - INFO - __main__ - Step 37609: {'lr': 0.0004319221600394603, 'samples': 7220928, 'steps': 37608, 'loss/train': 1.7221713066101074} 11/07/2021 02:31:28 - INFO - __main__ - Step 37610: {'lr': 0.0004319185200621678, 'samples': 7221120, 'steps': 37609, 'loss/train': 0.6669997572898865} 11/07/2021 02:31:28 - INFO - __main__ - Step 37611: {'lr': 0.0004319148800029057, 'samples': 7221312, 'steps': 37610, 'loss/train': 1.5159608125686646} 11/07/2021 02:31:29 - INFO - __main__ - Step 37612: {'lr': 0.0004319112398616755, 'samples': 7221504, 'steps': 37611, 'loss/train': 1.4701101779937744} 11/07/2021 02:31:30 - INFO - __main__ - Step 37613: {'lr': 0.00043190759963847894, 'samples': 7221696, 'steps': 37612, 'loss/train': 0.991864025592804} 11/07/2021 02:31:30 - INFO - __main__ - Step 37614: {'lr': 0.00043190395933331757, 'samples': 7221888, 'steps': 37613, 'loss/train': 1.380480408668518} 11/07/2021 02:31:30 - INFO - __main__ - Step 37615: {'lr': 0.00043190031894619306, 'samples': 7222080, 'steps': 37614, 'loss/train': 1.7094345092773438} 11/07/2021 02:31:31 - INFO - __main__ - Step 37616: {'lr': 0.0004318966784771071, 'samples': 7222272, 'steps': 37615, 'loss/train': 1.7797966003417969} 11/07/2021 02:31:31 - INFO - __main__ - Step 37617: {'lr': 0.00043189303792606136, 'samples': 7222464, 'steps': 37616, 'loss/train': 1.494810938835144} 11/07/2021 02:31:32 - INFO - __main__ - Step 37618: {'lr': 0.0004318893972930574, 'samples': 7222656, 'steps': 37617, 'loss/train': 1.7728573083877563} 11/07/2021 02:31:33 - INFO - __main__ - Step 37619: {'lr': 0.00043188575657809685, 'samples': 7222848, 'steps': 37618, 'loss/train': 1.844686508178711} 11/07/2021 02:31:33 - INFO - __main__ - Step 37620: {'lr': 0.00043188211578118143, 'samples': 7223040, 'steps': 37619, 'loss/train': 1.6193997859954834} 11/07/2021 02:31:33 - INFO - __main__ - Step 37621: {'lr': 0.0004318784749023127, 'samples': 7223232, 'steps': 37620, 'loss/train': 1.4788497686386108} 11/07/2021 02:31:34 - INFO - __main__ - Step 37622: {'lr': 0.0004318748339414923, 'samples': 7223424, 'steps': 37621, 'loss/train': 1.5991002321243286} 11/07/2021 02:31:35 - INFO - __main__ - Step 37623: {'lr': 0.000431871192898722, 'samples': 7223616, 'steps': 37622, 'loss/train': 1.8423744440078735} 11/07/2021 02:31:35 - INFO - __main__ - Step 37624: {'lr': 0.0004318675517740033, 'samples': 7223808, 'steps': 37623, 'loss/train': 1.4404720067977905} 11/07/2021 02:31:35 - INFO - __main__ - Step 37625: {'lr': 0.0004318639105673379, 'samples': 7224000, 'steps': 37624, 'loss/train': 1.5224372148513794} 11/07/2021 02:31:36 - INFO - __main__ - Step 37626: {'lr': 0.00043186026927872736, 'samples': 7224192, 'steps': 37625, 'loss/train': 1.1585700511932373} 11/07/2021 02:31:36 - INFO - __main__ - Step 37627: {'lr': 0.0004318566279081735, 'samples': 7224384, 'steps': 37626, 'loss/train': 1.5932338237762451} 11/07/2021 02:31:36 - INFO - __main__ - Step 37628: {'lr': 0.0004318529864556777, 'samples': 7224576, 'steps': 37627, 'loss/train': 1.5945461988449097} 11/07/2021 02:31:37 - INFO - __main__ - Step 37629: {'lr': 0.0004318493449212419, 'samples': 7224768, 'steps': 37628, 'loss/train': 1.6314464807510376} 11/07/2021 02:31:38 - INFO - __main__ - Step 37630: {'lr': 0.00043184570330486756, 'samples': 7224960, 'steps': 37629, 'loss/train': 1.088186502456665} 11/07/2021 02:31:38 - INFO - __main__ - Step 37631: {'lr': 0.0004318420616065563, 'samples': 7225152, 'steps': 37630, 'loss/train': 1.3572551012039185} 11/07/2021 02:31:39 - INFO - __main__ - Step 37632: {'lr': 0.0004318384198263099, 'samples': 7225344, 'steps': 37631, 'loss/train': 1.4363213777542114} 11/07/2021 02:31:39 - INFO - __main__ - Step 37633: {'lr': 0.0004318347779641298, 'samples': 7225536, 'steps': 37632, 'loss/train': 1.7372273206710815} 11/07/2021 02:31:40 - INFO - __main__ - Step 37634: {'lr': 0.00043183113602001777, 'samples': 7225728, 'steps': 37633, 'loss/train': 0.5927386283874512} 11/07/2021 02:31:40 - INFO - __main__ - Step 37635: {'lr': 0.0004318274939939755, 'samples': 7225920, 'steps': 37634, 'loss/train': 1.5352866649627686} 11/07/2021 02:31:41 - INFO - __main__ - Step 37636: {'lr': 0.00043182385188600457, 'samples': 7226112, 'steps': 37635, 'loss/train': 1.1105045080184937} 11/07/2021 02:31:41 - INFO - __main__ - Step 37637: {'lr': 0.0004318202096961066, 'samples': 7226304, 'steps': 37636, 'loss/train': 0.7921403646469116} 11/07/2021 02:31:41 - INFO - __main__ - Step 37638: {'lr': 0.0004318165674242832, 'samples': 7226496, 'steps': 37637, 'loss/train': 0.8578131794929504} 11/07/2021 02:31:42 - INFO - __main__ - Step 37639: {'lr': 0.0004318129250705361, 'samples': 7226688, 'steps': 37638, 'loss/train': 1.8374136686325073} 11/07/2021 02:31:43 - INFO - __main__ - Step 37640: {'lr': 0.0004318092826348669, 'samples': 7226880, 'steps': 37639, 'loss/train': 1.7463501691818237} 11/07/2021 02:31:43 - INFO - __main__ - Step 37641: {'lr': 0.0004318056401172772, 'samples': 7227072, 'steps': 37640, 'loss/train': 1.500892996788025} 11/07/2021 02:31:44 - INFO - __main__ - Step 37642: {'lr': 0.0004318019975177688, 'samples': 7227264, 'steps': 37641, 'loss/train': 1.86619234085083} 11/07/2021 02:31:44 - INFO - __main__ - Step 37643: {'lr': 0.0004317983548363431, 'samples': 7227456, 'steps': 37642, 'loss/train': 1.6383424997329712} 11/07/2021 02:31:45 - INFO - __main__ - Step 37644: {'lr': 0.0004317947120730019, 'samples': 7227648, 'steps': 37643, 'loss/train': 1.8383617401123047} 11/07/2021 02:31:45 - INFO - __main__ - Step 37645: {'lr': 0.0004317910692277469, 'samples': 7227840, 'steps': 37644, 'loss/train': 1.0736067295074463} 11/07/2021 02:31:46 - INFO - __main__ - Step 37646: {'lr': 0.0004317874263005795, 'samples': 7228032, 'steps': 37645, 'loss/train': 0.9489991068840027} 11/07/2021 02:31:46 - INFO - __main__ - Step 37647: {'lr': 0.0004317837832915016, 'samples': 7228224, 'steps': 37646, 'loss/train': 1.8266150951385498} 11/07/2021 02:31:46 - INFO - __main__ - Step 37648: {'lr': 0.0004317801402005147, 'samples': 7228416, 'steps': 37647, 'loss/train': 1.5896239280700684} 11/07/2021 02:31:47 - INFO - __main__ - Step 37649: {'lr': 0.00043177649702762043, 'samples': 7228608, 'steps': 37648, 'loss/train': 1.2595068216323853} 11/07/2021 02:31:48 - INFO - __main__ - Step 37650: {'lr': 0.0004317728537728206, 'samples': 7228800, 'steps': 37649, 'loss/train': 1.6371251344680786} 11/07/2021 02:31:48 - INFO - __main__ - Step 37651: {'lr': 0.0004317692104361166, 'samples': 7228992, 'steps': 37650, 'loss/train': 1.5012624263763428} 11/07/2021 02:31:48 - INFO - __main__ - Step 37652: {'lr': 0.0004317655670175102, 'samples': 7229184, 'steps': 37651, 'loss/train': 1.1823043823242188} 11/07/2021 02:31:49 - INFO - __main__ - Step 37653: {'lr': 0.0004317619235170032, 'samples': 7229376, 'steps': 37652, 'loss/train': 1.0218074321746826} 11/07/2021 02:31:49 - INFO - __main__ - Step 37654: {'lr': 0.00043175827993459696, 'samples': 7229568, 'steps': 37653, 'loss/train': 1.5730572938919067} 11/07/2021 02:31:50 - INFO - __main__ - Step 37655: {'lr': 0.0004317546362702932, 'samples': 7229760, 'steps': 37654, 'loss/train': 1.5754374265670776} 11/07/2021 02:31:50 - INFO - __main__ - Step 37656: {'lr': 0.0004317509925240937, 'samples': 7229952, 'steps': 37655, 'loss/train': 1.760396122932434} 11/07/2021 02:31:51 - INFO - __main__ - Step 37657: {'lr': 0.00043174734869599993, 'samples': 7230144, 'steps': 37656, 'loss/train': 1.6367154121398926} 11/07/2021 02:31:51 - INFO - __main__ - Step 37658: {'lr': 0.0004317437047860137, 'samples': 7230336, 'steps': 37657, 'loss/train': 1.5266382694244385} 11/07/2021 02:31:51 - INFO - __main__ - Step 37659: {'lr': 0.0004317400607941364, 'samples': 7230528, 'steps': 37658, 'loss/train': 1.5288509130477905} 11/07/2021 02:31:53 - INFO - __main__ - Step 37660: {'lr': 0.00043173641672037, 'samples': 7230720, 'steps': 37659, 'loss/train': 1.6391620635986328} 11/07/2021 02:31:53 - INFO - __main__ - Step 37661: {'lr': 0.00043173277256471586, 'samples': 7230912, 'steps': 37660, 'loss/train': 1.4939783811569214} 11/07/2021 02:31:53 - INFO - __main__ - Step 37662: {'lr': 0.0004317291283271758, 'samples': 7231104, 'steps': 37661, 'loss/train': 1.5181828737258911} 11/07/2021 02:31:54 - INFO - __main__ - Step 37663: {'lr': 0.0004317254840077514, 'samples': 7231296, 'steps': 37662, 'loss/train': 1.6886277198791504} 11/07/2021 02:31:54 - INFO - __main__ - Step 37664: {'lr': 0.0004317218396064443, 'samples': 7231488, 'steps': 37663, 'loss/train': 1.5676177740097046} 11/07/2021 02:31:55 - INFO - __main__ - Step 37665: {'lr': 0.00043171819512325614, 'samples': 7231680, 'steps': 37664, 'loss/train': 1.3484420776367188} 11/07/2021 02:31:55 - INFO - __main__ - Step 37666: {'lr': 0.00043171455055818854, 'samples': 7231872, 'steps': 37665, 'loss/train': 1.4471410512924194} 11/07/2021 02:31:56 - INFO - __main__ - Step 37667: {'lr': 0.0004317109059112432, 'samples': 7232064, 'steps': 37666, 'loss/train': 1.7877618074417114} 11/07/2021 02:31:56 - INFO - __main__ - Step 37668: {'lr': 0.00043170726118242164, 'samples': 7232256, 'steps': 37667, 'loss/train': 2.937180280685425} 11/07/2021 02:31:56 - INFO - __main__ - Step 37669: {'lr': 0.0004317036163717257, 'samples': 7232448, 'steps': 37668, 'loss/train': 1.400524377822876} 11/07/2021 02:31:57 - INFO - __main__ - Step 37670: {'lr': 0.0004316999714791569, 'samples': 7232640, 'steps': 37669, 'loss/train': 1.571532964706421} 11/07/2021 02:31:58 - INFO - __main__ - Step 37671: {'lr': 0.0004316963265047169, 'samples': 7232832, 'steps': 37670, 'loss/train': 1.8768014907836914} 11/07/2021 02:31:58 - INFO - __main__ - Step 37672: {'lr': 0.00043169268144840726, 'samples': 7233024, 'steps': 37671, 'loss/train': 1.411942481994629} 11/07/2021 02:31:58 - INFO - __main__ - Step 37673: {'lr': 0.0004316890363102298, 'samples': 7233216, 'steps': 37672, 'loss/train': 1.3016736507415771} 11/07/2021 02:31:59 - INFO - __main__ - Step 37674: {'lr': 0.000431685391090186, 'samples': 7233408, 'steps': 37673, 'loss/train': 1.345513939857483} 11/07/2021 02:31:59 - INFO - __main__ - Step 37675: {'lr': 0.00043168174578827755, 'samples': 7233600, 'steps': 37674, 'loss/train': 1.5534780025482178} 11/07/2021 02:32:00 - INFO - __main__ - Step 37676: {'lr': 0.00043167810040450617, 'samples': 7233792, 'steps': 37675, 'loss/train': 1.6502912044525146} 11/07/2021 02:32:00 - INFO - __main__ - Step 37677: {'lr': 0.00043167445493887347, 'samples': 7233984, 'steps': 37676, 'loss/train': 1.8989665508270264} 11/07/2021 02:32:01 - INFO - __main__ - Step 37678: {'lr': 0.000431670809391381, 'samples': 7234176, 'steps': 37677, 'loss/train': 1.4615715742111206} 11/07/2021 02:32:01 - INFO - __main__ - Step 37679: {'lr': 0.00043166716376203047, 'samples': 7234368, 'steps': 37678, 'loss/train': 1.699442744255066} 11/07/2021 02:32:02 - INFO - __main__ - Step 37680: {'lr': 0.0004316635180508235, 'samples': 7234560, 'steps': 37679, 'loss/train': 1.6097488403320312} 11/07/2021 02:32:03 - INFO - __main__ - Step 37681: {'lr': 0.0004316598722577618, 'samples': 7234752, 'steps': 37680, 'loss/train': 1.2849406003952026} 11/07/2021 02:32:03 - INFO - __main__ - Step 37682: {'lr': 0.000431656226382847, 'samples': 7234944, 'steps': 37681, 'loss/train': 1.5217424631118774} 11/07/2021 02:32:03 - INFO - __main__ - Step 37683: {'lr': 0.00043165258042608055, 'samples': 7235136, 'steps': 37682, 'loss/train': 1.7335761785507202} 11/07/2021 02:32:04 - INFO - __main__ - Step 37684: {'lr': 0.0004316489343874644, 'samples': 7235328, 'steps': 37683, 'loss/train': 1.3470348119735718} 11/07/2021 02:32:04 - INFO - __main__ - Step 37685: {'lr': 0.000431645288267, 'samples': 7235520, 'steps': 37684, 'loss/train': 1.5445353984832764} 11/07/2021 02:32:05 - INFO - __main__ - Step 37686: {'lr': 0.00043164164206468904, 'samples': 7235712, 'steps': 37685, 'loss/train': 0.9622970819473267} 11/07/2021 02:32:05 - INFO - __main__ - Step 37687: {'lr': 0.00043163799578053313, 'samples': 7235904, 'steps': 37686, 'loss/train': 1.0782309770584106} 11/07/2021 02:32:06 - INFO - __main__ - Step 37688: {'lr': 0.00043163434941453395, 'samples': 7236096, 'steps': 37687, 'loss/train': 1.3358571529388428} 11/07/2021 02:32:06 - INFO - __main__ - Step 37689: {'lr': 0.00043163070296669317, 'samples': 7236288, 'steps': 37688, 'loss/train': 1.1718648672103882} 11/07/2021 02:32:06 - INFO - __main__ - Step 37690: {'lr': 0.00043162705643701236, 'samples': 7236480, 'steps': 37689, 'loss/train': 1.4089534282684326} 11/07/2021 02:32:07 - INFO - __main__ - Step 37691: {'lr': 0.00043162340982549327, 'samples': 7236672, 'steps': 37690, 'loss/train': 1.8753679990768433} 11/07/2021 02:32:08 - INFO - __main__ - Step 37692: {'lr': 0.00043161976313213735, 'samples': 7236864, 'steps': 37691, 'loss/train': 1.8158597946166992} 11/07/2021 02:32:08 - INFO - __main__ - Step 37693: {'lr': 0.0004316161163569465, 'samples': 7237056, 'steps': 37692, 'loss/train': 1.4391977787017822} 11/07/2021 02:32:08 - INFO - __main__ - Step 37694: {'lr': 0.0004316124694999222, 'samples': 7237248, 'steps': 37693, 'loss/train': 1.347544550895691} 11/07/2021 02:32:09 - INFO - __main__ - Step 37695: {'lr': 0.000431608822561066, 'samples': 7237440, 'steps': 37694, 'loss/train': 1.5738919973373413} 11/07/2021 02:32:10 - INFO - __main__ - Step 37696: {'lr': 0.0004316051755403798, 'samples': 7237632, 'steps': 37695, 'loss/train': 1.808718204498291} 11/07/2021 02:32:10 - INFO - __main__ - Step 37697: {'lr': 0.000431601528437865, 'samples': 7237824, 'steps': 37696, 'loss/train': 1.71670663356781} 11/07/2021 02:32:10 - INFO - __main__ - Step 37698: {'lr': 0.00043159788125352353, 'samples': 7238016, 'steps': 37697, 'loss/train': 1.798108458518982} 11/07/2021 02:32:11 - INFO - __main__ - Step 37699: {'lr': 0.0004315942339873567, 'samples': 7238208, 'steps': 37698, 'loss/train': 1.7028776407241821} 11/07/2021 02:32:11 - INFO - __main__ - Step 37700: {'lr': 0.00043159058663936635, 'samples': 7238400, 'steps': 37699, 'loss/train': 1.9503159523010254} 11/07/2021 02:32:12 - INFO - __main__ - Step 37701: {'lr': 0.0004315869392095542, 'samples': 7238592, 'steps': 37700, 'loss/train': 1.5541750192642212} 11/07/2021 02:32:13 - INFO - __main__ - Step 37702: {'lr': 0.0004315832916979216, 'samples': 7238784, 'steps': 37701, 'loss/train': 1.2538578510284424} 11/07/2021 02:32:13 - INFO - __main__ - Step 37703: {'lr': 0.00043157964410447047, 'samples': 7238976, 'steps': 37702, 'loss/train': 1.6704691648483276} 11/07/2021 02:32:13 - INFO - __main__ - Step 37704: {'lr': 0.0004315759964292023, 'samples': 7239168, 'steps': 37703, 'loss/train': 0.4037812352180481} 11/07/2021 02:32:14 - INFO - __main__ - Step 37705: {'lr': 0.0004315723486721188, 'samples': 7239360, 'steps': 37704, 'loss/train': 0.9939437508583069} 11/07/2021 02:32:14 - INFO - __main__ - Step 37706: {'lr': 0.00043156870083322166, 'samples': 7239552, 'steps': 37705, 'loss/train': 1.3555794954299927} 11/07/2021 02:32:15 - INFO - __main__ - Step 37707: {'lr': 0.00043156505291251234, 'samples': 7239744, 'steps': 37706, 'loss/train': 1.6051470041275024} 11/07/2021 02:32:16 - INFO - __main__ - Step 37708: {'lr': 0.00043156140490999275, 'samples': 7239936, 'steps': 37707, 'loss/train': 1.7810765504837036} 11/07/2021 02:32:16 - INFO - __main__ - Step 37709: {'lr': 0.0004315577568256643, 'samples': 7240128, 'steps': 37708, 'loss/train': 1.9991899728775024} 11/07/2021 02:32:16 - INFO - __main__ - Step 37710: {'lr': 0.0004315541086595288, 'samples': 7240320, 'steps': 37709, 'loss/train': 1.5813043117523193} 11/07/2021 02:32:17 - INFO - __main__ - Step 37711: {'lr': 0.00043155046041158776, 'samples': 7240512, 'steps': 37710, 'loss/train': 1.4850425720214844} 11/07/2021 02:32:18 - INFO - __main__ - Step 37712: {'lr': 0.0004315468120818429, 'samples': 7240704, 'steps': 37711, 'loss/train': 1.7535170316696167} 11/07/2021 02:32:18 - INFO - __main__ - Step 37713: {'lr': 0.0004315431636702959, 'samples': 7240896, 'steps': 37712, 'loss/train': 1.2823959589004517} 11/07/2021 02:32:18 - INFO - __main__ - Step 37714: {'lr': 0.00043153951517694824, 'samples': 7241088, 'steps': 37713, 'loss/train': 1.300570011138916} 11/07/2021 02:32:19 - INFO - __main__ - Step 37715: {'lr': 0.0004315358666018018, 'samples': 7241280, 'steps': 37714, 'loss/train': 1.6309016942977905} 11/07/2021 02:32:19 - INFO - __main__ - Step 37716: {'lr': 0.00043153221794485795, 'samples': 7241472, 'steps': 37715, 'loss/train': 0.8364524245262146} 11/07/2021 02:32:20 - INFO - __main__ - Step 37717: {'lr': 0.0004315285692061186, 'samples': 7241664, 'steps': 37716, 'loss/train': 1.4671432971954346} 11/07/2021 02:32:20 - INFO - __main__ - Step 37718: {'lr': 0.00043152492038558526, 'samples': 7241856, 'steps': 37717, 'loss/train': 1.6024607419967651} 11/07/2021 02:32:21 - INFO - __main__ - Step 37719: {'lr': 0.00043152127148325957, 'samples': 7242048, 'steps': 37718, 'loss/train': 1.3017303943634033} 11/07/2021 02:32:21 - INFO - __main__ - Step 37720: {'lr': 0.00043151762249914324, 'samples': 7242240, 'steps': 37719, 'loss/train': 1.588016152381897} 11/07/2021 02:32:21 - INFO - __main__ - Step 37721: {'lr': 0.00043151397343323784, 'samples': 7242432, 'steps': 37720, 'loss/train': 1.9084659814834595} 11/07/2021 02:32:23 - INFO - __main__ - Step 37722: {'lr': 0.00043151032428554505, 'samples': 7242624, 'steps': 37721, 'loss/train': 1.6454074382781982} 11/07/2021 02:32:23 - INFO - __main__ - Step 37723: {'lr': 0.0004315066750560665, 'samples': 7242816, 'steps': 37722, 'loss/train': 0.9799754023551941} 11/07/2021 02:32:23 - INFO - __main__ - Step 37724: {'lr': 0.0004315030257448038, 'samples': 7243008, 'steps': 37723, 'loss/train': 1.5956308841705322} 11/07/2021 02:32:24 - INFO - __main__ - Step 37725: {'lr': 0.00043149937635175874, 'samples': 7243200, 'steps': 37724, 'loss/train': 1.3453290462493896} 11/07/2021 02:32:24 - INFO - __main__ - Step 37726: {'lr': 0.0004314957268769328, 'samples': 7243392, 'steps': 37725, 'loss/train': 1.399642825126648} 11/07/2021 02:32:24 - INFO - __main__ - Step 37727: {'lr': 0.00043149207732032767, 'samples': 7243584, 'steps': 37726, 'loss/train': 1.6732821464538574} 11/07/2021 02:32:25 - INFO - __main__ - Step 37728: {'lr': 0.00043148842768194503, 'samples': 7243776, 'steps': 37727, 'loss/train': 1.9138143062591553} 11/07/2021 02:32:26 - INFO - __main__ - Step 37729: {'lr': 0.0004314847779617865, 'samples': 7243968, 'steps': 37728, 'loss/train': 1.4180481433868408} 11/07/2021 02:32:26 - INFO - __main__ - Step 37730: {'lr': 0.00043148112815985377, 'samples': 7244160, 'steps': 37729, 'loss/train': 1.4895565509796143} 11/07/2021 02:32:27 - INFO - __main__ - Step 37731: {'lr': 0.0004314774782761484, 'samples': 7244352, 'steps': 37730, 'loss/train': 1.2930500507354736} 11/07/2021 02:32:27 - INFO - __main__ - Step 37732: {'lr': 0.00043147382831067204, 'samples': 7244544, 'steps': 37731, 'loss/train': 1.820050597190857} 11/07/2021 02:32:28 - INFO - __main__ - Step 37733: {'lr': 0.0004314701782634264, 'samples': 7244736, 'steps': 37732, 'loss/train': 1.4728184938430786} 11/07/2021 02:32:28 - INFO - __main__ - Step 37734: {'lr': 0.0004314665281344132, 'samples': 7244928, 'steps': 37733, 'loss/train': 1.8691281080245972} 11/07/2021 02:32:29 - INFO - __main__ - Step 37735: {'lr': 0.0004314628779236339, 'samples': 7245120, 'steps': 37734, 'loss/train': 1.8617697954177856} 11/07/2021 02:32:29 - INFO - __main__ - Step 37736: {'lr': 0.00043145922763109017, 'samples': 7245312, 'steps': 37735, 'loss/train': 0.6751835942268372} 11/07/2021 02:32:29 - INFO - __main__ - Step 37737: {'lr': 0.0004314555772567838, 'samples': 7245504, 'steps': 37736, 'loss/train': 1.458873987197876} 11/07/2021 02:32:30 - INFO - __main__ - Step 37738: {'lr': 0.0004314519268007163, 'samples': 7245696, 'steps': 37737, 'loss/train': 1.4394382238388062} 11/07/2021 02:32:31 - INFO - __main__ - Step 37739: {'lr': 0.00043144827626288943, 'samples': 7245888, 'steps': 37738, 'loss/train': 1.5663368701934814} 11/07/2021 02:32:31 - INFO - __main__ - Step 37740: {'lr': 0.00043144462564330464, 'samples': 7246080, 'steps': 37739, 'loss/train': 2.232424020767212} 11/07/2021 02:32:31 - INFO - __main__ - Step 37741: {'lr': 0.0004314409749419638, 'samples': 7246272, 'steps': 37740, 'loss/train': 1.4186749458312988} 11/07/2021 02:32:32 - INFO - __main__ - Step 37742: {'lr': 0.00043143732415886843, 'samples': 7246464, 'steps': 37741, 'loss/train': 1.548527717590332} 11/07/2021 02:32:32 - INFO - __main__ - Step 37743: {'lr': 0.0004314336732940202, 'samples': 7246656, 'steps': 37742, 'loss/train': 0.8476759195327759} 11/07/2021 02:32:33 - INFO - __main__ - Step 37744: {'lr': 0.0004314300223474208, 'samples': 7246848, 'steps': 37743, 'loss/train': 1.3937143087387085} 11/07/2021 02:32:33 - INFO - __main__ - Step 37745: {'lr': 0.0004314263713190718, 'samples': 7247040, 'steps': 37744, 'loss/train': 0.32582950592041016} 11/07/2021 02:32:34 - INFO - __main__ - Step 37746: {'lr': 0.00043142272020897486, 'samples': 7247232, 'steps': 37745, 'loss/train': 1.4074898958206177} 11/07/2021 02:32:34 - INFO - __main__ - Step 37747: {'lr': 0.0004314190690171317, 'samples': 7247424, 'steps': 37746, 'loss/train': 1.3477909564971924} 11/07/2021 02:32:35 - INFO - __main__ - Step 37748: {'lr': 0.0004314154177435438, 'samples': 7247616, 'steps': 37747, 'loss/train': 1.3964956998825073} 11/07/2021 02:32:36 - INFO - __main__ - Step 37749: {'lr': 0.000431411766388213, 'samples': 7247808, 'steps': 37748, 'loss/train': 1.625677466392517} 11/07/2021 02:32:36 - INFO - __main__ - Step 37750: {'lr': 0.0004314081149511409, 'samples': 7248000, 'steps': 37749, 'loss/train': 1.4735047817230225} 11/07/2021 02:32:36 - INFO - __main__ - Step 37751: {'lr': 0.00043140446343232895, 'samples': 7248192, 'steps': 37750, 'loss/train': 1.6033672094345093} 11/07/2021 02:32:37 - INFO - __main__ - Step 37752: {'lr': 0.000431400811831779, 'samples': 7248384, 'steps': 37751, 'loss/train': 1.698687195777893} 11/07/2021 02:32:37 - INFO - __main__ - Step 37753: {'lr': 0.0004313971601494927, 'samples': 7248576, 'steps': 37752, 'loss/train': 1.3801158666610718} 11/07/2021 02:32:38 - INFO - __main__ - Step 37754: {'lr': 0.0004313935083854716, 'samples': 7248768, 'steps': 37753, 'loss/train': 1.691262125968933} 11/07/2021 02:32:38 - INFO - __main__ - Step 37755: {'lr': 0.0004313898565397174, 'samples': 7248960, 'steps': 37754, 'loss/train': 1.657094955444336} 11/07/2021 02:32:39 - INFO - __main__ - Step 37756: {'lr': 0.00043138620461223175, 'samples': 7249152, 'steps': 37755, 'loss/train': 1.5319207906723022} 11/07/2021 02:32:39 - INFO - __main__ - Step 37757: {'lr': 0.00043138255260301625, 'samples': 7249344, 'steps': 37756, 'loss/train': 1.534110188484192} 11/07/2021 02:32:39 - INFO - __main__ - Step 37758: {'lr': 0.0004313789005120725, 'samples': 7249536, 'steps': 37757, 'loss/train': 1.7224847078323364} 11/07/2021 02:32:40 - INFO - __main__ - Step 37759: {'lr': 0.00043137524833940233, 'samples': 7249728, 'steps': 37758, 'loss/train': 1.6888084411621094} 11/07/2021 02:32:41 - INFO - __main__ - Step 37760: {'lr': 0.0004313715960850072, 'samples': 7249920, 'steps': 37759, 'loss/train': 1.3553645610809326} 11/07/2021 02:32:41 - INFO - __main__ - Step 37761: {'lr': 0.00043136794374888887, 'samples': 7250112, 'steps': 37760, 'loss/train': 1.6526095867156982} 11/07/2021 02:32:41 - INFO - __main__ - Step 37762: {'lr': 0.0004313642913310489, 'samples': 7250304, 'steps': 37761, 'loss/train': 1.6274479627609253} 11/07/2021 02:32:42 - INFO - __main__ - Step 37763: {'lr': 0.00043136063883148905, 'samples': 7250496, 'steps': 37762, 'loss/train': 1.5382124185562134} 11/07/2021 02:32:43 - INFO - __main__ - Step 37764: {'lr': 0.00043135698625021093, 'samples': 7250688, 'steps': 37763, 'loss/train': 1.6963647603988647} 11/07/2021 02:32:43 - INFO - __main__ - Step 37765: {'lr': 0.000431353333587216, 'samples': 7250880, 'steps': 37764, 'loss/train': 1.4143861532211304} 11/07/2021 02:32:44 - INFO - __main__ - Step 37766: {'lr': 0.00043134968084250616, 'samples': 7251072, 'steps': 37765, 'loss/train': 1.3447948694229126} 11/07/2021 02:32:44 - INFO - __main__ - Step 37767: {'lr': 0.00043134602801608293, 'samples': 7251264, 'steps': 37766, 'loss/train': 1.1817660331726074} 11/07/2021 02:32:44 - INFO - __main__ - Step 37768: {'lr': 0.00043134237510794794, 'samples': 7251456, 'steps': 37767, 'loss/train': 0.40962114930152893} 11/07/2021 02:32:45 - INFO - __main__ - Step 37769: {'lr': 0.0004313387221181029, 'samples': 7251648, 'steps': 37768, 'loss/train': 1.8461116552352905} 11/07/2021 02:32:46 - INFO - __main__ - Step 37770: {'lr': 0.0004313350690465495, 'samples': 7251840, 'steps': 37769, 'loss/train': 5.757008075714111} 11/07/2021 02:32:46 - INFO - __main__ - Step 37771: {'lr': 0.00043133141589328923, 'samples': 7252032, 'steps': 37770, 'loss/train': 1.4343360662460327} 11/07/2021 02:32:46 - INFO - __main__ - Step 37772: {'lr': 0.0004313277626583239, 'samples': 7252224, 'steps': 37771, 'loss/train': 1.7761224508285522} 11/07/2021 02:32:47 - INFO - __main__ - Step 37773: {'lr': 0.000431324109341655, 'samples': 7252416, 'steps': 37772, 'loss/train': 1.9840986728668213} 11/07/2021 02:32:47 - INFO - __main__ - Step 37774: {'lr': 0.0004313204559432842, 'samples': 7252608, 'steps': 37773, 'loss/train': 1.5764964818954468} 11/07/2021 02:32:48 - INFO - __main__ - Step 37775: {'lr': 0.0004313168024632133, 'samples': 7252800, 'steps': 37774, 'loss/train': 1.5473870038986206} 11/07/2021 02:32:48 - INFO - __main__ - Step 37776: {'lr': 0.00043131314890144386, 'samples': 7252992, 'steps': 37775, 'loss/train': 1.4710793495178223} 11/07/2021 02:32:49 - INFO - __main__ - Step 37777: {'lr': 0.0004313094952579775, 'samples': 7253184, 'steps': 37776, 'loss/train': 1.7181931734085083} 11/07/2021 02:32:49 - INFO - __main__ - Step 37778: {'lr': 0.0004313058415328158, 'samples': 7253376, 'steps': 37777, 'loss/train': 1.3149781227111816} 11/07/2021 02:32:49 - INFO - __main__ - Step 37779: {'lr': 0.00043130218772596053, 'samples': 7253568, 'steps': 37778, 'loss/train': 1.5012356042861938} 11/07/2021 02:32:50 - INFO - __main__ - Step 37780: {'lr': 0.00043129853383741334, 'samples': 7253760, 'steps': 37779, 'loss/train': 1.810323715209961} 11/07/2021 02:32:51 - INFO - __main__ - Step 37781: {'lr': 0.00043129487986717574, 'samples': 7253952, 'steps': 37780, 'loss/train': 1.4915090799331665} 11/07/2021 02:32:51 - INFO - __main__ - Step 37782: {'lr': 0.00043129122581524957, 'samples': 7254144, 'steps': 37781, 'loss/train': 0.9132309556007385} 11/07/2021 02:32:52 - INFO - __main__ - Step 37783: {'lr': 0.0004312875716816363, 'samples': 7254336, 'steps': 37782, 'loss/train': 1.672573447227478} 11/07/2021 02:32:52 - INFO - __main__ - Step 37784: {'lr': 0.0004312839174663377, 'samples': 7254528, 'steps': 37783, 'loss/train': 1.327996850013733} 11/07/2021 02:32:53 - INFO - __main__ - Step 37785: {'lr': 0.0004312802631693553, 'samples': 7254720, 'steps': 37784, 'loss/train': 1.4877172708511353} 11/07/2021 02:32:53 - INFO - __main__ - Step 37786: {'lr': 0.00043127660879069084, 'samples': 7254912, 'steps': 37785, 'loss/train': 1.5174921751022339} 11/07/2021 02:32:54 - INFO - __main__ - Step 37787: {'lr': 0.00043127295433034594, 'samples': 7255104, 'steps': 37786, 'loss/train': 1.538757562637329} 11/07/2021 02:32:54 - INFO - __main__ - Step 37788: {'lr': 0.00043126929978832217, 'samples': 7255296, 'steps': 37787, 'loss/train': 1.6394915580749512} 11/07/2021 02:32:54 - INFO - __main__ - Step 37789: {'lr': 0.00043126564516462134, 'samples': 7255488, 'steps': 37788, 'loss/train': 1.9435192346572876} 11/07/2021 02:32:55 - INFO - __main__ - Step 37790: {'lr': 0.000431261990459245, 'samples': 7255680, 'steps': 37789, 'loss/train': 2.3344905376434326} 11/07/2021 02:32:56 - INFO - __main__ - Step 37791: {'lr': 0.0004312583356721948, 'samples': 7255872, 'steps': 37790, 'loss/train': 1.6491162776947021} 11/07/2021 02:32:56 - INFO - __main__ - Step 37792: {'lr': 0.0004312546808034724, 'samples': 7256064, 'steps': 37791, 'loss/train': 1.3833657503128052} 11/07/2021 02:32:56 - INFO - __main__ - Step 37793: {'lr': 0.0004312510258530794, 'samples': 7256256, 'steps': 37792, 'loss/train': 1.8957017660140991} 11/07/2021 02:32:57 - INFO - __main__ - Step 37794: {'lr': 0.0004312473708210175, 'samples': 7256448, 'steps': 37793, 'loss/train': 1.3169971704483032} 11/07/2021 02:32:57 - INFO - __main__ - Step 37795: {'lr': 0.0004312437157072884, 'samples': 7256640, 'steps': 37794, 'loss/train': 0.8353274464607239} 11/07/2021 02:32:59 - INFO - __main__ - Step 37796: {'lr': 0.00043124006051189356, 'samples': 7256832, 'steps': 37795, 'loss/train': 1.1799557209014893} 11/07/2021 02:32:59 - INFO - __main__ - Step 37797: {'lr': 0.0004312364052348348, 'samples': 7257024, 'steps': 37796, 'loss/train': 1.508033037185669} 11/07/2021 02:32:59 - INFO - __main__ - Step 37798: {'lr': 0.0004312327498761137, 'samples': 7257216, 'steps': 37797, 'loss/train': 0.404900461435318} 11/07/2021 02:33:00 - INFO - __main__ - Step 37799: {'lr': 0.000431229094435732, 'samples': 7257408, 'steps': 37798, 'loss/train': 1.551729440689087} 11/07/2021 02:33:00 - INFO - __main__ - Step 37800: {'lr': 0.0004312254389136911, 'samples': 7257600, 'steps': 37799, 'loss/train': 1.2227293252944946} 11/07/2021 02:33:01 - INFO - __main__ - Step 37801: {'lr': 0.00043122178330999296, 'samples': 7257792, 'steps': 37800, 'loss/train': 1.5652748346328735} 11/07/2021 02:33:01 - INFO - __main__ - Step 37802: {'lr': 0.0004312181276246391, 'samples': 7257984, 'steps': 37801, 'loss/train': 1.7289127111434937} 11/07/2021 02:33:02 - INFO - __main__ - Step 37803: {'lr': 0.00043121447185763106, 'samples': 7258176, 'steps': 37802, 'loss/train': 1.5276554822921753} 11/07/2021 02:33:02 - INFO - __main__ - Step 37804: {'lr': 0.0004312108160089706, 'samples': 7258368, 'steps': 37803, 'loss/train': 1.2928982973098755} 11/07/2021 02:33:03 - INFO - __main__ - Step 37805: {'lr': 0.00043120716007865933, 'samples': 7258560, 'steps': 37804, 'loss/train': 1.6584603786468506} 11/07/2021 02:33:03 - INFO - __main__ - Step 37806: {'lr': 0.0004312035040666989, 'samples': 7258752, 'steps': 37805, 'loss/train': 0.6836652755737305} 11/07/2021 02:33:04 - INFO - __main__ - Step 37807: {'lr': 0.000431199847973091, 'samples': 7258944, 'steps': 37806, 'loss/train': 1.5036001205444336} 11/07/2021 02:33:04 - INFO - __main__ - Step 37808: {'lr': 0.0004311961917978372, 'samples': 7259136, 'steps': 37807, 'loss/train': 1.5429991483688354} 11/07/2021 02:33:05 - INFO - __main__ - Step 37809: {'lr': 0.0004311925355409393, 'samples': 7259328, 'steps': 37808, 'loss/train': 1.3604049682617188} 11/07/2021 02:33:05 - INFO - __main__ - Step 37810: {'lr': 0.00043118887920239876, 'samples': 7259520, 'steps': 37809, 'loss/train': 1.240416407585144} 11/07/2021 02:33:06 - INFO - __main__ - Step 37811: {'lr': 0.00043118522278221726, 'samples': 7259712, 'steps': 37810, 'loss/train': 1.3365050554275513} 11/07/2021 02:33:06 - INFO - __main__ - Step 37812: {'lr': 0.0004311815662803966, 'samples': 7259904, 'steps': 37811, 'loss/train': 1.4456079006195068} 11/07/2021 02:33:07 - INFO - __main__ - Step 37813: {'lr': 0.00043117790969693826, 'samples': 7260096, 'steps': 37812, 'loss/train': 1.4940829277038574} 11/07/2021 02:33:07 - INFO - __main__ - Step 37814: {'lr': 0.00043117425303184395, 'samples': 7260288, 'steps': 37813, 'loss/train': 1.4381945133209229} 11/07/2021 02:33:07 - INFO - __main__ - Step 37815: {'lr': 0.0004311705962851153, 'samples': 7260480, 'steps': 37814, 'loss/train': 1.5033267736434937} 11/07/2021 02:33:10 - INFO - __main__ - Step 37816: {'lr': 0.000431166939456754, 'samples': 7260672, 'steps': 37815, 'loss/train': 1.637882947921753} 11/07/2021 02:33:10 - INFO - __main__ - Step 37817: {'lr': 0.0004311632825467617, 'samples': 7260864, 'steps': 37816, 'loss/train': 1.7710007429122925} 11/07/2021 02:33:10 - INFO - __main__ - Step 37818: {'lr': 0.00043115962555514, 'samples': 7261056, 'steps': 37817, 'loss/train': 1.43278968334198} 11/07/2021 02:33:11 - INFO - __main__ - Step 37819: {'lr': 0.0004311559684818905, 'samples': 7261248, 'steps': 37818, 'loss/train': 1.7777564525604248} 11/07/2021 02:33:11 - INFO - __main__ - Step 37820: {'lr': 0.000431152311327015, 'samples': 7261440, 'steps': 37819, 'loss/train': 1.7829294204711914} 11/07/2021 02:33:12 - INFO - __main__ - Step 37821: {'lr': 0.00043114865409051505, 'samples': 7261632, 'steps': 37820, 'loss/train': 1.788772463798523} 11/07/2021 02:33:12 - INFO - __main__ - Step 37822: {'lr': 0.0004311449967723923, 'samples': 7261824, 'steps': 37821, 'loss/train': 1.7570000886917114} 11/07/2021 02:33:12 - INFO - __main__ - Step 37823: {'lr': 0.00043114133937264843, 'samples': 7262016, 'steps': 37822, 'loss/train': 1.4681305885314941} 11/07/2021 02:33:13 - INFO - __main__ - Step 37824: {'lr': 0.000431137681891285, 'samples': 7262208, 'steps': 37823, 'loss/train': 1.901205062866211} 11/07/2021 02:33:14 - INFO - __main__ - Step 37825: {'lr': 0.0004311340243283038, 'samples': 7262400, 'steps': 37824, 'loss/train': 2.0343801975250244} 11/07/2021 02:33:14 - INFO - __main__ - Step 37826: {'lr': 0.0004311303666837064, 'samples': 7262592, 'steps': 37825, 'loss/train': 1.4734593629837036} 11/07/2021 02:33:14 - INFO - __main__ - Step 37827: {'lr': 0.0004311267089574944, 'samples': 7262784, 'steps': 37826, 'loss/train': 1.7053385972976685} 11/07/2021 02:33:15 - INFO - __main__ - Step 37828: {'lr': 0.00043112305114966957, 'samples': 7262976, 'steps': 37827, 'loss/train': 0.6940966248512268} 11/07/2021 02:33:16 - INFO - __main__ - Step 37829: {'lr': 0.0004311193932602334, 'samples': 7263168, 'steps': 37828, 'loss/train': 1.3707382678985596} 11/07/2021 02:33:16 - INFO - __main__ - Step 37830: {'lr': 0.0004311157352891877, 'samples': 7263360, 'steps': 37829, 'loss/train': 1.5984339714050293} 11/07/2021 02:33:16 - INFO - __main__ - Step 37831: {'lr': 0.000431112077236534, 'samples': 7263552, 'steps': 37830, 'loss/train': 1.326088547706604} 11/07/2021 02:33:17 - INFO - __main__ - Step 37832: {'lr': 0.0004311084191022741, 'samples': 7263744, 'steps': 37831, 'loss/train': 1.5728517770767212} 11/07/2021 02:33:17 - INFO - __main__ - Step 37833: {'lr': 0.00043110476088640935, 'samples': 7263936, 'steps': 37832, 'loss/train': 1.3312963247299194} 11/07/2021 02:33:18 - INFO - __main__ - Step 37834: {'lr': 0.00043110110258894177, 'samples': 7264128, 'steps': 37833, 'loss/train': 1.6761177778244019} 11/07/2021 02:33:18 - INFO - __main__ - Step 37835: {'lr': 0.00043109744420987274, 'samples': 7264320, 'steps': 37834, 'loss/train': 1.7098302841186523} 11/07/2021 02:33:19 - INFO - __main__ - Step 37836: {'lr': 0.000431093785749204, 'samples': 7264512, 'steps': 37835, 'loss/train': 1.6409319639205933} 11/07/2021 02:33:19 - INFO - __main__ - Step 37837: {'lr': 0.00043109012720693717, 'samples': 7264704, 'steps': 37836, 'loss/train': 1.404793381690979} 11/07/2021 02:33:19 - INFO - __main__ - Step 37838: {'lr': 0.000431086468583074, 'samples': 7264896, 'steps': 37837, 'loss/train': 1.2502204179763794} 11/07/2021 02:33:20 - INFO - __main__ - Step 37839: {'lr': 0.00043108280987761593, 'samples': 7265088, 'steps': 37838, 'loss/train': 2.034834146499634} 11/07/2021 02:33:21 - INFO - __main__ - Step 37840: {'lr': 0.0004310791510905649, 'samples': 7265280, 'steps': 37839, 'loss/train': 1.233948826789856} 11/07/2021 02:33:22 - INFO - __main__ - Step 37841: {'lr': 0.00043107549222192235, 'samples': 7265472, 'steps': 37840, 'loss/train': 0.8641500473022461} 11/07/2021 02:33:22 - INFO - __main__ - Step 37842: {'lr': 0.0004310718332716899, 'samples': 7265664, 'steps': 37841, 'loss/train': 0.9060410261154175} 11/07/2021 02:33:22 - INFO - __main__ - Step 37843: {'lr': 0.00043106817423986933, 'samples': 7265856, 'steps': 37842, 'loss/train': 1.6606569290161133} 11/07/2021 02:33:23 - INFO - __main__ - Step 37844: {'lr': 0.00043106451512646226, 'samples': 7266048, 'steps': 37843, 'loss/train': 1.746648907661438} 11/07/2021 02:33:24 - INFO - __main__ - Step 37845: {'lr': 0.00043106085593147027, 'samples': 7266240, 'steps': 37844, 'loss/train': 2.3888840675354004} 11/07/2021 02:33:24 - INFO - __main__ - Step 37846: {'lr': 0.00043105719665489505, 'samples': 7266432, 'steps': 37845, 'loss/train': 2.0307137966156006} 11/07/2021 02:33:24 - INFO - __main__ - Step 37847: {'lr': 0.0004310535372967383, 'samples': 7266624, 'steps': 37846, 'loss/train': 1.5262460708618164} 11/07/2021 02:33:25 - INFO - __main__ - Step 37848: {'lr': 0.0004310498778570016, 'samples': 7266816, 'steps': 37847, 'loss/train': 1.5267208814620972} 11/07/2021 02:33:25 - INFO - __main__ - Step 37849: {'lr': 0.0004310462183356866, 'samples': 7267008, 'steps': 37848, 'loss/train': 1.6285754442214966} 11/07/2021 02:33:26 - INFO - __main__ - Step 37850: {'lr': 0.00043104255873279497, 'samples': 7267200, 'steps': 37849, 'loss/train': 1.5782309770584106} 11/07/2021 02:33:26 - INFO - __main__ - Step 37851: {'lr': 0.00043103889904832837, 'samples': 7267392, 'steps': 37850, 'loss/train': 1.5631217956542969} 11/07/2021 02:33:27 - INFO - __main__ - Step 37852: {'lr': 0.0004310352392822884, 'samples': 7267584, 'steps': 37851, 'loss/train': 1.2980358600616455} 11/07/2021 02:33:27 - INFO - __main__ - Step 37853: {'lr': 0.00043103157943467674, 'samples': 7267776, 'steps': 37852, 'loss/train': 1.319003939628601} 11/07/2021 02:33:27 - INFO - __main__ - Step 37854: {'lr': 0.00043102791950549513, 'samples': 7267968, 'steps': 37853, 'loss/train': 1.1419496536254883} 11/07/2021 02:33:28 - INFO - __main__ - Step 37855: {'lr': 0.00043102425949474504, 'samples': 7268160, 'steps': 37854, 'loss/train': 0.9001447558403015} 11/07/2021 02:33:29 - INFO - __main__ - Step 37856: {'lr': 0.00043102059940242825, 'samples': 7268352, 'steps': 37855, 'loss/train': 1.6300129890441895} 11/07/2021 02:33:29 - INFO - __main__ - Step 37857: {'lr': 0.0004310169392285464, 'samples': 7268544, 'steps': 37856, 'loss/train': 1.0531177520751953} 11/07/2021 02:33:30 - INFO - __main__ - Step 37858: {'lr': 0.0004310132789731011, 'samples': 7268736, 'steps': 37857, 'loss/train': 0.9698438048362732} 11/07/2021 02:33:30 - INFO - __main__ - Step 37859: {'lr': 0.000431009618636094, 'samples': 7268928, 'steps': 37858, 'loss/train': 2.036407232284546} 11/07/2021 02:33:31 - INFO - __main__ - Step 37860: {'lr': 0.00043100595821752674, 'samples': 7269120, 'steps': 37859, 'loss/train': 1.3307468891143799} 11/07/2021 02:33:31 - INFO - __main__ - Step 37861: {'lr': 0.00043100229771740096, 'samples': 7269312, 'steps': 37860, 'loss/train': 1.7175347805023193} 11/07/2021 02:33:32 - INFO - __main__ - Step 37862: {'lr': 0.0004309986371357184, 'samples': 7269504, 'steps': 37861, 'loss/train': 1.4010646343231201} 11/07/2021 02:33:32 - INFO - __main__ - Step 37863: {'lr': 0.00043099497647248065, 'samples': 7269696, 'steps': 37862, 'loss/train': 1.5942286252975464} 11/07/2021 02:33:32 - INFO - __main__ - Step 37864: {'lr': 0.00043099131572768936, 'samples': 7269888, 'steps': 37863, 'loss/train': 1.4234338998794556} 11/07/2021 02:33:33 - INFO - __main__ - Step 37865: {'lr': 0.00043098765490134607, 'samples': 7270080, 'steps': 37864, 'loss/train': 1.3859913349151611} 11/07/2021 02:33:34 - INFO - __main__ - Step 37866: {'lr': 0.00043098399399345267, 'samples': 7270272, 'steps': 37865, 'loss/train': 1.5256638526916504} 11/07/2021 02:33:34 - INFO - __main__ - Step 37867: {'lr': 0.0004309803330040106, 'samples': 7270464, 'steps': 37866, 'loss/train': 1.1439716815948486} 11/07/2021 02:33:34 - INFO - __main__ - Step 37868: {'lr': 0.0004309766719330216, 'samples': 7270656, 'steps': 37867, 'loss/train': 1.7910076379776} 11/07/2021 02:33:35 - INFO - __main__ - Step 37869: {'lr': 0.00043097301078048736, 'samples': 7270848, 'steps': 37868, 'loss/train': 2.2756893634796143} 11/07/2021 02:33:35 - INFO - __main__ - Step 37870: {'lr': 0.00043096934954640935, 'samples': 7271040, 'steps': 37869, 'loss/train': 0.8272931575775146} 11/07/2021 02:33:36 - INFO - __main__ - Step 37871: {'lr': 0.0004309656882307894, 'samples': 7271232, 'steps': 37870, 'loss/train': 1.467457890510559} 11/07/2021 02:33:36 - INFO - __main__ - Step 37872: {'lr': 0.0004309620268336292, 'samples': 7271424, 'steps': 37871, 'loss/train': 1.2793571949005127} 11/07/2021 02:33:37 - INFO - __main__ - Step 37873: {'lr': 0.0004309583653549302, 'samples': 7271616, 'steps': 37872, 'loss/train': 1.2705497741699219} 11/07/2021 02:33:37 - INFO - __main__ - Step 37874: {'lr': 0.0004309547037946941, 'samples': 7271808, 'steps': 37873, 'loss/train': 1.276893138885498} 11/07/2021 02:33:37 - INFO - __main__ - Step 37875: {'lr': 0.0004309510421529227, 'samples': 7272000, 'steps': 37874, 'loss/train': 1.6498548984527588} 11/07/2021 02:33:39 - INFO - __main__ - Step 37876: {'lr': 0.00043094738042961754, 'samples': 7272192, 'steps': 37875, 'loss/train': 1.8039900064468384} 11/07/2021 02:33:39 - INFO - __main__ - Step 37877: {'lr': 0.0004309437186247803, 'samples': 7272384, 'steps': 37876, 'loss/train': 1.3540936708450317} 11/07/2021 02:33:39 - INFO - __main__ - Step 37878: {'lr': 0.00043094005673841257, 'samples': 7272576, 'steps': 37877, 'loss/train': 1.3922836780548096} 11/07/2021 02:33:40 - INFO - __main__ - Step 37879: {'lr': 0.00043093639477051606, 'samples': 7272768, 'steps': 37878, 'loss/train': 1.6267963647842407} 11/07/2021 02:33:40 - INFO - __main__ - Step 37880: {'lr': 0.0004309327327210923, 'samples': 7272960, 'steps': 37879, 'loss/train': 1.459894061088562} 11/07/2021 02:33:41 - INFO - __main__ - Step 37881: {'lr': 0.00043092907059014325, 'samples': 7273152, 'steps': 37880, 'loss/train': 1.5659189224243164} 11/07/2021 02:33:41 - INFO - __main__ - Step 37882: {'lr': 0.00043092540837767025, 'samples': 7273344, 'steps': 37881, 'loss/train': 1.1091625690460205} 11/07/2021 02:33:42 - INFO - __main__ - Step 37883: {'lr': 0.000430921746083675, 'samples': 7273536, 'steps': 37882, 'loss/train': 1.4476122856140137} 11/07/2021 02:33:42 - INFO - __main__ - Step 37884: {'lr': 0.00043091808370815935, 'samples': 7273728, 'steps': 37883, 'loss/train': 5.744571685791016} 11/07/2021 02:33:42 - INFO - __main__ - Step 37885: {'lr': 0.0004309144212511246, 'samples': 7273920, 'steps': 37884, 'loss/train': 2.258639097213745} 11/07/2021 02:33:43 - INFO - __main__ - Step 37886: {'lr': 0.00043091075871257275, 'samples': 7274112, 'steps': 37885, 'loss/train': 1.553177833557129} 11/07/2021 02:33:44 - INFO - __main__ - Step 37887: {'lr': 0.0004309070960925052, 'samples': 7274304, 'steps': 37886, 'loss/train': 1.8596436977386475} 11/07/2021 02:33:44 - INFO - __main__ - Step 37888: {'lr': 0.0004309034333909238, 'samples': 7274496, 'steps': 37887, 'loss/train': 0.9980459809303284} 11/07/2021 02:33:45 - INFO - __main__ - Step 37889: {'lr': 0.0004308997706078301, 'samples': 7274688, 'steps': 37888, 'loss/train': 1.687778115272522} 11/07/2021 02:33:45 - INFO - __main__ - Step 37890: {'lr': 0.00043089610774322575, 'samples': 7274880, 'steps': 37889, 'loss/train': 1.6858359575271606} 11/07/2021 02:33:45 - INFO - __main__ - Step 37891: {'lr': 0.00043089244479711233, 'samples': 7275072, 'steps': 37890, 'loss/train': 1.6822612285614014} 11/07/2021 02:33:46 - INFO - __main__ - Step 37892: {'lr': 0.00043088878176949163, 'samples': 7275264, 'steps': 37891, 'loss/train': 1.539404273033142} 11/07/2021 02:33:47 - INFO - __main__ - Step 37893: {'lr': 0.0004308851186603652, 'samples': 7275456, 'steps': 37892, 'loss/train': 1.781607985496521} 11/07/2021 02:33:47 - INFO - __main__ - Step 37894: {'lr': 0.0004308814554697348, 'samples': 7275648, 'steps': 37893, 'loss/train': 1.5547536611557007} 11/07/2021 02:33:47 - INFO - __main__ - Step 37895: {'lr': 0.0004308777921976019, 'samples': 7275840, 'steps': 37894, 'loss/train': 1.359387993812561} 11/07/2021 02:33:48 - INFO - __main__ - Step 37896: {'lr': 0.00043087412884396835, 'samples': 7276032, 'steps': 37895, 'loss/train': 1.5210968255996704} 11/07/2021 02:33:49 - INFO - __main__ - Step 37897: {'lr': 0.0004308704654088357, 'samples': 7276224, 'steps': 37896, 'loss/train': 1.327373743057251} 11/07/2021 02:33:50 - INFO - __main__ - Step 37898: {'lr': 0.00043086680189220554, 'samples': 7276416, 'steps': 37897, 'loss/train': 1.487549901008606} 11/07/2021 02:33:50 - INFO - __main__ - Step 37899: {'lr': 0.00043086313829407966, 'samples': 7276608, 'steps': 37898, 'loss/train': 1.7413653135299683} 11/07/2021 02:33:50 - INFO - __main__ - Step 37900: {'lr': 0.0004308594746144596, 'samples': 7276800, 'steps': 37899, 'loss/train': 1.532272219657898} 11/07/2021 02:33:51 - INFO - __main__ - Step 37901: {'lr': 0.0004308558108533471, 'samples': 7276992, 'steps': 37900, 'loss/train': 0.5073288679122925} 11/07/2021 02:33:52 - INFO - __main__ - Step 37902: {'lr': 0.0004308521470107437, 'samples': 7277184, 'steps': 37901, 'loss/train': 0.9130961894989014} 11/07/2021 02:33:52 - INFO - __main__ - Step 37903: {'lr': 0.00043084848308665115, 'samples': 7277376, 'steps': 37902, 'loss/train': 1.5158967971801758} 11/07/2021 02:33:52 - INFO - __main__ - Step 37904: {'lr': 0.00043084481908107103, 'samples': 7277568, 'steps': 37903, 'loss/train': 1.6894845962524414} 11/07/2021 02:33:53 - INFO - __main__ - Step 37905: {'lr': 0.00043084115499400505, 'samples': 7277760, 'steps': 37904, 'loss/train': 1.2803417444229126} 11/07/2021 02:33:53 - INFO - __main__ - Step 37906: {'lr': 0.0004308374908254549, 'samples': 7277952, 'steps': 37905, 'loss/train': 1.5752876996994019} 11/07/2021 02:33:54 - INFO - __main__ - Step 37907: {'lr': 0.000430833826575422, 'samples': 7278144, 'steps': 37906, 'loss/train': 1.7972919940948486} 11/07/2021 02:33:55 - INFO - __main__ - Step 37908: {'lr': 0.0004308301622439083, 'samples': 7278336, 'steps': 37907, 'loss/train': 0.9065421223640442} 11/07/2021 02:33:55 - INFO - __main__ - Step 37909: {'lr': 0.0004308264978309153, 'samples': 7278528, 'steps': 37908, 'loss/train': 1.333460807800293} 11/07/2021 02:33:55 - INFO - __main__ - Step 37910: {'lr': 0.0004308228333364447, 'samples': 7278720, 'steps': 37909, 'loss/train': 1.5436149835586548} 11/07/2021 02:33:56 - INFO - __main__ - Step 37911: {'lr': 0.000430819168760498, 'samples': 7278912, 'steps': 37910, 'loss/train': 1.4273598194122314} 11/07/2021 02:33:56 - INFO - __main__ - Step 37912: {'lr': 0.0004308155041030771, 'samples': 7279104, 'steps': 37911, 'loss/train': 1.3679322004318237} 11/07/2021 02:33:57 - INFO - __main__ - Step 37913: {'lr': 0.00043081183936418343, 'samples': 7279296, 'steps': 37912, 'loss/train': 1.7382436990737915} 11/07/2021 02:33:57 - INFO - __main__ - Step 37914: {'lr': 0.0004308081745438188, 'samples': 7279488, 'steps': 37913, 'loss/train': 1.7539761066436768} 11/07/2021 02:33:58 - INFO - __main__ - Step 37915: {'lr': 0.00043080450964198483, 'samples': 7279680, 'steps': 37914, 'loss/train': 1.1099629402160645} 11/07/2021 02:33:58 - INFO - __main__ - Step 37916: {'lr': 0.00043080084465868307, 'samples': 7279872, 'steps': 37915, 'loss/train': 1.9730298519134521} 11/07/2021 02:33:58 - INFO - __main__ - Step 37917: {'lr': 0.0004307971795939152, 'samples': 7280064, 'steps': 37916, 'loss/train': 1.7470884323120117} 11/07/2021 02:33:59 - INFO - __main__ - Step 37918: {'lr': 0.000430793514447683, 'samples': 7280256, 'steps': 37917, 'loss/train': 1.3940391540527344} 11/07/2021 02:34:00 - INFO - __main__ - Step 37919: {'lr': 0.000430789849219988, 'samples': 7280448, 'steps': 37918, 'loss/train': 1.257074236869812} 11/07/2021 02:34:00 - INFO - __main__ - Step 37920: {'lr': 0.0004307861839108319, 'samples': 7280640, 'steps': 37919, 'loss/train': 1.2887773513793945} 11/07/2021 02:34:01 - INFO - __main__ - Step 37921: {'lr': 0.00043078251852021634, 'samples': 7280832, 'steps': 37920, 'loss/train': 1.6662101745605469} 11/07/2021 02:34:01 - INFO - __main__ - Step 37922: {'lr': 0.0004307788530481429, 'samples': 7281024, 'steps': 37921, 'loss/train': 1.563852071762085} 11/07/2021 02:34:02 - INFO - __main__ - Step 37923: {'lr': 0.00043077518749461336, 'samples': 7281216, 'steps': 37922, 'loss/train': 1.7499343156814575} 11/07/2021 02:34:02 - INFO - __main__ - Step 37924: {'lr': 0.00043077152185962933, 'samples': 7281408, 'steps': 37923, 'loss/train': 1.418026328086853} 11/07/2021 02:34:03 - INFO - __main__ - Step 37925: {'lr': 0.00043076785614319234, 'samples': 7281600, 'steps': 37924, 'loss/train': 1.733856201171875} 11/07/2021 02:34:03 - INFO - __main__ - Step 37926: {'lr': 0.0004307641903453042, 'samples': 7281792, 'steps': 37925, 'loss/train': 1.2359224557876587} 11/07/2021 02:34:03 - INFO - __main__ - Step 37927: {'lr': 0.00043076052446596656, 'samples': 7281984, 'steps': 37926, 'loss/train': 1.7876313924789429} 11/07/2021 02:34:04 - INFO - __main__ - Step 37928: {'lr': 0.000430756858505181, 'samples': 7282176, 'steps': 37927, 'loss/train': 1.8781075477600098} 11/07/2021 02:34:05 - INFO - __main__ - Step 37929: {'lr': 0.00043075319246294914, 'samples': 7282368, 'steps': 37928, 'loss/train': 1.6325830221176147} 11/07/2021 02:34:05 - INFO - __main__ - Step 37930: {'lr': 0.0004307495263392727, 'samples': 7282560, 'steps': 37929, 'loss/train': 1.660241961479187} 11/07/2021 02:34:05 - INFO - __main__ - Step 37931: {'lr': 0.00043074586013415337, 'samples': 7282752, 'steps': 37930, 'loss/train': 1.76328444480896} 11/07/2021 02:34:06 - INFO - __main__ - Step 37932: {'lr': 0.0004307421938475926, 'samples': 7282944, 'steps': 37931, 'loss/train': 1.3810811042785645} 11/07/2021 02:34:06 - INFO - __main__ - Step 37933: {'lr': 0.0004307385274795923, 'samples': 7283136, 'steps': 37932, 'loss/train': 1.5373013019561768} 11/07/2021 02:34:07 - INFO - __main__ - Step 37934: {'lr': 0.000430734861030154, 'samples': 7283328, 'steps': 37933, 'loss/train': 1.6717209815979004} 11/07/2021 02:34:07 - INFO - __main__ - Step 37935: {'lr': 0.0004307311944992793, 'samples': 7283520, 'steps': 37934, 'loss/train': 1.2602779865264893} 11/07/2021 02:34:08 - INFO - __main__ - Step 37936: {'lr': 0.00043072752788697003, 'samples': 7283712, 'steps': 37935, 'loss/train': 1.6329361200332642} 11/07/2021 02:34:08 - INFO - __main__ - Step 37937: {'lr': 0.0004307238611932276, 'samples': 7283904, 'steps': 37936, 'loss/train': 1.5360777378082275} 11/07/2021 02:34:08 - INFO - __main__ - Step 37938: {'lr': 0.00043072019441805386, 'samples': 7284096, 'steps': 37937, 'loss/train': 1.425024390220642} 11/07/2021 02:34:11 - INFO - __main__ - Step 37939: {'lr': 0.00043071652756145035, 'samples': 7284288, 'steps': 37938, 'loss/train': 1.440492033958435} 11/07/2021 02:34:11 - INFO - __main__ - Step 37940: {'lr': 0.0004307128606234188, 'samples': 7284480, 'steps': 37939, 'loss/train': 1.5482230186462402} 11/07/2021 02:34:11 - INFO - __main__ - Step 37941: {'lr': 0.00043070919360396076, 'samples': 7284672, 'steps': 37940, 'loss/train': 1.8059606552124023} 11/07/2021 02:34:12 - INFO - __main__ - Step 37942: {'lr': 0.00043070552650307804, 'samples': 7284864, 'steps': 37941, 'loss/train': 1.8094056844711304} 11/07/2021 02:34:12 - INFO - __main__ - Step 37943: {'lr': 0.0004307018593207721, 'samples': 7285056, 'steps': 37942, 'loss/train': 1.8075382709503174} 11/07/2021 02:34:12 - INFO - __main__ - Step 37944: {'lr': 0.0004306981920570447, 'samples': 7285248, 'steps': 37943, 'loss/train': 1.4249544143676758} 11/07/2021 02:34:13 - INFO - __main__ - Step 37945: {'lr': 0.00043069452471189765, 'samples': 7285440, 'steps': 37944, 'loss/train': 1.518310308456421} 11/07/2021 02:34:14 - INFO - __main__ - Step 37946: {'lr': 0.00043069085728533225, 'samples': 7285632, 'steps': 37945, 'loss/train': 1.5919995307922363} 11/07/2021 02:34:14 - INFO - __main__ - Step 37947: {'lr': 0.0004306871897773504, 'samples': 7285824, 'steps': 37946, 'loss/train': 1.625312328338623} 11/07/2021 02:34:15 - INFO - __main__ - Step 37948: {'lr': 0.0004306835221879537, 'samples': 7286016, 'steps': 37947, 'loss/train': 1.3613947629928589} 11/07/2021 02:34:15 - INFO - __main__ - Step 37949: {'lr': 0.00043067985451714373, 'samples': 7286208, 'steps': 37948, 'loss/train': 1.5056431293487549} 11/07/2021 02:34:15 - INFO - __main__ - Step 37950: {'lr': 0.0004306761867649223, 'samples': 7286400, 'steps': 37949, 'loss/train': 1.7206279039382935} 11/07/2021 02:34:16 - INFO - __main__ - Step 37951: {'lr': 0.0004306725189312909, 'samples': 7286592, 'steps': 37950, 'loss/train': 1.5456422567367554} 11/07/2021 02:34:17 - INFO - __main__ - Step 37952: {'lr': 0.00043066885101625133, 'samples': 7286784, 'steps': 37951, 'loss/train': 1.7271060943603516} 11/07/2021 02:34:17 - INFO - __main__ - Step 37953: {'lr': 0.00043066518301980504, 'samples': 7286976, 'steps': 37952, 'loss/train': 1.534792423248291} 11/07/2021 02:34:17 - INFO - __main__ - Step 37954: {'lr': 0.00043066151494195387, 'samples': 7287168, 'steps': 37953, 'loss/train': 1.1038849353790283} 11/07/2021 02:34:18 - INFO - __main__ - Step 37955: {'lr': 0.00043065784678269944, 'samples': 7287360, 'steps': 37954, 'loss/train': 1.5833653211593628} 11/07/2021 02:34:18 - INFO - __main__ - Step 37956: {'lr': 0.00043065417854204333, 'samples': 7287552, 'steps': 37955, 'loss/train': 1.7715234756469727} 11/07/2021 02:34:19 - INFO - __main__ - Step 37957: {'lr': 0.0004306505102199872, 'samples': 7287744, 'steps': 37956, 'loss/train': 1.5732712745666504} 11/07/2021 02:34:19 - INFO - __main__ - Step 37958: {'lr': 0.0004306468418165328, 'samples': 7287936, 'steps': 37957, 'loss/train': 1.638083815574646} 11/07/2021 02:34:20 - INFO - __main__ - Step 37959: {'lr': 0.0004306431733316817, 'samples': 7288128, 'steps': 37958, 'loss/train': 1.445956826210022} 11/07/2021 02:34:20 - INFO - __main__ - Step 37960: {'lr': 0.00043063950476543563, 'samples': 7288320, 'steps': 37959, 'loss/train': 2.0494678020477295} 11/07/2021 02:34:20 - INFO - __main__ - Step 37961: {'lr': 0.0004306358361177961, 'samples': 7288512, 'steps': 37960, 'loss/train': 1.6257449388504028} 11/07/2021 02:34:22 - INFO - __main__ - Step 37962: {'lr': 0.00043063216738876487, 'samples': 7288704, 'steps': 37961, 'loss/train': 1.5832840204238892} 11/07/2021 02:34:22 - INFO - __main__ - Step 37963: {'lr': 0.0004306284985783436, 'samples': 7288896, 'steps': 37962, 'loss/train': 1.6308826208114624} 11/07/2021 02:34:22 - INFO - __main__ - Step 37964: {'lr': 0.00043062482968653394, 'samples': 7289088, 'steps': 37963, 'loss/train': 1.5397615432739258} 11/07/2021 02:34:23 - INFO - __main__ - Step 37965: {'lr': 0.00043062116071333745, 'samples': 7289280, 'steps': 37964, 'loss/train': 1.1325451135635376} 11/07/2021 02:34:23 - INFO - __main__ - Step 37966: {'lr': 0.0004306174916587559, 'samples': 7289472, 'steps': 37965, 'loss/train': 1.169539451599121} 11/07/2021 02:34:24 - INFO - __main__ - Step 37967: {'lr': 0.0004306138225227909, 'samples': 7289664, 'steps': 37966, 'loss/train': 1.4744054079055786} 11/07/2021 02:34:25 - INFO - __main__ - Step 37968: {'lr': 0.0004306101533054441, 'samples': 7289856, 'steps': 37967, 'loss/train': 1.580784559249878} 11/07/2021 02:34:25 - INFO - __main__ - Step 37969: {'lr': 0.0004306064840067171, 'samples': 7290048, 'steps': 37968, 'loss/train': 1.974003553390503} 11/07/2021 02:34:25 - INFO - __main__ - Step 37970: {'lr': 0.00043060281462661165, 'samples': 7290240, 'steps': 37969, 'loss/train': 0.612262487411499} 11/07/2021 02:34:26 - INFO - __main__ - Step 37971: {'lr': 0.0004305991451651293, 'samples': 7290432, 'steps': 37970, 'loss/train': 1.509476661682129} 11/07/2021 02:34:27 - INFO - __main__ - Step 37972: {'lr': 0.00043059547562227185, 'samples': 7290624, 'steps': 37971, 'loss/train': 1.5636862516403198} 11/07/2021 02:34:27 - INFO - __main__ - Step 37973: {'lr': 0.0004305918059980408, 'samples': 7290816, 'steps': 37972, 'loss/train': 1.6018271446228027} 11/07/2021 02:34:28 - INFO - __main__ - Step 37974: {'lr': 0.00043058813629243787, 'samples': 7291008, 'steps': 37973, 'loss/train': 1.2701356410980225} 11/07/2021 02:34:28 - INFO - __main__ - Step 37975: {'lr': 0.0004305844665054648, 'samples': 7291200, 'steps': 37974, 'loss/train': 1.5594356060028076} 11/07/2021 02:34:28 - INFO - __main__ - Step 37976: {'lr': 0.00043058079663712304, 'samples': 7291392, 'steps': 37975, 'loss/train': 1.4831064939498901} 11/07/2021 02:34:30 - INFO - __main__ - Step 37977: {'lr': 0.00043057712668741443, 'samples': 7291584, 'steps': 37976, 'loss/train': 1.7999645471572876} 11/07/2021 02:34:30 - INFO - __main__ - Step 37978: {'lr': 0.0004305734566563405, 'samples': 7291776, 'steps': 37977, 'loss/train': 1.4681727886199951} 11/07/2021 02:34:31 - INFO - __main__ - Step 37979: {'lr': 0.000430569786543903, 'samples': 7291968, 'steps': 37978, 'loss/train': 1.2198891639709473} 11/07/2021 02:34:31 - INFO - __main__ - Step 37980: {'lr': 0.00043056611635010355, 'samples': 7292160, 'steps': 37979, 'loss/train': 1.8494244813919067} 11/07/2021 02:34:31 - INFO - __main__ - Step 37981: {'lr': 0.00043056244607494375, 'samples': 7292352, 'steps': 37980, 'loss/train': 1.7782717943191528} 11/07/2021 02:34:32 - INFO - __main__ - Step 37982: {'lr': 0.0004305587757184254, 'samples': 7292544, 'steps': 37981, 'loss/train': 1.644851565361023} 11/07/2021 02:34:32 - INFO - __main__ - Step 37983: {'lr': 0.0004305551052805499, 'samples': 7292736, 'steps': 37982, 'loss/train': 1.0468441247940063} 11/07/2021 02:34:34 - INFO - __main__ - Step 37984: {'lr': 0.0004305514347613191, 'samples': 7292928, 'steps': 37983, 'loss/train': 1.4788124561309814} 11/07/2021 02:34:34 - INFO - __main__ - Step 37985: {'lr': 0.0004305477641607347, 'samples': 7293120, 'steps': 37984, 'loss/train': 1.492355227470398} 11/07/2021 02:34:34 - INFO - __main__ - Step 37986: {'lr': 0.0004305440934787982, 'samples': 7293312, 'steps': 37985, 'loss/train': 1.895719289779663} 11/07/2021 02:34:35 - INFO - __main__ - Step 37987: {'lr': 0.0004305404227155113, 'samples': 7293504, 'steps': 37986, 'loss/train': 1.0172924995422363} 11/07/2021 02:34:35 - INFO - __main__ - Step 37988: {'lr': 0.0004305367518708757, 'samples': 7293696, 'steps': 37987, 'loss/train': 1.722000241279602} 11/07/2021 02:34:35 - INFO - __main__ - Step 37989: {'lr': 0.000430533080944893, 'samples': 7293888, 'steps': 37988, 'loss/train': 1.4645875692367554} 11/07/2021 02:34:37 - INFO - __main__ - Step 37990: {'lr': 0.00043052940993756493, 'samples': 7294080, 'steps': 37989, 'loss/train': 1.7487901449203491} 11/07/2021 02:34:37 - INFO - __main__ - Step 37991: {'lr': 0.00043052573884889305, 'samples': 7294272, 'steps': 37990, 'loss/train': 1.0233213901519775} 11/07/2021 02:34:37 - INFO - __main__ - Step 37992: {'lr': 0.00043052206767887907, 'samples': 7294464, 'steps': 37991, 'loss/train': 2.0673677921295166} 11/07/2021 02:34:38 - INFO - __main__ - Step 37993: {'lr': 0.00043051839642752466, 'samples': 7294656, 'steps': 37992, 'loss/train': 1.400230050086975} 11/07/2021 02:34:38 - INFO - __main__ - Step 37994: {'lr': 0.00043051472509483135, 'samples': 7294848, 'steps': 37993, 'loss/train': 1.6917005777359009} 11/07/2021 02:34:39 - INFO - __main__ - Step 37995: {'lr': 0.00043051105368080103, 'samples': 7295040, 'steps': 37994, 'loss/train': 1.5717216730117798} 11/07/2021 02:34:39 - INFO - __main__ - Step 37996: {'lr': 0.00043050738218543505, 'samples': 7295232, 'steps': 37995, 'loss/train': 1.281235694885254} 11/07/2021 02:34:40 - INFO - __main__ - Step 37997: {'lr': 0.00043050371060873537, 'samples': 7295424, 'steps': 37996, 'loss/train': 1.25249445438385} 11/07/2021 02:34:40 - INFO - __main__ - Step 37998: {'lr': 0.00043050003895070345, 'samples': 7295616, 'steps': 37997, 'loss/train': 1.582495927810669} 11/07/2021 02:34:40 - INFO - __main__ - Step 37999: {'lr': 0.000430496367211341, 'samples': 7295808, 'steps': 37998, 'loss/train': 1.7385200262069702} 11/07/2021 02:34:41 - INFO - __main__ - Step 38000: {'lr': 0.00043049269539064967, 'samples': 7296000, 'steps': 37999, 'loss/train': 1.301990270614624} 11/07/2021 02:34:42 - INFO - __main__ - Step 38001: {'lr': 0.0004304890234886311, 'samples': 7296192, 'steps': 38000, 'loss/train': 1.387725830078125} 11/07/2021 02:34:42 - INFO - __main__ - Step 38002: {'lr': 0.000430485351505287, 'samples': 7296384, 'steps': 38001, 'loss/train': 1.957288146018982} 11/07/2021 02:34:42 - INFO - __main__ - Step 38003: {'lr': 0.000430481679440619, 'samples': 7296576, 'steps': 38002, 'loss/train': 0.9746781587600708} 11/07/2021 02:34:43 - INFO - __main__ - Step 38004: {'lr': 0.0004304780072946287, 'samples': 7296768, 'steps': 38003, 'loss/train': 1.7512469291687012} 11/07/2021 02:34:44 - INFO - __main__ - Step 38005: {'lr': 0.00043047433506731783, 'samples': 7296960, 'steps': 38004, 'loss/train': 1.5505845546722412} 11/07/2021 02:34:44 - INFO - __main__ - Step 38006: {'lr': 0.00043047066275868795, 'samples': 7297152, 'steps': 38005, 'loss/train': 1.0482814311981201} 11/07/2021 02:34:45 - INFO - __main__ - Step 38007: {'lr': 0.0004304669903687408, 'samples': 7297344, 'steps': 38006, 'loss/train': 0.8675975799560547} 11/07/2021 02:34:45 - INFO - __main__ - Step 38008: {'lr': 0.000430463317897478, 'samples': 7297536, 'steps': 38007, 'loss/train': 2.2233479022979736} 11/07/2021 02:34:45 - INFO - __main__ - Step 38009: {'lr': 0.0004304596453449012, 'samples': 7297728, 'steps': 38008, 'loss/train': 1.630285382270813} 11/07/2021 02:34:46 - INFO - __main__ - Step 38010: {'lr': 0.0004304559727110121, 'samples': 7297920, 'steps': 38009, 'loss/train': 1.7070844173431396} 11/07/2021 02:34:47 - INFO - __main__ - Step 38011: {'lr': 0.0004304522999958124, 'samples': 7298112, 'steps': 38010, 'loss/train': 2.6189193725585938} 11/07/2021 02:34:47 - INFO - __main__ - Step 38012: {'lr': 0.00043044862719930356, 'samples': 7298304, 'steps': 38011, 'loss/train': 1.5570639371871948} 11/07/2021 02:34:47 - INFO - __main__ - Step 38013: {'lr': 0.0004304449543214874, 'samples': 7298496, 'steps': 38012, 'loss/train': 2.0051374435424805} 11/07/2021 02:34:48 - INFO - __main__ - Step 38014: {'lr': 0.0004304412813623655, 'samples': 7298688, 'steps': 38013, 'loss/train': 1.8789836168289185} 11/07/2021 02:34:48 - INFO - __main__ - Step 38015: {'lr': 0.0004304376083219396, 'samples': 7298880, 'steps': 38014, 'loss/train': 1.4761950969696045} 11/07/2021 02:34:49 - INFO - __main__ - Step 38016: {'lr': 0.00043043393520021125, 'samples': 7299072, 'steps': 38015, 'loss/train': 1.8535746335983276} 11/07/2021 02:34:50 - INFO - __main__ - Step 38017: {'lr': 0.0004304302619971822, 'samples': 7299264, 'steps': 38016, 'loss/train': 1.2162835597991943} 11/07/2021 02:34:50 - INFO - __main__ - Step 38018: {'lr': 0.000430426588712854, 'samples': 7299456, 'steps': 38017, 'loss/train': 1.473557710647583} 11/07/2021 02:34:50 - INFO - __main__ - Step 38019: {'lr': 0.0004304229153472283, 'samples': 7299648, 'steps': 38018, 'loss/train': 1.200123906135559} 11/07/2021 02:34:51 - INFO - __main__ - Step 38020: {'lr': 0.0004304192419003069, 'samples': 7299840, 'steps': 38019, 'loss/train': 1.2828359603881836} 11/07/2021 02:34:52 - INFO - __main__ - Step 38021: {'lr': 0.0004304155683720914, 'samples': 7300032, 'steps': 38020, 'loss/train': 1.4277637004852295} 11/07/2021 02:34:52 - INFO - __main__ - Step 38022: {'lr': 0.0004304118947625835, 'samples': 7300224, 'steps': 38021, 'loss/train': 1.602111577987671} 11/07/2021 02:34:52 - INFO - __main__ - Step 38023: {'lr': 0.00043040822107178465, 'samples': 7300416, 'steps': 38022, 'loss/train': 1.7993630170822144} 11/07/2021 02:34:53 - INFO - __main__ - Step 38024: {'lr': 0.0004304045472996966, 'samples': 7300608, 'steps': 38023, 'loss/train': 1.5964939594268799} 11/07/2021 02:34:53 - INFO - __main__ - Step 38025: {'lr': 0.0004304008734463212, 'samples': 7300800, 'steps': 38024, 'loss/train': 1.7076945304870605} 11/07/2021 02:34:53 - INFO - __main__ - Step 38026: {'lr': 0.00043039719951165986, 'samples': 7300992, 'steps': 38025, 'loss/train': 1.7546823024749756} 11/07/2021 02:34:54 - INFO - __main__ - Step 38027: {'lr': 0.0004303935254957143, 'samples': 7301184, 'steps': 38026, 'loss/train': 1.6069893836975098} 11/07/2021 02:34:55 - INFO - __main__ - Step 38028: {'lr': 0.0004303898513984863, 'samples': 7301376, 'steps': 38027, 'loss/train': 1.5962930917739868} 11/07/2021 02:34:55 - INFO - __main__ - Step 38029: {'lr': 0.0004303861772199773, 'samples': 7301568, 'steps': 38028, 'loss/train': 1.200292706489563} 11/07/2021 02:34:55 - INFO - __main__ - Step 38030: {'lr': 0.00043038250296018916, 'samples': 7301760, 'steps': 38029, 'loss/train': 0.9734283685684204} 11/07/2021 02:34:56 - INFO - __main__ - Step 38031: {'lr': 0.00043037882861912344, 'samples': 7301952, 'steps': 38030, 'loss/train': 1.5647027492523193} 11/07/2021 02:34:57 - INFO - __main__ - Step 38032: {'lr': 0.00043037515419678174, 'samples': 7302144, 'steps': 38031, 'loss/train': 1.3203153610229492} 11/07/2021 02:34:57 - INFO - __main__ - Step 38033: {'lr': 0.0004303714796931658, 'samples': 7302336, 'steps': 38032, 'loss/train': 1.8147969245910645} 11/07/2021 02:34:58 - INFO - __main__ - Step 38034: {'lr': 0.0004303678051082773, 'samples': 7302528, 'steps': 38033, 'loss/train': 2.1749398708343506} 11/07/2021 02:34:58 - INFO - __main__ - Step 38035: {'lr': 0.00043036413044211786, 'samples': 7302720, 'steps': 38034, 'loss/train': 1.9432387351989746} 11/07/2021 02:34:58 - INFO - __main__ - Step 38036: {'lr': 0.0004303604556946891, 'samples': 7302912, 'steps': 38035, 'loss/train': 1.8085533380508423} 11/07/2021 02:34:59 - INFO - __main__ - Step 38037: {'lr': 0.00043035678086599265, 'samples': 7303104, 'steps': 38036, 'loss/train': 1.323421597480774} 11/07/2021 02:35:00 - INFO - __main__ - Step 38038: {'lr': 0.00043035310595603026, 'samples': 7303296, 'steps': 38037, 'loss/train': 1.8228347301483154} 11/07/2021 02:35:00 - INFO - __main__ - Step 38039: {'lr': 0.00043034943096480357, 'samples': 7303488, 'steps': 38038, 'loss/train': 1.2788726091384888} 11/07/2021 02:35:00 - INFO - __main__ - Step 38040: {'lr': 0.0004303457558923142, 'samples': 7303680, 'steps': 38039, 'loss/train': 0.3434649407863617} 11/07/2021 02:35:01 - INFO - __main__ - Step 38041: {'lr': 0.00043034208073856374, 'samples': 7303872, 'steps': 38040, 'loss/train': 1.8999801874160767} 11/07/2021 02:35:02 - INFO - __main__ - Step 38042: {'lr': 0.000430338405503554, 'samples': 7304064, 'steps': 38041, 'loss/train': 0.8197647929191589} 11/07/2021 02:35:02 - INFO - __main__ - Step 38043: {'lr': 0.00043033473018728655, 'samples': 7304256, 'steps': 38042, 'loss/train': 1.171054482460022} 11/07/2021 02:35:02 - INFO - __main__ - Step 38044: {'lr': 0.00043033105478976306, 'samples': 7304448, 'steps': 38043, 'loss/train': 1.650887131690979} 11/07/2021 02:35:03 - INFO - __main__ - Step 38045: {'lr': 0.00043032737931098517, 'samples': 7304640, 'steps': 38044, 'loss/train': 1.5681344270706177} 11/07/2021 02:35:03 - INFO - __main__ - Step 38046: {'lr': 0.0004303237037509545, 'samples': 7304832, 'steps': 38045, 'loss/train': 1.498640537261963} 11/07/2021 02:35:04 - INFO - __main__ - Step 38047: {'lr': 0.0004303200281096727, 'samples': 7305024, 'steps': 38046, 'loss/train': 0.21930217742919922} 11/07/2021 02:35:05 - INFO - __main__ - Step 38048: {'lr': 0.00043031635238714163, 'samples': 7305216, 'steps': 38047, 'loss/train': 1.0719927549362183} 11/07/2021 02:35:05 - INFO - __main__ - Step 38049: {'lr': 0.00043031267658336276, 'samples': 7305408, 'steps': 38048, 'loss/train': 1.2124830484390259} 11/07/2021 02:35:05 - INFO - __main__ - Step 38050: {'lr': 0.00043030900069833774, 'samples': 7305600, 'steps': 38049, 'loss/train': 1.630570888519287} 11/07/2021 02:35:06 - INFO - __main__ - Step 38051: {'lr': 0.0004303053247320683, 'samples': 7305792, 'steps': 38050, 'loss/train': 1.6434534788131714} 11/07/2021 02:35:07 - INFO - __main__ - Step 38052: {'lr': 0.000430301648684556, 'samples': 7305984, 'steps': 38051, 'loss/train': 1.8277837038040161} 11/07/2021 02:35:07 - INFO - __main__ - Step 38053: {'lr': 0.0004302979725558026, 'samples': 7306176, 'steps': 38052, 'loss/train': 1.3193963766098022} 11/07/2021 02:35:07 - INFO - __main__ - Step 38054: {'lr': 0.0004302942963458097, 'samples': 7306368, 'steps': 38053, 'loss/train': 1.8110835552215576} 11/07/2021 02:35:08 - INFO - __main__ - Step 38055: {'lr': 0.00043029062005457897, 'samples': 7306560, 'steps': 38054, 'loss/train': 1.2935056686401367} 11/07/2021 02:35:08 - INFO - __main__ - Step 38056: {'lr': 0.00043028694368211216, 'samples': 7306752, 'steps': 38055, 'loss/train': 1.2038893699645996} 11/07/2021 02:35:08 - INFO - __main__ - Step 38057: {'lr': 0.00043028326722841073, 'samples': 7306944, 'steps': 38056, 'loss/train': 1.8193788528442383} 11/07/2021 02:35:09 - INFO - __main__ - Step 38058: {'lr': 0.00043027959069347644, 'samples': 7307136, 'steps': 38057, 'loss/train': 1.7588491439819336} 11/07/2021 02:35:10 - INFO - __main__ - Step 38059: {'lr': 0.00043027591407731106, 'samples': 7307328, 'steps': 38058, 'loss/train': 1.366492748260498} 11/07/2021 02:35:10 - INFO - __main__ - Step 38060: {'lr': 0.000430272237379916, 'samples': 7307520, 'steps': 38059, 'loss/train': 0.8111729621887207} 11/07/2021 02:35:10 - INFO - __main__ - Step 38061: {'lr': 0.00043026856060129307, 'samples': 7307712, 'steps': 38060, 'loss/train': 1.5397465229034424} 11/07/2021 02:35:11 - INFO - __main__ - Step 38062: {'lr': 0.00043026488374144404, 'samples': 7307904, 'steps': 38061, 'loss/train': 1.6733875274658203} 11/07/2021 02:35:12 - INFO - __main__ - Step 38063: {'lr': 0.00043026120680037026, 'samples': 7308096, 'steps': 38062, 'loss/train': 1.1368083953857422} 11/07/2021 02:35:12 - INFO - __main__ - Step 38064: {'lr': 0.00043025752977807365, 'samples': 7308288, 'steps': 38063, 'loss/train': 1.9798434972763062} 11/07/2021 02:35:12 - INFO - __main__ - Step 38065: {'lr': 0.00043025385267455576, 'samples': 7308480, 'steps': 38064, 'loss/train': 1.3891901969909668} 11/07/2021 02:35:13 - INFO - __main__ - Step 38066: {'lr': 0.0004302501754898183, 'samples': 7308672, 'steps': 38065, 'loss/train': 1.7628917694091797} 11/07/2021 02:35:13 - INFO - __main__ - Step 38067: {'lr': 0.00043024649822386284, 'samples': 7308864, 'steps': 38066, 'loss/train': 1.3898802995681763} 11/07/2021 02:35:14 - INFO - __main__ - Step 38068: {'lr': 0.00043024282087669106, 'samples': 7309056, 'steps': 38067, 'loss/train': 1.7788422107696533} 11/07/2021 02:35:14 - INFO - __main__ - Step 38069: {'lr': 0.0004302391434483048, 'samples': 7309248, 'steps': 38068, 'loss/train': 1.2079501152038574} 11/07/2021 02:35:15 - INFO - __main__ - Step 38070: {'lr': 0.00043023546593870543, 'samples': 7309440, 'steps': 38069, 'loss/train': 0.31639406085014343} 11/07/2021 02:35:15 - INFO - __main__ - Step 38071: {'lr': 0.00043023178834789477, 'samples': 7309632, 'steps': 38070, 'loss/train': 1.4251023530960083} 11/07/2021 02:35:16 - INFO - __main__ - Step 38072: {'lr': 0.0004302281106758745, 'samples': 7309824, 'steps': 38071, 'loss/train': 1.5218110084533691} 11/07/2021 02:35:17 - INFO - __main__ - Step 38073: {'lr': 0.00043022443292264613, 'samples': 7310016, 'steps': 38072, 'loss/train': 1.7101134061813354} 11/07/2021 02:35:17 - INFO - __main__ - Step 38074: {'lr': 0.00043022075508821145, 'samples': 7310208, 'steps': 38073, 'loss/train': 1.4294854402542114} 11/07/2021 02:35:17 - INFO - __main__ - Step 38075: {'lr': 0.0004302170771725721, 'samples': 7310400, 'steps': 38074, 'loss/train': 1.5847716331481934} 11/07/2021 02:35:18 - INFO - __main__ - Step 38076: {'lr': 0.0004302133991757297, 'samples': 7310592, 'steps': 38075, 'loss/train': 1.3922067880630493} 11/07/2021 02:35:18 - INFO - __main__ - Step 38077: {'lr': 0.000430209721097686, 'samples': 7310784, 'steps': 38076, 'loss/train': 1.6040767431259155} 11/07/2021 02:35:19 - INFO - __main__ - Step 38078: {'lr': 0.00043020604293844244, 'samples': 7310976, 'steps': 38077, 'loss/train': 1.1318144798278809} 11/07/2021 02:35:19 - INFO - __main__ - Step 38079: {'lr': 0.0004302023646980009, 'samples': 7311168, 'steps': 38078, 'loss/train': 1.669752597808838} 11/07/2021 02:35:20 - INFO - __main__ - Step 38080: {'lr': 0.00043019868637636294, 'samples': 7311360, 'steps': 38079, 'loss/train': 1.3321876525878906} 11/07/2021 02:35:20 - INFO - __main__ - Step 38081: {'lr': 0.0004301950079735302, 'samples': 7311552, 'steps': 38080, 'loss/train': 1.4218177795410156} 11/07/2021 02:35:20 - INFO - __main__ - Step 38082: {'lr': 0.00043019132948950443, 'samples': 7311744, 'steps': 38081, 'loss/train': 0.9103419780731201} 11/07/2021 02:35:21 - INFO - __main__ - Step 38083: {'lr': 0.0004301876509242872, 'samples': 7311936, 'steps': 38082, 'loss/train': 1.6363718509674072} 11/07/2021 02:35:22 - INFO - __main__ - Step 38084: {'lr': 0.0004301839722778802, 'samples': 7312128, 'steps': 38083, 'loss/train': 1.5699800252914429} 11/07/2021 02:35:22 - INFO - __main__ - Step 38085: {'lr': 0.0004301802935502851, 'samples': 7312320, 'steps': 38084, 'loss/train': 1.228023648262024} 11/07/2021 02:35:22 - INFO - __main__ - Step 38086: {'lr': 0.00043017661474150347, 'samples': 7312512, 'steps': 38085, 'loss/train': 1.5643198490142822} 11/07/2021 02:35:23 - INFO - __main__ - Step 38087: {'lr': 0.0004301729358515371, 'samples': 7312704, 'steps': 38086, 'loss/train': 1.0907460451126099} 11/07/2021 02:35:24 - INFO - __main__ - Step 38088: {'lr': 0.00043016925688038756, 'samples': 7312896, 'steps': 38087, 'loss/train': 1.7797681093215942} 11/07/2021 02:35:24 - INFO - __main__ - Step 38089: {'lr': 0.00043016557782805655, 'samples': 7313088, 'steps': 38088, 'loss/train': 1.50370192527771} 11/07/2021 02:35:25 - INFO - __main__ - Step 38090: {'lr': 0.0004301618986945457, 'samples': 7313280, 'steps': 38089, 'loss/train': 1.3989465236663818} 11/07/2021 02:35:25 - INFO - __main__ - Step 38091: {'lr': 0.0004301582194798567, 'samples': 7313472, 'steps': 38090, 'loss/train': 1.1627254486083984} 11/07/2021 02:35:25 - INFO - __main__ - Step 38092: {'lr': 0.00043015454018399115, 'samples': 7313664, 'steps': 38091, 'loss/train': 1.59376060962677} 11/07/2021 02:35:26 - INFO - __main__ - Step 38093: {'lr': 0.00043015086080695075, 'samples': 7313856, 'steps': 38092, 'loss/train': 1.5860843658447266} 11/07/2021 02:35:27 - INFO - __main__ - Step 38094: {'lr': 0.0004301471813487372, 'samples': 7314048, 'steps': 38093, 'loss/train': 1.6321327686309814} 11/07/2021 02:35:27 - INFO - __main__ - Step 38095: {'lr': 0.00043014350180935207, 'samples': 7314240, 'steps': 38094, 'loss/train': 1.2975819110870361} 11/07/2021 02:35:27 - INFO - __main__ - Step 38096: {'lr': 0.0004301398221887971, 'samples': 7314432, 'steps': 38095, 'loss/train': 2.0996835231781006} 11/07/2021 02:35:28 - INFO - __main__ - Step 38097: {'lr': 0.0004301361424870739, 'samples': 7314624, 'steps': 38096, 'loss/train': 1.677228331565857} 11/07/2021 02:35:29 - INFO - __main__ - Step 38098: {'lr': 0.00043013246270418406, 'samples': 7314816, 'steps': 38097, 'loss/train': 1.5101380348205566} 11/07/2021 02:35:29 - INFO - __main__ - Step 38099: {'lr': 0.00043012878284012936, 'samples': 7315008, 'steps': 38098, 'loss/train': 1.5078647136688232} 11/07/2021 02:35:29 - INFO - __main__ - Step 38100: {'lr': 0.0004301251028949114, 'samples': 7315200, 'steps': 38099, 'loss/train': 1.506402850151062} 11/07/2021 02:35:30 - INFO - __main__ - Step 38101: {'lr': 0.00043012142286853185, 'samples': 7315392, 'steps': 38100, 'loss/train': 1.571759819984436} 11/07/2021 02:35:30 - INFO - __main__ - Step 38102: {'lr': 0.00043011774276099235, 'samples': 7315584, 'steps': 38101, 'loss/train': 1.5209771394729614} 11/07/2021 02:35:31 - INFO - __main__ - Step 38103: {'lr': 0.0004301140625722946, 'samples': 7315776, 'steps': 38102, 'loss/train': 1.6852452754974365} 11/07/2021 02:35:32 - INFO - __main__ - Step 38104: {'lr': 0.0004301103823024403, 'samples': 7315968, 'steps': 38103, 'loss/train': 1.4258034229278564} 11/07/2021 02:35:32 - INFO - __main__ - Step 38105: {'lr': 0.0004301067019514309, 'samples': 7316160, 'steps': 38104, 'loss/train': 1.3200912475585938} 11/07/2021 02:35:32 - INFO - __main__ - Step 38106: {'lr': 0.0004301030215192683, 'samples': 7316352, 'steps': 38105, 'loss/train': 1.657687783241272} 11/07/2021 02:35:33 - INFO - __main__ - Step 38107: {'lr': 0.00043009934100595403, 'samples': 7316544, 'steps': 38106, 'loss/train': 1.5879276990890503} 11/07/2021 02:35:33 - INFO - __main__ - Step 38108: {'lr': 0.00043009566041148973, 'samples': 7316736, 'steps': 38107, 'loss/train': 1.4185930490493774} 11/07/2021 02:35:34 - INFO - __main__ - Step 38109: {'lr': 0.0004300919797358772, 'samples': 7316928, 'steps': 38108, 'loss/train': 1.3224161863327026} 11/07/2021 02:35:34 - INFO - __main__ - Step 38110: {'lr': 0.00043008829897911796, 'samples': 7317120, 'steps': 38109, 'loss/train': 1.5699503421783447} 11/07/2021 02:35:35 - INFO - __main__ - Step 38111: {'lr': 0.0004300846181412137, 'samples': 7317312, 'steps': 38110, 'loss/train': 1.2713605165481567} 11/07/2021 02:35:35 - INFO - __main__ - Step 38112: {'lr': 0.00043008093722216603, 'samples': 7317504, 'steps': 38111, 'loss/train': 1.3977137804031372} 11/07/2021 02:35:36 - INFO - __main__ - Step 38113: {'lr': 0.00043007725622197675, 'samples': 7317696, 'steps': 38112, 'loss/train': 1.6717946529388428} 11/07/2021 02:35:37 - INFO - __main__ - Step 38114: {'lr': 0.0004300735751406474, 'samples': 7317888, 'steps': 38113, 'loss/train': 1.5027621984481812} 11/07/2021 02:35:37 - INFO - __main__ - Step 38115: {'lr': 0.00043006989397817967, 'samples': 7318080, 'steps': 38114, 'loss/train': 1.4531986713409424} 11/07/2021 02:35:37 - INFO - __main__ - Step 38116: {'lr': 0.00043006621273457523, 'samples': 7318272, 'steps': 38115, 'loss/train': 1.7254838943481445} 11/07/2021 02:35:38 - INFO - __main__ - Step 38117: {'lr': 0.0004300625314098358, 'samples': 7318464, 'steps': 38116, 'loss/train': 1.239201307296753} 11/07/2021 02:35:38 - INFO - __main__ - Step 38118: {'lr': 0.0004300588500039629, 'samples': 7318656, 'steps': 38117, 'loss/train': 1.312067985534668} 11/07/2021 02:35:39 - INFO - __main__ - Step 38119: {'lr': 0.0004300551685169583, 'samples': 7318848, 'steps': 38118, 'loss/train': 1.3828651905059814} 11/07/2021 02:35:39 - INFO - __main__ - Step 38120: {'lr': 0.0004300514869488236, 'samples': 7319040, 'steps': 38119, 'loss/train': 1.5329667329788208} 11/07/2021 02:35:40 - INFO - __main__ - Step 38121: {'lr': 0.00043004780529956046, 'samples': 7319232, 'steps': 38120, 'loss/train': 1.3659088611602783} 11/07/2021 02:35:40 - INFO - __main__ - Step 38122: {'lr': 0.00043004412356917055, 'samples': 7319424, 'steps': 38121, 'loss/train': 1.5442653894424438} 11/07/2021 02:35:40 - INFO - __main__ - Step 38123: {'lr': 0.0004300404417576556, 'samples': 7319616, 'steps': 38122, 'loss/train': 1.3420342206954956} 11/07/2021 02:35:41 - INFO - __main__ - Step 38124: {'lr': 0.00043003675986501717, 'samples': 7319808, 'steps': 38123, 'loss/train': 1.5655076503753662} 11/07/2021 02:35:42 - INFO - __main__ - Step 38125: {'lr': 0.00043003307789125694, 'samples': 7320000, 'steps': 38124, 'loss/train': 1.7089120149612427} 11/07/2021 02:35:42 - INFO - __main__ - Step 38126: {'lr': 0.0004300293958363766, 'samples': 7320192, 'steps': 38125, 'loss/train': 0.8477843999862671} 11/07/2021 02:35:43 - INFO - __main__ - Step 38127: {'lr': 0.00043002571370037777, 'samples': 7320384, 'steps': 38126, 'loss/train': 1.3977762460708618} 11/07/2021 02:35:43 - INFO - __main__ - Step 38128: {'lr': 0.00043002203148326213, 'samples': 7320576, 'steps': 38127, 'loss/train': 1.8233729600906372} 11/07/2021 02:35:44 - INFO - __main__ - Step 38129: {'lr': 0.0004300183491850314, 'samples': 7320768, 'steps': 38128, 'loss/train': 1.4460153579711914} 11/07/2021 02:35:44 - INFO - __main__ - Step 38130: {'lr': 0.0004300146668056871, 'samples': 7320960, 'steps': 38129, 'loss/train': 1.8460206985473633} 11/07/2021 02:35:45 - INFO - __main__ - Step 38131: {'lr': 0.00043001098434523107, 'samples': 7321152, 'steps': 38130, 'loss/train': 1.248296856880188} 11/07/2021 02:35:45 - INFO - __main__ - Step 38132: {'lr': 0.0004300073018036648, 'samples': 7321344, 'steps': 38131, 'loss/train': 1.5343341827392578} 11/07/2021 02:35:46 - INFO - __main__ - Step 38133: {'lr': 0.00043000361918099, 'samples': 7321536, 'steps': 38132, 'loss/train': 1.7700618505477905} 11/07/2021 02:35:46 - INFO - __main__ - Step 38134: {'lr': 0.00042999993647720836, 'samples': 7321728, 'steps': 38133, 'loss/train': 2.182451009750366} 11/07/2021 02:35:47 - INFO - __main__ - Step 38135: {'lr': 0.0004299962536923215, 'samples': 7321920, 'steps': 38134, 'loss/train': 1.4863643646240234} 11/07/2021 02:35:47 - INFO - __main__ - Step 38136: {'lr': 0.0004299925708263312, 'samples': 7322112, 'steps': 38135, 'loss/train': 1.7947341203689575} 11/07/2021 02:35:48 - INFO - __main__ - Step 38137: {'lr': 0.00042998888787923895, 'samples': 7322304, 'steps': 38136, 'loss/train': 1.6867958307266235} 11/07/2021 02:35:48 - INFO - __main__ - Step 38138: {'lr': 0.0004299852048510465, 'samples': 7322496, 'steps': 38137, 'loss/train': 1.5821454524993896} 11/07/2021 02:35:48 - INFO - __main__ - Step 38139: {'lr': 0.00042998152174175555, 'samples': 7322688, 'steps': 38138, 'loss/train': 1.2109464406967163} 11/07/2021 02:35:49 - INFO - __main__ - Step 38140: {'lr': 0.0004299778385513676, 'samples': 7322880, 'steps': 38139, 'loss/train': 1.8368099927902222} 11/07/2021 02:35:50 - INFO - __main__ - Step 38141: {'lr': 0.0004299741552798845, 'samples': 7323072, 'steps': 38140, 'loss/train': 1.569381594657898} 11/07/2021 02:35:50 - INFO - __main__ - Step 38142: {'lr': 0.0004299704719273078, 'samples': 7323264, 'steps': 38141, 'loss/train': 1.2834303379058838} 11/07/2021 02:35:50 - INFO - __main__ - Step 38143: {'lr': 0.00042996678849363914, 'samples': 7323456, 'steps': 38142, 'loss/train': 1.2398931980133057} 11/07/2021 02:35:51 - INFO - __main__ - Step 38144: {'lr': 0.00042996310497888025, 'samples': 7323648, 'steps': 38143, 'loss/train': 1.9385180473327637} 11/07/2021 02:35:51 - INFO - __main__ - Step 38145: {'lr': 0.00042995942138303274, 'samples': 7323840, 'steps': 38144, 'loss/train': 0.6801995635032654} 11/07/2021 02:35:52 - INFO - __main__ - Step 38146: {'lr': 0.0004299557377060983, 'samples': 7324032, 'steps': 38145, 'loss/train': 1.591407060623169} 11/07/2021 02:35:52 - INFO - __main__ - Step 38147: {'lr': 0.00042995205394807864, 'samples': 7324224, 'steps': 38146, 'loss/train': 1.4548685550689697} 11/07/2021 02:35:53 - INFO - __main__ - Step 38148: {'lr': 0.00042994837010897524, 'samples': 7324416, 'steps': 38147, 'loss/train': 1.6742019653320312} 11/07/2021 02:35:53 - INFO - __main__ - Step 38149: {'lr': 0.00042994468618879, 'samples': 7324608, 'steps': 38148, 'loss/train': 1.6510372161865234} 11/07/2021 02:35:54 - INFO - __main__ - Step 38150: {'lr': 0.0004299410021875244, 'samples': 7324800, 'steps': 38149, 'loss/train': 1.6389765739440918} 11/07/2021 02:35:55 - INFO - __main__ - Step 38151: {'lr': 0.00042993731810518025, 'samples': 7324992, 'steps': 38150, 'loss/train': 1.482251524925232} 11/07/2021 02:35:55 - INFO - __main__ - Step 38152: {'lr': 0.00042993363394175897, 'samples': 7325184, 'steps': 38151, 'loss/train': 2.3803791999816895} 11/07/2021 02:35:55 - INFO - __main__ - Step 38153: {'lr': 0.0004299299496972625, 'samples': 7325376, 'steps': 38152, 'loss/train': 1.4593158960342407} 11/07/2021 02:35:56 - INFO - __main__ - Step 38154: {'lr': 0.0004299262653716923, 'samples': 7325568, 'steps': 38153, 'loss/train': 1.8404978513717651} 11/07/2021 02:35:56 - INFO - __main__ - Step 38155: {'lr': 0.0004299225809650501, 'samples': 7325760, 'steps': 38154, 'loss/train': 1.2424695491790771} 11/07/2021 02:35:57 - INFO - __main__ - Step 38156: {'lr': 0.0004299188964773376, 'samples': 7325952, 'steps': 38155, 'loss/train': 1.2804923057556152} 11/07/2021 02:35:57 - INFO - __main__ - Step 38157: {'lr': 0.0004299152119085564, 'samples': 7326144, 'steps': 38156, 'loss/train': 1.5574463605880737} 11/07/2021 02:35:58 - INFO - __main__ - Step 38158: {'lr': 0.0004299115272587082, 'samples': 7326336, 'steps': 38157, 'loss/train': 0.36680254340171814} 11/07/2021 02:35:58 - INFO - __main__ - Step 38159: {'lr': 0.0004299078425277947, 'samples': 7326528, 'steps': 38158, 'loss/train': 1.097966194152832} 11/07/2021 02:35:58 - INFO - __main__ - Step 38160: {'lr': 0.00042990415771581734, 'samples': 7326720, 'steps': 38159, 'loss/train': 1.2340035438537598} 11/07/2021 02:35:59 - INFO - __main__ - Step 38161: {'lr': 0.0004299004728227781, 'samples': 7326912, 'steps': 38160, 'loss/train': 1.3330954313278198} 11/07/2021 02:36:00 - INFO - __main__ - Step 38162: {'lr': 0.0004298967878486784, 'samples': 7327104, 'steps': 38161, 'loss/train': 0.4179893136024475} 11/07/2021 02:36:00 - INFO - __main__ - Step 38163: {'lr': 0.00042989310279352, 'samples': 7327296, 'steps': 38162, 'loss/train': 0.3285816013813019} 11/07/2021 02:36:00 - INFO - __main__ - Step 38164: {'lr': 0.0004298894176573046, 'samples': 7327488, 'steps': 38163, 'loss/train': 1.6918214559555054} 11/07/2021 02:36:01 - INFO - __main__ - Step 38165: {'lr': 0.0004298857324400337, 'samples': 7327680, 'steps': 38164, 'loss/train': 1.8268216848373413} 11/07/2021 02:36:02 - INFO - __main__ - Step 38166: {'lr': 0.0004298820471417091, 'samples': 7327872, 'steps': 38165, 'loss/train': 1.4902230501174927} 11/07/2021 02:36:02 - INFO - __main__ - Step 38167: {'lr': 0.00042987836176233246, 'samples': 7328064, 'steps': 38166, 'loss/train': 1.3428443670272827} 11/07/2021 02:36:03 - INFO - __main__ - Step 38168: {'lr': 0.0004298746763019054, 'samples': 7328256, 'steps': 38167, 'loss/train': 0.43336984515190125} 11/07/2021 02:36:03 - INFO - __main__ - Step 38169: {'lr': 0.0004298709907604296, 'samples': 7328448, 'steps': 38168, 'loss/train': 1.229674220085144} 11/07/2021 02:36:03 - INFO - __main__ - Step 38170: {'lr': 0.0004298673051379066, 'samples': 7328640, 'steps': 38169, 'loss/train': 2.1229805946350098} 11/07/2021 02:36:04 - INFO - __main__ - Step 38171: {'lr': 0.0004298636194343383, 'samples': 7328832, 'steps': 38170, 'loss/train': 0.895877480506897} 11/07/2021 02:36:05 - INFO - __main__ - Step 38172: {'lr': 0.0004298599336497262, 'samples': 7329024, 'steps': 38171, 'loss/train': 1.468153476715088} 11/07/2021 02:36:05 - INFO - __main__ - Step 38173: {'lr': 0.00042985624778407196, 'samples': 7329216, 'steps': 38172, 'loss/train': 1.3250070810317993} 11/07/2021 02:36:05 - INFO - __main__ - Step 38174: {'lr': 0.00042985256183737723, 'samples': 7329408, 'steps': 38173, 'loss/train': 0.975612998008728} 11/07/2021 02:36:06 - INFO - __main__ - Step 38175: {'lr': 0.00042984887580964376, 'samples': 7329600, 'steps': 38174, 'loss/train': 1.6117653846740723} 11/07/2021 02:36:06 - INFO - __main__ - Step 38176: {'lr': 0.00042984518970087316, 'samples': 7329792, 'steps': 38175, 'loss/train': 1.356275200843811} 11/07/2021 02:36:07 - INFO - __main__ - Step 38177: {'lr': 0.0004298415035110671, 'samples': 7329984, 'steps': 38176, 'loss/train': 1.7516674995422363} 11/07/2021 02:36:08 - INFO - __main__ - Step 38178: {'lr': 0.00042983781724022723, 'samples': 7330176, 'steps': 38177, 'loss/train': 1.744294285774231} 11/07/2021 02:36:08 - INFO - __main__ - Step 38179: {'lr': 0.0004298341308883552, 'samples': 7330368, 'steps': 38178, 'loss/train': 2.2094287872314453} 11/07/2021 02:36:08 - INFO - __main__ - Step 38180: {'lr': 0.0004298304444554527, 'samples': 7330560, 'steps': 38179, 'loss/train': 1.6997636556625366} 11/07/2021 02:36:09 - INFO - __main__ - Step 38181: {'lr': 0.00042982675794152135, 'samples': 7330752, 'steps': 38180, 'loss/train': 2.0016207695007324} 11/07/2021 02:36:10 - INFO - __main__ - Step 38182: {'lr': 0.0004298230713465629, 'samples': 7330944, 'steps': 38181, 'loss/train': 1.2829039096832275} 11/07/2021 02:36:10 - INFO - __main__ - Step 38183: {'lr': 0.00042981938467057893, 'samples': 7331136, 'steps': 38182, 'loss/train': 0.7408188581466675} 11/07/2021 02:36:11 - INFO - __main__ - Step 38184: {'lr': 0.0004298156979135711, 'samples': 7331328, 'steps': 38183, 'loss/train': 1.2323633432388306} 11/07/2021 02:36:11 - INFO - __main__ - Step 38185: {'lr': 0.000429812011075541, 'samples': 7331520, 'steps': 38184, 'loss/train': 1.148552417755127} 11/07/2021 02:36:11 - INFO - __main__ - Step 38186: {'lr': 0.0004298083241564905, 'samples': 7331712, 'steps': 38185, 'loss/train': 1.3870030641555786} 11/07/2021 02:36:12 - INFO - __main__ - Step 38187: {'lr': 0.00042980463715642115, 'samples': 7331904, 'steps': 38186, 'loss/train': 1.4995348453521729} 11/07/2021 02:36:13 - INFO - __main__ - Step 38188: {'lr': 0.0004298009500753346, 'samples': 7332096, 'steps': 38187, 'loss/train': 1.3584494590759277} 11/07/2021 02:36:13 - INFO - __main__ - Step 38189: {'lr': 0.00042979726291323246, 'samples': 7332288, 'steps': 38188, 'loss/train': 1.6575303077697754} 11/07/2021 02:36:13 - INFO - __main__ - Step 38190: {'lr': 0.00042979357567011643, 'samples': 7332480, 'steps': 38189, 'loss/train': 1.3136870861053467} 11/07/2021 02:36:14 - INFO - __main__ - Step 38191: {'lr': 0.0004297898883459883, 'samples': 7332672, 'steps': 38190, 'loss/train': 1.2578123807907104} 11/07/2021 02:36:15 - INFO - __main__ - Step 38192: {'lr': 0.00042978620094084955, 'samples': 7332864, 'steps': 38191, 'loss/train': 1.4794684648513794} 11/07/2021 02:36:15 - INFO - __main__ - Step 38193: {'lr': 0.00042978251345470185, 'samples': 7333056, 'steps': 38192, 'loss/train': 1.5394213199615479} 11/07/2021 02:36:15 - INFO - __main__ - Step 38194: {'lr': 0.000429778825887547, 'samples': 7333248, 'steps': 38193, 'loss/train': 1.0442190170288086} 11/07/2021 02:36:16 - INFO - __main__ - Step 38195: {'lr': 0.00042977513823938665, 'samples': 7333440, 'steps': 38194, 'loss/train': 1.488396167755127} 11/07/2021 02:36:16 - INFO - __main__ - Step 38196: {'lr': 0.00042977145051022224, 'samples': 7333632, 'steps': 38195, 'loss/train': 1.642302393913269} 11/07/2021 02:36:17 - INFO - __main__ - Step 38197: {'lr': 0.0004297677627000557, 'samples': 7333824, 'steps': 38196, 'loss/train': 1.7927191257476807} 11/07/2021 02:36:17 - INFO - __main__ - Step 38198: {'lr': 0.0004297640748088886, 'samples': 7334016, 'steps': 38197, 'loss/train': 1.6281474828720093} 11/07/2021 02:36:18 - INFO - __main__ - Step 38199: {'lr': 0.0004297603868367225, 'samples': 7334208, 'steps': 38198, 'loss/train': 1.7086904048919678} 11/07/2021 02:36:18 - INFO - __main__ - Step 38200: {'lr': 0.00042975669878355917, 'samples': 7334400, 'steps': 38199, 'loss/train': 2.989762783050537} 11/07/2021 02:36:18 - INFO - __main__ - Step 38201: {'lr': 0.00042975301064940026, 'samples': 7334592, 'steps': 38200, 'loss/train': 1.8463572263717651} 11/07/2021 02:36:19 - INFO - __main__ - Step 38202: {'lr': 0.00042974932243424743, 'samples': 7334784, 'steps': 38201, 'loss/train': 2.0515167713165283} 11/07/2021 02:36:20 - INFO - __main__ - Step 38203: {'lr': 0.0004297456341381023, 'samples': 7334976, 'steps': 38202, 'loss/train': 0.7293980717658997} 11/07/2021 02:36:20 - INFO - __main__ - Step 38204: {'lr': 0.0004297419457609666, 'samples': 7335168, 'steps': 38203, 'loss/train': 1.7791290283203125} 11/07/2021 02:36:21 - INFO - __main__ - Step 38205: {'lr': 0.0004297382573028419, 'samples': 7335360, 'steps': 38204, 'loss/train': 1.5674655437469482} 11/07/2021 02:36:21 - INFO - __main__ - Step 38206: {'lr': 0.0004297345687637299, 'samples': 7335552, 'steps': 38205, 'loss/train': 2.14150333404541} 11/07/2021 02:36:21 - INFO - __main__ - Step 38207: {'lr': 0.00042973088014363237, 'samples': 7335744, 'steps': 38206, 'loss/train': 1.6745388507843018} 11/07/2021 02:36:22 - INFO - __main__ - Step 38208: {'lr': 0.0004297271914425508, 'samples': 7335936, 'steps': 38207, 'loss/train': 1.391538381576538} 11/07/2021 02:36:23 - INFO - __main__ - Step 38209: {'lr': 0.00042972350266048693, 'samples': 7336128, 'steps': 38208, 'loss/train': 0.917203962802887} 11/07/2021 02:36:23 - INFO - __main__ - Step 38210: {'lr': 0.0004297198137974425, 'samples': 7336320, 'steps': 38209, 'loss/train': 1.749819278717041} 11/07/2021 02:36:23 - INFO - __main__ - Step 38211: {'lr': 0.00042971612485341896, 'samples': 7336512, 'steps': 38210, 'loss/train': 0.9586760997772217} 11/07/2021 02:36:24 - INFO - __main__ - Step 38212: {'lr': 0.00042971243582841823, 'samples': 7336704, 'steps': 38211, 'loss/train': 0.8148049116134644} 11/07/2021 02:36:25 - INFO - __main__ - Step 38213: {'lr': 0.0004297087467224418, 'samples': 7336896, 'steps': 38212, 'loss/train': 1.433459758758545} 11/07/2021 02:36:25 - INFO - __main__ - Step 38214: {'lr': 0.0004297050575354914, 'samples': 7337088, 'steps': 38213, 'loss/train': 1.6563775539398193} 11/07/2021 02:36:25 - INFO - __main__ - Step 38215: {'lr': 0.0004297013682675687, 'samples': 7337280, 'steps': 38214, 'loss/train': 1.0042729377746582} 11/07/2021 02:36:26 - INFO - __main__ - Step 38216: {'lr': 0.0004296976789186753, 'samples': 7337472, 'steps': 38215, 'loss/train': 1.534716010093689} 11/07/2021 02:36:26 - INFO - __main__ - Step 38217: {'lr': 0.00042969398948881286, 'samples': 7337664, 'steps': 38216, 'loss/train': 1.5462156534194946} 11/07/2021 02:36:27 - INFO - __main__ - Step 38218: {'lr': 0.00042969029997798314, 'samples': 7337856, 'steps': 38217, 'loss/train': 1.679050326347351} 11/07/2021 02:36:27 - INFO - __main__ - Step 38219: {'lr': 0.00042968661038618775, 'samples': 7338048, 'steps': 38218, 'loss/train': 1.924902319908142} 11/07/2021 02:36:28 - INFO - __main__ - Step 38220: {'lr': 0.0004296829207134283, 'samples': 7338240, 'steps': 38219, 'loss/train': 1.2776870727539062} 11/07/2021 02:36:28 - INFO - __main__ - Step 38221: {'lr': 0.0004296792309597065, 'samples': 7338432, 'steps': 38220, 'loss/train': 1.4061015844345093} 11/07/2021 02:36:28 - INFO - __main__ - Step 38222: {'lr': 0.00042967554112502404, 'samples': 7338624, 'steps': 38221, 'loss/train': 1.2293282747268677} 11/07/2021 02:36:29 - INFO - __main__ - Step 38223: {'lr': 0.00042967185120938256, 'samples': 7338816, 'steps': 38222, 'loss/train': 1.0383968353271484} 11/07/2021 02:36:30 - INFO - __main__ - Step 38224: {'lr': 0.00042966816121278365, 'samples': 7339008, 'steps': 38223, 'loss/train': 1.7293782234191895} 11/07/2021 02:36:30 - INFO - __main__ - Step 38225: {'lr': 0.0004296644711352291, 'samples': 7339200, 'steps': 38224, 'loss/train': 1.6785378456115723} 11/07/2021 02:36:31 - INFO - __main__ - Step 38226: {'lr': 0.0004296607809767205, 'samples': 7339392, 'steps': 38225, 'loss/train': 1.3909289836883545} 11/07/2021 02:36:31 - INFO - __main__ - Step 38227: {'lr': 0.00042965709073725957, 'samples': 7339584, 'steps': 38226, 'loss/train': 1.9048470258712769} 11/07/2021 02:36:31 - INFO - __main__ - Step 38228: {'lr': 0.00042965340041684785, 'samples': 7339776, 'steps': 38227, 'loss/train': 1.051666021347046} 11/07/2021 02:36:32 - INFO - __main__ - Step 38229: {'lr': 0.00042964971001548715, 'samples': 7339968, 'steps': 38228, 'loss/train': 1.3761814832687378} 11/07/2021 02:36:33 - INFO - __main__ - Step 38230: {'lr': 0.00042964601953317895, 'samples': 7340160, 'steps': 38229, 'loss/train': 1.4855128526687622} 11/07/2021 02:36:33 - INFO - __main__ - Step 38231: {'lr': 0.0004296423289699252, 'samples': 7340352, 'steps': 38230, 'loss/train': 0.9149501323699951} 11/07/2021 02:36:33 - INFO - __main__ - Step 38232: {'lr': 0.00042963863832572727, 'samples': 7340544, 'steps': 38231, 'loss/train': 1.554993748664856} 11/07/2021 02:36:34 - INFO - __main__ - Step 38233: {'lr': 0.0004296349476005869, 'samples': 7340736, 'steps': 38232, 'loss/train': 1.3690378665924072} 11/07/2021 02:36:35 - INFO - __main__ - Step 38234: {'lr': 0.0004296312567945059, 'samples': 7340928, 'steps': 38233, 'loss/train': 1.3793355226516724} 11/07/2021 02:36:35 - INFO - __main__ - Step 38235: {'lr': 0.0004296275659074858, 'samples': 7341120, 'steps': 38234, 'loss/train': 1.2817461490631104} 11/07/2021 02:36:36 - INFO - __main__ - Step 38236: {'lr': 0.00042962387493952823, 'samples': 7341312, 'steps': 38235, 'loss/train': 1.375528335571289} 11/07/2021 02:36:36 - INFO - __main__ - Step 38237: {'lr': 0.00042962018389063495, 'samples': 7341504, 'steps': 38236, 'loss/train': 1.0297311544418335} 11/07/2021 02:36:36 - INFO - __main__ - Step 38238: {'lr': 0.0004296164927608076, 'samples': 7341696, 'steps': 38237, 'loss/train': 1.5794353485107422} 11/07/2021 02:36:37 - INFO - __main__ - Step 38239: {'lr': 0.00042961280155004786, 'samples': 7341888, 'steps': 38238, 'loss/train': 0.6085673570632935} 11/07/2021 02:36:38 - INFO - __main__ - Step 38240: {'lr': 0.0004296091102583573, 'samples': 7342080, 'steps': 38239, 'loss/train': 1.249404788017273} 11/07/2021 02:36:38 - INFO - __main__ - Step 38241: {'lr': 0.0004296054188857377, 'samples': 7342272, 'steps': 38240, 'loss/train': 1.118761420249939} 11/07/2021 02:36:38 - INFO - __main__ - Step 38242: {'lr': 0.0004296017274321906, 'samples': 7342464, 'steps': 38241, 'loss/train': 1.4748969078063965} 11/07/2021 02:36:39 - INFO - __main__ - Step 38243: {'lr': 0.0004295980358977178, 'samples': 7342656, 'steps': 38242, 'loss/train': 1.4010951519012451} 11/07/2021 02:36:40 - INFO - __main__ - Step 38244: {'lr': 0.0004295943442823209, 'samples': 7342848, 'steps': 38243, 'loss/train': 1.797529697418213} 11/07/2021 02:36:40 - INFO - __main__ - Step 38245: {'lr': 0.0004295906525860015, 'samples': 7343040, 'steps': 38244, 'loss/train': 1.5135968923568726} 11/07/2021 02:36:40 - INFO - __main__ - Step 38246: {'lr': 0.00042958696080876136, 'samples': 7343232, 'steps': 38245, 'loss/train': 1.836068868637085} 11/07/2021 02:36:41 - INFO - __main__ - Step 38247: {'lr': 0.00042958326895060206, 'samples': 7343424, 'steps': 38246, 'loss/train': 1.1405773162841797} 11/07/2021 02:36:41 - INFO - __main__ - Step 38248: {'lr': 0.0004295795770115254, 'samples': 7343616, 'steps': 38247, 'loss/train': 1.3413417339324951} 11/07/2021 02:36:42 - INFO - __main__ - Step 38249: {'lr': 0.0004295758849915329, 'samples': 7343808, 'steps': 38248, 'loss/train': 1.624894618988037} 11/07/2021 02:36:43 - INFO - __main__ - Step 38250: {'lr': 0.00042957219289062635, 'samples': 7344000, 'steps': 38249, 'loss/train': 1.6614336967468262} 11/07/2021 02:36:43 - INFO - __main__ - Step 38251: {'lr': 0.0004295685007088072, 'samples': 7344192, 'steps': 38250, 'loss/train': 1.712052822113037} 11/07/2021 02:36:43 - INFO - __main__ - Step 38252: {'lr': 0.00042956480844607734, 'samples': 7344384, 'steps': 38251, 'loss/train': 0.6693893074989319} 11/07/2021 02:36:44 - INFO - __main__ - Step 38253: {'lr': 0.00042956111610243833, 'samples': 7344576, 'steps': 38252, 'loss/train': 0.6600829362869263} 11/07/2021 02:36:44 - INFO - __main__ - Step 38254: {'lr': 0.0004295574236778919, 'samples': 7344768, 'steps': 38253, 'loss/train': 1.4803495407104492} 11/07/2021 02:36:45 - INFO - __main__ - Step 38255: {'lr': 0.00042955373117243954, 'samples': 7344960, 'steps': 38254, 'loss/train': 1.4988573789596558} 11/07/2021 02:36:45 - INFO - __main__ - Step 38256: {'lr': 0.0004295500385860832, 'samples': 7345152, 'steps': 38255, 'loss/train': 2.0637753009796143} 11/07/2021 02:36:46 - INFO - __main__ - Step 38257: {'lr': 0.0004295463459188243, 'samples': 7345344, 'steps': 38256, 'loss/train': 0.21861299872398376} 11/07/2021 02:36:46 - INFO - __main__ - Step 38258: {'lr': 0.00042954265317066457, 'samples': 7345536, 'steps': 38257, 'loss/train': 0.9756987690925598} 11/07/2021 02:36:46 - INFO - __main__ - Step 38259: {'lr': 0.0004295389603416057, 'samples': 7345728, 'steps': 38258, 'loss/train': 1.52315092086792} 11/07/2021 02:36:48 - INFO - __main__ - Step 38260: {'lr': 0.0004295352674316494, 'samples': 7345920, 'steps': 38259, 'loss/train': 1.6356688737869263} 11/07/2021 02:36:48 - INFO - __main__ - Step 38261: {'lr': 0.0004295315744407972, 'samples': 7346112, 'steps': 38260, 'loss/train': 1.7461309432983398} 11/07/2021 02:36:48 - INFO - __main__ - Step 38262: {'lr': 0.0004295278813690509, 'samples': 7346304, 'steps': 38261, 'loss/train': 1.6206568479537964} 11/07/2021 02:36:49 - INFO - __main__ - Step 38263: {'lr': 0.0004295241882164121, 'samples': 7346496, 'steps': 38262, 'loss/train': 1.4554109573364258} 11/07/2021 02:36:49 - INFO - __main__ - Step 38264: {'lr': 0.0004295204949828825, 'samples': 7346688, 'steps': 38263, 'loss/train': 1.0673553943634033} 11/07/2021 02:36:50 - INFO - __main__ - Step 38265: {'lr': 0.0004295168016684636, 'samples': 7346880, 'steps': 38264, 'loss/train': 1.2171517610549927} 11/07/2021 02:36:50 - INFO - __main__ - Step 38266: {'lr': 0.0004295131082731574, 'samples': 7347072, 'steps': 38265, 'loss/train': 2.0179131031036377} 11/07/2021 02:36:51 - INFO - __main__ - Step 38267: {'lr': 0.0004295094147969652, 'samples': 7347264, 'steps': 38266, 'loss/train': 1.5524829626083374} 11/07/2021 02:36:51 - INFO - __main__ - Step 38268: {'lr': 0.0004295057212398889, 'samples': 7347456, 'steps': 38267, 'loss/train': 1.5185699462890625} 11/07/2021 02:36:51 - INFO - __main__ - Step 38269: {'lr': 0.00042950202760193003, 'samples': 7347648, 'steps': 38268, 'loss/train': 1.997591495513916} 11/07/2021 02:36:53 - INFO - __main__ - Step 38270: {'lr': 0.0004294983338830904, 'samples': 7347840, 'steps': 38269, 'loss/train': 1.6346243619918823} 11/07/2021 02:36:53 - INFO - __main__ - Step 38271: {'lr': 0.0004294946400833716, 'samples': 7348032, 'steps': 38270, 'loss/train': 1.7416479587554932} 11/07/2021 02:36:53 - INFO - __main__ - Step 38272: {'lr': 0.0004294909462027752, 'samples': 7348224, 'steps': 38271, 'loss/train': 1.3885921239852905} 11/07/2021 02:36:54 - INFO - __main__ - Step 38273: {'lr': 0.000429487252241303, 'samples': 7348416, 'steps': 38272, 'loss/train': 0.7423022985458374} 11/07/2021 02:36:54 - INFO - __main__ - Step 38274: {'lr': 0.00042948355819895655, 'samples': 7348608, 'steps': 38273, 'loss/train': 1.4728059768676758} 11/07/2021 02:36:54 - INFO - __main__ - Step 38275: {'lr': 0.0004294798640757377, 'samples': 7348800, 'steps': 38274, 'loss/train': 0.8224681615829468} 11/07/2021 02:36:55 - INFO - __main__ - Step 38276: {'lr': 0.00042947616987164787, 'samples': 7348992, 'steps': 38275, 'loss/train': 1.8463332653045654} 11/07/2021 02:36:56 - INFO - __main__ - Step 38277: {'lr': 0.00042947247558668887, 'samples': 7349184, 'steps': 38276, 'loss/train': 1.5869121551513672} 11/07/2021 02:36:56 - INFO - __main__ - Step 38278: {'lr': 0.00042946878122086243, 'samples': 7349376, 'steps': 38277, 'loss/train': 1.624953269958496} 11/07/2021 02:36:56 - INFO - __main__ - Step 38279: {'lr': 0.00042946508677417007, 'samples': 7349568, 'steps': 38278, 'loss/train': 1.7056394815444946} 11/07/2021 02:36:57 - INFO - __main__ - Step 38280: {'lr': 0.0004294613922466135, 'samples': 7349760, 'steps': 38279, 'loss/train': 1.591142177581787} 11/07/2021 02:36:58 - INFO - __main__ - Step 38281: {'lr': 0.0004294576976381944, 'samples': 7349952, 'steps': 38280, 'loss/train': 1.4061824083328247} 11/07/2021 02:36:58 - INFO - __main__ - Step 38282: {'lr': 0.00042945400294891445, 'samples': 7350144, 'steps': 38281, 'loss/train': 1.90840482711792} 11/07/2021 02:36:58 - INFO - __main__ - Step 38283: {'lr': 0.0004294503081787753, 'samples': 7350336, 'steps': 38282, 'loss/train': 0.9647262692451477} 11/07/2021 02:36:59 - INFO - __main__ - Step 38284: {'lr': 0.0004294466133277786, 'samples': 7350528, 'steps': 38283, 'loss/train': 1.229430079460144} 11/07/2021 02:36:59 - INFO - __main__ - Step 38285: {'lr': 0.00042944291839592597, 'samples': 7350720, 'steps': 38284, 'loss/train': 0.9978717565536499} 11/07/2021 02:37:01 - INFO - __main__ - Step 38286: {'lr': 0.0004294392233832192, 'samples': 7350912, 'steps': 38285, 'loss/train': 1.2743558883666992} 11/07/2021 02:37:01 - INFO - __main__ - Step 38287: {'lr': 0.0004294355282896599, 'samples': 7351104, 'steps': 38286, 'loss/train': 1.7710016965866089} 11/07/2021 02:37:01 - INFO - __main__ - Step 38288: {'lr': 0.00042943183311524967, 'samples': 7351296, 'steps': 38287, 'loss/train': 1.8275021314620972} 11/07/2021 02:37:02 - INFO - __main__ - Step 38289: {'lr': 0.0004294281378599902, 'samples': 7351488, 'steps': 38288, 'loss/train': 1.90546452999115} 11/07/2021 02:37:02 - INFO - __main__ - Step 38290: {'lr': 0.00042942444252388323, 'samples': 7351680, 'steps': 38289, 'loss/train': 1.8090331554412842} 11/07/2021 02:37:02 - INFO - __main__ - Step 38291: {'lr': 0.0004294207471069304, 'samples': 7351872, 'steps': 38290, 'loss/train': 1.1580256223678589} 11/07/2021 02:37:03 - INFO - __main__ - Step 38292: {'lr': 0.0004294170516091332, 'samples': 7352064, 'steps': 38291, 'loss/train': 1.6080865859985352} 11/07/2021 02:37:04 - INFO - __main__ - Step 38293: {'lr': 0.0004294133560304936, 'samples': 7352256, 'steps': 38292, 'loss/train': 1.8003463745117188} 11/07/2021 02:37:04 - INFO - __main__ - Step 38294: {'lr': 0.00042940966037101314, 'samples': 7352448, 'steps': 38293, 'loss/train': 1.6777453422546387} 11/07/2021 02:37:05 - INFO - __main__ - Step 38295: {'lr': 0.00042940596463069336, 'samples': 7352640, 'steps': 38294, 'loss/train': 0.3316746652126312} 11/07/2021 02:37:05 - INFO - __main__ - Step 38296: {'lr': 0.00042940226880953605, 'samples': 7352832, 'steps': 38295, 'loss/train': 1.732701301574707} 11/07/2021 02:37:05 - INFO - __main__ - Step 38297: {'lr': 0.0004293985729075428, 'samples': 7353024, 'steps': 38296, 'loss/train': 1.5013654232025146} 11/07/2021 02:37:06 - INFO - __main__ - Step 38298: {'lr': 0.00042939487692471534, 'samples': 7353216, 'steps': 38297, 'loss/train': 1.4927855730056763} 11/07/2021 02:37:07 - INFO - __main__ - Step 38299: {'lr': 0.0004293911808610554, 'samples': 7353408, 'steps': 38298, 'loss/train': 1.6887056827545166} 11/07/2021 02:37:07 - INFO - __main__ - Step 38300: {'lr': 0.0004293874847165645, 'samples': 7353600, 'steps': 38299, 'loss/train': 1.4818027019500732} 11/07/2021 02:37:07 - INFO - __main__ - Step 38301: {'lr': 0.0004293837884912444, 'samples': 7353792, 'steps': 38300, 'loss/train': 2.025219678878784} 11/07/2021 02:37:08 - INFO - __main__ - Step 38302: {'lr': 0.00042938009218509667, 'samples': 7353984, 'steps': 38301, 'loss/train': 1.3994265794754028} 11/07/2021 02:37:08 - INFO - __main__ - Step 38303: {'lr': 0.00042937639579812304, 'samples': 7354176, 'steps': 38302, 'loss/train': 1.506459355354309} 11/07/2021 02:37:09 - INFO - __main__ - Step 38304: {'lr': 0.0004293726993303252, 'samples': 7354368, 'steps': 38303, 'loss/train': 1.719734787940979} 11/07/2021 02:37:09 - INFO - __main__ - Step 38305: {'lr': 0.0004293690027817048, 'samples': 7354560, 'steps': 38304, 'loss/train': 1.195610761642456} 11/07/2021 02:37:10 - INFO - __main__ - Step 38306: {'lr': 0.00042936530615226355, 'samples': 7354752, 'steps': 38305, 'loss/train': 1.3250503540039062} 11/07/2021 02:37:10 - INFO - __main__ - Step 38307: {'lr': 0.00042936160944200295, 'samples': 7354944, 'steps': 38306, 'loss/train': 1.7600414752960205} 11/07/2021 02:37:10 - INFO - __main__ - Step 38308: {'lr': 0.00042935791265092483, 'samples': 7355136, 'steps': 38307, 'loss/train': 1.7057290077209473} 11/07/2021 02:37:11 - INFO - __main__ - Step 38309: {'lr': 0.0004293542157790308, 'samples': 7355328, 'steps': 38308, 'loss/train': 1.512961983680725} 11/07/2021 02:37:12 - INFO - __main__ - Step 38310: {'lr': 0.00042935051882632245, 'samples': 7355520, 'steps': 38309, 'loss/train': 1.4231493473052979} 11/07/2021 02:37:12 - INFO - __main__ - Step 38311: {'lr': 0.0004293468217928017, 'samples': 7355712, 'steps': 38310, 'loss/train': 1.3613260984420776} 11/07/2021 02:37:12 - INFO - __main__ - Step 38312: {'lr': 0.0004293431246784699, 'samples': 7355904, 'steps': 38311, 'loss/train': 1.8416643142700195} 11/07/2021 02:37:13 - INFO - __main__ - Step 38313: {'lr': 0.0004293394274833289, 'samples': 7356096, 'steps': 38312, 'loss/train': 1.530188798904419} 11/07/2021 02:37:14 - INFO - __main__ - Step 38314: {'lr': 0.0004293357302073804, 'samples': 7356288, 'steps': 38313, 'loss/train': 1.4533849954605103} 11/07/2021 02:37:14 - INFO - __main__ - Step 38315: {'lr': 0.00042933203285062585, 'samples': 7356480, 'steps': 38314, 'loss/train': 1.5147393941879272} 11/07/2021 02:37:15 - INFO - __main__ - Step 38316: {'lr': 0.00042932833541306704, 'samples': 7356672, 'steps': 38315, 'loss/train': 1.3559811115264893} 11/07/2021 02:37:15 - INFO - __main__ - Step 38317: {'lr': 0.0004293246378947058, 'samples': 7356864, 'steps': 38316, 'loss/train': 1.9374382495880127} 11/07/2021 02:37:15 - INFO - __main__ - Step 38318: {'lr': 0.00042932094029554354, 'samples': 7357056, 'steps': 38317, 'loss/train': 1.3933587074279785} 11/07/2021 02:37:16 - INFO - __main__ - Step 38319: {'lr': 0.00042931724261558205, 'samples': 7357248, 'steps': 38318, 'loss/train': 1.1619360446929932} 11/07/2021 02:37:17 - INFO - __main__ - Step 38320: {'lr': 0.000429313544854823, 'samples': 7357440, 'steps': 38319, 'loss/train': 1.4888416528701782} 11/07/2021 02:37:17 - INFO - __main__ - Step 38321: {'lr': 0.00042930984701326796, 'samples': 7357632, 'steps': 38320, 'loss/train': 1.7033252716064453} 11/07/2021 02:37:17 - INFO - __main__ - Step 38322: {'lr': 0.0004293061490909187, 'samples': 7357824, 'steps': 38321, 'loss/train': 1.3759467601776123} 11/07/2021 02:37:18 - INFO - __main__ - Step 38323: {'lr': 0.0004293024510877769, 'samples': 7358016, 'steps': 38322, 'loss/train': 0.9921587109565735} 11/07/2021 02:37:19 - INFO - __main__ - Step 38324: {'lr': 0.00042929875300384417, 'samples': 7358208, 'steps': 38323, 'loss/train': 1.6296452283859253} 11/07/2021 02:37:19 - INFO - __main__ - Step 38325: {'lr': 0.0004292950548391222, 'samples': 7358400, 'steps': 38324, 'loss/train': 1.064168095588684} 11/07/2021 02:37:19 - INFO - __main__ - Step 38326: {'lr': 0.00042929135659361265, 'samples': 7358592, 'steps': 38325, 'loss/train': 0.5363820791244507} 11/07/2021 02:37:20 - INFO - __main__ - Step 38327: {'lr': 0.0004292876582673171, 'samples': 7358784, 'steps': 38326, 'loss/train': 0.9394339323043823} 11/07/2021 02:37:20 - INFO - __main__ - Step 38328: {'lr': 0.0004292839598602374, 'samples': 7358976, 'steps': 38327, 'loss/train': 0.8856777548789978} 11/07/2021 02:37:21 - INFO - __main__ - Step 38329: {'lr': 0.000429280261372375, 'samples': 7359168, 'steps': 38328, 'loss/train': 1.5098530054092407} 11/07/2021 02:37:22 - INFO - __main__ - Step 38330: {'lr': 0.00042927656280373176, 'samples': 7359360, 'steps': 38329, 'loss/train': 1.4925379753112793} 11/07/2021 02:37:22 - INFO - __main__ - Step 38331: {'lr': 0.00042927286415430933, 'samples': 7359552, 'steps': 38330, 'loss/train': 1.8342890739440918} 11/07/2021 02:37:22 - INFO - __main__ - Step 38332: {'lr': 0.0004292691654241092, 'samples': 7359744, 'steps': 38331, 'loss/train': 1.2873988151550293} 11/07/2021 02:37:23 - INFO - __main__ - Step 38333: {'lr': 0.00042926546661313313, 'samples': 7359936, 'steps': 38332, 'loss/train': 1.7223068475723267} 11/07/2021 02:37:23 - INFO - __main__ - Step 38334: {'lr': 0.00042926176772138295, 'samples': 7360128, 'steps': 38333, 'loss/train': 1.29563307762146} 11/07/2021 02:37:24 - INFO - __main__ - Step 38335: {'lr': 0.0004292580687488601, 'samples': 7360320, 'steps': 38334, 'loss/train': 1.639797568321228} 11/07/2021 02:37:24 - INFO - __main__ - Step 38336: {'lr': 0.0004292543696955663, 'samples': 7360512, 'steps': 38335, 'loss/train': 1.5832456350326538} 11/07/2021 02:37:25 - INFO - __main__ - Step 38337: {'lr': 0.00042925067056150324, 'samples': 7360704, 'steps': 38336, 'loss/train': 1.8844374418258667} 11/07/2021 02:37:25 - INFO - __main__ - Step 38338: {'lr': 0.0004292469713466727, 'samples': 7360896, 'steps': 38337, 'loss/train': 1.8090503215789795} 11/07/2021 02:37:25 - INFO - __main__ - Step 38339: {'lr': 0.00042924327205107616, 'samples': 7361088, 'steps': 38338, 'loss/train': 1.2895766496658325} 11/07/2021 02:37:27 - INFO - __main__ - Step 38340: {'lr': 0.00042923957267471536, 'samples': 7361280, 'steps': 38339, 'loss/train': 1.9198942184448242} 11/07/2021 02:37:27 - INFO - __main__ - Step 38341: {'lr': 0.000429235873217592, 'samples': 7361472, 'steps': 38340, 'loss/train': 1.340233564376831} 11/07/2021 02:37:27 - INFO - __main__ - Step 38342: {'lr': 0.0004292321736797077, 'samples': 7361664, 'steps': 38341, 'loss/train': 1.4563068151474} 11/07/2021 02:37:28 - INFO - __main__ - Step 38343: {'lr': 0.0004292284740610642, 'samples': 7361856, 'steps': 38342, 'loss/train': 1.0337316989898682} 11/07/2021 02:37:28 - INFO - __main__ - Step 38344: {'lr': 0.0004292247743616631, 'samples': 7362048, 'steps': 38343, 'loss/train': 1.9487308263778687} 11/07/2021 02:37:29 - INFO - __main__ - Step 38345: {'lr': 0.00042922107458150604, 'samples': 7362240, 'steps': 38344, 'loss/train': 1.299109697341919} 11/07/2021 02:37:29 - INFO - __main__ - Step 38346: {'lr': 0.00042921737472059474, 'samples': 7362432, 'steps': 38345, 'loss/train': 1.619086742401123} 11/07/2021 02:37:30 - INFO - __main__ - Step 38347: {'lr': 0.0004292136747789309, 'samples': 7362624, 'steps': 38346, 'loss/train': 1.6750733852386475} 11/07/2021 02:37:30 - INFO - __main__ - Step 38348: {'lr': 0.00042920997475651607, 'samples': 7362816, 'steps': 38347, 'loss/train': 1.1470671892166138} 11/07/2021 02:37:30 - INFO - __main__ - Step 38349: {'lr': 0.00042920627465335205, 'samples': 7363008, 'steps': 38348, 'loss/train': 1.635786533355713} 11/07/2021 02:37:31 - INFO - __main__ - Step 38350: {'lr': 0.00042920257446944044, 'samples': 7363200, 'steps': 38349, 'loss/train': 1.5170003175735474} 11/07/2021 02:37:32 - INFO - __main__ - Step 38351: {'lr': 0.0004291988742047829, 'samples': 7363392, 'steps': 38350, 'loss/train': 1.636959433555603} 11/07/2021 02:37:32 - INFO - __main__ - Step 38352: {'lr': 0.0004291951738593811, 'samples': 7363584, 'steps': 38351, 'loss/train': 1.6563864946365356} 11/07/2021 02:37:32 - INFO - __main__ - Step 38353: {'lr': 0.0004291914734332367, 'samples': 7363776, 'steps': 38352, 'loss/train': 2.0324835777282715} 11/07/2021 02:37:33 - INFO - __main__ - Step 38354: {'lr': 0.0004291877729263515, 'samples': 7363968, 'steps': 38353, 'loss/train': 1.2263227701187134} 11/07/2021 02:37:33 - INFO - __main__ - Step 38355: {'lr': 0.0004291840723387269, 'samples': 7364160, 'steps': 38354, 'loss/train': 1.572365164756775} 11/07/2021 02:37:35 - INFO - __main__ - Step 38356: {'lr': 0.0004291803716703648, 'samples': 7364352, 'steps': 38355, 'loss/train': 1.4267158508300781} 11/07/2021 02:37:35 - INFO - __main__ - Step 38357: {'lr': 0.0004291766709212668, 'samples': 7364544, 'steps': 38356, 'loss/train': 1.7175195217132568} 11/07/2021 02:37:35 - INFO - __main__ - Step 38358: {'lr': 0.00042917297009143455, 'samples': 7364736, 'steps': 38357, 'loss/train': 1.390121579170227} 11/07/2021 02:37:36 - INFO - __main__ - Step 38359: {'lr': 0.00042916926918086973, 'samples': 7364928, 'steps': 38358, 'loss/train': 0.26775655150413513} 11/07/2021 02:37:36 - INFO - __main__ - Step 38360: {'lr': 0.000429165568189574, 'samples': 7365120, 'steps': 38359, 'loss/train': 1.4750648736953735} 11/07/2021 02:37:37 - INFO - __main__ - Step 38361: {'lr': 0.000429161867117549, 'samples': 7365312, 'steps': 38360, 'loss/train': 0.9110004305839539} 11/07/2021 02:37:37 - INFO - __main__ - Step 38362: {'lr': 0.0004291581659647965, 'samples': 7365504, 'steps': 38361, 'loss/train': 1.269174575805664} 11/07/2021 02:37:38 - INFO - __main__ - Step 38363: {'lr': 0.00042915446473131805, 'samples': 7365696, 'steps': 38362, 'loss/train': 1.0548052787780762} 11/07/2021 02:37:38 - INFO - __main__ - Step 38364: {'lr': 0.0004291507634171153, 'samples': 7365888, 'steps': 38363, 'loss/train': 1.7208577394485474} 11/07/2021 02:37:38 - INFO - __main__ - Step 38365: {'lr': 0.0004291470620221901, 'samples': 7366080, 'steps': 38364, 'loss/train': 1.6459720134735107} 11/07/2021 02:37:39 - INFO - __main__ - Step 38366: {'lr': 0.0004291433605465439, 'samples': 7366272, 'steps': 38365, 'loss/train': 1.5473575592041016} 11/07/2021 02:37:40 - INFO - __main__ - Step 38367: {'lr': 0.00042913965899017855, 'samples': 7366464, 'steps': 38366, 'loss/train': 1.445980191230774} 11/07/2021 02:37:40 - INFO - __main__ - Step 38368: {'lr': 0.0004291359573530956, 'samples': 7366656, 'steps': 38367, 'loss/train': 0.7899708151817322} 11/07/2021 02:37:40 - INFO - __main__ - Step 38369: {'lr': 0.0004291322556352967, 'samples': 7366848, 'steps': 38368, 'loss/train': 1.4125922918319702} 11/07/2021 02:37:41 - INFO - __main__ - Step 38370: {'lr': 0.00042912855383678365, 'samples': 7367040, 'steps': 38369, 'loss/train': 1.8855928182601929} 11/07/2021 02:37:41 - INFO - __main__ - Step 38371: {'lr': 0.000429124851957558, 'samples': 7367232, 'steps': 38370, 'loss/train': 1.5229068994522095} 11/07/2021 02:37:42 - INFO - __main__ - Step 38372: {'lr': 0.0004291211499976214, 'samples': 7367424, 'steps': 38371, 'loss/train': 1.5820503234863281} 11/07/2021 02:37:43 - INFO - __main__ - Step 38373: {'lr': 0.0004291174479569757, 'samples': 7367616, 'steps': 38372, 'loss/train': 1.3003944158554077} 11/07/2021 02:37:43 - INFO - __main__ - Step 38374: {'lr': 0.00042911374583562233, 'samples': 7367808, 'steps': 38373, 'loss/train': 1.4751864671707153} 11/07/2021 02:37:43 - INFO - __main__ - Step 38375: {'lr': 0.0004291100436335631, 'samples': 7368000, 'steps': 38374, 'loss/train': 1.2032063007354736} 11/07/2021 02:37:44 - INFO - __main__ - Step 38376: {'lr': 0.00042910634135079963, 'samples': 7368192, 'steps': 38375, 'loss/train': 1.6390328407287598} 11/07/2021 02:37:45 - INFO - __main__ - Step 38377: {'lr': 0.00042910263898733364, 'samples': 7368384, 'steps': 38376, 'loss/train': 2.2559876441955566} 11/07/2021 02:37:45 - INFO - __main__ - Step 38378: {'lr': 0.0004290989365431668, 'samples': 7368576, 'steps': 38377, 'loss/train': 0.7892315983772278} 11/07/2021 02:37:45 - INFO - __main__ - Step 38379: {'lr': 0.0004290952340183007, 'samples': 7368768, 'steps': 38378, 'loss/train': 2.178373098373413} 11/07/2021 02:37:46 - INFO - __main__ - Step 38380: {'lr': 0.00042909153141273705, 'samples': 7368960, 'steps': 38379, 'loss/train': 1.791010856628418} 11/07/2021 02:37:46 - INFO - __main__ - Step 38381: {'lr': 0.0004290878287264775, 'samples': 7369152, 'steps': 38380, 'loss/train': 1.7731704711914062} 11/07/2021 02:37:47 - INFO - __main__ - Step 38382: {'lr': 0.0004290841259595237, 'samples': 7369344, 'steps': 38381, 'loss/train': 0.74814772605896} 11/07/2021 02:37:48 - INFO - __main__ - Step 38383: {'lr': 0.00042908042311187744, 'samples': 7369536, 'steps': 38382, 'loss/train': 1.4945881366729736} 11/07/2021 02:37:48 - INFO - __main__ - Step 38384: {'lr': 0.00042907672018354027, 'samples': 7369728, 'steps': 38383, 'loss/train': 1.2648224830627441} 11/07/2021 02:37:48 - INFO - __main__ - Step 38385: {'lr': 0.00042907301717451396, 'samples': 7369920, 'steps': 38384, 'loss/train': 1.0060302019119263} 11/07/2021 02:37:49 - INFO - __main__ - Step 38386: {'lr': 0.0004290693140848, 'samples': 7370112, 'steps': 38385, 'loss/train': 1.5758689641952515} 11/07/2021 02:37:50 - INFO - __main__ - Step 38387: {'lr': 0.0004290656109144003, 'samples': 7370304, 'steps': 38386, 'loss/train': 1.7817904949188232} 11/07/2021 02:37:50 - INFO - __main__ - Step 38388: {'lr': 0.0004290619076633163, 'samples': 7370496, 'steps': 38387, 'loss/train': 1.7534565925598145} 11/07/2021 02:37:51 - INFO - __main__ - Step 38389: {'lr': 0.0004290582043315498, 'samples': 7370688, 'steps': 38388, 'loss/train': 1.8690040111541748} 11/07/2021 02:37:51 - INFO - __main__ - Step 38390: {'lr': 0.0004290545009191024, 'samples': 7370880, 'steps': 38389, 'loss/train': 0.6957410573959351} 11/07/2021 02:37:51 - INFO - __main__ - Step 38391: {'lr': 0.0004290507974259759, 'samples': 7371072, 'steps': 38390, 'loss/train': 1.5398764610290527} 11/07/2021 02:37:52 - INFO - __main__ - Step 38392: {'lr': 0.0004290470938521718, 'samples': 7371264, 'steps': 38391, 'loss/train': 0.7379046678543091} 11/07/2021 02:37:53 - INFO - __main__ - Step 38393: {'lr': 0.0004290433901976918, 'samples': 7371456, 'steps': 38392, 'loss/train': 5.813073635101318} 11/07/2021 02:37:53 - INFO - __main__ - Step 38394: {'lr': 0.0004290396864625377, 'samples': 7371648, 'steps': 38393, 'loss/train': 1.6722701787948608} 11/07/2021 02:37:53 - INFO - __main__ - Step 38395: {'lr': 0.000429035982646711, 'samples': 7371840, 'steps': 38394, 'loss/train': 1.5052942037582397} 11/07/2021 02:37:54 - INFO - __main__ - Step 38396: {'lr': 0.0004290322787502135, 'samples': 7372032, 'steps': 38395, 'loss/train': 1.1739457845687866} 11/07/2021 02:37:54 - INFO - __main__ - Step 38397: {'lr': 0.0004290285747730468, 'samples': 7372224, 'steps': 38396, 'loss/train': 1.9641786813735962} 11/07/2021 02:37:54 - INFO - __main__ - Step 38398: {'lr': 0.00042902487071521257, 'samples': 7372416, 'steps': 38397, 'loss/train': 1.9700204133987427} 11/07/2021 02:37:55 - INFO - __main__ - Step 38399: {'lr': 0.0004290211665767125, 'samples': 7372608, 'steps': 38398, 'loss/train': 1.8752416372299194} 11/07/2021 02:37:56 - INFO - __main__ - Step 38400: {'lr': 0.00042901746235754837, 'samples': 7372800, 'steps': 38399, 'loss/train': 1.5767382383346558} 11/07/2021 02:37:56 - INFO - __main__ - Step 38401: {'lr': 0.0004290137580577216, 'samples': 7372992, 'steps': 38400, 'loss/train': 1.6494174003601074} 11/07/2021 02:37:56 - INFO - __main__ - Step 38402: {'lr': 0.000429010053677234, 'samples': 7373184, 'steps': 38401, 'loss/train': 1.7537641525268555} 11/07/2021 02:37:57 - INFO - __main__ - Step 38403: {'lr': 0.00042900634921608726, 'samples': 7373376, 'steps': 38402, 'loss/train': 1.3137192726135254} 11/07/2021 02:37:58 - INFO - __main__ - Step 38404: {'lr': 0.0004290026446742831, 'samples': 7373568, 'steps': 38403, 'loss/train': 4.322111129760742} 11/07/2021 02:37:58 - INFO - __main__ - Step 38405: {'lr': 0.00042899894005182294, 'samples': 7373760, 'steps': 38404, 'loss/train': 1.4622212648391724} 11/07/2021 02:37:59 - INFO - __main__ - Step 38406: {'lr': 0.0004289952353487088, 'samples': 7373952, 'steps': 38405, 'loss/train': 0.7398356795310974} 11/07/2021 02:37:59 - INFO - __main__ - Step 38407: {'lr': 0.000428991530564942, 'samples': 7374144, 'steps': 38406, 'loss/train': 0.5963699817657471} 11/07/2021 02:37:59 - INFO - __main__ - Step 38408: {'lr': 0.00042898782570052453, 'samples': 7374336, 'steps': 38407, 'loss/train': 1.9118536710739136} 11/07/2021 02:38:00 - INFO - __main__ - Step 38409: {'lr': 0.0004289841207554578, 'samples': 7374528, 'steps': 38408, 'loss/train': 1.7349226474761963} 11/07/2021 02:38:01 - INFO - __main__ - Step 38410: {'lr': 0.00042898041572974363, 'samples': 7374720, 'steps': 38409, 'loss/train': 1.2647113800048828} 11/07/2021 02:38:01 - INFO - __main__ - Step 38411: {'lr': 0.0004289767106233836, 'samples': 7374912, 'steps': 38410, 'loss/train': 1.462265968322754} 11/07/2021 02:38:01 - INFO - __main__ - Step 38412: {'lr': 0.0004289730054363795, 'samples': 7375104, 'steps': 38411, 'loss/train': 1.6180638074874878} 11/07/2021 02:38:02 - INFO - __main__ - Step 38413: {'lr': 0.00042896930016873293, 'samples': 7375296, 'steps': 38412, 'loss/train': 1.8184987306594849} 11/07/2021 02:38:03 - INFO - __main__ - Step 38414: {'lr': 0.0004289655948204455, 'samples': 7375488, 'steps': 38413, 'loss/train': 1.8326774835586548} 11/07/2021 02:38:03 - INFO - __main__ - Step 38415: {'lr': 0.00042896188939151893, 'samples': 7375680, 'steps': 38414, 'loss/train': 0.982987105846405} 11/07/2021 02:38:04 - INFO - __main__ - Step 38416: {'lr': 0.00042895818388195497, 'samples': 7375872, 'steps': 38415, 'loss/train': 1.4191560745239258} 11/07/2021 02:38:04 - INFO - __main__ - Step 38417: {'lr': 0.00042895447829175516, 'samples': 7376064, 'steps': 38416, 'loss/train': 1.3981465101242065} 11/07/2021 02:38:04 - INFO - __main__ - Step 38418: {'lr': 0.00042895077262092117, 'samples': 7376256, 'steps': 38417, 'loss/train': 1.7016116380691528} 11/07/2021 02:38:05 - INFO - __main__ - Step 38419: {'lr': 0.00042894706686945485, 'samples': 7376448, 'steps': 38418, 'loss/train': 0.9155838489532471} 11/07/2021 02:38:06 - INFO - __main__ - Step 38420: {'lr': 0.00042894336103735766, 'samples': 7376640, 'steps': 38419, 'loss/train': 0.9882771372795105} 11/07/2021 02:38:06 - INFO - __main__ - Step 38421: {'lr': 0.0004289396551246313, 'samples': 7376832, 'steps': 38420, 'loss/train': 1.5289490222930908} 11/07/2021 02:38:06 - INFO - __main__ - Step 38422: {'lr': 0.0004289359491312776, 'samples': 7377024, 'steps': 38421, 'loss/train': 1.6016318798065186} 11/07/2021 02:38:07 - INFO - __main__ - Step 38423: {'lr': 0.00042893224305729806, 'samples': 7377216, 'steps': 38422, 'loss/train': 1.8466453552246094} 11/07/2021 02:38:08 - INFO - __main__ - Step 38424: {'lr': 0.0004289285369026944, 'samples': 7377408, 'steps': 38423, 'loss/train': 1.7110973596572876} 11/07/2021 02:38:08 - INFO - __main__ - Step 38425: {'lr': 0.00042892483066746836, 'samples': 7377600, 'steps': 38424, 'loss/train': 1.5883793830871582} 11/07/2021 02:38:09 - INFO - __main__ - Step 38426: {'lr': 0.0004289211243516216, 'samples': 7377792, 'steps': 38425, 'loss/train': 1.6158411502838135} 11/07/2021 02:38:09 - INFO - __main__ - Step 38427: {'lr': 0.0004289174179551556, 'samples': 7377984, 'steps': 38426, 'loss/train': 1.451820969581604} 11/07/2021 02:38:09 - INFO - __main__ - Step 38428: {'lr': 0.0004289137114780722, 'samples': 7378176, 'steps': 38427, 'loss/train': 1.9416730403900146} 11/07/2021 02:38:12 - INFO - __main__ - Step 38429: {'lr': 0.00042891000492037315, 'samples': 7378368, 'steps': 38428, 'loss/train': 1.641010046005249} 11/07/2021 02:38:12 - INFO - __main__ - Step 38430: {'lr': 0.00042890629828205997, 'samples': 7378560, 'steps': 38429, 'loss/train': 1.5851696729660034} 11/07/2021 02:38:12 - INFO - __main__ - Step 38431: {'lr': 0.0004289025915631343, 'samples': 7378752, 'steps': 38430, 'loss/train': 1.72231924533844} 11/07/2021 02:38:13 - INFO - __main__ - Step 38432: {'lr': 0.00042889888476359793, 'samples': 7378944, 'steps': 38431, 'loss/train': 1.4775114059448242} 11/07/2021 02:38:13 - INFO - __main__ - Step 38433: {'lr': 0.0004288951778834525, 'samples': 7379136, 'steps': 38432, 'loss/train': 1.3398957252502441} 11/07/2021 02:38:13 - INFO - __main__ - Step 38434: {'lr': 0.00042889147092269964, 'samples': 7379328, 'steps': 38433, 'loss/train': 2.4124293327331543} 11/07/2021 02:38:14 - INFO - __main__ - Step 38435: {'lr': 0.0004288877638813411, 'samples': 7379520, 'steps': 38434, 'loss/train': 1.8824303150177002} 11/07/2021 02:38:14 - INFO - __main__ - Step 38436: {'lr': 0.00042888405675937843, 'samples': 7379712, 'steps': 38435, 'loss/train': 1.839871883392334} 11/07/2021 02:38:15 - INFO - __main__ - Step 38437: {'lr': 0.00042888034955681337, 'samples': 7379904, 'steps': 38436, 'loss/train': 1.8125609159469604} 11/07/2021 02:38:16 - INFO - __main__ - Step 38438: {'lr': 0.0004288766422736476, 'samples': 7380096, 'steps': 38437, 'loss/train': 2.059551477432251} 11/07/2021 02:38:16 - INFO - __main__ - Step 38439: {'lr': 0.00042887293490988276, 'samples': 7380288, 'steps': 38438, 'loss/train': 1.4829810857772827} 11/07/2021 02:38:16 - INFO - __main__ - Step 38440: {'lr': 0.00042886922746552056, 'samples': 7380480, 'steps': 38439, 'loss/train': 1.6527646780014038} 11/07/2021 02:38:17 - INFO - __main__ - Step 38441: {'lr': 0.0004288655199405626, 'samples': 7380672, 'steps': 38440, 'loss/train': 1.5163254737854004} 11/07/2021 02:38:18 - INFO - __main__ - Step 38442: {'lr': 0.00042886181233501067, 'samples': 7380864, 'steps': 38441, 'loss/train': 1.5011135339736938} 11/07/2021 02:38:18 - INFO - __main__ - Step 38443: {'lr': 0.00042885810464886635, 'samples': 7381056, 'steps': 38442, 'loss/train': 1.5980879068374634} 11/07/2021 02:38:19 - INFO - __main__ - Step 38444: {'lr': 0.0004288543968821312, 'samples': 7381248, 'steps': 38443, 'loss/train': 0.7294881939888} 11/07/2021 02:38:19 - INFO - __main__ - Step 38445: {'lr': 0.00042885068903480717, 'samples': 7381440, 'steps': 38444, 'loss/train': 0.9094494581222534} 11/07/2021 02:38:19 - INFO - __main__ - Step 38446: {'lr': 0.00042884698110689574, 'samples': 7381632, 'steps': 38445, 'loss/train': 1.5150995254516602} 11/07/2021 02:38:20 - INFO - __main__ - Step 38447: {'lr': 0.00042884327309839865, 'samples': 7381824, 'steps': 38446, 'loss/train': 1.807474970817566} 11/07/2021 02:38:21 - INFO - __main__ - Step 38448: {'lr': 0.0004288395650093174, 'samples': 7382016, 'steps': 38447, 'loss/train': 1.0160998106002808} 11/07/2021 02:38:21 - INFO - __main__ - Step 38449: {'lr': 0.000428835856839654, 'samples': 7382208, 'steps': 38448, 'loss/train': 1.399091124534607} 11/07/2021 02:38:21 - INFO - __main__ - Step 38450: {'lr': 0.0004288321485894098, 'samples': 7382400, 'steps': 38449, 'loss/train': 1.4129213094711304} 11/07/2021 02:38:22 - INFO - __main__ - Step 38451: {'lr': 0.0004288284402585866, 'samples': 7382592, 'steps': 38450, 'loss/train': 1.5371888875961304} 11/07/2021 02:38:22 - INFO - __main__ - Step 38452: {'lr': 0.0004288247318471861, 'samples': 7382784, 'steps': 38451, 'loss/train': 1.8664556741714478} 11/07/2021 02:38:23 - INFO - __main__ - Step 38453: {'lr': 0.0004288210233552099, 'samples': 7382976, 'steps': 38452, 'loss/train': 1.9075709581375122} 11/07/2021 02:38:24 - INFO - __main__ - Step 38454: {'lr': 0.00042881731478265975, 'samples': 7383168, 'steps': 38453, 'loss/train': 1.6259357929229736} 11/07/2021 02:38:24 - INFO - __main__ - Step 38455: {'lr': 0.00042881360612953724, 'samples': 7383360, 'steps': 38454, 'loss/train': 1.8561694622039795} 11/07/2021 02:38:24 - INFO - __main__ - Step 38456: {'lr': 0.0004288098973958441, 'samples': 7383552, 'steps': 38455, 'loss/train': 1.4659887552261353} 11/07/2021 02:38:25 - INFO - __main__ - Step 38457: {'lr': 0.000428806188581582, 'samples': 7383744, 'steps': 38456, 'loss/train': 1.8136464357376099} 11/07/2021 02:38:26 - INFO - __main__ - Step 38458: {'lr': 0.00042880247968675255, 'samples': 7383936, 'steps': 38457, 'loss/train': 1.191347360610962} 11/07/2021 02:38:26 - INFO - __main__ - Step 38459: {'lr': 0.00042879877071135746, 'samples': 7384128, 'steps': 38458, 'loss/train': 1.7995673418045044} 11/07/2021 02:38:26 - INFO - __main__ - Step 38460: {'lr': 0.0004287950616553984, 'samples': 7384320, 'steps': 38459, 'loss/train': 2.0367720127105713} 11/07/2021 02:38:27 - INFO - __main__ - Step 38461: {'lr': 0.0004287913525188771, 'samples': 7384512, 'steps': 38460, 'loss/train': 1.733862042427063} 11/07/2021 02:38:27 - INFO - __main__ - Step 38462: {'lr': 0.0004287876433017951, 'samples': 7384704, 'steps': 38461, 'loss/train': 1.1448413133621216} 11/07/2021 02:38:28 - INFO - __main__ - Step 38463: {'lr': 0.0004287839340041542, 'samples': 7384896, 'steps': 38462, 'loss/train': 1.2920781373977661} 11/07/2021 02:38:28 - INFO - __main__ - Step 38464: {'lr': 0.000428780224625956, 'samples': 7385088, 'steps': 38463, 'loss/train': 1.2339569330215454} 11/07/2021 02:38:29 - INFO - __main__ - Step 38465: {'lr': 0.00042877651516720215, 'samples': 7385280, 'steps': 38464, 'loss/train': 2.114656925201416} 11/07/2021 02:38:29 - INFO - __main__ - Step 38466: {'lr': 0.0004287728056278944, 'samples': 7385472, 'steps': 38465, 'loss/train': 1.7644872665405273} 11/07/2021 02:38:29 - INFO - __main__ - Step 38467: {'lr': 0.00042876909600803444, 'samples': 7385664, 'steps': 38466, 'loss/train': 1.7050596475601196} 11/07/2021 02:38:31 - INFO - __main__ - Step 38468: {'lr': 0.00042876538630762386, 'samples': 7385856, 'steps': 38467, 'loss/train': 2.0102365016937256} 11/07/2021 02:38:31 - INFO - __main__ - Step 38469: {'lr': 0.00042876167652666433, 'samples': 7386048, 'steps': 38468, 'loss/train': 1.2179516553878784} 11/07/2021 02:38:31 - INFO - __main__ - Step 38470: {'lr': 0.0004287579666651575, 'samples': 7386240, 'steps': 38469, 'loss/train': 1.756458044052124} 11/07/2021 02:38:32 - INFO - __main__ - Step 38471: {'lr': 0.00042875425672310506, 'samples': 7386432, 'steps': 38470, 'loss/train': 1.6783102750778198} 11/07/2021 02:38:32 - INFO - __main__ - Step 38472: {'lr': 0.00042875054670050885, 'samples': 7386624, 'steps': 38471, 'loss/train': 0.9036006331443787} 11/07/2021 02:38:33 - INFO - __main__ - Step 38473: {'lr': 0.00042874683659737035, 'samples': 7386816, 'steps': 38472, 'loss/train': 0.5149843692779541} 11/07/2021 02:38:33 - INFO - __main__ - Step 38474: {'lr': 0.0004287431264136913, 'samples': 7387008, 'steps': 38473, 'loss/train': 1.8024157285690308} 11/07/2021 02:38:34 - INFO - __main__ - Step 38475: {'lr': 0.0004287394161494733, 'samples': 7387200, 'steps': 38474, 'loss/train': 1.8952447175979614} 11/07/2021 02:38:34 - INFO - __main__ - Step 38476: {'lr': 0.0004287357058047181, 'samples': 7387392, 'steps': 38475, 'loss/train': 1.8905134201049805} 11/07/2021 02:38:34 - INFO - __main__ - Step 38477: {'lr': 0.00042873199537942733, 'samples': 7387584, 'steps': 38476, 'loss/train': 1.6807677745819092} 11/07/2021 02:38:35 - INFO - __main__ - Step 38478: {'lr': 0.0004287282848736027, 'samples': 7387776, 'steps': 38477, 'loss/train': 1.6995487213134766} 11/07/2021 02:38:36 - INFO - __main__ - Step 38479: {'lr': 0.00042872457428724586, 'samples': 7387968, 'steps': 38478, 'loss/train': 1.7590001821517944} 11/07/2021 02:38:36 - INFO - __main__ - Step 38480: {'lr': 0.00042872086362035844, 'samples': 7388160, 'steps': 38479, 'loss/train': 1.6011989116668701} 11/07/2021 02:38:36 - INFO - __main__ - Step 38481: {'lr': 0.00042871715287294223, 'samples': 7388352, 'steps': 38480, 'loss/train': 1.413744330406189} 11/07/2021 02:38:37 - INFO - __main__ - Step 38482: {'lr': 0.00042871344204499886, 'samples': 7388544, 'steps': 38481, 'loss/train': 1.6196192502975464} 11/07/2021 02:38:37 - INFO - __main__ - Step 38483: {'lr': 0.0004287097311365299, 'samples': 7388736, 'steps': 38482, 'loss/train': 1.6136687994003296} 11/07/2021 02:38:38 - INFO - __main__ - Step 38484: {'lr': 0.00042870602014753707, 'samples': 7388928, 'steps': 38483, 'loss/train': 1.3907452821731567} 11/07/2021 02:38:39 - INFO - __main__ - Step 38485: {'lr': 0.0004287023090780221, 'samples': 7389120, 'steps': 38484, 'loss/train': 1.3356413841247559} 11/07/2021 02:38:39 - INFO - __main__ - Step 38486: {'lr': 0.0004286985979279866, 'samples': 7389312, 'steps': 38485, 'loss/train': 1.608561635017395} 11/07/2021 02:38:39 - INFO - __main__ - Step 38487: {'lr': 0.0004286948866974323, 'samples': 7389504, 'steps': 38486, 'loss/train': 1.2108365297317505} 11/07/2021 02:38:40 - INFO - __main__ - Step 38488: {'lr': 0.0004286911753863608, 'samples': 7389696, 'steps': 38487, 'loss/train': 1.6934078931808472} 11/07/2021 02:38:41 - INFO - __main__ - Step 38489: {'lr': 0.0004286874639947739, 'samples': 7389888, 'steps': 38488, 'loss/train': 1.500806212425232} 11/07/2021 02:38:41 - INFO - __main__ - Step 38490: {'lr': 0.0004286837525226731, 'samples': 7390080, 'steps': 38489, 'loss/train': 1.5360748767852783} 11/07/2021 02:38:41 - INFO - __main__ - Step 38491: {'lr': 0.0004286800409700602, 'samples': 7390272, 'steps': 38490, 'loss/train': 1.1713488101959229} 11/07/2021 02:38:42 - INFO - __main__ - Step 38492: {'lr': 0.0004286763293369369, 'samples': 7390464, 'steps': 38491, 'loss/train': 1.2377177476882935} 11/07/2021 02:38:42 - INFO - __main__ - Step 38493: {'lr': 0.00042867261762330466, 'samples': 7390656, 'steps': 38492, 'loss/train': 1.7022583484649658} 11/07/2021 02:38:43 - INFO - __main__ - Step 38494: {'lr': 0.0004286689058291654, 'samples': 7390848, 'steps': 38493, 'loss/train': 1.2819725275039673} 11/07/2021 02:38:43 - INFO - __main__ - Step 38495: {'lr': 0.00042866519395452063, 'samples': 7391040, 'steps': 38494, 'loss/train': 1.3884963989257812} 11/07/2021 02:38:44 - INFO - __main__ - Step 38496: {'lr': 0.00042866148199937216, 'samples': 7391232, 'steps': 38495, 'loss/train': 1.4726425409317017} 11/07/2021 02:38:44 - INFO - __main__ - Step 38497: {'lr': 0.00042865776996372146, 'samples': 7391424, 'steps': 38496, 'loss/train': 1.3454011678695679} 11/07/2021 02:38:44 - INFO - __main__ - Step 38498: {'lr': 0.00042865405784757037, 'samples': 7391616, 'steps': 38497, 'loss/train': 2.008884906768799} 11/07/2021 02:38:45 - INFO - __main__ - Step 38499: {'lr': 0.0004286503456509206, 'samples': 7391808, 'steps': 38498, 'loss/train': 1.5773800611495972} 11/07/2021 02:38:46 - INFO - __main__ - Step 38500: {'lr': 0.0004286466333737737, 'samples': 7392000, 'steps': 38499, 'loss/train': 1.757792353630066} 11/07/2021 02:38:46 - INFO - __main__ - Step 38501: {'lr': 0.00042864292101613133, 'samples': 7392192, 'steps': 38500, 'loss/train': 1.4850726127624512} 11/07/2021 02:38:46 - INFO - __main__ - Step 38502: {'lr': 0.0004286392085779953, 'samples': 7392384, 'steps': 38501, 'loss/train': 1.7341381311416626} 11/07/2021 02:38:47 - INFO - __main__ - Step 38503: {'lr': 0.00042863549605936716, 'samples': 7392576, 'steps': 38502, 'loss/train': 1.6198322772979736} 11/07/2021 02:38:47 - INFO - __main__ - Step 38504: {'lr': 0.00042863178346024856, 'samples': 7392768, 'steps': 38503, 'loss/train': 1.4690289497375488} 11/07/2021 02:38:48 - INFO - __main__ - Step 38505: {'lr': 0.00042862807078064124, 'samples': 7392960, 'steps': 38504, 'loss/train': 2.190840244293213} 11/07/2021 02:38:49 - INFO - __main__ - Step 38506: {'lr': 0.00042862435802054703, 'samples': 7393152, 'steps': 38505, 'loss/train': 0.8740153908729553} 11/07/2021 02:38:49 - INFO - __main__ - Step 38507: {'lr': 0.00042862064517996723, 'samples': 7393344, 'steps': 38506, 'loss/train': 1.7972629070281982} 11/07/2021 02:38:49 - INFO - __main__ - Step 38508: {'lr': 0.00042861693225890385, 'samples': 7393536, 'steps': 38507, 'loss/train': 1.6027082204818726} 11/07/2021 02:38:50 - INFO - __main__ - Step 38509: {'lr': 0.0004286132192573584, 'samples': 7393728, 'steps': 38508, 'loss/train': 1.0021463632583618} 11/07/2021 02:38:51 - INFO - __main__ - Step 38510: {'lr': 0.0004286095061753326, 'samples': 7393920, 'steps': 38509, 'loss/train': 1.9077627658843994} 11/07/2021 02:38:51 - INFO - __main__ - Step 38511: {'lr': 0.0004286057930128281, 'samples': 7394112, 'steps': 38510, 'loss/train': 1.4616092443466187} 11/07/2021 02:38:51 - INFO - __main__ - Step 38512: {'lr': 0.00042860207976984664, 'samples': 7394304, 'steps': 38511, 'loss/train': 1.640619158744812} 11/07/2021 02:38:52 - INFO - __main__ - Step 38513: {'lr': 0.00042859836644638976, 'samples': 7394496, 'steps': 38512, 'loss/train': 3.527266502380371} 11/07/2021 02:38:52 - INFO - __main__ - Step 38514: {'lr': 0.00042859465304245927, 'samples': 7394688, 'steps': 38513, 'loss/train': 1.371895670890808} 11/07/2021 02:38:53 - INFO - __main__ - Step 38515: {'lr': 0.00042859093955805675, 'samples': 7394880, 'steps': 38514, 'loss/train': 1.5881764888763428} 11/07/2021 02:38:53 - INFO - __main__ - Step 38516: {'lr': 0.0004285872259931839, 'samples': 7395072, 'steps': 38515, 'loss/train': 1.1682759523391724} 11/07/2021 02:38:54 - INFO - __main__ - Step 38517: {'lr': 0.00042858351234784244, 'samples': 7395264, 'steps': 38516, 'loss/train': 1.5074559450149536} 11/07/2021 02:38:54 - INFO - __main__ - Step 38518: {'lr': 0.000428579798622034, 'samples': 7395456, 'steps': 38517, 'loss/train': 1.5640182495117188} 11/07/2021 02:38:54 - INFO - __main__ - Step 38519: {'lr': 0.0004285760848157603, 'samples': 7395648, 'steps': 38518, 'loss/train': 2.235804557800293} 11/07/2021 02:38:56 - INFO - __main__ - Step 38520: {'lr': 0.00042857237092902285, 'samples': 7395840, 'steps': 38519, 'loss/train': 1.3733848333358765} 11/07/2021 02:38:56 - INFO - __main__ - Step 38521: {'lr': 0.0004285686569618235, 'samples': 7396032, 'steps': 38520, 'loss/train': 1.7476186752319336} 11/07/2021 02:38:56 - INFO - __main__ - Step 38522: {'lr': 0.0004285649429141639, 'samples': 7396224, 'steps': 38521, 'loss/train': 1.8905295133590698} 11/07/2021 02:38:57 - INFO - __main__ - Step 38523: {'lr': 0.00042856122878604566, 'samples': 7396416, 'steps': 38522, 'loss/train': 1.5531270503997803} 11/07/2021 02:38:57 - INFO - __main__ - Step 38524: {'lr': 0.0004285575145774705, 'samples': 7396608, 'steps': 38523, 'loss/train': 1.5270588397979736} 11/07/2021 02:38:57 - INFO - __main__ - Step 38525: {'lr': 0.00042855380028844004, 'samples': 7396800, 'steps': 38524, 'loss/train': 1.3176720142364502} 11/07/2021 02:38:58 - INFO - __main__ - Step 38526: {'lr': 0.00042855008591895607, 'samples': 7396992, 'steps': 38525, 'loss/train': 1.2649494409561157} 11/07/2021 02:38:59 - INFO - __main__ - Step 38527: {'lr': 0.00042854637146902007, 'samples': 7397184, 'steps': 38526, 'loss/train': 1.1893590688705444} 11/07/2021 02:38:59 - INFO - __main__ - Step 38528: {'lr': 0.00042854265693863394, 'samples': 7397376, 'steps': 38527, 'loss/train': 1.7693283557891846} 11/07/2021 02:38:59 - INFO - __main__ - Step 38529: {'lr': 0.00042853894232779924, 'samples': 7397568, 'steps': 38528, 'loss/train': 1.7133264541625977} 11/07/2021 02:39:00 - INFO - __main__ - Step 38530: {'lr': 0.00042853522763651767, 'samples': 7397760, 'steps': 38529, 'loss/train': 1.9178385734558105} 11/07/2021 02:39:01 - INFO - __main__ - Step 38531: {'lr': 0.00042853151286479074, 'samples': 7397952, 'steps': 38530, 'loss/train': 3.3915510177612305} 11/07/2021 02:39:01 - INFO - __main__ - Step 38532: {'lr': 0.0004285277980126204, 'samples': 7398144, 'steps': 38531, 'loss/train': 1.5660945177078247} 11/07/2021 02:39:01 - INFO - __main__ - Step 38533: {'lr': 0.0004285240830800081, 'samples': 7398336, 'steps': 38532, 'loss/train': 1.3985246419906616} 11/07/2021 02:39:02 - INFO - __main__ - Step 38534: {'lr': 0.00042852036806695565, 'samples': 7398528, 'steps': 38533, 'loss/train': 1.6784566640853882} 11/07/2021 02:39:02 - INFO - __main__ - Step 38535: {'lr': 0.0004285166529734647, 'samples': 7398720, 'steps': 38534, 'loss/train': 1.5484535694122314} 11/07/2021 02:39:03 - INFO - __main__ - Step 38536: {'lr': 0.0004285129377995369, 'samples': 7398912, 'steps': 38535, 'loss/train': 1.6828726530075073} 11/07/2021 02:39:04 - INFO - __main__ - Step 38537: {'lr': 0.0004285092225451739, 'samples': 7399104, 'steps': 38536, 'loss/train': 1.7928122282028198} 11/07/2021 02:39:04 - INFO - __main__ - Step 38538: {'lr': 0.0004285055072103774, 'samples': 7399296, 'steps': 38537, 'loss/train': 1.6644047498703003} 11/07/2021 02:39:04 - INFO - __main__ - Step 38539: {'lr': 0.00042850179179514906, 'samples': 7399488, 'steps': 38538, 'loss/train': 1.8039556741714478} 11/07/2021 02:39:05 - INFO - __main__ - Step 38540: {'lr': 0.00042849807629949057, 'samples': 7399680, 'steps': 38539, 'loss/train': 1.33259117603302} 11/07/2021 02:39:05 - INFO - __main__ - Step 38541: {'lr': 0.0004284943607234036, 'samples': 7399872, 'steps': 38540, 'loss/train': 1.7744859457015991} 11/07/2021 02:39:07 - INFO - __main__ - Step 38542: {'lr': 0.00042849064506688984, 'samples': 7400064, 'steps': 38541, 'loss/train': 1.8936946392059326} 11/07/2021 02:39:07 - INFO - __main__ - Step 38543: {'lr': 0.00042848692932995094, 'samples': 7400256, 'steps': 38542, 'loss/train': 1.3653404712677002} 11/07/2021 02:39:07 - INFO - __main__ - Step 38544: {'lr': 0.0004284832135125886, 'samples': 7400448, 'steps': 38543, 'loss/train': 1.289848804473877} 11/07/2021 02:39:08 - INFO - __main__ - Step 38545: {'lr': 0.0004284794976148044, 'samples': 7400640, 'steps': 38544, 'loss/train': 0.7191889882087708} 11/07/2021 02:39:08 - INFO - __main__ - Step 38546: {'lr': 0.00042847578163660016, 'samples': 7400832, 'steps': 38545, 'loss/train': 1.2714651823043823} 11/07/2021 02:39:09 - INFO - __main__ - Step 38547: {'lr': 0.0004284720655779775, 'samples': 7401024, 'steps': 38546, 'loss/train': 1.4489455223083496} 11/07/2021 02:39:10 - INFO - __main__ - Step 38548: {'lr': 0.00042846834943893806, 'samples': 7401216, 'steps': 38547, 'loss/train': 1.9417572021484375} 11/07/2021 02:39:10 - INFO - __main__ - Step 38549: {'lr': 0.0004284646332194836, 'samples': 7401408, 'steps': 38548, 'loss/train': 1.0289621353149414} 11/07/2021 02:39:10 - INFO - __main__ - Step 38550: {'lr': 0.0004284609169196156, 'samples': 7401600, 'steps': 38549, 'loss/train': 1.6011112928390503} 11/07/2021 02:39:11 - INFO - __main__ - Step 38551: {'lr': 0.000428457200539336, 'samples': 7401792, 'steps': 38550, 'loss/train': 1.1118967533111572} 11/07/2021 02:39:11 - INFO - __main__ - Step 38552: {'lr': 0.0004284534840786463, 'samples': 7401984, 'steps': 38551, 'loss/train': 1.5389808416366577} 11/07/2021 02:39:12 - INFO - __main__ - Step 38553: {'lr': 0.0004284497675375482, 'samples': 7402176, 'steps': 38552, 'loss/train': 1.730039358139038} 11/07/2021 02:39:12 - INFO - __main__ - Step 38554: {'lr': 0.0004284460509160433, 'samples': 7402368, 'steps': 38553, 'loss/train': 1.4535332918167114} 11/07/2021 02:39:13 - INFO - __main__ - Step 38555: {'lr': 0.0004284423342141335, 'samples': 7402560, 'steps': 38554, 'loss/train': 1.4308903217315674} 11/07/2021 02:39:13 - INFO - __main__ - Step 38556: {'lr': 0.0004284386174318202, 'samples': 7402752, 'steps': 38555, 'loss/train': 1.5834574699401855} 11/07/2021 02:39:13 - INFO - __main__ - Step 38557: {'lr': 0.00042843490056910534, 'samples': 7402944, 'steps': 38556, 'loss/train': 1.7708011865615845} 11/07/2021 02:39:14 - INFO - __main__ - Step 38558: {'lr': 0.00042843118362599045, 'samples': 7403136, 'steps': 38557, 'loss/train': 1.0050923824310303} 11/07/2021 02:39:15 - INFO - __main__ - Step 38559: {'lr': 0.0004284274666024772, 'samples': 7403328, 'steps': 38558, 'loss/train': 1.5794517993927002} 11/07/2021 02:39:15 - INFO - __main__ - Step 38560: {'lr': 0.0004284237494985672, 'samples': 7403520, 'steps': 38559, 'loss/train': 1.651281714439392} 11/07/2021 02:39:15 - INFO - __main__ - Step 38561: {'lr': 0.0004284200323142623, 'samples': 7403712, 'steps': 38560, 'loss/train': 2.0764431953430176} 11/07/2021 02:39:16 - INFO - __main__ - Step 38562: {'lr': 0.0004284163150495641, 'samples': 7403904, 'steps': 38561, 'loss/train': 1.2969495058059692} 11/07/2021 02:39:17 - INFO - __main__ - Step 38563: {'lr': 0.00042841259770447427, 'samples': 7404096, 'steps': 38562, 'loss/train': 1.4701931476593018} 11/07/2021 02:39:17 - INFO - __main__ - Step 38564: {'lr': 0.00042840888027899436, 'samples': 7404288, 'steps': 38563, 'loss/train': 1.3531675338745117} 11/07/2021 02:39:18 - INFO - __main__ - Step 38565: {'lr': 0.0004284051627731263, 'samples': 7404480, 'steps': 38564, 'loss/train': 1.5186748504638672} 11/07/2021 02:39:18 - INFO - __main__ - Step 38566: {'lr': 0.0004284014451868716, 'samples': 7404672, 'steps': 38565, 'loss/train': 1.763478398323059} 11/07/2021 02:39:18 - INFO - __main__ - Step 38567: {'lr': 0.0004283977275202319, 'samples': 7404864, 'steps': 38566, 'loss/train': 1.5161564350128174} 11/07/2021 02:39:19 - INFO - __main__ - Step 38568: {'lr': 0.00042839400977320895, 'samples': 7405056, 'steps': 38567, 'loss/train': 1.660630226135254} 11/07/2021 02:39:20 - INFO - __main__ - Step 38569: {'lr': 0.00042839029194580446, 'samples': 7405248, 'steps': 38568, 'loss/train': 1.5722196102142334} 11/07/2021 02:39:20 - INFO - __main__ - Step 38570: {'lr': 0.0004283865740380201, 'samples': 7405440, 'steps': 38569, 'loss/train': 1.3853479623794556} 11/07/2021 02:39:20 - INFO - __main__ - Step 38571: {'lr': 0.0004283828560498574, 'samples': 7405632, 'steps': 38570, 'loss/train': 0.9588892459869385} 11/07/2021 02:39:21 - INFO - __main__ - Step 38572: {'lr': 0.0004283791379813181, 'samples': 7405824, 'steps': 38571, 'loss/train': 1.5066494941711426} 11/07/2021 02:39:22 - INFO - __main__ - Step 38573: {'lr': 0.000428375419832404, 'samples': 7406016, 'steps': 38572, 'loss/train': 1.527634620666504} 11/07/2021 02:39:22 - INFO - __main__ - Step 38574: {'lr': 0.0004283717016031167, 'samples': 7406208, 'steps': 38573, 'loss/train': 1.3006707429885864} 11/07/2021 02:39:22 - INFO - __main__ - Step 38575: {'lr': 0.0004283679832934578, 'samples': 7406400, 'steps': 38574, 'loss/train': 1.6669787168502808} 11/07/2021 02:39:23 - INFO - __main__ - Step 38576: {'lr': 0.0004283642649034291, 'samples': 7406592, 'steps': 38575, 'loss/train': 1.7991132736206055} 11/07/2021 02:39:23 - INFO - __main__ - Step 38577: {'lr': 0.00042836054643303226, 'samples': 7406784, 'steps': 38576, 'loss/train': 2.0059776306152344} 11/07/2021 02:39:24 - INFO - __main__ - Step 38578: {'lr': 0.0004283568278822688, 'samples': 7406976, 'steps': 38577, 'loss/train': 0.8108773827552795} 11/07/2021 02:39:25 - INFO - __main__ - Step 38579: {'lr': 0.0004283531092511405, 'samples': 7407168, 'steps': 38578, 'loss/train': 1.2539422512054443} 11/07/2021 02:39:25 - INFO - __main__ - Step 38580: {'lr': 0.0004283493905396491, 'samples': 7407360, 'steps': 38579, 'loss/train': 0.8251067996025085} 11/07/2021 02:39:25 - INFO - __main__ - Step 38581: {'lr': 0.00042834567174779623, 'samples': 7407552, 'steps': 38580, 'loss/train': 1.5102128982543945} 11/07/2021 02:39:26 - INFO - __main__ - Step 38582: {'lr': 0.00042834195287558356, 'samples': 7407744, 'steps': 38581, 'loss/train': 1.4471834897994995} 11/07/2021 02:39:26 - INFO - __main__ - Step 38583: {'lr': 0.00042833823392301264, 'samples': 7407936, 'steps': 38582, 'loss/train': 0.9997994303703308} 11/07/2021 02:39:27 - INFO - __main__ - Step 38584: {'lr': 0.00042833451489008537, 'samples': 7408128, 'steps': 38583, 'loss/train': 0.7278831005096436} 11/07/2021 02:39:27 - INFO - __main__ - Step 38585: {'lr': 0.00042833079577680327, 'samples': 7408320, 'steps': 38584, 'loss/train': 1.4909403324127197} 11/07/2021 02:39:28 - INFO - __main__ - Step 38586: {'lr': 0.0004283270765831682, 'samples': 7408512, 'steps': 38585, 'loss/train': 1.8982082605361938} 11/07/2021 02:39:28 - INFO - __main__ - Step 38587: {'lr': 0.00042832335730918147, 'samples': 7408704, 'steps': 38586, 'loss/train': 1.4644055366516113} 11/07/2021 02:39:29 - INFO - __main__ - Step 38588: {'lr': 0.0004283196379548451, 'samples': 7408896, 'steps': 38587, 'loss/train': 1.6861553192138672} 11/07/2021 02:39:30 - INFO - __main__ - Step 38589: {'lr': 0.0004283159185201607, 'samples': 7409088, 'steps': 38588, 'loss/train': 1.7528589963912964} 11/07/2021 02:39:30 - INFO - __main__ - Step 38590: {'lr': 0.00042831219900512984, 'samples': 7409280, 'steps': 38589, 'loss/train': 1.5910186767578125} 11/07/2021 02:39:30 - INFO - __main__ - Step 38591: {'lr': 0.0004283084794097543, 'samples': 7409472, 'steps': 38590, 'loss/train': 1.6175768375396729} 11/07/2021 02:39:31 - INFO - __main__ - Step 38592: {'lr': 0.00042830475973403573, 'samples': 7409664, 'steps': 38591, 'loss/train': 0.31639763712882996} 11/07/2021 02:39:31 - INFO - __main__ - Step 38593: {'lr': 0.0004283010399779757, 'samples': 7409856, 'steps': 38592, 'loss/train': 1.6801315546035767} 11/07/2021 02:39:32 - INFO - __main__ - Step 38594: {'lr': 0.000428297320141576, 'samples': 7410048, 'steps': 38593, 'loss/train': 1.7525479793548584} 11/07/2021 02:39:32 - INFO - __main__ - Step 38595: {'lr': 0.0004282936002248383, 'samples': 7410240, 'steps': 38594, 'loss/train': 1.3758429288864136} 11/07/2021 02:39:33 - INFO - __main__ - Step 38596: {'lr': 0.00042828988022776426, 'samples': 7410432, 'steps': 38595, 'loss/train': 1.3587149381637573} 11/07/2021 02:39:33 - INFO - __main__ - Step 38597: {'lr': 0.00042828616015035554, 'samples': 7410624, 'steps': 38596, 'loss/train': 1.565584659576416} 11/07/2021 02:39:33 - INFO - __main__ - Step 38598: {'lr': 0.00042828243999261384, 'samples': 7410816, 'steps': 38597, 'loss/train': 1.705330491065979} 11/07/2021 02:39:34 - INFO - __main__ - Step 38599: {'lr': 0.0004282787197545408, 'samples': 7411008, 'steps': 38598, 'loss/train': 1.0509960651397705} 11/07/2021 02:39:35 - INFO - __main__ - Step 38600: {'lr': 0.00042827499943613815, 'samples': 7411200, 'steps': 38599, 'loss/train': 1.2314541339874268} 11/07/2021 02:39:35 - INFO - __main__ - Step 38601: {'lr': 0.00042827127903740747, 'samples': 7411392, 'steps': 38600, 'loss/train': 1.5458012819290161} 11/07/2021 02:39:36 - INFO - __main__ - Step 38602: {'lr': 0.00042826755855835053, 'samples': 7411584, 'steps': 38601, 'loss/train': 1.6242434978485107} 11/07/2021 02:39:36 - INFO - __main__ - Step 38603: {'lr': 0.00042826383799896906, 'samples': 7411776, 'steps': 38602, 'loss/train': 1.4507431983947754} 11/07/2021 02:39:36 - INFO - __main__ - Step 38604: {'lr': 0.0004282601173592646, 'samples': 7411968, 'steps': 38603, 'loss/train': 1.4450706243515015} 11/07/2021 02:39:37 - INFO - __main__ - Step 38605: {'lr': 0.0004282563966392389, 'samples': 7412160, 'steps': 38604, 'loss/train': 1.1748977899551392} 11/07/2021 02:39:38 - INFO - __main__ - Step 38606: {'lr': 0.00042825267583889354, 'samples': 7412352, 'steps': 38605, 'loss/train': 1.4789282083511353} 11/07/2021 02:39:38 - INFO - __main__ - Step 38607: {'lr': 0.00042824895495823033, 'samples': 7412544, 'steps': 38606, 'loss/train': 1.350877285003662} 11/07/2021 02:39:38 - INFO - __main__ - Step 38608: {'lr': 0.0004282452339972509, 'samples': 7412736, 'steps': 38607, 'loss/train': 0.5122175216674805} 11/07/2021 02:39:39 - INFO - __main__ - Step 38609: {'lr': 0.00042824151295595695, 'samples': 7412928, 'steps': 38608, 'loss/train': 1.9182634353637695} 11/07/2021 02:39:40 - INFO - __main__ - Step 38610: {'lr': 0.0004282377918343501, 'samples': 7413120, 'steps': 38609, 'loss/train': 1.5004831552505493} 11/07/2021 02:39:40 - INFO - __main__ - Step 38611: {'lr': 0.00042823407063243197, 'samples': 7413312, 'steps': 38610, 'loss/train': 1.1725540161132812} 11/07/2021 02:39:40 - INFO - __main__ - Step 38612: {'lr': 0.0004282303493502044, 'samples': 7413504, 'steps': 38611, 'loss/train': 0.6597931981086731} 11/07/2021 02:39:41 - INFO - __main__ - Step 38613: {'lr': 0.000428226627987669, 'samples': 7413696, 'steps': 38612, 'loss/train': 1.3209978342056274} 11/07/2021 02:39:41 - INFO - __main__ - Step 38614: {'lr': 0.0004282229065448273, 'samples': 7413888, 'steps': 38613, 'loss/train': 1.6144988536834717} 11/07/2021 02:39:42 - INFO - __main__ - Step 38615: {'lr': 0.0004282191850216812, 'samples': 7414080, 'steps': 38614, 'loss/train': 1.519603967666626} 11/07/2021 02:39:42 - INFO - __main__ - Step 38616: {'lr': 0.00042821546341823236, 'samples': 7414272, 'steps': 38615, 'loss/train': 1.4626085758209229} 11/07/2021 02:39:43 - INFO - __main__ - Step 38617: {'lr': 0.0004282117417344823, 'samples': 7414464, 'steps': 38616, 'loss/train': 1.5803148746490479} 11/07/2021 02:39:43 - INFO - __main__ - Step 38618: {'lr': 0.00042820801997043277, 'samples': 7414656, 'steps': 38617, 'loss/train': 1.4064488410949707} 11/07/2021 02:39:43 - INFO - __main__ - Step 38619: {'lr': 0.0004282042981260855, 'samples': 7414848, 'steps': 38618, 'loss/train': 1.533360481262207} 11/07/2021 02:39:44 - INFO - __main__ - Step 38620: {'lr': 0.00042820057620144214, 'samples': 7415040, 'steps': 38619, 'loss/train': 1.3826243877410889} 11/07/2021 02:39:45 - INFO - __main__ - Step 38621: {'lr': 0.00042819685419650427, 'samples': 7415232, 'steps': 38620, 'loss/train': 1.0208340883255005} 11/07/2021 02:39:45 - INFO - __main__ - Step 38622: {'lr': 0.0004281931321112737, 'samples': 7415424, 'steps': 38621, 'loss/train': 1.1382410526275635} 11/07/2021 02:39:45 - INFO - __main__ - Step 38623: {'lr': 0.0004281894099457521, 'samples': 7415616, 'steps': 38622, 'loss/train': 1.4776126146316528} 11/07/2021 02:39:46 - INFO - __main__ - Step 38624: {'lr': 0.00042818568769994103, 'samples': 7415808, 'steps': 38623, 'loss/train': 0.15500399470329285} 11/07/2021 02:39:47 - INFO - __main__ - Step 38625: {'lr': 0.00042818196537384225, 'samples': 7416000, 'steps': 38624, 'loss/train': 1.2261273860931396} 11/07/2021 02:39:47 - INFO - __main__ - Step 38626: {'lr': 0.0004281782429674574, 'samples': 7416192, 'steps': 38625, 'loss/train': 1.6735538244247437} 11/07/2021 02:39:48 - INFO - __main__ - Step 38627: {'lr': 0.0004281745204807882, 'samples': 7416384, 'steps': 38626, 'loss/train': 1.849352478981018} 11/07/2021 02:39:48 - INFO - __main__ - Step 38628: {'lr': 0.00042817079791383636, 'samples': 7416576, 'steps': 38627, 'loss/train': 1.77236807346344} 11/07/2021 02:39:48 - INFO - __main__ - Step 38629: {'lr': 0.00042816707526660346, 'samples': 7416768, 'steps': 38628, 'loss/train': 1.4409795999526978} 11/07/2021 02:39:49 - INFO - __main__ - Step 38630: {'lr': 0.00042816335253909125, 'samples': 7416960, 'steps': 38629, 'loss/train': 1.3015356063842773} 11/07/2021 02:39:50 - INFO - __main__ - Step 38631: {'lr': 0.00042815962973130134, 'samples': 7417152, 'steps': 38630, 'loss/train': 1.5178568363189697} 11/07/2021 02:39:50 - INFO - __main__ - Step 38632: {'lr': 0.00042815590684323554, 'samples': 7417344, 'steps': 38631, 'loss/train': 1.4378799200057983} 11/07/2021 02:39:50 - INFO - __main__ - Step 38633: {'lr': 0.00042815218387489535, 'samples': 7417536, 'steps': 38632, 'loss/train': 1.3826674222946167} 11/07/2021 02:39:51 - INFO - __main__ - Step 38634: {'lr': 0.00042814846082628256, 'samples': 7417728, 'steps': 38633, 'loss/train': 1.2611559629440308} 11/07/2021 02:39:51 - INFO - __main__ - Step 38635: {'lr': 0.0004281447376973988, 'samples': 7417920, 'steps': 38634, 'loss/train': 1.0202590227127075} 11/07/2021 02:39:52 - INFO - __main__ - Step 38636: {'lr': 0.00042814101448824583, 'samples': 7418112, 'steps': 38635, 'loss/train': 1.3930331468582153} 11/07/2021 02:39:52 - INFO - __main__ - Step 38637: {'lr': 0.0004281372911988253, 'samples': 7418304, 'steps': 38636, 'loss/train': 1.5592061281204224} 11/07/2021 02:39:53 - INFO - __main__ - Step 38638: {'lr': 0.0004281335678291387, 'samples': 7418496, 'steps': 38637, 'loss/train': 1.780315637588501} 11/07/2021 02:39:53 - INFO - __main__ - Step 38639: {'lr': 0.000428129844379188, 'samples': 7418688, 'steps': 38638, 'loss/train': 1.7167903184890747} 11/07/2021 02:39:53 - INFO - __main__ - Step 38640: {'lr': 0.0004281261208489747, 'samples': 7418880, 'steps': 38639, 'loss/train': 1.198464274406433} 11/07/2021 02:39:54 - INFO - __main__ - Step 38641: {'lr': 0.0004281223972385004, 'samples': 7419072, 'steps': 38640, 'loss/train': 1.5182913541793823} 11/07/2021 02:39:55 - INFO - __main__ - Step 38642: {'lr': 0.00042811867354776705, 'samples': 7419264, 'steps': 38641, 'loss/train': 1.1084949970245361} 11/07/2021 02:39:55 - INFO - __main__ - Step 38643: {'lr': 0.0004281149497767761, 'samples': 7419456, 'steps': 38642, 'loss/train': 1.3935750722885132} 11/07/2021 02:39:56 - INFO - __main__ - Step 38644: {'lr': 0.00042811122592552943, 'samples': 7419648, 'steps': 38643, 'loss/train': 1.4061126708984375} 11/07/2021 02:39:56 - INFO - __main__ - Step 38645: {'lr': 0.0004281075019940285, 'samples': 7419840, 'steps': 38644, 'loss/train': 1.5206317901611328} 11/07/2021 02:39:57 - INFO - __main__ - Step 38646: {'lr': 0.00042810377798227506, 'samples': 7420032, 'steps': 38645, 'loss/train': 1.5615888833999634} 11/07/2021 02:39:57 - INFO - __main__ - Step 38647: {'lr': 0.00042810005389027077, 'samples': 7420224, 'steps': 38646, 'loss/train': 1.8336944580078125} 11/07/2021 02:39:58 - INFO - __main__ - Step 38648: {'lr': 0.0004280963297180174, 'samples': 7420416, 'steps': 38647, 'loss/train': 1.5942424535751343} 11/07/2021 02:39:58 - INFO - __main__ - Step 38649: {'lr': 0.0004280926054655165, 'samples': 7420608, 'steps': 38648, 'loss/train': 1.7997180223464966} 11/07/2021 02:39:58 - INFO - __main__ - Step 38650: {'lr': 0.00042808888113277, 'samples': 7420800, 'steps': 38649, 'loss/train': 1.1809117794036865} 11/07/2021 02:40:00 - INFO - __main__ - Step 38651: {'lr': 0.0004280851567197792, 'samples': 7420992, 'steps': 38650, 'loss/train': 1.705949068069458} 11/07/2021 02:40:00 - INFO - __main__ - Step 38652: {'lr': 0.0004280814322265461, 'samples': 7421184, 'steps': 38651, 'loss/train': 2.073387384414673} 11/07/2021 02:40:00 - INFO - __main__ - Step 38653: {'lr': 0.00042807770765307217, 'samples': 7421376, 'steps': 38652, 'loss/train': 0.7365889549255371} 11/07/2021 02:40:01 - INFO - __main__ - Step 38654: {'lr': 0.00042807398299935927, 'samples': 7421568, 'steps': 38653, 'loss/train': 0.6344184875488281} 11/07/2021 02:40:01 - INFO - __main__ - Step 38655: {'lr': 0.0004280702582654089, 'samples': 7421760, 'steps': 38654, 'loss/train': 2.5534884929656982} 11/07/2021 02:40:01 - INFO - __main__ - Step 38656: {'lr': 0.00042806653345122287, 'samples': 7421952, 'steps': 38655, 'loss/train': 1.5213661193847656} 11/07/2021 02:40:02 - INFO - __main__ - Step 38657: {'lr': 0.0004280628085568028, 'samples': 7422144, 'steps': 38656, 'loss/train': 1.452724814414978} 11/07/2021 02:40:03 - INFO - __main__ - Step 38658: {'lr': 0.0004280590835821503, 'samples': 7422336, 'steps': 38657, 'loss/train': 2.2942631244659424} 11/07/2021 02:40:03 - INFO - __main__ - Step 38659: {'lr': 0.0004280553585272672, 'samples': 7422528, 'steps': 38658, 'loss/train': 1.8796608448028564} 11/07/2021 02:40:03 - INFO - __main__ - Step 38660: {'lr': 0.0004280516333921551, 'samples': 7422720, 'steps': 38659, 'loss/train': 2.2061965465545654} 11/07/2021 02:40:04 - INFO - __main__ - Step 38661: {'lr': 0.00042804790817681574, 'samples': 7422912, 'steps': 38660, 'loss/train': 1.398537039756775} 11/07/2021 02:40:06 - INFO - __main__ - Step 38662: {'lr': 0.0004280441828812506, 'samples': 7423104, 'steps': 38661, 'loss/train': 1.5821424722671509} 11/07/2021 02:40:06 - INFO - __main__ - Step 38663: {'lr': 0.0004280404575054616, 'samples': 7423296, 'steps': 38662, 'loss/train': 2.0686147212982178} 11/07/2021 02:40:06 - INFO - __main__ - Step 38664: {'lr': 0.00042803673204945027, 'samples': 7423488, 'steps': 38663, 'loss/train': 2.4432177543640137} 11/07/2021 02:40:07 - INFO - __main__ - Step 38665: {'lr': 0.0004280330065132184, 'samples': 7423680, 'steps': 38664, 'loss/train': 0.8475606441497803} 11/07/2021 02:40:07 - INFO - __main__ - Step 38666: {'lr': 0.0004280292808967675, 'samples': 7423872, 'steps': 38665, 'loss/train': 0.8151193857192993} 11/07/2021 02:40:07 - INFO - __main__ - Step 38667: {'lr': 0.00042802555520009945, 'samples': 7424064, 'steps': 38666, 'loss/train': 1.1058653593063354} 11/07/2021 02:40:08 - INFO - __main__ - Step 38668: {'lr': 0.00042802182942321576, 'samples': 7424256, 'steps': 38667, 'loss/train': 1.372496485710144} 11/07/2021 02:40:09 - INFO - __main__ - Step 38669: {'lr': 0.0004280181035661182, 'samples': 7424448, 'steps': 38668, 'loss/train': 1.6026809215545654} 11/07/2021 02:40:09 - INFO - __main__ - Step 38670: {'lr': 0.0004280143776288085, 'samples': 7424640, 'steps': 38669, 'loss/train': 1.6136150360107422} 11/07/2021 02:40:09 - INFO - __main__ - Step 38671: {'lr': 0.00042801065161128814, 'samples': 7424832, 'steps': 38670, 'loss/train': 1.3479188680648804} 11/07/2021 02:40:10 - INFO - __main__ - Step 38672: {'lr': 0.000428006925513559, 'samples': 7425024, 'steps': 38671, 'loss/train': 1.8654686212539673} 11/07/2021 02:40:11 - INFO - __main__ - Step 38673: {'lr': 0.0004280031993356227, 'samples': 7425216, 'steps': 38672, 'loss/train': 1.4852941036224365} 11/07/2021 02:40:11 - INFO - __main__ - Step 38674: {'lr': 0.00042799947307748087, 'samples': 7425408, 'steps': 38673, 'loss/train': 1.26423978805542} 11/07/2021 02:40:12 - INFO - __main__ - Step 38675: {'lr': 0.0004279957467391353, 'samples': 7425600, 'steps': 38674, 'loss/train': 1.5272585153579712} 11/07/2021 02:40:12 - INFO - __main__ - Step 38676: {'lr': 0.0004279920203205875, 'samples': 7425792, 'steps': 38675, 'loss/train': 1.5036089420318604} 11/07/2021 02:40:12 - INFO - __main__ - Step 38677: {'lr': 0.0004279882938218393, 'samples': 7425984, 'steps': 38676, 'loss/train': 1.2998476028442383} 11/07/2021 02:40:13 - INFO - __main__ - Step 38678: {'lr': 0.00042798456724289227, 'samples': 7426176, 'steps': 38677, 'loss/train': 1.3719924688339233} 11/07/2021 02:40:14 - INFO - __main__ - Step 38679: {'lr': 0.0004279808405837482, 'samples': 7426368, 'steps': 38678, 'loss/train': 1.4804455041885376} 11/07/2021 02:40:14 - INFO - __main__ - Step 38680: {'lr': 0.00042797711384440863, 'samples': 7426560, 'steps': 38679, 'loss/train': 1.239966630935669} 11/07/2021 02:40:14 - INFO - __main__ - Step 38681: {'lr': 0.0004279733870248754, 'samples': 7426752, 'steps': 38680, 'loss/train': 1.6720571517944336} 11/07/2021 02:40:15 - INFO - __main__ - Step 38682: {'lr': 0.00042796966012515007, 'samples': 7426944, 'steps': 38681, 'loss/train': 1.2801110744476318} 11/07/2021 02:40:15 - INFO - __main__ - Step 38683: {'lr': 0.00042796593314523435, 'samples': 7427136, 'steps': 38682, 'loss/train': 1.1848257780075073} 11/07/2021 02:40:16 - INFO - __main__ - Step 38684: {'lr': 0.0004279622060851299, 'samples': 7427328, 'steps': 38683, 'loss/train': 1.4435261487960815} 11/07/2021 02:40:16 - INFO - __main__ - Step 38685: {'lr': 0.0004279584789448385, 'samples': 7427520, 'steps': 38684, 'loss/train': 1.6397520303726196} 11/07/2021 02:40:17 - INFO - __main__ - Step 38686: {'lr': 0.0004279547517243617, 'samples': 7427712, 'steps': 38685, 'loss/train': 2.541539430618286} 11/07/2021 02:40:17 - INFO - __main__ - Step 38687: {'lr': 0.00042795102442370127, 'samples': 7427904, 'steps': 38686, 'loss/train': 1.6271125078201294} 11/07/2021 02:40:17 - INFO - __main__ - Step 38688: {'lr': 0.0004279472970428588, 'samples': 7428096, 'steps': 38687, 'loss/train': 0.9081865549087524} 11/07/2021 02:40:19 - INFO - __main__ - Step 38689: {'lr': 0.0004279435695818361, 'samples': 7428288, 'steps': 38688, 'loss/train': 1.785889983177185} 11/07/2021 02:40:19 - INFO - __main__ - Step 38690: {'lr': 0.00042793984204063477, 'samples': 7428480, 'steps': 38689, 'loss/train': 1.6187974214553833} 11/07/2021 02:40:19 - INFO - __main__ - Step 38691: {'lr': 0.0004279361144192565, 'samples': 7428672, 'steps': 38690, 'loss/train': 1.9208132028579712} 11/07/2021 02:40:20 - INFO - __main__ - Step 38692: {'lr': 0.00042793238671770285, 'samples': 7428864, 'steps': 38691, 'loss/train': 1.7893801927566528} 11/07/2021 02:40:20 - INFO - __main__ - Step 38693: {'lr': 0.0004279286589359757, 'samples': 7429056, 'steps': 38692, 'loss/train': 1.4303791522979736} 11/07/2021 02:40:21 - INFO - __main__ - Step 38694: {'lr': 0.00042792493107407666, 'samples': 7429248, 'steps': 38693, 'loss/train': 1.673890471458435} 11/07/2021 02:40:21 - INFO - __main__ - Step 38695: {'lr': 0.0004279212031320073, 'samples': 7429440, 'steps': 38694, 'loss/train': 1.5759137868881226} 11/07/2021 02:40:22 - INFO - __main__ - Step 38696: {'lr': 0.00042791747510976955, 'samples': 7429632, 'steps': 38695, 'loss/train': 1.9982209205627441} 11/07/2021 02:40:22 - INFO - __main__ - Step 38697: {'lr': 0.0004279137470073648, 'samples': 7429824, 'steps': 38696, 'loss/train': 1.5088738203048706} 11/07/2021 02:40:22 - INFO - __main__ - Step 38698: {'lr': 0.00042791001882479485, 'samples': 7430016, 'steps': 38697, 'loss/train': 1.8029636144638062} 11/07/2021 02:40:23 - INFO - __main__ - Step 38699: {'lr': 0.0004279062905620614, 'samples': 7430208, 'steps': 38698, 'loss/train': 1.0119118690490723} 11/07/2021 02:40:24 - INFO - __main__ - Step 38700: {'lr': 0.0004279025622191662, 'samples': 7430400, 'steps': 38699, 'loss/train': 1.4782328605651855} 11/07/2021 02:40:24 - INFO - __main__ - Step 38701: {'lr': 0.00042789883379611084, 'samples': 7430592, 'steps': 38700, 'loss/train': 1.4868354797363281} 11/07/2021 02:40:24 - INFO - __main__ - Step 38702: {'lr': 0.000427895105292897, 'samples': 7430784, 'steps': 38701, 'loss/train': 1.3261916637420654} 11/07/2021 02:40:25 - INFO - __main__ - Step 38703: {'lr': 0.00042789137670952627, 'samples': 7430976, 'steps': 38702, 'loss/train': 1.4865633249282837} 11/07/2021 02:40:25 - INFO - __main__ - Step 38704: {'lr': 0.00042788764804600055, 'samples': 7431168, 'steps': 38703, 'loss/train': 1.27956223487854} 11/07/2021 02:40:26 - INFO - __main__ - Step 38705: {'lr': 0.0004278839193023214, 'samples': 7431360, 'steps': 38704, 'loss/train': 1.390254259109497} 11/07/2021 02:40:27 - INFO - __main__ - Step 38706: {'lr': 0.0004278801904784904, 'samples': 7431552, 'steps': 38705, 'loss/train': 1.5037397146224976} 11/07/2021 02:40:27 - INFO - __main__ - Step 38707: {'lr': 0.00042787646157450946, 'samples': 7431744, 'steps': 38706, 'loss/train': 1.4711500406265259} 11/07/2021 02:40:27 - INFO - __main__ - Step 38708: {'lr': 0.00042787273259038, 'samples': 7431936, 'steps': 38707, 'loss/train': 1.1879253387451172} 11/07/2021 02:40:28 - INFO - __main__ - Step 38709: {'lr': 0.00042786900352610393, 'samples': 7432128, 'steps': 38708, 'loss/train': 1.1754286289215088} 11/07/2021 02:40:29 - INFO - __main__ - Step 38710: {'lr': 0.0004278652743816828, 'samples': 7432320, 'steps': 38709, 'loss/train': 1.3880391120910645} 11/07/2021 02:40:29 - INFO - __main__ - Step 38711: {'lr': 0.00042786154515711826, 'samples': 7432512, 'steps': 38710, 'loss/train': 0.7310174703598022} 11/07/2021 02:40:29 - INFO - __main__ - Step 38712: {'lr': 0.0004278578158524121, 'samples': 7432704, 'steps': 38711, 'loss/train': 2.104560136795044} 11/07/2021 02:40:30 - INFO - __main__ - Step 38713: {'lr': 0.00042785408646756594, 'samples': 7432896, 'steps': 38712, 'loss/train': 1.6006711721420288} 11/07/2021 02:40:30 - INFO - __main__ - Step 38714: {'lr': 0.0004278503570025816, 'samples': 7433088, 'steps': 38713, 'loss/train': 1.1154862642288208} 11/07/2021 02:40:31 - INFO - __main__ - Step 38715: {'lr': 0.0004278466274574605, 'samples': 7433280, 'steps': 38714, 'loss/train': 0.30776089429855347} 11/07/2021 02:40:32 - INFO - __main__ - Step 38716: {'lr': 0.0004278428978322044, 'samples': 7433472, 'steps': 38715, 'loss/train': 1.6656759977340698} 11/07/2021 02:40:32 - INFO - __main__ - Step 38717: {'lr': 0.00042783916812681516, 'samples': 7433664, 'steps': 38716, 'loss/train': 2.2830231189727783} 11/07/2021 02:40:32 - INFO - __main__ - Step 38718: {'lr': 0.0004278354383412943, 'samples': 7433856, 'steps': 38717, 'loss/train': 1.4579813480377197} 11/07/2021 02:40:33 - INFO - __main__ - Step 38719: {'lr': 0.0004278317084756435, 'samples': 7434048, 'steps': 38718, 'loss/train': 1.6667070388793945} 11/07/2021 02:40:33 - INFO - __main__ - Step 38720: {'lr': 0.00042782797852986454, 'samples': 7434240, 'steps': 38719, 'loss/train': 1.509294033050537} 11/07/2021 02:40:34 - INFO - __main__ - Step 38721: {'lr': 0.00042782424850395894, 'samples': 7434432, 'steps': 38720, 'loss/train': 1.7709182500839233} 11/07/2021 02:40:34 - INFO - __main__ - Step 38722: {'lr': 0.00042782051839792857, 'samples': 7434624, 'steps': 38721, 'loss/train': 0.7755714654922485} 11/07/2021 02:40:35 - INFO - __main__ - Step 38723: {'lr': 0.000427816788211775, 'samples': 7434816, 'steps': 38722, 'loss/train': 1.6787524223327637} 11/07/2021 02:40:35 - INFO - __main__ - Step 38724: {'lr': 0.00042781305794549994, 'samples': 7435008, 'steps': 38723, 'loss/train': 1.6378754377365112} 11/07/2021 02:40:35 - INFO - __main__ - Step 38725: {'lr': 0.00042780932759910504, 'samples': 7435200, 'steps': 38724, 'loss/train': 1.700057864189148} 11/07/2021 02:40:36 - INFO - __main__ - Step 38726: {'lr': 0.00042780559717259194, 'samples': 7435392, 'steps': 38725, 'loss/train': 1.517086148262024} 11/07/2021 02:40:37 - INFO - __main__ - Step 38727: {'lr': 0.0004278018666659624, 'samples': 7435584, 'steps': 38726, 'loss/train': 1.5830868482589722} 11/07/2021 02:40:37 - INFO - __main__ - Step 38728: {'lr': 0.0004277981360792182, 'samples': 7435776, 'steps': 38727, 'loss/train': 1.8037893772125244} 11/07/2021 02:40:38 - INFO - __main__ - Step 38729: {'lr': 0.0004277944054123608, 'samples': 7435968, 'steps': 38728, 'loss/train': 1.5496574640274048} 11/07/2021 02:40:38 - INFO - __main__ - Step 38730: {'lr': 0.000427790674665392, 'samples': 7436160, 'steps': 38729, 'loss/train': 1.3973135948181152} 11/07/2021 02:40:39 - INFO - __main__ - Step 38731: {'lr': 0.00042778694383831354, 'samples': 7436352, 'steps': 38730, 'loss/train': 1.558529257774353} 11/07/2021 02:40:39 - INFO - __main__ - Step 38732: {'lr': 0.0004277832129311269, 'samples': 7436544, 'steps': 38731, 'loss/train': 1.8024917840957642} 11/07/2021 02:40:40 - INFO - __main__ - Step 38733: {'lr': 0.000427779481943834, 'samples': 7436736, 'steps': 38732, 'loss/train': 1.8078402280807495} 11/07/2021 02:40:40 - INFO - __main__ - Step 38734: {'lr': 0.0004277757508764363, 'samples': 7436928, 'steps': 38733, 'loss/train': 1.2723112106323242} 11/07/2021 02:40:40 - INFO - __main__ - Step 38735: {'lr': 0.00042777201972893564, 'samples': 7437120, 'steps': 38734, 'loss/train': 1.5691598653793335} 11/07/2021 02:40:41 - INFO - __main__ - Step 38736: {'lr': 0.00042776828850133364, 'samples': 7437312, 'steps': 38735, 'loss/train': 1.4892206192016602} 11/07/2021 02:40:42 - INFO - __main__ - Step 38737: {'lr': 0.0004277645571936321, 'samples': 7437504, 'steps': 38736, 'loss/train': 1.7327055931091309} 11/07/2021 02:40:42 - INFO - __main__ - Step 38738: {'lr': 0.0004277608258058324, 'samples': 7437696, 'steps': 38737, 'loss/train': 1.7947583198547363} 11/07/2021 02:40:42 - INFO - __main__ - Step 38739: {'lr': 0.00042775709433793657, 'samples': 7437888, 'steps': 38738, 'loss/train': 1.4090418815612793} 11/07/2021 02:40:43 - INFO - __main__ - Step 38740: {'lr': 0.0004277533627899461, 'samples': 7438080, 'steps': 38739, 'loss/train': 1.3038268089294434} 11/07/2021 02:40:44 - INFO - __main__ - Step 38741: {'lr': 0.00042774963116186274, 'samples': 7438272, 'steps': 38740, 'loss/train': 1.4376336336135864} 11/07/2021 02:40:44 - INFO - __main__ - Step 38742: {'lr': 0.000427745899453688, 'samples': 7438464, 'steps': 38741, 'loss/train': 1.7706702947616577} 11/07/2021 02:40:44 - INFO - __main__ - Step 38743: {'lr': 0.00042774216766542386, 'samples': 7438656, 'steps': 38742, 'loss/train': 1.64524245262146} 11/07/2021 02:40:45 - INFO - __main__ - Step 38744: {'lr': 0.0004277384357970717, 'samples': 7438848, 'steps': 38743, 'loss/train': 1.289526104927063} 11/07/2021 02:40:45 - INFO - __main__ - Step 38745: {'lr': 0.00042773470384863344, 'samples': 7439040, 'steps': 38744, 'loss/train': 1.7782156467437744} 11/07/2021 02:40:45 - INFO - __main__ - Step 38746: {'lr': 0.0004277309718201107, 'samples': 7439232, 'steps': 38745, 'loss/train': 1.1813396215438843} 11/07/2021 02:40:46 - INFO - __main__ - Step 38747: {'lr': 0.000427727239711505, 'samples': 7439424, 'steps': 38746, 'loss/train': 1.3218621015548706} 11/07/2021 02:40:47 - INFO - __main__ - Step 38748: {'lr': 0.00042772350752281823, 'samples': 7439616, 'steps': 38747, 'loss/train': 1.8541260957717896} 11/07/2021 02:40:47 - INFO - __main__ - Step 38749: {'lr': 0.000427719775254052, 'samples': 7439808, 'steps': 38748, 'loss/train': 1.7880030870437622} 11/07/2021 02:40:47 - INFO - __main__ - Step 38750: {'lr': 0.00042771604290520795, 'samples': 7440000, 'steps': 38749, 'loss/train': 1.229604959487915} 11/07/2021 02:40:48 - INFO - __main__ - Step 38751: {'lr': 0.00042771231047628776, 'samples': 7440192, 'steps': 38750, 'loss/train': 1.1183679103851318} 11/07/2021 02:40:49 - INFO - __main__ - Step 38752: {'lr': 0.0004277085779672932, 'samples': 7440384, 'steps': 38751, 'loss/train': 1.9162299633026123} 11/07/2021 02:40:49 - INFO - __main__ - Step 38753: {'lr': 0.0004277048453782259, 'samples': 7440576, 'steps': 38752, 'loss/train': 1.6994130611419678} 11/07/2021 02:40:50 - INFO - __main__ - Step 38754: {'lr': 0.0004277011127090875, 'samples': 7440768, 'steps': 38753, 'loss/train': 2.003278970718384} 11/07/2021 02:40:50 - INFO - __main__ - Step 38755: {'lr': 0.0004276973799598798, 'samples': 7440960, 'steps': 38754, 'loss/train': 1.5014312267303467} 11/07/2021 02:40:50 - INFO - __main__ - Step 38756: {'lr': 0.0004276936471306043, 'samples': 7441152, 'steps': 38755, 'loss/train': 1.0279024839401245} 11/07/2021 02:40:51 - INFO - __main__ - Step 38757: {'lr': 0.00042768991422126285, 'samples': 7441344, 'steps': 38756, 'loss/train': 0.6809089183807373} 11/07/2021 02:40:52 - INFO - __main__ - Step 38758: {'lr': 0.00042768618123185703, 'samples': 7441536, 'steps': 38757, 'loss/train': 1.3736629486083984} 11/07/2021 02:40:52 - INFO - __main__ - Step 38759: {'lr': 0.00042768244816238863, 'samples': 7441728, 'steps': 38758, 'loss/train': 1.0637315511703491} 11/07/2021 02:40:52 - INFO - __main__ - Step 38760: {'lr': 0.00042767871501285916, 'samples': 7441920, 'steps': 38759, 'loss/train': 1.5829507112503052} 11/07/2021 02:40:53 - INFO - __main__ - Step 38761: {'lr': 0.00042767498178327047, 'samples': 7442112, 'steps': 38760, 'loss/train': 1.4992541074752808} 11/07/2021 02:40:54 - INFO - __main__ - Step 38762: {'lr': 0.00042767124847362413, 'samples': 7442304, 'steps': 38761, 'loss/train': 1.995751976966858} 11/07/2021 02:40:54 - INFO - __main__ - Step 38763: {'lr': 0.00042766751508392187, 'samples': 7442496, 'steps': 38762, 'loss/train': 1.8400249481201172} 11/07/2021 02:40:54 - INFO - __main__ - Step 38764: {'lr': 0.00042766378161416543, 'samples': 7442688, 'steps': 38763, 'loss/train': 1.5340784788131714} 11/07/2021 02:40:55 - INFO - __main__ - Step 38765: {'lr': 0.00042766004806435643, 'samples': 7442880, 'steps': 38764, 'loss/train': 1.5389777421951294} 11/07/2021 02:40:55 - INFO - __main__ - Step 38766: {'lr': 0.0004276563144344965, 'samples': 7443072, 'steps': 38765, 'loss/train': 1.4644994735717773} 11/07/2021 02:40:56 - INFO - __main__ - Step 38767: {'lr': 0.00042765258072458733, 'samples': 7443264, 'steps': 38766, 'loss/train': 1.8041868209838867} 11/07/2021 02:40:57 - INFO - __main__ - Step 38768: {'lr': 0.00042764884693463075, 'samples': 7443456, 'steps': 38767, 'loss/train': 1.6728705167770386} 11/07/2021 02:40:57 - INFO - __main__ - Step 38769: {'lr': 0.0004276451130646283, 'samples': 7443648, 'steps': 38768, 'loss/train': 1.5693845748901367} 11/07/2021 02:40:57 - INFO - __main__ - Step 38770: {'lr': 0.0004276413791145817, 'samples': 7443840, 'steps': 38769, 'loss/train': 1.5388051271438599} 11/07/2021 02:40:58 - INFO - __main__ - Step 38771: {'lr': 0.00042763764508449263, 'samples': 7444032, 'steps': 38770, 'loss/train': 1.5940831899642944} 11/07/2021 02:40:59 - INFO - __main__ - Step 38772: {'lr': 0.0004276339109743628, 'samples': 7444224, 'steps': 38771, 'loss/train': 1.9035909175872803} 11/07/2021 02:40:59 - INFO - __main__ - Step 38773: {'lr': 0.0004276301767841939, 'samples': 7444416, 'steps': 38772, 'loss/train': 1.301182746887207} 11/07/2021 02:40:59 - INFO - __main__ - Step 38774: {'lr': 0.00042762644251398755, 'samples': 7444608, 'steps': 38773, 'loss/train': 1.6869004964828491} 11/07/2021 02:41:00 - INFO - __main__ - Step 38775: {'lr': 0.0004276227081637454, 'samples': 7444800, 'steps': 38774, 'loss/train': 1.445910930633545} 11/07/2021 02:41:00 - INFO - __main__ - Step 38776: {'lr': 0.00042761897373346923, 'samples': 7444992, 'steps': 38775, 'loss/train': 1.0125514268875122} 11/07/2021 02:41:01 - INFO - __main__ - Step 38777: {'lr': 0.0004276152392231608, 'samples': 7445184, 'steps': 38776, 'loss/train': 1.3907091617584229} 11/07/2021 02:41:02 - INFO - __main__ - Step 38778: {'lr': 0.00042761150463282164, 'samples': 7445376, 'steps': 38777, 'loss/train': 1.4133721590042114} 11/07/2021 02:41:02 - INFO - __main__ - Step 38779: {'lr': 0.0004276077699624534, 'samples': 7445568, 'steps': 38778, 'loss/train': 1.6160264015197754} 11/07/2021 02:41:02 - INFO - __main__ - Step 38780: {'lr': 0.0004276040352120578, 'samples': 7445760, 'steps': 38779, 'loss/train': 1.2119277715682983} 11/07/2021 02:41:03 - INFO - __main__ - Step 38781: {'lr': 0.0004276003003816367, 'samples': 7445952, 'steps': 38780, 'loss/train': 0.9591224193572998} 11/07/2021 02:41:04 - INFO - __main__ - Step 38782: {'lr': 0.0004275965654711916, 'samples': 7446144, 'steps': 38781, 'loss/train': 1.4609408378601074} 11/07/2021 02:41:04 - INFO - __main__ - Step 38783: {'lr': 0.0004275928304807242, 'samples': 7446336, 'steps': 38782, 'loss/train': 1.4006892442703247} 11/07/2021 02:41:04 - INFO - __main__ - Step 38784: {'lr': 0.0004275890954102362, 'samples': 7446528, 'steps': 38783, 'loss/train': 1.4045475721359253} 11/07/2021 02:41:05 - INFO - __main__ - Step 38785: {'lr': 0.0004275853602597294, 'samples': 7446720, 'steps': 38784, 'loss/train': 0.8429235219955444} 11/07/2021 02:41:05 - INFO - __main__ - Step 38786: {'lr': 0.00042758162502920527, 'samples': 7446912, 'steps': 38785, 'loss/train': 1.095483422279358} 11/07/2021 02:41:05 - INFO - __main__ - Step 38787: {'lr': 0.0004275778897186656, 'samples': 7447104, 'steps': 38786, 'loss/train': 1.2157950401306152} 11/07/2021 02:41:06 - INFO - __main__ - Step 38788: {'lr': 0.0004275741543281121, 'samples': 7447296, 'steps': 38787, 'loss/train': 1.3693673610687256} 11/07/2021 02:41:07 - INFO - __main__ - Step 38789: {'lr': 0.0004275704188575464, 'samples': 7447488, 'steps': 38788, 'loss/train': 1.5449576377868652} 11/07/2021 02:41:07 - INFO - __main__ - Step 38790: {'lr': 0.00042756668330697024, 'samples': 7447680, 'steps': 38789, 'loss/train': 1.5498528480529785} 11/07/2021 02:41:07 - INFO - __main__ - Step 38791: {'lr': 0.00042756294767638527, 'samples': 7447872, 'steps': 38790, 'loss/train': 0.6934214234352112} 11/07/2021 02:41:08 - INFO - __main__ - Step 38792: {'lr': 0.00042755921196579316, 'samples': 7448064, 'steps': 38791, 'loss/train': 1.606022596359253} 11/07/2021 02:41:09 - INFO - __main__ - Step 38793: {'lr': 0.0004275554761751956, 'samples': 7448256, 'steps': 38792, 'loss/train': 1.848496437072754} 11/07/2021 02:41:09 - INFO - __main__ - Step 38794: {'lr': 0.0004275517403045943, 'samples': 7448448, 'steps': 38793, 'loss/train': 0.4533573389053345} 11/07/2021 02:41:10 - INFO - __main__ - Step 38795: {'lr': 0.000427548004353991, 'samples': 7448640, 'steps': 38794, 'loss/train': 1.8403584957122803} 11/07/2021 02:41:10 - INFO - __main__ - Step 38796: {'lr': 0.00042754426832338724, 'samples': 7448832, 'steps': 38795, 'loss/train': 1.3824883699417114} 11/07/2021 02:41:10 - INFO - __main__ - Step 38797: {'lr': 0.00042754053221278476, 'samples': 7449024, 'steps': 38796, 'loss/train': 1.9527474641799927} 11/07/2021 02:41:11 - INFO - __main__ - Step 38798: {'lr': 0.0004275367960221853, 'samples': 7449216, 'steps': 38797, 'loss/train': 1.161024808883667} 11/07/2021 02:41:12 - INFO - __main__ - Step 38799: {'lr': 0.0004275330597515904, 'samples': 7449408, 'steps': 38798, 'loss/train': 1.4690824747085571} 11/07/2021 02:41:12 - INFO - __main__ - Step 38800: {'lr': 0.00042752932340100195, 'samples': 7449600, 'steps': 38799, 'loss/train': 1.634828805923462} 11/07/2021 02:41:12 - INFO - __main__ - Step 38801: {'lr': 0.00042752558697042143, 'samples': 7449792, 'steps': 38800, 'loss/train': 1.6765644550323486} 11/07/2021 02:41:13 - INFO - __main__ - Step 38802: {'lr': 0.0004275218504598507, 'samples': 7449984, 'steps': 38801, 'loss/train': 1.7828283309936523} 11/07/2021 02:41:14 - INFO - __main__ - Step 38803: {'lr': 0.0004275181138692914, 'samples': 7450176, 'steps': 38802, 'loss/train': 1.677453875541687} 11/07/2021 02:41:14 - INFO - __main__ - Step 38804: {'lr': 0.0004275143771987451, 'samples': 7450368, 'steps': 38803, 'loss/train': 1.3535674810409546} 11/07/2021 02:41:14 - INFO - __main__ - Step 38805: {'lr': 0.00042751064044821354, 'samples': 7450560, 'steps': 38804, 'loss/train': 1.4613144397735596} 11/07/2021 02:41:15 - INFO - __main__ - Step 38806: {'lr': 0.0004275069036176985, 'samples': 7450752, 'steps': 38805, 'loss/train': 1.706905722618103} 11/07/2021 02:41:15 - INFO - __main__ - Step 38807: {'lr': 0.0004275031667072015, 'samples': 7450944, 'steps': 38806, 'loss/train': 1.6436902284622192} 11/07/2021 02:41:16 - INFO - __main__ - Step 38808: {'lr': 0.0004274994297167244, 'samples': 7451136, 'steps': 38807, 'loss/train': 1.309851884841919} 11/07/2021 02:41:17 - INFO - __main__ - Step 38809: {'lr': 0.00042749569264626875, 'samples': 7451328, 'steps': 38808, 'loss/train': 1.4383822679519653} 11/07/2021 02:41:17 - INFO - __main__ - Step 38810: {'lr': 0.0004274919554958363, 'samples': 7451520, 'steps': 38809, 'loss/train': 1.6340030431747437} 11/07/2021 02:41:17 - INFO - __main__ - Step 38811: {'lr': 0.00042748821826542875, 'samples': 7451712, 'steps': 38810, 'loss/train': 1.627172827720642} 11/07/2021 02:41:18 - INFO - __main__ - Step 38812: {'lr': 0.00042748448095504765, 'samples': 7451904, 'steps': 38811, 'loss/train': 1.35011625289917} 11/07/2021 02:41:18 - INFO - __main__ - Step 38813: {'lr': 0.0004274807435646948, 'samples': 7452096, 'steps': 38812, 'loss/train': 1.7396658658981323} 11/07/2021 02:41:19 - INFO - __main__ - Step 38814: {'lr': 0.0004274770060943719, 'samples': 7452288, 'steps': 38813, 'loss/train': 5.9485344886779785} 11/07/2021 02:41:19 - INFO - __main__ - Step 38815: {'lr': 0.00042747326854408063, 'samples': 7452480, 'steps': 38814, 'loss/train': 1.7223875522613525} 11/07/2021 02:41:20 - INFO - __main__ - Step 38816: {'lr': 0.00042746953091382254, 'samples': 7452672, 'steps': 38815, 'loss/train': 1.614368200302124} 11/07/2021 02:41:20 - INFO - __main__ - Step 38817: {'lr': 0.00042746579320359956, 'samples': 7452864, 'steps': 38816, 'loss/train': 1.5527185201644897} 11/07/2021 02:41:21 - INFO - __main__ - Step 38818: {'lr': 0.00042746205541341315, 'samples': 7453056, 'steps': 38817, 'loss/train': 1.6299482583999634} 11/07/2021 02:41:22 - INFO - __main__ - Step 38819: {'lr': 0.0004274583175432651, 'samples': 7453248, 'steps': 38818, 'loss/train': 1.170392632484436} 11/07/2021 02:41:22 - INFO - __main__ - Step 38820: {'lr': 0.000427454579593157, 'samples': 7453440, 'steps': 38819, 'loss/train': 1.6622440814971924} 11/07/2021 02:41:23 - INFO - __main__ - Step 38821: {'lr': 0.00042745084156309065, 'samples': 7453632, 'steps': 38820, 'loss/train': 1.4594944715499878} 11/07/2021 02:41:23 - INFO - __main__ - Step 38822: {'lr': 0.00042744710345306774, 'samples': 7453824, 'steps': 38821, 'loss/train': 1.780123233795166} 11/07/2021 02:41:23 - INFO - __main__ - Step 38823: {'lr': 0.00042744336526308986, 'samples': 7454016, 'steps': 38822, 'loss/train': 1.5132795572280884} 11/07/2021 02:41:24 - INFO - __main__ - Step 38824: {'lr': 0.0004274396269931587, 'samples': 7454208, 'steps': 38823, 'loss/train': 1.9288641214370728} 11/07/2021 02:41:25 - INFO - __main__ - Step 38825: {'lr': 0.0004274358886432761, 'samples': 7454400, 'steps': 38824, 'loss/train': 1.3878891468048096} 11/07/2021 02:41:25 - INFO - __main__ - Step 38826: {'lr': 0.0004274321502134435, 'samples': 7454592, 'steps': 38825, 'loss/train': 1.3130568265914917} 11/07/2021 02:41:25 - INFO - __main__ - Step 38827: {'lr': 0.00042742841170366274, 'samples': 7454784, 'steps': 38826, 'loss/train': 1.8139756917953491} 11/07/2021 02:41:26 - INFO - __main__ - Step 38828: {'lr': 0.0004274246731139355, 'samples': 7454976, 'steps': 38827, 'loss/train': 1.597933292388916} 11/07/2021 02:41:26 - INFO - __main__ - Step 38829: {'lr': 0.0004274209344442634, 'samples': 7455168, 'steps': 38828, 'loss/train': 1.1701526641845703} 11/07/2021 02:41:27 - INFO - __main__ - Step 38830: {'lr': 0.00042741719569464834, 'samples': 7455360, 'steps': 38829, 'loss/train': 1.382586121559143} 11/07/2021 02:41:27 - INFO - __main__ - Step 38831: {'lr': 0.0004274134568650916, 'samples': 7455552, 'steps': 38830, 'loss/train': 1.5133731365203857} 11/07/2021 02:41:28 - INFO - __main__ - Step 38832: {'lr': 0.00042740971795559527, 'samples': 7455744, 'steps': 38831, 'loss/train': 1.5364069938659668} 11/07/2021 02:41:28 - INFO - __main__ - Step 38833: {'lr': 0.00042740597896616075, 'samples': 7455936, 'steps': 38832, 'loss/train': 1.4677929878234863} 11/07/2021 02:41:29 - INFO - __main__ - Step 38834: {'lr': 0.00042740223989678984, 'samples': 7456128, 'steps': 38833, 'loss/train': 1.2758821249008179} 11/07/2021 02:41:29 - INFO - __main__ - Step 38835: {'lr': 0.0004273985007474842, 'samples': 7456320, 'steps': 38834, 'loss/train': 1.6179077625274658} 11/07/2021 02:41:30 - INFO - __main__ - Step 38836: {'lr': 0.00042739476151824565, 'samples': 7456512, 'steps': 38835, 'loss/train': 0.5692052245140076} 11/07/2021 02:41:30 - INFO - __main__ - Step 38837: {'lr': 0.00042739102220907567, 'samples': 7456704, 'steps': 38836, 'loss/train': 1.2258596420288086} 11/07/2021 02:41:31 - INFO - __main__ - Step 38838: {'lr': 0.000427387282819976, 'samples': 7456896, 'steps': 38837, 'loss/train': 1.7110847234725952} 11/07/2021 02:41:31 - INFO - __main__ - Step 38839: {'lr': 0.0004273835433509484, 'samples': 7457088, 'steps': 38838, 'loss/train': 1.5911930799484253} 11/07/2021 02:41:31 - INFO - __main__ - Step 38840: {'lr': 0.0004273798038019945, 'samples': 7457280, 'steps': 38839, 'loss/train': 1.5180996656417847} 11/07/2021 02:41:32 - INFO - __main__ - Step 38841: {'lr': 0.000427376064173116, 'samples': 7457472, 'steps': 38840, 'loss/train': 1.6930204629898071} 11/07/2021 02:41:33 - INFO - __main__ - Step 38842: {'lr': 0.0004273723244643146, 'samples': 7457664, 'steps': 38841, 'loss/train': 1.7402244806289673} 11/07/2021 02:41:33 - INFO - __main__ - Step 38843: {'lr': 0.000427368584675592, 'samples': 7457856, 'steps': 38842, 'loss/train': 1.8798352479934692} 11/07/2021 02:41:33 - INFO - __main__ - Step 38844: {'lr': 0.0004273648448069498, 'samples': 7458048, 'steps': 38843, 'loss/train': 1.5837422609329224} 11/07/2021 02:41:34 - INFO - __main__ - Step 38845: {'lr': 0.00042736110485838973, 'samples': 7458240, 'steps': 38844, 'loss/train': 1.194349765777588} 11/07/2021 02:41:35 - INFO - __main__ - Step 38846: {'lr': 0.0004273573648299135, 'samples': 7458432, 'steps': 38845, 'loss/train': 1.6953762769699097} 11/07/2021 02:41:35 - INFO - __main__ - Step 38847: {'lr': 0.0004273536247215227, 'samples': 7458624, 'steps': 38846, 'loss/train': 2.0758681297302246} 11/07/2021 02:41:36 - INFO - __main__ - Step 38848: {'lr': 0.00042734988453321923, 'samples': 7458816, 'steps': 38847, 'loss/train': 1.132045030593872} 11/07/2021 02:41:36 - INFO - __main__ - Step 38849: {'lr': 0.0004273461442650046, 'samples': 7459008, 'steps': 38848, 'loss/train': 1.5473262071609497} 11/07/2021 02:41:36 - INFO - __main__ - Step 38850: {'lr': 0.0004273424039168805, 'samples': 7459200, 'steps': 38849, 'loss/train': 1.733851671218872} 11/07/2021 02:41:37 - INFO - __main__ - Step 38851: {'lr': 0.00042733866348884864, 'samples': 7459392, 'steps': 38850, 'loss/train': 1.4504001140594482} 11/07/2021 02:41:38 - INFO - __main__ - Step 38852: {'lr': 0.0004273349229809108, 'samples': 7459584, 'steps': 38851, 'loss/train': 1.7992230653762817} 11/07/2021 02:41:38 - INFO - __main__ - Step 38853: {'lr': 0.00042733118239306845, 'samples': 7459776, 'steps': 38852, 'loss/train': 1.5601048469543457} 11/07/2021 02:41:38 - INFO - __main__ - Step 38854: {'lr': 0.0004273274417253235, 'samples': 7459968, 'steps': 38853, 'loss/train': 1.4414904117584229} 11/07/2021 02:41:39 - INFO - __main__ - Step 38855: {'lr': 0.00042732370097767756, 'samples': 7460160, 'steps': 38854, 'loss/train': 1.7437254190444946} 11/07/2021 02:41:39 - INFO - __main__ - Step 38856: {'lr': 0.0004273199601501322, 'samples': 7460352, 'steps': 38855, 'loss/train': 1.7385590076446533} 11/07/2021 02:41:40 - INFO - __main__ - Step 38857: {'lr': 0.0004273162192426893, 'samples': 7460544, 'steps': 38856, 'loss/train': 1.4456984996795654} 11/07/2021 02:41:41 - INFO - __main__ - Step 38858: {'lr': 0.00042731247825535037, 'samples': 7460736, 'steps': 38857, 'loss/train': 1.4588637351989746} 11/07/2021 02:41:41 - INFO - __main__ - Step 38859: {'lr': 0.00042730873718811724, 'samples': 7460928, 'steps': 38858, 'loss/train': 1.9591647386550903} 11/07/2021 02:41:41 - INFO - __main__ - Step 38860: {'lr': 0.0004273049960409915, 'samples': 7461120, 'steps': 38859, 'loss/train': 1.8021929264068604} 11/07/2021 02:41:42 - INFO - __main__ - Step 38861: {'lr': 0.00042730125481397487, 'samples': 7461312, 'steps': 38860, 'loss/train': 1.1760722398757935} 11/07/2021 02:41:43 - INFO - __main__ - Step 38862: {'lr': 0.00042729751350706905, 'samples': 7461504, 'steps': 38861, 'loss/train': 1.177544116973877} 11/07/2021 02:41:43 - INFO - __main__ - Step 38863: {'lr': 0.00042729377212027557, 'samples': 7461696, 'steps': 38862, 'loss/train': 1.1901837587356567} 11/07/2021 02:41:44 - INFO - __main__ - Step 38864: {'lr': 0.0004272900306535964, 'samples': 7461888, 'steps': 38863, 'loss/train': 1.6480708122253418} 11/07/2021 02:41:44 - INFO - __main__ - Step 38865: {'lr': 0.00042728628910703305, 'samples': 7462080, 'steps': 38864, 'loss/train': 2.429159164428711} 11/07/2021 02:41:44 - INFO - __main__ - Step 38866: {'lr': 0.0004272825474805872, 'samples': 7462272, 'steps': 38865, 'loss/train': 1.8162816762924194} 11/07/2021 02:41:45 - INFO - __main__ - Step 38867: {'lr': 0.0004272788057742606, 'samples': 7462464, 'steps': 38866, 'loss/train': 1.9516806602478027} 11/07/2021 02:41:46 - INFO - __main__ - Step 38868: {'lr': 0.0004272750639880549, 'samples': 7462656, 'steps': 38867, 'loss/train': 0.7387398481369019} 11/07/2021 02:41:46 - INFO - __main__ - Step 38869: {'lr': 0.0004272713221219718, 'samples': 7462848, 'steps': 38868, 'loss/train': 1.3881351947784424} 11/07/2021 02:41:46 - INFO - __main__ - Step 38870: {'lr': 0.00042726758017601297, 'samples': 7463040, 'steps': 38869, 'loss/train': 1.5700926780700684} 11/07/2021 02:41:47 - INFO - __main__ - Step 38871: {'lr': 0.00042726383815018006, 'samples': 7463232, 'steps': 38870, 'loss/train': 1.3037258386611938} 11/07/2021 02:41:48 - INFO - __main__ - Step 38872: {'lr': 0.00042726009604447484, 'samples': 7463424, 'steps': 38871, 'loss/train': 1.5725196599960327} 11/07/2021 02:41:49 - INFO - __main__ - Step 38873: {'lr': 0.00042725635385889893, 'samples': 7463616, 'steps': 38872, 'loss/train': 1.7052489519119263} 11/07/2021 02:41:49 - INFO - __main__ - Step 38874: {'lr': 0.0004272526115934541, 'samples': 7463808, 'steps': 38873, 'loss/train': 1.9589136838912964} 11/07/2021 02:41:49 - INFO - __main__ - Step 38875: {'lr': 0.0004272488692481419, 'samples': 7464000, 'steps': 38874, 'loss/train': 1.8079824447631836} 11/07/2021 02:41:50 - INFO - __main__ - Step 38876: {'lr': 0.00042724512682296416, 'samples': 7464192, 'steps': 38875, 'loss/train': 1.7487462759017944} 11/07/2021 02:41:50 - INFO - __main__ - Step 38877: {'lr': 0.00042724138431792245, 'samples': 7464384, 'steps': 38876, 'loss/train': 0.17262661457061768} 11/07/2021 02:41:50 - INFO - __main__ - Step 38878: {'lr': 0.0004272376417330186, 'samples': 7464576, 'steps': 38877, 'loss/train': 1.7479337453842163} 11/07/2021 02:41:51 - INFO - __main__ - Step 38879: {'lr': 0.00042723389906825415, 'samples': 7464768, 'steps': 38878, 'loss/train': 1.7263894081115723} 11/07/2021 02:41:52 - INFO - __main__ - Step 38880: {'lr': 0.0004272301563236308, 'samples': 7464960, 'steps': 38879, 'loss/train': 1.2034226655960083} 11/07/2021 02:41:52 - INFO - __main__ - Step 38881: {'lr': 0.0004272264134991503, 'samples': 7465152, 'steps': 38880, 'loss/train': 1.4417294263839722} 11/07/2021 02:41:52 - INFO - __main__ - Step 38882: {'lr': 0.0004272226705948143, 'samples': 7465344, 'steps': 38881, 'loss/train': 0.8323983550071716} 11/07/2021 02:41:53 - INFO - __main__ - Step 38883: {'lr': 0.00042721892761062453, 'samples': 7465536, 'steps': 38882, 'loss/train': 1.609955906867981} 11/07/2021 02:41:54 - INFO - __main__ - Step 38884: {'lr': 0.00042721518454658265, 'samples': 7465728, 'steps': 38883, 'loss/train': 1.783393383026123} 11/07/2021 02:41:54 - INFO - __main__ - Step 38885: {'lr': 0.0004272114414026903, 'samples': 7465920, 'steps': 38884, 'loss/train': 1.819656252861023} 11/07/2021 02:41:54 - INFO - __main__ - Step 38886: {'lr': 0.00042720769817894926, 'samples': 7466112, 'steps': 38885, 'loss/train': 1.1744916439056396} 11/07/2021 02:41:55 - INFO - __main__ - Step 38887: {'lr': 0.00042720395487536115, 'samples': 7466304, 'steps': 38886, 'loss/train': 2.7242424488067627} 11/07/2021 02:41:55 - INFO - __main__ - Step 38888: {'lr': 0.0004272002114919277, 'samples': 7466496, 'steps': 38887, 'loss/train': 0.7330546379089355} 11/07/2021 02:41:56 - INFO - __main__ - Step 38889: {'lr': 0.0004271964680286505, 'samples': 7466688, 'steps': 38888, 'loss/train': 0.8956628441810608} 11/07/2021 02:41:57 - INFO - __main__ - Step 38890: {'lr': 0.00042719272448553137, 'samples': 7466880, 'steps': 38889, 'loss/train': 1.526174783706665} 11/07/2021 02:41:57 - INFO - __main__ - Step 38891: {'lr': 0.00042718898086257183, 'samples': 7467072, 'steps': 38890, 'loss/train': 1.5497913360595703} 11/07/2021 02:41:57 - INFO - __main__ - Step 38892: {'lr': 0.0004271852371597738, 'samples': 7467264, 'steps': 38891, 'loss/train': 1.5525308847427368} 11/07/2021 02:41:58 - INFO - __main__ - Step 38893: {'lr': 0.00042718149337713873, 'samples': 7467456, 'steps': 38892, 'loss/train': 1.6520709991455078} 11/07/2021 02:41:59 - INFO - __main__ - Step 38894: {'lr': 0.0004271777495146685, 'samples': 7467648, 'steps': 38893, 'loss/train': 1.3811542987823486} 11/07/2021 02:41:59 - INFO - __main__ - Step 38895: {'lr': 0.00042717400557236467, 'samples': 7467840, 'steps': 38894, 'loss/train': 1.4010035991668701} 11/07/2021 02:41:59 - INFO - __main__ - Step 38896: {'lr': 0.000427170261550229, 'samples': 7468032, 'steps': 38895, 'loss/train': 1.5982306003570557} 11/07/2021 02:42:00 - INFO - __main__ - Step 38897: {'lr': 0.0004271665174482631, 'samples': 7468224, 'steps': 38896, 'loss/train': 1.3285613059997559} 11/07/2021 02:42:00 - INFO - __main__ - Step 38898: {'lr': 0.0004271627732664687, 'samples': 7468416, 'steps': 38897, 'loss/train': 0.915793776512146} 11/07/2021 02:42:01 - INFO - __main__ - Step 38899: {'lr': 0.0004271590290048475, 'samples': 7468608, 'steps': 38898, 'loss/train': 1.3674812316894531} 11/07/2021 02:42:01 - INFO - __main__ - Step 38900: {'lr': 0.00042715528466340117, 'samples': 7468800, 'steps': 38899, 'loss/train': 1.742058515548706} 11/07/2021 02:42:02 - INFO - __main__ - Step 38901: {'lr': 0.00042715154024213143, 'samples': 7468992, 'steps': 38900, 'loss/train': 1.2003147602081299} 11/07/2021 02:42:02 - INFO - __main__ - Step 38902: {'lr': 0.0004271477957410399, 'samples': 7469184, 'steps': 38901, 'loss/train': 1.4027563333511353} 11/07/2021 02:42:02 - INFO - __main__ - Step 38903: {'lr': 0.00042714405116012834, 'samples': 7469376, 'steps': 38902, 'loss/train': 1.6637219190597534} 11/07/2021 02:42:04 - INFO - __main__ - Step 38904: {'lr': 0.0004271403064993984, 'samples': 7469568, 'steps': 38903, 'loss/train': 1.1860729455947876} 11/07/2021 02:42:04 - INFO - __main__ - Step 38905: {'lr': 0.00042713656175885173, 'samples': 7469760, 'steps': 38904, 'loss/train': 1.5975831747055054} 11/07/2021 02:42:05 - INFO - __main__ - Step 38906: {'lr': 0.00042713281693849015, 'samples': 7469952, 'steps': 38905, 'loss/train': 0.8513796925544739} 11/07/2021 02:42:05 - INFO - __main__ - Step 38907: {'lr': 0.0004271290720383152, 'samples': 7470144, 'steps': 38906, 'loss/train': 1.9151580333709717} 11/07/2021 02:42:05 - INFO - __main__ - Step 38908: {'lr': 0.00042712532705832865, 'samples': 7470336, 'steps': 38907, 'loss/train': 1.497770071029663} 11/07/2021 02:42:06 - INFO - __main__ - Step 38909: {'lr': 0.0004271215819985321, 'samples': 7470528, 'steps': 38908, 'loss/train': 0.600193202495575} 11/07/2021 02:42:07 - INFO - __main__ - Step 38910: {'lr': 0.0004271178368589273, 'samples': 7470720, 'steps': 38909, 'loss/train': 1.7031151056289673} 11/07/2021 02:42:07 - INFO - __main__ - Step 38911: {'lr': 0.000427114091639516, 'samples': 7470912, 'steps': 38910, 'loss/train': 0.9450470209121704} 11/07/2021 02:42:08 - INFO - __main__ - Step 38912: {'lr': 0.0004271103463402998, 'samples': 7471104, 'steps': 38911, 'loss/train': 1.601342797279358} 11/07/2021 02:42:08 - INFO - __main__ - Step 38913: {'lr': 0.0004271066009612804, 'samples': 7471296, 'steps': 38912, 'loss/train': 1.6177656650543213} 11/07/2021 02:42:08 - INFO - __main__ - Step 38914: {'lr': 0.0004271028555024594, 'samples': 7471488, 'steps': 38913, 'loss/train': 1.7592273950576782} 11/07/2021 02:42:10 - INFO - __main__ - Step 38915: {'lr': 0.0004270991099638387, 'samples': 7471680, 'steps': 38914, 'loss/train': 0.23037822544574738} 11/07/2021 02:42:10 - INFO - __main__ - Step 38916: {'lr': 0.0004270953643454199, 'samples': 7471872, 'steps': 38915, 'loss/train': 1.353023886680603} 11/07/2021 02:42:10 - INFO - __main__ - Step 38917: {'lr': 0.0004270916186472046, 'samples': 7472064, 'steps': 38916, 'loss/train': 0.954717218875885} 11/07/2021 02:42:11 - INFO - __main__ - Step 38918: {'lr': 0.0004270878728691946, 'samples': 7472256, 'steps': 38917, 'loss/train': 1.196390151977539} 11/07/2021 02:42:11 - INFO - __main__ - Step 38919: {'lr': 0.00042708412701139147, 'samples': 7472448, 'steps': 38918, 'loss/train': 1.3688315153121948} 11/07/2021 02:42:13 - INFO - __main__ - Step 38920: {'lr': 0.000427080381073797, 'samples': 7472640, 'steps': 38919, 'loss/train': 1.7761547565460205} 11/07/2021 02:42:13 - INFO - __main__ - Step 38921: {'lr': 0.00042707663505641287, 'samples': 7472832, 'steps': 38920, 'loss/train': 1.8193520307540894} 11/07/2021 02:42:13 - INFO - __main__ - Step 38922: {'lr': 0.00042707288895924066, 'samples': 7473024, 'steps': 38921, 'loss/train': 1.3390640020370483} 11/07/2021 02:42:14 - INFO - __main__ - Step 38923: {'lr': 0.0004270691427822823, 'samples': 7473216, 'steps': 38922, 'loss/train': 1.3779325485229492} 11/07/2021 02:42:14 - INFO - __main__ - Step 38924: {'lr': 0.0004270653965255391, 'samples': 7473408, 'steps': 38923, 'loss/train': 1.0960659980773926} 11/07/2021 02:42:14 - INFO - __main__ - Step 38925: {'lr': 0.0004270616501890131, 'samples': 7473600, 'steps': 38924, 'loss/train': 1.1420000791549683} 11/07/2021 02:42:15 - INFO - __main__ - Step 38926: {'lr': 0.0004270579037727058, 'samples': 7473792, 'steps': 38925, 'loss/train': 0.8926109075546265} 11/07/2021 02:42:15 - INFO - __main__ - Step 38927: {'lr': 0.000427054157276619, 'samples': 7473984, 'steps': 38926, 'loss/train': 1.9323151111602783} 11/07/2021 02:42:16 - INFO - __main__ - Step 38928: {'lr': 0.00042705041070075433, 'samples': 7474176, 'steps': 38927, 'loss/train': 0.7841858863830566} 11/07/2021 02:42:17 - INFO - __main__ - Step 38929: {'lr': 0.00042704666404511343, 'samples': 7474368, 'steps': 38928, 'loss/train': 1.7835983037948608} 11/07/2021 02:42:17 - INFO - __main__ - Step 38930: {'lr': 0.000427042917309698, 'samples': 7474560, 'steps': 38929, 'loss/train': 1.7741930484771729} 11/07/2021 02:42:17 - INFO - __main__ - Step 38931: {'lr': 0.00042703917049450983, 'samples': 7474752, 'steps': 38930, 'loss/train': 1.5438940525054932} 11/07/2021 02:42:18 - INFO - __main__ - Step 38932: {'lr': 0.0004270354235995505, 'samples': 7474944, 'steps': 38931, 'loss/train': 1.4355179071426392} 11/07/2021 02:42:19 - INFO - __main__ - Step 38933: {'lr': 0.0004270316766248218, 'samples': 7475136, 'steps': 38932, 'loss/train': 1.6149401664733887} 11/07/2021 02:42:19 - INFO - __main__ - Step 38934: {'lr': 0.0004270279295703253, 'samples': 7475328, 'steps': 38933, 'loss/train': 1.5284372568130493} 11/07/2021 02:42:19 - INFO - __main__ - Step 38935: {'lr': 0.00042702418243606275, 'samples': 7475520, 'steps': 38934, 'loss/train': 1.5661935806274414} 11/07/2021 02:42:20 - INFO - __main__ - Step 38936: {'lr': 0.00042702043522203594, 'samples': 7475712, 'steps': 38935, 'loss/train': 1.6040937900543213} 11/07/2021 02:42:20 - INFO - __main__ - Step 38937: {'lr': 0.00042701668792824633, 'samples': 7475904, 'steps': 38936, 'loss/train': 1.6600172519683838} 11/07/2021 02:42:21 - INFO - __main__ - Step 38938: {'lr': 0.00042701294055469576, 'samples': 7476096, 'steps': 38937, 'loss/train': 1.6687569618225098} 11/07/2021 02:42:21 - INFO - __main__ - Step 38939: {'lr': 0.0004270091931013859, 'samples': 7476288, 'steps': 38938, 'loss/train': 1.7354636192321777} 11/07/2021 02:42:22 - INFO - __main__ - Step 38940: {'lr': 0.00042700544556831846, 'samples': 7476480, 'steps': 38939, 'loss/train': 1.436832308769226} 11/07/2021 02:42:22 - INFO - __main__ - Step 38941: {'lr': 0.00042700169795549504, 'samples': 7476672, 'steps': 38940, 'loss/train': 1.759921669960022} 11/07/2021 02:42:23 - INFO - __main__ - Step 38942: {'lr': 0.00042699795026291743, 'samples': 7476864, 'steps': 38941, 'loss/train': 2.0764222145080566} 11/07/2021 02:42:24 - INFO - __main__ - Step 38943: {'lr': 0.0004269942024905872, 'samples': 7477056, 'steps': 38942, 'loss/train': 1.6727497577667236} 11/07/2021 02:42:24 - INFO - __main__ - Step 38944: {'lr': 0.00042699045463850623, 'samples': 7477248, 'steps': 38943, 'loss/train': 1.5481544733047485} 11/07/2021 02:42:25 - INFO - __main__ - Step 38945: {'lr': 0.000426986706706676, 'samples': 7477440, 'steps': 38944, 'loss/train': 1.027152180671692} 11/07/2021 02:42:25 - INFO - __main__ - Step 38946: {'lr': 0.00042698295869509836, 'samples': 7477632, 'steps': 38945, 'loss/train': 1.5602725744247437} 11/07/2021 02:42:25 - INFO - __main__ - Step 38947: {'lr': 0.0004269792106037749, 'samples': 7477824, 'steps': 38946, 'loss/train': 0.6573772430419922} 11/07/2021 02:42:26 - INFO - __main__ - Step 38948: {'lr': 0.0004269754624327073, 'samples': 7478016, 'steps': 38947, 'loss/train': 2.1020119190216064} 11/07/2021 02:42:27 - INFO - __main__ - Step 38949: {'lr': 0.0004269717141818973, 'samples': 7478208, 'steps': 38948, 'loss/train': 1.3972387313842773} 11/07/2021 02:42:27 - INFO - __main__ - Step 38950: {'lr': 0.0004269679658513466, 'samples': 7478400, 'steps': 38949, 'loss/train': 1.495678186416626} 11/07/2021 02:42:27 - INFO - __main__ - Step 38951: {'lr': 0.00042696421744105686, 'samples': 7478592, 'steps': 38950, 'loss/train': 1.2925618886947632} 11/07/2021 02:42:28 - INFO - __main__ - Step 38952: {'lr': 0.0004269604689510298, 'samples': 7478784, 'steps': 38951, 'loss/train': 2.0521535873413086} 11/07/2021 02:42:28 - INFO - __main__ - Step 38953: {'lr': 0.0004269567203812671, 'samples': 7478976, 'steps': 38952, 'loss/train': 1.5479966402053833} 11/07/2021 02:42:29 - INFO - __main__ - Step 38954: {'lr': 0.00042695297173177033, 'samples': 7479168, 'steps': 38953, 'loss/train': 0.9544076919555664} 11/07/2021 02:42:30 - INFO - __main__ - Step 38955: {'lr': 0.0004269492230025413, 'samples': 7479360, 'steps': 38954, 'loss/train': 1.1891474723815918} 11/07/2021 02:42:30 - INFO - __main__ - Step 38956: {'lr': 0.0004269454741935818, 'samples': 7479552, 'steps': 38955, 'loss/train': 1.60728919506073} 11/07/2021 02:42:30 - INFO - __main__ - Step 38957: {'lr': 0.00042694172530489326, 'samples': 7479744, 'steps': 38956, 'loss/train': 1.171259880065918} 11/07/2021 02:42:31 - INFO - __main__ - Step 38958: {'lr': 0.00042693797633647755, 'samples': 7479936, 'steps': 38957, 'loss/train': 1.6915109157562256} 11/07/2021 02:42:32 - INFO - __main__ - Step 38959: {'lr': 0.00042693422728833644, 'samples': 7480128, 'steps': 38958, 'loss/train': 1.4069545269012451} 11/07/2021 02:42:32 - INFO - __main__ - Step 38960: {'lr': 0.00042693047816047135, 'samples': 7480320, 'steps': 38959, 'loss/train': 1.5176678895950317} 11/07/2021 02:42:32 - INFO - __main__ - Step 38961: {'lr': 0.0004269267289528842, 'samples': 7480512, 'steps': 38960, 'loss/train': 2.151459217071533} 11/07/2021 02:42:33 - INFO - __main__ - Step 38962: {'lr': 0.00042692297966557657, 'samples': 7480704, 'steps': 38961, 'loss/train': 1.3303264379501343} 11/07/2021 02:42:33 - INFO - __main__ - Step 38963: {'lr': 0.0004269192302985502, 'samples': 7480896, 'steps': 38962, 'loss/train': 1.3381013870239258} 11/07/2021 02:42:34 - INFO - __main__ - Step 38964: {'lr': 0.00042691548085180666, 'samples': 7481088, 'steps': 38963, 'loss/train': 1.6442064046859741} 11/07/2021 02:42:34 - INFO - __main__ - Step 38965: {'lr': 0.00042691173132534775, 'samples': 7481280, 'steps': 38964, 'loss/train': 1.4329073429107666} 11/07/2021 02:42:35 - INFO - __main__ - Step 38966: {'lr': 0.0004269079817191752, 'samples': 7481472, 'steps': 38965, 'loss/train': 1.486147403717041} 11/07/2021 02:42:35 - INFO - __main__ - Step 38967: {'lr': 0.00042690423203329067, 'samples': 7481664, 'steps': 38966, 'loss/train': 1.5373287200927734} 11/07/2021 02:42:35 - INFO - __main__ - Step 38968: {'lr': 0.0004269004822676958, 'samples': 7481856, 'steps': 38967, 'loss/train': 1.4226380586624146} 11/07/2021 02:42:37 - INFO - __main__ - Step 38969: {'lr': 0.0004268967324223922, 'samples': 7482048, 'steps': 38968, 'loss/train': 1.3333868980407715} 11/07/2021 02:42:37 - INFO - __main__ - Step 38970: {'lr': 0.00042689298249738185, 'samples': 7482240, 'steps': 38969, 'loss/train': 0.6204865574836731} 11/07/2021 02:42:37 - INFO - __main__ - Step 38971: {'lr': 0.00042688923249266614, 'samples': 7482432, 'steps': 38970, 'loss/train': 1.3735517263412476} 11/07/2021 02:42:38 - INFO - __main__ - Step 38972: {'lr': 0.00042688548240824687, 'samples': 7482624, 'steps': 38971, 'loss/train': 1.390674114227295} 11/07/2021 02:42:38 - INFO - __main__ - Step 38973: {'lr': 0.00042688173224412573, 'samples': 7482816, 'steps': 38972, 'loss/train': 1.1444605588912964} 11/07/2021 02:42:39 - INFO - __main__ - Step 38974: {'lr': 0.00042687798200030446, 'samples': 7483008, 'steps': 38973, 'loss/train': 1.1193890571594238} 11/07/2021 02:42:39 - INFO - __main__ - Step 38975: {'lr': 0.00042687423167678463, 'samples': 7483200, 'steps': 38974, 'loss/train': 1.9068354368209839} 11/07/2021 02:42:40 - INFO - __main__ - Step 38976: {'lr': 0.0004268704812735681, 'samples': 7483392, 'steps': 38975, 'loss/train': 1.6004170179367065} 11/07/2021 02:42:40 - INFO - __main__ - Step 38977: {'lr': 0.00042686673079065637, 'samples': 7483584, 'steps': 38976, 'loss/train': 1.358608603477478} 11/07/2021 02:42:40 - INFO - __main__ - Step 38978: {'lr': 0.00042686298022805126, 'samples': 7483776, 'steps': 38977, 'loss/train': 1.4833357334136963} 11/07/2021 02:42:41 - INFO - __main__ - Step 38979: {'lr': 0.0004268592295857544, 'samples': 7483968, 'steps': 38978, 'loss/train': 1.4220755100250244} 11/07/2021 02:42:42 - INFO - __main__ - Step 38980: {'lr': 0.0004268554788637675, 'samples': 7484160, 'steps': 38979, 'loss/train': 1.1065647602081299} 11/07/2021 02:42:42 - INFO - __main__ - Step 38981: {'lr': 0.0004268517280620923, 'samples': 7484352, 'steps': 38980, 'loss/train': 1.544353723526001} 11/07/2021 02:42:42 - INFO - __main__ - Step 38982: {'lr': 0.0004268479771807303, 'samples': 7484544, 'steps': 38981, 'loss/train': 1.8099292516708374} 11/07/2021 02:42:43 - INFO - __main__ - Step 38983: {'lr': 0.00042684422621968346, 'samples': 7484736, 'steps': 38982, 'loss/train': 2.324009418487549} 11/07/2021 02:42:43 - INFO - __main__ - Step 38984: {'lr': 0.0004268404751789533, 'samples': 7484928, 'steps': 38983, 'loss/train': 1.7545044422149658} 11/07/2021 02:42:44 - INFO - __main__ - Step 38985: {'lr': 0.0004268367240585416, 'samples': 7485120, 'steps': 38984, 'loss/train': 1.7155299186706543} 11/07/2021 02:42:44 - INFO - __main__ - Step 38986: {'lr': 0.0004268329728584499, 'samples': 7485312, 'steps': 38985, 'loss/train': 1.4670684337615967} 11/07/2021 02:42:45 - INFO - __main__ - Step 38987: {'lr': 0.0004268292215786801, 'samples': 7485504, 'steps': 38986, 'loss/train': 1.6708879470825195} 11/07/2021 02:42:45 - INFO - __main__ - Step 38988: {'lr': 0.0004268254702192337, 'samples': 7485696, 'steps': 38987, 'loss/train': 1.7074424028396606} 11/07/2021 02:42:45 - INFO - __main__ - Step 38989: {'lr': 0.00042682171878011255, 'samples': 7485888, 'steps': 38988, 'loss/train': 1.547581672668457} 11/07/2021 02:42:47 - INFO - __main__ - Step 38990: {'lr': 0.00042681796726131815, 'samples': 7486080, 'steps': 38989, 'loss/train': 0.6456459760665894} 11/07/2021 02:42:47 - INFO - __main__ - Step 38991: {'lr': 0.0004268142156628524, 'samples': 7486272, 'steps': 38990, 'loss/train': 1.5809472799301147} 11/07/2021 02:42:47 - INFO - __main__ - Step 38992: {'lr': 0.00042681046398471693, 'samples': 7486464, 'steps': 38991, 'loss/train': 1.6774481534957886} 11/07/2021 02:42:48 - INFO - __main__ - Step 38993: {'lr': 0.00042680671222691325, 'samples': 7486656, 'steps': 38992, 'loss/train': 1.2868596315383911} 11/07/2021 02:42:48 - INFO - __main__ - Step 38994: {'lr': 0.0004268029603894433, 'samples': 7486848, 'steps': 38993, 'loss/train': 1.7931604385375977} 11/07/2021 02:42:49 - INFO - __main__ - Step 38995: {'lr': 0.00042679920847230865, 'samples': 7487040, 'steps': 38994, 'loss/train': 1.3785682916641235} 11/07/2021 02:42:49 - INFO - __main__ - Step 38996: {'lr': 0.000426795456475511, 'samples': 7487232, 'steps': 38995, 'loss/train': 1.4442554712295532} 11/07/2021 02:42:50 - INFO - __main__ - Step 38997: {'lr': 0.00042679170439905204, 'samples': 7487424, 'steps': 38996, 'loss/train': 1.5359501838684082} 11/07/2021 02:42:50 - INFO - __main__ - Step 38998: {'lr': 0.0004267879522429334, 'samples': 7487616, 'steps': 38997, 'loss/train': 1.2817848920822144} 11/07/2021 02:42:50 - INFO - __main__ - Step 38999: {'lr': 0.00042678420000715687, 'samples': 7487808, 'steps': 38998, 'loss/train': 1.5414429903030396} 11/07/2021 02:42:52 - INFO - __main__ - Step 39000: {'lr': 0.0004267804476917242, 'samples': 7488000, 'steps': 38999, 'loss/train': 1.793521761894226} 11/07/2021 02:42:52 - INFO - __main__ - Step 39001: {'lr': 0.00042677669529663686, 'samples': 7488192, 'steps': 39000, 'loss/train': 1.7577154636383057} 11/07/2021 02:42:52 - INFO - __main__ - Step 39002: {'lr': 0.0004267729428218968, 'samples': 7488384, 'steps': 39001, 'loss/train': 1.4497911930084229} 11/07/2021 02:42:53 - INFO - __main__ - Step 39003: {'lr': 0.0004267691902675055, 'samples': 7488576, 'steps': 39002, 'loss/train': 1.7540159225463867} 11/07/2021 02:42:53 - INFO - __main__ - Step 39004: {'lr': 0.0004267654376334647, 'samples': 7488768, 'steps': 39003, 'loss/train': 1.9711253643035889} 11/07/2021 02:42:53 - INFO - __main__ - Step 39005: {'lr': 0.00042676168491977617, 'samples': 7488960, 'steps': 39004, 'loss/train': 1.3152886629104614} 11/07/2021 02:42:55 - INFO - __main__ - Step 39006: {'lr': 0.00042675793212644156, 'samples': 7489152, 'steps': 39005, 'loss/train': 1.3990839719772339} 11/07/2021 02:42:55 - INFO - __main__ - Step 39007: {'lr': 0.00042675417925346255, 'samples': 7489344, 'steps': 39006, 'loss/train': 1.7867127656936646} 11/07/2021 02:42:55 - INFO - __main__ - Step 39008: {'lr': 0.0004267504263008408, 'samples': 7489536, 'steps': 39007, 'loss/train': 1.0118520259857178} 11/07/2021 02:42:56 - INFO - __main__ - Step 39009: {'lr': 0.0004267466732685781, 'samples': 7489728, 'steps': 39008, 'loss/train': 1.6900672912597656} 11/07/2021 02:42:56 - INFO - __main__ - Step 39010: {'lr': 0.000426742920156676, 'samples': 7489920, 'steps': 39009, 'loss/train': 1.5228233337402344} 11/07/2021 02:42:57 - INFO - __main__ - Step 39011: {'lr': 0.00042673916696513625, 'samples': 7490112, 'steps': 39010, 'loss/train': 1.2955700159072876} 11/07/2021 02:42:58 - INFO - __main__ - Step 39012: {'lr': 0.0004267354136939607, 'samples': 7490304, 'steps': 39011, 'loss/train': 1.5123738050460815} 11/07/2021 02:42:58 - INFO - __main__ - Step 39013: {'lr': 0.0004267316603431508, 'samples': 7490496, 'steps': 39012, 'loss/train': 0.2923266589641571} 11/07/2021 02:42:58 - INFO - __main__ - Step 39014: {'lr': 0.00042672790691270835, 'samples': 7490688, 'steps': 39013, 'loss/train': 0.26859769225120544} 11/07/2021 02:42:59 - INFO - __main__ - Step 39015: {'lr': 0.00042672415340263507, 'samples': 7490880, 'steps': 39014, 'loss/train': 1.77121102809906} 11/07/2021 02:42:59 - INFO - __main__ - Step 39016: {'lr': 0.00042672039981293255, 'samples': 7491072, 'steps': 39015, 'loss/train': 1.6089158058166504} 11/07/2021 02:43:00 - INFO - __main__ - Step 39017: {'lr': 0.0004267166461436025, 'samples': 7491264, 'steps': 39016, 'loss/train': 1.4633493423461914} 11/07/2021 02:43:00 - INFO - __main__ - Step 39018: {'lr': 0.0004267128923946468, 'samples': 7491456, 'steps': 39017, 'loss/train': 1.7290396690368652} 11/07/2021 02:43:01 - INFO - __main__ - Step 39019: {'lr': 0.00042670913856606693, 'samples': 7491648, 'steps': 39018, 'loss/train': 1.2950791120529175} 11/07/2021 02:43:01 - INFO - __main__ - Step 39020: {'lr': 0.0004267053846578646, 'samples': 7491840, 'steps': 39019, 'loss/train': 1.799487590789795} 11/07/2021 02:43:01 - INFO - __main__ - Step 39021: {'lr': 0.00042670163067004156, 'samples': 7492032, 'steps': 39020, 'loss/train': 1.414844036102295} 11/07/2021 02:43:02 - INFO - __main__ - Step 39022: {'lr': 0.00042669787660259956, 'samples': 7492224, 'steps': 39021, 'loss/train': 1.309682846069336} 11/07/2021 02:43:03 - INFO - __main__ - Step 39023: {'lr': 0.0004266941224555402, 'samples': 7492416, 'steps': 39022, 'loss/train': 1.3504818677902222} 11/07/2021 02:43:03 - INFO - __main__ - Step 39024: {'lr': 0.0004266903682288652, 'samples': 7492608, 'steps': 39023, 'loss/train': 1.6617196798324585} 11/07/2021 02:43:04 - INFO - __main__ - Step 39025: {'lr': 0.00042668661392257626, 'samples': 7492800, 'steps': 39024, 'loss/train': 0.6336386203765869} 11/07/2021 02:43:04 - INFO - __main__ - Step 39026: {'lr': 0.00042668285953667497, 'samples': 7492992, 'steps': 39025, 'loss/train': 2.098825693130493} 11/07/2021 02:43:05 - INFO - __main__ - Step 39027: {'lr': 0.0004266791050711632, 'samples': 7493184, 'steps': 39026, 'loss/train': 1.292114019393921} 11/07/2021 02:43:05 - INFO - __main__ - Step 39028: {'lr': 0.0004266753505260425, 'samples': 7493376, 'steps': 39027, 'loss/train': 0.9015293717384338} 11/07/2021 02:43:06 - INFO - __main__ - Step 39029: {'lr': 0.00042667159590131467, 'samples': 7493568, 'steps': 39028, 'loss/train': 1.6260732412338257} 11/07/2021 02:43:06 - INFO - __main__ - Step 39030: {'lr': 0.0004266678411969813, 'samples': 7493760, 'steps': 39029, 'loss/train': 1.626287579536438} 11/07/2021 02:43:06 - INFO - __main__ - Step 39031: {'lr': 0.0004266640864130441, 'samples': 7493952, 'steps': 39030, 'loss/train': 2.1639013290405273} 11/07/2021 02:43:07 - INFO - __main__ - Step 39032: {'lr': 0.00042666033154950485, 'samples': 7494144, 'steps': 39031, 'loss/train': 1.570765733718872} 11/07/2021 02:43:08 - INFO - __main__ - Step 39033: {'lr': 0.00042665657660636517, 'samples': 7494336, 'steps': 39032, 'loss/train': 1.9968924522399902} 11/07/2021 02:43:08 - INFO - __main__ - Step 39034: {'lr': 0.0004266528215836267, 'samples': 7494528, 'steps': 39033, 'loss/train': 2.419520854949951} 11/07/2021 02:43:08 - INFO - __main__ - Step 39035: {'lr': 0.0004266490664812913, 'samples': 7494720, 'steps': 39034, 'loss/train': 1.085390567779541} 11/07/2021 02:43:09 - INFO - __main__ - Step 39036: {'lr': 0.00042664531129936044, 'samples': 7494912, 'steps': 39035, 'loss/train': 1.7049858570098877} 11/07/2021 02:43:10 - INFO - __main__ - Step 39037: {'lr': 0.00042664155603783606, 'samples': 7495104, 'steps': 39036, 'loss/train': 1.748128056526184} 11/07/2021 02:43:10 - INFO - __main__ - Step 39038: {'lr': 0.00042663780069671965, 'samples': 7495296, 'steps': 39037, 'loss/train': 1.3688992261886597} 11/07/2021 02:43:11 - INFO - __main__ - Step 39039: {'lr': 0.00042663404527601293, 'samples': 7495488, 'steps': 39038, 'loss/train': 1.2607609033584595} 11/07/2021 02:43:11 - INFO - __main__ - Step 39040: {'lr': 0.00042663028977571774, 'samples': 7495680, 'steps': 39039, 'loss/train': 1.2940502166748047} 11/07/2021 02:43:11 - INFO - __main__ - Step 39041: {'lr': 0.0004266265341958355, 'samples': 7495872, 'steps': 39040, 'loss/train': 1.4089736938476562} 11/07/2021 02:43:12 - INFO - __main__ - Step 39042: {'lr': 0.0004266227785363682, 'samples': 7496064, 'steps': 39041, 'loss/train': 1.0120713710784912} 11/07/2021 02:43:13 - INFO - __main__ - Step 39043: {'lr': 0.0004266190227973174, 'samples': 7496256, 'steps': 39042, 'loss/train': 1.0299112796783447} 11/07/2021 02:43:13 - INFO - __main__ - Step 39044: {'lr': 0.00042661526697868475, 'samples': 7496448, 'steps': 39043, 'loss/train': 1.5314782857894897} 11/07/2021 02:43:13 - INFO - __main__ - Step 39045: {'lr': 0.000426611511080472, 'samples': 7496640, 'steps': 39044, 'loss/train': 1.5129510164260864} 11/07/2021 02:43:14 - INFO - __main__ - Step 39046: {'lr': 0.0004266077551026809, 'samples': 7496832, 'steps': 39045, 'loss/train': 1.6129285097122192} 11/07/2021 02:43:14 - INFO - __main__ - Step 39047: {'lr': 0.000426603999045313, 'samples': 7497024, 'steps': 39046, 'loss/train': 1.57774019241333} 11/07/2021 02:43:15 - INFO - __main__ - Step 39048: {'lr': 0.00042660024290837003, 'samples': 7497216, 'steps': 39047, 'loss/train': 1.5723294019699097} 11/07/2021 02:43:15 - INFO - __main__ - Step 39049: {'lr': 0.00042659648669185376, 'samples': 7497408, 'steps': 39048, 'loss/train': 1.8148497343063354} 11/07/2021 02:43:16 - INFO - __main__ - Step 39050: {'lr': 0.0004265927303957658, 'samples': 7497600, 'steps': 39049, 'loss/train': 1.5007797479629517} 11/07/2021 02:43:16 - INFO - __main__ - Step 39051: {'lr': 0.0004265889740201079, 'samples': 7497792, 'steps': 39050, 'loss/train': 0.9909560680389404} 11/07/2021 02:43:17 - INFO - __main__ - Step 39052: {'lr': 0.0004265852175648818, 'samples': 7497984, 'steps': 39051, 'loss/train': 1.711198091506958} 11/07/2021 02:43:17 - INFO - __main__ - Step 39053: {'lr': 0.00042658146103008904, 'samples': 7498176, 'steps': 39052, 'loss/train': 1.286407470703125} 11/07/2021 02:43:18 - INFO - __main__ - Step 39054: {'lr': 0.0004265777044157314, 'samples': 7498368, 'steps': 39053, 'loss/train': 1.2508012056350708} 11/07/2021 02:43:18 - INFO - __main__ - Step 39055: {'lr': 0.0004265739477218106, 'samples': 7498560, 'steps': 39054, 'loss/train': 1.1808032989501953} 11/07/2021 02:43:19 - INFO - __main__ - Step 39056: {'lr': 0.0004265701909483283, 'samples': 7498752, 'steps': 39055, 'loss/train': 1.7937642335891724} 11/07/2021 02:43:19 - INFO - __main__ - Step 39057: {'lr': 0.0004265664340952862, 'samples': 7498944, 'steps': 39056, 'loss/train': 1.225213646888733} 11/07/2021 02:43:20 - INFO - __main__ - Step 39058: {'lr': 0.00042656267716268596, 'samples': 7499136, 'steps': 39057, 'loss/train': 1.51992928981781} 11/07/2021 02:43:20 - INFO - __main__ - Step 39059: {'lr': 0.00042655892015052945, 'samples': 7499328, 'steps': 39058, 'loss/train': 1.9359447956085205} 11/07/2021 02:43:21 - INFO - __main__ - Step 39060: {'lr': 0.00042655516305881803, 'samples': 7499520, 'steps': 39059, 'loss/train': 1.600701093673706} 11/07/2021 02:43:21 - INFO - __main__ - Step 39061: {'lr': 0.00042655140588755366, 'samples': 7499712, 'steps': 39060, 'loss/train': 1.3078956604003906} 11/07/2021 02:43:21 - INFO - __main__ - Step 39062: {'lr': 0.0004265476486367379, 'samples': 7499904, 'steps': 39061, 'loss/train': 1.5851510763168335} 11/07/2021 02:43:22 - INFO - __main__ - Step 39063: {'lr': 0.00042654389130637255, 'samples': 7500096, 'steps': 39062, 'loss/train': 2.0155324935913086} 11/07/2021 02:43:23 - INFO - __main__ - Step 39064: {'lr': 0.0004265401338964592, 'samples': 7500288, 'steps': 39063, 'loss/train': 1.5076969861984253} 11/07/2021 02:43:23 - INFO - __main__ - Step 39065: {'lr': 0.0004265363764069997, 'samples': 7500480, 'steps': 39064, 'loss/train': 1.4725383520126343} 11/07/2021 02:43:24 - INFO - __main__ - Step 39066: {'lr': 0.0004265326188379955, 'samples': 7500672, 'steps': 39065, 'loss/train': 1.3737947940826416} 11/07/2021 02:43:24 - INFO - __main__ - Step 39067: {'lr': 0.00042652886118944844, 'samples': 7500864, 'steps': 39066, 'loss/train': 1.9032598733901978} 11/07/2021 02:43:25 - INFO - __main__ - Step 39068: {'lr': 0.0004265251034613603, 'samples': 7501056, 'steps': 39067, 'loss/train': 1.7712714672088623} 11/07/2021 02:43:25 - INFO - __main__ - Step 39069: {'lr': 0.0004265213456537326, 'samples': 7501248, 'steps': 39068, 'loss/train': 1.6109968423843384} 11/07/2021 02:43:26 - INFO - __main__ - Step 39070: {'lr': 0.0004265175877665671, 'samples': 7501440, 'steps': 39069, 'loss/train': 1.583343505859375} 11/07/2021 02:43:26 - INFO - __main__ - Step 39071: {'lr': 0.0004265138297998655, 'samples': 7501632, 'steps': 39070, 'loss/train': 1.392993688583374} 11/07/2021 02:43:26 - INFO - __main__ - Step 39072: {'lr': 0.0004265100717536295, 'samples': 7501824, 'steps': 39071, 'loss/train': 1.5252935886383057} 11/07/2021 02:43:27 - INFO - __main__ - Step 39073: {'lr': 0.0004265063136278608, 'samples': 7502016, 'steps': 39072, 'loss/train': 1.0011496543884277} 11/07/2021 02:43:28 - INFO - __main__ - Step 39074: {'lr': 0.00042650255542256107, 'samples': 7502208, 'steps': 39073, 'loss/train': 0.8705465793609619} 11/07/2021 02:43:28 - INFO - __main__ - Step 39075: {'lr': 0.000426498797137732, 'samples': 7502400, 'steps': 39074, 'loss/train': 1.4799444675445557} 11/07/2021 02:43:28 - INFO - __main__ - Step 39076: {'lr': 0.00042649503877337523, 'samples': 7502592, 'steps': 39075, 'loss/train': 1.2689144611358643} 11/07/2021 02:43:29 - INFO - __main__ - Step 39077: {'lr': 0.0004264912803294926, 'samples': 7502784, 'steps': 39076, 'loss/train': 1.2915695905685425} 11/07/2021 02:43:29 - INFO - __main__ - Step 39078: {'lr': 0.0004264875218060857, 'samples': 7502976, 'steps': 39077, 'loss/train': 1.4235657453536987} 11/07/2021 02:43:30 - INFO - __main__ - Step 39079: {'lr': 0.00042648376320315634, 'samples': 7503168, 'steps': 39078, 'loss/train': 1.135210633277893} 11/07/2021 02:43:30 - INFO - __main__ - Step 39080: {'lr': 0.000426480004520706, 'samples': 7503360, 'steps': 39079, 'loss/train': 1.3262708187103271} 11/07/2021 02:43:31 - INFO - __main__ - Step 39081: {'lr': 0.00042647624575873656, 'samples': 7503552, 'steps': 39080, 'loss/train': 1.3059016466140747} 11/07/2021 02:43:31 - INFO - __main__ - Step 39082: {'lr': 0.0004264724869172496, 'samples': 7503744, 'steps': 39081, 'loss/train': 1.7730166912078857} 11/07/2021 02:43:31 - INFO - __main__ - Step 39083: {'lr': 0.00042646872799624694, 'samples': 7503936, 'steps': 39082, 'loss/train': 1.3689271211624146} 11/07/2021 02:43:33 - INFO - __main__ - Step 39084: {'lr': 0.00042646496899573005, 'samples': 7504128, 'steps': 39083, 'loss/train': 1.578107476234436} 11/07/2021 02:43:33 - INFO - __main__ - Step 39085: {'lr': 0.0004264612099157009, 'samples': 7504320, 'steps': 39084, 'loss/train': 1.7110601663589478} 11/07/2021 02:43:33 - INFO - __main__ - Step 39086: {'lr': 0.00042645745075616106, 'samples': 7504512, 'steps': 39085, 'loss/train': 0.18499447405338287} 11/07/2021 02:43:34 - INFO - __main__ - Step 39087: {'lr': 0.0004264536915171121, 'samples': 7504704, 'steps': 39086, 'loss/train': 1.8357080221176147} 11/07/2021 02:43:34 - INFO - __main__ - Step 39088: {'lr': 0.0004264499321985559, 'samples': 7504896, 'steps': 39087, 'loss/train': 1.2061140537261963} 11/07/2021 02:43:35 - INFO - __main__ - Step 39089: {'lr': 0.0004264461728004941, 'samples': 7505088, 'steps': 39088, 'loss/train': 1.141921877861023} 11/07/2021 02:43:35 - INFO - __main__ - Step 39090: {'lr': 0.0004264424133229283, 'samples': 7505280, 'steps': 39089, 'loss/train': 0.20083756744861603} 11/07/2021 02:43:36 - INFO - __main__ - Step 39091: {'lr': 0.0004264386537658603, 'samples': 7505472, 'steps': 39090, 'loss/train': 6.336944103240967} 11/07/2021 02:43:36 - INFO - __main__ - Step 39092: {'lr': 0.0004264348941292919, 'samples': 7505664, 'steps': 39091, 'loss/train': 1.4087821245193481} 11/07/2021 02:43:37 - INFO - __main__ - Step 39093: {'lr': 0.0004264311344132245, 'samples': 7505856, 'steps': 39092, 'loss/train': 1.8912596702575684} 11/07/2021 02:43:38 - INFO - __main__ - Step 39094: {'lr': 0.00042642737461766003, 'samples': 7506048, 'steps': 39093, 'loss/train': 1.4628355503082275} 11/07/2021 02:43:38 - INFO - __main__ - Step 39095: {'lr': 0.0004264236147426, 'samples': 7506240, 'steps': 39094, 'loss/train': 1.4418323040008545} 11/07/2021 02:43:38 - INFO - __main__ - Step 39096: {'lr': 0.0004264198547880464, 'samples': 7506432, 'steps': 39095, 'loss/train': 1.587921380996704} 11/07/2021 02:43:39 - INFO - __main__ - Step 39097: {'lr': 0.00042641609475400054, 'samples': 7506624, 'steps': 39096, 'loss/train': 2.056940793991089} 11/07/2021 02:43:39 - INFO - __main__ - Step 39098: {'lr': 0.0004264123346404644, 'samples': 7506816, 'steps': 39097, 'loss/train': 1.7755131721496582} 11/07/2021 02:43:39 - INFO - __main__ - Step 39099: {'lr': 0.0004264085744474396, 'samples': 7507008, 'steps': 39098, 'loss/train': 1.3965548276901245} 11/07/2021 02:43:40 - INFO - __main__ - Step 39100: {'lr': 0.0004264048141749278, 'samples': 7507200, 'steps': 39099, 'loss/train': 1.769647240638733} 11/07/2021 02:43:41 - INFO - __main__ - Step 39101: {'lr': 0.00042640105382293073, 'samples': 7507392, 'steps': 39100, 'loss/train': 1.4096415042877197} 11/07/2021 02:43:41 - INFO - __main__ - Step 39102: {'lr': 0.00042639729339145004, 'samples': 7507584, 'steps': 39101, 'loss/train': 0.1855313628911972} 11/07/2021 02:43:42 - INFO - __main__ - Step 39103: {'lr': 0.0004263935328804874, 'samples': 7507776, 'steps': 39102, 'loss/train': 1.5442306995391846} 11/07/2021 02:43:42 - INFO - __main__ - Step 39104: {'lr': 0.0004263897722900447, 'samples': 7507968, 'steps': 39103, 'loss/train': 0.9975839853286743} 11/07/2021 02:43:43 - INFO - __main__ - Step 39105: {'lr': 0.0004263860116201234, 'samples': 7508160, 'steps': 39104, 'loss/train': 1.142077088356018} 11/07/2021 02:43:43 - INFO - __main__ - Step 39106: {'lr': 0.00042638225087072523, 'samples': 7508352, 'steps': 39105, 'loss/train': 1.9047843217849731} 11/07/2021 02:43:44 - INFO - __main__ - Step 39107: {'lr': 0.00042637849004185203, 'samples': 7508544, 'steps': 39106, 'loss/train': 2.055018663406372} 11/07/2021 02:43:44 - INFO - __main__ - Step 39108: {'lr': 0.0004263747291335054, 'samples': 7508736, 'steps': 39107, 'loss/train': 1.40550696849823} 11/07/2021 02:43:44 - INFO - __main__ - Step 39109: {'lr': 0.00042637096814568696, 'samples': 7508928, 'steps': 39108, 'loss/train': 1.4031354188919067} 11/07/2021 02:43:45 - INFO - __main__ - Step 39110: {'lr': 0.0004263672070783986, 'samples': 7509120, 'steps': 39109, 'loss/train': 2.0347707271575928} 11/07/2021 02:43:46 - INFO - __main__ - Step 39111: {'lr': 0.0004263634459316418, 'samples': 7509312, 'steps': 39110, 'loss/train': 1.3580670356750488} 11/07/2021 02:43:46 - INFO - __main__ - Step 39112: {'lr': 0.0004263596847054184, 'samples': 7509504, 'steps': 39111, 'loss/train': 1.6178772449493408} 11/07/2021 02:43:46 - INFO - __main__ - Step 39113: {'lr': 0.00042635592339973006, 'samples': 7509696, 'steps': 39112, 'loss/train': 1.6451839208602905} 11/07/2021 02:43:47 - INFO - __main__ - Step 39114: {'lr': 0.00042635216201457836, 'samples': 7509888, 'steps': 39113, 'loss/train': 1.5635451078414917} 11/07/2021 02:43:48 - INFO - __main__ - Step 39115: {'lr': 0.00042634840054996527, 'samples': 7510080, 'steps': 39114, 'loss/train': 1.6748383045196533} 11/07/2021 02:43:48 - INFO - __main__ - Step 39116: {'lr': 0.00042634463900589214, 'samples': 7510272, 'steps': 39115, 'loss/train': 1.7744406461715698} 11/07/2021 02:43:49 - INFO - __main__ - Step 39117: {'lr': 0.0004263408773823609, 'samples': 7510464, 'steps': 39116, 'loss/train': 1.3158316612243652} 11/07/2021 02:43:49 - INFO - __main__ - Step 39118: {'lr': 0.00042633711567937325, 'samples': 7510656, 'steps': 39117, 'loss/train': 1.6775025129318237} 11/07/2021 02:43:49 - INFO - __main__ - Step 39119: {'lr': 0.00042633335389693073, 'samples': 7510848, 'steps': 39118, 'loss/train': 0.40122219920158386} 11/07/2021 02:43:51 - INFO - __main__ - Step 39120: {'lr': 0.0004263295920350352, 'samples': 7511040, 'steps': 39119, 'loss/train': 1.5833914279937744} 11/07/2021 02:43:51 - INFO - __main__ - Step 39121: {'lr': 0.0004263258300936882, 'samples': 7511232, 'steps': 39120, 'loss/train': 1.6524046659469604} 11/07/2021 02:43:51 - INFO - __main__ - Step 39122: {'lr': 0.00042632206807289154, 'samples': 7511424, 'steps': 39121, 'loss/train': 1.775888442993164} 11/07/2021 02:43:52 - INFO - __main__ - Step 39123: {'lr': 0.00042631830597264687, 'samples': 7511616, 'steps': 39122, 'loss/train': 0.7915307879447937} 11/07/2021 02:43:52 - INFO - __main__ - Step 39124: {'lr': 0.0004263145437929559, 'samples': 7511808, 'steps': 39123, 'loss/train': 1.0251669883728027} 11/07/2021 02:43:52 - INFO - __main__ - Step 39125: {'lr': 0.0004263107815338203, 'samples': 7512000, 'steps': 39124, 'loss/train': 1.103756308555603} 11/07/2021 02:43:54 - INFO - __main__ - Step 39126: {'lr': 0.00042630701919524176, 'samples': 7512192, 'steps': 39125, 'loss/train': 1.082054853439331} 11/07/2021 02:43:55 - INFO - __main__ - Step 39127: {'lr': 0.00042630325677722204, 'samples': 7512384, 'steps': 39126, 'loss/train': 1.9423552751541138} 11/07/2021 02:43:55 - INFO - __main__ - Step 39128: {'lr': 0.0004262994942797628, 'samples': 7512576, 'steps': 39127, 'loss/train': 1.4827128648757935} 11/07/2021 02:43:55 - INFO - __main__ - Step 39129: {'lr': 0.0004262957317028657, 'samples': 7512768, 'steps': 39128, 'loss/train': 1.9064688682556152} 11/07/2021 02:43:56 - INFO - __main__ - Step 39130: {'lr': 0.00042629196904653245, 'samples': 7512960, 'steps': 39129, 'loss/train': 1.6398931741714478} 11/07/2021 02:43:56 - INFO - __main__ - Step 39131: {'lr': 0.00042628820631076484, 'samples': 7513152, 'steps': 39130, 'loss/train': 1.8073322772979736} 11/07/2021 02:43:56 - INFO - __main__ - Step 39132: {'lr': 0.0004262844434955644, 'samples': 7513344, 'steps': 39131, 'loss/train': 1.827433705329895} 11/07/2021 02:43:57 - INFO - __main__ - Step 39133: {'lr': 0.00042628068060093294, 'samples': 7513536, 'steps': 39132, 'loss/train': 1.417256474494934} 11/07/2021 02:43:58 - INFO - __main__ - Step 39134: {'lr': 0.0004262769176268722, 'samples': 7513728, 'steps': 39133, 'loss/train': 1.0433682203292847} 11/07/2021 02:43:58 - INFO - __main__ - Step 39135: {'lr': 0.0004262731545733837, 'samples': 7513920, 'steps': 39134, 'loss/train': 0.5978344082832336} 11/07/2021 02:43:59 - INFO - __main__ - Step 39136: {'lr': 0.0004262693914404692, 'samples': 7514112, 'steps': 39135, 'loss/train': 2.1662096977233887} 11/07/2021 02:43:59 - INFO - __main__ - Step 39137: {'lr': 0.0004262656282281305, 'samples': 7514304, 'steps': 39136, 'loss/train': 1.5228852033615112} 11/07/2021 02:43:59 - INFO - __main__ - Step 39138: {'lr': 0.0004262618649363692, 'samples': 7514496, 'steps': 39137, 'loss/train': 1.7849223613739014} 11/07/2021 02:44:00 - INFO - __main__ - Step 39139: {'lr': 0.0004262581015651871, 'samples': 7514688, 'steps': 39138, 'loss/train': 1.654083251953125} 11/07/2021 02:44:01 - INFO - __main__ - Step 39140: {'lr': 0.0004262543381145857, 'samples': 7514880, 'steps': 39139, 'loss/train': 1.332046627998352} 11/07/2021 02:44:01 - INFO - __main__ - Step 39141: {'lr': 0.0004262505745845669, 'samples': 7515072, 'steps': 39140, 'loss/train': 1.7173765897750854} 11/07/2021 02:44:01 - INFO - __main__ - Step 39142: {'lr': 0.0004262468109751323, 'samples': 7515264, 'steps': 39141, 'loss/train': 0.9578734636306763} 11/07/2021 02:44:02 - INFO - __main__ - Step 39143: {'lr': 0.0004262430472862836, 'samples': 7515456, 'steps': 39142, 'loss/train': 1.4384714365005493} 11/07/2021 02:44:03 - INFO - __main__ - Step 39144: {'lr': 0.00042623928351802245, 'samples': 7515648, 'steps': 39143, 'loss/train': 1.6106135845184326} 11/07/2021 02:44:03 - INFO - __main__ - Step 39145: {'lr': 0.00042623551967035066, 'samples': 7515840, 'steps': 39144, 'loss/train': 1.6195812225341797} 11/07/2021 02:44:03 - INFO - __main__ - Step 39146: {'lr': 0.0004262317557432699, 'samples': 7516032, 'steps': 39145, 'loss/train': 1.3861796855926514} 11/07/2021 02:44:04 - INFO - __main__ - Step 39147: {'lr': 0.0004262279917367817, 'samples': 7516224, 'steps': 39146, 'loss/train': 2.5530037879943848} 11/07/2021 02:44:04 - INFO - __main__ - Step 39148: {'lr': 0.00042622422765088805, 'samples': 7516416, 'steps': 39147, 'loss/train': 1.7445322275161743} 11/07/2021 02:44:05 - INFO - __main__ - Step 39149: {'lr': 0.00042622046348559034, 'samples': 7516608, 'steps': 39148, 'loss/train': 1.8676589727401733} 11/07/2021 02:44:06 - INFO - __main__ - Step 39150: {'lr': 0.00042621669924089044, 'samples': 7516800, 'steps': 39149, 'loss/train': 1.1236180067062378} 11/07/2021 02:44:06 - INFO - __main__ - Step 39151: {'lr': 0.00042621293491679007, 'samples': 7516992, 'steps': 39150, 'loss/train': 1.6182490587234497} 11/07/2021 02:44:06 - INFO - __main__ - Step 39152: {'lr': 0.00042620917051329086, 'samples': 7517184, 'steps': 39151, 'loss/train': 1.8522018194198608} 11/07/2021 02:44:07 - INFO - __main__ - Step 39153: {'lr': 0.0004262054060303945, 'samples': 7517376, 'steps': 39152, 'loss/train': 1.569541335105896} 11/07/2021 02:44:08 - INFO - __main__ - Step 39154: {'lr': 0.00042620164146810267, 'samples': 7517568, 'steps': 39153, 'loss/train': 1.5665749311447144} 11/07/2021 02:44:08 - INFO - __main__ - Step 39155: {'lr': 0.0004261978768264172, 'samples': 7517760, 'steps': 39154, 'loss/train': 1.3959105014801025} 11/07/2021 02:44:08 - INFO - __main__ - Step 39156: {'lr': 0.00042619411210533957, 'samples': 7517952, 'steps': 39155, 'loss/train': 1.1868535280227661} 11/07/2021 02:44:09 - INFO - __main__ - Step 39157: {'lr': 0.00042619034730487167, 'samples': 7518144, 'steps': 39156, 'loss/train': 1.2965528964996338} 11/07/2021 02:44:09 - INFO - __main__ - Step 39158: {'lr': 0.00042618658242501507, 'samples': 7518336, 'steps': 39157, 'loss/train': 1.6020418405532837} 11/07/2021 02:44:09 - INFO - __main__ - Step 39159: {'lr': 0.0004261828174657716, 'samples': 7518528, 'steps': 39158, 'loss/train': 1.598738431930542} 11/07/2021 02:44:11 - INFO - __main__ - Step 39160: {'lr': 0.0004261790524271427, 'samples': 7518720, 'steps': 39159, 'loss/train': 0.7763488292694092} 11/07/2021 02:44:11 - INFO - __main__ - Step 39161: {'lr': 0.00042617528730913036, 'samples': 7518912, 'steps': 39160, 'loss/train': 1.378023386001587} 11/07/2021 02:44:11 - INFO - __main__ - Step 39162: {'lr': 0.00042617152211173615, 'samples': 7519104, 'steps': 39161, 'loss/train': 1.0492256879806519} 11/07/2021 02:44:12 - INFO - __main__ - Step 39163: {'lr': 0.0004261677568349618, 'samples': 7519296, 'steps': 39162, 'loss/train': 1.1615722179412842} 11/07/2021 02:44:12 - INFO - __main__ - Step 39164: {'lr': 0.0004261639914788089, 'samples': 7519488, 'steps': 39163, 'loss/train': 1.13607919216156} 11/07/2021 02:44:13 - INFO - __main__ - Step 39165: {'lr': 0.0004261602260432792, 'samples': 7519680, 'steps': 39164, 'loss/train': 1.5264469385147095} 11/07/2021 02:44:13 - INFO - __main__ - Step 39166: {'lr': 0.0004261564605283745, 'samples': 7519872, 'steps': 39165, 'loss/train': 1.8770747184753418} 11/07/2021 02:44:14 - INFO - __main__ - Step 39167: {'lr': 0.0004261526949340965, 'samples': 7520064, 'steps': 39166, 'loss/train': 1.918188452720642} 11/07/2021 02:44:14 - INFO - __main__ - Step 39168: {'lr': 0.0004261489292604467, 'samples': 7520256, 'steps': 39167, 'loss/train': 1.4785051345825195} 11/07/2021 02:44:14 - INFO - __main__ - Step 39169: {'lr': 0.0004261451635074269, 'samples': 7520448, 'steps': 39168, 'loss/train': 1.4458575248718262} 11/07/2021 02:44:15 - INFO - __main__ - Step 39170: {'lr': 0.0004261413976750388, 'samples': 7520640, 'steps': 39169, 'loss/train': 1.223093032836914} 11/07/2021 02:44:16 - INFO - __main__ - Step 39171: {'lr': 0.00042613763176328415, 'samples': 7520832, 'steps': 39170, 'loss/train': 1.6749415397644043} 11/07/2021 02:44:16 - INFO - __main__ - Step 39172: {'lr': 0.00042613386577216455, 'samples': 7521024, 'steps': 39171, 'loss/train': 1.0120102167129517} 11/07/2021 02:44:16 - INFO - __main__ - Step 39173: {'lr': 0.0004261300997016818, 'samples': 7521216, 'steps': 39172, 'loss/train': 1.5103209018707275} 11/07/2021 02:44:17 - INFO - __main__ - Step 39174: {'lr': 0.0004261263335518375, 'samples': 7521408, 'steps': 39173, 'loss/train': 1.765062689781189} 11/07/2021 02:44:18 - INFO - __main__ - Step 39175: {'lr': 0.00042612256732263345, 'samples': 7521600, 'steps': 39174, 'loss/train': 1.3832707405090332} 11/07/2021 02:44:18 - INFO - __main__ - Step 39176: {'lr': 0.0004261188010140712, 'samples': 7521792, 'steps': 39175, 'loss/train': 1.7844562530517578} 11/07/2021 02:44:19 - INFO - __main__ - Step 39177: {'lr': 0.00042611503462615266, 'samples': 7521984, 'steps': 39176, 'loss/train': 1.3058295249938965} 11/07/2021 02:44:19 - INFO - __main__ - Step 39178: {'lr': 0.0004261112681588793, 'samples': 7522176, 'steps': 39177, 'loss/train': 1.85269033908844} 11/07/2021 02:44:19 - INFO - __main__ - Step 39179: {'lr': 0.000426107501612253, 'samples': 7522368, 'steps': 39178, 'loss/train': 0.9109247922897339} 11/07/2021 02:44:20 - INFO - __main__ - Step 39180: {'lr': 0.0004261037349862753, 'samples': 7522560, 'steps': 39179, 'loss/train': 1.7155182361602783} 11/07/2021 02:44:21 - INFO - __main__ - Step 39181: {'lr': 0.000426099968280948, 'samples': 7522752, 'steps': 39180, 'loss/train': 1.9949593544006348} 11/07/2021 02:44:21 - INFO - __main__ - Step 39182: {'lr': 0.00042609620149627284, 'samples': 7522944, 'steps': 39181, 'loss/train': 1.5575292110443115} 11/07/2021 02:44:21 - INFO - __main__ - Step 39183: {'lr': 0.00042609243463225134, 'samples': 7523136, 'steps': 39182, 'loss/train': 1.2065694332122803} 11/07/2021 02:44:22 - INFO - __main__ - Step 39184: {'lr': 0.00042608866768888533, 'samples': 7523328, 'steps': 39183, 'loss/train': 1.510762333869934} 11/07/2021 02:44:22 - INFO - __main__ - Step 39185: {'lr': 0.0004260849006661765, 'samples': 7523520, 'steps': 39184, 'loss/train': 0.8770667314529419} 11/07/2021 02:44:23 - INFO - __main__ - Step 39186: {'lr': 0.0004260811335641266, 'samples': 7523712, 'steps': 39185, 'loss/train': 1.672101616859436} 11/07/2021 02:44:24 - INFO - __main__ - Step 39187: {'lr': 0.0004260773663827372, 'samples': 7523904, 'steps': 39186, 'loss/train': 1.276141881942749} 11/07/2021 02:44:24 - INFO - __main__ - Step 39188: {'lr': 0.00042607359912201004, 'samples': 7524096, 'steps': 39187, 'loss/train': 0.5991111397743225} 11/07/2021 02:44:24 - INFO - __main__ - Step 39189: {'lr': 0.0004260698317819468, 'samples': 7524288, 'steps': 39188, 'loss/train': 1.2625077962875366} 11/07/2021 02:44:25 - INFO - __main__ - Step 39190: {'lr': 0.00042606606436254926, 'samples': 7524480, 'steps': 39189, 'loss/train': 1.1217215061187744} 11/07/2021 02:44:26 - INFO - __main__ - Step 39191: {'lr': 0.000426062296863819, 'samples': 7524672, 'steps': 39190, 'loss/train': 1.4759544134140015} 11/07/2021 02:44:26 - INFO - __main__ - Step 39192: {'lr': 0.00042605852928575796, 'samples': 7524864, 'steps': 39191, 'loss/train': 1.1649805307388306} 11/07/2021 02:44:26 - INFO - __main__ - Step 39193: {'lr': 0.00042605476162836756, 'samples': 7525056, 'steps': 39192, 'loss/train': 0.11307892203330994} 11/07/2021 02:44:27 - INFO - __main__ - Step 39194: {'lr': 0.00042605099389164957, 'samples': 7525248, 'steps': 39193, 'loss/train': 1.0232011079788208} 11/07/2021 02:44:27 - INFO - __main__ - Step 39195: {'lr': 0.00042604722607560575, 'samples': 7525440, 'steps': 39194, 'loss/train': 1.8840378522872925} 11/07/2021 02:44:28 - INFO - __main__ - Step 39196: {'lr': 0.0004260434581802377, 'samples': 7525632, 'steps': 39195, 'loss/train': 1.325415015220642} 11/07/2021 02:44:29 - INFO - __main__ - Step 39197: {'lr': 0.0004260396902055473, 'samples': 7525824, 'steps': 39196, 'loss/train': 1.6330317258834839} 11/07/2021 02:44:29 - INFO - __main__ - Step 39198: {'lr': 0.0004260359221515361, 'samples': 7526016, 'steps': 39197, 'loss/train': 2.978104829788208} 11/07/2021 02:44:29 - INFO - __main__ - Step 39199: {'lr': 0.0004260321540182057, 'samples': 7526208, 'steps': 39198, 'loss/train': 1.3887532949447632} 11/07/2021 02:44:30 - INFO - __main__ - Step 39200: {'lr': 0.00042602838580555814, 'samples': 7526400, 'steps': 39199, 'loss/train': 1.367277979850769} 11/07/2021 02:44:31 - INFO - __main__ - Step 39201: {'lr': 0.0004260246175135948, 'samples': 7526592, 'steps': 39200, 'loss/train': 1.5049934387207031} 11/07/2021 02:44:31 - INFO - __main__ - Step 39202: {'lr': 0.00042602084914231743, 'samples': 7526784, 'steps': 39201, 'loss/train': 2.2483091354370117} 11/07/2021 02:44:31 - INFO - __main__ - Step 39203: {'lr': 0.0004260170806917278, 'samples': 7526976, 'steps': 39202, 'loss/train': 1.7197997570037842} 11/07/2021 02:44:32 - INFO - __main__ - Step 39204: {'lr': 0.0004260133121618276, 'samples': 7527168, 'steps': 39203, 'loss/train': 1.5005837678909302} 11/07/2021 02:44:32 - INFO - __main__ - Step 39205: {'lr': 0.0004260095435526186, 'samples': 7527360, 'steps': 39204, 'loss/train': 1.5221214294433594} 11/07/2021 02:44:32 - INFO - __main__ - Step 39206: {'lr': 0.0004260057748641024, 'samples': 7527552, 'steps': 39205, 'loss/train': 1.5341522693634033} 11/07/2021 02:44:33 - INFO - __main__ - Step 39207: {'lr': 0.00042600200609628063, 'samples': 7527744, 'steps': 39206, 'loss/train': 1.2754578590393066} 11/07/2021 02:44:34 - INFO - __main__ - Step 39208: {'lr': 0.0004259982372491551, 'samples': 7527936, 'steps': 39207, 'loss/train': 1.186719536781311} 11/07/2021 02:44:34 - INFO - __main__ - Step 39209: {'lr': 0.00042599446832272746, 'samples': 7528128, 'steps': 39208, 'loss/train': 1.3470895290374756} 11/07/2021 02:44:34 - INFO - __main__ - Step 39210: {'lr': 0.0004259906993169995, 'samples': 7528320, 'steps': 39209, 'loss/train': 1.8728010654449463} 11/07/2021 02:44:35 - INFO - __main__ - Step 39211: {'lr': 0.00042598693023197283, 'samples': 7528512, 'steps': 39210, 'loss/train': 1.5531646013259888} 11/07/2021 02:44:36 - INFO - __main__ - Step 39212: {'lr': 0.00042598316106764913, 'samples': 7528704, 'steps': 39211, 'loss/train': 1.987898588180542} 11/07/2021 02:44:36 - INFO - __main__ - Step 39213: {'lr': 0.0004259793918240302, 'samples': 7528896, 'steps': 39212, 'loss/train': 1.207327127456665} 11/07/2021 02:44:37 - INFO - __main__ - Step 39214: {'lr': 0.00042597562250111753, 'samples': 7529088, 'steps': 39213, 'loss/train': 1.4415658712387085} 11/07/2021 02:44:37 - INFO - __main__ - Step 39215: {'lr': 0.00042597185309891305, 'samples': 7529280, 'steps': 39214, 'loss/train': 1.9070416688919067} 11/07/2021 02:44:37 - INFO - __main__ - Step 39216: {'lr': 0.0004259680836174184, 'samples': 7529472, 'steps': 39215, 'loss/train': 1.5507830381393433} 11/07/2021 02:44:39 - INFO - __main__ - Step 39217: {'lr': 0.0004259643140566352, 'samples': 7529664, 'steps': 39216, 'loss/train': 0.5437787175178528} 11/07/2021 02:44:40 - INFO - __main__ - Step 39218: {'lr': 0.0004259605444165652, 'samples': 7529856, 'steps': 39217, 'loss/train': 1.7301392555236816} 11/07/2021 02:44:40 - INFO - __main__ - Step 39219: {'lr': 0.0004259567746972101, 'samples': 7530048, 'steps': 39218, 'loss/train': 1.3682011365890503} 11/07/2021 02:44:40 - INFO - __main__ - Step 39220: {'lr': 0.00042595300489857164, 'samples': 7530240, 'steps': 39219, 'loss/train': 0.709819495677948} 11/07/2021 02:44:41 - INFO - __main__ - Step 39221: {'lr': 0.0004259492350206514, 'samples': 7530432, 'steps': 39220, 'loss/train': 0.7055956721305847} 11/07/2021 02:44:41 - INFO - __main__ - Step 39222: {'lr': 0.00042594546506345124, 'samples': 7530624, 'steps': 39221, 'loss/train': 0.7826408743858337} 11/07/2021 02:44:41 - INFO - __main__ - Step 39223: {'lr': 0.00042594169502697265, 'samples': 7530816, 'steps': 39222, 'loss/train': 1.359542727470398} 11/07/2021 02:44:42 - INFO - __main__ - Step 39224: {'lr': 0.00042593792491121753, 'samples': 7531008, 'steps': 39223, 'loss/train': 1.3976107835769653} 11/07/2021 02:44:43 - INFO - __main__ - Step 39225: {'lr': 0.00042593415471618744, 'samples': 7531200, 'steps': 39224, 'loss/train': 1.443656086921692} 11/07/2021 02:44:43 - INFO - __main__ - Step 39226: {'lr': 0.0004259303844418841, 'samples': 7531392, 'steps': 39225, 'loss/train': 1.6289358139038086} 11/07/2021 02:44:43 - INFO - __main__ - Step 39227: {'lr': 0.00042592661408830937, 'samples': 7531584, 'steps': 39226, 'loss/train': 1.7915257215499878} 11/07/2021 02:44:44 - INFO - __main__ - Step 39228: {'lr': 0.00042592284365546474, 'samples': 7531776, 'steps': 39227, 'loss/train': 1.7885856628417969} 11/07/2021 02:44:45 - INFO - __main__ - Step 39229: {'lr': 0.00042591907314335197, 'samples': 7531968, 'steps': 39228, 'loss/train': 1.8358139991760254} 11/07/2021 02:44:45 - INFO - __main__ - Step 39230: {'lr': 0.00042591530255197286, 'samples': 7532160, 'steps': 39229, 'loss/train': 1.45115327835083} 11/07/2021 02:44:45 - INFO - __main__ - Step 39231: {'lr': 0.00042591153188132903, 'samples': 7532352, 'steps': 39230, 'loss/train': 1.6204403638839722} 11/07/2021 02:44:46 - INFO - __main__ - Step 39232: {'lr': 0.00042590776113142216, 'samples': 7532544, 'steps': 39231, 'loss/train': 1.5731817483901978} 11/07/2021 02:44:46 - INFO - __main__ - Step 39233: {'lr': 0.00042590399030225393, 'samples': 7532736, 'steps': 39232, 'loss/train': 1.7579594850540161} 11/07/2021 02:44:48 - INFO - __main__ - Step 39234: {'lr': 0.0004259002193938261, 'samples': 7532928, 'steps': 39233, 'loss/train': 2.0346720218658447} 11/07/2021 02:44:48 - INFO - __main__ - Step 39235: {'lr': 0.0004258964484061403, 'samples': 7533120, 'steps': 39234, 'loss/train': 1.7093135118484497} 11/07/2021 02:44:49 - INFO - __main__ - Step 39236: {'lr': 0.00042589267733919833, 'samples': 7533312, 'steps': 39235, 'loss/train': 1.5987120866775513} 11/07/2021 02:44:49 - INFO - __main__ - Step 39237: {'lr': 0.0004258889061930018, 'samples': 7533504, 'steps': 39236, 'loss/train': 1.6044750213623047} 11/07/2021 02:44:50 - INFO - __main__ - Step 39238: {'lr': 0.0004258851349675524, 'samples': 7533696, 'steps': 39237, 'loss/train': 1.6669312715530396} 11/07/2021 02:44:50 - INFO - __main__ - Step 39239: {'lr': 0.00042588136366285197, 'samples': 7533888, 'steps': 39238, 'loss/train': 1.5280288457870483} 11/07/2021 02:44:50 - INFO - __main__ - Step 39240: {'lr': 0.0004258775922789021, 'samples': 7534080, 'steps': 39239, 'loss/train': 1.1162550449371338} 11/07/2021 02:44:51 - INFO - __main__ - Step 39241: {'lr': 0.0004258738208157045, 'samples': 7534272, 'steps': 39240, 'loss/train': 1.8857530355453491} 11/07/2021 02:44:52 - INFO - __main__ - Step 39242: {'lr': 0.0004258700492732608, 'samples': 7534464, 'steps': 39241, 'loss/train': 1.784043312072754} 11/07/2021 02:44:52 - INFO - __main__ - Step 39243: {'lr': 0.0004258662776515728, 'samples': 7534656, 'steps': 39242, 'loss/train': 1.8259460926055908} 11/07/2021 02:44:52 - INFO - __main__ - Step 39244: {'lr': 0.00042586250595064216, 'samples': 7534848, 'steps': 39243, 'loss/train': 1.4832545518875122} 11/07/2021 02:44:53 - INFO - __main__ - Step 39245: {'lr': 0.0004258587341704706, 'samples': 7535040, 'steps': 39244, 'loss/train': 1.3301485776901245} 11/07/2021 02:44:53 - INFO - __main__ - Step 39246: {'lr': 0.00042585496231105986, 'samples': 7535232, 'steps': 39245, 'loss/train': 0.8189593553543091} 11/07/2021 02:44:54 - INFO - __main__ - Step 39247: {'lr': 0.00042585119037241156, 'samples': 7535424, 'steps': 39246, 'loss/train': 1.3747432231903076} 11/07/2021 02:44:54 - INFO - __main__ - Step 39248: {'lr': 0.00042584741835452743, 'samples': 7535616, 'steps': 39247, 'loss/train': 1.4216407537460327} 11/07/2021 02:44:55 - INFO - __main__ - Step 39249: {'lr': 0.0004258436462574091, 'samples': 7535808, 'steps': 39248, 'loss/train': 1.5498663187026978} 11/07/2021 02:44:55 - INFO - __main__ - Step 39250: {'lr': 0.0004258398740810584, 'samples': 7536000, 'steps': 39249, 'loss/train': 1.7944103479385376} 11/07/2021 02:44:55 - INFO - __main__ - Step 39251: {'lr': 0.00042583610182547694, 'samples': 7536192, 'steps': 39250, 'loss/train': 1.286428451538086} 11/07/2021 02:44:56 - INFO - __main__ - Step 39252: {'lr': 0.0004258323294906665, 'samples': 7536384, 'steps': 39251, 'loss/train': 1.1772104501724243} 11/07/2021 02:44:57 - INFO - __main__ - Step 39253: {'lr': 0.00042582855707662864, 'samples': 7536576, 'steps': 39252, 'loss/train': 1.6651321649551392} 11/07/2021 02:44:57 - INFO - __main__ - Step 39254: {'lr': 0.00042582478458336523, 'samples': 7536768, 'steps': 39253, 'loss/train': 1.6486400365829468} 11/07/2021 02:44:57 - INFO - __main__ - Step 39255: {'lr': 0.00042582101201087786, 'samples': 7536960, 'steps': 39254, 'loss/train': 1.1267701387405396} 11/07/2021 02:44:58 - INFO - __main__ - Step 39256: {'lr': 0.00042581723935916817, 'samples': 7537152, 'steps': 39255, 'loss/train': 1.9030141830444336} 11/07/2021 02:44:59 - INFO - __main__ - Step 39257: {'lr': 0.00042581346662823804, 'samples': 7537344, 'steps': 39256, 'loss/train': 1.0102065801620483} 11/07/2021 02:44:59 - INFO - __main__ - Step 39258: {'lr': 0.00042580969381808906, 'samples': 7537536, 'steps': 39257, 'loss/train': 1.8677732944488525} 11/07/2021 02:45:00 - INFO - __main__ - Step 39259: {'lr': 0.00042580592092872295, 'samples': 7537728, 'steps': 39258, 'loss/train': 1.4082847833633423} 11/07/2021 02:45:00 - INFO - __main__ - Step 39260: {'lr': 0.0004258021479601414, 'samples': 7537920, 'steps': 39259, 'loss/train': 1.3875758647918701} 11/07/2021 02:45:00 - INFO - __main__ - Step 39261: {'lr': 0.0004257983749123461, 'samples': 7538112, 'steps': 39260, 'loss/train': 1.4950883388519287} 11/07/2021 02:45:01 - INFO - __main__ - Step 39262: {'lr': 0.00042579460178533875, 'samples': 7538304, 'steps': 39261, 'loss/train': 1.6318151950836182} 11/07/2021 02:45:02 - INFO - __main__ - Step 39263: {'lr': 0.0004257908285791211, 'samples': 7538496, 'steps': 39262, 'loss/train': 1.4447813034057617} 11/07/2021 02:45:02 - INFO - __main__ - Step 39264: {'lr': 0.00042578705529369476, 'samples': 7538688, 'steps': 39263, 'loss/train': 0.8829726576805115} 11/07/2021 02:45:03 - INFO - __main__ - Step 39265: {'lr': 0.00042578328192906153, 'samples': 7538880, 'steps': 39264, 'loss/train': 1.7114918231964111} 11/07/2021 02:45:03 - INFO - __main__ - Step 39266: {'lr': 0.00042577950848522305, 'samples': 7539072, 'steps': 39265, 'loss/train': 0.7462561130523682} 11/07/2021 02:45:03 - INFO - __main__ - Step 39267: {'lr': 0.0004257757349621811, 'samples': 7539264, 'steps': 39266, 'loss/train': 1.6467303037643433} 11/07/2021 02:45:04 - INFO - __main__ - Step 39268: {'lr': 0.0004257719613599372, 'samples': 7539456, 'steps': 39267, 'loss/train': 1.3126444816589355} 11/07/2021 02:45:05 - INFO - __main__ - Step 39269: {'lr': 0.0004257681876784932, 'samples': 7539648, 'steps': 39268, 'loss/train': 1.579737663269043} 11/07/2021 02:45:05 - INFO - __main__ - Step 39270: {'lr': 0.0004257644139178508, 'samples': 7539840, 'steps': 39269, 'loss/train': 1.1076663732528687} 11/07/2021 02:45:05 - INFO - __main__ - Step 39271: {'lr': 0.0004257606400780117, 'samples': 7540032, 'steps': 39270, 'loss/train': 1.5877642631530762} 11/07/2021 02:45:06 - INFO - __main__ - Step 39272: {'lr': 0.0004257568661589775, 'samples': 7540224, 'steps': 39271, 'loss/train': 1.1899365186691284} 11/07/2021 02:45:07 - INFO - __main__ - Step 39273: {'lr': 0.00042575309216074997, 'samples': 7540416, 'steps': 39272, 'loss/train': 1.7019377946853638} 11/07/2021 02:45:07 - INFO - __main__ - Step 39274: {'lr': 0.00042574931808333095, 'samples': 7540608, 'steps': 39273, 'loss/train': 1.2056797742843628} 11/07/2021 02:45:07 - INFO - __main__ - Step 39275: {'lr': 0.0004257455439267218, 'samples': 7540800, 'steps': 39274, 'loss/train': 1.438101053237915} 11/07/2021 02:45:08 - INFO - __main__ - Step 39276: {'lr': 0.00042574176969092454, 'samples': 7540992, 'steps': 39275, 'loss/train': 1.626620888710022} 11/07/2021 02:45:08 - INFO - __main__ - Step 39277: {'lr': 0.0004257379953759407, 'samples': 7541184, 'steps': 39276, 'loss/train': 1.5642321109771729} 11/07/2021 02:45:09 - INFO - __main__ - Step 39278: {'lr': 0.00042573422098177204, 'samples': 7541376, 'steps': 39277, 'loss/train': 1.5449275970458984} 11/07/2021 02:45:09 - INFO - __main__ - Step 39279: {'lr': 0.0004257304465084203, 'samples': 7541568, 'steps': 39278, 'loss/train': 1.7904508113861084} 11/07/2021 02:45:10 - INFO - __main__ - Step 39280: {'lr': 0.0004257266719558871, 'samples': 7541760, 'steps': 39279, 'loss/train': 1.0784425735473633} 11/07/2021 02:45:10 - INFO - __main__ - Step 39281: {'lr': 0.0004257228973241741, 'samples': 7541952, 'steps': 39280, 'loss/train': 1.1655843257904053} 11/07/2021 02:45:10 - INFO - __main__ - Step 39282: {'lr': 0.00042571912261328315, 'samples': 7542144, 'steps': 39281, 'loss/train': 1.0028207302093506} 11/07/2021 02:45:11 - INFO - __main__ - Step 39283: {'lr': 0.00042571534782321593, 'samples': 7542336, 'steps': 39282, 'loss/train': 1.4070467948913574} 11/07/2021 02:45:12 - INFO - __main__ - Step 39284: {'lr': 0.000425711572953974, 'samples': 7542528, 'steps': 39283, 'loss/train': 1.9504486322402954} 11/07/2021 02:45:12 - INFO - __main__ - Step 39285: {'lr': 0.00042570779800555914, 'samples': 7542720, 'steps': 39284, 'loss/train': 1.4157495498657227} 11/07/2021 02:45:13 - INFO - __main__ - Step 39286: {'lr': 0.00042570402297797304, 'samples': 7542912, 'steps': 39285, 'loss/train': 1.3681554794311523} 11/07/2021 02:45:13 - INFO - __main__ - Step 39287: {'lr': 0.0004257002478712175, 'samples': 7543104, 'steps': 39286, 'loss/train': 1.1086269617080688} 11/07/2021 02:45:13 - INFO - __main__ - Step 39288: {'lr': 0.0004256964726852941, 'samples': 7543296, 'steps': 39287, 'loss/train': 0.5984827280044556} 11/07/2021 02:45:14 - INFO - __main__ - Step 39289: {'lr': 0.0004256926974202046, 'samples': 7543488, 'steps': 39288, 'loss/train': 1.4696334600448608} 11/07/2021 02:45:15 - INFO - __main__ - Step 39290: {'lr': 0.00042568892207595066, 'samples': 7543680, 'steps': 39289, 'loss/train': 1.3621699810028076} 11/07/2021 02:45:15 - INFO - __main__ - Step 39291: {'lr': 0.000425685146652534, 'samples': 7543872, 'steps': 39290, 'loss/train': 1.0785452127456665} 11/07/2021 02:45:16 - INFO - __main__ - Step 39292: {'lr': 0.00042568137114995633, 'samples': 7544064, 'steps': 39291, 'loss/train': 1.7251018285751343} 11/07/2021 02:45:16 - INFO - __main__ - Step 39293: {'lr': 0.00042567759556821937, 'samples': 7544256, 'steps': 39292, 'loss/train': 1.6907563209533691} 11/07/2021 02:45:17 - INFO - __main__ - Step 39294: {'lr': 0.00042567381990732476, 'samples': 7544448, 'steps': 39293, 'loss/train': 1.4064091444015503} 11/07/2021 02:45:17 - INFO - __main__ - Step 39295: {'lr': 0.0004256700441672743, 'samples': 7544640, 'steps': 39294, 'loss/train': 1.557332992553711} 11/07/2021 02:45:18 - INFO - __main__ - Step 39296: {'lr': 0.0004256662683480695, 'samples': 7544832, 'steps': 39295, 'loss/train': 1.9215407371520996} 11/07/2021 02:45:18 - INFO - __main__ - Step 39297: {'lr': 0.00042566249244971235, 'samples': 7545024, 'steps': 39296, 'loss/train': 1.4755570888519287} 11/07/2021 02:45:18 - INFO - __main__ - Step 39298: {'lr': 0.0004256587164722043, 'samples': 7545216, 'steps': 39297, 'loss/train': 0.8738113045692444} 11/07/2021 02:45:19 - INFO - __main__ - Step 39299: {'lr': 0.0004256549404155471, 'samples': 7545408, 'steps': 39298, 'loss/train': 1.4364068508148193} 11/07/2021 02:45:20 - INFO - __main__ - Step 39300: {'lr': 0.0004256511642797426, 'samples': 7545600, 'steps': 39299, 'loss/train': 1.5300726890563965} 11/07/2021 02:45:20 - INFO - __main__ - Step 39301: {'lr': 0.0004256473880647923, 'samples': 7545792, 'steps': 39300, 'loss/train': 0.16604219377040863} 11/07/2021 02:45:20 - INFO - __main__ - Step 39302: {'lr': 0.0004256436117706981, 'samples': 7545984, 'steps': 39301, 'loss/train': 0.8302720189094543} 11/07/2021 02:45:21 - INFO - __main__ - Step 39303: {'lr': 0.0004256398353974615, 'samples': 7546176, 'steps': 39302, 'loss/train': 0.9688646197319031} 11/07/2021 02:45:22 - INFO - __main__ - Step 39304: {'lr': 0.00042563605894508434, 'samples': 7546368, 'steps': 39303, 'loss/train': 1.3211116790771484} 11/07/2021 02:45:22 - INFO - __main__ - Step 39305: {'lr': 0.00042563228241356834, 'samples': 7546560, 'steps': 39304, 'loss/train': 2.016324520111084} 11/07/2021 02:45:22 - INFO - __main__ - Step 39306: {'lr': 0.000425628505802915, 'samples': 7546752, 'steps': 39305, 'loss/train': 0.9035109877586365} 11/07/2021 02:45:23 - INFO - __main__ - Step 39307: {'lr': 0.0004256247291131263, 'samples': 7546944, 'steps': 39306, 'loss/train': 1.6596336364746094} 11/07/2021 02:45:23 - INFO - __main__ - Step 39308: {'lr': 0.00042562095234420375, 'samples': 7547136, 'steps': 39307, 'loss/train': 1.361107349395752} 11/07/2021 02:45:24 - INFO - __main__ - Step 39309: {'lr': 0.00042561717549614907, 'samples': 7547328, 'steps': 39308, 'loss/train': 1.4581472873687744} 11/07/2021 02:45:24 - INFO - __main__ - Step 39310: {'lr': 0.0004256133985689641, 'samples': 7547520, 'steps': 39309, 'loss/train': 1.205946445465088} 11/07/2021 02:45:25 - INFO - __main__ - Step 39311: {'lr': 0.0004256096215626504, 'samples': 7547712, 'steps': 39310, 'loss/train': 1.3667376041412354} 11/07/2021 02:45:25 - INFO - __main__ - Step 39312: {'lr': 0.0004256058444772097, 'samples': 7547904, 'steps': 39311, 'loss/train': 1.536171317100525} 11/07/2021 02:45:25 - INFO - __main__ - Step 39313: {'lr': 0.0004256020673126437, 'samples': 7548096, 'steps': 39312, 'loss/train': 1.3614702224731445} 11/07/2021 02:45:27 - INFO - __main__ - Step 39314: {'lr': 0.0004255982900689541, 'samples': 7548288, 'steps': 39313, 'loss/train': 1.150659441947937} 11/07/2021 02:45:27 - INFO - __main__ - Step 39315: {'lr': 0.0004255945127461427, 'samples': 7548480, 'steps': 39314, 'loss/train': 1.6493338346481323} 11/07/2021 02:45:27 - INFO - __main__ - Step 39316: {'lr': 0.00042559073534421114, 'samples': 7548672, 'steps': 39315, 'loss/train': 1.3953452110290527} 11/07/2021 02:45:28 - INFO - __main__ - Step 39317: {'lr': 0.00042558695786316106, 'samples': 7548864, 'steps': 39316, 'loss/train': 1.6797736883163452} 11/07/2021 02:45:28 - INFO - __main__ - Step 39318: {'lr': 0.00042558318030299415, 'samples': 7549056, 'steps': 39317, 'loss/train': 1.3615410327911377} 11/07/2021 02:45:28 - INFO - __main__ - Step 39319: {'lr': 0.0004255794026637122, 'samples': 7549248, 'steps': 39318, 'loss/train': 3.0827596187591553} 11/07/2021 02:45:29 - INFO - __main__ - Step 39320: {'lr': 0.0004255756249453169, 'samples': 7549440, 'steps': 39319, 'loss/train': 1.328766107559204} 11/07/2021 02:45:30 - INFO - __main__ - Step 39321: {'lr': 0.00042557184714780993, 'samples': 7549632, 'steps': 39320, 'loss/train': 1.630851149559021} 11/07/2021 02:45:30 - INFO - __main__ - Step 39322: {'lr': 0.000425568069271193, 'samples': 7549824, 'steps': 39321, 'loss/train': 1.5530668497085571} 11/07/2021 02:45:30 - INFO - __main__ - Step 39323: {'lr': 0.00042556429131546775, 'samples': 7550016, 'steps': 39322, 'loss/train': 1.649032711982727} 11/07/2021 02:45:31 - INFO - __main__ - Step 39324: {'lr': 0.000425560513280636, 'samples': 7550208, 'steps': 39323, 'loss/train': 1.1001230478286743} 11/07/2021 02:45:32 - INFO - __main__ - Step 39325: {'lr': 0.00042555673516669933, 'samples': 7550400, 'steps': 39324, 'loss/train': 1.2240439653396606} 11/07/2021 02:45:32 - INFO - __main__ - Step 39326: {'lr': 0.0004255529569736596, 'samples': 7550592, 'steps': 39325, 'loss/train': 1.6975648403167725} 11/07/2021 02:45:33 - INFO - __main__ - Step 39327: {'lr': 0.0004255491787015183, 'samples': 7550784, 'steps': 39326, 'loss/train': 1.6072126626968384} 11/07/2021 02:45:33 - INFO - __main__ - Step 39328: {'lr': 0.0004255454003502774, 'samples': 7550976, 'steps': 39327, 'loss/train': 1.4416260719299316} 11/07/2021 02:45:33 - INFO - __main__ - Step 39329: {'lr': 0.0004255416219199384, 'samples': 7551168, 'steps': 39328, 'loss/train': 1.4229910373687744} 11/07/2021 02:45:34 - INFO - __main__ - Step 39330: {'lr': 0.0004255378434105029, 'samples': 7551360, 'steps': 39329, 'loss/train': 0.758528470993042} 11/07/2021 02:45:35 - INFO - __main__ - Step 39331: {'lr': 0.00042553406482197297, 'samples': 7551552, 'steps': 39330, 'loss/train': 2.1144182682037354} 11/07/2021 02:45:35 - INFO - __main__ - Step 39332: {'lr': 0.00042553028615434997, 'samples': 7551744, 'steps': 39331, 'loss/train': 1.465710997581482} 11/07/2021 02:45:35 - INFO - __main__ - Step 39333: {'lr': 0.0004255265074076358, 'samples': 7551936, 'steps': 39332, 'loss/train': 0.8778195381164551} 11/07/2021 02:45:36 - INFO - __main__ - Step 39334: {'lr': 0.00042552272858183203, 'samples': 7552128, 'steps': 39333, 'loss/train': 1.2574025392532349} 11/07/2021 02:45:37 - INFO - __main__ - Step 39335: {'lr': 0.0004255189496769405, 'samples': 7552320, 'steps': 39334, 'loss/train': 1.6069217920303345} 11/07/2021 02:45:38 - INFO - __main__ - Step 39336: {'lr': 0.00042551517069296276, 'samples': 7552512, 'steps': 39335, 'loss/train': 1.5512158870697021} 11/07/2021 02:45:38 - INFO - __main__ - Step 39337: {'lr': 0.00042551139162990065, 'samples': 7552704, 'steps': 39336, 'loss/train': 1.6924312114715576} 11/07/2021 02:45:38 - INFO - __main__ - Step 39338: {'lr': 0.0004255076124877558, 'samples': 7552896, 'steps': 39337, 'loss/train': 1.810476541519165} 11/07/2021 02:45:39 - INFO - __main__ - Step 39339: {'lr': 0.0004255038332665299, 'samples': 7553088, 'steps': 39338, 'loss/train': 1.5786186456680298} 11/07/2021 02:45:39 - INFO - __main__ - Step 39340: {'lr': 0.0004255000539662247, 'samples': 7553280, 'steps': 39339, 'loss/train': 1.684509038925171} 11/07/2021 02:45:40 - INFO - __main__ - Step 39341: {'lr': 0.0004254962745868419, 'samples': 7553472, 'steps': 39340, 'loss/train': 0.9445256590843201} 11/07/2021 02:45:40 - INFO - __main__ - Step 39342: {'lr': 0.00042549249512838325, 'samples': 7553664, 'steps': 39341, 'loss/train': 1.8878004550933838} 11/07/2021 02:45:41 - INFO - __main__ - Step 39343: {'lr': 0.00042548871559085026, 'samples': 7553856, 'steps': 39342, 'loss/train': 1.388020634651184} 11/07/2021 02:45:41 - INFO - __main__ - Step 39344: {'lr': 0.0004254849359742449, 'samples': 7554048, 'steps': 39343, 'loss/train': 1.0768413543701172} 11/07/2021 02:45:41 - INFO - __main__ - Step 39345: {'lr': 0.0004254811562785686, 'samples': 7554240, 'steps': 39344, 'loss/train': 1.4457755088806152} 11/07/2021 02:45:42 - INFO - __main__ - Step 39346: {'lr': 0.00042547737650382324, 'samples': 7554432, 'steps': 39345, 'loss/train': 1.4694263935089111} 11/07/2021 02:45:43 - INFO - __main__ - Step 39347: {'lr': 0.0004254735966500105, 'samples': 7554624, 'steps': 39346, 'loss/train': 1.3930296897888184} 11/07/2021 02:45:43 - INFO - __main__ - Step 39348: {'lr': 0.00042546981671713206, 'samples': 7554816, 'steps': 39347, 'loss/train': 0.8317453265190125} 11/07/2021 02:45:44 - INFO - __main__ - Step 39349: {'lr': 0.0004254660367051896, 'samples': 7555008, 'steps': 39348, 'loss/train': 1.5087776184082031} 11/07/2021 02:45:44 - INFO - __main__ - Step 39350: {'lr': 0.0004254622566141849, 'samples': 7555200, 'steps': 39349, 'loss/train': 1.4444903135299683} 11/07/2021 02:45:44 - INFO - __main__ - Step 39351: {'lr': 0.0004254584764441196, 'samples': 7555392, 'steps': 39350, 'loss/train': 1.4392361640930176} 11/07/2021 02:45:45 - INFO - __main__ - Step 39352: {'lr': 0.00042545469619499545, 'samples': 7555584, 'steps': 39351, 'loss/train': 1.4138058423995972} 11/07/2021 02:45:46 - INFO - __main__ - Step 39353: {'lr': 0.00042545091586681404, 'samples': 7555776, 'steps': 39352, 'loss/train': 1.6890608072280884} 11/07/2021 02:45:46 - INFO - __main__ - Step 39354: {'lr': 0.0004254471354595772, 'samples': 7555968, 'steps': 39353, 'loss/train': 1.4956886768341064} 11/07/2021 02:45:46 - INFO - __main__ - Step 39355: {'lr': 0.0004254433549732866, 'samples': 7556160, 'steps': 39354, 'loss/train': 1.7300609350204468} 11/07/2021 02:45:47 - INFO - __main__ - Step 39356: {'lr': 0.0004254395744079439, 'samples': 7556352, 'steps': 39355, 'loss/train': 1.515649676322937} 11/07/2021 02:45:48 - INFO - __main__ - Step 39357: {'lr': 0.0004254357937635509, 'samples': 7556544, 'steps': 39356, 'loss/train': 1.8868955373764038} 11/07/2021 02:45:48 - INFO - __main__ - Step 39358: {'lr': 0.00042543201304010914, 'samples': 7556736, 'steps': 39357, 'loss/train': 0.9736785292625427} 11/07/2021 02:45:48 - INFO - __main__ - Step 39359: {'lr': 0.0004254282322376205, 'samples': 7556928, 'steps': 39358, 'loss/train': 0.7911121845245361} 11/07/2021 02:45:49 - INFO - __main__ - Step 39360: {'lr': 0.0004254244513560866, 'samples': 7557120, 'steps': 39359, 'loss/train': 1.9326436519622803} 11/07/2021 02:45:49 - INFO - __main__ - Step 39361: {'lr': 0.00042542067039550916, 'samples': 7557312, 'steps': 39360, 'loss/train': 1.9013553857803345} 11/07/2021 02:45:50 - INFO - __main__ - Step 39362: {'lr': 0.00042541688935588984, 'samples': 7557504, 'steps': 39361, 'loss/train': 0.8181159496307373} 11/07/2021 02:45:50 - INFO - __main__ - Step 39363: {'lr': 0.00042541310823723035, 'samples': 7557696, 'steps': 39362, 'loss/train': 1.1350655555725098} 11/07/2021 02:45:51 - INFO - __main__ - Step 39364: {'lr': 0.00042540932703953246, 'samples': 7557888, 'steps': 39363, 'loss/train': 1.395199179649353} 11/07/2021 02:45:51 - INFO - __main__ - Step 39365: {'lr': 0.00042540554576279776, 'samples': 7558080, 'steps': 39364, 'loss/train': 1.6894145011901855} 11/07/2021 02:45:51 - INFO - __main__ - Step 39366: {'lr': 0.0004254017644070282, 'samples': 7558272, 'steps': 39365, 'loss/train': 1.7289868593215942} 11/07/2021 02:45:53 - INFO - __main__ - Step 39367: {'lr': 0.0004253979829722251, 'samples': 7558464, 'steps': 39366, 'loss/train': 1.5356981754302979} 11/07/2021 02:45:53 - INFO - __main__ - Step 39368: {'lr': 0.00042539420145839055, 'samples': 7558656, 'steps': 39367, 'loss/train': 0.863486111164093} 11/07/2021 02:45:53 - INFO - __main__ - Step 39369: {'lr': 0.00042539041986552596, 'samples': 7558848, 'steps': 39368, 'loss/train': 1.0760917663574219} 11/07/2021 02:45:54 - INFO - __main__ - Step 39370: {'lr': 0.00042538663819363323, 'samples': 7559040, 'steps': 39369, 'loss/train': 1.4790570735931396} 11/07/2021 02:45:54 - INFO - __main__ - Step 39371: {'lr': 0.000425382856442714, 'samples': 7559232, 'steps': 39370, 'loss/train': 1.5317871570587158} 11/07/2021 02:45:54 - INFO - __main__ - Step 39372: {'lr': 0.0004253790746127699, 'samples': 7559424, 'steps': 39371, 'loss/train': 1.7790511846542358} 11/07/2021 02:45:55 - INFO - __main__ - Step 39373: {'lr': 0.0004253752927038027, 'samples': 7559616, 'steps': 39372, 'loss/train': 1.676537036895752} 11/07/2021 02:45:56 - INFO - __main__ - Step 39374: {'lr': 0.0004253715107158141, 'samples': 7559808, 'steps': 39373, 'loss/train': 1.566117286682129} 11/07/2021 02:45:56 - INFO - __main__ - Step 39375: {'lr': 0.0004253677286488058, 'samples': 7560000, 'steps': 39374, 'loss/train': 1.004982352256775} 11/07/2021 02:45:56 - INFO - __main__ - Step 39376: {'lr': 0.00042536394650277953, 'samples': 7560192, 'steps': 39375, 'loss/train': 1.3873460292816162} 11/07/2021 02:45:57 - INFO - __main__ - Step 39377: {'lr': 0.000425360164277737, 'samples': 7560384, 'steps': 39376, 'loss/train': 1.6033751964569092} 11/07/2021 02:45:58 - INFO - __main__ - Step 39378: {'lr': 0.00042535638197367984, 'samples': 7560576, 'steps': 39377, 'loss/train': 1.3536235094070435} 11/07/2021 02:45:58 - INFO - __main__ - Step 39379: {'lr': 0.0004253525995906098, 'samples': 7560768, 'steps': 39378, 'loss/train': 1.5641216039657593} 11/07/2021 02:45:58 - INFO - __main__ - Step 39380: {'lr': 0.00042534881712852856, 'samples': 7560960, 'steps': 39379, 'loss/train': 1.5420782566070557} 11/07/2021 02:45:59 - INFO - __main__ - Step 39381: {'lr': 0.0004253450345874379, 'samples': 7561152, 'steps': 39380, 'loss/train': 1.4883129596710205} 11/07/2021 02:45:59 - INFO - __main__ - Step 39382: {'lr': 0.00042534125196733955, 'samples': 7561344, 'steps': 39381, 'loss/train': 1.754917860031128} 11/07/2021 02:46:00 - INFO - __main__ - Step 39383: {'lr': 0.000425337469268235, 'samples': 7561536, 'steps': 39382, 'loss/train': 1.8359870910644531} 11/07/2021 02:46:01 - INFO - __main__ - Step 39384: {'lr': 0.00042533368649012615, 'samples': 7561728, 'steps': 39383, 'loss/train': 1.5123169422149658} 11/07/2021 02:46:01 - INFO - __main__ - Step 39385: {'lr': 0.0004253299036330146, 'samples': 7561920, 'steps': 39384, 'loss/train': 1.6789470911026} 11/07/2021 02:46:01 - INFO - __main__ - Step 39386: {'lr': 0.00042532612069690214, 'samples': 7562112, 'steps': 39385, 'loss/train': 1.331007719039917} 11/07/2021 02:46:02 - INFO - __main__ - Step 39387: {'lr': 0.0004253223376817904, 'samples': 7562304, 'steps': 39386, 'loss/train': 1.7500447034835815} 11/07/2021 02:46:03 - INFO - __main__ - Step 39388: {'lr': 0.0004253185545876812, 'samples': 7562496, 'steps': 39387, 'loss/train': 1.3423337936401367} 11/07/2021 02:46:03 - INFO - __main__ - Step 39389: {'lr': 0.0004253147714145761, 'samples': 7562688, 'steps': 39388, 'loss/train': 1.7974839210510254} 11/07/2021 02:46:03 - INFO - __main__ - Step 39390: {'lr': 0.00042531098816247695, 'samples': 7562880, 'steps': 39389, 'loss/train': 1.8890057802200317} 11/07/2021 02:46:04 - INFO - __main__ - Step 39391: {'lr': 0.00042530720483138524, 'samples': 7563072, 'steps': 39390, 'loss/train': 0.7283228635787964} 11/07/2021 02:46:04 - INFO - __main__ - Step 39392: {'lr': 0.00042530342142130283, 'samples': 7563264, 'steps': 39391, 'loss/train': 1.046333909034729} 11/07/2021 02:46:05 - INFO - __main__ - Step 39393: {'lr': 0.0004252996379322315, 'samples': 7563456, 'steps': 39392, 'loss/train': 1.3759727478027344} 11/07/2021 02:46:06 - INFO - __main__ - Step 39394: {'lr': 0.0004252958543641728, 'samples': 7563648, 'steps': 39393, 'loss/train': 1.343981146812439} 11/07/2021 02:46:06 - INFO - __main__ - Step 39395: {'lr': 0.0004252920707171285, 'samples': 7563840, 'steps': 39394, 'loss/train': 1.5883010625839233} 11/07/2021 02:46:06 - INFO - __main__ - Step 39396: {'lr': 0.00042528828699110033, 'samples': 7564032, 'steps': 39395, 'loss/train': 1.370940089225769} 11/07/2021 02:46:07 - INFO - __main__ - Step 39397: {'lr': 0.0004252845031860899, 'samples': 7564224, 'steps': 39396, 'loss/train': 1.7147272825241089} 11/07/2021 02:46:07 - INFO - __main__ - Step 39398: {'lr': 0.000425280719302099, 'samples': 7564416, 'steps': 39397, 'loss/train': 1.6413315534591675} 11/07/2021 02:46:08 - INFO - __main__ - Step 39399: {'lr': 0.0004252769353391294, 'samples': 7564608, 'steps': 39398, 'loss/train': 0.9206255078315735} 11/07/2021 02:46:08 - INFO - __main__ - Step 39400: {'lr': 0.00042527315129718257, 'samples': 7564800, 'steps': 39399, 'loss/train': 1.7017139196395874} 11/07/2021 02:46:09 - INFO - __main__ - Step 39401: {'lr': 0.00042526936717626046, 'samples': 7564992, 'steps': 39400, 'loss/train': 1.9394193887710571} 11/07/2021 02:46:09 - INFO - __main__ - Step 39402: {'lr': 0.00042526558297636464, 'samples': 7565184, 'steps': 39401, 'loss/train': 1.4708151817321777} 11/07/2021 02:46:09 - INFO - __main__ - Step 39403: {'lr': 0.0004252617986974969, 'samples': 7565376, 'steps': 39402, 'loss/train': 0.9733449816703796} 11/07/2021 02:46:10 - INFO - __main__ - Step 39404: {'lr': 0.00042525801433965883, 'samples': 7565568, 'steps': 39403, 'loss/train': 1.8433624505996704} 11/07/2021 02:46:11 - INFO - __main__ - Step 39405: {'lr': 0.00042525422990285225, 'samples': 7565760, 'steps': 39404, 'loss/train': 1.5334631204605103} 11/07/2021 02:46:11 - INFO - __main__ - Step 39406: {'lr': 0.0004252504453870788, 'samples': 7565952, 'steps': 39405, 'loss/train': 1.3392281532287598} 11/07/2021 02:46:11 - INFO - __main__ - Step 39407: {'lr': 0.0004252466607923402, 'samples': 7566144, 'steps': 39406, 'loss/train': 1.5809663534164429} 11/07/2021 02:46:12 - INFO - __main__ - Step 39408: {'lr': 0.0004252428761186382, 'samples': 7566336, 'steps': 39407, 'loss/train': 2.1655049324035645} 11/07/2021 02:46:13 - INFO - __main__ - Step 39409: {'lr': 0.0004252390913659744, 'samples': 7566528, 'steps': 39408, 'loss/train': 1.7455583810806274} 11/07/2021 02:46:13 - INFO - __main__ - Step 39410: {'lr': 0.0004252353065343506, 'samples': 7566720, 'steps': 39409, 'loss/train': 0.2890855371952057} 11/07/2021 02:46:14 - INFO - __main__ - Step 39411: {'lr': 0.0004252315216237684, 'samples': 7566912, 'steps': 39410, 'loss/train': 1.478803277015686} 11/07/2021 02:46:14 - INFO - __main__ - Step 39412: {'lr': 0.00042522773663422977, 'samples': 7567104, 'steps': 39411, 'loss/train': 1.7798511981964111} 11/07/2021 02:46:14 - INFO - __main__ - Step 39413: {'lr': 0.000425223951565736, 'samples': 7567296, 'steps': 39412, 'loss/train': 1.5644084215164185} 11/07/2021 02:46:16 - INFO - __main__ - Step 39414: {'lr': 0.0004252201664182892, 'samples': 7567488, 'steps': 39413, 'loss/train': 1.9910378456115723} 11/07/2021 02:46:16 - INFO - __main__ - Step 39415: {'lr': 0.0004252163811918909, 'samples': 7567680, 'steps': 39414, 'loss/train': 1.361193060874939} 11/07/2021 02:46:16 - INFO - __main__ - Step 39416: {'lr': 0.00042521259588654264, 'samples': 7567872, 'steps': 39415, 'loss/train': 1.2348101139068604} 11/07/2021 02:46:17 - INFO - __main__ - Step 39417: {'lr': 0.00042520881050224637, 'samples': 7568064, 'steps': 39416, 'loss/train': 1.742704153060913} 11/07/2021 02:46:17 - INFO - __main__ - Step 39418: {'lr': 0.0004252050250390037, 'samples': 7568256, 'steps': 39417, 'loss/train': 1.4445207118988037} 11/07/2021 02:46:17 - INFO - __main__ - Step 39419: {'lr': 0.0004252012394968164, 'samples': 7568448, 'steps': 39418, 'loss/train': 1.140768051147461} 11/07/2021 02:46:18 - INFO - __main__ - Step 39420: {'lr': 0.0004251974538756861, 'samples': 7568640, 'steps': 39419, 'loss/train': 1.236556887626648} 11/07/2021 02:46:19 - INFO - __main__ - Step 39421: {'lr': 0.00042519366817561453, 'samples': 7568832, 'steps': 39420, 'loss/train': 1.6497478485107422} 11/07/2021 02:46:19 - INFO - __main__ - Step 39422: {'lr': 0.0004251898823966034, 'samples': 7569024, 'steps': 39421, 'loss/train': 1.5852000713348389} 11/07/2021 02:46:19 - INFO - __main__ - Step 39423: {'lr': 0.00042518609653865444, 'samples': 7569216, 'steps': 39422, 'loss/train': 1.822353720664978} 11/07/2021 02:46:20 - INFO - __main__ - Step 39424: {'lr': 0.00042518231060176926, 'samples': 7569408, 'steps': 39423, 'loss/train': 1.4493063688278198} 11/07/2021 02:46:21 - INFO - __main__ - Step 39425: {'lr': 0.00042517852458594967, 'samples': 7569600, 'steps': 39424, 'loss/train': 1.5626758337020874} 11/07/2021 02:46:21 - INFO - __main__ - Step 39426: {'lr': 0.00042517473849119734, 'samples': 7569792, 'steps': 39425, 'loss/train': 1.4561035633087158} 11/07/2021 02:46:21 - INFO - __main__ - Step 39427: {'lr': 0.000425170952317514, 'samples': 7569984, 'steps': 39426, 'loss/train': 1.5167341232299805} 11/07/2021 02:46:22 - INFO - __main__ - Step 39428: {'lr': 0.0004251671660649013, 'samples': 7570176, 'steps': 39427, 'loss/train': 1.613086223602295} 11/07/2021 02:46:22 - INFO - __main__ - Step 39429: {'lr': 0.000425163379733361, 'samples': 7570368, 'steps': 39428, 'loss/train': 1.4827110767364502} 11/07/2021 02:46:23 - INFO - __main__ - Step 39430: {'lr': 0.00042515959332289476, 'samples': 7570560, 'steps': 39429, 'loss/train': 1.8132240772247314} 11/07/2021 02:46:24 - INFO - __main__ - Step 39431: {'lr': 0.0004251558068335043, 'samples': 7570752, 'steps': 39430, 'loss/train': 1.5489274263381958} 11/07/2021 02:46:24 - INFO - __main__ - Step 39432: {'lr': 0.00042515202026519136, 'samples': 7570944, 'steps': 39431, 'loss/train': 1.3465017080307007} 11/07/2021 02:46:24 - INFO - __main__ - Step 39433: {'lr': 0.00042514823361795764, 'samples': 7571136, 'steps': 39432, 'loss/train': 0.7617422938346863} 11/07/2021 02:46:25 - INFO - __main__ - Step 39434: {'lr': 0.0004251444468918048, 'samples': 7571328, 'steps': 39433, 'loss/train': 1.3461247682571411} 11/07/2021 02:46:26 - INFO - __main__ - Step 39435: {'lr': 0.0004251406600867346, 'samples': 7571520, 'steps': 39434, 'loss/train': 0.49319687485694885} 11/07/2021 02:46:26 - INFO - __main__ - Step 39436: {'lr': 0.00042513687320274866, 'samples': 7571712, 'steps': 39435, 'loss/train': 1.7188775539398193} 11/07/2021 02:46:26 - INFO - __main__ - Step 39437: {'lr': 0.0004251330862398488, 'samples': 7571904, 'steps': 39436, 'loss/train': 1.560532569885254} 11/07/2021 02:46:27 - INFO - __main__ - Step 39438: {'lr': 0.0004251292991980367, 'samples': 7572096, 'steps': 39437, 'loss/train': 1.5440956354141235} 11/07/2021 02:46:27 - INFO - __main__ - Step 39439: {'lr': 0.000425125512077314, 'samples': 7572288, 'steps': 39438, 'loss/train': 1.5055773258209229} 11/07/2021 02:46:28 - INFO - __main__ - Step 39440: {'lr': 0.00042512172487768244, 'samples': 7572480, 'steps': 39439, 'loss/train': 1.3813639879226685} 11/07/2021 02:46:29 - INFO - __main__ - Step 39441: {'lr': 0.00042511793759914375, 'samples': 7572672, 'steps': 39440, 'loss/train': 1.7512849569320679} 11/07/2021 02:46:29 - INFO - __main__ - Step 39442: {'lr': 0.0004251141502416996, 'samples': 7572864, 'steps': 39441, 'loss/train': 1.831099510192871} 11/07/2021 02:46:29 - INFO - __main__ - Step 39443: {'lr': 0.0004251103628053517, 'samples': 7573056, 'steps': 39442, 'loss/train': 1.3687280416488647} 11/07/2021 02:46:30 - INFO - __main__ - Step 39444: {'lr': 0.0004251065752901018, 'samples': 7573248, 'steps': 39443, 'loss/train': 1.537869930267334} 11/07/2021 02:46:30 - INFO - __main__ - Step 39445: {'lr': 0.0004251027876959516, 'samples': 7573440, 'steps': 39444, 'loss/train': 1.254869818687439} 11/07/2021 02:46:31 - INFO - __main__ - Step 39446: {'lr': 0.0004250990000229028, 'samples': 7573632, 'steps': 39445, 'loss/train': 0.9443804025650024} 11/07/2021 02:46:31 - INFO - __main__ - Step 39447: {'lr': 0.00042509521227095706, 'samples': 7573824, 'steps': 39446, 'loss/train': 2.0799334049224854} 11/07/2021 02:46:32 - INFO - __main__ - Step 39448: {'lr': 0.0004250914244401161, 'samples': 7574016, 'steps': 39447, 'loss/train': 1.4163551330566406} 11/07/2021 02:46:32 - INFO - __main__ - Step 39449: {'lr': 0.00042508763653038167, 'samples': 7574208, 'steps': 39448, 'loss/train': 1.785377025604248} 11/07/2021 02:46:32 - INFO - __main__ - Step 39450: {'lr': 0.0004250838485417554, 'samples': 7574400, 'steps': 39449, 'loss/train': 1.613315463066101} 11/07/2021 02:46:34 - INFO - __main__ - Step 39451: {'lr': 0.00042508006047423916, 'samples': 7574592, 'steps': 39450, 'loss/train': 0.802128255367279} 11/07/2021 02:46:34 - INFO - __main__ - Step 39452: {'lr': 0.0004250762723278344, 'samples': 7574784, 'steps': 39451, 'loss/train': 0.8597303628921509} 11/07/2021 02:46:34 - INFO - __main__ - Step 39453: {'lr': 0.00042507248410254307, 'samples': 7574976, 'steps': 39452, 'loss/train': 1.2439583539962769} 11/07/2021 02:46:35 - INFO - __main__ - Step 39454: {'lr': 0.0004250686957983668, 'samples': 7575168, 'steps': 39453, 'loss/train': 1.4560964107513428} 11/07/2021 02:46:35 - INFO - __main__ - Step 39455: {'lr': 0.00042506490741530724, 'samples': 7575360, 'steps': 39454, 'loss/train': 1.489845633506775} 11/07/2021 02:46:36 - INFO - __main__ - Step 39456: {'lr': 0.00042506111895336616, 'samples': 7575552, 'steps': 39455, 'loss/train': 1.9019747972488403} 11/07/2021 02:46:36 - INFO - __main__ - Step 39457: {'lr': 0.00042505733041254526, 'samples': 7575744, 'steps': 39456, 'loss/train': 1.4447280168533325} 11/07/2021 02:46:37 - INFO - __main__ - Step 39458: {'lr': 0.00042505354179284615, 'samples': 7575936, 'steps': 39457, 'loss/train': 1.6310251951217651} 11/07/2021 02:46:37 - INFO - __main__ - Step 39459: {'lr': 0.00042504975309427064, 'samples': 7576128, 'steps': 39458, 'loss/train': 1.4011722803115845} 11/07/2021 02:46:37 - INFO - __main__ - Step 39460: {'lr': 0.0004250459643168204, 'samples': 7576320, 'steps': 39459, 'loss/train': 1.6884933710098267} 11/07/2021 02:46:38 - INFO - __main__ - Step 39461: {'lr': 0.0004250421754604972, 'samples': 7576512, 'steps': 39460, 'loss/train': 1.4346963167190552} 11/07/2021 02:46:39 - INFO - __main__ - Step 39462: {'lr': 0.0004250383865253027, 'samples': 7576704, 'steps': 39461, 'loss/train': 1.5717755556106567} 11/07/2021 02:46:39 - INFO - __main__ - Step 39463: {'lr': 0.00042503459751123854, 'samples': 7576896, 'steps': 39462, 'loss/train': 1.5766115188598633} 11/07/2021 02:46:39 - INFO - __main__ - Step 39464: {'lr': 0.00042503080841830654, 'samples': 7577088, 'steps': 39463, 'loss/train': 1.5276356935501099} 11/07/2021 02:46:40 - INFO - __main__ - Step 39465: {'lr': 0.0004250270192465083, 'samples': 7577280, 'steps': 39464, 'loss/train': 1.3183841705322266} 11/07/2021 02:46:41 - INFO - __main__ - Step 39466: {'lr': 0.0004250232299958456, 'samples': 7577472, 'steps': 39465, 'loss/train': 1.2871264219284058} 11/07/2021 02:46:41 - INFO - __main__ - Step 39467: {'lr': 0.0004250194406663203, 'samples': 7577664, 'steps': 39466, 'loss/train': 1.6174371242523193} 11/07/2021 02:46:42 - INFO - __main__ - Step 39468: {'lr': 0.00042501565125793375, 'samples': 7577856, 'steps': 39467, 'loss/train': 1.5066871643066406} 11/07/2021 02:46:42 - INFO - __main__ - Step 39469: {'lr': 0.0004250118617706879, 'samples': 7578048, 'steps': 39468, 'loss/train': 1.4779454469680786} 11/07/2021 02:46:42 - INFO - __main__ - Step 39470: {'lr': 0.0004250080722045844, 'samples': 7578240, 'steps': 39469, 'loss/train': 1.5967754125595093} 11/07/2021 02:46:43 - INFO - __main__ - Step 39471: {'lr': 0.000425004282559625, 'samples': 7578432, 'steps': 39470, 'loss/train': 2.041210889816284} 11/07/2021 02:46:44 - INFO - __main__ - Step 39472: {'lr': 0.0004250004928358113, 'samples': 7578624, 'steps': 39471, 'loss/train': 1.4292852878570557} 11/07/2021 02:46:44 - INFO - __main__ - Step 39473: {'lr': 0.0004249967030331451, 'samples': 7578816, 'steps': 39472, 'loss/train': 1.746503472328186} 11/07/2021 02:46:44 - INFO - __main__ - Step 39474: {'lr': 0.0004249929131516281, 'samples': 7579008, 'steps': 39473, 'loss/train': 1.8705283403396606} 11/07/2021 02:46:45 - INFO - __main__ - Step 39475: {'lr': 0.00042498912319126206, 'samples': 7579200, 'steps': 39474, 'loss/train': 1.8053672313690186} 11/07/2021 02:46:45 - INFO - __main__ - Step 39476: {'lr': 0.00042498533315204855, 'samples': 7579392, 'steps': 39475, 'loss/train': 2.0949859619140625} 11/07/2021 02:46:46 - INFO - __main__ - Step 39477: {'lr': 0.0004249815430339894, 'samples': 7579584, 'steps': 39476, 'loss/train': 0.690558135509491} 11/07/2021 02:46:46 - INFO - __main__ - Step 39478: {'lr': 0.0004249777528370862, 'samples': 7579776, 'steps': 39477, 'loss/train': 0.9026269316673279} 11/07/2021 02:46:47 - INFO - __main__ - Step 39479: {'lr': 0.00042497396256134073, 'samples': 7579968, 'steps': 39478, 'loss/train': 1.6150840520858765} 11/07/2021 02:46:47 - INFO - __main__ - Step 39480: {'lr': 0.0004249701722067547, 'samples': 7580160, 'steps': 39479, 'loss/train': 1.5659527778625488} 11/07/2021 02:46:47 - INFO - __main__ - Step 39481: {'lr': 0.0004249663817733298, 'samples': 7580352, 'steps': 39480, 'loss/train': 1.2330738306045532} 11/07/2021 02:46:48 - INFO - __main__ - Step 39482: {'lr': 0.00042496259126106786, 'samples': 7580544, 'steps': 39481, 'loss/train': 1.7414649724960327} 11/07/2021 02:46:49 - INFO - __main__ - Step 39483: {'lr': 0.0004249588006699704, 'samples': 7580736, 'steps': 39482, 'loss/train': 1.373297929763794} 11/07/2021 02:46:49 - INFO - __main__ - Step 39484: {'lr': 0.0004249550100000392, 'samples': 7580928, 'steps': 39483, 'loss/train': 1.433725357055664} 11/07/2021 02:46:50 - INFO - __main__ - Step 39485: {'lr': 0.0004249512192512759, 'samples': 7581120, 'steps': 39484, 'loss/train': 1.6692142486572266} 11/07/2021 02:46:50 - INFO - __main__ - Step 39486: {'lr': 0.0004249474284236824, 'samples': 7581312, 'steps': 39485, 'loss/train': 1.6300110816955566} 11/07/2021 02:46:51 - INFO - __main__ - Step 39487: {'lr': 0.0004249436375172602, 'samples': 7581504, 'steps': 39486, 'loss/train': 1.0745890140533447} 11/07/2021 02:46:51 - INFO - __main__ - Step 39488: {'lr': 0.0004249398465320111, 'samples': 7581696, 'steps': 39487, 'loss/train': 1.595900535583496} 11/07/2021 02:46:52 - INFO - __main__ - Step 39489: {'lr': 0.0004249360554679369, 'samples': 7581888, 'steps': 39488, 'loss/train': 1.4993693828582764} 11/07/2021 02:46:52 - INFO - __main__ - Step 39490: {'lr': 0.00042493226432503917, 'samples': 7582080, 'steps': 39489, 'loss/train': 1.0378392934799194} 11/07/2021 02:46:52 - INFO - __main__ - Step 39491: {'lr': 0.00042492847310331963, 'samples': 7582272, 'steps': 39490, 'loss/train': 1.5508944988250732} 11/07/2021 02:46:53 - INFO - __main__ - Step 39492: {'lr': 0.00042492468180278, 'samples': 7582464, 'steps': 39491, 'loss/train': 1.510115385055542} 11/07/2021 02:46:54 - INFO - __main__ - Step 39493: {'lr': 0.000424920890423422, 'samples': 7582656, 'steps': 39492, 'loss/train': 1.385694980621338} 11/07/2021 02:46:54 - INFO - __main__ - Step 39494: {'lr': 0.0004249170989652474, 'samples': 7582848, 'steps': 39493, 'loss/train': 1.5371301174163818} 11/07/2021 02:46:54 - INFO - __main__ - Step 39495: {'lr': 0.00042491330742825783, 'samples': 7583040, 'steps': 39494, 'loss/train': 1.4411581754684448} 11/07/2021 02:46:55 - INFO - __main__ - Step 39496: {'lr': 0.0004249095158124551, 'samples': 7583232, 'steps': 39495, 'loss/train': 1.6034796237945557} 11/07/2021 02:46:57 - INFO - __main__ - Step 39497: {'lr': 0.0004249057241178407, 'samples': 7583424, 'steps': 39496, 'loss/train': 1.4687601327896118} 11/07/2021 02:46:58 - INFO - __main__ - Step 39498: {'lr': 0.00042490193234441656, 'samples': 7583616, 'steps': 39497, 'loss/train': 1.361867904663086} 11/07/2021 02:46:58 - INFO - __main__ - Step 39499: {'lr': 0.00042489814049218434, 'samples': 7583808, 'steps': 39498, 'loss/train': 1.3539139032363892} 11/07/2021 02:46:58 - INFO - __main__ - Step 39500: {'lr': 0.00042489434856114565, 'samples': 7584000, 'steps': 39499, 'loss/train': 1.4942436218261719} 11/07/2021 02:46:59 - INFO - __main__ - Step 39501: {'lr': 0.00042489055655130226, 'samples': 7584192, 'steps': 39500, 'loss/train': 1.5138282775878906} 11/07/2021 02:46:59 - INFO - __main__ - Step 39502: {'lr': 0.00042488676446265596, 'samples': 7584384, 'steps': 39501, 'loss/train': 1.1894440650939941} 11/07/2021 02:46:59 - INFO - __main__ - Step 39503: {'lr': 0.00042488297229520834, 'samples': 7584576, 'steps': 39502, 'loss/train': 1.7873018980026245} 11/07/2021 02:47:00 - INFO - __main__ - Step 39504: {'lr': 0.00042487918004896117, 'samples': 7584768, 'steps': 39503, 'loss/train': 1.7786004543304443} 11/07/2021 02:47:01 - INFO - __main__ - Step 39505: {'lr': 0.0004248753877239161, 'samples': 7584960, 'steps': 39504, 'loss/train': 1.7644140720367432} 11/07/2021 02:47:01 - INFO - __main__ - Step 39506: {'lr': 0.0004248715953200749, 'samples': 7585152, 'steps': 39505, 'loss/train': 1.6513596773147583} 11/07/2021 02:47:01 - INFO - __main__ - Step 39507: {'lr': 0.00042486780283743927, 'samples': 7585344, 'steps': 39506, 'loss/train': 2.035029172897339} 11/07/2021 02:47:02 - INFO - __main__ - Step 39508: {'lr': 0.00042486401027601084, 'samples': 7585536, 'steps': 39507, 'loss/train': 1.7957890033721924} 11/07/2021 02:47:02 - INFO - __main__ - Step 39509: {'lr': 0.0004248602176357915, 'samples': 7585728, 'steps': 39508, 'loss/train': 1.3580241203308105} 11/07/2021 02:47:03 - INFO - __main__ - Step 39510: {'lr': 0.0004248564249167828, 'samples': 7585920, 'steps': 39509, 'loss/train': 1.7206439971923828} 11/07/2021 02:47:03 - INFO - __main__ - Step 39511: {'lr': 0.00042485263211898647, 'samples': 7586112, 'steps': 39510, 'loss/train': 0.794846773147583} 11/07/2021 02:47:04 - INFO - __main__ - Step 39512: {'lr': 0.00042484883924240427, 'samples': 7586304, 'steps': 39511, 'loss/train': 1.5746791362762451} 11/07/2021 02:47:04 - INFO - __main__ - Step 39513: {'lr': 0.0004248450462870378, 'samples': 7586496, 'steps': 39512, 'loss/train': 1.3140740394592285} 11/07/2021 02:47:04 - INFO - __main__ - Step 39514: {'lr': 0.0004248412532528889, 'samples': 7586688, 'steps': 39513, 'loss/train': 1.3792023658752441} 11/07/2021 02:47:05 - INFO - __main__ - Step 39515: {'lr': 0.00042483746013995924, 'samples': 7586880, 'steps': 39514, 'loss/train': 1.413406491279602} 11/07/2021 02:47:06 - INFO - __main__ - Step 39516: {'lr': 0.00042483366694825054, 'samples': 7587072, 'steps': 39515, 'loss/train': 1.7062214612960815} 11/07/2021 02:47:06 - INFO - __main__ - Step 39517: {'lr': 0.0004248298736777645, 'samples': 7587264, 'steps': 39516, 'loss/train': 1.7617663145065308} 11/07/2021 02:47:06 - INFO - __main__ - Step 39518: {'lr': 0.00042482608032850275, 'samples': 7587456, 'steps': 39517, 'loss/train': 1.4334967136383057} 11/07/2021 02:47:07 - INFO - __main__ - Step 39519: {'lr': 0.0004248222869004671, 'samples': 7587648, 'steps': 39518, 'loss/train': 1.6987298727035522} 11/07/2021 02:47:08 - INFO - __main__ - Step 39520: {'lr': 0.0004248184933936592, 'samples': 7587840, 'steps': 39519, 'loss/train': 1.8820641040802002} 11/07/2021 02:47:08 - INFO - __main__ - Step 39521: {'lr': 0.0004248146998080808, 'samples': 7588032, 'steps': 39520, 'loss/train': 1.3323283195495605} 11/07/2021 02:47:09 - INFO - __main__ - Step 39522: {'lr': 0.00042481090614373364, 'samples': 7588224, 'steps': 39521, 'loss/train': 1.578730583190918} 11/07/2021 02:47:09 - INFO - __main__ - Step 39523: {'lr': 0.00042480711240061933, 'samples': 7588416, 'steps': 39522, 'loss/train': 1.1655219793319702} 11/07/2021 02:47:09 - INFO - __main__ - Step 39524: {'lr': 0.0004248033185787397, 'samples': 7588608, 'steps': 39523, 'loss/train': 0.2873501479625702} 11/07/2021 02:47:10 - INFO - __main__ - Step 39525: {'lr': 0.00042479952467809623, 'samples': 7588800, 'steps': 39524, 'loss/train': 1.5925241708755493} 11/07/2021 02:47:11 - INFO - __main__ - Step 39526: {'lr': 0.00042479573069869095, 'samples': 7588992, 'steps': 39525, 'loss/train': 1.9888464212417603} 11/07/2021 02:47:11 - INFO - __main__ - Step 39527: {'lr': 0.0004247919366405253, 'samples': 7589184, 'steps': 39526, 'loss/train': 1.2977604866027832} 11/07/2021 02:47:11 - INFO - __main__ - Step 39528: {'lr': 0.0004247881425036012, 'samples': 7589376, 'steps': 39527, 'loss/train': 1.3489466905593872} 11/07/2021 02:47:12 - INFO - __main__ - Step 39529: {'lr': 0.00042478434828792025, 'samples': 7589568, 'steps': 39528, 'loss/train': 1.9120523929595947} 11/07/2021 02:47:12 - INFO - __main__ - Step 39530: {'lr': 0.00042478055399348415, 'samples': 7589760, 'steps': 39529, 'loss/train': 1.7055436372756958} 11/07/2021 02:47:13 - INFO - __main__ - Step 39531: {'lr': 0.0004247767596202946, 'samples': 7589952, 'steps': 39530, 'loss/train': 1.397582769393921} 11/07/2021 02:47:14 - INFO - __main__ - Step 39532: {'lr': 0.00042477296516835335, 'samples': 7590144, 'steps': 39531, 'loss/train': 1.6137359142303467} 11/07/2021 02:47:14 - INFO - __main__ - Step 39533: {'lr': 0.00042476917063766207, 'samples': 7590336, 'steps': 39532, 'loss/train': 1.2835636138916016} 11/07/2021 02:47:14 - INFO - __main__ - Step 39534: {'lr': 0.0004247653760282225, 'samples': 7590528, 'steps': 39533, 'loss/train': 1.2379640340805054} 11/07/2021 02:47:15 - INFO - __main__ - Step 39535: {'lr': 0.0004247615813400364, 'samples': 7590720, 'steps': 39534, 'loss/train': 1.3855849504470825} 11/07/2021 02:47:16 - INFO - __main__ - Step 39536: {'lr': 0.0004247577865731055, 'samples': 7590912, 'steps': 39535, 'loss/train': 1.8534126281738281} 11/07/2021 02:47:16 - INFO - __main__ - Step 39537: {'lr': 0.00042475399172743134, 'samples': 7591104, 'steps': 39536, 'loss/train': 1.2669944763183594} 11/07/2021 02:47:16 - INFO - __main__ - Step 39538: {'lr': 0.0004247501968030157, 'samples': 7591296, 'steps': 39537, 'loss/train': 1.1878610849380493} 11/07/2021 02:47:17 - INFO - __main__ - Step 39539: {'lr': 0.00042474640179986035, 'samples': 7591488, 'steps': 39538, 'loss/train': 1.5407204627990723} 11/07/2021 02:47:17 - INFO - __main__ - Step 39540: {'lr': 0.00042474260671796697, 'samples': 7591680, 'steps': 39539, 'loss/train': 1.8082916736602783} 11/07/2021 02:47:18 - INFO - __main__ - Step 39541: {'lr': 0.0004247388115573373, 'samples': 7591872, 'steps': 39540, 'loss/train': 1.6505813598632812} 11/07/2021 02:47:18 - INFO - __main__ - Step 39542: {'lr': 0.00042473501631797294, 'samples': 7592064, 'steps': 39541, 'loss/train': 1.686454176902771} 11/07/2021 02:47:19 - INFO - __main__ - Step 39543: {'lr': 0.0004247312209998758, 'samples': 7592256, 'steps': 39542, 'loss/train': 1.6812511682510376} 11/07/2021 02:47:19 - INFO - __main__ - Step 39544: {'lr': 0.00042472742560304734, 'samples': 7592448, 'steps': 39543, 'loss/train': 1.4964607954025269} 11/07/2021 02:47:19 - INFO - __main__ - Step 39545: {'lr': 0.00042472363012748947, 'samples': 7592640, 'steps': 39544, 'loss/train': 1.4741764068603516} 11/07/2021 02:47:21 - INFO - __main__ - Step 39546: {'lr': 0.00042471983457320384, 'samples': 7592832, 'steps': 39545, 'loss/train': 0.9633825421333313} 11/07/2021 02:47:21 - INFO - __main__ - Step 39547: {'lr': 0.00042471603894019206, 'samples': 7593024, 'steps': 39546, 'loss/train': 1.115648865699768} 11/07/2021 02:47:21 - INFO - __main__ - Step 39548: {'lr': 0.00042471224322845603, 'samples': 7593216, 'steps': 39547, 'loss/train': 1.6646112203598022} 11/07/2021 02:47:22 - INFO - __main__ - Step 39549: {'lr': 0.00042470844743799734, 'samples': 7593408, 'steps': 39548, 'loss/train': 1.172352910041809} 11/07/2021 02:47:22 - INFO - __main__ - Step 39550: {'lr': 0.00042470465156881765, 'samples': 7593600, 'steps': 39549, 'loss/train': 1.7505109310150146} 11/07/2021 02:47:23 - INFO - __main__ - Step 39551: {'lr': 0.00042470085562091887, 'samples': 7593792, 'steps': 39550, 'loss/train': 0.6108179092407227} 11/07/2021 02:47:23 - INFO - __main__ - Step 39552: {'lr': 0.0004246970595943025, 'samples': 7593984, 'steps': 39551, 'loss/train': 1.7491813898086548} 11/07/2021 02:47:24 - INFO - __main__ - Step 39553: {'lr': 0.0004246932634889703, 'samples': 7594176, 'steps': 39552, 'loss/train': 1.6450272798538208} 11/07/2021 02:47:24 - INFO - __main__ - Step 39554: {'lr': 0.00042468946730492404, 'samples': 7594368, 'steps': 39553, 'loss/train': 1.6657792329788208} 11/07/2021 02:47:24 - INFO - __main__ - Step 39555: {'lr': 0.00042468567104216536, 'samples': 7594560, 'steps': 39554, 'loss/train': 1.3390445709228516} 11/07/2021 02:47:25 - INFO - __main__ - Step 39556: {'lr': 0.0004246818747006961, 'samples': 7594752, 'steps': 39555, 'loss/train': 0.4458675682544708} 11/07/2021 02:47:26 - INFO - __main__ - Step 39557: {'lr': 0.00042467807828051787, 'samples': 7594944, 'steps': 39556, 'loss/train': 0.5895477533340454} 11/07/2021 02:47:26 - INFO - __main__ - Step 39558: {'lr': 0.0004246742817816323, 'samples': 7595136, 'steps': 39557, 'loss/train': 1.8787654638290405} 11/07/2021 02:47:27 - INFO - __main__ - Step 39559: {'lr': 0.00042467048520404126, 'samples': 7595328, 'steps': 39558, 'loss/train': 1.1775028705596924} 11/07/2021 02:47:27 - INFO - __main__ - Step 39560: {'lr': 0.00042466668854774636, 'samples': 7595520, 'steps': 39559, 'loss/train': 1.6932798624038696} 11/07/2021 02:47:27 - INFO - __main__ - Step 39561: {'lr': 0.00042466289181274943, 'samples': 7595712, 'steps': 39560, 'loss/train': 2.1152291297912598} 11/07/2021 02:47:28 - INFO - __main__ - Step 39562: {'lr': 0.00042465909499905206, 'samples': 7595904, 'steps': 39561, 'loss/train': 1.601305603981018} 11/07/2021 02:47:29 - INFO - __main__ - Step 39563: {'lr': 0.0004246552981066559, 'samples': 7596096, 'steps': 39562, 'loss/train': 1.5690134763717651} 11/07/2021 02:47:29 - INFO - __main__ - Step 39564: {'lr': 0.0004246515011355629, 'samples': 7596288, 'steps': 39563, 'loss/train': 1.7379307746887207} 11/07/2021 02:47:29 - INFO - __main__ - Step 39565: {'lr': 0.0004246477040857746, 'samples': 7596480, 'steps': 39564, 'loss/train': 1.4712262153625488} 11/07/2021 02:47:30 - INFO - __main__ - Step 39566: {'lr': 0.0004246439069572926, 'samples': 7596672, 'steps': 39565, 'loss/train': 0.4988980293273926} 11/07/2021 02:47:31 - INFO - __main__ - Step 39567: {'lr': 0.00042464010975011893, 'samples': 7596864, 'steps': 39566, 'loss/train': 2.626159429550171} 11/07/2021 02:47:31 - INFO - __main__ - Step 39568: {'lr': 0.00042463631246425504, 'samples': 7597056, 'steps': 39567, 'loss/train': 1.4757068157196045} 11/07/2021 02:47:32 - INFO - __main__ - Step 39569: {'lr': 0.0004246325150997027, 'samples': 7597248, 'steps': 39568, 'loss/train': 1.5218952894210815} 11/07/2021 02:47:32 - INFO - __main__ - Step 39570: {'lr': 0.0004246287176564637, 'samples': 7597440, 'steps': 39569, 'loss/train': 1.8463135957717896} 11/07/2021 02:47:32 - INFO - __main__ - Step 39571: {'lr': 0.0004246249201345397, 'samples': 7597632, 'steps': 39570, 'loss/train': 1.5711888074874878} 11/07/2021 02:47:33 - INFO - __main__ - Step 39572: {'lr': 0.0004246211225339323, 'samples': 7597824, 'steps': 39571, 'loss/train': 1.0583667755126953} 11/07/2021 02:47:34 - INFO - __main__ - Step 39573: {'lr': 0.0004246173248546434, 'samples': 7598016, 'steps': 39572, 'loss/train': 0.8221028447151184} 11/07/2021 02:47:34 - INFO - __main__ - Step 39574: {'lr': 0.0004246135270966747, 'samples': 7598208, 'steps': 39573, 'loss/train': 1.1225409507751465} 11/07/2021 02:47:34 - INFO - __main__ - Step 39575: {'lr': 0.00042460972926002774, 'samples': 7598400, 'steps': 39574, 'loss/train': 1.4795197248458862} 11/07/2021 02:47:35 - INFO - __main__ - Step 39576: {'lr': 0.00042460593134470426, 'samples': 7598592, 'steps': 39575, 'loss/train': 1.1359695196151733} 11/07/2021 02:47:35 - INFO - __main__ - Step 39577: {'lr': 0.0004246021333507062, 'samples': 7598784, 'steps': 39576, 'loss/train': 1.8443375825881958} 11/07/2021 02:47:36 - INFO - __main__ - Step 39578: {'lr': 0.00042459833527803503, 'samples': 7598976, 'steps': 39577, 'loss/train': 1.4954917430877686} 11/07/2021 02:47:36 - INFO - __main__ - Step 39579: {'lr': 0.00042459453712669255, 'samples': 7599168, 'steps': 39578, 'loss/train': 1.6938172578811646} 11/07/2021 02:47:37 - INFO - __main__ - Step 39580: {'lr': 0.0004245907388966804, 'samples': 7599360, 'steps': 39579, 'loss/train': 1.2286467552185059} 11/07/2021 02:47:37 - INFO - __main__ - Step 39581: {'lr': 0.0004245869405880005, 'samples': 7599552, 'steps': 39580, 'loss/train': 1.6558587551116943} 11/07/2021 02:47:37 - INFO - __main__ - Step 39582: {'lr': 0.0004245831422006543, 'samples': 7599744, 'steps': 39581, 'loss/train': 1.961647868156433} 11/07/2021 02:47:39 - INFO - __main__ - Step 39583: {'lr': 0.0004245793437346437, 'samples': 7599936, 'steps': 39582, 'loss/train': 1.524979829788208} 11/07/2021 02:47:39 - INFO - __main__ - Step 39584: {'lr': 0.0004245755451899703, 'samples': 7600128, 'steps': 39583, 'loss/train': 2.1289525032043457} 11/07/2021 02:47:39 - INFO - __main__ - Step 39585: {'lr': 0.0004245717465666359, 'samples': 7600320, 'steps': 39584, 'loss/train': 1.25424063205719} 11/07/2021 02:47:40 - INFO - __main__ - Step 39586: {'lr': 0.0004245679478646421, 'samples': 7600512, 'steps': 39585, 'loss/train': 1.688192367553711} 11/07/2021 02:47:40 - INFO - __main__ - Step 39587: {'lr': 0.00042456414908399075, 'samples': 7600704, 'steps': 39586, 'loss/train': 1.6947143077850342} 11/07/2021 02:47:41 - INFO - __main__ - Step 39588: {'lr': 0.00042456035022468344, 'samples': 7600896, 'steps': 39587, 'loss/train': 1.5494352579116821} 11/07/2021 02:47:41 - INFO - __main__ - Step 39589: {'lr': 0.0004245565512867219, 'samples': 7601088, 'steps': 39588, 'loss/train': 1.519472360610962} 11/07/2021 02:47:42 - INFO - __main__ - Step 39590: {'lr': 0.000424552752270108, 'samples': 7601280, 'steps': 39589, 'loss/train': 1.1904324293136597} 11/07/2021 02:47:42 - INFO - __main__ - Step 39591: {'lr': 0.0004245489531748432, 'samples': 7601472, 'steps': 39590, 'loss/train': 1.3604559898376465} 11/07/2021 02:47:42 - INFO - __main__ - Step 39592: {'lr': 0.00042454515400092944, 'samples': 7601664, 'steps': 39591, 'loss/train': 1.5542364120483398} 11/07/2021 02:47:43 - INFO - __main__ - Step 39593: {'lr': 0.00042454135474836817, 'samples': 7601856, 'steps': 39592, 'loss/train': 1.7233147621154785} 11/07/2021 02:47:44 - INFO - __main__ - Step 39594: {'lr': 0.0004245375554171613, 'samples': 7602048, 'steps': 39593, 'loss/train': 1.5351544618606567} 11/07/2021 02:47:44 - INFO - __main__ - Step 39595: {'lr': 0.00042453375600731057, 'samples': 7602240, 'steps': 39594, 'loss/train': 1.5188515186309814} 11/07/2021 02:47:45 - INFO - __main__ - Step 39596: {'lr': 0.00042452995651881764, 'samples': 7602432, 'steps': 39595, 'loss/train': 1.5693578720092773} 11/07/2021 02:47:45 - INFO - __main__ - Step 39597: {'lr': 0.0004245261569516842, 'samples': 7602624, 'steps': 39596, 'loss/train': 1.073722004890442} 11/07/2021 02:47:46 - INFO - __main__ - Step 39598: {'lr': 0.00042452235730591195, 'samples': 7602816, 'steps': 39597, 'loss/train': 1.8061981201171875} 11/07/2021 02:47:46 - INFO - __main__ - Step 39599: {'lr': 0.00042451855758150254, 'samples': 7603008, 'steps': 39598, 'loss/train': 1.7523164749145508} 11/07/2021 02:47:47 - INFO - __main__ - Step 39600: {'lr': 0.00042451475777845784, 'samples': 7603200, 'steps': 39599, 'loss/train': 1.640657901763916} 11/07/2021 02:47:47 - INFO - __main__ - Step 39601: {'lr': 0.00042451095789677943, 'samples': 7603392, 'steps': 39600, 'loss/train': 0.8249521255493164} 11/07/2021 02:47:47 - INFO - __main__ - Step 39602: {'lr': 0.0004245071579364691, 'samples': 7603584, 'steps': 39601, 'loss/train': 1.4024829864501953} 11/07/2021 02:47:48 - INFO - __main__ - Step 39603: {'lr': 0.0004245033578975286, 'samples': 7603776, 'steps': 39602, 'loss/train': 1.5897529125213623} 11/07/2021 02:47:49 - INFO - __main__ - Step 39604: {'lr': 0.00042449955777995954, 'samples': 7603968, 'steps': 39603, 'loss/train': 1.592136025428772} 11/07/2021 02:47:49 - INFO - __main__ - Step 39605: {'lr': 0.0004244957575837636, 'samples': 7604160, 'steps': 39604, 'loss/train': 1.688389539718628} 11/07/2021 02:47:49 - INFO - __main__ - Step 39606: {'lr': 0.00042449195730894266, 'samples': 7604352, 'steps': 39605, 'loss/train': 1.7507917881011963} 11/07/2021 02:47:50 - INFO - __main__ - Step 39607: {'lr': 0.00042448815695549823, 'samples': 7604544, 'steps': 39606, 'loss/train': 1.3852903842926025} 11/07/2021 02:47:50 - INFO - __main__ - Step 39608: {'lr': 0.00042448435652343223, 'samples': 7604736, 'steps': 39607, 'loss/train': 1.5574475526809692} 11/07/2021 02:47:51 - INFO - __main__ - Step 39609: {'lr': 0.0004244805560127463, 'samples': 7604928, 'steps': 39608, 'loss/train': 1.4441529512405396} 11/07/2021 02:47:51 - INFO - __main__ - Step 39610: {'lr': 0.00042447675542344203, 'samples': 7605120, 'steps': 39609, 'loss/train': 1.3182731866836548} 11/07/2021 02:47:52 - INFO - __main__ - Step 39611: {'lr': 0.0004244729547555213, 'samples': 7605312, 'steps': 39610, 'loss/train': 1.9568254947662354} 11/07/2021 02:47:52 - INFO - __main__ - Step 39612: {'lr': 0.00042446915400898565, 'samples': 7605504, 'steps': 39611, 'loss/train': 1.1934031248092651} 11/07/2021 02:47:52 - INFO - __main__ - Step 39613: {'lr': 0.00042446535318383695, 'samples': 7605696, 'steps': 39612, 'loss/train': 1.4212074279785156} 11/07/2021 02:47:54 - INFO - __main__ - Step 39614: {'lr': 0.00042446155228007687, 'samples': 7605888, 'steps': 39613, 'loss/train': 1.625552773475647} 11/07/2021 02:47:54 - INFO - __main__ - Step 39615: {'lr': 0.0004244577512977071, 'samples': 7606080, 'steps': 39614, 'loss/train': 1.0876522064208984} 11/07/2021 02:47:54 - INFO - __main__ - Step 39616: {'lr': 0.00042445395023672935, 'samples': 7606272, 'steps': 39615, 'loss/train': 1.587868332862854} 11/07/2021 02:47:55 - INFO - __main__ - Step 39617: {'lr': 0.0004244501490971454, 'samples': 7606464, 'steps': 39616, 'loss/train': 0.39124223589897156} 11/07/2021 02:47:55 - INFO - __main__ - Step 39618: {'lr': 0.0004244463478789568, 'samples': 7606656, 'steps': 39617, 'loss/train': 0.7393484711647034} 11/07/2021 02:47:56 - INFO - __main__ - Step 39619: {'lr': 0.0004244425465821654, 'samples': 7606848, 'steps': 39618, 'loss/train': 1.4141795635223389} 11/07/2021 02:47:56 - INFO - __main__ - Step 39620: {'lr': 0.0004244387452067729, 'samples': 7607040, 'steps': 39619, 'loss/train': 1.529541254043579} 11/07/2021 02:47:57 - INFO - __main__ - Step 39621: {'lr': 0.000424434943752781, 'samples': 7607232, 'steps': 39620, 'loss/train': 0.9253093004226685} 11/07/2021 02:47:57 - INFO - __main__ - Step 39622: {'lr': 0.0004244311422201914, 'samples': 7607424, 'steps': 39621, 'loss/train': 0.8912478089332581} 11/07/2021 02:47:57 - INFO - __main__ - Step 39623: {'lr': 0.0004244273406090058, 'samples': 7607616, 'steps': 39622, 'loss/train': 1.4458396434783936} 11/07/2021 02:47:58 - INFO - __main__ - Step 39624: {'lr': 0.000424423538919226, 'samples': 7607808, 'steps': 39623, 'loss/train': 1.4262381792068481} 11/07/2021 02:47:59 - INFO - __main__ - Step 39625: {'lr': 0.0004244197371508536, 'samples': 7608000, 'steps': 39624, 'loss/train': 1.5457366704940796} 11/07/2021 02:47:59 - INFO - __main__ - Step 39626: {'lr': 0.00042441593530389025, 'samples': 7608192, 'steps': 39625, 'loss/train': 1.671734094619751} 11/07/2021 02:48:00 - INFO - __main__ - Step 39627: {'lr': 0.0004244121333783379, 'samples': 7608384, 'steps': 39626, 'loss/train': 1.4473621845245361} 11/07/2021 02:48:00 - INFO - __main__ - Step 39628: {'lr': 0.0004244083313741981, 'samples': 7608576, 'steps': 39627, 'loss/train': 1.5875861644744873} 11/07/2021 02:48:01 - INFO - __main__ - Step 39629: {'lr': 0.0004244045292914726, 'samples': 7608768, 'steps': 39628, 'loss/train': 1.6657663583755493} 11/07/2021 02:48:01 - INFO - __main__ - Step 39630: {'lr': 0.00042440072713016317, 'samples': 7608960, 'steps': 39629, 'loss/train': 2.132669687271118} 11/07/2021 02:48:02 - INFO - __main__ - Step 39631: {'lr': 0.00042439692489027136, 'samples': 7609152, 'steps': 39630, 'loss/train': 1.6087201833724976} 11/07/2021 02:48:02 - INFO - __main__ - Step 39632: {'lr': 0.000424393122571799, 'samples': 7609344, 'steps': 39631, 'loss/train': 1.6041347980499268} 11/07/2021 02:48:02 - INFO - __main__ - Step 39633: {'lr': 0.00042438932017474783, 'samples': 7609536, 'steps': 39632, 'loss/train': 1.414711594581604} 11/07/2021 02:48:03 - INFO - __main__ - Step 39634: {'lr': 0.0004243855176991195, 'samples': 7609728, 'steps': 39633, 'loss/train': 1.6235281229019165} 11/07/2021 02:48:04 - INFO - __main__ - Step 39635: {'lr': 0.0004243817151449158, 'samples': 7609920, 'steps': 39634, 'loss/train': 1.363830327987671} 11/07/2021 02:48:04 - INFO - __main__ - Step 39636: {'lr': 0.0004243779125121383, 'samples': 7610112, 'steps': 39635, 'loss/train': 1.6997758150100708} 11/07/2021 02:48:04 - INFO - __main__ - Step 39637: {'lr': 0.00042437410980078894, 'samples': 7610304, 'steps': 39636, 'loss/train': 1.1887015104293823} 11/07/2021 02:48:05 - INFO - __main__ - Step 39638: {'lr': 0.0004243703070108692, 'samples': 7610496, 'steps': 39637, 'loss/train': 1.7544221878051758} 11/07/2021 02:48:05 - INFO - __main__ - Step 39639: {'lr': 0.00042436650414238086, 'samples': 7610688, 'steps': 39638, 'loss/train': 2.068868398666382} 11/07/2021 02:48:06 - INFO - __main__ - Step 39640: {'lr': 0.0004243627011953257, 'samples': 7610880, 'steps': 39639, 'loss/train': 1.4439142942428589} 11/07/2021 02:48:06 - INFO - __main__ - Step 39641: {'lr': 0.0004243588981697054, 'samples': 7611072, 'steps': 39640, 'loss/train': 1.7286045551300049} 11/07/2021 02:48:07 - INFO - __main__ - Step 39642: {'lr': 0.0004243550950655217, 'samples': 7611264, 'steps': 39641, 'loss/train': 0.8495831489562988} 11/07/2021 02:48:07 - INFO - __main__ - Step 39643: {'lr': 0.00042435129188277625, 'samples': 7611456, 'steps': 39642, 'loss/train': 1.5878719091415405} 11/07/2021 02:48:07 - INFO - __main__ - Step 39644: {'lr': 0.0004243474886214708, 'samples': 7611648, 'steps': 39643, 'loss/train': 1.6149377822875977} 11/07/2021 02:48:09 - INFO - __main__ - Step 39645: {'lr': 0.0004243436852816071, 'samples': 7611840, 'steps': 39644, 'loss/train': 1.3852001428604126} 11/07/2021 02:48:09 - INFO - __main__ - Step 39646: {'lr': 0.0004243398818631868, 'samples': 7612032, 'steps': 39645, 'loss/train': 1.6688364744186401} 11/07/2021 02:48:09 - INFO - __main__ - Step 39647: {'lr': 0.0004243360783662116, 'samples': 7612224, 'steps': 39646, 'loss/train': 2.061065673828125} 11/07/2021 02:48:10 - INFO - __main__ - Step 39648: {'lr': 0.0004243322747906833, 'samples': 7612416, 'steps': 39647, 'loss/train': 0.6897776126861572} 11/07/2021 02:48:10 - INFO - __main__ - Step 39649: {'lr': 0.00042432847113660355, 'samples': 7612608, 'steps': 39648, 'loss/train': 1.325101375579834} 11/07/2021 02:48:11 - INFO - __main__ - Step 39650: {'lr': 0.0004243246674039741, 'samples': 7612800, 'steps': 39649, 'loss/train': 1.4035671949386597} 11/07/2021 02:48:12 - INFO - __main__ - Step 39651: {'lr': 0.00042432086359279667, 'samples': 7612992, 'steps': 39650, 'loss/train': 1.6981890201568604} 11/07/2021 02:48:12 - INFO - __main__ - Step 39652: {'lr': 0.0004243170597030729, 'samples': 7613184, 'steps': 39651, 'loss/train': 1.5480200052261353} 11/07/2021 02:48:12 - INFO - __main__ - Step 39653: {'lr': 0.0004243132557348045, 'samples': 7613376, 'steps': 39652, 'loss/train': 0.45958590507507324} 11/07/2021 02:48:13 - INFO - __main__ - Step 39654: {'lr': 0.00042430945168799326, 'samples': 7613568, 'steps': 39653, 'loss/train': 1.4481576681137085} 11/07/2021 02:48:14 - INFO - __main__ - Step 39655: {'lr': 0.000424305647562641, 'samples': 7613760, 'steps': 39654, 'loss/train': 1.632983684539795} 11/07/2021 02:48:14 - INFO - __main__ - Step 39656: {'lr': 0.00042430184335874924, 'samples': 7613952, 'steps': 39655, 'loss/train': 1.5062471628189087} 11/07/2021 02:48:15 - INFO - __main__ - Step 39657: {'lr': 0.0004242980390763197, 'samples': 7614144, 'steps': 39656, 'loss/train': 1.5032585859298706} 11/07/2021 02:48:15 - INFO - __main__ - Step 39658: {'lr': 0.0004242942347153542, 'samples': 7614336, 'steps': 39657, 'loss/train': 1.1589655876159668} 11/07/2021 02:48:15 - INFO - __main__ - Step 39659: {'lr': 0.00042429043027585435, 'samples': 7614528, 'steps': 39658, 'loss/train': 0.9075229167938232} 11/07/2021 02:48:16 - INFO - __main__ - Step 39660: {'lr': 0.000424286625757822, 'samples': 7614720, 'steps': 39659, 'loss/train': 1.7922099828720093} 11/07/2021 02:48:17 - INFO - __main__ - Step 39661: {'lr': 0.00042428282116125873, 'samples': 7614912, 'steps': 39660, 'loss/train': 1.5930705070495605} 11/07/2021 02:48:17 - INFO - __main__ - Step 39662: {'lr': 0.0004242790164861663, 'samples': 7615104, 'steps': 39661, 'loss/train': 1.6237225532531738} 11/07/2021 02:48:18 - INFO - __main__ - Step 39663: {'lr': 0.0004242752117325465, 'samples': 7615296, 'steps': 39662, 'loss/train': 1.5321625471115112} 11/07/2021 02:48:18 - INFO - __main__ - Step 39664: {'lr': 0.000424271406900401, 'samples': 7615488, 'steps': 39663, 'loss/train': 1.7925151586532593} 11/07/2021 02:48:18 - INFO - __main__ - Step 39665: {'lr': 0.0004242676019897314, 'samples': 7615680, 'steps': 39664, 'loss/train': 1.6339011192321777} 11/07/2021 02:48:19 - INFO - __main__ - Step 39666: {'lr': 0.00042426379700053954, 'samples': 7615872, 'steps': 39665, 'loss/train': 1.840277910232544} 11/07/2021 02:48:20 - INFO - __main__ - Step 39667: {'lr': 0.00042425999193282713, 'samples': 7616064, 'steps': 39666, 'loss/train': 1.262918472290039} 11/07/2021 02:48:20 - INFO - __main__ - Step 39668: {'lr': 0.0004242561867865958, 'samples': 7616256, 'steps': 39667, 'loss/train': 2.535017967224121} 11/07/2021 02:48:20 - INFO - __main__ - Step 39669: {'lr': 0.0004242523815618473, 'samples': 7616448, 'steps': 39668, 'loss/train': 1.4685289859771729} 11/07/2021 02:48:21 - INFO - __main__ - Step 39670: {'lr': 0.0004242485762585835, 'samples': 7616640, 'steps': 39669, 'loss/train': 1.8454334735870361} 11/07/2021 02:48:22 - INFO - __main__ - Step 39671: {'lr': 0.0004242447708768059, 'samples': 7616832, 'steps': 39670, 'loss/train': 0.6174976825714111} 11/07/2021 02:48:22 - INFO - __main__ - Step 39672: {'lr': 0.0004242409654165163, 'samples': 7617024, 'steps': 39671, 'loss/train': 1.0872056484222412} 11/07/2021 02:48:22 - INFO - __main__ - Step 39673: {'lr': 0.00042423715987771637, 'samples': 7617216, 'steps': 39672, 'loss/train': 1.2014278173446655} 11/07/2021 02:48:23 - INFO - __main__ - Step 39674: {'lr': 0.0004242333542604079, 'samples': 7617408, 'steps': 39673, 'loss/train': 1.5013810396194458} 11/07/2021 02:48:23 - INFO - __main__ - Step 39675: {'lr': 0.0004242295485645926, 'samples': 7617600, 'steps': 39674, 'loss/train': 1.7303569316864014} 11/07/2021 02:48:24 - INFO - __main__ - Step 39676: {'lr': 0.0004242257427902721, 'samples': 7617792, 'steps': 39675, 'loss/train': 2.2309410572052} 11/07/2021 02:48:25 - INFO - __main__ - Step 39677: {'lr': 0.00042422193693744827, 'samples': 7617984, 'steps': 39676, 'loss/train': 1.37846040725708} 11/07/2021 02:48:25 - INFO - __main__ - Step 39678: {'lr': 0.0004242181310061226, 'samples': 7618176, 'steps': 39677, 'loss/train': 0.8689545392990112} 11/07/2021 02:48:25 - INFO - __main__ - Step 39679: {'lr': 0.000424214324996297, 'samples': 7618368, 'steps': 39678, 'loss/train': 1.9211291074752808} 11/07/2021 02:48:26 - INFO - __main__ - Step 39680: {'lr': 0.000424210518907973, 'samples': 7618560, 'steps': 39679, 'loss/train': 1.9578702449798584} 11/07/2021 02:48:27 - INFO - __main__ - Step 39681: {'lr': 0.0004242067127411525, 'samples': 7618752, 'steps': 39680, 'loss/train': 1.311366319656372} 11/07/2021 02:48:27 - INFO - __main__ - Step 39682: {'lr': 0.0004242029064958372, 'samples': 7618944, 'steps': 39681, 'loss/train': 1.0614537000656128} 11/07/2021 02:48:27 - INFO - __main__ - Step 39683: {'lr': 0.0004241991001720287, 'samples': 7619136, 'steps': 39682, 'loss/train': 1.565509557723999} 11/07/2021 02:48:28 - INFO - __main__ - Step 39684: {'lr': 0.00042419529376972885, 'samples': 7619328, 'steps': 39683, 'loss/train': 1.3136285543441772} 11/07/2021 02:48:28 - INFO - __main__ - Step 39685: {'lr': 0.0004241914872889392, 'samples': 7619520, 'steps': 39684, 'loss/train': 1.6138375997543335} 11/07/2021 02:48:29 - INFO - __main__ - Step 39686: {'lr': 0.00042418768072966163, 'samples': 7619712, 'steps': 39685, 'loss/train': 1.621649980545044} 11/07/2021 02:48:29 - INFO - __main__ - Step 39687: {'lr': 0.0004241838740918977, 'samples': 7619904, 'steps': 39686, 'loss/train': 1.4917651414871216} 11/07/2021 02:48:30 - INFO - __main__ - Step 39688: {'lr': 0.00042418006737564924, 'samples': 7620096, 'steps': 39687, 'loss/train': 1.7346736192703247} 11/07/2021 02:48:30 - INFO - __main__ - Step 39689: {'lr': 0.0004241762605809179, 'samples': 7620288, 'steps': 39688, 'loss/train': 1.132972240447998} 11/07/2021 02:48:30 - INFO - __main__ - Step 39690: {'lr': 0.00042417245370770547, 'samples': 7620480, 'steps': 39689, 'loss/train': 1.6081323623657227} 11/07/2021 02:48:31 - INFO - __main__ - Step 39691: {'lr': 0.00042416864675601365, 'samples': 7620672, 'steps': 39690, 'loss/train': 1.4430608749389648} 11/07/2021 02:48:32 - INFO - __main__ - Step 39692: {'lr': 0.0004241648397258441, 'samples': 7620864, 'steps': 39691, 'loss/train': 1.5008991956710815} 11/07/2021 02:48:32 - INFO - __main__ - Step 39693: {'lr': 0.0004241610326171985, 'samples': 7621056, 'steps': 39692, 'loss/train': 1.5701279640197754} 11/07/2021 02:48:32 - INFO - __main__ - Step 39694: {'lr': 0.0004241572254300786, 'samples': 7621248, 'steps': 39693, 'loss/train': 0.6049160957336426} 11/07/2021 02:48:33 - INFO - __main__ - Step 39695: {'lr': 0.00042415341816448625, 'samples': 7621440, 'steps': 39694, 'loss/train': 1.5551038980484009} 11/07/2021 02:48:33 - INFO - __main__ - Step 39696: {'lr': 0.000424149610820423, 'samples': 7621632, 'steps': 39695, 'loss/train': 1.4827028512954712} 11/07/2021 02:48:34 - INFO - __main__ - Step 39697: {'lr': 0.00042414580339789065, 'samples': 7621824, 'steps': 39696, 'loss/train': 1.496288537979126} 11/07/2021 02:48:34 - INFO - __main__ - Step 39698: {'lr': 0.00042414199589689084, 'samples': 7622016, 'steps': 39697, 'loss/train': 1.3225457668304443} 11/07/2021 02:48:35 - INFO - __main__ - Step 39699: {'lr': 0.0004241381883174254, 'samples': 7622208, 'steps': 39698, 'loss/train': 1.345666527748108} 11/07/2021 02:48:35 - INFO - __main__ - Step 39700: {'lr': 0.00042413438065949595, 'samples': 7622400, 'steps': 39699, 'loss/train': 1.0294551849365234} 11/07/2021 02:48:35 - INFO - __main__ - Step 39701: {'lr': 0.0004241305729231042, 'samples': 7622592, 'steps': 39700, 'loss/train': 1.5337921380996704} 11/07/2021 02:48:37 - INFO - __main__ - Step 39702: {'lr': 0.00042412676510825197, 'samples': 7622784, 'steps': 39701, 'loss/train': 1.2330243587493896} 11/07/2021 02:48:37 - INFO - __main__ - Step 39703: {'lr': 0.00042412295721494086, 'samples': 7622976, 'steps': 39702, 'loss/train': 1.093245267868042} 11/07/2021 02:48:37 - INFO - __main__ - Step 39704: {'lr': 0.00042411914924317265, 'samples': 7623168, 'steps': 39703, 'loss/train': 1.3550559282302856} 11/07/2021 02:48:38 - INFO - __main__ - Step 39705: {'lr': 0.00042411534119294903, 'samples': 7623360, 'steps': 39704, 'loss/train': 1.4321473836898804} 11/07/2021 02:48:38 - INFO - __main__ - Step 39706: {'lr': 0.0004241115330642717, 'samples': 7623552, 'steps': 39705, 'loss/train': 1.345481514930725} 11/07/2021 02:48:39 - INFO - __main__ - Step 39707: {'lr': 0.0004241077248571424, 'samples': 7623744, 'steps': 39706, 'loss/train': 1.318536639213562} 11/07/2021 02:48:39 - INFO - __main__ - Step 39708: {'lr': 0.0004241039165715629, 'samples': 7623936, 'steps': 39707, 'loss/train': 1.6019287109375} 11/07/2021 02:48:40 - INFO - __main__ - Step 39709: {'lr': 0.00042410010820753485, 'samples': 7624128, 'steps': 39708, 'loss/train': 1.2462016344070435} 11/07/2021 02:48:40 - INFO - __main__ - Step 39710: {'lr': 0.00042409629976505994, 'samples': 7624320, 'steps': 39709, 'loss/train': 1.191588044166565} 11/07/2021 02:48:41 - INFO - __main__ - Step 39711: {'lr': 0.00042409249124414, 'samples': 7624512, 'steps': 39710, 'loss/train': 1.242682933807373} 11/07/2021 02:48:41 - INFO - __main__ - Step 39712: {'lr': 0.00042408868264477657, 'samples': 7624704, 'steps': 39711, 'loss/train': 1.5885920524597168} 11/07/2021 02:48:42 - INFO - __main__ - Step 39713: {'lr': 0.00042408487396697147, 'samples': 7624896, 'steps': 39712, 'loss/train': 1.158028483390808} 11/07/2021 02:48:42 - INFO - __main__ - Step 39714: {'lr': 0.0004240810652107265, 'samples': 7625088, 'steps': 39713, 'loss/train': 0.7398699522018433} 11/07/2021 02:48:43 - INFO - __main__ - Step 39715: {'lr': 0.0004240772563760432, 'samples': 7625280, 'steps': 39714, 'loss/train': 1.3761696815490723} 11/07/2021 02:48:43 - INFO - __main__ - Step 39716: {'lr': 0.00042407344746292345, 'samples': 7625472, 'steps': 39715, 'loss/train': 1.2336206436157227} 11/07/2021 02:48:44 - INFO - __main__ - Step 39717: {'lr': 0.00042406963847136883, 'samples': 7625664, 'steps': 39716, 'loss/train': 1.5653035640716553} 11/07/2021 02:48:44 - INFO - __main__ - Step 39718: {'lr': 0.0004240658294013812, 'samples': 7625856, 'steps': 39717, 'loss/train': 1.227238416671753} 11/07/2021 02:48:45 - INFO - __main__ - Step 39719: {'lr': 0.00042406202025296213, 'samples': 7626048, 'steps': 39718, 'loss/train': 1.6440476179122925} 11/07/2021 02:48:45 - INFO - __main__ - Step 39720: {'lr': 0.00042405821102611336, 'samples': 7626240, 'steps': 39719, 'loss/train': 1.7578482627868652} 11/07/2021 02:48:45 - INFO - __main__ - Step 39721: {'lr': 0.0004240544017208367, 'samples': 7626432, 'steps': 39720, 'loss/train': 1.5157376527786255} 11/07/2021 02:48:46 - INFO - __main__ - Step 39722: {'lr': 0.0004240505923371338, 'samples': 7626624, 'steps': 39721, 'loss/train': 1.4250826835632324} 11/07/2021 02:48:47 - INFO - __main__ - Step 39723: {'lr': 0.0004240467828750064, 'samples': 7626816, 'steps': 39722, 'loss/train': 1.5825248956680298} 11/07/2021 02:48:47 - INFO - __main__ - Step 39724: {'lr': 0.0004240429733344562, 'samples': 7627008, 'steps': 39723, 'loss/train': 2.1459078788757324} 11/07/2021 02:48:47 - INFO - __main__ - Step 39725: {'lr': 0.0004240391637154849, 'samples': 7627200, 'steps': 39724, 'loss/train': 1.8433314561843872} 11/07/2021 02:48:48 - INFO - __main__ - Step 39726: {'lr': 0.0004240353540180942, 'samples': 7627392, 'steps': 39725, 'loss/train': 2.6465625762939453} 11/07/2021 02:48:48 - INFO - __main__ - Step 39727: {'lr': 0.00042403154424228596, 'samples': 7627584, 'steps': 39726, 'loss/train': 0.6849703192710876} 11/07/2021 02:48:49 - INFO - __main__ - Step 39728: {'lr': 0.00042402773438806175, 'samples': 7627776, 'steps': 39727, 'loss/train': 1.4673601388931274} 11/07/2021 02:48:50 - INFO - __main__ - Step 39729: {'lr': 0.00042402392445542333, 'samples': 7627968, 'steps': 39728, 'loss/train': 1.9610875844955444} 11/07/2021 02:48:50 - INFO - __main__ - Step 39730: {'lr': 0.0004240201144443724, 'samples': 7628160, 'steps': 39729, 'loss/train': 1.4347823858261108} 11/07/2021 02:48:50 - INFO - __main__ - Step 39731: {'lr': 0.00042401630435491073, 'samples': 7628352, 'steps': 39730, 'loss/train': 1.7260664701461792} 11/07/2021 02:48:51 - INFO - __main__ - Step 39732: {'lr': 0.00042401249418703996, 'samples': 7628544, 'steps': 39731, 'loss/train': 1.3634346723556519} 11/07/2021 02:48:52 - INFO - __main__ - Step 39733: {'lr': 0.00042400868394076185, 'samples': 7628736, 'steps': 39732, 'loss/train': 1.4103978872299194} 11/07/2021 02:48:52 - INFO - __main__ - Step 39734: {'lr': 0.0004240048736160781, 'samples': 7628928, 'steps': 39733, 'loss/train': 1.412202000617981} 11/07/2021 02:48:52 - INFO - __main__ - Step 39735: {'lr': 0.0004240010632129905, 'samples': 7629120, 'steps': 39734, 'loss/train': 1.230157494544983} 11/07/2021 02:48:53 - INFO - __main__ - Step 39736: {'lr': 0.00042399725273150056, 'samples': 7629312, 'steps': 39735, 'loss/train': 1.8496798276901245} 11/07/2021 02:48:53 - INFO - __main__ - Step 39737: {'lr': 0.0004239934421716103, 'samples': 7629504, 'steps': 39736, 'loss/train': 1.4702967405319214} 11/07/2021 02:48:54 - INFO - __main__ - Step 39738: {'lr': 0.00042398963153332124, 'samples': 7629696, 'steps': 39737, 'loss/train': 2.123060464859009} 11/07/2021 02:48:55 - INFO - __main__ - Step 39739: {'lr': 0.00042398582081663513, 'samples': 7629888, 'steps': 39738, 'loss/train': 1.6703587770462036} 11/07/2021 02:48:55 - INFO - __main__ - Step 39740: {'lr': 0.0004239820100215537, 'samples': 7630080, 'steps': 39739, 'loss/train': 1.3782191276550293} 11/07/2021 02:48:55 - INFO - __main__ - Step 39741: {'lr': 0.00042397819914807855, 'samples': 7630272, 'steps': 39740, 'loss/train': 1.7067298889160156} 11/07/2021 02:48:56 - INFO - __main__ - Step 39742: {'lr': 0.00042397438819621164, 'samples': 7630464, 'steps': 39741, 'loss/train': 1.5283854007720947} 11/07/2021 02:48:56 - INFO - __main__ - Step 39743: {'lr': 0.0004239705771659545, 'samples': 7630656, 'steps': 39742, 'loss/train': 1.3885010480880737} 11/07/2021 02:48:57 - INFO - __main__ - Step 39744: {'lr': 0.000423966766057309, 'samples': 7630848, 'steps': 39743, 'loss/train': 1.6397227048873901} 11/07/2021 02:48:57 - INFO - __main__ - Step 39745: {'lr': 0.00042396295487027666, 'samples': 7631040, 'steps': 39744, 'loss/train': 2.084167003631592} 11/07/2021 02:48:58 - INFO - __main__ - Step 39746: {'lr': 0.0004239591436048593, 'samples': 7631232, 'steps': 39745, 'loss/train': 1.6590286493301392} 11/07/2021 02:48:58 - INFO - __main__ - Step 39747: {'lr': 0.0004239553322610586, 'samples': 7631424, 'steps': 39746, 'loss/train': 1.4515533447265625} 11/07/2021 02:48:58 - INFO - __main__ - Step 39748: {'lr': 0.0004239515208388764, 'samples': 7631616, 'steps': 39747, 'loss/train': 1.706516981124878} 11/07/2021 02:49:00 - INFO - __main__ - Step 39749: {'lr': 0.00042394770933831425, 'samples': 7631808, 'steps': 39748, 'loss/train': 1.4649534225463867} 11/07/2021 02:49:00 - INFO - __main__ - Step 39750: {'lr': 0.00042394389775937403, 'samples': 7632000, 'steps': 39749, 'loss/train': 1.754453182220459} 11/07/2021 02:49:00 - INFO - __main__ - Step 39751: {'lr': 0.0004239400861020574, 'samples': 7632192, 'steps': 39750, 'loss/train': 1.0220494270324707} 11/07/2021 02:49:01 - INFO - __main__ - Step 39752: {'lr': 0.00042393627436636597, 'samples': 7632384, 'steps': 39751, 'loss/train': 1.8266533613204956} 11/07/2021 02:49:01 - INFO - __main__ - Step 39753: {'lr': 0.0004239324625523015, 'samples': 7632576, 'steps': 39752, 'loss/train': 1.4264540672302246} 11/07/2021 02:49:01 - INFO - __main__ - Step 39754: {'lr': 0.00042392865065986573, 'samples': 7632768, 'steps': 39753, 'loss/train': 1.504227638244629} 11/07/2021 02:49:02 - INFO - __main__ - Step 39755: {'lr': 0.00042392483868906053, 'samples': 7632960, 'steps': 39754, 'loss/train': 1.9210301637649536} 11/07/2021 02:49:03 - INFO - __main__ - Step 39756: {'lr': 0.0004239210266398874, 'samples': 7633152, 'steps': 39755, 'loss/train': 1.643656849861145} 11/07/2021 02:49:03 - INFO - __main__ - Step 39757: {'lr': 0.0004239172145123481, 'samples': 7633344, 'steps': 39756, 'loss/train': 1.6792773008346558} 11/07/2021 02:49:03 - INFO - __main__ - Step 39758: {'lr': 0.0004239134023064445, 'samples': 7633536, 'steps': 39757, 'loss/train': 1.3707860708236694} 11/07/2021 02:49:04 - INFO - __main__ - Step 39759: {'lr': 0.0004239095900221781, 'samples': 7633728, 'steps': 39758, 'loss/train': 1.836814284324646} 11/07/2021 02:49:05 - INFO - __main__ - Step 39760: {'lr': 0.00042390577765955077, 'samples': 7633920, 'steps': 39759, 'loss/train': 1.3591474294662476} 11/07/2021 02:49:05 - INFO - __main__ - Step 39761: {'lr': 0.00042390196521856417, 'samples': 7634112, 'steps': 39760, 'loss/train': 1.8055552244186401} 11/07/2021 02:49:06 - INFO - __main__ - Step 39762: {'lr': 0.00042389815269922005, 'samples': 7634304, 'steps': 39761, 'loss/train': 1.6083821058273315} 11/07/2021 02:49:06 - INFO - __main__ - Step 39763: {'lr': 0.0004238943401015201, 'samples': 7634496, 'steps': 39762, 'loss/train': 1.406672716140747} 11/07/2021 02:49:06 - INFO - __main__ - Step 39764: {'lr': 0.0004238905274254661, 'samples': 7634688, 'steps': 39763, 'loss/train': 1.3256386518478394} 11/07/2021 02:49:07 - INFO - __main__ - Step 39765: {'lr': 0.0004238867146710596, 'samples': 7634880, 'steps': 39764, 'loss/train': 1.0616670846939087} 11/07/2021 02:49:08 - INFO - __main__ - Step 39766: {'lr': 0.0004238829018383025, 'samples': 7635072, 'steps': 39765, 'loss/train': 1.067506194114685} 11/07/2021 02:49:08 - INFO - __main__ - Step 39767: {'lr': 0.0004238790889271964, 'samples': 7635264, 'steps': 39766, 'loss/train': 1.3006850481033325} 11/07/2021 02:49:08 - INFO - __main__ - Step 39768: {'lr': 0.0004238752759377431, 'samples': 7635456, 'steps': 39767, 'loss/train': 1.544019103050232} 11/07/2021 02:49:09 - INFO - __main__ - Step 39769: {'lr': 0.0004238714628699443, 'samples': 7635648, 'steps': 39768, 'loss/train': 1.4632283449172974} 11/07/2021 02:49:10 - INFO - __main__ - Step 39770: {'lr': 0.00042386764972380164, 'samples': 7635840, 'steps': 39769, 'loss/train': 1.483447790145874} 11/07/2021 02:49:10 - INFO - __main__ - Step 39771: {'lr': 0.00042386383649931693, 'samples': 7636032, 'steps': 39770, 'loss/train': 1.311075210571289} 11/07/2021 02:49:10 - INFO - __main__ - Step 39772: {'lr': 0.00042386002319649184, 'samples': 7636224, 'steps': 39771, 'loss/train': 1.4680263996124268} 11/07/2021 02:49:11 - INFO - __main__ - Step 39773: {'lr': 0.0004238562098153281, 'samples': 7636416, 'steps': 39772, 'loss/train': 1.5938646793365479} 11/07/2021 02:49:11 - INFO - __main__ - Step 39774: {'lr': 0.0004238523963558275, 'samples': 7636608, 'steps': 39773, 'loss/train': 1.0459965467453003} 11/07/2021 02:49:12 - INFO - __main__ - Step 39775: {'lr': 0.0004238485828179917, 'samples': 7636800, 'steps': 39774, 'loss/train': 1.9813939332962036} 11/07/2021 02:49:12 - INFO - __main__ - Step 39776: {'lr': 0.00042384476920182234, 'samples': 7636992, 'steps': 39775, 'loss/train': 1.984336018562317} 11/07/2021 02:49:13 - INFO - __main__ - Step 39777: {'lr': 0.0004238409555073212, 'samples': 7637184, 'steps': 39776, 'loss/train': 1.0975687503814697} 11/07/2021 02:49:13 - INFO - __main__ - Step 39778: {'lr': 0.00042383714173449007, 'samples': 7637376, 'steps': 39777, 'loss/train': 1.3706766366958618} 11/07/2021 02:49:13 - INFO - __main__ - Step 39779: {'lr': 0.00042383332788333055, 'samples': 7637568, 'steps': 39778, 'loss/train': 1.4491437673568726} 11/07/2021 02:49:14 - INFO - __main__ - Step 39780: {'lr': 0.0004238295139538445, 'samples': 7637760, 'steps': 39779, 'loss/train': 1.1163655519485474} 11/07/2021 02:49:15 - INFO - __main__ - Step 39781: {'lr': 0.0004238256999460335, 'samples': 7637952, 'steps': 39780, 'loss/train': 1.6790413856506348} 11/07/2021 02:49:15 - INFO - __main__ - Step 39782: {'lr': 0.00042382188585989933, 'samples': 7638144, 'steps': 39781, 'loss/train': 1.1418988704681396} 11/07/2021 02:49:16 - INFO - __main__ - Step 39783: {'lr': 0.0004238180716954436, 'samples': 7638336, 'steps': 39782, 'loss/train': 1.5400171279907227} 11/07/2021 02:49:16 - INFO - __main__ - Step 39784: {'lr': 0.0004238142574526683, 'samples': 7638528, 'steps': 39783, 'loss/train': 1.503924012184143} 11/07/2021 02:49:16 - INFO - __main__ - Step 39785: {'lr': 0.0004238104431315749, 'samples': 7638720, 'steps': 39784, 'loss/train': 1.6826781034469604} 11/07/2021 02:49:18 - INFO - __main__ - Step 39786: {'lr': 0.00042380662873216517, 'samples': 7638912, 'steps': 39785, 'loss/train': 1.4812978506088257} 11/07/2021 02:49:18 - INFO - __main__ - Step 39787: {'lr': 0.00042380281425444087, 'samples': 7639104, 'steps': 39786, 'loss/train': 1.9437432289123535} 11/07/2021 02:49:18 - INFO - __main__ - Step 39788: {'lr': 0.0004237989996984037, 'samples': 7639296, 'steps': 39787, 'loss/train': 1.8986597061157227} 11/07/2021 02:49:19 - INFO - __main__ - Step 39789: {'lr': 0.0004237951850640555, 'samples': 7639488, 'steps': 39788, 'loss/train': 1.75411057472229} 11/07/2021 02:49:19 - INFO - __main__ - Step 39790: {'lr': 0.0004237913703513977, 'samples': 7639680, 'steps': 39789, 'loss/train': 2.432239532470703} 11/07/2021 02:49:20 - INFO - __main__ - Step 39791: {'lr': 0.00042378755556043225, 'samples': 7639872, 'steps': 39790, 'loss/train': 2.40262508392334} 11/07/2021 02:49:20 - INFO - __main__ - Step 39792: {'lr': 0.0004237837406911608, 'samples': 7640064, 'steps': 39791, 'loss/train': 1.3880094289779663} 11/07/2021 02:49:21 - INFO - __main__ - Step 39793: {'lr': 0.00042377992574358514, 'samples': 7640256, 'steps': 39792, 'loss/train': 1.0807527303695679} 11/07/2021 02:49:21 - INFO - __main__ - Step 39794: {'lr': 0.0004237761107177068, 'samples': 7640448, 'steps': 39793, 'loss/train': 1.2133203744888306} 11/07/2021 02:49:21 - INFO - __main__ - Step 39795: {'lr': 0.00042377229561352774, 'samples': 7640640, 'steps': 39794, 'loss/train': 0.7437110543251038} 11/07/2021 02:49:22 - INFO - __main__ - Step 39796: {'lr': 0.00042376848043104953, 'samples': 7640832, 'steps': 39795, 'loss/train': 1.7172738313674927} 11/07/2021 02:49:23 - INFO - __main__ - Step 39797: {'lr': 0.00042376466517027387, 'samples': 7641024, 'steps': 39796, 'loss/train': 1.2570029497146606} 11/07/2021 02:49:23 - INFO - __main__ - Step 39798: {'lr': 0.00042376084983120266, 'samples': 7641216, 'steps': 39797, 'loss/train': 1.6518914699554443} 11/07/2021 02:49:24 - INFO - __main__ - Step 39799: {'lr': 0.0004237570344138374, 'samples': 7641408, 'steps': 39798, 'loss/train': 1.1588325500488281} 11/07/2021 02:49:24 - INFO - __main__ - Step 39800: {'lr': 0.00042375321891818, 'samples': 7641600, 'steps': 39799, 'loss/train': 1.3438451290130615} 11/07/2021 02:49:24 - INFO - __main__ - Step 39801: {'lr': 0.00042374940334423194, 'samples': 7641792, 'steps': 39800, 'loss/train': 1.038809895515442} 11/07/2021 02:49:25 - INFO - __main__ - Step 39802: {'lr': 0.00042374558769199517, 'samples': 7641984, 'steps': 39801, 'loss/train': 1.4980303049087524} 11/07/2021 02:49:26 - INFO - __main__ - Step 39803: {'lr': 0.0004237417719614713, 'samples': 7642176, 'steps': 39802, 'loss/train': 1.2092492580413818} 11/07/2021 02:49:26 - INFO - __main__ - Step 39804: {'lr': 0.000423737956152662, 'samples': 7642368, 'steps': 39803, 'loss/train': 1.3670045137405396} 11/07/2021 02:49:26 - INFO - __main__ - Step 39805: {'lr': 0.0004237341402655692, 'samples': 7642560, 'steps': 39804, 'loss/train': 1.4463545083999634} 11/07/2021 02:49:27 - INFO - __main__ - Step 39806: {'lr': 0.00042373032430019443, 'samples': 7642752, 'steps': 39805, 'loss/train': 1.3591455221176147} 11/07/2021 02:49:28 - INFO - __main__ - Step 39807: {'lr': 0.00042372650825653937, 'samples': 7642944, 'steps': 39806, 'loss/train': 1.7041791677474976} 11/07/2021 02:49:28 - INFO - __main__ - Step 39808: {'lr': 0.0004237226921346059, 'samples': 7643136, 'steps': 39807, 'loss/train': 1.2431955337524414} 11/07/2021 02:49:28 - INFO - __main__ - Step 39809: {'lr': 0.0004237188759343956, 'samples': 7643328, 'steps': 39808, 'loss/train': 0.7306289076805115} 11/07/2021 02:49:29 - INFO - __main__ - Step 39810: {'lr': 0.0004237150596559103, 'samples': 7643520, 'steps': 39809, 'loss/train': 1.6761316061019897} 11/07/2021 02:49:29 - INFO - __main__ - Step 39811: {'lr': 0.00042371124329915167, 'samples': 7643712, 'steps': 39810, 'loss/train': 0.20795224606990814} 11/07/2021 02:49:30 - INFO - __main__ - Step 39812: {'lr': 0.0004237074268641215, 'samples': 7643904, 'steps': 39811, 'loss/train': 1.5500051975250244} 11/07/2021 02:49:31 - INFO - __main__ - Step 39813: {'lr': 0.00042370361035082136, 'samples': 7644096, 'steps': 39812, 'loss/train': 1.7766578197479248} 11/07/2021 02:49:31 - INFO - __main__ - Step 39814: {'lr': 0.000423699793759253, 'samples': 7644288, 'steps': 39813, 'loss/train': 1.3118033409118652} 11/07/2021 02:49:31 - INFO - __main__ - Step 39815: {'lr': 0.0004236959770894183, 'samples': 7644480, 'steps': 39814, 'loss/train': 1.6286885738372803} 11/07/2021 02:49:32 - INFO - __main__ - Step 39816: {'lr': 0.00042369216034131887, 'samples': 7644672, 'steps': 39815, 'loss/train': 1.7368838787078857} 11/07/2021 02:49:33 - INFO - __main__ - Step 39817: {'lr': 0.0004236883435149564, 'samples': 7644864, 'steps': 39816, 'loss/train': 1.5768975019454956} 11/07/2021 02:49:33 - INFO - __main__ - Step 39818: {'lr': 0.0004236845266103327, 'samples': 7645056, 'steps': 39817, 'loss/train': 1.579480528831482} 11/07/2021 02:49:33 - INFO - __main__ - Step 39819: {'lr': 0.00042368070962744937, 'samples': 7645248, 'steps': 39818, 'loss/train': 0.9204245805740356} 11/07/2021 02:49:34 - INFO - __main__ - Step 39820: {'lr': 0.0004236768925663082, 'samples': 7645440, 'steps': 39819, 'loss/train': 1.6000114679336548} 11/07/2021 02:49:34 - INFO - __main__ - Step 39821: {'lr': 0.0004236730754269109, 'samples': 7645632, 'steps': 39820, 'loss/train': 1.3265875577926636} 11/07/2021 02:49:35 - INFO - __main__ - Step 39822: {'lr': 0.00042366925820925915, 'samples': 7645824, 'steps': 39821, 'loss/train': 1.3526073694229126} 11/07/2021 02:49:36 - INFO - __main__ - Step 39823: {'lr': 0.0004236654409133548, 'samples': 7646016, 'steps': 39822, 'loss/train': 1.3544975519180298} 11/07/2021 02:49:36 - INFO - __main__ - Step 39824: {'lr': 0.0004236616235391995, 'samples': 7646208, 'steps': 39823, 'loss/train': 1.2097423076629639} 11/07/2021 02:49:36 - INFO - __main__ - Step 39825: {'lr': 0.0004236578060867949, 'samples': 7646400, 'steps': 39824, 'loss/train': 1.4299346208572388} 11/07/2021 02:49:37 - INFO - __main__ - Step 39826: {'lr': 0.0004236539885561427, 'samples': 7646592, 'steps': 39825, 'loss/train': 1.4713175296783447} 11/07/2021 02:49:38 - INFO - __main__ - Step 39827: {'lr': 0.0004236501709472448, 'samples': 7646784, 'steps': 39826, 'loss/train': 0.9976240396499634} 11/07/2021 02:49:38 - INFO - __main__ - Step 39828: {'lr': 0.00042364635326010277, 'samples': 7646976, 'steps': 39827, 'loss/train': 1.8949692249298096} 11/07/2021 02:49:38 - INFO - __main__ - Step 39829: {'lr': 0.0004236425354947183, 'samples': 7647168, 'steps': 39828, 'loss/train': 1.5358874797821045} 11/07/2021 02:49:39 - INFO - __main__ - Step 39830: {'lr': 0.0004236387176510933, 'samples': 7647360, 'steps': 39829, 'loss/train': 1.5187652111053467} 11/07/2021 02:49:39 - INFO - __main__ - Step 39831: {'lr': 0.00042363489972922937, 'samples': 7647552, 'steps': 39830, 'loss/train': 1.2138946056365967} 11/07/2021 02:49:40 - INFO - __main__ - Step 39832: {'lr': 0.00042363108172912824, 'samples': 7647744, 'steps': 39831, 'loss/train': 1.8676568269729614} 11/07/2021 02:49:40 - INFO - __main__ - Step 39833: {'lr': 0.0004236272636507915, 'samples': 7647936, 'steps': 39832, 'loss/train': 1.081512689590454} 11/07/2021 02:49:41 - INFO - __main__ - Step 39834: {'lr': 0.0004236234454942211, 'samples': 7648128, 'steps': 39833, 'loss/train': 1.7128548622131348} 11/07/2021 02:49:41 - INFO - __main__ - Step 39835: {'lr': 0.0004236196272594186, 'samples': 7648320, 'steps': 39834, 'loss/train': 1.5928826332092285} 11/07/2021 02:49:41 - INFO - __main__ - Step 39836: {'lr': 0.00042361580894638586, 'samples': 7648512, 'steps': 39835, 'loss/train': 1.5246357917785645} 11/07/2021 02:49:42 - INFO - __main__ - Step 39837: {'lr': 0.0004236119905551244, 'samples': 7648704, 'steps': 39836, 'loss/train': 1.721874713897705} 11/07/2021 02:49:43 - INFO - __main__ - Step 39838: {'lr': 0.0004236081720856362, 'samples': 7648896, 'steps': 39837, 'loss/train': 1.6592479944229126} 11/07/2021 02:49:43 - INFO - __main__ - Step 39839: {'lr': 0.0004236043535379227, 'samples': 7649088, 'steps': 39838, 'loss/train': 1.5822508335113525} 11/07/2021 02:49:43 - INFO - __main__ - Step 39840: {'lr': 0.0004236005349119858, 'samples': 7649280, 'steps': 39839, 'loss/train': 1.6861557960510254} 11/07/2021 02:49:44 - INFO - __main__ - Step 39841: {'lr': 0.0004235967162078272, 'samples': 7649472, 'steps': 39840, 'loss/train': 0.9433187246322632} 11/07/2021 02:49:44 - INFO - __main__ - Step 39842: {'lr': 0.0004235928974254486, 'samples': 7649664, 'steps': 39841, 'loss/train': 1.473461627960205} 11/07/2021 02:49:45 - INFO - __main__ - Step 39843: {'lr': 0.00042358907856485166, 'samples': 7649856, 'steps': 39842, 'loss/train': 1.4653620719909668} 11/07/2021 02:49:45 - INFO - __main__ - Step 39844: {'lr': 0.0004235852596260382, 'samples': 7650048, 'steps': 39843, 'loss/train': 1.425424575805664} 11/07/2021 02:49:46 - INFO - __main__ - Step 39845: {'lr': 0.0004235814406090099, 'samples': 7650240, 'steps': 39844, 'loss/train': 1.3061177730560303} 11/07/2021 02:49:46 - INFO - __main__ - Step 39846: {'lr': 0.0004235776215137686, 'samples': 7650432, 'steps': 39845, 'loss/train': 0.9308658242225647} 11/07/2021 02:49:47 - INFO - __main__ - Step 39847: {'lr': 0.0004235738023403157, 'samples': 7650624, 'steps': 39846, 'loss/train': 1.448547124862671} 11/07/2021 02:49:47 - INFO - __main__ - Step 39848: {'lr': 0.00042356998308865323, 'samples': 7650816, 'steps': 39847, 'loss/train': 2.0486652851104736} 11/07/2021 02:49:48 - INFO - __main__ - Step 39849: {'lr': 0.00042356616375878274, 'samples': 7651008, 'steps': 39848, 'loss/train': 1.8110203742980957} 11/07/2021 02:49:48 - INFO - __main__ - Step 39850: {'lr': 0.00042356234435070604, 'samples': 7651200, 'steps': 39849, 'loss/train': 1.2810955047607422} 11/07/2021 02:49:49 - INFO - __main__ - Step 39851: {'lr': 0.0004235585248644249, 'samples': 7651392, 'steps': 39850, 'loss/train': 1.4850274324417114} 11/07/2021 02:49:49 - INFO - __main__ - Step 39852: {'lr': 0.0004235547052999409, 'samples': 7651584, 'steps': 39851, 'loss/train': 1.3674862384796143} 11/07/2021 02:49:50 - INFO - __main__ - Step 39853: {'lr': 0.00042355088565725584, 'samples': 7651776, 'steps': 39852, 'loss/train': 1.4045926332473755} 11/07/2021 02:49:50 - INFO - __main__ - Step 39854: {'lr': 0.0004235470659363714, 'samples': 7651968, 'steps': 39853, 'loss/train': 1.8996695280075073} 11/07/2021 02:49:51 - INFO - __main__ - Step 39855: {'lr': 0.0004235432461372894, 'samples': 7652160, 'steps': 39854, 'loss/train': 1.3678069114685059} 11/07/2021 02:49:51 - INFO - __main__ - Step 39856: {'lr': 0.0004235394262600114, 'samples': 7652352, 'steps': 39855, 'loss/train': 1.801565170288086} 11/07/2021 02:49:51 - INFO - __main__ - Step 39857: {'lr': 0.0004235356063045393, 'samples': 7652544, 'steps': 39856, 'loss/train': 0.8615394234657288} 11/07/2021 02:49:52 - INFO - __main__ - Step 39858: {'lr': 0.0004235317862708747, 'samples': 7652736, 'steps': 39857, 'loss/train': 1.7578961849212646} 11/07/2021 02:49:53 - INFO - __main__ - Step 39859: {'lr': 0.00042352796615901937, 'samples': 7652928, 'steps': 39858, 'loss/train': 1.4292079210281372} 11/07/2021 02:49:53 - INFO - __main__ - Step 39860: {'lr': 0.000423524145968975, 'samples': 7653120, 'steps': 39859, 'loss/train': 1.4792885780334473} 11/07/2021 02:49:53 - INFO - __main__ - Step 39861: {'lr': 0.00042352032570074327, 'samples': 7653312, 'steps': 39860, 'loss/train': 1.4000568389892578} 11/07/2021 02:49:54 - INFO - __main__ - Step 39862: {'lr': 0.00042351650535432607, 'samples': 7653504, 'steps': 39861, 'loss/train': 1.2337876558303833} 11/07/2021 02:49:55 - INFO - __main__ - Step 39863: {'lr': 0.00042351268492972494, 'samples': 7653696, 'steps': 39862, 'loss/train': 1.6213442087173462} 11/07/2021 02:49:55 - INFO - __main__ - Step 39864: {'lr': 0.0004235088644269417, 'samples': 7653888, 'steps': 39863, 'loss/train': 1.136330246925354} 11/07/2021 02:49:56 - INFO - __main__ - Step 39865: {'lr': 0.00042350504384597803, 'samples': 7654080, 'steps': 39864, 'loss/train': 1.6284946203231812} 11/07/2021 02:49:56 - INFO - __main__ - Step 39866: {'lr': 0.0004235012231868357, 'samples': 7654272, 'steps': 39865, 'loss/train': 1.7181075811386108} 11/07/2021 02:49:56 - INFO - __main__ - Step 39867: {'lr': 0.0004234974024495163, 'samples': 7654464, 'steps': 39866, 'loss/train': 0.9383137822151184} 11/07/2021 02:49:57 - INFO - __main__ - Step 39868: {'lr': 0.00042349358163402175, 'samples': 7654656, 'steps': 39867, 'loss/train': 2.038564920425415} 11/07/2021 02:49:58 - INFO - __main__ - Step 39869: {'lr': 0.0004234897607403536, 'samples': 7654848, 'steps': 39868, 'loss/train': 1.292812705039978} 11/07/2021 02:49:58 - INFO - __main__ - Step 39870: {'lr': 0.0004234859397685137, 'samples': 7655040, 'steps': 39869, 'loss/train': 1.2042925357818604} 11/07/2021 02:49:58 - INFO - __main__ - Step 39871: {'lr': 0.0004234821187185036, 'samples': 7655232, 'steps': 39870, 'loss/train': 1.6875336170196533} 11/07/2021 02:49:59 - INFO - __main__ - Step 39872: {'lr': 0.0004234782975903253, 'samples': 7655424, 'steps': 39871, 'loss/train': 1.6400891542434692} 11/07/2021 02:49:59 - INFO - __main__ - Step 39873: {'lr': 0.00042347447638398024, 'samples': 7655616, 'steps': 39872, 'loss/train': 1.468469500541687} 11/07/2021 02:50:00 - INFO - __main__ - Step 39874: {'lr': 0.00042347065509947023, 'samples': 7655808, 'steps': 39873, 'loss/train': 1.1016398668289185} 11/07/2021 02:50:00 - INFO - __main__ - Step 39875: {'lr': 0.0004234668337367971, 'samples': 7656000, 'steps': 39874, 'loss/train': 1.3569303750991821} 11/07/2021 02:50:01 - INFO - __main__ - Step 39876: {'lr': 0.0004234630122959625, 'samples': 7656192, 'steps': 39875, 'loss/train': 1.2579153776168823} 11/07/2021 02:50:01 - INFO - __main__ - Step 39877: {'lr': 0.0004234591907769681, 'samples': 7656384, 'steps': 39876, 'loss/train': 1.3981540203094482} 11/07/2021 02:50:01 - INFO - __main__ - Step 39878: {'lr': 0.0004234553691798156, 'samples': 7656576, 'steps': 39877, 'loss/train': 1.766584873199463} 11/07/2021 02:50:03 - INFO - __main__ - Step 39879: {'lr': 0.000423451547504507, 'samples': 7656768, 'steps': 39878, 'loss/train': 1.8074575662612915} 11/07/2021 02:50:03 - INFO - __main__ - Step 39880: {'lr': 0.0004234477257510436, 'samples': 7656960, 'steps': 39879, 'loss/train': 1.5272505283355713} 11/07/2021 02:50:03 - INFO - __main__ - Step 39881: {'lr': 0.00042344390391942745, 'samples': 7657152, 'steps': 39880, 'loss/train': 1.7444243431091309} 11/07/2021 02:50:04 - INFO - __main__ - Step 39882: {'lr': 0.0004234400820096601, 'samples': 7657344, 'steps': 39881, 'loss/train': 1.6279277801513672} 11/07/2021 02:50:04 - INFO - __main__ - Step 39883: {'lr': 0.0004234362600217433, 'samples': 7657536, 'steps': 39882, 'loss/train': 1.5940405130386353} 11/07/2021 02:50:05 - INFO - __main__ - Step 39884: {'lr': 0.0004234324379556789, 'samples': 7657728, 'steps': 39883, 'loss/train': 1.443777322769165} 11/07/2021 02:50:05 - INFO - __main__ - Step 39885: {'lr': 0.0004234286158114684, 'samples': 7657920, 'steps': 39884, 'loss/train': 1.5703567266464233} 11/07/2021 02:50:06 - INFO - __main__ - Step 39886: {'lr': 0.0004234247935891137, 'samples': 7658112, 'steps': 39885, 'loss/train': 1.7871206998825073} 11/07/2021 02:50:06 - INFO - __main__ - Step 39887: {'lr': 0.00042342097128861647, 'samples': 7658304, 'steps': 39886, 'loss/train': 1.869883418083191} 11/07/2021 02:50:06 - INFO - __main__ - Step 39888: {'lr': 0.0004234171489099784, 'samples': 7658496, 'steps': 39887, 'loss/train': 1.3405208587646484} 11/07/2021 02:50:07 - INFO - __main__ - Step 39889: {'lr': 0.00042341332645320126, 'samples': 7658688, 'steps': 39888, 'loss/train': 1.503919243812561} 11/07/2021 02:50:08 - INFO - __main__ - Step 39890: {'lr': 0.0004234095039182867, 'samples': 7658880, 'steps': 39889, 'loss/train': 0.8980141282081604} 11/07/2021 02:50:08 - INFO - __main__ - Step 39891: {'lr': 0.00042340568130523653, 'samples': 7659072, 'steps': 39890, 'loss/train': 1.4018449783325195} 11/07/2021 02:50:08 - INFO - __main__ - Step 39892: {'lr': 0.0004234018586140525, 'samples': 7659264, 'steps': 39891, 'loss/train': 1.499969482421875} 11/07/2021 02:50:09 - INFO - __main__ - Step 39893: {'lr': 0.00042339803584473626, 'samples': 7659456, 'steps': 39892, 'loss/train': 1.4663480520248413} 11/07/2021 02:50:09 - INFO - __main__ - Step 39894: {'lr': 0.0004233942129972894, 'samples': 7659648, 'steps': 39893, 'loss/train': 1.5299068689346313} 11/07/2021 02:50:10 - INFO - __main__ - Step 39895: {'lr': 0.00042339039007171386, 'samples': 7659840, 'steps': 39894, 'loss/train': 2.0362446308135986} 11/07/2021 02:50:11 - INFO - __main__ - Step 39896: {'lr': 0.00042338656706801135, 'samples': 7660032, 'steps': 39895, 'loss/train': 1.2440659999847412} 11/07/2021 02:50:11 - INFO - __main__ - Step 39897: {'lr': 0.00042338274398618346, 'samples': 7660224, 'steps': 39896, 'loss/train': 1.6488834619522095} 11/07/2021 02:50:11 - INFO - __main__ - Step 39898: {'lr': 0.000423378920826232, 'samples': 7660416, 'steps': 39897, 'loss/train': 3.15498423576355} 11/07/2021 02:50:12 - INFO - __main__ - Step 39899: {'lr': 0.0004233750975881587, 'samples': 7660608, 'steps': 39898, 'loss/train': 1.5884058475494385} 11/07/2021 02:50:13 - INFO - __main__ - Step 39900: {'lr': 0.0004233712742719652, 'samples': 7660800, 'steps': 39899, 'loss/train': 1.2452188730239868} 11/07/2021 02:50:13 - INFO - __main__ - Step 39901: {'lr': 0.0004233674508776533, 'samples': 7660992, 'steps': 39900, 'loss/train': 1.4315701723098755} 11/07/2021 02:50:13 - INFO - __main__ - Step 39902: {'lr': 0.00042336362740522473, 'samples': 7661184, 'steps': 39901, 'loss/train': 1.4178969860076904} 11/07/2021 02:50:14 - INFO - __main__ - Step 39903: {'lr': 0.0004233598038546812, 'samples': 7661376, 'steps': 39902, 'loss/train': 1.5171977281570435} 11/07/2021 02:50:14 - INFO - __main__ - Step 39904: {'lr': 0.0004233559802260244, 'samples': 7661568, 'steps': 39903, 'loss/train': 1.2969151735305786} 11/07/2021 02:50:15 - INFO - __main__ - Step 39905: {'lr': 0.000423352156519256, 'samples': 7661760, 'steps': 39904, 'loss/train': 0.9684048891067505} 11/07/2021 02:50:16 - INFO - __main__ - Step 39906: {'lr': 0.0004233483327343779, 'samples': 7661952, 'steps': 39905, 'loss/train': 1.567678451538086} 11/07/2021 02:50:16 - INFO - __main__ - Step 39907: {'lr': 0.0004233445088713916, 'samples': 7662144, 'steps': 39906, 'loss/train': 0.30363214015960693} 11/07/2021 02:50:16 - INFO - __main__ - Step 39908: {'lr': 0.000423340684930299, 'samples': 7662336, 'steps': 39907, 'loss/train': 1.196516752243042} 11/07/2021 02:50:17 - INFO - __main__ - Step 39909: {'lr': 0.0004233368609111018, 'samples': 7662528, 'steps': 39908, 'loss/train': 1.5008447170257568} 11/07/2021 02:50:18 - INFO - __main__ - Step 39910: {'lr': 0.00042333303681380165, 'samples': 7662720, 'steps': 39909, 'loss/train': 1.294662594795227} 11/07/2021 02:50:18 - INFO - __main__ - Step 39911: {'lr': 0.0004233292126384003, 'samples': 7662912, 'steps': 39910, 'loss/train': 1.9347686767578125} 11/07/2021 02:50:18 - INFO - __main__ - Step 39912: {'lr': 0.00042332538838489955, 'samples': 7663104, 'steps': 39911, 'loss/train': 1.5069063901901245} 11/07/2021 02:50:19 - INFO - __main__ - Step 39913: {'lr': 0.0004233215640533009, 'samples': 7663296, 'steps': 39912, 'loss/train': 1.3123886585235596} 11/07/2021 02:50:19 - INFO - __main__ - Step 39914: {'lr': 0.0004233177396436064, 'samples': 7663488, 'steps': 39913, 'loss/train': 1.651315689086914} 11/07/2021 02:50:19 - INFO - __main__ - Step 39915: {'lr': 0.00042331391515581753, 'samples': 7663680, 'steps': 39914, 'loss/train': 1.380843997001648} 11/07/2021 02:50:20 - INFO - __main__ - Step 39916: {'lr': 0.00042331009058993604, 'samples': 7663872, 'steps': 39915, 'loss/train': 0.7182490825653076} 11/07/2021 02:50:21 - INFO - __main__ - Step 39917: {'lr': 0.00042330626594596374, 'samples': 7664064, 'steps': 39916, 'loss/train': 1.447251319885254} 11/07/2021 02:50:21 - INFO - __main__ - Step 39918: {'lr': 0.00042330244122390227, 'samples': 7664256, 'steps': 39917, 'loss/train': 1.3649463653564453} 11/07/2021 02:50:21 - INFO - __main__ - Step 39919: {'lr': 0.00042329861642375347, 'samples': 7664448, 'steps': 39918, 'loss/train': 1.5415047407150269} 11/07/2021 02:50:22 - INFO - __main__ - Step 39920: {'lr': 0.00042329479154551897, 'samples': 7664640, 'steps': 39919, 'loss/train': 1.8297853469848633} 11/07/2021 02:50:23 - INFO - __main__ - Step 39921: {'lr': 0.0004232909665892005, 'samples': 7664832, 'steps': 39920, 'loss/train': 0.19549989700317383} 11/07/2021 02:50:23 - INFO - __main__ - Step 39922: {'lr': 0.00042328714155479973, 'samples': 7665024, 'steps': 39921, 'loss/train': 2.007741689682007} 11/07/2021 02:50:23 - INFO - __main__ - Step 39923: {'lr': 0.0004232833164423185, 'samples': 7665216, 'steps': 39922, 'loss/train': 0.6382672786712646} 11/07/2021 02:50:24 - INFO - __main__ - Step 39924: {'lr': 0.00042327949125175844, 'samples': 7665408, 'steps': 39923, 'loss/train': 1.445731282234192} 11/07/2021 02:50:24 - INFO - __main__ - Step 39925: {'lr': 0.0004232756659831214, 'samples': 7665600, 'steps': 39924, 'loss/train': 2.180142641067505} 11/07/2021 02:50:25 - INFO - __main__ - Step 39926: {'lr': 0.000423271840636409, 'samples': 7665792, 'steps': 39925, 'loss/train': 1.5678489208221436} 11/07/2021 02:50:25 - INFO - __main__ - Step 39927: {'lr': 0.00042326801521162295, 'samples': 7665984, 'steps': 39926, 'loss/train': 1.4961837530136108} 11/07/2021 02:50:26 - INFO - __main__ - Step 39928: {'lr': 0.000423264189708765, 'samples': 7666176, 'steps': 39927, 'loss/train': 1.2989650964736938} 11/07/2021 02:50:26 - INFO - __main__ - Step 39929: {'lr': 0.0004232603641278369, 'samples': 7666368, 'steps': 39928, 'loss/train': 0.9029120802879333} 11/07/2021 02:50:26 - INFO - __main__ - Step 39930: {'lr': 0.00042325653846884037, 'samples': 7666560, 'steps': 39929, 'loss/train': 1.4112350940704346} 11/07/2021 02:50:28 - INFO - __main__ - Step 39931: {'lr': 0.00042325271273177707, 'samples': 7666752, 'steps': 39930, 'loss/train': 1.0451306104660034} 11/07/2021 02:50:28 - INFO - __main__ - Step 39932: {'lr': 0.0004232488869166488, 'samples': 7666944, 'steps': 39931, 'loss/train': 1.7143346071243286} 11/07/2021 02:50:28 - INFO - __main__ - Step 39933: {'lr': 0.0004232450610234573, 'samples': 7667136, 'steps': 39932, 'loss/train': 1.4348359107971191} 11/07/2021 02:50:29 - INFO - __main__ - Step 39934: {'lr': 0.00042324123505220414, 'samples': 7667328, 'steps': 39933, 'loss/train': 1.8266007900238037} 11/07/2021 02:50:29 - INFO - __main__ - Step 39935: {'lr': 0.0004232374090028912, 'samples': 7667520, 'steps': 39934, 'loss/train': 1.6461167335510254} 11/07/2021 02:50:30 - INFO - __main__ - Step 39936: {'lr': 0.00042323358287552017, 'samples': 7667712, 'steps': 39935, 'loss/train': 1.599936842918396} 11/07/2021 02:50:30 - INFO - __main__ - Step 39937: {'lr': 0.0004232297566700928, 'samples': 7667904, 'steps': 39936, 'loss/train': 1.6643856763839722} 11/07/2021 02:50:31 - INFO - __main__ - Step 39938: {'lr': 0.00042322593038661074, 'samples': 7668096, 'steps': 39937, 'loss/train': 1.547898769378662} 11/07/2021 02:50:31 - INFO - __main__ - Step 39939: {'lr': 0.0004232221040250758, 'samples': 7668288, 'steps': 39938, 'loss/train': 1.211232304573059} 11/07/2021 02:50:31 - INFO - __main__ - Step 39940: {'lr': 0.00042321827758548953, 'samples': 7668480, 'steps': 39939, 'loss/train': 1.2328039407730103} 11/07/2021 02:50:32 - INFO - __main__ - Step 39941: {'lr': 0.00042321445106785385, 'samples': 7668672, 'steps': 39940, 'loss/train': 1.5373095273971558} 11/07/2021 02:50:33 - INFO - __main__ - Step 39942: {'lr': 0.0004232106244721704, 'samples': 7668864, 'steps': 39941, 'loss/train': 1.4850554466247559} 11/07/2021 02:50:33 - INFO - __main__ - Step 39943: {'lr': 0.0004232067977984409, 'samples': 7669056, 'steps': 39942, 'loss/train': 1.7178964614868164} 11/07/2021 02:50:33 - INFO - __main__ - Step 39944: {'lr': 0.0004232029710466671, 'samples': 7669248, 'steps': 39943, 'loss/train': 1.740206003189087} 11/07/2021 02:50:34 - INFO - __main__ - Step 39945: {'lr': 0.00042319914421685067, 'samples': 7669440, 'steps': 39944, 'loss/train': 1.894906997680664} 11/07/2021 02:50:35 - INFO - __main__ - Step 39946: {'lr': 0.0004231953173089935, 'samples': 7669632, 'steps': 39945, 'loss/train': 0.6198977828025818} 11/07/2021 02:50:35 - INFO - __main__ - Step 39947: {'lr': 0.00042319149032309713, 'samples': 7669824, 'steps': 39946, 'loss/train': 1.033416986465454} 11/07/2021 02:50:36 - INFO - __main__ - Step 39948: {'lr': 0.00042318766325916336, 'samples': 7670016, 'steps': 39947, 'loss/train': 1.8405121564865112} 11/07/2021 02:50:36 - INFO - __main__ - Step 39949: {'lr': 0.00042318383611719386, 'samples': 7670208, 'steps': 39948, 'loss/train': 0.4601535201072693} 11/07/2021 02:50:36 - INFO - __main__ - Step 39950: {'lr': 0.00042318000889719044, 'samples': 7670400, 'steps': 39949, 'loss/train': 1.8714054822921753} 11/07/2021 02:50:37 - INFO - __main__ - Step 39951: {'lr': 0.0004231761815991547, 'samples': 7670592, 'steps': 39950, 'loss/train': 0.18940521776676178} 11/07/2021 02:50:38 - INFO - __main__ - Step 39952: {'lr': 0.0004231723542230885, 'samples': 7670784, 'steps': 39951, 'loss/train': 1.4010276794433594} 11/07/2021 02:50:38 - INFO - __main__ - Step 39953: {'lr': 0.0004231685267689935, 'samples': 7670976, 'steps': 39952, 'loss/train': 1.1936516761779785} 11/07/2021 02:50:38 - INFO - __main__ - Step 39954: {'lr': 0.0004231646992368715, 'samples': 7671168, 'steps': 39953, 'loss/train': 1.1189135313034058} 11/07/2021 02:50:39 - INFO - __main__ - Step 39955: {'lr': 0.00042316087162672415, 'samples': 7671360, 'steps': 39954, 'loss/train': 1.3866944313049316} 11/07/2021 02:50:40 - INFO - __main__ - Step 39956: {'lr': 0.0004231570439385531, 'samples': 7671552, 'steps': 39955, 'loss/train': 2.2703516483306885} 11/07/2021 02:50:40 - INFO - __main__ - Step 39957: {'lr': 0.0004231532161723602, 'samples': 7671744, 'steps': 39956, 'loss/train': 1.7356328964233398} 11/07/2021 02:50:41 - INFO - __main__ - Step 39958: {'lr': 0.0004231493883281471, 'samples': 7671936, 'steps': 39957, 'loss/train': 1.8447068929672241} 11/07/2021 02:50:41 - INFO - __main__ - Step 39959: {'lr': 0.00042314556040591567, 'samples': 7672128, 'steps': 39958, 'loss/train': 1.6826385259628296} 11/07/2021 02:50:41 - INFO - __main__ - Step 39960: {'lr': 0.0004231417324056674, 'samples': 7672320, 'steps': 39959, 'loss/train': 1.5038126707077026} 11/07/2021 02:50:42 - INFO - __main__ - Step 39961: {'lr': 0.00042313790432740416, 'samples': 7672512, 'steps': 39960, 'loss/train': 1.671659231185913} 11/07/2021 02:50:43 - INFO - __main__ - Step 39962: {'lr': 0.00042313407617112765, 'samples': 7672704, 'steps': 39961, 'loss/train': 1.5262469053268433} 11/07/2021 02:50:43 - INFO - __main__ - Step 39963: {'lr': 0.00042313024793683965, 'samples': 7672896, 'steps': 39962, 'loss/train': 1.3900421857833862} 11/07/2021 02:50:43 - INFO - __main__ - Step 39964: {'lr': 0.0004231264196245418, 'samples': 7673088, 'steps': 39963, 'loss/train': 0.8114240765571594} 11/07/2021 02:50:44 - INFO - __main__ - Step 39965: {'lr': 0.00042312259123423584, 'samples': 7673280, 'steps': 39964, 'loss/train': 1.3081835508346558} 11/07/2021 02:50:44 - INFO - __main__ - Step 39966: {'lr': 0.00042311876276592355, 'samples': 7673472, 'steps': 39965, 'loss/train': 1.5169175863265991} 11/07/2021 02:50:45 - INFO - __main__ - Step 39967: {'lr': 0.00042311493421960656, 'samples': 7673664, 'steps': 39966, 'loss/train': 1.569330096244812} 11/07/2021 02:50:45 - INFO - __main__ - Step 39968: {'lr': 0.0004231111055952867, 'samples': 7673856, 'steps': 39967, 'loss/train': 1.5341840982437134} 11/07/2021 02:50:46 - INFO - __main__ - Step 39969: {'lr': 0.00042310727689296563, 'samples': 7674048, 'steps': 39968, 'loss/train': 0.4142704904079437} 11/07/2021 02:50:46 - INFO - __main__ - Step 39970: {'lr': 0.0004231034481126451, 'samples': 7674240, 'steps': 39969, 'loss/train': 1.8657910823822021} 11/07/2021 02:50:46 - INFO - __main__ - Step 39971: {'lr': 0.0004230996192543268, 'samples': 7674432, 'steps': 39970, 'loss/train': 1.4104971885681152} 11/07/2021 02:50:48 - INFO - __main__ - Step 39972: {'lr': 0.0004230957903180125, 'samples': 7674624, 'steps': 39971, 'loss/train': 0.7122201919555664} 11/07/2021 02:50:48 - INFO - __main__ - Step 39973: {'lr': 0.00042309196130370396, 'samples': 7674816, 'steps': 39972, 'loss/train': 1.4492918252944946} 11/07/2021 02:50:48 - INFO - __main__ - Step 39974: {'lr': 0.00042308813221140275, 'samples': 7675008, 'steps': 39973, 'loss/train': 1.5777790546417236} 11/07/2021 02:50:49 - INFO - __main__ - Step 39975: {'lr': 0.00042308430304111076, 'samples': 7675200, 'steps': 39974, 'loss/train': 1.6017565727233887} 11/07/2021 02:50:49 - INFO - __main__ - Step 39976: {'lr': 0.00042308047379282967, 'samples': 7675392, 'steps': 39975, 'loss/train': 2.0214884281158447} 11/07/2021 02:50:50 - INFO - __main__ - Step 39977: {'lr': 0.00042307664446656116, 'samples': 7675584, 'steps': 39976, 'loss/train': 1.8707165718078613} 11/07/2021 02:50:50 - INFO - __main__ - Step 39978: {'lr': 0.000423072815062307, 'samples': 7675776, 'steps': 39977, 'loss/train': 1.5159759521484375} 11/07/2021 02:50:51 - INFO - __main__ - Step 39979: {'lr': 0.0004230689855800689, 'samples': 7675968, 'steps': 39978, 'loss/train': 1.1753299236297607} 11/07/2021 02:50:51 - INFO - __main__ - Step 39980: {'lr': 0.0004230651560198486, 'samples': 7676160, 'steps': 39979, 'loss/train': 1.4214001893997192} 11/07/2021 02:50:51 - INFO - __main__ - Step 39981: {'lr': 0.0004230613263816478, 'samples': 7676352, 'steps': 39980, 'loss/train': 1.6573978662490845} 11/07/2021 02:50:52 - INFO - __main__ - Step 39982: {'lr': 0.0004230574966654682, 'samples': 7676544, 'steps': 39981, 'loss/train': 1.2650606632232666} 11/07/2021 02:50:53 - INFO - __main__ - Step 39983: {'lr': 0.0004230536668713116, 'samples': 7676736, 'steps': 39982, 'loss/train': 1.456331491470337} 11/07/2021 02:50:53 - INFO - __main__ - Step 39984: {'lr': 0.00042304983699917965, 'samples': 7676928, 'steps': 39983, 'loss/train': 1.0845017433166504} 11/07/2021 02:50:53 - INFO - __main__ - Step 39985: {'lr': 0.00042304600704907416, 'samples': 7677120, 'steps': 39984, 'loss/train': 1.2177273035049438} 11/07/2021 02:50:54 - INFO - __main__ - Step 39986: {'lr': 0.0004230421770209968, 'samples': 7677312, 'steps': 39985, 'loss/train': 1.704351782798767} 11/07/2021 02:50:55 - INFO - __main__ - Step 39987: {'lr': 0.0004230383469149493, 'samples': 7677504, 'steps': 39986, 'loss/train': 1.204689860343933} 11/07/2021 02:50:55 - INFO - __main__ - Step 39988: {'lr': 0.0004230345167309334, 'samples': 7677696, 'steps': 39987, 'loss/train': 1.5267856121063232} 11/07/2021 02:50:55 - INFO - __main__ - Step 39989: {'lr': 0.00042303068646895077, 'samples': 7677888, 'steps': 39988, 'loss/train': 1.614537000656128} 11/07/2021 02:50:56 - INFO - __main__ - Step 39990: {'lr': 0.0004230268561290032, 'samples': 7678080, 'steps': 39989, 'loss/train': 1.267861247062683} 11/07/2021 02:50:56 - INFO - __main__ - Step 39991: {'lr': 0.0004230230257110924, 'samples': 7678272, 'steps': 39990, 'loss/train': 0.7846195101737976} 11/07/2021 02:50:57 - INFO - __main__ - Step 39992: {'lr': 0.00042301919521522014, 'samples': 7678464, 'steps': 39991, 'loss/train': 2.2181107997894287} 11/07/2021 02:50:57 - INFO - __main__ - Step 39993: {'lr': 0.0004230153646413881, 'samples': 7678656, 'steps': 39992, 'loss/train': 1.6313539743423462} 11/07/2021 02:50:58 - INFO - __main__ - Step 39994: {'lr': 0.000423011533989598, 'samples': 7678848, 'steps': 39993, 'loss/train': 1.669006586074829} 11/07/2021 02:50:58 - INFO - __main__ - Step 39995: {'lr': 0.0004230077032598515, 'samples': 7679040, 'steps': 39994, 'loss/train': 2.0103037357330322} 11/07/2021 02:50:59 - INFO - __main__ - Step 39996: {'lr': 0.00042300387245215043, 'samples': 7679232, 'steps': 39995, 'loss/train': 1.4996882677078247} 11/07/2021 02:50:59 - INFO - __main__ - Step 39997: {'lr': 0.00042300004156649654, 'samples': 7679424, 'steps': 39996, 'loss/train': 1.631992220878601} 11/07/2021 02:51:00 - INFO - __main__ - Step 39998: {'lr': 0.0004229962106028914, 'samples': 7679616, 'steps': 39997, 'loss/train': 5.746405601501465} 11/07/2021 02:51:00 - INFO - __main__ - Step 39999: {'lr': 0.0004229923795613369, 'samples': 7679808, 'steps': 39998, 'loss/train': 1.672818899154663} 11/07/2021 02:51:01 - INFO - __main__ - Step 40000: {'lr': 0.00042298854844183476, 'samples': 7680000, 'steps': 39999, 'loss/train': 0.5506977438926697} 11/07/2021 02:51:01 - INFO - __main__ - Step 40001: {'lr': 0.0004229847172443866, 'samples': 7680192, 'steps': 40000, 'loss/train': 1.474384069442749} 11/07/2021 02:51:01 - INFO - __main__ - Step 40002: {'lr': 0.0004229808859689941, 'samples': 7680384, 'steps': 40001, 'loss/train': 0.9800273776054382} 11/07/2021 02:51:02 - INFO - __main__ - Step 40003: {'lr': 0.0004229770546156592, 'samples': 7680576, 'steps': 40002, 'loss/train': 1.1151974201202393} 11/07/2021 02:51:03 - INFO - __main__ - Step 40004: {'lr': 0.00042297322318438345, 'samples': 7680768, 'steps': 40003, 'loss/train': 1.5029839277267456} 11/07/2021 02:51:03 - INFO - __main__ - Step 40005: {'lr': 0.0004229693916751687, 'samples': 7680960, 'steps': 40004, 'loss/train': 1.0324509143829346} 11/07/2021 02:51:04 - INFO - __main__ - Step 40006: {'lr': 0.00042296556008801663, 'samples': 7681152, 'steps': 40005, 'loss/train': 1.5770862102508545} 11/07/2021 02:51:04 - INFO - __main__ - Step 40007: {'lr': 0.0004229617284229289, 'samples': 7681344, 'steps': 40006, 'loss/train': 1.2624397277832031} 11/07/2021 02:51:04 - INFO - __main__ - Step 40008: {'lr': 0.00042295789667990726, 'samples': 7681536, 'steps': 40007, 'loss/train': 1.0099983215332031} 11/07/2021 02:51:05 - INFO - __main__ - Step 40009: {'lr': 0.00042295406485895346, 'samples': 7681728, 'steps': 40008, 'loss/train': 0.8953605890274048} 11/07/2021 02:51:06 - INFO - __main__ - Step 40010: {'lr': 0.0004229502329600692, 'samples': 7681920, 'steps': 40009, 'loss/train': 1.3491777181625366} 11/07/2021 02:51:06 - INFO - __main__ - Step 40011: {'lr': 0.0004229464009832563, 'samples': 7682112, 'steps': 40010, 'loss/train': 1.033262014389038} 11/07/2021 02:51:06 - INFO - __main__ - Step 40012: {'lr': 0.0004229425689285163, 'samples': 7682304, 'steps': 40011, 'loss/train': 1.3582239151000977} 11/07/2021 02:51:07 - INFO - __main__ - Step 40013: {'lr': 0.00042293873679585125, 'samples': 7682496, 'steps': 40012, 'loss/train': 1.4142736196517944} 11/07/2021 02:51:08 - INFO - __main__ - Step 40014: {'lr': 0.00042293490458526257, 'samples': 7682688, 'steps': 40013, 'loss/train': 1.2864668369293213} 11/07/2021 02:51:08 - INFO - __main__ - Step 40015: {'lr': 0.0004229310722967521, 'samples': 7682880, 'steps': 40014, 'loss/train': 1.7777926921844482} 11/07/2021 02:51:09 - INFO - __main__ - Step 40016: {'lr': 0.00042292723993032157, 'samples': 7683072, 'steps': 40015, 'loss/train': 0.7587130069732666} 11/07/2021 02:51:09 - INFO - __main__ - Step 40017: {'lr': 0.0004229234074859726, 'samples': 7683264, 'steps': 40016, 'loss/train': 1.6694526672363281} 11/07/2021 02:51:09 - INFO - __main__ - Step 40018: {'lr': 0.00042291957496370713, 'samples': 7683456, 'steps': 40017, 'loss/train': 1.007763385772705} 11/07/2021 02:51:10 - INFO - __main__ - Step 40019: {'lr': 0.0004229157423635267, 'samples': 7683648, 'steps': 40018, 'loss/train': 1.8507840633392334} 11/07/2021 02:51:11 - INFO - __main__ - Step 40020: {'lr': 0.00042291190968543315, 'samples': 7683840, 'steps': 40019, 'loss/train': 1.412222981452942} 11/07/2021 02:51:11 - INFO - __main__ - Step 40021: {'lr': 0.0004229080769294281, 'samples': 7684032, 'steps': 40020, 'loss/train': 1.9351446628570557} 11/07/2021 02:51:11 - INFO - __main__ - Step 40022: {'lr': 0.00042290424409551343, 'samples': 7684224, 'steps': 40021, 'loss/train': 1.4295390844345093} 11/07/2021 02:51:12 - INFO - __main__ - Step 40023: {'lr': 0.0004229004111836907, 'samples': 7684416, 'steps': 40022, 'loss/train': 1.340185523033142} 11/07/2021 02:51:12 - INFO - __main__ - Step 40024: {'lr': 0.0004228965781939617, 'samples': 7684608, 'steps': 40023, 'loss/train': 1.2513066530227661} 11/07/2021 02:51:13 - INFO - __main__ - Step 40025: {'lr': 0.00042289274512632817, 'samples': 7684800, 'steps': 40024, 'loss/train': 1.5568289756774902} 11/07/2021 02:51:13 - INFO - __main__ - Step 40026: {'lr': 0.00042288891198079194, 'samples': 7684992, 'steps': 40025, 'loss/train': 1.7078965902328491} 11/07/2021 02:51:14 - INFO - __main__ - Step 40027: {'lr': 0.00042288507875735455, 'samples': 7685184, 'steps': 40026, 'loss/train': 1.5051108598709106} 11/07/2021 02:51:14 - INFO - __main__ - Step 40028: {'lr': 0.0004228812454560178, 'samples': 7685376, 'steps': 40027, 'loss/train': 1.5787116289138794} 11/07/2021 02:51:14 - INFO - __main__ - Step 40029: {'lr': 0.0004228774120767835, 'samples': 7685568, 'steps': 40028, 'loss/train': 1.5780994892120361} 11/07/2021 02:51:16 - INFO - __main__ - Step 40030: {'lr': 0.00042287357861965326, 'samples': 7685760, 'steps': 40029, 'loss/train': 1.6224216222763062} 11/07/2021 02:51:17 - INFO - __main__ - Step 40031: {'lr': 0.00042286974508462885, 'samples': 7685952, 'steps': 40030, 'loss/train': 1.374077320098877} 11/07/2021 02:51:17 - INFO - __main__ - Step 40032: {'lr': 0.000422865911471712, 'samples': 7686144, 'steps': 40031, 'loss/train': 1.3748855590820312} 11/07/2021 02:51:17 - INFO - __main__ - Step 40033: {'lr': 0.00042286207778090447, 'samples': 7686336, 'steps': 40032, 'loss/train': 1.6635338068008423} 11/07/2021 02:51:18 - INFO - __main__ - Step 40034: {'lr': 0.00042285824401220787, 'samples': 7686528, 'steps': 40033, 'loss/train': 1.7877252101898193} 11/07/2021 02:51:18 - INFO - __main__ - Step 40035: {'lr': 0.0004228544101656241, 'samples': 7686720, 'steps': 40034, 'loss/train': 2.0744729042053223} 11/07/2021 02:51:18 - INFO - __main__ - Step 40036: {'lr': 0.00042285057624115473, 'samples': 7686912, 'steps': 40035, 'loss/train': 1.5142874717712402} 11/07/2021 02:51:19 - INFO - __main__ - Step 40037: {'lr': 0.0004228467422388016, 'samples': 7687104, 'steps': 40036, 'loss/train': 1.9819977283477783} 11/07/2021 02:51:20 - INFO - __main__ - Step 40038: {'lr': 0.0004228429081585664, 'samples': 7687296, 'steps': 40037, 'loss/train': 1.2760931253433228} 11/07/2021 02:51:20 - INFO - __main__ - Step 40039: {'lr': 0.00042283907400045084, 'samples': 7687488, 'steps': 40038, 'loss/train': 1.119479775428772} 11/07/2021 02:51:20 - INFO - __main__ - Step 40040: {'lr': 0.0004228352397644567, 'samples': 7687680, 'steps': 40039, 'loss/train': 1.8740752935409546} 11/07/2021 02:51:21 - INFO - __main__ - Step 40041: {'lr': 0.0004228314054505856, 'samples': 7687872, 'steps': 40040, 'loss/train': 2.0900299549102783} 11/07/2021 02:51:22 - INFO - __main__ - Step 40042: {'lr': 0.0004228275710588394, 'samples': 7688064, 'steps': 40041, 'loss/train': 1.1090223789215088} 11/07/2021 02:51:22 - INFO - __main__ - Step 40043: {'lr': 0.0004228237365892197, 'samples': 7688256, 'steps': 40042, 'loss/train': 1.7269837856292725} 11/07/2021 02:51:22 - INFO - __main__ - Step 40044: {'lr': 0.00042281990204172837, 'samples': 7688448, 'steps': 40043, 'loss/train': 1.4572423696517944} 11/07/2021 02:51:23 - INFO - __main__ - Step 40045: {'lr': 0.000422816067416367, 'samples': 7688640, 'steps': 40044, 'loss/train': 1.5706082582473755} 11/07/2021 02:51:23 - INFO - __main__ - Step 40046: {'lr': 0.00042281223271313734, 'samples': 7688832, 'steps': 40045, 'loss/train': 1.2158195972442627} 11/07/2021 02:51:24 - INFO - __main__ - Step 40047: {'lr': 0.0004228083979320412, 'samples': 7689024, 'steps': 40046, 'loss/train': 1.4072970151901245} 11/07/2021 02:51:25 - INFO - __main__ - Step 40048: {'lr': 0.00042280456307308034, 'samples': 7689216, 'steps': 40047, 'loss/train': 1.435637354850769} 11/07/2021 02:51:25 - INFO - __main__ - Step 40049: {'lr': 0.0004228007281362563, 'samples': 7689408, 'steps': 40048, 'loss/train': 1.7101787328720093} 11/07/2021 02:51:25 - INFO - __main__ - Step 40050: {'lr': 0.0004227968931215709, 'samples': 7689600, 'steps': 40049, 'loss/train': 1.7233060598373413} 11/07/2021 02:51:26 - INFO - __main__ - Step 40051: {'lr': 0.000422793058029026, 'samples': 7689792, 'steps': 40050, 'loss/train': 1.6261519193649292} 11/07/2021 02:51:26 - INFO - __main__ - Step 40052: {'lr': 0.0004227892228586231, 'samples': 7689984, 'steps': 40051, 'loss/train': 1.8170970678329468} 11/07/2021 02:51:27 - INFO - __main__ - Step 40053: {'lr': 0.0004227853876103641, 'samples': 7690176, 'steps': 40052, 'loss/train': 1.5851227045059204} 11/07/2021 02:51:27 - INFO - __main__ - Step 40054: {'lr': 0.0004227815522842507, 'samples': 7690368, 'steps': 40053, 'loss/train': 1.2808752059936523} 11/07/2021 02:51:28 - INFO - __main__ - Step 40055: {'lr': 0.00042277771688028457, 'samples': 7690560, 'steps': 40054, 'loss/train': 0.8093523979187012} 11/07/2021 02:51:28 - INFO - __main__ - Step 40056: {'lr': 0.0004227738813984675, 'samples': 7690752, 'steps': 40055, 'loss/train': 1.4373022317886353} 11/07/2021 02:51:28 - INFO - __main__ - Step 40057: {'lr': 0.00042277004583880106, 'samples': 7690944, 'steps': 40056, 'loss/train': 1.190751075744629} 11/07/2021 02:51:29 - INFO - __main__ - Step 40058: {'lr': 0.00042276621020128724, 'samples': 7691136, 'steps': 40057, 'loss/train': 1.1999469995498657} 11/07/2021 02:51:30 - INFO - __main__ - Step 40059: {'lr': 0.0004227623744859276, 'samples': 7691328, 'steps': 40058, 'loss/train': 1.6112170219421387} 11/07/2021 02:51:30 - INFO - __main__ - Step 40060: {'lr': 0.0004227585386927239, 'samples': 7691520, 'steps': 40059, 'loss/train': 1.305436372756958} 11/07/2021 02:51:30 - INFO - __main__ - Step 40061: {'lr': 0.0004227547028216778, 'samples': 7691712, 'steps': 40060, 'loss/train': 1.8070495128631592} 11/07/2021 02:51:31 - INFO - __main__ - Step 40062: {'lr': 0.00042275086687279116, 'samples': 7691904, 'steps': 40061, 'loss/train': 1.6936426162719727} 11/07/2021 02:51:32 - INFO - __main__ - Step 40063: {'lr': 0.0004227470308460657, 'samples': 7692096, 'steps': 40062, 'loss/train': 1.5447250604629517} 11/07/2021 02:51:32 - INFO - __main__ - Step 40064: {'lr': 0.000422743194741503, 'samples': 7692288, 'steps': 40063, 'loss/train': 1.3440648317337036} 11/07/2021 02:51:33 - INFO - __main__ - Step 40065: {'lr': 0.00042273935855910487, 'samples': 7692480, 'steps': 40064, 'loss/train': 1.8710362911224365} 11/07/2021 02:51:33 - INFO - __main__ - Step 40066: {'lr': 0.00042273552229887313, 'samples': 7692672, 'steps': 40065, 'loss/train': 1.440974473953247} 11/07/2021 02:51:33 - INFO - __main__ - Step 40067: {'lr': 0.00042273168596080934, 'samples': 7692864, 'steps': 40066, 'loss/train': 1.5752909183502197} 11/07/2021 02:51:34 - INFO - __main__ - Step 40068: {'lr': 0.0004227278495449154, 'samples': 7693056, 'steps': 40067, 'loss/train': 1.2198938131332397} 11/07/2021 02:51:35 - INFO - __main__ - Step 40069: {'lr': 0.0004227240130511929, 'samples': 7693248, 'steps': 40068, 'loss/train': 0.8190735578536987} 11/07/2021 02:51:35 - INFO - __main__ - Step 40070: {'lr': 0.0004227201764796437, 'samples': 7693440, 'steps': 40069, 'loss/train': 1.6285350322723389} 11/07/2021 02:51:35 - INFO - __main__ - Step 40071: {'lr': 0.00042271633983026935, 'samples': 7693632, 'steps': 40070, 'loss/train': 1.406765341758728} 11/07/2021 02:51:36 - INFO - __main__ - Step 40072: {'lr': 0.00042271250310307174, 'samples': 7693824, 'steps': 40071, 'loss/train': 1.2084912061691284} 11/07/2021 02:51:37 - INFO - __main__ - Step 40073: {'lr': 0.0004227086662980525, 'samples': 7694016, 'steps': 40072, 'loss/train': 1.4927854537963867} 11/07/2021 02:51:37 - INFO - __main__ - Step 40074: {'lr': 0.00042270482941521347, 'samples': 7694208, 'steps': 40073, 'loss/train': 1.2841883897781372} 11/07/2021 02:51:37 - INFO - __main__ - Step 40075: {'lr': 0.0004227009924545563, 'samples': 7694400, 'steps': 40074, 'loss/train': 1.3613510131835938} 11/07/2021 02:51:38 - INFO - __main__ - Step 40076: {'lr': 0.00042269715541608265, 'samples': 7694592, 'steps': 40075, 'loss/train': 0.6684977412223816} 11/07/2021 02:51:38 - INFO - __main__ - Step 40077: {'lr': 0.0004226933182997944, 'samples': 7694784, 'steps': 40076, 'loss/train': 1.1968653202056885} 11/07/2021 02:51:39 - INFO - __main__ - Step 40078: {'lr': 0.00042268948110569317, 'samples': 7694976, 'steps': 40077, 'loss/train': 1.1509777307510376} 11/07/2021 02:51:40 - INFO - __main__ - Step 40079: {'lr': 0.00042268564383378073, 'samples': 7695168, 'steps': 40078, 'loss/train': 1.282145380973816} 11/07/2021 02:51:40 - INFO - __main__ - Step 40080: {'lr': 0.00042268180648405884, 'samples': 7695360, 'steps': 40079, 'loss/train': 1.6849303245544434} 11/07/2021 02:51:40 - INFO - __main__ - Step 40081: {'lr': 0.00042267796905652924, 'samples': 7695552, 'steps': 40080, 'loss/train': 1.3607251644134521} 11/07/2021 02:51:41 - INFO - __main__ - Step 40082: {'lr': 0.0004226741315511935, 'samples': 7695744, 'steps': 40081, 'loss/train': 1.2177730798721313} 11/07/2021 02:51:42 - INFO - __main__ - Step 40083: {'lr': 0.00042267029396805345, 'samples': 7695936, 'steps': 40082, 'loss/train': 1.407575011253357} 11/07/2021 02:51:42 - INFO - __main__ - Step 40084: {'lr': 0.0004226664563071109, 'samples': 7696128, 'steps': 40083, 'loss/train': 0.9506791234016418} 11/07/2021 02:51:42 - INFO - __main__ - Step 40085: {'lr': 0.0004226626185683675, 'samples': 7696320, 'steps': 40084, 'loss/train': 1.4657553434371948} 11/07/2021 02:51:43 - INFO - __main__ - Step 40086: {'lr': 0.00042265878075182497, 'samples': 7696512, 'steps': 40085, 'loss/train': 1.5130643844604492} 11/07/2021 02:51:43 - INFO - __main__ - Step 40087: {'lr': 0.0004226549428574851, 'samples': 7696704, 'steps': 40086, 'loss/train': 1.7923390865325928} 11/07/2021 02:51:44 - INFO - __main__ - Step 40088: {'lr': 0.0004226511048853495, 'samples': 7696896, 'steps': 40087, 'loss/train': 1.7920674085617065} 11/07/2021 02:51:44 - INFO - __main__ - Step 40089: {'lr': 0.00042264726683542, 'samples': 7697088, 'steps': 40088, 'loss/train': 1.546301007270813} 11/07/2021 02:51:45 - INFO - __main__ - Step 40090: {'lr': 0.00042264342870769835, 'samples': 7697280, 'steps': 40089, 'loss/train': 1.045566439628601} 11/07/2021 02:51:45 - INFO - __main__ - Step 40091: {'lr': 0.0004226395905021862, 'samples': 7697472, 'steps': 40090, 'loss/train': 1.8828248977661133} 11/07/2021 02:51:45 - INFO - __main__ - Step 40092: {'lr': 0.0004226357522188853, 'samples': 7697664, 'steps': 40091, 'loss/train': 1.5058112144470215} 11/07/2021 02:51:46 - INFO - __main__ - Step 40093: {'lr': 0.0004226319138577974, 'samples': 7697856, 'steps': 40092, 'loss/train': 1.7309236526489258} 11/07/2021 02:51:47 - INFO - __main__ - Step 40094: {'lr': 0.0004226280754189243, 'samples': 7698048, 'steps': 40093, 'loss/train': 1.5191304683685303} 11/07/2021 02:51:47 - INFO - __main__ - Step 40095: {'lr': 0.0004226242369022676, 'samples': 7698240, 'steps': 40094, 'loss/train': 1.419395089149475} 11/07/2021 02:51:48 - INFO - __main__ - Step 40096: {'lr': 0.00042262039830782906, 'samples': 7698432, 'steps': 40095, 'loss/train': 0.8884421586990356} 11/07/2021 02:51:48 - INFO - __main__ - Step 40097: {'lr': 0.00042261655963561043, 'samples': 7698624, 'steps': 40096, 'loss/train': 1.6845612525939941} 11/07/2021 02:51:48 - INFO - __main__ - Step 40098: {'lr': 0.0004226127208856134, 'samples': 7698816, 'steps': 40097, 'loss/train': 1.6713262796401978} 11/07/2021 02:51:49 - INFO - __main__ - Step 40099: {'lr': 0.0004226088820578399, 'samples': 7699008, 'steps': 40098, 'loss/train': 1.7476264238357544} 11/07/2021 02:51:49 - INFO - __main__ - Step 40100: {'lr': 0.00042260504315229136, 'samples': 7699200, 'steps': 40099, 'loss/train': 1.2733943462371826} 11/07/2021 02:51:50 - INFO - __main__ - Step 40101: {'lr': 0.00042260120416896975, 'samples': 7699392, 'steps': 40100, 'loss/train': 1.2876131534576416} 11/07/2021 02:51:50 - INFO - __main__ - Step 40102: {'lr': 0.0004225973651078766, 'samples': 7699584, 'steps': 40101, 'loss/train': 1.56428861618042} 11/07/2021 02:51:51 - INFO - __main__ - Step 40103: {'lr': 0.0004225935259690138, 'samples': 7699776, 'steps': 40102, 'loss/train': 1.4476832151412964} 11/07/2021 02:51:52 - INFO - __main__ - Step 40104: {'lr': 0.00042258968675238295, 'samples': 7699968, 'steps': 40103, 'loss/train': 1.598578929901123} 11/07/2021 02:51:52 - INFO - __main__ - Step 40105: {'lr': 0.00042258584745798595, 'samples': 7700160, 'steps': 40104, 'loss/train': 1.4063727855682373} 11/07/2021 02:51:52 - INFO - __main__ - Step 40106: {'lr': 0.00042258200808582434, 'samples': 7700352, 'steps': 40105, 'loss/train': 1.5410844087600708} 11/07/2021 02:51:53 - INFO - __main__ - Step 40107: {'lr': 0.00042257816863590006, 'samples': 7700544, 'steps': 40106, 'loss/train': 1.3327895402908325} 11/07/2021 02:51:53 - INFO - __main__ - Step 40108: {'lr': 0.0004225743291082146, 'samples': 7700736, 'steps': 40107, 'loss/train': 1.3556232452392578} 11/07/2021 02:51:54 - INFO - __main__ - Step 40109: {'lr': 0.0004225704895027699, 'samples': 7700928, 'steps': 40108, 'loss/train': 1.637282133102417} 11/07/2021 02:51:54 - INFO - __main__ - Step 40110: {'lr': 0.0004225666498195675, 'samples': 7701120, 'steps': 40109, 'loss/train': 1.1232974529266357} 11/07/2021 02:51:55 - INFO - __main__ - Step 40111: {'lr': 0.0004225628100586093, 'samples': 7701312, 'steps': 40110, 'loss/train': 1.66632080078125} 11/07/2021 02:51:55 - INFO - __main__ - Step 40112: {'lr': 0.00042255897021989695, 'samples': 7701504, 'steps': 40111, 'loss/train': 1.9927699565887451} 11/07/2021 02:51:55 - INFO - __main__ - Step 40113: {'lr': 0.0004225551303034322, 'samples': 7701696, 'steps': 40112, 'loss/train': 1.6477985382080078} 11/07/2021 02:51:56 - INFO - __main__ - Step 40114: {'lr': 0.00042255129030921673, 'samples': 7701888, 'steps': 40113, 'loss/train': 1.6133326292037964} 11/07/2021 02:51:57 - INFO - __main__ - Step 40115: {'lr': 0.0004225474502372524, 'samples': 7702080, 'steps': 40114, 'loss/train': 1.1240192651748657} 11/07/2021 02:51:57 - INFO - __main__ - Step 40116: {'lr': 0.00042254361008754076, 'samples': 7702272, 'steps': 40115, 'loss/train': 0.5389991998672485} 11/07/2021 02:51:57 - INFO - __main__ - Step 40117: {'lr': 0.0004225397698600837, 'samples': 7702464, 'steps': 40116, 'loss/train': 1.5630342960357666} 11/07/2021 02:51:58 - INFO - __main__ - Step 40118: {'lr': 0.0004225359295548828, 'samples': 7702656, 'steps': 40117, 'loss/train': 2.005375623703003} 11/07/2021 02:51:58 - INFO - __main__ - Step 40119: {'lr': 0.0004225320891719399, 'samples': 7702848, 'steps': 40118, 'loss/train': 1.270519733428955} 11/07/2021 02:51:59 - INFO - __main__ - Step 40120: {'lr': 0.0004225282487112567, 'samples': 7703040, 'steps': 40119, 'loss/train': 1.010699987411499} 11/07/2021 02:52:00 - INFO - __main__ - Step 40121: {'lr': 0.000422524408172835, 'samples': 7703232, 'steps': 40120, 'loss/train': 1.0963070392608643} 11/07/2021 02:52:00 - INFO - __main__ - Step 40122: {'lr': 0.0004225205675566765, 'samples': 7703424, 'steps': 40121, 'loss/train': 1.7472093105316162} 11/07/2021 02:52:00 - INFO - __main__ - Step 40123: {'lr': 0.00042251672686278275, 'samples': 7703616, 'steps': 40122, 'loss/train': 1.510049819946289} 11/07/2021 02:52:01 - INFO - __main__ - Step 40124: {'lr': 0.0004225128860911557, 'samples': 7703808, 'steps': 40123, 'loss/train': 1.5101922750473022} 11/07/2021 02:52:02 - INFO - __main__ - Step 40125: {'lr': 0.00042250904524179697, 'samples': 7704000, 'steps': 40124, 'loss/train': 1.3982422351837158} 11/07/2021 02:52:02 - INFO - __main__ - Step 40126: {'lr': 0.00042250520431470827, 'samples': 7704192, 'steps': 40125, 'loss/train': 0.7441375851631165} 11/07/2021 02:52:02 - INFO - __main__ - Step 40127: {'lr': 0.00042250136330989154, 'samples': 7704384, 'steps': 40126, 'loss/train': 1.6617443561553955} 11/07/2021 02:52:03 - INFO - __main__ - Step 40128: {'lr': 0.00042249752222734826, 'samples': 7704576, 'steps': 40127, 'loss/train': 2.0717756748199463} 11/07/2021 02:52:03 - INFO - __main__ - Step 40129: {'lr': 0.0004224936810670803, 'samples': 7704768, 'steps': 40128, 'loss/train': 1.5525445938110352} 11/07/2021 02:52:04 - INFO - __main__ - Step 40130: {'lr': 0.0004224898398290893, 'samples': 7704960, 'steps': 40129, 'loss/train': 1.2647972106933594} 11/07/2021 02:52:05 - INFO - __main__ - Step 40131: {'lr': 0.0004224859985133771, 'samples': 7705152, 'steps': 40130, 'loss/train': 1.3948974609375} 11/07/2021 02:52:05 - INFO - __main__ - Step 40132: {'lr': 0.0004224821571199453, 'samples': 7705344, 'steps': 40131, 'loss/train': 1.5660887956619263} 11/07/2021 02:52:05 - INFO - __main__ - Step 40133: {'lr': 0.0004224783156487958, 'samples': 7705536, 'steps': 40132, 'loss/train': 1.4071072340011597} 11/07/2021 02:52:06 - INFO - __main__ - Step 40134: {'lr': 0.0004224744740999302, 'samples': 7705728, 'steps': 40133, 'loss/train': 1.0452232360839844} 11/07/2021 02:52:07 - INFO - __main__ - Step 40135: {'lr': 0.0004224706324733502, 'samples': 7705920, 'steps': 40134, 'loss/train': 1.9273301362991333} 11/07/2021 02:52:07 - INFO - __main__ - Step 40136: {'lr': 0.00042246679076905763, 'samples': 7706112, 'steps': 40135, 'loss/train': 1.8715548515319824} 11/07/2021 02:52:07 - INFO - __main__ - Step 40137: {'lr': 0.00042246294898705416, 'samples': 7706304, 'steps': 40136, 'loss/train': 1.7802073955535889} 11/07/2021 02:52:08 - INFO - __main__ - Step 40138: {'lr': 0.0004224591071273416, 'samples': 7706496, 'steps': 40137, 'loss/train': 1.529614806175232} 11/07/2021 02:52:08 - INFO - __main__ - Step 40139: {'lr': 0.00042245526518992164, 'samples': 7706688, 'steps': 40138, 'loss/train': 1.316225528717041} 11/07/2021 02:52:09 - INFO - __main__ - Step 40140: {'lr': 0.0004224514231747959, 'samples': 7706880, 'steps': 40139, 'loss/train': 1.9425132274627686} 11/07/2021 02:52:09 - INFO - __main__ - Step 40141: {'lr': 0.00042244758108196635, 'samples': 7707072, 'steps': 40140, 'loss/train': 1.6781806945800781} 11/07/2021 02:52:10 - INFO - __main__ - Step 40142: {'lr': 0.00042244373891143453, 'samples': 7707264, 'steps': 40141, 'loss/train': 3.466099739074707} 11/07/2021 02:52:10 - INFO - __main__ - Step 40143: {'lr': 0.00042243989666320217, 'samples': 7707456, 'steps': 40142, 'loss/train': 1.2785730361938477} 11/07/2021 02:52:10 - INFO - __main__ - Step 40144: {'lr': 0.00042243605433727106, 'samples': 7707648, 'steps': 40143, 'loss/train': 1.637807011604309} 11/07/2021 02:52:11 - INFO - __main__ - Step 40145: {'lr': 0.0004224322119336429, 'samples': 7707840, 'steps': 40144, 'loss/train': 1.394010305404663} 11/07/2021 02:52:12 - INFO - __main__ - Step 40146: {'lr': 0.0004224283694523195, 'samples': 7708032, 'steps': 40145, 'loss/train': 1.388095498085022} 11/07/2021 02:52:13 - INFO - __main__ - Step 40147: {'lr': 0.0004224245268933025, 'samples': 7708224, 'steps': 40146, 'loss/train': 1.3383760452270508} 11/07/2021 02:52:13 - INFO - __main__ - Step 40148: {'lr': 0.0004224206842565937, 'samples': 7708416, 'steps': 40147, 'loss/train': 1.5303736925125122} 11/07/2021 02:52:13 - INFO - __main__ - Step 40149: {'lr': 0.0004224168415421948, 'samples': 7708608, 'steps': 40148, 'loss/train': 1.2203428745269775} 11/07/2021 02:52:14 - INFO - __main__ - Step 40150: {'lr': 0.0004224129987501075, 'samples': 7708800, 'steps': 40149, 'loss/train': 1.2419692277908325} 11/07/2021 02:52:15 - INFO - __main__ - Step 40151: {'lr': 0.0004224091558803337, 'samples': 7708992, 'steps': 40150, 'loss/train': 0.2047809660434723} 11/07/2021 02:52:15 - INFO - __main__ - Step 40152: {'lr': 0.0004224053129328748, 'samples': 7709184, 'steps': 40151, 'loss/train': 1.0908914804458618} 11/07/2021 02:52:15 - INFO - __main__ - Step 40153: {'lr': 0.0004224014699077329, 'samples': 7709376, 'steps': 40152, 'loss/train': 2.108187675476074} 11/07/2021 02:52:16 - INFO - __main__ - Step 40154: {'lr': 0.00042239762680490944, 'samples': 7709568, 'steps': 40153, 'loss/train': 1.6479051113128662} 11/07/2021 02:52:16 - INFO - __main__ - Step 40155: {'lr': 0.00042239378362440627, 'samples': 7709760, 'steps': 40154, 'loss/train': 1.5548971891403198} 11/07/2021 02:52:17 - INFO - __main__ - Step 40156: {'lr': 0.0004223899403662251, 'samples': 7709952, 'steps': 40155, 'loss/train': 0.6885210275650024} 11/07/2021 02:52:18 - INFO - __main__ - Step 40157: {'lr': 0.0004223860970303678, 'samples': 7710144, 'steps': 40156, 'loss/train': 1.4032775163650513} 11/07/2021 02:52:18 - INFO - __main__ - Step 40158: {'lr': 0.00042238225361683593, 'samples': 7710336, 'steps': 40157, 'loss/train': 1.1780622005462646} 11/07/2021 02:52:18 - INFO - __main__ - Step 40159: {'lr': 0.00042237841012563126, 'samples': 7710528, 'steps': 40158, 'loss/train': 1.3725476264953613} 11/07/2021 02:52:19 - INFO - __main__ - Step 40160: {'lr': 0.00042237456655675555, 'samples': 7710720, 'steps': 40159, 'loss/train': 1.0629607439041138} 11/07/2021 02:52:20 - INFO - __main__ - Step 40161: {'lr': 0.0004223707229102105, 'samples': 7710912, 'steps': 40160, 'loss/train': 0.9321591258049011} 11/07/2021 02:52:20 - INFO - __main__ - Step 40162: {'lr': 0.0004223668791859979, 'samples': 7711104, 'steps': 40161, 'loss/train': 1.98379385471344} 11/07/2021 02:52:20 - INFO - __main__ - Step 40163: {'lr': 0.00042236303538411934, 'samples': 7711296, 'steps': 40162, 'loss/train': 1.5861923694610596} 11/07/2021 02:52:21 - INFO - __main__ - Step 40164: {'lr': 0.0004223591915045768, 'samples': 7711488, 'steps': 40163, 'loss/train': 1.249192237854004} 11/07/2021 02:52:21 - INFO - __main__ - Step 40165: {'lr': 0.0004223553475473718, 'samples': 7711680, 'steps': 40164, 'loss/train': 1.4634073972702026} 11/07/2021 02:52:21 - INFO - __main__ - Step 40166: {'lr': 0.00042235150351250617, 'samples': 7711872, 'steps': 40165, 'loss/train': 1.8517236709594727} 11/07/2021 02:52:22 - INFO - __main__ - Step 40167: {'lr': 0.00042234765939998156, 'samples': 7712064, 'steps': 40166, 'loss/train': 1.651535153388977} 11/07/2021 02:52:23 - INFO - __main__ - Step 40168: {'lr': 0.00042234381520979983, 'samples': 7712256, 'steps': 40167, 'loss/train': 1.5286800861358643} 11/07/2021 02:52:23 - INFO - __main__ - Step 40169: {'lr': 0.0004223399709419625, 'samples': 7712448, 'steps': 40168, 'loss/train': 2.0702335834503174} 11/07/2021 02:52:23 - INFO - __main__ - Step 40170: {'lr': 0.0004223361265964716, 'samples': 7712640, 'steps': 40169, 'loss/train': 1.5532838106155396} 11/07/2021 02:52:24 - INFO - __main__ - Step 40171: {'lr': 0.0004223322821733286, 'samples': 7712832, 'steps': 40170, 'loss/train': 1.1976006031036377} 11/07/2021 02:52:25 - INFO - __main__ - Step 40172: {'lr': 0.0004223284376725354, 'samples': 7713024, 'steps': 40171, 'loss/train': 1.5690964460372925} 11/07/2021 02:52:25 - INFO - __main__ - Step 40173: {'lr': 0.00042232459309409355, 'samples': 7713216, 'steps': 40172, 'loss/train': 1.2744946479797363} 11/07/2021 02:52:25 - INFO - __main__ - Step 40174: {'lr': 0.00042232074843800494, 'samples': 7713408, 'steps': 40173, 'loss/train': 1.3678518533706665} 11/07/2021 02:52:26 - INFO - __main__ - Step 40175: {'lr': 0.00042231690370427135, 'samples': 7713600, 'steps': 40174, 'loss/train': 1.6584599018096924} 11/07/2021 02:52:26 - INFO - __main__ - Step 40176: {'lr': 0.00042231305889289437, 'samples': 7713792, 'steps': 40175, 'loss/train': 1.1790246963500977} 11/07/2021 02:52:27 - INFO - __main__ - Step 40177: {'lr': 0.00042230921400387576, 'samples': 7713984, 'steps': 40176, 'loss/train': 1.514189600944519} 11/07/2021 02:52:27 - INFO - __main__ - Step 40178: {'lr': 0.0004223053690372173, 'samples': 7714176, 'steps': 40177, 'loss/train': 2.0193939208984375} 11/07/2021 02:52:28 - INFO - __main__ - Step 40179: {'lr': 0.00042230152399292065, 'samples': 7714368, 'steps': 40178, 'loss/train': 1.0819612741470337} 11/07/2021 02:52:28 - INFO - __main__ - Step 40180: {'lr': 0.00042229767887098766, 'samples': 7714560, 'steps': 40179, 'loss/train': 0.5604084730148315} 11/07/2021 02:52:28 - INFO - __main__ - Step 40181: {'lr': 0.00042229383367142, 'samples': 7714752, 'steps': 40180, 'loss/train': 1.6694213151931763} 11/07/2021 02:52:30 - INFO - __main__ - Step 40182: {'lr': 0.0004222899883942194, 'samples': 7714944, 'steps': 40181, 'loss/train': 1.8260314464569092} 11/07/2021 02:52:30 - INFO - __main__ - Step 40183: {'lr': 0.0004222861430393875, 'samples': 7715136, 'steps': 40182, 'loss/train': 1.5716710090637207} 11/07/2021 02:52:30 - INFO - __main__ - Step 40184: {'lr': 0.0004222822976069262, 'samples': 7715328, 'steps': 40183, 'loss/train': 1.618808627128601} 11/07/2021 02:52:31 - INFO - __main__ - Step 40185: {'lr': 0.0004222784520968371, 'samples': 7715520, 'steps': 40184, 'loss/train': 1.6012680530548096} 11/07/2021 02:52:31 - INFO - __main__ - Step 40186: {'lr': 0.0004222746065091221, 'samples': 7715712, 'steps': 40185, 'loss/train': 1.499024510383606} 11/07/2021 02:52:31 - INFO - __main__ - Step 40187: {'lr': 0.0004222707608437827, 'samples': 7715904, 'steps': 40186, 'loss/train': 1.881104826927185} 11/07/2021 02:52:32 - INFO - __main__ - Step 40188: {'lr': 0.00042226691510082083, 'samples': 7716096, 'steps': 40187, 'loss/train': 0.5944708585739136} 11/07/2021 02:52:33 - INFO - __main__ - Step 40189: {'lr': 0.0004222630692802381, 'samples': 7716288, 'steps': 40188, 'loss/train': 1.5961015224456787} 11/07/2021 02:52:33 - INFO - __main__ - Step 40190: {'lr': 0.00042225922338203625, 'samples': 7716480, 'steps': 40189, 'loss/train': 1.6296290159225464} 11/07/2021 02:52:33 - INFO - __main__ - Step 40191: {'lr': 0.00042225537740621713, 'samples': 7716672, 'steps': 40190, 'loss/train': 1.8040450811386108} 11/07/2021 02:52:34 - INFO - __main__ - Step 40192: {'lr': 0.00042225153135278236, 'samples': 7716864, 'steps': 40191, 'loss/train': 0.49315086007118225} 11/07/2021 02:52:35 - INFO - __main__ - Step 40193: {'lr': 0.00042224768522173374, 'samples': 7717056, 'steps': 40192, 'loss/train': 1.218572974205017} 11/07/2021 02:52:35 - INFO - __main__ - Step 40194: {'lr': 0.00042224383901307293, 'samples': 7717248, 'steps': 40193, 'loss/train': 1.7356516122817993} 11/07/2021 02:52:35 - INFO - __main__ - Step 40195: {'lr': 0.0004222399927268018, 'samples': 7717440, 'steps': 40194, 'loss/train': 1.3899880647659302} 11/07/2021 02:52:36 - INFO - __main__ - Step 40196: {'lr': 0.0004222361463629218, 'samples': 7717632, 'steps': 40195, 'loss/train': 1.5748728513717651} 11/07/2021 02:52:36 - INFO - __main__ - Step 40197: {'lr': 0.00042223229992143505, 'samples': 7717824, 'steps': 40196, 'loss/train': 1.9716410636901855} 11/07/2021 02:52:37 - INFO - __main__ - Step 40198: {'lr': 0.00042222845340234293, 'samples': 7718016, 'steps': 40197, 'loss/train': 1.2805557250976562} 11/07/2021 02:52:38 - INFO - __main__ - Step 40199: {'lr': 0.00042222460680564747, 'samples': 7718208, 'steps': 40198, 'loss/train': 1.248995304107666} 11/07/2021 02:52:38 - INFO - __main__ - Step 40200: {'lr': 0.0004222207601313501, 'samples': 7718400, 'steps': 40199, 'loss/train': 1.1626522541046143} 11/07/2021 02:52:39 - INFO - __main__ - Step 40201: {'lr': 0.00042221691337945285, 'samples': 7718592, 'steps': 40200, 'loss/train': 1.5353505611419678} 11/07/2021 02:52:39 - INFO - __main__ - Step 40202: {'lr': 0.0004222130665499573, 'samples': 7718784, 'steps': 40201, 'loss/train': 0.5865015983581543} 11/07/2021 02:52:40 - INFO - __main__ - Step 40203: {'lr': 0.0004222092196428651, 'samples': 7718976, 'steps': 40202, 'loss/train': 1.5902080535888672} 11/07/2021 02:52:40 - INFO - __main__ - Step 40204: {'lr': 0.0004222053726581782, 'samples': 7719168, 'steps': 40203, 'loss/train': 1.7543646097183228} 11/07/2021 02:52:41 - INFO - __main__ - Step 40205: {'lr': 0.0004222015255958981, 'samples': 7719360, 'steps': 40204, 'loss/train': 1.538298487663269} 11/07/2021 02:52:41 - INFO - __main__ - Step 40206: {'lr': 0.0004221976784560267, 'samples': 7719552, 'steps': 40205, 'loss/train': 1.2510653734207153} 11/07/2021 02:52:41 - INFO - __main__ - Step 40207: {'lr': 0.0004221938312385657, 'samples': 7719744, 'steps': 40206, 'loss/train': 1.6789195537567139} 11/07/2021 02:52:42 - INFO - __main__ - Step 40208: {'lr': 0.00042218998394351684, 'samples': 7719936, 'steps': 40207, 'loss/train': 1.4205678701400757} 11/07/2021 02:52:43 - INFO - __main__ - Step 40209: {'lr': 0.0004221861365708818, 'samples': 7720128, 'steps': 40208, 'loss/train': 1.5723936557769775} 11/07/2021 02:52:43 - INFO - __main__ - Step 40210: {'lr': 0.0004221822891206623, 'samples': 7720320, 'steps': 40209, 'loss/train': 1.513080358505249} 11/07/2021 02:52:43 - INFO - __main__ - Step 40211: {'lr': 0.00042217844159286015, 'samples': 7720512, 'steps': 40210, 'loss/train': 1.6374874114990234} 11/07/2021 02:52:44 - INFO - __main__ - Step 40212: {'lr': 0.00042217459398747703, 'samples': 7720704, 'steps': 40211, 'loss/train': 1.9883413314819336} 11/07/2021 02:52:45 - INFO - __main__ - Step 40213: {'lr': 0.0004221707463045148, 'samples': 7720896, 'steps': 40212, 'loss/train': 1.5096373558044434} 11/07/2021 02:52:45 - INFO - __main__ - Step 40214: {'lr': 0.0004221668985439749, 'samples': 7721088, 'steps': 40213, 'loss/train': 0.14097359776496887} 11/07/2021 02:52:46 - INFO - __main__ - Step 40215: {'lr': 0.00042216305070585946, 'samples': 7721280, 'steps': 40214, 'loss/train': 1.5622889995574951} 11/07/2021 02:52:46 - INFO - __main__ - Step 40216: {'lr': 0.00042215920279016993, 'samples': 7721472, 'steps': 40215, 'loss/train': 1.5851691961288452} 11/07/2021 02:52:46 - INFO - __main__ - Step 40217: {'lr': 0.00042215535479690807, 'samples': 7721664, 'steps': 40216, 'loss/train': 1.330405592918396} 11/07/2021 02:52:47 - INFO - __main__ - Step 40218: {'lr': 0.0004221515067260757, 'samples': 7721856, 'steps': 40217, 'loss/train': 1.0161442756652832} 11/07/2021 02:52:48 - INFO - __main__ - Step 40219: {'lr': 0.0004221476585776745, 'samples': 7722048, 'steps': 40218, 'loss/train': 1.7406327724456787} 11/07/2021 02:52:48 - INFO - __main__ - Step 40220: {'lr': 0.00042214381035170624, 'samples': 7722240, 'steps': 40219, 'loss/train': 1.4465067386627197} 11/07/2021 02:52:48 - INFO - __main__ - Step 40221: {'lr': 0.0004221399620481726, 'samples': 7722432, 'steps': 40220, 'loss/train': 0.31174546480178833} 11/07/2021 02:52:49 - INFO - __main__ - Step 40222: {'lr': 0.00042213611366707547, 'samples': 7722624, 'steps': 40221, 'loss/train': 1.4638572931289673} 11/07/2021 02:52:50 - INFO - __main__ - Step 40223: {'lr': 0.0004221322652084163, 'samples': 7722816, 'steps': 40222, 'loss/train': 1.0964359045028687} 11/07/2021 02:52:50 - INFO - __main__ - Step 40224: {'lr': 0.0004221284166721971, 'samples': 7723008, 'steps': 40223, 'loss/train': 1.2773767709732056} 11/07/2021 02:52:51 - INFO - __main__ - Step 40225: {'lr': 0.00042212456805841944, 'samples': 7723200, 'steps': 40224, 'loss/train': 1.7557947635650635} 11/07/2021 02:52:51 - INFO - __main__ - Step 40226: {'lr': 0.00042212071936708506, 'samples': 7723392, 'steps': 40225, 'loss/train': 1.4272174835205078} 11/07/2021 02:52:51 - INFO - __main__ - Step 40227: {'lr': 0.0004221168705981958, 'samples': 7723584, 'steps': 40226, 'loss/train': 1.4838918447494507} 11/07/2021 02:52:52 - INFO - __main__ - Step 40228: {'lr': 0.00042211302175175334, 'samples': 7723776, 'steps': 40227, 'loss/train': 1.2917256355285645} 11/07/2021 02:52:53 - INFO - __main__ - Step 40229: {'lr': 0.0004221091728277595, 'samples': 7723968, 'steps': 40228, 'loss/train': 1.401443600654602} 11/07/2021 02:52:53 - INFO - __main__ - Step 40230: {'lr': 0.0004221053238262158, 'samples': 7724160, 'steps': 40229, 'loss/train': 1.6499395370483398} 11/07/2021 02:52:53 - INFO - __main__ - Step 40231: {'lr': 0.0004221014747471241, 'samples': 7724352, 'steps': 40230, 'loss/train': 1.2536239624023438} 11/07/2021 02:52:54 - INFO - __main__ - Step 40232: {'lr': 0.0004220976255904861, 'samples': 7724544, 'steps': 40231, 'loss/train': 1.606628656387329} 11/07/2021 02:52:55 - INFO - __main__ - Step 40233: {'lr': 0.00042209377635630364, 'samples': 7724736, 'steps': 40232, 'loss/train': 1.4865119457244873} 11/07/2021 02:52:55 - INFO - __main__ - Step 40234: {'lr': 0.00042208992704457837, 'samples': 7724928, 'steps': 40233, 'loss/train': 0.8506203889846802} 11/07/2021 02:52:55 - INFO - __main__ - Step 40235: {'lr': 0.00042208607765531204, 'samples': 7725120, 'steps': 40234, 'loss/train': 1.1438913345336914} 11/07/2021 02:52:56 - INFO - __main__ - Step 40236: {'lr': 0.00042208222818850634, 'samples': 7725312, 'steps': 40235, 'loss/train': 1.2168179750442505} 11/07/2021 02:52:56 - INFO - __main__ - Step 40237: {'lr': 0.0004220783786441631, 'samples': 7725504, 'steps': 40236, 'loss/train': 1.3755687475204468} 11/07/2021 02:52:56 - INFO - __main__ - Step 40238: {'lr': 0.0004220745290222839, 'samples': 7725696, 'steps': 40237, 'loss/train': 1.1750071048736572} 11/07/2021 02:52:57 - INFO - __main__ - Step 40239: {'lr': 0.00042207067932287066, 'samples': 7725888, 'steps': 40238, 'loss/train': 1.6734548807144165} 11/07/2021 02:52:58 - INFO - __main__ - Step 40240: {'lr': 0.00042206682954592503, 'samples': 7726080, 'steps': 40239, 'loss/train': 1.5095839500427246} 11/07/2021 02:52:58 - INFO - __main__ - Step 40241: {'lr': 0.0004220629796914487, 'samples': 7726272, 'steps': 40240, 'loss/train': 1.7802373170852661} 11/07/2021 02:52:58 - INFO - __main__ - Step 40242: {'lr': 0.00042205912975944344, 'samples': 7726464, 'steps': 40241, 'loss/train': 1.1329573392868042} 11/07/2021 02:52:59 - INFO - __main__ - Step 40243: {'lr': 0.00042205527974991096, 'samples': 7726656, 'steps': 40242, 'loss/train': 1.934626817703247} 11/07/2021 02:53:00 - INFO - __main__ - Step 40244: {'lr': 0.00042205142966285315, 'samples': 7726848, 'steps': 40243, 'loss/train': 1.7891731262207031} 11/07/2021 02:53:01 - INFO - __main__ - Step 40245: {'lr': 0.0004220475794982716, 'samples': 7727040, 'steps': 40244, 'loss/train': 1.6130657196044922} 11/07/2021 02:53:01 - INFO - __main__ - Step 40246: {'lr': 0.00042204372925616797, 'samples': 7727232, 'steps': 40245, 'loss/train': 1.7765222787857056} 11/07/2021 02:53:01 - INFO - __main__ - Step 40247: {'lr': 0.0004220398789365441, 'samples': 7727424, 'steps': 40246, 'loss/train': 1.2992706298828125} 11/07/2021 02:53:02 - INFO - __main__ - Step 40248: {'lr': 0.0004220360285394017, 'samples': 7727616, 'steps': 40247, 'loss/train': 1.0947076082229614} 11/07/2021 02:53:02 - INFO - __main__ - Step 40249: {'lr': 0.0004220321780647426, 'samples': 7727808, 'steps': 40248, 'loss/train': 1.3187216520309448} 11/07/2021 02:53:03 - INFO - __main__ - Step 40250: {'lr': 0.00042202832751256846, 'samples': 7728000, 'steps': 40249, 'loss/train': 1.7385437488555908} 11/07/2021 02:53:03 - INFO - __main__ - Step 40251: {'lr': 0.0004220244768828809, 'samples': 7728192, 'steps': 40250, 'loss/train': 1.3357092142105103} 11/07/2021 02:53:04 - INFO - __main__ - Step 40252: {'lr': 0.0004220206261756819, 'samples': 7728384, 'steps': 40251, 'loss/train': 0.9959414005279541} 11/07/2021 02:53:04 - INFO - __main__ - Step 40253: {'lr': 0.00042201677539097294, 'samples': 7728576, 'steps': 40252, 'loss/train': 1.3090541362762451} 11/07/2021 02:53:04 - INFO - __main__ - Step 40254: {'lr': 0.00042201292452875595, 'samples': 7728768, 'steps': 40253, 'loss/train': 1.762420654296875} 11/07/2021 02:53:06 - INFO - __main__ - Step 40255: {'lr': 0.00042200907358903264, 'samples': 7728960, 'steps': 40254, 'loss/train': 1.5239912271499634} 11/07/2021 02:53:06 - INFO - __main__ - Step 40256: {'lr': 0.0004220052225718046, 'samples': 7729152, 'steps': 40255, 'loss/train': 1.4988716840744019} 11/07/2021 02:53:06 - INFO - __main__ - Step 40257: {'lr': 0.0004220013714770737, 'samples': 7729344, 'steps': 40256, 'loss/train': 1.5974842309951782} 11/07/2021 02:53:07 - INFO - __main__ - Step 40258: {'lr': 0.0004219975203048416, 'samples': 7729536, 'steps': 40257, 'loss/train': 1.612775206565857} 11/07/2021 02:53:07 - INFO - __main__ - Step 40259: {'lr': 0.0004219936690551101, 'samples': 7729728, 'steps': 40258, 'loss/train': 1.3531855344772339} 11/07/2021 02:53:07 - INFO - __main__ - Step 40260: {'lr': 0.0004219898177278809, 'samples': 7729920, 'steps': 40259, 'loss/train': 1.6398286819458008} 11/07/2021 02:53:08 - INFO - __main__ - Step 40261: {'lr': 0.00042198596632315576, 'samples': 7730112, 'steps': 40260, 'loss/train': 5.88883113861084} 11/07/2021 02:53:09 - INFO - __main__ - Step 40262: {'lr': 0.0004219821148409364, 'samples': 7730304, 'steps': 40261, 'loss/train': 1.5091698169708252} 11/07/2021 02:53:09 - INFO - __main__ - Step 40263: {'lr': 0.00042197826328122456, 'samples': 7730496, 'steps': 40262, 'loss/train': 1.4398863315582275} 11/07/2021 02:53:09 - INFO - __main__ - Step 40264: {'lr': 0.00042197441164402197, 'samples': 7730688, 'steps': 40263, 'loss/train': 1.578916311264038} 11/07/2021 02:53:10 - INFO - __main__ - Step 40265: {'lr': 0.0004219705599293303, 'samples': 7730880, 'steps': 40264, 'loss/train': 1.0776889324188232} 11/07/2021 02:53:11 - INFO - __main__ - Step 40266: {'lr': 0.00042196670813715137, 'samples': 7731072, 'steps': 40265, 'loss/train': 1.384979248046875} 11/07/2021 02:53:11 - INFO - __main__ - Step 40267: {'lr': 0.0004219628562674869, 'samples': 7731264, 'steps': 40266, 'loss/train': 1.5530970096588135} 11/07/2021 02:53:12 - INFO - __main__ - Step 40268: {'lr': 0.00042195900432033865, 'samples': 7731456, 'steps': 40267, 'loss/train': 2.001399278640747} 11/07/2021 02:53:12 - INFO - __main__ - Step 40269: {'lr': 0.00042195515229570833, 'samples': 7731648, 'steps': 40268, 'loss/train': 1.5870234966278076} 11/07/2021 02:53:12 - INFO - __main__ - Step 40270: {'lr': 0.0004219513001935976, 'samples': 7731840, 'steps': 40269, 'loss/train': 1.714131236076355} 11/07/2021 02:53:13 - INFO - __main__ - Step 40271: {'lr': 0.00042194744801400837, 'samples': 7732032, 'steps': 40270, 'loss/train': 1.4420199394226074} 11/07/2021 02:53:14 - INFO - __main__ - Step 40272: {'lr': 0.0004219435957569422, 'samples': 7732224, 'steps': 40271, 'loss/train': 1.428703784942627} 11/07/2021 02:53:14 - INFO - __main__ - Step 40273: {'lr': 0.0004219397434224009, 'samples': 7732416, 'steps': 40272, 'loss/train': 1.6456485986709595} 11/07/2021 02:53:14 - INFO - __main__ - Step 40274: {'lr': 0.0004219358910103862, 'samples': 7732608, 'steps': 40273, 'loss/train': 1.3778915405273438} 11/07/2021 02:53:15 - INFO - __main__ - Step 40275: {'lr': 0.00042193203852089993, 'samples': 7732800, 'steps': 40274, 'loss/train': 1.2710283994674683} 11/07/2021 02:53:15 - INFO - __main__ - Step 40276: {'lr': 0.00042192818595394367, 'samples': 7732992, 'steps': 40275, 'loss/train': 1.1744085550308228} 11/07/2021 02:53:16 - INFO - __main__ - Step 40277: {'lr': 0.00042192433330951926, 'samples': 7733184, 'steps': 40276, 'loss/train': 1.3941353559494019} 11/07/2021 02:53:16 - INFO - __main__ - Step 40278: {'lr': 0.00042192048058762834, 'samples': 7733376, 'steps': 40277, 'loss/train': 1.1758747100830078} 11/07/2021 02:53:17 - INFO - __main__ - Step 40279: {'lr': 0.00042191662778827275, 'samples': 7733568, 'steps': 40278, 'loss/train': 1.396023154258728} 11/07/2021 02:53:17 - INFO - __main__ - Step 40280: {'lr': 0.0004219127749114541, 'samples': 7733760, 'steps': 40279, 'loss/train': 1.4444804191589355} 11/07/2021 02:53:17 - INFO - __main__ - Step 40281: {'lr': 0.00042190892195717426, 'samples': 7733952, 'steps': 40280, 'loss/train': 1.2106468677520752} 11/07/2021 02:53:19 - INFO - __main__ - Step 40282: {'lr': 0.000421905068925435, 'samples': 7734144, 'steps': 40281, 'loss/train': 1.269433856010437} 11/07/2021 02:53:19 - INFO - __main__ - Step 40283: {'lr': 0.00042190121581623784, 'samples': 7734336, 'steps': 40282, 'loss/train': 1.8498080968856812} 11/07/2021 02:53:19 - INFO - __main__ - Step 40284: {'lr': 0.0004218973626295847, 'samples': 7734528, 'steps': 40283, 'loss/train': 1.6520702838897705} 11/07/2021 02:53:20 - INFO - __main__ - Step 40285: {'lr': 0.0004218935093654772, 'samples': 7734720, 'steps': 40284, 'loss/train': 1.5929603576660156} 11/07/2021 02:53:20 - INFO - __main__ - Step 40286: {'lr': 0.00042188965602391726, 'samples': 7734912, 'steps': 40285, 'loss/train': 1.637909173965454} 11/07/2021 02:53:21 - INFO - __main__ - Step 40287: {'lr': 0.0004218858026049064, 'samples': 7735104, 'steps': 40286, 'loss/train': 1.2915066480636597} 11/07/2021 02:53:21 - INFO - __main__ - Step 40288: {'lr': 0.00042188194910844644, 'samples': 7735296, 'steps': 40287, 'loss/train': 2.1509063243865967} 11/07/2021 02:53:22 - INFO - __main__ - Step 40289: {'lr': 0.0004218780955345392, 'samples': 7735488, 'steps': 40288, 'loss/train': 1.2132989168167114} 11/07/2021 02:53:22 - INFO - __main__ - Step 40290: {'lr': 0.0004218742418831863, 'samples': 7735680, 'steps': 40289, 'loss/train': 1.6381466388702393} 11/07/2021 02:53:22 - INFO - __main__ - Step 40291: {'lr': 0.0004218703881543895, 'samples': 7735872, 'steps': 40290, 'loss/train': 1.260995864868164} 11/07/2021 02:53:23 - INFO - __main__ - Step 40292: {'lr': 0.0004218665343481506, 'samples': 7736064, 'steps': 40291, 'loss/train': 1.5558457374572754} 11/07/2021 02:53:24 - INFO - __main__ - Step 40293: {'lr': 0.00042186268046447124, 'samples': 7736256, 'steps': 40292, 'loss/train': 1.7037135362625122} 11/07/2021 02:53:24 - INFO - __main__ - Step 40294: {'lr': 0.0004218588265033533, 'samples': 7736448, 'steps': 40293, 'loss/train': 0.8515810966491699} 11/07/2021 02:53:25 - INFO - __main__ - Step 40295: {'lr': 0.0004218549724647983, 'samples': 7736640, 'steps': 40294, 'loss/train': 1.9279394149780273} 11/07/2021 02:53:25 - INFO - __main__ - Step 40296: {'lr': 0.0004218511183488082, 'samples': 7736832, 'steps': 40295, 'loss/train': 1.844152569770813} 11/07/2021 02:53:25 - INFO - __main__ - Step 40297: {'lr': 0.00042184726415538457, 'samples': 7737024, 'steps': 40296, 'loss/train': 1.586840271949768} 11/07/2021 02:53:26 - INFO - __main__ - Step 40298: {'lr': 0.00042184340988452924, 'samples': 7737216, 'steps': 40297, 'loss/train': 1.3267289400100708} 11/07/2021 02:53:27 - INFO - __main__ - Step 40299: {'lr': 0.00042183955553624393, 'samples': 7737408, 'steps': 40298, 'loss/train': 1.6323528289794922} 11/07/2021 02:53:27 - INFO - __main__ - Step 40300: {'lr': 0.0004218357011105304, 'samples': 7737600, 'steps': 40299, 'loss/train': 1.4126336574554443} 11/07/2021 02:53:27 - INFO - __main__ - Step 40301: {'lr': 0.00042183184660739027, 'samples': 7737792, 'steps': 40300, 'loss/train': 1.3471697568893433} 11/07/2021 02:53:28 - INFO - __main__ - Step 40302: {'lr': 0.00042182799202682543, 'samples': 7737984, 'steps': 40301, 'loss/train': 1.7396727800369263} 11/07/2021 02:53:29 - INFO - __main__ - Step 40303: {'lr': 0.0004218241373688375, 'samples': 7738176, 'steps': 40302, 'loss/train': 2.6259586811065674} 11/07/2021 02:53:30 - INFO - __main__ - Step 40304: {'lr': 0.0004218202826334283, 'samples': 7738368, 'steps': 40303, 'loss/train': 1.8593873977661133} 11/07/2021 02:53:30 - INFO - __main__ - Step 40305: {'lr': 0.0004218164278205995, 'samples': 7738560, 'steps': 40304, 'loss/train': 1.0830633640289307} 11/07/2021 02:53:30 - INFO - __main__ - Step 40306: {'lr': 0.00042181257293035293, 'samples': 7738752, 'steps': 40305, 'loss/train': 0.8169730305671692} 11/07/2021 02:53:31 - INFO - __main__ - Step 40307: {'lr': 0.00042180871796269025, 'samples': 7738944, 'steps': 40306, 'loss/train': 1.5396815538406372} 11/07/2021 02:53:31 - INFO - __main__ - Step 40308: {'lr': 0.00042180486291761314, 'samples': 7739136, 'steps': 40307, 'loss/train': 1.6105730533599854} 11/07/2021 02:53:31 - INFO - __main__ - Step 40309: {'lr': 0.0004218010077951235, 'samples': 7739328, 'steps': 40308, 'loss/train': 1.5214552879333496} 11/07/2021 02:53:32 - INFO - __main__ - Step 40310: {'lr': 0.00042179715259522293, 'samples': 7739520, 'steps': 40309, 'loss/train': 1.711687445640564} 11/07/2021 02:53:33 - INFO - __main__ - Step 40311: {'lr': 0.00042179329731791324, 'samples': 7739712, 'steps': 40310, 'loss/train': 1.418582558631897} 11/07/2021 02:53:33 - INFO - __main__ - Step 40312: {'lr': 0.0004217894419631961, 'samples': 7739904, 'steps': 40311, 'loss/train': 1.4990530014038086} 11/07/2021 02:53:33 - INFO - __main__ - Step 40313: {'lr': 0.00042178558653107337, 'samples': 7740096, 'steps': 40312, 'loss/train': 1.4654122591018677} 11/07/2021 02:53:34 - INFO - __main__ - Step 40314: {'lr': 0.0004217817310215466, 'samples': 7740288, 'steps': 40313, 'loss/train': 1.5557605028152466} 11/07/2021 02:53:35 - INFO - __main__ - Step 40315: {'lr': 0.00042177787543461767, 'samples': 7740480, 'steps': 40314, 'loss/train': 1.8264100551605225} 11/07/2021 02:53:35 - INFO - __main__ - Step 40316: {'lr': 0.0004217740197702883, 'samples': 7740672, 'steps': 40315, 'loss/train': 1.7862824201583862} 11/07/2021 02:53:35 - INFO - __main__ - Step 40317: {'lr': 0.00042177016402856023, 'samples': 7740864, 'steps': 40316, 'loss/train': 1.6017388105392456} 11/07/2021 02:53:36 - INFO - __main__ - Step 40318: {'lr': 0.00042176630820943515, 'samples': 7741056, 'steps': 40317, 'loss/train': 1.3726780414581299} 11/07/2021 02:53:36 - INFO - __main__ - Step 40319: {'lr': 0.0004217624523129148, 'samples': 7741248, 'steps': 40318, 'loss/train': 1.5588961839675903} 11/07/2021 02:53:37 - INFO - __main__ - Step 40320: {'lr': 0.0004217585963390009, 'samples': 7741440, 'steps': 40319, 'loss/train': 1.0949862003326416} 11/07/2021 02:53:38 - INFO - __main__ - Step 40321: {'lr': 0.00042175474028769534, 'samples': 7741632, 'steps': 40320, 'loss/train': 1.5856151580810547} 11/07/2021 02:53:38 - INFO - __main__ - Step 40322: {'lr': 0.00042175088415899963, 'samples': 7741824, 'steps': 40321, 'loss/train': 1.3406957387924194} 11/07/2021 02:53:38 - INFO - __main__ - Step 40323: {'lr': 0.00042174702795291574, 'samples': 7742016, 'steps': 40322, 'loss/train': 1.3882884979248047} 11/07/2021 02:53:39 - INFO - __main__ - Step 40324: {'lr': 0.0004217431716694452, 'samples': 7742208, 'steps': 40323, 'loss/train': 1.621527910232544} 11/07/2021 02:53:40 - INFO - __main__ - Step 40325: {'lr': 0.00042173931530858986, 'samples': 7742400, 'steps': 40324, 'loss/train': 1.1860873699188232} 11/07/2021 02:53:40 - INFO - __main__ - Step 40326: {'lr': 0.00042173545887035145, 'samples': 7742592, 'steps': 40325, 'loss/train': 1.6575325727462769} 11/07/2021 02:53:40 - INFO - __main__ - Step 40327: {'lr': 0.0004217316023547317, 'samples': 7742784, 'steps': 40326, 'loss/train': 1.705884337425232} 11/07/2021 02:53:41 - INFO - __main__ - Step 40328: {'lr': 0.00042172774576173226, 'samples': 7742976, 'steps': 40327, 'loss/train': 2.0860137939453125} 11/07/2021 02:53:41 - INFO - __main__ - Step 40329: {'lr': 0.00042172388909135505, 'samples': 7743168, 'steps': 40328, 'loss/train': 1.2067232131958008} 11/07/2021 02:53:42 - INFO - __main__ - Step 40330: {'lr': 0.0004217200323436017, 'samples': 7743360, 'steps': 40329, 'loss/train': 1.7795089483261108} 11/07/2021 02:53:42 - INFO - __main__ - Step 40331: {'lr': 0.00042171617551847387, 'samples': 7743552, 'steps': 40330, 'loss/train': 1.6506901979446411} 11/07/2021 02:53:43 - INFO - __main__ - Step 40332: {'lr': 0.0004217123186159735, 'samples': 7743744, 'steps': 40331, 'loss/train': 1.5076559782028198} 11/07/2021 02:53:43 - INFO - __main__ - Step 40333: {'lr': 0.0004217084616361021, 'samples': 7743936, 'steps': 40332, 'loss/train': 1.3635174036026} 11/07/2021 02:53:43 - INFO - __main__ - Step 40334: {'lr': 0.0004217046045788615, 'samples': 7744128, 'steps': 40333, 'loss/train': 1.613263726234436} 11/07/2021 02:53:44 - INFO - __main__ - Step 40335: {'lr': 0.0004217007474442535, 'samples': 7744320, 'steps': 40334, 'loss/train': 1.0538244247436523} 11/07/2021 02:53:45 - INFO - __main__ - Step 40336: {'lr': 0.00042169689023227987, 'samples': 7744512, 'steps': 40335, 'loss/train': 1.7182577848434448} 11/07/2021 02:53:45 - INFO - __main__ - Step 40337: {'lr': 0.00042169303294294216, 'samples': 7744704, 'steps': 40336, 'loss/train': 1.5433964729309082} 11/07/2021 02:53:46 - INFO - __main__ - Step 40338: {'lr': 0.0004216891755762423, 'samples': 7744896, 'steps': 40337, 'loss/train': 1.5233230590820312} 11/07/2021 02:53:46 - INFO - __main__ - Step 40339: {'lr': 0.00042168531813218193, 'samples': 7745088, 'steps': 40338, 'loss/train': 2.014331102371216} 11/07/2021 02:53:46 - INFO - __main__ - Step 40340: {'lr': 0.0004216814606107627, 'samples': 7745280, 'steps': 40339, 'loss/train': 1.6288928985595703} 11/07/2021 02:53:47 - INFO - __main__ - Step 40341: {'lr': 0.00042167760301198656, 'samples': 7745472, 'steps': 40340, 'loss/train': 1.9901844263076782} 11/07/2021 02:53:48 - INFO - __main__ - Step 40342: {'lr': 0.0004216737453358551, 'samples': 7745664, 'steps': 40341, 'loss/train': 1.5221227407455444} 11/07/2021 02:53:48 - INFO - __main__ - Step 40343: {'lr': 0.00042166988758237013, 'samples': 7745856, 'steps': 40342, 'loss/train': 1.735198974609375} 11/07/2021 02:53:48 - INFO - __main__ - Step 40344: {'lr': 0.00042166602975153333, 'samples': 7746048, 'steps': 40343, 'loss/train': 1.4574334621429443} 11/07/2021 02:53:49 - INFO - __main__ - Step 40345: {'lr': 0.0004216621718433465, 'samples': 7746240, 'steps': 40344, 'loss/train': 1.4490758180618286} 11/07/2021 02:53:50 - INFO - __main__ - Step 40346: {'lr': 0.0004216583138578113, 'samples': 7746432, 'steps': 40345, 'loss/train': 1.3788456916809082} 11/07/2021 02:53:50 - INFO - __main__ - Step 40347: {'lr': 0.00042165445579492956, 'samples': 7746624, 'steps': 40346, 'loss/train': 1.7768930196762085} 11/07/2021 02:53:50 - INFO - __main__ - Step 40348: {'lr': 0.00042165059765470294, 'samples': 7746816, 'steps': 40347, 'loss/train': 1.6489914655685425} 11/07/2021 02:53:51 - INFO - __main__ - Step 40349: {'lr': 0.0004216467394371333, 'samples': 7747008, 'steps': 40348, 'loss/train': 1.6425361633300781} 11/07/2021 02:53:51 - INFO - __main__ - Step 40350: {'lr': 0.00042164288114222213, 'samples': 7747200, 'steps': 40349, 'loss/train': 1.606465220451355} 11/07/2021 02:53:52 - INFO - __main__ - Step 40351: {'lr': 0.0004216390227699714, 'samples': 7747392, 'steps': 40350, 'loss/train': 1.3147684335708618} 11/07/2021 02:53:52 - INFO - __main__ - Step 40352: {'lr': 0.0004216351643203828, 'samples': 7747584, 'steps': 40351, 'loss/train': 1.381427526473999} 11/07/2021 02:53:53 - INFO - __main__ - Step 40353: {'lr': 0.000421631305793458, 'samples': 7747776, 'steps': 40352, 'loss/train': 1.214415192604065} 11/07/2021 02:53:53 - INFO - __main__ - Step 40354: {'lr': 0.00042162744718919875, 'samples': 7747968, 'steps': 40353, 'loss/train': 1.4954808950424194} 11/07/2021 02:53:53 - INFO - __main__ - Step 40355: {'lr': 0.0004216235885076069, 'samples': 7748160, 'steps': 40354, 'loss/train': 1.5964220762252808} 11/07/2021 02:53:55 - INFO - __main__ - Step 40356: {'lr': 0.00042161972974868415, 'samples': 7748352, 'steps': 40355, 'loss/train': 1.5978648662567139} 11/07/2021 02:53:55 - INFO - __main__ - Step 40357: {'lr': 0.00042161587091243215, 'samples': 7748544, 'steps': 40356, 'loss/train': 1.434416651725769} 11/07/2021 02:53:55 - INFO - __main__ - Step 40358: {'lr': 0.00042161201199885257, 'samples': 7748736, 'steps': 40357, 'loss/train': 1.6181108951568604} 11/07/2021 02:53:56 - INFO - __main__ - Step 40359: {'lr': 0.0004216081530079474, 'samples': 7748928, 'steps': 40358, 'loss/train': 1.367638111114502} 11/07/2021 02:53:56 - INFO - __main__ - Step 40360: {'lr': 0.0004216042939397182, 'samples': 7749120, 'steps': 40359, 'loss/train': 1.4361519813537598} 11/07/2021 02:53:56 - INFO - __main__ - Step 40361: {'lr': 0.00042160043479416676, 'samples': 7749312, 'steps': 40360, 'loss/train': 5.871668338775635} 11/07/2021 02:53:57 - INFO - __main__ - Step 40362: {'lr': 0.00042159657557129483, 'samples': 7749504, 'steps': 40361, 'loss/train': 1.7653417587280273} 11/07/2021 02:53:58 - INFO - __main__ - Step 40363: {'lr': 0.0004215927162711041, 'samples': 7749696, 'steps': 40362, 'loss/train': 1.5643774271011353} 11/07/2021 02:53:58 - INFO - __main__ - Step 40364: {'lr': 0.00042158885689359637, 'samples': 7749888, 'steps': 40363, 'loss/train': 1.458735704421997} 11/07/2021 02:53:59 - INFO - __main__ - Step 40365: {'lr': 0.0004215849974387733, 'samples': 7750080, 'steps': 40364, 'loss/train': 1.1762034893035889} 11/07/2021 02:53:59 - INFO - __main__ - Step 40366: {'lr': 0.0004215811379066367, 'samples': 7750272, 'steps': 40365, 'loss/train': 1.7351243495941162} 11/07/2021 02:54:00 - INFO - __main__ - Step 40367: {'lr': 0.00042157727829718827, 'samples': 7750464, 'steps': 40366, 'loss/train': 0.9102671146392822} 11/07/2021 02:54:00 - INFO - __main__ - Step 40368: {'lr': 0.00042157341861042986, 'samples': 7750656, 'steps': 40367, 'loss/train': 1.847285270690918} 11/07/2021 02:54:01 - INFO - __main__ - Step 40369: {'lr': 0.00042156955884636307, 'samples': 7750848, 'steps': 40368, 'loss/train': 1.3229519128799438} 11/07/2021 02:54:01 - INFO - __main__ - Step 40370: {'lr': 0.0004215656990049896, 'samples': 7751040, 'steps': 40369, 'loss/train': 1.2186511754989624} 11/07/2021 02:54:01 - INFO - __main__ - Step 40371: {'lr': 0.0004215618390863114, 'samples': 7751232, 'steps': 40370, 'loss/train': 1.1559193134307861} 11/07/2021 02:54:02 - INFO - __main__ - Step 40372: {'lr': 0.00042155797909033, 'samples': 7751424, 'steps': 40371, 'loss/train': 1.7392240762710571} 11/07/2021 02:54:03 - INFO - __main__ - Step 40373: {'lr': 0.00042155411901704723, 'samples': 7751616, 'steps': 40372, 'loss/train': 1.645806074142456} 11/07/2021 02:54:03 - INFO - __main__ - Step 40374: {'lr': 0.0004215502588664648, 'samples': 7751808, 'steps': 40373, 'loss/train': 1.5513337850570679} 11/07/2021 02:54:03 - INFO - __main__ - Step 40375: {'lr': 0.0004215463986385845, 'samples': 7752000, 'steps': 40374, 'loss/train': 1.8371727466583252} 11/07/2021 02:54:04 - INFO - __main__ - Step 40376: {'lr': 0.0004215425383334081, 'samples': 7752192, 'steps': 40375, 'loss/train': 1.8390657901763916} 11/07/2021 02:54:05 - INFO - __main__ - Step 40377: {'lr': 0.00042153867795093714, 'samples': 7752384, 'steps': 40376, 'loss/train': 1.8111679553985596} 11/07/2021 02:54:05 - INFO - __main__ - Step 40378: {'lr': 0.0004215348174911736, 'samples': 7752576, 'steps': 40377, 'loss/train': 1.1135926246643066} 11/07/2021 02:54:05 - INFO - __main__ - Step 40379: {'lr': 0.0004215309569541191, 'samples': 7752768, 'steps': 40378, 'loss/train': 1.1120686531066895} 11/07/2021 02:54:06 - INFO - __main__ - Step 40380: {'lr': 0.00042152709633977545, 'samples': 7752960, 'steps': 40379, 'loss/train': 1.7353676557540894} 11/07/2021 02:54:06 - INFO - __main__ - Step 40381: {'lr': 0.0004215232356481442, 'samples': 7753152, 'steps': 40380, 'loss/train': 1.472200632095337} 11/07/2021 02:54:06 - INFO - __main__ - Step 40382: {'lr': 0.0004215193748792273, 'samples': 7753344, 'steps': 40381, 'loss/train': 1.3253713846206665} 11/07/2021 02:54:08 - INFO - __main__ - Step 40383: {'lr': 0.00042151551403302645, 'samples': 7753536, 'steps': 40382, 'loss/train': 1.6094202995300293} 11/07/2021 02:54:08 - INFO - __main__ - Step 40384: {'lr': 0.00042151165310954335, 'samples': 7753728, 'steps': 40383, 'loss/train': 1.6044669151306152} 11/07/2021 02:54:08 - INFO - __main__ - Step 40385: {'lr': 0.0004215077921087798, 'samples': 7753920, 'steps': 40384, 'loss/train': 0.5478833317756653} 11/07/2021 02:54:09 - INFO - __main__ - Step 40386: {'lr': 0.00042150393103073736, 'samples': 7754112, 'steps': 40385, 'loss/train': 1.0497719049453735} 11/07/2021 02:54:09 - INFO - __main__ - Step 40387: {'lr': 0.00042150006987541795, 'samples': 7754304, 'steps': 40386, 'loss/train': 1.6169170141220093} 11/07/2021 02:54:10 - INFO - __main__ - Step 40388: {'lr': 0.0004214962086428232, 'samples': 7754496, 'steps': 40387, 'loss/train': 1.7807025909423828} 11/07/2021 02:54:10 - INFO - __main__ - Step 40389: {'lr': 0.00042149234733295497, 'samples': 7754688, 'steps': 40388, 'loss/train': 1.7034701108932495} 11/07/2021 02:54:11 - INFO - __main__ - Step 40390: {'lr': 0.00042148848594581503, 'samples': 7754880, 'steps': 40389, 'loss/train': 0.9810364842414856} 11/07/2021 02:54:11 - INFO - __main__ - Step 40391: {'lr': 0.00042148462448140487, 'samples': 7755072, 'steps': 40390, 'loss/train': 1.7027150392532349} 11/07/2021 02:54:11 - INFO - __main__ - Step 40392: {'lr': 0.0004214807629397264, 'samples': 7755264, 'steps': 40391, 'loss/train': 1.563767671585083} 11/07/2021 02:54:12 - INFO - __main__ - Step 40393: {'lr': 0.00042147690132078136, 'samples': 7755456, 'steps': 40392, 'loss/train': 1.670060396194458} 11/07/2021 02:54:13 - INFO - __main__ - Step 40394: {'lr': 0.0004214730396245715, 'samples': 7755648, 'steps': 40393, 'loss/train': 0.9150652289390564} 11/07/2021 02:54:13 - INFO - __main__ - Step 40395: {'lr': 0.0004214691778510985, 'samples': 7755840, 'steps': 40394, 'loss/train': 1.806809425354004} 11/07/2021 02:54:13 - INFO - __main__ - Step 40396: {'lr': 0.0004214653160003642, 'samples': 7756032, 'steps': 40395, 'loss/train': 1.3819338083267212} 11/07/2021 02:54:14 - INFO - __main__ - Step 40397: {'lr': 0.00042146145407237023, 'samples': 7756224, 'steps': 40396, 'loss/train': 1.266759991645813} 11/07/2021 02:54:15 - INFO - __main__ - Step 40398: {'lr': 0.00042145759206711834, 'samples': 7756416, 'steps': 40397, 'loss/train': 1.3659684658050537} 11/07/2021 02:54:15 - INFO - __main__ - Step 40399: {'lr': 0.0004214537299846104, 'samples': 7756608, 'steps': 40398, 'loss/train': 1.5729107856750488} 11/07/2021 02:54:16 - INFO - __main__ - Step 40400: {'lr': 0.00042144986782484796, 'samples': 7756800, 'steps': 40399, 'loss/train': 1.468271255493164} 11/07/2021 02:54:16 - INFO - __main__ - Step 40401: {'lr': 0.00042144600558783284, 'samples': 7756992, 'steps': 40400, 'loss/train': 1.6899981498718262} 11/07/2021 02:54:16 - INFO - __main__ - Step 40402: {'lr': 0.0004214421432735669, 'samples': 7757184, 'steps': 40401, 'loss/train': 1.9053840637207031} 11/07/2021 02:54:17 - INFO - __main__ - Step 40403: {'lr': 0.0004214382808820517, 'samples': 7757376, 'steps': 40402, 'loss/train': 1.874024510383606} 11/07/2021 02:54:18 - INFO - __main__ - Step 40404: {'lr': 0.0004214344184132891, 'samples': 7757568, 'steps': 40403, 'loss/train': 1.5011881589889526} 11/07/2021 02:54:18 - INFO - __main__ - Step 40405: {'lr': 0.0004214305558672808, 'samples': 7757760, 'steps': 40404, 'loss/train': 2.215088367462158} 11/07/2021 02:54:19 - INFO - __main__ - Step 40406: {'lr': 0.0004214266932440285, 'samples': 7757952, 'steps': 40405, 'loss/train': 0.11188347637653351} 11/07/2021 02:54:19 - INFO - __main__ - Step 40407: {'lr': 0.000421422830543534, 'samples': 7758144, 'steps': 40406, 'loss/train': 2.0618247985839844} 11/07/2021 02:54:20 - INFO - __main__ - Step 40408: {'lr': 0.00042141896776579904, 'samples': 7758336, 'steps': 40407, 'loss/train': 1.0536060333251953} 11/07/2021 02:54:20 - INFO - __main__ - Step 40409: {'lr': 0.0004214151049108252, 'samples': 7758528, 'steps': 40408, 'loss/train': 1.743821144104004} 11/07/2021 02:54:21 - INFO - __main__ - Step 40410: {'lr': 0.00042141124197861456, 'samples': 7758720, 'steps': 40409, 'loss/train': 0.9953275322914124} 11/07/2021 02:54:21 - INFO - __main__ - Step 40411: {'lr': 0.0004214073789691686, 'samples': 7758912, 'steps': 40410, 'loss/train': 1.4814211130142212} 11/07/2021 02:54:22 - INFO - __main__ - Step 40412: {'lr': 0.00042140351588248906, 'samples': 7759104, 'steps': 40411, 'loss/train': 1.3941441774368286} 11/07/2021 02:54:22 - INFO - __main__ - Step 40413: {'lr': 0.00042139965271857774, 'samples': 7759296, 'steps': 40412, 'loss/train': 1.7358777523040771} 11/07/2021 02:54:22 - INFO - __main__ - Step 40414: {'lr': 0.0004213957894774364, 'samples': 7759488, 'steps': 40413, 'loss/train': 1.5109068155288696} 11/07/2021 02:54:23 - INFO - __main__ - Step 40415: {'lr': 0.0004213919261590667, 'samples': 7759680, 'steps': 40414, 'loss/train': 0.9453153014183044} 11/07/2021 02:54:24 - INFO - __main__ - Step 40416: {'lr': 0.0004213880627634705, 'samples': 7759872, 'steps': 40415, 'loss/train': 2.697310209274292} 11/07/2021 02:54:24 - INFO - __main__ - Step 40417: {'lr': 0.0004213841992906496, 'samples': 7760064, 'steps': 40416, 'loss/train': 1.6190316677093506} 11/07/2021 02:54:24 - INFO - __main__ - Step 40418: {'lr': 0.0004213803357406055, 'samples': 7760256, 'steps': 40417, 'loss/train': 1.7742410898208618} 11/07/2021 02:54:25 - INFO - __main__ - Step 40419: {'lr': 0.00042137647211334007, 'samples': 7760448, 'steps': 40418, 'loss/train': 1.904790997505188} 11/07/2021 02:54:26 - INFO - __main__ - Step 40420: {'lr': 0.000421372608408855, 'samples': 7760640, 'steps': 40419, 'loss/train': 1.5416607856750488} 11/07/2021 02:54:26 - INFO - __main__ - Step 40421: {'lr': 0.0004213687446271522, 'samples': 7760832, 'steps': 40420, 'loss/train': 0.8516736030578613} 11/07/2021 02:54:27 - INFO - __main__ - Step 40422: {'lr': 0.0004213648807682332, 'samples': 7761024, 'steps': 40421, 'loss/train': 1.5297152996063232} 11/07/2021 02:54:27 - INFO - __main__ - Step 40423: {'lr': 0.00042136101683209993, 'samples': 7761216, 'steps': 40422, 'loss/train': 1.6230645179748535} 11/07/2021 02:54:27 - INFO - __main__ - Step 40424: {'lr': 0.00042135715281875393, 'samples': 7761408, 'steps': 40423, 'loss/train': 1.3259097337722778} 11/07/2021 02:54:28 - INFO - __main__ - Step 40425: {'lr': 0.000421353288728197, 'samples': 7761600, 'steps': 40424, 'loss/train': 1.081849455833435} 11/07/2021 02:54:29 - INFO - __main__ - Step 40426: {'lr': 0.00042134942456043104, 'samples': 7761792, 'steps': 40425, 'loss/train': 1.590003490447998} 11/07/2021 02:54:29 - INFO - __main__ - Step 40427: {'lr': 0.00042134556031545755, 'samples': 7761984, 'steps': 40426, 'loss/train': 1.2831095457077026} 11/07/2021 02:54:29 - INFO - __main__ - Step 40428: {'lr': 0.0004213416959932785, 'samples': 7762176, 'steps': 40427, 'loss/train': 1.867786169052124} 11/07/2021 02:54:30 - INFO - __main__ - Step 40429: {'lr': 0.0004213378315938955, 'samples': 7762368, 'steps': 40428, 'loss/train': 1.2048214673995972} 11/07/2021 02:54:30 - INFO - __main__ - Step 40430: {'lr': 0.0004213339671173103, 'samples': 7762560, 'steps': 40429, 'loss/train': 1.6742734909057617} 11/07/2021 02:54:31 - INFO - __main__ - Step 40431: {'lr': 0.00042133010256352466, 'samples': 7762752, 'steps': 40430, 'loss/train': 1.6833750009536743} 11/07/2021 02:54:32 - INFO - __main__ - Step 40432: {'lr': 0.00042132623793254034, 'samples': 7762944, 'steps': 40431, 'loss/train': 0.7925304770469666} 11/07/2021 02:54:32 - INFO - __main__ - Step 40433: {'lr': 0.0004213223732243591, 'samples': 7763136, 'steps': 40432, 'loss/train': 1.357794165611267} 11/07/2021 02:54:32 - INFO - __main__ - Step 40434: {'lr': 0.00042131850843898255, 'samples': 7763328, 'steps': 40433, 'loss/train': 1.5195480585098267} 11/07/2021 02:54:33 - INFO - __main__ - Step 40435: {'lr': 0.0004213146435764126, 'samples': 7763520, 'steps': 40434, 'loss/train': 0.5984975099563599} 11/07/2021 02:54:33 - INFO - __main__ - Step 40436: {'lr': 0.00042131077863665086, 'samples': 7763712, 'steps': 40435, 'loss/train': 1.5049656629562378} 11/07/2021 02:54:35 - INFO - __main__ - Step 40437: {'lr': 0.00042130691361969914, 'samples': 7763904, 'steps': 40436, 'loss/train': 0.2353384792804718} 11/07/2021 02:54:35 - INFO - __main__ - Step 40438: {'lr': 0.00042130304852555916, 'samples': 7764096, 'steps': 40437, 'loss/train': 1.421054482460022} 11/07/2021 02:54:35 - INFO - __main__ - Step 40439: {'lr': 0.00042129918335423265, 'samples': 7764288, 'steps': 40438, 'loss/train': 1.5775226354599} 11/07/2021 02:54:36 - INFO - __main__ - Step 40440: {'lr': 0.0004212953181057214, 'samples': 7764480, 'steps': 40439, 'loss/train': 1.0920753479003906} 11/07/2021 02:54:36 - INFO - __main__ - Step 40441: {'lr': 0.0004212914527800272, 'samples': 7764672, 'steps': 40440, 'loss/train': 0.48488837480545044} 11/07/2021 02:54:37 - INFO - __main__ - Step 40442: {'lr': 0.0004212875873771516, 'samples': 7764864, 'steps': 40441, 'loss/train': 0.532578706741333} 11/07/2021 02:54:37 - INFO - __main__ - Step 40443: {'lr': 0.0004212837218970965, 'samples': 7765056, 'steps': 40442, 'loss/train': 0.6872163414955139} 11/07/2021 02:54:38 - INFO - __main__ - Step 40444: {'lr': 0.00042127985633986365, 'samples': 7765248, 'steps': 40443, 'loss/train': 1.7978664636611938} 11/07/2021 02:54:38 - INFO - __main__ - Step 40445: {'lr': 0.0004212759907054546, 'samples': 7765440, 'steps': 40444, 'loss/train': 1.7988765239715576} 11/07/2021 02:54:39 - INFO - __main__ - Step 40446: {'lr': 0.00042127212499387136, 'samples': 7765632, 'steps': 40445, 'loss/train': 1.7115402221679688} 11/07/2021 02:54:40 - INFO - __main__ - Step 40447: {'lr': 0.0004212682592051155, 'samples': 7765824, 'steps': 40446, 'loss/train': 1.5423678159713745} 11/07/2021 02:54:40 - INFO - __main__ - Step 40448: {'lr': 0.0004212643933391888, 'samples': 7766016, 'steps': 40447, 'loss/train': 1.4345892667770386} 11/07/2021 02:54:40 - INFO - __main__ - Step 40449: {'lr': 0.000421260527396093, 'samples': 7766208, 'steps': 40448, 'loss/train': 1.547603726387024} 11/07/2021 02:54:41 - INFO - __main__ - Step 40450: {'lr': 0.0004212566613758299, 'samples': 7766400, 'steps': 40449, 'loss/train': 1.6221541166305542} 11/07/2021 02:54:41 - INFO - __main__ - Step 40451: {'lr': 0.00042125279527840124, 'samples': 7766592, 'steps': 40450, 'loss/train': 1.768631100654602} 11/07/2021 02:54:42 - INFO - __main__ - Step 40452: {'lr': 0.0004212489291038085, 'samples': 7766784, 'steps': 40451, 'loss/train': 1.3406453132629395} 11/07/2021 02:54:42 - INFO - __main__ - Step 40453: {'lr': 0.0004212450628520538, 'samples': 7766976, 'steps': 40452, 'loss/train': 1.6043791770935059} 11/07/2021 02:54:43 - INFO - __main__ - Step 40454: {'lr': 0.0004212411965231387, 'samples': 7767168, 'steps': 40453, 'loss/train': 1.3485994338989258} 11/07/2021 02:54:43 - INFO - __main__ - Step 40455: {'lr': 0.0004212373301170649, 'samples': 7767360, 'steps': 40454, 'loss/train': 1.7538222074508667} 11/07/2021 02:54:43 - INFO - __main__ - Step 40456: {'lr': 0.00042123346363383426, 'samples': 7767552, 'steps': 40455, 'loss/train': 2.2991220951080322} 11/07/2021 02:54:44 - INFO - __main__ - Step 40457: {'lr': 0.0004212295970734484, 'samples': 7767744, 'steps': 40456, 'loss/train': 1.1079826354980469} 11/07/2021 02:54:45 - INFO - __main__ - Step 40458: {'lr': 0.00042122573043590925, 'samples': 7767936, 'steps': 40457, 'loss/train': 1.0960724353790283} 11/07/2021 02:54:45 - INFO - __main__ - Step 40459: {'lr': 0.0004212218637212183, 'samples': 7768128, 'steps': 40458, 'loss/train': 1.659537672996521} 11/07/2021 02:54:45 - INFO - __main__ - Step 40460: {'lr': 0.00042121799692937747, 'samples': 7768320, 'steps': 40459, 'loss/train': 1.698103666305542} 11/07/2021 02:54:46 - INFO - __main__ - Step 40461: {'lr': 0.00042121413006038845, 'samples': 7768512, 'steps': 40460, 'loss/train': 1.0684001445770264} 11/07/2021 02:54:47 - INFO - __main__ - Step 40462: {'lr': 0.000421210263114253, 'samples': 7768704, 'steps': 40461, 'loss/train': 1.7173820734024048} 11/07/2021 02:54:47 - INFO - __main__ - Step 40463: {'lr': 0.00042120639609097277, 'samples': 7768896, 'steps': 40462, 'loss/train': 1.9047561883926392} 11/07/2021 02:54:47 - INFO - __main__ - Step 40464: {'lr': 0.0004212025289905497, 'samples': 7769088, 'steps': 40463, 'loss/train': 1.4366101026535034} 11/07/2021 02:54:48 - INFO - __main__ - Step 40465: {'lr': 0.0004211986618129854, 'samples': 7769280, 'steps': 40464, 'loss/train': 1.5336722135543823} 11/07/2021 02:54:48 - INFO - __main__ - Step 40466: {'lr': 0.00042119479455828153, 'samples': 7769472, 'steps': 40465, 'loss/train': 1.2135385274887085} 11/07/2021 02:54:48 - INFO - __main__ - Step 40467: {'lr': 0.00042119092722644, 'samples': 7769664, 'steps': 40466, 'loss/train': 1.1936951875686646} 11/07/2021 02:54:50 - INFO - __main__ - Step 40468: {'lr': 0.0004211870598174624, 'samples': 7769856, 'steps': 40467, 'loss/train': 1.0657728910446167} 11/07/2021 02:54:50 - INFO - __main__ - Step 40469: {'lr': 0.0004211831923313506, 'samples': 7770048, 'steps': 40468, 'loss/train': 1.6648720502853394} 11/07/2021 02:54:51 - INFO - __main__ - Step 40470: {'lr': 0.0004211793247681064, 'samples': 7770240, 'steps': 40469, 'loss/train': 1.7868831157684326} 11/07/2021 02:54:51 - INFO - __main__ - Step 40471: {'lr': 0.0004211754571277313, 'samples': 7770432, 'steps': 40470, 'loss/train': 1.7979527711868286} 11/07/2021 02:54:51 - INFO - __main__ - Step 40472: {'lr': 0.0004211715894102272, 'samples': 7770624, 'steps': 40471, 'loss/train': 1.8728834390640259} 11/07/2021 02:54:52 - INFO - __main__ - Step 40473: {'lr': 0.00042116772161559585, 'samples': 7770816, 'steps': 40472, 'loss/train': 1.4813945293426514} 11/07/2021 02:54:53 - INFO - __main__ - Step 40474: {'lr': 0.0004211638537438389, 'samples': 7771008, 'steps': 40473, 'loss/train': 1.855363368988037} 11/07/2021 02:54:53 - INFO - __main__ - Step 40475: {'lr': 0.0004211599857949583, 'samples': 7771200, 'steps': 40474, 'loss/train': 1.609128713607788} 11/07/2021 02:54:53 - INFO - __main__ - Step 40476: {'lr': 0.00042115611776895556, 'samples': 7771392, 'steps': 40475, 'loss/train': 0.9565662741661072} 11/07/2021 02:54:54 - INFO - __main__ - Step 40477: {'lr': 0.00042115224966583255, 'samples': 7771584, 'steps': 40476, 'loss/train': 1.084002137184143} 11/07/2021 02:54:55 - INFO - __main__ - Step 40478: {'lr': 0.00042114838148559093, 'samples': 7771776, 'steps': 40477, 'loss/train': 1.3256856203079224} 11/07/2021 02:54:55 - INFO - __main__ - Step 40479: {'lr': 0.0004211445132282325, 'samples': 7771968, 'steps': 40478, 'loss/train': 1.4972087144851685} 11/07/2021 02:54:55 - INFO - __main__ - Step 40480: {'lr': 0.000421140644893759, 'samples': 7772160, 'steps': 40479, 'loss/train': 0.7012211680412292} 11/07/2021 02:54:56 - INFO - __main__ - Step 40481: {'lr': 0.0004211367764821722, 'samples': 7772352, 'steps': 40480, 'loss/train': 1.6657418012619019} 11/07/2021 02:54:56 - INFO - __main__ - Step 40482: {'lr': 0.00042113290799347376, 'samples': 7772544, 'steps': 40481, 'loss/train': 1.779977560043335} 11/07/2021 02:54:57 - INFO - __main__ - Step 40483: {'lr': 0.00042112903942766546, 'samples': 7772736, 'steps': 40482, 'loss/train': 2.0749664306640625} 11/07/2021 02:54:58 - INFO - __main__ - Step 40484: {'lr': 0.00042112517078474914, 'samples': 7772928, 'steps': 40483, 'loss/train': 1.6234458684921265} 11/07/2021 02:54:58 - INFO - __main__ - Step 40485: {'lr': 0.0004211213020647264, 'samples': 7773120, 'steps': 40484, 'loss/train': 1.8211755752563477} 11/07/2021 02:54:58 - INFO - __main__ - Step 40486: {'lr': 0.00042111743326759903, 'samples': 7773312, 'steps': 40485, 'loss/train': 1.5641697645187378} 11/07/2021 02:54:59 - INFO - __main__ - Step 40487: {'lr': 0.00042111356439336877, 'samples': 7773504, 'steps': 40486, 'loss/train': 1.8205519914627075} 11/07/2021 02:54:59 - INFO - __main__ - Step 40488: {'lr': 0.0004211096954420375, 'samples': 7773696, 'steps': 40487, 'loss/train': 1.2478127479553223} 11/07/2021 02:55:00 - INFO - __main__ - Step 40489: {'lr': 0.0004211058264136067, 'samples': 7773888, 'steps': 40488, 'loss/train': 1.0244457721710205} 11/07/2021 02:55:00 - INFO - __main__ - Step 40490: {'lr': 0.0004211019573080783, 'samples': 7774080, 'steps': 40489, 'loss/train': 1.2829538583755493} 11/07/2021 02:55:01 - INFO - __main__ - Step 40491: {'lr': 0.00042109808812545405, 'samples': 7774272, 'steps': 40490, 'loss/train': 1.4776816368103027} 11/07/2021 02:55:01 - INFO - __main__ - Step 40492: {'lr': 0.0004210942188657356, 'samples': 7774464, 'steps': 40491, 'loss/train': 1.0204192399978638} 11/07/2021 02:55:01 - INFO - __main__ - Step 40493: {'lr': 0.00042109034952892473, 'samples': 7774656, 'steps': 40492, 'loss/train': 1.4715652465820312} 11/07/2021 02:55:03 - INFO - __main__ - Step 40494: {'lr': 0.00042108648011502314, 'samples': 7774848, 'steps': 40493, 'loss/train': 1.7918028831481934} 11/07/2021 02:55:03 - INFO - __main__ - Step 40495: {'lr': 0.00042108261062403276, 'samples': 7775040, 'steps': 40494, 'loss/train': 1.0174304246902466} 11/07/2021 02:55:03 - INFO - __main__ - Step 40496: {'lr': 0.00042107874105595507, 'samples': 7775232, 'steps': 40495, 'loss/train': 0.7235434055328369} 11/07/2021 02:55:04 - INFO - __main__ - Step 40497: {'lr': 0.00042107487141079206, 'samples': 7775424, 'steps': 40496, 'loss/train': 1.3755961656570435} 11/07/2021 02:55:04 - INFO - __main__ - Step 40498: {'lr': 0.00042107100168854516, 'samples': 7775616, 'steps': 40497, 'loss/train': 1.5656465291976929} 11/07/2021 02:55:04 - INFO - __main__ - Step 40499: {'lr': 0.00042106713188921647, 'samples': 7775808, 'steps': 40498, 'loss/train': 1.7913703918457031} 11/07/2021 02:55:06 - INFO - __main__ - Step 40500: {'lr': 0.00042106326201280756, 'samples': 7776000, 'steps': 40499, 'loss/train': 0.22071810066699982} 11/07/2021 02:55:06 - INFO - __main__ - Step 40501: {'lr': 0.0004210593920593201, 'samples': 7776192, 'steps': 40500, 'loss/train': 0.8095206618309021} 11/07/2021 02:55:07 - INFO - __main__ - Step 40502: {'lr': 0.000421055522028756, 'samples': 7776384, 'steps': 40501, 'loss/train': 0.4188019335269928} 11/07/2021 02:55:07 - INFO - __main__ - Step 40503: {'lr': 0.00042105165192111684, 'samples': 7776576, 'steps': 40502, 'loss/train': 1.5947679281234741} 11/07/2021 02:55:07 - INFO - __main__ - Step 40504: {'lr': 0.00042104778173640453, 'samples': 7776768, 'steps': 40503, 'loss/train': 1.7064250707626343} 11/07/2021 02:55:08 - INFO - __main__ - Step 40505: {'lr': 0.0004210439114746206, 'samples': 7776960, 'steps': 40504, 'loss/train': 1.4955432415008545} 11/07/2021 02:55:09 - INFO - __main__ - Step 40506: {'lr': 0.00042104004113576707, 'samples': 7777152, 'steps': 40505, 'loss/train': 1.7106342315673828} 11/07/2021 02:55:09 - INFO - __main__ - Step 40507: {'lr': 0.00042103617071984544, 'samples': 7777344, 'steps': 40506, 'loss/train': 1.195376992225647} 11/07/2021 02:55:09 - INFO - __main__ - Step 40508: {'lr': 0.00042103230022685765, 'samples': 7777536, 'steps': 40507, 'loss/train': 1.639768123626709} 11/07/2021 02:55:10 - INFO - __main__ - Step 40509: {'lr': 0.0004210284296568052, 'samples': 7777728, 'steps': 40508, 'loss/train': 1.5711511373519897} 11/07/2021 02:55:11 - INFO - __main__ - Step 40510: {'lr': 0.0004210245590096901, 'samples': 7777920, 'steps': 40509, 'loss/train': 1.082285761833191} 11/07/2021 02:55:11 - INFO - __main__ - Step 40511: {'lr': 0.000421020688285514, 'samples': 7778112, 'steps': 40510, 'loss/train': 1.4846168756484985} 11/07/2021 02:55:11 - INFO - __main__ - Step 40512: {'lr': 0.0004210168174842785, 'samples': 7778304, 'steps': 40511, 'loss/train': 1.407073974609375} 11/07/2021 02:55:12 - INFO - __main__ - Step 40513: {'lr': 0.00042101294660598556, 'samples': 7778496, 'steps': 40512, 'loss/train': 1.8460878133773804} 11/07/2021 02:55:12 - INFO - __main__ - Step 40514: {'lr': 0.0004210090756506367, 'samples': 7778688, 'steps': 40513, 'loss/train': 1.498748779296875} 11/07/2021 02:55:13 - INFO - __main__ - Step 40515: {'lr': 0.0004210052046182339, 'samples': 7778880, 'steps': 40514, 'loss/train': 1.8797236680984497} 11/07/2021 02:55:14 - INFO - __main__ - Step 40516: {'lr': 0.0004210013335087787, 'samples': 7779072, 'steps': 40515, 'loss/train': 1.5167568922042847} 11/07/2021 02:55:14 - INFO - __main__ - Step 40517: {'lr': 0.000420997462322273, 'samples': 7779264, 'steps': 40516, 'loss/train': 1.554261326789856} 11/07/2021 02:55:14 - INFO - __main__ - Step 40518: {'lr': 0.00042099359105871856, 'samples': 7779456, 'steps': 40517, 'loss/train': 0.995050311088562} 11/07/2021 02:55:15 - INFO - __main__ - Step 40519: {'lr': 0.00042098971971811695, 'samples': 7779648, 'steps': 40518, 'loss/train': 1.330376148223877} 11/07/2021 02:55:16 - INFO - __main__ - Step 40520: {'lr': 0.00042098584830047004, 'samples': 7779840, 'steps': 40519, 'loss/train': 1.555248498916626} 11/07/2021 02:55:16 - INFO - __main__ - Step 40521: {'lr': 0.00042098197680577956, 'samples': 7780032, 'steps': 40520, 'loss/train': 1.7325122356414795} 11/07/2021 02:55:16 - INFO - __main__ - Step 40522: {'lr': 0.00042097810523404714, 'samples': 7780224, 'steps': 40521, 'loss/train': 1.4902366399765015} 11/07/2021 02:55:17 - INFO - __main__ - Step 40523: {'lr': 0.0004209742335852747, 'samples': 7780416, 'steps': 40522, 'loss/train': 1.586439609527588} 11/07/2021 02:55:17 - INFO - __main__ - Step 40524: {'lr': 0.0004209703618594639, 'samples': 7780608, 'steps': 40523, 'loss/train': 1.4793297052383423} 11/07/2021 02:55:17 - INFO - __main__ - Step 40525: {'lr': 0.00042096649005661654, 'samples': 7780800, 'steps': 40524, 'loss/train': 1.5533337593078613} 11/07/2021 02:55:18 - INFO - __main__ - Step 40526: {'lr': 0.00042096261817673423, 'samples': 7780992, 'steps': 40525, 'loss/train': 1.506975769996643} 11/07/2021 02:55:19 - INFO - __main__ - Step 40527: {'lr': 0.0004209587462198189, 'samples': 7781184, 'steps': 40526, 'loss/train': 1.802403450012207} 11/07/2021 02:55:19 - INFO - __main__ - Step 40528: {'lr': 0.0004209548741858721, 'samples': 7781376, 'steps': 40527, 'loss/train': 1.5555566549301147} 11/07/2021 02:55:19 - INFO - __main__ - Step 40529: {'lr': 0.00042095100207489573, 'samples': 7781568, 'steps': 40528, 'loss/train': 1.3441355228424072} 11/07/2021 02:55:20 - INFO - __main__ - Step 40530: {'lr': 0.0004209471298868914, 'samples': 7781760, 'steps': 40529, 'loss/train': 1.8669350147247314} 11/07/2021 02:55:21 - INFO - __main__ - Step 40531: {'lr': 0.00042094325762186103, 'samples': 7781952, 'steps': 40530, 'loss/train': 1.5020253658294678} 11/07/2021 02:55:21 - INFO - __main__ - Step 40532: {'lr': 0.0004209393852798062, 'samples': 7782144, 'steps': 40531, 'loss/train': 1.7007616758346558} 11/07/2021 02:55:21 - INFO - __main__ - Step 40533: {'lr': 0.00042093551286072887, 'samples': 7782336, 'steps': 40532, 'loss/train': 1.5439091920852661} 11/07/2021 02:55:22 - INFO - __main__ - Step 40534: {'lr': 0.00042093164036463045, 'samples': 7782528, 'steps': 40533, 'loss/train': 1.5337989330291748} 11/07/2021 02:55:22 - INFO - __main__ - Step 40535: {'lr': 0.0004209277677915129, 'samples': 7782720, 'steps': 40534, 'loss/train': 1.3965024948120117} 11/07/2021 02:55:24 - INFO - __main__ - Step 40536: {'lr': 0.000420923895141378, 'samples': 7782912, 'steps': 40535, 'loss/train': 1.4856348037719727} 11/07/2021 02:55:24 - INFO - __main__ - Step 40537: {'lr': 0.0004209200224142274, 'samples': 7783104, 'steps': 40536, 'loss/train': 1.4512770175933838} 11/07/2021 02:55:24 - INFO - __main__ - Step 40538: {'lr': 0.0004209161496100629, 'samples': 7783296, 'steps': 40537, 'loss/train': 0.8187178373336792} 11/07/2021 02:55:25 - INFO - __main__ - Step 40539: {'lr': 0.00042091227672888624, 'samples': 7783488, 'steps': 40538, 'loss/train': 1.7245628833770752} 11/07/2021 02:55:25 - INFO - __main__ - Step 40540: {'lr': 0.00042090840377069906, 'samples': 7783680, 'steps': 40539, 'loss/train': 1.4943673610687256} 11/07/2021 02:55:25 - INFO - __main__ - Step 40541: {'lr': 0.00042090453073550323, 'samples': 7783872, 'steps': 40540, 'loss/train': 1.4821431636810303} 11/07/2021 02:55:26 - INFO - __main__ - Step 40542: {'lr': 0.0004209006576233004, 'samples': 7784064, 'steps': 40541, 'loss/train': 1.2041219472885132} 11/07/2021 02:55:27 - INFO - __main__ - Step 40543: {'lr': 0.0004208967844340925, 'samples': 7784256, 'steps': 40542, 'loss/train': 1.1263279914855957} 11/07/2021 02:55:27 - INFO - __main__ - Step 40544: {'lr': 0.0004208929111678811, 'samples': 7784448, 'steps': 40543, 'loss/train': 1.6190506219863892} 11/07/2021 02:55:28 - INFO - __main__ - Step 40545: {'lr': 0.0004208890378246679, 'samples': 7784640, 'steps': 40544, 'loss/train': 1.5942102670669556} 11/07/2021 02:55:28 - INFO - __main__ - Step 40546: {'lr': 0.00042088516440445486, 'samples': 7784832, 'steps': 40545, 'loss/train': 0.7566236853599548} 11/07/2021 02:55:29 - INFO - __main__ - Step 40547: {'lr': 0.0004208812909072435, 'samples': 7785024, 'steps': 40546, 'loss/train': 1.7903724908828735} 11/07/2021 02:55:29 - INFO - __main__ - Step 40548: {'lr': 0.00042087741733303575, 'samples': 7785216, 'steps': 40547, 'loss/train': 1.1815096139907837} 11/07/2021 02:55:30 - INFO - __main__ - Step 40549: {'lr': 0.00042087354368183316, 'samples': 7785408, 'steps': 40548, 'loss/train': 1.5147377252578735} 11/07/2021 02:55:30 - INFO - __main__ - Step 40550: {'lr': 0.00042086966995363774, 'samples': 7785600, 'steps': 40549, 'loss/train': 1.6385198831558228} 11/07/2021 02:55:30 - INFO - __main__ - Step 40551: {'lr': 0.000420865796148451, 'samples': 7785792, 'steps': 40550, 'loss/train': 1.3999335765838623} 11/07/2021 02:55:31 - INFO - __main__ - Step 40552: {'lr': 0.00042086192226627476, 'samples': 7785984, 'steps': 40551, 'loss/train': 0.5500167608261108} 11/07/2021 02:55:32 - INFO - __main__ - Step 40553: {'lr': 0.00042085804830711084, 'samples': 7786176, 'steps': 40552, 'loss/train': 1.6004263162612915} 11/07/2021 02:55:32 - INFO - __main__ - Step 40554: {'lr': 0.00042085417427096085, 'samples': 7786368, 'steps': 40553, 'loss/train': 1.3719244003295898} 11/07/2021 02:55:32 - INFO - __main__ - Step 40555: {'lr': 0.0004208503001578266, 'samples': 7786560, 'steps': 40554, 'loss/train': 1.30928373336792} 11/07/2021 02:55:33 - INFO - __main__ - Step 40556: {'lr': 0.00042084642596770984, 'samples': 7786752, 'steps': 40555, 'loss/train': 1.6300101280212402} 11/07/2021 02:55:34 - INFO - __main__ - Step 40557: {'lr': 0.0004208425517006124, 'samples': 7786944, 'steps': 40556, 'loss/train': 1.650418758392334} 11/07/2021 02:55:34 - INFO - __main__ - Step 40558: {'lr': 0.0004208386773565359, 'samples': 7787136, 'steps': 40557, 'loss/train': 1.234565258026123} 11/07/2021 02:55:34 - INFO - __main__ - Step 40559: {'lr': 0.0004208348029354821, 'samples': 7787328, 'steps': 40558, 'loss/train': 1.5747935771942139} 11/07/2021 02:55:35 - INFO - __main__ - Step 40560: {'lr': 0.00042083092843745275, 'samples': 7787520, 'steps': 40559, 'loss/train': 1.7562403678894043} 11/07/2021 02:55:35 - INFO - __main__ - Step 40561: {'lr': 0.0004208270538624497, 'samples': 7787712, 'steps': 40560, 'loss/train': 1.1333563327789307} 11/07/2021 02:55:36 - INFO - __main__ - Step 40562: {'lr': 0.00042082317921047455, 'samples': 7787904, 'steps': 40561, 'loss/train': 1.908737063407898} 11/07/2021 02:55:37 - INFO - __main__ - Step 40563: {'lr': 0.0004208193044815291, 'samples': 7788096, 'steps': 40562, 'loss/train': 1.2895712852478027} 11/07/2021 02:55:37 - INFO - __main__ - Step 40564: {'lr': 0.0004208154296756152, 'samples': 7788288, 'steps': 40563, 'loss/train': 1.5736141204833984} 11/07/2021 02:55:37 - INFO - __main__ - Step 40565: {'lr': 0.0004208115547927345, 'samples': 7788480, 'steps': 40564, 'loss/train': 1.2707369327545166} 11/07/2021 02:55:38 - INFO - __main__ - Step 40566: {'lr': 0.0004208076798328886, 'samples': 7788672, 'steps': 40565, 'loss/train': 2.010261058807373} 11/07/2021 02:55:38 - INFO - __main__ - Step 40567: {'lr': 0.00042080380479607947, 'samples': 7788864, 'steps': 40566, 'loss/train': 1.1383821964263916} 11/07/2021 02:55:39 - INFO - __main__ - Step 40568: {'lr': 0.00042079992968230886, 'samples': 7789056, 'steps': 40567, 'loss/train': 1.415386438369751} 11/07/2021 02:55:39 - INFO - __main__ - Step 40569: {'lr': 0.0004207960544915784, 'samples': 7789248, 'steps': 40568, 'loss/train': 1.236395239830017} 11/07/2021 02:55:40 - INFO - __main__ - Step 40570: {'lr': 0.0004207921792238898, 'samples': 7789440, 'steps': 40569, 'loss/train': 1.6369279623031616} 11/07/2021 02:55:40 - INFO - __main__ - Step 40571: {'lr': 0.0004207883038792449, 'samples': 7789632, 'steps': 40570, 'loss/train': 1.3528670072555542} 11/07/2021 02:55:41 - INFO - __main__ - Step 40572: {'lr': 0.0004207844284576455, 'samples': 7789824, 'steps': 40571, 'loss/train': 1.7188644409179688} 11/07/2021 02:55:42 - INFO - __main__ - Step 40573: {'lr': 0.0004207805529590932, 'samples': 7790016, 'steps': 40572, 'loss/train': 1.762956976890564} 11/07/2021 02:55:42 - INFO - __main__ - Step 40574: {'lr': 0.0004207766773835899, 'samples': 7790208, 'steps': 40573, 'loss/train': 1.1353379487991333} 11/07/2021 02:55:43 - INFO - __main__ - Step 40575: {'lr': 0.0004207728017311372, 'samples': 7790400, 'steps': 40574, 'loss/train': 0.41304847598075867} 11/07/2021 02:55:43 - INFO - __main__ - Step 40576: {'lr': 0.0004207689260017369, 'samples': 7790592, 'steps': 40575, 'loss/train': 1.788718581199646} 11/07/2021 02:55:43 - INFO - __main__ - Step 40577: {'lr': 0.0004207650501953908, 'samples': 7790784, 'steps': 40576, 'loss/train': 1.785046100616455} 11/07/2021 02:55:44 - INFO - __main__ - Step 40578: {'lr': 0.0004207611743121006, 'samples': 7790976, 'steps': 40577, 'loss/train': 1.5799282789230347} 11/07/2021 02:55:45 - INFO - __main__ - Step 40579: {'lr': 0.00042075729835186807, 'samples': 7791168, 'steps': 40578, 'loss/train': 1.4899252653121948} 11/07/2021 02:55:45 - INFO - __main__ - Step 40580: {'lr': 0.0004207534223146948, 'samples': 7791360, 'steps': 40579, 'loss/train': 1.3295331001281738} 11/07/2021 02:55:45 - INFO - __main__ - Step 40581: {'lr': 0.0004207495462005828, 'samples': 7791552, 'steps': 40580, 'loss/train': 1.9620747566223145} 11/07/2021 02:55:46 - INFO - __main__ - Step 40582: {'lr': 0.0004207456700095337, 'samples': 7791744, 'steps': 40581, 'loss/train': 1.3816947937011719} 11/07/2021 02:55:47 - INFO - __main__ - Step 40583: {'lr': 0.0004207417937415492, 'samples': 7791936, 'steps': 40582, 'loss/train': 1.445510745048523} 11/07/2021 02:55:47 - INFO - __main__ - Step 40584: {'lr': 0.000420737917396631, 'samples': 7792128, 'steps': 40583, 'loss/train': 1.6489579677581787} 11/07/2021 02:55:48 - INFO - __main__ - Step 40585: {'lr': 0.00042073404097478105, 'samples': 7792320, 'steps': 40584, 'loss/train': 1.4975324869155884} 11/07/2021 02:55:48 - INFO - __main__ - Step 40586: {'lr': 0.000420730164476001, 'samples': 7792512, 'steps': 40585, 'loss/train': 2.274691104888916} 11/07/2021 02:55:48 - INFO - __main__ - Step 40587: {'lr': 0.00042072628790029243, 'samples': 7792704, 'steps': 40586, 'loss/train': 1.5260604619979858} 11/07/2021 02:55:49 - INFO - __main__ - Step 40588: {'lr': 0.0004207224112476573, 'samples': 7792896, 'steps': 40587, 'loss/train': 1.7950011491775513} 11/07/2021 02:55:50 - INFO - __main__ - Step 40589: {'lr': 0.0004207185345180973, 'samples': 7793088, 'steps': 40588, 'loss/train': 1.7205456495285034} 11/07/2021 02:55:50 - INFO - __main__ - Step 40590: {'lr': 0.00042071465771161416, 'samples': 7793280, 'steps': 40589, 'loss/train': 1.4447256326675415} 11/07/2021 02:55:50 - INFO - __main__ - Step 40591: {'lr': 0.0004207107808282097, 'samples': 7793472, 'steps': 40590, 'loss/train': 1.3997273445129395} 11/07/2021 02:55:51 - INFO - __main__ - Step 40592: {'lr': 0.00042070690386788545, 'samples': 7793664, 'steps': 40591, 'loss/train': 1.5835875272750854} 11/07/2021 02:55:52 - INFO - __main__ - Step 40593: {'lr': 0.0004207030268306434, 'samples': 7793856, 'steps': 40592, 'loss/train': 1.679340124130249} 11/07/2021 02:55:52 - INFO - __main__ - Step 40594: {'lr': 0.00042069914971648516, 'samples': 7794048, 'steps': 40593, 'loss/train': 1.6571979522705078} 11/07/2021 02:55:53 - INFO - __main__ - Step 40595: {'lr': 0.0004206952725254125, 'samples': 7794240, 'steps': 40594, 'loss/train': 1.4698810577392578} 11/07/2021 02:55:53 - INFO - __main__ - Step 40596: {'lr': 0.00042069139525742727, 'samples': 7794432, 'steps': 40595, 'loss/train': 1.6492644548416138} 11/07/2021 02:55:53 - INFO - __main__ - Step 40597: {'lr': 0.000420687517912531, 'samples': 7794624, 'steps': 40596, 'loss/train': 1.3679529428482056} 11/07/2021 02:55:54 - INFO - __main__ - Step 40598: {'lr': 0.0004206836404907257, 'samples': 7794816, 'steps': 40597, 'loss/train': 1.3563146591186523} 11/07/2021 02:55:55 - INFO - __main__ - Step 40599: {'lr': 0.0004206797629920129, 'samples': 7795008, 'steps': 40598, 'loss/train': 1.6323126554489136} 11/07/2021 02:55:55 - INFO - __main__ - Step 40600: {'lr': 0.0004206758854163945, 'samples': 7795200, 'steps': 40599, 'loss/train': 1.3663840293884277} 11/07/2021 02:55:55 - INFO - __main__ - Step 40601: {'lr': 0.00042067200776387215, 'samples': 7795392, 'steps': 40600, 'loss/train': 1.5179119110107422} 11/07/2021 02:55:56 - INFO - __main__ - Step 40602: {'lr': 0.0004206681300344476, 'samples': 7795584, 'steps': 40601, 'loss/train': 1.5285123586654663} 11/07/2021 02:55:56 - INFO - __main__ - Step 40603: {'lr': 0.0004206642522281227, 'samples': 7795776, 'steps': 40602, 'loss/train': 1.7061302661895752} 11/07/2021 02:55:57 - INFO - __main__ - Step 40604: {'lr': 0.000420660374344899, 'samples': 7795968, 'steps': 40603, 'loss/train': 1.711515188217163} 11/07/2021 02:55:57 - INFO - __main__ - Step 40605: {'lr': 0.00042065649638477843, 'samples': 7796160, 'steps': 40604, 'loss/train': 1.5275837182998657} 11/07/2021 02:55:58 - INFO - __main__ - Step 40606: {'lr': 0.0004206526183477627, 'samples': 7796352, 'steps': 40605, 'loss/train': 1.5344431400299072} 11/07/2021 02:55:58 - INFO - __main__ - Step 40607: {'lr': 0.0004206487402338535, 'samples': 7796544, 'steps': 40606, 'loss/train': 1.5935360193252563} 11/07/2021 02:55:59 - INFO - __main__ - Step 40608: {'lr': 0.00042064486204305263, 'samples': 7796736, 'steps': 40607, 'loss/train': 1.5178614854812622} 11/07/2021 02:56:00 - INFO - __main__ - Step 40609: {'lr': 0.0004206409837753618, 'samples': 7796928, 'steps': 40608, 'loss/train': 1.832965612411499} 11/07/2021 02:56:00 - INFO - __main__ - Step 40610: {'lr': 0.00042063710543078283, 'samples': 7797120, 'steps': 40609, 'loss/train': 1.4966989755630493} 11/07/2021 02:56:00 - INFO - __main__ - Step 40611: {'lr': 0.00042063322700931733, 'samples': 7797312, 'steps': 40610, 'loss/train': 1.3636682033538818} 11/07/2021 02:56:01 - INFO - __main__ - Step 40612: {'lr': 0.0004206293485109672, 'samples': 7797504, 'steps': 40611, 'loss/train': 2.53975248336792} 11/07/2021 02:56:01 - INFO - __main__ - Step 40613: {'lr': 0.0004206254699357341, 'samples': 7797696, 'steps': 40612, 'loss/train': 1.1263415813446045} 11/07/2021 02:56:02 - INFO - __main__ - Step 40614: {'lr': 0.00042062159128361976, 'samples': 7797888, 'steps': 40613, 'loss/train': 2.1661019325256348} 11/07/2021 02:56:02 - INFO - __main__ - Step 40615: {'lr': 0.000420617712554626, 'samples': 7798080, 'steps': 40614, 'loss/train': 1.3241528272628784} 11/07/2021 02:56:03 - INFO - __main__ - Step 40616: {'lr': 0.0004206138337487545, 'samples': 7798272, 'steps': 40615, 'loss/train': 0.6974542140960693} 11/07/2021 02:56:03 - INFO - __main__ - Step 40617: {'lr': 0.0004206099548660071, 'samples': 7798464, 'steps': 40616, 'loss/train': 1.47562575340271} 11/07/2021 02:56:03 - INFO - __main__ - Step 40618: {'lr': 0.00042060607590638547, 'samples': 7798656, 'steps': 40617, 'loss/train': 1.7578154802322388} 11/07/2021 02:56:04 - INFO - __main__ - Step 40619: {'lr': 0.00042060219686989133, 'samples': 7798848, 'steps': 40618, 'loss/train': 1.2786564826965332} 11/07/2021 02:56:05 - INFO - __main__ - Step 40620: {'lr': 0.00042059831775652644, 'samples': 7799040, 'steps': 40619, 'loss/train': 1.3147251605987549} 11/07/2021 02:56:05 - INFO - __main__ - Step 40621: {'lr': 0.00042059443856629265, 'samples': 7799232, 'steps': 40620, 'loss/train': 1.4205396175384521} 11/07/2021 02:56:06 - INFO - __main__ - Step 40622: {'lr': 0.00042059055929919163, 'samples': 7799424, 'steps': 40621, 'loss/train': 1.4647282361984253} 11/07/2021 02:56:06 - INFO - __main__ - Step 40623: {'lr': 0.00042058667995522513, 'samples': 7799616, 'steps': 40622, 'loss/train': 0.836337149143219} 11/07/2021 02:56:07 - INFO - __main__ - Step 40624: {'lr': 0.0004205828005343949, 'samples': 7799808, 'steps': 40623, 'loss/train': 1.6595656871795654} 11/07/2021 02:56:07 - INFO - __main__ - Step 40625: {'lr': 0.00042057892103670275, 'samples': 7800000, 'steps': 40624, 'loss/train': 1.6615570783615112} 11/07/2021 02:56:08 - INFO - __main__ - Step 40626: {'lr': 0.0004205750414621503, 'samples': 7800192, 'steps': 40625, 'loss/train': 1.3805701732635498} 11/07/2021 02:56:08 - INFO - __main__ - Step 40627: {'lr': 0.0004205711618107394, 'samples': 7800384, 'steps': 40626, 'loss/train': 2.026540517807007} 11/07/2021 02:56:08 - INFO - __main__ - Step 40628: {'lr': 0.00042056728208247175, 'samples': 7800576, 'steps': 40627, 'loss/train': 1.489866852760315} 11/07/2021 02:56:09 - INFO - __main__ - Step 40629: {'lr': 0.0004205634022773491, 'samples': 7800768, 'steps': 40628, 'loss/train': 1.2569215297698975} 11/07/2021 02:56:10 - INFO - __main__ - Step 40630: {'lr': 0.0004205595223953732, 'samples': 7800960, 'steps': 40629, 'loss/train': 1.34797203540802} 11/07/2021 02:56:10 - INFO - __main__ - Step 40631: {'lr': 0.0004205556424365459, 'samples': 7801152, 'steps': 40630, 'loss/train': 1.273876428604126} 11/07/2021 02:56:10 - INFO - __main__ - Step 40632: {'lr': 0.0004205517624008688, 'samples': 7801344, 'steps': 40631, 'loss/train': 1.5526227951049805} 11/07/2021 02:56:11 - INFO - __main__ - Step 40633: {'lr': 0.00042054788228834374, 'samples': 7801536, 'steps': 40632, 'loss/train': 1.3163074254989624} 11/07/2021 02:56:11 - INFO - __main__ - Step 40634: {'lr': 0.0004205440020989724, 'samples': 7801728, 'steps': 40633, 'loss/train': 1.474863886833191} 11/07/2021 02:56:12 - INFO - __main__ - Step 40635: {'lr': 0.0004205401218327565, 'samples': 7801920, 'steps': 40634, 'loss/train': 1.6535274982452393} 11/07/2021 02:56:13 - INFO - __main__ - Step 40636: {'lr': 0.0004205362414896979, 'samples': 7802112, 'steps': 40635, 'loss/train': 1.746715784072876} 11/07/2021 02:56:13 - INFO - __main__ - Step 40637: {'lr': 0.0004205323610697984, 'samples': 7802304, 'steps': 40636, 'loss/train': 1.7244786024093628} 11/07/2021 02:56:13 - INFO - __main__ - Step 40638: {'lr': 0.0004205284805730596, 'samples': 7802496, 'steps': 40637, 'loss/train': 1.8182399272918701} 11/07/2021 02:56:14 - INFO - __main__ - Step 40639: {'lr': 0.00042052459999948323, 'samples': 7802688, 'steps': 40638, 'loss/train': 1.222219467163086} 11/07/2021 02:56:14 - INFO - __main__ - Step 40640: {'lr': 0.00042052071934907116, 'samples': 7802880, 'steps': 40639, 'loss/train': 1.7510050535202026} 11/07/2021 02:56:15 - INFO - __main__ - Step 40641: {'lr': 0.00042051683862182504, 'samples': 7803072, 'steps': 40640, 'loss/train': 1.3503400087356567} 11/07/2021 02:56:15 - INFO - __main__ - Step 40642: {'lr': 0.0004205129578177467, 'samples': 7803264, 'steps': 40641, 'loss/train': 1.3796998262405396} 11/07/2021 02:56:16 - INFO - __main__ - Step 40643: {'lr': 0.0004205090769368379, 'samples': 7803456, 'steps': 40642, 'loss/train': 1.7264069318771362} 11/07/2021 02:56:16 - INFO - __main__ - Step 40644: {'lr': 0.00042050519597910024, 'samples': 7803648, 'steps': 40643, 'loss/train': 1.688590168952942} 11/07/2021 02:56:16 - INFO - __main__ - Step 40645: {'lr': 0.00042050131494453567, 'samples': 7803840, 'steps': 40644, 'loss/train': 1.254162311553955} 11/07/2021 02:56:18 - INFO - __main__ - Step 40646: {'lr': 0.00042049743383314577, 'samples': 7804032, 'steps': 40645, 'loss/train': 1.396903395652771} 11/07/2021 02:56:18 - INFO - __main__ - Step 40647: {'lr': 0.0004204935526449324, 'samples': 7804224, 'steps': 40646, 'loss/train': 1.4807292222976685} 11/07/2021 02:56:18 - INFO - __main__ - Step 40648: {'lr': 0.0004204896713798972, 'samples': 7804416, 'steps': 40647, 'loss/train': 1.357002854347229} 11/07/2021 02:56:19 - INFO - __main__ - Step 40649: {'lr': 0.00042048579003804205, 'samples': 7804608, 'steps': 40648, 'loss/train': 1.4147695302963257} 11/07/2021 02:56:19 - INFO - __main__ - Step 40650: {'lr': 0.00042048190861936866, 'samples': 7804800, 'steps': 40649, 'loss/train': 1.5080392360687256} 11/07/2021 02:56:20 - INFO - __main__ - Step 40651: {'lr': 0.0004204780271238786, 'samples': 7804992, 'steps': 40650, 'loss/train': 1.3220431804656982} 11/07/2021 02:56:20 - INFO - __main__ - Step 40652: {'lr': 0.00042047414555157394, 'samples': 7805184, 'steps': 40651, 'loss/train': 1.7669703960418701} 11/07/2021 02:56:21 - INFO - __main__ - Step 40653: {'lr': 0.0004204702639024562, 'samples': 7805376, 'steps': 40652, 'loss/train': 1.8579809665679932} 11/07/2021 02:56:21 - INFO - __main__ - Step 40654: {'lr': 0.00042046638217652717, 'samples': 7805568, 'steps': 40653, 'loss/train': 1.3786760568618774} 11/07/2021 02:56:21 - INFO - __main__ - Step 40655: {'lr': 0.00042046250037378865, 'samples': 7805760, 'steps': 40654, 'loss/train': 1.4152415990829468} 11/07/2021 02:56:22 - INFO - __main__ - Step 40656: {'lr': 0.0004204586184942423, 'samples': 7805952, 'steps': 40655, 'loss/train': 1.5150220394134521} 11/07/2021 02:56:23 - INFO - __main__ - Step 40657: {'lr': 0.00042045473653789004, 'samples': 7806144, 'steps': 40656, 'loss/train': 1.9052633047103882} 11/07/2021 02:56:23 - INFO - __main__ - Step 40658: {'lr': 0.00042045085450473336, 'samples': 7806336, 'steps': 40657, 'loss/train': 1.6889491081237793} 11/07/2021 02:56:23 - INFO - __main__ - Step 40659: {'lr': 0.00042044697239477423, 'samples': 7806528, 'steps': 40658, 'loss/train': 1.0255661010742188} 11/07/2021 02:56:24 - INFO - __main__ - Step 40660: {'lr': 0.00042044309020801434, 'samples': 7806720, 'steps': 40659, 'loss/train': 1.272469162940979} 11/07/2021 02:56:24 - INFO - __main__ - Step 40661: {'lr': 0.00042043920794445543, 'samples': 7806912, 'steps': 40660, 'loss/train': 1.1650625467300415} 11/07/2021 02:56:25 - INFO - __main__ - Step 40662: {'lr': 0.0004204353256040992, 'samples': 7807104, 'steps': 40661, 'loss/train': 1.1032905578613281} 11/07/2021 02:56:26 - INFO - __main__ - Step 40663: {'lr': 0.0004204314431869475, 'samples': 7807296, 'steps': 40662, 'loss/train': 1.5344452857971191} 11/07/2021 02:56:26 - INFO - __main__ - Step 40664: {'lr': 0.0004204275606930019, 'samples': 7807488, 'steps': 40663, 'loss/train': 1.6054086685180664} 11/07/2021 02:56:26 - INFO - __main__ - Step 40665: {'lr': 0.00042042367812226446, 'samples': 7807680, 'steps': 40664, 'loss/train': 1.3232537508010864} 11/07/2021 02:56:27 - INFO - __main__ - Step 40666: {'lr': 0.00042041979547473665, 'samples': 7807872, 'steps': 40665, 'loss/train': 1.46358060836792} 11/07/2021 02:56:28 - INFO - __main__ - Step 40667: {'lr': 0.0004204159127504202, 'samples': 7808064, 'steps': 40666, 'loss/train': 1.802840232849121} 11/07/2021 02:56:28 - INFO - __main__ - Step 40668: {'lr': 0.0004204120299493171, 'samples': 7808256, 'steps': 40667, 'loss/train': 0.9984766244888306} 11/07/2021 02:56:28 - INFO - __main__ - Step 40669: {'lr': 0.0004204081470714289, 'samples': 7808448, 'steps': 40668, 'loss/train': 1.5101535320281982} 11/07/2021 02:56:29 - INFO - __main__ - Step 40670: {'lr': 0.00042040426411675747, 'samples': 7808640, 'steps': 40669, 'loss/train': 1.0501954555511475} 11/07/2021 02:56:29 - INFO - __main__ - Step 40671: {'lr': 0.0004204003810853045, 'samples': 7808832, 'steps': 40670, 'loss/train': 1.3344838619232178} 11/07/2021 02:56:30 - INFO - __main__ - Step 40672: {'lr': 0.00042039649797707176, 'samples': 7809024, 'steps': 40671, 'loss/train': 1.6589035987854004} 11/07/2021 02:56:31 - INFO - __main__ - Step 40673: {'lr': 0.0004203926147920609, 'samples': 7809216, 'steps': 40672, 'loss/train': 0.6955644488334656} 11/07/2021 02:56:31 - INFO - __main__ - Step 40674: {'lr': 0.0004203887315302739, 'samples': 7809408, 'steps': 40673, 'loss/train': 1.543953776359558} 11/07/2021 02:56:31 - INFO - __main__ - Step 40675: {'lr': 0.0004203848481917122, 'samples': 7809600, 'steps': 40674, 'loss/train': 1.5235605239868164} 11/07/2021 02:56:32 - INFO - __main__ - Step 40676: {'lr': 0.00042038096477637786, 'samples': 7809792, 'steps': 40675, 'loss/train': 1.1400928497314453} 11/07/2021 02:56:33 - INFO - __main__ - Step 40677: {'lr': 0.00042037708128427243, 'samples': 7809984, 'steps': 40676, 'loss/train': 1.9184764623641968} 11/07/2021 02:56:33 - INFO - __main__ - Step 40678: {'lr': 0.00042037319771539775, 'samples': 7810176, 'steps': 40677, 'loss/train': 1.3890955448150635} 11/07/2021 02:56:33 - INFO - __main__ - Step 40679: {'lr': 0.00042036931406975547, 'samples': 7810368, 'steps': 40678, 'loss/train': 0.9071448445320129} 11/07/2021 02:56:34 - INFO - __main__ - Step 40680: {'lr': 0.0004203654303473474, 'samples': 7810560, 'steps': 40679, 'loss/train': 1.413865327835083} 11/07/2021 02:56:34 - INFO - __main__ - Step 40681: {'lr': 0.0004203615465481754, 'samples': 7810752, 'steps': 40680, 'loss/train': 1.401404857635498} 11/07/2021 02:56:35 - INFO - __main__ - Step 40682: {'lr': 0.0004203576626722411, 'samples': 7810944, 'steps': 40681, 'loss/train': 1.5267057418823242} 11/07/2021 02:56:35 - INFO - __main__ - Step 40683: {'lr': 0.00042035377871954614, 'samples': 7811136, 'steps': 40682, 'loss/train': 0.5235815048217773} 11/07/2021 02:56:36 - INFO - __main__ - Step 40684: {'lr': 0.00042034989469009245, 'samples': 7811328, 'steps': 40683, 'loss/train': 1.418968915939331} 11/07/2021 02:56:36 - INFO - __main__ - Step 40685: {'lr': 0.0004203460105838818, 'samples': 7811520, 'steps': 40684, 'loss/train': 1.3702691793441772} 11/07/2021 02:56:36 - INFO - __main__ - Step 40686: {'lr': 0.00042034212640091587, 'samples': 7811712, 'steps': 40685, 'loss/train': 1.2421624660491943} 11/07/2021 02:56:37 - INFO - __main__ - Step 40687: {'lr': 0.00042033824214119633, 'samples': 7811904, 'steps': 40686, 'loss/train': 1.2181388139724731} 11/07/2021 02:56:38 - INFO - __main__ - Step 40688: {'lr': 0.00042033435780472494, 'samples': 7812096, 'steps': 40687, 'loss/train': 1.5045456886291504} 11/07/2021 02:56:38 - INFO - __main__ - Step 40689: {'lr': 0.00042033047339150363, 'samples': 7812288, 'steps': 40688, 'loss/train': 1.4343459606170654} 11/07/2021 02:56:39 - INFO - __main__ - Step 40690: {'lr': 0.00042032658890153404, 'samples': 7812480, 'steps': 40689, 'loss/train': 1.233680009841919} 11/07/2021 02:56:39 - INFO - __main__ - Step 40691: {'lr': 0.0004203227043348179, 'samples': 7812672, 'steps': 40690, 'loss/train': 1.5752936601638794} 11/07/2021 02:56:39 - INFO - __main__ - Step 40692: {'lr': 0.000420318819691357, 'samples': 7812864, 'steps': 40691, 'loss/train': 1.6695582866668701} 11/07/2021 02:56:40 - INFO - __main__ - Step 40693: {'lr': 0.00042031493497115304, 'samples': 7813056, 'steps': 40692, 'loss/train': 1.7513031959533691} 11/07/2021 02:56:41 - INFO - __main__ - Step 40694: {'lr': 0.0004203110501742078, 'samples': 7813248, 'steps': 40693, 'loss/train': 0.9645717740058899} 11/07/2021 02:56:41 - INFO - __main__ - Step 40695: {'lr': 0.00042030716530052297, 'samples': 7813440, 'steps': 40694, 'loss/train': 1.6265499591827393} 11/07/2021 02:56:41 - INFO - __main__ - Step 40696: {'lr': 0.00042030328035010047, 'samples': 7813632, 'steps': 40695, 'loss/train': 1.2691519260406494} 11/07/2021 02:56:42 - INFO - __main__ - Step 40697: {'lr': 0.0004202993953229418, 'samples': 7813824, 'steps': 40696, 'loss/train': 1.6467305421829224} 11/07/2021 02:56:43 - INFO - __main__ - Step 40698: {'lr': 0.000420295510219049, 'samples': 7814016, 'steps': 40697, 'loss/train': 1.8103171586990356} 11/07/2021 02:56:43 - INFO - __main__ - Step 40699: {'lr': 0.00042029162503842357, 'samples': 7814208, 'steps': 40698, 'loss/train': 1.101730465888977} 11/07/2021 02:56:43 - INFO - __main__ - Step 40700: {'lr': 0.0004202877397810674, 'samples': 7814400, 'steps': 40699, 'loss/train': 1.522147536277771} 11/07/2021 02:56:44 - INFO - __main__ - Step 40701: {'lr': 0.0004202838544469822, 'samples': 7814592, 'steps': 40700, 'loss/train': 1.5229068994522095} 11/07/2021 02:56:44 - INFO - __main__ - Step 40702: {'lr': 0.00042027996903616974, 'samples': 7814784, 'steps': 40701, 'loss/train': 1.5482386350631714} 11/07/2021 02:56:45 - INFO - __main__ - Step 40703: {'lr': 0.0004202760835486317, 'samples': 7814976, 'steps': 40702, 'loss/train': 1.3894777297973633} 11/07/2021 02:56:46 - INFO - __main__ - Step 40704: {'lr': 0.00042027219798436996, 'samples': 7815168, 'steps': 40703, 'loss/train': 1.6228009462356567} 11/07/2021 02:56:46 - INFO - __main__ - Step 40705: {'lr': 0.00042026831234338614, 'samples': 7815360, 'steps': 40704, 'loss/train': 1.6805357933044434} 11/07/2021 02:56:46 - INFO - __main__ - Step 40706: {'lr': 0.0004202644266256821, 'samples': 7815552, 'steps': 40705, 'loss/train': 1.679015874862671} 11/07/2021 02:56:47 - INFO - __main__ - Step 40707: {'lr': 0.00042026054083125943, 'samples': 7815744, 'steps': 40706, 'loss/train': 1.0983397960662842} 11/07/2021 02:56:48 - INFO - __main__ - Step 40708: {'lr': 0.0004202566549601201, 'samples': 7815936, 'steps': 40707, 'loss/train': 1.7401659488677979} 11/07/2021 02:56:48 - INFO - __main__ - Step 40709: {'lr': 0.00042025276901226573, 'samples': 7816128, 'steps': 40708, 'loss/train': 1.924376368522644} 11/07/2021 02:56:49 - INFO - __main__ - Step 40710: {'lr': 0.00042024888298769806, 'samples': 7816320, 'steps': 40709, 'loss/train': 1.6605688333511353} 11/07/2021 02:56:49 - INFO - __main__ - Step 40711: {'lr': 0.0004202449968864188, 'samples': 7816512, 'steps': 40710, 'loss/train': 2.121345043182373} 11/07/2021 02:56:49 - INFO - __main__ - Step 40712: {'lr': 0.00042024111070842985, 'samples': 7816704, 'steps': 40711, 'loss/train': 2.026416778564453} 11/07/2021 02:56:51 - INFO - __main__ - Step 40713: {'lr': 0.0004202372244537329, 'samples': 7816896, 'steps': 40712, 'loss/train': 0.983768105506897} 11/07/2021 02:56:51 - INFO - __main__ - Step 40714: {'lr': 0.00042023333812232967, 'samples': 7817088, 'steps': 40713, 'loss/train': 1.6620196104049683} 11/07/2021 02:56:51 - INFO - __main__ - Step 40715: {'lr': 0.0004202294517142219, 'samples': 7817280, 'steps': 40714, 'loss/train': 1.6445621252059937} 11/07/2021 02:56:52 - INFO - __main__ - Step 40716: {'lr': 0.0004202255652294114, 'samples': 7817472, 'steps': 40715, 'loss/train': 1.2422605752944946} 11/07/2021 02:56:52 - INFO - __main__ - Step 40717: {'lr': 0.00042022167866789985, 'samples': 7817664, 'steps': 40716, 'loss/train': 1.9124672412872314} 11/07/2021 02:56:52 - INFO - __main__ - Step 40718: {'lr': 0.00042021779202968903, 'samples': 7817856, 'steps': 40717, 'loss/train': 1.974969744682312} 11/07/2021 02:56:53 - INFO - __main__ - Step 40719: {'lr': 0.0004202139053147808, 'samples': 7818048, 'steps': 40718, 'loss/train': 1.6937755346298218} 11/07/2021 02:56:54 - INFO - __main__ - Step 40720: {'lr': 0.0004202100185231767, 'samples': 7818240, 'steps': 40719, 'loss/train': 1.5049687623977661} 11/07/2021 02:56:54 - INFO - __main__ - Step 40721: {'lr': 0.00042020613165487863, 'samples': 7818432, 'steps': 40720, 'loss/train': 1.3050556182861328} 11/07/2021 02:56:54 - INFO - __main__ - Step 40722: {'lr': 0.0004202022447098883, 'samples': 7818624, 'steps': 40721, 'loss/train': 1.4542369842529297} 11/07/2021 02:56:55 - INFO - __main__ - Step 40723: {'lr': 0.00042019835768820744, 'samples': 7818816, 'steps': 40722, 'loss/train': 1.6474261283874512} 11/07/2021 02:56:56 - INFO - __main__ - Step 40724: {'lr': 0.00042019447058983786, 'samples': 7819008, 'steps': 40723, 'loss/train': 1.053047776222229} 11/07/2021 02:56:56 - INFO - __main__ - Step 40725: {'lr': 0.0004201905834147813, 'samples': 7819200, 'steps': 40724, 'loss/train': 1.6326931715011597} 11/07/2021 02:56:56 - INFO - __main__ - Step 40726: {'lr': 0.0004201866961630395, 'samples': 7819392, 'steps': 40725, 'loss/train': 1.4784409999847412} 11/07/2021 02:56:57 - INFO - __main__ - Step 40727: {'lr': 0.00042018280883461415, 'samples': 7819584, 'steps': 40726, 'loss/train': 1.242191195487976} 11/07/2021 02:56:57 - INFO - __main__ - Step 40728: {'lr': 0.000420178921429507, 'samples': 7819776, 'steps': 40727, 'loss/train': 1.5088322162628174} 11/07/2021 02:56:58 - INFO - __main__ - Step 40729: {'lr': 0.00042017503394771997, 'samples': 7819968, 'steps': 40728, 'loss/train': 1.5620919466018677} 11/07/2021 02:56:59 - INFO - __main__ - Step 40730: {'lr': 0.00042017114638925456, 'samples': 7820160, 'steps': 40729, 'loss/train': 1.665900707244873} 11/07/2021 02:56:59 - INFO - __main__ - Step 40731: {'lr': 0.00042016725875411274, 'samples': 7820352, 'steps': 40730, 'loss/train': 2.2982349395751953} 11/07/2021 02:56:59 - INFO - __main__ - Step 40732: {'lr': 0.0004201633710422962, 'samples': 7820544, 'steps': 40731, 'loss/train': 1.4807544946670532} 11/07/2021 02:57:00 - INFO - __main__ - Step 40733: {'lr': 0.0004201594832538067, 'samples': 7820736, 'steps': 40732, 'loss/train': 1.611283540725708} 11/07/2021 02:57:01 - INFO - __main__ - Step 40734: {'lr': 0.0004201555953886459, 'samples': 7820928, 'steps': 40733, 'loss/train': 1.288476586341858} 11/07/2021 02:57:01 - INFO - __main__ - Step 40735: {'lr': 0.00042015170744681566, 'samples': 7821120, 'steps': 40734, 'loss/train': 1.8149077892303467} 11/07/2021 02:57:01 - INFO - __main__ - Step 40736: {'lr': 0.00042014781942831757, 'samples': 7821312, 'steps': 40735, 'loss/train': 1.7490525245666504} 11/07/2021 02:57:02 - INFO - __main__ - Step 40737: {'lr': 0.00042014393133315366, 'samples': 7821504, 'steps': 40736, 'loss/train': 1.5821160078048706} 11/07/2021 02:57:02 - INFO - __main__ - Step 40738: {'lr': 0.00042014004316132537, 'samples': 7821696, 'steps': 40737, 'loss/train': 1.7117501497268677} 11/07/2021 02:57:03 - INFO - __main__ - Step 40739: {'lr': 0.0004201361549128347, 'samples': 7821888, 'steps': 40738, 'loss/train': 1.4417829513549805} 11/07/2021 02:57:03 - INFO - __main__ - Step 40740: {'lr': 0.00042013226658768333, 'samples': 7822080, 'steps': 40739, 'loss/train': 2.1850247383117676} 11/07/2021 02:57:04 - INFO - __main__ - Step 40741: {'lr': 0.0004201283781858729, 'samples': 7822272, 'steps': 40740, 'loss/train': 1.318172574043274} 11/07/2021 02:57:04 - INFO - __main__ - Step 40742: {'lr': 0.00042012448970740523, 'samples': 7822464, 'steps': 40741, 'loss/train': 1.9657243490219116} 11/07/2021 02:57:04 - INFO - __main__ - Step 40743: {'lr': 0.00042012060115228215, 'samples': 7822656, 'steps': 40742, 'loss/train': 1.322919487953186} 11/07/2021 02:57:05 - INFO - __main__ - Step 40744: {'lr': 0.0004201167125205054, 'samples': 7822848, 'steps': 40743, 'loss/train': 0.939601719379425} 11/07/2021 02:57:06 - INFO - __main__ - Step 40745: {'lr': 0.0004201128238120766, 'samples': 7823040, 'steps': 40744, 'loss/train': 1.352439045906067} 11/07/2021 02:57:06 - INFO - __main__ - Step 40746: {'lr': 0.00042010893502699765, 'samples': 7823232, 'steps': 40745, 'loss/train': 1.2596780061721802} 11/07/2021 02:57:07 - INFO - __main__ - Step 40747: {'lr': 0.0004201050461652702, 'samples': 7823424, 'steps': 40746, 'loss/train': 1.5559792518615723} 11/07/2021 02:57:07 - INFO - __main__ - Step 40748: {'lr': 0.00042010115722689603, 'samples': 7823616, 'steps': 40747, 'loss/train': 1.4758732318878174} 11/07/2021 02:57:08 - INFO - __main__ - Step 40749: {'lr': 0.0004200972682118769, 'samples': 7823808, 'steps': 40748, 'loss/train': 1.8409411907196045} 11/07/2021 02:57:08 - INFO - __main__ - Step 40750: {'lr': 0.0004200933791202146, 'samples': 7824000, 'steps': 40749, 'loss/train': 1.5887305736541748} 11/07/2021 02:57:09 - INFO - __main__ - Step 40751: {'lr': 0.0004200894899519108, 'samples': 7824192, 'steps': 40750, 'loss/train': 1.5835992097854614} 11/07/2021 02:57:09 - INFO - __main__ - Step 40752: {'lr': 0.00042008560070696735, 'samples': 7824384, 'steps': 40751, 'loss/train': 0.8489807844161987} 11/07/2021 02:57:09 - INFO - __main__ - Step 40753: {'lr': 0.000420081711385386, 'samples': 7824576, 'steps': 40752, 'loss/train': 1.7906991243362427} 11/07/2021 02:57:10 - INFO - __main__ - Step 40754: {'lr': 0.00042007782198716836, 'samples': 7824768, 'steps': 40753, 'loss/train': 2.3305821418762207} 11/07/2021 02:57:11 - INFO - __main__ - Step 40755: {'lr': 0.0004200739325123163, 'samples': 7824960, 'steps': 40754, 'loss/train': 1.161139965057373} 11/07/2021 02:57:11 - INFO - __main__ - Step 40756: {'lr': 0.0004200700429608315, 'samples': 7825152, 'steps': 40755, 'loss/train': 1.5971800088882446} 11/07/2021 02:57:11 - INFO - __main__ - Step 40757: {'lr': 0.00042006615333271585, 'samples': 7825344, 'steps': 40756, 'loss/train': 1.609025478363037} 11/07/2021 02:57:12 - INFO - __main__ - Step 40758: {'lr': 0.000420062263627971, 'samples': 7825536, 'steps': 40757, 'loss/train': 1.4358506202697754} 11/07/2021 02:57:12 - INFO - __main__ - Step 40759: {'lr': 0.0004200583738465987, 'samples': 7825728, 'steps': 40758, 'loss/train': 1.945847511291504} 11/07/2021 02:57:13 - INFO - __main__ - Step 40760: {'lr': 0.00042005448398860077, 'samples': 7825920, 'steps': 40759, 'loss/train': 1.662986159324646} 11/07/2021 02:57:13 - INFO - __main__ - Step 40761: {'lr': 0.00042005059405397885, 'samples': 7826112, 'steps': 40760, 'loss/train': 1.3383136987686157} 11/07/2021 02:57:14 - INFO - __main__ - Step 40762: {'lr': 0.00042004670404273474, 'samples': 7826304, 'steps': 40761, 'loss/train': 1.7181071043014526} 11/07/2021 02:57:14 - INFO - __main__ - Step 40763: {'lr': 0.0004200428139548703, 'samples': 7826496, 'steps': 40762, 'loss/train': 0.7669545412063599} 11/07/2021 02:57:14 - INFO - __main__ - Step 40764: {'lr': 0.0004200389237903871, 'samples': 7826688, 'steps': 40763, 'loss/train': 1.6772427558898926} 11/07/2021 02:57:16 - INFO - __main__ - Step 40765: {'lr': 0.000420035033549287, 'samples': 7826880, 'steps': 40764, 'loss/train': 1.4819986820220947} 11/07/2021 02:57:16 - INFO - __main__ - Step 40766: {'lr': 0.0004200311432315718, 'samples': 7827072, 'steps': 40765, 'loss/train': 1.9922943115234375} 11/07/2021 02:57:16 - INFO - __main__ - Step 40767: {'lr': 0.0004200272528372432, 'samples': 7827264, 'steps': 40766, 'loss/train': 1.4818094968795776} 11/07/2021 02:57:17 - INFO - __main__ - Step 40768: {'lr': 0.0004200233623663028, 'samples': 7827456, 'steps': 40767, 'loss/train': 1.082582712173462} 11/07/2021 02:57:17 - INFO - __main__ - Step 40769: {'lr': 0.0004200194718187527, 'samples': 7827648, 'steps': 40768, 'loss/train': 1.1773542165756226} 11/07/2021 02:57:18 - INFO - __main__ - Step 40770: {'lr': 0.0004200155811945943, 'samples': 7827840, 'steps': 40769, 'loss/train': 1.2706178426742554} 11/07/2021 02:57:18 - INFO - __main__ - Step 40771: {'lr': 0.0004200116904938295, 'samples': 7828032, 'steps': 40770, 'loss/train': 1.697472095489502} 11/07/2021 02:57:19 - INFO - __main__ - Step 40772: {'lr': 0.00042000779971646007, 'samples': 7828224, 'steps': 40771, 'loss/train': 1.7203675508499146} 11/07/2021 02:57:19 - INFO - __main__ - Step 40773: {'lr': 0.00042000390886248783, 'samples': 7828416, 'steps': 40772, 'loss/train': 5.857418537139893} 11/07/2021 02:57:19 - INFO - __main__ - Step 40774: {'lr': 0.0004200000179319144, 'samples': 7828608, 'steps': 40773, 'loss/train': 1.426015019416809} 11/07/2021 02:57:20 - INFO - __main__ - Step 40775: {'lr': 0.0004199961269247416, 'samples': 7828800, 'steps': 40774, 'loss/train': 1.3574186563491821} 11/07/2021 02:57:21 - INFO - __main__ - Step 40776: {'lr': 0.0004199922358409711, 'samples': 7828992, 'steps': 40775, 'loss/train': 1.6170494556427002} 11/07/2021 02:57:21 - INFO - __main__ - Step 40777: {'lr': 0.0004199883446806048, 'samples': 7829184, 'steps': 40776, 'loss/train': 1.7628307342529297} 11/07/2021 02:57:21 - INFO - __main__ - Step 40778: {'lr': 0.0004199844534436443, 'samples': 7829376, 'steps': 40777, 'loss/train': 2.341153621673584} 11/07/2021 02:57:22 - INFO - __main__ - Step 40779: {'lr': 0.0004199805621300915, 'samples': 7829568, 'steps': 40778, 'loss/train': 1.5946757793426514} 11/07/2021 02:57:23 - INFO - __main__ - Step 40780: {'lr': 0.0004199766707399481, 'samples': 7829760, 'steps': 40779, 'loss/train': 1.1053686141967773} 11/07/2021 02:57:23 - INFO - __main__ - Step 40781: {'lr': 0.0004199727792732158, 'samples': 7829952, 'steps': 40780, 'loss/train': 0.9021766185760498} 11/07/2021 02:57:23 - INFO - __main__ - Step 40782: {'lr': 0.0004199688877298964, 'samples': 7830144, 'steps': 40781, 'loss/train': 1.256246566772461} 11/07/2021 02:57:24 - INFO - __main__ - Step 40783: {'lr': 0.00041996499610999163, 'samples': 7830336, 'steps': 40782, 'loss/train': 1.5676124095916748} 11/07/2021 02:57:24 - INFO - __main__ - Step 40784: {'lr': 0.00041996110441350323, 'samples': 7830528, 'steps': 40783, 'loss/train': 1.7661354541778564} 11/07/2021 02:57:25 - INFO - __main__ - Step 40785: {'lr': 0.000419957212640433, 'samples': 7830720, 'steps': 40784, 'loss/train': 1.8322479724884033} 11/07/2021 02:57:25 - INFO - __main__ - Step 40786: {'lr': 0.0004199533207907827, 'samples': 7830912, 'steps': 40785, 'loss/train': 1.3264210224151611} 11/07/2021 02:57:26 - INFO - __main__ - Step 40787: {'lr': 0.00041994942886455403, 'samples': 7831104, 'steps': 40786, 'loss/train': 1.4569522142410278} 11/07/2021 02:57:26 - INFO - __main__ - Step 40788: {'lr': 0.00041994553686174876, 'samples': 7831296, 'steps': 40787, 'loss/train': 1.7506399154663086} 11/07/2021 02:57:27 - INFO - __main__ - Step 40789: {'lr': 0.0004199416447823686, 'samples': 7831488, 'steps': 40788, 'loss/train': 1.4973564147949219} 11/07/2021 02:57:28 - INFO - __main__ - Step 40790: {'lr': 0.0004199377526264154, 'samples': 7831680, 'steps': 40789, 'loss/train': 1.3136669397354126} 11/07/2021 02:57:28 - INFO - __main__ - Step 40791: {'lr': 0.00041993386039389095, 'samples': 7831872, 'steps': 40790, 'loss/train': 2.1212332248687744} 11/07/2021 02:57:28 - INFO - __main__ - Step 40792: {'lr': 0.0004199299680847969, 'samples': 7832064, 'steps': 40791, 'loss/train': 0.9397590756416321} 11/07/2021 02:57:29 - INFO - __main__ - Step 40793: {'lr': 0.000419926075699135, 'samples': 7832256, 'steps': 40792, 'loss/train': 1.2800946235656738} 11/07/2021 02:57:29 - INFO - __main__ - Step 40794: {'lr': 0.000419922183236907, 'samples': 7832448, 'steps': 40793, 'loss/train': 1.0686975717544556} 11/07/2021 02:57:30 - INFO - __main__ - Step 40795: {'lr': 0.0004199182906981147, 'samples': 7832640, 'steps': 40794, 'loss/train': 1.7454752922058105} 11/07/2021 02:57:31 - INFO - __main__ - Step 40796: {'lr': 0.00041991439808275986, 'samples': 7832832, 'steps': 40795, 'loss/train': 0.7209337949752808} 11/07/2021 02:57:31 - INFO - __main__ - Step 40797: {'lr': 0.0004199105053908442, 'samples': 7833024, 'steps': 40796, 'loss/train': 1.42599618434906} 11/07/2021 02:57:31 - INFO - __main__ - Step 40798: {'lr': 0.0004199066126223695, 'samples': 7833216, 'steps': 40797, 'loss/train': 1.8595082759857178} 11/07/2021 02:57:32 - INFO - __main__ - Step 40799: {'lr': 0.0004199027197773375, 'samples': 7833408, 'steps': 40798, 'loss/train': 2.1304287910461426} 11/07/2021 02:57:32 - INFO - __main__ - Step 40800: {'lr': 0.00041989882685575, 'samples': 7833600, 'steps': 40799, 'loss/train': 1.2476295232772827} 11/07/2021 02:57:33 - INFO - __main__ - Step 40801: {'lr': 0.0004198949338576086, 'samples': 7833792, 'steps': 40800, 'loss/train': 1.1822339296340942} 11/07/2021 02:57:34 - INFO - __main__ - Step 40802: {'lr': 0.0004198910407829152, 'samples': 7833984, 'steps': 40801, 'loss/train': 1.404305100440979} 11/07/2021 02:57:34 - INFO - __main__ - Step 40803: {'lr': 0.00041988714763167156, 'samples': 7834176, 'steps': 40802, 'loss/train': 1.536059021949768} 11/07/2021 02:57:34 - INFO - __main__ - Step 40804: {'lr': 0.00041988325440387944, 'samples': 7834368, 'steps': 40803, 'loss/train': 1.5439774990081787} 11/07/2021 02:57:35 - INFO - __main__ - Step 40805: {'lr': 0.00041987936109954047, 'samples': 7834560, 'steps': 40804, 'loss/train': 1.814212441444397} 11/07/2021 02:57:36 - INFO - __main__ - Step 40806: {'lr': 0.0004198754677186565, 'samples': 7834752, 'steps': 40805, 'loss/train': 1.5161305665969849} 11/07/2021 02:57:36 - INFO - __main__ - Step 40807: {'lr': 0.0004198715742612292, 'samples': 7834944, 'steps': 40806, 'loss/train': 1.632473111152649} 11/07/2021 02:57:36 - INFO - __main__ - Step 40808: {'lr': 0.0004198676807272605, 'samples': 7835136, 'steps': 40807, 'loss/train': 1.0738561153411865} 11/07/2021 02:57:37 - INFO - __main__ - Step 40809: {'lr': 0.000419863787116752, 'samples': 7835328, 'steps': 40808, 'loss/train': 1.2711148262023926} 11/07/2021 02:57:37 - INFO - __main__ - Step 40810: {'lr': 0.0004198598934297055, 'samples': 7835520, 'steps': 40809, 'loss/train': 1.3566529750823975} 11/07/2021 02:57:37 - INFO - __main__ - Step 40811: {'lr': 0.00041985599966612273, 'samples': 7835712, 'steps': 40810, 'loss/train': 1.2638170719146729} 11/07/2021 02:57:39 - INFO - __main__ - Step 40812: {'lr': 0.0004198521058260055, 'samples': 7835904, 'steps': 40811, 'loss/train': 0.5754212141036987} 11/07/2021 02:57:39 - INFO - __main__ - Step 40813: {'lr': 0.0004198482119093555, 'samples': 7836096, 'steps': 40812, 'loss/train': 1.6761367321014404} 11/07/2021 02:57:40 - INFO - __main__ - Step 40814: {'lr': 0.00041984431791617456, 'samples': 7836288, 'steps': 40813, 'loss/train': 2.0934343338012695} 11/07/2021 02:57:40 - INFO - __main__ - Step 40815: {'lr': 0.0004198404238464644, 'samples': 7836480, 'steps': 40814, 'loss/train': 2.344271183013916} 11/07/2021 02:57:40 - INFO - __main__ - Step 40816: {'lr': 0.0004198365297002267, 'samples': 7836672, 'steps': 40815, 'loss/train': 1.3367879390716553} 11/07/2021 02:57:41 - INFO - __main__ - Step 40817: {'lr': 0.0004198326354774633, 'samples': 7836864, 'steps': 40816, 'loss/train': 2.2786648273468018} 11/07/2021 02:57:42 - INFO - __main__ - Step 40818: {'lr': 0.00041982874117817593, 'samples': 7837056, 'steps': 40817, 'loss/train': 1.5131193399429321} 11/07/2021 02:57:42 - INFO - __main__ - Step 40819: {'lr': 0.00041982484680236636, 'samples': 7837248, 'steps': 40818, 'loss/train': 1.1675461530685425} 11/07/2021 02:57:42 - INFO - __main__ - Step 40820: {'lr': 0.00041982095235003634, 'samples': 7837440, 'steps': 40819, 'loss/train': 1.5366125106811523} 11/07/2021 02:57:43 - INFO - __main__ - Step 40821: {'lr': 0.0004198170578211877, 'samples': 7837632, 'steps': 40820, 'loss/train': 1.6404383182525635} 11/07/2021 02:57:43 - INFO - __main__ - Step 40822: {'lr': 0.000419813163215822, 'samples': 7837824, 'steps': 40821, 'loss/train': 1.3809610605239868} 11/07/2021 02:57:44 - INFO - __main__ - Step 40823: {'lr': 0.0004198092685339411, 'samples': 7838016, 'steps': 40822, 'loss/train': 1.6485894918441772} 11/07/2021 02:57:45 - INFO - __main__ - Step 40824: {'lr': 0.00041980537377554685, 'samples': 7838208, 'steps': 40823, 'loss/train': 1.9748244285583496} 11/07/2021 02:57:45 - INFO - __main__ - Step 40825: {'lr': 0.00041980147894064086, 'samples': 7838400, 'steps': 40824, 'loss/train': 1.0561368465423584} 11/07/2021 02:57:45 - INFO - __main__ - Step 40826: {'lr': 0.00041979758402922496, 'samples': 7838592, 'steps': 40825, 'loss/train': 1.2652628421783447} 11/07/2021 02:57:46 - INFO - __main__ - Step 40827: {'lr': 0.00041979368904130086, 'samples': 7838784, 'steps': 40826, 'loss/train': 1.6909739971160889} 11/07/2021 02:57:47 - INFO - __main__ - Step 40828: {'lr': 0.00041978979397687047, 'samples': 7838976, 'steps': 40827, 'loss/train': 1.191987156867981} 11/07/2021 02:57:47 - INFO - __main__ - Step 40829: {'lr': 0.00041978589883593525, 'samples': 7839168, 'steps': 40828, 'loss/train': 1.1689318418502808} 11/07/2021 02:57:47 - INFO - __main__ - Step 40830: {'lr': 0.0004197820036184972, 'samples': 7839360, 'steps': 40829, 'loss/train': 1.3950546979904175} 11/07/2021 02:57:48 - INFO - __main__ - Step 40831: {'lr': 0.000419778108324558, 'samples': 7839552, 'steps': 40830, 'loss/train': 1.4193148612976074} 11/07/2021 02:57:48 - INFO - __main__ - Step 40832: {'lr': 0.00041977421295411944, 'samples': 7839744, 'steps': 40831, 'loss/train': 1.3167130947113037} 11/07/2021 02:57:49 - INFO - __main__ - Step 40833: {'lr': 0.00041977031750718317, 'samples': 7839936, 'steps': 40832, 'loss/train': 1.5753673315048218} 11/07/2021 02:57:50 - INFO - __main__ - Step 40834: {'lr': 0.000419766421983751, 'samples': 7840128, 'steps': 40833, 'loss/train': 1.5279433727264404} 11/07/2021 02:57:50 - INFO - __main__ - Step 40835: {'lr': 0.00041976252638382483, 'samples': 7840320, 'steps': 40834, 'loss/train': 0.5898969173431396} 11/07/2021 02:57:50 - INFO - __main__ - Step 40836: {'lr': 0.00041975863070740617, 'samples': 7840512, 'steps': 40835, 'loss/train': 1.344542145729065} 11/07/2021 02:57:51 - INFO - __main__ - Step 40837: {'lr': 0.0004197547349544969, 'samples': 7840704, 'steps': 40836, 'loss/train': 0.8119551539421082} 11/07/2021 02:57:52 - INFO - __main__ - Step 40838: {'lr': 0.0004197508391250988, 'samples': 7840896, 'steps': 40837, 'loss/train': 1.9184430837631226} 11/07/2021 02:57:52 - INFO - __main__ - Step 40839: {'lr': 0.0004197469432192136, 'samples': 7841088, 'steps': 40838, 'loss/train': 1.3354907035827637} 11/07/2021 02:57:53 - INFO - __main__ - Step 40840: {'lr': 0.000419743047236843, 'samples': 7841280, 'steps': 40839, 'loss/train': 1.261608600616455} 11/07/2021 02:57:53 - INFO - __main__ - Step 40841: {'lr': 0.00041973915117798883, 'samples': 7841472, 'steps': 40840, 'loss/train': 1.2910435199737549} 11/07/2021 02:57:53 - INFO - __main__ - Step 40842: {'lr': 0.0004197352550426528, 'samples': 7841664, 'steps': 40841, 'loss/train': 1.584807276725769} 11/07/2021 02:57:54 - INFO - __main__ - Step 40843: {'lr': 0.0004197313588308367, 'samples': 7841856, 'steps': 40842, 'loss/train': 1.3579699993133545} 11/07/2021 02:57:55 - INFO - __main__ - Step 40844: {'lr': 0.0004197274625425423, 'samples': 7842048, 'steps': 40843, 'loss/train': 1.606826663017273} 11/07/2021 02:57:55 - INFO - __main__ - Step 40845: {'lr': 0.0004197235661777713, 'samples': 7842240, 'steps': 40844, 'loss/train': 1.7265325784683228} 11/07/2021 02:57:55 - INFO - __main__ - Step 40846: {'lr': 0.00041971966973652545, 'samples': 7842432, 'steps': 40845, 'loss/train': 1.2680137157440186} 11/07/2021 02:57:56 - INFO - __main__ - Step 40847: {'lr': 0.00041971577321880656, 'samples': 7842624, 'steps': 40846, 'loss/train': 1.2878777980804443} 11/07/2021 02:57:56 - INFO - __main__ - Step 40848: {'lr': 0.00041971187662461634, 'samples': 7842816, 'steps': 40847, 'loss/train': 1.6605842113494873} 11/07/2021 02:57:57 - INFO - __main__ - Step 40849: {'lr': 0.0004197079799539566, 'samples': 7843008, 'steps': 40848, 'loss/train': 1.651978611946106} 11/07/2021 02:57:57 - INFO - __main__ - Step 40850: {'lr': 0.0004197040832068291, 'samples': 7843200, 'steps': 40849, 'loss/train': 1.6321369409561157} 11/07/2021 02:57:58 - INFO - __main__ - Step 40851: {'lr': 0.00041970018638323546, 'samples': 7843392, 'steps': 40850, 'loss/train': 1.5288792848587036} 11/07/2021 02:57:58 - INFO - __main__ - Step 40852: {'lr': 0.00041969628948317756, 'samples': 7843584, 'steps': 40851, 'loss/train': 1.5668535232543945} 11/07/2021 02:57:58 - INFO - __main__ - Step 40853: {'lr': 0.00041969239250665716, 'samples': 7843776, 'steps': 40852, 'loss/train': 1.402988076210022} 11/07/2021 02:57:59 - INFO - __main__ - Step 40854: {'lr': 0.000419688495453676, 'samples': 7843968, 'steps': 40853, 'loss/train': 1.280922293663025} 11/07/2021 02:58:00 - INFO - __main__ - Step 40855: {'lr': 0.0004196845983242358, 'samples': 7844160, 'steps': 40854, 'loss/train': 1.6631810665130615} 11/07/2021 02:58:00 - INFO - __main__ - Step 40856: {'lr': 0.0004196807011183383, 'samples': 7844352, 'steps': 40855, 'loss/train': 1.41666841506958} 11/07/2021 02:58:00 - INFO - __main__ - Step 40857: {'lr': 0.00041967680383598536, 'samples': 7844544, 'steps': 40856, 'loss/train': 1.7356514930725098} 11/07/2021 02:58:01 - INFO - __main__ - Step 40858: {'lr': 0.00041967290647717864, 'samples': 7844736, 'steps': 40857, 'loss/train': 1.4704985618591309} 11/07/2021 02:58:02 - INFO - __main__ - Step 40859: {'lr': 0.00041966900904191995, 'samples': 7844928, 'steps': 40858, 'loss/train': 1.802902102470398} 11/07/2021 02:58:02 - INFO - __main__ - Step 40860: {'lr': 0.000419665111530211, 'samples': 7845120, 'steps': 40859, 'loss/train': 1.9410536289215088} 11/07/2021 02:58:03 - INFO - __main__ - Step 40861: {'lr': 0.00041966121394205357, 'samples': 7845312, 'steps': 40860, 'loss/train': 1.276798963546753} 11/07/2021 02:58:03 - INFO - __main__ - Step 40862: {'lr': 0.0004196573162774494, 'samples': 7845504, 'steps': 40861, 'loss/train': 0.9696474671363831} 11/07/2021 02:58:03 - INFO - __main__ - Step 40863: {'lr': 0.0004196534185364003, 'samples': 7845696, 'steps': 40862, 'loss/train': 0.7720293998718262} 11/07/2021 02:58:04 - INFO - __main__ - Step 40864: {'lr': 0.00041964952071890795, 'samples': 7845888, 'steps': 40863, 'loss/train': 1.7502236366271973} 11/07/2021 02:58:05 - INFO - __main__ - Step 40865: {'lr': 0.00041964562282497417, 'samples': 7846080, 'steps': 40864, 'loss/train': 1.2578016519546509} 11/07/2021 02:58:05 - INFO - __main__ - Step 40866: {'lr': 0.0004196417248546006, 'samples': 7846272, 'steps': 40865, 'loss/train': 1.572597622871399} 11/07/2021 02:58:05 - INFO - __main__ - Step 40867: {'lr': 0.0004196378268077893, 'samples': 7846464, 'steps': 40866, 'loss/train': 1.2670254707336426} 11/07/2021 02:58:06 - INFO - __main__ - Step 40868: {'lr': 0.00041963392868454163, 'samples': 7846656, 'steps': 40867, 'loss/train': 1.4458591938018799} 11/07/2021 02:58:06 - INFO - __main__ - Step 40869: {'lr': 0.0004196300304848596, 'samples': 7846848, 'steps': 40868, 'loss/train': 1.4543291330337524} 11/07/2021 02:58:07 - INFO - __main__ - Step 40870: {'lr': 0.00041962613220874486, 'samples': 7847040, 'steps': 40869, 'loss/train': 1.6573225259780884} 11/07/2021 02:58:07 - INFO - __main__ - Step 40871: {'lr': 0.0004196222338561992, 'samples': 7847232, 'steps': 40870, 'loss/train': 1.441749930381775} 11/07/2021 02:58:08 - INFO - __main__ - Step 40872: {'lr': 0.0004196183354272244, 'samples': 7847424, 'steps': 40871, 'loss/train': 1.2333451509475708} 11/07/2021 02:58:08 - INFO - __main__ - Step 40873: {'lr': 0.00041961443692182214, 'samples': 7847616, 'steps': 40872, 'loss/train': 1.369896411895752} 11/07/2021 02:58:08 - INFO - __main__ - Step 40874: {'lr': 0.00041961053833999433, 'samples': 7847808, 'steps': 40873, 'loss/train': 1.374863624572754} 11/07/2021 02:58:10 - INFO - __main__ - Step 40875: {'lr': 0.00041960663968174263, 'samples': 7848000, 'steps': 40874, 'loss/train': 1.2193489074707031} 11/07/2021 02:58:10 - INFO - __main__ - Step 40876: {'lr': 0.0004196027409470687, 'samples': 7848192, 'steps': 40875, 'loss/train': 1.4365462064743042} 11/07/2021 02:58:10 - INFO - __main__ - Step 40877: {'lr': 0.00041959884213597443, 'samples': 7848384, 'steps': 40876, 'loss/train': 1.5156211853027344} 11/07/2021 02:58:11 - INFO - __main__ - Step 40878: {'lr': 0.0004195949432484615, 'samples': 7848576, 'steps': 40877, 'loss/train': 1.7335604429244995} 11/07/2021 02:58:11 - INFO - __main__ - Step 40879: {'lr': 0.00041959104428453175, 'samples': 7848768, 'steps': 40878, 'loss/train': 1.2749711275100708} 11/07/2021 02:58:12 - INFO - __main__ - Step 40880: {'lr': 0.000419587145244187, 'samples': 7848960, 'steps': 40879, 'loss/train': 0.626573920249939} 11/07/2021 02:58:12 - INFO - __main__ - Step 40881: {'lr': 0.0004195832461274288, 'samples': 7849152, 'steps': 40880, 'loss/train': 0.25203990936279297} 11/07/2021 02:58:13 - INFO - __main__ - Step 40882: {'lr': 0.00041957934693425894, 'samples': 7849344, 'steps': 40881, 'loss/train': 1.595252513885498} 11/07/2021 02:58:13 - INFO - __main__ - Step 40883: {'lr': 0.0004195754476646793, 'samples': 7849536, 'steps': 40882, 'loss/train': 1.359736442565918} 11/07/2021 02:58:13 - INFO - __main__ - Step 40884: {'lr': 0.0004195715483186916, 'samples': 7849728, 'steps': 40883, 'loss/train': 1.6086974143981934} 11/07/2021 02:58:15 - INFO - __main__ - Step 40885: {'lr': 0.00041956764889629756, 'samples': 7849920, 'steps': 40884, 'loss/train': 1.9066940546035767} 11/07/2021 02:58:15 - INFO - __main__ - Step 40886: {'lr': 0.000419563749397499, 'samples': 7850112, 'steps': 40885, 'loss/train': 1.6675184965133667} 11/07/2021 02:58:15 - INFO - __main__ - Step 40887: {'lr': 0.00041955984982229756, 'samples': 7850304, 'steps': 40886, 'loss/train': 1.6105003356933594} 11/07/2021 02:58:16 - INFO - __main__ - Step 40888: {'lr': 0.0004195559501706951, 'samples': 7850496, 'steps': 40887, 'loss/train': 1.023625373840332} 11/07/2021 02:58:16 - INFO - __main__ - Step 40889: {'lr': 0.0004195520504426933, 'samples': 7850688, 'steps': 40888, 'loss/train': 1.644714593887329} 11/07/2021 02:58:17 - INFO - __main__ - Step 40890: {'lr': 0.000419548150638294, 'samples': 7850880, 'steps': 40889, 'loss/train': 1.4619386196136475} 11/07/2021 02:58:17 - INFO - __main__ - Step 40891: {'lr': 0.0004195442507574989, 'samples': 7851072, 'steps': 40890, 'loss/train': 1.5527533292770386} 11/07/2021 02:58:18 - INFO - __main__ - Step 40892: {'lr': 0.00041954035080030985, 'samples': 7851264, 'steps': 40891, 'loss/train': 1.0924385786056519} 11/07/2021 02:58:18 - INFO - __main__ - Step 40893: {'lr': 0.0004195364507667284, 'samples': 7851456, 'steps': 40892, 'loss/train': 1.4693257808685303} 11/07/2021 02:58:18 - INFO - __main__ - Step 40894: {'lr': 0.0004195325506567566, 'samples': 7851648, 'steps': 40893, 'loss/train': 1.937795877456665} 11/07/2021 02:58:19 - INFO - __main__ - Step 40895: {'lr': 0.00041952865047039604, 'samples': 7851840, 'steps': 40894, 'loss/train': 1.859720230102539} 11/07/2021 02:58:20 - INFO - __main__ - Step 40896: {'lr': 0.00041952475020764834, 'samples': 7852032, 'steps': 40895, 'loss/train': 1.1508278846740723} 11/07/2021 02:58:20 - INFO - __main__ - Step 40897: {'lr': 0.00041952084986851546, 'samples': 7852224, 'steps': 40896, 'loss/train': 1.3395731449127197} 11/07/2021 02:58:21 - INFO - __main__ - Step 40898: {'lr': 0.0004195169494529991, 'samples': 7852416, 'steps': 40897, 'loss/train': 5.83343505859375} 11/07/2021 02:58:21 - INFO - __main__ - Step 40899: {'lr': 0.0004195130489611011, 'samples': 7852608, 'steps': 40898, 'loss/train': 0.18828418850898743} 11/07/2021 02:58:21 - INFO - __main__ - Step 40900: {'lr': 0.0004195091483928231, 'samples': 7852800, 'steps': 40899, 'loss/train': 1.6984916925430298} 11/07/2021 02:58:22 - INFO - __main__ - Step 40901: {'lr': 0.0004195052477481669, 'samples': 7852992, 'steps': 40900, 'loss/train': 1.210571527481079} 11/07/2021 02:58:23 - INFO - __main__ - Step 40902: {'lr': 0.00041950134702713415, 'samples': 7853184, 'steps': 40901, 'loss/train': 1.6256788969039917} 11/07/2021 02:58:23 - INFO - __main__ - Step 40903: {'lr': 0.0004194974462297268, 'samples': 7853376, 'steps': 40902, 'loss/train': 0.6730626821517944} 11/07/2021 02:58:23 - INFO - __main__ - Step 40904: {'lr': 0.00041949354535594655, 'samples': 7853568, 'steps': 40903, 'loss/train': 1.4739363193511963} 11/07/2021 02:58:24 - INFO - __main__ - Step 40905: {'lr': 0.000419489644405795, 'samples': 7853760, 'steps': 40904, 'loss/train': 1.771188497543335} 11/07/2021 02:58:24 - INFO - __main__ - Step 40906: {'lr': 0.00041948574337927414, 'samples': 7853952, 'steps': 40905, 'loss/train': 1.63949453830719} 11/07/2021 02:58:25 - INFO - __main__ - Step 40907: {'lr': 0.0004194818422763856, 'samples': 7854144, 'steps': 40906, 'loss/train': 1.6062594652175903} 11/07/2021 02:58:26 - INFO - __main__ - Step 40908: {'lr': 0.00041947794109713113, 'samples': 7854336, 'steps': 40907, 'loss/train': 1.2813870906829834} 11/07/2021 02:58:26 - INFO - __main__ - Step 40909: {'lr': 0.0004194740398415125, 'samples': 7854528, 'steps': 40908, 'loss/train': 1.4221503734588623} 11/07/2021 02:58:26 - INFO - __main__ - Step 40910: {'lr': 0.00041947013850953156, 'samples': 7854720, 'steps': 40909, 'loss/train': 2.0788793563842773} 11/07/2021 02:58:27 - INFO - __main__ - Step 40911: {'lr': 0.00041946623710118993, 'samples': 7854912, 'steps': 40910, 'loss/train': 1.5028927326202393} 11/07/2021 02:58:28 - INFO - __main__ - Step 40912: {'lr': 0.0004194623356164894, 'samples': 7855104, 'steps': 40911, 'loss/train': 1.2865480184555054} 11/07/2021 02:58:28 - INFO - __main__ - Step 40913: {'lr': 0.0004194584340554318, 'samples': 7855296, 'steps': 40912, 'loss/train': 1.4281774759292603} 11/07/2021 02:58:28 - INFO - __main__ - Step 40914: {'lr': 0.0004194545324180188, 'samples': 7855488, 'steps': 40913, 'loss/train': 1.617722511291504} 11/07/2021 02:58:29 - INFO - __main__ - Step 40915: {'lr': 0.00041945063070425226, 'samples': 7855680, 'steps': 40914, 'loss/train': 1.6741904020309448} 11/07/2021 02:58:29 - INFO - __main__ - Step 40916: {'lr': 0.0004194467289141339, 'samples': 7855872, 'steps': 40915, 'loss/train': 0.865330696105957} 11/07/2021 02:58:30 - INFO - __main__ - Step 40917: {'lr': 0.00041944282704766534, 'samples': 7856064, 'steps': 40916, 'loss/train': 1.5894625186920166} 11/07/2021 02:58:31 - INFO - __main__ - Step 40918: {'lr': 0.0004194389251048486, 'samples': 7856256, 'steps': 40917, 'loss/train': 1.3960907459259033} 11/07/2021 02:58:31 - INFO - __main__ - Step 40919: {'lr': 0.00041943502308568523, 'samples': 7856448, 'steps': 40918, 'loss/train': 1.5175076723098755} 11/07/2021 02:58:31 - INFO - __main__ - Step 40920: {'lr': 0.000419431120990177, 'samples': 7856640, 'steps': 40919, 'loss/train': 1.536737084388733} 11/07/2021 02:58:32 - INFO - __main__ - Step 40921: {'lr': 0.0004194272188183258, 'samples': 7856832, 'steps': 40920, 'loss/train': 1.4177848100662231} 11/07/2021 02:58:33 - INFO - __main__ - Step 40922: {'lr': 0.0004194233165701333, 'samples': 7857024, 'steps': 40921, 'loss/train': 1.6465486288070679} 11/07/2021 02:58:33 - INFO - __main__ - Step 40923: {'lr': 0.0004194194142456013, 'samples': 7857216, 'steps': 40922, 'loss/train': 1.512831211090088} 11/07/2021 02:58:33 - INFO - __main__ - Step 40924: {'lr': 0.00041941551184473144, 'samples': 7857408, 'steps': 40923, 'loss/train': 1.766401767730713} 11/07/2021 02:58:34 - INFO - __main__ - Step 40925: {'lr': 0.0004194116093675256, 'samples': 7857600, 'steps': 40924, 'loss/train': 1.5521842241287231} 11/07/2021 02:58:34 - INFO - __main__ - Step 40926: {'lr': 0.0004194077068139855, 'samples': 7857792, 'steps': 40925, 'loss/train': 1.638545036315918} 11/07/2021 02:58:35 - INFO - __main__ - Step 40927: {'lr': 0.00041940380418411296, 'samples': 7857984, 'steps': 40926, 'loss/train': 1.4787472486495972} 11/07/2021 02:58:35 - INFO - __main__ - Step 40928: {'lr': 0.00041939990147790956, 'samples': 7858176, 'steps': 40927, 'loss/train': 1.639092206954956} 11/07/2021 02:58:36 - INFO - __main__ - Step 40929: {'lr': 0.00041939599869537724, 'samples': 7858368, 'steps': 40928, 'loss/train': 1.7011237144470215} 11/07/2021 02:58:36 - INFO - __main__ - Step 40930: {'lr': 0.00041939209583651774, 'samples': 7858560, 'steps': 40929, 'loss/train': 1.7363842725753784} 11/07/2021 02:58:36 - INFO - __main__ - Step 40931: {'lr': 0.0004193881929013327, 'samples': 7858752, 'steps': 40930, 'loss/train': 1.7511610984802246} 11/07/2021 02:58:37 - INFO - __main__ - Step 40932: {'lr': 0.00041938428988982403, 'samples': 7858944, 'steps': 40931, 'loss/train': 0.9895086884498596} 11/07/2021 02:58:38 - INFO - __main__ - Step 40933: {'lr': 0.00041938038680199333, 'samples': 7859136, 'steps': 40932, 'loss/train': 1.296011209487915} 11/07/2021 02:58:38 - INFO - __main__ - Step 40934: {'lr': 0.0004193764836378425, 'samples': 7859328, 'steps': 40933, 'loss/train': 1.413246989250183} 11/07/2021 02:58:39 - INFO - __main__ - Step 40935: {'lr': 0.0004193725803973732, 'samples': 7859520, 'steps': 40934, 'loss/train': 1.4412325620651245} 11/07/2021 02:58:39 - INFO - __main__ - Step 40936: {'lr': 0.0004193686770805873, 'samples': 7859712, 'steps': 40935, 'loss/train': 1.5362998247146606} 11/07/2021 02:58:40 - INFO - __main__ - Step 40937: {'lr': 0.00041936477368748645, 'samples': 7859904, 'steps': 40936, 'loss/train': 1.3954333066940308} 11/07/2021 02:58:40 - INFO - __main__ - Step 40938: {'lr': 0.00041936087021807243, 'samples': 7860096, 'steps': 40937, 'loss/train': 1.5923043489456177} 11/07/2021 02:58:41 - INFO - __main__ - Step 40939: {'lr': 0.000419356966672347, 'samples': 7860288, 'steps': 40938, 'loss/train': 1.7164502143859863} 11/07/2021 02:58:41 - INFO - __main__ - Step 40940: {'lr': 0.00041935306305031195, 'samples': 7860480, 'steps': 40939, 'loss/train': 0.6994503140449524} 11/07/2021 02:58:41 - INFO - __main__ - Step 40941: {'lr': 0.000419349159351969, 'samples': 7860672, 'steps': 40940, 'loss/train': 1.786657691001892} 11/07/2021 02:58:42 - INFO - __main__ - Step 40942: {'lr': 0.00041934525557732005, 'samples': 7860864, 'steps': 40941, 'loss/train': 1.44952392578125} 11/07/2021 02:58:43 - INFO - __main__ - Step 40943: {'lr': 0.00041934135172636667, 'samples': 7861056, 'steps': 40942, 'loss/train': 2.088534116744995} 11/07/2021 02:58:43 - INFO - __main__ - Step 40944: {'lr': 0.00041933744779911066, 'samples': 7861248, 'steps': 40943, 'loss/train': 1.5122030973434448} 11/07/2021 02:58:43 - INFO - __main__ - Step 40945: {'lr': 0.00041933354379555376, 'samples': 7861440, 'steps': 40944, 'loss/train': 1.540553331375122} 11/07/2021 02:58:44 - INFO - __main__ - Step 40946: {'lr': 0.00041932963971569786, 'samples': 7861632, 'steps': 40945, 'loss/train': 1.1800307035446167} 11/07/2021 02:58:45 - INFO - __main__ - Step 40947: {'lr': 0.0004193257355595446, 'samples': 7861824, 'steps': 40946, 'loss/train': 1.4031745195388794} 11/07/2021 02:58:45 - INFO - __main__ - Step 40948: {'lr': 0.00041932183132709587, 'samples': 7862016, 'steps': 40947, 'loss/train': 1.0483026504516602} 11/07/2021 02:58:46 - INFO - __main__ - Step 40949: {'lr': 0.00041931792701835325, 'samples': 7862208, 'steps': 40948, 'loss/train': 1.5203033685684204} 11/07/2021 02:58:46 - INFO - __main__ - Step 40950: {'lr': 0.00041931402263331856, 'samples': 7862400, 'steps': 40949, 'loss/train': 1.102845549583435} 11/07/2021 02:58:46 - INFO - __main__ - Step 40951: {'lr': 0.0004193101181719936, 'samples': 7862592, 'steps': 40950, 'loss/train': 1.4966281652450562} 11/07/2021 02:58:47 - INFO - __main__ - Step 40952: {'lr': 0.00041930621363438014, 'samples': 7862784, 'steps': 40951, 'loss/train': 1.6106051206588745} 11/07/2021 02:58:47 - INFO - __main__ - Step 40953: {'lr': 0.0004193023090204799, 'samples': 7862976, 'steps': 40952, 'loss/train': 1.6493892669677734} 11/07/2021 02:58:48 - INFO - __main__ - Step 40954: {'lr': 0.0004192984043302947, 'samples': 7863168, 'steps': 40953, 'loss/train': 1.6793699264526367} 11/07/2021 02:58:48 - INFO - __main__ - Step 40955: {'lr': 0.00041929449956382625, 'samples': 7863360, 'steps': 40954, 'loss/train': 1.2445733547210693} 11/07/2021 02:58:49 - INFO - __main__ - Step 40956: {'lr': 0.0004192905947210762, 'samples': 7863552, 'steps': 40955, 'loss/train': 0.526964545249939} 11/07/2021 02:58:49 - INFO - __main__ - Step 40957: {'lr': 0.00041928668980204653, 'samples': 7863744, 'steps': 40956, 'loss/train': 1.0845738649368286} 11/07/2021 02:58:50 - INFO - __main__ - Step 40958: {'lr': 0.00041928278480673884, 'samples': 7863936, 'steps': 40957, 'loss/train': 1.4329932928085327} 11/07/2021 02:58:50 - INFO - __main__ - Step 40959: {'lr': 0.00041927887973515493, 'samples': 7864128, 'steps': 40958, 'loss/train': 1.483497977256775} 11/07/2021 02:58:51 - INFO - __main__ - Step 40960: {'lr': 0.0004192749745872966, 'samples': 7864320, 'steps': 40959, 'loss/train': 1.2724796533584595} 11/07/2021 02:58:51 - INFO - __main__ - Step 40961: {'lr': 0.00041927106936316563, 'samples': 7864512, 'steps': 40960, 'loss/train': 1.6600308418273926} 11/07/2021 02:58:51 - INFO - __main__ - Step 40962: {'lr': 0.00041926716406276367, 'samples': 7864704, 'steps': 40961, 'loss/train': 1.5604661703109741} 11/07/2021 02:58:52 - INFO - __main__ - Step 40963: {'lr': 0.00041926325868609247, 'samples': 7864896, 'steps': 40962, 'loss/train': 1.5598963499069214} 11/07/2021 02:58:53 - INFO - __main__ - Step 40964: {'lr': 0.0004192593532331539, 'samples': 7865088, 'steps': 40963, 'loss/train': 1.7123584747314453} 11/07/2021 02:58:53 - INFO - __main__ - Step 40965: {'lr': 0.00041925544770394976, 'samples': 7865280, 'steps': 40964, 'loss/train': 1.3562514781951904} 11/07/2021 02:58:53 - INFO - __main__ - Step 40966: {'lr': 0.0004192515420984816, 'samples': 7865472, 'steps': 40965, 'loss/train': 1.4312055110931396} 11/07/2021 02:58:54 - INFO - __main__ - Step 40967: {'lr': 0.0004192476364167514, 'samples': 7865664, 'steps': 40966, 'loss/train': 1.754093885421753} 11/07/2021 02:58:55 - INFO - __main__ - Step 40968: {'lr': 0.0004192437306587608, 'samples': 7865856, 'steps': 40967, 'loss/train': 1.5404969453811646} 11/07/2021 02:58:55 - INFO - __main__ - Step 40969: {'lr': 0.0004192398248245116, 'samples': 7866048, 'steps': 40968, 'loss/train': 1.6052510738372803} 11/07/2021 02:58:56 - INFO - __main__ - Step 40970: {'lr': 0.00041923591891400555, 'samples': 7866240, 'steps': 40969, 'loss/train': 0.7481063604354858} 11/07/2021 02:58:56 - INFO - __main__ - Step 40971: {'lr': 0.00041923201292724436, 'samples': 7866432, 'steps': 40970, 'loss/train': 1.8128414154052734} 11/07/2021 02:58:56 - INFO - __main__ - Step 40972: {'lr': 0.00041922810686422987, 'samples': 7866624, 'steps': 40971, 'loss/train': 1.3442429304122925} 11/07/2021 02:58:57 - INFO - __main__ - Step 40973: {'lr': 0.00041922420072496383, 'samples': 7866816, 'steps': 40972, 'loss/train': 1.2182066440582275} 11/07/2021 02:58:58 - INFO - __main__ - Step 40974: {'lr': 0.00041922029450944785, 'samples': 7867008, 'steps': 40973, 'loss/train': 1.3147344589233398} 11/07/2021 02:58:58 - INFO - __main__ - Step 40975: {'lr': 0.000419216388217684, 'samples': 7867200, 'steps': 40974, 'loss/train': 1.1383329629898071} 11/07/2021 02:58:58 - INFO - __main__ - Step 40976: {'lr': 0.00041921248184967374, 'samples': 7867392, 'steps': 40975, 'loss/train': 1.6389542818069458} 11/07/2021 02:58:59 - INFO - __main__ - Step 40977: {'lr': 0.000419208575405419, 'samples': 7867584, 'steps': 40976, 'loss/train': 1.6150141954421997} 11/07/2021 02:59:00 - INFO - __main__ - Step 40978: {'lr': 0.00041920466888492147, 'samples': 7867776, 'steps': 40977, 'loss/train': 0.9776393175125122} 11/07/2021 02:59:00 - INFO - __main__ - Step 40979: {'lr': 0.00041920076228818293, 'samples': 7867968, 'steps': 40978, 'loss/train': 1.7132076025009155} 11/07/2021 02:59:00 - INFO - __main__ - Step 40980: {'lr': 0.0004191968556152051, 'samples': 7868160, 'steps': 40979, 'loss/train': 0.4408467710018158} 11/07/2021 02:59:01 - INFO - __main__ - Step 40981: {'lr': 0.0004191929488659898, 'samples': 7868352, 'steps': 40980, 'loss/train': 1.6313823461532593} 11/07/2021 02:59:01 - INFO - __main__ - Step 40982: {'lr': 0.00041918904204053874, 'samples': 7868544, 'steps': 40981, 'loss/train': 1.7473126649856567} 11/07/2021 02:59:02 - INFO - __main__ - Step 40983: {'lr': 0.0004191851351388538, 'samples': 7868736, 'steps': 40982, 'loss/train': 1.521274209022522} 11/07/2021 02:59:03 - INFO - __main__ - Step 40984: {'lr': 0.0004191812281609366, 'samples': 7868928, 'steps': 40983, 'loss/train': 1.632369875907898} 11/07/2021 02:59:03 - INFO - __main__ - Step 40985: {'lr': 0.00041917732110678896, 'samples': 7869120, 'steps': 40984, 'loss/train': 2.443267822265625} 11/07/2021 02:59:03 - INFO - __main__ - Step 40986: {'lr': 0.0004191734139764126, 'samples': 7869312, 'steps': 40985, 'loss/train': 1.7018356323242188} 11/07/2021 02:59:04 - INFO - __main__ - Step 40987: {'lr': 0.00041916950676980933, 'samples': 7869504, 'steps': 40986, 'loss/train': 1.4859052896499634} 11/07/2021 02:59:04 - INFO - __main__ - Step 40988: {'lr': 0.0004191655994869809, 'samples': 7869696, 'steps': 40987, 'loss/train': 1.4574897289276123} 11/07/2021 02:59:05 - INFO - __main__ - Step 40989: {'lr': 0.000419161692127929, 'samples': 7869888, 'steps': 40988, 'loss/train': 1.970042109489441} 11/07/2021 02:59:05 - INFO - __main__ - Step 40990: {'lr': 0.00041915778469265555, 'samples': 7870080, 'steps': 40989, 'loss/train': 1.6592310667037964} 11/07/2021 02:59:06 - INFO - __main__ - Step 40991: {'lr': 0.0004191538771811621, 'samples': 7870272, 'steps': 40990, 'loss/train': 1.3842103481292725} 11/07/2021 02:59:06 - INFO - __main__ - Step 40992: {'lr': 0.00041914996959345057, 'samples': 7870464, 'steps': 40991, 'loss/train': 1.3491690158843994} 11/07/2021 02:59:06 - INFO - __main__ - Step 40993: {'lr': 0.0004191460619295227, 'samples': 7870656, 'steps': 40992, 'loss/train': 1.3155477046966553} 11/07/2021 02:59:08 - INFO - __main__ - Step 40994: {'lr': 0.0004191421541893802, 'samples': 7870848, 'steps': 40993, 'loss/train': 1.4192020893096924} 11/07/2021 02:59:08 - INFO - __main__ - Step 40995: {'lr': 0.0004191382463730249, 'samples': 7871040, 'steps': 40994, 'loss/train': 0.810729444026947} 11/07/2021 02:59:08 - INFO - __main__ - Step 40996: {'lr': 0.00041913433848045844, 'samples': 7871232, 'steps': 40995, 'loss/train': 1.2763605117797852} 11/07/2021 02:59:09 - INFO - __main__ - Step 40997: {'lr': 0.00041913043051168276, 'samples': 7871424, 'steps': 40996, 'loss/train': 1.5386114120483398} 11/07/2021 02:59:09 - INFO - __main__ - Step 40998: {'lr': 0.00041912652246669943, 'samples': 7871616, 'steps': 40997, 'loss/train': 1.7605671882629395} 11/07/2021 02:59:10 - INFO - __main__ - Step 40999: {'lr': 0.0004191226143455103, 'samples': 7871808, 'steps': 40998, 'loss/train': 1.474281668663025} 11/07/2021 02:59:10 - INFO - __main__ - Step 41000: {'lr': 0.00041911870614811715, 'samples': 7872000, 'steps': 40999, 'loss/train': 1.4995510578155518} 11/07/2021 02:59:11 - INFO - __main__ - Step 41001: {'lr': 0.00041911479787452177, 'samples': 7872192, 'steps': 41000, 'loss/train': 1.3275129795074463} 11/07/2021 02:59:11 - INFO - __main__ - Step 41002: {'lr': 0.0004191108895247258, 'samples': 7872384, 'steps': 41001, 'loss/train': 1.3515865802764893} 11/07/2021 02:59:11 - INFO - __main__ - Step 41003: {'lr': 0.00041910698109873116, 'samples': 7872576, 'steps': 41002, 'loss/train': 1.233052372932434} 11/07/2021 02:59:12 - INFO - __main__ - Step 41004: {'lr': 0.0004191030725965394, 'samples': 7872768, 'steps': 41003, 'loss/train': 1.0532420873641968} 11/07/2021 02:59:13 - INFO - __main__ - Step 41005: {'lr': 0.00041909916401815245, 'samples': 7872960, 'steps': 41004, 'loss/train': 1.5793228149414062} 11/07/2021 02:59:13 - INFO - __main__ - Step 41006: {'lr': 0.00041909525536357206, 'samples': 7873152, 'steps': 41005, 'loss/train': 1.4701327085494995} 11/07/2021 02:59:14 - INFO - __main__ - Step 41007: {'lr': 0.0004190913466327999, 'samples': 7873344, 'steps': 41006, 'loss/train': 1.8696014881134033} 11/07/2021 02:59:14 - INFO - __main__ - Step 41008: {'lr': 0.00041908743782583793, 'samples': 7873536, 'steps': 41007, 'loss/train': 1.466177225112915} 11/07/2021 02:59:14 - INFO - __main__ - Step 41009: {'lr': 0.00041908352894268766, 'samples': 7873728, 'steps': 41008, 'loss/train': 1.524558663368225} 11/07/2021 02:59:15 - INFO - __main__ - Step 41010: {'lr': 0.00041907961998335094, 'samples': 7873920, 'steps': 41009, 'loss/train': 1.7194336652755737} 11/07/2021 02:59:16 - INFO - __main__ - Step 41011: {'lr': 0.0004190757109478296, 'samples': 7874112, 'steps': 41010, 'loss/train': 1.3648717403411865} 11/07/2021 02:59:16 - INFO - __main__ - Step 41012: {'lr': 0.00041907180183612525, 'samples': 7874304, 'steps': 41011, 'loss/train': 1.446285605430603} 11/07/2021 02:59:16 - INFO - __main__ - Step 41013: {'lr': 0.00041906789264823985, 'samples': 7874496, 'steps': 41012, 'loss/train': 1.3314675092697144} 11/07/2021 02:59:17 - INFO - __main__ - Step 41014: {'lr': 0.00041906398338417504, 'samples': 7874688, 'steps': 41013, 'loss/train': 1.8180811405181885} 11/07/2021 02:59:18 - INFO - __main__ - Step 41015: {'lr': 0.00041906007404393273, 'samples': 7874880, 'steps': 41014, 'loss/train': 1.1559853553771973} 11/07/2021 02:59:18 - INFO - __main__ - Step 41016: {'lr': 0.0004190561646275144, 'samples': 7875072, 'steps': 41015, 'loss/train': 1.9799628257751465} 11/07/2021 02:59:18 - INFO - __main__ - Step 41017: {'lr': 0.0004190522551349221, 'samples': 7875264, 'steps': 41016, 'loss/train': 1.4010202884674072} 11/07/2021 02:59:19 - INFO - __main__ - Step 41018: {'lr': 0.00041904834556615733, 'samples': 7875456, 'steps': 41017, 'loss/train': 1.2678967714309692} 11/07/2021 02:59:19 - INFO - __main__ - Step 41019: {'lr': 0.000419044435921222, 'samples': 7875648, 'steps': 41018, 'loss/train': 1.5244847536087036} 11/07/2021 02:59:20 - INFO - __main__ - Step 41020: {'lr': 0.0004190405262001179, 'samples': 7875840, 'steps': 41019, 'loss/train': 1.3021565675735474} 11/07/2021 02:59:21 - INFO - __main__ - Step 41021: {'lr': 0.00041903661640284675, 'samples': 7876032, 'steps': 41020, 'loss/train': 1.822368860244751} 11/07/2021 02:59:21 - INFO - __main__ - Step 41022: {'lr': 0.0004190327065294104, 'samples': 7876224, 'steps': 41021, 'loss/train': 0.6467123627662659} 11/07/2021 02:59:21 - INFO - __main__ - Step 41023: {'lr': 0.00041902879657981036, 'samples': 7876416, 'steps': 41022, 'loss/train': 1.0762028694152832} 11/07/2021 02:59:22 - INFO - __main__ - Step 41024: {'lr': 0.00041902488655404864, 'samples': 7876608, 'steps': 41023, 'loss/train': 1.6476768255233765} 11/07/2021 02:59:23 - INFO - __main__ - Step 41025: {'lr': 0.0004190209764521269, 'samples': 7876800, 'steps': 41024, 'loss/train': 1.7097328901290894} 11/07/2021 02:59:23 - INFO - __main__ - Step 41026: {'lr': 0.0004190170662740469, 'samples': 7876992, 'steps': 41025, 'loss/train': 1.1358178853988647} 11/07/2021 02:59:23 - INFO - __main__ - Step 41027: {'lr': 0.0004190131560198104, 'samples': 7877184, 'steps': 41026, 'loss/train': 1.4658715724945068} 11/07/2021 02:59:24 - INFO - __main__ - Step 41028: {'lr': 0.00041900924568941925, 'samples': 7877376, 'steps': 41027, 'loss/train': 0.515064537525177} 11/07/2021 02:59:24 - INFO - __main__ - Step 41029: {'lr': 0.0004190053352828751, 'samples': 7877568, 'steps': 41028, 'loss/train': 1.1149355173110962} 11/07/2021 02:59:25 - INFO - __main__ - Step 41030: {'lr': 0.00041900142480017974, 'samples': 7877760, 'steps': 41029, 'loss/train': 0.8842067718505859} 11/07/2021 02:59:25 - INFO - __main__ - Step 41031: {'lr': 0.0004189975142413349, 'samples': 7877952, 'steps': 41030, 'loss/train': 1.2230801582336426} 11/07/2021 02:59:26 - INFO - __main__ - Step 41032: {'lr': 0.00041899360360634247, 'samples': 7878144, 'steps': 41031, 'loss/train': 1.5904314517974854} 11/07/2021 02:59:26 - INFO - __main__ - Step 41033: {'lr': 0.0004189896928952041, 'samples': 7878336, 'steps': 41032, 'loss/train': 1.352460265159607} 11/07/2021 02:59:26 - INFO - __main__ - Step 41034: {'lr': 0.0004189857821079216, 'samples': 7878528, 'steps': 41033, 'loss/train': 1.438441514968872} 11/07/2021 02:59:28 - INFO - __main__ - Step 41035: {'lr': 0.0004189818712444967, 'samples': 7878720, 'steps': 41034, 'loss/train': 1.6369142532348633} 11/07/2021 02:59:28 - INFO - __main__ - Step 41036: {'lr': 0.0004189779603049312, 'samples': 7878912, 'steps': 41035, 'loss/train': 0.795782208442688} 11/07/2021 02:59:28 - INFO - __main__ - Step 41037: {'lr': 0.0004189740492892268, 'samples': 7879104, 'steps': 41036, 'loss/train': 1.4832515716552734} 11/07/2021 02:59:29 - INFO - __main__ - Step 41038: {'lr': 0.0004189701381973853, 'samples': 7879296, 'steps': 41037, 'loss/train': 1.3019779920578003} 11/07/2021 02:59:29 - INFO - __main__ - Step 41039: {'lr': 0.00041896622702940846, 'samples': 7879488, 'steps': 41038, 'loss/train': 1.4662656784057617} 11/07/2021 02:59:29 - INFO - __main__ - Step 41040: {'lr': 0.0004189623157852981, 'samples': 7879680, 'steps': 41039, 'loss/train': 1.8899928331375122} 11/07/2021 02:59:30 - INFO - __main__ - Step 41041: {'lr': 0.0004189584044650559, 'samples': 7879872, 'steps': 41040, 'loss/train': 2.035140037536621} 11/07/2021 02:59:31 - INFO - __main__ - Step 41042: {'lr': 0.0004189544930686837, 'samples': 7880064, 'steps': 41041, 'loss/train': 2.037627696990967} 11/07/2021 02:59:31 - INFO - __main__ - Step 41043: {'lr': 0.0004189505815961831, 'samples': 7880256, 'steps': 41042, 'loss/train': 1.5560941696166992} 11/07/2021 02:59:31 - INFO - __main__ - Step 41044: {'lr': 0.000418946670047556, 'samples': 7880448, 'steps': 41043, 'loss/train': 1.392247200012207} 11/07/2021 02:59:32 - INFO - __main__ - Step 41045: {'lr': 0.0004189427584228042, 'samples': 7880640, 'steps': 41044, 'loss/train': 1.5331697463989258} 11/07/2021 02:59:33 - INFO - __main__ - Step 41046: {'lr': 0.0004189388467219294, 'samples': 7880832, 'steps': 41045, 'loss/train': 1.496413230895996} 11/07/2021 02:59:33 - INFO - __main__ - Step 41047: {'lr': 0.0004189349349449333, 'samples': 7881024, 'steps': 41046, 'loss/train': 1.2940542697906494} 11/07/2021 02:59:34 - INFO - __main__ - Step 41048: {'lr': 0.00041893102309181773, 'samples': 7881216, 'steps': 41047, 'loss/train': 1.5159060955047607} 11/07/2021 02:59:34 - INFO - __main__ - Step 41049: {'lr': 0.00041892711116258454, 'samples': 7881408, 'steps': 41048, 'loss/train': 1.2349627017974854} 11/07/2021 02:59:34 - INFO - __main__ - Step 41050: {'lr': 0.00041892319915723533, 'samples': 7881600, 'steps': 41049, 'loss/train': 1.1747950315475464} 11/07/2021 02:59:35 - INFO - __main__ - Step 41051: {'lr': 0.0004189192870757719, 'samples': 7881792, 'steps': 41050, 'loss/train': 1.7542132139205933} 11/07/2021 02:59:36 - INFO - __main__ - Step 41052: {'lr': 0.0004189153749181961, 'samples': 7881984, 'steps': 41051, 'loss/train': 1.3275436162948608} 11/07/2021 02:59:36 - INFO - __main__ - Step 41053: {'lr': 0.00041891146268450963, 'samples': 7882176, 'steps': 41052, 'loss/train': 1.717761516571045} 11/07/2021 02:59:37 - INFO - __main__ - Step 41054: {'lr': 0.0004189075503747142, 'samples': 7882368, 'steps': 41053, 'loss/train': 1.5952472686767578} 11/07/2021 02:59:37 - INFO - __main__ - Step 41055: {'lr': 0.0004189036379888117, 'samples': 7882560, 'steps': 41054, 'loss/train': 1.4856972694396973} 11/07/2021 02:59:37 - INFO - __main__ - Step 41056: {'lr': 0.00041889972552680387, 'samples': 7882752, 'steps': 41055, 'loss/train': 1.6331814527511597} 11/07/2021 02:59:38 - INFO - __main__ - Step 41057: {'lr': 0.0004188958129886924, 'samples': 7882944, 'steps': 41056, 'loss/train': 1.7809665203094482} 11/07/2021 02:59:39 - INFO - __main__ - Step 41058: {'lr': 0.000418891900374479, 'samples': 7883136, 'steps': 41057, 'loss/train': 1.209208369255066} 11/07/2021 02:59:39 - INFO - __main__ - Step 41059: {'lr': 0.0004188879876841656, 'samples': 7883328, 'steps': 41058, 'loss/train': 2.0920801162719727} 11/07/2021 02:59:39 - INFO - __main__ - Step 41060: {'lr': 0.0004188840749177538, 'samples': 7883520, 'steps': 41059, 'loss/train': 1.6948528289794922} 11/07/2021 02:59:40 - INFO - __main__ - Step 41061: {'lr': 0.0004188801620752455, 'samples': 7883712, 'steps': 41060, 'loss/train': 1.7842382192611694} 11/07/2021 02:59:41 - INFO - __main__ - Step 41062: {'lr': 0.00041887624915664247, 'samples': 7883904, 'steps': 41061, 'loss/train': 0.37048017978668213} 11/07/2021 02:59:41 - INFO - __main__ - Step 41063: {'lr': 0.0004188723361619463, 'samples': 7884096, 'steps': 41062, 'loss/train': 1.9038022756576538} 11/07/2021 02:59:41 - INFO - __main__ - Step 41064: {'lr': 0.0004188684230911589, 'samples': 7884288, 'steps': 41063, 'loss/train': 1.2390437126159668} 11/07/2021 02:59:42 - INFO - __main__ - Step 41065: {'lr': 0.00041886450994428197, 'samples': 7884480, 'steps': 41064, 'loss/train': 1.3526166677474976} 11/07/2021 02:59:42 - INFO - __main__ - Step 41066: {'lr': 0.0004188605967213174, 'samples': 7884672, 'steps': 41065, 'loss/train': 0.6758860945701599} 11/07/2021 02:59:43 - INFO - __main__ - Step 41067: {'lr': 0.0004188566834222667, 'samples': 7884864, 'steps': 41066, 'loss/train': 1.5114113092422485} 11/07/2021 02:59:43 - INFO - __main__ - Step 41068: {'lr': 0.00041885277004713185, 'samples': 7885056, 'steps': 41067, 'loss/train': 1.4266055822372437} 11/07/2021 02:59:44 - INFO - __main__ - Step 41069: {'lr': 0.0004188488565959146, 'samples': 7885248, 'steps': 41068, 'loss/train': 1.9119995832443237} 11/07/2021 02:59:44 - INFO - __main__ - Step 41070: {'lr': 0.0004188449430686166, 'samples': 7885440, 'steps': 41069, 'loss/train': 1.4126471281051636} 11/07/2021 02:59:44 - INFO - __main__ - Step 41071: {'lr': 0.00041884102946523964, 'samples': 7885632, 'steps': 41070, 'loss/train': 1.4124705791473389} 11/07/2021 02:59:45 - INFO - __main__ - Step 41072: {'lr': 0.0004188371157857856, 'samples': 7885824, 'steps': 41071, 'loss/train': 2.075404167175293} 11/07/2021 02:59:46 - INFO - __main__ - Step 41073: {'lr': 0.0004188332020302561, 'samples': 7886016, 'steps': 41072, 'loss/train': 1.1294195652008057} 11/07/2021 02:59:46 - INFO - __main__ - Step 41074: {'lr': 0.000418829288198653, 'samples': 7886208, 'steps': 41073, 'loss/train': 1.5374236106872559} 11/07/2021 02:59:47 - INFO - __main__ - Step 41075: {'lr': 0.00041882537429097804, 'samples': 7886400, 'steps': 41074, 'loss/train': 1.0856437683105469} 11/07/2021 02:59:47 - INFO - __main__ - Step 41076: {'lr': 0.00041882146030723297, 'samples': 7886592, 'steps': 41075, 'loss/train': 1.1252588033676147} 11/07/2021 02:59:48 - INFO - __main__ - Step 41077: {'lr': 0.0004188175462474195, 'samples': 7886784, 'steps': 41076, 'loss/train': 1.6488182544708252} 11/07/2021 02:59:48 - INFO - __main__ - Step 41078: {'lr': 0.0004188136321115395, 'samples': 7886976, 'steps': 41077, 'loss/train': 1.5101019144058228} 11/07/2021 02:59:49 - INFO - __main__ - Step 41079: {'lr': 0.00041880971789959466, 'samples': 7887168, 'steps': 41078, 'loss/train': 1.8681154251098633} 11/07/2021 02:59:49 - INFO - __main__ - Step 41080: {'lr': 0.0004188058036115868, 'samples': 7887360, 'steps': 41079, 'loss/train': 1.7132915258407593} 11/07/2021 02:59:49 - INFO - __main__ - Step 41081: {'lr': 0.0004188018892475176, 'samples': 7887552, 'steps': 41080, 'loss/train': 1.715844988822937} 11/07/2021 02:59:50 - INFO - __main__ - Step 41082: {'lr': 0.0004187979748073889, 'samples': 7887744, 'steps': 41081, 'loss/train': 1.5530753135681152} 11/07/2021 02:59:51 - INFO - __main__ - Step 41083: {'lr': 0.0004187940602912024, 'samples': 7887936, 'steps': 41082, 'loss/train': 1.3540942668914795} 11/07/2021 02:59:51 - INFO - __main__ - Step 41084: {'lr': 0.00041879014569895994, 'samples': 7888128, 'steps': 41083, 'loss/train': 1.4271323680877686} 11/07/2021 02:59:51 - INFO - __main__ - Step 41085: {'lr': 0.0004187862310306633, 'samples': 7888320, 'steps': 41084, 'loss/train': 1.9033116102218628} 11/07/2021 02:59:52 - INFO - __main__ - Step 41086: {'lr': 0.00041878231628631406, 'samples': 7888512, 'steps': 41085, 'loss/train': 2.167079448699951} 11/07/2021 02:59:53 - INFO - __main__ - Step 41087: {'lr': 0.0004187784014659142, 'samples': 7888704, 'steps': 41086, 'loss/train': 0.9366491436958313} 11/07/2021 02:59:53 - INFO - __main__ - Step 41088: {'lr': 0.0004187744865694654, 'samples': 7888896, 'steps': 41087, 'loss/train': 1.4434558153152466} 11/07/2021 02:59:54 - INFO - __main__ - Step 41089: {'lr': 0.0004187705715969694, 'samples': 7889088, 'steps': 41088, 'loss/train': 1.1905782222747803} 11/07/2021 02:59:54 - INFO - __main__ - Step 41090: {'lr': 0.0004187666565484279, 'samples': 7889280, 'steps': 41089, 'loss/train': 1.3745850324630737} 11/07/2021 02:59:54 - INFO - __main__ - Step 41091: {'lr': 0.0004187627414238428, 'samples': 7889472, 'steps': 41090, 'loss/train': 0.6837329864501953} 11/07/2021 02:59:55 - INFO - __main__ - Step 41092: {'lr': 0.0004187588262232159, 'samples': 7889664, 'steps': 41091, 'loss/train': 1.574971079826355} 11/07/2021 02:59:56 - INFO - __main__ - Step 41093: {'lr': 0.00041875491094654885, 'samples': 7889856, 'steps': 41092, 'loss/train': 1.2117096185684204} 11/07/2021 02:59:56 - INFO - __main__ - Step 41094: {'lr': 0.0004187509955938434, 'samples': 7890048, 'steps': 41093, 'loss/train': 1.3760145902633667} 11/07/2021 02:59:56 - INFO - __main__ - Step 41095: {'lr': 0.0004187470801651013, 'samples': 7890240, 'steps': 41094, 'loss/train': 1.254813551902771} 11/07/2021 02:59:57 - INFO - __main__ - Step 41096: {'lr': 0.0004187431646603245, 'samples': 7890432, 'steps': 41095, 'loss/train': 1.0888969898223877} 11/07/2021 02:59:58 - INFO - __main__ - Step 41097: {'lr': 0.0004187392490795146, 'samples': 7890624, 'steps': 41096, 'loss/train': 1.7391693592071533} 11/07/2021 02:59:58 - INFO - __main__ - Step 41098: {'lr': 0.00041873533342267336, 'samples': 7890816, 'steps': 41097, 'loss/train': 1.4721550941467285} 11/07/2021 02:59:58 - INFO - __main__ - Step 41099: {'lr': 0.0004187314176898026, 'samples': 7891008, 'steps': 41098, 'loss/train': 1.741350769996643} 11/07/2021 02:59:59 - INFO - __main__ - Step 41100: {'lr': 0.000418727501880904, 'samples': 7891200, 'steps': 41099, 'loss/train': 2.3582005500793457} 11/07/2021 02:59:59 - INFO - __main__ - Step 41101: {'lr': 0.00041872358599597947, 'samples': 7891392, 'steps': 41100, 'loss/train': 1.6507580280303955} 11/07/2021 02:59:59 - INFO - __main__ - Step 41102: {'lr': 0.00041871967003503073, 'samples': 7891584, 'steps': 41101, 'loss/train': 1.177298903465271} 11/07/2021 03:00:01 - INFO - __main__ - Step 41103: {'lr': 0.00041871575399805947, 'samples': 7891776, 'steps': 41102, 'loss/train': 1.187396764755249} 11/07/2021 03:00:01 - INFO - __main__ - Step 41104: {'lr': 0.0004187118378850674, 'samples': 7891968, 'steps': 41103, 'loss/train': 1.6059626340866089} 11/07/2021 03:00:01 - INFO - __main__ - Step 41105: {'lr': 0.00041870792169605654, 'samples': 7892160, 'steps': 41104, 'loss/train': 2.1208462715148926} 11/07/2021 03:00:02 - INFO - __main__ - Step 41106: {'lr': 0.0004187040054310284, 'samples': 7892352, 'steps': 41105, 'loss/train': 1.490696907043457} 11/07/2021 03:00:02 - INFO - __main__ - Step 41107: {'lr': 0.0004187000890899848, 'samples': 7892544, 'steps': 41106, 'loss/train': 1.494994044303894} 11/07/2021 03:00:03 - INFO - __main__ - Step 41108: {'lr': 0.0004186961726729276, 'samples': 7892736, 'steps': 41107, 'loss/train': 1.3582350015640259} 11/07/2021 03:00:03 - INFO - __main__ - Step 41109: {'lr': 0.0004186922561798585, 'samples': 7892928, 'steps': 41108, 'loss/train': 0.9714041948318481} 11/07/2021 03:00:04 - INFO - __main__ - Step 41110: {'lr': 0.00041868833961077935, 'samples': 7893120, 'steps': 41109, 'loss/train': 1.1687935590744019} 11/07/2021 03:00:04 - INFO - __main__ - Step 41111: {'lr': 0.0004186844229656917, 'samples': 7893312, 'steps': 41110, 'loss/train': 1.5238968133926392} 11/07/2021 03:00:04 - INFO - __main__ - Step 41112: {'lr': 0.0004186805062445975, 'samples': 7893504, 'steps': 41111, 'loss/train': 1.8914436101913452} 11/07/2021 03:00:06 - INFO - __main__ - Step 41113: {'lr': 0.00041867658944749856, 'samples': 7893696, 'steps': 41112, 'loss/train': 1.78902006149292} 11/07/2021 03:00:06 - INFO - __main__ - Step 41114: {'lr': 0.00041867267257439644, 'samples': 7893888, 'steps': 41113, 'loss/train': 1.6058887243270874} 11/07/2021 03:00:06 - INFO - __main__ - Step 41115: {'lr': 0.00041866875562529305, 'samples': 7894080, 'steps': 41114, 'loss/train': 1.6523754596710205} 11/07/2021 03:00:07 - INFO - __main__ - Step 41116: {'lr': 0.0004186648386001901, 'samples': 7894272, 'steps': 41115, 'loss/train': 0.9120059013366699} 11/07/2021 03:00:07 - INFO - __main__ - Step 41117: {'lr': 0.0004186609214990894, 'samples': 7894464, 'steps': 41116, 'loss/train': 1.5206637382507324} 11/07/2021 03:00:07 - INFO - __main__ - Step 41118: {'lr': 0.0004186570043219927, 'samples': 7894656, 'steps': 41117, 'loss/train': 1.4369728565216064} 11/07/2021 03:00:08 - INFO - __main__ - Step 41119: {'lr': 0.0004186530870689017, 'samples': 7894848, 'steps': 41118, 'loss/train': 1.2693451642990112} 11/07/2021 03:00:09 - INFO - __main__ - Step 41120: {'lr': 0.00041864916973981833, 'samples': 7895040, 'steps': 41119, 'loss/train': 1.017479419708252} 11/07/2021 03:00:09 - INFO - __main__ - Step 41121: {'lr': 0.0004186452523347442, 'samples': 7895232, 'steps': 41120, 'loss/train': 1.0667365789413452} 11/07/2021 03:00:09 - INFO - __main__ - Step 41122: {'lr': 0.00041864133485368106, 'samples': 7895424, 'steps': 41121, 'loss/train': 1.0865099430084229} 11/07/2021 03:00:10 - INFO - __main__ - Step 41123: {'lr': 0.0004186374172966308, 'samples': 7895616, 'steps': 41122, 'loss/train': 1.5828053951263428} 11/07/2021 03:00:11 - INFO - __main__ - Step 41124: {'lr': 0.0004186334996635951, 'samples': 7895808, 'steps': 41123, 'loss/train': 1.979206919670105} 11/07/2021 03:00:11 - INFO - __main__ - Step 41125: {'lr': 0.00041862958195457574, 'samples': 7896000, 'steps': 41124, 'loss/train': 0.7947288751602173} 11/07/2021 03:00:12 - INFO - __main__ - Step 41126: {'lr': 0.0004186256641695745, 'samples': 7896192, 'steps': 41125, 'loss/train': 0.6448233723640442} 11/07/2021 03:00:12 - INFO - __main__ - Step 41127: {'lr': 0.00041862174630859315, 'samples': 7896384, 'steps': 41126, 'loss/train': 1.4096827507019043} 11/07/2021 03:00:12 - INFO - __main__ - Step 41128: {'lr': 0.0004186178283716334, 'samples': 7896576, 'steps': 41127, 'loss/train': 1.2233763933181763} 11/07/2021 03:00:13 - INFO - __main__ - Step 41129: {'lr': 0.0004186139103586971, 'samples': 7896768, 'steps': 41128, 'loss/train': 1.5383292436599731} 11/07/2021 03:00:14 - INFO - __main__ - Step 41130: {'lr': 0.00041860999226978605, 'samples': 7896960, 'steps': 41129, 'loss/train': 1.4980281591415405} 11/07/2021 03:00:14 - INFO - __main__ - Step 41131: {'lr': 0.0004186060741049018, 'samples': 7897152, 'steps': 41130, 'loss/train': 0.919456958770752} 11/07/2021 03:00:15 - INFO - __main__ - Step 41132: {'lr': 0.00041860215586404624, 'samples': 7897344, 'steps': 41131, 'loss/train': 1.7707819938659668} 11/07/2021 03:00:15 - INFO - __main__ - Step 41133: {'lr': 0.00041859823754722127, 'samples': 7897536, 'steps': 41132, 'loss/train': 1.076181411743164} 11/07/2021 03:00:16 - INFO - __main__ - Step 41134: {'lr': 0.00041859431915442847, 'samples': 7897728, 'steps': 41133, 'loss/train': 1.4806371927261353} 11/07/2021 03:00:16 - INFO - __main__ - Step 41135: {'lr': 0.0004185904006856697, 'samples': 7897920, 'steps': 41134, 'loss/train': 1.6345583200454712} 11/07/2021 03:00:17 - INFO - __main__ - Step 41136: {'lr': 0.0004185864821409467, 'samples': 7898112, 'steps': 41135, 'loss/train': 1.5924763679504395} 11/07/2021 03:00:17 - INFO - __main__ - Step 41137: {'lr': 0.00041858256352026124, 'samples': 7898304, 'steps': 41136, 'loss/train': 1.5258071422576904} 11/07/2021 03:00:17 - INFO - __main__ - Step 41138: {'lr': 0.0004185786448236151, 'samples': 7898496, 'steps': 41137, 'loss/train': 1.2716859579086304} 11/07/2021 03:00:18 - INFO - __main__ - Step 41139: {'lr': 0.0004185747260510099, 'samples': 7898688, 'steps': 41138, 'loss/train': 1.5812559127807617} 11/07/2021 03:00:19 - INFO - __main__ - Step 41140: {'lr': 0.0004185708072024476, 'samples': 7898880, 'steps': 41139, 'loss/train': 1.661331057548523} 11/07/2021 03:00:19 - INFO - __main__ - Step 41141: {'lr': 0.0004185668882779299, 'samples': 7899072, 'steps': 41140, 'loss/train': 1.2637416124343872} 11/07/2021 03:00:19 - INFO - __main__ - Step 41142: {'lr': 0.00041856296927745857, 'samples': 7899264, 'steps': 41141, 'loss/train': 1.122740626335144} 11/07/2021 03:00:20 - INFO - __main__ - Step 41143: {'lr': 0.00041855905020103543, 'samples': 7899456, 'steps': 41142, 'loss/train': 1.4493659734725952} 11/07/2021 03:00:21 - INFO - __main__ - Step 41144: {'lr': 0.00041855513104866203, 'samples': 7899648, 'steps': 41143, 'loss/train': 1.2647788524627686} 11/07/2021 03:00:21 - INFO - __main__ - Step 41145: {'lr': 0.00041855121182034037, 'samples': 7899840, 'steps': 41144, 'loss/train': 1.7006542682647705} 11/07/2021 03:00:22 - INFO - __main__ - Step 41146: {'lr': 0.00041854729251607214, 'samples': 7900032, 'steps': 41145, 'loss/train': 1.4035475254058838} 11/07/2021 03:00:22 - INFO - __main__ - Step 41147: {'lr': 0.00041854337313585913, 'samples': 7900224, 'steps': 41146, 'loss/train': 1.5544965267181396} 11/07/2021 03:00:22 - INFO - __main__ - Step 41148: {'lr': 0.000418539453679703, 'samples': 7900416, 'steps': 41147, 'loss/train': 1.6288424730300903} 11/07/2021 03:00:23 - INFO - __main__ - Step 41149: {'lr': 0.0004185355341476057, 'samples': 7900608, 'steps': 41148, 'loss/train': 1.534730315208435} 11/07/2021 03:00:24 - INFO - __main__ - Step 41150: {'lr': 0.00041853161453956885, 'samples': 7900800, 'steps': 41149, 'loss/train': 1.6405394077301025} 11/07/2021 03:00:24 - INFO - __main__ - Step 41151: {'lr': 0.0004185276948555942, 'samples': 7900992, 'steps': 41150, 'loss/train': 1.3956571817398071} 11/07/2021 03:00:24 - INFO - __main__ - Step 41152: {'lr': 0.0004185237750956836, 'samples': 7901184, 'steps': 41151, 'loss/train': 1.4736276865005493} 11/07/2021 03:00:25 - INFO - __main__ - Step 41153: {'lr': 0.0004185198552598388, 'samples': 7901376, 'steps': 41152, 'loss/train': 1.2600595951080322} 11/07/2021 03:00:26 - INFO - __main__ - Step 41154: {'lr': 0.00041851593534806154, 'samples': 7901568, 'steps': 41153, 'loss/train': 1.4591710567474365} 11/07/2021 03:00:26 - INFO - __main__ - Step 41155: {'lr': 0.0004185120153603536, 'samples': 7901760, 'steps': 41154, 'loss/train': 1.0266485214233398} 11/07/2021 03:00:27 - INFO - __main__ - Step 41156: {'lr': 0.0004185080952967168, 'samples': 7901952, 'steps': 41155, 'loss/train': 1.5074058771133423} 11/07/2021 03:00:27 - INFO - __main__ - Step 41157: {'lr': 0.00041850417515715277, 'samples': 7902144, 'steps': 41156, 'loss/train': 1.0465952157974243} 11/07/2021 03:00:27 - INFO - __main__ - Step 41158: {'lr': 0.00041850025494166346, 'samples': 7902336, 'steps': 41157, 'loss/train': 1.8012182712554932} 11/07/2021 03:00:28 - INFO - __main__ - Step 41159: {'lr': 0.0004184963346502504, 'samples': 7902528, 'steps': 41158, 'loss/train': 1.931903600692749} 11/07/2021 03:00:29 - INFO - __main__ - Step 41160: {'lr': 0.00041849241428291555, 'samples': 7902720, 'steps': 41159, 'loss/train': 0.7333551049232483} 11/07/2021 03:00:29 - INFO - __main__ - Step 41161: {'lr': 0.00041848849383966063, 'samples': 7902912, 'steps': 41160, 'loss/train': 0.9262370467185974} 11/07/2021 03:00:29 - INFO - __main__ - Step 41162: {'lr': 0.0004184845733204874, 'samples': 7903104, 'steps': 41161, 'loss/train': 1.379118800163269} 11/07/2021 03:00:30 - INFO - __main__ - Step 41163: {'lr': 0.00041848065272539765, 'samples': 7903296, 'steps': 41162, 'loss/train': 1.4716529846191406} 11/07/2021 03:00:31 - INFO - __main__ - Step 41164: {'lr': 0.00041847673205439305, 'samples': 7903488, 'steps': 41163, 'loss/train': 1.436081886291504} 11/07/2021 03:00:31 - INFO - __main__ - Step 41165: {'lr': 0.0004184728113074755, 'samples': 7903680, 'steps': 41164, 'loss/train': 1.4125069379806519} 11/07/2021 03:00:31 - INFO - __main__ - Step 41166: {'lr': 0.00041846889048464665, 'samples': 7903872, 'steps': 41165, 'loss/train': 1.7010443210601807} 11/07/2021 03:00:32 - INFO - __main__ - Step 41167: {'lr': 0.0004184649695859083, 'samples': 7904064, 'steps': 41166, 'loss/train': 1.6647261381149292} 11/07/2021 03:00:32 - INFO - __main__ - Step 41168: {'lr': 0.00041846104861126233, 'samples': 7904256, 'steps': 41167, 'loss/train': 2.12406325340271} 11/07/2021 03:00:33 - INFO - __main__ - Step 41169: {'lr': 0.0004184571275607103, 'samples': 7904448, 'steps': 41168, 'loss/train': 1.7294236421585083} 11/07/2021 03:00:34 - INFO - __main__ - Step 41170: {'lr': 0.0004184532064342542, 'samples': 7904640, 'steps': 41169, 'loss/train': 1.4182112216949463} 11/07/2021 03:00:34 - INFO - __main__ - Step 41171: {'lr': 0.0004184492852318956, 'samples': 7904832, 'steps': 41170, 'loss/train': 1.6887729167938232} 11/07/2021 03:00:34 - INFO - __main__ - Step 41172: {'lr': 0.00041844536395363636, 'samples': 7905024, 'steps': 41171, 'loss/train': 1.4426734447479248} 11/07/2021 03:00:35 - INFO - __main__ - Step 41173: {'lr': 0.00041844144259947825, 'samples': 7905216, 'steps': 41172, 'loss/train': 1.449723720550537} 11/07/2021 03:00:36 - INFO - __main__ - Step 41174: {'lr': 0.000418437521169423, 'samples': 7905408, 'steps': 41173, 'loss/train': 1.7111395597457886} 11/07/2021 03:00:36 - INFO - __main__ - Step 41175: {'lr': 0.0004184335996634725, 'samples': 7905600, 'steps': 41174, 'loss/train': 1.5197077989578247} 11/07/2021 03:00:36 - INFO - __main__ - Step 41176: {'lr': 0.00041842967808162834, 'samples': 7905792, 'steps': 41175, 'loss/train': 1.224771499633789} 11/07/2021 03:00:37 - INFO - __main__ - Step 41177: {'lr': 0.0004184257564238924, 'samples': 7905984, 'steps': 41176, 'loss/train': 1.630832314491272} 11/07/2021 03:00:37 - INFO - __main__ - Step 41178: {'lr': 0.0004184218346902663, 'samples': 7906176, 'steps': 41177, 'loss/train': 1.420072078704834} 11/07/2021 03:00:37 - INFO - __main__ - Step 41179: {'lr': 0.00041841791288075203, 'samples': 7906368, 'steps': 41178, 'loss/train': 1.6199049949645996} 11/07/2021 03:00:39 - INFO - __main__ - Step 41180: {'lr': 0.0004184139909953513, 'samples': 7906560, 'steps': 41179, 'loss/train': 1.5802867412567139} 11/07/2021 03:00:39 - INFO - __main__ - Step 41181: {'lr': 0.0004184100690340657, 'samples': 7906752, 'steps': 41180, 'loss/train': 1.0029767751693726} 11/07/2021 03:00:39 - INFO - __main__ - Step 41182: {'lr': 0.00041840614699689715, 'samples': 7906944, 'steps': 41181, 'loss/train': 1.5073751211166382} 11/07/2021 03:00:40 - INFO - __main__ - Step 41183: {'lr': 0.00041840222488384745, 'samples': 7907136, 'steps': 41182, 'loss/train': 1.5685653686523438} 11/07/2021 03:00:40 - INFO - __main__ - Step 41184: {'lr': 0.00041839830269491823, 'samples': 7907328, 'steps': 41183, 'loss/train': 1.391185998916626} 11/07/2021 03:00:41 - INFO - __main__ - Step 41185: {'lr': 0.0004183943804301114, 'samples': 7907520, 'steps': 41184, 'loss/train': 1.2560378313064575} 11/07/2021 03:00:41 - INFO - __main__ - Step 41186: {'lr': 0.0004183904580894287, 'samples': 7907712, 'steps': 41185, 'loss/train': 1.225590467453003} 11/07/2021 03:00:42 - INFO - __main__ - Step 41187: {'lr': 0.0004183865356728717, 'samples': 7907904, 'steps': 41186, 'loss/train': 1.9919193983078003} 11/07/2021 03:00:42 - INFO - __main__ - Step 41188: {'lr': 0.0004183826131804424, 'samples': 7908096, 'steps': 41187, 'loss/train': 1.4859697818756104} 11/07/2021 03:00:42 - INFO - __main__ - Step 41189: {'lr': 0.0004183786906121425, 'samples': 7908288, 'steps': 41188, 'loss/train': 1.4693942070007324} 11/07/2021 03:00:43 - INFO - __main__ - Step 41190: {'lr': 0.0004183747679679738, 'samples': 7908480, 'steps': 41189, 'loss/train': 1.2072123289108276} 11/07/2021 03:00:44 - INFO - __main__ - Step 41191: {'lr': 0.000418370845247938, 'samples': 7908672, 'steps': 41190, 'loss/train': 1.5702741146087646} 11/07/2021 03:00:44 - INFO - __main__ - Step 41192: {'lr': 0.0004183669224520369, 'samples': 7908864, 'steps': 41191, 'loss/train': 1.5912758111953735} 11/07/2021 03:00:44 - INFO - __main__ - Step 41193: {'lr': 0.00041836299958027226, 'samples': 7909056, 'steps': 41192, 'loss/train': 1.2868040800094604} 11/07/2021 03:00:45 - INFO - __main__ - Step 41194: {'lr': 0.00041835907663264585, 'samples': 7909248, 'steps': 41193, 'loss/train': 2.07917857170105} 11/07/2021 03:00:46 - INFO - __main__ - Step 41195: {'lr': 0.0004183551536091594, 'samples': 7909440, 'steps': 41194, 'loss/train': 1.5447174310684204} 11/07/2021 03:00:46 - INFO - __main__ - Step 41196: {'lr': 0.00041835123050981476, 'samples': 7909632, 'steps': 41195, 'loss/train': 1.3635088205337524} 11/07/2021 03:00:47 - INFO - __main__ - Step 41197: {'lr': 0.00041834730733461366, 'samples': 7909824, 'steps': 41196, 'loss/train': 1.5128867626190186} 11/07/2021 03:00:47 - INFO - __main__ - Step 41198: {'lr': 0.0004183433840835578, 'samples': 7910016, 'steps': 41197, 'loss/train': 1.45900297164917} 11/07/2021 03:00:47 - INFO - __main__ - Step 41199: {'lr': 0.0004183394607566491, 'samples': 7910208, 'steps': 41198, 'loss/train': 0.9792979955673218} 11/07/2021 03:00:48 - INFO - __main__ - Step 41200: {'lr': 0.0004183355373538892, 'samples': 7910400, 'steps': 41199, 'loss/train': 0.5584649443626404} 11/07/2021 03:00:49 - INFO - __main__ - Step 41201: {'lr': 0.00041833161387527985, 'samples': 7910592, 'steps': 41200, 'loss/train': 0.9643514156341553} 11/07/2021 03:00:49 - INFO - __main__ - Step 41202: {'lr': 0.0004183276903208228, 'samples': 7910784, 'steps': 41201, 'loss/train': 1.8792955875396729} 11/07/2021 03:00:50 - INFO - __main__ - Step 41203: {'lr': 0.0004183237666905201, 'samples': 7910976, 'steps': 41202, 'loss/train': 1.084418535232544} 11/07/2021 03:00:50 - INFO - __main__ - Step 41204: {'lr': 0.0004183198429843732, 'samples': 7911168, 'steps': 41203, 'loss/train': 1.5909727811813354} 11/07/2021 03:00:50 - INFO - __main__ - Step 41205: {'lr': 0.00041831591920238396, 'samples': 7911360, 'steps': 41204, 'loss/train': 1.7619580030441284} 11/07/2021 03:00:51 - INFO - __main__ - Step 41206: {'lr': 0.0004183119953445542, 'samples': 7911552, 'steps': 41205, 'loss/train': 1.2859710454940796} 11/07/2021 03:00:52 - INFO - __main__ - Step 41207: {'lr': 0.00041830807141088566, 'samples': 7911744, 'steps': 41206, 'loss/train': 1.5500056743621826} 11/07/2021 03:00:52 - INFO - __main__ - Step 41208: {'lr': 0.0004183041474013801, 'samples': 7911936, 'steps': 41207, 'loss/train': 1.6253983974456787} 11/07/2021 03:00:52 - INFO - __main__ - Step 41209: {'lr': 0.00041830022331603925, 'samples': 7912128, 'steps': 41208, 'loss/train': 1.482258915901184} 11/07/2021 03:00:53 - INFO - __main__ - Step 41210: {'lr': 0.000418296299154865, 'samples': 7912320, 'steps': 41209, 'loss/train': 1.3288849592208862} 11/07/2021 03:00:54 - INFO - __main__ - Step 41211: {'lr': 0.000418292374917859, 'samples': 7912512, 'steps': 41210, 'loss/train': 1.7064313888549805} 11/07/2021 03:00:54 - INFO - __main__ - Step 41212: {'lr': 0.00041828845060502297, 'samples': 7912704, 'steps': 41211, 'loss/train': 1.5797468423843384} 11/07/2021 03:00:55 - INFO - __main__ - Step 41213: {'lr': 0.00041828452621635884, 'samples': 7912896, 'steps': 41212, 'loss/train': 0.6790094375610352} 11/07/2021 03:00:55 - INFO - __main__ - Step 41214: {'lr': 0.0004182806017518682, 'samples': 7913088, 'steps': 41213, 'loss/train': 1.4418489933013916} 11/07/2021 03:00:55 - INFO - __main__ - Step 41215: {'lr': 0.00041827667721155303, 'samples': 7913280, 'steps': 41214, 'loss/train': 2.006834030151367} 11/07/2021 03:00:57 - INFO - __main__ - Step 41216: {'lr': 0.000418272752595415, 'samples': 7913472, 'steps': 41215, 'loss/train': 1.6815943717956543} 11/07/2021 03:00:57 - INFO - __main__ - Step 41217: {'lr': 0.00041826882790345577, 'samples': 7913664, 'steps': 41216, 'loss/train': 1.343355417251587} 11/07/2021 03:00:58 - INFO - __main__ - Step 41218: {'lr': 0.00041826490313567725, 'samples': 7913856, 'steps': 41217, 'loss/train': 1.6229034662246704} 11/07/2021 03:00:58 - INFO - __main__ - Step 41219: {'lr': 0.0004182609782920812, 'samples': 7914048, 'steps': 41218, 'loss/train': 1.9552432298660278} 11/07/2021 03:00:58 - INFO - __main__ - Step 41220: {'lr': 0.0004182570533726693, 'samples': 7914240, 'steps': 41219, 'loss/train': 1.3559013605117798} 11/07/2021 03:00:59 - INFO - __main__ - Step 41221: {'lr': 0.00041825312837744333, 'samples': 7914432, 'steps': 41220, 'loss/train': 1.5077816247940063} 11/07/2021 03:01:00 - INFO - __main__ - Step 41222: {'lr': 0.00041824920330640517, 'samples': 7914624, 'steps': 41221, 'loss/train': 0.24105438590049744} 11/07/2021 03:01:00 - INFO - __main__ - Step 41223: {'lr': 0.0004182452781595565, 'samples': 7914816, 'steps': 41222, 'loss/train': 1.7922337055206299} 11/07/2021 03:01:00 - INFO - __main__ - Step 41224: {'lr': 0.0004182413529368991, 'samples': 7915008, 'steps': 41223, 'loss/train': 1.418586015701294} 11/07/2021 03:01:01 - INFO - __main__ - Step 41225: {'lr': 0.0004182374276384347, 'samples': 7915200, 'steps': 41224, 'loss/train': 1.4558976888656616} 11/07/2021 03:01:01 - INFO - __main__ - Step 41226: {'lr': 0.0004182335022641651, 'samples': 7915392, 'steps': 41225, 'loss/train': 1.558049201965332} 11/07/2021 03:01:02 - INFO - __main__ - Step 41227: {'lr': 0.00041822957681409215, 'samples': 7915584, 'steps': 41226, 'loss/train': 1.8619894981384277} 11/07/2021 03:01:03 - INFO - __main__ - Step 41228: {'lr': 0.00041822565128821757, 'samples': 7915776, 'steps': 41227, 'loss/train': 1.5256032943725586} 11/07/2021 03:01:03 - INFO - __main__ - Step 41229: {'lr': 0.00041822172568654306, 'samples': 7915968, 'steps': 41228, 'loss/train': 1.7781447172164917} 11/07/2021 03:01:03 - INFO - __main__ - Step 41230: {'lr': 0.0004182178000090704, 'samples': 7916160, 'steps': 41229, 'loss/train': 1.4191579818725586} 11/07/2021 03:01:04 - INFO - __main__ - Step 41231: {'lr': 0.0004182138742558015, 'samples': 7916352, 'steps': 41230, 'loss/train': 1.6129021644592285} 11/07/2021 03:01:04 - INFO - __main__ - Step 41232: {'lr': 0.00041820994842673787, 'samples': 7916544, 'steps': 41231, 'loss/train': 1.6607749462127686} 11/07/2021 03:01:05 - INFO - __main__ - Step 41233: {'lr': 0.00041820602252188156, 'samples': 7916736, 'steps': 41232, 'loss/train': 1.9180601835250854} 11/07/2021 03:01:05 - INFO - __main__ - Step 41234: {'lr': 0.00041820209654123416, 'samples': 7916928, 'steps': 41233, 'loss/train': 1.4475653171539307} 11/07/2021 03:01:06 - INFO - __main__ - Step 41235: {'lr': 0.00041819817048479745, 'samples': 7917120, 'steps': 41234, 'loss/train': 4.331748008728027} 11/07/2021 03:01:06 - INFO - __main__ - Step 41236: {'lr': 0.0004181942443525734, 'samples': 7917312, 'steps': 41235, 'loss/train': 1.290693759918213} 11/07/2021 03:01:06 - INFO - __main__ - Step 41237: {'lr': 0.00041819031814456346, 'samples': 7917504, 'steps': 41236, 'loss/train': 1.7088333368301392} 11/07/2021 03:01:08 - INFO - __main__ - Step 41238: {'lr': 0.0004181863918607696, 'samples': 7917696, 'steps': 41237, 'loss/train': 1.9867417812347412} 11/07/2021 03:01:08 - INFO - __main__ - Step 41239: {'lr': 0.00041818246550119354, 'samples': 7917888, 'steps': 41238, 'loss/train': 1.6488137245178223} 11/07/2021 03:01:09 - INFO - __main__ - Step 41240: {'lr': 0.00041817853906583706, 'samples': 7918080, 'steps': 41239, 'loss/train': 1.6615079641342163} 11/07/2021 03:01:09 - INFO - __main__ - Step 41241: {'lr': 0.000418174612554702, 'samples': 7918272, 'steps': 41240, 'loss/train': 1.684520959854126} 11/07/2021 03:01:09 - INFO - __main__ - Step 41242: {'lr': 0.00041817068596778994, 'samples': 7918464, 'steps': 41241, 'loss/train': 1.7856652736663818} 11/07/2021 03:01:10 - INFO - __main__ - Step 41243: {'lr': 0.0004181667593051028, 'samples': 7918656, 'steps': 41242, 'loss/train': 1.7953649759292603} 11/07/2021 03:01:10 - INFO - __main__ - Step 41244: {'lr': 0.0004181628325666424, 'samples': 7918848, 'steps': 41243, 'loss/train': 1.7734460830688477} 11/07/2021 03:01:11 - INFO - __main__ - Step 41245: {'lr': 0.0004181589057524103, 'samples': 7919040, 'steps': 41244, 'loss/train': 1.22751784324646} 11/07/2021 03:01:11 - INFO - __main__ - Step 41246: {'lr': 0.0004181549788624085, 'samples': 7919232, 'steps': 41245, 'loss/train': 1.4137510061264038} 11/07/2021 03:01:12 - INFO - __main__ - Step 41247: {'lr': 0.0004181510518966386, 'samples': 7919424, 'steps': 41246, 'loss/train': 1.210434913635254} 11/07/2021 03:01:12 - INFO - __main__ - Step 41248: {'lr': 0.00041814712485510245, 'samples': 7919616, 'steps': 41247, 'loss/train': 0.9829598665237427} 11/07/2021 03:01:13 - INFO - __main__ - Step 41249: {'lr': 0.0004181431977378017, 'samples': 7919808, 'steps': 41248, 'loss/train': 1.560865879058838} 11/07/2021 03:01:14 - INFO - __main__ - Step 41250: {'lr': 0.00041813927054473835, 'samples': 7920000, 'steps': 41249, 'loss/train': 1.2396860122680664} 11/07/2021 03:01:14 - INFO - __main__ - Step 41251: {'lr': 0.000418135343275914, 'samples': 7920192, 'steps': 41250, 'loss/train': 1.5683566331863403} 11/07/2021 03:01:14 - INFO - __main__ - Step 41252: {'lr': 0.0004181314159313305, 'samples': 7920384, 'steps': 41251, 'loss/train': 1.6124900579452515} 11/07/2021 03:01:15 - INFO - __main__ - Step 41253: {'lr': 0.0004181274885109895, 'samples': 7920576, 'steps': 41252, 'loss/train': 0.991035521030426} 11/07/2021 03:01:15 - INFO - __main__ - Step 41254: {'lr': 0.0004181235610148929, 'samples': 7920768, 'steps': 41253, 'loss/train': 1.6612294912338257} 11/07/2021 03:01:16 - INFO - __main__ - Step 41255: {'lr': 0.0004181196334430424, 'samples': 7920960, 'steps': 41254, 'loss/train': 1.5381064414978027} 11/07/2021 03:01:16 - INFO - __main__ - Step 41256: {'lr': 0.00041811570579543977, 'samples': 7921152, 'steps': 41255, 'loss/train': 1.5637043714523315} 11/07/2021 03:01:17 - INFO - __main__ - Step 41257: {'lr': 0.0004181117780720868, 'samples': 7921344, 'steps': 41256, 'loss/train': 1.1258245706558228} 11/07/2021 03:01:17 - INFO - __main__ - Step 41258: {'lr': 0.00041810785027298524, 'samples': 7921536, 'steps': 41257, 'loss/train': 1.58634352684021} 11/07/2021 03:01:17 - INFO - __main__ - Step 41259: {'lr': 0.00041810392239813695, 'samples': 7921728, 'steps': 41258, 'loss/train': 1.0692466497421265} 11/07/2021 03:01:18 - INFO - __main__ - Step 41260: {'lr': 0.00041809999444754353, 'samples': 7921920, 'steps': 41259, 'loss/train': 2.233379602432251} 11/07/2021 03:01:19 - INFO - __main__ - Step 41261: {'lr': 0.0004180960664212069, 'samples': 7922112, 'steps': 41260, 'loss/train': 1.564186453819275} 11/07/2021 03:01:19 - INFO - __main__ - Step 41262: {'lr': 0.00041809213831912884, 'samples': 7922304, 'steps': 41261, 'loss/train': 1.601365089416504} 11/07/2021 03:01:19 - INFO - __main__ - Step 41263: {'lr': 0.0004180882101413109, 'samples': 7922496, 'steps': 41262, 'loss/train': 1.3063501119613647} 11/07/2021 03:01:20 - INFO - __main__ - Step 41264: {'lr': 0.00041808428188775515, 'samples': 7922688, 'steps': 41263, 'loss/train': 1.4315458536148071} 11/07/2021 03:01:20 - INFO - __main__ - Step 41265: {'lr': 0.0004180803535584632, 'samples': 7922880, 'steps': 41264, 'loss/train': 1.7812724113464355} 11/07/2021 03:01:21 - INFO - __main__ - Step 41266: {'lr': 0.0004180764251534368, 'samples': 7923072, 'steps': 41265, 'loss/train': 1.3440953493118286} 11/07/2021 03:01:22 - INFO - __main__ - Step 41267: {'lr': 0.0004180724966726778, 'samples': 7923264, 'steps': 41266, 'loss/train': 1.538791537284851} 11/07/2021 03:01:22 - INFO - __main__ - Step 41268: {'lr': 0.00041806856811618784, 'samples': 7923456, 'steps': 41267, 'loss/train': 1.7029993534088135} 11/07/2021 03:01:22 - INFO - __main__ - Step 41269: {'lr': 0.00041806463948396876, 'samples': 7923648, 'steps': 41268, 'loss/train': 1.508406639099121} 11/07/2021 03:01:23 - INFO - __main__ - Step 41270: {'lr': 0.0004180607107760225, 'samples': 7923840, 'steps': 41269, 'loss/train': 1.4010753631591797} 11/07/2021 03:01:24 - INFO - __main__ - Step 41271: {'lr': 0.0004180567819923505, 'samples': 7924032, 'steps': 41270, 'loss/train': 1.4275845289230347} 11/07/2021 03:01:24 - INFO - __main__ - Step 41272: {'lr': 0.0004180528531329548, 'samples': 7924224, 'steps': 41271, 'loss/train': 1.6024832725524902} 11/07/2021 03:01:24 - INFO - __main__ - Step 41273: {'lr': 0.00041804892419783715, 'samples': 7924416, 'steps': 41272, 'loss/train': 1.3015625476837158} 11/07/2021 03:01:25 - INFO - __main__ - Step 41274: {'lr': 0.0004180449951869991, 'samples': 7924608, 'steps': 41273, 'loss/train': 1.3946226835250854} 11/07/2021 03:01:25 - INFO - __main__ - Step 41275: {'lr': 0.00041804106610044263, 'samples': 7924800, 'steps': 41274, 'loss/train': 1.509242296218872} 11/07/2021 03:01:26 - INFO - __main__ - Step 41276: {'lr': 0.00041803713693816947, 'samples': 7924992, 'steps': 41275, 'loss/train': 1.2228401899337769} 11/07/2021 03:01:26 - INFO - __main__ - Step 41277: {'lr': 0.0004180332077001814, 'samples': 7925184, 'steps': 41276, 'loss/train': 1.587908148765564} 11/07/2021 03:01:27 - INFO - __main__ - Step 41278: {'lr': 0.0004180292783864801, 'samples': 7925376, 'steps': 41277, 'loss/train': 1.5843009948730469} 11/07/2021 03:01:27 - INFO - __main__ - Step 41279: {'lr': 0.00041802534899706734, 'samples': 7925568, 'steps': 41278, 'loss/train': 1.3490830659866333} 11/07/2021 03:01:27 - INFO - __main__ - Step 41280: {'lr': 0.0004180214195319451, 'samples': 7925760, 'steps': 41279, 'loss/train': 1.0151638984680176} 11/07/2021 03:01:28 - INFO - __main__ - Step 41281: {'lr': 0.00041801748999111487, 'samples': 7925952, 'steps': 41280, 'loss/train': 2.370021343231201} 11/07/2021 03:01:29 - INFO - __main__ - Step 41282: {'lr': 0.0004180135603745786, 'samples': 7926144, 'steps': 41281, 'loss/train': 1.8114560842514038} 11/07/2021 03:01:29 - INFO - __main__ - Step 41283: {'lr': 0.000418009630682338, 'samples': 7926336, 'steps': 41282, 'loss/train': 1.4603055715560913} 11/07/2021 03:01:29 - INFO - __main__ - Step 41284: {'lr': 0.00041800570091439493, 'samples': 7926528, 'steps': 41283, 'loss/train': 1.5822771787643433} 11/07/2021 03:01:30 - INFO - __main__ - Step 41285: {'lr': 0.000418001771070751, 'samples': 7926720, 'steps': 41284, 'loss/train': 1.1700141429901123} 11/07/2021 03:01:31 - INFO - __main__ - Step 41286: {'lr': 0.0004179978411514081, 'samples': 7926912, 'steps': 41285, 'loss/train': 1.5586498975753784} 11/07/2021 03:01:31 - INFO - __main__ - Step 41287: {'lr': 0.000417993911156368, 'samples': 7927104, 'steps': 41286, 'loss/train': 1.3684496879577637} 11/07/2021 03:01:32 - INFO - __main__ - Step 41288: {'lr': 0.00041798998108563234, 'samples': 7927296, 'steps': 41287, 'loss/train': 1.536394476890564} 11/07/2021 03:01:32 - INFO - __main__ - Step 41289: {'lr': 0.00041798605093920307, 'samples': 7927488, 'steps': 41288, 'loss/train': 1.71012544631958} 11/07/2021 03:01:32 - INFO - __main__ - Step 41290: {'lr': 0.00041798212071708185, 'samples': 7927680, 'steps': 41289, 'loss/train': 1.1121327877044678} 11/07/2021 03:01:33 - INFO - __main__ - Step 41291: {'lr': 0.0004179781904192704, 'samples': 7927872, 'steps': 41290, 'loss/train': 1.5258047580718994} 11/07/2021 03:01:34 - INFO - __main__ - Step 41292: {'lr': 0.00041797426004577066, 'samples': 7928064, 'steps': 41291, 'loss/train': 1.2911633253097534} 11/07/2021 03:01:34 - INFO - __main__ - Step 41293: {'lr': 0.00041797032959658433, 'samples': 7928256, 'steps': 41292, 'loss/train': 1.5362772941589355} 11/07/2021 03:01:34 - INFO - __main__ - Step 41294: {'lr': 0.0004179663990717131, 'samples': 7928448, 'steps': 41293, 'loss/train': 1.5473556518554688} 11/07/2021 03:01:35 - INFO - __main__ - Step 41295: {'lr': 0.0004179624684711588, 'samples': 7928640, 'steps': 41294, 'loss/train': 1.390834093093872} 11/07/2021 03:01:35 - INFO - __main__ - Step 41296: {'lr': 0.0004179585377949232, 'samples': 7928832, 'steps': 41295, 'loss/train': 1.8194223642349243} 11/07/2021 03:01:36 - INFO - __main__ - Step 41297: {'lr': 0.0004179546070430082, 'samples': 7929024, 'steps': 41296, 'loss/train': 1.2560080289840698} 11/07/2021 03:01:37 - INFO - __main__ - Step 41298: {'lr': 0.0004179506762154153, 'samples': 7929216, 'steps': 41297, 'loss/train': 1.2087020874023438} 11/07/2021 03:01:37 - INFO - __main__ - Step 41299: {'lr': 0.0004179467453121465, 'samples': 7929408, 'steps': 41298, 'loss/train': 1.3658021688461304} 11/07/2021 03:01:37 - INFO - __main__ - Step 41300: {'lr': 0.0004179428143332035, 'samples': 7929600, 'steps': 41299, 'loss/train': 1.672145128250122} 11/07/2021 03:01:38 - INFO - __main__ - Step 41301: {'lr': 0.000417938883278588, 'samples': 7929792, 'steps': 41300, 'loss/train': 1.7102265357971191} 11/07/2021 03:01:39 - INFO - __main__ - Step 41302: {'lr': 0.0004179349521483018, 'samples': 7929984, 'steps': 41301, 'loss/train': 1.1674959659576416} 11/07/2021 03:01:39 - INFO - __main__ - Step 41303: {'lr': 0.00041793102094234673, 'samples': 7930176, 'steps': 41302, 'loss/train': 1.4597300291061401} 11/07/2021 03:01:39 - INFO - __main__ - Step 41304: {'lr': 0.00041792708966072455, 'samples': 7930368, 'steps': 41303, 'loss/train': 1.5141634941101074} 11/07/2021 03:01:40 - INFO - __main__ - Step 41305: {'lr': 0.0004179231583034371, 'samples': 7930560, 'steps': 41304, 'loss/train': 1.7141952514648438} 11/07/2021 03:01:40 - INFO - __main__ - Step 41306: {'lr': 0.0004179192268704859, 'samples': 7930752, 'steps': 41305, 'loss/train': 1.4723738431930542} 11/07/2021 03:01:41 - INFO - __main__ - Step 41307: {'lr': 0.000417915295361873, 'samples': 7930944, 'steps': 41306, 'loss/train': 1.3933378458023071} 11/07/2021 03:01:42 - INFO - __main__ - Step 41308: {'lr': 0.0004179113637776, 'samples': 7931136, 'steps': 41307, 'loss/train': 1.2595709562301636} 11/07/2021 03:01:42 - INFO - __main__ - Step 41309: {'lr': 0.0004179074321176688, 'samples': 7931328, 'steps': 41308, 'loss/train': 1.5110349655151367} 11/07/2021 03:01:42 - INFO - __main__ - Step 41310: {'lr': 0.000417903500382081, 'samples': 7931520, 'steps': 41309, 'loss/train': 1.3963290452957153} 11/07/2021 03:01:43 - INFO - __main__ - Step 41311: {'lr': 0.00041789956857083853, 'samples': 7931712, 'steps': 41310, 'loss/train': 1.7429158687591553} 11/07/2021 03:01:44 - INFO - __main__ - Step 41312: {'lr': 0.00041789563668394314, 'samples': 7931904, 'steps': 41311, 'loss/train': 1.4432523250579834} 11/07/2021 03:01:44 - INFO - __main__ - Step 41313: {'lr': 0.0004178917047213965, 'samples': 7932096, 'steps': 41312, 'loss/train': 1.3484480381011963} 11/07/2021 03:01:44 - INFO - __main__ - Step 41314: {'lr': 0.00041788777268320055, 'samples': 7932288, 'steps': 41313, 'loss/train': 1.684675693511963} 11/07/2021 03:01:45 - INFO - __main__ - Step 41315: {'lr': 0.00041788384056935693, 'samples': 7932480, 'steps': 41314, 'loss/train': 1.4484103918075562} 11/07/2021 03:01:45 - INFO - __main__ - Step 41316: {'lr': 0.0004178799083798673, 'samples': 7932672, 'steps': 41315, 'loss/train': 1.47584068775177} 11/07/2021 03:01:45 - INFO - __main__ - Step 41317: {'lr': 0.00041787597611473375, 'samples': 7932864, 'steps': 41316, 'loss/train': 1.2993026971817017} 11/07/2021 03:01:46 - INFO - __main__ - Step 41318: {'lr': 0.00041787204377395783, 'samples': 7933056, 'steps': 41317, 'loss/train': 1.1744214296340942} 11/07/2021 03:01:47 - INFO - __main__ - Step 41319: {'lr': 0.0004178681113575413, 'samples': 7933248, 'steps': 41318, 'loss/train': 1.6770493984222412} 11/07/2021 03:01:47 - INFO - __main__ - Step 41320: {'lr': 0.00041786417886548606, 'samples': 7933440, 'steps': 41319, 'loss/train': 1.511395812034607} 11/07/2021 03:01:48 - INFO - __main__ - Step 41321: {'lr': 0.0004178602462977937, 'samples': 7933632, 'steps': 41320, 'loss/train': 1.0146536827087402} 11/07/2021 03:01:48 - INFO - __main__ - Step 41322: {'lr': 0.0004178563136544662, 'samples': 7933824, 'steps': 41321, 'loss/train': 1.5325267314910889} 11/07/2021 03:01:49 - INFO - __main__ - Step 41323: {'lr': 0.0004178523809355053, 'samples': 7934016, 'steps': 41322, 'loss/train': 1.431384801864624} 11/07/2021 03:01:49 - INFO - __main__ - Step 41324: {'lr': 0.00041784844814091263, 'samples': 7934208, 'steps': 41323, 'loss/train': 1.2008589506149292} 11/07/2021 03:01:50 - INFO - __main__ - Step 41325: {'lr': 0.00041784451527069, 'samples': 7934400, 'steps': 41324, 'loss/train': 1.5073442459106445} 11/07/2021 03:01:50 - INFO - __main__ - Step 41326: {'lr': 0.0004178405823248392, 'samples': 7934592, 'steps': 41325, 'loss/train': 1.7020225524902344} 11/07/2021 03:01:50 - INFO - __main__ - Step 41327: {'lr': 0.0004178366493033621, 'samples': 7934784, 'steps': 41326, 'loss/train': 1.369020700454712} 11/07/2021 03:01:52 - INFO - __main__ - Step 41328: {'lr': 0.0004178327162062604, 'samples': 7934976, 'steps': 41327, 'loss/train': 0.5613696575164795} 11/07/2021 03:01:52 - INFO - __main__ - Step 41329: {'lr': 0.00041782878303353577, 'samples': 7935168, 'steps': 41328, 'loss/train': 1.6528732776641846} 11/07/2021 03:01:52 - INFO - __main__ - Step 41330: {'lr': 0.0004178248497851902, 'samples': 7935360, 'steps': 41329, 'loss/train': 1.4104169607162476} 11/07/2021 03:01:53 - INFO - __main__ - Step 41331: {'lr': 0.00041782091646122533, 'samples': 7935552, 'steps': 41330, 'loss/train': 1.3062204122543335} 11/07/2021 03:01:53 - INFO - __main__ - Step 41332: {'lr': 0.00041781698306164283, 'samples': 7935744, 'steps': 41331, 'loss/train': 1.29737389087677} 11/07/2021 03:01:54 - INFO - __main__ - Step 41333: {'lr': 0.0004178130495864447, 'samples': 7935936, 'steps': 41332, 'loss/train': 1.2845706939697266} 11/07/2021 03:01:55 - INFO - __main__ - Step 41334: {'lr': 0.00041780911603563254, 'samples': 7936128, 'steps': 41333, 'loss/train': 1.6035759449005127} 11/07/2021 03:01:55 - INFO - __main__ - Step 41335: {'lr': 0.00041780518240920817, 'samples': 7936320, 'steps': 41334, 'loss/train': 1.7680363655090332} 11/07/2021 03:01:55 - INFO - __main__ - Step 41336: {'lr': 0.0004178012487071734, 'samples': 7936512, 'steps': 41335, 'loss/train': 1.8347069025039673} 11/07/2021 03:01:56 - INFO - __main__ - Step 41337: {'lr': 0.00041779731492953, 'samples': 7936704, 'steps': 41336, 'loss/train': 1.4897397756576538} 11/07/2021 03:01:56 - INFO - __main__ - Step 41338: {'lr': 0.0004177933810762797, 'samples': 7936896, 'steps': 41337, 'loss/train': 1.3531194925308228} 11/07/2021 03:01:57 - INFO - __main__ - Step 41339: {'lr': 0.00041778944714742435, 'samples': 7937088, 'steps': 41338, 'loss/train': 1.9024698734283447} 11/07/2021 03:01:59 - INFO - __main__ - Step 41340: {'lr': 0.00041778551314296556, 'samples': 7937280, 'steps': 41339, 'loss/train': 1.5987845659255981} 11/07/2021 03:01:59 - INFO - __main__ - Step 41341: {'lr': 0.00041778157906290525, 'samples': 7937472, 'steps': 41340, 'loss/train': 1.0938655138015747} 11/07/2021 03:02:00 - INFO - __main__ - Step 41342: {'lr': 0.00041777764490724515, 'samples': 7937664, 'steps': 41341, 'loss/train': 1.311763048171997} 11/07/2021 03:02:00 - INFO - __main__ - Step 41343: {'lr': 0.00041777371067598705, 'samples': 7937856, 'steps': 41342, 'loss/train': 1.4340498447418213} 11/07/2021 03:02:00 - INFO - __main__ - Step 41344: {'lr': 0.00041776977636913274, 'samples': 7938048, 'steps': 41343, 'loss/train': 1.6654342412948608} 11/07/2021 03:02:01 - INFO - __main__ - Step 41345: {'lr': 0.0004177658419866839, 'samples': 7938240, 'steps': 41344, 'loss/train': 1.9490320682525635} 11/07/2021 03:02:01 - INFO - __main__ - Step 41346: {'lr': 0.0004177619075286424, 'samples': 7938432, 'steps': 41345, 'loss/train': 1.9665720462799072} 11/07/2021 03:02:01 - INFO - __main__ - Step 41347: {'lr': 0.00041775797299500997, 'samples': 7938624, 'steps': 41346, 'loss/train': 1.7433401346206665} 11/07/2021 03:02:02 - INFO - __main__ - Step 41348: {'lr': 0.0004177540383857883, 'samples': 7938816, 'steps': 41347, 'loss/train': 1.7861980199813843} 11/07/2021 03:02:03 - INFO - __main__ - Step 41349: {'lr': 0.0004177501037009793, 'samples': 7939008, 'steps': 41348, 'loss/train': 1.8376537561416626} 11/07/2021 03:02:03 - INFO - __main__ - Step 41350: {'lr': 0.0004177461689405847, 'samples': 7939200, 'steps': 41349, 'loss/train': 1.8534979820251465} 11/07/2021 03:02:03 - INFO - __main__ - Step 41351: {'lr': 0.00041774223410460633, 'samples': 7939392, 'steps': 41350, 'loss/train': 1.8638612031936646} 11/07/2021 03:02:04 - INFO - __main__ - Step 41352: {'lr': 0.00041773829919304584, 'samples': 7939584, 'steps': 41351, 'loss/train': 1.4559210538864136} 11/07/2021 03:02:05 - INFO - __main__ - Step 41353: {'lr': 0.000417734364205905, 'samples': 7939776, 'steps': 41352, 'loss/train': 1.5426164865493774} 11/07/2021 03:02:05 - INFO - __main__ - Step 41354: {'lr': 0.0004177304291431857, 'samples': 7939968, 'steps': 41353, 'loss/train': 0.13154828548431396} 11/07/2021 03:02:05 - INFO - __main__ - Step 41355: {'lr': 0.00041772649400488967, 'samples': 7940160, 'steps': 41354, 'loss/train': 1.3842164278030396} 11/07/2021 03:02:06 - INFO - __main__ - Step 41356: {'lr': 0.0004177225587910186, 'samples': 7940352, 'steps': 41355, 'loss/train': 1.2309932708740234} 11/07/2021 03:02:06 - INFO - __main__ - Step 41357: {'lr': 0.0004177186235015744, 'samples': 7940544, 'steps': 41356, 'loss/train': 1.4589163064956665} 11/07/2021 03:02:07 - INFO - __main__ - Step 41358: {'lr': 0.0004177146881365588, 'samples': 7940736, 'steps': 41357, 'loss/train': 1.6069822311401367} 11/07/2021 03:02:07 - INFO - __main__ - Step 41359: {'lr': 0.00041771075269597354, 'samples': 7940928, 'steps': 41358, 'loss/train': 0.9026435017585754} 11/07/2021 03:02:08 - INFO - __main__ - Step 41360: {'lr': 0.0004177068171798204, 'samples': 7941120, 'steps': 41359, 'loss/train': 1.3894383907318115} 11/07/2021 03:02:08 - INFO - __main__ - Step 41361: {'lr': 0.0004177028815881011, 'samples': 7941312, 'steps': 41360, 'loss/train': 1.4144057035446167} 11/07/2021 03:02:08 - INFO - __main__ - Step 41362: {'lr': 0.00041769894592081746, 'samples': 7941504, 'steps': 41361, 'loss/train': 1.4857261180877686} 11/07/2021 03:02:10 - INFO - __main__ - Step 41363: {'lr': 0.0004176950101779713, 'samples': 7941696, 'steps': 41362, 'loss/train': 1.717484474182129} 11/07/2021 03:02:10 - INFO - __main__ - Step 41364: {'lr': 0.00041769107435956444, 'samples': 7941888, 'steps': 41363, 'loss/train': 1.5446809530258179} 11/07/2021 03:02:10 - INFO - __main__ - Step 41365: {'lr': 0.00041768713846559844, 'samples': 7942080, 'steps': 41364, 'loss/train': 1.936703085899353} 11/07/2021 03:02:11 - INFO - __main__ - Step 41366: {'lr': 0.00041768320249607527, 'samples': 7942272, 'steps': 41365, 'loss/train': 0.5804887413978577} 11/07/2021 03:02:11 - INFO - __main__ - Step 41367: {'lr': 0.00041767926645099664, 'samples': 7942464, 'steps': 41366, 'loss/train': 1.1168676614761353} 11/07/2021 03:02:12 - INFO - __main__ - Step 41368: {'lr': 0.00041767533033036425, 'samples': 7942656, 'steps': 41367, 'loss/train': 1.2331725358963013} 11/07/2021 03:02:12 - INFO - __main__ - Step 41369: {'lr': 0.00041767139413418, 'samples': 7942848, 'steps': 41368, 'loss/train': 1.3584792613983154} 11/07/2021 03:02:13 - INFO - __main__ - Step 41370: {'lr': 0.00041766745786244564, 'samples': 7943040, 'steps': 41369, 'loss/train': 1.856892704963684} 11/07/2021 03:02:13 - INFO - __main__ - Step 41371: {'lr': 0.00041766352151516284, 'samples': 7943232, 'steps': 41370, 'loss/train': 1.2535980939865112} 11/07/2021 03:02:13 - INFO - __main__ - Step 41372: {'lr': 0.0004176595850923335, 'samples': 7943424, 'steps': 41371, 'loss/train': 1.2640399932861328} 11/07/2021 03:02:14 - INFO - __main__ - Step 41373: {'lr': 0.0004176556485939593, 'samples': 7943616, 'steps': 41372, 'loss/train': 1.373843789100647} 11/07/2021 03:02:15 - INFO - __main__ - Step 41374: {'lr': 0.00041765171202004205, 'samples': 7943808, 'steps': 41373, 'loss/train': 1.3820745944976807} 11/07/2021 03:02:15 - INFO - __main__ - Step 41375: {'lr': 0.00041764777537058354, 'samples': 7944000, 'steps': 41374, 'loss/train': 1.7117164134979248} 11/07/2021 03:02:16 - INFO - __main__ - Step 41376: {'lr': 0.0004176438386455855, 'samples': 7944192, 'steps': 41375, 'loss/train': 0.868809163570404} 11/07/2021 03:02:16 - INFO - __main__ - Step 41377: {'lr': 0.00041763990184504984, 'samples': 7944384, 'steps': 41376, 'loss/train': 1.7543200254440308} 11/07/2021 03:02:16 - INFO - __main__ - Step 41378: {'lr': 0.00041763596496897817, 'samples': 7944576, 'steps': 41377, 'loss/train': 1.489722728729248} 11/07/2021 03:02:17 - INFO - __main__ - Step 41379: {'lr': 0.00041763202801737225, 'samples': 7944768, 'steps': 41378, 'loss/train': 1.531445860862732} 11/07/2021 03:02:18 - INFO - __main__ - Step 41380: {'lr': 0.00041762809099023403, 'samples': 7944960, 'steps': 41379, 'loss/train': 1.4082839488983154} 11/07/2021 03:02:18 - INFO - __main__ - Step 41381: {'lr': 0.00041762415388756514, 'samples': 7945152, 'steps': 41380, 'loss/train': 1.53441321849823} 11/07/2021 03:02:18 - INFO - __main__ - Step 41382: {'lr': 0.00041762021670936736, 'samples': 7945344, 'steps': 41381, 'loss/train': 1.8188769817352295} 11/07/2021 03:02:19 - INFO - __main__ - Step 41383: {'lr': 0.0004176162794556425, 'samples': 7945536, 'steps': 41382, 'loss/train': 1.631960391998291} 11/07/2021 03:02:20 - INFO - __main__ - Step 41384: {'lr': 0.0004176123421263923, 'samples': 7945728, 'steps': 41383, 'loss/train': 1.130538821220398} 11/07/2021 03:02:20 - INFO - __main__ - Step 41385: {'lr': 0.00041760840472161866, 'samples': 7945920, 'steps': 41384, 'loss/train': 1.5905122756958008} 11/07/2021 03:02:20 - INFO - __main__ - Step 41386: {'lr': 0.0004176044672413232, 'samples': 7946112, 'steps': 41385, 'loss/train': 1.7300176620483398} 11/07/2021 03:02:21 - INFO - __main__ - Step 41387: {'lr': 0.00041760052968550776, 'samples': 7946304, 'steps': 41386, 'loss/train': 1.1613948345184326} 11/07/2021 03:02:21 - INFO - __main__ - Step 41388: {'lr': 0.0004175965920541741, 'samples': 7946496, 'steps': 41387, 'loss/train': 1.0157157182693481} 11/07/2021 03:02:22 - INFO - __main__ - Step 41389: {'lr': 0.00041759265434732404, 'samples': 7946688, 'steps': 41388, 'loss/train': 0.9334609508514404} 11/07/2021 03:02:22 - INFO - __main__ - Step 41390: {'lr': 0.00041758871656495927, 'samples': 7946880, 'steps': 41389, 'loss/train': 0.7453952431678772} 11/07/2021 03:02:23 - INFO - __main__ - Step 41391: {'lr': 0.00041758477870708165, 'samples': 7947072, 'steps': 41390, 'loss/train': 1.6157008409500122} 11/07/2021 03:02:23 - INFO - __main__ - Step 41392: {'lr': 0.0004175808407736929, 'samples': 7947264, 'steps': 41391, 'loss/train': 0.9999271035194397} 11/07/2021 03:02:23 - INFO - __main__ - Step 41393: {'lr': 0.00041757690276479474, 'samples': 7947456, 'steps': 41392, 'loss/train': 1.3824840784072876} 11/07/2021 03:02:25 - INFO - __main__ - Step 41394: {'lr': 0.0004175729646803891, 'samples': 7947648, 'steps': 41393, 'loss/train': 1.6060587167739868} 11/07/2021 03:02:25 - INFO - __main__ - Step 41395: {'lr': 0.00041756902652047767, 'samples': 7947840, 'steps': 41394, 'loss/train': 1.4517266750335693} 11/07/2021 03:02:25 - INFO - __main__ - Step 41396: {'lr': 0.0004175650882850622, 'samples': 7948032, 'steps': 41395, 'loss/train': 0.9659159183502197} 11/07/2021 03:02:26 - INFO - __main__ - Step 41397: {'lr': 0.0004175611499741445, 'samples': 7948224, 'steps': 41396, 'loss/train': 1.5996308326721191} 11/07/2021 03:02:26 - INFO - __main__ - Step 41398: {'lr': 0.00041755721158772633, 'samples': 7948416, 'steps': 41397, 'loss/train': 0.12279503047466278} 11/07/2021 03:02:27 - INFO - __main__ - Step 41399: {'lr': 0.00041755327312580944, 'samples': 7948608, 'steps': 41398, 'loss/train': 1.602047324180603} 11/07/2021 03:02:27 - INFO - __main__ - Step 41400: {'lr': 0.0004175493345883956, 'samples': 7948800, 'steps': 41399, 'loss/train': 1.2394378185272217} 11/07/2021 03:02:28 - INFO - __main__ - Step 41401: {'lr': 0.0004175453959754867, 'samples': 7948992, 'steps': 41400, 'loss/train': 1.2604111433029175} 11/07/2021 03:02:28 - INFO - __main__ - Step 41402: {'lr': 0.00041754145728708434, 'samples': 7949184, 'steps': 41401, 'loss/train': 1.337045669555664} 11/07/2021 03:02:28 - INFO - __main__ - Step 41403: {'lr': 0.0004175375185231904, 'samples': 7949376, 'steps': 41402, 'loss/train': 1.2870851755142212} 11/07/2021 03:02:29 - INFO - __main__ - Step 41404: {'lr': 0.00041753357968380675, 'samples': 7949568, 'steps': 41403, 'loss/train': 1.5985771417617798} 11/07/2021 03:02:30 - INFO - __main__ - Step 41405: {'lr': 0.00041752964076893496, 'samples': 7949760, 'steps': 41404, 'loss/train': 1.6936683654785156} 11/07/2021 03:02:30 - INFO - __main__ - Step 41406: {'lr': 0.00041752570177857695, 'samples': 7949952, 'steps': 41405, 'loss/train': 1.5201998949050903} 11/07/2021 03:02:30 - INFO - __main__ - Step 41407: {'lr': 0.0004175217627127344, 'samples': 7950144, 'steps': 41406, 'loss/train': 1.4150969982147217} 11/07/2021 03:02:31 - INFO - __main__ - Step 41408: {'lr': 0.0004175178235714091, 'samples': 7950336, 'steps': 41407, 'loss/train': 1.580664038658142} 11/07/2021 03:02:31 - INFO - __main__ - Step 41409: {'lr': 0.0004175138843546029, 'samples': 7950528, 'steps': 41408, 'loss/train': 1.626002550125122} 11/07/2021 03:02:32 - INFO - __main__ - Step 41410: {'lr': 0.00041750994506231756, 'samples': 7950720, 'steps': 41409, 'loss/train': 0.6695911884307861} 11/07/2021 03:02:32 - INFO - __main__ - Step 41411: {'lr': 0.00041750600569455474, 'samples': 7950912, 'steps': 41410, 'loss/train': 1.4357165098190308} 11/07/2021 03:02:33 - INFO - __main__ - Step 41412: {'lr': 0.0004175020662513164, 'samples': 7951104, 'steps': 41411, 'loss/train': 1.796668291091919} 11/07/2021 03:02:33 - INFO - __main__ - Step 41413: {'lr': 0.0004174981267326041, 'samples': 7951296, 'steps': 41412, 'loss/train': 1.517921805381775} 11/07/2021 03:02:34 - INFO - __main__ - Step 41414: {'lr': 0.0004174941871384198, 'samples': 7951488, 'steps': 41413, 'loss/train': 0.9669694304466248} 11/07/2021 03:02:35 - INFO - __main__ - Step 41415: {'lr': 0.00041749024746876517, 'samples': 7951680, 'steps': 41414, 'loss/train': 1.9053404331207275} 11/07/2021 03:02:35 - INFO - __main__ - Step 41416: {'lr': 0.00041748630772364204, 'samples': 7951872, 'steps': 41415, 'loss/train': 1.4301363229751587} 11/07/2021 03:02:35 - INFO - __main__ - Step 41417: {'lr': 0.00041748236790305215, 'samples': 7952064, 'steps': 41416, 'loss/train': 1.349250078201294} 11/07/2021 03:02:36 - INFO - __main__ - Step 41418: {'lr': 0.0004174784280069973, 'samples': 7952256, 'steps': 41417, 'loss/train': 1.444665551185608} 11/07/2021 03:02:36 - INFO - __main__ - Step 41419: {'lr': 0.00041747448803547925, 'samples': 7952448, 'steps': 41418, 'loss/train': 1.7027661800384521} 11/07/2021 03:02:37 - INFO - __main__ - Step 41420: {'lr': 0.0004174705479884998, 'samples': 7952640, 'steps': 41419, 'loss/train': 0.8072425723075867} 11/07/2021 03:02:37 - INFO - __main__ - Step 41421: {'lr': 0.0004174666078660607, 'samples': 7952832, 'steps': 41420, 'loss/train': 1.1555086374282837} 11/07/2021 03:02:38 - INFO - __main__ - Step 41422: {'lr': 0.00041746266766816377, 'samples': 7953024, 'steps': 41421, 'loss/train': 1.1109774112701416} 11/07/2021 03:02:38 - INFO - __main__ - Step 41423: {'lr': 0.0004174587273948106, 'samples': 7953216, 'steps': 41422, 'loss/train': 1.418365716934204} 11/07/2021 03:02:38 - INFO - __main__ - Step 41424: {'lr': 0.0004174547870460033, 'samples': 7953408, 'steps': 41423, 'loss/train': 1.2630163431167603} 11/07/2021 03:02:39 - INFO - __main__ - Step 41425: {'lr': 0.0004174508466217434, 'samples': 7953600, 'steps': 41424, 'loss/train': 1.473492980003357} 11/07/2021 03:02:40 - INFO - __main__ - Step 41426: {'lr': 0.00041744690612203263, 'samples': 7953792, 'steps': 41425, 'loss/train': 1.6726351976394653} 11/07/2021 03:02:40 - INFO - __main__ - Step 41427: {'lr': 0.00041744296554687294, 'samples': 7953984, 'steps': 41426, 'loss/train': 1.9256874322891235} 11/07/2021 03:02:41 - INFO - __main__ - Step 41428: {'lr': 0.00041743902489626606, 'samples': 7954176, 'steps': 41427, 'loss/train': 1.5032762289047241} 11/07/2021 03:02:41 - INFO - __main__ - Step 41429: {'lr': 0.0004174350841702137, 'samples': 7954368, 'steps': 41428, 'loss/train': 1.2201040983200073} 11/07/2021 03:02:41 - INFO - __main__ - Step 41430: {'lr': 0.0004174311433687177, 'samples': 7954560, 'steps': 41429, 'loss/train': 1.4208319187164307} 11/07/2021 03:02:42 - INFO - __main__ - Step 41431: {'lr': 0.00041742720249177975, 'samples': 7954752, 'steps': 41430, 'loss/train': 1.0522828102111816} 11/07/2021 03:02:43 - INFO - __main__ - Step 41432: {'lr': 0.0004174232615394018, 'samples': 7954944, 'steps': 41431, 'loss/train': 1.4779268503189087} 11/07/2021 03:02:43 - INFO - __main__ - Step 41433: {'lr': 0.00041741932051158535, 'samples': 7955136, 'steps': 41432, 'loss/train': 1.259121060371399} 11/07/2021 03:02:43 - INFO - __main__ - Step 41434: {'lr': 0.00041741537940833247, 'samples': 7955328, 'steps': 41433, 'loss/train': 1.3860517740249634} 11/07/2021 03:02:44 - INFO - __main__ - Step 41435: {'lr': 0.00041741143822964476, 'samples': 7955520, 'steps': 41434, 'loss/train': 1.6901785135269165} 11/07/2021 03:02:45 - INFO - __main__ - Step 41436: {'lr': 0.00041740749697552406, 'samples': 7955712, 'steps': 41435, 'loss/train': 1.1720250844955444} 11/07/2021 03:02:45 - INFO - __main__ - Step 41437: {'lr': 0.0004174035556459721, 'samples': 7955904, 'steps': 41436, 'loss/train': 0.42491212487220764} 11/07/2021 03:02:45 - INFO - __main__ - Step 41438: {'lr': 0.0004173996142409907, 'samples': 7956096, 'steps': 41437, 'loss/train': 1.5557392835617065} 11/07/2021 03:02:46 - INFO - __main__ - Step 41439: {'lr': 0.0004173956727605816, 'samples': 7956288, 'steps': 41438, 'loss/train': 1.436159610748291} 11/07/2021 03:02:46 - INFO - __main__ - Step 41440: {'lr': 0.00041739173120474663, 'samples': 7956480, 'steps': 41439, 'loss/train': 1.266028642654419} 11/07/2021 03:02:47 - INFO - __main__ - Step 41441: {'lr': 0.00041738778957348745, 'samples': 7956672, 'steps': 41440, 'loss/train': 1.7526899576187134} 11/07/2021 03:02:48 - INFO - __main__ - Step 41442: {'lr': 0.00041738384786680596, 'samples': 7956864, 'steps': 41441, 'loss/train': 1.5865671634674072} 11/07/2021 03:02:48 - INFO - __main__ - Step 41443: {'lr': 0.0004173799060847039, 'samples': 7957056, 'steps': 41442, 'loss/train': 1.5946766138076782} 11/07/2021 03:02:48 - INFO - __main__ - Step 41444: {'lr': 0.00041737596422718306, 'samples': 7957248, 'steps': 41443, 'loss/train': 1.0524725914001465} 11/07/2021 03:02:49 - INFO - __main__ - Step 41445: {'lr': 0.0004173720222942452, 'samples': 7957440, 'steps': 41444, 'loss/train': 1.2988440990447998} 11/07/2021 03:02:50 - INFO - __main__ - Step 41446: {'lr': 0.000417368080285892, 'samples': 7957632, 'steps': 41445, 'loss/train': 1.635378360748291} 11/07/2021 03:02:50 - INFO - __main__ - Step 41447: {'lr': 0.0004173641382021254, 'samples': 7957824, 'steps': 41446, 'loss/train': 1.5069023370742798} 11/07/2021 03:02:50 - INFO - __main__ - Step 41448: {'lr': 0.00041736019604294704, 'samples': 7958016, 'steps': 41447, 'loss/train': 1.1068689823150635} 11/07/2021 03:02:51 - INFO - __main__ - Step 41449: {'lr': 0.00041735625380835884, 'samples': 7958208, 'steps': 41448, 'loss/train': 1.2277780771255493} 11/07/2021 03:02:51 - INFO - __main__ - Step 41450: {'lr': 0.0004173523114983624, 'samples': 7958400, 'steps': 41449, 'loss/train': 1.8505853414535522} 11/07/2021 03:02:51 - INFO - __main__ - Step 41451: {'lr': 0.0004173483691129597, 'samples': 7958592, 'steps': 41450, 'loss/train': 0.6934025883674622} 11/07/2021 03:02:52 - INFO - __main__ - Step 41452: {'lr': 0.00041734442665215235, 'samples': 7958784, 'steps': 41451, 'loss/train': 1.4254666566848755} 11/07/2021 03:02:53 - INFO - __main__ - Step 41453: {'lr': 0.00041734048411594214, 'samples': 7958976, 'steps': 41452, 'loss/train': 1.5704346895217896} 11/07/2021 03:02:53 - INFO - __main__ - Step 41454: {'lr': 0.000417336541504331, 'samples': 7959168, 'steps': 41453, 'loss/train': 1.513946533203125} 11/07/2021 03:02:54 - INFO - __main__ - Step 41455: {'lr': 0.0004173325988173205, 'samples': 7959360, 'steps': 41454, 'loss/train': 1.9185298681259155} 11/07/2021 03:02:54 - INFO - __main__ - Step 41456: {'lr': 0.00041732865605491256, 'samples': 7959552, 'steps': 41455, 'loss/train': 1.6555119752883911} 11/07/2021 03:02:55 - INFO - __main__ - Step 41457: {'lr': 0.00041732471321710886, 'samples': 7959744, 'steps': 41456, 'loss/train': 1.1334835290908813} 11/07/2021 03:02:55 - INFO - __main__ - Step 41458: {'lr': 0.00041732077030391126, 'samples': 7959936, 'steps': 41457, 'loss/train': 1.4219813346862793} 11/07/2021 03:02:56 - INFO - __main__ - Step 41459: {'lr': 0.00041731682731532154, 'samples': 7960128, 'steps': 41458, 'loss/train': 1.6092408895492554} 11/07/2021 03:02:56 - INFO - __main__ - Step 41460: {'lr': 0.0004173128842513414, 'samples': 7960320, 'steps': 41459, 'loss/train': 1.670626163482666} 11/07/2021 03:02:56 - INFO - __main__ - Step 41461: {'lr': 0.00041730894111197266, 'samples': 7960512, 'steps': 41460, 'loss/train': 1.2933282852172852} 11/07/2021 03:02:57 - INFO - __main__ - Step 41462: {'lr': 0.0004173049978972171, 'samples': 7960704, 'steps': 41461, 'loss/train': 1.370222568511963} 11/07/2021 03:02:58 - INFO - __main__ - Step 41463: {'lr': 0.0004173010546070765, 'samples': 7960896, 'steps': 41462, 'loss/train': 1.1095010042190552} 11/07/2021 03:02:58 - INFO - __main__ - Step 41464: {'lr': 0.00041729711124155255, 'samples': 7961088, 'steps': 41463, 'loss/train': 1.4668735265731812} 11/07/2021 03:02:58 - INFO - __main__ - Step 41465: {'lr': 0.0004172931678006472, 'samples': 7961280, 'steps': 41464, 'loss/train': 1.5170831680297852} 11/07/2021 03:02:59 - INFO - __main__ - Step 41466: {'lr': 0.00041728922428436213, 'samples': 7961472, 'steps': 41465, 'loss/train': 1.5716190338134766} 11/07/2021 03:03:00 - INFO - __main__ - Step 41467: {'lr': 0.000417285280692699, 'samples': 7961664, 'steps': 41466, 'loss/train': 1.6805745363235474} 11/07/2021 03:03:00 - INFO - __main__ - Step 41468: {'lr': 0.00041728133702565985, 'samples': 7961856, 'steps': 41467, 'loss/train': 1.1384950876235962} 11/07/2021 03:03:01 - INFO - __main__ - Step 41469: {'lr': 0.0004172773932832462, 'samples': 7962048, 'steps': 41468, 'loss/train': 1.230355978012085} 11/07/2021 03:03:01 - INFO - __main__ - Step 41470: {'lr': 0.00041727344946546, 'samples': 7962240, 'steps': 41469, 'loss/train': 1.650503396987915} 11/07/2021 03:03:02 - INFO - __main__ - Step 41471: {'lr': 0.00041726950557230294, 'samples': 7962432, 'steps': 41470, 'loss/train': 1.2427712678909302} 11/07/2021 03:03:02 - INFO - __main__ - Step 41472: {'lr': 0.0004172655616037768, 'samples': 7962624, 'steps': 41471, 'loss/train': 1.4126157760620117} 11/07/2021 03:03:02 - INFO - __main__ - Step 41473: {'lr': 0.0004172616175598835, 'samples': 7962816, 'steps': 41472, 'loss/train': 1.6682500839233398} 11/07/2021 03:03:03 - INFO - __main__ - Step 41474: {'lr': 0.00041725767344062453, 'samples': 7963008, 'steps': 41473, 'loss/train': 1.1578184366226196} 11/07/2021 03:03:04 - INFO - __main__ - Step 41475: {'lr': 0.00041725372924600193, 'samples': 7963200, 'steps': 41474, 'loss/train': 1.4274133443832397} 11/07/2021 03:03:04 - INFO - __main__ - Step 41476: {'lr': 0.00041724978497601736, 'samples': 7963392, 'steps': 41475, 'loss/train': 1.463484287261963} 11/07/2021 03:03:04 - INFO - __main__ - Step 41477: {'lr': 0.0004172458406306726, 'samples': 7963584, 'steps': 41476, 'loss/train': 1.0533301830291748} 11/07/2021 03:03:05 - INFO - __main__ - Step 41478: {'lr': 0.00041724189620996946, 'samples': 7963776, 'steps': 41477, 'loss/train': 1.0579352378845215} 11/07/2021 03:03:06 - INFO - __main__ - Step 41479: {'lr': 0.0004172379517139097, 'samples': 7963968, 'steps': 41478, 'loss/train': 1.7435880899429321} 11/07/2021 03:03:06 - INFO - __main__ - Step 41480: {'lr': 0.0004172340071424951, 'samples': 7964160, 'steps': 41479, 'loss/train': 1.8347982168197632} 11/07/2021 03:03:07 - INFO - __main__ - Step 41481: {'lr': 0.00041723006249572744, 'samples': 7964352, 'steps': 41480, 'loss/train': 1.4495962858200073} 11/07/2021 03:03:07 - INFO - __main__ - Step 41482: {'lr': 0.00041722611777360844, 'samples': 7964544, 'steps': 41481, 'loss/train': 1.658594012260437} 11/07/2021 03:03:07 - INFO - __main__ - Step 41483: {'lr': 0.00041722217297614, 'samples': 7964736, 'steps': 41482, 'loss/train': 1.278537631034851} 11/07/2021 03:03:09 - INFO - __main__ - Step 41484: {'lr': 0.00041721822810332384, 'samples': 7964928, 'steps': 41483, 'loss/train': 1.0990140438079834} 11/07/2021 03:03:09 - INFO - __main__ - Step 41485: {'lr': 0.00041721428315516176, 'samples': 7965120, 'steps': 41484, 'loss/train': 1.4794907569885254} 11/07/2021 03:03:09 - INFO - __main__ - Step 41486: {'lr': 0.00041721033813165543, 'samples': 7965312, 'steps': 41485, 'loss/train': 1.702915906906128} 11/07/2021 03:03:10 - INFO - __main__ - Step 41487: {'lr': 0.0004172063930328067, 'samples': 7965504, 'steps': 41486, 'loss/train': 1.5428369045257568} 11/07/2021 03:03:10 - INFO - __main__ - Step 41488: {'lr': 0.00041720244785861736, 'samples': 7965696, 'steps': 41487, 'loss/train': 1.109743356704712} 11/07/2021 03:03:12 - INFO - __main__ - Step 41489: {'lr': 0.0004171985026090892, 'samples': 7965888, 'steps': 41488, 'loss/train': 1.1824779510498047} 11/07/2021 03:03:12 - INFO - __main__ - Step 41490: {'lr': 0.00041719455728422394, 'samples': 7966080, 'steps': 41489, 'loss/train': 1.8389651775360107} 11/07/2021 03:03:12 - INFO - __main__ - Step 41491: {'lr': 0.0004171906118840234, 'samples': 7966272, 'steps': 41490, 'loss/train': 1.796555519104004} 11/07/2021 03:03:13 - INFO - __main__ - Step 41492: {'lr': 0.00041718666640848937, 'samples': 7966464, 'steps': 41491, 'loss/train': 1.8231583833694458} 11/07/2021 03:03:13 - INFO - __main__ - Step 41493: {'lr': 0.0004171827208576236, 'samples': 7966656, 'steps': 41492, 'loss/train': 1.3844928741455078} 11/07/2021 03:03:14 - INFO - __main__ - Step 41494: {'lr': 0.00041717877523142786, 'samples': 7966848, 'steps': 41493, 'loss/train': 1.3137469291687012} 11/07/2021 03:03:14 - INFO - __main__ - Step 41495: {'lr': 0.00041717482952990394, 'samples': 7967040, 'steps': 41494, 'loss/train': 1.6464651823043823} 11/07/2021 03:03:15 - INFO - __main__ - Step 41496: {'lr': 0.00041717088375305367, 'samples': 7967232, 'steps': 41495, 'loss/train': 0.9370419979095459} 11/07/2021 03:03:15 - INFO - __main__ - Step 41497: {'lr': 0.0004171669379008787, 'samples': 7967424, 'steps': 41496, 'loss/train': 1.5788377523422241} 11/07/2021 03:03:16 - INFO - __main__ - Step 41498: {'lr': 0.00041716299197338093, 'samples': 7967616, 'steps': 41497, 'loss/train': 1.5179771184921265} 11/07/2021 03:03:16 - INFO - __main__ - Step 41499: {'lr': 0.0004171590459705622, 'samples': 7967808, 'steps': 41498, 'loss/train': 1.1437050104141235} 11/07/2021 03:03:16 - INFO - __main__ - Step 41500: {'lr': 0.0004171550998924241, 'samples': 7968000, 'steps': 41499, 'loss/train': 1.9808309078216553} 11/07/2021 03:03:17 - INFO - __main__ - Step 41501: {'lr': 0.0004171511537389684, 'samples': 7968192, 'steps': 41500, 'loss/train': 1.4925646781921387} 11/07/2021 03:03:18 - INFO - __main__ - Step 41502: {'lr': 0.0004171472075101971, 'samples': 7968384, 'steps': 41501, 'loss/train': 1.3484253883361816} 11/07/2021 03:03:18 - INFO - __main__ - Step 41503: {'lr': 0.0004171432612061117, 'samples': 7968576, 'steps': 41502, 'loss/train': 1.1927528381347656} 11/07/2021 03:03:18 - INFO - __main__ - Step 41504: {'lr': 0.00041713931482671425, 'samples': 7968768, 'steps': 41503, 'loss/train': 1.7438743114471436} 11/07/2021 03:03:19 - INFO - __main__ - Step 41505: {'lr': 0.0004171353683720064, 'samples': 7968960, 'steps': 41504, 'loss/train': 1.5354259014129639} 11/07/2021 03:03:20 - INFO - __main__ - Step 41506: {'lr': 0.00041713142184198994, 'samples': 7969152, 'steps': 41505, 'loss/train': 0.9600526690483093} 11/07/2021 03:03:20 - INFO - __main__ - Step 41507: {'lr': 0.0004171274752366665, 'samples': 7969344, 'steps': 41506, 'loss/train': 1.7088255882263184} 11/07/2021 03:03:20 - INFO - __main__ - Step 41508: {'lr': 0.00041712352855603817, 'samples': 7969536, 'steps': 41507, 'loss/train': 0.9700304269790649} 11/07/2021 03:03:21 - INFO - __main__ - Step 41509: {'lr': 0.00041711958180010644, 'samples': 7969728, 'steps': 41508, 'loss/train': 1.4612095355987549} 11/07/2021 03:03:21 - INFO - __main__ - Step 41510: {'lr': 0.0004171156349688733, 'samples': 7969920, 'steps': 41509, 'loss/train': 1.733356237411499} 11/07/2021 03:03:22 - INFO - __main__ - Step 41511: {'lr': 0.0004171116880623404, 'samples': 7970112, 'steps': 41510, 'loss/train': 0.9728167057037354} 11/07/2021 03:03:23 - INFO - __main__ - Step 41512: {'lr': 0.0004171077410805095, 'samples': 7970304, 'steps': 41511, 'loss/train': 2.3415327072143555} 11/07/2021 03:03:23 - INFO - __main__ - Step 41513: {'lr': 0.0004171037940233825, 'samples': 7970496, 'steps': 41512, 'loss/train': 1.5685441493988037} 11/07/2021 03:03:23 - INFO - __main__ - Step 41514: {'lr': 0.0004170998468909611, 'samples': 7970688, 'steps': 41513, 'loss/train': 0.9442493915557861} 11/07/2021 03:03:24 - INFO - __main__ - Step 41515: {'lr': 0.00041709589968324704, 'samples': 7970880, 'steps': 41514, 'loss/train': 1.246776819229126} 11/07/2021 03:03:24 - INFO - __main__ - Step 41516: {'lr': 0.00041709195240024224, 'samples': 7971072, 'steps': 41515, 'loss/train': 1.509366750717163} 11/07/2021 03:03:25 - INFO - __main__ - Step 41517: {'lr': 0.0004170880050419483, 'samples': 7971264, 'steps': 41516, 'loss/train': 1.6185767650604248} 11/07/2021 03:03:25 - INFO - __main__ - Step 41518: {'lr': 0.0004170840576083671, 'samples': 7971456, 'steps': 41517, 'loss/train': 1.701627254486084} 11/07/2021 03:03:26 - INFO - __main__ - Step 41519: {'lr': 0.00041708011009950044, 'samples': 7971648, 'steps': 41518, 'loss/train': 1.5624347925186157} 11/07/2021 03:03:26 - INFO - __main__ - Step 41520: {'lr': 0.00041707616251535, 'samples': 7971840, 'steps': 41519, 'loss/train': 1.5904089212417603} 11/07/2021 03:03:26 - INFO - __main__ - Step 41521: {'lr': 0.0004170722148559176, 'samples': 7972032, 'steps': 41520, 'loss/train': 1.464963674545288} 11/07/2021 03:03:28 - INFO - __main__ - Step 41522: {'lr': 0.0004170682671212051, 'samples': 7972224, 'steps': 41521, 'loss/train': 1.7140016555786133} 11/07/2021 03:03:28 - INFO - __main__ - Step 41523: {'lr': 0.00041706431931121416, 'samples': 7972416, 'steps': 41522, 'loss/train': 1.342008113861084} 11/07/2021 03:03:28 - INFO - __main__ - Step 41524: {'lr': 0.00041706037142594666, 'samples': 7972608, 'steps': 41523, 'loss/train': 1.2205827236175537} 11/07/2021 03:03:29 - INFO - __main__ - Step 41525: {'lr': 0.00041705642346540436, 'samples': 7972800, 'steps': 41524, 'loss/train': 1.8854976892471313} 11/07/2021 03:03:29 - INFO - __main__ - Step 41526: {'lr': 0.00041705247542958904, 'samples': 7972992, 'steps': 41525, 'loss/train': 1.6413252353668213} 11/07/2021 03:03:30 - INFO - __main__ - Step 41527: {'lr': 0.00041704852731850234, 'samples': 7973184, 'steps': 41526, 'loss/train': 1.44478440284729} 11/07/2021 03:03:30 - INFO - __main__ - Step 41528: {'lr': 0.0004170445791321462, 'samples': 7973376, 'steps': 41527, 'loss/train': 1.8444479703903198} 11/07/2021 03:03:31 - INFO - __main__ - Step 41529: {'lr': 0.00041704063087052236, 'samples': 7973568, 'steps': 41528, 'loss/train': 0.33136051893234253} 11/07/2021 03:03:31 - INFO - __main__ - Step 41530: {'lr': 0.0004170366825336326, 'samples': 7973760, 'steps': 41529, 'loss/train': 1.6385127305984497} 11/07/2021 03:03:31 - INFO - __main__ - Step 41531: {'lr': 0.0004170327341214787, 'samples': 7973952, 'steps': 41530, 'loss/train': 1.5817596912384033} 11/07/2021 03:03:33 - INFO - __main__ - Step 41532: {'lr': 0.00041702878563406237, 'samples': 7974144, 'steps': 41531, 'loss/train': 1.5311561822891235} 11/07/2021 03:03:33 - INFO - __main__ - Step 41533: {'lr': 0.0004170248370713855, 'samples': 7974336, 'steps': 41532, 'loss/train': 1.442463994026184} 11/07/2021 03:03:34 - INFO - __main__ - Step 41534: {'lr': 0.0004170208884334498, 'samples': 7974528, 'steps': 41533, 'loss/train': 1.0978189706802368} 11/07/2021 03:03:34 - INFO - __main__ - Step 41535: {'lr': 0.000417016939720257, 'samples': 7974720, 'steps': 41534, 'loss/train': 1.4123374223709106} 11/07/2021 03:03:34 - INFO - __main__ - Step 41536: {'lr': 0.000417012990931809, 'samples': 7974912, 'steps': 41535, 'loss/train': 0.7976887822151184} 11/07/2021 03:03:35 - INFO - __main__ - Step 41537: {'lr': 0.00041700904206810755, 'samples': 7975104, 'steps': 41536, 'loss/train': 1.848340392112732} 11/07/2021 03:03:36 - INFO - __main__ - Step 41538: {'lr': 0.00041700509312915437, 'samples': 7975296, 'steps': 41537, 'loss/train': 1.4981316328048706} 11/07/2021 03:03:36 - INFO - __main__ - Step 41539: {'lr': 0.0004170011441149513, 'samples': 7975488, 'steps': 41538, 'loss/train': 0.8363898396492004} 11/07/2021 03:03:36 - INFO - __main__ - Step 41540: {'lr': 0.0004169971950255001, 'samples': 7975680, 'steps': 41539, 'loss/train': 1.6645216941833496} 11/07/2021 03:03:37 - INFO - __main__ - Step 41541: {'lr': 0.0004169932458608025, 'samples': 7975872, 'steps': 41540, 'loss/train': 1.6828755140304565} 11/07/2021 03:03:37 - INFO - __main__ - Step 41542: {'lr': 0.00041698929662086035, 'samples': 7976064, 'steps': 41541, 'loss/train': 1.5580079555511475} 11/07/2021 03:03:38 - INFO - __main__ - Step 41543: {'lr': 0.0004169853473056754, 'samples': 7976256, 'steps': 41542, 'loss/train': 1.5427733659744263} 11/07/2021 03:03:38 - INFO - __main__ - Step 41544: {'lr': 0.0004169813979152494, 'samples': 7976448, 'steps': 41543, 'loss/train': 1.0188912153244019} 11/07/2021 03:03:39 - INFO - __main__ - Step 41545: {'lr': 0.0004169774484495841, 'samples': 7976640, 'steps': 41544, 'loss/train': 1.597219467163086} 11/07/2021 03:03:39 - INFO - __main__ - Step 41546: {'lr': 0.00041697349890868146, 'samples': 7976832, 'steps': 41545, 'loss/train': 1.5436819791793823} 11/07/2021 03:03:39 - INFO - __main__ - Step 41547: {'lr': 0.0004169695492925431, 'samples': 7977024, 'steps': 41546, 'loss/train': 1.716619610786438} 11/07/2021 03:03:41 - INFO - __main__ - Step 41548: {'lr': 0.0004169655996011708, 'samples': 7977216, 'steps': 41547, 'loss/train': 1.5983400344848633} 11/07/2021 03:03:41 - INFO - __main__ - Step 41549: {'lr': 0.0004169616498345664, 'samples': 7977408, 'steps': 41548, 'loss/train': 0.22941921651363373} 11/07/2021 03:03:42 - INFO - __main__ - Step 41550: {'lr': 0.0004169576999927317, 'samples': 7977600, 'steps': 41549, 'loss/train': 0.8101643323898315} 11/07/2021 03:03:42 - INFO - __main__ - Step 41551: {'lr': 0.00041695375007566837, 'samples': 7977792, 'steps': 41550, 'loss/train': 1.3644152879714966} 11/07/2021 03:03:42 - INFO - __main__ - Step 41552: {'lr': 0.00041694980008337825, 'samples': 7977984, 'steps': 41551, 'loss/train': 1.5691694021224976} 11/07/2021 03:03:43 - INFO - __main__ - Step 41553: {'lr': 0.0004169458500158632, 'samples': 7978176, 'steps': 41552, 'loss/train': 1.4925529956817627} 11/07/2021 03:03:44 - INFO - __main__ - Step 41554: {'lr': 0.0004169418998731249, 'samples': 7978368, 'steps': 41553, 'loss/train': 1.4446444511413574} 11/07/2021 03:03:44 - INFO - __main__ - Step 41555: {'lr': 0.00041693794965516514, 'samples': 7978560, 'steps': 41554, 'loss/train': 1.3788728713989258} 11/07/2021 03:03:45 - INFO - __main__ - Step 41556: {'lr': 0.0004169339993619857, 'samples': 7978752, 'steps': 41555, 'loss/train': 1.3602726459503174} 11/07/2021 03:03:45 - INFO - __main__ - Step 41557: {'lr': 0.0004169300489935884, 'samples': 7978944, 'steps': 41556, 'loss/train': 1.5837068557739258} 11/07/2021 03:03:45 - INFO - __main__ - Step 41558: {'lr': 0.000416926098549975, 'samples': 7979136, 'steps': 41557, 'loss/train': 1.434810996055603} 11/07/2021 03:03:46 - INFO - __main__ - Step 41559: {'lr': 0.00041692214803114725, 'samples': 7979328, 'steps': 41558, 'loss/train': 1.3605402708053589} 11/07/2021 03:03:47 - INFO - __main__ - Step 41560: {'lr': 0.00041691819743710704, 'samples': 7979520, 'steps': 41559, 'loss/train': 1.6383851766586304} 11/07/2021 03:03:47 - INFO - __main__ - Step 41561: {'lr': 0.00041691424676785593, 'samples': 7979712, 'steps': 41560, 'loss/train': 1.891399621963501} 11/07/2021 03:03:47 - INFO - __main__ - Step 41562: {'lr': 0.00041691029602339595, 'samples': 7979904, 'steps': 41561, 'loss/train': 1.7154065370559692} 11/07/2021 03:03:48 - INFO - __main__ - Step 41563: {'lr': 0.00041690634520372865, 'samples': 7980096, 'steps': 41562, 'loss/train': 1.307283878326416} 11/07/2021 03:03:49 - INFO - __main__ - Step 41564: {'lr': 0.000416902394308856, 'samples': 7980288, 'steps': 41563, 'loss/train': 1.3553440570831299} 11/07/2021 03:03:49 - INFO - __main__ - Step 41565: {'lr': 0.00041689844333877966, 'samples': 7980480, 'steps': 41564, 'loss/train': 1.7948700189590454} 11/07/2021 03:03:49 - INFO - __main__ - Step 41566: {'lr': 0.00041689449229350155, 'samples': 7980672, 'steps': 41565, 'loss/train': 1.0878863334655762} 11/07/2021 03:03:50 - INFO - __main__ - Step 41567: {'lr': 0.00041689054117302333, 'samples': 7980864, 'steps': 41566, 'loss/train': 1.3097561597824097} 11/07/2021 03:03:50 - INFO - __main__ - Step 41568: {'lr': 0.00041688658997734675, 'samples': 7981056, 'steps': 41567, 'loss/train': 1.6336743831634521} 11/07/2021 03:03:51 - INFO - __main__ - Step 41569: {'lr': 0.0004168826387064737, 'samples': 7981248, 'steps': 41568, 'loss/train': 1.0773857831954956} 11/07/2021 03:03:51 - INFO - __main__ - Step 41570: {'lr': 0.00041687868736040593, 'samples': 7981440, 'steps': 41569, 'loss/train': 1.6702024936676025} 11/07/2021 03:03:52 - INFO - __main__ - Step 41571: {'lr': 0.0004168747359391451, 'samples': 7981632, 'steps': 41570, 'loss/train': 1.680422067642212} 11/07/2021 03:03:52 - INFO - __main__ - Step 41572: {'lr': 0.00041687078444269316, 'samples': 7981824, 'steps': 41571, 'loss/train': 1.8877520561218262} 11/07/2021 03:03:53 - INFO - __main__ - Step 41573: {'lr': 0.0004168668328710518, 'samples': 7982016, 'steps': 41572, 'loss/train': 1.2095186710357666} 11/07/2021 03:03:53 - INFO - __main__ - Step 41574: {'lr': 0.0004168628812242228, 'samples': 7982208, 'steps': 41573, 'loss/train': 1.69416344165802} 11/07/2021 03:03:54 - INFO - __main__ - Step 41575: {'lr': 0.00041685892950220804, 'samples': 7982400, 'steps': 41574, 'loss/train': 1.6244293451309204} 11/07/2021 03:03:54 - INFO - __main__ - Step 41576: {'lr': 0.0004168549777050091, 'samples': 7982592, 'steps': 41575, 'loss/train': 0.9274775981903076} 11/07/2021 03:03:55 - INFO - __main__ - Step 41577: {'lr': 0.000416851025832628, 'samples': 7982784, 'steps': 41576, 'loss/train': 0.6884801387786865} 11/07/2021 03:03:55 - INFO - __main__ - Step 41578: {'lr': 0.0004168470738850664, 'samples': 7982976, 'steps': 41577, 'loss/train': 1.5503827333450317} 11/07/2021 03:03:56 - INFO - __main__ - Step 41579: {'lr': 0.00041684312186232597, 'samples': 7983168, 'steps': 41578, 'loss/train': 1.9044495820999146} 11/07/2021 03:03:56 - INFO - __main__ - Step 41580: {'lr': 0.0004168391697644087, 'samples': 7983360, 'steps': 41579, 'loss/train': 2.477400064468384} 11/07/2021 03:03:57 - INFO - __main__ - Step 41581: {'lr': 0.0004168352175913163, 'samples': 7983552, 'steps': 41580, 'loss/train': 1.2631022930145264} 11/07/2021 03:03:57 - INFO - __main__ - Step 41582: {'lr': 0.00041683126534305037, 'samples': 7983744, 'steps': 41581, 'loss/train': 1.6071327924728394} 11/07/2021 03:03:57 - INFO - __main__ - Step 41583: {'lr': 0.000416827313019613, 'samples': 7983936, 'steps': 41582, 'loss/train': 1.5428014993667603} 11/07/2021 03:03:58 - INFO - __main__ - Step 41584: {'lr': 0.0004168233606210058, 'samples': 7984128, 'steps': 41583, 'loss/train': 0.9221970438957214} 11/07/2021 03:03:59 - INFO - __main__ - Step 41585: {'lr': 0.0004168194081472305, 'samples': 7984320, 'steps': 41584, 'loss/train': 1.1452103853225708} 11/07/2021 03:03:59 - INFO - __main__ - Step 41586: {'lr': 0.000416815455598289, 'samples': 7984512, 'steps': 41585, 'loss/train': 1.6904128789901733} 11/07/2021 03:03:59 - INFO - __main__ - Step 41587: {'lr': 0.000416811502974183, 'samples': 7984704, 'steps': 41586, 'loss/train': 1.4233283996582031} 11/07/2021 03:04:00 - INFO - __main__ - Step 41588: {'lr': 0.00041680755027491433, 'samples': 7984896, 'steps': 41587, 'loss/train': 1.4579297304153442} 11/07/2021 03:04:00 - INFO - __main__ - Step 41589: {'lr': 0.0004168035975004847, 'samples': 7985088, 'steps': 41588, 'loss/train': 0.9681318402290344} 11/07/2021 03:04:01 - INFO - __main__ - Step 41590: {'lr': 0.00041679964465089596, 'samples': 7985280, 'steps': 41589, 'loss/train': 1.7440179586410522} 11/07/2021 03:04:02 - INFO - __main__ - Step 41591: {'lr': 0.00041679569172614996, 'samples': 7985472, 'steps': 41590, 'loss/train': 1.1505935192108154} 11/07/2021 03:04:02 - INFO - __main__ - Step 41592: {'lr': 0.0004167917387262483, 'samples': 7985664, 'steps': 41591, 'loss/train': 1.8241571187973022} 11/07/2021 03:04:02 - INFO - __main__ - Step 41593: {'lr': 0.0004167877856511929, 'samples': 7985856, 'steps': 41592, 'loss/train': 1.6857103109359741} 11/07/2021 03:04:03 - INFO - __main__ - Step 41594: {'lr': 0.0004167838325009855, 'samples': 7986048, 'steps': 41593, 'loss/train': 1.7244120836257935} 11/07/2021 03:04:04 - INFO - __main__ - Step 41595: {'lr': 0.0004167798792756279, 'samples': 7986240, 'steps': 41594, 'loss/train': 1.451128602027893} 11/07/2021 03:04:04 - INFO - __main__ - Step 41596: {'lr': 0.0004167759259751218, 'samples': 7986432, 'steps': 41595, 'loss/train': 1.5407642126083374} 11/07/2021 03:04:04 - INFO - __main__ - Step 41597: {'lr': 0.0004167719725994691, 'samples': 7986624, 'steps': 41596, 'loss/train': 1.3572611808776855} 11/07/2021 03:04:05 - INFO - __main__ - Step 41598: {'lr': 0.00041676801914867145, 'samples': 7986816, 'steps': 41597, 'loss/train': 1.030824065208435} 11/07/2021 03:04:05 - INFO - __main__ - Step 41599: {'lr': 0.00041676406562273074, 'samples': 7987008, 'steps': 41598, 'loss/train': 1.2116070985794067} 11/07/2021 03:04:06 - INFO - __main__ - Step 41600: {'lr': 0.00041676011202164875, 'samples': 7987200, 'steps': 41599, 'loss/train': 1.7278271913528442} 11/07/2021 03:04:06 - INFO - __main__ - Step 41601: {'lr': 0.00041675615834542716, 'samples': 7987392, 'steps': 41600, 'loss/train': 1.5282822847366333} 11/07/2021 03:04:07 - INFO - __main__ - Step 41602: {'lr': 0.0004167522045940678, 'samples': 7987584, 'steps': 41601, 'loss/train': 1.5136961936950684} 11/07/2021 03:04:07 - INFO - __main__ - Step 41603: {'lr': 0.0004167482507675726, 'samples': 7987776, 'steps': 41602, 'loss/train': 1.5415982007980347} 11/07/2021 03:04:07 - INFO - __main__ - Step 41604: {'lr': 0.0004167442968659431, 'samples': 7987968, 'steps': 41603, 'loss/train': 1.5877426862716675} 11/07/2021 03:04:09 - INFO - __main__ - Step 41605: {'lr': 0.0004167403428891812, 'samples': 7988160, 'steps': 41604, 'loss/train': 1.3433371782302856} 11/07/2021 03:04:09 - INFO - __main__ - Step 41606: {'lr': 0.00041673638883728877, 'samples': 7988352, 'steps': 41605, 'loss/train': 1.543599247932434} 11/07/2021 03:04:09 - INFO - __main__ - Step 41607: {'lr': 0.00041673243471026746, 'samples': 7988544, 'steps': 41606, 'loss/train': 1.7371529340744019} 11/07/2021 03:04:10 - INFO - __main__ - Step 41608: {'lr': 0.000416728480508119, 'samples': 7988736, 'steps': 41607, 'loss/train': 1.6626496315002441} 11/07/2021 03:04:10 - INFO - __main__ - Step 41609: {'lr': 0.00041672452623084535, 'samples': 7988928, 'steps': 41608, 'loss/train': 1.3573652505874634} 11/07/2021 03:04:10 - INFO - __main__ - Step 41610: {'lr': 0.0004167205718784481, 'samples': 7989120, 'steps': 41609, 'loss/train': 1.6693180799484253} 11/07/2021 03:04:11 - INFO - __main__ - Step 41611: {'lr': 0.0004167166174509293, 'samples': 7989312, 'steps': 41610, 'loss/train': 1.1781405210494995} 11/07/2021 03:04:12 - INFO - __main__ - Step 41612: {'lr': 0.00041671266294829036, 'samples': 7989504, 'steps': 41611, 'loss/train': 1.4301857948303223} 11/07/2021 03:04:12 - INFO - __main__ - Step 41613: {'lr': 0.0004167087083705334, 'samples': 7989696, 'steps': 41612, 'loss/train': 1.623382568359375} 11/07/2021 03:04:12 - INFO - __main__ - Step 41614: {'lr': 0.00041670475371766, 'samples': 7989888, 'steps': 41613, 'loss/train': 1.4963102340698242} 11/07/2021 03:04:13 - INFO - __main__ - Step 41615: {'lr': 0.0004167007989896721, 'samples': 7990080, 'steps': 41614, 'loss/train': 1.2779196500778198} 11/07/2021 03:04:14 - INFO - __main__ - Step 41616: {'lr': 0.0004166968441865714, 'samples': 7990272, 'steps': 41615, 'loss/train': 1.5700368881225586} 11/07/2021 03:04:14 - INFO - __main__ - Step 41617: {'lr': 0.00041669288930835957, 'samples': 7990464, 'steps': 41616, 'loss/train': 1.4026747941970825} 11/07/2021 03:04:14 - INFO - __main__ - Step 41618: {'lr': 0.0004166889343550385, 'samples': 7990656, 'steps': 41617, 'loss/train': 1.3232351541519165} 11/07/2021 03:04:15 - INFO - __main__ - Step 41619: {'lr': 0.00041668497932661005, 'samples': 7990848, 'steps': 41618, 'loss/train': 1.4335687160491943} 11/07/2021 03:04:15 - INFO - __main__ - Step 41620: {'lr': 0.00041668102422307593, 'samples': 7991040, 'steps': 41619, 'loss/train': 1.4519500732421875} 11/07/2021 03:04:16 - INFO - __main__ - Step 41621: {'lr': 0.0004166770690444378, 'samples': 7991232, 'steps': 41620, 'loss/train': 0.8947644829750061} 11/07/2021 03:04:16 - INFO - __main__ - Step 41622: {'lr': 0.0004166731137906976, 'samples': 7991424, 'steps': 41621, 'loss/train': 1.49089777469635} 11/07/2021 03:04:17 - INFO - __main__ - Step 41623: {'lr': 0.0004166691584618572, 'samples': 7991616, 'steps': 41622, 'loss/train': 1.4614523649215698} 11/07/2021 03:04:17 - INFO - __main__ - Step 41624: {'lr': 0.00041666520305791806, 'samples': 7991808, 'steps': 41623, 'loss/train': 1.655706524848938} 11/07/2021 03:04:18 - INFO - __main__ - Step 41625: {'lr': 0.00041666124757888223, 'samples': 7992000, 'steps': 41624, 'loss/train': 1.599887490272522} 11/07/2021 03:04:19 - INFO - __main__ - Step 41626: {'lr': 0.0004166572920247514, 'samples': 7992192, 'steps': 41625, 'loss/train': 0.9554967880249023} 11/07/2021 03:04:19 - INFO - __main__ - Step 41627: {'lr': 0.0004166533363955274, 'samples': 7992384, 'steps': 41626, 'loss/train': 1.4879121780395508} 11/07/2021 03:04:19 - INFO - __main__ - Step 41628: {'lr': 0.00041664938069121195, 'samples': 7992576, 'steps': 41627, 'loss/train': 1.5209451913833618} 11/07/2021 03:04:20 - INFO - __main__ - Step 41629: {'lr': 0.00041664542491180685, 'samples': 7992768, 'steps': 41628, 'loss/train': 1.5639281272888184} 11/07/2021 03:04:20 - INFO - __main__ - Step 41630: {'lr': 0.0004166414690573139, 'samples': 7992960, 'steps': 41629, 'loss/train': 1.2982888221740723} 11/07/2021 03:04:20 - INFO - __main__ - Step 41631: {'lr': 0.0004166375131277349, 'samples': 7993152, 'steps': 41630, 'loss/train': 1.4986904859542847} 11/07/2021 03:04:21 - INFO - __main__ - Step 41632: {'lr': 0.0004166335571230716, 'samples': 7993344, 'steps': 41631, 'loss/train': 1.3777761459350586} 11/07/2021 03:04:22 - INFO - __main__ - Step 41633: {'lr': 0.0004166296010433258, 'samples': 7993536, 'steps': 41632, 'loss/train': 1.5161300897598267} 11/07/2021 03:04:22 - INFO - __main__ - Step 41634: {'lr': 0.00041662564488849927, 'samples': 7993728, 'steps': 41633, 'loss/train': 1.3086680173873901} 11/07/2021 03:04:22 - INFO - __main__ - Step 41635: {'lr': 0.00041662168865859374, 'samples': 7993920, 'steps': 41634, 'loss/train': 1.5390034914016724} 11/07/2021 03:04:23 - INFO - __main__ - Step 41636: {'lr': 0.0004166177323536111, 'samples': 7994112, 'steps': 41635, 'loss/train': 1.047380805015564} 11/07/2021 03:04:24 - INFO - __main__ - Step 41637: {'lr': 0.000416613775973553, 'samples': 7994304, 'steps': 41636, 'loss/train': 1.6869618892669678} 11/07/2021 03:04:24 - INFO - __main__ - Step 41638: {'lr': 0.0004166098195184214, 'samples': 7994496, 'steps': 41637, 'loss/train': 1.61757230758667} 11/07/2021 03:04:24 - INFO - __main__ - Step 41639: {'lr': 0.000416605862988218, 'samples': 7994688, 'steps': 41638, 'loss/train': 0.2610689401626587} 11/07/2021 03:04:25 - INFO - __main__ - Step 41640: {'lr': 0.00041660190638294456, 'samples': 7994880, 'steps': 41639, 'loss/train': 1.7146142721176147} 11/07/2021 03:04:25 - INFO - __main__ - Step 41641: {'lr': 0.0004165979497026028, 'samples': 7995072, 'steps': 41640, 'loss/train': 1.8692586421966553} 11/07/2021 03:04:26 - INFO - __main__ - Step 41642: {'lr': 0.00041659399294719456, 'samples': 7995264, 'steps': 41641, 'loss/train': 1.502307415008545} 11/07/2021 03:04:27 - INFO - __main__ - Step 41643: {'lr': 0.00041659003611672175, 'samples': 7995456, 'steps': 41642, 'loss/train': 1.2738826274871826} 11/07/2021 03:04:27 - INFO - __main__ - Step 41644: {'lr': 0.000416586079211186, 'samples': 7995648, 'steps': 41643, 'loss/train': 1.7889599800109863} 11/07/2021 03:04:27 - INFO - __main__ - Step 41645: {'lr': 0.0004165821222305891, 'samples': 7995840, 'steps': 41644, 'loss/train': 1.6064268350601196} 11/07/2021 03:04:28 - INFO - __main__ - Step 41646: {'lr': 0.00041657816517493284, 'samples': 7996032, 'steps': 41645, 'loss/train': 1.5085879564285278} 11/07/2021 03:04:29 - INFO - __main__ - Step 41647: {'lr': 0.00041657420804421907, 'samples': 7996224, 'steps': 41646, 'loss/train': 1.4858813285827637} 11/07/2021 03:04:29 - INFO - __main__ - Step 41648: {'lr': 0.00041657025083844957, 'samples': 7996416, 'steps': 41647, 'loss/train': 1.5819388628005981} 11/07/2021 03:04:29 - INFO - __main__ - Step 41649: {'lr': 0.00041656629355762607, 'samples': 7996608, 'steps': 41648, 'loss/train': 1.721908450126648} 11/07/2021 03:04:30 - INFO - __main__ - Step 41650: {'lr': 0.00041656233620175035, 'samples': 7996800, 'steps': 41649, 'loss/train': 1.5587491989135742} 11/07/2021 03:04:30 - INFO - __main__ - Step 41651: {'lr': 0.0004165583787708242, 'samples': 7996992, 'steps': 41650, 'loss/train': 1.8320040702819824} 11/07/2021 03:04:31 - INFO - __main__ - Step 41652: {'lr': 0.0004165544212648494, 'samples': 7997184, 'steps': 41651, 'loss/train': 1.6275389194488525} 11/07/2021 03:04:31 - INFO - __main__ - Step 41653: {'lr': 0.0004165504636838278, 'samples': 7997376, 'steps': 41652, 'loss/train': 0.7777548432350159} 11/07/2021 03:04:32 - INFO - __main__ - Step 41654: {'lr': 0.0004165465060277611, 'samples': 7997568, 'steps': 41653, 'loss/train': 2.0229907035827637} 11/07/2021 03:04:32 - INFO - __main__ - Step 41655: {'lr': 0.0004165425482966512, 'samples': 7997760, 'steps': 41654, 'loss/train': 1.7797147035598755} 11/07/2021 03:04:33 - INFO - __main__ - Step 41656: {'lr': 0.00041653859049049964, 'samples': 7997952, 'steps': 41655, 'loss/train': 1.5604771375656128} 11/07/2021 03:04:34 - INFO - __main__ - Step 41657: {'lr': 0.00041653463260930845, 'samples': 7998144, 'steps': 41656, 'loss/train': 1.7182575464248657} 11/07/2021 03:04:34 - INFO - __main__ - Step 41658: {'lr': 0.00041653067465307925, 'samples': 7998336, 'steps': 41657, 'loss/train': 1.416732907295227} 11/07/2021 03:04:34 - INFO - __main__ - Step 41659: {'lr': 0.00041652671662181394, 'samples': 7998528, 'steps': 41658, 'loss/train': 1.7493034601211548} 11/07/2021 03:04:35 - INFO - __main__ - Step 41660: {'lr': 0.00041652275851551435, 'samples': 7998720, 'steps': 41659, 'loss/train': 1.145478367805481} 11/07/2021 03:04:35 - INFO - __main__ - Step 41661: {'lr': 0.0004165188003341821, 'samples': 7998912, 'steps': 41660, 'loss/train': 1.4541614055633545} 11/07/2021 03:04:36 - INFO - __main__ - Step 41662: {'lr': 0.0004165148420778191, 'samples': 7999104, 'steps': 41661, 'loss/train': 1.7806086540222168} 11/07/2021 03:04:36 - INFO - __main__ - Step 41663: {'lr': 0.000416510883746427, 'samples': 7999296, 'steps': 41662, 'loss/train': 1.8331174850463867} 11/07/2021 03:04:37 - INFO - __main__ - Step 41664: {'lr': 0.00041650692534000766, 'samples': 7999488, 'steps': 41663, 'loss/train': 1.617154598236084} 11/07/2021 03:04:37 - INFO - __main__ - Step 41665: {'lr': 0.0004165029668585629, 'samples': 7999680, 'steps': 41664, 'loss/train': 1.5130399465560913} 11/07/2021 03:04:37 - INFO - __main__ - Step 41666: {'lr': 0.00041649900830209455, 'samples': 7999872, 'steps': 41665, 'loss/train': 1.3040863275527954} 11/07/2021 03:04:38 - INFO - __main__ - Step 41667: {'lr': 0.00041649504967060423, 'samples': 8000064, 'steps': 41666, 'loss/train': 1.3093539476394653} 11/07/2021 03:04:39 - INFO - __main__ - Step 41668: {'lr': 0.0004164910909640938, 'samples': 8000256, 'steps': 41667, 'loss/train': 1.47370445728302} 11/07/2021 03:04:39 - INFO - __main__ - Step 41669: {'lr': 0.0004164871321825651, 'samples': 8000448, 'steps': 41668, 'loss/train': 0.9638338685035706} 11/07/2021 03:04:39 - INFO - __main__ - Step 41670: {'lr': 0.0004164831733260198, 'samples': 8000640, 'steps': 41669, 'loss/train': 1.4005441665649414} 11/07/2021 03:04:40 - INFO - __main__ - Step 41671: {'lr': 0.0004164792143944598, 'samples': 8000832, 'steps': 41670, 'loss/train': 1.7547717094421387} 11/07/2021 03:04:40 - INFO - __main__ - Step 41672: {'lr': 0.0004164752553878868, 'samples': 8001024, 'steps': 41671, 'loss/train': 1.6093745231628418} 11/07/2021 03:04:41 - INFO - __main__ - Step 41673: {'lr': 0.00041647129630630265, 'samples': 8001216, 'steps': 41672, 'loss/train': 1.2964626550674438} 11/07/2021 03:04:42 - INFO - __main__ - Step 41674: {'lr': 0.0004164673371497092, 'samples': 8001408, 'steps': 41673, 'loss/train': 1.6003328561782837} 11/07/2021 03:04:42 - INFO - __main__ - Step 41675: {'lr': 0.000416463377918108, 'samples': 8001600, 'steps': 41674, 'loss/train': 0.9917303323745728} 11/07/2021 03:04:42 - INFO - __main__ - Step 41676: {'lr': 0.00041645941861150103, 'samples': 8001792, 'steps': 41675, 'loss/train': 2.396070957183838} 11/07/2021 03:04:43 - INFO - __main__ - Step 41677: {'lr': 0.00041645545922989, 'samples': 8001984, 'steps': 41676, 'loss/train': 1.1119292974472046} 11/07/2021 03:04:44 - INFO - __main__ - Step 41678: {'lr': 0.00041645149977327667, 'samples': 8002176, 'steps': 41677, 'loss/train': 0.8787075877189636} 11/07/2021 03:04:44 - INFO - __main__ - Step 41679: {'lr': 0.0004164475402416629, 'samples': 8002368, 'steps': 41678, 'loss/train': 1.0953854322433472} 11/07/2021 03:04:45 - INFO - __main__ - Step 41680: {'lr': 0.0004164435806350505, 'samples': 8002560, 'steps': 41679, 'loss/train': 1.580024003982544} 11/07/2021 03:04:45 - INFO - __main__ - Step 41681: {'lr': 0.00041643962095344107, 'samples': 8002752, 'steps': 41680, 'loss/train': 1.4981598854064941} 11/07/2021 03:04:45 - INFO - __main__ - Step 41682: {'lr': 0.0004164356611968366, 'samples': 8002944, 'steps': 41681, 'loss/train': 1.466947078704834} 11/07/2021 03:04:46 - INFO - __main__ - Step 41683: {'lr': 0.0004164317013652387, 'samples': 8003136, 'steps': 41682, 'loss/train': 1.6721065044403076} 11/07/2021 03:04:47 - INFO - __main__ - Step 41684: {'lr': 0.00041642774145864934, 'samples': 8003328, 'steps': 41683, 'loss/train': 1.6475099325180054} 11/07/2021 03:04:47 - INFO - __main__ - Step 41685: {'lr': 0.00041642378147707014, 'samples': 8003520, 'steps': 41684, 'loss/train': 1.4430322647094727} 11/07/2021 03:04:47 - INFO - __main__ - Step 41686: {'lr': 0.00041641982142050297, 'samples': 8003712, 'steps': 41685, 'loss/train': 1.5093369483947754} 11/07/2021 03:04:48 - INFO - __main__ - Step 41687: {'lr': 0.00041641586128894967, 'samples': 8003904, 'steps': 41686, 'loss/train': 1.529678225517273} 11/07/2021 03:04:48 - INFO - __main__ - Step 41688: {'lr': 0.0004164119010824119, 'samples': 8004096, 'steps': 41687, 'loss/train': 1.4738942384719849} 11/07/2021 03:04:49 - INFO - __main__ - Step 41689: {'lr': 0.00041640794080089144, 'samples': 8004288, 'steps': 41688, 'loss/train': 1.233488917350769} 11/07/2021 03:04:49 - INFO - __main__ - Step 41690: {'lr': 0.0004164039804443902, 'samples': 8004480, 'steps': 41689, 'loss/train': 1.4913170337677002} 11/07/2021 03:04:50 - INFO - __main__ - Step 41691: {'lr': 0.0004164000200129099, 'samples': 8004672, 'steps': 41690, 'loss/train': 1.5144816637039185} 11/07/2021 03:04:50 - INFO - __main__ - Step 41692: {'lr': 0.0004163960595064522, 'samples': 8004864, 'steps': 41691, 'loss/train': 1.6226403713226318} 11/07/2021 03:04:51 - INFO - __main__ - Step 41693: {'lr': 0.00041639209892501913, 'samples': 8005056, 'steps': 41692, 'loss/train': 1.6253248453140259} 11/07/2021 03:04:52 - INFO - __main__ - Step 41694: {'lr': 0.00041638813826861234, 'samples': 8005248, 'steps': 41693, 'loss/train': 1.784563422203064} 11/07/2021 03:04:52 - INFO - __main__ - Step 41695: {'lr': 0.00041638417753723356, 'samples': 8005440, 'steps': 41694, 'loss/train': 0.6529554724693298} 11/07/2021 03:04:53 - INFO - __main__ - Step 41696: {'lr': 0.00041638021673088464, 'samples': 8005632, 'steps': 41695, 'loss/train': 1.8190524578094482} 11/07/2021 03:04:53 - INFO - __main__ - Step 41697: {'lr': 0.0004163762558495674, 'samples': 8005824, 'steps': 41696, 'loss/train': 1.8762617111206055} 11/07/2021 03:04:53 - INFO - __main__ - Step 41698: {'lr': 0.0004163722948932836, 'samples': 8006016, 'steps': 41697, 'loss/train': 0.7908387184143066} 11/07/2021 03:04:54 - INFO - __main__ - Step 41699: {'lr': 0.000416368333862035, 'samples': 8006208, 'steps': 41698, 'loss/train': 1.673041820526123} 11/07/2021 03:04:54 - INFO - __main__ - Step 41700: {'lr': 0.00041636437275582335, 'samples': 8006400, 'steps': 41699, 'loss/train': 1.4131288528442383} 11/07/2021 03:04:55 - INFO - __main__ - Step 41701: {'lr': 0.00041636041157465056, 'samples': 8006592, 'steps': 41700, 'loss/train': 1.3762298822402954} 11/07/2021 03:04:56 - INFO - __main__ - Step 41702: {'lr': 0.00041635645031851826, 'samples': 8006784, 'steps': 41701, 'loss/train': 1.8060678243637085} 11/07/2021 03:04:56 - INFO - __main__ - Step 41703: {'lr': 0.00041635248898742834, 'samples': 8006976, 'steps': 41702, 'loss/train': 1.4579479694366455} 11/07/2021 03:04:56 - INFO - __main__ - Step 41704: {'lr': 0.00041634852758138253, 'samples': 8007168, 'steps': 41703, 'loss/train': 1.565946340560913} 11/07/2021 03:04:57 - INFO - __main__ - Step 41705: {'lr': 0.0004163445661003827, 'samples': 8007360, 'steps': 41704, 'loss/train': 2.0427615642547607} 11/07/2021 03:04:58 - INFO - __main__ - Step 41706: {'lr': 0.0004163406045444306, 'samples': 8007552, 'steps': 41705, 'loss/train': 1.6200158596038818} 11/07/2021 03:04:58 - INFO - __main__ - Step 41707: {'lr': 0.0004163366429135279, 'samples': 8007744, 'steps': 41706, 'loss/train': 1.5748037099838257} 11/07/2021 03:04:58 - INFO - __main__ - Step 41708: {'lr': 0.00041633268120767653, 'samples': 8007936, 'steps': 41707, 'loss/train': 1.799818992614746} 11/07/2021 03:04:59 - INFO - __main__ - Step 41709: {'lr': 0.00041632871942687814, 'samples': 8008128, 'steps': 41708, 'loss/train': 1.4548799991607666} 11/07/2021 03:04:59 - INFO - __main__ - Step 41710: {'lr': 0.00041632475757113466, 'samples': 8008320, 'steps': 41709, 'loss/train': 1.7368544340133667} 11/07/2021 03:05:00 - INFO - __main__ - Step 41711: {'lr': 0.00041632079564044776, 'samples': 8008512, 'steps': 41710, 'loss/train': 1.3731677532196045} 11/07/2021 03:05:01 - INFO - __main__ - Step 41712: {'lr': 0.0004163168336348194, 'samples': 8008704, 'steps': 41711, 'loss/train': 0.9456028342247009} 11/07/2021 03:05:01 - INFO - __main__ - Step 41713: {'lr': 0.00041631287155425114, 'samples': 8008896, 'steps': 41712, 'loss/train': 1.3551782369613647} 11/07/2021 03:05:01 - INFO - __main__ - Step 41714: {'lr': 0.0004163089093987449, 'samples': 8009088, 'steps': 41713, 'loss/train': 1.3990758657455444} 11/07/2021 03:05:02 - INFO - __main__ - Step 41715: {'lr': 0.00041630494716830244, 'samples': 8009280, 'steps': 41714, 'loss/train': 1.2092492580413818} 11/07/2021 03:05:03 - INFO - __main__ - Step 41716: {'lr': 0.00041630098486292546, 'samples': 8009472, 'steps': 41715, 'loss/train': 1.7054479122161865} 11/07/2021 03:05:03 - INFO - __main__ - Step 41717: {'lr': 0.0004162970224826159, 'samples': 8009664, 'steps': 41716, 'loss/train': 1.0781768560409546} 11/07/2021 03:05:03 - INFO - __main__ - Step 41718: {'lr': 0.0004162930600273754, 'samples': 8009856, 'steps': 41717, 'loss/train': 1.3864885568618774} 11/07/2021 03:05:04 - INFO - __main__ - Step 41719: {'lr': 0.0004162890974972059, 'samples': 8010048, 'steps': 41718, 'loss/train': 1.8126137256622314} 11/07/2021 03:05:04 - INFO - __main__ - Step 41720: {'lr': 0.00041628513489210906, 'samples': 8010240, 'steps': 41719, 'loss/train': 1.5357738733291626} 11/07/2021 03:05:05 - INFO - __main__ - Step 41721: {'lr': 0.0004162811722120867, 'samples': 8010432, 'steps': 41720, 'loss/train': 1.2699007987976074} 11/07/2021 03:05:05 - INFO - __main__ - Step 41722: {'lr': 0.00041627720945714065, 'samples': 8010624, 'steps': 41721, 'loss/train': 1.3326258659362793} 11/07/2021 03:05:06 - INFO - __main__ - Step 41723: {'lr': 0.00041627324662727263, 'samples': 8010816, 'steps': 41722, 'loss/train': 1.481400728225708} 11/07/2021 03:05:06 - INFO - __main__ - Step 41724: {'lr': 0.0004162692837224844, 'samples': 8011008, 'steps': 41723, 'loss/train': 1.7024741172790527} 11/07/2021 03:05:06 - INFO - __main__ - Step 41725: {'lr': 0.00041626532074277785, 'samples': 8011200, 'steps': 41724, 'loss/train': 1.259816288948059} 11/07/2021 03:05:07 - INFO - __main__ - Step 41726: {'lr': 0.00041626135768815467, 'samples': 8011392, 'steps': 41725, 'loss/train': 1.537654161453247} 11/07/2021 03:05:08 - INFO - __main__ - Step 41727: {'lr': 0.0004162573945586168, 'samples': 8011584, 'steps': 41726, 'loss/train': 1.597123146057129} 11/07/2021 03:05:08 - INFO - __main__ - Step 41728: {'lr': 0.0004162534313541658, 'samples': 8011776, 'steps': 41727, 'loss/train': 0.8959583044052124} 11/07/2021 03:05:09 - INFO - __main__ - Step 41729: {'lr': 0.00041624946807480357, 'samples': 8011968, 'steps': 41728, 'loss/train': 1.5581363439559937} 11/07/2021 03:05:09 - INFO - __main__ - Step 41730: {'lr': 0.0004162455047205319, 'samples': 8012160, 'steps': 41729, 'loss/train': 0.8903040885925293} 11/07/2021 03:05:10 - INFO - __main__ - Step 41731: {'lr': 0.0004162415412913526, 'samples': 8012352, 'steps': 41730, 'loss/train': 1.7477757930755615} 11/07/2021 03:05:10 - INFO - __main__ - Step 41732: {'lr': 0.00041623757778726743, 'samples': 8012544, 'steps': 41731, 'loss/train': 1.3590712547302246} 11/07/2021 03:05:10 - INFO - __main__ - Step 41733: {'lr': 0.00041623361420827816, 'samples': 8012736, 'steps': 41732, 'loss/train': 0.7484281659126282} 11/07/2021 03:05:11 - INFO - __main__ - Step 41734: {'lr': 0.0004162296505543867, 'samples': 8012928, 'steps': 41733, 'loss/train': 1.5739892721176147} 11/07/2021 03:05:11 - INFO - __main__ - Step 41735: {'lr': 0.00041622568682559455, 'samples': 8013120, 'steps': 41734, 'loss/train': 1.2763605117797852} 11/07/2021 03:05:12 - INFO - __main__ - Step 41736: {'lr': 0.0004162217230219038, 'samples': 8013312, 'steps': 41735, 'loss/train': 1.4866390228271484} 11/07/2021 03:05:13 - INFO - __main__ - Step 41737: {'lr': 0.00041621775914331595, 'samples': 8013504, 'steps': 41736, 'loss/train': 1.272396445274353} 11/07/2021 03:05:14 - INFO - __main__ - Step 41738: {'lr': 0.00041621379518983306, 'samples': 8013696, 'steps': 41737, 'loss/train': 1.5573691129684448} 11/07/2021 03:05:14 - INFO - __main__ - Step 41739: {'lr': 0.00041620983116145673, 'samples': 8013888, 'steps': 41738, 'loss/train': 1.3005025386810303} 11/07/2021 03:05:14 - INFO - __main__ - Step 41740: {'lr': 0.00041620586705818887, 'samples': 8014080, 'steps': 41739, 'loss/train': 0.7410742044448853} 11/07/2021 03:05:15 - INFO - __main__ - Step 41741: {'lr': 0.00041620190288003126, 'samples': 8014272, 'steps': 41740, 'loss/train': 1.2668712139129639} 11/07/2021 03:05:15 - INFO - __main__ - Step 41742: {'lr': 0.00041619793862698553, 'samples': 8014464, 'steps': 41741, 'loss/train': 1.555660367012024} 11/07/2021 03:05:16 - INFO - __main__ - Step 41743: {'lr': 0.00041619397429905363, 'samples': 8014656, 'steps': 41742, 'loss/train': 0.837822675704956} 11/07/2021 03:05:16 - INFO - __main__ - Step 41744: {'lr': 0.0004161900098962373, 'samples': 8014848, 'steps': 41743, 'loss/train': 1.4115031957626343} 11/07/2021 03:05:17 - INFO - __main__ - Step 41745: {'lr': 0.00041618604541853826, 'samples': 8015040, 'steps': 41744, 'loss/train': 1.5460307598114014} 11/07/2021 03:05:17 - INFO - __main__ - Step 41746: {'lr': 0.00041618208086595843, 'samples': 8015232, 'steps': 41745, 'loss/train': 1.339052677154541} 11/07/2021 03:05:17 - INFO - __main__ - Step 41747: {'lr': 0.0004161781162384994, 'samples': 8015424, 'steps': 41746, 'loss/train': 1.7926512956619263} 11/07/2021 03:05:19 - INFO - __main__ - Step 41748: {'lr': 0.00041617415153616323, 'samples': 8015616, 'steps': 41747, 'loss/train': 1.718548059463501} 11/07/2021 03:05:19 - INFO - __main__ - Step 41749: {'lr': 0.00041617018675895145, 'samples': 8015808, 'steps': 41748, 'loss/train': 1.0688071250915527} 11/07/2021 03:05:19 - INFO - __main__ - Step 41750: {'lr': 0.00041616622190686597, 'samples': 8016000, 'steps': 41749, 'loss/train': 1.0594754219055176} 11/07/2021 03:05:20 - INFO - __main__ - Step 41751: {'lr': 0.0004161622569799086, 'samples': 8016192, 'steps': 41750, 'loss/train': 1.49894118309021} 11/07/2021 03:05:20 - INFO - __main__ - Step 41752: {'lr': 0.00041615829197808095, 'samples': 8016384, 'steps': 41751, 'loss/train': 1.7414077520370483} 11/07/2021 03:05:20 - INFO - __main__ - Step 41753: {'lr': 0.0004161543269013851, 'samples': 8016576, 'steps': 41752, 'loss/train': 2.081005096435547} 11/07/2021 03:05:21 - INFO - __main__ - Step 41754: {'lr': 0.0004161503617498226, 'samples': 8016768, 'steps': 41753, 'loss/train': 1.1592674255371094} 11/07/2021 03:05:22 - INFO - __main__ - Step 41755: {'lr': 0.00041614639652339533, 'samples': 8016960, 'steps': 41754, 'loss/train': 0.987359344959259} 11/07/2021 03:05:22 - INFO - __main__ - Step 41756: {'lr': 0.00041614243122210505, 'samples': 8017152, 'steps': 41755, 'loss/train': 1.9649503231048584} 11/07/2021 03:05:22 - INFO - __main__ - Step 41757: {'lr': 0.0004161384658459535, 'samples': 8017344, 'steps': 41756, 'loss/train': 1.4676735401153564} 11/07/2021 03:05:23 - INFO - __main__ - Step 41758: {'lr': 0.0004161345003949426, 'samples': 8017536, 'steps': 41757, 'loss/train': 1.7614989280700684} 11/07/2021 03:05:24 - INFO - __main__ - Step 41759: {'lr': 0.00041613053486907396, 'samples': 8017728, 'steps': 41758, 'loss/train': 1.616061806678772} 11/07/2021 03:05:24 - INFO - __main__ - Step 41760: {'lr': 0.0004161265692683496, 'samples': 8017920, 'steps': 41759, 'loss/train': 1.723777413368225} 11/07/2021 03:05:25 - INFO - __main__ - Step 41761: {'lr': 0.0004161226035927711, 'samples': 8018112, 'steps': 41760, 'loss/train': 2.0317869186401367} 11/07/2021 03:05:25 - INFO - __main__ - Step 41762: {'lr': 0.0004161186378423403, 'samples': 8018304, 'steps': 41761, 'loss/train': 1.7671819925308228} 11/07/2021 03:05:25 - INFO - __main__ - Step 41763: {'lr': 0.000416114672017059, 'samples': 8018496, 'steps': 41762, 'loss/train': 1.7405064105987549} 11/07/2021 03:05:26 - INFO - __main__ - Step 41764: {'lr': 0.000416110706116929, 'samples': 8018688, 'steps': 41763, 'loss/train': 1.5564242601394653} 11/07/2021 03:05:27 - INFO - __main__ - Step 41765: {'lr': 0.0004161067401419521, 'samples': 8018880, 'steps': 41764, 'loss/train': 1.6028566360473633} 11/07/2021 03:05:27 - INFO - __main__ - Step 41766: {'lr': 0.00041610277409213003, 'samples': 8019072, 'steps': 41765, 'loss/train': 0.15148968994617462} 11/07/2021 03:05:27 - INFO - __main__ - Step 41767: {'lr': 0.00041609880796746463, 'samples': 8019264, 'steps': 41766, 'loss/train': 1.4200246334075928} 11/07/2021 03:05:28 - INFO - __main__ - Step 41768: {'lr': 0.00041609484176795774, 'samples': 8019456, 'steps': 41767, 'loss/train': 1.4033209085464478} 11/07/2021 03:05:29 - INFO - __main__ - Step 41769: {'lr': 0.000416090875493611, 'samples': 8019648, 'steps': 41768, 'loss/train': 1.5000132322311401} 11/07/2021 03:05:29 - INFO - __main__ - Step 41770: {'lr': 0.0004160869091444263, 'samples': 8019840, 'steps': 41769, 'loss/train': 1.6210057735443115} 11/07/2021 03:05:29 - INFO - __main__ - Step 41771: {'lr': 0.0004160829427204054, 'samples': 8020032, 'steps': 41770, 'loss/train': 1.5194448232650757} 11/07/2021 03:05:30 - INFO - __main__ - Step 41772: {'lr': 0.00041607897622155006, 'samples': 8020224, 'steps': 41771, 'loss/train': 1.3142369985580444} 11/07/2021 03:05:30 - INFO - __main__ - Step 41773: {'lr': 0.00041607500964786217, 'samples': 8020416, 'steps': 41772, 'loss/train': 1.448745846748352} 11/07/2021 03:05:31 - INFO - __main__ - Step 41774: {'lr': 0.0004160710429993434, 'samples': 8020608, 'steps': 41773, 'loss/train': 2.627546548843384} 11/07/2021 03:05:31 - INFO - __main__ - Step 41775: {'lr': 0.00041606707627599556, 'samples': 8020800, 'steps': 41774, 'loss/train': 1.5680979490280151} 11/07/2021 03:05:32 - INFO - __main__ - Step 41776: {'lr': 0.00041606310947782046, 'samples': 8020992, 'steps': 41775, 'loss/train': 1.5007060766220093} 11/07/2021 03:05:32 - INFO - __main__ - Step 41777: {'lr': 0.0004160591426048199, 'samples': 8021184, 'steps': 41776, 'loss/train': 1.1363593339920044} 11/07/2021 03:05:33 - INFO - __main__ - Step 41778: {'lr': 0.00041605517565699565, 'samples': 8021376, 'steps': 41777, 'loss/train': 1.7209640741348267} 11/07/2021 03:05:34 - INFO - __main__ - Step 41779: {'lr': 0.00041605120863434945, 'samples': 8021568, 'steps': 41778, 'loss/train': 1.527706503868103} 11/07/2021 03:05:34 - INFO - __main__ - Step 41780: {'lr': 0.0004160472415368832, 'samples': 8021760, 'steps': 41779, 'loss/train': 1.049204707145691} 11/07/2021 03:05:35 - INFO - __main__ - Step 41781: {'lr': 0.00041604327436459864, 'samples': 8021952, 'steps': 41780, 'loss/train': 1.3618234395980835} 11/07/2021 03:05:35 - INFO - __main__ - Step 41782: {'lr': 0.0004160393071174975, 'samples': 8022144, 'steps': 41781, 'loss/train': 1.4717530012130737} 11/07/2021 03:05:35 - INFO - __main__ - Step 41783: {'lr': 0.00041603533979558163, 'samples': 8022336, 'steps': 41782, 'loss/train': 1.55155611038208} 11/07/2021 03:05:36 - INFO - __main__ - Step 41784: {'lr': 0.0004160313723988528, 'samples': 8022528, 'steps': 41783, 'loss/train': 1.434744954109192} 11/07/2021 03:05:36 - INFO - __main__ - Step 41785: {'lr': 0.00041602740492731284, 'samples': 8022720, 'steps': 41784, 'loss/train': 5.639638423919678} 11/07/2021 03:05:37 - INFO - __main__ - Step 41786: {'lr': 0.0004160234373809634, 'samples': 8022912, 'steps': 41785, 'loss/train': 5.51109504699707} 11/07/2021 03:05:37 - INFO - __main__ - Step 41787: {'lr': 0.0004160194697598064, 'samples': 8023104, 'steps': 41786, 'loss/train': 1.7458086013793945} 11/07/2021 03:05:38 - INFO - __main__ - Step 41788: {'lr': 0.0004160155020638436, 'samples': 8023296, 'steps': 41787, 'loss/train': 1.6691575050354004} 11/07/2021 03:05:38 - INFO - __main__ - Step 41789: {'lr': 0.0004160115342930768, 'samples': 8023488, 'steps': 41788, 'loss/train': 1.4780073165893555} 11/07/2021 03:05:38 - INFO - __main__ - Step 41790: {'lr': 0.0004160075664475077, 'samples': 8023680, 'steps': 41789, 'loss/train': 1.584285020828247} 11/07/2021 03:05:39 - INFO - __main__ - Step 41791: {'lr': 0.0004160035985271382, 'samples': 8023872, 'steps': 41790, 'loss/train': 1.6199537515640259} 11/07/2021 03:05:40 - INFO - __main__ - Step 41792: {'lr': 0.00041599963053196997, 'samples': 8024064, 'steps': 41791, 'loss/train': 1.8177478313446045} 11/07/2021 03:05:40 - INFO - __main__ - Step 41793: {'lr': 0.0004159956624620049, 'samples': 8024256, 'steps': 41792, 'loss/train': 1.2697099447250366} 11/07/2021 03:05:41 - INFO - __main__ - Step 41794: {'lr': 0.0004159916943172448, 'samples': 8024448, 'steps': 41793, 'loss/train': 1.3568118810653687} 11/07/2021 03:05:41 - INFO - __main__ - Step 41795: {'lr': 0.0004159877260976914, 'samples': 8024640, 'steps': 41794, 'loss/train': 1.1664894819259644} 11/07/2021 03:05:41 - INFO - __main__ - Step 41796: {'lr': 0.00041598375780334653, 'samples': 8024832, 'steps': 41795, 'loss/train': 1.419095516204834} 11/07/2021 03:05:42 - INFO - __main__ - Step 41797: {'lr': 0.0004159797894342118, 'samples': 8025024, 'steps': 41796, 'loss/train': 1.4277313947677612} 11/07/2021 03:05:43 - INFO - __main__ - Step 41798: {'lr': 0.0004159758209902892, 'samples': 8025216, 'steps': 41797, 'loss/train': 1.4310158491134644} 11/07/2021 03:05:43 - INFO - __main__ - Step 41799: {'lr': 0.00041597185247158053, 'samples': 8025408, 'steps': 41798, 'loss/train': 1.2442349195480347} 11/07/2021 03:05:43 - INFO - __main__ - Step 41800: {'lr': 0.0004159678838780874, 'samples': 8025600, 'steps': 41799, 'loss/train': 1.3497828245162964} 11/07/2021 03:05:44 - INFO - __main__ - Step 41801: {'lr': 0.0004159639152098118, 'samples': 8025792, 'steps': 41800, 'loss/train': 1.1749296188354492} 11/07/2021 03:05:45 - INFO - __main__ - Step 41802: {'lr': 0.00041595994646675537, 'samples': 8025984, 'steps': 41801, 'loss/train': 1.4512723684310913} 11/07/2021 03:05:45 - INFO - __main__ - Step 41803: {'lr': 0.0004159559776489199, 'samples': 8026176, 'steps': 41802, 'loss/train': 1.5854878425598145} 11/07/2021 03:05:45 - INFO - __main__ - Step 41804: {'lr': 0.00041595200875630734, 'samples': 8026368, 'steps': 41803, 'loss/train': 1.5534003973007202} 11/07/2021 03:05:46 - INFO - __main__ - Step 41805: {'lr': 0.00041594803978891925, 'samples': 8026560, 'steps': 41804, 'loss/train': 1.3734766244888306} 11/07/2021 03:05:46 - INFO - __main__ - Step 41806: {'lr': 0.00041594407074675753, 'samples': 8026752, 'steps': 41805, 'loss/train': 1.9073400497436523} 11/07/2021 03:05:47 - INFO - __main__ - Step 41807: {'lr': 0.0004159401016298241, 'samples': 8026944, 'steps': 41806, 'loss/train': 1.948595404624939} 11/07/2021 03:05:48 - INFO - __main__ - Step 41808: {'lr': 0.0004159361324381206, 'samples': 8027136, 'steps': 41807, 'loss/train': 1.5608607530593872} 11/07/2021 03:05:48 - INFO - __main__ - Step 41809: {'lr': 0.0004159321631716487, 'samples': 8027328, 'steps': 41808, 'loss/train': 2.098839521408081} 11/07/2021 03:05:49 - INFO - __main__ - Step 41810: {'lr': 0.00041592819383041047, 'samples': 8027520, 'steps': 41809, 'loss/train': 1.6151118278503418} 11/07/2021 03:05:49 - INFO - __main__ - Step 41811: {'lr': 0.0004159242244144075, 'samples': 8027712, 'steps': 41810, 'loss/train': 1.6768019199371338} 11/07/2021 03:05:50 - INFO - __main__ - Step 41812: {'lr': 0.0004159202549236416, 'samples': 8027904, 'steps': 41811, 'loss/train': 1.2156720161437988} 11/07/2021 03:05:50 - INFO - __main__ - Step 41813: {'lr': 0.00041591628535811464, 'samples': 8028096, 'steps': 41812, 'loss/train': 1.1023814678192139} 11/07/2021 03:05:51 - INFO - __main__ - Step 41814: {'lr': 0.00041591231571782834, 'samples': 8028288, 'steps': 41813, 'loss/train': 1.4737012386322021} 11/07/2021 03:05:51 - INFO - __main__ - Step 41815: {'lr': 0.0004159083460027845, 'samples': 8028480, 'steps': 41814, 'loss/train': 1.6790229082107544} 11/07/2021 03:05:51 - INFO - __main__ - Step 41816: {'lr': 0.000415904376212985, 'samples': 8028672, 'steps': 41815, 'loss/train': 1.4413472414016724} 11/07/2021 03:05:52 - INFO - __main__ - Step 41817: {'lr': 0.00041590040634843144, 'samples': 8028864, 'steps': 41816, 'loss/train': 2.422144889831543} 11/07/2021 03:05:53 - INFO - __main__ - Step 41818: {'lr': 0.00041589643640912576, 'samples': 8029056, 'steps': 41817, 'loss/train': 1.58564031124115} 11/07/2021 03:05:53 - INFO - __main__ - Step 41819: {'lr': 0.0004158924663950697, 'samples': 8029248, 'steps': 41818, 'loss/train': 1.575334072113037} 11/07/2021 03:05:53 - INFO - __main__ - Step 41820: {'lr': 0.00041588849630626513, 'samples': 8029440, 'steps': 41819, 'loss/train': 1.7191028594970703} 11/07/2021 03:05:54 - INFO - __main__ - Step 41821: {'lr': 0.00041588452614271364, 'samples': 8029632, 'steps': 41820, 'loss/train': 1.5590040683746338} 11/07/2021 03:05:54 - INFO - __main__ - Step 41822: {'lr': 0.00041588055590441726, 'samples': 8029824, 'steps': 41821, 'loss/train': 1.3510241508483887} 11/07/2021 03:05:55 - INFO - __main__ - Step 41823: {'lr': 0.0004158765855913776, 'samples': 8030016, 'steps': 41822, 'loss/train': 1.5703097581863403} 11/07/2021 03:05:56 - INFO - __main__ - Step 41824: {'lr': 0.0004158726152035965, 'samples': 8030208, 'steps': 41823, 'loss/train': 1.4530744552612305} 11/07/2021 03:05:56 - INFO - __main__ - Step 41825: {'lr': 0.00041586864474107575, 'samples': 8030400, 'steps': 41824, 'loss/train': 1.1881881952285767} 11/07/2021 03:05:56 - INFO - __main__ - Step 41826: {'lr': 0.0004158646742038172, 'samples': 8030592, 'steps': 41825, 'loss/train': 1.6617271900177002} 11/07/2021 03:05:57 - INFO - __main__ - Step 41827: {'lr': 0.00041586070359182255, 'samples': 8030784, 'steps': 41826, 'loss/train': 1.8090656995773315} 11/07/2021 03:05:58 - INFO - __main__ - Step 41828: {'lr': 0.00041585673290509364, 'samples': 8030976, 'steps': 41827, 'loss/train': 1.0655546188354492} 11/07/2021 03:05:58 - INFO - __main__ - Step 41829: {'lr': 0.0004158527621436322, 'samples': 8031168, 'steps': 41828, 'loss/train': 1.5067369937896729} 11/07/2021 03:05:58 - INFO - __main__ - Step 41830: {'lr': 0.0004158487913074401, 'samples': 8031360, 'steps': 41829, 'loss/train': 1.3611478805541992} 11/07/2021 03:05:59 - INFO - __main__ - Step 41831: {'lr': 0.0004158448203965192, 'samples': 8031552, 'steps': 41830, 'loss/train': 1.7292866706848145} 11/07/2021 03:05:59 - INFO - __main__ - Step 41832: {'lr': 0.000415840849410871, 'samples': 8031744, 'steps': 41831, 'loss/train': 1.2231707572937012} 11/07/2021 03:06:00 - INFO - __main__ - Step 41833: {'lr': 0.0004158368783504975, 'samples': 8031936, 'steps': 41832, 'loss/train': 0.8280168175697327} 11/07/2021 03:06:00 - INFO - __main__ - Step 41834: {'lr': 0.00041583290721540055, 'samples': 8032128, 'steps': 41833, 'loss/train': 1.269150733947754} 11/07/2021 03:06:01 - INFO - __main__ - Step 41835: {'lr': 0.0004158289360055819, 'samples': 8032320, 'steps': 41834, 'loss/train': 0.977594256401062} 11/07/2021 03:06:01 - INFO - __main__ - Step 41836: {'lr': 0.00041582496472104314, 'samples': 8032512, 'steps': 41835, 'loss/train': 2.1492388248443604} 11/07/2021 03:06:01 - INFO - __main__ - Step 41837: {'lr': 0.0004158209933617863, 'samples': 8032704, 'steps': 41836, 'loss/train': 1.4113600254058838} 11/07/2021 03:06:02 - INFO - __main__ - Step 41838: {'lr': 0.00041581702192781305, 'samples': 8032896, 'steps': 41837, 'loss/train': 1.2576713562011719} 11/07/2021 03:06:03 - INFO - __main__ - Step 41839: {'lr': 0.0004158130504191252, 'samples': 8033088, 'steps': 41838, 'loss/train': 1.7074604034423828} 11/07/2021 03:06:03 - INFO - __main__ - Step 41840: {'lr': 0.0004158090788357246, 'samples': 8033280, 'steps': 41839, 'loss/train': 1.8725475072860718} 11/07/2021 03:06:03 - INFO - __main__ - Step 41841: {'lr': 0.0004158051071776129, 'samples': 8033472, 'steps': 41840, 'loss/train': 1.3763138055801392} 11/07/2021 03:06:04 - INFO - __main__ - Step 41842: {'lr': 0.00041580113544479203, 'samples': 8033664, 'steps': 41841, 'loss/train': 1.4608910083770752} 11/07/2021 03:06:05 - INFO - __main__ - Step 41843: {'lr': 0.00041579716363726376, 'samples': 8033856, 'steps': 41842, 'loss/train': 1.999753713607788} 11/07/2021 03:06:05 - INFO - __main__ - Step 41844: {'lr': 0.00041579319175502985, 'samples': 8034048, 'steps': 41843, 'loss/train': 0.8875548243522644} 11/07/2021 03:06:05 - INFO - __main__ - Step 41845: {'lr': 0.000415789219798092, 'samples': 8034240, 'steps': 41844, 'loss/train': 1.5030208826065063} 11/07/2021 03:06:06 - INFO - __main__ - Step 41846: {'lr': 0.00041578524776645216, 'samples': 8034432, 'steps': 41845, 'loss/train': 1.609982967376709} 11/07/2021 03:06:06 - INFO - __main__ - Step 41847: {'lr': 0.00041578127566011203, 'samples': 8034624, 'steps': 41846, 'loss/train': 1.7222378253936768} 11/07/2021 03:06:07 - INFO - __main__ - Step 41848: {'lr': 0.0004157773034790734, 'samples': 8034816, 'steps': 41847, 'loss/train': 1.5789622068405151} 11/07/2021 03:06:08 - INFO - __main__ - Step 41849: {'lr': 0.00041577333122333807, 'samples': 8035008, 'steps': 41848, 'loss/train': 0.21003803610801697} 11/07/2021 03:06:08 - INFO - __main__ - Step 41850: {'lr': 0.00041576935889290777, 'samples': 8035200, 'steps': 41849, 'loss/train': 1.580950140953064} 11/07/2021 03:06:08 - INFO - __main__ - Step 41851: {'lr': 0.0004157653864877845, 'samples': 8035392, 'steps': 41850, 'loss/train': 1.5882028341293335} 11/07/2021 03:06:09 - INFO - __main__ - Step 41852: {'lr': 0.00041576141400796984, 'samples': 8035584, 'steps': 41851, 'loss/train': 1.2623769044876099} 11/07/2021 03:06:09 - INFO - __main__ - Step 41853: {'lr': 0.00041575744145346563, 'samples': 8035776, 'steps': 41852, 'loss/train': 1.2826652526855469} 11/07/2021 03:06:10 - INFO - __main__ - Step 41854: {'lr': 0.00041575346882427366, 'samples': 8035968, 'steps': 41853, 'loss/train': 1.4938825368881226} 11/07/2021 03:06:10 - INFO - __main__ - Step 41855: {'lr': 0.00041574949612039583, 'samples': 8036160, 'steps': 41854, 'loss/train': 1.9276888370513916} 11/07/2021 03:06:11 - INFO - __main__ - Step 41856: {'lr': 0.0004157455233418337, 'samples': 8036352, 'steps': 41855, 'loss/train': 1.6604562997817993} 11/07/2021 03:06:11 - INFO - __main__ - Step 41857: {'lr': 0.0004157415504885893, 'samples': 8036544, 'steps': 41856, 'loss/train': 1.363741159439087} 11/07/2021 03:06:11 - INFO - __main__ - Step 41858: {'lr': 0.00041573757756066423, 'samples': 8036736, 'steps': 41857, 'loss/train': 0.9199839234352112} 11/07/2021 03:06:13 - INFO - __main__ - Step 41859: {'lr': 0.0004157336045580604, 'samples': 8036928, 'steps': 41858, 'loss/train': 1.5329149961471558} 11/07/2021 03:06:13 - INFO - __main__ - Step 41860: {'lr': 0.0004157296314807796, 'samples': 8037120, 'steps': 41859, 'loss/train': 1.2419116497039795} 11/07/2021 03:06:13 - INFO - __main__ - Step 41861: {'lr': 0.0004157256583288235, 'samples': 8037312, 'steps': 41860, 'loss/train': 0.6708611249923706} 11/07/2021 03:06:14 - INFO - __main__ - Step 41862: {'lr': 0.0004157216851021941, 'samples': 8037504, 'steps': 41861, 'loss/train': 2.5971829891204834} 11/07/2021 03:06:14 - INFO - __main__ - Step 41863: {'lr': 0.00041571771180089304, 'samples': 8037696, 'steps': 41862, 'loss/train': 1.778261423110962} 11/07/2021 03:06:15 - INFO - __main__ - Step 41864: {'lr': 0.0004157137384249221, 'samples': 8037888, 'steps': 41863, 'loss/train': 1.4639763832092285} 11/07/2021 03:06:15 - INFO - __main__ - Step 41865: {'lr': 0.00041570976497428303, 'samples': 8038080, 'steps': 41864, 'loss/train': 1.5496078729629517} 11/07/2021 03:06:16 - INFO - __main__ - Step 41866: {'lr': 0.0004157057914489778, 'samples': 8038272, 'steps': 41865, 'loss/train': 1.4821267127990723} 11/07/2021 03:06:16 - INFO - __main__ - Step 41867: {'lr': 0.00041570181784900806, 'samples': 8038464, 'steps': 41866, 'loss/train': 1.4663201570510864} 11/07/2021 03:06:16 - INFO - __main__ - Step 41868: {'lr': 0.0004156978441743756, 'samples': 8038656, 'steps': 41867, 'loss/train': 1.65707528591156} 11/07/2021 03:06:17 - INFO - __main__ - Step 41869: {'lr': 0.00041569387042508235, 'samples': 8038848, 'steps': 41868, 'loss/train': 1.712996244430542} 11/07/2021 03:06:18 - INFO - __main__ - Step 41870: {'lr': 0.0004156898966011299, 'samples': 8039040, 'steps': 41869, 'loss/train': 1.2954154014587402} 11/07/2021 03:06:18 - INFO - __main__ - Step 41871: {'lr': 0.0004156859227025202, 'samples': 8039232, 'steps': 41870, 'loss/train': 1.9032760858535767} 11/07/2021 03:06:18 - INFO - __main__ - Step 41872: {'lr': 0.0004156819487292549, 'samples': 8039424, 'steps': 41871, 'loss/train': 1.3794838190078735} 11/07/2021 03:06:19 - INFO - __main__ - Step 41873: {'lr': 0.00041567797468133595, 'samples': 8039616, 'steps': 41872, 'loss/train': 1.5694235563278198} 11/07/2021 03:06:20 - INFO - __main__ - Step 41874: {'lr': 0.00041567400055876505, 'samples': 8039808, 'steps': 41873, 'loss/train': 2.1152632236480713} 11/07/2021 03:06:20 - INFO - __main__ - Step 41875: {'lr': 0.00041567002636154406, 'samples': 8040000, 'steps': 41874, 'loss/train': 1.4536792039871216} 11/07/2021 03:06:21 - INFO - __main__ - Step 41876: {'lr': 0.0004156660520896746, 'samples': 8040192, 'steps': 41875, 'loss/train': 1.0573084354400635} 11/07/2021 03:06:21 - INFO - __main__ - Step 41877: {'lr': 0.00041566207774315866, 'samples': 8040384, 'steps': 41876, 'loss/train': 1.8460909128189087} 11/07/2021 03:06:21 - INFO - __main__ - Step 41878: {'lr': 0.0004156581033219979, 'samples': 8040576, 'steps': 41877, 'loss/train': 1.110854983329773} 11/07/2021 03:06:22 - INFO - __main__ - Step 41879: {'lr': 0.0004156541288261941, 'samples': 8040768, 'steps': 41878, 'loss/train': 1.5497623682022095} 11/07/2021 03:06:23 - INFO - __main__ - Step 41880: {'lr': 0.00041565015425574917, 'samples': 8040960, 'steps': 41879, 'loss/train': 1.6084986925125122} 11/07/2021 03:06:23 - INFO - __main__ - Step 41881: {'lr': 0.00041564617961066487, 'samples': 8041152, 'steps': 41880, 'loss/train': 1.2137629985809326} 11/07/2021 03:06:23 - INFO - __main__ - Step 41882: {'lr': 0.00041564220489094295, 'samples': 8041344, 'steps': 41881, 'loss/train': 1.3202635049819946} 11/07/2021 03:06:24 - INFO - __main__ - Step 41883: {'lr': 0.00041563823009658514, 'samples': 8041536, 'steps': 41882, 'loss/train': 1.5626225471496582} 11/07/2021 03:06:24 - INFO - __main__ - Step 41884: {'lr': 0.00041563425522759336, 'samples': 8041728, 'steps': 41883, 'loss/train': 1.4322011470794678} 11/07/2021 03:06:25 - INFO - __main__ - Step 41885: {'lr': 0.0004156302802839693, 'samples': 8041920, 'steps': 41884, 'loss/train': 1.412869930267334} 11/07/2021 03:06:25 - INFO - __main__ - Step 41886: {'lr': 0.0004156263052657148, 'samples': 8042112, 'steps': 41885, 'loss/train': 1.515254259109497} 11/07/2021 03:06:26 - INFO - __main__ - Step 41887: {'lr': 0.0004156223301728316, 'samples': 8042304, 'steps': 41886, 'loss/train': 1.881744146347046} 11/07/2021 03:06:26 - INFO - __main__ - Step 41888: {'lr': 0.0004156183550053216, 'samples': 8042496, 'steps': 41887, 'loss/train': 1.3872469663619995} 11/07/2021 03:06:26 - INFO - __main__ - Step 41889: {'lr': 0.0004156143797631866, 'samples': 8042688, 'steps': 41888, 'loss/train': 1.8121181726455688} 11/07/2021 03:06:28 - INFO - __main__ - Step 41890: {'lr': 0.0004156104044464282, 'samples': 8042880, 'steps': 41889, 'loss/train': 1.5599640607833862} 11/07/2021 03:06:28 - INFO - __main__ - Step 41891: {'lr': 0.00041560642905504833, 'samples': 8043072, 'steps': 41890, 'loss/train': 3.547255039215088} 11/07/2021 03:06:28 - INFO - __main__ - Step 41892: {'lr': 0.0004156024535890487, 'samples': 8043264, 'steps': 41891, 'loss/train': 1.33156156539917} 11/07/2021 03:06:29 - INFO - __main__ - Step 41893: {'lr': 0.00041559847804843123, 'samples': 8043456, 'steps': 41892, 'loss/train': 1.6302646398544312} 11/07/2021 03:06:29 - INFO - __main__ - Step 41894: {'lr': 0.0004155945024331976, 'samples': 8043648, 'steps': 41893, 'loss/train': 1.2952686548233032} 11/07/2021 03:06:30 - INFO - __main__ - Step 41895: {'lr': 0.00041559052674334975, 'samples': 8043840, 'steps': 41894, 'loss/train': 1.4727513790130615} 11/07/2021 03:06:30 - INFO - __main__ - Step 41896: {'lr': 0.0004155865509788893, 'samples': 8044032, 'steps': 41895, 'loss/train': 0.39268600940704346} 11/07/2021 03:06:31 - INFO - __main__ - Step 41897: {'lr': 0.00041558257513981805, 'samples': 8044224, 'steps': 41896, 'loss/train': 1.482244849205017} 11/07/2021 03:06:31 - INFO - __main__ - Step 41898: {'lr': 0.00041557859922613795, 'samples': 8044416, 'steps': 41897, 'loss/train': 1.8280117511749268} 11/07/2021 03:06:31 - INFO - __main__ - Step 41899: {'lr': 0.00041557462323785053, 'samples': 8044608, 'steps': 41898, 'loss/train': 1.4932671785354614} 11/07/2021 03:06:32 - INFO - __main__ - Step 41900: {'lr': 0.00041557064717495786, 'samples': 8044800, 'steps': 41899, 'loss/train': 1.7836129665374756} 11/07/2021 03:06:33 - INFO - __main__ - Step 41901: {'lr': 0.00041556667103746157, 'samples': 8044992, 'steps': 41900, 'loss/train': 1.612501859664917} 11/07/2021 03:06:33 - INFO - __main__ - Step 41902: {'lr': 0.00041556269482536355, 'samples': 8045184, 'steps': 41901, 'loss/train': 1.2717280387878418} 11/07/2021 03:06:34 - INFO - __main__ - Step 41903: {'lr': 0.00041555871853866553, 'samples': 8045376, 'steps': 41902, 'loss/train': 1.4974288940429688} 11/07/2021 03:06:34 - INFO - __main__ - Step 41904: {'lr': 0.00041555474217736926, 'samples': 8045568, 'steps': 41903, 'loss/train': 1.5328596830368042} 11/07/2021 03:06:34 - INFO - __main__ - Step 41905: {'lr': 0.0004155507657414766, 'samples': 8045760, 'steps': 41904, 'loss/train': 1.8202886581420898} 11/07/2021 03:06:35 - INFO - __main__ - Step 41906: {'lr': 0.0004155467892309893, 'samples': 8045952, 'steps': 41905, 'loss/train': 1.405146837234497} 11/07/2021 03:06:36 - INFO - __main__ - Step 41907: {'lr': 0.0004155428126459092, 'samples': 8046144, 'steps': 41906, 'loss/train': 1.0993465185165405} 11/07/2021 03:06:36 - INFO - __main__ - Step 41908: {'lr': 0.00041553883598623804, 'samples': 8046336, 'steps': 41907, 'loss/train': 1.3472046852111816} 11/07/2021 03:06:36 - INFO - __main__ - Step 41909: {'lr': 0.00041553485925197763, 'samples': 8046528, 'steps': 41908, 'loss/train': 2.0090084075927734} 11/07/2021 03:06:37 - INFO - __main__ - Step 41910: {'lr': 0.00041553088244312975, 'samples': 8046720, 'steps': 41909, 'loss/train': 1.7823117971420288} 11/07/2021 03:06:38 - INFO - __main__ - Step 41911: {'lr': 0.0004155269055596963, 'samples': 8046912, 'steps': 41910, 'loss/train': 0.9449849128723145} 11/07/2021 03:06:38 - INFO - __main__ - Step 41912: {'lr': 0.0004155229286016789, 'samples': 8047104, 'steps': 41911, 'loss/train': 1.4676487445831299} 11/07/2021 03:06:39 - INFO - __main__ - Step 41913: {'lr': 0.0004155189515690794, 'samples': 8047296, 'steps': 41912, 'loss/train': 1.838443636894226} 11/07/2021 03:06:39 - INFO - __main__ - Step 41914: {'lr': 0.0004155149744618997, 'samples': 8047488, 'steps': 41913, 'loss/train': 1.6639630794525146} 11/07/2021 03:06:39 - INFO - __main__ - Step 41915: {'lr': 0.0004155109972801414, 'samples': 8047680, 'steps': 41914, 'loss/train': 1.3994457721710205} 11/07/2021 03:06:40 - INFO - __main__ - Step 41916: {'lr': 0.0004155070200238065, 'samples': 8047872, 'steps': 41915, 'loss/train': 1.8797807693481445} 11/07/2021 03:06:41 - INFO - __main__ - Step 41917: {'lr': 0.00041550304269289664, 'samples': 8048064, 'steps': 41916, 'loss/train': 1.755118489265442} 11/07/2021 03:06:41 - INFO - __main__ - Step 41918: {'lr': 0.00041549906528741366, 'samples': 8048256, 'steps': 41917, 'loss/train': 2.0328023433685303} 11/07/2021 03:06:41 - INFO - __main__ - Step 41919: {'lr': 0.0004154950878073594, 'samples': 8048448, 'steps': 41918, 'loss/train': 1.6511229276657104} 11/07/2021 03:06:42 - INFO - __main__ - Step 41920: {'lr': 0.0004154911102527356, 'samples': 8048640, 'steps': 41919, 'loss/train': 1.6904728412628174} 11/07/2021 03:06:42 - INFO - __main__ - Step 41921: {'lr': 0.00041548713262354396, 'samples': 8048832, 'steps': 41920, 'loss/train': 1.1226547956466675} 11/07/2021 03:06:43 - INFO - __main__ - Step 41922: {'lr': 0.0004154831549197865, 'samples': 8049024, 'steps': 41921, 'loss/train': 1.5018033981323242} 11/07/2021 03:06:43 - INFO - __main__ - Step 41923: {'lr': 0.0004154791771414648, 'samples': 8049216, 'steps': 41922, 'loss/train': 2.225022077560425} 11/07/2021 03:06:44 - INFO - __main__ - Step 41924: {'lr': 0.0004154751992885808, 'samples': 8049408, 'steps': 41923, 'loss/train': 0.46176427602767944} 11/07/2021 03:06:44 - INFO - __main__ - Step 41925: {'lr': 0.0004154712213611362, 'samples': 8049600, 'steps': 41924, 'loss/train': 1.9820481538772583} 11/07/2021 03:06:44 - INFO - __main__ - Step 41926: {'lr': 0.0004154672433591328, 'samples': 8049792, 'steps': 41925, 'loss/train': 1.159741997718811} 11/07/2021 03:06:46 - INFO - __main__ - Step 41927: {'lr': 0.0004154632652825724, 'samples': 8049984, 'steps': 41926, 'loss/train': 1.5462394952774048} 11/07/2021 03:06:46 - INFO - __main__ - Step 41928: {'lr': 0.00041545928713145687, 'samples': 8050176, 'steps': 41927, 'loss/train': 1.55469810962677} 11/07/2021 03:06:46 - INFO - __main__ - Step 41929: {'lr': 0.00041545530890578784, 'samples': 8050368, 'steps': 41928, 'loss/train': 1.1376702785491943} 11/07/2021 03:06:47 - INFO - __main__ - Step 41930: {'lr': 0.00041545133060556734, 'samples': 8050560, 'steps': 41929, 'loss/train': 1.464155912399292} 11/07/2021 03:06:47 - INFO - __main__ - Step 41931: {'lr': 0.00041544735223079693, 'samples': 8050752, 'steps': 41930, 'loss/train': 1.6106220483779907} 11/07/2021 03:06:48 - INFO - __main__ - Step 41932: {'lr': 0.0004154433737814786, 'samples': 8050944, 'steps': 41931, 'loss/train': 1.9208873510360718} 11/07/2021 03:06:48 - INFO - __main__ - Step 41933: {'lr': 0.0004154393952576139, 'samples': 8051136, 'steps': 41932, 'loss/train': 1.5260471105575562} 11/07/2021 03:06:49 - INFO - __main__ - Step 41934: {'lr': 0.00041543541665920483, 'samples': 8051328, 'steps': 41933, 'loss/train': 1.2168070077896118} 11/07/2021 03:06:49 - INFO - __main__ - Step 41935: {'lr': 0.000415431437986253, 'samples': 8051520, 'steps': 41934, 'loss/train': 1.0677088499069214} 11/07/2021 03:06:49 - INFO - __main__ - Step 41936: {'lr': 0.00041542745923876047, 'samples': 8051712, 'steps': 41935, 'loss/train': 0.22388498485088348} 11/07/2021 03:06:51 - INFO - __main__ - Step 41937: {'lr': 0.00041542348041672886, 'samples': 8051904, 'steps': 41936, 'loss/train': 0.9962683916091919} 11/07/2021 03:06:51 - INFO - __main__ - Step 41938: {'lr': 0.00041541950152015997, 'samples': 8052096, 'steps': 41937, 'loss/train': 1.61919105052948} 11/07/2021 03:06:51 - INFO - __main__ - Step 41939: {'lr': 0.0004154155225490555, 'samples': 8052288, 'steps': 41938, 'loss/train': 2.00990629196167} 11/07/2021 03:06:52 - INFO - __main__ - Step 41940: {'lr': 0.0004154115435034175, 'samples': 8052480, 'steps': 41939, 'loss/train': 1.5125808715820312} 11/07/2021 03:06:52 - INFO - __main__ - Step 41941: {'lr': 0.00041540756438324746, 'samples': 8052672, 'steps': 41940, 'loss/train': 2.0472888946533203} 11/07/2021 03:06:52 - INFO - __main__ - Step 41942: {'lr': 0.0004154035851885474, 'samples': 8052864, 'steps': 41941, 'loss/train': 1.1914267539978027} 11/07/2021 03:06:54 - INFO - __main__ - Step 41943: {'lr': 0.0004153996059193191, 'samples': 8053056, 'steps': 41942, 'loss/train': 0.6326480507850647} 11/07/2021 03:06:54 - INFO - __main__ - Step 41944: {'lr': 0.0004153956265755642, 'samples': 8053248, 'steps': 41943, 'loss/train': 1.6083602905273438} 11/07/2021 03:06:54 - INFO - __main__ - Step 41945: {'lr': 0.0004153916471572846, 'samples': 8053440, 'steps': 41944, 'loss/train': 0.9648074507713318} 11/07/2021 03:06:55 - INFO - __main__ - Step 41946: {'lr': 0.0004153876676644821, 'samples': 8053632, 'steps': 41945, 'loss/train': 1.5843206644058228} 11/07/2021 03:06:55 - INFO - __main__ - Step 41947: {'lr': 0.0004153836880971585, 'samples': 8053824, 'steps': 41946, 'loss/train': 0.8655702471733093} 11/07/2021 03:06:56 - INFO - __main__ - Step 41948: {'lr': 0.00041537970845531547, 'samples': 8054016, 'steps': 41947, 'loss/train': 1.5090172290802002} 11/07/2021 03:06:56 - INFO - __main__ - Step 41949: {'lr': 0.00041537572873895503, 'samples': 8054208, 'steps': 41948, 'loss/train': 1.765095829963684} 11/07/2021 03:06:57 - INFO - __main__ - Step 41950: {'lr': 0.00041537174894807873, 'samples': 8054400, 'steps': 41949, 'loss/train': 1.4930294752120972} 11/07/2021 03:06:57 - INFO - __main__ - Step 41951: {'lr': 0.00041536776908268847, 'samples': 8054592, 'steps': 41950, 'loss/train': 1.2215300798416138} 11/07/2021 03:06:57 - INFO - __main__ - Step 41952: {'lr': 0.00041536378914278603, 'samples': 8054784, 'steps': 41951, 'loss/train': 1.7548632621765137} 11/07/2021 03:06:58 - INFO - __main__ - Step 41953: {'lr': 0.00041535980912837326, 'samples': 8054976, 'steps': 41952, 'loss/train': 1.6592971086502075} 11/07/2021 03:07:00 - INFO - __main__ - Step 41954: {'lr': 0.00041535582903945195, 'samples': 8055168, 'steps': 41953, 'loss/train': 1.9381015300750732} 11/07/2021 03:07:00 - INFO - __main__ - Step 41955: {'lr': 0.00041535184887602384, 'samples': 8055360, 'steps': 41954, 'loss/train': 1.328749656677246} 11/07/2021 03:07:00 - INFO - __main__ - Step 41956: {'lr': 0.0004153478686380907, 'samples': 8055552, 'steps': 41955, 'loss/train': 1.5953503847122192} 11/07/2021 03:07:01 - INFO - __main__ - Step 41957: {'lr': 0.0004153438883256544, 'samples': 8055744, 'steps': 41956, 'loss/train': 1.5133731365203857} 11/07/2021 03:07:01 - INFO - __main__ - Step 41958: {'lr': 0.0004153399079387167, 'samples': 8055936, 'steps': 41957, 'loss/train': 1.5228304862976074} 11/07/2021 03:07:01 - INFO - __main__ - Step 41959: {'lr': 0.00041533592747727935, 'samples': 8056128, 'steps': 41958, 'loss/train': 1.3258435726165771} 11/07/2021 03:07:02 - INFO - __main__ - Step 41960: {'lr': 0.00041533194694134414, 'samples': 8056320, 'steps': 41959, 'loss/train': 1.7467302083969116} 11/07/2021 03:07:03 - INFO - __main__ - Step 41961: {'lr': 0.00041532796633091297, 'samples': 8056512, 'steps': 41960, 'loss/train': 1.7610151767730713} 11/07/2021 03:07:03 - INFO - __main__ - Step 41962: {'lr': 0.00041532398564598757, 'samples': 8056704, 'steps': 41961, 'loss/train': 1.536446213722229} 11/07/2021 03:07:03 - INFO - __main__ - Step 41963: {'lr': 0.0004153200048865697, 'samples': 8056896, 'steps': 41962, 'loss/train': 1.8210513591766357} 11/07/2021 03:07:04 - INFO - __main__ - Step 41964: {'lr': 0.0004153160240526612, 'samples': 8057088, 'steps': 41963, 'loss/train': 1.6114858388900757} 11/07/2021 03:07:04 - INFO - __main__ - Step 41965: {'lr': 0.0004153120431442639, 'samples': 8057280, 'steps': 41964, 'loss/train': 1.4403822422027588} 11/07/2021 03:07:05 - INFO - __main__ - Step 41966: {'lr': 0.00041530806216137953, 'samples': 8057472, 'steps': 41965, 'loss/train': 1.6566977500915527} 11/07/2021 03:07:06 - INFO - __main__ - Step 41967: {'lr': 0.00041530408110400987, 'samples': 8057664, 'steps': 41966, 'loss/train': 1.118053913116455} 11/07/2021 03:07:06 - INFO - __main__ - Step 41968: {'lr': 0.00041530009997215665, 'samples': 8057856, 'steps': 41967, 'loss/train': 1.4030784368515015} 11/07/2021 03:07:06 - INFO - __main__ - Step 41969: {'lr': 0.00041529611876582194, 'samples': 8058048, 'steps': 41968, 'loss/train': 1.3769654035568237} 11/07/2021 03:07:07 - INFO - __main__ - Step 41970: {'lr': 0.00041529213748500726, 'samples': 8058240, 'steps': 41969, 'loss/train': 1.3986589908599854} 11/07/2021 03:07:07 - INFO - __main__ - Step 41971: {'lr': 0.0004152881561297145, 'samples': 8058432, 'steps': 41970, 'loss/train': 1.9423401355743408} 11/07/2021 03:07:08 - INFO - __main__ - Step 41972: {'lr': 0.0004152841746999454, 'samples': 8058624, 'steps': 41971, 'loss/train': 1.5982258319854736} 11/07/2021 03:07:08 - INFO - __main__ - Step 41973: {'lr': 0.00041528019319570186, 'samples': 8058816, 'steps': 41972, 'loss/train': 1.6509596109390259} 11/07/2021 03:07:09 - INFO - __main__ - Step 41974: {'lr': 0.0004152762116169856, 'samples': 8059008, 'steps': 41973, 'loss/train': 1.199228048324585} 11/07/2021 03:07:09 - INFO - __main__ - Step 41975: {'lr': 0.00041527222996379844, 'samples': 8059200, 'steps': 41974, 'loss/train': 1.4981478452682495} 11/07/2021 03:07:09 - INFO - __main__ - Step 41976: {'lr': 0.0004152682482361422, 'samples': 8059392, 'steps': 41975, 'loss/train': 1.662643313407898} 11/07/2021 03:07:11 - INFO - __main__ - Step 41977: {'lr': 0.0004152642664340185, 'samples': 8059584, 'steps': 41976, 'loss/train': 1.1987229585647583} 11/07/2021 03:07:11 - INFO - __main__ - Step 41978: {'lr': 0.00041526028455742936, 'samples': 8059776, 'steps': 41977, 'loss/train': 0.48487409949302673} 11/07/2021 03:07:12 - INFO - __main__ - Step 41979: {'lr': 0.0004152563026063765, 'samples': 8059968, 'steps': 41978, 'loss/train': 1.4140325784683228} 11/07/2021 03:07:12 - INFO - __main__ - Step 41980: {'lr': 0.00041525232058086173, 'samples': 8060160, 'steps': 41979, 'loss/train': 1.611794114112854} 11/07/2021 03:07:12 - INFO - __main__ - Step 41981: {'lr': 0.0004152483384808867, 'samples': 8060352, 'steps': 41980, 'loss/train': 1.3852312564849854} 11/07/2021 03:07:13 - INFO - __main__ - Step 41982: {'lr': 0.0004152443563064534, 'samples': 8060544, 'steps': 41981, 'loss/train': 0.39391547441482544} 11/07/2021 03:07:14 - INFO - __main__ - Step 41983: {'lr': 0.00041524037405756356, 'samples': 8060736, 'steps': 41982, 'loss/train': 1.5800052881240845} 11/07/2021 03:07:14 - INFO - __main__ - Step 41984: {'lr': 0.0004152363917342189, 'samples': 8060928, 'steps': 41983, 'loss/train': 1.337543249130249} 11/07/2021 03:07:15 - INFO - __main__ - Step 41985: {'lr': 0.00041523240933642134, 'samples': 8061120, 'steps': 41984, 'loss/train': 1.818328857421875} 11/07/2021 03:07:15 - INFO - __main__ - Step 41986: {'lr': 0.00041522842686417255, 'samples': 8061312, 'steps': 41985, 'loss/train': 1.1234190464019775} 11/07/2021 03:07:15 - INFO - __main__ - Step 41987: {'lr': 0.0004152244443174744, 'samples': 8061504, 'steps': 41986, 'loss/train': 0.8166891932487488} 11/07/2021 03:07:16 - INFO - __main__ - Step 41988: {'lr': 0.00041522046169632863, 'samples': 8061696, 'steps': 41987, 'loss/train': 2.011791706085205} 11/07/2021 03:07:16 - INFO - __main__ - Step 41989: {'lr': 0.0004152164790007371, 'samples': 8061888, 'steps': 41988, 'loss/train': 1.5016108751296997} 11/07/2021 03:07:17 - INFO - __main__ - Step 41990: {'lr': 0.00041521249623070164, 'samples': 8062080, 'steps': 41989, 'loss/train': 1.6521884202957153} 11/07/2021 03:07:17 - INFO - __main__ - Step 41991: {'lr': 0.0004152085133862239, 'samples': 8062272, 'steps': 41990, 'loss/train': 1.3991754055023193} 11/07/2021 03:07:18 - INFO - __main__ - Step 41992: {'lr': 0.0004152045304673058, 'samples': 8062464, 'steps': 41991, 'loss/train': 1.167265772819519} 11/07/2021 03:07:18 - INFO - __main__ - Step 41993: {'lr': 0.000415200547473949, 'samples': 8062656, 'steps': 41992, 'loss/train': 1.5853790044784546} 11/07/2021 03:07:19 - INFO - __main__ - Step 41994: {'lr': 0.00041519656440615544, 'samples': 8062848, 'steps': 41993, 'loss/train': 1.2686554193496704} 11/07/2021 03:07:20 - INFO - __main__ - Step 41995: {'lr': 0.00041519258126392685, 'samples': 8063040, 'steps': 41994, 'loss/train': 1.9380366802215576} 11/07/2021 03:07:20 - INFO - __main__ - Step 41996: {'lr': 0.00041518859804726507, 'samples': 8063232, 'steps': 41995, 'loss/train': 1.2416760921478271} 11/07/2021 03:07:20 - INFO - __main__ - Step 41997: {'lr': 0.00041518461475617183, 'samples': 8063424, 'steps': 41996, 'loss/train': 1.2034986019134521} 11/07/2021 03:07:21 - INFO - __main__ - Step 41998: {'lr': 0.00041518063139064893, 'samples': 8063616, 'steps': 41997, 'loss/train': 1.7212826013565063} 11/07/2021 03:07:21 - INFO - __main__ - Step 41999: {'lr': 0.0004151766479506982, 'samples': 8063808, 'steps': 41998, 'loss/train': 1.2917431592941284} 11/07/2021 03:07:23 - INFO - __main__ - Step 42000: {'lr': 0.0004151726644363214, 'samples': 8064000, 'steps': 41999, 'loss/train': 1.8012648820877075} 11/07/2021 03:07:23 - INFO - __main__ - Step 42001: {'lr': 0.00041516868084752034, 'samples': 8064192, 'steps': 42000, 'loss/train': 1.4202122688293457} 11/07/2021 03:07:23 - INFO - __main__ - Step 42002: {'lr': 0.0004151646971842968, 'samples': 8064384, 'steps': 42001, 'loss/train': 1.7842530012130737} 11/07/2021 03:07:24 - INFO - __main__ - Step 42003: {'lr': 0.00041516071344665275, 'samples': 8064576, 'steps': 42002, 'loss/train': 1.8126933574676514} 11/07/2021 03:07:24 - INFO - __main__ - Step 42004: {'lr': 0.00041515672963458975, 'samples': 8064768, 'steps': 42003, 'loss/train': 1.757590413093567} 11/07/2021 03:07:25 - INFO - __main__ - Step 42005: {'lr': 0.00041515274574810965, 'samples': 8064960, 'steps': 42004, 'loss/train': 1.5378392934799194} 11/07/2021 03:07:25 - INFO - __main__ - Step 42006: {'lr': 0.00041514876178721426, 'samples': 8065152, 'steps': 42005, 'loss/train': 1.984458565711975} 11/07/2021 03:07:25 - INFO - __main__ - Step 42007: {'lr': 0.0004151447777519054, 'samples': 8065344, 'steps': 42006, 'loss/train': 1.7512965202331543} 11/07/2021 03:07:26 - INFO - __main__ - Step 42008: {'lr': 0.00041514079364218483, 'samples': 8065536, 'steps': 42007, 'loss/train': 1.270204782485962} 11/07/2021 03:07:27 - INFO - __main__ - Step 42009: {'lr': 0.0004151368094580544, 'samples': 8065728, 'steps': 42008, 'loss/train': 1.3795816898345947} 11/07/2021 03:07:27 - INFO - __main__ - Step 42010: {'lr': 0.0004151328251995159, 'samples': 8065920, 'steps': 42009, 'loss/train': 1.1478238105773926} 11/07/2021 03:07:27 - INFO - __main__ - Step 42011: {'lr': 0.000415128840866571, 'samples': 8066112, 'steps': 42010, 'loss/train': 5.797067642211914} 11/07/2021 03:07:28 - INFO - __main__ - Step 42012: {'lr': 0.00041512485645922164, 'samples': 8066304, 'steps': 42011, 'loss/train': 1.7063237428665161} 11/07/2021 03:07:28 - INFO - __main__ - Step 42013: {'lr': 0.0004151208719774696, 'samples': 8066496, 'steps': 42012, 'loss/train': 1.5984175205230713} 11/07/2021 03:07:29 - INFO - __main__ - Step 42014: {'lr': 0.0004151168874213166, 'samples': 8066688, 'steps': 42013, 'loss/train': 1.807763934135437} 11/07/2021 03:07:29 - INFO - __main__ - Step 42015: {'lr': 0.00041511290279076454, 'samples': 8066880, 'steps': 42014, 'loss/train': 1.4820877313613892} 11/07/2021 03:07:30 - INFO - __main__ - Step 42016: {'lr': 0.0004151089180858151, 'samples': 8067072, 'steps': 42015, 'loss/train': 1.1027518510818481} 11/07/2021 03:07:30 - INFO - __main__ - Step 42017: {'lr': 0.00041510493330647015, 'samples': 8067264, 'steps': 42016, 'loss/train': 1.7387123107910156} 11/07/2021 03:07:31 - INFO - __main__ - Step 42018: {'lr': 0.00041510094845273145, 'samples': 8067456, 'steps': 42017, 'loss/train': 1.5490429401397705} 11/07/2021 03:07:32 - INFO - __main__ - Step 42019: {'lr': 0.0004150969635246008, 'samples': 8067648, 'steps': 42018, 'loss/train': 1.4055429697036743} 11/07/2021 03:07:32 - INFO - __main__ - Step 42020: {'lr': 0.00041509297852208003, 'samples': 8067840, 'steps': 42019, 'loss/train': 1.8538402318954468} 11/07/2021 03:07:32 - INFO - __main__ - Step 42021: {'lr': 0.00041508899344517094, 'samples': 8068032, 'steps': 42020, 'loss/train': 1.2977536916732788} 11/07/2021 03:07:33 - INFO - __main__ - Step 42022: {'lr': 0.0004150850082938752, 'samples': 8068224, 'steps': 42021, 'loss/train': 1.055382251739502} 11/07/2021 03:07:33 - INFO - __main__ - Step 42023: {'lr': 0.00041508102306819485, 'samples': 8068416, 'steps': 42022, 'loss/train': 1.807405710220337} 11/07/2021 03:07:34 - INFO - __main__ - Step 42024: {'lr': 0.0004150770377681314, 'samples': 8068608, 'steps': 42023, 'loss/train': 1.8987107276916504} 11/07/2021 03:07:34 - INFO - __main__ - Step 42025: {'lr': 0.00041507305239368684, 'samples': 8068800, 'steps': 42024, 'loss/train': 1.1936386823654175} 11/07/2021 03:07:35 - INFO - __main__ - Step 42026: {'lr': 0.0004150690669448629, 'samples': 8068992, 'steps': 42025, 'loss/train': 1.6065560579299927} 11/07/2021 03:07:35 - INFO - __main__ - Step 42027: {'lr': 0.0004150650814216614, 'samples': 8069184, 'steps': 42026, 'loss/train': 2.0050127506256104} 11/07/2021 03:07:36 - INFO - __main__ - Step 42028: {'lr': 0.0004150610958240841, 'samples': 8069376, 'steps': 42027, 'loss/train': 1.268518328666687} 11/07/2021 03:07:36 - INFO - __main__ - Step 42029: {'lr': 0.00041505711015213284, 'samples': 8069568, 'steps': 42028, 'loss/train': 1.7336264848709106} 11/07/2021 03:07:37 - INFO - __main__ - Step 42030: {'lr': 0.0004150531244058094, 'samples': 8069760, 'steps': 42029, 'loss/train': 1.4684981107711792} 11/07/2021 03:07:37 - INFO - __main__ - Step 42031: {'lr': 0.00041504913858511557, 'samples': 8069952, 'steps': 42030, 'loss/train': 1.1485469341278076} 11/07/2021 03:07:38 - INFO - __main__ - Step 42032: {'lr': 0.0004150451526900531, 'samples': 8070144, 'steps': 42031, 'loss/train': 1.8169656991958618} 11/07/2021 03:07:38 - INFO - __main__ - Step 42033: {'lr': 0.00041504116672062385, 'samples': 8070336, 'steps': 42032, 'loss/train': 1.820695400238037} 11/07/2021 03:07:38 - INFO - __main__ - Step 42034: {'lr': 0.0004150371806768296, 'samples': 8070528, 'steps': 42033, 'loss/train': 1.4767593145370483} 11/07/2021 03:07:39 - INFO - __main__ - Step 42035: {'lr': 0.00041503319455867216, 'samples': 8070720, 'steps': 42034, 'loss/train': 1.8394572734832764} 11/07/2021 03:07:40 - INFO - __main__ - Step 42036: {'lr': 0.0004150292083661533, 'samples': 8070912, 'steps': 42035, 'loss/train': 1.9457099437713623} 11/07/2021 03:07:40 - INFO - __main__ - Step 42037: {'lr': 0.00041502522209927486, 'samples': 8071104, 'steps': 42036, 'loss/train': 2.1062285900115967} 11/07/2021 03:07:40 - INFO - __main__ - Step 42038: {'lr': 0.00041502123575803854, 'samples': 8071296, 'steps': 42037, 'loss/train': 1.010448932647705} 11/07/2021 03:07:41 - INFO - __main__ - Step 42039: {'lr': 0.0004150172493424462, 'samples': 8071488, 'steps': 42038, 'loss/train': 1.797959804534912} 11/07/2021 03:07:41 - INFO - __main__ - Step 42040: {'lr': 0.00041501326285249963, 'samples': 8071680, 'steps': 42039, 'loss/train': 1.6893484592437744} 11/07/2021 03:07:42 - INFO - __main__ - Step 42041: {'lr': 0.0004150092762882007, 'samples': 8071872, 'steps': 42040, 'loss/train': 1.5143741369247437} 11/07/2021 03:07:43 - INFO - __main__ - Step 42042: {'lr': 0.00041500528964955106, 'samples': 8072064, 'steps': 42041, 'loss/train': 1.83730149269104} 11/07/2021 03:07:43 - INFO - __main__ - Step 42043: {'lr': 0.0004150013029365527, 'samples': 8072256, 'steps': 42042, 'loss/train': 1.7041666507720947} 11/07/2021 03:07:43 - INFO - __main__ - Step 42044: {'lr': 0.0004149973161492072, 'samples': 8072448, 'steps': 42043, 'loss/train': 1.4257150888442993} 11/07/2021 03:07:44 - INFO - __main__ - Step 42045: {'lr': 0.0004149933292875164, 'samples': 8072640, 'steps': 42044, 'loss/train': 1.5595922470092773} 11/07/2021 03:07:44 - INFO - __main__ - Step 42046: {'lr': 0.0004149893423514822, 'samples': 8072832, 'steps': 42045, 'loss/train': 1.4120824337005615} 11/07/2021 03:07:45 - INFO - __main__ - Step 42047: {'lr': 0.0004149853553411064, 'samples': 8073024, 'steps': 42046, 'loss/train': 1.8442844152450562} 11/07/2021 03:07:45 - INFO - __main__ - Step 42048: {'lr': 0.00041498136825639074, 'samples': 8073216, 'steps': 42047, 'loss/train': 1.3788831233978271} 11/07/2021 03:07:46 - INFO - __main__ - Step 42049: {'lr': 0.000414977381097337, 'samples': 8073408, 'steps': 42048, 'loss/train': 0.8893455862998962} 11/07/2021 03:07:46 - INFO - __main__ - Step 42050: {'lr': 0.000414973393863947, 'samples': 8073600, 'steps': 42049, 'loss/train': 1.5420819520950317} 11/07/2021 03:07:46 - INFO - __main__ - Step 42051: {'lr': 0.0004149694065562225, 'samples': 8073792, 'steps': 42050, 'loss/train': 0.9688058495521545} 11/07/2021 03:07:48 - INFO - __main__ - Step 42052: {'lr': 0.0004149654191741654, 'samples': 8073984, 'steps': 42051, 'loss/train': 1.6391124725341797} 11/07/2021 03:07:48 - INFO - __main__ - Step 42053: {'lr': 0.0004149614317177774, 'samples': 8074176, 'steps': 42052, 'loss/train': 1.605303168296814} 11/07/2021 03:07:48 - INFO - __main__ - Step 42054: {'lr': 0.00041495744418706027, 'samples': 8074368, 'steps': 42053, 'loss/train': 1.5720794200897217} 11/07/2021 03:07:49 - INFO - __main__ - Step 42055: {'lr': 0.00041495345658201587, 'samples': 8074560, 'steps': 42054, 'loss/train': 1.0581259727478027} 11/07/2021 03:07:49 - INFO - __main__ - Step 42056: {'lr': 0.00041494946890264606, 'samples': 8074752, 'steps': 42055, 'loss/train': 1.635392189025879} 11/07/2021 03:07:50 - INFO - __main__ - Step 42057: {'lr': 0.00041494548114895255, 'samples': 8074944, 'steps': 42056, 'loss/train': 1.5409810543060303} 11/07/2021 03:07:51 - INFO - __main__ - Step 42058: {'lr': 0.0004149414933209371, 'samples': 8075136, 'steps': 42057, 'loss/train': 1.8882447481155396} 11/07/2021 03:07:51 - INFO - __main__ - Step 42059: {'lr': 0.00041493750541860165, 'samples': 8075328, 'steps': 42058, 'loss/train': 0.4278101921081543} 11/07/2021 03:07:51 - INFO - __main__ - Step 42060: {'lr': 0.0004149335174419478, 'samples': 8075520, 'steps': 42059, 'loss/train': 1.5596524477005005} 11/07/2021 03:07:52 - INFO - __main__ - Step 42061: {'lr': 0.0004149295293909775, 'samples': 8075712, 'steps': 42060, 'loss/train': 1.5847713947296143} 11/07/2021 03:07:53 - INFO - __main__ - Step 42062: {'lr': 0.0004149255412656925, 'samples': 8075904, 'steps': 42061, 'loss/train': 1.6529903411865234} 11/07/2021 03:07:53 - INFO - __main__ - Step 42063: {'lr': 0.00041492155306609456, 'samples': 8076096, 'steps': 42062, 'loss/train': 2.4510741233825684} 11/07/2021 03:07:53 - INFO - __main__ - Step 42064: {'lr': 0.00041491756479218557, 'samples': 8076288, 'steps': 42063, 'loss/train': 1.7397103309631348} 11/07/2021 03:07:54 - INFO - __main__ - Step 42065: {'lr': 0.0004149135764439672, 'samples': 8076480, 'steps': 42064, 'loss/train': 1.273585557937622} 11/07/2021 03:07:54 - INFO - __main__ - Step 42066: {'lr': 0.0004149095880214414, 'samples': 8076672, 'steps': 42065, 'loss/train': 1.6326723098754883} 11/07/2021 03:07:55 - INFO - __main__ - Step 42067: {'lr': 0.00041490559952460983, 'samples': 8076864, 'steps': 42066, 'loss/train': 1.0523433685302734} 11/07/2021 03:07:55 - INFO - __main__ - Step 42068: {'lr': 0.00041490161095347435, 'samples': 8077056, 'steps': 42067, 'loss/train': 1.2187541723251343} 11/07/2021 03:07:56 - INFO - __main__ - Step 42069: {'lr': 0.00041489762230803676, 'samples': 8077248, 'steps': 42068, 'loss/train': 1.519520878791809} 11/07/2021 03:07:56 - INFO - __main__ - Step 42070: {'lr': 0.00041489363358829885, 'samples': 8077440, 'steps': 42069, 'loss/train': 1.3521915674209595} 11/07/2021 03:07:57 - INFO - __main__ - Step 42071: {'lr': 0.0004148896447942624, 'samples': 8077632, 'steps': 42070, 'loss/train': 1.5754438638687134} 11/07/2021 03:07:57 - INFO - __main__ - Step 42072: {'lr': 0.00041488565592592917, 'samples': 8077824, 'steps': 42071, 'loss/train': 1.6142688989639282} 11/07/2021 03:07:58 - INFO - __main__ - Step 42073: {'lr': 0.0004148816669833011, 'samples': 8078016, 'steps': 42072, 'loss/train': 2.0739517211914062} 11/07/2021 03:07:58 - INFO - __main__ - Step 42074: {'lr': 0.0004148776779663799, 'samples': 8078208, 'steps': 42073, 'loss/train': 1.590319275856018} 11/07/2021 03:07:59 - INFO - __main__ - Step 42075: {'lr': 0.00041487368887516726, 'samples': 8078400, 'steps': 42074, 'loss/train': 1.4445720911026} 11/07/2021 03:07:59 - INFO - __main__ - Step 42076: {'lr': 0.00041486969970966516, 'samples': 8078592, 'steps': 42075, 'loss/train': 1.8851046562194824} 11/07/2021 03:07:59 - INFO - __main__ - Step 42077: {'lr': 0.0004148657104698753, 'samples': 8078784, 'steps': 42076, 'loss/train': 1.7685871124267578} 11/07/2021 03:08:00 - INFO - __main__ - Step 42078: {'lr': 0.00041486172115579945, 'samples': 8078976, 'steps': 42077, 'loss/train': 1.370727777481079} 11/07/2021 03:08:01 - INFO - __main__ - Step 42079: {'lr': 0.00041485773176743953, 'samples': 8079168, 'steps': 42078, 'loss/train': 1.630990743637085} 11/07/2021 03:08:01 - INFO - __main__ - Step 42080: {'lr': 0.00041485374230479724, 'samples': 8079360, 'steps': 42079, 'loss/train': 1.9803260564804077} 11/07/2021 03:08:01 - INFO - __main__ - Step 42081: {'lr': 0.00041484975276787436, 'samples': 8079552, 'steps': 42080, 'loss/train': 1.5335677862167358} 11/07/2021 03:08:02 - INFO - __main__ - Step 42082: {'lr': 0.00041484576315667273, 'samples': 8079744, 'steps': 42081, 'loss/train': 1.72471284866333} 11/07/2021 03:08:03 - INFO - __main__ - Step 42083: {'lr': 0.0004148417734711941, 'samples': 8079936, 'steps': 42082, 'loss/train': 0.442223459482193} 11/07/2021 03:08:03 - INFO - __main__ - Step 42084: {'lr': 0.00041483778371144046, 'samples': 8080128, 'steps': 42083, 'loss/train': 1.3410874605178833} 11/07/2021 03:08:04 - INFO - __main__ - Step 42085: {'lr': 0.0004148337938774134, 'samples': 8080320, 'steps': 42084, 'loss/train': 1.4568349123001099} 11/07/2021 03:08:04 - INFO - __main__ - Step 42086: {'lr': 0.00041482980396911467, 'samples': 8080512, 'steps': 42085, 'loss/train': 0.7727944850921631} 11/07/2021 03:08:04 - INFO - __main__ - Step 42087: {'lr': 0.0004148258139865463, 'samples': 8080704, 'steps': 42086, 'loss/train': 2.0954015254974365} 11/07/2021 03:08:05 - INFO - __main__ - Step 42088: {'lr': 0.00041482182392970984, 'samples': 8080896, 'steps': 42087, 'loss/train': 1.3977972269058228} 11/07/2021 03:08:06 - INFO - __main__ - Step 42089: {'lr': 0.00041481783379860725, 'samples': 8081088, 'steps': 42088, 'loss/train': 1.567556381225586} 11/07/2021 03:08:06 - INFO - __main__ - Step 42090: {'lr': 0.0004148138435932404, 'samples': 8081280, 'steps': 42089, 'loss/train': 1.4468871355056763} 11/07/2021 03:08:06 - INFO - __main__ - Step 42091: {'lr': 0.0004148098533136109, 'samples': 8081472, 'steps': 42090, 'loss/train': 2.4713995456695557} 11/07/2021 03:08:07 - INFO - __main__ - Step 42092: {'lr': 0.0004148058629597206, 'samples': 8081664, 'steps': 42091, 'loss/train': 1.3342759609222412} 11/07/2021 03:08:07 - INFO - __main__ - Step 42093: {'lr': 0.0004148018725315713, 'samples': 8081856, 'steps': 42092, 'loss/train': 1.1993120908737183} 11/07/2021 03:08:08 - INFO - __main__ - Step 42094: {'lr': 0.00041479788202916483, 'samples': 8082048, 'steps': 42093, 'loss/train': 1.8482271432876587} 11/07/2021 03:08:09 - INFO - __main__ - Step 42095: {'lr': 0.000414793891452503, 'samples': 8082240, 'steps': 42094, 'loss/train': 1.5280399322509766} 11/07/2021 03:08:09 - INFO - __main__ - Step 42096: {'lr': 0.0004147899008015876, 'samples': 8082432, 'steps': 42095, 'loss/train': 1.5832175016403198} 11/07/2021 03:08:09 - INFO - __main__ - Step 42097: {'lr': 0.0004147859100764204, 'samples': 8082624, 'steps': 42096, 'loss/train': 1.7992056608200073} 11/07/2021 03:08:10 - INFO - __main__ - Step 42098: {'lr': 0.0004147819192770033, 'samples': 8082816, 'steps': 42097, 'loss/train': 1.5658690929412842} 11/07/2021 03:08:11 - INFO - __main__ - Step 42099: {'lr': 0.00041477792840333784, 'samples': 8083008, 'steps': 42098, 'loss/train': 1.2517671585083008} 11/07/2021 03:08:11 - INFO - __main__ - Step 42100: {'lr': 0.00041477393745542607, 'samples': 8083200, 'steps': 42099, 'loss/train': 1.3536474704742432} 11/07/2021 03:08:11 - INFO - __main__ - Step 42101: {'lr': 0.0004147699464332697, 'samples': 8083392, 'steps': 42100, 'loss/train': 1.4670751094818115} 11/07/2021 03:08:12 - INFO - __main__ - Step 42102: {'lr': 0.0004147659553368706, 'samples': 8083584, 'steps': 42101, 'loss/train': 2.541309118270874} 11/07/2021 03:08:12 - INFO - __main__ - Step 42103: {'lr': 0.00041476196416623034, 'samples': 8083776, 'steps': 42102, 'loss/train': 1.3247640132904053} 11/07/2021 03:08:13 - INFO - __main__ - Step 42104: {'lr': 0.0004147579729213511, 'samples': 8083968, 'steps': 42103, 'loss/train': 1.835842251777649} 11/07/2021 03:08:13 - INFO - __main__ - Step 42105: {'lr': 0.0004147539816022343, 'samples': 8084160, 'steps': 42104, 'loss/train': 1.3662902116775513} 11/07/2021 03:08:14 - INFO - __main__ - Step 42106: {'lr': 0.0004147499902088819, 'samples': 8084352, 'steps': 42105, 'loss/train': 1.5861386060714722} 11/07/2021 03:08:14 - INFO - __main__ - Step 42107: {'lr': 0.0004147459987412958, 'samples': 8084544, 'steps': 42106, 'loss/train': 1.6038364171981812} 11/07/2021 03:08:14 - INFO - __main__ - Step 42108: {'lr': 0.0004147420071994776, 'samples': 8084736, 'steps': 42107, 'loss/train': 1.1665230989456177} 11/07/2021 03:08:16 - INFO - __main__ - Step 42109: {'lr': 0.0004147380155834293, 'samples': 8084928, 'steps': 42108, 'loss/train': 1.8599544763565063} 11/07/2021 03:08:16 - INFO - __main__ - Step 42110: {'lr': 0.0004147340238931525, 'samples': 8085120, 'steps': 42109, 'loss/train': 1.515739917755127} 11/07/2021 03:08:16 - INFO - __main__ - Step 42111: {'lr': 0.0004147300321286491, 'samples': 8085312, 'steps': 42110, 'loss/train': 1.7114287614822388} 11/07/2021 03:08:17 - INFO - __main__ - Step 42112: {'lr': 0.0004147260402899209, 'samples': 8085504, 'steps': 42111, 'loss/train': 1.681444764137268} 11/07/2021 03:08:17 - INFO - __main__ - Step 42113: {'lr': 0.0004147220483769697, 'samples': 8085696, 'steps': 42112, 'loss/train': 1.788677453994751} 11/07/2021 03:08:18 - INFO - __main__ - Step 42114: {'lr': 0.0004147180563897972, 'samples': 8085888, 'steps': 42113, 'loss/train': 2.0268678665161133} 11/07/2021 03:08:18 - INFO - __main__ - Step 42115: {'lr': 0.0004147140643284054, 'samples': 8086080, 'steps': 42114, 'loss/train': 1.668137550354004} 11/07/2021 03:08:19 - INFO - __main__ - Step 42116: {'lr': 0.00041471007219279595, 'samples': 8086272, 'steps': 42115, 'loss/train': 1.519972562789917} 11/07/2021 03:08:19 - INFO - __main__ - Step 42117: {'lr': 0.0004147060799829707, 'samples': 8086464, 'steps': 42116, 'loss/train': 1.638015627861023} 11/07/2021 03:08:19 - INFO - __main__ - Step 42118: {'lr': 0.00041470208769893137, 'samples': 8086656, 'steps': 42117, 'loss/train': 1.7440546751022339} 11/07/2021 03:08:20 - INFO - __main__ - Step 42119: {'lr': 0.0004146980953406799, 'samples': 8086848, 'steps': 42118, 'loss/train': 0.6300422549247742} 11/07/2021 03:08:21 - INFO - __main__ - Step 42120: {'lr': 0.000414694102908218, 'samples': 8087040, 'steps': 42119, 'loss/train': 0.47657132148742676} 11/07/2021 03:08:21 - INFO - __main__ - Step 42121: {'lr': 0.0004146901104015474, 'samples': 8087232, 'steps': 42120, 'loss/train': 1.5050257444381714} 11/07/2021 03:08:21 - INFO - __main__ - Step 42122: {'lr': 0.00041468611782067, 'samples': 8087424, 'steps': 42121, 'loss/train': 1.2551136016845703} 11/07/2021 03:08:22 - INFO - __main__ - Step 42123: {'lr': 0.0004146821251655877, 'samples': 8087616, 'steps': 42122, 'loss/train': 1.520211100578308} 11/07/2021 03:08:22 - INFO - __main__ - Step 42124: {'lr': 0.000414678132436302, 'samples': 8087808, 'steps': 42123, 'loss/train': 0.9685293436050415} 11/07/2021 03:08:23 - INFO - __main__ - Step 42125: {'lr': 0.000414674139632815, 'samples': 8088000, 'steps': 42124, 'loss/train': 1.4324889183044434} 11/07/2021 03:08:23 - INFO - __main__ - Step 42126: {'lr': 0.0004146701467551283, 'samples': 8088192, 'steps': 42125, 'loss/train': 1.3058773279190063} 11/07/2021 03:08:24 - INFO - __main__ - Step 42127: {'lr': 0.0004146661538032438, 'samples': 8088384, 'steps': 42126, 'loss/train': 1.5557634830474854} 11/07/2021 03:08:24 - INFO - __main__ - Step 42128: {'lr': 0.0004146621607771633, 'samples': 8088576, 'steps': 42127, 'loss/train': 1.6212557554244995} 11/07/2021 03:08:25 - INFO - __main__ - Step 42129: {'lr': 0.00041465816767688853, 'samples': 8088768, 'steps': 42128, 'loss/train': 1.4947006702423096} 11/07/2021 03:08:26 - INFO - __main__ - Step 42130: {'lr': 0.0004146541745024214, 'samples': 8088960, 'steps': 42129, 'loss/train': 1.1805391311645508} 11/07/2021 03:08:26 - INFO - __main__ - Step 42131: {'lr': 0.00041465018125376354, 'samples': 8089152, 'steps': 42130, 'loss/train': 1.5250442028045654} 11/07/2021 03:08:26 - INFO - __main__ - Step 42132: {'lr': 0.0004146461879309169, 'samples': 8089344, 'steps': 42131, 'loss/train': 1.5029743909835815} 11/07/2021 03:08:27 - INFO - __main__ - Step 42133: {'lr': 0.0004146421945338832, 'samples': 8089536, 'steps': 42132, 'loss/train': 1.6332733631134033} 11/07/2021 03:08:27 - INFO - __main__ - Step 42134: {'lr': 0.0004146382010626643, 'samples': 8089728, 'steps': 42133, 'loss/train': 1.3687775135040283} 11/07/2021 03:08:28 - INFO - __main__ - Step 42135: {'lr': 0.000414634207517262, 'samples': 8089920, 'steps': 42134, 'loss/train': 1.0351780652999878} 11/07/2021 03:08:28 - INFO - __main__ - Step 42136: {'lr': 0.000414630213897678, 'samples': 8090112, 'steps': 42135, 'loss/train': 1.5677454471588135} 11/07/2021 03:08:29 - INFO - __main__ - Step 42137: {'lr': 0.00041462622020391416, 'samples': 8090304, 'steps': 42136, 'loss/train': 1.659880518913269} 11/07/2021 03:08:29 - INFO - __main__ - Step 42138: {'lr': 0.00041462222643597236, 'samples': 8090496, 'steps': 42137, 'loss/train': 1.4060335159301758} 11/07/2021 03:08:29 - INFO - __main__ - Step 42139: {'lr': 0.00041461823259385423, 'samples': 8090688, 'steps': 42138, 'loss/train': 1.3353675603866577} 11/07/2021 03:08:30 - INFO - __main__ - Step 42140: {'lr': 0.00041461423867756176, 'samples': 8090880, 'steps': 42139, 'loss/train': 1.480120301246643} 11/07/2021 03:08:31 - INFO - __main__ - Step 42141: {'lr': 0.00041461024468709664, 'samples': 8091072, 'steps': 42140, 'loss/train': 1.1149752140045166} 11/07/2021 03:08:31 - INFO - __main__ - Step 42142: {'lr': 0.0004146062506224606, 'samples': 8091264, 'steps': 42141, 'loss/train': 1.4817534685134888} 11/07/2021 03:08:31 - INFO - __main__ - Step 42143: {'lr': 0.0004146022564836556, 'samples': 8091456, 'steps': 42142, 'loss/train': 1.6729769706726074} 11/07/2021 03:08:32 - INFO - __main__ - Step 42144: {'lr': 0.0004145982622706833, 'samples': 8091648, 'steps': 42143, 'loss/train': 1.3165689706802368} 11/07/2021 03:08:33 - INFO - __main__ - Step 42145: {'lr': 0.00041459426798354563, 'samples': 8091840, 'steps': 42144, 'loss/train': 1.4829295873641968} 11/07/2021 03:08:33 - INFO - __main__ - Step 42146: {'lr': 0.00041459027362224433, 'samples': 8092032, 'steps': 42145, 'loss/train': 1.44984769821167} 11/07/2021 03:08:34 - INFO - __main__ - Step 42147: {'lr': 0.00041458627918678116, 'samples': 8092224, 'steps': 42146, 'loss/train': 1.9205005168914795} 11/07/2021 03:08:34 - INFO - __main__ - Step 42148: {'lr': 0.00041458228467715786, 'samples': 8092416, 'steps': 42147, 'loss/train': 1.452275037765503} 11/07/2021 03:08:34 - INFO - __main__ - Step 42149: {'lr': 0.00041457829009337643, 'samples': 8092608, 'steps': 42148, 'loss/train': 1.1981793642044067} 11/07/2021 03:08:35 - INFO - __main__ - Step 42150: {'lr': 0.00041457429543543856, 'samples': 8092800, 'steps': 42149, 'loss/train': 1.450162410736084} 11/07/2021 03:08:36 - INFO - __main__ - Step 42151: {'lr': 0.0004145703007033461, 'samples': 8092992, 'steps': 42150, 'loss/train': 1.1431949138641357} 11/07/2021 03:08:36 - INFO - __main__ - Step 42152: {'lr': 0.00041456630589710073, 'samples': 8093184, 'steps': 42151, 'loss/train': 1.1344993114471436} 11/07/2021 03:08:36 - INFO - __main__ - Step 42153: {'lr': 0.0004145623110167043, 'samples': 8093376, 'steps': 42152, 'loss/train': 1.4127930402755737} 11/07/2021 03:08:37 - INFO - __main__ - Step 42154: {'lr': 0.00041455831606215863, 'samples': 8093568, 'steps': 42153, 'loss/train': 1.4258630275726318} 11/07/2021 03:08:37 - INFO - __main__ - Step 42155: {'lr': 0.0004145543210334656, 'samples': 8093760, 'steps': 42154, 'loss/train': 1.334218144416809} 11/07/2021 03:08:38 - INFO - __main__ - Step 42156: {'lr': 0.00041455032593062685, 'samples': 8093952, 'steps': 42155, 'loss/train': 1.757309913635254} 11/07/2021 03:08:39 - INFO - __main__ - Step 42157: {'lr': 0.00041454633075364427, 'samples': 8094144, 'steps': 42156, 'loss/train': 1.4084231853485107} 11/07/2021 03:08:39 - INFO - __main__ - Step 42158: {'lr': 0.00041454233550251976, 'samples': 8094336, 'steps': 42157, 'loss/train': 1.2516793012619019} 11/07/2021 03:08:39 - INFO - __main__ - Step 42159: {'lr': 0.0004145383401772549, 'samples': 8094528, 'steps': 42158, 'loss/train': 1.6665518283843994} 11/07/2021 03:08:40 - INFO - __main__ - Step 42160: {'lr': 0.00041453434477785165, 'samples': 8094720, 'steps': 42159, 'loss/train': 1.5540876388549805} 11/07/2021 03:08:41 - INFO - __main__ - Step 42161: {'lr': 0.0004145303493043118, 'samples': 8094912, 'steps': 42160, 'loss/train': 1.2934569120407104} 11/07/2021 03:08:41 - INFO - __main__ - Step 42162: {'lr': 0.000414526353756637, 'samples': 8095104, 'steps': 42161, 'loss/train': 2.090319871902466} 11/07/2021 03:08:41 - INFO - __main__ - Step 42163: {'lr': 0.0004145223581348292, 'samples': 8095296, 'steps': 42162, 'loss/train': 1.8299297094345093} 11/07/2021 03:08:42 - INFO - __main__ - Step 42164: {'lr': 0.00041451836243889027, 'samples': 8095488, 'steps': 42163, 'loss/train': 1.2692837715148926} 11/07/2021 03:08:42 - INFO - __main__ - Step 42165: {'lr': 0.0004145143666688218, 'samples': 8095680, 'steps': 42164, 'loss/train': 1.8240784406661987} 11/07/2021 03:08:43 - INFO - __main__ - Step 42166: {'lr': 0.0004145103708246257, 'samples': 8095872, 'steps': 42165, 'loss/train': 1.1021740436553955} 11/07/2021 03:08:43 - INFO - __main__ - Step 42167: {'lr': 0.0004145063749063038, 'samples': 8096064, 'steps': 42166, 'loss/train': 1.6214840412139893} 11/07/2021 03:08:44 - INFO - __main__ - Step 42168: {'lr': 0.00041450237891385783, 'samples': 8096256, 'steps': 42167, 'loss/train': 1.618085265159607} 11/07/2021 03:08:44 - INFO - __main__ - Step 42169: {'lr': 0.00041449838284728964, 'samples': 8096448, 'steps': 42168, 'loss/train': 1.386372685432434} 11/07/2021 03:08:45 - INFO - __main__ - Step 42170: {'lr': 0.000414494386706601, 'samples': 8096640, 'steps': 42169, 'loss/train': 1.309335708618164} 11/07/2021 03:08:45 - INFO - __main__ - Step 42171: {'lr': 0.00041449039049179385, 'samples': 8096832, 'steps': 42170, 'loss/train': 1.3361847400665283} 11/07/2021 03:08:46 - INFO - __main__ - Step 42172: {'lr': 0.0004144863942028697, 'samples': 8097024, 'steps': 42171, 'loss/train': 1.3556065559387207} 11/07/2021 03:08:46 - INFO - __main__ - Step 42173: {'lr': 0.0004144823978398306, 'samples': 8097216, 'steps': 42172, 'loss/train': 1.8618935346603394} 11/07/2021 03:08:47 - INFO - __main__ - Step 42174: {'lr': 0.0004144784014026782, 'samples': 8097408, 'steps': 42173, 'loss/train': 1.6203159093856812} 11/07/2021 03:08:47 - INFO - __main__ - Step 42175: {'lr': 0.0004144744048914145, 'samples': 8097600, 'steps': 42174, 'loss/train': 1.4742738008499146} 11/07/2021 03:08:47 - INFO - __main__ - Step 42176: {'lr': 0.0004144704083060411, 'samples': 8097792, 'steps': 42175, 'loss/train': 1.5544403791427612} 11/07/2021 03:08:48 - INFO - __main__ - Step 42177: {'lr': 0.00041446641164655983, 'samples': 8097984, 'steps': 42176, 'loss/train': 1.346947431564331} 11/07/2021 03:08:49 - INFO - __main__ - Step 42178: {'lr': 0.0004144624149129727, 'samples': 8098176, 'steps': 42177, 'loss/train': 1.5605933666229248} 11/07/2021 03:08:49 - INFO - __main__ - Step 42179: {'lr': 0.00041445841810528117, 'samples': 8098368, 'steps': 42178, 'loss/train': 1.7424789667129517} 11/07/2021 03:08:49 - INFO - __main__ - Step 42180: {'lr': 0.00041445442122348727, 'samples': 8098560, 'steps': 42179, 'loss/train': 0.6647708415985107} 11/07/2021 03:08:50 - INFO - __main__ - Step 42181: {'lr': 0.0004144504242675927, 'samples': 8098752, 'steps': 42180, 'loss/train': 1.6002750396728516} 11/07/2021 03:08:50 - INFO - __main__ - Step 42182: {'lr': 0.0004144464272375994, 'samples': 8098944, 'steps': 42181, 'loss/train': 1.1403971910476685} 11/07/2021 03:08:51 - INFO - __main__ - Step 42183: {'lr': 0.000414442430133509, 'samples': 8099136, 'steps': 42182, 'loss/train': 1.608879566192627} 11/07/2021 03:08:52 - INFO - __main__ - Step 42184: {'lr': 0.00041443843295532333, 'samples': 8099328, 'steps': 42183, 'loss/train': 1.4282817840576172} 11/07/2021 03:08:52 - INFO - __main__ - Step 42185: {'lr': 0.0004144344357030444, 'samples': 8099520, 'steps': 42184, 'loss/train': 1.5298147201538086} 11/07/2021 03:08:52 - INFO - __main__ - Step 42186: {'lr': 0.0004144304383766737, 'samples': 8099712, 'steps': 42185, 'loss/train': 1.4202641248703003} 11/07/2021 03:08:53 - INFO - __main__ - Step 42187: {'lr': 0.0004144264409762133, 'samples': 8099904, 'steps': 42186, 'loss/train': 1.4413903951644897} 11/07/2021 03:08:54 - INFO - __main__ - Step 42188: {'lr': 0.0004144224435016648, 'samples': 8100096, 'steps': 42187, 'loss/train': 1.535606861114502} 11/07/2021 03:08:54 - INFO - __main__ - Step 42189: {'lr': 0.00041441844595303015, 'samples': 8100288, 'steps': 42188, 'loss/train': 1.5828893184661865} 11/07/2021 03:08:54 - INFO - __main__ - Step 42190: {'lr': 0.0004144144483303111, 'samples': 8100480, 'steps': 42189, 'loss/train': 1.4190727472305298} 11/07/2021 03:08:55 - INFO - __main__ - Step 42191: {'lr': 0.00041441045063350933, 'samples': 8100672, 'steps': 42190, 'loss/train': 1.7690839767456055} 11/07/2021 03:08:55 - INFO - __main__ - Step 42192: {'lr': 0.00041440645286262677, 'samples': 8100864, 'steps': 42191, 'loss/train': 1.7696083784103394} 11/07/2021 03:08:56 - INFO - __main__ - Step 42193: {'lr': 0.0004144024550176653, 'samples': 8101056, 'steps': 42192, 'loss/train': 1.3551652431488037} 11/07/2021 03:08:57 - INFO - __main__ - Step 42194: {'lr': 0.0004143984570986265, 'samples': 8101248, 'steps': 42193, 'loss/train': 1.4273933172225952} 11/07/2021 03:08:57 - INFO - __main__ - Step 42195: {'lr': 0.00041439445910551235, 'samples': 8101440, 'steps': 42194, 'loss/train': 1.6295689344406128} 11/07/2021 03:08:57 - INFO - __main__ - Step 42196: {'lr': 0.00041439046103832454, 'samples': 8101632, 'steps': 42195, 'loss/train': 1.0149160623550415} 11/07/2021 03:08:58 - INFO - __main__ - Step 42197: {'lr': 0.000414386462897065, 'samples': 8101824, 'steps': 42196, 'loss/train': 1.359559416770935} 11/07/2021 03:08:59 - INFO - __main__ - Step 42198: {'lr': 0.00041438246468173545, 'samples': 8102016, 'steps': 42197, 'loss/train': 1.8362070322036743} 11/07/2021 03:08:59 - INFO - __main__ - Step 42199: {'lr': 0.0004143784663923377, 'samples': 8102208, 'steps': 42198, 'loss/train': 1.4988839626312256} 11/07/2021 03:08:59 - INFO - __main__ - Step 42200: {'lr': 0.00041437446802887354, 'samples': 8102400, 'steps': 42199, 'loss/train': 0.7154588103294373} 11/07/2021 03:09:00 - INFO - __main__ - Step 42201: {'lr': 0.0004143704695913447, 'samples': 8102592, 'steps': 42200, 'loss/train': 1.3718774318695068} 11/07/2021 03:09:00 - INFO - __main__ - Step 42202: {'lr': 0.0004143664710797531, 'samples': 8102784, 'steps': 42201, 'loss/train': 1.7068943977355957} 11/07/2021 03:09:01 - INFO - __main__ - Step 42203: {'lr': 0.0004143624724941006, 'samples': 8102976, 'steps': 42202, 'loss/train': 1.0936533212661743} 11/07/2021 03:09:01 - INFO - __main__ - Step 42204: {'lr': 0.00041435847383438886, 'samples': 8103168, 'steps': 42203, 'loss/train': 1.7383671998977661} 11/07/2021 03:09:02 - INFO - __main__ - Step 42205: {'lr': 0.0004143544751006197, 'samples': 8103360, 'steps': 42204, 'loss/train': 1.5881863832473755} 11/07/2021 03:09:02 - INFO - __main__ - Step 42206: {'lr': 0.000414350476292795, 'samples': 8103552, 'steps': 42205, 'loss/train': 1.4384769201278687} 11/07/2021 03:09:02 - INFO - __main__ - Step 42207: {'lr': 0.0004143464774109164, 'samples': 8103744, 'steps': 42206, 'loss/train': 1.3117130994796753} 11/07/2021 03:09:03 - INFO - __main__ - Step 42208: {'lr': 0.0004143424784549859, 'samples': 8103936, 'steps': 42207, 'loss/train': 0.45743227005004883} 11/07/2021 03:09:04 - INFO - __main__ - Step 42209: {'lr': 0.00041433847942500516, 'samples': 8104128, 'steps': 42208, 'loss/train': 1.2702810764312744} 11/07/2021 03:09:04 - INFO - __main__ - Step 42210: {'lr': 0.0004143344803209761, 'samples': 8104320, 'steps': 42209, 'loss/train': 1.4717539548873901} 11/07/2021 03:09:04 - INFO - __main__ - Step 42211: {'lr': 0.0004143304811429005, 'samples': 8104512, 'steps': 42210, 'loss/train': 1.5789469480514526} 11/07/2021 03:09:05 - INFO - __main__ - Step 42212: {'lr': 0.00041432648189078006, 'samples': 8104704, 'steps': 42211, 'loss/train': 1.544179081916809} 11/07/2021 03:09:06 - INFO - __main__ - Step 42213: {'lr': 0.0004143224825646166, 'samples': 8104896, 'steps': 42212, 'loss/train': 1.3781150579452515} 11/07/2021 03:09:06 - INFO - __main__ - Step 42214: {'lr': 0.000414318483164412, 'samples': 8105088, 'steps': 42213, 'loss/train': 1.4762358665466309} 11/07/2021 03:09:07 - INFO - __main__ - Step 42215: {'lr': 0.000414314483690168, 'samples': 8105280, 'steps': 42214, 'loss/train': 1.572849988937378} 11/07/2021 03:09:07 - INFO - __main__ - Step 42216: {'lr': 0.00041431048414188645, 'samples': 8105472, 'steps': 42215, 'loss/train': 1.6689866781234741} 11/07/2021 03:09:07 - INFO - __main__ - Step 42217: {'lr': 0.00041430648451956913, 'samples': 8105664, 'steps': 42216, 'loss/train': 1.4321280717849731} 11/07/2021 03:09:08 - INFO - __main__ - Step 42218: {'lr': 0.00041430248482321794, 'samples': 8105856, 'steps': 42217, 'loss/train': 1.486643671989441} 11/07/2021 03:09:09 - INFO - __main__ - Step 42219: {'lr': 0.00041429848505283444, 'samples': 8106048, 'steps': 42218, 'loss/train': 1.625313401222229} 11/07/2021 03:09:09 - INFO - __main__ - Step 42220: {'lr': 0.00041429448520842064, 'samples': 8106240, 'steps': 42219, 'loss/train': 1.6230796575546265} 11/07/2021 03:09:09 - INFO - __main__ - Step 42221: {'lr': 0.0004142904852899783, 'samples': 8106432, 'steps': 42220, 'loss/train': 1.590558409690857} 11/07/2021 03:09:10 - INFO - __main__ - Step 42222: {'lr': 0.0004142864852975092, 'samples': 8106624, 'steps': 42221, 'loss/train': 1.4351541996002197} 11/07/2021 03:09:10 - INFO - __main__ - Step 42223: {'lr': 0.00041428248523101507, 'samples': 8106816, 'steps': 42222, 'loss/train': 1.552646279335022} 11/07/2021 03:09:11 - INFO - __main__ - Step 42224: {'lr': 0.0004142784850904978, 'samples': 8107008, 'steps': 42223, 'loss/train': 1.7166039943695068} 11/07/2021 03:09:11 - INFO - __main__ - Step 42225: {'lr': 0.00041427448487595933, 'samples': 8107200, 'steps': 42224, 'loss/train': 1.5216922760009766} 11/07/2021 03:09:12 - INFO - __main__ - Step 42226: {'lr': 0.0004142704845874012, 'samples': 8107392, 'steps': 42225, 'loss/train': 1.6146934032440186} 11/07/2021 03:09:12 - INFO - __main__ - Step 42227: {'lr': 0.00041426648422482527, 'samples': 8107584, 'steps': 42226, 'loss/train': 1.2154557704925537} 11/07/2021 03:09:12 - INFO - __main__ - Step 42228: {'lr': 0.0004142624837882335, 'samples': 8107776, 'steps': 42227, 'loss/train': 1.4375699758529663} 11/07/2021 03:09:14 - INFO - __main__ - Step 42229: {'lr': 0.0004142584832776275, 'samples': 8107968, 'steps': 42228, 'loss/train': 1.8489353656768799} 11/07/2021 03:09:14 - INFO - __main__ - Step 42230: {'lr': 0.00041425448269300923, 'samples': 8108160, 'steps': 42229, 'loss/train': 1.63296377658844} 11/07/2021 03:09:14 - INFO - __main__ - Step 42231: {'lr': 0.00041425048203438036, 'samples': 8108352, 'steps': 42230, 'loss/train': 2.0668020248413086} 11/07/2021 03:09:15 - INFO - __main__ - Step 42232: {'lr': 0.0004142464813017429, 'samples': 8108544, 'steps': 42231, 'loss/train': 1.516548752784729} 11/07/2021 03:09:15 - INFO - __main__ - Step 42233: {'lr': 0.0004142424804950984, 'samples': 8108736, 'steps': 42232, 'loss/train': 1.2571018934249878} 11/07/2021 03:09:16 - INFO - __main__ - Step 42234: {'lr': 0.00041423847961444873, 'samples': 8108928, 'steps': 42233, 'loss/train': 1.8132482767105103} 11/07/2021 03:09:16 - INFO - __main__ - Step 42235: {'lr': 0.0004142344786597958, 'samples': 8109120, 'steps': 42234, 'loss/train': 1.7678147554397583} 11/07/2021 03:09:17 - INFO - __main__ - Step 42236: {'lr': 0.0004142304776311413, 'samples': 8109312, 'steps': 42235, 'loss/train': 1.5495364665985107} 11/07/2021 03:09:17 - INFO - __main__ - Step 42237: {'lr': 0.0004142264765284871, 'samples': 8109504, 'steps': 42236, 'loss/train': 1.6588215827941895} 11/07/2021 03:09:17 - INFO - __main__ - Step 42238: {'lr': 0.0004142224753518351, 'samples': 8109696, 'steps': 42237, 'loss/train': 1.3605972528457642} 11/07/2021 03:09:18 - INFO - __main__ - Step 42239: {'lr': 0.00041421847410118685, 'samples': 8109888, 'steps': 42238, 'loss/train': 1.2603652477264404} 11/07/2021 03:09:19 - INFO - __main__ - Step 42240: {'lr': 0.00041421447277654436, 'samples': 8110080, 'steps': 42239, 'loss/train': 1.2082425355911255} 11/07/2021 03:09:19 - INFO - __main__ - Step 42241: {'lr': 0.0004142104713779093, 'samples': 8110272, 'steps': 42240, 'loss/train': 1.4556949138641357} 11/07/2021 03:09:20 - INFO - __main__ - Step 42242: {'lr': 0.00041420646990528355, 'samples': 8110464, 'steps': 42241, 'loss/train': 1.4256129264831543} 11/07/2021 03:09:20 - INFO - __main__ - Step 42243: {'lr': 0.0004142024683586689, 'samples': 8110656, 'steps': 42242, 'loss/train': 1.448241114616394} 11/07/2021 03:09:20 - INFO - __main__ - Step 42244: {'lr': 0.00041419846673806715, 'samples': 8110848, 'steps': 42243, 'loss/train': 1.2812025547027588} 11/07/2021 03:09:21 - INFO - __main__ - Step 42245: {'lr': 0.0004141944650434801, 'samples': 8111040, 'steps': 42244, 'loss/train': 1.5415171384811401} 11/07/2021 03:09:22 - INFO - __main__ - Step 42246: {'lr': 0.00041419046327490964, 'samples': 8111232, 'steps': 42245, 'loss/train': 1.428864598274231} 11/07/2021 03:09:22 - INFO - __main__ - Step 42247: {'lr': 0.00041418646143235737, 'samples': 8111424, 'steps': 42246, 'loss/train': 2.096343517303467} 11/07/2021 03:09:22 - INFO - __main__ - Step 42248: {'lr': 0.0004141824595158253, 'samples': 8111616, 'steps': 42247, 'loss/train': 1.5350630283355713} 11/07/2021 03:09:23 - INFO - __main__ - Step 42249: {'lr': 0.0004141784575253151, 'samples': 8111808, 'steps': 42248, 'loss/train': 1.8915627002716064} 11/07/2021 03:09:24 - INFO - __main__ - Step 42250: {'lr': 0.0004141744554608287, 'samples': 8112000, 'steps': 42249, 'loss/train': 1.0861523151397705} 11/07/2021 03:09:24 - INFO - __main__ - Step 42251: {'lr': 0.00041417045332236776, 'samples': 8112192, 'steps': 42250, 'loss/train': 1.1745171546936035} 11/07/2021 03:09:24 - INFO - __main__ - Step 42252: {'lr': 0.0004141664511099341, 'samples': 8112384, 'steps': 42251, 'loss/train': 1.573246955871582} 11/07/2021 03:09:25 - INFO - __main__ - Step 42253: {'lr': 0.00041416244882352965, 'samples': 8112576, 'steps': 42252, 'loss/train': 0.9070172905921936} 11/07/2021 03:09:25 - INFO - __main__ - Step 42254: {'lr': 0.00041415844646315613, 'samples': 8112768, 'steps': 42253, 'loss/train': 1.758210301399231} 11/07/2021 03:09:26 - INFO - __main__ - Step 42255: {'lr': 0.0004141544440288153, 'samples': 8112960, 'steps': 42254, 'loss/train': 2.242783546447754} 11/07/2021 03:09:26 - INFO - __main__ - Step 42256: {'lr': 0.0004141504415205091, 'samples': 8113152, 'steps': 42255, 'loss/train': 1.3303405046463013} 11/07/2021 03:09:27 - INFO - __main__ - Step 42257: {'lr': 0.0004141464389382391, 'samples': 8113344, 'steps': 42256, 'loss/train': 1.8815373182296753} 11/07/2021 03:09:27 - INFO - __main__ - Step 42258: {'lr': 0.0004141424362820073, 'samples': 8113536, 'steps': 42257, 'loss/train': 1.2732740640640259} 11/07/2021 03:09:28 - INFO - __main__ - Step 42259: {'lr': 0.0004141384335518155, 'samples': 8113728, 'steps': 42258, 'loss/train': 1.6466920375823975} 11/07/2021 03:09:28 - INFO - __main__ - Step 42260: {'lr': 0.00041413443074766543, 'samples': 8113920, 'steps': 42259, 'loss/train': 1.194049596786499} 11/07/2021 03:09:29 - INFO - __main__ - Step 42261: {'lr': 0.000414130427869559, 'samples': 8114112, 'steps': 42260, 'loss/train': 1.1260548830032349} 11/07/2021 03:09:29 - INFO - __main__ - Step 42262: {'lr': 0.0004141264249174978, 'samples': 8114304, 'steps': 42261, 'loss/train': 1.7151681184768677} 11/07/2021 03:09:30 - INFO - __main__ - Step 42263: {'lr': 0.00041412242189148383, 'samples': 8114496, 'steps': 42262, 'loss/train': 1.5122243165969849} 11/07/2021 03:09:30 - INFO - __main__ - Step 42264: {'lr': 0.00041411841879151877, 'samples': 8114688, 'steps': 42263, 'loss/train': 1.6099839210510254} 11/07/2021 03:09:30 - INFO - __main__ - Step 42265: {'lr': 0.00041411441561760455, 'samples': 8114880, 'steps': 42264, 'loss/train': 1.6253814697265625} 11/07/2021 03:09:31 - INFO - __main__ - Step 42266: {'lr': 0.0004141104123697429, 'samples': 8115072, 'steps': 42265, 'loss/train': 1.6564679145812988} 11/07/2021 03:09:32 - INFO - __main__ - Step 42267: {'lr': 0.00041410640904793563, 'samples': 8115264, 'steps': 42266, 'loss/train': 0.8678615093231201} 11/07/2021 03:09:32 - INFO - __main__ - Step 42268: {'lr': 0.0004141024056521845, 'samples': 8115456, 'steps': 42267, 'loss/train': 0.7243680357933044} 11/07/2021 03:09:32 - INFO - __main__ - Step 42269: {'lr': 0.0004140984021824914, 'samples': 8115648, 'steps': 42268, 'loss/train': 1.637137770652771} 11/07/2021 03:09:33 - INFO - __main__ - Step 42270: {'lr': 0.0004140943986388581, 'samples': 8115840, 'steps': 42269, 'loss/train': 1.5717798471450806} 11/07/2021 03:09:34 - INFO - __main__ - Step 42271: {'lr': 0.00041409039502128634, 'samples': 8116032, 'steps': 42270, 'loss/train': 1.5168558359146118} 11/07/2021 03:09:35 - INFO - __main__ - Step 42272: {'lr': 0.000414086391329778, 'samples': 8116224, 'steps': 42271, 'loss/train': 1.1119903326034546} 11/07/2021 03:09:35 - INFO - __main__ - Step 42273: {'lr': 0.0004140823875643349, 'samples': 8116416, 'steps': 42272, 'loss/train': 1.3936454057693481} 11/07/2021 03:09:36 - INFO - __main__ - Step 42274: {'lr': 0.00041407838372495883, 'samples': 8116608, 'steps': 42273, 'loss/train': 1.0360863208770752} 11/07/2021 03:09:36 - INFO - __main__ - Step 42275: {'lr': 0.00041407437981165154, 'samples': 8116800, 'steps': 42274, 'loss/train': 1.3648031949996948} 11/07/2021 03:09:36 - INFO - __main__ - Step 42276: {'lr': 0.0004140703758244148, 'samples': 8116992, 'steps': 42275, 'loss/train': 1.9282615184783936} 11/07/2021 03:09:37 - INFO - __main__ - Step 42277: {'lr': 0.00041406637176325054, 'samples': 8117184, 'steps': 42276, 'loss/train': 1.2744916677474976} 11/07/2021 03:09:38 - INFO - __main__ - Step 42278: {'lr': 0.00041406236762816053, 'samples': 8117376, 'steps': 42277, 'loss/train': 1.9428569078445435} 11/07/2021 03:09:38 - INFO - __main__ - Step 42279: {'lr': 0.0004140583634191465, 'samples': 8117568, 'steps': 42278, 'loss/train': 1.5542657375335693} 11/07/2021 03:09:38 - INFO - __main__ - Step 42280: {'lr': 0.00041405435913621037, 'samples': 8117760, 'steps': 42279, 'loss/train': 0.7622489333152771} 11/07/2021 03:09:39 - INFO - __main__ - Step 42281: {'lr': 0.0004140503547793538, 'samples': 8117952, 'steps': 42280, 'loss/train': 1.6333931684494019} 11/07/2021 03:09:39 - INFO - __main__ - Step 42282: {'lr': 0.00041404635034857876, 'samples': 8118144, 'steps': 42281, 'loss/train': 1.736616849899292} 11/07/2021 03:09:40 - INFO - __main__ - Step 42283: {'lr': 0.00041404234584388683, 'samples': 8118336, 'steps': 42282, 'loss/train': 1.5493872165679932} 11/07/2021 03:09:40 - INFO - __main__ - Step 42284: {'lr': 0.00041403834126528007, 'samples': 8118528, 'steps': 42283, 'loss/train': 1.6267235279083252} 11/07/2021 03:09:41 - INFO - __main__ - Step 42285: {'lr': 0.00041403433661276015, 'samples': 8118720, 'steps': 42284, 'loss/train': 1.4420843124389648} 11/07/2021 03:09:41 - INFO - __main__ - Step 42286: {'lr': 0.0004140303318863288, 'samples': 8118912, 'steps': 42285, 'loss/train': 1.6940083503723145} 11/07/2021 03:09:41 - INFO - __main__ - Step 42287: {'lr': 0.00041402632708598797, 'samples': 8119104, 'steps': 42286, 'loss/train': 1.706137776374817} 11/07/2021 03:09:43 - INFO - __main__ - Step 42288: {'lr': 0.0004140223222117394, 'samples': 8119296, 'steps': 42287, 'loss/train': 1.910693883895874} 11/07/2021 03:09:43 - INFO - __main__ - Step 42289: {'lr': 0.00041401831726358497, 'samples': 8119488, 'steps': 42288, 'loss/train': 1.5870105028152466} 11/07/2021 03:09:43 - INFO - __main__ - Step 42290: {'lr': 0.0004140143122415263, 'samples': 8119680, 'steps': 42289, 'loss/train': 1.8212772607803345} 11/07/2021 03:09:44 - INFO - __main__ - Step 42291: {'lr': 0.0004140103071455654, 'samples': 8119872, 'steps': 42290, 'loss/train': 1.7726240158081055} 11/07/2021 03:09:44 - INFO - __main__ - Step 42292: {'lr': 0.000414006301975704, 'samples': 8120064, 'steps': 42291, 'loss/train': 1.9648642539978027} 11/07/2021 03:09:44 - INFO - __main__ - Step 42293: {'lr': 0.0004140022967319439, 'samples': 8120256, 'steps': 42292, 'loss/train': 1.6367963552474976} 11/07/2021 03:09:45 - INFO - __main__ - Step 42294: {'lr': 0.0004139982914142868, 'samples': 8120448, 'steps': 42293, 'loss/train': 1.9807569980621338} 11/07/2021 03:09:46 - INFO - __main__ - Step 42295: {'lr': 0.0004139942860227346, 'samples': 8120640, 'steps': 42294, 'loss/train': 1.6433264017105103} 11/07/2021 03:09:46 - INFO - __main__ - Step 42296: {'lr': 0.00041399028055728914, 'samples': 8120832, 'steps': 42295, 'loss/train': 1.5786479711532593} 11/07/2021 03:09:46 - INFO - __main__ - Step 42297: {'lr': 0.0004139862750179523, 'samples': 8121024, 'steps': 42296, 'loss/train': 1.858721375465393} 11/07/2021 03:09:47 - INFO - __main__ - Step 42298: {'lr': 0.0004139822694047256, 'samples': 8121216, 'steps': 42297, 'loss/train': 1.4344762563705444} 11/07/2021 03:09:48 - INFO - __main__ - Step 42299: {'lr': 0.0004139782637176112, 'samples': 8121408, 'steps': 42298, 'loss/train': 1.577717900276184} 11/07/2021 03:09:48 - INFO - __main__ - Step 42300: {'lr': 0.0004139742579566106, 'samples': 8121600, 'steps': 42299, 'loss/train': 1.0138734579086304} 11/07/2021 03:09:49 - INFO - __main__ - Step 42301: {'lr': 0.00041397025212172573, 'samples': 8121792, 'steps': 42300, 'loss/train': 1.2797232866287231} 11/07/2021 03:09:49 - INFO - __main__ - Step 42302: {'lr': 0.00041396624621295843, 'samples': 8121984, 'steps': 42301, 'loss/train': 1.844573974609375} 11/07/2021 03:09:49 - INFO - __main__ - Step 42303: {'lr': 0.00041396224023031045, 'samples': 8122176, 'steps': 42302, 'loss/train': 1.121030569076538} 11/07/2021 03:09:50 - INFO - __main__ - Step 42304: {'lr': 0.0004139582341737836, 'samples': 8122368, 'steps': 42303, 'loss/train': 1.6290957927703857} 11/07/2021 03:09:51 - INFO - __main__ - Step 42305: {'lr': 0.0004139542280433797, 'samples': 8122560, 'steps': 42304, 'loss/train': 1.3857667446136475} 11/07/2021 03:09:51 - INFO - __main__ - Step 42306: {'lr': 0.00041395022183910064, 'samples': 8122752, 'steps': 42305, 'loss/train': 1.6097029447555542} 11/07/2021 03:09:51 - INFO - __main__ - Step 42307: {'lr': 0.00041394621556094805, 'samples': 8122944, 'steps': 42306, 'loss/train': 1.5338916778564453} 11/07/2021 03:09:52 - INFO - __main__ - Step 42308: {'lr': 0.0004139422092089239, 'samples': 8123136, 'steps': 42307, 'loss/train': 1.7702220678329468} 11/07/2021 03:09:53 - INFO - __main__ - Step 42309: {'lr': 0.0004139382027830298, 'samples': 8123328, 'steps': 42308, 'loss/train': 1.5488650798797607} 11/07/2021 03:09:53 - INFO - __main__ - Step 42310: {'lr': 0.00041393419628326777, 'samples': 8123520, 'steps': 42309, 'loss/train': 1.628334879875183} 11/07/2021 03:09:54 - INFO - __main__ - Step 42311: {'lr': 0.00041393018970963945, 'samples': 8123712, 'steps': 42310, 'loss/train': 1.7775880098342896} 11/07/2021 03:09:54 - INFO - __main__ - Step 42312: {'lr': 0.00041392618306214683, 'samples': 8123904, 'steps': 42311, 'loss/train': 1.2932323217391968} 11/07/2021 03:09:54 - INFO - __main__ - Step 42313: {'lr': 0.0004139221763407915, 'samples': 8124096, 'steps': 42312, 'loss/train': 1.6325318813323975} 11/07/2021 03:09:55 - INFO - __main__ - Step 42314: {'lr': 0.00041391816954557543, 'samples': 8124288, 'steps': 42313, 'loss/train': 0.11254347860813141} 11/07/2021 03:09:56 - INFO - __main__ - Step 42315: {'lr': 0.00041391416267650034, 'samples': 8124480, 'steps': 42314, 'loss/train': 1.9047186374664307} 11/07/2021 03:09:56 - INFO - __main__ - Step 42316: {'lr': 0.00041391015573356805, 'samples': 8124672, 'steps': 42315, 'loss/train': 1.5465834140777588} 11/07/2021 03:09:56 - INFO - __main__ - Step 42317: {'lr': 0.0004139061487167804, 'samples': 8124864, 'steps': 42316, 'loss/train': 1.3714628219604492} 11/07/2021 03:09:57 - INFO - __main__ - Step 42318: {'lr': 0.00041390214162613916, 'samples': 8125056, 'steps': 42317, 'loss/train': 1.8584504127502441} 11/07/2021 03:09:57 - INFO - __main__ - Step 42319: {'lr': 0.00041389813446164614, 'samples': 8125248, 'steps': 42318, 'loss/train': 1.3471084833145142} 11/07/2021 03:09:58 - INFO - __main__ - Step 42320: {'lr': 0.0004138941272233031, 'samples': 8125440, 'steps': 42319, 'loss/train': 1.346096396446228} 11/07/2021 03:09:58 - INFO - __main__ - Step 42321: {'lr': 0.0004138901199111119, 'samples': 8125632, 'steps': 42320, 'loss/train': 1.64899480342865} 11/07/2021 03:09:59 - INFO - __main__ - Step 42322: {'lr': 0.00041388611252507446, 'samples': 8125824, 'steps': 42321, 'loss/train': 1.300271987915039} 11/07/2021 03:09:59 - INFO - __main__ - Step 42323: {'lr': 0.0004138821050651923, 'samples': 8126016, 'steps': 42322, 'loss/train': 1.3816546201705933} 11/07/2021 03:10:00 - INFO - __main__ - Step 42324: {'lr': 0.00041387809753146756, 'samples': 8126208, 'steps': 42323, 'loss/train': 1.7210004329681396} 11/07/2021 03:10:00 - INFO - __main__ - Step 42325: {'lr': 0.00041387408992390177, 'samples': 8126400, 'steps': 42324, 'loss/train': 1.4438029527664185} 11/07/2021 03:10:01 - INFO - __main__ - Step 42326: {'lr': 0.0004138700822424968, 'samples': 8126592, 'steps': 42325, 'loss/train': 1.495758056640625} 11/07/2021 03:10:01 - INFO - __main__ - Step 42327: {'lr': 0.0004138660744872547, 'samples': 8126784, 'steps': 42326, 'loss/train': 1.3418431282043457} 11/07/2021 03:10:02 - INFO - __main__ - Step 42328: {'lr': 0.00041386206665817684, 'samples': 8126976, 'steps': 42327, 'loss/train': 5.740207195281982} 11/07/2021 03:10:02 - INFO - __main__ - Step 42329: {'lr': 0.0004138580587552654, 'samples': 8127168, 'steps': 42328, 'loss/train': 1.4846197366714478} 11/07/2021 03:10:02 - INFO - __main__ - Step 42330: {'lr': 0.000413854050778522, 'samples': 8127360, 'steps': 42329, 'loss/train': 1.207411289215088} 11/07/2021 03:10:03 - INFO - __main__ - Step 42331: {'lr': 0.00041385004272794846, 'samples': 8127552, 'steps': 42330, 'loss/train': 1.4573063850402832} 11/07/2021 03:10:04 - INFO - __main__ - Step 42332: {'lr': 0.0004138460346035467, 'samples': 8127744, 'steps': 42331, 'loss/train': 1.7764642238616943} 11/07/2021 03:10:04 - INFO - __main__ - Step 42333: {'lr': 0.0004138420264053184, 'samples': 8127936, 'steps': 42332, 'loss/train': 1.7020927667617798} 11/07/2021 03:10:05 - INFO - __main__ - Step 42334: {'lr': 0.00041383801813326543, 'samples': 8128128, 'steps': 42333, 'loss/train': 1.3734804391860962} 11/07/2021 03:10:05 - INFO - __main__ - Step 42335: {'lr': 0.00041383400978738956, 'samples': 8128320, 'steps': 42334, 'loss/train': 1.374592900276184} 11/07/2021 03:10:05 - INFO - __main__ - Step 42336: {'lr': 0.0004138300013676926, 'samples': 8128512, 'steps': 42335, 'loss/train': 1.3566697835922241} 11/07/2021 03:10:06 - INFO - __main__ - Step 42337: {'lr': 0.0004138259928741764, 'samples': 8128704, 'steps': 42336, 'loss/train': 1.46265709400177} 11/07/2021 03:10:07 - INFO - __main__ - Step 42338: {'lr': 0.0004138219843068427, 'samples': 8128896, 'steps': 42337, 'loss/train': 1.2423937320709229} 11/07/2021 03:10:07 - INFO - __main__ - Step 42339: {'lr': 0.00041381797566569345, 'samples': 8129088, 'steps': 42338, 'loss/train': 1.4230663776397705} 11/07/2021 03:10:07 - INFO - __main__ - Step 42340: {'lr': 0.0004138139669507303, 'samples': 8129280, 'steps': 42339, 'loss/train': 1.0463687181472778} 11/07/2021 03:10:08 - INFO - __main__ - Step 42341: {'lr': 0.000413809958161955, 'samples': 8129472, 'steps': 42340, 'loss/train': 1.5473113059997559} 11/07/2021 03:10:09 - INFO - __main__ - Step 42342: {'lr': 0.0004138059492993695, 'samples': 8129664, 'steps': 42341, 'loss/train': 1.9287792444229126} 11/07/2021 03:10:09 - INFO - __main__ - Step 42343: {'lr': 0.0004138019403629756, 'samples': 8129856, 'steps': 42342, 'loss/train': 1.5084797143936157} 11/07/2021 03:10:09 - INFO - __main__ - Step 42344: {'lr': 0.0004137979313527751, 'samples': 8130048, 'steps': 42343, 'loss/train': 1.1683118343353271} 11/07/2021 03:10:10 - INFO - __main__ - Step 42345: {'lr': 0.00041379392226876974, 'samples': 8130240, 'steps': 42344, 'loss/train': 1.669350028038025} 11/07/2021 03:10:10 - INFO - __main__ - Step 42346: {'lr': 0.0004137899131109614, 'samples': 8130432, 'steps': 42345, 'loss/train': 1.6195399761199951} 11/07/2021 03:10:11 - INFO - __main__ - Step 42347: {'lr': 0.0004137859038793518, 'samples': 8130624, 'steps': 42346, 'loss/train': 2.136760950088501} 11/07/2021 03:10:12 - INFO - __main__ - Step 42348: {'lr': 0.0004137818945739428, 'samples': 8130816, 'steps': 42347, 'loss/train': 1.791646122932434} 11/07/2021 03:10:12 - INFO - __main__ - Step 42349: {'lr': 0.00041377788519473624, 'samples': 8131008, 'steps': 42348, 'loss/train': 1.6567926406860352} 11/07/2021 03:10:12 - INFO - __main__ - Step 42350: {'lr': 0.0004137738757417339, 'samples': 8131200, 'steps': 42349, 'loss/train': 1.6222318410873413} 11/07/2021 03:10:13 - INFO - __main__ - Step 42351: {'lr': 0.0004137698662149375, 'samples': 8131392, 'steps': 42350, 'loss/train': 0.6834756135940552} 11/07/2021 03:10:14 - INFO - __main__ - Step 42352: {'lr': 0.00041376585661434903, 'samples': 8131584, 'steps': 42351, 'loss/train': 1.8909810781478882} 11/07/2021 03:10:14 - INFO - __main__ - Step 42353: {'lr': 0.0004137618469399702, 'samples': 8131776, 'steps': 42352, 'loss/train': 1.8924025297164917} 11/07/2021 03:10:15 - INFO - __main__ - Step 42354: {'lr': 0.0004137578371918027, 'samples': 8131968, 'steps': 42353, 'loss/train': 1.5220388174057007} 11/07/2021 03:10:15 - INFO - __main__ - Step 42355: {'lr': 0.00041375382736984857, 'samples': 8132160, 'steps': 42354, 'loss/train': 1.323188066482544} 11/07/2021 03:10:15 - INFO - __main__ - Step 42356: {'lr': 0.0004137498174741094, 'samples': 8132352, 'steps': 42355, 'loss/train': 1.8916676044464111} 11/07/2021 03:10:16 - INFO - __main__ - Step 42357: {'lr': 0.0004137458075045871, 'samples': 8132544, 'steps': 42356, 'loss/train': 0.9286091923713684} 11/07/2021 03:10:17 - INFO - __main__ - Step 42358: {'lr': 0.0004137417974612835, 'samples': 8132736, 'steps': 42357, 'loss/train': 1.803433895111084} 11/07/2021 03:10:18 - INFO - __main__ - Step 42359: {'lr': 0.0004137377873442004, 'samples': 8132928, 'steps': 42358, 'loss/train': 1.3678960800170898} 11/07/2021 03:10:18 - INFO - __main__ - Step 42360: {'lr': 0.00041373377715333946, 'samples': 8133120, 'steps': 42359, 'loss/train': 1.5719913244247437} 11/07/2021 03:10:19 - INFO - __main__ - Step 42361: {'lr': 0.00041372976688870266, 'samples': 8133312, 'steps': 42360, 'loss/train': 1.2116249799728394} 11/07/2021 03:10:19 - INFO - __main__ - Step 42362: {'lr': 0.0004137257565502918, 'samples': 8133504, 'steps': 42361, 'loss/train': 1.4815484285354614} 11/07/2021 03:10:19 - INFO - __main__ - Step 42363: {'lr': 0.00041372174613810863, 'samples': 8133696, 'steps': 42362, 'loss/train': 1.6371750831604004} 11/07/2021 03:10:20 - INFO - __main__ - Step 42364: {'lr': 0.00041371773565215494, 'samples': 8133888, 'steps': 42363, 'loss/train': 1.8453580141067505} 11/07/2021 03:10:21 - INFO - __main__ - Step 42365: {'lr': 0.00041371372509243256, 'samples': 8134080, 'steps': 42364, 'loss/train': 1.7084465026855469} 11/07/2021 03:10:21 - INFO - __main__ - Step 42366: {'lr': 0.00041370971445894335, 'samples': 8134272, 'steps': 42365, 'loss/train': 1.9868760108947754} 11/07/2021 03:10:21 - INFO - __main__ - Step 42367: {'lr': 0.00041370570375168903, 'samples': 8134464, 'steps': 42366, 'loss/train': 1.4556933641433716} 11/07/2021 03:10:22 - INFO - __main__ - Step 42368: {'lr': 0.00041370169297067145, 'samples': 8134656, 'steps': 42367, 'loss/train': 2.0079658031463623} 11/07/2021 03:10:22 - INFO - __main__ - Step 42369: {'lr': 0.00041369768211589245, 'samples': 8134848, 'steps': 42368, 'loss/train': 1.5647902488708496} 11/07/2021 03:10:22 - INFO - __main__ - Step 42370: {'lr': 0.0004136936711873537, 'samples': 8135040, 'steps': 42369, 'loss/train': 1.5075159072875977} 11/07/2021 03:10:23 - INFO - __main__ - Step 42371: {'lr': 0.0004136896601850572, 'samples': 8135232, 'steps': 42370, 'loss/train': 1.371673345565796} 11/07/2021 03:10:24 - INFO - __main__ - Step 42372: {'lr': 0.0004136856491090046, 'samples': 8135424, 'steps': 42371, 'loss/train': 1.6827363967895508} 11/07/2021 03:10:24 - INFO - __main__ - Step 42373: {'lr': 0.0004136816379591979, 'samples': 8135616, 'steps': 42372, 'loss/train': 2.1556642055511475} 11/07/2021 03:10:24 - INFO - __main__ - Step 42374: {'lr': 0.0004136776267356387, 'samples': 8135808, 'steps': 42373, 'loss/train': 1.7191283702850342} 11/07/2021 03:10:25 - INFO - __main__ - Step 42375: {'lr': 0.0004136736154383288, 'samples': 8136000, 'steps': 42374, 'loss/train': 1.7905488014221191} 11/07/2021 03:10:26 - INFO - __main__ - Step 42376: {'lr': 0.00041366960406727024, 'samples': 8136192, 'steps': 42375, 'loss/train': 1.4930577278137207} 11/07/2021 03:10:26 - INFO - __main__ - Step 42377: {'lr': 0.00041366559262246463, 'samples': 8136384, 'steps': 42376, 'loss/train': 1.5554438829421997} 11/07/2021 03:10:27 - INFO - __main__ - Step 42378: {'lr': 0.00041366158110391375, 'samples': 8136576, 'steps': 42377, 'loss/train': 1.5728462934494019} 11/07/2021 03:10:27 - INFO - __main__ - Step 42379: {'lr': 0.0004136575695116196, 'samples': 8136768, 'steps': 42378, 'loss/train': 1.3974881172180176} 11/07/2021 03:10:27 - INFO - __main__ - Step 42380: {'lr': 0.0004136535578455838, 'samples': 8136960, 'steps': 42379, 'loss/train': 1.3594677448272705} 11/07/2021 03:10:28 - INFO - __main__ - Step 42381: {'lr': 0.0004136495461058083, 'samples': 8137152, 'steps': 42380, 'loss/train': 1.4582617282867432} 11/07/2021 03:10:29 - INFO - __main__ - Step 42382: {'lr': 0.0004136455342922948, 'samples': 8137344, 'steps': 42381, 'loss/train': 1.524465799331665} 11/07/2021 03:10:29 - INFO - __main__ - Step 42383: {'lr': 0.0004136415224050451, 'samples': 8137536, 'steps': 42382, 'loss/train': 1.5900578498840332} 11/07/2021 03:10:29 - INFO - __main__ - Step 42384: {'lr': 0.0004136375104440611, 'samples': 8137728, 'steps': 42383, 'loss/train': 1.1075342893600464} 11/07/2021 03:10:30 - INFO - __main__ - Step 42385: {'lr': 0.0004136334984093446, 'samples': 8137920, 'steps': 42384, 'loss/train': 0.9145182967185974} 11/07/2021 03:10:31 - INFO - __main__ - Step 42386: {'lr': 0.0004136294863008974, 'samples': 8138112, 'steps': 42385, 'loss/train': 5.843629837036133} 11/07/2021 03:10:31 - INFO - __main__ - Step 42387: {'lr': 0.00041362547411872116, 'samples': 8138304, 'steps': 42386, 'loss/train': 0.8810856342315674} 11/07/2021 03:10:31 - INFO - __main__ - Step 42388: {'lr': 0.00041362146186281777, 'samples': 8138496, 'steps': 42387, 'loss/train': 1.6148850917816162} 11/07/2021 03:10:32 - INFO - __main__ - Step 42389: {'lr': 0.00041361744953318923, 'samples': 8138688, 'steps': 42388, 'loss/train': 1.2505319118499756} 11/07/2021 03:10:32 - INFO - __main__ - Step 42390: {'lr': 0.0004136134371298371, 'samples': 8138880, 'steps': 42389, 'loss/train': 1.6921306848526} 11/07/2021 03:10:33 - INFO - __main__ - Step 42391: {'lr': 0.0004136094246527633, 'samples': 8139072, 'steps': 42390, 'loss/train': 1.567236065864563} 11/07/2021 03:10:34 - INFO - __main__ - Step 42392: {'lr': 0.0004136054121019697, 'samples': 8139264, 'steps': 42391, 'loss/train': 1.408864140510559} 11/07/2021 03:10:34 - INFO - __main__ - Step 42393: {'lr': 0.0004136013994774579, 'samples': 8139456, 'steps': 42392, 'loss/train': 1.432043194770813} 11/07/2021 03:10:34 - INFO - __main__ - Step 42394: {'lr': 0.00041359738677922993, 'samples': 8139648, 'steps': 42393, 'loss/train': 1.5740227699279785} 11/07/2021 03:10:35 - INFO - __main__ - Step 42395: {'lr': 0.00041359337400728746, 'samples': 8139840, 'steps': 42394, 'loss/train': 0.7346595525741577} 11/07/2021 03:10:35 - INFO - __main__ - Step 42396: {'lr': 0.00041358936116163224, 'samples': 8140032, 'steps': 42395, 'loss/train': 1.6192106008529663} 11/07/2021 03:10:36 - INFO - __main__ - Step 42397: {'lr': 0.00041358534824226635, 'samples': 8140224, 'steps': 42396, 'loss/train': 1.6648677587509155} 11/07/2021 03:10:36 - INFO - __main__ - Step 42398: {'lr': 0.0004135813352491913, 'samples': 8140416, 'steps': 42397, 'loss/train': 1.7205020189285278} 11/07/2021 03:10:37 - INFO - __main__ - Step 42399: {'lr': 0.00041357732218240905, 'samples': 8140608, 'steps': 42398, 'loss/train': 1.6916424036026} 11/07/2021 03:10:37 - INFO - __main__ - Step 42400: {'lr': 0.0004135733090419215, 'samples': 8140800, 'steps': 42399, 'loss/train': 1.7660330533981323} 11/07/2021 03:10:37 - INFO - __main__ - Step 42401: {'lr': 0.00041356929582773023, 'samples': 8140992, 'steps': 42400, 'loss/train': 1.6983288526535034} 11/07/2021 03:10:38 - INFO - __main__ - Step 42402: {'lr': 0.00041356528253983714, 'samples': 8141184, 'steps': 42401, 'loss/train': 1.5337250232696533} 11/07/2021 03:10:39 - INFO - __main__ - Step 42403: {'lr': 0.0004135612691782441, 'samples': 8141376, 'steps': 42402, 'loss/train': 2.1747608184814453} 11/07/2021 03:10:39 - INFO - __main__ - Step 42404: {'lr': 0.0004135572557429529, 'samples': 8141568, 'steps': 42403, 'loss/train': 1.6874691247940063} 11/07/2021 03:10:39 - INFO - __main__ - Step 42405: {'lr': 0.0004135532422339653, 'samples': 8141760, 'steps': 42404, 'loss/train': 1.4186465740203857} 11/07/2021 03:10:40 - INFO - __main__ - Step 42406: {'lr': 0.00041354922865128316, 'samples': 8141952, 'steps': 42405, 'loss/train': 1.2541090250015259} 11/07/2021 03:10:41 - INFO - __main__ - Step 42407: {'lr': 0.00041354521499490813, 'samples': 8142144, 'steps': 42406, 'loss/train': 1.4607620239257812} 11/07/2021 03:10:41 - INFO - __main__ - Step 42408: {'lr': 0.00041354120126484227, 'samples': 8142336, 'steps': 42407, 'loss/train': 1.6432466506958008} 11/07/2021 03:10:42 - INFO - __main__ - Step 42409: {'lr': 0.00041353718746108724, 'samples': 8142528, 'steps': 42408, 'loss/train': 1.1523303985595703} 11/07/2021 03:10:42 - INFO - __main__ - Step 42410: {'lr': 0.00041353317358364496, 'samples': 8142720, 'steps': 42409, 'loss/train': 1.0971400737762451} 11/07/2021 03:10:42 - INFO - __main__ - Step 42411: {'lr': 0.00041352915963251705, 'samples': 8142912, 'steps': 42410, 'loss/train': 1.179257869720459} 11/07/2021 03:10:43 - INFO - __main__ - Step 42412: {'lr': 0.00041352514560770545, 'samples': 8143104, 'steps': 42411, 'loss/train': 1.554078459739685} 11/07/2021 03:10:44 - INFO - __main__ - Step 42413: {'lr': 0.000413521131509212, 'samples': 8143296, 'steps': 42412, 'loss/train': 1.9412667751312256} 11/07/2021 03:10:44 - INFO - __main__ - Step 42414: {'lr': 0.0004135171173370383, 'samples': 8143488, 'steps': 42413, 'loss/train': 1.1289230585098267} 11/07/2021 03:10:44 - INFO - __main__ - Step 42415: {'lr': 0.00041351310309118653, 'samples': 8143680, 'steps': 42414, 'loss/train': 0.7985998392105103} 11/07/2021 03:10:45 - INFO - __main__ - Step 42416: {'lr': 0.00041350908877165805, 'samples': 8143872, 'steps': 42415, 'loss/train': 1.303896188735962} 11/07/2021 03:10:46 - INFO - __main__ - Step 42417: {'lr': 0.00041350507437845505, 'samples': 8144064, 'steps': 42416, 'loss/train': 1.6557601690292358} 11/07/2021 03:10:46 - INFO - __main__ - Step 42418: {'lr': 0.00041350105991157915, 'samples': 8144256, 'steps': 42417, 'loss/train': 1.5549960136413574} 11/07/2021 03:10:46 - INFO - __main__ - Step 42419: {'lr': 0.00041349704537103216, 'samples': 8144448, 'steps': 42418, 'loss/train': 1.4929468631744385} 11/07/2021 03:10:47 - INFO - __main__ - Step 42420: {'lr': 0.000413493030756816, 'samples': 8144640, 'steps': 42419, 'loss/train': 1.7966359853744507} 11/07/2021 03:10:47 - INFO - __main__ - Step 42421: {'lr': 0.0004134890160689323, 'samples': 8144832, 'steps': 42420, 'loss/train': 1.2032808065414429} 11/07/2021 03:10:48 - INFO - __main__ - Step 42422: {'lr': 0.000413485001307383, 'samples': 8145024, 'steps': 42421, 'loss/train': 1.6719205379486084} 11/07/2021 03:10:49 - INFO - __main__ - Step 42423: {'lr': 0.00041348098647216993, 'samples': 8145216, 'steps': 42422, 'loss/train': 1.4168541431427002} 11/07/2021 03:10:49 - INFO - __main__ - Step 42424: {'lr': 0.00041347697156329485, 'samples': 8145408, 'steps': 42423, 'loss/train': 1.4836190938949585} 11/07/2021 03:10:49 - INFO - __main__ - Step 42425: {'lr': 0.00041347295658075955, 'samples': 8145600, 'steps': 42424, 'loss/train': 1.6768486499786377} 11/07/2021 03:10:50 - INFO - __main__ - Step 42426: {'lr': 0.00041346894152456584, 'samples': 8145792, 'steps': 42425, 'loss/train': 1.65650475025177} 11/07/2021 03:10:50 - INFO - __main__ - Step 42427: {'lr': 0.00041346492639471555, 'samples': 8145984, 'steps': 42426, 'loss/train': 1.1028519868850708} 11/07/2021 03:10:51 - INFO - __main__ - Step 42428: {'lr': 0.0004134609111912105, 'samples': 8146176, 'steps': 42427, 'loss/train': 0.635445773601532} 11/07/2021 03:10:51 - INFO - __main__ - Step 42429: {'lr': 0.00041345689591405256, 'samples': 8146368, 'steps': 42428, 'loss/train': 1.371331810951233} 11/07/2021 03:10:52 - INFO - __main__ - Step 42430: {'lr': 0.0004134528805632434, 'samples': 8146560, 'steps': 42429, 'loss/train': 1.5687236785888672} 11/07/2021 03:10:52 - INFO - __main__ - Step 42431: {'lr': 0.00041344886513878485, 'samples': 8146752, 'steps': 42430, 'loss/train': 1.7651448249816895} 11/07/2021 03:10:52 - INFO - __main__ - Step 42432: {'lr': 0.00041344484964067873, 'samples': 8146944, 'steps': 42431, 'loss/train': 1.099167823791504} 11/07/2021 03:10:54 - INFO - __main__ - Step 42433: {'lr': 0.00041344083406892704, 'samples': 8147136, 'steps': 42432, 'loss/train': 1.468461513519287} 11/07/2021 03:10:54 - INFO - __main__ - Step 42434: {'lr': 0.0004134368184235313, 'samples': 8147328, 'steps': 42433, 'loss/train': 1.3953205347061157} 11/07/2021 03:10:54 - INFO - __main__ - Step 42435: {'lr': 0.0004134328027044935, 'samples': 8147520, 'steps': 42434, 'loss/train': 1.4772852659225464} 11/07/2021 03:10:55 - INFO - __main__ - Step 42436: {'lr': 0.0004134287869118154, 'samples': 8147712, 'steps': 42435, 'loss/train': 1.740281343460083} 11/07/2021 03:10:55 - INFO - __main__ - Step 42437: {'lr': 0.0004134247710454988, 'samples': 8147904, 'steps': 42436, 'loss/train': 1.4107420444488525} 11/07/2021 03:10:55 - INFO - __main__ - Step 42438: {'lr': 0.00041342075510554554, 'samples': 8148096, 'steps': 42437, 'loss/train': 1.4461491107940674} 11/07/2021 03:10:56 - INFO - __main__ - Step 42439: {'lr': 0.0004134167390919574, 'samples': 8148288, 'steps': 42438, 'loss/train': 1.2683757543563843} 11/07/2021 03:10:57 - INFO - __main__ - Step 42440: {'lr': 0.0004134127230047362, 'samples': 8148480, 'steps': 42439, 'loss/train': 1.5026410818099976} 11/07/2021 03:10:57 - INFO - __main__ - Step 42441: {'lr': 0.00041340870684388375, 'samples': 8148672, 'steps': 42440, 'loss/train': 1.4890927076339722} 11/07/2021 03:10:57 - INFO - __main__ - Step 42442: {'lr': 0.00041340469060940183, 'samples': 8148864, 'steps': 42441, 'loss/train': 1.7618567943572998} 11/07/2021 03:10:58 - INFO - __main__ - Step 42443: {'lr': 0.0004134006743012923, 'samples': 8149056, 'steps': 42442, 'loss/train': 1.6861194372177124} 11/07/2021 03:10:59 - INFO - __main__ - Step 42444: {'lr': 0.00041339665791955695, 'samples': 8149248, 'steps': 42443, 'loss/train': 1.6956210136413574} 11/07/2021 03:10:59 - INFO - __main__ - Step 42445: {'lr': 0.00041339264146419757, 'samples': 8149440, 'steps': 42444, 'loss/train': 1.5215418338775635} 11/07/2021 03:10:59 - INFO - __main__ - Step 42446: {'lr': 0.000413388624935216, 'samples': 8149632, 'steps': 42445, 'loss/train': 1.765880823135376} 11/07/2021 03:11:00 - INFO - __main__ - Step 42447: {'lr': 0.00041338460833261403, 'samples': 8149824, 'steps': 42446, 'loss/train': 1.688170075416565} 11/07/2021 03:11:00 - INFO - __main__ - Step 42448: {'lr': 0.0004133805916563935, 'samples': 8150016, 'steps': 42447, 'loss/train': 1.2641940116882324} 11/07/2021 03:11:01 - INFO - __main__ - Step 42449: {'lr': 0.00041337657490655625, 'samples': 8150208, 'steps': 42448, 'loss/train': 1.819398283958435} 11/07/2021 03:11:01 - INFO - __main__ - Step 42450: {'lr': 0.00041337255808310394, 'samples': 8150400, 'steps': 42449, 'loss/train': 1.7530895471572876} 11/07/2021 03:11:02 - INFO - __main__ - Step 42451: {'lr': 0.0004133685411860385, 'samples': 8150592, 'steps': 42450, 'loss/train': 1.6568763256072998} 11/07/2021 03:11:02 - INFO - __main__ - Step 42452: {'lr': 0.0004133645242153617, 'samples': 8150784, 'steps': 42451, 'loss/train': 1.6520686149597168} 11/07/2021 03:11:03 - INFO - __main__ - Step 42453: {'lr': 0.0004133605071710754, 'samples': 8150976, 'steps': 42452, 'loss/train': 1.8637690544128418} 11/07/2021 03:11:04 - INFO - __main__ - Step 42454: {'lr': 0.00041335649005318133, 'samples': 8151168, 'steps': 42453, 'loss/train': 2.0649642944335938} 11/07/2021 03:11:04 - INFO - __main__ - Step 42455: {'lr': 0.0004133524728616814, 'samples': 8151360, 'steps': 42454, 'loss/train': 1.7693240642547607} 11/07/2021 03:11:04 - INFO - __main__ - Step 42456: {'lr': 0.00041334845559657735, 'samples': 8151552, 'steps': 42455, 'loss/train': 1.2166308164596558} 11/07/2021 03:11:05 - INFO - __main__ - Step 42457: {'lr': 0.00041334443825787097, 'samples': 8151744, 'steps': 42456, 'loss/train': 1.0859808921813965} 11/07/2021 03:11:05 - INFO - __main__ - Step 42458: {'lr': 0.0004133404208455642, 'samples': 8151936, 'steps': 42457, 'loss/train': 1.2377136945724487} 11/07/2021 03:11:06 - INFO - __main__ - Step 42459: {'lr': 0.00041333640335965865, 'samples': 8152128, 'steps': 42458, 'loss/train': 1.615179181098938} 11/07/2021 03:11:06 - INFO - __main__ - Step 42460: {'lr': 0.0004133323858001563, 'samples': 8152320, 'steps': 42459, 'loss/train': 1.0365239381790161} 11/07/2021 03:11:07 - INFO - __main__ - Step 42461: {'lr': 0.0004133283681670589, 'samples': 8152512, 'steps': 42460, 'loss/train': 1.2702975273132324} 11/07/2021 03:11:07 - INFO - __main__ - Step 42462: {'lr': 0.0004133243504603682, 'samples': 8152704, 'steps': 42461, 'loss/train': 1.4783105850219727} 11/07/2021 03:11:07 - INFO - __main__ - Step 42463: {'lr': 0.0004133203326800861, 'samples': 8152896, 'steps': 42462, 'loss/train': 1.4924323558807373} 11/07/2021 03:11:08 - INFO - __main__ - Step 42464: {'lr': 0.0004133163148262144, 'samples': 8153088, 'steps': 42463, 'loss/train': 1.103530764579773} 11/07/2021 03:11:09 - INFO - __main__ - Step 42465: {'lr': 0.00041331229689875487, 'samples': 8153280, 'steps': 42464, 'loss/train': 1.96661376953125} 11/07/2021 03:11:09 - INFO - __main__ - Step 42466: {'lr': 0.0004133082788977093, 'samples': 8153472, 'steps': 42465, 'loss/train': 1.1008806228637695} 11/07/2021 03:11:10 - INFO - __main__ - Step 42467: {'lr': 0.00041330426082307963, 'samples': 8153664, 'steps': 42466, 'loss/train': 1.5295052528381348} 11/07/2021 03:11:10 - INFO - __main__ - Step 42468: {'lr': 0.0004133002426748675, 'samples': 8153856, 'steps': 42467, 'loss/train': 1.3837577104568481} 11/07/2021 03:11:10 - INFO - __main__ - Step 42469: {'lr': 0.0004132962244530749, 'samples': 8154048, 'steps': 42468, 'loss/train': 1.902879238128662} 11/07/2021 03:11:11 - INFO - __main__ - Step 42470: {'lr': 0.0004132922061577035, 'samples': 8154240, 'steps': 42469, 'loss/train': 1.6521797180175781} 11/07/2021 03:11:12 - INFO - __main__ - Step 42471: {'lr': 0.0004132881877887551, 'samples': 8154432, 'steps': 42470, 'loss/train': 1.247977375984192} 11/07/2021 03:11:12 - INFO - __main__ - Step 42472: {'lr': 0.0004132841693462315, 'samples': 8154624, 'steps': 42471, 'loss/train': 1.312026858329773} 11/07/2021 03:11:12 - INFO - __main__ - Step 42473: {'lr': 0.0004132801508301347, 'samples': 8154816, 'steps': 42472, 'loss/train': 1.3885867595672607} 11/07/2021 03:11:13 - INFO - __main__ - Step 42474: {'lr': 0.0004132761322404663, 'samples': 8155008, 'steps': 42473, 'loss/train': 0.9789870977401733} 11/07/2021 03:11:14 - INFO - __main__ - Step 42475: {'lr': 0.00041327211357722825, 'samples': 8155200, 'steps': 42474, 'loss/train': 0.8852501511573792} 11/07/2021 03:11:14 - INFO - __main__ - Step 42476: {'lr': 0.00041326809484042235, 'samples': 8155392, 'steps': 42475, 'loss/train': 1.377720594406128} 11/07/2021 03:11:15 - INFO - __main__ - Step 42477: {'lr': 0.0004132640760300503, 'samples': 8155584, 'steps': 42476, 'loss/train': 1.496697187423706} 11/07/2021 03:11:15 - INFO - __main__ - Step 42478: {'lr': 0.000413260057146114, 'samples': 8155776, 'steps': 42477, 'loss/train': 1.9076615571975708} 11/07/2021 03:11:15 - INFO - __main__ - Step 42479: {'lr': 0.00041325603818861517, 'samples': 8155968, 'steps': 42478, 'loss/train': 1.923953652381897} 11/07/2021 03:11:16 - INFO - __main__ - Step 42480: {'lr': 0.0004132520191575558, 'samples': 8156160, 'steps': 42479, 'loss/train': 1.850592017173767} 11/07/2021 03:11:16 - INFO - __main__ - Step 42481: {'lr': 0.0004132480000529375, 'samples': 8156352, 'steps': 42480, 'loss/train': 1.101906180381775} 11/07/2021 03:11:17 - INFO - __main__ - Step 42482: {'lr': 0.0004132439808747622, 'samples': 8156544, 'steps': 42481, 'loss/train': 1.372361421585083} 11/07/2021 03:11:17 - INFO - __main__ - Step 42483: {'lr': 0.00041323996162303167, 'samples': 8156736, 'steps': 42482, 'loss/train': 1.4120784997940063} 11/07/2021 03:11:18 - INFO - __main__ - Step 42484: {'lr': 0.0004132359422977477, 'samples': 8156928, 'steps': 42483, 'loss/train': 1.6222646236419678} 11/07/2021 03:11:19 - INFO - __main__ - Step 42485: {'lr': 0.0004132319228989122, 'samples': 8157120, 'steps': 42484, 'loss/train': 1.6639920473098755} 11/07/2021 03:11:19 - INFO - __main__ - Step 42486: {'lr': 0.00041322790342652695, 'samples': 8157312, 'steps': 42485, 'loss/train': 0.6607436537742615} 11/07/2021 03:11:19 - INFO - __main__ - Step 42487: {'lr': 0.00041322388388059366, 'samples': 8157504, 'steps': 42486, 'loss/train': 1.8186523914337158} 11/07/2021 03:11:20 - INFO - __main__ - Step 42488: {'lr': 0.0004132198642611142, 'samples': 8157696, 'steps': 42487, 'loss/train': 0.4286053478717804} 11/07/2021 03:11:20 - INFO - __main__ - Step 42489: {'lr': 0.0004132158445680904, 'samples': 8157888, 'steps': 42488, 'loss/train': 1.5934503078460693} 11/07/2021 03:11:21 - INFO - __main__ - Step 42490: {'lr': 0.0004132118248015241, 'samples': 8158080, 'steps': 42489, 'loss/train': 1.6939036846160889} 11/07/2021 03:11:21 - INFO - __main__ - Step 42491: {'lr': 0.000413207804961417, 'samples': 8158272, 'steps': 42490, 'loss/train': 1.4884772300720215} 11/07/2021 03:11:22 - INFO - __main__ - Step 42492: {'lr': 0.000413203785047771, 'samples': 8158464, 'steps': 42491, 'loss/train': 1.7436952590942383} 11/07/2021 03:11:22 - INFO - __main__ - Step 42493: {'lr': 0.00041319976506058785, 'samples': 8158656, 'steps': 42492, 'loss/train': 0.9843049049377441} 11/07/2021 03:11:22 - INFO - __main__ - Step 42494: {'lr': 0.00041319574499986957, 'samples': 8158848, 'steps': 42493, 'loss/train': 1.8652119636535645} 11/07/2021 03:11:23 - INFO - __main__ - Step 42495: {'lr': 0.0004131917248656177, 'samples': 8159040, 'steps': 42494, 'loss/train': 1.465429663658142} 11/07/2021 03:11:24 - INFO - __main__ - Step 42496: {'lr': 0.0004131877046578341, 'samples': 8159232, 'steps': 42495, 'loss/train': 1.3295153379440308} 11/07/2021 03:11:24 - INFO - __main__ - Step 42497: {'lr': 0.0004131836843765207, 'samples': 8159424, 'steps': 42496, 'loss/train': 1.6745126247406006} 11/07/2021 03:11:24 - INFO - __main__ - Step 42498: {'lr': 0.00041317966402167923, 'samples': 8159616, 'steps': 42497, 'loss/train': 1.2073416709899902} 11/07/2021 03:11:25 - INFO - __main__ - Step 42499: {'lr': 0.0004131756435933115, 'samples': 8159808, 'steps': 42498, 'loss/train': 1.2210643291473389} 11/07/2021 03:11:25 - INFO - __main__ - Step 42500: {'lr': 0.00041317162309141944, 'samples': 8160000, 'steps': 42499, 'loss/train': 1.4471495151519775} 11/07/2021 03:11:26 - INFO - __main__ - Step 42501: {'lr': 0.00041316760251600474, 'samples': 8160192, 'steps': 42500, 'loss/train': 1.6119571924209595} 11/07/2021 03:11:27 - INFO - __main__ - Step 42502: {'lr': 0.00041316358186706915, 'samples': 8160384, 'steps': 42501, 'loss/train': 0.9305280447006226} 11/07/2021 03:11:27 - INFO - __main__ - Step 42503: {'lr': 0.0004131595611446146, 'samples': 8160576, 'steps': 42502, 'loss/train': 1.4976180791854858} 11/07/2021 03:11:27 - INFO - __main__ - Step 42504: {'lr': 0.0004131555403486429, 'samples': 8160768, 'steps': 42503, 'loss/train': 1.690905213356018} 11/07/2021 03:11:28 - INFO - __main__ - Step 42505: {'lr': 0.00041315151947915577, 'samples': 8160960, 'steps': 42504, 'loss/train': 1.7861652374267578} 11/07/2021 03:11:29 - INFO - __main__ - Step 42506: {'lr': 0.0004131474985361551, 'samples': 8161152, 'steps': 42505, 'loss/train': 1.7889028787612915} 11/07/2021 03:11:29 - INFO - __main__ - Step 42507: {'lr': 0.0004131434775196428, 'samples': 8161344, 'steps': 42506, 'loss/train': 1.2052451372146606} 11/07/2021 03:11:29 - INFO - __main__ - Step 42508: {'lr': 0.0004131394564296205, 'samples': 8161536, 'steps': 42507, 'loss/train': 1.2064223289489746} 11/07/2021 03:11:30 - INFO - __main__ - Step 42509: {'lr': 0.00041313543526609, 'samples': 8161728, 'steps': 42508, 'loss/train': 1.6265215873718262} 11/07/2021 03:11:30 - INFO - __main__ - Step 42510: {'lr': 0.00041313141402905324, 'samples': 8161920, 'steps': 42509, 'loss/train': 1.6610273122787476} 11/07/2021 03:11:31 - INFO - __main__ - Step 42511: {'lr': 0.00041312739271851196, 'samples': 8162112, 'steps': 42510, 'loss/train': 1.5313423871994019} 11/07/2021 03:11:31 - INFO - __main__ - Step 42512: {'lr': 0.0004131233713344681, 'samples': 8162304, 'steps': 42511, 'loss/train': 1.8610824346542358} 11/07/2021 03:11:32 - INFO - __main__ - Step 42513: {'lr': 0.0004131193498769232, 'samples': 8162496, 'steps': 42512, 'loss/train': 1.57438325881958} 11/07/2021 03:11:32 - INFO - __main__ - Step 42514: {'lr': 0.0004131153283458794, 'samples': 8162688, 'steps': 42513, 'loss/train': 1.455344796180725} 11/07/2021 03:11:32 - INFO - __main__ - Step 42515: {'lr': 0.00041311130674133824, 'samples': 8162880, 'steps': 42514, 'loss/train': 1.7059967517852783} 11/07/2021 03:11:33 - INFO - __main__ - Step 42516: {'lr': 0.0004131072850633017, 'samples': 8163072, 'steps': 42515, 'loss/train': 1.568089246749878} 11/07/2021 03:11:34 - INFO - __main__ - Step 42517: {'lr': 0.0004131032633117715, 'samples': 8163264, 'steps': 42516, 'loss/train': 1.0315004587173462} 11/07/2021 03:11:34 - INFO - __main__ - Step 42518: {'lr': 0.0004130992414867495, 'samples': 8163456, 'steps': 42517, 'loss/train': 1.199580430984497} 11/07/2021 03:11:35 - INFO - __main__ - Step 42519: {'lr': 0.0004130952195882375, 'samples': 8163648, 'steps': 42518, 'loss/train': 1.2123099565505981} 11/07/2021 03:11:35 - INFO - __main__ - Step 42520: {'lr': 0.0004130911976162373, 'samples': 8163840, 'steps': 42519, 'loss/train': 1.5549166202545166} 11/07/2021 03:11:36 - INFO - __main__ - Step 42521: {'lr': 0.0004130871755707508, 'samples': 8164032, 'steps': 42520, 'loss/train': 1.4823044538497925} 11/07/2021 03:11:36 - INFO - __main__ - Step 42522: {'lr': 0.0004130831534517796, 'samples': 8164224, 'steps': 42521, 'loss/train': 1.6366355419158936} 11/07/2021 03:11:37 - INFO - __main__ - Step 42523: {'lr': 0.00041307913125932574, 'samples': 8164416, 'steps': 42522, 'loss/train': 1.189083456993103} 11/07/2021 03:11:37 - INFO - __main__ - Step 42524: {'lr': 0.00041307510899339097, 'samples': 8164608, 'steps': 42523, 'loss/train': 1.2785511016845703} 11/07/2021 03:11:37 - INFO - __main__ - Step 42525: {'lr': 0.00041307108665397695, 'samples': 8164800, 'steps': 42524, 'loss/train': 1.7492245435714722} 11/07/2021 03:11:39 - INFO - __main__ - Step 42526: {'lr': 0.00041306706424108563, 'samples': 8164992, 'steps': 42525, 'loss/train': 1.626507043838501} 11/07/2021 03:11:39 - INFO - __main__ - Step 42527: {'lr': 0.0004130630417547189, 'samples': 8165184, 'steps': 42526, 'loss/train': 1.6828103065490723} 11/07/2021 03:11:39 - INFO - __main__ - Step 42528: {'lr': 0.00041305901919487845, 'samples': 8165376, 'steps': 42527, 'loss/train': 1.7451270818710327} 11/07/2021 03:11:40 - INFO - __main__ - Step 42529: {'lr': 0.0004130549965615661, 'samples': 8165568, 'steps': 42528, 'loss/train': 1.639512538909912} 11/07/2021 03:11:40 - INFO - __main__ - Step 42530: {'lr': 0.00041305097385478375, 'samples': 8165760, 'steps': 42529, 'loss/train': 1.508062481880188} 11/07/2021 03:11:40 - INFO - __main__ - Step 42531: {'lr': 0.00041304695107453307, 'samples': 8165952, 'steps': 42530, 'loss/train': 1.6047009229660034} 11/07/2021 03:11:41 - INFO - __main__ - Step 42532: {'lr': 0.000413042928220816, 'samples': 8166144, 'steps': 42531, 'loss/train': 0.9355162978172302} 11/07/2021 03:11:42 - INFO - __main__ - Step 42533: {'lr': 0.0004130389052936342, 'samples': 8166336, 'steps': 42532, 'loss/train': 1.5015809535980225} 11/07/2021 03:11:42 - INFO - __main__ - Step 42534: {'lr': 0.0004130348822929897, 'samples': 8166528, 'steps': 42533, 'loss/train': 1.8318469524383545} 11/07/2021 03:11:42 - INFO - __main__ - Step 42535: {'lr': 0.0004130308592188842, 'samples': 8166720, 'steps': 42534, 'loss/train': 1.5886842012405396} 11/07/2021 03:11:43 - INFO - __main__ - Step 42536: {'lr': 0.0004130268360713194, 'samples': 8166912, 'steps': 42535, 'loss/train': 1.339535117149353} 11/07/2021 03:11:44 - INFO - __main__ - Step 42537: {'lr': 0.0004130228128502973, 'samples': 8167104, 'steps': 42536, 'loss/train': 0.9194119572639465} 11/07/2021 03:11:44 - INFO - __main__ - Step 42538: {'lr': 0.0004130187895558196, 'samples': 8167296, 'steps': 42537, 'loss/train': 1.4769353866577148} 11/07/2021 03:11:44 - INFO - __main__ - Step 42539: {'lr': 0.00041301476618788827, 'samples': 8167488, 'steps': 42538, 'loss/train': 0.9612076282501221} 11/07/2021 03:11:45 - INFO - __main__ - Step 42540: {'lr': 0.0004130107427465049, 'samples': 8167680, 'steps': 42539, 'loss/train': 1.7081719636917114} 11/07/2021 03:11:45 - INFO - __main__ - Step 42541: {'lr': 0.00041300671923167145, 'samples': 8167872, 'steps': 42540, 'loss/train': 1.4179370403289795} 11/07/2021 03:11:46 - INFO - __main__ - Step 42542: {'lr': 0.00041300269564338956, 'samples': 8168064, 'steps': 42541, 'loss/train': 1.123705267906189} 11/07/2021 03:11:47 - INFO - __main__ - Step 42543: {'lr': 0.0004129986719816613, 'samples': 8168256, 'steps': 42542, 'loss/train': 1.6389899253845215} 11/07/2021 03:11:47 - INFO - __main__ - Step 42544: {'lr': 0.0004129946482464883, 'samples': 8168448, 'steps': 42543, 'loss/train': 0.9306034445762634} 11/07/2021 03:11:47 - INFO - __main__ - Step 42545: {'lr': 0.0004129906244378724, 'samples': 8168640, 'steps': 42544, 'loss/train': 1.734245777130127} 11/07/2021 03:11:48 - INFO - __main__ - Step 42546: {'lr': 0.0004129866005558155, 'samples': 8168832, 'steps': 42545, 'loss/train': 1.7263352870941162} 11/07/2021 03:11:49 - INFO - __main__ - Step 42547: {'lr': 0.00041298257660031935, 'samples': 8169024, 'steps': 42546, 'loss/train': 0.9408368468284607} 11/07/2021 03:11:49 - INFO - __main__ - Step 42548: {'lr': 0.00041297855257138577, 'samples': 8169216, 'steps': 42547, 'loss/train': 1.384551763534546} 11/07/2021 03:11:49 - INFO - __main__ - Step 42549: {'lr': 0.0004129745284690165, 'samples': 8169408, 'steps': 42548, 'loss/train': 1.6924649477005005} 11/07/2021 03:11:50 - INFO - __main__ - Step 42550: {'lr': 0.0004129705042932135, 'samples': 8169600, 'steps': 42549, 'loss/train': 1.3338024616241455} 11/07/2021 03:11:50 - INFO - __main__ - Step 42551: {'lr': 0.0004129664800439785, 'samples': 8169792, 'steps': 42550, 'loss/train': 1.6320832967758179} 11/07/2021 03:11:51 - INFO - __main__ - Step 42552: {'lr': 0.0004129624557213133, 'samples': 8169984, 'steps': 42551, 'loss/train': 1.28696870803833} 11/07/2021 03:11:51 - INFO - __main__ - Step 42553: {'lr': 0.00041295843132521973, 'samples': 8170176, 'steps': 42552, 'loss/train': 1.7522774934768677} 11/07/2021 03:11:52 - INFO - __main__ - Step 42554: {'lr': 0.0004129544068556996, 'samples': 8170368, 'steps': 42553, 'loss/train': 1.6836118698120117} 11/07/2021 03:11:52 - INFO - __main__ - Step 42555: {'lr': 0.00041295038231275473, 'samples': 8170560, 'steps': 42554, 'loss/train': 1.6868772506713867} 11/07/2021 03:11:53 - INFO - __main__ - Step 42556: {'lr': 0.0004129463576963869, 'samples': 8170752, 'steps': 42555, 'loss/train': 1.503050446510315} 11/07/2021 03:11:54 - INFO - __main__ - Step 42557: {'lr': 0.000412942333006598, 'samples': 8170944, 'steps': 42556, 'loss/train': 1.6459203958511353} 11/07/2021 03:11:54 - INFO - __main__ - Step 42558: {'lr': 0.0004129383082433898, 'samples': 8171136, 'steps': 42557, 'loss/train': 1.4009777307510376} 11/07/2021 03:11:54 - INFO - __main__ - Step 42559: {'lr': 0.0004129342834067641, 'samples': 8171328, 'steps': 42558, 'loss/train': 1.706690788269043} 11/07/2021 03:11:55 - INFO - __main__ - Step 42560: {'lr': 0.0004129302584967227, 'samples': 8171520, 'steps': 42559, 'loss/train': 1.6396253108978271} 11/07/2021 03:11:55 - INFO - __main__ - Step 42561: {'lr': 0.0004129262335132675, 'samples': 8171712, 'steps': 42560, 'loss/train': 0.891944408416748} 11/07/2021 03:11:55 - INFO - __main__ - Step 42562: {'lr': 0.00041292220845640023, 'samples': 8171904, 'steps': 42561, 'loss/train': 1.710152506828308} 11/07/2021 03:11:56 - INFO - __main__ - Step 42563: {'lr': 0.00041291818332612275, 'samples': 8172096, 'steps': 42562, 'loss/train': 1.7914820909500122} 11/07/2021 03:11:57 - INFO - __main__ - Step 42564: {'lr': 0.00041291415812243676, 'samples': 8172288, 'steps': 42563, 'loss/train': 1.2920377254486084} 11/07/2021 03:11:57 - INFO - __main__ - Step 42565: {'lr': 0.0004129101328453442, 'samples': 8172480, 'steps': 42564, 'loss/train': 1.6780205965042114} 11/07/2021 03:11:57 - INFO - __main__ - Step 42566: {'lr': 0.0004129061074948469, 'samples': 8172672, 'steps': 42565, 'loss/train': 1.525704264640808} 11/07/2021 03:11:58 - INFO - __main__ - Step 42567: {'lr': 0.0004129020820709466, 'samples': 8172864, 'steps': 42566, 'loss/train': 1.5468403100967407} 11/07/2021 03:11:59 - INFO - __main__ - Step 42568: {'lr': 0.00041289805657364516, 'samples': 8173056, 'steps': 42567, 'loss/train': 1.4239075183868408} 11/07/2021 03:11:59 - INFO - __main__ - Step 42569: {'lr': 0.0004128940310029443, 'samples': 8173248, 'steps': 42568, 'loss/train': 1.1596369743347168} 11/07/2021 03:12:00 - INFO - __main__ - Step 42570: {'lr': 0.0004128900053588459, 'samples': 8173440, 'steps': 42569, 'loss/train': 1.5229151248931885} 11/07/2021 03:12:00 - INFO - __main__ - Step 42571: {'lr': 0.00041288597964135186, 'samples': 8173632, 'steps': 42570, 'loss/train': 1.7631556987762451} 11/07/2021 03:12:00 - INFO - __main__ - Step 42572: {'lr': 0.0004128819538504639, 'samples': 8173824, 'steps': 42571, 'loss/train': 1.4214872121810913} 11/07/2021 03:12:01 - INFO - __main__ - Step 42573: {'lr': 0.00041287792798618374, 'samples': 8174016, 'steps': 42572, 'loss/train': 0.9737385511398315} 11/07/2021 03:12:02 - INFO - __main__ - Step 42574: {'lr': 0.00041287390204851343, 'samples': 8174208, 'steps': 42573, 'loss/train': 1.2740235328674316} 11/07/2021 03:12:02 - INFO - __main__ - Step 42575: {'lr': 0.0004128698760374546, 'samples': 8174400, 'steps': 42574, 'loss/train': 1.4505579471588135} 11/07/2021 03:12:02 - INFO - __main__ - Step 42576: {'lr': 0.0004128658499530091, 'samples': 8174592, 'steps': 42575, 'loss/train': 1.3641626834869385} 11/07/2021 03:12:03 - INFO - __main__ - Step 42577: {'lr': 0.00041286182379517876, 'samples': 8174784, 'steps': 42576, 'loss/train': 1.3425848484039307} 11/07/2021 03:12:04 - INFO - __main__ - Step 42578: {'lr': 0.00041285779756396543, 'samples': 8174976, 'steps': 42577, 'loss/train': 1.4676402807235718} 11/07/2021 03:12:04 - INFO - __main__ - Step 42579: {'lr': 0.00041285377125937085, 'samples': 8175168, 'steps': 42578, 'loss/train': 1.6525946855545044} 11/07/2021 03:12:04 - INFO - __main__ - Step 42580: {'lr': 0.0004128497448813969, 'samples': 8175360, 'steps': 42579, 'loss/train': 1.211531400680542} 11/07/2021 03:12:05 - INFO - __main__ - Step 42581: {'lr': 0.0004128457184300454, 'samples': 8175552, 'steps': 42580, 'loss/train': 0.8959876894950867} 11/07/2021 03:12:05 - INFO - __main__ - Step 42582: {'lr': 0.0004128416919053181, 'samples': 8175744, 'steps': 42581, 'loss/train': 1.1853410005569458} 11/07/2021 03:12:06 - INFO - __main__ - Step 42583: {'lr': 0.0004128376653072168, 'samples': 8175936, 'steps': 42582, 'loss/train': 1.5495693683624268} 11/07/2021 03:12:07 - INFO - __main__ - Step 42584: {'lr': 0.0004128336386357434, 'samples': 8176128, 'steps': 42583, 'loss/train': 1.5811612606048584} 11/07/2021 03:12:07 - INFO - __main__ - Step 42585: {'lr': 0.0004128296118908997, 'samples': 8176320, 'steps': 42584, 'loss/train': 1.4197684526443481} 11/07/2021 03:12:07 - INFO - __main__ - Step 42586: {'lr': 0.0004128255850726874, 'samples': 8176512, 'steps': 42585, 'loss/train': 1.4086215496063232} 11/07/2021 03:12:08 - INFO - __main__ - Step 42587: {'lr': 0.0004128215581811085, 'samples': 8176704, 'steps': 42586, 'loss/train': 1.300828456878662} 11/07/2021 03:12:09 - INFO - __main__ - Step 42588: {'lr': 0.0004128175312161647, 'samples': 8176896, 'steps': 42587, 'loss/train': 0.3088771402835846} 11/07/2021 03:12:09 - INFO - __main__ - Step 42589: {'lr': 0.00041281350417785777, 'samples': 8177088, 'steps': 42588, 'loss/train': 1.5267343521118164} 11/07/2021 03:12:09 - INFO - __main__ - Step 42590: {'lr': 0.00041280947706618965, 'samples': 8177280, 'steps': 42589, 'loss/train': 1.41680109500885} 11/07/2021 03:12:10 - INFO - __main__ - Step 42591: {'lr': 0.0004128054498811621, 'samples': 8177472, 'steps': 42590, 'loss/train': 0.8744376301765442} 11/07/2021 03:12:10 - INFO - __main__ - Step 42592: {'lr': 0.0004128014226227769, 'samples': 8177664, 'steps': 42591, 'loss/train': 1.8009610176086426} 11/07/2021 03:12:11 - INFO - __main__ - Step 42593: {'lr': 0.00041279739529103586, 'samples': 8177856, 'steps': 42592, 'loss/train': 2.0615074634552} 11/07/2021 03:12:11 - INFO - __main__ - Step 42594: {'lr': 0.0004127933678859409, 'samples': 8178048, 'steps': 42593, 'loss/train': 1.7933173179626465} 11/07/2021 03:12:12 - INFO - __main__ - Step 42595: {'lr': 0.00041278934040749375, 'samples': 8178240, 'steps': 42594, 'loss/train': 1.6331143379211426} 11/07/2021 03:12:12 - INFO - __main__ - Step 42596: {'lr': 0.0004127853128556962, 'samples': 8178432, 'steps': 42595, 'loss/train': 1.4697948694229126} 11/07/2021 03:12:12 - INFO - __main__ - Step 42597: {'lr': 0.00041278128523055015, 'samples': 8178624, 'steps': 42596, 'loss/train': 1.5395692586898804} 11/07/2021 03:12:13 - INFO - __main__ - Step 42598: {'lr': 0.0004127772575320573, 'samples': 8178816, 'steps': 42597, 'loss/train': 1.289622187614441} 11/07/2021 03:12:15 - INFO - __main__ - Step 42599: {'lr': 0.0004127732297602196, 'samples': 8179008, 'steps': 42598, 'loss/train': 0.9867827296257019} 11/07/2021 03:12:15 - INFO - __main__ - Step 42600: {'lr': 0.0004127692019150387, 'samples': 8179200, 'steps': 42599, 'loss/train': 1.7771762609481812} 11/07/2021 03:12:15 - INFO - __main__ - Step 42601: {'lr': 0.00041276517399651657, 'samples': 8179392, 'steps': 42600, 'loss/train': 1.8227523565292358} 11/07/2021 03:12:16 - INFO - __main__ - Step 42602: {'lr': 0.00041276114600465497, 'samples': 8179584, 'steps': 42601, 'loss/train': 1.7748034000396729} 11/07/2021 03:12:16 - INFO - __main__ - Step 42603: {'lr': 0.0004127571179394557, 'samples': 8179776, 'steps': 42602, 'loss/train': 1.3474419116973877} 11/07/2021 03:12:16 - INFO - __main__ - Step 42604: {'lr': 0.0004127530898009205, 'samples': 8179968, 'steps': 42603, 'loss/train': 1.3940340280532837} 11/07/2021 03:12:17 - INFO - __main__ - Step 42605: {'lr': 0.00041274906158905137, 'samples': 8180160, 'steps': 42604, 'loss/train': 2.1176109313964844} 11/07/2021 03:12:18 - INFO - __main__ - Step 42606: {'lr': 0.00041274503330384997, 'samples': 8180352, 'steps': 42605, 'loss/train': 1.880507469177246} 11/07/2021 03:12:18 - INFO - __main__ - Step 42607: {'lr': 0.0004127410049453182, 'samples': 8180544, 'steps': 42606, 'loss/train': 1.521466612815857} 11/07/2021 03:12:18 - INFO - __main__ - Step 42608: {'lr': 0.00041273697651345785, 'samples': 8180736, 'steps': 42607, 'loss/train': 1.7024065256118774} 11/07/2021 03:12:19 - INFO - __main__ - Step 42609: {'lr': 0.00041273294800827075, 'samples': 8180928, 'steps': 42608, 'loss/train': 1.6238617897033691} 11/07/2021 03:12:20 - INFO - __main__ - Step 42610: {'lr': 0.00041272891942975863, 'samples': 8181120, 'steps': 42609, 'loss/train': 1.1740303039550781} 11/07/2021 03:12:20 - INFO - __main__ - Step 42611: {'lr': 0.00041272489077792343, 'samples': 8181312, 'steps': 42610, 'loss/train': 1.7574613094329834} 11/07/2021 03:12:21 - INFO - __main__ - Step 42612: {'lr': 0.0004127208620527669, 'samples': 8181504, 'steps': 42611, 'loss/train': 1.3476653099060059} 11/07/2021 03:12:21 - INFO - __main__ - Step 42613: {'lr': 0.00041271683325429075, 'samples': 8181696, 'steps': 42612, 'loss/train': 0.977882981300354} 11/07/2021 03:12:21 - INFO - __main__ - Step 42614: {'lr': 0.00041271280438249705, 'samples': 8181888, 'steps': 42613, 'loss/train': 1.575322151184082} 11/07/2021 03:12:22 - INFO - __main__ - Step 42615: {'lr': 0.00041270877543738744, 'samples': 8182080, 'steps': 42614, 'loss/train': 1.1944984197616577} 11/07/2021 03:12:23 - INFO - __main__ - Step 42616: {'lr': 0.0004127047464189637, 'samples': 8182272, 'steps': 42615, 'loss/train': 0.4800504446029663} 11/07/2021 03:12:23 - INFO - __main__ - Step 42617: {'lr': 0.0004127007173272278, 'samples': 8182464, 'steps': 42616, 'loss/train': 1.5213189125061035} 11/07/2021 03:12:23 - INFO - __main__ - Step 42618: {'lr': 0.0004126966881621814, 'samples': 8182656, 'steps': 42617, 'loss/train': 1.9201672077178955} 11/07/2021 03:12:24 - INFO - __main__ - Step 42619: {'lr': 0.0004126926589238264, 'samples': 8182848, 'steps': 42618, 'loss/train': 1.2446012496948242} 11/07/2021 03:12:24 - INFO - __main__ - Step 42620: {'lr': 0.00041268862961216457, 'samples': 8183040, 'steps': 42619, 'loss/train': 1.7546820640563965} 11/07/2021 03:12:25 - INFO - __main__ - Step 42621: {'lr': 0.00041268460022719783, 'samples': 8183232, 'steps': 42620, 'loss/train': 1.547313928604126} 11/07/2021 03:12:25 - INFO - __main__ - Step 42622: {'lr': 0.0004126805707689279, 'samples': 8183424, 'steps': 42621, 'loss/train': 1.7227954864501953} 11/07/2021 03:12:26 - INFO - __main__ - Step 42623: {'lr': 0.0004126765412373566, 'samples': 8183616, 'steps': 42622, 'loss/train': 2.0326387882232666} 11/07/2021 03:12:26 - INFO - __main__ - Step 42624: {'lr': 0.0004126725116324858, 'samples': 8183808, 'steps': 42623, 'loss/train': 1.348909854888916} 11/07/2021 03:12:26 - INFO - __main__ - Step 42625: {'lr': 0.00041266848195431715, 'samples': 8184000, 'steps': 42624, 'loss/train': 1.6020936965942383} 11/07/2021 03:12:27 - INFO - __main__ - Step 42626: {'lr': 0.00041266445220285267, 'samples': 8184192, 'steps': 42625, 'loss/train': 0.6952012181282043} 11/07/2021 03:12:28 - INFO - __main__ - Step 42627: {'lr': 0.0004126604223780941, 'samples': 8184384, 'steps': 42626, 'loss/train': 1.5037245750427246} 11/07/2021 03:12:28 - INFO - __main__ - Step 42628: {'lr': 0.00041265639248004327, 'samples': 8184576, 'steps': 42627, 'loss/train': 1.6028532981872559} 11/07/2021 03:12:29 - INFO - __main__ - Step 42629: {'lr': 0.000412652362508702, 'samples': 8184768, 'steps': 42628, 'loss/train': 0.7027224898338318} 11/07/2021 03:12:29 - INFO - __main__ - Step 42630: {'lr': 0.000412648332464072, 'samples': 8184960, 'steps': 42629, 'loss/train': 1.5392451286315918} 11/07/2021 03:12:30 - INFO - __main__ - Step 42631: {'lr': 0.00041264430234615526, 'samples': 8185152, 'steps': 42630, 'loss/train': 1.8497998714447021} 11/07/2021 03:12:30 - INFO - __main__ - Step 42632: {'lr': 0.0004126402721549535, 'samples': 8185344, 'steps': 42631, 'loss/train': 1.470371127128601} 11/07/2021 03:12:31 - INFO - __main__ - Step 42633: {'lr': 0.00041263624189046846, 'samples': 8185536, 'steps': 42632, 'loss/train': 1.7528550624847412} 11/07/2021 03:12:31 - INFO - __main__ - Step 42634: {'lr': 0.0004126322115527021, 'samples': 8185728, 'steps': 42633, 'loss/train': 1.517307996749878} 11/07/2021 03:12:31 - INFO - __main__ - Step 42635: {'lr': 0.00041262818114165615, 'samples': 8185920, 'steps': 42634, 'loss/train': 1.2638063430786133} 11/07/2021 03:12:32 - INFO - __main__ - Step 42636: {'lr': 0.0004126241506573325, 'samples': 8186112, 'steps': 42635, 'loss/train': 1.5525659322738647} 11/07/2021 03:12:33 - INFO - __main__ - Step 42637: {'lr': 0.00041262012009973283, 'samples': 8186304, 'steps': 42636, 'loss/train': 1.163089394569397} 11/07/2021 03:12:33 - INFO - __main__ - Step 42638: {'lr': 0.0004126160894688591, 'samples': 8186496, 'steps': 42637, 'loss/train': 1.1772278547286987} 11/07/2021 03:12:33 - INFO - __main__ - Step 42639: {'lr': 0.00041261205876471307, 'samples': 8186688, 'steps': 42638, 'loss/train': 1.3485753536224365} 11/07/2021 03:12:34 - INFO - __main__ - Step 42640: {'lr': 0.0004126080279872966, 'samples': 8186880, 'steps': 42639, 'loss/train': 1.5482761859893799} 11/07/2021 03:12:34 - INFO - __main__ - Step 42641: {'lr': 0.0004126039971366114, 'samples': 8187072, 'steps': 42640, 'loss/train': 1.2749180793762207} 11/07/2021 03:12:35 - INFO - __main__ - Step 42642: {'lr': 0.0004125999662126594, 'samples': 8187264, 'steps': 42641, 'loss/train': 1.9490081071853638} 11/07/2021 03:12:35 - INFO - __main__ - Step 42643: {'lr': 0.00041259593521544223, 'samples': 8187456, 'steps': 42642, 'loss/train': 1.4983640909194946} 11/07/2021 03:12:36 - INFO - __main__ - Step 42644: {'lr': 0.00041259190414496194, 'samples': 8187648, 'steps': 42643, 'loss/train': 1.4390331506729126} 11/07/2021 03:12:36 - INFO - __main__ - Step 42645: {'lr': 0.00041258787300122026, 'samples': 8187840, 'steps': 42644, 'loss/train': 1.4660903215408325} 11/07/2021 03:12:36 - INFO - __main__ - Step 42646: {'lr': 0.000412583841784219, 'samples': 8188032, 'steps': 42645, 'loss/train': 1.397388219833374} 11/07/2021 03:12:37 - INFO - __main__ - Step 42647: {'lr': 0.00041257981049395997, 'samples': 8188224, 'steps': 42646, 'loss/train': 1.726476788520813} 11/07/2021 03:12:38 - INFO - __main__ - Step 42648: {'lr': 0.000412575779130445, 'samples': 8188416, 'steps': 42647, 'loss/train': 1.4296152591705322} 11/07/2021 03:12:38 - INFO - __main__ - Step 42649: {'lr': 0.0004125717476936758, 'samples': 8188608, 'steps': 42648, 'loss/train': 0.9558641910552979} 11/07/2021 03:12:39 - INFO - __main__ - Step 42650: {'lr': 0.0004125677161836543, 'samples': 8188800, 'steps': 42649, 'loss/train': 1.4704724550247192} 11/07/2021 03:12:39 - INFO - __main__ - Step 42651: {'lr': 0.00041256368460038237, 'samples': 8188992, 'steps': 42650, 'loss/train': 1.8188363313674927} 11/07/2021 03:12:40 - INFO - __main__ - Step 42652: {'lr': 0.00041255965294386174, 'samples': 8189184, 'steps': 42651, 'loss/train': 1.3506090641021729} 11/07/2021 03:12:40 - INFO - __main__ - Step 42653: {'lr': 0.00041255562121409416, 'samples': 8189376, 'steps': 42652, 'loss/train': 0.8035365343093872} 11/07/2021 03:12:41 - INFO - __main__ - Step 42654: {'lr': 0.0004125515894110816, 'samples': 8189568, 'steps': 42653, 'loss/train': 1.6913963556289673} 11/07/2021 03:12:41 - INFO - __main__ - Step 42655: {'lr': 0.00041254755753482574, 'samples': 8189760, 'steps': 42654, 'loss/train': 1.364007830619812} 11/07/2021 03:12:41 - INFO - __main__ - Step 42656: {'lr': 0.00041254352558532854, 'samples': 8189952, 'steps': 42655, 'loss/train': 1.3132096529006958} 11/07/2021 03:12:42 - INFO - __main__ - Step 42657: {'lr': 0.0004125394935625917, 'samples': 8190144, 'steps': 42656, 'loss/train': 0.9017446041107178} 11/07/2021 03:12:43 - INFO - __main__ - Step 42658: {'lr': 0.00041253546146661704, 'samples': 8190336, 'steps': 42657, 'loss/train': 1.883015513420105} 11/07/2021 03:12:43 - INFO - __main__ - Step 42659: {'lr': 0.00041253142929740643, 'samples': 8190528, 'steps': 42658, 'loss/train': 1.9374349117279053} 11/07/2021 03:12:43 - INFO - __main__ - Step 42660: {'lr': 0.00041252739705496165, 'samples': 8190720, 'steps': 42659, 'loss/train': 1.5369181632995605} 11/07/2021 03:12:44 - INFO - __main__ - Step 42661: {'lr': 0.00041252336473928455, 'samples': 8190912, 'steps': 42660, 'loss/train': 1.5200303792953491} 11/07/2021 03:12:45 - INFO - __main__ - Step 42662: {'lr': 0.00041251933235037695, 'samples': 8191104, 'steps': 42661, 'loss/train': 1.5997662544250488} 11/07/2021 03:12:45 - INFO - __main__ - Step 42663: {'lr': 0.00041251529988824067, 'samples': 8191296, 'steps': 42662, 'loss/train': 1.662327766418457} 11/07/2021 03:12:46 - INFO - __main__ - Step 42664: {'lr': 0.0004125112673528775, 'samples': 8191488, 'steps': 42663, 'loss/train': 1.5672930479049683} 11/07/2021 03:12:46 - INFO - __main__ - Step 42665: {'lr': 0.0004125072347442892, 'samples': 8191680, 'steps': 42664, 'loss/train': 1.6102161407470703} 11/07/2021 03:12:46 - INFO - __main__ - Step 42666: {'lr': 0.0004125032020624776, 'samples': 8191872, 'steps': 42665, 'loss/train': 1.5029171705245972} 11/07/2021 03:12:47 - INFO - __main__ - Step 42667: {'lr': 0.0004124991693074447, 'samples': 8192064, 'steps': 42666, 'loss/train': 1.9619196653366089} 11/07/2021 03:12:48 - INFO - __main__ - Step 42668: {'lr': 0.00041249513647919207, 'samples': 8192256, 'steps': 42667, 'loss/train': 2.0680155754089355} 11/07/2021 03:12:48 - INFO - __main__ - Step 42669: {'lr': 0.00041249110357772167, 'samples': 8192448, 'steps': 42668, 'loss/train': 1.3066715002059937} 11/07/2021 03:12:48 - INFO - __main__ - Step 42670: {'lr': 0.00041248707060303536, 'samples': 8192640, 'steps': 42669, 'loss/train': 1.6666626930236816} 11/07/2021 03:12:49 - INFO - __main__ - Step 42671: {'lr': 0.00041248303755513484, 'samples': 8192832, 'steps': 42670, 'loss/train': 1.66847562789917} 11/07/2021 03:12:49 - INFO - __main__ - Step 42672: {'lr': 0.00041247900443402194, 'samples': 8193024, 'steps': 42671, 'loss/train': 1.0965551137924194} 11/07/2021 03:12:50 - INFO - __main__ - Step 42673: {'lr': 0.00041247497123969844, 'samples': 8193216, 'steps': 42672, 'loss/train': 1.3833045959472656} 11/07/2021 03:12:50 - INFO - __main__ - Step 42674: {'lr': 0.00041247093797216637, 'samples': 8193408, 'steps': 42673, 'loss/train': 1.230155348777771} 11/07/2021 03:12:51 - INFO - __main__ - Step 42675: {'lr': 0.00041246690463142733, 'samples': 8193600, 'steps': 42674, 'loss/train': 0.7003434896469116} 11/07/2021 03:12:51 - INFO - __main__ - Step 42676: {'lr': 0.0004124628712174833, 'samples': 8193792, 'steps': 42675, 'loss/train': 1.5358424186706543} 11/07/2021 03:12:51 - INFO - __main__ - Step 42677: {'lr': 0.0004124588377303359, 'samples': 8193984, 'steps': 42676, 'loss/train': 1.5736559629440308} 11/07/2021 03:12:52 - INFO - __main__ - Step 42678: {'lr': 0.00041245480416998704, 'samples': 8194176, 'steps': 42677, 'loss/train': 1.054282307624817} 11/07/2021 03:12:53 - INFO - __main__ - Step 42679: {'lr': 0.00041245077053643866, 'samples': 8194368, 'steps': 42678, 'loss/train': 1.107468605041504} 11/07/2021 03:12:53 - INFO - __main__ - Step 42680: {'lr': 0.0004124467368296924, 'samples': 8194560, 'steps': 42679, 'loss/train': 1.5272530317306519} 11/07/2021 03:12:54 - INFO - __main__ - Step 42681: {'lr': 0.00041244270304975004, 'samples': 8194752, 'steps': 42680, 'loss/train': 0.5860555768013} 11/07/2021 03:12:54 - INFO - __main__ - Step 42682: {'lr': 0.0004124386691966137, 'samples': 8194944, 'steps': 42681, 'loss/train': 1.4744832515716553} 11/07/2021 03:12:55 - INFO - __main__ - Step 42683: {'lr': 0.00041243463527028493, 'samples': 8195136, 'steps': 42682, 'loss/train': 1.290229320526123} 11/07/2021 03:12:55 - INFO - __main__ - Step 42684: {'lr': 0.0004124306012707656, 'samples': 8195328, 'steps': 42683, 'loss/train': 1.3682605028152466} 11/07/2021 03:12:56 - INFO - __main__ - Step 42685: {'lr': 0.00041242656719805754, 'samples': 8195520, 'steps': 42684, 'loss/train': 1.3809823989868164} 11/07/2021 03:12:56 - INFO - __main__ - Step 42686: {'lr': 0.0004124225330521626, 'samples': 8195712, 'steps': 42685, 'loss/train': 1.5804864168167114} 11/07/2021 03:12:56 - INFO - __main__ - Step 42687: {'lr': 0.0004124184988330826, 'samples': 8195904, 'steps': 42686, 'loss/train': 0.9367903470993042} 11/07/2021 03:12:57 - INFO - __main__ - Step 42688: {'lr': 0.0004124144645408192, 'samples': 8196096, 'steps': 42687, 'loss/train': 1.4681004285812378} 11/07/2021 03:12:58 - INFO - __main__ - Step 42689: {'lr': 0.0004124104301753745, 'samples': 8196288, 'steps': 42688, 'loss/train': 1.3483312129974365} 11/07/2021 03:12:58 - INFO - __main__ - Step 42690: {'lr': 0.0004124063957367501, 'samples': 8196480, 'steps': 42689, 'loss/train': 1.2217729091644287} 11/07/2021 03:12:58 - INFO - __main__ - Step 42691: {'lr': 0.0004124023612249479, 'samples': 8196672, 'steps': 42690, 'loss/train': 1.4882365465164185} 11/07/2021 03:12:59 - INFO - __main__ - Step 42692: {'lr': 0.0004123983266399697, 'samples': 8196864, 'steps': 42691, 'loss/train': 1.5725913047790527} 11/07/2021 03:13:00 - INFO - __main__ - Step 42693: {'lr': 0.0004123942919818173, 'samples': 8197056, 'steps': 42692, 'loss/train': 1.2359098196029663} 11/07/2021 03:13:00 - INFO - __main__ - Step 42694: {'lr': 0.00041239025725049256, 'samples': 8197248, 'steps': 42693, 'loss/train': 1.1791036128997803} 11/07/2021 03:13:01 - INFO - __main__ - Step 42695: {'lr': 0.0004123862224459973, 'samples': 8197440, 'steps': 42694, 'loss/train': 1.843918800354004} 11/07/2021 03:13:01 - INFO - __main__ - Step 42696: {'lr': 0.0004123821875683333, 'samples': 8197632, 'steps': 42695, 'loss/train': 1.4438482522964478} 11/07/2021 03:13:01 - INFO - __main__ - Step 42697: {'lr': 0.0004123781526175023, 'samples': 8197824, 'steps': 42696, 'loss/train': 1.5149061679840088} 11/07/2021 03:13:02 - INFO - __main__ - Step 42698: {'lr': 0.0004123741175935063, 'samples': 8198016, 'steps': 42697, 'loss/train': 1.6861848831176758} 11/07/2021 03:13:03 - INFO - __main__ - Step 42699: {'lr': 0.000412370082496347, 'samples': 8198208, 'steps': 42698, 'loss/train': 1.707339882850647} 11/07/2021 03:13:03 - INFO - __main__ - Step 42700: {'lr': 0.0004123660473260263, 'samples': 8198400, 'steps': 42699, 'loss/train': 0.6675590872764587} 11/07/2021 03:13:03 - INFO - __main__ - Step 42701: {'lr': 0.0004123620120825459, 'samples': 8198592, 'steps': 42700, 'loss/train': 1.6229008436203003} 11/07/2021 03:13:04 - INFO - __main__ - Step 42702: {'lr': 0.00041235797676590776, 'samples': 8198784, 'steps': 42701, 'loss/train': 1.2133471965789795} 11/07/2021 03:13:04 - INFO - __main__ - Step 42703: {'lr': 0.0004123539413761136, 'samples': 8198976, 'steps': 42702, 'loss/train': 0.5439752340316772} 11/07/2021 03:13:05 - INFO - __main__ - Step 42704: {'lr': 0.0004123499059131652, 'samples': 8199168, 'steps': 42703, 'loss/train': 1.476960301399231} 11/07/2021 03:13:06 - INFO - __main__ - Step 42705: {'lr': 0.00041234587037706447, 'samples': 8199360, 'steps': 42704, 'loss/train': 1.2985483407974243} 11/07/2021 03:13:06 - INFO - __main__ - Step 42706: {'lr': 0.0004123418347678132, 'samples': 8199552, 'steps': 42705, 'loss/train': 1.567990779876709} 11/07/2021 03:13:06 - INFO - __main__ - Step 42707: {'lr': 0.00041233779908541316, 'samples': 8199744, 'steps': 42706, 'loss/train': 1.5503088235855103} 11/07/2021 03:13:07 - INFO - __main__ - Step 42708: {'lr': 0.0004123337633298662, 'samples': 8199936, 'steps': 42707, 'loss/train': 1.386027455329895} 11/07/2021 03:13:08 - INFO - __main__ - Step 42709: {'lr': 0.0004123297275011743, 'samples': 8200128, 'steps': 42708, 'loss/train': 1.4431533813476562} 11/07/2021 03:13:08 - INFO - __main__ - Step 42710: {'lr': 0.00041232569159933895, 'samples': 8200320, 'steps': 42709, 'loss/train': 1.3087667226791382} 11/07/2021 03:13:08 - INFO - __main__ - Step 42711: {'lr': 0.00041232165562436225, 'samples': 8200512, 'steps': 42710, 'loss/train': 1.0742384195327759} 11/07/2021 03:13:09 - INFO - __main__ - Step 42712: {'lr': 0.00041231761957624593, 'samples': 8200704, 'steps': 42711, 'loss/train': 1.6528187990188599} 11/07/2021 03:13:09 - INFO - __main__ - Step 42713: {'lr': 0.0004123135834549917, 'samples': 8200896, 'steps': 42712, 'loss/train': 1.6122461557388306} 11/07/2021 03:13:10 - INFO - __main__ - Step 42714: {'lr': 0.00041230954726060155, 'samples': 8201088, 'steps': 42713, 'loss/train': 1.4251964092254639} 11/07/2021 03:13:11 - INFO - __main__ - Step 42715: {'lr': 0.00041230551099307724, 'samples': 8201280, 'steps': 42714, 'loss/train': 1.4186371564865112} 11/07/2021 03:13:11 - INFO - __main__ - Step 42716: {'lr': 0.0004123014746524205, 'samples': 8201472, 'steps': 42715, 'loss/train': 1.5294647216796875} 11/07/2021 03:13:11 - INFO - __main__ - Step 42717: {'lr': 0.0004122974382386333, 'samples': 8201664, 'steps': 42716, 'loss/train': 1.4531598091125488} 11/07/2021 03:13:12 - INFO - __main__ - Step 42718: {'lr': 0.00041229340175171733, 'samples': 8201856, 'steps': 42717, 'loss/train': 1.565722942352295} 11/07/2021 03:13:13 - INFO - __main__ - Step 42719: {'lr': 0.00041228936519167446, 'samples': 8202048, 'steps': 42718, 'loss/train': 1.530666708946228} 11/07/2021 03:13:13 - INFO - __main__ - Step 42720: {'lr': 0.00041228532855850655, 'samples': 8202240, 'steps': 42719, 'loss/train': 1.1392285823822021} 11/07/2021 03:13:13 - INFO - __main__ - Step 42721: {'lr': 0.0004122812918522153, 'samples': 8202432, 'steps': 42720, 'loss/train': 0.7674376368522644} 11/07/2021 03:13:14 - INFO - __main__ - Step 42722: {'lr': 0.0004122772550728027, 'samples': 8202624, 'steps': 42721, 'loss/train': 1.3797781467437744} 11/07/2021 03:13:14 - INFO - __main__ - Step 42723: {'lr': 0.0004122732182202703, 'samples': 8202816, 'steps': 42722, 'loss/train': 1.799514651298523} 11/07/2021 03:13:15 - INFO - __main__ - Step 42724: {'lr': 0.0004122691812946202, 'samples': 8203008, 'steps': 42723, 'loss/train': 1.4491488933563232} 11/07/2021 03:13:16 - INFO - __main__ - Step 42725: {'lr': 0.00041226514429585417, 'samples': 8203200, 'steps': 42724, 'loss/train': 1.598483920097351} 11/07/2021 03:13:16 - INFO - __main__ - Step 42726: {'lr': 0.0004122611072239739, 'samples': 8203392, 'steps': 42725, 'loss/train': 1.3325248956680298} 11/07/2021 03:13:17 - INFO - __main__ - Step 42727: {'lr': 0.00041225707007898127, 'samples': 8203584, 'steps': 42726, 'loss/train': 1.7285935878753662} 11/07/2021 03:13:17 - INFO - __main__ - Step 42728: {'lr': 0.0004122530328608781, 'samples': 8203776, 'steps': 42727, 'loss/train': 1.6887344121932983} 11/07/2021 03:13:17 - INFO - __main__ - Step 42729: {'lr': 0.00041224899556966635, 'samples': 8203968, 'steps': 42728, 'loss/train': 1.861165165901184} 11/07/2021 03:13:18 - INFO - __main__ - Step 42730: {'lr': 0.00041224495820534757, 'samples': 8204160, 'steps': 42729, 'loss/train': 0.25296056270599365} 11/07/2021 03:13:19 - INFO - __main__ - Step 42731: {'lr': 0.00041224092076792374, 'samples': 8204352, 'steps': 42730, 'loss/train': 1.3531886339187622} 11/07/2021 03:13:19 - INFO - __main__ - Step 42732: {'lr': 0.0004122368832573967, 'samples': 8204544, 'steps': 42731, 'loss/train': 1.4255421161651611} 11/07/2021 03:13:19 - INFO - __main__ - Step 42733: {'lr': 0.00041223284567376816, 'samples': 8204736, 'steps': 42732, 'loss/train': 1.5796672105789185} 11/07/2021 03:13:20 - INFO - __main__ - Step 42734: {'lr': 0.00041222880801704005, 'samples': 8204928, 'steps': 42733, 'loss/train': 1.8132939338684082} 11/07/2021 03:13:21 - INFO - __main__ - Step 42735: {'lr': 0.0004122247702872141, 'samples': 8205120, 'steps': 42734, 'loss/train': 2.2760121822357178} 11/07/2021 03:13:21 - INFO - __main__ - Step 42736: {'lr': 0.0004122207324842923, 'samples': 8205312, 'steps': 42735, 'loss/train': 1.2232327461242676} 11/07/2021 03:13:22 - INFO - __main__ - Step 42737: {'lr': 0.00041221669460827614, 'samples': 8205504, 'steps': 42736, 'loss/train': 1.5809918642044067} 11/07/2021 03:13:22 - INFO - __main__ - Step 42738: {'lr': 0.00041221265665916776, 'samples': 8205696, 'steps': 42737, 'loss/train': 1.5916699171066284} 11/07/2021 03:13:22 - INFO - __main__ - Step 42739: {'lr': 0.00041220861863696886, 'samples': 8205888, 'steps': 42738, 'loss/train': 1.396811604499817} 11/07/2021 03:13:23 - INFO - __main__ - Step 42740: {'lr': 0.0004122045805416812, 'samples': 8206080, 'steps': 42739, 'loss/train': 1.6565934419631958} 11/07/2021 03:13:24 - INFO - __main__ - Step 42741: {'lr': 0.00041220054237330674, 'samples': 8206272, 'steps': 42740, 'loss/train': 1.755387783050537} 11/07/2021 03:13:24 - INFO - __main__ - Step 42742: {'lr': 0.00041219650413184714, 'samples': 8206464, 'steps': 42741, 'loss/train': 1.520103096961975} 11/07/2021 03:13:24 - INFO - __main__ - Step 42743: {'lr': 0.00041219246581730435, 'samples': 8206656, 'steps': 42742, 'loss/train': 1.721859097480774} 11/07/2021 03:13:25 - INFO - __main__ - Step 42744: {'lr': 0.0004121884274296801, 'samples': 8206848, 'steps': 42743, 'loss/train': 1.856993556022644} 11/07/2021 03:13:26 - INFO - __main__ - Step 42745: {'lr': 0.00041218438896897623, 'samples': 8207040, 'steps': 42744, 'loss/train': 1.4189162254333496} 11/07/2021 03:13:26 - INFO - __main__ - Step 42746: {'lr': 0.00041218035043519464, 'samples': 8207232, 'steps': 42745, 'loss/train': 1.5349032878875732} 11/07/2021 03:13:27 - INFO - __main__ - Step 42747: {'lr': 0.00041217631182833707, 'samples': 8207424, 'steps': 42746, 'loss/train': 1.169258713722229} 11/07/2021 03:13:27 - INFO - __main__ - Step 42748: {'lr': 0.00041217227314840535, 'samples': 8207616, 'steps': 42747, 'loss/train': 2.701878547668457} 11/07/2021 03:13:27 - INFO - __main__ - Step 42749: {'lr': 0.00041216823439540134, 'samples': 8207808, 'steps': 42748, 'loss/train': 1.5812207460403442} 11/07/2021 03:13:28 - INFO - __main__ - Step 42750: {'lr': 0.0004121641955693268, 'samples': 8208000, 'steps': 42749, 'loss/train': 1.693583369255066} 11/07/2021 03:13:29 - INFO - __main__ - Step 42751: {'lr': 0.00041216015667018357, 'samples': 8208192, 'steps': 42750, 'loss/train': 1.6943845748901367} 11/07/2021 03:13:29 - INFO - __main__ - Step 42752: {'lr': 0.00041215611769797344, 'samples': 8208384, 'steps': 42751, 'loss/train': 1.8799508810043335} 11/07/2021 03:13:29 - INFO - __main__ - Step 42753: {'lr': 0.00041215207865269833, 'samples': 8208576, 'steps': 42752, 'loss/train': 2.027010917663574} 11/07/2021 03:13:30 - INFO - __main__ - Step 42754: {'lr': 0.00041214803953435993, 'samples': 8208768, 'steps': 42753, 'loss/train': 1.3380992412567139} 11/07/2021 03:13:30 - INFO - __main__ - Step 42755: {'lr': 0.0004121440003429602, 'samples': 8208960, 'steps': 42754, 'loss/train': 1.271234393119812} 11/07/2021 03:13:31 - INFO - __main__ - Step 42756: {'lr': 0.0004121399610785008, 'samples': 8209152, 'steps': 42755, 'loss/train': 1.6511046886444092} 11/07/2021 03:13:32 - INFO - __main__ - Step 42757: {'lr': 0.00041213592174098367, 'samples': 8209344, 'steps': 42756, 'loss/train': 1.5800201892852783} 11/07/2021 03:13:32 - INFO - __main__ - Step 42758: {'lr': 0.00041213188233041065, 'samples': 8209536, 'steps': 42757, 'loss/train': 1.890745759010315} 11/07/2021 03:13:32 - INFO - __main__ - Step 42759: {'lr': 0.00041212784284678345, 'samples': 8209728, 'steps': 42758, 'loss/train': 1.4945995807647705} 11/07/2021 03:13:33 - INFO - __main__ - Step 42760: {'lr': 0.0004121238032901039, 'samples': 8209920, 'steps': 42759, 'loss/train': 1.3198996782302856} 11/07/2021 03:13:34 - INFO - __main__ - Step 42761: {'lr': 0.00041211976366037394, 'samples': 8210112, 'steps': 42760, 'loss/train': 0.8576481342315674} 11/07/2021 03:13:34 - INFO - __main__ - Step 42762: {'lr': 0.0004121157239575953, 'samples': 8210304, 'steps': 42761, 'loss/train': 1.662672758102417} 11/07/2021 03:13:35 - INFO - __main__ - Step 42763: {'lr': 0.0004121116841817699, 'samples': 8210496, 'steps': 42762, 'loss/train': 0.672544002532959} 11/07/2021 03:13:35 - INFO - __main__ - Step 42764: {'lr': 0.00041210764433289936, 'samples': 8210688, 'steps': 42763, 'loss/train': 1.4414068460464478} 11/07/2021 03:13:35 - INFO - __main__ - Step 42765: {'lr': 0.0004121036044109856, 'samples': 8210880, 'steps': 42764, 'loss/train': 1.4708746671676636} 11/07/2021 03:13:36 - INFO - __main__ - Step 42766: {'lr': 0.00041209956441603054, 'samples': 8211072, 'steps': 42765, 'loss/train': 1.7228068113327026} 11/07/2021 03:13:37 - INFO - __main__ - Step 42767: {'lr': 0.0004120955243480359, 'samples': 8211264, 'steps': 42766, 'loss/train': 1.7478610277175903} 11/07/2021 03:13:37 - INFO - __main__ - Step 42768: {'lr': 0.0004120914842070035, 'samples': 8211456, 'steps': 42767, 'loss/train': 1.1757432222366333} 11/07/2021 03:13:37 - INFO - __main__ - Step 42769: {'lr': 0.0004120874439929352, 'samples': 8211648, 'steps': 42768, 'loss/train': 1.5053163766860962} 11/07/2021 03:13:38 - INFO - __main__ - Step 42770: {'lr': 0.00041208340370583275, 'samples': 8211840, 'steps': 42769, 'loss/train': 0.9787638783454895} 11/07/2021 03:13:39 - INFO - __main__ - Step 42771: {'lr': 0.0004120793633456981, 'samples': 8212032, 'steps': 42770, 'loss/train': 1.8949719667434692} 11/07/2021 03:13:39 - INFO - __main__ - Step 42772: {'lr': 0.0004120753229125329, 'samples': 8212224, 'steps': 42771, 'loss/train': 1.3817518949508667} 11/07/2021 03:13:40 - INFO - __main__ - Step 42773: {'lr': 0.00041207128240633906, 'samples': 8212416, 'steps': 42772, 'loss/train': 1.1997261047363281} 11/07/2021 03:13:40 - INFO - __main__ - Step 42774: {'lr': 0.0004120672418271184, 'samples': 8212608, 'steps': 42773, 'loss/train': 1.54781174659729} 11/07/2021 03:13:40 - INFO - __main__ - Step 42775: {'lr': 0.0004120632011748728, 'samples': 8212800, 'steps': 42774, 'loss/train': 1.4456589221954346} 11/07/2021 03:13:41 - INFO - __main__ - Step 42776: {'lr': 0.00041205916044960406, 'samples': 8212992, 'steps': 42775, 'loss/train': 3.485616683959961} 11/07/2021 03:13:42 - INFO - __main__ - Step 42777: {'lr': 0.0004120551196513139, 'samples': 8213184, 'steps': 42776, 'loss/train': 1.132140040397644} 11/07/2021 03:13:42 - INFO - __main__ - Step 42778: {'lr': 0.0004120510787800042, 'samples': 8213376, 'steps': 42777, 'loss/train': 1.948042869567871} 11/07/2021 03:13:42 - INFO - __main__ - Step 42779: {'lr': 0.0004120470378356768, 'samples': 8213568, 'steps': 42778, 'loss/train': 1.4851789474487305} 11/07/2021 03:13:43 - INFO - __main__ - Step 42780: {'lr': 0.00041204299681833344, 'samples': 8213760, 'steps': 42779, 'loss/train': 1.4981358051300049} 11/07/2021 03:13:43 - INFO - __main__ - Step 42781: {'lr': 0.00041203895572797613, 'samples': 8213952, 'steps': 42780, 'loss/train': 1.876665472984314} 11/07/2021 03:13:44 - INFO - __main__ - Step 42782: {'lr': 0.00041203491456460653, 'samples': 8214144, 'steps': 42781, 'loss/train': 1.4150429964065552} 11/07/2021 03:13:44 - INFO - __main__ - Step 42783: {'lr': 0.00041203087332822644, 'samples': 8214336, 'steps': 42782, 'loss/train': 1.4986529350280762} 11/07/2021 03:13:45 - INFO - __main__ - Step 42784: {'lr': 0.0004120268320188378, 'samples': 8214528, 'steps': 42783, 'loss/train': 1.3611279726028442} 11/07/2021 03:13:45 - INFO - __main__ - Step 42785: {'lr': 0.00041202279063644234, 'samples': 8214720, 'steps': 42784, 'loss/train': 1.2365634441375732} 11/07/2021 03:13:45 - INFO - __main__ - Step 42786: {'lr': 0.00041201874918104185, 'samples': 8214912, 'steps': 42785, 'loss/train': 1.448770523071289} 11/07/2021 03:13:47 - INFO - __main__ - Step 42787: {'lr': 0.0004120147076526383, 'samples': 8215104, 'steps': 42786, 'loss/train': 1.3730212450027466} 11/07/2021 03:13:47 - INFO - __main__ - Step 42788: {'lr': 0.0004120106660512334, 'samples': 8215296, 'steps': 42787, 'loss/train': 1.5505905151367188} 11/07/2021 03:13:48 - INFO - __main__ - Step 42789: {'lr': 0.000412006624376829, 'samples': 8215488, 'steps': 42788, 'loss/train': 1.574457049369812} 11/07/2021 03:13:48 - INFO - __main__ - Step 42790: {'lr': 0.0004120025826294269, 'samples': 8215680, 'steps': 42789, 'loss/train': 1.6592400074005127} 11/07/2021 03:13:48 - INFO - __main__ - Step 42791: {'lr': 0.00041199854080902897, 'samples': 8215872, 'steps': 42790, 'loss/train': 1.792243242263794} 11/07/2021 03:13:49 - INFO - __main__ - Step 42792: {'lr': 0.00041199449891563694, 'samples': 8216064, 'steps': 42791, 'loss/train': 1.2733299732208252} 11/07/2021 03:13:50 - INFO - __main__ - Step 42793: {'lr': 0.00041199045694925273, 'samples': 8216256, 'steps': 42792, 'loss/train': 1.6514403820037842} 11/07/2021 03:13:50 - INFO - __main__ - Step 42794: {'lr': 0.0004119864149098781, 'samples': 8216448, 'steps': 42793, 'loss/train': 2.1267051696777344} 11/07/2021 03:13:50 - INFO - __main__ - Step 42795: {'lr': 0.0004119823727975149, 'samples': 8216640, 'steps': 42794, 'loss/train': 1.120583176612854} 11/07/2021 03:13:51 - INFO - __main__ - Step 42796: {'lr': 0.00041197833061216494, 'samples': 8216832, 'steps': 42795, 'loss/train': 1.3337584733963013} 11/07/2021 03:13:51 - INFO - __main__ - Step 42797: {'lr': 0.00041197428835383, 'samples': 8217024, 'steps': 42796, 'loss/train': 0.7551344037055969} 11/07/2021 03:13:52 - INFO - __main__ - Step 42798: {'lr': 0.00041197024602251204, 'samples': 8217216, 'steps': 42797, 'loss/train': 1.4640302658081055} 11/07/2021 03:13:53 - INFO - __main__ - Step 42799: {'lr': 0.0004119662036182127, 'samples': 8217408, 'steps': 42798, 'loss/train': 1.2800933122634888} 11/07/2021 03:13:53 - INFO - __main__ - Step 42800: {'lr': 0.00041196216114093397, 'samples': 8217600, 'steps': 42799, 'loss/train': 1.4961788654327393} 11/07/2021 03:13:53 - INFO - __main__ - Step 42801: {'lr': 0.00041195811859067756, 'samples': 8217792, 'steps': 42800, 'loss/train': 1.5047647953033447} 11/07/2021 03:13:54 - INFO - __main__ - Step 42802: {'lr': 0.0004119540759674453, 'samples': 8217984, 'steps': 42801, 'loss/train': 1.2064974308013916} 11/07/2021 03:13:54 - INFO - __main__ - Step 42803: {'lr': 0.000411950033271239, 'samples': 8218176, 'steps': 42802, 'loss/train': 1.220013976097107} 11/07/2021 03:13:55 - INFO - __main__ - Step 42804: {'lr': 0.0004119459905020606, 'samples': 8218368, 'steps': 42803, 'loss/train': 1.2036381959915161} 11/07/2021 03:13:55 - INFO - __main__ - Step 42805: {'lr': 0.0004119419476599118, 'samples': 8218560, 'steps': 42804, 'loss/train': 1.4537402391433716} 11/07/2021 03:13:56 - INFO - __main__ - Step 42806: {'lr': 0.0004119379047447944, 'samples': 8218752, 'steps': 42805, 'loss/train': 1.662406086921692} 11/07/2021 03:13:56 - INFO - __main__ - Step 42807: {'lr': 0.00041193386175671033, 'samples': 8218944, 'steps': 42806, 'loss/train': 1.4320244789123535} 11/07/2021 03:13:56 - INFO - __main__ - Step 42808: {'lr': 0.0004119298186956613, 'samples': 8219136, 'steps': 42807, 'loss/train': 1.7451808452606201} 11/07/2021 03:13:57 - INFO - __main__ - Step 42809: {'lr': 0.00041192577556164924, 'samples': 8219328, 'steps': 42808, 'loss/train': 3.0425539016723633} 11/07/2021 03:13:58 - INFO - __main__ - Step 42810: {'lr': 0.000411921732354676, 'samples': 8219520, 'steps': 42809, 'loss/train': 1.4344040155410767} 11/07/2021 03:13:58 - INFO - __main__ - Step 42811: {'lr': 0.00041191768907474326, 'samples': 8219712, 'steps': 42810, 'loss/train': 1.871738314628601} 11/07/2021 03:13:58 - INFO - __main__ - Step 42812: {'lr': 0.00041191364572185286, 'samples': 8219904, 'steps': 42811, 'loss/train': 1.4439513683319092} 11/07/2021 03:13:59 - INFO - __main__ - Step 42813: {'lr': 0.0004119096022960067, 'samples': 8220096, 'steps': 42812, 'loss/train': 1.6381014585494995} 11/07/2021 03:14:00 - INFO - __main__ - Step 42814: {'lr': 0.0004119055587972066, 'samples': 8220288, 'steps': 42813, 'loss/train': 1.3307337760925293} 11/07/2021 03:14:00 - INFO - __main__ - Step 42815: {'lr': 0.0004119015152254543, 'samples': 8220480, 'steps': 42814, 'loss/train': 1.7919366359710693} 11/07/2021 03:14:01 - INFO - __main__ - Step 42816: {'lr': 0.00041189747158075176, 'samples': 8220672, 'steps': 42815, 'loss/train': 1.4417839050292969} 11/07/2021 03:14:01 - INFO - __main__ - Step 42817: {'lr': 0.00041189342786310067, 'samples': 8220864, 'steps': 42816, 'loss/train': 1.1310538053512573} 11/07/2021 03:14:01 - INFO - __main__ - Step 42818: {'lr': 0.0004118893840725029, 'samples': 8221056, 'steps': 42817, 'loss/train': 1.6472176313400269} 11/07/2021 03:14:02 - INFO - __main__ - Step 42819: {'lr': 0.0004118853402089603, 'samples': 8221248, 'steps': 42818, 'loss/train': 0.6757338643074036} 11/07/2021 03:14:03 - INFO - __main__ - Step 42820: {'lr': 0.0004118812962724746, 'samples': 8221440, 'steps': 42819, 'loss/train': 1.600584626197815} 11/07/2021 03:14:03 - INFO - __main__ - Step 42821: {'lr': 0.00041187725226304775, 'samples': 8221632, 'steps': 42820, 'loss/train': 1.9334253072738647} 11/07/2021 03:14:04 - INFO - __main__ - Step 42822: {'lr': 0.0004118732081806814, 'samples': 8221824, 'steps': 42821, 'loss/train': 1.7592384815216064} 11/07/2021 03:14:04 - INFO - __main__ - Step 42823: {'lr': 0.0004118691640253777, 'samples': 8222016, 'steps': 42822, 'loss/train': 0.5113934874534607} 11/07/2021 03:14:04 - INFO - __main__ - Step 42824: {'lr': 0.00041186511979713806, 'samples': 8222208, 'steps': 42823, 'loss/train': 1.31758451461792} 11/07/2021 03:14:05 - INFO - __main__ - Step 42825: {'lr': 0.00041186107549596453, 'samples': 8222400, 'steps': 42824, 'loss/train': 1.43822181224823} 11/07/2021 03:14:06 - INFO - __main__ - Step 42826: {'lr': 0.0004118570311218589, 'samples': 8222592, 'steps': 42825, 'loss/train': 1.5050877332687378} 11/07/2021 03:14:06 - INFO - __main__ - Step 42827: {'lr': 0.00041185298667482294, 'samples': 8222784, 'steps': 42826, 'loss/train': 1.505362868309021} 11/07/2021 03:14:06 - INFO - __main__ - Step 42828: {'lr': 0.0004118489421548586, 'samples': 8222976, 'steps': 42827, 'loss/train': 0.5131959915161133} 11/07/2021 03:14:07 - INFO - __main__ - Step 42829: {'lr': 0.00041184489756196764, 'samples': 8223168, 'steps': 42828, 'loss/train': 1.1229454278945923} 11/07/2021 03:14:07 - INFO - __main__ - Step 42830: {'lr': 0.0004118408528961519, 'samples': 8223360, 'steps': 42829, 'loss/train': 1.5232681035995483} 11/07/2021 03:14:08 - INFO - __main__ - Step 42831: {'lr': 0.00041183680815741307, 'samples': 8223552, 'steps': 42830, 'loss/train': 1.2058498859405518} 11/07/2021 03:14:08 - INFO - __main__ - Step 42832: {'lr': 0.0004118327633457531, 'samples': 8223744, 'steps': 42831, 'loss/train': 0.5994836688041687} 11/07/2021 03:14:09 - INFO - __main__ - Step 42833: {'lr': 0.00041182871846117373, 'samples': 8223936, 'steps': 42832, 'loss/train': 1.479069471359253} 11/07/2021 03:14:09 - INFO - __main__ - Step 42834: {'lr': 0.0004118246735036769, 'samples': 8224128, 'steps': 42833, 'loss/train': 1.2316757440567017} 11/07/2021 03:14:09 - INFO - __main__ - Step 42835: {'lr': 0.0004118206284732644, 'samples': 8224320, 'steps': 42834, 'loss/train': 1.3874062299728394} 11/07/2021 03:14:10 - INFO - __main__ - Step 42836: {'lr': 0.000411816583369938, 'samples': 8224512, 'steps': 42835, 'loss/train': 1.6726195812225342} 11/07/2021 03:14:11 - INFO - __main__ - Step 42837: {'lr': 0.0004118125381936996, 'samples': 8224704, 'steps': 42836, 'loss/train': 1.5835198163986206} 11/07/2021 03:14:11 - INFO - __main__ - Step 42838: {'lr': 0.0004118084929445508, 'samples': 8224896, 'steps': 42837, 'loss/train': 1.3595654964447021} 11/07/2021 03:14:11 - INFO - __main__ - Step 42839: {'lr': 0.0004118044476224937, 'samples': 8225088, 'steps': 42838, 'loss/train': 1.2652711868286133} 11/07/2021 03:14:12 - INFO - __main__ - Step 42840: {'lr': 0.00041180040222753, 'samples': 8225280, 'steps': 42839, 'loss/train': 1.5912714004516602} 11/07/2021 03:14:13 - INFO - __main__ - Step 42841: {'lr': 0.00041179635675966155, 'samples': 8225472, 'steps': 42840, 'loss/train': 1.1543596982955933} 11/07/2021 03:14:13 - INFO - __main__ - Step 42842: {'lr': 0.00041179231121889014, 'samples': 8225664, 'steps': 42841, 'loss/train': 0.6163930892944336} 11/07/2021 03:14:14 - INFO - __main__ - Step 42843: {'lr': 0.0004117882656052176, 'samples': 8225856, 'steps': 42842, 'loss/train': 1.5155036449432373} 11/07/2021 03:14:14 - INFO - __main__ - Step 42844: {'lr': 0.0004117842199186458, 'samples': 8226048, 'steps': 42843, 'loss/train': 1.2637860774993896} 11/07/2021 03:14:14 - INFO - __main__ - Step 42845: {'lr': 0.00041178017415917655, 'samples': 8226240, 'steps': 42844, 'loss/train': 1.3483153581619263} 11/07/2021 03:14:15 - INFO - __main__ - Step 42846: {'lr': 0.00041177612832681156, 'samples': 8226432, 'steps': 42845, 'loss/train': 1.0798571109771729} 11/07/2021 03:14:16 - INFO - __main__ - Step 42847: {'lr': 0.00041177208242155285, 'samples': 8226624, 'steps': 42846, 'loss/train': 1.6547551155090332} 11/07/2021 03:14:16 - INFO - __main__ - Step 42848: {'lr': 0.000411768036443402, 'samples': 8226816, 'steps': 42847, 'loss/train': 2.1480069160461426} 11/07/2021 03:14:17 - INFO - __main__ - Step 42849: {'lr': 0.0004117639903923611, 'samples': 8227008, 'steps': 42848, 'loss/train': 0.9353717565536499} 11/07/2021 03:14:17 - INFO - __main__ - Step 42850: {'lr': 0.00041175994426843177, 'samples': 8227200, 'steps': 42849, 'loss/train': 1.4175337553024292} 11/07/2021 03:14:18 - INFO - __main__ - Step 42851: {'lr': 0.00041175589807161597, 'samples': 8227392, 'steps': 42850, 'loss/train': 1.5138330459594727} 11/07/2021 03:14:18 - INFO - __main__ - Step 42852: {'lr': 0.0004117518518019154, 'samples': 8227584, 'steps': 42851, 'loss/train': 1.3619396686553955} 11/07/2021 03:14:19 - INFO - __main__ - Step 42853: {'lr': 0.00041174780545933195, 'samples': 8227776, 'steps': 42852, 'loss/train': 1.7273114919662476} 11/07/2021 03:14:19 - INFO - __main__ - Step 42854: {'lr': 0.0004117437590438674, 'samples': 8227968, 'steps': 42853, 'loss/train': 1.7855308055877686} 11/07/2021 03:14:19 - INFO - __main__ - Step 42855: {'lr': 0.0004117397125555237, 'samples': 8228160, 'steps': 42854, 'loss/train': 1.7407039403915405} 11/07/2021 03:14:21 - INFO - __main__ - Step 42856: {'lr': 0.00041173566599430245, 'samples': 8228352, 'steps': 42855, 'loss/train': 1.6007099151611328} 11/07/2021 03:14:22 - INFO - __main__ - Step 42857: {'lr': 0.00041173161936020573, 'samples': 8228544, 'steps': 42856, 'loss/train': 1.495436668395996} 11/07/2021 03:14:22 - INFO - __main__ - Step 42858: {'lr': 0.0004117275726532352, 'samples': 8228736, 'steps': 42857, 'loss/train': 1.219700574874878} 11/07/2021 03:14:22 - INFO - __main__ - Step 42859: {'lr': 0.0004117235258733927, 'samples': 8228928, 'steps': 42858, 'loss/train': 1.6199307441711426} 11/07/2021 03:14:23 - INFO - __main__ - Step 42860: {'lr': 0.00041171947902068006, 'samples': 8229120, 'steps': 42859, 'loss/train': 1.9934972524642944} 11/07/2021 03:14:23 - INFO - __main__ - Step 42861: {'lr': 0.00041171543209509923, 'samples': 8229312, 'steps': 42860, 'loss/train': 1.7413346767425537} 11/07/2021 03:14:23 - INFO - __main__ - Step 42862: {'lr': 0.0004117113850966517, 'samples': 8229504, 'steps': 42861, 'loss/train': 1.840273141860962} 11/07/2021 03:14:24 - INFO - __main__ - Step 42863: {'lr': 0.00041170733802533974, 'samples': 8229696, 'steps': 42862, 'loss/train': 1.7821216583251953} 11/07/2021 03:14:25 - INFO - __main__ - Step 42864: {'lr': 0.0004117032908811649, 'samples': 8229888, 'steps': 42863, 'loss/train': 1.765580177307129} 11/07/2021 03:14:25 - INFO - __main__ - Step 42865: {'lr': 0.000411699243664129, 'samples': 8230080, 'steps': 42864, 'loss/train': 0.8765882253646851} 11/07/2021 03:14:26 - INFO - __main__ - Step 42866: {'lr': 0.00041169519637423394, 'samples': 8230272, 'steps': 42865, 'loss/train': 1.6690140962600708} 11/07/2021 03:14:26 - INFO - __main__ - Step 42867: {'lr': 0.0004116911490114815, 'samples': 8230464, 'steps': 42866, 'loss/train': 0.9538524150848389} 11/07/2021 03:14:26 - INFO - __main__ - Step 42868: {'lr': 0.0004116871015758735, 'samples': 8230656, 'steps': 42867, 'loss/train': 1.6665130853652954} 11/07/2021 03:14:27 - INFO - __main__ - Step 42869: {'lr': 0.0004116830540674118, 'samples': 8230848, 'steps': 42868, 'loss/train': 1.5690141916275024} 11/07/2021 03:14:28 - INFO - __main__ - Step 42870: {'lr': 0.00041167900648609825, 'samples': 8231040, 'steps': 42869, 'loss/train': 1.5212374925613403} 11/07/2021 03:14:28 - INFO - __main__ - Step 42871: {'lr': 0.00041167495883193464, 'samples': 8231232, 'steps': 42870, 'loss/train': 1.4287992715835571} 11/07/2021 03:14:28 - INFO - __main__ - Step 42872: {'lr': 0.00041167091110492273, 'samples': 8231424, 'steps': 42871, 'loss/train': 1.6299023628234863} 11/07/2021 03:14:29 - INFO - __main__ - Step 42873: {'lr': 0.0004116668633050644, 'samples': 8231616, 'steps': 42872, 'loss/train': 1.3518669605255127} 11/07/2021 03:14:29 - INFO - __main__ - Step 42874: {'lr': 0.0004116628154323616, 'samples': 8231808, 'steps': 42873, 'loss/train': 1.5255669355392456} 11/07/2021 03:14:30 - INFO - __main__ - Step 42875: {'lr': 0.0004116587674868159, 'samples': 8232000, 'steps': 42874, 'loss/train': 1.3380085229873657} 11/07/2021 03:14:31 - INFO - __main__ - Step 42876: {'lr': 0.00041165471946842924, 'samples': 8232192, 'steps': 42875, 'loss/train': 1.9206820726394653} 11/07/2021 03:14:31 - INFO - __main__ - Step 42877: {'lr': 0.00041165067137720356, 'samples': 8232384, 'steps': 42876, 'loss/train': 1.344838261604309} 11/07/2021 03:14:31 - INFO - __main__ - Step 42878: {'lr': 0.00041164662321314054, 'samples': 8232576, 'steps': 42877, 'loss/train': 1.2000056505203247} 11/07/2021 03:14:32 - INFO - __main__ - Step 42879: {'lr': 0.000411642574976242, 'samples': 8232768, 'steps': 42878, 'loss/train': 1.7511587142944336} 11/07/2021 03:14:33 - INFO - __main__ - Step 42880: {'lr': 0.0004116385266665099, 'samples': 8232960, 'steps': 42879, 'loss/train': 0.9567857384681702} 11/07/2021 03:14:33 - INFO - __main__ - Step 42881: {'lr': 0.0004116344782839459, 'samples': 8233152, 'steps': 42880, 'loss/train': 1.3815840482711792} 11/07/2021 03:14:33 - INFO - __main__ - Step 42882: {'lr': 0.00041163042982855194, 'samples': 8233344, 'steps': 42881, 'loss/train': 1.416580080986023} 11/07/2021 03:14:34 - INFO - __main__ - Step 42883: {'lr': 0.00041162638130032975, 'samples': 8233536, 'steps': 42882, 'loss/train': 1.3377119302749634} 11/07/2021 03:14:34 - INFO - __main__ - Step 42884: {'lr': 0.00041162233269928126, 'samples': 8233728, 'steps': 42883, 'loss/train': 1.66068434715271} 11/07/2021 03:14:35 - INFO - __main__ - Step 42885: {'lr': 0.0004116182840254082, 'samples': 8233920, 'steps': 42884, 'loss/train': 1.219361662864685} 11/07/2021 03:14:35 - INFO - __main__ - Step 42886: {'lr': 0.0004116142352787125, 'samples': 8234112, 'steps': 42885, 'loss/train': 1.7923866510391235} 11/07/2021 03:14:36 - INFO - __main__ - Step 42887: {'lr': 0.00041161018645919593, 'samples': 8234304, 'steps': 42886, 'loss/train': 1.2530755996704102} 11/07/2021 03:14:36 - INFO - __main__ - Step 42888: {'lr': 0.00041160613756686015, 'samples': 8234496, 'steps': 42887, 'loss/train': 1.4656062126159668} 11/07/2021 03:14:37 - INFO - __main__ - Step 42889: {'lr': 0.00041160208860170725, 'samples': 8234688, 'steps': 42888, 'loss/train': 1.395034670829773} 11/07/2021 03:14:38 - INFO - __main__ - Step 42890: {'lr': 0.000411598039563739, 'samples': 8234880, 'steps': 42889, 'loss/train': 1.0422735214233398} 11/07/2021 03:14:38 - INFO - __main__ - Step 42891: {'lr': 0.0004115939904529571, 'samples': 8235072, 'steps': 42890, 'loss/train': 1.6529215574264526} 11/07/2021 03:14:38 - INFO - __main__ - Step 42892: {'lr': 0.00041158994126936347, 'samples': 8235264, 'steps': 42891, 'loss/train': 1.6952149868011475} 11/07/2021 03:14:39 - INFO - __main__ - Step 42893: {'lr': 0.0004115858920129598, 'samples': 8235456, 'steps': 42892, 'loss/train': 1.9112871885299683} 11/07/2021 03:14:39 - INFO - __main__ - Step 42894: {'lr': 0.0004115818426837481, 'samples': 8235648, 'steps': 42893, 'loss/train': 1.6131938695907593} 11/07/2021 03:14:39 - INFO - __main__ - Step 42895: {'lr': 0.0004115777932817301, 'samples': 8235840, 'steps': 42894, 'loss/train': 1.5090101957321167} 11/07/2021 03:14:41 - INFO - __main__ - Step 42896: {'lr': 0.00041157374380690765, 'samples': 8236032, 'steps': 42895, 'loss/train': 4.94158935546875} 11/07/2021 03:14:41 - INFO - __main__ - Step 42897: {'lr': 0.0004115696942592826, 'samples': 8236224, 'steps': 42896, 'loss/train': 1.6551597118377686} 11/07/2021 03:14:41 - INFO - __main__ - Step 42898: {'lr': 0.0004115656446388567, 'samples': 8236416, 'steps': 42897, 'loss/train': 1.551294207572937} 11/07/2021 03:14:42 - INFO - __main__ - Step 42899: {'lr': 0.00041156159494563183, 'samples': 8236608, 'steps': 42898, 'loss/train': 1.627911925315857} 11/07/2021 03:14:42 - INFO - __main__ - Step 42900: {'lr': 0.00041155754517960974, 'samples': 8236800, 'steps': 42899, 'loss/train': 1.7276084423065186} 11/07/2021 03:14:42 - INFO - __main__ - Step 42901: {'lr': 0.00041155349534079236, 'samples': 8236992, 'steps': 42900, 'loss/train': 1.9185916185379028} 11/07/2021 03:14:43 - INFO - __main__ - Step 42902: {'lr': 0.0004115494454291815, 'samples': 8237184, 'steps': 42901, 'loss/train': 1.4276373386383057} 11/07/2021 03:14:44 - INFO - __main__ - Step 42903: {'lr': 0.0004115453954447789, 'samples': 8237376, 'steps': 42902, 'loss/train': 1.8472546339035034} 11/07/2021 03:14:44 - INFO - __main__ - Step 42904: {'lr': 0.0004115413453875865, 'samples': 8237568, 'steps': 42903, 'loss/train': 0.8522806167602539} 11/07/2021 03:14:44 - INFO - __main__ - Step 42905: {'lr': 0.000411537295257606, 'samples': 8237760, 'steps': 42904, 'loss/train': 1.500504493713379} 11/07/2021 03:14:45 - INFO - __main__ - Step 42906: {'lr': 0.00041153324505483933, 'samples': 8237952, 'steps': 42905, 'loss/train': 1.5426833629608154} 11/07/2021 03:14:46 - INFO - __main__ - Step 42907: {'lr': 0.0004115291947792882, 'samples': 8238144, 'steps': 42906, 'loss/train': 1.536191463470459} 11/07/2021 03:14:46 - INFO - __main__ - Step 42908: {'lr': 0.00041152514443095454, 'samples': 8238336, 'steps': 42907, 'loss/train': 1.0617425441741943} 11/07/2021 03:14:46 - INFO - __main__ - Step 42909: {'lr': 0.00041152109400984015, 'samples': 8238528, 'steps': 42908, 'loss/train': 2.1334660053253174} 11/07/2021 03:14:47 - INFO - __main__ - Step 42910: {'lr': 0.0004115170435159469, 'samples': 8238720, 'steps': 42909, 'loss/train': 1.8392415046691895} 11/07/2021 03:14:47 - INFO - __main__ - Step 42911: {'lr': 0.00041151299294927657, 'samples': 8238912, 'steps': 42910, 'loss/train': 1.5095146894454956} 11/07/2021 03:14:48 - INFO - __main__ - Step 42912: {'lr': 0.0004115089423098309, 'samples': 8239104, 'steps': 42911, 'loss/train': 1.5403552055358887} 11/07/2021 03:14:49 - INFO - __main__ - Step 42913: {'lr': 0.00041150489159761186, 'samples': 8239296, 'steps': 42912, 'loss/train': 1.5937893390655518} 11/07/2021 03:14:49 - INFO - __main__ - Step 42914: {'lr': 0.00041150084081262105, 'samples': 8239488, 'steps': 42913, 'loss/train': 1.7277464866638184} 11/07/2021 03:14:49 - INFO - __main__ - Step 42915: {'lr': 0.0004114967899548606, 'samples': 8239680, 'steps': 42914, 'loss/train': 1.2665598392486572} 11/07/2021 03:14:50 - INFO - __main__ - Step 42916: {'lr': 0.0004114927390243322, 'samples': 8239872, 'steps': 42915, 'loss/train': 1.7183034420013428} 11/07/2021 03:14:51 - INFO - __main__ - Step 42917: {'lr': 0.00041148868802103766, 'samples': 8240064, 'steps': 42916, 'loss/train': 1.6050708293914795} 11/07/2021 03:14:51 - INFO - __main__ - Step 42918: {'lr': 0.00041148463694497874, 'samples': 8240256, 'steps': 42917, 'loss/train': 1.6935396194458008} 11/07/2021 03:14:51 - INFO - __main__ - Step 42919: {'lr': 0.00041148058579615733, 'samples': 8240448, 'steps': 42918, 'loss/train': 1.258515477180481} 11/07/2021 03:14:52 - INFO - __main__ - Step 42920: {'lr': 0.00041147653457457534, 'samples': 8240640, 'steps': 42919, 'loss/train': 1.4335256814956665} 11/07/2021 03:14:52 - INFO - __main__ - Step 42921: {'lr': 0.0004114724832802345, 'samples': 8240832, 'steps': 42920, 'loss/train': 1.331539511680603} 11/07/2021 03:14:52 - INFO - __main__ - Step 42922: {'lr': 0.0004114684319131366, 'samples': 8241024, 'steps': 42921, 'loss/train': 0.9531698822975159} 11/07/2021 03:14:53 - INFO - __main__ - Step 42923: {'lr': 0.00041146438047328347, 'samples': 8241216, 'steps': 42922, 'loss/train': 1.325085163116455} 11/07/2021 03:14:54 - INFO - __main__ - Step 42924: {'lr': 0.0004114603289606771, 'samples': 8241408, 'steps': 42923, 'loss/train': 2.0017459392547607} 11/07/2021 03:14:54 - INFO - __main__ - Step 42925: {'lr': 0.00041145627737531915, 'samples': 8241600, 'steps': 42924, 'loss/train': 1.332912802696228} 11/07/2021 03:14:55 - INFO - __main__ - Step 42926: {'lr': 0.0004114522257172115, 'samples': 8241792, 'steps': 42925, 'loss/train': 1.446260929107666} 11/07/2021 03:14:55 - INFO - __main__ - Step 42927: {'lr': 0.000411448173986356, 'samples': 8241984, 'steps': 42926, 'loss/train': 1.5102177858352661} 11/07/2021 03:14:56 - INFO - __main__ - Step 42928: {'lr': 0.0004114441221827544, 'samples': 8242176, 'steps': 42927, 'loss/train': 1.4338157176971436} 11/07/2021 03:14:56 - INFO - __main__ - Step 42929: {'lr': 0.0004114400703064085, 'samples': 8242368, 'steps': 42928, 'loss/train': 1.3309646844863892} 11/07/2021 03:14:57 - INFO - __main__ - Step 42930: {'lr': 0.0004114360183573203, 'samples': 8242560, 'steps': 42929, 'loss/train': 1.402104139328003} 11/07/2021 03:14:57 - INFO - __main__ - Step 42931: {'lr': 0.0004114319663354915, 'samples': 8242752, 'steps': 42930, 'loss/train': 1.8404631614685059} 11/07/2021 03:14:57 - INFO - __main__ - Step 42932: {'lr': 0.000411427914240924, 'samples': 8242944, 'steps': 42931, 'loss/train': 1.820628046989441} 11/07/2021 03:14:58 - INFO - __main__ - Step 42933: {'lr': 0.0004114238620736195, 'samples': 8243136, 'steps': 42932, 'loss/train': 1.276533603668213} 11/07/2021 03:14:59 - INFO - __main__ - Step 42934: {'lr': 0.00041141980983357986, 'samples': 8243328, 'steps': 42933, 'loss/train': 1.1286767721176147} 11/07/2021 03:14:59 - INFO - __main__ - Step 42935: {'lr': 0.000411415757520807, 'samples': 8243520, 'steps': 42934, 'loss/train': 1.6131216287612915} 11/07/2021 03:14:59 - INFO - __main__ - Step 42936: {'lr': 0.00041141170513530267, 'samples': 8243712, 'steps': 42935, 'loss/train': 1.4220731258392334} 11/07/2021 03:15:00 - INFO - __main__ - Step 42937: {'lr': 0.0004114076526770688, 'samples': 8243904, 'steps': 42936, 'loss/train': 1.399277687072754} 11/07/2021 03:15:01 - INFO - __main__ - Step 42938: {'lr': 0.000411403600146107, 'samples': 8244096, 'steps': 42937, 'loss/train': 2.115095376968384} 11/07/2021 03:15:01 - INFO - __main__ - Step 42939: {'lr': 0.0004113995475424193, 'samples': 8244288, 'steps': 42938, 'loss/train': 1.4036604166030884} 11/07/2021 03:15:02 - INFO - __main__ - Step 42940: {'lr': 0.0004113954948660075, 'samples': 8244480, 'steps': 42939, 'loss/train': 1.0579941272735596} 11/07/2021 03:15:02 - INFO - __main__ - Step 42941: {'lr': 0.00041139144211687327, 'samples': 8244672, 'steps': 42940, 'loss/train': 1.5197367668151855} 11/07/2021 03:15:02 - INFO - __main__ - Step 42942: {'lr': 0.0004113873892950186, 'samples': 8244864, 'steps': 42941, 'loss/train': 1.0686310529708862} 11/07/2021 03:15:03 - INFO - __main__ - Step 42943: {'lr': 0.00041138333640044523, 'samples': 8245056, 'steps': 42942, 'loss/train': 1.4850374460220337} 11/07/2021 03:15:04 - INFO - __main__ - Step 42944: {'lr': 0.0004113792834331551, 'samples': 8245248, 'steps': 42943, 'loss/train': 1.482663869857788} 11/07/2021 03:15:04 - INFO - __main__ - Step 42945: {'lr': 0.00041137523039314994, 'samples': 8245440, 'steps': 42944, 'loss/train': 1.2159746885299683} 11/07/2021 03:15:05 - INFO - __main__ - Step 42946: {'lr': 0.0004113711772804315, 'samples': 8245632, 'steps': 42945, 'loss/train': 1.1424510478973389} 11/07/2021 03:15:05 - INFO - __main__ - Step 42947: {'lr': 0.0004113671240950018, 'samples': 8245824, 'steps': 42946, 'loss/train': 1.3112131357192993} 11/07/2021 03:15:06 - INFO - __main__ - Step 42948: {'lr': 0.0004113630708368625, 'samples': 8246016, 'steps': 42947, 'loss/train': 1.9154735803604126} 11/07/2021 03:15:06 - INFO - __main__ - Step 42949: {'lr': 0.0004113590175060155, 'samples': 8246208, 'steps': 42948, 'loss/train': 1.2409237623214722} 11/07/2021 03:15:07 - INFO - __main__ - Step 42950: {'lr': 0.00041135496410246264, 'samples': 8246400, 'steps': 42949, 'loss/train': 1.1103994846343994} 11/07/2021 03:15:07 - INFO - __main__ - Step 42951: {'lr': 0.0004113509106262058, 'samples': 8246592, 'steps': 42950, 'loss/train': 1.343586802482605} 11/07/2021 03:15:07 - INFO - __main__ - Step 42952: {'lr': 0.00041134685707724656, 'samples': 8246784, 'steps': 42951, 'loss/train': 1.314520239830017} 11/07/2021 03:15:08 - INFO - __main__ - Step 42953: {'lr': 0.000411342803455587, 'samples': 8246976, 'steps': 42952, 'loss/train': 1.8426026105880737} 11/07/2021 03:15:09 - INFO - __main__ - Step 42954: {'lr': 0.0004113387497612289, 'samples': 8247168, 'steps': 42953, 'loss/train': 1.7123295068740845} 11/07/2021 03:15:09 - INFO - __main__ - Step 42955: {'lr': 0.00041133469599417393, 'samples': 8247360, 'steps': 42954, 'loss/train': 1.0952670574188232} 11/07/2021 03:15:09 - INFO - __main__ - Step 42956: {'lr': 0.00041133064215442415, 'samples': 8247552, 'steps': 42955, 'loss/train': 2.494164228439331} 11/07/2021 03:15:10 - INFO - __main__ - Step 42957: {'lr': 0.0004113265882419812, 'samples': 8247744, 'steps': 42956, 'loss/train': 1.4035422801971436} 11/07/2021 03:15:10 - INFO - __main__ - Step 42958: {'lr': 0.0004113225342568471, 'samples': 8247936, 'steps': 42957, 'loss/train': 1.3970733880996704} 11/07/2021 03:15:11 - INFO - __main__ - Step 42959: {'lr': 0.00041131848019902343, 'samples': 8248128, 'steps': 42958, 'loss/train': 1.5247231721878052} 11/07/2021 03:15:11 - INFO - __main__ - Step 42960: {'lr': 0.0004113144260685122, 'samples': 8248320, 'steps': 42959, 'loss/train': 1.2599166631698608} 11/07/2021 03:15:12 - INFO - __main__ - Step 42961: {'lr': 0.00041131037186531514, 'samples': 8248512, 'steps': 42960, 'loss/train': 1.5106650590896606} 11/07/2021 03:15:12 - INFO - __main__ - Step 42962: {'lr': 0.00041130631758943414, 'samples': 8248704, 'steps': 42961, 'loss/train': 1.5878769159317017} 11/07/2021 03:15:13 - INFO - __main__ - Step 42963: {'lr': 0.00041130226324087094, 'samples': 8248896, 'steps': 42962, 'loss/train': 1.6092218160629272} 11/07/2021 03:15:14 - INFO - __main__ - Step 42964: {'lr': 0.00041129820881962754, 'samples': 8249088, 'steps': 42963, 'loss/train': 1.5334964990615845} 11/07/2021 03:15:14 - INFO - __main__ - Step 42965: {'lr': 0.0004112941543257056, 'samples': 8249280, 'steps': 42964, 'loss/train': 1.873719573020935} 11/07/2021 03:15:14 - INFO - __main__ - Step 42966: {'lr': 0.00041129009975910704, 'samples': 8249472, 'steps': 42965, 'loss/train': 1.4746450185775757} 11/07/2021 03:15:15 - INFO - __main__ - Step 42967: {'lr': 0.00041128604511983356, 'samples': 8249664, 'steps': 42966, 'loss/train': 2.0043632984161377} 11/07/2021 03:15:15 - INFO - __main__ - Step 42968: {'lr': 0.00041128199040788715, 'samples': 8249856, 'steps': 42967, 'loss/train': 1.3230992555618286} 11/07/2021 03:15:16 - INFO - __main__ - Step 42969: {'lr': 0.00041127793562326955, 'samples': 8250048, 'steps': 42968, 'loss/train': 1.7417832612991333} 11/07/2021 03:15:16 - INFO - __main__ - Step 42970: {'lr': 0.0004112738807659826, 'samples': 8250240, 'steps': 42969, 'loss/train': 1.4938559532165527} 11/07/2021 03:15:17 - INFO - __main__ - Step 42971: {'lr': 0.00041126982583602817, 'samples': 8250432, 'steps': 42970, 'loss/train': 1.7032666206359863} 11/07/2021 03:15:17 - INFO - __main__ - Step 42972: {'lr': 0.00041126577083340797, 'samples': 8250624, 'steps': 42971, 'loss/train': 1.1354724168777466} 11/07/2021 03:15:17 - INFO - __main__ - Step 42973: {'lr': 0.000411261715758124, 'samples': 8250816, 'steps': 42972, 'loss/train': 1.4441967010498047} 11/07/2021 03:15:18 - INFO - __main__ - Step 42974: {'lr': 0.0004112576606101779, 'samples': 8251008, 'steps': 42973, 'loss/train': 1.3982219696044922} 11/07/2021 03:15:19 - INFO - __main__ - Step 42975: {'lr': 0.0004112536053895716, 'samples': 8251200, 'steps': 42974, 'loss/train': 1.5185602903366089} 11/07/2021 03:15:19 - INFO - __main__ - Step 42976: {'lr': 0.0004112495500963069, 'samples': 8251392, 'steps': 42975, 'loss/train': 1.5261902809143066} 11/07/2021 03:15:19 - INFO - __main__ - Step 42977: {'lr': 0.00041124549473038564, 'samples': 8251584, 'steps': 42976, 'loss/train': 1.7653063535690308} 11/07/2021 03:15:20 - INFO - __main__ - Step 42978: {'lr': 0.0004112414392918097, 'samples': 8251776, 'steps': 42977, 'loss/train': 1.3035284280776978} 11/07/2021 03:15:20 - INFO - __main__ - Step 42979: {'lr': 0.00041123738378058083, 'samples': 8251968, 'steps': 42978, 'loss/train': 1.7096556425094604} 11/07/2021 03:15:21 - INFO - __main__ - Step 42980: {'lr': 0.0004112333281967009, 'samples': 8252160, 'steps': 42979, 'loss/train': 1.40980863571167} 11/07/2021 03:15:22 - INFO - __main__ - Step 42981: {'lr': 0.00041122927254017173, 'samples': 8252352, 'steps': 42980, 'loss/train': 1.3556780815124512} 11/07/2021 03:15:22 - INFO - __main__ - Step 42982: {'lr': 0.0004112252168109951, 'samples': 8252544, 'steps': 42981, 'loss/train': 1.8229740858078003} 11/07/2021 03:15:22 - INFO - __main__ - Step 42983: {'lr': 0.0004112211610091728, 'samples': 8252736, 'steps': 42982, 'loss/train': 2.1034083366394043} 11/07/2021 03:15:23 - INFO - __main__ - Step 42984: {'lr': 0.0004112171051347069, 'samples': 8252928, 'steps': 42983, 'loss/train': 1.8783169984817505} 11/07/2021 03:15:24 - INFO - __main__ - Step 42985: {'lr': 0.00041121304918759893, 'samples': 8253120, 'steps': 42984, 'loss/train': 1.5309916734695435} 11/07/2021 03:15:24 - INFO - __main__ - Step 42986: {'lr': 0.00041120899316785095, 'samples': 8253312, 'steps': 42985, 'loss/train': 1.4424855709075928} 11/07/2021 03:15:24 - INFO - __main__ - Step 42987: {'lr': 0.00041120493707546456, 'samples': 8253504, 'steps': 42986, 'loss/train': 1.3083442449569702} 11/07/2021 03:15:25 - INFO - __main__ - Step 42988: {'lr': 0.00041120088091044183, 'samples': 8253696, 'steps': 42987, 'loss/train': 1.4893476963043213} 11/07/2021 03:15:25 - INFO - __main__ - Step 42989: {'lr': 0.0004111968246727844, 'samples': 8253888, 'steps': 42988, 'loss/train': 1.3382315635681152} 11/07/2021 03:15:26 - INFO - __main__ - Step 42990: {'lr': 0.0004111927683624942, 'samples': 8254080, 'steps': 42989, 'loss/train': 0.8450368046760559} 11/07/2021 03:15:27 - INFO - __main__ - Step 42991: {'lr': 0.00041118871197957306, 'samples': 8254272, 'steps': 42990, 'loss/train': 0.8592576384544373} 11/07/2021 03:15:27 - INFO - __main__ - Step 42992: {'lr': 0.00041118465552402274, 'samples': 8254464, 'steps': 42991, 'loss/train': 2.1071560382843018} 11/07/2021 03:15:27 - INFO - __main__ - Step 42993: {'lr': 0.00041118059899584503, 'samples': 8254656, 'steps': 42992, 'loss/train': 1.7796658277511597} 11/07/2021 03:15:28 - INFO - __main__ - Step 42994: {'lr': 0.00041117654239504193, 'samples': 8254848, 'steps': 42993, 'loss/train': 1.23753023147583} 11/07/2021 03:15:29 - INFO - __main__ - Step 42995: {'lr': 0.0004111724857216151, 'samples': 8255040, 'steps': 42994, 'loss/train': 1.5601799488067627} 11/07/2021 03:15:29 - INFO - __main__ - Step 42996: {'lr': 0.0004111684289755665, 'samples': 8255232, 'steps': 42995, 'loss/train': 1.4935163259506226} 11/07/2021 03:15:29 - INFO - __main__ - Step 42997: {'lr': 0.00041116437215689785, 'samples': 8255424, 'steps': 42996, 'loss/train': 1.6728167533874512} 11/07/2021 03:15:30 - INFO - __main__ - Step 42998: {'lr': 0.000411160315265611, 'samples': 8255616, 'steps': 42997, 'loss/train': 1.5341050624847412} 11/07/2021 03:15:30 - INFO - __main__ - Step 42999: {'lr': 0.0004111562583017079, 'samples': 8255808, 'steps': 42998, 'loss/train': 1.1012388467788696} 11/07/2021 03:15:31 - INFO - __main__ - Step 43000: {'lr': 0.00041115220126519014, 'samples': 8256000, 'steps': 42999, 'loss/train': 1.7845770120620728} 11/07/2021 03:15:31 - INFO - __main__ - Step 43001: {'lr': 0.00041114814415605977, 'samples': 8256192, 'steps': 43000, 'loss/train': 0.8801016807556152} 11/07/2021 03:15:32 - INFO - __main__ - Step 43002: {'lr': 0.0004111440869743185, 'samples': 8256384, 'steps': 43001, 'loss/train': 1.5452907085418701} 11/07/2021 03:15:32 - INFO - __main__ - Step 43003: {'lr': 0.00041114002971996824, 'samples': 8256576, 'steps': 43002, 'loss/train': 0.9452287554740906} 11/07/2021 03:15:33 - INFO - __main__ - Step 43004: {'lr': 0.0004111359723930107, 'samples': 8256768, 'steps': 43003, 'loss/train': 0.4670843183994293} 11/07/2021 03:15:33 - INFO - __main__ - Step 43005: {'lr': 0.00041113191499344784, 'samples': 8256960, 'steps': 43004, 'loss/train': 1.4178489446640015} 11/07/2021 03:15:34 - INFO - __main__ - Step 43006: {'lr': 0.0004111278575212814, 'samples': 8257152, 'steps': 43005, 'loss/train': 1.4189621210098267} 11/07/2021 03:15:34 - INFO - __main__ - Step 43007: {'lr': 0.0004111237999765132, 'samples': 8257344, 'steps': 43006, 'loss/train': 1.805106520652771} 11/07/2021 03:15:35 - INFO - __main__ - Step 43008: {'lr': 0.0004111197423591452, 'samples': 8257536, 'steps': 43007, 'loss/train': 1.9079796075820923} 11/07/2021 03:15:35 - INFO - __main__ - Step 43009: {'lr': 0.000411115684669179, 'samples': 8257728, 'steps': 43008, 'loss/train': 1.5675806999206543} 11/07/2021 03:15:35 - INFO - __main__ - Step 43010: {'lr': 0.00041111162690661665, 'samples': 8257920, 'steps': 43009, 'loss/train': 1.6696821451187134} 11/07/2021 03:15:36 - INFO - __main__ - Step 43011: {'lr': 0.00041110756907145984, 'samples': 8258112, 'steps': 43010, 'loss/train': 1.4286752939224243} 11/07/2021 03:15:37 - INFO - __main__ - Step 43012: {'lr': 0.0004111035111637105, 'samples': 8258304, 'steps': 43011, 'loss/train': 1.3758095502853394} 11/07/2021 03:15:37 - INFO - __main__ - Step 43013: {'lr': 0.00041109945318337034, 'samples': 8258496, 'steps': 43012, 'loss/train': 1.6210780143737793} 11/07/2021 03:15:37 - INFO - __main__ - Step 43014: {'lr': 0.00041109539513044127, 'samples': 8258688, 'steps': 43013, 'loss/train': 1.3206629753112793} 11/07/2021 03:15:38 - INFO - __main__ - Step 43015: {'lr': 0.0004110913370049251, 'samples': 8258880, 'steps': 43014, 'loss/train': 1.639256238937378} 11/07/2021 03:15:39 - INFO - __main__ - Step 43016: {'lr': 0.00041108727880682363, 'samples': 8259072, 'steps': 43015, 'loss/train': 0.8471148014068604} 11/07/2021 03:15:39 - INFO - __main__ - Step 43017: {'lr': 0.0004110832205361388, 'samples': 8259264, 'steps': 43016, 'loss/train': 1.240605354309082} 11/07/2021 03:15:40 - INFO - __main__ - Step 43018: {'lr': 0.0004110791621928723, 'samples': 8259456, 'steps': 43017, 'loss/train': 1.4540150165557861} 11/07/2021 03:15:40 - INFO - __main__ - Step 43019: {'lr': 0.00041107510377702604, 'samples': 8259648, 'steps': 43018, 'loss/train': 1.5557411909103394} 11/07/2021 03:15:40 - INFO - __main__ - Step 43020: {'lr': 0.00041107104528860186, 'samples': 8259840, 'steps': 43019, 'loss/train': 1.4920012950897217} 11/07/2021 03:15:41 - INFO - __main__ - Step 43021: {'lr': 0.00041106698672760145, 'samples': 8260032, 'steps': 43020, 'loss/train': 1.1121315956115723} 11/07/2021 03:15:42 - INFO - __main__ - Step 43022: {'lr': 0.0004110629280940268, 'samples': 8260224, 'steps': 43021, 'loss/train': 1.8375533819198608} 11/07/2021 03:15:42 - INFO - __main__ - Step 43023: {'lr': 0.0004110588693878796, 'samples': 8260416, 'steps': 43022, 'loss/train': 1.7017621994018555} 11/07/2021 03:15:42 - INFO - __main__ - Step 43024: {'lr': 0.0004110548106091619, 'samples': 8260608, 'steps': 43023, 'loss/train': 1.8397775888442993} 11/07/2021 03:15:43 - INFO - __main__ - Step 43025: {'lr': 0.00041105075175787534, 'samples': 8260800, 'steps': 43024, 'loss/train': 1.5974022150039673} 11/07/2021 03:15:43 - INFO - __main__ - Step 43026: {'lr': 0.00041104669283402174, 'samples': 8260992, 'steps': 43025, 'loss/train': 1.4904776811599731} 11/07/2021 03:15:44 - INFO - __main__ - Step 43027: {'lr': 0.00041104263383760304, 'samples': 8261184, 'steps': 43026, 'loss/train': 1.4006849527359009} 11/07/2021 03:15:45 - INFO - __main__ - Step 43028: {'lr': 0.000411038574768621, 'samples': 8261376, 'steps': 43027, 'loss/train': 1.6243817806243896} 11/07/2021 03:15:45 - INFO - __main__ - Step 43029: {'lr': 0.00041103451562707745, 'samples': 8261568, 'steps': 43028, 'loss/train': 1.3424921035766602} 11/07/2021 03:15:45 - INFO - __main__ - Step 43030: {'lr': 0.0004110304564129742, 'samples': 8261760, 'steps': 43029, 'loss/train': 1.4041417837142944} 11/07/2021 03:15:46 - INFO - __main__ - Step 43031: {'lr': 0.00041102639712631316, 'samples': 8261952, 'steps': 43030, 'loss/train': 1.4108160734176636} 11/07/2021 03:15:47 - INFO - __main__ - Step 43032: {'lr': 0.0004110223377670962, 'samples': 8262144, 'steps': 43031, 'loss/train': 1.3687145709991455} 11/07/2021 03:15:47 - INFO - __main__ - Step 43033: {'lr': 0.0004110182783353249, 'samples': 8262336, 'steps': 43032, 'loss/train': 1.0869172811508179} 11/07/2021 03:15:47 - INFO - __main__ - Step 43034: {'lr': 0.0004110142188310013, 'samples': 8262528, 'steps': 43033, 'loss/train': 1.3496339321136475} 11/07/2021 03:15:48 - INFO - __main__ - Step 43035: {'lr': 0.0004110101592541272, 'samples': 8262720, 'steps': 43034, 'loss/train': 1.3196594715118408} 11/07/2021 03:15:48 - INFO - __main__ - Step 43036: {'lr': 0.0004110060996047044, 'samples': 8262912, 'steps': 43035, 'loss/train': 1.082405686378479} 11/07/2021 03:15:50 - INFO - __main__ - Step 43037: {'lr': 0.00041100203988273475, 'samples': 8263104, 'steps': 43036, 'loss/train': 1.2167576551437378} 11/07/2021 03:15:50 - INFO - __main__ - Step 43038: {'lr': 0.0004109979800882201, 'samples': 8263296, 'steps': 43037, 'loss/train': 1.5957521200180054} 11/07/2021 03:15:50 - INFO - __main__ - Step 43039: {'lr': 0.00041099392022116214, 'samples': 8263488, 'steps': 43038, 'loss/train': 1.3459219932556152} 11/07/2021 03:15:51 - INFO - __main__ - Step 43040: {'lr': 0.0004109898602815629, 'samples': 8263680, 'steps': 43039, 'loss/train': 1.0219038724899292} 11/07/2021 03:15:51 - INFO - __main__ - Step 43041: {'lr': 0.000410985800269424, 'samples': 8263872, 'steps': 43040, 'loss/train': 1.612999439239502} 11/07/2021 03:15:51 - INFO - __main__ - Step 43042: {'lr': 0.00041098174018474747, 'samples': 8264064, 'steps': 43041, 'loss/train': 1.31856369972229} 11/07/2021 03:15:52 - INFO - __main__ - Step 43043: {'lr': 0.000410977680027535, 'samples': 8264256, 'steps': 43042, 'loss/train': 1.392549991607666} 11/07/2021 03:15:53 - INFO - __main__ - Step 43044: {'lr': 0.00041097361979778853, 'samples': 8264448, 'steps': 43043, 'loss/train': 1.1940284967422485} 11/07/2021 03:15:53 - INFO - __main__ - Step 43045: {'lr': 0.00041096955949550983, 'samples': 8264640, 'steps': 43044, 'loss/train': 1.3849601745605469} 11/07/2021 03:15:54 - INFO - __main__ - Step 43046: {'lr': 0.00041096549912070067, 'samples': 8264832, 'steps': 43045, 'loss/train': 1.7899335622787476} 11/07/2021 03:15:54 - INFO - __main__ - Step 43047: {'lr': 0.000410961438673363, 'samples': 8265024, 'steps': 43046, 'loss/train': 1.868234395980835} 11/07/2021 03:15:55 - INFO - __main__ - Step 43048: {'lr': 0.0004109573781534985, 'samples': 8265216, 'steps': 43047, 'loss/train': 1.6222096681594849} 11/07/2021 03:15:56 - INFO - __main__ - Step 43049: {'lr': 0.0004109533175611092, 'samples': 8265408, 'steps': 43048, 'loss/train': 1.1791523694992065} 11/07/2021 03:15:56 - INFO - __main__ - Step 43050: {'lr': 0.0004109492568961968, 'samples': 8265600, 'steps': 43049, 'loss/train': 1.5527454614639282} 11/07/2021 03:15:56 - INFO - __main__ - Step 43051: {'lr': 0.00041094519615876313, 'samples': 8265792, 'steps': 43050, 'loss/train': 1.4125049114227295} 11/07/2021 03:15:57 - INFO - __main__ - Step 43052: {'lr': 0.0004109411353488101, 'samples': 8265984, 'steps': 43051, 'loss/train': 1.371485948562622} 11/07/2021 03:15:58 - INFO - __main__ - Step 43053: {'lr': 0.00041093707446633934, 'samples': 8266176, 'steps': 43052, 'loss/train': 1.9206199645996094} 11/07/2021 03:15:58 - INFO - __main__ - Step 43054: {'lr': 0.00041093301351135294, 'samples': 8266368, 'steps': 43053, 'loss/train': 2.1060402393341064} 11/07/2021 03:15:58 - INFO - __main__ - Step 43055: {'lr': 0.00041092895248385255, 'samples': 8266560, 'steps': 43054, 'loss/train': 1.3890399932861328} 11/07/2021 03:15:59 - INFO - __main__ - Step 43056: {'lr': 0.00041092489138384, 'samples': 8266752, 'steps': 43055, 'loss/train': 5.796188831329346} 11/07/2021 03:15:59 - INFO - __main__ - Step 43057: {'lr': 0.0004109208302113173, 'samples': 8266944, 'steps': 43056, 'loss/train': 1.612684965133667} 11/07/2021 03:16:00 - INFO - __main__ - Step 43058: {'lr': 0.00041091676896628604, 'samples': 8267136, 'steps': 43057, 'loss/train': 5.75294828414917} 11/07/2021 03:16:01 - INFO - __main__ - Step 43059: {'lr': 0.00041091270764874823, 'samples': 8267328, 'steps': 43058, 'loss/train': 1.6078474521636963} 11/07/2021 03:16:01 - INFO - __main__ - Step 43060: {'lr': 0.0004109086462587056, 'samples': 8267520, 'steps': 43059, 'loss/train': 1.4386402368545532} 11/07/2021 03:16:01 - INFO - __main__ - Step 43061: {'lr': 0.0004109045847961601, 'samples': 8267712, 'steps': 43060, 'loss/train': 1.534162163734436} 11/07/2021 03:16:02 - INFO - __main__ - Step 43062: {'lr': 0.0004109005232611134, 'samples': 8267904, 'steps': 43061, 'loss/train': 1.4820033311843872} 11/07/2021 03:16:02 - INFO - __main__ - Step 43063: {'lr': 0.00041089646165356743, 'samples': 8268096, 'steps': 43062, 'loss/train': 0.9905995726585388} 11/07/2021 03:16:03 - INFO - __main__ - Step 43064: {'lr': 0.000410892399973524, 'samples': 8268288, 'steps': 43063, 'loss/train': 1.497308373451233} 11/07/2021 03:16:03 - INFO - __main__ - Step 43065: {'lr': 0.00041088833822098495, 'samples': 8268480, 'steps': 43064, 'loss/train': 1.0966325998306274} 11/07/2021 03:16:04 - INFO - __main__ - Step 43066: {'lr': 0.00041088427639595206, 'samples': 8268672, 'steps': 43065, 'loss/train': 1.3141721487045288} 11/07/2021 03:16:04 - INFO - __main__ - Step 43067: {'lr': 0.0004108802144984273, 'samples': 8268864, 'steps': 43066, 'loss/train': 1.7734742164611816} 11/07/2021 03:16:04 - INFO - __main__ - Step 43068: {'lr': 0.0004108761525284123, 'samples': 8269056, 'steps': 43067, 'loss/train': 1.7632601261138916} 11/07/2021 03:16:05 - INFO - __main__ - Step 43069: {'lr': 0.000410872090485909, 'samples': 8269248, 'steps': 43068, 'loss/train': 1.366463541984558} 11/07/2021 03:16:06 - INFO - __main__ - Step 43070: {'lr': 0.00041086802837091916, 'samples': 8269440, 'steps': 43069, 'loss/train': 2.006185293197632} 11/07/2021 03:16:06 - INFO - __main__ - Step 43071: {'lr': 0.00041086396618344475, 'samples': 8269632, 'steps': 43070, 'loss/train': 1.924782395362854} 11/07/2021 03:16:06 - INFO - __main__ - Step 43072: {'lr': 0.0004108599039234875, 'samples': 8269824, 'steps': 43071, 'loss/train': 1.1278479099273682} 11/07/2021 03:16:07 - INFO - __main__ - Step 43073: {'lr': 0.00041085584159104925, 'samples': 8270016, 'steps': 43072, 'loss/train': 1.4849249124526978} 11/07/2021 03:16:08 - INFO - __main__ - Step 43074: {'lr': 0.00041085177918613185, 'samples': 8270208, 'steps': 43073, 'loss/train': 1.31929612159729} 11/07/2021 03:16:08 - INFO - __main__ - Step 43075: {'lr': 0.0004108477167087371, 'samples': 8270400, 'steps': 43074, 'loss/train': 1.661223292350769} 11/07/2021 03:16:09 - INFO - __main__ - Step 43076: {'lr': 0.0004108436541588669, 'samples': 8270592, 'steps': 43075, 'loss/train': 1.6085768938064575} 11/07/2021 03:16:09 - INFO - __main__ - Step 43077: {'lr': 0.000410839591536523, 'samples': 8270784, 'steps': 43076, 'loss/train': 1.5742981433868408} 11/07/2021 03:16:09 - INFO - __main__ - Step 43078: {'lr': 0.00041083552884170726, 'samples': 8270976, 'steps': 43077, 'loss/train': 1.8481992483139038} 11/07/2021 03:16:10 - INFO - __main__ - Step 43079: {'lr': 0.0004108314660744216, 'samples': 8271168, 'steps': 43078, 'loss/train': 1.462916612625122} 11/07/2021 03:16:11 - INFO - __main__ - Step 43080: {'lr': 0.0004108274032346676, 'samples': 8271360, 'steps': 43079, 'loss/train': 2.329413890838623} 11/07/2021 03:16:11 - INFO - __main__ - Step 43081: {'lr': 0.0004108233403224474, 'samples': 8271552, 'steps': 43080, 'loss/train': 1.365371823310852} 11/07/2021 03:16:11 - INFO - __main__ - Step 43082: {'lr': 0.0004108192773377626, 'samples': 8271744, 'steps': 43081, 'loss/train': 1.0467580556869507} 11/07/2021 03:16:12 - INFO - __main__ - Step 43083: {'lr': 0.0004108152142806151, 'samples': 8271936, 'steps': 43082, 'loss/train': 1.5733226537704468} 11/07/2021 03:16:12 - INFO - __main__ - Step 43084: {'lr': 0.00041081115115100677, 'samples': 8272128, 'steps': 43083, 'loss/train': 1.3022276163101196} 11/07/2021 03:16:13 - INFO - __main__ - Step 43085: {'lr': 0.0004108070879489395, 'samples': 8272320, 'steps': 43084, 'loss/train': 1.8730179071426392} 11/07/2021 03:16:13 - INFO - __main__ - Step 43086: {'lr': 0.0004108030246744149, 'samples': 8272512, 'steps': 43085, 'loss/train': 1.9628807306289673} 11/07/2021 03:16:14 - INFO - __main__ - Step 43087: {'lr': 0.00041079896132743506, 'samples': 8272704, 'steps': 43086, 'loss/train': 1.5374175310134888} 11/07/2021 03:16:14 - INFO - __main__ - Step 43088: {'lr': 0.0004107948979080016, 'samples': 8272896, 'steps': 43087, 'loss/train': 1.434190273284912} 11/07/2021 03:16:14 - INFO - __main__ - Step 43089: {'lr': 0.00041079083441611646, 'samples': 8273088, 'steps': 43088, 'loss/train': 0.9195286631584167} 11/07/2021 03:16:16 - INFO - __main__ - Step 43090: {'lr': 0.0004107867708517815, 'samples': 8273280, 'steps': 43089, 'loss/train': 1.2906053066253662} 11/07/2021 03:16:16 - INFO - __main__ - Step 43091: {'lr': 0.0004107827072149984, 'samples': 8273472, 'steps': 43090, 'loss/train': 1.2737891674041748} 11/07/2021 03:16:16 - INFO - __main__ - Step 43092: {'lr': 0.0004107786435057692, 'samples': 8273664, 'steps': 43091, 'loss/train': 1.444359302520752} 11/07/2021 03:16:17 - INFO - __main__ - Step 43093: {'lr': 0.0004107745797240956, 'samples': 8273856, 'steps': 43092, 'loss/train': 1.8389931917190552} 11/07/2021 03:16:17 - INFO - __main__ - Step 43094: {'lr': 0.0004107705158699794, 'samples': 8274048, 'steps': 43093, 'loss/train': 0.8773282170295715} 11/07/2021 03:16:18 - INFO - __main__ - Step 43095: {'lr': 0.00041076645194342254, 'samples': 8274240, 'steps': 43094, 'loss/train': 2.0294175148010254} 11/07/2021 03:16:18 - INFO - __main__ - Step 43096: {'lr': 0.00041076238794442675, 'samples': 8274432, 'steps': 43095, 'loss/train': 1.283133864402771} 11/07/2021 03:16:19 - INFO - __main__ - Step 43097: {'lr': 0.00041075832387299396, 'samples': 8274624, 'steps': 43096, 'loss/train': 1.3659342527389526} 11/07/2021 03:16:19 - INFO - __main__ - Step 43098: {'lr': 0.00041075425972912595, 'samples': 8274816, 'steps': 43097, 'loss/train': 1.7125060558319092} 11/07/2021 03:16:19 - INFO - __main__ - Step 43099: {'lr': 0.00041075019551282455, 'samples': 8275008, 'steps': 43098, 'loss/train': 1.3746939897537231} 11/07/2021 03:16:20 - INFO - __main__ - Step 43100: {'lr': 0.00041074613122409157, 'samples': 8275200, 'steps': 43099, 'loss/train': 1.4970327615737915} 11/07/2021 03:16:21 - INFO - __main__ - Step 43101: {'lr': 0.0004107420668629289, 'samples': 8275392, 'steps': 43100, 'loss/train': 1.3181568384170532} 11/07/2021 03:16:21 - INFO - __main__ - Step 43102: {'lr': 0.00041073800242933826, 'samples': 8275584, 'steps': 43101, 'loss/train': 1.4100422859191895} 11/07/2021 03:16:21 - INFO - __main__ - Step 43103: {'lr': 0.00041073393792332157, 'samples': 8275776, 'steps': 43102, 'loss/train': 1.6294130086898804} 11/07/2021 03:16:22 - INFO - __main__ - Step 43104: {'lr': 0.0004107298733448807, 'samples': 8275968, 'steps': 43103, 'loss/train': 1.350721001625061} 11/07/2021 03:16:23 - INFO - __main__ - Step 43105: {'lr': 0.0004107258086940174, 'samples': 8276160, 'steps': 43104, 'loss/train': 1.189432978630066} 11/07/2021 03:16:23 - INFO - __main__ - Step 43106: {'lr': 0.0004107217439707336, 'samples': 8276352, 'steps': 43105, 'loss/train': 1.315584659576416} 11/07/2021 03:16:23 - INFO - __main__ - Step 43107: {'lr': 0.000410717679175031, 'samples': 8276544, 'steps': 43106, 'loss/train': 1.1594332456588745} 11/07/2021 03:16:24 - INFO - __main__ - Step 43108: {'lr': 0.00041071361430691143, 'samples': 8276736, 'steps': 43107, 'loss/train': 1.5375863313674927} 11/07/2021 03:16:24 - INFO - __main__ - Step 43109: {'lr': 0.00041070954936637687, 'samples': 8276928, 'steps': 43108, 'loss/train': 1.8577443361282349} 11/07/2021 03:16:25 - INFO - __main__ - Step 43110: {'lr': 0.00041070548435342903, 'samples': 8277120, 'steps': 43109, 'loss/train': 1.6667335033416748} 11/07/2021 03:16:25 - INFO - __main__ - Step 43111: {'lr': 0.00041070141926806983, 'samples': 8277312, 'steps': 43110, 'loss/train': 1.0912951231002808} 11/07/2021 03:16:26 - INFO - __main__ - Step 43112: {'lr': 0.00041069735411030105, 'samples': 8277504, 'steps': 43111, 'loss/train': 1.5038973093032837} 11/07/2021 03:16:26 - INFO - __main__ - Step 43113: {'lr': 0.00041069328888012447, 'samples': 8277696, 'steps': 43112, 'loss/train': 1.277441382408142} 11/07/2021 03:16:26 - INFO - __main__ - Step 43114: {'lr': 0.000410689223577542, 'samples': 8277888, 'steps': 43113, 'loss/train': 1.4587876796722412} 11/07/2021 03:16:27 - INFO - __main__ - Step 43115: {'lr': 0.00041068515820255543, 'samples': 8278080, 'steps': 43114, 'loss/train': 1.666568636894226} 11/07/2021 03:16:28 - INFO - __main__ - Step 43116: {'lr': 0.00041068109275516665, 'samples': 8278272, 'steps': 43115, 'loss/train': 1.3361376523971558} 11/07/2021 03:16:28 - INFO - __main__ - Step 43117: {'lr': 0.0004106770272353774, 'samples': 8278464, 'steps': 43116, 'loss/train': 1.3129546642303467} 11/07/2021 03:16:29 - INFO - __main__ - Step 43118: {'lr': 0.00041067296164318956, 'samples': 8278656, 'steps': 43117, 'loss/train': 1.4723830223083496} 11/07/2021 03:16:29 - INFO - __main__ - Step 43119: {'lr': 0.000410668895978605, 'samples': 8278848, 'steps': 43118, 'loss/train': 1.167095422744751} 11/07/2021 03:16:29 - INFO - __main__ - Step 43120: {'lr': 0.0004106648302416255, 'samples': 8279040, 'steps': 43119, 'loss/train': 1.4202436208724976} 11/07/2021 03:16:30 - INFO - __main__ - Step 43121: {'lr': 0.0004106607644322529, 'samples': 8279232, 'steps': 43120, 'loss/train': 1.5817418098449707} 11/07/2021 03:16:31 - INFO - __main__ - Step 43122: {'lr': 0.00041065669855048896, 'samples': 8279424, 'steps': 43121, 'loss/train': 1.5932525396347046} 11/07/2021 03:16:31 - INFO - __main__ - Step 43123: {'lr': 0.0004106526325963357, 'samples': 8279616, 'steps': 43122, 'loss/train': 1.3598804473876953} 11/07/2021 03:16:31 - INFO - __main__ - Step 43124: {'lr': 0.0004106485665697948, 'samples': 8279808, 'steps': 43123, 'loss/train': 1.0472019910812378} 11/07/2021 03:16:32 - INFO - __main__ - Step 43125: {'lr': 0.00041064450047086814, 'samples': 8280000, 'steps': 43124, 'loss/train': 1.4605704545974731} 11/07/2021 03:16:33 - INFO - __main__ - Step 43126: {'lr': 0.00041064043429955756, 'samples': 8280192, 'steps': 43125, 'loss/train': 1.3245314359664917} 11/07/2021 03:16:33 - INFO - __main__ - Step 43127: {'lr': 0.0004106363680558649, 'samples': 8280384, 'steps': 43126, 'loss/train': 1.6327924728393555} 11/07/2021 03:16:33 - INFO - __main__ - Step 43128: {'lr': 0.0004106323017397919, 'samples': 8280576, 'steps': 43127, 'loss/train': 1.4311386346817017} 11/07/2021 03:16:34 - INFO - __main__ - Step 43129: {'lr': 0.00041062823535134053, 'samples': 8280768, 'steps': 43128, 'loss/train': 1.6788736581802368} 11/07/2021 03:16:34 - INFO - __main__ - Step 43130: {'lr': 0.0004106241688905126, 'samples': 8280960, 'steps': 43129, 'loss/train': 1.4539369344711304} 11/07/2021 03:16:35 - INFO - __main__ - Step 43131: {'lr': 0.00041062010235730974, 'samples': 8281152, 'steps': 43130, 'loss/train': 1.5070507526397705} 11/07/2021 03:16:36 - INFO - __main__ - Step 43132: {'lr': 0.0004106160357517341, 'samples': 8281344, 'steps': 43131, 'loss/train': 1.384362816810608} 11/07/2021 03:16:36 - INFO - __main__ - Step 43133: {'lr': 0.00041061196907378727, 'samples': 8281536, 'steps': 43132, 'loss/train': 0.20600904524326324} 11/07/2021 03:16:36 - INFO - __main__ - Step 43134: {'lr': 0.00041060790232347116, 'samples': 8281728, 'steps': 43133, 'loss/train': 1.1068603992462158} 11/07/2021 03:16:37 - INFO - __main__ - Step 43135: {'lr': 0.00041060383550078764, 'samples': 8281920, 'steps': 43134, 'loss/train': 1.7642035484313965} 11/07/2021 03:16:37 - INFO - __main__ - Step 43136: {'lr': 0.00041059976860573845, 'samples': 8282112, 'steps': 43135, 'loss/train': 1.1896979808807373} 11/07/2021 03:16:38 - INFO - __main__ - Step 43137: {'lr': 0.00041059570163832555, 'samples': 8282304, 'steps': 43136, 'loss/train': 0.7403428554534912} 11/07/2021 03:16:38 - INFO - __main__ - Step 43138: {'lr': 0.00041059163459855066, 'samples': 8282496, 'steps': 43137, 'loss/train': 1.6915414333343506} 11/07/2021 03:16:39 - INFO - __main__ - Step 43139: {'lr': 0.00041058756748641573, 'samples': 8282688, 'steps': 43138, 'loss/train': 1.559566855430603} 11/07/2021 03:16:39 - INFO - __main__ - Step 43140: {'lr': 0.0004105835003019225, 'samples': 8282880, 'steps': 43139, 'loss/train': 1.4372590780258179} 11/07/2021 03:16:39 - INFO - __main__ - Step 43141: {'lr': 0.00041057943304507273, 'samples': 8283072, 'steps': 43140, 'loss/train': 1.3585083484649658} 11/07/2021 03:16:41 - INFO - __main__ - Step 43142: {'lr': 0.0004105753657158684, 'samples': 8283264, 'steps': 43141, 'loss/train': 1.2696479558944702} 11/07/2021 03:16:41 - INFO - __main__ - Step 43143: {'lr': 0.00041057129831431133, 'samples': 8283456, 'steps': 43142, 'loss/train': 1.361433982849121} 11/07/2021 03:16:41 - INFO - __main__ - Step 43144: {'lr': 0.00041056723084040324, 'samples': 8283648, 'steps': 43143, 'loss/train': 1.8104569911956787} 11/07/2021 03:16:42 - INFO - __main__ - Step 43145: {'lr': 0.00041056316329414613, 'samples': 8283840, 'steps': 43144, 'loss/train': 1.4259611368179321} 11/07/2021 03:16:42 - INFO - __main__ - Step 43146: {'lr': 0.00041055909567554166, 'samples': 8284032, 'steps': 43145, 'loss/train': 1.4371272325515747} 11/07/2021 03:16:43 - INFO - __main__ - Step 43147: {'lr': 0.00041055502798459175, 'samples': 8284224, 'steps': 43146, 'loss/train': 1.4179667234420776} 11/07/2021 03:16:43 - INFO - __main__ - Step 43148: {'lr': 0.00041055096022129823, 'samples': 8284416, 'steps': 43147, 'loss/train': 1.4169361591339111} 11/07/2021 03:16:44 - INFO - __main__ - Step 43149: {'lr': 0.0004105468923856629, 'samples': 8284608, 'steps': 43148, 'loss/train': 1.2737438678741455} 11/07/2021 03:16:44 - INFO - __main__ - Step 43150: {'lr': 0.00041054282447768763, 'samples': 8284800, 'steps': 43149, 'loss/train': 1.6200916767120361} 11/07/2021 03:16:44 - INFO - __main__ - Step 43151: {'lr': 0.00041053875649737424, 'samples': 8284992, 'steps': 43150, 'loss/train': 1.7988002300262451} 11/07/2021 03:16:45 - INFO - __main__ - Step 43152: {'lr': 0.0004105346884447246, 'samples': 8285184, 'steps': 43151, 'loss/train': 1.8118313550949097} 11/07/2021 03:16:46 - INFO - __main__ - Step 43153: {'lr': 0.00041053062031974055, 'samples': 8285376, 'steps': 43152, 'loss/train': 1.5103808641433716} 11/07/2021 03:16:46 - INFO - __main__ - Step 43154: {'lr': 0.00041052655212242377, 'samples': 8285568, 'steps': 43153, 'loss/train': 1.352491021156311} 11/07/2021 03:16:46 - INFO - __main__ - Step 43155: {'lr': 0.00041052248385277623, 'samples': 8285760, 'steps': 43154, 'loss/train': 1.6922416687011719} 11/07/2021 03:16:47 - INFO - __main__ - Step 43156: {'lr': 0.0004105184155107998, 'samples': 8285952, 'steps': 43155, 'loss/train': 1.8941594362258911} 11/07/2021 03:16:48 - INFO - __main__ - Step 43157: {'lr': 0.00041051434709649614, 'samples': 8286144, 'steps': 43156, 'loss/train': 1.6118606328964233} 11/07/2021 03:16:48 - INFO - __main__ - Step 43158: {'lr': 0.0004105102786098672, 'samples': 8286336, 'steps': 43157, 'loss/train': 1.1860041618347168} 11/07/2021 03:16:49 - INFO - __main__ - Step 43159: {'lr': 0.0004105062100509149, 'samples': 8286528, 'steps': 43158, 'loss/train': 1.4262062311172485} 11/07/2021 03:16:49 - INFO - __main__ - Step 43160: {'lr': 0.000410502141419641, 'samples': 8286720, 'steps': 43159, 'loss/train': 1.2417993545532227} 11/07/2021 03:16:49 - INFO - __main__ - Step 43161: {'lr': 0.00041049807271604724, 'samples': 8286912, 'steps': 43160, 'loss/train': 1.6057826280593872} 11/07/2021 03:16:50 - INFO - __main__ - Step 43162: {'lr': 0.00041049400394013545, 'samples': 8287104, 'steps': 43161, 'loss/train': 0.7456337809562683} 11/07/2021 03:16:51 - INFO - __main__ - Step 43163: {'lr': 0.0004104899350919077, 'samples': 8287296, 'steps': 43162, 'loss/train': 1.8297420740127563} 11/07/2021 03:16:52 - INFO - __main__ - Step 43164: {'lr': 0.0004104858661713655, 'samples': 8287488, 'steps': 43163, 'loss/train': 1.7945384979248047} 11/07/2021 03:16:52 - INFO - __main__ - Step 43165: {'lr': 0.00041048179717851095, 'samples': 8287680, 'steps': 43164, 'loss/train': 0.2556303143501282} 11/07/2021 03:16:52 - INFO - __main__ - Step 43166: {'lr': 0.00041047772811334584, 'samples': 8287872, 'steps': 43165, 'loss/train': 1.3993481397628784} 11/07/2021 03:16:53 - INFO - __main__ - Step 43167: {'lr': 0.0004104736589758719, 'samples': 8288064, 'steps': 43166, 'loss/train': 0.8999582529067993} 11/07/2021 03:16:53 - INFO - __main__ - Step 43168: {'lr': 0.0004104695897660909, 'samples': 8288256, 'steps': 43167, 'loss/train': 1.4187109470367432} 11/07/2021 03:16:54 - INFO - __main__ - Step 43169: {'lr': 0.0004104655204840048, 'samples': 8288448, 'steps': 43168, 'loss/train': 1.7086986303329468} 11/07/2021 03:16:54 - INFO - __main__ - Step 43170: {'lr': 0.0004104614511296155, 'samples': 8288640, 'steps': 43169, 'loss/train': 1.5957424640655518} 11/07/2021 03:16:55 - INFO - __main__ - Step 43171: {'lr': 0.00041045738170292467, 'samples': 8288832, 'steps': 43170, 'loss/train': 1.7372387647628784} 11/07/2021 03:16:55 - INFO - __main__ - Step 43172: {'lr': 0.0004104533122039342, 'samples': 8289024, 'steps': 43171, 'loss/train': 1.630563735961914} 11/07/2021 03:16:55 - INFO - __main__ - Step 43173: {'lr': 0.00041044924263264603, 'samples': 8289216, 'steps': 43172, 'loss/train': 1.5493334531784058} 11/07/2021 03:16:56 - INFO - __main__ - Step 43174: {'lr': 0.00041044517298906194, 'samples': 8289408, 'steps': 43173, 'loss/train': 1.1043447256088257} 11/07/2021 03:16:57 - INFO - __main__ - Step 43175: {'lr': 0.0004104411032731836, 'samples': 8289600, 'steps': 43174, 'loss/train': 1.4027479887008667} 11/07/2021 03:16:57 - INFO - __main__ - Step 43176: {'lr': 0.00041043703348501304, 'samples': 8289792, 'steps': 43175, 'loss/train': 0.8495373129844666} 11/07/2021 03:16:57 - INFO - __main__ - Step 43177: {'lr': 0.0004104329636245521, 'samples': 8289984, 'steps': 43176, 'loss/train': 0.77286696434021} 11/07/2021 03:16:58 - INFO - __main__ - Step 43178: {'lr': 0.0004104288936918024, 'samples': 8290176, 'steps': 43177, 'loss/train': 1.7297497987747192} 11/07/2021 03:16:59 - INFO - __main__ - Step 43179: {'lr': 0.00041042482368676604, 'samples': 8290368, 'steps': 43178, 'loss/train': 1.2108728885650635} 11/07/2021 03:16:59 - INFO - __main__ - Step 43180: {'lr': 0.00041042075360944464, 'samples': 8290560, 'steps': 43179, 'loss/train': 1.4625720977783203} 11/07/2021 03:17:00 - INFO - __main__ - Step 43181: {'lr': 0.0004104166834598402, 'samples': 8290752, 'steps': 43180, 'loss/train': 1.3745132684707642} 11/07/2021 03:17:00 - INFO - __main__ - Step 43182: {'lr': 0.00041041261323795437, 'samples': 8290944, 'steps': 43181, 'loss/train': 1.5282878875732422} 11/07/2021 03:17:00 - INFO - __main__ - Step 43183: {'lr': 0.0004104085429437892, 'samples': 8291136, 'steps': 43182, 'loss/train': 1.267673373222351} 11/07/2021 03:17:01 - INFO - __main__ - Step 43184: {'lr': 0.00041040447257734635, 'samples': 8291328, 'steps': 43183, 'loss/train': 1.4752342700958252} 11/07/2021 03:17:02 - INFO - __main__ - Step 43185: {'lr': 0.00041040040213862774, 'samples': 8291520, 'steps': 43184, 'loss/train': 2.0098776817321777} 11/07/2021 03:17:02 - INFO - __main__ - Step 43186: {'lr': 0.00041039633162763523, 'samples': 8291712, 'steps': 43185, 'loss/train': 1.0623544454574585} 11/07/2021 03:17:02 - INFO - __main__ - Step 43187: {'lr': 0.00041039226104437056, 'samples': 8291904, 'steps': 43186, 'loss/train': 1.8754194974899292} 11/07/2021 03:17:03 - INFO - __main__ - Step 43188: {'lr': 0.0004103881903888356, 'samples': 8292096, 'steps': 43187, 'loss/train': 1.6514359712600708} 11/07/2021 03:17:04 - INFO - __main__ - Step 43189: {'lr': 0.0004103841196610322, 'samples': 8292288, 'steps': 43188, 'loss/train': 2.032471179962158} 11/07/2021 03:17:04 - INFO - __main__ - Step 43190: {'lr': 0.0004103800488609622, 'samples': 8292480, 'steps': 43189, 'loss/train': 1.1504318714141846} 11/07/2021 03:17:04 - INFO - __main__ - Step 43191: {'lr': 0.0004103759779886274, 'samples': 8292672, 'steps': 43190, 'loss/train': 1.742537021636963} 11/07/2021 03:17:05 - INFO - __main__ - Step 43192: {'lr': 0.0004103719070440297, 'samples': 8292864, 'steps': 43191, 'loss/train': 0.9761320352554321} 11/07/2021 03:17:05 - INFO - __main__ - Step 43193: {'lr': 0.00041036783602717086, 'samples': 8293056, 'steps': 43192, 'loss/train': 1.222903847694397} 11/07/2021 03:17:06 - INFO - __main__ - Step 43194: {'lr': 0.00041036376493805286, 'samples': 8293248, 'steps': 43193, 'loss/train': 2.652095317840576} 11/07/2021 03:17:07 - INFO - __main__ - Step 43195: {'lr': 0.0004103596937766773, 'samples': 8293440, 'steps': 43194, 'loss/train': 1.6091325283050537} 11/07/2021 03:17:07 - INFO - __main__ - Step 43196: {'lr': 0.00041035562254304614, 'samples': 8293632, 'steps': 43195, 'loss/train': 1.3011974096298218} 11/07/2021 03:17:07 - INFO - __main__ - Step 43197: {'lr': 0.00041035155123716127, 'samples': 8293824, 'steps': 43196, 'loss/train': 0.3642762005329132} 11/07/2021 03:17:08 - INFO - __main__ - Step 43198: {'lr': 0.00041034747985902446, 'samples': 8294016, 'steps': 43197, 'loss/train': 2.0115673542022705} 11/07/2021 03:17:08 - INFO - __main__ - Step 43199: {'lr': 0.0004103434084086375, 'samples': 8294208, 'steps': 43198, 'loss/train': 1.4424716234207153} 11/07/2021 03:17:09 - INFO - __main__ - Step 43200: {'lr': 0.0004103393368860023, 'samples': 8294400, 'steps': 43199, 'loss/train': 1.7488442659378052} 11/07/2021 03:17:09 - INFO - __main__ - Step 43201: {'lr': 0.0004103352652911206, 'samples': 8294592, 'steps': 43200, 'loss/train': 1.2876766920089722} 11/07/2021 03:17:10 - INFO - __main__ - Step 43202: {'lr': 0.0004103311936239944, 'samples': 8294784, 'steps': 43201, 'loss/train': 1.2778476476669312} 11/07/2021 03:17:10 - INFO - __main__ - Step 43203: {'lr': 0.0004103271218846254, 'samples': 8294976, 'steps': 43202, 'loss/train': 1.2290693521499634} 11/07/2021 03:17:10 - INFO - __main__ - Step 43204: {'lr': 0.00041032305007301554, 'samples': 8295168, 'steps': 43203, 'loss/train': 1.2880079746246338} 11/07/2021 03:17:11 - INFO - __main__ - Step 43205: {'lr': 0.00041031897818916645, 'samples': 8295360, 'steps': 43204, 'loss/train': 1.504037618637085} 11/07/2021 03:17:12 - INFO - __main__ - Step 43206: {'lr': 0.0004103149062330802, 'samples': 8295552, 'steps': 43205, 'loss/train': 1.6308549642562866} 11/07/2021 03:17:12 - INFO - __main__ - Step 43207: {'lr': 0.00041031083420475854, 'samples': 8295744, 'steps': 43206, 'loss/train': 1.379603624343872} 11/07/2021 03:17:12 - INFO - __main__ - Step 43208: {'lr': 0.00041030676210420324, 'samples': 8295936, 'steps': 43207, 'loss/train': 1.3084861040115356} 11/07/2021 03:17:13 - INFO - __main__ - Step 43209: {'lr': 0.0004103026899314162, 'samples': 8296128, 'steps': 43208, 'loss/train': 1.4617557525634766} 11/07/2021 03:17:14 - INFO - __main__ - Step 43210: {'lr': 0.00041029861768639934, 'samples': 8296320, 'steps': 43209, 'loss/train': 1.5682384967803955} 11/07/2021 03:17:14 - INFO - __main__ - Step 43211: {'lr': 0.0004102945453691542, 'samples': 8296512, 'steps': 43210, 'loss/train': 1.2275004386901855} 11/07/2021 03:17:15 - INFO - __main__ - Step 43212: {'lr': 0.00041029047297968293, 'samples': 8296704, 'steps': 43211, 'loss/train': 1.4966442584991455} 11/07/2021 03:17:15 - INFO - __main__ - Step 43213: {'lr': 0.00041028640051798726, 'samples': 8296896, 'steps': 43212, 'loss/train': 1.5281322002410889} 11/07/2021 03:17:15 - INFO - __main__ - Step 43214: {'lr': 0.000410282327984069, 'samples': 8297088, 'steps': 43213, 'loss/train': 1.1413795948028564} 11/07/2021 03:17:16 - INFO - __main__ - Step 43215: {'lr': 0.00041027825537792993, 'samples': 8297280, 'steps': 43214, 'loss/train': 1.04563307762146} 11/07/2021 03:17:17 - INFO - __main__ - Step 43216: {'lr': 0.0004102741826995721, 'samples': 8297472, 'steps': 43215, 'loss/train': 1.8254573345184326} 11/07/2021 03:17:17 - INFO - __main__ - Step 43217: {'lr': 0.000410270109948997, 'samples': 8297664, 'steps': 43216, 'loss/train': 1.4102542400360107} 11/07/2021 03:17:17 - INFO - __main__ - Step 43218: {'lr': 0.0004102660371262068, 'samples': 8297856, 'steps': 43217, 'loss/train': 1.1999146938323975} 11/07/2021 03:17:18 - INFO - __main__ - Step 43219: {'lr': 0.0004102619642312031, 'samples': 8298048, 'steps': 43218, 'loss/train': 1.255764365196228} 11/07/2021 03:17:18 - INFO - __main__ - Step 43220: {'lr': 0.00041025789126398793, 'samples': 8298240, 'steps': 43219, 'loss/train': 1.6038835048675537} 11/07/2021 03:17:20 - INFO - __main__ - Step 43221: {'lr': 0.000410253818224563, 'samples': 8298432, 'steps': 43220, 'loss/train': 1.781392216682434} 11/07/2021 03:17:20 - INFO - __main__ - Step 43222: {'lr': 0.0004102497451129302, 'samples': 8298624, 'steps': 43221, 'loss/train': 1.4568729400634766} 11/07/2021 03:17:20 - INFO - __main__ - Step 43223: {'lr': 0.00041024567192909125, 'samples': 8298816, 'steps': 43222, 'loss/train': 1.552573561668396} 11/07/2021 03:17:21 - INFO - __main__ - Step 43224: {'lr': 0.0004102415986730481, 'samples': 8299008, 'steps': 43223, 'loss/train': 0.9100706577301025} 11/07/2021 03:17:21 - INFO - __main__ - Step 43225: {'lr': 0.0004102375253448026, 'samples': 8299200, 'steps': 43224, 'loss/train': 1.3224427700042725} 11/07/2021 03:17:21 - INFO - __main__ - Step 43226: {'lr': 0.0004102334519443565, 'samples': 8299392, 'steps': 43225, 'loss/train': 0.4940943717956543} 11/07/2021 03:17:23 - INFO - __main__ - Step 43227: {'lr': 0.0004102293784717117, 'samples': 8299584, 'steps': 43226, 'loss/train': 0.4784579575061798} 11/07/2021 03:17:23 - INFO - __main__ - Step 43228: {'lr': 0.00041022530492687006, 'samples': 8299776, 'steps': 43227, 'loss/train': 1.7686057090759277} 11/07/2021 03:17:24 - INFO - __main__ - Step 43229: {'lr': 0.0004102212313098333, 'samples': 8299968, 'steps': 43228, 'loss/train': 1.4608936309814453} 11/07/2021 03:17:24 - INFO - __main__ - Step 43230: {'lr': 0.00041021715762060336, 'samples': 8300160, 'steps': 43229, 'loss/train': 1.6046879291534424} 11/07/2021 03:17:24 - INFO - __main__ - Step 43231: {'lr': 0.000410213083859182, 'samples': 8300352, 'steps': 43230, 'loss/train': 1.4580544233322144} 11/07/2021 03:17:25 - INFO - __main__ - Step 43232: {'lr': 0.0004102090100255711, 'samples': 8300544, 'steps': 43231, 'loss/train': 4.313381195068359} 11/07/2021 03:17:26 - INFO - __main__ - Step 43233: {'lr': 0.00041020493611977263, 'samples': 8300736, 'steps': 43232, 'loss/train': 3.5713274478912354} 11/07/2021 03:17:26 - INFO - __main__ - Step 43234: {'lr': 0.0004102008621417881, 'samples': 8300928, 'steps': 43233, 'loss/train': 1.7774174213409424} 11/07/2021 03:17:26 - INFO - __main__ - Step 43235: {'lr': 0.0004101967880916196, 'samples': 8301120, 'steps': 43234, 'loss/train': 1.1446168422698975} 11/07/2021 03:17:27 - INFO - __main__ - Step 43236: {'lr': 0.00041019271396926894, 'samples': 8301312, 'steps': 43235, 'loss/train': 1.3731508255004883} 11/07/2021 03:17:27 - INFO - __main__ - Step 43237: {'lr': 0.0004101886397747379, 'samples': 8301504, 'steps': 43236, 'loss/train': 1.533296823501587} 11/07/2021 03:17:28 - INFO - __main__ - Step 43238: {'lr': 0.0004101845655080283, 'samples': 8301696, 'steps': 43237, 'loss/train': 1.5787715911865234} 11/07/2021 03:17:28 - INFO - __main__ - Step 43239: {'lr': 0.00041018049116914204, 'samples': 8301888, 'steps': 43238, 'loss/train': 1.5141563415527344} 11/07/2021 03:17:29 - INFO - __main__ - Step 43240: {'lr': 0.00041017641675808095, 'samples': 8302080, 'steps': 43239, 'loss/train': 0.6786595582962036} 11/07/2021 03:17:29 - INFO - __main__ - Step 43241: {'lr': 0.00041017234227484675, 'samples': 8302272, 'steps': 43240, 'loss/train': 1.649290919303894} 11/07/2021 03:17:29 - INFO - __main__ - Step 43242: {'lr': 0.0004101682677194414, 'samples': 8302464, 'steps': 43241, 'loss/train': 1.368617057800293} 11/07/2021 03:17:30 - INFO - __main__ - Step 43243: {'lr': 0.0004101641930918667, 'samples': 8302656, 'steps': 43242, 'loss/train': 1.4992783069610596} 11/07/2021 03:17:31 - INFO - __main__ - Step 43244: {'lr': 0.00041016011839212446, 'samples': 8302848, 'steps': 43243, 'loss/train': 1.781638503074646} 11/07/2021 03:17:31 - INFO - __main__ - Step 43245: {'lr': 0.0004101560436202166, 'samples': 8303040, 'steps': 43244, 'loss/train': 1.5576189756393433} 11/07/2021 03:17:31 - INFO - __main__ - Step 43246: {'lr': 0.0004101519687761449, 'samples': 8303232, 'steps': 43245, 'loss/train': 1.2122910022735596} 11/07/2021 03:17:32 - INFO - __main__ - Step 43247: {'lr': 0.00041014789385991114, 'samples': 8303424, 'steps': 43246, 'loss/train': 1.5584133863449097} 11/07/2021 03:17:33 - INFO - __main__ - Step 43248: {'lr': 0.00041014381887151727, 'samples': 8303616, 'steps': 43247, 'loss/train': 1.2769432067871094} 11/07/2021 03:17:33 - INFO - __main__ - Step 43249: {'lr': 0.00041013974381096503, 'samples': 8303808, 'steps': 43248, 'loss/train': 1.1447575092315674} 11/07/2021 03:17:34 - INFO - __main__ - Step 43250: {'lr': 0.00041013566867825627, 'samples': 8304000, 'steps': 43249, 'loss/train': 1.616087794303894} 11/07/2021 03:17:34 - INFO - __main__ - Step 43251: {'lr': 0.00041013159347339293, 'samples': 8304192, 'steps': 43250, 'loss/train': 0.3687068819999695} 11/07/2021 03:17:34 - INFO - __main__ - Step 43252: {'lr': 0.0004101275181963767, 'samples': 8304384, 'steps': 43251, 'loss/train': 1.7518508434295654} 11/07/2021 03:17:35 - INFO - __main__ - Step 43253: {'lr': 0.0004101234428472095, 'samples': 8304576, 'steps': 43252, 'loss/train': 2.2655272483825684} 11/07/2021 03:17:36 - INFO - __main__ - Step 43254: {'lr': 0.0004101193674258931, 'samples': 8304768, 'steps': 43253, 'loss/train': 1.2787197828292847} 11/07/2021 03:17:36 - INFO - __main__ - Step 43255: {'lr': 0.00041011529193242947, 'samples': 8304960, 'steps': 43254, 'loss/train': 1.5614346265792847} 11/07/2021 03:17:36 - INFO - __main__ - Step 43256: {'lr': 0.00041011121636682024, 'samples': 8305152, 'steps': 43255, 'loss/train': 1.4917633533477783} 11/07/2021 03:17:37 - INFO - __main__ - Step 43257: {'lr': 0.0004101071407290675, 'samples': 8305344, 'steps': 43256, 'loss/train': 1.564497709274292} 11/07/2021 03:17:37 - INFO - __main__ - Step 43258: {'lr': 0.00041010306501917287, 'samples': 8305536, 'steps': 43257, 'loss/train': 1.4174599647521973} 11/07/2021 03:17:38 - INFO - __main__ - Step 43259: {'lr': 0.0004100989892371383, 'samples': 8305728, 'steps': 43258, 'loss/train': 1.2914807796478271} 11/07/2021 03:17:38 - INFO - __main__ - Step 43260: {'lr': 0.00041009491338296557, 'samples': 8305920, 'steps': 43259, 'loss/train': 1.1329169273376465} 11/07/2021 03:17:39 - INFO - __main__ - Step 43261: {'lr': 0.00041009083745665654, 'samples': 8306112, 'steps': 43260, 'loss/train': 1.5572667121887207} 11/07/2021 03:17:39 - INFO - __main__ - Step 43262: {'lr': 0.0004100867614582131, 'samples': 8306304, 'steps': 43261, 'loss/train': 1.5937113761901855} 11/07/2021 03:17:40 - INFO - __main__ - Step 43263: {'lr': 0.00041008268538763703, 'samples': 8306496, 'steps': 43262, 'loss/train': 1.6277613639831543} 11/07/2021 03:17:41 - INFO - __main__ - Step 43264: {'lr': 0.00041007860924493014, 'samples': 8306688, 'steps': 43263, 'loss/train': 1.5388847589492798} 11/07/2021 03:17:41 - INFO - __main__ - Step 43265: {'lr': 0.0004100745330300943, 'samples': 8306880, 'steps': 43264, 'loss/train': 1.5251654386520386} 11/07/2021 03:17:42 - INFO - __main__ - Step 43266: {'lr': 0.0004100704567431314, 'samples': 8307072, 'steps': 43265, 'loss/train': 1.2249115705490112} 11/07/2021 03:17:42 - INFO - __main__ - Step 43267: {'lr': 0.0004100663803840431, 'samples': 8307264, 'steps': 43266, 'loss/train': 0.8777159452438354} 11/07/2021 03:17:42 - INFO - __main__ - Step 43268: {'lr': 0.0004100623039528315, 'samples': 8307456, 'steps': 43267, 'loss/train': 0.4611353874206543} 11/07/2021 03:17:43 - INFO - __main__ - Step 43269: {'lr': 0.0004100582274494982, 'samples': 8307648, 'steps': 43268, 'loss/train': 1.3887360095977783} 11/07/2021 03:17:44 - INFO - __main__ - Step 43270: {'lr': 0.00041005415087404516, 'samples': 8307840, 'steps': 43269, 'loss/train': 1.452347993850708} 11/07/2021 03:17:44 - INFO - __main__ - Step 43271: {'lr': 0.0004100500742264742, 'samples': 8308032, 'steps': 43270, 'loss/train': 0.9966098070144653} 11/07/2021 03:17:44 - INFO - __main__ - Step 43272: {'lr': 0.0004100459975067871, 'samples': 8308224, 'steps': 43271, 'loss/train': 0.7639972567558289} 11/07/2021 03:17:45 - INFO - __main__ - Step 43273: {'lr': 0.0004100419207149858, 'samples': 8308416, 'steps': 43272, 'loss/train': 1.5415605306625366} 11/07/2021 03:17:46 - INFO - __main__ - Step 43274: {'lr': 0.0004100378438510721, 'samples': 8308608, 'steps': 43273, 'loss/train': 1.5262449979782104} 11/07/2021 03:17:46 - INFO - __main__ - Step 43275: {'lr': 0.00041003376691504777, 'samples': 8308800, 'steps': 43274, 'loss/train': 1.361143708229065} 11/07/2021 03:17:47 - INFO - __main__ - Step 43276: {'lr': 0.0004100296899069147, 'samples': 8308992, 'steps': 43275, 'loss/train': 1.4873331785202026} 11/07/2021 03:17:47 - INFO - __main__ - Step 43277: {'lr': 0.0004100256128266747, 'samples': 8309184, 'steps': 43276, 'loss/train': 1.2578943967819214} 11/07/2021 03:17:47 - INFO - __main__ - Step 43278: {'lr': 0.00041002153567432965, 'samples': 8309376, 'steps': 43277, 'loss/train': 1.4881389141082764} 11/07/2021 03:17:49 - INFO - __main__ - Step 43279: {'lr': 0.00041001745844988134, 'samples': 8309568, 'steps': 43278, 'loss/train': 1.335159182548523} 11/07/2021 03:17:49 - INFO - __main__ - Step 43280: {'lr': 0.00041001338115333175, 'samples': 8309760, 'steps': 43279, 'loss/train': 1.672161340713501} 11/07/2021 03:17:49 - INFO - __main__ - Step 43281: {'lr': 0.0004100093037846825, 'samples': 8309952, 'steps': 43280, 'loss/train': 0.9645368456840515} 11/07/2021 03:17:50 - INFO - __main__ - Step 43282: {'lr': 0.0004100052263439355, 'samples': 8310144, 'steps': 43281, 'loss/train': 1.6344153881072998} 11/07/2021 03:17:50 - INFO - __main__ - Step 43283: {'lr': 0.00041000114883109264, 'samples': 8310336, 'steps': 43282, 'loss/train': 1.7180469036102295} 11/07/2021 03:17:51 - INFO - __main__ - Step 43284: {'lr': 0.00040999707124615573, 'samples': 8310528, 'steps': 43283, 'loss/train': 1.3519318103790283} 11/07/2021 03:17:52 - INFO - __main__ - Step 43285: {'lr': 0.00040999299358912664, 'samples': 8310720, 'steps': 43284, 'loss/train': 1.628510594367981} 11/07/2021 03:17:52 - INFO - __main__ - Step 43286: {'lr': 0.00040998891586000716, 'samples': 8310912, 'steps': 43285, 'loss/train': 1.4235750436782837} 11/07/2021 03:17:52 - INFO - __main__ - Step 43287: {'lr': 0.0004099848380587992, 'samples': 8311104, 'steps': 43286, 'loss/train': 1.61976957321167} 11/07/2021 03:17:53 - INFO - __main__ - Step 43288: {'lr': 0.00040998076018550444, 'samples': 8311296, 'steps': 43287, 'loss/train': 1.476173758506775} 11/07/2021 03:17:54 - INFO - __main__ - Step 43289: {'lr': 0.00040997668224012485, 'samples': 8311488, 'steps': 43288, 'loss/train': 1.4979326725006104} 11/07/2021 03:17:54 - INFO - __main__ - Step 43290: {'lr': 0.00040997260422266223, 'samples': 8311680, 'steps': 43289, 'loss/train': 1.5309162139892578} 11/07/2021 03:17:54 - INFO - __main__ - Step 43291: {'lr': 0.00040996852613311844, 'samples': 8311872, 'steps': 43290, 'loss/train': 1.7398695945739746} 11/07/2021 03:17:55 - INFO - __main__ - Step 43292: {'lr': 0.00040996444797149526, 'samples': 8312064, 'steps': 43291, 'loss/train': 1.7311522960662842} 11/07/2021 03:17:55 - INFO - __main__ - Step 43293: {'lr': 0.0004099603697377946, 'samples': 8312256, 'steps': 43292, 'loss/train': 1.2518097162246704} 11/07/2021 03:17:55 - INFO - __main__ - Step 43294: {'lr': 0.0004099562914320183, 'samples': 8312448, 'steps': 43293, 'loss/train': 1.6182531118392944} 11/07/2021 03:17:57 - INFO - __main__ - Step 43295: {'lr': 0.0004099522130541681, 'samples': 8312640, 'steps': 43294, 'loss/train': 1.4488192796707153} 11/07/2021 03:17:57 - INFO - __main__ - Step 43296: {'lr': 0.000409948134604246, 'samples': 8312832, 'steps': 43295, 'loss/train': 1.589702844619751} 11/07/2021 03:17:57 - INFO - __main__ - Step 43297: {'lr': 0.0004099440560822536, 'samples': 8313024, 'steps': 43296, 'loss/train': 1.4996671676635742} 11/07/2021 03:17:58 - INFO - __main__ - Step 43298: {'lr': 0.000409939977488193, 'samples': 8313216, 'steps': 43297, 'loss/train': 2.0168118476867676} 11/07/2021 03:17:58 - INFO - __main__ - Step 43299: {'lr': 0.0004099358988220658, 'samples': 8313408, 'steps': 43298, 'loss/train': 1.5424015522003174} 11/07/2021 03:18:00 - INFO - __main__ - Step 43300: {'lr': 0.00040993182008387406, 'samples': 8313600, 'steps': 43299, 'loss/train': 1.5230618715286255} 11/07/2021 03:18:00 - INFO - __main__ - Step 43301: {'lr': 0.0004099277412736195, 'samples': 8313792, 'steps': 43300, 'loss/train': 0.788213312625885} 11/07/2021 03:18:01 - INFO - __main__ - Step 43302: {'lr': 0.0004099236623913039, 'samples': 8313984, 'steps': 43301, 'loss/train': 1.7703386545181274} 11/07/2021 03:18:01 - INFO - __main__ - Step 43303: {'lr': 0.0004099195834369292, 'samples': 8314176, 'steps': 43302, 'loss/train': 1.5457040071487427} 11/07/2021 03:18:02 - INFO - __main__ - Step 43304: {'lr': 0.0004099155044104972, 'samples': 8314368, 'steps': 43303, 'loss/train': 0.8869584202766418} 11/07/2021 03:18:02 - INFO - __main__ - Step 43305: {'lr': 0.00040991142531200973, 'samples': 8314560, 'steps': 43304, 'loss/train': 1.570747971534729} 11/07/2021 03:18:02 - INFO - __main__ - Step 43306: {'lr': 0.0004099073461414686, 'samples': 8314752, 'steps': 43305, 'loss/train': 1.8002310991287231} 11/07/2021 03:18:03 - INFO - __main__ - Step 43307: {'lr': 0.0004099032668988758, 'samples': 8314944, 'steps': 43306, 'loss/train': 2.0311312675476074} 11/07/2021 03:18:04 - INFO - __main__ - Step 43308: {'lr': 0.00040989918758423306, 'samples': 8315136, 'steps': 43307, 'loss/train': 1.678662896156311} 11/07/2021 03:18:04 - INFO - __main__ - Step 43309: {'lr': 0.0004098951081975421, 'samples': 8315328, 'steps': 43308, 'loss/train': 1.566868782043457} 11/07/2021 03:18:04 - INFO - __main__ - Step 43310: {'lr': 0.0004098910287388049, 'samples': 8315520, 'steps': 43309, 'loss/train': 1.680838942527771} 11/07/2021 03:18:05 - INFO - __main__ - Step 43311: {'lr': 0.00040988694920802326, 'samples': 8315712, 'steps': 43310, 'loss/train': 1.9421578645706177} 11/07/2021 03:18:05 - INFO - __main__ - Step 43312: {'lr': 0.0004098828696051991, 'samples': 8315904, 'steps': 43311, 'loss/train': 0.9963335990905762} 11/07/2021 03:18:06 - INFO - __main__ - Step 43313: {'lr': 0.00040987878993033417, 'samples': 8316096, 'steps': 43312, 'loss/train': 1.7013061046600342} 11/07/2021 03:18:06 - INFO - __main__ - Step 43314: {'lr': 0.0004098747101834303, 'samples': 8316288, 'steps': 43313, 'loss/train': 1.45722234249115} 11/07/2021 03:18:07 - INFO - __main__ - Step 43315: {'lr': 0.00040987063036448934, 'samples': 8316480, 'steps': 43314, 'loss/train': 1.5777240991592407} 11/07/2021 03:18:07 - INFO - __main__ - Step 43316: {'lr': 0.0004098665504735132, 'samples': 8316672, 'steps': 43315, 'loss/train': 1.9435744285583496} 11/07/2021 03:18:07 - INFO - __main__ - Step 43317: {'lr': 0.0004098624705105036, 'samples': 8316864, 'steps': 43316, 'loss/train': 1.5735280513763428} 11/07/2021 03:18:08 - INFO - __main__ - Step 43318: {'lr': 0.00040985839047546243, 'samples': 8317056, 'steps': 43317, 'loss/train': 2.057516574859619} 11/07/2021 03:18:09 - INFO - __main__ - Step 43319: {'lr': 0.00040985431036839155, 'samples': 8317248, 'steps': 43318, 'loss/train': 1.3936246633529663} 11/07/2021 03:18:09 - INFO - __main__ - Step 43320: {'lr': 0.00040985023018929277, 'samples': 8317440, 'steps': 43319, 'loss/train': 1.7613195180892944} 11/07/2021 03:18:09 - INFO - __main__ - Step 43321: {'lr': 0.000409846149938168, 'samples': 8317632, 'steps': 43320, 'loss/train': 1.836496114730835} 11/07/2021 03:18:10 - INFO - __main__ - Step 43322: {'lr': 0.000409842069615019, 'samples': 8317824, 'steps': 43321, 'loss/train': 1.7254339456558228} 11/07/2021 03:18:11 - INFO - __main__ - Step 43323: {'lr': 0.0004098379892198476, 'samples': 8318016, 'steps': 43322, 'loss/train': 1.6206070184707642} 11/07/2021 03:18:11 - INFO - __main__ - Step 43324: {'lr': 0.0004098339087526557, 'samples': 8318208, 'steps': 43323, 'loss/train': 2.0613462924957275} 11/07/2021 03:18:12 - INFO - __main__ - Step 43325: {'lr': 0.00040982982821344505, 'samples': 8318400, 'steps': 43324, 'loss/train': 0.8795347213745117} 11/07/2021 03:18:12 - INFO - __main__ - Step 43326: {'lr': 0.0004098257476022176, 'samples': 8318592, 'steps': 43325, 'loss/train': 1.7156808376312256} 11/07/2021 03:18:12 - INFO - __main__ - Step 43327: {'lr': 0.00040982166691897517, 'samples': 8318784, 'steps': 43326, 'loss/train': 1.533298373222351} 11/07/2021 03:18:13 - INFO - __main__ - Step 43328: {'lr': 0.00040981758616371943, 'samples': 8318976, 'steps': 43327, 'loss/train': 1.332958459854126} 11/07/2021 03:18:14 - INFO - __main__ - Step 43329: {'lr': 0.00040981350533645245, 'samples': 8319168, 'steps': 43328, 'loss/train': 1.8397305011749268} 11/07/2021 03:18:14 - INFO - __main__ - Step 43330: {'lr': 0.00040980942443717596, 'samples': 8319360, 'steps': 43329, 'loss/train': 1.0857611894607544} 11/07/2021 03:18:14 - INFO - __main__ - Step 43331: {'lr': 0.0004098053434658918, 'samples': 8319552, 'steps': 43330, 'loss/train': 1.4920378923416138} 11/07/2021 03:18:15 - INFO - __main__ - Step 43332: {'lr': 0.0004098012624226018, 'samples': 8319744, 'steps': 43331, 'loss/train': 1.2828925848007202} 11/07/2021 03:18:15 - INFO - __main__ - Step 43333: {'lr': 0.00040979718130730786, 'samples': 8319936, 'steps': 43332, 'loss/train': 0.9293913841247559} 11/07/2021 03:18:16 - INFO - __main__ - Step 43334: {'lr': 0.0004097931001200118, 'samples': 8320128, 'steps': 43333, 'loss/train': 1.555694818496704} 11/07/2021 03:18:16 - INFO - __main__ - Step 43335: {'lr': 0.00040978901886071543, 'samples': 8320320, 'steps': 43334, 'loss/train': 1.77414071559906} 11/07/2021 03:18:17 - INFO - __main__ - Step 43336: {'lr': 0.0004097849375294205, 'samples': 8320512, 'steps': 43335, 'loss/train': 1.270373821258545} 11/07/2021 03:18:17 - INFO - __main__ - Step 43337: {'lr': 0.000409780856126129, 'samples': 8320704, 'steps': 43336, 'loss/train': 1.5385404825210571} 11/07/2021 03:18:17 - INFO - __main__ - Step 43338: {'lr': 0.00040977677465084275, 'samples': 8320896, 'steps': 43337, 'loss/train': 1.318015217781067} 11/07/2021 03:18:18 - INFO - __main__ - Step 43339: {'lr': 0.00040977269310356345, 'samples': 8321088, 'steps': 43338, 'loss/train': 1.5474750995635986} 11/07/2021 03:18:19 - INFO - __main__ - Step 43340: {'lr': 0.00040976861148429313, 'samples': 8321280, 'steps': 43339, 'loss/train': 1.669358491897583} 11/07/2021 03:18:19 - INFO - __main__ - Step 43341: {'lr': 0.0004097645297930335, 'samples': 8321472, 'steps': 43340, 'loss/train': 1.5698802471160889} 11/07/2021 03:18:20 - INFO - __main__ - Step 43342: {'lr': 0.00040976044802978645, 'samples': 8321664, 'steps': 43341, 'loss/train': 1.496842622756958} 11/07/2021 03:18:20 - INFO - __main__ - Step 43343: {'lr': 0.0004097563661945538, 'samples': 8321856, 'steps': 43342, 'loss/train': 5.575445652008057} 11/07/2021 03:18:21 - INFO - __main__ - Step 43344: {'lr': 0.0004097522842873374, 'samples': 8322048, 'steps': 43343, 'loss/train': 1.5706019401550293} 11/07/2021 03:18:21 - INFO - __main__ - Step 43345: {'lr': 0.0004097482023081391, 'samples': 8322240, 'steps': 43344, 'loss/train': 1.1677210330963135} 11/07/2021 03:18:22 - INFO - __main__ - Step 43346: {'lr': 0.00040974412025696067, 'samples': 8322432, 'steps': 43345, 'loss/train': 1.5173113346099854} 11/07/2021 03:18:22 - INFO - __main__ - Step 43347: {'lr': 0.0004097400381338041, 'samples': 8322624, 'steps': 43346, 'loss/train': 1.7230134010314941} 11/07/2021 03:18:22 - INFO - __main__ - Step 43348: {'lr': 0.0004097359559386711, 'samples': 8322816, 'steps': 43347, 'loss/train': 1.76523756980896} 11/07/2021 03:18:23 - INFO - __main__ - Step 43349: {'lr': 0.0004097318736715635, 'samples': 8323008, 'steps': 43348, 'loss/train': 0.9458571076393127} 11/07/2021 03:18:24 - INFO - __main__ - Step 43350: {'lr': 0.0004097277913324832, 'samples': 8323200, 'steps': 43349, 'loss/train': 1.1856716871261597} 11/07/2021 03:18:24 - INFO - __main__ - Step 43351: {'lr': 0.000409723708921432, 'samples': 8323392, 'steps': 43350, 'loss/train': 2.379688024520874} 11/07/2021 03:18:24 - INFO - __main__ - Step 43352: {'lr': 0.0004097196264384118, 'samples': 8323584, 'steps': 43351, 'loss/train': 1.0327445268630981} 11/07/2021 03:18:25 - INFO - __main__ - Step 43353: {'lr': 0.00040971554388342436, 'samples': 8323776, 'steps': 43352, 'loss/train': 1.4225362539291382} 11/07/2021 03:18:26 - INFO - __main__ - Step 43354: {'lr': 0.00040971146125647165, 'samples': 8323968, 'steps': 43353, 'loss/train': 1.3241119384765625} 11/07/2021 03:18:26 - INFO - __main__ - Step 43355: {'lr': 0.00040970737855755535, 'samples': 8324160, 'steps': 43354, 'loss/train': 1.253584384918213} 11/07/2021 03:18:26 - INFO - __main__ - Step 43356: {'lr': 0.00040970329578667735, 'samples': 8324352, 'steps': 43355, 'loss/train': 1.532841444015503} 11/07/2021 03:18:27 - INFO - __main__ - Step 43357: {'lr': 0.00040969921294383956, 'samples': 8324544, 'steps': 43356, 'loss/train': 0.44970759749412537} 11/07/2021 03:18:27 - INFO - __main__ - Step 43358: {'lr': 0.00040969513002904375, 'samples': 8324736, 'steps': 43357, 'loss/train': 1.4711352586746216} 11/07/2021 03:18:27 - INFO - __main__ - Step 43359: {'lr': 0.0004096910470422918, 'samples': 8324928, 'steps': 43358, 'loss/train': 0.8249011635780334} 11/07/2021 03:18:29 - INFO - __main__ - Step 43360: {'lr': 0.0004096869639835855, 'samples': 8325120, 'steps': 43359, 'loss/train': 1.2207739353179932} 11/07/2021 03:18:29 - INFO - __main__ - Step 43361: {'lr': 0.0004096828808529267, 'samples': 8325312, 'steps': 43360, 'loss/train': 1.2354450225830078} 11/07/2021 03:18:29 - INFO - __main__ - Step 43362: {'lr': 0.0004096787976503173, 'samples': 8325504, 'steps': 43361, 'loss/train': 1.9712218046188354} 11/07/2021 03:18:30 - INFO - __main__ - Step 43363: {'lr': 0.0004096747143757591, 'samples': 8325696, 'steps': 43362, 'loss/train': 0.833960771560669} 11/07/2021 03:18:30 - INFO - __main__ - Step 43364: {'lr': 0.0004096706310292539, 'samples': 8325888, 'steps': 43363, 'loss/train': 1.145098328590393} 11/07/2021 03:18:31 - INFO - __main__ - Step 43365: {'lr': 0.0004096665476108036, 'samples': 8326080, 'steps': 43364, 'loss/train': 1.7072386741638184} 11/07/2021 03:18:32 - INFO - __main__ - Step 43366: {'lr': 0.00040966246412040995, 'samples': 8326272, 'steps': 43365, 'loss/train': 1.7308281660079956} 11/07/2021 03:18:32 - INFO - __main__ - Step 43367: {'lr': 0.00040965838055807493, 'samples': 8326464, 'steps': 43366, 'loss/train': 1.5208653211593628} 11/07/2021 03:18:32 - INFO - __main__ - Step 43368: {'lr': 0.00040965429692380034, 'samples': 8326656, 'steps': 43367, 'loss/train': 1.3145965337753296} 11/07/2021 03:18:33 - INFO - __main__ - Step 43369: {'lr': 0.00040965021321758796, 'samples': 8326848, 'steps': 43368, 'loss/train': 1.3993343114852905} 11/07/2021 03:18:34 - INFO - __main__ - Step 43370: {'lr': 0.00040964612943943964, 'samples': 8327040, 'steps': 43369, 'loss/train': 1.4707213640213013} 11/07/2021 03:18:34 - INFO - __main__ - Step 43371: {'lr': 0.00040964204558935726, 'samples': 8327232, 'steps': 43370, 'loss/train': 1.248171329498291} 11/07/2021 03:18:34 - INFO - __main__ - Step 43372: {'lr': 0.00040963796166734257, 'samples': 8327424, 'steps': 43371, 'loss/train': 1.6246415376663208} 11/07/2021 03:18:35 - INFO - __main__ - Step 43373: {'lr': 0.00040963387767339757, 'samples': 8327616, 'steps': 43372, 'loss/train': 1.0944591760635376} 11/07/2021 03:18:35 - INFO - __main__ - Step 43374: {'lr': 0.00040962979360752394, 'samples': 8327808, 'steps': 43373, 'loss/train': 1.5124233961105347} 11/07/2021 03:18:36 - INFO - __main__ - Step 43375: {'lr': 0.0004096257094697236, 'samples': 8328000, 'steps': 43374, 'loss/train': 1.634065866470337} 11/07/2021 03:18:36 - INFO - __main__ - Step 43376: {'lr': 0.00040962162525999833, 'samples': 8328192, 'steps': 43375, 'loss/train': 0.593380868434906} 11/07/2021 03:18:37 - INFO - __main__ - Step 43377: {'lr': 0.00040961754097835015, 'samples': 8328384, 'steps': 43376, 'loss/train': 1.536962866783142} 11/07/2021 03:18:37 - INFO - __main__ - Step 43378: {'lr': 0.00040961345662478065, 'samples': 8328576, 'steps': 43377, 'loss/train': 1.2107462882995605} 11/07/2021 03:18:37 - INFO - __main__ - Step 43379: {'lr': 0.00040960937219929186, 'samples': 8328768, 'steps': 43378, 'loss/train': 1.471487045288086} 11/07/2021 03:18:38 - INFO - __main__ - Step 43380: {'lr': 0.00040960528770188554, 'samples': 8328960, 'steps': 43379, 'loss/train': 1.4032756090164185} 11/07/2021 03:18:39 - INFO - __main__ - Step 43381: {'lr': 0.00040960120313256356, 'samples': 8329152, 'steps': 43380, 'loss/train': 1.7656457424163818} 11/07/2021 03:18:39 - INFO - __main__ - Step 43382: {'lr': 0.0004095971184913277, 'samples': 8329344, 'steps': 43381, 'loss/train': 1.5677131414413452} 11/07/2021 03:18:39 - INFO - __main__ - Step 43383: {'lr': 0.0004095930337781798, 'samples': 8329536, 'steps': 43382, 'loss/train': 0.2914910912513733} 11/07/2021 03:18:40 - INFO - __main__ - Step 43384: {'lr': 0.00040958894899312183, 'samples': 8329728, 'steps': 43383, 'loss/train': 1.640594244003296} 11/07/2021 03:18:40 - INFO - __main__ - Step 43385: {'lr': 0.0004095848641361555, 'samples': 8329920, 'steps': 43384, 'loss/train': 1.6570206880569458} 11/07/2021 03:18:41 - INFO - __main__ - Step 43386: {'lr': 0.0004095807792072827, 'samples': 8330112, 'steps': 43385, 'loss/train': 0.9187992811203003} 11/07/2021 03:18:42 - INFO - __main__ - Step 43387: {'lr': 0.00040957669420650525, 'samples': 8330304, 'steps': 43386, 'loss/train': 1.5217148065567017} 11/07/2021 03:18:42 - INFO - __main__ - Step 43388: {'lr': 0.000409572609133825, 'samples': 8330496, 'steps': 43387, 'loss/train': 1.2924803495407104} 11/07/2021 03:18:42 - INFO - __main__ - Step 43389: {'lr': 0.00040956852398924383, 'samples': 8330688, 'steps': 43388, 'loss/train': 1.3369059562683105} 11/07/2021 03:18:43 - INFO - __main__ - Step 43390: {'lr': 0.0004095644387727635, 'samples': 8330880, 'steps': 43389, 'loss/train': 1.7199703454971313} 11/07/2021 03:18:44 - INFO - __main__ - Step 43391: {'lr': 0.0004095603534843859, 'samples': 8331072, 'steps': 43390, 'loss/train': 2.3008205890655518} 11/07/2021 03:18:44 - INFO - __main__ - Step 43392: {'lr': 0.00040955626812411297, 'samples': 8331264, 'steps': 43391, 'loss/train': 1.1952695846557617} 11/07/2021 03:18:44 - INFO - __main__ - Step 43393: {'lr': 0.0004095521826919463, 'samples': 8331456, 'steps': 43392, 'loss/train': 1.5825469493865967} 11/07/2021 03:18:45 - INFO - __main__ - Step 43394: {'lr': 0.0004095480971878879, 'samples': 8331648, 'steps': 43393, 'loss/train': 1.059404730796814} 11/07/2021 03:18:45 - INFO - __main__ - Step 43395: {'lr': 0.0004095440116119397, 'samples': 8331840, 'steps': 43394, 'loss/train': 1.9554314613342285} 11/07/2021 03:18:46 - INFO - __main__ - Step 43396: {'lr': 0.00040953992596410335, 'samples': 8332032, 'steps': 43395, 'loss/train': 1.4320061206817627} 11/07/2021 03:18:46 - INFO - __main__ - Step 43397: {'lr': 0.0004095358402443808, 'samples': 8332224, 'steps': 43396, 'loss/train': 0.6256570219993591} 11/07/2021 03:18:47 - INFO - __main__ - Step 43398: {'lr': 0.0004095317544527738, 'samples': 8332416, 'steps': 43397, 'loss/train': 1.4908761978149414} 11/07/2021 03:18:47 - INFO - __main__ - Step 43399: {'lr': 0.00040952766858928433, 'samples': 8332608, 'steps': 43398, 'loss/train': 1.5086199045181274} 11/07/2021 03:18:47 - INFO - __main__ - Step 43400: {'lr': 0.0004095235826539141, 'samples': 8332800, 'steps': 43399, 'loss/train': 1.3647236824035645} 11/07/2021 03:18:49 - INFO - __main__ - Step 43401: {'lr': 0.00040951949664666504, 'samples': 8332992, 'steps': 43400, 'loss/train': 1.3735185861587524} 11/07/2021 03:18:49 - INFO - __main__ - Step 43402: {'lr': 0.00040951541056753895, 'samples': 8333184, 'steps': 43401, 'loss/train': 1.4975714683532715} 11/07/2021 03:18:50 - INFO - __main__ - Step 43403: {'lr': 0.00040951132441653773, 'samples': 8333376, 'steps': 43402, 'loss/train': 1.4579033851623535} 11/07/2021 03:18:50 - INFO - __main__ - Step 43404: {'lr': 0.00040950723819366307, 'samples': 8333568, 'steps': 43403, 'loss/train': 1.461193323135376} 11/07/2021 03:18:50 - INFO - __main__ - Step 43405: {'lr': 0.000409503151898917, 'samples': 8333760, 'steps': 43404, 'loss/train': 0.5103493928909302} 11/07/2021 03:18:51 - INFO - __main__ - Step 43406: {'lr': 0.0004094990655323012, 'samples': 8333952, 'steps': 43405, 'loss/train': 0.9520642161369324} 11/07/2021 03:18:52 - INFO - __main__ - Step 43407: {'lr': 0.00040949497909381757, 'samples': 8334144, 'steps': 43406, 'loss/train': 1.6208157539367676} 11/07/2021 03:18:52 - INFO - __main__ - Step 43408: {'lr': 0.000409490892583468, 'samples': 8334336, 'steps': 43407, 'loss/train': 1.2611219882965088} 11/07/2021 03:18:52 - INFO - __main__ - Step 43409: {'lr': 0.0004094868060012543, 'samples': 8334528, 'steps': 43408, 'loss/train': 1.7201714515686035} 11/07/2021 03:18:53 - INFO - __main__ - Step 43410: {'lr': 0.0004094827193471783, 'samples': 8334720, 'steps': 43409, 'loss/train': 1.157602071762085} 11/07/2021 03:18:53 - INFO - __main__ - Step 43411: {'lr': 0.00040947863262124186, 'samples': 8334912, 'steps': 43410, 'loss/train': 1.817247748374939} 11/07/2021 03:18:54 - INFO - __main__ - Step 43412: {'lr': 0.0004094745458234468, 'samples': 8335104, 'steps': 43411, 'loss/train': 1.3322869539260864} 11/07/2021 03:18:55 - INFO - __main__ - Step 43413: {'lr': 0.00040947045895379494, 'samples': 8335296, 'steps': 43412, 'loss/train': 1.4618539810180664} 11/07/2021 03:18:55 - INFO - __main__ - Step 43414: {'lr': 0.00040946637201228815, 'samples': 8335488, 'steps': 43413, 'loss/train': 0.8857395052909851} 11/07/2021 03:18:55 - INFO - __main__ - Step 43415: {'lr': 0.00040946228499892835, 'samples': 8335680, 'steps': 43414, 'loss/train': 0.5324795246124268} 11/07/2021 03:18:56 - INFO - __main__ - Step 43416: {'lr': 0.0004094581979137172, 'samples': 8335872, 'steps': 43415, 'loss/train': 1.338943362236023} 11/07/2021 03:18:57 - INFO - __main__ - Step 43417: {'lr': 0.00040945411075665674, 'samples': 8336064, 'steps': 43416, 'loss/train': 1.0641642808914185} 11/07/2021 03:18:57 - INFO - __main__ - Step 43418: {'lr': 0.0004094500235277486, 'samples': 8336256, 'steps': 43417, 'loss/train': 1.5129536390304565} 11/07/2021 03:18:57 - INFO - __main__ - Step 43419: {'lr': 0.0004094459362269949, 'samples': 8336448, 'steps': 43418, 'loss/train': 1.694779396057129} 11/07/2021 03:18:58 - INFO - __main__ - Step 43420: {'lr': 0.0004094418488543972, 'samples': 8336640, 'steps': 43419, 'loss/train': 1.850289225578308} 11/07/2021 03:18:58 - INFO - __main__ - Step 43421: {'lr': 0.00040943776140995756, 'samples': 8336832, 'steps': 43420, 'loss/train': 1.336556077003479} 11/07/2021 03:18:58 - INFO - __main__ - Step 43422: {'lr': 0.0004094336738936777, 'samples': 8337024, 'steps': 43421, 'loss/train': 1.7144724130630493} 11/07/2021 03:18:59 - INFO - __main__ - Step 43423: {'lr': 0.0004094295863055594, 'samples': 8337216, 'steps': 43422, 'loss/train': 1.6323271989822388} 11/07/2021 03:19:00 - INFO - __main__ - Step 43424: {'lr': 0.0004094254986456046, 'samples': 8337408, 'steps': 43423, 'loss/train': 1.2456058263778687} 11/07/2021 03:19:00 - INFO - __main__ - Step 43425: {'lr': 0.0004094214109138152, 'samples': 8337600, 'steps': 43424, 'loss/train': 1.344637155532837} 11/07/2021 03:19:00 - INFO - __main__ - Step 43426: {'lr': 0.000409417323110193, 'samples': 8337792, 'steps': 43425, 'loss/train': 1.5101417303085327} 11/07/2021 03:19:01 - INFO - __main__ - Step 43427: {'lr': 0.00040941323523473975, 'samples': 8337984, 'steps': 43426, 'loss/train': 1.7929035425186157} 11/07/2021 03:19:02 - INFO - __main__ - Step 43428: {'lr': 0.00040940914728745736, 'samples': 8338176, 'steps': 43427, 'loss/train': 0.9796424508094788} 11/07/2021 03:19:02 - INFO - __main__ - Step 43429: {'lr': 0.0004094050592683477, 'samples': 8338368, 'steps': 43428, 'loss/train': 1.762089729309082} 11/07/2021 03:19:03 - INFO - __main__ - Step 43430: {'lr': 0.00040940097117741255, 'samples': 8338560, 'steps': 43429, 'loss/train': 1.084628939628601} 11/07/2021 03:19:03 - INFO - __main__ - Step 43431: {'lr': 0.00040939688301465377, 'samples': 8338752, 'steps': 43430, 'loss/train': 1.2713582515716553} 11/07/2021 03:19:03 - INFO - __main__ - Step 43432: {'lr': 0.0004093927947800732, 'samples': 8338944, 'steps': 43431, 'loss/train': 1.6452714204788208} 11/07/2021 03:19:04 - INFO - __main__ - Step 43433: {'lr': 0.00040938870647367275, 'samples': 8339136, 'steps': 43432, 'loss/train': 1.3768030405044556} 11/07/2021 03:19:05 - INFO - __main__ - Step 43434: {'lr': 0.0004093846180954542, 'samples': 8339328, 'steps': 43433, 'loss/train': 1.5188106298446655} 11/07/2021 03:19:05 - INFO - __main__ - Step 43435: {'lr': 0.00040938052964541936, 'samples': 8339520, 'steps': 43434, 'loss/train': 1.2342990636825562} 11/07/2021 03:19:05 - INFO - __main__ - Step 43436: {'lr': 0.0004093764411235702, 'samples': 8339712, 'steps': 43435, 'loss/train': 1.8071941137313843} 11/07/2021 03:19:06 - INFO - __main__ - Step 43437: {'lr': 0.00040937235252990834, 'samples': 8339904, 'steps': 43436, 'loss/train': 1.6887180805206299} 11/07/2021 03:19:07 - INFO - __main__ - Step 43438: {'lr': 0.00040936826386443585, 'samples': 8340096, 'steps': 43437, 'loss/train': 1.5172128677368164} 11/07/2021 03:19:07 - INFO - __main__ - Step 43439: {'lr': 0.00040936417512715454, 'samples': 8340288, 'steps': 43438, 'loss/train': 1.392961025238037} 11/07/2021 03:19:07 - INFO - __main__ - Step 43440: {'lr': 0.00040936008631806603, 'samples': 8340480, 'steps': 43439, 'loss/train': 1.736971378326416} 11/07/2021 03:19:08 - INFO - __main__ - Step 43441: {'lr': 0.00040935599743717243, 'samples': 8340672, 'steps': 43440, 'loss/train': 1.6567108631134033} 11/07/2021 03:19:08 - INFO - __main__ - Step 43442: {'lr': 0.00040935190848447544, 'samples': 8340864, 'steps': 43441, 'loss/train': 1.3322232961654663} 11/07/2021 03:19:09 - INFO - __main__ - Step 43443: {'lr': 0.000409347819459977, 'samples': 8341056, 'steps': 43442, 'loss/train': 1.6457480192184448} 11/07/2021 03:19:09 - INFO - __main__ - Step 43444: {'lr': 0.0004093437303636788, 'samples': 8341248, 'steps': 43443, 'loss/train': 1.3809614181518555} 11/07/2021 03:19:10 - INFO - __main__ - Step 43445: {'lr': 0.0004093396411955829, 'samples': 8341440, 'steps': 43444, 'loss/train': 1.338553547859192} 11/07/2021 03:19:10 - INFO - __main__ - Step 43446: {'lr': 0.0004093355519556908, 'samples': 8341632, 'steps': 43445, 'loss/train': 1.2826430797576904} 11/07/2021 03:19:10 - INFO - __main__ - Step 43447: {'lr': 0.0004093314626440048, 'samples': 8341824, 'steps': 43446, 'loss/train': 1.4386062622070312} 11/07/2021 03:19:11 - INFO - __main__ - Step 43448: {'lr': 0.0004093273732605264, 'samples': 8342016, 'steps': 43447, 'loss/train': 1.561930775642395} 11/07/2021 03:19:12 - INFO - __main__ - Step 43449: {'lr': 0.0004093232838052575, 'samples': 8342208, 'steps': 43448, 'loss/train': 1.3393933773040771} 11/07/2021 03:19:12 - INFO - __main__ - Step 43450: {'lr': 0.0004093191942782001, 'samples': 8342400, 'steps': 43449, 'loss/train': 1.4493123292922974} 11/07/2021 03:19:13 - INFO - __main__ - Step 43451: {'lr': 0.0004093151046793558, 'samples': 8342592, 'steps': 43450, 'loss/train': 4.328895568847656} 11/07/2021 03:19:13 - INFO - __main__ - Step 43452: {'lr': 0.00040931101500872656, 'samples': 8342784, 'steps': 43451, 'loss/train': 1.560447096824646} 11/07/2021 03:19:14 - INFO - __main__ - Step 43453: {'lr': 0.00040930692526631443, 'samples': 8342976, 'steps': 43452, 'loss/train': 1.3717985153198242} 11/07/2021 03:19:14 - INFO - __main__ - Step 43454: {'lr': 0.0004093028354521209, 'samples': 8343168, 'steps': 43453, 'loss/train': 1.4804021120071411} 11/07/2021 03:19:15 - INFO - __main__ - Step 43455: {'lr': 0.000409298745566148, 'samples': 8343360, 'steps': 43454, 'loss/train': 1.7922828197479248} 11/07/2021 03:19:15 - INFO - __main__ - Step 43456: {'lr': 0.00040929465560839753, 'samples': 8343552, 'steps': 43455, 'loss/train': 1.7526626586914062} 11/07/2021 03:19:15 - INFO - __main__ - Step 43457: {'lr': 0.00040929056557887137, 'samples': 8343744, 'steps': 43456, 'loss/train': 1.7522269487380981} 11/07/2021 03:19:16 - INFO - __main__ - Step 43458: {'lr': 0.0004092864754775713, 'samples': 8343936, 'steps': 43457, 'loss/train': 1.1358518600463867} 11/07/2021 03:19:17 - INFO - __main__ - Step 43459: {'lr': 0.00040928238530449926, 'samples': 8344128, 'steps': 43458, 'loss/train': 1.2265043258666992} 11/07/2021 03:19:17 - INFO - __main__ - Step 43460: {'lr': 0.00040927829505965694, 'samples': 8344320, 'steps': 43459, 'loss/train': 1.6911441087722778} 11/07/2021 03:19:17 - INFO - __main__ - Step 43461: {'lr': 0.00040927420474304646, 'samples': 8344512, 'steps': 43460, 'loss/train': 0.21975359320640564} 11/07/2021 03:19:18 - INFO - __main__ - Step 43462: {'lr': 0.00040927011435466933, 'samples': 8344704, 'steps': 43461, 'loss/train': 1.3408634662628174} 11/07/2021 03:19:18 - INFO - __main__ - Step 43463: {'lr': 0.0004092660238945276, 'samples': 8344896, 'steps': 43462, 'loss/train': 1.6607204675674438} 11/07/2021 03:19:19 - INFO - __main__ - Step 43464: {'lr': 0.00040926193336262304, 'samples': 8345088, 'steps': 43463, 'loss/train': 1.8812296390533447} 11/07/2021 03:19:20 - INFO - __main__ - Step 43465: {'lr': 0.0004092578427589575, 'samples': 8345280, 'steps': 43464, 'loss/train': 1.7003211975097656} 11/07/2021 03:19:20 - INFO - __main__ - Step 43466: {'lr': 0.0004092537520835328, 'samples': 8345472, 'steps': 43465, 'loss/train': 1.2302883863449097} 11/07/2021 03:19:20 - INFO - __main__ - Step 43467: {'lr': 0.0004092496613363509, 'samples': 8345664, 'steps': 43466, 'loss/train': 1.0020233392715454} 11/07/2021 03:19:21 - INFO - __main__ - Step 43468: {'lr': 0.0004092455705174135, 'samples': 8345856, 'steps': 43467, 'loss/train': 1.6291698217391968} 11/07/2021 03:19:22 - INFO - __main__ - Step 43469: {'lr': 0.00040924147962672253, 'samples': 8346048, 'steps': 43468, 'loss/train': 3.498680353164673} 11/07/2021 03:19:22 - INFO - __main__ - Step 43470: {'lr': 0.00040923738866427986, 'samples': 8346240, 'steps': 43469, 'loss/train': 1.2424981594085693} 11/07/2021 03:19:23 - INFO - __main__ - Step 43471: {'lr': 0.00040923329763008714, 'samples': 8346432, 'steps': 43470, 'loss/train': 1.70469331741333} 11/07/2021 03:19:23 - INFO - __main__ - Step 43472: {'lr': 0.0004092292065241464, 'samples': 8346624, 'steps': 43471, 'loss/train': 1.1266168355941772} 11/07/2021 03:19:23 - INFO - __main__ - Step 43473: {'lr': 0.00040922511534645953, 'samples': 8346816, 'steps': 43472, 'loss/train': 1.3175348043441772} 11/07/2021 03:19:24 - INFO - __main__ - Step 43474: {'lr': 0.0004092210240970282, 'samples': 8347008, 'steps': 43473, 'loss/train': 1.4208704233169556} 11/07/2021 03:19:25 - INFO - __main__ - Step 43475: {'lr': 0.0004092169327758544, 'samples': 8347200, 'steps': 43474, 'loss/train': 1.222894549369812} 11/07/2021 03:19:25 - INFO - __main__ - Step 43476: {'lr': 0.0004092128413829398, 'samples': 8347392, 'steps': 43475, 'loss/train': 1.2349470853805542} 11/07/2021 03:19:25 - INFO - __main__ - Step 43477: {'lr': 0.0004092087499182864, 'samples': 8347584, 'steps': 43476, 'loss/train': 1.730302333831787} 11/07/2021 03:19:26 - INFO - __main__ - Step 43478: {'lr': 0.000409204658381896, 'samples': 8347776, 'steps': 43477, 'loss/train': 1.6098095178604126} 11/07/2021 03:19:26 - INFO - __main__ - Step 43479: {'lr': 0.00040920056677377047, 'samples': 8347968, 'steps': 43478, 'loss/train': 1.6416910886764526} 11/07/2021 03:19:27 - INFO - __main__ - Step 43480: {'lr': 0.00040919647509391155, 'samples': 8348160, 'steps': 43479, 'loss/train': 1.114418387413025} 11/07/2021 03:19:28 - INFO - __main__ - Step 43481: {'lr': 0.0004091923833423212, 'samples': 8348352, 'steps': 43480, 'loss/train': 1.4143720865249634} 11/07/2021 03:19:28 - INFO - __main__ - Step 43482: {'lr': 0.00040918829151900127, 'samples': 8348544, 'steps': 43481, 'loss/train': 1.4186195135116577} 11/07/2021 03:19:28 - INFO - __main__ - Step 43483: {'lr': 0.0004091841996239535, 'samples': 8348736, 'steps': 43482, 'loss/train': 1.563791275024414} 11/07/2021 03:19:29 - INFO - __main__ - Step 43484: {'lr': 0.00040918010765717976, 'samples': 8348928, 'steps': 43483, 'loss/train': 1.2986820936203003} 11/07/2021 03:19:30 - INFO - __main__ - Step 43485: {'lr': 0.00040917601561868194, 'samples': 8349120, 'steps': 43484, 'loss/train': 1.6048474311828613} 11/07/2021 03:19:30 - INFO - __main__ - Step 43486: {'lr': 0.00040917192350846187, 'samples': 8349312, 'steps': 43485, 'loss/train': 1.2884235382080078} 11/07/2021 03:19:31 - INFO - __main__ - Step 43487: {'lr': 0.00040916783132652134, 'samples': 8349504, 'steps': 43486, 'loss/train': 1.8406164646148682} 11/07/2021 03:19:31 - INFO - __main__ - Step 43488: {'lr': 0.0004091637390728623, 'samples': 8349696, 'steps': 43487, 'loss/train': 1.7488003969192505} 11/07/2021 03:19:31 - INFO - __main__ - Step 43489: {'lr': 0.00040915964674748665, 'samples': 8349888, 'steps': 43488, 'loss/train': 1.1958898305892944} 11/07/2021 03:19:32 - INFO - __main__ - Step 43490: {'lr': 0.0004091555543503959, 'samples': 8350080, 'steps': 43489, 'loss/train': 1.5925389528274536} 11/07/2021 03:19:33 - INFO - __main__ - Step 43491: {'lr': 0.00040915146188159223, 'samples': 8350272, 'steps': 43490, 'loss/train': 0.8500160574913025} 11/07/2021 03:19:33 - INFO - __main__ - Step 43492: {'lr': 0.0004091473693410773, 'samples': 8350464, 'steps': 43491, 'loss/train': 1.240749716758728} 11/07/2021 03:19:33 - INFO - __main__ - Step 43493: {'lr': 0.0004091432767288531, 'samples': 8350656, 'steps': 43492, 'loss/train': 1.4671683311462402} 11/07/2021 03:19:34 - INFO - __main__ - Step 43494: {'lr': 0.0004091391840449213, 'samples': 8350848, 'steps': 43493, 'loss/train': 1.5638720989227295} 11/07/2021 03:19:34 - INFO - __main__ - Step 43495: {'lr': 0.00040913509128928394, 'samples': 8351040, 'steps': 43494, 'loss/train': 1.8460904359817505} 11/07/2021 03:19:35 - INFO - __main__ - Step 43496: {'lr': 0.00040913099846194274, 'samples': 8351232, 'steps': 43495, 'loss/train': 1.6968833208084106} 11/07/2021 03:19:35 - INFO - __main__ - Step 43497: {'lr': 0.00040912690556289957, 'samples': 8351424, 'steps': 43496, 'loss/train': 0.21595387160778046} 11/07/2021 03:19:36 - INFO - __main__ - Step 43498: {'lr': 0.0004091228125921562, 'samples': 8351616, 'steps': 43497, 'loss/train': 1.8241119384765625} 11/07/2021 03:19:36 - INFO - __main__ - Step 43499: {'lr': 0.0004091187195497146, 'samples': 8351808, 'steps': 43498, 'loss/train': 2.1405017375946045} 11/07/2021 03:19:36 - INFO - __main__ - Step 43500: {'lr': 0.00040911462643557656, 'samples': 8352000, 'steps': 43499, 'loss/train': 1.618061900138855} 11/07/2021 03:19:37 - INFO - __main__ - Step 43501: {'lr': 0.0004091105332497439, 'samples': 8352192, 'steps': 43500, 'loss/train': 1.3820098638534546} 11/07/2021 03:19:38 - INFO - __main__ - Step 43502: {'lr': 0.0004091064399922185, 'samples': 8352384, 'steps': 43501, 'loss/train': 1.3085178136825562} 11/07/2021 03:19:38 - INFO - __main__ - Step 43503: {'lr': 0.0004091023466630023, 'samples': 8352576, 'steps': 43502, 'loss/train': 1.8581750392913818} 11/07/2021 03:19:38 - INFO - __main__ - Step 43504: {'lr': 0.00040909825326209694, 'samples': 8352768, 'steps': 43503, 'loss/train': 1.267033576965332} 11/07/2021 03:19:39 - INFO - __main__ - Step 43505: {'lr': 0.0004090941597895043, 'samples': 8352960, 'steps': 43504, 'loss/train': 1.826012134552002} 11/07/2021 03:19:40 - INFO - __main__ - Step 43506: {'lr': 0.0004090900662452264, 'samples': 8353152, 'steps': 43505, 'loss/train': 1.538798451423645} 11/07/2021 03:19:40 - INFO - __main__ - Step 43507: {'lr': 0.00040908597262926484, 'samples': 8353344, 'steps': 43506, 'loss/train': 1.5585148334503174} 11/07/2021 03:19:41 - INFO - __main__ - Step 43508: {'lr': 0.0004090818789416217, 'samples': 8353536, 'steps': 43507, 'loss/train': 1.1718560457229614} 11/07/2021 03:19:41 - INFO - __main__ - Step 43509: {'lr': 0.0004090777851822988, 'samples': 8353728, 'steps': 43508, 'loss/train': 1.2051889896392822} 11/07/2021 03:19:41 - INFO - __main__ - Step 43510: {'lr': 0.0004090736913512977, 'samples': 8353920, 'steps': 43509, 'loss/train': 1.932144284248352} 11/07/2021 03:19:42 - INFO - __main__ - Step 43511: {'lr': 0.0004090695974486206, 'samples': 8354112, 'steps': 43510, 'loss/train': 1.5004621744155884} 11/07/2021 03:19:43 - INFO - __main__ - Step 43512: {'lr': 0.00040906550347426907, 'samples': 8354304, 'steps': 43511, 'loss/train': 0.28656384348869324} 11/07/2021 03:19:43 - INFO - __main__ - Step 43513: {'lr': 0.0004090614094282452, 'samples': 8354496, 'steps': 43512, 'loss/train': 1.5482137203216553} 11/07/2021 03:19:43 - INFO - __main__ - Step 43514: {'lr': 0.00040905731531055067, 'samples': 8354688, 'steps': 43513, 'loss/train': 1.636155366897583} 11/07/2021 03:19:44 - INFO - __main__ - Step 43515: {'lr': 0.0004090532211211874, 'samples': 8354880, 'steps': 43514, 'loss/train': 1.1567301750183105} 11/07/2021 03:19:44 - INFO - __main__ - Step 43516: {'lr': 0.0004090491268601572, 'samples': 8355072, 'steps': 43515, 'loss/train': 1.7579503059387207} 11/07/2021 03:19:45 - INFO - __main__ - Step 43517: {'lr': 0.0004090450325274618, 'samples': 8355264, 'steps': 43516, 'loss/train': 1.4337153434753418} 11/07/2021 03:19:45 - INFO - __main__ - Step 43518: {'lr': 0.0004090409381231033, 'samples': 8355456, 'steps': 43517, 'loss/train': 0.9164023399353027} 11/07/2021 03:19:46 - INFO - __main__ - Step 43519: {'lr': 0.0004090368436470833, 'samples': 8355648, 'steps': 43518, 'loss/train': 1.3550169467926025} 11/07/2021 03:19:46 - INFO - __main__ - Step 43520: {'lr': 0.0004090327490994038, 'samples': 8355840, 'steps': 43519, 'loss/train': 1.3841397762298584} 11/07/2021 03:19:47 - INFO - __main__ - Step 43521: {'lr': 0.00040902865448006663, 'samples': 8356032, 'steps': 43520, 'loss/train': 1.8165466785430908} 11/07/2021 03:19:48 - INFO - __main__ - Step 43522: {'lr': 0.0004090245597890736, 'samples': 8356224, 'steps': 43521, 'loss/train': 1.598728895187378} 11/07/2021 03:19:48 - INFO - __main__ - Step 43523: {'lr': 0.00040902046502642656, 'samples': 8356416, 'steps': 43522, 'loss/train': 1.8639800548553467} 11/07/2021 03:19:48 - INFO - __main__ - Step 43524: {'lr': 0.0004090163701921273, 'samples': 8356608, 'steps': 43523, 'loss/train': 1.2822519540786743} 11/07/2021 03:19:49 - INFO - __main__ - Step 43525: {'lr': 0.0004090122752861777, 'samples': 8356800, 'steps': 43524, 'loss/train': 1.387771725654602} 11/07/2021 03:19:49 - INFO - __main__ - Step 43526: {'lr': 0.0004090081803085797, 'samples': 8356992, 'steps': 43525, 'loss/train': 1.2362226247787476} 11/07/2021 03:19:50 - INFO - __main__ - Step 43527: {'lr': 0.00040900408525933505, 'samples': 8357184, 'steps': 43526, 'loss/train': 1.8942064046859741} 11/07/2021 03:19:51 - INFO - __main__ - Step 43528: {'lr': 0.0004089999901384456, 'samples': 8357376, 'steps': 43527, 'loss/train': 1.1758595705032349} 11/07/2021 03:19:51 - INFO - __main__ - Step 43529: {'lr': 0.00040899589494591316, 'samples': 8357568, 'steps': 43528, 'loss/train': 1.031545639038086} 11/07/2021 03:19:51 - INFO - __main__ - Step 43530: {'lr': 0.0004089917996817397, 'samples': 8357760, 'steps': 43529, 'loss/train': 1.3493149280548096} 11/07/2021 03:19:52 - INFO - __main__ - Step 43531: {'lr': 0.00040898770434592694, 'samples': 8357952, 'steps': 43530, 'loss/train': 1.3495090007781982} 11/07/2021 03:19:53 - INFO - __main__ - Step 43532: {'lr': 0.0004089836089384768, 'samples': 8358144, 'steps': 43531, 'loss/train': 1.5462332963943481} 11/07/2021 03:19:53 - INFO - __main__ - Step 43533: {'lr': 0.0004089795134593911, 'samples': 8358336, 'steps': 43532, 'loss/train': 1.2180896997451782} 11/07/2021 03:19:54 - INFO - __main__ - Step 43534: {'lr': 0.00040897541790867165, 'samples': 8358528, 'steps': 43533, 'loss/train': 2.5205626487731934} 11/07/2021 03:19:54 - INFO - __main__ - Step 43535: {'lr': 0.00040897132228632035, 'samples': 8358720, 'steps': 43534, 'loss/train': 1.61453115940094} 11/07/2021 03:19:54 - INFO - __main__ - Step 43536: {'lr': 0.000408967226592339, 'samples': 8358912, 'steps': 43535, 'loss/train': 1.846745252609253} 11/07/2021 03:19:56 - INFO - __main__ - Step 43537: {'lr': 0.00040896313082672953, 'samples': 8359104, 'steps': 43536, 'loss/train': 0.9068235754966736} 11/07/2021 03:19:56 - INFO - __main__ - Step 43538: {'lr': 0.0004089590349894937, 'samples': 8359296, 'steps': 43537, 'loss/train': 1.7170244455337524} 11/07/2021 03:19:56 - INFO - __main__ - Step 43539: {'lr': 0.0004089549390806334, 'samples': 8359488, 'steps': 43538, 'loss/train': 1.5198299884796143} 11/07/2021 03:19:57 - INFO - __main__ - Step 43540: {'lr': 0.0004089508431001504, 'samples': 8359680, 'steps': 43539, 'loss/train': 1.860291600227356} 11/07/2021 03:19:57 - INFO - __main__ - Step 43541: {'lr': 0.00040894674704804667, 'samples': 8359872, 'steps': 43540, 'loss/train': 1.6128144264221191} 11/07/2021 03:19:57 - INFO - __main__ - Step 43542: {'lr': 0.00040894265092432397, 'samples': 8360064, 'steps': 43541, 'loss/train': 1.2706501483917236} 11/07/2021 03:19:58 - INFO - __main__ - Step 43543: {'lr': 0.0004089385547289841, 'samples': 8360256, 'steps': 43542, 'loss/train': 0.9065102338790894} 11/07/2021 03:19:59 - INFO - __main__ - Step 43544: {'lr': 0.00040893445846202904, 'samples': 8360448, 'steps': 43543, 'loss/train': 1.482672929763794} 11/07/2021 03:19:59 - INFO - __main__ - Step 43545: {'lr': 0.00040893036212346056, 'samples': 8360640, 'steps': 43544, 'loss/train': 1.546755075454712} 11/07/2021 03:19:59 - INFO - __main__ - Step 43546: {'lr': 0.00040892626571328053, 'samples': 8360832, 'steps': 43545, 'loss/train': 1.0674269199371338} 11/07/2021 03:20:00 - INFO - __main__ - Step 43547: {'lr': 0.00040892216923149073, 'samples': 8361024, 'steps': 43546, 'loss/train': 1.7720543146133423} 11/07/2021 03:20:01 - INFO - __main__ - Step 43548: {'lr': 0.000408918072678093, 'samples': 8361216, 'steps': 43547, 'loss/train': 1.5051828622817993} 11/07/2021 03:20:01 - INFO - __main__ - Step 43549: {'lr': 0.0004089139760530893, 'samples': 8361408, 'steps': 43548, 'loss/train': 1.3326727151870728} 11/07/2021 03:20:02 - INFO - __main__ - Step 43550: {'lr': 0.0004089098793564815, 'samples': 8361600, 'steps': 43549, 'loss/train': 1.5760725736618042} 11/07/2021 03:20:02 - INFO - __main__ - Step 43551: {'lr': 0.00040890578258827125, 'samples': 8361792, 'steps': 43550, 'loss/train': 1.712302327156067} 11/07/2021 03:20:02 - INFO - __main__ - Step 43552: {'lr': 0.00040890168574846055, 'samples': 8361984, 'steps': 43551, 'loss/train': 1.5927711725234985} 11/07/2021 03:20:03 - INFO - __main__ - Step 43553: {'lr': 0.0004088975888370512, 'samples': 8362176, 'steps': 43552, 'loss/train': 1.4326268434524536} 11/07/2021 03:20:04 - INFO - __main__ - Step 43554: {'lr': 0.00040889349185404503, 'samples': 8362368, 'steps': 43553, 'loss/train': 1.8387601375579834} 11/07/2021 03:20:04 - INFO - __main__ - Step 43555: {'lr': 0.00040888939479944385, 'samples': 8362560, 'steps': 43554, 'loss/train': 1.3779672384262085} 11/07/2021 03:20:04 - INFO - __main__ - Step 43556: {'lr': 0.00040888529767324966, 'samples': 8362752, 'steps': 43555, 'loss/train': 1.569291114807129} 11/07/2021 03:20:05 - INFO - __main__ - Step 43557: {'lr': 0.0004088812004754642, 'samples': 8362944, 'steps': 43556, 'loss/train': 0.6441032886505127} 11/07/2021 03:20:06 - INFO - __main__ - Step 43558: {'lr': 0.00040887710320608927, 'samples': 8363136, 'steps': 43557, 'loss/train': 0.5327775478363037} 11/07/2021 03:20:06 - INFO - __main__ - Step 43559: {'lr': 0.00040887300586512677, 'samples': 8363328, 'steps': 43558, 'loss/train': 1.2207534313201904} 11/07/2021 03:20:06 - INFO - __main__ - Step 43560: {'lr': 0.0004088689084525786, 'samples': 8363520, 'steps': 43559, 'loss/train': 1.6354135274887085} 11/07/2021 03:20:07 - INFO - __main__ - Step 43561: {'lr': 0.0004088648109684465, 'samples': 8363712, 'steps': 43560, 'loss/train': 1.0811123847961426} 11/07/2021 03:20:07 - INFO - __main__ - Step 43562: {'lr': 0.00040886071341273236, 'samples': 8363904, 'steps': 43561, 'loss/train': 1.5641906261444092} 11/07/2021 03:20:08 - INFO - __main__ - Step 43563: {'lr': 0.0004088566157854381, 'samples': 8364096, 'steps': 43562, 'loss/train': 0.8359418511390686} 11/07/2021 03:20:08 - INFO - __main__ - Step 43564: {'lr': 0.0004088525180865654, 'samples': 8364288, 'steps': 43563, 'loss/train': 1.4746079444885254} 11/07/2021 03:20:09 - INFO - __main__ - Step 43565: {'lr': 0.0004088484203161163, 'samples': 8364480, 'steps': 43564, 'loss/train': 1.2377235889434814} 11/07/2021 03:20:09 - INFO - __main__ - Step 43566: {'lr': 0.0004088443224740925, 'samples': 8364672, 'steps': 43565, 'loss/train': 0.6066043972969055} 11/07/2021 03:20:09 - INFO - __main__ - Step 43567: {'lr': 0.00040884022456049595, 'samples': 8364864, 'steps': 43566, 'loss/train': 1.4646358489990234} 11/07/2021 03:20:10 - INFO - __main__ - Step 43568: {'lr': 0.00040883612657532844, 'samples': 8365056, 'steps': 43567, 'loss/train': 0.7460005879402161} 11/07/2021 03:20:11 - INFO - __main__ - Step 43569: {'lr': 0.0004088320285185918, 'samples': 8365248, 'steps': 43568, 'loss/train': 1.2883244752883911} 11/07/2021 03:20:11 - INFO - __main__ - Step 43570: {'lr': 0.0004088279303902879, 'samples': 8365440, 'steps': 43569, 'loss/train': 1.4460668563842773} 11/07/2021 03:20:12 - INFO - __main__ - Step 43571: {'lr': 0.0004088238321904185, 'samples': 8365632, 'steps': 43570, 'loss/train': 1.7092125415802002} 11/07/2021 03:20:12 - INFO - __main__ - Step 43572: {'lr': 0.00040881973391898563, 'samples': 8365824, 'steps': 43571, 'loss/train': 1.3663840293884277} 11/07/2021 03:20:12 - INFO - __main__ - Step 43573: {'lr': 0.00040881563557599107, 'samples': 8366016, 'steps': 43572, 'loss/train': 1.221165418624878} 11/07/2021 03:20:13 - INFO - __main__ - Step 43574: {'lr': 0.00040881153716143656, 'samples': 8366208, 'steps': 43573, 'loss/train': 1.2819992303848267} 11/07/2021 03:20:14 - INFO - __main__ - Step 43575: {'lr': 0.000408807438675324, 'samples': 8366400, 'steps': 43574, 'loss/train': 1.6424840688705444} 11/07/2021 03:20:14 - INFO - __main__ - Step 43576: {'lr': 0.0004088033401176554, 'samples': 8366592, 'steps': 43575, 'loss/train': 1.4731272459030151} 11/07/2021 03:20:14 - INFO - __main__ - Step 43577: {'lr': 0.00040879924148843233, 'samples': 8366784, 'steps': 43576, 'loss/train': 1.5095930099487305} 11/07/2021 03:20:15 - INFO - __main__ - Step 43578: {'lr': 0.00040879514278765685, 'samples': 8366976, 'steps': 43577, 'loss/train': 1.735713005065918} 11/07/2021 03:20:16 - INFO - __main__ - Step 43579: {'lr': 0.00040879104401533064, 'samples': 8367168, 'steps': 43578, 'loss/train': 1.2982885837554932} 11/07/2021 03:20:16 - INFO - __main__ - Step 43580: {'lr': 0.0004087869451714557, 'samples': 8367360, 'steps': 43579, 'loss/train': 1.4081155061721802} 11/07/2021 03:20:16 - INFO - __main__ - Step 43581: {'lr': 0.0004087828462560338, 'samples': 8367552, 'steps': 43580, 'loss/train': 1.8157951831817627} 11/07/2021 03:20:17 - INFO - __main__ - Step 43582: {'lr': 0.0004087787472690668, 'samples': 8367744, 'steps': 43581, 'loss/train': 1.1368916034698486} 11/07/2021 03:20:17 - INFO - __main__ - Step 43583: {'lr': 0.00040877464821055656, 'samples': 8367936, 'steps': 43582, 'loss/train': 1.5306081771850586} 11/07/2021 03:20:18 - INFO - __main__ - Step 43584: {'lr': 0.00040877054908050495, 'samples': 8368128, 'steps': 43583, 'loss/train': 1.5217291116714478} 11/07/2021 03:20:18 - INFO - __main__ - Step 43585: {'lr': 0.0004087664498789137, 'samples': 8368320, 'steps': 43584, 'loss/train': 1.1860651969909668} 11/07/2021 03:20:19 - INFO - __main__ - Step 43586: {'lr': 0.00040876235060578476, 'samples': 8368512, 'steps': 43585, 'loss/train': 1.665792465209961} 11/07/2021 03:20:19 - INFO - __main__ - Step 43587: {'lr': 0.00040875825126112, 'samples': 8368704, 'steps': 43586, 'loss/train': 1.4056962728500366} 11/07/2021 03:20:19 - INFO - __main__ - Step 43588: {'lr': 0.00040875415184492113, 'samples': 8368896, 'steps': 43587, 'loss/train': 1.6160039901733398} 11/07/2021 03:20:21 - INFO - __main__ - Step 43589: {'lr': 0.0004087500523571902, 'samples': 8369088, 'steps': 43588, 'loss/train': 1.5002211332321167} 11/07/2021 03:20:21 - INFO - __main__ - Step 43590: {'lr': 0.00040874595279792884, 'samples': 8369280, 'steps': 43589, 'loss/train': 1.3868716955184937} 11/07/2021 03:20:21 - INFO - __main__ - Step 43591: {'lr': 0.00040874185316713905, 'samples': 8369472, 'steps': 43590, 'loss/train': 1.3115204572677612} 11/07/2021 03:20:22 - INFO - __main__ - Step 43592: {'lr': 0.00040873775346482265, 'samples': 8369664, 'steps': 43591, 'loss/train': 0.7397732734680176} 11/07/2021 03:20:22 - INFO - __main__ - Step 43593: {'lr': 0.0004087336536909815, 'samples': 8369856, 'steps': 43592, 'loss/train': 1.7234948873519897} 11/07/2021 03:20:24 - INFO - __main__ - Step 43594: {'lr': 0.00040872955384561735, 'samples': 8370048, 'steps': 43593, 'loss/train': 1.3786344528198242} 11/07/2021 03:20:24 - INFO - __main__ - Step 43595: {'lr': 0.00040872545392873214, 'samples': 8370240, 'steps': 43594, 'loss/train': 1.480933427810669} 11/07/2021 03:20:24 - INFO - __main__ - Step 43596: {'lr': 0.00040872135394032764, 'samples': 8370432, 'steps': 43595, 'loss/train': 1.488183856010437} 11/07/2021 03:20:25 - INFO - __main__ - Step 43597: {'lr': 0.0004087172538804058, 'samples': 8370624, 'steps': 43596, 'loss/train': 1.3239374160766602} 11/07/2021 03:20:25 - INFO - __main__ - Step 43598: {'lr': 0.0004087131537489685, 'samples': 8370816, 'steps': 43597, 'loss/train': 1.4526338577270508} 11/07/2021 03:20:25 - INFO - __main__ - Step 43599: {'lr': 0.00040870905354601733, 'samples': 8371008, 'steps': 43598, 'loss/train': 1.9269925355911255} 11/07/2021 03:20:26 - INFO - __main__ - Step 43600: {'lr': 0.0004087049532715544, 'samples': 8371200, 'steps': 43599, 'loss/train': 0.2446974217891693} 11/07/2021 03:20:27 - INFO - __main__ - Step 43601: {'lr': 0.00040870085292558147, 'samples': 8371392, 'steps': 43600, 'loss/train': 1.1627293825149536} 11/07/2021 03:20:27 - INFO - __main__ - Step 43602: {'lr': 0.0004086967525081003, 'samples': 8371584, 'steps': 43601, 'loss/train': 1.3988213539123535} 11/07/2021 03:20:27 - INFO - __main__ - Step 43603: {'lr': 0.00040869265201911285, 'samples': 8371776, 'steps': 43602, 'loss/train': 0.8341631889343262} 11/07/2021 03:20:28 - INFO - __main__ - Step 43604: {'lr': 0.00040868855145862105, 'samples': 8371968, 'steps': 43603, 'loss/train': 1.744335412979126} 11/07/2021 03:20:29 - INFO - __main__ - Step 43605: {'lr': 0.00040868445082662655, 'samples': 8372160, 'steps': 43604, 'loss/train': 1.4655396938323975} 11/07/2021 03:20:29 - INFO - __main__ - Step 43606: {'lr': 0.0004086803501231313, 'samples': 8372352, 'steps': 43605, 'loss/train': 1.306358814239502} 11/07/2021 03:20:29 - INFO - __main__ - Step 43607: {'lr': 0.00040867624934813715, 'samples': 8372544, 'steps': 43606, 'loss/train': 2.9461324214935303} 11/07/2021 03:20:30 - INFO - __main__ - Step 43608: {'lr': 0.00040867214850164594, 'samples': 8372736, 'steps': 43607, 'loss/train': 0.9181822538375854} 11/07/2021 03:20:30 - INFO - __main__ - Step 43609: {'lr': 0.0004086680475836594, 'samples': 8372928, 'steps': 43608, 'loss/train': 0.5646260976791382} 11/07/2021 03:20:31 - INFO - __main__ - Step 43610: {'lr': 0.0004086639465941796, 'samples': 8373120, 'steps': 43609, 'loss/train': 1.3706705570220947} 11/07/2021 03:20:32 - INFO - __main__ - Step 43611: {'lr': 0.00040865984553320825, 'samples': 8373312, 'steps': 43610, 'loss/train': 1.3446487188339233} 11/07/2021 03:20:32 - INFO - __main__ - Step 43612: {'lr': 0.0004086557444007472, 'samples': 8373504, 'steps': 43611, 'loss/train': 1.6764800548553467} 11/07/2021 03:20:32 - INFO - __main__ - Step 43613: {'lr': 0.0004086516431967984, 'samples': 8373696, 'steps': 43612, 'loss/train': 1.6255570650100708} 11/07/2021 03:20:33 - INFO - __main__ - Step 43614: {'lr': 0.0004086475419213635, 'samples': 8373888, 'steps': 43613, 'loss/train': 1.2821707725524902} 11/07/2021 03:20:33 - INFO - __main__ - Step 43615: {'lr': 0.0004086434405744445, 'samples': 8374080, 'steps': 43614, 'loss/train': 1.3963944911956787} 11/07/2021 03:20:34 - INFO - __main__ - Step 43616: {'lr': 0.00040863933915604323, 'samples': 8374272, 'steps': 43615, 'loss/train': 0.8696134686470032} 11/07/2021 03:20:34 - INFO - __main__ - Step 43617: {'lr': 0.00040863523766616157, 'samples': 8374464, 'steps': 43616, 'loss/train': 1.4096627235412598} 11/07/2021 03:20:35 - INFO - __main__ - Step 43618: {'lr': 0.0004086311361048012, 'samples': 8374656, 'steps': 43617, 'loss/train': 1.5101468563079834} 11/07/2021 03:20:35 - INFO - __main__ - Step 43619: {'lr': 0.0004086270344719642, 'samples': 8374848, 'steps': 43618, 'loss/train': 1.7593225240707397} 11/07/2021 03:20:35 - INFO - __main__ - Step 43620: {'lr': 0.00040862293276765227, 'samples': 8375040, 'steps': 43619, 'loss/train': 1.3444418907165527} 11/07/2021 03:20:36 - INFO - __main__ - Step 43621: {'lr': 0.00040861883099186725, 'samples': 8375232, 'steps': 43620, 'loss/train': 1.2412489652633667} 11/07/2021 03:20:37 - INFO - __main__ - Step 43622: {'lr': 0.0004086147291446111, 'samples': 8375424, 'steps': 43621, 'loss/train': 1.0515379905700684} 11/07/2021 03:20:37 - INFO - __main__ - Step 43623: {'lr': 0.0004086106272258856, 'samples': 8375616, 'steps': 43622, 'loss/train': 0.8363951444625854} 11/07/2021 03:20:37 - INFO - __main__ - Step 43624: {'lr': 0.0004086065252356925, 'samples': 8375808, 'steps': 43623, 'loss/train': 1.4724141359329224} 11/07/2021 03:20:38 - INFO - __main__ - Step 43625: {'lr': 0.00040860242317403383, 'samples': 8376000, 'steps': 43624, 'loss/train': 1.4095970392227173} 11/07/2021 03:20:39 - INFO - __main__ - Step 43626: {'lr': 0.0004085983210409114, 'samples': 8376192, 'steps': 43625, 'loss/train': 1.4020105600357056} 11/07/2021 03:20:40 - INFO - __main__ - Step 43627: {'lr': 0.00040859421883632696, 'samples': 8376384, 'steps': 43626, 'loss/train': 1.2069751024246216} 11/07/2021 03:20:40 - INFO - __main__ - Step 43628: {'lr': 0.0004085901165602824, 'samples': 8376576, 'steps': 43627, 'loss/train': 1.2861889600753784} 11/07/2021 03:20:40 - INFO - __main__ - Step 43629: {'lr': 0.00040858601421277956, 'samples': 8376768, 'steps': 43628, 'loss/train': 1.0271927118301392} 11/07/2021 03:20:41 - INFO - __main__ - Step 43630: {'lr': 0.00040858191179382044, 'samples': 8376960, 'steps': 43629, 'loss/train': 1.1741310358047485} 11/07/2021 03:20:41 - INFO - __main__ - Step 43631: {'lr': 0.0004085778093034066, 'samples': 8377152, 'steps': 43630, 'loss/train': 1.7343759536743164} 11/07/2021 03:20:42 - INFO - __main__ - Step 43632: {'lr': 0.0004085737067415401, 'samples': 8377344, 'steps': 43631, 'loss/train': 1.6797895431518555} 11/07/2021 03:20:42 - INFO - __main__ - Step 43633: {'lr': 0.00040856960410822277, 'samples': 8377536, 'steps': 43632, 'loss/train': 1.4149872064590454} 11/07/2021 03:20:43 - INFO - __main__ - Step 43634: {'lr': 0.0004085655014034564, 'samples': 8377728, 'steps': 43633, 'loss/train': 1.6078990697860718} 11/07/2021 03:20:43 - INFO - __main__ - Step 43635: {'lr': 0.0004085613986272428, 'samples': 8377920, 'steps': 43634, 'loss/train': 5.793621063232422} 11/07/2021 03:20:43 - INFO - __main__ - Step 43636: {'lr': 0.0004085572957795839, 'samples': 8378112, 'steps': 43635, 'loss/train': 1.4916640520095825} 11/07/2021 03:20:44 - INFO - __main__ - Step 43637: {'lr': 0.00040855319286048163, 'samples': 8378304, 'steps': 43636, 'loss/train': 1.4157909154891968} 11/07/2021 03:20:45 - INFO - __main__ - Step 43638: {'lr': 0.0004085490898699377, 'samples': 8378496, 'steps': 43637, 'loss/train': 1.3781392574310303} 11/07/2021 03:20:45 - INFO - __main__ - Step 43639: {'lr': 0.0004085449868079539, 'samples': 8378688, 'steps': 43638, 'loss/train': 1.5037422180175781} 11/07/2021 03:20:46 - INFO - __main__ - Step 43640: {'lr': 0.00040854088367453225, 'samples': 8378880, 'steps': 43639, 'loss/train': 1.285473346710205} 11/07/2021 03:20:46 - INFO - __main__ - Step 43641: {'lr': 0.00040853678046967454, 'samples': 8379072, 'steps': 43640, 'loss/train': 1.3737177848815918} 11/07/2021 03:20:46 - INFO - __main__ - Step 43642: {'lr': 0.00040853267719338256, 'samples': 8379264, 'steps': 43641, 'loss/train': 1.3150479793548584} 11/07/2021 03:20:47 - INFO - __main__ - Step 43643: {'lr': 0.00040852857384565824, 'samples': 8379456, 'steps': 43642, 'loss/train': 1.0506174564361572} 11/07/2021 03:20:48 - INFO - __main__ - Step 43644: {'lr': 0.00040852447042650337, 'samples': 8379648, 'steps': 43643, 'loss/train': 1.6337863206863403} 11/07/2021 03:20:48 - INFO - __main__ - Step 43645: {'lr': 0.0004085203669359198, 'samples': 8379840, 'steps': 43644, 'loss/train': 1.521552324295044} 11/07/2021 03:20:48 - INFO - __main__ - Step 43646: {'lr': 0.0004085162633739095, 'samples': 8380032, 'steps': 43645, 'loss/train': 1.760745882987976} 11/07/2021 03:20:49 - INFO - __main__ - Step 43647: {'lr': 0.0004085121597404741, 'samples': 8380224, 'steps': 43646, 'loss/train': 1.9856324195861816} 11/07/2021 03:20:50 - INFO - __main__ - Step 43648: {'lr': 0.0004085080560356156, 'samples': 8380416, 'steps': 43647, 'loss/train': 1.5533442497253418} 11/07/2021 03:20:50 - INFO - __main__ - Step 43649: {'lr': 0.0004085039522593358, 'samples': 8380608, 'steps': 43648, 'loss/train': 1.524967908859253} 11/07/2021 03:20:51 - INFO - __main__ - Step 43650: {'lr': 0.0004084998484116366, 'samples': 8380800, 'steps': 43649, 'loss/train': 1.6469630002975464} 11/07/2021 03:20:51 - INFO - __main__ - Step 43651: {'lr': 0.0004084957444925198, 'samples': 8380992, 'steps': 43650, 'loss/train': 1.450783371925354} 11/07/2021 03:20:51 - INFO - __main__ - Step 43652: {'lr': 0.0004084916405019873, 'samples': 8381184, 'steps': 43651, 'loss/train': 1.3922834396362305} 11/07/2021 03:20:52 - INFO - __main__ - Step 43653: {'lr': 0.0004084875364400409, 'samples': 8381376, 'steps': 43652, 'loss/train': 1.9974443912506104} 11/07/2021 03:20:53 - INFO - __main__ - Step 43654: {'lr': 0.0004084834323066824, 'samples': 8381568, 'steps': 43653, 'loss/train': 1.7659012079238892} 11/07/2021 03:20:53 - INFO - __main__ - Step 43655: {'lr': 0.00040847932810191375, 'samples': 8381760, 'steps': 43654, 'loss/train': 1.340469479560852} 11/07/2021 03:20:53 - INFO - __main__ - Step 43656: {'lr': 0.00040847522382573675, 'samples': 8381952, 'steps': 43655, 'loss/train': 1.574964165687561} 11/07/2021 03:20:54 - INFO - __main__ - Step 43657: {'lr': 0.0004084711194781533, 'samples': 8382144, 'steps': 43656, 'loss/train': 1.0405417680740356} 11/07/2021 03:20:54 - INFO - __main__ - Step 43658: {'lr': 0.00040846701505916516, 'samples': 8382336, 'steps': 43657, 'loss/train': 1.6210185289382935} 11/07/2021 03:20:55 - INFO - __main__ - Step 43659: {'lr': 0.00040846291056877425, 'samples': 8382528, 'steps': 43658, 'loss/train': 1.4585506916046143} 11/07/2021 03:20:55 - INFO - __main__ - Step 43660: {'lr': 0.0004084588060069824, 'samples': 8382720, 'steps': 43659, 'loss/train': 1.3846935033798218} 11/07/2021 03:20:56 - INFO - __main__ - Step 43661: {'lr': 0.0004084547013737915, 'samples': 8382912, 'steps': 43660, 'loss/train': 1.5579522848129272} 11/07/2021 03:20:56 - INFO - __main__ - Step 43662: {'lr': 0.00040845059666920323, 'samples': 8383104, 'steps': 43661, 'loss/train': 1.7791005373001099} 11/07/2021 03:20:57 - INFO - __main__ - Step 43663: {'lr': 0.0004084464918932197, 'samples': 8383296, 'steps': 43662, 'loss/train': 1.4406996965408325} 11/07/2021 03:20:58 - INFO - __main__ - Step 43664: {'lr': 0.0004084423870458426, 'samples': 8383488, 'steps': 43663, 'loss/train': 1.6090887784957886} 11/07/2021 03:20:58 - INFO - __main__ - Step 43665: {'lr': 0.00040843828212707366, 'samples': 8383680, 'steps': 43664, 'loss/train': 1.5177305936813354} 11/07/2021 03:20:58 - INFO - __main__ - Step 43666: {'lr': 0.00040843417713691505, 'samples': 8383872, 'steps': 43665, 'loss/train': 0.8475781083106995} 11/07/2021 03:20:59 - INFO - __main__ - Step 43667: {'lr': 0.0004084300720753684, 'samples': 8384064, 'steps': 43666, 'loss/train': 1.6888883113861084} 11/07/2021 03:20:59 - INFO - __main__ - Step 43668: {'lr': 0.0004084259669424356, 'samples': 8384256, 'steps': 43667, 'loss/train': 1.521574854850769} 11/07/2021 03:21:00 - INFO - __main__ - Step 43669: {'lr': 0.0004084218617381185, 'samples': 8384448, 'steps': 43668, 'loss/train': 1.4842170476913452} 11/07/2021 03:21:00 - INFO - __main__ - Step 43670: {'lr': 0.00040841775646241897, 'samples': 8384640, 'steps': 43669, 'loss/train': 0.7249814867973328} 11/07/2021 03:21:01 - INFO - __main__ - Step 43671: {'lr': 0.0004084136511153388, 'samples': 8384832, 'steps': 43670, 'loss/train': 1.9210768938064575} 11/07/2021 03:21:01 - INFO - __main__ - Step 43672: {'lr': 0.0004084095456968799, 'samples': 8385024, 'steps': 43671, 'loss/train': 1.5453283786773682} 11/07/2021 03:21:02 - INFO - __main__ - Step 43673: {'lr': 0.0004084054402070441, 'samples': 8385216, 'steps': 43672, 'loss/train': 1.5406078100204468} 11/07/2021 03:21:02 - INFO - __main__ - Step 43674: {'lr': 0.0004084013346458333, 'samples': 8385408, 'steps': 43673, 'loss/train': 1.7233799695968628} 11/07/2021 03:21:03 - INFO - __main__ - Step 43675: {'lr': 0.00040839722901324924, 'samples': 8385600, 'steps': 43674, 'loss/train': 0.7722538709640503} 11/07/2021 03:21:03 - INFO - __main__ - Step 43676: {'lr': 0.00040839312330929377, 'samples': 8385792, 'steps': 43675, 'loss/train': 1.3467720746994019} 11/07/2021 03:21:04 - INFO - __main__ - Step 43677: {'lr': 0.00040838901753396896, 'samples': 8385984, 'steps': 43676, 'loss/train': 1.4793387651443481} 11/07/2021 03:21:04 - INFO - __main__ - Step 43678: {'lr': 0.0004083849116872764, 'samples': 8386176, 'steps': 43677, 'loss/train': 1.6407707929611206} 11/07/2021 03:21:05 - INFO - __main__ - Step 43679: {'lr': 0.0004083808057692181, 'samples': 8386368, 'steps': 43678, 'loss/train': 1.5521893501281738} 11/07/2021 03:21:05 - INFO - __main__ - Step 43680: {'lr': 0.00040837669977979586, 'samples': 8386560, 'steps': 43679, 'loss/train': 1.1769499778747559} 11/07/2021 03:21:06 - INFO - __main__ - Step 43681: {'lr': 0.00040837259371901145, 'samples': 8386752, 'steps': 43680, 'loss/train': 1.577649474143982} 11/07/2021 03:21:06 - INFO - __main__ - Step 43682: {'lr': 0.00040836848758686687, 'samples': 8386944, 'steps': 43681, 'loss/train': 1.474136233329773} 11/07/2021 03:21:06 - INFO - __main__ - Step 43683: {'lr': 0.00040836438138336384, 'samples': 8387136, 'steps': 43682, 'loss/train': 1.5819343328475952} 11/07/2021 03:21:07 - INFO - __main__ - Step 43684: {'lr': 0.00040836027510850426, 'samples': 8387328, 'steps': 43683, 'loss/train': 1.250996708869934} 11/07/2021 03:21:08 - INFO - __main__ - Step 43685: {'lr': 0.00040835616876229, 'samples': 8387520, 'steps': 43684, 'loss/train': 1.6471080780029297} 11/07/2021 03:21:08 - INFO - __main__ - Step 43686: {'lr': 0.00040835206234472287, 'samples': 8387712, 'steps': 43685, 'loss/train': 1.461700439453125} 11/07/2021 03:21:08 - INFO - __main__ - Step 43687: {'lr': 0.0004083479558558048, 'samples': 8387904, 'steps': 43686, 'loss/train': 1.5194342136383057} 11/07/2021 03:21:09 - INFO - __main__ - Step 43688: {'lr': 0.0004083438492955376, 'samples': 8388096, 'steps': 43687, 'loss/train': 1.0905221700668335} 11/07/2021 03:21:10 - INFO - __main__ - Step 43689: {'lr': 0.00040833974266392306, 'samples': 8388288, 'steps': 43688, 'loss/train': 1.6670489311218262} 11/07/2021 03:21:10 - INFO - __main__ - Step 43690: {'lr': 0.00040833563596096305, 'samples': 8388480, 'steps': 43689, 'loss/train': 1.2590972185134888} 11/07/2021 03:21:10 - INFO - __main__ - Step 43691: {'lr': 0.0004083315291866595, 'samples': 8388672, 'steps': 43690, 'loss/train': 1.5689071416854858} 11/07/2021 03:21:11 - INFO - __main__ - Step 43692: {'lr': 0.00040832742234101415, 'samples': 8388864, 'steps': 43691, 'loss/train': 1.4217921495437622} 11/07/2021 03:21:11 - INFO - __main__ - Step 43693: {'lr': 0.00040832331542402895, 'samples': 8389056, 'steps': 43692, 'loss/train': 1.8546339273452759} 11/07/2021 03:21:12 - INFO - __main__ - Step 43694: {'lr': 0.0004083192084357057, 'samples': 8389248, 'steps': 43693, 'loss/train': 1.5490301847457886} 11/07/2021 03:21:13 - INFO - __main__ - Step 43695: {'lr': 0.0004083151013760462, 'samples': 8389440, 'steps': 43694, 'loss/train': 1.5929126739501953} 11/07/2021 03:21:13 - INFO - __main__ - Step 43696: {'lr': 0.0004083109942450524, 'samples': 8389632, 'steps': 43695, 'loss/train': 1.4311790466308594} 11/07/2021 03:21:13 - INFO - __main__ - Step 43697: {'lr': 0.00040830688704272615, 'samples': 8389824, 'steps': 43696, 'loss/train': 1.2046213150024414} 11/07/2021 03:21:14 - INFO - __main__ - Step 43698: {'lr': 0.0004083027797690693, 'samples': 8390016, 'steps': 43697, 'loss/train': 1.4289475679397583} 11/07/2021 03:21:14 - INFO - __main__ - Step 43699: {'lr': 0.0004082986724240835, 'samples': 8390208, 'steps': 43698, 'loss/train': 1.9071747064590454} 11/07/2021 03:21:15 - INFO - __main__ - Step 43700: {'lr': 0.00040829456500777084, 'samples': 8390400, 'steps': 43699, 'loss/train': 1.3858799934387207} 11/07/2021 03:21:15 - INFO - __main__ - Step 43701: {'lr': 0.00040829045752013317, 'samples': 8390592, 'steps': 43700, 'loss/train': 1.8750146627426147} 11/07/2021 03:21:16 - INFO - __main__ - Step 43702: {'lr': 0.00040828634996117213, 'samples': 8390784, 'steps': 43701, 'loss/train': 1.3986989259719849} 11/07/2021 03:21:16 - INFO - __main__ - Step 43703: {'lr': 0.0004082822423308897, 'samples': 8390976, 'steps': 43702, 'loss/train': 0.9039545059204102} 11/07/2021 03:21:16 - INFO - __main__ - Step 43704: {'lr': 0.00040827813462928784, 'samples': 8391168, 'steps': 43703, 'loss/train': 1.2571094036102295} 11/07/2021 03:21:17 - INFO - __main__ - Step 43705: {'lr': 0.0004082740268563683, 'samples': 8391360, 'steps': 43704, 'loss/train': 1.4285101890563965} 11/07/2021 03:21:18 - INFO - __main__ - Step 43706: {'lr': 0.0004082699190121329, 'samples': 8391552, 'steps': 43705, 'loss/train': 0.6200053691864014} 11/07/2021 03:21:18 - INFO - __main__ - Step 43707: {'lr': 0.00040826581109658345, 'samples': 8391744, 'steps': 43706, 'loss/train': 1.3370939493179321} 11/07/2021 03:21:19 - INFO - __main__ - Step 43708: {'lr': 0.00040826170310972196, 'samples': 8391936, 'steps': 43707, 'loss/train': 1.3259520530700684} 11/07/2021 03:21:19 - INFO - __main__ - Step 43709: {'lr': 0.0004082575950515501, 'samples': 8392128, 'steps': 43708, 'loss/train': 1.5314348936080933} 11/07/2021 03:21:19 - INFO - __main__ - Step 43710: {'lr': 0.00040825348692206985, 'samples': 8392320, 'steps': 43709, 'loss/train': 1.7945643663406372} 11/07/2021 03:21:20 - INFO - __main__ - Step 43711: {'lr': 0.0004082493787212831, 'samples': 8392512, 'steps': 43710, 'loss/train': 1.4325158596038818} 11/07/2021 03:21:21 - INFO - __main__ - Step 43712: {'lr': 0.00040824527044919153, 'samples': 8392704, 'steps': 43711, 'loss/train': 1.5164660215377808} 11/07/2021 03:21:21 - INFO - __main__ - Step 43713: {'lr': 0.0004082411621057971, 'samples': 8392896, 'steps': 43712, 'loss/train': 1.6946877241134644} 11/07/2021 03:21:21 - INFO - __main__ - Step 43714: {'lr': 0.00040823705369110163, 'samples': 8393088, 'steps': 43713, 'loss/train': 0.49761763215065} 11/07/2021 03:21:22 - INFO - __main__ - Step 43715: {'lr': 0.000408232945205107, 'samples': 8393280, 'steps': 43714, 'loss/train': 1.9720444679260254} 11/07/2021 03:21:23 - INFO - __main__ - Step 43716: {'lr': 0.00040822883664781506, 'samples': 8393472, 'steps': 43715, 'loss/train': 0.9192848205566406} 11/07/2021 03:21:23 - INFO - __main__ - Step 43717: {'lr': 0.0004082247280192276, 'samples': 8393664, 'steps': 43716, 'loss/train': 1.2968155145645142} 11/07/2021 03:21:23 - INFO - __main__ - Step 43718: {'lr': 0.00040822061931934656, 'samples': 8393856, 'steps': 43717, 'loss/train': 1.8738207817077637} 11/07/2021 03:21:24 - INFO - __main__ - Step 43719: {'lr': 0.00040821651054817376, 'samples': 8394048, 'steps': 43718, 'loss/train': 1.6693713665008545} 11/07/2021 03:21:24 - INFO - __main__ - Step 43720: {'lr': 0.000408212401705711, 'samples': 8394240, 'steps': 43719, 'loss/train': 1.4084855318069458} 11/07/2021 03:21:25 - INFO - __main__ - Step 43721: {'lr': 0.0004082082927919602, 'samples': 8394432, 'steps': 43720, 'loss/train': 0.9644858241081238} 11/07/2021 03:21:25 - INFO - __main__ - Step 43722: {'lr': 0.0004082041838069232, 'samples': 8394624, 'steps': 43721, 'loss/train': 1.4714794158935547} 11/07/2021 03:21:26 - INFO - __main__ - Step 43723: {'lr': 0.0004082000747506018, 'samples': 8394816, 'steps': 43722, 'loss/train': 1.086302638053894} 11/07/2021 03:21:26 - INFO - __main__ - Step 43724: {'lr': 0.00040819596562299793, 'samples': 8395008, 'steps': 43723, 'loss/train': 0.6590692400932312} 11/07/2021 03:21:27 - INFO - __main__ - Step 43725: {'lr': 0.0004081918564241134, 'samples': 8395200, 'steps': 43724, 'loss/train': 1.3129510879516602} 11/07/2021 03:21:29 - INFO - __main__ - Step 43726: {'lr': 0.00040818774715395, 'samples': 8395392, 'steps': 43725, 'loss/train': 1.5787527561187744} 11/07/2021 03:21:29 - INFO - __main__ - Step 43727: {'lr': 0.0004081836378125097, 'samples': 8395584, 'steps': 43726, 'loss/train': 1.050729751586914} 11/07/2021 03:21:29 - INFO - __main__ - Step 43728: {'lr': 0.00040817952839979424, 'samples': 8395776, 'steps': 43727, 'loss/train': 1.8111873865127563} 11/07/2021 03:21:30 - INFO - __main__ - Step 43729: {'lr': 0.00040817541891580557, 'samples': 8395968, 'steps': 43728, 'loss/train': 0.9503833055496216} 11/07/2021 03:21:30 - INFO - __main__ - Step 43730: {'lr': 0.00040817130936054546, 'samples': 8396160, 'steps': 43729, 'loss/train': 1.7883265018463135} 11/07/2021 03:21:30 - INFO - __main__ - Step 43731: {'lr': 0.00040816719973401586, 'samples': 8396352, 'steps': 43730, 'loss/train': 1.64004647731781} 11/07/2021 03:21:31 - INFO - __main__ - Step 43732: {'lr': 0.0004081630900362185, 'samples': 8396544, 'steps': 43731, 'loss/train': 1.5553715229034424} 11/07/2021 03:21:32 - INFO - __main__ - Step 43733: {'lr': 0.0004081589802671553, 'samples': 8396736, 'steps': 43732, 'loss/train': 1.3468505144119263} 11/07/2021 03:21:32 - INFO - __main__ - Step 43734: {'lr': 0.00040815487042682814, 'samples': 8396928, 'steps': 43733, 'loss/train': 1.5261868238449097} 11/07/2021 03:21:32 - INFO - __main__ - Step 43735: {'lr': 0.0004081507605152388, 'samples': 8397120, 'steps': 43734, 'loss/train': 0.9943277835845947} 11/07/2021 03:21:33 - INFO - __main__ - Step 43736: {'lr': 0.0004081466505323892, 'samples': 8397312, 'steps': 43735, 'loss/train': 0.5817473530769348} 11/07/2021 03:21:33 - INFO - __main__ - Step 43737: {'lr': 0.0004081425404782811, 'samples': 8397504, 'steps': 43736, 'loss/train': 1.6696367263793945} 11/07/2021 03:21:34 - INFO - __main__ - Step 43738: {'lr': 0.00040813843035291655, 'samples': 8397696, 'steps': 43737, 'loss/train': 1.6473397016525269} 11/07/2021 03:21:35 - INFO - __main__ - Step 43739: {'lr': 0.00040813432015629714, 'samples': 8397888, 'steps': 43738, 'loss/train': 1.7417515516281128} 11/07/2021 03:21:35 - INFO - __main__ - Step 43740: {'lr': 0.0004081302098884249, 'samples': 8398080, 'steps': 43739, 'loss/train': 0.8534517288208008} 11/07/2021 03:21:35 - INFO - __main__ - Step 43741: {'lr': 0.0004081260995493015, 'samples': 8398272, 'steps': 43740, 'loss/train': 2.0336036682128906} 11/07/2021 03:21:36 - INFO - __main__ - Step 43742: {'lr': 0.0004081219891389291, 'samples': 8398464, 'steps': 43741, 'loss/train': 1.8651096820831299} 11/07/2021 03:21:36 - INFO - __main__ - Step 43743: {'lr': 0.0004081178786573092, 'samples': 8398656, 'steps': 43742, 'loss/train': 1.3008315563201904} 11/07/2021 03:21:37 - INFO - __main__ - Step 43744: {'lr': 0.000408113768104444, 'samples': 8398848, 'steps': 43743, 'loss/train': 1.471579909324646} 11/07/2021 03:21:38 - INFO - __main__ - Step 43745: {'lr': 0.0004081096574803351, 'samples': 8399040, 'steps': 43744, 'loss/train': 1.38153076171875} 11/07/2021 03:21:38 - INFO - __main__ - Step 43746: {'lr': 0.00040810554678498434, 'samples': 8399232, 'steps': 43745, 'loss/train': 1.2050849199295044} 11/07/2021 03:21:38 - INFO - __main__ - Step 43747: {'lr': 0.00040810143601839377, 'samples': 8399424, 'steps': 43746, 'loss/train': 1.6005651950836182} 11/07/2021 03:21:39 - INFO - __main__ - Step 43748: {'lr': 0.0004080973251805651, 'samples': 8399616, 'steps': 43747, 'loss/train': 1.2557823657989502} 11/07/2021 03:21:40 - INFO - __main__ - Step 43749: {'lr': 0.0004080932142715002, 'samples': 8399808, 'steps': 43748, 'loss/train': 1.5913665294647217} 11/07/2021 03:21:40 - INFO - __main__ - Step 43750: {'lr': 0.000408089103291201, 'samples': 8400000, 'steps': 43749, 'loss/train': 1.661261796951294} 11/07/2021 03:21:40 - INFO - __main__ - Step 43751: {'lr': 0.0004080849922396692, 'samples': 8400192, 'steps': 43750, 'loss/train': 1.527084231376648} 11/07/2021 03:21:41 - INFO - __main__ - Step 43752: {'lr': 0.00040808088111690677, 'samples': 8400384, 'steps': 43751, 'loss/train': 1.6774927377700806} 11/07/2021 03:21:41 - INFO - __main__ - Step 43753: {'lr': 0.00040807676992291557, 'samples': 8400576, 'steps': 43752, 'loss/train': 1.3135204315185547} 11/07/2021 03:21:42 - INFO - __main__ - Step 43754: {'lr': 0.0004080726586576974, 'samples': 8400768, 'steps': 43753, 'loss/train': 1.443676471710205} 11/07/2021 03:21:42 - INFO - __main__ - Step 43755: {'lr': 0.0004080685473212541, 'samples': 8400960, 'steps': 43754, 'loss/train': 1.5122127532958984} 11/07/2021 03:21:43 - INFO - __main__ - Step 43756: {'lr': 0.0004080644359135876, 'samples': 8401152, 'steps': 43755, 'loss/train': 1.314313530921936} 11/07/2021 03:21:43 - INFO - __main__ - Step 43757: {'lr': 0.00040806032443469967, 'samples': 8401344, 'steps': 43756, 'loss/train': 1.472224473953247} 11/07/2021 03:21:44 - INFO - __main__ - Step 43758: {'lr': 0.0004080562128845923, 'samples': 8401536, 'steps': 43757, 'loss/train': 1.3135592937469482} 11/07/2021 03:21:44 - INFO - __main__ - Step 43759: {'lr': 0.0004080521012632671, 'samples': 8401728, 'steps': 43758, 'loss/train': 1.6687555313110352} 11/07/2021 03:21:45 - INFO - __main__ - Step 43760: {'lr': 0.00040804798957072607, 'samples': 8401920, 'steps': 43759, 'loss/train': 1.6299394369125366} 11/07/2021 03:21:45 - INFO - __main__ - Step 43761: {'lr': 0.0004080438778069711, 'samples': 8402112, 'steps': 43760, 'loss/train': 1.4132717847824097} 11/07/2021 03:21:46 - INFO - __main__ - Step 43762: {'lr': 0.000408039765972004, 'samples': 8402304, 'steps': 43761, 'loss/train': 1.6096906661987305} 11/07/2021 03:21:46 - INFO - __main__ - Step 43763: {'lr': 0.0004080356540658266, 'samples': 8402496, 'steps': 43762, 'loss/train': 1.373243808746338} 11/07/2021 03:21:46 - INFO - __main__ - Step 43764: {'lr': 0.00040803154208844086, 'samples': 8402688, 'steps': 43763, 'loss/train': 1.814076542854309} 11/07/2021 03:21:47 - INFO - __main__ - Step 43765: {'lr': 0.00040802743003984845, 'samples': 8402880, 'steps': 43764, 'loss/train': 1.7175999879837036} 11/07/2021 03:21:48 - INFO - __main__ - Step 43766: {'lr': 0.0004080233179200513, 'samples': 8403072, 'steps': 43765, 'loss/train': 1.591747522354126} 11/07/2021 03:21:48 - INFO - __main__ - Step 43767: {'lr': 0.00040801920572905133, 'samples': 8403264, 'steps': 43766, 'loss/train': 1.5183857679367065} 11/07/2021 03:21:49 - INFO - __main__ - Step 43768: {'lr': 0.0004080150934668503, 'samples': 8403456, 'steps': 43767, 'loss/train': 1.1555874347686768} 11/07/2021 03:21:49 - INFO - __main__ - Step 43769: {'lr': 0.00040801098113345014, 'samples': 8403648, 'steps': 43768, 'loss/train': 1.8131109476089478} 11/07/2021 03:21:50 - INFO - __main__ - Step 43770: {'lr': 0.00040800686872885267, 'samples': 8403840, 'steps': 43769, 'loss/train': 0.3235270082950592} 11/07/2021 03:21:50 - INFO - __main__ - Step 43771: {'lr': 0.0004080027562530598, 'samples': 8404032, 'steps': 43770, 'loss/train': 1.4960527420043945} 11/07/2021 03:21:51 - INFO - __main__ - Step 43772: {'lr': 0.0004079986437060733, 'samples': 8404224, 'steps': 43771, 'loss/train': 2.1998274326324463} 11/07/2021 03:21:51 - INFO - __main__ - Step 43773: {'lr': 0.00040799453108789497, 'samples': 8404416, 'steps': 43772, 'loss/train': 1.7890753746032715} 11/07/2021 03:21:51 - INFO - __main__ - Step 43774: {'lr': 0.0004079904183985268, 'samples': 8404608, 'steps': 43773, 'loss/train': 1.5287282466888428} 11/07/2021 03:21:52 - INFO - __main__ - Step 43775: {'lr': 0.00040798630563797055, 'samples': 8404800, 'steps': 43774, 'loss/train': 1.4617946147918701} 11/07/2021 03:21:53 - INFO - __main__ - Step 43776: {'lr': 0.00040798219280622816, 'samples': 8404992, 'steps': 43775, 'loss/train': 1.5505151748657227} 11/07/2021 03:21:53 - INFO - __main__ - Step 43777: {'lr': 0.0004079780799033014, 'samples': 8405184, 'steps': 43776, 'loss/train': 1.1835485696792603} 11/07/2021 03:21:53 - INFO - __main__ - Step 43778: {'lr': 0.0004079739669291922, 'samples': 8405376, 'steps': 43777, 'loss/train': 1.4083938598632812} 11/07/2021 03:21:54 - INFO - __main__ - Step 43779: {'lr': 0.0004079698538839023, 'samples': 8405568, 'steps': 43778, 'loss/train': 1.3506239652633667} 11/07/2021 03:21:55 - INFO - __main__ - Step 43780: {'lr': 0.00040796574076743366, 'samples': 8405760, 'steps': 43779, 'loss/train': 1.6787636280059814} 11/07/2021 03:21:55 - INFO - __main__ - Step 43781: {'lr': 0.00040796162757978803, 'samples': 8405952, 'steps': 43780, 'loss/train': 2.036315679550171} 11/07/2021 03:21:55 - INFO - __main__ - Step 43782: {'lr': 0.00040795751432096746, 'samples': 8406144, 'steps': 43781, 'loss/train': 1.39372980594635} 11/07/2021 03:21:56 - INFO - __main__ - Step 43783: {'lr': 0.00040795340099097357, 'samples': 8406336, 'steps': 43782, 'loss/train': 1.23551607131958} 11/07/2021 03:21:56 - INFO - __main__ - Step 43784: {'lr': 0.00040794928758980837, 'samples': 8406528, 'steps': 43783, 'loss/train': 1.1073113679885864} 11/07/2021 03:21:56 - INFO - __main__ - Step 43785: {'lr': 0.0004079451741174737, 'samples': 8406720, 'steps': 43784, 'loss/train': 1.5919462442398071} 11/07/2021 03:21:58 - INFO - __main__ - Step 43786: {'lr': 0.00040794106057397123, 'samples': 8406912, 'steps': 43785, 'loss/train': 1.5587059259414673} 11/07/2021 03:21:58 - INFO - __main__ - Step 43787: {'lr': 0.00040793694695930304, 'samples': 8407104, 'steps': 43786, 'loss/train': 1.360137701034546} 11/07/2021 03:21:59 - INFO - __main__ - Step 43788: {'lr': 0.00040793283327347085, 'samples': 8407296, 'steps': 43787, 'loss/train': 1.580074667930603} 11/07/2021 03:21:59 - INFO - __main__ - Step 43789: {'lr': 0.00040792871951647657, 'samples': 8407488, 'steps': 43788, 'loss/train': 1.572245478630066} 11/07/2021 03:21:59 - INFO - __main__ - Step 43790: {'lr': 0.00040792460568832214, 'samples': 8407680, 'steps': 43789, 'loss/train': 1.5273981094360352} 11/07/2021 03:22:00 - INFO - __main__ - Step 43791: {'lr': 0.00040792049178900924, 'samples': 8407872, 'steps': 43790, 'loss/train': 1.3839747905731201} 11/07/2021 03:22:01 - INFO - __main__ - Step 43792: {'lr': 0.00040791637781853983, 'samples': 8408064, 'steps': 43791, 'loss/train': 1.33804190158844} 11/07/2021 03:22:01 - INFO - __main__ - Step 43793: {'lr': 0.0004079122637769157, 'samples': 8408256, 'steps': 43792, 'loss/train': 1.3798296451568604} 11/07/2021 03:22:02 - INFO - __main__ - Step 43794: {'lr': 0.0004079081496641388, 'samples': 8408448, 'steps': 43793, 'loss/train': 1.4129741191864014} 11/07/2021 03:22:02 - INFO - __main__ - Step 43795: {'lr': 0.0004079040354802109, 'samples': 8408640, 'steps': 43794, 'loss/train': 0.3329477906227112} 11/07/2021 03:22:03 - INFO - __main__ - Step 43796: {'lr': 0.00040789992122513386, 'samples': 8408832, 'steps': 43795, 'loss/train': 0.7639191150665283} 11/07/2021 03:22:04 - INFO - __main__ - Step 43797: {'lr': 0.00040789580689890953, 'samples': 8409024, 'steps': 43796, 'loss/train': 1.3062173128128052} 11/07/2021 03:22:04 - INFO - __main__ - Step 43798: {'lr': 0.00040789169250153985, 'samples': 8409216, 'steps': 43797, 'loss/train': 1.3329349756240845} 11/07/2021 03:22:04 - INFO - __main__ - Step 43799: {'lr': 0.00040788757803302656, 'samples': 8409408, 'steps': 43798, 'loss/train': 1.2266077995300293} 11/07/2021 03:22:05 - INFO - __main__ - Step 43800: {'lr': 0.00040788346349337156, 'samples': 8409600, 'steps': 43799, 'loss/train': 1.3185638189315796} 11/07/2021 03:22:05 - INFO - __main__ - Step 43801: {'lr': 0.00040787934888257673, 'samples': 8409792, 'steps': 43800, 'loss/train': 1.1510093212127686} 11/07/2021 03:22:06 - INFO - __main__ - Step 43802: {'lr': 0.00040787523420064394, 'samples': 8409984, 'steps': 43801, 'loss/train': 1.5760995149612427} 11/07/2021 03:22:06 - INFO - __main__ - Step 43803: {'lr': 0.00040787111944757496, 'samples': 8410176, 'steps': 43802, 'loss/train': 1.4465820789337158} 11/07/2021 03:22:07 - INFO - __main__ - Step 43804: {'lr': 0.0004078670046233717, 'samples': 8410368, 'steps': 43803, 'loss/train': 1.5060796737670898} 11/07/2021 03:22:07 - INFO - __main__ - Step 43805: {'lr': 0.000407862889728036, 'samples': 8410560, 'steps': 43804, 'loss/train': 1.6155372858047485} 11/07/2021 03:22:07 - INFO - __main__ - Step 43806: {'lr': 0.0004078587747615697, 'samples': 8410752, 'steps': 43805, 'loss/train': 1.4102935791015625} 11/07/2021 03:22:08 - INFO - __main__ - Step 43807: {'lr': 0.00040785465972397475, 'samples': 8410944, 'steps': 43806, 'loss/train': 1.7546608448028564} 11/07/2021 03:22:09 - INFO - __main__ - Step 43808: {'lr': 0.0004078505446152528, 'samples': 8411136, 'steps': 43807, 'loss/train': 1.7220182418823242} 11/07/2021 03:22:09 - INFO - __main__ - Step 43809: {'lr': 0.0004078464294354059, 'samples': 8411328, 'steps': 43808, 'loss/train': 1.6403729915618896} 11/07/2021 03:22:09 - INFO - __main__ - Step 43810: {'lr': 0.00040784231418443585, 'samples': 8411520, 'steps': 43809, 'loss/train': 1.6539579629898071} 11/07/2021 03:22:10 - INFO - __main__ - Step 43811: {'lr': 0.00040783819886234445, 'samples': 8411712, 'steps': 43810, 'loss/train': 1.6177512407302856} 11/07/2021 03:22:11 - INFO - __main__ - Step 43812: {'lr': 0.00040783408346913366, 'samples': 8411904, 'steps': 43811, 'loss/train': 1.1286613941192627} 11/07/2021 03:22:11 - INFO - __main__ - Step 43813: {'lr': 0.00040782996800480523, 'samples': 8412096, 'steps': 43812, 'loss/train': 1.3283767700195312} 11/07/2021 03:22:12 - INFO - __main__ - Step 43814: {'lr': 0.000407825852469361, 'samples': 8412288, 'steps': 43813, 'loss/train': 1.4667810201644897} 11/07/2021 03:22:12 - INFO - __main__ - Step 43815: {'lr': 0.00040782173686280287, 'samples': 8412480, 'steps': 43814, 'loss/train': 1.4546928405761719} 11/07/2021 03:22:12 - INFO - __main__ - Step 43816: {'lr': 0.0004078176211851328, 'samples': 8412672, 'steps': 43815, 'loss/train': 1.089661717414856} 11/07/2021 03:22:13 - INFO - __main__ - Step 43817: {'lr': 0.0004078135054363524, 'samples': 8412864, 'steps': 43816, 'loss/train': 1.2308920621871948} 11/07/2021 03:22:14 - INFO - __main__ - Step 43818: {'lr': 0.00040780938961646385, 'samples': 8413056, 'steps': 43817, 'loss/train': 1.5190150737762451} 11/07/2021 03:22:14 - INFO - __main__ - Step 43819: {'lr': 0.00040780527372546874, 'samples': 8413248, 'steps': 43818, 'loss/train': 1.7634469270706177} 11/07/2021 03:22:14 - INFO - __main__ - Step 43820: {'lr': 0.000407801157763369, 'samples': 8413440, 'steps': 43819, 'loss/train': 1.3441243171691895} 11/07/2021 03:22:15 - INFO - __main__ - Step 43821: {'lr': 0.0004077970417301665, 'samples': 8413632, 'steps': 43820, 'loss/train': 1.578310251235962} 11/07/2021 03:22:15 - INFO - __main__ - Step 43822: {'lr': 0.00040779292562586304, 'samples': 8413824, 'steps': 43821, 'loss/train': 1.3511230945587158} 11/07/2021 03:22:16 - INFO - __main__ - Step 43823: {'lr': 0.0004077888094504606, 'samples': 8414016, 'steps': 43822, 'loss/train': 1.7738044261932373} 11/07/2021 03:22:16 - INFO - __main__ - Step 43824: {'lr': 0.0004077846932039609, 'samples': 8414208, 'steps': 43823, 'loss/train': 1.5807143449783325} 11/07/2021 03:22:17 - INFO - __main__ - Step 43825: {'lr': 0.00040778057688636594, 'samples': 8414400, 'steps': 43824, 'loss/train': 1.0032193660736084} 11/07/2021 03:22:17 - INFO - __main__ - Step 43826: {'lr': 0.00040777646049767736, 'samples': 8414592, 'steps': 43825, 'loss/train': 1.5744746923446655} 11/07/2021 03:22:17 - INFO - __main__ - Step 43827: {'lr': 0.0004077723440378972, 'samples': 8414784, 'steps': 43826, 'loss/train': 1.5103524923324585} 11/07/2021 03:22:18 - INFO - __main__ - Step 43828: {'lr': 0.0004077682275070273, 'samples': 8414976, 'steps': 43827, 'loss/train': 1.714619517326355} 11/07/2021 03:22:19 - INFO - __main__ - Step 43829: {'lr': 0.00040776411090506944, 'samples': 8415168, 'steps': 43828, 'loss/train': 4.008372783660889} 11/07/2021 03:22:19 - INFO - __main__ - Step 43830: {'lr': 0.0004077599942320255, 'samples': 8415360, 'steps': 43829, 'loss/train': 1.4760822057724} 11/07/2021 03:22:19 - INFO - __main__ - Step 43831: {'lr': 0.00040775587748789733, 'samples': 8415552, 'steps': 43830, 'loss/train': 1.6377016305923462} 11/07/2021 03:22:20 - INFO - __main__ - Step 43832: {'lr': 0.0004077517606726868, 'samples': 8415744, 'steps': 43831, 'loss/train': 1.2174091339111328} 11/07/2021 03:22:21 - INFO - __main__ - Step 43833: {'lr': 0.0004077476437863958, 'samples': 8415936, 'steps': 43832, 'loss/train': 1.558376431465149} 11/07/2021 03:22:22 - INFO - __main__ - Step 43834: {'lr': 0.0004077435268290261, 'samples': 8416128, 'steps': 43833, 'loss/train': 1.772778034210205} 11/07/2021 03:22:22 - INFO - __main__ - Step 43835: {'lr': 0.0004077394098005796, 'samples': 8416320, 'steps': 43834, 'loss/train': 1.7632755041122437} 11/07/2021 03:22:22 - INFO - __main__ - Step 43836: {'lr': 0.00040773529270105816, 'samples': 8416512, 'steps': 43835, 'loss/train': 0.5030577182769775} 11/07/2021 03:22:23 - INFO - __main__ - Step 43837: {'lr': 0.0004077311755304637, 'samples': 8416704, 'steps': 43836, 'loss/train': 1.724934458732605} 11/07/2021 03:22:24 - INFO - __main__ - Step 43838: {'lr': 0.000407727058288798, 'samples': 8416896, 'steps': 43837, 'loss/train': 1.2645070552825928} 11/07/2021 03:22:24 - INFO - __main__ - Step 43839: {'lr': 0.00040772294097606276, 'samples': 8417088, 'steps': 43838, 'loss/train': 1.1850857734680176} 11/07/2021 03:22:24 - INFO - __main__ - Step 43840: {'lr': 0.0004077188235922601, 'samples': 8417280, 'steps': 43839, 'loss/train': 1.8185043334960938} 11/07/2021 03:22:25 - INFO - __main__ - Step 43841: {'lr': 0.0004077147061373918, 'samples': 8417472, 'steps': 43840, 'loss/train': 1.5139338970184326} 11/07/2021 03:22:25 - INFO - __main__ - Step 43842: {'lr': 0.00040771058861145963, 'samples': 8417664, 'steps': 43841, 'loss/train': 1.6452528238296509} 11/07/2021 03:22:25 - INFO - __main__ - Step 43843: {'lr': 0.0004077064710144656, 'samples': 8417856, 'steps': 43842, 'loss/train': 2.128436326980591} 11/07/2021 03:22:27 - INFO - __main__ - Step 43844: {'lr': 0.0004077023533464114, 'samples': 8418048, 'steps': 43843, 'loss/train': 1.321452260017395} 11/07/2021 03:22:27 - INFO - __main__ - Step 43845: {'lr': 0.000407698235607299, 'samples': 8418240, 'steps': 43844, 'loss/train': 1.3933018445968628} 11/07/2021 03:22:27 - INFO - __main__ - Step 43846: {'lr': 0.0004076941177971301, 'samples': 8418432, 'steps': 43845, 'loss/train': 1.6112278699874878} 11/07/2021 03:22:28 - INFO - __main__ - Step 43847: {'lr': 0.0004076899999159067, 'samples': 8418624, 'steps': 43846, 'loss/train': 1.155909538269043} 11/07/2021 03:22:28 - INFO - __main__ - Step 43848: {'lr': 0.0004076858819636307, 'samples': 8418816, 'steps': 43847, 'loss/train': 1.2335573434829712} 11/07/2021 03:22:29 - INFO - __main__ - Step 43849: {'lr': 0.0004076817639403038, 'samples': 8419008, 'steps': 43848, 'loss/train': 1.659220576286316} 11/07/2021 03:22:29 - INFO - __main__ - Step 43850: {'lr': 0.0004076776458459279, 'samples': 8419200, 'steps': 43849, 'loss/train': 1.4065468311309814} 11/07/2021 03:22:30 - INFO - __main__ - Step 43851: {'lr': 0.00040767352768050503, 'samples': 8419392, 'steps': 43850, 'loss/train': 1.8235328197479248} 11/07/2021 03:22:30 - INFO - __main__ - Step 43852: {'lr': 0.0004076694094440368, 'samples': 8419584, 'steps': 43851, 'loss/train': 1.302893042564392} 11/07/2021 03:22:30 - INFO - __main__ - Step 43853: {'lr': 0.0004076652911365252, 'samples': 8419776, 'steps': 43852, 'loss/train': 1.7538639307022095} 11/07/2021 03:22:32 - INFO - __main__ - Step 43854: {'lr': 0.00040766117275797196, 'samples': 8419968, 'steps': 43853, 'loss/train': 2.043196201324463} 11/07/2021 03:22:32 - INFO - __main__ - Step 43855: {'lr': 0.0004076570543083792, 'samples': 8420160, 'steps': 43854, 'loss/train': 1.650506615638733} 11/07/2021 03:22:32 - INFO - __main__ - Step 43856: {'lr': 0.0004076529357877485, 'samples': 8420352, 'steps': 43855, 'loss/train': 1.1953593492507935} 11/07/2021 03:22:33 - INFO - __main__ - Step 43857: {'lr': 0.00040764881719608184, 'samples': 8420544, 'steps': 43856, 'loss/train': 1.5082013607025146} 11/07/2021 03:22:33 - INFO - __main__ - Step 43858: {'lr': 0.000407644698533381, 'samples': 8420736, 'steps': 43857, 'loss/train': 1.5169827938079834} 11/07/2021 03:22:34 - INFO - __main__ - Step 43859: {'lr': 0.00040764057979964793, 'samples': 8420928, 'steps': 43858, 'loss/train': 1.1407544612884521} 11/07/2021 03:22:34 - INFO - __main__ - Step 43860: {'lr': 0.0004076364609948844, 'samples': 8421120, 'steps': 43859, 'loss/train': 1.04957115650177} 11/07/2021 03:22:35 - INFO - __main__ - Step 43861: {'lr': 0.0004076323421190924, 'samples': 8421312, 'steps': 43860, 'loss/train': 1.3893458843231201} 11/07/2021 03:22:35 - INFO - __main__ - Step 43862: {'lr': 0.0004076282231722737, 'samples': 8421504, 'steps': 43861, 'loss/train': 1.1427700519561768} 11/07/2021 03:22:35 - INFO - __main__ - Step 43863: {'lr': 0.0004076241041544301, 'samples': 8421696, 'steps': 43862, 'loss/train': 1.4993537664413452} 11/07/2021 03:22:36 - INFO - __main__ - Step 43864: {'lr': 0.00040761998506556353, 'samples': 8421888, 'steps': 43863, 'loss/train': 1.5167622566223145} 11/07/2021 03:22:37 - INFO - __main__ - Step 43865: {'lr': 0.0004076158659056758, 'samples': 8422080, 'steps': 43864, 'loss/train': 1.1694954633712769} 11/07/2021 03:22:37 - INFO - __main__ - Step 43866: {'lr': 0.00040761174667476883, 'samples': 8422272, 'steps': 43865, 'loss/train': 1.4639588594436646} 11/07/2021 03:22:37 - INFO - __main__ - Step 43867: {'lr': 0.0004076076273728444, 'samples': 8422464, 'steps': 43866, 'loss/train': 1.7881133556365967} 11/07/2021 03:22:38 - INFO - __main__ - Step 43868: {'lr': 0.0004076035079999045, 'samples': 8422656, 'steps': 43867, 'loss/train': 1.3311805725097656} 11/07/2021 03:22:39 - INFO - __main__ - Step 43869: {'lr': 0.0004075993885559508, 'samples': 8422848, 'steps': 43868, 'loss/train': 1.6587876081466675} 11/07/2021 03:22:39 - INFO - __main__ - Step 43870: {'lr': 0.0004075952690409852, 'samples': 8423040, 'steps': 43869, 'loss/train': 1.6615227460861206} 11/07/2021 03:22:39 - INFO - __main__ - Step 43871: {'lr': 0.00040759114945500974, 'samples': 8423232, 'steps': 43870, 'loss/train': 1.629590392112732} 11/07/2021 03:22:40 - INFO - __main__ - Step 43872: {'lr': 0.0004075870297980261, 'samples': 8423424, 'steps': 43871, 'loss/train': 1.5323020219802856} 11/07/2021 03:22:40 - INFO - __main__ - Step 43873: {'lr': 0.0004075829100700361, 'samples': 8423616, 'steps': 43872, 'loss/train': 1.6959728002548218} 11/07/2021 03:22:41 - INFO - __main__ - Step 43874: {'lr': 0.0004075787902710417, 'samples': 8423808, 'steps': 43873, 'loss/train': 1.9394711256027222} 11/07/2021 03:22:42 - INFO - __main__ - Step 43875: {'lr': 0.0004075746704010448, 'samples': 8424000, 'steps': 43874, 'loss/train': 1.5391508340835571} 11/07/2021 03:22:42 - INFO - __main__ - Step 43876: {'lr': 0.0004075705504600471, 'samples': 8424192, 'steps': 43875, 'loss/train': 1.6477059125900269} 11/07/2021 03:22:42 - INFO - __main__ - Step 43877: {'lr': 0.00040756643044805057, 'samples': 8424384, 'steps': 43876, 'loss/train': 2.9302780628204346} 11/07/2021 03:22:43 - INFO - __main__ - Step 43878: {'lr': 0.0004075623103650571, 'samples': 8424576, 'steps': 43877, 'loss/train': 1.6522893905639648} 11/07/2021 03:22:44 - INFO - __main__ - Step 43879: {'lr': 0.00040755819021106844, 'samples': 8424768, 'steps': 43878, 'loss/train': 1.4377820491790771} 11/07/2021 03:22:44 - INFO - __main__ - Step 43880: {'lr': 0.00040755406998608645, 'samples': 8424960, 'steps': 43879, 'loss/train': 1.9909913539886475} 11/07/2021 03:22:44 - INFO - __main__ - Step 43881: {'lr': 0.00040754994969011306, 'samples': 8425152, 'steps': 43880, 'loss/train': 0.9266546964645386} 11/07/2021 03:22:45 - INFO - __main__ - Step 43882: {'lr': 0.00040754582932315007, 'samples': 8425344, 'steps': 43881, 'loss/train': 2.1872682571411133} 11/07/2021 03:22:45 - INFO - __main__ - Step 43883: {'lr': 0.0004075417088851994, 'samples': 8425536, 'steps': 43882, 'loss/train': 1.6716867685317993} 11/07/2021 03:22:46 - INFO - __main__ - Step 43884: {'lr': 0.0004075375883762629, 'samples': 8425728, 'steps': 43883, 'loss/train': 1.638662338256836} 11/07/2021 03:22:47 - INFO - __main__ - Step 43885: {'lr': 0.0004075334677963423, 'samples': 8425920, 'steps': 43884, 'loss/train': 1.7723662853240967} 11/07/2021 03:22:47 - INFO - __main__ - Step 43886: {'lr': 0.0004075293471454396, 'samples': 8426112, 'steps': 43885, 'loss/train': 1.765032172203064} 11/07/2021 03:22:47 - INFO - __main__ - Step 43887: {'lr': 0.0004075252264235566, 'samples': 8426304, 'steps': 43886, 'loss/train': 1.8197784423828125} 11/07/2021 03:22:48 - INFO - __main__ - Step 43888: {'lr': 0.0004075211056306951, 'samples': 8426496, 'steps': 43887, 'loss/train': 1.9503345489501953} 11/07/2021 03:22:48 - INFO - __main__ - Step 43889: {'lr': 0.00040751698476685716, 'samples': 8426688, 'steps': 43888, 'loss/train': 1.4646555185317993} 11/07/2021 03:22:49 - INFO - __main__ - Step 43890: {'lr': 0.00040751286383204437, 'samples': 8426880, 'steps': 43889, 'loss/train': 1.5433449745178223} 11/07/2021 03:22:49 - INFO - __main__ - Step 43891: {'lr': 0.0004075087428262588, 'samples': 8427072, 'steps': 43890, 'loss/train': 1.3730559349060059} 11/07/2021 03:22:50 - INFO - __main__ - Step 43892: {'lr': 0.0004075046217495022, 'samples': 8427264, 'steps': 43891, 'loss/train': 0.6554791331291199} 11/07/2021 03:22:50 - INFO - __main__ - Step 43893: {'lr': 0.00040750050060177643, 'samples': 8427456, 'steps': 43892, 'loss/train': 1.933127760887146} 11/07/2021 03:22:50 - INFO - __main__ - Step 43894: {'lr': 0.00040749637938308336, 'samples': 8427648, 'steps': 43893, 'loss/train': 1.17872953414917} 11/07/2021 03:22:51 - INFO - __main__ - Step 43895: {'lr': 0.00040749225809342485, 'samples': 8427840, 'steps': 43894, 'loss/train': 1.7261704206466675} 11/07/2021 03:22:52 - INFO - __main__ - Step 43896: {'lr': 0.00040748813673280277, 'samples': 8428032, 'steps': 43895, 'loss/train': 1.2042407989501953} 11/07/2021 03:22:52 - INFO - __main__ - Step 43897: {'lr': 0.0004074840153012189, 'samples': 8428224, 'steps': 43896, 'loss/train': 1.368628978729248} 11/07/2021 03:22:52 - INFO - __main__ - Step 43898: {'lr': 0.0004074798937986753, 'samples': 8428416, 'steps': 43897, 'loss/train': 1.738026738166809} 11/07/2021 03:22:53 - INFO - __main__ - Step 43899: {'lr': 0.00040747577222517364, 'samples': 8428608, 'steps': 43898, 'loss/train': 1.714708685874939} 11/07/2021 03:22:54 - INFO - __main__ - Step 43900: {'lr': 0.0004074716505807158, 'samples': 8428800, 'steps': 43899, 'loss/train': 1.4866786003112793} 11/07/2021 03:22:55 - INFO - __main__ - Step 43901: {'lr': 0.0004074675288653037, 'samples': 8428992, 'steps': 43900, 'loss/train': 1.5645619630813599} 11/07/2021 03:22:55 - INFO - __main__ - Step 43902: {'lr': 0.0004074634070789391, 'samples': 8429184, 'steps': 43901, 'loss/train': 0.1000712662935257} 11/07/2021 03:22:55 - INFO - __main__ - Step 43903: {'lr': 0.0004074592852216239, 'samples': 8429376, 'steps': 43902, 'loss/train': 1.6530612707138062} 11/07/2021 03:22:56 - INFO - __main__ - Step 43904: {'lr': 0.0004074551632933601, 'samples': 8429568, 'steps': 43903, 'loss/train': 1.089402437210083} 11/07/2021 03:22:56 - INFO - __main__ - Step 43905: {'lr': 0.00040745104129414933, 'samples': 8429760, 'steps': 43904, 'loss/train': 0.9136307835578918} 11/07/2021 03:22:57 - INFO - __main__ - Step 43906: {'lr': 0.0004074469192239936, 'samples': 8429952, 'steps': 43905, 'loss/train': 1.4113155603408813} 11/07/2021 03:22:57 - INFO - __main__ - Step 43907: {'lr': 0.0004074427970828947, 'samples': 8430144, 'steps': 43906, 'loss/train': 1.4035757780075073} 11/07/2021 03:22:58 - INFO - __main__ - Step 43908: {'lr': 0.00040743867487085444, 'samples': 8430336, 'steps': 43907, 'loss/train': 1.4519786834716797} 11/07/2021 03:22:58 - INFO - __main__ - Step 43909: {'lr': 0.0004074345525878748, 'samples': 8430528, 'steps': 43908, 'loss/train': 1.6051146984100342} 11/07/2021 03:22:58 - INFO - __main__ - Step 43910: {'lr': 0.0004074304302339576, 'samples': 8430720, 'steps': 43909, 'loss/train': 1.672196388244629} 11/07/2021 03:22:59 - INFO - __main__ - Step 43911: {'lr': 0.0004074263078091046, 'samples': 8430912, 'steps': 43910, 'loss/train': 1.1506792306900024} 11/07/2021 03:23:00 - INFO - __main__ - Step 43912: {'lr': 0.00040742218531331786, 'samples': 8431104, 'steps': 43911, 'loss/train': 2.5236992835998535} 11/07/2021 03:23:00 - INFO - __main__ - Step 43913: {'lr': 0.0004074180627465991, 'samples': 8431296, 'steps': 43912, 'loss/train': 1.912400245666504} 11/07/2021 03:23:00 - INFO - __main__ - Step 43914: {'lr': 0.00040741394010895013, 'samples': 8431488, 'steps': 43913, 'loss/train': 1.4635186195373535} 11/07/2021 03:23:01 - INFO - __main__ - Step 43915: {'lr': 0.0004074098174003729, 'samples': 8431680, 'steps': 43914, 'loss/train': 1.7027950286865234} 11/07/2021 03:23:02 - INFO - __main__ - Step 43916: {'lr': 0.0004074056946208692, 'samples': 8431872, 'steps': 43915, 'loss/train': 1.5679969787597656} 11/07/2021 03:23:02 - INFO - __main__ - Step 43917: {'lr': 0.0004074015717704409, 'samples': 8432064, 'steps': 43916, 'loss/train': 1.717900037765503} 11/07/2021 03:23:03 - INFO - __main__ - Step 43918: {'lr': 0.00040739744884908994, 'samples': 8432256, 'steps': 43917, 'loss/train': 1.867804765701294} 11/07/2021 03:23:03 - INFO - __main__ - Step 43919: {'lr': 0.00040739332585681807, 'samples': 8432448, 'steps': 43918, 'loss/train': 1.896952509880066} 11/07/2021 03:23:03 - INFO - __main__ - Step 43920: {'lr': 0.00040738920279362724, 'samples': 8432640, 'steps': 43919, 'loss/train': 1.7845344543457031} 11/07/2021 03:23:04 - INFO - __main__ - Step 43921: {'lr': 0.00040738507965951923, 'samples': 8432832, 'steps': 43920, 'loss/train': 1.416855812072754} 11/07/2021 03:23:05 - INFO - __main__ - Step 43922: {'lr': 0.0004073809564544959, 'samples': 8433024, 'steps': 43921, 'loss/train': 2.4920897483825684} 11/07/2021 03:23:05 - INFO - __main__ - Step 43923: {'lr': 0.0004073768331785592, 'samples': 8433216, 'steps': 43922, 'loss/train': 1.7586002349853516} 11/07/2021 03:23:05 - INFO - __main__ - Step 43924: {'lr': 0.0004073727098317109, 'samples': 8433408, 'steps': 43923, 'loss/train': 1.983863353729248} 11/07/2021 03:23:06 - INFO - __main__ - Step 43925: {'lr': 0.0004073685864139529, 'samples': 8433600, 'steps': 43924, 'loss/train': 1.6089143753051758} 11/07/2021 03:23:07 - INFO - __main__ - Step 43926: {'lr': 0.00040736446292528704, 'samples': 8433792, 'steps': 43925, 'loss/train': 2.148613214492798} 11/07/2021 03:23:07 - INFO - __main__ - Step 43927: {'lr': 0.0004073603393657152, 'samples': 8433984, 'steps': 43926, 'loss/train': 1.701894998550415} 11/07/2021 03:23:08 - INFO - __main__ - Step 43928: {'lr': 0.0004073562157352392, 'samples': 8434176, 'steps': 43927, 'loss/train': 1.439337968826294} 11/07/2021 03:23:08 - INFO - __main__ - Step 43929: {'lr': 0.00040735209203386093, 'samples': 8434368, 'steps': 43928, 'loss/train': 1.6041038036346436} 11/07/2021 03:23:08 - INFO - __main__ - Step 43930: {'lr': 0.00040734796826158226, 'samples': 8434560, 'steps': 43929, 'loss/train': 1.6103554964065552} 11/07/2021 03:23:09 - INFO - __main__ - Step 43931: {'lr': 0.000407343844418405, 'samples': 8434752, 'steps': 43930, 'loss/train': 1.2818981409072876} 11/07/2021 03:23:10 - INFO - __main__ - Step 43932: {'lr': 0.000407339720504331, 'samples': 8434944, 'steps': 43931, 'loss/train': 1.7273222208023071} 11/07/2021 03:23:10 - INFO - __main__ - Step 43933: {'lr': 0.00040733559651936216, 'samples': 8435136, 'steps': 43932, 'loss/train': 9.138843536376953} 11/07/2021 03:23:11 - INFO - __main__ - Step 43934: {'lr': 0.0004073314724635003, 'samples': 8435328, 'steps': 43933, 'loss/train': 0.8566093444824219} 11/07/2021 03:23:11 - INFO - __main__ - Step 43935: {'lr': 0.0004073273483367474, 'samples': 8435520, 'steps': 43934, 'loss/train': 0.11346945911645889} 11/07/2021 03:23:11 - INFO - __main__ - Step 43936: {'lr': 0.0004073232241391052, 'samples': 8435712, 'steps': 43935, 'loss/train': 1.9948738813400269} 11/07/2021 03:23:12 - INFO - __main__ - Step 43937: {'lr': 0.00040731909987057547, 'samples': 8435904, 'steps': 43936, 'loss/train': 1.584019660949707} 11/07/2021 03:23:13 - INFO - __main__ - Step 43938: {'lr': 0.0004073149755311603, 'samples': 8436096, 'steps': 43937, 'loss/train': 1.7624531984329224} 11/07/2021 03:23:13 - INFO - __main__ - Step 43939: {'lr': 0.0004073108511208614, 'samples': 8436288, 'steps': 43938, 'loss/train': 1.6295770406723022} 11/07/2021 03:23:13 - INFO - __main__ - Step 43940: {'lr': 0.0004073067266396807, 'samples': 8436480, 'steps': 43939, 'loss/train': 1.6054798364639282} 11/07/2021 03:23:14 - INFO - __main__ - Step 43941: {'lr': 0.00040730260208761995, 'samples': 8436672, 'steps': 43940, 'loss/train': 1.169405460357666} 11/07/2021 03:23:15 - INFO - __main__ - Step 43942: {'lr': 0.0004072984774646811, 'samples': 8436864, 'steps': 43941, 'loss/train': 1.4220243692398071} 11/07/2021 03:23:15 - INFO - __main__ - Step 43943: {'lr': 0.0004072943527708659, 'samples': 8437056, 'steps': 43942, 'loss/train': 1.10098397731781} 11/07/2021 03:23:15 - INFO - __main__ - Step 43944: {'lr': 0.00040729022800617637, 'samples': 8437248, 'steps': 43943, 'loss/train': 1.892876148223877} 11/07/2021 03:23:16 - INFO - __main__ - Step 43945: {'lr': 0.00040728610317061433, 'samples': 8437440, 'steps': 43944, 'loss/train': 1.7797027826309204} 11/07/2021 03:23:16 - INFO - __main__ - Step 43946: {'lr': 0.0004072819782641816, 'samples': 8437632, 'steps': 43945, 'loss/train': 2.0013530254364014} 11/07/2021 03:23:17 - INFO - __main__ - Step 43947: {'lr': 0.00040727785328687995, 'samples': 8437824, 'steps': 43946, 'loss/train': 1.4861782789230347} 11/07/2021 03:23:17 - INFO - __main__ - Step 43948: {'lr': 0.00040727372823871135, 'samples': 8438016, 'steps': 43947, 'loss/train': 1.3490138053894043} 11/07/2021 03:23:18 - INFO - __main__ - Step 43949: {'lr': 0.00040726960311967766, 'samples': 8438208, 'steps': 43948, 'loss/train': 1.4066849946975708} 11/07/2021 03:23:18 - INFO - __main__ - Step 43950: {'lr': 0.0004072654779297807, 'samples': 8438400, 'steps': 43949, 'loss/train': 1.1034836769104004} 11/07/2021 03:23:19 - INFO - __main__ - Step 43951: {'lr': 0.0004072613526690223, 'samples': 8438592, 'steps': 43950, 'loss/train': 1.2344754934310913} 11/07/2021 03:23:19 - INFO - __main__ - Step 43952: {'lr': 0.00040725722733740444, 'samples': 8438784, 'steps': 43951, 'loss/train': 1.6640533208847046} 11/07/2021 03:23:20 - INFO - __main__ - Step 43953: {'lr': 0.0004072531019349289, 'samples': 8438976, 'steps': 43952, 'loss/train': 0.542209267616272} 11/07/2021 03:23:21 - INFO - __main__ - Step 43954: {'lr': 0.00040724897646159753, 'samples': 8439168, 'steps': 43953, 'loss/train': 1.1740684509277344} 11/07/2021 03:23:21 - INFO - __main__ - Step 43955: {'lr': 0.0004072448509174121, 'samples': 8439360, 'steps': 43954, 'loss/train': 1.7182492017745972} 11/07/2021 03:23:21 - INFO - __main__ - Step 43956: {'lr': 0.00040724072530237465, 'samples': 8439552, 'steps': 43955, 'loss/train': 1.1453046798706055} 11/07/2021 03:23:22 - INFO - __main__ - Step 43957: {'lr': 0.00040723659961648694, 'samples': 8439744, 'steps': 43956, 'loss/train': 2.543769598007202} 11/07/2021 03:23:22 - INFO - __main__ - Step 43958: {'lr': 0.0004072324738597509, 'samples': 8439936, 'steps': 43957, 'loss/train': 0.9986402988433838} 11/07/2021 03:23:23 - INFO - __main__ - Step 43959: {'lr': 0.00040722834803216834, 'samples': 8440128, 'steps': 43958, 'loss/train': 1.1566802263259888} 11/07/2021 03:23:23 - INFO - __main__ - Step 43960: {'lr': 0.000407224222133741, 'samples': 8440320, 'steps': 43959, 'loss/train': 2.107487440109253} 11/07/2021 03:23:24 - INFO - __main__ - Step 43961: {'lr': 0.00040722009616447094, 'samples': 8440512, 'steps': 43960, 'loss/train': 1.6063014268875122} 11/07/2021 03:23:24 - INFO - __main__ - Step 43962: {'lr': 0.0004072159701243599, 'samples': 8440704, 'steps': 43961, 'loss/train': 1.5708634853363037} 11/07/2021 03:23:24 - INFO - __main__ - Step 43963: {'lr': 0.00040721184401340977, 'samples': 8440896, 'steps': 43962, 'loss/train': 1.6888006925582886} 11/07/2021 03:23:25 - INFO - __main__ - Step 43964: {'lr': 0.00040720771783162236, 'samples': 8441088, 'steps': 43963, 'loss/train': 1.2426689863204956} 11/07/2021 03:23:26 - INFO - __main__ - Step 43965: {'lr': 0.0004072035915789997, 'samples': 8441280, 'steps': 43964, 'loss/train': 1.6193268299102783} 11/07/2021 03:23:26 - INFO - __main__ - Step 43966: {'lr': 0.0004071994652555434, 'samples': 8441472, 'steps': 43965, 'loss/train': 1.2662886381149292} 11/07/2021 03:23:27 - INFO - __main__ - Step 43967: {'lr': 0.0004071953388612555, 'samples': 8441664, 'steps': 43966, 'loss/train': 1.4906792640686035} 11/07/2021 03:23:27 - INFO - __main__ - Step 43968: {'lr': 0.0004071912123961379, 'samples': 8441856, 'steps': 43967, 'loss/train': 1.512399435043335} 11/07/2021 03:23:28 - INFO - __main__ - Step 43969: {'lr': 0.00040718708586019226, 'samples': 8442048, 'steps': 43968, 'loss/train': 0.19833578169345856} 11/07/2021 03:23:28 - INFO - __main__ - Step 43970: {'lr': 0.00040718295925342053, 'samples': 8442240, 'steps': 43969, 'loss/train': 1.3765857219696045} 11/07/2021 03:23:29 - INFO - __main__ - Step 43971: {'lr': 0.0004071788325758246, 'samples': 8442432, 'steps': 43970, 'loss/train': 0.7994476556777954} 11/07/2021 03:23:29 - INFO - __main__ - Step 43972: {'lr': 0.00040717470582740634, 'samples': 8442624, 'steps': 43971, 'loss/train': 1.7003074884414673} 11/07/2021 03:23:29 - INFO - __main__ - Step 43973: {'lr': 0.0004071705790081676, 'samples': 8442816, 'steps': 43972, 'loss/train': 1.42214834690094} 11/07/2021 03:23:30 - INFO - __main__ - Step 43974: {'lr': 0.0004071664521181102, 'samples': 8443008, 'steps': 43973, 'loss/train': 1.4173136949539185} 11/07/2021 03:23:31 - INFO - __main__ - Step 43975: {'lr': 0.00040716232515723596, 'samples': 8443200, 'steps': 43974, 'loss/train': 1.5742828845977783} 11/07/2021 03:23:31 - INFO - __main__ - Step 43976: {'lr': 0.00040715819812554686, 'samples': 8443392, 'steps': 43975, 'loss/train': 1.956968069076538} 11/07/2021 03:23:31 - INFO - __main__ - Step 43977: {'lr': 0.0004071540710230447, 'samples': 8443584, 'steps': 43976, 'loss/train': 1.3919167518615723} 11/07/2021 03:23:32 - INFO - __main__ - Step 43978: {'lr': 0.0004071499438497314, 'samples': 8443776, 'steps': 43977, 'loss/train': 1.4417613744735718} 11/07/2021 03:23:32 - INFO - __main__ - Step 43979: {'lr': 0.0004071458166056087, 'samples': 8443968, 'steps': 43978, 'loss/train': 1.6836180686950684} 11/07/2021 03:23:34 - INFO - __main__ - Step 43980: {'lr': 0.00040714168929067854, 'samples': 8444160, 'steps': 43979, 'loss/train': 1.9319530725479126} 11/07/2021 03:23:34 - INFO - __main__ - Step 43981: {'lr': 0.0004071375619049427, 'samples': 8444352, 'steps': 43980, 'loss/train': 1.5899041891098022} 11/07/2021 03:23:34 - INFO - __main__ - Step 43982: {'lr': 0.0004071334344484031, 'samples': 8444544, 'steps': 43981, 'loss/train': 2.048292636871338} 11/07/2021 03:23:35 - INFO - __main__ - Step 43983: {'lr': 0.00040712930692106164, 'samples': 8444736, 'steps': 43982, 'loss/train': 1.5842952728271484} 11/07/2021 03:23:35 - INFO - __main__ - Step 43984: {'lr': 0.00040712517932292016, 'samples': 8444928, 'steps': 43983, 'loss/train': 1.2995017766952515} 11/07/2021 03:23:36 - INFO - __main__ - Step 43985: {'lr': 0.00040712105165398044, 'samples': 8445120, 'steps': 43984, 'loss/train': 0.8532267808914185} 11/07/2021 03:23:36 - INFO - __main__ - Step 43986: {'lr': 0.0004071169239142445, 'samples': 8445312, 'steps': 43985, 'loss/train': 1.5599069595336914} 11/07/2021 03:23:37 - INFO - __main__ - Step 43987: {'lr': 0.000407112796103714, 'samples': 8445504, 'steps': 43986, 'loss/train': 1.4275630712509155} 11/07/2021 03:23:37 - INFO - __main__ - Step 43988: {'lr': 0.0004071086682223909, 'samples': 8445696, 'steps': 43987, 'loss/train': 1.1857975721359253} 11/07/2021 03:23:37 - INFO - __main__ - Step 43989: {'lr': 0.0004071045402702771, 'samples': 8445888, 'steps': 43988, 'loss/train': 1.6939241886138916} 11/07/2021 03:23:38 - INFO - __main__ - Step 43990: {'lr': 0.0004071004122473744, 'samples': 8446080, 'steps': 43989, 'loss/train': 1.6355971097946167} 11/07/2021 03:23:39 - INFO - __main__ - Step 43991: {'lr': 0.0004070962841536847, 'samples': 8446272, 'steps': 43990, 'loss/train': 1.6357855796813965} 11/07/2021 03:23:39 - INFO - __main__ - Step 43992: {'lr': 0.0004070921559892098, 'samples': 8446464, 'steps': 43991, 'loss/train': 1.360775113105774} 11/07/2021 03:23:39 - INFO - __main__ - Step 43993: {'lr': 0.00040708802775395165, 'samples': 8446656, 'steps': 43992, 'loss/train': 1.4375793933868408} 11/07/2021 03:23:40 - INFO - __main__ - Step 43994: {'lr': 0.000407083899447912, 'samples': 8446848, 'steps': 43993, 'loss/train': 1.5229673385620117} 11/07/2021 03:23:41 - INFO - __main__ - Step 43995: {'lr': 0.00040707977107109285, 'samples': 8447040, 'steps': 43994, 'loss/train': 1.6514801979064941} 11/07/2021 03:23:41 - INFO - __main__ - Step 43996: {'lr': 0.00040707564262349594, 'samples': 8447232, 'steps': 43995, 'loss/train': 1.9328930377960205} 11/07/2021 03:23:42 - INFO - __main__ - Step 43997: {'lr': 0.0004070715141051231, 'samples': 8447424, 'steps': 43996, 'loss/train': 1.518604040145874} 11/07/2021 03:23:42 - INFO - __main__ - Step 43998: {'lr': 0.00040706738551597634, 'samples': 8447616, 'steps': 43997, 'loss/train': 1.8714951276779175} 11/07/2021 03:23:42 - INFO - __main__ - Step 43999: {'lr': 0.0004070632568560574, 'samples': 8447808, 'steps': 43998, 'loss/train': 1.267977237701416} 11/07/2021 03:23:43 - INFO - __main__ - Step 44000: {'lr': 0.0004070591281253682, 'samples': 8448000, 'steps': 43999, 'loss/train': 1.512953519821167} 11/07/2021 03:23:44 - INFO - __main__ - Step 44001: {'lr': 0.0004070549993239106, 'samples': 8448192, 'steps': 44000, 'loss/train': 0.831805408000946} 11/07/2021 03:23:44 - INFO - __main__ - Step 44002: {'lr': 0.0004070508704516864, 'samples': 8448384, 'steps': 44001, 'loss/train': 1.5522176027297974} 11/07/2021 03:23:44 - INFO - __main__ - Step 44003: {'lr': 0.00040704674150869753, 'samples': 8448576, 'steps': 44002, 'loss/train': 1.5985102653503418} 11/07/2021 03:23:45 - INFO - __main__ - Step 44004: {'lr': 0.0004070426124949458, 'samples': 8448768, 'steps': 44003, 'loss/train': 1.6318305730819702} 11/07/2021 03:23:45 - INFO - __main__ - Step 44005: {'lr': 0.00040703848341043313, 'samples': 8448960, 'steps': 44004, 'loss/train': 1.3904163837432861} 11/07/2021 03:23:46 - INFO - __main__ - Step 44006: {'lr': 0.00040703435425516136, 'samples': 8449152, 'steps': 44005, 'loss/train': 1.818678855895996} 11/07/2021 03:23:46 - INFO - __main__ - Step 44007: {'lr': 0.0004070302250291322, 'samples': 8449344, 'steps': 44006, 'loss/train': 1.2677369117736816} 11/07/2021 03:23:47 - INFO - __main__ - Step 44008: {'lr': 0.0004070260957323478, 'samples': 8449536, 'steps': 44007, 'loss/train': 1.5789427757263184} 11/07/2021 03:23:47 - INFO - __main__ - Step 44009: {'lr': 0.0004070219663648098, 'samples': 8449728, 'steps': 44008, 'loss/train': 1.146945595741272} 11/07/2021 03:23:47 - INFO - __main__ - Step 44010: {'lr': 0.0004070178369265201, 'samples': 8449920, 'steps': 44009, 'loss/train': 1.254859447479248} 11/07/2021 03:23:49 - INFO - __main__ - Step 44011: {'lr': 0.00040701370741748057, 'samples': 8450112, 'steps': 44010, 'loss/train': 1.4340729713439941} 11/07/2021 03:23:49 - INFO - __main__ - Step 44012: {'lr': 0.0004070095778376932, 'samples': 8450304, 'steps': 44011, 'loss/train': 1.708217978477478} 11/07/2021 03:23:49 - INFO - __main__ - Step 44013: {'lr': 0.0004070054481871597, 'samples': 8450496, 'steps': 44012, 'loss/train': 1.3653974533081055} 11/07/2021 03:23:50 - INFO - __main__ - Step 44014: {'lr': 0.00040700131846588185, 'samples': 8450688, 'steps': 44013, 'loss/train': 0.32702651619911194} 11/07/2021 03:23:50 - INFO - __main__ - Step 44015: {'lr': 0.0004069971886738617, 'samples': 8450880, 'steps': 44014, 'loss/train': 1.6158820390701294} 11/07/2021 03:23:51 - INFO - __main__ - Step 44016: {'lr': 0.00040699305881110103, 'samples': 8451072, 'steps': 44015, 'loss/train': 1.2307311296463013} 11/07/2021 03:23:52 - INFO - __main__ - Step 44017: {'lr': 0.00040698892887760174, 'samples': 8451264, 'steps': 44016, 'loss/train': 2.1337993144989014} 11/07/2021 03:23:52 - INFO - __main__ - Step 44018: {'lr': 0.00040698479887336567, 'samples': 8451456, 'steps': 44017, 'loss/train': 1.6925996541976929} 11/07/2021 03:23:52 - INFO - __main__ - Step 44019: {'lr': 0.00040698066879839463, 'samples': 8451648, 'steps': 44018, 'loss/train': 1.8600653409957886} 11/07/2021 03:23:53 - INFO - __main__ - Step 44020: {'lr': 0.00040697653865269057, 'samples': 8451840, 'steps': 44019, 'loss/train': 0.4142761826515198} 11/07/2021 03:23:54 - INFO - __main__ - Step 44021: {'lr': 0.00040697240843625527, 'samples': 8452032, 'steps': 44020, 'loss/train': 1.5758490562438965} 11/07/2021 03:23:54 - INFO - __main__ - Step 44022: {'lr': 0.00040696827814909063, 'samples': 8452224, 'steps': 44021, 'loss/train': 1.2229807376861572} 11/07/2021 03:23:54 - INFO - __main__ - Step 44023: {'lr': 0.0004069641477911985, 'samples': 8452416, 'steps': 44022, 'loss/train': 1.3245600461959839} 11/07/2021 03:23:55 - INFO - __main__ - Step 44024: {'lr': 0.00040696001736258077, 'samples': 8452608, 'steps': 44023, 'loss/train': 1.4945148229599} 11/07/2021 03:23:55 - INFO - __main__ - Step 44025: {'lr': 0.0004069558868632393, 'samples': 8452800, 'steps': 44024, 'loss/train': 1.1728746891021729} 11/07/2021 03:23:56 - INFO - __main__ - Step 44026: {'lr': 0.0004069517562931759, 'samples': 8452992, 'steps': 44025, 'loss/train': 1.3753551244735718} 11/07/2021 03:23:56 - INFO - __main__ - Step 44027: {'lr': 0.0004069476256523924, 'samples': 8453184, 'steps': 44026, 'loss/train': 1.3952306509017944} 11/07/2021 03:23:57 - INFO - __main__ - Step 44028: {'lr': 0.0004069434949408908, 'samples': 8453376, 'steps': 44027, 'loss/train': 1.2849841117858887} 11/07/2021 03:23:57 - INFO - __main__ - Step 44029: {'lr': 0.0004069393641586728, 'samples': 8453568, 'steps': 44028, 'loss/train': 1.9058088064193726} 11/07/2021 03:23:57 - INFO - __main__ - Step 44030: {'lr': 0.00040693523330574043, 'samples': 8453760, 'steps': 44029, 'loss/train': 1.7052431106567383} 11/07/2021 03:23:58 - INFO - __main__ - Step 44031: {'lr': 0.0004069311023820954, 'samples': 8453952, 'steps': 44030, 'loss/train': 0.7730574011802673} 11/07/2021 03:23:59 - INFO - __main__ - Step 44032: {'lr': 0.0004069269713877397, 'samples': 8454144, 'steps': 44031, 'loss/train': 1.3204773664474487} 11/07/2021 03:23:59 - INFO - __main__ - Step 44033: {'lr': 0.00040692284032267515, 'samples': 8454336, 'steps': 44032, 'loss/train': 1.355200171470642} 11/07/2021 03:24:00 - INFO - __main__ - Step 44034: {'lr': 0.0004069187091869035, 'samples': 8454528, 'steps': 44033, 'loss/train': 1.5818285942077637} 11/07/2021 03:24:00 - INFO - __main__ - Step 44035: {'lr': 0.00040691457798042673, 'samples': 8454720, 'steps': 44034, 'loss/train': 1.3490406274795532} 11/07/2021 03:24:01 - INFO - __main__ - Step 44036: {'lr': 0.00040691044670324673, 'samples': 8454912, 'steps': 44035, 'loss/train': 1.9681199789047241} 11/07/2021 03:24:01 - INFO - __main__ - Step 44037: {'lr': 0.00040690631535536526, 'samples': 8455104, 'steps': 44036, 'loss/train': 1.5073906183242798} 11/07/2021 03:24:02 - INFO - __main__ - Step 44038: {'lr': 0.00040690218393678426, 'samples': 8455296, 'steps': 44037, 'loss/train': 1.4101229906082153} 11/07/2021 03:24:02 - INFO - __main__ - Step 44039: {'lr': 0.0004068980524475054, 'samples': 8455488, 'steps': 44038, 'loss/train': 1.77540123462677} 11/07/2021 03:24:03 - INFO - __main__ - Step 44040: {'lr': 0.00040689392088753097, 'samples': 8455680, 'steps': 44039, 'loss/train': 5.637470245361328} 11/07/2021 03:24:03 - INFO - __main__ - Step 44041: {'lr': 0.00040688978925686235, 'samples': 8455872, 'steps': 44040, 'loss/train': 1.8101271390914917} 11/07/2021 03:24:04 - INFO - __main__ - Step 44042: {'lr': 0.00040688565755550164, 'samples': 8456064, 'steps': 44041, 'loss/train': 1.7539114952087402} 11/07/2021 03:24:04 - INFO - __main__ - Step 44043: {'lr': 0.00040688152578345074, 'samples': 8456256, 'steps': 44042, 'loss/train': 1.4729076623916626} 11/07/2021 03:24:05 - INFO - __main__ - Step 44044: {'lr': 0.0004068773939407114, 'samples': 8456448, 'steps': 44043, 'loss/train': 1.6007177829742432} 11/07/2021 03:24:05 - INFO - __main__ - Step 44045: {'lr': 0.0004068732620272856, 'samples': 8456640, 'steps': 44044, 'loss/train': 1.4922720193862915} 11/07/2021 03:24:05 - INFO - __main__ - Step 44046: {'lr': 0.000406869130043175, 'samples': 8456832, 'steps': 44045, 'loss/train': 1.3924622535705566} 11/07/2021 03:24:06 - INFO - __main__ - Step 44047: {'lr': 0.0004068649979883817, 'samples': 8457024, 'steps': 44046, 'loss/train': 1.539118766784668} 11/07/2021 03:24:07 - INFO - __main__ - Step 44048: {'lr': 0.0004068608658629074, 'samples': 8457216, 'steps': 44047, 'loss/train': 1.7003668546676636} 11/07/2021 03:24:07 - INFO - __main__ - Step 44049: {'lr': 0.000406856733666754, 'samples': 8457408, 'steps': 44048, 'loss/train': 1.2621194124221802} 11/07/2021 03:24:07 - INFO - __main__ - Step 44050: {'lr': 0.00040685260139992343, 'samples': 8457600, 'steps': 44049, 'loss/train': 1.989881157875061} 11/07/2021 03:24:08 - INFO - __main__ - Step 44051: {'lr': 0.00040684846906241745, 'samples': 8457792, 'steps': 44050, 'loss/train': 1.3429982662200928} 11/07/2021 03:24:09 - INFO - __main__ - Step 44052: {'lr': 0.000406844336654238, 'samples': 8457984, 'steps': 44051, 'loss/train': 1.6912051439285278} 11/07/2021 03:24:09 - INFO - __main__ - Step 44053: {'lr': 0.00040684020417538694, 'samples': 8458176, 'steps': 44052, 'loss/train': 1.2079888582229614} 11/07/2021 03:24:09 - INFO - __main__ - Step 44054: {'lr': 0.00040683607162586604, 'samples': 8458368, 'steps': 44053, 'loss/train': 1.5336663722991943} 11/07/2021 03:24:10 - INFO - __main__ - Step 44055: {'lr': 0.00040683193900567727, 'samples': 8458560, 'steps': 44054, 'loss/train': 1.1132365465164185} 11/07/2021 03:24:10 - INFO - __main__ - Step 44056: {'lr': 0.00040682780631482243, 'samples': 8458752, 'steps': 44055, 'loss/train': 1.7476736307144165} 11/07/2021 03:24:10 - INFO - __main__ - Step 44057: {'lr': 0.0004068236735533034, 'samples': 8458944, 'steps': 44056, 'loss/train': 1.392434000968933} 11/07/2021 03:24:11 - INFO - __main__ - Step 44058: {'lr': 0.00040681954072112206, 'samples': 8459136, 'steps': 44057, 'loss/train': 1.7331249713897705} 11/07/2021 03:24:12 - INFO - __main__ - Step 44059: {'lr': 0.0004068154078182802, 'samples': 8459328, 'steps': 44058, 'loss/train': 0.8346248269081116} 11/07/2021 03:24:12 - INFO - __main__ - Step 44060: {'lr': 0.00040681127484477983, 'samples': 8459520, 'steps': 44059, 'loss/train': 1.481729507446289} 11/07/2021 03:24:12 - INFO - __main__ - Step 44061: {'lr': 0.0004068071418006226, 'samples': 8459712, 'steps': 44060, 'loss/train': 1.9142131805419922} 11/07/2021 03:24:13 - INFO - __main__ - Step 44062: {'lr': 0.0004068030086858106, 'samples': 8459904, 'steps': 44061, 'loss/train': 1.3857892751693726} 11/07/2021 03:24:14 - INFO - __main__ - Step 44063: {'lr': 0.00040679887550034555, 'samples': 8460096, 'steps': 44062, 'loss/train': 1.4364193677902222} 11/07/2021 03:24:14 - INFO - __main__ - Step 44064: {'lr': 0.0004067947422442293, 'samples': 8460288, 'steps': 44063, 'loss/train': 1.6256428956985474} 11/07/2021 03:24:15 - INFO - __main__ - Step 44065: {'lr': 0.00040679060891746384, 'samples': 8460480, 'steps': 44064, 'loss/train': 1.7078979015350342} 11/07/2021 03:24:15 - INFO - __main__ - Step 44066: {'lr': 0.00040678647552005087, 'samples': 8460672, 'steps': 44065, 'loss/train': 1.907792329788208} 11/07/2021 03:24:15 - INFO - __main__ - Step 44067: {'lr': 0.00040678234205199237, 'samples': 8460864, 'steps': 44066, 'loss/train': 1.3730069398880005} 11/07/2021 03:24:16 - INFO - __main__ - Step 44068: {'lr': 0.0004067782085132902, 'samples': 8461056, 'steps': 44067, 'loss/train': 1.428218960762024} 11/07/2021 03:24:17 - INFO - __main__ - Step 44069: {'lr': 0.00040677407490394616, 'samples': 8461248, 'steps': 44068, 'loss/train': 1.4499595165252686} 11/07/2021 03:24:17 - INFO - __main__ - Step 44070: {'lr': 0.0004067699412239622, 'samples': 8461440, 'steps': 44069, 'loss/train': 1.5239028930664062} 11/07/2021 03:24:17 - INFO - __main__ - Step 44071: {'lr': 0.00040676580747334, 'samples': 8461632, 'steps': 44070, 'loss/train': 1.042952537536621} 11/07/2021 03:24:18 - INFO - __main__ - Step 44072: {'lr': 0.0004067616736520816, 'samples': 8461824, 'steps': 44071, 'loss/train': 1.2719451189041138} 11/07/2021 03:24:19 - INFO - __main__ - Step 44073: {'lr': 0.0004067575397601888, 'samples': 8462016, 'steps': 44072, 'loss/train': 1.175555944442749} 11/07/2021 03:24:19 - INFO - __main__ - Step 44074: {'lr': 0.0004067534057976635, 'samples': 8462208, 'steps': 44073, 'loss/train': 1.5510388612747192} 11/07/2021 03:24:19 - INFO - __main__ - Step 44075: {'lr': 0.0004067492717645075, 'samples': 8462400, 'steps': 44074, 'loss/train': 1.422773003578186} 11/07/2021 03:24:20 - INFO - __main__ - Step 44076: {'lr': 0.00040674513766072274, 'samples': 8462592, 'steps': 44075, 'loss/train': 0.9936408996582031} 11/07/2021 03:24:20 - INFO - __main__ - Step 44077: {'lr': 0.000406741003486311, 'samples': 8462784, 'steps': 44076, 'loss/train': 1.760259985923767} 11/07/2021 03:24:21 - INFO - __main__ - Step 44078: {'lr': 0.00040673686924127416, 'samples': 8462976, 'steps': 44077, 'loss/train': 1.571046233177185} 11/07/2021 03:24:22 - INFO - __main__ - Step 44079: {'lr': 0.0004067327349256142, 'samples': 8463168, 'steps': 44078, 'loss/train': 1.288905382156372} 11/07/2021 03:24:22 - INFO - __main__ - Step 44080: {'lr': 0.00040672860053933286, 'samples': 8463360, 'steps': 44079, 'loss/train': 1.5099854469299316} 11/07/2021 03:24:22 - INFO - __main__ - Step 44081: {'lr': 0.00040672446608243194, 'samples': 8463552, 'steps': 44080, 'loss/train': 1.4610154628753662} 11/07/2021 03:24:23 - INFO - __main__ - Step 44082: {'lr': 0.0004067203315549135, 'samples': 8463744, 'steps': 44081, 'loss/train': 1.413544774055481} 11/07/2021 03:24:24 - INFO - __main__ - Step 44083: {'lr': 0.00040671619695677923, 'samples': 8463936, 'steps': 44082, 'loss/train': 1.3504700660705566} 11/07/2021 03:24:24 - INFO - __main__ - Step 44084: {'lr': 0.00040671206228803117, 'samples': 8464128, 'steps': 44083, 'loss/train': 1.6227068901062012} 11/07/2021 03:24:24 - INFO - __main__ - Step 44085: {'lr': 0.0004067079275486709, 'samples': 8464320, 'steps': 44084, 'loss/train': 0.2816094756126404} 11/07/2021 03:24:25 - INFO - __main__ - Step 44086: {'lr': 0.00040670379273870054, 'samples': 8464512, 'steps': 44085, 'loss/train': 1.8726481199264526} 11/07/2021 03:24:25 - INFO - __main__ - Step 44087: {'lr': 0.00040669965785812193, 'samples': 8464704, 'steps': 44086, 'loss/train': 1.5070987939834595} 11/07/2021 03:24:25 - INFO - __main__ - Step 44088: {'lr': 0.00040669552290693677, 'samples': 8464896, 'steps': 44087, 'loss/train': 1.7143826484680176} 11/07/2021 03:24:27 - INFO - __main__ - Step 44089: {'lr': 0.0004066913878851471, 'samples': 8465088, 'steps': 44088, 'loss/train': 1.077122688293457} 11/07/2021 03:24:27 - INFO - __main__ - Step 44090: {'lr': 0.00040668725279275464, 'samples': 8465280, 'steps': 44089, 'loss/train': 1.257422685623169} 11/07/2021 03:24:27 - INFO - __main__ - Step 44091: {'lr': 0.0004066831176297614, 'samples': 8465472, 'steps': 44090, 'loss/train': 1.3948299884796143} 11/07/2021 03:24:28 - INFO - __main__ - Step 44092: {'lr': 0.0004066789823961691, 'samples': 8465664, 'steps': 44091, 'loss/train': 1.5024408102035522} 11/07/2021 03:24:28 - INFO - __main__ - Step 44093: {'lr': 0.00040667484709197967, 'samples': 8465856, 'steps': 44092, 'loss/train': 0.9710264205932617} 11/07/2021 03:24:29 - INFO - __main__ - Step 44094: {'lr': 0.00040667071171719503, 'samples': 8466048, 'steps': 44093, 'loss/train': 1.559989094734192} 11/07/2021 03:24:29 - INFO - __main__ - Step 44095: {'lr': 0.00040666657627181697, 'samples': 8466240, 'steps': 44094, 'loss/train': 1.1567920446395874} 11/07/2021 03:24:30 - INFO - __main__ - Step 44096: {'lr': 0.00040666244075584736, 'samples': 8466432, 'steps': 44095, 'loss/train': 1.9180774688720703} 11/07/2021 03:24:30 - INFO - __main__ - Step 44097: {'lr': 0.000406658305169288, 'samples': 8466624, 'steps': 44096, 'loss/train': 1.5806243419647217} 11/07/2021 03:24:30 - INFO - __main__ - Step 44098: {'lr': 0.000406654169512141, 'samples': 8466816, 'steps': 44097, 'loss/train': 1.4634579420089722} 11/07/2021 03:24:32 - INFO - __main__ - Step 44099: {'lr': 0.0004066500337844078, 'samples': 8467008, 'steps': 44098, 'loss/train': 1.3353631496429443} 11/07/2021 03:24:32 - INFO - __main__ - Step 44100: {'lr': 0.0004066458979860907, 'samples': 8467200, 'steps': 44099, 'loss/train': 1.8623902797698975} 11/07/2021 03:24:33 - INFO - __main__ - Step 44101: {'lr': 0.00040664176211719136, 'samples': 8467392, 'steps': 44100, 'loss/train': 1.426758885383606} 11/07/2021 03:24:33 - INFO - __main__ - Step 44102: {'lr': 0.00040663762617771163, 'samples': 8467584, 'steps': 44101, 'loss/train': 1.5183041095733643} 11/07/2021 03:24:33 - INFO - __main__ - Step 44103: {'lr': 0.00040663349016765337, 'samples': 8467776, 'steps': 44102, 'loss/train': 1.2168281078338623} 11/07/2021 03:24:34 - INFO - __main__ - Step 44104: {'lr': 0.00040662935408701853, 'samples': 8467968, 'steps': 44103, 'loss/train': 1.5888041257858276} 11/07/2021 03:24:35 - INFO - __main__ - Step 44105: {'lr': 0.00040662521793580886, 'samples': 8468160, 'steps': 44104, 'loss/train': 0.3741858899593353} 11/07/2021 03:24:35 - INFO - __main__ - Step 44106: {'lr': 0.0004066210817140263, 'samples': 8468352, 'steps': 44105, 'loss/train': 1.7300732135772705} 11/07/2021 03:24:35 - INFO - __main__ - Step 44107: {'lr': 0.0004066169454216727, 'samples': 8468544, 'steps': 44106, 'loss/train': 2.1087207794189453} 11/07/2021 03:24:36 - INFO - __main__ - Step 44108: {'lr': 0.00040661280905875, 'samples': 8468736, 'steps': 44107, 'loss/train': 1.2548373937606812} 11/07/2021 03:24:36 - INFO - __main__ - Step 44109: {'lr': 0.0004066086726252599, 'samples': 8468928, 'steps': 44108, 'loss/train': 1.4089781045913696} 11/07/2021 03:24:37 - INFO - __main__ - Step 44110: {'lr': 0.0004066045361212043, 'samples': 8469120, 'steps': 44109, 'loss/train': 0.9372249245643616} 11/07/2021 03:24:37 - INFO - __main__ - Step 44111: {'lr': 0.00040660039954658523, 'samples': 8469312, 'steps': 44110, 'loss/train': 1.983938455581665} 11/07/2021 03:24:38 - INFO - __main__ - Step 44112: {'lr': 0.0004065962629014044, 'samples': 8469504, 'steps': 44111, 'loss/train': 1.4359891414642334} 11/07/2021 03:24:38 - INFO - __main__ - Step 44113: {'lr': 0.00040659212618566364, 'samples': 8469696, 'steps': 44112, 'loss/train': 1.4150471687316895} 11/07/2021 03:24:39 - INFO - __main__ - Step 44114: {'lr': 0.000406587989399365, 'samples': 8469888, 'steps': 44113, 'loss/train': 1.5617057085037231} 11/07/2021 03:24:40 - INFO - __main__ - Step 44115: {'lr': 0.0004065838525425102, 'samples': 8470080, 'steps': 44114, 'loss/train': 0.9213770031929016} 11/07/2021 03:24:40 - INFO - __main__ - Step 44116: {'lr': 0.00040657971561510104, 'samples': 8470272, 'steps': 44115, 'loss/train': 1.8451734781265259} 11/07/2021 03:24:40 - INFO - __main__ - Step 44117: {'lr': 0.00040657557861713956, 'samples': 8470464, 'steps': 44116, 'loss/train': 1.2429790496826172} 11/07/2021 03:24:41 - INFO - __main__ - Step 44118: {'lr': 0.00040657144154862746, 'samples': 8470656, 'steps': 44117, 'loss/train': 1.3513708114624023} 11/07/2021 03:24:41 - INFO - __main__ - Step 44119: {'lr': 0.00040656730440956677, 'samples': 8470848, 'steps': 44118, 'loss/train': 1.3201446533203125} 11/07/2021 03:24:42 - INFO - __main__ - Step 44120: {'lr': 0.0004065631671999592, 'samples': 8471040, 'steps': 44119, 'loss/train': 1.51387357711792} 11/07/2021 03:24:43 - INFO - __main__ - Step 44121: {'lr': 0.0004065590299198068, 'samples': 8471232, 'steps': 44120, 'loss/train': 1.7190121412277222} 11/07/2021 03:24:43 - INFO - __main__ - Step 44122: {'lr': 0.00040655489256911123, 'samples': 8471424, 'steps': 44121, 'loss/train': 1.5446957349777222} 11/07/2021 03:24:43 - INFO - __main__ - Step 44123: {'lr': 0.00040655075514787445, 'samples': 8471616, 'steps': 44122, 'loss/train': 1.3534348011016846} 11/07/2021 03:24:44 - INFO - __main__ - Step 44124: {'lr': 0.0004065466176560983, 'samples': 8471808, 'steps': 44123, 'loss/train': 1.3408905267715454} 11/07/2021 03:24:45 - INFO - __main__ - Step 44125: {'lr': 0.0004065424800937847, 'samples': 8472000, 'steps': 44124, 'loss/train': 1.0104137659072876} 11/07/2021 03:24:45 - INFO - __main__ - Step 44126: {'lr': 0.0004065383424609354, 'samples': 8472192, 'steps': 44125, 'loss/train': 1.5084606409072876} 11/07/2021 03:24:45 - INFO - __main__ - Step 44127: {'lr': 0.00040653420475755245, 'samples': 8472384, 'steps': 44126, 'loss/train': 1.3555549383163452} 11/07/2021 03:24:46 - INFO - __main__ - Step 44128: {'lr': 0.0004065300669836375, 'samples': 8472576, 'steps': 44127, 'loss/train': 1.8347290754318237} 11/07/2021 03:24:46 - INFO - __main__ - Step 44129: {'lr': 0.0004065259291391926, 'samples': 8472768, 'steps': 44128, 'loss/train': 1.2331953048706055} 11/07/2021 03:24:47 - INFO - __main__ - Step 44130: {'lr': 0.0004065217912242195, 'samples': 8472960, 'steps': 44129, 'loss/train': 1.6485356092453003} 11/07/2021 03:24:47 - INFO - __main__ - Step 44131: {'lr': 0.00040651765323872, 'samples': 8473152, 'steps': 44130, 'loss/train': 1.753224492073059} 11/07/2021 03:24:48 - INFO - __main__ - Step 44132: {'lr': 0.0004065135151826962, 'samples': 8473344, 'steps': 44131, 'loss/train': 1.2589049339294434} 11/07/2021 03:24:48 - INFO - __main__ - Step 44133: {'lr': 0.00040650937705614975, 'samples': 8473536, 'steps': 44132, 'loss/train': 1.6444562673568726} 11/07/2021 03:24:48 - INFO - __main__ - Step 44134: {'lr': 0.0004065052388590826, 'samples': 8473728, 'steps': 44133, 'loss/train': 1.6255830526351929} 11/07/2021 03:24:50 - INFO - __main__ - Step 44135: {'lr': 0.00040650110059149664, 'samples': 8473920, 'steps': 44134, 'loss/train': 1.9925587177276611} 11/07/2021 03:24:50 - INFO - __main__ - Step 44136: {'lr': 0.0004064969622533937, 'samples': 8474112, 'steps': 44135, 'loss/train': 1.1963980197906494} 11/07/2021 03:24:50 - INFO - __main__ - Step 44137: {'lr': 0.0004064928238447756, 'samples': 8474304, 'steps': 44136, 'loss/train': 1.5025725364685059} 11/07/2021 03:24:51 - INFO - __main__ - Step 44138: {'lr': 0.00040648868536564427, 'samples': 8474496, 'steps': 44137, 'loss/train': 1.430065631866455} 11/07/2021 03:24:51 - INFO - __main__ - Step 44139: {'lr': 0.00040648454681600153, 'samples': 8474688, 'steps': 44138, 'loss/train': 1.4614596366882324} 11/07/2021 03:24:51 - INFO - __main__ - Step 44140: {'lr': 0.0004064804081958493, 'samples': 8474880, 'steps': 44139, 'loss/train': 1.4588491916656494} 11/07/2021 03:24:52 - INFO - __main__ - Step 44141: {'lr': 0.00040647626950518945, 'samples': 8475072, 'steps': 44140, 'loss/train': 1.6638232469558716} 11/07/2021 03:24:53 - INFO - __main__ - Step 44142: {'lr': 0.00040647213074402374, 'samples': 8475264, 'steps': 44141, 'loss/train': 1.2178179025650024} 11/07/2021 03:24:53 - INFO - __main__ - Step 44143: {'lr': 0.0004064679919123541, 'samples': 8475456, 'steps': 44142, 'loss/train': 1.2608200311660767} 11/07/2021 03:24:53 - INFO - __main__ - Step 44144: {'lr': 0.00040646385301018243, 'samples': 8475648, 'steps': 44143, 'loss/train': 1.7908116579055786} 11/07/2021 03:24:54 - INFO - __main__ - Step 44145: {'lr': 0.0004064597140375105, 'samples': 8475840, 'steps': 44144, 'loss/train': 0.6181836128234863} 11/07/2021 03:24:55 - INFO - __main__ - Step 44146: {'lr': 0.00040645557499434035, 'samples': 8476032, 'steps': 44145, 'loss/train': 1.198376178741455} 11/07/2021 03:24:55 - INFO - __main__ - Step 44147: {'lr': 0.0004064514358806737, 'samples': 8476224, 'steps': 44146, 'loss/train': 1.6143416166305542} 11/07/2021 03:24:55 - INFO - __main__ - Step 44148: {'lr': 0.00040644729669651235, 'samples': 8476416, 'steps': 44147, 'loss/train': 1.154239535331726} 11/07/2021 03:24:56 - INFO - __main__ - Step 44149: {'lr': 0.0004064431574418583, 'samples': 8476608, 'steps': 44148, 'loss/train': 1.6307626962661743} 11/07/2021 03:24:56 - INFO - __main__ - Step 44150: {'lr': 0.00040643901811671345, 'samples': 8476800, 'steps': 44149, 'loss/train': 1.8045600652694702} 11/07/2021 03:24:57 - INFO - __main__ - Step 44151: {'lr': 0.0004064348787210795, 'samples': 8476992, 'steps': 44150, 'loss/train': 0.6478796005249023} 11/07/2021 03:24:58 - INFO - __main__ - Step 44152: {'lr': 0.0004064307392549585, 'samples': 8477184, 'steps': 44151, 'loss/train': 1.6127792596817017} 11/07/2021 03:24:58 - INFO - __main__ - Step 44153: {'lr': 0.00040642659971835217, 'samples': 8477376, 'steps': 44152, 'loss/train': 1.827341079711914} 11/07/2021 03:24:58 - INFO - __main__ - Step 44154: {'lr': 0.0004064224601112625, 'samples': 8477568, 'steps': 44153, 'loss/train': 1.2901002168655396} 11/07/2021 03:24:59 - INFO - __main__ - Step 44155: {'lr': 0.0004064183204336912, 'samples': 8477760, 'steps': 44154, 'loss/train': 2.1328094005584717} 11/07/2021 03:25:00 - INFO - __main__ - Step 44156: {'lr': 0.00040641418068564024, 'samples': 8477952, 'steps': 44155, 'loss/train': 1.2231348752975464} 11/07/2021 03:25:00 - INFO - __main__ - Step 44157: {'lr': 0.0004064100408671114, 'samples': 8478144, 'steps': 44156, 'loss/train': 1.8266414403915405} 11/07/2021 03:25:00 - INFO - __main__ - Step 44158: {'lr': 0.0004064059009781067, 'samples': 8478336, 'steps': 44157, 'loss/train': 1.670493245124817} 11/07/2021 03:25:01 - INFO - __main__ - Step 44159: {'lr': 0.0004064017610186279, 'samples': 8478528, 'steps': 44158, 'loss/train': 1.9124197959899902} 11/07/2021 03:25:01 - INFO - __main__ - Step 44160: {'lr': 0.00040639762098867684, 'samples': 8478720, 'steps': 44159, 'loss/train': 1.1427425146102905} 11/07/2021 03:25:02 - INFO - __main__ - Step 44161: {'lr': 0.0004063934808882555, 'samples': 8478912, 'steps': 44160, 'loss/train': 1.389540433883667} 11/07/2021 03:25:02 - INFO - __main__ - Step 44162: {'lr': 0.0004063893407173656, 'samples': 8479104, 'steps': 44161, 'loss/train': 1.037030577659607} 11/07/2021 03:25:03 - INFO - __main__ - Step 44163: {'lr': 0.00040638520047600916, 'samples': 8479296, 'steps': 44162, 'loss/train': 1.5312376022338867} 11/07/2021 03:25:03 - INFO - __main__ - Step 44164: {'lr': 0.00040638106016418785, 'samples': 8479488, 'steps': 44163, 'loss/train': 1.4833199977874756} 11/07/2021 03:25:03 - INFO - __main__ - Step 44165: {'lr': 0.0004063769197819037, 'samples': 8479680, 'steps': 44164, 'loss/train': 1.6046035289764404} 11/07/2021 03:25:05 - INFO - __main__ - Step 44166: {'lr': 0.0004063727793291585, 'samples': 8479872, 'steps': 44165, 'loss/train': 1.6139516830444336} 11/07/2021 03:25:05 - INFO - __main__ - Step 44167: {'lr': 0.00040636863880595415, 'samples': 8480064, 'steps': 44166, 'loss/train': 1.361234426498413} 11/07/2021 03:25:05 - INFO - __main__ - Step 44168: {'lr': 0.0004063644982122926, 'samples': 8480256, 'steps': 44167, 'loss/train': 1.920316457748413} 11/07/2021 03:25:06 - INFO - __main__ - Step 44169: {'lr': 0.00040636035754817545, 'samples': 8480448, 'steps': 44168, 'loss/train': 1.5258365869522095} 11/07/2021 03:25:06 - INFO - __main__ - Step 44170: {'lr': 0.00040635621681360485, 'samples': 8480640, 'steps': 44169, 'loss/train': 1.5655819177627563} 11/07/2021 03:25:06 - INFO - __main__ - Step 44171: {'lr': 0.00040635207600858247, 'samples': 8480832, 'steps': 44170, 'loss/train': 1.3133772611618042} 11/07/2021 03:25:07 - INFO - __main__ - Step 44172: {'lr': 0.00040634793513311037, 'samples': 8481024, 'steps': 44171, 'loss/train': 1.8161416053771973} 11/07/2021 03:25:08 - INFO - __main__ - Step 44173: {'lr': 0.0004063437941871903, 'samples': 8481216, 'steps': 44172, 'loss/train': 0.9237746000289917} 11/07/2021 03:25:08 - INFO - __main__ - Step 44174: {'lr': 0.000406339653170824, 'samples': 8481408, 'steps': 44173, 'loss/train': 1.6315330266952515} 11/07/2021 03:25:08 - INFO - __main__ - Step 44175: {'lr': 0.00040633551208401356, 'samples': 8481600, 'steps': 44174, 'loss/train': 1.2439640760421753} 11/07/2021 03:25:09 - INFO - __main__ - Step 44176: {'lr': 0.0004063313709267607, 'samples': 8481792, 'steps': 44175, 'loss/train': 1.1543171405792236} 11/07/2021 03:25:10 - INFO - __main__ - Step 44177: {'lr': 0.0004063272296990674, 'samples': 8481984, 'steps': 44176, 'loss/train': 1.2038911581039429} 11/07/2021 03:25:10 - INFO - __main__ - Step 44178: {'lr': 0.00040632308840093533, 'samples': 8482176, 'steps': 44177, 'loss/train': 1.9420859813690186} 11/07/2021 03:25:10 - INFO - __main__ - Step 44179: {'lr': 0.0004063189470323666, 'samples': 8482368, 'steps': 44178, 'loss/train': 1.3965636491775513} 11/07/2021 03:25:11 - INFO - __main__ - Step 44180: {'lr': 0.000406314805593363, 'samples': 8482560, 'steps': 44179, 'loss/train': 1.4306188821792603} 11/07/2021 03:25:11 - INFO - __main__ - Step 44181: {'lr': 0.00040631066408392636, 'samples': 8482752, 'steps': 44180, 'loss/train': 0.47852659225463867} 11/07/2021 03:25:12 - INFO - __main__ - Step 44182: {'lr': 0.0004063065225040584, 'samples': 8482944, 'steps': 44181, 'loss/train': 1.3241499662399292} 11/07/2021 03:25:12 - INFO - __main__ - Step 44183: {'lr': 0.0004063023808537613, 'samples': 8483136, 'steps': 44182, 'loss/train': 1.2116507291793823} 11/07/2021 03:25:13 - INFO - __main__ - Step 44184: {'lr': 0.00040629823913303665, 'samples': 8483328, 'steps': 44183, 'loss/train': 0.8394481539726257} 11/07/2021 03:25:13 - INFO - __main__ - Step 44185: {'lr': 0.0004062940973418865, 'samples': 8483520, 'steps': 44184, 'loss/train': 1.467079758644104} 11/07/2021 03:25:14 - INFO - __main__ - Step 44186: {'lr': 0.00040628995548031254, 'samples': 8483712, 'steps': 44185, 'loss/train': 1.486828327178955} 11/07/2021 03:25:15 - INFO - __main__ - Step 44187: {'lr': 0.00040628581354831687, 'samples': 8483904, 'steps': 44186, 'loss/train': 1.712790608406067} 11/07/2021 03:25:15 - INFO - __main__ - Step 44188: {'lr': 0.0004062816715459011, 'samples': 8484096, 'steps': 44187, 'loss/train': 1.5028607845306396} 11/07/2021 03:25:15 - INFO - __main__ - Step 44189: {'lr': 0.0004062775294730673, 'samples': 8484288, 'steps': 44188, 'loss/train': 1.4444500207901} 11/07/2021 03:25:16 - INFO - __main__ - Step 44190: {'lr': 0.0004062733873298172, 'samples': 8484480, 'steps': 44189, 'loss/train': 1.873066782951355} 11/07/2021 03:25:16 - INFO - __main__ - Step 44191: {'lr': 0.0004062692451161528, 'samples': 8484672, 'steps': 44190, 'loss/train': 1.6633840799331665} 11/07/2021 03:25:17 - INFO - __main__ - Step 44192: {'lr': 0.00040626510283207586, 'samples': 8484864, 'steps': 44191, 'loss/train': 1.141286015510559} 11/07/2021 03:25:18 - INFO - __main__ - Step 44193: {'lr': 0.00040626096047758823, 'samples': 8485056, 'steps': 44192, 'loss/train': 1.1355482339859009} 11/07/2021 03:25:18 - INFO - __main__ - Step 44194: {'lr': 0.0004062568180526919, 'samples': 8485248, 'steps': 44193, 'loss/train': 0.19759449362754822} 11/07/2021 03:25:18 - INFO - __main__ - Step 44195: {'lr': 0.0004062526755573886, 'samples': 8485440, 'steps': 44194, 'loss/train': 1.320232629776001} 11/07/2021 03:25:19 - INFO - __main__ - Step 44196: {'lr': 0.00040624853299168025, 'samples': 8485632, 'steps': 44195, 'loss/train': 1.5903716087341309} 11/07/2021 03:25:20 - INFO - __main__ - Step 44197: {'lr': 0.0004062443903555687, 'samples': 8485824, 'steps': 44196, 'loss/train': 1.3376597166061401} 11/07/2021 03:25:20 - INFO - __main__ - Step 44198: {'lr': 0.0004062402476490559, 'samples': 8486016, 'steps': 44197, 'loss/train': 1.3712801933288574} 11/07/2021 03:25:20 - INFO - __main__ - Step 44199: {'lr': 0.00040623610487214366, 'samples': 8486208, 'steps': 44198, 'loss/train': 1.0875322818756104} 11/07/2021 03:25:21 - INFO - __main__ - Step 44200: {'lr': 0.0004062319620248338, 'samples': 8486400, 'steps': 44199, 'loss/train': 1.6165579557418823} 11/07/2021 03:25:21 - INFO - __main__ - Step 44201: {'lr': 0.00040622781910712826, 'samples': 8486592, 'steps': 44200, 'loss/train': 1.1684272289276123} 11/07/2021 03:25:22 - INFO - __main__ - Step 44202: {'lr': 0.00040622367611902886, 'samples': 8486784, 'steps': 44201, 'loss/train': 1.704862117767334} 11/07/2021 03:25:23 - INFO - __main__ - Step 44203: {'lr': 0.0004062195330605375, 'samples': 8486976, 'steps': 44202, 'loss/train': 1.402276873588562} 11/07/2021 03:25:23 - INFO - __main__ - Step 44204: {'lr': 0.000406215389931656, 'samples': 8487168, 'steps': 44203, 'loss/train': 1.4944804906845093} 11/07/2021 03:25:23 - INFO - __main__ - Step 44205: {'lr': 0.0004062112467323863, 'samples': 8487360, 'steps': 44204, 'loss/train': 1.7162210941314697} 11/07/2021 03:25:24 - INFO - __main__ - Step 44206: {'lr': 0.00040620710346273015, 'samples': 8487552, 'steps': 44205, 'loss/train': 1.2356802225112915} 11/07/2021 03:25:24 - INFO - __main__ - Step 44207: {'lr': 0.00040620296012268956, 'samples': 8487744, 'steps': 44206, 'loss/train': 1.6394577026367188} 11/07/2021 03:25:25 - INFO - __main__ - Step 44208: {'lr': 0.0004061988167122663, 'samples': 8487936, 'steps': 44207, 'loss/train': 0.7263883352279663} 11/07/2021 03:25:25 - INFO - __main__ - Step 44209: {'lr': 0.00040619467323146224, 'samples': 8488128, 'steps': 44208, 'loss/train': 0.44673678278923035} 11/07/2021 03:25:26 - INFO - __main__ - Step 44210: {'lr': 0.0004061905296802793, 'samples': 8488320, 'steps': 44209, 'loss/train': 1.4260218143463135} 11/07/2021 03:25:26 - INFO - __main__ - Step 44211: {'lr': 0.00040618638605871934, 'samples': 8488512, 'steps': 44210, 'loss/train': 0.9377465844154358} 11/07/2021 03:25:27 - INFO - __main__ - Step 44212: {'lr': 0.00040618224236678413, 'samples': 8488704, 'steps': 44211, 'loss/train': 1.746358871459961} 11/07/2021 03:25:27 - INFO - __main__ - Step 44213: {'lr': 0.00040617809860447564, 'samples': 8488896, 'steps': 44212, 'loss/train': 0.8359420299530029} 11/07/2021 03:25:28 - INFO - __main__ - Step 44214: {'lr': 0.00040617395477179577, 'samples': 8489088, 'steps': 44213, 'loss/train': 1.3058966398239136} 11/07/2021 03:25:29 - INFO - __main__ - Step 44215: {'lr': 0.0004061698108687463, 'samples': 8489280, 'steps': 44214, 'loss/train': 1.6434102058410645} 11/07/2021 03:25:29 - INFO - __main__ - Step 44216: {'lr': 0.00040616566689532905, 'samples': 8489472, 'steps': 44215, 'loss/train': 1.4960986375808716} 11/07/2021 03:25:29 - INFO - __main__ - Step 44217: {'lr': 0.00040616152285154607, 'samples': 8489664, 'steps': 44216, 'loss/train': 1.7768678665161133} 11/07/2021 03:25:30 - INFO - __main__ - Step 44218: {'lr': 0.000406157378737399, 'samples': 8489856, 'steps': 44217, 'loss/train': 1.7725777626037598} 11/07/2021 03:25:30 - INFO - __main__ - Step 44219: {'lr': 0.0004061532345528899, 'samples': 8490048, 'steps': 44218, 'loss/train': 1.3695653676986694} 11/07/2021 03:25:31 - INFO - __main__ - Step 44220: {'lr': 0.00040614909029802054, 'samples': 8490240, 'steps': 44219, 'loss/train': 1.4920202493667603} 11/07/2021 03:25:32 - INFO - __main__ - Step 44221: {'lr': 0.0004061449459727928, 'samples': 8490432, 'steps': 44220, 'loss/train': 1.9846137762069702} 11/07/2021 03:25:32 - INFO - __main__ - Step 44222: {'lr': 0.0004061408015772086, 'samples': 8490624, 'steps': 44221, 'loss/train': 1.5419899225234985} 11/07/2021 03:25:32 - INFO - __main__ - Step 44223: {'lr': 0.0004061366571112698, 'samples': 8490816, 'steps': 44222, 'loss/train': 1.380388617515564} 11/07/2021 03:25:33 - INFO - __main__ - Step 44224: {'lr': 0.0004061325125749781, 'samples': 8491008, 'steps': 44223, 'loss/train': 1.3789750337600708} 11/07/2021 03:25:34 - INFO - __main__ - Step 44225: {'lr': 0.00040612836796833556, 'samples': 8491200, 'steps': 44224, 'loss/train': 1.9082430601119995} 11/07/2021 03:25:34 - INFO - __main__ - Step 44226: {'lr': 0.000406124223291344, 'samples': 8491392, 'steps': 44225, 'loss/train': 2.0185680389404297} 11/07/2021 03:25:34 - INFO - __main__ - Step 44227: {'lr': 0.0004061200785440052, 'samples': 8491584, 'steps': 44226, 'loss/train': 1.3868499994277954} 11/07/2021 03:25:35 - INFO - __main__ - Step 44228: {'lr': 0.0004061159337263213, 'samples': 8491776, 'steps': 44227, 'loss/train': 1.6316428184509277} 11/07/2021 03:25:35 - INFO - __main__ - Step 44229: {'lr': 0.0004061117888382938, 'samples': 8491968, 'steps': 44228, 'loss/train': 1.5498803853988647} 11/07/2021 03:25:36 - INFO - __main__ - Step 44230: {'lr': 0.00040610764387992475, 'samples': 8492160, 'steps': 44229, 'loss/train': 1.3144543170928955} 11/07/2021 03:25:37 - INFO - __main__ - Step 44231: {'lr': 0.0004061034988512161, 'samples': 8492352, 'steps': 44230, 'loss/train': 1.8794748783111572} 11/07/2021 03:25:37 - INFO - __main__ - Step 44232: {'lr': 0.0004060993537521695, 'samples': 8492544, 'steps': 44231, 'loss/train': 5.840709686279297} 11/07/2021 03:25:37 - INFO - __main__ - Step 44233: {'lr': 0.00040609520858278704, 'samples': 8492736, 'steps': 44232, 'loss/train': 1.6199346780776978} 11/07/2021 03:25:38 - INFO - __main__ - Step 44234: {'lr': 0.0004060910633430704, 'samples': 8492928, 'steps': 44233, 'loss/train': 1.3364590406417847} 11/07/2021 03:25:38 - INFO - __main__ - Step 44235: {'lr': 0.0004060869180330216, 'samples': 8493120, 'steps': 44234, 'loss/train': 1.4359380006790161} 11/07/2021 03:25:39 - INFO - __main__ - Step 44236: {'lr': 0.00040608277265264243, 'samples': 8493312, 'steps': 44235, 'loss/train': 1.476361870765686} 11/07/2021 03:25:39 - INFO - __main__ - Step 44237: {'lr': 0.0004060786272019348, 'samples': 8493504, 'steps': 44236, 'loss/train': 1.8258213996887207} 11/07/2021 03:25:40 - INFO - __main__ - Step 44238: {'lr': 0.00040607448168090044, 'samples': 8493696, 'steps': 44237, 'loss/train': 1.4443018436431885} 11/07/2021 03:25:40 - INFO - __main__ - Step 44239: {'lr': 0.00040607033608954136, 'samples': 8493888, 'steps': 44238, 'loss/train': 1.3991117477416992} 11/07/2021 03:25:40 - INFO - __main__ - Step 44240: {'lr': 0.0004060661904278595, 'samples': 8494080, 'steps': 44239, 'loss/train': 0.4276210069656372} 11/07/2021 03:25:41 - INFO - __main__ - Step 44241: {'lr': 0.0004060620446958565, 'samples': 8494272, 'steps': 44240, 'loss/train': 1.401721715927124} 11/07/2021 03:25:42 - INFO - __main__ - Step 44242: {'lr': 0.00040605789889353445, 'samples': 8494464, 'steps': 44241, 'loss/train': 1.4835010766983032} 11/07/2021 03:25:42 - INFO - __main__ - Step 44243: {'lr': 0.00040605375302089507, 'samples': 8494656, 'steps': 44242, 'loss/train': 1.3869136571884155} 11/07/2021 03:25:42 - INFO - __main__ - Step 44244: {'lr': 0.00040604960707794023, 'samples': 8494848, 'steps': 44243, 'loss/train': 1.373746395111084} 11/07/2021 03:25:43 - INFO - __main__ - Step 44245: {'lr': 0.00040604546106467196, 'samples': 8495040, 'steps': 44244, 'loss/train': 1.4466519355773926} 11/07/2021 03:25:43 - INFO - __main__ - Step 44246: {'lr': 0.00040604131498109193, 'samples': 8495232, 'steps': 44245, 'loss/train': 1.67646062374115} 11/07/2021 03:25:44 - INFO - __main__ - Step 44247: {'lr': 0.0004060371688272021, 'samples': 8495424, 'steps': 44246, 'loss/train': 1.4390851259231567} 11/07/2021 03:25:45 - INFO - __main__ - Step 44248: {'lr': 0.00040603302260300435, 'samples': 8495616, 'steps': 44247, 'loss/train': 1.0876033306121826} 11/07/2021 03:25:45 - INFO - __main__ - Step 44249: {'lr': 0.00040602887630850055, 'samples': 8495808, 'steps': 44248, 'loss/train': 1.3555651903152466} 11/07/2021 03:25:45 - INFO - __main__ - Step 44250: {'lr': 0.0004060247299436925, 'samples': 8496000, 'steps': 44249, 'loss/train': 2.00783371925354} 11/07/2021 03:25:46 - INFO - __main__ - Step 44251: {'lr': 0.0004060205835085821, 'samples': 8496192, 'steps': 44250, 'loss/train': 1.2769891023635864} 11/07/2021 03:25:46 - INFO - __main__ - Step 44252: {'lr': 0.00040601643700317126, 'samples': 8496384, 'steps': 44251, 'loss/train': 1.2505950927734375} 11/07/2021 03:25:47 - INFO - __main__ - Step 44253: {'lr': 0.0004060122904274618, 'samples': 8496576, 'steps': 44252, 'loss/train': 1.2134746313095093} 11/07/2021 03:25:47 - INFO - __main__ - Step 44254: {'lr': 0.0004060081437814557, 'samples': 8496768, 'steps': 44253, 'loss/train': 1.798974633216858} 11/07/2021 03:25:48 - INFO - __main__ - Step 44255: {'lr': 0.00040600399706515466, 'samples': 8496960, 'steps': 44254, 'loss/train': 1.435835599899292} 11/07/2021 03:25:48 - INFO - __main__ - Step 44256: {'lr': 0.0004059998502785606, 'samples': 8497152, 'steps': 44255, 'loss/train': 1.6909722089767456} 11/07/2021 03:25:49 - INFO - __main__ - Step 44257: {'lr': 0.0004059957034216755, 'samples': 8497344, 'steps': 44256, 'loss/train': 1.5016534328460693} 11/07/2021 03:25:49 - INFO - __main__ - Step 44258: {'lr': 0.00040599155649450106, 'samples': 8497536, 'steps': 44257, 'loss/train': 1.8113622665405273} 11/07/2021 03:25:50 - INFO - __main__ - Step 44259: {'lr': 0.00040598740949703927, 'samples': 8497728, 'steps': 44258, 'loss/train': 1.6369322538375854} 11/07/2021 03:25:50 - INFO - __main__ - Step 44260: {'lr': 0.00040598326242929195, 'samples': 8497920, 'steps': 44259, 'loss/train': 1.4528487920761108} 11/07/2021 03:25:50 - INFO - __main__ - Step 44261: {'lr': 0.00040597911529126096, 'samples': 8498112, 'steps': 44260, 'loss/train': 1.588715672492981} 11/07/2021 03:25:52 - INFO - __main__ - Step 44262: {'lr': 0.00040597496808294825, 'samples': 8498304, 'steps': 44261, 'loss/train': 1.552764654159546} 11/07/2021 03:25:52 - INFO - __main__ - Step 44263: {'lr': 0.0004059708208043556, 'samples': 8498496, 'steps': 44262, 'loss/train': 2.3060762882232666} 11/07/2021 03:25:52 - INFO - __main__ - Step 44264: {'lr': 0.00040596667345548486, 'samples': 8498688, 'steps': 44263, 'loss/train': 0.9599685668945312} 11/07/2021 03:25:53 - INFO - __main__ - Step 44265: {'lr': 0.00040596252603633797, 'samples': 8498880, 'steps': 44264, 'loss/train': 1.3154436349868774} 11/07/2021 03:25:53 - INFO - __main__ - Step 44266: {'lr': 0.0004059583785469168, 'samples': 8499072, 'steps': 44265, 'loss/train': 1.1292322874069214} 11/07/2021 03:25:54 - INFO - __main__ - Step 44267: {'lr': 0.00040595423098722315, 'samples': 8499264, 'steps': 44266, 'loss/train': 1.3291518688201904} 11/07/2021 03:25:54 - INFO - __main__ - Step 44268: {'lr': 0.000405950083357259, 'samples': 8499456, 'steps': 44267, 'loss/train': 1.5495879650115967} 11/07/2021 03:25:55 - INFO - __main__ - Step 44269: {'lr': 0.0004059459356570261, 'samples': 8499648, 'steps': 44268, 'loss/train': 1.3936069011688232} 11/07/2021 03:25:55 - INFO - __main__ - Step 44270: {'lr': 0.00040594178788652636, 'samples': 8499840, 'steps': 44269, 'loss/train': 1.0680961608886719} 11/07/2021 03:25:55 - INFO - __main__ - Step 44271: {'lr': 0.00040593764004576166, 'samples': 8500032, 'steps': 44270, 'loss/train': 0.7102341651916504} 11/07/2021 03:25:57 - INFO - __main__ - Step 44272: {'lr': 0.0004059334921347339, 'samples': 8500224, 'steps': 44271, 'loss/train': 1.6032512187957764} 11/07/2021 03:25:57 - INFO - __main__ - Step 44273: {'lr': 0.00040592934415344486, 'samples': 8500416, 'steps': 44272, 'loss/train': 1.1632397174835205} 11/07/2021 03:25:57 - INFO - __main__ - Step 44274: {'lr': 0.0004059251961018965, 'samples': 8500608, 'steps': 44273, 'loss/train': 1.0936400890350342} 11/07/2021 03:25:58 - INFO - __main__ - Step 44275: {'lr': 0.00040592104798009066, 'samples': 8500800, 'steps': 44274, 'loss/train': 1.1963216066360474} 11/07/2021 03:25:58 - INFO - __main__ - Step 44276: {'lr': 0.00040591689978802917, 'samples': 8500992, 'steps': 44275, 'loss/train': 1.8550727367401123} 11/07/2021 03:26:00 - INFO - __main__ - Step 44277: {'lr': 0.0004059127515257139, 'samples': 8501184, 'steps': 44276, 'loss/train': 1.220307469367981} 11/07/2021 03:26:00 - INFO - __main__ - Step 44278: {'lr': 0.0004059086031931468, 'samples': 8501376, 'steps': 44277, 'loss/train': 1.9647818803787231} 11/07/2021 03:26:00 - INFO - __main__ - Step 44279: {'lr': 0.00040590445479032965, 'samples': 8501568, 'steps': 44278, 'loss/train': 1.5263757705688477} 11/07/2021 03:26:01 - INFO - __main__ - Step 44280: {'lr': 0.0004059003063172644, 'samples': 8501760, 'steps': 44279, 'loss/train': 1.7530311346054077} 11/07/2021 03:26:01 - INFO - __main__ - Step 44281: {'lr': 0.0004058961577739529, 'samples': 8501952, 'steps': 44280, 'loss/train': 1.3085688352584839} 11/07/2021 03:26:01 - INFO - __main__ - Step 44282: {'lr': 0.00040589200916039703, 'samples': 8502144, 'steps': 44281, 'loss/train': 1.7258718013763428} 11/07/2021 03:26:02 - INFO - __main__ - Step 44283: {'lr': 0.0004058878604765985, 'samples': 8502336, 'steps': 44282, 'loss/train': 2.434744119644165} 11/07/2021 03:26:03 - INFO - __main__ - Step 44284: {'lr': 0.00040588371172255936, 'samples': 8502528, 'steps': 44283, 'loss/train': 2.6195433139801025} 11/07/2021 03:26:03 - INFO - __main__ - Step 44285: {'lr': 0.0004058795628982814, 'samples': 8502720, 'steps': 44284, 'loss/train': 1.3945242166519165} 11/07/2021 03:26:03 - INFO - __main__ - Step 44286: {'lr': 0.0004058754140037666, 'samples': 8502912, 'steps': 44285, 'loss/train': 1.7013885974884033} 11/07/2021 03:26:04 - INFO - __main__ - Step 44287: {'lr': 0.00040587126503901664, 'samples': 8503104, 'steps': 44286, 'loss/train': 1.5036070346832275} 11/07/2021 03:26:04 - INFO - __main__ - Step 44288: {'lr': 0.0004058671160040336, 'samples': 8503296, 'steps': 44287, 'loss/train': 1.908082365989685} 11/07/2021 03:26:05 - INFO - __main__ - Step 44289: {'lr': 0.0004058629668988192, 'samples': 8503488, 'steps': 44288, 'loss/train': 1.5681588649749756} 11/07/2021 03:26:05 - INFO - __main__ - Step 44290: {'lr': 0.0004058588177233753, 'samples': 8503680, 'steps': 44289, 'loss/train': 1.7509156465530396} 11/07/2021 03:26:06 - INFO - __main__ - Step 44291: {'lr': 0.0004058546684777039, 'samples': 8503872, 'steps': 44290, 'loss/train': 1.3371413946151733} 11/07/2021 03:26:06 - INFO - __main__ - Step 44292: {'lr': 0.0004058505191618067, 'samples': 8504064, 'steps': 44291, 'loss/train': 1.466626763343811} 11/07/2021 03:26:07 - INFO - __main__ - Step 44293: {'lr': 0.00040584636977568573, 'samples': 8504256, 'steps': 44292, 'loss/train': 1.5148591995239258} 11/07/2021 03:26:07 - INFO - __main__ - Step 44294: {'lr': 0.0004058422203193428, 'samples': 8504448, 'steps': 44293, 'loss/train': 2.143653392791748} 11/07/2021 03:26:08 - INFO - __main__ - Step 44295: {'lr': 0.0004058380707927798, 'samples': 8504640, 'steps': 44294, 'loss/train': 1.8551056385040283} 11/07/2021 03:26:08 - INFO - __main__ - Step 44296: {'lr': 0.00040583392119599847, 'samples': 8504832, 'steps': 44295, 'loss/train': 1.5859216451644897} 11/07/2021 03:26:09 - INFO - __main__ - Step 44297: {'lr': 0.0004058297715290008, 'samples': 8505024, 'steps': 44296, 'loss/train': 1.3397605419158936} 11/07/2021 03:26:09 - INFO - __main__ - Step 44298: {'lr': 0.00040582562179178864, 'samples': 8505216, 'steps': 44297, 'loss/train': 1.5907195806503296} 11/07/2021 03:26:10 - INFO - __main__ - Step 44299: {'lr': 0.0004058214719843639, 'samples': 8505408, 'steps': 44298, 'loss/train': 1.302096962928772} 11/07/2021 03:26:11 - INFO - __main__ - Step 44300: {'lr': 0.0004058173221067284, 'samples': 8505600, 'steps': 44299, 'loss/train': 1.5051329135894775} 11/07/2021 03:26:11 - INFO - __main__ - Step 44301: {'lr': 0.00040581317215888403, 'samples': 8505792, 'steps': 44300, 'loss/train': 1.4881658554077148} 11/07/2021 03:26:11 - INFO - __main__ - Step 44302: {'lr': 0.0004058090221408326, 'samples': 8505984, 'steps': 44301, 'loss/train': 1.6845630407333374} 11/07/2021 03:26:12 - INFO - __main__ - Step 44303: {'lr': 0.0004058048720525761, 'samples': 8506176, 'steps': 44302, 'loss/train': 1.5862226486206055} 11/07/2021 03:26:13 - INFO - __main__ - Step 44304: {'lr': 0.00040580072189411626, 'samples': 8506368, 'steps': 44303, 'loss/train': 0.34425705671310425} 11/07/2021 03:26:13 - INFO - __main__ - Step 44305: {'lr': 0.00040579657166545503, 'samples': 8506560, 'steps': 44304, 'loss/train': 1.3707748651504517} 11/07/2021 03:26:13 - INFO - __main__ - Step 44306: {'lr': 0.0004057924213665943, 'samples': 8506752, 'steps': 44305, 'loss/train': 1.7618228197097778} 11/07/2021 03:26:14 - INFO - __main__ - Step 44307: {'lr': 0.0004057882709975359, 'samples': 8506944, 'steps': 44306, 'loss/train': 1.6465903520584106} 11/07/2021 03:26:14 - INFO - __main__ - Step 44308: {'lr': 0.0004057841205582817, 'samples': 8507136, 'steps': 44307, 'loss/train': 1.5074667930603027} 11/07/2021 03:26:15 - INFO - __main__ - Step 44309: {'lr': 0.0004057799700488336, 'samples': 8507328, 'steps': 44308, 'loss/train': 1.3298178911209106} 11/07/2021 03:26:16 - INFO - __main__ - Step 44310: {'lr': 0.0004057758194691934, 'samples': 8507520, 'steps': 44309, 'loss/train': 1.3904099464416504} 11/07/2021 03:26:16 - INFO - __main__ - Step 44311: {'lr': 0.00040577166881936304, 'samples': 8507712, 'steps': 44310, 'loss/train': 1.4899998903274536} 11/07/2021 03:26:16 - INFO - __main__ - Step 44312: {'lr': 0.0004057675180993444, 'samples': 8507904, 'steps': 44311, 'loss/train': 1.7797423601150513} 11/07/2021 03:26:17 - INFO - __main__ - Step 44313: {'lr': 0.00040576336730913933, 'samples': 8508096, 'steps': 44312, 'loss/train': 1.3314696550369263} 11/07/2021 03:26:17 - INFO - __main__ - Step 44314: {'lr': 0.00040575921644874966, 'samples': 8508288, 'steps': 44313, 'loss/train': 1.326983094215393} 11/07/2021 03:26:19 - INFO - __main__ - Step 44315: {'lr': 0.00040575506551817725, 'samples': 8508480, 'steps': 44314, 'loss/train': 0.7640935778617859} 11/07/2021 03:26:19 - INFO - __main__ - Step 44316: {'lr': 0.00040575091451742405, 'samples': 8508672, 'steps': 44315, 'loss/train': 1.2644010782241821} 11/07/2021 03:26:19 - INFO - __main__ - Step 44317: {'lr': 0.0004057467634464919, 'samples': 8508864, 'steps': 44316, 'loss/train': 1.4013490676879883} 11/07/2021 03:26:20 - INFO - __main__ - Step 44318: {'lr': 0.00040574261230538267, 'samples': 8509056, 'steps': 44317, 'loss/train': 1.2443079948425293} 11/07/2021 03:26:20 - INFO - __main__ - Step 44319: {'lr': 0.0004057384610940982, 'samples': 8509248, 'steps': 44318, 'loss/train': 1.4569404125213623} 11/07/2021 03:26:20 - INFO - __main__ - Step 44320: {'lr': 0.0004057343098126404, 'samples': 8509440, 'steps': 44319, 'loss/train': 1.319869041442871} 11/07/2021 03:26:21 - INFO - __main__ - Step 44321: {'lr': 0.0004057301584610111, 'samples': 8509632, 'steps': 44320, 'loss/train': 1.4804351329803467} 11/07/2021 03:26:22 - INFO - __main__ - Step 44322: {'lr': 0.00040572600703921223, 'samples': 8509824, 'steps': 44321, 'loss/train': 3.2597482204437256} 11/07/2021 03:26:22 - INFO - __main__ - Step 44323: {'lr': 0.0004057218555472456, 'samples': 8510016, 'steps': 44322, 'loss/train': 1.5574032068252563} 11/07/2021 03:26:22 - INFO - __main__ - Step 44324: {'lr': 0.0004057177039851131, 'samples': 8510208, 'steps': 44323, 'loss/train': 1.564391016960144} 11/07/2021 03:26:23 - INFO - __main__ - Step 44325: {'lr': 0.00040571355235281657, 'samples': 8510400, 'steps': 44324, 'loss/train': 1.9127159118652344} 11/07/2021 03:26:24 - INFO - __main__ - Step 44326: {'lr': 0.00040570940065035797, 'samples': 8510592, 'steps': 44325, 'loss/train': 1.2782944440841675} 11/07/2021 03:26:24 - INFO - __main__ - Step 44327: {'lr': 0.0004057052488777392, 'samples': 8510784, 'steps': 44326, 'loss/train': 1.6044999361038208} 11/07/2021 03:26:25 - INFO - __main__ - Step 44328: {'lr': 0.0004057010970349619, 'samples': 8510976, 'steps': 44327, 'loss/train': 1.5265976190567017} 11/07/2021 03:26:25 - INFO - __main__ - Step 44329: {'lr': 0.00040569694512202815, 'samples': 8511168, 'steps': 44328, 'loss/train': 1.3547221422195435} 11/07/2021 03:26:25 - INFO - __main__ - Step 44330: {'lr': 0.00040569279313893976, 'samples': 8511360, 'steps': 44329, 'loss/train': 1.4769420623779297} 11/07/2021 03:26:26 - INFO - __main__ - Step 44331: {'lr': 0.0004056886410856986, 'samples': 8511552, 'steps': 44330, 'loss/train': 0.3407726585865021} 11/07/2021 03:26:27 - INFO - __main__ - Step 44332: {'lr': 0.0004056844889623065, 'samples': 8511744, 'steps': 44331, 'loss/train': 1.1814073324203491} 11/07/2021 03:26:27 - INFO - __main__ - Step 44333: {'lr': 0.0004056803367687654, 'samples': 8511936, 'steps': 44332, 'loss/train': 1.6243948936462402} 11/07/2021 03:26:27 - INFO - __main__ - Step 44334: {'lr': 0.0004056761845050772, 'samples': 8512128, 'steps': 44333, 'loss/train': 0.6642376184463501} 11/07/2021 03:26:28 - INFO - __main__ - Step 44335: {'lr': 0.0004056720321712436, 'samples': 8512320, 'steps': 44334, 'loss/train': 1.2679871320724487} 11/07/2021 03:26:29 - INFO - __main__ - Step 44336: {'lr': 0.00040566787976726665, 'samples': 8512512, 'steps': 44335, 'loss/train': 1.2967005968093872} 11/07/2021 03:26:29 - INFO - __main__ - Step 44337: {'lr': 0.00040566372729314813, 'samples': 8512704, 'steps': 44336, 'loss/train': 1.4064103364944458} 11/07/2021 03:26:29 - INFO - __main__ - Step 44338: {'lr': 0.00040565957474889, 'samples': 8512896, 'steps': 44337, 'loss/train': 1.4833481311798096} 11/07/2021 03:26:30 - INFO - __main__ - Step 44339: {'lr': 0.000405655422134494, 'samples': 8513088, 'steps': 44338, 'loss/train': 2.3924691677093506} 11/07/2021 03:26:30 - INFO - __main__ - Step 44340: {'lr': 0.0004056512694499621, 'samples': 8513280, 'steps': 44339, 'loss/train': 0.9818007946014404} 11/07/2021 03:26:31 - INFO - __main__ - Step 44341: {'lr': 0.0004056471166952961, 'samples': 8513472, 'steps': 44340, 'loss/train': 1.5734854936599731} 11/07/2021 03:26:32 - INFO - __main__ - Step 44342: {'lr': 0.0004056429638704979, 'samples': 8513664, 'steps': 44341, 'loss/train': 1.7564226388931274} 11/07/2021 03:26:32 - INFO - __main__ - Step 44343: {'lr': 0.0004056388109755695, 'samples': 8513856, 'steps': 44342, 'loss/train': 1.9729844331741333} 11/07/2021 03:26:32 - INFO - __main__ - Step 44344: {'lr': 0.0004056346580105126, 'samples': 8514048, 'steps': 44343, 'loss/train': 0.17679743468761444} 11/07/2021 03:26:33 - INFO - __main__ - Step 44345: {'lr': 0.00040563050497532905, 'samples': 8514240, 'steps': 44344, 'loss/train': 1.5594099760055542} 11/07/2021 03:26:33 - INFO - __main__ - Step 44346: {'lr': 0.00040562635187002083, 'samples': 8514432, 'steps': 44345, 'loss/train': 1.5085008144378662} 11/07/2021 03:26:34 - INFO - __main__ - Step 44347: {'lr': 0.0004056221986945898, 'samples': 8514624, 'steps': 44346, 'loss/train': 0.818427562713623} 11/07/2021 03:26:35 - INFO - __main__ - Step 44348: {'lr': 0.0004056180454490378, 'samples': 8514816, 'steps': 44347, 'loss/train': 0.6687682867050171} 11/07/2021 03:26:35 - INFO - __main__ - Step 44349: {'lr': 0.00040561389213336673, 'samples': 8515008, 'steps': 44348, 'loss/train': 1.2913715839385986} 11/07/2021 03:26:35 - INFO - __main__ - Step 44350: {'lr': 0.00040560973874757844, 'samples': 8515200, 'steps': 44349, 'loss/train': 1.7278624773025513} 11/07/2021 03:26:36 - INFO - __main__ - Step 44351: {'lr': 0.0004056055852916748, 'samples': 8515392, 'steps': 44350, 'loss/train': 1.3829728364944458} 11/07/2021 03:26:37 - INFO - __main__ - Step 44352: {'lr': 0.0004056014317656577, 'samples': 8515584, 'steps': 44351, 'loss/train': 1.8176770210266113} 11/07/2021 03:26:37 - INFO - __main__ - Step 44353: {'lr': 0.00040559727816952897, 'samples': 8515776, 'steps': 44352, 'loss/train': 1.4659550189971924} 11/07/2021 03:26:37 - INFO - __main__ - Step 44354: {'lr': 0.0004055931245032904, 'samples': 8515968, 'steps': 44353, 'loss/train': 1.4851115942001343} 11/07/2021 03:26:38 - INFO - __main__ - Step 44355: {'lr': 0.0004055889707669441, 'samples': 8516160, 'steps': 44354, 'loss/train': 1.867988109588623} 11/07/2021 03:26:38 - INFO - __main__ - Step 44356: {'lr': 0.0004055848169604919, 'samples': 8516352, 'steps': 44355, 'loss/train': 1.2548911571502686} 11/07/2021 03:26:39 - INFO - __main__ - Step 44357: {'lr': 0.00040558066308393536, 'samples': 8516544, 'steps': 44356, 'loss/train': 1.6776759624481201} 11/07/2021 03:26:40 - INFO - __main__ - Step 44358: {'lr': 0.0004055765091372767, 'samples': 8516736, 'steps': 44357, 'loss/train': 1.2604798078536987} 11/07/2021 03:26:40 - INFO - __main__ - Step 44359: {'lr': 0.0004055723551205177, 'samples': 8516928, 'steps': 44358, 'loss/train': 2.0085549354553223} 11/07/2021 03:26:40 - INFO - __main__ - Step 44360: {'lr': 0.0004055682010336601, 'samples': 8517120, 'steps': 44359, 'loss/train': 1.4113619327545166} 11/07/2021 03:26:41 - INFO - __main__ - Step 44361: {'lr': 0.0004055640468767059, 'samples': 8517312, 'steps': 44360, 'loss/train': 1.4180389642715454} 11/07/2021 03:26:42 - INFO - __main__ - Step 44362: {'lr': 0.000405559892649657, 'samples': 8517504, 'steps': 44361, 'loss/train': 1.191287875175476} 11/07/2021 03:26:42 - INFO - __main__ - Step 44363: {'lr': 0.00040555573835251513, 'samples': 8517696, 'steps': 44362, 'loss/train': 1.7261238098144531} 11/07/2021 03:26:42 - INFO - __main__ - Step 44364: {'lr': 0.00040555158398528237, 'samples': 8517888, 'steps': 44363, 'loss/train': 1.5277279615402222} 11/07/2021 03:26:43 - INFO - __main__ - Step 44365: {'lr': 0.0004055474295479603, 'samples': 8518080, 'steps': 44364, 'loss/train': 1.3089200258255005} 11/07/2021 03:26:43 - INFO - __main__ - Step 44366: {'lr': 0.00040554327504055106, 'samples': 8518272, 'steps': 44365, 'loss/train': 1.2779258489608765} 11/07/2021 03:26:45 - INFO - __main__ - Step 44367: {'lr': 0.0004055391204630564, 'samples': 8518464, 'steps': 44366, 'loss/train': 1.3282432556152344} 11/07/2021 03:26:45 - INFO - __main__ - Step 44368: {'lr': 0.0004055349658154782, 'samples': 8518656, 'steps': 44367, 'loss/train': 1.0060210227966309} 11/07/2021 03:26:46 - INFO - __main__ - Step 44369: {'lr': 0.00040553081109781844, 'samples': 8518848, 'steps': 44368, 'loss/train': 1.7433545589447021} 11/07/2021 03:26:46 - INFO - __main__ - Step 44370: {'lr': 0.0004055266563100788, 'samples': 8519040, 'steps': 44369, 'loss/train': 1.9142252206802368} 11/07/2021 03:26:46 - INFO - __main__ - Step 44371: {'lr': 0.00040552250145226124, 'samples': 8519232, 'steps': 44370, 'loss/train': 1.2746949195861816} 11/07/2021 03:26:47 - INFO - __main__ - Step 44372: {'lr': 0.0004055183465243676, 'samples': 8519424, 'steps': 44371, 'loss/train': 0.8926762938499451} 11/07/2021 03:26:47 - INFO - __main__ - Step 44373: {'lr': 0.0004055141915263999, 'samples': 8519616, 'steps': 44372, 'loss/train': 1.221543788909912} 11/07/2021 03:26:48 - INFO - __main__ - Step 44374: {'lr': 0.0004055100364583598, 'samples': 8519808, 'steps': 44373, 'loss/train': 1.3261895179748535} 11/07/2021 03:26:48 - INFO - __main__ - Step 44375: {'lr': 0.0004055058813202493, 'samples': 8520000, 'steps': 44374, 'loss/train': 2.2200098037719727} 11/07/2021 03:26:49 - INFO - __main__ - Step 44376: {'lr': 0.0004055017261120704, 'samples': 8520192, 'steps': 44375, 'loss/train': 2.082080125808716} 11/07/2021 03:26:49 - INFO - __main__ - Step 44377: {'lr': 0.00040549757083382465, 'samples': 8520384, 'steps': 44376, 'loss/train': 1.1706246137619019} 11/07/2021 03:26:50 - INFO - __main__ - Step 44378: {'lr': 0.00040549341548551415, 'samples': 8520576, 'steps': 44377, 'loss/train': 1.8282917737960815} 11/07/2021 03:26:51 - INFO - __main__ - Step 44379: {'lr': 0.0004054892600671407, 'samples': 8520768, 'steps': 44378, 'loss/train': 1.7403497695922852} 11/07/2021 03:26:51 - INFO - __main__ - Step 44380: {'lr': 0.00040548510457870623, 'samples': 8520960, 'steps': 44379, 'loss/train': 1.3733513355255127} 11/07/2021 03:26:51 - INFO - __main__ - Step 44381: {'lr': 0.00040548094902021257, 'samples': 8521152, 'steps': 44380, 'loss/train': 1.5659174919128418} 11/07/2021 03:26:52 - INFO - __main__ - Step 44382: {'lr': 0.00040547679339166155, 'samples': 8521344, 'steps': 44381, 'loss/train': 1.7189534902572632} 11/07/2021 03:26:52 - INFO - __main__ - Step 44383: {'lr': 0.0004054726376930551, 'samples': 8521536, 'steps': 44382, 'loss/train': 1.4297466278076172} 11/07/2021 03:26:53 - INFO - __main__ - Step 44384: {'lr': 0.0004054684819243951, 'samples': 8521728, 'steps': 44383, 'loss/train': 1.544453501701355} 11/07/2021 03:26:53 - INFO - __main__ - Step 44385: {'lr': 0.0004054643260856834, 'samples': 8521920, 'steps': 44384, 'loss/train': 1.738171935081482} 11/07/2021 03:26:54 - INFO - __main__ - Step 44386: {'lr': 0.00040546017017692183, 'samples': 8522112, 'steps': 44385, 'loss/train': 1.7193278074264526} 11/07/2021 03:26:54 - INFO - __main__ - Step 44387: {'lr': 0.00040545601419811236, 'samples': 8522304, 'steps': 44386, 'loss/train': 1.71592378616333} 11/07/2021 03:26:54 - INFO - __main__ - Step 44388: {'lr': 0.00040545185814925676, 'samples': 8522496, 'steps': 44387, 'loss/train': 1.3953267335891724} 11/07/2021 03:26:55 - INFO - __main__ - Step 44389: {'lr': 0.00040544770203035705, 'samples': 8522688, 'steps': 44388, 'loss/train': 1.3401683568954468} 11/07/2021 03:26:56 - INFO - __main__ - Step 44390: {'lr': 0.0004054435458414149, 'samples': 8522880, 'steps': 44389, 'loss/train': 1.2182083129882812} 11/07/2021 03:26:56 - INFO - __main__ - Step 44391: {'lr': 0.0004054393895824323, 'samples': 8523072, 'steps': 44390, 'loss/train': 1.2936636209487915} 11/07/2021 03:26:56 - INFO - __main__ - Step 44392: {'lr': 0.00040543523325341116, 'samples': 8523264, 'steps': 44391, 'loss/train': 1.199650764465332} 11/07/2021 03:26:57 - INFO - __main__ - Step 44393: {'lr': 0.0004054310768543532, 'samples': 8523456, 'steps': 44392, 'loss/train': 0.5333095788955688} 11/07/2021 03:26:58 - INFO - __main__ - Step 44394: {'lr': 0.00040542692038526054, 'samples': 8523648, 'steps': 44393, 'loss/train': 1.777122139930725} 11/07/2021 03:26:58 - INFO - __main__ - Step 44395: {'lr': 0.0004054227638461348, 'samples': 8523840, 'steps': 44394, 'loss/train': 1.1267532110214233} 11/07/2021 03:26:59 - INFO - __main__ - Step 44396: {'lr': 0.000405418607236978, 'samples': 8524032, 'steps': 44395, 'loss/train': 1.3103433847427368} 11/07/2021 03:26:59 - INFO - __main__ - Step 44397: {'lr': 0.00040541445055779197, 'samples': 8524224, 'steps': 44396, 'loss/train': 1.1464890241622925} 11/07/2021 03:26:59 - INFO - __main__ - Step 44398: {'lr': 0.0004054102938085786, 'samples': 8524416, 'steps': 44397, 'loss/train': 1.3520127534866333} 11/07/2021 03:27:01 - INFO - __main__ - Step 44399: {'lr': 0.0004054061369893397, 'samples': 8524608, 'steps': 44398, 'loss/train': 1.9427788257598877} 11/07/2021 03:27:01 - INFO - __main__ - Step 44400: {'lr': 0.0004054019801000772, 'samples': 8524800, 'steps': 44399, 'loss/train': 1.9333767890930176} 11/07/2021 03:27:01 - INFO - __main__ - Step 44401: {'lr': 0.00040539782314079304, 'samples': 8524992, 'steps': 44400, 'loss/train': 1.6394872665405273} 11/07/2021 03:27:02 - INFO - __main__ - Step 44402: {'lr': 0.000405393666111489, 'samples': 8525184, 'steps': 44401, 'loss/train': 1.2938517332077026} 11/07/2021 03:27:02 - INFO - __main__ - Step 44403: {'lr': 0.0004053895090121669, 'samples': 8525376, 'steps': 44402, 'loss/train': 1.545699119567871} 11/07/2021 03:27:03 - INFO - __main__ - Step 44404: {'lr': 0.00040538535184282877, 'samples': 8525568, 'steps': 44403, 'loss/train': 0.23328112065792084} 11/07/2021 03:27:03 - INFO - __main__ - Step 44405: {'lr': 0.00040538119460347636, 'samples': 8525760, 'steps': 44404, 'loss/train': 1.5613592863082886} 11/07/2021 03:27:04 - INFO - __main__ - Step 44406: {'lr': 0.0004053770372941116, 'samples': 8525952, 'steps': 44405, 'loss/train': 1.7085882425308228} 11/07/2021 03:27:04 - INFO - __main__ - Step 44407: {'lr': 0.00040537287991473627, 'samples': 8526144, 'steps': 44406, 'loss/train': 1.5492923259735107} 11/07/2021 03:27:04 - INFO - __main__ - Step 44408: {'lr': 0.0004053687224653524, 'samples': 8526336, 'steps': 44407, 'loss/train': 1.4747073650360107} 11/07/2021 03:27:05 - INFO - __main__ - Step 44409: {'lr': 0.0004053645649459617, 'samples': 8526528, 'steps': 44408, 'loss/train': 1.1547192335128784} 11/07/2021 03:27:06 - INFO - __main__ - Step 44410: {'lr': 0.0004053604073565662, 'samples': 8526720, 'steps': 44409, 'loss/train': 1.2398037910461426} 11/07/2021 03:27:06 - INFO - __main__ - Step 44411: {'lr': 0.0004053562496971677, 'samples': 8526912, 'steps': 44410, 'loss/train': 2.591552972793579} 11/07/2021 03:27:07 - INFO - __main__ - Step 44412: {'lr': 0.00040535209196776803, 'samples': 8527104, 'steps': 44411, 'loss/train': 1.2528629302978516} 11/07/2021 03:27:07 - INFO - __main__ - Step 44413: {'lr': 0.00040534793416836915, 'samples': 8527296, 'steps': 44412, 'loss/train': 1.191092610359192} 11/07/2021 03:27:07 - INFO - __main__ - Step 44414: {'lr': 0.00040534377629897276, 'samples': 8527488, 'steps': 44413, 'loss/train': 0.8791242241859436} 11/07/2021 03:27:08 - INFO - __main__ - Step 44415: {'lr': 0.000405339618359581, 'samples': 8527680, 'steps': 44414, 'loss/train': 0.21375149488449097} 11/07/2021 03:27:09 - INFO - __main__ - Step 44416: {'lr': 0.0004053354603501956, 'samples': 8527872, 'steps': 44415, 'loss/train': 1.3721272945404053} 11/07/2021 03:27:09 - INFO - __main__ - Step 44417: {'lr': 0.0004053313022708184, 'samples': 8528064, 'steps': 44416, 'loss/train': 0.752591609954834} 11/07/2021 03:27:09 - INFO - __main__ - Step 44418: {'lr': 0.00040532714412145135, 'samples': 8528256, 'steps': 44417, 'loss/train': 2.0578362941741943} 11/07/2021 03:27:10 - INFO - __main__ - Step 44419: {'lr': 0.0004053229859020962, 'samples': 8528448, 'steps': 44418, 'loss/train': 1.4786136150360107} 11/07/2021 03:27:11 - INFO - __main__ - Step 44420: {'lr': 0.00040531882761275496, 'samples': 8528640, 'steps': 44419, 'loss/train': 1.2803568840026855} 11/07/2021 03:27:12 - INFO - __main__ - Step 44421: {'lr': 0.00040531466925342947, 'samples': 8528832, 'steps': 44420, 'loss/train': 1.6242843866348267} 11/07/2021 03:27:12 - INFO - __main__ - Step 44422: {'lr': 0.0004053105108241216, 'samples': 8529024, 'steps': 44421, 'loss/train': 1.488799810409546} 11/07/2021 03:27:12 - INFO - __main__ - Step 44423: {'lr': 0.0004053063523248331, 'samples': 8529216, 'steps': 44422, 'loss/train': 1.6097091436386108} 11/07/2021 03:27:13 - INFO - __main__ - Step 44424: {'lr': 0.0004053021937555661, 'samples': 8529408, 'steps': 44423, 'loss/train': 1.3957725763320923} 11/07/2021 03:27:13 - INFO - __main__ - Step 44425: {'lr': 0.00040529803511632224, 'samples': 8529600, 'steps': 44424, 'loss/train': 0.8210239410400391} 11/07/2021 03:27:14 - INFO - __main__ - Step 44426: {'lr': 0.0004052938764071035, 'samples': 8529792, 'steps': 44425, 'loss/train': 0.4482980966567993} 11/07/2021 03:27:14 - INFO - __main__ - Step 44427: {'lr': 0.00040528971762791177, 'samples': 8529984, 'steps': 44426, 'loss/train': 1.7034776210784912} 11/07/2021 03:27:15 - INFO - __main__ - Step 44428: {'lr': 0.0004052855587787488, 'samples': 8530176, 'steps': 44427, 'loss/train': 1.948665976524353} 11/07/2021 03:27:15 - INFO - __main__ - Step 44429: {'lr': 0.0004052813998596167, 'samples': 8530368, 'steps': 44428, 'loss/train': 1.4644088745117188} 11/07/2021 03:27:15 - INFO - __main__ - Step 44430: {'lr': 0.0004052772408705171, 'samples': 8530560, 'steps': 44429, 'loss/train': 1.6136597394943237} 11/07/2021 03:27:17 - INFO - __main__ - Step 44431: {'lr': 0.000405273081811452, 'samples': 8530752, 'steps': 44430, 'loss/train': 1.1703969240188599} 11/07/2021 03:27:17 - INFO - __main__ - Step 44432: {'lr': 0.0004052689226824232, 'samples': 8530944, 'steps': 44431, 'loss/train': 1.5301213264465332} 11/07/2021 03:27:17 - INFO - __main__ - Step 44433: {'lr': 0.0004052647634834327, 'samples': 8531136, 'steps': 44432, 'loss/train': 1.528038501739502} 11/07/2021 03:27:18 - INFO - __main__ - Step 44434: {'lr': 0.00040526060421448216, 'samples': 8531328, 'steps': 44433, 'loss/train': 1.3487976789474487} 11/07/2021 03:27:18 - INFO - __main__ - Step 44435: {'lr': 0.00040525644487557366, 'samples': 8531520, 'steps': 44434, 'loss/train': 1.7728455066680908} 11/07/2021 03:27:19 - INFO - __main__ - Step 44436: {'lr': 0.000405252285466709, 'samples': 8531712, 'steps': 44435, 'loss/train': 1.8397818803787231} 11/07/2021 03:27:19 - INFO - __main__ - Step 44437: {'lr': 0.0004052481259878901, 'samples': 8531904, 'steps': 44436, 'loss/train': 1.6660261154174805} 11/07/2021 03:27:20 - INFO - __main__ - Step 44438: {'lr': 0.00040524396643911874, 'samples': 8532096, 'steps': 44437, 'loss/train': 1.6685140132904053} 11/07/2021 03:27:20 - INFO - __main__ - Step 44439: {'lr': 0.00040523980682039684, 'samples': 8532288, 'steps': 44438, 'loss/train': 1.6078511476516724} 11/07/2021 03:27:20 - INFO - __main__ - Step 44440: {'lr': 0.00040523564713172634, 'samples': 8532480, 'steps': 44439, 'loss/train': 1.8439935445785522} 11/07/2021 03:27:21 - INFO - __main__ - Step 44441: {'lr': 0.000405231487373109, 'samples': 8532672, 'steps': 44440, 'loss/train': 0.8348057866096497} 11/07/2021 03:27:22 - INFO - __main__ - Step 44442: {'lr': 0.00040522732754454674, 'samples': 8532864, 'steps': 44441, 'loss/train': 1.5335814952850342} 11/07/2021 03:27:22 - INFO - __main__ - Step 44443: {'lr': 0.0004052231676460415, 'samples': 8533056, 'steps': 44442, 'loss/train': 1.5462127923965454} 11/07/2021 03:27:22 - INFO - __main__ - Step 44444: {'lr': 0.000405219007677595, 'samples': 8533248, 'steps': 44443, 'loss/train': 1.638048529624939} 11/07/2021 03:27:23 - INFO - __main__ - Step 44445: {'lr': 0.0004052148476392093, 'samples': 8533440, 'steps': 44444, 'loss/train': 1.5250041484832764} 11/07/2021 03:27:23 - INFO - __main__ - Step 44446: {'lr': 0.00040521068753088615, 'samples': 8533632, 'steps': 44445, 'loss/train': 1.178097128868103} 11/07/2021 03:27:26 - INFO - __main__ - Step 44447: {'lr': 0.0004052065273526274, 'samples': 8533824, 'steps': 44446, 'loss/train': 1.603786587715149} 11/07/2021 03:27:26 - INFO - __main__ - Step 44448: {'lr': 0.0004052023671044351, 'samples': 8534016, 'steps': 44447, 'loss/train': 1.2792620658874512} 11/07/2021 03:27:26 - INFO - __main__ - Step 44449: {'lr': 0.0004051982067863109, 'samples': 8534208, 'steps': 44448, 'loss/train': 1.9062073230743408} 11/07/2021 03:27:27 - INFO - __main__ - Step 44450: {'lr': 0.0004051940463982569, 'samples': 8534400, 'steps': 44449, 'loss/train': 1.2241114377975464} 11/07/2021 03:27:27 - INFO - __main__ - Step 44451: {'lr': 0.0004051898859402748, 'samples': 8534592, 'steps': 44450, 'loss/train': 1.7529017925262451} 11/07/2021 03:27:27 - INFO - __main__ - Step 44452: {'lr': 0.00040518572541236653, 'samples': 8534784, 'steps': 44451, 'loss/train': 1.7296701669692993} 11/07/2021 03:27:28 - INFO - __main__ - Step 44453: {'lr': 0.00040518156481453397, 'samples': 8534976, 'steps': 44452, 'loss/train': 1.0338215827941895} 11/07/2021 03:27:29 - INFO - __main__ - Step 44454: {'lr': 0.0004051774041467789, 'samples': 8535168, 'steps': 44453, 'loss/train': 1.2485108375549316} 11/07/2021 03:27:29 - INFO - __main__ - Step 44455: {'lr': 0.00040517324340910347, 'samples': 8535360, 'steps': 44454, 'loss/train': 1.7952251434326172} 11/07/2021 03:27:29 - INFO - __main__ - Step 44456: {'lr': 0.0004051690826015092, 'samples': 8535552, 'steps': 44455, 'loss/train': 1.1251962184906006} 11/07/2021 03:27:30 - INFO - __main__ - Step 44457: {'lr': 0.0004051649217239982, 'samples': 8535744, 'steps': 44456, 'loss/train': 1.3085439205169678} 11/07/2021 03:27:30 - INFO - __main__ - Step 44458: {'lr': 0.00040516076077657233, 'samples': 8535936, 'steps': 44457, 'loss/train': 1.619061827659607} 11/07/2021 03:27:31 - INFO - __main__ - Step 44459: {'lr': 0.0004051565997592334, 'samples': 8536128, 'steps': 44458, 'loss/train': 1.7020214796066284} 11/07/2021 03:27:32 - INFO - __main__ - Step 44460: {'lr': 0.0004051524386719832, 'samples': 8536320, 'steps': 44459, 'loss/train': 1.3638756275177002} 11/07/2021 03:27:32 - INFO - __main__ - Step 44461: {'lr': 0.0004051482775148238, 'samples': 8536512, 'steps': 44460, 'loss/train': 1.798329472541809} 11/07/2021 03:27:32 - INFO - __main__ - Step 44462: {'lr': 0.00040514411628775695, 'samples': 8536704, 'steps': 44461, 'loss/train': 1.083493709564209} 11/07/2021 03:27:33 - INFO - __main__ - Step 44463: {'lr': 0.0004051399549907846, 'samples': 8536896, 'steps': 44462, 'loss/train': 1.439456582069397} 11/07/2021 03:27:33 - INFO - __main__ - Step 44464: {'lr': 0.0004051357936239085, 'samples': 8537088, 'steps': 44463, 'loss/train': 0.927096962928772} 11/07/2021 03:27:34 - INFO - __main__ - Step 44465: {'lr': 0.0004051316321871307, 'samples': 8537280, 'steps': 44464, 'loss/train': 1.1415132284164429} 11/07/2021 03:27:35 - INFO - __main__ - Step 44466: {'lr': 0.0004051274706804529, 'samples': 8537472, 'steps': 44465, 'loss/train': 0.8037278056144714} 11/07/2021 03:27:35 - INFO - __main__ - Step 44467: {'lr': 0.00040512330910387706, 'samples': 8537664, 'steps': 44466, 'loss/train': 2.71468186378479} 11/07/2021 03:27:35 - INFO - __main__ - Step 44468: {'lr': 0.0004051191474574051, 'samples': 8537856, 'steps': 44467, 'loss/train': 1.15324068069458} 11/07/2021 03:27:36 - INFO - __main__ - Step 44469: {'lr': 0.0004051149857410388, 'samples': 8538048, 'steps': 44468, 'loss/train': 1.7898513078689575} 11/07/2021 03:27:37 - INFO - __main__ - Step 44470: {'lr': 0.00040511082395478014, 'samples': 8538240, 'steps': 44469, 'loss/train': 1.243414282798767} 11/07/2021 03:27:37 - INFO - __main__ - Step 44471: {'lr': 0.0004051066620986309, 'samples': 8538432, 'steps': 44470, 'loss/train': 2.240720510482788} 11/07/2021 03:27:37 - INFO - __main__ - Step 44472: {'lr': 0.00040510250017259297, 'samples': 8538624, 'steps': 44471, 'loss/train': 1.2709674835205078} 11/07/2021 03:27:38 - INFO - __main__ - Step 44473: {'lr': 0.0004050983381766683, 'samples': 8538816, 'steps': 44472, 'loss/train': 1.600716233253479} 11/07/2021 03:27:38 - INFO - __main__ - Step 44474: {'lr': 0.00040509417611085864, 'samples': 8539008, 'steps': 44473, 'loss/train': 1.0161057710647583} 11/07/2021 03:27:40 - INFO - __main__ - Step 44475: {'lr': 0.000405090013975166, 'samples': 8539200, 'steps': 44474, 'loss/train': 1.0554980039596558} 11/07/2021 03:27:40 - INFO - __main__ - Step 44476: {'lr': 0.0004050858517695921, 'samples': 8539392, 'steps': 44475, 'loss/train': 2.087064027786255} 11/07/2021 03:27:40 - INFO - __main__ - Step 44477: {'lr': 0.00040508168949413904, 'samples': 8539584, 'steps': 44476, 'loss/train': 1.0634537935256958} 11/07/2021 03:27:41 - INFO - __main__ - Step 44478: {'lr': 0.00040507752714880854, 'samples': 8539776, 'steps': 44477, 'loss/train': 1.3781660795211792} 11/07/2021 03:27:41 - INFO - __main__ - Step 44479: {'lr': 0.0004050733647336024, 'samples': 8539968, 'steps': 44478, 'loss/train': 1.4802095890045166} 11/07/2021 03:27:42 - INFO - __main__ - Step 44480: {'lr': 0.00040506920224852265, 'samples': 8540160, 'steps': 44479, 'loss/train': 1.452307105064392} 11/07/2021 03:27:43 - INFO - __main__ - Step 44481: {'lr': 0.0004050650396935711, 'samples': 8540352, 'steps': 44480, 'loss/train': 1.2733694314956665} 11/07/2021 03:27:43 - INFO - __main__ - Step 44482: {'lr': 0.00040506087706874966, 'samples': 8540544, 'steps': 44481, 'loss/train': 1.220317006111145} 11/07/2021 03:27:43 - INFO - __main__ - Step 44483: {'lr': 0.00040505671437406017, 'samples': 8540736, 'steps': 44482, 'loss/train': 0.8834923505783081} 11/07/2021 03:27:44 - INFO - __main__ - Step 44484: {'lr': 0.00040505255160950453, 'samples': 8540928, 'steps': 44483, 'loss/train': 1.421524167060852} 11/07/2021 03:27:44 - INFO - __main__ - Step 44485: {'lr': 0.00040504838877508464, 'samples': 8541120, 'steps': 44484, 'loss/train': 1.5100693702697754} 11/07/2021 03:27:44 - INFO - __main__ - Step 44486: {'lr': 0.0004050442258708022, 'samples': 8541312, 'steps': 44485, 'loss/train': 0.9665480256080627} 11/07/2021 03:27:45 - INFO - __main__ - Step 44487: {'lr': 0.0004050400628966594, 'samples': 8541504, 'steps': 44486, 'loss/train': 1.4220572710037231} 11/07/2021 03:27:46 - INFO - __main__ - Step 44488: {'lr': 0.0004050358998526578, 'samples': 8541696, 'steps': 44487, 'loss/train': 1.3052656650543213} 11/07/2021 03:27:46 - INFO - __main__ - Step 44489: {'lr': 0.00040503173673879945, 'samples': 8541888, 'steps': 44488, 'loss/train': 1.27193021774292} 11/07/2021 03:27:46 - INFO - __main__ - Step 44490: {'lr': 0.00040502757355508626, 'samples': 8542080, 'steps': 44489, 'loss/train': 0.7407692670822144} 11/07/2021 03:27:47 - INFO - __main__ - Step 44491: {'lr': 0.00040502341030152, 'samples': 8542272, 'steps': 44490, 'loss/train': 1.129051923751831} 11/07/2021 03:27:48 - INFO - __main__ - Step 44492: {'lr': 0.0004050192469781025, 'samples': 8542464, 'steps': 44491, 'loss/train': 1.6179014444351196} 11/07/2021 03:27:48 - INFO - __main__ - Step 44493: {'lr': 0.00040501508358483583, 'samples': 8542656, 'steps': 44492, 'loss/train': 1.8394125699996948} 11/07/2021 03:27:49 - INFO - __main__ - Step 44494: {'lr': 0.00040501092012172173, 'samples': 8542848, 'steps': 44493, 'loss/train': 1.7730039358139038} 11/07/2021 03:27:49 - INFO - __main__ - Step 44495: {'lr': 0.0004050067565887621, 'samples': 8543040, 'steps': 44494, 'loss/train': 1.4558534622192383} 11/07/2021 03:27:49 - INFO - __main__ - Step 44496: {'lr': 0.00040500259298595874, 'samples': 8543232, 'steps': 44495, 'loss/train': 1.5732234716415405} 11/07/2021 03:27:50 - INFO - __main__ - Step 44497: {'lr': 0.00040499842931331374, 'samples': 8543424, 'steps': 44496, 'loss/train': 1.3753433227539062} 11/07/2021 03:27:51 - INFO - __main__ - Step 44498: {'lr': 0.0004049942655708287, 'samples': 8543616, 'steps': 44497, 'loss/train': 1.4764783382415771} 11/07/2021 03:27:51 - INFO - __main__ - Step 44499: {'lr': 0.0004049901017585058, 'samples': 8543808, 'steps': 44498, 'loss/train': 1.58024263381958} 11/07/2021 03:27:51 - INFO - __main__ - Step 44500: {'lr': 0.00040498593787634664, 'samples': 8544000, 'steps': 44499, 'loss/train': 1.0468982458114624} 11/07/2021 03:27:52 - INFO - __main__ - Step 44501: {'lr': 0.0004049817739243532, 'samples': 8544192, 'steps': 44500, 'loss/train': 1.7301188707351685} 11/07/2021 03:27:53 - INFO - __main__ - Step 44502: {'lr': 0.0004049776099025274, 'samples': 8544384, 'steps': 44501, 'loss/train': 1.7797633409500122} 11/07/2021 03:27:53 - INFO - __main__ - Step 44503: {'lr': 0.000404973445810871, 'samples': 8544576, 'steps': 44502, 'loss/train': 1.3407477140426636} 11/07/2021 03:27:53 - INFO - __main__ - Step 44504: {'lr': 0.00040496928164938614, 'samples': 8544768, 'steps': 44503, 'loss/train': 1.2654181718826294} 11/07/2021 03:27:54 - INFO - __main__ - Step 44505: {'lr': 0.0004049651174180744, 'samples': 8544960, 'steps': 44504, 'loss/train': 1.4537317752838135} 11/07/2021 03:27:54 - INFO - __main__ - Step 44506: {'lr': 0.00040496095311693775, 'samples': 8545152, 'steps': 44505, 'loss/train': 1.857408881187439} 11/07/2021 03:27:55 - INFO - __main__ - Step 44507: {'lr': 0.0004049567887459781, 'samples': 8545344, 'steps': 44506, 'loss/train': 1.611707091331482} 11/07/2021 03:27:56 - INFO - __main__ - Step 44508: {'lr': 0.0004049526243051973, 'samples': 8545536, 'steps': 44507, 'loss/train': 1.5558937788009644} 11/07/2021 03:27:56 - INFO - __main__ - Step 44509: {'lr': 0.0004049484597945973, 'samples': 8545728, 'steps': 44508, 'loss/train': 2.009676218032837} 11/07/2021 03:27:56 - INFO - __main__ - Step 44510: {'lr': 0.00040494429521417983, 'samples': 8545920, 'steps': 44509, 'loss/train': 1.0804678201675415} 11/07/2021 03:27:57 - INFO - __main__ - Step 44511: {'lr': 0.0004049401305639469, 'samples': 8546112, 'steps': 44510, 'loss/train': 1.7514135837554932} 11/07/2021 03:27:57 - INFO - __main__ - Step 44512: {'lr': 0.00040493596584390034, 'samples': 8546304, 'steps': 44511, 'loss/train': 1.0365068912506104} 11/07/2021 03:27:58 - INFO - __main__ - Step 44513: {'lr': 0.00040493180105404203, 'samples': 8546496, 'steps': 44512, 'loss/train': 1.2823036909103394} 11/07/2021 03:27:58 - INFO - __main__ - Step 44514: {'lr': 0.0004049276361943738, 'samples': 8546688, 'steps': 44513, 'loss/train': 1.1235660314559937} 11/07/2021 03:27:59 - INFO - __main__ - Step 44515: {'lr': 0.0004049234712648976, 'samples': 8546880, 'steps': 44514, 'loss/train': 1.4431331157684326} 11/07/2021 03:27:59 - INFO - __main__ - Step 44516: {'lr': 0.00040491930626561525, 'samples': 8547072, 'steps': 44515, 'loss/train': 1.423165202140808} 11/07/2021 03:27:59 - INFO - __main__ - Step 44517: {'lr': 0.00040491514119652875, 'samples': 8547264, 'steps': 44516, 'loss/train': 1.4906201362609863} 11/07/2021 03:28:01 - INFO - __main__ - Step 44518: {'lr': 0.00040491097605763974, 'samples': 8547456, 'steps': 44517, 'loss/train': 1.781027913093567} 11/07/2021 03:28:01 - INFO - __main__ - Step 44519: {'lr': 0.00040490681084895034, 'samples': 8547648, 'steps': 44518, 'loss/train': 1.086724042892456} 11/07/2021 03:28:01 - INFO - __main__ - Step 44520: {'lr': 0.00040490264557046217, 'samples': 8547840, 'steps': 44519, 'loss/train': 1.5045785903930664} 11/07/2021 03:28:02 - INFO - __main__ - Step 44521: {'lr': 0.0004048984802221774, 'samples': 8548032, 'steps': 44520, 'loss/train': 1.6920466423034668} 11/07/2021 03:28:02 - INFO - __main__ - Step 44522: {'lr': 0.0004048943148040977, 'samples': 8548224, 'steps': 44521, 'loss/train': 1.6950204372406006} 11/07/2021 03:28:03 - INFO - __main__ - Step 44523: {'lr': 0.0004048901493162251, 'samples': 8548416, 'steps': 44522, 'loss/train': 1.0734343528747559} 11/07/2021 03:28:03 - INFO - __main__ - Step 44524: {'lr': 0.00040488598375856133, 'samples': 8548608, 'steps': 44523, 'loss/train': 1.0315029621124268} 11/07/2021 03:28:04 - INFO - __main__ - Step 44525: {'lr': 0.0004048818181311083, 'samples': 8548800, 'steps': 44524, 'loss/train': 1.529782772064209} 11/07/2021 03:28:04 - INFO - __main__ - Step 44526: {'lr': 0.000404877652433868, 'samples': 8548992, 'steps': 44525, 'loss/train': 1.3307926654815674} 11/07/2021 03:28:04 - INFO - __main__ - Step 44527: {'lr': 0.0004048734866668421, 'samples': 8549184, 'steps': 44526, 'loss/train': 1.581064224243164} 11/07/2021 03:28:05 - INFO - __main__ - Step 44528: {'lr': 0.0004048693208300327, 'samples': 8549376, 'steps': 44527, 'loss/train': 1.7866863012313843} 11/07/2021 03:28:06 - INFO - __main__ - Step 44529: {'lr': 0.00040486515492344145, 'samples': 8549568, 'steps': 44528, 'loss/train': 1.6121472120285034} 11/07/2021 03:28:06 - INFO - __main__ - Step 44530: {'lr': 0.00040486098894707044, 'samples': 8549760, 'steps': 44529, 'loss/train': 0.9334121346473694} 11/07/2021 03:28:06 - INFO - __main__ - Step 44531: {'lr': 0.00040485682290092144, 'samples': 8549952, 'steps': 44530, 'loss/train': 1.5065734386444092} 11/07/2021 03:28:07 - INFO - __main__ - Step 44532: {'lr': 0.0004048526567849964, 'samples': 8550144, 'steps': 44531, 'loss/train': 1.7208983898162842} 11/07/2021 03:28:08 - INFO - __main__ - Step 44533: {'lr': 0.00040484849059929705, 'samples': 8550336, 'steps': 44532, 'loss/train': 1.720125436782837} 11/07/2021 03:28:08 - INFO - __main__ - Step 44534: {'lr': 0.00040484432434382547, 'samples': 8550528, 'steps': 44533, 'loss/train': 1.2826281785964966} 11/07/2021 03:28:09 - INFO - __main__ - Step 44535: {'lr': 0.0004048401580185833, 'samples': 8550720, 'steps': 44534, 'loss/train': 1.655936598777771} 11/07/2021 03:28:09 - INFO - __main__ - Step 44536: {'lr': 0.00040483599162357257, 'samples': 8550912, 'steps': 44535, 'loss/train': 1.1506872177124023} 11/07/2021 03:28:09 - INFO - __main__ - Step 44537: {'lr': 0.0004048318251587952, 'samples': 8551104, 'steps': 44536, 'loss/train': 1.6509345769882202} 11/07/2021 03:28:10 - INFO - __main__ - Step 44538: {'lr': 0.000404827658624253, 'samples': 8551296, 'steps': 44537, 'loss/train': 1.1471433639526367} 11/07/2021 03:28:11 - INFO - __main__ - Step 44539: {'lr': 0.00040482349201994785, 'samples': 8551488, 'steps': 44538, 'loss/train': 1.1694883108139038} 11/07/2021 03:28:11 - INFO - __main__ - Step 44540: {'lr': 0.00040481932534588153, 'samples': 8551680, 'steps': 44539, 'loss/train': 1.3767096996307373} 11/07/2021 03:28:11 - INFO - __main__ - Step 44541: {'lr': 0.00040481515860205607, 'samples': 8551872, 'steps': 44540, 'loss/train': 1.4552617073059082} 11/07/2021 03:28:12 - INFO - __main__ - Step 44542: {'lr': 0.00040481099178847326, 'samples': 8552064, 'steps': 44541, 'loss/train': 1.4863961935043335} 11/07/2021 03:28:12 - INFO - __main__ - Step 44543: {'lr': 0.000404806824905135, 'samples': 8552256, 'steps': 44542, 'loss/train': 1.2956562042236328} 11/07/2021 03:28:13 - INFO - __main__ - Step 44544: {'lr': 0.0004048026579520433, 'samples': 8552448, 'steps': 44543, 'loss/train': 1.751367449760437} 11/07/2021 03:28:13 - INFO - __main__ - Step 44545: {'lr': 0.00040479849092919974, 'samples': 8552640, 'steps': 44544, 'loss/train': 1.4386411905288696} 11/07/2021 03:28:14 - INFO - __main__ - Step 44546: {'lr': 0.00040479432383660644, 'samples': 8552832, 'steps': 44545, 'loss/train': 1.5601409673690796} 11/07/2021 03:28:14 - INFO - __main__ - Step 44547: {'lr': 0.00040479015667426523, 'samples': 8553024, 'steps': 44546, 'loss/train': 1.5965445041656494} 11/07/2021 03:28:14 - INFO - __main__ - Step 44548: {'lr': 0.00040478598944217794, 'samples': 8553216, 'steps': 44547, 'loss/train': 1.6914118528366089} 11/07/2021 03:28:15 - INFO - __main__ - Step 44549: {'lr': 0.0004047818221403464, 'samples': 8553408, 'steps': 44548, 'loss/train': 1.3184523582458496} 11/07/2021 03:28:16 - INFO - __main__ - Step 44550: {'lr': 0.0004047776547687727, 'samples': 8553600, 'steps': 44549, 'loss/train': 1.451835036277771} 11/07/2021 03:28:16 - INFO - __main__ - Step 44551: {'lr': 0.00040477348732745853, 'samples': 8553792, 'steps': 44550, 'loss/train': 1.5977486371994019} 11/07/2021 03:28:17 - INFO - __main__ - Step 44552: {'lr': 0.0004047693198164058, 'samples': 8553984, 'steps': 44551, 'loss/train': 1.5908708572387695} 11/07/2021 03:28:17 - INFO - __main__ - Step 44553: {'lr': 0.0004047651522356164, 'samples': 8554176, 'steps': 44552, 'loss/train': 1.6753019094467163} 11/07/2021 03:28:17 - INFO - __main__ - Step 44554: {'lr': 0.0004047609845850922, 'samples': 8554368, 'steps': 44553, 'loss/train': 1.3718253374099731} 11/07/2021 03:28:19 - INFO - __main__ - Step 44555: {'lr': 0.0004047568168648351, 'samples': 8554560, 'steps': 44554, 'loss/train': 1.522674798965454} 11/07/2021 03:28:19 - INFO - __main__ - Step 44556: {'lr': 0.00040475264907484696, 'samples': 8554752, 'steps': 44555, 'loss/train': 1.0965365171432495} 11/07/2021 03:28:19 - INFO - __main__ - Step 44557: {'lr': 0.0004047484812151296, 'samples': 8554944, 'steps': 44556, 'loss/train': 1.554131269454956} 11/07/2021 03:28:20 - INFO - __main__ - Step 44558: {'lr': 0.00040474431328568506, 'samples': 8555136, 'steps': 44557, 'loss/train': 1.3718769550323486} 11/07/2021 03:28:20 - INFO - __main__ - Step 44559: {'lr': 0.00040474014528651514, 'samples': 8555328, 'steps': 44558, 'loss/train': 1.8229775428771973} 11/07/2021 03:28:20 - INFO - __main__ - Step 44560: {'lr': 0.00040473597721762164, 'samples': 8555520, 'steps': 44559, 'loss/train': 0.5324205756187439} 11/07/2021 03:28:22 - INFO - __main__ - Step 44561: {'lr': 0.00040473180907900645, 'samples': 8555712, 'steps': 44560, 'loss/train': 0.978965699672699} 11/07/2021 03:28:22 - INFO - __main__ - Step 44562: {'lr': 0.0004047276408706716, 'samples': 8555904, 'steps': 44561, 'loss/train': 1.4872220754623413} 11/07/2021 03:28:22 - INFO - __main__ - Step 44563: {'lr': 0.00040472347259261875, 'samples': 8556096, 'steps': 44562, 'loss/train': 1.306357741355896} 11/07/2021 03:28:23 - INFO - __main__ - Step 44564: {'lr': 0.00040471930424485, 'samples': 8556288, 'steps': 44563, 'loss/train': 0.8524765968322754} 11/07/2021 03:28:23 - INFO - __main__ - Step 44565: {'lr': 0.0004047151358273671, 'samples': 8556480, 'steps': 44564, 'loss/train': 1.5363770723342896} 11/07/2021 03:28:24 - INFO - __main__ - Step 44566: {'lr': 0.00040471096734017185, 'samples': 8556672, 'steps': 44565, 'loss/train': 0.9901103377342224} 11/07/2021 03:28:24 - INFO - __main__ - Step 44567: {'lr': 0.0004047067987832663, 'samples': 8556864, 'steps': 44566, 'loss/train': 1.4449656009674072} 11/07/2021 03:28:25 - INFO - __main__ - Step 44568: {'lr': 0.00040470263015665234, 'samples': 8557056, 'steps': 44567, 'loss/train': 1.0699820518493652} 11/07/2021 03:28:25 - INFO - __main__ - Step 44569: {'lr': 0.00040469846146033164, 'samples': 8557248, 'steps': 44568, 'loss/train': 0.8161872625350952} 11/07/2021 03:28:25 - INFO - __main__ - Step 44570: {'lr': 0.00040469429269430617, 'samples': 8557440, 'steps': 44569, 'loss/train': 0.8928823471069336} 11/07/2021 03:28:26 - INFO - __main__ - Step 44571: {'lr': 0.00040469012385857794, 'samples': 8557632, 'steps': 44570, 'loss/train': 1.259041428565979} 11/07/2021 03:28:27 - INFO - __main__ - Step 44572: {'lr': 0.0004046859549531487, 'samples': 8557824, 'steps': 44571, 'loss/train': 1.3198877573013306} 11/07/2021 03:28:27 - INFO - __main__ - Step 44573: {'lr': 0.0004046817859780203, 'samples': 8558016, 'steps': 44572, 'loss/train': 1.0246363878250122} 11/07/2021 03:28:28 - INFO - __main__ - Step 44574: {'lr': 0.00040467761693319473, 'samples': 8558208, 'steps': 44573, 'loss/train': 1.1041311025619507} 11/07/2021 03:28:28 - INFO - __main__ - Step 44575: {'lr': 0.0004046734478186738, 'samples': 8558400, 'steps': 44574, 'loss/train': 1.3575173616409302} 11/07/2021 03:28:29 - INFO - __main__ - Step 44576: {'lr': 0.0004046692786344594, 'samples': 8558592, 'steps': 44575, 'loss/train': 1.237064003944397} 11/07/2021 03:28:29 - INFO - __main__ - Step 44577: {'lr': 0.0004046651093805534, 'samples': 8558784, 'steps': 44576, 'loss/train': 1.3218896389007568} 11/07/2021 03:28:30 - INFO - __main__ - Step 44578: {'lr': 0.0004046609400569577, 'samples': 8558976, 'steps': 44577, 'loss/train': 1.2574677467346191} 11/07/2021 03:28:30 - INFO - __main__ - Step 44579: {'lr': 0.00040465677066367424, 'samples': 8559168, 'steps': 44578, 'loss/train': 1.6030867099761963} 11/07/2021 03:28:30 - INFO - __main__ - Step 44580: {'lr': 0.0004046526012007047, 'samples': 8559360, 'steps': 44579, 'loss/train': 1.4407395124435425} 11/07/2021 03:28:31 - INFO - __main__ - Step 44581: {'lr': 0.0004046484316680511, 'samples': 8559552, 'steps': 44580, 'loss/train': 0.8182333111763} 11/07/2021 03:28:32 - INFO - __main__ - Step 44582: {'lr': 0.0004046442620657154, 'samples': 8559744, 'steps': 44581, 'loss/train': 1.3196399211883545} 11/07/2021 03:28:32 - INFO - __main__ - Step 44583: {'lr': 0.00040464009239369925, 'samples': 8559936, 'steps': 44582, 'loss/train': 1.4967689514160156} 11/07/2021 03:28:32 - INFO - __main__ - Step 44584: {'lr': 0.0004046359226520048, 'samples': 8560128, 'steps': 44583, 'loss/train': 1.42673921585083} 11/07/2021 03:28:33 - INFO - __main__ - Step 44585: {'lr': 0.0004046317528406337, 'samples': 8560320, 'steps': 44584, 'loss/train': 0.9684894680976868} 11/07/2021 03:28:34 - INFO - __main__ - Step 44586: {'lr': 0.0004046275829595879, 'samples': 8560512, 'steps': 44585, 'loss/train': 1.9079174995422363} 11/07/2021 03:28:34 - INFO - __main__ - Step 44587: {'lr': 0.0004046234130088694, 'samples': 8560704, 'steps': 44586, 'loss/train': 1.0878044366836548} 11/07/2021 03:28:34 - INFO - __main__ - Step 44588: {'lr': 0.00040461924298847987, 'samples': 8560896, 'steps': 44587, 'loss/train': 2.121105432510376} 11/07/2021 03:28:35 - INFO - __main__ - Step 44589: {'lr': 0.0004046150728984214, 'samples': 8561088, 'steps': 44588, 'loss/train': 1.2639451026916504} 11/07/2021 03:28:35 - INFO - __main__ - Step 44590: {'lr': 0.00040461090273869566, 'samples': 8561280, 'steps': 44589, 'loss/train': 1.383887529373169} 11/07/2021 03:28:36 - INFO - __main__ - Step 44591: {'lr': 0.0004046067325093047, 'samples': 8561472, 'steps': 44590, 'loss/train': 1.1634901762008667} 11/07/2021 03:28:37 - INFO - __main__ - Step 44592: {'lr': 0.00040460256221025025, 'samples': 8561664, 'steps': 44591, 'loss/train': 1.2618054151535034} 11/07/2021 03:28:37 - INFO - __main__ - Step 44593: {'lr': 0.00040459839184153436, 'samples': 8561856, 'steps': 44592, 'loss/train': 1.6785650253295898} 11/07/2021 03:28:37 - INFO - __main__ - Step 44594: {'lr': 0.00040459422140315876, 'samples': 8562048, 'steps': 44593, 'loss/train': 1.0582304000854492} 11/07/2021 03:28:38 - INFO - __main__ - Step 44595: {'lr': 0.00040459005089512544, 'samples': 8562240, 'steps': 44594, 'loss/train': 1.5507423877716064} 11/07/2021 03:28:38 - INFO - __main__ - Step 44596: {'lr': 0.0004045858803174362, 'samples': 8562432, 'steps': 44595, 'loss/train': 1.3976720571517944} 11/07/2021 03:28:39 - INFO - __main__ - Step 44597: {'lr': 0.0004045817096700929, 'samples': 8562624, 'steps': 44596, 'loss/train': 1.7540621757507324} 11/07/2021 03:28:39 - INFO - __main__ - Step 44598: {'lr': 0.0004045775389530976, 'samples': 8562816, 'steps': 44597, 'loss/train': 1.73148512840271} 11/07/2021 03:28:40 - INFO - __main__ - Step 44599: {'lr': 0.00040457336816645195, 'samples': 8563008, 'steps': 44598, 'loss/train': 1.1353774070739746} 11/07/2021 03:28:40 - INFO - __main__ - Step 44600: {'lr': 0.000404569197310158, 'samples': 8563200, 'steps': 44599, 'loss/train': 1.5680984258651733} 11/07/2021 03:28:40 - INFO - __main__ - Step 44601: {'lr': 0.0004045650263842174, 'samples': 8563392, 'steps': 44600, 'loss/train': 1.0853443145751953} 11/07/2021 03:28:41 - INFO - __main__ - Step 44602: {'lr': 0.0004045608553886323, 'samples': 8563584, 'steps': 44601, 'loss/train': 1.4845424890518188} 11/07/2021 03:28:42 - INFO - __main__ - Step 44603: {'lr': 0.0004045566843234044, 'samples': 8563776, 'steps': 44602, 'loss/train': 1.0669195652008057} 11/07/2021 03:28:42 - INFO - __main__ - Step 44604: {'lr': 0.0004045525131885357, 'samples': 8563968, 'steps': 44603, 'loss/train': 1.5860636234283447} 11/07/2021 03:28:42 - INFO - __main__ - Step 44605: {'lr': 0.0004045483419840281, 'samples': 8564160, 'steps': 44604, 'loss/train': 1.49483060836792} 11/07/2021 03:28:43 - INFO - __main__ - Step 44606: {'lr': 0.00040454417070988325, 'samples': 8564352, 'steps': 44605, 'loss/train': 1.434079647064209} 11/07/2021 03:28:44 - INFO - __main__ - Step 44607: {'lr': 0.0004045399993661033, 'samples': 8564544, 'steps': 44606, 'loss/train': 1.501473069190979} 11/07/2021 03:28:44 - INFO - __main__ - Step 44608: {'lr': 0.00040453582795268994, 'samples': 8564736, 'steps': 44607, 'loss/train': 0.5401839017868042} 11/07/2021 03:28:45 - INFO - __main__ - Step 44609: {'lr': 0.00040453165646964505, 'samples': 8564928, 'steps': 44608, 'loss/train': 1.4329159259796143} 11/07/2021 03:28:45 - INFO - __main__ - Step 44610: {'lr': 0.00040452748491697074, 'samples': 8565120, 'steps': 44609, 'loss/train': 1.2819854021072388} 11/07/2021 03:28:46 - INFO - __main__ - Step 44611: {'lr': 0.00040452331329466864, 'samples': 8565312, 'steps': 44610, 'loss/train': 1.503104329109192} 11/07/2021 03:28:46 - INFO - __main__ - Step 44612: {'lr': 0.0004045191416027407, 'samples': 8565504, 'steps': 44611, 'loss/train': 0.9699654579162598} 11/07/2021 03:28:47 - INFO - __main__ - Step 44613: {'lr': 0.0004045149698411889, 'samples': 8565696, 'steps': 44612, 'loss/train': 0.3859573304653168} 11/07/2021 03:28:47 - INFO - __main__ - Step 44614: {'lr': 0.000404510798010015, 'samples': 8565888, 'steps': 44613, 'loss/train': 2.097834587097168} 11/07/2021 03:28:48 - INFO - __main__ - Step 44615: {'lr': 0.0004045066261092209, 'samples': 8566080, 'steps': 44614, 'loss/train': 1.0142295360565186} 11/07/2021 03:28:48 - INFO - __main__ - Step 44616: {'lr': 0.0004045024541388085, 'samples': 8566272, 'steps': 44615, 'loss/train': 1.4237236976623535} 11/07/2021 03:28:48 - INFO - __main__ - Step 44617: {'lr': 0.0004044982820987797, 'samples': 8566464, 'steps': 44616, 'loss/train': 1.5492023229599} 11/07/2021 03:28:49 - INFO - __main__ - Step 44618: {'lr': 0.0004044941099891364, 'samples': 8566656, 'steps': 44617, 'loss/train': 1.244526743888855} 11/07/2021 03:28:50 - INFO - __main__ - Step 44619: {'lr': 0.0004044899378098803, 'samples': 8566848, 'steps': 44618, 'loss/train': 1.1378777027130127} 11/07/2021 03:28:50 - INFO - __main__ - Step 44620: {'lr': 0.00040448576556101356, 'samples': 8567040, 'steps': 44619, 'loss/train': 1.8612985610961914} 11/07/2021 03:28:51 - INFO - __main__ - Step 44621: {'lr': 0.0004044815932425379, 'samples': 8567232, 'steps': 44620, 'loss/train': 2.050100088119507} 11/07/2021 03:28:51 - INFO - __main__ - Step 44622: {'lr': 0.0004044774208544551, 'samples': 8567424, 'steps': 44621, 'loss/train': 1.3216272592544556} 11/07/2021 03:28:52 - INFO - __main__ - Step 44623: {'lr': 0.00040447324839676727, 'samples': 8567616, 'steps': 44622, 'loss/train': 1.1530163288116455} 11/07/2021 03:28:52 - INFO - __main__ - Step 44624: {'lr': 0.00040446907586947614, 'samples': 8567808, 'steps': 44623, 'loss/train': 1.5983481407165527} 11/07/2021 03:28:53 - INFO - __main__ - Step 44625: {'lr': 0.0004044649032725836, 'samples': 8568000, 'steps': 44624, 'loss/train': 1.199302315711975} 11/07/2021 03:28:53 - INFO - __main__ - Step 44626: {'lr': 0.00040446073060609156, 'samples': 8568192, 'steps': 44625, 'loss/train': 1.1727397441864014} 11/07/2021 03:28:53 - INFO - __main__ - Step 44627: {'lr': 0.00040445655787000196, 'samples': 8568384, 'steps': 44626, 'loss/train': 1.0947909355163574} 11/07/2021 03:28:54 - INFO - __main__ - Step 44628: {'lr': 0.0004044523850643166, 'samples': 8568576, 'steps': 44627, 'loss/train': 1.65569269657135} 11/07/2021 03:28:55 - INFO - __main__ - Step 44629: {'lr': 0.0004044482121890374, 'samples': 8568768, 'steps': 44628, 'loss/train': 1.7430261373519897} 11/07/2021 03:28:55 - INFO - __main__ - Step 44630: {'lr': 0.00040444403924416614, 'samples': 8568960, 'steps': 44629, 'loss/train': 1.1327166557312012} 11/07/2021 03:28:55 - INFO - __main__ - Step 44631: {'lr': 0.00040443986622970486, 'samples': 8569152, 'steps': 44630, 'loss/train': 1.8160648345947266} 11/07/2021 03:28:56 - INFO - __main__ - Step 44632: {'lr': 0.0004044356931456553, 'samples': 8569344, 'steps': 44631, 'loss/train': 1.0969573259353638} 11/07/2021 03:28:57 - INFO - __main__ - Step 44633: {'lr': 0.00040443151999201946, 'samples': 8569536, 'steps': 44632, 'loss/train': 1.370672345161438} 11/07/2021 03:28:57 - INFO - __main__ - Step 44634: {'lr': 0.00040442734676879907, 'samples': 8569728, 'steps': 44633, 'loss/train': 1.0866888761520386} 11/07/2021 03:28:57 - INFO - __main__ - Step 44635: {'lr': 0.0004044231734759961, 'samples': 8569920, 'steps': 44634, 'loss/train': 1.5982840061187744} 11/07/2021 03:28:58 - INFO - __main__ - Step 44636: {'lr': 0.00040441900011361256, 'samples': 8570112, 'steps': 44635, 'loss/train': 1.1409536600112915} 11/07/2021 03:28:58 - INFO - __main__ - Step 44637: {'lr': 0.0004044148266816501, 'samples': 8570304, 'steps': 44636, 'loss/train': 1.6011314392089844} 11/07/2021 03:28:59 - INFO - __main__ - Step 44638: {'lr': 0.0004044106531801107, 'samples': 8570496, 'steps': 44637, 'loss/train': 1.7269203662872314} 11/07/2021 03:29:00 - INFO - __main__ - Step 44639: {'lr': 0.0004044064796089963, 'samples': 8570688, 'steps': 44638, 'loss/train': 1.596115231513977} 11/07/2021 03:29:00 - INFO - __main__ - Step 44640: {'lr': 0.0004044023059683087, 'samples': 8570880, 'steps': 44639, 'loss/train': 1.7004880905151367} 11/07/2021 03:29:00 - INFO - __main__ - Step 44641: {'lr': 0.00040439813225804977, 'samples': 8571072, 'steps': 44640, 'loss/train': 1.777889609336853} 11/07/2021 03:29:01 - INFO - __main__ - Step 44642: {'lr': 0.00040439395847822145, 'samples': 8571264, 'steps': 44641, 'loss/train': 1.8670952320098877} 11/07/2021 03:29:02 - INFO - __main__ - Step 44643: {'lr': 0.00040438978462882557, 'samples': 8571456, 'steps': 44642, 'loss/train': 1.3878732919692993} 11/07/2021 03:29:02 - INFO - __main__ - Step 44644: {'lr': 0.0004043856107098641, 'samples': 8571648, 'steps': 44643, 'loss/train': 1.3951754570007324} 11/07/2021 03:29:02 - INFO - __main__ - Step 44645: {'lr': 0.0004043814367213388, 'samples': 8571840, 'steps': 44644, 'loss/train': 1.39116370677948} 11/07/2021 03:29:03 - INFO - __main__ - Step 44646: {'lr': 0.00040437726266325164, 'samples': 8572032, 'steps': 44645, 'loss/train': 1.547196388244629} 11/07/2021 03:29:03 - INFO - __main__ - Step 44647: {'lr': 0.00040437308853560444, 'samples': 8572224, 'steps': 44646, 'loss/train': 1.3838136196136475} 11/07/2021 03:29:04 - INFO - __main__ - Step 44648: {'lr': 0.0004043689143383991, 'samples': 8572416, 'steps': 44647, 'loss/train': 1.6577091217041016} 11/07/2021 03:29:04 - INFO - __main__ - Step 44649: {'lr': 0.00040436474007163754, 'samples': 8572608, 'steps': 44648, 'loss/train': 1.0719048976898193} 11/07/2021 03:29:05 - INFO - __main__ - Step 44650: {'lr': 0.0004043605657353216, 'samples': 8572800, 'steps': 44649, 'loss/train': 1.302512288093567} 11/07/2021 03:29:05 - INFO - __main__ - Step 44651: {'lr': 0.00040435639132945314, 'samples': 8572992, 'steps': 44650, 'loss/train': 1.2456927299499512} 11/07/2021 03:29:05 - INFO - __main__ - Step 44652: {'lr': 0.0004043522168540341, 'samples': 8573184, 'steps': 44651, 'loss/train': 1.5196424722671509} 11/07/2021 03:29:06 - INFO - __main__ - Step 44653: {'lr': 0.0004043480423090664, 'samples': 8573376, 'steps': 44652, 'loss/train': 1.4495714902877808} 11/07/2021 03:29:07 - INFO - __main__ - Step 44654: {'lr': 0.0004043438676945518, 'samples': 8573568, 'steps': 44653, 'loss/train': 1.6864550113677979} 11/07/2021 03:29:07 - INFO - __main__ - Step 44655: {'lr': 0.0004043396930104922, 'samples': 8573760, 'steps': 44654, 'loss/train': 1.440232515335083} 11/07/2021 03:29:07 - INFO - __main__ - Step 44656: {'lr': 0.0004043355182568895, 'samples': 8573952, 'steps': 44655, 'loss/train': 1.5735957622528076} 11/07/2021 03:29:08 - INFO - __main__ - Step 44657: {'lr': 0.00040433134343374565, 'samples': 8574144, 'steps': 44656, 'loss/train': 1.6642217636108398} 11/07/2021 03:29:08 - INFO - __main__ - Step 44658: {'lr': 0.0004043271685410625, 'samples': 8574336, 'steps': 44657, 'loss/train': 1.1298478841781616} 11/07/2021 03:29:09 - INFO - __main__ - Step 44659: {'lr': 0.00040432299357884185, 'samples': 8574528, 'steps': 44658, 'loss/train': 1.385873794555664} 11/07/2021 03:29:10 - INFO - __main__ - Step 44660: {'lr': 0.0004043188185470856, 'samples': 8574720, 'steps': 44659, 'loss/train': 1.2078728675842285} 11/07/2021 03:29:10 - INFO - __main__ - Step 44661: {'lr': 0.00040431464344579585, 'samples': 8574912, 'steps': 44660, 'loss/train': 1.7153549194335938} 11/07/2021 03:29:10 - INFO - __main__ - Step 44662: {'lr': 0.00040431046827497415, 'samples': 8575104, 'steps': 44661, 'loss/train': 1.582525610923767} 11/07/2021 03:29:11 - INFO - __main__ - Step 44663: {'lr': 0.00040430629303462256, 'samples': 8575296, 'steps': 44662, 'loss/train': 1.4888883829116821} 11/07/2021 03:29:12 - INFO - __main__ - Step 44664: {'lr': 0.000404302117724743, 'samples': 8575488, 'steps': 44663, 'loss/train': 1.955739974975586} 11/07/2021 03:29:12 - INFO - __main__ - Step 44665: {'lr': 0.00040429794234533726, 'samples': 8575680, 'steps': 44664, 'loss/train': 1.2550443410873413} 11/07/2021 03:29:12 - INFO - __main__ - Step 44666: {'lr': 0.0004042937668964072, 'samples': 8575872, 'steps': 44665, 'loss/train': 2.086137056350708} 11/07/2021 03:29:13 - INFO - __main__ - Step 44667: {'lr': 0.00040428959137795475, 'samples': 8576064, 'steps': 44666, 'loss/train': 1.512468695640564} 11/07/2021 03:29:13 - INFO - __main__ - Step 44668: {'lr': 0.0004042854157899818, 'samples': 8576256, 'steps': 44667, 'loss/train': 1.9976961612701416} 11/07/2021 03:29:13 - INFO - __main__ - Step 44669: {'lr': 0.0004042812401324902, 'samples': 8576448, 'steps': 44668, 'loss/train': 1.7271101474761963} 11/07/2021 03:29:14 - INFO - __main__ - Step 44670: {'lr': 0.0004042770644054819, 'samples': 8576640, 'steps': 44669, 'loss/train': 1.6623976230621338} 11/07/2021 03:29:15 - INFO - __main__ - Step 44671: {'lr': 0.0004042728886089587, 'samples': 8576832, 'steps': 44670, 'loss/train': 1.8340626955032349} 11/07/2021 03:29:15 - INFO - __main__ - Step 44672: {'lr': 0.00040426871274292257, 'samples': 8577024, 'steps': 44671, 'loss/train': 1.5917364358901978} 11/07/2021 03:29:16 - INFO - __main__ - Step 44673: {'lr': 0.00040426453680737534, 'samples': 8577216, 'steps': 44672, 'loss/train': 1.7523205280303955} 11/07/2021 03:29:16 - INFO - __main__ - Step 44674: {'lr': 0.0004042603608023189, 'samples': 8577408, 'steps': 44673, 'loss/train': 1.5502620935440063} 11/07/2021 03:29:17 - INFO - __main__ - Step 44675: {'lr': 0.00040425618472775504, 'samples': 8577600, 'steps': 44674, 'loss/train': 1.4467202425003052} 11/07/2021 03:29:17 - INFO - __main__ - Step 44676: {'lr': 0.0004042520085836857, 'samples': 8577792, 'steps': 44675, 'loss/train': 1.835953712463379} 11/07/2021 03:29:18 - INFO - __main__ - Step 44677: {'lr': 0.0004042478323701129, 'samples': 8577984, 'steps': 44676, 'loss/train': 1.275037169456482} 11/07/2021 03:29:18 - INFO - __main__ - Step 44678: {'lr': 0.00040424365608703836, 'samples': 8578176, 'steps': 44677, 'loss/train': 1.4688793420791626} 11/07/2021 03:29:18 - INFO - __main__ - Step 44679: {'lr': 0.00040423947973446404, 'samples': 8578368, 'steps': 44678, 'loss/train': 1.7626153230667114} 11/07/2021 03:29:19 - INFO - __main__ - Step 44680: {'lr': 0.00040423530331239177, 'samples': 8578560, 'steps': 44679, 'loss/train': 1.0789210796356201} 11/07/2021 03:29:20 - INFO - __main__ - Step 44681: {'lr': 0.0004042311268208234, 'samples': 8578752, 'steps': 44680, 'loss/train': 2.3214893341064453} 11/07/2021 03:29:20 - INFO - __main__ - Step 44682: {'lr': 0.00040422695025976084, 'samples': 8578944, 'steps': 44681, 'loss/train': 1.3355746269226074} 11/07/2021 03:29:21 - INFO - __main__ - Step 44683: {'lr': 0.00040422277362920614, 'samples': 8579136, 'steps': 44682, 'loss/train': 0.9555754661560059} 11/07/2021 03:29:21 - INFO - __main__ - Step 44684: {'lr': 0.0004042185969291609, 'samples': 8579328, 'steps': 44683, 'loss/train': 1.8307428359985352} 11/07/2021 03:29:22 - INFO - __main__ - Step 44685: {'lr': 0.00040421442015962727, 'samples': 8579520, 'steps': 44684, 'loss/train': 1.598419189453125} 11/07/2021 03:29:22 - INFO - __main__ - Step 44686: {'lr': 0.0004042102433206069, 'samples': 8579712, 'steps': 44685, 'loss/train': 1.1682043075561523} 11/07/2021 03:29:23 - INFO - __main__ - Step 44687: {'lr': 0.0004042060664121018, 'samples': 8579904, 'steps': 44686, 'loss/train': 0.7249529361724854} 11/07/2021 03:29:23 - INFO - __main__ - Step 44688: {'lr': 0.00040420188943411385, 'samples': 8580096, 'steps': 44687, 'loss/train': 1.834280014038086} 11/07/2021 03:29:23 - INFO - __main__ - Step 44689: {'lr': 0.0004041977123866448, 'samples': 8580288, 'steps': 44688, 'loss/train': 1.1947107315063477} 11/07/2021 03:29:24 - INFO - __main__ - Step 44690: {'lr': 0.0004041935352696968, 'samples': 8580480, 'steps': 44689, 'loss/train': 1.2123805284500122} 11/07/2021 03:29:25 - INFO - __main__ - Step 44691: {'lr': 0.00040418935808327153, 'samples': 8580672, 'steps': 44690, 'loss/train': 0.9247638583183289} 11/07/2021 03:29:25 - INFO - __main__ - Step 44692: {'lr': 0.00040418518082737087, 'samples': 8580864, 'steps': 44691, 'loss/train': 1.1153234243392944} 11/07/2021 03:29:25 - INFO - __main__ - Step 44693: {'lr': 0.0004041810035019967, 'samples': 8581056, 'steps': 44692, 'loss/train': 1.3872288465499878} 11/07/2021 03:29:26 - INFO - __main__ - Step 44694: {'lr': 0.00040417682610715107, 'samples': 8581248, 'steps': 44693, 'loss/train': 1.0482349395751953} 11/07/2021 03:29:27 - INFO - __main__ - Step 44695: {'lr': 0.00040417264864283563, 'samples': 8581440, 'steps': 44694, 'loss/train': 1.252543568611145} 11/07/2021 03:29:27 - INFO - __main__ - Step 44696: {'lr': 0.00040416847110905243, 'samples': 8581632, 'steps': 44695, 'loss/train': 1.1403671503067017} 11/07/2021 03:29:28 - INFO - __main__ - Step 44697: {'lr': 0.0004041642935058033, 'samples': 8581824, 'steps': 44696, 'loss/train': 1.35580313205719} 11/07/2021 03:29:28 - INFO - __main__ - Step 44698: {'lr': 0.0004041601158330901, 'samples': 8582016, 'steps': 44697, 'loss/train': 1.7104305028915405} 11/07/2021 03:29:28 - INFO - __main__ - Step 44699: {'lr': 0.0004041559380909148, 'samples': 8582208, 'steps': 44698, 'loss/train': 1.700950264930725} 11/07/2021 03:29:29 - INFO - __main__ - Step 44700: {'lr': 0.00040415176027927915, 'samples': 8582400, 'steps': 44699, 'loss/train': 1.3662071228027344} 11/07/2021 03:29:31 - INFO - __main__ - Step 44701: {'lr': 0.00040414758239818506, 'samples': 8582592, 'steps': 44700, 'loss/train': 1.6446884870529175} 11/07/2021 03:29:31 - INFO - __main__ - Step 44702: {'lr': 0.00040414340444763455, 'samples': 8582784, 'steps': 44701, 'loss/train': 1.5361683368682861} 11/07/2021 03:29:32 - INFO - __main__ - Step 44703: {'lr': 0.0004041392264276292, 'samples': 8582976, 'steps': 44702, 'loss/train': 1.8980095386505127} 11/07/2021 03:29:32 - INFO - __main__ - Step 44704: {'lr': 0.00040413504833817127, 'samples': 8583168, 'steps': 44703, 'loss/train': 1.8210697174072266} 11/07/2021 03:29:32 - INFO - __main__ - Step 44705: {'lr': 0.0004041308701792625, 'samples': 8583360, 'steps': 44704, 'loss/train': 1.8195379972457886} 11/07/2021 03:29:33 - INFO - __main__ - Step 44706: {'lr': 0.00040412669195090466, 'samples': 8583552, 'steps': 44705, 'loss/train': 1.787896752357483} 11/07/2021 03:29:33 - INFO - __main__ - Step 44707: {'lr': 0.0004041225136530997, 'samples': 8583744, 'steps': 44706, 'loss/train': 1.7658010721206665} 11/07/2021 03:29:34 - INFO - __main__ - Step 44708: {'lr': 0.0004041183352858495, 'samples': 8583936, 'steps': 44707, 'loss/train': 1.7904207706451416} 11/07/2021 03:29:34 - INFO - __main__ - Step 44709: {'lr': 0.00040411415684915596, 'samples': 8584128, 'steps': 44708, 'loss/train': 1.2527177333831787} 11/07/2021 03:29:35 - INFO - __main__ - Step 44710: {'lr': 0.000404109978343021, 'samples': 8584320, 'steps': 44709, 'loss/train': 1.5017738342285156} 11/07/2021 03:29:35 - INFO - __main__ - Step 44711: {'lr': 0.0004041057997674464, 'samples': 8584512, 'steps': 44710, 'loss/train': 1.3378225564956665} 11/07/2021 03:29:36 - INFO - __main__ - Step 44712: {'lr': 0.0004041016211224342, 'samples': 8584704, 'steps': 44711, 'loss/train': 1.3807090520858765} 11/07/2021 03:29:37 - INFO - __main__ - Step 44713: {'lr': 0.0004040974424079862, 'samples': 8584896, 'steps': 44712, 'loss/train': 1.5382866859436035} 11/07/2021 03:29:37 - INFO - __main__ - Step 44714: {'lr': 0.00040409326362410416, 'samples': 8585088, 'steps': 44713, 'loss/train': 1.2872508764266968} 11/07/2021 03:29:37 - INFO - __main__ - Step 44715: {'lr': 0.0004040890847707901, 'samples': 8585280, 'steps': 44714, 'loss/train': 1.6032570600509644} 11/07/2021 03:29:38 - INFO - __main__ - Step 44716: {'lr': 0.0004040849058480459, 'samples': 8585472, 'steps': 44715, 'loss/train': 0.18380995094776154} 11/07/2021 03:29:38 - INFO - __main__ - Step 44717: {'lr': 0.0004040807268558734, 'samples': 8585664, 'steps': 44716, 'loss/train': 1.8361215591430664} 11/07/2021 03:29:38 - INFO - __main__ - Step 44718: {'lr': 0.0004040765477942745, 'samples': 8585856, 'steps': 44717, 'loss/train': 1.7135276794433594} 11/07/2021 03:29:39 - INFO - __main__ - Step 44719: {'lr': 0.0004040723686632512, 'samples': 8586048, 'steps': 44718, 'loss/train': 1.422973394393921} 11/07/2021 03:29:40 - INFO - __main__ - Step 44720: {'lr': 0.00040406818946280514, 'samples': 8586240, 'steps': 44719, 'loss/train': 0.0921250507235527} 11/07/2021 03:29:40 - INFO - __main__ - Step 44721: {'lr': 0.0004040640101929384, 'samples': 8586432, 'steps': 44720, 'loss/train': 1.2875210046768188} 11/07/2021 03:29:41 - INFO - __main__ - Step 44722: {'lr': 0.0004040598308536527, 'samples': 8586624, 'steps': 44721, 'loss/train': 1.5017659664154053} 11/07/2021 03:29:41 - INFO - __main__ - Step 44723: {'lr': 0.0004040556514449501, 'samples': 8586816, 'steps': 44722, 'loss/train': 1.0461225509643555} 11/07/2021 03:29:42 - INFO - __main__ - Step 44724: {'lr': 0.0004040514719668324, 'samples': 8587008, 'steps': 44723, 'loss/train': 1.646589756011963} 11/07/2021 03:29:42 - INFO - __main__ - Step 44725: {'lr': 0.00040404729241930144, 'samples': 8587200, 'steps': 44724, 'loss/train': 1.349840521812439} 11/07/2021 03:29:43 - INFO - __main__ - Step 44726: {'lr': 0.0004040431128023592, 'samples': 8587392, 'steps': 44725, 'loss/train': 1.336923599243164} 11/07/2021 03:29:43 - INFO - __main__ - Step 44727: {'lr': 0.0004040389331160075, 'samples': 8587584, 'steps': 44726, 'loss/train': 1.6357241868972778} 11/07/2021 03:29:43 - INFO - __main__ - Step 44728: {'lr': 0.00040403475336024816, 'samples': 8587776, 'steps': 44727, 'loss/train': 1.8353484869003296} 11/07/2021 03:29:45 - INFO - __main__ - Step 44729: {'lr': 0.0004040305735350832, 'samples': 8587968, 'steps': 44728, 'loss/train': 1.692384123802185} 11/07/2021 03:29:45 - INFO - __main__ - Step 44730: {'lr': 0.00040402639364051443, 'samples': 8588160, 'steps': 44729, 'loss/train': 1.8110960721969604} 11/07/2021 03:29:45 - INFO - __main__ - Step 44731: {'lr': 0.0004040222136765437, 'samples': 8588352, 'steps': 44730, 'loss/train': 1.1847339868545532} 11/07/2021 03:29:46 - INFO - __main__ - Step 44732: {'lr': 0.000404018033643173, 'samples': 8588544, 'steps': 44731, 'loss/train': 1.5125484466552734} 11/07/2021 03:29:46 - INFO - __main__ - Step 44733: {'lr': 0.00040401385354040415, 'samples': 8588736, 'steps': 44732, 'loss/train': 1.4958770275115967} 11/07/2021 03:29:46 - INFO - __main__ - Step 44734: {'lr': 0.00040400967336823903, 'samples': 8588928, 'steps': 44733, 'loss/train': 1.5879772901535034} 11/07/2021 03:29:47 - INFO - __main__ - Step 44735: {'lr': 0.0004040054931266795, 'samples': 8589120, 'steps': 44734, 'loss/train': 1.4642417430877686} 11/07/2021 03:29:48 - INFO - __main__ - Step 44736: {'lr': 0.0004040013128157275, 'samples': 8589312, 'steps': 44735, 'loss/train': 1.6132031679153442} 11/07/2021 03:29:48 - INFO - __main__ - Step 44737: {'lr': 0.00040399713243538483, 'samples': 8589504, 'steps': 44736, 'loss/train': 0.6882599592208862} 11/07/2021 03:29:48 - INFO - __main__ - Step 44738: {'lr': 0.00040399295198565344, 'samples': 8589696, 'steps': 44737, 'loss/train': 1.5418144464492798} 11/07/2021 03:29:49 - INFO - __main__ - Step 44739: {'lr': 0.0004039887714665352, 'samples': 8589888, 'steps': 44738, 'loss/train': 1.4107799530029297} 11/07/2021 03:29:50 - INFO - __main__ - Step 44740: {'lr': 0.0004039845908780321, 'samples': 8590080, 'steps': 44739, 'loss/train': 1.3109534978866577} 11/07/2021 03:29:50 - INFO - __main__ - Step 44741: {'lr': 0.00040398041022014585, 'samples': 8590272, 'steps': 44740, 'loss/train': 1.3458914756774902} 11/07/2021 03:29:51 - INFO - __main__ - Step 44742: {'lr': 0.0004039762294928784, 'samples': 8590464, 'steps': 44741, 'loss/train': 1.0974065065383911} 11/07/2021 03:29:51 - INFO - __main__ - Step 44743: {'lr': 0.0004039720486962316, 'samples': 8590656, 'steps': 44742, 'loss/train': 1.3014174699783325} 11/07/2021 03:29:51 - INFO - __main__ - Step 44744: {'lr': 0.00040396786783020747, 'samples': 8590848, 'steps': 44743, 'loss/train': 1.29957115650177} 11/07/2021 03:29:52 - INFO - __main__ - Step 44745: {'lr': 0.00040396368689480766, 'samples': 8591040, 'steps': 44744, 'loss/train': 1.4198511838912964} 11/07/2021 03:29:53 - INFO - __main__ - Step 44746: {'lr': 0.00040395950589003425, 'samples': 8591232, 'steps': 44745, 'loss/train': 1.7467460632324219} 11/07/2021 03:29:53 - INFO - __main__ - Step 44747: {'lr': 0.00040395532481588914, 'samples': 8591424, 'steps': 44746, 'loss/train': 1.654396414756775} 11/07/2021 03:29:53 - INFO - __main__ - Step 44748: {'lr': 0.00040395114367237407, 'samples': 8591616, 'steps': 44747, 'loss/train': 1.1026297807693481} 11/07/2021 03:29:54 - INFO - __main__ - Step 44749: {'lr': 0.00040394696245949093, 'samples': 8591808, 'steps': 44748, 'loss/train': 1.638931155204773} 11/07/2021 03:29:55 - INFO - __main__ - Step 44750: {'lr': 0.0004039427811772417, 'samples': 8592000, 'steps': 44749, 'loss/train': 1.6664801836013794} 11/07/2021 03:29:55 - INFO - __main__ - Step 44751: {'lr': 0.0004039385998256283, 'samples': 8592192, 'steps': 44750, 'loss/train': 1.2520142793655396} 11/07/2021 03:29:55 - INFO - __main__ - Step 44752: {'lr': 0.0004039344184046525, 'samples': 8592384, 'steps': 44751, 'loss/train': 1.4764256477355957} 11/07/2021 03:29:56 - INFO - __main__ - Step 44753: {'lr': 0.00040393023691431617, 'samples': 8592576, 'steps': 44752, 'loss/train': 1.5981147289276123} 11/07/2021 03:29:56 - INFO - __main__ - Step 44754: {'lr': 0.00040392605535462137, 'samples': 8592768, 'steps': 44753, 'loss/train': 1.1540776491165161} 11/07/2021 03:29:57 - INFO - __main__ - Step 44755: {'lr': 0.00040392187372556977, 'samples': 8592960, 'steps': 44754, 'loss/train': 1.6268656253814697} 11/07/2021 03:29:57 - INFO - __main__ - Step 44756: {'lr': 0.00040391769202716333, 'samples': 8593152, 'steps': 44755, 'loss/train': 1.6105319261550903} 11/07/2021 03:29:58 - INFO - __main__ - Step 44757: {'lr': 0.00040391351025940406, 'samples': 8593344, 'steps': 44756, 'loss/train': 1.4816174507141113} 11/07/2021 03:29:58 - INFO - __main__ - Step 44758: {'lr': 0.00040390932842229363, 'samples': 8593536, 'steps': 44757, 'loss/train': 1.7078075408935547} 11/07/2021 03:29:59 - INFO - __main__ - Step 44759: {'lr': 0.0004039051465158341, 'samples': 8593728, 'steps': 44758, 'loss/train': 1.6025429964065552} 11/07/2021 03:29:59 - INFO - __main__ - Step 44760: {'lr': 0.0004039009645400272, 'samples': 8593920, 'steps': 44759, 'loss/train': 1.4869424104690552} 11/07/2021 03:30:00 - INFO - __main__ - Step 44761: {'lr': 0.00040389678249487504, 'samples': 8594112, 'steps': 44760, 'loss/train': 1.7148278951644897} 11/07/2021 03:30:00 - INFO - __main__ - Step 44762: {'lr': 0.00040389260038037924, 'samples': 8594304, 'steps': 44761, 'loss/train': 0.8487159609794617} 11/07/2021 03:30:01 - INFO - __main__ - Step 44763: {'lr': 0.0004038884181965419, 'samples': 8594496, 'steps': 44762, 'loss/train': 1.2550697326660156} 11/07/2021 03:30:01 - INFO - __main__ - Step 44764: {'lr': 0.0004038842359433647, 'samples': 8594688, 'steps': 44763, 'loss/train': 1.3257486820220947} 11/07/2021 03:30:01 - INFO - __main__ - Step 44765: {'lr': 0.0004038800536208497, 'samples': 8594880, 'steps': 44764, 'loss/train': 1.4142779111862183} 11/07/2021 03:30:02 - INFO - __main__ - Step 44766: {'lr': 0.00040387587122899877, 'samples': 8595072, 'steps': 44765, 'loss/train': 1.2997912168502808} 11/07/2021 03:30:03 - INFO - __main__ - Step 44767: {'lr': 0.0004038716887678137, 'samples': 8595264, 'steps': 44766, 'loss/train': 1.6130790710449219} 11/07/2021 03:30:03 - INFO - __main__ - Step 44768: {'lr': 0.0004038675062372964, 'samples': 8595456, 'steps': 44767, 'loss/train': 1.2441644668579102} 11/07/2021 03:30:03 - INFO - __main__ - Step 44769: {'lr': 0.00040386332363744884, 'samples': 8595648, 'steps': 44768, 'loss/train': 1.7680883407592773} 11/07/2021 03:30:04 - INFO - __main__ - Step 44770: {'lr': 0.0004038591409682728, 'samples': 8595840, 'steps': 44769, 'loss/train': 1.4171826839447021} 11/07/2021 03:30:05 - INFO - __main__ - Step 44771: {'lr': 0.00040385495822977015, 'samples': 8596032, 'steps': 44770, 'loss/train': 1.5394569635391235} 11/07/2021 03:30:05 - INFO - __main__ - Step 44772: {'lr': 0.00040385077542194294, 'samples': 8596224, 'steps': 44771, 'loss/train': 1.4642919301986694} 11/07/2021 03:30:05 - INFO - __main__ - Step 44773: {'lr': 0.0004038465925447929, 'samples': 8596416, 'steps': 44772, 'loss/train': 1.5884572267532349} 11/07/2021 03:30:06 - INFO - __main__ - Step 44774: {'lr': 0.00040384240959832196, 'samples': 8596608, 'steps': 44773, 'loss/train': 1.3437590599060059} 11/07/2021 03:30:06 - INFO - __main__ - Step 44775: {'lr': 0.000403838226582532, 'samples': 8596800, 'steps': 44774, 'loss/train': 1.4359939098358154} 11/07/2021 03:30:07 - INFO - __main__ - Step 44776: {'lr': 0.00040383404349742484, 'samples': 8596992, 'steps': 44775, 'loss/train': 1.4305771589279175} 11/07/2021 03:30:08 - INFO - __main__ - Step 44777: {'lr': 0.0004038298603430025, 'samples': 8597184, 'steps': 44776, 'loss/train': 1.1964409351348877} 11/07/2021 03:30:08 - INFO - __main__ - Step 44778: {'lr': 0.0004038256771192668, 'samples': 8597376, 'steps': 44777, 'loss/train': 1.8335461616516113} 11/07/2021 03:30:08 - INFO - __main__ - Step 44779: {'lr': 0.00040382149382621967, 'samples': 8597568, 'steps': 44778, 'loss/train': 1.5217831134796143} 11/07/2021 03:30:09 - INFO - __main__ - Step 44780: {'lr': 0.00040381731046386295, 'samples': 8597760, 'steps': 44779, 'loss/train': 1.361086130142212} 11/07/2021 03:30:10 - INFO - __main__ - Step 44781: {'lr': 0.0004038131270321984, 'samples': 8597952, 'steps': 44780, 'loss/train': 1.297823429107666} 11/07/2021 03:30:10 - INFO - __main__ - Step 44782: {'lr': 0.0004038089435312281, 'samples': 8598144, 'steps': 44781, 'loss/train': 1.009280800819397} 11/07/2021 03:30:10 - INFO - __main__ - Step 44783: {'lr': 0.0004038047599609539, 'samples': 8598336, 'steps': 44782, 'loss/train': 0.9909611344337463} 11/07/2021 03:30:11 - INFO - __main__ - Step 44784: {'lr': 0.00040380057632137756, 'samples': 8598528, 'steps': 44783, 'loss/train': 1.3801733255386353} 11/07/2021 03:30:11 - INFO - __main__ - Step 44785: {'lr': 0.0004037963926125011, 'samples': 8598720, 'steps': 44784, 'loss/train': 1.8758233785629272} 11/07/2021 03:30:12 - INFO - __main__ - Step 44786: {'lr': 0.00040379220883432644, 'samples': 8598912, 'steps': 44785, 'loss/train': 1.2698369026184082} 11/07/2021 03:30:12 - INFO - __main__ - Step 44787: {'lr': 0.0004037880249868553, 'samples': 8599104, 'steps': 44786, 'loss/train': 1.7087428569793701} 11/07/2021 03:30:13 - INFO - __main__ - Step 44788: {'lr': 0.00040378384107008967, 'samples': 8599296, 'steps': 44787, 'loss/train': 1.9226173162460327} 11/07/2021 03:30:13 - INFO - __main__ - Step 44789: {'lr': 0.00040377965708403133, 'samples': 8599488, 'steps': 44788, 'loss/train': 1.6939905881881714} 11/07/2021 03:30:13 - INFO - __main__ - Step 44790: {'lr': 0.00040377547302868235, 'samples': 8599680, 'steps': 44789, 'loss/train': 5.499805927276611} 11/07/2021 03:30:14 - INFO - __main__ - Step 44791: {'lr': 0.00040377128890404444, 'samples': 8599872, 'steps': 44790, 'loss/train': 1.661346435546875} 11/07/2021 03:30:15 - INFO - __main__ - Step 44792: {'lr': 0.00040376710471011967, 'samples': 8600064, 'steps': 44791, 'loss/train': 1.880703091621399} 11/07/2021 03:30:15 - INFO - __main__ - Step 44793: {'lr': 0.0004037629204469098, 'samples': 8600256, 'steps': 44792, 'loss/train': 1.6165194511413574} 11/07/2021 03:30:16 - INFO - __main__ - Step 44794: {'lr': 0.0004037587361144166, 'samples': 8600448, 'steps': 44793, 'loss/train': 1.1653132438659668} 11/07/2021 03:30:16 - INFO - __main__ - Step 44795: {'lr': 0.0004037545517126422, 'samples': 8600640, 'steps': 44794, 'loss/train': 1.2117013931274414} 11/07/2021 03:30:16 - INFO - __main__ - Step 44796: {'lr': 0.0004037503672415883, 'samples': 8600832, 'steps': 44795, 'loss/train': 1.3438433408737183} 11/07/2021 03:30:17 - INFO - __main__ - Step 44797: {'lr': 0.000403746182701257, 'samples': 8601024, 'steps': 44796, 'loss/train': 1.2403674125671387} 11/07/2021 03:30:18 - INFO - __main__ - Step 44798: {'lr': 0.0004037419980916499, 'samples': 8601216, 'steps': 44797, 'loss/train': 1.3725799322128296} 11/07/2021 03:30:18 - INFO - __main__ - Step 44799: {'lr': 0.00040373781341276904, 'samples': 8601408, 'steps': 44798, 'loss/train': 1.3333415985107422} 11/07/2021 03:30:18 - INFO - __main__ - Step 44800: {'lr': 0.00040373362866461633, 'samples': 8601600, 'steps': 44799, 'loss/train': 1.6242492198944092} 11/07/2021 03:30:19 - INFO - __main__ - Step 44801: {'lr': 0.0004037294438471936, 'samples': 8601792, 'steps': 44800, 'loss/train': 1.2437806129455566} 11/07/2021 03:30:20 - INFO - __main__ - Step 44802: {'lr': 0.00040372525896050285, 'samples': 8601984, 'steps': 44801, 'loss/train': 1.6958515644073486} 11/07/2021 03:30:20 - INFO - __main__ - Step 44803: {'lr': 0.0004037210740045457, 'samples': 8602176, 'steps': 44802, 'loss/train': 1.30084228515625} 11/07/2021 03:30:21 - INFO - __main__ - Step 44804: {'lr': 0.0004037168889793243, 'samples': 8602368, 'steps': 44803, 'loss/train': 1.8645272254943848} 11/07/2021 03:30:21 - INFO - __main__ - Step 44805: {'lr': 0.0004037127038848404, 'samples': 8602560, 'steps': 44804, 'loss/train': 1.5042856931686401} 11/07/2021 03:30:21 - INFO - __main__ - Step 44806: {'lr': 0.00040370851872109604, 'samples': 8602752, 'steps': 44805, 'loss/train': 1.5103329420089722} 11/07/2021 03:30:22 - INFO - __main__ - Step 44807: {'lr': 0.0004037043334880929, 'samples': 8602944, 'steps': 44806, 'loss/train': 1.3800625801086426} 11/07/2021 03:30:23 - INFO - __main__ - Step 44808: {'lr': 0.000403700148185833, 'samples': 8603136, 'steps': 44807, 'loss/train': 1.4422687292099} 11/07/2021 03:30:23 - INFO - __main__ - Step 44809: {'lr': 0.00040369596281431816, 'samples': 8603328, 'steps': 44808, 'loss/train': 1.1962133646011353} 11/07/2021 03:30:23 - INFO - __main__ - Step 44810: {'lr': 0.0004036917773735502, 'samples': 8603520, 'steps': 44809, 'loss/train': 1.5136005878448486} 11/07/2021 03:30:24 - INFO - __main__ - Step 44811: {'lr': 0.00040368759186353123, 'samples': 8603712, 'steps': 44810, 'loss/train': 1.7433708906173706} 11/07/2021 03:30:25 - INFO - __main__ - Step 44812: {'lr': 0.0004036834062842629, 'samples': 8603904, 'steps': 44811, 'loss/train': 1.8308992385864258} 11/07/2021 03:30:25 - INFO - __main__ - Step 44813: {'lr': 0.00040367922063574735, 'samples': 8604096, 'steps': 44812, 'loss/train': 1.7019158601760864} 11/07/2021 03:30:26 - INFO - __main__ - Step 44814: {'lr': 0.0004036750349179862, 'samples': 8604288, 'steps': 44813, 'loss/train': 1.2187209129333496} 11/07/2021 03:30:26 - INFO - __main__ - Step 44815: {'lr': 0.00040367084913098153, 'samples': 8604480, 'steps': 44814, 'loss/train': 1.3914854526519775} 11/07/2021 03:30:26 - INFO - __main__ - Step 44816: {'lr': 0.000403666663274735, 'samples': 8604672, 'steps': 44815, 'loss/train': 1.2438373565673828} 11/07/2021 03:30:27 - INFO - __main__ - Step 44817: {'lr': 0.0004036624773492488, 'samples': 8604864, 'steps': 44816, 'loss/train': 1.1171629428863525} 11/07/2021 03:30:28 - INFO - __main__ - Step 44818: {'lr': 0.0004036582913545246, 'samples': 8605056, 'steps': 44817, 'loss/train': 1.2431228160858154} 11/07/2021 03:30:28 - INFO - __main__ - Step 44819: {'lr': 0.0004036541052905643, 'samples': 8605248, 'steps': 44818, 'loss/train': 1.629530906677246} 11/07/2021 03:30:28 - INFO - __main__ - Step 44820: {'lr': 0.0004036499191573699, 'samples': 8605440, 'steps': 44819, 'loss/train': 2.1798417568206787} 11/07/2021 03:30:29 - INFO - __main__ - Step 44821: {'lr': 0.00040364573295494316, 'samples': 8605632, 'steps': 44820, 'loss/train': 1.426156997680664} 11/07/2021 03:30:30 - INFO - __main__ - Step 44822: {'lr': 0.00040364154668328604, 'samples': 8605824, 'steps': 44821, 'loss/train': 1.6849960088729858} 11/07/2021 03:30:30 - INFO - __main__ - Step 44823: {'lr': 0.0004036373603424004, 'samples': 8606016, 'steps': 44822, 'loss/train': 1.3584458827972412} 11/07/2021 03:30:30 - INFO - __main__ - Step 44824: {'lr': 0.00040363317393228814, 'samples': 8606208, 'steps': 44823, 'loss/train': 0.9686586260795593} 11/07/2021 03:30:31 - INFO - __main__ - Step 44825: {'lr': 0.00040362898745295117, 'samples': 8606400, 'steps': 44824, 'loss/train': 1.492126703262329} 11/07/2021 03:30:31 - INFO - __main__ - Step 44826: {'lr': 0.00040362480090439136, 'samples': 8606592, 'steps': 44825, 'loss/train': 1.4815083742141724} 11/07/2021 03:30:32 - INFO - __main__ - Step 44827: {'lr': 0.00040362061428661055, 'samples': 8606784, 'steps': 44826, 'loss/train': 1.5363249778747559} 11/07/2021 03:30:33 - INFO - __main__ - Step 44828: {'lr': 0.0004036164275996107, 'samples': 8606976, 'steps': 44827, 'loss/train': 1.1376135349273682} 11/07/2021 03:30:33 - INFO - __main__ - Step 44829: {'lr': 0.00040361224084339365, 'samples': 8607168, 'steps': 44828, 'loss/train': 1.5848695039749146} 11/07/2021 03:30:33 - INFO - __main__ - Step 44830: {'lr': 0.00040360805401796124, 'samples': 8607360, 'steps': 44829, 'loss/train': 1.6936755180358887} 11/07/2021 03:30:34 - INFO - __main__ - Step 44831: {'lr': 0.0004036038671233154, 'samples': 8607552, 'steps': 44830, 'loss/train': 1.593983769416809} 11/07/2021 03:30:34 - INFO - __main__ - Step 44832: {'lr': 0.00040359968015945814, 'samples': 8607744, 'steps': 44831, 'loss/train': 1.419744849205017} 11/07/2021 03:30:35 - INFO - __main__ - Step 44833: {'lr': 0.0004035954931263912, 'samples': 8607936, 'steps': 44832, 'loss/train': 1.3327523469924927} 11/07/2021 03:30:35 - INFO - __main__ - Step 44834: {'lr': 0.00040359130602411644, 'samples': 8608128, 'steps': 44833, 'loss/train': 1.508933186531067} 11/07/2021 03:30:36 - INFO - __main__ - Step 44835: {'lr': 0.0004035871188526358, 'samples': 8608320, 'steps': 44834, 'loss/train': 1.6264845132827759} 11/07/2021 03:30:36 - INFO - __main__ - Step 44836: {'lr': 0.00040358293161195125, 'samples': 8608512, 'steps': 44835, 'loss/train': 1.378198266029358} 11/07/2021 03:30:37 - INFO - __main__ - Step 44837: {'lr': 0.0004035787443020645, 'samples': 8608704, 'steps': 44836, 'loss/train': 1.2456293106079102} 11/07/2021 03:30:37 - INFO - __main__ - Step 44838: {'lr': 0.00040357455692297765, 'samples': 8608896, 'steps': 44837, 'loss/train': 1.5283297300338745} 11/07/2021 03:30:38 - INFO - __main__ - Step 44839: {'lr': 0.0004035703694746924, 'samples': 8609088, 'steps': 44838, 'loss/train': 1.4481676816940308} 11/07/2021 03:30:38 - INFO - __main__ - Step 44840: {'lr': 0.0004035661819572108, 'samples': 8609280, 'steps': 44839, 'loss/train': 1.4853111505508423} 11/07/2021 03:30:39 - INFO - __main__ - Step 44841: {'lr': 0.0004035619943705345, 'samples': 8609472, 'steps': 44840, 'loss/train': 1.4644232988357544} 11/07/2021 03:30:39 - INFO - __main__ - Step 44842: {'lr': 0.0004035578067146657, 'samples': 8609664, 'steps': 44841, 'loss/train': 0.952649712562561} 11/07/2021 03:30:40 - INFO - __main__ - Step 44843: {'lr': 0.000403553618989606, 'samples': 8609856, 'steps': 44842, 'loss/train': 1.4051940441131592} 11/07/2021 03:30:40 - INFO - __main__ - Step 44844: {'lr': 0.0004035494311953575, 'samples': 8610048, 'steps': 44843, 'loss/train': 1.7837588787078857} 11/07/2021 03:30:41 - INFO - __main__ - Step 44845: {'lr': 0.0004035452433319219, 'samples': 8610240, 'steps': 44844, 'loss/train': 1.64267897605896} 11/07/2021 03:30:41 - INFO - __main__ - Step 44846: {'lr': 0.0004035410553993012, 'samples': 8610432, 'steps': 44845, 'loss/train': 1.8207674026489258} 11/07/2021 03:30:41 - INFO - __main__ - Step 44847: {'lr': 0.00040353686739749733, 'samples': 8610624, 'steps': 44846, 'loss/train': 1.419385552406311} 11/07/2021 03:30:42 - INFO - __main__ - Step 44848: {'lr': 0.0004035326793265121, 'samples': 8610816, 'steps': 44847, 'loss/train': 1.539034366607666} 11/07/2021 03:30:43 - INFO - __main__ - Step 44849: {'lr': 0.0004035284911863474, 'samples': 8611008, 'steps': 44848, 'loss/train': 1.7744563817977905} 11/07/2021 03:30:43 - INFO - __main__ - Step 44850: {'lr': 0.00040352430297700513, 'samples': 8611200, 'steps': 44849, 'loss/train': 1.3370999097824097} 11/07/2021 03:30:43 - INFO - __main__ - Step 44851: {'lr': 0.00040352011469848713, 'samples': 8611392, 'steps': 44850, 'loss/train': 0.7737088203430176} 11/07/2021 03:30:44 - INFO - __main__ - Step 44852: {'lr': 0.00040351592635079535, 'samples': 8611584, 'steps': 44851, 'loss/train': 1.7571678161621094} 11/07/2021 03:30:45 - INFO - __main__ - Step 44853: {'lr': 0.0004035117379339318, 'samples': 8611776, 'steps': 44852, 'loss/train': 1.7419499158859253} 11/07/2021 03:30:45 - INFO - __main__ - Step 44854: {'lr': 0.00040350754944789815, 'samples': 8611968, 'steps': 44853, 'loss/train': 1.3507704734802246} 11/07/2021 03:30:45 - INFO - __main__ - Step 44855: {'lr': 0.0004035033608926963, 'samples': 8612160, 'steps': 44854, 'loss/train': 1.4348763227462769} 11/07/2021 03:30:46 - INFO - __main__ - Step 44856: {'lr': 0.0004034991722683282, 'samples': 8612352, 'steps': 44855, 'loss/train': 1.1511130332946777} 11/07/2021 03:30:46 - INFO - __main__ - Step 44857: {'lr': 0.0004034949835747958, 'samples': 8612544, 'steps': 44856, 'loss/train': 1.2454063892364502} 11/07/2021 03:30:47 - INFO - __main__ - Step 44858: {'lr': 0.00040349079481210096, 'samples': 8612736, 'steps': 44857, 'loss/train': 1.4043110609054565} 11/07/2021 03:30:48 - INFO - __main__ - Step 44859: {'lr': 0.00040348660598024547, 'samples': 8612928, 'steps': 44858, 'loss/train': 1.4020954370498657} 11/07/2021 03:30:48 - INFO - __main__ - Step 44860: {'lr': 0.0004034824170792313, 'samples': 8613120, 'steps': 44859, 'loss/train': 0.940532386302948} 11/07/2021 03:30:48 - INFO - __main__ - Step 44861: {'lr': 0.0004034782281090603, 'samples': 8613312, 'steps': 44860, 'loss/train': 1.7690776586532593} 11/07/2021 03:30:49 - INFO - __main__ - Step 44862: {'lr': 0.00040347403906973445, 'samples': 8613504, 'steps': 44861, 'loss/train': 1.2716156244277954} 11/07/2021 03:30:49 - INFO - __main__ - Step 44863: {'lr': 0.0004034698499612555, 'samples': 8613696, 'steps': 44862, 'loss/train': 1.0258671045303345} 11/07/2021 03:30:50 - INFO - __main__ - Step 44864: {'lr': 0.00040346566078362545, 'samples': 8613888, 'steps': 44863, 'loss/train': 1.5426487922668457} 11/07/2021 03:30:50 - INFO - __main__ - Step 44865: {'lr': 0.0004034614715368461, 'samples': 8614080, 'steps': 44864, 'loss/train': 1.569562554359436} 11/07/2021 03:30:51 - INFO - __main__ - Step 44866: {'lr': 0.0004034572822209194, 'samples': 8614272, 'steps': 44865, 'loss/train': 1.3128806352615356} 11/07/2021 03:30:51 - INFO - __main__ - Step 44867: {'lr': 0.00040345309283584726, 'samples': 8614464, 'steps': 44866, 'loss/train': 1.4803099632263184} 11/07/2021 03:30:51 - INFO - __main__ - Step 44868: {'lr': 0.0004034489033816314, 'samples': 8614656, 'steps': 44867, 'loss/train': 1.3495128154754639} 11/07/2021 03:30:52 - INFO - __main__ - Step 44869: {'lr': 0.00040344471385827396, 'samples': 8614848, 'steps': 44868, 'loss/train': 1.1803743839263916} 11/07/2021 03:30:53 - INFO - __main__ - Step 44870: {'lr': 0.00040344052426577665, 'samples': 8615040, 'steps': 44869, 'loss/train': 1.2572747468948364} 11/07/2021 03:30:53 - INFO - __main__ - Step 44871: {'lr': 0.0004034363346041414, 'samples': 8615232, 'steps': 44870, 'loss/train': 1.3079253435134888} 11/07/2021 03:30:53 - INFO - __main__ - Step 44872: {'lr': 0.0004034321448733701, 'samples': 8615424, 'steps': 44871, 'loss/train': 1.5915842056274414} 11/07/2021 03:30:54 - INFO - __main__ - Step 44873: {'lr': 0.00040342795507346464, 'samples': 8615616, 'steps': 44872, 'loss/train': 1.5641024112701416} 11/07/2021 03:30:55 - INFO - __main__ - Step 44874: {'lr': 0.000403423765204427, 'samples': 8615808, 'steps': 44873, 'loss/train': 1.731332540512085} 11/07/2021 03:30:55 - INFO - __main__ - Step 44875: {'lr': 0.0004034195752662589, 'samples': 8616000, 'steps': 44874, 'loss/train': 1.3966073989868164} 11/07/2021 03:30:55 - INFO - __main__ - Step 44876: {'lr': 0.00040341538525896233, 'samples': 8616192, 'steps': 44875, 'loss/train': 1.4717856645584106} 11/07/2021 03:30:56 - INFO - __main__ - Step 44877: {'lr': 0.0004034111951825391, 'samples': 8616384, 'steps': 44876, 'loss/train': 1.5354030132293701} 11/07/2021 03:30:56 - INFO - __main__ - Step 44878: {'lr': 0.00040340700503699116, 'samples': 8616576, 'steps': 44877, 'loss/train': 1.6030645370483398} 11/07/2021 03:30:57 - INFO - __main__ - Step 44879: {'lr': 0.0004034028148223204, 'samples': 8616768, 'steps': 44878, 'loss/train': 1.5579181909561157} 11/07/2021 03:30:58 - INFO - __main__ - Step 44880: {'lr': 0.0004033986245385288, 'samples': 8616960, 'steps': 44879, 'loss/train': 1.655372977256775} 11/07/2021 03:30:58 - INFO - __main__ - Step 44881: {'lr': 0.0004033944341856181, 'samples': 8617152, 'steps': 44880, 'loss/train': 1.4386367797851562} 11/07/2021 03:30:58 - INFO - __main__ - Step 44882: {'lr': 0.00040339024376359015, 'samples': 8617344, 'steps': 44881, 'loss/train': 0.927916407585144} 11/07/2021 03:30:59 - INFO - __main__ - Step 44883: {'lr': 0.000403386053272447, 'samples': 8617536, 'steps': 44882, 'loss/train': 1.5074671506881714} 11/07/2021 03:31:00 - INFO - __main__ - Step 44884: {'lr': 0.0004033818627121904, 'samples': 8617728, 'steps': 44883, 'loss/train': 1.0531666278839111} 11/07/2021 03:31:00 - INFO - __main__ - Step 44885: {'lr': 0.00040337767208282235, 'samples': 8617920, 'steps': 44884, 'loss/train': 1.7778396606445312} 11/07/2021 03:31:00 - INFO - __main__ - Step 44886: {'lr': 0.00040337348138434466, 'samples': 8618112, 'steps': 44885, 'loss/train': 0.8337576389312744} 11/07/2021 03:31:01 - INFO - __main__ - Step 44887: {'lr': 0.00040336929061675933, 'samples': 8618304, 'steps': 44886, 'loss/train': 1.292196273803711} 11/07/2021 03:31:01 - INFO - __main__ - Step 44888: {'lr': 0.0004033650997800681, 'samples': 8618496, 'steps': 44887, 'loss/train': 1.5180169343948364} 11/07/2021 03:31:02 - INFO - __main__ - Step 44889: {'lr': 0.00040336090887427284, 'samples': 8618688, 'steps': 44888, 'loss/train': 1.934960126876831} 11/07/2021 03:31:02 - INFO - __main__ - Step 44890: {'lr': 0.00040335671789937564, 'samples': 8618880, 'steps': 44889, 'loss/train': 1.5152138471603394} 11/07/2021 03:31:03 - INFO - __main__ - Step 44891: {'lr': 0.00040335252685537817, 'samples': 8619072, 'steps': 44890, 'loss/train': 0.4493248760700226} 11/07/2021 03:31:03 - INFO - __main__ - Step 44892: {'lr': 0.0004033483357422825, 'samples': 8619264, 'steps': 44891, 'loss/train': 1.8648028373718262} 11/07/2021 03:31:03 - INFO - __main__ - Step 44893: {'lr': 0.0004033441445600904, 'samples': 8619456, 'steps': 44892, 'loss/train': 1.6660622358322144} 11/07/2021 03:31:04 - INFO - __main__ - Step 44894: {'lr': 0.0004033399533088038, 'samples': 8619648, 'steps': 44893, 'loss/train': 1.3583667278289795} 11/07/2021 03:31:05 - INFO - __main__ - Step 44895: {'lr': 0.00040333576198842456, 'samples': 8619840, 'steps': 44894, 'loss/train': 1.1685876846313477} 11/07/2021 03:31:05 - INFO - __main__ - Step 44896: {'lr': 0.00040333157059895463, 'samples': 8620032, 'steps': 44895, 'loss/train': 1.2783938646316528} 11/07/2021 03:31:06 - INFO - __main__ - Step 44897: {'lr': 0.0004033273791403959, 'samples': 8620224, 'steps': 44896, 'loss/train': 1.4491678476333618} 11/07/2021 03:31:06 - INFO - __main__ - Step 44898: {'lr': 0.0004033231876127501, 'samples': 8620416, 'steps': 44897, 'loss/train': 1.4834508895874023} 11/07/2021 03:31:06 - INFO - __main__ - Step 44899: {'lr': 0.00040331899601601934, 'samples': 8620608, 'steps': 44898, 'loss/train': 0.9562569856643677} 11/07/2021 03:31:07 - INFO - __main__ - Step 44900: {'lr': 0.0004033148043502054, 'samples': 8620800, 'steps': 44899, 'loss/train': 1.7440876960754395} 11/07/2021 03:31:08 - INFO - __main__ - Step 44901: {'lr': 0.00040331061261531014, 'samples': 8620992, 'steps': 44900, 'loss/train': 1.2397050857543945} 11/07/2021 03:31:08 - INFO - __main__ - Step 44902: {'lr': 0.0004033064208113355, 'samples': 8621184, 'steps': 44901, 'loss/train': 1.4030208587646484} 11/07/2021 03:31:08 - INFO - __main__ - Step 44903: {'lr': 0.00040330222893828334, 'samples': 8621376, 'steps': 44902, 'loss/train': 1.8505945205688477} 11/07/2021 03:31:09 - INFO - __main__ - Step 44904: {'lr': 0.0004032980369961555, 'samples': 8621568, 'steps': 44903, 'loss/train': 1.1294523477554321} 11/07/2021 03:31:10 - INFO - __main__ - Step 44905: {'lr': 0.000403293844984954, 'samples': 8621760, 'steps': 44904, 'loss/train': 0.1676974892616272} 11/07/2021 03:31:10 - INFO - __main__ - Step 44906: {'lr': 0.00040328965290468066, 'samples': 8621952, 'steps': 44905, 'loss/train': 1.710390329360962} 11/07/2021 03:31:10 - INFO - __main__ - Step 44907: {'lr': 0.00040328546075533745, 'samples': 8622144, 'steps': 44906, 'loss/train': 1.4258091449737549} 11/07/2021 03:31:11 - INFO - __main__ - Step 44908: {'lr': 0.00040328126853692606, 'samples': 8622336, 'steps': 44907, 'loss/train': 1.154900312423706} 11/07/2021 03:31:11 - INFO - __main__ - Step 44909: {'lr': 0.00040327707624944855, 'samples': 8622528, 'steps': 44908, 'loss/train': 1.5896992683410645} 11/07/2021 03:31:12 - INFO - __main__ - Step 44910: {'lr': 0.0004032728838929067, 'samples': 8622720, 'steps': 44909, 'loss/train': 0.9224819540977478} 11/07/2021 03:31:13 - INFO - __main__ - Step 44911: {'lr': 0.0004032686914673025, 'samples': 8622912, 'steps': 44910, 'loss/train': 1.7258622646331787} 11/07/2021 03:31:13 - INFO - __main__ - Step 44912: {'lr': 0.00040326449897263775, 'samples': 8623104, 'steps': 44911, 'loss/train': 1.3852179050445557} 11/07/2021 03:31:13 - INFO - __main__ - Step 44913: {'lr': 0.0004032603064089144, 'samples': 8623296, 'steps': 44912, 'loss/train': 0.12013912200927734} 11/07/2021 03:31:14 - INFO - __main__ - Step 44914: {'lr': 0.00040325611377613435, 'samples': 8623488, 'steps': 44913, 'loss/train': 2.0965540409088135} 11/07/2021 03:31:15 - INFO - __main__ - Step 44915: {'lr': 0.00040325192107429944, 'samples': 8623680, 'steps': 44914, 'loss/train': 1.8563235998153687} 11/07/2021 03:31:15 - INFO - __main__ - Step 44916: {'lr': 0.00040324772830341163, 'samples': 8623872, 'steps': 44915, 'loss/train': 1.5661081075668335} 11/07/2021 03:31:15 - INFO - __main__ - Step 44917: {'lr': 0.0004032435354634726, 'samples': 8624064, 'steps': 44916, 'loss/train': 1.2690373659133911} 11/07/2021 03:31:16 - INFO - __main__ - Step 44918: {'lr': 0.00040323934255448457, 'samples': 8624256, 'steps': 44917, 'loss/train': 1.1036372184753418} 11/07/2021 03:31:16 - INFO - __main__ - Step 44919: {'lr': 0.00040323514957644915, 'samples': 8624448, 'steps': 44918, 'loss/train': 1.6021407842636108} 11/07/2021 03:31:17 - INFO - __main__ - Step 44920: {'lr': 0.00040323095652936843, 'samples': 8624640, 'steps': 44919, 'loss/train': 1.4129055738449097} 11/07/2021 03:31:17 - INFO - __main__ - Step 44921: {'lr': 0.00040322676341324415, 'samples': 8624832, 'steps': 44920, 'loss/train': 1.3669313192367554} 11/07/2021 03:31:18 - INFO - __main__ - Step 44922: {'lr': 0.0004032225702280783, 'samples': 8625024, 'steps': 44921, 'loss/train': 1.6862813234329224} 11/07/2021 03:31:18 - INFO - __main__ - Step 44923: {'lr': 0.00040321837697387264, 'samples': 8625216, 'steps': 44922, 'loss/train': 1.9148911237716675} 11/07/2021 03:31:18 - INFO - __main__ - Step 44924: {'lr': 0.00040321418365062915, 'samples': 8625408, 'steps': 44923, 'loss/train': 0.8510093688964844} 11/07/2021 03:31:19 - INFO - __main__ - Step 44925: {'lr': 0.00040320999025834973, 'samples': 8625600, 'steps': 44924, 'loss/train': 1.509555459022522} 11/07/2021 03:31:20 - INFO - __main__ - Step 44926: {'lr': 0.0004032057967970363, 'samples': 8625792, 'steps': 44925, 'loss/train': 1.7970837354660034} 11/07/2021 03:31:20 - INFO - __main__ - Step 44927: {'lr': 0.0004032016032666907, 'samples': 8625984, 'steps': 44926, 'loss/train': 1.5563510656356812} 11/07/2021 03:31:21 - INFO - __main__ - Step 44928: {'lr': 0.00040319740966731477, 'samples': 8626176, 'steps': 44927, 'loss/train': 1.2547262907028198} 11/07/2021 03:31:21 - INFO - __main__ - Step 44929: {'lr': 0.0004031932159989105, 'samples': 8626368, 'steps': 44928, 'loss/train': 1.417038917541504} 11/07/2021 03:31:21 - INFO - __main__ - Step 44930: {'lr': 0.0004031890222614797, 'samples': 8626560, 'steps': 44929, 'loss/train': 1.4200637340545654} 11/07/2021 03:31:22 - INFO - __main__ - Step 44931: {'lr': 0.0004031848284550243, 'samples': 8626752, 'steps': 44930, 'loss/train': 1.3945986032485962} 11/07/2021 03:31:23 - INFO - __main__ - Step 44932: {'lr': 0.0004031806345795462, 'samples': 8626944, 'steps': 44931, 'loss/train': 0.8418196439743042} 11/07/2021 03:31:23 - INFO - __main__ - Step 44933: {'lr': 0.0004031764406350472, 'samples': 8627136, 'steps': 44932, 'loss/train': 1.5948578119277954} 11/07/2021 03:31:23 - INFO - __main__ - Step 44934: {'lr': 0.0004031722466215293, 'samples': 8627328, 'steps': 44933, 'loss/train': 1.464234709739685} 11/07/2021 03:31:24 - INFO - __main__ - Step 44935: {'lr': 0.00040316805253899434, 'samples': 8627520, 'steps': 44934, 'loss/train': 0.7063742280006409} 11/07/2021 03:31:25 - INFO - __main__ - Step 44936: {'lr': 0.0004031638583874443, 'samples': 8627712, 'steps': 44935, 'loss/train': 1.4003729820251465} 11/07/2021 03:31:25 - INFO - __main__ - Step 44937: {'lr': 0.0004031596641668809, 'samples': 8627904, 'steps': 44936, 'loss/train': 1.2112001180648804} 11/07/2021 03:31:26 - INFO - __main__ - Step 44938: {'lr': 0.0004031554698773061, 'samples': 8628096, 'steps': 44937, 'loss/train': 1.67475426197052} 11/07/2021 03:31:26 - INFO - __main__ - Step 44939: {'lr': 0.0004031512755187219, 'samples': 8628288, 'steps': 44938, 'loss/train': 1.6906245946884155} 11/07/2021 03:31:26 - INFO - __main__ - Step 44940: {'lr': 0.00040314708109113003, 'samples': 8628480, 'steps': 44939, 'loss/train': 1.8159323930740356} 11/07/2021 03:31:28 - INFO - __main__ - Step 44941: {'lr': 0.0004031428865945325, 'samples': 8628672, 'steps': 44940, 'loss/train': 1.7222596406936646} 11/07/2021 03:31:28 - INFO - __main__ - Step 44942: {'lr': 0.0004031386920289311, 'samples': 8628864, 'steps': 44941, 'loss/train': 0.9303457736968994} 11/07/2021 03:31:28 - INFO - __main__ - Step 44943: {'lr': 0.0004031344973943278, 'samples': 8629056, 'steps': 44942, 'loss/train': 1.8656607866287231} 11/07/2021 03:31:29 - INFO - __main__ - Step 44944: {'lr': 0.00040313030269072445, 'samples': 8629248, 'steps': 44943, 'loss/train': 0.9886725544929504} 11/07/2021 03:31:29 - INFO - __main__ - Step 44945: {'lr': 0.00040312610791812286, 'samples': 8629440, 'steps': 44944, 'loss/train': 1.2826176881790161} 11/07/2021 03:31:30 - INFO - __main__ - Step 44946: {'lr': 0.00040312191307652513, 'samples': 8629632, 'steps': 44945, 'loss/train': 0.7697887420654297} 11/07/2021 03:31:30 - INFO - __main__ - Step 44947: {'lr': 0.000403117718165933, 'samples': 8629824, 'steps': 44946, 'loss/train': 0.953285276889801} 11/07/2021 03:31:31 - INFO - __main__ - Step 44948: {'lr': 0.00040311352318634844, 'samples': 8630016, 'steps': 44947, 'loss/train': 5.663651466369629} 11/07/2021 03:31:31 - INFO - __main__ - Step 44949: {'lr': 0.00040310932813777316, 'samples': 8630208, 'steps': 44948, 'loss/train': 1.1987284421920776} 11/07/2021 03:31:31 - INFO - __main__ - Step 44950: {'lr': 0.0004031051330202092, 'samples': 8630400, 'steps': 44949, 'loss/train': 1.4727267026901245} 11/07/2021 03:31:32 - INFO - __main__ - Step 44951: {'lr': 0.00040310093783365854, 'samples': 8630592, 'steps': 44950, 'loss/train': 1.728103756904602} 11/07/2021 03:31:33 - INFO - __main__ - Step 44952: {'lr': 0.0004030967425781229, 'samples': 8630784, 'steps': 44951, 'loss/train': 1.317302942276001} 11/07/2021 03:31:33 - INFO - __main__ - Step 44953: {'lr': 0.0004030925472536042, 'samples': 8630976, 'steps': 44952, 'loss/train': 1.6258738040924072} 11/07/2021 03:31:34 - INFO - __main__ - Step 44954: {'lr': 0.0004030883518601044, 'samples': 8631168, 'steps': 44953, 'loss/train': 1.1118228435516357} 11/07/2021 03:31:34 - INFO - __main__ - Step 44955: {'lr': 0.0004030841563976254, 'samples': 8631360, 'steps': 44954, 'loss/train': 1.2092745304107666} 11/07/2021 03:31:35 - INFO - __main__ - Step 44956: {'lr': 0.00040307996086616895, 'samples': 8631552, 'steps': 44955, 'loss/train': 1.5450530052185059} 11/07/2021 03:31:35 - INFO - __main__ - Step 44957: {'lr': 0.00040307576526573704, 'samples': 8631744, 'steps': 44956, 'loss/train': 1.7838172912597656} 11/07/2021 03:31:36 - INFO - __main__ - Step 44958: {'lr': 0.00040307156959633154, 'samples': 8631936, 'steps': 44957, 'loss/train': 1.2368890047073364} 11/07/2021 03:31:36 - INFO - __main__ - Step 44959: {'lr': 0.00040306737385795437, 'samples': 8632128, 'steps': 44958, 'loss/train': 1.431859016418457} 11/07/2021 03:31:36 - INFO - __main__ - Step 44960: {'lr': 0.00040306317805060746, 'samples': 8632320, 'steps': 44959, 'loss/train': 1.3809988498687744} 11/07/2021 03:31:37 - INFO - __main__ - Step 44961: {'lr': 0.0004030589821742926, 'samples': 8632512, 'steps': 44960, 'loss/train': 1.3988364934921265} 11/07/2021 03:31:38 - INFO - __main__ - Step 44962: {'lr': 0.00040305478622901177, 'samples': 8632704, 'steps': 44961, 'loss/train': 1.5435221195220947} 11/07/2021 03:31:38 - INFO - __main__ - Step 44963: {'lr': 0.0004030505902147668, 'samples': 8632896, 'steps': 44962, 'loss/train': 1.8923532962799072} 11/07/2021 03:31:38 - INFO - __main__ - Step 44964: {'lr': 0.00040304639413155953, 'samples': 8633088, 'steps': 44963, 'loss/train': 1.4311248064041138} 11/07/2021 03:31:39 - INFO - __main__ - Step 44965: {'lr': 0.0004030421979793919, 'samples': 8633280, 'steps': 44964, 'loss/train': 1.5051273107528687} 11/07/2021 03:31:40 - INFO - __main__ - Step 44966: {'lr': 0.0004030380017582659, 'samples': 8633472, 'steps': 44965, 'loss/train': 1.7004728317260742} 11/07/2021 03:31:40 - INFO - __main__ - Step 44967: {'lr': 0.0004030338054681833, 'samples': 8633664, 'steps': 44966, 'loss/train': 1.6326181888580322} 11/07/2021 03:31:40 - INFO - __main__ - Step 44968: {'lr': 0.0004030296091091461, 'samples': 8633856, 'steps': 44967, 'loss/train': 1.38742995262146} 11/07/2021 03:31:41 - INFO - __main__ - Step 44969: {'lr': 0.000403025412681156, 'samples': 8634048, 'steps': 44968, 'loss/train': 1.190015196800232} 11/07/2021 03:31:41 - INFO - __main__ - Step 44970: {'lr': 0.00040302121618421505, 'samples': 8634240, 'steps': 44969, 'loss/train': 1.7225620746612549} 11/07/2021 03:31:42 - INFO - __main__ - Step 44971: {'lr': 0.0004030170196183252, 'samples': 8634432, 'steps': 44970, 'loss/train': 1.1247191429138184} 11/07/2021 03:31:42 - INFO - __main__ - Step 44972: {'lr': 0.00040301282298348806, 'samples': 8634624, 'steps': 44971, 'loss/train': 1.6943212747573853} 11/07/2021 03:31:43 - INFO - __main__ - Step 44973: {'lr': 0.0004030086262797058, 'samples': 8634816, 'steps': 44972, 'loss/train': 1.40690279006958} 11/07/2021 03:31:43 - INFO - __main__ - Step 44974: {'lr': 0.0004030044295069803, 'samples': 8635008, 'steps': 44973, 'loss/train': 1.1578707695007324} 11/07/2021 03:31:43 - INFO - __main__ - Step 44975: {'lr': 0.00040300023266531327, 'samples': 8635200, 'steps': 44974, 'loss/train': 1.1108335256576538} 11/07/2021 03:31:45 - INFO - __main__ - Step 44976: {'lr': 0.0004029960357547067, 'samples': 8635392, 'steps': 44975, 'loss/train': 1.3263057470321655} 11/07/2021 03:31:45 - INFO - __main__ - Step 44977: {'lr': 0.0004029918387751625, 'samples': 8635584, 'steps': 44976, 'loss/train': 1.6945290565490723} 11/07/2021 03:31:45 - INFO - __main__ - Step 44978: {'lr': 0.00040298764172668253, 'samples': 8635776, 'steps': 44977, 'loss/train': 1.4941730499267578} 11/07/2021 03:31:46 - INFO - __main__ - Step 44979: {'lr': 0.00040298344460926866, 'samples': 8635968, 'steps': 44978, 'loss/train': 1.4797956943511963} 11/07/2021 03:31:46 - INFO - __main__ - Step 44980: {'lr': 0.0004029792474229228, 'samples': 8636160, 'steps': 44979, 'loss/train': 1.5584986209869385} 11/07/2021 03:31:46 - INFO - __main__ - Step 44981: {'lr': 0.00040297505016764697, 'samples': 8636352, 'steps': 44980, 'loss/train': 1.4460642337799072} 11/07/2021 03:31:47 - INFO - __main__ - Step 44982: {'lr': 0.00040297085284344284, 'samples': 8636544, 'steps': 44981, 'loss/train': 1.5774402618408203} 11/07/2021 03:31:48 - INFO - __main__ - Step 44983: {'lr': 0.0004029666554503124, 'samples': 8636736, 'steps': 44982, 'loss/train': 1.447740077972412} 11/07/2021 03:31:48 - INFO - __main__ - Step 44984: {'lr': 0.0004029624579882576, 'samples': 8636928, 'steps': 44983, 'loss/train': 2.202698230743408} 11/07/2021 03:31:48 - INFO - __main__ - Step 44985: {'lr': 0.00040295826045728023, 'samples': 8637120, 'steps': 44984, 'loss/train': 0.47387027740478516} 11/07/2021 03:31:49 - INFO - __main__ - Step 44986: {'lr': 0.00040295406285738224, 'samples': 8637312, 'steps': 44985, 'loss/train': 1.4053353071212769} 11/07/2021 03:31:50 - INFO - __main__ - Step 44987: {'lr': 0.00040294986518856553, 'samples': 8637504, 'steps': 44986, 'loss/train': 1.6060630083084106} 11/07/2021 03:31:50 - INFO - __main__ - Step 44988: {'lr': 0.00040294566745083195, 'samples': 8637696, 'steps': 44987, 'loss/train': 1.1187083721160889} 11/07/2021 03:31:51 - INFO - __main__ - Step 44989: {'lr': 0.00040294146964418344, 'samples': 8637888, 'steps': 44988, 'loss/train': 1.3277955055236816} 11/07/2021 03:31:51 - INFO - __main__ - Step 44990: {'lr': 0.00040293727176862184, 'samples': 8638080, 'steps': 44989, 'loss/train': 1.8343409299850464} 11/07/2021 03:31:51 - INFO - __main__ - Step 44991: {'lr': 0.000402933073824149, 'samples': 8638272, 'steps': 44990, 'loss/train': 0.9720051288604736} 11/07/2021 03:31:52 - INFO - __main__ - Step 44992: {'lr': 0.000402928875810767, 'samples': 8638464, 'steps': 44991, 'loss/train': 1.4146634340286255} 11/07/2021 03:31:53 - INFO - __main__ - Step 44993: {'lr': 0.00040292467772847754, 'samples': 8638656, 'steps': 44992, 'loss/train': 1.0446237325668335} 11/07/2021 03:31:53 - INFO - __main__ - Step 44994: {'lr': 0.00040292047957728264, 'samples': 8638848, 'steps': 44993, 'loss/train': 1.6617484092712402} 11/07/2021 03:31:53 - INFO - __main__ - Step 44995: {'lr': 0.00040291628135718404, 'samples': 8639040, 'steps': 44994, 'loss/train': 1.681424617767334} 11/07/2021 03:31:54 - INFO - __main__ - Step 44996: {'lr': 0.0004029120830681838, 'samples': 8639232, 'steps': 44995, 'loss/train': 1.5093427896499634} 11/07/2021 03:31:55 - INFO - __main__ - Step 44997: {'lr': 0.0004029078847102837, 'samples': 8639424, 'steps': 44996, 'loss/train': 1.4904605150222778} 11/07/2021 03:31:55 - INFO - __main__ - Step 44998: {'lr': 0.00040290368628348564, 'samples': 8639616, 'steps': 44997, 'loss/train': 0.495597243309021} 11/07/2021 03:31:56 - INFO - __main__ - Step 44999: {'lr': 0.00040289948778779157, 'samples': 8639808, 'steps': 44998, 'loss/train': 1.3495784997940063} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541} 11/07/2021 03:31:56 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 03:35:09 - INFO - __main__ - Step 45000: {'loss/eval': 1.406919240951538, 'perplexity': 4.0833563804626465} 11/07/2021 03:35:20 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20211106_211610-dtkf2u0m/run-dtkf2u0m.wandb']. This may take a bit of time if the files are large. 11/07/2021 03:35:24 - WARNING - huggingface_hub.repository - Several commits (3) will be pushed upstream. 11/07/2021 03:35:24 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 03:35:49 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small d425b2d..5ed0776 proud-haze-135 -> proud-haze-135 11/07/2021 03:35:50 - INFO - __main__ - Step 45001: {'lr': 0.00040289109058972285, 'samples': 8640192, 'steps': 45000, 'loss/train': 1.378940463066101} 11/07/2021 03:35:51 - INFO - __main__ - Step 45002: {'lr': 0.000402886891887352, 'samples': 8640384, 'steps': 45001, 'loss/train': 0.7639281749725342} 11/07/2021 03:35:52 - INFO - __main__ - Step 45003: {'lr': 0.0004028826931160927, 'samples': 8640576, 'steps': 45002, 'loss/train': 1.7666077613830566} 11/07/2021 03:35:52 - INFO - __main__ - Step 45004: {'lr': 0.0004028784942759468, 'samples': 8640768, 'steps': 45003, 'loss/train': 1.9681998491287231} 11/07/2021 03:35:53 - INFO - __main__ - Step 45005: {'lr': 0.0004028742953669162, 'samples': 8640960, 'steps': 45004, 'loss/train': 1.333229899406433} 11/07/2021 03:35:53 - INFO - __main__ - Step 45006: {'lr': 0.0004028700963890028, 'samples': 8641152, 'steps': 45005, 'loss/train': 1.756109356880188} 11/07/2021 03:35:53 - INFO - __main__ - Step 45007: {'lr': 0.0004028658973422085, 'samples': 8641344, 'steps': 45006, 'loss/train': 1.7226879596710205} 11/07/2021 03:35:54 - INFO - __main__ - Step 45008: {'lr': 0.0004028616982265352, 'samples': 8641536, 'steps': 45007, 'loss/train': 1.0147008895874023} 11/07/2021 03:35:55 - INFO - __main__ - Step 45009: {'lr': 0.0004028574990419848, 'samples': 8641728, 'steps': 45008, 'loss/train': 1.5768661499023438} 11/07/2021 03:35:55 - INFO - __main__ - Step 45010: {'lr': 0.0004028532997885591, 'samples': 8641920, 'steps': 45009, 'loss/train': 1.1671724319458008} 11/07/2021 03:35:55 - INFO - __main__ - Step 45011: {'lr': 0.0004028491004662601, 'samples': 8642112, 'steps': 45010, 'loss/train': 1.2716879844665527} 11/07/2021 03:35:56 - INFO - __main__ - Step 45012: {'lr': 0.0004028449010750896, 'samples': 8642304, 'steps': 45011, 'loss/train': 1.6644291877746582} 11/07/2021 03:35:57 - INFO - __main__ - Step 45013: {'lr': 0.0004028407016150496, 'samples': 8642496, 'steps': 45012, 'loss/train': 1.2869713306427002} 11/07/2021 03:35:57 - INFO - __main__ - Step 45014: {'lr': 0.000402836502086142, 'samples': 8642688, 'steps': 45013, 'loss/train': 1.0818111896514893} 11/07/2021 03:35:57 - INFO - __main__ - Step 45015: {'lr': 0.00040283230248836855, 'samples': 8642880, 'steps': 45014, 'loss/train': 1.1113263368606567} 11/07/2021 03:35:58 - INFO - __main__ - Step 45016: {'lr': 0.0004028281028217312, 'samples': 8643072, 'steps': 45015, 'loss/train': 1.3489185571670532} 11/07/2021 03:35:58 - INFO - __main__ - Step 45017: {'lr': 0.00040282390308623195, 'samples': 8643264, 'steps': 45016, 'loss/train': 1.355971336364746} 11/07/2021 03:35:59 - INFO - __main__ - Step 45018: {'lr': 0.0004028197032818726, 'samples': 8643456, 'steps': 45017, 'loss/train': 1.0107563734054565} 11/07/2021 03:36:00 - INFO - __main__ - Step 45019: {'lr': 0.00040281550340865493, 'samples': 8643648, 'steps': 45018, 'loss/train': 1.496358871459961} 11/07/2021 03:36:00 - INFO - __main__ - Step 45020: {'lr': 0.000402811303466581, 'samples': 8643840, 'steps': 45019, 'loss/train': 1.4261424541473389} 11/07/2021 03:36:00 - INFO - __main__ - Step 45021: {'lr': 0.00040280710345565277, 'samples': 8644032, 'steps': 45020, 'loss/train': 1.3391042947769165} 11/07/2021 03:36:01 - INFO - __main__ - Step 45022: {'lr': 0.0004028029033758719, 'samples': 8644224, 'steps': 45021, 'loss/train': 1.426767349243164} 11/07/2021 03:36:02 - INFO - __main__ - Step 45023: {'lr': 0.00040279870322724044, 'samples': 8644416, 'steps': 45022, 'loss/train': 1.71662175655365} 11/07/2021 03:36:02 - INFO - __main__ - Step 45024: {'lr': 0.00040279450300976025, 'samples': 8644608, 'steps': 45023, 'loss/train': 1.476520299911499} 11/07/2021 03:36:02 - INFO - __main__ - Step 45025: {'lr': 0.0004027903027234332, 'samples': 8644800, 'steps': 45024, 'loss/train': 1.5499037504196167} 11/07/2021 03:36:03 - INFO - __main__ - Step 45026: {'lr': 0.0004027861023682612, 'samples': 8644992, 'steps': 45025, 'loss/train': 0.5358538031578064} 11/07/2021 03:36:03 - INFO - __main__ - Step 45027: {'lr': 0.00040278190194424613, 'samples': 8645184, 'steps': 45026, 'loss/train': 0.8318495750427246} 11/07/2021 03:36:04 - INFO - __main__ - Step 45028: {'lr': 0.0004027777014513899, 'samples': 8645376, 'steps': 45027, 'loss/train': 1.6739071607589722} 11/07/2021 03:36:04 - INFO - __main__ - Step 45029: {'lr': 0.0004027735008896944, 'samples': 8645568, 'steps': 45028, 'loss/train': 1.6734158992767334} 11/07/2021 03:36:05 - INFO - __main__ - Step 45030: {'lr': 0.0004027693002591615, 'samples': 8645760, 'steps': 45029, 'loss/train': 1.218596339225769} 11/07/2021 03:36:05 - INFO - __main__ - Step 45031: {'lr': 0.0004027650995597931, 'samples': 8645952, 'steps': 45030, 'loss/train': 1.261763572692871} 11/07/2021 03:36:05 - INFO - __main__ - Step 45032: {'lr': 0.0004027608987915912, 'samples': 8646144, 'steps': 45031, 'loss/train': 1.4001818895339966} 11/07/2021 03:36:06 - INFO - __main__ - Step 45033: {'lr': 0.0004027566979545574, 'samples': 8646336, 'steps': 45032, 'loss/train': 1.6564899682998657} 11/07/2021 03:36:07 - INFO - __main__ - Step 45034: {'lr': 0.000402752497048694, 'samples': 8646528, 'steps': 45033, 'loss/train': 1.373465657234192} 11/07/2021 03:36:07 - INFO - __main__ - Step 45035: {'lr': 0.0004027482960740026, 'samples': 8646720, 'steps': 45034, 'loss/train': 1.378868818283081} 11/07/2021 03:36:08 - INFO - __main__ - Step 45036: {'lr': 0.00040274409503048513, 'samples': 8646912, 'steps': 45035, 'loss/train': 1.5058993101119995} 11/07/2021 03:36:08 - INFO - __main__ - Step 45037: {'lr': 0.0004027398939181436, 'samples': 8647104, 'steps': 45036, 'loss/train': 1.4314427375793457} 11/07/2021 03:36:08 - INFO - __main__ - Step 45038: {'lr': 0.00040273569273697974, 'samples': 8647296, 'steps': 45037, 'loss/train': 1.5494295358657837} 11/07/2021 03:36:09 - INFO - __main__ - Step 45039: {'lr': 0.0004027314914869956, 'samples': 8647488, 'steps': 45038, 'loss/train': 1.506453514099121} 11/07/2021 03:36:10 - INFO - __main__ - Step 45040: {'lr': 0.000402727290168193, 'samples': 8647680, 'steps': 45039, 'loss/train': 1.688452959060669} 11/07/2021 03:36:10 - INFO - __main__ - Step 45041: {'lr': 0.00040272308878057383, 'samples': 8647872, 'steps': 45040, 'loss/train': 1.3499056100845337} 11/07/2021 03:36:10 - INFO - __main__ - Step 45042: {'lr': 0.0004027188873241401, 'samples': 8648064, 'steps': 45041, 'loss/train': 1.4158624410629272} 11/07/2021 03:36:11 - INFO - __main__ - Step 45043: {'lr': 0.00040271468579889346, 'samples': 8648256, 'steps': 45042, 'loss/train': 1.4179600477218628} 11/07/2021 03:36:12 - INFO - __main__ - Step 45044: {'lr': 0.0004027104842048359, 'samples': 8648448, 'steps': 45043, 'loss/train': 2.341157913208008} 11/07/2021 03:36:12 - INFO - __main__ - Step 45045: {'lr': 0.0004027062825419695, 'samples': 8648640, 'steps': 45044, 'loss/train': 1.744893193244934} 11/07/2021 03:36:12 - INFO - __main__ - Step 45046: {'lr': 0.0004027020808102959, 'samples': 8648832, 'steps': 45045, 'loss/train': 0.9090373516082764} 11/07/2021 03:36:13 - INFO - __main__ - Step 45047: {'lr': 0.0004026978790098171, 'samples': 8649024, 'steps': 45046, 'loss/train': 0.878197193145752} 11/07/2021 03:36:13 - INFO - __main__ - Step 45048: {'lr': 0.0004026936771405351, 'samples': 8649216, 'steps': 45047, 'loss/train': 1.6723681688308716} 11/07/2021 03:36:14 - INFO - __main__ - Step 45049: {'lr': 0.0004026894752024516, 'samples': 8649408, 'steps': 45048, 'loss/train': 1.3421533107757568} 11/07/2021 03:36:14 - INFO - __main__ - Step 45050: {'lr': 0.00040268527319556856, 'samples': 8649600, 'steps': 45049, 'loss/train': 1.3463534116744995} 11/07/2021 03:36:15 - INFO - __main__ - Step 45051: {'lr': 0.0004026810711198879, 'samples': 8649792, 'steps': 45050, 'loss/train': 1.0824987888336182} 11/07/2021 03:36:15 - INFO - __main__ - Step 45052: {'lr': 0.00040267686897541157, 'samples': 8649984, 'steps': 45051, 'loss/train': 1.6015963554382324} 11/07/2021 03:36:15 - INFO - __main__ - Step 45053: {'lr': 0.0004026726667621413, 'samples': 8650176, 'steps': 45052, 'loss/train': 1.3904969692230225} 11/07/2021 03:36:16 - INFO - __main__ - Step 45054: {'lr': 0.00040266846448007914, 'samples': 8650368, 'steps': 45053, 'loss/train': 1.6913669109344482} 11/07/2021 03:36:17 - INFO - __main__ - Step 45055: {'lr': 0.00040266426212922697, 'samples': 8650560, 'steps': 45054, 'loss/train': 1.5606504678726196} 11/07/2021 03:36:17 - INFO - __main__ - Step 45056: {'lr': 0.00040266005970958656, 'samples': 8650752, 'steps': 45055, 'loss/train': 1.2225046157836914} 11/07/2021 03:36:17 - INFO - __main__ - Step 45057: {'lr': 0.0004026558572211599, 'samples': 8650944, 'steps': 45056, 'loss/train': 1.4816373586654663} 11/07/2021 03:36:18 - INFO - __main__ - Step 45058: {'lr': 0.00040265165466394894, 'samples': 8651136, 'steps': 45057, 'loss/train': 1.6577447652816772} 11/07/2021 03:36:19 - INFO - __main__ - Step 45059: {'lr': 0.00040264745203795536, 'samples': 8651328, 'steps': 45058, 'loss/train': 1.5056010484695435} 11/07/2021 03:36:19 - INFO - __main__ - Step 45060: {'lr': 0.0004026432493431813, 'samples': 8651520, 'steps': 45059, 'loss/train': 1.4441595077514648} 11/07/2021 03:36:20 - INFO - __main__ - Step 45061: {'lr': 0.0004026390465796286, 'samples': 8651712, 'steps': 45060, 'loss/train': 1.7447130680084229} 11/07/2021 03:36:20 - INFO - __main__ - Step 45062: {'lr': 0.000402634843747299, 'samples': 8651904, 'steps': 45061, 'loss/train': 1.3498313426971436} 11/07/2021 03:36:20 - INFO - __main__ - Step 45063: {'lr': 0.0004026306408461945, 'samples': 8652096, 'steps': 45062, 'loss/train': 1.6140329837799072} 11/07/2021 03:36:21 - INFO - __main__ - Step 45064: {'lr': 0.000402626437876317, 'samples': 8652288, 'steps': 45063, 'loss/train': 1.8729219436645508} 11/07/2021 03:36:22 - INFO - __main__ - Step 45065: {'lr': 0.00040262223483766835, 'samples': 8652480, 'steps': 45064, 'loss/train': 1.4350508451461792} 11/07/2021 03:36:22 - INFO - __main__ - Step 45066: {'lr': 0.0004026180317302506, 'samples': 8652672, 'steps': 45065, 'loss/train': 1.5817408561706543} 11/07/2021 03:36:22 - INFO - __main__ - Step 45067: {'lr': 0.0004026138285540654, 'samples': 8652864, 'steps': 45066, 'loss/train': 1.3502610921859741} 11/07/2021 03:36:23 - INFO - __main__ - Step 45068: {'lr': 0.0004026096253091148, 'samples': 8653056, 'steps': 45067, 'loss/train': 1.4997808933258057} 11/07/2021 03:36:23 - INFO - __main__ - Step 45069: {'lr': 0.00040260542199540064, 'samples': 8653248, 'steps': 45068, 'loss/train': 1.4410582780838013} 11/07/2021 03:36:24 - INFO - __main__ - Step 45070: {'lr': 0.00040260121861292484, 'samples': 8653440, 'steps': 45069, 'loss/train': 1.253722071647644} 11/07/2021 03:36:25 - INFO - __main__ - Step 45071: {'lr': 0.0004025970151616893, 'samples': 8653632, 'steps': 45070, 'loss/train': 1.573169469833374} 11/07/2021 03:36:25 - INFO - __main__ - Step 45072: {'lr': 0.0004025928116416959, 'samples': 8653824, 'steps': 45071, 'loss/train': 1.4829283952713013} 11/07/2021 03:36:25 - INFO - __main__ - Step 45073: {'lr': 0.0004025886080529465, 'samples': 8654016, 'steps': 45072, 'loss/train': 2.308427095413208} 11/07/2021 03:36:26 - INFO - __main__ - Step 45074: {'lr': 0.00040258440439544307, 'samples': 8654208, 'steps': 45073, 'loss/train': 1.4862397909164429} 11/07/2021 03:36:27 - INFO - __main__ - Step 45075: {'lr': 0.0004025802006691874, 'samples': 8654400, 'steps': 45074, 'loss/train': 1.362560510635376} 11/07/2021 03:36:27 - INFO - __main__ - Step 45076: {'lr': 0.0004025759968741816, 'samples': 8654592, 'steps': 45075, 'loss/train': 1.7087560892105103} 11/07/2021 03:36:27 - INFO - __main__ - Step 45077: {'lr': 0.00040257179301042724, 'samples': 8654784, 'steps': 45076, 'loss/train': 1.1875423192977905} 11/07/2021 03:36:28 - INFO - __main__ - Step 45078: {'lr': 0.00040256758907792646, 'samples': 8654976, 'steps': 45077, 'loss/train': 1.640230417251587} 11/07/2021 03:36:28 - INFO - __main__ - Step 45079: {'lr': 0.0004025633850766811, 'samples': 8655168, 'steps': 45078, 'loss/train': 0.8937728404998779} 11/07/2021 03:36:29 - INFO - __main__ - Step 45080: {'lr': 0.00040255918100669296, 'samples': 8655360, 'steps': 45079, 'loss/train': 1.7140698432922363} 11/07/2021 03:36:30 - INFO - __main__ - Step 45081: {'lr': 0.000402554976867964, 'samples': 8655552, 'steps': 45080, 'loss/train': 2.8504831790924072} 11/07/2021 03:36:30 - INFO - __main__ - Step 45082: {'lr': 0.00040255077266049624, 'samples': 8655744, 'steps': 45081, 'loss/train': 1.5986260175704956} 11/07/2021 03:36:30 - INFO - __main__ - Step 45083: {'lr': 0.0004025465683842914, 'samples': 8655936, 'steps': 45082, 'loss/train': 1.4731183052062988} 11/07/2021 03:36:31 - INFO - __main__ - Step 45084: {'lr': 0.0004025423640393514, 'samples': 8656128, 'steps': 45083, 'loss/train': 1.6889382600784302} 11/07/2021 03:36:31 - INFO - __main__ - Step 45085: {'lr': 0.0004025381596256782, 'samples': 8656320, 'steps': 45084, 'loss/train': 1.5213496685028076} 11/07/2021 03:36:32 - INFO - __main__ - Step 45086: {'lr': 0.0004025339551432736, 'samples': 8656512, 'steps': 45085, 'loss/train': 1.6633069515228271} 11/07/2021 03:36:32 - INFO - __main__ - Step 45087: {'lr': 0.0004025297505921396, 'samples': 8656704, 'steps': 45086, 'loss/train': 1.361456036567688} 11/07/2021 03:36:33 - INFO - __main__ - Step 45088: {'lr': 0.00040252554597227795, 'samples': 8656896, 'steps': 45087, 'loss/train': 1.7299543619155884} 11/07/2021 03:36:33 - INFO - __main__ - Step 45089: {'lr': 0.00040252134128369085, 'samples': 8657088, 'steps': 45088, 'loss/train': 1.6604050397872925} 11/07/2021 03:36:34 - INFO - __main__ - Step 45090: {'lr': 0.00040251713652637985, 'samples': 8657280, 'steps': 45089, 'loss/train': 1.785449743270874} 11/07/2021 03:36:34 - INFO - __main__ - Step 45091: {'lr': 0.00040251293170034697, 'samples': 8657472, 'steps': 45090, 'loss/train': 1.5653769969940186} 11/07/2021 03:36:35 - INFO - __main__ - Step 45092: {'lr': 0.00040250872680559416, 'samples': 8657664, 'steps': 45091, 'loss/train': 1.049634337425232} 11/07/2021 03:36:35 - INFO - __main__ - Step 45093: {'lr': 0.00040250452184212326, 'samples': 8657856, 'steps': 45092, 'loss/train': 1.2109078168869019} 11/07/2021 03:36:36 - INFO - __main__ - Step 45094: {'lr': 0.00040250031680993617, 'samples': 8658048, 'steps': 45093, 'loss/train': 1.3026163578033447} 11/07/2021 03:36:36 - INFO - __main__ - Step 45095: {'lr': 0.0004024961117090348, 'samples': 8658240, 'steps': 45094, 'loss/train': 1.310563087463379} 11/07/2021 03:36:36 - INFO - __main__ - Step 45096: {'lr': 0.00040249190653942105, 'samples': 8658432, 'steps': 45095, 'loss/train': 1.729379415512085} 11/07/2021 03:36:37 - INFO - __main__ - Step 45097: {'lr': 0.00040248770130109677, 'samples': 8658624, 'steps': 45096, 'loss/train': 1.745119571685791} 11/07/2021 03:36:38 - INFO - __main__ - Step 45098: {'lr': 0.0004024834959940639, 'samples': 8658816, 'steps': 45097, 'loss/train': 1.3489664793014526} 11/07/2021 03:36:38 - INFO - __main__ - Step 45099: {'lr': 0.0004024792906183243, 'samples': 8659008, 'steps': 45098, 'loss/train': 1.7792701721191406} 11/07/2021 03:36:38 - INFO - __main__ - Step 45100: {'lr': 0.0004024750851738799, 'samples': 8659200, 'steps': 45099, 'loss/train': 1.3792343139648438} 11/07/2021 03:36:39 - INFO - __main__ - Step 45101: {'lr': 0.00040247087966073253, 'samples': 8659392, 'steps': 45100, 'loss/train': 0.5739343762397766} 11/07/2021 03:36:40 - INFO - __main__ - Step 45102: {'lr': 0.00040246667407888427, 'samples': 8659584, 'steps': 45101, 'loss/train': 1.3865184783935547} 11/07/2021 03:36:40 - INFO - __main__ - Step 45103: {'lr': 0.0004024624684283368, 'samples': 8659776, 'steps': 45102, 'loss/train': 1.595341682434082} 11/07/2021 03:36:40 - INFO - __main__ - Step 45104: {'lr': 0.000402458262709092, 'samples': 8659968, 'steps': 45103, 'loss/train': 1.605851650238037} 11/07/2021 03:36:41 - INFO - __main__ - Step 45105: {'lr': 0.00040245405692115193, 'samples': 8660160, 'steps': 45104, 'loss/train': 1.5262550115585327} 11/07/2021 03:36:41 - INFO - __main__ - Step 45106: {'lr': 0.0004024498510645185, 'samples': 8660352, 'steps': 45105, 'loss/train': 1.4625663757324219} 11/07/2021 03:36:42 - INFO - __main__ - Step 45107: {'lr': 0.0004024456451391934, 'samples': 8660544, 'steps': 45106, 'loss/train': 0.90992271900177} 11/07/2021 03:36:43 - INFO - __main__ - Step 45108: {'lr': 0.0004024414391451787, 'samples': 8660736, 'steps': 45107, 'loss/train': 1.426047921180725} 11/07/2021 03:36:43 - INFO - __main__ - Step 45109: {'lr': 0.00040243723308247624, 'samples': 8660928, 'steps': 45108, 'loss/train': 1.5061769485473633} 11/07/2021 03:36:43 - INFO - __main__ - Step 45110: {'lr': 0.0004024330269510879, 'samples': 8661120, 'steps': 45109, 'loss/train': 1.8153321743011475} 11/07/2021 03:36:44 - INFO - __main__ - Step 45111: {'lr': 0.00040242882075101563, 'samples': 8661312, 'steps': 45110, 'loss/train': 1.7206075191497803} 11/07/2021 03:36:45 - INFO - __main__ - Step 45112: {'lr': 0.0004024246144822612, 'samples': 8661504, 'steps': 45111, 'loss/train': 1.9978528022766113} 11/07/2021 03:36:45 - INFO - __main__ - Step 45113: {'lr': 0.00040242040814482665, 'samples': 8661696, 'steps': 45112, 'loss/train': 1.1356459856033325} 11/07/2021 03:36:45 - INFO - __main__ - Step 45114: {'lr': 0.00040241620173871385, 'samples': 8661888, 'steps': 45113, 'loss/train': 1.4631052017211914} 11/07/2021 03:36:46 - INFO - __main__ - Step 45115: {'lr': 0.0004024119952639246, 'samples': 8662080, 'steps': 45114, 'loss/train': 1.5599628686904907} 11/07/2021 03:36:46 - INFO - __main__ - Step 45116: {'lr': 0.00040240778872046093, 'samples': 8662272, 'steps': 45115, 'loss/train': 1.4171319007873535} 11/07/2021 03:36:47 - INFO - __main__ - Step 45117: {'lr': 0.00040240358210832456, 'samples': 8662464, 'steps': 45116, 'loss/train': 1.6910568475723267} 11/07/2021 03:36:47 - INFO - __main__ - Step 45118: {'lr': 0.00040239937542751753, 'samples': 8662656, 'steps': 45117, 'loss/train': 1.4217894077301025} 11/07/2021 03:36:48 - INFO - __main__ - Step 45119: {'lr': 0.0004023951686780417, 'samples': 8662848, 'steps': 45118, 'loss/train': 1.3507628440856934} 11/07/2021 03:36:48 - INFO - __main__ - Step 45120: {'lr': 0.000402390961859899, 'samples': 8663040, 'steps': 45119, 'loss/train': 1.2889933586120605} 11/07/2021 03:36:48 - INFO - __main__ - Step 45121: {'lr': 0.00040238675497309117, 'samples': 8663232, 'steps': 45120, 'loss/train': 1.2425684928894043} 11/07/2021 03:36:50 - INFO - __main__ - Step 45122: {'lr': 0.0004023825480176204, 'samples': 8663424, 'steps': 45121, 'loss/train': 1.8090465068817139} 11/07/2021 03:36:50 - INFO - __main__ - Step 45123: {'lr': 0.0004023783409934882, 'samples': 8663616, 'steps': 45122, 'loss/train': 1.250329613685608} 11/07/2021 03:36:50 - INFO - __main__ - Step 45124: {'lr': 0.00040237413390069684, 'samples': 8663808, 'steps': 45123, 'loss/train': 1.3898934125900269} 11/07/2021 03:36:51 - INFO - __main__ - Step 45125: {'lr': 0.000402369926739248, 'samples': 8664000, 'steps': 45124, 'loss/train': 1.5924787521362305} 11/07/2021 03:36:51 - INFO - __main__ - Step 45126: {'lr': 0.0004023657195091436, 'samples': 8664192, 'steps': 45125, 'loss/train': 1.8747450113296509} 11/07/2021 03:36:52 - INFO - __main__ - Step 45127: {'lr': 0.00040236151221038555, 'samples': 8664384, 'steps': 45126, 'loss/train': 1.515708327293396} 11/07/2021 03:36:52 - INFO - __main__ - Step 45128: {'lr': 0.00040235730484297573, 'samples': 8664576, 'steps': 45127, 'loss/train': 1.0064326524734497} 11/07/2021 03:36:53 - INFO - __main__ - Step 45129: {'lr': 0.00040235309740691607, 'samples': 8664768, 'steps': 45128, 'loss/train': 1.7677335739135742} 11/07/2021 03:36:53 - INFO - __main__ - Step 45130: {'lr': 0.0004023488899022085, 'samples': 8664960, 'steps': 45129, 'loss/train': 1.232685923576355} 11/07/2021 03:36:53 - INFO - __main__ - Step 45131: {'lr': 0.00040234468232885483, 'samples': 8665152, 'steps': 45130, 'loss/train': 1.8822057247161865} 11/07/2021 03:36:54 - INFO - __main__ - Step 45132: {'lr': 0.00040234047468685704, 'samples': 8665344, 'steps': 45131, 'loss/train': 0.13900701701641083} 11/07/2021 03:36:55 - INFO - __main__ - Step 45133: {'lr': 0.00040233626697621695, 'samples': 8665536, 'steps': 45132, 'loss/train': 1.5698128938674927} 11/07/2021 03:36:55 - INFO - __main__ - Step 45134: {'lr': 0.0004023320591969365, 'samples': 8665728, 'steps': 45133, 'loss/train': 1.1442500352859497} 11/07/2021 03:36:55 - INFO - __main__ - Step 45135: {'lr': 0.00040232785134901755, 'samples': 8665920, 'steps': 45134, 'loss/train': 1.6900111436843872} 11/07/2021 03:36:56 - INFO - __main__ - Step 45136: {'lr': 0.0004023236434324621, 'samples': 8666112, 'steps': 45135, 'loss/train': 1.533385157585144} 11/07/2021 03:36:56 - INFO - __main__ - Step 45137: {'lr': 0.0004023194354472719, 'samples': 8666304, 'steps': 45136, 'loss/train': 1.4602547883987427} 11/07/2021 03:36:57 - INFO - __main__ - Step 45138: {'lr': 0.0004023152273934489, 'samples': 8666496, 'steps': 45137, 'loss/train': 1.6864073276519775} 11/07/2021 03:36:58 - INFO - __main__ - Step 45139: {'lr': 0.000402311019270995, 'samples': 8666688, 'steps': 45138, 'loss/train': 1.7880440950393677} 11/07/2021 03:36:58 - INFO - __main__ - Step 45140: {'lr': 0.00040230681107991217, 'samples': 8666880, 'steps': 45139, 'loss/train': 0.9193914532661438} 11/07/2021 03:36:58 - INFO - __main__ - Step 45141: {'lr': 0.0004023026028202021, 'samples': 8667072, 'steps': 45140, 'loss/train': 1.1637225151062012} 11/07/2021 03:36:59 - INFO - __main__ - Step 45142: {'lr': 0.000402298394491867, 'samples': 8667264, 'steps': 45141, 'loss/train': 1.384219765663147} 11/07/2021 03:37:00 - INFO - __main__ - Step 45143: {'lr': 0.0004022941860949085, 'samples': 8667456, 'steps': 45142, 'loss/train': 1.5145856142044067} 11/07/2021 03:37:00 - INFO - __main__ - Step 45144: {'lr': 0.0004022899776293287, 'samples': 8667648, 'steps': 45143, 'loss/train': 1.5330946445465088} 11/07/2021 03:37:00 - INFO - __main__ - Step 45145: {'lr': 0.00040228576909512927, 'samples': 8667840, 'steps': 45144, 'loss/train': 1.4673269987106323} 11/07/2021 03:37:01 - INFO - __main__ - Step 45146: {'lr': 0.0004022815604923122, 'samples': 8668032, 'steps': 45145, 'loss/train': 1.427357792854309} 11/07/2021 03:37:01 - INFO - __main__ - Step 45147: {'lr': 0.00040227735182087954, 'samples': 8668224, 'steps': 45146, 'loss/train': 1.6355878114700317} 11/07/2021 03:37:02 - INFO - __main__ - Step 45148: {'lr': 0.00040227314308083296, 'samples': 8668416, 'steps': 45147, 'loss/train': 1.5041543245315552} 11/07/2021 03:37:03 - INFO - __main__ - Step 45149: {'lr': 0.0004022689342721745, 'samples': 8668608, 'steps': 45148, 'loss/train': 1.401368498802185} 11/07/2021 03:37:03 - INFO - __main__ - Step 45150: {'lr': 0.000402264725394906, 'samples': 8668800, 'steps': 45149, 'loss/train': 1.3359441757202148} 11/07/2021 03:37:03 - INFO - __main__ - Step 45151: {'lr': 0.00040226051644902925, 'samples': 8668992, 'steps': 45150, 'loss/train': 1.7657395601272583} 11/07/2021 03:37:04 - INFO - __main__ - Step 45152: {'lr': 0.0004022563074345464, 'samples': 8669184, 'steps': 45151, 'loss/train': 1.005380630493164} 11/07/2021 03:37:05 - INFO - __main__ - Step 45153: {'lr': 0.00040225209835145916, 'samples': 8669376, 'steps': 45152, 'loss/train': 1.4632766246795654} 11/07/2021 03:37:05 - INFO - __main__ - Step 45154: {'lr': 0.0004022478891997695, 'samples': 8669568, 'steps': 45153, 'loss/train': 1.425004243850708} 11/07/2021 03:37:05 - INFO - __main__ - Step 45155: {'lr': 0.0004022436799794792, 'samples': 8669760, 'steps': 45154, 'loss/train': 1.634901523590088} 11/07/2021 03:37:06 - INFO - __main__ - Step 45156: {'lr': 0.0004022394706905904, 'samples': 8669952, 'steps': 45155, 'loss/train': 1.547129511833191} 11/07/2021 03:37:06 - INFO - __main__ - Step 45157: {'lr': 0.0004022352613331047, 'samples': 8670144, 'steps': 45156, 'loss/train': 1.5003315210342407} 11/07/2021 03:37:07 - INFO - __main__ - Step 45158: {'lr': 0.0004022310519070242, 'samples': 8670336, 'steps': 45157, 'loss/train': 1.1571340560913086} 11/07/2021 03:37:07 - INFO - __main__ - Step 45159: {'lr': 0.00040222684241235075, 'samples': 8670528, 'steps': 45158, 'loss/train': 1.305799961090088} 11/07/2021 03:37:08 - INFO - __main__ - Step 45160: {'lr': 0.00040222263284908616, 'samples': 8670720, 'steps': 45159, 'loss/train': 1.6285111904144287} 11/07/2021 03:37:08 - INFO - __main__ - Step 45161: {'lr': 0.00040221842321723245, 'samples': 8670912, 'steps': 45160, 'loss/train': 1.3914912939071655} 11/07/2021 03:37:08 - INFO - __main__ - Step 45162: {'lr': 0.0004022142135167915, 'samples': 8671104, 'steps': 45161, 'loss/train': 1.4020293951034546} 11/07/2021 03:37:09 - INFO - __main__ - Step 45163: {'lr': 0.0004022100037477652, 'samples': 8671296, 'steps': 45162, 'loss/train': 1.3243298530578613} 11/07/2021 03:37:10 - INFO - __main__ - Step 45164: {'lr': 0.0004022057939101553, 'samples': 8671488, 'steps': 45163, 'loss/train': 1.6081992387771606} 11/07/2021 03:37:10 - INFO - __main__ - Step 45165: {'lr': 0.0004022015840039639, 'samples': 8671680, 'steps': 45164, 'loss/train': 0.20652131736278534} 11/07/2021 03:37:11 - INFO - __main__ - Step 45166: {'lr': 0.00040219737402919284, 'samples': 8671872, 'steps': 45165, 'loss/train': 1.3107188940048218} 11/07/2021 03:37:11 - INFO - __main__ - Step 45167: {'lr': 0.0004021931639858439, 'samples': 8672064, 'steps': 45166, 'loss/train': 1.767072081565857} 11/07/2021 03:37:11 - INFO - __main__ - Step 45168: {'lr': 0.00040218895387391913, 'samples': 8672256, 'steps': 45167, 'loss/train': 1.615883708000183} 11/07/2021 03:37:12 - INFO - __main__ - Step 45169: {'lr': 0.0004021847436934204, 'samples': 8672448, 'steps': 45168, 'loss/train': 1.5700477361679077} 11/07/2021 03:37:13 - INFO - __main__ - Step 45170: {'lr': 0.0004021805334443496, 'samples': 8672640, 'steps': 45169, 'loss/train': 1.1653289794921875} 11/07/2021 03:37:13 - INFO - __main__ - Step 45171: {'lr': 0.00040217632312670846, 'samples': 8672832, 'steps': 45170, 'loss/train': 1.2083021402359009} 11/07/2021 03:37:13 - INFO - __main__ - Step 45172: {'lr': 0.0004021721127404991, 'samples': 8673024, 'steps': 45171, 'loss/train': 1.7216506004333496} 11/07/2021 03:37:14 - INFO - __main__ - Step 45173: {'lr': 0.0004021679022857233, 'samples': 8673216, 'steps': 45172, 'loss/train': 1.6763455867767334} 11/07/2021 03:37:15 - INFO - __main__ - Step 45174: {'lr': 0.000402163691762383, 'samples': 8673408, 'steps': 45173, 'loss/train': 1.2438222169876099} 11/07/2021 03:37:15 - INFO - __main__ - Step 45175: {'lr': 0.00040215948117048006, 'samples': 8673600, 'steps': 45174, 'loss/train': 1.4948920011520386} 11/07/2021 03:37:15 - INFO - __main__ - Step 45176: {'lr': 0.00040215527051001653, 'samples': 8673792, 'steps': 45175, 'loss/train': 1.5023847818374634} 11/07/2021 03:37:16 - INFO - __main__ - Step 45177: {'lr': 0.00040215105978099407, 'samples': 8673984, 'steps': 45176, 'loss/train': 1.5538227558135986} 11/07/2021 03:37:16 - INFO - __main__ - Step 45178: {'lr': 0.00040214684898341475, 'samples': 8674176, 'steps': 45177, 'loss/train': 1.6242763996124268} 11/07/2021 03:37:18 - INFO - __main__ - Step 45179: {'lr': 0.00040214263811728034, 'samples': 8674368, 'steps': 45178, 'loss/train': 1.148213267326355} 11/07/2021 03:37:18 - INFO - __main__ - Step 45180: {'lr': 0.00040213842718259287, 'samples': 8674560, 'steps': 45179, 'loss/train': 1.8194248676300049} 11/07/2021 03:37:19 - INFO - __main__ - Step 45181: {'lr': 0.00040213421617935416, 'samples': 8674752, 'steps': 45180, 'loss/train': 1.411015510559082} 11/07/2021 03:37:19 - INFO - __main__ - Step 45182: {'lr': 0.000402130005107566, 'samples': 8674944, 'steps': 45181, 'loss/train': 3.0634677410125732} 11/07/2021 03:37:19 - INFO - __main__ - Step 45183: {'lr': 0.0004021257939672306, 'samples': 8675136, 'steps': 45182, 'loss/train': 1.4965283870697021} 11/07/2021 03:37:20 - INFO - __main__ - Step 45184: {'lr': 0.0004021215827583496, 'samples': 8675328, 'steps': 45183, 'loss/train': 1.5582106113433838} 11/07/2021 03:37:21 - INFO - __main__ - Step 45185: {'lr': 0.0004021173714809249, 'samples': 8675520, 'steps': 45184, 'loss/train': 1.5984597206115723} 11/07/2021 03:37:21 - INFO - __main__ - Step 45186: {'lr': 0.0004021131601349585, 'samples': 8675712, 'steps': 45185, 'loss/train': 1.2047393321990967} 11/07/2021 03:37:21 - INFO - __main__ - Step 45187: {'lr': 0.0004021089487204522, 'samples': 8675904, 'steps': 45186, 'loss/train': 1.9748467206954956} 11/07/2021 03:37:22 - INFO - __main__ - Step 45188: {'lr': 0.00040210473723740803, 'samples': 8676096, 'steps': 45187, 'loss/train': 1.7441582679748535} 11/07/2021 03:37:22 - INFO - __main__ - Step 45189: {'lr': 0.0004021005256858279, 'samples': 8676288, 'steps': 45188, 'loss/train': 1.3486766815185547} 11/07/2021 03:37:23 - INFO - __main__ - Step 45190: {'lr': 0.00040209631406571344, 'samples': 8676480, 'steps': 45189, 'loss/train': 2.29198956489563} 11/07/2021 03:37:23 - INFO - __main__ - Step 45191: {'lr': 0.00040209210237706684, 'samples': 8676672, 'steps': 45190, 'loss/train': 1.65663743019104} 11/07/2021 03:37:24 - INFO - __main__ - Step 45192: {'lr': 0.0004020878906198898, 'samples': 8676864, 'steps': 45191, 'loss/train': 1.2061249017715454} 11/07/2021 03:37:24 - INFO - __main__ - Step 45193: {'lr': 0.0004020836787941844, 'samples': 8677056, 'steps': 45192, 'loss/train': 1.2055654525756836} 11/07/2021 03:37:25 - INFO - __main__ - Step 45194: {'lr': 0.0004020794668999524, 'samples': 8677248, 'steps': 45193, 'loss/train': 1.764271855354309} 11/07/2021 03:37:26 - INFO - __main__ - Step 45195: {'lr': 0.0004020752549371957, 'samples': 8677440, 'steps': 45194, 'loss/train': 1.267154335975647} 11/07/2021 03:37:26 - INFO - __main__ - Step 45196: {'lr': 0.00040207104290591633, 'samples': 8677632, 'steps': 45195, 'loss/train': 1.8496373891830444} 11/07/2021 03:37:26 - INFO - __main__ - Step 45197: {'lr': 0.000402066830806116, 'samples': 8677824, 'steps': 45196, 'loss/train': 1.0303630828857422} 11/07/2021 03:37:27 - INFO - __main__ - Step 45198: {'lr': 0.0004020626186377967, 'samples': 8678016, 'steps': 45197, 'loss/train': 1.5434365272521973} 11/07/2021 03:37:27 - INFO - __main__ - Step 45199: {'lr': 0.00040205840640096036, 'samples': 8678208, 'steps': 45198, 'loss/train': 1.3023549318313599} 11/07/2021 03:37:27 - INFO - __main__ - Step 45200: {'lr': 0.0004020541940956089, 'samples': 8678400, 'steps': 45199, 'loss/train': 1.1285649538040161} 11/07/2021 03:37:28 - INFO - __main__ - Step 45201: {'lr': 0.0004020499817217441, 'samples': 8678592, 'steps': 45200, 'loss/train': 1.0267317295074463} 11/07/2021 03:37:29 - INFO - __main__ - Step 45202: {'lr': 0.000402045769279368, 'samples': 8678784, 'steps': 45201, 'loss/train': 1.3042110204696655} 11/07/2021 03:37:29 - INFO - __main__ - Step 45203: {'lr': 0.0004020415567684823, 'samples': 8678976, 'steps': 45202, 'loss/train': 1.1630033254623413} 11/07/2021 03:37:30 - INFO - __main__ - Step 45204: {'lr': 0.0004020373441890891, 'samples': 8679168, 'steps': 45203, 'loss/train': 2.08655047416687} 11/07/2021 03:37:30 - INFO - __main__ - Step 45205: {'lr': 0.00040203313154119026, 'samples': 8679360, 'steps': 45204, 'loss/train': 1.9520084857940674} 11/07/2021 03:37:32 - INFO - __main__ - Step 45206: {'lr': 0.00040202891882478754, 'samples': 8679552, 'steps': 45205, 'loss/train': 1.7195074558258057} 11/07/2021 03:37:32 - INFO - __main__ - Step 45207: {'lr': 0.000402024706039883, 'samples': 8679744, 'steps': 45206, 'loss/train': 1.6180075407028198} 11/07/2021 03:37:33 - INFO - __main__ - Step 45208: {'lr': 0.0004020204931864785, 'samples': 8679936, 'steps': 45207, 'loss/train': 1.2656595706939697} 11/07/2021 03:37:33 - INFO - __main__ - Step 45209: {'lr': 0.0004020162802645758, 'samples': 8680128, 'steps': 45208, 'loss/train': 1.5247958898544312} 11/07/2021 03:37:33 - INFO - __main__ - Step 45210: {'lr': 0.000402012067274177, 'samples': 8680320, 'steps': 45209, 'loss/train': 1.5917531251907349} 11/07/2021 03:37:34 - INFO - __main__ - Step 45211: {'lr': 0.0004020078542152839, 'samples': 8680512, 'steps': 45210, 'loss/train': 2.631469249725342} 11/07/2021 03:37:34 - INFO - __main__ - Step 45212: {'lr': 0.0004020036410878984, 'samples': 8680704, 'steps': 45211, 'loss/train': 1.7594704627990723} 11/07/2021 03:37:35 - INFO - __main__ - Step 45213: {'lr': 0.0004019994278920224, 'samples': 8680896, 'steps': 45212, 'loss/train': 1.6504069566726685} 11/07/2021 03:37:35 - INFO - __main__ - Step 45214: {'lr': 0.00040199521462765776, 'samples': 8681088, 'steps': 45213, 'loss/train': 1.2237399816513062} 11/07/2021 03:37:36 - INFO - __main__ - Step 45215: {'lr': 0.0004019910012948065, 'samples': 8681280, 'steps': 45214, 'loss/train': 1.6197471618652344} 11/07/2021 03:37:36 - INFO - __main__ - Step 45216: {'lr': 0.0004019867878934704, 'samples': 8681472, 'steps': 45215, 'loss/train': 1.4788774251937866} 11/07/2021 03:37:36 - INFO - __main__ - Step 45217: {'lr': 0.0004019825744236514, 'samples': 8681664, 'steps': 45216, 'loss/train': 1.4490773677825928} 11/07/2021 03:37:37 - INFO - __main__ - Step 45218: {'lr': 0.0004019783608853513, 'samples': 8681856, 'steps': 45217, 'loss/train': 1.8118454217910767} 11/07/2021 03:37:38 - INFO - __main__ - Step 45219: {'lr': 0.0004019741472785723, 'samples': 8682048, 'steps': 45218, 'loss/train': 1.7203056812286377} 11/07/2021 03:37:38 - INFO - __main__ - Step 45220: {'lr': 0.0004019699336033159, 'samples': 8682240, 'steps': 45219, 'loss/train': 1.4860272407531738} 11/07/2021 03:37:39 - INFO - __main__ - Step 45221: {'lr': 0.0004019657198595843, 'samples': 8682432, 'steps': 45220, 'loss/train': 1.7316553592681885} 11/07/2021 03:37:39 - INFO - __main__ - Step 45222: {'lr': 0.00040196150604737924, 'samples': 8682624, 'steps': 45221, 'loss/train': 1.0518038272857666} 11/07/2021 03:37:39 - INFO - __main__ - Step 45223: {'lr': 0.0004019572921667027, 'samples': 8682816, 'steps': 45222, 'loss/train': 1.50096595287323} 11/07/2021 03:37:40 - INFO - __main__ - Step 45224: {'lr': 0.0004019530782175566, 'samples': 8683008, 'steps': 45223, 'loss/train': 1.5049973726272583} 11/07/2021 03:37:41 - INFO - __main__ - Step 45225: {'lr': 0.00040194886419994274, 'samples': 8683200, 'steps': 45224, 'loss/train': 1.1065824031829834} 11/07/2021 03:37:41 - INFO - __main__ - Step 45226: {'lr': 0.0004019446501138631, 'samples': 8683392, 'steps': 45225, 'loss/train': 1.344346046447754} 11/07/2021 03:37:41 - INFO - __main__ - Step 45227: {'lr': 0.0004019404359593195, 'samples': 8683584, 'steps': 45226, 'loss/train': 0.24694529175758362} 11/07/2021 03:37:42 - INFO - __main__ - Step 45228: {'lr': 0.0004019362217363138, 'samples': 8683776, 'steps': 45227, 'loss/train': 1.2918033599853516} 11/07/2021 03:37:43 - INFO - __main__ - Step 45229: {'lr': 0.00040193200744484815, 'samples': 8683968, 'steps': 45228, 'loss/train': 1.40713632106781} 11/07/2021 03:37:43 - INFO - __main__ - Step 45230: {'lr': 0.00040192779308492423, 'samples': 8684160, 'steps': 45229, 'loss/train': 1.5547404289245605} 11/07/2021 03:37:44 - INFO - __main__ - Step 45231: {'lr': 0.00040192357865654395, 'samples': 8684352, 'steps': 45230, 'loss/train': 1.2795825004577637} 11/07/2021 03:37:44 - INFO - __main__ - Step 45232: {'lr': 0.00040191936415970926, 'samples': 8684544, 'steps': 45231, 'loss/train': 1.4769785404205322} 11/07/2021 03:37:44 - INFO - __main__ - Step 45233: {'lr': 0.00040191514959442206, 'samples': 8684736, 'steps': 45232, 'loss/train': 1.2793304920196533} 11/07/2021 03:37:45 - INFO - __main__ - Step 45234: {'lr': 0.0004019109349606842, 'samples': 8684928, 'steps': 45233, 'loss/train': 1.544562578201294} 11/07/2021 03:37:46 - INFO - __main__ - Step 45235: {'lr': 0.0004019067202584977, 'samples': 8685120, 'steps': 45234, 'loss/train': 2.054046392440796} 11/07/2021 03:37:46 - INFO - __main__ - Step 45236: {'lr': 0.0004019025054878643, 'samples': 8685312, 'steps': 45235, 'loss/train': 3.486762762069702} 11/07/2021 03:37:46 - INFO - __main__ - Step 45237: {'lr': 0.00040189829064878605, 'samples': 8685504, 'steps': 45236, 'loss/train': 1.7212954759597778} 11/07/2021 03:37:47 - INFO - __main__ - Step 45238: {'lr': 0.0004018940757412647, 'samples': 8685696, 'steps': 45237, 'loss/train': 0.9980286359786987} 11/07/2021 03:37:48 - INFO - __main__ - Step 45239: {'lr': 0.0004018898607653022, 'samples': 8685888, 'steps': 45238, 'loss/train': 1.3442773818969727} 11/07/2021 03:37:48 - INFO - __main__ - Step 45240: {'lr': 0.00040188564572090057, 'samples': 8686080, 'steps': 45239, 'loss/train': 1.4004093408584595} 11/07/2021 03:37:48 - INFO - __main__ - Step 45241: {'lr': 0.00040188143060806156, 'samples': 8686272, 'steps': 45240, 'loss/train': 1.0079407691955566} 11/07/2021 03:37:49 - INFO - __main__ - Step 45242: {'lr': 0.0004018772154267871, 'samples': 8686464, 'steps': 45241, 'loss/train': 1.5502294301986694} 11/07/2021 03:37:49 - INFO - __main__ - Step 45243: {'lr': 0.0004018730001770792, 'samples': 8686656, 'steps': 45242, 'loss/train': 1.4918146133422852} 11/07/2021 03:37:50 - INFO - __main__ - Step 45244: {'lr': 0.00040186878485893955, 'samples': 8686848, 'steps': 45243, 'loss/train': 1.7983713150024414} 11/07/2021 03:37:50 - INFO - __main__ - Step 45245: {'lr': 0.0004018645694723703, 'samples': 8687040, 'steps': 45244, 'loss/train': 1.4935053586959839} 11/07/2021 03:37:51 - INFO - __main__ - Step 45246: {'lr': 0.00040186035401737307, 'samples': 8687232, 'steps': 45245, 'loss/train': 1.4526677131652832} 11/07/2021 03:37:51 - INFO - __main__ - Step 45247: {'lr': 0.00040185613849395, 'samples': 8687424, 'steps': 45246, 'loss/train': 1.2876986265182495} 11/07/2021 03:37:51 - INFO - __main__ - Step 45248: {'lr': 0.0004018519229021029, 'samples': 8687616, 'steps': 45247, 'loss/train': 0.5878735780715942} 11/07/2021 03:37:53 - INFO - __main__ - Step 45249: {'lr': 0.0004018477072418336, 'samples': 8687808, 'steps': 45248, 'loss/train': 1.623437523841858} 11/07/2021 03:37:53 - INFO - __main__ - Step 45250: {'lr': 0.00040184349151314413, 'samples': 8688000, 'steps': 45249, 'loss/train': 1.5730112791061401} 11/07/2021 03:37:53 - INFO - __main__ - Step 45251: {'lr': 0.0004018392757160363, 'samples': 8688192, 'steps': 45250, 'loss/train': 0.9747245907783508} 11/07/2021 03:37:54 - INFO - __main__ - Step 45252: {'lr': 0.00040183505985051204, 'samples': 8688384, 'steps': 45251, 'loss/train': 1.9808520078659058} 11/07/2021 03:37:54 - INFO - __main__ - Step 45253: {'lr': 0.0004018308439165733, 'samples': 8688576, 'steps': 45252, 'loss/train': 0.8069427609443665} 11/07/2021 03:37:54 - INFO - __main__ - Step 45254: {'lr': 0.00040182662791422185, 'samples': 8688768, 'steps': 45253, 'loss/train': 1.6230127811431885} 11/07/2021 03:37:55 - INFO - __main__ - Step 45255: {'lr': 0.0004018224118434597, 'samples': 8688960, 'steps': 45254, 'loss/train': 1.6706337928771973} 11/07/2021 03:37:56 - INFO - __main__ - Step 45256: {'lr': 0.0004018181957042887, 'samples': 8689152, 'steps': 45255, 'loss/train': 1.0620611906051636} 11/07/2021 03:37:56 - INFO - __main__ - Step 45257: {'lr': 0.00040181397949671073, 'samples': 8689344, 'steps': 45256, 'loss/train': 1.2024199962615967} 11/07/2021 03:37:56 - INFO - __main__ - Step 45258: {'lr': 0.00040180976322072776, 'samples': 8689536, 'steps': 45257, 'loss/train': 1.508073091506958} 11/07/2021 03:37:57 - INFO - __main__ - Step 45259: {'lr': 0.0004018055468763416, 'samples': 8689728, 'steps': 45258, 'loss/train': 1.4503211975097656} 11/07/2021 03:37:58 - INFO - __main__ - Step 45260: {'lr': 0.0004018013304635543, 'samples': 8689920, 'steps': 45259, 'loss/train': 1.3935219049453735} 11/07/2021 03:37:58 - INFO - __main__ - Step 45261: {'lr': 0.0004017971139823676, 'samples': 8690112, 'steps': 45260, 'loss/train': 1.8210779428482056} 11/07/2021 03:37:59 - INFO - __main__ - Step 45262: {'lr': 0.0004017928974327835, 'samples': 8690304, 'steps': 45261, 'loss/train': 1.3265295028686523} 11/07/2021 03:37:59 - INFO - __main__ - Step 45263: {'lr': 0.00040178868081480393, 'samples': 8690496, 'steps': 45262, 'loss/train': 1.6089332103729248} 11/07/2021 03:37:59 - INFO - __main__ - Step 45264: {'lr': 0.00040178446412843054, 'samples': 8690688, 'steps': 45263, 'loss/train': 1.2602695226669312} 11/07/2021 03:38:00 - INFO - __main__ - Step 45265: {'lr': 0.0004017802473736655, 'samples': 8690880, 'steps': 45264, 'loss/train': 1.212044358253479} 11/07/2021 03:38:01 - INFO - __main__ - Step 45266: {'lr': 0.00040177603055051065, 'samples': 8691072, 'steps': 45265, 'loss/train': 1.4239474534988403} 11/07/2021 03:38:01 - INFO - __main__ - Step 45267: {'lr': 0.0004017718136589679, 'samples': 8691264, 'steps': 45266, 'loss/train': 1.1865142583847046} 11/07/2021 03:38:01 - INFO - __main__ - Step 45268: {'lr': 0.000401767596699039, 'samples': 8691456, 'steps': 45267, 'loss/train': 1.5081130266189575} 11/07/2021 03:38:02 - INFO - __main__ - Step 45269: {'lr': 0.00040176337967072603, 'samples': 8691648, 'steps': 45268, 'loss/train': 1.8244184255599976} 11/07/2021 03:38:03 - INFO - __main__ - Step 45270: {'lr': 0.0004017591625740308, 'samples': 8691840, 'steps': 45269, 'loss/train': 1.4279512166976929} 11/07/2021 03:38:03 - INFO - __main__ - Step 45271: {'lr': 0.0004017549454089553, 'samples': 8692032, 'steps': 45270, 'loss/train': 1.4407144784927368} 11/07/2021 03:38:03 - INFO - __main__ - Step 45272: {'lr': 0.00040175072817550127, 'samples': 8692224, 'steps': 45271, 'loss/train': 1.6822588443756104} 11/07/2021 03:38:04 - INFO - __main__ - Step 45273: {'lr': 0.00040174651087367076, 'samples': 8692416, 'steps': 45272, 'loss/train': 1.109861969947815} 11/07/2021 03:38:04 - INFO - __main__ - Step 45274: {'lr': 0.0004017422935034656, 'samples': 8692608, 'steps': 45273, 'loss/train': 1.4292773008346558} 11/07/2021 03:38:05 - INFO - __main__ - Step 45275: {'lr': 0.00040173807606488763, 'samples': 8692800, 'steps': 45274, 'loss/train': 0.8860165476799011} 11/07/2021 03:38:06 - INFO - __main__ - Step 45276: {'lr': 0.0004017338585579389, 'samples': 8692992, 'steps': 45275, 'loss/train': 1.3372459411621094} 11/07/2021 03:38:06 - INFO - __main__ - Step 45277: {'lr': 0.0004017296409826213, 'samples': 8693184, 'steps': 45276, 'loss/train': 1.598120927810669} 11/07/2021 03:38:06 - INFO - __main__ - Step 45278: {'lr': 0.00040172542333893657, 'samples': 8693376, 'steps': 45277, 'loss/train': 2.6153388023376465} 11/07/2021 03:38:07 - INFO - __main__ - Step 45279: {'lr': 0.00040172120562688673, 'samples': 8693568, 'steps': 45278, 'loss/train': 1.7494827508926392} 11/07/2021 03:38:07 - INFO - __main__ - Step 45280: {'lr': 0.00040171698784647366, 'samples': 8693760, 'steps': 45279, 'loss/train': 1.1776270866394043} 11/07/2021 03:38:08 - INFO - __main__ - Step 45281: {'lr': 0.00040171276999769926, 'samples': 8693952, 'steps': 45280, 'loss/train': 1.368177056312561} 11/07/2021 03:38:08 - INFO - __main__ - Step 45282: {'lr': 0.00040170855208056537, 'samples': 8694144, 'steps': 45281, 'loss/train': 1.5612167119979858} 11/07/2021 03:38:09 - INFO - __main__ - Step 45283: {'lr': 0.000401704334095074, 'samples': 8694336, 'steps': 45282, 'loss/train': 1.2405672073364258} 11/07/2021 03:38:09 - INFO - __main__ - Step 45284: {'lr': 0.00040170011604122704, 'samples': 8694528, 'steps': 45283, 'loss/train': 1.4168163537979126} 11/07/2021 03:38:09 - INFO - __main__ - Step 45285: {'lr': 0.0004016958979190263, 'samples': 8694720, 'steps': 45284, 'loss/train': 1.6693315505981445} 11/07/2021 03:38:11 - INFO - __main__ - Step 45286: {'lr': 0.0004016916797284738, 'samples': 8694912, 'steps': 45285, 'loss/train': 1.5633200407028198} 11/07/2021 03:38:11 - INFO - __main__ - Step 45287: {'lr': 0.00040168746146957123, 'samples': 8695104, 'steps': 45286, 'loss/train': 1.2865655422210693} 11/07/2021 03:38:11 - INFO - __main__ - Step 45288: {'lr': 0.0004016832431423207, 'samples': 8695296, 'steps': 45287, 'loss/train': 1.1564030647277832} 11/07/2021 03:38:12 - INFO - __main__ - Step 45289: {'lr': 0.00040167902474672404, 'samples': 8695488, 'steps': 45288, 'loss/train': 1.4032548666000366} 11/07/2021 03:38:12 - INFO - __main__ - Step 45290: {'lr': 0.0004016748062827832, 'samples': 8695680, 'steps': 45289, 'loss/train': 1.2627499103546143} 11/07/2021 03:38:13 - INFO - __main__ - Step 45291: {'lr': 0.00040167058775049993, 'samples': 8695872, 'steps': 45290, 'loss/train': 1.611716628074646} 11/07/2021 03:38:13 - INFO - __main__ - Step 45292: {'lr': 0.0004016663691498763, 'samples': 8696064, 'steps': 45291, 'loss/train': 1.4768520593643188} 11/07/2021 03:38:14 - INFO - __main__ - Step 45293: {'lr': 0.00040166215048091414, 'samples': 8696256, 'steps': 45292, 'loss/train': 1.205528974533081} 11/07/2021 03:38:14 - INFO - __main__ - Step 45294: {'lr': 0.0004016579317436153, 'samples': 8696448, 'steps': 45293, 'loss/train': 1.9122101068496704} 11/07/2021 03:38:14 - INFO - __main__ - Step 45295: {'lr': 0.0004016537129379818, 'samples': 8696640, 'steps': 45294, 'loss/train': 1.5891315937042236} 11/07/2021 03:38:15 - INFO - __main__ - Step 45296: {'lr': 0.0004016494940640155, 'samples': 8696832, 'steps': 45295, 'loss/train': 1.4022443294525146} 11/07/2021 03:38:16 - INFO - __main__ - Step 45297: {'lr': 0.0004016452751217183, 'samples': 8697024, 'steps': 45296, 'loss/train': 1.3902982473373413} 11/07/2021 03:38:16 - INFO - __main__ - Step 45298: {'lr': 0.00040164105611109195, 'samples': 8697216, 'steps': 45297, 'loss/train': 1.7416541576385498} 11/07/2021 03:38:17 - INFO - __main__ - Step 45299: {'lr': 0.0004016368370321386, 'samples': 8697408, 'steps': 45298, 'loss/train': 1.1704825162887573} 11/07/2021 03:38:17 - INFO - __main__ - Step 45300: {'lr': 0.00040163261788485994, 'samples': 8697600, 'steps': 45299, 'loss/train': 1.501841425895691} 11/07/2021 03:38:18 - INFO - __main__ - Step 45301: {'lr': 0.00040162839866925804, 'samples': 8697792, 'steps': 45300, 'loss/train': 1.65465247631073} 11/07/2021 03:38:18 - INFO - __main__ - Step 45302: {'lr': 0.0004016241793853347, 'samples': 8697984, 'steps': 45301, 'loss/train': 1.2076376676559448} 11/07/2021 03:38:19 - INFO - __main__ - Step 45303: {'lr': 0.00040161996003309174, 'samples': 8698176, 'steps': 45302, 'loss/train': 1.5029069185256958} 11/07/2021 03:38:19 - INFO - __main__ - Step 45304: {'lr': 0.00040161574061253134, 'samples': 8698368, 'steps': 45303, 'loss/train': 1.6785801649093628} 11/07/2021 03:38:19 - INFO - __main__ - Step 45305: {'lr': 0.0004016115211236552, 'samples': 8698560, 'steps': 45304, 'loss/train': 1.2267104387283325} 11/07/2021 03:38:20 - INFO - __main__ - Step 45306: {'lr': 0.0004016073015664651, 'samples': 8698752, 'steps': 45305, 'loss/train': 1.4622952938079834} 11/07/2021 03:38:21 - INFO - __main__ - Step 45307: {'lr': 0.0004016030819409632, 'samples': 8698944, 'steps': 45306, 'loss/train': 1.6274431943893433} 11/07/2021 03:38:21 - INFO - __main__ - Step 45308: {'lr': 0.00040159886224715126, 'samples': 8699136, 'steps': 45307, 'loss/train': 1.5141170024871826} 11/07/2021 03:38:21 - INFO - __main__ - Step 45309: {'lr': 0.0004015946424850312, 'samples': 8699328, 'steps': 45308, 'loss/train': 1.6308294534683228} 11/07/2021 03:38:22 - INFO - __main__ - Step 45310: {'lr': 0.000401590422654605, 'samples': 8699520, 'steps': 45309, 'loss/train': 1.4504419565200806} 11/07/2021 03:38:23 - INFO - __main__ - Step 45311: {'lr': 0.00040158620275587443, 'samples': 8699712, 'steps': 45310, 'loss/train': 0.8374848961830139} 11/07/2021 03:38:23 - INFO - __main__ - Step 45312: {'lr': 0.0004015819827888415, 'samples': 8699904, 'steps': 45311, 'loss/train': 1.1242897510528564} 11/07/2021 03:38:23 - INFO - __main__ - Step 45313: {'lr': 0.00040157776275350805, 'samples': 8700096, 'steps': 45312, 'loss/train': 1.6858326196670532} 11/07/2021 03:38:24 - INFO - __main__ - Step 45314: {'lr': 0.000401573542649876, 'samples': 8700288, 'steps': 45313, 'loss/train': 1.314355731010437} 11/07/2021 03:38:24 - INFO - __main__ - Step 45315: {'lr': 0.0004015693224779472, 'samples': 8700480, 'steps': 45314, 'loss/train': 1.3788398504257202} 11/07/2021 03:38:24 - INFO - __main__ - Step 45316: {'lr': 0.0004015651022377237, 'samples': 8700672, 'steps': 45315, 'loss/train': 1.6739813089370728} 11/07/2021 03:38:26 - INFO - __main__ - Step 45317: {'lr': 0.00040156088192920726, 'samples': 8700864, 'steps': 45316, 'loss/train': 1.5884895324707031} 11/07/2021 03:38:26 - INFO - __main__ - Step 45318: {'lr': 0.0004015566615523998, 'samples': 8701056, 'steps': 45317, 'loss/train': 1.2685041427612305} 11/07/2021 03:38:26 - INFO - __main__ - Step 45319: {'lr': 0.00040155244110730325, 'samples': 8701248, 'steps': 45318, 'loss/train': 2.00679349899292} 11/07/2021 03:38:27 - INFO - __main__ - Step 45320: {'lr': 0.00040154822059391954, 'samples': 8701440, 'steps': 45319, 'loss/train': 1.6689691543579102} 11/07/2021 03:38:27 - INFO - __main__ - Step 45321: {'lr': 0.00040154400001225055, 'samples': 8701632, 'steps': 45320, 'loss/train': 1.1222220659255981} 11/07/2021 03:38:28 - INFO - __main__ - Step 45322: {'lr': 0.00040153977936229813, 'samples': 8701824, 'steps': 45321, 'loss/train': 1.3170971870422363} 11/07/2021 03:38:28 - INFO - __main__ - Step 45323: {'lr': 0.00040153555864406423, 'samples': 8702016, 'steps': 45322, 'loss/train': 0.9419315457344055} 11/07/2021 03:38:29 - INFO - __main__ - Step 45324: {'lr': 0.0004015313378575508, 'samples': 8702208, 'steps': 45323, 'loss/train': 1.348008394241333} 11/07/2021 03:38:29 - INFO - __main__ - Step 45325: {'lr': 0.00040152711700275963, 'samples': 8702400, 'steps': 45324, 'loss/train': 1.7534323930740356} 11/07/2021 03:38:29 - INFO - __main__ - Step 45326: {'lr': 0.0004015228960796927, 'samples': 8702592, 'steps': 45325, 'loss/train': 1.7659834623336792} 11/07/2021 03:38:30 - INFO - __main__ - Step 45327: {'lr': 0.0004015186750883518, 'samples': 8702784, 'steps': 45326, 'loss/train': 0.9564913511276245} 11/07/2021 03:38:31 - INFO - __main__ - Step 45328: {'lr': 0.0004015144540287391, 'samples': 8702976, 'steps': 45327, 'loss/train': 1.4444808959960938} 11/07/2021 03:38:31 - INFO - __main__ - Step 45329: {'lr': 0.0004015102329008562, 'samples': 8703168, 'steps': 45328, 'loss/train': 0.23689793050289154} 11/07/2021 03:38:31 - INFO - __main__ - Step 45330: {'lr': 0.0004015060117047051, 'samples': 8703360, 'steps': 45329, 'loss/train': 1.4495890140533447} 11/07/2021 03:38:32 - INFO - __main__ - Step 45331: {'lr': 0.0004015017904402879, 'samples': 8703552, 'steps': 45330, 'loss/train': 1.1690396070480347} 11/07/2021 03:38:33 - INFO - __main__ - Step 45332: {'lr': 0.00040149756910760616, 'samples': 8703744, 'steps': 45331, 'loss/train': 1.6938713788986206} 11/07/2021 03:38:33 - INFO - __main__ - Step 45333: {'lr': 0.000401493347706662, 'samples': 8703936, 'steps': 45332, 'loss/train': 1.1185976266860962} 11/07/2021 03:38:33 - INFO - __main__ - Step 45334: {'lr': 0.00040148912623745733, 'samples': 8704128, 'steps': 45333, 'loss/train': 1.6920937299728394} 11/07/2021 03:38:34 - INFO - __main__ - Step 45335: {'lr': 0.0004014849046999939, 'samples': 8704320, 'steps': 45334, 'loss/train': 1.3867719173431396} 11/07/2021 03:38:34 - INFO - __main__ - Step 45336: {'lr': 0.00040148068309427376, 'samples': 8704512, 'steps': 45335, 'loss/train': 1.4321553707122803} 11/07/2021 03:38:35 - INFO - __main__ - Step 45337: {'lr': 0.00040147646142029884, 'samples': 8704704, 'steps': 45336, 'loss/train': 1.781887173652649} 11/07/2021 03:38:36 - INFO - __main__ - Step 45338: {'lr': 0.0004014722396780709, 'samples': 8704896, 'steps': 45337, 'loss/train': 1.198248267173767} 11/07/2021 03:38:36 - INFO - __main__ - Step 45339: {'lr': 0.00040146801786759183, 'samples': 8705088, 'steps': 45338, 'loss/train': 1.3500672578811646} 11/07/2021 03:38:36 - INFO - __main__ - Step 45340: {'lr': 0.00040146379598886376, 'samples': 8705280, 'steps': 45339, 'loss/train': 1.4676085710525513} 11/07/2021 03:38:37 - INFO - __main__ - Step 45341: {'lr': 0.00040145957404188825, 'samples': 8705472, 'steps': 45340, 'loss/train': 1.3935436010360718} 11/07/2021 03:38:38 - INFO - __main__ - Step 45342: {'lr': 0.00040145535202666747, 'samples': 8705664, 'steps': 45341, 'loss/train': 0.6357443332672119} 11/07/2021 03:38:38 - INFO - __main__ - Step 45343: {'lr': 0.0004014511299432033, 'samples': 8705856, 'steps': 45342, 'loss/train': 1.6631529331207275} 11/07/2021 03:38:39 - INFO - __main__ - Step 45344: {'lr': 0.0004014469077914976, 'samples': 8706048, 'steps': 45343, 'loss/train': 1.384961485862732} 11/07/2021 03:38:39 - INFO - __main__ - Step 45345: {'lr': 0.0004014426855715523, 'samples': 8706240, 'steps': 45344, 'loss/train': 1.0772547721862793} 11/07/2021 03:38:39 - INFO - __main__ - Step 45346: {'lr': 0.00040143846328336913, 'samples': 8706432, 'steps': 45345, 'loss/train': 1.3336265087127686} 11/07/2021 03:38:40 - INFO - __main__ - Step 45347: {'lr': 0.00040143424092695015, 'samples': 8706624, 'steps': 45346, 'loss/train': 1.4853858947753906} 11/07/2021 03:38:40 - INFO - __main__ - Step 45348: {'lr': 0.00040143001850229733, 'samples': 8706816, 'steps': 45347, 'loss/train': 2.2581679821014404} 11/07/2021 03:38:41 - INFO - __main__ - Step 45349: {'lr': 0.00040142579600941237, 'samples': 8707008, 'steps': 45348, 'loss/train': 1.3928520679473877} 11/07/2021 03:38:41 - INFO - __main__ - Step 45350: {'lr': 0.0004014215734482973, 'samples': 8707200, 'steps': 45349, 'loss/train': 1.4989149570465088} 11/07/2021 03:38:42 - INFO - __main__ - Step 45351: {'lr': 0.00040141735081895407, 'samples': 8707392, 'steps': 45350, 'loss/train': 1.6037946939468384} 11/07/2021 03:38:42 - INFO - __main__ - Step 45352: {'lr': 0.00040141312812138453, 'samples': 8707584, 'steps': 45351, 'loss/train': 1.6483333110809326} 11/07/2021 03:38:42 - INFO - __main__ - Step 45353: {'lr': 0.0004014089053555905, 'samples': 8707776, 'steps': 45352, 'loss/train': 1.1922721862792969} 11/07/2021 03:38:44 - INFO - __main__ - Step 45354: {'lr': 0.000401404682521574, 'samples': 8707968, 'steps': 45353, 'loss/train': 1.1696412563323975} 11/07/2021 03:38:44 - INFO - __main__ - Step 45355: {'lr': 0.0004014004596193368, 'samples': 8708160, 'steps': 45354, 'loss/train': 1.2211811542510986} 11/07/2021 03:38:44 - INFO - __main__ - Step 45356: {'lr': 0.000401396236648881, 'samples': 8708352, 'steps': 45355, 'loss/train': 1.2534793615341187} 11/07/2021 03:38:45 - INFO - __main__ - Step 45357: {'lr': 0.00040139201361020827, 'samples': 8708544, 'steps': 45356, 'loss/train': 1.3304888010025024} 11/07/2021 03:38:45 - INFO - __main__ - Step 45358: {'lr': 0.0004013877905033208, 'samples': 8708736, 'steps': 45357, 'loss/train': 0.8877719044685364} 11/07/2021 03:38:46 - INFO - __main__ - Step 45359: {'lr': 0.0004013835673282202, 'samples': 8708928, 'steps': 45358, 'loss/train': 1.0555520057678223} 11/07/2021 03:38:46 - INFO - __main__ - Step 45360: {'lr': 0.00040137934408490856, 'samples': 8709120, 'steps': 45359, 'loss/train': 1.185915231704712} 11/07/2021 03:38:47 - INFO - __main__ - Step 45361: {'lr': 0.0004013751207733877, 'samples': 8709312, 'steps': 45360, 'loss/train': 1.7336360216140747} 11/07/2021 03:38:47 - INFO - __main__ - Step 45362: {'lr': 0.0004013708973936595, 'samples': 8709504, 'steps': 45361, 'loss/train': 1.5897088050842285} 11/07/2021 03:38:47 - INFO - __main__ - Step 45363: {'lr': 0.000401366673945726, 'samples': 8709696, 'steps': 45362, 'loss/train': 1.6161272525787354} 11/07/2021 03:38:49 - INFO - __main__ - Step 45364: {'lr': 0.00040136245042958897, 'samples': 8709888, 'steps': 45363, 'loss/train': 1.0643367767333984} 11/07/2021 03:38:49 - INFO - __main__ - Step 45365: {'lr': 0.00040135822684525036, 'samples': 8710080, 'steps': 45364, 'loss/train': 1.150641679763794} 11/07/2021 03:38:50 - INFO - __main__ - Step 45366: {'lr': 0.0004013540031927121, 'samples': 8710272, 'steps': 45365, 'loss/train': 1.0963282585144043} 11/07/2021 03:38:50 - INFO - __main__ - Step 45367: {'lr': 0.000401349779471976, 'samples': 8710464, 'steps': 45366, 'loss/train': 1.2358239889144897} 11/07/2021 03:38:50 - INFO - __main__ - Step 45368: {'lr': 0.000401345555683044, 'samples': 8710656, 'steps': 45367, 'loss/train': 1.6147366762161255} 11/07/2021 03:38:51 - INFO - __main__ - Step 45369: {'lr': 0.00040134133182591813, 'samples': 8710848, 'steps': 45368, 'loss/train': 1.0666077136993408} 11/07/2021 03:38:52 - INFO - __main__ - Step 45370: {'lr': 0.0004013371079006001, 'samples': 8711040, 'steps': 45369, 'loss/train': 0.46323254704475403} 11/07/2021 03:38:52 - INFO - __main__ - Step 45371: {'lr': 0.000401332883907092, 'samples': 8711232, 'steps': 45370, 'loss/train': 1.4854381084442139} 11/07/2021 03:38:52 - INFO - __main__ - Step 45372: {'lr': 0.00040132865984539556, 'samples': 8711424, 'steps': 45371, 'loss/train': 1.0855177640914917} 11/07/2021 03:38:53 - INFO - __main__ - Step 45373: {'lr': 0.0004013244357155128, 'samples': 8711616, 'steps': 45372, 'loss/train': 1.092380404472351} 11/07/2021 03:38:53 - INFO - __main__ - Step 45374: {'lr': 0.0004013202115174456, 'samples': 8711808, 'steps': 45373, 'loss/train': 0.912677526473999} 11/07/2021 03:38:54 - INFO - __main__ - Step 45375: {'lr': 0.0004013159872511958, 'samples': 8712000, 'steps': 45374, 'loss/train': 1.3166635036468506} 11/07/2021 03:38:55 - INFO - __main__ - Step 45376: {'lr': 0.0004013117629167653, 'samples': 8712192, 'steps': 45375, 'loss/train': 1.071495771408081} 11/07/2021 03:38:55 - INFO - __main__ - Step 45377: {'lr': 0.0004013075385141561, 'samples': 8712384, 'steps': 45376, 'loss/train': 1.6424611806869507} 11/07/2021 03:38:55 - INFO - __main__ - Step 45378: {'lr': 0.0004013033140433702, 'samples': 8712576, 'steps': 45377, 'loss/train': 0.13290008902549744} 11/07/2021 03:38:56 - INFO - __main__ - Step 45379: {'lr': 0.0004012990895044092, 'samples': 8712768, 'steps': 45378, 'loss/train': 1.250949740409851} 11/07/2021 03:38:57 - INFO - __main__ - Step 45380: {'lr': 0.0004012948648972752, 'samples': 8712960, 'steps': 45379, 'loss/train': 0.9296746253967285} 11/07/2021 03:38:57 - INFO - __main__ - Step 45381: {'lr': 0.00040129064022197006, 'samples': 8713152, 'steps': 45380, 'loss/train': 1.6221343278884888} 11/07/2021 03:38:58 - INFO - __main__ - Step 45382: {'lr': 0.0004012864154784957, 'samples': 8713344, 'steps': 45381, 'loss/train': 1.529241681098938} 11/07/2021 03:38:58 - INFO - __main__ - Step 45383: {'lr': 0.00040128219066685403, 'samples': 8713536, 'steps': 45382, 'loss/train': 1.508192539215088} 11/07/2021 03:38:58 - INFO - __main__ - Step 45384: {'lr': 0.00040127796578704703, 'samples': 8713728, 'steps': 45383, 'loss/train': 1.7660785913467407} 11/07/2021 03:38:59 - INFO - __main__ - Step 45385: {'lr': 0.00040127374083907634, 'samples': 8713920, 'steps': 45384, 'loss/train': 1.0196280479431152} 11/07/2021 03:39:00 - INFO - __main__ - Step 45386: {'lr': 0.00040126951582294414, 'samples': 8714112, 'steps': 45385, 'loss/train': 0.7238351106643677} 11/07/2021 03:39:00 - INFO - __main__ - Step 45387: {'lr': 0.00040126529073865216, 'samples': 8714304, 'steps': 45386, 'loss/train': 3.4579100608825684} 11/07/2021 03:39:00 - INFO - __main__ - Step 45388: {'lr': 0.00040126106558620246, 'samples': 8714496, 'steps': 45387, 'loss/train': 0.8059271574020386} 11/07/2021 03:39:01 - INFO - __main__ - Step 45389: {'lr': 0.0004012568403655967, 'samples': 8714688, 'steps': 45388, 'loss/train': 1.2644438743591309} 11/07/2021 03:39:01 - INFO - __main__ - Step 45390: {'lr': 0.00040125261507683706, 'samples': 8714880, 'steps': 45389, 'loss/train': 0.9288297891616821} 11/07/2021 03:39:02 - INFO - __main__ - Step 45391: {'lr': 0.0004012483897199254, 'samples': 8715072, 'steps': 45390, 'loss/train': 1.1532604694366455} 11/07/2021 03:39:03 - INFO - __main__ - Step 45392: {'lr': 0.0004012441642948635, 'samples': 8715264, 'steps': 45391, 'loss/train': 1.7337646484375} 11/07/2021 03:39:03 - INFO - __main__ - Step 45393: {'lr': 0.0004012399388016533, 'samples': 8715456, 'steps': 45392, 'loss/train': 1.607285499572754} 11/07/2021 03:39:03 - INFO - __main__ - Step 45394: {'lr': 0.00040123571324029663, 'samples': 8715648, 'steps': 45393, 'loss/train': 0.2502501606941223} 11/07/2021 03:39:04 - INFO - __main__ - Step 45395: {'lr': 0.0004012314876107956, 'samples': 8715840, 'steps': 45394, 'loss/train': 1.7108367681503296} 11/07/2021 03:39:04 - INFO - __main__ - Step 45396: {'lr': 0.00040122726191315196, 'samples': 8716032, 'steps': 45395, 'loss/train': 1.3642891645431519} 11/07/2021 03:39:05 - INFO - __main__ - Step 45397: {'lr': 0.00040122303614736763, 'samples': 8716224, 'steps': 45396, 'loss/train': 1.5683138370513916} 11/07/2021 03:39:05 - INFO - __main__ - Step 45398: {'lr': 0.00040121881031344455, 'samples': 8716416, 'steps': 45397, 'loss/train': 1.4296704530715942} 11/07/2021 03:39:06 - INFO - __main__ - Step 45399: {'lr': 0.00040121458441138457, 'samples': 8716608, 'steps': 45398, 'loss/train': 1.3734186887741089} 11/07/2021 03:39:06 - INFO - __main__ - Step 45400: {'lr': 0.0004012103584411897, 'samples': 8716800, 'steps': 45399, 'loss/train': 1.0634467601776123} 11/07/2021 03:39:06 - INFO - __main__ - Step 45401: {'lr': 0.0004012061324028617, 'samples': 8716992, 'steps': 45400, 'loss/train': 1.677510380744934} 11/07/2021 03:39:07 - INFO - __main__ - Step 45402: {'lr': 0.0004012019062964026, 'samples': 8717184, 'steps': 45401, 'loss/train': 1.3154534101486206} 11/07/2021 03:39:08 - INFO - __main__ - Step 45403: {'lr': 0.00040119768012181423, 'samples': 8717376, 'steps': 45402, 'loss/train': 1.3625843524932861} 11/07/2021 03:39:08 - INFO - __main__ - Step 45404: {'lr': 0.0004011934538790986, 'samples': 8717568, 'steps': 45403, 'loss/train': 1.2091530561447144} 11/07/2021 03:39:08 - INFO - __main__ - Step 45405: {'lr': 0.00040118922756825735, 'samples': 8717760, 'steps': 45404, 'loss/train': 1.4153841733932495} 11/07/2021 03:39:09 - INFO - __main__ - Step 45406: {'lr': 0.00040118500118929267, 'samples': 8717952, 'steps': 45405, 'loss/train': 1.504854679107666} 11/07/2021 03:39:10 - INFO - __main__ - Step 45407: {'lr': 0.00040118077474220643, 'samples': 8718144, 'steps': 45406, 'loss/train': 1.545736312866211} 11/07/2021 03:39:10 - INFO - __main__ - Step 45408: {'lr': 0.00040117654822700047, 'samples': 8718336, 'steps': 45407, 'loss/train': 0.8987617492675781} 11/07/2021 03:39:11 - INFO - __main__ - Step 45409: {'lr': 0.0004011723216436766, 'samples': 8718528, 'steps': 45408, 'loss/train': 1.142067313194275} 11/07/2021 03:39:11 - INFO - __main__ - Step 45410: {'lr': 0.0004011680949922368, 'samples': 8718720, 'steps': 45409, 'loss/train': 1.677101492881775} 11/07/2021 03:39:11 - INFO - __main__ - Step 45411: {'lr': 0.00040116386827268304, 'samples': 8718912, 'steps': 45410, 'loss/train': 1.656983733177185} 11/07/2021 03:39:13 - INFO - __main__ - Step 45412: {'lr': 0.0004011596414850172, 'samples': 8719104, 'steps': 45411, 'loss/train': 1.2456414699554443} 11/07/2021 03:39:14 - INFO - __main__ - Step 45413: {'lr': 0.0004011554146292411, 'samples': 8719296, 'steps': 45412, 'loss/train': 1.2986106872558594} 11/07/2021 03:39:14 - INFO - __main__ - Step 45414: {'lr': 0.0004011511877053567, 'samples': 8719488, 'steps': 45413, 'loss/train': 1.574819803237915} 11/07/2021 03:39:14 - INFO - __main__ - Step 45415: {'lr': 0.0004011469607133659, 'samples': 8719680, 'steps': 45414, 'loss/train': 0.9463472962379456} 11/07/2021 03:39:15 - INFO - __main__ - Step 45416: {'lr': 0.0004011427336532707, 'samples': 8719872, 'steps': 45415, 'loss/train': 2.732780694961548} 11/07/2021 03:39:15 - INFO - __main__ - Step 45417: {'lr': 0.00040113850652507286, 'samples': 8720064, 'steps': 45416, 'loss/train': 2.6750824451446533} 11/07/2021 03:39:15 - INFO - __main__ - Step 45418: {'lr': 0.00040113427932877434, 'samples': 8720256, 'steps': 45417, 'loss/train': 2.024507522583008} 11/07/2021 03:39:16 - INFO - __main__ - Step 45419: {'lr': 0.00040113005206437704, 'samples': 8720448, 'steps': 45418, 'loss/train': 1.1875247955322266} 11/07/2021 03:39:17 - INFO - __main__ - Step 45420: {'lr': 0.00040112582473188284, 'samples': 8720640, 'steps': 45419, 'loss/train': 1.6446876525878906} 11/07/2021 03:39:17 - INFO - __main__ - Step 45421: {'lr': 0.00040112159733129375, 'samples': 8720832, 'steps': 45420, 'loss/train': 1.5601922273635864} 11/07/2021 03:39:17 - INFO - __main__ - Step 45422: {'lr': 0.00040111736986261155, 'samples': 8721024, 'steps': 45421, 'loss/train': 1.513543963432312} 11/07/2021 03:39:18 - INFO - __main__ - Step 45423: {'lr': 0.00040111314232583816, 'samples': 8721216, 'steps': 45422, 'loss/train': 1.7309056520462036} 11/07/2021 03:39:19 - INFO - __main__ - Step 45424: {'lr': 0.0004011089147209756, 'samples': 8721408, 'steps': 45423, 'loss/train': 1.6408305168151855} 11/07/2021 03:39:19 - INFO - __main__ - Step 45425: {'lr': 0.00040110468704802573, 'samples': 8721600, 'steps': 45424, 'loss/train': 1.5653102397918701} 11/07/2021 03:39:20 - INFO - __main__ - Step 45426: {'lr': 0.00040110045930699033, 'samples': 8721792, 'steps': 45425, 'loss/train': 1.1400521993637085} 11/07/2021 03:39:20 - INFO - __main__ - Step 45427: {'lr': 0.00040109623149787137, 'samples': 8721984, 'steps': 45426, 'loss/train': 1.363743782043457} 11/07/2021 03:39:20 - INFO - __main__ - Step 45428: {'lr': 0.0004010920036206709, 'samples': 8722176, 'steps': 45427, 'loss/train': 1.199953317642212} 11/07/2021 03:39:22 - INFO - __main__ - Step 45429: {'lr': 0.00040108777567539057, 'samples': 8722368, 'steps': 45428, 'loss/train': 1.6662077903747559} 11/07/2021 03:39:22 - INFO - __main__ - Step 45430: {'lr': 0.00040108354766203247, 'samples': 8722560, 'steps': 45429, 'loss/train': 1.6846674680709839} 11/07/2021 03:39:22 - INFO - __main__ - Step 45431: {'lr': 0.0004010793195805985, 'samples': 8722752, 'steps': 45430, 'loss/train': 1.2505202293395996} 11/07/2021 03:39:23 - INFO - __main__ - Step 45432: {'lr': 0.0004010750914310905, 'samples': 8722944, 'steps': 45431, 'loss/train': 1.168033480644226} 11/07/2021 03:39:23 - INFO - __main__ - Step 45433: {'lr': 0.0004010708632135104, 'samples': 8723136, 'steps': 45432, 'loss/train': 1.0324722528457642} 11/07/2021 03:39:23 - INFO - __main__ - Step 45434: {'lr': 0.00040106663492786007, 'samples': 8723328, 'steps': 45433, 'loss/train': 5.603840351104736} 11/07/2021 03:39:24 - INFO - __main__ - Step 45435: {'lr': 0.00040106240657414137, 'samples': 8723520, 'steps': 45434, 'loss/train': 5.6717705726623535} 11/07/2021 03:39:25 - INFO - __main__ - Step 45436: {'lr': 0.0004010581781523564, 'samples': 8723712, 'steps': 45435, 'loss/train': 5.517861366271973} 11/07/2021 03:39:25 - INFO - __main__ - Step 45437: {'lr': 0.0004010539496625069, 'samples': 8723904, 'steps': 45436, 'loss/train': 1.426464557647705} 11/07/2021 03:39:26 - INFO - __main__ - Step 45438: {'lr': 0.00040104972110459493, 'samples': 8724096, 'steps': 45437, 'loss/train': 1.9749648571014404} 11/07/2021 03:39:26 - INFO - __main__ - Step 45439: {'lr': 0.00040104549247862217, 'samples': 8724288, 'steps': 45438, 'loss/train': 1.8147975206375122} 11/07/2021 03:39:26 - INFO - __main__ - Step 45440: {'lr': 0.0004010412637845906, 'samples': 8724480, 'steps': 45439, 'loss/train': 1.4899234771728516} 11/07/2021 03:39:27 - INFO - __main__ - Step 45441: {'lr': 0.00040103703502250223, 'samples': 8724672, 'steps': 45440, 'loss/train': 1.7324793338775635} 11/07/2021 03:39:28 - INFO - __main__ - Step 45442: {'lr': 0.0004010328061923589, 'samples': 8724864, 'steps': 45441, 'loss/train': 1.6355983018875122} 11/07/2021 03:39:28 - INFO - __main__ - Step 45443: {'lr': 0.00040102857729416256, 'samples': 8725056, 'steps': 45442, 'loss/train': 1.6216485500335693} 11/07/2021 03:39:28 - INFO - __main__ - Step 45444: {'lr': 0.000401024348327915, 'samples': 8725248, 'steps': 45443, 'loss/train': 1.4130525588989258} 11/07/2021 03:39:29 - INFO - __main__ - Step 45445: {'lr': 0.00040102011929361826, 'samples': 8725440, 'steps': 45444, 'loss/train': 1.080099105834961} 11/07/2021 03:39:30 - INFO - __main__ - Step 45446: {'lr': 0.00040101589019127416, 'samples': 8725632, 'steps': 45445, 'loss/train': 1.3785549402236938} 11/07/2021 03:39:30 - INFO - __main__ - Step 45447: {'lr': 0.0004010116610208846, 'samples': 8725824, 'steps': 45446, 'loss/train': 1.7756900787353516} 11/07/2021 03:39:30 - INFO - __main__ - Step 45448: {'lr': 0.0004010074317824516, 'samples': 8726016, 'steps': 45447, 'loss/train': 1.0014840364456177} 11/07/2021 03:39:31 - INFO - __main__ - Step 45449: {'lr': 0.0004010032024759769, 'samples': 8726208, 'steps': 45448, 'loss/train': 1.5262198448181152} 11/07/2021 03:39:31 - INFO - __main__ - Step 45450: {'lr': 0.0004009989731014625, 'samples': 8726400, 'steps': 45449, 'loss/train': 1.2290147542953491} 11/07/2021 03:39:33 - INFO - __main__ - Step 45451: {'lr': 0.00040099474365891033, 'samples': 8726592, 'steps': 45450, 'loss/train': 1.4418271780014038} 11/07/2021 03:39:33 - INFO - __main__ - Step 45452: {'lr': 0.0004009905141483222, 'samples': 8726784, 'steps': 45451, 'loss/train': 1.9713292121887207} 11/07/2021 03:39:33 - INFO - __main__ - Step 45453: {'lr': 0.0004009862845697001, 'samples': 8726976, 'steps': 45452, 'loss/train': 1.6822173595428467} 11/07/2021 03:39:34 - INFO - __main__ - Step 45454: {'lr': 0.00040098205492304596, 'samples': 8727168, 'steps': 45453, 'loss/train': 1.695788860321045} 11/07/2021 03:39:34 - INFO - __main__ - Step 45455: {'lr': 0.00040097782520836156, 'samples': 8727360, 'steps': 45454, 'loss/train': 1.7513108253479004} 11/07/2021 03:39:34 - INFO - __main__ - Step 45456: {'lr': 0.00040097359542564894, 'samples': 8727552, 'steps': 45455, 'loss/train': 1.728674292564392} 11/07/2021 03:39:35 - INFO - __main__ - Step 45457: {'lr': 0.0004009693655749099, 'samples': 8727744, 'steps': 45456, 'loss/train': 1.1296839714050293} 11/07/2021 03:39:36 - INFO - __main__ - Step 45458: {'lr': 0.00040096513565614645, 'samples': 8727936, 'steps': 45457, 'loss/train': 1.2750709056854248} 11/07/2021 03:39:36 - INFO - __main__ - Step 45459: {'lr': 0.00040096090566936037, 'samples': 8728128, 'steps': 45458, 'loss/train': 0.9658198952674866} 11/07/2021 03:39:37 - INFO - __main__ - Step 45460: {'lr': 0.00040095667561455367, 'samples': 8728320, 'steps': 45459, 'loss/train': 1.7042561769485474} 11/07/2021 03:39:37 - INFO - __main__ - Step 45461: {'lr': 0.00040095244549172824, 'samples': 8728512, 'steps': 45460, 'loss/train': 1.1562546491622925} 11/07/2021 03:39:37 - INFO - __main__ - Step 45462: {'lr': 0.00040094821530088594, 'samples': 8728704, 'steps': 45461, 'loss/train': 1.7081876993179321} 11/07/2021 03:39:38 - INFO - __main__ - Step 45463: {'lr': 0.0004009439850420287, 'samples': 8728896, 'steps': 45462, 'loss/train': 1.6255519390106201} 11/07/2021 03:39:39 - INFO - __main__ - Step 45464: {'lr': 0.00040093975471515843, 'samples': 8729088, 'steps': 45463, 'loss/train': 1.6070562601089478} 11/07/2021 03:39:39 - INFO - __main__ - Step 45465: {'lr': 0.00040093552432027713, 'samples': 8729280, 'steps': 45464, 'loss/train': 1.1172105073928833} 11/07/2021 03:39:39 - INFO - __main__ - Step 45466: {'lr': 0.0004009312938573865, 'samples': 8729472, 'steps': 45465, 'loss/train': 1.5569968223571777} 11/07/2021 03:39:40 - INFO - __main__ - Step 45467: {'lr': 0.00040092706332648856, 'samples': 8729664, 'steps': 45466, 'loss/train': 1.3887341022491455} 11/07/2021 03:39:41 - INFO - __main__ - Step 45468: {'lr': 0.00040092283272758525, 'samples': 8729856, 'steps': 45467, 'loss/train': 1.8437894582748413} 11/07/2021 03:39:41 - INFO - __main__ - Step 45469: {'lr': 0.00040091860206067844, 'samples': 8730048, 'steps': 45468, 'loss/train': 1.4862357378005981} 11/07/2021 03:39:41 - INFO - __main__ - Step 45470: {'lr': 0.00040091437132577004, 'samples': 8730240, 'steps': 45469, 'loss/train': 1.660348653793335} 11/07/2021 03:39:42 - INFO - __main__ - Step 45471: {'lr': 0.0004009101405228619, 'samples': 8730432, 'steps': 45470, 'loss/train': 1.6826940774917603} 11/07/2021 03:39:42 - INFO - __main__ - Step 45472: {'lr': 0.00040090590965195604, 'samples': 8730624, 'steps': 45471, 'loss/train': 0.5386844873428345} 11/07/2021 03:39:42 - INFO - __main__ - Step 45473: {'lr': 0.0004009016787130543, 'samples': 8730816, 'steps': 45472, 'loss/train': 6.135570049285889} 11/07/2021 03:39:43 - INFO - __main__ - Step 45474: {'lr': 0.0004008974477061586, 'samples': 8731008, 'steps': 45473, 'loss/train': 0.7260832786560059} 11/07/2021 03:39:44 - INFO - __main__ - Step 45475: {'lr': 0.0004008932166312708, 'samples': 8731200, 'steps': 45474, 'loss/train': 1.4672410488128662} 11/07/2021 03:39:44 - INFO - __main__ - Step 45476: {'lr': 0.0004008889854883929, 'samples': 8731392, 'steps': 45475, 'loss/train': 0.9254458546638489} 11/07/2021 03:39:45 - INFO - __main__ - Step 45477: {'lr': 0.0004008847542775267, 'samples': 8731584, 'steps': 45476, 'loss/train': 1.8162497282028198} 11/07/2021 03:39:45 - INFO - __main__ - Step 45478: {'lr': 0.00040088052299867415, 'samples': 8731776, 'steps': 45477, 'loss/train': 1.2034556865692139} 11/07/2021 03:39:46 - INFO - __main__ - Step 45479: {'lr': 0.0004008762916518372, 'samples': 8731968, 'steps': 45478, 'loss/train': 1.6396222114562988} 11/07/2021 03:39:46 - INFO - __main__ - Step 45480: {'lr': 0.0004008720602370177, 'samples': 8732160, 'steps': 45479, 'loss/train': 1.153206467628479} 11/07/2021 03:39:47 - INFO - __main__ - Step 45481: {'lr': 0.00040086782875421755, 'samples': 8732352, 'steps': 45480, 'loss/train': 1.8851182460784912} 11/07/2021 03:39:47 - INFO - __main__ - Step 45482: {'lr': 0.0004008635972034388, 'samples': 8732544, 'steps': 45481, 'loss/train': 1.4990321397781372} 11/07/2021 03:39:47 - INFO - __main__ - Step 45483: {'lr': 0.0004008593655846831, 'samples': 8732736, 'steps': 45482, 'loss/train': 1.3766307830810547} 11/07/2021 03:39:48 - INFO - __main__ - Step 45484: {'lr': 0.0004008551338979526, 'samples': 8732928, 'steps': 45483, 'loss/train': 0.743320643901825} 11/07/2021 03:39:49 - INFO - __main__ - Step 45485: {'lr': 0.00040085090214324906, 'samples': 8733120, 'steps': 45484, 'loss/train': 1.4501512050628662} 11/07/2021 03:39:49 - INFO - __main__ - Step 45486: {'lr': 0.00040084667032057444, 'samples': 8733312, 'steps': 45485, 'loss/train': 1.6061315536499023} 11/07/2021 03:39:49 - INFO - __main__ - Step 45487: {'lr': 0.00040084243842993065, 'samples': 8733504, 'steps': 45486, 'loss/train': 1.3426865339279175} 11/07/2021 03:39:50 - INFO - __main__ - Step 45488: {'lr': 0.0004008382064713195, 'samples': 8733696, 'steps': 45487, 'loss/train': 1.6358917951583862} 11/07/2021 03:39:51 - INFO - __main__ - Step 45489: {'lr': 0.0004008339744447431, 'samples': 8733888, 'steps': 45488, 'loss/train': 1.4708629846572876} 11/07/2021 03:39:51 - INFO - __main__ - Step 45490: {'lr': 0.0004008297423502032, 'samples': 8734080, 'steps': 45489, 'loss/train': 1.5823044776916504} 11/07/2021 03:39:52 - INFO - __main__ - Step 45491: {'lr': 0.0004008255101877017, 'samples': 8734272, 'steps': 45490, 'loss/train': 1.4473228454589844} 11/07/2021 03:39:52 - INFO - __main__ - Step 45492: {'lr': 0.00040082127795724066, 'samples': 8734464, 'steps': 45491, 'loss/train': 1.4682440757751465} 11/07/2021 03:39:52 - INFO - __main__ - Step 45493: {'lr': 0.00040081704565882176, 'samples': 8734656, 'steps': 45492, 'loss/train': 1.3882434368133545} 11/07/2021 03:39:53 - INFO - __main__ - Step 45494: {'lr': 0.00040081281329244707, 'samples': 8734848, 'steps': 45493, 'loss/train': 6.106868743896484} 11/07/2021 03:39:54 - INFO - __main__ - Step 45495: {'lr': 0.00040080858085811844, 'samples': 8735040, 'steps': 45494, 'loss/train': 0.2559010982513428} 11/07/2021 03:39:54 - INFO - __main__ - Step 45496: {'lr': 0.00040080434835583777, 'samples': 8735232, 'steps': 45495, 'loss/train': 0.9081894159317017} 11/07/2021 03:39:54 - INFO - __main__ - Step 45497: {'lr': 0.00040080011578560705, 'samples': 8735424, 'steps': 45496, 'loss/train': 1.5792264938354492} 11/07/2021 03:39:55 - INFO - __main__ - Step 45498: {'lr': 0.0004007958831474281, 'samples': 8735616, 'steps': 45497, 'loss/train': 1.1584725379943848} 11/07/2021 03:39:55 - INFO - __main__ - Step 45499: {'lr': 0.0004007916504413029, 'samples': 8735808, 'steps': 45498, 'loss/train': 0.7571787238121033} 11/07/2021 03:39:56 - INFO - __main__ - Step 45500: {'lr': 0.00040078741766723326, 'samples': 8736000, 'steps': 45499, 'loss/train': 1.438258171081543} 11/07/2021 03:39:56 - INFO - __main__ - Step 45501: {'lr': 0.00040078318482522114, 'samples': 8736192, 'steps': 45500, 'loss/train': 1.6170825958251953} 11/07/2021 03:39:57 - INFO - __main__ - Step 45502: {'lr': 0.0004007789519152684, 'samples': 8736384, 'steps': 45501, 'loss/train': 1.430743932723999} 11/07/2021 03:39:57 - INFO - __main__ - Step 45503: {'lr': 0.00040077471893737703, 'samples': 8736576, 'steps': 45502, 'loss/train': 1.7867926359176636} 11/07/2021 03:39:58 - INFO - __main__ - Step 45504: {'lr': 0.0004007704858915489, 'samples': 8736768, 'steps': 45503, 'loss/train': 1.342382788658142} 11/07/2021 03:39:59 - INFO - __main__ - Step 45505: {'lr': 0.00040076625277778594, 'samples': 8736960, 'steps': 45504, 'loss/train': 1.2320177555084229} 11/07/2021 03:39:59 - INFO - __main__ - Step 45506: {'lr': 0.00040076201959609003, 'samples': 8737152, 'steps': 45505, 'loss/train': 1.6842968463897705} 11/07/2021 03:39:59 - INFO - __main__ - Step 45507: {'lr': 0.00040075778634646305, 'samples': 8737344, 'steps': 45506, 'loss/train': 1.3534555435180664} 11/07/2021 03:40:00 - INFO - __main__ - Step 45508: {'lr': 0.0004007535530289069, 'samples': 8737536, 'steps': 45507, 'loss/train': 1.2427611351013184} 11/07/2021 03:40:00 - INFO - __main__ - Step 45509: {'lr': 0.0004007493196434236, 'samples': 8737728, 'steps': 45508, 'loss/train': 1.5556435585021973} 11/07/2021 03:40:01 - INFO - __main__ - Step 45510: {'lr': 0.0004007450861900149, 'samples': 8737920, 'steps': 45509, 'loss/train': 1.7099498510360718} 11/07/2021 03:40:02 - INFO - __main__ - Step 45511: {'lr': 0.00040074085266868285, 'samples': 8738112, 'steps': 45510, 'loss/train': 1.4670841693878174} 11/07/2021 03:40:02 - INFO - __main__ - Step 45512: {'lr': 0.0004007366190794294, 'samples': 8738304, 'steps': 45511, 'loss/train': 1.3476924896240234} 11/07/2021 03:40:02 - INFO - __main__ - Step 45513: {'lr': 0.00040073238542225623, 'samples': 8738496, 'steps': 45512, 'loss/train': 1.453534483909607} 11/07/2021 03:40:03 - INFO - __main__ - Step 45514: {'lr': 0.00040072815169716534, 'samples': 8738688, 'steps': 45513, 'loss/train': 1.599866509437561} 11/07/2021 03:40:03 - INFO - __main__ - Step 45515: {'lr': 0.00040072391790415873, 'samples': 8738880, 'steps': 45514, 'loss/train': 1.2118515968322754} 11/07/2021 03:40:04 - INFO - __main__ - Step 45516: {'lr': 0.00040071968404323824, 'samples': 8739072, 'steps': 45515, 'loss/train': 1.3849482536315918} 11/07/2021 03:40:04 - INFO - __main__ - Step 45517: {'lr': 0.0004007154501144058, 'samples': 8739264, 'steps': 45516, 'loss/train': 1.4673330783843994} 11/07/2021 03:40:05 - INFO - __main__ - Step 45518: {'lr': 0.00040071121611766325, 'samples': 8739456, 'steps': 45517, 'loss/train': 1.6368898153305054} 11/07/2021 03:40:05 - INFO - __main__ - Step 45519: {'lr': 0.00040070698205301266, 'samples': 8739648, 'steps': 45518, 'loss/train': 1.6441377401351929} 11/07/2021 03:40:05 - INFO - __main__ - Step 45520: {'lr': 0.0004007027479204557, 'samples': 8739840, 'steps': 45519, 'loss/train': 1.7209645509719849} 11/07/2021 03:40:06 - INFO - __main__ - Step 45521: {'lr': 0.0004006985137199945, 'samples': 8740032, 'steps': 45520, 'loss/train': 1.7317477464675903} 11/07/2021 03:40:07 - INFO - __main__ - Step 45522: {'lr': 0.00040069427945163083, 'samples': 8740224, 'steps': 45521, 'loss/train': 1.6250065565109253} 11/07/2021 03:40:07 - INFO - __main__ - Step 45523: {'lr': 0.00040069004511536667, 'samples': 8740416, 'steps': 45522, 'loss/train': 1.2221133708953857} 11/07/2021 03:40:07 - INFO - __main__ - Step 45524: {'lr': 0.00040068581071120386, 'samples': 8740608, 'steps': 45523, 'loss/train': 1.7659928798675537} 11/07/2021 03:40:08 - INFO - __main__ - Step 45525: {'lr': 0.00040068157623914435, 'samples': 8740800, 'steps': 45524, 'loss/train': 1.4572179317474365} 11/07/2021 03:40:09 - INFO - __main__ - Step 45526: {'lr': 0.0004006773416991901, 'samples': 8740992, 'steps': 45525, 'loss/train': 0.5819351077079773} 11/07/2021 03:40:09 - INFO - __main__ - Step 45527: {'lr': 0.00040067310709134295, 'samples': 8741184, 'steps': 45526, 'loss/train': 1.801347017288208} 11/07/2021 03:40:09 - INFO - __main__ - Step 45528: {'lr': 0.0004006688724156048, 'samples': 8741376, 'steps': 45527, 'loss/train': 1.456350326538086} 11/07/2021 03:40:10 - INFO - __main__ - Step 45529: {'lr': 0.00040066463767197757, 'samples': 8741568, 'steps': 45528, 'loss/train': 1.5421572923660278} 11/07/2021 03:40:10 - INFO - __main__ - Step 45530: {'lr': 0.00040066040286046325, 'samples': 8741760, 'steps': 45529, 'loss/train': 1.0882627964019775} 11/07/2021 03:40:11 - INFO - __main__ - Step 45531: {'lr': 0.0004006561679810636, 'samples': 8741952, 'steps': 45530, 'loss/train': 1.636697769165039} 11/07/2021 03:40:12 - INFO - __main__ - Step 45532: {'lr': 0.0004006519330337807, 'samples': 8742144, 'steps': 45531, 'loss/train': 1.5281553268432617} 11/07/2021 03:40:12 - INFO - __main__ - Step 45533: {'lr': 0.0004006476980186163, 'samples': 8742336, 'steps': 45532, 'loss/train': 0.9776030778884888} 11/07/2021 03:40:12 - INFO - __main__ - Step 45534: {'lr': 0.0004006434629355723, 'samples': 8742528, 'steps': 45533, 'loss/train': 1.47916841506958} 11/07/2021 03:40:13 - INFO - __main__ - Step 45535: {'lr': 0.0004006392277846508, 'samples': 8742720, 'steps': 45534, 'loss/train': 1.5749417543411255} 11/07/2021 03:40:14 - INFO - __main__ - Step 45536: {'lr': 0.00040063499256585354, 'samples': 8742912, 'steps': 45535, 'loss/train': 1.0047314167022705} 11/07/2021 03:40:14 - INFO - __main__ - Step 45537: {'lr': 0.00040063075727918247, 'samples': 8743104, 'steps': 45536, 'loss/train': 1.3174585103988647} 11/07/2021 03:40:14 - INFO - __main__ - Step 45538: {'lr': 0.0004006265219246395, 'samples': 8743296, 'steps': 45537, 'loss/train': 1.3374698162078857} 11/07/2021 03:40:15 - INFO - __main__ - Step 45539: {'lr': 0.00040062228650222657, 'samples': 8743488, 'steps': 45538, 'loss/train': 1.4565569162368774} 11/07/2021 03:40:15 - INFO - __main__ - Step 45540: {'lr': 0.00040061805101194553, 'samples': 8743680, 'steps': 45539, 'loss/train': 1.4614648818969727} 11/07/2021 03:40:16 - INFO - __main__ - Step 45541: {'lr': 0.00040061381545379837, 'samples': 8743872, 'steps': 45540, 'loss/train': 1.1748137474060059} 11/07/2021 03:40:17 - INFO - __main__ - Step 45542: {'lr': 0.00040060957982778687, 'samples': 8744064, 'steps': 45541, 'loss/train': 1.3833144903182983} 11/07/2021 03:40:17 - INFO - __main__ - Step 45543: {'lr': 0.0004006053441339131, 'samples': 8744256, 'steps': 45542, 'loss/train': 1.6318830251693726} 11/07/2021 03:40:17 - INFO - __main__ - Step 45544: {'lr': 0.00040060110837217885, 'samples': 8744448, 'steps': 45543, 'loss/train': 1.6543395519256592} 11/07/2021 03:40:18 - INFO - __main__ - Step 45545: {'lr': 0.000400596872542586, 'samples': 8744640, 'steps': 45544, 'loss/train': 1.4727685451507568} 11/07/2021 03:40:18 - INFO - __main__ - Step 45546: {'lr': 0.0004005926366451367, 'samples': 8744832, 'steps': 45545, 'loss/train': 1.6064571142196655} 11/07/2021 03:40:19 - INFO - __main__ - Step 45547: {'lr': 0.0004005884006798325, 'samples': 8745024, 'steps': 45546, 'loss/train': 1.4628363847732544} 11/07/2021 03:40:19 - INFO - __main__ - Step 45548: {'lr': 0.0004005841646466756, 'samples': 8745216, 'steps': 45547, 'loss/train': 1.2657325267791748} 11/07/2021 03:40:20 - INFO - __main__ - Step 45549: {'lr': 0.00040057992854566774, 'samples': 8745408, 'steps': 45548, 'loss/train': 1.6489057540893555} 11/07/2021 03:40:20 - INFO - __main__ - Step 45550: {'lr': 0.0004005756923768109, 'samples': 8745600, 'steps': 45549, 'loss/train': 1.4323945045471191} 11/07/2021 03:40:21 - INFO - __main__ - Step 45551: {'lr': 0.0004005714561401069, 'samples': 8745792, 'steps': 45550, 'loss/train': 1.2886767387390137} 11/07/2021 03:40:21 - INFO - __main__ - Step 45552: {'lr': 0.0004005672198355579, 'samples': 8745984, 'steps': 45551, 'loss/train': 1.3043690919876099} 11/07/2021 03:40:22 - INFO - __main__ - Step 45553: {'lr': 0.00040056298346316554, 'samples': 8746176, 'steps': 45552, 'loss/train': 1.378745675086975} 11/07/2021 03:40:22 - INFO - __main__ - Step 45554: {'lr': 0.0004005587470229318, 'samples': 8746368, 'steps': 45553, 'loss/train': 1.0702617168426514} 11/07/2021 03:40:23 - INFO - __main__ - Step 45555: {'lr': 0.00040055451051485865, 'samples': 8746560, 'steps': 45554, 'loss/train': 2.052830457687378} 11/07/2021 03:40:23 - INFO - __main__ - Step 45556: {'lr': 0.0004005502739389479, 'samples': 8746752, 'steps': 45555, 'loss/train': 1.501568078994751} 11/07/2021 03:40:24 - INFO - __main__ - Step 45557: {'lr': 0.00040054603729520154, 'samples': 8746944, 'steps': 45556, 'loss/train': 1.662977933883667} 11/07/2021 03:40:24 - INFO - __main__ - Step 45558: {'lr': 0.00040054180058362156, 'samples': 8747136, 'steps': 45557, 'loss/train': 1.621247410774231} 11/07/2021 03:40:25 - INFO - __main__ - Step 45559: {'lr': 0.0004005375638042097, 'samples': 8747328, 'steps': 45558, 'loss/train': 1.490097999572754} 11/07/2021 03:40:25 - INFO - __main__ - Step 45560: {'lr': 0.0004005333269569679, 'samples': 8747520, 'steps': 45559, 'loss/train': 1.5717151165008545} 11/07/2021 03:40:25 - INFO - __main__ - Step 45561: {'lr': 0.0004005290900418982, 'samples': 8747712, 'steps': 45560, 'loss/train': 1.599551796913147} 11/07/2021 03:40:26 - INFO - __main__ - Step 45562: {'lr': 0.0004005248530590023, 'samples': 8747904, 'steps': 45561, 'loss/train': 1.0087416172027588} 11/07/2021 03:40:27 - INFO - __main__ - Step 45563: {'lr': 0.0004005206160082823, 'samples': 8748096, 'steps': 45562, 'loss/train': 1.5920844078063965} 11/07/2021 03:40:27 - INFO - __main__ - Step 45564: {'lr': 0.00040051637888973996, 'samples': 8748288, 'steps': 45563, 'loss/train': 1.5179200172424316} 11/07/2021 03:40:28 - INFO - __main__ - Step 45565: {'lr': 0.0004005121417033773, 'samples': 8748480, 'steps': 45564, 'loss/train': 0.9567902088165283} 11/07/2021 03:40:28 - INFO - __main__ - Step 45566: {'lr': 0.0004005079044491963, 'samples': 8748672, 'steps': 45565, 'loss/train': 0.9599171280860901} 11/07/2021 03:40:28 - INFO - __main__ - Step 45567: {'lr': 0.0004005036671271986, 'samples': 8748864, 'steps': 45566, 'loss/train': 1.2490531206130981} 11/07/2021 03:40:29 - INFO - __main__ - Step 45568: {'lr': 0.00040049942973738626, 'samples': 8749056, 'steps': 45567, 'loss/train': 1.963448405265808} 11/07/2021 03:40:30 - INFO - __main__ - Step 45569: {'lr': 0.00040049519227976135, 'samples': 8749248, 'steps': 45568, 'loss/train': 1.647210955619812} 11/07/2021 03:40:30 - INFO - __main__ - Step 45570: {'lr': 0.0004004909547543255, 'samples': 8749440, 'steps': 45569, 'loss/train': 1.090729832649231} 11/07/2021 03:40:30 - INFO - __main__ - Step 45571: {'lr': 0.0004004867171610808, 'samples': 8749632, 'steps': 45570, 'loss/train': 1.408889651298523} 11/07/2021 03:40:31 - INFO - __main__ - Step 45572: {'lr': 0.00040048247950002917, 'samples': 8749824, 'steps': 45571, 'loss/train': 1.5346219539642334} 11/07/2021 03:40:32 - INFO - __main__ - Step 45573: {'lr': 0.0004004782417711724, 'samples': 8750016, 'steps': 45572, 'loss/train': 0.6711319088935852} 11/07/2021 03:40:32 - INFO - __main__ - Step 45574: {'lr': 0.0004004740039745124, 'samples': 8750208, 'steps': 45573, 'loss/train': 1.6857950687408447} 11/07/2021 03:40:33 - INFO - __main__ - Step 45575: {'lr': 0.0004004697661100512, 'samples': 8750400, 'steps': 45574, 'loss/train': 1.100041151046753} 11/07/2021 03:40:33 - INFO - __main__ - Step 45576: {'lr': 0.0004004655281777906, 'samples': 8750592, 'steps': 45575, 'loss/train': 1.2100518941879272} 11/07/2021 03:40:33 - INFO - __main__ - Step 45577: {'lr': 0.0004004612901777326, 'samples': 8750784, 'steps': 45576, 'loss/train': 1.9163987636566162} 11/07/2021 03:40:34 - INFO - __main__ - Step 45578: {'lr': 0.000400457052109879, 'samples': 8750976, 'steps': 45577, 'loss/train': 1.7235652208328247} 11/07/2021 03:40:35 - INFO - __main__ - Step 45579: {'lr': 0.0004004528139742319, 'samples': 8751168, 'steps': 45578, 'loss/train': 1.5605733394622803} 11/07/2021 03:40:35 - INFO - __main__ - Step 45580: {'lr': 0.00040044857577079294, 'samples': 8751360, 'steps': 45579, 'loss/train': 1.3955143690109253} 11/07/2021 03:40:35 - INFO - __main__ - Step 45581: {'lr': 0.00040044433749956434, 'samples': 8751552, 'steps': 45580, 'loss/train': 1.2570422887802124} 11/07/2021 03:40:36 - INFO - __main__ - Step 45582: {'lr': 0.0004004400991605477, 'samples': 8751744, 'steps': 45581, 'loss/train': 0.8221997618675232} 11/07/2021 03:40:36 - INFO - __main__ - Step 45583: {'lr': 0.0004004358607537451, 'samples': 8751936, 'steps': 45582, 'loss/train': 1.2752317190170288} 11/07/2021 03:40:37 - INFO - __main__ - Step 45584: {'lr': 0.0004004316222791584, 'samples': 8752128, 'steps': 45583, 'loss/train': 1.6482517719268799} 11/07/2021 03:40:38 - INFO - __main__ - Step 45585: {'lr': 0.00040042738373678954, 'samples': 8752320, 'steps': 45584, 'loss/train': 1.456422209739685} 11/07/2021 03:40:38 - INFO - __main__ - Step 45586: {'lr': 0.0004004231451266406, 'samples': 8752512, 'steps': 45585, 'loss/train': 1.0222355127334595} 11/07/2021 03:40:38 - INFO - __main__ - Step 45587: {'lr': 0.0004004189064487131, 'samples': 8752704, 'steps': 45586, 'loss/train': 0.8327646255493164} 11/07/2021 03:40:39 - INFO - __main__ - Step 45588: {'lr': 0.00040041466770300923, 'samples': 8752896, 'steps': 45587, 'loss/train': 1.407906413078308} 11/07/2021 03:40:40 - INFO - __main__ - Step 45589: {'lr': 0.00040041042888953085, 'samples': 8753088, 'steps': 45588, 'loss/train': 1.678365707397461} 11/07/2021 03:40:40 - INFO - __main__ - Step 45590: {'lr': 0.0004004061900082798, 'samples': 8753280, 'steps': 45589, 'loss/train': 1.6984779834747314} 11/07/2021 03:40:41 - INFO - __main__ - Step 45591: {'lr': 0.00040040195105925803, 'samples': 8753472, 'steps': 45590, 'loss/train': 1.6859689950942993} 11/07/2021 03:40:41 - INFO - __main__ - Step 45592: {'lr': 0.00040039771204246756, 'samples': 8753664, 'steps': 45591, 'loss/train': 2.851062536239624} 11/07/2021 03:40:41 - INFO - __main__ - Step 45593: {'lr': 0.0004003934729579101, 'samples': 8753856, 'steps': 45592, 'loss/train': 1.8652925491333008} 11/07/2021 03:40:42 - INFO - __main__ - Step 45594: {'lr': 0.0004003892338055877, 'samples': 8754048, 'steps': 45593, 'loss/train': 1.4023840427398682} 11/07/2021 03:40:43 - INFO - __main__ - Step 45595: {'lr': 0.0004003849945855023, 'samples': 8754240, 'steps': 45594, 'loss/train': 1.2967686653137207} 11/07/2021 03:40:43 - INFO - __main__ - Step 45596: {'lr': 0.0004003807552976556, 'samples': 8754432, 'steps': 45595, 'loss/train': 0.9110772013664246} 11/07/2021 03:40:43 - INFO - __main__ - Step 45597: {'lr': 0.00040037651594204975, 'samples': 8754624, 'steps': 45596, 'loss/train': 1.5004764795303345} 11/07/2021 03:40:44 - INFO - __main__ - Step 45598: {'lr': 0.00040037227651868655, 'samples': 8754816, 'steps': 45597, 'loss/train': 0.45010626316070557} 11/07/2021 03:40:44 - INFO - __main__ - Step 45599: {'lr': 0.000400368037027568, 'samples': 8755008, 'steps': 45598, 'loss/train': 1.6588164567947388} 11/07/2021 03:40:45 - INFO - __main__ - Step 45600: {'lr': 0.0004003637974686958, 'samples': 8755200, 'steps': 45599, 'loss/train': 1.6795395612716675} 11/07/2021 03:40:46 - INFO - __main__ - Step 45601: {'lr': 0.000400359557842072, 'samples': 8755392, 'steps': 45600, 'loss/train': 1.6923891305923462} 11/07/2021 03:40:46 - INFO - __main__ - Step 45602: {'lr': 0.00040035531814769853, 'samples': 8755584, 'steps': 45601, 'loss/train': 1.6309795379638672} 11/07/2021 03:40:46 - INFO - __main__ - Step 45603: {'lr': 0.0004003510783855774, 'samples': 8755776, 'steps': 45602, 'loss/train': 1.6606926918029785} 11/07/2021 03:40:47 - INFO - __main__ - Step 45604: {'lr': 0.00040034683855571027, 'samples': 8755968, 'steps': 45603, 'loss/train': 1.921417236328125} 11/07/2021 03:40:48 - INFO - __main__ - Step 45605: {'lr': 0.00040034259865809915, 'samples': 8756160, 'steps': 45604, 'loss/train': 1.2512096166610718} 11/07/2021 03:40:48 - INFO - __main__ - Step 45606: {'lr': 0.00040033835869274605, 'samples': 8756352, 'steps': 45605, 'loss/train': 0.9501252174377441} 11/07/2021 03:40:48 - INFO - __main__ - Step 45607: {'lr': 0.00040033411865965276, 'samples': 8756544, 'steps': 45606, 'loss/train': 1.3827056884765625} 11/07/2021 03:40:49 - INFO - __main__ - Step 45608: {'lr': 0.0004003298785588212, 'samples': 8756736, 'steps': 45607, 'loss/train': 1.574720025062561} 11/07/2021 03:40:49 - INFO - __main__ - Step 45609: {'lr': 0.00040032563839025335, 'samples': 8756928, 'steps': 45608, 'loss/train': 1.3279770612716675} 11/07/2021 03:40:50 - INFO - __main__ - Step 45610: {'lr': 0.00040032139815395114, 'samples': 8757120, 'steps': 45609, 'loss/train': 1.8429594039916992} 11/07/2021 03:40:51 - INFO - __main__ - Step 45611: {'lr': 0.00040031715784991643, 'samples': 8757312, 'steps': 45610, 'loss/train': 1.3361419439315796} 11/07/2021 03:40:51 - INFO - __main__ - Step 45612: {'lr': 0.000400312917478151, 'samples': 8757504, 'steps': 45611, 'loss/train': 1.6061185598373413} 11/07/2021 03:40:51 - INFO - __main__ - Step 45613: {'lr': 0.000400308677038657, 'samples': 8757696, 'steps': 45612, 'loss/train': 1.4026153087615967} 11/07/2021 03:40:52 - INFO - __main__ - Step 45614: {'lr': 0.0004003044365314362, 'samples': 8757888, 'steps': 45613, 'loss/train': 1.5190362930297852} 11/07/2021 03:40:53 - INFO - __main__ - Step 45615: {'lr': 0.0004003001959564906, 'samples': 8758080, 'steps': 45614, 'loss/train': 0.32917556166648865} 11/07/2021 03:40:54 - INFO - __main__ - Step 45616: {'lr': 0.000400295955313822, 'samples': 8758272, 'steps': 45615, 'loss/train': 1.3965507745742798} 11/07/2021 03:40:54 - INFO - __main__ - Step 45617: {'lr': 0.0004002917146034323, 'samples': 8758464, 'steps': 45616, 'loss/train': 1.3469103574752808} 11/07/2021 03:40:54 - INFO - __main__ - Step 45618: {'lr': 0.0004002874738253235, 'samples': 8758656, 'steps': 45617, 'loss/train': 1.355541467666626} 11/07/2021 03:40:55 - INFO - __main__ - Step 45619: {'lr': 0.00040028323297949754, 'samples': 8758848, 'steps': 45618, 'loss/train': 1.3055799007415771} 11/07/2021 03:40:55 - INFO - __main__ - Step 45620: {'lr': 0.0004002789920659563, 'samples': 8759040, 'steps': 45619, 'loss/train': 1.5339677333831787} 11/07/2021 03:40:56 - INFO - __main__ - Step 45621: {'lr': 0.0004002747510847016, 'samples': 8759232, 'steps': 45620, 'loss/train': 1.6751906871795654} 11/07/2021 03:40:57 - INFO - __main__ - Step 45622: {'lr': 0.0004002705100357354, 'samples': 8759424, 'steps': 45621, 'loss/train': 1.5554715394973755} 11/07/2021 03:40:57 - INFO - __main__ - Step 45623: {'lr': 0.00040026626891905963, 'samples': 8759616, 'steps': 45622, 'loss/train': 1.5582956075668335} 11/07/2021 03:40:57 - INFO - __main__ - Step 45624: {'lr': 0.00040026202773467623, 'samples': 8759808, 'steps': 45623, 'loss/train': 1.737872838973999} 11/07/2021 03:40:58 - INFO - __main__ - Step 45625: {'lr': 0.00040025778648258706, 'samples': 8760000, 'steps': 45624, 'loss/train': 1.5353635549545288} 11/07/2021 03:40:59 - INFO - __main__ - Step 45626: {'lr': 0.00040025354516279413, 'samples': 8760192, 'steps': 45625, 'loss/train': 0.9963065981864929} 11/07/2021 03:40:59 - INFO - __main__ - Step 45627: {'lr': 0.0004002493037752992, 'samples': 8760384, 'steps': 45626, 'loss/train': 1.507947564125061} 11/07/2021 03:40:59 - INFO - __main__ - Step 45628: {'lr': 0.0004002450623201043, 'samples': 8760576, 'steps': 45627, 'loss/train': 1.379876732826233} 11/07/2021 03:41:00 - INFO - __main__ - Step 45629: {'lr': 0.0004002408207972111, 'samples': 8760768, 'steps': 45628, 'loss/train': 2.1403496265411377} 11/07/2021 03:41:00 - INFO - __main__ - Step 45630: {'lr': 0.00040023657920662195, 'samples': 8760960, 'steps': 45629, 'loss/train': 1.431511402130127} 11/07/2021 03:41:00 - INFO - __main__ - Step 45631: {'lr': 0.0004002323375483384, 'samples': 8761152, 'steps': 45630, 'loss/train': 1.7480183839797974} 11/07/2021 03:41:01 - INFO - __main__ - Step 45632: {'lr': 0.00040022809582236245, 'samples': 8761344, 'steps': 45631, 'loss/train': 2.1796576976776123} 11/07/2021 03:41:02 - INFO - __main__ - Step 45633: {'lr': 0.0004002238540286961, 'samples': 8761536, 'steps': 45632, 'loss/train': 1.638445258140564} 11/07/2021 03:41:02 - INFO - __main__ - Step 45634: {'lr': 0.00040021961216734123, 'samples': 8761728, 'steps': 45633, 'loss/train': 1.6624153852462769} 11/07/2021 03:41:03 - INFO - __main__ - Step 45635: {'lr': 0.0004002153702382997, 'samples': 8761920, 'steps': 45634, 'loss/train': 1.3904660940170288} 11/07/2021 03:41:03 - INFO - __main__ - Step 45636: {'lr': 0.0004002111282415734, 'samples': 8762112, 'steps': 45635, 'loss/train': 0.7281976342201233} 11/07/2021 03:41:04 - INFO - __main__ - Step 45637: {'lr': 0.00040020688617716427, 'samples': 8762304, 'steps': 45636, 'loss/train': 1.0489119291305542} 11/07/2021 03:41:04 - INFO - __main__ - Step 45638: {'lr': 0.0004002026440450742, 'samples': 8762496, 'steps': 45637, 'loss/train': 1.5963797569274902} 11/07/2021 03:41:05 - INFO - __main__ - Step 45639: {'lr': 0.0004001984018453052, 'samples': 8762688, 'steps': 45638, 'loss/train': 1.5868746042251587} 11/07/2021 03:41:05 - INFO - __main__ - Step 45640: {'lr': 0.0004001941595778592, 'samples': 8762880, 'steps': 45639, 'loss/train': 1.3480441570281982} 11/07/2021 03:41:05 - INFO - __main__ - Step 45641: {'lr': 0.0004001899172427379, 'samples': 8763072, 'steps': 45640, 'loss/train': 1.348357081413269} 11/07/2021 03:41:06 - INFO - __main__ - Step 45642: {'lr': 0.00040018567483994337, 'samples': 8763264, 'steps': 45641, 'loss/train': 1.5343412160873413} 11/07/2021 03:41:07 - INFO - __main__ - Step 45643: {'lr': 0.00040018143236947756, 'samples': 8763456, 'steps': 45642, 'loss/train': 1.5094057321548462} 11/07/2021 03:41:07 - INFO - __main__ - Step 45644: {'lr': 0.0004001771898313422, 'samples': 8763648, 'steps': 45643, 'loss/train': 1.577721357345581} 11/07/2021 03:41:07 - INFO - __main__ - Step 45645: {'lr': 0.00040017294722553945, 'samples': 8763840, 'steps': 45644, 'loss/train': 1.5198816061019897} 11/07/2021 03:41:08 - INFO - __main__ - Step 45646: {'lr': 0.000400168704552071, 'samples': 8764032, 'steps': 45645, 'loss/train': 1.4890958070755005} 11/07/2021 03:41:09 - INFO - __main__ - Step 45647: {'lr': 0.0004001644618109389, 'samples': 8764224, 'steps': 45646, 'loss/train': 2.0204715728759766} 11/07/2021 03:41:09 - INFO - __main__ - Step 45648: {'lr': 0.00040016021900214497, 'samples': 8764416, 'steps': 45647, 'loss/train': 1.0654419660568237} 11/07/2021 03:41:09 - INFO - __main__ - Step 45649: {'lr': 0.00040015597612569115, 'samples': 8764608, 'steps': 45648, 'loss/train': 1.324616551399231} 11/07/2021 03:41:10 - INFO - __main__ - Step 45650: {'lr': 0.00040015173318157937, 'samples': 8764800, 'steps': 45649, 'loss/train': 1.0149785280227661} 11/07/2021 03:41:10 - INFO - __main__ - Step 45651: {'lr': 0.00040014749016981154, 'samples': 8764992, 'steps': 45650, 'loss/train': 1.5370380878448486} 11/07/2021 03:41:11 - INFO - __main__ - Step 45652: {'lr': 0.00040014324709038965, 'samples': 8765184, 'steps': 45651, 'loss/train': 1.5011380910873413} 11/07/2021 03:41:12 - INFO - __main__ - Step 45653: {'lr': 0.00040013900394331544, 'samples': 8765376, 'steps': 45652, 'loss/train': 1.340386152267456} 11/07/2021 03:41:12 - INFO - __main__ - Step 45654: {'lr': 0.0004001347607285909, 'samples': 8765568, 'steps': 45653, 'loss/train': 1.6123696565628052} 11/07/2021 03:41:12 - INFO - __main__ - Step 45655: {'lr': 0.000400130517446218, 'samples': 8765760, 'steps': 45654, 'loss/train': 1.710909128189087} 11/07/2021 03:41:13 - INFO - __main__ - Step 45656: {'lr': 0.00040012627409619853, 'samples': 8765952, 'steps': 45655, 'loss/train': 1.3030675649642944} 11/07/2021 03:41:13 - INFO - __main__ - Step 45657: {'lr': 0.00040012203067853457, 'samples': 8766144, 'steps': 45656, 'loss/train': 1.6493674516677856} 11/07/2021 03:41:14 - INFO - __main__ - Step 45658: {'lr': 0.0004001177871932279, 'samples': 8766336, 'steps': 45657, 'loss/train': 1.0549403429031372} 11/07/2021 03:41:15 - INFO - __main__ - Step 45659: {'lr': 0.00040011354364028053, 'samples': 8766528, 'steps': 45658, 'loss/train': 1.245045781135559} 11/07/2021 03:41:15 - INFO - __main__ - Step 45660: {'lr': 0.00040010930001969426, 'samples': 8766720, 'steps': 45659, 'loss/train': 0.8746814727783203} 11/07/2021 03:41:15 - INFO - __main__ - Step 45661: {'lr': 0.00040010505633147106, 'samples': 8766912, 'steps': 45660, 'loss/train': 0.6673946976661682} 11/07/2021 03:41:16 - INFO - __main__ - Step 45662: {'lr': 0.00040010081257561283, 'samples': 8767104, 'steps': 45661, 'loss/train': 0.17925159633159637} 11/07/2021 03:41:17 - INFO - __main__ - Step 45663: {'lr': 0.0004000965687521215, 'samples': 8767296, 'steps': 45662, 'loss/train': 1.4898158311843872} 11/07/2021 03:41:17 - INFO - __main__ - Step 45664: {'lr': 0.0004000923248609989, 'samples': 8767488, 'steps': 45663, 'loss/train': 1.8594963550567627} 11/07/2021 03:41:17 - INFO - __main__ - Step 45665: {'lr': 0.00040008808090224714, 'samples': 8767680, 'steps': 45664, 'loss/train': 1.7202855348587036} 11/07/2021 03:41:18 - INFO - __main__ - Step 45666: {'lr': 0.0004000838368758679, 'samples': 8767872, 'steps': 45665, 'loss/train': 1.6891522407531738} 11/07/2021 03:41:18 - INFO - __main__ - Step 45667: {'lr': 0.00040007959278186327, 'samples': 8768064, 'steps': 45666, 'loss/train': 1.3225188255310059} 11/07/2021 03:41:19 - INFO - __main__ - Step 45668: {'lr': 0.0004000753486202351, 'samples': 8768256, 'steps': 45667, 'loss/train': 1.238524317741394} 11/07/2021 03:41:20 - INFO - __main__ - Step 45669: {'lr': 0.0004000711043909853, 'samples': 8768448, 'steps': 45668, 'loss/train': 1.3674970865249634} 11/07/2021 03:41:20 - INFO - __main__ - Step 45670: {'lr': 0.0004000668600941157, 'samples': 8768640, 'steps': 45669, 'loss/train': 1.4492086172103882} 11/07/2021 03:41:20 - INFO - __main__ - Step 45671: {'lr': 0.00040006261572962833, 'samples': 8768832, 'steps': 45670, 'loss/train': 1.239437222480774} 11/07/2021 03:41:21 - INFO - __main__ - Step 45672: {'lr': 0.00040005837129752496, 'samples': 8769024, 'steps': 45671, 'loss/train': 1.1618527173995972} 11/07/2021 03:41:22 - INFO - __main__ - Step 45673: {'lr': 0.00040005412679780777, 'samples': 8769216, 'steps': 45672, 'loss/train': 1.4424184560775757} 11/07/2021 03:41:22 - INFO - __main__ - Step 45674: {'lr': 0.00040004988223047843, 'samples': 8769408, 'steps': 45673, 'loss/train': 1.3270950317382812} 11/07/2021 03:41:22 - INFO - __main__ - Step 45675: {'lr': 0.0004000456375955389, 'samples': 8769600, 'steps': 45674, 'loss/train': 1.350642442703247} 11/07/2021 03:41:23 - INFO - __main__ - Step 45676: {'lr': 0.00040004139289299127, 'samples': 8769792, 'steps': 45675, 'loss/train': 1.709375023841858} 11/07/2021 03:41:23 - INFO - __main__ - Step 45677: {'lr': 0.0004000371481228371, 'samples': 8769984, 'steps': 45676, 'loss/train': 1.4423998594284058} 11/07/2021 03:41:24 - INFO - __main__ - Step 45678: {'lr': 0.00040003290328507855, 'samples': 8770176, 'steps': 45677, 'loss/train': 1.486462116241455} 11/07/2021 03:41:25 - INFO - __main__ - Step 45679: {'lr': 0.0004000286583797176, 'samples': 8770368, 'steps': 45678, 'loss/train': 1.642520546913147} 11/07/2021 03:41:25 - INFO - __main__ - Step 45680: {'lr': 0.000400024413406756, 'samples': 8770560, 'steps': 45679, 'loss/train': 1.4172120094299316} 11/07/2021 03:41:25 - INFO - __main__ - Step 45681: {'lr': 0.0004000201683661957, 'samples': 8770752, 'steps': 45680, 'loss/train': 1.7763307094573975} 11/07/2021 03:41:26 - INFO - __main__ - Step 45682: {'lr': 0.0004000159232580386, 'samples': 8770944, 'steps': 45681, 'loss/train': 1.6562503576278687} 11/07/2021 03:41:27 - INFO - __main__ - Step 45683: {'lr': 0.0004000116780822867, 'samples': 8771136, 'steps': 45682, 'loss/train': 1.8312898874282837} 11/07/2021 03:41:27 - INFO - __main__ - Step 45684: {'lr': 0.0004000074328389418, 'samples': 8771328, 'steps': 45683, 'loss/train': 1.7056604623794556} 11/07/2021 03:41:28 - INFO - __main__ - Step 45685: {'lr': 0.0004000031875280059, 'samples': 8771520, 'steps': 45684, 'loss/train': 1.4973245859146118} 11/07/2021 03:41:28 - INFO - __main__ - Step 45686: {'lr': 0.00039999894214948087, 'samples': 8771712, 'steps': 45685, 'loss/train': 1.3996328115463257} 11/07/2021 03:41:28 - INFO - __main__ - Step 45687: {'lr': 0.00039999469670336864, 'samples': 8771904, 'steps': 45686, 'loss/train': 0.9094541072845459} 11/07/2021 03:41:29 - INFO - __main__ - Step 45688: {'lr': 0.0003999904511896711, 'samples': 8772096, 'steps': 45687, 'loss/train': 1.5477782487869263} 11/07/2021 03:41:30 - INFO - __main__ - Step 45689: {'lr': 0.00039998620560839014, 'samples': 8772288, 'steps': 45688, 'loss/train': 1.755375862121582} 11/07/2021 03:41:30 - INFO - __main__ - Step 45690: {'lr': 0.0003999819599595278, 'samples': 8772480, 'steps': 45689, 'loss/train': 1.5153800249099731} 11/07/2021 03:41:30 - INFO - __main__ - Step 45691: {'lr': 0.00039997771424308583, 'samples': 8772672, 'steps': 45690, 'loss/train': 1.8970978260040283} 11/07/2021 03:41:31 - INFO - __main__ - Step 45692: {'lr': 0.0003999734684590662, 'samples': 8772864, 'steps': 45691, 'loss/train': 1.9008773565292358} 11/07/2021 03:41:31 - INFO - __main__ - Step 45693: {'lr': 0.0003999692226074709, 'samples': 8773056, 'steps': 45692, 'loss/train': 1.364372968673706} 11/07/2021 03:41:32 - INFO - __main__ - Step 45694: {'lr': 0.0003999649766883018, 'samples': 8773248, 'steps': 45693, 'loss/train': 1.855595588684082} 11/07/2021 03:41:32 - INFO - __main__ - Step 45695: {'lr': 0.0003999607307015607, 'samples': 8773440, 'steps': 45694, 'loss/train': 0.847008466720581} 11/07/2021 03:41:33 - INFO - __main__ - Step 45696: {'lr': 0.00039995648464724966, 'samples': 8773632, 'steps': 45695, 'loss/train': 1.5284357070922852} 11/07/2021 03:41:33 - INFO - __main__ - Step 45697: {'lr': 0.00039995223852537054, 'samples': 8773824, 'steps': 45696, 'loss/train': 1.5149009227752686} 11/07/2021 03:41:34 - INFO - __main__ - Step 45698: {'lr': 0.0003999479923359253, 'samples': 8774016, 'steps': 45697, 'loss/train': 1.5185508728027344} 11/07/2021 03:41:34 - INFO - __main__ - Step 45699: {'lr': 0.0003999437460789157, 'samples': 8774208, 'steps': 45698, 'loss/train': 1.335496425628662} 11/07/2021 03:41:35 - INFO - __main__ - Step 45700: {'lr': 0.0003999394997543439, 'samples': 8774400, 'steps': 45699, 'loss/train': 0.5011448264122009} 11/07/2021 03:41:35 - INFO - __main__ - Step 45701: {'lr': 0.0003999352533622116, 'samples': 8774592, 'steps': 45700, 'loss/train': 2.193835973739624} 11/07/2021 03:41:36 - INFO - __main__ - Step 45702: {'lr': 0.00039993100690252084, 'samples': 8774784, 'steps': 45701, 'loss/train': 1.5538792610168457} 11/07/2021 03:41:36 - INFO - __main__ - Step 45703: {'lr': 0.00039992676037527337, 'samples': 8774976, 'steps': 45702, 'loss/train': 1.1894370317459106} 11/07/2021 03:41:36 - INFO - __main__ - Step 45704: {'lr': 0.0003999225137804713, 'samples': 8775168, 'steps': 45703, 'loss/train': 1.1973448991775513} 11/07/2021 03:41:37 - INFO - __main__ - Step 45705: {'lr': 0.0003999182671181164, 'samples': 8775360, 'steps': 45704, 'loss/train': 1.148694396018982} 11/07/2021 03:41:38 - INFO - __main__ - Step 45706: {'lr': 0.00039991402038821067, 'samples': 8775552, 'steps': 45705, 'loss/train': 1.6992937326431274} 11/07/2021 03:41:38 - INFO - __main__ - Step 45707: {'lr': 0.00039990977359075607, 'samples': 8775744, 'steps': 45706, 'loss/train': 1.647392988204956} 11/07/2021 03:41:38 - INFO - __main__ - Step 45708: {'lr': 0.00039990552672575436, 'samples': 8775936, 'steps': 45707, 'loss/train': 1.611392617225647} 11/07/2021 03:41:39 - INFO - __main__ - Step 45709: {'lr': 0.00039990127979320757, 'samples': 8776128, 'steps': 45708, 'loss/train': 1.4701683521270752} 11/07/2021 03:41:39 - INFO - __main__ - Step 45710: {'lr': 0.00039989703279311753, 'samples': 8776320, 'steps': 45709, 'loss/train': 1.3313275575637817} 11/07/2021 03:41:40 - INFO - __main__ - Step 45711: {'lr': 0.00039989278572548625, 'samples': 8776512, 'steps': 45710, 'loss/train': 1.3801134824752808} 11/07/2021 03:41:41 - INFO - __main__ - Step 45712: {'lr': 0.00039988853859031557, 'samples': 8776704, 'steps': 45711, 'loss/train': 1.8523447513580322} 11/07/2021 03:41:41 - INFO - __main__ - Step 45713: {'lr': 0.0003998842913876074, 'samples': 8776896, 'steps': 45712, 'loss/train': 1.0423424243927002} 11/07/2021 03:41:41 - INFO - __main__ - Step 45714: {'lr': 0.0003998800441173637, 'samples': 8777088, 'steps': 45713, 'loss/train': 1.2443095445632935} 11/07/2021 03:41:42 - INFO - __main__ - Step 45715: {'lr': 0.00039987579677958643, 'samples': 8777280, 'steps': 45714, 'loss/train': 1.7403079271316528} 11/07/2021 03:41:42 - INFO - __main__ - Step 45716: {'lr': 0.0003998715493742774, 'samples': 8777472, 'steps': 45715, 'loss/train': 1.786270260810852} 11/07/2021 03:41:43 - INFO - __main__ - Step 45717: {'lr': 0.0003998673019014385, 'samples': 8777664, 'steps': 45716, 'loss/train': 1.768802523612976} 11/07/2021 03:41:43 - INFO - __main__ - Step 45718: {'lr': 0.0003998630543610717, 'samples': 8777856, 'steps': 45717, 'loss/train': 1.3839589357376099} 11/07/2021 03:41:44 - INFO - __main__ - Step 45719: {'lr': 0.00039985880675317897, 'samples': 8778048, 'steps': 45718, 'loss/train': 1.6259337663650513} 11/07/2021 03:41:44 - INFO - __main__ - Step 45720: {'lr': 0.0003998545590777622, 'samples': 8778240, 'steps': 45719, 'loss/train': 1.936063289642334} 11/07/2021 03:41:45 - INFO - __main__ - Step 45721: {'lr': 0.0003998503113348233, 'samples': 8778432, 'steps': 45720, 'loss/train': 1.554116129875183} 11/07/2021 03:41:46 - INFO - __main__ - Step 45722: {'lr': 0.0003998460635243641, 'samples': 8778624, 'steps': 45721, 'loss/train': 1.700764536857605} 11/07/2021 03:41:46 - INFO - __main__ - Step 45723: {'lr': 0.00039984181564638654, 'samples': 8778816, 'steps': 45722, 'loss/train': 1.6266900300979614} 11/07/2021 03:41:46 - INFO - __main__ - Step 45724: {'lr': 0.00039983756770089264, 'samples': 8779008, 'steps': 45723, 'loss/train': 0.8411117792129517} 11/07/2021 03:41:47 - INFO - __main__ - Step 45725: {'lr': 0.0003998333196878843, 'samples': 8779200, 'steps': 45724, 'loss/train': 1.3577433824539185} 11/07/2021 03:41:47 - INFO - __main__ - Step 45726: {'lr': 0.00039982907160736325, 'samples': 8779392, 'steps': 45725, 'loss/train': 1.300414800643921} 11/07/2021 03:41:47 - INFO - __main__ - Step 45727: {'lr': 0.00039982482345933155, 'samples': 8779584, 'steps': 45726, 'loss/train': 1.2126566171646118} 11/07/2021 03:41:48 - INFO - __main__ - Step 45728: {'lr': 0.00039982057524379124, 'samples': 8779776, 'steps': 45727, 'loss/train': 1.459671974182129} 11/07/2021 03:41:49 - INFO - __main__ - Step 45729: {'lr': 0.00039981632696074396, 'samples': 8779968, 'steps': 45728, 'loss/train': 1.786897897720337} 11/07/2021 03:41:49 - INFO - __main__ - Step 45730: {'lr': 0.00039981207861019175, 'samples': 8780160, 'steps': 45729, 'loss/train': 1.1281368732452393} 11/07/2021 03:41:49 - INFO - __main__ - Step 45731: {'lr': 0.0003998078301921365, 'samples': 8780352, 'steps': 45730, 'loss/train': 1.603014349937439} 11/07/2021 03:41:50 - INFO - __main__ - Step 45732: {'lr': 0.00039980358170658026, 'samples': 8780544, 'steps': 45731, 'loss/train': 1.6706631183624268} 11/07/2021 03:41:51 - INFO - __main__ - Step 45733: {'lr': 0.0003997993331535248, 'samples': 8780736, 'steps': 45732, 'loss/train': 1.5991101264953613} 11/07/2021 03:41:51 - INFO - __main__ - Step 45734: {'lr': 0.0003997950845329721, 'samples': 8780928, 'steps': 45733, 'loss/train': 1.6309759616851807} 11/07/2021 03:41:52 - INFO - __main__ - Step 45735: {'lr': 0.000399790835844924, 'samples': 8781120, 'steps': 45734, 'loss/train': 1.2797209024429321} 11/07/2021 03:41:52 - INFO - __main__ - Step 45736: {'lr': 0.00039978658708938244, 'samples': 8781312, 'steps': 45735, 'loss/train': 1.7494573593139648} 11/07/2021 03:41:52 - INFO - __main__ - Step 45737: {'lr': 0.00039978233826634934, 'samples': 8781504, 'steps': 45736, 'loss/train': 1.0572928190231323} 11/07/2021 03:41:54 - INFO - __main__ - Step 45738: {'lr': 0.0003997780893758267, 'samples': 8781696, 'steps': 45737, 'loss/train': 1.5383487939834595} 11/07/2021 03:41:54 - INFO - __main__ - Step 45739: {'lr': 0.0003997738404178164, 'samples': 8781888, 'steps': 45738, 'loss/train': 2.2209999561309814} 11/07/2021 03:41:54 - INFO - __main__ - Step 45740: {'lr': 0.00039976959139232017, 'samples': 8782080, 'steps': 45739, 'loss/train': 1.2510405778884888} 11/07/2021 03:41:55 - INFO - __main__ - Step 45741: {'lr': 0.0003997653422993402, 'samples': 8782272, 'steps': 45740, 'loss/train': 1.1893982887268066} 11/07/2021 03:41:55 - INFO - __main__ - Step 45742: {'lr': 0.0003997610931388782, 'samples': 8782464, 'steps': 45741, 'loss/train': 1.5910414457321167} 11/07/2021 03:41:56 - INFO - __main__ - Step 45743: {'lr': 0.0003997568439109363, 'samples': 8782656, 'steps': 45742, 'loss/train': 1.802226185798645} 11/07/2021 03:41:56 - INFO - __main__ - Step 45744: {'lr': 0.00039975259461551613, 'samples': 8782848, 'steps': 45743, 'loss/train': 1.9506511688232422} 11/07/2021 03:41:57 - INFO - __main__ - Step 45745: {'lr': 0.0003997483452526198, 'samples': 8783040, 'steps': 45744, 'loss/train': 1.4046062231063843} 11/07/2021 03:41:57 - INFO - __main__ - Step 45746: {'lr': 0.0003997440958222491, 'samples': 8783232, 'steps': 45745, 'loss/train': 1.6467649936676025} 11/07/2021 03:41:57 - INFO - __main__ - Step 45747: {'lr': 0.0003997398463244062, 'samples': 8783424, 'steps': 45746, 'loss/train': 1.4050190448760986} 11/07/2021 03:41:59 - INFO - __main__ - Step 45748: {'lr': 0.00039973559675909274, 'samples': 8783616, 'steps': 45747, 'loss/train': 1.5863054990768433} 11/07/2021 03:41:59 - INFO - __main__ - Step 45749: {'lr': 0.00039973134712631067, 'samples': 8783808, 'steps': 45748, 'loss/train': 1.5905817747116089} 11/07/2021 03:41:59 - INFO - __main__ - Step 45750: {'lr': 0.00039972709742606207, 'samples': 8784000, 'steps': 45749, 'loss/train': 1.5976406335830688} 11/07/2021 03:42:00 - INFO - __main__ - Step 45751: {'lr': 0.00039972284765834866, 'samples': 8784192, 'steps': 45750, 'loss/train': 1.4078799486160278} 11/07/2021 03:42:00 - INFO - __main__ - Step 45752: {'lr': 0.00039971859782317245, 'samples': 8784384, 'steps': 45751, 'loss/train': 0.5994760990142822} 11/07/2021 03:42:01 - INFO - __main__ - Step 45753: {'lr': 0.0003997143479205354, 'samples': 8784576, 'steps': 45752, 'loss/train': 1.6960455179214478} 11/07/2021 03:42:01 - INFO - __main__ - Step 45754: {'lr': 0.0003997100979504394, 'samples': 8784768, 'steps': 45753, 'loss/train': 1.2282835245132446} 11/07/2021 03:42:02 - INFO - __main__ - Step 45755: {'lr': 0.00039970584791288626, 'samples': 8784960, 'steps': 45754, 'loss/train': 1.7166662216186523} 11/07/2021 03:42:02 - INFO - __main__ - Step 45756: {'lr': 0.000399701597807878, 'samples': 8785152, 'steps': 45755, 'loss/train': 1.2824113368988037} 11/07/2021 03:42:02 - INFO - __main__ - Step 45757: {'lr': 0.00039969734763541657, 'samples': 8785344, 'steps': 45756, 'loss/train': 0.9290876984596252} 11/07/2021 03:42:03 - INFO - __main__ - Step 45758: {'lr': 0.00039969309739550373, 'samples': 8785536, 'steps': 45757, 'loss/train': 1.5645976066589355} 11/07/2021 03:42:04 - INFO - __main__ - Step 45759: {'lr': 0.0003996888470881416, 'samples': 8785728, 'steps': 45758, 'loss/train': 1.6670734882354736} 11/07/2021 03:42:04 - INFO - __main__ - Step 45760: {'lr': 0.0003996845967133319, 'samples': 8785920, 'steps': 45759, 'loss/train': 1.7307648658752441} 11/07/2021 03:42:04 - INFO - __main__ - Step 45761: {'lr': 0.0003996803462710766, 'samples': 8786112, 'steps': 45760, 'loss/train': 1.9626553058624268} 11/07/2021 03:42:05 - INFO - __main__ - Step 45762: {'lr': 0.00039967609576137774, 'samples': 8786304, 'steps': 45761, 'loss/train': 1.5805244445800781} 11/07/2021 03:42:05 - INFO - __main__ - Step 45763: {'lr': 0.0003996718451842371, 'samples': 8786496, 'steps': 45762, 'loss/train': 1.5933947563171387} 11/07/2021 03:42:06 - INFO - __main__ - Step 45764: {'lr': 0.00039966759453965664, 'samples': 8786688, 'steps': 45763, 'loss/train': 2.0527541637420654} 11/07/2021 03:42:07 - INFO - __main__ - Step 45765: {'lr': 0.00039966334382763826, 'samples': 8786880, 'steps': 45764, 'loss/train': 1.156002402305603} 11/07/2021 03:42:07 - INFO - __main__ - Step 45766: {'lr': 0.00039965909304818387, 'samples': 8787072, 'steps': 45765, 'loss/train': 0.7822408080101013} 11/07/2021 03:42:07 - INFO - __main__ - Step 45767: {'lr': 0.00039965484220129546, 'samples': 8787264, 'steps': 45766, 'loss/train': 1.55080246925354} 11/07/2021 03:42:08 - INFO - __main__ - Step 45768: {'lr': 0.0003996505912869749, 'samples': 8787456, 'steps': 45767, 'loss/train': 1.7155765295028687} 11/07/2021 03:42:09 - INFO - __main__ - Step 45769: {'lr': 0.000399646340305224, 'samples': 8787648, 'steps': 45768, 'loss/train': 1.4348912239074707} 11/07/2021 03:42:09 - INFO - __main__ - Step 45770: {'lr': 0.00039964208925604485, 'samples': 8787840, 'steps': 45769, 'loss/train': 0.3281099200248718} 11/07/2021 03:42:09 - INFO - __main__ - Step 45771: {'lr': 0.0003996378381394392, 'samples': 8788032, 'steps': 45770, 'loss/train': 1.5719586610794067} 11/07/2021 03:42:10 - INFO - __main__ - Step 45772: {'lr': 0.00039963358695540907, 'samples': 8788224, 'steps': 45771, 'loss/train': 1.2992075681686401} 11/07/2021 03:42:10 - INFO - __main__ - Step 45773: {'lr': 0.0003996293357039564, 'samples': 8788416, 'steps': 45772, 'loss/train': 1.2013145685195923} 11/07/2021 03:42:11 - INFO - __main__ - Step 45774: {'lr': 0.0003996250843850831, 'samples': 8788608, 'steps': 45773, 'loss/train': 1.9157236814498901} 11/07/2021 03:42:11 - INFO - __main__ - Step 45775: {'lr': 0.000399620832998791, 'samples': 8788800, 'steps': 45774, 'loss/train': 1.9511425495147705} 11/07/2021 03:42:12 - INFO - __main__ - Step 45776: {'lr': 0.000399616581545082, 'samples': 8788992, 'steps': 45775, 'loss/train': 1.366453766822815} 11/07/2021 03:42:12 - INFO - __main__ - Step 45777: {'lr': 0.0003996123300239581, 'samples': 8789184, 'steps': 45776, 'loss/train': 1.2759289741516113} 11/07/2021 03:42:12 - INFO - __main__ - Step 45778: {'lr': 0.0003996080784354212, 'samples': 8789376, 'steps': 45777, 'loss/train': 1.4267598390579224} 11/07/2021 03:42:13 - INFO - __main__ - Step 45779: {'lr': 0.0003996038267794733, 'samples': 8789568, 'steps': 45778, 'loss/train': 1.9631978273391724} 11/07/2021 03:42:14 - INFO - __main__ - Step 45780: {'lr': 0.0003995995750561161, 'samples': 8789760, 'steps': 45779, 'loss/train': 1.7855234146118164} 11/07/2021 03:42:14 - INFO - __main__ - Step 45781: {'lr': 0.00039959532326535175, 'samples': 8789952, 'steps': 45780, 'loss/train': 1.3308883905410767} 11/07/2021 03:42:15 - INFO - __main__ - Step 45782: {'lr': 0.000399591071407182, 'samples': 8790144, 'steps': 45781, 'loss/train': 1.2953991889953613} 11/07/2021 03:42:15 - INFO - __main__ - Step 45783: {'lr': 0.0003995868194816088, 'samples': 8790336, 'steps': 45782, 'loss/train': 1.6726969480514526} 11/07/2021 03:42:15 - INFO - __main__ - Step 45784: {'lr': 0.0003995825674886341, 'samples': 8790528, 'steps': 45783, 'loss/train': 1.6071983575820923} 11/07/2021 03:42:16 - INFO - __main__ - Step 45785: {'lr': 0.00039957831542825983, 'samples': 8790720, 'steps': 45784, 'loss/train': 1.026556372642517} 11/07/2021 03:42:17 - INFO - __main__ - Step 45786: {'lr': 0.0003995740633004878, 'samples': 8790912, 'steps': 45785, 'loss/train': 1.1194347143173218} 11/07/2021 03:42:17 - INFO - __main__ - Step 45787: {'lr': 0.00039956981110532007, 'samples': 8791104, 'steps': 45786, 'loss/train': 1.8880589008331299} 11/07/2021 03:42:17 - INFO - __main__ - Step 45788: {'lr': 0.0003995655588427586, 'samples': 8791296, 'steps': 45787, 'loss/train': 1.363681674003601} 11/07/2021 03:42:18 - INFO - __main__ - Step 45789: {'lr': 0.00039956130651280504, 'samples': 8791488, 'steps': 45788, 'loss/train': 1.6982097625732422} 11/07/2021 03:42:19 - INFO - __main__ - Step 45790: {'lr': 0.0003995570541154615, 'samples': 8791680, 'steps': 45789, 'loss/train': 1.8806573152542114} 11/07/2021 03:42:19 - INFO - __main__ - Step 45791: {'lr': 0.0003995528016507298, 'samples': 8791872, 'steps': 45790, 'loss/train': 1.3060063123703003} 11/07/2021 03:42:19 - INFO - __main__ - Step 45792: {'lr': 0.000399548549118612, 'samples': 8792064, 'steps': 45791, 'loss/train': 1.6437419652938843} 11/07/2021 03:42:20 - INFO - __main__ - Step 45793: {'lr': 0.00039954429651910993, 'samples': 8792256, 'steps': 45792, 'loss/train': 1.2622495889663696} 11/07/2021 03:42:20 - INFO - __main__ - Step 45794: {'lr': 0.00039954004385222555, 'samples': 8792448, 'steps': 45793, 'loss/train': 1.5217324495315552} 11/07/2021 03:42:21 - INFO - __main__ - Step 45795: {'lr': 0.00039953579111796065, 'samples': 8792640, 'steps': 45794, 'loss/train': 1.6628124713897705} 11/07/2021 03:42:21 - INFO - __main__ - Step 45796: {'lr': 0.00039953153831631726, 'samples': 8792832, 'steps': 45795, 'loss/train': 1.499194622039795} 11/07/2021 03:42:22 - INFO - __main__ - Step 45797: {'lr': 0.0003995272854472972, 'samples': 8793024, 'steps': 45796, 'loss/train': 1.5747181177139282} 11/07/2021 03:42:22 - INFO - __main__ - Step 45798: {'lr': 0.00039952303251090254, 'samples': 8793216, 'steps': 45797, 'loss/train': 1.9704207181930542} 11/07/2021 03:42:22 - INFO - __main__ - Step 45799: {'lr': 0.00039951877950713513, 'samples': 8793408, 'steps': 45798, 'loss/train': 1.6811201572418213} 11/07/2021 03:42:23 - INFO - __main__ - Step 45800: {'lr': 0.0003995145264359968, 'samples': 8793600, 'steps': 45799, 'loss/train': 1.8199926614761353} 11/07/2021 03:42:24 - INFO - __main__ - Step 45801: {'lr': 0.00039951027329748957, 'samples': 8793792, 'steps': 45800, 'loss/train': 1.422680377960205} 11/07/2021 03:42:24 - INFO - __main__ - Step 45802: {'lr': 0.0003995060200916153, 'samples': 8793984, 'steps': 45801, 'loss/train': 1.3478022813796997} 11/07/2021 03:42:24 - INFO - __main__ - Step 45803: {'lr': 0.0003995017668183759, 'samples': 8794176, 'steps': 45802, 'loss/train': 1.246759057044983} 11/07/2021 03:42:25 - INFO - __main__ - Step 45804: {'lr': 0.0003994975134777733, 'samples': 8794368, 'steps': 45803, 'loss/train': 1.6146609783172607} 11/07/2021 03:42:26 - INFO - __main__ - Step 45805: {'lr': 0.00039949326006980944, 'samples': 8794560, 'steps': 45804, 'loss/train': 1.7832012176513672} 11/07/2021 03:42:26 - INFO - __main__ - Step 45806: {'lr': 0.0003994890065944863, 'samples': 8794752, 'steps': 45805, 'loss/train': 1.466225504875183} 11/07/2021 03:42:27 - INFO - __main__ - Step 45807: {'lr': 0.00039948475305180567, 'samples': 8794944, 'steps': 45806, 'loss/train': 1.4615516662597656} 11/07/2021 03:42:27 - INFO - __main__ - Step 45808: {'lr': 0.0003994804994417695, 'samples': 8795136, 'steps': 45807, 'loss/train': 1.4964948892593384} 11/07/2021 03:42:27 - INFO - __main__ - Step 45809: {'lr': 0.0003994762457643797, 'samples': 8795328, 'steps': 45808, 'loss/train': 1.6247048377990723} 11/07/2021 03:42:28 - INFO - __main__ - Step 45810: {'lr': 0.0003994719920196383, 'samples': 8795520, 'steps': 45809, 'loss/train': 1.7376784086227417} 11/07/2021 03:42:29 - INFO - __main__ - Step 45811: {'lr': 0.00039946773820754704, 'samples': 8795712, 'steps': 45810, 'loss/train': 1.5790966749191284} 11/07/2021 03:42:29 - INFO - __main__ - Step 45812: {'lr': 0.00039946348432810797, 'samples': 8795904, 'steps': 45811, 'loss/train': 1.4171829223632812} 11/07/2021 03:42:29 - INFO - __main__ - Step 45813: {'lr': 0.0003994592303813229, 'samples': 8796096, 'steps': 45812, 'loss/train': 1.3879013061523438} 11/07/2021 03:42:30 - INFO - __main__ - Step 45814: {'lr': 0.00039945497636719384, 'samples': 8796288, 'steps': 45813, 'loss/train': 1.4347091913223267} 11/07/2021 03:42:30 - INFO - __main__ - Step 45815: {'lr': 0.00039945072228572275, 'samples': 8796480, 'steps': 45814, 'loss/train': 0.9813772439956665} 11/07/2021 03:42:31 - INFO - __main__ - Step 45816: {'lr': 0.0003994464681369114, 'samples': 8796672, 'steps': 45815, 'loss/train': 1.7121654748916626} 11/07/2021 03:42:32 - INFO - __main__ - Step 45817: {'lr': 0.0003994422139207618, 'samples': 8796864, 'steps': 45816, 'loss/train': 1.9688409566879272} 11/07/2021 03:42:32 - INFO - __main__ - Step 45818: {'lr': 0.00039943795963727583, 'samples': 8797056, 'steps': 45817, 'loss/train': 1.3847655057907104} 11/07/2021 03:42:32 - INFO - __main__ - Step 45819: {'lr': 0.0003994337052864554, 'samples': 8797248, 'steps': 45818, 'loss/train': 1.5700711011886597} 11/07/2021 03:42:33 - INFO - __main__ - Step 45820: {'lr': 0.00039942945086830246, 'samples': 8797440, 'steps': 45819, 'loss/train': 2.300741195678711} 11/07/2021 03:42:34 - INFO - __main__ - Step 45821: {'lr': 0.00039942519638281893, 'samples': 8797632, 'steps': 45820, 'loss/train': 1.5391618013381958} 11/07/2021 03:42:34 - INFO - __main__ - Step 45822: {'lr': 0.0003994209418300068, 'samples': 8797824, 'steps': 45821, 'loss/train': 1.2126270532608032} 11/07/2021 03:42:34 - INFO - __main__ - Step 45823: {'lr': 0.0003994166872098677, 'samples': 8798016, 'steps': 45822, 'loss/train': 1.7054393291473389} 11/07/2021 03:42:35 - INFO - __main__ - Step 45824: {'lr': 0.0003994124325224039, 'samples': 8798208, 'steps': 45823, 'loss/train': 1.7563220262527466} 11/07/2021 03:42:35 - INFO - __main__ - Step 45825: {'lr': 0.00039940817776761706, 'samples': 8798400, 'steps': 45824, 'loss/train': 1.543555498123169} 11/07/2021 03:42:37 - INFO - __main__ - Step 45826: {'lr': 0.0003994039229455093, 'samples': 8798592, 'steps': 45825, 'loss/train': 1.7660768032073975} 11/07/2021 03:42:37 - INFO - __main__ - Step 45827: {'lr': 0.00039939966805608234, 'samples': 8798784, 'steps': 45826, 'loss/train': 0.9904483556747437} 11/07/2021 03:42:38 - INFO - __main__ - Step 45828: {'lr': 0.0003993954130993383, 'samples': 8798976, 'steps': 45827, 'loss/train': 1.0330867767333984} 11/07/2021 03:42:38 - INFO - __main__ - Step 45829: {'lr': 0.0003993911580752789, 'samples': 8799168, 'steps': 45828, 'loss/train': 1.1959236860275269} 11/07/2021 03:42:38 - INFO - __main__ - Step 45830: {'lr': 0.00039938690298390624, 'samples': 8799360, 'steps': 45829, 'loss/train': 1.4125633239746094} 11/07/2021 03:42:39 - INFO - __main__ - Step 45831: {'lr': 0.00039938264782522206, 'samples': 8799552, 'steps': 45830, 'loss/train': 1.7893304824829102} 11/07/2021 03:42:39 - INFO - __main__ - Step 45832: {'lr': 0.0003993783925992284, 'samples': 8799744, 'steps': 45831, 'loss/train': 0.11289530247449875} 11/07/2021 03:42:40 - INFO - __main__ - Step 45833: {'lr': 0.00039937413730592713, 'samples': 8799936, 'steps': 45832, 'loss/train': 1.5255755186080933} 11/07/2021 03:42:40 - INFO - __main__ - Step 45834: {'lr': 0.0003993698819453202, 'samples': 8800128, 'steps': 45833, 'loss/train': 1.3934648036956787} 11/07/2021 03:42:41 - INFO - __main__ - Step 45835: {'lr': 0.00039936562651740956, 'samples': 8800320, 'steps': 45834, 'loss/train': 1.3922640085220337} 11/07/2021 03:42:41 - INFO - __main__ - Step 45836: {'lr': 0.00039936137102219695, 'samples': 8800512, 'steps': 45835, 'loss/train': 1.3778247833251953} 11/07/2021 03:42:41 - INFO - __main__ - Step 45837: {'lr': 0.0003993571154596845, 'samples': 8800704, 'steps': 45836, 'loss/train': 1.7465338706970215} 11/07/2021 03:42:43 - INFO - __main__ - Step 45838: {'lr': 0.00039935285982987403, 'samples': 8800896, 'steps': 45837, 'loss/train': 1.6409074068069458} 11/07/2021 03:42:43 - INFO - __main__ - Step 45839: {'lr': 0.0003993486041327674, 'samples': 8801088, 'steps': 45838, 'loss/train': 1.664171814918518} 11/07/2021 03:42:43 - INFO - __main__ - Step 45840: {'lr': 0.00039934434836836664, 'samples': 8801280, 'steps': 45839, 'loss/train': 1.5493240356445312} 11/07/2021 03:42:44 - INFO - __main__ - Step 45841: {'lr': 0.00039934009253667356, 'samples': 8801472, 'steps': 45840, 'loss/train': 1.693857192993164} 11/07/2021 03:42:44 - INFO - __main__ - Step 45842: {'lr': 0.0003993358366376903, 'samples': 8801664, 'steps': 45841, 'loss/train': 0.4450680911540985} 11/07/2021 03:42:44 - INFO - __main__ - Step 45843: {'lr': 0.0003993315806714185, 'samples': 8801856, 'steps': 45842, 'loss/train': 1.3008158206939697} 11/07/2021 03:42:45 - INFO - __main__ - Step 45844: {'lr': 0.0003993273246378602, 'samples': 8802048, 'steps': 45843, 'loss/train': 0.5249027609825134} 11/07/2021 03:42:46 - INFO - __main__ - Step 45845: {'lr': 0.00039932306853701735, 'samples': 8802240, 'steps': 45844, 'loss/train': 1.738145351409912} 11/07/2021 03:42:46 - INFO - __main__ - Step 45846: {'lr': 0.0003993188123688918, 'samples': 8802432, 'steps': 45845, 'loss/train': 1.5204349756240845} 11/07/2021 03:42:46 - INFO - __main__ - Step 45847: {'lr': 0.00039931455613348546, 'samples': 8802624, 'steps': 45846, 'loss/train': 1.2769067287445068} 11/07/2021 03:42:47 - INFO - __main__ - Step 45848: {'lr': 0.0003993102998308004, 'samples': 8802816, 'steps': 45847, 'loss/train': 1.755358099937439} 11/07/2021 03:42:48 - INFO - __main__ - Step 45849: {'lr': 0.0003993060434608383, 'samples': 8803008, 'steps': 45848, 'loss/train': 1.435434341430664} 11/07/2021 03:42:48 - INFO - __main__ - Step 45850: {'lr': 0.0003993017870236012, 'samples': 8803200, 'steps': 45849, 'loss/train': 1.9264001846313477} 11/07/2021 03:42:48 - INFO - __main__ - Step 45851: {'lr': 0.0003992975305190911, 'samples': 8803392, 'steps': 45850, 'loss/train': 1.6069012880325317} 11/07/2021 03:42:49 - INFO - __main__ - Step 45852: {'lr': 0.0003992932739473098, 'samples': 8803584, 'steps': 45851, 'loss/train': 1.504792332649231} 11/07/2021 03:42:49 - INFO - __main__ - Step 45853: {'lr': 0.0003992890173082593, 'samples': 8803776, 'steps': 45852, 'loss/train': 1.418312430381775} 11/07/2021 03:42:50 - INFO - __main__ - Step 45854: {'lr': 0.00039928476060194137, 'samples': 8803968, 'steps': 45853, 'loss/train': 1.2826389074325562} 11/07/2021 03:42:50 - INFO - __main__ - Step 45855: {'lr': 0.0003992805038283581, 'samples': 8804160, 'steps': 45854, 'loss/train': 1.3450167179107666} 11/07/2021 03:42:51 - INFO - __main__ - Step 45856: {'lr': 0.0003992762469875113, 'samples': 8804352, 'steps': 45855, 'loss/train': 0.4302082359790802} 11/07/2021 03:42:51 - INFO - __main__ - Step 45857: {'lr': 0.00039927199007940294, 'samples': 8804544, 'steps': 45856, 'loss/train': 1.6461188793182373} 11/07/2021 03:42:52 - INFO - __main__ - Step 45858: {'lr': 0.00039926773310403497, 'samples': 8804736, 'steps': 45857, 'loss/train': 2.005046844482422} 11/07/2021 03:42:53 - INFO - __main__ - Step 45859: {'lr': 0.0003992634760614092, 'samples': 8804928, 'steps': 45858, 'loss/train': 1.6766897439956665} 11/07/2021 03:42:53 - INFO - __main__ - Step 45860: {'lr': 0.00039925921895152765, 'samples': 8805120, 'steps': 45859, 'loss/train': 1.3352468013763428} 11/07/2021 03:42:53 - INFO - __main__ - Step 45861: {'lr': 0.00039925496177439226, 'samples': 8805312, 'steps': 45860, 'loss/train': 1.4517005681991577} 11/07/2021 03:42:54 - INFO - __main__ - Step 45862: {'lr': 0.0003992507045300048, 'samples': 8805504, 'steps': 45861, 'loss/train': 1.3755040168762207} 11/07/2021 03:42:54 - INFO - __main__ - Step 45863: {'lr': 0.00039924644721836734, 'samples': 8805696, 'steps': 45862, 'loss/train': 2.094545602798462} 11/07/2021 03:42:55 - INFO - __main__ - Step 45864: {'lr': 0.0003992421898394817, 'samples': 8805888, 'steps': 45863, 'loss/train': 1.4637939929962158} 11/07/2021 03:42:55 - INFO - __main__ - Step 45865: {'lr': 0.00039923793239334974, 'samples': 8806080, 'steps': 45864, 'loss/train': 1.7148493528366089} 11/07/2021 03:42:56 - INFO - __main__ - Step 45866: {'lr': 0.0003992336748799736, 'samples': 8806272, 'steps': 45865, 'loss/train': 1.241485595703125} 11/07/2021 03:42:56 - INFO - __main__ - Step 45867: {'lr': 0.00039922941729935503, 'samples': 8806464, 'steps': 45866, 'loss/train': 1.6325886249542236} 11/07/2021 03:42:56 - INFO - __main__ - Step 45868: {'lr': 0.000399225159651496, 'samples': 8806656, 'steps': 45867, 'loss/train': 1.3532395362854004} 11/07/2021 03:42:58 - INFO - __main__ - Step 45869: {'lr': 0.0003992209019363984, 'samples': 8806848, 'steps': 45868, 'loss/train': 1.514560580253601} 11/07/2021 03:42:58 - INFO - __main__ - Step 45870: {'lr': 0.0003992166441540641, 'samples': 8807040, 'steps': 45869, 'loss/train': 1.3086159229278564} 11/07/2021 03:42:58 - INFO - __main__ - Step 45871: {'lr': 0.00039921238630449515, 'samples': 8807232, 'steps': 45870, 'loss/train': 1.616173505783081} 11/07/2021 03:42:59 - INFO - __main__ - Step 45872: {'lr': 0.0003992081283876934, 'samples': 8807424, 'steps': 45871, 'loss/train': 1.00423264503479} 11/07/2021 03:42:59 - INFO - __main__ - Step 45873: {'lr': 0.00039920387040366076, 'samples': 8807616, 'steps': 45872, 'loss/train': 1.6508270502090454} 11/07/2021 03:43:00 - INFO - __main__ - Step 45874: {'lr': 0.00039919961235239913, 'samples': 8807808, 'steps': 45873, 'loss/train': 0.48083221912384033} 11/07/2021 03:43:01 - INFO - __main__ - Step 45875: {'lr': 0.0003991953542339105, 'samples': 8808000, 'steps': 45874, 'loss/train': 1.726269006729126} 11/07/2021 03:43:01 - INFO - __main__ - Step 45876: {'lr': 0.00039919109604819676, 'samples': 8808192, 'steps': 45875, 'loss/train': 0.335126668214798} 11/07/2021 03:43:01 - INFO - __main__ - Step 45877: {'lr': 0.00039918683779525976, 'samples': 8808384, 'steps': 45876, 'loss/train': 1.3062458038330078} 11/07/2021 03:43:02 - INFO - __main__ - Step 45878: {'lr': 0.0003991825794751015, 'samples': 8808576, 'steps': 45877, 'loss/train': 1.2087069749832153} 11/07/2021 03:43:02 - INFO - __main__ - Step 45879: {'lr': 0.0003991783210877239, 'samples': 8808768, 'steps': 45878, 'loss/train': 1.5349632501602173} 11/07/2021 03:43:03 - INFO - __main__ - Step 45880: {'lr': 0.00039917406263312885, 'samples': 8808960, 'steps': 45879, 'loss/train': 1.3000528812408447} 11/07/2021 03:43:03 - INFO - __main__ - Step 45881: {'lr': 0.0003991698041113182, 'samples': 8809152, 'steps': 45880, 'loss/train': 1.5514869689941406} 11/07/2021 03:43:04 - INFO - __main__ - Step 45882: {'lr': 0.000399165545522294, 'samples': 8809344, 'steps': 45881, 'loss/train': 1.4706882238388062} 11/07/2021 03:43:04 - INFO - __main__ - Step 45883: {'lr': 0.0003991612868660581, 'samples': 8809536, 'steps': 45882, 'loss/train': 1.2744070291519165} 11/07/2021 03:43:04 - INFO - __main__ - Step 45884: {'lr': 0.0003991570281426124, 'samples': 8809728, 'steps': 45883, 'loss/train': 2.4730279445648193} 11/07/2021 03:43:05 - INFO - __main__ - Step 45885: {'lr': 0.0003991527693519589, 'samples': 8809920, 'steps': 45884, 'loss/train': 1.9310436248779297} 11/07/2021 03:43:06 - INFO - __main__ - Step 45886: {'lr': 0.0003991485104940994, 'samples': 8810112, 'steps': 45885, 'loss/train': 1.477452039718628} 11/07/2021 03:43:06 - INFO - __main__ - Step 45887: {'lr': 0.0003991442515690359, 'samples': 8810304, 'steps': 45886, 'loss/train': 1.0899360179901123} 11/07/2021 03:43:06 - INFO - __main__ - Step 45888: {'lr': 0.00039913999257677025, 'samples': 8810496, 'steps': 45887, 'loss/train': 2.2320735454559326} 11/07/2021 03:43:07 - INFO - __main__ - Step 45889: {'lr': 0.0003991357335173045, 'samples': 8810688, 'steps': 45888, 'loss/train': 1.5972838401794434} 11/07/2021 03:43:08 - INFO - __main__ - Step 45890: {'lr': 0.0003991314743906405, 'samples': 8810880, 'steps': 45889, 'loss/train': 1.3330001831054688} 11/07/2021 03:43:08 - INFO - __main__ - Step 45891: {'lr': 0.0003991272151967801, 'samples': 8811072, 'steps': 45890, 'loss/train': 1.6927576065063477} 11/07/2021 03:43:08 - INFO - __main__ - Step 45892: {'lr': 0.0003991229559357253, 'samples': 8811264, 'steps': 45891, 'loss/train': 1.166650414466858} 11/07/2021 03:43:09 - INFO - __main__ - Step 45893: {'lr': 0.00039911869660747804, 'samples': 8811456, 'steps': 45892, 'loss/train': 1.2685664892196655} 11/07/2021 03:43:09 - INFO - __main__ - Step 45894: {'lr': 0.0003991144372120401, 'samples': 8811648, 'steps': 45893, 'loss/train': 1.2748992443084717} 11/07/2021 03:43:10 - INFO - __main__ - Step 45895: {'lr': 0.0003991101777494136, 'samples': 8811840, 'steps': 45894, 'loss/train': 1.6891132593154907} 11/07/2021 03:43:11 - INFO - __main__ - Step 45896: {'lr': 0.0003991059182196003, 'samples': 8812032, 'steps': 45895, 'loss/train': 1.1875171661376953} 11/07/2021 03:43:11 - INFO - __main__ - Step 45897: {'lr': 0.00039910165862260216, 'samples': 8812224, 'steps': 45896, 'loss/train': 1.4112639427185059} 11/07/2021 03:43:11 - INFO - __main__ - Step 45898: {'lr': 0.0003990973989584211, 'samples': 8812416, 'steps': 45897, 'loss/train': 1.9724289178848267} 11/07/2021 03:43:12 - INFO - __main__ - Step 45899: {'lr': 0.00039909313922705913, 'samples': 8812608, 'steps': 45898, 'loss/train': 1.9549881219863892} 11/07/2021 03:43:13 - INFO - __main__ - Step 45900: {'lr': 0.000399088879428518, 'samples': 8812800, 'steps': 45899, 'loss/train': 2.485609531402588} 11/07/2021 03:43:13 - INFO - __main__ - Step 45901: {'lr': 0.0003990846195627998, 'samples': 8812992, 'steps': 45900, 'loss/train': 1.1123993396759033} 11/07/2021 03:43:13 - INFO - __main__ - Step 45902: {'lr': 0.0003990803596299064, 'samples': 8813184, 'steps': 45901, 'loss/train': 1.7796376943588257} 11/07/2021 03:43:14 - INFO - __main__ - Step 45903: {'lr': 0.0003990760996298396, 'samples': 8813376, 'steps': 45902, 'loss/train': 1.5923742055892944} 11/07/2021 03:43:14 - INFO - __main__ - Step 45904: {'lr': 0.0003990718395626014, 'samples': 8813568, 'steps': 45903, 'loss/train': 1.4409897327423096} 11/07/2021 03:43:14 - INFO - __main__ - Step 45905: {'lr': 0.0003990675794281938, 'samples': 8813760, 'steps': 45904, 'loss/train': 1.350804090499878} 11/07/2021 03:43:15 - INFO - __main__ - Step 45906: {'lr': 0.00039906331922661857, 'samples': 8813952, 'steps': 45905, 'loss/train': 1.6716240644454956} 11/07/2021 03:43:16 - INFO - __main__ - Step 45907: {'lr': 0.00039905905895787775, 'samples': 8814144, 'steps': 45906, 'loss/train': 1.2691327333450317} 11/07/2021 03:43:16 - INFO - __main__ - Step 45908: {'lr': 0.00039905479862197327, 'samples': 8814336, 'steps': 45907, 'loss/train': 1.3989278078079224} 11/07/2021 03:43:16 - INFO - __main__ - Step 45909: {'lr': 0.00039905053821890697, 'samples': 8814528, 'steps': 45908, 'loss/train': 1.3683829307556152} 11/07/2021 03:43:17 - INFO - __main__ - Step 45910: {'lr': 0.0003990462777486808, 'samples': 8814720, 'steps': 45909, 'loss/train': 0.9254510402679443} 11/07/2021 03:43:18 - INFO - __main__ - Step 45911: {'lr': 0.00039904201721129663, 'samples': 8814912, 'steps': 45910, 'loss/train': 1.3869608640670776} 11/07/2021 03:43:18 - INFO - __main__ - Step 45912: {'lr': 0.00039903775660675645, 'samples': 8815104, 'steps': 45911, 'loss/train': 1.241188883781433} 11/07/2021 03:43:18 - INFO - __main__ - Step 45913: {'lr': 0.00039903349593506214, 'samples': 8815296, 'steps': 45912, 'loss/train': 1.3880771398544312} 11/07/2021 03:43:19 - INFO - __main__ - Step 45914: {'lr': 0.0003990292351962157, 'samples': 8815488, 'steps': 45913, 'loss/train': 1.409548044204712} 11/07/2021 03:43:19 - INFO - __main__ - Step 45915: {'lr': 0.00039902497439021895, 'samples': 8815680, 'steps': 45914, 'loss/train': 1.898686170578003} 11/07/2021 03:43:20 - INFO - __main__ - Step 45916: {'lr': 0.0003990207135170738, 'samples': 8815872, 'steps': 45915, 'loss/train': 1.645846962928772} 11/07/2021 03:43:21 - INFO - __main__ - Step 45917: {'lr': 0.00039901645257678234, 'samples': 8816064, 'steps': 45916, 'loss/train': 1.190329909324646} 11/07/2021 03:43:21 - INFO - __main__ - Step 45918: {'lr': 0.0003990121915693462, 'samples': 8816256, 'steps': 45917, 'loss/train': 1.5712729692459106} 11/07/2021 03:43:21 - INFO - __main__ - Step 45919: {'lr': 0.0003990079304947676, 'samples': 8816448, 'steps': 45918, 'loss/train': 1.7502552270889282} 11/07/2021 03:43:22 - INFO - __main__ - Step 45920: {'lr': 0.00039900366935304824, 'samples': 8816640, 'steps': 45919, 'loss/train': 1.6001427173614502} 11/07/2021 03:43:23 - INFO - __main__ - Step 45921: {'lr': 0.0003989994081441902, 'samples': 8816832, 'steps': 45920, 'loss/train': 1.2713396549224854} 11/07/2021 03:43:23 - INFO - __main__ - Step 45922: {'lr': 0.00039899514686819526, 'samples': 8817024, 'steps': 45921, 'loss/train': 1.0933136940002441} 11/07/2021 03:43:23 - INFO - __main__ - Step 45923: {'lr': 0.00039899088552506544, 'samples': 8817216, 'steps': 45922, 'loss/train': 1.9176268577575684} 11/07/2021 03:43:24 - INFO - __main__ - Step 45924: {'lr': 0.00039898662411480264, 'samples': 8817408, 'steps': 45923, 'loss/train': 1.3488006591796875} 11/07/2021 03:43:24 - INFO - __main__ - Step 45925: {'lr': 0.00039898236263740875, 'samples': 8817600, 'steps': 45924, 'loss/train': 1.6533674001693726} 11/07/2021 03:43:25 - INFO - __main__ - Step 45926: {'lr': 0.00039897810109288566, 'samples': 8817792, 'steps': 45925, 'loss/train': 0.8493460416793823} 11/07/2021 03:43:26 - INFO - __main__ - Step 45927: {'lr': 0.0003989738394812354, 'samples': 8817984, 'steps': 45926, 'loss/train': 1.268776535987854} 11/07/2021 03:43:26 - INFO - __main__ - Step 45928: {'lr': 0.0003989695778024598, 'samples': 8818176, 'steps': 45927, 'loss/train': 1.2660177946090698} 11/07/2021 03:43:26 - INFO - __main__ - Step 45929: {'lr': 0.00039896531605656085, 'samples': 8818368, 'steps': 45928, 'loss/train': 1.4705090522766113} 11/07/2021 03:43:27 - INFO - __main__ - Step 45930: {'lr': 0.00039896105424354035, 'samples': 8818560, 'steps': 45929, 'loss/train': 1.8615349531173706} 11/07/2021 03:43:28 - INFO - __main__ - Step 45931: {'lr': 0.0003989567923634003, 'samples': 8818752, 'steps': 45930, 'loss/train': 1.7084077596664429} 11/07/2021 03:43:28 - INFO - __main__ - Step 45932: {'lr': 0.00039895253041614265, 'samples': 8818944, 'steps': 45931, 'loss/train': 1.4123179912567139} 11/07/2021 03:43:28 - INFO - __main__ - Step 45933: {'lr': 0.00039894826840176933, 'samples': 8819136, 'steps': 45932, 'loss/train': 1.1559531688690186} 11/07/2021 03:43:29 - INFO - __main__ - Step 45934: {'lr': 0.00039894400632028217, 'samples': 8819328, 'steps': 45933, 'loss/train': 1.6073169708251953} 11/07/2021 03:43:29 - INFO - __main__ - Step 45935: {'lr': 0.00039893974417168316, 'samples': 8819520, 'steps': 45934, 'loss/train': 1.4241822957992554} 11/07/2021 03:43:30 - INFO - __main__ - Step 45936: {'lr': 0.00039893548195597415, 'samples': 8819712, 'steps': 45935, 'loss/train': 1.7503966093063354} 11/07/2021 03:43:31 - INFO - __main__ - Step 45937: {'lr': 0.0003989312196731572, 'samples': 8819904, 'steps': 45936, 'loss/train': 1.5599544048309326} 11/07/2021 03:43:31 - INFO - __main__ - Step 45938: {'lr': 0.0003989269573232341, 'samples': 8820096, 'steps': 45937, 'loss/train': 1.580369234085083} 11/07/2021 03:43:31 - INFO - __main__ - Step 45939: {'lr': 0.0003989226949062068, 'samples': 8820288, 'steps': 45938, 'loss/train': 1.3357412815093994} 11/07/2021 03:43:32 - INFO - __main__ - Step 45940: {'lr': 0.00039891843242207726, 'samples': 8820480, 'steps': 45939, 'loss/train': 1.5565556287765503} 11/07/2021 03:43:32 - INFO - __main__ - Step 45941: {'lr': 0.00039891416987084726, 'samples': 8820672, 'steps': 45940, 'loss/train': 1.424456238746643} 11/07/2021 03:43:33 - INFO - __main__ - Step 45942: {'lr': 0.00039890990725251896, 'samples': 8820864, 'steps': 45941, 'loss/train': 2.265129327774048} 11/07/2021 03:43:34 - INFO - __main__ - Step 45943: {'lr': 0.0003989056445670941, 'samples': 8821056, 'steps': 45942, 'loss/train': 1.5895918607711792} 11/07/2021 03:43:34 - INFO - __main__ - Step 45944: {'lr': 0.0003989013818145747, 'samples': 8821248, 'steps': 45943, 'loss/train': 1.706114649772644} 11/07/2021 03:43:34 - INFO - __main__ - Step 45945: {'lr': 0.0003988971189949626, 'samples': 8821440, 'steps': 45944, 'loss/train': 1.637607216835022} 11/07/2021 03:43:35 - INFO - __main__ - Step 45946: {'lr': 0.0003988928561082598, 'samples': 8821632, 'steps': 45945, 'loss/train': 1.3578442335128784} 11/07/2021 03:43:35 - INFO - __main__ - Step 45947: {'lr': 0.0003988885931544681, 'samples': 8821824, 'steps': 45946, 'loss/train': 1.0622464418411255} 11/07/2021 03:43:37 - INFO - __main__ - Step 45948: {'lr': 0.0003988843301335895, 'samples': 8822016, 'steps': 45947, 'loss/train': 1.4858222007751465} 11/07/2021 03:43:37 - INFO - __main__ - Step 45949: {'lr': 0.00039888006704562594, 'samples': 8822208, 'steps': 45948, 'loss/train': 1.3465840816497803} 11/07/2021 03:43:37 - INFO - __main__ - Step 45950: {'lr': 0.0003988758038905794, 'samples': 8822400, 'steps': 45949, 'loss/train': 1.2762097120285034} 11/07/2021 03:43:38 - INFO - __main__ - Step 45951: {'lr': 0.00039887154066845166, 'samples': 8822592, 'steps': 45950, 'loss/train': 1.5280088186264038} 11/07/2021 03:43:38 - INFO - __main__ - Step 45952: {'lr': 0.00039886727737924464, 'samples': 8822784, 'steps': 45951, 'loss/train': 0.44151145219802856} 11/07/2021 03:43:39 - INFO - __main__ - Step 45953: {'lr': 0.00039886301402296037, 'samples': 8822976, 'steps': 45952, 'loss/train': 0.2146332859992981} 11/07/2021 03:43:39 - INFO - __main__ - Step 45954: {'lr': 0.00039885875059960074, 'samples': 8823168, 'steps': 45953, 'loss/train': 1.481507420539856} 11/07/2021 03:43:40 - INFO - __main__ - Step 45955: {'lr': 0.0003988544871091676, 'samples': 8823360, 'steps': 45954, 'loss/train': 1.226210355758667} 11/07/2021 03:43:40 - INFO - __main__ - Step 45956: {'lr': 0.000398850223551663, 'samples': 8823552, 'steps': 45955, 'loss/train': 1.7058351039886475} 11/07/2021 03:43:40 - INFO - __main__ - Step 45957: {'lr': 0.00039884595992708877, 'samples': 8823744, 'steps': 45956, 'loss/train': 1.4703701734542847} 11/07/2021 03:43:41 - INFO - __main__ - Step 45958: {'lr': 0.00039884169623544683, 'samples': 8823936, 'steps': 45957, 'loss/train': 1.5774255990982056} 11/07/2021 03:43:42 - INFO - __main__ - Step 45959: {'lr': 0.0003988374324767391, 'samples': 8824128, 'steps': 45958, 'loss/train': 1.3722915649414062} 11/07/2021 03:43:42 - INFO - __main__ - Step 45960: {'lr': 0.0003988331686509675, 'samples': 8824320, 'steps': 45959, 'loss/train': 1.1074728965759277} 11/07/2021 03:43:42 - INFO - __main__ - Step 45961: {'lr': 0.000398828904758134, 'samples': 8824512, 'steps': 45960, 'loss/train': 1.7804874181747437} 11/07/2021 03:43:43 - INFO - __main__ - Step 45962: {'lr': 0.0003988246407982405, 'samples': 8824704, 'steps': 45961, 'loss/train': 1.5980979204177856} 11/07/2021 03:43:43 - INFO - __main__ - Step 45963: {'lr': 0.00039882037677128895, 'samples': 8824896, 'steps': 45962, 'loss/train': 2.0666005611419678} 11/07/2021 03:43:44 - INFO - __main__ - Step 45964: {'lr': 0.0003988161126772812, 'samples': 8825088, 'steps': 45963, 'loss/train': 0.9492558240890503} 11/07/2021 03:43:44 - INFO - __main__ - Step 45965: {'lr': 0.0003988118485162192, 'samples': 8825280, 'steps': 45964, 'loss/train': 1.4683305025100708} 11/07/2021 03:43:45 - INFO - __main__ - Step 45966: {'lr': 0.00039880758428810487, 'samples': 8825472, 'steps': 45965, 'loss/train': 1.331489086151123} 11/07/2021 03:43:45 - INFO - __main__ - Step 45967: {'lr': 0.00039880331999294017, 'samples': 8825664, 'steps': 45966, 'loss/train': 1.5512917041778564} 11/07/2021 03:43:46 - INFO - __main__ - Step 45968: {'lr': 0.00039879905563072694, 'samples': 8825856, 'steps': 45967, 'loss/train': 0.9686439037322998} 11/07/2021 03:43:47 - INFO - __main__ - Step 45969: {'lr': 0.00039879479120146725, 'samples': 8826048, 'steps': 45968, 'loss/train': 1.2109235525131226} 11/07/2021 03:43:47 - INFO - __main__ - Step 45970: {'lr': 0.0003987905267051628, 'samples': 8826240, 'steps': 45969, 'loss/train': 1.4375460147857666} 11/07/2021 03:43:47 - INFO - __main__ - Step 45971: {'lr': 0.0003987862621418157, 'samples': 8826432, 'steps': 45970, 'loss/train': 1.5557785034179688} 11/07/2021 03:43:48 - INFO - __main__ - Step 45972: {'lr': 0.0003987819975114278, 'samples': 8826624, 'steps': 45971, 'loss/train': 1.1495202779769897} 11/07/2021 03:43:48 - INFO - __main__ - Step 45973: {'lr': 0.000398777732814001, 'samples': 8826816, 'steps': 45972, 'loss/train': 1.341982126235962} 11/07/2021 03:43:49 - INFO - __main__ - Step 45974: {'lr': 0.0003987734680495373, 'samples': 8827008, 'steps': 45973, 'loss/train': 1.3633899688720703} 11/07/2021 03:43:49 - INFO - __main__ - Step 45975: {'lr': 0.0003987692032180385, 'samples': 8827200, 'steps': 45974, 'loss/train': 0.8215121030807495} 11/07/2021 03:43:50 - INFO - __main__ - Step 45976: {'lr': 0.00039876493831950664, 'samples': 8827392, 'steps': 45975, 'loss/train': 1.5936977863311768} 11/07/2021 03:43:50 - INFO - __main__ - Step 45977: {'lr': 0.00039876067335394363, 'samples': 8827584, 'steps': 45976, 'loss/train': 1.0867680311203003} 11/07/2021 03:43:50 - INFO - __main__ - Step 45978: {'lr': 0.0003987564083213513, 'samples': 8827776, 'steps': 45977, 'loss/train': 1.5268322229385376} 11/07/2021 03:43:51 - INFO - __main__ - Step 45979: {'lr': 0.00039875214322173167, 'samples': 8827968, 'steps': 45978, 'loss/train': 1.3771189451217651} 11/07/2021 03:43:52 - INFO - __main__ - Step 45980: {'lr': 0.00039874787805508656, 'samples': 8828160, 'steps': 45979, 'loss/train': 1.4589320421218872} 11/07/2021 03:43:52 - INFO - __main__ - Step 45981: {'lr': 0.000398743612821418, 'samples': 8828352, 'steps': 45980, 'loss/train': 1.7522274255752563} 11/07/2021 03:43:52 - INFO - __main__ - Step 45982: {'lr': 0.0003987393475207278, 'samples': 8828544, 'steps': 45981, 'loss/train': 1.7427177429199219} 11/07/2021 03:43:53 - INFO - __main__ - Step 45983: {'lr': 0.000398735082153018, 'samples': 8828736, 'steps': 45982, 'loss/train': 1.525829792022705} 11/07/2021 03:43:54 - INFO - __main__ - Step 45984: {'lr': 0.00039873081671829046, 'samples': 8828928, 'steps': 45983, 'loss/train': 2.171912670135498} 11/07/2021 03:43:54 - INFO - __main__ - Step 45985: {'lr': 0.0003987265512165471, 'samples': 8829120, 'steps': 45984, 'loss/train': 1.4670724868774414} 11/07/2021 03:43:55 - INFO - __main__ - Step 45986: {'lr': 0.0003987222856477899, 'samples': 8829312, 'steps': 45985, 'loss/train': 1.321970820426941} 11/07/2021 03:43:55 - INFO - __main__ - Step 45987: {'lr': 0.0003987180200120207, 'samples': 8829504, 'steps': 45986, 'loss/train': 1.2274272441864014} 11/07/2021 03:43:56 - INFO - __main__ - Step 45988: {'lr': 0.0003987137543092414, 'samples': 8829696, 'steps': 45987, 'loss/train': 1.7335909605026245} 11/07/2021 03:43:56 - INFO - __main__ - Step 45989: {'lr': 0.0003987094885394541, 'samples': 8829888, 'steps': 45988, 'loss/train': 1.610555648803711} 11/07/2021 03:43:57 - INFO - __main__ - Step 45990: {'lr': 0.0003987052227026605, 'samples': 8830080, 'steps': 45989, 'loss/train': 1.0616350173950195} 11/07/2021 03:43:57 - INFO - __main__ - Step 45991: {'lr': 0.0003987009567988626, 'samples': 8830272, 'steps': 45990, 'loss/train': 1.3533833026885986} 11/07/2021 03:43:58 - INFO - __main__ - Step 45992: {'lr': 0.00039869669082806243, 'samples': 8830464, 'steps': 45991, 'loss/train': 0.27393484115600586} 11/07/2021 03:43:58 - INFO - __main__ - Step 45993: {'lr': 0.0003986924247902618, 'samples': 8830656, 'steps': 45992, 'loss/train': 1.3827497959136963} 11/07/2021 03:43:59 - INFO - __main__ - Step 45994: {'lr': 0.00039868815868546257, 'samples': 8830848, 'steps': 45993, 'loss/train': 2.0358173847198486} 11/07/2021 03:43:59 - INFO - __main__ - Step 45995: {'lr': 0.00039868389251366686, 'samples': 8831040, 'steps': 45994, 'loss/train': 2.121223211288452} 11/07/2021 03:44:00 - INFO - __main__ - Step 45996: {'lr': 0.00039867962627487645, 'samples': 8831232, 'steps': 45995, 'loss/train': 1.6265567541122437} 11/07/2021 03:44:00 - INFO - __main__ - Step 45997: {'lr': 0.0003986753599690933, 'samples': 8831424, 'steps': 45996, 'loss/train': 1.1627484560012817} 11/07/2021 03:44:00 - INFO - __main__ - Step 45998: {'lr': 0.00039867109359631935, 'samples': 8831616, 'steps': 45997, 'loss/train': 1.7465667724609375} 11/07/2021 03:44:01 - INFO - __main__ - Step 45999: {'lr': 0.00039866682715655646, 'samples': 8831808, 'steps': 45998, 'loss/train': 1.0794578790664673} 11/07/2021 03:44:02 - INFO - __main__ - Step 46000: {'lr': 0.00039866256064980657, 'samples': 8832000, 'steps': 45999, 'loss/train': 1.949453592300415} 11/07/2021 03:44:02 - INFO - __main__ - Step 46001: {'lr': 0.0003986582940760717, 'samples': 8832192, 'steps': 46000, 'loss/train': 1.27757728099823} 11/07/2021 03:44:02 - INFO - __main__ - Step 46002: {'lr': 0.0003986540274353536, 'samples': 8832384, 'steps': 46001, 'loss/train': 1.7763686180114746} 11/07/2021 03:44:03 - INFO - __main__ - Step 46003: {'lr': 0.00039864976072765437, 'samples': 8832576, 'steps': 46002, 'loss/train': 1.5143035650253296} 11/07/2021 03:44:03 - INFO - __main__ - Step 46004: {'lr': 0.0003986454939529758, 'samples': 8832768, 'steps': 46003, 'loss/train': 1.384948968887329} 11/07/2021 03:44:04 - INFO - __main__ - Step 46005: {'lr': 0.0003986412271113199, 'samples': 8832960, 'steps': 46004, 'loss/train': 1.4078046083450317} 11/07/2021 03:44:04 - INFO - __main__ - Step 46006: {'lr': 0.0003986369602026886, 'samples': 8833152, 'steps': 46005, 'loss/train': 1.493535041809082} 11/07/2021 03:44:05 - INFO - __main__ - Step 46007: {'lr': 0.0003986326932270836, 'samples': 8833344, 'steps': 46006, 'loss/train': 1.1166813373565674} 11/07/2021 03:44:05 - INFO - __main__ - Step 46008: {'lr': 0.00039862842618450717, 'samples': 8833536, 'steps': 46007, 'loss/train': 2.2529072761535645} 11/07/2021 03:44:05 - INFO - __main__ - Step 46009: {'lr': 0.00039862415907496103, 'samples': 8833728, 'steps': 46008, 'loss/train': 1.8821719884872437} 11/07/2021 03:44:07 - INFO - __main__ - Step 46010: {'lr': 0.00039861989189844715, 'samples': 8833920, 'steps': 46009, 'loss/train': 1.2184703350067139} 11/07/2021 03:44:07 - INFO - __main__ - Step 46011: {'lr': 0.00039861562465496735, 'samples': 8834112, 'steps': 46010, 'loss/train': 1.2818342447280884} 11/07/2021 03:44:07 - INFO - __main__ - Step 46012: {'lr': 0.00039861135734452376, 'samples': 8834304, 'steps': 46011, 'loss/train': 1.7484780550003052} 11/07/2021 03:44:08 - INFO - __main__ - Step 46013: {'lr': 0.00039860708996711816, 'samples': 8834496, 'steps': 46012, 'loss/train': 1.7105348110198975} 11/07/2021 03:44:08 - INFO - __main__ - Step 46014: {'lr': 0.00039860282252275245, 'samples': 8834688, 'steps': 46013, 'loss/train': 0.4886361360549927} 11/07/2021 03:44:09 - INFO - __main__ - Step 46015: {'lr': 0.0003985985550114286, 'samples': 8834880, 'steps': 46014, 'loss/train': 1.743740439414978} 11/07/2021 03:44:10 - INFO - __main__ - Step 46016: {'lr': 0.00039859428743314857, 'samples': 8835072, 'steps': 46015, 'loss/train': 1.7782355546951294} 11/07/2021 03:44:10 - INFO - __main__ - Step 46017: {'lr': 0.0003985900197879142, 'samples': 8835264, 'steps': 46016, 'loss/train': 2.0538876056671143} 11/07/2021 03:44:10 - INFO - __main__ - Step 46018: {'lr': 0.00039858575207572756, 'samples': 8835456, 'steps': 46017, 'loss/train': 1.385968804359436} 11/07/2021 03:44:11 - INFO - __main__ - Step 46019: {'lr': 0.00039858148429659036, 'samples': 8835648, 'steps': 46018, 'loss/train': 1.4984182119369507} 11/07/2021 03:44:11 - INFO - __main__ - Step 46020: {'lr': 0.0003985772164505047, 'samples': 8835840, 'steps': 46019, 'loss/train': 0.7730764746665955} 11/07/2021 03:44:11 - INFO - __main__ - Step 46021: {'lr': 0.0003985729485374724, 'samples': 8836032, 'steps': 46020, 'loss/train': 1.6437476873397827} 11/07/2021 03:44:13 - INFO - __main__ - Step 46022: {'lr': 0.0003985686805574954, 'samples': 8836224, 'steps': 46021, 'loss/train': 1.77516770362854} 11/07/2021 03:44:13 - INFO - __main__ - Step 46023: {'lr': 0.00039856441251057573, 'samples': 8836416, 'steps': 46022, 'loss/train': 1.629174828529358} 11/07/2021 03:44:13 - INFO - __main__ - Step 46024: {'lr': 0.0003985601443967152, 'samples': 8836608, 'steps': 46023, 'loss/train': 1.600903868675232} 11/07/2021 03:44:14 - INFO - __main__ - Step 46025: {'lr': 0.0003985558762159157, 'samples': 8836800, 'steps': 46024, 'loss/train': 1.515991449356079} 11/07/2021 03:44:14 - INFO - __main__ - Step 46026: {'lr': 0.0003985516079681793, 'samples': 8836992, 'steps': 46025, 'loss/train': 1.7451043128967285} 11/07/2021 03:44:15 - INFO - __main__ - Step 46027: {'lr': 0.0003985473396535078, 'samples': 8837184, 'steps': 46026, 'loss/train': 1.4028428792953491} 11/07/2021 03:44:15 - INFO - __main__ - Step 46028: {'lr': 0.00039854307127190316, 'samples': 8837376, 'steps': 46027, 'loss/train': 1.4161070585250854} 11/07/2021 03:44:16 - INFO - __main__ - Step 46029: {'lr': 0.0003985388028233673, 'samples': 8837568, 'steps': 46028, 'loss/train': 1.7701877355575562} 11/07/2021 03:44:16 - INFO - __main__ - Step 46030: {'lr': 0.0003985345343079022, 'samples': 8837760, 'steps': 46029, 'loss/train': 1.0398225784301758} 11/07/2021 03:44:16 - INFO - __main__ - Step 46031: {'lr': 0.00039853026572550965, 'samples': 8837952, 'steps': 46030, 'loss/train': 2.184058666229248} 11/07/2021 03:44:17 - INFO - __main__ - Step 46032: {'lr': 0.0003985259970761917, 'samples': 8838144, 'steps': 46031, 'loss/train': 1.8846038579940796} 11/07/2021 03:44:18 - INFO - __main__ - Step 46033: {'lr': 0.0003985217283599502, 'samples': 8838336, 'steps': 46032, 'loss/train': 1.2859147787094116} 11/07/2021 03:44:18 - INFO - __main__ - Step 46034: {'lr': 0.0003985174595767871, 'samples': 8838528, 'steps': 46033, 'loss/train': 1.7278329133987427} 11/07/2021 03:44:18 - INFO - __main__ - Step 46035: {'lr': 0.0003985131907267043, 'samples': 8838720, 'steps': 46034, 'loss/train': 1.7227964401245117} 11/07/2021 03:44:19 - INFO - __main__ - Step 46036: {'lr': 0.00039850892180970387, 'samples': 8838912, 'steps': 46035, 'loss/train': 2.0506961345672607} 11/07/2021 03:44:20 - INFO - __main__ - Step 46037: {'lr': 0.0003985046528257875, 'samples': 8839104, 'steps': 46036, 'loss/train': 0.9985599517822266} 11/07/2021 03:44:20 - INFO - __main__ - Step 46038: {'lr': 0.00039850038377495727, 'samples': 8839296, 'steps': 46037, 'loss/train': 1.7415443658828735} 11/07/2021 03:44:21 - INFO - __main__ - Step 46039: {'lr': 0.000398496114657215, 'samples': 8839488, 'steps': 46038, 'loss/train': 1.8762344121932983} 11/07/2021 03:44:21 - INFO - __main__ - Step 46040: {'lr': 0.0003984918454725628, 'samples': 8839680, 'steps': 46039, 'loss/train': 1.3955578804016113} 11/07/2021 03:44:21 - INFO - __main__ - Step 46041: {'lr': 0.0003984875762210023, 'samples': 8839872, 'steps': 46040, 'loss/train': 1.3260287046432495} 11/07/2021 03:44:22 - INFO - __main__ - Step 46042: {'lr': 0.0003984833069025357, 'samples': 8840064, 'steps': 46041, 'loss/train': 1.4703521728515625} 11/07/2021 03:44:23 - INFO - __main__ - Step 46043: {'lr': 0.00039847903751716486, 'samples': 8840256, 'steps': 46042, 'loss/train': 1.4357061386108398} 11/07/2021 03:44:23 - INFO - __main__ - Step 46044: {'lr': 0.00039847476806489153, 'samples': 8840448, 'steps': 46043, 'loss/train': 0.6424424052238464} 11/07/2021 03:44:23 - INFO - __main__ - Step 46045: {'lr': 0.00039847049854571784, 'samples': 8840640, 'steps': 46044, 'loss/train': 1.3334059715270996} 11/07/2021 03:44:24 - INFO - __main__ - Step 46046: {'lr': 0.00039846622895964556, 'samples': 8840832, 'steps': 46045, 'loss/train': 1.4812532663345337} 11/07/2021 03:44:25 - INFO - __main__ - Step 46047: {'lr': 0.0003984619593066767, 'samples': 8841024, 'steps': 46046, 'loss/train': 1.4225276708602905} 11/07/2021 03:44:25 - INFO - __main__ - Step 46048: {'lr': 0.0003984576895868132, 'samples': 8841216, 'steps': 46047, 'loss/train': 1.6545523405075073} 11/07/2021 03:44:25 - INFO - __main__ - Step 46049: {'lr': 0.000398453419800057, 'samples': 8841408, 'steps': 46048, 'loss/train': 1.4333152770996094} 11/07/2021 03:44:26 - INFO - __main__ - Step 46050: {'lr': 0.00039844914994640994, 'samples': 8841600, 'steps': 46049, 'loss/train': 1.543885350227356} 11/07/2021 03:44:26 - INFO - __main__ - Step 46051: {'lr': 0.00039844488002587397, 'samples': 8841792, 'steps': 46050, 'loss/train': 1.1068906784057617} 11/07/2021 03:44:27 - INFO - __main__ - Step 46052: {'lr': 0.00039844061003845114, 'samples': 8841984, 'steps': 46051, 'loss/train': 1.8480699062347412} 11/07/2021 03:44:28 - INFO - __main__ - Step 46053: {'lr': 0.00039843633998414306, 'samples': 8842176, 'steps': 46052, 'loss/train': 1.6766722202301025} 11/07/2021 03:44:28 - INFO - __main__ - Step 46054: {'lr': 0.000398432069862952, 'samples': 8842368, 'steps': 46053, 'loss/train': 1.7152425050735474} 11/07/2021 03:44:29 - INFO - __main__ - Step 46055: {'lr': 0.00039842779967487967, 'samples': 8842560, 'steps': 46054, 'loss/train': 0.5176750421524048} 11/07/2021 03:44:29 - INFO - __main__ - Step 46056: {'lr': 0.0003984235294199281, 'samples': 8842752, 'steps': 46055, 'loss/train': 0.8775113821029663} 11/07/2021 03:44:29 - INFO - __main__ - Step 46057: {'lr': 0.0003984192590980992, 'samples': 8842944, 'steps': 46056, 'loss/train': 1.3182798624038696} 11/07/2021 03:44:30 - INFO - __main__ - Step 46058: {'lr': 0.00039841498870939483, 'samples': 8843136, 'steps': 46057, 'loss/train': 0.20144513249397278} 11/07/2021 03:44:31 - INFO - __main__ - Step 46059: {'lr': 0.000398410718253817, 'samples': 8843328, 'steps': 46058, 'loss/train': 1.1564953327178955} 11/07/2021 03:44:31 - INFO - __main__ - Step 46060: {'lr': 0.00039840644773136757, 'samples': 8843520, 'steps': 46059, 'loss/train': 1.2651718854904175} 11/07/2021 03:44:31 - INFO - __main__ - Step 46061: {'lr': 0.0003984021771420484, 'samples': 8843712, 'steps': 46060, 'loss/train': 1.7208924293518066} 11/07/2021 03:44:32 - INFO - __main__ - Step 46062: {'lr': 0.0003983979064858616, 'samples': 8843904, 'steps': 46061, 'loss/train': 1.408997654914856} 11/07/2021 03:44:33 - INFO - __main__ - Step 46063: {'lr': 0.000398393635762809, 'samples': 8844096, 'steps': 46062, 'loss/train': 1.4301825761795044} 11/07/2021 03:44:33 - INFO - __main__ - Step 46064: {'lr': 0.0003983893649728925, 'samples': 8844288, 'steps': 46063, 'loss/train': 1.620592474937439} 11/07/2021 03:44:34 - INFO - __main__ - Step 46065: {'lr': 0.000398385094116114, 'samples': 8844480, 'steps': 46064, 'loss/train': 1.2645494937896729} 11/07/2021 03:44:34 - INFO - __main__ - Step 46066: {'lr': 0.0003983808231924755, 'samples': 8844672, 'steps': 46065, 'loss/train': 1.045702338218689} 11/07/2021 03:44:34 - INFO - __main__ - Step 46067: {'lr': 0.0003983765522019789, 'samples': 8844864, 'steps': 46066, 'loss/train': 1.748497486114502} 11/07/2021 03:44:35 - INFO - __main__ - Step 46068: {'lr': 0.0003983722811446261, 'samples': 8845056, 'steps': 46067, 'loss/train': 1.3432093858718872} 11/07/2021 03:44:36 - INFO - __main__ - Step 46069: {'lr': 0.00039836801002041903, 'samples': 8845248, 'steps': 46068, 'loss/train': 1.3738770484924316} 11/07/2021 03:44:36 - INFO - __main__ - Step 46070: {'lr': 0.00039836373882935967, 'samples': 8845440, 'steps': 46069, 'loss/train': 1.4351294040679932} 11/07/2021 03:44:36 - INFO - __main__ - Step 46071: {'lr': 0.0003983594675714498, 'samples': 8845632, 'steps': 46070, 'loss/train': 1.1481633186340332} 11/07/2021 03:44:37 - INFO - __main__ - Step 46072: {'lr': 0.0003983551962466915, 'samples': 8845824, 'steps': 46071, 'loss/train': 1.5206433534622192} 11/07/2021 03:44:37 - INFO - __main__ - Step 46073: {'lr': 0.0003983509248550867, 'samples': 8846016, 'steps': 46072, 'loss/train': 1.1233588457107544} 11/07/2021 03:44:38 - INFO - __main__ - Step 46074: {'lr': 0.00039834665339663725, 'samples': 8846208, 'steps': 46073, 'loss/train': 1.2450916767120361} 11/07/2021 03:44:38 - INFO - __main__ - Step 46075: {'lr': 0.00039834238187134497, 'samples': 8846400, 'steps': 46074, 'loss/train': 1.1505430936813354} 11/07/2021 03:44:39 - INFO - __main__ - Step 46076: {'lr': 0.00039833811027921196, 'samples': 8846592, 'steps': 46075, 'loss/train': 1.3425941467285156} 11/07/2021 03:44:39 - INFO - __main__ - Step 46077: {'lr': 0.00039833383862024016, 'samples': 8846784, 'steps': 46076, 'loss/train': 1.7394100427627563} 11/07/2021 03:44:39 - INFO - __main__ - Step 46078: {'lr': 0.00039832956689443135, 'samples': 8846976, 'steps': 46077, 'loss/train': 1.5554648637771606} 11/07/2021 03:44:40 - INFO - __main__ - Step 46079: {'lr': 0.00039832529510178756, 'samples': 8847168, 'steps': 46078, 'loss/train': 1.199279546737671} 11/07/2021 03:44:41 - INFO - __main__ - Step 46080: {'lr': 0.0003983210232423107, 'samples': 8847360, 'steps': 46079, 'loss/train': 1.7897357940673828} 11/07/2021 03:44:41 - INFO - __main__ - Step 46081: {'lr': 0.00039831675131600253, 'samples': 8847552, 'steps': 46080, 'loss/train': 1.701612949371338} 11/07/2021 03:44:41 - INFO - __main__ - Step 46082: {'lr': 0.0003983124793228653, 'samples': 8847744, 'steps': 46081, 'loss/train': 1.5686990022659302} 11/07/2021 03:44:42 - INFO - __main__ - Step 46083: {'lr': 0.00039830820726290063, 'samples': 8847936, 'steps': 46082, 'loss/train': 1.6925108432769775} 11/07/2021 03:44:43 - INFO - __main__ - Step 46084: {'lr': 0.0003983039351361106, 'samples': 8848128, 'steps': 46083, 'loss/train': 1.5960572957992554} 11/07/2021 03:44:43 - INFO - __main__ - Step 46085: {'lr': 0.0003982996629424972, 'samples': 8848320, 'steps': 46084, 'loss/train': 1.156279444694519} 11/07/2021 03:44:43 - INFO - __main__ - Step 46086: {'lr': 0.0003982953906820622, 'samples': 8848512, 'steps': 46085, 'loss/train': 1.071182131767273} 11/07/2021 03:44:44 - INFO - __main__ - Step 46087: {'lr': 0.0003982911183548075, 'samples': 8848704, 'steps': 46086, 'loss/train': 1.5461595058441162} 11/07/2021 03:44:44 - INFO - __main__ - Step 46088: {'lr': 0.0003982868459607352, 'samples': 8848896, 'steps': 46087, 'loss/train': 1.517485499382019} 11/07/2021 03:44:45 - INFO - __main__ - Step 46089: {'lr': 0.0003982825734998471, 'samples': 8849088, 'steps': 46088, 'loss/train': 1.4021140336990356} 11/07/2021 03:44:46 - INFO - __main__ - Step 46090: {'lr': 0.0003982783009721452, 'samples': 8849280, 'steps': 46089, 'loss/train': 1.940238118171692} 11/07/2021 03:44:46 - INFO - __main__ - Step 46091: {'lr': 0.00039827402837763136, 'samples': 8849472, 'steps': 46090, 'loss/train': 1.4375869035720825} 11/07/2021 03:44:46 - INFO - __main__ - Step 46092: {'lr': 0.00039826975571630754, 'samples': 8849664, 'steps': 46091, 'loss/train': 0.8164916634559631} 11/07/2021 03:44:47 - INFO - __main__ - Step 46093: {'lr': 0.0003982654829881757, 'samples': 8849856, 'steps': 46092, 'loss/train': 0.9763711094856262} 11/07/2021 03:44:48 - INFO - __main__ - Step 46094: {'lr': 0.0003982612101932376, 'samples': 8850048, 'steps': 46093, 'loss/train': 1.353725790977478} 11/07/2021 03:44:48 - INFO - __main__ - Step 46095: {'lr': 0.0003982569373314954, 'samples': 8850240, 'steps': 46094, 'loss/train': 1.9268068075180054} 11/07/2021 03:44:48 - INFO - __main__ - Step 46096: {'lr': 0.0003982526644029508, 'samples': 8850432, 'steps': 46095, 'loss/train': 1.31769859790802} 11/07/2021 03:44:49 - INFO - __main__ - Step 46097: {'lr': 0.000398248391407606, 'samples': 8850624, 'steps': 46096, 'loss/train': 1.2756290435791016} 11/07/2021 03:44:49 - INFO - __main__ - Step 46098: {'lr': 0.0003982441183454627, 'samples': 8850816, 'steps': 46097, 'loss/train': 1.1033422946929932} 11/07/2021 03:44:49 - INFO - __main__ - Step 46099: {'lr': 0.0003982398452165228, 'samples': 8851008, 'steps': 46098, 'loss/train': 1.5437897443771362} 11/07/2021 03:44:50 - INFO - __main__ - Step 46100: {'lr': 0.0003982355720207884, 'samples': 8851200, 'steps': 46099, 'loss/train': 1.0908828973770142} 11/07/2021 03:44:51 - INFO - __main__ - Step 46101: {'lr': 0.00039823129875826127, 'samples': 8851392, 'steps': 46100, 'loss/train': 1.4641016721725464} 11/07/2021 03:44:51 - INFO - __main__ - Step 46102: {'lr': 0.0003982270254289435, 'samples': 8851584, 'steps': 46101, 'loss/train': 1.1825180053710938} 11/07/2021 03:44:51 - INFO - __main__ - Step 46103: {'lr': 0.0003982227520328368, 'samples': 8851776, 'steps': 46102, 'loss/train': 1.260934829711914} 11/07/2021 03:44:52 - INFO - __main__ - Step 46104: {'lr': 0.0003982184785699433, 'samples': 8851968, 'steps': 46103, 'loss/train': 1.7066675424575806} 11/07/2021 03:44:53 - INFO - __main__ - Step 46105: {'lr': 0.00039821420504026486, 'samples': 8852160, 'steps': 46104, 'loss/train': 1.4572490453720093} 11/07/2021 03:44:53 - INFO - __main__ - Step 46106: {'lr': 0.00039820993144380333, 'samples': 8852352, 'steps': 46105, 'loss/train': 1.6005284786224365} 11/07/2021 03:44:54 - INFO - __main__ - Step 46107: {'lr': 0.0003982056577805607, 'samples': 8852544, 'steps': 46106, 'loss/train': 1.1769062280654907} 11/07/2021 03:44:54 - INFO - __main__ - Step 46108: {'lr': 0.00039820138405053887, 'samples': 8852736, 'steps': 46107, 'loss/train': 1.3909072875976562} 11/07/2021 03:44:54 - INFO - __main__ - Step 46109: {'lr': 0.0003981971102537398, 'samples': 8852928, 'steps': 46108, 'loss/train': 1.658613920211792} 11/07/2021 03:44:55 - INFO - __main__ - Step 46110: {'lr': 0.00039819283639016547, 'samples': 8853120, 'steps': 46109, 'loss/train': 1.5829380750656128} 11/07/2021 03:44:56 - INFO - __main__ - Step 46111: {'lr': 0.00039818856245981766, 'samples': 8853312, 'steps': 46110, 'loss/train': 2.1844825744628906} 11/07/2021 03:44:56 - INFO - __main__ - Step 46112: {'lr': 0.0003981842884626984, 'samples': 8853504, 'steps': 46111, 'loss/train': 0.8375652432441711} 11/07/2021 03:44:56 - INFO - __main__ - Step 46113: {'lr': 0.0003981800143988095, 'samples': 8853696, 'steps': 46112, 'loss/train': 1.705703854560852} 11/07/2021 03:44:57 - INFO - __main__ - Step 46114: {'lr': 0.00039817574026815305, 'samples': 8853888, 'steps': 46113, 'loss/train': 1.6477971076965332} 11/07/2021 03:44:58 - INFO - __main__ - Step 46115: {'lr': 0.0003981714660707309, 'samples': 8854080, 'steps': 46114, 'loss/train': 1.4718902111053467} 11/07/2021 03:44:58 - INFO - __main__ - Step 46116: {'lr': 0.00039816719180654493, 'samples': 8854272, 'steps': 46115, 'loss/train': 0.8173640370368958} 11/07/2021 03:44:58 - INFO - __main__ - Step 46117: {'lr': 0.0003981629174755972, 'samples': 8854464, 'steps': 46116, 'loss/train': 1.6102850437164307} 11/07/2021 03:44:59 - INFO - __main__ - Step 46118: {'lr': 0.0003981586430778895, 'samples': 8854656, 'steps': 46117, 'loss/train': 1.548747181892395} 11/07/2021 03:44:59 - INFO - __main__ - Step 46119: {'lr': 0.0003981543686134238, 'samples': 8854848, 'steps': 46118, 'loss/train': 1.253394603729248} 11/07/2021 03:45:00 - INFO - __main__ - Step 46120: {'lr': 0.000398150094082202, 'samples': 8855040, 'steps': 46119, 'loss/train': 1.7422831058502197} 11/07/2021 03:45:00 - INFO - __main__ - Step 46121: {'lr': 0.000398145819484226, 'samples': 8855232, 'steps': 46120, 'loss/train': 1.6735376119613647} 11/07/2021 03:45:01 - INFO - __main__ - Step 46122: {'lr': 0.00039814154481949786, 'samples': 8855424, 'steps': 46121, 'loss/train': 1.60072660446167} 11/07/2021 03:45:01 - INFO - __main__ - Step 46123: {'lr': 0.00039813727008801945, 'samples': 8855616, 'steps': 46122, 'loss/train': 1.3049921989440918} 11/07/2021 03:45:02 - INFO - __main__ - Step 46124: {'lr': 0.00039813299528979263, 'samples': 8855808, 'steps': 46123, 'loss/train': 1.8983211517333984} 11/07/2021 03:45:03 - INFO - __main__ - Step 46125: {'lr': 0.0003981287204248194, 'samples': 8856000, 'steps': 46124, 'loss/train': 1.713747262954712} 11/07/2021 03:45:03 - INFO - __main__ - Step 46126: {'lr': 0.0003981244454931017, 'samples': 8856192, 'steps': 46125, 'loss/train': 1.5147662162780762} 11/07/2021 03:45:03 - INFO - __main__ - Step 46127: {'lr': 0.00039812017049464126, 'samples': 8856384, 'steps': 46126, 'loss/train': 1.7749470472335815} 11/07/2021 03:45:04 - INFO - __main__ - Step 46128: {'lr': 0.0003981158954294403, 'samples': 8856576, 'steps': 46127, 'loss/train': 1.315519094467163} 11/07/2021 03:45:04 - INFO - __main__ - Step 46129: {'lr': 0.00039811162029750047, 'samples': 8856768, 'steps': 46128, 'loss/train': 1.5389231443405151} 11/07/2021 03:45:04 - INFO - __main__ - Step 46130: {'lr': 0.00039810734509882395, 'samples': 8856960, 'steps': 46129, 'loss/train': 1.6367995738983154} 11/07/2021 03:45:05 - INFO - __main__ - Step 46131: {'lr': 0.0003981030698334125, 'samples': 8857152, 'steps': 46130, 'loss/train': 0.968117892742157} 11/07/2021 03:45:06 - INFO - __main__ - Step 46132: {'lr': 0.00039809879450126805, 'samples': 8857344, 'steps': 46131, 'loss/train': 1.5910096168518066} 11/07/2021 03:45:06 - INFO - __main__ - Step 46133: {'lr': 0.00039809451910239257, 'samples': 8857536, 'steps': 46132, 'loss/train': 1.5100864171981812} 11/07/2021 03:45:06 - INFO - __main__ - Step 46134: {'lr': 0.000398090243636788, 'samples': 8857728, 'steps': 46133, 'loss/train': 1.3803400993347168} 11/07/2021 03:45:07 - INFO - __main__ - Step 46135: {'lr': 0.00039808596810445636, 'samples': 8857920, 'steps': 46134, 'loss/train': 1.642842411994934} 11/07/2021 03:45:08 - INFO - __main__ - Step 46136: {'lr': 0.0003980816925053994, 'samples': 8858112, 'steps': 46135, 'loss/train': 1.5238131284713745} 11/07/2021 03:45:08 - INFO - __main__ - Step 46137: {'lr': 0.0003980774168396191, 'samples': 8858304, 'steps': 46136, 'loss/train': 1.399741291999817} 11/07/2021 03:45:08 - INFO - __main__ - Step 46138: {'lr': 0.00039807314110711735, 'samples': 8858496, 'steps': 46137, 'loss/train': 1.3536529541015625} 11/07/2021 03:45:09 - INFO - __main__ - Step 46139: {'lr': 0.0003980688653078962, 'samples': 8858688, 'steps': 46138, 'loss/train': 1.0252282619476318} 11/07/2021 03:45:09 - INFO - __main__ - Step 46140: {'lr': 0.00039806458944195743, 'samples': 8858880, 'steps': 46139, 'loss/train': 2.035419464111328} 11/07/2021 03:45:10 - INFO - __main__ - Step 46141: {'lr': 0.00039806031350930315, 'samples': 8859072, 'steps': 46140, 'loss/train': 1.6623613834381104} 11/07/2021 03:45:11 - INFO - __main__ - Step 46142: {'lr': 0.00039805603750993514, 'samples': 8859264, 'steps': 46141, 'loss/train': 1.3882590532302856} 11/07/2021 03:45:11 - INFO - __main__ - Step 46143: {'lr': 0.0003980517614438553, 'samples': 8859456, 'steps': 46142, 'loss/train': 2.1687562465667725} 11/07/2021 03:45:11 - INFO - __main__ - Step 46144: {'lr': 0.00039804748531106565, 'samples': 8859648, 'steps': 46143, 'loss/train': 1.4556162357330322} 11/07/2021 03:45:12 - INFO - __main__ - Step 46145: {'lr': 0.0003980432091115681, 'samples': 8859840, 'steps': 46144, 'loss/train': 1.7229753732681274} 11/07/2021 03:45:13 - INFO - __main__ - Step 46146: {'lr': 0.0003980389328453646, 'samples': 8860032, 'steps': 46145, 'loss/train': 1.2073771953582764} 11/07/2021 03:45:13 - INFO - __main__ - Step 46147: {'lr': 0.00039803465651245694, 'samples': 8860224, 'steps': 46146, 'loss/train': 1.692874550819397} 11/07/2021 03:45:13 - INFO - __main__ - Step 46148: {'lr': 0.00039803038011284724, 'samples': 8860416, 'steps': 46147, 'loss/train': 1.5015639066696167} 11/07/2021 03:45:14 - INFO - __main__ - Step 46149: {'lr': 0.00039802610364653737, 'samples': 8860608, 'steps': 46148, 'loss/train': 1.696057915687561} 11/07/2021 03:45:14 - INFO - __main__ - Step 46150: {'lr': 0.00039802182711352906, 'samples': 8860800, 'steps': 46149, 'loss/train': 1.4315892457962036} 11/07/2021 03:45:15 - INFO - __main__ - Step 46151: {'lr': 0.0003980175505138246, 'samples': 8860992, 'steps': 46150, 'loss/train': 1.3779420852661133} 11/07/2021 03:45:15 - INFO - __main__ - Step 46152: {'lr': 0.0003980132738474256, 'samples': 8861184, 'steps': 46151, 'loss/train': 1.1602864265441895} 11/07/2021 03:45:16 - INFO - __main__ - Step 46153: {'lr': 0.0003980089971143341, 'samples': 8861376, 'steps': 46152, 'loss/train': 1.6327952146530151} 11/07/2021 03:45:16 - INFO - __main__ - Step 46154: {'lr': 0.000398004720314552, 'samples': 8861568, 'steps': 46153, 'loss/train': 2.127473831176758} 11/07/2021 03:45:17 - INFO - __main__ - Step 46155: {'lr': 0.00039800044344808134, 'samples': 8861760, 'steps': 46154, 'loss/train': 1.058316707611084} 11/07/2021 03:45:18 - INFO - __main__ - Step 46156: {'lr': 0.00039799616651492394, 'samples': 8861952, 'steps': 46155, 'loss/train': 1.5582176446914673} 11/07/2021 03:45:18 - INFO - __main__ - Step 46157: {'lr': 0.00039799188951508176, 'samples': 8862144, 'steps': 46156, 'loss/train': 1.5612353086471558} 11/07/2021 03:45:18 - INFO - __main__ - Step 46158: {'lr': 0.0003979876124485567, 'samples': 8862336, 'steps': 46157, 'loss/train': 0.9640724062919617} 11/07/2021 03:45:19 - INFO - __main__ - Step 46159: {'lr': 0.0003979833353153507, 'samples': 8862528, 'steps': 46158, 'loss/train': 1.6415303945541382} 11/07/2021 03:45:19 - INFO - __main__ - Step 46160: {'lr': 0.00039797905811546564, 'samples': 8862720, 'steps': 46159, 'loss/train': 1.5795783996582031} 11/07/2021 03:45:19 - INFO - __main__ - Step 46161: {'lr': 0.0003979747808489036, 'samples': 8862912, 'steps': 46160, 'loss/train': 1.205873727798462} 11/07/2021 03:45:20 - INFO - __main__ - Step 46162: {'lr': 0.0003979705035156663, 'samples': 8863104, 'steps': 46161, 'loss/train': 1.7263001203536987} 11/07/2021 03:45:21 - INFO - __main__ - Step 46163: {'lr': 0.0003979662261157558, 'samples': 8863296, 'steps': 46162, 'loss/train': 1.7620081901550293} 11/07/2021 03:45:21 - INFO - __main__ - Step 46164: {'lr': 0.00039796194864917414, 'samples': 8863488, 'steps': 46163, 'loss/train': 1.6041735410690308} 11/07/2021 03:45:21 - INFO - __main__ - Step 46165: {'lr': 0.00039795767111592303, 'samples': 8863680, 'steps': 46164, 'loss/train': 1.8693697452545166} 11/07/2021 03:45:22 - INFO - __main__ - Step 46166: {'lr': 0.00039795339351600444, 'samples': 8863872, 'steps': 46165, 'loss/train': 1.9089528322219849} 11/07/2021 03:45:23 - INFO - __main__ - Step 46167: {'lr': 0.0003979491158494203, 'samples': 8864064, 'steps': 46166, 'loss/train': 1.438377022743225} 11/07/2021 03:45:23 - INFO - __main__ - Step 46168: {'lr': 0.00039794483811617267, 'samples': 8864256, 'steps': 46167, 'loss/train': 1.8377418518066406} 11/07/2021 03:45:23 - INFO - __main__ - Step 46169: {'lr': 0.0003979405603162633, 'samples': 8864448, 'steps': 46168, 'loss/train': 1.2616311311721802} 11/07/2021 03:45:24 - INFO - __main__ - Step 46170: {'lr': 0.0003979362824496942, 'samples': 8864640, 'steps': 46169, 'loss/train': 1.7195957899093628} 11/07/2021 03:45:24 - INFO - __main__ - Step 46171: {'lr': 0.00039793200451646737, 'samples': 8864832, 'steps': 46170, 'loss/train': 1.6643671989440918} 11/07/2021 03:45:25 - INFO - __main__ - Step 46172: {'lr': 0.0003979277265165846, 'samples': 8865024, 'steps': 46171, 'loss/train': 1.4506498575210571} 11/07/2021 03:45:26 - INFO - __main__ - Step 46173: {'lr': 0.00039792344845004793, 'samples': 8865216, 'steps': 46172, 'loss/train': 1.3304592370986938} 11/07/2021 03:45:26 - INFO - __main__ - Step 46174: {'lr': 0.00039791917031685914, 'samples': 8865408, 'steps': 46173, 'loss/train': 1.5090699195861816} 11/07/2021 03:45:26 - INFO - __main__ - Step 46175: {'lr': 0.0003979148921170203, 'samples': 8865600, 'steps': 46174, 'loss/train': 1.5561892986297607} 11/07/2021 03:45:27 - INFO - __main__ - Step 46176: {'lr': 0.0003979106138505333, 'samples': 8865792, 'steps': 46175, 'loss/train': 1.3784784078598022} 11/07/2021 03:45:28 - INFO - __main__ - Step 46177: {'lr': 0.00039790633551740006, 'samples': 8865984, 'steps': 46176, 'loss/train': 1.4233170747756958} 11/07/2021 03:45:28 - INFO - __main__ - Step 46178: {'lr': 0.0003979020571176226, 'samples': 8866176, 'steps': 46177, 'loss/train': 1.3791457414627075} 11/07/2021 03:45:28 - INFO - __main__ - Step 46179: {'lr': 0.00039789777865120257, 'samples': 8866368, 'steps': 46178, 'loss/train': 1.4233858585357666} 11/07/2021 03:45:29 - INFO - __main__ - Step 46180: {'lr': 0.0003978935001181422, 'samples': 8866560, 'steps': 46179, 'loss/train': 1.8352843523025513} 11/07/2021 03:45:29 - INFO - __main__ - Step 46181: {'lr': 0.0003978892215184433, 'samples': 8866752, 'steps': 46180, 'loss/train': 1.3255808353424072} 11/07/2021 03:45:30 - INFO - __main__ - Step 46182: {'lr': 0.00039788494285210774, 'samples': 8866944, 'steps': 46181, 'loss/train': 1.5028743743896484} 11/07/2021 03:45:31 - INFO - __main__ - Step 46183: {'lr': 0.0003978806641191376, 'samples': 8867136, 'steps': 46182, 'loss/train': 1.8030539751052856} 11/07/2021 03:45:31 - INFO - __main__ - Step 46184: {'lr': 0.0003978763853195346, 'samples': 8867328, 'steps': 46183, 'loss/train': 1.3502064943313599} 11/07/2021 03:45:31 - INFO - __main__ - Step 46185: {'lr': 0.0003978721064533009, 'samples': 8867520, 'steps': 46184, 'loss/train': 1.466660976409912} 11/07/2021 03:45:32 - INFO - __main__ - Step 46186: {'lr': 0.0003978678275204383, 'samples': 8867712, 'steps': 46185, 'loss/train': 1.0451267957687378} 11/07/2021 03:45:32 - INFO - __main__ - Step 46187: {'lr': 0.00039786354852094864, 'samples': 8867904, 'steps': 46186, 'loss/train': 1.3738776445388794} 11/07/2021 03:45:33 - INFO - __main__ - Step 46188: {'lr': 0.00039785926945483396, 'samples': 8868096, 'steps': 46187, 'loss/train': 1.433671236038208} 11/07/2021 03:45:33 - INFO - __main__ - Step 46189: {'lr': 0.00039785499032209625, 'samples': 8868288, 'steps': 46188, 'loss/train': 1.3277513980865479} 11/07/2021 03:45:34 - INFO - __main__ - Step 46190: {'lr': 0.0003978507111227373, 'samples': 8868480, 'steps': 46189, 'loss/train': 1.5594428777694702} 11/07/2021 03:45:34 - INFO - __main__ - Step 46191: {'lr': 0.00039784643185675916, 'samples': 8868672, 'steps': 46190, 'loss/train': 1.5130149126052856} 11/07/2021 03:45:34 - INFO - __main__ - Step 46192: {'lr': 0.0003978421525241637, 'samples': 8868864, 'steps': 46191, 'loss/train': 1.791502833366394} 11/07/2021 03:45:35 - INFO - __main__ - Step 46193: {'lr': 0.00039783787312495277, 'samples': 8869056, 'steps': 46192, 'loss/train': 1.1747045516967773} 11/07/2021 03:45:36 - INFO - __main__ - Step 46194: {'lr': 0.0003978335936591284, 'samples': 8869248, 'steps': 46193, 'loss/train': 1.7039309740066528} 11/07/2021 03:45:36 - INFO - __main__ - Step 46195: {'lr': 0.00039782931412669253, 'samples': 8869440, 'steps': 46194, 'loss/train': 1.482157826423645} 11/07/2021 03:45:36 - INFO - __main__ - Step 46196: {'lr': 0.000397825034527647, 'samples': 8869632, 'steps': 46195, 'loss/train': 1.8387107849121094} 11/07/2021 03:45:37 - INFO - __main__ - Step 46197: {'lr': 0.0003978207548619939, 'samples': 8869824, 'steps': 46196, 'loss/train': 1.5056120157241821} 11/07/2021 03:45:38 - INFO - __main__ - Step 46198: {'lr': 0.000397816475129735, 'samples': 8870016, 'steps': 46197, 'loss/train': 1.4632428884506226} 11/07/2021 03:45:38 - INFO - __main__ - Step 46199: {'lr': 0.0003978121953308722, 'samples': 8870208, 'steps': 46198, 'loss/train': 1.8056995868682861} 11/07/2021 03:45:38 - INFO - __main__ - Step 46200: {'lr': 0.0003978079154654075, 'samples': 8870400, 'steps': 46199, 'loss/train': 1.4516160488128662} 11/07/2021 03:45:39 - INFO - __main__ - Step 46201: {'lr': 0.000397803635533343, 'samples': 8870592, 'steps': 46200, 'loss/train': 1.0014290809631348} 11/07/2021 03:45:39 - INFO - __main__ - Step 46202: {'lr': 0.00039779935553468026, 'samples': 8870784, 'steps': 46201, 'loss/train': 1.3329901695251465} 11/07/2021 03:45:40 - INFO - __main__ - Step 46203: {'lr': 0.0003977950754694215, 'samples': 8870976, 'steps': 46202, 'loss/train': 1.5694855451583862} 11/07/2021 03:45:41 - INFO - __main__ - Step 46204: {'lr': 0.00039779079533756856, 'samples': 8871168, 'steps': 46203, 'loss/train': 1.2100770473480225} 11/07/2021 03:45:41 - INFO - __main__ - Step 46205: {'lr': 0.00039778651513912343, 'samples': 8871360, 'steps': 46204, 'loss/train': 1.1733067035675049} 11/07/2021 03:45:41 - INFO - __main__ - Step 46206: {'lr': 0.00039778223487408796, 'samples': 8871552, 'steps': 46205, 'loss/train': 0.9480448961257935} 11/07/2021 03:45:42 - INFO - __main__ - Step 46207: {'lr': 0.000397777954542464, 'samples': 8871744, 'steps': 46206, 'loss/train': 1.1663490533828735} 11/07/2021 03:45:43 - INFO - __main__ - Step 46208: {'lr': 0.0003977736741442537, 'samples': 8871936, 'steps': 46207, 'loss/train': 1.4362285137176514} 11/07/2021 03:45:43 - INFO - __main__ - Step 46209: {'lr': 0.00039776939367945874, 'samples': 8872128, 'steps': 46208, 'loss/train': 1.7439661026000977} 11/07/2021 03:45:44 - INFO - __main__ - Step 46210: {'lr': 0.00039776511314808125, 'samples': 8872320, 'steps': 46209, 'loss/train': 1.4757336378097534} 11/07/2021 03:45:44 - INFO - __main__ - Step 46211: {'lr': 0.00039776083255012307, 'samples': 8872512, 'steps': 46210, 'loss/train': 1.7445675134658813} 11/07/2021 03:45:44 - INFO - __main__ - Step 46212: {'lr': 0.0003977565518855861, 'samples': 8872704, 'steps': 46211, 'loss/train': 1.7271639108657837} 11/07/2021 03:45:45 - INFO - __main__ - Step 46213: {'lr': 0.0003977522711544723, 'samples': 8872896, 'steps': 46212, 'loss/train': 1.2396310567855835} 11/07/2021 03:45:46 - INFO - __main__ - Step 46214: {'lr': 0.00039774799035678367, 'samples': 8873088, 'steps': 46213, 'loss/train': 1.4215750694274902} 11/07/2021 03:45:46 - INFO - __main__ - Step 46215: {'lr': 0.000397743709492522, 'samples': 8873280, 'steps': 46214, 'loss/train': 1.7732235193252563} 11/07/2021 03:45:46 - INFO - __main__ - Step 46216: {'lr': 0.0003977394285616893, 'samples': 8873472, 'steps': 46215, 'loss/train': 1.045384407043457} 11/07/2021 03:45:47 - INFO - __main__ - Step 46217: {'lr': 0.0003977351475642876, 'samples': 8873664, 'steps': 46216, 'loss/train': 0.8771900534629822} 11/07/2021 03:45:47 - INFO - __main__ - Step 46218: {'lr': 0.00039773086650031866, 'samples': 8873856, 'steps': 46217, 'loss/train': 0.9818655848503113} 11/07/2021 03:45:48 - INFO - __main__ - Step 46219: {'lr': 0.00039772658536978443, 'samples': 8874048, 'steps': 46218, 'loss/train': 1.6137886047363281} 11/07/2021 03:45:48 - INFO - __main__ - Step 46220: {'lr': 0.00039772230417268697, 'samples': 8874240, 'steps': 46219, 'loss/train': 1.6870713233947754} 11/07/2021 03:45:49 - INFO - __main__ - Step 46221: {'lr': 0.00039771802290902806, 'samples': 8874432, 'steps': 46220, 'loss/train': 1.9500912427902222} 11/07/2021 03:45:49 - INFO - __main__ - Step 46222: {'lr': 0.0003977137415788097, 'samples': 8874624, 'steps': 46221, 'loss/train': 1.2778922319412231} 11/07/2021 03:45:49 - INFO - __main__ - Step 46223: {'lr': 0.00039770946018203375, 'samples': 8874816, 'steps': 46222, 'loss/train': 1.569501519203186} 11/07/2021 03:45:51 - INFO - __main__ - Step 46224: {'lr': 0.00039770517871870226, 'samples': 8875008, 'steps': 46223, 'loss/train': 1.6372594833374023} 11/07/2021 03:45:51 - INFO - __main__ - Step 46225: {'lr': 0.00039770089718881707, 'samples': 8875200, 'steps': 46224, 'loss/train': 0.9442867040634155} 11/07/2021 03:45:51 - INFO - __main__ - Step 46226: {'lr': 0.00039769661559238014, 'samples': 8875392, 'steps': 46225, 'loss/train': 1.5672374963760376} 11/07/2021 03:45:52 - INFO - __main__ - Step 46227: {'lr': 0.0003976923339293934, 'samples': 8875584, 'steps': 46226, 'loss/train': 1.2659668922424316} 11/07/2021 03:45:52 - INFO - __main__ - Step 46228: {'lr': 0.0003976880521998588, 'samples': 8875776, 'steps': 46227, 'loss/train': 1.7109959125518799} 11/07/2021 03:45:52 - INFO - __main__ - Step 46229: {'lr': 0.00039768377040377823, 'samples': 8875968, 'steps': 46228, 'loss/train': 1.5612633228302002} 11/07/2021 03:45:53 - INFO - __main__ - Step 46230: {'lr': 0.00039767948854115356, 'samples': 8876160, 'steps': 46229, 'loss/train': 3.5253870487213135} 11/07/2021 03:45:54 - INFO - __main__ - Step 46231: {'lr': 0.0003976752066119869, 'samples': 8876352, 'steps': 46230, 'loss/train': 1.3928388357162476} 11/07/2021 03:45:54 - INFO - __main__ - Step 46232: {'lr': 0.00039767092461628, 'samples': 8876544, 'steps': 46231, 'loss/train': 1.063865303993225} 11/07/2021 03:45:55 - INFO - __main__ - Step 46233: {'lr': 0.0003976666425540349, 'samples': 8876736, 'steps': 46232, 'loss/train': 1.5887515544891357} 11/07/2021 03:45:55 - INFO - __main__ - Step 46234: {'lr': 0.00039766236042525346, 'samples': 8876928, 'steps': 46233, 'loss/train': 1.708302617073059} 11/07/2021 03:45:56 - INFO - __main__ - Step 46235: {'lr': 0.0003976580782299376, 'samples': 8877120, 'steps': 46234, 'loss/train': 1.5173242092132568} 11/07/2021 03:45:56 - INFO - __main__ - Step 46236: {'lr': 0.0003976537959680894, 'samples': 8877312, 'steps': 46235, 'loss/train': 1.6224896907806396} 11/07/2021 03:45:57 - INFO - __main__ - Step 46237: {'lr': 0.0003976495136397106, 'samples': 8877504, 'steps': 46236, 'loss/train': 1.1167081594467163} 11/07/2021 03:45:57 - INFO - __main__ - Step 46238: {'lr': 0.0003976452312448032, 'samples': 8877696, 'steps': 46237, 'loss/train': 1.2795268297195435} 11/07/2021 03:45:57 - INFO - __main__ - Step 46239: {'lr': 0.0003976409487833692, 'samples': 8877888, 'steps': 46238, 'loss/train': 1.6856943368911743} 11/07/2021 03:45:58 - INFO - __main__ - Step 46240: {'lr': 0.0003976366662554104, 'samples': 8878080, 'steps': 46239, 'loss/train': 0.9027916789054871} 11/07/2021 03:45:59 - INFO - __main__ - Step 46241: {'lr': 0.0003976323836609288, 'samples': 8878272, 'steps': 46240, 'loss/train': 1.8561947345733643} 11/07/2021 03:45:59 - INFO - __main__ - Step 46242: {'lr': 0.00039762810099992644, 'samples': 8878464, 'steps': 46241, 'loss/train': 1.3961539268493652} 11/07/2021 03:46:00 - INFO - __main__ - Step 46243: {'lr': 0.00039762381827240496, 'samples': 8878656, 'steps': 46242, 'loss/train': 0.9099176526069641} 11/07/2021 03:46:00 - INFO - __main__ - Step 46244: {'lr': 0.00039761953547836655, 'samples': 8878848, 'steps': 46243, 'loss/train': 2.1442995071411133} 11/07/2021 03:46:01 - INFO - __main__ - Step 46245: {'lr': 0.00039761525261781304, 'samples': 8879040, 'steps': 46244, 'loss/train': 1.3313018083572388} 11/07/2021 03:46:01 - INFO - __main__ - Step 46246: {'lr': 0.00039761096969074644, 'samples': 8879232, 'steps': 46245, 'loss/train': 1.868284821510315} 11/07/2021 03:46:02 - INFO - __main__ - Step 46247: {'lr': 0.0003976066866971686, 'samples': 8879424, 'steps': 46246, 'loss/train': 1.4452598094940186} 11/07/2021 03:46:02 - INFO - __main__ - Step 46248: {'lr': 0.0003976024036370814, 'samples': 8879616, 'steps': 46247, 'loss/train': 1.410376787185669} 11/07/2021 03:46:02 - INFO - __main__ - Step 46249: {'lr': 0.0003975981205104868, 'samples': 8879808, 'steps': 46248, 'loss/train': 0.6749045252799988} 11/07/2021 03:46:03 - INFO - __main__ - Step 46250: {'lr': 0.0003975938373173868, 'samples': 8880000, 'steps': 46249, 'loss/train': 1.77179753780365} 11/07/2021 03:46:04 - INFO - __main__ - Step 46251: {'lr': 0.00039758955405778344, 'samples': 8880192, 'steps': 46250, 'loss/train': 1.5711568593978882} 11/07/2021 03:46:04 - INFO - __main__ - Step 46252: {'lr': 0.0003975852707316784, 'samples': 8880384, 'steps': 46251, 'loss/train': 1.2575101852416992} 11/07/2021 03:46:05 - INFO - __main__ - Step 46253: {'lr': 0.00039758098733907364, 'samples': 8880576, 'steps': 46252, 'loss/train': 1.2835415601730347} 11/07/2021 03:46:05 - INFO - __main__ - Step 46254: {'lr': 0.00039757670387997125, 'samples': 8880768, 'steps': 46253, 'loss/train': 1.1759365797042847} 11/07/2021 03:46:05 - INFO - __main__ - Step 46255: {'lr': 0.000397572420354373, 'samples': 8880960, 'steps': 46254, 'loss/train': 1.6075844764709473} 11/07/2021 03:46:06 - INFO - __main__ - Step 46256: {'lr': 0.00039756813676228097, 'samples': 8881152, 'steps': 46255, 'loss/train': 1.6370705366134644} 11/07/2021 03:46:07 - INFO - __main__ - Step 46257: {'lr': 0.00039756385310369703, 'samples': 8881344, 'steps': 46256, 'loss/train': 1.8248659372329712} 11/07/2021 03:46:07 - INFO - __main__ - Step 46258: {'lr': 0.00039755956937862305, 'samples': 8881536, 'steps': 46257, 'loss/train': 1.6447463035583496} 11/07/2021 03:46:07 - INFO - __main__ - Step 46259: {'lr': 0.000397555285587061, 'samples': 8881728, 'steps': 46258, 'loss/train': 1.0949921607971191} 11/07/2021 03:46:08 - INFO - __main__ - Step 46260: {'lr': 0.0003975510017290128, 'samples': 8881920, 'steps': 46259, 'loss/train': 1.5389002561569214} 11/07/2021 03:46:08 - INFO - __main__ - Step 46261: {'lr': 0.00039754671780448044, 'samples': 8882112, 'steps': 46260, 'loss/train': 1.0770015716552734} 11/07/2021 03:46:09 - INFO - __main__ - Step 46262: {'lr': 0.00039754243381346575, 'samples': 8882304, 'steps': 46261, 'loss/train': 1.4575480222702026} 11/07/2021 03:46:10 - INFO - __main__ - Step 46263: {'lr': 0.0003975381497559708, 'samples': 8882496, 'steps': 46262, 'loss/train': 0.6957834959030151} 11/07/2021 03:46:10 - INFO - __main__ - Step 46264: {'lr': 0.00039753386563199733, 'samples': 8882688, 'steps': 46263, 'loss/train': 1.280260443687439} 11/07/2021 03:46:10 - INFO - __main__ - Step 46265: {'lr': 0.0003975295814415475, 'samples': 8882880, 'steps': 46264, 'loss/train': 1.0015782117843628} 11/07/2021 03:46:11 - INFO - __main__ - Step 46266: {'lr': 0.000397525297184623, 'samples': 8883072, 'steps': 46265, 'loss/train': 1.1495505571365356} 11/07/2021 03:46:12 - INFO - __main__ - Step 46267: {'lr': 0.000397521012861226, 'samples': 8883264, 'steps': 46266, 'loss/train': 1.4827730655670166} 11/07/2021 03:46:12 - INFO - __main__ - Step 46268: {'lr': 0.0003975167284713582, 'samples': 8883456, 'steps': 46267, 'loss/train': 0.9695247411727905} 11/07/2021 03:46:12 - INFO - __main__ - Step 46269: {'lr': 0.0003975124440150217, 'samples': 8883648, 'steps': 46268, 'loss/train': 1.5522502660751343} 11/07/2021 03:46:13 - INFO - __main__ - Step 46270: {'lr': 0.0003975081594922183, 'samples': 8883840, 'steps': 46269, 'loss/train': 1.0997180938720703} 11/07/2021 03:46:13 - INFO - __main__ - Step 46271: {'lr': 0.00039750387490295006, 'samples': 8884032, 'steps': 46270, 'loss/train': 1.3116071224212646} 11/07/2021 03:46:14 - INFO - __main__ - Step 46272: {'lr': 0.00039749959024721883, 'samples': 8884224, 'steps': 46271, 'loss/train': 1.546545386314392} 11/07/2021 03:46:14 - INFO - __main__ - Step 46273: {'lr': 0.00039749530552502654, 'samples': 8884416, 'steps': 46272, 'loss/train': 1.332622766494751} 11/07/2021 03:46:15 - INFO - __main__ - Step 46274: {'lr': 0.0003974910207363752, 'samples': 8884608, 'steps': 46273, 'loss/train': 1.4993009567260742} 11/07/2021 03:46:15 - INFO - __main__ - Step 46275: {'lr': 0.00039748673588126674, 'samples': 8884800, 'steps': 46274, 'loss/train': 1.944024682044983} 11/07/2021 03:46:16 - INFO - __main__ - Step 46276: {'lr': 0.00039748245095970285, 'samples': 8884992, 'steps': 46275, 'loss/train': 1.0248795747756958} 11/07/2021 03:46:17 - INFO - __main__ - Step 46277: {'lr': 0.0003974781659716857, 'samples': 8885184, 'steps': 46276, 'loss/train': 1.0836387872695923} 11/07/2021 03:46:17 - INFO - __main__ - Step 46278: {'lr': 0.00039747388091721723, 'samples': 8885376, 'steps': 46277, 'loss/train': 1.4748361110687256} 11/07/2021 03:46:17 - INFO - __main__ - Step 46279: {'lr': 0.00039746959579629924, 'samples': 8885568, 'steps': 46278, 'loss/train': 1.4609371423721313} 11/07/2021 03:46:18 - INFO - __main__ - Step 46280: {'lr': 0.00039746531060893387, 'samples': 8885760, 'steps': 46279, 'loss/train': 1.255275845527649} 11/07/2021 03:46:18 - INFO - __main__ - Step 46281: {'lr': 0.00039746102535512273, 'samples': 8885952, 'steps': 46280, 'loss/train': 1.3065893650054932} 11/07/2021 03:46:19 - INFO - __main__ - Step 46282: {'lr': 0.000397456740034868, 'samples': 8886144, 'steps': 46281, 'loss/train': 1.4119219779968262} 11/07/2021 03:46:19 - INFO - __main__ - Step 46283: {'lr': 0.00039745245464817156, 'samples': 8886336, 'steps': 46282, 'loss/train': 1.323103666305542} 11/07/2021 03:46:20 - INFO - __main__ - Step 46284: {'lr': 0.0003974481691950352, 'samples': 8886528, 'steps': 46283, 'loss/train': 1.300934910774231} 11/07/2021 03:46:20 - INFO - __main__ - Step 46285: {'lr': 0.00039744388367546113, 'samples': 8886720, 'steps': 46284, 'loss/train': 1.2084866762161255} 11/07/2021 03:46:20 - INFO - __main__ - Step 46286: {'lr': 0.0003974395980894511, 'samples': 8886912, 'steps': 46285, 'loss/train': 1.968017339706421} 11/07/2021 03:46:21 - INFO - __main__ - Step 46287: {'lr': 0.000397435312437007, 'samples': 8887104, 'steps': 46286, 'loss/train': 1.5359374284744263} 11/07/2021 03:46:22 - INFO - __main__ - Step 46288: {'lr': 0.0003974310267181308, 'samples': 8887296, 'steps': 46287, 'loss/train': 1.4335994720458984} 11/07/2021 03:46:22 - INFO - __main__ - Step 46289: {'lr': 0.00039742674093282447, 'samples': 8887488, 'steps': 46288, 'loss/train': 0.9824011921882629} 11/07/2021 03:46:23 - INFO - __main__ - Step 46290: {'lr': 0.00039742245508109, 'samples': 8887680, 'steps': 46289, 'loss/train': 1.5129871368408203} 11/07/2021 03:46:23 - INFO - __main__ - Step 46291: {'lr': 0.0003974181691629292, 'samples': 8887872, 'steps': 46290, 'loss/train': 0.4517260491847992} 11/07/2021 03:46:23 - INFO - __main__ - Step 46292: {'lr': 0.00039741388317834404, 'samples': 8888064, 'steps': 46291, 'loss/train': 0.2578364312648773} 11/07/2021 03:46:24 - INFO - __main__ - Step 46293: {'lr': 0.0003974095971273365, 'samples': 8888256, 'steps': 46292, 'loss/train': 1.7126384973526} 11/07/2021 03:46:25 - INFO - __main__ - Step 46294: {'lr': 0.0003974053110099084, 'samples': 8888448, 'steps': 46293, 'loss/train': 1.5983866453170776} 11/07/2021 03:46:25 - INFO - __main__ - Step 46295: {'lr': 0.00039740102482606175, 'samples': 8888640, 'steps': 46294, 'loss/train': 1.3725718259811401} 11/07/2021 03:46:25 - INFO - __main__ - Step 46296: {'lr': 0.0003973967385757985, 'samples': 8888832, 'steps': 46295, 'loss/train': 1.0838935375213623} 11/07/2021 03:46:26 - INFO - __main__ - Step 46297: {'lr': 0.00039739245225912055, 'samples': 8889024, 'steps': 46296, 'loss/train': 1.899027943611145} 11/07/2021 03:46:27 - INFO - __main__ - Step 46298: {'lr': 0.0003973881658760298, 'samples': 8889216, 'steps': 46297, 'loss/train': 1.5178961753845215} 11/07/2021 03:46:27 - INFO - __main__ - Step 46299: {'lr': 0.0003973838794265283, 'samples': 8889408, 'steps': 46298, 'loss/train': 1.2332338094711304} 11/07/2021 03:46:27 - INFO - __main__ - Step 46300: {'lr': 0.00039737959291061785, 'samples': 8889600, 'steps': 46299, 'loss/train': 1.6719191074371338} 11/07/2021 03:46:28 - INFO - __main__ - Step 46301: {'lr': 0.00039737530632830045, 'samples': 8889792, 'steps': 46300, 'loss/train': 1.134166955947876} 11/07/2021 03:46:28 - INFO - __main__ - Step 46302: {'lr': 0.000397371019679578, 'samples': 8889984, 'steps': 46301, 'loss/train': 1.2153406143188477} 11/07/2021 03:46:29 - INFO - __main__ - Step 46303: {'lr': 0.00039736673296445233, 'samples': 8890176, 'steps': 46302, 'loss/train': 1.4334412813186646} 11/07/2021 03:46:30 - INFO - __main__ - Step 46304: {'lr': 0.00039736244618292563, 'samples': 8890368, 'steps': 46303, 'loss/train': 1.919948935508728} 11/07/2021 03:46:30 - INFO - __main__ - Step 46305: {'lr': 0.0003973581593349997, 'samples': 8890560, 'steps': 46304, 'loss/train': 1.259997010231018} 11/07/2021 03:46:30 - INFO - __main__ - Step 46306: {'lr': 0.00039735387242067637, 'samples': 8890752, 'steps': 46305, 'loss/train': 1.6165543794631958} 11/07/2021 03:46:31 - INFO - __main__ - Step 46307: {'lr': 0.0003973495854399577, 'samples': 8890944, 'steps': 46306, 'loss/train': 1.770179271697998} 11/07/2021 03:46:32 - INFO - __main__ - Step 46308: {'lr': 0.0003973452983928456, 'samples': 8891136, 'steps': 46307, 'loss/train': 1.8767409324645996} 11/07/2021 03:46:32 - INFO - __main__ - Step 46309: {'lr': 0.00039734101127934194, 'samples': 8891328, 'steps': 46308, 'loss/train': 1.3616315126419067} 11/07/2021 03:46:32 - INFO - __main__ - Step 46310: {'lr': 0.0003973367240994487, 'samples': 8891520, 'steps': 46309, 'loss/train': 1.2925238609313965} 11/07/2021 03:46:33 - INFO - __main__ - Step 46311: {'lr': 0.00039733243685316776, 'samples': 8891712, 'steps': 46310, 'loss/train': 1.692766547203064} 11/07/2021 03:46:33 - INFO - __main__ - Step 46312: {'lr': 0.00039732814954050125, 'samples': 8891904, 'steps': 46311, 'loss/train': 1.7525407075881958} 11/07/2021 03:46:34 - INFO - __main__ - Step 46313: {'lr': 0.0003973238621614508, 'samples': 8892096, 'steps': 46312, 'loss/train': 1.7229480743408203} 11/07/2021 03:46:34 - INFO - __main__ - Step 46314: {'lr': 0.0003973195747160185, 'samples': 8892288, 'steps': 46313, 'loss/train': 1.8279120922088623} 11/07/2021 03:46:35 - INFO - __main__ - Step 46315: {'lr': 0.00039731528720420635, 'samples': 8892480, 'steps': 46314, 'loss/train': 1.6845635175704956} 11/07/2021 03:46:35 - INFO - __main__ - Step 46316: {'lr': 0.00039731099962601613, 'samples': 8892672, 'steps': 46315, 'loss/train': 1.5683223009109497} 11/07/2021 03:46:35 - INFO - __main__ - Step 46317: {'lr': 0.0003973067119814499, 'samples': 8892864, 'steps': 46316, 'loss/train': 1.3998371362686157} 11/07/2021 03:46:36 - INFO - __main__ - Step 46318: {'lr': 0.00039730242427050955, 'samples': 8893056, 'steps': 46317, 'loss/train': 1.473682165145874} 11/07/2021 03:46:37 - INFO - __main__ - Step 46319: {'lr': 0.00039729813649319704, 'samples': 8893248, 'steps': 46318, 'loss/train': 1.5312484502792358} 11/07/2021 03:46:37 - INFO - __main__ - Step 46320: {'lr': 0.0003972938486495141, 'samples': 8893440, 'steps': 46319, 'loss/train': 1.6194556951522827} 11/07/2021 03:46:38 - INFO - __main__ - Step 46321: {'lr': 0.000397289560739463, 'samples': 8893632, 'steps': 46320, 'loss/train': 0.7224371433258057} 11/07/2021 03:46:38 - INFO - __main__ - Step 46322: {'lr': 0.0003972852727630454, 'samples': 8893824, 'steps': 46321, 'loss/train': 1.4943556785583496} 11/07/2021 03:46:39 - INFO - __main__ - Step 46323: {'lr': 0.0003972809847202633, 'samples': 8894016, 'steps': 46322, 'loss/train': 1.1739274263381958} 11/07/2021 03:46:39 - INFO - __main__ - Step 46324: {'lr': 0.0003972766966111187, 'samples': 8894208, 'steps': 46323, 'loss/train': 1.3836803436279297} 11/07/2021 03:46:40 - INFO - __main__ - Step 46325: {'lr': 0.0003972724084356135, 'samples': 8894400, 'steps': 46324, 'loss/train': 1.1018271446228027} 11/07/2021 03:46:40 - INFO - __main__ - Step 46326: {'lr': 0.0003972681201937497, 'samples': 8894592, 'steps': 46325, 'loss/train': 1.9598287343978882} 11/07/2021 03:46:40 - INFO - __main__ - Step 46327: {'lr': 0.00039726383188552907, 'samples': 8894784, 'steps': 46326, 'loss/train': 0.4965382516384125} 11/07/2021 03:46:41 - INFO - __main__ - Step 46328: {'lr': 0.0003972595435109536, 'samples': 8894976, 'steps': 46327, 'loss/train': 1.0187140703201294} 11/07/2021 03:46:42 - INFO - __main__ - Step 46329: {'lr': 0.0003972552550700253, 'samples': 8895168, 'steps': 46328, 'loss/train': 1.9053086042404175} 11/07/2021 03:46:42 - INFO - __main__ - Step 46330: {'lr': 0.00039725096656274605, 'samples': 8895360, 'steps': 46329, 'loss/train': 1.474176287651062} 11/07/2021 03:46:42 - INFO - __main__ - Step 46331: {'lr': 0.0003972466779891178, 'samples': 8895552, 'steps': 46330, 'loss/train': 1.58558988571167} 11/07/2021 03:46:43 - INFO - __main__ - Step 46332: {'lr': 0.00039724238934914246, 'samples': 8895744, 'steps': 46331, 'loss/train': 1.1344085931777954} 11/07/2021 03:46:43 - INFO - __main__ - Step 46333: {'lr': 0.00039723810064282194, 'samples': 8895936, 'steps': 46332, 'loss/train': 1.906126856803894} 11/07/2021 03:46:44 - INFO - __main__ - Step 46334: {'lr': 0.00039723381187015827, 'samples': 8896128, 'steps': 46333, 'loss/train': 1.4070894718170166} 11/07/2021 03:46:44 - INFO - __main__ - Step 46335: {'lr': 0.00039722952303115325, 'samples': 8896320, 'steps': 46334, 'loss/train': 1.662004828453064} 11/07/2021 03:46:45 - INFO - __main__ - Step 46336: {'lr': 0.00039722523412580893, 'samples': 8896512, 'steps': 46335, 'loss/train': 1.8109190464019775} 11/07/2021 03:46:45 - INFO - __main__ - Step 46337: {'lr': 0.00039722094515412716, 'samples': 8896704, 'steps': 46336, 'loss/train': 1.2489622831344604} 11/07/2021 03:46:45 - INFO - __main__ - Step 46338: {'lr': 0.0003972166561161099, 'samples': 8896896, 'steps': 46337, 'loss/train': 1.6360827684402466} 11/07/2021 03:46:46 - INFO - __main__ - Step 46339: {'lr': 0.0003972123670117591, 'samples': 8897088, 'steps': 46338, 'loss/train': 1.5947308540344238} 11/07/2021 03:46:47 - INFO - __main__ - Step 46340: {'lr': 0.0003972080778410767, 'samples': 8897280, 'steps': 46339, 'loss/train': 1.5650063753128052} 11/07/2021 03:46:47 - INFO - __main__ - Step 46341: {'lr': 0.0003972037886040646, 'samples': 8897472, 'steps': 46340, 'loss/train': 1.557324767112732} 11/07/2021 03:46:47 - INFO - __main__ - Step 46342: {'lr': 0.0003971994993007247, 'samples': 8897664, 'steps': 46341, 'loss/train': 1.0299310684204102} 11/07/2021 03:46:48 - INFO - __main__ - Step 46343: {'lr': 0.000397195209931059, 'samples': 8897856, 'steps': 46342, 'loss/train': 1.5515167713165283} 11/07/2021 03:46:49 - INFO - __main__ - Step 46344: {'lr': 0.00039719092049506945, 'samples': 8898048, 'steps': 46343, 'loss/train': 1.3159284591674805} 11/07/2021 03:46:49 - INFO - __main__ - Step 46345: {'lr': 0.0003971866309927579, 'samples': 8898240, 'steps': 46344, 'loss/train': 1.5566840171813965} 11/07/2021 03:46:50 - INFO - __main__ - Step 46346: {'lr': 0.0003971823414241263, 'samples': 8898432, 'steps': 46345, 'loss/train': 1.3025474548339844} 11/07/2021 03:46:50 - INFO - __main__ - Step 46347: {'lr': 0.00039717805178917666, 'samples': 8898624, 'steps': 46346, 'loss/train': 0.9771838188171387} 11/07/2021 03:46:50 - INFO - __main__ - Step 46348: {'lr': 0.0003971737620879109, 'samples': 8898816, 'steps': 46347, 'loss/train': 1.1943190097808838} 11/07/2021 03:46:52 - INFO - __main__ - Step 46349: {'lr': 0.00039716947232033086, 'samples': 8899008, 'steps': 46348, 'loss/train': 1.3805917501449585} 11/07/2021 03:46:52 - INFO - __main__ - Step 46350: {'lr': 0.0003971651824864385, 'samples': 8899200, 'steps': 46349, 'loss/train': 1.2455881834030151} 11/07/2021 03:46:52 - INFO - __main__ - Step 46351: {'lr': 0.0003971608925862358, 'samples': 8899392, 'steps': 46350, 'loss/train': 0.6013545393943787} 11/07/2021 03:46:53 - INFO - __main__ - Step 46352: {'lr': 0.0003971566026197247, 'samples': 8899584, 'steps': 46351, 'loss/train': 1.3850561380386353} 11/07/2021 03:46:53 - INFO - __main__ - Step 46353: {'lr': 0.0003971523125869071, 'samples': 8899776, 'steps': 46352, 'loss/train': 1.5100414752960205} 11/07/2021 03:46:54 - INFO - __main__ - Step 46354: {'lr': 0.0003971480224877849, 'samples': 8899968, 'steps': 46353, 'loss/train': 1.7108681201934814} 11/07/2021 03:46:54 - INFO - __main__ - Step 46355: {'lr': 0.0003971437323223601, 'samples': 8900160, 'steps': 46354, 'loss/train': 1.50070321559906} 11/07/2021 03:46:55 - INFO - __main__ - Step 46356: {'lr': 0.0003971394420906346, 'samples': 8900352, 'steps': 46355, 'loss/train': 1.4882370233535767} 11/07/2021 03:46:55 - INFO - __main__ - Step 46357: {'lr': 0.0003971351517926103, 'samples': 8900544, 'steps': 46356, 'loss/train': 1.826380729675293} 11/07/2021 03:46:55 - INFO - __main__ - Step 46358: {'lr': 0.00039713086142828926, 'samples': 8900736, 'steps': 46357, 'loss/train': 1.4136426448822021} 11/07/2021 03:46:56 - INFO - __main__ - Step 46359: {'lr': 0.0003971265709976732, 'samples': 8900928, 'steps': 46358, 'loss/train': 1.0631334781646729} 11/07/2021 03:46:57 - INFO - __main__ - Step 46360: {'lr': 0.0003971222805007643, 'samples': 8901120, 'steps': 46359, 'loss/train': 1.5096248388290405} 11/07/2021 03:46:57 - INFO - __main__ - Step 46361: {'lr': 0.0003971179899375643, 'samples': 8901312, 'steps': 46360, 'loss/train': 1.5115386247634888} 11/07/2021 03:46:57 - INFO - __main__ - Step 46362: {'lr': 0.0003971136993080753, 'samples': 8901504, 'steps': 46361, 'loss/train': 1.3785995244979858} 11/07/2021 03:46:58 - INFO - __main__ - Step 46363: {'lr': 0.000397109408612299, 'samples': 8901696, 'steps': 46362, 'loss/train': 1.4775710105895996} 11/07/2021 03:46:58 - INFO - __main__ - Step 46364: {'lr': 0.0003971051178502375, 'samples': 8901888, 'steps': 46363, 'loss/train': 1.3644808530807495} 11/07/2021 03:46:59 - INFO - __main__ - Step 46365: {'lr': 0.00039710082702189276, 'samples': 8902080, 'steps': 46364, 'loss/train': 0.9544762372970581} 11/07/2021 03:46:59 - INFO - __main__ - Step 46366: {'lr': 0.0003970965361272667, 'samples': 8902272, 'steps': 46365, 'loss/train': 1.3058061599731445} 11/07/2021 03:47:00 - INFO - __main__ - Step 46367: {'lr': 0.0003970922451663611, 'samples': 8902464, 'steps': 46366, 'loss/train': 1.5385879278182983} 11/07/2021 03:47:00 - INFO - __main__ - Step 46368: {'lr': 0.0003970879541391781, 'samples': 8902656, 'steps': 46367, 'loss/train': 1.2907299995422363} 11/07/2021 03:47:01 - INFO - __main__ - Step 46369: {'lr': 0.0003970836630457194, 'samples': 8902848, 'steps': 46368, 'loss/train': 1.6435942649841309} 11/07/2021 03:47:02 - INFO - __main__ - Step 46370: {'lr': 0.00039707937188598717, 'samples': 8903040, 'steps': 46369, 'loss/train': 1.3786275386810303} 11/07/2021 03:47:02 - INFO - __main__ - Step 46371: {'lr': 0.00039707508065998324, 'samples': 8903232, 'steps': 46370, 'loss/train': 1.4974207878112793} 11/07/2021 03:47:03 - INFO - __main__ - Step 46372: {'lr': 0.0003970707893677095, 'samples': 8903424, 'steps': 46371, 'loss/train': 1.7725350856781006} 11/07/2021 03:47:03 - INFO - __main__ - Step 46373: {'lr': 0.00039706649800916804, 'samples': 8903616, 'steps': 46372, 'loss/train': 1.5043054819107056} 11/07/2021 03:47:03 - INFO - __main__ - Step 46374: {'lr': 0.0003970622065843607, 'samples': 8903808, 'steps': 46373, 'loss/train': 5.2578558921813965} 11/07/2021 03:47:04 - INFO - __main__ - Step 46375: {'lr': 0.00039705791509328926, 'samples': 8904000, 'steps': 46374, 'loss/train': 1.6006724834442139} 11/07/2021 03:47:05 - INFO - __main__ - Step 46376: {'lr': 0.0003970536235359558, 'samples': 8904192, 'steps': 46375, 'loss/train': 1.1772557497024536} 11/07/2021 03:47:05 - INFO - __main__ - Step 46377: {'lr': 0.00039704933191236225, 'samples': 8904384, 'steps': 46376, 'loss/train': 1.2849663496017456} 11/07/2021 03:47:05 - INFO - __main__ - Step 46378: {'lr': 0.00039704504022251066, 'samples': 8904576, 'steps': 46377, 'loss/train': 1.5546324253082275} 11/07/2021 03:47:06 - INFO - __main__ - Step 46379: {'lr': 0.00039704074846640277, 'samples': 8904768, 'steps': 46378, 'loss/train': 1.1840581893920898} 11/07/2021 03:47:06 - INFO - __main__ - Step 46380: {'lr': 0.0003970364566440406, 'samples': 8904960, 'steps': 46379, 'loss/train': 1.5081672668457031} 11/07/2021 03:47:07 - INFO - __main__ - Step 46381: {'lr': 0.000397032164755426, 'samples': 8905152, 'steps': 46380, 'loss/train': 1.852556824684143} 11/07/2021 03:47:07 - INFO - __main__ - Step 46382: {'lr': 0.0003970278728005611, 'samples': 8905344, 'steps': 46381, 'loss/train': 1.6030914783477783} 11/07/2021 03:47:08 - INFO - __main__ - Step 46383: {'lr': 0.0003970235807794476, 'samples': 8905536, 'steps': 46382, 'loss/train': 1.5848695039749146} 11/07/2021 03:47:08 - INFO - __main__ - Step 46384: {'lr': 0.00039701928869208757, 'samples': 8905728, 'steps': 46383, 'loss/train': 2.116671323776245} 11/07/2021 03:47:09 - INFO - __main__ - Step 46385: {'lr': 0.0003970149965384829, 'samples': 8905920, 'steps': 46384, 'loss/train': 1.5274428129196167} 11/07/2021 03:47:10 - INFO - __main__ - Step 46386: {'lr': 0.00039701070431863564, 'samples': 8906112, 'steps': 46385, 'loss/train': 1.8301100730895996} 11/07/2021 03:47:10 - INFO - __main__ - Step 46387: {'lr': 0.00039700641203254755, 'samples': 8906304, 'steps': 46386, 'loss/train': 1.2952507734298706} 11/07/2021 03:47:10 - INFO - __main__ - Step 46388: {'lr': 0.0003970021196802206, 'samples': 8906496, 'steps': 46387, 'loss/train': 1.4825998544692993} 11/07/2021 03:47:11 - INFO - __main__ - Step 46389: {'lr': 0.0003969978272616569, 'samples': 8906688, 'steps': 46388, 'loss/train': 1.300925612449646} 11/07/2021 03:47:11 - INFO - __main__ - Step 46390: {'lr': 0.0003969935347768581, 'samples': 8906880, 'steps': 46389, 'loss/train': 1.5207626819610596} 11/07/2021 03:47:12 - INFO - __main__ - Step 46391: {'lr': 0.00039698924222582636, 'samples': 8907072, 'steps': 46390, 'loss/train': 1.572201132774353} 11/07/2021 03:47:12 - INFO - __main__ - Step 46392: {'lr': 0.00039698494960856346, 'samples': 8907264, 'steps': 46391, 'loss/train': 1.227784276008606} 11/07/2021 03:47:13 - INFO - __main__ - Step 46393: {'lr': 0.0003969806569250716, 'samples': 8907456, 'steps': 46392, 'loss/train': 0.8939191102981567} 11/07/2021 03:47:13 - INFO - __main__ - Step 46394: {'lr': 0.0003969763641753523, 'samples': 8907648, 'steps': 46393, 'loss/train': 1.826757788658142} 11/07/2021 03:47:14 - INFO - __main__ - Step 46395: {'lr': 0.00039697207135940785, 'samples': 8907840, 'steps': 46394, 'loss/train': 0.6387439370155334} 11/07/2021 03:47:14 - INFO - __main__ - Step 46396: {'lr': 0.00039696777847724, 'samples': 8908032, 'steps': 46395, 'loss/train': 1.2974773645401} 11/07/2021 03:47:15 - INFO - __main__ - Step 46397: {'lr': 0.00039696348552885075, 'samples': 8908224, 'steps': 46396, 'loss/train': 1.2351579666137695} 11/07/2021 03:47:15 - INFO - __main__ - Step 46398: {'lr': 0.000396959192514242, 'samples': 8908416, 'steps': 46397, 'loss/train': 1.7253797054290771} 11/07/2021 03:47:16 - INFO - __main__ - Step 46399: {'lr': 0.0003969548994334158, 'samples': 8908608, 'steps': 46398, 'loss/train': 1.3931994438171387} 11/07/2021 03:47:16 - INFO - __main__ - Step 46400: {'lr': 0.0003969506062863739, 'samples': 8908800, 'steps': 46399, 'loss/train': 1.0712485313415527} 11/07/2021 03:47:16 - INFO - __main__ - Step 46401: {'lr': 0.0003969463130731183, 'samples': 8908992, 'steps': 46400, 'loss/train': 1.5049532651901245} 11/07/2021 03:47:17 - INFO - __main__ - Step 46402: {'lr': 0.00039694201979365094, 'samples': 8909184, 'steps': 46401, 'loss/train': 1.2959758043289185} 11/07/2021 03:47:18 - INFO - __main__ - Step 46403: {'lr': 0.00039693772644797386, 'samples': 8909376, 'steps': 46402, 'loss/train': 1.4447095394134521} 11/07/2021 03:47:18 - INFO - __main__ - Step 46404: {'lr': 0.0003969334330360889, 'samples': 8909568, 'steps': 46403, 'loss/train': 1.9214714765548706} 11/07/2021 03:47:18 - INFO - __main__ - Step 46405: {'lr': 0.000396929139557998, 'samples': 8909760, 'steps': 46404, 'loss/train': 1.6759204864501953} 11/07/2021 03:47:19 - INFO - __main__ - Step 46406: {'lr': 0.00039692484601370305, 'samples': 8909952, 'steps': 46405, 'loss/train': 1.6240415573120117} 11/07/2021 03:47:20 - INFO - __main__ - Step 46407: {'lr': 0.0003969205524032061, 'samples': 8910144, 'steps': 46406, 'loss/train': 1.2875351905822754} 11/07/2021 03:47:20 - INFO - __main__ - Step 46408: {'lr': 0.00039691625872650895, 'samples': 8910336, 'steps': 46407, 'loss/train': 1.4437205791473389} 11/07/2021 03:47:20 - INFO - __main__ - Step 46409: {'lr': 0.00039691196498361364, 'samples': 8910528, 'steps': 46408, 'loss/train': 1.4613850116729736} 11/07/2021 03:47:21 - INFO - __main__ - Step 46410: {'lr': 0.0003969076711745221, 'samples': 8910720, 'steps': 46409, 'loss/train': 1.6706717014312744} 11/07/2021 03:47:21 - INFO - __main__ - Step 46411: {'lr': 0.00039690337729923617, 'samples': 8910912, 'steps': 46410, 'loss/train': 1.7761319875717163} 11/07/2021 03:47:22 - INFO - __main__ - Step 46412: {'lr': 0.0003968990833577578, 'samples': 8911104, 'steps': 46411, 'loss/train': 1.0088707208633423} 11/07/2021 03:47:23 - INFO - __main__ - Step 46413: {'lr': 0.00039689478935008905, 'samples': 8911296, 'steps': 46412, 'loss/train': 1.9144214391708374} 11/07/2021 03:47:23 - INFO - __main__ - Step 46414: {'lr': 0.00039689049527623176, 'samples': 8911488, 'steps': 46413, 'loss/train': 1.185842752456665} 11/07/2021 03:47:23 - INFO - __main__ - Step 46415: {'lr': 0.0003968862011361879, 'samples': 8911680, 'steps': 46414, 'loss/train': 1.3232680559158325} 11/07/2021 03:47:24 - INFO - __main__ - Step 46416: {'lr': 0.0003968819069299593, 'samples': 8911872, 'steps': 46415, 'loss/train': 1.2034891843795776} 11/07/2021 03:47:25 - INFO - __main__ - Step 46417: {'lr': 0.0003968776126575481, 'samples': 8912064, 'steps': 46416, 'loss/train': 1.278905987739563} 11/07/2021 03:47:25 - INFO - __main__ - Step 46418: {'lr': 0.000396873318318956, 'samples': 8912256, 'steps': 46417, 'loss/train': 1.3681342601776123} 11/07/2021 03:47:25 - INFO - __main__ - Step 46419: {'lr': 0.00039686902391418514, 'samples': 8912448, 'steps': 46418, 'loss/train': 1.267077088356018} 11/07/2021 03:47:26 - INFO - __main__ - Step 46420: {'lr': 0.00039686472944323734, 'samples': 8912640, 'steps': 46419, 'loss/train': 1.7939561605453491} 11/07/2021 03:47:26 - INFO - __main__ - Step 46421: {'lr': 0.0003968604349061145, 'samples': 8912832, 'steps': 46420, 'loss/train': 1.28379487991333} 11/07/2021 03:47:26 - INFO - __main__ - Step 46422: {'lr': 0.0003968561403028187, 'samples': 8913024, 'steps': 46421, 'loss/train': 1.4254707098007202} 11/07/2021 03:47:27 - INFO - __main__ - Step 46423: {'lr': 0.00039685184563335174, 'samples': 8913216, 'steps': 46422, 'loss/train': 1.5502345561981201} 11/07/2021 03:47:28 - INFO - __main__ - Step 46424: {'lr': 0.00039684755089771555, 'samples': 8913408, 'steps': 46423, 'loss/train': 1.2274820804595947} 11/07/2021 03:47:28 - INFO - __main__ - Step 46425: {'lr': 0.0003968432560959122, 'samples': 8913600, 'steps': 46424, 'loss/train': 1.656483769416809} 11/07/2021 03:47:28 - INFO - __main__ - Step 46426: {'lr': 0.00039683896122794354, 'samples': 8913792, 'steps': 46425, 'loss/train': 1.4895061254501343} 11/07/2021 03:47:29 - INFO - __main__ - Step 46427: {'lr': 0.0003968346662938115, 'samples': 8913984, 'steps': 46426, 'loss/train': 1.6468385457992554} 11/07/2021 03:47:30 - INFO - __main__ - Step 46428: {'lr': 0.00039683037129351805, 'samples': 8914176, 'steps': 46427, 'loss/train': 1.4098467826843262} 11/07/2021 03:47:30 - INFO - __main__ - Step 46429: {'lr': 0.000396826076227065, 'samples': 8914368, 'steps': 46428, 'loss/train': 1.276272177696228} 11/07/2021 03:47:31 - INFO - __main__ - Step 46430: {'lr': 0.00039682178109445447, 'samples': 8914560, 'steps': 46429, 'loss/train': 1.4634912014007568} 11/07/2021 03:47:31 - INFO - __main__ - Step 46431: {'lr': 0.0003968174858956883, 'samples': 8914752, 'steps': 46430, 'loss/train': 1.4141737222671509} 11/07/2021 03:47:31 - INFO - __main__ - Step 46432: {'lr': 0.0003968131906307684, 'samples': 8914944, 'steps': 46431, 'loss/train': 1.7467100620269775} 11/07/2021 03:47:32 - INFO - __main__ - Step 46433: {'lr': 0.00039680889529969686, 'samples': 8915136, 'steps': 46432, 'loss/train': 1.4590861797332764} 11/07/2021 03:47:33 - INFO - __main__ - Step 46434: {'lr': 0.0003968045999024754, 'samples': 8915328, 'steps': 46433, 'loss/train': 1.3307888507843018} 11/07/2021 03:47:33 - INFO - __main__ - Step 46435: {'lr': 0.0003968003044391061, 'samples': 8915520, 'steps': 46434, 'loss/train': 1.1671562194824219} 11/07/2021 03:47:33 - INFO - __main__ - Step 46436: {'lr': 0.00039679600890959077, 'samples': 8915712, 'steps': 46435, 'loss/train': 1.6333740949630737} 11/07/2021 03:47:34 - INFO - __main__ - Step 46437: {'lr': 0.0003967917133139315, 'samples': 8915904, 'steps': 46436, 'loss/train': 1.6342260837554932} 11/07/2021 03:47:35 - INFO - __main__ - Step 46438: {'lr': 0.00039678741765213006, 'samples': 8916096, 'steps': 46437, 'loss/train': 1.4422754049301147} 11/07/2021 03:47:35 - INFO - __main__ - Step 46439: {'lr': 0.0003967831219241885, 'samples': 8916288, 'steps': 46438, 'loss/train': 0.8115842938423157} 11/07/2021 03:47:35 - INFO - __main__ - Step 46440: {'lr': 0.00039677882613010885, 'samples': 8916480, 'steps': 46439, 'loss/train': 1.4022616147994995} 11/07/2021 03:47:36 - INFO - __main__ - Step 46441: {'lr': 0.0003967745302698928, 'samples': 8916672, 'steps': 46440, 'loss/train': 1.2536962032318115} 11/07/2021 03:47:36 - INFO - __main__ - Step 46442: {'lr': 0.0003967702343435424, 'samples': 8916864, 'steps': 46441, 'loss/train': 1.6496021747589111} 11/07/2021 03:47:37 - INFO - __main__ - Step 46443: {'lr': 0.00039676593835105966, 'samples': 8917056, 'steps': 46442, 'loss/train': 1.66366446018219} 11/07/2021 03:47:37 - INFO - __main__ - Step 46444: {'lr': 0.0003967616422924465, 'samples': 8917248, 'steps': 46443, 'loss/train': 0.9641647934913635} 11/07/2021 03:47:38 - INFO - __main__ - Step 46445: {'lr': 0.0003967573461677047, 'samples': 8917440, 'steps': 46444, 'loss/train': 1.5705403089523315} 11/07/2021 03:47:38 - INFO - __main__ - Step 46446: {'lr': 0.0003967530499768364, 'samples': 8917632, 'steps': 46445, 'loss/train': 1.5934021472930908} 11/07/2021 03:47:39 - INFO - __main__ - Step 46447: {'lr': 0.00039674875371984336, 'samples': 8917824, 'steps': 46446, 'loss/train': 1.6371523141860962} 11/07/2021 03:47:39 - INFO - __main__ - Step 46448: {'lr': 0.0003967444573967277, 'samples': 8918016, 'steps': 46447, 'loss/train': 1.5475355386734009} 11/07/2021 03:47:40 - INFO - __main__ - Step 46449: {'lr': 0.0003967401610074911, 'samples': 8918208, 'steps': 46448, 'loss/train': 1.378336787223816} 11/07/2021 03:47:40 - INFO - __main__ - Step 46450: {'lr': 0.0003967358645521357, 'samples': 8918400, 'steps': 46449, 'loss/train': 1.6174300909042358} 11/07/2021 03:47:41 - INFO - __main__ - Step 46451: {'lr': 0.00039673156803066346, 'samples': 8918592, 'steps': 46450, 'loss/train': 1.4035089015960693} 11/07/2021 03:47:41 - INFO - __main__ - Step 46452: {'lr': 0.00039672727144307617, 'samples': 8918784, 'steps': 46451, 'loss/train': 1.4450703859329224} 11/07/2021 03:47:41 - INFO - __main__ - Step 46453: {'lr': 0.0003967229747893759, 'samples': 8918976, 'steps': 46452, 'loss/train': 1.6976627111434937} 11/07/2021 03:47:42 - INFO - __main__ - Step 46454: {'lr': 0.0003967186780695645, 'samples': 8919168, 'steps': 46453, 'loss/train': 1.2306568622589111} 11/07/2021 03:47:43 - INFO - __main__ - Step 46455: {'lr': 0.0003967143812836439, 'samples': 8919360, 'steps': 46454, 'loss/train': 1.285661220550537} 11/07/2021 03:47:43 - INFO - __main__ - Step 46456: {'lr': 0.00039671008443161604, 'samples': 8919552, 'steps': 46455, 'loss/train': 1.6329265832901} 11/07/2021 03:47:43 - INFO - __main__ - Step 46457: {'lr': 0.00039670578751348283, 'samples': 8919744, 'steps': 46456, 'loss/train': 1.6372334957122803} 11/07/2021 03:47:44 - INFO - __main__ - Step 46458: {'lr': 0.0003967014905292464, 'samples': 8919936, 'steps': 46457, 'loss/train': 0.9876204133033752} 11/07/2021 03:47:45 - INFO - __main__ - Step 46459: {'lr': 0.0003966971934789084, 'samples': 8920128, 'steps': 46458, 'loss/train': 1.5900052785873413} 11/07/2021 03:47:45 - INFO - __main__ - Step 46460: {'lr': 0.0003966928963624711, 'samples': 8920320, 'steps': 46459, 'loss/train': 1.9930756092071533} 11/07/2021 03:47:46 - INFO - __main__ - Step 46461: {'lr': 0.0003966885991799361, 'samples': 8920512, 'steps': 46460, 'loss/train': 1.638953685760498} 11/07/2021 03:47:46 - INFO - __main__ - Step 46462: {'lr': 0.0003966843019313055, 'samples': 8920704, 'steps': 46461, 'loss/train': 1.3511921167373657} 11/07/2021 03:47:46 - INFO - __main__ - Step 46463: {'lr': 0.00039668000461658126, 'samples': 8920896, 'steps': 46462, 'loss/train': 0.6511127948760986} 11/07/2021 03:47:47 - INFO - __main__ - Step 46464: {'lr': 0.00039667570723576516, 'samples': 8921088, 'steps': 46463, 'loss/train': 0.8745549917221069} 11/07/2021 03:47:48 - INFO - __main__ - Step 46465: {'lr': 0.0003966714097888594, 'samples': 8921280, 'steps': 46464, 'loss/train': 1.5564099550247192} 11/07/2021 03:47:48 - INFO - __main__ - Step 46466: {'lr': 0.0003966671122758657, 'samples': 8921472, 'steps': 46465, 'loss/train': 1.396647334098816} 11/07/2021 03:47:48 - INFO - __main__ - Step 46467: {'lr': 0.00039666281469678604, 'samples': 8921664, 'steps': 46466, 'loss/train': 1.562657356262207} 11/07/2021 03:47:49 - INFO - __main__ - Step 46468: {'lr': 0.0003966585170516224, 'samples': 8921856, 'steps': 46467, 'loss/train': 1.7159804105758667} 11/07/2021 03:47:49 - INFO - __main__ - Step 46469: {'lr': 0.0003966542193403767, 'samples': 8922048, 'steps': 46468, 'loss/train': 1.0989930629730225} 11/07/2021 03:47:50 - INFO - __main__ - Step 46470: {'lr': 0.00039664992156305086, 'samples': 8922240, 'steps': 46469, 'loss/train': 1.3452023267745972} 11/07/2021 03:47:50 - INFO - __main__ - Step 46471: {'lr': 0.00039664562371964683, 'samples': 8922432, 'steps': 46470, 'loss/train': 1.5813894271850586} 11/07/2021 03:47:51 - INFO - __main__ - Step 46472: {'lr': 0.00039664132581016654, 'samples': 8922624, 'steps': 46471, 'loss/train': 1.7242380380630493} 11/07/2021 03:47:51 - INFO - __main__ - Step 46473: {'lr': 0.000396637027834612, 'samples': 8922816, 'steps': 46472, 'loss/train': 1.4563004970550537} 11/07/2021 03:47:51 - INFO - __main__ - Step 46474: {'lr': 0.000396632729792985, 'samples': 8923008, 'steps': 46473, 'loss/train': 1.0452595949172974} 11/07/2021 03:47:52 - INFO - __main__ - Step 46475: {'lr': 0.00039662843168528756, 'samples': 8923200, 'steps': 46474, 'loss/train': 1.0003538131713867} 11/07/2021 03:47:53 - INFO - __main__ - Step 46476: {'lr': 0.0003966241335115216, 'samples': 8923392, 'steps': 46475, 'loss/train': 1.5979691743850708} 11/07/2021 03:47:53 - INFO - __main__ - Step 46477: {'lr': 0.0003966198352716891, 'samples': 8923584, 'steps': 46476, 'loss/train': 1.578640341758728} 11/07/2021 03:47:54 - INFO - __main__ - Step 46478: {'lr': 0.000396615536965792, 'samples': 8923776, 'steps': 46477, 'loss/train': 1.4766262769699097} 11/07/2021 03:47:54 - INFO - __main__ - Step 46479: {'lr': 0.00039661123859383214, 'samples': 8923968, 'steps': 46478, 'loss/train': 1.2574434280395508} 11/07/2021 03:47:55 - INFO - __main__ - Step 46480: {'lr': 0.0003966069401558116, 'samples': 8924160, 'steps': 46479, 'loss/train': 1.872518539428711} 11/07/2021 03:47:55 - INFO - __main__ - Step 46481: {'lr': 0.0003966026416517321, 'samples': 8924352, 'steps': 46480, 'loss/train': 1.8166900873184204} 11/07/2021 03:47:56 - INFO - __main__ - Step 46482: {'lr': 0.0003965983430815958, 'samples': 8924544, 'steps': 46481, 'loss/train': 1.5014415979385376} 11/07/2021 03:47:56 - INFO - __main__ - Step 46483: {'lr': 0.00039659404444540456, 'samples': 8924736, 'steps': 46482, 'loss/train': 1.8454792499542236} 11/07/2021 03:47:57 - INFO - __main__ - Step 46484: {'lr': 0.0003965897457431602, 'samples': 8924928, 'steps': 46483, 'loss/train': 1.2768256664276123} 11/07/2021 03:47:57 - INFO - __main__ - Step 46485: {'lr': 0.00039658544697486486, 'samples': 8925120, 'steps': 46484, 'loss/train': 0.8908268213272095} 11/07/2021 03:47:58 - INFO - __main__ - Step 46486: {'lr': 0.0003965811481405204, 'samples': 8925312, 'steps': 46485, 'loss/train': 1.6718217134475708} 11/07/2021 03:47:58 - INFO - __main__ - Step 46487: {'lr': 0.00039657684924012873, 'samples': 8925504, 'steps': 46486, 'loss/train': 1.548094391822815} 11/07/2021 03:47:59 - INFO - __main__ - Step 46488: {'lr': 0.0003965725502736917, 'samples': 8925696, 'steps': 46487, 'loss/train': 1.6084216833114624} 11/07/2021 03:47:59 - INFO - __main__ - Step 46489: {'lr': 0.0003965682512412114, 'samples': 8925888, 'steps': 46488, 'loss/train': 1.1312930583953857} 11/07/2021 03:47:59 - INFO - __main__ - Step 46490: {'lr': 0.0003965639521426897, 'samples': 8926080, 'steps': 46489, 'loss/train': 1.8306597471237183} 11/07/2021 03:48:00 - INFO - __main__ - Step 46491: {'lr': 0.0003965596529781286, 'samples': 8926272, 'steps': 46490, 'loss/train': 1.3911941051483154} 11/07/2021 03:48:01 - INFO - __main__ - Step 46492: {'lr': 0.0003965553537475299, 'samples': 8926464, 'steps': 46491, 'loss/train': 2.3198862075805664} 11/07/2021 03:48:01 - INFO - __main__ - Step 46493: {'lr': 0.0003965510544508957, 'samples': 8926656, 'steps': 46492, 'loss/train': 1.6831574440002441} 11/07/2021 03:48:01 - INFO - __main__ - Step 46494: {'lr': 0.0003965467550882278, 'samples': 8926848, 'steps': 46493, 'loss/train': 1.5054762363433838} 11/07/2021 03:48:02 - INFO - __main__ - Step 46495: {'lr': 0.0003965424556595282, 'samples': 8927040, 'steps': 46494, 'loss/train': 1.6593341827392578} 11/07/2021 03:48:03 - INFO - __main__ - Step 46496: {'lr': 0.0003965381561647988, 'samples': 8927232, 'steps': 46495, 'loss/train': 1.6570336818695068} 11/07/2021 03:48:03 - INFO - __main__ - Step 46497: {'lr': 0.0003965338566040416, 'samples': 8927424, 'steps': 46496, 'loss/train': 1.7234264612197876} 11/07/2021 03:48:03 - INFO - __main__ - Step 46498: {'lr': 0.0003965295569772585, 'samples': 8927616, 'steps': 46497, 'loss/train': 1.8450464010238647} 11/07/2021 03:48:04 - INFO - __main__ - Step 46499: {'lr': 0.00039652525728445145, 'samples': 8927808, 'steps': 46498, 'loss/train': 1.6731663942337036} 11/07/2021 03:48:04 - INFO - __main__ - Step 46500: {'lr': 0.00039652095752562246, 'samples': 8928000, 'steps': 46499, 'loss/train': 1.2277780771255493} 11/07/2021 03:48:05 - INFO - __main__ - Step 46501: {'lr': 0.00039651665770077326, 'samples': 8928192, 'steps': 46500, 'loss/train': 1.7918261289596558} 11/07/2021 03:48:06 - INFO - __main__ - Step 46502: {'lr': 0.00039651235780990596, 'samples': 8928384, 'steps': 46501, 'loss/train': 1.228326678276062} 11/07/2021 03:48:06 - INFO - __main__ - Step 46503: {'lr': 0.00039650805785302245, 'samples': 8928576, 'steps': 46502, 'loss/train': 1.630692481994629} 11/07/2021 03:48:07 - INFO - __main__ - Step 46504: {'lr': 0.0003965037578301247, 'samples': 8928768, 'steps': 46503, 'loss/train': 1.2357648611068726} 11/07/2021 03:48:07 - INFO - __main__ - Step 46505: {'lr': 0.00039649945774121453, 'samples': 8928960, 'steps': 46504, 'loss/train': 1.0091569423675537} 11/07/2021 03:48:07 - INFO - __main__ - Step 46506: {'lr': 0.0003964951575862941, 'samples': 8929152, 'steps': 46505, 'loss/train': 1.7667460441589355} 11/07/2021 03:48:08 - INFO - __main__ - Step 46507: {'lr': 0.00039649085736536517, 'samples': 8929344, 'steps': 46506, 'loss/train': 0.6111887693405151} 11/07/2021 03:48:09 - INFO - __main__ - Step 46508: {'lr': 0.0003964865570784296, 'samples': 8929536, 'steps': 46507, 'loss/train': 1.2216861248016357} 11/07/2021 03:48:09 - INFO - __main__ - Step 46509: {'lr': 0.00039648225672548953, 'samples': 8929728, 'steps': 46508, 'loss/train': 1.1576570272445679} 11/07/2021 03:48:09 - INFO - __main__ - Step 46510: {'lr': 0.00039647795630654687, 'samples': 8929920, 'steps': 46509, 'loss/train': 1.639652132987976} 11/07/2021 03:48:10 - INFO - __main__ - Step 46511: {'lr': 0.00039647365582160345, 'samples': 8930112, 'steps': 46510, 'loss/train': 1.9230080842971802} 11/07/2021 03:48:12 - INFO - __main__ - Step 46512: {'lr': 0.00039646935527066124, 'samples': 8930304, 'steps': 46511, 'loss/train': 1.6091551780700684} 11/07/2021 03:48:13 - INFO - __main__ - Step 46513: {'lr': 0.00039646505465372223, 'samples': 8930496, 'steps': 46512, 'loss/train': 1.5849133729934692} 11/07/2021 03:48:13 - INFO - __main__ - Step 46514: {'lr': 0.0003964607539707884, 'samples': 8930688, 'steps': 46513, 'loss/train': 3.213456392288208} 11/07/2021 03:48:13 - INFO - __main__ - Step 46515: {'lr': 0.0003964564532218615, 'samples': 8930880, 'steps': 46514, 'loss/train': 3.0650737285614014} 11/07/2021 03:48:14 - INFO - __main__ - Step 46516: {'lr': 0.0003964521524069436, 'samples': 8931072, 'steps': 46515, 'loss/train': 3.095937967300415} 11/07/2021 03:48:14 - INFO - __main__ - Step 46517: {'lr': 0.00039644785152603666, 'samples': 8931264, 'steps': 46516, 'loss/train': 3.1416046619415283} 11/07/2021 03:48:14 - INFO - __main__ - Step 46518: {'lr': 0.0003964435505791425, 'samples': 8931456, 'steps': 46517, 'loss/train': 1.620643973350525} 11/07/2021 03:48:15 - INFO - __main__ - Step 46519: {'lr': 0.0003964392495662632, 'samples': 8931648, 'steps': 46518, 'loss/train': 1.5931931734085083} 11/07/2021 03:48:16 - INFO - __main__ - Step 46520: {'lr': 0.0003964349484874007, 'samples': 8931840, 'steps': 46519, 'loss/train': 1.1818599700927734} 11/07/2021 03:48:16 - INFO - __main__ - Step 46521: {'lr': 0.00039643064734255675, 'samples': 8932032, 'steps': 46520, 'loss/train': 1.839593529701233} 11/07/2021 03:48:16 - INFO - __main__ - Step 46522: {'lr': 0.0003964263461317334, 'samples': 8932224, 'steps': 46521, 'loss/train': 1.2800414562225342} 11/07/2021 03:48:17 - INFO - __main__ - Step 46523: {'lr': 0.0003964220448549327, 'samples': 8932416, 'steps': 46522, 'loss/train': 1.3235206604003906} 11/07/2021 03:48:17 - INFO - __main__ - Step 46524: {'lr': 0.0003964177435121565, 'samples': 8932608, 'steps': 46523, 'loss/train': 1.8287054300308228} 11/07/2021 03:48:18 - INFO - __main__ - Step 46525: {'lr': 0.00039641344210340665, 'samples': 8932800, 'steps': 46524, 'loss/train': 1.641908049583435} 11/07/2021 03:48:18 - INFO - __main__ - Step 46526: {'lr': 0.00039640914062868515, 'samples': 8932992, 'steps': 46525, 'loss/train': 1.56090247631073} 11/07/2021 03:48:19 - INFO - __main__ - Step 46527: {'lr': 0.000396404839087994, 'samples': 8933184, 'steps': 46526, 'loss/train': 0.6343189477920532} 11/07/2021 03:48:19 - INFO - __main__ - Step 46528: {'lr': 0.0003964005374813351, 'samples': 8933376, 'steps': 46527, 'loss/train': 1.3692983388900757} 11/07/2021 03:48:20 - INFO - __main__ - Step 46529: {'lr': 0.0003963962358087103, 'samples': 8933568, 'steps': 46528, 'loss/train': 1.516156554222107} 11/07/2021 03:48:21 - INFO - __main__ - Step 46530: {'lr': 0.00039639193407012166, 'samples': 8933760, 'steps': 46529, 'loss/train': 1.586659550666809} 11/07/2021 03:48:21 - INFO - __main__ - Step 46531: {'lr': 0.00039638763226557106, 'samples': 8933952, 'steps': 46530, 'loss/train': 1.5941252708435059} 11/07/2021 03:48:21 - INFO - __main__ - Step 46532: {'lr': 0.0003963833303950605, 'samples': 8934144, 'steps': 46531, 'loss/train': 1.4129873514175415} 11/07/2021 03:48:22 - INFO - __main__ - Step 46533: {'lr': 0.00039637902845859185, 'samples': 8934336, 'steps': 46532, 'loss/train': 1.332977294921875} 11/07/2021 03:48:22 - INFO - __main__ - Step 46534: {'lr': 0.00039637472645616704, 'samples': 8934528, 'steps': 46533, 'loss/train': 1.740139126777649} 11/07/2021 03:48:23 - INFO - __main__ - Step 46535: {'lr': 0.00039637042438778804, 'samples': 8934720, 'steps': 46534, 'loss/train': 1.2434988021850586} 11/07/2021 03:48:24 - INFO - __main__ - Step 46536: {'lr': 0.0003963661222534568, 'samples': 8934912, 'steps': 46535, 'loss/train': 1.1036967039108276} 11/07/2021 03:48:24 - INFO - __main__ - Step 46537: {'lr': 0.00039636182005317524, 'samples': 8935104, 'steps': 46536, 'loss/train': 1.4403520822525024} 11/07/2021 03:48:24 - INFO - __main__ - Step 46538: {'lr': 0.0003963575177869453, 'samples': 8935296, 'steps': 46537, 'loss/train': 1.4397002458572388} 11/07/2021 03:48:25 - INFO - __main__ - Step 46539: {'lr': 0.00039635321545476894, 'samples': 8935488, 'steps': 46538, 'loss/train': 1.5650548934936523} 11/07/2021 03:48:26 - INFO - __main__ - Step 46540: {'lr': 0.00039634891305664806, 'samples': 8935680, 'steps': 46539, 'loss/train': 2.071367025375366} 11/07/2021 03:48:26 - INFO - __main__ - Step 46541: {'lr': 0.00039634461059258466, 'samples': 8935872, 'steps': 46540, 'loss/train': 1.7746798992156982} 11/07/2021 03:48:26 - INFO - __main__ - Step 46542: {'lr': 0.0003963403080625806, 'samples': 8936064, 'steps': 46541, 'loss/train': 1.6014556884765625} 11/07/2021 03:48:27 - INFO - __main__ - Step 46543: {'lr': 0.00039633600546663784, 'samples': 8936256, 'steps': 46542, 'loss/train': 1.584721326828003} 11/07/2021 03:48:27 - INFO - __main__ - Step 46544: {'lr': 0.00039633170280475833, 'samples': 8936448, 'steps': 46543, 'loss/train': 1.3739217519760132} 11/07/2021 03:48:28 - INFO - __main__ - Step 46545: {'lr': 0.000396327400076944, 'samples': 8936640, 'steps': 46544, 'loss/train': 1.6864553689956665} 11/07/2021 03:48:28 - INFO - __main__ - Step 46546: {'lr': 0.0003963230972831968, 'samples': 8936832, 'steps': 46545, 'loss/train': 1.6099401712417603} 11/07/2021 03:48:29 - INFO - __main__ - Step 46547: {'lr': 0.0003963187944235188, 'samples': 8937024, 'steps': 46546, 'loss/train': 1.7182683944702148} 11/07/2021 03:48:29 - INFO - __main__ - Step 46548: {'lr': 0.00039631449149791164, 'samples': 8937216, 'steps': 46547, 'loss/train': 1.5881284475326538} 11/07/2021 03:48:29 - INFO - __main__ - Step 46549: {'lr': 0.0003963101885063776, 'samples': 8937408, 'steps': 46548, 'loss/train': 1.5946006774902344} 11/07/2021 03:48:31 - INFO - __main__ - Step 46550: {'lr': 0.00039630588544891835, 'samples': 8937600, 'steps': 46549, 'loss/train': 1.1868088245391846} 11/07/2021 03:48:31 - INFO - __main__ - Step 46551: {'lr': 0.0003963015823255359, 'samples': 8937792, 'steps': 46550, 'loss/train': 1.6669026613235474} 11/07/2021 03:48:31 - INFO - __main__ - Step 46552: {'lr': 0.00039629727913623213, 'samples': 8937984, 'steps': 46551, 'loss/train': 1.6167420148849487} 11/07/2021 03:48:32 - INFO - __main__ - Step 46553: {'lr': 0.0003962929758810092, 'samples': 8938176, 'steps': 46552, 'loss/train': 1.8316158056259155} 11/07/2021 03:48:32 - INFO - __main__ - Step 46554: {'lr': 0.00039628867255986887, 'samples': 8938368, 'steps': 46553, 'loss/train': 2.182403087615967} 11/07/2021 03:48:33 - INFO - __main__ - Step 46555: {'lr': 0.0003962843691728132, 'samples': 8938560, 'steps': 46554, 'loss/train': 1.6754432916641235} 11/07/2021 03:48:33 - INFO - __main__ - Step 46556: {'lr': 0.000396280065719844, 'samples': 8938752, 'steps': 46555, 'loss/train': 1.8905868530273438} 11/07/2021 03:48:34 - INFO - __main__ - Step 46557: {'lr': 0.0003962757622009632, 'samples': 8938944, 'steps': 46556, 'loss/train': 1.4807912111282349} 11/07/2021 03:48:34 - INFO - __main__ - Step 46558: {'lr': 0.0003962714586161729, 'samples': 8939136, 'steps': 46557, 'loss/train': 1.1115580797195435} 11/07/2021 03:48:34 - INFO - __main__ - Step 46559: {'lr': 0.0003962671549654748, 'samples': 8939328, 'steps': 46558, 'loss/train': 0.7697739601135254} 11/07/2021 03:48:35 - INFO - __main__ - Step 46560: {'lr': 0.00039626285124887107, 'samples': 8939520, 'steps': 46559, 'loss/train': 1.631230115890503} 11/07/2021 03:48:36 - INFO - __main__ - Step 46561: {'lr': 0.00039625854746636356, 'samples': 8939712, 'steps': 46560, 'loss/train': 1.4189014434814453} 11/07/2021 03:48:36 - INFO - __main__ - Step 46562: {'lr': 0.0003962542436179542, 'samples': 8939904, 'steps': 46561, 'loss/train': 1.2800192832946777} 11/07/2021 03:48:37 - INFO - __main__ - Step 46563: {'lr': 0.0003962499397036449, 'samples': 8940096, 'steps': 46562, 'loss/train': 1.153496503829956} 11/07/2021 03:48:37 - INFO - __main__ - Step 46564: {'lr': 0.0003962456357234377, 'samples': 8940288, 'steps': 46563, 'loss/train': 1.1571232080459595} 11/07/2021 03:48:38 - INFO - __main__ - Step 46565: {'lr': 0.0003962413316773344, 'samples': 8940480, 'steps': 46564, 'loss/train': 1.484740138053894} 11/07/2021 03:48:38 - INFO - __main__ - Step 46566: {'lr': 0.000396237027565337, 'samples': 8940672, 'steps': 46565, 'loss/train': 2.0003304481506348} 11/07/2021 03:48:39 - INFO - __main__ - Step 46567: {'lr': 0.00039623272338744754, 'samples': 8940864, 'steps': 46566, 'loss/train': 1.615573763847351} 11/07/2021 03:48:39 - INFO - __main__ - Step 46568: {'lr': 0.00039622841914366784, 'samples': 8941056, 'steps': 46567, 'loss/train': 1.545198917388916} 11/07/2021 03:48:39 - INFO - __main__ - Step 46569: {'lr': 0.0003962241148339999, 'samples': 8941248, 'steps': 46568, 'loss/train': 1.480456829071045} 11/07/2021 03:48:40 - INFO - __main__ - Step 46570: {'lr': 0.0003962198104584456, 'samples': 8941440, 'steps': 46569, 'loss/train': 1.4060744047164917} 11/07/2021 03:48:41 - INFO - __main__ - Step 46571: {'lr': 0.00039621550601700683, 'samples': 8941632, 'steps': 46570, 'loss/train': 1.5286009311676025} 11/07/2021 03:48:41 - INFO - __main__ - Step 46572: {'lr': 0.0003962112015096857, 'samples': 8941824, 'steps': 46571, 'loss/train': 1.2991770505905151} 11/07/2021 03:48:42 - INFO - __main__ - Step 46573: {'lr': 0.00039620689693648404, 'samples': 8942016, 'steps': 46572, 'loss/train': 0.5695983171463013} 11/07/2021 03:48:42 - INFO - __main__ - Step 46574: {'lr': 0.0003962025922974038, 'samples': 8942208, 'steps': 46573, 'loss/train': 1.6276212930679321} 11/07/2021 03:48:42 - INFO - __main__ - Step 46575: {'lr': 0.00039619828759244693, 'samples': 8942400, 'steps': 46574, 'loss/train': 1.5821086168289185} 11/07/2021 03:48:43 - INFO - __main__ - Step 46576: {'lr': 0.00039619398282161536, 'samples': 8942592, 'steps': 46575, 'loss/train': 1.1809090375900269} 11/07/2021 03:48:44 - INFO - __main__ - Step 46577: {'lr': 0.000396189677984911, 'samples': 8942784, 'steps': 46576, 'loss/train': 1.2262035608291626} 11/07/2021 03:48:44 - INFO - __main__ - Step 46578: {'lr': 0.00039618537308233593, 'samples': 8942976, 'steps': 46577, 'loss/train': 1.1441209316253662} 11/07/2021 03:48:44 - INFO - __main__ - Step 46579: {'lr': 0.00039618106811389187, 'samples': 8943168, 'steps': 46578, 'loss/train': 1.8602534532546997} 11/07/2021 03:48:45 - INFO - __main__ - Step 46580: {'lr': 0.00039617676307958095, 'samples': 8943360, 'steps': 46579, 'loss/train': 1.4358470439910889} 11/07/2021 03:48:46 - INFO - __main__ - Step 46581: {'lr': 0.000396172457979405, 'samples': 8943552, 'steps': 46580, 'loss/train': 0.9704254269599915} 11/07/2021 03:48:46 - INFO - __main__ - Step 46582: {'lr': 0.0003961681528133661, 'samples': 8943744, 'steps': 46581, 'loss/train': 1.5471473932266235} 11/07/2021 03:48:46 - INFO - __main__ - Step 46583: {'lr': 0.00039616384758146594, 'samples': 8943936, 'steps': 46582, 'loss/train': 1.4239884614944458} 11/07/2021 03:48:47 - INFO - __main__ - Step 46584: {'lr': 0.0003961595422837067, 'samples': 8944128, 'steps': 46583, 'loss/train': 1.2873655557632446} 11/07/2021 03:48:47 - INFO - __main__ - Step 46585: {'lr': 0.0003961552369200902, 'samples': 8944320, 'steps': 46584, 'loss/train': 1.6000466346740723} 11/07/2021 03:48:49 - INFO - __main__ - Step 46586: {'lr': 0.0003961509314906184, 'samples': 8944512, 'steps': 46585, 'loss/train': 1.4519565105438232} 11/07/2021 03:48:49 - INFO - __main__ - Step 46587: {'lr': 0.00039614662599529325, 'samples': 8944704, 'steps': 46586, 'loss/train': 1.7165050506591797} 11/07/2021 03:48:49 - INFO - __main__ - Step 46588: {'lr': 0.0003961423204341167, 'samples': 8944896, 'steps': 46587, 'loss/train': 1.883649230003357} 11/07/2021 03:48:50 - INFO - __main__ - Step 46589: {'lr': 0.00039613801480709065, 'samples': 8945088, 'steps': 46588, 'loss/train': 1.7284959554672241} 11/07/2021 03:48:50 - INFO - __main__ - Step 46590: {'lr': 0.00039613370911421706, 'samples': 8945280, 'steps': 46589, 'loss/train': 1.1167792081832886} 11/07/2021 03:48:50 - INFO - __main__ - Step 46591: {'lr': 0.00039612940335549793, 'samples': 8945472, 'steps': 46590, 'loss/train': 1.5884649753570557} 11/07/2021 03:48:51 - INFO - __main__ - Step 46592: {'lr': 0.0003961250975309351, 'samples': 8945664, 'steps': 46591, 'loss/train': 1.5987987518310547} 11/07/2021 03:48:52 - INFO - __main__ - Step 46593: {'lr': 0.0003961207916405305, 'samples': 8945856, 'steps': 46592, 'loss/train': 1.3909083604812622} 11/07/2021 03:48:52 - INFO - __main__ - Step 46594: {'lr': 0.00039611648568428626, 'samples': 8946048, 'steps': 46593, 'loss/train': 1.7174088954925537} 11/07/2021 03:48:52 - INFO - __main__ - Step 46595: {'lr': 0.0003961121796622041, 'samples': 8946240, 'steps': 46594, 'loss/train': 1.463890552520752} 11/07/2021 03:48:53 - INFO - __main__ - Step 46596: {'lr': 0.000396107873574286, 'samples': 8946432, 'steps': 46595, 'loss/train': 1.6015509366989136} 11/07/2021 03:48:54 - INFO - __main__ - Step 46597: {'lr': 0.00039610356742053403, 'samples': 8946624, 'steps': 46596, 'loss/train': 0.988956868648529} 11/07/2021 03:48:54 - INFO - __main__ - Step 46598: {'lr': 0.0003960992612009501, 'samples': 8946816, 'steps': 46597, 'loss/train': 1.4584623575210571} 11/07/2021 03:48:54 - INFO - __main__ - Step 46599: {'lr': 0.0003960949549155359, 'samples': 8947008, 'steps': 46598, 'loss/train': 1.323596715927124} 11/07/2021 03:48:55 - INFO - __main__ - Step 46600: {'lr': 0.0003960906485642938, 'samples': 8947200, 'steps': 46599, 'loss/train': 1.269243597984314} 11/07/2021 03:48:55 - INFO - __main__ - Step 46601: {'lr': 0.0003960863421472254, 'samples': 8947392, 'steps': 46600, 'loss/train': 1.7932299375534058} 11/07/2021 03:48:56 - INFO - __main__ - Step 46602: {'lr': 0.00039608203566433273, 'samples': 8947584, 'steps': 46601, 'loss/train': 1.4171199798583984} 11/07/2021 03:48:57 - INFO - __main__ - Step 46603: {'lr': 0.00039607772911561776, 'samples': 8947776, 'steps': 46602, 'loss/train': 1.5545135736465454} 11/07/2021 03:48:57 - INFO - __main__ - Step 46604: {'lr': 0.00039607342250108234, 'samples': 8947968, 'steps': 46603, 'loss/train': 1.7056605815887451} 11/07/2021 03:48:57 - INFO - __main__ - Step 46605: {'lr': 0.0003960691158207287, 'samples': 8948160, 'steps': 46604, 'loss/train': 1.4816783666610718} 11/07/2021 03:48:58 - INFO - __main__ - Step 46606: {'lr': 0.0003960648090745584, 'samples': 8948352, 'steps': 46605, 'loss/train': 1.4529831409454346} 11/07/2021 03:48:59 - INFO - __main__ - Step 46607: {'lr': 0.00039606050226257354, 'samples': 8948544, 'steps': 46606, 'loss/train': 2.054903030395508} 11/07/2021 03:48:59 - INFO - __main__ - Step 46608: {'lr': 0.00039605619538477617, 'samples': 8948736, 'steps': 46607, 'loss/train': 1.7597931623458862} 11/07/2021 03:49:00 - INFO - __main__ - Step 46609: {'lr': 0.00039605188844116815, 'samples': 8948928, 'steps': 46608, 'loss/train': 1.247678279876709} 11/07/2021 03:49:00 - INFO - __main__ - Step 46610: {'lr': 0.0003960475814317512, 'samples': 8949120, 'steps': 46609, 'loss/train': 2.0703155994415283} 11/07/2021 03:49:00 - INFO - __main__ - Step 46611: {'lr': 0.0003960432743565277, 'samples': 8949312, 'steps': 46610, 'loss/train': 1.2479217052459717} 11/07/2021 03:49:01 - INFO - __main__ - Step 46612: {'lr': 0.00039603896721549924, 'samples': 8949504, 'steps': 46611, 'loss/train': 0.7806252837181091} 11/07/2021 03:49:02 - INFO - __main__ - Step 46613: {'lr': 0.0003960346600086679, 'samples': 8949696, 'steps': 46612, 'loss/train': 1.7482831478118896} 11/07/2021 03:49:02 - INFO - __main__ - Step 46614: {'lr': 0.0003960303527360356, 'samples': 8949888, 'steps': 46613, 'loss/train': 1.7881076335906982} 11/07/2021 03:49:02 - INFO - __main__ - Step 46615: {'lr': 0.00039602604539760425, 'samples': 8950080, 'steps': 46614, 'loss/train': 1.4864344596862793} 11/07/2021 03:49:03 - INFO - __main__ - Step 46616: {'lr': 0.0003960217379933758, 'samples': 8950272, 'steps': 46615, 'loss/train': 0.23502303659915924} 11/07/2021 03:49:03 - INFO - __main__ - Step 46617: {'lr': 0.00039601743052335224, 'samples': 8950464, 'steps': 46616, 'loss/train': 1.6170903444290161} 11/07/2021 03:49:04 - INFO - __main__ - Step 46618: {'lr': 0.00039601312298753554, 'samples': 8950656, 'steps': 46617, 'loss/train': 1.5993255376815796} 11/07/2021 03:49:04 - INFO - __main__ - Step 46619: {'lr': 0.0003960088153859275, 'samples': 8950848, 'steps': 46618, 'loss/train': 1.3210651874542236} 11/07/2021 03:49:05 - INFO - __main__ - Step 46620: {'lr': 0.0003960045077185301, 'samples': 8951040, 'steps': 46619, 'loss/train': 1.428301453590393} 11/07/2021 03:49:05 - INFO - __main__ - Step 46621: {'lr': 0.0003960001999853454, 'samples': 8951232, 'steps': 46620, 'loss/train': 1.6477384567260742} 11/07/2021 03:49:06 - INFO - __main__ - Step 46622: {'lr': 0.00039599589218637535, 'samples': 8951424, 'steps': 46621, 'loss/train': 1.698339581489563} 11/07/2021 03:49:06 - INFO - __main__ - Step 46623: {'lr': 0.00039599158432162163, 'samples': 8951616, 'steps': 46622, 'loss/train': 1.2999848127365112} 11/07/2021 03:49:07 - INFO - __main__ - Step 46624: {'lr': 0.00039598727639108644, 'samples': 8951808, 'steps': 46623, 'loss/train': 1.4000439643859863} 11/07/2021 03:49:07 - INFO - __main__ - Step 46625: {'lr': 0.00039598296839477167, 'samples': 8952000, 'steps': 46624, 'loss/train': 1.7494179010391235} 11/07/2021 03:49:08 - INFO - __main__ - Step 46626: {'lr': 0.00039597866033267917, 'samples': 8952192, 'steps': 46625, 'loss/train': 1.659546136856079} 11/07/2021 03:49:08 - INFO - __main__ - Step 46627: {'lr': 0.00039597435220481094, 'samples': 8952384, 'steps': 46626, 'loss/train': 1.4539217948913574} 11/07/2021 03:49:09 - INFO - __main__ - Step 46628: {'lr': 0.0003959700440111689, 'samples': 8952576, 'steps': 46627, 'loss/train': 1.1460973024368286} 11/07/2021 03:49:10 - INFO - __main__ - Step 46629: {'lr': 0.00039596573575175506, 'samples': 8952768, 'steps': 46628, 'loss/train': 1.5490814447402954} 11/07/2021 03:49:10 - INFO - __main__ - Step 46630: {'lr': 0.00039596142742657125, 'samples': 8952960, 'steps': 46629, 'loss/train': 0.5390429496765137} 11/07/2021 03:49:10 - INFO - __main__ - Step 46631: {'lr': 0.00039595711903561947, 'samples': 8953152, 'steps': 46630, 'loss/train': 1.2273478507995605} 11/07/2021 03:49:11 - INFO - __main__ - Step 46632: {'lr': 0.0003959528105789018, 'samples': 8953344, 'steps': 46631, 'loss/train': 2.5275912284851074} 11/07/2021 03:49:11 - INFO - __main__ - Step 46633: {'lr': 0.00039594850205641985, 'samples': 8953536, 'steps': 46632, 'loss/train': 2.9538755416870117} 11/07/2021 03:49:12 - INFO - __main__ - Step 46634: {'lr': 0.0003959441934681759, 'samples': 8953728, 'steps': 46633, 'loss/train': 1.9540854692459106} 11/07/2021 03:49:12 - INFO - __main__ - Step 46635: {'lr': 0.00039593988481417174, 'samples': 8953920, 'steps': 46634, 'loss/train': 1.362870454788208} 11/07/2021 03:49:13 - INFO - __main__ - Step 46636: {'lr': 0.0003959355760944093, 'samples': 8954112, 'steps': 46635, 'loss/train': 1.3720383644104004} 11/07/2021 03:49:13 - INFO - __main__ - Step 46637: {'lr': 0.0003959312673088905, 'samples': 8954304, 'steps': 46636, 'loss/train': 1.8665322065353394} 11/07/2021 03:49:13 - INFO - __main__ - Step 46638: {'lr': 0.0003959269584576173, 'samples': 8954496, 'steps': 46637, 'loss/train': 1.192275881767273} 11/07/2021 03:49:14 - INFO - __main__ - Step 46639: {'lr': 0.00039592264954059177, 'samples': 8954688, 'steps': 46638, 'loss/train': 1.9641491174697876} 11/07/2021 03:49:15 - INFO - __main__ - Step 46640: {'lr': 0.00039591834055781566, 'samples': 8954880, 'steps': 46639, 'loss/train': 1.2691432237625122} 11/07/2021 03:49:15 - INFO - __main__ - Step 46641: {'lr': 0.0003959140315092911, 'samples': 8955072, 'steps': 46640, 'loss/train': 1.4783892631530762} 11/07/2021 03:49:16 - INFO - __main__ - Step 46642: {'lr': 0.00039590972239501984, 'samples': 8955264, 'steps': 46641, 'loss/train': 1.8274744749069214} 11/07/2021 03:49:16 - INFO - __main__ - Step 46643: {'lr': 0.0003959054132150039, 'samples': 8955456, 'steps': 46642, 'loss/train': 1.2819947004318237} 11/07/2021 03:49:16 - INFO - __main__ - Step 46644: {'lr': 0.00039590110396924526, 'samples': 8955648, 'steps': 46643, 'loss/train': 2.006920337677002} 11/07/2021 03:49:17 - INFO - __main__ - Step 46645: {'lr': 0.0003958967946577459, 'samples': 8955840, 'steps': 46644, 'loss/train': 1.395865797996521} 11/07/2021 03:49:18 - INFO - __main__ - Step 46646: {'lr': 0.0003958924852805076, 'samples': 8956032, 'steps': 46645, 'loss/train': 1.7654708623886108} 11/07/2021 03:49:18 - INFO - __main__ - Step 46647: {'lr': 0.00039588817583753236, 'samples': 8956224, 'steps': 46646, 'loss/train': 1.3263219594955444} 11/07/2021 03:49:18 - INFO - __main__ - Step 46648: {'lr': 0.0003958838663288223, 'samples': 8956416, 'steps': 46647, 'loss/train': 1.6043410301208496} 11/07/2021 03:49:19 - INFO - __main__ - Step 46649: {'lr': 0.00039587955675437917, 'samples': 8956608, 'steps': 46648, 'loss/train': 1.3933658599853516} 11/07/2021 03:49:20 - INFO - __main__ - Step 46650: {'lr': 0.00039587524711420487, 'samples': 8956800, 'steps': 46649, 'loss/train': 1.3795287609100342} 11/07/2021 03:49:20 - INFO - __main__ - Step 46651: {'lr': 0.00039587093740830147, 'samples': 8956992, 'steps': 46650, 'loss/train': 0.8048598766326904} 11/07/2021 03:49:20 - INFO - __main__ - Step 46652: {'lr': 0.0003958666276366709, 'samples': 8957184, 'steps': 46651, 'loss/train': 1.2594974040985107} 11/07/2021 03:49:21 - INFO - __main__ - Step 46653: {'lr': 0.00039586231779931516, 'samples': 8957376, 'steps': 46652, 'loss/train': 0.3006884753704071} 11/07/2021 03:49:21 - INFO - __main__ - Step 46654: {'lr': 0.000395858007896236, 'samples': 8957568, 'steps': 46653, 'loss/train': 1.400397777557373} 11/07/2021 03:49:22 - INFO - __main__ - Step 46655: {'lr': 0.0003958536979274355, 'samples': 8957760, 'steps': 46654, 'loss/train': 1.100865125656128} 11/07/2021 03:49:23 - INFO - __main__ - Step 46656: {'lr': 0.00039584938789291563, 'samples': 8957952, 'steps': 46655, 'loss/train': 1.5183429718017578} 11/07/2021 03:49:23 - INFO - __main__ - Step 46657: {'lr': 0.0003958450777926782, 'samples': 8958144, 'steps': 46656, 'loss/train': 1.4449796676635742} 11/07/2021 03:49:23 - INFO - __main__ - Step 46658: {'lr': 0.00039584076762672526, 'samples': 8958336, 'steps': 46657, 'loss/train': 1.4669604301452637} 11/07/2021 03:49:24 - INFO - __main__ - Step 46659: {'lr': 0.0003958364573950587, 'samples': 8958528, 'steps': 46658, 'loss/train': 1.6107738018035889} 11/07/2021 03:49:25 - INFO - __main__ - Step 46660: {'lr': 0.00039583214709768054, 'samples': 8958720, 'steps': 46659, 'loss/train': 1.4138153791427612} 11/07/2021 03:49:25 - INFO - __main__ - Step 46661: {'lr': 0.0003958278367345926, 'samples': 8958912, 'steps': 46660, 'loss/train': 1.656166911125183} 11/07/2021 03:49:25 - INFO - __main__ - Step 46662: {'lr': 0.00039582352630579697, 'samples': 8959104, 'steps': 46661, 'loss/train': 1.3021039962768555} 11/07/2021 03:49:26 - INFO - __main__ - Step 46663: {'lr': 0.00039581921581129543, 'samples': 8959296, 'steps': 46662, 'loss/train': 1.5526115894317627} 11/07/2021 03:49:26 - INFO - __main__ - Step 46664: {'lr': 0.00039581490525109005, 'samples': 8959488, 'steps': 46663, 'loss/train': 1.0968859195709229} 11/07/2021 03:49:27 - INFO - __main__ - Step 46665: {'lr': 0.00039581059462518266, 'samples': 8959680, 'steps': 46664, 'loss/train': 1.296531081199646} 11/07/2021 03:49:27 - INFO - __main__ - Step 46666: {'lr': 0.00039580628393357534, 'samples': 8959872, 'steps': 46665, 'loss/train': 1.4447951316833496} 11/07/2021 03:49:28 - INFO - __main__ - Step 46667: {'lr': 0.0003958019731762699, 'samples': 8960064, 'steps': 46666, 'loss/train': 1.739152193069458} 11/07/2021 03:49:28 - INFO - __main__ - Step 46668: {'lr': 0.0003957976623532684, 'samples': 8960256, 'steps': 46667, 'loss/train': 1.4870009422302246} 11/07/2021 03:49:29 - INFO - __main__ - Step 46669: {'lr': 0.0003957933514645727, 'samples': 8960448, 'steps': 46668, 'loss/train': 1.5848653316497803} 11/07/2021 03:49:30 - INFO - __main__ - Step 46670: {'lr': 0.00039578904051018474, 'samples': 8960640, 'steps': 46669, 'loss/train': 1.4008269309997559} 11/07/2021 03:49:30 - INFO - __main__ - Step 46671: {'lr': 0.00039578472949010644, 'samples': 8960832, 'steps': 46670, 'loss/train': 1.578660249710083} 11/07/2021 03:49:30 - INFO - __main__ - Step 46672: {'lr': 0.00039578041840433986, 'samples': 8961024, 'steps': 46671, 'loss/train': 1.870067834854126} 11/07/2021 03:49:31 - INFO - __main__ - Step 46673: {'lr': 0.00039577610725288694, 'samples': 8961216, 'steps': 46672, 'loss/train': 1.4752981662750244} 11/07/2021 03:49:31 - INFO - __main__ - Step 46674: {'lr': 0.0003957717960357494, 'samples': 8961408, 'steps': 46673, 'loss/train': 1.3758714199066162} 11/07/2021 03:49:31 - INFO - __main__ - Step 46675: {'lr': 0.0003957674847529295, 'samples': 8961600, 'steps': 46674, 'loss/train': 1.70477294921875} 11/07/2021 03:49:32 - INFO - __main__ - Step 46676: {'lr': 0.00039576317340442893, 'samples': 8961792, 'steps': 46675, 'loss/train': 1.4040971994400024} 11/07/2021 03:49:33 - INFO - __main__ - Step 46677: {'lr': 0.00039575886199024976, 'samples': 8961984, 'steps': 46676, 'loss/train': 1.8378627300262451} 11/07/2021 03:49:33 - INFO - __main__ - Step 46678: {'lr': 0.0003957545505103939, 'samples': 8962176, 'steps': 46677, 'loss/train': 2.1272621154785156} 11/07/2021 03:49:33 - INFO - __main__ - Step 46679: {'lr': 0.0003957502389648632, 'samples': 8962368, 'steps': 46678, 'loss/train': 0.7058058381080627} 11/07/2021 03:49:34 - INFO - __main__ - Step 46680: {'lr': 0.00039574592735365976, 'samples': 8962560, 'steps': 46679, 'loss/train': 1.6294348239898682} 11/07/2021 03:49:35 - INFO - __main__ - Step 46681: {'lr': 0.00039574161567678545, 'samples': 8962752, 'steps': 46680, 'loss/train': 0.8606553673744202} 11/07/2021 03:49:35 - INFO - __main__ - Step 46682: {'lr': 0.00039573730393424226, 'samples': 8962944, 'steps': 46681, 'loss/train': 1.6506311893463135} 11/07/2021 03:49:36 - INFO - __main__ - Step 46683: {'lr': 0.000395732992126032, 'samples': 8963136, 'steps': 46682, 'loss/train': 1.470301866531372} 11/07/2021 03:49:36 - INFO - __main__ - Step 46684: {'lr': 0.00039572868025215677, 'samples': 8963328, 'steps': 46683, 'loss/train': 1.0998947620391846} 11/07/2021 03:49:36 - INFO - __main__ - Step 46685: {'lr': 0.0003957243683126184, 'samples': 8963520, 'steps': 46684, 'loss/train': 1.2275121212005615} 11/07/2021 03:49:37 - INFO - __main__ - Step 46686: {'lr': 0.00039572005630741886, 'samples': 8963712, 'steps': 46685, 'loss/train': 1.9175535440444946} 11/07/2021 03:49:38 - INFO - __main__ - Step 46687: {'lr': 0.00039571574423656017, 'samples': 8963904, 'steps': 46686, 'loss/train': 1.5435117483139038} 11/07/2021 03:49:38 - INFO - __main__ - Step 46688: {'lr': 0.0003957114321000442, 'samples': 8964096, 'steps': 46687, 'loss/train': 1.4155611991882324} 11/07/2021 03:49:38 - INFO - __main__ - Step 46689: {'lr': 0.0003957071198978729, 'samples': 8964288, 'steps': 46688, 'loss/train': 1.3612478971481323} 11/07/2021 03:49:39 - INFO - __main__ - Step 46690: {'lr': 0.00039570280763004823, 'samples': 8964480, 'steps': 46689, 'loss/train': 1.7550060749053955} 11/07/2021 03:49:40 - INFO - __main__ - Step 46691: {'lr': 0.0003956984952965721, 'samples': 8964672, 'steps': 46690, 'loss/train': 1.3893109560012817} 11/07/2021 03:49:40 - INFO - __main__ - Step 46692: {'lr': 0.0003956941828974465, 'samples': 8964864, 'steps': 46691, 'loss/train': 1.536023497581482} 11/07/2021 03:49:40 - INFO - __main__ - Step 46693: {'lr': 0.0003956898704326733, 'samples': 8965056, 'steps': 46692, 'loss/train': 1.3925052881240845} 11/07/2021 03:49:41 - INFO - __main__ - Step 46694: {'lr': 0.00039568555790225456, 'samples': 8965248, 'steps': 46693, 'loss/train': 1.4903935194015503} 11/07/2021 03:49:41 - INFO - __main__ - Step 46695: {'lr': 0.00039568124530619213, 'samples': 8965440, 'steps': 46694, 'loss/train': 1.5666130781173706} 11/07/2021 03:49:41 - INFO - __main__ - Step 46696: {'lr': 0.00039567693264448803, 'samples': 8965632, 'steps': 46695, 'loss/train': 0.9935513138771057} 11/07/2021 03:49:42 - INFO - __main__ - Step 46697: {'lr': 0.00039567261991714406, 'samples': 8965824, 'steps': 46696, 'loss/train': 1.3019336462020874} 11/07/2021 03:49:43 - INFO - __main__ - Step 46698: {'lr': 0.00039566830712416226, 'samples': 8966016, 'steps': 46697, 'loss/train': 1.6125619411468506} 11/07/2021 03:49:43 - INFO - __main__ - Step 46699: {'lr': 0.0003956639942655446, 'samples': 8966208, 'steps': 46698, 'loss/train': 1.5985229015350342} 11/07/2021 03:49:44 - INFO - __main__ - Step 46700: {'lr': 0.000395659681341293, 'samples': 8966400, 'steps': 46699, 'loss/train': 1.6518969535827637} 11/07/2021 03:49:44 - INFO - __main__ - Step 46701: {'lr': 0.00039565536835140934, 'samples': 8966592, 'steps': 46700, 'loss/train': 1.3528852462768555} 11/07/2021 03:49:45 - INFO - __main__ - Step 46702: {'lr': 0.00039565105529589575, 'samples': 8966784, 'steps': 46701, 'loss/train': 1.752949833869934} 11/07/2021 03:49:45 - INFO - __main__ - Step 46703: {'lr': 0.00039564674217475393, 'samples': 8966976, 'steps': 46702, 'loss/train': 2.194688558578491} 11/07/2021 03:49:46 - INFO - __main__ - Step 46704: {'lr': 0.00039564242898798595, 'samples': 8967168, 'steps': 46703, 'loss/train': 1.578883409500122} 11/07/2021 03:49:46 - INFO - __main__ - Step 46705: {'lr': 0.00039563811573559377, 'samples': 8967360, 'steps': 46704, 'loss/train': 1.5374468564987183} 11/07/2021 03:49:46 - INFO - __main__ - Step 46706: {'lr': 0.00039563380241757927, 'samples': 8967552, 'steps': 46705, 'loss/train': 1.447409749031067} 11/07/2021 03:49:47 - INFO - __main__ - Step 46707: {'lr': 0.00039562948903394446, 'samples': 8967744, 'steps': 46706, 'loss/train': 1.3885626792907715} 11/07/2021 03:49:48 - INFO - __main__ - Step 46708: {'lr': 0.00039562517558469124, 'samples': 8967936, 'steps': 46707, 'loss/train': 1.3380175828933716} 11/07/2021 03:49:48 - INFO - __main__ - Step 46709: {'lr': 0.00039562086206982157, 'samples': 8968128, 'steps': 46708, 'loss/train': 1.9125405550003052} 11/07/2021 03:49:48 - INFO - __main__ - Step 46710: {'lr': 0.0003956165484893374, 'samples': 8968320, 'steps': 46709, 'loss/train': 1.4979057312011719} 11/07/2021 03:49:49 - INFO - __main__ - Step 46711: {'lr': 0.0003956122348432406, 'samples': 8968512, 'steps': 46710, 'loss/train': 1.2842931747436523} 11/07/2021 03:49:50 - INFO - __main__ - Step 46712: {'lr': 0.0003956079211315332, 'samples': 8968704, 'steps': 46711, 'loss/train': 1.184146523475647} 11/07/2021 03:49:50 - INFO - __main__ - Step 46713: {'lr': 0.00039560360735421706, 'samples': 8968896, 'steps': 46712, 'loss/train': 1.3897055387496948} 11/07/2021 03:49:51 - INFO - __main__ - Step 46714: {'lr': 0.0003955992935112943, 'samples': 8969088, 'steps': 46713, 'loss/train': 1.3282452821731567} 11/07/2021 03:49:51 - INFO - __main__ - Step 46715: {'lr': 0.00039559497960276667, 'samples': 8969280, 'steps': 46714, 'loss/train': 0.5708993673324585} 11/07/2021 03:49:51 - INFO - __main__ - Step 46716: {'lr': 0.0003955906656286362, 'samples': 8969472, 'steps': 46715, 'loss/train': 1.396579623222351} 11/07/2021 03:49:53 - INFO - __main__ - Step 46717: {'lr': 0.00039558635158890487, 'samples': 8969664, 'steps': 46716, 'loss/train': 1.4984301328659058} 11/07/2021 03:49:53 - INFO - __main__ - Step 46718: {'lr': 0.0003955820374835745, 'samples': 8969856, 'steps': 46717, 'loss/train': 1.558255672454834} 11/07/2021 03:49:53 - INFO - __main__ - Step 46719: {'lr': 0.0003955777233126472, 'samples': 8970048, 'steps': 46718, 'loss/train': 1.1341357231140137} 11/07/2021 03:49:54 - INFO - __main__ - Step 46720: {'lr': 0.00039557340907612473, 'samples': 8970240, 'steps': 46719, 'loss/train': 1.982488989830017} 11/07/2021 03:49:54 - INFO - __main__ - Step 46721: {'lr': 0.00039556909477400914, 'samples': 8970432, 'steps': 46720, 'loss/train': 1.388935923576355} 11/07/2021 03:49:55 - INFO - __main__ - Step 46722: {'lr': 0.00039556478040630246, 'samples': 8970624, 'steps': 46721, 'loss/train': 1.6528886556625366} 11/07/2021 03:49:55 - INFO - __main__ - Step 46723: {'lr': 0.0003955604659730064, 'samples': 8970816, 'steps': 46722, 'loss/train': 1.4015668630599976} 11/07/2021 03:49:56 - INFO - __main__ - Step 46724: {'lr': 0.00039555615147412315, 'samples': 8971008, 'steps': 46723, 'loss/train': 1.8984806537628174} 11/07/2021 03:49:56 - INFO - __main__ - Step 46725: {'lr': 0.00039555183690965454, 'samples': 8971200, 'steps': 46724, 'loss/train': 1.2106280326843262} 11/07/2021 03:49:56 - INFO - __main__ - Step 46726: {'lr': 0.00039554752227960243, 'samples': 8971392, 'steps': 46725, 'loss/train': 1.5564759969711304} 11/07/2021 03:49:57 - INFO - __main__ - Step 46727: {'lr': 0.0003955432075839689, 'samples': 8971584, 'steps': 46726, 'loss/train': 1.1629358530044556} 11/07/2021 03:49:58 - INFO - __main__ - Step 46728: {'lr': 0.00039553889282275585, 'samples': 8971776, 'steps': 46727, 'loss/train': 0.1853945404291153} 11/07/2021 03:49:58 - INFO - __main__ - Step 46729: {'lr': 0.0003955345779959653, 'samples': 8971968, 'steps': 46728, 'loss/train': 0.9729139804840088} 11/07/2021 03:49:59 - INFO - __main__ - Step 46730: {'lr': 0.00039553026310359897, 'samples': 8972160, 'steps': 46729, 'loss/train': 2.072021961212158} 11/07/2021 03:49:59 - INFO - __main__ - Step 46731: {'lr': 0.000395525948145659, 'samples': 8972352, 'steps': 46730, 'loss/train': 1.3280072212219238} 11/07/2021 03:50:00 - INFO - __main__ - Step 46732: {'lr': 0.0003955216331221473, 'samples': 8972544, 'steps': 46731, 'loss/train': 1.6797558069229126} 11/07/2021 03:50:00 - INFO - __main__ - Step 46733: {'lr': 0.00039551731803306577, 'samples': 8972736, 'steps': 46732, 'loss/train': 1.758447527885437} 11/07/2021 03:50:01 - INFO - __main__ - Step 46734: {'lr': 0.0003955130028784165, 'samples': 8972928, 'steps': 46733, 'loss/train': 2.373305559158325} 11/07/2021 03:50:01 - INFO - __main__ - Step 46735: {'lr': 0.0003955086876582012, 'samples': 8973120, 'steps': 46734, 'loss/train': 1.5346252918243408} 11/07/2021 03:50:01 - INFO - __main__ - Step 46736: {'lr': 0.000395504372372422, 'samples': 8973312, 'steps': 46735, 'loss/train': 1.6995505094528198} 11/07/2021 03:50:02 - INFO - __main__ - Step 46737: {'lr': 0.0003955000570210807, 'samples': 8973504, 'steps': 46736, 'loss/train': 1.3359993696212769} 11/07/2021 03:50:03 - INFO - __main__ - Step 46738: {'lr': 0.0003954957416041793, 'samples': 8973696, 'steps': 46737, 'loss/train': 1.7690136432647705} 11/07/2021 03:50:03 - INFO - __main__ - Step 46739: {'lr': 0.0003954914261217198, 'samples': 8973888, 'steps': 46738, 'loss/train': 1.6240804195404053} 11/07/2021 03:50:03 - INFO - __main__ - Step 46740: {'lr': 0.0003954871105737042, 'samples': 8974080, 'steps': 46739, 'loss/train': 1.0496690273284912} 11/07/2021 03:50:04 - INFO - __main__ - Step 46741: {'lr': 0.00039548279496013424, 'samples': 8974272, 'steps': 46740, 'loss/train': 0.8729541301727295} 11/07/2021 03:50:04 - INFO - __main__ - Step 46742: {'lr': 0.000395478479281012, 'samples': 8974464, 'steps': 46741, 'loss/train': 1.4796406030654907} 11/07/2021 03:50:05 - INFO - __main__ - Step 46743: {'lr': 0.00039547416353633946, 'samples': 8974656, 'steps': 46742, 'loss/train': 1.3999308347702026} 11/07/2021 03:50:06 - INFO - __main__ - Step 46744: {'lr': 0.00039546984772611843, 'samples': 8974848, 'steps': 46743, 'loss/train': 1.0811153650283813} 11/07/2021 03:50:06 - INFO - __main__ - Step 46745: {'lr': 0.00039546553185035093, 'samples': 8975040, 'steps': 46744, 'loss/train': 1.0817866325378418} 11/07/2021 03:50:06 - INFO - __main__ - Step 46746: {'lr': 0.00039546121590903897, 'samples': 8975232, 'steps': 46745, 'loss/train': 1.5537179708480835} 11/07/2021 03:50:07 - INFO - __main__ - Step 46747: {'lr': 0.0003954568999021844, 'samples': 8975424, 'steps': 46746, 'loss/train': 1.363145351409912} 11/07/2021 03:50:08 - INFO - __main__ - Step 46748: {'lr': 0.0003954525838297892, 'samples': 8975616, 'steps': 46747, 'loss/train': 1.4305704832077026} 11/07/2021 03:50:08 - INFO - __main__ - Step 46749: {'lr': 0.0003954482676918553, 'samples': 8975808, 'steps': 46748, 'loss/train': 1.5961601734161377} 11/07/2021 03:50:08 - INFO - __main__ - Step 46750: {'lr': 0.00039544395148838465, 'samples': 8976000, 'steps': 46749, 'loss/train': 1.72967529296875} 11/07/2021 03:50:09 - INFO - __main__ - Step 46751: {'lr': 0.0003954396352193792, 'samples': 8976192, 'steps': 46750, 'loss/train': 2.361015796661377} 11/07/2021 03:50:09 - INFO - __main__ - Step 46752: {'lr': 0.000395435318884841, 'samples': 8976384, 'steps': 46751, 'loss/train': 1.7659987211227417} 11/07/2021 03:50:10 - INFO - __main__ - Step 46753: {'lr': 0.0003954310024847717, 'samples': 8976576, 'steps': 46752, 'loss/train': 1.489812970161438} 11/07/2021 03:50:10 - INFO - __main__ - Step 46754: {'lr': 0.00039542668601917353, 'samples': 8976768, 'steps': 46753, 'loss/train': 1.301104187965393} 11/07/2021 03:50:11 - INFO - __main__ - Step 46755: {'lr': 0.0003954223694880483, 'samples': 8976960, 'steps': 46754, 'loss/train': 1.5028026103973389} 11/07/2021 03:50:11 - INFO - __main__ - Step 46756: {'lr': 0.0003954180528913981, 'samples': 8977152, 'steps': 46755, 'loss/train': 1.4305599927902222} 11/07/2021 03:50:11 - INFO - __main__ - Step 46757: {'lr': 0.0003954137362292247, 'samples': 8977344, 'steps': 46756, 'loss/train': 1.4253391027450562} 11/07/2021 03:50:12 - INFO - __main__ - Step 46758: {'lr': 0.0003954094195015301, 'samples': 8977536, 'steps': 46757, 'loss/train': 1.3867909908294678} 11/07/2021 03:50:13 - INFO - __main__ - Step 46759: {'lr': 0.0003954051027083163, 'samples': 8977728, 'steps': 46758, 'loss/train': 1.9530441761016846} 11/07/2021 03:50:13 - INFO - __main__ - Step 46760: {'lr': 0.0003954007858495852, 'samples': 8977920, 'steps': 46759, 'loss/train': 1.2998592853546143} 11/07/2021 03:50:13 - INFO - __main__ - Step 46761: {'lr': 0.00039539646892533867, 'samples': 8978112, 'steps': 46760, 'loss/train': 1.5191389322280884} 11/07/2021 03:50:14 - INFO - __main__ - Step 46762: {'lr': 0.00039539215193557886, 'samples': 8978304, 'steps': 46761, 'loss/train': 1.5478785037994385} 11/07/2021 03:50:14 - INFO - __main__ - Step 46763: {'lr': 0.0003953878348803075, 'samples': 8978496, 'steps': 46762, 'loss/train': 1.9524608850479126} 11/07/2021 03:50:15 - INFO - __main__ - Step 46764: {'lr': 0.0003953835177595266, 'samples': 8978688, 'steps': 46763, 'loss/train': 1.666530966758728} 11/07/2021 03:50:16 - INFO - __main__ - Step 46765: {'lr': 0.0003953792005732382, 'samples': 8978880, 'steps': 46764, 'loss/train': 1.1611963510513306} 11/07/2021 03:50:16 - INFO - __main__ - Step 46766: {'lr': 0.0003953748833214442, 'samples': 8979072, 'steps': 46765, 'loss/train': 1.4376670122146606} 11/07/2021 03:50:16 - INFO - __main__ - Step 46767: {'lr': 0.00039537056600414647, 'samples': 8979264, 'steps': 46766, 'loss/train': 1.1385133266448975} 11/07/2021 03:50:17 - INFO - __main__ - Step 46768: {'lr': 0.00039536624862134695, 'samples': 8979456, 'steps': 46767, 'loss/train': 1.679625391960144} 11/07/2021 03:50:18 - INFO - __main__ - Step 46769: {'lr': 0.00039536193117304774, 'samples': 8979648, 'steps': 46768, 'loss/train': 1.402443766593933} 11/07/2021 03:50:18 - INFO - __main__ - Step 46770: {'lr': 0.0003953576136592507, 'samples': 8979840, 'steps': 46769, 'loss/train': 1.2000861167907715} 11/07/2021 03:50:18 - INFO - __main__ - Step 46771: {'lr': 0.0003953532960799577, 'samples': 8980032, 'steps': 46770, 'loss/train': 1.1951671838760376} 11/07/2021 03:50:19 - INFO - __main__ - Step 46772: {'lr': 0.0003953489784351707, 'samples': 8980224, 'steps': 46771, 'loss/train': 1.7271140813827515} 11/07/2021 03:50:19 - INFO - __main__ - Step 46773: {'lr': 0.0003953446607248918, 'samples': 8980416, 'steps': 46772, 'loss/train': 1.2765376567840576} 11/07/2021 03:50:20 - INFO - __main__ - Step 46774: {'lr': 0.00039534034294912276, 'samples': 8980608, 'steps': 46773, 'loss/train': 1.1294169425964355} 11/07/2021 03:50:20 - INFO - __main__ - Step 46775: {'lr': 0.0003953360251078656, 'samples': 8980800, 'steps': 46774, 'loss/train': 1.6824162006378174} 11/07/2021 03:50:21 - INFO - __main__ - Step 46776: {'lr': 0.0003953317072011224, 'samples': 8980992, 'steps': 46775, 'loss/train': 1.4009029865264893} 11/07/2021 03:50:21 - INFO - __main__ - Step 46777: {'lr': 0.0003953273892288949, 'samples': 8981184, 'steps': 46776, 'loss/train': 0.33751192688941956} 11/07/2021 03:50:21 - INFO - __main__ - Step 46778: {'lr': 0.00039532307119118505, 'samples': 8981376, 'steps': 46777, 'loss/train': 1.763453722000122} 11/07/2021 03:50:23 - INFO - __main__ - Step 46779: {'lr': 0.00039531875308799493, 'samples': 8981568, 'steps': 46778, 'loss/train': 1.458613634109497} 11/07/2021 03:50:23 - INFO - __main__ - Step 46780: {'lr': 0.0003953144349193264, 'samples': 8981760, 'steps': 46779, 'loss/train': 1.495517373085022} 11/07/2021 03:50:23 - INFO - __main__ - Step 46781: {'lr': 0.0003953101166851814, 'samples': 8981952, 'steps': 46780, 'loss/train': 1.5929330587387085} 11/07/2021 03:50:24 - INFO - __main__ - Step 46782: {'lr': 0.0003953057983855619, 'samples': 8982144, 'steps': 46781, 'loss/train': 1.6176116466522217} 11/07/2021 03:50:24 - INFO - __main__ - Step 46783: {'lr': 0.00039530148002046996, 'samples': 8982336, 'steps': 46782, 'loss/train': 1.8476134538650513} 11/07/2021 03:50:25 - INFO - __main__ - Step 46784: {'lr': 0.0003952971615899074, 'samples': 8982528, 'steps': 46783, 'loss/train': 1.4362748861312866} 11/07/2021 03:50:25 - INFO - __main__ - Step 46785: {'lr': 0.00039529284309387607, 'samples': 8982720, 'steps': 46784, 'loss/train': 1.153825283050537} 11/07/2021 03:50:26 - INFO - __main__ - Step 46786: {'lr': 0.0003952885245323781, 'samples': 8982912, 'steps': 46785, 'loss/train': 1.3790760040283203} 11/07/2021 03:50:26 - INFO - __main__ - Step 46787: {'lr': 0.00039528420590541536, 'samples': 8983104, 'steps': 46786, 'loss/train': 1.3740510940551758} 11/07/2021 03:50:26 - INFO - __main__ - Step 46788: {'lr': 0.0003952798872129897, 'samples': 8983296, 'steps': 46787, 'loss/train': 0.6646178960800171} 11/07/2021 03:50:28 - INFO - __main__ - Step 46789: {'lr': 0.00039527556845510336, 'samples': 8983488, 'steps': 46788, 'loss/train': 1.5769122838974} 11/07/2021 03:50:28 - INFO - __main__ - Step 46790: {'lr': 0.00039527124963175796, 'samples': 8983680, 'steps': 46789, 'loss/train': 1.3731993436813354} 11/07/2021 03:50:28 - INFO - __main__ - Step 46791: {'lr': 0.0003952669307429556, 'samples': 8983872, 'steps': 46790, 'loss/train': 2.1312155723571777} 11/07/2021 03:50:29 - INFO - __main__ - Step 46792: {'lr': 0.00039526261178869816, 'samples': 8984064, 'steps': 46791, 'loss/train': 1.5580544471740723} 11/07/2021 03:50:29 - INFO - __main__ - Step 46793: {'lr': 0.0003952582927689877, 'samples': 8984256, 'steps': 46792, 'loss/train': 1.2427005767822266} 11/07/2021 03:50:29 - INFO - __main__ - Step 46794: {'lr': 0.00039525397368382604, 'samples': 8984448, 'steps': 46793, 'loss/train': 1.6376162767410278} 11/07/2021 03:50:30 - INFO - __main__ - Step 46795: {'lr': 0.0003952496545332152, 'samples': 8984640, 'steps': 46794, 'loss/train': 1.7411525249481201} 11/07/2021 03:50:31 - INFO - __main__ - Step 46796: {'lr': 0.00039524533531715714, 'samples': 8984832, 'steps': 46795, 'loss/train': 1.1954646110534668} 11/07/2021 03:50:31 - INFO - __main__ - Step 46797: {'lr': 0.00039524101603565377, 'samples': 8985024, 'steps': 46796, 'loss/train': 1.771838903427124} 11/07/2021 03:50:31 - INFO - __main__ - Step 46798: {'lr': 0.000395236696688707, 'samples': 8985216, 'steps': 46797, 'loss/train': 1.5596638917922974} 11/07/2021 03:50:32 - INFO - __main__ - Step 46799: {'lr': 0.0003952323772763188, 'samples': 8985408, 'steps': 46798, 'loss/train': 1.5280567407608032} 11/07/2021 03:50:33 - INFO - __main__ - Step 46800: {'lr': 0.00039522805779849116, 'samples': 8985600, 'steps': 46799, 'loss/train': 1.419503092765808} 11/07/2021 03:50:33 - INFO - __main__ - Step 46801: {'lr': 0.000395223738255226, 'samples': 8985792, 'steps': 46800, 'loss/train': 1.2403419017791748} 11/07/2021 03:50:33 - INFO - __main__ - Step 46802: {'lr': 0.00039521941864652525, 'samples': 8985984, 'steps': 46801, 'loss/train': 1.4845149517059326} 11/07/2021 03:50:34 - INFO - __main__ - Step 46803: {'lr': 0.0003952150989723909, 'samples': 8986176, 'steps': 46802, 'loss/train': 1.7280097007751465} 11/07/2021 03:50:34 - INFO - __main__ - Step 46804: {'lr': 0.00039521077923282486, 'samples': 8986368, 'steps': 46803, 'loss/train': 1.6100494861602783} 11/07/2021 03:50:35 - INFO - __main__ - Step 46805: {'lr': 0.00039520645942782906, 'samples': 8986560, 'steps': 46804, 'loss/train': 1.4668149948120117} 11/07/2021 03:50:36 - INFO - __main__ - Step 46806: {'lr': 0.00039520213955740555, 'samples': 8986752, 'steps': 46805, 'loss/train': 1.8643771409988403} 11/07/2021 03:50:36 - INFO - __main__ - Step 46807: {'lr': 0.0003951978196215561, 'samples': 8986944, 'steps': 46806, 'loss/train': 2.0712478160858154} 11/07/2021 03:50:36 - INFO - __main__ - Step 46808: {'lr': 0.00039519349962028276, 'samples': 8987136, 'steps': 46807, 'loss/train': 0.9385669231414795} 11/07/2021 03:50:37 - INFO - __main__ - Step 46809: {'lr': 0.0003951891795535875, 'samples': 8987328, 'steps': 46808, 'loss/train': 1.5247825384140015} 11/07/2021 03:50:38 - INFO - __main__ - Step 46810: {'lr': 0.00039518485942147233, 'samples': 8987520, 'steps': 46809, 'loss/train': 1.3851916790008545} 11/07/2021 03:50:38 - INFO - __main__ - Step 46811: {'lr': 0.0003951805392239389, 'samples': 8987712, 'steps': 46810, 'loss/train': 0.7063491940498352} 11/07/2021 03:50:39 - INFO - __main__ - Step 46812: {'lr': 0.00039517621896098954, 'samples': 8987904, 'steps': 46811, 'loss/train': 1.5973904132843018} 11/07/2021 03:50:39 - INFO - __main__ - Step 46813: {'lr': 0.00039517189863262593, 'samples': 8988096, 'steps': 46812, 'loss/train': 1.5485116243362427} 11/07/2021 03:50:39 - INFO - __main__ - Step 46814: {'lr': 0.00039516757823885006, 'samples': 8988288, 'steps': 46813, 'loss/train': 1.8475719690322876} 11/07/2021 03:50:40 - INFO - __main__ - Step 46815: {'lr': 0.000395163257779664, 'samples': 8988480, 'steps': 46814, 'loss/train': 1.8050289154052734} 11/07/2021 03:50:41 - INFO - __main__ - Step 46816: {'lr': 0.00039515893725506956, 'samples': 8988672, 'steps': 46815, 'loss/train': 1.5417001247406006} 11/07/2021 03:50:42 - INFO - __main__ - Step 46817: {'lr': 0.0003951546166650688, 'samples': 8988864, 'steps': 46816, 'loss/train': 4.985804557800293} 11/07/2021 03:50:42 - INFO - __main__ - Step 46818: {'lr': 0.0003951502960096636, 'samples': 8989056, 'steps': 46817, 'loss/train': 5.222023010253906} 11/07/2021 03:50:42 - INFO - __main__ - Step 46819: {'lr': 0.00039514597528885587, 'samples': 8989248, 'steps': 46818, 'loss/train': 1.0148247480392456} 11/07/2021 03:50:43 - INFO - __main__ - Step 46820: {'lr': 0.0003951416545026476, 'samples': 8989440, 'steps': 46819, 'loss/train': 1.453107237815857} 11/07/2021 03:50:43 - INFO - __main__ - Step 46821: {'lr': 0.0003951373336510408, 'samples': 8989632, 'steps': 46820, 'loss/train': 1.2581894397735596} 11/07/2021 03:50:44 - INFO - __main__ - Step 46822: {'lr': 0.00039513301273403733, 'samples': 8989824, 'steps': 46821, 'loss/train': 1.9668676853179932} 11/07/2021 03:50:44 - INFO - __main__ - Step 46823: {'lr': 0.0003951286917516392, 'samples': 8990016, 'steps': 46822, 'loss/train': 1.7566052675247192} 11/07/2021 03:50:45 - INFO - __main__ - Step 46824: {'lr': 0.00039512437070384827, 'samples': 8990208, 'steps': 46823, 'loss/train': 1.5792790651321411} 11/07/2021 03:50:45 - INFO - __main__ - Step 46825: {'lr': 0.00039512004959066653, 'samples': 8990400, 'steps': 46824, 'loss/train': 1.590653419494629} 11/07/2021 03:50:45 - INFO - __main__ - Step 46826: {'lr': 0.00039511572841209597, 'samples': 8990592, 'steps': 46825, 'loss/train': 1.6206929683685303} 11/07/2021 03:50:46 - INFO - __main__ - Step 46827: {'lr': 0.00039511140716813847, 'samples': 8990784, 'steps': 46826, 'loss/train': 1.5135914087295532} 11/07/2021 03:50:47 - INFO - __main__ - Step 46828: {'lr': 0.00039510708585879605, 'samples': 8990976, 'steps': 46827, 'loss/train': 1.8345533609390259} 11/07/2021 03:50:47 - INFO - __main__ - Step 46829: {'lr': 0.00039510276448407054, 'samples': 8991168, 'steps': 46828, 'loss/train': 1.2339359521865845} 11/07/2021 03:50:47 - INFO - __main__ - Step 46830: {'lr': 0.00039509844304396407, 'samples': 8991360, 'steps': 46829, 'loss/train': 0.958311140537262} 11/07/2021 03:50:48 - INFO - __main__ - Step 46831: {'lr': 0.00039509412153847847, 'samples': 8991552, 'steps': 46830, 'loss/train': 1.6364647150039673} 11/07/2021 03:50:48 - INFO - __main__ - Step 46832: {'lr': 0.00039508979996761564, 'samples': 8991744, 'steps': 46831, 'loss/train': 1.422149896621704} 11/07/2021 03:50:49 - INFO - __main__ - Step 46833: {'lr': 0.00039508547833137753, 'samples': 8991936, 'steps': 46832, 'loss/train': 1.893943190574646} 11/07/2021 03:50:50 - INFO - __main__ - Step 46834: {'lr': 0.0003950811566297662, 'samples': 8992128, 'steps': 46833, 'loss/train': 1.193856120109558} 11/07/2021 03:50:50 - INFO - __main__ - Step 46835: {'lr': 0.00039507683486278357, 'samples': 8992320, 'steps': 46834, 'loss/train': 2.5559275150299072} 11/07/2021 03:50:50 - INFO - __main__ - Step 46836: {'lr': 0.00039507251303043156, 'samples': 8992512, 'steps': 46835, 'loss/train': 1.5294876098632812} 11/07/2021 03:50:51 - INFO - __main__ - Step 46837: {'lr': 0.0003950681911327121, 'samples': 8992704, 'steps': 46836, 'loss/train': 1.7873066663742065} 11/07/2021 03:50:52 - INFO - __main__ - Step 46838: {'lr': 0.00039506386916962714, 'samples': 8992896, 'steps': 46837, 'loss/train': 1.2543444633483887} 11/07/2021 03:50:52 - INFO - __main__ - Step 46839: {'lr': 0.0003950595471411786, 'samples': 8993088, 'steps': 46838, 'loss/train': 1.2466020584106445} 11/07/2021 03:50:52 - INFO - __main__ - Step 46840: {'lr': 0.00039505522504736855, 'samples': 8993280, 'steps': 46839, 'loss/train': 1.3969309329986572} 11/07/2021 03:50:53 - INFO - __main__ - Step 46841: {'lr': 0.00039505090288819876, 'samples': 8993472, 'steps': 46840, 'loss/train': 2.016864776611328} 11/07/2021 03:50:53 - INFO - __main__ - Step 46842: {'lr': 0.00039504658066367136, 'samples': 8993664, 'steps': 46841, 'loss/train': 1.4573959112167358} 11/07/2021 03:50:54 - INFO - __main__ - Step 46843: {'lr': 0.0003950422583737882, 'samples': 8993856, 'steps': 46842, 'loss/train': 1.6924041509628296} 11/07/2021 03:50:54 - INFO - __main__ - Step 46844: {'lr': 0.0003950379360185512, 'samples': 8994048, 'steps': 46843, 'loss/train': 1.1295289993286133} 11/07/2021 03:50:55 - INFO - __main__ - Step 46845: {'lr': 0.00039503361359796235, 'samples': 8994240, 'steps': 46844, 'loss/train': 0.9984075427055359} 11/07/2021 03:50:55 - INFO - __main__ - Step 46846: {'lr': 0.00039502929111202357, 'samples': 8994432, 'steps': 46845, 'loss/train': 1.9570423364639282} 11/07/2021 03:50:55 - INFO - __main__ - Step 46847: {'lr': 0.0003950249685607369, 'samples': 8994624, 'steps': 46846, 'loss/train': 1.361639142036438} 11/07/2021 03:50:56 - INFO - __main__ - Step 46848: {'lr': 0.00039502064594410414, 'samples': 8994816, 'steps': 46847, 'loss/train': 1.080751895904541} 11/07/2021 03:50:57 - INFO - __main__ - Step 46849: {'lr': 0.00039501632326212734, 'samples': 8995008, 'steps': 46848, 'loss/train': 1.4507310390472412} 11/07/2021 03:50:57 - INFO - __main__ - Step 46850: {'lr': 0.00039501200051480844, 'samples': 8995200, 'steps': 46849, 'loss/train': 1.6919381618499756} 11/07/2021 03:50:57 - INFO - __main__ - Step 46851: {'lr': 0.0003950076777021494, 'samples': 8995392, 'steps': 46850, 'loss/train': 1.6522619724273682} 11/07/2021 03:50:58 - INFO - __main__ - Step 46852: {'lr': 0.00039500335482415205, 'samples': 8995584, 'steps': 46851, 'loss/train': 1.5006636381149292} 11/07/2021 03:50:58 - INFO - __main__ - Step 46853: {'lr': 0.00039499903188081856, 'samples': 8995776, 'steps': 46852, 'loss/train': 1.6721848249435425} 11/07/2021 03:50:59 - INFO - __main__ - Step 46854: {'lr': 0.0003949947088721506, 'samples': 8995968, 'steps': 46853, 'loss/train': 1.4820759296417236} 11/07/2021 03:51:00 - INFO - __main__ - Step 46855: {'lr': 0.0003949903857981503, 'samples': 8996160, 'steps': 46854, 'loss/train': 1.4208883047103882} 11/07/2021 03:51:00 - INFO - __main__ - Step 46856: {'lr': 0.0003949860626588196, 'samples': 8996352, 'steps': 46855, 'loss/train': 1.5806963443756104} 11/07/2021 03:51:00 - INFO - __main__ - Step 46857: {'lr': 0.0003949817394541604, 'samples': 8996544, 'steps': 46856, 'loss/train': 1.4058988094329834} 11/07/2021 03:51:01 - INFO - __main__ - Step 46858: {'lr': 0.0003949774161841747, 'samples': 8996736, 'steps': 46857, 'loss/train': 2.2320330142974854} 11/07/2021 03:51:02 - INFO - __main__ - Step 46859: {'lr': 0.0003949730928488644, 'samples': 8996928, 'steps': 46858, 'loss/train': 1.3717622756958008} 11/07/2021 03:51:02 - INFO - __main__ - Step 46860: {'lr': 0.0003949687694482314, 'samples': 8997120, 'steps': 46859, 'loss/train': 1.7184524536132812} 11/07/2021 03:51:02 - INFO - __main__ - Step 46861: {'lr': 0.0003949644459822778, 'samples': 8997312, 'steps': 46860, 'loss/train': 1.2574100494384766} 11/07/2021 03:51:03 - INFO - __main__ - Step 46862: {'lr': 0.00039496012245100536, 'samples': 8997504, 'steps': 46861, 'loss/train': 1.7844125032424927} 11/07/2021 03:51:03 - INFO - __main__ - Step 46863: {'lr': 0.0003949557988544162, 'samples': 8997696, 'steps': 46862, 'loss/train': 1.4378299713134766} 11/07/2021 03:51:04 - INFO - __main__ - Step 46864: {'lr': 0.0003949514751925122, 'samples': 8997888, 'steps': 46863, 'loss/train': 1.038225769996643} 11/07/2021 03:51:04 - INFO - __main__ - Step 46865: {'lr': 0.00039494715146529526, 'samples': 8998080, 'steps': 46864, 'loss/train': 1.1390782594680786} 11/07/2021 03:51:05 - INFO - __main__ - Step 46866: {'lr': 0.00039494282767276736, 'samples': 8998272, 'steps': 46865, 'loss/train': 1.4852098226547241} 11/07/2021 03:51:05 - INFO - __main__ - Step 46867: {'lr': 0.0003949385038149305, 'samples': 8998464, 'steps': 46866, 'loss/train': 1.7658873796463013} 11/07/2021 03:51:06 - INFO - __main__ - Step 46868: {'lr': 0.0003949341798917866, 'samples': 8998656, 'steps': 46867, 'loss/train': 0.9245157837867737} 11/07/2021 03:51:07 - INFO - __main__ - Step 46869: {'lr': 0.00039492985590333754, 'samples': 8998848, 'steps': 46868, 'loss/train': 1.5905011892318726} 11/07/2021 03:51:08 - INFO - __main__ - Step 46870: {'lr': 0.00039492553184958533, 'samples': 8999040, 'steps': 46869, 'loss/train': 0.830938458442688} 11/07/2021 03:51:08 - INFO - __main__ - Step 46871: {'lr': 0.00039492120773053195, 'samples': 8999232, 'steps': 46870, 'loss/train': 1.3371855020523071} 11/07/2021 03:51:09 - INFO - __main__ - Step 46872: {'lr': 0.0003949168835461793, 'samples': 8999424, 'steps': 46871, 'loss/train': 1.7193821668624878} 11/07/2021 03:51:09 - INFO - __main__ - Step 46873: {'lr': 0.0003949125592965293, 'samples': 8999616, 'steps': 46872, 'loss/train': 1.5219018459320068} 11/07/2021 03:51:09 - INFO - __main__ - Step 46874: {'lr': 0.000394908234981584, 'samples': 8999808, 'steps': 46873, 'loss/train': 1.4934570789337158} 11/07/2021 03:51:10 - INFO - __main__ - Step 46875: {'lr': 0.00039490391060134525, 'samples': 9000000, 'steps': 46874, 'loss/train': 1.3278827667236328} 11/07/2021 03:51:11 - INFO - __main__ - Step 46876: {'lr': 0.000394899586155815, 'samples': 9000192, 'steps': 46875, 'loss/train': 1.5030622482299805} 11/07/2021 03:51:11 - INFO - __main__ - Step 46877: {'lr': 0.00039489526164499536, 'samples': 9000384, 'steps': 46876, 'loss/train': 1.18563973903656} 11/07/2021 03:51:11 - INFO - __main__ - Step 46878: {'lr': 0.000394890937068888, 'samples': 9000576, 'steps': 46877, 'loss/train': 1.779812216758728} 11/07/2021 03:51:12 - INFO - __main__ - Step 46879: {'lr': 0.00039488661242749506, 'samples': 9000768, 'steps': 46878, 'loss/train': 1.6924418210983276} 11/07/2021 03:51:12 - INFO - __main__ - Step 46880: {'lr': 0.00039488228772081846, 'samples': 9000960, 'steps': 46879, 'loss/train': 0.20111282169818878} 11/07/2021 03:51:13 - INFO - __main__ - Step 46881: {'lr': 0.00039487796294886016, 'samples': 9001152, 'steps': 46880, 'loss/train': 1.9703088998794556} 11/07/2021 03:51:14 - INFO - __main__ - Step 46882: {'lr': 0.0003948736381116221, 'samples': 9001344, 'steps': 46881, 'loss/train': 1.535144329071045} 11/07/2021 03:51:14 - INFO - __main__ - Step 46883: {'lr': 0.0003948693132091061, 'samples': 9001536, 'steps': 46882, 'loss/train': 1.4550069570541382} 11/07/2021 03:51:14 - INFO - __main__ - Step 46884: {'lr': 0.00039486498824131434, 'samples': 9001728, 'steps': 46883, 'loss/train': 1.181460976600647} 11/07/2021 03:51:15 - INFO - __main__ - Step 46885: {'lr': 0.00039486066320824865, 'samples': 9001920, 'steps': 46884, 'loss/train': 0.7617834806442261} 11/07/2021 03:51:16 - INFO - __main__ - Step 46886: {'lr': 0.00039485633810991096, 'samples': 9002112, 'steps': 46885, 'loss/train': 1.6075130701065063} 11/07/2021 03:51:16 - INFO - __main__ - Step 46887: {'lr': 0.0003948520129463032, 'samples': 9002304, 'steps': 46886, 'loss/train': 1.8204573392868042} 11/07/2021 03:51:16 - INFO - __main__ - Step 46888: {'lr': 0.0003948476877174274, 'samples': 9002496, 'steps': 46887, 'loss/train': 1.8663369417190552} 11/07/2021 03:51:17 - INFO - __main__ - Step 46889: {'lr': 0.0003948433624232854, 'samples': 9002688, 'steps': 46888, 'loss/train': 2.045262098312378} 11/07/2021 03:51:17 - INFO - __main__ - Step 46890: {'lr': 0.0003948390370638794, 'samples': 9002880, 'steps': 46889, 'loss/train': 1.6313608884811401} 11/07/2021 03:51:17 - INFO - __main__ - Step 46891: {'lr': 0.000394834711639211, 'samples': 9003072, 'steps': 46890, 'loss/train': 1.6466128826141357} 11/07/2021 03:51:19 - INFO - __main__ - Step 46892: {'lr': 0.00039483038614928235, 'samples': 9003264, 'steps': 46891, 'loss/train': 1.4804253578186035} 11/07/2021 03:51:19 - INFO - __main__ - Step 46893: {'lr': 0.0003948260605940953, 'samples': 9003456, 'steps': 46892, 'loss/train': 1.5738805532455444} 11/07/2021 03:51:19 - INFO - __main__ - Step 46894: {'lr': 0.00039482173497365193, 'samples': 9003648, 'steps': 46893, 'loss/train': 1.5786223411560059} 11/07/2021 03:51:20 - INFO - __main__ - Step 46895: {'lr': 0.0003948174092879541, 'samples': 9003840, 'steps': 46894, 'loss/train': 1.8035955429077148} 11/07/2021 03:51:20 - INFO - __main__ - Step 46896: {'lr': 0.0003948130835370038, 'samples': 9004032, 'steps': 46895, 'loss/train': 1.8617504835128784} 11/07/2021 03:51:21 - INFO - __main__ - Step 46897: {'lr': 0.000394808757720803, 'samples': 9004224, 'steps': 46896, 'loss/train': 0.9014905691146851} 11/07/2021 03:51:21 - INFO - __main__ - Step 46898: {'lr': 0.00039480443183935357, 'samples': 9004416, 'steps': 46897, 'loss/train': 1.4515459537506104} 11/07/2021 03:51:22 - INFO - __main__ - Step 46899: {'lr': 0.0003948001058926575, 'samples': 9004608, 'steps': 46898, 'loss/train': 2.697383403778076} 11/07/2021 03:51:22 - INFO - __main__ - Step 46900: {'lr': 0.0003947957798807167, 'samples': 9004800, 'steps': 46899, 'loss/train': 1.7489532232284546} 11/07/2021 03:51:22 - INFO - __main__ - Step 46901: {'lr': 0.00039479145380353313, 'samples': 9004992, 'steps': 46900, 'loss/train': 1.0459482669830322} 11/07/2021 03:51:24 - INFO - __main__ - Step 46902: {'lr': 0.0003947871276611088, 'samples': 9005184, 'steps': 46901, 'loss/train': 1.5989751815795898} 11/07/2021 03:51:24 - INFO - __main__ - Step 46903: {'lr': 0.0003947828014534457, 'samples': 9005376, 'steps': 46902, 'loss/train': 1.4633909463882446} 11/07/2021 03:51:24 - INFO - __main__ - Step 46904: {'lr': 0.00039477847518054566, 'samples': 9005568, 'steps': 46903, 'loss/train': 1.4922444820404053} 11/07/2021 03:51:25 - INFO - __main__ - Step 46905: {'lr': 0.00039477414884241064, 'samples': 9005760, 'steps': 46904, 'loss/train': 1.4760546684265137} 11/07/2021 03:51:25 - INFO - __main__ - Step 46906: {'lr': 0.0003947698224390426, 'samples': 9005952, 'steps': 46905, 'loss/train': 2.0090863704681396} 11/07/2021 03:51:25 - INFO - __main__ - Step 46907: {'lr': 0.0003947654959704435, 'samples': 9006144, 'steps': 46906, 'loss/train': 1.7975330352783203} 11/07/2021 03:51:26 - INFO - __main__ - Step 46908: {'lr': 0.00039476116943661544, 'samples': 9006336, 'steps': 46907, 'loss/train': 1.886009931564331} 11/07/2021 03:51:27 - INFO - __main__ - Step 46909: {'lr': 0.00039475684283756007, 'samples': 9006528, 'steps': 46908, 'loss/train': 0.6918396949768066} 11/07/2021 03:51:27 - INFO - __main__ - Step 46910: {'lr': 0.0003947525161732797, 'samples': 9006720, 'steps': 46909, 'loss/train': 1.0997514724731445} 11/07/2021 03:51:27 - INFO - __main__ - Step 46911: {'lr': 0.0003947481894437759, 'samples': 9006912, 'steps': 46910, 'loss/train': 1.4464973211288452} 11/07/2021 03:51:28 - INFO - __main__ - Step 46912: {'lr': 0.0003947438626490508, 'samples': 9007104, 'steps': 46911, 'loss/train': 1.5185309648513794} 11/07/2021 03:51:29 - INFO - __main__ - Step 46913: {'lr': 0.0003947395357891064, 'samples': 9007296, 'steps': 46912, 'loss/train': 0.4749510586261749} 11/07/2021 03:51:29 - INFO - __main__ - Step 46914: {'lr': 0.00039473520886394465, 'samples': 9007488, 'steps': 46913, 'loss/train': 1.6673336029052734} 11/07/2021 03:51:29 - INFO - __main__ - Step 46915: {'lr': 0.00039473088187356737, 'samples': 9007680, 'steps': 46914, 'loss/train': 1.1993536949157715} 11/07/2021 03:51:30 - INFO - __main__ - Step 46916: {'lr': 0.0003947265548179766, 'samples': 9007872, 'steps': 46915, 'loss/train': 1.507056474685669} 11/07/2021 03:51:30 - INFO - __main__ - Step 46917: {'lr': 0.00039472222769717434, 'samples': 9008064, 'steps': 46916, 'loss/train': 1.8981199264526367} 11/07/2021 03:51:31 - INFO - __main__ - Step 46918: {'lr': 0.00039471790051116243, 'samples': 9008256, 'steps': 46917, 'loss/train': 1.3224080801010132} 11/07/2021 03:51:32 - INFO - __main__ - Step 46919: {'lr': 0.0003947135732599428, 'samples': 9008448, 'steps': 46918, 'loss/train': 1.5564310550689697} 11/07/2021 03:51:32 - INFO - __main__ - Step 46920: {'lr': 0.0003947092459435176, 'samples': 9008640, 'steps': 46919, 'loss/train': 0.5554118752479553} 11/07/2021 03:51:32 - INFO - __main__ - Step 46921: {'lr': 0.0003947049185618886, 'samples': 9008832, 'steps': 46920, 'loss/train': 0.7767159342765808} 11/07/2021 03:51:33 - INFO - __main__ - Step 46922: {'lr': 0.0003947005911150577, 'samples': 9009024, 'steps': 46921, 'loss/train': 1.5369144678115845} 11/07/2021 03:51:34 - INFO - __main__ - Step 46923: {'lr': 0.0003946962636030271, 'samples': 9009216, 'steps': 46922, 'loss/train': 1.928104281425476} 11/07/2021 03:51:34 - INFO - __main__ - Step 46924: {'lr': 0.00039469193602579856, 'samples': 9009408, 'steps': 46923, 'loss/train': 1.6119428873062134} 11/07/2021 03:51:34 - INFO - __main__ - Step 46925: {'lr': 0.000394687608383374, 'samples': 9009600, 'steps': 46924, 'loss/train': 1.768648386001587} 11/07/2021 03:51:35 - INFO - __main__ - Step 46926: {'lr': 0.0003946832806757554, 'samples': 9009792, 'steps': 46925, 'loss/train': 1.144248127937317} 11/07/2021 03:51:35 - INFO - __main__ - Step 46927: {'lr': 0.00039467895290294484, 'samples': 9009984, 'steps': 46926, 'loss/train': 1.4360101222991943} 11/07/2021 03:51:36 - INFO - __main__ - Step 46928: {'lr': 0.00039467462506494416, 'samples': 9010176, 'steps': 46927, 'loss/train': 1.2297422885894775} 11/07/2021 03:51:37 - INFO - __main__ - Step 46929: {'lr': 0.0003946702971617553, 'samples': 9010368, 'steps': 46928, 'loss/train': 1.675126075744629} 11/07/2021 03:51:37 - INFO - __main__ - Step 46930: {'lr': 0.00039466596919338027, 'samples': 9010560, 'steps': 46929, 'loss/train': 1.154676079750061} 11/07/2021 03:51:37 - INFO - __main__ - Step 46931: {'lr': 0.000394661641159821, 'samples': 9010752, 'steps': 46930, 'loss/train': 1.3921693563461304} 11/07/2021 03:51:38 - INFO - __main__ - Step 46932: {'lr': 0.00039465731306107937, 'samples': 9010944, 'steps': 46931, 'loss/train': 1.4811137914657593} 11/07/2021 03:51:39 - INFO - __main__ - Step 46933: {'lr': 0.0003946529848971574, 'samples': 9011136, 'steps': 46932, 'loss/train': 0.9219220280647278} 11/07/2021 03:51:39 - INFO - __main__ - Step 46934: {'lr': 0.00039464865666805706, 'samples': 9011328, 'steps': 46933, 'loss/train': 1.5291695594787598} 11/07/2021 03:51:39 - INFO - __main__ - Step 46935: {'lr': 0.00039464432837378025, 'samples': 9011520, 'steps': 46934, 'loss/train': 1.7101067304611206} 11/07/2021 03:51:40 - INFO - __main__ - Step 46936: {'lr': 0.0003946400000143289, 'samples': 9011712, 'steps': 46935, 'loss/train': 1.0490752458572388} 11/07/2021 03:51:40 - INFO - __main__ - Step 46937: {'lr': 0.000394635671589705, 'samples': 9011904, 'steps': 46936, 'loss/train': 1.351618766784668} 11/07/2021 03:51:41 - INFO - __main__ - Step 46938: {'lr': 0.0003946313430999106, 'samples': 9012096, 'steps': 46937, 'loss/train': 1.6533046960830688} 11/07/2021 03:51:41 - INFO - __main__ - Step 46939: {'lr': 0.0003946270145449475, 'samples': 9012288, 'steps': 46938, 'loss/train': 1.3199681043624878} 11/07/2021 03:51:42 - INFO - __main__ - Step 46940: {'lr': 0.00039462268592481767, 'samples': 9012480, 'steps': 46939, 'loss/train': 0.9378066062927246} 11/07/2021 03:51:42 - INFO - __main__ - Step 46941: {'lr': 0.00039461835723952313, 'samples': 9012672, 'steps': 46940, 'loss/train': 1.3353723287582397} 11/07/2021 03:51:43 - INFO - __main__ - Step 46942: {'lr': 0.0003946140284890657, 'samples': 9012864, 'steps': 46941, 'loss/train': 1.2372130155563354} 11/07/2021 03:51:44 - INFO - __main__ - Step 46943: {'lr': 0.0003946096996734475, 'samples': 9013056, 'steps': 46942, 'loss/train': 1.3074778318405151} 11/07/2021 03:51:44 - INFO - __main__ - Step 46944: {'lr': 0.00039460537079267035, 'samples': 9013248, 'steps': 46943, 'loss/train': 1.737030267715454} 11/07/2021 03:51:44 - INFO - __main__ - Step 46945: {'lr': 0.00039460104184673627, 'samples': 9013440, 'steps': 46944, 'loss/train': 1.4304100275039673} 11/07/2021 03:51:45 - INFO - __main__ - Step 46946: {'lr': 0.00039459671283564727, 'samples': 9013632, 'steps': 46945, 'loss/train': 1.438733696937561} 11/07/2021 03:51:45 - INFO - __main__ - Step 46947: {'lr': 0.0003945923837594051, 'samples': 9013824, 'steps': 46946, 'loss/train': 1.9903538227081299} 11/07/2021 03:51:46 - INFO - __main__ - Step 46948: {'lr': 0.0003945880546180119, 'samples': 9014016, 'steps': 46947, 'loss/train': 1.6213985681533813} 11/07/2021 03:51:46 - INFO - __main__ - Step 46949: {'lr': 0.00039458372541146955, 'samples': 9014208, 'steps': 46948, 'loss/train': 1.5832176208496094} 11/07/2021 03:51:47 - INFO - __main__ - Step 46950: {'lr': 0.00039457939613978, 'samples': 9014400, 'steps': 46949, 'loss/train': 1.7206088304519653} 11/07/2021 03:51:47 - INFO - __main__ - Step 46951: {'lr': 0.0003945750668029452, 'samples': 9014592, 'steps': 46950, 'loss/train': 1.3618377447128296} 11/07/2021 03:51:47 - INFO - __main__ - Step 46952: {'lr': 0.0003945707374009671, 'samples': 9014784, 'steps': 46951, 'loss/train': 1.8693825006484985} 11/07/2021 03:51:48 - INFO - __main__ - Step 46953: {'lr': 0.0003945664079338477, 'samples': 9014976, 'steps': 46952, 'loss/train': 1.589695930480957} 11/07/2021 03:51:49 - INFO - __main__ - Step 46954: {'lr': 0.0003945620784015888, 'samples': 9015168, 'steps': 46953, 'loss/train': 1.9525930881500244} 11/07/2021 03:51:49 - INFO - __main__ - Step 46955: {'lr': 0.00039455774880419256, 'samples': 9015360, 'steps': 46954, 'loss/train': 1.226714849472046} 11/07/2021 03:51:49 - INFO - __main__ - Step 46956: {'lr': 0.00039455341914166074, 'samples': 9015552, 'steps': 46955, 'loss/train': 0.9247803688049316} 11/07/2021 03:51:50 - INFO - __main__ - Step 46957: {'lr': 0.0003945490894139955, 'samples': 9015744, 'steps': 46956, 'loss/train': 1.0902832746505737} 11/07/2021 03:51:50 - INFO - __main__ - Step 46958: {'lr': 0.0003945447596211986, 'samples': 9015936, 'steps': 46957, 'loss/train': 1.6723014116287231} 11/07/2021 03:51:51 - INFO - __main__ - Step 46959: {'lr': 0.0003945404297632721, 'samples': 9016128, 'steps': 46958, 'loss/train': 1.616858959197998} 11/07/2021 03:51:52 - INFO - __main__ - Step 46960: {'lr': 0.00039453609984021787, 'samples': 9016320, 'steps': 46959, 'loss/train': 1.7943397760391235} 11/07/2021 03:51:52 - INFO - __main__ - Step 46961: {'lr': 0.00039453176985203785, 'samples': 9016512, 'steps': 46960, 'loss/train': 1.3130934238433838} 11/07/2021 03:51:52 - INFO - __main__ - Step 46962: {'lr': 0.0003945274397987342, 'samples': 9016704, 'steps': 46961, 'loss/train': 1.7671287059783936} 11/07/2021 03:51:53 - INFO - __main__ - Step 46963: {'lr': 0.0003945231096803086, 'samples': 9016896, 'steps': 46962, 'loss/train': 1.2953639030456543} 11/07/2021 03:51:54 - INFO - __main__ - Step 46964: {'lr': 0.0003945187794967632, 'samples': 9017088, 'steps': 46963, 'loss/train': 1.5047909021377563} 11/07/2021 03:51:54 - INFO - __main__ - Step 46965: {'lr': 0.00039451444924809976, 'samples': 9017280, 'steps': 46964, 'loss/train': 1.5116136074066162} 11/07/2021 03:51:54 - INFO - __main__ - Step 46966: {'lr': 0.0003945101189343204, 'samples': 9017472, 'steps': 46965, 'loss/train': 1.4135128259658813} 11/07/2021 03:51:55 - INFO - __main__ - Step 46967: {'lr': 0.000394505788555427, 'samples': 9017664, 'steps': 46966, 'loss/train': 1.447141408920288} 11/07/2021 03:51:55 - INFO - __main__ - Step 46968: {'lr': 0.0003945014581114215, 'samples': 9017856, 'steps': 46967, 'loss/train': 1.9308451414108276} 11/07/2021 03:51:56 - INFO - __main__ - Step 46969: {'lr': 0.00039449712760230584, 'samples': 9018048, 'steps': 46968, 'loss/train': 1.1975317001342773} 11/07/2021 03:51:56 - INFO - __main__ - Step 46970: {'lr': 0.0003944927970280821, 'samples': 9018240, 'steps': 46969, 'loss/train': 1.501655101776123} 11/07/2021 03:51:57 - INFO - __main__ - Step 46971: {'lr': 0.00039448846638875213, 'samples': 9018432, 'steps': 46970, 'loss/train': 1.0903911590576172} 11/07/2021 03:51:57 - INFO - __main__ - Step 46972: {'lr': 0.00039448413568431785, 'samples': 9018624, 'steps': 46971, 'loss/train': 1.5875543355941772} 11/07/2021 03:51:57 - INFO - __main__ - Step 46973: {'lr': 0.0003944798049147812, 'samples': 9018816, 'steps': 46972, 'loss/train': 1.8218967914581299} 11/07/2021 03:51:58 - INFO - __main__ - Step 46974: {'lr': 0.00039447547408014426, 'samples': 9019008, 'steps': 46973, 'loss/train': 1.572045087814331} 11/07/2021 03:51:59 - INFO - __main__ - Step 46975: {'lr': 0.00039447114318040885, 'samples': 9019200, 'steps': 46974, 'loss/train': 1.3558261394500732} 11/07/2021 03:51:59 - INFO - __main__ - Step 46976: {'lr': 0.000394466812215577, 'samples': 9019392, 'steps': 46975, 'loss/train': 1.5541266202926636} 11/07/2021 03:51:59 - INFO - __main__ - Step 46977: {'lr': 0.0003944624811856506, 'samples': 9019584, 'steps': 46976, 'loss/train': 1.6195271015167236} 11/07/2021 03:52:00 - INFO - __main__ - Step 46978: {'lr': 0.0003944581500906317, 'samples': 9019776, 'steps': 46977, 'loss/train': 1.519793152809143} 11/07/2021 03:52:01 - INFO - __main__ - Step 46979: {'lr': 0.00039445381893052215, 'samples': 9019968, 'steps': 46978, 'loss/train': 1.248785376548767} 11/07/2021 03:52:01 - INFO - __main__ - Step 46980: {'lr': 0.0003944494877053239, 'samples': 9020160, 'steps': 46979, 'loss/train': 1.2922887802124023} 11/07/2021 03:52:01 - INFO - __main__ - Step 46981: {'lr': 0.00039444515641503896, 'samples': 9020352, 'steps': 46980, 'loss/train': 1.960410237312317} 11/07/2021 03:52:02 - INFO - __main__ - Step 46982: {'lr': 0.00039444082505966926, 'samples': 9020544, 'steps': 46981, 'loss/train': 1.441537857055664} 11/07/2021 03:52:02 - INFO - __main__ - Step 46983: {'lr': 0.0003944364936392168, 'samples': 9020736, 'steps': 46982, 'loss/train': 1.4293638467788696} 11/07/2021 03:52:02 - INFO - __main__ - Step 46984: {'lr': 0.0003944321621536835, 'samples': 9020928, 'steps': 46983, 'loss/train': 1.1249370574951172} 11/07/2021 03:52:04 - INFO - __main__ - Step 46985: {'lr': 0.00039442783060307117, 'samples': 9021120, 'steps': 46984, 'loss/train': 1.44890296459198} 11/07/2021 03:52:04 - INFO - __main__ - Step 46986: {'lr': 0.00039442349898738204, 'samples': 9021312, 'steps': 46985, 'loss/train': 1.7546148300170898} 11/07/2021 03:52:04 - INFO - __main__ - Step 46987: {'lr': 0.0003944191673066178, 'samples': 9021504, 'steps': 46986, 'loss/train': 1.1470195055007935} 11/07/2021 03:52:05 - INFO - __main__ - Step 46988: {'lr': 0.00039441483556078055, 'samples': 9021696, 'steps': 46987, 'loss/train': 1.1848145723342896} 11/07/2021 03:52:05 - INFO - __main__ - Step 46989: {'lr': 0.0003944105037498722, 'samples': 9021888, 'steps': 46988, 'loss/train': 1.8253114223480225} 11/07/2021 03:52:06 - INFO - __main__ - Step 46990: {'lr': 0.0003944061718738947, 'samples': 9022080, 'steps': 46989, 'loss/train': 1.395495891571045} 11/07/2021 03:52:06 - INFO - __main__ - Step 46991: {'lr': 0.00039440183993285006, 'samples': 9022272, 'steps': 46990, 'loss/train': 1.5553473234176636} 11/07/2021 03:52:07 - INFO - __main__ - Step 46992: {'lr': 0.0003943975079267401, 'samples': 9022464, 'steps': 46991, 'loss/train': 1.4578441381454468} 11/07/2021 03:52:07 - INFO - __main__ - Step 46993: {'lr': 0.0003943931758555669, 'samples': 9022656, 'steps': 46992, 'loss/train': 1.4696886539459229} 11/07/2021 03:52:07 - INFO - __main__ - Step 46994: {'lr': 0.0003943888437193324, 'samples': 9022848, 'steps': 46993, 'loss/train': 1.2073811292648315} 11/07/2021 03:52:08 - INFO - __main__ - Step 46995: {'lr': 0.00039438451151803844, 'samples': 9023040, 'steps': 46994, 'loss/train': 1.1929799318313599} 11/07/2021 03:52:09 - INFO - __main__ - Step 46996: {'lr': 0.000394380179251687, 'samples': 9023232, 'steps': 46995, 'loss/train': 1.5996979475021362} 11/07/2021 03:52:09 - INFO - __main__ - Step 46997: {'lr': 0.0003943758469202802, 'samples': 9023424, 'steps': 46996, 'loss/train': 1.0800632238388062} 11/07/2021 03:52:09 - INFO - __main__ - Step 46998: {'lr': 0.0003943715145238198, 'samples': 9023616, 'steps': 46997, 'loss/train': 1.2664647102355957} 11/07/2021 03:52:10 - INFO - __main__ - Step 46999: {'lr': 0.00039436718206230795, 'samples': 9023808, 'steps': 46998, 'loss/train': 1.4994295835494995} 11/07/2021 03:52:11 - INFO - __main__ - Step 47000: {'lr': 0.0003943628495357463, 'samples': 9024000, 'steps': 46999, 'loss/train': 1.680883765220642} 11/07/2021 03:52:11 - INFO - __main__ - Step 47001: {'lr': 0.00039435851694413705, 'samples': 9024192, 'steps': 47000, 'loss/train': 2.1149520874023438} 11/07/2021 03:52:12 - INFO - __main__ - Step 47002: {'lr': 0.00039435418428748206, 'samples': 9024384, 'steps': 47001, 'loss/train': 1.675614833831787} 11/07/2021 03:52:12 - INFO - __main__ - Step 47003: {'lr': 0.00039434985156578333, 'samples': 9024576, 'steps': 47002, 'loss/train': 1.2153077125549316} 11/07/2021 03:52:12 - INFO - __main__ - Step 47004: {'lr': 0.0003943455187790428, 'samples': 9024768, 'steps': 47003, 'loss/train': 1.455561637878418} 11/07/2021 03:52:13 - INFO - __main__ - Step 47005: {'lr': 0.0003943411859272624, 'samples': 9024960, 'steps': 47004, 'loss/train': 1.614766001701355} 11/07/2021 03:52:14 - INFO - __main__ - Step 47006: {'lr': 0.0003943368530104441, 'samples': 9025152, 'steps': 47005, 'loss/train': 1.222779631614685} 11/07/2021 03:52:14 - INFO - __main__ - Step 47007: {'lr': 0.00039433252002858975, 'samples': 9025344, 'steps': 47006, 'loss/train': 1.3408775329589844} 11/07/2021 03:52:14 - INFO - __main__ - Step 47008: {'lr': 0.0003943281869817015, 'samples': 9025536, 'steps': 47007, 'loss/train': 0.7637006640434265} 11/07/2021 03:52:15 - INFO - __main__ - Step 47009: {'lr': 0.0003943238538697811, 'samples': 9025728, 'steps': 47008, 'loss/train': 1.2307101488113403} 11/07/2021 03:52:15 - INFO - __main__ - Step 47010: {'lr': 0.00039431952069283067, 'samples': 9025920, 'steps': 47009, 'loss/train': 1.1147801876068115} 11/07/2021 03:52:16 - INFO - __main__ - Step 47011: {'lr': 0.00039431518745085205, 'samples': 9026112, 'steps': 47010, 'loss/train': 1.3025678396224976} 11/07/2021 03:52:16 - INFO - __main__ - Step 47012: {'lr': 0.00039431085414384727, 'samples': 9026304, 'steps': 47011, 'loss/train': 1.6826845407485962} 11/07/2021 03:52:17 - INFO - __main__ - Step 47013: {'lr': 0.0003943065207718182, 'samples': 9026496, 'steps': 47012, 'loss/train': 1.8111188411712646} 11/07/2021 03:52:17 - INFO - __main__ - Step 47014: {'lr': 0.0003943021873347669, 'samples': 9026688, 'steps': 47013, 'loss/train': 1.3840372562408447} 11/07/2021 03:52:18 - INFO - __main__ - Step 47015: {'lr': 0.00039429785383269524, 'samples': 9026880, 'steps': 47014, 'loss/train': 1.157009482383728} 11/07/2021 03:52:19 - INFO - __main__ - Step 47016: {'lr': 0.00039429352026560516, 'samples': 9027072, 'steps': 47015, 'loss/train': 1.1459460258483887} 11/07/2021 03:52:19 - INFO - __main__ - Step 47017: {'lr': 0.0003942891866334987, 'samples': 9027264, 'steps': 47016, 'loss/train': 1.6639070510864258} 11/07/2021 03:52:19 - INFO - __main__ - Step 47018: {'lr': 0.00039428485293637773, 'samples': 9027456, 'steps': 47017, 'loss/train': 1.348271131515503} 11/07/2021 03:52:20 - INFO - __main__ - Step 47019: {'lr': 0.00039428051917424423, 'samples': 9027648, 'steps': 47018, 'loss/train': 1.3297394514083862} 11/07/2021 03:52:20 - INFO - __main__ - Step 47020: {'lr': 0.0003942761853471002, 'samples': 9027840, 'steps': 47019, 'loss/train': 1.7127476930618286} 11/07/2021 03:52:21 - INFO - __main__ - Step 47021: {'lr': 0.0003942718514549475, 'samples': 9028032, 'steps': 47020, 'loss/train': 1.5943137407302856} 11/07/2021 03:52:21 - INFO - __main__ - Step 47022: {'lr': 0.0003942675174977881, 'samples': 9028224, 'steps': 47021, 'loss/train': 1.2140249013900757} 11/07/2021 03:52:22 - INFO - __main__ - Step 47023: {'lr': 0.000394263183475624, 'samples': 9028416, 'steps': 47022, 'loss/train': 1.5225287675857544} 11/07/2021 03:52:22 - INFO - __main__ - Step 47024: {'lr': 0.0003942588493884571, 'samples': 9028608, 'steps': 47023, 'loss/train': 1.6169816255569458} 11/07/2021 03:52:22 - INFO - __main__ - Step 47025: {'lr': 0.00039425451523628953, 'samples': 9028800, 'steps': 47024, 'loss/train': 1.2692021131515503} 11/07/2021 03:52:23 - INFO - __main__ - Step 47026: {'lr': 0.00039425018101912305, 'samples': 9028992, 'steps': 47025, 'loss/train': 1.7158222198486328} 11/07/2021 03:52:24 - INFO - __main__ - Step 47027: {'lr': 0.00039424584673695956, 'samples': 9029184, 'steps': 47026, 'loss/train': 1.682454228401184} 11/07/2021 03:52:24 - INFO - __main__ - Step 47028: {'lr': 0.0003942415123898012, 'samples': 9029376, 'steps': 47027, 'loss/train': 1.6168550252914429} 11/07/2021 03:52:24 - INFO - __main__ - Step 47029: {'lr': 0.0003942371779776498, 'samples': 9029568, 'steps': 47028, 'loss/train': 1.5480886697769165} 11/07/2021 03:52:25 - INFO - __main__ - Step 47030: {'lr': 0.00039423284350050735, 'samples': 9029760, 'steps': 47029, 'loss/train': 1.8418940305709839} 11/07/2021 03:52:26 - INFO - __main__ - Step 47031: {'lr': 0.0003942285089583759, 'samples': 9029952, 'steps': 47030, 'loss/train': 1.7178999185562134} 11/07/2021 03:52:26 - INFO - __main__ - Step 47032: {'lr': 0.0003942241743512572, 'samples': 9030144, 'steps': 47031, 'loss/train': 1.4074722528457642} 11/07/2021 03:52:26 - INFO - __main__ - Step 47033: {'lr': 0.00039421983967915337, 'samples': 9030336, 'steps': 47032, 'loss/train': 1.7305998802185059} 11/07/2021 03:52:27 - INFO - __main__ - Step 47034: {'lr': 0.00039421550494206625, 'samples': 9030528, 'steps': 47033, 'loss/train': 1.7071924209594727} 11/07/2021 03:52:27 - INFO - __main__ - Step 47035: {'lr': 0.0003942111701399979, 'samples': 9030720, 'steps': 47034, 'loss/train': 1.5228930711746216} 11/07/2021 03:52:28 - INFO - __main__ - Step 47036: {'lr': 0.0003942068352729502, 'samples': 9030912, 'steps': 47035, 'loss/train': 1.6422492265701294} 11/07/2021 03:52:29 - INFO - __main__ - Step 47037: {'lr': 0.0003942025003409252, 'samples': 9031104, 'steps': 47036, 'loss/train': 1.4419137239456177} 11/07/2021 03:52:29 - INFO - __main__ - Step 47038: {'lr': 0.0003941981653439247, 'samples': 9031296, 'steps': 47037, 'loss/train': 1.5991615056991577} 11/07/2021 03:52:29 - INFO - __main__ - Step 47039: {'lr': 0.00039419383028195076, 'samples': 9031488, 'steps': 47038, 'loss/train': 1.3113130331039429} 11/07/2021 03:52:30 - INFO - __main__ - Step 47040: {'lr': 0.00039418949515500524, 'samples': 9031680, 'steps': 47039, 'loss/train': 1.5550031661987305} 11/07/2021 03:52:31 - INFO - __main__ - Step 47041: {'lr': 0.0003941851599630902, 'samples': 9031872, 'steps': 47040, 'loss/train': 1.4652179479599} 11/07/2021 03:52:31 - INFO - __main__ - Step 47042: {'lr': 0.00039418082470620756, 'samples': 9032064, 'steps': 47041, 'loss/train': 1.7316834926605225} 11/07/2021 03:52:32 - INFO - __main__ - Step 47043: {'lr': 0.0003941764893843593, 'samples': 9032256, 'steps': 47042, 'loss/train': 1.5148168802261353} 11/07/2021 03:52:32 - INFO - __main__ - Step 47044: {'lr': 0.0003941721539975473, 'samples': 9032448, 'steps': 47043, 'loss/train': 1.3945996761322021} 11/07/2021 03:52:32 - INFO - __main__ - Step 47045: {'lr': 0.0003941678185457736, 'samples': 9032640, 'steps': 47044, 'loss/train': 1.6497108936309814} 11/07/2021 03:52:33 - INFO - __main__ - Step 47046: {'lr': 0.00039416348302904005, 'samples': 9032832, 'steps': 47045, 'loss/train': 1.4801838397979736} 11/07/2021 03:52:34 - INFO - __main__ - Step 47047: {'lr': 0.0003941591474473487, 'samples': 9033024, 'steps': 47046, 'loss/train': 1.6212077140808105} 11/07/2021 03:52:34 - INFO - __main__ - Step 47048: {'lr': 0.0003941548118007014, 'samples': 9033216, 'steps': 47047, 'loss/train': 1.920916199684143} 11/07/2021 03:52:34 - INFO - __main__ - Step 47049: {'lr': 0.00039415047608910023, 'samples': 9033408, 'steps': 47048, 'loss/train': 1.437896966934204} 11/07/2021 03:52:35 - INFO - __main__ - Step 47050: {'lr': 0.000394146140312547, 'samples': 9033600, 'steps': 47049, 'loss/train': 1.3306047916412354} 11/07/2021 03:52:35 - INFO - __main__ - Step 47051: {'lr': 0.0003941418044710438, 'samples': 9033792, 'steps': 47050, 'loss/train': 1.4316445589065552} 11/07/2021 03:52:36 - INFO - __main__ - Step 47052: {'lr': 0.00039413746856459253, 'samples': 9033984, 'steps': 47051, 'loss/train': 0.7563554644584656} 11/07/2021 03:52:37 - INFO - __main__ - Step 47053: {'lr': 0.0003941331325931952, 'samples': 9034176, 'steps': 47052, 'loss/train': 1.5452617406845093} 11/07/2021 03:52:37 - INFO - __main__ - Step 47054: {'lr': 0.0003941287965568536, 'samples': 9034368, 'steps': 47053, 'loss/train': 1.8800292015075684} 11/07/2021 03:52:37 - INFO - __main__ - Step 47055: {'lr': 0.0003941244604555698, 'samples': 9034560, 'steps': 47054, 'loss/train': 1.7606195211410522} 11/07/2021 03:52:38 - INFO - __main__ - Step 47056: {'lr': 0.0003941201242893457, 'samples': 9034752, 'steps': 47055, 'loss/train': 1.5376604795455933} 11/07/2021 03:52:39 - INFO - __main__ - Step 47057: {'lr': 0.00039411578805818344, 'samples': 9034944, 'steps': 47056, 'loss/train': 1.383442997932434} 11/07/2021 03:52:39 - INFO - __main__ - Step 47058: {'lr': 0.00039411145176208477, 'samples': 9035136, 'steps': 47057, 'loss/train': 1.3758971691131592} 11/07/2021 03:52:39 - INFO - __main__ - Step 47059: {'lr': 0.0003941071154010517, 'samples': 9035328, 'steps': 47058, 'loss/train': 1.4077143669128418} 11/07/2021 03:52:40 - INFO - __main__ - Step 47060: {'lr': 0.00039410277897508617, 'samples': 9035520, 'steps': 47059, 'loss/train': 1.337977647781372} 11/07/2021 03:52:40 - INFO - __main__ - Step 47061: {'lr': 0.00039409844248419014, 'samples': 9035712, 'steps': 47060, 'loss/train': 1.5199073553085327} 11/07/2021 03:52:41 - INFO - __main__ - Step 47062: {'lr': 0.0003940941059283656, 'samples': 9035904, 'steps': 47061, 'loss/train': 1.421905517578125} 11/07/2021 03:52:41 - INFO - __main__ - Step 47063: {'lr': 0.00039408976930761444, 'samples': 9036096, 'steps': 47062, 'loss/train': 1.5089776515960693} 11/07/2021 03:52:42 - INFO - __main__ - Step 47064: {'lr': 0.00039408543262193867, 'samples': 9036288, 'steps': 47063, 'loss/train': 1.375567078590393} 11/07/2021 03:52:42 - INFO - __main__ - Step 47065: {'lr': 0.00039408109587134034, 'samples': 9036480, 'steps': 47064, 'loss/train': 0.798215925693512} 11/07/2021 03:52:42 - INFO - __main__ - Step 47066: {'lr': 0.00039407675905582117, 'samples': 9036672, 'steps': 47065, 'loss/train': 1.7782467603683472} 11/07/2021 03:52:44 - INFO - __main__ - Step 47067: {'lr': 0.00039407242217538317, 'samples': 9036864, 'steps': 47066, 'loss/train': 1.2912510633468628} 11/07/2021 03:52:44 - INFO - __main__ - Step 47068: {'lr': 0.0003940680852300285, 'samples': 9037056, 'steps': 47067, 'loss/train': 1.5498754978179932} 11/07/2021 03:52:44 - INFO - __main__ - Step 47069: {'lr': 0.00039406374821975893, 'samples': 9037248, 'steps': 47068, 'loss/train': 1.6175334453582764} 11/07/2021 03:52:45 - INFO - __main__ - Step 47070: {'lr': 0.00039405941114457644, 'samples': 9037440, 'steps': 47069, 'loss/train': 1.3006986379623413} 11/07/2021 03:52:45 - INFO - __main__ - Step 47071: {'lr': 0.000394055074004483, 'samples': 9037632, 'steps': 47070, 'loss/train': 1.5169404745101929} 11/07/2021 03:52:46 - INFO - __main__ - Step 47072: {'lr': 0.0003940507367994806, 'samples': 9037824, 'steps': 47071, 'loss/train': 0.5934475660324097} 11/07/2021 03:52:46 - INFO - __main__ - Step 47073: {'lr': 0.00039404639952957116, 'samples': 9038016, 'steps': 47072, 'loss/train': 1.216092586517334} 11/07/2021 03:52:47 - INFO - __main__ - Step 47074: {'lr': 0.00039404206219475655, 'samples': 9038208, 'steps': 47073, 'loss/train': 1.2290641069412231} 11/07/2021 03:52:47 - INFO - __main__ - Step 47075: {'lr': 0.00039403772479503895, 'samples': 9038400, 'steps': 47074, 'loss/train': 1.8244417905807495} 11/07/2021 03:52:47 - INFO - __main__ - Step 47076: {'lr': 0.0003940333873304201, 'samples': 9038592, 'steps': 47075, 'loss/train': 0.9801872968673706} 11/07/2021 03:52:49 - INFO - __main__ - Step 47077: {'lr': 0.000394029049800902, 'samples': 9038784, 'steps': 47076, 'loss/train': 1.3451482057571411} 11/07/2021 03:52:49 - INFO - __main__ - Step 47078: {'lr': 0.00039402471220648675, 'samples': 9038976, 'steps': 47077, 'loss/train': 1.9589600563049316} 11/07/2021 03:52:49 - INFO - __main__ - Step 47079: {'lr': 0.000394020374547176, 'samples': 9039168, 'steps': 47078, 'loss/train': 1.6597225666046143} 11/07/2021 03:52:50 - INFO - __main__ - Step 47080: {'lr': 0.00039401603682297204, 'samples': 9039360, 'steps': 47079, 'loss/train': 1.3895072937011719} 11/07/2021 03:52:50 - INFO - __main__ - Step 47081: {'lr': 0.0003940116990338766, 'samples': 9039552, 'steps': 47080, 'loss/train': 2.7593445777893066} 11/07/2021 03:52:50 - INFO - __main__ - Step 47082: {'lr': 0.00039400736117989175, 'samples': 9039744, 'steps': 47081, 'loss/train': 1.3985692262649536} 11/07/2021 03:52:51 - INFO - __main__ - Step 47083: {'lr': 0.0003940030232610194, 'samples': 9039936, 'steps': 47082, 'loss/train': 1.3660260438919067} 11/07/2021 03:52:52 - INFO - __main__ - Step 47084: {'lr': 0.0003939986852772615, 'samples': 9040128, 'steps': 47083, 'loss/train': 1.7843916416168213} 11/07/2021 03:52:52 - INFO - __main__ - Step 47085: {'lr': 0.00039399434722862004, 'samples': 9040320, 'steps': 47084, 'loss/train': 1.2024017572402954} 11/07/2021 03:52:53 - INFO - __main__ - Step 47086: {'lr': 0.00039399000911509685, 'samples': 9040512, 'steps': 47085, 'loss/train': 1.2676604986190796} 11/07/2021 03:52:53 - INFO - __main__ - Step 47087: {'lr': 0.00039398567093669413, 'samples': 9040704, 'steps': 47086, 'loss/train': 1.5416532754898071} 11/07/2021 03:52:54 - INFO - __main__ - Step 47088: {'lr': 0.00039398133269341357, 'samples': 9040896, 'steps': 47087, 'loss/train': 1.261775016784668} 11/07/2021 03:52:54 - INFO - __main__ - Step 47089: {'lr': 0.0003939769943852573, 'samples': 9041088, 'steps': 47088, 'loss/train': 1.8220853805541992} 11/07/2021 03:52:55 - INFO - __main__ - Step 47090: {'lr': 0.0003939726560122272, 'samples': 9041280, 'steps': 47089, 'loss/train': 0.8334203362464905} 11/07/2021 03:52:55 - INFO - __main__ - Step 47091: {'lr': 0.00039396831757432526, 'samples': 9041472, 'steps': 47090, 'loss/train': 1.4785126447677612} 11/07/2021 03:52:55 - INFO - __main__ - Step 47092: {'lr': 0.0003939639790715535, 'samples': 9041664, 'steps': 47091, 'loss/train': 1.4800597429275513} 11/07/2021 03:52:56 - INFO - __main__ - Step 47093: {'lr': 0.0003939596405039136, 'samples': 9041856, 'steps': 47092, 'loss/train': 1.4908806085586548} 11/07/2021 03:52:57 - INFO - __main__ - Step 47094: {'lr': 0.00039395530187140784, 'samples': 9042048, 'steps': 47093, 'loss/train': 1.5633057355880737} 11/07/2021 03:52:57 - INFO - __main__ - Step 47095: {'lr': 0.000393950963174038, 'samples': 9042240, 'steps': 47094, 'loss/train': 1.6771138906478882} 11/07/2021 03:52:57 - INFO - __main__ - Step 47096: {'lr': 0.00039394662441180606, 'samples': 9042432, 'steps': 47095, 'loss/train': 1.8062840700149536} 11/07/2021 03:52:58 - INFO - __main__ - Step 47097: {'lr': 0.000393942285584714, 'samples': 9042624, 'steps': 47096, 'loss/train': 1.3849742412567139} 11/07/2021 03:52:58 - INFO - __main__ - Step 47098: {'lr': 0.00039393794669276386, 'samples': 9042816, 'steps': 47097, 'loss/train': 1.4430208206176758} 11/07/2021 03:52:59 - INFO - __main__ - Step 47099: {'lr': 0.00039393360773595744, 'samples': 9043008, 'steps': 47098, 'loss/train': 1.518402099609375} 11/07/2021 03:52:59 - INFO - __main__ - Step 47100: {'lr': 0.0003939292687142967, 'samples': 9043200, 'steps': 47099, 'loss/train': 1.5874766111373901} 11/07/2021 03:53:00 - INFO - __main__ - Step 47101: {'lr': 0.0003939249296277837, 'samples': 9043392, 'steps': 47100, 'loss/train': 1.4214836359024048} 11/07/2021 03:53:00 - INFO - __main__ - Step 47102: {'lr': 0.0003939205904764204, 'samples': 9043584, 'steps': 47101, 'loss/train': 1.459752082824707} 11/07/2021 03:53:01 - INFO - __main__ - Step 47103: {'lr': 0.00039391625126020856, 'samples': 9043776, 'steps': 47102, 'loss/train': 1.505184292793274} 11/07/2021 03:53:02 - INFO - __main__ - Step 47104: {'lr': 0.0003939119119791504, 'samples': 9043968, 'steps': 47103, 'loss/train': 1.4024347066879272} 11/07/2021 03:53:02 - INFO - __main__ - Step 47105: {'lr': 0.0003939075726332477, 'samples': 9044160, 'steps': 47104, 'loss/train': 1.4824903011322021} 11/07/2021 03:53:02 - INFO - __main__ - Step 47106: {'lr': 0.00039390323322250253, 'samples': 9044352, 'steps': 47105, 'loss/train': 0.8480738997459412} 11/07/2021 03:53:03 - INFO - __main__ - Step 47107: {'lr': 0.0003938988937469168, 'samples': 9044544, 'steps': 47106, 'loss/train': 1.2670586109161377} 11/07/2021 03:53:03 - INFO - __main__ - Step 47108: {'lr': 0.0003938945542064923, 'samples': 9044736, 'steps': 47107, 'loss/train': 1.4854414463043213} 11/07/2021 03:53:04 - INFO - __main__ - Step 47109: {'lr': 0.00039389021460123125, 'samples': 9044928, 'steps': 47108, 'loss/train': 1.1902029514312744} 11/07/2021 03:53:04 - INFO - __main__ - Step 47110: {'lr': 0.0003938858749311355, 'samples': 9045120, 'steps': 47109, 'loss/train': 1.9038214683532715} 11/07/2021 03:53:05 - INFO - __main__ - Step 47111: {'lr': 0.00039388153519620696, 'samples': 9045312, 'steps': 47110, 'loss/train': 1.5505082607269287} 11/07/2021 03:53:05 - INFO - __main__ - Step 47112: {'lr': 0.0003938771953964476, 'samples': 9045504, 'steps': 47111, 'loss/train': 1.3920332193374634} 11/07/2021 03:53:05 - INFO - __main__ - Step 47113: {'lr': 0.0003938728555318594, 'samples': 9045696, 'steps': 47112, 'loss/train': 2.713216543197632} 11/07/2021 03:53:06 - INFO - __main__ - Step 47114: {'lr': 0.00039386851560244433, 'samples': 9045888, 'steps': 47113, 'loss/train': 1.6680505275726318} 11/07/2021 03:53:07 - INFO - __main__ - Step 47115: {'lr': 0.0003938641756082043, 'samples': 9046080, 'steps': 47114, 'loss/train': 1.3867429494857788} 11/07/2021 03:53:07 - INFO - __main__ - Step 47116: {'lr': 0.00039385983554914136, 'samples': 9046272, 'steps': 47115, 'loss/train': 1.3547860383987427} 11/07/2021 03:53:08 - INFO - __main__ - Step 47117: {'lr': 0.0003938554954252573, 'samples': 9046464, 'steps': 47116, 'loss/train': 1.5629254579544067} 11/07/2021 03:53:08 - INFO - __main__ - Step 47118: {'lr': 0.00039385115523655426, 'samples': 9046656, 'steps': 47117, 'loss/train': 1.1610107421875} 11/07/2021 03:53:09 - INFO - __main__ - Step 47119: {'lr': 0.00039384681498303407, 'samples': 9046848, 'steps': 47118, 'loss/train': 1.762439489364624} 11/07/2021 03:53:09 - INFO - __main__ - Step 47120: {'lr': 0.0003938424746646988, 'samples': 9047040, 'steps': 47119, 'loss/train': 1.3170665502548218} 11/07/2021 03:53:10 - INFO - __main__ - Step 47121: {'lr': 0.00039383813428155027, 'samples': 9047232, 'steps': 47120, 'loss/train': 2.4549214839935303} 11/07/2021 03:53:10 - INFO - __main__ - Step 47122: {'lr': 0.0003938337938335904, 'samples': 9047424, 'steps': 47121, 'loss/train': 1.5323492288589478} 11/07/2021 03:53:10 - INFO - __main__ - Step 47123: {'lr': 0.00039382945332082136, 'samples': 9047616, 'steps': 47122, 'loss/train': 1.4084267616271973} 11/07/2021 03:53:11 - INFO - __main__ - Step 47124: {'lr': 0.00039382511274324496, 'samples': 9047808, 'steps': 47123, 'loss/train': 1.4346362352371216} 11/07/2021 03:53:12 - INFO - __main__ - Step 47125: {'lr': 0.0003938207721008632, 'samples': 9048000, 'steps': 47124, 'loss/train': 1.3514961004257202} 11/07/2021 03:53:12 - INFO - __main__ - Step 47126: {'lr': 0.00039381643139367806, 'samples': 9048192, 'steps': 47125, 'loss/train': 1.743735909461975} 11/07/2021 03:53:12 - INFO - __main__ - Step 47127: {'lr': 0.00039381209062169136, 'samples': 9048384, 'steps': 47126, 'loss/train': 1.2722545862197876} 11/07/2021 03:53:13 - INFO - __main__ - Step 47128: {'lr': 0.0003938077497849052, 'samples': 9048576, 'steps': 47127, 'loss/train': 1.4940422773361206} 11/07/2021 03:53:13 - INFO - __main__ - Step 47129: {'lr': 0.00039380340888332143, 'samples': 9048768, 'steps': 47128, 'loss/train': 1.303487777709961} 11/07/2021 03:53:14 - INFO - __main__ - Step 47130: {'lr': 0.0003937990679169421, 'samples': 9048960, 'steps': 47129, 'loss/train': 2.0714499950408936} 11/07/2021 03:53:15 - INFO - __main__ - Step 47131: {'lr': 0.0003937947268857692, 'samples': 9049152, 'steps': 47130, 'loss/train': 1.6812490224838257} 11/07/2021 03:53:15 - INFO - __main__ - Step 47132: {'lr': 0.00039379038578980454, 'samples': 9049344, 'steps': 47131, 'loss/train': 1.5768859386444092} 11/07/2021 03:53:15 - INFO - __main__ - Step 47133: {'lr': 0.0003937860446290502, 'samples': 9049536, 'steps': 47132, 'loss/train': 1.3159254789352417} 11/07/2021 03:53:16 - INFO - __main__ - Step 47134: {'lr': 0.0003937817034035081, 'samples': 9049728, 'steps': 47133, 'loss/train': 1.0606609582901} 11/07/2021 03:53:17 - INFO - __main__ - Step 47135: {'lr': 0.00039377736211318004, 'samples': 9049920, 'steps': 47134, 'loss/train': 2.44612717628479} 11/07/2021 03:53:17 - INFO - __main__ - Step 47136: {'lr': 0.0003937730207580682, 'samples': 9050112, 'steps': 47135, 'loss/train': 1.7027770280838013} 11/07/2021 03:53:17 - INFO - __main__ - Step 47137: {'lr': 0.0003937686793381745, 'samples': 9050304, 'steps': 47136, 'loss/train': 0.6332939863204956} 11/07/2021 03:53:18 - INFO - __main__ - Step 47138: {'lr': 0.0003937643378535009, 'samples': 9050496, 'steps': 47137, 'loss/train': 1.0706485509872437} 11/07/2021 03:53:18 - INFO - __main__ - Step 47139: {'lr': 0.0003937599963040491, 'samples': 9050688, 'steps': 47138, 'loss/train': 0.40077632665634155} 11/07/2021 03:53:19 - INFO - __main__ - Step 47140: {'lr': 0.0003937556546898214, 'samples': 9050880, 'steps': 47139, 'loss/train': 1.3255983591079712} 11/07/2021 03:53:20 - INFO - __main__ - Step 47141: {'lr': 0.0003937513130108197, 'samples': 9051072, 'steps': 47140, 'loss/train': 1.567197322845459} 11/07/2021 03:53:20 - INFO - __main__ - Step 47142: {'lr': 0.00039374697126704573, 'samples': 9051264, 'steps': 47141, 'loss/train': 1.435327172279358} 11/07/2021 03:53:20 - INFO - __main__ - Step 47143: {'lr': 0.0003937426294585017, 'samples': 9051456, 'steps': 47142, 'loss/train': 0.9999099373817444} 11/07/2021 03:53:21 - INFO - __main__ - Step 47144: {'lr': 0.00039373828758518936, 'samples': 9051648, 'steps': 47143, 'loss/train': 0.8797643780708313} 11/07/2021 03:53:21 - INFO - __main__ - Step 47145: {'lr': 0.00039373394564711086, 'samples': 9051840, 'steps': 47144, 'loss/train': 0.6948489546775818} 11/07/2021 03:53:22 - INFO - __main__ - Step 47146: {'lr': 0.00039372960364426803, 'samples': 9052032, 'steps': 47145, 'loss/train': 1.5431687831878662} 11/07/2021 03:53:22 - INFO - __main__ - Step 47147: {'lr': 0.0003937252615766628, 'samples': 9052224, 'steps': 47146, 'loss/train': 1.4942766427993774} 11/07/2021 03:53:23 - INFO - __main__ - Step 47148: {'lr': 0.0003937209194442973, 'samples': 9052416, 'steps': 47147, 'loss/train': 1.663383960723877} 11/07/2021 03:53:23 - INFO - __main__ - Step 47149: {'lr': 0.00039371657724717325, 'samples': 9052608, 'steps': 47148, 'loss/train': 1.6548125743865967} 11/07/2021 03:53:24 - INFO - __main__ - Step 47150: {'lr': 0.0003937122349852928, 'samples': 9052800, 'steps': 47149, 'loss/train': 1.6805589199066162} 11/07/2021 03:53:24 - INFO - __main__ - Step 47151: {'lr': 0.0003937078926586578, 'samples': 9052992, 'steps': 47150, 'loss/train': 1.630454421043396} 11/07/2021 03:53:25 - INFO - __main__ - Step 47152: {'lr': 0.0003937035502672703, 'samples': 9053184, 'steps': 47151, 'loss/train': 1.3253780603408813} 11/07/2021 03:53:25 - INFO - __main__ - Step 47153: {'lr': 0.0003936992078111321, 'samples': 9053376, 'steps': 47152, 'loss/train': 2.368971109390259} 11/07/2021 03:53:26 - INFO - __main__ - Step 47154: {'lr': 0.0003936948652902453, 'samples': 9053568, 'steps': 47153, 'loss/train': 1.6653461456298828} 11/07/2021 03:53:26 - INFO - __main__ - Step 47155: {'lr': 0.0003936905227046119, 'samples': 9053760, 'steps': 47154, 'loss/train': 1.3452043533325195} 11/07/2021 03:53:27 - INFO - __main__ - Step 47156: {'lr': 0.00039368618005423365, 'samples': 9053952, 'steps': 47155, 'loss/train': 1.4380110502243042} 11/07/2021 03:53:27 - INFO - __main__ - Step 47157: {'lr': 0.00039368183733911265, 'samples': 9054144, 'steps': 47156, 'loss/train': 1.4627124071121216} 11/07/2021 03:53:28 - INFO - __main__ - Step 47158: {'lr': 0.00039367749455925086, 'samples': 9054336, 'steps': 47157, 'loss/train': 1.1693251132965088} 11/07/2021 03:53:28 - INFO - __main__ - Step 47159: {'lr': 0.0003936731517146502, 'samples': 9054528, 'steps': 47158, 'loss/train': 1.4604130983352661} 11/07/2021 03:53:28 - INFO - __main__ - Step 47160: {'lr': 0.0003936688088053126, 'samples': 9054720, 'steps': 47159, 'loss/train': 1.8615742921829224} 11/07/2021 03:53:29 - INFO - __main__ - Step 47161: {'lr': 0.0003936644658312401, 'samples': 9054912, 'steps': 47160, 'loss/train': 1.6003049612045288} 11/07/2021 03:53:30 - INFO - __main__ - Step 47162: {'lr': 0.0003936601227924346, 'samples': 9055104, 'steps': 47161, 'loss/train': 0.5940176248550415} 11/07/2021 03:53:30 - INFO - __main__ - Step 47163: {'lr': 0.00039365577968889805, 'samples': 9055296, 'steps': 47162, 'loss/train': 1.779089093208313} 11/07/2021 03:53:30 - INFO - __main__ - Step 47164: {'lr': 0.0003936514365206324, 'samples': 9055488, 'steps': 47163, 'loss/train': 1.7254983186721802} 11/07/2021 03:53:31 - INFO - __main__ - Step 47165: {'lr': 0.00039364709328763966, 'samples': 9055680, 'steps': 47164, 'loss/train': 1.2700613737106323} 11/07/2021 03:53:31 - INFO - __main__ - Step 47166: {'lr': 0.00039364274998992177, 'samples': 9055872, 'steps': 47165, 'loss/train': 1.04393470287323} 11/07/2021 03:53:32 - INFO - __main__ - Step 47167: {'lr': 0.00039363840662748063, 'samples': 9056064, 'steps': 47166, 'loss/train': 0.7460552453994751} 11/07/2021 03:53:32 - INFO - __main__ - Step 47168: {'lr': 0.0003936340632003183, 'samples': 9056256, 'steps': 47167, 'loss/train': 1.065807580947876} 11/07/2021 03:53:33 - INFO - __main__ - Step 47169: {'lr': 0.0003936297197084366, 'samples': 9056448, 'steps': 47168, 'loss/train': 1.5235415697097778} 11/07/2021 03:53:33 - INFO - __main__ - Step 47170: {'lr': 0.00039362537615183764, 'samples': 9056640, 'steps': 47169, 'loss/train': 1.1110341548919678} 11/07/2021 03:53:33 - INFO - __main__ - Step 47171: {'lr': 0.0003936210325305233, 'samples': 9056832, 'steps': 47170, 'loss/train': 1.6105690002441406} 11/07/2021 03:53:35 - INFO - __main__ - Step 47172: {'lr': 0.0003936166888444954, 'samples': 9057024, 'steps': 47171, 'loss/train': 1.2022067308425903} 11/07/2021 03:53:35 - INFO - __main__ - Step 47173: {'lr': 0.0003936123450937562, 'samples': 9057216, 'steps': 47172, 'loss/train': 1.0590406656265259} 11/07/2021 03:53:35 - INFO - __main__ - Step 47174: {'lr': 0.0003936080012783075, 'samples': 9057408, 'steps': 47173, 'loss/train': 1.7635650634765625} 11/07/2021 03:53:36 - INFO - __main__ - Step 47175: {'lr': 0.0003936036573981512, 'samples': 9057600, 'steps': 47174, 'loss/train': 1.543225884437561} 11/07/2021 03:53:36 - INFO - __main__ - Step 47176: {'lr': 0.00039359931345328927, 'samples': 9057792, 'steps': 47175, 'loss/train': 1.3579483032226562} 11/07/2021 03:53:37 - INFO - __main__ - Step 47177: {'lr': 0.0003935949694437237, 'samples': 9057984, 'steps': 47176, 'loss/train': 1.6006814241409302} 11/07/2021 03:53:37 - INFO - __main__ - Step 47178: {'lr': 0.00039359062536945645, 'samples': 9058176, 'steps': 47177, 'loss/train': 0.7601653933525085} 11/07/2021 03:53:38 - INFO - __main__ - Step 47179: {'lr': 0.00039358628123048955, 'samples': 9058368, 'steps': 47178, 'loss/train': 1.717115044593811} 11/07/2021 03:53:38 - INFO - __main__ - Step 47180: {'lr': 0.0003935819370268249, 'samples': 9058560, 'steps': 47179, 'loss/train': 1.3778393268585205} 11/07/2021 03:53:38 - INFO - __main__ - Step 47181: {'lr': 0.00039357759275846437, 'samples': 9058752, 'steps': 47180, 'loss/train': 1.2944941520690918} 11/07/2021 03:53:40 - INFO - __main__ - Step 47182: {'lr': 0.00039357324842541, 'samples': 9058944, 'steps': 47181, 'loss/train': 1.4966262578964233} 11/07/2021 03:53:40 - INFO - __main__ - Step 47183: {'lr': 0.0003935689040276638, 'samples': 9059136, 'steps': 47182, 'loss/train': 2.011629104614258} 11/07/2021 03:53:40 - INFO - __main__ - Step 47184: {'lr': 0.0003935645595652276, 'samples': 9059328, 'steps': 47183, 'loss/train': 1.5474724769592285} 11/07/2021 03:53:41 - INFO - __main__ - Step 47185: {'lr': 0.0003935602150381034, 'samples': 9059520, 'steps': 47184, 'loss/train': 1.3871104717254639} 11/07/2021 03:53:41 - INFO - __main__ - Step 47186: {'lr': 0.00039355587044629325, 'samples': 9059712, 'steps': 47185, 'loss/train': 2.540052652359009} 11/07/2021 03:53:42 - INFO - __main__ - Step 47187: {'lr': 0.00039355152578979903, 'samples': 9059904, 'steps': 47186, 'loss/train': 1.1637755632400513} 11/07/2021 03:53:42 - INFO - __main__ - Step 47188: {'lr': 0.0003935471810686228, 'samples': 9060096, 'steps': 47187, 'loss/train': 1.387712001800537} 11/07/2021 03:53:43 - INFO - __main__ - Step 47189: {'lr': 0.0003935428362827662, 'samples': 9060288, 'steps': 47188, 'loss/train': 1.6322131156921387} 11/07/2021 03:53:43 - INFO - __main__ - Step 47190: {'lr': 0.0003935384914322316, 'samples': 9060480, 'steps': 47189, 'loss/train': 0.8436173796653748} 11/07/2021 03:53:44 - INFO - __main__ - Step 47191: {'lr': 0.0003935341465170207, 'samples': 9060672, 'steps': 47190, 'loss/train': 1.3562065362930298} 11/07/2021 03:53:44 - INFO - __main__ - Step 47192: {'lr': 0.0003935298015371355, 'samples': 9060864, 'steps': 47191, 'loss/train': 1.4174747467041016} 11/07/2021 03:53:46 - INFO - __main__ - Step 47193: {'lr': 0.0003935254564925781, 'samples': 9061056, 'steps': 47192, 'loss/train': 1.27580988407135} 11/07/2021 03:53:46 - INFO - __main__ - Step 47194: {'lr': 0.0003935211113833502, 'samples': 9061248, 'steps': 47193, 'loss/train': 1.6711721420288086} 11/07/2021 03:53:47 - INFO - __main__ - Step 47195: {'lr': 0.00039351676620945396, 'samples': 9061440, 'steps': 47194, 'loss/train': 0.5922019481658936} 11/07/2021 03:53:47 - INFO - __main__ - Step 47196: {'lr': 0.00039351242097089133, 'samples': 9061632, 'steps': 47195, 'loss/train': 1.8144773244857788} 11/07/2021 03:53:47 - INFO - __main__ - Step 47197: {'lr': 0.0003935080756676641, 'samples': 9061824, 'steps': 47196, 'loss/train': 1.7800637483596802} 11/07/2021 03:53:48 - INFO - __main__ - Step 47198: {'lr': 0.0003935037302997745, 'samples': 9062016, 'steps': 47197, 'loss/train': 1.7908663749694824} 11/07/2021 03:53:48 - INFO - __main__ - Step 47199: {'lr': 0.00039349938486722425, 'samples': 9062208, 'steps': 47198, 'loss/train': 1.4536714553833008} 11/07/2021 03:53:48 - INFO - __main__ - Step 47200: {'lr': 0.0003934950393700154, 'samples': 9062400, 'steps': 47199, 'loss/train': 1.3902226686477661} 11/07/2021 03:53:49 - INFO - __main__ - Step 47201: {'lr': 0.0003934906938081499, 'samples': 9062592, 'steps': 47200, 'loss/train': 1.2590264081954956} 11/07/2021 03:53:50 - INFO - __main__ - Step 47202: {'lr': 0.0003934863481816297, 'samples': 9062784, 'steps': 47201, 'loss/train': 1.6010756492614746} 11/07/2021 03:53:50 - INFO - __main__ - Step 47203: {'lr': 0.00039348200249045675, 'samples': 9062976, 'steps': 47202, 'loss/train': 1.2708886861801147} 11/07/2021 03:53:51 - INFO - __main__ - Step 47204: {'lr': 0.000393477656734633, 'samples': 9063168, 'steps': 47203, 'loss/train': 2.0616767406463623} 11/07/2021 03:53:51 - INFO - __main__ - Step 47205: {'lr': 0.0003934733109141605, 'samples': 9063360, 'steps': 47204, 'loss/train': 1.5255537033081055} 11/07/2021 03:53:52 - INFO - __main__ - Step 47206: {'lr': 0.00039346896502904117, 'samples': 9063552, 'steps': 47205, 'loss/train': 1.4788717031478882} 11/07/2021 03:53:52 - INFO - __main__ - Step 47207: {'lr': 0.0003934646190792769, 'samples': 9063744, 'steps': 47206, 'loss/train': 1.8538322448730469} 11/07/2021 03:53:53 - INFO - __main__ - Step 47208: {'lr': 0.00039346027306486964, 'samples': 9063936, 'steps': 47207, 'loss/train': 1.713992714881897} 11/07/2021 03:53:53 - INFO - __main__ - Step 47209: {'lr': 0.00039345592698582146, 'samples': 9064128, 'steps': 47208, 'loss/train': 1.1504381895065308} 11/07/2021 03:53:53 - INFO - __main__ - Step 47210: {'lr': 0.00039345158084213417, 'samples': 9064320, 'steps': 47209, 'loss/train': 1.6446284055709839} 11/07/2021 03:53:54 - INFO - __main__ - Step 47211: {'lr': 0.0003934472346338099, 'samples': 9064512, 'steps': 47210, 'loss/train': 1.9809157848358154} 11/07/2021 03:53:55 - INFO - __main__ - Step 47212: {'lr': 0.00039344288836085046, 'samples': 9064704, 'steps': 47211, 'loss/train': 1.4329661130905151} 11/07/2021 03:53:55 - INFO - __main__ - Step 47213: {'lr': 0.0003934385420232579, 'samples': 9064896, 'steps': 47212, 'loss/train': 1.3324871063232422} 11/07/2021 03:53:55 - INFO - __main__ - Step 47214: {'lr': 0.0003934341956210341, 'samples': 9065088, 'steps': 47213, 'loss/train': 1.3765357732772827} 11/07/2021 03:53:56 - INFO - __main__ - Step 47215: {'lr': 0.0003934298491541811, 'samples': 9065280, 'steps': 47214, 'loss/train': 1.4130184650421143} 11/07/2021 03:53:57 - INFO - __main__ - Step 47216: {'lr': 0.0003934255026227008, 'samples': 9065472, 'steps': 47215, 'loss/train': 1.6666706800460815} 11/07/2021 03:53:57 - INFO - __main__ - Step 47217: {'lr': 0.0003934211560265952, 'samples': 9065664, 'steps': 47216, 'loss/train': 1.3997408151626587} 11/07/2021 03:53:58 - INFO - __main__ - Step 47218: {'lr': 0.0003934168093658663, 'samples': 9065856, 'steps': 47217, 'loss/train': 2.0830512046813965} 11/07/2021 03:53:58 - INFO - __main__ - Step 47219: {'lr': 0.0003934124626405159, 'samples': 9066048, 'steps': 47218, 'loss/train': 1.5073606967926025} 11/07/2021 03:53:58 - INFO - __main__ - Step 47220: {'lr': 0.00039340811585054615, 'samples': 9066240, 'steps': 47219, 'loss/train': 1.524347186088562} 11/07/2021 03:53:59 - INFO - __main__ - Step 47221: {'lr': 0.0003934037689959589, 'samples': 9066432, 'steps': 47220, 'loss/train': 1.5292742252349854} 11/07/2021 03:54:00 - INFO - __main__ - Step 47222: {'lr': 0.00039339942207675604, 'samples': 9066624, 'steps': 47221, 'loss/train': 1.8891303539276123} 11/07/2021 03:54:00 - INFO - __main__ - Step 47223: {'lr': 0.0003933950750929397, 'samples': 9066816, 'steps': 47222, 'loss/train': 0.27736398577690125} 11/07/2021 03:54:00 - INFO - __main__ - Step 47224: {'lr': 0.0003933907280445117, 'samples': 9067008, 'steps': 47223, 'loss/train': 1.1352673768997192} 11/07/2021 03:54:01 - INFO - __main__ - Step 47225: {'lr': 0.00039338638093147404, 'samples': 9067200, 'steps': 47224, 'loss/train': 0.9194772839546204} 11/07/2021 03:54:02 - INFO - __main__ - Step 47226: {'lr': 0.00039338203375382873, 'samples': 9067392, 'steps': 47225, 'loss/train': 1.056955099105835} 11/07/2021 03:54:02 - INFO - __main__ - Step 47227: {'lr': 0.00039337768651157766, 'samples': 9067584, 'steps': 47226, 'loss/train': 1.5889086723327637} 11/07/2021 03:54:02 - INFO - __main__ - Step 47228: {'lr': 0.0003933733392047228, 'samples': 9067776, 'steps': 47227, 'loss/train': 1.2220077514648438} 11/07/2021 03:54:03 - INFO - __main__ - Step 47229: {'lr': 0.0003933689918332662, 'samples': 9067968, 'steps': 47228, 'loss/train': 1.3765571117401123} 11/07/2021 03:54:03 - INFO - __main__ - Step 47230: {'lr': 0.0003933646443972097, 'samples': 9068160, 'steps': 47229, 'loss/train': 0.5726089477539062} 11/07/2021 03:54:03 - INFO - __main__ - Step 47231: {'lr': 0.0003933602968965553, 'samples': 9068352, 'steps': 47230, 'loss/train': 1.0503922700881958} 11/07/2021 03:54:05 - INFO - __main__ - Step 47232: {'lr': 0.00039335594933130494, 'samples': 9068544, 'steps': 47231, 'loss/train': 1.624923586845398} 11/07/2021 03:54:06 - INFO - __main__ - Step 47233: {'lr': 0.0003933516017014607, 'samples': 9068736, 'steps': 47232, 'loss/train': 1.2353731393814087} 11/07/2021 03:54:06 - INFO - __main__ - Step 47234: {'lr': 0.0003933472540070243, 'samples': 9068928, 'steps': 47233, 'loss/train': 2.365288019180298} 11/07/2021 03:54:06 - INFO - __main__ - Step 47235: {'lr': 0.00039334290624799795, 'samples': 9069120, 'steps': 47234, 'loss/train': 3.6028764247894287} 11/07/2021 03:54:07 - INFO - __main__ - Step 47236: {'lr': 0.0003933385584243834, 'samples': 9069312, 'steps': 47235, 'loss/train': 2.2998764514923096} 11/07/2021 03:54:07 - INFO - __main__ - Step 47237: {'lr': 0.0003933342105361828, 'samples': 9069504, 'steps': 47236, 'loss/train': 1.322514295578003} 11/07/2021 03:54:08 - INFO - __main__ - Step 47238: {'lr': 0.000393329862583398, 'samples': 9069696, 'steps': 47237, 'loss/train': 1.5863367319107056} 11/07/2021 03:54:09 - INFO - __main__ - Step 47239: {'lr': 0.00039332551456603093, 'samples': 9069888, 'steps': 47238, 'loss/train': 1.2961407899856567} 11/07/2021 03:54:09 - INFO - __main__ - Step 47240: {'lr': 0.00039332116648408365, 'samples': 9070080, 'steps': 47239, 'loss/train': 2.511929988861084} 11/07/2021 03:54:10 - INFO - __main__ - Step 47241: {'lr': 0.00039331681833755804, 'samples': 9070272, 'steps': 47240, 'loss/train': 1.450066089630127} 11/07/2021 03:54:10 - INFO - __main__ - Step 47242: {'lr': 0.00039331247012645604, 'samples': 9070464, 'steps': 47241, 'loss/train': 0.961763858795166} 11/07/2021 03:54:10 - INFO - __main__ - Step 47243: {'lr': 0.00039330812185077967, 'samples': 9070656, 'steps': 47242, 'loss/train': 1.3809430599212646} 11/07/2021 03:54:11 - INFO - __main__ - Step 47244: {'lr': 0.0003933037735105309, 'samples': 9070848, 'steps': 47243, 'loss/train': 1.3170738220214844} 11/07/2021 03:54:12 - INFO - __main__ - Step 47245: {'lr': 0.00039329942510571165, 'samples': 9071040, 'steps': 47244, 'loss/train': 1.5395973920822144} 11/07/2021 03:54:12 - INFO - __main__ - Step 47246: {'lr': 0.0003932950766363239, 'samples': 9071232, 'steps': 47245, 'loss/train': 1.5195993185043335} 11/07/2021 03:54:12 - INFO - __main__ - Step 47247: {'lr': 0.00039329072810236965, 'samples': 9071424, 'steps': 47246, 'loss/train': 0.65678471326828} 11/07/2021 03:54:13 - INFO - __main__ - Step 47248: {'lr': 0.0003932863795038507, 'samples': 9071616, 'steps': 47247, 'loss/train': 0.8310577273368835} 11/07/2021 03:54:14 - INFO - __main__ - Step 47249: {'lr': 0.0003932820308407692, 'samples': 9071808, 'steps': 47248, 'loss/train': 1.3056358098983765} 11/07/2021 03:54:14 - INFO - __main__ - Step 47250: {'lr': 0.000393277682113127, 'samples': 9072000, 'steps': 47249, 'loss/train': 1.0573945045471191} 11/07/2021 03:54:14 - INFO - __main__ - Step 47251: {'lr': 0.00039327333332092606, 'samples': 9072192, 'steps': 47250, 'loss/train': 1.5271329879760742} 11/07/2021 03:54:15 - INFO - __main__ - Step 47252: {'lr': 0.0003932689844641684, 'samples': 9072384, 'steps': 47251, 'loss/train': 1.4134752750396729} 11/07/2021 03:54:15 - INFO - __main__ - Step 47253: {'lr': 0.00039326463554285597, 'samples': 9072576, 'steps': 47252, 'loss/train': 1.1137325763702393} 11/07/2021 03:54:15 - INFO - __main__ - Step 47254: {'lr': 0.00039326028655699063, 'samples': 9072768, 'steps': 47253, 'loss/train': 1.1397970914840698} 11/07/2021 03:54:17 - INFO - __main__ - Step 47255: {'lr': 0.0003932559375065745, 'samples': 9072960, 'steps': 47254, 'loss/train': 1.6555663347244263} 11/07/2021 03:54:17 - INFO - __main__ - Step 47256: {'lr': 0.00039325158839160937, 'samples': 9073152, 'steps': 47255, 'loss/train': 1.5813300609588623} 11/07/2021 03:54:17 - INFO - __main__ - Step 47257: {'lr': 0.0003932472392120974, 'samples': 9073344, 'steps': 47256, 'loss/train': 1.7237550020217896} 11/07/2021 03:54:18 - INFO - __main__ - Step 47258: {'lr': 0.00039324288996804026, 'samples': 9073536, 'steps': 47257, 'loss/train': 1.1574243307113647} 11/07/2021 03:54:18 - INFO - __main__ - Step 47259: {'lr': 0.0003932385406594402, 'samples': 9073728, 'steps': 47258, 'loss/train': 1.146239161491394} 11/07/2021 03:54:19 - INFO - __main__ - Step 47260: {'lr': 0.0003932341912862991, 'samples': 9073920, 'steps': 47259, 'loss/train': 0.8379764556884766} 11/07/2021 03:54:19 - INFO - __main__ - Step 47261: {'lr': 0.0003932298418486188, 'samples': 9074112, 'steps': 47260, 'loss/train': 1.6379982233047485} 11/07/2021 03:54:20 - INFO - __main__ - Step 47262: {'lr': 0.00039322549234640136, 'samples': 9074304, 'steps': 47261, 'loss/train': 1.5093833208084106} 11/07/2021 03:54:20 - INFO - __main__ - Step 47263: {'lr': 0.00039322114277964875, 'samples': 9074496, 'steps': 47262, 'loss/train': 1.1236997842788696} 11/07/2021 03:54:20 - INFO - __main__ - Step 47264: {'lr': 0.0003932167931483629, 'samples': 9074688, 'steps': 47263, 'loss/train': 1.0301036834716797} 11/07/2021 03:54:21 - INFO - __main__ - Step 47265: {'lr': 0.00039321244345254583, 'samples': 9074880, 'steps': 47264, 'loss/train': 1.3616799116134644} 11/07/2021 03:54:22 - INFO - __main__ - Step 47266: {'lr': 0.0003932080936921993, 'samples': 9075072, 'steps': 47265, 'loss/train': 1.1745116710662842} 11/07/2021 03:54:22 - INFO - __main__ - Step 47267: {'lr': 0.00039320374386732555, 'samples': 9075264, 'steps': 47266, 'loss/train': 1.5966567993164062} 11/07/2021 03:54:22 - INFO - __main__ - Step 47268: {'lr': 0.00039319939397792635, 'samples': 9075456, 'steps': 47267, 'loss/train': 1.515926480293274} 11/07/2021 03:54:23 - INFO - __main__ - Step 47269: {'lr': 0.00039319504402400367, 'samples': 9075648, 'steps': 47268, 'loss/train': 0.9507867097854614} 11/07/2021 03:54:24 - INFO - __main__ - Step 47270: {'lr': 0.0003931906940055596, 'samples': 9075840, 'steps': 47269, 'loss/train': 1.687520146369934} 11/07/2021 03:54:24 - INFO - __main__ - Step 47271: {'lr': 0.00039318634392259593, 'samples': 9076032, 'steps': 47270, 'loss/train': 1.403967022895813} 11/07/2021 03:54:24 - INFO - __main__ - Step 47272: {'lr': 0.00039318199377511476, 'samples': 9076224, 'steps': 47271, 'loss/train': 1.505491852760315} 11/07/2021 03:54:25 - INFO - __main__ - Step 47273: {'lr': 0.00039317764356311803, 'samples': 9076416, 'steps': 47272, 'loss/train': 1.9129726886749268} 11/07/2021 03:54:25 - INFO - __main__ - Step 47274: {'lr': 0.00039317329328660754, 'samples': 9076608, 'steps': 47273, 'loss/train': 1.8173123598098755} 11/07/2021 03:54:26 - INFO - __main__ - Step 47275: {'lr': 0.0003931689429455855, 'samples': 9076800, 'steps': 47274, 'loss/train': 1.7296642065048218} 11/07/2021 03:54:27 - INFO - __main__ - Step 47276: {'lr': 0.00039316459254005364, 'samples': 9076992, 'steps': 47275, 'loss/train': 1.330780029296875} 11/07/2021 03:54:27 - INFO - __main__ - Step 47277: {'lr': 0.00039316024207001403, 'samples': 9077184, 'steps': 47276, 'loss/train': 1.7858941555023193} 11/07/2021 03:54:27 - INFO - __main__ - Step 47278: {'lr': 0.0003931558915354687, 'samples': 9077376, 'steps': 47277, 'loss/train': 1.0815973281860352} 11/07/2021 03:54:28 - INFO - __main__ - Step 47279: {'lr': 0.00039315154093641947, 'samples': 9077568, 'steps': 47278, 'loss/train': 1.7182049751281738} 11/07/2021 03:54:28 - INFO - __main__ - Step 47280: {'lr': 0.00039314719027286837, 'samples': 9077760, 'steps': 47279, 'loss/train': 1.3329108953475952} 11/07/2021 03:54:29 - INFO - __main__ - Step 47281: {'lr': 0.00039314283954481737, 'samples': 9077952, 'steps': 47280, 'loss/train': 1.2826067209243774} 11/07/2021 03:54:29 - INFO - __main__ - Step 47282: {'lr': 0.00039313848875226844, 'samples': 9078144, 'steps': 47281, 'loss/train': 1.6733689308166504} 11/07/2021 03:54:30 - INFO - __main__ - Step 47283: {'lr': 0.0003931341378952235, 'samples': 9078336, 'steps': 47282, 'loss/train': 1.5830260515213013} 11/07/2021 03:54:30 - INFO - __main__ - Step 47284: {'lr': 0.0003931297869736845, 'samples': 9078528, 'steps': 47283, 'loss/train': 1.3615508079528809} 11/07/2021 03:54:30 - INFO - __main__ - Step 47285: {'lr': 0.0003931254359876535, 'samples': 9078720, 'steps': 47284, 'loss/train': 1.4981908798217773} 11/07/2021 03:54:31 - INFO - __main__ - Step 47286: {'lr': 0.00039312108493713227, 'samples': 9078912, 'steps': 47285, 'loss/train': 1.6365206241607666} 11/07/2021 03:54:32 - INFO - __main__ - Step 47287: {'lr': 0.00039311673382212296, 'samples': 9079104, 'steps': 47286, 'loss/train': 1.4383234977722168} 11/07/2021 03:54:32 - INFO - __main__ - Step 47288: {'lr': 0.0003931123826426275, 'samples': 9079296, 'steps': 47287, 'loss/train': 1.3046178817749023} 11/07/2021 03:54:33 - INFO - __main__ - Step 47289: {'lr': 0.00039310803139864777, 'samples': 9079488, 'steps': 47288, 'loss/train': 1.2773518562316895} 11/07/2021 03:54:33 - INFO - __main__ - Step 47290: {'lr': 0.0003931036800901857, 'samples': 9079680, 'steps': 47289, 'loss/train': 1.4533926248550415} 11/07/2021 03:54:34 - INFO - __main__ - Step 47291: {'lr': 0.0003930993287172434, 'samples': 9079872, 'steps': 47290, 'loss/train': 1.3367587327957153} 11/07/2021 03:54:34 - INFO - __main__ - Step 47292: {'lr': 0.0003930949772798227, 'samples': 9080064, 'steps': 47291, 'loss/train': 1.9066798686981201} 11/07/2021 03:54:35 - INFO - __main__ - Step 47293: {'lr': 0.00039309062577792565, 'samples': 9080256, 'steps': 47292, 'loss/train': 1.4607013463974} 11/07/2021 03:54:35 - INFO - __main__ - Step 47294: {'lr': 0.0003930862742115542, 'samples': 9080448, 'steps': 47293, 'loss/train': 1.6229429244995117} 11/07/2021 03:54:35 - INFO - __main__ - Step 47295: {'lr': 0.0003930819225807102, 'samples': 9080640, 'steps': 47294, 'loss/train': 1.2199546098709106} 11/07/2021 03:54:37 - INFO - __main__ - Step 47296: {'lr': 0.00039307757088539574, 'samples': 9080832, 'steps': 47295, 'loss/train': 1.463995099067688} 11/07/2021 03:54:37 - INFO - __main__ - Step 47297: {'lr': 0.0003930732191256128, 'samples': 9081024, 'steps': 47296, 'loss/train': 1.3913321495056152} 11/07/2021 03:54:37 - INFO - __main__ - Step 47298: {'lr': 0.00039306886730136316, 'samples': 9081216, 'steps': 47297, 'loss/train': 1.3702958822250366} 11/07/2021 03:54:38 - INFO - __main__ - Step 47299: {'lr': 0.00039306451541264896, 'samples': 9081408, 'steps': 47298, 'loss/train': 1.3876159191131592} 11/07/2021 03:54:38 - INFO - __main__ - Step 47300: {'lr': 0.0003930601634594721, 'samples': 9081600, 'steps': 47299, 'loss/train': 1.3369739055633545} 11/07/2021 03:54:38 - INFO - __main__ - Step 47301: {'lr': 0.0003930558114418345, 'samples': 9081792, 'steps': 47300, 'loss/train': 1.3893908262252808} 11/07/2021 03:54:39 - INFO - __main__ - Step 47302: {'lr': 0.0003930514593597382, 'samples': 9081984, 'steps': 47301, 'loss/train': 1.543992519378662} 11/07/2021 03:54:40 - INFO - __main__ - Step 47303: {'lr': 0.00039304710721318505, 'samples': 9082176, 'steps': 47302, 'loss/train': 1.5732660293579102} 11/07/2021 03:54:40 - INFO - __main__ - Step 47304: {'lr': 0.0003930427550021771, 'samples': 9082368, 'steps': 47303, 'loss/train': 1.4729195833206177} 11/07/2021 03:54:40 - INFO - __main__ - Step 47305: {'lr': 0.00039303840272671636, 'samples': 9082560, 'steps': 47304, 'loss/train': 2.4504709243774414} 11/07/2021 03:54:41 - INFO - __main__ - Step 47306: {'lr': 0.00039303405038680465, 'samples': 9082752, 'steps': 47305, 'loss/train': 1.43690824508667} 11/07/2021 03:54:42 - INFO - __main__ - Step 47307: {'lr': 0.00039302969798244407, 'samples': 9082944, 'steps': 47306, 'loss/train': 1.5622464418411255} 11/07/2021 03:54:42 - INFO - __main__ - Step 47308: {'lr': 0.0003930253455136365, 'samples': 9083136, 'steps': 47307, 'loss/train': 1.2252328395843506} 11/07/2021 03:54:42 - INFO - __main__ - Step 47309: {'lr': 0.0003930209929803839, 'samples': 9083328, 'steps': 47308, 'loss/train': 1.747268557548523} 11/07/2021 03:54:43 - INFO - __main__ - Step 47310: {'lr': 0.0003930166403826883, 'samples': 9083520, 'steps': 47309, 'loss/train': 1.8754584789276123} 11/07/2021 03:54:43 - INFO - __main__ - Step 47311: {'lr': 0.00039301228772055147, 'samples': 9083712, 'steps': 47310, 'loss/train': 1.5060009956359863} 11/07/2021 03:54:44 - INFO - __main__ - Step 47312: {'lr': 0.0003930079349939756, 'samples': 9083904, 'steps': 47311, 'loss/train': 1.5311397314071655} 11/07/2021 03:54:44 - INFO - __main__ - Step 47313: {'lr': 0.00039300358220296255, 'samples': 9084096, 'steps': 47312, 'loss/train': 1.609533667564392} 11/07/2021 03:54:45 - INFO - __main__ - Step 47314: {'lr': 0.0003929992293475143, 'samples': 9084288, 'steps': 47313, 'loss/train': 1.5221704244613647} 11/07/2021 03:54:45 - INFO - __main__ - Step 47315: {'lr': 0.00039299487642763286, 'samples': 9084480, 'steps': 47314, 'loss/train': 0.9802134037017822} 11/07/2021 03:54:46 - INFO - __main__ - Step 47316: {'lr': 0.00039299052344332, 'samples': 9084672, 'steps': 47315, 'loss/train': 1.5102630853652954} 11/07/2021 03:54:47 - INFO - __main__ - Step 47317: {'lr': 0.00039298617039457796, 'samples': 9084864, 'steps': 47316, 'loss/train': 1.2505311965942383} 11/07/2021 03:54:47 - INFO - __main__ - Step 47318: {'lr': 0.0003929818172814085, 'samples': 9085056, 'steps': 47317, 'loss/train': 1.1996872425079346} 11/07/2021 03:54:47 - INFO - __main__ - Step 47319: {'lr': 0.00039297746410381357, 'samples': 9085248, 'steps': 47318, 'loss/train': 1.3600307703018188} 11/07/2021 03:54:48 - INFO - __main__ - Step 47320: {'lr': 0.00039297311086179535, 'samples': 9085440, 'steps': 47319, 'loss/train': 1.2544894218444824} 11/07/2021 03:54:48 - INFO - __main__ - Step 47321: {'lr': 0.00039296875755535557, 'samples': 9085632, 'steps': 47320, 'loss/train': 1.227154016494751} 11/07/2021 03:54:48 - INFO - __main__ - Step 47322: {'lr': 0.0003929644041844962, 'samples': 9085824, 'steps': 47321, 'loss/train': 1.1505377292633057} 11/07/2021 03:54:50 - INFO - __main__ - Step 47323: {'lr': 0.00039296005074921937, 'samples': 9086016, 'steps': 47322, 'loss/train': 1.7143731117248535} 11/07/2021 03:54:50 - INFO - __main__ - Step 47324: {'lr': 0.0003929556972495269, 'samples': 9086208, 'steps': 47323, 'loss/train': 1.8348923921585083} 11/07/2021 03:54:50 - INFO - __main__ - Step 47325: {'lr': 0.00039295134368542083, 'samples': 9086400, 'steps': 47324, 'loss/train': 1.1800689697265625} 11/07/2021 03:54:51 - INFO - __main__ - Step 47326: {'lr': 0.000392946990056903, 'samples': 9086592, 'steps': 47325, 'loss/train': 1.5850028991699219} 11/07/2021 03:54:51 - INFO - __main__ - Step 47327: {'lr': 0.00039294263636397564, 'samples': 9086784, 'steps': 47326, 'loss/train': 1.642444133758545} 11/07/2021 03:54:52 - INFO - __main__ - Step 47328: {'lr': 0.00039293828260664047, 'samples': 9086976, 'steps': 47327, 'loss/train': 2.3432319164276123} 11/07/2021 03:54:52 - INFO - __main__ - Step 47329: {'lr': 0.0003929339287848994, 'samples': 9087168, 'steps': 47328, 'loss/train': 1.3766820430755615} 11/07/2021 03:54:53 - INFO - __main__ - Step 47330: {'lr': 0.00039292957489875456, 'samples': 9087360, 'steps': 47329, 'loss/train': 1.3282623291015625} 11/07/2021 03:54:53 - INFO - __main__ - Step 47331: {'lr': 0.00039292522094820794, 'samples': 9087552, 'steps': 47330, 'loss/train': 2.0509135723114014} 11/07/2021 03:54:53 - INFO - __main__ - Step 47332: {'lr': 0.00039292086693326134, 'samples': 9087744, 'steps': 47331, 'loss/train': 1.2490166425704956} 11/07/2021 03:54:54 - INFO - __main__ - Step 47333: {'lr': 0.0003929165128539168, 'samples': 9087936, 'steps': 47332, 'loss/train': 1.6519125699996948} 11/07/2021 03:54:55 - INFO - __main__ - Step 47334: {'lr': 0.0003929121587101764, 'samples': 9088128, 'steps': 47333, 'loss/train': 1.5916945934295654} 11/07/2021 03:54:55 - INFO - __main__ - Step 47335: {'lr': 0.00039290780450204187, 'samples': 9088320, 'steps': 47334, 'loss/train': 1.6293517351150513} 11/07/2021 03:54:55 - INFO - __main__ - Step 47336: {'lr': 0.00039290345022951535, 'samples': 9088512, 'steps': 47335, 'loss/train': 2.4737918376922607} 11/07/2021 03:54:56 - INFO - __main__ - Step 47337: {'lr': 0.0003928990958925987, 'samples': 9088704, 'steps': 47336, 'loss/train': 1.8430202007293701} 11/07/2021 03:54:57 - INFO - __main__ - Step 47338: {'lr': 0.0003928947414912939, 'samples': 9088896, 'steps': 47337, 'loss/train': 1.7198305130004883} 11/07/2021 03:54:57 - INFO - __main__ - Step 47339: {'lr': 0.00039289038702560304, 'samples': 9089088, 'steps': 47338, 'loss/train': 1.2610658407211304} 11/07/2021 03:54:58 - INFO - __main__ - Step 47340: {'lr': 0.0003928860324955279, 'samples': 9089280, 'steps': 47339, 'loss/train': 1.1008061170578003} 11/07/2021 03:54:58 - INFO - __main__ - Step 47341: {'lr': 0.00039288167790107055, 'samples': 9089472, 'steps': 47340, 'loss/train': 1.7525180578231812} 11/07/2021 03:54:58 - INFO - __main__ - Step 47342: {'lr': 0.00039287732324223287, 'samples': 9089664, 'steps': 47341, 'loss/train': 1.2478035688400269} 11/07/2021 03:54:59 - INFO - __main__ - Step 47343: {'lr': 0.0003928729685190169, 'samples': 9089856, 'steps': 47342, 'loss/train': 1.0368107557296753} 11/07/2021 03:55:00 - INFO - __main__ - Step 47344: {'lr': 0.00039286861373142456, 'samples': 9090048, 'steps': 47343, 'loss/train': 1.490264892578125} 11/07/2021 03:55:00 - INFO - __main__ - Step 47345: {'lr': 0.0003928642588794579, 'samples': 9090240, 'steps': 47344, 'loss/train': 1.3091282844543457} 11/07/2021 03:55:00 - INFO - __main__ - Step 47346: {'lr': 0.0003928599039631187, 'samples': 9090432, 'steps': 47345, 'loss/train': 1.1811167001724243} 11/07/2021 03:55:01 - INFO - __main__ - Step 47347: {'lr': 0.00039285554898240907, 'samples': 9090624, 'steps': 47346, 'loss/train': 1.5830038785934448} 11/07/2021 03:55:01 - INFO - __main__ - Step 47348: {'lr': 0.0003928511939373309, 'samples': 9090816, 'steps': 47347, 'loss/train': 1.7371083498001099} 11/07/2021 03:55:02 - INFO - __main__ - Step 47349: {'lr': 0.0003928468388278863, 'samples': 9091008, 'steps': 47348, 'loss/train': 1.4600836038589478} 11/07/2021 03:55:03 - INFO - __main__ - Step 47350: {'lr': 0.00039284248365407704, 'samples': 9091200, 'steps': 47349, 'loss/train': 1.5137965679168701} 11/07/2021 03:55:03 - INFO - __main__ - Step 47351: {'lr': 0.00039283812841590514, 'samples': 9091392, 'steps': 47350, 'loss/train': 1.3401132822036743} 11/07/2021 03:55:03 - INFO - __main__ - Step 47352: {'lr': 0.0003928337731133727, 'samples': 9091584, 'steps': 47351, 'loss/train': 0.8940303921699524} 11/07/2021 03:55:04 - INFO - __main__ - Step 47353: {'lr': 0.0003928294177464814, 'samples': 9091776, 'steps': 47352, 'loss/train': 1.344061017036438} 11/07/2021 03:55:05 - INFO - __main__ - Step 47354: {'lr': 0.0003928250623152335, 'samples': 9091968, 'steps': 47353, 'loss/train': 1.3783444166183472} 11/07/2021 03:55:05 - INFO - __main__ - Step 47355: {'lr': 0.00039282070681963076, 'samples': 9092160, 'steps': 47354, 'loss/train': 0.9003660678863525} 11/07/2021 03:55:05 - INFO - __main__ - Step 47356: {'lr': 0.00039281635125967525, 'samples': 9092352, 'steps': 47355, 'loss/train': 1.6121208667755127} 11/07/2021 03:55:06 - INFO - __main__ - Step 47357: {'lr': 0.00039281199563536887, 'samples': 9092544, 'steps': 47356, 'loss/train': 1.4227135181427002} 11/07/2021 03:55:06 - INFO - __main__ - Step 47358: {'lr': 0.00039280763994671363, 'samples': 9092736, 'steps': 47357, 'loss/train': 1.783883810043335} 11/07/2021 03:55:07 - INFO - __main__ - Step 47359: {'lr': 0.0003928032841937115, 'samples': 9092928, 'steps': 47358, 'loss/train': 0.9126694202423096} 11/07/2021 03:55:07 - INFO - __main__ - Step 47360: {'lr': 0.0003927989283763643, 'samples': 9093120, 'steps': 47359, 'loss/train': 0.3783952295780182} 11/07/2021 03:55:08 - INFO - __main__ - Step 47361: {'lr': 0.0003927945724946742, 'samples': 9093312, 'steps': 47360, 'loss/train': 1.4427589178085327} 11/07/2021 03:55:08 - INFO - __main__ - Step 47362: {'lr': 0.00039279021654864307, 'samples': 9093504, 'steps': 47361, 'loss/train': 1.2977060079574585} 11/07/2021 03:55:08 - INFO - __main__ - Step 47363: {'lr': 0.0003927858605382728, 'samples': 9093696, 'steps': 47362, 'loss/train': 1.5949335098266602} 11/07/2021 03:55:09 - INFO - __main__ - Step 47364: {'lr': 0.0003927815044635655, 'samples': 9093888, 'steps': 47363, 'loss/train': 1.5859639644622803} 11/07/2021 03:55:10 - INFO - __main__ - Step 47365: {'lr': 0.00039277714832452304, 'samples': 9094080, 'steps': 47364, 'loss/train': 0.8822575807571411} 11/07/2021 03:55:10 - INFO - __main__ - Step 47366: {'lr': 0.0003927727921211474, 'samples': 9094272, 'steps': 47365, 'loss/train': 1.8432468175888062} 11/07/2021 03:55:11 - INFO - __main__ - Step 47367: {'lr': 0.00039276843585344046, 'samples': 9094464, 'steps': 47366, 'loss/train': 0.81095951795578} 11/07/2021 03:55:11 - INFO - __main__ - Step 47368: {'lr': 0.0003927640795214044, 'samples': 9094656, 'steps': 47367, 'loss/train': 1.6812350749969482} 11/07/2021 03:55:11 - INFO - __main__ - Step 47369: {'lr': 0.00039275972312504103, 'samples': 9094848, 'steps': 47368, 'loss/train': 1.4239321947097778} 11/07/2021 03:55:13 - INFO - __main__ - Step 47370: {'lr': 0.0003927553666643523, 'samples': 9095040, 'steps': 47369, 'loss/train': 1.0802626609802246} 11/07/2021 03:55:13 - INFO - __main__ - Step 47371: {'lr': 0.0003927510101393401, 'samples': 9095232, 'steps': 47370, 'loss/train': 1.0999367237091064} 11/07/2021 03:55:13 - INFO - __main__ - Step 47372: {'lr': 0.0003927466535500066, 'samples': 9095424, 'steps': 47371, 'loss/train': 1.2307920455932617} 11/07/2021 03:55:14 - INFO - __main__ - Step 47373: {'lr': 0.00039274229689635365, 'samples': 9095616, 'steps': 47372, 'loss/train': 1.0841302871704102} 11/07/2021 03:55:14 - INFO - __main__ - Step 47374: {'lr': 0.00039273794017838327, 'samples': 9095808, 'steps': 47373, 'loss/train': 1.7814555168151855} 11/07/2021 03:55:15 - INFO - __main__ - Step 47375: {'lr': 0.0003927335833960973, 'samples': 9096000, 'steps': 47374, 'loss/train': 1.539723515510559} 11/07/2021 03:55:15 - INFO - __main__ - Step 47376: {'lr': 0.00039272922654949783, 'samples': 9096192, 'steps': 47375, 'loss/train': 1.4435195922851562} 11/07/2021 03:55:16 - INFO - __main__ - Step 47377: {'lr': 0.0003927248696385868, 'samples': 9096384, 'steps': 47376, 'loss/train': 1.781643271446228} 11/07/2021 03:55:16 - INFO - __main__ - Step 47378: {'lr': 0.00039272051266336607, 'samples': 9096576, 'steps': 47377, 'loss/train': 1.3502711057662964} 11/07/2021 03:55:16 - INFO - __main__ - Step 47379: {'lr': 0.00039271615562383775, 'samples': 9096768, 'steps': 47378, 'loss/train': 0.17254693806171417} 11/07/2021 03:55:18 - INFO - __main__ - Step 47380: {'lr': 0.00039271179852000366, 'samples': 9096960, 'steps': 47379, 'loss/train': 1.5878491401672363} 11/07/2021 03:55:18 - INFO - __main__ - Step 47381: {'lr': 0.0003927074413518659, 'samples': 9097152, 'steps': 47380, 'loss/train': 1.3975498676300049} 11/07/2021 03:55:18 - INFO - __main__ - Step 47382: {'lr': 0.0003927030841194263, 'samples': 9097344, 'steps': 47381, 'loss/train': 1.100394606590271} 11/07/2021 03:55:19 - INFO - __main__ - Step 47383: {'lr': 0.00039269872682268697, 'samples': 9097536, 'steps': 47382, 'loss/train': 1.9096540212631226} 11/07/2021 03:55:19 - INFO - __main__ - Step 47384: {'lr': 0.00039269436946164977, 'samples': 9097728, 'steps': 47383, 'loss/train': 1.53835928440094} 11/07/2021 03:55:20 - INFO - __main__ - Step 47385: {'lr': 0.00039269001203631667, 'samples': 9097920, 'steps': 47384, 'loss/train': 1.1829365491867065} 11/07/2021 03:55:20 - INFO - __main__ - Step 47386: {'lr': 0.0003926856545466896, 'samples': 9098112, 'steps': 47385, 'loss/train': 1.4961042404174805} 11/07/2021 03:55:21 - INFO - __main__ - Step 47387: {'lr': 0.0003926812969927707, 'samples': 9098304, 'steps': 47386, 'loss/train': 1.2566967010498047} 11/07/2021 03:55:21 - INFO - __main__ - Step 47388: {'lr': 0.0003926769393745617, 'samples': 9098496, 'steps': 47387, 'loss/train': 1.3491863012313843} 11/07/2021 03:55:21 - INFO - __main__ - Step 47389: {'lr': 0.0003926725816920648, 'samples': 9098688, 'steps': 47388, 'loss/train': 1.4302196502685547} 11/07/2021 03:55:22 - INFO - __main__ - Step 47390: {'lr': 0.0003926682239452817, 'samples': 9098880, 'steps': 47389, 'loss/train': 1.1231874227523804} 11/07/2021 03:55:23 - INFO - __main__ - Step 47391: {'lr': 0.00039266386613421455, 'samples': 9099072, 'steps': 47390, 'loss/train': 1.0468133687973022} 11/07/2021 03:55:23 - INFO - __main__ - Step 47392: {'lr': 0.00039265950825886523, 'samples': 9099264, 'steps': 47391, 'loss/train': 1.6112629175186157} 11/07/2021 03:55:23 - INFO - __main__ - Step 47393: {'lr': 0.00039265515031923585, 'samples': 9099456, 'steps': 47392, 'loss/train': 1.6024153232574463} 11/07/2021 03:55:24 - INFO - __main__ - Step 47394: {'lr': 0.0003926507923153282, 'samples': 9099648, 'steps': 47393, 'loss/train': 1.5982778072357178} 11/07/2021 03:55:24 - INFO - __main__ - Step 47395: {'lr': 0.0003926464342471443, 'samples': 9099840, 'steps': 47394, 'loss/train': 1.7416530847549438} 11/07/2021 03:55:25 - INFO - __main__ - Step 47396: {'lr': 0.00039264207611468607, 'samples': 9100032, 'steps': 47395, 'loss/train': 1.781440019607544} 11/07/2021 03:55:26 - INFO - __main__ - Step 47397: {'lr': 0.00039263771791795554, 'samples': 9100224, 'steps': 47396, 'loss/train': 1.5347706079483032} 11/07/2021 03:55:26 - INFO - __main__ - Step 47398: {'lr': 0.0003926333596569547, 'samples': 9100416, 'steps': 47397, 'loss/train': 1.39937162399292} 11/07/2021 03:55:26 - INFO - __main__ - Step 47399: {'lr': 0.00039262900133168544, 'samples': 9100608, 'steps': 47398, 'loss/train': 1.6870687007904053} 11/07/2021 03:55:27 - INFO - __main__ - Step 47400: {'lr': 0.0003926246429421497, 'samples': 9100800, 'steps': 47399, 'loss/train': 1.5279653072357178} 11/07/2021 03:55:28 - INFO - __main__ - Step 47401: {'lr': 0.00039262028448834964, 'samples': 9100992, 'steps': 47400, 'loss/train': 1.6076257228851318} 11/07/2021 03:55:28 - INFO - __main__ - Step 47402: {'lr': 0.00039261592597028696, 'samples': 9101184, 'steps': 47401, 'loss/train': 1.4856348037719727} 11/07/2021 03:55:28 - INFO - __main__ - Step 47403: {'lr': 0.0003926115673879638, 'samples': 9101376, 'steps': 47402, 'loss/train': 1.4722977876663208} 11/07/2021 03:55:29 - INFO - __main__ - Step 47404: {'lr': 0.000392607208741382, 'samples': 9101568, 'steps': 47403, 'loss/train': 1.549590826034546} 11/07/2021 03:55:29 - INFO - __main__ - Step 47405: {'lr': 0.00039260285003054365, 'samples': 9101760, 'steps': 47404, 'loss/train': 1.8400324583053589} 11/07/2021 03:55:30 - INFO - __main__ - Step 47406: {'lr': 0.0003925984912554507, 'samples': 9101952, 'steps': 47405, 'loss/train': 1.9346383810043335} 11/07/2021 03:55:31 - INFO - __main__ - Step 47407: {'lr': 0.00039259413241610495, 'samples': 9102144, 'steps': 47406, 'loss/train': 1.3195863962173462} 11/07/2021 03:55:31 - INFO - __main__ - Step 47408: {'lr': 0.0003925897735125086, 'samples': 9102336, 'steps': 47407, 'loss/train': 5.765100479125977} 11/07/2021 03:55:31 - INFO - __main__ - Step 47409: {'lr': 0.00039258541454466344, 'samples': 9102528, 'steps': 47408, 'loss/train': 1.1543376445770264} 11/07/2021 03:55:32 - INFO - __main__ - Step 47410: {'lr': 0.0003925810555125715, 'samples': 9102720, 'steps': 47409, 'loss/train': 1.2631092071533203} 11/07/2021 03:55:33 - INFO - __main__ - Step 47411: {'lr': 0.00039257669641623474, 'samples': 9102912, 'steps': 47410, 'loss/train': 1.3123632669448853} 11/07/2021 03:55:33 - INFO - __main__ - Step 47412: {'lr': 0.0003925723372556551, 'samples': 9103104, 'steps': 47411, 'loss/train': 2.1971917152404785} 11/07/2021 03:55:34 - INFO - __main__ - Step 47413: {'lr': 0.00039256797803083457, 'samples': 9103296, 'steps': 47412, 'loss/train': 0.8656166791915894} 11/07/2021 03:55:34 - INFO - __main__ - Step 47414: {'lr': 0.00039256361874177517, 'samples': 9103488, 'steps': 47413, 'loss/train': 1.8716926574707031} 11/07/2021 03:55:34 - INFO - __main__ - Step 47415: {'lr': 0.0003925592593884787, 'samples': 9103680, 'steps': 47414, 'loss/train': 1.5350236892700195} 11/07/2021 03:55:35 - INFO - __main__ - Step 47416: {'lr': 0.0003925548999709473, 'samples': 9103872, 'steps': 47415, 'loss/train': 1.506656527519226} 11/07/2021 03:55:36 - INFO - __main__ - Step 47417: {'lr': 0.00039255054048918284, 'samples': 9104064, 'steps': 47416, 'loss/train': 0.6237403154373169} 11/07/2021 03:55:36 - INFO - __main__ - Step 47418: {'lr': 0.00039254618094318726, 'samples': 9104256, 'steps': 47417, 'loss/train': 1.6589988470077515} 11/07/2021 03:55:36 - INFO - __main__ - Step 47419: {'lr': 0.0003925418213329627, 'samples': 9104448, 'steps': 47418, 'loss/train': 1.7542097568511963} 11/07/2021 03:55:37 - INFO - __main__ - Step 47420: {'lr': 0.0003925374616585109, 'samples': 9104640, 'steps': 47419, 'loss/train': 1.64899480342865} 11/07/2021 03:55:37 - INFO - __main__ - Step 47421: {'lr': 0.00039253310191983393, 'samples': 9104832, 'steps': 47420, 'loss/train': 1.9085698127746582} 11/07/2021 03:55:38 - INFO - __main__ - Step 47422: {'lr': 0.0003925287421169337, 'samples': 9105024, 'steps': 47421, 'loss/train': 1.615214228630066} 11/07/2021 03:55:38 - INFO - __main__ - Step 47423: {'lr': 0.00039252438224981237, 'samples': 9105216, 'steps': 47422, 'loss/train': 1.4789481163024902} 11/07/2021 03:55:39 - INFO - __main__ - Step 47424: {'lr': 0.0003925200223184716, 'samples': 9105408, 'steps': 47423, 'loss/train': 1.1823432445526123} 11/07/2021 03:55:39 - INFO - __main__ - Step 47425: {'lr': 0.0003925156623229136, 'samples': 9105600, 'steps': 47424, 'loss/train': 1.7379939556121826} 11/07/2021 03:55:40 - INFO - __main__ - Step 47426: {'lr': 0.00039251130226314015, 'samples': 9105792, 'steps': 47425, 'loss/train': 1.4799647331237793} 11/07/2021 03:55:41 - INFO - __main__ - Step 47427: {'lr': 0.00039250694213915335, 'samples': 9105984, 'steps': 47426, 'loss/train': 1.6108179092407227} 11/07/2021 03:55:41 - INFO - __main__ - Step 47428: {'lr': 0.0003925025819509551, 'samples': 9106176, 'steps': 47427, 'loss/train': 1.475036382675171} 11/07/2021 03:55:41 - INFO - __main__ - Step 47429: {'lr': 0.00039249822169854745, 'samples': 9106368, 'steps': 47428, 'loss/train': 0.7688894271850586} 11/07/2021 03:55:42 - INFO - __main__ - Step 47430: {'lr': 0.0003924938613819322, 'samples': 9106560, 'steps': 47429, 'loss/train': 1.7876664400100708} 11/07/2021 03:55:42 - INFO - __main__ - Step 47431: {'lr': 0.0003924895010011115, 'samples': 9106752, 'steps': 47430, 'loss/train': 0.8254964351654053} 11/07/2021 03:55:43 - INFO - __main__ - Step 47432: {'lr': 0.0003924851405560872, 'samples': 9106944, 'steps': 47431, 'loss/train': 1.0238627195358276} 11/07/2021 03:55:43 - INFO - __main__ - Step 47433: {'lr': 0.00039248078004686126, 'samples': 9107136, 'steps': 47432, 'loss/train': 1.3875761032104492} 11/07/2021 03:55:44 - INFO - __main__ - Step 47434: {'lr': 0.00039247641947343575, 'samples': 9107328, 'steps': 47433, 'loss/train': 1.2660794258117676} 11/07/2021 03:55:44 - INFO - __main__ - Step 47435: {'lr': 0.0003924720588358126, 'samples': 9107520, 'steps': 47434, 'loss/train': 1.3477479219436646} 11/07/2021 03:55:45 - INFO - __main__ - Step 47436: {'lr': 0.0003924676981339936, 'samples': 9107712, 'steps': 47435, 'loss/train': 1.7307621240615845} 11/07/2021 03:55:45 - INFO - __main__ - Step 47437: {'lr': 0.00039246333736798095, 'samples': 9107904, 'steps': 47436, 'loss/train': 1.5367916822433472} 11/07/2021 03:55:46 - INFO - __main__ - Step 47438: {'lr': 0.0003924589765377765, 'samples': 9108096, 'steps': 47437, 'loss/train': 1.5305283069610596} 11/07/2021 03:55:46 - INFO - __main__ - Step 47439: {'lr': 0.00039245461564338223, 'samples': 9108288, 'steps': 47438, 'loss/train': 1.7045104503631592} 11/07/2021 03:55:47 - INFO - __main__ - Step 47440: {'lr': 0.00039245025468480013, 'samples': 9108480, 'steps': 47439, 'loss/train': 1.171530842781067} 11/07/2021 03:55:47 - INFO - __main__ - Step 47441: {'lr': 0.00039244589366203207, 'samples': 9108672, 'steps': 47440, 'loss/train': 0.5102721452713013} 11/07/2021 03:55:47 - INFO - __main__ - Step 47442: {'lr': 0.0003924415325750802, 'samples': 9108864, 'steps': 47441, 'loss/train': 1.3438969850540161} 11/07/2021 03:55:48 - INFO - __main__ - Step 47443: {'lr': 0.0003924371714239463, 'samples': 9109056, 'steps': 47442, 'loss/train': 1.5555357933044434} 11/07/2021 03:55:49 - INFO - __main__ - Step 47444: {'lr': 0.0003924328102086324, 'samples': 9109248, 'steps': 47443, 'loss/train': 1.4037777185440063} 11/07/2021 03:55:49 - INFO - __main__ - Step 47445: {'lr': 0.0003924284489291405, 'samples': 9109440, 'steps': 47444, 'loss/train': 1.8709050416946411} 11/07/2021 03:55:49 - INFO - __main__ - Step 47446: {'lr': 0.00039242408758547256, 'samples': 9109632, 'steps': 47445, 'loss/train': 1.5296556949615479} 11/07/2021 03:55:50 - INFO - __main__ - Step 47447: {'lr': 0.0003924197261776304, 'samples': 9109824, 'steps': 47446, 'loss/train': 1.7294964790344238} 11/07/2021 03:55:50 - INFO - __main__ - Step 47448: {'lr': 0.0003924153647056163, 'samples': 9110016, 'steps': 47447, 'loss/train': 1.8322336673736572} 11/07/2021 03:55:51 - INFO - __main__ - Step 47449: {'lr': 0.0003924110031694319, 'samples': 9110208, 'steps': 47448, 'loss/train': 1.8327605724334717} 11/07/2021 03:55:51 - INFO - __main__ - Step 47450: {'lr': 0.00039240664156907937, 'samples': 9110400, 'steps': 47449, 'loss/train': 1.2270948886871338} 11/07/2021 03:55:52 - INFO - __main__ - Step 47451: {'lr': 0.00039240227990456055, 'samples': 9110592, 'steps': 47450, 'loss/train': 1.0061084032058716} 11/07/2021 03:55:52 - INFO - __main__ - Step 47452: {'lr': 0.00039239791817587746, 'samples': 9110784, 'steps': 47451, 'loss/train': 1.542582631111145} 11/07/2021 03:55:52 - INFO - __main__ - Step 47453: {'lr': 0.0003923935563830321, 'samples': 9110976, 'steps': 47452, 'loss/train': 1.3199650049209595} 11/07/2021 03:55:53 - INFO - __main__ - Step 47454: {'lr': 0.0003923891945260264, 'samples': 9111168, 'steps': 47453, 'loss/train': 1.102708101272583} 11/07/2021 03:55:54 - INFO - __main__ - Step 47455: {'lr': 0.00039238483260486235, 'samples': 9111360, 'steps': 47454, 'loss/train': 0.989068865776062} 11/07/2021 03:55:54 - INFO - __main__ - Step 47456: {'lr': 0.0003923804706195418, 'samples': 9111552, 'steps': 47455, 'loss/train': 1.8135876655578613} 11/07/2021 03:55:54 - INFO - __main__ - Step 47457: {'lr': 0.0003923761085700669, 'samples': 9111744, 'steps': 47456, 'loss/train': 1.2370156049728394} 11/07/2021 03:55:55 - INFO - __main__ - Step 47458: {'lr': 0.0003923717464564395, 'samples': 9111936, 'steps': 47457, 'loss/train': 0.9308407306671143} 11/07/2021 03:55:56 - INFO - __main__ - Step 47459: {'lr': 0.00039236738427866154, 'samples': 9112128, 'steps': 47458, 'loss/train': 0.8917006850242615} 11/07/2021 03:55:56 - INFO - __main__ - Step 47460: {'lr': 0.000392363022036735, 'samples': 9112320, 'steps': 47459, 'loss/train': 1.387584924697876} 11/07/2021 03:55:57 - INFO - __main__ - Step 47461: {'lr': 0.00039235865973066196, 'samples': 9112512, 'steps': 47460, 'loss/train': 1.2784820795059204} 11/07/2021 03:55:57 - INFO - __main__ - Step 47462: {'lr': 0.00039235429736044435, 'samples': 9112704, 'steps': 47461, 'loss/train': 1.7527347803115845} 11/07/2021 03:55:58 - INFO - __main__ - Step 47463: {'lr': 0.00039234993492608404, 'samples': 9112896, 'steps': 47462, 'loss/train': 1.625871181488037} 11/07/2021 03:55:58 - INFO - __main__ - Step 47464: {'lr': 0.0003923455724275831, 'samples': 9113088, 'steps': 47463, 'loss/train': 1.7436282634735107} 11/07/2021 03:55:59 - INFO - __main__ - Step 47465: {'lr': 0.0003923412098649433, 'samples': 9113280, 'steps': 47464, 'loss/train': 1.4446229934692383} 11/07/2021 03:55:59 - INFO - __main__ - Step 47466: {'lr': 0.0003923368472381668, 'samples': 9113472, 'steps': 47465, 'loss/train': 1.7699695825576782} 11/07/2021 03:56:00 - INFO - __main__ - Step 47467: {'lr': 0.0003923324845472556, 'samples': 9113664, 'steps': 47466, 'loss/train': 1.5300990343093872} 11/07/2021 03:56:00 - INFO - __main__ - Step 47468: {'lr': 0.0003923281217922115, 'samples': 9113856, 'steps': 47467, 'loss/train': 1.4730432033538818} 11/07/2021 03:56:01 - INFO - __main__ - Step 47469: {'lr': 0.0003923237589730366, 'samples': 9114048, 'steps': 47468, 'loss/train': 1.4974443912506104} 11/07/2021 03:56:01 - INFO - __main__ - Step 47470: {'lr': 0.00039231939608973276, 'samples': 9114240, 'steps': 47469, 'loss/train': 1.9974877834320068} 11/07/2021 03:56:02 - INFO - __main__ - Step 47471: {'lr': 0.000392315033142302, 'samples': 9114432, 'steps': 47470, 'loss/train': 1.3896907567977905} 11/07/2021 03:56:02 - INFO - __main__ - Step 47472: {'lr': 0.0003923106701307463, 'samples': 9114624, 'steps': 47471, 'loss/train': 1.5565924644470215} 11/07/2021 03:56:02 - INFO - __main__ - Step 47473: {'lr': 0.0003923063070550676, 'samples': 9114816, 'steps': 47472, 'loss/train': 1.4438856840133667} 11/07/2021 03:56:03 - INFO - __main__ - Step 47474: {'lr': 0.00039230194391526784, 'samples': 9115008, 'steps': 47473, 'loss/train': 1.4795085191726685} 11/07/2021 03:56:04 - INFO - __main__ - Step 47475: {'lr': 0.00039229758071134907, 'samples': 9115200, 'steps': 47474, 'loss/train': 2.37557053565979} 11/07/2021 03:56:04 - INFO - __main__ - Step 47476: {'lr': 0.0003922932174433132, 'samples': 9115392, 'steps': 47475, 'loss/train': 1.6281499862670898} 11/07/2021 03:56:04 - INFO - __main__ - Step 47477: {'lr': 0.0003922888541111622, 'samples': 9115584, 'steps': 47476, 'loss/train': 1.0745799541473389} 11/07/2021 03:56:05 - INFO - __main__ - Step 47478: {'lr': 0.00039228449071489804, 'samples': 9115776, 'steps': 47477, 'loss/train': 1.604480266571045} 11/07/2021 03:56:06 - INFO - __main__ - Step 47479: {'lr': 0.0003922801272545227, 'samples': 9115968, 'steps': 47478, 'loss/train': 1.740954041481018} 11/07/2021 03:56:06 - INFO - __main__ - Step 47480: {'lr': 0.000392275763730038, 'samples': 9116160, 'steps': 47479, 'loss/train': 1.3065632581710815} 11/07/2021 03:56:07 - INFO - __main__ - Step 47481: {'lr': 0.00039227140014144615, 'samples': 9116352, 'steps': 47480, 'loss/train': 1.776518702507019} 11/07/2021 03:56:07 - INFO - __main__ - Step 47482: {'lr': 0.00039226703648874905, 'samples': 9116544, 'steps': 47481, 'loss/train': 1.07040274143219} 11/07/2021 03:56:07 - INFO - __main__ - Step 47483: {'lr': 0.00039226267277194855, 'samples': 9116736, 'steps': 47482, 'loss/train': 1.2167857885360718} 11/07/2021 03:56:08 - INFO - __main__ - Step 47484: {'lr': 0.0003922583089910467, 'samples': 9116928, 'steps': 47483, 'loss/train': 1.297946810722351} 11/07/2021 03:56:09 - INFO - __main__ - Step 47485: {'lr': 0.0003922539451460454, 'samples': 9117120, 'steps': 47484, 'loss/train': 1.4696154594421387} 11/07/2021 03:56:09 - INFO - __main__ - Step 47486: {'lr': 0.00039224958123694676, 'samples': 9117312, 'steps': 47485, 'loss/train': 1.5161038637161255} 11/07/2021 03:56:09 - INFO - __main__ - Step 47487: {'lr': 0.0003922452172637526, 'samples': 9117504, 'steps': 47486, 'loss/train': 0.7221962809562683} 11/07/2021 03:56:10 - INFO - __main__ - Step 47488: {'lr': 0.000392240853226465, 'samples': 9117696, 'steps': 47487, 'loss/train': 1.6920087337493896} 11/07/2021 03:56:11 - INFO - __main__ - Step 47489: {'lr': 0.0003922364891250858, 'samples': 9117888, 'steps': 47488, 'loss/train': 1.1007860898971558} 11/07/2021 03:56:11 - INFO - __main__ - Step 47490: {'lr': 0.00039223212495961704, 'samples': 9118080, 'steps': 47489, 'loss/train': 1.5745450258255005} 11/07/2021 03:56:11 - INFO - __main__ - Step 47491: {'lr': 0.0003922277607300607, 'samples': 9118272, 'steps': 47490, 'loss/train': 0.9680019021034241} 11/07/2021 03:56:12 - INFO - __main__ - Step 47492: {'lr': 0.0003922233964364187, 'samples': 9118464, 'steps': 47491, 'loss/train': 1.7139664888381958} 11/07/2021 03:56:12 - INFO - __main__ - Step 47493: {'lr': 0.000392219032078693, 'samples': 9118656, 'steps': 47492, 'loss/train': 1.9633903503417969} 11/07/2021 03:56:13 - INFO - __main__ - Step 47494: {'lr': 0.0003922146676568856, 'samples': 9118848, 'steps': 47493, 'loss/train': 1.7013767957687378} 11/07/2021 03:56:13 - INFO - __main__ - Step 47495: {'lr': 0.0003922103031709986, 'samples': 9119040, 'steps': 47494, 'loss/train': 1.2616795301437378} 11/07/2021 03:56:14 - INFO - __main__ - Step 47496: {'lr': 0.0003922059386210337, 'samples': 9119232, 'steps': 47495, 'loss/train': 1.4971474409103394} 11/07/2021 03:56:14 - INFO - __main__ - Step 47497: {'lr': 0.0003922015740069931, 'samples': 9119424, 'steps': 47496, 'loss/train': 0.1893942803144455} 11/07/2021 03:56:15 - INFO - __main__ - Step 47498: {'lr': 0.0003921972093288786, 'samples': 9119616, 'steps': 47497, 'loss/train': 1.8189650774002075} 11/07/2021 03:56:15 - INFO - __main__ - Step 47499: {'lr': 0.00039219284458669217, 'samples': 9119808, 'steps': 47498, 'loss/train': 1.4831254482269287} 11/07/2021 03:56:16 - INFO - __main__ - Step 47500: {'lr': 0.00039218847978043594, 'samples': 9120000, 'steps': 47499, 'loss/train': 1.6738368272781372} 11/07/2021 03:56:16 - INFO - __main__ - Step 47501: {'lr': 0.00039218411491011176, 'samples': 9120192, 'steps': 47500, 'loss/train': 1.4098420143127441} 11/07/2021 03:56:17 - INFO - __main__ - Step 47502: {'lr': 0.0003921797499757216, 'samples': 9120384, 'steps': 47501, 'loss/train': 1.5153719186782837} 11/07/2021 03:56:17 - INFO - __main__ - Step 47503: {'lr': 0.0003921753849772674, 'samples': 9120576, 'steps': 47502, 'loss/train': 1.432418942451477} 11/07/2021 03:56:17 - INFO - __main__ - Step 47504: {'lr': 0.0003921710199147512, 'samples': 9120768, 'steps': 47503, 'loss/train': 1.4112839698791504} 11/07/2021 03:56:18 - INFO - __main__ - Step 47505: {'lr': 0.0003921666547881749, 'samples': 9120960, 'steps': 47504, 'loss/train': 1.5197889804840088} 11/07/2021 03:56:19 - INFO - __main__ - Step 47506: {'lr': 0.00039216228959754055, 'samples': 9121152, 'steps': 47505, 'loss/train': 1.8048980236053467} 11/07/2021 03:56:19 - INFO - __main__ - Step 47507: {'lr': 0.00039215792434285, 'samples': 9121344, 'steps': 47506, 'loss/train': 1.5157513618469238} 11/07/2021 03:56:19 - INFO - __main__ - Step 47508: {'lr': 0.00039215355902410534, 'samples': 9121536, 'steps': 47507, 'loss/train': 1.1093748807907104} 11/07/2021 03:56:20 - INFO - __main__ - Step 47509: {'lr': 0.0003921491936413085, 'samples': 9121728, 'steps': 47508, 'loss/train': 1.6741187572479248} 11/07/2021 03:56:21 - INFO - __main__ - Step 47510: {'lr': 0.0003921448281944614, 'samples': 9121920, 'steps': 47509, 'loss/train': 1.2055332660675049} 11/07/2021 03:56:21 - INFO - __main__ - Step 47511: {'lr': 0.000392140462683566, 'samples': 9122112, 'steps': 47510, 'loss/train': 1.3611558675765991} 11/07/2021 03:56:22 - INFO - __main__ - Step 47512: {'lr': 0.0003921360971086243, 'samples': 9122304, 'steps': 47511, 'loss/train': 1.3768153190612793} 11/07/2021 03:56:22 - INFO - __main__ - Step 47513: {'lr': 0.0003921317314696383, 'samples': 9122496, 'steps': 47512, 'loss/train': 1.7975044250488281} 11/07/2021 03:56:22 - INFO - __main__ - Step 47514: {'lr': 0.0003921273657666099, 'samples': 9122688, 'steps': 47513, 'loss/train': 1.3423588275909424} 11/07/2021 03:56:23 - INFO - __main__ - Step 47515: {'lr': 0.0003921229999995412, 'samples': 9122880, 'steps': 47514, 'loss/train': 1.1245650053024292} 11/07/2021 03:56:24 - INFO - __main__ - Step 47516: {'lr': 0.000392118634168434, 'samples': 9123072, 'steps': 47515, 'loss/train': 0.41801056265830994} 11/07/2021 03:56:24 - INFO - __main__ - Step 47517: {'lr': 0.00039211426827329035, 'samples': 9123264, 'steps': 47516, 'loss/train': 0.7599082589149475} 11/07/2021 03:56:24 - INFO - __main__ - Step 47518: {'lr': 0.0003921099023141121, 'samples': 9123456, 'steps': 47517, 'loss/train': 1.2782249450683594} 11/07/2021 03:56:25 - INFO - __main__ - Step 47519: {'lr': 0.0003921055362909015, 'samples': 9123648, 'steps': 47518, 'loss/train': 1.3840163946151733} 11/07/2021 03:56:25 - INFO - __main__ - Step 47520: {'lr': 0.0003921011702036602, 'samples': 9123840, 'steps': 47519, 'loss/train': 0.9894071817398071} 11/07/2021 03:56:26 - INFO - __main__ - Step 47521: {'lr': 0.00039209680405239035, 'samples': 9124032, 'steps': 47520, 'loss/train': 1.6704676151275635} 11/07/2021 03:56:26 - INFO - __main__ - Step 47522: {'lr': 0.0003920924378370939, 'samples': 9124224, 'steps': 47521, 'loss/train': 1.1027514934539795} 11/07/2021 03:56:27 - INFO - __main__ - Step 47523: {'lr': 0.0003920880715577728, 'samples': 9124416, 'steps': 47522, 'loss/train': 1.4846141338348389} 11/07/2021 03:56:27 - INFO - __main__ - Step 47524: {'lr': 0.00039208370521442895, 'samples': 9124608, 'steps': 47523, 'loss/train': 1.6287258863449097} 11/07/2021 03:56:28 - INFO - __main__ - Step 47525: {'lr': 0.0003920793388070644, 'samples': 9124800, 'steps': 47524, 'loss/train': 0.7328007221221924} 11/07/2021 03:56:29 - INFO - __main__ - Step 47526: {'lr': 0.0003920749723356811, 'samples': 9124992, 'steps': 47525, 'loss/train': 1.7460342645645142} 11/07/2021 03:56:29 - INFO - __main__ - Step 47527: {'lr': 0.000392070605800281, 'samples': 9125184, 'steps': 47526, 'loss/train': 1.5260841846466064} 11/07/2021 03:56:29 - INFO - __main__ - Step 47528: {'lr': 0.00039206623920086603, 'samples': 9125376, 'steps': 47527, 'loss/train': 1.522653579711914} 11/07/2021 03:56:30 - INFO - __main__ - Step 47529: {'lr': 0.0003920618725374383, 'samples': 9125568, 'steps': 47528, 'loss/train': 1.4565163850784302} 11/07/2021 03:56:30 - INFO - __main__ - Step 47530: {'lr': 0.00039205750580999964, 'samples': 9125760, 'steps': 47529, 'loss/train': 1.4106204509735107} 11/07/2021 03:56:31 - INFO - __main__ - Step 47531: {'lr': 0.0003920531390185521, 'samples': 9125952, 'steps': 47530, 'loss/train': 1.9258077144622803} 11/07/2021 03:56:31 - INFO - __main__ - Step 47532: {'lr': 0.00039204877216309755, 'samples': 9126144, 'steps': 47531, 'loss/train': 1.7408628463745117} 11/07/2021 03:56:32 - INFO - __main__ - Step 47533: {'lr': 0.00039204440524363805, 'samples': 9126336, 'steps': 47532, 'loss/train': 1.299311637878418} 11/07/2021 03:56:32 - INFO - __main__ - Step 47534: {'lr': 0.0003920400382601755, 'samples': 9126528, 'steps': 47533, 'loss/train': 1.6594051122665405} 11/07/2021 03:56:32 - INFO - __main__ - Step 47535: {'lr': 0.00039203567121271187, 'samples': 9126720, 'steps': 47534, 'loss/train': 1.7957714796066284} 11/07/2021 03:56:34 - INFO - __main__ - Step 47536: {'lr': 0.00039203130410124927, 'samples': 9126912, 'steps': 47535, 'loss/train': 2.0496859550476074} 11/07/2021 03:56:34 - INFO - __main__ - Step 47537: {'lr': 0.0003920269369257895, 'samples': 9127104, 'steps': 47536, 'loss/train': 1.5177720785140991} 11/07/2021 03:56:34 - INFO - __main__ - Step 47538: {'lr': 0.0003920225696863345, 'samples': 9127296, 'steps': 47537, 'loss/train': 1.370882272720337} 11/07/2021 03:56:35 - INFO - __main__ - Step 47539: {'lr': 0.00039201820238288644, 'samples': 9127488, 'steps': 47538, 'loss/train': 1.5021008253097534} 11/07/2021 03:56:35 - INFO - __main__ - Step 47540: {'lr': 0.00039201383501544706, 'samples': 9127680, 'steps': 47539, 'loss/train': 1.5323901176452637} 11/07/2021 03:56:36 - INFO - __main__ - Step 47541: {'lr': 0.00039200946758401856, 'samples': 9127872, 'steps': 47540, 'loss/train': 1.6588393449783325} 11/07/2021 03:56:36 - INFO - __main__ - Step 47542: {'lr': 0.00039200510008860273, 'samples': 9128064, 'steps': 47541, 'loss/train': 1.0969041585922241} 11/07/2021 03:56:37 - INFO - __main__ - Step 47543: {'lr': 0.0003920007325292016, 'samples': 9128256, 'steps': 47542, 'loss/train': 1.2662030458450317} 11/07/2021 03:56:37 - INFO - __main__ - Step 47544: {'lr': 0.00039199636490581713, 'samples': 9128448, 'steps': 47543, 'loss/train': 1.4820476770401} 11/07/2021 03:56:37 - INFO - __main__ - Step 47545: {'lr': 0.00039199199721845127, 'samples': 9128640, 'steps': 47544, 'loss/train': 0.8880251049995422} 11/07/2021 03:56:38 - INFO - __main__ - Step 47546: {'lr': 0.000391987629467106, 'samples': 9128832, 'steps': 47545, 'loss/train': 1.5734833478927612} 11/07/2021 03:56:39 - INFO - __main__ - Step 47547: {'lr': 0.00039198326165178335, 'samples': 9129024, 'steps': 47546, 'loss/train': 1.2641962766647339} 11/07/2021 03:56:39 - INFO - __main__ - Step 47548: {'lr': 0.0003919788937724852, 'samples': 9129216, 'steps': 47547, 'loss/train': 1.4985029697418213} 11/07/2021 03:56:39 - INFO - __main__ - Step 47549: {'lr': 0.0003919745258292135, 'samples': 9129408, 'steps': 47548, 'loss/train': 1.4992035627365112} 11/07/2021 03:56:40 - INFO - __main__ - Step 47550: {'lr': 0.00039197015782197034, 'samples': 9129600, 'steps': 47549, 'loss/train': 1.4774483442306519} 11/07/2021 03:56:40 - INFO - __main__ - Step 47551: {'lr': 0.0003919657897507576, 'samples': 9129792, 'steps': 47550, 'loss/train': 1.6427239179611206} 11/07/2021 03:56:41 - INFO - __main__ - Step 47552: {'lr': 0.0003919614216155772, 'samples': 9129984, 'steps': 47551, 'loss/train': 1.5218361616134644} 11/07/2021 03:56:42 - INFO - __main__ - Step 47553: {'lr': 0.0003919570534164313, 'samples': 9130176, 'steps': 47552, 'loss/train': 1.2969653606414795} 11/07/2021 03:56:42 - INFO - __main__ - Step 47554: {'lr': 0.0003919526851533216, 'samples': 9130368, 'steps': 47553, 'loss/train': 1.785011887550354} 11/07/2021 03:56:42 - INFO - __main__ - Step 47555: {'lr': 0.00039194831682625033, 'samples': 9130560, 'steps': 47554, 'loss/train': 1.4270840883255005} 11/07/2021 03:56:43 - INFO - __main__ - Step 47556: {'lr': 0.0003919439484352193, 'samples': 9130752, 'steps': 47555, 'loss/train': 0.7276812791824341} 11/07/2021 03:56:44 - INFO - __main__ - Step 47557: {'lr': 0.00039193957998023057, 'samples': 9130944, 'steps': 47556, 'loss/train': 1.7058446407318115} 11/07/2021 03:56:44 - INFO - __main__ - Step 47558: {'lr': 0.000391935211461286, 'samples': 9131136, 'steps': 47557, 'loss/train': 1.4903485774993896} 11/07/2021 03:56:44 - INFO - __main__ - Step 47559: {'lr': 0.00039193084287838755, 'samples': 9131328, 'steps': 47558, 'loss/train': 1.2844045162200928} 11/07/2021 03:56:45 - INFO - __main__ - Step 47560: {'lr': 0.0003919264742315373, 'samples': 9131520, 'steps': 47559, 'loss/train': 1.8127259016036987} 11/07/2021 03:56:45 - INFO - __main__ - Step 47561: {'lr': 0.00039192210552073723, 'samples': 9131712, 'steps': 47560, 'loss/train': 1.9596339464187622} 11/07/2021 03:56:46 - INFO - __main__ - Step 47562: {'lr': 0.0003919177367459892, 'samples': 9131904, 'steps': 47561, 'loss/train': 1.9238076210021973} 11/07/2021 03:56:47 - INFO - __main__ - Step 47563: {'lr': 0.00039191336790729526, 'samples': 9132096, 'steps': 47562, 'loss/train': 1.9140310287475586} 11/07/2021 03:56:47 - INFO - __main__ - Step 47564: {'lr': 0.00039190899900465727, 'samples': 9132288, 'steps': 47563, 'loss/train': 1.6951509714126587} 11/07/2021 03:56:47 - INFO - __main__ - Step 47565: {'lr': 0.0003919046300380773, 'samples': 9132480, 'steps': 47564, 'loss/train': 0.3123338520526886} 11/07/2021 03:56:48 - INFO - __main__ - Step 47566: {'lr': 0.00039190026100755735, 'samples': 9132672, 'steps': 47565, 'loss/train': 1.3794103860855103} 11/07/2021 03:56:49 - INFO - __main__ - Step 47567: {'lr': 0.00039189589191309927, 'samples': 9132864, 'steps': 47566, 'loss/train': 0.9913450479507446} 11/07/2021 03:56:49 - INFO - __main__ - Step 47568: {'lr': 0.00039189152275470514, 'samples': 9133056, 'steps': 47567, 'loss/train': 0.6241242289543152} 11/07/2021 03:56:49 - INFO - __main__ - Step 47569: {'lr': 0.0003918871535323769, 'samples': 9133248, 'steps': 47568, 'loss/train': 1.2644298076629639} 11/07/2021 03:56:50 - INFO - __main__ - Step 47570: {'lr': 0.0003918827842461165, 'samples': 9133440, 'steps': 47569, 'loss/train': 1.5793612003326416} 11/07/2021 03:56:50 - INFO - __main__ - Step 47571: {'lr': 0.0003918784148959258, 'samples': 9133632, 'steps': 47570, 'loss/train': 0.8174675107002258} 11/07/2021 03:56:51 - INFO - __main__ - Step 47572: {'lr': 0.0003918740454818069, 'samples': 9133824, 'steps': 47571, 'loss/train': 1.1769851446151733} 11/07/2021 03:56:52 - INFO - __main__ - Step 47573: {'lr': 0.0003918696760037618, 'samples': 9134016, 'steps': 47572, 'loss/train': 1.6718579530715942} 11/07/2021 03:56:52 - INFO - __main__ - Step 47574: {'lr': 0.0003918653064617924, 'samples': 9134208, 'steps': 47573, 'loss/train': 1.5838192701339722} 11/07/2021 03:56:52 - INFO - __main__ - Step 47575: {'lr': 0.00039186093685590064, 'samples': 9134400, 'steps': 47574, 'loss/train': 1.1955496072769165} 11/07/2021 03:56:53 - INFO - __main__ - Step 47576: {'lr': 0.0003918565671860886, 'samples': 9134592, 'steps': 47575, 'loss/train': 1.5702279806137085} 11/07/2021 03:56:53 - INFO - __main__ - Step 47577: {'lr': 0.00039185219745235816, 'samples': 9134784, 'steps': 47576, 'loss/train': 1.6203112602233887} 11/07/2021 03:56:54 - INFO - __main__ - Step 47578: {'lr': 0.0003918478276547113, 'samples': 9134976, 'steps': 47577, 'loss/train': 1.670324444770813} 11/07/2021 03:56:54 - INFO - __main__ - Step 47579: {'lr': 0.00039184345779315, 'samples': 9135168, 'steps': 47578, 'loss/train': 1.572860836982727} 11/07/2021 03:56:55 - INFO - __main__ - Step 47580: {'lr': 0.0003918390878676762, 'samples': 9135360, 'steps': 47579, 'loss/train': 1.5355112552642822} 11/07/2021 03:56:55 - INFO - __main__ - Step 47581: {'lr': 0.00039183471787829194, 'samples': 9135552, 'steps': 47580, 'loss/train': 1.6907731294631958} 11/07/2021 03:56:56 - INFO - __main__ - Step 47582: {'lr': 0.0003918303478249991, 'samples': 9135744, 'steps': 47581, 'loss/train': 1.6832005977630615} 11/07/2021 03:56:57 - INFO - __main__ - Step 47583: {'lr': 0.0003918259777077997, 'samples': 9135936, 'steps': 47582, 'loss/train': 1.2778152227401733} 11/07/2021 03:56:57 - INFO - __main__ - Step 47584: {'lr': 0.00039182160752669577, 'samples': 9136128, 'steps': 47583, 'loss/train': 1.6662622690200806} 11/07/2021 03:56:58 - INFO - __main__ - Step 47585: {'lr': 0.0003918172372816892, 'samples': 9136320, 'steps': 47584, 'loss/train': 1.4052339792251587} 11/07/2021 03:56:58 - INFO - __main__ - Step 47586: {'lr': 0.0003918128669727818, 'samples': 9136512, 'steps': 47585, 'loss/train': 0.2713298201560974} 11/07/2021 03:56:58 - INFO - __main__ - Step 47587: {'lr': 0.00039180849659997593, 'samples': 9136704, 'steps': 47586, 'loss/train': 0.8063874840736389} 11/07/2021 03:57:00 - INFO - __main__ - Step 47588: {'lr': 0.00039180412616327323, 'samples': 9136896, 'steps': 47587, 'loss/train': 1.20669424533844} 11/07/2021 03:57:00 - INFO - __main__ - Step 47589: {'lr': 0.00039179975566267585, 'samples': 9137088, 'steps': 47588, 'loss/train': 1.0589284896850586} 11/07/2021 03:57:00 - INFO - __main__ - Step 47590: {'lr': 0.00039179538509818556, 'samples': 9137280, 'steps': 47589, 'loss/train': 1.230054259300232} 11/07/2021 03:57:01 - INFO - __main__ - Step 47591: {'lr': 0.0003917910144698046, 'samples': 9137472, 'steps': 47590, 'loss/train': 2.2915971279144287} 11/07/2021 03:57:01 - INFO - __main__ - Step 47592: {'lr': 0.0003917866437775347, 'samples': 9137664, 'steps': 47591, 'loss/train': 1.0403106212615967} 11/07/2021 03:57:02 - INFO - __main__ - Step 47593: {'lr': 0.000391782273021378, 'samples': 9137856, 'steps': 47592, 'loss/train': 1.5003052949905396} 11/07/2021 03:57:02 - INFO - __main__ - Step 47594: {'lr': 0.00039177790220133637, 'samples': 9138048, 'steps': 47593, 'loss/train': 1.8745750188827515} 11/07/2021 03:57:03 - INFO - __main__ - Step 47595: {'lr': 0.0003917735313174117, 'samples': 9138240, 'steps': 47594, 'loss/train': 1.5211420059204102} 11/07/2021 03:57:03 - INFO - __main__ - Step 47596: {'lr': 0.0003917691603696062, 'samples': 9138432, 'steps': 47595, 'loss/train': 1.5756067037582397} 11/07/2021 03:57:03 - INFO - __main__ - Step 47597: {'lr': 0.0003917647893579217, 'samples': 9138624, 'steps': 47596, 'loss/train': 1.660907506942749} 11/07/2021 03:57:04 - INFO - __main__ - Step 47598: {'lr': 0.0003917604182823601, 'samples': 9138816, 'steps': 47597, 'loss/train': 1.3687254190444946} 11/07/2021 03:57:05 - INFO - __main__ - Step 47599: {'lr': 0.00039175604714292346, 'samples': 9139008, 'steps': 47598, 'loss/train': 1.4359571933746338} 11/07/2021 03:57:05 - INFO - __main__ - Step 47600: {'lr': 0.00039175167593961377, 'samples': 9139200, 'steps': 47599, 'loss/train': 1.042655110359192} 11/07/2021 03:57:06 - INFO - __main__ - Step 47601: {'lr': 0.0003917473046724329, 'samples': 9139392, 'steps': 47600, 'loss/train': 1.4571832418441772} 11/07/2021 03:57:06 - INFO - __main__ - Step 47602: {'lr': 0.000391742933341383, 'samples': 9139584, 'steps': 47601, 'loss/train': 1.5106868743896484} 11/07/2021 03:57:06 - INFO - __main__ - Step 47603: {'lr': 0.00039173856194646585, 'samples': 9139776, 'steps': 47602, 'loss/train': 1.426308035850525} 11/07/2021 03:57:08 - INFO - __main__ - Step 47604: {'lr': 0.00039173419048768343, 'samples': 9139968, 'steps': 47603, 'loss/train': 1.333543300628662} 11/07/2021 03:57:08 - INFO - __main__ - Step 47605: {'lr': 0.0003917298189650378, 'samples': 9140160, 'steps': 47604, 'loss/train': 2.3469433784484863} 11/07/2021 03:57:08 - INFO - __main__ - Step 47606: {'lr': 0.00039172544737853097, 'samples': 9140352, 'steps': 47605, 'loss/train': 0.9758121967315674} 11/07/2021 03:57:09 - INFO - __main__ - Step 47607: {'lr': 0.00039172107572816477, 'samples': 9140544, 'steps': 47606, 'loss/train': 1.5648424625396729} 11/07/2021 03:57:09 - INFO - __main__ - Step 47608: {'lr': 0.00039171670401394134, 'samples': 9140736, 'steps': 47607, 'loss/train': 0.8682147860527039} 11/07/2021 03:57:10 - INFO - __main__ - Step 47609: {'lr': 0.00039171233223586247, 'samples': 9140928, 'steps': 47608, 'loss/train': 1.2661722898483276} 11/07/2021 03:57:10 - INFO - __main__ - Step 47610: {'lr': 0.0003917079603939302, 'samples': 9141120, 'steps': 47609, 'loss/train': 2.0972232818603516} 11/07/2021 03:57:11 - INFO - __main__ - Step 47611: {'lr': 0.0003917035884881465, 'samples': 9141312, 'steps': 47610, 'loss/train': 0.6600740551948547} 11/07/2021 03:57:11 - INFO - __main__ - Step 47612: {'lr': 0.00039169921651851337, 'samples': 9141504, 'steps': 47611, 'loss/train': 1.7421717643737793} 11/07/2021 03:57:11 - INFO - __main__ - Step 47613: {'lr': 0.0003916948444850328, 'samples': 9141696, 'steps': 47612, 'loss/train': 1.5109429359436035} 11/07/2021 03:57:12 - INFO - __main__ - Step 47614: {'lr': 0.0003916904723877067, 'samples': 9141888, 'steps': 47613, 'loss/train': 1.4767059087753296} 11/07/2021 03:57:13 - INFO - __main__ - Step 47615: {'lr': 0.000391686100226537, 'samples': 9142080, 'steps': 47614, 'loss/train': 1.5321447849273682} 11/07/2021 03:57:13 - INFO - __main__ - Step 47616: {'lr': 0.00039168172800152577, 'samples': 9142272, 'steps': 47615, 'loss/train': 1.3128496408462524} 11/07/2021 03:57:14 - INFO - __main__ - Step 47617: {'lr': 0.0003916773557126749, 'samples': 9142464, 'steps': 47616, 'loss/train': 1.225580096244812} 11/07/2021 03:57:14 - INFO - __main__ - Step 47618: {'lr': 0.00039167298335998646, 'samples': 9142656, 'steps': 47617, 'loss/train': 1.678581953048706} 11/07/2021 03:57:15 - INFO - __main__ - Step 47619: {'lr': 0.0003916686109434624, 'samples': 9142848, 'steps': 47618, 'loss/train': 1.4629886150360107} 11/07/2021 03:57:15 - INFO - __main__ - Step 47620: {'lr': 0.00039166423846310463, 'samples': 9143040, 'steps': 47619, 'loss/train': 1.1578612327575684} 11/07/2021 03:57:16 - INFO - __main__ - Step 47621: {'lr': 0.00039165986591891506, 'samples': 9143232, 'steps': 47620, 'loss/train': 1.5875498056411743} 11/07/2021 03:57:16 - INFO - __main__ - Step 47622: {'lr': 0.0003916554933108958, 'samples': 9143424, 'steps': 47621, 'loss/train': 1.1253294944763184} 11/07/2021 03:57:16 - INFO - __main__ - Step 47623: {'lr': 0.00039165112063904874, 'samples': 9143616, 'steps': 47622, 'loss/train': 0.9231517314910889} 11/07/2021 03:57:18 - INFO - __main__ - Step 47624: {'lr': 0.0003916467479033759, 'samples': 9143808, 'steps': 47623, 'loss/train': 1.4769951105117798} 11/07/2021 03:57:18 - INFO - __main__ - Step 47625: {'lr': 0.00039164237510387915, 'samples': 9144000, 'steps': 47624, 'loss/train': 1.0370756387710571} 11/07/2021 03:57:18 - INFO - __main__ - Step 47626: {'lr': 0.0003916380022405606, 'samples': 9144192, 'steps': 47625, 'loss/train': 1.2719224691390991} 11/07/2021 03:57:19 - INFO - __main__ - Step 47627: {'lr': 0.0003916336293134222, 'samples': 9144384, 'steps': 47626, 'loss/train': 2.3809704780578613} 11/07/2021 03:57:19 - INFO - __main__ - Step 47628: {'lr': 0.0003916292563224657, 'samples': 9144576, 'steps': 47627, 'loss/train': 1.146511197090149} 11/07/2021 03:57:19 - INFO - __main__ - Step 47629: {'lr': 0.00039162488326769334, 'samples': 9144768, 'steps': 47628, 'loss/train': 2.0270538330078125} 11/07/2021 03:57:21 - INFO - __main__ - Step 47630: {'lr': 0.00039162051014910706, 'samples': 9144960, 'steps': 47629, 'loss/train': 2.115088939666748} 11/07/2021 03:57:21 - INFO - __main__ - Step 47631: {'lr': 0.0003916161369667087, 'samples': 9145152, 'steps': 47630, 'loss/train': 1.8053927421569824} 11/07/2021 03:57:21 - INFO - __main__ - Step 47632: {'lr': 0.0003916117637205003, 'samples': 9145344, 'steps': 47631, 'loss/train': 1.1802260875701904} 11/07/2021 03:57:22 - INFO - __main__ - Step 47633: {'lr': 0.00039160739041048376, 'samples': 9145536, 'steps': 47632, 'loss/train': 0.5280793309211731} 11/07/2021 03:57:22 - INFO - __main__ - Step 47634: {'lr': 0.0003916030170366612, 'samples': 9145728, 'steps': 47633, 'loss/train': 1.577242136001587} 11/07/2021 03:57:23 - INFO - __main__ - Step 47635: {'lr': 0.0003915986435990345, 'samples': 9145920, 'steps': 47634, 'loss/train': 1.6110584735870361} 11/07/2021 03:57:23 - INFO - __main__ - Step 47636: {'lr': 0.0003915942700976056, 'samples': 9146112, 'steps': 47635, 'loss/train': 1.3024262189865112} 11/07/2021 03:57:24 - INFO - __main__ - Step 47637: {'lr': 0.0003915898965323765, 'samples': 9146304, 'steps': 47636, 'loss/train': 1.7504475116729736} 11/07/2021 03:57:24 - INFO - __main__ - Step 47638: {'lr': 0.00039158552290334927, 'samples': 9146496, 'steps': 47637, 'loss/train': 1.4919499158859253} 11/07/2021 03:57:24 - INFO - __main__ - Step 47639: {'lr': 0.00039158114921052567, 'samples': 9146688, 'steps': 47638, 'loss/train': 1.6135183572769165} 11/07/2021 03:57:25 - INFO - __main__ - Step 47640: {'lr': 0.0003915767754539078, 'samples': 9146880, 'steps': 47639, 'loss/train': 1.364461898803711} 11/07/2021 03:57:26 - INFO - __main__ - Step 47641: {'lr': 0.0003915724016334977, 'samples': 9147072, 'steps': 47640, 'loss/train': 1.3528691530227661} 11/07/2021 03:57:26 - INFO - __main__ - Step 47642: {'lr': 0.00039156802774929723, 'samples': 9147264, 'steps': 47641, 'loss/train': 1.877328634262085} 11/07/2021 03:57:26 - INFO - __main__ - Step 47643: {'lr': 0.00039156365380130844, 'samples': 9147456, 'steps': 47642, 'loss/train': 1.2187020778656006} 11/07/2021 03:57:27 - INFO - __main__ - Step 47644: {'lr': 0.00039155927978953316, 'samples': 9147648, 'steps': 47643, 'loss/train': 1.3663796186447144} 11/07/2021 03:57:28 - INFO - __main__ - Step 47645: {'lr': 0.00039155490571397345, 'samples': 9147840, 'steps': 47644, 'loss/train': 1.871665596961975} 11/07/2021 03:57:29 - INFO - __main__ - Step 47646: {'lr': 0.0003915505315746313, 'samples': 9148032, 'steps': 47645, 'loss/train': 2.014326333999634} 11/07/2021 03:57:29 - INFO - __main__ - Step 47647: {'lr': 0.00039154615737150867, 'samples': 9148224, 'steps': 47646, 'loss/train': 1.3724406957626343} 11/07/2021 03:57:29 - INFO - __main__ - Step 47648: {'lr': 0.00039154178310460755, 'samples': 9148416, 'steps': 47647, 'loss/train': 1.3263518810272217} 11/07/2021 03:57:30 - INFO - __main__ - Step 47649: {'lr': 0.00039153740877392987, 'samples': 9148608, 'steps': 47648, 'loss/train': 0.2772655189037323} 11/07/2021 03:57:31 - INFO - __main__ - Step 47650: {'lr': 0.0003915330343794777, 'samples': 9148800, 'steps': 47649, 'loss/train': 1.6966464519500732} 11/07/2021 03:57:31 - INFO - __main__ - Step 47651: {'lr': 0.0003915286599212529, 'samples': 9148992, 'steps': 47650, 'loss/train': 1.5087082386016846} 11/07/2021 03:57:32 - INFO - __main__ - Step 47652: {'lr': 0.0003915242853992573, 'samples': 9149184, 'steps': 47651, 'loss/train': 1.1331799030303955} 11/07/2021 03:57:32 - INFO - __main__ - Step 47653: {'lr': 0.0003915199108134932, 'samples': 9149376, 'steps': 47652, 'loss/train': 0.2149980217218399} 11/07/2021 03:57:32 - INFO - __main__ - Step 47654: {'lr': 0.00039151553616396234, 'samples': 9149568, 'steps': 47653, 'loss/train': 5.551827430725098} 11/07/2021 03:57:33 - INFO - __main__ - Step 47655: {'lr': 0.0003915111614506668, 'samples': 9149760, 'steps': 47654, 'loss/train': 1.6605746746063232} 11/07/2021 03:57:34 - INFO - __main__ - Step 47656: {'lr': 0.0003915067866736085, 'samples': 9149952, 'steps': 47655, 'loss/train': 1.5566972494125366} 11/07/2021 03:57:34 - INFO - __main__ - Step 47657: {'lr': 0.0003915024118327895, 'samples': 9150144, 'steps': 47656, 'loss/train': 1.962587594985962} 11/07/2021 03:57:34 - INFO - __main__ - Step 47658: {'lr': 0.0003914980369282116, 'samples': 9150336, 'steps': 47657, 'loss/train': 1.4869587421417236} 11/07/2021 03:57:35 - INFO - __main__ - Step 47659: {'lr': 0.0003914936619598769, 'samples': 9150528, 'steps': 47658, 'loss/train': 1.1580917835235596} 11/07/2021 03:57:36 - INFO - __main__ - Step 47660: {'lr': 0.0003914892869277873, 'samples': 9150720, 'steps': 47659, 'loss/train': 1.6406500339508057} 11/07/2021 03:57:36 - INFO - __main__ - Step 47661: {'lr': 0.0003914849118319449, 'samples': 9150912, 'steps': 47660, 'loss/train': 1.7226920127868652} 11/07/2021 03:57:37 - INFO - __main__ - Step 47662: {'lr': 0.0003914805366723515, 'samples': 9151104, 'steps': 47661, 'loss/train': 1.6189788579940796} 11/07/2021 03:57:37 - INFO - __main__ - Step 47663: {'lr': 0.0003914761614490092, 'samples': 9151296, 'steps': 47662, 'loss/train': 1.6396089792251587} 11/07/2021 03:57:37 - INFO - __main__ - Step 47664: {'lr': 0.0003914717861619199, 'samples': 9151488, 'steps': 47663, 'loss/train': 1.578438639640808} 11/07/2021 03:57:38 - INFO - __main__ - Step 47665: {'lr': 0.00039146741081108567, 'samples': 9151680, 'steps': 47664, 'loss/train': 1.9214646816253662} 11/07/2021 03:57:39 - INFO - __main__ - Step 47666: {'lr': 0.0003914630353965083, 'samples': 9151872, 'steps': 47665, 'loss/train': 1.3176829814910889} 11/07/2021 03:57:39 - INFO - __main__ - Step 47667: {'lr': 0.00039145865991818994, 'samples': 9152064, 'steps': 47666, 'loss/train': 1.5965214967727661} 11/07/2021 03:57:39 - INFO - __main__ - Step 47668: {'lr': 0.00039145428437613246, 'samples': 9152256, 'steps': 47667, 'loss/train': 1.41964852809906} 11/07/2021 03:57:40 - INFO - __main__ - Step 47669: {'lr': 0.0003914499087703379, 'samples': 9152448, 'steps': 47668, 'loss/train': 1.509716510772705} 11/07/2021 03:57:40 - INFO - __main__ - Step 47670: {'lr': 0.00039144553310080816, 'samples': 9152640, 'steps': 47669, 'loss/train': 1.100142002105713} 11/07/2021 03:57:41 - INFO - __main__ - Step 47671: {'lr': 0.0003914411573675453, 'samples': 9152832, 'steps': 47670, 'loss/train': 1.584984302520752} 11/07/2021 03:57:42 - INFO - __main__ - Step 47672: {'lr': 0.0003914367815705511, 'samples': 9153024, 'steps': 47671, 'loss/train': 1.3642358779907227} 11/07/2021 03:57:42 - INFO - __main__ - Step 47673: {'lr': 0.00039143240570982776, 'samples': 9153216, 'steps': 47672, 'loss/train': 1.5138477087020874} 11/07/2021 03:57:42 - INFO - __main__ - Step 47674: {'lr': 0.00039142802978537716, 'samples': 9153408, 'steps': 47673, 'loss/train': 1.6956623792648315} 11/07/2021 03:57:43 - INFO - __main__ - Step 47675: {'lr': 0.00039142365379720123, 'samples': 9153600, 'steps': 47674, 'loss/train': 1.4973416328430176} 11/07/2021 03:57:44 - INFO - __main__ - Step 47676: {'lr': 0.0003914192777453021, 'samples': 9153792, 'steps': 47675, 'loss/train': 1.4953629970550537} 11/07/2021 03:57:44 - INFO - __main__ - Step 47677: {'lr': 0.00039141490162968154, 'samples': 9153984, 'steps': 47676, 'loss/train': 1.174144983291626} 11/07/2021 03:57:44 - INFO - __main__ - Step 47678: {'lr': 0.0003914105254503416, 'samples': 9154176, 'steps': 47677, 'loss/train': 1.7978628873825073} 11/07/2021 03:57:45 - INFO - __main__ - Step 47679: {'lr': 0.00039140614920728424, 'samples': 9154368, 'steps': 47678, 'loss/train': 1.5725959539413452} 11/07/2021 03:57:45 - INFO - __main__ - Step 47680: {'lr': 0.0003914017729005115, 'samples': 9154560, 'steps': 47679, 'loss/train': 1.680406928062439} 11/07/2021 03:57:46 - INFO - __main__ - Step 47681: {'lr': 0.00039139739653002527, 'samples': 9154752, 'steps': 47680, 'loss/train': 0.5801538825035095} 11/07/2021 03:57:46 - INFO - __main__ - Step 47682: {'lr': 0.00039139302009582753, 'samples': 9154944, 'steps': 47681, 'loss/train': 1.4638181924819946} 11/07/2021 03:57:47 - INFO - __main__ - Step 47683: {'lr': 0.00039138864359792035, 'samples': 9155136, 'steps': 47682, 'loss/train': 1.611740231513977} 11/07/2021 03:57:47 - INFO - __main__ - Step 47684: {'lr': 0.0003913842670363056, 'samples': 9155328, 'steps': 47683, 'loss/train': 0.9025747776031494} 11/07/2021 03:57:47 - INFO - __main__ - Step 47685: {'lr': 0.0003913798904109853, 'samples': 9155520, 'steps': 47684, 'loss/train': 1.3374627828598022} 11/07/2021 03:57:49 - INFO - __main__ - Step 47686: {'lr': 0.0003913755137219614, 'samples': 9155712, 'steps': 47685, 'loss/train': 1.321630597114563} 11/07/2021 03:57:49 - INFO - __main__ - Step 47687: {'lr': 0.00039137113696923587, 'samples': 9155904, 'steps': 47686, 'loss/train': 1.7227809429168701} 11/07/2021 03:57:49 - INFO - __main__ - Step 47688: {'lr': 0.00039136676015281063, 'samples': 9156096, 'steps': 47687, 'loss/train': 0.8350129723548889} 11/07/2021 03:57:50 - INFO - __main__ - Step 47689: {'lr': 0.00039136238327268776, 'samples': 9156288, 'steps': 47688, 'loss/train': 1.270866870880127} 11/07/2021 03:57:50 - INFO - __main__ - Step 47690: {'lr': 0.0003913580063288692, 'samples': 9156480, 'steps': 47689, 'loss/train': 1.9323718547821045} 11/07/2021 03:57:51 - INFO - __main__ - Step 47691: {'lr': 0.0003913536293213569, 'samples': 9156672, 'steps': 47690, 'loss/train': 1.5638900995254517} 11/07/2021 03:57:51 - INFO - __main__ - Step 47692: {'lr': 0.00039134925225015277, 'samples': 9156864, 'steps': 47691, 'loss/train': 1.785346269607544} 11/07/2021 03:57:52 - INFO - __main__ - Step 47693: {'lr': 0.0003913448751152589, 'samples': 9157056, 'steps': 47692, 'loss/train': 0.6156972050666809} 11/07/2021 03:57:52 - INFO - __main__ - Step 47694: {'lr': 0.0003913404979166772, 'samples': 9157248, 'steps': 47693, 'loss/train': 1.6768780946731567} 11/07/2021 03:57:53 - INFO - __main__ - Step 47695: {'lr': 0.00039133612065440964, 'samples': 9157440, 'steps': 47694, 'loss/train': 1.6212339401245117} 11/07/2021 03:57:53 - INFO - __main__ - Step 47696: {'lr': 0.0003913317433284582, 'samples': 9157632, 'steps': 47695, 'loss/train': 1.614753246307373} 11/07/2021 03:57:53 - INFO - __main__ - Step 47697: {'lr': 0.0003913273659388249, 'samples': 9157824, 'steps': 47696, 'loss/train': 1.357269287109375} 11/07/2021 03:57:54 - INFO - __main__ - Step 47698: {'lr': 0.0003913229884855117, 'samples': 9158016, 'steps': 47697, 'loss/train': 0.7561484575271606} 11/07/2021 03:57:55 - INFO - __main__ - Step 47699: {'lr': 0.00039131861096852044, 'samples': 9158208, 'steps': 47698, 'loss/train': 1.6597344875335693} 11/07/2021 03:57:55 - INFO - __main__ - Step 47700: {'lr': 0.0003913142333878533, 'samples': 9158400, 'steps': 47699, 'loss/train': 1.3496906757354736} 11/07/2021 03:57:55 - INFO - __main__ - Step 47701: {'lr': 0.0003913098557435121, 'samples': 9158592, 'steps': 47700, 'loss/train': 1.3063585758209229} 11/07/2021 03:57:56 - INFO - __main__ - Step 47702: {'lr': 0.00039130547803549877, 'samples': 9158784, 'steps': 47701, 'loss/train': 1.5292190313339233} 11/07/2021 03:57:57 - INFO - __main__ - Step 47703: {'lr': 0.00039130110026381547, 'samples': 9158976, 'steps': 47702, 'loss/train': 1.587748408317566} 11/07/2021 03:57:57 - INFO - __main__ - Step 47704: {'lr': 0.00039129672242846407, 'samples': 9159168, 'steps': 47703, 'loss/train': 0.2601267695426941} 11/07/2021 03:57:58 - INFO - __main__ - Step 47705: {'lr': 0.0003912923445294465, 'samples': 9159360, 'steps': 47704, 'loss/train': 1.6256636381149292} 11/07/2021 03:57:58 - INFO - __main__ - Step 47706: {'lr': 0.00039128796656676487, 'samples': 9159552, 'steps': 47705, 'loss/train': 1.1044323444366455} 11/07/2021 03:57:58 - INFO - __main__ - Step 47707: {'lr': 0.000391283588540421, 'samples': 9159744, 'steps': 47706, 'loss/train': 1.5699584484100342} 11/07/2021 03:57:59 - INFO - __main__ - Step 47708: {'lr': 0.00039127921045041693, 'samples': 9159936, 'steps': 47707, 'loss/train': 1.3472719192504883} 11/07/2021 03:58:00 - INFO - __main__ - Step 47709: {'lr': 0.00039127483229675457, 'samples': 9160128, 'steps': 47708, 'loss/train': 1.4134140014648438} 11/07/2021 03:58:00 - INFO - __main__ - Step 47710: {'lr': 0.0003912704540794361, 'samples': 9160320, 'steps': 47709, 'loss/train': 1.2979545593261719} 11/07/2021 03:58:00 - INFO - __main__ - Step 47711: {'lr': 0.0003912660757984632, 'samples': 9160512, 'steps': 47710, 'loss/train': 1.8867838382720947} 11/07/2021 03:58:01 - INFO - __main__ - Step 47712: {'lr': 0.00039126169745383807, 'samples': 9160704, 'steps': 47711, 'loss/train': 0.4019244313240051} 11/07/2021 03:58:01 - INFO - __main__ - Step 47713: {'lr': 0.00039125731904556254, 'samples': 9160896, 'steps': 47712, 'loss/train': 1.565820336341858} 11/07/2021 03:58:02 - INFO - __main__ - Step 47714: {'lr': 0.0003912529405736387, 'samples': 9161088, 'steps': 47713, 'loss/train': 1.0823204517364502} 11/07/2021 03:58:02 - INFO - __main__ - Step 47715: {'lr': 0.00039124856203806834, 'samples': 9161280, 'steps': 47714, 'loss/train': 1.3622620105743408} 11/07/2021 03:58:03 - INFO - __main__ - Step 47716: {'lr': 0.0003912441834388537, 'samples': 9161472, 'steps': 47715, 'loss/train': 1.5373780727386475} 11/07/2021 03:58:03 - INFO - __main__ - Step 47717: {'lr': 0.00039123980477599664, 'samples': 9161664, 'steps': 47716, 'loss/train': 1.94687020778656} 11/07/2021 03:58:04 - INFO - __main__ - Step 47718: {'lr': 0.00039123542604949904, 'samples': 9161856, 'steps': 47717, 'loss/train': 1.64690363407135} 11/07/2021 03:58:04 - INFO - __main__ - Step 47719: {'lr': 0.0003912310472593629, 'samples': 9162048, 'steps': 47718, 'loss/train': 1.5679748058319092} 11/07/2021 03:58:05 - INFO - __main__ - Step 47720: {'lr': 0.0003912266684055902, 'samples': 9162240, 'steps': 47719, 'loss/train': 1.5285804271697998} 11/07/2021 03:58:05 - INFO - __main__ - Step 47721: {'lr': 0.000391222289488183, 'samples': 9162432, 'steps': 47720, 'loss/train': 1.8171290159225464} 11/07/2021 03:58:06 - INFO - __main__ - Step 47722: {'lr': 0.00039121791050714317, 'samples': 9162624, 'steps': 47721, 'loss/train': 1.580133318901062} 11/07/2021 03:58:06 - INFO - __main__ - Step 47723: {'lr': 0.0003912135314624728, 'samples': 9162816, 'steps': 47722, 'loss/train': 1.504193902015686} 11/07/2021 03:58:07 - INFO - __main__ - Step 47724: {'lr': 0.00039120915235417377, 'samples': 9163008, 'steps': 47723, 'loss/train': 1.044930100440979} 11/07/2021 03:58:07 - INFO - __main__ - Step 47725: {'lr': 0.0003912047731822481, 'samples': 9163200, 'steps': 47724, 'loss/train': 1.6905770301818848} 11/07/2021 03:58:08 - INFO - __main__ - Step 47726: {'lr': 0.0003912003939466977, 'samples': 9163392, 'steps': 47725, 'loss/train': 1.3502663373947144} 11/07/2021 03:58:08 - INFO - __main__ - Step 47727: {'lr': 0.0003911960146475245, 'samples': 9163584, 'steps': 47726, 'loss/train': 0.49609217047691345} 11/07/2021 03:58:08 - INFO - __main__ - Step 47728: {'lr': 0.0003911916352847307, 'samples': 9163776, 'steps': 47727, 'loss/train': 1.264794945716858} 11/07/2021 03:58:10 - INFO - __main__ - Step 47729: {'lr': 0.0003911872558583181, 'samples': 9163968, 'steps': 47728, 'loss/train': 1.5488823652267456} 11/07/2021 03:58:10 - INFO - __main__ - Step 47730: {'lr': 0.00039118287636828866, 'samples': 9164160, 'steps': 47729, 'loss/train': 1.5856084823608398} 11/07/2021 03:58:10 - INFO - __main__ - Step 47731: {'lr': 0.0003911784968146444, 'samples': 9164352, 'steps': 47730, 'loss/train': 1.1186045408248901} 11/07/2021 03:58:11 - INFO - __main__ - Step 47732: {'lr': 0.00039117411719738726, 'samples': 9164544, 'steps': 47731, 'loss/train': 1.676872968673706} 11/07/2021 03:58:11 - INFO - __main__ - Step 47733: {'lr': 0.0003911697375165193, 'samples': 9164736, 'steps': 47732, 'loss/train': 1.6229221820831299} 11/07/2021 03:58:12 - INFO - __main__ - Step 47734: {'lr': 0.00039116535777204237, 'samples': 9164928, 'steps': 47733, 'loss/train': 1.2042266130447388} 11/07/2021 03:58:12 - INFO - __main__ - Step 47735: {'lr': 0.00039116097796395856, 'samples': 9165120, 'steps': 47734, 'loss/train': 1.3100491762161255} 11/07/2021 03:58:13 - INFO - __main__ - Step 47736: {'lr': 0.00039115659809226975, 'samples': 9165312, 'steps': 47735, 'loss/train': 1.545833706855774} 11/07/2021 03:58:13 - INFO - __main__ - Step 47737: {'lr': 0.00039115221815697797, 'samples': 9165504, 'steps': 47736, 'loss/train': 1.6659634113311768} 11/07/2021 03:58:13 - INFO - __main__ - Step 47738: {'lr': 0.00039114783815808526, 'samples': 9165696, 'steps': 47737, 'loss/train': 1.2204612493515015} 11/07/2021 03:58:14 - INFO - __main__ - Step 47739: {'lr': 0.0003911434580955934, 'samples': 9165888, 'steps': 47738, 'loss/train': 1.8816306591033936} 11/07/2021 03:58:15 - INFO - __main__ - Step 47740: {'lr': 0.00039113907796950453, 'samples': 9166080, 'steps': 47739, 'loss/train': 1.467585802078247} 11/07/2021 03:58:15 - INFO - __main__ - Step 47741: {'lr': 0.0003911346977798206, 'samples': 9166272, 'steps': 47740, 'loss/train': 0.9358709454536438} 11/07/2021 03:58:16 - INFO - __main__ - Step 47742: {'lr': 0.0003911303175265435, 'samples': 9166464, 'steps': 47741, 'loss/train': 1.3167362213134766} 11/07/2021 03:58:16 - INFO - __main__ - Step 47743: {'lr': 0.00039112593720967524, 'samples': 9166656, 'steps': 47742, 'loss/train': 1.5671089887619019} 11/07/2021 03:58:16 - INFO - __main__ - Step 47744: {'lr': 0.00039112155682921785, 'samples': 9166848, 'steps': 47743, 'loss/train': 1.520840048789978} 11/07/2021 03:58:17 - INFO - __main__ - Step 47745: {'lr': 0.00039111717638517325, 'samples': 9167040, 'steps': 47744, 'loss/train': 1.286608099937439} 11/07/2021 03:58:18 - INFO - __main__ - Step 47746: {'lr': 0.00039111279587754344, 'samples': 9167232, 'steps': 47745, 'loss/train': 1.2636497020721436} 11/07/2021 03:58:18 - INFO - __main__ - Step 47747: {'lr': 0.0003911084153063303, 'samples': 9167424, 'steps': 47746, 'loss/train': 1.6070020198822021} 11/07/2021 03:58:18 - INFO - __main__ - Step 47748: {'lr': 0.000391104034671536, 'samples': 9167616, 'steps': 47747, 'loss/train': 1.3143073320388794} 11/07/2021 03:58:19 - INFO - __main__ - Step 47749: {'lr': 0.00039109965397316236, 'samples': 9167808, 'steps': 47748, 'loss/train': 1.0709642171859741} 11/07/2021 03:58:20 - INFO - __main__ - Step 47750: {'lr': 0.0003910952732112114, 'samples': 9168000, 'steps': 47749, 'loss/train': 1.06427800655365} 11/07/2021 03:58:20 - INFO - __main__ - Step 47751: {'lr': 0.00039109089238568507, 'samples': 9168192, 'steps': 47750, 'loss/train': 1.802921175956726} 11/07/2021 03:58:20 - INFO - __main__ - Step 47752: {'lr': 0.00039108651149658534, 'samples': 9168384, 'steps': 47751, 'loss/train': 1.2981460094451904} 11/07/2021 03:58:21 - INFO - __main__ - Step 47753: {'lr': 0.0003910821305439143, 'samples': 9168576, 'steps': 47752, 'loss/train': 1.3508820533752441} 11/07/2021 03:58:21 - INFO - __main__ - Step 47754: {'lr': 0.00039107774952767374, 'samples': 9168768, 'steps': 47753, 'loss/train': 4.926101207733154} 11/07/2021 03:58:22 - INFO - __main__ - Step 47755: {'lr': 0.0003910733684478657, 'samples': 9168960, 'steps': 47754, 'loss/train': 1.0816140174865723} 11/07/2021 03:58:23 - INFO - __main__ - Step 47756: {'lr': 0.00039106898730449223, 'samples': 9169152, 'steps': 47755, 'loss/train': 1.4479185342788696} 11/07/2021 03:58:23 - INFO - __main__ - Step 47757: {'lr': 0.0003910646060975553, 'samples': 9169344, 'steps': 47756, 'loss/train': 1.2211220264434814} 11/07/2021 03:58:23 - INFO - __main__ - Step 47758: {'lr': 0.00039106022482705675, 'samples': 9169536, 'steps': 47757, 'loss/train': 1.4629676342010498} 11/07/2021 03:58:24 - INFO - __main__ - Step 47759: {'lr': 0.0003910558434929987, 'samples': 9169728, 'steps': 47758, 'loss/train': 1.2839739322662354} 11/07/2021 03:58:25 - INFO - __main__ - Step 47760: {'lr': 0.000391051462095383, 'samples': 9169920, 'steps': 47759, 'loss/train': 1.7330576181411743} 11/07/2021 03:58:25 - INFO - __main__ - Step 47761: {'lr': 0.0003910470806342117, 'samples': 9170112, 'steps': 47760, 'loss/train': 1.3753608465194702} 11/07/2021 03:58:25 - INFO - __main__ - Step 47762: {'lr': 0.00039104269910948675, 'samples': 9170304, 'steps': 47761, 'loss/train': 1.8073952198028564} 11/07/2021 03:58:26 - INFO - __main__ - Step 47763: {'lr': 0.00039103831752121024, 'samples': 9170496, 'steps': 47762, 'loss/train': 1.5527784824371338} 11/07/2021 03:58:26 - INFO - __main__ - Step 47764: {'lr': 0.00039103393586938394, 'samples': 9170688, 'steps': 47763, 'loss/train': 1.7788493633270264} 11/07/2021 03:58:26 - INFO - __main__ - Step 47765: {'lr': 0.00039102955415401, 'samples': 9170880, 'steps': 47764, 'loss/train': 1.4334397315979004} 11/07/2021 03:58:27 - INFO - __main__ - Step 47766: {'lr': 0.00039102517237509025, 'samples': 9171072, 'steps': 47765, 'loss/train': 1.1334584951400757} 11/07/2021 03:58:28 - INFO - __main__ - Step 47767: {'lr': 0.0003910207905326267, 'samples': 9171264, 'steps': 47766, 'loss/train': 1.5549800395965576} 11/07/2021 03:58:28 - INFO - __main__ - Step 47768: {'lr': 0.00039101640862662147, 'samples': 9171456, 'steps': 47767, 'loss/train': 1.2709290981292725} 11/07/2021 03:58:28 - INFO - __main__ - Step 47769: {'lr': 0.0003910120266570764, 'samples': 9171648, 'steps': 47768, 'loss/train': 2.0093159675598145} 11/07/2021 03:58:29 - INFO - __main__ - Step 47770: {'lr': 0.0003910076446239934, 'samples': 9171840, 'steps': 47769, 'loss/train': 1.5359703302383423} 11/07/2021 03:58:30 - INFO - __main__ - Step 47771: {'lr': 0.00039100326252737463, 'samples': 9172032, 'steps': 47770, 'loss/train': 1.9358810186386108} 11/07/2021 03:58:30 - INFO - __main__ - Step 47772: {'lr': 0.00039099888036722187, 'samples': 9172224, 'steps': 47771, 'loss/train': 1.156240701675415} 11/07/2021 03:58:31 - INFO - __main__ - Step 47773: {'lr': 0.00039099449814353725, 'samples': 9172416, 'steps': 47772, 'loss/train': 0.9753404259681702} 11/07/2021 03:58:31 - INFO - __main__ - Step 47774: {'lr': 0.00039099011585632266, 'samples': 9172608, 'steps': 47773, 'loss/train': 1.1997112035751343} 11/07/2021 03:58:31 - INFO - __main__ - Step 47775: {'lr': 0.0003909857335055801, 'samples': 9172800, 'steps': 47774, 'loss/train': 1.3026461601257324} 11/07/2021 03:58:32 - INFO - __main__ - Step 47776: {'lr': 0.00039098135109131156, 'samples': 9172992, 'steps': 47775, 'loss/train': 1.5997029542922974} 11/07/2021 03:58:33 - INFO - __main__ - Step 47777: {'lr': 0.00039097696861351895, 'samples': 9173184, 'steps': 47776, 'loss/train': 1.5098634958267212} 11/07/2021 03:58:33 - INFO - __main__ - Step 47778: {'lr': 0.00039097258607220445, 'samples': 9173376, 'steps': 47777, 'loss/train': 1.658744215965271} 11/07/2021 03:58:33 - INFO - __main__ - Step 47779: {'lr': 0.00039096820346736974, 'samples': 9173568, 'steps': 47778, 'loss/train': 1.496510624885559} 11/07/2021 03:58:34 - INFO - __main__ - Step 47780: {'lr': 0.00039096382079901695, 'samples': 9173760, 'steps': 47779, 'loss/train': 1.2784366607666016} 11/07/2021 03:58:35 - INFO - __main__ - Step 47781: {'lr': 0.000390959438067148, 'samples': 9173952, 'steps': 47780, 'loss/train': 1.5558462142944336} 11/07/2021 03:58:35 - INFO - __main__ - Step 47782: {'lr': 0.000390955055271765, 'samples': 9174144, 'steps': 47781, 'loss/train': 1.5842334032058716} 11/07/2021 03:58:35 - INFO - __main__ - Step 47783: {'lr': 0.00039095067241286973, 'samples': 9174336, 'steps': 47782, 'loss/train': 1.5696221590042114} 11/07/2021 03:58:36 - INFO - __main__ - Step 47784: {'lr': 0.00039094628949046435, 'samples': 9174528, 'steps': 47783, 'loss/train': 1.2933385372161865} 11/07/2021 03:58:36 - INFO - __main__ - Step 47785: {'lr': 0.0003909419065045507, 'samples': 9174720, 'steps': 47784, 'loss/train': 1.1484545469284058} 11/07/2021 03:58:37 - INFO - __main__ - Step 47786: {'lr': 0.0003909375234551308, 'samples': 9174912, 'steps': 47785, 'loss/train': 1.266628623008728} 11/07/2021 03:58:37 - INFO - __main__ - Step 47787: {'lr': 0.0003909331403422066, 'samples': 9175104, 'steps': 47786, 'loss/train': 1.4414047002792358} 11/07/2021 03:58:38 - INFO - __main__ - Step 47788: {'lr': 0.00039092875716578013, 'samples': 9175296, 'steps': 47787, 'loss/train': 1.4967223405838013} 11/07/2021 03:58:38 - INFO - __main__ - Step 47789: {'lr': 0.00039092437392585335, 'samples': 9175488, 'steps': 47788, 'loss/train': 1.2454779148101807} 11/07/2021 03:58:39 - INFO - __main__ - Step 47790: {'lr': 0.0003909199906224282, 'samples': 9175680, 'steps': 47789, 'loss/train': 1.5636887550354004} 11/07/2021 03:58:40 - INFO - __main__ - Step 47791: {'lr': 0.00039091560725550676, 'samples': 9175872, 'steps': 47790, 'loss/train': 1.3521119356155396} 11/07/2021 03:58:40 - INFO - __main__ - Step 47792: {'lr': 0.0003909112238250908, 'samples': 9176064, 'steps': 47791, 'loss/train': 1.4724849462509155} 11/07/2021 03:58:40 - INFO - __main__ - Step 47793: {'lr': 0.0003909068403311825, 'samples': 9176256, 'steps': 47792, 'loss/train': 1.5105748176574707} 11/07/2021 03:58:41 - INFO - __main__ - Step 47794: {'lr': 0.0003909024567737837, 'samples': 9176448, 'steps': 47793, 'loss/train': 1.4884341955184937} 11/07/2021 03:58:41 - INFO - __main__ - Step 47795: {'lr': 0.0003908980731528965, 'samples': 9176640, 'steps': 47794, 'loss/train': 1.392460584640503} 11/07/2021 03:58:42 - INFO - __main__ - Step 47796: {'lr': 0.0003908936894685227, 'samples': 9176832, 'steps': 47795, 'loss/train': 1.4147835969924927} 11/07/2021 03:58:42 - INFO - __main__ - Step 47797: {'lr': 0.0003908893057206644, 'samples': 9177024, 'steps': 47796, 'loss/train': 1.644497036933899} 11/07/2021 03:58:43 - INFO - __main__ - Step 47798: {'lr': 0.00039088492190932365, 'samples': 9177216, 'steps': 47797, 'loss/train': 1.58420729637146} 11/07/2021 03:58:43 - INFO - __main__ - Step 47799: {'lr': 0.00039088053803450223, 'samples': 9177408, 'steps': 47798, 'loss/train': 1.7875009775161743} 11/07/2021 03:58:43 - INFO - __main__ - Step 47800: {'lr': 0.00039087615409620223, 'samples': 9177600, 'steps': 47799, 'loss/train': 1.3872160911560059} 11/07/2021 03:58:44 - INFO - __main__ - Step 47801: {'lr': 0.00039087177009442567, 'samples': 9177792, 'steps': 47800, 'loss/train': 1.6980464458465576} 11/07/2021 03:58:45 - INFO - __main__ - Step 47802: {'lr': 0.0003908673860291744, 'samples': 9177984, 'steps': 47801, 'loss/train': 1.910475254058838} 11/07/2021 03:58:45 - INFO - __main__ - Step 47803: {'lr': 0.0003908630019004504, 'samples': 9178176, 'steps': 47802, 'loss/train': 1.361932396888733} 11/07/2021 03:58:45 - INFO - __main__ - Step 47804: {'lr': 0.00039085861770825586, 'samples': 9178368, 'steps': 47803, 'loss/train': 1.6999609470367432} 11/07/2021 03:58:46 - INFO - __main__ - Step 47805: {'lr': 0.00039085423345259254, 'samples': 9178560, 'steps': 47804, 'loss/train': 1.305261492729187} 11/07/2021 03:58:46 - INFO - __main__ - Step 47806: {'lr': 0.00039084984913346246, 'samples': 9178752, 'steps': 47805, 'loss/train': 1.737400770187378} 11/07/2021 03:58:47 - INFO - __main__ - Step 47807: {'lr': 0.0003908454647508676, 'samples': 9178944, 'steps': 47806, 'loss/train': 1.7078434228897095} 11/07/2021 03:58:48 - INFO - __main__ - Step 47808: {'lr': 0.0003908410803048099, 'samples': 9179136, 'steps': 47807, 'loss/train': 0.9526119232177734} 11/07/2021 03:58:48 - INFO - __main__ - Step 47809: {'lr': 0.0003908366957952915, 'samples': 9179328, 'steps': 47808, 'loss/train': 1.8848559856414795} 11/07/2021 03:58:48 - INFO - __main__ - Step 47810: {'lr': 0.0003908323112223142, 'samples': 9179520, 'steps': 47809, 'loss/train': 1.385873556137085} 11/07/2021 03:58:49 - INFO - __main__ - Step 47811: {'lr': 0.0003908279265858801, 'samples': 9179712, 'steps': 47810, 'loss/train': 1.7560125589370728} 11/07/2021 03:58:50 - INFO - __main__ - Step 47812: {'lr': 0.00039082354188599094, 'samples': 9179904, 'steps': 47811, 'loss/train': 0.9998289942741394} 11/07/2021 03:58:50 - INFO - __main__ - Step 47813: {'lr': 0.00039081915712264897, 'samples': 9180096, 'steps': 47812, 'loss/train': 1.5740004777908325} 11/07/2021 03:58:50 - INFO - __main__ - Step 47814: {'lr': 0.000390814772295856, 'samples': 9180288, 'steps': 47813, 'loss/train': 0.9608702063560486} 11/07/2021 03:58:51 - INFO - __main__ - Step 47815: {'lr': 0.0003908103874056142, 'samples': 9180480, 'steps': 47814, 'loss/train': 1.3893488645553589} 11/07/2021 03:58:51 - INFO - __main__ - Step 47816: {'lr': 0.0003908060024519253, 'samples': 9180672, 'steps': 47815, 'loss/train': 1.414729118347168} 11/07/2021 03:58:51 - INFO - __main__ - Step 47817: {'lr': 0.0003908016174347915, 'samples': 9180864, 'steps': 47816, 'loss/train': 1.5888200998306274} 11/07/2021 03:58:52 - INFO - __main__ - Step 47818: {'lr': 0.00039079723235421456, 'samples': 9181056, 'steps': 47817, 'loss/train': 1.9803375005722046} 11/07/2021 03:58:53 - INFO - __main__ - Step 47819: {'lr': 0.0003907928472101966, 'samples': 9181248, 'steps': 47818, 'loss/train': 1.477423071861267} 11/07/2021 03:58:53 - INFO - __main__ - Step 47820: {'lr': 0.00039078846200273955, 'samples': 9181440, 'steps': 47819, 'loss/train': 1.858801007270813} 11/07/2021 03:58:53 - INFO - __main__ - Step 47821: {'lr': 0.00039078407673184536, 'samples': 9181632, 'steps': 47820, 'loss/train': 1.4473891258239746} 11/07/2021 03:58:54 - INFO - __main__ - Step 47822: {'lr': 0.000390779691397516, 'samples': 9181824, 'steps': 47821, 'loss/train': 1.2635656595230103} 11/07/2021 03:58:55 - INFO - __main__ - Step 47823: {'lr': 0.0003907753059997536, 'samples': 9182016, 'steps': 47822, 'loss/train': 1.6431529521942139} 11/07/2021 03:58:55 - INFO - __main__ - Step 47824: {'lr': 0.00039077092053855996, 'samples': 9182208, 'steps': 47823, 'loss/train': 1.7903770208358765} 11/07/2021 03:58:56 - INFO - __main__ - Step 47825: {'lr': 0.0003907665350139371, 'samples': 9182400, 'steps': 47824, 'loss/train': 1.5683518648147583} 11/07/2021 03:58:56 - INFO - __main__ - Step 47826: {'lr': 0.00039076214942588704, 'samples': 9182592, 'steps': 47825, 'loss/train': 1.5268745422363281} 11/07/2021 03:58:56 - INFO - __main__ - Step 47827: {'lr': 0.00039075776377441176, 'samples': 9182784, 'steps': 47826, 'loss/train': 1.1185357570648193} 11/07/2021 03:58:57 - INFO - __main__ - Step 47828: {'lr': 0.00039075337805951314, 'samples': 9182976, 'steps': 47827, 'loss/train': 5.700735092163086} 11/07/2021 03:58:58 - INFO - __main__ - Step 47829: {'lr': 0.0003907489922811932, 'samples': 9183168, 'steps': 47828, 'loss/train': 1.038259744644165} 11/07/2021 03:58:58 - INFO - __main__ - Step 47830: {'lr': 0.000390744606439454, 'samples': 9183360, 'steps': 47829, 'loss/train': 1.5854010581970215} 11/07/2021 03:58:58 - INFO - __main__ - Step 47831: {'lr': 0.00039074022053429746, 'samples': 9183552, 'steps': 47830, 'loss/train': 1.1093767881393433} 11/07/2021 03:58:59 - INFO - __main__ - Step 47832: {'lr': 0.00039073583456572547, 'samples': 9183744, 'steps': 47831, 'loss/train': 1.3338769674301147} 11/07/2021 03:59:00 - INFO - __main__ - Step 47833: {'lr': 0.0003907314485337402, 'samples': 9183936, 'steps': 47832, 'loss/train': 1.903498649597168} 11/07/2021 03:59:00 - INFO - __main__ - Step 47834: {'lr': 0.00039072706243834345, 'samples': 9184128, 'steps': 47833, 'loss/train': 1.6174588203430176} 11/07/2021 03:59:01 - INFO - __main__ - Step 47835: {'lr': 0.0003907226762795372, 'samples': 9184320, 'steps': 47834, 'loss/train': 1.1398457288742065} 11/07/2021 03:59:01 - INFO - __main__ - Step 47836: {'lr': 0.0003907182900573235, 'samples': 9184512, 'steps': 47835, 'loss/train': 1.6860932111740112} 11/07/2021 03:59:01 - INFO - __main__ - Step 47837: {'lr': 0.00039071390377170434, 'samples': 9184704, 'steps': 47836, 'loss/train': 1.1040606498718262} 11/07/2021 03:59:02 - INFO - __main__ - Step 47838: {'lr': 0.00039070951742268173, 'samples': 9184896, 'steps': 47837, 'loss/train': 1.4555193185806274} 11/07/2021 03:59:03 - INFO - __main__ - Step 47839: {'lr': 0.00039070513101025753, 'samples': 9185088, 'steps': 47838, 'loss/train': 1.118199348449707} 11/07/2021 03:59:03 - INFO - __main__ - Step 47840: {'lr': 0.00039070074453443374, 'samples': 9185280, 'steps': 47839, 'loss/train': 1.9513616561889648} 11/07/2021 03:59:03 - INFO - __main__ - Step 47841: {'lr': 0.0003906963579952124, 'samples': 9185472, 'steps': 47840, 'loss/train': 1.395675539970398} 11/07/2021 03:59:04 - INFO - __main__ - Step 47842: {'lr': 0.0003906919713925954, 'samples': 9185664, 'steps': 47841, 'loss/train': 1.8071616888046265} 11/07/2021 03:59:04 - INFO - __main__ - Step 47843: {'lr': 0.00039068758472658483, 'samples': 9185856, 'steps': 47842, 'loss/train': 1.3935356140136719} 11/07/2021 03:59:05 - INFO - __main__ - Step 47844: {'lr': 0.0003906831979971826, 'samples': 9186048, 'steps': 47843, 'loss/train': 1.501913070678711} 11/07/2021 03:59:05 - INFO - __main__ - Step 47845: {'lr': 0.0003906788112043907, 'samples': 9186240, 'steps': 47844, 'loss/train': 1.1235847473144531} 11/07/2021 03:59:06 - INFO - __main__ - Step 47846: {'lr': 0.00039067442434821106, 'samples': 9186432, 'steps': 47845, 'loss/train': 0.49357911944389343} 11/07/2021 03:59:06 - INFO - __main__ - Step 47847: {'lr': 0.0003906700374286457, 'samples': 9186624, 'steps': 47846, 'loss/train': 1.6865296363830566} 11/07/2021 03:59:06 - INFO - __main__ - Step 47848: {'lr': 0.0003906656504456966, 'samples': 9186816, 'steps': 47847, 'loss/train': 1.388684868812561} 11/07/2021 03:59:08 - INFO - __main__ - Step 47849: {'lr': 0.0003906612633993657, 'samples': 9187008, 'steps': 47848, 'loss/train': 0.9301236867904663} 11/07/2021 03:59:08 - INFO - __main__ - Step 47850: {'lr': 0.00039065687628965506, 'samples': 9187200, 'steps': 47849, 'loss/train': 1.4606879949569702} 11/07/2021 03:59:08 - INFO - __main__ - Step 47851: {'lr': 0.0003906524891165666, 'samples': 9187392, 'steps': 47850, 'loss/train': 1.518761157989502} 11/07/2021 03:59:09 - INFO - __main__ - Step 47852: {'lr': 0.00039064810188010223, 'samples': 9187584, 'steps': 47851, 'loss/train': 1.4743632078170776} 11/07/2021 03:59:09 - INFO - __main__ - Step 47853: {'lr': 0.000390643714580264, 'samples': 9187776, 'steps': 47852, 'loss/train': 2.037034511566162} 11/07/2021 03:59:10 - INFO - __main__ - Step 47854: {'lr': 0.000390639327217054, 'samples': 9187968, 'steps': 47853, 'loss/train': 1.6816785335540771} 11/07/2021 03:59:10 - INFO - __main__ - Step 47855: {'lr': 0.000390634939790474, 'samples': 9188160, 'steps': 47854, 'loss/train': 1.4877616167068481} 11/07/2021 03:59:11 - INFO - __main__ - Step 47856: {'lr': 0.00039063055230052605, 'samples': 9188352, 'steps': 47855, 'loss/train': 1.093326210975647} 11/07/2021 03:59:11 - INFO - __main__ - Step 47857: {'lr': 0.00039062616474721217, 'samples': 9188544, 'steps': 47856, 'loss/train': 1.781771183013916} 11/07/2021 03:59:11 - INFO - __main__ - Step 47858: {'lr': 0.00039062177713053436, 'samples': 9188736, 'steps': 47857, 'loss/train': 1.5823264122009277} 11/07/2021 03:59:12 - INFO - __main__ - Step 47859: {'lr': 0.00039061738945049454, 'samples': 9188928, 'steps': 47858, 'loss/train': 4.710807800292969} 11/07/2021 03:59:12 - INFO - __main__ - Step 47860: {'lr': 0.0003906130017070946, 'samples': 9189120, 'steps': 47859, 'loss/train': 1.006072759628296} 11/07/2021 03:59:13 - INFO - __main__ - Step 47861: {'lr': 0.0003906086139003366, 'samples': 9189312, 'steps': 47860, 'loss/train': 1.132124662399292} 11/07/2021 03:59:14 - INFO - __main__ - Step 47862: {'lr': 0.00039060422603022266, 'samples': 9189504, 'steps': 47861, 'loss/train': 1.6611148118972778} 11/07/2021 03:59:14 - INFO - __main__ - Step 47863: {'lr': 0.0003905998380967546, 'samples': 9189696, 'steps': 47862, 'loss/train': 1.276165246963501} 11/07/2021 03:59:14 - INFO - __main__ - Step 47864: {'lr': 0.00039059545009993436, 'samples': 9189888, 'steps': 47863, 'loss/train': 1.817383885383606} 11/07/2021 03:59:15 - INFO - __main__ - Step 47865: {'lr': 0.00039059106203976403, 'samples': 9190080, 'steps': 47864, 'loss/train': 1.0590981245040894} 11/07/2021 03:59:16 - INFO - __main__ - Step 47866: {'lr': 0.00039058667391624546, 'samples': 9190272, 'steps': 47865, 'loss/train': 1.774393916130066} 11/07/2021 03:59:17 - INFO - __main__ - Step 47867: {'lr': 0.00039058228572938074, 'samples': 9190464, 'steps': 47866, 'loss/train': 1.5428546667099} 11/07/2021 03:59:17 - INFO - __main__ - Step 47868: {'lr': 0.00039057789747917184, 'samples': 9190656, 'steps': 47867, 'loss/train': 0.7667586207389832} 11/07/2021 03:59:17 - INFO - __main__ - Step 47869: {'lr': 0.00039057350916562065, 'samples': 9190848, 'steps': 47868, 'loss/train': 1.5777109861373901} 11/07/2021 03:59:18 - INFO - __main__ - Step 47870: {'lr': 0.0003905691207887293, 'samples': 9191040, 'steps': 47869, 'loss/train': 2.0030999183654785} 11/07/2021 03:59:18 - INFO - __main__ - Step 47871: {'lr': 0.00039056473234849964, 'samples': 9191232, 'steps': 47870, 'loss/train': 1.908658742904663} 11/07/2021 03:59:18 - INFO - __main__ - Step 47872: {'lr': 0.0003905603438449337, 'samples': 9191424, 'steps': 47871, 'loss/train': 1.2018303871154785} 11/07/2021 03:59:19 - INFO - __main__ - Step 47873: {'lr': 0.00039055595527803333, 'samples': 9191616, 'steps': 47872, 'loss/train': 1.4805277585983276} 11/07/2021 03:59:20 - INFO - __main__ - Step 47874: {'lr': 0.00039055156664780067, 'samples': 9191808, 'steps': 47873, 'loss/train': 0.9325542449951172} 11/07/2021 03:59:20 - INFO - __main__ - Step 47875: {'lr': 0.00039054717795423765, 'samples': 9192000, 'steps': 47874, 'loss/train': 1.3526469469070435} 11/07/2021 03:59:20 - INFO - __main__ - Step 47876: {'lr': 0.0003905427891973463, 'samples': 9192192, 'steps': 47875, 'loss/train': 1.7165586948394775} 11/07/2021 03:59:21 - INFO - __main__ - Step 47877: {'lr': 0.0003905384003771285, 'samples': 9192384, 'steps': 47876, 'loss/train': 1.5303162336349487} 11/07/2021 03:59:22 - INFO - __main__ - Step 47878: {'lr': 0.00039053401149358625, 'samples': 9192576, 'steps': 47877, 'loss/train': 1.5087376832962036} 11/07/2021 03:59:22 - INFO - __main__ - Step 47879: {'lr': 0.0003905296225467215, 'samples': 9192768, 'steps': 47878, 'loss/train': 1.519276738166809} 11/07/2021 03:59:23 - INFO - __main__ - Step 47880: {'lr': 0.0003905252335365364, 'samples': 9192960, 'steps': 47879, 'loss/train': 1.481291651725769} 11/07/2021 03:59:23 - INFO - __main__ - Step 47881: {'lr': 0.00039052084446303264, 'samples': 9193152, 'steps': 47880, 'loss/train': 1.1391723155975342} 11/07/2021 03:59:23 - INFO - __main__ - Step 47882: {'lr': 0.0003905164553262125, 'samples': 9193344, 'steps': 47881, 'loss/train': 0.9642443656921387} 11/07/2021 03:59:24 - INFO - __main__ - Step 47883: {'lr': 0.0003905120661260777, 'samples': 9193536, 'steps': 47882, 'loss/train': 1.2184077501296997} 11/07/2021 03:59:25 - INFO - __main__ - Step 47884: {'lr': 0.00039050767686263035, 'samples': 9193728, 'steps': 47883, 'loss/train': 1.4414434432983398} 11/07/2021 03:59:25 - INFO - __main__ - Step 47885: {'lr': 0.0003905032875358725, 'samples': 9193920, 'steps': 47884, 'loss/train': 1.474022626876831} 11/07/2021 03:59:26 - INFO - __main__ - Step 47886: {'lr': 0.00039049889814580597, 'samples': 9194112, 'steps': 47885, 'loss/train': 1.4804164171218872} 11/07/2021 03:59:26 - INFO - __main__ - Step 47887: {'lr': 0.00039049450869243276, 'samples': 9194304, 'steps': 47886, 'loss/train': 0.8207853436470032} 11/07/2021 03:59:27 - INFO - __main__ - Step 47888: {'lr': 0.00039049011917575494, 'samples': 9194496, 'steps': 47887, 'loss/train': 1.946252465248108} 11/07/2021 03:59:27 - INFO - __main__ - Step 47889: {'lr': 0.00039048572959577446, 'samples': 9194688, 'steps': 47888, 'loss/train': 1.5656707286834717} 11/07/2021 03:59:28 - INFO - __main__ - Step 47890: {'lr': 0.0003904813399524932, 'samples': 9194880, 'steps': 47889, 'loss/train': 1.0549839735031128} 11/07/2021 03:59:28 - INFO - __main__ - Step 47891: {'lr': 0.0003904769502459133, 'samples': 9195072, 'steps': 47890, 'loss/train': 0.9227809906005859} 11/07/2021 03:59:28 - INFO - __main__ - Step 47892: {'lr': 0.0003904725604760366, 'samples': 9195264, 'steps': 47891, 'loss/train': 1.466859221458435} 11/07/2021 03:59:29 - INFO - __main__ - Step 47893: {'lr': 0.0003904681706428652, 'samples': 9195456, 'steps': 47892, 'loss/train': 1.8413870334625244} 11/07/2021 03:59:30 - INFO - __main__ - Step 47894: {'lr': 0.000390463780746401, 'samples': 9195648, 'steps': 47893, 'loss/train': 0.5830784440040588} 11/07/2021 03:59:30 - INFO - __main__ - Step 47895: {'lr': 0.00039045939078664595, 'samples': 9195840, 'steps': 47894, 'loss/train': 1.316049575805664} 11/07/2021 03:59:30 - INFO - __main__ - Step 47896: {'lr': 0.0003904550007636021, 'samples': 9196032, 'steps': 47895, 'loss/train': 0.8721176385879517} 11/07/2021 03:59:31 - INFO - __main__ - Step 47897: {'lr': 0.00039045061067727126, 'samples': 9196224, 'steps': 47896, 'loss/train': 1.7862621545791626} 11/07/2021 03:59:31 - INFO - __main__ - Step 47898: {'lr': 0.0003904462205276557, 'samples': 9196416, 'steps': 47897, 'loss/train': 1.522489070892334} 11/07/2021 03:59:32 - INFO - __main__ - Step 47899: {'lr': 0.0003904418303147572, 'samples': 9196608, 'steps': 47898, 'loss/train': 1.3559387922286987} 11/07/2021 03:59:33 - INFO - __main__ - Step 47900: {'lr': 0.0003904374400385777, 'samples': 9196800, 'steps': 47899, 'loss/train': 1.9666988849639893} 11/07/2021 03:59:33 - INFO - __main__ - Step 47901: {'lr': 0.0003904330496991194, 'samples': 9196992, 'steps': 47900, 'loss/train': 1.569732427597046} 11/07/2021 03:59:33 - INFO - __main__ - Step 47902: {'lr': 0.00039042865929638404, 'samples': 9197184, 'steps': 47901, 'loss/train': 1.6634944677352905} 11/07/2021 03:59:34 - INFO - __main__ - Step 47903: {'lr': 0.00039042426883037376, 'samples': 9197376, 'steps': 47902, 'loss/train': 1.6189223527908325} 11/07/2021 03:59:35 - INFO - __main__ - Step 47904: {'lr': 0.00039041987830109036, 'samples': 9197568, 'steps': 47903, 'loss/train': 1.027011752128601} 11/07/2021 03:59:35 - INFO - __main__ - Step 47905: {'lr': 0.000390415487708536, 'samples': 9197760, 'steps': 47904, 'loss/train': 1.4380768537521362} 11/07/2021 03:59:35 - INFO - __main__ - Step 47906: {'lr': 0.0003904110970527126, 'samples': 9197952, 'steps': 47905, 'loss/train': 1.5336869955062866} 11/07/2021 03:59:36 - INFO - __main__ - Step 47907: {'lr': 0.00039040670633362206, 'samples': 9198144, 'steps': 47906, 'loss/train': 1.5007959604263306} 11/07/2021 03:59:36 - INFO - __main__ - Step 47908: {'lr': 0.00039040231555126647, 'samples': 9198336, 'steps': 47907, 'loss/train': 1.2818505764007568} 11/07/2021 03:59:37 - INFO - __main__ - Step 47909: {'lr': 0.0003903979247056478, 'samples': 9198528, 'steps': 47908, 'loss/train': 1.2131530046463013} 11/07/2021 03:59:37 - INFO - __main__ - Step 47910: {'lr': 0.00039039353379676796, 'samples': 9198720, 'steps': 47909, 'loss/train': 1.4607211351394653} 11/07/2021 03:59:38 - INFO - __main__ - Step 47911: {'lr': 0.0003903891428246289, 'samples': 9198912, 'steps': 47910, 'loss/train': 1.980078101158142} 11/07/2021 03:59:38 - INFO - __main__ - Step 47912: {'lr': 0.0003903847517892328, 'samples': 9199104, 'steps': 47911, 'loss/train': 1.5750683546066284} 11/07/2021 03:59:38 - INFO - __main__ - Step 47913: {'lr': 0.00039038036069058137, 'samples': 9199296, 'steps': 47912, 'loss/train': 1.5380631685256958} 11/07/2021 03:59:39 - INFO - __main__ - Step 47914: {'lr': 0.0003903759695286768, 'samples': 9199488, 'steps': 47913, 'loss/train': 1.278486728668213} 11/07/2021 03:59:40 - INFO - __main__ - Step 47915: {'lr': 0.0003903715783035209, 'samples': 9199680, 'steps': 47914, 'loss/train': 0.8846134543418884} 11/07/2021 03:59:40 - INFO - __main__ - Step 47916: {'lr': 0.00039036718701511577, 'samples': 9199872, 'steps': 47915, 'loss/train': 1.2928320169448853} 11/07/2021 03:59:41 - INFO - __main__ - Step 47917: {'lr': 0.00039036279566346334, 'samples': 9200064, 'steps': 47916, 'loss/train': 1.5142719745635986} 11/07/2021 03:59:41 - INFO - __main__ - Step 47918: {'lr': 0.0003903584042485656, 'samples': 9200256, 'steps': 47917, 'loss/train': 0.7032201290130615} 11/07/2021 03:59:42 - INFO - __main__ - Step 47919: {'lr': 0.0003903540127704246, 'samples': 9200448, 'steps': 47918, 'loss/train': 1.6926442384719849} 11/07/2021 03:59:42 - INFO - __main__ - Step 47920: {'lr': 0.0003903496212290422, 'samples': 9200640, 'steps': 47919, 'loss/train': 0.8472903370857239} 11/07/2021 03:59:43 - INFO - __main__ - Step 47921: {'lr': 0.00039034522962442045, 'samples': 9200832, 'steps': 47920, 'loss/train': 0.6675543189048767} 11/07/2021 03:59:43 - INFO - __main__ - Step 47922: {'lr': 0.0003903408379565612, 'samples': 9201024, 'steps': 47921, 'loss/train': 1.4430902004241943} 11/07/2021 03:59:43 - INFO - __main__ - Step 47923: {'lr': 0.0003903364462254666, 'samples': 9201216, 'steps': 47922, 'loss/train': 1.4382398128509521} 11/07/2021 03:59:44 - INFO - __main__ - Step 47924: {'lr': 0.0003903320544311386, 'samples': 9201408, 'steps': 47923, 'loss/train': 1.2611238956451416} 11/07/2021 03:59:45 - INFO - __main__ - Step 47925: {'lr': 0.0003903276625735791, 'samples': 9201600, 'steps': 47924, 'loss/train': 1.5550249814987183} 11/07/2021 03:59:45 - INFO - __main__ - Step 47926: {'lr': 0.00039032327065279015, 'samples': 9201792, 'steps': 47925, 'loss/train': 0.6004175543785095} 11/07/2021 03:59:45 - INFO - __main__ - Step 47927: {'lr': 0.0003903188786687737, 'samples': 9201984, 'steps': 47926, 'loss/train': 1.0430054664611816} 11/07/2021 03:59:46 - INFO - __main__ - Step 47928: {'lr': 0.0003903144866215317, 'samples': 9202176, 'steps': 47927, 'loss/train': 1.3700852394104004} 11/07/2021 03:59:47 - INFO - __main__ - Step 47929: {'lr': 0.0003903100945110661, 'samples': 9202368, 'steps': 47928, 'loss/train': 1.3961900472640991} 11/07/2021 03:59:47 - INFO - __main__ - Step 47930: {'lr': 0.00039030570233737903, 'samples': 9202560, 'steps': 47929, 'loss/train': 1.1518902778625488} 11/07/2021 03:59:48 - INFO - __main__ - Step 47931: {'lr': 0.0003903013101004724, 'samples': 9202752, 'steps': 47930, 'loss/train': 1.6772443056106567} 11/07/2021 03:59:48 - INFO - __main__ - Step 47932: {'lr': 0.00039029691780034814, 'samples': 9202944, 'steps': 47931, 'loss/train': 1.384472370147705} 11/07/2021 03:59:48 - INFO - __main__ - Step 47933: {'lr': 0.00039029252543700823, 'samples': 9203136, 'steps': 47932, 'loss/train': 1.481557846069336} 11/07/2021 03:59:49 - INFO - __main__ - Step 47934: {'lr': 0.0003902881330104546, 'samples': 9203328, 'steps': 47933, 'loss/train': 1.4954843521118164} 11/07/2021 03:59:50 - INFO - __main__ - Step 47935: {'lr': 0.00039028374052068937, 'samples': 9203520, 'steps': 47934, 'loss/train': 1.54979407787323} 11/07/2021 03:59:50 - INFO - __main__ - Step 47936: {'lr': 0.0003902793479677145, 'samples': 9203712, 'steps': 47935, 'loss/train': 1.6774086952209473} 11/07/2021 03:59:50 - INFO - __main__ - Step 47937: {'lr': 0.00039027495535153185, 'samples': 9203904, 'steps': 47936, 'loss/train': 1.4777233600616455} 11/07/2021 03:59:51 - INFO - __main__ - Step 47938: {'lr': 0.0003902705626721435, 'samples': 9204096, 'steps': 47937, 'loss/train': 0.9641197323799133} 11/07/2021 03:59:51 - INFO - __main__ - Step 47939: {'lr': 0.00039026616992955145, 'samples': 9204288, 'steps': 47938, 'loss/train': 1.1526705026626587} 11/07/2021 03:59:52 - INFO - __main__ - Step 47940: {'lr': 0.0003902617771237575, 'samples': 9204480, 'steps': 47939, 'loss/train': 1.3181544542312622} 11/07/2021 03:59:52 - INFO - __main__ - Step 47941: {'lr': 0.0003902573842547639, 'samples': 9204672, 'steps': 47940, 'loss/train': 1.266067385673523} 11/07/2021 03:59:53 - INFO - __main__ - Step 47942: {'lr': 0.00039025299132257243, 'samples': 9204864, 'steps': 47941, 'loss/train': 1.1862119436264038} 11/07/2021 03:59:53 - INFO - __main__ - Step 47943: {'lr': 0.00039024859832718505, 'samples': 9205056, 'steps': 47942, 'loss/train': 1.8826212882995605} 11/07/2021 03:59:54 - INFO - __main__ - Step 47944: {'lr': 0.0003902442052686039, 'samples': 9205248, 'steps': 47943, 'loss/train': 1.5753989219665527} 11/07/2021 03:59:55 - INFO - __main__ - Step 47945: {'lr': 0.00039023981214683087, 'samples': 9205440, 'steps': 47944, 'loss/train': 1.1622357368469238} 11/07/2021 03:59:55 - INFO - __main__ - Step 47946: {'lr': 0.0003902354189618679, 'samples': 9205632, 'steps': 47945, 'loss/train': 1.7712570428848267} 11/07/2021 03:59:55 - INFO - __main__ - Step 47947: {'lr': 0.00039023102571371707, 'samples': 9205824, 'steps': 47946, 'loss/train': 1.6194078922271729} 11/07/2021 03:59:56 - INFO - __main__ - Step 47948: {'lr': 0.0003902266324023803, 'samples': 9206016, 'steps': 47947, 'loss/train': 1.4775843620300293} 11/07/2021 03:59:56 - INFO - __main__ - Step 47949: {'lr': 0.00039022223902785954, 'samples': 9206208, 'steps': 47948, 'loss/train': 1.5144954919815063} 11/07/2021 03:59:57 - INFO - __main__ - Step 47950: {'lr': 0.0003902178455901568, 'samples': 9206400, 'steps': 47949, 'loss/train': 1.2559685707092285} 11/07/2021 03:59:57 - INFO - __main__ - Step 47951: {'lr': 0.00039021345208927404, 'samples': 9206592, 'steps': 47950, 'loss/train': 1.4207682609558105} 11/07/2021 03:59:58 - INFO - __main__ - Step 47952: {'lr': 0.0003902090585252133, 'samples': 9206784, 'steps': 47951, 'loss/train': 1.2095712423324585} 11/07/2021 03:59:58 - INFO - __main__ - Step 47953: {'lr': 0.0003902046648979766, 'samples': 9206976, 'steps': 47952, 'loss/train': 1.166861891746521} 11/07/2021 03:59:59 - INFO - __main__ - Step 47954: {'lr': 0.00039020027120756573, 'samples': 9207168, 'steps': 47953, 'loss/train': 0.9656014442443848} 11/07/2021 03:59:59 - INFO - __main__ - Step 47955: {'lr': 0.00039019587745398276, 'samples': 9207360, 'steps': 47954, 'loss/train': 1.032138466835022} 11/07/2021 04:00:00 - INFO - __main__ - Step 47956: {'lr': 0.0003901914836372298, 'samples': 9207552, 'steps': 47955, 'loss/train': 1.3611185550689697} 11/07/2021 04:00:00 - INFO - __main__ - Step 47957: {'lr': 0.00039018708975730864, 'samples': 9207744, 'steps': 47956, 'loss/train': 1.2215360403060913} 11/07/2021 04:00:01 - INFO - __main__ - Step 47958: {'lr': 0.0003901826958142214, 'samples': 9207936, 'steps': 47957, 'loss/train': 1.4965671300888062} 11/07/2021 04:00:01 - INFO - __main__ - Step 47959: {'lr': 0.0003901783018079699, 'samples': 9208128, 'steps': 47958, 'loss/train': 1.5038868188858032} 11/07/2021 04:00:02 - INFO - __main__ - Step 47960: {'lr': 0.0003901739077385563, 'samples': 9208320, 'steps': 47959, 'loss/train': 1.7007519006729126} 11/07/2021 04:00:02 - INFO - __main__ - Step 47961: {'lr': 0.0003901695136059825, 'samples': 9208512, 'steps': 47960, 'loss/train': 1.1992660760879517} 11/07/2021 04:00:03 - INFO - __main__ - Step 47962: {'lr': 0.00039016511941025045, 'samples': 9208704, 'steps': 47961, 'loss/train': 2.0137312412261963} 11/07/2021 04:00:03 - INFO - __main__ - Step 47963: {'lr': 0.0003901607251513622, 'samples': 9208896, 'steps': 47962, 'loss/train': 1.7977275848388672} 11/07/2021 04:00:03 - INFO - __main__ - Step 47964: {'lr': 0.0003901563308293197, 'samples': 9209088, 'steps': 47963, 'loss/train': 1.5365867614746094} 11/07/2021 04:00:04 - INFO - __main__ - Step 47965: {'lr': 0.0003901519364441248, 'samples': 9209280, 'steps': 47964, 'loss/train': 1.5069364309310913} 11/07/2021 04:00:05 - INFO - __main__ - Step 47966: {'lr': 0.0003901475419957797, 'samples': 9209472, 'steps': 47965, 'loss/train': 1.3223764896392822} 11/07/2021 04:00:05 - INFO - __main__ - Step 47967: {'lr': 0.0003901431474842863, 'samples': 9209664, 'steps': 47966, 'loss/train': 1.5235975980758667} 11/07/2021 04:00:05 - INFO - __main__ - Step 47968: {'lr': 0.0003901387529096465, 'samples': 9209856, 'steps': 47967, 'loss/train': 1.5078519582748413} 11/07/2021 04:00:06 - INFO - __main__ - Step 47969: {'lr': 0.0003901343582718624, 'samples': 9210048, 'steps': 47968, 'loss/train': 1.3114722967147827} 11/07/2021 04:00:06 - INFO - __main__ - Step 47970: {'lr': 0.0003901299635709359, 'samples': 9210240, 'steps': 47969, 'loss/train': 1.4865241050720215} 11/07/2021 04:00:07 - INFO - __main__ - Step 47971: {'lr': 0.00039012556880686897, 'samples': 9210432, 'steps': 47970, 'loss/train': 0.869248628616333} 11/07/2021 04:00:08 - INFO - __main__ - Step 47972: {'lr': 0.00039012117397966363, 'samples': 9210624, 'steps': 47971, 'loss/train': 1.566467046737671} 11/07/2021 04:00:08 - INFO - __main__ - Step 47973: {'lr': 0.00039011677908932184, 'samples': 9210816, 'steps': 47972, 'loss/train': 1.6380442380905151} 11/07/2021 04:00:08 - INFO - __main__ - Step 47974: {'lr': 0.00039011238413584566, 'samples': 9211008, 'steps': 47973, 'loss/train': 2.1799044609069824} 11/07/2021 04:00:09 - INFO - __main__ - Step 47975: {'lr': 0.0003901079891192369, 'samples': 9211200, 'steps': 47974, 'loss/train': 0.16520646214485168} 11/07/2021 04:00:10 - INFO - __main__ - Step 47976: {'lr': 0.00039010359403949776, 'samples': 9211392, 'steps': 47975, 'loss/train': 1.1269757747650146} 11/07/2021 04:00:10 - INFO - __main__ - Step 47977: {'lr': 0.00039009919889663005, 'samples': 9211584, 'steps': 47976, 'loss/train': 2.0871849060058594} 11/07/2021 04:00:10 - INFO - __main__ - Step 47978: {'lr': 0.00039009480369063575, 'samples': 9211776, 'steps': 47977, 'loss/train': 1.1506580114364624} 11/07/2021 04:00:11 - INFO - __main__ - Step 47979: {'lr': 0.000390090408421517, 'samples': 9211968, 'steps': 47978, 'loss/train': 1.7109673023223877} 11/07/2021 04:00:11 - INFO - __main__ - Step 47980: {'lr': 0.0003900860130892756, 'samples': 9212160, 'steps': 47979, 'loss/train': 1.2497364282608032} 11/07/2021 04:00:12 - INFO - __main__ - Step 47981: {'lr': 0.0003900816176939136, 'samples': 9212352, 'steps': 47980, 'loss/train': 1.489717960357666} 11/07/2021 04:00:12 - INFO - __main__ - Step 47982: {'lr': 0.000390077222235433, 'samples': 9212544, 'steps': 47981, 'loss/train': 1.4342354536056519} 11/07/2021 04:00:13 - INFO - __main__ - Step 47983: {'lr': 0.0003900728267138357, 'samples': 9212736, 'steps': 47982, 'loss/train': 1.5929369926452637} 11/07/2021 04:00:13 - INFO - __main__ - Step 47984: {'lr': 0.0003900684311291238, 'samples': 9212928, 'steps': 47983, 'loss/train': 0.9831579327583313} 11/07/2021 04:00:14 - INFO - __main__ - Step 47985: {'lr': 0.0003900640354812992, 'samples': 9213120, 'steps': 47984, 'loss/train': 1.508144736289978} 11/07/2021 04:00:15 - INFO - __main__ - Step 47986: {'lr': 0.000390059639770364, 'samples': 9213312, 'steps': 47985, 'loss/train': 1.419266700744629} 11/07/2021 04:00:15 - INFO - __main__ - Step 47987: {'lr': 0.0003900552439963201, 'samples': 9213504, 'steps': 47986, 'loss/train': 1.4877800941467285} 11/07/2021 04:00:15 - INFO - __main__ - Step 47988: {'lr': 0.0003900508481591694, 'samples': 9213696, 'steps': 47987, 'loss/train': 1.4193735122680664} 11/07/2021 04:00:16 - INFO - __main__ - Step 47989: {'lr': 0.00039004645225891387, 'samples': 9213888, 'steps': 47988, 'loss/train': 1.3641928434371948} 11/07/2021 04:00:16 - INFO - __main__ - Step 47990: {'lr': 0.0003900420562955557, 'samples': 9214080, 'steps': 47989, 'loss/train': 1.297823429107666} 11/07/2021 04:00:16 - INFO - __main__ - Step 47991: {'lr': 0.0003900376602690966, 'samples': 9214272, 'steps': 47990, 'loss/train': 1.5094558000564575} 11/07/2021 04:00:17 - INFO - __main__ - Step 47992: {'lr': 0.0003900332641795388, 'samples': 9214464, 'steps': 47991, 'loss/train': 1.933498740196228} 11/07/2021 04:00:18 - INFO - __main__ - Step 47993: {'lr': 0.0003900288680268842, 'samples': 9214656, 'steps': 47992, 'loss/train': 1.278651237487793} 11/07/2021 04:00:18 - INFO - __main__ - Step 47994: {'lr': 0.00039002447181113464, 'samples': 9214848, 'steps': 47993, 'loss/train': 2.1139328479766846} 11/07/2021 04:00:18 - INFO - __main__ - Step 47995: {'lr': 0.0003900200755322923, 'samples': 9215040, 'steps': 47994, 'loss/train': 1.141119360923767} 11/07/2021 04:00:19 - INFO - __main__ - Step 47996: {'lr': 0.0003900156791903591, 'samples': 9215232, 'steps': 47995, 'loss/train': 1.348052740097046} 11/07/2021 04:00:20 - INFO - __main__ - Step 47997: {'lr': 0.0003900112827853369, 'samples': 9215424, 'steps': 47996, 'loss/train': 1.3135474920272827} 11/07/2021 04:00:20 - INFO - __main__ - Step 47998: {'lr': 0.0003900068863172278, 'samples': 9215616, 'steps': 47997, 'loss/train': 1.4085980653762817} 11/07/2021 04:00:21 - INFO - __main__ - Step 47999: {'lr': 0.0003900024897860338, 'samples': 9215808, 'steps': 47998, 'loss/train': 1.8113974332809448} 11/07/2021 04:00:21 - INFO - __main__ - Step 48000: {'lr': 0.00038999809319175684, 'samples': 9216000, 'steps': 47999, 'loss/train': 0.4227254092693329} 11/07/2021 04:00:21 - INFO - __main__ - Step 48001: {'lr': 0.0003899936965343989, 'samples': 9216192, 'steps': 48000, 'loss/train': 1.8167829513549805} 11/07/2021 04:00:22 - INFO - __main__ - Step 48002: {'lr': 0.00038998929981396194, 'samples': 9216384, 'steps': 48001, 'loss/train': 1.4759230613708496} 11/07/2021 04:00:23 - INFO - __main__ - Step 48003: {'lr': 0.0003899849030304479, 'samples': 9216576, 'steps': 48002, 'loss/train': 2.544011116027832} 11/07/2021 04:00:23 - INFO - __main__ - Step 48004: {'lr': 0.0003899805061838589, 'samples': 9216768, 'steps': 48003, 'loss/train': 1.7079887390136719} 11/07/2021 04:00:24 - INFO - __main__ - Step 48005: {'lr': 0.0003899761092741968, 'samples': 9216960, 'steps': 48004, 'loss/train': 1.6452378034591675} 11/07/2021 04:00:24 - INFO - __main__ - Step 48006: {'lr': 0.00038997171230146366, 'samples': 9217152, 'steps': 48005, 'loss/train': 1.3807014226913452} 11/07/2021 04:00:24 - INFO - __main__ - Step 48007: {'lr': 0.0003899673152656614, 'samples': 9217344, 'steps': 48006, 'loss/train': 1.7523833513259888} 11/07/2021 04:00:25 - INFO - __main__ - Step 48008: {'lr': 0.0003899629181667921, 'samples': 9217536, 'steps': 48007, 'loss/train': 1.784700632095337} 11/07/2021 04:00:26 - INFO - __main__ - Step 48009: {'lr': 0.0003899585210048576, 'samples': 9217728, 'steps': 48008, 'loss/train': 1.3834120035171509} 11/07/2021 04:00:26 - INFO - __main__ - Step 48010: {'lr': 0.0003899541237798599, 'samples': 9217920, 'steps': 48009, 'loss/train': 1.6663899421691895} 11/07/2021 04:00:26 - INFO - __main__ - Step 48011: {'lr': 0.0003899497264918012, 'samples': 9218112, 'steps': 48010, 'loss/train': 1.1437512636184692} 11/07/2021 04:00:27 - INFO - __main__ - Step 48012: {'lr': 0.00038994532914068313, 'samples': 9218304, 'steps': 48011, 'loss/train': 1.5247350931167603} 11/07/2021 04:00:28 - INFO - __main__ - Step 48013: {'lr': 0.00038994093172650804, 'samples': 9218496, 'steps': 48012, 'loss/train': 1.5147181749343872} 11/07/2021 04:00:28 - INFO - __main__ - Step 48014: {'lr': 0.00038993653424927754, 'samples': 9218688, 'steps': 48013, 'loss/train': 1.5079004764556885} 11/07/2021 04:00:29 - INFO - __main__ - Step 48015: {'lr': 0.00038993213670899385, 'samples': 9218880, 'steps': 48014, 'loss/train': 1.1028234958648682} 11/07/2021 04:00:29 - INFO - __main__ - Step 48016: {'lr': 0.000389927739105659, 'samples': 9219072, 'steps': 48015, 'loss/train': 1.5839699506759644} 11/07/2021 04:00:29 - INFO - __main__ - Step 48017: {'lr': 0.0003899233414392748, 'samples': 9219264, 'steps': 48016, 'loss/train': 1.3747262954711914} 11/07/2021 04:00:30 - INFO - __main__ - Step 48018: {'lr': 0.0003899189437098433, 'samples': 9219456, 'steps': 48017, 'loss/train': 1.2862333059310913} 11/07/2021 04:00:31 - INFO - __main__ - Step 48019: {'lr': 0.00038991454591736643, 'samples': 9219648, 'steps': 48018, 'loss/train': 1.1262879371643066} 11/07/2021 04:00:31 - INFO - __main__ - Step 48020: {'lr': 0.00038991014806184635, 'samples': 9219840, 'steps': 48019, 'loss/train': 1.2946993112564087} 11/07/2021 04:00:31 - INFO - __main__ - Step 48021: {'lr': 0.0003899057501432848, 'samples': 9220032, 'steps': 48020, 'loss/train': 1.6199647188186646} 11/07/2021 04:00:32 - INFO - __main__ - Step 48022: {'lr': 0.0003899013521616839, 'samples': 9220224, 'steps': 48021, 'loss/train': 1.3253999948501587} 11/07/2021 04:00:33 - INFO - __main__ - Step 48023: {'lr': 0.0003898969541170456, 'samples': 9220416, 'steps': 48022, 'loss/train': 1.4175102710723877} 11/07/2021 04:00:33 - INFO - __main__ - Step 48024: {'lr': 0.0003898925560093719, 'samples': 9220608, 'steps': 48023, 'loss/train': 1.3709696531295776} 11/07/2021 04:00:33 - INFO - __main__ - Step 48025: {'lr': 0.00038988815783866485, 'samples': 9220800, 'steps': 48024, 'loss/train': 1.568740725517273} 11/07/2021 04:00:34 - INFO - __main__ - Step 48026: {'lr': 0.00038988375960492626, 'samples': 9220992, 'steps': 48025, 'loss/train': 1.8158372640609741} 11/07/2021 04:00:34 - INFO - __main__ - Step 48027: {'lr': 0.0003898793613081583, 'samples': 9221184, 'steps': 48026, 'loss/train': 1.3739103078842163} 11/07/2021 04:00:36 - INFO - __main__ - Step 48028: {'lr': 0.0003898749629483628, 'samples': 9221376, 'steps': 48027, 'loss/train': 2.0244140625} 11/07/2021 04:00:36 - INFO - __main__ - Step 48029: {'lr': 0.00038987056452554177, 'samples': 9221568, 'steps': 48028, 'loss/train': 1.6520586013793945} 11/07/2021 04:00:36 - INFO - __main__ - Step 48030: {'lr': 0.0003898661660396973, 'samples': 9221760, 'steps': 48029, 'loss/train': 1.8628299236297607} 11/07/2021 04:00:37 - INFO - __main__ - Step 48031: {'lr': 0.00038986176749083117, 'samples': 9221952, 'steps': 48030, 'loss/train': 0.3103807270526886} 11/07/2021 04:00:37 - INFO - __main__ - Step 48032: {'lr': 0.0003898573688789456, 'samples': 9222144, 'steps': 48031, 'loss/train': 0.3190688192844391} 11/07/2021 04:00:38 - INFO - __main__ - Step 48033: {'lr': 0.0003898529702040424, 'samples': 9222336, 'steps': 48032, 'loss/train': 1.669013261795044} 11/07/2021 04:00:38 - INFO - __main__ - Step 48034: {'lr': 0.00038984857146612365, 'samples': 9222528, 'steps': 48033, 'loss/train': 1.5042675733566284} 11/07/2021 04:00:39 - INFO - __main__ - Step 48035: {'lr': 0.00038984417266519126, 'samples': 9222720, 'steps': 48034, 'loss/train': 1.2486090660095215} 11/07/2021 04:00:39 - INFO - __main__ - Step 48036: {'lr': 0.00038983977380124726, 'samples': 9222912, 'steps': 48035, 'loss/train': 1.455985426902771} 11/07/2021 04:00:39 - INFO - __main__ - Step 48037: {'lr': 0.0003898353748742936, 'samples': 9223104, 'steps': 48036, 'loss/train': 1.2599023580551147} 11/07/2021 04:00:40 - INFO - __main__ - Step 48038: {'lr': 0.00038983097588433225, 'samples': 9223296, 'steps': 48037, 'loss/train': 1.5512021780014038} 11/07/2021 04:00:41 - INFO - __main__ - Step 48039: {'lr': 0.00038982657683136524, 'samples': 9223488, 'steps': 48038, 'loss/train': 1.6784405708312988} 11/07/2021 04:00:41 - INFO - __main__ - Step 48040: {'lr': 0.00038982217771539466, 'samples': 9223680, 'steps': 48039, 'loss/train': 1.306701898574829} 11/07/2021 04:00:42 - INFO - __main__ - Step 48041: {'lr': 0.0003898177785364222, 'samples': 9223872, 'steps': 48040, 'loss/train': 1.0747727155685425} 11/07/2021 04:00:42 - INFO - __main__ - Step 48042: {'lr': 0.00038981337929445004, 'samples': 9224064, 'steps': 48041, 'loss/train': 1.4584308862686157} 11/07/2021 04:00:42 - INFO - __main__ - Step 48043: {'lr': 0.0003898089799894802, 'samples': 9224256, 'steps': 48042, 'loss/train': 1.5404750108718872} 11/07/2021 04:00:43 - INFO - __main__ - Step 48044: {'lr': 0.0003898045806215145, 'samples': 9224448, 'steps': 48043, 'loss/train': 1.6170690059661865} 11/07/2021 04:00:44 - INFO - __main__ - Step 48045: {'lr': 0.00038980018119055506, 'samples': 9224640, 'steps': 48044, 'loss/train': 1.2645432949066162} 11/07/2021 04:00:44 - INFO - __main__ - Step 48046: {'lr': 0.00038979578169660384, 'samples': 9224832, 'steps': 48045, 'loss/train': 1.5899505615234375} 11/07/2021 04:00:44 - INFO - __main__ - Step 48047: {'lr': 0.0003897913821396628, 'samples': 9225024, 'steps': 48046, 'loss/train': 1.07871675491333} 11/07/2021 04:00:45 - INFO - __main__ - Step 48048: {'lr': 0.0003897869825197339, 'samples': 9225216, 'steps': 48047, 'loss/train': 2.3908605575561523} 11/07/2021 04:00:46 - INFO - __main__ - Step 48049: {'lr': 0.0003897825828368191, 'samples': 9225408, 'steps': 48048, 'loss/train': 1.2328208684921265} 11/07/2021 04:00:46 - INFO - __main__ - Step 48050: {'lr': 0.0003897781830909204, 'samples': 9225600, 'steps': 48049, 'loss/train': 1.1049468517303467} 11/07/2021 04:00:46 - INFO - __main__ - Step 48051: {'lr': 0.00038977378328203987, 'samples': 9225792, 'steps': 48050, 'loss/train': 1.367971658706665} 11/07/2021 04:00:47 - INFO - __main__ - Step 48052: {'lr': 0.0003897693834101794, 'samples': 9225984, 'steps': 48051, 'loss/train': 1.1159980297088623} 11/07/2021 04:00:47 - INFO - __main__ - Step 48053: {'lr': 0.00038976498347534106, 'samples': 9226176, 'steps': 48052, 'loss/train': 1.4800593852996826} 11/07/2021 04:00:48 - INFO - __main__ - Step 48054: {'lr': 0.0003897605834775267, 'samples': 9226368, 'steps': 48053, 'loss/train': 1.5713735818862915} 11/07/2021 04:00:49 - INFO - __main__ - Step 48055: {'lr': 0.00038975618341673845, 'samples': 9226560, 'steps': 48054, 'loss/train': 1.028491735458374} 11/07/2021 04:00:49 - INFO - __main__ - Step 48056: {'lr': 0.0003897517832929782, 'samples': 9226752, 'steps': 48055, 'loss/train': 1.6298503875732422} 11/07/2021 04:00:49 - INFO - __main__ - Step 48057: {'lr': 0.00038974738310624797, 'samples': 9226944, 'steps': 48056, 'loss/train': 1.0520175695419312} 11/07/2021 04:00:50 - INFO - __main__ - Step 48058: {'lr': 0.00038974298285654967, 'samples': 9227136, 'steps': 48057, 'loss/train': 1.0178232192993164} 11/07/2021 04:00:51 - INFO - __main__ - Step 48059: {'lr': 0.0003897385825438854, 'samples': 9227328, 'steps': 48058, 'loss/train': 1.4590984582901} 11/07/2021 04:00:51 - INFO - __main__ - Step 48060: {'lr': 0.0003897341821682571, 'samples': 9227520, 'steps': 48059, 'loss/train': 0.2315509021282196} 11/07/2021 04:00:51 - INFO - __main__ - Step 48061: {'lr': 0.0003897297817296667, 'samples': 9227712, 'steps': 48060, 'loss/train': 1.2649005651474} 11/07/2021 04:00:52 - INFO - __main__ - Step 48062: {'lr': 0.00038972538122811613, 'samples': 9227904, 'steps': 48061, 'loss/train': 1.64288330078125} 11/07/2021 04:00:52 - INFO - __main__ - Step 48063: {'lr': 0.00038972098066360753, 'samples': 9228096, 'steps': 48062, 'loss/train': 1.4780681133270264} 11/07/2021 04:00:52 - INFO - __main__ - Step 48064: {'lr': 0.0003897165800361427, 'samples': 9228288, 'steps': 48063, 'loss/train': 1.5908677577972412} 11/07/2021 04:00:53 - INFO - __main__ - Step 48065: {'lr': 0.0003897121793457239, 'samples': 9228480, 'steps': 48064, 'loss/train': 1.5628256797790527} 11/07/2021 04:00:54 - INFO - __main__ - Step 48066: {'lr': 0.0003897077785923529, 'samples': 9228672, 'steps': 48065, 'loss/train': 1.6030704975128174} 11/07/2021 04:00:54 - INFO - __main__ - Step 48067: {'lr': 0.0003897033777760318, 'samples': 9228864, 'steps': 48066, 'loss/train': 1.1855614185333252} 11/07/2021 04:00:55 - INFO - __main__ - Step 48068: {'lr': 0.0003896989768967624, 'samples': 9229056, 'steps': 48067, 'loss/train': 1.5664584636688232} 11/07/2021 04:00:55 - INFO - __main__ - Step 48069: {'lr': 0.0003896945759545468, 'samples': 9229248, 'steps': 48068, 'loss/train': 1.6907851696014404} 11/07/2021 04:00:56 - INFO - __main__ - Step 48070: {'lr': 0.000389690174949387, 'samples': 9229440, 'steps': 48069, 'loss/train': 1.2275843620300293} 11/07/2021 04:00:56 - INFO - __main__ - Step 48071: {'lr': 0.00038968577388128503, 'samples': 9229632, 'steps': 48070, 'loss/train': 1.7831544876098633} 11/07/2021 04:00:57 - INFO - __main__ - Step 48072: {'lr': 0.00038968137275024274, 'samples': 9229824, 'steps': 48071, 'loss/train': 1.374072551727295} 11/07/2021 04:00:57 - INFO - __main__ - Step 48073: {'lr': 0.0003896769715562622, 'samples': 9230016, 'steps': 48072, 'loss/train': 1.4499943256378174} 11/07/2021 04:00:57 - INFO - __main__ - Step 48074: {'lr': 0.0003896725702993453, 'samples': 9230208, 'steps': 48073, 'loss/train': 0.9299537539482117} 11/07/2021 04:00:58 - INFO - __main__ - Step 48075: {'lr': 0.0003896681689794942, 'samples': 9230400, 'steps': 48074, 'loss/train': 1.2336264848709106} 11/07/2021 04:00:59 - INFO - __main__ - Step 48076: {'lr': 0.00038966376759671075, 'samples': 9230592, 'steps': 48075, 'loss/train': 1.3669625520706177} 11/07/2021 04:00:59 - INFO - __main__ - Step 48077: {'lr': 0.00038965936615099694, 'samples': 9230784, 'steps': 48076, 'loss/train': 1.556752324104309} 11/07/2021 04:00:59 - INFO - __main__ - Step 48078: {'lr': 0.0003896549646423548, 'samples': 9230976, 'steps': 48077, 'loss/train': 1.269991159439087} 11/07/2021 04:01:00 - INFO - __main__ - Step 48079: {'lr': 0.0003896505630707863, 'samples': 9231168, 'steps': 48078, 'loss/train': 1.0601564645767212} 11/07/2021 04:01:01 - INFO - __main__ - Step 48080: {'lr': 0.00038964616143629337, 'samples': 9231360, 'steps': 48079, 'loss/train': 1.5609855651855469} 11/07/2021 04:01:01 - INFO - __main__ - Step 48081: {'lr': 0.00038964175973887807, 'samples': 9231552, 'steps': 48080, 'loss/train': 1.0855125188827515} 11/07/2021 04:01:02 - INFO - __main__ - Step 48082: {'lr': 0.0003896373579785423, 'samples': 9231744, 'steps': 48081, 'loss/train': 1.7724723815917969} 11/07/2021 04:01:02 - INFO - __main__ - Step 48083: {'lr': 0.00038963295615528803, 'samples': 9231936, 'steps': 48082, 'loss/train': 1.3519500494003296} 11/07/2021 04:01:02 - INFO - __main__ - Step 48084: {'lr': 0.00038962855426911746, 'samples': 9232128, 'steps': 48083, 'loss/train': 1.6795637607574463} 11/07/2021 04:01:03 - INFO - __main__ - Step 48085: {'lr': 0.00038962415232003233, 'samples': 9232320, 'steps': 48084, 'loss/train': 1.6219850778579712} 11/07/2021 04:01:04 - INFO - __main__ - Step 48086: {'lr': 0.00038961975030803474, 'samples': 9232512, 'steps': 48085, 'loss/train': 1.5682246685028076} 11/07/2021 04:01:04 - INFO - __main__ - Step 48087: {'lr': 0.00038961534823312664, 'samples': 9232704, 'steps': 48086, 'loss/train': 1.5759390592575073} 11/07/2021 04:01:04 - INFO - __main__ - Step 48088: {'lr': 0.00038961094609531, 'samples': 9232896, 'steps': 48087, 'loss/train': 1.3574557304382324} 11/07/2021 04:01:05 - INFO - __main__ - Step 48089: {'lr': 0.00038960654389458684, 'samples': 9233088, 'steps': 48088, 'loss/train': 1.263695478439331} 11/07/2021 04:01:05 - INFO - __main__ - Step 48090: {'lr': 0.0003896021416309591, 'samples': 9233280, 'steps': 48089, 'loss/train': 1.3953235149383545} 11/07/2021 04:01:06 - INFO - __main__ - Step 48091: {'lr': 0.0003895977393044288, 'samples': 9233472, 'steps': 48090, 'loss/train': 1.786818504333496} 11/07/2021 04:01:06 - INFO - __main__ - Step 48092: {'lr': 0.00038959333691499794, 'samples': 9233664, 'steps': 48091, 'loss/train': 1.076110601425171} 11/07/2021 04:01:07 - INFO - __main__ - Step 48093: {'lr': 0.00038958893446266844, 'samples': 9233856, 'steps': 48092, 'loss/train': 1.581133246421814} 11/07/2021 04:01:07 - INFO - __main__ - Step 48094: {'lr': 0.00038958453194744237, 'samples': 9234048, 'steps': 48093, 'loss/train': 1.5100988149642944} 11/07/2021 04:01:07 - INFO - __main__ - Step 48095: {'lr': 0.0003895801293693216, 'samples': 9234240, 'steps': 48094, 'loss/train': 1.3519607782363892} 11/07/2021 04:01:09 - INFO - __main__ - Step 48096: {'lr': 0.0003895757267283082, 'samples': 9234432, 'steps': 48095, 'loss/train': 1.5387576818466187} 11/07/2021 04:01:09 - INFO - __main__ - Step 48097: {'lr': 0.0003895713240244042, 'samples': 9234624, 'steps': 48096, 'loss/train': 1.3814644813537598} 11/07/2021 04:01:09 - INFO - __main__ - Step 48098: {'lr': 0.0003895669212576114, 'samples': 9234816, 'steps': 48097, 'loss/train': 1.592847228050232} 11/07/2021 04:01:10 - INFO - __main__ - Step 48099: {'lr': 0.000389562518427932, 'samples': 9235008, 'steps': 48098, 'loss/train': 1.7075380086898804} 11/07/2021 04:01:10 - INFO - __main__ - Step 48100: {'lr': 0.00038955811553536787, 'samples': 9235200, 'steps': 48099, 'loss/train': 1.761032223701477} 11/07/2021 04:01:11 - INFO - __main__ - Step 48101: {'lr': 0.00038955371257992096, 'samples': 9235392, 'steps': 48100, 'loss/train': 2.054033041000366} 11/07/2021 04:01:11 - INFO - __main__ - Step 48102: {'lr': 0.0003895493095615933, 'samples': 9235584, 'steps': 48101, 'loss/train': 1.3840248584747314} 11/07/2021 04:01:12 - INFO - __main__ - Step 48103: {'lr': 0.00038954490648038687, 'samples': 9235776, 'steps': 48102, 'loss/train': 1.024058222770691} 11/07/2021 04:01:12 - INFO - __main__ - Step 48104: {'lr': 0.0003895405033363037, 'samples': 9235968, 'steps': 48103, 'loss/train': 1.359765648841858} 11/07/2021 04:01:12 - INFO - __main__ - Step 48105: {'lr': 0.0003895361001293457, 'samples': 9236160, 'steps': 48104, 'loss/train': 1.476563811302185} 11/07/2021 04:01:13 - INFO - __main__ - Step 48106: {'lr': 0.0003895316968595149, 'samples': 9236352, 'steps': 48105, 'loss/train': 1.8598228693008423} 11/07/2021 04:01:14 - INFO - __main__ - Step 48107: {'lr': 0.0003895272935268133, 'samples': 9236544, 'steps': 48106, 'loss/train': 1.3268775939941406} 11/07/2021 04:01:14 - INFO - __main__ - Step 48108: {'lr': 0.0003895228901312428, 'samples': 9236736, 'steps': 48107, 'loss/train': 0.6876649856567383} 11/07/2021 04:01:15 - INFO - __main__ - Step 48109: {'lr': 0.0003895184866728054, 'samples': 9236928, 'steps': 48108, 'loss/train': 1.5355066061019897} 11/07/2021 04:01:15 - INFO - __main__ - Step 48110: {'lr': 0.0003895140831515033, 'samples': 9237120, 'steps': 48109, 'loss/train': 1.507046103477478} 11/07/2021 04:01:15 - INFO - __main__ - Step 48111: {'lr': 0.0003895096795673381, 'samples': 9237312, 'steps': 48110, 'loss/train': 1.1604622602462769} 11/07/2021 04:01:16 - INFO - __main__ - Step 48112: {'lr': 0.0003895052759203121, 'samples': 9237504, 'steps': 48111, 'loss/train': 1.9194079637527466} 11/07/2021 04:01:17 - INFO - __main__ - Step 48113: {'lr': 0.0003895008722104272, 'samples': 9237696, 'steps': 48112, 'loss/train': 1.3934123516082764} 11/07/2021 04:01:17 - INFO - __main__ - Step 48114: {'lr': 0.00038949646843768526, 'samples': 9237888, 'steps': 48113, 'loss/train': 1.328235387802124} 11/07/2021 04:01:17 - INFO - __main__ - Step 48115: {'lr': 0.00038949206460208845, 'samples': 9238080, 'steps': 48114, 'loss/train': 1.8984450101852417} 11/07/2021 04:01:18 - INFO - __main__ - Step 48116: {'lr': 0.0003894876607036386, 'samples': 9238272, 'steps': 48115, 'loss/train': 1.4652715921401978} 11/07/2021 04:01:18 - INFO - __main__ - Step 48117: {'lr': 0.0003894832567423379, 'samples': 9238464, 'steps': 48116, 'loss/train': 1.687768578529358} 11/07/2021 04:01:19 - INFO - __main__ - Step 48118: {'lr': 0.00038947885271818807, 'samples': 9238656, 'steps': 48117, 'loss/train': 1.3348737955093384} 11/07/2021 04:01:20 - INFO - __main__ - Step 48119: {'lr': 0.0003894744486311912, 'samples': 9238848, 'steps': 48118, 'loss/train': 0.9331466555595398} 11/07/2021 04:01:20 - INFO - __main__ - Step 48120: {'lr': 0.00038947004448134937, 'samples': 9239040, 'steps': 48119, 'loss/train': 1.088590145111084} 11/07/2021 04:01:20 - INFO - __main__ - Step 48121: {'lr': 0.0003894656402686645, 'samples': 9239232, 'steps': 48120, 'loss/train': 2.1469006538391113} 11/07/2021 04:01:21 - INFO - __main__ - Step 48122: {'lr': 0.00038946123599313846, 'samples': 9239424, 'steps': 48121, 'loss/train': 1.62726628780365} 11/07/2021 04:01:22 - INFO - __main__ - Step 48123: {'lr': 0.0003894568316547734, 'samples': 9239616, 'steps': 48122, 'loss/train': 2.0312588214874268} 11/07/2021 04:01:22 - INFO - __main__ - Step 48124: {'lr': 0.00038945242725357127, 'samples': 9239808, 'steps': 48123, 'loss/train': 1.563032627105713} 11/07/2021 04:01:22 - INFO - __main__ - Step 48125: {'lr': 0.000389448022789534, 'samples': 9240000, 'steps': 48124, 'loss/train': 1.9691152572631836} 11/07/2021 04:01:23 - INFO - __main__ - Step 48126: {'lr': 0.0003894436182626636, 'samples': 9240192, 'steps': 48125, 'loss/train': 1.8409271240234375} 11/07/2021 04:01:23 - INFO - __main__ - Step 48127: {'lr': 0.00038943921367296213, 'samples': 9240384, 'steps': 48126, 'loss/train': 1.4550058841705322} 11/07/2021 04:01:24 - INFO - __main__ - Step 48128: {'lr': 0.00038943480902043146, 'samples': 9240576, 'steps': 48127, 'loss/train': 1.4334383010864258} 11/07/2021 04:01:24 - INFO - __main__ - Step 48129: {'lr': 0.0003894304043050736, 'samples': 9240768, 'steps': 48128, 'loss/train': 1.6067302227020264} 11/07/2021 04:01:25 - INFO - __main__ - Step 48130: {'lr': 0.0003894259995268905, 'samples': 9240960, 'steps': 48129, 'loss/train': 1.6146119832992554} 11/07/2021 04:01:25 - INFO - __main__ - Step 48131: {'lr': 0.00038942159468588423, 'samples': 9241152, 'steps': 48130, 'loss/train': 1.4710017442703247} 11/07/2021 04:01:25 - INFO - __main__ - Step 48132: {'lr': 0.00038941718978205674, 'samples': 9241344, 'steps': 48131, 'loss/train': 1.373228669166565} 11/07/2021 04:01:27 - INFO - __main__ - Step 48133: {'lr': 0.0003894127848154101, 'samples': 9241536, 'steps': 48132, 'loss/train': 1.3340827226638794} 11/07/2021 04:01:27 - INFO - __main__ - Step 48134: {'lr': 0.0003894083797859461, 'samples': 9241728, 'steps': 48133, 'loss/train': 1.859207034111023} 11/07/2021 04:01:27 - INFO - __main__ - Step 48135: {'lr': 0.00038940397469366695, 'samples': 9241920, 'steps': 48134, 'loss/train': 1.445658564567566} 11/07/2021 04:01:28 - INFO - __main__ - Step 48136: {'lr': 0.0003893995695385744, 'samples': 9242112, 'steps': 48135, 'loss/train': 1.6205159425735474} 11/07/2021 04:01:28 - INFO - __main__ - Step 48137: {'lr': 0.0003893951643206706, 'samples': 9242304, 'steps': 48136, 'loss/train': 0.8118087649345398} 11/07/2021 04:01:28 - INFO - __main__ - Step 48138: {'lr': 0.00038939075903995744, 'samples': 9242496, 'steps': 48137, 'loss/train': 1.5155673027038574} 11/07/2021 04:01:29 - INFO - __main__ - Step 48139: {'lr': 0.000389386353696437, 'samples': 9242688, 'steps': 48138, 'loss/train': 1.2779062986373901} 11/07/2021 04:01:30 - INFO - __main__ - Step 48140: {'lr': 0.0003893819482901113, 'samples': 9242880, 'steps': 48139, 'loss/train': 1.4969934225082397} 11/07/2021 04:01:30 - INFO - __main__ - Step 48141: {'lr': 0.0003893775428209822, 'samples': 9243072, 'steps': 48140, 'loss/train': 1.910718321800232} 11/07/2021 04:01:30 - INFO - __main__ - Step 48142: {'lr': 0.00038937313728905164, 'samples': 9243264, 'steps': 48141, 'loss/train': 1.4000948667526245} 11/07/2021 04:01:31 - INFO - __main__ - Step 48143: {'lr': 0.0003893687316943218, 'samples': 9243456, 'steps': 48142, 'loss/train': 1.5593124628067017} 11/07/2021 04:01:32 - INFO - __main__ - Step 48144: {'lr': 0.0003893643260367945, 'samples': 9243648, 'steps': 48143, 'loss/train': 2.035222053527832} 11/07/2021 04:01:32 - INFO - __main__ - Step 48145: {'lr': 0.00038935992031647183, 'samples': 9243840, 'steps': 48144, 'loss/train': 1.6396360397338867} 11/07/2021 04:01:33 - INFO - __main__ - Step 48146: {'lr': 0.00038935551453335573, 'samples': 9244032, 'steps': 48145, 'loss/train': 1.7988249063491821} 11/07/2021 04:01:33 - INFO - __main__ - Step 48147: {'lr': 0.00038935110868744817, 'samples': 9244224, 'steps': 48146, 'loss/train': 1.5400888919830322} 11/07/2021 04:01:33 - INFO - __main__ - Step 48148: {'lr': 0.0003893467027787511, 'samples': 9244416, 'steps': 48147, 'loss/train': 1.4044010639190674} 11/07/2021 04:01:34 - INFO - __main__ - Step 48149: {'lr': 0.00038934229680726663, 'samples': 9244608, 'steps': 48148, 'loss/train': 1.3595432043075562} 11/07/2021 04:01:35 - INFO - __main__ - Step 48150: {'lr': 0.0003893378907729966, 'samples': 9244800, 'steps': 48149, 'loss/train': 1.5146124362945557} 11/07/2021 04:01:35 - INFO - __main__ - Step 48151: {'lr': 0.0003893334846759431, 'samples': 9244992, 'steps': 48150, 'loss/train': 1.3512797355651855} 11/07/2021 04:01:35 - INFO - __main__ - Step 48152: {'lr': 0.0003893290785161081, 'samples': 9245184, 'steps': 48151, 'loss/train': 1.6466460227966309} 11/07/2021 04:01:36 - INFO - __main__ - Step 48153: {'lr': 0.00038932467229349353, 'samples': 9245376, 'steps': 48152, 'loss/train': 1.788167119026184} 11/07/2021 04:01:37 - INFO - __main__ - Step 48154: {'lr': 0.0003893202660081014, 'samples': 9245568, 'steps': 48153, 'loss/train': 1.78120756149292} 11/07/2021 04:01:37 - INFO - __main__ - Step 48155: {'lr': 0.00038931585965993384, 'samples': 9245760, 'steps': 48154, 'loss/train': 1.456383228302002} 11/07/2021 04:01:38 - INFO - __main__ - Step 48156: {'lr': 0.0003893114532489926, 'samples': 9245952, 'steps': 48155, 'loss/train': 0.7910884022712708} 11/07/2021 04:01:38 - INFO - __main__ - Step 48157: {'lr': 0.00038930704677527975, 'samples': 9246144, 'steps': 48156, 'loss/train': 1.9671176671981812} 11/07/2021 04:01:38 - INFO - __main__ - Step 48158: {'lr': 0.00038930264023879737, 'samples': 9246336, 'steps': 48157, 'loss/train': 1.5886300802230835} 11/07/2021 04:01:39 - INFO - __main__ - Step 48159: {'lr': 0.0003892982336395473, 'samples': 9246528, 'steps': 48158, 'loss/train': 1.5728018283843994} 11/07/2021 04:01:40 - INFO - __main__ - Step 48160: {'lr': 0.00038929382697753157, 'samples': 9246720, 'steps': 48159, 'loss/train': 1.4440854787826538} 11/07/2021 04:01:40 - INFO - __main__ - Step 48161: {'lr': 0.00038928942025275227, 'samples': 9246912, 'steps': 48160, 'loss/train': 1.386595606803894} 11/07/2021 04:01:41 - INFO - __main__ - Step 48162: {'lr': 0.00038928501346521127, 'samples': 9247104, 'steps': 48161, 'loss/train': 1.5824897289276123} 11/07/2021 04:01:41 - INFO - __main__ - Step 48163: {'lr': 0.0003892806066149106, 'samples': 9247296, 'steps': 48162, 'loss/train': 1.7923336029052734} 11/07/2021 04:01:41 - INFO - __main__ - Step 48164: {'lr': 0.00038927619970185225, 'samples': 9247488, 'steps': 48163, 'loss/train': 1.387700080871582} 11/07/2021 04:01:43 - INFO - __main__ - Step 48165: {'lr': 0.0003892717927260382, 'samples': 9247680, 'steps': 48164, 'loss/train': 1.330069661140442} 11/07/2021 04:01:43 - INFO - __main__ - Step 48166: {'lr': 0.00038926738568747035, 'samples': 9247872, 'steps': 48165, 'loss/train': 1.217962384223938} 11/07/2021 04:01:43 - INFO - __main__ - Step 48167: {'lr': 0.0003892629785861509, 'samples': 9248064, 'steps': 48166, 'loss/train': 1.3875772953033447} 11/07/2021 04:01:44 - INFO - __main__ - Step 48168: {'lr': 0.00038925857142208155, 'samples': 9248256, 'steps': 48167, 'loss/train': 1.591727614402771} 11/07/2021 04:01:44 - INFO - __main__ - Step 48169: {'lr': 0.0003892541641952645, 'samples': 9248448, 'steps': 48168, 'loss/train': 2.152301788330078} 11/07/2021 04:01:46 - INFO - __main__ - Step 48170: {'lr': 0.00038924975690570173, 'samples': 9248640, 'steps': 48169, 'loss/train': 0.714447021484375} 11/07/2021 04:01:46 - INFO - __main__ - Step 48171: {'lr': 0.0003892453495533951, 'samples': 9248832, 'steps': 48170, 'loss/train': 1.6348806619644165} 11/07/2021 04:01:47 - INFO - __main__ - Step 48172: {'lr': 0.0003892409421383467, 'samples': 9249024, 'steps': 48171, 'loss/train': 1.3700920343399048} 11/07/2021 04:01:47 - INFO - __main__ - Step 48173: {'lr': 0.0003892365346605584, 'samples': 9249216, 'steps': 48172, 'loss/train': 1.094175100326538} 11/07/2021 04:01:47 - INFO - __main__ - Step 48174: {'lr': 0.0003892321271200324, 'samples': 9249408, 'steps': 48173, 'loss/train': 1.1373343467712402} 11/07/2021 04:01:48 - INFO - __main__ - Step 48175: {'lr': 0.0003892277195167705, 'samples': 9249600, 'steps': 48174, 'loss/train': 0.7871014475822449} 11/07/2021 04:01:48 - INFO - __main__ - Step 48176: {'lr': 0.00038922331185077465, 'samples': 9249792, 'steps': 48175, 'loss/train': 0.7428754568099976} 11/07/2021 04:01:49 - INFO - __main__ - Step 48177: {'lr': 0.000389218904122047, 'samples': 9249984, 'steps': 48176, 'loss/train': 1.4595086574554443} 11/07/2021 04:01:49 - INFO - __main__ - Step 48178: {'lr': 0.00038921449633058945, 'samples': 9250176, 'steps': 48177, 'loss/train': 1.8247631788253784} 11/07/2021 04:01:50 - INFO - __main__ - Step 48179: {'lr': 0.00038921008847640407, 'samples': 9250368, 'steps': 48178, 'loss/train': 1.262622594833374} 11/07/2021 04:01:50 - INFO - __main__ - Step 48180: {'lr': 0.0003892056805594926, 'samples': 9250560, 'steps': 48179, 'loss/train': 1.541310429573059} 11/07/2021 04:01:50 - INFO - __main__ - Step 48181: {'lr': 0.0003892012725798574, 'samples': 9250752, 'steps': 48180, 'loss/train': 1.705743432044983} 11/07/2021 04:01:51 - INFO - __main__ - Step 48182: {'lr': 0.00038919686453750015, 'samples': 9250944, 'steps': 48181, 'loss/train': 1.821557641029358} 11/07/2021 04:01:52 - INFO - __main__ - Step 48183: {'lr': 0.0003891924564324229, 'samples': 9251136, 'steps': 48182, 'loss/train': 1.426300287246704} 11/07/2021 04:01:52 - INFO - __main__ - Step 48184: {'lr': 0.0003891880482646277, 'samples': 9251328, 'steps': 48183, 'loss/train': 1.5814155340194702} 11/07/2021 04:01:52 - INFO - __main__ - Step 48185: {'lr': 0.00038918364003411656, 'samples': 9251520, 'steps': 48184, 'loss/train': 1.4798948764801025} 11/07/2021 04:01:53 - INFO - __main__ - Step 48186: {'lr': 0.0003891792317408914, 'samples': 9251712, 'steps': 48185, 'loss/train': 1.9678324460983276} 11/07/2021 04:01:54 - INFO - __main__ - Step 48187: {'lr': 0.00038917482338495424, 'samples': 9251904, 'steps': 48186, 'loss/train': 1.6393036842346191} 11/07/2021 04:01:54 - INFO - __main__ - Step 48188: {'lr': 0.000389170414966307, 'samples': 9252096, 'steps': 48187, 'loss/train': 1.6283665895462036} 11/07/2021 04:01:55 - INFO - __main__ - Step 48189: {'lr': 0.0003891660064849518, 'samples': 9252288, 'steps': 48188, 'loss/train': 0.8630964756011963} 11/07/2021 04:01:55 - INFO - __main__ - Step 48190: {'lr': 0.00038916159794089044, 'samples': 9252480, 'steps': 48189, 'loss/train': 1.1445237398147583} 11/07/2021 04:01:55 - INFO - __main__ - Step 48191: {'lr': 0.00038915718933412515, 'samples': 9252672, 'steps': 48190, 'loss/train': 1.79466712474823} 11/07/2021 04:01:56 - INFO - __main__ - Step 48192: {'lr': 0.0003891527806646576, 'samples': 9252864, 'steps': 48191, 'loss/train': 1.5653927326202393} 11/07/2021 04:01:57 - INFO - __main__ - Step 48193: {'lr': 0.0003891483719324901, 'samples': 9253056, 'steps': 48192, 'loss/train': 1.4417861700057983} 11/07/2021 04:01:57 - INFO - __main__ - Step 48194: {'lr': 0.00038914396313762445, 'samples': 9253248, 'steps': 48193, 'loss/train': 1.647060751914978} 11/07/2021 04:01:57 - INFO - __main__ - Step 48195: {'lr': 0.00038913955428006265, 'samples': 9253440, 'steps': 48194, 'loss/train': 1.4309707880020142} 11/07/2021 04:01:58 - INFO - __main__ - Step 48196: {'lr': 0.00038913514535980675, 'samples': 9253632, 'steps': 48195, 'loss/train': 1.375997543334961} 11/07/2021 04:01:59 - INFO - __main__ - Step 48197: {'lr': 0.0003891307363768587, 'samples': 9253824, 'steps': 48196, 'loss/train': 0.729505717754364} 11/07/2021 04:01:59 - INFO - __main__ - Step 48198: {'lr': 0.00038912632733122045, 'samples': 9254016, 'steps': 48197, 'loss/train': 1.4610792398452759} 11/07/2021 04:01:59 - INFO - __main__ - Step 48199: {'lr': 0.000389121918222894, 'samples': 9254208, 'steps': 48198, 'loss/train': 1.446216344833374} 11/07/2021 04:02:00 - INFO - __main__ - Step 48200: {'lr': 0.0003891175090518814, 'samples': 9254400, 'steps': 48199, 'loss/train': 1.6533331871032715} 11/07/2021 04:02:00 - INFO - __main__ - Step 48201: {'lr': 0.00038911309981818466, 'samples': 9254592, 'steps': 48200, 'loss/train': 1.6140072345733643} 11/07/2021 04:02:01 - INFO - __main__ - Step 48202: {'lr': 0.00038910869052180563, 'samples': 9254784, 'steps': 48201, 'loss/train': 1.157281756401062} 11/07/2021 04:02:02 - INFO - __main__ - Step 48203: {'lr': 0.00038910428116274644, 'samples': 9254976, 'steps': 48202, 'loss/train': 1.405438780784607} 11/07/2021 04:02:02 - INFO - __main__ - Step 48204: {'lr': 0.0003890998717410089, 'samples': 9255168, 'steps': 48203, 'loss/train': 1.6671035289764404} 11/07/2021 04:02:02 - INFO - __main__ - Step 48205: {'lr': 0.0003890954622565952, 'samples': 9255360, 'steps': 48204, 'loss/train': 1.69551682472229} 11/07/2021 04:02:03 - INFO - __main__ - Step 48206: {'lr': 0.00038909105270950716, 'samples': 9255552, 'steps': 48205, 'loss/train': 1.541272521018982} 11/07/2021 04:02:03 - INFO - __main__ - Step 48207: {'lr': 0.0003890866430997468, 'samples': 9255744, 'steps': 48206, 'loss/train': 1.7470780611038208} 11/07/2021 04:02:04 - INFO - __main__ - Step 48208: {'lr': 0.0003890822334273163, 'samples': 9255936, 'steps': 48207, 'loss/train': 1.031080722808838} 11/07/2021 04:02:04 - INFO - __main__ - Step 48209: {'lr': 0.0003890778236922174, 'samples': 9256128, 'steps': 48208, 'loss/train': 1.8261090517044067} 11/07/2021 04:02:05 - INFO - __main__ - Step 48210: {'lr': 0.00038907341389445217, 'samples': 9256320, 'steps': 48209, 'loss/train': 1.4540414810180664} 11/07/2021 04:02:05 - INFO - __main__ - Step 48211: {'lr': 0.0003890690040340226, 'samples': 9256512, 'steps': 48210, 'loss/train': 0.7340781688690186} 11/07/2021 04:02:05 - INFO - __main__ - Step 48212: {'lr': 0.00038906459411093075, 'samples': 9256704, 'steps': 48211, 'loss/train': 0.9523065686225891} 11/07/2021 04:02:07 - INFO - __main__ - Step 48213: {'lr': 0.0003890601841251785, 'samples': 9256896, 'steps': 48212, 'loss/train': 1.6918346881866455} 11/07/2021 04:02:07 - INFO - __main__ - Step 48214: {'lr': 0.0003890557740767678, 'samples': 9257088, 'steps': 48213, 'loss/train': 1.09598708152771} 11/07/2021 04:02:07 - INFO - __main__ - Step 48215: {'lr': 0.00038905136396570085, 'samples': 9257280, 'steps': 48214, 'loss/train': 1.8478615283966064} 11/07/2021 04:02:08 - INFO - __main__ - Step 48216: {'lr': 0.0003890469537919794, 'samples': 9257472, 'steps': 48215, 'loss/train': 0.5915213227272034} 11/07/2021 04:02:08 - INFO - __main__ - Step 48217: {'lr': 0.0003890425435556055, 'samples': 9257664, 'steps': 48216, 'loss/train': 2.091214418411255} 11/07/2021 04:02:09 - INFO - __main__ - Step 48218: {'lr': 0.0003890381332565813, 'samples': 9257856, 'steps': 48217, 'loss/train': 1.7031760215759277} 11/07/2021 04:02:09 - INFO - __main__ - Step 48219: {'lr': 0.00038903372289490865, 'samples': 9258048, 'steps': 48218, 'loss/train': 1.9759994745254517} 11/07/2021 04:02:10 - INFO - __main__ - Step 48220: {'lr': 0.0003890293124705895, 'samples': 9258240, 'steps': 48219, 'loss/train': 1.4671889543533325} 11/07/2021 04:02:10 - INFO - __main__ - Step 48221: {'lr': 0.0003890249019836259, 'samples': 9258432, 'steps': 48220, 'loss/train': 1.2330702543258667} 11/07/2021 04:02:10 - INFO - __main__ - Step 48222: {'lr': 0.0003890204914340198, 'samples': 9258624, 'steps': 48221, 'loss/train': 1.453384280204773} 11/07/2021 04:02:12 - INFO - __main__ - Step 48223: {'lr': 0.00038901608082177327, 'samples': 9258816, 'steps': 48222, 'loss/train': 1.6290801763534546} 11/07/2021 04:02:12 - INFO - __main__ - Step 48224: {'lr': 0.0003890116701468882, 'samples': 9259008, 'steps': 48223, 'loss/train': 1.69619882106781} 11/07/2021 04:02:12 - INFO - __main__ - Step 48225: {'lr': 0.0003890072594093666, 'samples': 9259200, 'steps': 48224, 'loss/train': 1.2480615377426147} 11/07/2021 04:02:13 - INFO - __main__ - Step 48226: {'lr': 0.00038900284860921046, 'samples': 9259392, 'steps': 48225, 'loss/train': 0.24857133626937866} 11/07/2021 04:02:13 - INFO - __main__ - Step 48227: {'lr': 0.00038899843774642184, 'samples': 9259584, 'steps': 48226, 'loss/train': 1.8875969648361206} 11/07/2021 04:02:13 - INFO - __main__ - Step 48228: {'lr': 0.00038899402682100265, 'samples': 9259776, 'steps': 48227, 'loss/train': 1.4447287321090698} 11/07/2021 04:02:14 - INFO - __main__ - Step 48229: {'lr': 0.0003889896158329549, 'samples': 9259968, 'steps': 48228, 'loss/train': 1.9886767864227295} 11/07/2021 04:02:15 - INFO - __main__ - Step 48230: {'lr': 0.00038898520478228055, 'samples': 9260160, 'steps': 48229, 'loss/train': 1.613850474357605} 11/07/2021 04:02:15 - INFO - __main__ - Step 48231: {'lr': 0.00038898079366898164, 'samples': 9260352, 'steps': 48230, 'loss/train': 1.478806495666504} 11/07/2021 04:02:15 - INFO - __main__ - Step 48232: {'lr': 0.0003889763824930601, 'samples': 9260544, 'steps': 48231, 'loss/train': 1.3042471408843994} 11/07/2021 04:02:16 - INFO - __main__ - Step 48233: {'lr': 0.00038897197125451795, 'samples': 9260736, 'steps': 48232, 'loss/train': 1.3413423299789429} 11/07/2021 04:02:17 - INFO - __main__ - Step 48234: {'lr': 0.0003889675599533572, 'samples': 9260928, 'steps': 48233, 'loss/train': 1.5150249004364014} 11/07/2021 04:02:17 - INFO - __main__ - Step 48235: {'lr': 0.0003889631485895798, 'samples': 9261120, 'steps': 48234, 'loss/train': 1.4947305917739868} 11/07/2021 04:02:17 - INFO - __main__ - Step 48236: {'lr': 0.00038895873716318776, 'samples': 9261312, 'steps': 48235, 'loss/train': 1.218025803565979} 11/07/2021 04:02:18 - INFO - __main__ - Step 48237: {'lr': 0.000388954325674183, 'samples': 9261504, 'steps': 48236, 'loss/train': 1.2372782230377197} 11/07/2021 04:02:18 - INFO - __main__ - Step 48238: {'lr': 0.00038894991412256766, 'samples': 9261696, 'steps': 48237, 'loss/train': 1.2564854621887207} 11/07/2021 04:02:19 - INFO - __main__ - Step 48239: {'lr': 0.00038894550250834355, 'samples': 9261888, 'steps': 48238, 'loss/train': 1.5338630676269531} 11/07/2021 04:02:20 - INFO - __main__ - Step 48240: {'lr': 0.00038894109083151274, 'samples': 9262080, 'steps': 48239, 'loss/train': 1.4853929281234741} 11/07/2021 04:02:20 - INFO - __main__ - Step 48241: {'lr': 0.0003889366790920773, 'samples': 9262272, 'steps': 48240, 'loss/train': 1.1592860221862793} 11/07/2021 04:02:20 - INFO - __main__ - Step 48242: {'lr': 0.00038893226729003904, 'samples': 9262464, 'steps': 48241, 'loss/train': 2.0898427963256836} 11/07/2021 04:02:21 - INFO - __main__ - Step 48243: {'lr': 0.0003889278554254001, 'samples': 9262656, 'steps': 48242, 'loss/train': 1.4382511377334595} 11/07/2021 04:02:22 - INFO - __main__ - Step 48244: {'lr': 0.00038892344349816246, 'samples': 9262848, 'steps': 48243, 'loss/train': 1.4409617185592651} 11/07/2021 04:02:22 - INFO - __main__ - Step 48245: {'lr': 0.00038891903150832795, 'samples': 9263040, 'steps': 48244, 'loss/train': 1.440793752670288} 11/07/2021 04:02:22 - INFO - __main__ - Step 48246: {'lr': 0.00038891461945589866, 'samples': 9263232, 'steps': 48245, 'loss/train': 1.4871621131896973} 11/07/2021 04:02:23 - INFO - __main__ - Step 48247: {'lr': 0.0003889102073408767, 'samples': 9263424, 'steps': 48246, 'loss/train': 2.115586996078491} 11/07/2021 04:02:23 - INFO - __main__ - Step 48248: {'lr': 0.0003889057951632639, 'samples': 9263616, 'steps': 48247, 'loss/train': 1.8211729526519775} 11/07/2021 04:02:23 - INFO - __main__ - Step 48249: {'lr': 0.0003889013829230623, 'samples': 9263808, 'steps': 48248, 'loss/train': 1.6548411846160889} 11/07/2021 04:02:24 - INFO - __main__ - Step 48250: {'lr': 0.00038889697062027384, 'samples': 9264000, 'steps': 48249, 'loss/train': 1.5534456968307495} 11/07/2021 04:02:25 - INFO - __main__ - Step 48251: {'lr': 0.00038889255825490053, 'samples': 9264192, 'steps': 48250, 'loss/train': 1.0798821449279785} 11/07/2021 04:02:25 - INFO - __main__ - Step 48252: {'lr': 0.0003888881458269444, 'samples': 9264384, 'steps': 48251, 'loss/train': 0.9305204153060913} 11/07/2021 04:02:26 - INFO - __main__ - Step 48253: {'lr': 0.00038888373333640746, 'samples': 9264576, 'steps': 48252, 'loss/train': 1.3799680471420288} 11/07/2021 04:02:26 - INFO - __main__ - Step 48254: {'lr': 0.00038887932078329165, 'samples': 9264768, 'steps': 48253, 'loss/train': 1.4742240905761719} 11/07/2021 04:02:27 - INFO - __main__ - Step 48255: {'lr': 0.00038887490816759895, 'samples': 9264960, 'steps': 48254, 'loss/train': 1.489135980606079} 11/07/2021 04:02:27 - INFO - __main__ - Step 48256: {'lr': 0.00038887049548933135, 'samples': 9265152, 'steps': 48255, 'loss/train': 1.4327009916305542} 11/07/2021 04:02:28 - INFO - __main__ - Step 48257: {'lr': 0.0003888660827484908, 'samples': 9265344, 'steps': 48256, 'loss/train': 1.4369255304336548} 11/07/2021 04:02:28 - INFO - __main__ - Step 48258: {'lr': 0.00038886166994507945, 'samples': 9265536, 'steps': 48257, 'loss/train': 1.502833604812622} 11/07/2021 04:02:28 - INFO - __main__ - Step 48259: {'lr': 0.00038885725707909905, 'samples': 9265728, 'steps': 48258, 'loss/train': 0.3147200047969818} 11/07/2021 04:02:29 - INFO - __main__ - Step 48260: {'lr': 0.0003888528441505518, 'samples': 9265920, 'steps': 48259, 'loss/train': 1.8659920692443848} 11/07/2021 04:02:30 - INFO - __main__ - Step 48261: {'lr': 0.00038884843115943955, 'samples': 9266112, 'steps': 48260, 'loss/train': 0.5704777836799622} 11/07/2021 04:02:30 - INFO - __main__ - Step 48262: {'lr': 0.00038884401810576434, 'samples': 9266304, 'steps': 48261, 'loss/train': 1.202111005783081} 11/07/2021 04:02:30 - INFO - __main__ - Step 48263: {'lr': 0.0003888396049895282, 'samples': 9266496, 'steps': 48262, 'loss/train': 1.4878652095794678} 11/07/2021 04:02:31 - INFO - __main__ - Step 48264: {'lr': 0.000388835191810733, 'samples': 9266688, 'steps': 48263, 'loss/train': 1.6896651983261108} 11/07/2021 04:02:32 - INFO - __main__ - Step 48265: {'lr': 0.0003888307785693809, 'samples': 9266880, 'steps': 48264, 'loss/train': 1.4856317043304443} 11/07/2021 04:02:32 - INFO - __main__ - Step 48266: {'lr': 0.0003888263652654738, 'samples': 9267072, 'steps': 48265, 'loss/train': 1.5039575099945068} 11/07/2021 04:02:33 - INFO - __main__ - Step 48267: {'lr': 0.0003888219518990136, 'samples': 9267264, 'steps': 48266, 'loss/train': 1.9758280515670776} 11/07/2021 04:02:33 - INFO - __main__ - Step 48268: {'lr': 0.0003888175384700024, 'samples': 9267456, 'steps': 48267, 'loss/train': 1.226232647895813} 11/07/2021 04:02:33 - INFO - __main__ - Step 48269: {'lr': 0.0003888131249784421, 'samples': 9267648, 'steps': 48268, 'loss/train': 1.7522835731506348} 11/07/2021 04:02:34 - INFO - __main__ - Step 48270: {'lr': 0.00038880871142433484, 'samples': 9267840, 'steps': 48269, 'loss/train': 0.9955446124076843} 11/07/2021 04:02:35 - INFO - __main__ - Step 48271: {'lr': 0.0003888042978076825, 'samples': 9268032, 'steps': 48270, 'loss/train': 1.7882648706436157} 11/07/2021 04:02:35 - INFO - __main__ - Step 48272: {'lr': 0.00038879988412848706, 'samples': 9268224, 'steps': 48271, 'loss/train': 1.4117809534072876} 11/07/2021 04:02:35 - INFO - __main__ - Step 48273: {'lr': 0.00038879547038675054, 'samples': 9268416, 'steps': 48272, 'loss/train': 1.4808791875839233} 11/07/2021 04:02:36 - INFO - __main__ - Step 48274: {'lr': 0.0003887910565824749, 'samples': 9268608, 'steps': 48273, 'loss/train': 0.9658954739570618} 11/07/2021 04:02:36 - INFO - __main__ - Step 48275: {'lr': 0.0003887866427156622, 'samples': 9268800, 'steps': 48274, 'loss/train': 1.368213415145874} 11/07/2021 04:02:37 - INFO - __main__ - Step 48276: {'lr': 0.00038878222878631444, 'samples': 9268992, 'steps': 48275, 'loss/train': 1.7484121322631836} 11/07/2021 04:02:37 - INFO - __main__ - Step 48277: {'lr': 0.0003887778147944334, 'samples': 9269184, 'steps': 48276, 'loss/train': 1.3552138805389404} 11/07/2021 04:02:38 - INFO - __main__ - Step 48278: {'lr': 0.0003887734007400213, 'samples': 9269376, 'steps': 48277, 'loss/train': 1.819169044494629} 11/07/2021 04:02:38 - INFO - __main__ - Step 48279: {'lr': 0.00038876898662308, 'samples': 9269568, 'steps': 48278, 'loss/train': 1.8303667306900024} 11/07/2021 04:02:39 - INFO - __main__ - Step 48280: {'lr': 0.00038876457244361166, 'samples': 9269760, 'steps': 48279, 'loss/train': 1.3897451162338257} 11/07/2021 04:02:40 - INFO - __main__ - Step 48281: {'lr': 0.000388760158201618, 'samples': 9269952, 'steps': 48280, 'loss/train': 1.4322813749313354} 11/07/2021 04:02:40 - INFO - __main__ - Step 48282: {'lr': 0.0003887557438971012, 'samples': 9270144, 'steps': 48281, 'loss/train': 1.7044888734817505} 11/07/2021 04:02:40 - INFO - __main__ - Step 48283: {'lr': 0.0003887513295300632, 'samples': 9270336, 'steps': 48282, 'loss/train': 1.4639825820922852} 11/07/2021 04:02:41 - INFO - __main__ - Step 48284: {'lr': 0.00038874691510050604, 'samples': 9270528, 'steps': 48283, 'loss/train': 1.6526075601577759} 11/07/2021 04:02:41 - INFO - __main__ - Step 48285: {'lr': 0.00038874250060843163, 'samples': 9270720, 'steps': 48284, 'loss/train': 1.1600265502929688} 11/07/2021 04:02:42 - INFO - __main__ - Step 48286: {'lr': 0.00038873808605384197, 'samples': 9270912, 'steps': 48285, 'loss/train': 1.3651691675186157} 11/07/2021 04:02:42 - INFO - __main__ - Step 48287: {'lr': 0.0003887336714367391, 'samples': 9271104, 'steps': 48286, 'loss/train': 1.148716688156128} 11/07/2021 04:02:43 - INFO - __main__ - Step 48288: {'lr': 0.00038872925675712493, 'samples': 9271296, 'steps': 48287, 'loss/train': 1.4003381729125977} 11/07/2021 04:02:43 - INFO - __main__ - Step 48289: {'lr': 0.0003887248420150016, 'samples': 9271488, 'steps': 48288, 'loss/train': 1.4225640296936035} 11/07/2021 04:02:43 - INFO - __main__ - Step 48290: {'lr': 0.00038872042721037087, 'samples': 9271680, 'steps': 48289, 'loss/train': 1.2903283834457397} 11/07/2021 04:02:45 - INFO - __main__ - Step 48291: {'lr': 0.00038871601234323494, 'samples': 9271872, 'steps': 48290, 'loss/train': 0.3443036377429962} 11/07/2021 04:02:45 - INFO - __main__ - Step 48292: {'lr': 0.00038871159741359567, 'samples': 9272064, 'steps': 48291, 'loss/train': 1.823111891746521} 11/07/2021 04:02:45 - INFO - __main__ - Step 48293: {'lr': 0.0003887071824214551, 'samples': 9272256, 'steps': 48292, 'loss/train': 1.5963959693908691} 11/07/2021 04:02:46 - INFO - __main__ - Step 48294: {'lr': 0.0003887027673668152, 'samples': 9272448, 'steps': 48293, 'loss/train': 1.734969139099121} 11/07/2021 04:02:46 - INFO - __main__ - Step 48295: {'lr': 0.0003886983522496781, 'samples': 9272640, 'steps': 48294, 'loss/train': 1.4370068311691284} 11/07/2021 04:02:47 - INFO - __main__ - Step 48296: {'lr': 0.00038869393707004554, 'samples': 9272832, 'steps': 48295, 'loss/train': 1.698050618171692} 11/07/2021 04:02:47 - INFO - __main__ - Step 48297: {'lr': 0.00038868952182791964, 'samples': 9273024, 'steps': 48296, 'loss/train': 1.2806745767593384} 11/07/2021 04:02:48 - INFO - __main__ - Step 48298: {'lr': 0.0003886851065233024, 'samples': 9273216, 'steps': 48297, 'loss/train': 1.7508585453033447} 11/07/2021 04:02:48 - INFO - __main__ - Step 48299: {'lr': 0.0003886806911561958, 'samples': 9273408, 'steps': 48298, 'loss/train': 1.9955202341079712} 11/07/2021 04:02:48 - INFO - __main__ - Step 48300: {'lr': 0.0003886762757266018, 'samples': 9273600, 'steps': 48299, 'loss/train': 1.306843876838684} 11/07/2021 04:02:50 - INFO - __main__ - Step 48301: {'lr': 0.0003886718602345224, 'samples': 9273792, 'steps': 48300, 'loss/train': 1.6101634502410889} 11/07/2021 04:02:50 - INFO - __main__ - Step 48302: {'lr': 0.0003886674446799596, 'samples': 9273984, 'steps': 48301, 'loss/train': 1.5550583600997925} 11/07/2021 04:02:50 - INFO - __main__ - Step 48303: {'lr': 0.00038866302906291546, 'samples': 9274176, 'steps': 48302, 'loss/train': 1.1555699110031128} 11/07/2021 04:02:51 - INFO - __main__ - Step 48304: {'lr': 0.0003886586133833918, 'samples': 9274368, 'steps': 48303, 'loss/train': 1.4424365758895874} 11/07/2021 04:02:51 - INFO - __main__ - Step 48305: {'lr': 0.00038865419764139077, 'samples': 9274560, 'steps': 48304, 'loss/train': 0.8644227385520935} 11/07/2021 04:02:51 - INFO - __main__ - Step 48306: {'lr': 0.00038864978183691425, 'samples': 9274752, 'steps': 48305, 'loss/train': 0.1560346931219101} 11/07/2021 04:02:52 - INFO - __main__ - Step 48307: {'lr': 0.00038864536596996437, 'samples': 9274944, 'steps': 48306, 'loss/train': 1.4520589113235474} 11/07/2021 04:02:53 - INFO - __main__ - Step 48308: {'lr': 0.0003886409500405429, 'samples': 9275136, 'steps': 48307, 'loss/train': 1.5580068826675415} 11/07/2021 04:02:53 - INFO - __main__ - Step 48309: {'lr': 0.00038863653404865207, 'samples': 9275328, 'steps': 48308, 'loss/train': 1.8992919921875} 11/07/2021 04:02:53 - INFO - __main__ - Step 48310: {'lr': 0.0003886321179942937, 'samples': 9275520, 'steps': 48309, 'loss/train': 1.8710448741912842} 11/07/2021 04:02:54 - INFO - __main__ - Step 48311: {'lr': 0.0003886277018774699, 'samples': 9275712, 'steps': 48310, 'loss/train': 1.650238037109375} 11/07/2021 04:02:55 - INFO - __main__ - Step 48312: {'lr': 0.0003886232856981825, 'samples': 9275904, 'steps': 48311, 'loss/train': 1.7266851663589478} 11/07/2021 04:02:55 - INFO - __main__ - Step 48313: {'lr': 0.00038861886945643363, 'samples': 9276096, 'steps': 48312, 'loss/train': 0.19809167087078094} 11/07/2021 04:02:55 - INFO - __main__ - Step 48314: {'lr': 0.00038861445315222523, 'samples': 9276288, 'steps': 48313, 'loss/train': 2.008556365966797} 11/07/2021 04:02:56 - INFO - __main__ - Step 48315: {'lr': 0.00038861003678555936, 'samples': 9276480, 'steps': 48314, 'loss/train': 1.3083784580230713} 11/07/2021 04:02:56 - INFO - __main__ - Step 48316: {'lr': 0.00038860562035643786, 'samples': 9276672, 'steps': 48315, 'loss/train': 1.8233020305633545} 11/07/2021 04:02:57 - INFO - __main__ - Step 48317: {'lr': 0.00038860120386486285, 'samples': 9276864, 'steps': 48316, 'loss/train': 1.6587094068527222} 11/07/2021 04:02:58 - INFO - __main__ - Step 48318: {'lr': 0.00038859678731083627, 'samples': 9277056, 'steps': 48317, 'loss/train': 1.7899813652038574} 11/07/2021 04:02:58 - INFO - __main__ - Step 48319: {'lr': 0.0003885923706943601, 'samples': 9277248, 'steps': 48318, 'loss/train': 1.2071336507797241} 11/07/2021 04:02:58 - INFO - __main__ - Step 48320: {'lr': 0.00038858795401543634, 'samples': 9277440, 'steps': 48319, 'loss/train': 1.0648425817489624} 11/07/2021 04:02:59 - INFO - __main__ - Step 48321: {'lr': 0.000388583537274067, 'samples': 9277632, 'steps': 48320, 'loss/train': 1.409128189086914} 11/07/2021 04:03:00 - INFO - __main__ - Step 48322: {'lr': 0.0003885791204702541, 'samples': 9277824, 'steps': 48321, 'loss/train': 1.2754244804382324} 11/07/2021 04:03:00 - INFO - __main__ - Step 48323: {'lr': 0.0003885747036039995, 'samples': 9278016, 'steps': 48322, 'loss/train': 1.552413821220398} 11/07/2021 04:03:00 - INFO - __main__ - Step 48324: {'lr': 0.0003885702866753054, 'samples': 9278208, 'steps': 48323, 'loss/train': 1.6152557134628296} 11/07/2021 04:03:01 - INFO - __main__ - Step 48325: {'lr': 0.00038856586968417353, 'samples': 9278400, 'steps': 48324, 'loss/train': 1.4518879652023315} 11/07/2021 04:03:01 - INFO - __main__ - Step 48326: {'lr': 0.00038856145263060606, 'samples': 9278592, 'steps': 48325, 'loss/train': 1.1021263599395752} 11/07/2021 04:03:02 - INFO - __main__ - Step 48327: {'lr': 0.00038855703551460497, 'samples': 9278784, 'steps': 48326, 'loss/train': 1.3933674097061157} 11/07/2021 04:03:02 - INFO - __main__ - Step 48328: {'lr': 0.00038855261833617216, 'samples': 9278976, 'steps': 48327, 'loss/train': 1.8830376863479614} 11/07/2021 04:03:03 - INFO - __main__ - Step 48329: {'lr': 0.00038854820109530974, 'samples': 9279168, 'steps': 48328, 'loss/train': 1.5740693807601929} 11/07/2021 04:03:03 - INFO - __main__ - Step 48330: {'lr': 0.00038854378379201966, 'samples': 9279360, 'steps': 48329, 'loss/train': 1.3352789878845215} 11/07/2021 04:03:04 - INFO - __main__ - Step 48331: {'lr': 0.0003885393664263038, 'samples': 9279552, 'steps': 48330, 'loss/train': 1.5021089315414429} 11/07/2021 04:03:05 - INFO - __main__ - Step 48332: {'lr': 0.00038853494899816434, 'samples': 9279744, 'steps': 48331, 'loss/train': 1.460158348083496} 11/07/2021 04:03:05 - INFO - __main__ - Step 48333: {'lr': 0.0003885305315076031, 'samples': 9279936, 'steps': 48332, 'loss/train': 1.4276036024093628} 11/07/2021 04:03:05 - INFO - __main__ - Step 48334: {'lr': 0.0003885261139546221, 'samples': 9280128, 'steps': 48333, 'loss/train': 1.0807746648788452} 11/07/2021 04:03:06 - INFO - __main__ - Step 48335: {'lr': 0.00038852169633922344, 'samples': 9280320, 'steps': 48334, 'loss/train': 1.3793299198150635} 11/07/2021 04:03:06 - INFO - __main__ - Step 48336: {'lr': 0.00038851727866140906, 'samples': 9280512, 'steps': 48335, 'loss/train': 1.3977899551391602} 11/07/2021 04:03:08 - INFO - __main__ - Step 48337: {'lr': 0.00038851286092118095, 'samples': 9280704, 'steps': 48336, 'loss/train': 1.2920801639556885} 11/07/2021 04:03:08 - INFO - __main__ - Step 48338: {'lr': 0.0003885084431185411, 'samples': 9280896, 'steps': 48337, 'loss/train': 1.3549143075942993} 11/07/2021 04:03:08 - INFO - __main__ - Step 48339: {'lr': 0.0003885040252534913, 'samples': 9281088, 'steps': 48338, 'loss/train': 2.4132800102233887} 11/07/2021 04:03:09 - INFO - __main__ - Step 48340: {'lr': 0.00038849960732603386, 'samples': 9281280, 'steps': 48339, 'loss/train': 2.2466821670532227} 11/07/2021 04:03:09 - INFO - __main__ - Step 48341: {'lr': 0.00038849518933617064, 'samples': 9281472, 'steps': 48340, 'loss/train': 1.1273099184036255} 11/07/2021 04:03:09 - INFO - __main__ - Step 48342: {'lr': 0.0003884907712839036, 'samples': 9281664, 'steps': 48341, 'loss/train': 1.5950285196304321} 11/07/2021 04:03:10 - INFO - __main__ - Step 48343: {'lr': 0.00038848635316923475, 'samples': 9281856, 'steps': 48342, 'loss/train': 1.6004289388656616} 11/07/2021 04:03:11 - INFO - __main__ - Step 48344: {'lr': 0.0003884819349921661, 'samples': 9282048, 'steps': 48343, 'loss/train': 1.8100656270980835} 11/07/2021 04:03:11 - INFO - __main__ - Step 48345: {'lr': 0.0003884775167526996, 'samples': 9282240, 'steps': 48344, 'loss/train': 1.493229627609253} 11/07/2021 04:03:11 - INFO - __main__ - Step 48346: {'lr': 0.0003884730984508373, 'samples': 9282432, 'steps': 48345, 'loss/train': 1.6365907192230225} 11/07/2021 04:03:12 - INFO - __main__ - Step 48347: {'lr': 0.0003884686800865812, 'samples': 9282624, 'steps': 48346, 'loss/train': 1.643216609954834} 11/07/2021 04:03:13 - INFO - __main__ - Step 48348: {'lr': 0.0003884642616599331, 'samples': 9282816, 'steps': 48347, 'loss/train': 1.2912529706954956} 11/07/2021 04:03:13 - INFO - __main__ - Step 48349: {'lr': 0.00038845984317089526, 'samples': 9283008, 'steps': 48348, 'loss/train': 1.9691541194915771} 11/07/2021 04:03:14 - INFO - __main__ - Step 48350: {'lr': 0.00038845542461946953, 'samples': 9283200, 'steps': 48349, 'loss/train': 1.8574779033660889} 11/07/2021 04:03:14 - INFO - __main__ - Step 48351: {'lr': 0.00038845100600565794, 'samples': 9283392, 'steps': 48350, 'loss/train': 1.6028879880905151} 11/07/2021 04:03:14 - INFO - __main__ - Step 48352: {'lr': 0.00038844658732946244, 'samples': 9283584, 'steps': 48351, 'loss/train': 1.471935510635376} 11/07/2021 04:03:15 - INFO - __main__ - Step 48353: {'lr': 0.000388442168590885, 'samples': 9283776, 'steps': 48352, 'loss/train': 1.7875932455062866} 11/07/2021 04:03:16 - INFO - __main__ - Step 48354: {'lr': 0.00038843774978992773, 'samples': 9283968, 'steps': 48353, 'loss/train': 1.2040324211120605} 11/07/2021 04:03:16 - INFO - __main__ - Step 48355: {'lr': 0.0003884333309265925, 'samples': 9284160, 'steps': 48354, 'loss/train': 2.1289470195770264} 11/07/2021 04:03:16 - INFO - __main__ - Step 48356: {'lr': 0.00038842891200088135, 'samples': 9284352, 'steps': 48355, 'loss/train': 1.300454020500183} 11/07/2021 04:03:17 - INFO - __main__ - Step 48357: {'lr': 0.0003884244930127963, 'samples': 9284544, 'steps': 48356, 'loss/train': 1.789948582649231} 11/07/2021 04:03:17 - INFO - __main__ - Step 48358: {'lr': 0.0003884200739623393, 'samples': 9284736, 'steps': 48357, 'loss/train': 1.8772468566894531} 11/07/2021 04:03:18 - INFO - __main__ - Step 48359: {'lr': 0.00038841565484951237, 'samples': 9284928, 'steps': 48358, 'loss/train': 1.9208248853683472} 11/07/2021 04:03:19 - INFO - __main__ - Step 48360: {'lr': 0.0003884112356743175, 'samples': 9285120, 'steps': 48359, 'loss/train': 1.1249557733535767} 11/07/2021 04:03:19 - INFO - __main__ - Step 48361: {'lr': 0.0003884068164367566, 'samples': 9285312, 'steps': 48360, 'loss/train': 1.8961440324783325} 11/07/2021 04:03:19 - INFO - __main__ - Step 48362: {'lr': 0.00038840239713683165, 'samples': 9285504, 'steps': 48361, 'loss/train': 1.3289682865142822} 11/07/2021 04:03:20 - INFO - __main__ - Step 48363: {'lr': 0.0003883979777745449, 'samples': 9285696, 'steps': 48362, 'loss/train': 1.716339111328125} 11/07/2021 04:03:20 - INFO - __main__ - Step 48364: {'lr': 0.00038839355834989806, 'samples': 9285888, 'steps': 48363, 'loss/train': 1.509454369544983} 11/07/2021 04:03:21 - INFO - __main__ - Step 48365: {'lr': 0.0003883891388628932, 'samples': 9286080, 'steps': 48364, 'loss/train': 1.4280014038085938} 11/07/2021 04:03:21 - INFO - __main__ - Step 48366: {'lr': 0.0003883847193135323, 'samples': 9286272, 'steps': 48365, 'loss/train': 1.377169132232666} 11/07/2021 04:03:22 - INFO - __main__ - Step 48367: {'lr': 0.0003883802997018174, 'samples': 9286464, 'steps': 48366, 'loss/train': 0.9458029270172119} 11/07/2021 04:03:22 - INFO - __main__ - Step 48368: {'lr': 0.00038837588002775054, 'samples': 9286656, 'steps': 48367, 'loss/train': 1.3015624284744263} 11/07/2021 04:03:22 - INFO - __main__ - Step 48369: {'lr': 0.0003883714602913336, 'samples': 9286848, 'steps': 48368, 'loss/train': 1.306085467338562} 11/07/2021 04:03:23 - INFO - __main__ - Step 48370: {'lr': 0.00038836704049256864, 'samples': 9287040, 'steps': 48369, 'loss/train': 1.64092218875885} 11/07/2021 04:03:24 - INFO - __main__ - Step 48371: {'lr': 0.0003883626206314577, 'samples': 9287232, 'steps': 48370, 'loss/train': 1.1958248615264893} 11/07/2021 04:03:24 - INFO - __main__ - Step 48372: {'lr': 0.0003883582007080025, 'samples': 9287424, 'steps': 48371, 'loss/train': 1.6516435146331787} 11/07/2021 04:03:25 - INFO - __main__ - Step 48373: {'lr': 0.0003883537807222054, 'samples': 9287616, 'steps': 48372, 'loss/train': 5.795014381408691} 11/07/2021 04:03:25 - INFO - __main__ - Step 48374: {'lr': 0.0003883493606740681, 'samples': 9287808, 'steps': 48373, 'loss/train': 1.2726349830627441} 11/07/2021 04:03:26 - INFO - __main__ - Step 48375: {'lr': 0.0003883449405635928, 'samples': 9288000, 'steps': 48374, 'loss/train': 1.3027281761169434} 11/07/2021 04:03:26 - INFO - __main__ - Step 48376: {'lr': 0.0003883405203907814, 'samples': 9288192, 'steps': 48375, 'loss/train': 1.632309079170227} 11/07/2021 04:03:27 - INFO - __main__ - Step 48377: {'lr': 0.0003883361001556359, 'samples': 9288384, 'steps': 48376, 'loss/train': 1.4304492473602295} 11/07/2021 04:03:27 - INFO - __main__ - Step 48378: {'lr': 0.0003883316798581582, 'samples': 9288576, 'steps': 48377, 'loss/train': 1.9488767385482788} 11/07/2021 04:03:27 - INFO - __main__ - Step 48379: {'lr': 0.0003883272594983505, 'samples': 9288768, 'steps': 48378, 'loss/train': 1.4861701726913452} 11/07/2021 04:03:28 - INFO - __main__ - Step 48380: {'lr': 0.00038832283907621457, 'samples': 9288960, 'steps': 48379, 'loss/train': 1.530775547027588} 11/07/2021 04:03:29 - INFO - __main__ - Step 48381: {'lr': 0.00038831841859175253, 'samples': 9289152, 'steps': 48380, 'loss/train': 1.578474998474121} 11/07/2021 04:03:29 - INFO - __main__ - Step 48382: {'lr': 0.0003883139980449664, 'samples': 9289344, 'steps': 48381, 'loss/train': 1.6181931495666504} 11/07/2021 04:03:29 - INFO - __main__ - Step 48383: {'lr': 0.00038830957743585807, 'samples': 9289536, 'steps': 48382, 'loss/train': 1.1623398065567017} 11/07/2021 04:03:30 - INFO - __main__ - Step 48384: {'lr': 0.0003883051567644296, 'samples': 9289728, 'steps': 48383, 'loss/train': 1.6592649221420288} 11/07/2021 04:03:31 - INFO - __main__ - Step 48385: {'lr': 0.00038830073603068297, 'samples': 9289920, 'steps': 48384, 'loss/train': 1.9188312292099} 11/07/2021 04:03:31 - INFO - __main__ - Step 48386: {'lr': 0.00038829631523462003, 'samples': 9290112, 'steps': 48385, 'loss/train': 1.1328415870666504} 11/07/2021 04:03:32 - INFO - __main__ - Step 48387: {'lr': 0.000388291894376243, 'samples': 9290304, 'steps': 48386, 'loss/train': 0.8931834101676941} 11/07/2021 04:03:32 - INFO - __main__ - Step 48388: {'lr': 0.0003882874734555538, 'samples': 9290496, 'steps': 48387, 'loss/train': 0.7181499600410461} 11/07/2021 04:03:32 - INFO - __main__ - Step 48389: {'lr': 0.00038828305247255447, 'samples': 9290688, 'steps': 48388, 'loss/train': 1.7743628025054932} 11/07/2021 04:03:33 - INFO - __main__ - Step 48390: {'lr': 0.00038827863142724685, 'samples': 9290880, 'steps': 48389, 'loss/train': 1.139664888381958} 11/07/2021 04:03:34 - INFO - __main__ - Step 48391: {'lr': 0.00038827421031963294, 'samples': 9291072, 'steps': 48390, 'loss/train': 1.466491937637329} 11/07/2021 04:03:34 - INFO - __main__ - Step 48392: {'lr': 0.0003882697891497149, 'samples': 9291264, 'steps': 48391, 'loss/train': 0.5570085048675537} 11/07/2021 04:03:35 - INFO - __main__ - Step 48393: {'lr': 0.00038826536791749454, 'samples': 9291456, 'steps': 48392, 'loss/train': 1.432281494140625} 11/07/2021 04:03:35 - INFO - __main__ - Step 48394: {'lr': 0.00038826094662297404, 'samples': 9291648, 'steps': 48393, 'loss/train': 1.030002474784851} 11/07/2021 04:03:35 - INFO - __main__ - Step 48395: {'lr': 0.0003882565252661553, 'samples': 9291840, 'steps': 48394, 'loss/train': 1.17636239528656} 11/07/2021 04:03:36 - INFO - __main__ - Step 48396: {'lr': 0.00038825210384704024, 'samples': 9292032, 'steps': 48395, 'loss/train': 1.313944935798645} 11/07/2021 04:03:37 - INFO - __main__ - Step 48397: {'lr': 0.0003882476823656309, 'samples': 9292224, 'steps': 48396, 'loss/train': 1.5488330125808716} 11/07/2021 04:03:37 - INFO - __main__ - Step 48398: {'lr': 0.00038824326082192935, 'samples': 9292416, 'steps': 48397, 'loss/train': 1.6451550722122192} 11/07/2021 04:03:37 - INFO - __main__ - Step 48399: {'lr': 0.0003882388392159375, 'samples': 9292608, 'steps': 48398, 'loss/train': 1.3582696914672852} 11/07/2021 04:03:38 - INFO - __main__ - Step 48400: {'lr': 0.0003882344175476573, 'samples': 9292800, 'steps': 48399, 'loss/train': 1.2250689268112183} 11/07/2021 04:03:39 - INFO - __main__ - Step 48401: {'lr': 0.00038822999581709087, 'samples': 9292992, 'steps': 48400, 'loss/train': 1.4877376556396484} 11/07/2021 04:03:39 - INFO - __main__ - Step 48402: {'lr': 0.0003882255740242401, 'samples': 9293184, 'steps': 48401, 'loss/train': 1.5930143594741821} 11/07/2021 04:03:39 - INFO - __main__ - Step 48403: {'lr': 0.0003882211521691071, 'samples': 9293376, 'steps': 48402, 'loss/train': 1.1649909019470215} 11/07/2021 04:03:40 - INFO - __main__ - Step 48404: {'lr': 0.0003882167302516937, 'samples': 9293568, 'steps': 48403, 'loss/train': 1.1903587579727173} 11/07/2021 04:03:40 - INFO - __main__ - Step 48405: {'lr': 0.000388212308272002, 'samples': 9293760, 'steps': 48404, 'loss/train': 1.2432438135147095} 11/07/2021 04:03:41 - INFO - __main__ - Step 48406: {'lr': 0.00038820788623003397, 'samples': 9293952, 'steps': 48405, 'loss/train': 1.5587519407272339} 11/07/2021 04:03:41 - INFO - __main__ - Step 48407: {'lr': 0.00038820346412579156, 'samples': 9294144, 'steps': 48406, 'loss/train': 1.460087776184082} 11/07/2021 04:03:42 - INFO - __main__ - Step 48408: {'lr': 0.0003881990419592768, 'samples': 9294336, 'steps': 48407, 'loss/train': 1.2280032634735107} 11/07/2021 04:03:42 - INFO - __main__ - Step 48409: {'lr': 0.00038819461973049177, 'samples': 9294528, 'steps': 48408, 'loss/train': 1.981165885925293} 11/07/2021 04:03:43 - INFO - __main__ - Step 48410: {'lr': 0.00038819019743943834, 'samples': 9294720, 'steps': 48409, 'loss/train': 1.597578763961792} 11/07/2021 04:03:44 - INFO - __main__ - Step 48411: {'lr': 0.00038818577508611854, 'samples': 9294912, 'steps': 48410, 'loss/train': 1.4958175420761108} 11/07/2021 04:03:44 - INFO - __main__ - Step 48412: {'lr': 0.00038818135267053435, 'samples': 9295104, 'steps': 48411, 'loss/train': 1.221787691116333} 11/07/2021 04:03:44 - INFO - __main__ - Step 48413: {'lr': 0.00038817693019268775, 'samples': 9295296, 'steps': 48412, 'loss/train': 1.3178467750549316} 11/07/2021 04:03:45 - INFO - __main__ - Step 48414: {'lr': 0.0003881725076525808, 'samples': 9295488, 'steps': 48413, 'loss/train': 1.1740386486053467} 11/07/2021 04:03:45 - INFO - __main__ - Step 48415: {'lr': 0.0003881680850502154, 'samples': 9295680, 'steps': 48414, 'loss/train': 1.349912405014038} 11/07/2021 04:03:46 - INFO - __main__ - Step 48416: {'lr': 0.00038816366238559366, 'samples': 9295872, 'steps': 48415, 'loss/train': 1.637817621231079} 11/07/2021 04:03:47 - INFO - __main__ - Step 48417: {'lr': 0.00038815923965871747, 'samples': 9296064, 'steps': 48416, 'loss/train': 1.4408929347991943} 11/07/2021 04:03:47 - INFO - __main__ - Step 48418: {'lr': 0.00038815481686958883, 'samples': 9296256, 'steps': 48417, 'loss/train': 1.164324164390564} 11/07/2021 04:03:47 - INFO - __main__ - Step 48419: {'lr': 0.0003881503940182098, 'samples': 9296448, 'steps': 48418, 'loss/train': 1.32057785987854} 11/07/2021 04:03:48 - INFO - __main__ - Step 48420: {'lr': 0.0003881459711045823, 'samples': 9296640, 'steps': 48419, 'loss/train': 1.1111409664154053} 11/07/2021 04:03:48 - INFO - __main__ - Step 48421: {'lr': 0.0003881415481287084, 'samples': 9296832, 'steps': 48420, 'loss/train': 1.074395775794983} 11/07/2021 04:03:49 - INFO - __main__ - Step 48422: {'lr': 0.00038813712509058995, 'samples': 9297024, 'steps': 48421, 'loss/train': 1.415626883506775} 11/07/2021 04:03:49 - INFO - __main__ - Step 48423: {'lr': 0.0003881327019902292, 'samples': 9297216, 'steps': 48422, 'loss/train': 1.8866455554962158} 11/07/2021 04:03:50 - INFO - __main__ - Step 48424: {'lr': 0.00038812827882762793, 'samples': 9297408, 'steps': 48423, 'loss/train': 1.7330360412597656} 11/07/2021 04:03:50 - INFO - __main__ - Step 48425: {'lr': 0.00038812385560278815, 'samples': 9297600, 'steps': 48424, 'loss/train': 1.602062702178955} 11/07/2021 04:03:50 - INFO - __main__ - Step 48426: {'lr': 0.0003881194323157119, 'samples': 9297792, 'steps': 48425, 'loss/train': 1.510677695274353} 11/07/2021 04:03:51 - INFO - __main__ - Step 48427: {'lr': 0.00038811500896640116, 'samples': 9297984, 'steps': 48426, 'loss/train': 1.292872428894043} 11/07/2021 04:03:52 - INFO - __main__ - Step 48428: {'lr': 0.0003881105855548579, 'samples': 9298176, 'steps': 48427, 'loss/train': 1.6281934976577759} 11/07/2021 04:03:52 - INFO - __main__ - Step 48429: {'lr': 0.00038810616208108416, 'samples': 9298368, 'steps': 48428, 'loss/train': 1.389033555984497} 11/07/2021 04:03:52 - INFO - __main__ - Step 48430: {'lr': 0.00038810173854508204, 'samples': 9298560, 'steps': 48429, 'loss/train': 1.1105941534042358} 11/07/2021 04:03:53 - INFO - __main__ - Step 48431: {'lr': 0.0003880973149468533, 'samples': 9298752, 'steps': 48430, 'loss/train': 1.383050799369812} 11/07/2021 04:03:54 - INFO - __main__ - Step 48432: {'lr': 0.00038809289128640003, 'samples': 9298944, 'steps': 48431, 'loss/train': 1.530485987663269} 11/07/2021 04:03:54 - INFO - __main__ - Step 48433: {'lr': 0.00038808846756372426, 'samples': 9299136, 'steps': 48432, 'loss/train': 1.4259238243103027} 11/07/2021 04:03:54 - INFO - __main__ - Step 48434: {'lr': 0.0003880840437788279, 'samples': 9299328, 'steps': 48433, 'loss/train': 1.7784948348999023} 11/07/2021 04:03:55 - INFO - __main__ - Step 48435: {'lr': 0.00038807961993171306, 'samples': 9299520, 'steps': 48434, 'loss/train': 1.703410267829895} 11/07/2021 04:03:55 - INFO - __main__ - Step 48436: {'lr': 0.00038807519602238174, 'samples': 9299712, 'steps': 48435, 'loss/train': 1.0404880046844482} 11/07/2021 04:03:56 - INFO - __main__ - Step 48437: {'lr': 0.00038807077205083577, 'samples': 9299904, 'steps': 48436, 'loss/train': 1.5217667818069458} 11/07/2021 04:03:56 - INFO - __main__ - Step 48438: {'lr': 0.0003880663480170772, 'samples': 9300096, 'steps': 48437, 'loss/train': 1.5590276718139648} 11/07/2021 04:03:57 - INFO - __main__ - Step 48439: {'lr': 0.00038806192392110817, 'samples': 9300288, 'steps': 48438, 'loss/train': 1.4246101379394531} 11/07/2021 04:03:57 - INFO - __main__ - Step 48440: {'lr': 0.0003880574997629305, 'samples': 9300480, 'steps': 48439, 'loss/train': 1.5377458333969116} 11/07/2021 04:03:58 - INFO - __main__ - Step 48441: {'lr': 0.0003880530755425462, 'samples': 9300672, 'steps': 48440, 'loss/train': 1.6214735507965088} 11/07/2021 04:03:58 - INFO - __main__ - Step 48442: {'lr': 0.0003880486512599574, 'samples': 9300864, 'steps': 48441, 'loss/train': 0.9166438579559326} 11/07/2021 04:03:59 - INFO - __main__ - Step 48443: {'lr': 0.00038804422691516606, 'samples': 9301056, 'steps': 48442, 'loss/train': 1.3133119344711304} 11/07/2021 04:03:59 - INFO - __main__ - Step 48444: {'lr': 0.0003880398025081741, 'samples': 9301248, 'steps': 48443, 'loss/train': 1.359276533126831} 11/07/2021 04:04:00 - INFO - __main__ - Step 48445: {'lr': 0.0003880353780389834, 'samples': 9301440, 'steps': 48444, 'loss/train': 0.9887827634811401} 11/07/2021 04:04:00 - INFO - __main__ - Step 48446: {'lr': 0.0003880309535075962, 'samples': 9301632, 'steps': 48445, 'loss/train': 1.2172267436981201} 11/07/2021 04:04:01 - INFO - __main__ - Step 48447: {'lr': 0.00038802652891401434, 'samples': 9301824, 'steps': 48446, 'loss/train': 2.194840669631958} 11/07/2021 04:04:01 - INFO - __main__ - Step 48448: {'lr': 0.0003880221042582399, 'samples': 9302016, 'steps': 48447, 'loss/train': 1.3867621421813965} 11/07/2021 04:04:02 - INFO - __main__ - Step 48449: {'lr': 0.0003880176795402748, 'samples': 9302208, 'steps': 48448, 'loss/train': 1.2776654958724976} 11/07/2021 04:04:02 - INFO - __main__ - Step 48450: {'lr': 0.00038801325476012113, 'samples': 9302400, 'steps': 48449, 'loss/train': 1.5412228107452393} 11/07/2021 04:04:02 - INFO - __main__ - Step 48451: {'lr': 0.00038800882991778073, 'samples': 9302592, 'steps': 48450, 'loss/train': 1.4281809329986572} 11/07/2021 04:04:03 - INFO - __main__ - Step 48452: {'lr': 0.00038800440501325574, 'samples': 9302784, 'steps': 48451, 'loss/train': 1.0717188119888306} 11/07/2021 04:04:04 - INFO - __main__ - Step 48453: {'lr': 0.000387999980046548, 'samples': 9302976, 'steps': 48452, 'loss/train': 1.6209484338760376} 11/07/2021 04:04:04 - INFO - __main__ - Step 48454: {'lr': 0.0003879955550176597, 'samples': 9303168, 'steps': 48453, 'loss/train': 2.196815252304077} 11/07/2021 04:04:05 - INFO - __main__ - Step 48455: {'lr': 0.00038799112992659267, 'samples': 9303360, 'steps': 48454, 'loss/train': 1.5383121967315674} 11/07/2021 04:04:05 - INFO - __main__ - Step 48456: {'lr': 0.00038798670477334894, 'samples': 9303552, 'steps': 48455, 'loss/train': 1.4590588808059692} 11/07/2021 04:04:05 - INFO - __main__ - Step 48457: {'lr': 0.00038798227955793066, 'samples': 9303744, 'steps': 48456, 'loss/train': 1.6237387657165527} 11/07/2021 04:04:06 - INFO - __main__ - Step 48458: {'lr': 0.0003879778542803396, 'samples': 9303936, 'steps': 48457, 'loss/train': 1.506496787071228} 11/07/2021 04:04:07 - INFO - __main__ - Step 48459: {'lr': 0.00038797342894057783, 'samples': 9304128, 'steps': 48458, 'loss/train': 0.8236624002456665} 11/07/2021 04:04:07 - INFO - __main__ - Step 48460: {'lr': 0.0003879690035386474, 'samples': 9304320, 'steps': 48459, 'loss/train': 1.6890909671783447} 11/07/2021 04:04:07 - INFO - __main__ - Step 48461: {'lr': 0.0003879645780745503, 'samples': 9304512, 'steps': 48460, 'loss/train': 1.3325366973876953} 11/07/2021 04:04:08 - INFO - __main__ - Step 48462: {'lr': 0.0003879601525482884, 'samples': 9304704, 'steps': 48461, 'loss/train': 1.5774303674697876} 11/07/2021 04:04:09 - INFO - __main__ - Step 48463: {'lr': 0.00038795572695986394, 'samples': 9304896, 'steps': 48462, 'loss/train': 1.639383316040039} 11/07/2021 04:04:09 - INFO - __main__ - Step 48464: {'lr': 0.00038795130130927857, 'samples': 9305088, 'steps': 48463, 'loss/train': 1.8844096660614014} 11/07/2021 04:04:09 - INFO - __main__ - Step 48465: {'lr': 0.0003879468755965346, 'samples': 9305280, 'steps': 48464, 'loss/train': 1.598170518875122} 11/07/2021 04:04:10 - INFO - __main__ - Step 48466: {'lr': 0.00038794244982163383, 'samples': 9305472, 'steps': 48465, 'loss/train': 1.6323853731155396} 11/07/2021 04:04:10 - INFO - __main__ - Step 48467: {'lr': 0.0003879380239845783, 'samples': 9305664, 'steps': 48466, 'loss/train': 0.7035285830497742} 11/07/2021 04:04:11 - INFO - __main__ - Step 48468: {'lr': 0.0003879335980853701, 'samples': 9305856, 'steps': 48467, 'loss/train': 1.063156008720398} 11/07/2021 04:04:12 - INFO - __main__ - Step 48469: {'lr': 0.00038792917212401114, 'samples': 9306048, 'steps': 48468, 'loss/train': 1.3900890350341797} 11/07/2021 04:04:12 - INFO - __main__ - Step 48470: {'lr': 0.0003879247461005034, 'samples': 9306240, 'steps': 48469, 'loss/train': 1.5695993900299072} 11/07/2021 04:04:12 - INFO - __main__ - Step 48471: {'lr': 0.0003879203200148489, 'samples': 9306432, 'steps': 48470, 'loss/train': 0.6920669674873352} 11/07/2021 04:04:13 - INFO - __main__ - Step 48472: {'lr': 0.0003879158938670496, 'samples': 9306624, 'steps': 48471, 'loss/train': 1.632412314414978} 11/07/2021 04:04:14 - INFO - __main__ - Step 48473: {'lr': 0.0003879114676571076, 'samples': 9306816, 'steps': 48472, 'loss/train': 1.1928110122680664} 11/07/2021 04:04:14 - INFO - __main__ - Step 48474: {'lr': 0.00038790704138502475, 'samples': 9307008, 'steps': 48473, 'loss/train': 1.233225703239441} 11/07/2021 04:04:14 - INFO - __main__ - Step 48475: {'lr': 0.0003879026150508032, 'samples': 9307200, 'steps': 48474, 'loss/train': 1.964782476425171} 11/07/2021 04:04:15 - INFO - __main__ - Step 48476: {'lr': 0.00038789818865444473, 'samples': 9307392, 'steps': 48475, 'loss/train': 2.022186040878296} 11/07/2021 04:04:15 - INFO - __main__ - Step 48477: {'lr': 0.0003878937621959516, 'samples': 9307584, 'steps': 48476, 'loss/train': 1.3771586418151855} 11/07/2021 04:04:16 - INFO - __main__ - Step 48478: {'lr': 0.0003878893356753256, 'samples': 9307776, 'steps': 48477, 'loss/train': 1.628832221031189} 11/07/2021 04:04:16 - INFO - __main__ - Step 48479: {'lr': 0.0003878849090925688, 'samples': 9307968, 'steps': 48478, 'loss/train': 1.2131322622299194} 11/07/2021 04:04:17 - INFO - __main__ - Step 48480: {'lr': 0.00038788048244768316, 'samples': 9308160, 'steps': 48479, 'loss/train': 1.4445890188217163} 11/07/2021 04:04:17 - INFO - __main__ - Step 48481: {'lr': 0.00038787605574067076, 'samples': 9308352, 'steps': 48480, 'loss/train': 1.3065617084503174} 11/07/2021 04:04:18 - INFO - __main__ - Step 48482: {'lr': 0.0003878716289715335, 'samples': 9308544, 'steps': 48481, 'loss/train': 1.4773340225219727} 11/07/2021 04:04:18 - INFO - __main__ - Step 48483: {'lr': 0.0003878672021402734, 'samples': 9308736, 'steps': 48482, 'loss/train': 1.5814417600631714} 11/07/2021 04:04:19 - INFO - __main__ - Step 48484: {'lr': 0.00038786277524689245, 'samples': 9308928, 'steps': 48483, 'loss/train': 1.6066398620605469} 11/07/2021 04:04:19 - INFO - __main__ - Step 48485: {'lr': 0.0003878583482913927, 'samples': 9309120, 'steps': 48484, 'loss/train': 1.5111255645751953} 11/07/2021 04:04:20 - INFO - __main__ - Step 48486: {'lr': 0.00038785392127377603, 'samples': 9309312, 'steps': 48485, 'loss/train': 1.7543655633926392} 11/07/2021 04:04:20 - INFO - __main__ - Step 48487: {'lr': 0.0003878494941940447, 'samples': 9309504, 'steps': 48486, 'loss/train': 1.267889380455017} 11/07/2021 04:04:20 - INFO - __main__ - Step 48488: {'lr': 0.0003878450670522004, 'samples': 9309696, 'steps': 48487, 'loss/train': 1.1851766109466553} 11/07/2021 04:04:21 - INFO - __main__ - Step 48489: {'lr': 0.00038784063984824516, 'samples': 9309888, 'steps': 48488, 'loss/train': 1.4379669427871704} 11/07/2021 04:04:22 - INFO - __main__ - Step 48490: {'lr': 0.00038783621258218115, 'samples': 9310080, 'steps': 48489, 'loss/train': 1.6245023012161255} 11/07/2021 04:04:22 - INFO - __main__ - Step 48491: {'lr': 0.00038783178525401025, 'samples': 9310272, 'steps': 48490, 'loss/train': 1.602535367012024} 11/07/2021 04:04:22 - INFO - __main__ - Step 48492: {'lr': 0.00038782735786373445, 'samples': 9310464, 'steps': 48491, 'loss/train': 1.8196521997451782} 11/07/2021 04:04:23 - INFO - __main__ - Step 48493: {'lr': 0.00038782293041135583, 'samples': 9310656, 'steps': 48492, 'loss/train': 1.6887304782867432} 11/07/2021 04:04:24 - INFO - __main__ - Step 48494: {'lr': 0.0003878185028968763, 'samples': 9310848, 'steps': 48493, 'loss/train': 1.529370665550232} 11/07/2021 04:04:24 - INFO - __main__ - Step 48495: {'lr': 0.00038781407532029785, 'samples': 9311040, 'steps': 48494, 'loss/train': 1.4546703100204468} 11/07/2021 04:04:25 - INFO - __main__ - Step 48496: {'lr': 0.0003878096476816225, 'samples': 9311232, 'steps': 48495, 'loss/train': 1.2742546796798706} 11/07/2021 04:04:25 - INFO - __main__ - Step 48497: {'lr': 0.0003878052199808523, 'samples': 9311424, 'steps': 48496, 'loss/train': 1.48627769947052} 11/07/2021 04:04:25 - INFO - __main__ - Step 48498: {'lr': 0.0003878007922179891, 'samples': 9311616, 'steps': 48497, 'loss/train': 1.6182938814163208} 11/07/2021 04:04:26 - INFO - __main__ - Step 48499: {'lr': 0.0003877963643930351, 'samples': 9311808, 'steps': 48498, 'loss/train': 1.3741947412490845} 11/07/2021 04:04:27 - INFO - __main__ - Step 48500: {'lr': 0.00038779193650599213, 'samples': 9312000, 'steps': 48499, 'loss/train': 1.2534310817718506} 11/07/2021 04:04:27 - INFO - __main__ - Step 48501: {'lr': 0.0003877875085568622, 'samples': 9312192, 'steps': 48500, 'loss/train': 1.628035306930542} 11/07/2021 04:04:27 - INFO - __main__ - Step 48502: {'lr': 0.0003877830805456474, 'samples': 9312384, 'steps': 48501, 'loss/train': 1.7516915798187256} 11/07/2021 04:04:28 - INFO - __main__ - Step 48503: {'lr': 0.00038777865247234967, 'samples': 9312576, 'steps': 48502, 'loss/train': 1.5887657403945923} 11/07/2021 04:04:28 - INFO - __main__ - Step 48504: {'lr': 0.00038777422433697106, 'samples': 9312768, 'steps': 48503, 'loss/train': 1.4785151481628418} 11/07/2021 04:04:29 - INFO - __main__ - Step 48505: {'lr': 0.00038776979613951347, 'samples': 9312960, 'steps': 48504, 'loss/train': 1.5877552032470703} 11/07/2021 04:04:29 - INFO - __main__ - Step 48506: {'lr': 0.00038776536787997885, 'samples': 9313152, 'steps': 48505, 'loss/train': 1.8600739240646362} 11/07/2021 04:04:30 - INFO - __main__ - Step 48507: {'lr': 0.0003877609395583693, 'samples': 9313344, 'steps': 48506, 'loss/train': 1.3988336324691772} 11/07/2021 04:04:30 - INFO - __main__ - Step 48508: {'lr': 0.0003877565111746869, 'samples': 9313536, 'steps': 48507, 'loss/train': 1.3312302827835083} 11/07/2021 04:04:31 - INFO - __main__ - Step 48509: {'lr': 0.00038775208272893346, 'samples': 9313728, 'steps': 48508, 'loss/train': 1.4225656986236572} 11/07/2021 04:04:32 - INFO - __main__ - Step 48510: {'lr': 0.0003877476542211111, 'samples': 9313920, 'steps': 48509, 'loss/train': 1.4918937683105469} 11/07/2021 04:04:33 - INFO - __main__ - Step 48511: {'lr': 0.0003877432256512218, 'samples': 9314112, 'steps': 48510, 'loss/train': 1.3282244205474854} 11/07/2021 04:04:33 - INFO - __main__ - Step 48512: {'lr': 0.00038773879701926747, 'samples': 9314304, 'steps': 48511, 'loss/train': 1.7111784219741821} 11/07/2021 04:04:33 - INFO - __main__ - Step 48513: {'lr': 0.0003877343683252501, 'samples': 9314496, 'steps': 48512, 'loss/train': 1.8285577297210693} 11/07/2021 04:04:34 - INFO - __main__ - Step 48514: {'lr': 0.00038772993956917183, 'samples': 9314688, 'steps': 48513, 'loss/train': 1.6901429891586304} 11/07/2021 04:04:34 - INFO - __main__ - Step 48515: {'lr': 0.00038772551075103457, 'samples': 9314880, 'steps': 48514, 'loss/train': 2.77165150642395} 11/07/2021 04:04:34 - INFO - __main__ - Step 48516: {'lr': 0.00038772108187084034, 'samples': 9315072, 'steps': 48515, 'loss/train': 2.760432004928589} 11/07/2021 04:04:35 - INFO - __main__ - Step 48517: {'lr': 0.00038771665292859116, 'samples': 9315264, 'steps': 48516, 'loss/train': 1.6580708026885986} 11/07/2021 04:04:36 - INFO - __main__ - Step 48518: {'lr': 0.00038771222392428885, 'samples': 9315456, 'steps': 48517, 'loss/train': 1.2777594327926636} 11/07/2021 04:04:36 - INFO - __main__ - Step 48519: {'lr': 0.0003877077948579356, 'samples': 9315648, 'steps': 48518, 'loss/train': 1.2819809913635254} 11/07/2021 04:04:36 - INFO - __main__ - Step 48520: {'lr': 0.00038770336572953334, 'samples': 9315840, 'steps': 48519, 'loss/train': 1.8734691143035889} 11/07/2021 04:04:37 - INFO - __main__ - Step 48521: {'lr': 0.00038769893653908404, 'samples': 9316032, 'steps': 48520, 'loss/train': 1.5955742597579956} 11/07/2021 04:04:38 - INFO - __main__ - Step 48522: {'lr': 0.0003876945072865898, 'samples': 9316224, 'steps': 48521, 'loss/train': 1.3457098007202148} 11/07/2021 04:04:38 - INFO - __main__ - Step 48523: {'lr': 0.0003876900779720525, 'samples': 9316416, 'steps': 48522, 'loss/train': 1.5795559883117676} 11/07/2021 04:04:39 - INFO - __main__ - Step 48524: {'lr': 0.0003876856485954742, 'samples': 9316608, 'steps': 48523, 'loss/train': 1.3283982276916504} 11/07/2021 04:04:39 - INFO - __main__ - Step 48525: {'lr': 0.00038768121915685685, 'samples': 9316800, 'steps': 48524, 'loss/train': 1.6154831647872925} 11/07/2021 04:04:39 - INFO - __main__ - Step 48526: {'lr': 0.00038767678965620245, 'samples': 9316992, 'steps': 48525, 'loss/train': 1.2984143495559692} 11/07/2021 04:04:40 - INFO - __main__ - Step 48527: {'lr': 0.00038767236009351304, 'samples': 9317184, 'steps': 48526, 'loss/train': 1.514677882194519} 11/07/2021 04:04:41 - INFO - __main__ - Step 48528: {'lr': 0.00038766793046879057, 'samples': 9317376, 'steps': 48527, 'loss/train': 0.9569680094718933} 11/07/2021 04:04:41 - INFO - __main__ - Step 48529: {'lr': 0.000387663500782037, 'samples': 9317568, 'steps': 48528, 'loss/train': 1.884013295173645} 11/07/2021 04:04:41 - INFO - __main__ - Step 48530: {'lr': 0.00038765907103325447, 'samples': 9317760, 'steps': 48529, 'loss/train': 1.5891244411468506} 11/07/2021 04:04:42 - INFO - __main__ - Step 48531: {'lr': 0.00038765464122244485, 'samples': 9317952, 'steps': 48530, 'loss/train': 1.4183497428894043} 11/07/2021 04:04:42 - INFO - __main__ - Step 48532: {'lr': 0.0003876502113496102, 'samples': 9318144, 'steps': 48531, 'loss/train': 1.2370476722717285} 11/07/2021 04:04:43 - INFO - __main__ - Step 48533: {'lr': 0.00038764578141475245, 'samples': 9318336, 'steps': 48532, 'loss/train': 1.615739345550537} 11/07/2021 04:04:43 - INFO - __main__ - Step 48534: {'lr': 0.0003876413514178736, 'samples': 9318528, 'steps': 48533, 'loss/train': 1.2059228420257568} 11/07/2021 04:04:44 - INFO - __main__ - Step 48535: {'lr': 0.0003876369213589758, 'samples': 9318720, 'steps': 48534, 'loss/train': 1.2580654621124268} 11/07/2021 04:04:44 - INFO - __main__ - Step 48536: {'lr': 0.0003876324912380608, 'samples': 9318912, 'steps': 48535, 'loss/train': 1.6176378726959229} 11/07/2021 04:04:44 - INFO - __main__ - Step 48537: {'lr': 0.00038762806105513084, 'samples': 9319104, 'steps': 48536, 'loss/train': 1.6030935049057007} 11/07/2021 04:04:45 - INFO - __main__ - Step 48538: {'lr': 0.0003876236308101877, 'samples': 9319296, 'steps': 48537, 'loss/train': 1.8924974203109741} 11/07/2021 04:04:46 - INFO - __main__ - Step 48539: {'lr': 0.0003876192005032335, 'samples': 9319488, 'steps': 48538, 'loss/train': 0.9740433692932129} 11/07/2021 04:04:46 - INFO - __main__ - Step 48540: {'lr': 0.00038761477013427026, 'samples': 9319680, 'steps': 48539, 'loss/train': 1.4934413433074951} 11/07/2021 04:04:47 - INFO - __main__ - Step 48541: {'lr': 0.00038761033970329987, 'samples': 9319872, 'steps': 48540, 'loss/train': 1.0842349529266357} 11/07/2021 04:04:47 - INFO - __main__ - Step 48542: {'lr': 0.00038760590921032445, 'samples': 9320064, 'steps': 48541, 'loss/train': 1.3521828651428223} 11/07/2021 04:04:48 - INFO - __main__ - Step 48543: {'lr': 0.0003876014786553459, 'samples': 9320256, 'steps': 48542, 'loss/train': 1.6854872703552246} 11/07/2021 04:04:48 - INFO - __main__ - Step 48544: {'lr': 0.00038759704803836625, 'samples': 9320448, 'steps': 48543, 'loss/train': 1.2868236303329468} 11/07/2021 04:04:49 - INFO - __main__ - Step 48545: {'lr': 0.00038759261735938743, 'samples': 9320640, 'steps': 48544, 'loss/train': 1.4686287641525269} 11/07/2021 04:04:49 - INFO - __main__ - Step 48546: {'lr': 0.00038758818661841155, 'samples': 9320832, 'steps': 48545, 'loss/train': 1.323473572731018} 11/07/2021 04:04:49 - INFO - __main__ - Step 48547: {'lr': 0.0003875837558154406, 'samples': 9321024, 'steps': 48546, 'loss/train': 1.3346006870269775} 11/07/2021 04:04:50 - INFO - __main__ - Step 48548: {'lr': 0.0003875793249504765, 'samples': 9321216, 'steps': 48547, 'loss/train': 1.331971526145935} 11/07/2021 04:04:51 - INFO - __main__ - Step 48549: {'lr': 0.00038757489402352124, 'samples': 9321408, 'steps': 48548, 'loss/train': 2.03489089012146} 11/07/2021 04:04:51 - INFO - __main__ - Step 48550: {'lr': 0.0003875704630345769, 'samples': 9321600, 'steps': 48549, 'loss/train': 1.404166579246521} 11/07/2021 04:04:51 - INFO - __main__ - Step 48551: {'lr': 0.00038756603198364544, 'samples': 9321792, 'steps': 48550, 'loss/train': 1.6891363859176636} 11/07/2021 04:04:52 - INFO - __main__ - Step 48552: {'lr': 0.0003875616008707288, 'samples': 9321984, 'steps': 48551, 'loss/train': 1.9975979328155518} 11/07/2021 04:04:53 - INFO - __main__ - Step 48553: {'lr': 0.00038755716969582913, 'samples': 9322176, 'steps': 48552, 'loss/train': 1.307058572769165} 11/07/2021 04:04:53 - INFO - __main__ - Step 48554: {'lr': 0.0003875527384589482, 'samples': 9322368, 'steps': 48553, 'loss/train': 1.2819502353668213} 11/07/2021 04:04:54 - INFO - __main__ - Step 48555: {'lr': 0.00038754830716008815, 'samples': 9322560, 'steps': 48554, 'loss/train': 1.1141554117202759} 11/07/2021 04:04:54 - INFO - __main__ - Step 48556: {'lr': 0.000387543875799251, 'samples': 9322752, 'steps': 48555, 'loss/train': 1.530539631843567} 11/07/2021 04:04:54 - INFO - __main__ - Step 48557: {'lr': 0.0003875394443764387, 'samples': 9322944, 'steps': 48556, 'loss/train': 2.102464199066162} 11/07/2021 04:04:55 - INFO - __main__ - Step 48558: {'lr': 0.00038753501289165324, 'samples': 9323136, 'steps': 48557, 'loss/train': 1.6528323888778687} 11/07/2021 04:04:56 - INFO - __main__ - Step 48559: {'lr': 0.0003875305813448966, 'samples': 9323328, 'steps': 48558, 'loss/train': 1.4500070810317993} 11/07/2021 04:04:56 - INFO - __main__ - Step 48560: {'lr': 0.00038752614973617085, 'samples': 9323520, 'steps': 48559, 'loss/train': 1.2675766944885254} 11/07/2021 04:04:56 - INFO - __main__ - Step 48561: {'lr': 0.0003875217180654779, 'samples': 9323712, 'steps': 48560, 'loss/train': 0.9635229706764221} 11/07/2021 04:04:57 - INFO - __main__ - Step 48562: {'lr': 0.00038751728633281974, 'samples': 9323904, 'steps': 48561, 'loss/train': 1.6561272144317627} 11/07/2021 04:04:57 - INFO - __main__ - Step 48563: {'lr': 0.00038751285453819846, 'samples': 9324096, 'steps': 48562, 'loss/train': 1.4693461656570435} 11/07/2021 04:04:59 - INFO - __main__ - Step 48564: {'lr': 0.000387508422681616, 'samples': 9324288, 'steps': 48563, 'loss/train': 1.416921854019165} 11/07/2021 04:04:59 - INFO - __main__ - Step 48565: {'lr': 0.0003875039907630744, 'samples': 9324480, 'steps': 48564, 'loss/train': 1.456447720527649} 11/07/2021 04:04:59 - INFO - __main__ - Step 48566: {'lr': 0.0003874995587825756, 'samples': 9324672, 'steps': 48565, 'loss/train': 1.5661808252334595} 11/07/2021 04:05:00 - INFO - __main__ - Step 48567: {'lr': 0.00038749512674012167, 'samples': 9324864, 'steps': 48566, 'loss/train': 1.9703129529953003} 11/07/2021 04:05:00 - INFO - __main__ - Step 48568: {'lr': 0.0003874906946357145, 'samples': 9325056, 'steps': 48567, 'loss/train': 1.4652469158172607} 11/07/2021 04:05:01 - INFO - __main__ - Step 48569: {'lr': 0.00038748626246935613, 'samples': 9325248, 'steps': 48568, 'loss/train': 1.4368247985839844} 11/07/2021 04:05:01 - INFO - __main__ - Step 48570: {'lr': 0.0003874818302410486, 'samples': 9325440, 'steps': 48569, 'loss/train': 0.7973642349243164} 11/07/2021 04:05:02 - INFO - __main__ - Step 48571: {'lr': 0.00038747739795079396, 'samples': 9325632, 'steps': 48570, 'loss/train': 1.7440383434295654} 11/07/2021 04:05:02 - INFO - __main__ - Step 48572: {'lr': 0.000387472965598594, 'samples': 9325824, 'steps': 48571, 'loss/train': 1.2948087453842163} 11/07/2021 04:05:03 - INFO - __main__ - Step 48573: {'lr': 0.0003874685331844509, 'samples': 9326016, 'steps': 48572, 'loss/train': 1.4181630611419678} 11/07/2021 04:05:03 - INFO - __main__ - Step 48574: {'lr': 0.0003874641007083666, 'samples': 9326208, 'steps': 48573, 'loss/train': 1.457755208015442} 11/07/2021 04:05:05 - INFO - __main__ - Step 48575: {'lr': 0.00038745966817034305, 'samples': 9326400, 'steps': 48574, 'loss/train': 1.319076418876648} 11/07/2021 04:05:05 - INFO - __main__ - Step 48576: {'lr': 0.0003874552355703823, 'samples': 9326592, 'steps': 48575, 'loss/train': 0.9433940052986145} 11/07/2021 04:05:05 - INFO - __main__ - Step 48577: {'lr': 0.00038745080290848635, 'samples': 9326784, 'steps': 48576, 'loss/train': 1.485592246055603} 11/07/2021 04:05:06 - INFO - __main__ - Step 48578: {'lr': 0.0003874463701846573, 'samples': 9326976, 'steps': 48577, 'loss/train': 0.6104253530502319} 11/07/2021 04:05:06 - INFO - __main__ - Step 48579: {'lr': 0.0003874419373988969, 'samples': 9327168, 'steps': 48578, 'loss/train': 1.7105915546417236} 11/07/2021 04:05:06 - INFO - __main__ - Step 48580: {'lr': 0.0003874375045512073, 'samples': 9327360, 'steps': 48579, 'loss/train': 1.3981499671936035} 11/07/2021 04:05:07 - INFO - __main__ - Step 48581: {'lr': 0.0003874330716415905, 'samples': 9327552, 'steps': 48580, 'loss/train': 1.525861382484436} 11/07/2021 04:05:08 - INFO - __main__ - Step 48582: {'lr': 0.00038742863867004853, 'samples': 9327744, 'steps': 48581, 'loss/train': 1.4717320203781128} 11/07/2021 04:05:08 - INFO - __main__ - Step 48583: {'lr': 0.0003874242056365833, 'samples': 9327936, 'steps': 48582, 'loss/train': 1.9312816858291626} 11/07/2021 04:05:08 - INFO - __main__ - Step 48584: {'lr': 0.0003874197725411969, 'samples': 9328128, 'steps': 48583, 'loss/train': 0.9575197100639343} 11/07/2021 04:05:09 - INFO - __main__ - Step 48585: {'lr': 0.00038741533938389117, 'samples': 9328320, 'steps': 48584, 'loss/train': 1.4876394271850586} 11/07/2021 04:05:10 - INFO - __main__ - Step 48586: {'lr': 0.00038741090616466824, 'samples': 9328512, 'steps': 48585, 'loss/train': 1.5372319221496582} 11/07/2021 04:05:10 - INFO - __main__ - Step 48587: {'lr': 0.0003874064728835301, 'samples': 9328704, 'steps': 48586, 'loss/train': 1.9714494943618774} 11/07/2021 04:05:10 - INFO - __main__ - Step 48588: {'lr': 0.0003874020395404787, 'samples': 9328896, 'steps': 48587, 'loss/train': 1.817679524421692} 11/07/2021 04:05:11 - INFO - __main__ - Step 48589: {'lr': 0.00038739760613551606, 'samples': 9329088, 'steps': 48588, 'loss/train': 1.5934373140335083} 11/07/2021 04:05:11 - INFO - __main__ - Step 48590: {'lr': 0.0003873931726686442, 'samples': 9329280, 'steps': 48589, 'loss/train': 1.5713540315628052} 11/07/2021 04:05:12 - INFO - __main__ - Step 48591: {'lr': 0.0003873887391398651, 'samples': 9329472, 'steps': 48590, 'loss/train': 1.4419883489608765} 11/07/2021 04:05:13 - INFO - __main__ - Step 48592: {'lr': 0.0003873843055491807, 'samples': 9329664, 'steps': 48591, 'loss/train': 1.312111496925354} 11/07/2021 04:05:13 - INFO - __main__ - Step 48593: {'lr': 0.00038737987189659315, 'samples': 9329856, 'steps': 48592, 'loss/train': 1.7823280096054077} 11/07/2021 04:05:13 - INFO - __main__ - Step 48594: {'lr': 0.00038737543818210423, 'samples': 9330048, 'steps': 48593, 'loss/train': 1.4560304880142212} 11/07/2021 04:05:14 - INFO - __main__ - Step 48595: {'lr': 0.00038737100440571615, 'samples': 9330240, 'steps': 48594, 'loss/train': 1.3260122537612915} 11/07/2021 04:05:15 - INFO - __main__ - Step 48596: {'lr': 0.00038736657056743075, 'samples': 9330432, 'steps': 48595, 'loss/train': 1.53359854221344} 11/07/2021 04:05:15 - INFO - __main__ - Step 48597: {'lr': 0.0003873621366672502, 'samples': 9330624, 'steps': 48596, 'loss/train': 1.49400794506073} 11/07/2021 04:05:15 - INFO - __main__ - Step 48598: {'lr': 0.0003873577027051763, 'samples': 9330816, 'steps': 48597, 'loss/train': 1.5908323526382446} 11/07/2021 04:05:16 - INFO - __main__ - Step 48599: {'lr': 0.0003873532686812111, 'samples': 9331008, 'steps': 48598, 'loss/train': 1.642113447189331} 11/07/2021 04:05:16 - INFO - __main__ - Step 48600: {'lr': 0.0003873488345953567, 'samples': 9331200, 'steps': 48599, 'loss/train': 1.4471324682235718} 11/07/2021 04:05:17 - INFO - __main__ - Step 48601: {'lr': 0.00038734440044761503, 'samples': 9331392, 'steps': 48600, 'loss/train': 0.10087435692548752} 11/07/2021 04:05:17 - INFO - __main__ - Step 48602: {'lr': 0.0003873399662379881, 'samples': 9331584, 'steps': 48601, 'loss/train': 1.194433569908142} 11/07/2021 04:05:18 - INFO - __main__ - Step 48603: {'lr': 0.00038733553196647786, 'samples': 9331776, 'steps': 48602, 'loss/train': 1.6700736284255981} 11/07/2021 04:05:18 - INFO - __main__ - Step 48604: {'lr': 0.00038733109763308644, 'samples': 9331968, 'steps': 48603, 'loss/train': 1.0045156478881836} 11/07/2021 04:05:18 - INFO - __main__ - Step 48605: {'lr': 0.0003873266632378157, 'samples': 9332160, 'steps': 48604, 'loss/train': 1.76278555393219} 11/07/2021 04:05:19 - INFO - __main__ - Step 48606: {'lr': 0.00038732222878066764, 'samples': 9332352, 'steps': 48605, 'loss/train': 1.3059808015823364} 11/07/2021 04:05:20 - INFO - __main__ - Step 48607: {'lr': 0.0003873177942616444, 'samples': 9332544, 'steps': 48606, 'loss/train': 1.3572828769683838} 11/07/2021 04:05:20 - INFO - __main__ - Step 48608: {'lr': 0.0003873133596807478, 'samples': 9332736, 'steps': 48607, 'loss/train': 1.509686827659607} 11/07/2021 04:05:21 - INFO - __main__ - Step 48609: {'lr': 0.00038730892503797986, 'samples': 9332928, 'steps': 48608, 'loss/train': 1.3287807703018188} 11/07/2021 04:05:21 - INFO - __main__ - Step 48610: {'lr': 0.00038730449033334277, 'samples': 9333120, 'steps': 48609, 'loss/train': 1.5132238864898682} 11/07/2021 04:05:21 - INFO - __main__ - Step 48611: {'lr': 0.00038730005556683833, 'samples': 9333312, 'steps': 48610, 'loss/train': 1.4505895376205444} 11/07/2021 04:05:22 - INFO - __main__ - Step 48612: {'lr': 0.00038729562073846856, 'samples': 9333504, 'steps': 48611, 'loss/train': 1.8530577421188354} 11/07/2021 04:05:23 - INFO - __main__ - Step 48613: {'lr': 0.00038729118584823557, 'samples': 9333696, 'steps': 48612, 'loss/train': 1.5153831243515015} 11/07/2021 04:05:23 - INFO - __main__ - Step 48614: {'lr': 0.0003872867508961413, 'samples': 9333888, 'steps': 48613, 'loss/train': 1.310623288154602} 11/07/2021 04:05:23 - INFO - __main__ - Step 48615: {'lr': 0.00038728231588218767, 'samples': 9334080, 'steps': 48614, 'loss/train': 1.5036399364471436} 11/07/2021 04:05:24 - INFO - __main__ - Step 48616: {'lr': 0.00038727788080637684, 'samples': 9334272, 'steps': 48615, 'loss/train': 1.897681713104248} 11/07/2021 04:05:25 - INFO - __main__ - Step 48617: {'lr': 0.00038727344566871064, 'samples': 9334464, 'steps': 48616, 'loss/train': 2.2115097045898438} 11/07/2021 04:05:25 - INFO - __main__ - Step 48618: {'lr': 0.00038726901046919114, 'samples': 9334656, 'steps': 48617, 'loss/train': 1.8675711154937744} 11/07/2021 04:05:26 - INFO - __main__ - Step 48619: {'lr': 0.00038726457520782046, 'samples': 9334848, 'steps': 48618, 'loss/train': 1.9020841121673584} 11/07/2021 04:05:26 - INFO - __main__ - Step 48620: {'lr': 0.00038726013988460027, 'samples': 9335040, 'steps': 48619, 'loss/train': 1.3993057012557983} 11/07/2021 04:05:26 - INFO - __main__ - Step 48621: {'lr': 0.00038725570449953296, 'samples': 9335232, 'steps': 48620, 'loss/train': 0.7827754020690918} 11/07/2021 04:05:27 - INFO - __main__ - Step 48622: {'lr': 0.0003872512690526203, 'samples': 9335424, 'steps': 48621, 'loss/train': 1.1186741590499878} 11/07/2021 04:05:28 - INFO - __main__ - Step 48623: {'lr': 0.0003872468335438643, 'samples': 9335616, 'steps': 48622, 'loss/train': 1.4439775943756104} 11/07/2021 04:05:28 - INFO - __main__ - Step 48624: {'lr': 0.000387242397973267, 'samples': 9335808, 'steps': 48623, 'loss/train': 1.7906427383422852} 11/07/2021 04:05:28 - INFO - __main__ - Step 48625: {'lr': 0.0003872379623408304, 'samples': 9336000, 'steps': 48624, 'loss/train': 1.492393970489502} 11/07/2021 04:05:29 - INFO - __main__ - Step 48626: {'lr': 0.0003872335266465565, 'samples': 9336192, 'steps': 48625, 'loss/train': 1.3923840522766113} 11/07/2021 04:05:30 - INFO - __main__ - Step 48627: {'lr': 0.00038722909089044735, 'samples': 9336384, 'steps': 48626, 'loss/train': 1.0367521047592163} 11/07/2021 04:05:30 - INFO - __main__ - Step 48628: {'lr': 0.0003872246550725048, 'samples': 9336576, 'steps': 48627, 'loss/train': 1.5686945915222168} 11/07/2021 04:05:30 - INFO - __main__ - Step 48629: {'lr': 0.000387220219192731, 'samples': 9336768, 'steps': 48628, 'loss/train': 1.4293190240859985} 11/07/2021 04:05:31 - INFO - __main__ - Step 48630: {'lr': 0.00038721578325112785, 'samples': 9336960, 'steps': 48629, 'loss/train': 2.342914342880249} 11/07/2021 04:05:31 - INFO - __main__ - Step 48631: {'lr': 0.00038721134724769733, 'samples': 9337152, 'steps': 48630, 'loss/train': 1.2137004137039185} 11/07/2021 04:05:32 - INFO - __main__ - Step 48632: {'lr': 0.00038720691118244164, 'samples': 9337344, 'steps': 48631, 'loss/train': 1.630367398262024} 11/07/2021 04:05:32 - INFO - __main__ - Step 48633: {'lr': 0.00038720247505536257, 'samples': 9337536, 'steps': 48632, 'loss/train': 1.723755955696106} 11/07/2021 04:05:33 - INFO - __main__ - Step 48634: {'lr': 0.0003871980388664621, 'samples': 9337728, 'steps': 48633, 'loss/train': 1.821600317955017} 11/07/2021 04:05:33 - INFO - __main__ - Step 48635: {'lr': 0.00038719360261574233, 'samples': 9337920, 'steps': 48634, 'loss/train': 1.6551693677902222} 11/07/2021 04:05:34 - INFO - __main__ - Step 48636: {'lr': 0.00038718916630320533, 'samples': 9338112, 'steps': 48635, 'loss/train': 1.846117615699768} 11/07/2021 04:05:34 - INFO - __main__ - Step 48637: {'lr': 0.0003871847299288529, 'samples': 9338304, 'steps': 48636, 'loss/train': 1.1855734586715698} 11/07/2021 04:05:35 - INFO - __main__ - Step 48638: {'lr': 0.00038718029349268723, 'samples': 9338496, 'steps': 48637, 'loss/train': 1.9610881805419922} 11/07/2021 04:05:35 - INFO - __main__ - Step 48639: {'lr': 0.00038717585699471024, 'samples': 9338688, 'steps': 48638, 'loss/train': 1.5299619436264038} 11/07/2021 04:05:36 - INFO - __main__ - Step 48640: {'lr': 0.0003871714204349239, 'samples': 9338880, 'steps': 48639, 'loss/train': 1.39769446849823} 11/07/2021 04:05:36 - INFO - __main__ - Step 48641: {'lr': 0.00038716698381333027, 'samples': 9339072, 'steps': 48640, 'loss/train': 1.4764127731323242} 11/07/2021 04:05:36 - INFO - __main__ - Step 48642: {'lr': 0.0003871625471299313, 'samples': 9339264, 'steps': 48641, 'loss/train': 1.362342357635498} 11/07/2021 04:05:37 - INFO - __main__ - Step 48643: {'lr': 0.00038715811038472894, 'samples': 9339456, 'steps': 48642, 'loss/train': 1.7338573932647705} 11/07/2021 04:05:38 - INFO - __main__ - Step 48644: {'lr': 0.0003871536735777252, 'samples': 9339648, 'steps': 48643, 'loss/train': 1.346001386642456} 11/07/2021 04:05:38 - INFO - __main__ - Step 48645: {'lr': 0.0003871492367089223, 'samples': 9339840, 'steps': 48644, 'loss/train': 1.7608494758605957} 11/07/2021 04:05:38 - INFO - __main__ - Step 48646: {'lr': 0.000387144799778322, 'samples': 9340032, 'steps': 48645, 'loss/train': 1.3227531909942627} 11/07/2021 04:05:39 - INFO - __main__ - Step 48647: {'lr': 0.00038714036278592636, 'samples': 9340224, 'steps': 48646, 'loss/train': 1.2336411476135254} 11/07/2021 04:05:40 - INFO - __main__ - Step 48648: {'lr': 0.0003871359257317374, 'samples': 9340416, 'steps': 48647, 'loss/train': 1.5839163064956665} 11/07/2021 04:05:40 - INFO - __main__ - Step 48649: {'lr': 0.0003871314886157571, 'samples': 9340608, 'steps': 48648, 'loss/train': 1.714418649673462} 11/07/2021 04:05:40 - INFO - __main__ - Step 48650: {'lr': 0.0003871270514379874, 'samples': 9340800, 'steps': 48649, 'loss/train': 0.7539232969284058} 11/07/2021 04:05:41 - INFO - __main__ - Step 48651: {'lr': 0.00038712261419843056, 'samples': 9340992, 'steps': 48650, 'loss/train': 1.4385876655578613} 11/07/2021 04:05:41 - INFO - __main__ - Step 48652: {'lr': 0.00038711817689708817, 'samples': 9341184, 'steps': 48651, 'loss/train': 1.1543158292770386} 11/07/2021 04:05:42 - INFO - __main__ - Step 48653: {'lr': 0.00038711373953396257, 'samples': 9341376, 'steps': 48652, 'loss/train': 1.554872751235962} 11/07/2021 04:05:43 - INFO - __main__ - Step 48654: {'lr': 0.0003871093021090556, 'samples': 9341568, 'steps': 48653, 'loss/train': 0.905467689037323} 11/07/2021 04:05:43 - INFO - __main__ - Step 48655: {'lr': 0.0003871048646223693, 'samples': 9341760, 'steps': 48654, 'loss/train': 1.8039259910583496} 11/07/2021 04:05:43 - INFO - __main__ - Step 48656: {'lr': 0.00038710042707390557, 'samples': 9341952, 'steps': 48655, 'loss/train': 1.3409221172332764} 11/07/2021 04:05:44 - INFO - __main__ - Step 48657: {'lr': 0.00038709598946366666, 'samples': 9342144, 'steps': 48656, 'loss/train': 0.9013664722442627} 11/07/2021 04:05:44 - INFO - __main__ - Step 48658: {'lr': 0.00038709155179165436, 'samples': 9342336, 'steps': 48657, 'loss/train': 1.5325487852096558} 11/07/2021 04:05:45 - INFO - __main__ - Step 48659: {'lr': 0.00038708711405787067, 'samples': 9342528, 'steps': 48658, 'loss/train': 1.698494553565979} 11/07/2021 04:05:45 - INFO - __main__ - Step 48660: {'lr': 0.0003870826762623177, 'samples': 9342720, 'steps': 48659, 'loss/train': 1.3557522296905518} 11/07/2021 04:05:46 - INFO - __main__ - Step 48661: {'lr': 0.00038707823840499736, 'samples': 9342912, 'steps': 48660, 'loss/train': 1.452558159828186} 11/07/2021 04:05:46 - INFO - __main__ - Step 48662: {'lr': 0.0003870738004859117, 'samples': 9343104, 'steps': 48661, 'loss/train': 1.7299939393997192} 11/07/2021 04:05:47 - INFO - __main__ - Step 48663: {'lr': 0.0003870693625050626, 'samples': 9343296, 'steps': 48662, 'loss/train': 1.2825325727462769} 11/07/2021 04:05:47 - INFO - __main__ - Step 48664: {'lr': 0.00038706492446245234, 'samples': 9343488, 'steps': 48663, 'loss/train': 1.4849152565002441} 11/07/2021 04:05:48 - INFO - __main__ - Step 48665: {'lr': 0.00038706048635808266, 'samples': 9343680, 'steps': 48664, 'loss/train': 1.9088115692138672} 11/07/2021 04:05:48 - INFO - __main__ - Step 48666: {'lr': 0.0003870560481919556, 'samples': 9343872, 'steps': 48665, 'loss/train': 1.2525205612182617} 11/07/2021 04:05:49 - INFO - __main__ - Step 48667: {'lr': 0.00038705160996407325, 'samples': 9344064, 'steps': 48666, 'loss/train': 1.3481857776641846} 11/07/2021 04:05:49 - INFO - __main__ - Step 48668: {'lr': 0.00038704717167443753, 'samples': 9344256, 'steps': 48667, 'loss/train': 1.5626063346862793} 11/07/2021 04:05:49 - INFO - __main__ - Step 48669: {'lr': 0.0003870427333230505, 'samples': 9344448, 'steps': 48668, 'loss/train': 1.2384713888168335} 11/07/2021 04:05:50 - INFO - __main__ - Step 48670: {'lr': 0.00038703829490991407, 'samples': 9344640, 'steps': 48669, 'loss/train': 1.6280044317245483} 11/07/2021 04:05:51 - INFO - __main__ - Step 48671: {'lr': 0.0003870338564350303, 'samples': 9344832, 'steps': 48670, 'loss/train': 1.4496784210205078} 11/07/2021 04:05:51 - INFO - __main__ - Step 48672: {'lr': 0.0003870294178984013, 'samples': 9345024, 'steps': 48671, 'loss/train': 1.2935545444488525} 11/07/2021 04:05:51 - INFO - __main__ - Step 48673: {'lr': 0.0003870249793000289, 'samples': 9345216, 'steps': 48672, 'loss/train': 1.9649289846420288} 11/07/2021 04:05:52 - INFO - __main__ - Step 48674: {'lr': 0.0003870205406399151, 'samples': 9345408, 'steps': 48673, 'loss/train': 1.470001459121704} 11/07/2021 04:05:53 - INFO - __main__ - Step 48675: {'lr': 0.000387016101918062, 'samples': 9345600, 'steps': 48674, 'loss/train': 1.6249600648880005} 11/07/2021 04:05:53 - INFO - __main__ - Step 48676: {'lr': 0.0003870116631344716, 'samples': 9345792, 'steps': 48675, 'loss/train': 1.3065425157546997} 11/07/2021 04:05:53 - INFO - __main__ - Step 48677: {'lr': 0.0003870072242891458, 'samples': 9345984, 'steps': 48676, 'loss/train': 2.1314406394958496} 11/07/2021 04:05:54 - INFO - __main__ - Step 48678: {'lr': 0.0003870027853820867, 'samples': 9346176, 'steps': 48677, 'loss/train': 1.4698635339736938} 11/07/2021 04:05:54 - INFO - __main__ - Step 48679: {'lr': 0.0003869983464132962, 'samples': 9346368, 'steps': 48678, 'loss/train': 1.4177656173706055} 11/07/2021 04:05:55 - INFO - __main__ - Step 48680: {'lr': 0.0003869939073827764, 'samples': 9346560, 'steps': 48679, 'loss/train': 1.283326506614685} 11/07/2021 04:05:55 - INFO - __main__ - Step 48681: {'lr': 0.00038698946829052926, 'samples': 9346752, 'steps': 48680, 'loss/train': 0.9898263812065125} 11/07/2021 04:05:56 - INFO - __main__ - Step 48682: {'lr': 0.00038698502913655673, 'samples': 9346944, 'steps': 48681, 'loss/train': 1.7014774084091187} 11/07/2021 04:05:56 - INFO - __main__ - Step 48683: {'lr': 0.00038698058992086095, 'samples': 9347136, 'steps': 48682, 'loss/train': 1.427775263786316} 11/07/2021 04:05:57 - INFO - __main__ - Step 48684: {'lr': 0.0003869761506434438, 'samples': 9347328, 'steps': 48683, 'loss/train': 1.2640846967697144} 11/07/2021 04:05:58 - INFO - __main__ - Step 48685: {'lr': 0.0003869717113043073, 'samples': 9347520, 'steps': 48684, 'loss/train': 1.5891711711883545} 11/07/2021 04:05:58 - INFO - __main__ - Step 48686: {'lr': 0.00038696727190345347, 'samples': 9347712, 'steps': 48685, 'loss/train': 1.3345023393630981} 11/07/2021 04:05:58 - INFO - __main__ - Step 48687: {'lr': 0.00038696283244088426, 'samples': 9347904, 'steps': 48686, 'loss/train': 1.5201863050460815} 11/07/2021 04:05:59 - INFO - __main__ - Step 48688: {'lr': 0.0003869583929166017, 'samples': 9348096, 'steps': 48687, 'loss/train': 1.3590421676635742} 11/07/2021 04:05:59 - INFO - __main__ - Step 48689: {'lr': 0.0003869539533306079, 'samples': 9348288, 'steps': 48688, 'loss/train': 1.1935977935791016} 11/07/2021 04:06:00 - INFO - __main__ - Step 48690: {'lr': 0.00038694951368290463, 'samples': 9348480, 'steps': 48689, 'loss/train': 1.6029000282287598} 11/07/2021 04:06:00 - INFO - __main__ - Step 48691: {'lr': 0.0003869450739734941, 'samples': 9348672, 'steps': 48690, 'loss/train': 1.3300920724868774} 11/07/2021 04:06:01 - INFO - __main__ - Step 48692: {'lr': 0.00038694063420237823, 'samples': 9348864, 'steps': 48691, 'loss/train': 0.9210855960845947} 11/07/2021 04:06:01 - INFO - __main__ - Step 48693: {'lr': 0.00038693619436955907, 'samples': 9349056, 'steps': 48692, 'loss/train': 1.2502361536026} 11/07/2021 04:06:01 - INFO - __main__ - Step 48694: {'lr': 0.0003869317544750385, 'samples': 9349248, 'steps': 48693, 'loss/train': 1.711861252784729} 11/07/2021 04:06:02 - INFO - __main__ - Step 48695: {'lr': 0.0003869273145188186, 'samples': 9349440, 'steps': 48694, 'loss/train': 1.7460613250732422} 11/07/2021 04:06:03 - INFO - __main__ - Step 48696: {'lr': 0.00038692287450090143, 'samples': 9349632, 'steps': 48695, 'loss/train': 1.3120661973953247} 11/07/2021 04:06:03 - INFO - __main__ - Step 48697: {'lr': 0.0003869184344212888, 'samples': 9349824, 'steps': 48696, 'loss/train': 1.6759048700332642} 11/07/2021 04:06:03 - INFO - __main__ - Step 48698: {'lr': 0.00038691399427998296, 'samples': 9350016, 'steps': 48697, 'loss/train': 1.2368032932281494} 11/07/2021 04:06:04 - INFO - __main__ - Step 48699: {'lr': 0.0003869095540769858, 'samples': 9350208, 'steps': 48698, 'loss/train': 1.4147051572799683} 11/07/2021 04:06:04 - INFO - __main__ - Step 48700: {'lr': 0.0003869051138122992, 'samples': 9350400, 'steps': 48699, 'loss/train': 1.457667350769043} 11/07/2021 04:06:05 - INFO - __main__ - Step 48701: {'lr': 0.0003869006734859253, 'samples': 9350592, 'steps': 48700, 'loss/train': 1.5897791385650635} 11/07/2021 04:06:06 - INFO - __main__ - Step 48702: {'lr': 0.00038689623309786617, 'samples': 9350784, 'steps': 48701, 'loss/train': 1.6684839725494385} 11/07/2021 04:06:06 - INFO - __main__ - Step 48703: {'lr': 0.00038689179264812356, 'samples': 9350976, 'steps': 48702, 'loss/train': 1.1511362791061401} 11/07/2021 04:06:06 - INFO - __main__ - Step 48704: {'lr': 0.00038688735213669967, 'samples': 9351168, 'steps': 48703, 'loss/train': 1.2333346605300903} 11/07/2021 04:06:07 - INFO - __main__ - Step 48705: {'lr': 0.00038688291156359654, 'samples': 9351360, 'steps': 48704, 'loss/train': 1.3094733953475952} 11/07/2021 04:06:08 - INFO - __main__ - Step 48706: {'lr': 0.000386878470928816, 'samples': 9351552, 'steps': 48705, 'loss/train': 1.6706844568252563} 11/07/2021 04:06:09 - INFO - __main__ - Step 48707: {'lr': 0.0003868740302323601, 'samples': 9351744, 'steps': 48706, 'loss/train': 1.051388144493103} 11/07/2021 04:06:09 - INFO - __main__ - Step 48708: {'lr': 0.00038686958947423096, 'samples': 9351936, 'steps': 48707, 'loss/train': 1.7953312397003174} 11/07/2021 04:06:10 - INFO - __main__ - Step 48709: {'lr': 0.00038686514865443047, 'samples': 9352128, 'steps': 48708, 'loss/train': 1.4169048070907593} 11/07/2021 04:06:10 - INFO - __main__ - Step 48710: {'lr': 0.00038686070777296057, 'samples': 9352320, 'steps': 48709, 'loss/train': 1.9127233028411865} 11/07/2021 04:06:10 - INFO - __main__ - Step 48711: {'lr': 0.00038685626682982347, 'samples': 9352512, 'steps': 48710, 'loss/train': 1.7541173696517944} 11/07/2021 04:06:11 - INFO - __main__ - Step 48712: {'lr': 0.000386851825825021, 'samples': 9352704, 'steps': 48711, 'loss/train': 1.7800002098083496} 11/07/2021 04:06:12 - INFO - __main__ - Step 48713: {'lr': 0.0003868473847585552, 'samples': 9352896, 'steps': 48712, 'loss/train': 0.9547432065010071} 11/07/2021 04:06:12 - INFO - __main__ - Step 48714: {'lr': 0.00038684294363042806, 'samples': 9353088, 'steps': 48713, 'loss/train': 1.3911314010620117} 11/07/2021 04:06:12 - INFO - __main__ - Step 48715: {'lr': 0.00038683850244064164, 'samples': 9353280, 'steps': 48714, 'loss/train': 1.717429280281067} 11/07/2021 04:06:13 - INFO - __main__ - Step 48716: {'lr': 0.0003868340611891978, 'samples': 9353472, 'steps': 48715, 'loss/train': 0.7260299324989319} 11/07/2021 04:06:13 - INFO - __main__ - Step 48717: {'lr': 0.0003868296198760988, 'samples': 9353664, 'steps': 48716, 'loss/train': 1.784231185913086} 11/07/2021 04:06:14 - INFO - __main__ - Step 48718: {'lr': 0.00038682517850134634, 'samples': 9353856, 'steps': 48717, 'loss/train': 1.1612917184829712} 11/07/2021 04:06:14 - INFO - __main__ - Step 48719: {'lr': 0.0003868207370649427, 'samples': 9354048, 'steps': 48718, 'loss/train': 1.3193871974945068} 11/07/2021 04:06:15 - INFO - __main__ - Step 48720: {'lr': 0.0003868162955668897, 'samples': 9354240, 'steps': 48719, 'loss/train': 1.63193678855896} 11/07/2021 04:06:15 - INFO - __main__ - Step 48721: {'lr': 0.0003868118540071894, 'samples': 9354432, 'steps': 48720, 'loss/train': 1.3199032545089722} 11/07/2021 04:06:16 - INFO - __main__ - Step 48722: {'lr': 0.0003868074123858437, 'samples': 9354624, 'steps': 48721, 'loss/train': 1.2110531330108643} 11/07/2021 04:06:17 - INFO - __main__ - Step 48723: {'lr': 0.0003868029707028548, 'samples': 9354816, 'steps': 48722, 'loss/train': 1.7190054655075073} 11/07/2021 04:06:17 - INFO - __main__ - Step 48724: {'lr': 0.00038679852895822454, 'samples': 9355008, 'steps': 48723, 'loss/train': 1.5598641633987427} 11/07/2021 04:06:17 - INFO - __main__ - Step 48725: {'lr': 0.000386794087151955, 'samples': 9355200, 'steps': 48724, 'loss/train': 1.8229436874389648} 11/07/2021 04:06:18 - INFO - __main__ - Step 48726: {'lr': 0.00038678964528404816, 'samples': 9355392, 'steps': 48725, 'loss/train': 1.6857901811599731} 11/07/2021 04:06:18 - INFO - __main__ - Step 48727: {'lr': 0.000386785203354506, 'samples': 9355584, 'steps': 48726, 'loss/train': 1.457085132598877} 11/07/2021 04:06:18 - INFO - __main__ - Step 48728: {'lr': 0.0003867807613633305, 'samples': 9355776, 'steps': 48727, 'loss/train': 1.4536830186843872} 11/07/2021 04:06:19 - INFO - __main__ - Step 48729: {'lr': 0.0003867763193105237, 'samples': 9355968, 'steps': 48728, 'loss/train': 1.7361259460449219} 11/07/2021 04:06:20 - INFO - __main__ - Step 48730: {'lr': 0.00038677187719608763, 'samples': 9356160, 'steps': 48729, 'loss/train': 1.7625551223754883} 11/07/2021 04:06:20 - INFO - __main__ - Step 48731: {'lr': 0.00038676743502002434, 'samples': 9356352, 'steps': 48730, 'loss/train': 1.319649338722229} 11/07/2021 04:06:20 - INFO - __main__ - Step 48732: {'lr': 0.0003867629927823357, 'samples': 9356544, 'steps': 48731, 'loss/train': 1.083548665046692} 11/07/2021 04:06:21 - INFO - __main__ - Step 48733: {'lr': 0.0003867585504830237, 'samples': 9356736, 'steps': 48732, 'loss/train': 1.518478512763977} 11/07/2021 04:06:22 - INFO - __main__ - Step 48734: {'lr': 0.00038675410812209044, 'samples': 9356928, 'steps': 48733, 'loss/train': 1.7668431997299194} 11/07/2021 04:06:22 - INFO - __main__ - Step 48735: {'lr': 0.0003867496656995379, 'samples': 9357120, 'steps': 48734, 'loss/train': 1.438031554222107} 11/07/2021 04:06:22 - INFO - __main__ - Step 48736: {'lr': 0.0003867452232153681, 'samples': 9357312, 'steps': 48735, 'loss/train': 1.722999095916748} 11/07/2021 04:06:23 - INFO - __main__ - Step 48737: {'lr': 0.00038674078066958296, 'samples': 9357504, 'steps': 48736, 'loss/train': 1.2756949663162231} 11/07/2021 04:06:23 - INFO - __main__ - Step 48738: {'lr': 0.0003867363380621846, 'samples': 9357696, 'steps': 48737, 'loss/train': 0.6910433173179626} 11/07/2021 04:06:24 - INFO - __main__ - Step 48739: {'lr': 0.0003867318953931749, 'samples': 9357888, 'steps': 48738, 'loss/train': 1.8019530773162842} 11/07/2021 04:06:25 - INFO - __main__ - Step 48740: {'lr': 0.00038672745266255594, 'samples': 9358080, 'steps': 48739, 'loss/train': 1.5454462766647339} 11/07/2021 04:06:25 - INFO - __main__ - Step 48741: {'lr': 0.0003867230098703297, 'samples': 9358272, 'steps': 48740, 'loss/train': 1.5047858953475952} 11/07/2021 04:06:25 - INFO - __main__ - Step 48742: {'lr': 0.00038671856701649813, 'samples': 9358464, 'steps': 48741, 'loss/train': 1.7610464096069336} 11/07/2021 04:06:26 - INFO - __main__ - Step 48743: {'lr': 0.0003867141241010633, 'samples': 9358656, 'steps': 48742, 'loss/train': 1.0106375217437744} 11/07/2021 04:06:27 - INFO - __main__ - Step 48744: {'lr': 0.00038670968112402724, 'samples': 9358848, 'steps': 48743, 'loss/train': 2.2555599212646484} 11/07/2021 04:06:27 - INFO - __main__ - Step 48745: {'lr': 0.00038670523808539194, 'samples': 9359040, 'steps': 48744, 'loss/train': 1.3200651407241821} 11/07/2021 04:06:27 - INFO - __main__ - Step 48746: {'lr': 0.0003867007949851593, 'samples': 9359232, 'steps': 48745, 'loss/train': 1.2698509693145752} 11/07/2021 04:06:28 - INFO - __main__ - Step 48747: {'lr': 0.0003866963518233314, 'samples': 9359424, 'steps': 48746, 'loss/train': 1.5615946054458618} 11/07/2021 04:06:28 - INFO - __main__ - Step 48748: {'lr': 0.00038669190859991025, 'samples': 9359616, 'steps': 48747, 'loss/train': 1.2709089517593384} 11/07/2021 04:06:29 - INFO - __main__ - Step 48749: {'lr': 0.00038668746531489787, 'samples': 9359808, 'steps': 48748, 'loss/train': 1.6371272802352905} 11/07/2021 04:06:29 - INFO - __main__ - Step 48750: {'lr': 0.0003866830219682962, 'samples': 9360000, 'steps': 48749, 'loss/train': 1.6976897716522217} 11/07/2021 04:06:30 - INFO - __main__ - Step 48751: {'lr': 0.00038667857856010727, 'samples': 9360192, 'steps': 48750, 'loss/train': 1.5372999906539917} 11/07/2021 04:06:30 - INFO - __main__ - Step 48752: {'lr': 0.00038667413509033306, 'samples': 9360384, 'steps': 48751, 'loss/train': 1.2858636379241943} 11/07/2021 04:06:30 - INFO - __main__ - Step 48753: {'lr': 0.0003866696915589756, 'samples': 9360576, 'steps': 48752, 'loss/train': 1.4410368204116821} 11/07/2021 04:06:32 - INFO - __main__ - Step 48754: {'lr': 0.0003866652479660369, 'samples': 9360768, 'steps': 48753, 'loss/train': 1.517182469367981} 11/07/2021 04:06:32 - INFO - __main__ - Step 48755: {'lr': 0.00038666080431151896, 'samples': 9360960, 'steps': 48754, 'loss/train': 1.2897025346755981} 11/07/2021 04:06:32 - INFO - __main__ - Step 48756: {'lr': 0.00038665636059542367, 'samples': 9361152, 'steps': 48755, 'loss/train': 1.6465325355529785} 11/07/2021 04:06:33 - INFO - __main__ - Step 48757: {'lr': 0.00038665191681775323, 'samples': 9361344, 'steps': 48756, 'loss/train': 1.4895535707473755} 11/07/2021 04:06:33 - INFO - __main__ - Step 48758: {'lr': 0.00038664747297850955, 'samples': 9361536, 'steps': 48757, 'loss/train': 1.6872446537017822} 11/07/2021 04:06:34 - INFO - __main__ - Step 48759: {'lr': 0.00038664302907769456, 'samples': 9361728, 'steps': 48758, 'loss/train': 1.286729335784912} 11/07/2021 04:06:34 - INFO - __main__ - Step 48760: {'lr': 0.00038663858511531034, 'samples': 9361920, 'steps': 48759, 'loss/train': 1.6060450077056885} 11/07/2021 04:06:35 - INFO - __main__ - Step 48761: {'lr': 0.000386634141091359, 'samples': 9362112, 'steps': 48760, 'loss/train': 1.1402372121810913} 11/07/2021 04:06:35 - INFO - __main__ - Step 48762: {'lr': 0.0003866296970058423, 'samples': 9362304, 'steps': 48761, 'loss/train': 1.502384901046753} 11/07/2021 04:06:35 - INFO - __main__ - Step 48763: {'lr': 0.0003866252528587624, 'samples': 9362496, 'steps': 48762, 'loss/train': 1.5161304473876953} 11/07/2021 04:06:37 - INFO - __main__ - Step 48764: {'lr': 0.00038662080865012127, 'samples': 9362688, 'steps': 48763, 'loss/train': 1.6916911602020264} 11/07/2021 04:06:37 - INFO - __main__ - Step 48765: {'lr': 0.00038661636437992093, 'samples': 9362880, 'steps': 48764, 'loss/train': 1.4508171081542969} 11/07/2021 04:06:37 - INFO - __main__ - Step 48766: {'lr': 0.0003866119200481634, 'samples': 9363072, 'steps': 48765, 'loss/train': 1.3959133625030518} 11/07/2021 04:06:38 - INFO - __main__ - Step 48767: {'lr': 0.00038660747565485054, 'samples': 9363264, 'steps': 48766, 'loss/train': 1.5454134941101074} 11/07/2021 04:06:38 - INFO - __main__ - Step 48768: {'lr': 0.0003866030311999845, 'samples': 9363456, 'steps': 48767, 'loss/train': 1.689577579498291} 11/07/2021 04:06:39 - INFO - __main__ - Step 48769: {'lr': 0.0003865985866835673, 'samples': 9363648, 'steps': 48768, 'loss/train': 1.476892113685608} 11/07/2021 04:06:39 - INFO - __main__ - Step 48770: {'lr': 0.00038659414210560087, 'samples': 9363840, 'steps': 48769, 'loss/train': 1.6476308107376099} 11/07/2021 04:06:40 - INFO - __main__ - Step 48771: {'lr': 0.00038658969746608717, 'samples': 9364032, 'steps': 48770, 'loss/train': 1.1548690795898438} 11/07/2021 04:06:40 - INFO - __main__ - Step 48772: {'lr': 0.0003865852527650283, 'samples': 9364224, 'steps': 48771, 'loss/train': 0.3828035891056061} 11/07/2021 04:06:40 - INFO - __main__ - Step 48773: {'lr': 0.0003865808080024262, 'samples': 9364416, 'steps': 48772, 'loss/train': 1.2640275955200195} 11/07/2021 04:06:41 - INFO - __main__ - Step 48774: {'lr': 0.00038657636317828293, 'samples': 9364608, 'steps': 48773, 'loss/train': 1.4242490530014038} 11/07/2021 04:06:42 - INFO - __main__ - Step 48775: {'lr': 0.00038657191829260043, 'samples': 9364800, 'steps': 48774, 'loss/train': 1.7121540307998657} 11/07/2021 04:06:42 - INFO - __main__ - Step 48776: {'lr': 0.00038656747334538073, 'samples': 9364992, 'steps': 48775, 'loss/train': 1.2917346954345703} 11/07/2021 04:06:42 - INFO - __main__ - Step 48777: {'lr': 0.00038656302833662583, 'samples': 9365184, 'steps': 48776, 'loss/train': 2.1382129192352295} 11/07/2021 04:06:43 - INFO - __main__ - Step 48778: {'lr': 0.00038655858326633774, 'samples': 9365376, 'steps': 48777, 'loss/train': 1.293482780456543} 11/07/2021 04:06:43 - INFO - __main__ - Step 48779: {'lr': 0.0003865541381345185, 'samples': 9365568, 'steps': 48778, 'loss/train': 0.16194438934326172} 11/07/2021 04:06:45 - INFO - __main__ - Step 48780: {'lr': 0.00038654969294117, 'samples': 9365760, 'steps': 48779, 'loss/train': 0.9568668007850647} 11/07/2021 04:06:45 - INFO - __main__ - Step 48781: {'lr': 0.0003865452476862944, 'samples': 9365952, 'steps': 48780, 'loss/train': 1.5685209035873413} 11/07/2021 04:06:45 - INFO - __main__ - Step 48782: {'lr': 0.0003865408023698935, 'samples': 9366144, 'steps': 48781, 'loss/train': 1.5595369338989258} 11/07/2021 04:06:46 - INFO - __main__ - Step 48783: {'lr': 0.00038653635699196956, 'samples': 9366336, 'steps': 48782, 'loss/train': 1.7438932657241821} 11/07/2021 04:06:46 - INFO - __main__ - Step 48784: {'lr': 0.0003865319115525244, 'samples': 9366528, 'steps': 48783, 'loss/train': 1.029396414756775} 11/07/2021 04:06:46 - INFO - __main__ - Step 48785: {'lr': 0.00038652746605156, 'samples': 9366720, 'steps': 48784, 'loss/train': 1.8966786861419678} 11/07/2021 04:06:47 - INFO - __main__ - Step 48786: {'lr': 0.0003865230204890785, 'samples': 9366912, 'steps': 48785, 'loss/train': 1.8196535110473633} 11/07/2021 04:06:48 - INFO - __main__ - Step 48787: {'lr': 0.0003865185748650818, 'samples': 9367104, 'steps': 48786, 'loss/train': 1.0815938711166382} 11/07/2021 04:06:48 - INFO - __main__ - Step 48788: {'lr': 0.00038651412917957195, 'samples': 9367296, 'steps': 48787, 'loss/train': 1.6344361305236816} 11/07/2021 04:06:48 - INFO - __main__ - Step 48789: {'lr': 0.000386509683432551, 'samples': 9367488, 'steps': 48788, 'loss/train': 1.7716069221496582} 11/07/2021 04:06:49 - INFO - __main__ - Step 48790: {'lr': 0.0003865052376240208, 'samples': 9367680, 'steps': 48789, 'loss/train': 1.2671294212341309} 11/07/2021 04:06:50 - INFO - __main__ - Step 48791: {'lr': 0.00038650079175398346, 'samples': 9367872, 'steps': 48790, 'loss/train': 1.7354562282562256} 11/07/2021 04:06:50 - INFO - __main__ - Step 48792: {'lr': 0.00038649634582244095, 'samples': 9368064, 'steps': 48791, 'loss/train': 1.3500326871871948} 11/07/2021 04:06:50 - INFO - __main__ - Step 48793: {'lr': 0.0003864918998293954, 'samples': 9368256, 'steps': 48792, 'loss/train': 1.6196799278259277} 11/07/2021 04:06:51 - INFO - __main__ - Step 48794: {'lr': 0.0003864874537748486, 'samples': 9368448, 'steps': 48793, 'loss/train': 1.5681240558624268} 11/07/2021 04:06:51 - INFO - __main__ - Step 48795: {'lr': 0.00038648300765880276, 'samples': 9368640, 'steps': 48794, 'loss/train': 1.248744249343872} 11/07/2021 04:06:52 - INFO - __main__ - Step 48796: {'lr': 0.0003864785614812597, 'samples': 9368832, 'steps': 48795, 'loss/train': 1.4846022129058838} 11/07/2021 04:06:53 - INFO - __main__ - Step 48797: {'lr': 0.00038647411524222146, 'samples': 9369024, 'steps': 48796, 'loss/train': 1.6658082008361816} 11/07/2021 04:06:53 - INFO - __main__ - Step 48798: {'lr': 0.00038646966894169014, 'samples': 9369216, 'steps': 48797, 'loss/train': 1.4880821704864502} 11/07/2021 04:06:53 - INFO - __main__ - Step 48799: {'lr': 0.00038646522257966776, 'samples': 9369408, 'steps': 48798, 'loss/train': 1.0046395063400269} 11/07/2021 04:06:54 - INFO - __main__ - Step 48800: {'lr': 0.0003864607761561562, 'samples': 9369600, 'steps': 48799, 'loss/train': 1.652105689048767} 11/07/2021 04:06:56 - INFO - __main__ - Step 48801: {'lr': 0.00038645632967115753, 'samples': 9369792, 'steps': 48800, 'loss/train': 1.5499067306518555} 11/07/2021 04:06:56 - INFO - __main__ - Step 48802: {'lr': 0.0003864518831246737, 'samples': 9369984, 'steps': 48801, 'loss/train': 1.5321955680847168} 11/07/2021 04:06:56 - INFO - __main__ - Step 48803: {'lr': 0.00038644743651670684, 'samples': 9370176, 'steps': 48802, 'loss/train': 1.5658891201019287} 11/07/2021 04:06:57 - INFO - __main__ - Step 48804: {'lr': 0.00038644298984725876, 'samples': 9370368, 'steps': 48803, 'loss/train': 1.4779611825942993} 11/07/2021 04:06:57 - INFO - __main__ - Step 48805: {'lr': 0.00038643854311633166, 'samples': 9370560, 'steps': 48804, 'loss/train': 1.4240643978118896} 11/07/2021 04:06:57 - INFO - __main__ - Step 48806: {'lr': 0.0003864340963239275, 'samples': 9370752, 'steps': 48805, 'loss/train': 1.3121813535690308} 11/07/2021 04:06:58 - INFO - __main__ - Step 48807: {'lr': 0.00038642964947004815, 'samples': 9370944, 'steps': 48806, 'loss/train': 0.7261683344841003} 11/07/2021 04:06:59 - INFO - __main__ - Step 48808: {'lr': 0.0003864252025546957, 'samples': 9371136, 'steps': 48807, 'loss/train': 0.8198906183242798} 11/07/2021 04:06:59 - INFO - __main__ - Step 48809: {'lr': 0.00038642075557787225, 'samples': 9371328, 'steps': 48808, 'loss/train': 1.670855164527893} 11/07/2021 04:07:00 - INFO - __main__ - Step 48810: {'lr': 0.0003864163085395797, 'samples': 9371520, 'steps': 48809, 'loss/train': 1.4442169666290283} 11/07/2021 04:07:00 - INFO - __main__ - Step 48811: {'lr': 0.00038641186143982, 'samples': 9371712, 'steps': 48810, 'loss/train': 1.5569305419921875} 11/07/2021 04:07:00 - INFO - __main__ - Step 48812: {'lr': 0.0003864074142785952, 'samples': 9371904, 'steps': 48811, 'loss/train': 1.8886054754257202} 11/07/2021 04:07:01 - INFO - __main__ - Step 48813: {'lr': 0.0003864029670559074, 'samples': 9372096, 'steps': 48812, 'loss/train': 0.931264340877533} 11/07/2021 04:07:02 - INFO - __main__ - Step 48814: {'lr': 0.0003863985197717585, 'samples': 9372288, 'steps': 48813, 'loss/train': 0.7957919239997864} 11/07/2021 04:07:02 - INFO - __main__ - Step 48815: {'lr': 0.0003863940724261505, 'samples': 9372480, 'steps': 48814, 'loss/train': 1.1304203271865845} 11/07/2021 04:07:02 - INFO - __main__ - Step 48816: {'lr': 0.0003863896250190855, 'samples': 9372672, 'steps': 48815, 'loss/train': 1.3601429462432861} 11/07/2021 04:07:03 - INFO - __main__ - Step 48817: {'lr': 0.00038638517755056534, 'samples': 9372864, 'steps': 48816, 'loss/train': 1.6283661127090454} 11/07/2021 04:07:03 - INFO - __main__ - Step 48818: {'lr': 0.00038638073002059223, 'samples': 9373056, 'steps': 48817, 'loss/train': 1.3443467617034912} 11/07/2021 04:07:04 - INFO - __main__ - Step 48819: {'lr': 0.000386376282429168, 'samples': 9373248, 'steps': 48818, 'loss/train': 0.7243291139602661} 11/07/2021 04:07:04 - INFO - __main__ - Step 48820: {'lr': 0.0003863718347762948, 'samples': 9373440, 'steps': 48819, 'loss/train': 1.125797152519226} 11/07/2021 04:07:05 - INFO - __main__ - Step 48821: {'lr': 0.0003863673870619744, 'samples': 9373632, 'steps': 48820, 'loss/train': 1.2432880401611328} 11/07/2021 04:07:05 - INFO - __main__ - Step 48822: {'lr': 0.00038636293928620915, 'samples': 9373824, 'steps': 48821, 'loss/train': 1.2100565433502197} 11/07/2021 04:07:05 - INFO - __main__ - Step 48823: {'lr': 0.0003863584914490007, 'samples': 9374016, 'steps': 48822, 'loss/train': 1.6470533609390259} 11/07/2021 04:07:07 - INFO - __main__ - Step 48824: {'lr': 0.0003863540435503513, 'samples': 9374208, 'steps': 48823, 'loss/train': 0.5824220180511475} 11/07/2021 04:07:07 - INFO - __main__ - Step 48825: {'lr': 0.0003863495955902629, 'samples': 9374400, 'steps': 48824, 'loss/train': 1.147403597831726} 11/07/2021 04:07:08 - INFO - __main__ - Step 48826: {'lr': 0.00038634514756873746, 'samples': 9374592, 'steps': 48825, 'loss/train': 1.6314716339111328} 11/07/2021 04:07:08 - INFO - __main__ - Step 48827: {'lr': 0.000386340699485777, 'samples': 9374784, 'steps': 48826, 'loss/train': 0.47161123156547546} 11/07/2021 04:07:08 - INFO - __main__ - Step 48828: {'lr': 0.0003863362513413835, 'samples': 9374976, 'steps': 48827, 'loss/train': 1.3701505661010742} 11/07/2021 04:07:09 - INFO - __main__ - Step 48829: {'lr': 0.00038633180313555894, 'samples': 9375168, 'steps': 48828, 'loss/train': 0.1184760332107544} 11/07/2021 04:07:10 - INFO - __main__ - Step 48830: {'lr': 0.0003863273548683054, 'samples': 9375360, 'steps': 48829, 'loss/train': 1.616905927658081} 11/07/2021 04:07:10 - INFO - __main__ - Step 48831: {'lr': 0.0003863229065396249, 'samples': 9375552, 'steps': 48830, 'loss/train': 1.8326122760772705} 11/07/2021 04:07:10 - INFO - __main__ - Step 48832: {'lr': 0.0003863184581495194, 'samples': 9375744, 'steps': 48831, 'loss/train': 1.295698642730713} 11/07/2021 04:07:11 - INFO - __main__ - Step 48833: {'lr': 0.0003863140096979909, 'samples': 9375936, 'steps': 48832, 'loss/train': 1.1749823093414307} 11/07/2021 04:07:12 - INFO - __main__ - Step 48834: {'lr': 0.00038630956118504146, 'samples': 9376128, 'steps': 48833, 'loss/train': 1.8343381881713867} 11/07/2021 04:07:12 - INFO - __main__ - Step 48835: {'lr': 0.00038630511261067294, 'samples': 9376320, 'steps': 48834, 'loss/train': 1.4257701635360718} 11/07/2021 04:07:13 - INFO - __main__ - Step 48836: {'lr': 0.0003863006639748875, 'samples': 9376512, 'steps': 48835, 'loss/train': 1.5344891548156738} 11/07/2021 04:07:13 - INFO - __main__ - Step 48837: {'lr': 0.000386296215277687, 'samples': 9376704, 'steps': 48836, 'loss/train': 0.5804466605186462} 11/07/2021 04:07:13 - INFO - __main__ - Step 48838: {'lr': 0.0003862917665190736, 'samples': 9376896, 'steps': 48837, 'loss/train': 1.2600109577178955} 11/07/2021 04:07:14 - INFO - __main__ - Step 48839: {'lr': 0.0003862873176990492, 'samples': 9377088, 'steps': 48838, 'loss/train': 1.8333112001419067} 11/07/2021 04:07:15 - INFO - __main__ - Step 48840: {'lr': 0.00038628286881761594, 'samples': 9377280, 'steps': 48839, 'loss/train': 1.3519837856292725} 11/07/2021 04:07:15 - INFO - __main__ - Step 48841: {'lr': 0.0003862784198747756, 'samples': 9377472, 'steps': 48840, 'loss/train': 0.9140884280204773} 11/07/2021 04:07:15 - INFO - __main__ - Step 48842: {'lr': 0.0003862739708705304, 'samples': 9377664, 'steps': 48841, 'loss/train': 1.6649800539016724} 11/07/2021 04:07:16 - INFO - __main__ - Step 48843: {'lr': 0.0003862695218048822, 'samples': 9377856, 'steps': 48842, 'loss/train': 1.3832347393035889} 11/07/2021 04:07:16 - INFO - __main__ - Step 48844: {'lr': 0.000386265072677833, 'samples': 9378048, 'steps': 48843, 'loss/train': 1.4973256587982178} 11/07/2021 04:07:17 - INFO - __main__ - Step 48845: {'lr': 0.00038626062348938494, 'samples': 9378240, 'steps': 48844, 'loss/train': 1.8129611015319824} 11/07/2021 04:07:18 - INFO - __main__ - Step 48846: {'lr': 0.00038625617423954, 'samples': 9378432, 'steps': 48845, 'loss/train': 1.4013532400131226} 11/07/2021 04:07:18 - INFO - __main__ - Step 48847: {'lr': 0.00038625172492829995, 'samples': 9378624, 'steps': 48846, 'loss/train': 1.7761856317520142} 11/07/2021 04:07:18 - INFO - __main__ - Step 48848: {'lr': 0.00038624727555566714, 'samples': 9378816, 'steps': 48847, 'loss/train': 2.2022461891174316} 11/07/2021 04:07:19 - INFO - __main__ - Step 48849: {'lr': 0.0003862428261216433, 'samples': 9379008, 'steps': 48848, 'loss/train': 1.5194182395935059} 11/07/2021 04:07:20 - INFO - __main__ - Step 48850: {'lr': 0.00038623837662623065, 'samples': 9379200, 'steps': 48849, 'loss/train': 1.8473443984985352} 11/07/2021 04:07:20 - INFO - __main__ - Step 48851: {'lr': 0.000386233927069431, 'samples': 9379392, 'steps': 48850, 'loss/train': 1.5626877546310425} 11/07/2021 04:07:20 - INFO - __main__ - Step 48852: {'lr': 0.0003862294774512465, 'samples': 9379584, 'steps': 48851, 'loss/train': 1.4937726259231567} 11/07/2021 04:07:21 - INFO - __main__ - Step 48853: {'lr': 0.00038622502777167913, 'samples': 9379776, 'steps': 48852, 'loss/train': 1.0545097589492798} 11/07/2021 04:07:21 - INFO - __main__ - Step 48854: {'lr': 0.00038622057803073075, 'samples': 9379968, 'steps': 48853, 'loss/train': 1.2151628732681274} 11/07/2021 04:07:21 - INFO - __main__ - Step 48855: {'lr': 0.0003862161282284036, 'samples': 9380160, 'steps': 48854, 'loss/train': 1.763312816619873} 11/07/2021 04:07:22 - INFO - __main__ - Step 48856: {'lr': 0.00038621167836469945, 'samples': 9380352, 'steps': 48855, 'loss/train': 1.5793697834014893} 11/07/2021 04:07:23 - INFO - __main__ - Step 48857: {'lr': 0.0003862072284396205, 'samples': 9380544, 'steps': 48856, 'loss/train': 1.4016894102096558} 11/07/2021 04:07:23 - INFO - __main__ - Step 48858: {'lr': 0.00038620277845316867, 'samples': 9380736, 'steps': 48857, 'loss/train': 1.4486711025238037} 11/07/2021 04:07:23 - INFO - __main__ - Step 48859: {'lr': 0.00038619832840534586, 'samples': 9380928, 'steps': 48858, 'loss/train': 0.8741102814674377} 11/07/2021 04:07:24 - INFO - __main__ - Step 48860: {'lr': 0.0003861938782961544, 'samples': 9381120, 'steps': 48859, 'loss/train': 1.3810038566589355} 11/07/2021 04:07:25 - INFO - __main__ - Step 48861: {'lr': 0.0003861894281255959, 'samples': 9381312, 'steps': 48860, 'loss/train': 1.1368257999420166} 11/07/2021 04:07:25 - INFO - __main__ - Step 48862: {'lr': 0.0003861849778936726, 'samples': 9381504, 'steps': 48861, 'loss/train': 1.2794278860092163} 11/07/2021 04:07:26 - INFO - __main__ - Step 48863: {'lr': 0.00038618052760038647, 'samples': 9381696, 'steps': 48862, 'loss/train': 1.540213942527771} 11/07/2021 04:07:26 - INFO - __main__ - Step 48864: {'lr': 0.00038617607724573944, 'samples': 9381888, 'steps': 48863, 'loss/train': 1.45363187789917} 11/07/2021 04:07:26 - INFO - __main__ - Step 48865: {'lr': 0.0003861716268297336, 'samples': 9382080, 'steps': 48864, 'loss/train': 0.20641732215881348} 11/07/2021 04:07:27 - INFO - __main__ - Step 48866: {'lr': 0.000386167176352371, 'samples': 9382272, 'steps': 48865, 'loss/train': 2.019467830657959} 11/07/2021 04:07:28 - INFO - __main__ - Step 48867: {'lr': 0.00038616272581365354, 'samples': 9382464, 'steps': 48866, 'loss/train': 1.715158462524414} 11/07/2021 04:07:28 - INFO - __main__ - Step 48868: {'lr': 0.00038615827521358315, 'samples': 9382656, 'steps': 48867, 'loss/train': 1.4227601289749146} 11/07/2021 04:07:28 - INFO - __main__ - Step 48869: {'lr': 0.00038615382455216204, 'samples': 9382848, 'steps': 48868, 'loss/train': 3.3128716945648193} 11/07/2021 04:07:29 - INFO - __main__ - Step 48870: {'lr': 0.0003861493738293921, 'samples': 9383040, 'steps': 48869, 'loss/train': 1.4049416780471802} 11/07/2021 04:07:30 - INFO - __main__ - Step 48871: {'lr': 0.0003861449230452753, 'samples': 9383232, 'steps': 48870, 'loss/train': 1.6238514184951782} 11/07/2021 04:07:30 - INFO - __main__ - Step 48872: {'lr': 0.00038614047219981374, 'samples': 9383424, 'steps': 48871, 'loss/train': 1.6514708995819092} 11/07/2021 04:07:31 - INFO - __main__ - Step 48873: {'lr': 0.0003861360212930094, 'samples': 9383616, 'steps': 48872, 'loss/train': 1.457114338874817} 11/07/2021 04:07:31 - INFO - __main__ - Step 48874: {'lr': 0.0003861315703248643, 'samples': 9383808, 'steps': 48873, 'loss/train': 2.123868703842163} 11/07/2021 04:07:31 - INFO - __main__ - Step 48875: {'lr': 0.0003861271192953804, 'samples': 9384000, 'steps': 48874, 'loss/train': 1.9054794311523438} 11/07/2021 04:07:32 - INFO - __main__ - Step 48876: {'lr': 0.00038612266820455964, 'samples': 9384192, 'steps': 48875, 'loss/train': 1.4032111167907715} 11/07/2021 04:07:33 - INFO - __main__ - Step 48877: {'lr': 0.0003861182170524041, 'samples': 9384384, 'steps': 48876, 'loss/train': 1.3080118894577026} 11/07/2021 04:07:33 - INFO - __main__ - Step 48878: {'lr': 0.0003861137658389159, 'samples': 9384576, 'steps': 48877, 'loss/train': 1.5100162029266357} 11/07/2021 04:07:33 - INFO - __main__ - Step 48879: {'lr': 0.0003861093145640969, 'samples': 9384768, 'steps': 48878, 'loss/train': 1.6183574199676514} 11/07/2021 04:07:34 - INFO - __main__ - Step 48880: {'lr': 0.00038610486322794915, 'samples': 9384960, 'steps': 48879, 'loss/train': 1.8066436052322388} 11/07/2021 04:07:34 - INFO - __main__ - Step 48881: {'lr': 0.0003861004118304746, 'samples': 9385152, 'steps': 48880, 'loss/train': 1.4952269792556763} 11/07/2021 04:07:35 - INFO - __main__ - Step 48882: {'lr': 0.0003860959603716754, 'samples': 9385344, 'steps': 48881, 'loss/train': 0.9843176603317261} 11/07/2021 04:07:35 - INFO - __main__ - Step 48883: {'lr': 0.00038609150885155337, 'samples': 9385536, 'steps': 48882, 'loss/train': 1.4871866703033447} 11/07/2021 04:07:36 - INFO - __main__ - Step 48884: {'lr': 0.0003860870572701106, 'samples': 9385728, 'steps': 48883, 'loss/train': 1.55475914478302} 11/07/2021 04:07:36 - INFO - __main__ - Step 48885: {'lr': 0.0003860826056273492, 'samples': 9385920, 'steps': 48884, 'loss/train': 1.460806131362915} 11/07/2021 04:07:36 - INFO - __main__ - Step 48886: {'lr': 0.0003860781539232709, 'samples': 9386112, 'steps': 48885, 'loss/train': 5.611880302429199} 11/07/2021 04:07:38 - INFO - __main__ - Step 48887: {'lr': 0.0003860737021578781, 'samples': 9386304, 'steps': 48886, 'loss/train': 1.2217185497283936} 11/07/2021 04:07:38 - INFO - __main__ - Step 48888: {'lr': 0.00038606925033117246, 'samples': 9386496, 'steps': 48887, 'loss/train': 1.5372991561889648} 11/07/2021 04:07:38 - INFO - __main__ - Step 48889: {'lr': 0.00038606479844315614, 'samples': 9386688, 'steps': 48888, 'loss/train': 1.4254859685897827} 11/07/2021 04:07:39 - INFO - __main__ - Step 48890: {'lr': 0.00038606034649383116, 'samples': 9386880, 'steps': 48889, 'loss/train': 1.2352256774902344} 11/07/2021 04:07:39 - INFO - __main__ - Step 48891: {'lr': 0.0003860558944831994, 'samples': 9387072, 'steps': 48890, 'loss/train': 0.6035268902778625} 11/07/2021 04:07:39 - INFO - __main__ - Step 48892: {'lr': 0.000386051442411263, 'samples': 9387264, 'steps': 48891, 'loss/train': 1.3484644889831543} 11/07/2021 04:07:41 - INFO - __main__ - Step 48893: {'lr': 0.00038604699027802394, 'samples': 9387456, 'steps': 48892, 'loss/train': 1.492655634880066} 11/07/2021 04:07:41 - INFO - __main__ - Step 48894: {'lr': 0.0003860425380834842, 'samples': 9387648, 'steps': 48893, 'loss/train': 1.4083945751190186} 11/07/2021 04:07:41 - INFO - __main__ - Step 48895: {'lr': 0.0003860380858276458, 'samples': 9387840, 'steps': 48894, 'loss/train': 0.21939469873905182} 11/07/2021 04:07:42 - INFO - __main__ - Step 48896: {'lr': 0.0003860336335105107, 'samples': 9388032, 'steps': 48895, 'loss/train': 1.3290843963623047} 11/07/2021 04:07:42 - INFO - __main__ - Step 48897: {'lr': 0.000386029181132081, 'samples': 9388224, 'steps': 48896, 'loss/train': 1.3532027006149292} 11/07/2021 04:07:43 - INFO - __main__ - Step 48898: {'lr': 0.0003860247286923586, 'samples': 9388416, 'steps': 48897, 'loss/train': 1.551863193511963} 11/07/2021 04:07:43 - INFO - __main__ - Step 48899: {'lr': 0.0003860202761913455, 'samples': 9388608, 'steps': 48898, 'loss/train': 1.5765019655227661} 11/07/2021 04:07:44 - INFO - __main__ - Step 48900: {'lr': 0.00038601582362904384, 'samples': 9388800, 'steps': 48899, 'loss/train': 1.5646566152572632} 11/07/2021 04:07:44 - INFO - __main__ - Step 48901: {'lr': 0.0003860113710054556, 'samples': 9388992, 'steps': 48900, 'loss/train': 1.491734266281128} 11/07/2021 04:07:44 - INFO - __main__ - Step 48902: {'lr': 0.00038600691832058265, 'samples': 9389184, 'steps': 48901, 'loss/train': 1.0709919929504395} 11/07/2021 04:07:46 - INFO - __main__ - Step 48903: {'lr': 0.0003860024655744271, 'samples': 9389376, 'steps': 48902, 'loss/train': 0.95680832862854} 11/07/2021 04:07:46 - INFO - __main__ - Step 48904: {'lr': 0.000385998012766991, 'samples': 9389568, 'steps': 48903, 'loss/train': 1.5748766660690308} 11/07/2021 04:07:46 - INFO - __main__ - Step 48905: {'lr': 0.0003859935598982762, 'samples': 9389760, 'steps': 48904, 'loss/train': 1.882698893547058} 11/07/2021 04:07:47 - INFO - __main__ - Step 48906: {'lr': 0.0003859891069682848, 'samples': 9389952, 'steps': 48905, 'loss/train': 1.5582358837127686} 11/07/2021 04:07:47 - INFO - __main__ - Step 48907: {'lr': 0.0003859846539770189, 'samples': 9390144, 'steps': 48906, 'loss/train': 0.37549105286598206} 11/07/2021 04:07:48 - INFO - __main__ - Step 48908: {'lr': 0.0003859802009244804, 'samples': 9390336, 'steps': 48907, 'loss/train': 1.8381954431533813} 11/07/2021 04:07:49 - INFO - __main__ - Step 48909: {'lr': 0.00038597574781067123, 'samples': 9390528, 'steps': 48908, 'loss/train': 1.480832815170288} 11/07/2021 04:07:49 - INFO - __main__ - Step 48910: {'lr': 0.0003859712946355936, 'samples': 9390720, 'steps': 48909, 'loss/train': 0.8164600133895874} 11/07/2021 04:07:49 - INFO - __main__ - Step 48911: {'lr': 0.0003859668413992493, 'samples': 9390912, 'steps': 48910, 'loss/train': 1.281607985496521} 11/07/2021 04:07:50 - INFO - __main__ - Step 48912: {'lr': 0.0003859623881016404, 'samples': 9391104, 'steps': 48911, 'loss/train': 1.6252738237380981} 11/07/2021 04:07:50 - INFO - __main__ - Step 48913: {'lr': 0.000385957934742769, 'samples': 9391296, 'steps': 48912, 'loss/train': 1.3718078136444092} 11/07/2021 04:07:51 - INFO - __main__ - Step 48914: {'lr': 0.0003859534813226372, 'samples': 9391488, 'steps': 48913, 'loss/train': 1.0985984802246094} 11/07/2021 04:07:51 - INFO - __main__ - Step 48915: {'lr': 0.00038594902784124663, 'samples': 9391680, 'steps': 48914, 'loss/train': 1.525246500968933} 11/07/2021 04:07:52 - INFO - __main__ - Step 48916: {'lr': 0.00038594457429859966, 'samples': 9391872, 'steps': 48915, 'loss/train': 1.7037945985794067} 11/07/2021 04:07:52 - INFO - __main__ - Step 48917: {'lr': 0.00038594012069469814, 'samples': 9392064, 'steps': 48916, 'loss/train': 1.8779573440551758} 11/07/2021 04:07:52 - INFO - __main__ - Step 48918: {'lr': 0.0003859356670295441, 'samples': 9392256, 'steps': 48917, 'loss/train': 1.5259239673614502} 11/07/2021 04:07:54 - INFO - __main__ - Step 48919: {'lr': 0.00038593121330313953, 'samples': 9392448, 'steps': 48918, 'loss/train': 1.8216310739517212} 11/07/2021 04:07:54 - INFO - __main__ - Step 48920: {'lr': 0.0003859267595154865, 'samples': 9392640, 'steps': 48919, 'loss/train': 1.6950819492340088} 11/07/2021 04:07:54 - INFO - __main__ - Step 48921: {'lr': 0.0003859223056665869, 'samples': 9392832, 'steps': 48920, 'loss/train': 2.4829678535461426} 11/07/2021 04:07:55 - INFO - __main__ - Step 48922: {'lr': 0.00038591785175644283, 'samples': 9393024, 'steps': 48921, 'loss/train': 1.4854564666748047} 11/07/2021 04:07:55 - INFO - __main__ - Step 48923: {'lr': 0.0003859133977850563, 'samples': 9393216, 'steps': 48922, 'loss/train': 1.3652397394180298} 11/07/2021 04:07:56 - INFO - __main__ - Step 48924: {'lr': 0.00038590894375242925, 'samples': 9393408, 'steps': 48923, 'loss/train': 1.6962120532989502} 11/07/2021 04:07:56 - INFO - __main__ - Step 48925: {'lr': 0.0003859044896585637, 'samples': 9393600, 'steps': 48924, 'loss/train': 1.3781741857528687} 11/07/2021 04:07:57 - INFO - __main__ - Step 48926: {'lr': 0.00038590003550346177, 'samples': 9393792, 'steps': 48925, 'loss/train': 1.7190457582473755} 11/07/2021 04:07:57 - INFO - __main__ - Step 48927: {'lr': 0.0003858955812871254, 'samples': 9393984, 'steps': 48926, 'loss/train': 1.9883801937103271} 11/07/2021 04:07:57 - INFO - __main__ - Step 48928: {'lr': 0.0003858911270095565, 'samples': 9394176, 'steps': 48927, 'loss/train': 1.4579230546951294} 11/07/2021 04:07:59 - INFO - __main__ - Step 48929: {'lr': 0.00038588667267075715, 'samples': 9394368, 'steps': 48928, 'loss/train': 1.4992196559906006} 11/07/2021 04:07:59 - INFO - __main__ - Step 48930: {'lr': 0.0003858822182707294, 'samples': 9394560, 'steps': 48929, 'loss/train': 1.516557216644287} 11/07/2021 04:07:59 - INFO - __main__ - Step 48931: {'lr': 0.00038587776380947516, 'samples': 9394752, 'steps': 48930, 'loss/train': 1.510056495666504} 11/07/2021 04:08:00 - INFO - __main__ - Step 48932: {'lr': 0.0003858733092869966, 'samples': 9394944, 'steps': 48931, 'loss/train': 1.8778561353683472} 11/07/2021 04:08:00 - INFO - __main__ - Step 48933: {'lr': 0.00038586885470329554, 'samples': 9395136, 'steps': 48932, 'loss/train': 1.3887089490890503} 11/07/2021 04:08:00 - INFO - __main__ - Step 48934: {'lr': 0.0003858644000583741, 'samples': 9395328, 'steps': 48933, 'loss/train': 1.4813086986541748} 11/07/2021 04:08:01 - INFO - __main__ - Step 48935: {'lr': 0.0003858599453522342, 'samples': 9395520, 'steps': 48934, 'loss/train': 1.7310360670089722} 11/07/2021 04:08:02 - INFO - __main__ - Step 48936: {'lr': 0.000385855490584878, 'samples': 9395712, 'steps': 48935, 'loss/train': 1.3012902736663818} 11/07/2021 04:08:02 - INFO - __main__ - Step 48937: {'lr': 0.0003858510357563074, 'samples': 9395904, 'steps': 48936, 'loss/train': 1.43248450756073} 11/07/2021 04:08:02 - INFO - __main__ - Step 48938: {'lr': 0.00038584658086652433, 'samples': 9396096, 'steps': 48937, 'loss/train': 1.7660819292068481} 11/07/2021 04:08:03 - INFO - __main__ - Step 48939: {'lr': 0.00038584212591553105, 'samples': 9396288, 'steps': 48938, 'loss/train': 1.2785720825195312} 11/07/2021 04:08:04 - INFO - __main__ - Step 48940: {'lr': 0.00038583767090332924, 'samples': 9396480, 'steps': 48939, 'loss/train': 1.5622167587280273} 11/07/2021 04:08:04 - INFO - __main__ - Step 48941: {'lr': 0.00038583321582992113, 'samples': 9396672, 'steps': 48940, 'loss/train': 1.2461129426956177} 11/07/2021 04:08:04 - INFO - __main__ - Step 48942: {'lr': 0.0003858287606953087, 'samples': 9396864, 'steps': 48941, 'loss/train': 1.3549952507019043} 11/07/2021 04:08:05 - INFO - __main__ - Step 48943: {'lr': 0.00038582430549949386, 'samples': 9397056, 'steps': 48942, 'loss/train': 1.319166660308838} 11/07/2021 04:08:05 - INFO - __main__ - Step 48944: {'lr': 0.00038581985024247877, 'samples': 9397248, 'steps': 48943, 'loss/train': 1.4726316928863525} 11/07/2021 04:08:06 - INFO - __main__ - Step 48945: {'lr': 0.0003858153949242653, 'samples': 9397440, 'steps': 48944, 'loss/train': 1.5756710767745972} 11/07/2021 04:08:07 - INFO - __main__ - Step 48946: {'lr': 0.00038581093954485554, 'samples': 9397632, 'steps': 48945, 'loss/train': 1.4263874292373657} 11/07/2021 04:08:07 - INFO - __main__ - Step 48947: {'lr': 0.00038580648410425146, 'samples': 9397824, 'steps': 48946, 'loss/train': 1.4357798099517822} 11/07/2021 04:08:07 - INFO - __main__ - Step 48948: {'lr': 0.00038580202860245507, 'samples': 9398016, 'steps': 48947, 'loss/train': 1.2987250089645386} 11/07/2021 04:08:08 - INFO - __main__ - Step 48949: {'lr': 0.00038579757303946826, 'samples': 9398208, 'steps': 48948, 'loss/train': 1.1880955696105957} 11/07/2021 04:08:09 - INFO - __main__ - Step 48950: {'lr': 0.0003857931174152933, 'samples': 9398400, 'steps': 48949, 'loss/train': 1.6143276691436768} 11/07/2021 04:08:09 - INFO - __main__ - Step 48951: {'lr': 0.000385788661729932, 'samples': 9398592, 'steps': 48950, 'loss/train': 1.5052846670150757} 11/07/2021 04:08:09 - INFO - __main__ - Step 48952: {'lr': 0.0003857842059833865, 'samples': 9398784, 'steps': 48951, 'loss/train': 1.6773607730865479} 11/07/2021 04:08:10 - INFO - __main__ - Step 48953: {'lr': 0.0003857797501756587, 'samples': 9398976, 'steps': 48952, 'loss/train': 1.5313549041748047} 11/07/2021 04:08:10 - INFO - __main__ - Step 48954: {'lr': 0.0003857752943067506, 'samples': 9399168, 'steps': 48953, 'loss/train': 1.5578811168670654} 11/07/2021 04:08:12 - INFO - __main__ - Step 48955: {'lr': 0.0003857708383766643, 'samples': 9399360, 'steps': 48954, 'loss/train': 1.220923662185669} 11/07/2021 04:08:12 - INFO - __main__ - Step 48956: {'lr': 0.00038576638238540167, 'samples': 9399552, 'steps': 48955, 'loss/train': 1.686860203742981} 11/07/2021 04:08:12 - INFO - __main__ - Step 48957: {'lr': 0.00038576192633296485, 'samples': 9399744, 'steps': 48956, 'loss/train': 1.7237331867218018} 11/07/2021 04:08:13 - INFO - __main__ - Step 48958: {'lr': 0.00038575747021935583, 'samples': 9399936, 'steps': 48957, 'loss/train': 1.772165060043335} 11/07/2021 04:08:13 - INFO - __main__ - Step 48959: {'lr': 0.0003857530140445765, 'samples': 9400128, 'steps': 48958, 'loss/train': 1.250779151916504} 11/07/2021 04:08:13 - INFO - __main__ - Step 48960: {'lr': 0.00038574855780862903, 'samples': 9400320, 'steps': 48959, 'loss/train': 1.3282482624053955} 11/07/2021 04:08:14 - INFO - __main__ - Step 48961: {'lr': 0.0003857441015115154, 'samples': 9400512, 'steps': 48960, 'loss/train': 1.4221696853637695} 11/07/2021 04:08:15 - INFO - __main__ - Step 48962: {'lr': 0.00038573964515323754, 'samples': 9400704, 'steps': 48961, 'loss/train': 1.850115418434143} 11/07/2021 04:08:15 - INFO - __main__ - Step 48963: {'lr': 0.0003857351887337974, 'samples': 9400896, 'steps': 48962, 'loss/train': 1.2149375677108765} 11/07/2021 04:08:15 - INFO - __main__ - Step 48964: {'lr': 0.00038573073225319724, 'samples': 9401088, 'steps': 48963, 'loss/train': 1.45426607131958} 11/07/2021 04:08:16 - INFO - __main__ - Step 48965: {'lr': 0.00038572627571143873, 'samples': 9401280, 'steps': 48964, 'loss/train': 1.7876821756362915} 11/07/2021 04:08:17 - INFO - __main__ - Step 48966: {'lr': 0.0003857218191085242, 'samples': 9401472, 'steps': 48965, 'loss/train': 1.346888542175293} 11/07/2021 04:08:17 - INFO - __main__ - Step 48967: {'lr': 0.0003857173624444554, 'samples': 9401664, 'steps': 48966, 'loss/train': 0.31117236614227295} 11/07/2021 04:08:18 - INFO - __main__ - Step 48968: {'lr': 0.00038571290571923455, 'samples': 9401856, 'steps': 48967, 'loss/train': 0.902428150177002} 11/07/2021 04:08:18 - INFO - __main__ - Step 48969: {'lr': 0.0003857084489328635, 'samples': 9402048, 'steps': 48968, 'loss/train': 1.6733211278915405} 11/07/2021 04:08:18 - INFO - __main__ - Step 48970: {'lr': 0.00038570399208534437, 'samples': 9402240, 'steps': 48969, 'loss/train': 1.519181489944458} 11/07/2021 04:08:19 - INFO - __main__ - Step 48971: {'lr': 0.000385699535176679, 'samples': 9402432, 'steps': 48970, 'loss/train': 1.7384952306747437} 11/07/2021 04:08:20 - INFO - __main__ - Step 48972: {'lr': 0.00038569507820686956, 'samples': 9402624, 'steps': 48971, 'loss/train': 1.4874610900878906} 11/07/2021 04:08:20 - INFO - __main__ - Step 48973: {'lr': 0.000385690621175918, 'samples': 9402816, 'steps': 48972, 'loss/train': 1.9224824905395508} 11/07/2021 04:08:20 - INFO - __main__ - Step 48974: {'lr': 0.0003856861640838265, 'samples': 9403008, 'steps': 48973, 'loss/train': 2.112514019012451} 11/07/2021 04:08:21 - INFO - __main__ - Step 48975: {'lr': 0.00038568170693059677, 'samples': 9403200, 'steps': 48974, 'loss/train': 1.4942660331726074} 11/07/2021 04:08:22 - INFO - __main__ - Step 48976: {'lr': 0.000385677249716231, 'samples': 9403392, 'steps': 48975, 'loss/train': 1.478994369506836} 11/07/2021 04:08:22 - INFO - __main__ - Step 48977: {'lr': 0.0003856727924407311, 'samples': 9403584, 'steps': 48976, 'loss/train': 1.1730769872665405} 11/07/2021 04:08:22 - INFO - __main__ - Step 48978: {'lr': 0.0003856683351040992, 'samples': 9403776, 'steps': 48977, 'loss/train': 1.257746696472168} 11/07/2021 04:08:23 - INFO - __main__ - Step 48979: {'lr': 0.00038566387770633715, 'samples': 9403968, 'steps': 48978, 'loss/train': 1.9970284700393677} 11/07/2021 04:08:23 - INFO - __main__ - Step 48980: {'lr': 0.00038565942024744703, 'samples': 9404160, 'steps': 48979, 'loss/train': 1.560881495475769} 11/07/2021 04:08:24 - INFO - __main__ - Step 48981: {'lr': 0.000385654962727431, 'samples': 9404352, 'steps': 48980, 'loss/train': 1.696020483970642} 11/07/2021 04:08:25 - INFO - __main__ - Step 48982: {'lr': 0.00038565050514629087, 'samples': 9404544, 'steps': 48981, 'loss/train': 1.3196929693222046} 11/07/2021 04:08:25 - INFO - __main__ - Step 48983: {'lr': 0.0003856460475040288, 'samples': 9404736, 'steps': 48982, 'loss/train': 1.0660433769226074} 11/07/2021 04:08:25 - INFO - __main__ - Step 48984: {'lr': 0.00038564158980064657, 'samples': 9404928, 'steps': 48983, 'loss/train': 1.5505517721176147} 11/07/2021 04:08:26 - INFO - __main__ - Step 48985: {'lr': 0.0003856371320361464, 'samples': 9405120, 'steps': 48984, 'loss/train': 1.1489191055297852} 11/07/2021 04:08:27 - INFO - __main__ - Step 48986: {'lr': 0.00038563267421053024, 'samples': 9405312, 'steps': 48985, 'loss/train': 1.3692070245742798} 11/07/2021 04:08:27 - INFO - __main__ - Step 48987: {'lr': 0.0003856282163238001, 'samples': 9405504, 'steps': 48986, 'loss/train': 1.6641390323638916} 11/07/2021 04:08:27 - INFO - __main__ - Step 48988: {'lr': 0.000385623758375958, 'samples': 9405696, 'steps': 48987, 'loss/train': 1.8186664581298828} 11/07/2021 04:08:28 - INFO - __main__ - Step 48989: {'lr': 0.0003856193003670058, 'samples': 9405888, 'steps': 48988, 'loss/train': 0.9343728423118591} 11/07/2021 04:08:28 - INFO - __main__ - Step 48990: {'lr': 0.0003856148422969458, 'samples': 9406080, 'steps': 48989, 'loss/train': 1.5853612422943115} 11/07/2021 04:08:29 - INFO - __main__ - Step 48991: {'lr': 0.0003856103841657797, 'samples': 9406272, 'steps': 48990, 'loss/train': 1.4318612813949585} 11/07/2021 04:08:29 - INFO - __main__ - Step 48992: {'lr': 0.00038560592597350975, 'samples': 9406464, 'steps': 48991, 'loss/train': 1.2812211513519287} 11/07/2021 04:08:30 - INFO - __main__ - Step 48993: {'lr': 0.0003856014677201378, 'samples': 9406656, 'steps': 48992, 'loss/train': 1.5667595863342285} 11/07/2021 04:08:30 - INFO - __main__ - Step 48994: {'lr': 0.000385597009405666, 'samples': 9406848, 'steps': 48993, 'loss/train': 1.3088542222976685} 11/07/2021 04:08:30 - INFO - __main__ - Step 48995: {'lr': 0.0003855925510300962, 'samples': 9407040, 'steps': 48994, 'loss/train': 1.3461893796920776} 11/07/2021 04:08:31 - INFO - __main__ - Step 48996: {'lr': 0.0003855880925934305, 'samples': 9407232, 'steps': 48995, 'loss/train': 1.308261513710022} 11/07/2021 04:08:32 - INFO - __main__ - Step 48997: {'lr': 0.000385583634095671, 'samples': 9407424, 'steps': 48996, 'loss/train': 1.569661259651184} 11/07/2021 04:08:32 - INFO - __main__ - Step 48998: {'lr': 0.00038557917553681944, 'samples': 9407616, 'steps': 48997, 'loss/train': 1.5256402492523193} 11/07/2021 04:08:32 - INFO - __main__ - Step 48999: {'lr': 0.00038557471691687804, 'samples': 9407808, 'steps': 48998, 'loss/train': 1.4784225225448608} 11/07/2021 04:08:33 - INFO - __main__ - Step 49000: {'lr': 0.0003855702582358489, 'samples': 9408000, 'steps': 48999, 'loss/train': 1.7442666292190552} 11/07/2021 04:08:33 - INFO - __main__ - Step 49001: {'lr': 0.00038556579949373384, 'samples': 9408192, 'steps': 49000, 'loss/train': 1.5192310810089111} 11/07/2021 04:08:34 - INFO - __main__ - Step 49002: {'lr': 0.00038556134069053484, 'samples': 9408384, 'steps': 49001, 'loss/train': 0.9820490479469299} 11/07/2021 04:08:34 - INFO - __main__ - Step 49003: {'lr': 0.00038555688182625406, 'samples': 9408576, 'steps': 49002, 'loss/train': 1.5996290445327759} 11/07/2021 04:08:35 - INFO - __main__ - Step 49004: {'lr': 0.0003855524229008934, 'samples': 9408768, 'steps': 49003, 'loss/train': 0.8507100939750671} 11/07/2021 04:08:35 - INFO - __main__ - Step 49005: {'lr': 0.0003855479639144549, 'samples': 9408960, 'steps': 49004, 'loss/train': 1.7345751523971558} 11/07/2021 04:08:35 - INFO - __main__ - Step 49006: {'lr': 0.0003855435048669406, 'samples': 9409152, 'steps': 49005, 'loss/train': 1.1710102558135986} 11/07/2021 04:08:37 - INFO - __main__ - Step 49007: {'lr': 0.0003855390457583525, 'samples': 9409344, 'steps': 49006, 'loss/train': 1.5970712900161743} 11/07/2021 04:08:37 - INFO - __main__ - Step 49008: {'lr': 0.0003855345865886926, 'samples': 9409536, 'steps': 49007, 'loss/train': 2.054039239883423} 11/07/2021 04:08:37 - INFO - __main__ - Step 49009: {'lr': 0.0003855301273579629, 'samples': 9409728, 'steps': 49008, 'loss/train': 4.088986396789551} 11/07/2021 04:08:38 - INFO - __main__ - Step 49010: {'lr': 0.0003855256680661654, 'samples': 9409920, 'steps': 49009, 'loss/train': 1.3908464908599854} 11/07/2021 04:08:38 - INFO - __main__ - Step 49011: {'lr': 0.00038552120871330217, 'samples': 9410112, 'steps': 49010, 'loss/train': 1.4406582117080688} 11/07/2021 04:08:38 - INFO - __main__ - Step 49012: {'lr': 0.0003855167492993751, 'samples': 9410304, 'steps': 49011, 'loss/train': 1.5320438146591187} 11/07/2021 04:08:39 - INFO - __main__ - Step 49013: {'lr': 0.00038551228982438635, 'samples': 9410496, 'steps': 49012, 'loss/train': 1.4877569675445557} 11/07/2021 04:08:40 - INFO - __main__ - Step 49014: {'lr': 0.00038550783028833786, 'samples': 9410688, 'steps': 49013, 'loss/train': 1.3163621425628662} 11/07/2021 04:08:40 - INFO - __main__ - Step 49015: {'lr': 0.00038550337069123155, 'samples': 9410880, 'steps': 49014, 'loss/train': 1.7298897504806519} 11/07/2021 04:08:41 - INFO - __main__ - Step 49016: {'lr': 0.00038549891103306953, 'samples': 9411072, 'steps': 49015, 'loss/train': 1.3940982818603516} 11/07/2021 04:08:41 - INFO - __main__ - Step 49017: {'lr': 0.00038549445131385386, 'samples': 9411264, 'steps': 49016, 'loss/train': 1.5628517866134644} 11/07/2021 04:08:42 - INFO - __main__ - Step 49018: {'lr': 0.00038548999153358645, 'samples': 9411456, 'steps': 49017, 'loss/train': 1.1688588857650757} 11/07/2021 04:08:42 - INFO - __main__ - Step 49019: {'lr': 0.0003854855316922693, 'samples': 9411648, 'steps': 49018, 'loss/train': 1.7715271711349487} 11/07/2021 04:08:43 - INFO - __main__ - Step 49020: {'lr': 0.0003854810717899045, 'samples': 9411840, 'steps': 49019, 'loss/train': 1.3426858186721802} 11/07/2021 04:08:43 - INFO - __main__ - Step 49021: {'lr': 0.0003854766118264941, 'samples': 9412032, 'steps': 49020, 'loss/train': 1.2353349924087524} 11/07/2021 04:08:43 - INFO - __main__ - Step 49022: {'lr': 0.0003854721518020399, 'samples': 9412224, 'steps': 49021, 'loss/train': 0.7917335033416748} 11/07/2021 04:08:44 - INFO - __main__ - Step 49023: {'lr': 0.00038546769171654403, 'samples': 9412416, 'steps': 49022, 'loss/train': 2.011986255645752} 11/07/2021 04:08:45 - INFO - __main__ - Step 49024: {'lr': 0.00038546323157000856, 'samples': 9412608, 'steps': 49023, 'loss/train': 1.5429515838623047} 11/07/2021 04:08:45 - INFO - __main__ - Step 49025: {'lr': 0.00038545877136243544, 'samples': 9412800, 'steps': 49024, 'loss/train': 1.461340308189392} 11/07/2021 04:08:46 - INFO - __main__ - Step 49026: {'lr': 0.00038545431109382667, 'samples': 9412992, 'steps': 49025, 'loss/train': 1.071258544921875} 11/07/2021 04:08:46 - INFO - __main__ - Step 49027: {'lr': 0.0003854498507641843, 'samples': 9413184, 'steps': 49026, 'loss/train': 2.099436044692993} 11/07/2021 04:08:46 - INFO - __main__ - Step 49028: {'lr': 0.00038544539037351037, 'samples': 9413376, 'steps': 49027, 'loss/train': 1.1202818155288696} 11/07/2021 04:08:47 - INFO - __main__ - Step 49029: {'lr': 0.0003854409299218068, 'samples': 9413568, 'steps': 49028, 'loss/train': 1.7763105630874634} 11/07/2021 04:08:48 - INFO - __main__ - Step 49030: {'lr': 0.00038543646940907564, 'samples': 9413760, 'steps': 49029, 'loss/train': 5.898472309112549} 11/07/2021 04:08:48 - INFO - __main__ - Step 49031: {'lr': 0.0003854320088353188, 'samples': 9413952, 'steps': 49030, 'loss/train': 1.3018929958343506} 11/07/2021 04:08:48 - INFO - __main__ - Step 49032: {'lr': 0.0003854275482005385, 'samples': 9414144, 'steps': 49031, 'loss/train': 1.8933767080307007} 11/07/2021 04:08:49 - INFO - __main__ - Step 49033: {'lr': 0.0003854230875047366, 'samples': 9414336, 'steps': 49032, 'loss/train': 1.587097406387329} 11/07/2021 04:08:49 - INFO - __main__ - Step 49034: {'lr': 0.0003854186267479151, 'samples': 9414528, 'steps': 49033, 'loss/train': 1.6172665357589722} 11/07/2021 04:08:51 - INFO - __main__ - Step 49035: {'lr': 0.00038541416593007615, 'samples': 9414720, 'steps': 49034, 'loss/train': 1.136541724205017} 11/07/2021 04:08:51 - INFO - __main__ - Step 49036: {'lr': 0.00038540970505122164, 'samples': 9414912, 'steps': 49035, 'loss/train': 1.8691534996032715} 11/07/2021 04:08:51 - INFO - __main__ - Step 49037: {'lr': 0.0003854052441113536, 'samples': 9415104, 'steps': 49036, 'loss/train': 1.267198085784912} 11/07/2021 04:08:52 - INFO - __main__ - Step 49038: {'lr': 0.00038540078311047397, 'samples': 9415296, 'steps': 49037, 'loss/train': 1.221850872039795} 11/07/2021 04:08:52 - INFO - __main__ - Step 49039: {'lr': 0.0003853963220485849, 'samples': 9415488, 'steps': 49038, 'loss/train': 1.516257882118225} 11/07/2021 04:08:52 - INFO - __main__ - Step 49040: {'lr': 0.00038539186092568833, 'samples': 9415680, 'steps': 49039, 'loss/train': 1.5481401681900024} 11/07/2021 04:08:53 - INFO - __main__ - Step 49041: {'lr': 0.00038538739974178633, 'samples': 9415872, 'steps': 49040, 'loss/train': 1.4090861082077026} 11/07/2021 04:08:54 - INFO - __main__ - Step 49042: {'lr': 0.00038538293849688077, 'samples': 9416064, 'steps': 49041, 'loss/train': 1.515071153640747} 11/07/2021 04:08:54 - INFO - __main__ - Step 49043: {'lr': 0.0003853784771909739, 'samples': 9416256, 'steps': 49042, 'loss/train': 1.0329222679138184} 11/07/2021 04:08:54 - INFO - __main__ - Step 49044: {'lr': 0.0003853740158240674, 'samples': 9416448, 'steps': 49043, 'loss/train': 1.4012494087219238} 11/07/2021 04:08:55 - INFO - __main__ - Step 49045: {'lr': 0.0003853695543961635, 'samples': 9416640, 'steps': 49044, 'loss/train': 2.0041351318359375} 11/07/2021 04:08:56 - INFO - __main__ - Step 49046: {'lr': 0.00038536509290726417, 'samples': 9416832, 'steps': 49045, 'loss/train': 1.7326302528381348} 11/07/2021 04:08:56 - INFO - __main__ - Step 49047: {'lr': 0.00038536063135737145, 'samples': 9417024, 'steps': 49046, 'loss/train': 1.3995091915130615} 11/07/2021 04:08:56 - INFO - __main__ - Step 49048: {'lr': 0.0003853561697464874, 'samples': 9417216, 'steps': 49047, 'loss/train': 1.129688024520874} 11/07/2021 04:08:57 - INFO - __main__ - Step 49049: {'lr': 0.0003853517080746138, 'samples': 9417408, 'steps': 49048, 'loss/train': 1.1902576684951782} 11/07/2021 04:08:57 - INFO - __main__ - Step 49050: {'lr': 0.00038534724634175285, 'samples': 9417600, 'steps': 49049, 'loss/train': 1.5384318828582764} 11/07/2021 04:08:58 - INFO - __main__ - Step 49051: {'lr': 0.0003853427845479065, 'samples': 9417792, 'steps': 49050, 'loss/train': 1.6219627857208252} 11/07/2021 04:08:59 - INFO - __main__ - Step 49052: {'lr': 0.0003853383226930768, 'samples': 9417984, 'steps': 49051, 'loss/train': 0.9121947288513184} 11/07/2021 04:08:59 - INFO - __main__ - Step 49053: {'lr': 0.00038533386077726573, 'samples': 9418176, 'steps': 49052, 'loss/train': 0.8772579431533813} 11/07/2021 04:08:59 - INFO - __main__ - Step 49054: {'lr': 0.00038532939880047535, 'samples': 9418368, 'steps': 49053, 'loss/train': 1.5285707712173462} 11/07/2021 04:09:00 - INFO - __main__ - Step 49055: {'lr': 0.00038532493676270765, 'samples': 9418560, 'steps': 49054, 'loss/train': 0.7943586707115173} 11/07/2021 04:09:00 - INFO - __main__ - Step 49056: {'lr': 0.0003853204746639646, 'samples': 9418752, 'steps': 49055, 'loss/train': 1.632174015045166} 11/07/2021 04:09:01 - INFO - __main__ - Step 49057: {'lr': 0.0003853160125042482, 'samples': 9418944, 'steps': 49056, 'loss/train': 1.5486323833465576} 11/07/2021 04:09:01 - INFO - __main__ - Step 49058: {'lr': 0.00038531155028356047, 'samples': 9419136, 'steps': 49057, 'loss/train': 1.500724196434021} 11/07/2021 04:09:02 - INFO - __main__ - Step 49059: {'lr': 0.0003853070880019035, 'samples': 9419328, 'steps': 49058, 'loss/train': 1.1665802001953125} 11/07/2021 04:09:02 - INFO - __main__ - Step 49060: {'lr': 0.0003853026256592792, 'samples': 9419520, 'steps': 49059, 'loss/train': 1.3292691707611084} 11/07/2021 04:09:02 - INFO - __main__ - Step 49061: {'lr': 0.0003852981632556897, 'samples': 9419712, 'steps': 49060, 'loss/train': 1.4833451509475708} 11/07/2021 04:09:03 - INFO - __main__ - Step 49062: {'lr': 0.0003852937007911369, 'samples': 9419904, 'steps': 49061, 'loss/train': 1.846569299697876} 11/07/2021 04:09:04 - INFO - __main__ - Step 49063: {'lr': 0.00038528923826562287, 'samples': 9420096, 'steps': 49062, 'loss/train': 1.3322322368621826} 11/07/2021 04:09:04 - INFO - __main__ - Step 49064: {'lr': 0.00038528477567914955, 'samples': 9420288, 'steps': 49063, 'loss/train': 1.1554535627365112} 11/07/2021 04:09:04 - INFO - __main__ - Step 49065: {'lr': 0.000385280313031719, 'samples': 9420480, 'steps': 49064, 'loss/train': 1.8195546865463257} 11/07/2021 04:09:05 - INFO - __main__ - Step 49066: {'lr': 0.00038527585032333326, 'samples': 9420672, 'steps': 49065, 'loss/train': 1.331484317779541} 11/07/2021 04:09:06 - INFO - __main__ - Step 49067: {'lr': 0.00038527138755399423, 'samples': 9420864, 'steps': 49066, 'loss/train': 1.426199197769165} 11/07/2021 04:09:06 - INFO - __main__ - Step 49068: {'lr': 0.00038526692472370407, 'samples': 9421056, 'steps': 49067, 'loss/train': 1.358767032623291} 11/07/2021 04:09:07 - INFO - __main__ - Step 49069: {'lr': 0.0003852624618324647, 'samples': 9421248, 'steps': 49068, 'loss/train': 1.6837997436523438} 11/07/2021 04:09:07 - INFO - __main__ - Step 49070: {'lr': 0.0003852579988802782, 'samples': 9421440, 'steps': 49069, 'loss/train': 2.1985177993774414} 11/07/2021 04:09:07 - INFO - __main__ - Step 49071: {'lr': 0.00038525353586714645, 'samples': 9421632, 'steps': 49070, 'loss/train': 1.5465973615646362} 11/07/2021 04:09:08 - INFO - __main__ - Step 49072: {'lr': 0.0003852490727930716, 'samples': 9421824, 'steps': 49071, 'loss/train': 1.1003345251083374} 11/07/2021 04:09:09 - INFO - __main__ - Step 49073: {'lr': 0.00038524460965805557, 'samples': 9422016, 'steps': 49072, 'loss/train': 1.042018175125122} 11/07/2021 04:09:09 - INFO - __main__ - Step 49074: {'lr': 0.00038524014646210044, 'samples': 9422208, 'steps': 49073, 'loss/train': 1.722677230834961} 11/07/2021 04:09:09 - INFO - __main__ - Step 49075: {'lr': 0.00038523568320520817, 'samples': 9422400, 'steps': 49074, 'loss/train': 1.2833806276321411} 11/07/2021 04:09:10 - INFO - __main__ - Step 49076: {'lr': 0.0003852312198873808, 'samples': 9422592, 'steps': 49075, 'loss/train': 1.5856560468673706} 11/07/2021 04:09:11 - INFO - __main__ - Step 49077: {'lr': 0.0003852267565086203, 'samples': 9422784, 'steps': 49076, 'loss/train': 1.6125866174697876} 11/07/2021 04:09:11 - INFO - __main__ - Step 49078: {'lr': 0.0003852222930689288, 'samples': 9422976, 'steps': 49077, 'loss/train': 1.5924015045166016} 11/07/2021 04:09:11 - INFO - __main__ - Step 49079: {'lr': 0.00038521782956830807, 'samples': 9423168, 'steps': 49078, 'loss/train': 1.728890299797058} 11/07/2021 04:09:12 - INFO - __main__ - Step 49080: {'lr': 0.00038521336600676035, 'samples': 9423360, 'steps': 49079, 'loss/train': 1.4587135314941406} 11/07/2021 04:09:12 - INFO - __main__ - Step 49081: {'lr': 0.00038520890238428763, 'samples': 9423552, 'steps': 49080, 'loss/train': 1.463641881942749} 11/07/2021 04:09:13 - INFO - __main__ - Step 49082: {'lr': 0.00038520443870089185, 'samples': 9423744, 'steps': 49081, 'loss/train': 1.0472511053085327} 11/07/2021 04:09:13 - INFO - __main__ - Step 49083: {'lr': 0.00038519997495657497, 'samples': 9423936, 'steps': 49082, 'loss/train': 1.4112025499343872} 11/07/2021 04:09:14 - INFO - __main__ - Step 49084: {'lr': 0.0003851955111513391, 'samples': 9424128, 'steps': 49083, 'loss/train': 0.6797143816947937} 11/07/2021 04:09:14 - INFO - __main__ - Step 49085: {'lr': 0.0003851910472851862, 'samples': 9424320, 'steps': 49084, 'loss/train': 1.422843098640442} 11/07/2021 04:09:14 - INFO - __main__ - Step 49086: {'lr': 0.0003851865833581183, 'samples': 9424512, 'steps': 49085, 'loss/train': 1.7759983539581299} 11/07/2021 04:09:15 - INFO - __main__ - Step 49087: {'lr': 0.0003851821193701375, 'samples': 9424704, 'steps': 49086, 'loss/train': 1.171252727508545} 11/07/2021 04:09:16 - INFO - __main__ - Step 49088: {'lr': 0.0003851776553212456, 'samples': 9424896, 'steps': 49087, 'loss/train': 0.863366425037384} 11/07/2021 04:09:16 - INFO - __main__ - Step 49089: {'lr': 0.0003851731912114448, 'samples': 9425088, 'steps': 49088, 'loss/train': 1.486554741859436} 11/07/2021 04:09:17 - INFO - __main__ - Step 49090: {'lr': 0.00038516872704073704, 'samples': 9425280, 'steps': 49089, 'loss/train': 1.8933355808258057} 11/07/2021 04:09:17 - INFO - __main__ - Step 49091: {'lr': 0.0003851642628091243, 'samples': 9425472, 'steps': 49090, 'loss/train': 1.3501158952713013} 11/07/2021 04:09:18 - INFO - __main__ - Step 49092: {'lr': 0.0003851597985166087, 'samples': 9425664, 'steps': 49091, 'loss/train': 1.405320167541504} 11/07/2021 04:09:18 - INFO - __main__ - Step 49093: {'lr': 0.0003851553341631921, 'samples': 9425856, 'steps': 49092, 'loss/train': 1.4765554666519165} 11/07/2021 04:09:19 - INFO - __main__ - Step 49094: {'lr': 0.0003851508697488766, 'samples': 9426048, 'steps': 49093, 'loss/train': 1.0383342504501343} 11/07/2021 04:09:19 - INFO - __main__ - Step 49095: {'lr': 0.0003851464052736643, 'samples': 9426240, 'steps': 49094, 'loss/train': 0.8753216862678528} 11/07/2021 04:09:19 - INFO - __main__ - Step 49096: {'lr': 0.00038514194073755706, 'samples': 9426432, 'steps': 49095, 'loss/train': 1.3852145671844482} 11/07/2021 04:09:20 - INFO - __main__ - Step 49097: {'lr': 0.00038513747614055696, 'samples': 9426624, 'steps': 49096, 'loss/train': 1.2463265657424927} 11/07/2021 04:09:21 - INFO - __main__ - Step 49098: {'lr': 0.0003851330114826659, 'samples': 9426816, 'steps': 49097, 'loss/train': 1.6080673933029175} 11/07/2021 04:09:21 - INFO - __main__ - Step 49099: {'lr': 0.0003851285467638861, 'samples': 9427008, 'steps': 49098, 'loss/train': 1.8316123485565186} 11/07/2021 04:09:21 - INFO - __main__ - Step 49100: {'lr': 0.00038512408198421936, 'samples': 9427200, 'steps': 49099, 'loss/train': 1.3645503520965576} 11/07/2021 04:09:22 - INFO - __main__ - Step 49101: {'lr': 0.0003851196171436679, 'samples': 9427392, 'steps': 49100, 'loss/train': 1.1806963682174683} 11/07/2021 04:09:23 - INFO - __main__ - Step 49102: {'lr': 0.0003851151522422336, 'samples': 9427584, 'steps': 49101, 'loss/train': 0.46137189865112305} 11/07/2021 04:09:23 - INFO - __main__ - Step 49103: {'lr': 0.0003851106872799185, 'samples': 9427776, 'steps': 49102, 'loss/train': 1.374953269958496} 11/07/2021 04:09:23 - INFO - __main__ - Step 49104: {'lr': 0.00038510622225672455, 'samples': 9427968, 'steps': 49103, 'loss/train': 1.1852757930755615} 11/07/2021 04:09:24 - INFO - __main__ - Step 49105: {'lr': 0.0003851017571726539, 'samples': 9428160, 'steps': 49104, 'loss/train': 0.906430184841156} 11/07/2021 04:09:24 - INFO - __main__ - Step 49106: {'lr': 0.00038509729202770843, 'samples': 9428352, 'steps': 49105, 'loss/train': 1.8787739276885986} 11/07/2021 04:09:24 - INFO - __main__ - Step 49107: {'lr': 0.00038509282682189016, 'samples': 9428544, 'steps': 49106, 'loss/train': 1.4721879959106445} 11/07/2021 04:09:26 - INFO - __main__ - Step 49108: {'lr': 0.0003850883615552012, 'samples': 9428736, 'steps': 49107, 'loss/train': 1.5572868585586548} 11/07/2021 04:09:26 - INFO - __main__ - Step 49109: {'lr': 0.0003850838962276436, 'samples': 9428928, 'steps': 49108, 'loss/train': 1.6854112148284912} 11/07/2021 04:09:26 - INFO - __main__ - Step 49110: {'lr': 0.0003850794308392192, 'samples': 9429120, 'steps': 49109, 'loss/train': 1.3845703601837158} 11/07/2021 04:09:27 - INFO - __main__ - Step 49111: {'lr': 0.00038507496538993006, 'samples': 9429312, 'steps': 49110, 'loss/train': 1.334354281425476} 11/07/2021 04:09:27 - INFO - __main__ - Step 49112: {'lr': 0.00038507049987977825, 'samples': 9429504, 'steps': 49111, 'loss/train': 1.1155405044555664} 11/07/2021 04:09:28 - INFO - __main__ - Step 49113: {'lr': 0.0003850660343087657, 'samples': 9429696, 'steps': 49112, 'loss/train': 0.5999600291252136} 11/07/2021 04:09:28 - INFO - __main__ - Step 49114: {'lr': 0.0003850615686768946, 'samples': 9429888, 'steps': 49113, 'loss/train': 1.5678521394729614} 11/07/2021 04:09:29 - INFO - __main__ - Step 49115: {'lr': 0.00038505710298416683, 'samples': 9430080, 'steps': 49114, 'loss/train': 1.5501654148101807} 11/07/2021 04:09:29 - INFO - __main__ - Step 49116: {'lr': 0.00038505263723058437, 'samples': 9430272, 'steps': 49115, 'loss/train': 0.7990596890449524} 11/07/2021 04:09:29 - INFO - __main__ - Step 49117: {'lr': 0.0003850481714161492, 'samples': 9430464, 'steps': 49116, 'loss/train': 1.1202987432479858} 11/07/2021 04:09:30 - INFO - __main__ - Step 49118: {'lr': 0.00038504370554086353, 'samples': 9430656, 'steps': 49117, 'loss/train': 1.5684027671813965} 11/07/2021 04:09:31 - INFO - __main__ - Step 49119: {'lr': 0.0003850392396047292, 'samples': 9430848, 'steps': 49118, 'loss/train': 1.6090108156204224} 11/07/2021 04:09:31 - INFO - __main__ - Step 49120: {'lr': 0.0003850347736077483, 'samples': 9431040, 'steps': 49119, 'loss/train': 1.100080132484436} 11/07/2021 04:09:31 - INFO - __main__ - Step 49121: {'lr': 0.0003850303075499227, 'samples': 9431232, 'steps': 49120, 'loss/train': 1.6162917613983154} 11/07/2021 04:09:32 - INFO - __main__ - Step 49122: {'lr': 0.0003850258414312547, 'samples': 9431424, 'steps': 49121, 'loss/train': 1.0960960388183594} 11/07/2021 04:09:33 - INFO - __main__ - Step 49123: {'lr': 0.000385021375251746, 'samples': 9431616, 'steps': 49122, 'loss/train': 1.4721559286117554} 11/07/2021 04:09:33 - INFO - __main__ - Step 49124: {'lr': 0.00038501690901139883, 'samples': 9431808, 'steps': 49123, 'loss/train': 0.6333041787147522} 11/07/2021 04:09:34 - INFO - __main__ - Step 49125: {'lr': 0.0003850124427102151, 'samples': 9432000, 'steps': 49124, 'loss/train': 2.3519115447998047} 11/07/2021 04:09:34 - INFO - __main__ - Step 49126: {'lr': 0.0003850079763481968, 'samples': 9432192, 'steps': 49125, 'loss/train': 1.4221237897872925} 11/07/2021 04:09:34 - INFO - __main__ - Step 49127: {'lr': 0.0003850035099253461, 'samples': 9432384, 'steps': 49126, 'loss/train': 5.797683238983154} 11/07/2021 04:09:35 - INFO - __main__ - Step 49128: {'lr': 0.00038499904344166483, 'samples': 9432576, 'steps': 49127, 'loss/train': 0.7352468967437744} 11/07/2021 04:09:36 - INFO - __main__ - Step 49129: {'lr': 0.0003849945768971551, 'samples': 9432768, 'steps': 49128, 'loss/train': 1.9143232107162476} 11/07/2021 04:09:36 - INFO - __main__ - Step 49130: {'lr': 0.0003849901102918189, 'samples': 9432960, 'steps': 49129, 'loss/train': 1.6455557346343994} 11/07/2021 04:09:36 - INFO - __main__ - Step 49131: {'lr': 0.00038498564362565826, 'samples': 9433152, 'steps': 49130, 'loss/train': 1.6137969493865967} 11/07/2021 04:09:37 - INFO - __main__ - Step 49132: {'lr': 0.0003849811768986751, 'samples': 9433344, 'steps': 49131, 'loss/train': 1.4530037641525269} 11/07/2021 04:09:37 - INFO - __main__ - Step 49133: {'lr': 0.0003849767101108715, 'samples': 9433536, 'steps': 49132, 'loss/train': 1.8602375984191895} 11/07/2021 04:09:38 - INFO - __main__ - Step 49134: {'lr': 0.0003849722432622495, 'samples': 9433728, 'steps': 49133, 'loss/train': 1.441968321800232} 11/07/2021 04:09:39 - INFO - __main__ - Step 49135: {'lr': 0.0003849677763528111, 'samples': 9433920, 'steps': 49134, 'loss/train': 1.8222562074661255} 11/07/2021 04:09:39 - INFO - __main__ - Step 49136: {'lr': 0.0003849633093825583, 'samples': 9434112, 'steps': 49135, 'loss/train': 1.0792269706726074} 11/07/2021 04:09:39 - INFO - __main__ - Step 49137: {'lr': 0.00038495884235149316, 'samples': 9434304, 'steps': 49136, 'loss/train': 1.2116094827651978} 11/07/2021 04:09:40 - INFO - __main__ - Step 49138: {'lr': 0.0003849543752596176, 'samples': 9434496, 'steps': 49137, 'loss/train': 1.5266435146331787} 11/07/2021 04:09:41 - INFO - __main__ - Step 49139: {'lr': 0.00038494990810693366, 'samples': 9434688, 'steps': 49138, 'loss/train': 1.8355218172073364} 11/07/2021 04:09:41 - INFO - __main__ - Step 49140: {'lr': 0.0003849454408934434, 'samples': 9434880, 'steps': 49139, 'loss/train': 1.7328380346298218} 11/07/2021 04:09:41 - INFO - __main__ - Step 49141: {'lr': 0.0003849409736191488, 'samples': 9435072, 'steps': 49140, 'loss/train': 1.72958505153656} 11/07/2021 04:09:42 - INFO - __main__ - Step 49142: {'lr': 0.00038493650628405196, 'samples': 9435264, 'steps': 49141, 'loss/train': 1.4418506622314453} 11/07/2021 04:09:42 - INFO - __main__ - Step 49143: {'lr': 0.0003849320388881547, 'samples': 9435456, 'steps': 49142, 'loss/train': 1.5139061212539673} 11/07/2021 04:09:43 - INFO - __main__ - Step 49144: {'lr': 0.0003849275714314592, 'samples': 9435648, 'steps': 49143, 'loss/train': 1.3376448154449463} 11/07/2021 04:09:43 - INFO - __main__ - Step 49145: {'lr': 0.0003849231039139674, 'samples': 9435840, 'steps': 49144, 'loss/train': 1.3212387561798096} 11/07/2021 04:09:44 - INFO - __main__ - Step 49146: {'lr': 0.00038491863633568135, 'samples': 9436032, 'steps': 49145, 'loss/train': 1.1440930366516113} 11/07/2021 04:09:44 - INFO - __main__ - Step 49147: {'lr': 0.000384914168696603, 'samples': 9436224, 'steps': 49146, 'loss/train': 1.305210828781128} 11/07/2021 04:09:45 - INFO - __main__ - Step 49148: {'lr': 0.0003849097009967344, 'samples': 9436416, 'steps': 49147, 'loss/train': 1.2001196146011353} 11/07/2021 04:09:46 - INFO - __main__ - Step 49149: {'lr': 0.0003849052332360777, 'samples': 9436608, 'steps': 49148, 'loss/train': 1.4368815422058105} 11/07/2021 04:09:46 - INFO - __main__ - Step 49150: {'lr': 0.0003849007654146347, 'samples': 9436800, 'steps': 49149, 'loss/train': 1.491844654083252} 11/07/2021 04:09:46 - INFO - __main__ - Step 49151: {'lr': 0.0003848962975324074, 'samples': 9436992, 'steps': 49150, 'loss/train': 1.9037903547286987} 11/07/2021 04:09:47 - INFO - __main__ - Step 49152: {'lr': 0.00038489182958939804, 'samples': 9437184, 'steps': 49151, 'loss/train': 1.6102936267852783} 11/07/2021 04:09:47 - INFO - __main__ - Step 49153: {'lr': 0.00038488736158560845, 'samples': 9437376, 'steps': 49152, 'loss/train': 1.3841238021850586} 11/07/2021 04:09:47 - INFO - __main__ - Step 49154: {'lr': 0.00038488289352104065, 'samples': 9437568, 'steps': 49153, 'loss/train': 1.4052152633666992} 11/07/2021 04:09:48 - INFO - __main__ - Step 49155: {'lr': 0.0003848784253956968, 'samples': 9437760, 'steps': 49154, 'loss/train': 1.306625485420227} 11/07/2021 04:09:49 - INFO - __main__ - Step 49156: {'lr': 0.00038487395720957884, 'samples': 9437952, 'steps': 49155, 'loss/train': 1.4390147924423218} 11/07/2021 04:09:49 - INFO - __main__ - Step 49157: {'lr': 0.0003848694889626886, 'samples': 9438144, 'steps': 49156, 'loss/train': 1.962949275970459} 11/07/2021 04:09:49 - INFO - __main__ - Step 49158: {'lr': 0.0003848650206550284, 'samples': 9438336, 'steps': 49157, 'loss/train': 0.8605524897575378} 11/07/2021 04:09:50 - INFO - __main__ - Step 49159: {'lr': 0.0003848605522866, 'samples': 9438528, 'steps': 49158, 'loss/train': 0.9240766167640686} 11/07/2021 04:09:51 - INFO - __main__ - Step 49160: {'lr': 0.00038485608385740555, 'samples': 9438720, 'steps': 49159, 'loss/train': 1.4867128133773804} 11/07/2021 04:09:51 - INFO - __main__ - Step 49161: {'lr': 0.00038485161536744707, 'samples': 9438912, 'steps': 49160, 'loss/train': 1.9888595342636108} 11/07/2021 04:09:52 - INFO - __main__ - Step 49162: {'lr': 0.0003848471468167265, 'samples': 9439104, 'steps': 49161, 'loss/train': 1.83589768409729} 11/07/2021 04:09:52 - INFO - __main__ - Step 49163: {'lr': 0.00038484267820524586, 'samples': 9439296, 'steps': 49162, 'loss/train': 1.7949270009994507} 11/07/2021 04:09:52 - INFO - __main__ - Step 49164: {'lr': 0.00038483820953300724, 'samples': 9439488, 'steps': 49163, 'loss/train': 1.58258855342865} 11/07/2021 04:09:53 - INFO - __main__ - Step 49165: {'lr': 0.00038483374080001254, 'samples': 9439680, 'steps': 49164, 'loss/train': 1.699041485786438} 11/07/2021 04:09:54 - INFO - __main__ - Step 49166: {'lr': 0.00038482927200626386, 'samples': 9439872, 'steps': 49165, 'loss/train': 1.4731369018554688} 11/07/2021 04:09:54 - INFO - __main__ - Step 49167: {'lr': 0.0003848248031517633, 'samples': 9440064, 'steps': 49166, 'loss/train': 1.3327513933181763} 11/07/2021 04:09:54 - INFO - __main__ - Step 49168: {'lr': 0.00038482033423651256, 'samples': 9440256, 'steps': 49167, 'loss/train': 1.5444903373718262} 11/07/2021 04:09:55 - INFO - __main__ - Step 49169: {'lr': 0.00038481586526051406, 'samples': 9440448, 'steps': 49168, 'loss/train': 1.2590073347091675} 11/07/2021 04:09:56 - INFO - __main__ - Step 49170: {'lr': 0.0003848113962237695, 'samples': 9440640, 'steps': 49169, 'loss/train': 1.7843161821365356} 11/07/2021 04:09:56 - INFO - __main__ - Step 49171: {'lr': 0.00038480692712628104, 'samples': 9440832, 'steps': 49170, 'loss/train': 1.728171467781067} 11/07/2021 04:09:57 - INFO - __main__ - Step 49172: {'lr': 0.0003848024579680506, 'samples': 9441024, 'steps': 49171, 'loss/train': 1.5877866744995117} 11/07/2021 04:09:57 - INFO - __main__ - Step 49173: {'lr': 0.00038479798874908026, 'samples': 9441216, 'steps': 49172, 'loss/train': 1.536706566810608} 11/07/2021 04:09:57 - INFO - __main__ - Step 49174: {'lr': 0.00038479351946937206, 'samples': 9441408, 'steps': 49173, 'loss/train': 1.5378828048706055} 11/07/2021 04:09:58 - INFO - __main__ - Step 49175: {'lr': 0.000384789050128928, 'samples': 9441600, 'steps': 49174, 'loss/train': 1.402969479560852} 11/07/2021 04:09:59 - INFO - __main__ - Step 49176: {'lr': 0.0003847845807277501, 'samples': 9441792, 'steps': 49175, 'loss/train': 1.3299713134765625} 11/07/2021 04:09:59 - INFO - __main__ - Step 49177: {'lr': 0.0003847801112658403, 'samples': 9441984, 'steps': 49176, 'loss/train': 1.167944312095642} 11/07/2021 04:09:59 - INFO - __main__ - Step 49178: {'lr': 0.0003847756417432007, 'samples': 9442176, 'steps': 49177, 'loss/train': 1.6673598289489746} 11/07/2021 04:10:00 - INFO - __main__ - Step 49179: {'lr': 0.00038477117215983316, 'samples': 9442368, 'steps': 49178, 'loss/train': 1.6264417171478271} 11/07/2021 04:10:01 - INFO - __main__ - Step 49180: {'lr': 0.0003847667025157399, 'samples': 9442560, 'steps': 49179, 'loss/train': 1.0405162572860718} 11/07/2021 04:10:01 - INFO - __main__ - Step 49181: {'lr': 0.0003847622328109228, 'samples': 9442752, 'steps': 49180, 'loss/train': 1.2665191888809204} 11/07/2021 04:10:01 - INFO - __main__ - Step 49182: {'lr': 0.000384757763045384, 'samples': 9442944, 'steps': 49181, 'loss/train': 1.2967607975006104} 11/07/2021 04:10:02 - INFO - __main__ - Step 49183: {'lr': 0.0003847532932191254, 'samples': 9443136, 'steps': 49182, 'loss/train': 1.3285562992095947} 11/07/2021 04:10:02 - INFO - __main__ - Step 49184: {'lr': 0.000384748823332149, 'samples': 9443328, 'steps': 49183, 'loss/train': 1.5022859573364258} 11/07/2021 04:10:03 - INFO - __main__ - Step 49185: {'lr': 0.0003847443533844569, 'samples': 9443520, 'steps': 49184, 'loss/train': 2.1024880409240723} 11/07/2021 04:10:04 - INFO - __main__ - Step 49186: {'lr': 0.000384739883376051, 'samples': 9443712, 'steps': 49185, 'loss/train': 1.6192253828048706} 11/07/2021 04:10:04 - INFO - __main__ - Step 49187: {'lr': 0.0003847354133069335, 'samples': 9443904, 'steps': 49186, 'loss/train': 1.399524450302124} 11/07/2021 04:10:04 - INFO - __main__ - Step 49188: {'lr': 0.0003847309431771062, 'samples': 9444096, 'steps': 49187, 'loss/train': 1.550622582435608} 11/07/2021 04:10:05 - INFO - __main__ - Step 49189: {'lr': 0.00038472647298657135, 'samples': 9444288, 'steps': 49188, 'loss/train': 1.5158143043518066} 11/07/2021 04:10:05 - INFO - __main__ - Step 49190: {'lr': 0.0003847220027353308, 'samples': 9444480, 'steps': 49189, 'loss/train': 1.0012966394424438} 11/07/2021 04:10:07 - INFO - __main__ - Step 49191: {'lr': 0.0003847175324233865, 'samples': 9444672, 'steps': 49190, 'loss/train': 1.4463376998901367} 11/07/2021 04:10:07 - INFO - __main__ - Step 49192: {'lr': 0.00038471306205074054, 'samples': 9444864, 'steps': 49191, 'loss/train': 1.320358157157898} 11/07/2021 04:10:07 - INFO - __main__ - Step 49193: {'lr': 0.00038470859161739504, 'samples': 9445056, 'steps': 49192, 'loss/train': 1.2056214809417725} 11/07/2021 04:10:08 - INFO - __main__ - Step 49194: {'lr': 0.00038470412112335184, 'samples': 9445248, 'steps': 49193, 'loss/train': 1.3874770402908325} 11/07/2021 04:10:08 - INFO - __main__ - Step 49195: {'lr': 0.0003846996505686131, 'samples': 9445440, 'steps': 49194, 'loss/train': 1.157554268836975} 11/07/2021 04:10:09 - INFO - __main__ - Step 49196: {'lr': 0.00038469517995318083, 'samples': 9445632, 'steps': 49195, 'loss/train': 1.8665660619735718} 11/07/2021 04:10:09 - INFO - __main__ - Step 49197: {'lr': 0.000384690709277057, 'samples': 9445824, 'steps': 49196, 'loss/train': 1.7280787229537964} 11/07/2021 04:10:10 - INFO - __main__ - Step 49198: {'lr': 0.0003846862385402435, 'samples': 9446016, 'steps': 49197, 'loss/train': 0.7833828330039978} 11/07/2021 04:10:10 - INFO - __main__ - Step 49199: {'lr': 0.00038468176774274253, 'samples': 9446208, 'steps': 49198, 'loss/train': 1.6302024126052856} 11/07/2021 04:10:10 - INFO - __main__ - Step 49200: {'lr': 0.000384677296884556, 'samples': 9446400, 'steps': 49199, 'loss/train': 1.3116899728775024} 11/07/2021 04:10:11 - INFO - __main__ - Step 49201: {'lr': 0.000384672825965686, 'samples': 9446592, 'steps': 49200, 'loss/train': 1.413428783416748} 11/07/2021 04:10:12 - INFO - __main__ - Step 49202: {'lr': 0.0003846683549861344, 'samples': 9446784, 'steps': 49201, 'loss/train': 1.2374444007873535} 11/07/2021 04:10:12 - INFO - __main__ - Step 49203: {'lr': 0.00038466388394590344, 'samples': 9446976, 'steps': 49202, 'loss/train': 1.4004100561141968} 11/07/2021 04:10:12 - INFO - __main__ - Step 49204: {'lr': 0.00038465941284499493, 'samples': 9447168, 'steps': 49203, 'loss/train': 1.2180466651916504} 11/07/2021 04:10:13 - INFO - __main__ - Step 49205: {'lr': 0.00038465494168341105, 'samples': 9447360, 'steps': 49204, 'loss/train': 1.7740188837051392} 11/07/2021 04:10:14 - INFO - __main__ - Step 49206: {'lr': 0.00038465047046115365, 'samples': 9447552, 'steps': 49205, 'loss/train': 0.7401227951049805} 11/07/2021 04:10:14 - INFO - __main__ - Step 49207: {'lr': 0.00038464599917822483, 'samples': 9447744, 'steps': 49206, 'loss/train': 1.6348817348480225} 11/07/2021 04:10:15 - INFO - __main__ - Step 49208: {'lr': 0.00038464152783462667, 'samples': 9447936, 'steps': 49207, 'loss/train': 1.5200368165969849} 11/07/2021 04:10:15 - INFO - __main__ - Step 49209: {'lr': 0.0003846370564303611, 'samples': 9448128, 'steps': 49208, 'loss/train': 1.825721263885498} 11/07/2021 04:10:15 - INFO - __main__ - Step 49210: {'lr': 0.00038463258496543014, 'samples': 9448320, 'steps': 49209, 'loss/train': 1.2741966247558594} 11/07/2021 04:10:16 - INFO - __main__ - Step 49211: {'lr': 0.0003846281134398358, 'samples': 9448512, 'steps': 49210, 'loss/train': 1.2416861057281494} 11/07/2021 04:10:17 - INFO - __main__ - Step 49212: {'lr': 0.0003846236418535801, 'samples': 9448704, 'steps': 49211, 'loss/train': 1.73604154586792} 11/07/2021 04:10:17 - INFO - __main__ - Step 49213: {'lr': 0.00038461917020666506, 'samples': 9448896, 'steps': 49212, 'loss/train': 2.193845272064209} 11/07/2021 04:10:17 - INFO - __main__ - Step 49214: {'lr': 0.0003846146984990927, 'samples': 9449088, 'steps': 49213, 'loss/train': 1.6842647790908813} 11/07/2021 04:10:18 - INFO - __main__ - Step 49215: {'lr': 0.00038461022673086506, 'samples': 9449280, 'steps': 49214, 'loss/train': 1.3088513612747192} 11/07/2021 04:10:19 - INFO - __main__ - Step 49216: {'lr': 0.0003846057549019841, 'samples': 9449472, 'steps': 49215, 'loss/train': 1.2411233186721802} 11/07/2021 04:10:19 - INFO - __main__ - Step 49217: {'lr': 0.0003846012830124519, 'samples': 9449664, 'steps': 49216, 'loss/train': 1.495794653892517} 11/07/2021 04:10:19 - INFO - __main__ - Step 49218: {'lr': 0.0003845968110622704, 'samples': 9449856, 'steps': 49217, 'loss/train': 1.6284512281417847} 11/07/2021 04:10:20 - INFO - __main__ - Step 49219: {'lr': 0.0003845923390514417, 'samples': 9450048, 'steps': 49218, 'loss/train': 1.5216821432113647} 11/07/2021 04:10:20 - INFO - __main__ - Step 49220: {'lr': 0.0003845878669799677, 'samples': 9450240, 'steps': 49219, 'loss/train': 1.9224973917007446} 11/07/2021 04:10:21 - INFO - __main__ - Step 49221: {'lr': 0.00038458339484785057, 'samples': 9450432, 'steps': 49220, 'loss/train': 1.6108232736587524} 11/07/2021 04:10:22 - INFO - __main__ - Step 49222: {'lr': 0.00038457892265509214, 'samples': 9450624, 'steps': 49221, 'loss/train': 1.7062171697616577} 11/07/2021 04:10:22 - INFO - __main__ - Step 49223: {'lr': 0.00038457445040169467, 'samples': 9450816, 'steps': 49222, 'loss/train': 0.9867490530014038} 11/07/2021 04:10:22 - INFO - __main__ - Step 49224: {'lr': 0.00038456997808765993, 'samples': 9451008, 'steps': 49223, 'loss/train': 1.5299254655838013} 11/07/2021 04:10:23 - INFO - __main__ - Step 49225: {'lr': 0.00038456550571299, 'samples': 9451200, 'steps': 49224, 'loss/train': 1.4590548276901245} 11/07/2021 04:10:23 - INFO - __main__ - Step 49226: {'lr': 0.000384561033277687, 'samples': 9451392, 'steps': 49225, 'loss/train': 0.8083837628364563} 11/07/2021 04:10:24 - INFO - __main__ - Step 49227: {'lr': 0.00038455656078175283, 'samples': 9451584, 'steps': 49226, 'loss/train': 1.7535966634750366} 11/07/2021 04:10:25 - INFO - __main__ - Step 49228: {'lr': 0.0003845520882251895, 'samples': 9451776, 'steps': 49227, 'loss/train': 1.3637335300445557} 11/07/2021 04:10:25 - INFO - __main__ - Step 49229: {'lr': 0.00038454761560799915, 'samples': 9451968, 'steps': 49228, 'loss/train': 1.4770692586898804} 11/07/2021 04:10:25 - INFO - __main__ - Step 49230: {'lr': 0.0003845431429301838, 'samples': 9452160, 'steps': 49229, 'loss/train': 2.0705811977386475} 11/07/2021 04:10:26 - INFO - __main__ - Step 49231: {'lr': 0.0003845386701917453, 'samples': 9452352, 'steps': 49230, 'loss/train': 1.1304152011871338} 11/07/2021 04:10:27 - INFO - __main__ - Step 49232: {'lr': 0.0003845341973926857, 'samples': 9452544, 'steps': 49231, 'loss/train': 1.4169456958770752} 11/07/2021 04:10:27 - INFO - __main__ - Step 49233: {'lr': 0.0003845297245330071, 'samples': 9452736, 'steps': 49232, 'loss/train': 1.4708280563354492} 11/07/2021 04:10:27 - INFO - __main__ - Step 49234: {'lr': 0.0003845252516127115, 'samples': 9452928, 'steps': 49233, 'loss/train': 1.5530120134353638} 11/07/2021 04:10:28 - INFO - __main__ - Step 49235: {'lr': 0.0003845207786318009, 'samples': 9453120, 'steps': 49234, 'loss/train': 2.7697346210479736} 11/07/2021 04:10:28 - INFO - __main__ - Step 49236: {'lr': 0.0003845163055902773, 'samples': 9453312, 'steps': 49235, 'loss/train': 1.5223881006240845} 11/07/2021 04:10:28 - INFO - __main__ - Step 49237: {'lr': 0.0003845118324881428, 'samples': 9453504, 'steps': 49236, 'loss/train': 1.5188398361206055} 11/07/2021 04:10:29 - INFO - __main__ - Step 49238: {'lr': 0.00038450735932539927, 'samples': 9453696, 'steps': 49237, 'loss/train': 1.7778433561325073} 11/07/2021 04:10:30 - INFO - __main__ - Step 49239: {'lr': 0.0003845028861020488, 'samples': 9453888, 'steps': 49238, 'loss/train': 1.739320993423462} 11/07/2021 04:10:30 - INFO - __main__ - Step 49240: {'lr': 0.0003844984128180934, 'samples': 9454080, 'steps': 49239, 'loss/train': 1.4633797407150269} 11/07/2021 04:10:30 - INFO - __main__ - Step 49241: {'lr': 0.00038449393947353507, 'samples': 9454272, 'steps': 49240, 'loss/train': 1.2488583326339722} 11/07/2021 04:10:31 - INFO - __main__ - Step 49242: {'lr': 0.00038448946606837585, 'samples': 9454464, 'steps': 49241, 'loss/train': 1.33491051197052} 11/07/2021 04:10:32 - INFO - __main__ - Step 49243: {'lr': 0.00038448499260261787, 'samples': 9454656, 'steps': 49242, 'loss/train': 3.0733556747436523} 11/07/2021 04:10:32 - INFO - __main__ - Step 49244: {'lr': 0.0003844805190762629, 'samples': 9454848, 'steps': 49243, 'loss/train': 1.5128308534622192} 11/07/2021 04:10:32 - INFO - __main__ - Step 49245: {'lr': 0.00038447604548931313, 'samples': 9455040, 'steps': 49244, 'loss/train': 1.3653062582015991} 11/07/2021 04:10:33 - INFO - __main__ - Step 49246: {'lr': 0.0003844715718417705, 'samples': 9455232, 'steps': 49245, 'loss/train': 1.8834713697433472} 11/07/2021 04:10:33 - INFO - __main__ - Step 49247: {'lr': 0.0003844670981336371, 'samples': 9455424, 'steps': 49246, 'loss/train': 1.5269482135772705} 11/07/2021 04:10:34 - INFO - __main__ - Step 49248: {'lr': 0.000384462624364915, 'samples': 9455616, 'steps': 49247, 'loss/train': 1.0469377040863037} 11/07/2021 04:10:34 - INFO - __main__ - Step 49249: {'lr': 0.00038445815053560596, 'samples': 9455808, 'steps': 49248, 'loss/train': 1.388178825378418} 11/07/2021 04:10:35 - INFO - __main__ - Step 49250: {'lr': 0.00038445367664571216, 'samples': 9456000, 'steps': 49249, 'loss/train': 1.3149610757827759} 11/07/2021 04:10:35 - INFO - __main__ - Step 49251: {'lr': 0.00038444920269523563, 'samples': 9456192, 'steps': 49250, 'loss/train': 1.2185102701187134} 11/07/2021 04:10:36 - INFO - __main__ - Step 49252: {'lr': 0.0003844447286841783, 'samples': 9456384, 'steps': 49251, 'loss/train': 1.2820465564727783} 11/07/2021 04:10:37 - INFO - __main__ - Step 49253: {'lr': 0.0003844402546125424, 'samples': 9456576, 'steps': 49252, 'loss/train': 1.7159557342529297} 11/07/2021 04:10:37 - INFO - __main__ - Step 49254: {'lr': 0.00038443578048032975, 'samples': 9456768, 'steps': 49253, 'loss/train': 1.5664453506469727} 11/07/2021 04:10:37 - INFO - __main__ - Step 49255: {'lr': 0.0003844313062875423, 'samples': 9456960, 'steps': 49254, 'loss/train': 0.8009182214736938} 11/07/2021 04:10:38 - INFO - __main__ - Step 49256: {'lr': 0.00038442683203418227, 'samples': 9457152, 'steps': 49255, 'loss/train': 1.176375389099121} 11/07/2021 04:10:38 - INFO - __main__ - Step 49257: {'lr': 0.0003844223577202516, 'samples': 9457344, 'steps': 49256, 'loss/train': 0.899752140045166} 11/07/2021 04:10:39 - INFO - __main__ - Step 49258: {'lr': 0.00038441788334575225, 'samples': 9457536, 'steps': 49257, 'loss/train': 1.9049264192581177} 11/07/2021 04:10:39 - INFO - __main__ - Step 49259: {'lr': 0.0003844134089106863, 'samples': 9457728, 'steps': 49258, 'loss/train': 1.2303478717803955} 11/07/2021 04:10:40 - INFO - __main__ - Step 49260: {'lr': 0.00038440893441505573, 'samples': 9457920, 'steps': 49259, 'loss/train': 1.3818678855895996} 11/07/2021 04:10:40 - INFO - __main__ - Step 49261: {'lr': 0.0003844044598588625, 'samples': 9458112, 'steps': 49260, 'loss/train': 1.93372642993927} 11/07/2021 04:10:40 - INFO - __main__ - Step 49262: {'lr': 0.0003843999852421088, 'samples': 9458304, 'steps': 49261, 'loss/train': 1.7044050693511963} 11/07/2021 04:10:41 - INFO - __main__ - Step 49263: {'lr': 0.0003843955105647965, 'samples': 9458496, 'steps': 49262, 'loss/train': 1.1875362396240234} 11/07/2021 04:10:42 - INFO - __main__ - Step 49264: {'lr': 0.0003843910358269277, 'samples': 9458688, 'steps': 49263, 'loss/train': 1.5284796953201294} 11/07/2021 04:10:42 - INFO - __main__ - Step 49265: {'lr': 0.0003843865610285043, 'samples': 9458880, 'steps': 49264, 'loss/train': 1.6451259851455688} 11/07/2021 04:10:42 - INFO - __main__ - Step 49266: {'lr': 0.0003843820861695284, 'samples': 9459072, 'steps': 49265, 'loss/train': 1.1807126998901367} 11/07/2021 04:10:43 - INFO - __main__ - Step 49267: {'lr': 0.00038437761125000204, 'samples': 9459264, 'steps': 49266, 'loss/train': 2.00026273727417} 11/07/2021 04:10:44 - INFO - __main__ - Step 49268: {'lr': 0.00038437313626992723, 'samples': 9459456, 'steps': 49267, 'loss/train': 0.694304347038269} 11/07/2021 04:10:44 - INFO - __main__ - Step 49269: {'lr': 0.0003843686612293059, 'samples': 9459648, 'steps': 49268, 'loss/train': 1.9365851879119873} 11/07/2021 04:10:45 - INFO - __main__ - Step 49270: {'lr': 0.0003843641861281402, 'samples': 9459840, 'steps': 49269, 'loss/train': 1.5159986019134521} 11/07/2021 04:10:45 - INFO - __main__ - Step 49271: {'lr': 0.00038435971096643196, 'samples': 9460032, 'steps': 49270, 'loss/train': 1.4350037574768066} 11/07/2021 04:10:45 - INFO - __main__ - Step 49272: {'lr': 0.00038435523574418336, 'samples': 9460224, 'steps': 49271, 'loss/train': 1.6132951974868774} 11/07/2021 04:10:46 - INFO - __main__ - Step 49273: {'lr': 0.0003843507604613964, 'samples': 9460416, 'steps': 49272, 'loss/train': 1.7995132207870483} 11/07/2021 04:10:47 - INFO - __main__ - Step 49274: {'lr': 0.00038434628511807296, 'samples': 9460608, 'steps': 49273, 'loss/train': 1.0257656574249268} 11/07/2021 04:10:47 - INFO - __main__ - Step 49275: {'lr': 0.00038434180971421523, 'samples': 9460800, 'steps': 49274, 'loss/train': 1.4039250612258911} 11/07/2021 04:10:47 - INFO - __main__ - Step 49276: {'lr': 0.0003843373342498251, 'samples': 9460992, 'steps': 49275, 'loss/train': 1.385812520980835} 11/07/2021 04:10:48 - INFO - __main__ - Step 49277: {'lr': 0.00038433285872490475, 'samples': 9461184, 'steps': 49276, 'loss/train': 1.634580135345459} 11/07/2021 04:10:48 - INFO - __main__ - Step 49278: {'lr': 0.000384328383139456, 'samples': 9461376, 'steps': 49277, 'loss/train': 1.0532727241516113} 11/07/2021 04:10:49 - INFO - __main__ - Step 49279: {'lr': 0.000384323907493481, 'samples': 9461568, 'steps': 49278, 'loss/train': 1.9170868396759033} 11/07/2021 04:10:49 - INFO - __main__ - Step 49280: {'lr': 0.0003843194317869817, 'samples': 9461760, 'steps': 49279, 'loss/train': 1.1617804765701294} 11/07/2021 04:10:50 - INFO - __main__ - Step 49281: {'lr': 0.0003843149560199601, 'samples': 9461952, 'steps': 49280, 'loss/train': 1.3052281141281128} 11/07/2021 04:10:50 - INFO - __main__ - Step 49282: {'lr': 0.0003843104801924183, 'samples': 9462144, 'steps': 49281, 'loss/train': 1.4169832468032837} 11/07/2021 04:10:51 - INFO - __main__ - Step 49283: {'lr': 0.00038430600430435825, 'samples': 9462336, 'steps': 49282, 'loss/train': 1.314243197441101} 11/07/2021 04:10:52 - INFO - __main__ - Step 49284: {'lr': 0.000384301528355782, 'samples': 9462528, 'steps': 49283, 'loss/train': 1.324676275253296} 11/07/2021 04:10:52 - INFO - __main__ - Step 49285: {'lr': 0.00038429705234669157, 'samples': 9462720, 'steps': 49284, 'loss/train': 1.4951428174972534} 11/07/2021 04:10:52 - INFO - __main__ - Step 49286: {'lr': 0.00038429257627708893, 'samples': 9462912, 'steps': 49285, 'loss/train': 1.447383999824524} 11/07/2021 04:10:53 - INFO - __main__ - Step 49287: {'lr': 0.00038428810014697615, 'samples': 9463104, 'steps': 49286, 'loss/train': 1.6952389478683472} 11/07/2021 04:10:53 - INFO - __main__ - Step 49288: {'lr': 0.00038428362395635514, 'samples': 9463296, 'steps': 49287, 'loss/train': 1.4429686069488525} 11/07/2021 04:10:54 - INFO - __main__ - Step 49289: {'lr': 0.0003842791477052281, 'samples': 9463488, 'steps': 49288, 'loss/train': 1.15886390209198} 11/07/2021 04:10:54 - INFO - __main__ - Step 49290: {'lr': 0.00038427467139359696, 'samples': 9463680, 'steps': 49289, 'loss/train': 1.347989559173584} 11/07/2021 04:10:55 - INFO - __main__ - Step 49291: {'lr': 0.00038427019502146364, 'samples': 9463872, 'steps': 49290, 'loss/train': 1.9227449893951416} 11/07/2021 04:10:55 - INFO - __main__ - Step 49292: {'lr': 0.0003842657185888303, 'samples': 9464064, 'steps': 49291, 'loss/train': 0.09805437177419662} 11/07/2021 04:10:56 - INFO - __main__ - Step 49293: {'lr': 0.00038426124209569885, 'samples': 9464256, 'steps': 49292, 'loss/train': 1.6097315549850464} 11/07/2021 04:10:57 - INFO - __main__ - Step 49294: {'lr': 0.00038425676554207133, 'samples': 9464448, 'steps': 49293, 'loss/train': 1.3904743194580078} 11/07/2021 04:10:57 - INFO - __main__ - Step 49295: {'lr': 0.0003842522889279499, 'samples': 9464640, 'steps': 49294, 'loss/train': 1.4921461343765259} 11/07/2021 04:10:57 - INFO - __main__ - Step 49296: {'lr': 0.00038424781225333636, 'samples': 9464832, 'steps': 49295, 'loss/train': 1.443150281906128} 11/07/2021 04:10:58 - INFO - __main__ - Step 49297: {'lr': 0.0003842433355182329, 'samples': 9465024, 'steps': 49296, 'loss/train': 1.2435979843139648} 11/07/2021 04:10:58 - INFO - __main__ - Step 49298: {'lr': 0.0003842388587226414, 'samples': 9465216, 'steps': 49297, 'loss/train': 1.4616177082061768} 11/07/2021 04:10:59 - INFO - __main__ - Step 49299: {'lr': 0.000384234381866564, 'samples': 9465408, 'steps': 49298, 'loss/train': 1.6488696336746216} 11/07/2021 04:10:59 - INFO - __main__ - Step 49300: {'lr': 0.00038422990495000267, 'samples': 9465600, 'steps': 49299, 'loss/train': 1.6438243389129639} 11/07/2021 04:11:00 - INFO - __main__ - Step 49301: {'lr': 0.00038422542797295935, 'samples': 9465792, 'steps': 49300, 'loss/train': 1.4698615074157715} 11/07/2021 04:11:00 - INFO - __main__ - Step 49302: {'lr': 0.0003842209509354362, 'samples': 9465984, 'steps': 49301, 'loss/train': 1.4135884046554565} 11/07/2021 04:11:00 - INFO - __main__ - Step 49303: {'lr': 0.00038421647383743505, 'samples': 9466176, 'steps': 49302, 'loss/train': 1.5925277471542358} 11/07/2021 04:11:01 - INFO - __main__ - Step 49304: {'lr': 0.00038421199667895814, 'samples': 9466368, 'steps': 49303, 'loss/train': 1.4623899459838867} 11/07/2021 04:11:02 - INFO - __main__ - Step 49305: {'lr': 0.0003842075194600073, 'samples': 9466560, 'steps': 49304, 'loss/train': 1.2562298774719238} 11/07/2021 04:11:02 - INFO - __main__ - Step 49306: {'lr': 0.00038420304218058466, 'samples': 9466752, 'steps': 49305, 'loss/train': 1.2742486000061035} 11/07/2021 04:11:03 - INFO - __main__ - Step 49307: {'lr': 0.00038419856484069216, 'samples': 9466944, 'steps': 49306, 'loss/train': 1.4497098922729492} 11/07/2021 04:11:03 - INFO - __main__ - Step 49308: {'lr': 0.0003841940874403319, 'samples': 9467136, 'steps': 49307, 'loss/train': 1.211452603340149} 11/07/2021 04:11:03 - INFO - __main__ - Step 49309: {'lr': 0.0003841896099795058, 'samples': 9467328, 'steps': 49308, 'loss/train': 1.1877186298370361} 11/07/2021 04:11:04 - INFO - __main__ - Step 49310: {'lr': 0.00038418513245821605, 'samples': 9467520, 'steps': 49309, 'loss/train': 1.3652452230453491} 11/07/2021 04:11:05 - INFO - __main__ - Step 49311: {'lr': 0.0003841806548764645, 'samples': 9467712, 'steps': 49310, 'loss/train': 1.6228737831115723} 11/07/2021 04:11:05 - INFO - __main__ - Step 49312: {'lr': 0.0003841761772342531, 'samples': 9467904, 'steps': 49311, 'loss/train': 2.3533477783203125} 11/07/2021 04:11:05 - INFO - __main__ - Step 49313: {'lr': 0.0003841716995315841, 'samples': 9468096, 'steps': 49312, 'loss/train': 1.301640272140503} 11/07/2021 04:11:06 - INFO - __main__ - Step 49314: {'lr': 0.00038416722176845943, 'samples': 9468288, 'steps': 49313, 'loss/train': 1.537453055381775} 11/07/2021 04:11:07 - INFO - __main__ - Step 49315: {'lr': 0.000384162743944881, 'samples': 9468480, 'steps': 49314, 'loss/train': 1.6197184324264526} 11/07/2021 04:11:07 - INFO - __main__ - Step 49316: {'lr': 0.0003841582660608509, 'samples': 9468672, 'steps': 49315, 'loss/train': 1.285697102546692} 11/07/2021 04:11:07 - INFO - __main__ - Step 49317: {'lr': 0.00038415378811637124, 'samples': 9468864, 'steps': 49316, 'loss/train': 1.6587913036346436} 11/07/2021 04:11:08 - INFO - __main__ - Step 49318: {'lr': 0.00038414931011144393, 'samples': 9469056, 'steps': 49317, 'loss/train': 1.7063889503479004} 11/07/2021 04:11:08 - INFO - __main__ - Step 49319: {'lr': 0.000384144832046071, 'samples': 9469248, 'steps': 49318, 'loss/train': 1.5879881381988525} 11/07/2021 04:11:10 - INFO - __main__ - Step 49320: {'lr': 0.0003841403539202545, 'samples': 9469440, 'steps': 49319, 'loss/train': 2.182030439376831} 11/07/2021 04:11:10 - INFO - __main__ - Step 49321: {'lr': 0.00038413587573399635, 'samples': 9469632, 'steps': 49320, 'loss/train': 1.5305895805358887} 11/07/2021 04:11:10 - INFO - __main__ - Step 49322: {'lr': 0.0003841313974872986, 'samples': 9469824, 'steps': 49321, 'loss/train': 1.6020350456237793} 11/07/2021 04:11:11 - INFO - __main__ - Step 49323: {'lr': 0.00038412691918016345, 'samples': 9470016, 'steps': 49322, 'loss/train': 0.9629158973693848} 11/07/2021 04:11:11 - INFO - __main__ - Step 49324: {'lr': 0.00038412244081259273, 'samples': 9470208, 'steps': 49323, 'loss/train': 1.748894214630127} 11/07/2021 04:11:11 - INFO - __main__ - Step 49325: {'lr': 0.00038411796238458853, 'samples': 9470400, 'steps': 49324, 'loss/train': 2.147814989089966} 11/07/2021 04:11:12 - INFO - __main__ - Step 49326: {'lr': 0.00038411348389615286, 'samples': 9470592, 'steps': 49325, 'loss/train': 2.2232298851013184} 11/07/2021 04:11:13 - INFO - __main__ - Step 49327: {'lr': 0.00038410900534728765, 'samples': 9470784, 'steps': 49326, 'loss/train': 1.5854507684707642} 11/07/2021 04:11:13 - INFO - __main__ - Step 49328: {'lr': 0.000384104526737995, 'samples': 9470976, 'steps': 49327, 'loss/train': 1.6244455575942993} 11/07/2021 04:11:13 - INFO - __main__ - Step 49329: {'lr': 0.0003841000480682769, 'samples': 9471168, 'steps': 49328, 'loss/train': 1.4646588563919067} 11/07/2021 04:11:14 - INFO - __main__ - Step 49330: {'lr': 0.0003840955693381355, 'samples': 9471360, 'steps': 49329, 'loss/train': 1.893558382987976} 11/07/2021 04:11:15 - INFO - __main__ - Step 49331: {'lr': 0.0003840910905475726, 'samples': 9471552, 'steps': 49330, 'loss/train': 1.3998960256576538} 11/07/2021 04:11:15 - INFO - __main__ - Step 49332: {'lr': 0.0003840866116965904, 'samples': 9471744, 'steps': 49331, 'loss/train': 1.8675849437713623} 11/07/2021 04:11:15 - INFO - __main__ - Step 49333: {'lr': 0.00038408213278519083, 'samples': 9471936, 'steps': 49332, 'loss/train': 1.5504200458526611} 11/07/2021 04:11:16 - INFO - __main__ - Step 49334: {'lr': 0.0003840776538133759, 'samples': 9472128, 'steps': 49333, 'loss/train': 1.301131248474121} 11/07/2021 04:11:16 - INFO - __main__ - Step 49335: {'lr': 0.00038407317478114764, 'samples': 9472320, 'steps': 49334, 'loss/train': 1.5315876007080078} 11/07/2021 04:11:17 - INFO - __main__ - Step 49336: {'lr': 0.00038406869568850805, 'samples': 9472512, 'steps': 49335, 'loss/train': 0.8331318497657776} 11/07/2021 04:11:18 - INFO - __main__ - Step 49337: {'lr': 0.00038406421653545926, 'samples': 9472704, 'steps': 49336, 'loss/train': 1.8083441257476807} 11/07/2021 04:11:18 - INFO - __main__ - Step 49338: {'lr': 0.00038405973732200317, 'samples': 9472896, 'steps': 49337, 'loss/train': 1.7773988246917725} 11/07/2021 04:11:18 - INFO - __main__ - Step 49339: {'lr': 0.0003840552580481418, 'samples': 9473088, 'steps': 49338, 'loss/train': 1.477247714996338} 11/07/2021 04:11:19 - INFO - __main__ - Step 49340: {'lr': 0.00038405077871387716, 'samples': 9473280, 'steps': 49339, 'loss/train': 1.6360654830932617} 11/07/2021 04:11:19 - INFO - __main__ - Step 49341: {'lr': 0.00038404629931921137, 'samples': 9473472, 'steps': 49340, 'loss/train': 5.7670135498046875} 11/07/2021 04:11:20 - INFO - __main__ - Step 49342: {'lr': 0.0003840418198641463, 'samples': 9473664, 'steps': 49341, 'loss/train': 1.1676645278930664} 11/07/2021 04:11:20 - INFO - __main__ - Step 49343: {'lr': 0.0003840373403486842, 'samples': 9473856, 'steps': 49342, 'loss/train': 1.524045467376709} 11/07/2021 04:11:21 - INFO - __main__ - Step 49344: {'lr': 0.0003840328607728269, 'samples': 9474048, 'steps': 49343, 'loss/train': 1.4302656650543213} 11/07/2021 04:11:21 - INFO - __main__ - Step 49345: {'lr': 0.0003840283811365764, 'samples': 9474240, 'steps': 49344, 'loss/train': 1.4958720207214355} 11/07/2021 04:11:21 - INFO - __main__ - Step 49346: {'lr': 0.00038402390143993484, 'samples': 9474432, 'steps': 49345, 'loss/train': 1.3384087085723877} 11/07/2021 04:11:23 - INFO - __main__ - Step 49347: {'lr': 0.0003840194216829042, 'samples': 9474624, 'steps': 49346, 'loss/train': 1.161323070526123} 11/07/2021 04:11:23 - INFO - __main__ - Step 49348: {'lr': 0.00038401494186548633, 'samples': 9474816, 'steps': 49347, 'loss/train': 1.4390065670013428} 11/07/2021 04:11:23 - INFO - __main__ - Step 49349: {'lr': 0.0003840104619876835, 'samples': 9475008, 'steps': 49348, 'loss/train': 1.1753920316696167} 11/07/2021 04:11:24 - INFO - __main__ - Step 49350: {'lr': 0.0003840059820494976, 'samples': 9475200, 'steps': 49349, 'loss/train': 1.6433159112930298} 11/07/2021 04:11:24 - INFO - __main__ - Step 49351: {'lr': 0.00038400150205093075, 'samples': 9475392, 'steps': 49350, 'loss/train': 1.245421290397644} 11/07/2021 04:11:24 - INFO - __main__ - Step 49352: {'lr': 0.00038399702199198486, 'samples': 9475584, 'steps': 49351, 'loss/train': 0.9821666479110718} 11/07/2021 04:11:25 - INFO - __main__ - Step 49353: {'lr': 0.00038399254187266186, 'samples': 9475776, 'steps': 49352, 'loss/train': 1.5022464990615845} 11/07/2021 04:11:26 - INFO - __main__ - Step 49354: {'lr': 0.000383988061692964, 'samples': 9475968, 'steps': 49353, 'loss/train': 1.3901777267456055} 11/07/2021 04:11:26 - INFO - __main__ - Step 49355: {'lr': 0.0003839835814528931, 'samples': 9476160, 'steps': 49354, 'loss/train': 1.644857406616211} 11/07/2021 04:11:26 - INFO - __main__ - Step 49356: {'lr': 0.0003839791011524514, 'samples': 9476352, 'steps': 49355, 'loss/train': 1.5316932201385498} 11/07/2021 04:11:27 - INFO - __main__ - Step 49357: {'lr': 0.0003839746207916407, 'samples': 9476544, 'steps': 49356, 'loss/train': 1.4775909185409546} 11/07/2021 04:11:28 - INFO - __main__ - Step 49358: {'lr': 0.0003839701403704631, 'samples': 9476736, 'steps': 49357, 'loss/train': 1.4452733993530273} 11/07/2021 04:11:28 - INFO - __main__ - Step 49359: {'lr': 0.00038396565988892063, 'samples': 9476928, 'steps': 49358, 'loss/train': 0.7642771601676941} 11/07/2021 04:11:28 - INFO - __main__ - Step 49360: {'lr': 0.00038396117934701537, 'samples': 9477120, 'steps': 49359, 'loss/train': 1.2526742219924927} 11/07/2021 04:11:29 - INFO - __main__ - Step 49361: {'lr': 0.00038395669874474915, 'samples': 9477312, 'steps': 49360, 'loss/train': 1.5693467855453491} 11/07/2021 04:11:29 - INFO - __main__ - Step 49362: {'lr': 0.00038395221808212415, 'samples': 9477504, 'steps': 49361, 'loss/train': 1.4363118410110474} 11/07/2021 04:11:30 - INFO - __main__ - Step 49363: {'lr': 0.0003839477373591423, 'samples': 9477696, 'steps': 49362, 'loss/train': 1.6547024250030518} 11/07/2021 04:11:31 - INFO - __main__ - Step 49364: {'lr': 0.0003839432565758059, 'samples': 9477888, 'steps': 49363, 'loss/train': 1.3071367740631104} 11/07/2021 04:11:31 - INFO - __main__ - Step 49365: {'lr': 0.0003839387757321165, 'samples': 9478080, 'steps': 49364, 'loss/train': 1.572064995765686} 11/07/2021 04:11:31 - INFO - __main__ - Step 49366: {'lr': 0.0003839342948280764, 'samples': 9478272, 'steps': 49365, 'loss/train': 1.5092639923095703} 11/07/2021 04:11:32 - INFO - __main__ - Step 49367: {'lr': 0.00038392981386368763, 'samples': 9478464, 'steps': 49366, 'loss/train': 1.7656980752944946} 11/07/2021 04:11:33 - INFO - __main__ - Step 49368: {'lr': 0.0003839253328389521, 'samples': 9478656, 'steps': 49367, 'loss/train': 1.7381960153579712} 11/07/2021 04:11:33 - INFO - __main__ - Step 49369: {'lr': 0.00038392085175387186, 'samples': 9478848, 'steps': 49368, 'loss/train': 1.2247587442398071} 11/07/2021 04:11:33 - INFO - __main__ - Step 49370: {'lr': 0.000383916370608449, 'samples': 9479040, 'steps': 49369, 'loss/train': 1.2737410068511963} 11/07/2021 04:11:34 - INFO - __main__ - Step 49371: {'lr': 0.0003839118894026855, 'samples': 9479232, 'steps': 49370, 'loss/train': 1.4327012300491333} 11/07/2021 04:11:34 - INFO - __main__ - Step 49372: {'lr': 0.0003839074081365833, 'samples': 9479424, 'steps': 49371, 'loss/train': 1.0670971870422363} 11/07/2021 04:11:35 - INFO - __main__ - Step 49373: {'lr': 0.0003839029268101446, 'samples': 9479616, 'steps': 49372, 'loss/train': 1.4543306827545166} 11/07/2021 04:11:36 - INFO - __main__ - Step 49374: {'lr': 0.00038389844542337123, 'samples': 9479808, 'steps': 49373, 'loss/train': 1.070849895477295} 11/07/2021 04:11:36 - INFO - __main__ - Step 49375: {'lr': 0.0003838939639762653, 'samples': 9480000, 'steps': 49374, 'loss/train': 1.360690951347351} 11/07/2021 04:11:36 - INFO - __main__ - Step 49376: {'lr': 0.00038388948246882883, 'samples': 9480192, 'steps': 49375, 'loss/train': 1.9821124076843262} 11/07/2021 04:11:37 - INFO - __main__ - Step 49377: {'lr': 0.0003838850009010638, 'samples': 9480384, 'steps': 49376, 'loss/train': 1.5155435800552368} 11/07/2021 04:11:38 - INFO - __main__ - Step 49378: {'lr': 0.0003838805192729723, 'samples': 9480576, 'steps': 49377, 'loss/train': 1.7489532232284546} 11/07/2021 04:11:38 - INFO - __main__ - Step 49379: {'lr': 0.00038387603758455624, 'samples': 9480768, 'steps': 49378, 'loss/train': 1.4840288162231445} 11/07/2021 04:11:38 - INFO - __main__ - Step 49380: {'lr': 0.00038387155583581773, 'samples': 9480960, 'steps': 49379, 'loss/train': 1.4947314262390137} 11/07/2021 04:11:39 - INFO - __main__ - Step 49381: {'lr': 0.00038386707402675877, 'samples': 9481152, 'steps': 49380, 'loss/train': 1.3950653076171875} 11/07/2021 04:11:39 - INFO - __main__ - Step 49382: {'lr': 0.00038386259215738135, 'samples': 9481344, 'steps': 49381, 'loss/train': 1.6808557510375977} 11/07/2021 04:11:40 - INFO - __main__ - Step 49383: {'lr': 0.0003838581102276876, 'samples': 9481536, 'steps': 49382, 'loss/train': 1.1713740825653076} 11/07/2021 04:11:40 - INFO - __main__ - Step 49384: {'lr': 0.00038385362823767935, 'samples': 9481728, 'steps': 49383, 'loss/train': 1.228548526763916} 11/07/2021 04:11:41 - INFO - __main__ - Step 49385: {'lr': 0.00038384914618735873, 'samples': 9481920, 'steps': 49384, 'loss/train': 1.4046412706375122} 11/07/2021 04:11:41 - INFO - __main__ - Step 49386: {'lr': 0.0003838446640767278, 'samples': 9482112, 'steps': 49385, 'loss/train': 1.7546875476837158} 11/07/2021 04:11:41 - INFO - __main__ - Step 49387: {'lr': 0.00038384018190578843, 'samples': 9482304, 'steps': 49386, 'loss/train': 1.4690765142440796} 11/07/2021 04:11:42 - INFO - __main__ - Step 49388: {'lr': 0.0003838356996745429, 'samples': 9482496, 'steps': 49387, 'loss/train': 0.9885172843933105} 11/07/2021 04:11:43 - INFO - __main__ - Step 49389: {'lr': 0.00038383121738299296, 'samples': 9482688, 'steps': 49388, 'loss/train': 1.657820463180542} 11/07/2021 04:11:43 - INFO - __main__ - Step 49390: {'lr': 0.00038382673503114075, 'samples': 9482880, 'steps': 49389, 'loss/train': 1.3290120363235474} 11/07/2021 04:11:44 - INFO - __main__ - Step 49391: {'lr': 0.0003838222526189883, 'samples': 9483072, 'steps': 49390, 'loss/train': 1.2781789302825928} 11/07/2021 04:11:44 - INFO - __main__ - Step 49392: {'lr': 0.0003838177701465376, 'samples': 9483264, 'steps': 49391, 'loss/train': 1.478083848953247} 11/07/2021 04:11:44 - INFO - __main__ - Step 49393: {'lr': 0.00038381328761379063, 'samples': 9483456, 'steps': 49392, 'loss/train': 2.3239057064056396} 11/07/2021 04:11:46 - INFO - __main__ - Step 49394: {'lr': 0.0003838088050207496, 'samples': 9483648, 'steps': 49393, 'loss/train': 1.419954776763916} 11/07/2021 04:11:46 - INFO - __main__ - Step 49395: {'lr': 0.00038380432236741625, 'samples': 9483840, 'steps': 49394, 'loss/train': 1.784666657447815} 11/07/2021 04:11:46 - INFO - __main__ - Step 49396: {'lr': 0.0003837998396537927, 'samples': 9484032, 'steps': 49395, 'loss/train': 0.4355214238166809} 11/07/2021 04:11:47 - INFO - __main__ - Step 49397: {'lr': 0.0003837953568798811, 'samples': 9484224, 'steps': 49396, 'loss/train': 1.6687462329864502} 11/07/2021 04:11:47 - INFO - __main__ - Step 49398: {'lr': 0.00038379087404568333, 'samples': 9484416, 'steps': 49397, 'loss/train': 1.2138727903366089} 11/07/2021 04:11:48 - INFO - __main__ - Step 49399: {'lr': 0.00038378639115120154, 'samples': 9484608, 'steps': 49398, 'loss/train': 1.3672698736190796} 11/07/2021 04:11:48 - INFO - __main__ - Step 49400: {'lr': 0.0003837819081964377, 'samples': 9484800, 'steps': 49399, 'loss/train': 1.1022813320159912} 11/07/2021 04:11:49 - INFO - __main__ - Step 49401: {'lr': 0.0003837774251813936, 'samples': 9484992, 'steps': 49400, 'loss/train': 1.3086178302764893} 11/07/2021 04:11:49 - INFO - __main__ - Step 49402: {'lr': 0.0003837729421060716, 'samples': 9485184, 'steps': 49401, 'loss/train': 1.7074296474456787} 11/07/2021 04:11:49 - INFO - __main__ - Step 49403: {'lr': 0.00038376845897047354, 'samples': 9485376, 'steps': 49402, 'loss/train': 0.9208493828773499} 11/07/2021 04:11:50 - INFO - __main__ - Step 49404: {'lr': 0.00038376397577460144, 'samples': 9485568, 'steps': 49403, 'loss/train': 1.593536376953125} 11/07/2021 04:11:51 - INFO - __main__ - Step 49405: {'lr': 0.00038375949251845745, 'samples': 9485760, 'steps': 49404, 'loss/train': 1.5746198892593384} 11/07/2021 04:11:51 - INFO - __main__ - Step 49406: {'lr': 0.0003837550092020434, 'samples': 9485952, 'steps': 49405, 'loss/train': 1.6538019180297852} 11/07/2021 04:11:52 - INFO - __main__ - Step 49407: {'lr': 0.0003837505258253615, 'samples': 9486144, 'steps': 49406, 'loss/train': 1.4615103006362915} 11/07/2021 04:11:52 - INFO - __main__ - Step 49408: {'lr': 0.0003837460423884136, 'samples': 9486336, 'steps': 49407, 'loss/train': 1.2624928951263428} 11/07/2021 04:11:53 - INFO - __main__ - Step 49409: {'lr': 0.00038374155889120176, 'samples': 9486528, 'steps': 49408, 'loss/train': 1.5209566354751587} 11/07/2021 04:11:53 - INFO - __main__ - Step 49410: {'lr': 0.0003837370753337281, 'samples': 9486720, 'steps': 49409, 'loss/train': 1.5793596506118774} 11/07/2021 04:11:54 - INFO - __main__ - Step 49411: {'lr': 0.00038373259171599463, 'samples': 9486912, 'steps': 49410, 'loss/train': 1.3892780542373657} 11/07/2021 04:11:54 - INFO - __main__ - Step 49412: {'lr': 0.0003837281080380033, 'samples': 9487104, 'steps': 49411, 'loss/train': 1.6055463552474976} 11/07/2021 04:11:54 - INFO - __main__ - Step 49413: {'lr': 0.00038372362429975603, 'samples': 9487296, 'steps': 49412, 'loss/train': 1.5732831954956055} 11/07/2021 04:11:55 - INFO - __main__ - Step 49414: {'lr': 0.0003837191405012551, 'samples': 9487488, 'steps': 49413, 'loss/train': 1.428322434425354} 11/07/2021 04:11:56 - INFO - __main__ - Step 49415: {'lr': 0.00038371465664250226, 'samples': 9487680, 'steps': 49414, 'loss/train': 1.847044825553894} 11/07/2021 04:11:56 - INFO - __main__ - Step 49416: {'lr': 0.0003837101727234997, 'samples': 9487872, 'steps': 49415, 'loss/train': 1.5304168462753296} 11/07/2021 04:11:56 - INFO - __main__ - Step 49417: {'lr': 0.0003837056887442495, 'samples': 9488064, 'steps': 49416, 'loss/train': 1.2969977855682373} 11/07/2021 04:11:57 - INFO - __main__ - Step 49418: {'lr': 0.00038370120470475355, 'samples': 9488256, 'steps': 49417, 'loss/train': 1.2422821521759033} 11/07/2021 04:11:58 - INFO - __main__ - Step 49419: {'lr': 0.0003836967206050138, 'samples': 9488448, 'steps': 49418, 'loss/train': 1.557572841644287} 11/07/2021 04:11:58 - INFO - __main__ - Step 49420: {'lr': 0.0003836922364450325, 'samples': 9488640, 'steps': 49419, 'loss/train': 1.4618909358978271} 11/07/2021 04:11:58 - INFO - __main__ - Step 49421: {'lr': 0.0003836877522248114, 'samples': 9488832, 'steps': 49420, 'loss/train': 3.2358062267303467} 11/07/2021 04:11:59 - INFO - __main__ - Step 49422: {'lr': 0.0003836832679443527, 'samples': 9489024, 'steps': 49421, 'loss/train': 1.930889368057251} 11/07/2021 04:11:59 - INFO - __main__ - Step 49423: {'lr': 0.00038367878360365845, 'samples': 9489216, 'steps': 49422, 'loss/train': 1.6684209108352661} 11/07/2021 04:12:00 - INFO - __main__ - Step 49424: {'lr': 0.00038367429920273054, 'samples': 9489408, 'steps': 49423, 'loss/train': 1.5331238508224487} 11/07/2021 04:12:01 - INFO - __main__ - Step 49425: {'lr': 0.00038366981474157114, 'samples': 9489600, 'steps': 49424, 'loss/train': 1.2940645217895508} 11/07/2021 04:12:01 - INFO - __main__ - Step 49426: {'lr': 0.00038366533022018214, 'samples': 9489792, 'steps': 49425, 'loss/train': 1.228728175163269} 11/07/2021 04:12:01 - INFO - __main__ - Step 49427: {'lr': 0.0003836608456385655, 'samples': 9489984, 'steps': 49426, 'loss/train': 2.0489604473114014} 11/07/2021 04:12:02 - INFO - __main__ - Step 49428: {'lr': 0.00038365636099672347, 'samples': 9490176, 'steps': 49427, 'loss/train': 0.9330426454544067} 11/07/2021 04:12:02 - INFO - __main__ - Step 49429: {'lr': 0.0003836518762946579, 'samples': 9490368, 'steps': 49428, 'loss/train': 1.604009747505188} 11/07/2021 04:12:03 - INFO - __main__ - Step 49430: {'lr': 0.0003836473915323709, 'samples': 9490560, 'steps': 49429, 'loss/train': 1.3320634365081787} 11/07/2021 04:12:03 - INFO - __main__ - Step 49431: {'lr': 0.0003836429067098645, 'samples': 9490752, 'steps': 49430, 'loss/train': 2.4549779891967773} 11/07/2021 04:12:04 - INFO - __main__ - Step 49432: {'lr': 0.0003836384218271405, 'samples': 9490944, 'steps': 49431, 'loss/train': 1.0012286901474} 11/07/2021 04:12:04 - INFO - __main__ - Step 49433: {'lr': 0.00038363393688420116, 'samples': 9491136, 'steps': 49432, 'loss/train': 1.3346449136734009} 11/07/2021 04:12:05 - INFO - __main__ - Step 49434: {'lr': 0.0003836294518810485, 'samples': 9491328, 'steps': 49433, 'loss/train': 1.3577319383621216} 11/07/2021 04:12:05 - INFO - __main__ - Step 49435: {'lr': 0.00038362496681768434, 'samples': 9491520, 'steps': 49434, 'loss/train': 1.2545278072357178} 11/07/2021 04:12:06 - INFO - __main__ - Step 49436: {'lr': 0.0003836204816941109, 'samples': 9491712, 'steps': 49435, 'loss/train': 1.6246488094329834} 11/07/2021 04:12:06 - INFO - __main__ - Step 49437: {'lr': 0.0003836159965103301, 'samples': 9491904, 'steps': 49436, 'loss/train': 1.687782883644104} 11/07/2021 04:12:07 - INFO - __main__ - Step 49438: {'lr': 0.0003836115112663441, 'samples': 9492096, 'steps': 49437, 'loss/train': 1.801659345626831} 11/07/2021 04:12:07 - INFO - __main__ - Step 49439: {'lr': 0.0003836070259621548, 'samples': 9492288, 'steps': 49438, 'loss/train': 1.387070655822754} 11/07/2021 04:12:08 - INFO - __main__ - Step 49440: {'lr': 0.0003836025405977641, 'samples': 9492480, 'steps': 49439, 'loss/train': 1.3248119354248047} 11/07/2021 04:12:08 - INFO - __main__ - Step 49441: {'lr': 0.00038359805517317427, 'samples': 9492672, 'steps': 49440, 'loss/train': 1.3795644044876099} 11/07/2021 04:12:09 - INFO - __main__ - Step 49442: {'lr': 0.00038359356968838723, 'samples': 9492864, 'steps': 49441, 'loss/train': 1.2761085033416748} 11/07/2021 04:12:09 - INFO - __main__ - Step 49443: {'lr': 0.00038358908414340485, 'samples': 9493056, 'steps': 49442, 'loss/train': 1.6001322269439697} 11/07/2021 04:12:09 - INFO - __main__ - Step 49444: {'lr': 0.0003835845985382294, 'samples': 9493248, 'steps': 49443, 'loss/train': 1.4786072969436646} 11/07/2021 04:12:10 - INFO - __main__ - Step 49445: {'lr': 0.00038358011287286287, 'samples': 9493440, 'steps': 49444, 'loss/train': 0.9966360330581665} 11/07/2021 04:12:11 - INFO - __main__ - Step 49446: {'lr': 0.0003835756271473071, 'samples': 9493632, 'steps': 49445, 'loss/train': 1.651698112487793} 11/07/2021 04:12:11 - INFO - __main__ - Step 49447: {'lr': 0.0003835711413615642, 'samples': 9493824, 'steps': 49446, 'loss/train': 1.3871666193008423} 11/07/2021 04:12:11 - INFO - __main__ - Step 49448: {'lr': 0.0003835666555156362, 'samples': 9494016, 'steps': 49447, 'loss/train': 1.2935926914215088} 11/07/2021 04:12:12 - INFO - __main__ - Step 49449: {'lr': 0.00038356216960952515, 'samples': 9494208, 'steps': 49448, 'loss/train': 1.6006450653076172} 11/07/2021 04:12:13 - INFO - __main__ - Step 49450: {'lr': 0.0003835576836432331, 'samples': 9494400, 'steps': 49449, 'loss/train': 0.46942609548568726} 11/07/2021 04:12:13 - INFO - __main__ - Step 49451: {'lr': 0.000383553197616762, 'samples': 9494592, 'steps': 49450, 'loss/train': 1.724548101425171} 11/07/2021 04:12:14 - INFO - __main__ - Step 49452: {'lr': 0.00038354871153011385, 'samples': 9494784, 'steps': 49451, 'loss/train': 1.2632355690002441} 11/07/2021 04:12:14 - INFO - __main__ - Step 49453: {'lr': 0.0003835442253832907, 'samples': 9494976, 'steps': 49452, 'loss/train': 1.3236713409423828} 11/07/2021 04:12:14 - INFO - __main__ - Step 49454: {'lr': 0.00038353973917629457, 'samples': 9495168, 'steps': 49453, 'loss/train': 1.5322283506393433} 11/07/2021 04:12:15 - INFO - __main__ - Step 49455: {'lr': 0.0003835352529091275, 'samples': 9495360, 'steps': 49454, 'loss/train': 1.0614477396011353} 11/07/2021 04:12:16 - INFO - __main__ - Step 49456: {'lr': 0.0003835307665817915, 'samples': 9495552, 'steps': 49455, 'loss/train': 1.7836710214614868} 11/07/2021 04:12:16 - INFO - __main__ - Step 49457: {'lr': 0.0003835262801942887, 'samples': 9495744, 'steps': 49456, 'loss/train': 0.15671800076961517} 11/07/2021 04:12:17 - INFO - __main__ - Step 49458: {'lr': 0.000383521793746621, 'samples': 9495936, 'steps': 49457, 'loss/train': 1.6529656648635864} 11/07/2021 04:12:17 - INFO - __main__ - Step 49459: {'lr': 0.00038351730723879034, 'samples': 9496128, 'steps': 49458, 'loss/train': 1.468443751335144} 11/07/2021 04:12:17 - INFO - __main__ - Step 49460: {'lr': 0.0003835128206707989, 'samples': 9496320, 'steps': 49459, 'loss/train': 1.6167665719985962} 11/07/2021 04:12:18 - INFO - __main__ - Step 49461: {'lr': 0.00038350833404264865, 'samples': 9496512, 'steps': 49460, 'loss/train': 1.1963554620742798} 11/07/2021 04:12:19 - INFO - __main__ - Step 49462: {'lr': 0.0003835038473543416, 'samples': 9496704, 'steps': 49461, 'loss/train': 1.8096206188201904} 11/07/2021 04:12:19 - INFO - __main__ - Step 49463: {'lr': 0.0003834993606058798, 'samples': 9496896, 'steps': 49462, 'loss/train': 1.7199034690856934} 11/07/2021 04:12:19 - INFO - __main__ - Step 49464: {'lr': 0.00038349487379726513, 'samples': 9497088, 'steps': 49463, 'loss/train': 1.3855326175689697} 11/07/2021 04:12:20 - INFO - __main__ - Step 49465: {'lr': 0.0003834903869284999, 'samples': 9497280, 'steps': 49464, 'loss/train': 1.2433205842971802} 11/07/2021 04:12:21 - INFO - __main__ - Step 49466: {'lr': 0.00038348589999958585, 'samples': 9497472, 'steps': 49465, 'loss/train': 1.5063279867172241} 11/07/2021 04:12:21 - INFO - __main__ - Step 49467: {'lr': 0.00038348141301052505, 'samples': 9497664, 'steps': 49466, 'loss/train': 1.8365730047225952} 11/07/2021 04:12:21 - INFO - __main__ - Step 49468: {'lr': 0.00038347692596131977, 'samples': 9497856, 'steps': 49467, 'loss/train': 1.5028265714645386} 11/07/2021 04:12:22 - INFO - __main__ - Step 49469: {'lr': 0.0003834724388519717, 'samples': 9498048, 'steps': 49468, 'loss/train': 1.3604669570922852} 11/07/2021 04:12:22 - INFO - __main__ - Step 49470: {'lr': 0.00038346795168248306, 'samples': 9498240, 'steps': 49469, 'loss/train': 1.525028944015503} 11/07/2021 04:12:23 - INFO - __main__ - Step 49471: {'lr': 0.00038346346445285585, 'samples': 9498432, 'steps': 49470, 'loss/train': 1.1660608053207397} 11/07/2021 04:12:24 - INFO - __main__ - Step 49472: {'lr': 0.0003834589771630921, 'samples': 9498624, 'steps': 49471, 'loss/train': 1.2024121284484863} 11/07/2021 04:12:24 - INFO - __main__ - Step 49473: {'lr': 0.0003834544898131936, 'samples': 9498816, 'steps': 49472, 'loss/train': 1.4175405502319336} 11/07/2021 04:12:25 - INFO - __main__ - Step 49474: {'lr': 0.00038345000240316276, 'samples': 9499008, 'steps': 49473, 'loss/train': 1.046689748764038} 11/07/2021 04:12:25 - INFO - __main__ - Step 49475: {'lr': 0.00038344551493300135, 'samples': 9499200, 'steps': 49474, 'loss/train': 1.3881487846374512} 11/07/2021 04:12:26 - INFO - __main__ - Step 49476: {'lr': 0.00038344102740271144, 'samples': 9499392, 'steps': 49475, 'loss/train': 1.2882369756698608} 11/07/2021 04:12:26 - INFO - __main__ - Step 49477: {'lr': 0.00038343653981229504, 'samples': 9499584, 'steps': 49476, 'loss/train': 1.7036164999008179} 11/07/2021 04:12:27 - INFO - __main__ - Step 49478: {'lr': 0.00038343205216175426, 'samples': 9499776, 'steps': 49477, 'loss/train': 0.9137195348739624} 11/07/2021 04:12:27 - INFO - __main__ - Step 49479: {'lr': 0.000383427564451091, 'samples': 9499968, 'steps': 49478, 'loss/train': 1.1925657987594604} 11/07/2021 04:12:27 - INFO - __main__ - Step 49480: {'lr': 0.00038342307668030737, 'samples': 9500160, 'steps': 49479, 'loss/train': 1.835159182548523} 11/07/2021 04:12:28 - INFO - __main__ - Step 49481: {'lr': 0.0003834185888494053, 'samples': 9500352, 'steps': 49480, 'loss/train': 1.017730951309204} 11/07/2021 04:12:29 - INFO - __main__ - Step 49482: {'lr': 0.00038341410095838694, 'samples': 9500544, 'steps': 49481, 'loss/train': 1.4684760570526123} 11/07/2021 04:12:29 - INFO - __main__ - Step 49483: {'lr': 0.0003834096130072542, 'samples': 9500736, 'steps': 49482, 'loss/train': 1.460621953010559} 11/07/2021 04:12:29 - INFO - __main__ - Step 49484: {'lr': 0.00038340512499600917, 'samples': 9500928, 'steps': 49483, 'loss/train': 0.677061915397644} 11/07/2021 04:12:30 - INFO - __main__ - Step 49485: {'lr': 0.00038340063692465386, 'samples': 9501120, 'steps': 49484, 'loss/train': 0.8293577432632446} 11/07/2021 04:12:30 - INFO - __main__ - Step 49486: {'lr': 0.00038339614879319027, 'samples': 9501312, 'steps': 49485, 'loss/train': 1.5253651142120361} 11/07/2021 04:12:31 - INFO - __main__ - Step 49487: {'lr': 0.00038339166060162046, 'samples': 9501504, 'steps': 49486, 'loss/train': 1.2394787073135376} 11/07/2021 04:12:31 - INFO - __main__ - Step 49488: {'lr': 0.00038338717234994633, 'samples': 9501696, 'steps': 49487, 'loss/train': 0.6876512765884399} 11/07/2021 04:12:32 - INFO - __main__ - Step 49489: {'lr': 0.0003833826840381701, 'samples': 9501888, 'steps': 49488, 'loss/train': 1.709553599357605} 11/07/2021 04:12:32 - INFO - __main__ - Step 49490: {'lr': 0.00038337819566629363, 'samples': 9502080, 'steps': 49489, 'loss/train': 2.0343925952911377} 11/07/2021 04:12:33 - INFO - __main__ - Step 49491: {'lr': 0.000383373707234319, 'samples': 9502272, 'steps': 49490, 'loss/train': 1.097631573677063} 11/07/2021 04:12:34 - INFO - __main__ - Step 49492: {'lr': 0.0003833692187422483, 'samples': 9502464, 'steps': 49491, 'loss/train': 1.8399280309677124} 11/07/2021 04:12:34 - INFO - __main__ - Step 49493: {'lr': 0.0003833647301900835, 'samples': 9502656, 'steps': 49492, 'loss/train': 1.6749378442764282} 11/07/2021 04:12:34 - INFO - __main__ - Step 49494: {'lr': 0.00038336024157782655, 'samples': 9502848, 'steps': 49493, 'loss/train': 1.6002864837646484} 11/07/2021 04:12:35 - INFO - __main__ - Step 49495: {'lr': 0.00038335575290547954, 'samples': 9503040, 'steps': 49494, 'loss/train': 1.5806015729904175} 11/07/2021 04:12:35 - INFO - __main__ - Step 49496: {'lr': 0.0003833512641730445, 'samples': 9503232, 'steps': 49495, 'loss/train': 0.9982802867889404} 11/07/2021 04:12:36 - INFO - __main__ - Step 49497: {'lr': 0.0003833467753805234, 'samples': 9503424, 'steps': 49496, 'loss/train': 1.2644473314285278} 11/07/2021 04:12:36 - INFO - __main__ - Step 49498: {'lr': 0.00038334228652791837, 'samples': 9503616, 'steps': 49497, 'loss/train': 1.4780821800231934} 11/07/2021 04:12:37 - INFO - __main__ - Step 49499: {'lr': 0.00038333779761523133, 'samples': 9503808, 'steps': 49498, 'loss/train': 1.47004234790802} 11/07/2021 04:12:37 - INFO - __main__ - Step 49500: {'lr': 0.0003833333086424643, 'samples': 9504000, 'steps': 49499, 'loss/train': 1.6942075490951538} 11/07/2021 04:12:38 - INFO - __main__ - Step 49501: {'lr': 0.00038332881960961943, 'samples': 9504192, 'steps': 49500, 'loss/train': 1.52151620388031} 11/07/2021 04:12:38 - INFO - __main__ - Step 49502: {'lr': 0.0003833243305166986, 'samples': 9504384, 'steps': 49501, 'loss/train': 1.5208748579025269} 11/07/2021 04:12:39 - INFO - __main__ - Step 49503: {'lr': 0.00038331984136370377, 'samples': 9504576, 'steps': 49502, 'loss/train': 1.6068562269210815} 11/07/2021 04:12:39 - INFO - __main__ - Step 49504: {'lr': 0.0003833153521506372, 'samples': 9504768, 'steps': 49503, 'loss/train': 1.0254100561141968} 11/07/2021 04:12:40 - INFO - __main__ - Step 49505: {'lr': 0.00038331086287750083, 'samples': 9504960, 'steps': 49504, 'loss/train': 1.1298511028289795} 11/07/2021 04:12:40 - INFO - __main__ - Step 49506: {'lr': 0.0003833063735442966, 'samples': 9505152, 'steps': 49505, 'loss/train': 1.3094515800476074} 11/07/2021 04:12:41 - INFO - __main__ - Step 49507: {'lr': 0.0003833018841510265, 'samples': 9505344, 'steps': 49506, 'loss/train': 1.68741774559021} 11/07/2021 04:12:41 - INFO - __main__ - Step 49508: {'lr': 0.00038329739469769277, 'samples': 9505536, 'steps': 49507, 'loss/train': 1.2099201679229736} 11/07/2021 04:12:42 - INFO - __main__ - Step 49509: {'lr': 0.0003832929051842972, 'samples': 9505728, 'steps': 49508, 'loss/train': 1.4359480142593384} 11/07/2021 04:12:42 - INFO - __main__ - Step 49510: {'lr': 0.0003832884156108418, 'samples': 9505920, 'steps': 49509, 'loss/train': 1.4834827184677124} 11/07/2021 04:12:42 - INFO - __main__ - Step 49511: {'lr': 0.0003832839259773289, 'samples': 9506112, 'steps': 49510, 'loss/train': 1.4688080549240112} 11/07/2021 04:12:43 - INFO - __main__ - Step 49512: {'lr': 0.00038327943628376025, 'samples': 9506304, 'steps': 49511, 'loss/train': 1.4338685274124146} 11/07/2021 04:12:44 - INFO - __main__ - Step 49513: {'lr': 0.00038327494653013787, 'samples': 9506496, 'steps': 49512, 'loss/train': 1.4766474962234497} 11/07/2021 04:12:44 - INFO - __main__ - Step 49514: {'lr': 0.00038327045671646386, 'samples': 9506688, 'steps': 49513, 'loss/train': 1.6842774152755737} 11/07/2021 04:12:45 - INFO - __main__ - Step 49515: {'lr': 0.00038326596684274035, 'samples': 9506880, 'steps': 49514, 'loss/train': 2.441283702850342} 11/07/2021 04:12:45 - INFO - __main__ - Step 49516: {'lr': 0.00038326147690896916, 'samples': 9507072, 'steps': 49515, 'loss/train': 1.7583577632904053} 11/07/2021 04:12:45 - INFO - __main__ - Step 49517: {'lr': 0.00038325698691515247, 'samples': 9507264, 'steps': 49516, 'loss/train': 1.6572788953781128} 11/07/2021 04:12:46 - INFO - __main__ - Step 49518: {'lr': 0.00038325249686129223, 'samples': 9507456, 'steps': 49517, 'loss/train': 1.929858684539795} 11/07/2021 04:12:47 - INFO - __main__ - Step 49519: {'lr': 0.0003832480067473904, 'samples': 9507648, 'steps': 49518, 'loss/train': 1.293485164642334} 11/07/2021 04:12:47 - INFO - __main__ - Step 49520: {'lr': 0.0003832435165734491, 'samples': 9507840, 'steps': 49519, 'loss/train': 1.440374493598938} 11/07/2021 04:12:47 - INFO - __main__ - Step 49521: {'lr': 0.0003832390263394704, 'samples': 9508032, 'steps': 49520, 'loss/train': 1.3797657489776611} 11/07/2021 04:12:48 - INFO - __main__ - Step 49522: {'lr': 0.0003832345360454561, 'samples': 9508224, 'steps': 49521, 'loss/train': 1.4090845584869385} 11/07/2021 04:12:49 - INFO - __main__ - Step 49523: {'lr': 0.00038323004569140853, 'samples': 9508416, 'steps': 49522, 'loss/train': 1.6031702756881714} 11/07/2021 04:12:49 - INFO - __main__ - Step 49524: {'lr': 0.0003832255552773295, 'samples': 9508608, 'steps': 49523, 'loss/train': 0.7252176403999329} 11/07/2021 04:12:50 - INFO - __main__ - Step 49525: {'lr': 0.00038322106480322105, 'samples': 9508800, 'steps': 49524, 'loss/train': 1.2934327125549316} 11/07/2021 04:12:50 - INFO - __main__ - Step 49526: {'lr': 0.00038321657426908527, 'samples': 9508992, 'steps': 49525, 'loss/train': 1.1309657096862793} 11/07/2021 04:12:50 - INFO - __main__ - Step 49527: {'lr': 0.0003832120836749242, 'samples': 9509184, 'steps': 49526, 'loss/train': 0.998445987701416} 11/07/2021 04:12:51 - INFO - __main__ - Step 49528: {'lr': 0.0003832075930207398, 'samples': 9509376, 'steps': 49527, 'loss/train': 1.4018816947937012} 11/07/2021 04:12:52 - INFO - __main__ - Step 49529: {'lr': 0.0003832031023065341, 'samples': 9509568, 'steps': 49528, 'loss/train': 1.630361557006836} 11/07/2021 04:12:52 - INFO - __main__ - Step 49530: {'lr': 0.0003831986115323092, 'samples': 9509760, 'steps': 49529, 'loss/train': 0.6628460884094238} 11/07/2021 04:12:52 - INFO - __main__ - Step 49531: {'lr': 0.00038319412069806694, 'samples': 9509952, 'steps': 49530, 'loss/train': 1.198029637336731} 11/07/2021 04:12:53 - INFO - __main__ - Step 49532: {'lr': 0.00038318962980380956, 'samples': 9510144, 'steps': 49531, 'loss/train': 1.4488250017166138} 11/07/2021 04:12:53 - INFO - __main__ - Step 49533: {'lr': 0.0003831851388495389, 'samples': 9510336, 'steps': 49532, 'loss/train': 1.459463119506836} 11/07/2021 04:12:54 - INFO - __main__ - Step 49534: {'lr': 0.0003831806478352572, 'samples': 9510528, 'steps': 49533, 'loss/train': 1.526487112045288} 11/07/2021 04:12:55 - INFO - __main__ - Step 49535: {'lr': 0.00038317615676096623, 'samples': 9510720, 'steps': 49534, 'loss/train': 1.530272126197815} 11/07/2021 04:12:55 - INFO - __main__ - Step 49536: {'lr': 0.00038317166562666817, 'samples': 9510912, 'steps': 49535, 'loss/train': 1.554976224899292} 11/07/2021 04:12:55 - INFO - __main__ - Step 49537: {'lr': 0.00038316717443236505, 'samples': 9511104, 'steps': 49536, 'loss/train': 1.4371247291564941} 11/07/2021 04:12:56 - INFO - __main__ - Step 49538: {'lr': 0.0003831626831780588, 'samples': 9511296, 'steps': 49537, 'loss/train': 1.523795485496521} 11/07/2021 04:12:57 - INFO - __main__ - Step 49539: {'lr': 0.0003831581918637516, 'samples': 9511488, 'steps': 49538, 'loss/train': 1.5631895065307617} 11/07/2021 04:12:57 - INFO - __main__ - Step 49540: {'lr': 0.0003831537004894453, 'samples': 9511680, 'steps': 49539, 'loss/train': 1.807240605354309} 11/07/2021 04:12:57 - INFO - __main__ - Step 49541: {'lr': 0.000383149209055142, 'samples': 9511872, 'steps': 49540, 'loss/train': 1.528426170349121} 11/07/2021 04:12:58 - INFO - __main__ - Step 49542: {'lr': 0.00038314471756084373, 'samples': 9512064, 'steps': 49541, 'loss/train': 1.4981565475463867} 11/07/2021 04:12:58 - INFO - __main__ - Step 49543: {'lr': 0.0003831402260065525, 'samples': 9512256, 'steps': 49542, 'loss/train': 1.3794786930084229} 11/07/2021 04:12:59 - INFO - __main__ - Step 49544: {'lr': 0.00038313573439227035, 'samples': 9512448, 'steps': 49543, 'loss/train': 1.2986475229263306} 11/07/2021 04:13:00 - INFO - __main__ - Step 49545: {'lr': 0.0003831312427179993, 'samples': 9512640, 'steps': 49544, 'loss/train': 1.5060278177261353} 11/07/2021 04:13:00 - INFO - __main__ - Step 49546: {'lr': 0.00038312675098374136, 'samples': 9512832, 'steps': 49545, 'loss/train': 1.2425042390823364} 11/07/2021 04:13:00 - INFO - __main__ - Step 49547: {'lr': 0.0003831222591894985, 'samples': 9513024, 'steps': 49546, 'loss/train': 1.5853462219238281} 11/07/2021 04:13:01 - INFO - __main__ - Step 49548: {'lr': 0.0003831177673352729, 'samples': 9513216, 'steps': 49547, 'loss/train': 1.7374829053878784} 11/07/2021 04:13:01 - INFO - __main__ - Step 49549: {'lr': 0.00038311327542106646, 'samples': 9513408, 'steps': 49548, 'loss/train': 1.6388118267059326} 11/07/2021 04:13:03 - INFO - __main__ - Step 49550: {'lr': 0.00038310878344688116, 'samples': 9513600, 'steps': 49549, 'loss/train': 0.2743910551071167} 11/07/2021 04:13:03 - INFO - __main__ - Step 49551: {'lr': 0.0003831042914127192, 'samples': 9513792, 'steps': 49550, 'loss/train': 1.3564828634262085} 11/07/2021 04:13:03 - INFO - __main__ - Step 49552: {'lr': 0.00038309979931858243, 'samples': 9513984, 'steps': 49551, 'loss/train': 2.0853230953216553} 11/07/2021 04:13:04 - INFO - __main__ - Step 49553: {'lr': 0.00038309530716447297, 'samples': 9514176, 'steps': 49552, 'loss/train': 1.543075680732727} 11/07/2021 04:13:04 - INFO - __main__ - Step 49554: {'lr': 0.00038309081495039275, 'samples': 9514368, 'steps': 49553, 'loss/train': 0.3885495364665985} 11/07/2021 04:13:05 - INFO - __main__ - Step 49555: {'lr': 0.00038308632267634396, 'samples': 9514560, 'steps': 49554, 'loss/train': 0.8817199468612671} 11/07/2021 04:13:05 - INFO - __main__ - Step 49556: {'lr': 0.00038308183034232844, 'samples': 9514752, 'steps': 49555, 'loss/train': 1.533594012260437} 11/07/2021 04:13:06 - INFO - __main__ - Step 49557: {'lr': 0.0003830773379483484, 'samples': 9514944, 'steps': 49556, 'loss/train': 1.54350745677948} 11/07/2021 04:13:06 - INFO - __main__ - Step 49558: {'lr': 0.0003830728454944057, 'samples': 9515136, 'steps': 49557, 'loss/train': 1.152811050415039} 11/07/2021 04:13:06 - INFO - __main__ - Step 49559: {'lr': 0.00038306835298050255, 'samples': 9515328, 'steps': 49558, 'loss/train': 1.62739896774292} 11/07/2021 04:13:07 - INFO - __main__ - Step 49560: {'lr': 0.0003830638604066407, 'samples': 9515520, 'steps': 49559, 'loss/train': 1.3035004138946533} 11/07/2021 04:13:08 - INFO - __main__ - Step 49561: {'lr': 0.00038305936777282233, 'samples': 9515712, 'steps': 49560, 'loss/train': 1.7126566171646118} 11/07/2021 04:13:08 - INFO - __main__ - Step 49562: {'lr': 0.00038305487507904956, 'samples': 9515904, 'steps': 49561, 'loss/train': 1.2861318588256836} 11/07/2021 04:13:08 - INFO - __main__ - Step 49563: {'lr': 0.0003830503823253243, 'samples': 9516096, 'steps': 49562, 'loss/train': 1.5957990884780884} 11/07/2021 04:13:09 - INFO - __main__ - Step 49564: {'lr': 0.0003830458895116485, 'samples': 9516288, 'steps': 49563, 'loss/train': 1.307597041130066} 11/07/2021 04:13:09 - INFO - __main__ - Step 49565: {'lr': 0.0003830413966380243, 'samples': 9516480, 'steps': 49564, 'loss/train': 1.5037707090377808} 11/07/2021 04:13:10 - INFO - __main__ - Step 49566: {'lr': 0.00038303690370445384, 'samples': 9516672, 'steps': 49565, 'loss/train': 0.9930906295776367} 11/07/2021 04:13:10 - INFO - __main__ - Step 49567: {'lr': 0.00038303241071093884, 'samples': 9516864, 'steps': 49566, 'loss/train': 1.5315903425216675} 11/07/2021 04:13:11 - INFO - __main__ - Step 49568: {'lr': 0.00038302791765748156, 'samples': 9517056, 'steps': 49567, 'loss/train': 1.6297444105148315} 11/07/2021 04:13:11 - INFO - __main__ - Step 49569: {'lr': 0.0003830234245440839, 'samples': 9517248, 'steps': 49568, 'loss/train': 1.4744263887405396} 11/07/2021 04:13:12 - INFO - __main__ - Step 49570: {'lr': 0.000383018931370748, 'samples': 9517440, 'steps': 49569, 'loss/train': 1.4099575281143188} 11/07/2021 04:13:13 - INFO - __main__ - Step 49571: {'lr': 0.00038301443813747583, 'samples': 9517632, 'steps': 49570, 'loss/train': 1.4269685745239258} 11/07/2021 04:13:13 - INFO - __main__ - Step 49572: {'lr': 0.00038300994484426936, 'samples': 9517824, 'steps': 49571, 'loss/train': 1.3018014430999756} 11/07/2021 04:13:13 - INFO - __main__ - Step 49573: {'lr': 0.0003830054514911307, 'samples': 9518016, 'steps': 49572, 'loss/train': 1.1602410078048706} 11/07/2021 04:13:14 - INFO - __main__ - Step 49574: {'lr': 0.0003830009580780618, 'samples': 9518208, 'steps': 49573, 'loss/train': 1.6300928592681885} 11/07/2021 04:13:14 - INFO - __main__ - Step 49575: {'lr': 0.00038299646460506474, 'samples': 9518400, 'steps': 49574, 'loss/train': 1.4427337646484375} 11/07/2021 04:13:15 - INFO - __main__ - Step 49576: {'lr': 0.0003829919710721415, 'samples': 9518592, 'steps': 49575, 'loss/train': 1.8379251956939697} 11/07/2021 04:13:16 - INFO - __main__ - Step 49577: {'lr': 0.0003829874774792941, 'samples': 9518784, 'steps': 49576, 'loss/train': 1.6046229600906372} 11/07/2021 04:13:16 - INFO - __main__ - Step 49578: {'lr': 0.00038298298382652467, 'samples': 9518976, 'steps': 49577, 'loss/train': 1.7580783367156982} 11/07/2021 04:13:16 - INFO - __main__ - Step 49579: {'lr': 0.00038297849011383517, 'samples': 9519168, 'steps': 49578, 'loss/train': 1.5407004356384277} 11/07/2021 04:13:17 - INFO - __main__ - Step 49580: {'lr': 0.0003829739963412276, 'samples': 9519360, 'steps': 49579, 'loss/train': 0.37775328755378723} 11/07/2021 04:13:18 - INFO - __main__ - Step 49581: {'lr': 0.000382969502508704, 'samples': 9519552, 'steps': 49580, 'loss/train': 1.6808826923370361} 11/07/2021 04:13:18 - INFO - __main__ - Step 49582: {'lr': 0.0003829650086162663, 'samples': 9519744, 'steps': 49581, 'loss/train': 1.637237310409546} 11/07/2021 04:13:18 - INFO - __main__ - Step 49583: {'lr': 0.0003829605146639167, 'samples': 9519936, 'steps': 49582, 'loss/train': 1.4391934871673584} 11/07/2021 04:13:19 - INFO - __main__ - Step 49584: {'lr': 0.00038295602065165714, 'samples': 9520128, 'steps': 49583, 'loss/train': 1.8345682621002197} 11/07/2021 04:13:19 - INFO - __main__ - Step 49585: {'lr': 0.0003829515265794896, 'samples': 9520320, 'steps': 49584, 'loss/train': 1.8743866682052612} 11/07/2021 04:13:20 - INFO - __main__ - Step 49586: {'lr': 0.00038294703244741625, 'samples': 9520512, 'steps': 49585, 'loss/train': 1.6653952598571777} 11/07/2021 04:13:20 - INFO - __main__ - Step 49587: {'lr': 0.000382942538255439, 'samples': 9520704, 'steps': 49586, 'loss/train': 1.7852160930633545} 11/07/2021 04:13:21 - INFO - __main__ - Step 49588: {'lr': 0.0003829380440035598, 'samples': 9520896, 'steps': 49587, 'loss/train': 1.1733639240264893} 11/07/2021 04:13:21 - INFO - __main__ - Step 49589: {'lr': 0.0003829335496917808, 'samples': 9521088, 'steps': 49588, 'loss/train': 0.9976392388343811} 11/07/2021 04:13:21 - INFO - __main__ - Step 49590: {'lr': 0.000382929055320104, 'samples': 9521280, 'steps': 49589, 'loss/train': 1.4055172204971313} 11/07/2021 04:13:22 - INFO - __main__ - Step 49591: {'lr': 0.0003829245608885315, 'samples': 9521472, 'steps': 49590, 'loss/train': 1.155929446220398} 11/07/2021 04:13:23 - INFO - __main__ - Step 49592: {'lr': 0.0003829200663970652, 'samples': 9521664, 'steps': 49591, 'loss/train': 2.5761666297912598} 11/07/2021 04:13:23 - INFO - __main__ - Step 49593: {'lr': 0.00038291557184570713, 'samples': 9521856, 'steps': 49592, 'loss/train': 1.8351387977600098} 11/07/2021 04:13:23 - INFO - __main__ - Step 49594: {'lr': 0.0003829110772344594, 'samples': 9522048, 'steps': 49593, 'loss/train': 1.4460923671722412} 11/07/2021 04:13:24 - INFO - __main__ - Step 49595: {'lr': 0.000382906582563324, 'samples': 9522240, 'steps': 49594, 'loss/train': 1.391068935394287} 11/07/2021 04:13:25 - INFO - __main__ - Step 49596: {'lr': 0.00038290208783230286, 'samples': 9522432, 'steps': 49595, 'loss/train': 1.0812073945999146} 11/07/2021 04:13:25 - INFO - __main__ - Step 49597: {'lr': 0.00038289759304139815, 'samples': 9522624, 'steps': 49596, 'loss/train': 1.5305726528167725} 11/07/2021 04:13:25 - INFO - __main__ - Step 49598: {'lr': 0.0003828930981906118, 'samples': 9522816, 'steps': 49597, 'loss/train': 1.4933536052703857} 11/07/2021 04:13:26 - INFO - __main__ - Step 49599: {'lr': 0.000382888603279946, 'samples': 9523008, 'steps': 49598, 'loss/train': 0.9710624814033508} 11/07/2021 04:13:26 - INFO - __main__ - Step 49600: {'lr': 0.00038288410830940246, 'samples': 9523200, 'steps': 49599, 'loss/train': 1.8931074142456055} 11/07/2021 04:13:27 - INFO - __main__ - Step 49601: {'lr': 0.00038287961327898346, 'samples': 9523392, 'steps': 49600, 'loss/train': 0.9926786422729492} 11/07/2021 04:13:28 - INFO - __main__ - Step 49602: {'lr': 0.000382875118188691, 'samples': 9523584, 'steps': 49601, 'loss/train': 1.5702821016311646} 11/07/2021 04:13:28 - INFO - __main__ - Step 49603: {'lr': 0.000382870623038527, 'samples': 9523776, 'steps': 49602, 'loss/train': 1.3586236238479614} 11/07/2021 04:13:28 - INFO - __main__ - Step 49604: {'lr': 0.0003828661278284936, 'samples': 9523968, 'steps': 49603, 'loss/train': 1.2675609588623047} 11/07/2021 04:13:29 - INFO - __main__ - Step 49605: {'lr': 0.00038286163255859276, 'samples': 9524160, 'steps': 49604, 'loss/train': 1.3705264329910278} 11/07/2021 04:13:30 - INFO - __main__ - Step 49606: {'lr': 0.0003828571372288265, 'samples': 9524352, 'steps': 49605, 'loss/train': 1.6547865867614746} 11/07/2021 04:13:30 - INFO - __main__ - Step 49607: {'lr': 0.00038285264183919696, 'samples': 9524544, 'steps': 49606, 'loss/train': 1.2476061582565308} 11/07/2021 04:13:30 - INFO - __main__ - Step 49608: {'lr': 0.00038284814638970594, 'samples': 9524736, 'steps': 49607, 'loss/train': 1.2930322885513306} 11/07/2021 04:13:31 - INFO - __main__ - Step 49609: {'lr': 0.00038284365088035564, 'samples': 9524928, 'steps': 49608, 'loss/train': 1.6116275787353516} 11/07/2021 04:13:31 - INFO - __main__ - Step 49610: {'lr': 0.00038283915531114806, 'samples': 9525120, 'steps': 49609, 'loss/train': 1.7082544565200806} 11/07/2021 04:13:32 - INFO - __main__ - Step 49611: {'lr': 0.0003828346596820852, 'samples': 9525312, 'steps': 49610, 'loss/train': 1.5115309953689575} 11/07/2021 04:13:32 - INFO - __main__ - Step 49612: {'lr': 0.00038283016399316905, 'samples': 9525504, 'steps': 49611, 'loss/train': 1.3873403072357178} 11/07/2021 04:13:33 - INFO - __main__ - Step 49613: {'lr': 0.00038282566824440176, 'samples': 9525696, 'steps': 49612, 'loss/train': 1.6959043741226196} 11/07/2021 04:13:33 - INFO - __main__ - Step 49614: {'lr': 0.0003828211724357852, 'samples': 9525888, 'steps': 49613, 'loss/train': 5.5632734298706055} 11/07/2021 04:13:33 - INFO - __main__ - Step 49615: {'lr': 0.00038281667656732144, 'samples': 9526080, 'steps': 49614, 'loss/train': 1.538751482963562} 11/07/2021 04:13:34 - INFO - __main__ - Step 49616: {'lr': 0.0003828121806390126, 'samples': 9526272, 'steps': 49615, 'loss/train': 1.3263506889343262} 11/07/2021 04:13:35 - INFO - __main__ - Step 49617: {'lr': 0.0003828076846508606, 'samples': 9526464, 'steps': 49616, 'loss/train': 1.384452223777771} 11/07/2021 04:13:35 - INFO - __main__ - Step 49618: {'lr': 0.00038280318860286756, 'samples': 9526656, 'steps': 49617, 'loss/train': 1.475805640220642} 11/07/2021 04:13:36 - INFO - __main__ - Step 49619: {'lr': 0.0003827986924950354, 'samples': 9526848, 'steps': 49618, 'loss/train': 1.462049961090088} 11/07/2021 04:13:36 - INFO - __main__ - Step 49620: {'lr': 0.0003827941963273663, 'samples': 9527040, 'steps': 49619, 'loss/train': 1.3185629844665527} 11/07/2021 04:13:36 - INFO - __main__ - Step 49621: {'lr': 0.00038278970009986206, 'samples': 9527232, 'steps': 49620, 'loss/train': 1.1094467639923096} 11/07/2021 04:13:37 - INFO - __main__ - Step 49622: {'lr': 0.0003827852038125249, 'samples': 9527424, 'steps': 49621, 'loss/train': 1.55144202709198} 11/07/2021 04:13:38 - INFO - __main__ - Step 49623: {'lr': 0.00038278070746535674, 'samples': 9527616, 'steps': 49622, 'loss/train': 1.518005132675171} 11/07/2021 04:13:38 - INFO - __main__ - Step 49624: {'lr': 0.0003827762110583597, 'samples': 9527808, 'steps': 49623, 'loss/train': 1.5901243686676025} 11/07/2021 04:13:38 - INFO - __main__ - Step 49625: {'lr': 0.0003827717145915357, 'samples': 9528000, 'steps': 49624, 'loss/train': 1.5442684888839722} 11/07/2021 04:13:39 - INFO - __main__ - Step 49626: {'lr': 0.0003827672180648868, 'samples': 9528192, 'steps': 49625, 'loss/train': 1.7559553384780884} 11/07/2021 04:13:40 - INFO - __main__ - Step 49627: {'lr': 0.0003827627214784151, 'samples': 9528384, 'steps': 49626, 'loss/train': 1.4572389125823975} 11/07/2021 04:13:40 - INFO - __main__ - Step 49628: {'lr': 0.0003827582248321225, 'samples': 9528576, 'steps': 49627, 'loss/train': 1.6016181707382202} 11/07/2021 04:13:41 - INFO - __main__ - Step 49629: {'lr': 0.0003827537281260111, 'samples': 9528768, 'steps': 49628, 'loss/train': 1.3121598958969116} 11/07/2021 04:13:41 - INFO - __main__ - Step 49630: {'lr': 0.00038274923136008294, 'samples': 9528960, 'steps': 49629, 'loss/train': 1.4197397232055664} 11/07/2021 04:13:41 - INFO - __main__ - Step 49631: {'lr': 0.00038274473453434, 'samples': 9529152, 'steps': 49630, 'loss/train': 1.2599914073944092} 11/07/2021 04:13:42 - INFO - __main__ - Step 49632: {'lr': 0.0003827402376487844, 'samples': 9529344, 'steps': 49631, 'loss/train': 1.630716323852539} 11/07/2021 04:13:43 - INFO - __main__ - Step 49633: {'lr': 0.0003827357407034181, 'samples': 9529536, 'steps': 49632, 'loss/train': 1.7098501920700073} 11/07/2021 04:13:43 - INFO - __main__ - Step 49634: {'lr': 0.00038273124369824304, 'samples': 9529728, 'steps': 49633, 'loss/train': 1.7722996473312378} 11/07/2021 04:13:43 - INFO - __main__ - Step 49635: {'lr': 0.00038272674663326136, 'samples': 9529920, 'steps': 49634, 'loss/train': 1.3559863567352295} 11/07/2021 04:13:44 - INFO - __main__ - Step 49636: {'lr': 0.000382722249508475, 'samples': 9530112, 'steps': 49635, 'loss/train': 1.5778757333755493} 11/07/2021 04:13:44 - INFO - __main__ - Step 49637: {'lr': 0.00038271775232388616, 'samples': 9530304, 'steps': 49636, 'loss/train': 1.48716402053833} 11/07/2021 04:13:45 - INFO - __main__ - Step 49638: {'lr': 0.00038271325507949666, 'samples': 9530496, 'steps': 49637, 'loss/train': 1.8214058876037598} 11/07/2021 04:13:45 - INFO - __main__ - Step 49639: {'lr': 0.00038270875777530864, 'samples': 9530688, 'steps': 49638, 'loss/train': 1.3735169172286987} 11/07/2021 04:13:46 - INFO - __main__ - Step 49640: {'lr': 0.0003827042604113241, 'samples': 9530880, 'steps': 49639, 'loss/train': 1.2777843475341797} 11/07/2021 04:13:46 - INFO - __main__ - Step 49641: {'lr': 0.0003826997629875451, 'samples': 9531072, 'steps': 49640, 'loss/train': 1.8156064748764038} 11/07/2021 04:13:46 - INFO - __main__ - Step 49642: {'lr': 0.0003826952655039736, 'samples': 9531264, 'steps': 49641, 'loss/train': 1.287994384765625} 11/07/2021 04:13:48 - INFO - __main__ - Step 49643: {'lr': 0.0003826907679606117, 'samples': 9531456, 'steps': 49642, 'loss/train': 1.5666861534118652} 11/07/2021 04:13:48 - INFO - __main__ - Step 49644: {'lr': 0.00038268627035746133, 'samples': 9531648, 'steps': 49643, 'loss/train': 1.2110158205032349} 11/07/2021 04:13:48 - INFO - __main__ - Step 49645: {'lr': 0.00038268177269452463, 'samples': 9531840, 'steps': 49644, 'loss/train': 1.4190071821212769} 11/07/2021 04:13:49 - INFO - __main__ - Step 49646: {'lr': 0.0003826772749718036, 'samples': 9532032, 'steps': 49645, 'loss/train': 1.5874351263046265} 11/07/2021 04:13:49 - INFO - __main__ - Step 49647: {'lr': 0.00038267277718930014, 'samples': 9532224, 'steps': 49646, 'loss/train': 1.654268503189087} 11/07/2021 04:13:49 - INFO - __main__ - Step 49648: {'lr': 0.0003826682793470164, 'samples': 9532416, 'steps': 49647, 'loss/train': 0.908328652381897} 11/07/2021 04:13:50 - INFO - __main__ - Step 49649: {'lr': 0.0003826637814449544, 'samples': 9532608, 'steps': 49648, 'loss/train': 1.3017082214355469} 11/07/2021 04:13:51 - INFO - __main__ - Step 49650: {'lr': 0.00038265928348311614, 'samples': 9532800, 'steps': 49649, 'loss/train': 1.3552396297454834} 11/07/2021 04:13:51 - INFO - __main__ - Step 49651: {'lr': 0.0003826547854615037, 'samples': 9532992, 'steps': 49650, 'loss/train': 1.3476513624191284} 11/07/2021 04:13:51 - INFO - __main__ - Step 49652: {'lr': 0.000382650287380119, 'samples': 9533184, 'steps': 49651, 'loss/train': 0.6778313517570496} 11/07/2021 04:13:52 - INFO - __main__ - Step 49653: {'lr': 0.00038264578923896415, 'samples': 9533376, 'steps': 49652, 'loss/train': 1.587056279182434} 11/07/2021 04:13:53 - INFO - __main__ - Step 49654: {'lr': 0.00038264129103804113, 'samples': 9533568, 'steps': 49653, 'loss/train': 1.9432332515716553} 11/07/2021 04:13:53 - INFO - __main__ - Step 49655: {'lr': 0.00038263679277735196, 'samples': 9533760, 'steps': 49654, 'loss/train': 1.1152772903442383} 11/07/2021 04:13:54 - INFO - __main__ - Step 49656: {'lr': 0.0003826322944568988, 'samples': 9533952, 'steps': 49655, 'loss/train': 0.20244891941547394} 11/07/2021 04:13:54 - INFO - __main__ - Step 49657: {'lr': 0.00038262779607668354, 'samples': 9534144, 'steps': 49656, 'loss/train': 1.5896071195602417} 11/07/2021 04:13:55 - INFO - __main__ - Step 49658: {'lr': 0.0003826232976367082, 'samples': 9534336, 'steps': 49657, 'loss/train': 1.1312906742095947} 11/07/2021 04:13:55 - INFO - __main__ - Step 49659: {'lr': 0.0003826187991369749, 'samples': 9534528, 'steps': 49658, 'loss/train': 1.2865389585494995} 11/07/2021 04:13:56 - INFO - __main__ - Step 49660: {'lr': 0.00038261430057748557, 'samples': 9534720, 'steps': 49659, 'loss/train': 1.4453037977218628} 11/07/2021 04:13:56 - INFO - __main__ - Step 49661: {'lr': 0.0003826098019582423, 'samples': 9534912, 'steps': 49660, 'loss/train': 1.836003065109253} 11/07/2021 04:13:57 - INFO - __main__ - Step 49662: {'lr': 0.00038260530327924715, 'samples': 9535104, 'steps': 49661, 'loss/train': 1.378966212272644} 11/07/2021 04:13:57 - INFO - __main__ - Step 49663: {'lr': 0.00038260080454050207, 'samples': 9535296, 'steps': 49662, 'loss/train': 1.3185806274414062} 11/07/2021 04:13:58 - INFO - __main__ - Step 49664: {'lr': 0.00038259630574200904, 'samples': 9535488, 'steps': 49663, 'loss/train': 1.0712865591049194} 11/07/2021 04:13:58 - INFO - __main__ - Step 49665: {'lr': 0.0003825918068837702, 'samples': 9535680, 'steps': 49664, 'loss/train': 1.7213788032531738} 11/07/2021 04:13:59 - INFO - __main__ - Step 49666: {'lr': 0.00038258730796578757, 'samples': 9535872, 'steps': 49665, 'loss/train': 0.6380241513252258} 11/07/2021 04:13:59 - INFO - __main__ - Step 49667: {'lr': 0.0003825828089880631, 'samples': 9536064, 'steps': 49666, 'loss/train': 1.4698467254638672} 11/07/2021 04:13:59 - INFO - __main__ - Step 49668: {'lr': 0.00038257830995059894, 'samples': 9536256, 'steps': 49667, 'loss/train': 1.3259644508361816} 11/07/2021 04:14:00 - INFO - __main__ - Step 49669: {'lr': 0.00038257381085339694, 'samples': 9536448, 'steps': 49668, 'loss/train': 1.43675696849823} 11/07/2021 04:14:01 - INFO - __main__ - Step 49670: {'lr': 0.00038256931169645925, 'samples': 9536640, 'steps': 49669, 'loss/train': 1.3211148977279663} 11/07/2021 04:14:01 - INFO - __main__ - Step 49671: {'lr': 0.00038256481247978793, 'samples': 9536832, 'steps': 49670, 'loss/train': 1.5604088306427002} 11/07/2021 04:14:01 - INFO - __main__ - Step 49672: {'lr': 0.00038256031320338494, 'samples': 9537024, 'steps': 49671, 'loss/train': 1.1204627752304077} 11/07/2021 04:14:02 - INFO - __main__ - Step 49673: {'lr': 0.0003825558138672523, 'samples': 9537216, 'steps': 49672, 'loss/train': 1.0878816843032837} 11/07/2021 04:14:03 - INFO - __main__ - Step 49674: {'lr': 0.00038255131447139203, 'samples': 9537408, 'steps': 49673, 'loss/train': 1.3592828512191772} 11/07/2021 04:14:03 - INFO - __main__ - Step 49675: {'lr': 0.00038254681501580625, 'samples': 9537600, 'steps': 49674, 'loss/train': 1.5508614778518677} 11/07/2021 04:14:03 - INFO - __main__ - Step 49676: {'lr': 0.00038254231550049686, 'samples': 9537792, 'steps': 49675, 'loss/train': 1.5368692874908447} 11/07/2021 04:14:04 - INFO - __main__ - Step 49677: {'lr': 0.00038253781592546593, 'samples': 9537984, 'steps': 49676, 'loss/train': 1.016180396080017} 11/07/2021 04:14:04 - INFO - __main__ - Step 49678: {'lr': 0.0003825333162907155, 'samples': 9538176, 'steps': 49677, 'loss/train': 1.0984134674072266} 11/07/2021 04:14:05 - INFO - __main__ - Step 49679: {'lr': 0.0003825288165962477, 'samples': 9538368, 'steps': 49678, 'loss/train': 1.4270671606063843} 11/07/2021 04:14:06 - INFO - __main__ - Step 49680: {'lr': 0.0003825243168420644, 'samples': 9538560, 'steps': 49679, 'loss/train': 1.6393283605575562} 11/07/2021 04:14:06 - INFO - __main__ - Step 49681: {'lr': 0.00038251981702816767, 'samples': 9538752, 'steps': 49680, 'loss/train': 1.3077753782272339} 11/07/2021 04:14:06 - INFO - __main__ - Step 49682: {'lr': 0.00038251531715455955, 'samples': 9538944, 'steps': 49681, 'loss/train': 1.278901219367981} 11/07/2021 04:14:07 - INFO - __main__ - Step 49683: {'lr': 0.00038251081722124214, 'samples': 9539136, 'steps': 49682, 'loss/train': 1.7971502542495728} 11/07/2021 04:14:08 - INFO - __main__ - Step 49684: {'lr': 0.0003825063172282174, 'samples': 9539328, 'steps': 49683, 'loss/train': 1.0549887418746948} 11/07/2021 04:14:08 - INFO - __main__ - Step 49685: {'lr': 0.00038250181717548726, 'samples': 9539520, 'steps': 49684, 'loss/train': 1.6527125835418701} 11/07/2021 04:14:08 - INFO - __main__ - Step 49686: {'lr': 0.0003824973170630539, 'samples': 9539712, 'steps': 49685, 'loss/train': 1.698177456855774} 11/07/2021 04:14:09 - INFO - __main__ - Step 49687: {'lr': 0.0003824928168909193, 'samples': 9539904, 'steps': 49686, 'loss/train': 1.2676993608474731} 11/07/2021 04:14:09 - INFO - __main__ - Step 49688: {'lr': 0.00038248831665908546, 'samples': 9540096, 'steps': 49687, 'loss/train': 1.2248433828353882} 11/07/2021 04:14:10 - INFO - __main__ - Step 49689: {'lr': 0.0003824838163675545, 'samples': 9540288, 'steps': 49688, 'loss/train': 1.0999497175216675} 11/07/2021 04:14:10 - INFO - __main__ - Step 49690: {'lr': 0.0003824793160163283, 'samples': 9540480, 'steps': 49689, 'loss/train': 1.3124828338623047} 11/07/2021 04:14:11 - INFO - __main__ - Step 49691: {'lr': 0.000382474815605409, 'samples': 9540672, 'steps': 49690, 'loss/train': 1.3977371454238892} 11/07/2021 04:14:11 - INFO - __main__ - Step 49692: {'lr': 0.00038247031513479856, 'samples': 9540864, 'steps': 49691, 'loss/train': 2.3644144535064697} 11/07/2021 04:14:12 - INFO - __main__ - Step 49693: {'lr': 0.0003824658146044991, 'samples': 9541056, 'steps': 49692, 'loss/train': 1.680859088897705} 11/07/2021 04:14:13 - INFO - __main__ - Step 49694: {'lr': 0.0003824613140145125, 'samples': 9541248, 'steps': 49693, 'loss/train': 1.5952850580215454} 11/07/2021 04:14:13 - INFO - __main__ - Step 49695: {'lr': 0.00038245681336484096, 'samples': 9541440, 'steps': 49694, 'loss/train': 1.8383580446243286} 11/07/2021 04:14:13 - INFO - __main__ - Step 49696: {'lr': 0.00038245231265548633, 'samples': 9541632, 'steps': 49695, 'loss/train': 0.7911908030509949} 11/07/2021 04:14:14 - INFO - __main__ - Step 49697: {'lr': 0.0003824478118864508, 'samples': 9541824, 'steps': 49696, 'loss/train': 1.490234375} 11/07/2021 04:14:14 - INFO - __main__ - Step 49698: {'lr': 0.0003824433110577363, 'samples': 9542016, 'steps': 49697, 'loss/train': 0.9387646317481995} 11/07/2021 04:14:15 - INFO - __main__ - Step 49699: {'lr': 0.0003824388101693449, 'samples': 9542208, 'steps': 49698, 'loss/train': 1.9955414533615112} 11/07/2021 04:14:15 - INFO - __main__ - Step 49700: {'lr': 0.00038243430922127865, 'samples': 9542400, 'steps': 49699, 'loss/train': 1.8206192255020142} 11/07/2021 04:14:16 - INFO - __main__ - Step 49701: {'lr': 0.00038242980821353954, 'samples': 9542592, 'steps': 49700, 'loss/train': 1.4436894655227661} 11/07/2021 04:14:16 - INFO - __main__ - Step 49702: {'lr': 0.00038242530714612953, 'samples': 9542784, 'steps': 49701, 'loss/train': 1.7632184028625488} 11/07/2021 04:14:16 - INFO - __main__ - Step 49703: {'lr': 0.00038242080601905083, 'samples': 9542976, 'steps': 49702, 'loss/train': 1.850001573562622} 11/07/2021 04:14:18 - INFO - __main__ - Step 49704: {'lr': 0.0003824163048323053, 'samples': 9543168, 'steps': 49703, 'loss/train': 1.4575488567352295} 11/07/2021 04:14:18 - INFO - __main__ - Step 49705: {'lr': 0.000382411803585895, 'samples': 9543360, 'steps': 49704, 'loss/train': 0.9507234692573547} 11/07/2021 04:14:18 - INFO - __main__ - Step 49706: {'lr': 0.000382407302279822, 'samples': 9543552, 'steps': 49705, 'loss/train': 0.6334741115570068} 11/07/2021 04:14:19 - INFO - __main__ - Step 49707: {'lr': 0.0003824028009140883, 'samples': 9543744, 'steps': 49706, 'loss/train': 1.6214922666549683} 11/07/2021 04:14:19 - INFO - __main__ - Step 49708: {'lr': 0.000382398299488696, 'samples': 9543936, 'steps': 49707, 'loss/train': 1.3992692232131958} 11/07/2021 04:14:20 - INFO - __main__ - Step 49709: {'lr': 0.000382393798003647, 'samples': 9544128, 'steps': 49708, 'loss/train': 1.5029001235961914} 11/07/2021 04:14:20 - INFO - __main__ - Step 49710: {'lr': 0.00038238929645894345, 'samples': 9544320, 'steps': 49709, 'loss/train': 1.2773594856262207} 11/07/2021 04:14:21 - INFO - __main__ - Step 49711: {'lr': 0.00038238479485458725, 'samples': 9544512, 'steps': 49710, 'loss/train': 1.6366949081420898} 11/07/2021 04:14:21 - INFO - __main__ - Step 49712: {'lr': 0.0003823802931905806, 'samples': 9544704, 'steps': 49711, 'loss/train': 1.3928391933441162} 11/07/2021 04:14:21 - INFO - __main__ - Step 49713: {'lr': 0.0003823757914669254, 'samples': 9544896, 'steps': 49712, 'loss/train': 0.8958503603935242} 11/07/2021 04:14:22 - INFO - __main__ - Step 49714: {'lr': 0.00038237128968362366, 'samples': 9545088, 'steps': 49713, 'loss/train': 1.345287561416626} 11/07/2021 04:14:23 - INFO - __main__ - Step 49715: {'lr': 0.0003823667878406776, 'samples': 9545280, 'steps': 49714, 'loss/train': 1.3177357912063599} 11/07/2021 04:14:23 - INFO - __main__ - Step 49716: {'lr': 0.000382362285938089, 'samples': 9545472, 'steps': 49715, 'loss/train': 1.5471429824829102} 11/07/2021 04:14:23 - INFO - __main__ - Step 49717: {'lr': 0.00038235778397586, 'samples': 9545664, 'steps': 49716, 'loss/train': 1.2901102304458618} 11/07/2021 04:14:24 - INFO - __main__ - Step 49718: {'lr': 0.00038235328195399253, 'samples': 9545856, 'steps': 49717, 'loss/train': 1.408123254776001} 11/07/2021 04:14:25 - INFO - __main__ - Step 49719: {'lr': 0.0003823487798724888, 'samples': 9546048, 'steps': 49718, 'loss/train': 0.8959937691688538} 11/07/2021 04:14:25 - INFO - __main__ - Step 49720: {'lr': 0.00038234427773135084, 'samples': 9546240, 'steps': 49719, 'loss/train': 1.272355556488037} 11/07/2021 04:14:26 - INFO - __main__ - Step 49721: {'lr': 0.00038233977553058055, 'samples': 9546432, 'steps': 49720, 'loss/train': 1.237605333328247} 11/07/2021 04:14:26 - INFO - __main__ - Step 49722: {'lr': 0.0003823352732701799, 'samples': 9546624, 'steps': 49721, 'loss/train': 2.2506821155548096} 11/07/2021 04:14:27 - INFO - __main__ - Step 49723: {'lr': 0.0003823307709501511, 'samples': 9546816, 'steps': 49722, 'loss/train': 1.407764196395874} 11/07/2021 04:14:28 - INFO - __main__ - Step 49724: {'lr': 0.0003823262685704961, 'samples': 9547008, 'steps': 49723, 'loss/train': 0.09105194360017776} 11/07/2021 04:14:28 - INFO - __main__ - Step 49725: {'lr': 0.00038232176613121687, 'samples': 9547200, 'steps': 49724, 'loss/train': 1.2535902261734009} 11/07/2021 04:14:28 - INFO - __main__ - Step 49726: {'lr': 0.00038231726363231554, 'samples': 9547392, 'steps': 49725, 'loss/train': 0.0777641236782074} 11/07/2021 04:14:29 - INFO - __main__ - Step 49727: {'lr': 0.0003823127610737941, 'samples': 9547584, 'steps': 49726, 'loss/train': 0.7437136173248291} 11/07/2021 04:14:29 - INFO - __main__ - Step 49728: {'lr': 0.00038230825845565454, 'samples': 9547776, 'steps': 49727, 'loss/train': 1.60201895236969} 11/07/2021 04:14:30 - INFO - __main__ - Step 49729: {'lr': 0.00038230375577789894, 'samples': 9547968, 'steps': 49728, 'loss/train': 1.3844270706176758} 11/07/2021 04:14:30 - INFO - __main__ - Step 49730: {'lr': 0.0003822992530405293, 'samples': 9548160, 'steps': 49729, 'loss/train': 1.0805037021636963} 11/07/2021 04:14:31 - INFO - __main__ - Step 49731: {'lr': 0.00038229475024354766, 'samples': 9548352, 'steps': 49730, 'loss/train': 0.09550980478525162} 11/07/2021 04:14:31 - INFO - __main__ - Step 49732: {'lr': 0.00038229024738695605, 'samples': 9548544, 'steps': 49731, 'loss/train': 1.4066355228424072} 11/07/2021 04:14:32 - INFO - __main__ - Step 49733: {'lr': 0.0003822857444707565, 'samples': 9548736, 'steps': 49732, 'loss/train': 0.9648271203041077} 11/07/2021 04:14:32 - INFO - __main__ - Step 49734: {'lr': 0.00038228124149495104, 'samples': 9548928, 'steps': 49733, 'loss/train': 0.6856678128242493} 11/07/2021 04:14:33 - INFO - __main__ - Step 49735: {'lr': 0.0003822767384595417, 'samples': 9549120, 'steps': 49734, 'loss/train': 1.0965052843093872} 11/07/2021 04:14:33 - INFO - __main__ - Step 49736: {'lr': 0.0003822722353645305, 'samples': 9549312, 'steps': 49735, 'loss/train': 1.3487507104873657} 11/07/2021 04:14:34 - INFO - __main__ - Step 49737: {'lr': 0.00038226773220991937, 'samples': 9549504, 'steps': 49736, 'loss/train': 1.2100340127944946} 11/07/2021 04:14:34 - INFO - __main__ - Step 49738: {'lr': 0.0003822632289957105, 'samples': 9549696, 'steps': 49737, 'loss/train': 1.3853355646133423} 11/07/2021 04:14:34 - INFO - __main__ - Step 49739: {'lr': 0.000382258725721906, 'samples': 9549888, 'steps': 49738, 'loss/train': 1.366625189781189} 11/07/2021 04:14:35 - INFO - __main__ - Step 49740: {'lr': 0.0003822542223885076, 'samples': 9550080, 'steps': 49739, 'loss/train': 2.083750009536743} 11/07/2021 04:14:36 - INFO - __main__ - Step 49741: {'lr': 0.0003822497189955175, 'samples': 9550272, 'steps': 49740, 'loss/train': 1.1018315553665161} 11/07/2021 04:14:36 - INFO - __main__ - Step 49742: {'lr': 0.0003822452155429378, 'samples': 9550464, 'steps': 49741, 'loss/train': 1.5830544233322144} 11/07/2021 04:14:36 - INFO - __main__ - Step 49743: {'lr': 0.0003822407120307704, 'samples': 9550656, 'steps': 49742, 'loss/train': 1.3785125017166138} 11/07/2021 04:14:37 - INFO - __main__ - Step 49744: {'lr': 0.0003822362084590174, 'samples': 9550848, 'steps': 49743, 'loss/train': 0.9274885654449463} 11/07/2021 04:14:38 - INFO - __main__ - Step 49745: {'lr': 0.0003822317048276808, 'samples': 9551040, 'steps': 49744, 'loss/train': 1.155612587928772} 11/07/2021 04:14:38 - INFO - __main__ - Step 49746: {'lr': 0.0003822272011367626, 'samples': 9551232, 'steps': 49745, 'loss/train': 1.5381557941436768} 11/07/2021 04:14:39 - INFO - __main__ - Step 49747: {'lr': 0.0003822226973862649, 'samples': 9551424, 'steps': 49746, 'loss/train': 1.1861746311187744} 11/07/2021 04:14:39 - INFO - __main__ - Step 49748: {'lr': 0.00038221819357618967, 'samples': 9551616, 'steps': 49747, 'loss/train': 1.2957367897033691} 11/07/2021 04:14:39 - INFO - __main__ - Step 49749: {'lr': 0.0003822136897065389, 'samples': 9551808, 'steps': 49748, 'loss/train': 1.6322989463806152} 11/07/2021 04:14:40 - INFO - __main__ - Step 49750: {'lr': 0.0003822091857773148, 'samples': 9552000, 'steps': 49749, 'loss/train': 1.41116464138031} 11/07/2021 04:14:41 - INFO - __main__ - Step 49751: {'lr': 0.00038220468178851917, 'samples': 9552192, 'steps': 49750, 'loss/train': 1.1397852897644043} 11/07/2021 04:14:41 - INFO - __main__ - Step 49752: {'lr': 0.00038220017774015427, 'samples': 9552384, 'steps': 49751, 'loss/train': 1.9566864967346191} 11/07/2021 04:14:41 - INFO - __main__ - Step 49753: {'lr': 0.00038219567363222183, 'samples': 9552576, 'steps': 49752, 'loss/train': 1.3994569778442383} 11/07/2021 04:14:42 - INFO - __main__ - Step 49754: {'lr': 0.00038219116946472425, 'samples': 9552768, 'steps': 49753, 'loss/train': 1.2265424728393555} 11/07/2021 04:14:43 - INFO - __main__ - Step 49755: {'lr': 0.0003821866652376633, 'samples': 9552960, 'steps': 49754, 'loss/train': 1.6414146423339844} 11/07/2021 04:14:43 - INFO - __main__ - Step 49756: {'lr': 0.0003821821609510411, 'samples': 9553152, 'steps': 49755, 'loss/train': 1.7572424411773682} 11/07/2021 04:14:44 - INFO - __main__ - Step 49757: {'lr': 0.0003821776566048596, 'samples': 9553344, 'steps': 49756, 'loss/train': 1.6133787631988525} 11/07/2021 04:14:44 - INFO - __main__ - Step 49758: {'lr': 0.0003821731521991209, 'samples': 9553536, 'steps': 49757, 'loss/train': 0.19680075347423553} 11/07/2021 04:14:44 - INFO - __main__ - Step 49759: {'lr': 0.00038216864773382703, 'samples': 9553728, 'steps': 49758, 'loss/train': 1.293948769569397} 11/07/2021 04:14:46 - INFO - __main__ - Step 49760: {'lr': 0.00038216414320898004, 'samples': 9553920, 'steps': 49759, 'loss/train': 1.4094274044036865} 11/07/2021 04:14:46 - INFO - __main__ - Step 49761: {'lr': 0.0003821596386245819, 'samples': 9554112, 'steps': 49760, 'loss/train': 1.4295532703399658} 11/07/2021 04:14:46 - INFO - __main__ - Step 49762: {'lr': 0.00038215513398063465, 'samples': 9554304, 'steps': 49761, 'loss/train': 1.2866480350494385} 11/07/2021 04:14:47 - INFO - __main__ - Step 49763: {'lr': 0.00038215062927714037, 'samples': 9554496, 'steps': 49762, 'loss/train': 1.7364535331726074} 11/07/2021 04:14:47 - INFO - __main__ - Step 49764: {'lr': 0.000382146124514101, 'samples': 9554688, 'steps': 49763, 'loss/train': 1.5931779146194458} 11/07/2021 04:14:48 - INFO - __main__ - Step 49765: {'lr': 0.00038214161969151865, 'samples': 9554880, 'steps': 49764, 'loss/train': 1.1106199026107788} 11/07/2021 04:14:48 - INFO - __main__ - Step 49766: {'lr': 0.0003821371148093954, 'samples': 9555072, 'steps': 49765, 'loss/train': 1.1809937953948975} 11/07/2021 04:14:49 - INFO - __main__ - Step 49767: {'lr': 0.0003821326098677331, 'samples': 9555264, 'steps': 49766, 'loss/train': 1.4365804195404053} 11/07/2021 04:14:49 - INFO - __main__ - Step 49768: {'lr': 0.00038212810486653394, 'samples': 9555456, 'steps': 49767, 'loss/train': 1.1563007831573486} 11/07/2021 04:14:49 - INFO - __main__ - Step 49769: {'lr': 0.0003821235998057999, 'samples': 9555648, 'steps': 49768, 'loss/train': 1.485763430595398} 11/07/2021 04:14:50 - INFO - __main__ - Step 49770: {'lr': 0.00038211909468553295, 'samples': 9555840, 'steps': 49769, 'loss/train': 1.531764030456543} 11/07/2021 04:14:51 - INFO - __main__ - Step 49771: {'lr': 0.00038211458950573526, 'samples': 9556032, 'steps': 49770, 'loss/train': 1.1895573139190674} 11/07/2021 04:14:51 - INFO - __main__ - Step 49772: {'lr': 0.0003821100842664087, 'samples': 9556224, 'steps': 49771, 'loss/train': 1.6334819793701172} 11/07/2021 04:14:51 - INFO - __main__ - Step 49773: {'lr': 0.00038210557896755536, 'samples': 9556416, 'steps': 49772, 'loss/train': 1.3276512622833252} 11/07/2021 04:14:52 - INFO - __main__ - Step 49774: {'lr': 0.0003821010736091774, 'samples': 9556608, 'steps': 49773, 'loss/train': 1.833406686782837} 11/07/2021 04:14:52 - INFO - __main__ - Step 49775: {'lr': 0.00038209656819127664, 'samples': 9556800, 'steps': 49774, 'loss/train': 1.2205287218093872} 11/07/2021 04:14:53 - INFO - __main__ - Step 49776: {'lr': 0.0003820920627138552, 'samples': 9556992, 'steps': 49775, 'loss/train': 1.4490100145339966} 11/07/2021 04:14:54 - INFO - __main__ - Step 49777: {'lr': 0.00038208755717691515, 'samples': 9557184, 'steps': 49776, 'loss/train': 1.194211483001709} 11/07/2021 04:14:54 - INFO - __main__ - Step 49778: {'lr': 0.00038208305158045846, 'samples': 9557376, 'steps': 49777, 'loss/train': 1.0601788759231567} 11/07/2021 04:14:54 - INFO - __main__ - Step 49779: {'lr': 0.0003820785459244872, 'samples': 9557568, 'steps': 49778, 'loss/train': 1.5217045545578003} 11/07/2021 04:14:55 - INFO - __main__ - Step 49780: {'lr': 0.00038207404020900343, 'samples': 9557760, 'steps': 49779, 'loss/train': 1.3851622343063354} 11/07/2021 04:14:56 - INFO - __main__ - Step 49781: {'lr': 0.0003820695344340091, 'samples': 9557952, 'steps': 49780, 'loss/train': 0.9521015882492065} 11/07/2021 04:14:56 - INFO - __main__ - Step 49782: {'lr': 0.00038206502859950624, 'samples': 9558144, 'steps': 49781, 'loss/train': 0.9233091473579407} 11/07/2021 04:14:56 - INFO - __main__ - Step 49783: {'lr': 0.000382060522705497, 'samples': 9558336, 'steps': 49782, 'loss/train': 1.2780437469482422} 11/07/2021 04:14:57 - INFO - __main__ - Step 49784: {'lr': 0.0003820560167519832, 'samples': 9558528, 'steps': 49783, 'loss/train': 1.763771891593933} 11/07/2021 04:14:57 - INFO - __main__ - Step 49785: {'lr': 0.000382051510738967, 'samples': 9558720, 'steps': 49784, 'loss/train': 1.419110655784607} 11/07/2021 04:14:58 - INFO - __main__ - Step 49786: {'lr': 0.0003820470046664506, 'samples': 9558912, 'steps': 49785, 'loss/train': 1.7713582515716553} 11/07/2021 04:14:58 - INFO - __main__ - Step 49787: {'lr': 0.0003820424985344357, 'samples': 9559104, 'steps': 49786, 'loss/train': 1.8338513374328613} 11/07/2021 04:14:59 - INFO - __main__ - Step 49788: {'lr': 0.0003820379923429246, 'samples': 9559296, 'steps': 49787, 'loss/train': 1.5551120042800903} 11/07/2021 04:14:59 - INFO - __main__ - Step 49789: {'lr': 0.00038203348609191915, 'samples': 9559488, 'steps': 49788, 'loss/train': 1.0834661722183228} 11/07/2021 04:15:00 - INFO - __main__ - Step 49790: {'lr': 0.00038202897978142144, 'samples': 9559680, 'steps': 49789, 'loss/train': 1.7385804653167725} 11/07/2021 04:15:01 - INFO - __main__ - Step 49791: {'lr': 0.00038202447341143355, 'samples': 9559872, 'steps': 49790, 'loss/train': 1.4318112134933472} 11/07/2021 04:15:01 - INFO - __main__ - Step 49792: {'lr': 0.0003820199669819574, 'samples': 9560064, 'steps': 49791, 'loss/train': 0.9425163269042969} 11/07/2021 04:15:01 - INFO - __main__ - Step 49793: {'lr': 0.00038201546049299517, 'samples': 9560256, 'steps': 49792, 'loss/train': 1.0969769954681396} 11/07/2021 04:15:02 - INFO - __main__ - Step 49794: {'lr': 0.00038201095394454874, 'samples': 9560448, 'steps': 49793, 'loss/train': 1.5377111434936523} 11/07/2021 04:15:02 - INFO - __main__ - Step 49795: {'lr': 0.0003820064473366203, 'samples': 9560640, 'steps': 49794, 'loss/train': 1.321353793144226} 11/07/2021 04:15:03 - INFO - __main__ - Step 49796: {'lr': 0.00038200194066921166, 'samples': 9560832, 'steps': 49795, 'loss/train': 1.2820968627929688} 11/07/2021 04:15:03 - INFO - __main__ - Step 49797: {'lr': 0.00038199743394232513, 'samples': 9561024, 'steps': 49796, 'loss/train': 1.299493670463562} 11/07/2021 04:15:04 - INFO - __main__ - Step 49798: {'lr': 0.0003819929271559625, 'samples': 9561216, 'steps': 49797, 'loss/train': 1.5363938808441162} 11/07/2021 04:15:04 - INFO - __main__ - Step 49799: {'lr': 0.00038198842031012594, 'samples': 9561408, 'steps': 49798, 'loss/train': 1.254093050956726} 11/07/2021 04:15:04 - INFO - __main__ - Step 49800: {'lr': 0.00038198391340481735, 'samples': 9561600, 'steps': 49799, 'loss/train': 1.1182819604873657} 11/07/2021 04:15:05 - INFO - __main__ - Step 49801: {'lr': 0.0003819794064400389, 'samples': 9561792, 'steps': 49800, 'loss/train': 1.2071176767349243} 11/07/2021 04:15:06 - INFO - __main__ - Step 49802: {'lr': 0.00038197489941579264, 'samples': 9561984, 'steps': 49801, 'loss/train': 1.5347254276275635} 11/07/2021 04:15:06 - INFO - __main__ - Step 49803: {'lr': 0.00038197039233208043, 'samples': 9562176, 'steps': 49802, 'loss/train': 1.8398714065551758} 11/07/2021 04:15:06 - INFO - __main__ - Step 49804: {'lr': 0.0003819658851889044, 'samples': 9562368, 'steps': 49803, 'loss/train': 1.399654507637024} 11/07/2021 04:15:07 - INFO - __main__ - Step 49805: {'lr': 0.00038196137798626663, 'samples': 9562560, 'steps': 49804, 'loss/train': 0.5613115429878235} 11/07/2021 04:15:08 - INFO - __main__ - Step 49806: {'lr': 0.00038195687072416906, 'samples': 9562752, 'steps': 49805, 'loss/train': 1.5593262910842896} 11/07/2021 04:15:08 - INFO - __main__ - Step 49807: {'lr': 0.00038195236340261374, 'samples': 9562944, 'steps': 49806, 'loss/train': 1.1653640270233154} 11/07/2021 04:15:08 - INFO - __main__ - Step 49808: {'lr': 0.0003819478560216029, 'samples': 9563136, 'steps': 49807, 'loss/train': 1.3279402256011963} 11/07/2021 04:15:09 - INFO - __main__ - Step 49809: {'lr': 0.00038194334858113817, 'samples': 9563328, 'steps': 49808, 'loss/train': 1.3673834800720215} 11/07/2021 04:15:09 - INFO - __main__ - Step 49810: {'lr': 0.0003819388410812219, 'samples': 9563520, 'steps': 49809, 'loss/train': 1.2251176834106445} 11/07/2021 04:15:10 - INFO - __main__ - Step 49811: {'lr': 0.00038193433352185597, 'samples': 9563712, 'steps': 49810, 'loss/train': 2.7023043632507324} 11/07/2021 04:15:11 - INFO - __main__ - Step 49812: {'lr': 0.0003819298259030425, 'samples': 9563904, 'steps': 49811, 'loss/train': 1.3174453973770142} 11/07/2021 04:15:11 - INFO - __main__ - Step 49813: {'lr': 0.00038192531822478347, 'samples': 9564096, 'steps': 49812, 'loss/train': 1.591962456703186} 11/07/2021 04:15:11 - INFO - __main__ - Step 49814: {'lr': 0.000381920810487081, 'samples': 9564288, 'steps': 49813, 'loss/train': 1.6464207172393799} 11/07/2021 04:15:12 - INFO - __main__ - Step 49815: {'lr': 0.0003819163026899369, 'samples': 9564480, 'steps': 49814, 'loss/train': 1.0955125093460083} 11/07/2021 04:15:13 - INFO - __main__ - Step 49816: {'lr': 0.00038191179483335346, 'samples': 9564672, 'steps': 49815, 'loss/train': 1.1044607162475586} 11/07/2021 04:15:13 - INFO - __main__ - Step 49817: {'lr': 0.0003819072869173326, 'samples': 9564864, 'steps': 49816, 'loss/train': 0.9763187766075134} 11/07/2021 04:15:13 - INFO - __main__ - Step 49818: {'lr': 0.0003819027789418764, 'samples': 9565056, 'steps': 49817, 'loss/train': 1.5007073879241943} 11/07/2021 04:15:14 - INFO - __main__ - Step 49819: {'lr': 0.0003818982709069867, 'samples': 9565248, 'steps': 49818, 'loss/train': 1.4065707921981812} 11/07/2021 04:15:14 - INFO - __main__ - Step 49820: {'lr': 0.00038189376281266575, 'samples': 9565440, 'steps': 49819, 'loss/train': 1.1974198818206787} 11/07/2021 04:15:14 - INFO - __main__ - Step 49821: {'lr': 0.00038188925465891554, 'samples': 9565632, 'steps': 49820, 'loss/train': 1.432716727256775} 11/07/2021 04:15:15 - INFO - __main__ - Step 49822: {'lr': 0.000381884746445738, 'samples': 9565824, 'steps': 49821, 'loss/train': 1.4346973896026611} 11/07/2021 04:15:16 - INFO - __main__ - Step 49823: {'lr': 0.0003818802381731353, 'samples': 9566016, 'steps': 49822, 'loss/train': 1.4307574033737183} 11/07/2021 04:15:16 - INFO - __main__ - Step 49824: {'lr': 0.00038187572984110937, 'samples': 9566208, 'steps': 49823, 'loss/train': 1.0929710865020752} 11/07/2021 04:15:16 - INFO - __main__ - Step 49825: {'lr': 0.00038187122144966225, 'samples': 9566400, 'steps': 49824, 'loss/train': 1.65118408203125} 11/07/2021 04:15:17 - INFO - __main__ - Step 49826: {'lr': 0.000381866712998796, 'samples': 9566592, 'steps': 49825, 'loss/train': 1.4581557512283325} 11/07/2021 04:15:18 - INFO - __main__ - Step 49827: {'lr': 0.0003818622044885126, 'samples': 9566784, 'steps': 49826, 'loss/train': 1.3367974758148193} 11/07/2021 04:15:18 - INFO - __main__ - Step 49828: {'lr': 0.00038185769591881426, 'samples': 9566976, 'steps': 49827, 'loss/train': 2.494285821914673} 11/07/2021 04:15:19 - INFO - __main__ - Step 49829: {'lr': 0.00038185318728970277, 'samples': 9567168, 'steps': 49828, 'loss/train': 1.6039881706237793} 11/07/2021 04:15:19 - INFO - __main__ - Step 49830: {'lr': 0.00038184867860118036, 'samples': 9567360, 'steps': 49829, 'loss/train': 1.405824899673462} 11/07/2021 04:15:19 - INFO - __main__ - Step 49831: {'lr': 0.0003818441698532488, 'samples': 9567552, 'steps': 49830, 'loss/train': 1.1581023931503296} 11/07/2021 04:15:21 - INFO - __main__ - Step 49832: {'lr': 0.00038183966104591037, 'samples': 9567744, 'steps': 49831, 'loss/train': 1.0995320081710815} 11/07/2021 04:15:21 - INFO - __main__ - Step 49833: {'lr': 0.0003818351521791671, 'samples': 9567936, 'steps': 49832, 'loss/train': 1.6082440614700317} 11/07/2021 04:15:21 - INFO - __main__ - Step 49834: {'lr': 0.0003818306432530209, 'samples': 9568128, 'steps': 49833, 'loss/train': 1.2923341989517212} 11/07/2021 04:15:22 - INFO - __main__ - Step 49835: {'lr': 0.0003818261342674738, 'samples': 9568320, 'steps': 49834, 'loss/train': 0.09563875943422318} 11/07/2021 04:15:22 - INFO - __main__ - Step 49836: {'lr': 0.00038182162522252795, 'samples': 9568512, 'steps': 49835, 'loss/train': 1.1273518800735474} 11/07/2021 04:15:23 - INFO - __main__ - Step 49837: {'lr': 0.0003818171161181853, 'samples': 9568704, 'steps': 49836, 'loss/train': 1.4690061807632446} 11/07/2021 04:15:23 - INFO - __main__ - Step 49838: {'lr': 0.00038181260695444784, 'samples': 9568896, 'steps': 49837, 'loss/train': 1.4633288383483887} 11/07/2021 04:15:24 - INFO - __main__ - Step 49839: {'lr': 0.00038180809773131764, 'samples': 9569088, 'steps': 49838, 'loss/train': 1.6124905347824097} 11/07/2021 04:15:24 - INFO - __main__ - Step 49840: {'lr': 0.0003818035884487968, 'samples': 9569280, 'steps': 49839, 'loss/train': 1.2007817029953003} 11/07/2021 04:15:24 - INFO - __main__ - Step 49841: {'lr': 0.0003817990791068873, 'samples': 9569472, 'steps': 49840, 'loss/train': 1.1484363079071045} 11/07/2021 04:15:25 - INFO - __main__ - Step 49842: {'lr': 0.00038179456970559116, 'samples': 9569664, 'steps': 49841, 'loss/train': 1.6722135543823242} 11/07/2021 04:15:26 - INFO - __main__ - Step 49843: {'lr': 0.0003817900602449104, 'samples': 9569856, 'steps': 49842, 'loss/train': 1.3057899475097656} 11/07/2021 04:15:26 - INFO - __main__ - Step 49844: {'lr': 0.0003817855507248471, 'samples': 9570048, 'steps': 49843, 'loss/train': 0.6849073767662048} 11/07/2021 04:15:26 - INFO - __main__ - Step 49845: {'lr': 0.00038178104114540326, 'samples': 9570240, 'steps': 49844, 'loss/train': 1.0210819244384766} 11/07/2021 04:15:27 - INFO - __main__ - Step 49846: {'lr': 0.0003817765315065809, 'samples': 9570432, 'steps': 49845, 'loss/train': 1.4170243740081787} 11/07/2021 04:15:28 - INFO - __main__ - Step 49847: {'lr': 0.000381772021808382, 'samples': 9570624, 'steps': 49846, 'loss/train': 1.5046032667160034} 11/07/2021 04:15:28 - INFO - __main__ - Step 49848: {'lr': 0.00038176751205080885, 'samples': 9570816, 'steps': 49847, 'loss/train': 1.5529192686080933} 11/07/2021 04:15:28 - INFO - __main__ - Step 49849: {'lr': 0.00038176300223386313, 'samples': 9571008, 'steps': 49848, 'loss/train': 1.2158565521240234} 11/07/2021 04:15:29 - INFO - __main__ - Step 49850: {'lr': 0.00038175849235754704, 'samples': 9571200, 'steps': 49849, 'loss/train': 1.4324151277542114} 11/07/2021 04:15:29 - INFO - __main__ - Step 49851: {'lr': 0.00038175398242186264, 'samples': 9571392, 'steps': 49850, 'loss/train': 1.5142985582351685} 11/07/2021 04:15:30 - INFO - __main__ - Step 49852: {'lr': 0.00038174947242681194, 'samples': 9571584, 'steps': 49851, 'loss/train': 1.4615650177001953} 11/07/2021 04:15:31 - INFO - __main__ - Step 49853: {'lr': 0.000381744962372397, 'samples': 9571776, 'steps': 49852, 'loss/train': 1.3350597620010376} 11/07/2021 04:15:31 - INFO - __main__ - Step 49854: {'lr': 0.00038174045225861976, 'samples': 9571968, 'steps': 49853, 'loss/train': 1.1813478469848633} 11/07/2021 04:15:31 - INFO - __main__ - Step 49855: {'lr': 0.00038173594208548234, 'samples': 9572160, 'steps': 49854, 'loss/train': 1.561554193496704} 11/07/2021 04:15:32 - INFO - __main__ - Step 49856: {'lr': 0.00038173143185298665, 'samples': 9572352, 'steps': 49855, 'loss/train': 0.08423556387424469} 11/07/2021 04:15:32 - INFO - __main__ - Step 49857: {'lr': 0.00038172692156113484, 'samples': 9572544, 'steps': 49856, 'loss/train': 1.0528861284255981} 11/07/2021 04:15:33 - INFO - __main__ - Step 49858: {'lr': 0.000381722411209929, 'samples': 9572736, 'steps': 49857, 'loss/train': 0.5608619451522827} 11/07/2021 04:15:33 - INFO - __main__ - Step 49859: {'lr': 0.00038171790079937097, 'samples': 9572928, 'steps': 49858, 'loss/train': 1.4621555805206299} 11/07/2021 04:15:34 - INFO - __main__ - Step 49860: {'lr': 0.000381713390329463, 'samples': 9573120, 'steps': 49859, 'loss/train': 1.6315885782241821} 11/07/2021 04:15:34 - INFO - __main__ - Step 49861: {'lr': 0.00038170887980020683, 'samples': 9573312, 'steps': 49860, 'loss/train': 1.1106321811676025} 11/07/2021 04:15:34 - INFO - __main__ - Step 49862: {'lr': 0.0003817043692116049, 'samples': 9573504, 'steps': 49861, 'loss/train': 1.6139997243881226} 11/07/2021 04:15:36 - INFO - __main__ - Step 49863: {'lr': 0.00038169985856365885, 'samples': 9573696, 'steps': 49862, 'loss/train': 1.5891435146331787} 11/07/2021 04:15:36 - INFO - __main__ - Step 49864: {'lr': 0.00038169534785637097, 'samples': 9573888, 'steps': 49863, 'loss/train': 1.7982343435287476} 11/07/2021 04:15:36 - INFO - __main__ - Step 49865: {'lr': 0.00038169083708974313, 'samples': 9574080, 'steps': 49864, 'loss/train': 1.48331618309021} 11/07/2021 04:15:37 - INFO - __main__ - Step 49866: {'lr': 0.0003816863262637774, 'samples': 9574272, 'steps': 49865, 'loss/train': 1.1913880109786987} 11/07/2021 04:15:37 - INFO - __main__ - Step 49867: {'lr': 0.0003816818153784759, 'samples': 9574464, 'steps': 49866, 'loss/train': 0.8057406544685364} 11/07/2021 04:15:38 - INFO - __main__ - Step 49868: {'lr': 0.00038167730443384063, 'samples': 9574656, 'steps': 49867, 'loss/train': 1.673979640007019} 11/07/2021 04:15:38 - INFO - __main__ - Step 49869: {'lr': 0.0003816727934298736, 'samples': 9574848, 'steps': 49868, 'loss/train': 1.4075284004211426} 11/07/2021 04:15:39 - INFO - __main__ - Step 49870: {'lr': 0.0003816682823665768, 'samples': 9575040, 'steps': 49869, 'loss/train': 1.7056241035461426} 11/07/2021 04:15:39 - INFO - __main__ - Step 49871: {'lr': 0.0003816637712439523, 'samples': 9575232, 'steps': 49870, 'loss/train': 1.5761786699295044} 11/07/2021 04:15:39 - INFO - __main__ - Step 49872: {'lr': 0.0003816592600620021, 'samples': 9575424, 'steps': 49871, 'loss/train': 1.599300742149353} 11/07/2021 04:15:40 - INFO - __main__ - Step 49873: {'lr': 0.0003816547488207284, 'samples': 9575616, 'steps': 49872, 'loss/train': 1.250908613204956} 11/07/2021 04:15:41 - INFO - __main__ - Step 49874: {'lr': 0.00038165023752013294, 'samples': 9575808, 'steps': 49873, 'loss/train': 1.2514135837554932} 11/07/2021 04:15:41 - INFO - __main__ - Step 49875: {'lr': 0.00038164572616021807, 'samples': 9576000, 'steps': 49874, 'loss/train': 1.779987096786499} 11/07/2021 04:15:41 - INFO - __main__ - Step 49876: {'lr': 0.0003816412147409856, 'samples': 9576192, 'steps': 49875, 'loss/train': 1.3635135889053345} 11/07/2021 04:15:42 - INFO - __main__ - Step 49877: {'lr': 0.0003816367032624376, 'samples': 9576384, 'steps': 49876, 'loss/train': 1.3634365797042847} 11/07/2021 04:15:43 - INFO - __main__ - Step 49878: {'lr': 0.0003816321917245761, 'samples': 9576576, 'steps': 49877, 'loss/train': 1.286026120185852} 11/07/2021 04:15:44 - INFO - __main__ - Step 49879: {'lr': 0.00038162768012740323, 'samples': 9576768, 'steps': 49878, 'loss/train': 1.2849881649017334} 11/07/2021 04:15:44 - INFO - __main__ - Step 49880: {'lr': 0.00038162316847092096, 'samples': 9576960, 'steps': 49879, 'loss/train': 1.0298362970352173} 11/07/2021 04:15:45 - INFO - __main__ - Step 49881: {'lr': 0.0003816186567551313, 'samples': 9577152, 'steps': 49880, 'loss/train': 0.4664343297481537} 11/07/2021 04:15:45 - INFO - __main__ - Step 49882: {'lr': 0.0003816141449800364, 'samples': 9577344, 'steps': 49881, 'loss/train': 1.3210618495941162} 11/07/2021 04:15:46 - INFO - __main__ - Step 49883: {'lr': 0.00038160963314563806, 'samples': 9577536, 'steps': 49882, 'loss/train': 1.3121057748794556} 11/07/2021 04:15:46 - INFO - __main__ - Step 49884: {'lr': 0.00038160512125193853, 'samples': 9577728, 'steps': 49883, 'loss/train': 1.4728535413742065} 11/07/2021 04:15:47 - INFO - __main__ - Step 49885: {'lr': 0.0003816006092989397, 'samples': 9577920, 'steps': 49884, 'loss/train': 1.027085542678833} 11/07/2021 04:15:47 - INFO - __main__ - Step 49886: {'lr': 0.0003815960972866437, 'samples': 9578112, 'steps': 49885, 'loss/train': 0.914968729019165} 11/07/2021 04:15:47 - INFO - __main__ - Step 49887: {'lr': 0.00038159158521505255, 'samples': 9578304, 'steps': 49886, 'loss/train': 1.4157140254974365} 11/07/2021 04:15:48 - INFO - __main__ - Step 49888: {'lr': 0.0003815870730841683, 'samples': 9578496, 'steps': 49887, 'loss/train': 1.4689892530441284} 11/07/2021 04:15:49 - INFO - __main__ - Step 49889: {'lr': 0.00038158256089399287, 'samples': 9578688, 'steps': 49888, 'loss/train': 1.092774748802185} 11/07/2021 04:15:49 - INFO - __main__ - Step 49890: {'lr': 0.0003815780486445284, 'samples': 9578880, 'steps': 49889, 'loss/train': 1.4610096216201782} 11/07/2021 04:15:49 - INFO - __main__ - Step 49891: {'lr': 0.00038157353633577686, 'samples': 9579072, 'steps': 49890, 'loss/train': 1.5496063232421875} 11/07/2021 04:15:50 - INFO - __main__ - Step 49892: {'lr': 0.0003815690239677403, 'samples': 9579264, 'steps': 49891, 'loss/train': 0.18603570759296417} 11/07/2021 04:15:51 - INFO - __main__ - Step 49893: {'lr': 0.00038156451154042084, 'samples': 9579456, 'steps': 49892, 'loss/train': 1.3252862691879272} 11/07/2021 04:15:51 - INFO - __main__ - Step 49894: {'lr': 0.0003815599990538203, 'samples': 9579648, 'steps': 49893, 'loss/train': 1.69198739528656} 11/07/2021 04:15:52 - INFO - __main__ - Step 49895: {'lr': 0.00038155548650794103, 'samples': 9579840, 'steps': 49894, 'loss/train': 1.2539637088775635} 11/07/2021 04:15:52 - INFO - __main__ - Step 49896: {'lr': 0.00038155097390278484, 'samples': 9580032, 'steps': 49895, 'loss/train': 1.6797884702682495} 11/07/2021 04:15:52 - INFO - __main__ - Step 49897: {'lr': 0.0003815464612383538, 'samples': 9580224, 'steps': 49896, 'loss/train': 1.6274105310440063} 11/07/2021 04:15:53 - INFO - __main__ - Step 49898: {'lr': 0.0003815419485146499, 'samples': 9580416, 'steps': 49897, 'loss/train': 1.5162203311920166} 11/07/2021 04:15:54 - INFO - __main__ - Step 49899: {'lr': 0.0003815374357316753, 'samples': 9580608, 'steps': 49898, 'loss/train': 1.5220880508422852} 11/07/2021 04:15:54 - INFO - __main__ - Step 49900: {'lr': 0.0003815329228894319, 'samples': 9580800, 'steps': 49899, 'loss/train': 1.3859000205993652} 11/07/2021 04:15:54 - INFO - __main__ - Step 49901: {'lr': 0.0003815284099879218, 'samples': 9580992, 'steps': 49900, 'loss/train': 1.3224958181381226} 11/07/2021 04:15:55 - INFO - __main__ - Step 49902: {'lr': 0.00038152389702714705, 'samples': 9581184, 'steps': 49901, 'loss/train': 1.0632014274597168} 11/07/2021 04:15:56 - INFO - __main__ - Step 49903: {'lr': 0.0003815193840071097, 'samples': 9581376, 'steps': 49902, 'loss/train': 1.4790116548538208} 11/07/2021 04:15:56 - INFO - __main__ - Step 49904: {'lr': 0.0003815148709278117, 'samples': 9581568, 'steps': 49903, 'loss/train': 1.837923526763916} 11/07/2021 04:15:56 - INFO - __main__ - Step 49905: {'lr': 0.00038151035778925514, 'samples': 9581760, 'steps': 49904, 'loss/train': 1.3551061153411865} 11/07/2021 04:15:57 - INFO - __main__ - Step 49906: {'lr': 0.000381505844591442, 'samples': 9581952, 'steps': 49905, 'loss/train': 1.1105365753173828} 11/07/2021 04:15:57 - INFO - __main__ - Step 49907: {'lr': 0.0003815013313343744, 'samples': 9582144, 'steps': 49906, 'loss/train': 1.5107008218765259} 11/07/2021 04:15:58 - INFO - __main__ - Step 49908: {'lr': 0.0003814968180180544, 'samples': 9582336, 'steps': 49907, 'loss/train': 1.464564323425293} 11/07/2021 04:15:59 - INFO - __main__ - Step 49909: {'lr': 0.00038149230464248386, 'samples': 9582528, 'steps': 49908, 'loss/train': 1.664136528968811} 11/07/2021 04:15:59 - INFO - __main__ - Step 49910: {'lr': 0.000381487791207665, 'samples': 9582720, 'steps': 49909, 'loss/train': 1.5816500186920166} 11/07/2021 04:15:59 - INFO - __main__ - Step 49911: {'lr': 0.0003814832777135997, 'samples': 9582912, 'steps': 49910, 'loss/train': 1.1990145444869995} 11/07/2021 04:16:00 - INFO - __main__ - Step 49912: {'lr': 0.00038147876416029004, 'samples': 9583104, 'steps': 49911, 'loss/train': 1.7254343032836914} 11/07/2021 04:16:00 - INFO - __main__ - Step 49913: {'lr': 0.0003814742505477381, 'samples': 9583296, 'steps': 49912, 'loss/train': 1.4630495309829712} 11/07/2021 04:16:01 - INFO - __main__ - Step 49914: {'lr': 0.0003814697368759459, 'samples': 9583488, 'steps': 49913, 'loss/train': 1.644455075263977} 11/07/2021 04:16:01 - INFO - __main__ - Step 49915: {'lr': 0.0003814652231449155, 'samples': 9583680, 'steps': 49914, 'loss/train': 1.118870735168457} 11/07/2021 04:16:02 - INFO - __main__ - Step 49916: {'lr': 0.0003814607093546489, 'samples': 9583872, 'steps': 49915, 'loss/train': 1.2629168033599854} 11/07/2021 04:16:02 - INFO - __main__ - Step 49917: {'lr': 0.0003814561955051481, 'samples': 9584064, 'steps': 49916, 'loss/train': 1.5991111993789673} 11/07/2021 04:16:02 - INFO - __main__ - Step 49918: {'lr': 0.00038145168159641515, 'samples': 9584256, 'steps': 49917, 'loss/train': 1.3697715997695923} 11/07/2021 04:16:03 - INFO - __main__ - Step 49919: {'lr': 0.0003814471676284521, 'samples': 9584448, 'steps': 49918, 'loss/train': 1.385772705078125} 11/07/2021 04:16:04 - INFO - __main__ - Step 49920: {'lr': 0.00038144265360126107, 'samples': 9584640, 'steps': 49919, 'loss/train': 1.071580410003662} 11/07/2021 04:16:04 - INFO - __main__ - Step 49921: {'lr': 0.00038143813951484396, 'samples': 9584832, 'steps': 49920, 'loss/train': 1.748077392578125} 11/07/2021 04:16:05 - INFO - __main__ - Step 49922: {'lr': 0.0003814336253692028, 'samples': 9585024, 'steps': 49921, 'loss/train': 0.9060306549072266} 11/07/2021 04:16:05 - INFO - __main__ - Step 49923: {'lr': 0.0003814291111643397, 'samples': 9585216, 'steps': 49922, 'loss/train': 1.5653313398361206} 11/07/2021 04:16:06 - INFO - __main__ - Step 49924: {'lr': 0.00038142459690025665, 'samples': 9585408, 'steps': 49923, 'loss/train': 1.4163196086883545} 11/07/2021 04:16:06 - INFO - __main__ - Step 49925: {'lr': 0.0003814200825769558, 'samples': 9585600, 'steps': 49924, 'loss/train': 1.6841281652450562} 11/07/2021 04:16:07 - INFO - __main__ - Step 49926: {'lr': 0.000381415568194439, 'samples': 9585792, 'steps': 49925, 'loss/train': 1.4563446044921875} 11/07/2021 04:16:07 - INFO - __main__ - Step 49927: {'lr': 0.00038141105375270846, 'samples': 9585984, 'steps': 49926, 'loss/train': 1.2546554803848267} 11/07/2021 04:16:07 - INFO - __main__ - Step 49928: {'lr': 0.00038140653925176606, 'samples': 9586176, 'steps': 49927, 'loss/train': 1.588482141494751} 11/07/2021 04:16:08 - INFO - __main__ - Step 49929: {'lr': 0.0003814020246916139, 'samples': 9586368, 'steps': 49928, 'loss/train': 0.6331830620765686} 11/07/2021 04:16:09 - INFO - __main__ - Step 49930: {'lr': 0.000381397510072254, 'samples': 9586560, 'steps': 49929, 'loss/train': 0.9107598662376404} 11/07/2021 04:16:09 - INFO - __main__ - Step 49931: {'lr': 0.0003813929953936884, 'samples': 9586752, 'steps': 49930, 'loss/train': 1.4552175998687744} 11/07/2021 04:16:09 - INFO - __main__ - Step 49932: {'lr': 0.00038138848065591923, 'samples': 9586944, 'steps': 49931, 'loss/train': 1.0978375673294067} 11/07/2021 04:16:10 - INFO - __main__ - Step 49933: {'lr': 0.00038138396585894843, 'samples': 9587136, 'steps': 49932, 'loss/train': 1.5072349309921265} 11/07/2021 04:16:11 - INFO - __main__ - Step 49934: {'lr': 0.0003813794510027779, 'samples': 9587328, 'steps': 49933, 'loss/train': 1.138684868812561} 11/07/2021 04:16:11 - INFO - __main__ - Step 49935: {'lr': 0.00038137493608741, 'samples': 9587520, 'steps': 49934, 'loss/train': 1.2428410053253174} 11/07/2021 04:16:12 - INFO - __main__ - Step 49936: {'lr': 0.0003813704211128465, 'samples': 9587712, 'steps': 49935, 'loss/train': 0.08512834459543228} 11/07/2021 04:16:12 - INFO - __main__ - Step 49937: {'lr': 0.0003813659060790895, 'samples': 9587904, 'steps': 49936, 'loss/train': 1.707636833190918} 11/07/2021 04:16:12 - INFO - __main__ - Step 49938: {'lr': 0.00038136139098614107, 'samples': 9588096, 'steps': 49937, 'loss/train': 1.393393874168396} 11/07/2021 04:16:13 - INFO - __main__ - Step 49939: {'lr': 0.00038135687583400326, 'samples': 9588288, 'steps': 49938, 'loss/train': 1.4983201026916504} 11/07/2021 04:16:14 - INFO - __main__ - Step 49940: {'lr': 0.000381352360622678, 'samples': 9588480, 'steps': 49939, 'loss/train': 1.282761812210083} 11/07/2021 04:16:14 - INFO - __main__ - Step 49941: {'lr': 0.00038134784535216737, 'samples': 9588672, 'steps': 49940, 'loss/train': 1.4226652383804321} 11/07/2021 04:16:14 - INFO - __main__ - Step 49942: {'lr': 0.0003813433300224735, 'samples': 9588864, 'steps': 49941, 'loss/train': 1.063042163848877} 11/07/2021 04:16:15 - INFO - __main__ - Step 49943: {'lr': 0.0003813388146335983, 'samples': 9589056, 'steps': 49942, 'loss/train': 1.04142427444458} 11/07/2021 04:16:16 - INFO - __main__ - Step 49944: {'lr': 0.00038133429918554395, 'samples': 9589248, 'steps': 49943, 'loss/train': 1.3146531581878662} 11/07/2021 04:16:16 - INFO - __main__ - Step 49945: {'lr': 0.00038132978367831226, 'samples': 9589440, 'steps': 49944, 'loss/train': 1.3345072269439697} 11/07/2021 04:16:17 - INFO - __main__ - Step 49946: {'lr': 0.00038132526811190547, 'samples': 9589632, 'steps': 49945, 'loss/train': 1.4192546606063843} 11/07/2021 04:16:17 - INFO - __main__ - Step 49947: {'lr': 0.00038132075248632557, 'samples': 9589824, 'steps': 49946, 'loss/train': 1.4883534908294678} 11/07/2021 04:16:17 - INFO - __main__ - Step 49948: {'lr': 0.0003813162368015745, 'samples': 9590016, 'steps': 49947, 'loss/train': 1.16780424118042} 11/07/2021 04:16:18 - INFO - __main__ - Step 49949: {'lr': 0.00038131172105765446, 'samples': 9590208, 'steps': 49948, 'loss/train': 1.5168664455413818} 11/07/2021 04:16:19 - INFO - __main__ - Step 49950: {'lr': 0.0003813072052545673, 'samples': 9590400, 'steps': 49949, 'loss/train': 1.0772747993469238} 11/07/2021 04:16:19 - INFO - __main__ - Step 49951: {'lr': 0.00038130268939231513, 'samples': 9590592, 'steps': 49950, 'loss/train': 1.8183636665344238} 11/07/2021 04:16:19 - INFO - __main__ - Step 49952: {'lr': 0.0003812981734709, 'samples': 9590784, 'steps': 49951, 'loss/train': 0.702171266078949} 11/07/2021 04:16:20 - INFO - __main__ - Step 49953: {'lr': 0.00038129365749032395, 'samples': 9590976, 'steps': 49952, 'loss/train': 1.9576735496520996} 11/07/2021 04:16:21 - INFO - __main__ - Step 49954: {'lr': 0.000381289141450589, 'samples': 9591168, 'steps': 49953, 'loss/train': 2.3979005813598633} 11/07/2021 04:16:21 - INFO - __main__ - Step 49955: {'lr': 0.00038128462535169715, 'samples': 9591360, 'steps': 49954, 'loss/train': 1.3407514095306396} 11/07/2021 04:16:21 - INFO - __main__ - Step 49956: {'lr': 0.00038128010919365066, 'samples': 9591552, 'steps': 49955, 'loss/train': 1.8588253259658813} 11/07/2021 04:16:22 - INFO - __main__ - Step 49957: {'lr': 0.0003812755929764512, 'samples': 9591744, 'steps': 49956, 'loss/train': 1.5412076711654663} 11/07/2021 04:16:22 - INFO - __main__ - Step 49958: {'lr': 0.000381271076700101, 'samples': 9591936, 'steps': 49957, 'loss/train': 1.4377400875091553} 11/07/2021 04:16:23 - INFO - __main__ - Step 49959: {'lr': 0.00038126656036460206, 'samples': 9592128, 'steps': 49958, 'loss/train': 1.0640348196029663} 11/07/2021 04:16:23 - INFO - __main__ - Step 49960: {'lr': 0.0003812620439699565, 'samples': 9592320, 'steps': 49959, 'loss/train': 1.584822177886963} 11/07/2021 04:16:24 - INFO - __main__ - Step 49961: {'lr': 0.00038125752751616625, 'samples': 9592512, 'steps': 49960, 'loss/train': 1.4965132474899292} 11/07/2021 04:16:24 - INFO - __main__ - Step 49962: {'lr': 0.00038125301100323344, 'samples': 9592704, 'steps': 49961, 'loss/train': 1.340742826461792} 11/07/2021 04:16:24 - INFO - __main__ - Step 49963: {'lr': 0.00038124849443116, 'samples': 9592896, 'steps': 49962, 'loss/train': 1.480609655380249} 11/07/2021 04:16:26 - INFO - __main__ - Step 49964: {'lr': 0.000381243977799948, 'samples': 9593088, 'steps': 49963, 'loss/train': 1.2617064714431763} 11/07/2021 04:16:26 - INFO - __main__ - Step 49965: {'lr': 0.0003812394611095995, 'samples': 9593280, 'steps': 49964, 'loss/train': 1.535764217376709} 11/07/2021 04:16:26 - INFO - __main__ - Step 49966: {'lr': 0.0003812349443601165, 'samples': 9593472, 'steps': 49965, 'loss/train': 1.6400853395462036} 11/07/2021 04:16:27 - INFO - __main__ - Step 49967: {'lr': 0.0003812304275515012, 'samples': 9593664, 'steps': 49966, 'loss/train': 1.1503257751464844} 11/07/2021 04:16:27 - INFO - __main__ - Step 49968: {'lr': 0.00038122591068375536, 'samples': 9593856, 'steps': 49967, 'loss/train': 1.3187167644500732} 11/07/2021 04:16:27 - INFO - __main__ - Step 49969: {'lr': 0.00038122139375688116, 'samples': 9594048, 'steps': 49968, 'loss/train': 1.1606495380401611} 11/07/2021 04:16:29 - INFO - __main__ - Step 49970: {'lr': 0.0003812168767708807, 'samples': 9594240, 'steps': 49969, 'loss/train': 0.9344202280044556} 11/07/2021 04:16:29 - INFO - __main__ - Step 49971: {'lr': 0.0003812123597257559, 'samples': 9594432, 'steps': 49970, 'loss/train': 1.361876368522644} 11/07/2021 04:16:29 - INFO - __main__ - Step 49972: {'lr': 0.00038120784262150875, 'samples': 9594624, 'steps': 49971, 'loss/train': 1.2185523509979248} 11/07/2021 04:16:30 - INFO - __main__ - Step 49973: {'lr': 0.0003812033254581414, 'samples': 9594816, 'steps': 49972, 'loss/train': 0.9987174272537231} 11/07/2021 04:16:30 - INFO - __main__ - Step 49974: {'lr': 0.0003811988082356559, 'samples': 9595008, 'steps': 49973, 'loss/train': 1.5329139232635498} 11/07/2021 04:16:31 - INFO - __main__ - Step 49975: {'lr': 0.0003811942909540542, 'samples': 9595200, 'steps': 49974, 'loss/train': 1.565413475036621} 11/07/2021 04:16:31 - INFO - __main__ - Step 49976: {'lr': 0.0003811897736133385, 'samples': 9595392, 'steps': 49975, 'loss/train': 1.2943179607391357} 11/07/2021 04:16:32 - INFO - __main__ - Step 49977: {'lr': 0.0003811852562135106, 'samples': 9595584, 'steps': 49976, 'loss/train': 1.1904795169830322} 11/07/2021 04:16:32 - INFO - __main__ - Step 49978: {'lr': 0.0003811807387545727, 'samples': 9595776, 'steps': 49977, 'loss/train': 1.2342473268508911} 11/07/2021 04:16:32 - INFO - __main__ - Step 49979: {'lr': 0.0003811762212365267, 'samples': 9595968, 'steps': 49978, 'loss/train': 1.3623075485229492} 11/07/2021 04:16:33 - INFO - __main__ - Step 49980: {'lr': 0.0003811717036593748, 'samples': 9596160, 'steps': 49979, 'loss/train': 0.9780320525169373} 11/07/2021 04:16:34 - INFO - __main__ - Step 49981: {'lr': 0.00038116718602311896, 'samples': 9596352, 'steps': 49980, 'loss/train': 1.3779079914093018} 11/07/2021 04:16:34 - INFO - __main__ - Step 49982: {'lr': 0.00038116266832776113, 'samples': 9596544, 'steps': 49981, 'loss/train': 0.8242561221122742} 11/07/2021 04:16:34 - INFO - __main__ - Step 49983: {'lr': 0.0003811581505733035, 'samples': 9596736, 'steps': 49982, 'loss/train': 1.616904616355896} 11/07/2021 04:16:35 - INFO - __main__ - Step 49984: {'lr': 0.000381153632759748, 'samples': 9596928, 'steps': 49983, 'loss/train': 1.6792519092559814} 11/07/2021 04:16:36 - INFO - __main__ - Step 49985: {'lr': 0.0003811491148870967, 'samples': 9597120, 'steps': 49984, 'loss/train': 1.758325457572937} 11/07/2021 04:16:36 - INFO - __main__ - Step 49986: {'lr': 0.0003811445969553516, 'samples': 9597312, 'steps': 49985, 'loss/train': 1.5427839756011963} 11/07/2021 04:16:37 - INFO - __main__ - Step 49987: {'lr': 0.00038114007896451486, 'samples': 9597504, 'steps': 49986, 'loss/train': 1.2429639101028442} 11/07/2021 04:16:37 - INFO - __main__ - Step 49988: {'lr': 0.0003811355609145883, 'samples': 9597696, 'steps': 49987, 'loss/train': 1.6669725179672241} 11/07/2021 04:16:37 - INFO - __main__ - Step 49989: {'lr': 0.0003811310428055742, 'samples': 9597888, 'steps': 49988, 'loss/train': 2.0266692638397217} 11/07/2021 04:16:38 - INFO - __main__ - Step 49990: {'lr': 0.00038112652463747444, 'samples': 9598080, 'steps': 49989, 'loss/train': 1.3608561754226685} 11/07/2021 04:16:39 - INFO - __main__ - Step 49991: {'lr': 0.00038112200641029104, 'samples': 9598272, 'steps': 49990, 'loss/train': 1.6590118408203125} 11/07/2021 04:16:39 - INFO - __main__ - Step 49992: {'lr': 0.00038111748812402616, 'samples': 9598464, 'steps': 49991, 'loss/train': 0.44961920380592346} 11/07/2021 04:16:39 - INFO - __main__ - Step 49993: {'lr': 0.0003811129697786817, 'samples': 9598656, 'steps': 49992, 'loss/train': 1.4356154203414917} 11/07/2021 04:16:40 - INFO - __main__ - Step 49994: {'lr': 0.00038110845137425976, 'samples': 9598848, 'steps': 49993, 'loss/train': 1.4192280769348145} 11/07/2021 04:16:41 - INFO - __main__ - Step 49995: {'lr': 0.0003811039329107624, 'samples': 9599040, 'steps': 49994, 'loss/train': 0.9220527410507202} 11/07/2021 04:16:42 - INFO - __main__ - Step 49996: {'lr': 0.00038109941438819165, 'samples': 9599232, 'steps': 49995, 'loss/train': 5.768974304199219} 11/07/2021 04:16:42 - INFO - __main__ - Step 49997: {'lr': 0.00038109489580654955, 'samples': 9599424, 'steps': 49996, 'loss/train': 1.6446173191070557} 11/07/2021 04:16:42 - INFO - __main__ - Step 49998: {'lr': 0.00038109037716583806, 'samples': 9599616, 'steps': 49997, 'loss/train': 1.6709755659103394} 11/07/2021 04:16:43 - INFO - __main__ - Step 49999: {'lr': 0.0003810858584660593, 'samples': 9599808, 'steps': 49998, 'loss/train': 1.74168062210083} 11/07/2021 04:16:43 - INFO - __main__ - Step 50000: {'lr': 0.0003810813397072152, 'samples': 9600000, 'steps': 49999, 'loss/train': 1.699582576751709} 11/07/2021 04:16:43 - INFO - __main__ - Step 50001: {'lr': 0.00038107682088930797, 'samples': 9600192, 'steps': 50000, 'loss/train': 1.4355201721191406} 11/07/2021 04:16:44 - INFO - __main__ - Step 50002: {'lr': 0.00038107230201233944, 'samples': 9600384, 'steps': 50001, 'loss/train': 1.7963310480117798} 11/07/2021 04:16:45 - INFO - __main__ - Step 50003: {'lr': 0.00038106778307631187, 'samples': 9600576, 'steps': 50002, 'loss/train': 1.7608848810195923} 11/07/2021 04:16:45 - INFO - __main__ - Step 50004: {'lr': 0.0003810632640812271, 'samples': 9600768, 'steps': 50003, 'loss/train': 1.6591436862945557} 11/07/2021 04:16:45 - INFO - __main__ - Step 50005: {'lr': 0.00038105874502708726, 'samples': 9600960, 'steps': 50004, 'loss/train': 1.6088366508483887} 11/07/2021 04:16:46 - INFO - __main__ - Step 50006: {'lr': 0.0003810542259138944, 'samples': 9601152, 'steps': 50005, 'loss/train': 1.5151917934417725} 11/07/2021 04:16:47 - INFO - __main__ - Step 50007: {'lr': 0.0003810497067416505, 'samples': 9601344, 'steps': 50006, 'loss/train': 1.273219347000122} 11/07/2021 04:16:47 - INFO - __main__ - Step 50008: {'lr': 0.0003810451875103576, 'samples': 9601536, 'steps': 50007, 'loss/train': 1.481964349746704} 11/07/2021 04:16:47 - INFO - __main__ - Step 50009: {'lr': 0.0003810406682200178, 'samples': 9601728, 'steps': 50008, 'loss/train': 1.6708675622940063} 11/07/2021 04:16:48 - INFO - __main__ - Step 50010: {'lr': 0.0003810361488706331, 'samples': 9601920, 'steps': 50009, 'loss/train': 1.252386212348938} 11/07/2021 04:16:48 - INFO - __main__ - Step 50011: {'lr': 0.0003810316294622056, 'samples': 9602112, 'steps': 50010, 'loss/train': 1.366594672203064} 11/07/2021 04:16:49 - INFO - __main__ - Step 50012: {'lr': 0.0003810271099947371, 'samples': 9602304, 'steps': 50011, 'loss/train': 1.3563555479049683} 11/07/2021 04:16:50 - INFO - __main__ - Step 50013: {'lr': 0.00038102259046822993, 'samples': 9602496, 'steps': 50012, 'loss/train': 1.6523642539978027} 11/07/2021 04:16:50 - INFO - __main__ - Step 50014: {'lr': 0.00038101807088268595, 'samples': 9602688, 'steps': 50013, 'loss/train': 1.2176542282104492} 11/07/2021 04:16:50 - INFO - __main__ - Step 50015: {'lr': 0.00038101355123810733, 'samples': 9602880, 'steps': 50014, 'loss/train': 1.590633511543274} 11/07/2021 04:16:51 - INFO - __main__ - Step 50016: {'lr': 0.00038100903153449596, 'samples': 9603072, 'steps': 50015, 'loss/train': 1.1036028861999512} 11/07/2021 04:16:52 - INFO - __main__ - Step 50017: {'lr': 0.00038100451177185395, 'samples': 9603264, 'steps': 50016, 'loss/train': 1.8031105995178223} 11/07/2021 04:16:52 - INFO - __main__ - Step 50018: {'lr': 0.0003809999919501833, 'samples': 9603456, 'steps': 50017, 'loss/train': 1.193867564201355} 11/07/2021 04:16:52 - INFO - __main__ - Step 50019: {'lr': 0.00038099547206948617, 'samples': 9603648, 'steps': 50018, 'loss/train': 1.4448760747909546} 11/07/2021 04:16:53 - INFO - __main__ - Step 50020: {'lr': 0.0003809909521297644, 'samples': 9603840, 'steps': 50019, 'loss/train': 1.6645346879959106} 11/07/2021 04:16:53 - INFO - __main__ - Step 50021: {'lr': 0.00038098643213102014, 'samples': 9604032, 'steps': 50020, 'loss/train': 1.0006576776504517} 11/07/2021 04:16:54 - INFO - __main__ - Step 50022: {'lr': 0.0003809819120732554, 'samples': 9604224, 'steps': 50021, 'loss/train': 1.2743947505950928} 11/07/2021 04:16:55 - INFO - __main__ - Step 50023: {'lr': 0.00038097739195647233, 'samples': 9604416, 'steps': 50022, 'loss/train': 1.3045101165771484} 11/07/2021 04:16:55 - INFO - __main__ - Step 50024: {'lr': 0.0003809728717806728, 'samples': 9604608, 'steps': 50023, 'loss/train': 0.8881009221076965} 11/07/2021 04:16:55 - INFO - __main__ - Step 50025: {'lr': 0.00038096835154585897, 'samples': 9604800, 'steps': 50024, 'loss/train': 0.4098493754863739} 11/07/2021 04:16:56 - INFO - __main__ - Step 50026: {'lr': 0.0003809638312520327, 'samples': 9604992, 'steps': 50025, 'loss/train': 1.20095956325531} 11/07/2021 04:16:57 - INFO - __main__ - Step 50027: {'lr': 0.0003809593108991962, 'samples': 9605184, 'steps': 50026, 'loss/train': 1.4733937978744507} 11/07/2021 04:16:57 - INFO - __main__ - Step 50028: {'lr': 0.0003809547904873515, 'samples': 9605376, 'steps': 50027, 'loss/train': 1.335333228111267} 11/07/2021 04:16:58 - INFO - __main__ - Step 50029: {'lr': 0.0003809502700165006, 'samples': 9605568, 'steps': 50028, 'loss/train': 1.0062395334243774} 11/07/2021 04:16:58 - INFO - __main__ - Step 50030: {'lr': 0.00038094574948664554, 'samples': 9605760, 'steps': 50029, 'loss/train': 0.22823107242584229} 11/07/2021 04:16:59 - INFO - __main__ - Step 50031: {'lr': 0.00038094122889778824, 'samples': 9605952, 'steps': 50030, 'loss/train': 1.204109787940979} 11/07/2021 04:17:00 - INFO - __main__ - Step 50032: {'lr': 0.000380936708249931, 'samples': 9606144, 'steps': 50031, 'loss/train': 0.7152600884437561} 11/07/2021 04:17:00 - INFO - __main__ - Step 50033: {'lr': 0.0003809321875430756, 'samples': 9606336, 'steps': 50032, 'loss/train': 1.490403175354004} 11/07/2021 04:17:00 - INFO - __main__ - Step 50034: {'lr': 0.0003809276667772241, 'samples': 9606528, 'steps': 50033, 'loss/train': 1.2764545679092407} 11/07/2021 04:17:01 - INFO - __main__ - Step 50035: {'lr': 0.00038092314595237873, 'samples': 9606720, 'steps': 50034, 'loss/train': 1.7428356409072876} 11/07/2021 04:17:01 - INFO - __main__ - Step 50036: {'lr': 0.0003809186250685414, 'samples': 9606912, 'steps': 50035, 'loss/train': 1.3986958265304565} 11/07/2021 04:17:02 - INFO - __main__ - Step 50037: {'lr': 0.0003809141041257141, 'samples': 9607104, 'steps': 50036, 'loss/train': 1.1598021984100342} 11/07/2021 04:17:02 - INFO - __main__ - Step 50038: {'lr': 0.000380909583123899, 'samples': 9607296, 'steps': 50037, 'loss/train': 1.2968944311141968} 11/07/2021 04:17:03 - INFO - __main__ - Step 50039: {'lr': 0.00038090506206309805, 'samples': 9607488, 'steps': 50038, 'loss/train': 1.6650761365890503} 11/07/2021 04:17:03 - INFO - __main__ - Step 50040: {'lr': 0.00038090054094331324, 'samples': 9607680, 'steps': 50039, 'loss/train': 1.1097683906555176} 11/07/2021 04:17:03 - INFO - __main__ - Step 50041: {'lr': 0.0003808960197645467, 'samples': 9607872, 'steps': 50040, 'loss/train': 1.4362438917160034} 11/07/2021 04:17:05 - INFO - __main__ - Step 50042: {'lr': 0.00038089149852680036, 'samples': 9608064, 'steps': 50041, 'loss/train': 1.7006573677062988} 11/07/2021 04:17:05 - INFO - __main__ - Step 50043: {'lr': 0.00038088697723007647, 'samples': 9608256, 'steps': 50042, 'loss/train': 1.070981740951538} 11/07/2021 04:17:05 - INFO - __main__ - Step 50044: {'lr': 0.00038088245587437685, 'samples': 9608448, 'steps': 50043, 'loss/train': 1.192070722579956} 11/07/2021 04:17:06 - INFO - __main__ - Step 50045: {'lr': 0.00038087793445970363, 'samples': 9608640, 'steps': 50044, 'loss/train': 1.2683120965957642} 11/07/2021 04:17:06 - INFO - __main__ - Step 50046: {'lr': 0.0003808734129860588, 'samples': 9608832, 'steps': 50045, 'loss/train': 1.0339076519012451} 11/07/2021 04:17:06 - INFO - __main__ - Step 50047: {'lr': 0.0003808688914534445, 'samples': 9609024, 'steps': 50046, 'loss/train': 1.137500286102295} 11/07/2021 04:17:07 - INFO - __main__ - Step 50048: {'lr': 0.00038086436986186267, 'samples': 9609216, 'steps': 50047, 'loss/train': 1.3772196769714355} 11/07/2021 04:17:08 - INFO - __main__ - Step 50049: {'lr': 0.00038085984821131536, 'samples': 9609408, 'steps': 50048, 'loss/train': 1.4623816013336182} 11/07/2021 04:17:08 - INFO - __main__ - Step 50050: {'lr': 0.00038085532650180464, 'samples': 9609600, 'steps': 50049, 'loss/train': 0.5126489996910095} 11/07/2021 04:17:09 - INFO - __main__ - Step 50051: {'lr': 0.0003808508047333325, 'samples': 9609792, 'steps': 50050, 'loss/train': 0.09533369541168213} 11/07/2021 04:17:09 - INFO - __main__ - Step 50052: {'lr': 0.000380846282905901, 'samples': 9609984, 'steps': 50051, 'loss/train': 1.4151746034622192} 11/07/2021 04:17:10 - INFO - __main__ - Step 50053: {'lr': 0.0003808417610195122, 'samples': 9610176, 'steps': 50052, 'loss/train': 1.6462810039520264} 11/07/2021 04:17:10 - INFO - __main__ - Step 50054: {'lr': 0.0003808372390741681, 'samples': 9610368, 'steps': 50053, 'loss/train': 1.304959774017334} 11/07/2021 04:17:11 - INFO - __main__ - Step 50055: {'lr': 0.0003808327170698708, 'samples': 9610560, 'steps': 50054, 'loss/train': 1.4613430500030518} 11/07/2021 04:17:11 - INFO - __main__ - Step 50056: {'lr': 0.0003808281950066223, 'samples': 9610752, 'steps': 50055, 'loss/train': 1.3599810600280762} 11/07/2021 04:17:11 - INFO - __main__ - Step 50057: {'lr': 0.0003808236728844246, 'samples': 9610944, 'steps': 50056, 'loss/train': 0.9918197989463806} 11/07/2021 04:17:12 - INFO - __main__ - Step 50058: {'lr': 0.0003808191507032798, 'samples': 9611136, 'steps': 50057, 'loss/train': 1.5752508640289307} 11/07/2021 04:17:13 - INFO - __main__ - Step 50059: {'lr': 0.00038081462846318984, 'samples': 9611328, 'steps': 50058, 'loss/train': 1.5404647588729858} 11/07/2021 04:17:13 - INFO - __main__ - Step 50060: {'lr': 0.000380810106164157, 'samples': 9611520, 'steps': 50059, 'loss/train': 1.406663179397583} 11/07/2021 04:17:13 - INFO - __main__ - Step 50061: {'lr': 0.000380805583806183, 'samples': 9611712, 'steps': 50060, 'loss/train': 1.639754056930542} 11/07/2021 04:17:14 - INFO - __main__ - Step 50062: {'lr': 0.00038080106138927, 'samples': 9611904, 'steps': 50061, 'loss/train': 1.8533631563186646} 11/07/2021 04:17:15 - INFO - __main__ - Step 50063: {'lr': 0.00038079653891342016, 'samples': 9612096, 'steps': 50062, 'loss/train': 1.365235686302185} 11/07/2021 04:17:15 - INFO - __main__ - Step 50064: {'lr': 0.0003807920163786353, 'samples': 9612288, 'steps': 50063, 'loss/train': 1.032162070274353} 11/07/2021 04:17:15 - INFO - __main__ - Step 50065: {'lr': 0.00038078749378491763, 'samples': 9612480, 'steps': 50064, 'loss/train': 1.5522141456604004} 11/07/2021 04:17:16 - INFO - __main__ - Step 50066: {'lr': 0.00038078297113226925, 'samples': 9612672, 'steps': 50065, 'loss/train': 0.29551154375076294} 11/07/2021 04:17:16 - INFO - __main__ - Step 50067: {'lr': 0.00038077844842069193, 'samples': 9612864, 'steps': 50066, 'loss/train': 1.5626736879348755} 11/07/2021 04:17:17 - INFO - __main__ - Step 50068: {'lr': 0.00038077392565018784, 'samples': 9613056, 'steps': 50067, 'loss/train': 1.2820627689361572} 11/07/2021 04:17:18 - INFO - __main__ - Step 50069: {'lr': 0.0003807694028207591, 'samples': 9613248, 'steps': 50068, 'loss/train': 1.2960739135742188} 11/07/2021 04:17:18 - INFO - __main__ - Step 50070: {'lr': 0.0003807648799324077, 'samples': 9613440, 'steps': 50069, 'loss/train': 1.5695760250091553} 11/07/2021 04:17:18 - INFO - __main__ - Step 50071: {'lr': 0.0003807603569851357, 'samples': 9613632, 'steps': 50070, 'loss/train': 1.5570412874221802} 11/07/2021 04:17:19 - INFO - __main__ - Step 50072: {'lr': 0.0003807558339789451, 'samples': 9613824, 'steps': 50071, 'loss/train': 1.4880906343460083} 11/07/2021 04:17:20 - INFO - __main__ - Step 50073: {'lr': 0.00038075131091383783, 'samples': 9614016, 'steps': 50072, 'loss/train': 1.4011688232421875} 11/07/2021 04:17:20 - INFO - __main__ - Step 50074: {'lr': 0.0003807467877898161, 'samples': 9614208, 'steps': 50073, 'loss/train': 1.616499662399292} 11/07/2021 04:17:20 - INFO - __main__ - Step 50075: {'lr': 0.00038074226460688186, 'samples': 9614400, 'steps': 50074, 'loss/train': 1.4088376760482788} 11/07/2021 04:17:21 - INFO - __main__ - Step 50076: {'lr': 0.0003807377413650372, 'samples': 9614592, 'steps': 50075, 'loss/train': 1.7282475233078003} 11/07/2021 04:17:21 - INFO - __main__ - Step 50077: {'lr': 0.0003807332180642842, 'samples': 9614784, 'steps': 50076, 'loss/train': 1.5338096618652344} 11/07/2021 04:17:22 - INFO - __main__ - Step 50078: {'lr': 0.00038072869470462465, 'samples': 9614976, 'steps': 50077, 'loss/train': 1.6608787775039673} 11/07/2021 04:17:23 - INFO - __main__ - Step 50079: {'lr': 0.00038072417128606095, 'samples': 9615168, 'steps': 50078, 'loss/train': 0.6714621186256409} 11/07/2021 04:17:23 - INFO - __main__ - Step 50080: {'lr': 0.00038071964780859486, 'samples': 9615360, 'steps': 50079, 'loss/train': 1.2888237237930298} 11/07/2021 04:17:23 - INFO - __main__ - Step 50081: {'lr': 0.0003807151242722285, 'samples': 9615552, 'steps': 50080, 'loss/train': 1.4392421245574951} 11/07/2021 04:17:24 - INFO - __main__ - Step 50082: {'lr': 0.00038071060067696393, 'samples': 9615744, 'steps': 50081, 'loss/train': 1.5196335315704346} 11/07/2021 04:17:25 - INFO - __main__ - Step 50083: {'lr': 0.00038070607702280325, 'samples': 9615936, 'steps': 50082, 'loss/train': 1.734366774559021} 11/07/2021 04:17:25 - INFO - __main__ - Step 50084: {'lr': 0.00038070155330974844, 'samples': 9616128, 'steps': 50083, 'loss/train': 1.2994858026504517} 11/07/2021 04:17:25 - INFO - __main__ - Step 50085: {'lr': 0.0003806970295378014, 'samples': 9616320, 'steps': 50084, 'loss/train': 1.5340774059295654} 11/07/2021 04:17:26 - INFO - __main__ - Step 50086: {'lr': 0.00038069250570696433, 'samples': 9616512, 'steps': 50085, 'loss/train': 1.9248086214065552} 11/07/2021 04:17:26 - INFO - __main__ - Step 50087: {'lr': 0.00038068798181723927, 'samples': 9616704, 'steps': 50086, 'loss/train': 1.6424987316131592} 11/07/2021 04:17:27 - INFO - __main__ - Step 50088: {'lr': 0.00038068345786862825, 'samples': 9616896, 'steps': 50087, 'loss/train': 0.971694827079773} 11/07/2021 04:17:28 - INFO - __main__ - Step 50089: {'lr': 0.0003806789338611333, 'samples': 9617088, 'steps': 50088, 'loss/train': 0.5746945142745972} 11/07/2021 04:17:28 - INFO - __main__ - Step 50090: {'lr': 0.00038067440979475635, 'samples': 9617280, 'steps': 50089, 'loss/train': 1.6796151399612427} 11/07/2021 04:17:28 - INFO - __main__ - Step 50091: {'lr': 0.0003806698856694996, 'samples': 9617472, 'steps': 50090, 'loss/train': 1.5130348205566406} 11/07/2021 04:17:29 - INFO - __main__ - Step 50092: {'lr': 0.00038066536148536495, 'samples': 9617664, 'steps': 50091, 'loss/train': 1.341958999633789} 11/07/2021 04:17:30 - INFO - __main__ - Step 50093: {'lr': 0.00038066083724235455, 'samples': 9617856, 'steps': 50092, 'loss/train': 1.1158323287963867} 11/07/2021 04:17:30 - INFO - __main__ - Step 50094: {'lr': 0.00038065631294047035, 'samples': 9618048, 'steps': 50093, 'loss/train': 1.3453494310379028} 11/07/2021 04:17:30 - INFO - __main__ - Step 50095: {'lr': 0.0003806517885797145, 'samples': 9618240, 'steps': 50094, 'loss/train': 1.2856875658035278} 11/07/2021 04:17:31 - INFO - __main__ - Step 50096: {'lr': 0.0003806472641600889, 'samples': 9618432, 'steps': 50095, 'loss/train': 1.086360216140747} 11/07/2021 04:17:31 - INFO - __main__ - Step 50097: {'lr': 0.00038064273968159575, 'samples': 9618624, 'steps': 50096, 'loss/train': 1.6092201471328735} 11/07/2021 04:17:33 - INFO - __main__ - Step 50098: {'lr': 0.00038063821514423694, 'samples': 9618816, 'steps': 50097, 'loss/train': 1.7410180568695068} 11/07/2021 04:17:33 - INFO - __main__ - Step 50099: {'lr': 0.00038063369054801456, 'samples': 9619008, 'steps': 50098, 'loss/train': 1.2810670137405396} 11/07/2021 04:17:33 - INFO - __main__ - Step 50100: {'lr': 0.00038062916589293064, 'samples': 9619200, 'steps': 50099, 'loss/train': 0.9716135859489441} 11/07/2021 04:17:34 - INFO - __main__ - Step 50101: {'lr': 0.0003806246411789872, 'samples': 9619392, 'steps': 50100, 'loss/train': 1.5850729942321777} 11/07/2021 04:17:34 - INFO - __main__ - Step 50102: {'lr': 0.00038062011640618636, 'samples': 9619584, 'steps': 50101, 'loss/train': 1.6093177795410156} 11/07/2021 04:17:35 - INFO - __main__ - Step 50103: {'lr': 0.00038061559157453014, 'samples': 9619776, 'steps': 50102, 'loss/train': 0.5811560153961182} 11/07/2021 04:17:35 - INFO - __main__ - Step 50104: {'lr': 0.00038061106668402055, 'samples': 9619968, 'steps': 50103, 'loss/train': 1.1450581550598145} 11/07/2021 04:17:36 - INFO - __main__ - Step 50105: {'lr': 0.0003806065417346596, 'samples': 9620160, 'steps': 50104, 'loss/train': 1.740180253982544} 11/07/2021 04:17:36 - INFO - __main__ - Step 50106: {'lr': 0.00038060201672644934, 'samples': 9620352, 'steps': 50105, 'loss/train': 1.3290950059890747} 11/07/2021 04:17:36 - INFO - __main__ - Step 50107: {'lr': 0.00038059749165939184, 'samples': 9620544, 'steps': 50106, 'loss/train': 1.5256736278533936} 11/07/2021 04:17:37 - INFO - __main__ - Step 50108: {'lr': 0.00038059296653348917, 'samples': 9620736, 'steps': 50107, 'loss/train': 1.0681893825531006} 11/07/2021 04:17:38 - INFO - __main__ - Step 50109: {'lr': 0.00038058844134874326, 'samples': 9620928, 'steps': 50108, 'loss/train': 1.5296568870544434} 11/07/2021 04:17:38 - INFO - __main__ - Step 50110: {'lr': 0.0003805839161051563, 'samples': 9621120, 'steps': 50109, 'loss/train': 1.5743610858917236} 11/07/2021 04:17:38 - INFO - __main__ - Step 50111: {'lr': 0.00038057939080273016, 'samples': 9621312, 'steps': 50110, 'loss/train': 1.1973341703414917} 11/07/2021 04:17:39 - INFO - __main__ - Step 50112: {'lr': 0.00038057486544146703, 'samples': 9621504, 'steps': 50111, 'loss/train': 1.5013760328292847} 11/07/2021 04:17:40 - INFO - __main__ - Step 50113: {'lr': 0.0003805703400213688, 'samples': 9621696, 'steps': 50112, 'loss/train': 1.4528602361679077} 11/07/2021 04:17:40 - INFO - __main__ - Step 50114: {'lr': 0.0003805658145424376, 'samples': 9621888, 'steps': 50113, 'loss/train': 1.760488748550415} 11/07/2021 04:17:40 - INFO - __main__ - Step 50115: {'lr': 0.00038056128900467546, 'samples': 9622080, 'steps': 50114, 'loss/train': 1.398864984512329} 11/07/2021 04:17:41 - INFO - __main__ - Step 50116: {'lr': 0.00038055676340808446, 'samples': 9622272, 'steps': 50115, 'loss/train': 1.3213566541671753} 11/07/2021 04:17:41 - INFO - __main__ - Step 50117: {'lr': 0.00038055223775266666, 'samples': 9622464, 'steps': 50116, 'loss/train': 1.8905999660491943} 11/07/2021 04:17:42 - INFO - __main__ - Step 50118: {'lr': 0.0003805477120384239, 'samples': 9622656, 'steps': 50117, 'loss/train': 1.3616106510162354} 11/07/2021 04:17:43 - INFO - __main__ - Step 50119: {'lr': 0.00038054318626535845, 'samples': 9622848, 'steps': 50118, 'loss/train': 1.5988349914550781} 11/07/2021 04:17:43 - INFO - __main__ - Step 50120: {'lr': 0.00038053866043347216, 'samples': 9623040, 'steps': 50119, 'loss/train': 1.7721415758132935} 11/07/2021 04:17:43 - INFO - __main__ - Step 50121: {'lr': 0.00038053413454276725, 'samples': 9623232, 'steps': 50120, 'loss/train': 1.3219261169433594} 11/07/2021 04:17:44 - INFO - __main__ - Step 50122: {'lr': 0.00038052960859324557, 'samples': 9623424, 'steps': 50121, 'loss/train': 1.76235830783844} 11/07/2021 04:17:44 - INFO - __main__ - Step 50123: {'lr': 0.0003805250825849094, 'samples': 9623616, 'steps': 50122, 'loss/train': 1.686732530593872} 11/07/2021 04:17:45 - INFO - __main__ - Step 50124: {'lr': 0.0003805205565177606, 'samples': 9623808, 'steps': 50123, 'loss/train': 1.3251734972000122} 11/07/2021 04:17:45 - INFO - __main__ - Step 50125: {'lr': 0.0003805160303918013, 'samples': 9624000, 'steps': 50124, 'loss/train': 1.3244572877883911} 11/07/2021 04:17:46 - INFO - __main__ - Step 50126: {'lr': 0.0003805115042070333, 'samples': 9624192, 'steps': 50125, 'loss/train': 1.2289057970046997} 11/07/2021 04:17:46 - INFO - __main__ - Step 50127: {'lr': 0.000380506977963459, 'samples': 9624384, 'steps': 50126, 'loss/train': 1.5427542924880981} 11/07/2021 04:17:46 - INFO - __main__ - Step 50128: {'lr': 0.00038050245166108024, 'samples': 9624576, 'steps': 50127, 'loss/train': 1.2708112001419067} 11/07/2021 04:17:47 - INFO - __main__ - Step 50129: {'lr': 0.000380497925299899, 'samples': 9624768, 'steps': 50128, 'loss/train': 1.509828805923462} 11/07/2021 04:17:48 - INFO - __main__ - Step 50130: {'lr': 0.0003804933988799175, 'samples': 9624960, 'steps': 50129, 'loss/train': 1.2576930522918701} 11/07/2021 04:17:48 - INFO - __main__ - Step 50131: {'lr': 0.0003804888724011377, 'samples': 9625152, 'steps': 50130, 'loss/train': 1.5663971900939941} 11/07/2021 04:17:48 - INFO - __main__ - Step 50132: {'lr': 0.00038048434586356164, 'samples': 9625344, 'steps': 50131, 'loss/train': 1.1456104516983032} 11/07/2021 04:17:49 - INFO - __main__ - Step 50133: {'lr': 0.0003804798192671912, 'samples': 9625536, 'steps': 50132, 'loss/train': 1.7524334192276} 11/07/2021 04:17:50 - INFO - __main__ - Step 50134: {'lr': 0.00038047529261202876, 'samples': 9625728, 'steps': 50133, 'loss/train': 1.5693469047546387} 11/07/2021 04:17:50 - INFO - __main__ - Step 50135: {'lr': 0.0003804707658980761, 'samples': 9625920, 'steps': 50134, 'loss/train': 1.1924694776535034} 11/07/2021 04:17:51 - INFO - __main__ - Step 50136: {'lr': 0.0003804662391253352, 'samples': 9626112, 'steps': 50135, 'loss/train': 1.474402666091919} 11/07/2021 04:17:51 - INFO - __main__ - Step 50137: {'lr': 0.00038046171229380837, 'samples': 9626304, 'steps': 50136, 'loss/train': 1.3929524421691895} 11/07/2021 04:17:51 - INFO - __main__ - Step 50138: {'lr': 0.0003804571854034975, 'samples': 9626496, 'steps': 50137, 'loss/train': 0.9163033366203308} 11/07/2021 04:17:52 - INFO - __main__ - Step 50139: {'lr': 0.0003804526584544046, 'samples': 9626688, 'steps': 50138, 'loss/train': 1.1019681692123413} 11/07/2021 04:17:53 - INFO - __main__ - Step 50140: {'lr': 0.0003804481314465317, 'samples': 9626880, 'steps': 50139, 'loss/train': 1.2081000804901123} 11/07/2021 04:17:53 - INFO - __main__ - Step 50141: {'lr': 0.0003804436043798809, 'samples': 9627072, 'steps': 50140, 'loss/train': 1.5302804708480835} 11/07/2021 04:17:53 - INFO - __main__ - Step 50142: {'lr': 0.00038043907725445424, 'samples': 9627264, 'steps': 50141, 'loss/train': 1.393977403640747} 11/07/2021 04:17:54 - INFO - __main__ - Step 50143: {'lr': 0.00038043455007025375, 'samples': 9627456, 'steps': 50142, 'loss/train': 1.6601444482803345} 11/07/2021 04:17:55 - INFO - __main__ - Step 50144: {'lr': 0.00038043002282728153, 'samples': 9627648, 'steps': 50143, 'loss/train': 1.873679757118225} 11/07/2021 04:17:55 - INFO - __main__ - Step 50145: {'lr': 0.00038042549552553954, 'samples': 9627840, 'steps': 50144, 'loss/train': 1.777874231338501} 11/07/2021 04:17:56 - INFO - __main__ - Step 50146: {'lr': 0.00038042096816502967, 'samples': 9628032, 'steps': 50145, 'loss/train': 1.4701316356658936} 11/07/2021 04:17:56 - INFO - __main__ - Step 50147: {'lr': 0.0003804164407457543, 'samples': 9628224, 'steps': 50146, 'loss/train': 1.5121428966522217} 11/07/2021 04:17:56 - INFO - __main__ - Step 50148: {'lr': 0.0003804119132677152, 'samples': 9628416, 'steps': 50147, 'loss/train': 1.4496904611587524} 11/07/2021 04:17:57 - INFO - __main__ - Step 50149: {'lr': 0.0003804073857309145, 'samples': 9628608, 'steps': 50148, 'loss/train': 2.126142740249634} 11/07/2021 04:17:58 - INFO - __main__ - Step 50150: {'lr': 0.00038040285813535434, 'samples': 9628800, 'steps': 50149, 'loss/train': 1.5035032033920288} 11/07/2021 04:17:58 - INFO - __main__ - Step 50151: {'lr': 0.0003803983304810367, 'samples': 9628992, 'steps': 50150, 'loss/train': 0.773347795009613} 11/07/2021 04:17:58 - INFO - __main__ - Step 50152: {'lr': 0.0003803938027679634, 'samples': 9629184, 'steps': 50151, 'loss/train': 0.7962072491645813} 11/07/2021 04:17:59 - INFO - __main__ - Step 50153: {'lr': 0.0003803892749961368, 'samples': 9629376, 'steps': 50152, 'loss/train': 1.6893115043640137} 11/07/2021 04:18:00 - INFO - __main__ - Step 50154: {'lr': 0.0003803847471655587, 'samples': 9629568, 'steps': 50153, 'loss/train': 1.2985191345214844} 11/07/2021 04:18:00 - INFO - __main__ - Step 50155: {'lr': 0.00038038021927623133, 'samples': 9629760, 'steps': 50154, 'loss/train': 2.1850473880767822} 11/07/2021 04:18:00 - INFO - __main__ - Step 50156: {'lr': 0.00038037569132815663, 'samples': 9629952, 'steps': 50155, 'loss/train': 1.6614058017730713} 11/07/2021 04:18:01 - INFO - __main__ - Step 50157: {'lr': 0.0003803711633213367, 'samples': 9630144, 'steps': 50156, 'loss/train': 0.8183517456054688} 11/07/2021 04:18:01 - INFO - __main__ - Step 50158: {'lr': 0.0003803666352557735, 'samples': 9630336, 'steps': 50157, 'loss/train': 1.7416718006134033} 11/07/2021 04:18:02 - INFO - __main__ - Step 50159: {'lr': 0.0003803621071314691, 'samples': 9630528, 'steps': 50158, 'loss/train': 1.6603349447250366} 11/07/2021 04:18:03 - INFO - __main__ - Step 50160: {'lr': 0.0003803575789484255, 'samples': 9630720, 'steps': 50159, 'loss/train': 1.5331896543502808} 11/07/2021 04:18:03 - INFO - __main__ - Step 50161: {'lr': 0.0003803530507066448, 'samples': 9630912, 'steps': 50160, 'loss/train': 1.3435156345367432} 11/07/2021 04:18:03 - INFO - __main__ - Step 50162: {'lr': 0.00038034852240612907, 'samples': 9631104, 'steps': 50161, 'loss/train': 1.4418418407440186} 11/07/2021 04:18:04 - INFO - __main__ - Step 50163: {'lr': 0.00038034399404688024, 'samples': 9631296, 'steps': 50162, 'loss/train': 0.3798968195915222} 11/07/2021 04:18:04 - INFO - __main__ - Step 50164: {'lr': 0.00038033946562890055, 'samples': 9631488, 'steps': 50163, 'loss/train': 0.9605880975723267} 11/07/2021 04:18:05 - INFO - __main__ - Step 50165: {'lr': 0.0003803349371521918, 'samples': 9631680, 'steps': 50164, 'loss/train': 1.5995622873306274} 11/07/2021 04:18:06 - INFO - __main__ - Step 50166: {'lr': 0.00038033040861675617, 'samples': 9631872, 'steps': 50165, 'loss/train': 1.0809211730957031} 11/07/2021 04:18:06 - INFO - __main__ - Step 50167: {'lr': 0.0003803258800225956, 'samples': 9632064, 'steps': 50166, 'loss/train': 0.08810996264219284} 11/07/2021 04:18:06 - INFO - __main__ - Step 50168: {'lr': 0.0003803213513697123, 'samples': 9632256, 'steps': 50167, 'loss/train': 1.607441782951355} 11/07/2021 04:18:07 - INFO - __main__ - Step 50169: {'lr': 0.0003803168226581082, 'samples': 9632448, 'steps': 50168, 'loss/train': 1.309495210647583} 11/07/2021 04:18:08 - INFO - __main__ - Step 50170: {'lr': 0.00038031229388778526, 'samples': 9632640, 'steps': 50169, 'loss/train': 1.2352304458618164} 11/07/2021 04:18:08 - INFO - __main__ - Step 50171: {'lr': 0.00038030776505874577, 'samples': 9632832, 'steps': 50170, 'loss/train': 1.6889320611953735} 11/07/2021 04:18:09 - INFO - __main__ - Step 50172: {'lr': 0.0003803032361709915, 'samples': 9633024, 'steps': 50171, 'loss/train': 1.7311201095581055} 11/07/2021 04:18:09 - INFO - __main__ - Step 50173: {'lr': 0.00038029870722452455, 'samples': 9633216, 'steps': 50172, 'loss/train': 1.1639690399169922} 11/07/2021 04:18:09 - INFO - __main__ - Step 50174: {'lr': 0.0003802941782193471, 'samples': 9633408, 'steps': 50173, 'loss/train': 0.15695008635520935} 11/07/2021 04:18:10 - INFO - __main__ - Step 50175: {'lr': 0.00038028964915546107, 'samples': 9633600, 'steps': 50174, 'loss/train': 1.5289998054504395} 11/07/2021 04:18:11 - INFO - __main__ - Step 50176: {'lr': 0.00038028512003286853, 'samples': 9633792, 'steps': 50175, 'loss/train': 1.2912172079086304} 11/07/2021 04:18:11 - INFO - __main__ - Step 50177: {'lr': 0.00038028059085157165, 'samples': 9633984, 'steps': 50176, 'loss/train': 1.3492580652236938} 11/07/2021 04:18:11 - INFO - __main__ - Step 50178: {'lr': 0.0003802760616115722, 'samples': 9634176, 'steps': 50177, 'loss/train': 1.8885968923568726} 11/07/2021 04:18:12 - INFO - __main__ - Step 50179: {'lr': 0.0003802715323128724, 'samples': 9634368, 'steps': 50178, 'loss/train': 1.1622658967971802} 11/07/2021 04:18:13 - INFO - __main__ - Step 50180: {'lr': 0.00038026700295547424, 'samples': 9634560, 'steps': 50179, 'loss/train': 1.4751503467559814} 11/07/2021 04:18:13 - INFO - __main__ - Step 50181: {'lr': 0.0003802624735393798, 'samples': 9634752, 'steps': 50180, 'loss/train': 1.3558752536773682} 11/07/2021 04:18:14 - INFO - __main__ - Step 50182: {'lr': 0.00038025794406459115, 'samples': 9634944, 'steps': 50181, 'loss/train': 1.3577591180801392} 11/07/2021 04:18:14 - INFO - __main__ - Step 50183: {'lr': 0.00038025341453111017, 'samples': 9635136, 'steps': 50182, 'loss/train': 1.314875841140747} 11/07/2021 04:18:14 - INFO - __main__ - Step 50184: {'lr': 0.0003802488849389391, 'samples': 9635328, 'steps': 50183, 'loss/train': 0.12867824733257294} 11/07/2021 04:18:15 - INFO - __main__ - Step 50185: {'lr': 0.0003802443552880799, 'samples': 9635520, 'steps': 50184, 'loss/train': 1.5836074352264404} 11/07/2021 04:18:16 - INFO - __main__ - Step 50186: {'lr': 0.00038023982557853456, 'samples': 9635712, 'steps': 50185, 'loss/train': 1.537987470626831} 11/07/2021 04:18:16 - INFO - __main__ - Step 50187: {'lr': 0.00038023529581030516, 'samples': 9635904, 'steps': 50186, 'loss/train': 1.3645615577697754} 11/07/2021 04:18:16 - INFO - __main__ - Step 50188: {'lr': 0.00038023076598339375, 'samples': 9636096, 'steps': 50187, 'loss/train': 1.2964589595794678} 11/07/2021 04:18:17 - INFO - __main__ - Step 50189: {'lr': 0.0003802262360978024, 'samples': 9636288, 'steps': 50188, 'loss/train': 4.468301296234131} 11/07/2021 04:18:18 - INFO - __main__ - Step 50190: {'lr': 0.00038022170615353314, 'samples': 9636480, 'steps': 50189, 'loss/train': 0.9755254983901978} 11/07/2021 04:18:18 - INFO - __main__ - Step 50191: {'lr': 0.00038021717615058795, 'samples': 9636672, 'steps': 50190, 'loss/train': 1.2911059856414795} 11/07/2021 04:18:18 - INFO - __main__ - Step 50192: {'lr': 0.00038021264608896884, 'samples': 9636864, 'steps': 50191, 'loss/train': 1.4070249795913696} 11/07/2021 04:18:19 - INFO - __main__ - Step 50193: {'lr': 0.000380208115968678, 'samples': 9637056, 'steps': 50192, 'loss/train': 1.7199753522872925} 11/07/2021 04:18:19 - INFO - __main__ - Step 50194: {'lr': 0.00038020358578971737, 'samples': 9637248, 'steps': 50193, 'loss/train': 0.8250120878219604} 11/07/2021 04:18:19 - INFO - __main__ - Step 50195: {'lr': 0.000380199055552089, 'samples': 9637440, 'steps': 50194, 'loss/train': 1.2280455827713013} 11/07/2021 04:18:21 - INFO - __main__ - Step 50196: {'lr': 0.000380194525255795, 'samples': 9637632, 'steps': 50195, 'loss/train': 0.7293029427528381} 11/07/2021 04:18:21 - INFO - __main__ - Step 50197: {'lr': 0.0003801899949008373, 'samples': 9637824, 'steps': 50196, 'loss/train': 1.4373277425765991} 11/07/2021 04:18:21 - INFO - __main__ - Step 50198: {'lr': 0.000380185464487218, 'samples': 9638016, 'steps': 50197, 'loss/train': 1.2845509052276611} 11/07/2021 04:18:22 - INFO - __main__ - Step 50199: {'lr': 0.00038018093401493916, 'samples': 9638208, 'steps': 50198, 'loss/train': 1.3581730127334595} 11/07/2021 04:18:22 - INFO - __main__ - Step 50200: {'lr': 0.00038017640348400286, 'samples': 9638400, 'steps': 50199, 'loss/train': 1.4440293312072754} 11/07/2021 04:18:23 - INFO - __main__ - Step 50201: {'lr': 0.000380171872894411, 'samples': 9638592, 'steps': 50200, 'loss/train': 1.8245569467544556} 11/07/2021 04:18:24 - INFO - __main__ - Step 50202: {'lr': 0.00038016734224616565, 'samples': 9638784, 'steps': 50201, 'loss/train': 1.4053648710250854} 11/07/2021 04:18:24 - INFO - __main__ - Step 50203: {'lr': 0.000380162811539269, 'samples': 9638976, 'steps': 50202, 'loss/train': 1.519346833229065} 11/07/2021 04:18:24 - INFO - __main__ - Step 50204: {'lr': 0.0003801582807737229, 'samples': 9639168, 'steps': 50203, 'loss/train': 0.9944263100624084} 11/07/2021 04:18:25 - INFO - __main__ - Step 50205: {'lr': 0.00038015374994952966, 'samples': 9639360, 'steps': 50204, 'loss/train': 1.582240343093872} 11/07/2021 04:18:25 - INFO - __main__ - Step 50206: {'lr': 0.0003801492190666911, 'samples': 9639552, 'steps': 50205, 'loss/train': 1.3527615070343018} 11/07/2021 04:18:26 - INFO - __main__ - Step 50207: {'lr': 0.00038014468812520917, 'samples': 9639744, 'steps': 50206, 'loss/train': 1.7546110153198242} 11/07/2021 04:18:26 - INFO - __main__ - Step 50208: {'lr': 0.00038014015712508617, 'samples': 9639936, 'steps': 50207, 'loss/train': 1.6559122800827026} 11/07/2021 04:18:27 - INFO - __main__ - Step 50209: {'lr': 0.000380135626066324, 'samples': 9640128, 'steps': 50208, 'loss/train': 1.6707357168197632} 11/07/2021 04:18:27 - INFO - __main__ - Step 50210: {'lr': 0.00038013109494892467, 'samples': 9640320, 'steps': 50209, 'loss/train': 1.460334062576294} 11/07/2021 04:18:27 - INFO - __main__ - Step 50211: {'lr': 0.00038012656377289035, 'samples': 9640512, 'steps': 50210, 'loss/train': 1.4711841344833374} 11/07/2021 04:18:29 - INFO - __main__ - Step 50212: {'lr': 0.000380122032538223, 'samples': 9640704, 'steps': 50211, 'loss/train': 1.8600960969924927} 11/07/2021 04:18:29 - INFO - __main__ - Step 50213: {'lr': 0.0003801175012449246, 'samples': 9640896, 'steps': 50212, 'loss/train': 1.8999093770980835} 11/07/2021 04:18:29 - INFO - __main__ - Step 50214: {'lr': 0.0003801129698929974, 'samples': 9641088, 'steps': 50213, 'loss/train': 1.8671318292617798} 11/07/2021 04:18:30 - INFO - __main__ - Step 50215: {'lr': 0.00038010843848244316, 'samples': 9641280, 'steps': 50214, 'loss/train': 1.1005767583847046} 11/07/2021 04:18:30 - INFO - __main__ - Step 50216: {'lr': 0.00038010390701326415, 'samples': 9641472, 'steps': 50215, 'loss/train': 1.3407621383666992} 11/07/2021 04:18:31 - INFO - __main__ - Step 50217: {'lr': 0.00038009937548546223, 'samples': 9641664, 'steps': 50216, 'loss/train': 1.4449464082717896} 11/07/2021 04:18:31 - INFO - __main__ - Step 50218: {'lr': 0.0003800948438990397, 'samples': 9641856, 'steps': 50217, 'loss/train': 0.5882311463356018} 11/07/2021 04:18:32 - INFO - __main__ - Step 50219: {'lr': 0.0003800903122539983, 'samples': 9642048, 'steps': 50218, 'loss/train': 1.342165470123291} 11/07/2021 04:18:32 - INFO - __main__ - Step 50220: {'lr': 0.00038008578055034024, 'samples': 9642240, 'steps': 50219, 'loss/train': 1.6090655326843262} 11/07/2021 04:18:32 - INFO - __main__ - Step 50221: {'lr': 0.0003800812487880676, 'samples': 9642432, 'steps': 50220, 'loss/train': 2.008596658706665} 11/07/2021 04:18:33 - INFO - __main__ - Step 50222: {'lr': 0.00038007671696718226, 'samples': 9642624, 'steps': 50221, 'loss/train': 1.2963448762893677} 11/07/2021 04:18:34 - INFO - __main__ - Step 50223: {'lr': 0.0003800721850876864, 'samples': 9642816, 'steps': 50222, 'loss/train': 1.3931059837341309} 11/07/2021 04:18:34 - INFO - __main__ - Step 50224: {'lr': 0.00038006765314958205, 'samples': 9643008, 'steps': 50223, 'loss/train': 1.0996228456497192} 11/07/2021 04:18:34 - INFO - __main__ - Step 50225: {'lr': 0.00038006312115287125, 'samples': 9643200, 'steps': 50224, 'loss/train': 0.7568684220314026} 11/07/2021 04:18:35 - INFO - __main__ - Step 50226: {'lr': 0.00038005858909755596, 'samples': 9643392, 'steps': 50225, 'loss/train': 1.444982647895813} 11/07/2021 04:18:36 - INFO - __main__ - Step 50227: {'lr': 0.00038005405698363824, 'samples': 9643584, 'steps': 50226, 'loss/train': 1.4651623964309692} 11/07/2021 04:18:36 - INFO - __main__ - Step 50228: {'lr': 0.0003800495248111202, 'samples': 9643776, 'steps': 50227, 'loss/train': 0.8650681376457214} 11/07/2021 04:18:36 - INFO - __main__ - Step 50229: {'lr': 0.00038004499258000393, 'samples': 9643968, 'steps': 50228, 'loss/train': 2.6625075340270996} 11/07/2021 04:18:37 - INFO - __main__ - Step 50230: {'lr': 0.0003800404602902913, 'samples': 9644160, 'steps': 50229, 'loss/train': 1.6091790199279785} 11/07/2021 04:18:37 - INFO - __main__ - Step 50231: {'lr': 0.0003800359279419845, 'samples': 9644352, 'steps': 50230, 'loss/train': 1.3057498931884766} 11/07/2021 04:18:38 - INFO - __main__ - Step 50232: {'lr': 0.0003800313955350855, 'samples': 9644544, 'steps': 50231, 'loss/train': 0.8586352467536926} 11/07/2021 04:18:38 - INFO - __main__ - Step 50233: {'lr': 0.0003800268630695963, 'samples': 9644736, 'steps': 50232, 'loss/train': 1.289310097694397} 11/07/2021 04:18:39 - INFO - __main__ - Step 50234: {'lr': 0.00038002233054551906, 'samples': 9644928, 'steps': 50233, 'loss/train': 1.217408537864685} 11/07/2021 04:18:39 - INFO - __main__ - Step 50235: {'lr': 0.00038001779796285575, 'samples': 9645120, 'steps': 50234, 'loss/train': 1.2559170722961426} 11/07/2021 04:18:40 - INFO - __main__ - Step 50236: {'lr': 0.0003800132653216084, 'samples': 9645312, 'steps': 50235, 'loss/train': 1.3718836307525635} 11/07/2021 04:18:41 - INFO - __main__ - Step 50237: {'lr': 0.00038000873262177914, 'samples': 9645504, 'steps': 50236, 'loss/train': 1.396490454673767} 11/07/2021 04:18:41 - INFO - __main__ - Step 50238: {'lr': 0.00038000419986336997, 'samples': 9645696, 'steps': 50237, 'loss/train': 1.6394122838974} 11/07/2021 04:18:41 - INFO - __main__ - Step 50239: {'lr': 0.0003799996670463828, 'samples': 9645888, 'steps': 50238, 'loss/train': 1.2472902536392212} 11/07/2021 04:18:42 - INFO - __main__ - Step 50240: {'lr': 0.0003799951341708199, 'samples': 9646080, 'steps': 50239, 'loss/train': 1.4720319509506226} 11/07/2021 04:18:42 - INFO - __main__ - Step 50241: {'lr': 0.0003799906012366832, 'samples': 9646272, 'steps': 50240, 'loss/train': 0.07744947820901871} 11/07/2021 04:18:43 - INFO - __main__ - Step 50242: {'lr': 0.0003799860682439746, 'samples': 9646464, 'steps': 50241, 'loss/train': 1.135290503501892} 11/07/2021 04:18:43 - INFO - __main__ - Step 50243: {'lr': 0.0003799815351926964, 'samples': 9646656, 'steps': 50242, 'loss/train': 2.1058695316314697} 11/07/2021 04:18:44 - INFO - __main__ - Step 50244: {'lr': 0.0003799770020828505, 'samples': 9646848, 'steps': 50243, 'loss/train': 1.246898889541626} 11/07/2021 04:18:44 - INFO - __main__ - Step 50245: {'lr': 0.000379972468914439, 'samples': 9647040, 'steps': 50244, 'loss/train': 1.1737421751022339} 11/07/2021 04:18:45 - INFO - __main__ - Step 50246: {'lr': 0.0003799679356874639, 'samples': 9647232, 'steps': 50245, 'loss/train': 1.6934196949005127} 11/07/2021 04:18:45 - INFO - __main__ - Step 50247: {'lr': 0.0003799634024019272, 'samples': 9647424, 'steps': 50246, 'loss/train': 1.6743232011795044} 11/07/2021 04:18:46 - INFO - __main__ - Step 50248: {'lr': 0.0003799588690578311, 'samples': 9647616, 'steps': 50247, 'loss/train': 5.710962772369385} 11/07/2021 04:18:46 - INFO - __main__ - Step 50249: {'lr': 0.0003799543356551773, 'samples': 9647808, 'steps': 50248, 'loss/train': 1.0744068622589111} 11/07/2021 04:18:47 - INFO - __main__ - Step 50250: {'lr': 0.00037994980219396835, 'samples': 9648000, 'steps': 50249, 'loss/train': 1.2862998247146606} 11/07/2021 04:18:47 - INFO - __main__ - Step 50251: {'lr': 0.00037994526867420595, 'samples': 9648192, 'steps': 50250, 'loss/train': 1.4021929502487183} 11/07/2021 04:18:47 - INFO - __main__ - Step 50252: {'lr': 0.0003799407350958922, 'samples': 9648384, 'steps': 50251, 'loss/train': 1.3082607984542847} 11/07/2021 04:18:48 - INFO - __main__ - Step 50253: {'lr': 0.00037993620145902914, 'samples': 9648576, 'steps': 50252, 'loss/train': 0.08240436017513275} 11/07/2021 04:18:49 - INFO - __main__ - Step 50254: {'lr': 0.00037993166776361883, 'samples': 9648768, 'steps': 50253, 'loss/train': 1.5516008138656616} 11/07/2021 04:18:49 - INFO - __main__ - Step 50255: {'lr': 0.0003799271340096633, 'samples': 9648960, 'steps': 50254, 'loss/train': 1.3444350957870483} 11/07/2021 04:18:49 - INFO - __main__ - Step 50256: {'lr': 0.00037992260019716463, 'samples': 9649152, 'steps': 50255, 'loss/train': 1.1952298879623413} 11/07/2021 04:18:50 - INFO - __main__ - Step 50257: {'lr': 0.00037991806632612485, 'samples': 9649344, 'steps': 50256, 'loss/train': 1.0896855592727661} 11/07/2021 04:18:51 - INFO - __main__ - Step 50258: {'lr': 0.000379913532396546, 'samples': 9649536, 'steps': 50257, 'loss/train': 1.1467939615249634} 11/07/2021 04:18:51 - INFO - __main__ - Step 50259: {'lr': 0.0003799089984084302, 'samples': 9649728, 'steps': 50258, 'loss/train': 1.575662612915039} 11/07/2021 04:18:51 - INFO - __main__ - Step 50260: {'lr': 0.00037990446436177925, 'samples': 9649920, 'steps': 50259, 'loss/train': 1.17159104347229} 11/07/2021 04:18:52 - INFO - __main__ - Step 50261: {'lr': 0.0003798999302565954, 'samples': 9650112, 'steps': 50260, 'loss/train': 1.5639235973358154} 11/07/2021 04:18:52 - INFO - __main__ - Step 50262: {'lr': 0.0003798953960928807, 'samples': 9650304, 'steps': 50261, 'loss/train': 1.9105384349822998} 11/07/2021 04:18:54 - INFO - __main__ - Step 50263: {'lr': 0.0003798908618706371, 'samples': 9650496, 'steps': 50262, 'loss/train': 1.8271366357803345} 11/07/2021 04:18:54 - INFO - __main__ - Step 50264: {'lr': 0.0003798863275898667, 'samples': 9650688, 'steps': 50263, 'loss/train': 1.162956953048706} 11/07/2021 04:18:54 - INFO - __main__ - Step 50265: {'lr': 0.00037988179325057156, 'samples': 9650880, 'steps': 50264, 'loss/train': 1.6414152383804321} 11/07/2021 04:18:55 - INFO - __main__ - Step 50266: {'lr': 0.0003798772588527536, 'samples': 9651072, 'steps': 50265, 'loss/train': 1.4641472101211548} 11/07/2021 04:18:55 - INFO - __main__ - Step 50267: {'lr': 0.000379872724396415, 'samples': 9651264, 'steps': 50266, 'loss/train': 0.9950963258743286} 11/07/2021 04:18:55 - INFO - __main__ - Step 50268: {'lr': 0.00037986818988155775, 'samples': 9651456, 'steps': 50267, 'loss/train': 1.4459974765777588} 11/07/2021 04:18:57 - INFO - __main__ - Step 50269: {'lr': 0.0003798636553081839, 'samples': 9651648, 'steps': 50268, 'loss/train': 1.2771360874176025} 11/07/2021 04:18:57 - INFO - __main__ - Step 50270: {'lr': 0.0003798591206762955, 'samples': 9651840, 'steps': 50269, 'loss/train': 1.2188576459884644} 11/07/2021 04:18:57 - INFO - __main__ - Step 50271: {'lr': 0.0003798545859858945, 'samples': 9652032, 'steps': 50270, 'loss/train': 1.3124229907989502} 11/07/2021 04:18:58 - INFO - __main__ - Step 50272: {'lr': 0.0003798500512369832, 'samples': 9652224, 'steps': 50271, 'loss/train': 1.11708402633667} 11/07/2021 04:18:58 - INFO - __main__ - Step 50273: {'lr': 0.00037984551642956336, 'samples': 9652416, 'steps': 50272, 'loss/train': 1.3915640115737915} 11/07/2021 04:18:59 - INFO - __main__ - Step 50274: {'lr': 0.0003798409815636371, 'samples': 9652608, 'steps': 50273, 'loss/train': 1.2666629552841187} 11/07/2021 04:18:59 - INFO - __main__ - Step 50275: {'lr': 0.00037983644663920656, 'samples': 9652800, 'steps': 50274, 'loss/train': 1.1245124340057373} 11/07/2021 04:19:00 - INFO - __main__ - Step 50276: {'lr': 0.0003798319116562737, 'samples': 9652992, 'steps': 50275, 'loss/train': 1.2249014377593994} 11/07/2021 04:19:00 - INFO - __main__ - Step 50277: {'lr': 0.00037982737661484056, 'samples': 9653184, 'steps': 50276, 'loss/train': 1.4194586277008057} 11/07/2021 04:19:01 - INFO - __main__ - Step 50278: {'lr': 0.00037982284151490933, 'samples': 9653376, 'steps': 50277, 'loss/train': 0.9645190834999084} 11/07/2021 04:19:02 - INFO - __main__ - Step 50279: {'lr': 0.00037981830635648177, 'samples': 9653568, 'steps': 50278, 'loss/train': 1.2584521770477295} 11/07/2021 04:19:02 - INFO - __main__ - Step 50280: {'lr': 0.0003798137711395602, 'samples': 9653760, 'steps': 50279, 'loss/train': 1.2484103441238403} 11/07/2021 04:19:02 - INFO - __main__ - Step 50281: {'lr': 0.00037980923586414646, 'samples': 9653952, 'steps': 50280, 'loss/train': 1.3946747779846191} 11/07/2021 04:19:03 - INFO - __main__ - Step 50282: {'lr': 0.0003798047005302427, 'samples': 9654144, 'steps': 50281, 'loss/train': 1.743774175643921} 11/07/2021 04:19:03 - INFO - __main__ - Step 50283: {'lr': 0.000379800165137851, 'samples': 9654336, 'steps': 50282, 'loss/train': 1.0040243864059448} 11/07/2021 04:19:04 - INFO - __main__ - Step 50284: {'lr': 0.00037979562968697324, 'samples': 9654528, 'steps': 50283, 'loss/train': 1.5647839307785034} 11/07/2021 04:19:04 - INFO - __main__ - Step 50285: {'lr': 0.0003797910941776117, 'samples': 9654720, 'steps': 50284, 'loss/train': 0.07238461077213287} 11/07/2021 04:19:05 - INFO - __main__ - Step 50286: {'lr': 0.00037978655860976826, 'samples': 9654912, 'steps': 50285, 'loss/train': 1.6101865768432617} 11/07/2021 04:19:05 - INFO - __main__ - Step 50287: {'lr': 0.00037978202298344496, 'samples': 9655104, 'steps': 50286, 'loss/train': 1.595892310142517} 11/07/2021 04:19:05 - INFO - __main__ - Step 50288: {'lr': 0.0003797774872986439, 'samples': 9655296, 'steps': 50287, 'loss/train': 1.5872743129730225} 11/07/2021 04:19:06 - INFO - __main__ - Step 50289: {'lr': 0.00037977295155536706, 'samples': 9655488, 'steps': 50288, 'loss/train': 1.563896894454956} 11/07/2021 04:19:07 - INFO - __main__ - Step 50290: {'lr': 0.00037976841575361665, 'samples': 9655680, 'steps': 50289, 'loss/train': 1.583510398864746} 11/07/2021 04:19:07 - INFO - __main__ - Step 50291: {'lr': 0.00037976387989339445, 'samples': 9655872, 'steps': 50290, 'loss/train': 1.1028339862823486} 11/07/2021 04:19:07 - INFO - __main__ - Step 50292: {'lr': 0.0003797593439747028, 'samples': 9656064, 'steps': 50291, 'loss/train': 1.603823184967041} 11/07/2021 04:19:08 - INFO - __main__ - Step 50293: {'lr': 0.0003797548079975435, 'samples': 9656256, 'steps': 50292, 'loss/train': 1.4887603521347046} 11/07/2021 04:19:08 - INFO - __main__ - Step 50294: {'lr': 0.0003797502719619187, 'samples': 9656448, 'steps': 50293, 'loss/train': 1.4273159503936768} 11/07/2021 04:19:09 - INFO - __main__ - Step 50295: {'lr': 0.0003797457358678304, 'samples': 9656640, 'steps': 50294, 'loss/train': 1.3308401107788086} 11/07/2021 04:19:10 - INFO - __main__ - Step 50296: {'lr': 0.0003797411997152807, 'samples': 9656832, 'steps': 50295, 'loss/train': 0.9965770244598389} 11/07/2021 04:19:10 - INFO - __main__ - Step 50297: {'lr': 0.0003797366635042716, 'samples': 9657024, 'steps': 50296, 'loss/train': 1.4599796533584595} 11/07/2021 04:19:10 - INFO - __main__ - Step 50298: {'lr': 0.0003797321272348052, 'samples': 9657216, 'steps': 50297, 'loss/train': 1.2639737129211426} 11/07/2021 04:19:11 - INFO - __main__ - Step 50299: {'lr': 0.00037972759090688354, 'samples': 9657408, 'steps': 50298, 'loss/train': 1.2504427433013916} 11/07/2021 04:19:12 - INFO - __main__ - Step 50300: {'lr': 0.0003797230545205086, 'samples': 9657600, 'steps': 50299, 'loss/train': 1.3856931924819946} 11/07/2021 04:19:12 - INFO - __main__ - Step 50301: {'lr': 0.00037971851807568237, 'samples': 9657792, 'steps': 50300, 'loss/train': 1.50432288646698} 11/07/2021 04:19:12 - INFO - __main__ - Step 50302: {'lr': 0.000379713981572407, 'samples': 9657984, 'steps': 50301, 'loss/train': 1.360334873199463} 11/07/2021 04:19:13 - INFO - __main__ - Step 50303: {'lr': 0.0003797094450106846, 'samples': 9658176, 'steps': 50302, 'loss/train': 1.5017192363739014} 11/07/2021 04:19:13 - INFO - __main__ - Step 50304: {'lr': 0.00037970490839051707, 'samples': 9658368, 'steps': 50303, 'loss/train': 1.1480355262756348} 11/07/2021 04:19:14 - INFO - __main__ - Step 50305: {'lr': 0.00037970037171190655, 'samples': 9658560, 'steps': 50304, 'loss/train': 1.4229942560195923} 11/07/2021 04:19:15 - INFO - __main__ - Step 50306: {'lr': 0.000379695834974855, 'samples': 9658752, 'steps': 50305, 'loss/train': 0.122550368309021} 11/07/2021 04:19:15 - INFO - __main__ - Step 50307: {'lr': 0.0003796912981793645, 'samples': 9658944, 'steps': 50306, 'loss/train': 1.2545815706253052} 11/07/2021 04:19:15 - INFO - __main__ - Step 50308: {'lr': 0.0003796867613254371, 'samples': 9659136, 'steps': 50307, 'loss/train': 1.4521191120147705} 11/07/2021 04:19:16 - INFO - __main__ - Step 50309: {'lr': 0.0003796822244130749, 'samples': 9659328, 'steps': 50308, 'loss/train': 1.373075246810913} 11/07/2021 04:19:17 - INFO - __main__ - Step 50310: {'lr': 0.00037967768744227984, 'samples': 9659520, 'steps': 50309, 'loss/train': 1.7056126594543457} 11/07/2021 04:19:17 - INFO - __main__ - Step 50311: {'lr': 0.000379673150413054, 'samples': 9659712, 'steps': 50310, 'loss/train': 1.5011404752731323} 11/07/2021 04:19:17 - INFO - __main__ - Step 50312: {'lr': 0.00037966861332539947, 'samples': 9659904, 'steps': 50311, 'loss/train': 1.3286014795303345} 11/07/2021 04:19:18 - INFO - __main__ - Step 50313: {'lr': 0.0003796640761793183, 'samples': 9660096, 'steps': 50312, 'loss/train': 1.5061498880386353} 11/07/2021 04:19:18 - INFO - __main__ - Step 50314: {'lr': 0.00037965953897481244, 'samples': 9660288, 'steps': 50313, 'loss/train': 1.6554516553878784} 11/07/2021 04:19:19 - INFO - __main__ - Step 50315: {'lr': 0.00037965500171188406, 'samples': 9660480, 'steps': 50314, 'loss/train': 1.2993578910827637} 11/07/2021 04:19:19 - INFO - __main__ - Step 50316: {'lr': 0.00037965046439053507, 'samples': 9660672, 'steps': 50315, 'loss/train': 0.6360580325126648} 11/07/2021 04:19:20 - INFO - __main__ - Step 50317: {'lr': 0.00037964592701076753, 'samples': 9660864, 'steps': 50316, 'loss/train': 1.4160159826278687} 11/07/2021 04:19:20 - INFO - __main__ - Step 50318: {'lr': 0.00037964138957258367, 'samples': 9661056, 'steps': 50317, 'loss/train': 0.6304948925971985} 11/07/2021 04:19:20 - INFO - __main__ - Step 50319: {'lr': 0.0003796368520759854, 'samples': 9661248, 'steps': 50318, 'loss/train': 1.1022051572799683} 11/07/2021 04:19:22 - INFO - __main__ - Step 50320: {'lr': 0.00037963231452097467, 'samples': 9661440, 'steps': 50319, 'loss/train': 1.4201772212982178} 11/07/2021 04:19:22 - INFO - __main__ - Step 50321: {'lr': 0.00037962777690755365, 'samples': 9661632, 'steps': 50320, 'loss/train': 1.3736419677734375} 11/07/2021 04:19:22 - INFO - __main__ - Step 50322: {'lr': 0.00037962323923572427, 'samples': 9661824, 'steps': 50321, 'loss/train': 1.2573219537734985} 11/07/2021 04:19:23 - INFO - __main__ - Step 50323: {'lr': 0.0003796187015054888, 'samples': 9662016, 'steps': 50322, 'loss/train': 1.9343341588974} 11/07/2021 04:19:23 - INFO - __main__ - Step 50324: {'lr': 0.00037961416371684907, 'samples': 9662208, 'steps': 50323, 'loss/train': 0.29651209712028503} 11/07/2021 04:19:24 - INFO - __main__ - Step 50325: {'lr': 0.0003796096258698073, 'samples': 9662400, 'steps': 50324, 'loss/train': 1.1519948244094849} 11/07/2021 04:19:24 - INFO - __main__ - Step 50326: {'lr': 0.0003796050879643653, 'samples': 9662592, 'steps': 50325, 'loss/train': 1.5263431072235107} 11/07/2021 04:19:25 - INFO - __main__ - Step 50327: {'lr': 0.0003796005500005253, 'samples': 9662784, 'steps': 50326, 'loss/train': 1.3878707885742188} 11/07/2021 04:19:25 - INFO - __main__ - Step 50328: {'lr': 0.0003795960119782893, 'samples': 9662976, 'steps': 50327, 'loss/train': 0.9455971121788025} 11/07/2021 04:19:25 - INFO - __main__ - Step 50329: {'lr': 0.0003795914738976594, 'samples': 9663168, 'steps': 50328, 'loss/train': 1.1657785177230835} 11/07/2021 04:19:26 - INFO - __main__ - Step 50330: {'lr': 0.00037958693575863747, 'samples': 9663360, 'steps': 50329, 'loss/train': 1.1909018754959106} 11/07/2021 04:19:27 - INFO - __main__ - Step 50331: {'lr': 0.0003795823975612257, 'samples': 9663552, 'steps': 50330, 'loss/train': 1.3688995838165283} 11/07/2021 04:19:27 - INFO - __main__ - Step 50332: {'lr': 0.0003795778593054261, 'samples': 9663744, 'steps': 50331, 'loss/train': 1.1265391111373901} 11/07/2021 04:19:28 - INFO - __main__ - Step 50333: {'lr': 0.00037957332099124066, 'samples': 9663936, 'steps': 50332, 'loss/train': 1.5496602058410645} 11/07/2021 04:19:28 - INFO - __main__ - Step 50334: {'lr': 0.00037956878261867163, 'samples': 9664128, 'steps': 50333, 'loss/train': 1.4843788146972656} 11/07/2021 04:19:29 - INFO - __main__ - Step 50335: {'lr': 0.0003795642441877208, 'samples': 9664320, 'steps': 50334, 'loss/train': 1.6729861497879028} 11/07/2021 04:19:29 - INFO - __main__ - Step 50336: {'lr': 0.0003795597056983903, 'samples': 9664512, 'steps': 50335, 'loss/train': 1.5528745651245117} 11/07/2021 04:19:30 - INFO - __main__ - Step 50337: {'lr': 0.0003795551671506823, 'samples': 9664704, 'steps': 50336, 'loss/train': 1.9175281524658203} 11/07/2021 04:19:30 - INFO - __main__ - Step 50338: {'lr': 0.0003795506285445987, 'samples': 9664896, 'steps': 50337, 'loss/train': 1.3358761072158813} 11/07/2021 04:19:30 - INFO - __main__ - Step 50339: {'lr': 0.0003795460898801415, 'samples': 9665088, 'steps': 50338, 'loss/train': 1.4042829275131226} 11/07/2021 04:19:31 - INFO - __main__ - Step 50340: {'lr': 0.00037954155115731294, 'samples': 9665280, 'steps': 50339, 'loss/train': 1.368674635887146} 11/07/2021 04:19:32 - INFO - __main__ - Step 50341: {'lr': 0.0003795370123761149, 'samples': 9665472, 'steps': 50340, 'loss/train': 1.57216215133667} 11/07/2021 04:19:32 - INFO - __main__ - Step 50342: {'lr': 0.00037953247353654946, 'samples': 9665664, 'steps': 50341, 'loss/train': 1.0828821659088135} 11/07/2021 04:19:32 - INFO - __main__ - Step 50343: {'lr': 0.00037952793463861867, 'samples': 9665856, 'steps': 50342, 'loss/train': 1.0894135236740112} 11/07/2021 04:19:33 - INFO - __main__ - Step 50344: {'lr': 0.0003795233956823246, 'samples': 9666048, 'steps': 50343, 'loss/train': 1.427432894706726} 11/07/2021 04:19:34 - INFO - __main__ - Step 50345: {'lr': 0.0003795188566676694, 'samples': 9666240, 'steps': 50344, 'loss/train': 1.0008103847503662} 11/07/2021 04:19:35 - INFO - __main__ - Step 50346: {'lr': 0.00037951431759465496, 'samples': 9666432, 'steps': 50345, 'loss/train': 1.2678896188735962} 11/07/2021 04:19:35 - INFO - __main__ - Step 50347: {'lr': 0.0003795097784632833, 'samples': 9666624, 'steps': 50346, 'loss/train': 1.6455512046813965} 11/07/2021 04:19:35 - INFO - __main__ - Step 50348: {'lr': 0.00037950523927355657, 'samples': 9666816, 'steps': 50347, 'loss/train': 1.758359670639038} 11/07/2021 04:19:36 - INFO - __main__ - Step 50349: {'lr': 0.0003795007000254768, 'samples': 9667008, 'steps': 50348, 'loss/train': 1.6530606746673584} 11/07/2021 04:19:36 - INFO - __main__ - Step 50350: {'lr': 0.00037949616071904593, 'samples': 9667200, 'steps': 50349, 'loss/train': 5.764377593994141} 11/07/2021 04:19:36 - INFO - __main__ - Step 50351: {'lr': 0.0003794916213542662, 'samples': 9667392, 'steps': 50350, 'loss/train': 1.215613842010498} 11/07/2021 04:19:37 - INFO - __main__ - Step 50352: {'lr': 0.00037948708193113947, 'samples': 9667584, 'steps': 50351, 'loss/train': 0.9372159838676453} 11/07/2021 04:19:38 - INFO - __main__ - Step 50353: {'lr': 0.00037948254244966786, 'samples': 9667776, 'steps': 50352, 'loss/train': 1.5439207553863525} 11/07/2021 04:19:38 - INFO - __main__ - Step 50354: {'lr': 0.00037947800290985344, 'samples': 9667968, 'steps': 50353, 'loss/train': 1.3743999004364014} 11/07/2021 04:19:38 - INFO - __main__ - Step 50355: {'lr': 0.00037947346331169816, 'samples': 9668160, 'steps': 50354, 'loss/train': 1.4863427877426147} 11/07/2021 04:19:39 - INFO - __main__ - Step 50356: {'lr': 0.00037946892365520423, 'samples': 9668352, 'steps': 50355, 'loss/train': 1.680455207824707} 11/07/2021 04:19:40 - INFO - __main__ - Step 50357: {'lr': 0.00037946438394037356, 'samples': 9668544, 'steps': 50356, 'loss/train': 1.0879682302474976} 11/07/2021 04:19:40 - INFO - __main__ - Step 50358: {'lr': 0.00037945984416720826, 'samples': 9668736, 'steps': 50357, 'loss/train': 1.3697302341461182} 11/07/2021 04:19:41 - INFO - __main__ - Step 50359: {'lr': 0.0003794553043357104, 'samples': 9668928, 'steps': 50358, 'loss/train': 1.5519213676452637} 11/07/2021 04:19:41 - INFO - __main__ - Step 50360: {'lr': 0.0003794507644458819, 'samples': 9669120, 'steps': 50359, 'loss/train': 1.4177567958831787} 11/07/2021 04:19:41 - INFO - __main__ - Step 50361: {'lr': 0.00037944622449772485, 'samples': 9669312, 'steps': 50360, 'loss/train': 1.1745578050613403} 11/07/2021 04:19:42 - INFO - __main__ - Step 50362: {'lr': 0.0003794416844912414, 'samples': 9669504, 'steps': 50361, 'loss/train': 1.5298815965652466} 11/07/2021 04:19:43 - INFO - __main__ - Step 50363: {'lr': 0.0003794371444264335, 'samples': 9669696, 'steps': 50362, 'loss/train': 1.569003939628601} 11/07/2021 04:19:43 - INFO - __main__ - Step 50364: {'lr': 0.00037943260430330317, 'samples': 9669888, 'steps': 50363, 'loss/train': 1.3630812168121338} 11/07/2021 04:19:44 - INFO - __main__ - Step 50365: {'lr': 0.00037942806412185254, 'samples': 9670080, 'steps': 50364, 'loss/train': 1.5234861373901367} 11/07/2021 04:19:44 - INFO - __main__ - Step 50366: {'lr': 0.0003794235238820837, 'samples': 9670272, 'steps': 50365, 'loss/train': 1.5544954538345337} 11/07/2021 04:19:44 - INFO - __main__ - Step 50367: {'lr': 0.0003794189835839985, 'samples': 9670464, 'steps': 50366, 'loss/train': 1.484514832496643} 11/07/2021 04:19:45 - INFO - __main__ - Step 50368: {'lr': 0.0003794144432275992, 'samples': 9670656, 'steps': 50367, 'loss/train': 0.7086828947067261} 11/07/2021 04:19:46 - INFO - __main__ - Step 50369: {'lr': 0.0003794099028128877, 'samples': 9670848, 'steps': 50368, 'loss/train': 1.7169569730758667} 11/07/2021 04:19:46 - INFO - __main__ - Step 50370: {'lr': 0.0003794053623398661, 'samples': 9671040, 'steps': 50369, 'loss/train': 1.511048436164856} 11/07/2021 04:19:46 - INFO - __main__ - Step 50371: {'lr': 0.00037940082180853643, 'samples': 9671232, 'steps': 50370, 'loss/train': 1.7683322429656982} 11/07/2021 04:19:47 - INFO - __main__ - Step 50372: {'lr': 0.0003793962812189008, 'samples': 9671424, 'steps': 50371, 'loss/train': 1.5911660194396973} 11/07/2021 04:19:48 - INFO - __main__ - Step 50373: {'lr': 0.00037939174057096114, 'samples': 9671616, 'steps': 50372, 'loss/train': 1.3682141304016113} 11/07/2021 04:19:48 - INFO - __main__ - Step 50374: {'lr': 0.0003793871998647196, 'samples': 9671808, 'steps': 50373, 'loss/train': 1.1681374311447144} 11/07/2021 04:19:49 - INFO - __main__ - Step 50375: {'lr': 0.00037938265910017813, 'samples': 9672000, 'steps': 50374, 'loss/train': 1.4625418186187744} 11/07/2021 04:19:49 - INFO - __main__ - Step 50376: {'lr': 0.0003793781182773388, 'samples': 9672192, 'steps': 50375, 'loss/train': 1.2847131490707397} 11/07/2021 04:19:49 - INFO - __main__ - Step 50377: {'lr': 0.00037937357739620383, 'samples': 9672384, 'steps': 50376, 'loss/train': 1.396962285041809} 11/07/2021 04:19:50 - INFO - __main__ - Step 50378: {'lr': 0.000379369036456775, 'samples': 9672576, 'steps': 50377, 'loss/train': 1.40147066116333} 11/07/2021 04:19:51 - INFO - __main__ - Step 50379: {'lr': 0.00037936449545905457, 'samples': 9672768, 'steps': 50378, 'loss/train': 1.1427726745605469} 11/07/2021 04:19:51 - INFO - __main__ - Step 50380: {'lr': 0.0003793599544030444, 'samples': 9672960, 'steps': 50379, 'loss/train': 0.7213249802589417} 11/07/2021 04:19:51 - INFO - __main__ - Step 50381: {'lr': 0.00037935541328874665, 'samples': 9673152, 'steps': 50380, 'loss/train': 0.06385089457035065} 11/07/2021 04:19:52 - INFO - __main__ - Step 50382: {'lr': 0.0003793508721161634, 'samples': 9673344, 'steps': 50381, 'loss/train': 1.3770196437835693} 11/07/2021 04:19:53 - INFO - __main__ - Step 50383: {'lr': 0.00037934633088529656, 'samples': 9673536, 'steps': 50382, 'loss/train': 1.456033706665039} 11/07/2021 04:19:53 - INFO - __main__ - Step 50384: {'lr': 0.00037934178959614834, 'samples': 9673728, 'steps': 50383, 'loss/train': 0.8930547833442688} 11/07/2021 04:19:54 - INFO - __main__ - Step 50385: {'lr': 0.00037933724824872067, 'samples': 9673920, 'steps': 50384, 'loss/train': 1.4878311157226562} 11/07/2021 04:19:54 - INFO - __main__ - Step 50386: {'lr': 0.00037933270684301567, 'samples': 9674112, 'steps': 50385, 'loss/train': 1.4187685251235962} 11/07/2021 04:19:54 - INFO - __main__ - Step 50387: {'lr': 0.00037932816537903535, 'samples': 9674304, 'steps': 50386, 'loss/train': 1.256471037864685} 11/07/2021 04:19:55 - INFO - __main__ - Step 50388: {'lr': 0.0003793236238567817, 'samples': 9674496, 'steps': 50387, 'loss/train': 1.0572724342346191} 11/07/2021 04:19:56 - INFO - __main__ - Step 50389: {'lr': 0.00037931908227625686, 'samples': 9674688, 'steps': 50388, 'loss/train': 1.6994491815567017} 11/07/2021 04:19:56 - INFO - __main__ - Step 50390: {'lr': 0.0003793145406374628, 'samples': 9674880, 'steps': 50389, 'loss/train': 1.6414024829864502} 11/07/2021 04:19:57 - INFO - __main__ - Step 50391: {'lr': 0.0003793099989404016, 'samples': 9675072, 'steps': 50390, 'loss/train': 0.18833202123641968} 11/07/2021 04:19:57 - INFO - __main__ - Step 50392: {'lr': 0.00037930545718507536, 'samples': 9675264, 'steps': 50391, 'loss/train': 1.4667836427688599} 11/07/2021 04:19:58 - INFO - __main__ - Step 50393: {'lr': 0.000379300915371486, 'samples': 9675456, 'steps': 50392, 'loss/train': 1.5009870529174805} 11/07/2021 04:19:58 - INFO - __main__ - Step 50394: {'lr': 0.00037929637349963573, 'samples': 9675648, 'steps': 50393, 'loss/train': 1.3425657749176025} 11/07/2021 04:19:59 - INFO - __main__ - Step 50395: {'lr': 0.00037929183156952653, 'samples': 9675840, 'steps': 50394, 'loss/train': 1.0846537351608276} 11/07/2021 04:19:59 - INFO - __main__ - Step 50396: {'lr': 0.00037928728958116034, 'samples': 9676032, 'steps': 50395, 'loss/train': 1.7205784320831299} 11/07/2021 04:19:59 - INFO - __main__ - Step 50397: {'lr': 0.0003792827475345393, 'samples': 9676224, 'steps': 50396, 'loss/train': 2.1761927604675293} 11/07/2021 04:20:00 - INFO - __main__ - Step 50398: {'lr': 0.00037927820542966545, 'samples': 9676416, 'steps': 50397, 'loss/train': 1.4321573972702026} 11/07/2021 04:20:01 - INFO - __main__ - Step 50399: {'lr': 0.0003792736632665409, 'samples': 9676608, 'steps': 50398, 'loss/train': 1.4543241262435913} 11/07/2021 04:20:01 - INFO - __main__ - Step 50400: {'lr': 0.0003792691210451676, 'samples': 9676800, 'steps': 50399, 'loss/train': 1.7749804258346558} 11/07/2021 04:20:01 - INFO - __main__ - Step 50401: {'lr': 0.0003792645787655476, 'samples': 9676992, 'steps': 50400, 'loss/train': 1.9975613355636597} 11/07/2021 04:20:02 - INFO - __main__ - Step 50402: {'lr': 0.000379260036427683, 'samples': 9677184, 'steps': 50401, 'loss/train': 2.0390677452087402} 11/07/2021 04:20:03 - INFO - __main__ - Step 50403: {'lr': 0.0003792554940315758, 'samples': 9677376, 'steps': 50402, 'loss/train': 0.8864619731903076} 11/07/2021 04:20:03 - INFO - __main__ - Step 50404: {'lr': 0.00037925095157722807, 'samples': 9677568, 'steps': 50403, 'loss/train': 0.966880738735199} 11/07/2021 04:20:03 - INFO - __main__ - Step 50405: {'lr': 0.0003792464090646419, 'samples': 9677760, 'steps': 50404, 'loss/train': 1.2450422048568726} 11/07/2021 04:20:04 - INFO - __main__ - Step 50406: {'lr': 0.00037924186649381924, 'samples': 9677952, 'steps': 50405, 'loss/train': 1.5221196413040161} 11/07/2021 04:20:04 - INFO - __main__ - Step 50407: {'lr': 0.00037923732386476225, 'samples': 9678144, 'steps': 50406, 'loss/train': 0.800771951675415} 11/07/2021 04:20:05 - INFO - __main__ - Step 50408: {'lr': 0.0003792327811774728, 'samples': 9678336, 'steps': 50407, 'loss/train': 1.0960415601730347} 11/07/2021 04:20:05 - INFO - __main__ - Step 50409: {'lr': 0.00037922823843195317, 'samples': 9678528, 'steps': 50408, 'loss/train': 0.9421395659446716} 11/07/2021 04:20:06 - INFO - __main__ - Step 50410: {'lr': 0.00037922369562820525, 'samples': 9678720, 'steps': 50409, 'loss/train': 1.4728403091430664} 11/07/2021 04:20:06 - INFO - __main__ - Step 50411: {'lr': 0.00037921915276623106, 'samples': 9678912, 'steps': 50410, 'loss/train': 1.227642297744751} 11/07/2021 04:20:07 - INFO - __main__ - Step 50412: {'lr': 0.00037921460984603284, 'samples': 9679104, 'steps': 50411, 'loss/train': 1.0253260135650635} 11/07/2021 04:20:08 - INFO - __main__ - Step 50413: {'lr': 0.0003792100668676125, 'samples': 9679296, 'steps': 50412, 'loss/train': 1.12844717502594} 11/07/2021 04:20:08 - INFO - __main__ - Step 50414: {'lr': 0.000379205523830972, 'samples': 9679488, 'steps': 50413, 'loss/train': 1.1844233274459839} 11/07/2021 04:20:08 - INFO - __main__ - Step 50415: {'lr': 0.0003792009807361135, 'samples': 9679680, 'steps': 50414, 'loss/train': 1.5617855787277222} 11/07/2021 04:20:09 - INFO - __main__ - Step 50416: {'lr': 0.00037919643758303913, 'samples': 9679872, 'steps': 50415, 'loss/train': 1.4274672269821167} 11/07/2021 04:20:09 - INFO - __main__ - Step 50417: {'lr': 0.0003791918943717507, 'samples': 9680064, 'steps': 50416, 'loss/train': 1.6753476858139038} 11/07/2021 04:20:09 - INFO - __main__ - Step 50418: {'lr': 0.0003791873511022505, 'samples': 9680256, 'steps': 50417, 'loss/train': 1.258516550064087} 11/07/2021 04:20:10 - INFO - __main__ - Step 50419: {'lr': 0.0003791828077745405, 'samples': 9680448, 'steps': 50418, 'loss/train': 1.3263862133026123} 11/07/2021 04:20:11 - INFO - __main__ - Step 50420: {'lr': 0.00037917826438862263, 'samples': 9680640, 'steps': 50419, 'loss/train': 1.6234769821166992} 11/07/2021 04:20:11 - INFO - __main__ - Step 50421: {'lr': 0.0003791737209444991, 'samples': 9680832, 'steps': 50420, 'loss/train': 1.2203651666641235} 11/07/2021 04:20:11 - INFO - __main__ - Step 50422: {'lr': 0.00037916917744217185, 'samples': 9681024, 'steps': 50421, 'loss/train': 1.0994569063186646} 11/07/2021 04:20:12 - INFO - __main__ - Step 50423: {'lr': 0.0003791646338816429, 'samples': 9681216, 'steps': 50422, 'loss/train': 1.255558967590332} 11/07/2021 04:20:13 - INFO - __main__ - Step 50424: {'lr': 0.0003791600902629144, 'samples': 9681408, 'steps': 50423, 'loss/train': 1.5016043186187744} 11/07/2021 04:20:13 - INFO - __main__ - Step 50425: {'lr': 0.0003791555465859884, 'samples': 9681600, 'steps': 50424, 'loss/train': 1.4460242986679077} 11/07/2021 04:20:14 - INFO - __main__ - Step 50426: {'lr': 0.0003791510028508669, 'samples': 9681792, 'steps': 50425, 'loss/train': 0.0705665796995163} 11/07/2021 04:20:14 - INFO - __main__ - Step 50427: {'lr': 0.0003791464590575519, 'samples': 9681984, 'steps': 50426, 'loss/train': 1.5198925733566284} 11/07/2021 04:20:14 - INFO - __main__ - Step 50428: {'lr': 0.0003791419152060455, 'samples': 9682176, 'steps': 50427, 'loss/train': 1.438368558883667} 11/07/2021 04:20:15 - INFO - __main__ - Step 50429: {'lr': 0.00037913737129634977, 'samples': 9682368, 'steps': 50428, 'loss/train': 1.4559372663497925} 11/07/2021 04:20:16 - INFO - __main__ - Step 50430: {'lr': 0.00037913282732846676, 'samples': 9682560, 'steps': 50429, 'loss/train': 1.1034575700759888} 11/07/2021 04:20:16 - INFO - __main__ - Step 50431: {'lr': 0.0003791282833023985, 'samples': 9682752, 'steps': 50430, 'loss/train': 0.6443070769309998} 11/07/2021 04:20:16 - INFO - __main__ - Step 50432: {'lr': 0.0003791237392181469, 'samples': 9682944, 'steps': 50431, 'loss/train': 1.4389891624450684} 11/07/2021 04:20:17 - INFO - __main__ - Step 50433: {'lr': 0.0003791191950757143, 'samples': 9683136, 'steps': 50432, 'loss/train': 1.4601596593856812} 11/07/2021 04:20:18 - INFO - __main__ - Step 50434: {'lr': 0.0003791146508751025, 'samples': 9683328, 'steps': 50433, 'loss/train': 1.0389693975448608} 11/07/2021 04:20:18 - INFO - __main__ - Step 50435: {'lr': 0.00037911010661631364, 'samples': 9683520, 'steps': 50434, 'loss/train': 1.6238583326339722} 11/07/2021 04:20:18 - INFO - __main__ - Step 50436: {'lr': 0.0003791055622993498, 'samples': 9683712, 'steps': 50435, 'loss/train': 1.509716510772705} 11/07/2021 04:20:19 - INFO - __main__ - Step 50437: {'lr': 0.0003791010179242129, 'samples': 9683904, 'steps': 50436, 'loss/train': 1.1959272623062134} 11/07/2021 04:20:19 - INFO - __main__ - Step 50438: {'lr': 0.0003790964734909051, 'samples': 9684096, 'steps': 50437, 'loss/train': 1.2539235353469849} 11/07/2021 04:20:20 - INFO - __main__ - Step 50439: {'lr': 0.00037909192899942846, 'samples': 9684288, 'steps': 50438, 'loss/train': 1.107391357421875} 11/07/2021 04:20:21 - INFO - __main__ - Step 50440: {'lr': 0.00037908738444978495, 'samples': 9684480, 'steps': 50439, 'loss/train': 1.3870924711227417} 11/07/2021 04:20:21 - INFO - __main__ - Step 50441: {'lr': 0.00037908283984197666, 'samples': 9684672, 'steps': 50440, 'loss/train': 1.7239774465560913} 11/07/2021 04:20:21 - INFO - __main__ - Step 50442: {'lr': 0.0003790782951760057, 'samples': 9684864, 'steps': 50441, 'loss/train': 1.4376455545425415} 11/07/2021 04:20:22 - INFO - __main__ - Step 50443: {'lr': 0.000379073750451874, 'samples': 9685056, 'steps': 50442, 'loss/train': 1.0883122682571411} 11/07/2021 04:20:23 - INFO - __main__ - Step 50444: {'lr': 0.00037906920566958363, 'samples': 9685248, 'steps': 50443, 'loss/train': 1.358377456665039} 11/07/2021 04:20:23 - INFO - __main__ - Step 50445: {'lr': 0.0003790646608291367, 'samples': 9685440, 'steps': 50444, 'loss/train': 1.310596227645874} 11/07/2021 04:20:23 - INFO - __main__ - Step 50446: {'lr': 0.00037906011593053527, 'samples': 9685632, 'steps': 50445, 'loss/train': 1.6248341798782349} 11/07/2021 04:20:24 - INFO - __main__ - Step 50447: {'lr': 0.00037905557097378127, 'samples': 9685824, 'steps': 50446, 'loss/train': 1.4741896390914917} 11/07/2021 04:20:24 - INFO - __main__ - Step 50448: {'lr': 0.00037905102595887685, 'samples': 9686016, 'steps': 50447, 'loss/train': 1.413648247718811} 11/07/2021 04:20:25 - INFO - __main__ - Step 50449: {'lr': 0.00037904648088582407, 'samples': 9686208, 'steps': 50448, 'loss/train': 1.1509143114089966} 11/07/2021 04:20:25 - INFO - __main__ - Step 50450: {'lr': 0.0003790419357546249, 'samples': 9686400, 'steps': 50449, 'loss/train': 1.2753708362579346} 11/07/2021 04:20:26 - INFO - __main__ - Step 50451: {'lr': 0.0003790373905652814, 'samples': 9686592, 'steps': 50450, 'loss/train': 1.6979026794433594} 11/07/2021 04:20:26 - INFO - __main__ - Step 50452: {'lr': 0.0003790328453177957, 'samples': 9686784, 'steps': 50451, 'loss/train': 1.5667774677276611} 11/07/2021 04:20:26 - INFO - __main__ - Step 50453: {'lr': 0.0003790283000121697, 'samples': 9686976, 'steps': 50452, 'loss/train': 1.496895670890808} 11/07/2021 04:20:27 - INFO - __main__ - Step 50454: {'lr': 0.0003790237546484056, 'samples': 9687168, 'steps': 50453, 'loss/train': 1.5901826620101929} 11/07/2021 04:20:28 - INFO - __main__ - Step 50455: {'lr': 0.00037901920922650534, 'samples': 9687360, 'steps': 50454, 'loss/train': 1.4874805212020874} 11/07/2021 04:20:28 - INFO - __main__ - Step 50456: {'lr': 0.0003790146637464711, 'samples': 9687552, 'steps': 50455, 'loss/train': 1.5322622060775757} 11/07/2021 04:20:28 - INFO - __main__ - Step 50457: {'lr': 0.0003790101182083048, 'samples': 9687744, 'steps': 50456, 'loss/train': 2.1499805450439453} 11/07/2021 04:20:29 - INFO - __main__ - Step 50458: {'lr': 0.0003790055726120085, 'samples': 9687936, 'steps': 50457, 'loss/train': 1.5269300937652588} 11/07/2021 04:20:29 - INFO - __main__ - Step 50459: {'lr': 0.0003790010269575844, 'samples': 9688128, 'steps': 50458, 'loss/train': 0.9556301236152649} 11/07/2021 04:20:31 - INFO - __main__ - Step 50460: {'lr': 0.00037899648124503426, 'samples': 9688320, 'steps': 50459, 'loss/train': 1.2562488317489624} 11/07/2021 04:20:31 - INFO - __main__ - Step 50461: {'lr': 0.0003789919354743604, 'samples': 9688512, 'steps': 50460, 'loss/train': 1.3984532356262207} 11/07/2021 04:20:31 - INFO - __main__ - Step 50462: {'lr': 0.00037898738964556474, 'samples': 9688704, 'steps': 50461, 'loss/train': 1.095470905303955} 11/07/2021 04:20:32 - INFO - __main__ - Step 50463: {'lr': 0.0003789828437586494, 'samples': 9688896, 'steps': 50462, 'loss/train': 0.21125854551792145} 11/07/2021 04:20:32 - INFO - __main__ - Step 50464: {'lr': 0.0003789782978136163, 'samples': 9689088, 'steps': 50463, 'loss/train': 1.7625631093978882} 11/07/2021 04:20:33 - INFO - __main__ - Step 50465: {'lr': 0.0003789737518104676, 'samples': 9689280, 'steps': 50464, 'loss/train': 1.1391310691833496} 11/07/2021 04:20:34 - INFO - __main__ - Step 50466: {'lr': 0.0003789692057492053, 'samples': 9689472, 'steps': 50465, 'loss/train': 0.7298038005828857} 11/07/2021 04:20:34 - INFO - __main__ - Step 50467: {'lr': 0.0003789646596298315, 'samples': 9689664, 'steps': 50466, 'loss/train': 0.6776963472366333} 11/07/2021 04:20:34 - INFO - __main__ - Step 50468: {'lr': 0.0003789601134523482, 'samples': 9689856, 'steps': 50467, 'loss/train': 1.4561177492141724} 11/07/2021 04:20:35 - INFO - __main__ - Step 50469: {'lr': 0.0003789555672167575, 'samples': 9690048, 'steps': 50468, 'loss/train': 1.309482455253601} 11/07/2021 04:20:36 - INFO - __main__ - Step 50470: {'lr': 0.00037895102092306134, 'samples': 9690240, 'steps': 50469, 'loss/train': 1.7961630821228027} 11/07/2021 04:20:36 - INFO - __main__ - Step 50471: {'lr': 0.00037894647457126186, 'samples': 9690432, 'steps': 50470, 'loss/train': 1.5374242067337036} 11/07/2021 04:20:36 - INFO - __main__ - Step 50472: {'lr': 0.00037894192816136107, 'samples': 9690624, 'steps': 50471, 'loss/train': 1.2444005012512207} 11/07/2021 04:20:37 - INFO - __main__ - Step 50473: {'lr': 0.00037893738169336114, 'samples': 9690816, 'steps': 50472, 'loss/train': 1.5550144910812378} 11/07/2021 04:20:37 - INFO - __main__ - Step 50474: {'lr': 0.00037893283516726397, 'samples': 9691008, 'steps': 50473, 'loss/train': 1.3162457942962646} 11/07/2021 04:20:38 - INFO - __main__ - Step 50475: {'lr': 0.0003789282885830716, 'samples': 9691200, 'steps': 50474, 'loss/train': 2.113288164138794} 11/07/2021 04:20:38 - INFO - __main__ - Step 50476: {'lr': 0.0003789237419407862, 'samples': 9691392, 'steps': 50475, 'loss/train': 1.8515410423278809} 11/07/2021 04:20:39 - INFO - __main__ - Step 50477: {'lr': 0.00037891919524040964, 'samples': 9691584, 'steps': 50476, 'loss/train': 1.5828161239624023} 11/07/2021 04:20:39 - INFO - __main__ - Step 50478: {'lr': 0.0003789146484819442, 'samples': 9691776, 'steps': 50477, 'loss/train': 0.8145389556884766} 11/07/2021 04:20:40 - INFO - __main__ - Step 50479: {'lr': 0.00037891010166539175, 'samples': 9691968, 'steps': 50478, 'loss/train': 1.2109582424163818} 11/07/2021 04:20:41 - INFO - __main__ - Step 50480: {'lr': 0.00037890555479075437, 'samples': 9692160, 'steps': 50479, 'loss/train': 1.516937255859375} 11/07/2021 04:20:42 - INFO - __main__ - Step 50481: {'lr': 0.0003789010078580342, 'samples': 9692352, 'steps': 50480, 'loss/train': 1.3102186918258667} 11/07/2021 04:20:42 - INFO - __main__ - Step 50482: {'lr': 0.00037889646086723325, 'samples': 9692544, 'steps': 50481, 'loss/train': 1.1893430948257446} 11/07/2021 04:20:42 - INFO - __main__ - Step 50483: {'lr': 0.0003788919138183534, 'samples': 9692736, 'steps': 50482, 'loss/train': 1.4249972105026245} 11/07/2021 04:20:43 - INFO - __main__ - Step 50484: {'lr': 0.000378887366711397, 'samples': 9692928, 'steps': 50483, 'loss/train': 1.0710270404815674} 11/07/2021 04:20:43 - INFO - __main__ - Step 50485: {'lr': 0.0003788828195463658, 'samples': 9693120, 'steps': 50484, 'loss/train': 0.8438765406608582} 11/07/2021 04:20:44 - INFO - __main__ - Step 50486: {'lr': 0.0003788782723232621, 'samples': 9693312, 'steps': 50485, 'loss/train': 1.1044734716415405} 11/07/2021 04:20:44 - INFO - __main__ - Step 50487: {'lr': 0.00037887372504208784, 'samples': 9693504, 'steps': 50486, 'loss/train': 0.1660122126340866} 11/07/2021 04:20:45 - INFO - __main__ - Step 50488: {'lr': 0.000378869177702845, 'samples': 9693696, 'steps': 50487, 'loss/train': 1.5479238033294678} 11/07/2021 04:20:45 - INFO - __main__ - Step 50489: {'lr': 0.00037886463030553576, 'samples': 9693888, 'steps': 50488, 'loss/train': 0.8453708291053772} 11/07/2021 04:20:45 - INFO - __main__ - Step 50490: {'lr': 0.0003788600828501621, 'samples': 9694080, 'steps': 50489, 'loss/train': 1.5299060344696045} 11/07/2021 04:20:47 - INFO - __main__ - Step 50491: {'lr': 0.000378855535336726, 'samples': 9694272, 'steps': 50490, 'loss/train': 1.4592078924179077} 11/07/2021 04:20:47 - INFO - __main__ - Step 50492: {'lr': 0.00037885098776522966, 'samples': 9694464, 'steps': 50491, 'loss/train': 1.3624205589294434} 11/07/2021 04:20:47 - INFO - __main__ - Step 50493: {'lr': 0.00037884644013567504, 'samples': 9694656, 'steps': 50492, 'loss/train': 1.2569674253463745} 11/07/2021 04:20:48 - INFO - __main__ - Step 50494: {'lr': 0.0003788418924480642, 'samples': 9694848, 'steps': 50493, 'loss/train': 1.4081571102142334} 11/07/2021 04:20:48 - INFO - __main__ - Step 50495: {'lr': 0.00037883734470239914, 'samples': 9695040, 'steps': 50494, 'loss/train': 0.1726246476173401} 11/07/2021 04:20:49 - INFO - __main__ - Step 50496: {'lr': 0.00037883279689868203, 'samples': 9695232, 'steps': 50495, 'loss/train': 1.0943998098373413} 11/07/2021 04:20:49 - INFO - __main__ - Step 50497: {'lr': 0.00037882824903691484, 'samples': 9695424, 'steps': 50496, 'loss/train': 1.4701080322265625} 11/07/2021 04:20:50 - INFO - __main__ - Step 50498: {'lr': 0.00037882370111709963, 'samples': 9695616, 'steps': 50497, 'loss/train': 1.3461015224456787} 11/07/2021 04:20:50 - INFO - __main__ - Step 50499: {'lr': 0.00037881915313923845, 'samples': 9695808, 'steps': 50498, 'loss/train': 1.5255160331726074} 11/07/2021 04:20:50 - INFO - __main__ - Step 50500: {'lr': 0.0003788146051033333, 'samples': 9696000, 'steps': 50499, 'loss/train': 1.5375534296035767} 11/07/2021 04:20:51 - INFO - __main__ - Step 50501: {'lr': 0.0003788100570093863, 'samples': 9696192, 'steps': 50500, 'loss/train': 1.058691143989563} 11/07/2021 04:20:52 - INFO - __main__ - Step 50502: {'lr': 0.0003788055088573995, 'samples': 9696384, 'steps': 50501, 'loss/train': 1.8498750925064087} 11/07/2021 04:20:52 - INFO - __main__ - Step 50503: {'lr': 0.0003788009606473749, 'samples': 9696576, 'steps': 50502, 'loss/train': 1.3221867084503174} 11/07/2021 04:20:52 - INFO - __main__ - Step 50504: {'lr': 0.0003787964123793146, 'samples': 9696768, 'steps': 50503, 'loss/train': 1.1646900177001953} 11/07/2021 04:20:53 - INFO - __main__ - Step 50505: {'lr': 0.0003787918640532206, 'samples': 9696960, 'steps': 50504, 'loss/train': 1.4190033674240112} 11/07/2021 04:20:54 - INFO - __main__ - Step 50506: {'lr': 0.000378787315669095, 'samples': 9697152, 'steps': 50505, 'loss/train': 1.230149507522583} 11/07/2021 04:20:54 - INFO - __main__ - Step 50507: {'lr': 0.00037878276722693984, 'samples': 9697344, 'steps': 50506, 'loss/train': 1.09401273727417} 11/07/2021 04:20:55 - INFO - __main__ - Step 50508: {'lr': 0.00037877821872675705, 'samples': 9697536, 'steps': 50507, 'loss/train': 1.4707863330841064} 11/07/2021 04:20:55 - INFO - __main__ - Step 50509: {'lr': 0.00037877367016854886, 'samples': 9697728, 'steps': 50508, 'loss/train': 1.4535642862319946} 11/07/2021 04:20:55 - INFO - __main__ - Step 50510: {'lr': 0.00037876912155231725, 'samples': 9697920, 'steps': 50509, 'loss/train': 1.3505984544754028} 11/07/2021 04:20:56 - INFO - __main__ - Step 50511: {'lr': 0.0003787645728780642, 'samples': 9698112, 'steps': 50510, 'loss/train': 1.0581302642822266} 11/07/2021 04:20:57 - INFO - __main__ - Step 50512: {'lr': 0.0003787600241457918, 'samples': 9698304, 'steps': 50511, 'loss/train': 1.6496787071228027} 11/07/2021 04:20:57 - INFO - __main__ - Step 50513: {'lr': 0.0003787554753555022, 'samples': 9698496, 'steps': 50512, 'loss/train': 1.4941242933273315} 11/07/2021 04:20:57 - INFO - __main__ - Step 50514: {'lr': 0.00037875092650719737, 'samples': 9698688, 'steps': 50513, 'loss/train': 1.7991743087768555} 11/07/2021 04:20:58 - INFO - __main__ - Step 50515: {'lr': 0.0003787463776008794, 'samples': 9698880, 'steps': 50514, 'loss/train': 1.6457631587982178} 11/07/2021 04:20:58 - INFO - __main__ - Step 50516: {'lr': 0.00037874182863655015, 'samples': 9699072, 'steps': 50515, 'loss/train': 3.9654839038848877} 11/07/2021 04:20:59 - INFO - __main__ - Step 50517: {'lr': 0.00037873727961421197, 'samples': 9699264, 'steps': 50516, 'loss/train': 1.1913834810256958} 11/07/2021 04:21:00 - INFO - __main__ - Step 50518: {'lr': 0.00037873273053386664, 'samples': 9699456, 'steps': 50517, 'loss/train': 1.2977443933486938} 11/07/2021 04:21:00 - INFO - __main__ - Step 50519: {'lr': 0.00037872818139551633, 'samples': 9699648, 'steps': 50518, 'loss/train': 0.13847365975379944} 11/07/2021 04:21:01 - INFO - __main__ - Step 50520: {'lr': 0.0003787236321991632, 'samples': 9699840, 'steps': 50519, 'loss/train': 1.0413885116577148} 11/07/2021 04:21:01 - INFO - __main__ - Step 50521: {'lr': 0.0003787190829448092, 'samples': 9700032, 'steps': 50520, 'loss/train': 0.9799793362617493} 11/07/2021 04:21:01 - INFO - __main__ - Step 50522: {'lr': 0.00037871453363245625, 'samples': 9700224, 'steps': 50521, 'loss/train': 1.574615716934204} 11/07/2021 04:21:02 - INFO - __main__ - Step 50523: {'lr': 0.0003787099842621066, 'samples': 9700416, 'steps': 50522, 'loss/train': 1.5450891256332397} 11/07/2021 04:21:03 - INFO - __main__ - Step 50524: {'lr': 0.0003787054348337621, 'samples': 9700608, 'steps': 50523, 'loss/train': 1.689807653427124} 11/07/2021 04:21:03 - INFO - __main__ - Step 50525: {'lr': 0.000378700885347425, 'samples': 9700800, 'steps': 50524, 'loss/train': 1.999502182006836} 11/07/2021 04:21:03 - INFO - __main__ - Step 50526: {'lr': 0.0003786963358030973, 'samples': 9700992, 'steps': 50525, 'loss/train': 1.3054684400558472} 11/07/2021 04:21:04 - INFO - __main__ - Step 50527: {'lr': 0.000378691786200781, 'samples': 9701184, 'steps': 50526, 'loss/train': 1.6954087018966675} 11/07/2021 04:21:05 - INFO - __main__ - Step 50528: {'lr': 0.0003786872365404781, 'samples': 9701376, 'steps': 50527, 'loss/train': 1.4089579582214355} 11/07/2021 04:21:05 - INFO - __main__ - Step 50529: {'lr': 0.00037868268682219073, 'samples': 9701568, 'steps': 50528, 'loss/train': 1.2623209953308105} 11/07/2021 04:21:05 - INFO - __main__ - Step 50530: {'lr': 0.000378678137045921, 'samples': 9701760, 'steps': 50529, 'loss/train': 1.2020975351333618} 11/07/2021 04:21:06 - INFO - __main__ - Step 50531: {'lr': 0.0003786735872116709, 'samples': 9701952, 'steps': 50530, 'loss/train': 1.2616205215454102} 11/07/2021 04:21:06 - INFO - __main__ - Step 50532: {'lr': 0.00037866903731944234, 'samples': 9702144, 'steps': 50531, 'loss/train': 1.9756957292556763} 11/07/2021 04:21:07 - INFO - __main__ - Step 50533: {'lr': 0.0003786644873692376, 'samples': 9702336, 'steps': 50532, 'loss/train': 1.620253562927246} 11/07/2021 04:21:08 - INFO - __main__ - Step 50534: {'lr': 0.0003786599373610586, 'samples': 9702528, 'steps': 50533, 'loss/train': 1.2405271530151367} 11/07/2021 04:21:08 - INFO - __main__ - Step 50535: {'lr': 0.00037865538729490745, 'samples': 9702720, 'steps': 50534, 'loss/train': 1.2175302505493164} 11/07/2021 04:21:08 - INFO - __main__ - Step 50536: {'lr': 0.00037865083717078605, 'samples': 9702912, 'steps': 50535, 'loss/train': 1.144630789756775} 11/07/2021 04:21:09 - INFO - __main__ - Step 50537: {'lr': 0.00037864628698869676, 'samples': 9703104, 'steps': 50536, 'loss/train': 1.092538833618164} 11/07/2021 04:21:10 - INFO - __main__ - Step 50538: {'lr': 0.0003786417367486413, 'samples': 9703296, 'steps': 50537, 'loss/train': 1.1753264665603638} 11/07/2021 04:21:10 - INFO - __main__ - Step 50539: {'lr': 0.00037863718645062184, 'samples': 9703488, 'steps': 50538, 'loss/train': 1.6783736944198608} 11/07/2021 04:21:10 - INFO - __main__ - Step 50540: {'lr': 0.00037863263609464056, 'samples': 9703680, 'steps': 50539, 'loss/train': 1.5667228698730469} 11/07/2021 04:21:11 - INFO - __main__ - Step 50541: {'lr': 0.00037862808568069935, 'samples': 9703872, 'steps': 50540, 'loss/train': 1.5426188707351685} 11/07/2021 04:21:11 - INFO - __main__ - Step 50542: {'lr': 0.00037862353520880026, 'samples': 9704064, 'steps': 50541, 'loss/train': 1.5579650402069092} 11/07/2021 04:21:12 - INFO - __main__ - Step 50543: {'lr': 0.0003786189846789454, 'samples': 9704256, 'steps': 50542, 'loss/train': 1.1260631084442139} 11/07/2021 04:21:12 - INFO - __main__ - Step 50544: {'lr': 0.00037861443409113683, 'samples': 9704448, 'steps': 50543, 'loss/train': 1.2102491855621338} 11/07/2021 04:21:13 - INFO - __main__ - Step 50545: {'lr': 0.0003786098834453766, 'samples': 9704640, 'steps': 50544, 'loss/train': 1.4704492092132568} 11/07/2021 04:21:13 - INFO - __main__ - Step 50546: {'lr': 0.00037860533274166675, 'samples': 9704832, 'steps': 50545, 'loss/train': 1.130420446395874} 11/07/2021 04:21:13 - INFO - __main__ - Step 50547: {'lr': 0.0003786007819800094, 'samples': 9705024, 'steps': 50546, 'loss/train': 1.5193778276443481} 11/07/2021 04:21:14 - INFO - __main__ - Step 50548: {'lr': 0.00037859623116040633, 'samples': 9705216, 'steps': 50547, 'loss/train': 1.3470855951309204} 11/07/2021 04:21:15 - INFO - __main__ - Step 50549: {'lr': 0.00037859168028285984, 'samples': 9705408, 'steps': 50548, 'loss/train': 1.4055453538894653} 11/07/2021 04:21:15 - INFO - __main__ - Step 50550: {'lr': 0.000378587129347372, 'samples': 9705600, 'steps': 50549, 'loss/train': 1.0417683124542236} 11/07/2021 04:21:16 - INFO - __main__ - Step 50551: {'lr': 0.00037858257835394473, 'samples': 9705792, 'steps': 50550, 'loss/train': 1.7074761390686035} 11/07/2021 04:21:16 - INFO - __main__ - Step 50552: {'lr': 0.0003785780273025802, 'samples': 9705984, 'steps': 50551, 'loss/train': 1.2066880464553833} 11/07/2021 04:21:17 - INFO - __main__ - Step 50553: {'lr': 0.00037857347619328033, 'samples': 9706176, 'steps': 50552, 'loss/train': 1.751990556716919} 11/07/2021 04:21:17 - INFO - __main__ - Step 50554: {'lr': 0.0003785689250260472, 'samples': 9706368, 'steps': 50553, 'loss/train': 1.3436176776885986} 11/07/2021 04:21:18 - INFO - __main__ - Step 50555: {'lr': 0.00037856437380088295, 'samples': 9706560, 'steps': 50554, 'loss/train': 1.865929365158081} 11/07/2021 04:21:18 - INFO - __main__ - Step 50556: {'lr': 0.0003785598225177896, 'samples': 9706752, 'steps': 50555, 'loss/train': 1.5532974004745483} 11/07/2021 04:21:18 - INFO - __main__ - Step 50557: {'lr': 0.0003785552711767691, 'samples': 9706944, 'steps': 50556, 'loss/train': 1.744436502456665} 11/07/2021 04:21:19 - INFO - __main__ - Step 50558: {'lr': 0.0003785507197778236, 'samples': 9707136, 'steps': 50557, 'loss/train': 1.4375718832015991} 11/07/2021 04:21:20 - INFO - __main__ - Step 50559: {'lr': 0.0003785461683209552, 'samples': 9707328, 'steps': 50558, 'loss/train': 1.1379846334457397} 11/07/2021 04:21:20 - INFO - __main__ - Step 50560: {'lr': 0.00037854161680616586, 'samples': 9707520, 'steps': 50559, 'loss/train': 1.375577688217163} 11/07/2021 04:21:21 - INFO - __main__ - Step 50561: {'lr': 0.00037853706523345766, 'samples': 9707712, 'steps': 50560, 'loss/train': 1.7637293338775635} 11/07/2021 04:21:21 - INFO - __main__ - Step 50562: {'lr': 0.0003785325136028326, 'samples': 9707904, 'steps': 50561, 'loss/train': 1.5140193700790405} 11/07/2021 04:21:21 - INFO - __main__ - Step 50563: {'lr': 0.0003785279619142927, 'samples': 9708096, 'steps': 50562, 'loss/train': 1.4685074090957642} 11/07/2021 04:21:22 - INFO - __main__ - Step 50564: {'lr': 0.0003785234101678402, 'samples': 9708288, 'steps': 50563, 'loss/train': 1.5050429105758667} 11/07/2021 04:21:23 - INFO - __main__ - Step 50565: {'lr': 0.000378518858363477, 'samples': 9708480, 'steps': 50564, 'loss/train': 1.1843069791793823} 11/07/2021 04:21:23 - INFO - __main__ - Step 50566: {'lr': 0.00037851430650120516, 'samples': 9708672, 'steps': 50565, 'loss/train': 1.375612497329712} 11/07/2021 04:21:23 - INFO - __main__ - Step 50567: {'lr': 0.00037850975458102686, 'samples': 9708864, 'steps': 50566, 'loss/train': 1.0893933773040771} 11/07/2021 04:21:24 - INFO - __main__ - Step 50568: {'lr': 0.000378505202602944, 'samples': 9709056, 'steps': 50567, 'loss/train': 1.0624316930770874} 11/07/2021 04:21:25 - INFO - __main__ - Step 50569: {'lr': 0.0003785006505669586, 'samples': 9709248, 'steps': 50568, 'loss/train': 1.5698699951171875} 11/07/2021 04:21:25 - INFO - __main__ - Step 50570: {'lr': 0.0003784960984730728, 'samples': 9709440, 'steps': 50569, 'loss/train': 1.3673588037490845} 11/07/2021 04:21:25 - INFO - __main__ - Step 50571: {'lr': 0.00037849154632128867, 'samples': 9709632, 'steps': 50570, 'loss/train': 1.5583531856536865} 11/07/2021 04:21:26 - INFO - __main__ - Step 50572: {'lr': 0.0003784869941116082, 'samples': 9709824, 'steps': 50571, 'loss/train': 1.3499821424484253} 11/07/2021 04:21:26 - INFO - __main__ - Step 50573: {'lr': 0.00037848244184403356, 'samples': 9710016, 'steps': 50572, 'loss/train': 1.1018770933151245} 11/07/2021 04:21:27 - INFO - __main__ - Step 50574: {'lr': 0.0003784778895185667, 'samples': 9710208, 'steps': 50573, 'loss/train': 1.6546837091445923} 11/07/2021 04:21:27 - INFO - __main__ - Step 50575: {'lr': 0.00037847333713520966, 'samples': 9710400, 'steps': 50574, 'loss/train': 2.1528480052948} 11/07/2021 04:21:28 - INFO - __main__ - Step 50576: {'lr': 0.0003784687846939645, 'samples': 9710592, 'steps': 50575, 'loss/train': 1.3178619146347046} 11/07/2021 04:21:28 - INFO - __main__ - Step 50577: {'lr': 0.00037846423219483325, 'samples': 9710784, 'steps': 50576, 'loss/train': 1.562099814414978} 11/07/2021 04:21:29 - INFO - __main__ - Step 50578: {'lr': 0.00037845967963781807, 'samples': 9710976, 'steps': 50577, 'loss/train': 1.1327686309814453} 11/07/2021 04:21:29 - INFO - __main__ - Step 50579: {'lr': 0.00037845512702292097, 'samples': 9711168, 'steps': 50578, 'loss/train': 1.1399965286254883} 11/07/2021 04:21:30 - INFO - __main__ - Step 50580: {'lr': 0.00037845057435014384, 'samples': 9711360, 'steps': 50579, 'loss/train': 1.2374995946884155} 11/07/2021 04:21:30 - INFO - __main__ - Step 50581: {'lr': 0.000378446021619489, 'samples': 9711552, 'steps': 50580, 'loss/train': 1.163236379623413} 11/07/2021 04:21:31 - INFO - __main__ - Step 50582: {'lr': 0.0003784414688309583, 'samples': 9711744, 'steps': 50581, 'loss/train': 0.9576605558395386} 11/07/2021 04:21:31 - INFO - __main__ - Step 50583: {'lr': 0.0003784369159845539, 'samples': 9711936, 'steps': 50582, 'loss/train': 1.6346780061721802} 11/07/2021 04:21:31 - INFO - __main__ - Step 50584: {'lr': 0.00037843236308027776, 'samples': 9712128, 'steps': 50583, 'loss/train': 1.5481826066970825} 11/07/2021 04:21:32 - INFO - __main__ - Step 50585: {'lr': 0.000378427810118132, 'samples': 9712320, 'steps': 50584, 'loss/train': 1.5417025089263916} 11/07/2021 04:21:33 - INFO - __main__ - Step 50586: {'lr': 0.0003784232570981186, 'samples': 9712512, 'steps': 50585, 'loss/train': 1.3001868724822998} 11/07/2021 04:21:33 - INFO - __main__ - Step 50587: {'lr': 0.0003784187040202398, 'samples': 9712704, 'steps': 50586, 'loss/train': 1.3041154146194458} 11/07/2021 04:21:33 - INFO - __main__ - Step 50588: {'lr': 0.0003784141508844974, 'samples': 9712896, 'steps': 50587, 'loss/train': 1.7065526247024536} 11/07/2021 04:21:34 - INFO - __main__ - Step 50589: {'lr': 0.00037840959769089354, 'samples': 9713088, 'steps': 50588, 'loss/train': 1.3818120956420898} 11/07/2021 04:21:35 - INFO - __main__ - Step 50590: {'lr': 0.00037840504443943033, 'samples': 9713280, 'steps': 50589, 'loss/train': 1.0090471506118774} 11/07/2021 04:21:35 - INFO - __main__ - Step 50591: {'lr': 0.00037840049113010976, 'samples': 9713472, 'steps': 50590, 'loss/train': 1.7250033617019653} 11/07/2021 04:21:35 - INFO - __main__ - Step 50592: {'lr': 0.000378395937762934, 'samples': 9713664, 'steps': 50591, 'loss/train': 1.4000662565231323} 11/07/2021 04:21:36 - INFO - __main__ - Step 50593: {'lr': 0.000378391384337905, 'samples': 9713856, 'steps': 50592, 'loss/train': 1.5273271799087524} 11/07/2021 04:21:36 - INFO - __main__ - Step 50594: {'lr': 0.00037838683085502473, 'samples': 9714048, 'steps': 50593, 'loss/train': 0.9857616424560547} 11/07/2021 04:21:37 - INFO - __main__ - Step 50595: {'lr': 0.0003783822773142954, 'samples': 9714240, 'steps': 50594, 'loss/train': 1.7231225967407227} 11/07/2021 04:21:37 - INFO - __main__ - Step 50596: {'lr': 0.00037837772371571897, 'samples': 9714432, 'steps': 50595, 'loss/train': 1.5911321640014648} 11/07/2021 04:21:38 - INFO - __main__ - Step 50597: {'lr': 0.0003783731700592975, 'samples': 9714624, 'steps': 50596, 'loss/train': 1.0957367420196533} 11/07/2021 04:21:38 - INFO - __main__ - Step 50598: {'lr': 0.0003783686163450332, 'samples': 9714816, 'steps': 50597, 'loss/train': 1.6767088174819946} 11/07/2021 04:21:38 - INFO - __main__ - Step 50599: {'lr': 0.0003783640625729278, 'samples': 9715008, 'steps': 50598, 'loss/train': 1.8420186042785645} 11/07/2021 04:21:40 - INFO - __main__ - Step 50600: {'lr': 0.00037835950874298365, 'samples': 9715200, 'steps': 50599, 'loss/train': 1.783567190170288} 11/07/2021 04:21:40 - INFO - __main__ - Step 50601: {'lr': 0.0003783549548552027, 'samples': 9715392, 'steps': 50600, 'loss/train': 1.1593835353851318} 11/07/2021 04:21:40 - INFO - __main__ - Step 50602: {'lr': 0.00037835040090958684, 'samples': 9715584, 'steps': 50601, 'loss/train': 1.4136946201324463} 11/07/2021 04:21:41 - INFO - __main__ - Step 50603: {'lr': 0.0003783458469061384, 'samples': 9715776, 'steps': 50602, 'loss/train': 1.1507467031478882} 11/07/2021 04:21:41 - INFO - __main__ - Step 50604: {'lr': 0.0003783412928448593, 'samples': 9715968, 'steps': 50603, 'loss/train': 1.043258786201477} 11/07/2021 04:21:42 - INFO - __main__ - Step 50605: {'lr': 0.00037833673872575153, 'samples': 9716160, 'steps': 50604, 'loss/train': 1.5505529642105103} 11/07/2021 04:21:42 - INFO - __main__ - Step 50606: {'lr': 0.00037833218454881725, 'samples': 9716352, 'steps': 50605, 'loss/train': 1.3890246152877808} 11/07/2021 04:21:43 - INFO - __main__ - Step 50607: {'lr': 0.0003783276303140584, 'samples': 9716544, 'steps': 50606, 'loss/train': 1.0809133052825928} 11/07/2021 04:21:43 - INFO - __main__ - Step 50608: {'lr': 0.0003783230760214772, 'samples': 9716736, 'steps': 50607, 'loss/train': 1.2744044065475464} 11/07/2021 04:21:43 - INFO - __main__ - Step 50609: {'lr': 0.00037831852167107563, 'samples': 9716928, 'steps': 50608, 'loss/train': 0.9171972870826721} 11/07/2021 04:21:44 - INFO - __main__ - Step 50610: {'lr': 0.0003783139672628556, 'samples': 9717120, 'steps': 50609, 'loss/train': 1.0866856575012207} 11/07/2021 04:21:45 - INFO - __main__ - Step 50611: {'lr': 0.0003783094127968193, 'samples': 9717312, 'steps': 50610, 'loss/train': 0.7773082852363586} 11/07/2021 04:21:45 - INFO - __main__ - Step 50612: {'lr': 0.0003783048582729688, 'samples': 9717504, 'steps': 50611, 'loss/train': 1.7726795673370361} 11/07/2021 04:21:46 - INFO - __main__ - Step 50613: {'lr': 0.0003783003036913061, 'samples': 9717696, 'steps': 50612, 'loss/train': 1.2064554691314697} 11/07/2021 04:21:46 - INFO - __main__ - Step 50614: {'lr': 0.0003782957490518332, 'samples': 9717888, 'steps': 50613, 'loss/train': 1.362608551979065} 11/07/2021 04:21:47 - INFO - __main__ - Step 50615: {'lr': 0.00037829119435455226, 'samples': 9718080, 'steps': 50614, 'loss/train': 1.2910335063934326} 11/07/2021 04:21:47 - INFO - __main__ - Step 50616: {'lr': 0.00037828663959946527, 'samples': 9718272, 'steps': 50615, 'loss/train': 1.0467013120651245} 11/07/2021 04:21:48 - INFO - __main__ - Step 50617: {'lr': 0.0003782820847865743, 'samples': 9718464, 'steps': 50616, 'loss/train': 1.650442361831665} 11/07/2021 04:21:48 - INFO - __main__ - Step 50618: {'lr': 0.0003782775299158815, 'samples': 9718656, 'steps': 50617, 'loss/train': 0.8560373783111572} 11/07/2021 04:21:48 - INFO - __main__ - Step 50619: {'lr': 0.0003782729749873887, 'samples': 9718848, 'steps': 50618, 'loss/train': 1.4592068195343018} 11/07/2021 04:21:49 - INFO - __main__ - Step 50620: {'lr': 0.0003782684200010981, 'samples': 9719040, 'steps': 50619, 'loss/train': 1.7602537870407104} 11/07/2021 04:21:50 - INFO - __main__ - Step 50621: {'lr': 0.0003782638649570118, 'samples': 9719232, 'steps': 50620, 'loss/train': 1.3714299201965332} 11/07/2021 04:21:50 - INFO - __main__ - Step 50622: {'lr': 0.00037825930985513177, 'samples': 9719424, 'steps': 50621, 'loss/train': 1.1895204782485962} 11/07/2021 04:21:50 - INFO - __main__ - Step 50623: {'lr': 0.00037825475469546, 'samples': 9719616, 'steps': 50622, 'loss/train': 1.2780832052230835} 11/07/2021 04:21:51 - INFO - __main__ - Step 50624: {'lr': 0.00037825019947799863, 'samples': 9719808, 'steps': 50623, 'loss/train': 1.401994228363037} 11/07/2021 04:21:52 - INFO - __main__ - Step 50625: {'lr': 0.0003782456442027498, 'samples': 9720000, 'steps': 50624, 'loss/train': 2.208725929260254} 11/07/2021 04:21:52 - INFO - __main__ - Step 50626: {'lr': 0.0003782410888697153, 'samples': 9720192, 'steps': 50625, 'loss/train': 2.153963088989258} 11/07/2021 04:21:52 - INFO - __main__ - Step 50627: {'lr': 0.00037823653347889745, 'samples': 9720384, 'steps': 50626, 'loss/train': 0.7837381958961487} 11/07/2021 04:21:53 - INFO - __main__ - Step 50628: {'lr': 0.0003782319780302982, 'samples': 9720576, 'steps': 50627, 'loss/train': 1.6824977397918701} 11/07/2021 04:21:53 - INFO - __main__ - Step 50629: {'lr': 0.00037822742252391963, 'samples': 9720768, 'steps': 50628, 'loss/train': 1.5285260677337646} 11/07/2021 04:21:54 - INFO - __main__ - Step 50630: {'lr': 0.0003782228669597637, 'samples': 9720960, 'steps': 50629, 'loss/train': 0.8713791966438293} 11/07/2021 04:21:54 - INFO - __main__ - Step 50631: {'lr': 0.00037821831133783246, 'samples': 9721152, 'steps': 50630, 'loss/train': 1.676653265953064} 11/07/2021 04:21:55 - INFO - __main__ - Step 50632: {'lr': 0.00037821375565812816, 'samples': 9721344, 'steps': 50631, 'loss/train': 1.2008812427520752} 11/07/2021 04:21:55 - INFO - __main__ - Step 50633: {'lr': 0.00037820919992065263, 'samples': 9721536, 'steps': 50632, 'loss/train': 1.4452593326568604} 11/07/2021 04:21:55 - INFO - __main__ - Step 50634: {'lr': 0.00037820464412540805, 'samples': 9721728, 'steps': 50633, 'loss/train': 1.4773658514022827} 11/07/2021 04:21:56 - INFO - __main__ - Step 50635: {'lr': 0.0003782000882723965, 'samples': 9721920, 'steps': 50634, 'loss/train': 1.4276493787765503} 11/07/2021 04:21:57 - INFO - __main__ - Step 50636: {'lr': 0.00037819553236161985, 'samples': 9722112, 'steps': 50635, 'loss/train': 1.3189899921417236} 11/07/2021 04:21:57 - INFO - __main__ - Step 50637: {'lr': 0.0003781909763930803, 'samples': 9722304, 'steps': 50636, 'loss/train': 1.030458927154541} 11/07/2021 04:21:57 - INFO - __main__ - Step 50638: {'lr': 0.00037818642036677993, 'samples': 9722496, 'steps': 50637, 'loss/train': 1.0234310626983643} 11/07/2021 04:21:58 - INFO - __main__ - Step 50639: {'lr': 0.00037818186428272064, 'samples': 9722688, 'steps': 50638, 'loss/train': 1.3208693265914917} 11/07/2021 04:21:59 - INFO - __main__ - Step 50640: {'lr': 0.00037817730814090466, 'samples': 9722880, 'steps': 50639, 'loss/train': 1.2710895538330078} 11/07/2021 04:21:59 - INFO - __main__ - Step 50641: {'lr': 0.000378172751941334, 'samples': 9723072, 'steps': 50640, 'loss/train': 1.9764162302017212} 11/07/2021 04:22:00 - INFO - __main__ - Step 50642: {'lr': 0.0003781681956840106, 'samples': 9723264, 'steps': 50641, 'loss/train': 1.6260383129119873} 11/07/2021 04:22:00 - INFO - __main__ - Step 50643: {'lr': 0.0003781636393689366, 'samples': 9723456, 'steps': 50642, 'loss/train': 1.8933266401290894} 11/07/2021 04:22:00 - INFO - __main__ - Step 50644: {'lr': 0.0003781590829961141, 'samples': 9723648, 'steps': 50643, 'loss/train': 1.2901744842529297} 11/07/2021 04:22:01 - INFO - __main__ - Step 50645: {'lr': 0.000378154526565545, 'samples': 9723840, 'steps': 50644, 'loss/train': 1.571274757385254} 11/07/2021 04:22:02 - INFO - __main__ - Step 50646: {'lr': 0.00037814997007723153, 'samples': 9724032, 'steps': 50645, 'loss/train': 1.4527223110198975} 11/07/2021 04:22:02 - INFO - __main__ - Step 50647: {'lr': 0.0003781454135311756, 'samples': 9724224, 'steps': 50646, 'loss/train': 1.5039423704147339} 11/07/2021 04:22:02 - INFO - __main__ - Step 50648: {'lr': 0.0003781408569273794, 'samples': 9724416, 'steps': 50647, 'loss/train': 0.5214137434959412} 11/07/2021 04:22:03 - INFO - __main__ - Step 50649: {'lr': 0.0003781363002658448, 'samples': 9724608, 'steps': 50648, 'loss/train': 1.3853232860565186} 11/07/2021 04:22:03 - INFO - __main__ - Step 50650: {'lr': 0.000378131743546574, 'samples': 9724800, 'steps': 50649, 'loss/train': 1.7496758699417114} 11/07/2021 04:22:04 - INFO - __main__ - Step 50651: {'lr': 0.000378127186769569, 'samples': 9724992, 'steps': 50650, 'loss/train': 1.6479120254516602} 11/07/2021 04:22:05 - INFO - __main__ - Step 50652: {'lr': 0.00037812262993483194, 'samples': 9725184, 'steps': 50651, 'loss/train': 1.4466067552566528} 11/07/2021 04:22:05 - INFO - __main__ - Step 50653: {'lr': 0.0003781180730423648, 'samples': 9725376, 'steps': 50652, 'loss/train': 1.361926555633545} 11/07/2021 04:22:05 - INFO - __main__ - Step 50654: {'lr': 0.00037811351609216956, 'samples': 9725568, 'steps': 50653, 'loss/train': 1.057162880897522} 11/07/2021 04:22:06 - INFO - __main__ - Step 50655: {'lr': 0.00037810895908424837, 'samples': 9725760, 'steps': 50654, 'loss/train': 1.5812091827392578} 11/07/2021 04:22:07 - INFO - __main__ - Step 50656: {'lr': 0.0003781044020186033, 'samples': 9725952, 'steps': 50655, 'loss/train': 1.335661768913269} 11/07/2021 04:22:07 - INFO - __main__ - Step 50657: {'lr': 0.0003780998448952363, 'samples': 9726144, 'steps': 50656, 'loss/train': 1.419960856437683} 11/07/2021 04:22:07 - INFO - __main__ - Step 50658: {'lr': 0.0003780952877141495, 'samples': 9726336, 'steps': 50657, 'loss/train': 0.1022462397813797} 11/07/2021 04:22:08 - INFO - __main__ - Step 50659: {'lr': 0.0003780907304753449, 'samples': 9726528, 'steps': 50658, 'loss/train': 1.390694260597229} 11/07/2021 04:22:08 - INFO - __main__ - Step 50660: {'lr': 0.0003780861731788247, 'samples': 9726720, 'steps': 50659, 'loss/train': 1.6995733976364136} 11/07/2021 04:22:09 - INFO - __main__ - Step 50661: {'lr': 0.0003780816158245908, 'samples': 9726912, 'steps': 50660, 'loss/train': 1.3688750267028809} 11/07/2021 04:22:09 - INFO - __main__ - Step 50662: {'lr': 0.0003780770584126453, 'samples': 9727104, 'steps': 50661, 'loss/train': 1.521982192993164} 11/07/2021 04:22:10 - INFO - __main__ - Step 50663: {'lr': 0.0003780725009429903, 'samples': 9727296, 'steps': 50662, 'loss/train': 1.281760334968567} 11/07/2021 04:22:10 - INFO - __main__ - Step 50664: {'lr': 0.00037806794341562773, 'samples': 9727488, 'steps': 50663, 'loss/train': 0.08871020376682281} 11/07/2021 04:22:11 - INFO - __main__ - Step 50665: {'lr': 0.00037806338583055976, 'samples': 9727680, 'steps': 50664, 'loss/train': 1.4168148040771484} 11/07/2021 04:22:12 - INFO - __main__ - Step 50666: {'lr': 0.0003780588281877884, 'samples': 9727872, 'steps': 50665, 'loss/train': 1.6245907545089722} 11/07/2021 04:22:12 - INFO - __main__ - Step 50667: {'lr': 0.00037805427048731566, 'samples': 9728064, 'steps': 50666, 'loss/train': 1.5741922855377197} 11/07/2021 04:22:12 - INFO - __main__ - Step 50668: {'lr': 0.0003780497127291437, 'samples': 9728256, 'steps': 50667, 'loss/train': 1.537471055984497} 11/07/2021 04:22:13 - INFO - __main__ - Step 50669: {'lr': 0.0003780451549132745, 'samples': 9728448, 'steps': 50668, 'loss/train': 1.3848398923873901} 11/07/2021 04:22:13 - INFO - __main__ - Step 50670: {'lr': 0.00037804059703971016, 'samples': 9728640, 'steps': 50669, 'loss/train': 1.594283938407898} 11/07/2021 04:22:14 - INFO - __main__ - Step 50671: {'lr': 0.00037803603910845264, 'samples': 9728832, 'steps': 50670, 'loss/train': 0.884373664855957} 11/07/2021 04:22:14 - INFO - __main__ - Step 50672: {'lr': 0.00037803148111950407, 'samples': 9729024, 'steps': 50671, 'loss/train': 0.6188684701919556} 11/07/2021 04:22:15 - INFO - __main__ - Step 50673: {'lr': 0.0003780269230728665, 'samples': 9729216, 'steps': 50672, 'loss/train': 1.2383602857589722} 11/07/2021 04:22:15 - INFO - __main__ - Step 50674: {'lr': 0.000378022364968542, 'samples': 9729408, 'steps': 50673, 'loss/train': 1.4449785947799683} 11/07/2021 04:22:15 - INFO - __main__ - Step 50675: {'lr': 0.00037801780680653263, 'samples': 9729600, 'steps': 50674, 'loss/train': 1.3962541818618774} 11/07/2021 04:22:16 - INFO - __main__ - Step 50676: {'lr': 0.0003780132485868403, 'samples': 9729792, 'steps': 50675, 'loss/train': 1.3470677137374878} 11/07/2021 04:22:17 - INFO - __main__ - Step 50677: {'lr': 0.0003780086903094673, 'samples': 9729984, 'steps': 50676, 'loss/train': 1.3629646301269531} 11/07/2021 04:22:17 - INFO - __main__ - Step 50678: {'lr': 0.0003780041319744154, 'samples': 9730176, 'steps': 50677, 'loss/train': 1.3893108367919922} 11/07/2021 04:22:17 - INFO - __main__ - Step 50679: {'lr': 0.00037799957358168693, 'samples': 9730368, 'steps': 50678, 'loss/train': 1.2100180387496948} 11/07/2021 04:22:18 - INFO - __main__ - Step 50680: {'lr': 0.0003779950151312838, 'samples': 9730560, 'steps': 50679, 'loss/train': 1.3467028141021729} 11/07/2021 04:22:19 - INFO - __main__ - Step 50681: {'lr': 0.0003779904566232081, 'samples': 9730752, 'steps': 50680, 'loss/train': 1.2199546098709106} 11/07/2021 04:22:19 - INFO - __main__ - Step 50682: {'lr': 0.0003779858980574619, 'samples': 9730944, 'steps': 50681, 'loss/train': 1.311398983001709} 11/07/2021 04:22:20 - INFO - __main__ - Step 50683: {'lr': 0.0003779813394340472, 'samples': 9731136, 'steps': 50682, 'loss/train': 1.609716534614563} 11/07/2021 04:22:20 - INFO - __main__ - Step 50684: {'lr': 0.0003779767807529661, 'samples': 9731328, 'steps': 50683, 'loss/train': 1.0118694305419922} 11/07/2021 04:22:20 - INFO - __main__ - Step 50685: {'lr': 0.0003779722220142206, 'samples': 9731520, 'steps': 50684, 'loss/train': 1.3198235034942627} 11/07/2021 04:22:21 - INFO - __main__ - Step 50686: {'lr': 0.00037796766321781286, 'samples': 9731712, 'steps': 50685, 'loss/train': 1.7070567607879639} 11/07/2021 04:22:22 - INFO - __main__ - Step 50687: {'lr': 0.00037796310436374474, 'samples': 9731904, 'steps': 50686, 'loss/train': 3.05145525932312} 11/07/2021 04:22:22 - INFO - __main__ - Step 50688: {'lr': 0.0003779585454520186, 'samples': 9732096, 'steps': 50687, 'loss/train': 1.5338678359985352} 11/07/2021 04:22:22 - INFO - __main__ - Step 50689: {'lr': 0.0003779539864826362, 'samples': 9732288, 'steps': 50688, 'loss/train': 1.711365818977356} 11/07/2021 04:22:23 - INFO - __main__ - Step 50690: {'lr': 0.0003779494274555997, 'samples': 9732480, 'steps': 50689, 'loss/train': 1.7390422821044922} 11/07/2021 04:22:24 - INFO - __main__ - Step 50691: {'lr': 0.0003779448683709111, 'samples': 9732672, 'steps': 50690, 'loss/train': 1.5304346084594727} 11/07/2021 04:22:24 - INFO - __main__ - Step 50692: {'lr': 0.0003779403092285727, 'samples': 9732864, 'steps': 50691, 'loss/train': 1.2473922967910767} 11/07/2021 04:22:24 - INFO - __main__ - Step 50693: {'lr': 0.00037793575002858625, 'samples': 9733056, 'steps': 50692, 'loss/train': 1.4359720945358276} 11/07/2021 04:22:25 - INFO - __main__ - Step 50694: {'lr': 0.00037793119077095396, 'samples': 9733248, 'steps': 50693, 'loss/train': 1.259575605392456} 11/07/2021 04:22:25 - INFO - __main__ - Step 50695: {'lr': 0.00037792663145567784, 'samples': 9733440, 'steps': 50694, 'loss/train': 1.3352851867675781} 11/07/2021 04:22:26 - INFO - __main__ - Step 50696: {'lr': 0.00037792207208275995, 'samples': 9733632, 'steps': 50695, 'loss/train': 2.020500659942627} 11/07/2021 04:22:27 - INFO - __main__ - Step 50697: {'lr': 0.0003779175126522023, 'samples': 9733824, 'steps': 50696, 'loss/train': 1.638195514678955} 11/07/2021 04:22:27 - INFO - __main__ - Step 50698: {'lr': 0.0003779129531640071, 'samples': 9734016, 'steps': 50697, 'loss/train': 1.8014445304870605} 11/07/2021 04:22:27 - INFO - __main__ - Step 50699: {'lr': 0.0003779083936181762, 'samples': 9734208, 'steps': 50698, 'loss/train': 1.3109745979309082} 11/07/2021 04:22:28 - INFO - __main__ - Step 50700: {'lr': 0.0003779038340147118, 'samples': 9734400, 'steps': 50699, 'loss/train': 1.7507801055908203} 11/07/2021 04:22:29 - INFO - __main__ - Step 50701: {'lr': 0.0003778992743536159, 'samples': 9734592, 'steps': 50700, 'loss/train': 1.6560417413711548} 11/07/2021 04:22:29 - INFO - __main__ - Step 50702: {'lr': 0.0003778947146348906, 'samples': 9734784, 'steps': 50701, 'loss/train': 1.7197080850601196} 11/07/2021 04:22:29 - INFO - __main__ - Step 50703: {'lr': 0.00037789015485853786, 'samples': 9734976, 'steps': 50702, 'loss/train': 1.6700061559677124} 11/07/2021 04:22:30 - INFO - __main__ - Step 50704: {'lr': 0.0003778855950245598, 'samples': 9735168, 'steps': 50703, 'loss/train': 1.8428999185562134} 11/07/2021 04:22:30 - INFO - __main__ - Step 50705: {'lr': 0.00037788103513295844, 'samples': 9735360, 'steps': 50704, 'loss/train': 1.0536830425262451} 11/07/2021 04:22:31 - INFO - __main__ - Step 50706: {'lr': 0.00037787647518373586, 'samples': 9735552, 'steps': 50705, 'loss/train': 1.2545115947723389} 11/07/2021 04:22:31 - INFO - __main__ - Step 50707: {'lr': 0.0003778719151768941, 'samples': 9735744, 'steps': 50706, 'loss/train': 1.4495525360107422} 11/07/2021 04:22:32 - INFO - __main__ - Step 50708: {'lr': 0.0003778673551124353, 'samples': 9735936, 'steps': 50707, 'loss/train': 0.7083872556686401} 11/07/2021 04:22:32 - INFO - __main__ - Step 50709: {'lr': 0.0003778627949903615, 'samples': 9736128, 'steps': 50708, 'loss/train': 1.3951613903045654} 11/07/2021 04:22:33 - INFO - __main__ - Step 50710: {'lr': 0.00037785823481067455, 'samples': 9736320, 'steps': 50709, 'loss/train': 1.2276339530944824} 11/07/2021 04:22:33 - INFO - __main__ - Step 50711: {'lr': 0.0003778536745733767, 'samples': 9736512, 'steps': 50710, 'loss/train': 1.5364985466003418} 11/07/2021 04:22:34 - INFO - __main__ - Step 50712: {'lr': 0.00037784911427846997, 'samples': 9736704, 'steps': 50711, 'loss/train': 0.06656911224126816} 11/07/2021 04:22:34 - INFO - __main__ - Step 50713: {'lr': 0.0003778445539259564, 'samples': 9736896, 'steps': 50712, 'loss/train': 1.4633636474609375} 11/07/2021 04:22:35 - INFO - __main__ - Step 50714: {'lr': 0.000377839993515838, 'samples': 9737088, 'steps': 50713, 'loss/train': 1.334006667137146} 11/07/2021 04:22:35 - INFO - __main__ - Step 50715: {'lr': 0.000377835433048117, 'samples': 9737280, 'steps': 50714, 'loss/train': 1.427729606628418} 11/07/2021 04:22:35 - INFO - __main__ - Step 50716: {'lr': 0.00037783087252279523, 'samples': 9737472, 'steps': 50715, 'loss/train': 1.0502132177352905} 11/07/2021 04:22:36 - INFO - __main__ - Step 50717: {'lr': 0.0003778263119398748, 'samples': 9737664, 'steps': 50716, 'loss/train': 1.142493486404419} 11/07/2021 04:22:37 - INFO - __main__ - Step 50718: {'lr': 0.00037782175129935793, 'samples': 9737856, 'steps': 50717, 'loss/train': 1.661091923713684} 11/07/2021 04:22:37 - INFO - __main__ - Step 50719: {'lr': 0.0003778171906012464, 'samples': 9738048, 'steps': 50718, 'loss/train': 1.5472112894058228} 11/07/2021 04:22:37 - INFO - __main__ - Step 50720: {'lr': 0.0003778126298455425, 'samples': 9738240, 'steps': 50719, 'loss/train': 0.9596140384674072} 11/07/2021 04:22:38 - INFO - __main__ - Step 50721: {'lr': 0.0003778080690322483, 'samples': 9738432, 'steps': 50720, 'loss/train': 1.7238438129425049} 11/07/2021 04:22:39 - INFO - __main__ - Step 50722: {'lr': 0.0003778035081613656, 'samples': 9738624, 'steps': 50721, 'loss/train': 0.9496026635169983} 11/07/2021 04:22:39 - INFO - __main__ - Step 50723: {'lr': 0.00037779894723289666, 'samples': 9738816, 'steps': 50722, 'loss/train': 0.970390260219574} 11/07/2021 04:22:39 - INFO - __main__ - Step 50724: {'lr': 0.00037779438624684346, 'samples': 9739008, 'steps': 50723, 'loss/train': 1.4665848016738892} 11/07/2021 04:22:40 - INFO - __main__ - Step 50725: {'lr': 0.00037778982520320813, 'samples': 9739200, 'steps': 50724, 'loss/train': 1.033931851387024} 11/07/2021 04:22:40 - INFO - __main__ - Step 50726: {'lr': 0.00037778526410199266, 'samples': 9739392, 'steps': 50725, 'loss/train': 1.2364975214004517} 11/07/2021 04:22:41 - INFO - __main__ - Step 50727: {'lr': 0.0003777807029431992, 'samples': 9739584, 'steps': 50726, 'loss/train': 0.11913690716028214} 11/07/2021 04:22:42 - INFO - __main__ - Step 50728: {'lr': 0.0003777761417268296, 'samples': 9739776, 'steps': 50727, 'loss/train': 1.2327946424484253} 11/07/2021 04:22:42 - INFO - __main__ - Step 50729: {'lr': 0.00037777158045288606, 'samples': 9739968, 'steps': 50728, 'loss/train': 1.6647366285324097} 11/07/2021 04:22:42 - INFO - __main__ - Step 50730: {'lr': 0.00037776701912137066, 'samples': 9740160, 'steps': 50729, 'loss/train': 1.1865135431289673} 11/07/2021 04:22:43 - INFO - __main__ - Step 50731: {'lr': 0.00037776245773228547, 'samples': 9740352, 'steps': 50730, 'loss/train': 0.562919020652771} 11/07/2021 04:22:43 - INFO - __main__ - Step 50732: {'lr': 0.0003777578962856324, 'samples': 9740544, 'steps': 50731, 'loss/train': 1.2084091901779175} 11/07/2021 04:22:44 - INFO - __main__ - Step 50733: {'lr': 0.0003777533347814136, 'samples': 9740736, 'steps': 50732, 'loss/train': 1.3344764709472656} 11/07/2021 04:22:44 - INFO - __main__ - Step 50734: {'lr': 0.0003777487732196312, 'samples': 9740928, 'steps': 50733, 'loss/train': 1.5014675855636597} 11/07/2021 04:22:45 - INFO - __main__ - Step 50735: {'lr': 0.00037774421160028705, 'samples': 9741120, 'steps': 50734, 'loss/train': 1.603287935256958} 11/07/2021 04:22:45 - INFO - __main__ - Step 50736: {'lr': 0.0003777396499233834, 'samples': 9741312, 'steps': 50735, 'loss/train': 2.493743658065796} 11/07/2021 04:22:45 - INFO - __main__ - Step 50737: {'lr': 0.00037773508818892223, 'samples': 9741504, 'steps': 50736, 'loss/train': 1.6289881467819214} 11/07/2021 04:22:46 - INFO - __main__ - Step 50738: {'lr': 0.0003777305263969056, 'samples': 9741696, 'steps': 50737, 'loss/train': 1.4261382818222046} 11/07/2021 04:22:47 - INFO - __main__ - Step 50739: {'lr': 0.00037772596454733554, 'samples': 9741888, 'steps': 50738, 'loss/train': 1.0800403356552124} 11/07/2021 04:22:47 - INFO - __main__ - Step 50740: {'lr': 0.00037772140264021416, 'samples': 9742080, 'steps': 50739, 'loss/train': 1.165719747543335} 11/07/2021 04:22:47 - INFO - __main__ - Step 50741: {'lr': 0.00037771684067554345, 'samples': 9742272, 'steps': 50740, 'loss/train': 1.4455782175064087} 11/07/2021 04:22:48 - INFO - __main__ - Step 50742: {'lr': 0.0003777122786533255, 'samples': 9742464, 'steps': 50741, 'loss/train': 1.5963486433029175} 11/07/2021 04:22:49 - INFO - __main__ - Step 50743: {'lr': 0.0003777077165735625, 'samples': 9742656, 'steps': 50742, 'loss/train': 1.311700463294983} 11/07/2021 04:22:49 - INFO - __main__ - Step 50744: {'lr': 0.0003777031544362562, 'samples': 9742848, 'steps': 50743, 'loss/train': 1.2413666248321533} 11/07/2021 04:22:50 - INFO - __main__ - Step 50745: {'lr': 0.0003776985922414089, 'samples': 9743040, 'steps': 50744, 'loss/train': 1.693814754486084} 11/07/2021 04:22:50 - INFO - __main__ - Step 50746: {'lr': 0.0003776940299890226, 'samples': 9743232, 'steps': 50745, 'loss/train': 1.498221516609192} 11/07/2021 04:22:50 - INFO - __main__ - Step 50747: {'lr': 0.0003776894676790993, 'samples': 9743424, 'steps': 50746, 'loss/train': 0.10506287217140198} 11/07/2021 04:22:51 - INFO - __main__ - Step 50748: {'lr': 0.0003776849053116411, 'samples': 9743616, 'steps': 50747, 'loss/train': 1.1215659379959106} 11/07/2021 04:22:52 - INFO - __main__ - Step 50749: {'lr': 0.00037768034288665015, 'samples': 9743808, 'steps': 50748, 'loss/train': 1.410152554512024} 11/07/2021 04:22:52 - INFO - __main__ - Step 50750: {'lr': 0.0003776757804041283, 'samples': 9744000, 'steps': 50749, 'loss/train': 1.4149744510650635} 11/07/2021 04:22:52 - INFO - __main__ - Step 50751: {'lr': 0.00037767121786407774, 'samples': 9744192, 'steps': 50750, 'loss/train': 1.5454553365707397} 11/07/2021 04:22:53 - INFO - __main__ - Step 50752: {'lr': 0.00037766665526650054, 'samples': 9744384, 'steps': 50751, 'loss/train': 1.801805019378662} 11/07/2021 04:22:54 - INFO - __main__ - Step 50753: {'lr': 0.0003776620926113986, 'samples': 9744576, 'steps': 50752, 'loss/train': 1.524074912071228} 11/07/2021 04:22:54 - INFO - __main__ - Step 50754: {'lr': 0.0003776575298987742, 'samples': 9744768, 'steps': 50753, 'loss/train': 1.9182108640670776} 11/07/2021 04:22:54 - INFO - __main__ - Step 50755: {'lr': 0.00037765296712862927, 'samples': 9744960, 'steps': 50754, 'loss/train': 1.1489320993423462} 11/07/2021 04:22:55 - INFO - __main__ - Step 50756: {'lr': 0.00037764840430096593, 'samples': 9745152, 'steps': 50755, 'loss/train': 0.532424807548523} 11/07/2021 04:22:55 - INFO - __main__ - Step 50757: {'lr': 0.0003776438414157861, 'samples': 9745344, 'steps': 50756, 'loss/train': 1.3537510633468628} 11/07/2021 04:22:56 - INFO - __main__ - Step 50758: {'lr': 0.00037763927847309195, 'samples': 9745536, 'steps': 50757, 'loss/train': 0.9443285465240479} 11/07/2021 04:22:57 - INFO - __main__ - Step 50759: {'lr': 0.00037763471547288554, 'samples': 9745728, 'steps': 50758, 'loss/train': 1.5967708826065063} 11/07/2021 04:22:57 - INFO - __main__ - Step 50760: {'lr': 0.00037763015241516887, 'samples': 9745920, 'steps': 50759, 'loss/train': 1.276456356048584} 11/07/2021 04:22:57 - INFO - __main__ - Step 50761: {'lr': 0.00037762558929994394, 'samples': 9746112, 'steps': 50760, 'loss/train': 1.8176703453063965} 11/07/2021 04:22:58 - INFO - __main__ - Step 50762: {'lr': 0.00037762102612721305, 'samples': 9746304, 'steps': 50761, 'loss/train': 0.9380192160606384} 11/07/2021 04:22:59 - INFO - __main__ - Step 50763: {'lr': 0.00037761646289697796, 'samples': 9746496, 'steps': 50762, 'loss/train': 1.3880196809768677} 11/07/2021 04:22:59 - INFO - __main__ - Step 50764: {'lr': 0.0003776118996092409, 'samples': 9746688, 'steps': 50763, 'loss/train': 1.337918758392334} 11/07/2021 04:22:59 - INFO - __main__ - Step 50765: {'lr': 0.00037760733626400396, 'samples': 9746880, 'steps': 50764, 'loss/train': 1.2872546911239624} 11/07/2021 04:23:00 - INFO - __main__ - Step 50766: {'lr': 0.00037760277286126906, 'samples': 9747072, 'steps': 50765, 'loss/train': 1.5700017213821411} 11/07/2021 04:23:00 - INFO - __main__ - Step 50767: {'lr': 0.00037759820940103827, 'samples': 9747264, 'steps': 50766, 'loss/train': 1.6338597536087036} 11/07/2021 04:23:00 - INFO - __main__ - Step 50768: {'lr': 0.0003775936458833138, 'samples': 9747456, 'steps': 50767, 'loss/train': 0.943385124206543} 11/07/2021 04:23:01 - INFO - __main__ - Step 50769: {'lr': 0.00037758908230809757, 'samples': 9747648, 'steps': 50768, 'loss/train': 1.0664221048355103} 11/07/2021 04:23:02 - INFO - __main__ - Step 50770: {'lr': 0.0003775845186753917, 'samples': 9747840, 'steps': 50769, 'loss/train': 1.719895362854004} 11/07/2021 04:23:02 - INFO - __main__ - Step 50771: {'lr': 0.00037757995498519814, 'samples': 9748032, 'steps': 50770, 'loss/train': 1.3944305181503296} 11/07/2021 04:23:02 - INFO - __main__ - Step 50772: {'lr': 0.00037757539123751906, 'samples': 9748224, 'steps': 50771, 'loss/train': 1.4768362045288086} 11/07/2021 04:23:03 - INFO - __main__ - Step 50773: {'lr': 0.00037757082743235644, 'samples': 9748416, 'steps': 50772, 'loss/train': 1.4655301570892334} 11/07/2021 04:23:04 - INFO - __main__ - Step 50774: {'lr': 0.00037756626356971236, 'samples': 9748608, 'steps': 50773, 'loss/train': 0.987450897693634} 11/07/2021 04:23:04 - INFO - __main__ - Step 50775: {'lr': 0.00037756169964958897, 'samples': 9748800, 'steps': 50774, 'loss/train': 1.2702594995498657} 11/07/2021 04:23:04 - INFO - __main__ - Step 50776: {'lr': 0.00037755713567198823, 'samples': 9748992, 'steps': 50775, 'loss/train': 1.52090585231781} 11/07/2021 04:23:05 - INFO - __main__ - Step 50777: {'lr': 0.00037755257163691214, 'samples': 9749184, 'steps': 50776, 'loss/train': 1.83846116065979} 11/07/2021 04:23:05 - INFO - __main__ - Step 50778: {'lr': 0.00037754800754436293, 'samples': 9749376, 'steps': 50777, 'loss/train': 1.0825878381729126} 11/07/2021 04:23:06 - INFO - __main__ - Step 50779: {'lr': 0.0003775434433943425, 'samples': 9749568, 'steps': 50778, 'loss/train': 1.220787763595581} 11/07/2021 04:23:06 - INFO - __main__ - Step 50780: {'lr': 0.00037753887918685295, 'samples': 9749760, 'steps': 50779, 'loss/train': 1.7025530338287354} 11/07/2021 04:23:07 - INFO - __main__ - Step 50781: {'lr': 0.0003775343149218964, 'samples': 9749952, 'steps': 50780, 'loss/train': 1.3414987325668335} 11/07/2021 04:23:07 - INFO - __main__ - Step 50782: {'lr': 0.0003775297505994748, 'samples': 9750144, 'steps': 50781, 'loss/train': 1.3007538318634033} 11/07/2021 04:23:07 - INFO - __main__ - Step 50783: {'lr': 0.0003775251862195903, 'samples': 9750336, 'steps': 50782, 'loss/train': 1.2600034475326538} 11/07/2021 04:23:09 - INFO - __main__ - Step 50784: {'lr': 0.0003775206217822449, 'samples': 9750528, 'steps': 50783, 'loss/train': 1.5636194944381714} 11/07/2021 04:23:09 - INFO - __main__ - Step 50785: {'lr': 0.00037751605728744063, 'samples': 9750720, 'steps': 50784, 'loss/train': 1.5546804666519165} 11/07/2021 04:23:09 - INFO - __main__ - Step 50786: {'lr': 0.0003775114927351797, 'samples': 9750912, 'steps': 50785, 'loss/train': 1.2205405235290527} 11/07/2021 04:23:10 - INFO - __main__ - Step 50787: {'lr': 0.00037750692812546396, 'samples': 9751104, 'steps': 50786, 'loss/train': 1.3504419326782227} 11/07/2021 04:23:10 - INFO - __main__ - Step 50788: {'lr': 0.00037750236345829557, 'samples': 9751296, 'steps': 50787, 'loss/train': 1.6261910200119019} 11/07/2021 04:23:11 - INFO - __main__ - Step 50789: {'lr': 0.0003774977987336767, 'samples': 9751488, 'steps': 50788, 'loss/train': 1.0199942588806152} 11/07/2021 04:23:11 - INFO - __main__ - Step 50790: {'lr': 0.0003774932339516092, 'samples': 9751680, 'steps': 50789, 'loss/train': 1.037049651145935} 11/07/2021 04:23:12 - INFO - __main__ - Step 50791: {'lr': 0.00037748866911209525, 'samples': 9751872, 'steps': 50790, 'loss/train': 1.5314488410949707} 11/07/2021 04:23:12 - INFO - __main__ - Step 50792: {'lr': 0.00037748410421513677, 'samples': 9752064, 'steps': 50791, 'loss/train': 0.8069332838058472} 11/07/2021 04:23:12 - INFO - __main__ - Step 50793: {'lr': 0.000377479539260736, 'samples': 9752256, 'steps': 50792, 'loss/train': 1.3279070854187012} 11/07/2021 04:23:13 - INFO - __main__ - Step 50794: {'lr': 0.0003774749742488949, 'samples': 9752448, 'steps': 50793, 'loss/train': 1.1581848859786987} 11/07/2021 04:23:14 - INFO - __main__ - Step 50795: {'lr': 0.0003774704091796156, 'samples': 9752640, 'steps': 50794, 'loss/train': 1.2445685863494873} 11/07/2021 04:23:14 - INFO - __main__ - Step 50796: {'lr': 0.00037746584405290006, 'samples': 9752832, 'steps': 50795, 'loss/train': 1.3859024047851562} 11/07/2021 04:23:14 - INFO - __main__ - Step 50797: {'lr': 0.00037746127886875035, 'samples': 9753024, 'steps': 50796, 'loss/train': 1.5053210258483887} 11/07/2021 04:23:15 - INFO - __main__ - Step 50798: {'lr': 0.0003774567136271686, 'samples': 9753216, 'steps': 50797, 'loss/train': 1.9043644666671753} 11/07/2021 04:23:16 - INFO - __main__ - Step 50799: {'lr': 0.0003774521483281568, 'samples': 9753408, 'steps': 50798, 'loss/train': 1.689976692199707} 11/07/2021 04:23:16 - INFO - __main__ - Step 50800: {'lr': 0.00037744758297171706, 'samples': 9753600, 'steps': 50799, 'loss/train': 1.27703058719635} 11/07/2021 04:23:17 - INFO - __main__ - Step 50801: {'lr': 0.00037744301755785137, 'samples': 9753792, 'steps': 50800, 'loss/train': 1.5397943258285522} 11/07/2021 04:23:17 - INFO - __main__ - Step 50802: {'lr': 0.0003774384520865618, 'samples': 9753984, 'steps': 50801, 'loss/train': 1.4204710721969604} 11/07/2021 04:23:17 - INFO - __main__ - Step 50803: {'lr': 0.0003774338865578505, 'samples': 9754176, 'steps': 50802, 'loss/train': 1.2714866399765015} 11/07/2021 04:23:18 - INFO - __main__ - Step 50804: {'lr': 0.00037742932097171945, 'samples': 9754368, 'steps': 50803, 'loss/train': 1.2413955926895142} 11/07/2021 04:23:19 - INFO - __main__ - Step 50805: {'lr': 0.0003774247553281707, 'samples': 9754560, 'steps': 50804, 'loss/train': 1.5885212421417236} 11/07/2021 04:23:19 - INFO - __main__ - Step 50806: {'lr': 0.00037742018962720625, 'samples': 9754752, 'steps': 50805, 'loss/train': 1.8339276313781738} 11/07/2021 04:23:19 - INFO - __main__ - Step 50807: {'lr': 0.0003774156238688282, 'samples': 9754944, 'steps': 50806, 'loss/train': 1.2531362771987915} 11/07/2021 04:23:20 - INFO - __main__ - Step 50808: {'lr': 0.00037741105805303874, 'samples': 9755136, 'steps': 50807, 'loss/train': 1.233770489692688} 11/07/2021 04:23:21 - INFO - __main__ - Step 50809: {'lr': 0.0003774064921798399, 'samples': 9755328, 'steps': 50808, 'loss/train': 1.383651614189148} 11/07/2021 04:23:21 - INFO - __main__ - Step 50810: {'lr': 0.00037740192624923354, 'samples': 9755520, 'steps': 50809, 'loss/train': 1.0668141841888428} 11/07/2021 04:23:22 - INFO - __main__ - Step 50811: {'lr': 0.00037739736026122186, 'samples': 9755712, 'steps': 50810, 'loss/train': 1.4635947942733765} 11/07/2021 04:23:22 - INFO - __main__ - Step 50812: {'lr': 0.00037739279421580683, 'samples': 9755904, 'steps': 50811, 'loss/train': 1.1940083503723145} 11/07/2021 04:23:22 - INFO - __main__ - Step 50813: {'lr': 0.00037738822811299067, 'samples': 9756096, 'steps': 50812, 'loss/train': 1.7556402683258057} 11/07/2021 04:23:23 - INFO - __main__ - Step 50814: {'lr': 0.00037738366195277527, 'samples': 9756288, 'steps': 50813, 'loss/train': 1.406965970993042} 11/07/2021 04:23:24 - INFO - __main__ - Step 50815: {'lr': 0.0003773790957351628, 'samples': 9756480, 'steps': 50814, 'loss/train': 1.1583935022354126} 11/07/2021 04:23:24 - INFO - __main__ - Step 50816: {'lr': 0.00037737452946015533, 'samples': 9756672, 'steps': 50815, 'loss/train': 1.148431420326233} 11/07/2021 04:23:24 - INFO - __main__ - Step 50817: {'lr': 0.0003773699631277548, 'samples': 9756864, 'steps': 50816, 'loss/train': 1.3440370559692383} 11/07/2021 04:23:25 - INFO - __main__ - Step 50818: {'lr': 0.00037736539673796334, 'samples': 9757056, 'steps': 50817, 'loss/train': 1.1187483072280884} 11/07/2021 04:23:26 - INFO - __main__ - Step 50819: {'lr': 0.00037736083029078294, 'samples': 9757248, 'steps': 50818, 'loss/train': 1.1490564346313477} 11/07/2021 04:23:27 - INFO - __main__ - Step 50820: {'lr': 0.00037735626378621577, 'samples': 9757440, 'steps': 50819, 'loss/train': 1.4225025177001953} 11/07/2021 04:23:27 - INFO - __main__ - Step 50821: {'lr': 0.00037735169722426384, 'samples': 9757632, 'steps': 50820, 'loss/train': 5.91687536239624} 11/07/2021 04:23:27 - INFO - __main__ - Step 50822: {'lr': 0.0003773471306049292, 'samples': 9757824, 'steps': 50821, 'loss/train': 5.8392863273620605} 11/07/2021 04:23:28 - INFO - __main__ - Step 50823: {'lr': 0.00037734256392821393, 'samples': 9758016, 'steps': 50822, 'loss/train': 0.9705638289451599} 11/07/2021 04:23:28 - INFO - __main__ - Step 50824: {'lr': 0.00037733799719411997, 'samples': 9758208, 'steps': 50823, 'loss/train': 1.161920428276062} 11/07/2021 04:23:28 - INFO - __main__ - Step 50825: {'lr': 0.00037733343040264954, 'samples': 9758400, 'steps': 50824, 'loss/train': 2.0557708740234375} 11/07/2021 04:23:29 - INFO - __main__ - Step 50826: {'lr': 0.00037732886355380465, 'samples': 9758592, 'steps': 50825, 'loss/train': 1.666393518447876} 11/07/2021 04:23:30 - INFO - __main__ - Step 50827: {'lr': 0.00037732429664758725, 'samples': 9758784, 'steps': 50826, 'loss/train': 1.5360578298568726} 11/07/2021 04:23:30 - INFO - __main__ - Step 50828: {'lr': 0.0003773197296839996, 'samples': 9758976, 'steps': 50827, 'loss/train': 1.7051162719726562} 11/07/2021 04:23:30 - INFO - __main__ - Step 50829: {'lr': 0.00037731516266304355, 'samples': 9759168, 'steps': 50828, 'loss/train': 1.089630365371704} 11/07/2021 04:23:31 - INFO - __main__ - Step 50830: {'lr': 0.00037731059558472136, 'samples': 9759360, 'steps': 50829, 'loss/train': 1.4240920543670654} 11/07/2021 04:23:32 - INFO - __main__ - Step 50831: {'lr': 0.00037730602844903495, 'samples': 9759552, 'steps': 50830, 'loss/train': 1.5600533485412598} 11/07/2021 04:23:32 - INFO - __main__ - Step 50832: {'lr': 0.00037730146125598634, 'samples': 9759744, 'steps': 50831, 'loss/train': 1.3985621929168701} 11/07/2021 04:23:32 - INFO - __main__ - Step 50833: {'lr': 0.0003772968940055777, 'samples': 9759936, 'steps': 50832, 'loss/train': 1.2548178434371948} 11/07/2021 04:23:33 - INFO - __main__ - Step 50834: {'lr': 0.000377292326697811, 'samples': 9760128, 'steps': 50833, 'loss/train': 0.6145873665809631} 11/07/2021 04:23:33 - INFO - __main__ - Step 50835: {'lr': 0.00037728775933268844, 'samples': 9760320, 'steps': 50834, 'loss/train': 1.3260642290115356} 11/07/2021 04:23:34 - INFO - __main__ - Step 50836: {'lr': 0.0003772831919102119, 'samples': 9760512, 'steps': 50835, 'loss/train': 1.3751753568649292} 11/07/2021 04:23:35 - INFO - __main__ - Step 50837: {'lr': 0.00037727862443038353, 'samples': 9760704, 'steps': 50836, 'loss/train': 1.3612593412399292} 11/07/2021 04:23:35 - INFO - __main__ - Step 50838: {'lr': 0.00037727405689320535, 'samples': 9760896, 'steps': 50837, 'loss/train': 1.3940775394439697} 11/07/2021 04:23:35 - INFO - __main__ - Step 50839: {'lr': 0.00037726948929867955, 'samples': 9761088, 'steps': 50838, 'loss/train': 1.1814788579940796} 11/07/2021 04:23:36 - INFO - __main__ - Step 50840: {'lr': 0.00037726492164680796, 'samples': 9761280, 'steps': 50839, 'loss/train': 0.9084142446517944} 11/07/2021 04:23:37 - INFO - __main__ - Step 50841: {'lr': 0.00037726035393759286, 'samples': 9761472, 'steps': 50840, 'loss/train': 1.009061574935913} 11/07/2021 04:23:37 - INFO - __main__ - Step 50842: {'lr': 0.00037725578617103605, 'samples': 9761664, 'steps': 50841, 'loss/train': 1.5422435998916626} 11/07/2021 04:23:38 - INFO - __main__ - Step 50843: {'lr': 0.00037725121834713995, 'samples': 9761856, 'steps': 50842, 'loss/train': 0.6560593247413635} 11/07/2021 04:23:38 - INFO - __main__ - Step 50844: {'lr': 0.0003772466504659063, 'samples': 9762048, 'steps': 50843, 'loss/train': 1.5334835052490234} 11/07/2021 04:23:38 - INFO - __main__ - Step 50845: {'lr': 0.00037724208252733725, 'samples': 9762240, 'steps': 50844, 'loss/train': 1.6152210235595703} 11/07/2021 04:23:39 - INFO - __main__ - Step 50846: {'lr': 0.000377237514531435, 'samples': 9762432, 'steps': 50845, 'loss/train': 0.46458733081817627} 11/07/2021 04:23:40 - INFO - __main__ - Step 50847: {'lr': 0.0003772329464782014, 'samples': 9762624, 'steps': 50846, 'loss/train': 0.9685691595077515} 11/07/2021 04:23:40 - INFO - __main__ - Step 50848: {'lr': 0.00037722837836763856, 'samples': 9762816, 'steps': 50847, 'loss/train': 1.5266244411468506} 11/07/2021 04:23:40 - INFO - __main__ - Step 50849: {'lr': 0.0003772238101997486, 'samples': 9763008, 'steps': 50848, 'loss/train': 1.5096989870071411} 11/07/2021 04:23:41 - INFO - __main__ - Step 50850: {'lr': 0.0003772192419745336, 'samples': 9763200, 'steps': 50849, 'loss/train': 1.3950402736663818} 11/07/2021 04:23:41 - INFO - __main__ - Step 50851: {'lr': 0.0003772146736919956, 'samples': 9763392, 'steps': 50850, 'loss/train': 1.409664511680603} 11/07/2021 04:23:42 - INFO - __main__ - Step 50852: {'lr': 0.0003772101053521366, 'samples': 9763584, 'steps': 50851, 'loss/train': 0.7390536665916443} 11/07/2021 04:23:43 - INFO - __main__ - Step 50853: {'lr': 0.0003772055369549586, 'samples': 9763776, 'steps': 50852, 'loss/train': 1.3061412572860718} 11/07/2021 04:23:43 - INFO - __main__ - Step 50854: {'lr': 0.0003772009685004638, 'samples': 9763968, 'steps': 50853, 'loss/train': 1.6045711040496826} 11/07/2021 04:23:43 - INFO - __main__ - Step 50855: {'lr': 0.0003771963999886543, 'samples': 9764160, 'steps': 50854, 'loss/train': 1.455470323562622} 11/07/2021 04:23:44 - INFO - __main__ - Step 50856: {'lr': 0.000377191831419532, 'samples': 9764352, 'steps': 50855, 'loss/train': 1.2654258012771606} 11/07/2021 04:23:45 - INFO - __main__ - Step 50857: {'lr': 0.000377187262793099, 'samples': 9764544, 'steps': 50856, 'loss/train': 0.07830142974853516} 11/07/2021 04:23:45 - INFO - __main__ - Step 50858: {'lr': 0.0003771826941093574, 'samples': 9764736, 'steps': 50857, 'loss/train': 1.545218586921692} 11/07/2021 04:23:45 - INFO - __main__ - Step 50859: {'lr': 0.0003771781253683092, 'samples': 9764928, 'steps': 50858, 'loss/train': 1.5167969465255737} 11/07/2021 04:23:46 - INFO - __main__ - Step 50860: {'lr': 0.00037717355656995653, 'samples': 9765120, 'steps': 50859, 'loss/train': 0.9585990309715271} 11/07/2021 04:23:46 - INFO - __main__ - Step 50861: {'lr': 0.0003771689877143015, 'samples': 9765312, 'steps': 50860, 'loss/train': 0.734885573387146} 11/07/2021 04:23:47 - INFO - __main__ - Step 50862: {'lr': 0.000377164418801346, 'samples': 9765504, 'steps': 50861, 'loss/train': 1.4071004390716553} 11/07/2021 04:23:47 - INFO - __main__ - Step 50863: {'lr': 0.0003771598498310922, 'samples': 9765696, 'steps': 50862, 'loss/train': 1.0230638980865479} 11/07/2021 04:23:48 - INFO - __main__ - Step 50864: {'lr': 0.0003771552808035421, 'samples': 9765888, 'steps': 50863, 'loss/train': 1.6071019172668457} 11/07/2021 04:23:48 - INFO - __main__ - Step 50865: {'lr': 0.0003771507117186978, 'samples': 9766080, 'steps': 50864, 'loss/train': 1.5947484970092773} 11/07/2021 04:23:49 - INFO - __main__ - Step 50866: {'lr': 0.0003771461425765614, 'samples': 9766272, 'steps': 50865, 'loss/train': 1.2703443765640259} 11/07/2021 04:23:50 - INFO - __main__ - Step 50867: {'lr': 0.00037714157337713483, 'samples': 9766464, 'steps': 50866, 'loss/train': 1.5110069513320923} 11/07/2021 04:23:50 - INFO - __main__ - Step 50868: {'lr': 0.0003771370041204203, 'samples': 9766656, 'steps': 50867, 'loss/train': 1.8053284883499146} 11/07/2021 04:23:50 - INFO - __main__ - Step 50869: {'lr': 0.0003771324348064198, 'samples': 9766848, 'steps': 50868, 'loss/train': 1.1192725896835327} 11/07/2021 04:23:51 - INFO - __main__ - Step 50870: {'lr': 0.00037712786543513534, 'samples': 9767040, 'steps': 50869, 'loss/train': 1.066977858543396} 11/07/2021 04:23:51 - INFO - __main__ - Step 50871: {'lr': 0.000377123296006569, 'samples': 9767232, 'steps': 50870, 'loss/train': 1.4052960872650146} 11/07/2021 04:23:52 - INFO - __main__ - Step 50872: {'lr': 0.000377118726520723, 'samples': 9767424, 'steps': 50871, 'loss/train': 1.4068541526794434} 11/07/2021 04:23:52 - INFO - __main__ - Step 50873: {'lr': 0.0003771141569775991, 'samples': 9767616, 'steps': 50872, 'loss/train': 1.9116144180297852} 11/07/2021 04:23:53 - INFO - __main__ - Step 50874: {'lr': 0.0003771095873771996, 'samples': 9767808, 'steps': 50873, 'loss/train': 1.5371664762496948} 11/07/2021 04:23:53 - INFO - __main__ - Step 50875: {'lr': 0.0003771050177195265, 'samples': 9768000, 'steps': 50874, 'loss/train': 1.178216814994812} 11/07/2021 04:23:53 - INFO - __main__ - Step 50876: {'lr': 0.0003771004480045818, 'samples': 9768192, 'steps': 50875, 'loss/train': 1.0844007730484009} 11/07/2021 04:23:54 - INFO - __main__ - Step 50877: {'lr': 0.00037709587823236767, 'samples': 9768384, 'steps': 50876, 'loss/train': 1.0935975313186646} 11/07/2021 04:23:55 - INFO - __main__ - Step 50878: {'lr': 0.00037709130840288605, 'samples': 9768576, 'steps': 50877, 'loss/train': 1.5046992301940918} 11/07/2021 04:23:56 - INFO - __main__ - Step 50879: {'lr': 0.00037708673851613903, 'samples': 9768768, 'steps': 50878, 'loss/train': 1.512107014656067} 11/07/2021 04:23:56 - INFO - __main__ - Step 50880: {'lr': 0.00037708216857212863, 'samples': 9768960, 'steps': 50879, 'loss/train': 1.4767941236495972} 11/07/2021 04:23:56 - INFO - __main__ - Step 50881: {'lr': 0.0003770775985708571, 'samples': 9769152, 'steps': 50880, 'loss/train': 1.2471498250961304} 11/07/2021 04:23:57 - INFO - __main__ - Step 50882: {'lr': 0.0003770730285123263, 'samples': 9769344, 'steps': 50881, 'loss/train': 1.632147192955017} 11/07/2021 04:23:58 - INFO - __main__ - Step 50883: {'lr': 0.0003770684583965384, 'samples': 9769536, 'steps': 50882, 'loss/train': 0.994832456111908} 11/07/2021 04:23:58 - INFO - __main__ - Step 50884: {'lr': 0.0003770638882234953, 'samples': 9769728, 'steps': 50883, 'loss/train': 1.422627568244934} 11/07/2021 04:23:58 - INFO - __main__ - Step 50885: {'lr': 0.0003770593179931993, 'samples': 9769920, 'steps': 50884, 'loss/train': 1.1419776678085327} 11/07/2021 04:23:59 - INFO - __main__ - Step 50886: {'lr': 0.00037705474770565215, 'samples': 9770112, 'steps': 50885, 'loss/train': 1.7810052633285522} 11/07/2021 04:23:59 - INFO - __main__ - Step 50887: {'lr': 0.00037705017736085623, 'samples': 9770304, 'steps': 50886, 'loss/train': 1.781874179840088} 11/07/2021 04:23:59 - INFO - __main__ - Step 50888: {'lr': 0.00037704560695881346, 'samples': 9770496, 'steps': 50887, 'loss/train': 1.0416572093963623} 11/07/2021 04:24:01 - INFO - __main__ - Step 50889: {'lr': 0.0003770410364995259, 'samples': 9770688, 'steps': 50888, 'loss/train': 1.0607938766479492} 11/07/2021 04:24:01 - INFO - __main__ - Step 50890: {'lr': 0.00037703646598299554, 'samples': 9770880, 'steps': 50889, 'loss/train': 1.4012714624404907} 11/07/2021 04:24:02 - INFO - __main__ - Step 50891: {'lr': 0.00037703189540922463, 'samples': 9771072, 'steps': 50890, 'loss/train': 1.1869800090789795} 11/07/2021 04:24:02 - INFO - __main__ - Step 50892: {'lr': 0.000377027324778215, 'samples': 9771264, 'steps': 50891, 'loss/train': 1.6377372741699219} 11/07/2021 04:24:02 - INFO - __main__ - Step 50893: {'lr': 0.0003770227540899689, 'samples': 9771456, 'steps': 50892, 'loss/train': 1.1716175079345703} 11/07/2021 04:24:03 - INFO - __main__ - Step 50894: {'lr': 0.0003770181833444882, 'samples': 9771648, 'steps': 50893, 'loss/train': 0.227674663066864} 11/07/2021 04:24:04 - INFO - __main__ - Step 50895: {'lr': 0.0003770136125417751, 'samples': 9771840, 'steps': 50894, 'loss/train': 1.4408305883407593} 11/07/2021 04:24:04 - INFO - __main__ - Step 50896: {'lr': 0.0003770090416818317, 'samples': 9772032, 'steps': 50895, 'loss/train': 1.4755432605743408} 11/07/2021 04:24:05 - INFO - __main__ - Step 50897: {'lr': 0.00037700447076465996, 'samples': 9772224, 'steps': 50896, 'loss/train': 1.4067610502243042} 11/07/2021 04:24:05 - INFO - __main__ - Step 50898: {'lr': 0.0003769998997902619, 'samples': 9772416, 'steps': 50897, 'loss/train': 1.731428623199463} 11/07/2021 04:24:05 - INFO - __main__ - Step 50899: {'lr': 0.00037699532875863976, 'samples': 9772608, 'steps': 50898, 'loss/train': 1.5574582815170288} 11/07/2021 04:24:06 - INFO - __main__ - Step 50900: {'lr': 0.0003769907576697954, 'samples': 9772800, 'steps': 50899, 'loss/train': 1.311132550239563} 11/07/2021 04:24:07 - INFO - __main__ - Step 50901: {'lr': 0.000376986186523731, 'samples': 9772992, 'steps': 50900, 'loss/train': 1.404496192932129} 11/07/2021 04:24:07 - INFO - __main__ - Step 50902: {'lr': 0.0003769816153204485, 'samples': 9773184, 'steps': 50901, 'loss/train': 1.2235862016677856} 11/07/2021 04:24:07 - INFO - __main__ - Step 50903: {'lr': 0.00037697704405995015, 'samples': 9773376, 'steps': 50902, 'loss/train': 1.45487642288208} 11/07/2021 04:24:08 - INFO - __main__ - Step 50904: {'lr': 0.0003769724727422379, 'samples': 9773568, 'steps': 50903, 'loss/train': 1.3382630348205566} 11/07/2021 04:24:09 - INFO - __main__ - Step 50905: {'lr': 0.0003769679013673137, 'samples': 9773760, 'steps': 50904, 'loss/train': 1.6522085666656494} 11/07/2021 04:24:09 - INFO - __main__ - Step 50906: {'lr': 0.00037696332993517983, 'samples': 9773952, 'steps': 50905, 'loss/train': 1.360925316810608} 11/07/2021 04:24:09 - INFO - __main__ - Step 50907: {'lr': 0.0003769587584458382, 'samples': 9774144, 'steps': 50906, 'loss/train': 1.3342199325561523} 11/07/2021 04:24:10 - INFO - __main__ - Step 50908: {'lr': 0.00037695418689929095, 'samples': 9774336, 'steps': 50907, 'loss/train': 1.4487676620483398} 11/07/2021 04:24:10 - INFO - __main__ - Step 50909: {'lr': 0.00037694961529554006, 'samples': 9774528, 'steps': 50908, 'loss/train': 1.2917283773422241} 11/07/2021 04:24:11 - INFO - __main__ - Step 50910: {'lr': 0.0003769450436345877, 'samples': 9774720, 'steps': 50909, 'loss/train': 1.2469871044158936} 11/07/2021 04:24:11 - INFO - __main__ - Step 50911: {'lr': 0.00037694047191643576, 'samples': 9774912, 'steps': 50910, 'loss/train': 0.7166115045547485} 11/07/2021 04:24:12 - INFO - __main__ - Step 50912: {'lr': 0.00037693590014108646, 'samples': 9775104, 'steps': 50911, 'loss/train': 1.480364203453064} 11/07/2021 04:24:12 - INFO - __main__ - Step 50913: {'lr': 0.0003769313283085418, 'samples': 9775296, 'steps': 50912, 'loss/train': 1.0824471712112427} 11/07/2021 04:24:12 - INFO - __main__ - Step 50914: {'lr': 0.0003769267564188038, 'samples': 9775488, 'steps': 50913, 'loss/train': 1.4203639030456543} 11/07/2021 04:24:13 - INFO - __main__ - Step 50915: {'lr': 0.0003769221844718746, 'samples': 9775680, 'steps': 50914, 'loss/train': 1.8766120672225952} 11/07/2021 04:24:14 - INFO - __main__ - Step 50916: {'lr': 0.00037691761246775625, 'samples': 9775872, 'steps': 50915, 'loss/train': 1.2549500465393066} 11/07/2021 04:24:14 - INFO - __main__ - Step 50917: {'lr': 0.00037691304040645074, 'samples': 9776064, 'steps': 50916, 'loss/train': 1.245263695716858} 11/07/2021 04:24:15 - INFO - __main__ - Step 50918: {'lr': 0.00037690846828796024, 'samples': 9776256, 'steps': 50917, 'loss/train': 1.5781947374343872} 11/07/2021 04:24:15 - INFO - __main__ - Step 50919: {'lr': 0.00037690389611228664, 'samples': 9776448, 'steps': 50918, 'loss/train': 1.6627144813537598} 11/07/2021 04:24:16 - INFO - __main__ - Step 50920: {'lr': 0.00037689932387943216, 'samples': 9776640, 'steps': 50919, 'loss/train': 1.681615948677063} 11/07/2021 04:24:16 - INFO - __main__ - Step 50921: {'lr': 0.0003768947515893988, 'samples': 9776832, 'steps': 50920, 'loss/train': 2.0066945552825928} 11/07/2021 04:24:17 - INFO - __main__ - Step 50922: {'lr': 0.0003768901792421886, 'samples': 9777024, 'steps': 50921, 'loss/train': 1.188463568687439} 11/07/2021 04:24:17 - INFO - __main__ - Step 50923: {'lr': 0.0003768856068378036, 'samples': 9777216, 'steps': 50922, 'loss/train': 1.2847148180007935} 11/07/2021 04:24:17 - INFO - __main__ - Step 50924: {'lr': 0.000376881034376246, 'samples': 9777408, 'steps': 50923, 'loss/train': 1.717268943786621} 11/07/2021 04:24:18 - INFO - __main__ - Step 50925: {'lr': 0.0003768764618575178, 'samples': 9777600, 'steps': 50924, 'loss/train': 1.064644694328308} 11/07/2021 04:24:19 - INFO - __main__ - Step 50926: {'lr': 0.00037687188928162087, 'samples': 9777792, 'steps': 50925, 'loss/train': 1.1622135639190674} 11/07/2021 04:24:19 - INFO - __main__ - Step 50927: {'lr': 0.00037686731664855755, 'samples': 9777984, 'steps': 50926, 'loss/train': 1.6185857057571411} 11/07/2021 04:24:19 - INFO - __main__ - Step 50928: {'lr': 0.0003768627439583297, 'samples': 9778176, 'steps': 50927, 'loss/train': 1.2021294832229614} 11/07/2021 04:24:20 - INFO - __main__ - Step 50929: {'lr': 0.00037685817121093946, 'samples': 9778368, 'steps': 50928, 'loss/train': 1.309601068496704} 11/07/2021 04:24:21 - INFO - __main__ - Step 50930: {'lr': 0.000376853598406389, 'samples': 9778560, 'steps': 50929, 'loss/train': 0.9625622630119324} 11/07/2021 04:24:21 - INFO - __main__ - Step 50931: {'lr': 0.00037684902554468015, 'samples': 9778752, 'steps': 50930, 'loss/train': 1.2327609062194824} 11/07/2021 04:24:22 - INFO - __main__ - Step 50932: {'lr': 0.0003768444526258151, 'samples': 9778944, 'steps': 50931, 'loss/train': 1.4789179563522339} 11/07/2021 04:24:22 - INFO - __main__ - Step 50933: {'lr': 0.0003768398796497959, 'samples': 9779136, 'steps': 50932, 'loss/train': 1.0722277164459229} 11/07/2021 04:24:22 - INFO - __main__ - Step 50934: {'lr': 0.00037683530661662457, 'samples': 9779328, 'steps': 50933, 'loss/train': 1.4856834411621094} 11/07/2021 04:24:23 - INFO - __main__ - Step 50935: {'lr': 0.00037683073352630327, 'samples': 9779520, 'steps': 50934, 'loss/train': 1.6969283819198608} 11/07/2021 04:24:24 - INFO - __main__ - Step 50936: {'lr': 0.000376826160378834, 'samples': 9779712, 'steps': 50935, 'loss/train': 1.0604041814804077} 11/07/2021 04:24:24 - INFO - __main__ - Step 50937: {'lr': 0.0003768215871742188, 'samples': 9779904, 'steps': 50936, 'loss/train': 1.147591471672058} 11/07/2021 04:24:24 - INFO - __main__ - Step 50938: {'lr': 0.00037681701391245983, 'samples': 9780096, 'steps': 50937, 'loss/train': 0.8042525053024292} 11/07/2021 04:24:25 - INFO - __main__ - Step 50939: {'lr': 0.0003768124405935589, 'samples': 9780288, 'steps': 50938, 'loss/train': 1.5059691667556763} 11/07/2021 04:24:25 - INFO - __main__ - Step 50940: {'lr': 0.00037680786721751834, 'samples': 9780480, 'steps': 50939, 'loss/train': 1.6961679458618164} 11/07/2021 04:24:26 - INFO - __main__ - Step 50941: {'lr': 0.0003768032937843401, 'samples': 9780672, 'steps': 50940, 'loss/train': 1.6141343116760254} 11/07/2021 04:24:26 - INFO - __main__ - Step 50942: {'lr': 0.00037679872029402627, 'samples': 9780864, 'steps': 50941, 'loss/train': 1.6517798900604248} 11/07/2021 04:24:27 - INFO - __main__ - Step 50943: {'lr': 0.0003767941467465789, 'samples': 9781056, 'steps': 50942, 'loss/train': 2.1457533836364746} 11/07/2021 04:24:27 - INFO - __main__ - Step 50944: {'lr': 0.000376789573142, 'samples': 9781248, 'steps': 50943, 'loss/train': 1.5147534608840942} 11/07/2021 04:24:28 - INFO - __main__ - Step 50945: {'lr': 0.0003767849994802918, 'samples': 9781440, 'steps': 50944, 'loss/train': 1.5164519548416138} 11/07/2021 04:24:29 - INFO - __main__ - Step 50946: {'lr': 0.0003767804257614561, 'samples': 9781632, 'steps': 50945, 'loss/train': 1.181636095046997} 11/07/2021 04:24:29 - INFO - __main__ - Step 50947: {'lr': 0.00037677585198549516, 'samples': 9781824, 'steps': 50946, 'loss/train': 1.5399096012115479} 11/07/2021 04:24:29 - INFO - __main__ - Step 50948: {'lr': 0.00037677127815241086, 'samples': 9782016, 'steps': 50947, 'loss/train': 1.3078244924545288} 11/07/2021 04:24:30 - INFO - __main__ - Step 50949: {'lr': 0.00037676670426220547, 'samples': 9782208, 'steps': 50948, 'loss/train': 1.358941912651062} 11/07/2021 04:24:30 - INFO - __main__ - Step 50950: {'lr': 0.00037676213031488095, 'samples': 9782400, 'steps': 50949, 'loss/train': 1.3406785726547241} 11/07/2021 04:24:31 - INFO - __main__ - Step 50951: {'lr': 0.0003767575563104394, 'samples': 9782592, 'steps': 50950, 'loss/train': 1.4531294107437134} 11/07/2021 04:24:31 - INFO - __main__ - Step 50952: {'lr': 0.00037675298224888287, 'samples': 9782784, 'steps': 50951, 'loss/train': 1.345122218132019} 11/07/2021 04:24:32 - INFO - __main__ - Step 50953: {'lr': 0.0003767484081302133, 'samples': 9782976, 'steps': 50952, 'loss/train': 1.617568850517273} 11/07/2021 04:24:32 - INFO - __main__ - Step 50954: {'lr': 0.000376743833954433, 'samples': 9783168, 'steps': 50953, 'loss/train': 1.5393524169921875} 11/07/2021 04:24:33 - INFO - __main__ - Step 50955: {'lr': 0.00037673925972154376, 'samples': 9783360, 'steps': 50954, 'loss/train': 1.5331530570983887} 11/07/2021 04:24:34 - INFO - __main__ - Step 50956: {'lr': 0.00037673468543154777, 'samples': 9783552, 'steps': 50955, 'loss/train': 0.05478181689977646} 11/07/2021 04:24:34 - INFO - __main__ - Step 50957: {'lr': 0.0003767301110844472, 'samples': 9783744, 'steps': 50956, 'loss/train': 1.8871591091156006} 11/07/2021 04:24:34 - INFO - __main__ - Step 50958: {'lr': 0.0003767255366802439, 'samples': 9783936, 'steps': 50957, 'loss/train': 1.2720681428909302} 11/07/2021 04:24:35 - INFO - __main__ - Step 50959: {'lr': 0.00037672096221894004, 'samples': 9784128, 'steps': 50958, 'loss/train': 1.5616569519042969} 11/07/2021 04:24:35 - INFO - __main__ - Step 50960: {'lr': 0.0003767163877005376, 'samples': 9784320, 'steps': 50959, 'loss/train': 1.6533688306808472} 11/07/2021 04:24:36 - INFO - __main__ - Step 50961: {'lr': 0.0003767118131250388, 'samples': 9784512, 'steps': 50960, 'loss/train': 2.0300369262695312} 11/07/2021 04:24:36 - INFO - __main__ - Step 50962: {'lr': 0.00037670723849244557, 'samples': 9784704, 'steps': 50961, 'loss/train': 1.1564273834228516} 11/07/2021 04:24:37 - INFO - __main__ - Step 50963: {'lr': 0.0003767026638027601, 'samples': 9784896, 'steps': 50962, 'loss/train': 1.2109038829803467} 11/07/2021 04:24:37 - INFO - __main__ - Step 50964: {'lr': 0.00037669808905598434, 'samples': 9785088, 'steps': 50963, 'loss/train': 2.1062874794006348} 11/07/2021 04:24:37 - INFO - __main__ - Step 50965: {'lr': 0.0003766935142521203, 'samples': 9785280, 'steps': 50964, 'loss/train': 1.395483136177063} 11/07/2021 04:24:38 - INFO - __main__ - Step 50966: {'lr': 0.00037668893939117023, 'samples': 9785472, 'steps': 50965, 'loss/train': 1.069337010383606} 11/07/2021 04:24:39 - INFO - __main__ - Step 50967: {'lr': 0.000376684364473136, 'samples': 9785664, 'steps': 50966, 'loss/train': 1.7145888805389404} 11/07/2021 04:24:39 - INFO - __main__ - Step 50968: {'lr': 0.00037667978949801974, 'samples': 9785856, 'steps': 50967, 'loss/train': 1.388250470161438} 11/07/2021 04:24:39 - INFO - __main__ - Step 50969: {'lr': 0.00037667521446582355, 'samples': 9786048, 'steps': 50968, 'loss/train': 1.8198034763336182} 11/07/2021 04:24:40 - INFO - __main__ - Step 50970: {'lr': 0.00037667063937654944, 'samples': 9786240, 'steps': 50969, 'loss/train': 1.6997214555740356} 11/07/2021 04:24:41 - INFO - __main__ - Step 50971: {'lr': 0.00037666606423019956, 'samples': 9786432, 'steps': 50970, 'loss/train': 1.4291136264801025} 11/07/2021 04:24:41 - INFO - __main__ - Step 50972: {'lr': 0.00037666148902677576, 'samples': 9786624, 'steps': 50971, 'loss/train': 1.5241873264312744} 11/07/2021 04:24:42 - INFO - __main__ - Step 50973: {'lr': 0.0003766569137662804, 'samples': 9786816, 'steps': 50972, 'loss/train': 1.5081205368041992} 11/07/2021 04:24:42 - INFO - __main__ - Step 50974: {'lr': 0.00037665233844871534, 'samples': 9787008, 'steps': 50973, 'loss/train': 1.5813207626342773} 11/07/2021 04:24:42 - INFO - __main__ - Step 50975: {'lr': 0.0003766477630740827, 'samples': 9787200, 'steps': 50974, 'loss/train': 0.9070396423339844} 11/07/2021 04:24:43 - INFO - __main__ - Step 50976: {'lr': 0.00037664318764238445, 'samples': 9787392, 'steps': 50975, 'loss/train': 0.4607886075973511} 11/07/2021 04:24:44 - INFO - __main__ - Step 50977: {'lr': 0.0003766386121536228, 'samples': 9787584, 'steps': 50976, 'loss/train': 3.141408681869507} 11/07/2021 04:24:44 - INFO - __main__ - Step 50978: {'lr': 0.00037663403660779984, 'samples': 9787776, 'steps': 50977, 'loss/train': 1.2351731061935425} 11/07/2021 04:24:44 - INFO - __main__ - Step 50979: {'lr': 0.00037662946100491736, 'samples': 9787968, 'steps': 50978, 'loss/train': 1.4202797412872314} 11/07/2021 04:24:45 - INFO - __main__ - Step 50980: {'lr': 0.00037662488534497766, 'samples': 9788160, 'steps': 50979, 'loss/train': 0.7293932437896729} 11/07/2021 04:24:45 - INFO - __main__ - Step 50981: {'lr': 0.0003766203096279828, 'samples': 9788352, 'steps': 50980, 'loss/train': 2.311060667037964} 11/07/2021 04:24:46 - INFO - __main__ - Step 50982: {'lr': 0.00037661573385393477, 'samples': 9788544, 'steps': 50981, 'loss/train': 1.1850734949111938} 11/07/2021 04:24:47 - INFO - __main__ - Step 50983: {'lr': 0.0003766111580228356, 'samples': 9788736, 'steps': 50982, 'loss/train': 1.104164719581604} 11/07/2021 04:24:47 - INFO - __main__ - Step 50984: {'lr': 0.00037660658213468744, 'samples': 9788928, 'steps': 50983, 'loss/train': 1.3133814334869385} 11/07/2021 04:24:47 - INFO - __main__ - Step 50985: {'lr': 0.00037660200618949225, 'samples': 9789120, 'steps': 50984, 'loss/train': 1.201141357421875} 11/07/2021 04:24:48 - INFO - __main__ - Step 50986: {'lr': 0.0003765974301872522, 'samples': 9789312, 'steps': 50985, 'loss/train': 1.0673134326934814} 11/07/2021 04:24:49 - INFO - __main__ - Step 50987: {'lr': 0.0003765928541279693, 'samples': 9789504, 'steps': 50986, 'loss/train': 1.4994068145751953} 11/07/2021 04:24:49 - INFO - __main__ - Step 50988: {'lr': 0.0003765882780116455, 'samples': 9789696, 'steps': 50987, 'loss/train': 0.5558611750602722} 11/07/2021 04:24:49 - INFO - __main__ - Step 50989: {'lr': 0.0003765837018382831, 'samples': 9789888, 'steps': 50988, 'loss/train': 1.6570909023284912} 11/07/2021 04:24:50 - INFO - __main__ - Step 50990: {'lr': 0.0003765791256078841, 'samples': 9790080, 'steps': 50989, 'loss/train': 1.2512147426605225} 11/07/2021 04:24:50 - INFO - __main__ - Step 50991: {'lr': 0.00037657454932045036, 'samples': 9790272, 'steps': 50990, 'loss/train': 0.9478741884231567} 11/07/2021 04:24:51 - INFO - __main__ - Step 50992: {'lr': 0.00037656997297598417, 'samples': 9790464, 'steps': 50991, 'loss/train': 1.110074520111084} 11/07/2021 04:24:51 - INFO - __main__ - Step 50993: {'lr': 0.0003765653965744874, 'samples': 9790656, 'steps': 50992, 'loss/train': 1.4602985382080078} 11/07/2021 04:24:52 - INFO - __main__ - Step 50994: {'lr': 0.00037656082011596224, 'samples': 9790848, 'steps': 50993, 'loss/train': 1.0839636325836182} 11/07/2021 04:24:52 - INFO - __main__ - Step 50995: {'lr': 0.00037655624360041084, 'samples': 9791040, 'steps': 50994, 'loss/train': 0.879666805267334} 11/07/2021 04:24:52 - INFO - __main__ - Step 50996: {'lr': 0.00037655166702783507, 'samples': 9791232, 'steps': 50995, 'loss/train': 1.4642044305801392} 11/07/2021 04:24:54 - INFO - __main__ - Step 50997: {'lr': 0.0003765470903982371, 'samples': 9791424, 'steps': 50996, 'loss/train': 1.9917117357254028} 11/07/2021 04:24:54 - INFO - __main__ - Step 50998: {'lr': 0.0003765425137116189, 'samples': 9791616, 'steps': 50997, 'loss/train': 1.369557499885559} 11/07/2021 04:24:54 - INFO - __main__ - Step 50999: {'lr': 0.00037653793696798267, 'samples': 9791808, 'steps': 50998, 'loss/train': 1.602256178855896} 11/07/2021 04:24:55 - INFO - __main__ - Step 51000: {'lr': 0.0003765333601673303, 'samples': 9792000, 'steps': 50999, 'loss/train': 1.2036160230636597} 11/07/2021 04:24:55 - INFO - __main__ - Step 51001: {'lr': 0.0003765287833096641, 'samples': 9792192, 'steps': 51000, 'loss/train': 1.0194517374038696} 11/07/2021 04:24:56 - INFO - __main__ - Step 51002: {'lr': 0.00037652420639498583, 'samples': 9792384, 'steps': 51001, 'loss/train': 0.9271731376647949} 11/07/2021 04:24:56 - INFO - __main__ - Step 51003: {'lr': 0.00037651962942329784, 'samples': 9792576, 'steps': 51002, 'loss/train': 1.1674859523773193} 11/07/2021 04:24:57 - INFO - __main__ - Step 51004: {'lr': 0.0003765150523946019, 'samples': 9792768, 'steps': 51003, 'loss/train': 1.4975206851959229} 11/07/2021 04:24:57 - INFO - __main__ - Step 51005: {'lr': 0.00037651047530890035, 'samples': 9792960, 'steps': 51004, 'loss/train': 1.5388249158859253} 11/07/2021 04:24:57 - INFO - __main__ - Step 51006: {'lr': 0.0003765058981661952, 'samples': 9793152, 'steps': 51005, 'loss/train': 1.3791348934173584} 11/07/2021 04:24:58 - INFO - __main__ - Step 51007: {'lr': 0.0003765013209664883, 'samples': 9793344, 'steps': 51006, 'loss/train': 1.4319292306900024} 11/07/2021 04:24:59 - INFO - __main__ - Step 51008: {'lr': 0.00037649674370978195, 'samples': 9793536, 'steps': 51007, 'loss/train': 0.7983530759811401} 11/07/2021 04:24:59 - INFO - __main__ - Step 51009: {'lr': 0.000376492166396078, 'samples': 9793728, 'steps': 51008, 'loss/train': 1.3422341346740723} 11/07/2021 04:24:59 - INFO - __main__ - Step 51010: {'lr': 0.0003764875890253787, 'samples': 9793920, 'steps': 51009, 'loss/train': 1.3380659818649292} 11/07/2021 04:25:00 - INFO - __main__ - Step 51011: {'lr': 0.0003764830115976861, 'samples': 9794112, 'steps': 51010, 'loss/train': 2.4802086353302} 11/07/2021 04:25:00 - INFO - __main__ - Step 51012: {'lr': 0.00037647843411300213, 'samples': 9794304, 'steps': 51011, 'loss/train': 0.9203962087631226} 11/07/2021 04:25:01 - INFO - __main__ - Step 51013: {'lr': 0.00037647385657132895, 'samples': 9794496, 'steps': 51012, 'loss/train': 1.365518569946289} 11/07/2021 04:25:02 - INFO - __main__ - Step 51014: {'lr': 0.0003764692789726686, 'samples': 9794688, 'steps': 51013, 'loss/train': 0.8856534957885742} 11/07/2021 04:25:02 - INFO - __main__ - Step 51015: {'lr': 0.00037646470131702314, 'samples': 9794880, 'steps': 51014, 'loss/train': 1.6138612031936646} 11/07/2021 04:25:02 - INFO - __main__ - Step 51016: {'lr': 0.00037646012360439463, 'samples': 9795072, 'steps': 51015, 'loss/train': 1.444564700126648} 11/07/2021 04:25:03 - INFO - __main__ - Step 51017: {'lr': 0.0003764555458347851, 'samples': 9795264, 'steps': 51016, 'loss/train': 1.4704002141952515} 11/07/2021 04:25:04 - INFO - __main__ - Step 51018: {'lr': 0.00037645096800819684, 'samples': 9795456, 'steps': 51017, 'loss/train': 1.9046685695648193} 11/07/2021 04:25:04 - INFO - __main__ - Step 51019: {'lr': 0.00037644639012463155, 'samples': 9795648, 'steps': 51018, 'loss/train': 1.228771448135376} 11/07/2021 04:25:04 - INFO - __main__ - Step 51020: {'lr': 0.00037644181218409156, 'samples': 9795840, 'steps': 51019, 'loss/train': 1.7821251153945923} 11/07/2021 04:25:05 - INFO - __main__ - Step 51021: {'lr': 0.0003764372341865788, 'samples': 9796032, 'steps': 51020, 'loss/train': 1.356489658355713} 11/07/2021 04:25:05 - INFO - __main__ - Step 51022: {'lr': 0.00037643265613209533, 'samples': 9796224, 'steps': 51021, 'loss/train': 1.0176609754562378} 11/07/2021 04:25:06 - INFO - __main__ - Step 51023: {'lr': 0.00037642807802064327, 'samples': 9796416, 'steps': 51022, 'loss/train': 1.3684649467468262} 11/07/2021 04:25:07 - INFO - __main__ - Step 51024: {'lr': 0.00037642349985222474, 'samples': 9796608, 'steps': 51023, 'loss/train': 1.8060178756713867} 11/07/2021 04:25:07 - INFO - __main__ - Step 51025: {'lr': 0.0003764189216268417, 'samples': 9796800, 'steps': 51024, 'loss/train': 1.5317825078964233} 11/07/2021 04:25:07 - INFO - __main__ - Step 51026: {'lr': 0.0003764143433444962, 'samples': 9796992, 'steps': 51025, 'loss/train': 1.35077702999115} 11/07/2021 04:25:08 - INFO - __main__ - Step 51027: {'lr': 0.00037640976500519035, 'samples': 9797184, 'steps': 51026, 'loss/train': 2.0202128887176514} 11/07/2021 04:25:09 - INFO - __main__ - Step 51028: {'lr': 0.0003764051866089262, 'samples': 9797376, 'steps': 51027, 'loss/train': 1.3916025161743164} 11/07/2021 04:25:09 - INFO - __main__ - Step 51029: {'lr': 0.00037640060815570585, 'samples': 9797568, 'steps': 51028, 'loss/train': 1.1106539964675903} 11/07/2021 04:25:09 - INFO - __main__ - Step 51030: {'lr': 0.0003763960296455314, 'samples': 9797760, 'steps': 51029, 'loss/train': 1.4245356321334839} 11/07/2021 04:25:10 - INFO - __main__ - Step 51031: {'lr': 0.0003763914510784048, 'samples': 9797952, 'steps': 51030, 'loss/train': 1.5696529150009155} 11/07/2021 04:25:10 - INFO - __main__ - Step 51032: {'lr': 0.00037638687245432817, 'samples': 9798144, 'steps': 51031, 'loss/train': 1.0840160846710205} 11/07/2021 04:25:11 - INFO - __main__ - Step 51033: {'lr': 0.00037638229377330356, 'samples': 9798336, 'steps': 51032, 'loss/train': 1.2295622825622559} 11/07/2021 04:25:12 - INFO - __main__ - Step 51034: {'lr': 0.00037637771503533303, 'samples': 9798528, 'steps': 51033, 'loss/train': 1.0383323431015015} 11/07/2021 04:25:12 - INFO - __main__ - Step 51035: {'lr': 0.00037637313624041863, 'samples': 9798720, 'steps': 51034, 'loss/train': 1.4757288694381714} 11/07/2021 04:25:12 - INFO - __main__ - Step 51036: {'lr': 0.00037636855738856247, 'samples': 9798912, 'steps': 51035, 'loss/train': 1.072751522064209} 11/07/2021 04:25:13 - INFO - __main__ - Step 51037: {'lr': 0.00037636397847976656, 'samples': 9799104, 'steps': 51036, 'loss/train': 1.022415041923523} 11/07/2021 04:25:14 - INFO - __main__ - Step 51038: {'lr': 0.00037635939951403307, 'samples': 9799296, 'steps': 51037, 'loss/train': 1.2919124364852905} 11/07/2021 04:25:14 - INFO - __main__ - Step 51039: {'lr': 0.00037635482049136395, 'samples': 9799488, 'steps': 51038, 'loss/train': 2.3762500286102295} 11/07/2021 04:25:14 - INFO - __main__ - Step 51040: {'lr': 0.0003763502414117612, 'samples': 9799680, 'steps': 51039, 'loss/train': 1.3238046169281006} 11/07/2021 04:25:15 - INFO - __main__ - Step 51041: {'lr': 0.0003763456622752271, 'samples': 9799872, 'steps': 51040, 'loss/train': 1.8432047367095947} 11/07/2021 04:25:15 - INFO - __main__ - Step 51042: {'lr': 0.0003763410830817635, 'samples': 9800064, 'steps': 51041, 'loss/train': 1.0838677883148193} 11/07/2021 04:25:15 - INFO - __main__ - Step 51043: {'lr': 0.00037633650383137263, 'samples': 9800256, 'steps': 51042, 'loss/train': 0.8608529567718506} 11/07/2021 04:25:16 - INFO - __main__ - Step 51044: {'lr': 0.0003763319245240565, 'samples': 9800448, 'steps': 51043, 'loss/train': 1.2664395570755005} 11/07/2021 04:25:17 - INFO - __main__ - Step 51045: {'lr': 0.00037632734515981715, 'samples': 9800640, 'steps': 51044, 'loss/train': 1.5198123455047607} 11/07/2021 04:25:17 - INFO - __main__ - Step 51046: {'lr': 0.00037632276573865657, 'samples': 9800832, 'steps': 51045, 'loss/train': 2.2718794345855713} 11/07/2021 04:25:18 - INFO - __main__ - Step 51047: {'lr': 0.00037631818626057695, 'samples': 9801024, 'steps': 51046, 'loss/train': 1.485799789428711} 11/07/2021 04:25:18 - INFO - __main__ - Step 51048: {'lr': 0.0003763136067255803, 'samples': 9801216, 'steps': 51047, 'loss/train': 1.0966688394546509} 11/07/2021 04:25:19 - INFO - __main__ - Step 51049: {'lr': 0.00037630902713366865, 'samples': 9801408, 'steps': 51048, 'loss/train': 1.274632215499878} 11/07/2021 04:25:19 - INFO - __main__ - Step 51050: {'lr': 0.00037630444748484415, 'samples': 9801600, 'steps': 51049, 'loss/train': 1.422975778579712} 11/07/2021 04:25:20 - INFO - __main__ - Step 51051: {'lr': 0.00037629986777910885, 'samples': 9801792, 'steps': 51050, 'loss/train': 1.4993656873703003} 11/07/2021 04:25:20 - INFO - __main__ - Step 51052: {'lr': 0.00037629528801646475, 'samples': 9801984, 'steps': 51051, 'loss/train': 1.592901349067688} 11/07/2021 04:25:20 - INFO - __main__ - Step 51053: {'lr': 0.0003762907081969139, 'samples': 9802176, 'steps': 51052, 'loss/train': 1.0042757987976074} 11/07/2021 04:25:21 - INFO - __main__ - Step 51054: {'lr': 0.00037628612832045846, 'samples': 9802368, 'steps': 51053, 'loss/train': 1.1692345142364502} 11/07/2021 04:25:22 - INFO - __main__ - Step 51055: {'lr': 0.0003762815483871004, 'samples': 9802560, 'steps': 51054, 'loss/train': 0.5680693984031677} 11/07/2021 04:25:22 - INFO - __main__ - Step 51056: {'lr': 0.00037627696839684176, 'samples': 9802752, 'steps': 51055, 'loss/train': 1.7787619829177856} 11/07/2021 04:25:22 - INFO - __main__ - Step 51057: {'lr': 0.0003762723883496848, 'samples': 9802944, 'steps': 51056, 'loss/train': 1.5501517057418823} 11/07/2021 04:25:23 - INFO - __main__ - Step 51058: {'lr': 0.00037626780824563145, 'samples': 9803136, 'steps': 51057, 'loss/train': 1.1457315683364868} 11/07/2021 04:25:24 - INFO - __main__ - Step 51059: {'lr': 0.0003762632280846837, 'samples': 9803328, 'steps': 51058, 'loss/train': 1.0320316553115845} 11/07/2021 04:25:24 - INFO - __main__ - Step 51060: {'lr': 0.00037625864786684364, 'samples': 9803520, 'steps': 51059, 'loss/train': 1.1877084970474243} 11/07/2021 04:25:25 - INFO - __main__ - Step 51061: {'lr': 0.00037625406759211346, 'samples': 9803712, 'steps': 51060, 'loss/train': 1.035565733909607} 11/07/2021 04:25:25 - INFO - __main__ - Step 51062: {'lr': 0.00037624948726049513, 'samples': 9803904, 'steps': 51061, 'loss/train': 1.4257559776306152} 11/07/2021 04:25:25 - INFO - __main__ - Step 51063: {'lr': 0.0003762449068719907, 'samples': 9804096, 'steps': 51062, 'loss/train': 1.0471153259277344} 11/07/2021 04:25:26 - INFO - __main__ - Step 51064: {'lr': 0.00037624032642660234, 'samples': 9804288, 'steps': 51063, 'loss/train': 1.2665207386016846} 11/07/2021 04:25:27 - INFO - __main__ - Step 51065: {'lr': 0.00037623574592433195, 'samples': 9804480, 'steps': 51064, 'loss/train': 1.2061656713485718} 11/07/2021 04:25:27 - INFO - __main__ - Step 51066: {'lr': 0.00037623116536518176, 'samples': 9804672, 'steps': 51065, 'loss/train': 0.8431649804115295} 11/07/2021 04:25:27 - INFO - __main__ - Step 51067: {'lr': 0.00037622658474915373, 'samples': 9804864, 'steps': 51066, 'loss/train': 1.7555702924728394} 11/07/2021 04:25:28 - INFO - __main__ - Step 51068: {'lr': 0.0003762220040762499, 'samples': 9805056, 'steps': 51067, 'loss/train': 1.3444541692733765} 11/07/2021 04:25:29 - INFO - __main__ - Step 51069: {'lr': 0.0003762174233464724, 'samples': 9805248, 'steps': 51068, 'loss/train': 1.6199666261672974} 11/07/2021 04:25:29 - INFO - __main__ - Step 51070: {'lr': 0.00037621284255982324, 'samples': 9805440, 'steps': 51069, 'loss/train': 1.1775859594345093} 11/07/2021 04:25:30 - INFO - __main__ - Step 51071: {'lr': 0.0003762082617163046, 'samples': 9805632, 'steps': 51070, 'loss/train': 1.2825325727462769} 11/07/2021 04:25:30 - INFO - __main__ - Step 51072: {'lr': 0.0003762036808159185, 'samples': 9805824, 'steps': 51071, 'loss/train': 1.4918760061264038} 11/07/2021 04:25:30 - INFO - __main__ - Step 51073: {'lr': 0.0003761990998586669, 'samples': 9806016, 'steps': 51072, 'loss/train': 1.979184627532959} 11/07/2021 04:25:31 - INFO - __main__ - Step 51074: {'lr': 0.0003761945188445519, 'samples': 9806208, 'steps': 51073, 'loss/train': 1.551345944404602} 11/07/2021 04:25:32 - INFO - __main__ - Step 51075: {'lr': 0.00037618993777357567, 'samples': 9806400, 'steps': 51074, 'loss/train': 1.4490280151367188} 11/07/2021 04:25:32 - INFO - __main__ - Step 51076: {'lr': 0.00037618535664574014, 'samples': 9806592, 'steps': 51075, 'loss/train': 1.307779312133789} 11/07/2021 04:25:32 - INFO - __main__ - Step 51077: {'lr': 0.0003761807754610475, 'samples': 9806784, 'steps': 51076, 'loss/train': 1.5131794214248657} 11/07/2021 04:25:33 - INFO - __main__ - Step 51078: {'lr': 0.0003761761942194997, 'samples': 9806976, 'steps': 51077, 'loss/train': 1.1490602493286133} 11/07/2021 04:25:34 - INFO - __main__ - Step 51079: {'lr': 0.00037617161292109887, 'samples': 9807168, 'steps': 51078, 'loss/train': 1.1951687335968018} 11/07/2021 04:25:34 - INFO - __main__ - Step 51080: {'lr': 0.0003761670315658471, 'samples': 9807360, 'steps': 51079, 'loss/train': 1.4678398370742798} 11/07/2021 04:25:34 - INFO - __main__ - Step 51081: {'lr': 0.0003761624501537463, 'samples': 9807552, 'steps': 51080, 'loss/train': 1.7070611715316772} 11/07/2021 04:25:35 - INFO - __main__ - Step 51082: {'lr': 0.00037615786868479875, 'samples': 9807744, 'steps': 51081, 'loss/train': 1.6293083429336548} 11/07/2021 04:25:35 - INFO - __main__ - Step 51083: {'lr': 0.0003761532871590063, 'samples': 9807936, 'steps': 51082, 'loss/train': 2.0467350482940674} 11/07/2021 04:25:36 - INFO - __main__ - Step 51084: {'lr': 0.0003761487055763713, 'samples': 9808128, 'steps': 51083, 'loss/train': 1.0840213298797607} 11/07/2021 04:25:37 - INFO - __main__ - Step 51085: {'lr': 0.0003761441239368955, 'samples': 9808320, 'steps': 51084, 'loss/train': 1.654410481452942} 11/07/2021 04:25:37 - INFO - __main__ - Step 51086: {'lr': 0.0003761395422405811, 'samples': 9808512, 'steps': 51085, 'loss/train': 1.5213103294372559} 11/07/2021 04:25:37 - INFO - __main__ - Step 51087: {'lr': 0.00037613496048743023, 'samples': 9808704, 'steps': 51086, 'loss/train': 2.194960832595825} 11/07/2021 04:25:38 - INFO - __main__ - Step 51088: {'lr': 0.00037613037867744494, 'samples': 9808896, 'steps': 51087, 'loss/train': 1.4452787637710571} 11/07/2021 04:25:39 - INFO - __main__ - Step 51089: {'lr': 0.00037612579681062713, 'samples': 9809088, 'steps': 51088, 'loss/train': 2.0537967681884766} 11/07/2021 04:25:39 - INFO - __main__ - Step 51090: {'lr': 0.000376121214886979, 'samples': 9809280, 'steps': 51089, 'loss/train': 1.2467275857925415} 11/07/2021 04:25:39 - INFO - __main__ - Step 51091: {'lr': 0.00037611663290650267, 'samples': 9809472, 'steps': 51090, 'loss/train': 1.3502446413040161} 11/07/2021 04:25:40 - INFO - __main__ - Step 51092: {'lr': 0.0003761120508692001, 'samples': 9809664, 'steps': 51091, 'loss/train': 1.2698564529418945} 11/07/2021 04:25:40 - INFO - __main__ - Step 51093: {'lr': 0.00037610746877507343, 'samples': 9809856, 'steps': 51092, 'loss/train': 1.6742361783981323} 11/07/2021 04:25:41 - INFO - __main__ - Step 51094: {'lr': 0.0003761028866241246, 'samples': 9810048, 'steps': 51093, 'loss/train': 1.6833693981170654} 11/07/2021 04:25:41 - INFO - __main__ - Step 51095: {'lr': 0.00037609830441635573, 'samples': 9810240, 'steps': 51094, 'loss/train': 1.6488521099090576} 11/07/2021 04:25:42 - INFO - __main__ - Step 51096: {'lr': 0.00037609372215176897, 'samples': 9810432, 'steps': 51095, 'loss/train': 1.3324769735336304} 11/07/2021 04:25:42 - INFO - __main__ - Step 51097: {'lr': 0.0003760891398303663, 'samples': 9810624, 'steps': 51096, 'loss/train': 1.8812788724899292} 11/07/2021 04:25:42 - INFO - __main__ - Step 51098: {'lr': 0.0003760845574521499, 'samples': 9810816, 'steps': 51097, 'loss/train': 1.2763627767562866} 11/07/2021 04:25:43 - INFO - __main__ - Step 51099: {'lr': 0.00037607997501712165, 'samples': 9811008, 'steps': 51098, 'loss/train': 1.563471794128418} 11/07/2021 04:25:44 - INFO - __main__ - Step 51100: {'lr': 0.0003760753925252838, 'samples': 9811200, 'steps': 51099, 'loss/train': 0.26753655076026917} 11/07/2021 04:25:44 - INFO - __main__ - Step 51101: {'lr': 0.0003760708099766382, 'samples': 9811392, 'steps': 51100, 'loss/train': 2.09893798828125} 11/07/2021 04:25:45 - INFO - __main__ - Step 51102: {'lr': 0.00037606622737118713, 'samples': 9811584, 'steps': 51101, 'loss/train': 1.1198339462280273} 11/07/2021 04:25:45 - INFO - __main__ - Step 51103: {'lr': 0.00037606164470893247, 'samples': 9811776, 'steps': 51102, 'loss/train': 1.3035331964492798} 11/07/2021 04:25:45 - INFO - __main__ - Step 51104: {'lr': 0.00037605706198987646, 'samples': 9811968, 'steps': 51103, 'loss/train': 0.9974083304405212} 11/07/2021 04:25:46 - INFO - __main__ - Step 51105: {'lr': 0.0003760524792140211, 'samples': 9812160, 'steps': 51104, 'loss/train': 1.4258350133895874} 11/07/2021 04:25:47 - INFO - __main__ - Step 51106: {'lr': 0.0003760478963813684, 'samples': 9812352, 'steps': 51105, 'loss/train': 1.657906413078308} 11/07/2021 04:25:47 - INFO - __main__ - Step 51107: {'lr': 0.00037604331349192047, 'samples': 9812544, 'steps': 51106, 'loss/train': 1.748923659324646} 11/07/2021 04:25:47 - INFO - __main__ - Step 51108: {'lr': 0.00037603873054567927, 'samples': 9812736, 'steps': 51107, 'loss/train': 1.3222150802612305} 11/07/2021 04:25:48 - INFO - __main__ - Step 51109: {'lr': 0.00037603414754264707, 'samples': 9812928, 'steps': 51108, 'loss/train': 1.42711341381073} 11/07/2021 04:25:49 - INFO - __main__ - Step 51110: {'lr': 0.00037602956448282577, 'samples': 9813120, 'steps': 51109, 'loss/train': 1.2570126056671143} 11/07/2021 04:25:49 - INFO - __main__ - Step 51111: {'lr': 0.00037602498136621754, 'samples': 9813312, 'steps': 51110, 'loss/train': 1.5092865228652954} 11/07/2021 04:25:50 - INFO - __main__ - Step 51112: {'lr': 0.00037602039819282444, 'samples': 9813504, 'steps': 51111, 'loss/train': 1.6016576290130615} 11/07/2021 04:25:50 - INFO - __main__ - Step 51113: {'lr': 0.00037601581496264847, 'samples': 9813696, 'steps': 51112, 'loss/train': 1.3010882139205933} 11/07/2021 04:25:50 - INFO - __main__ - Step 51114: {'lr': 0.0003760112316756917, 'samples': 9813888, 'steps': 51113, 'loss/train': 1.7741448879241943} 11/07/2021 04:25:51 - INFO - __main__ - Step 51115: {'lr': 0.0003760066483319562, 'samples': 9814080, 'steps': 51114, 'loss/train': 0.8605861067771912} 11/07/2021 04:25:52 - INFO - __main__ - Step 51116: {'lr': 0.000376002064931444, 'samples': 9814272, 'steps': 51115, 'loss/train': 1.5896106958389282} 11/07/2021 04:25:52 - INFO - __main__ - Step 51117: {'lr': 0.00037599748147415724, 'samples': 9814464, 'steps': 51116, 'loss/train': 1.5450607538223267} 11/07/2021 04:25:52 - INFO - __main__ - Step 51118: {'lr': 0.000375992897960098, 'samples': 9814656, 'steps': 51117, 'loss/train': 1.7376378774642944} 11/07/2021 04:25:53 - INFO - __main__ - Step 51119: {'lr': 0.0003759883143892683, 'samples': 9814848, 'steps': 51118, 'loss/train': 2.2202067375183105} 11/07/2021 04:25:53 - INFO - __main__ - Step 51120: {'lr': 0.00037598373076167023, 'samples': 9815040, 'steps': 51119, 'loss/train': 1.8037248849868774} 11/07/2021 04:25:54 - INFO - __main__ - Step 51121: {'lr': 0.0003759791470773058, 'samples': 9815232, 'steps': 51120, 'loss/train': 1.4552325010299683} 11/07/2021 04:25:54 - INFO - __main__ - Step 51122: {'lr': 0.0003759745633361771, 'samples': 9815424, 'steps': 51121, 'loss/train': 1.3235969543457031} 11/07/2021 04:25:55 - INFO - __main__ - Step 51123: {'lr': 0.0003759699795382863, 'samples': 9815616, 'steps': 51122, 'loss/train': 1.767889142036438} 11/07/2021 04:25:55 - INFO - __main__ - Step 51124: {'lr': 0.00037596539568363524, 'samples': 9815808, 'steps': 51123, 'loss/train': 1.190832257270813} 11/07/2021 04:25:56 - INFO - __main__ - Step 51125: {'lr': 0.0003759608117722262, 'samples': 9816000, 'steps': 51124, 'loss/train': 1.942044734954834} 11/07/2021 04:25:57 - INFO - __main__ - Step 51126: {'lr': 0.00037595622780406114, 'samples': 9816192, 'steps': 51125, 'loss/train': 1.6755667924880981} 11/07/2021 04:25:57 - INFO - __main__ - Step 51127: {'lr': 0.0003759516437791421, 'samples': 9816384, 'steps': 51126, 'loss/train': 5.812788963317871} 11/07/2021 04:25:57 - INFO - __main__ - Step 51128: {'lr': 0.0003759470596974712, 'samples': 9816576, 'steps': 51127, 'loss/train': 1.7180052995681763} 11/07/2021 04:25:58 - INFO - __main__ - Step 51129: {'lr': 0.0003759424755590505, 'samples': 9816768, 'steps': 51128, 'loss/train': 1.8356326818466187} 11/07/2021 04:25:58 - INFO - __main__ - Step 51130: {'lr': 0.0003759378913638822, 'samples': 9816960, 'steps': 51129, 'loss/train': 1.433469295501709} 11/07/2021 04:25:58 - INFO - __main__ - Step 51131: {'lr': 0.0003759333071119681, 'samples': 9817152, 'steps': 51130, 'loss/train': 1.8264704942703247} 11/07/2021 04:26:00 - INFO - __main__ - Step 51132: {'lr': 0.0003759287228033104, 'samples': 9817344, 'steps': 51131, 'loss/train': 1.4732524156570435} 11/07/2021 04:26:00 - INFO - __main__ - Step 51133: {'lr': 0.0003759241384379112, 'samples': 9817536, 'steps': 51132, 'loss/train': 1.5256009101867676} 11/07/2021 04:26:01 - INFO - __main__ - Step 51134: {'lr': 0.0003759195540157725, 'samples': 9817728, 'steps': 51133, 'loss/train': 5.581474304199219} 11/07/2021 04:26:01 - INFO - __main__ - Step 51135: {'lr': 0.00037591496953689644, 'samples': 9817920, 'steps': 51134, 'loss/train': 5.611433982849121} 11/07/2021 04:26:01 - INFO - __main__ - Step 51136: {'lr': 0.00037591038500128495, 'samples': 9818112, 'steps': 51135, 'loss/train': 1.5781525373458862} 11/07/2021 04:26:02 - INFO - __main__ - Step 51137: {'lr': 0.00037590580040894024, 'samples': 9818304, 'steps': 51136, 'loss/train': 1.3154970407485962} 11/07/2021 04:26:03 - INFO - __main__ - Step 51138: {'lr': 0.0003759012157598643, 'samples': 9818496, 'steps': 51137, 'loss/train': 1.8281892538070679} 11/07/2021 04:26:03 - INFO - __main__ - Step 51139: {'lr': 0.00037589663105405924, 'samples': 9818688, 'steps': 51138, 'loss/train': 1.3921667337417603} 11/07/2021 04:26:03 - INFO - __main__ - Step 51140: {'lr': 0.00037589204629152705, 'samples': 9818880, 'steps': 51139, 'loss/train': 1.0147649049758911} 11/07/2021 04:26:04 - INFO - __main__ - Step 51141: {'lr': 0.00037588746147226994, 'samples': 9819072, 'steps': 51140, 'loss/train': 1.3558980226516724} 11/07/2021 04:26:04 - INFO - __main__ - Step 51142: {'lr': 0.00037588287659628977, 'samples': 9819264, 'steps': 51141, 'loss/train': 1.4923806190490723} 11/07/2021 04:26:05 - INFO - __main__ - Step 51143: {'lr': 0.0003758782916635888, 'samples': 9819456, 'steps': 51142, 'loss/train': 1.5209615230560303} 11/07/2021 04:26:05 - INFO - __main__ - Step 51144: {'lr': 0.000375873706674169, 'samples': 9819648, 'steps': 51143, 'loss/train': 1.5310009717941284} 11/07/2021 04:26:06 - INFO - __main__ - Step 51145: {'lr': 0.0003758691216280324, 'samples': 9819840, 'steps': 51144, 'loss/train': 1.2682783603668213} 11/07/2021 04:26:06 - INFO - __main__ - Step 51146: {'lr': 0.00037586453652518117, 'samples': 9820032, 'steps': 51145, 'loss/train': 1.249147891998291} 11/07/2021 04:26:06 - INFO - __main__ - Step 51147: {'lr': 0.00037585995136561734, 'samples': 9820224, 'steps': 51146, 'loss/train': 1.5766587257385254} 11/07/2021 04:26:07 - INFO - __main__ - Step 51148: {'lr': 0.0003758553661493429, 'samples': 9820416, 'steps': 51147, 'loss/train': 1.3657782077789307} 11/07/2021 04:26:08 - INFO - __main__ - Step 51149: {'lr': 0.00037585078087635994, 'samples': 9820608, 'steps': 51148, 'loss/train': 1.239439845085144} 11/07/2021 04:26:08 - INFO - __main__ - Step 51150: {'lr': 0.00037584619554667065, 'samples': 9820800, 'steps': 51149, 'loss/train': 1.6626681089401245} 11/07/2021 04:26:09 - INFO - __main__ - Step 51151: {'lr': 0.000375841610160277, 'samples': 9820992, 'steps': 51150, 'loss/train': 1.2145030498504639} 11/07/2021 04:26:09 - INFO - __main__ - Step 51152: {'lr': 0.00037583702471718106, 'samples': 9821184, 'steps': 51151, 'loss/train': 1.277573823928833} 11/07/2021 04:26:09 - INFO - __main__ - Step 51153: {'lr': 0.00037583243921738484, 'samples': 9821376, 'steps': 51152, 'loss/train': 1.323180079460144} 11/07/2021 04:26:10 - INFO - __main__ - Step 51154: {'lr': 0.0003758278536608905, 'samples': 9821568, 'steps': 51153, 'loss/train': 1.5218793153762817} 11/07/2021 04:26:11 - INFO - __main__ - Step 51155: {'lr': 0.00037582326804770004, 'samples': 9821760, 'steps': 51154, 'loss/train': 1.1672931909561157} 11/07/2021 04:26:11 - INFO - __main__ - Step 51156: {'lr': 0.0003758186823778156, 'samples': 9821952, 'steps': 51155, 'loss/train': 1.6178406476974487} 11/07/2021 04:26:11 - INFO - __main__ - Step 51157: {'lr': 0.0003758140966512392, 'samples': 9822144, 'steps': 51156, 'loss/train': 1.4110069274902344} 11/07/2021 04:26:12 - INFO - __main__ - Step 51158: {'lr': 0.0003758095108679729, 'samples': 9822336, 'steps': 51157, 'loss/train': 1.5689750909805298} 11/07/2021 04:26:13 - INFO - __main__ - Step 51159: {'lr': 0.0003758049250280188, 'samples': 9822528, 'steps': 51158, 'loss/train': 1.546869158744812} 11/07/2021 04:26:13 - INFO - __main__ - Step 51160: {'lr': 0.0003758003391313789, 'samples': 9822720, 'steps': 51159, 'loss/train': 1.3399982452392578} 11/07/2021 04:26:13 - INFO - __main__ - Step 51161: {'lr': 0.00037579575317805525, 'samples': 9822912, 'steps': 51160, 'loss/train': 1.4853978157043457} 11/07/2021 04:26:14 - INFO - __main__ - Step 51162: {'lr': 0.00037579116716805007, 'samples': 9823104, 'steps': 51161, 'loss/train': 1.910470962524414} 11/07/2021 04:26:14 - INFO - __main__ - Step 51163: {'lr': 0.00037578658110136535, 'samples': 9823296, 'steps': 51162, 'loss/train': 1.8178781270980835} 11/07/2021 04:26:15 - INFO - __main__ - Step 51164: {'lr': 0.00037578199497800304, 'samples': 9823488, 'steps': 51163, 'loss/train': 1.254246711730957} 11/07/2021 04:26:16 - INFO - __main__ - Step 51165: {'lr': 0.0003757774087979654, 'samples': 9823680, 'steps': 51164, 'loss/train': 1.3284828662872314} 11/07/2021 04:26:16 - INFO - __main__ - Step 51166: {'lr': 0.0003757728225612543, 'samples': 9823872, 'steps': 51165, 'loss/train': 1.1570911407470703} 11/07/2021 04:26:16 - INFO - __main__ - Step 51167: {'lr': 0.00037576823626787203, 'samples': 9824064, 'steps': 51166, 'loss/train': 1.0579010248184204} 11/07/2021 04:26:17 - INFO - __main__ - Step 51168: {'lr': 0.00037576364991782045, 'samples': 9824256, 'steps': 51167, 'loss/train': 1.2708224058151245} 11/07/2021 04:26:18 - INFO - __main__ - Step 51169: {'lr': 0.00037575906351110174, 'samples': 9824448, 'steps': 51168, 'loss/train': 1.0911803245544434} 11/07/2021 04:26:18 - INFO - __main__ - Step 51170: {'lr': 0.0003757544770477179, 'samples': 9824640, 'steps': 51169, 'loss/train': 1.6708488464355469} 11/07/2021 04:26:18 - INFO - __main__ - Step 51171: {'lr': 0.00037574989052767106, 'samples': 9824832, 'steps': 51170, 'loss/train': 1.4156115055084229} 11/07/2021 04:26:19 - INFO - __main__ - Step 51172: {'lr': 0.0003757453039509633, 'samples': 9825024, 'steps': 51171, 'loss/train': 1.4718499183654785} 11/07/2021 04:26:19 - INFO - __main__ - Step 51173: {'lr': 0.0003757407173175966, 'samples': 9825216, 'steps': 51172, 'loss/train': 1.422802448272705} 11/07/2021 04:26:20 - INFO - __main__ - Step 51174: {'lr': 0.00037573613062757304, 'samples': 9825408, 'steps': 51173, 'loss/train': 1.4421409368515015} 11/07/2021 04:26:20 - INFO - __main__ - Step 51175: {'lr': 0.00037573154388089483, 'samples': 9825600, 'steps': 51174, 'loss/train': 1.019473910331726} 11/07/2021 04:26:21 - INFO - __main__ - Step 51176: {'lr': 0.00037572695707756385, 'samples': 9825792, 'steps': 51175, 'loss/train': 1.5388219356536865} 11/07/2021 04:26:21 - INFO - __main__ - Step 51177: {'lr': 0.0003757223702175822, 'samples': 9825984, 'steps': 51176, 'loss/train': 1.487369179725647} 11/07/2021 04:26:22 - INFO - __main__ - Step 51178: {'lr': 0.00037571778330095206, 'samples': 9826176, 'steps': 51177, 'loss/train': 0.8048746585845947} 11/07/2021 04:26:22 - INFO - __main__ - Step 51179: {'lr': 0.00037571319632767543, 'samples': 9826368, 'steps': 51178, 'loss/train': 1.4266879558563232} 11/07/2021 04:26:23 - INFO - __main__ - Step 51180: {'lr': 0.0003757086092977544, 'samples': 9826560, 'steps': 51179, 'loss/train': 1.2909984588623047} 11/07/2021 04:26:23 - INFO - __main__ - Step 51181: {'lr': 0.00037570402221119093, 'samples': 9826752, 'steps': 51180, 'loss/train': 1.4604463577270508} 11/07/2021 04:26:24 - INFO - __main__ - Step 51182: {'lr': 0.0003756994350679872, 'samples': 9826944, 'steps': 51181, 'loss/train': 1.4334055185317993} 11/07/2021 04:26:24 - INFO - __main__ - Step 51183: {'lr': 0.00037569484786814525, 'samples': 9827136, 'steps': 51182, 'loss/train': 1.5371993780136108} 11/07/2021 04:26:24 - INFO - __main__ - Step 51184: {'lr': 0.0003756902606116671, 'samples': 9827328, 'steps': 51183, 'loss/train': 1.7246803045272827} 11/07/2021 04:26:26 - INFO - __main__ - Step 51185: {'lr': 0.00037568567329855483, 'samples': 9827520, 'steps': 51184, 'loss/train': 1.031441330909729} 11/07/2021 04:26:26 - INFO - __main__ - Step 51186: {'lr': 0.00037568108592881067, 'samples': 9827712, 'steps': 51185, 'loss/train': 0.8346840739250183} 11/07/2021 04:26:26 - INFO - __main__ - Step 51187: {'lr': 0.00037567649850243646, 'samples': 9827904, 'steps': 51186, 'loss/train': 1.7006884813308716} 11/07/2021 04:26:27 - INFO - __main__ - Step 51188: {'lr': 0.00037567191101943437, 'samples': 9828096, 'steps': 51187, 'loss/train': 1.8177293539047241} 11/07/2021 04:26:27 - INFO - __main__ - Step 51189: {'lr': 0.00037566732347980647, 'samples': 9828288, 'steps': 51188, 'loss/train': 1.2582699060440063} 11/07/2021 04:26:28 - INFO - __main__ - Step 51190: {'lr': 0.0003756627358835548, 'samples': 9828480, 'steps': 51189, 'loss/train': 1.6479040384292603} 11/07/2021 04:26:28 - INFO - __main__ - Step 51191: {'lr': 0.00037565814823068143, 'samples': 9828672, 'steps': 51190, 'loss/train': 1.7629849910736084} 11/07/2021 04:26:29 - INFO - __main__ - Step 51192: {'lr': 0.0003756535605211885, 'samples': 9828864, 'steps': 51191, 'loss/train': 0.8963702917098999} 11/07/2021 04:26:29 - INFO - __main__ - Step 51193: {'lr': 0.000375648972755078, 'samples': 9829056, 'steps': 51192, 'loss/train': 1.5291481018066406} 11/07/2021 04:26:29 - INFO - __main__ - Step 51194: {'lr': 0.00037564438493235195, 'samples': 9829248, 'steps': 51193, 'loss/train': 1.6048918962478638} 11/07/2021 04:26:31 - INFO - __main__ - Step 51195: {'lr': 0.0003756397970530125, 'samples': 9829440, 'steps': 51194, 'loss/train': 1.336006999015808} 11/07/2021 04:26:31 - INFO - __main__ - Step 51196: {'lr': 0.00037563520911706175, 'samples': 9829632, 'steps': 51195, 'loss/train': 1.1494961977005005} 11/07/2021 04:26:32 - INFO - __main__ - Step 51197: {'lr': 0.0003756306211245016, 'samples': 9829824, 'steps': 51196, 'loss/train': 1.3077516555786133} 11/07/2021 04:26:32 - INFO - __main__ - Step 51198: {'lr': 0.0003756260330753343, 'samples': 9830016, 'steps': 51197, 'loss/train': 1.2972228527069092} 11/07/2021 04:26:32 - INFO - __main__ - Step 51199: {'lr': 0.00037562144496956193, 'samples': 9830208, 'steps': 51198, 'loss/train': 0.075208380818367} 11/07/2021 04:26:33 - INFO - __main__ - Step 51200: {'lr': 0.0003756168568071864, 'samples': 9830400, 'steps': 51199, 'loss/train': 1.175175666809082} 11/07/2021 04:26:34 - INFO - __main__ - Step 51201: {'lr': 0.0003756122685882098, 'samples': 9830592, 'steps': 51200, 'loss/train': 1.2902253866195679} 11/07/2021 04:26:34 - INFO - __main__ - Step 51202: {'lr': 0.00037560768031263427, 'samples': 9830784, 'steps': 51201, 'loss/train': 1.6287318468093872} 11/07/2021 04:26:34 - INFO - __main__ - Step 51203: {'lr': 0.0003756030919804619, 'samples': 9830976, 'steps': 51202, 'loss/train': 1.1665724515914917} 11/07/2021 04:26:35 - INFO - __main__ - Step 51204: {'lr': 0.00037559850359169465, 'samples': 9831168, 'steps': 51203, 'loss/train': 1.0926939249038696} 11/07/2021 04:26:35 - INFO - __main__ - Step 51205: {'lr': 0.0003755939151463347, 'samples': 9831360, 'steps': 51204, 'loss/train': 1.5217796564102173} 11/07/2021 04:26:36 - INFO - __main__ - Step 51206: {'lr': 0.0003755893266443842, 'samples': 9831552, 'steps': 51205, 'loss/train': 1.3197687864303589} 11/07/2021 04:26:37 - INFO - __main__ - Step 51207: {'lr': 0.0003755847380858449, 'samples': 9831744, 'steps': 51206, 'loss/train': 1.1206849813461304} 11/07/2021 04:26:37 - INFO - __main__ - Step 51208: {'lr': 0.0003755801494707191, 'samples': 9831936, 'steps': 51207, 'loss/train': 1.1840324401855469} 11/07/2021 04:26:37 - INFO - __main__ - Step 51209: {'lr': 0.00037557556079900886, 'samples': 9832128, 'steps': 51208, 'loss/train': 1.5369545221328735} 11/07/2021 04:26:38 - INFO - __main__ - Step 51210: {'lr': 0.0003755709720707161, 'samples': 9832320, 'steps': 51209, 'loss/train': 1.1344716548919678} 11/07/2021 04:26:39 - INFO - __main__ - Step 51211: {'lr': 0.00037556638328584314, 'samples': 9832512, 'steps': 51210, 'loss/train': 1.7113068103790283} 11/07/2021 04:26:39 - INFO - __main__ - Step 51212: {'lr': 0.0003755617944443919, 'samples': 9832704, 'steps': 51211, 'loss/train': 1.5996965169906616} 11/07/2021 04:26:39 - INFO - __main__ - Step 51213: {'lr': 0.00037555720554636443, 'samples': 9832896, 'steps': 51212, 'loss/train': 1.3318369388580322} 11/07/2021 04:26:40 - INFO - __main__ - Step 51214: {'lr': 0.00037555261659176275, 'samples': 9833088, 'steps': 51213, 'loss/train': 1.6860949993133545} 11/07/2021 04:26:40 - INFO - __main__ - Step 51215: {'lr': 0.00037554802758058903, 'samples': 9833280, 'steps': 51214, 'loss/train': 1.3410645723342896} 11/07/2021 04:26:41 - INFO - __main__ - Step 51216: {'lr': 0.0003755434385128453, 'samples': 9833472, 'steps': 51215, 'loss/train': 1.4011985063552856} 11/07/2021 04:26:41 - INFO - __main__ - Step 51217: {'lr': 0.00037553884938853365, 'samples': 9833664, 'steps': 51216, 'loss/train': 1.1416559219360352} 11/07/2021 04:26:42 - INFO - __main__ - Step 51218: {'lr': 0.0003755342602076561, 'samples': 9833856, 'steps': 51217, 'loss/train': 1.0293290615081787} 11/07/2021 04:26:42 - INFO - __main__ - Step 51219: {'lr': 0.0003755296709702148, 'samples': 9834048, 'steps': 51218, 'loss/train': 1.7509585618972778} 11/07/2021 04:26:42 - INFO - __main__ - Step 51220: {'lr': 0.0003755250816762118, 'samples': 9834240, 'steps': 51219, 'loss/train': 1.5241022109985352} 11/07/2021 04:26:44 - INFO - __main__ - Step 51221: {'lr': 0.00037552049232564906, 'samples': 9834432, 'steps': 51220, 'loss/train': 1.921941876411438} 11/07/2021 04:26:44 - INFO - __main__ - Step 51222: {'lr': 0.0003755159029185288, 'samples': 9834624, 'steps': 51221, 'loss/train': 1.2071889638900757} 11/07/2021 04:26:44 - INFO - __main__ - Step 51223: {'lr': 0.0003755113134548529, 'samples': 9834816, 'steps': 51222, 'loss/train': 1.339905858039856} 11/07/2021 04:26:45 - INFO - __main__ - Step 51224: {'lr': 0.00037550672393462357, 'samples': 9835008, 'steps': 51223, 'loss/train': 1.5776480436325073} 11/07/2021 04:26:45 - INFO - __main__ - Step 51225: {'lr': 0.0003755021343578429, 'samples': 9835200, 'steps': 51224, 'loss/train': 1.6987099647521973} 11/07/2021 04:26:46 - INFO - __main__ - Step 51226: {'lr': 0.0003754975447245129, 'samples': 9835392, 'steps': 51225, 'loss/train': 1.362648844718933} 11/07/2021 04:26:46 - INFO - __main__ - Step 51227: {'lr': 0.00037549295503463563, 'samples': 9835584, 'steps': 51226, 'loss/train': 1.7931948900222778} 11/07/2021 04:26:47 - INFO - __main__ - Step 51228: {'lr': 0.0003754883652882132, 'samples': 9835776, 'steps': 51227, 'loss/train': 1.7281259298324585} 11/07/2021 04:26:47 - INFO - __main__ - Step 51229: {'lr': 0.00037548377548524755, 'samples': 9835968, 'steps': 51228, 'loss/train': 1.7389373779296875} 11/07/2021 04:26:47 - INFO - __main__ - Step 51230: {'lr': 0.0003754791856257409, 'samples': 9836160, 'steps': 51229, 'loss/train': 1.3119885921478271} 11/07/2021 04:26:48 - INFO - __main__ - Step 51231: {'lr': 0.00037547459570969527, 'samples': 9836352, 'steps': 51230, 'loss/train': 1.4407776594161987} 11/07/2021 04:26:49 - INFO - __main__ - Step 51232: {'lr': 0.0003754700057371127, 'samples': 9836544, 'steps': 51231, 'loss/train': 1.476395606994629} 11/07/2021 04:26:49 - INFO - __main__ - Step 51233: {'lr': 0.0003754654157079954, 'samples': 9836736, 'steps': 51232, 'loss/train': 1.6063034534454346} 11/07/2021 04:26:49 - INFO - __main__ - Step 51234: {'lr': 0.00037546082562234516, 'samples': 9836928, 'steps': 51233, 'loss/train': 1.552552580833435} 11/07/2021 04:26:50 - INFO - __main__ - Step 51235: {'lr': 0.00037545623548016426, 'samples': 9837120, 'steps': 51234, 'loss/train': 1.502600073814392} 11/07/2021 04:26:51 - INFO - __main__ - Step 51236: {'lr': 0.00037545164528145474, 'samples': 9837312, 'steps': 51235, 'loss/train': 1.330193281173706} 11/07/2021 04:26:51 - INFO - __main__ - Step 51237: {'lr': 0.00037544705502621866, 'samples': 9837504, 'steps': 51236, 'loss/train': 1.2627513408660889} 11/07/2021 04:26:52 - INFO - __main__ - Step 51238: {'lr': 0.000375442464714458, 'samples': 9837696, 'steps': 51237, 'loss/train': 2.0160419940948486} 11/07/2021 04:26:52 - INFO - __main__ - Step 51239: {'lr': 0.000375437874346175, 'samples': 9837888, 'steps': 51238, 'loss/train': 1.600635051727295} 11/07/2021 04:26:52 - INFO - __main__ - Step 51240: {'lr': 0.0003754332839213716, 'samples': 9838080, 'steps': 51239, 'loss/train': 1.4182648658752441} 11/07/2021 04:26:53 - INFO - __main__ - Step 51241: {'lr': 0.00037542869344004987, 'samples': 9838272, 'steps': 51240, 'loss/train': 1.7461457252502441} 11/07/2021 04:26:54 - INFO - __main__ - Step 51242: {'lr': 0.0003754241029022119, 'samples': 9838464, 'steps': 51241, 'loss/train': 1.2371352910995483} 11/07/2021 04:26:54 - INFO - __main__ - Step 51243: {'lr': 0.00037541951230785975, 'samples': 9838656, 'steps': 51242, 'loss/train': 1.3465973138809204} 11/07/2021 04:26:54 - INFO - __main__ - Step 51244: {'lr': 0.00037541492165699554, 'samples': 9838848, 'steps': 51243, 'loss/train': 1.5332742929458618} 11/07/2021 04:26:55 - INFO - __main__ - Step 51245: {'lr': 0.0003754103309496213, 'samples': 9839040, 'steps': 51244, 'loss/train': 1.625928521156311} 11/07/2021 04:26:55 - INFO - __main__ - Step 51246: {'lr': 0.00037540574018573913, 'samples': 9839232, 'steps': 51245, 'loss/train': 1.4563348293304443} 11/07/2021 04:26:56 - INFO - __main__ - Step 51247: {'lr': 0.00037540114936535107, 'samples': 9839424, 'steps': 51246, 'loss/train': 0.9256128668785095} 11/07/2021 04:26:56 - INFO - __main__ - Step 51248: {'lr': 0.0003753965584884591, 'samples': 9839616, 'steps': 51247, 'loss/train': 1.4945613145828247} 11/07/2021 04:26:57 - INFO - __main__ - Step 51249: {'lr': 0.00037539196755506546, 'samples': 9839808, 'steps': 51248, 'loss/train': 1.4203832149505615} 11/07/2021 04:26:57 - INFO - __main__ - Step 51250: {'lr': 0.0003753873765651721, 'samples': 9840000, 'steps': 51249, 'loss/train': 1.7863932847976685} 11/07/2021 04:26:57 - INFO - __main__ - Step 51251: {'lr': 0.0003753827855187811, 'samples': 9840192, 'steps': 51250, 'loss/train': 1.2520095109939575} 11/07/2021 04:26:59 - INFO - __main__ - Step 51252: {'lr': 0.00037537819441589457, 'samples': 9840384, 'steps': 51251, 'loss/train': 1.857222318649292} 11/07/2021 04:26:59 - INFO - __main__ - Step 51253: {'lr': 0.0003753736032565146, 'samples': 9840576, 'steps': 51252, 'loss/train': 1.4762943983078003} 11/07/2021 04:26:59 - INFO - __main__ - Step 51254: {'lr': 0.0003753690120406432, 'samples': 9840768, 'steps': 51253, 'loss/train': 1.2805707454681396} 11/07/2021 04:27:00 - INFO - __main__ - Step 51255: {'lr': 0.00037536442076828235, 'samples': 9840960, 'steps': 51254, 'loss/train': 1.5488632917404175} 11/07/2021 04:27:00 - INFO - __main__ - Step 51256: {'lr': 0.00037535982943943437, 'samples': 9841152, 'steps': 51255, 'loss/train': 1.2295396327972412} 11/07/2021 04:27:00 - INFO - __main__ - Step 51257: {'lr': 0.0003753552380541011, 'samples': 9841344, 'steps': 51256, 'loss/train': 1.6443824768066406} 11/07/2021 04:27:02 - INFO - __main__ - Step 51258: {'lr': 0.00037535064661228476, 'samples': 9841536, 'steps': 51257, 'loss/train': 1.77608060836792} 11/07/2021 04:27:02 - INFO - __main__ - Step 51259: {'lr': 0.00037534605511398736, 'samples': 9841728, 'steps': 51258, 'loss/train': 1.365010380744934} 11/07/2021 04:27:02 - INFO - __main__ - Step 51260: {'lr': 0.0003753414635592109, 'samples': 9841920, 'steps': 51259, 'loss/train': 1.306699514389038} 11/07/2021 04:27:03 - INFO - __main__ - Step 51261: {'lr': 0.0003753368719479575, 'samples': 9842112, 'steps': 51260, 'loss/train': 1.4501153230667114} 11/07/2021 04:27:03 - INFO - __main__ - Step 51262: {'lr': 0.00037533228028022923, 'samples': 9842304, 'steps': 51261, 'loss/train': 0.8957582116127014} 11/07/2021 04:27:04 - INFO - __main__ - Step 51263: {'lr': 0.0003753276885560283, 'samples': 9842496, 'steps': 51262, 'loss/train': 1.4621988534927368} 11/07/2021 04:27:04 - INFO - __main__ - Step 51264: {'lr': 0.0003753230967753566, 'samples': 9842688, 'steps': 51263, 'loss/train': 1.0747171640396118} 11/07/2021 04:27:05 - INFO - __main__ - Step 51265: {'lr': 0.00037531850493821616, 'samples': 9842880, 'steps': 51264, 'loss/train': 1.7383825778961182} 11/07/2021 04:27:05 - INFO - __main__ - Step 51266: {'lr': 0.00037531391304460916, 'samples': 9843072, 'steps': 51265, 'loss/train': 1.8504222631454468} 11/07/2021 04:27:05 - INFO - __main__ - Step 51267: {'lr': 0.00037530932109453767, 'samples': 9843264, 'steps': 51266, 'loss/train': 0.8919113874435425} 11/07/2021 04:27:06 - INFO - __main__ - Step 51268: {'lr': 0.00037530472908800375, 'samples': 9843456, 'steps': 51267, 'loss/train': 1.6534337997436523} 11/07/2021 04:27:07 - INFO - __main__ - Step 51269: {'lr': 0.0003753001370250094, 'samples': 9843648, 'steps': 51268, 'loss/train': 1.3912287950515747} 11/07/2021 04:27:07 - INFO - __main__ - Step 51270: {'lr': 0.00037529554490555686, 'samples': 9843840, 'steps': 51269, 'loss/train': 1.5490639209747314} 11/07/2021 04:27:07 - INFO - __main__ - Step 51271: {'lr': 0.00037529095272964796, 'samples': 9844032, 'steps': 51270, 'loss/train': 0.9925702810287476} 11/07/2021 04:27:08 - INFO - __main__ - Step 51272: {'lr': 0.0003752863604972849, 'samples': 9844224, 'steps': 51271, 'loss/train': 1.397322177886963} 11/07/2021 04:27:09 - INFO - __main__ - Step 51273: {'lr': 0.00037528176820846975, 'samples': 9844416, 'steps': 51272, 'loss/train': 1.414917230606079} 11/07/2021 04:27:09 - INFO - __main__ - Step 51274: {'lr': 0.00037527717586320457, 'samples': 9844608, 'steps': 51273, 'loss/train': 1.25178062915802} 11/07/2021 04:27:10 - INFO - __main__ - Step 51275: {'lr': 0.00037527258346149153, 'samples': 9844800, 'steps': 51274, 'loss/train': 1.7030956745147705} 11/07/2021 04:27:10 - INFO - __main__ - Step 51276: {'lr': 0.0003752679910033325, 'samples': 9844992, 'steps': 51275, 'loss/train': 1.7786269187927246} 11/07/2021 04:27:10 - INFO - __main__ - Step 51277: {'lr': 0.00037526339848872956, 'samples': 9845184, 'steps': 51276, 'loss/train': 1.3212450742721558} 11/07/2021 04:27:11 - INFO - __main__ - Step 51278: {'lr': 0.000375258805917685, 'samples': 9845376, 'steps': 51277, 'loss/train': 1.3959523439407349} 11/07/2021 04:27:12 - INFO - __main__ - Step 51279: {'lr': 0.0003752542132902007, 'samples': 9845568, 'steps': 51278, 'loss/train': 1.5075372457504272} 11/07/2021 04:27:12 - INFO - __main__ - Step 51280: {'lr': 0.00037524962060627885, 'samples': 9845760, 'steps': 51279, 'loss/train': 1.4516735076904297} 11/07/2021 04:27:12 - INFO - __main__ - Step 51281: {'lr': 0.0003752450278659214, 'samples': 9845952, 'steps': 51280, 'loss/train': 1.4751049280166626} 11/07/2021 04:27:13 - INFO - __main__ - Step 51282: {'lr': 0.00037524043506913045, 'samples': 9846144, 'steps': 51281, 'loss/train': 1.726582646369934} 11/07/2021 04:27:13 - INFO - __main__ - Step 51283: {'lr': 0.0003752358422159081, 'samples': 9846336, 'steps': 51282, 'loss/train': 1.9660826921463013} 11/07/2021 04:27:14 - INFO - __main__ - Step 51284: {'lr': 0.0003752312493062564, 'samples': 9846528, 'steps': 51283, 'loss/train': 1.5405163764953613} 11/07/2021 04:27:14 - INFO - __main__ - Step 51285: {'lr': 0.0003752266563401775, 'samples': 9846720, 'steps': 51284, 'loss/train': 1.2178876399993896} 11/07/2021 04:27:15 - INFO - __main__ - Step 51286: {'lr': 0.00037522206331767335, 'samples': 9846912, 'steps': 51285, 'loss/train': 1.6081781387329102} 11/07/2021 04:27:15 - INFO - __main__ - Step 51287: {'lr': 0.00037521747023874606, 'samples': 9847104, 'steps': 51286, 'loss/train': 1.0516690015792847} 11/07/2021 04:27:15 - INFO - __main__ - Step 51288: {'lr': 0.0003752128771033978, 'samples': 9847296, 'steps': 51287, 'loss/train': 1.0194778442382812} 11/07/2021 04:27:17 - INFO - __main__ - Step 51289: {'lr': 0.0003752082839116304, 'samples': 9847488, 'steps': 51288, 'loss/train': 1.3066163063049316} 11/07/2021 04:27:17 - INFO - __main__ - Step 51290: {'lr': 0.0003752036906634462, 'samples': 9847680, 'steps': 51289, 'loss/train': 1.0381736755371094} 11/07/2021 04:27:17 - INFO - __main__ - Step 51291: {'lr': 0.0003751990973588471, 'samples': 9847872, 'steps': 51290, 'loss/train': 2.442082166671753} 11/07/2021 04:27:18 - INFO - __main__ - Step 51292: {'lr': 0.0003751945039978353, 'samples': 9848064, 'steps': 51291, 'loss/train': 1.7478818893432617} 11/07/2021 04:27:18 - INFO - __main__ - Step 51293: {'lr': 0.00037518991058041267, 'samples': 9848256, 'steps': 51292, 'loss/train': 1.6824642419815063} 11/07/2021 04:27:20 - INFO - __main__ - Step 51294: {'lr': 0.00037518531710658144, 'samples': 9848448, 'steps': 51293, 'loss/train': 1.36592435836792} 11/07/2021 04:27:20 - INFO - __main__ - Step 51295: {'lr': 0.0003751807235763437, 'samples': 9848640, 'steps': 51294, 'loss/train': 1.5273423194885254} 11/07/2021 04:27:21 - INFO - __main__ - Step 51296: {'lr': 0.00037517612998970136, 'samples': 9848832, 'steps': 51295, 'loss/train': 1.6209473609924316} 11/07/2021 04:27:21 - INFO - __main__ - Step 51297: {'lr': 0.00037517153634665664, 'samples': 9849024, 'steps': 51296, 'loss/train': 1.7631850242614746} 11/07/2021 04:27:21 - INFO - __main__ - Step 51298: {'lr': 0.0003751669426472115, 'samples': 9849216, 'steps': 51297, 'loss/train': 1.006211280822754} 11/07/2021 04:27:22 - INFO - __main__ - Step 51299: {'lr': 0.0003751623488913681, 'samples': 9849408, 'steps': 51298, 'loss/train': 1.5204555988311768} 11/07/2021 04:27:22 - INFO - __main__ - Step 51300: {'lr': 0.00037515775507912855, 'samples': 9849600, 'steps': 51299, 'loss/train': 1.7774275541305542} 11/07/2021 04:27:23 - INFO - __main__ - Step 51301: {'lr': 0.0003751531612104948, 'samples': 9849792, 'steps': 51300, 'loss/train': 0.8453087210655212} 11/07/2021 04:27:24 - INFO - __main__ - Step 51302: {'lr': 0.00037514856728546893, 'samples': 9849984, 'steps': 51301, 'loss/train': 1.785905122756958} 11/07/2021 04:27:24 - INFO - __main__ - Step 51303: {'lr': 0.00037514397330405306, 'samples': 9850176, 'steps': 51302, 'loss/train': 1.390616536140442} 11/07/2021 04:27:24 - INFO - __main__ - Step 51304: {'lr': 0.00037513937926624924, 'samples': 9850368, 'steps': 51303, 'loss/train': 1.6205112934112549} 11/07/2021 04:27:25 - INFO - __main__ - Step 51305: {'lr': 0.0003751347851720596, 'samples': 9850560, 'steps': 51304, 'loss/train': 1.3488975763320923} 11/07/2021 04:27:25 - INFO - __main__ - Step 51306: {'lr': 0.00037513019102148606, 'samples': 9850752, 'steps': 51305, 'loss/train': 1.1012285947799683} 11/07/2021 04:27:26 - INFO - __main__ - Step 51307: {'lr': 0.0003751255968145309, 'samples': 9850944, 'steps': 51306, 'loss/train': 1.2730046510696411} 11/07/2021 04:27:26 - INFO - __main__ - Step 51308: {'lr': 0.00037512100255119603, 'samples': 9851136, 'steps': 51307, 'loss/train': 1.618487000465393} 11/07/2021 04:27:27 - INFO - __main__ - Step 51309: {'lr': 0.0003751164082314835, 'samples': 9851328, 'steps': 51308, 'loss/train': 1.5488232374191284} 11/07/2021 04:27:27 - INFO - __main__ - Step 51310: {'lr': 0.00037511181385539553, 'samples': 9851520, 'steps': 51309, 'loss/train': 0.549214243888855} 11/07/2021 04:27:28 - INFO - __main__ - Step 51311: {'lr': 0.00037510721942293415, 'samples': 9851712, 'steps': 51310, 'loss/train': 1.7520240545272827} 11/07/2021 04:27:28 - INFO - __main__ - Step 51312: {'lr': 0.0003751026249341013, 'samples': 9851904, 'steps': 51311, 'loss/train': 1.4599018096923828} 11/07/2021 04:27:29 - INFO - __main__ - Step 51313: {'lr': 0.0003750980303888991, 'samples': 9852096, 'steps': 51312, 'loss/train': 1.903134822845459} 11/07/2021 04:27:29 - INFO - __main__ - Step 51314: {'lr': 0.0003750934357873298, 'samples': 9852288, 'steps': 51313, 'loss/train': 1.5627148151397705} 11/07/2021 04:27:30 - INFO - __main__ - Step 51315: {'lr': 0.00037508884112939523, 'samples': 9852480, 'steps': 51314, 'loss/train': 1.7844643592834473} 11/07/2021 04:27:30 - INFO - __main__ - Step 51316: {'lr': 0.0003750842464150975, 'samples': 9852672, 'steps': 51315, 'loss/train': 1.4671173095703125} 11/07/2021 04:27:30 - INFO - __main__ - Step 51317: {'lr': 0.0003750796516444389, 'samples': 9852864, 'steps': 51316, 'loss/train': 1.6843032836914062} 11/07/2021 04:27:31 - INFO - __main__ - Step 51318: {'lr': 0.0003750750568174212, 'samples': 9853056, 'steps': 51317, 'loss/train': 1.5260761976242065} 11/07/2021 04:27:32 - INFO - __main__ - Step 51319: {'lr': 0.00037507046193404665, 'samples': 9853248, 'steps': 51318, 'loss/train': 1.6331720352172852} 11/07/2021 04:27:32 - INFO - __main__ - Step 51320: {'lr': 0.0003750658669943173, 'samples': 9853440, 'steps': 51319, 'loss/train': 1.2930999994277954} 11/07/2021 04:27:32 - INFO - __main__ - Step 51321: {'lr': 0.00037506127199823523, 'samples': 9853632, 'steps': 51320, 'loss/train': 0.9876997470855713} 11/07/2021 04:27:33 - INFO - __main__ - Step 51322: {'lr': 0.00037505667694580244, 'samples': 9853824, 'steps': 51321, 'loss/train': 1.3453137874603271} 11/07/2021 04:27:34 - INFO - __main__ - Step 51323: {'lr': 0.000375052081837021, 'samples': 9854016, 'steps': 51322, 'loss/train': 2.2082297801971436} 11/07/2021 04:27:34 - INFO - __main__ - Step 51324: {'lr': 0.0003750474866718931, 'samples': 9854208, 'steps': 51323, 'loss/train': 0.22028931975364685} 11/07/2021 04:27:35 - INFO - __main__ - Step 51325: {'lr': 0.0003750428914504207, 'samples': 9854400, 'steps': 51324, 'loss/train': 1.3859983682632446} 11/07/2021 04:27:35 - INFO - __main__ - Step 51326: {'lr': 0.0003750382961726059, 'samples': 9854592, 'steps': 51325, 'loss/train': 2.015860080718994} 11/07/2021 04:27:35 - INFO - __main__ - Step 51327: {'lr': 0.0003750337008384508, 'samples': 9854784, 'steps': 51326, 'loss/train': 1.8525826930999756} 11/07/2021 04:27:36 - INFO - __main__ - Step 51328: {'lr': 0.0003750291054479574, 'samples': 9854976, 'steps': 51327, 'loss/train': 1.1386030912399292} 11/07/2021 04:27:37 - INFO - __main__ - Step 51329: {'lr': 0.0003750245100011278, 'samples': 9855168, 'steps': 51328, 'loss/train': 1.5202986001968384} 11/07/2021 04:27:37 - INFO - __main__ - Step 51330: {'lr': 0.00037501991449796415, 'samples': 9855360, 'steps': 51329, 'loss/train': 0.1436721235513687} 11/07/2021 04:27:37 - INFO - __main__ - Step 51331: {'lr': 0.0003750153189384684, 'samples': 9855552, 'steps': 51330, 'loss/train': 1.4653372764587402} 11/07/2021 04:27:38 - INFO - __main__ - Step 51332: {'lr': 0.00037501072332264267, 'samples': 9855744, 'steps': 51331, 'loss/train': 1.437253713607788} 11/07/2021 04:27:38 - INFO - __main__ - Step 51333: {'lr': 0.0003750061276504891, 'samples': 9855936, 'steps': 51332, 'loss/train': 1.316138505935669} 11/07/2021 04:27:39 - INFO - __main__ - Step 51334: {'lr': 0.0003750015319220097, 'samples': 9856128, 'steps': 51333, 'loss/train': 1.7797185182571411} 11/07/2021 04:27:39 - INFO - __main__ - Step 51335: {'lr': 0.0003749969361372065, 'samples': 9856320, 'steps': 51334, 'loss/train': 1.5513956546783447} 11/07/2021 04:27:40 - INFO - __main__ - Step 51336: {'lr': 0.0003749923402960816, 'samples': 9856512, 'steps': 51335, 'loss/train': 1.8163543939590454} 11/07/2021 04:27:40 - INFO - __main__ - Step 51337: {'lr': 0.00037498774439863704, 'samples': 9856704, 'steps': 51336, 'loss/train': 1.3807144165039062} 11/07/2021 04:27:41 - INFO - __main__ - Step 51338: {'lr': 0.000374983148444875, 'samples': 9856896, 'steps': 51337, 'loss/train': 1.6411243677139282} 11/07/2021 04:27:41 - INFO - __main__ - Step 51339: {'lr': 0.00037497855243479744, 'samples': 9857088, 'steps': 51338, 'loss/train': 1.4071072340011597} 11/07/2021 04:27:42 - INFO - __main__ - Step 51340: {'lr': 0.0003749739563684065, 'samples': 9857280, 'steps': 51339, 'loss/train': 0.8938208818435669} 11/07/2021 04:27:42 - INFO - __main__ - Step 51341: {'lr': 0.00037496936024570426, 'samples': 9857472, 'steps': 51340, 'loss/train': 1.4135487079620361} 11/07/2021 04:27:43 - INFO - __main__ - Step 51342: {'lr': 0.0003749647640666927, 'samples': 9857664, 'steps': 51341, 'loss/train': 0.7178032398223877} 11/07/2021 04:27:43 - INFO - __main__ - Step 51343: {'lr': 0.000374960167831374, 'samples': 9857856, 'steps': 51342, 'loss/train': 1.3337452411651611} 11/07/2021 04:27:44 - INFO - __main__ - Step 51344: {'lr': 0.00037495557153975016, 'samples': 9858048, 'steps': 51343, 'loss/train': 1.2852369546890259} 11/07/2021 04:27:44 - INFO - __main__ - Step 51345: {'lr': 0.0003749509751918232, 'samples': 9858240, 'steps': 51344, 'loss/train': 1.4947065114974976} 11/07/2021 04:27:45 - INFO - __main__ - Step 51346: {'lr': 0.0003749463787875953, 'samples': 9858432, 'steps': 51345, 'loss/train': 1.188281774520874} 11/07/2021 04:27:45 - INFO - __main__ - Step 51347: {'lr': 0.00037494178232706847, 'samples': 9858624, 'steps': 51346, 'loss/train': 1.556554913520813} 11/07/2021 04:27:45 - INFO - __main__ - Step 51348: {'lr': 0.00037493718581024484, 'samples': 9858816, 'steps': 51347, 'loss/train': 1.08109450340271} 11/07/2021 04:27:46 - INFO - __main__ - Step 51349: {'lr': 0.0003749325892371264, 'samples': 9859008, 'steps': 51348, 'loss/train': 1.705818772315979} 11/07/2021 04:27:47 - INFO - __main__ - Step 51350: {'lr': 0.0003749279926077153, 'samples': 9859200, 'steps': 51349, 'loss/train': 1.6825324296951294} 11/07/2021 04:27:47 - INFO - __main__ - Step 51351: {'lr': 0.0003749233959220136, 'samples': 9859392, 'steps': 51350, 'loss/train': 1.872248649597168} 11/07/2021 04:27:47 - INFO - __main__ - Step 51352: {'lr': 0.00037491879918002323, 'samples': 9859584, 'steps': 51351, 'loss/train': 1.6915991306304932} 11/07/2021 04:27:48 - INFO - __main__ - Step 51353: {'lr': 0.0003749142023817465, 'samples': 9859776, 'steps': 51352, 'loss/train': 1.0961235761642456} 11/07/2021 04:27:48 - INFO - __main__ - Step 51354: {'lr': 0.00037490960552718534, 'samples': 9859968, 'steps': 51353, 'loss/train': 1.577583909034729} 11/07/2021 04:27:49 - INFO - __main__ - Step 51355: {'lr': 0.00037490500861634183, 'samples': 9860160, 'steps': 51354, 'loss/train': 1.4419667720794678} 11/07/2021 04:27:50 - INFO - __main__ - Step 51356: {'lr': 0.00037490041164921803, 'samples': 9860352, 'steps': 51355, 'loss/train': 1.8501923084259033} 11/07/2021 04:27:50 - INFO - __main__ - Step 51357: {'lr': 0.000374895814625816, 'samples': 9860544, 'steps': 51356, 'loss/train': 1.4689220190048218} 11/07/2021 04:27:50 - INFO - __main__ - Step 51358: {'lr': 0.00037489121754613787, 'samples': 9860736, 'steps': 51357, 'loss/train': 1.227744698524475} 11/07/2021 04:27:51 - INFO - __main__ - Step 51359: {'lr': 0.00037488662041018574, 'samples': 9860928, 'steps': 51358, 'loss/train': 1.2650045156478882} 11/07/2021 04:27:52 - INFO - __main__ - Step 51360: {'lr': 0.00037488202321796156, 'samples': 9861120, 'steps': 51359, 'loss/train': 1.6581660509109497} 11/07/2021 04:27:52 - INFO - __main__ - Step 51361: {'lr': 0.0003748774259694675, 'samples': 9861312, 'steps': 51360, 'loss/train': 1.516729474067688} 11/07/2021 04:27:52 - INFO - __main__ - Step 51362: {'lr': 0.00037487282866470565, 'samples': 9861504, 'steps': 51361, 'loss/train': 1.3707294464111328} 11/07/2021 04:27:53 - INFO - __main__ - Step 51363: {'lr': 0.00037486823130367786, 'samples': 9861696, 'steps': 51362, 'loss/train': 1.912338137626648} 11/07/2021 04:27:53 - INFO - __main__ - Step 51364: {'lr': 0.0003748636338863865, 'samples': 9861888, 'steps': 51363, 'loss/train': 1.4236916303634644} 11/07/2021 04:27:54 - INFO - __main__ - Step 51365: {'lr': 0.0003748590364128335, 'samples': 9862080, 'steps': 51364, 'loss/train': 1.3337184190750122} 11/07/2021 04:27:54 - INFO - __main__ - Step 51366: {'lr': 0.00037485443888302095, 'samples': 9862272, 'steps': 51365, 'loss/train': 1.347653865814209} 11/07/2021 04:27:55 - INFO - __main__ - Step 51367: {'lr': 0.00037484984129695096, 'samples': 9862464, 'steps': 51366, 'loss/train': 1.1018328666687012} 11/07/2021 04:27:55 - INFO - __main__ - Step 51368: {'lr': 0.00037484524365462545, 'samples': 9862656, 'steps': 51367, 'loss/train': 2.0757765769958496} 11/07/2021 04:27:55 - INFO - __main__ - Step 51369: {'lr': 0.0003748406459560466, 'samples': 9862848, 'steps': 51368, 'loss/train': 1.8313307762145996} 11/07/2021 04:27:56 - INFO - __main__ - Step 51370: {'lr': 0.0003748360482012166, 'samples': 9863040, 'steps': 51369, 'loss/train': 1.4531660079956055} 11/07/2021 04:27:57 - INFO - __main__ - Step 51371: {'lr': 0.00037483145039013735, 'samples': 9863232, 'steps': 51370, 'loss/train': 1.2641748189926147} 11/07/2021 04:27:57 - INFO - __main__ - Step 51372: {'lr': 0.0003748268525228109, 'samples': 9863424, 'steps': 51371, 'loss/train': 1.3420898914337158} 11/07/2021 04:27:58 - INFO - __main__ - Step 51373: {'lr': 0.00037482225459923945, 'samples': 9863616, 'steps': 51372, 'loss/train': 1.6101536750793457} 11/07/2021 04:27:58 - INFO - __main__ - Step 51374: {'lr': 0.00037481765661942506, 'samples': 9863808, 'steps': 51373, 'loss/train': 0.9312657117843628} 11/07/2021 04:27:59 - INFO - __main__ - Step 51375: {'lr': 0.0003748130585833697, 'samples': 9864000, 'steps': 51374, 'loss/train': 1.268650770187378} 11/07/2021 04:27:59 - INFO - __main__ - Step 51376: {'lr': 0.0003748084604910755, 'samples': 9864192, 'steps': 51375, 'loss/train': 1.219296932220459} 11/07/2021 04:28:00 - INFO - __main__ - Step 51377: {'lr': 0.0003748038623425446, 'samples': 9864384, 'steps': 51376, 'loss/train': 1.5603306293487549} 11/07/2021 04:28:00 - INFO - __main__ - Step 51378: {'lr': 0.00037479926413777896, 'samples': 9864576, 'steps': 51377, 'loss/train': 1.482375144958496} 11/07/2021 04:28:00 - INFO - __main__ - Step 51379: {'lr': 0.0003747946658767807, 'samples': 9864768, 'steps': 51378, 'loss/train': 0.4298912286758423} 11/07/2021 04:28:01 - INFO - __main__ - Step 51380: {'lr': 0.0003747900675595519, 'samples': 9864960, 'steps': 51379, 'loss/train': 1.7146047353744507} 11/07/2021 04:28:02 - INFO - __main__ - Step 51381: {'lr': 0.00037478546918609464, 'samples': 9865152, 'steps': 51380, 'loss/train': 2.189980983734131} 11/07/2021 04:28:02 - INFO - __main__ - Step 51382: {'lr': 0.00037478087075641095, 'samples': 9865344, 'steps': 51381, 'loss/train': 1.391196846961975} 11/07/2021 04:28:02 - INFO - __main__ - Step 51383: {'lr': 0.00037477627227050286, 'samples': 9865536, 'steps': 51382, 'loss/train': 1.5402201414108276} 11/07/2021 04:28:03 - INFO - __main__ - Step 51384: {'lr': 0.0003747716737283726, 'samples': 9865728, 'steps': 51383, 'loss/train': 2.444733142852783} 11/07/2021 04:28:03 - INFO - __main__ - Step 51385: {'lr': 0.00037476707513002213, 'samples': 9865920, 'steps': 51384, 'loss/train': 1.5121098756790161} 11/07/2021 04:28:04 - INFO - __main__ - Step 51386: {'lr': 0.0003747624764754535, 'samples': 9866112, 'steps': 51385, 'loss/train': 1.090367078781128} 11/07/2021 04:28:04 - INFO - __main__ - Step 51387: {'lr': 0.00037475787776466887, 'samples': 9866304, 'steps': 51386, 'loss/train': 1.01362943649292} 11/07/2021 04:28:05 - INFO - __main__ - Step 51388: {'lr': 0.00037475327899767026, 'samples': 9866496, 'steps': 51387, 'loss/train': 1.2422531843185425} 11/07/2021 04:28:05 - INFO - __main__ - Step 51389: {'lr': 0.0003747486801744597, 'samples': 9866688, 'steps': 51388, 'loss/train': 2.2478537559509277} 11/07/2021 04:28:06 - INFO - __main__ - Step 51390: {'lr': 0.0003747440812950393, 'samples': 9866880, 'steps': 51389, 'loss/train': 1.2774142026901245} 11/07/2021 04:28:06 - INFO - __main__ - Step 51391: {'lr': 0.0003747394823594112, 'samples': 9867072, 'steps': 51390, 'loss/train': 1.6010338068008423} 11/07/2021 04:28:07 - INFO - __main__ - Step 51392: {'lr': 0.00037473488336757743, 'samples': 9867264, 'steps': 51391, 'loss/train': 2.4366376399993896} 11/07/2021 04:28:07 - INFO - __main__ - Step 51393: {'lr': 0.00037473028431954006, 'samples': 9867456, 'steps': 51392, 'loss/train': 1.157906174659729} 11/07/2021 04:28:08 - INFO - __main__ - Step 51394: {'lr': 0.00037472568521530107, 'samples': 9867648, 'steps': 51393, 'loss/train': 1.1847411394119263} 11/07/2021 04:28:08 - INFO - __main__ - Step 51395: {'lr': 0.0003747210860548627, 'samples': 9867840, 'steps': 51394, 'loss/train': 0.23853078484535217} 11/07/2021 04:28:09 - INFO - __main__ - Step 51396: {'lr': 0.00037471648683822683, 'samples': 9868032, 'steps': 51395, 'loss/train': 1.126974105834961} 11/07/2021 04:28:09 - INFO - __main__ - Step 51397: {'lr': 0.0003747118875653957, 'samples': 9868224, 'steps': 51396, 'loss/train': 1.9312546253204346} 11/07/2021 04:28:10 - INFO - __main__ - Step 51398: {'lr': 0.00037470728823637135, 'samples': 9868416, 'steps': 51397, 'loss/train': 1.4620530605316162} 11/07/2021 04:28:10 - INFO - __main__ - Step 51399: {'lr': 0.0003747026888511558, 'samples': 9868608, 'steps': 51398, 'loss/train': 1.5368403196334839} 11/07/2021 04:28:10 - INFO - __main__ - Step 51400: {'lr': 0.00037469808940975106, 'samples': 9868800, 'steps': 51399, 'loss/train': 1.1409534215927124} 11/07/2021 04:28:11 - INFO - __main__ - Step 51401: {'lr': 0.00037469348991215934, 'samples': 9868992, 'steps': 51400, 'loss/train': 1.626787543296814} 11/07/2021 04:28:12 - INFO - __main__ - Step 51402: {'lr': 0.00037468889035838264, 'samples': 9869184, 'steps': 51401, 'loss/train': 1.4606435298919678} 11/07/2021 04:28:12 - INFO - __main__ - Step 51403: {'lr': 0.0003746842907484231, 'samples': 9869376, 'steps': 51402, 'loss/train': 3.1921544075012207} 11/07/2021 04:28:12 - INFO - __main__ - Step 51404: {'lr': 0.0003746796910822827, 'samples': 9869568, 'steps': 51403, 'loss/train': 1.2902007102966309} 11/07/2021 04:28:13 - INFO - __main__ - Step 51405: {'lr': 0.0003746750913599636, 'samples': 9869760, 'steps': 51404, 'loss/train': 0.8975526690483093} 11/07/2021 04:28:13 - INFO - __main__ - Step 51406: {'lr': 0.00037467049158146777, 'samples': 9869952, 'steps': 51405, 'loss/train': 1.7837495803833008} 11/07/2021 04:28:14 - INFO - __main__ - Step 51407: {'lr': 0.00037466589174679733, 'samples': 9870144, 'steps': 51406, 'loss/train': 1.249118685722351} 11/07/2021 04:28:14 - INFO - __main__ - Step 51408: {'lr': 0.0003746612918559544, 'samples': 9870336, 'steps': 51407, 'loss/train': 1.4978562593460083} 11/07/2021 04:28:15 - INFO - __main__ - Step 51409: {'lr': 0.00037465669190894107, 'samples': 9870528, 'steps': 51408, 'loss/train': 1.2435975074768066} 11/07/2021 04:28:15 - INFO - __main__ - Step 51410: {'lr': 0.00037465209190575927, 'samples': 9870720, 'steps': 51409, 'loss/train': 1.4231942892074585} 11/07/2021 04:28:16 - INFO - __main__ - Step 51411: {'lr': 0.00037464749184641123, 'samples': 9870912, 'steps': 51410, 'loss/train': 1.044649362564087} 11/07/2021 04:28:17 - INFO - __main__ - Step 51412: {'lr': 0.0003746428917308989, 'samples': 9871104, 'steps': 51411, 'loss/train': 1.5776267051696777} 11/07/2021 04:28:17 - INFO - __main__ - Step 51413: {'lr': 0.0003746382915592244, 'samples': 9871296, 'steps': 51412, 'loss/train': 1.2381223440170288} 11/07/2021 04:28:17 - INFO - __main__ - Step 51414: {'lr': 0.0003746336913313898, 'samples': 9871488, 'steps': 51413, 'loss/train': 1.206687569618225} 11/07/2021 04:28:18 - INFO - __main__ - Step 51415: {'lr': 0.0003746290910473973, 'samples': 9871680, 'steps': 51414, 'loss/train': 1.6536258459091187} 11/07/2021 04:28:18 - INFO - __main__ - Step 51416: {'lr': 0.00037462449070724876, 'samples': 9871872, 'steps': 51415, 'loss/train': 1.7118861675262451} 11/07/2021 04:28:19 - INFO - __main__ - Step 51417: {'lr': 0.00037461989031094636, 'samples': 9872064, 'steps': 51416, 'loss/train': 1.6540234088897705} 11/07/2021 04:28:20 - INFO - __main__ - Step 51418: {'lr': 0.00037461528985849215, 'samples': 9872256, 'steps': 51417, 'loss/train': 1.264769196510315} 11/07/2021 04:28:20 - INFO - __main__ - Step 51419: {'lr': 0.0003746106893498882, 'samples': 9872448, 'steps': 51418, 'loss/train': 1.1097873449325562} 11/07/2021 04:28:20 - INFO - __main__ - Step 51420: {'lr': 0.00037460608878513656, 'samples': 9872640, 'steps': 51419, 'loss/train': 1.7027997970581055} 11/07/2021 04:28:21 - INFO - __main__ - Step 51421: {'lr': 0.00037460148816423946, 'samples': 9872832, 'steps': 51420, 'loss/train': 1.830729365348816} 11/07/2021 04:28:21 - INFO - __main__ - Step 51422: {'lr': 0.0003745968874871988, 'samples': 9873024, 'steps': 51421, 'loss/train': 1.856423020362854} 11/07/2021 04:28:21 - INFO - __main__ - Step 51423: {'lr': 0.00037459228675401667, 'samples': 9873216, 'steps': 51422, 'loss/train': 0.4582367539405823} 11/07/2021 04:28:22 - INFO - __main__ - Step 51424: {'lr': 0.00037458768596469516, 'samples': 9873408, 'steps': 51423, 'loss/train': 1.4918971061706543} 11/07/2021 04:28:23 - INFO - __main__ - Step 51425: {'lr': 0.0003745830851192364, 'samples': 9873600, 'steps': 51424, 'loss/train': 1.3095556497573853} 11/07/2021 04:28:23 - INFO - __main__ - Step 51426: {'lr': 0.00037457848421764247, 'samples': 9873792, 'steps': 51425, 'loss/train': 1.410119652748108} 11/07/2021 04:28:23 - INFO - __main__ - Step 51427: {'lr': 0.0003745738832599153, 'samples': 9873984, 'steps': 51426, 'loss/train': 1.9977155923843384} 11/07/2021 04:28:24 - INFO - __main__ - Step 51428: {'lr': 0.0003745692822460572, 'samples': 9874176, 'steps': 51427, 'loss/train': 1.139614224433899} 11/07/2021 04:28:25 - INFO - __main__ - Step 51429: {'lr': 0.00037456468117607, 'samples': 9874368, 'steps': 51428, 'loss/train': 1.6224833726882935} 11/07/2021 04:28:26 - INFO - __main__ - Step 51430: {'lr': 0.0003745600800499559, 'samples': 9874560, 'steps': 51429, 'loss/train': 1.3608607053756714} 11/07/2021 04:28:26 - INFO - __main__ - Step 51431: {'lr': 0.0003745554788677169, 'samples': 9874752, 'steps': 51430, 'loss/train': 1.467066764831543} 11/07/2021 04:28:26 - INFO - __main__ - Step 51432: {'lr': 0.0003745508776293551, 'samples': 9874944, 'steps': 51431, 'loss/train': 1.8557852506637573} 11/07/2021 04:28:27 - INFO - __main__ - Step 51433: {'lr': 0.0003745462763348727, 'samples': 9875136, 'steps': 51432, 'loss/train': 1.9213792085647583} 11/07/2021 04:28:27 - INFO - __main__ - Step 51434: {'lr': 0.00037454167498427165, 'samples': 9875328, 'steps': 51433, 'loss/train': 1.5387630462646484} 11/07/2021 04:28:28 - INFO - __main__ - Step 51435: {'lr': 0.0003745370735775541, 'samples': 9875520, 'steps': 51434, 'loss/train': 1.3750271797180176} 11/07/2021 04:28:28 - INFO - __main__ - Step 51436: {'lr': 0.00037453247211472195, 'samples': 9875712, 'steps': 51435, 'loss/train': 2.651184320449829} 11/07/2021 04:28:29 - INFO - __main__ - Step 51437: {'lr': 0.0003745278705957774, 'samples': 9875904, 'steps': 51436, 'loss/train': 1.0046931505203247} 11/07/2021 04:28:29 - INFO - __main__ - Step 51438: {'lr': 0.00037452326902072256, 'samples': 9876096, 'steps': 51437, 'loss/train': 1.501657485961914} 11/07/2021 04:28:30 - INFO - __main__ - Step 51439: {'lr': 0.0003745186673895594, 'samples': 9876288, 'steps': 51438, 'loss/train': 1.5010449886322021} 11/07/2021 04:28:31 - INFO - __main__ - Step 51440: {'lr': 0.0003745140657022901, 'samples': 9876480, 'steps': 51439, 'loss/train': 1.5103965997695923} 11/07/2021 04:28:31 - INFO - __main__ - Step 51441: {'lr': 0.0003745094639589167, 'samples': 9876672, 'steps': 51440, 'loss/train': 1.458385705947876} 11/07/2021 04:28:31 - INFO - __main__ - Step 51442: {'lr': 0.00037450486215944123, 'samples': 9876864, 'steps': 51441, 'loss/train': 1.954463005065918} 11/07/2021 04:28:32 - INFO - __main__ - Step 51443: {'lr': 0.0003745002603038658, 'samples': 9877056, 'steps': 51442, 'loss/train': 1.5138039588928223} 11/07/2021 04:28:32 - INFO - __main__ - Step 51444: {'lr': 0.00037449565839219246, 'samples': 9877248, 'steps': 51443, 'loss/train': 0.8293909430503845} 11/07/2021 04:28:33 - INFO - __main__ - Step 51445: {'lr': 0.0003744910564244233, 'samples': 9877440, 'steps': 51444, 'loss/train': 0.7137060761451721} 11/07/2021 04:28:33 - INFO - __main__ - Step 51446: {'lr': 0.0003744864544005604, 'samples': 9877632, 'steps': 51445, 'loss/train': 1.6284750699996948} 11/07/2021 04:28:34 - INFO - __main__ - Step 51447: {'lr': 0.0003744818523206058, 'samples': 9877824, 'steps': 51446, 'loss/train': 1.0235060453414917} 11/07/2021 04:28:34 - INFO - __main__ - Step 51448: {'lr': 0.00037447725018456167, 'samples': 9878016, 'steps': 51447, 'loss/train': 1.4038679599761963} 11/07/2021 04:28:34 - INFO - __main__ - Step 51449: {'lr': 0.00037447264799243, 'samples': 9878208, 'steps': 51448, 'loss/train': 1.4147348403930664} 11/07/2021 04:28:35 - INFO - __main__ - Step 51450: {'lr': 0.00037446804574421276, 'samples': 9878400, 'steps': 51449, 'loss/train': 1.1394082307815552} 11/07/2021 04:28:36 - INFO - __main__ - Step 51451: {'lr': 0.00037446344343991224, 'samples': 9878592, 'steps': 51450, 'loss/train': 1.2706471681594849} 11/07/2021 04:28:36 - INFO - __main__ - Step 51452: {'lr': 0.0003744588410795304, 'samples': 9878784, 'steps': 51451, 'loss/train': 1.8876612186431885} 11/07/2021 04:28:37 - INFO - __main__ - Step 51453: {'lr': 0.00037445423866306926, 'samples': 9878976, 'steps': 51452, 'loss/train': 1.4991883039474487} 11/07/2021 04:28:37 - INFO - __main__ - Step 51454: {'lr': 0.00037444963619053103, 'samples': 9879168, 'steps': 51453, 'loss/train': 1.1963261365890503} 11/07/2021 04:28:37 - INFO - __main__ - Step 51455: {'lr': 0.00037444503366191776, 'samples': 9879360, 'steps': 51454, 'loss/train': 1.015403389930725} 11/07/2021 04:28:38 - INFO - __main__ - Step 51456: {'lr': 0.00037444043107723134, 'samples': 9879552, 'steps': 51455, 'loss/train': 1.8410333395004272} 11/07/2021 04:28:39 - INFO - __main__ - Step 51457: {'lr': 0.0003744358284364741, 'samples': 9879744, 'steps': 51456, 'loss/train': 1.4342031478881836} 11/07/2021 04:28:39 - INFO - __main__ - Step 51458: {'lr': 0.00037443122573964794, 'samples': 9879936, 'steps': 51457, 'loss/train': 1.4363067150115967} 11/07/2021 04:28:39 - INFO - __main__ - Step 51459: {'lr': 0.000374426622986755, 'samples': 9880128, 'steps': 51458, 'loss/train': 1.6961021423339844} 11/07/2021 04:28:40 - INFO - __main__ - Step 51460: {'lr': 0.0003744220201777974, 'samples': 9880320, 'steps': 51459, 'loss/train': 1.4590411186218262} 11/07/2021 04:28:41 - INFO - __main__ - Step 51461: {'lr': 0.0003744174173127771, 'samples': 9880512, 'steps': 51460, 'loss/train': 1.4540116786956787} 11/07/2021 04:28:41 - INFO - __main__ - Step 51462: {'lr': 0.00037441281439169624, 'samples': 9880704, 'steps': 51461, 'loss/train': 1.4658534526824951} 11/07/2021 04:28:41 - INFO - __main__ - Step 51463: {'lr': 0.0003744082114145568, 'samples': 9880896, 'steps': 51462, 'loss/train': 1.1046414375305176} 11/07/2021 04:28:42 - INFO - __main__ - Step 51464: {'lr': 0.00037440360838136106, 'samples': 9881088, 'steps': 51463, 'loss/train': 1.3326470851898193} 11/07/2021 04:28:42 - INFO - __main__ - Step 51465: {'lr': 0.0003743990052921109, 'samples': 9881280, 'steps': 51464, 'loss/train': 1.2350364923477173} 11/07/2021 04:28:43 - INFO - __main__ - Step 51466: {'lr': 0.00037439440214680854, 'samples': 9881472, 'steps': 51465, 'loss/train': 1.0318598747253418} 11/07/2021 04:28:44 - INFO - __main__ - Step 51467: {'lr': 0.00037438979894545595, 'samples': 9881664, 'steps': 51466, 'loss/train': 1.6322429180145264} 11/07/2021 04:28:44 - INFO - __main__ - Step 51468: {'lr': 0.0003743851956880553, 'samples': 9881856, 'steps': 51467, 'loss/train': 1.3701001405715942} 11/07/2021 04:28:44 - INFO - __main__ - Step 51469: {'lr': 0.00037438059237460846, 'samples': 9882048, 'steps': 51468, 'loss/train': 1.480985164642334} 11/07/2021 04:28:45 - INFO - __main__ - Step 51470: {'lr': 0.0003743759890051177, 'samples': 9882240, 'steps': 51469, 'loss/train': 1.0128283500671387} 11/07/2021 04:28:46 - INFO - __main__ - Step 51471: {'lr': 0.00037437138557958505, 'samples': 9882432, 'steps': 51470, 'loss/train': 1.2985695600509644} 11/07/2021 04:28:46 - INFO - __main__ - Step 51472: {'lr': 0.0003743667820980126, 'samples': 9882624, 'steps': 51471, 'loss/train': 1.4783293008804321} 11/07/2021 04:28:46 - INFO - __main__ - Step 51473: {'lr': 0.0003743621785604024, 'samples': 9882816, 'steps': 51472, 'loss/train': 1.547150731086731} 11/07/2021 04:28:47 - INFO - __main__ - Step 51474: {'lr': 0.00037435757496675646, 'samples': 9883008, 'steps': 51473, 'loss/train': 1.023967981338501} 11/07/2021 04:28:47 - INFO - __main__ - Step 51475: {'lr': 0.000374352971317077, 'samples': 9883200, 'steps': 51474, 'loss/train': 1.2801886796951294} 11/07/2021 04:28:48 - INFO - __main__ - Step 51476: {'lr': 0.0003743483676113659, 'samples': 9883392, 'steps': 51475, 'loss/train': 1.77716064453125} 11/07/2021 04:28:48 - INFO - __main__ - Step 51477: {'lr': 0.00037434376384962544, 'samples': 9883584, 'steps': 51476, 'loss/train': 1.5756834745407104} 11/07/2021 04:28:49 - INFO - __main__ - Step 51478: {'lr': 0.00037433916003185757, 'samples': 9883776, 'steps': 51477, 'loss/train': 1.6418976783752441} 11/07/2021 04:28:49 - INFO - __main__ - Step 51479: {'lr': 0.0003743345561580644, 'samples': 9883968, 'steps': 51478, 'loss/train': 1.2006781101226807} 11/07/2021 04:28:49 - INFO - __main__ - Step 51480: {'lr': 0.0003743299522282479, 'samples': 9884160, 'steps': 51479, 'loss/train': 1.4376378059387207} 11/07/2021 04:28:51 - INFO - __main__ - Step 51481: {'lr': 0.0003743253482424104, 'samples': 9884352, 'steps': 51480, 'loss/train': 1.3342554569244385} 11/07/2021 04:28:51 - INFO - __main__ - Step 51482: {'lr': 0.00037432074420055376, 'samples': 9884544, 'steps': 51481, 'loss/train': 1.9823734760284424} 11/07/2021 04:28:51 - INFO - __main__ - Step 51483: {'lr': 0.00037431614010268013, 'samples': 9884736, 'steps': 51482, 'loss/train': 1.7683123350143433} 11/07/2021 04:28:52 - INFO - __main__ - Step 51484: {'lr': 0.0003743115359487915, 'samples': 9884928, 'steps': 51483, 'loss/train': 1.566193699836731} 11/07/2021 04:28:52 - INFO - __main__ - Step 51485: {'lr': 0.00037430693173889, 'samples': 9885120, 'steps': 51484, 'loss/train': 1.6051998138427734} 11/07/2021 04:28:53 - INFO - __main__ - Step 51486: {'lr': 0.00037430232747297774, 'samples': 9885312, 'steps': 51485, 'loss/train': 1.6360217332839966} 11/07/2021 04:28:53 - INFO - __main__ - Step 51487: {'lr': 0.00037429772315105683, 'samples': 9885504, 'steps': 51486, 'loss/train': 1.1560131311416626} 11/07/2021 04:28:54 - INFO - __main__ - Step 51488: {'lr': 0.0003742931187731293, 'samples': 9885696, 'steps': 51487, 'loss/train': 0.9204044938087463} 11/07/2021 04:28:54 - INFO - __main__ - Step 51489: {'lr': 0.00037428851433919707, 'samples': 9885888, 'steps': 51488, 'loss/train': 1.2909021377563477} 11/07/2021 04:28:54 - INFO - __main__ - Step 51490: {'lr': 0.0003742839098492625, 'samples': 9886080, 'steps': 51489, 'loss/train': 1.2641663551330566} 11/07/2021 04:28:55 - INFO - __main__ - Step 51491: {'lr': 0.0003742793053033274, 'samples': 9886272, 'steps': 51490, 'loss/train': 1.4978505373001099} 11/07/2021 04:28:56 - INFO - __main__ - Step 51492: {'lr': 0.000374274700701394, 'samples': 9886464, 'steps': 51491, 'loss/train': 1.688596487045288} 11/07/2021 04:28:56 - INFO - __main__ - Step 51493: {'lr': 0.00037427009604346437, 'samples': 9886656, 'steps': 51492, 'loss/train': 1.7720634937286377} 11/07/2021 04:28:57 - INFO - __main__ - Step 51494: {'lr': 0.0003742654913295405, 'samples': 9886848, 'steps': 51493, 'loss/train': 1.5528498888015747} 11/07/2021 04:28:57 - INFO - __main__ - Step 51495: {'lr': 0.0003742608865596246, 'samples': 9887040, 'steps': 51494, 'loss/train': 1.5911250114440918} 11/07/2021 04:28:57 - INFO - __main__ - Step 51496: {'lr': 0.0003742562817337186, 'samples': 9887232, 'steps': 51495, 'loss/train': 1.3602335453033447} 11/07/2021 04:28:58 - INFO - __main__ - Step 51497: {'lr': 0.0003742516768518247, 'samples': 9887424, 'steps': 51496, 'loss/train': 1.474177598953247} 11/07/2021 04:28:59 - INFO - __main__ - Step 51498: {'lr': 0.0003742470719139448, 'samples': 9887616, 'steps': 51497, 'loss/train': 1.5765888690948486} 11/07/2021 04:28:59 - INFO - __main__ - Step 51499: {'lr': 0.0003742424669200811, 'samples': 9887808, 'steps': 51498, 'loss/train': 1.1993672847747803} 11/07/2021 04:28:59 - INFO - __main__ - Step 51500: {'lr': 0.00037423786187023574, 'samples': 9888000, 'steps': 51499, 'loss/train': 0.9676117300987244} 11/07/2021 04:29:00 - INFO - __main__ - Step 51501: {'lr': 0.00037423325676441064, 'samples': 9888192, 'steps': 51500, 'loss/train': 0.9103842973709106} 11/07/2021 04:29:01 - INFO - __main__ - Step 51502: {'lr': 0.0003742286516026081, 'samples': 9888384, 'steps': 51501, 'loss/train': 1.3378478288650513} 11/07/2021 04:29:01 - INFO - __main__ - Step 51503: {'lr': 0.0003742240463848299, 'samples': 9888576, 'steps': 51502, 'loss/train': 1.6188141107559204} 11/07/2021 04:29:01 - INFO - __main__ - Step 51504: {'lr': 0.0003742194411110783, 'samples': 9888768, 'steps': 51503, 'loss/train': 1.5715343952178955} 11/07/2021 04:29:02 - INFO - __main__ - Step 51505: {'lr': 0.00037421483578135536, 'samples': 9888960, 'steps': 51504, 'loss/train': 1.6032418012619019} 11/07/2021 04:29:02 - INFO - __main__ - Step 51506: {'lr': 0.0003742102303956631, 'samples': 9889152, 'steps': 51505, 'loss/train': 1.5006656646728516} 11/07/2021 04:29:03 - INFO - __main__ - Step 51507: {'lr': 0.0003742056249540036, 'samples': 9889344, 'steps': 51506, 'loss/train': 1.2101598978042603} 11/07/2021 04:29:03 - INFO - __main__ - Step 51508: {'lr': 0.00037420101945637906, 'samples': 9889536, 'steps': 51507, 'loss/train': 1.6999436616897583} 11/07/2021 04:29:04 - INFO - __main__ - Step 51509: {'lr': 0.00037419641390279136, 'samples': 9889728, 'steps': 51508, 'loss/train': 0.9679996967315674} 11/07/2021 04:29:04 - INFO - __main__ - Step 51510: {'lr': 0.00037419180829324273, 'samples': 9889920, 'steps': 51509, 'loss/train': 1.518884539604187} 11/07/2021 04:29:04 - INFO - __main__ - Step 51511: {'lr': 0.0003741872026277351, 'samples': 9890112, 'steps': 51510, 'loss/train': 1.2138888835906982} 11/07/2021 04:29:05 - INFO - __main__ - Step 51512: {'lr': 0.00037418259690627075, 'samples': 9890304, 'steps': 51511, 'loss/train': 2.737327814102173} 11/07/2021 04:29:06 - INFO - __main__ - Step 51513: {'lr': 0.0003741779911288516, 'samples': 9890496, 'steps': 51512, 'loss/train': 1.3141067028045654} 11/07/2021 04:29:06 - INFO - __main__ - Step 51514: {'lr': 0.0003741733852954797, 'samples': 9890688, 'steps': 51513, 'loss/train': 1.173534631729126} 11/07/2021 04:29:07 - INFO - __main__ - Step 51515: {'lr': 0.00037416877940615737, 'samples': 9890880, 'steps': 51514, 'loss/train': 1.575916051864624} 11/07/2021 04:29:07 - INFO - __main__ - Step 51516: {'lr': 0.00037416417346088635, 'samples': 9891072, 'steps': 51515, 'loss/train': 1.3990917205810547} 11/07/2021 04:29:07 - INFO - __main__ - Step 51517: {'lr': 0.0003741595674596688, 'samples': 9891264, 'steps': 51516, 'loss/train': 2.040133237838745} 11/07/2021 04:29:08 - INFO - __main__ - Step 51518: {'lr': 0.000374154961402507, 'samples': 9891456, 'steps': 51517, 'loss/train': 1.1698777675628662} 11/07/2021 04:29:09 - INFO - __main__ - Step 51519: {'lr': 0.00037415035528940284, 'samples': 9891648, 'steps': 51518, 'loss/train': 1.1059638261795044} 11/07/2021 04:29:09 - INFO - __main__ - Step 51520: {'lr': 0.00037414574912035845, 'samples': 9891840, 'steps': 51519, 'loss/train': 1.148276448249817} 11/07/2021 04:29:09 - INFO - __main__ - Step 51521: {'lr': 0.0003741411428953759, 'samples': 9892032, 'steps': 51520, 'loss/train': 1.3834562301635742} 11/07/2021 04:29:10 - INFO - __main__ - Step 51522: {'lr': 0.00037413653661445736, 'samples': 9892224, 'steps': 51521, 'loss/train': 1.5016969442367554} 11/07/2021 04:29:11 - INFO - __main__ - Step 51523: {'lr': 0.00037413193027760466, 'samples': 9892416, 'steps': 51522, 'loss/train': 1.725628137588501} 11/07/2021 04:29:11 - INFO - __main__ - Step 51524: {'lr': 0.00037412732388482015, 'samples': 9892608, 'steps': 51523, 'loss/train': 0.6794720888137817} 11/07/2021 04:29:12 - INFO - __main__ - Step 51525: {'lr': 0.0003741227174361057, 'samples': 9892800, 'steps': 51524, 'loss/train': 0.9042429327964783} 11/07/2021 04:29:12 - INFO - __main__ - Step 51526: {'lr': 0.00037411811093146345, 'samples': 9892992, 'steps': 51525, 'loss/train': 1.3141316175460815} 11/07/2021 04:29:12 - INFO - __main__ - Step 51527: {'lr': 0.0003741135043708956, 'samples': 9893184, 'steps': 51526, 'loss/train': 1.630672574043274} 11/07/2021 04:29:13 - INFO - __main__ - Step 51528: {'lr': 0.000374108897754404, 'samples': 9893376, 'steps': 51527, 'loss/train': 1.5166720151901245} 11/07/2021 04:29:14 - INFO - __main__ - Step 51529: {'lr': 0.00037410429108199097, 'samples': 9893568, 'steps': 51528, 'loss/train': 1.4905604124069214} 11/07/2021 04:29:14 - INFO - __main__ - Step 51530: {'lr': 0.0003740996843536584, 'samples': 9893760, 'steps': 51529, 'loss/train': 1.3705830574035645} 11/07/2021 04:29:14 - INFO - __main__ - Step 51531: {'lr': 0.00037409507756940843, 'samples': 9893952, 'steps': 51530, 'loss/train': 1.4777553081512451} 11/07/2021 04:29:15 - INFO - __main__ - Step 51532: {'lr': 0.00037409047072924307, 'samples': 9894144, 'steps': 51531, 'loss/train': 1.2640812397003174} 11/07/2021 04:29:15 - INFO - __main__ - Step 51533: {'lr': 0.0003740858638331646, 'samples': 9894336, 'steps': 51532, 'loss/train': 0.8791767358779907} 11/07/2021 04:29:16 - INFO - __main__ - Step 51534: {'lr': 0.0003740812568811748, 'samples': 9894528, 'steps': 51533, 'loss/train': 1.3411303758621216} 11/07/2021 04:29:17 - INFO - __main__ - Step 51535: {'lr': 0.000374076649873276, 'samples': 9894720, 'steps': 51534, 'loss/train': 1.0225270986557007} 11/07/2021 04:29:17 - INFO - __main__ - Step 51536: {'lr': 0.00037407204280947014, 'samples': 9894912, 'steps': 51535, 'loss/train': 1.6323903799057007} 11/07/2021 04:29:17 - INFO - __main__ - Step 51537: {'lr': 0.0003740674356897593, 'samples': 9895104, 'steps': 51536, 'loss/train': 0.9552247524261475} 11/07/2021 04:29:18 - INFO - __main__ - Step 51538: {'lr': 0.0003740628285141457, 'samples': 9895296, 'steps': 51537, 'loss/train': 1.5924347639083862} 11/07/2021 04:29:19 - INFO - __main__ - Step 51539: {'lr': 0.00037405822128263125, 'samples': 9895488, 'steps': 51538, 'loss/train': 0.8937223553657532} 11/07/2021 04:29:19 - INFO - __main__ - Step 51540: {'lr': 0.000374053613995218, 'samples': 9895680, 'steps': 51539, 'loss/train': 1.7730861902236938} 11/07/2021 04:29:19 - INFO - __main__ - Step 51541: {'lr': 0.0003740490066519082, 'samples': 9895872, 'steps': 51540, 'loss/train': 5.167013645172119} 11/07/2021 04:29:20 - INFO - __main__ - Step 51542: {'lr': 0.0003740443992527038, 'samples': 9896064, 'steps': 51541, 'loss/train': 1.4915814399719238} 11/07/2021 04:29:20 - INFO - __main__ - Step 51543: {'lr': 0.00037403979179760687, 'samples': 9896256, 'steps': 51542, 'loss/train': 1.727198600769043} 11/07/2021 04:29:20 - INFO - __main__ - Step 51544: {'lr': 0.0003740351842866196, 'samples': 9896448, 'steps': 51543, 'loss/train': 1.2733923196792603} 11/07/2021 04:29:21 - INFO - __main__ - Step 51545: {'lr': 0.0003740305767197439, 'samples': 9896640, 'steps': 51544, 'loss/train': 1.4166321754455566} 11/07/2021 04:29:22 - INFO - __main__ - Step 51546: {'lr': 0.0003740259690969821, 'samples': 9896832, 'steps': 51545, 'loss/train': 1.5141226053237915} 11/07/2021 04:29:22 - INFO - __main__ - Step 51547: {'lr': 0.00037402136141833595, 'samples': 9897024, 'steps': 51546, 'loss/train': 1.39847993850708} 11/07/2021 04:29:22 - INFO - __main__ - Step 51548: {'lr': 0.0003740167536838077, 'samples': 9897216, 'steps': 51547, 'loss/train': 1.4686611890792847} 11/07/2021 04:29:23 - INFO - __main__ - Step 51549: {'lr': 0.0003740121458933995, 'samples': 9897408, 'steps': 51548, 'loss/train': 2.0743227005004883} 11/07/2021 04:29:24 - INFO - __main__ - Step 51550: {'lr': 0.0003740075380471133, 'samples': 9897600, 'steps': 51549, 'loss/train': 1.748748540878296} 11/07/2021 04:29:24 - INFO - __main__ - Step 51551: {'lr': 0.0003740029301449512, 'samples': 9897792, 'steps': 51550, 'loss/train': 1.677230715751648} 11/07/2021 04:29:25 - INFO - __main__ - Step 51552: {'lr': 0.0003739983221869153, 'samples': 9897984, 'steps': 51551, 'loss/train': 0.8572708368301392} 11/07/2021 04:29:25 - INFO - __main__ - Step 51553: {'lr': 0.00037399371417300766, 'samples': 9898176, 'steps': 51552, 'loss/train': 1.0359253883361816} 11/07/2021 04:29:25 - INFO - __main__ - Step 51554: {'lr': 0.00037398910610323034, 'samples': 9898368, 'steps': 51553, 'loss/train': 0.7630068063735962} 11/07/2021 04:29:26 - INFO - __main__ - Step 51555: {'lr': 0.0003739844979775855, 'samples': 9898560, 'steps': 51554, 'loss/train': 5.784372806549072} 11/07/2021 04:29:27 - INFO - __main__ - Step 51556: {'lr': 0.0003739798897960752, 'samples': 9898752, 'steps': 51555, 'loss/train': 2.1621899604797363} 11/07/2021 04:29:27 - INFO - __main__ - Step 51557: {'lr': 0.00037397528155870134, 'samples': 9898944, 'steps': 51556, 'loss/train': 1.7338827848434448} 11/07/2021 04:29:27 - INFO - __main__ - Step 51558: {'lr': 0.00037397067326546616, 'samples': 9899136, 'steps': 51557, 'loss/train': 1.0028150081634521} 11/07/2021 04:29:28 - INFO - __main__ - Step 51559: {'lr': 0.0003739660649163718, 'samples': 9899328, 'steps': 51558, 'loss/train': 1.5446821451187134} 11/07/2021 04:29:28 - INFO - __main__ - Step 51560: {'lr': 0.0003739614565114202, 'samples': 9899520, 'steps': 51559, 'loss/train': 1.6628481149673462} 11/07/2021 04:29:29 - INFO - __main__ - Step 51561: {'lr': 0.00037395684805061345, 'samples': 9899712, 'steps': 51560, 'loss/train': 1.4018369913101196} 11/07/2021 04:29:30 - INFO - __main__ - Step 51562: {'lr': 0.00037395223953395375, 'samples': 9899904, 'steps': 51561, 'loss/train': 1.3891258239746094} 11/07/2021 04:29:30 - INFO - __main__ - Step 51563: {'lr': 0.000373947630961443, 'samples': 9900096, 'steps': 51562, 'loss/train': 1.5041695833206177} 11/07/2021 04:29:30 - INFO - __main__ - Step 51564: {'lr': 0.00037394302233308336, 'samples': 9900288, 'steps': 51563, 'loss/train': 1.408871054649353} 11/07/2021 04:29:31 - INFO - __main__ - Step 51565: {'lr': 0.0003739384136488769, 'samples': 9900480, 'steps': 51564, 'loss/train': 1.5967864990234375} 11/07/2021 04:29:32 - INFO - __main__ - Step 51566: {'lr': 0.00037393380490882575, 'samples': 9900672, 'steps': 51565, 'loss/train': 1.0944602489471436} 11/07/2021 04:29:32 - INFO - __main__ - Step 51567: {'lr': 0.0003739291961129319, 'samples': 9900864, 'steps': 51566, 'loss/train': 1.4621555805206299} 11/07/2021 04:29:32 - INFO - __main__ - Step 51568: {'lr': 0.0003739245872611975, 'samples': 9901056, 'steps': 51567, 'loss/train': 1.713440179824829} 11/07/2021 04:29:33 - INFO - __main__ - Step 51569: {'lr': 0.0003739199783536246, 'samples': 9901248, 'steps': 51568, 'loss/train': 1.344717025756836} 11/07/2021 04:29:33 - INFO - __main__ - Step 51570: {'lr': 0.0003739153693902152, 'samples': 9901440, 'steps': 51569, 'loss/train': 1.5534286499023438} 11/07/2021 04:29:34 - INFO - __main__ - Step 51571: {'lr': 0.0003739107603709715, 'samples': 9901632, 'steps': 51570, 'loss/train': 1.600929856300354} 11/07/2021 04:29:35 - INFO - __main__ - Step 51572: {'lr': 0.00037390615129589554, 'samples': 9901824, 'steps': 51571, 'loss/train': 1.3709287643432617} 11/07/2021 04:29:35 - INFO - __main__ - Step 51573: {'lr': 0.00037390154216498933, 'samples': 9902016, 'steps': 51572, 'loss/train': 1.6646630764007568} 11/07/2021 04:29:35 - INFO - __main__ - Step 51574: {'lr': 0.000373896932978255, 'samples': 9902208, 'steps': 51573, 'loss/train': 0.6080346703529358} 11/07/2021 04:29:36 - INFO - __main__ - Step 51575: {'lr': 0.00037389232373569463, 'samples': 9902400, 'steps': 51574, 'loss/train': 1.31934654712677} 11/07/2021 04:29:37 - INFO - __main__ - Step 51576: {'lr': 0.0003738877144373104, 'samples': 9902592, 'steps': 51575, 'loss/train': 1.0798628330230713} 11/07/2021 04:29:37 - INFO - __main__ - Step 51577: {'lr': 0.0003738831050831042, 'samples': 9902784, 'steps': 51576, 'loss/train': 1.7884399890899658} 11/07/2021 04:29:37 - INFO - __main__ - Step 51578: {'lr': 0.0003738784956730781, 'samples': 9902976, 'steps': 51577, 'loss/train': 1.9427136182785034} 11/07/2021 04:29:38 - INFO - __main__ - Step 51579: {'lr': 0.0003738738862072343, 'samples': 9903168, 'steps': 51578, 'loss/train': 1.6550885438919067} 11/07/2021 04:29:38 - INFO - __main__ - Step 51580: {'lr': 0.00037386927668557493, 'samples': 9903360, 'steps': 51579, 'loss/train': 1.5623819828033447} 11/07/2021 04:29:38 - INFO - __main__ - Step 51581: {'lr': 0.0003738646671081019, 'samples': 9903552, 'steps': 51580, 'loss/train': 1.5108397006988525} 11/07/2021 04:29:39 - INFO - __main__ - Step 51582: {'lr': 0.00037386005747481744, 'samples': 9903744, 'steps': 51581, 'loss/train': 1.5365359783172607} 11/07/2021 04:29:40 - INFO - __main__ - Step 51583: {'lr': 0.00037385544778572346, 'samples': 9903936, 'steps': 51582, 'loss/train': 1.7460014820098877} 11/07/2021 04:29:40 - INFO - __main__ - Step 51584: {'lr': 0.00037385083804082213, 'samples': 9904128, 'steps': 51583, 'loss/train': 1.2553999423980713} 11/07/2021 04:29:40 - INFO - __main__ - Step 51585: {'lr': 0.00037384622824011555, 'samples': 9904320, 'steps': 51584, 'loss/train': 0.9383065104484558} 11/07/2021 04:29:41 - INFO - __main__ - Step 51586: {'lr': 0.00037384161838360574, 'samples': 9904512, 'steps': 51585, 'loss/train': 1.4494212865829468} 11/07/2021 04:29:42 - INFO - __main__ - Step 51587: {'lr': 0.00037383700847129487, 'samples': 9904704, 'steps': 51586, 'loss/train': 1.7529298067092896} 11/07/2021 04:29:42 - INFO - __main__ - Step 51588: {'lr': 0.0003738323985031849, 'samples': 9904896, 'steps': 51587, 'loss/train': 1.9423640966415405} 11/07/2021 04:29:43 - INFO - __main__ - Step 51589: {'lr': 0.000373827788479278, 'samples': 9905088, 'steps': 51588, 'loss/train': 1.507910132408142} 11/07/2021 04:29:43 - INFO - __main__ - Step 51590: {'lr': 0.0003738231783995762, 'samples': 9905280, 'steps': 51589, 'loss/train': 1.803034782409668} 11/07/2021 04:29:43 - INFO - __main__ - Step 51591: {'lr': 0.00037381856826408156, 'samples': 9905472, 'steps': 51590, 'loss/train': 1.5332369804382324} 11/07/2021 04:29:45 - INFO - __main__ - Step 51592: {'lr': 0.00037381395807279625, 'samples': 9905664, 'steps': 51591, 'loss/train': 0.134771466255188} 11/07/2021 04:29:45 - INFO - __main__ - Step 51593: {'lr': 0.0003738093478257222, 'samples': 9905856, 'steps': 51592, 'loss/train': 1.1799582242965698} 11/07/2021 04:29:45 - INFO - __main__ - Step 51594: {'lr': 0.0003738047375228616, 'samples': 9906048, 'steps': 51593, 'loss/train': 1.308282732963562} 11/07/2021 04:29:46 - INFO - __main__ - Step 51595: {'lr': 0.00037380012716421647, 'samples': 9906240, 'steps': 51594, 'loss/train': 1.6689704656600952} 11/07/2021 04:29:46 - INFO - __main__ - Step 51596: {'lr': 0.00037379551674978896, 'samples': 9906432, 'steps': 51595, 'loss/train': 1.452520728111267} 11/07/2021 04:29:47 - INFO - __main__ - Step 51597: {'lr': 0.0003737909062795811, 'samples': 9906624, 'steps': 51596, 'loss/train': 1.5214353799819946} 11/07/2021 04:29:47 - INFO - __main__ - Step 51598: {'lr': 0.00037378629575359493, 'samples': 9906816, 'steps': 51597, 'loss/train': 2.1359145641326904} 11/07/2021 04:29:48 - INFO - __main__ - Step 51599: {'lr': 0.0003737816851718326, 'samples': 9907008, 'steps': 51598, 'loss/train': 1.743646502494812} 11/07/2021 04:29:48 - INFO - __main__ - Step 51600: {'lr': 0.0003737770745342961, 'samples': 9907200, 'steps': 51599, 'loss/train': 1.3906631469726562} 11/07/2021 04:29:48 - INFO - __main__ - Step 51601: {'lr': 0.0003737724638409876, 'samples': 9907392, 'steps': 51600, 'loss/train': 1.3511393070220947} 11/07/2021 04:29:49 - INFO - __main__ - Step 51602: {'lr': 0.00037376785309190913, 'samples': 9907584, 'steps': 51601, 'loss/train': 1.3243608474731445} 11/07/2021 04:29:50 - INFO - __main__ - Step 51603: {'lr': 0.0003737632422870628, 'samples': 9907776, 'steps': 51602, 'loss/train': 1.7266608476638794} 11/07/2021 04:29:50 - INFO - __main__ - Step 51604: {'lr': 0.00037375863142645064, 'samples': 9907968, 'steps': 51603, 'loss/train': 1.8057512044906616} 11/07/2021 04:29:50 - INFO - __main__ - Step 51605: {'lr': 0.00037375402051007477, 'samples': 9908160, 'steps': 51604, 'loss/train': 0.5424093008041382} 11/07/2021 04:29:51 - INFO - __main__ - Step 51606: {'lr': 0.00037374940953793724, 'samples': 9908352, 'steps': 51605, 'loss/train': 1.5453418493270874} 11/07/2021 04:29:51 - INFO - __main__ - Step 51607: {'lr': 0.00037374479851004006, 'samples': 9908544, 'steps': 51606, 'loss/train': 1.4249101877212524} 11/07/2021 04:29:52 - INFO - __main__ - Step 51608: {'lr': 0.0003737401874263855, 'samples': 9908736, 'steps': 51607, 'loss/train': 1.4591652154922485} 11/07/2021 04:29:53 - INFO - __main__ - Step 51609: {'lr': 0.0003737355762869755, 'samples': 9908928, 'steps': 51608, 'loss/train': 1.513169765472412} 11/07/2021 04:29:53 - INFO - __main__ - Step 51610: {'lr': 0.0003737309650918121, 'samples': 9909120, 'steps': 51609, 'loss/train': 1.3394849300384521} 11/07/2021 04:29:53 - INFO - __main__ - Step 51611: {'lr': 0.0003737263538408975, 'samples': 9909312, 'steps': 51610, 'loss/train': 1.6213732957839966} 11/07/2021 04:29:54 - INFO - __main__ - Step 51612: {'lr': 0.0003737217425342336, 'samples': 9909504, 'steps': 51611, 'loss/train': 1.1712919473648071} 11/07/2021 04:29:55 - INFO - __main__ - Step 51613: {'lr': 0.0003737171311718227, 'samples': 9909696, 'steps': 51612, 'loss/train': 0.9208439588546753} 11/07/2021 04:29:55 - INFO - __main__ - Step 51614: {'lr': 0.0003737125197536667, 'samples': 9909888, 'steps': 51613, 'loss/train': 1.4358117580413818} 11/07/2021 04:29:55 - INFO - __main__ - Step 51615: {'lr': 0.0003737079082797678, 'samples': 9910080, 'steps': 51614, 'loss/train': 1.3692514896392822} 11/07/2021 04:29:56 - INFO - __main__ - Step 51616: {'lr': 0.000373703296750128, 'samples': 9910272, 'steps': 51615, 'loss/train': 1.4786714315414429} 11/07/2021 04:29:56 - INFO - __main__ - Step 51617: {'lr': 0.0003736986851647495, 'samples': 9910464, 'steps': 51616, 'loss/train': 1.227534294128418} 11/07/2021 04:29:57 - INFO - __main__ - Step 51618: {'lr': 0.00037369407352363417, 'samples': 9910656, 'steps': 51617, 'loss/train': 1.4258657693862915} 11/07/2021 04:29:57 - INFO - __main__ - Step 51619: {'lr': 0.0003736894618267842, 'samples': 9910848, 'steps': 51618, 'loss/train': 1.7483607530593872} 11/07/2021 04:29:58 - INFO - __main__ - Step 51620: {'lr': 0.0003736848500742017, 'samples': 9911040, 'steps': 51619, 'loss/train': 1.0665727853775024} 11/07/2021 04:29:58 - INFO - __main__ - Step 51621: {'lr': 0.0003736802382658887, 'samples': 9911232, 'steps': 51620, 'loss/train': 1.5844364166259766} 11/07/2021 04:29:58 - INFO - __main__ - Step 51622: {'lr': 0.00037367562640184735, 'samples': 9911424, 'steps': 51621, 'loss/train': 1.685865044593811} 11/07/2021 04:29:59 - INFO - __main__ - Step 51623: {'lr': 0.0003736710144820796, 'samples': 9911616, 'steps': 51622, 'loss/train': 2.9197614192962646} 11/07/2021 04:30:00 - INFO - __main__ - Step 51624: {'lr': 0.00037366640250658767, 'samples': 9911808, 'steps': 51623, 'loss/train': 1.7044905424118042} 11/07/2021 04:30:00 - INFO - __main__ - Step 51625: {'lr': 0.00037366179047537354, 'samples': 9912000, 'steps': 51624, 'loss/train': 1.2015422582626343} 11/07/2021 04:30:01 - INFO - __main__ - Step 51626: {'lr': 0.0003736571783884393, 'samples': 9912192, 'steps': 51625, 'loss/train': 1.9773355722427368} 11/07/2021 04:30:01 - INFO - __main__ - Step 51627: {'lr': 0.00037365256624578695, 'samples': 9912384, 'steps': 51626, 'loss/train': 1.3225902318954468} 11/07/2021 04:30:02 - INFO - __main__ - Step 51628: {'lr': 0.0003736479540474188, 'samples': 9912576, 'steps': 51627, 'loss/train': 0.1793370544910431} 11/07/2021 04:30:02 - INFO - __main__ - Step 51629: {'lr': 0.00037364334179333674, 'samples': 9912768, 'steps': 51628, 'loss/train': 1.239203929901123} 11/07/2021 04:30:03 - INFO - __main__ - Step 51630: {'lr': 0.00037363872948354294, 'samples': 9912960, 'steps': 51629, 'loss/train': 1.4803258180618286} 11/07/2021 04:30:03 - INFO - __main__ - Step 51631: {'lr': 0.00037363411711803935, 'samples': 9913152, 'steps': 51630, 'loss/train': 1.6821731328964233} 11/07/2021 04:30:03 - INFO - __main__ - Step 51632: {'lr': 0.0003736295046968282, 'samples': 9913344, 'steps': 51631, 'loss/train': 1.213153600692749} 11/07/2021 04:30:04 - INFO - __main__ - Step 51633: {'lr': 0.0003736248922199115, 'samples': 9913536, 'steps': 51632, 'loss/train': 1.9221079349517822} 11/07/2021 04:30:05 - INFO - __main__ - Step 51634: {'lr': 0.0003736202796872913, 'samples': 9913728, 'steps': 51633, 'loss/train': 1.223962426185608} 11/07/2021 04:30:05 - INFO - __main__ - Step 51635: {'lr': 0.00037361566709896964, 'samples': 9913920, 'steps': 51634, 'loss/train': 1.9123646020889282} 11/07/2021 04:30:05 - INFO - __main__ - Step 51636: {'lr': 0.00037361105445494884, 'samples': 9914112, 'steps': 51635, 'loss/train': 1.6113739013671875} 11/07/2021 04:30:06 - INFO - __main__ - Step 51637: {'lr': 0.0003736064417552307, 'samples': 9914304, 'steps': 51636, 'loss/train': 1.8846906423568726} 11/07/2021 04:30:06 - INFO - __main__ - Step 51638: {'lr': 0.0003736018289998174, 'samples': 9914496, 'steps': 51637, 'loss/train': 2.0408263206481934} 11/07/2021 04:30:07 - INFO - __main__ - Step 51639: {'lr': 0.00037359721618871107, 'samples': 9914688, 'steps': 51638, 'loss/train': 0.6913129091262817} 11/07/2021 04:30:07 - INFO - __main__ - Step 51640: {'lr': 0.0003735926033219137, 'samples': 9914880, 'steps': 51639, 'loss/train': 1.5510092973709106} 11/07/2021 04:30:08 - INFO - __main__ - Step 51641: {'lr': 0.00037358799039942744, 'samples': 9915072, 'steps': 51640, 'loss/train': 1.5725252628326416} 11/07/2021 04:30:08 - INFO - __main__ - Step 51642: {'lr': 0.00037358337742125433, 'samples': 9915264, 'steps': 51641, 'loss/train': 1.0196442604064941} 11/07/2021 04:30:08 - INFO - __main__ - Step 51643: {'lr': 0.0003735787643873965, 'samples': 9915456, 'steps': 51642, 'loss/train': 0.9055829048156738} 11/07/2021 04:30:10 - INFO - __main__ - Step 51644: {'lr': 0.00037357415129785586, 'samples': 9915648, 'steps': 51643, 'loss/train': 0.6788956522941589} 11/07/2021 04:30:10 - INFO - __main__ - Step 51645: {'lr': 0.00037356953815263473, 'samples': 9915840, 'steps': 51644, 'loss/train': 1.7450796365737915} 11/07/2021 04:30:10 - INFO - __main__ - Step 51646: {'lr': 0.00037356492495173505, 'samples': 9916032, 'steps': 51645, 'loss/train': 1.5990597009658813} 11/07/2021 04:30:11 - INFO - __main__ - Step 51647: {'lr': 0.00037356031169515894, 'samples': 9916224, 'steps': 51646, 'loss/train': 1.8649356365203857} 11/07/2021 04:30:11 - INFO - __main__ - Step 51648: {'lr': 0.0003735556983829084, 'samples': 9916416, 'steps': 51647, 'loss/train': 2.3101606369018555} 11/07/2021 04:30:12 - INFO - __main__ - Step 51649: {'lr': 0.00037355108501498557, 'samples': 9916608, 'steps': 51648, 'loss/train': 0.7279643416404724} 11/07/2021 04:30:12 - INFO - __main__ - Step 51650: {'lr': 0.0003735464715913926, 'samples': 9916800, 'steps': 51649, 'loss/train': 1.4655834436416626} 11/07/2021 04:30:13 - INFO - __main__ - Step 51651: {'lr': 0.00037354185811213145, 'samples': 9916992, 'steps': 51650, 'loss/train': 1.3757160902023315} 11/07/2021 04:30:13 - INFO - __main__ - Step 51652: {'lr': 0.0003735372445772042, 'samples': 9917184, 'steps': 51651, 'loss/train': 1.4900068044662476} 11/07/2021 04:30:13 - INFO - __main__ - Step 51653: {'lr': 0.00037353263098661304, 'samples': 9917376, 'steps': 51652, 'loss/train': 1.225886583328247} 11/07/2021 04:30:15 - INFO - __main__ - Step 51654: {'lr': 0.00037352801734036, 'samples': 9917568, 'steps': 51653, 'loss/train': 1.6471189260482788} 11/07/2021 04:30:15 - INFO - __main__ - Step 51655: {'lr': 0.00037352340363844706, 'samples': 9917760, 'steps': 51654, 'loss/train': 1.3263717889785767} 11/07/2021 04:30:15 - INFO - __main__ - Step 51656: {'lr': 0.00037351878988087646, 'samples': 9917952, 'steps': 51655, 'loss/train': 1.8218783140182495} 11/07/2021 04:30:16 - INFO - __main__ - Step 51657: {'lr': 0.0003735141760676501, 'samples': 9918144, 'steps': 51656, 'loss/train': 1.5391809940338135} 11/07/2021 04:30:16 - INFO - __main__ - Step 51658: {'lr': 0.0003735095621987703, 'samples': 9918336, 'steps': 51657, 'loss/train': 1.4478362798690796} 11/07/2021 04:30:17 - INFO - __main__ - Step 51659: {'lr': 0.00037350494827423884, 'samples': 9918528, 'steps': 51658, 'loss/train': 1.8777034282684326} 11/07/2021 04:30:17 - INFO - __main__ - Step 51660: {'lr': 0.00037350033429405806, 'samples': 9918720, 'steps': 51659, 'loss/train': 1.7261641025543213} 11/07/2021 04:30:18 - INFO - __main__ - Step 51661: {'lr': 0.0003734957202582299, 'samples': 9918912, 'steps': 51660, 'loss/train': 1.31928551197052} 11/07/2021 04:30:18 - INFO - __main__ - Step 51662: {'lr': 0.00037349110616675653, 'samples': 9919104, 'steps': 51661, 'loss/train': 1.650587558746338} 11/07/2021 04:30:18 - INFO - __main__ - Step 51663: {'lr': 0.0003734864920196399, 'samples': 9919296, 'steps': 51662, 'loss/train': 1.3911428451538086} 11/07/2021 04:30:19 - INFO - __main__ - Step 51664: {'lr': 0.0003734818778168823, 'samples': 9919488, 'steps': 51663, 'loss/train': 1.6656628847122192} 11/07/2021 04:30:20 - INFO - __main__ - Step 51665: {'lr': 0.0003734772635584855, 'samples': 9919680, 'steps': 51664, 'loss/train': 1.6247785091400146} 11/07/2021 04:30:20 - INFO - __main__ - Step 51666: {'lr': 0.0003734726492444518, 'samples': 9919872, 'steps': 51665, 'loss/train': 1.460745930671692} 11/07/2021 04:30:20 - INFO - __main__ - Step 51667: {'lr': 0.00037346803487478325, 'samples': 9920064, 'steps': 51666, 'loss/train': 1.350220799446106} 11/07/2021 04:30:21 - INFO - __main__ - Step 51668: {'lr': 0.0003734634204494819, 'samples': 9920256, 'steps': 51667, 'loss/train': 1.5722851753234863} 11/07/2021 04:30:21 - INFO - __main__ - Step 51669: {'lr': 0.0003734588059685499, 'samples': 9920448, 'steps': 51668, 'loss/train': 1.341074824333191} 11/07/2021 04:30:22 - INFO - __main__ - Step 51670: {'lr': 0.0003734541914319892, 'samples': 9920640, 'steps': 51669, 'loss/train': 2.1882638931274414} 11/07/2021 04:30:22 - INFO - __main__ - Step 51671: {'lr': 0.0003734495768398019, 'samples': 9920832, 'steps': 51670, 'loss/train': 1.5213651657104492} 11/07/2021 04:30:23 - INFO - __main__ - Step 51672: {'lr': 0.00037344496219199016, 'samples': 9921024, 'steps': 51671, 'loss/train': 1.61359703540802} 11/07/2021 04:30:23 - INFO - __main__ - Step 51673: {'lr': 0.0003734403474885561, 'samples': 9921216, 'steps': 51672, 'loss/train': 1.300546646118164} 11/07/2021 04:30:24 - INFO - __main__ - Step 51674: {'lr': 0.00037343573272950167, 'samples': 9921408, 'steps': 51673, 'loss/train': 1.4245498180389404} 11/07/2021 04:30:25 - INFO - __main__ - Step 51675: {'lr': 0.00037343111791482897, 'samples': 9921600, 'steps': 51674, 'loss/train': 1.6947277784347534} 11/07/2021 04:30:25 - INFO - __main__ - Step 51676: {'lr': 0.0003734265030445401, 'samples': 9921792, 'steps': 51675, 'loss/train': 1.4293681383132935} 11/07/2021 04:30:25 - INFO - __main__ - Step 51677: {'lr': 0.0003734218881186372, 'samples': 9921984, 'steps': 51676, 'loss/train': 1.6062393188476562} 11/07/2021 04:30:26 - INFO - __main__ - Step 51678: {'lr': 0.00037341727313712237, 'samples': 9922176, 'steps': 51677, 'loss/train': 1.2721912860870361} 11/07/2021 04:30:26 - INFO - __main__ - Step 51679: {'lr': 0.0003734126580999975, 'samples': 9922368, 'steps': 51678, 'loss/train': 1.5690890550613403} 11/07/2021 04:30:27 - INFO - __main__ - Step 51680: {'lr': 0.0003734080430072649, 'samples': 9922560, 'steps': 51679, 'loss/train': 1.523789882659912} 11/07/2021 04:30:27 - INFO - __main__ - Step 51681: {'lr': 0.0003734034278589265, 'samples': 9922752, 'steps': 51680, 'loss/train': 1.4328149557113647} 11/07/2021 04:30:28 - INFO - __main__ - Step 51682: {'lr': 0.0003733988126549843, 'samples': 9922944, 'steps': 51681, 'loss/train': 1.4673817157745361} 11/07/2021 04:30:28 - INFO - __main__ - Step 51683: {'lr': 0.0003733941973954407, 'samples': 9923136, 'steps': 51682, 'loss/train': 1.2372491359710693} 11/07/2021 04:30:28 - INFO - __main__ - Step 51684: {'lr': 0.00037338958208029744, 'samples': 9923328, 'steps': 51683, 'loss/train': 1.281899333000183} 11/07/2021 04:30:30 - INFO - __main__ - Step 51685: {'lr': 0.0003733849667095568, 'samples': 9923520, 'steps': 51684, 'loss/train': 1.811161756515503} 11/07/2021 04:30:30 - INFO - __main__ - Step 51686: {'lr': 0.00037338035128322075, 'samples': 9923712, 'steps': 51685, 'loss/train': 1.7999763488769531} 11/07/2021 04:30:30 - INFO - __main__ - Step 51687: {'lr': 0.00037337573580129143, 'samples': 9923904, 'steps': 51686, 'loss/train': 0.15734606981277466} 11/07/2021 04:30:31 - INFO - __main__ - Step 51688: {'lr': 0.0003733711202637709, 'samples': 9924096, 'steps': 51687, 'loss/train': 2.9371252059936523} 11/07/2021 04:30:31 - INFO - __main__ - Step 51689: {'lr': 0.00037336650467066125, 'samples': 9924288, 'steps': 51688, 'loss/train': 1.6733360290527344} 11/07/2021 04:30:31 - INFO - __main__ - Step 51690: {'lr': 0.0003733618890219646, 'samples': 9924480, 'steps': 51689, 'loss/train': 1.1849936246871948} 11/07/2021 04:30:33 - INFO - __main__ - Step 51691: {'lr': 0.000373357273317683, 'samples': 9924672, 'steps': 51690, 'loss/train': 5.7699737548828125} 11/07/2021 04:30:33 - INFO - __main__ - Step 51692: {'lr': 0.00037335265755781844, 'samples': 9924864, 'steps': 51691, 'loss/train': 1.5762732028961182} 11/07/2021 04:30:33 - INFO - __main__ - Step 51693: {'lr': 0.00037334804174237314, 'samples': 9925056, 'steps': 51692, 'loss/train': 0.768830418586731} 11/07/2021 04:30:34 - INFO - __main__ - Step 51694: {'lr': 0.0003733434258713491, 'samples': 9925248, 'steps': 51693, 'loss/train': 1.4546159505844116} 11/07/2021 04:30:34 - INFO - __main__ - Step 51695: {'lr': 0.00037333880994474834, 'samples': 9925440, 'steps': 51694, 'loss/train': 0.9135294556617737} 11/07/2021 04:30:35 - INFO - __main__ - Step 51696: {'lr': 0.00037333419396257307, 'samples': 9925632, 'steps': 51695, 'loss/train': 1.7703784704208374} 11/07/2021 04:30:36 - INFO - __main__ - Step 51697: {'lr': 0.00037332957792482534, 'samples': 9925824, 'steps': 51696, 'loss/train': 1.1710397005081177} 11/07/2021 04:30:36 - INFO - __main__ - Step 51698: {'lr': 0.0003733249618315072, 'samples': 9926016, 'steps': 51697, 'loss/train': 1.419105052947998} 11/07/2021 04:30:36 - INFO - __main__ - Step 51699: {'lr': 0.0003733203456826207, 'samples': 9926208, 'steps': 51698, 'loss/train': 1.4877972602844238} 11/07/2021 04:30:37 - INFO - __main__ - Step 51700: {'lr': 0.000373315729478168, 'samples': 9926400, 'steps': 51699, 'loss/train': 1.1597901582717896} 11/07/2021 04:30:37 - INFO - __main__ - Step 51701: {'lr': 0.0003733111132181511, 'samples': 9926592, 'steps': 51700, 'loss/train': 1.6634995937347412} 11/07/2021 04:30:38 - INFO - __main__ - Step 51702: {'lr': 0.0003733064969025721, 'samples': 9926784, 'steps': 51701, 'loss/train': 1.2895931005477905} 11/07/2021 04:30:38 - INFO - __main__ - Step 51703: {'lr': 0.00037330188053143323, 'samples': 9926976, 'steps': 51702, 'loss/train': 1.1735255718231201} 11/07/2021 04:30:39 - INFO - __main__ - Step 51704: {'lr': 0.0003732972641047363, 'samples': 9927168, 'steps': 51703, 'loss/train': 1.7416150569915771} 11/07/2021 04:30:39 - INFO - __main__ - Step 51705: {'lr': 0.0003732926476224835, 'samples': 9927360, 'steps': 51704, 'loss/train': 1.1643649339675903} 11/07/2021 04:30:39 - INFO - __main__ - Step 51706: {'lr': 0.00037328803108467704, 'samples': 9927552, 'steps': 51705, 'loss/train': 1.7630983591079712} 11/07/2021 04:30:41 - INFO - __main__ - Step 51707: {'lr': 0.0003732834144913188, 'samples': 9927744, 'steps': 51706, 'loss/train': 1.2046449184417725} 11/07/2021 04:30:41 - INFO - __main__ - Step 51708: {'lr': 0.00037327879784241095, 'samples': 9927936, 'steps': 51707, 'loss/train': 1.187599778175354} 11/07/2021 04:30:41 - INFO - __main__ - Step 51709: {'lr': 0.00037327418113795565, 'samples': 9928128, 'steps': 51708, 'loss/train': 1.437038779258728} 11/07/2021 04:30:42 - INFO - __main__ - Step 51710: {'lr': 0.0003732695643779549, 'samples': 9928320, 'steps': 51709, 'loss/train': 1.5829737186431885} 11/07/2021 04:30:42 - INFO - __main__ - Step 51711: {'lr': 0.0003732649475624108, 'samples': 9928512, 'steps': 51710, 'loss/train': 1.6034432649612427} 11/07/2021 04:30:43 - INFO - __main__ - Step 51712: {'lr': 0.0003732603306913254, 'samples': 9928704, 'steps': 51711, 'loss/train': 1.2415457963943481} 11/07/2021 04:30:44 - INFO - __main__ - Step 51713: {'lr': 0.00037325571376470074, 'samples': 9928896, 'steps': 51712, 'loss/train': 1.7289055585861206} 11/07/2021 04:30:44 - INFO - __main__ - Step 51714: {'lr': 0.00037325109678253897, 'samples': 9929088, 'steps': 51713, 'loss/train': 1.754821538925171} 11/07/2021 04:30:45 - INFO - __main__ - Step 51715: {'lr': 0.0003732464797448422, 'samples': 9929280, 'steps': 51714, 'loss/train': 1.5865949392318726} 11/07/2021 04:30:45 - INFO - __main__ - Step 51716: {'lr': 0.0003732418626516125, 'samples': 9929472, 'steps': 51715, 'loss/train': 1.9165054559707642} 11/07/2021 04:30:47 - INFO - __main__ - Step 51717: {'lr': 0.0003732372455028519, 'samples': 9929664, 'steps': 51716, 'loss/train': 0.8452095985412598} 11/07/2021 04:30:47 - INFO - __main__ - Step 51718: {'lr': 0.00037323262829856246, 'samples': 9929856, 'steps': 51717, 'loss/train': 1.0609043836593628} 11/07/2021 04:30:47 - INFO - __main__ - Step 51719: {'lr': 0.00037322801103874633, 'samples': 9930048, 'steps': 51718, 'loss/train': 2.9352524280548096} 11/07/2021 04:30:48 - INFO - __main__ - Step 51720: {'lr': 0.00037322339372340555, 'samples': 9930240, 'steps': 51719, 'loss/train': 1.8885968923568726} 11/07/2021 04:30:48 - INFO - __main__ - Step 51721: {'lr': 0.0003732187763525421, 'samples': 9930432, 'steps': 51720, 'loss/train': 1.7703211307525635} 11/07/2021 04:30:48 - INFO - __main__ - Step 51722: {'lr': 0.00037321415892615833, 'samples': 9930624, 'steps': 51721, 'loss/train': 0.9961236715316772} 11/07/2021 04:30:49 - INFO - __main__ - Step 51723: {'lr': 0.0003732095414442561, 'samples': 9930816, 'steps': 51722, 'loss/train': 1.6001304388046265} 11/07/2021 04:30:50 - INFO - __main__ - Step 51724: {'lr': 0.00037320492390683756, 'samples': 9931008, 'steps': 51723, 'loss/train': 1.4640626907348633} 11/07/2021 04:30:50 - INFO - __main__ - Step 51725: {'lr': 0.00037320030631390476, 'samples': 9931200, 'steps': 51724, 'loss/train': 1.0389316082000732} 11/07/2021 04:30:51 - INFO - __main__ - Step 51726: {'lr': 0.00037319568866545983, 'samples': 9931392, 'steps': 51725, 'loss/train': 0.3944765627384186} 11/07/2021 04:30:51 - INFO - __main__ - Step 51727: {'lr': 0.00037319107096150483, 'samples': 9931584, 'steps': 51726, 'loss/train': 1.802032470703125} 11/07/2021 04:30:51 - INFO - __main__ - Step 51728: {'lr': 0.00037318645320204183, 'samples': 9931776, 'steps': 51727, 'loss/train': 1.6470043659210205} 11/07/2021 04:30:52 - INFO - __main__ - Step 51729: {'lr': 0.0003731818353870729, 'samples': 9931968, 'steps': 51728, 'loss/train': 1.5274968147277832} 11/07/2021 04:30:53 - INFO - __main__ - Step 51730: {'lr': 0.00037317721751660014, 'samples': 9932160, 'steps': 51729, 'loss/train': 1.6125394105911255} 11/07/2021 04:30:53 - INFO - __main__ - Step 51731: {'lr': 0.00037317259959062564, 'samples': 9932352, 'steps': 51730, 'loss/train': 1.8147977590560913} 11/07/2021 04:30:53 - INFO - __main__ - Step 51732: {'lr': 0.0003731679816091514, 'samples': 9932544, 'steps': 51731, 'loss/train': 1.1446830034255981} 11/07/2021 04:30:54 - INFO - __main__ - Step 51733: {'lr': 0.00037316336357217966, 'samples': 9932736, 'steps': 51732, 'loss/train': 1.0742896795272827} 11/07/2021 04:30:55 - INFO - __main__ - Step 51734: {'lr': 0.0003731587454797124, 'samples': 9932928, 'steps': 51733, 'loss/train': 1.5206941366195679} 11/07/2021 04:30:55 - INFO - __main__ - Step 51735: {'lr': 0.0003731541273317517, 'samples': 9933120, 'steps': 51734, 'loss/train': 1.9338865280151367} 11/07/2021 04:30:55 - INFO - __main__ - Step 51736: {'lr': 0.0003731495091282996, 'samples': 9933312, 'steps': 51735, 'loss/train': 1.4734269380569458} 11/07/2021 04:30:56 - INFO - __main__ - Step 51737: {'lr': 0.0003731448908693583, 'samples': 9933504, 'steps': 51736, 'loss/train': 1.5363223552703857} 11/07/2021 04:30:56 - INFO - __main__ - Step 51738: {'lr': 0.0003731402725549298, 'samples': 9933696, 'steps': 51737, 'loss/train': 1.4811780452728271} 11/07/2021 04:30:57 - INFO - __main__ - Step 51739: {'lr': 0.0003731356541850162, 'samples': 9933888, 'steps': 51738, 'loss/train': 3.218954086303711} 11/07/2021 04:30:58 - INFO - __main__ - Step 51740: {'lr': 0.0003731310357596195, 'samples': 9934080, 'steps': 51739, 'loss/train': 1.2267725467681885} 11/07/2021 04:30:58 - INFO - __main__ - Step 51741: {'lr': 0.0003731264172787419, 'samples': 9934272, 'steps': 51740, 'loss/train': 1.9757555723190308} 11/07/2021 04:30:58 - INFO - __main__ - Step 51742: {'lr': 0.0003731217987423854, 'samples': 9934464, 'steps': 51741, 'loss/train': 1.652662754058838} 11/07/2021 04:30:59 - INFO - __main__ - Step 51743: {'lr': 0.00037311718015055215, 'samples': 9934656, 'steps': 51742, 'loss/train': 1.73580002784729} 11/07/2021 04:30:59 - INFO - __main__ - Step 51744: {'lr': 0.0003731125615032442, 'samples': 9934848, 'steps': 51743, 'loss/train': 1.3464809656143188} 11/07/2021 04:31:00 - INFO - __main__ - Step 51745: {'lr': 0.0003731079428004637, 'samples': 9935040, 'steps': 51744, 'loss/train': 1.7845275402069092} 11/07/2021 04:31:00 - INFO - __main__ - Step 51746: {'lr': 0.00037310332404221256, 'samples': 9935232, 'steps': 51745, 'loss/train': 2.0145421028137207} 11/07/2021 04:31:01 - INFO - __main__ - Step 51747: {'lr': 0.000373098705228493, 'samples': 9935424, 'steps': 51746, 'loss/train': 1.3998632431030273} 11/07/2021 04:31:01 - INFO - __main__ - Step 51748: {'lr': 0.00037309408635930705, 'samples': 9935616, 'steps': 51747, 'loss/train': 1.2065773010253906} 11/07/2021 04:31:01 - INFO - __main__ - Step 51749: {'lr': 0.0003730894674346568, 'samples': 9935808, 'steps': 51748, 'loss/train': 1.1705305576324463} 11/07/2021 04:31:02 - INFO - __main__ - Step 51750: {'lr': 0.00037308484845454434, 'samples': 9936000, 'steps': 51749, 'loss/train': 1.5436286926269531} 11/07/2021 04:31:03 - INFO - __main__ - Step 51751: {'lr': 0.0003730802294189718, 'samples': 9936192, 'steps': 51750, 'loss/train': 1.499884843826294} 11/07/2021 04:31:03 - INFO - __main__ - Step 51752: {'lr': 0.00037307561032794113, 'samples': 9936384, 'steps': 51751, 'loss/train': 1.10182785987854} 11/07/2021 04:31:03 - INFO - __main__ - Step 51753: {'lr': 0.0003730709911814545, 'samples': 9936576, 'steps': 51752, 'loss/train': 1.6749976873397827} 11/07/2021 04:31:04 - INFO - __main__ - Step 51754: {'lr': 0.000373066371979514, 'samples': 9936768, 'steps': 51753, 'loss/train': 1.492154598236084} 11/07/2021 04:31:05 - INFO - __main__ - Step 51755: {'lr': 0.00037306175272212166, 'samples': 9936960, 'steps': 51754, 'loss/train': 1.5916110277175903} 11/07/2021 04:31:05 - INFO - __main__ - Step 51756: {'lr': 0.0003730571334092796, 'samples': 9937152, 'steps': 51755, 'loss/train': 1.5051785707473755} 11/07/2021 04:31:05 - INFO - __main__ - Step 51757: {'lr': 0.00037305251404099, 'samples': 9937344, 'steps': 51756, 'loss/train': 1.2896778583526611} 11/07/2021 04:31:06 - INFO - __main__ - Step 51758: {'lr': 0.00037304789461725473, 'samples': 9937536, 'steps': 51757, 'loss/train': 1.5637562274932861} 11/07/2021 04:31:06 - INFO - __main__ - Step 51759: {'lr': 0.000373043275138076, 'samples': 9937728, 'steps': 51758, 'loss/train': 1.2381839752197266} 11/07/2021 04:31:07 - INFO - __main__ - Step 51760: {'lr': 0.00037303865560345587, 'samples': 9937920, 'steps': 51759, 'loss/train': 1.8927297592163086} 11/07/2021 04:31:08 - INFO - __main__ - Step 51761: {'lr': 0.00037303403601339643, 'samples': 9938112, 'steps': 51760, 'loss/train': 1.7042573690414429} 11/07/2021 04:31:08 - INFO - __main__ - Step 51762: {'lr': 0.0003730294163678997, 'samples': 9938304, 'steps': 51761, 'loss/train': 1.8782144784927368} 11/07/2021 04:31:08 - INFO - __main__ - Step 51763: {'lr': 0.00037302479666696787, 'samples': 9938496, 'steps': 51762, 'loss/train': 1.4256799221038818} 11/07/2021 04:31:09 - INFO - __main__ - Step 51764: {'lr': 0.000373020176910603, 'samples': 9938688, 'steps': 51763, 'loss/train': 0.7905405759811401} 11/07/2021 04:31:09 - INFO - __main__ - Step 51765: {'lr': 0.00037301555709880706, 'samples': 9938880, 'steps': 51764, 'loss/train': 1.828656554222107} 11/07/2021 04:31:10 - INFO - __main__ - Step 51766: {'lr': 0.00037301093723158223, 'samples': 9939072, 'steps': 51765, 'loss/train': 0.8808240294456482} 11/07/2021 04:31:10 - INFO - __main__ - Step 51767: {'lr': 0.0003730063173089306, 'samples': 9939264, 'steps': 51766, 'loss/train': 0.9141917824745178} 11/07/2021 04:31:11 - INFO - __main__ - Step 51768: {'lr': 0.0003730016973308542, 'samples': 9939456, 'steps': 51767, 'loss/train': 1.409717082977295} 11/07/2021 04:31:11 - INFO - __main__ - Step 51769: {'lr': 0.0003729970772973551, 'samples': 9939648, 'steps': 51768, 'loss/train': 1.1435794830322266} 11/07/2021 04:31:11 - INFO - __main__ - Step 51770: {'lr': 0.00037299245720843544, 'samples': 9939840, 'steps': 51769, 'loss/train': 1.2731410264968872} 11/07/2021 04:31:12 - INFO - __main__ - Step 51771: {'lr': 0.0003729878370640973, 'samples': 9940032, 'steps': 51770, 'loss/train': 0.7276386618614197} 11/07/2021 04:31:13 - INFO - __main__ - Step 51772: {'lr': 0.0003729832168643428, 'samples': 9940224, 'steps': 51771, 'loss/train': 1.9008690118789673} 11/07/2021 04:31:13 - INFO - __main__ - Step 51773: {'lr': 0.00037297859660917384, 'samples': 9940416, 'steps': 51772, 'loss/train': 1.7732874155044556} 11/07/2021 04:31:14 - INFO - __main__ - Step 51774: {'lr': 0.00037297397629859266, 'samples': 9940608, 'steps': 51773, 'loss/train': 1.7407798767089844} 11/07/2021 04:31:14 - INFO - __main__ - Step 51775: {'lr': 0.0003729693559326013, 'samples': 9940800, 'steps': 51774, 'loss/train': 2.1384971141815186} 11/07/2021 04:31:15 - INFO - __main__ - Step 51776: {'lr': 0.00037296473551120185, 'samples': 9940992, 'steps': 51775, 'loss/train': 1.0660651922225952} 11/07/2021 04:31:16 - INFO - __main__ - Step 51777: {'lr': 0.00037296011503439643, 'samples': 9941184, 'steps': 51776, 'loss/train': 1.4928719997406006} 11/07/2021 04:31:16 - INFO - __main__ - Step 51778: {'lr': 0.00037295549450218704, 'samples': 9941376, 'steps': 51777, 'loss/train': 0.5091902017593384} 11/07/2021 04:31:17 - INFO - __main__ - Step 51779: {'lr': 0.0003729508739145758, 'samples': 9941568, 'steps': 51778, 'loss/train': 1.3106775283813477} 11/07/2021 04:31:17 - INFO - __main__ - Step 51780: {'lr': 0.0003729462532715648, 'samples': 9941760, 'steps': 51779, 'loss/train': 1.698462963104248} 11/07/2021 04:31:17 - INFO - __main__ - Step 51781: {'lr': 0.0003729416325731561, 'samples': 9941952, 'steps': 51780, 'loss/train': 1.79434335231781} 11/07/2021 04:31:18 - INFO - __main__ - Step 51782: {'lr': 0.0003729370118193518, 'samples': 9942144, 'steps': 51781, 'loss/train': 1.734641671180725} 11/07/2021 04:31:19 - INFO - __main__ - Step 51783: {'lr': 0.00037293239101015397, 'samples': 9942336, 'steps': 51782, 'loss/train': 1.9667565822601318} 11/07/2021 04:31:19 - INFO - __main__ - Step 51784: {'lr': 0.0003729277701455648, 'samples': 9942528, 'steps': 51783, 'loss/train': 1.8873034715652466} 11/07/2021 04:31:19 - INFO - __main__ - Step 51785: {'lr': 0.00037292314922558615, 'samples': 9942720, 'steps': 51784, 'loss/train': 0.9962705373764038} 11/07/2021 04:31:20 - INFO - __main__ - Step 51786: {'lr': 0.0003729185282502203, 'samples': 9942912, 'steps': 51785, 'loss/train': 1.6829118728637695} 11/07/2021 04:31:20 - INFO - __main__ - Step 51787: {'lr': 0.00037291390721946914, 'samples': 9943104, 'steps': 51786, 'loss/train': 1.8479154109954834} 11/07/2021 04:31:21 - INFO - __main__ - Step 51788: {'lr': 0.00037290928613333495, 'samples': 9943296, 'steps': 51787, 'loss/train': 1.1238924264907837} 11/07/2021 04:31:22 - INFO - __main__ - Step 51789: {'lr': 0.00037290466499181977, 'samples': 9943488, 'steps': 51788, 'loss/train': 1.8052947521209717} 11/07/2021 04:31:22 - INFO - __main__ - Step 51790: {'lr': 0.0003729000437949256, 'samples': 9943680, 'steps': 51789, 'loss/train': 5.505935192108154} 11/07/2021 04:31:22 - INFO - __main__ - Step 51791: {'lr': 0.0003728954225426546, 'samples': 9943872, 'steps': 51790, 'loss/train': 5.436152935028076} 11/07/2021 04:31:23 - INFO - __main__ - Step 51792: {'lr': 0.00037289080123500886, 'samples': 9944064, 'steps': 51791, 'loss/train': 1.7828023433685303} 11/07/2021 04:31:23 - INFO - __main__ - Step 51793: {'lr': 0.0003728861798719903, 'samples': 9944256, 'steps': 51792, 'loss/train': 1.127197504043579} 11/07/2021 04:31:24 - INFO - __main__ - Step 51794: {'lr': 0.00037288155845360116, 'samples': 9944448, 'steps': 51793, 'loss/train': 1.2389522790908813} 11/07/2021 04:31:25 - INFO - __main__ - Step 51795: {'lr': 0.00037287693697984355, 'samples': 9944640, 'steps': 51794, 'loss/train': 1.238544225692749} 11/07/2021 04:31:25 - INFO - __main__ - Step 51796: {'lr': 0.0003728723154507195, 'samples': 9944832, 'steps': 51795, 'loss/train': 1.530659556388855} 11/07/2021 04:31:25 - INFO - __main__ - Step 51797: {'lr': 0.000372867693866231, 'samples': 9945024, 'steps': 51796, 'loss/train': 0.9943727254867554} 11/07/2021 04:31:26 - INFO - __main__ - Step 51798: {'lr': 0.0003728630722263803, 'samples': 9945216, 'steps': 51797, 'loss/train': 1.7220540046691895} 11/07/2021 04:31:27 - INFO - __main__ - Step 51799: {'lr': 0.0003728584505311693, 'samples': 9945408, 'steps': 51798, 'loss/train': 1.9942365884780884} 11/07/2021 04:31:27 - INFO - __main__ - Step 51800: {'lr': 0.0003728538287806002, 'samples': 9945600, 'steps': 51799, 'loss/train': 1.9418936967849731} 11/07/2021 04:31:27 - INFO - __main__ - Step 51801: {'lr': 0.00037284920697467505, 'samples': 9945792, 'steps': 51800, 'loss/train': 1.168082356452942} 11/07/2021 04:31:28 - INFO - __main__ - Step 51802: {'lr': 0.00037284458511339604, 'samples': 9945984, 'steps': 51801, 'loss/train': 1.4020640850067139} 11/07/2021 04:31:28 - INFO - __main__ - Step 51803: {'lr': 0.00037283996319676505, 'samples': 9946176, 'steps': 51802, 'loss/train': 1.0403871536254883} 11/07/2021 04:31:29 - INFO - __main__ - Step 51804: {'lr': 0.0003728353412247843, 'samples': 9946368, 'steps': 51803, 'loss/train': 1.0290791988372803} 11/07/2021 04:31:29 - INFO - __main__ - Step 51805: {'lr': 0.0003728307191974558, 'samples': 9946560, 'steps': 51804, 'loss/train': 1.4733210802078247} 11/07/2021 04:31:30 - INFO - __main__ - Step 51806: {'lr': 0.00037282609711478175, 'samples': 9946752, 'steps': 51805, 'loss/train': 1.1654385328292847} 11/07/2021 04:31:30 - INFO - __main__ - Step 51807: {'lr': 0.00037282147497676415, 'samples': 9946944, 'steps': 51806, 'loss/train': 1.5371730327606201} 11/07/2021 04:31:30 - INFO - __main__ - Step 51808: {'lr': 0.000372816852783405, 'samples': 9947136, 'steps': 51807, 'loss/train': 1.8102275133132935} 11/07/2021 04:31:31 - INFO - __main__ - Step 51809: {'lr': 0.0003728122305347066, 'samples': 9947328, 'steps': 51808, 'loss/train': 1.6196417808532715} 11/07/2021 04:31:33 - INFO - __main__ - Step 51810: {'lr': 0.00037280760823067086, 'samples': 9947520, 'steps': 51809, 'loss/train': 1.4983396530151367} 11/07/2021 04:31:33 - INFO - __main__ - Step 51811: {'lr': 0.00037280298587129984, 'samples': 9947712, 'steps': 51810, 'loss/train': 1.291886329650879} 11/07/2021 04:31:33 - INFO - __main__ - Step 51812: {'lr': 0.0003727983634565958, 'samples': 9947904, 'steps': 51811, 'loss/train': 2.8370022773742676} 11/07/2021 04:31:34 - INFO - __main__ - Step 51813: {'lr': 0.0003727937409865606, 'samples': 9948096, 'steps': 51812, 'loss/train': 2.879993438720703} 11/07/2021 04:31:34 - INFO - __main__ - Step 51814: {'lr': 0.0003727891184611965, 'samples': 9948288, 'steps': 51813, 'loss/train': 1.6335194110870361} 11/07/2021 04:31:34 - INFO - __main__ - Step 51815: {'lr': 0.0003727844958805055, 'samples': 9948480, 'steps': 51814, 'loss/train': 1.2594209909439087} 11/07/2021 04:31:35 - INFO - __main__ - Step 51816: {'lr': 0.0003727798732444897, 'samples': 9948672, 'steps': 51815, 'loss/train': 1.3646434545516968} 11/07/2021 04:31:36 - INFO - __main__ - Step 51817: {'lr': 0.00037277525055315114, 'samples': 9948864, 'steps': 51816, 'loss/train': 1.7088550329208374} 11/07/2021 04:31:36 - INFO - __main__ - Step 51818: {'lr': 0.0003727706278064921, 'samples': 9949056, 'steps': 51817, 'loss/train': 1.1961368322372437} 11/07/2021 04:31:36 - INFO - __main__ - Step 51819: {'lr': 0.00037276600500451434, 'samples': 9949248, 'steps': 51818, 'loss/train': 0.8588838577270508} 11/07/2021 04:31:37 - INFO - __main__ - Step 51820: {'lr': 0.00037276138214722016, 'samples': 9949440, 'steps': 51819, 'loss/train': 1.0528807640075684} 11/07/2021 04:31:38 - INFO - __main__ - Step 51821: {'lr': 0.0003727567592346116, 'samples': 9949632, 'steps': 51820, 'loss/train': 1.587601900100708} 11/07/2021 04:31:38 - INFO - __main__ - Step 51822: {'lr': 0.00037275213626669076, 'samples': 9949824, 'steps': 51821, 'loss/train': 1.6551111936569214} 11/07/2021 04:31:39 - INFO - __main__ - Step 51823: {'lr': 0.00037274751324345966, 'samples': 9950016, 'steps': 51822, 'loss/train': 1.622349739074707} 11/07/2021 04:31:39 - INFO - __main__ - Step 51824: {'lr': 0.0003727428901649205, 'samples': 9950208, 'steps': 51823, 'loss/train': 2.177377939224243} 11/07/2021 04:31:39 - INFO - __main__ - Step 51825: {'lr': 0.00037273826703107527, 'samples': 9950400, 'steps': 51824, 'loss/train': 1.2760615348815918} 11/07/2021 04:31:40 - INFO - __main__ - Step 51826: {'lr': 0.000372733643841926, 'samples': 9950592, 'steps': 51825, 'loss/train': 1.5299488306045532} 11/07/2021 04:31:41 - INFO - __main__ - Step 51827: {'lr': 0.00037272902059747487, 'samples': 9950784, 'steps': 51826, 'loss/train': 1.2025662660598755} 11/07/2021 04:31:41 - INFO - __main__ - Step 51828: {'lr': 0.00037272439729772397, 'samples': 9950976, 'steps': 51827, 'loss/train': 1.5960502624511719} 11/07/2021 04:31:41 - INFO - __main__ - Step 51829: {'lr': 0.00037271977394267534, 'samples': 9951168, 'steps': 51828, 'loss/train': 1.3377803564071655} 11/07/2021 04:31:42 - INFO - __main__ - Step 51830: {'lr': 0.0003727151505323311, 'samples': 9951360, 'steps': 51829, 'loss/train': 0.9705298542976379} 11/07/2021 04:31:43 - INFO - __main__ - Step 51831: {'lr': 0.0003727105270666933, 'samples': 9951552, 'steps': 51830, 'loss/train': 1.0445075035095215} 11/07/2021 04:31:43 - INFO - __main__ - Step 51832: {'lr': 0.00037270590354576396, 'samples': 9951744, 'steps': 51831, 'loss/train': 1.5251946449279785} 11/07/2021 04:31:43 - INFO - __main__ - Step 51833: {'lr': 0.0003727012799695453, 'samples': 9951936, 'steps': 51832, 'loss/train': 1.1685718297958374} 11/07/2021 04:31:44 - INFO - __main__ - Step 51834: {'lr': 0.0003726966563380393, 'samples': 9952128, 'steps': 51833, 'loss/train': 1.306240200996399} 11/07/2021 04:31:44 - INFO - __main__ - Step 51835: {'lr': 0.00037269203265124807, 'samples': 9952320, 'steps': 51834, 'loss/train': 1.5331083536148071} 11/07/2021 04:31:45 - INFO - __main__ - Step 51836: {'lr': 0.00037268740890917374, 'samples': 9952512, 'steps': 51835, 'loss/train': 1.4534672498703003} 11/07/2021 04:31:46 - INFO - __main__ - Step 51837: {'lr': 0.0003726827851118183, 'samples': 9952704, 'steps': 51836, 'loss/train': 1.3268409967422485} 11/07/2021 04:31:46 - INFO - __main__ - Step 51838: {'lr': 0.00037267816125918394, 'samples': 9952896, 'steps': 51837, 'loss/train': 1.8532732725143433} 11/07/2021 04:31:46 - INFO - __main__ - Step 51839: {'lr': 0.00037267353735127276, 'samples': 9953088, 'steps': 51838, 'loss/train': 1.5745549201965332} 11/07/2021 04:31:47 - INFO - __main__ - Step 51840: {'lr': 0.00037266891338808667, 'samples': 9953280, 'steps': 51839, 'loss/train': 1.067560076713562} 11/07/2021 04:31:47 - INFO - __main__ - Step 51841: {'lr': 0.00037266428936962785, 'samples': 9953472, 'steps': 51840, 'loss/train': 1.5043675899505615} 11/07/2021 04:31:48 - INFO - __main__ - Step 51842: {'lr': 0.00037265966529589846, 'samples': 9953664, 'steps': 51841, 'loss/train': 1.3698827028274536} 11/07/2021 04:31:48 - INFO - __main__ - Step 51843: {'lr': 0.0003726550411669005, 'samples': 9953856, 'steps': 51842, 'loss/train': 0.9738919734954834} 11/07/2021 04:31:49 - INFO - __main__ - Step 51844: {'lr': 0.000372650416982636, 'samples': 9954048, 'steps': 51843, 'loss/train': 2.3924238681793213} 11/07/2021 04:31:49 - INFO - __main__ - Step 51845: {'lr': 0.0003726457927431073, 'samples': 9954240, 'steps': 51844, 'loss/train': 1.8212895393371582} 11/07/2021 04:31:50 - INFO - __main__ - Step 51846: {'lr': 0.0003726411684483161, 'samples': 9954432, 'steps': 51845, 'loss/train': 1.4313552379608154} 11/07/2021 04:31:50 - INFO - __main__ - Step 51847: {'lr': 0.0003726365440982648, 'samples': 9954624, 'steps': 51846, 'loss/train': 1.6480627059936523} 11/07/2021 04:31:51 - INFO - __main__ - Step 51848: {'lr': 0.00037263191969295537, 'samples': 9954816, 'steps': 51847, 'loss/train': 1.3523963689804077} 11/07/2021 04:31:51 - INFO - __main__ - Step 51849: {'lr': 0.0003726272952323898, 'samples': 9955008, 'steps': 51848, 'loss/train': 1.597180962562561} 11/07/2021 04:31:51 - INFO - __main__ - Step 51850: {'lr': 0.0003726226707165703, 'samples': 9955200, 'steps': 51849, 'loss/train': 1.6327004432678223} 11/07/2021 04:31:52 - INFO - __main__ - Step 51851: {'lr': 0.000372618046145499, 'samples': 9955392, 'steps': 51850, 'loss/train': 1.4291988611221313} 11/07/2021 04:31:53 - INFO - __main__ - Step 51852: {'lr': 0.0003726134215191778, 'samples': 9955584, 'steps': 51851, 'loss/train': 1.3706315755844116} 11/07/2021 04:31:53 - INFO - __main__ - Step 51853: {'lr': 0.0003726087968376089, 'samples': 9955776, 'steps': 51852, 'loss/train': 1.2131381034851074} 11/07/2021 04:31:54 - INFO - __main__ - Step 51854: {'lr': 0.0003726041721007944, 'samples': 9955968, 'steps': 51853, 'loss/train': 5.908267498016357} 11/07/2021 04:31:54 - INFO - __main__ - Step 51855: {'lr': 0.0003725995473087363, 'samples': 9956160, 'steps': 51854, 'loss/train': 1.3329787254333496} 11/07/2021 04:31:54 - INFO - __main__ - Step 51856: {'lr': 0.0003725949224614368, 'samples': 9956352, 'steps': 51855, 'loss/train': 1.276503562927246} 11/07/2021 04:31:55 - INFO - __main__ - Step 51857: {'lr': 0.00037259029755889783, 'samples': 9956544, 'steps': 51856, 'loss/train': 0.7896950244903564} 11/07/2021 04:31:56 - INFO - __main__ - Step 51858: {'lr': 0.00037258567260112165, 'samples': 9956736, 'steps': 51857, 'loss/train': 0.9399859309196472} 11/07/2021 04:31:56 - INFO - __main__ - Step 51859: {'lr': 0.00037258104758811024, 'samples': 9956928, 'steps': 51858, 'loss/train': 1.4322720766067505} 11/07/2021 04:31:56 - INFO - __main__ - Step 51860: {'lr': 0.00037257642251986567, 'samples': 9957120, 'steps': 51859, 'loss/train': 2.2745914459228516} 11/07/2021 04:31:57 - INFO - __main__ - Step 51861: {'lr': 0.00037257179739639006, 'samples': 9957312, 'steps': 51860, 'loss/train': 1.4123642444610596} 11/07/2021 04:31:57 - INFO - __main__ - Step 51862: {'lr': 0.00037256717221768556, 'samples': 9957504, 'steps': 51861, 'loss/train': 1.6009799242019653} 11/07/2021 04:31:58 - INFO - __main__ - Step 51863: {'lr': 0.0003725625469837541, 'samples': 9957696, 'steps': 51862, 'loss/train': 1.8816026449203491} 11/07/2021 04:31:58 - INFO - __main__ - Step 51864: {'lr': 0.00037255792169459785, 'samples': 9957888, 'steps': 51863, 'loss/train': 1.9500659704208374} 11/07/2021 04:31:59 - INFO - __main__ - Step 51865: {'lr': 0.00037255329635021896, 'samples': 9958080, 'steps': 51864, 'loss/train': 1.6771234273910522} 11/07/2021 04:31:59 - INFO - __main__ - Step 51866: {'lr': 0.0003725486709506194, 'samples': 9958272, 'steps': 51865, 'loss/train': 1.736471176147461} 11/07/2021 04:31:59 - INFO - __main__ - Step 51867: {'lr': 0.0003725440454958013, 'samples': 9958464, 'steps': 51866, 'loss/train': 1.9499398469924927} 11/07/2021 04:32:00 - INFO - __main__ - Step 51868: {'lr': 0.0003725394199857667, 'samples': 9958656, 'steps': 51867, 'loss/train': 1.7287940979003906} 11/07/2021 04:32:01 - INFO - __main__ - Step 51869: {'lr': 0.0003725347944205178, 'samples': 9958848, 'steps': 51868, 'loss/train': 1.9378396272659302} 11/07/2021 04:32:01 - INFO - __main__ - Step 51870: {'lr': 0.0003725301688000566, 'samples': 9959040, 'steps': 51869, 'loss/train': 2.1761879920959473} 11/07/2021 04:32:02 - INFO - __main__ - Step 51871: {'lr': 0.0003725255431243852, 'samples': 9959232, 'steps': 51870, 'loss/train': 1.4214507341384888} 11/07/2021 04:32:02 - INFO - __main__ - Step 51872: {'lr': 0.00037252091739350566, 'samples': 9959424, 'steps': 51871, 'loss/train': 1.181527853012085} 11/07/2021 04:32:03 - INFO - __main__ - Step 51873: {'lr': 0.0003725162916074201, 'samples': 9959616, 'steps': 51872, 'loss/train': 1.4796147346496582} 11/07/2021 04:32:03 - INFO - __main__ - Step 51874: {'lr': 0.0003725116657661306, 'samples': 9959808, 'steps': 51873, 'loss/train': 1.6419570446014404} 11/07/2021 04:32:04 - INFO - __main__ - Step 51875: {'lr': 0.00037250703986963917, 'samples': 9960000, 'steps': 51874, 'loss/train': 1.7105646133422852} 11/07/2021 04:32:04 - INFO - __main__ - Step 51876: {'lr': 0.000372502413917948, 'samples': 9960192, 'steps': 51875, 'loss/train': 1.1473350524902344} 11/07/2021 04:32:04 - INFO - __main__ - Step 51877: {'lr': 0.00037249778791105916, 'samples': 9960384, 'steps': 51876, 'loss/train': 1.1271415948867798} 11/07/2021 04:32:06 - INFO - __main__ - Step 51878: {'lr': 0.0003724931618489747, 'samples': 9960576, 'steps': 51877, 'loss/train': 1.3002792596817017} 11/07/2021 04:32:06 - INFO - __main__ - Step 51879: {'lr': 0.0003724885357316967, 'samples': 9960768, 'steps': 51878, 'loss/train': 1.4516825675964355} 11/07/2021 04:32:06 - INFO - __main__ - Step 51880: {'lr': 0.00037248390955922726, 'samples': 9960960, 'steps': 51879, 'loss/train': 1.018768072128296} 11/07/2021 04:32:07 - INFO - __main__ - Step 51881: {'lr': 0.00037247928333156844, 'samples': 9961152, 'steps': 51880, 'loss/train': 1.8426998853683472} 11/07/2021 04:32:07 - INFO - __main__ - Step 51882: {'lr': 0.0003724746570487223, 'samples': 9961344, 'steps': 51881, 'loss/train': 1.809626817703247} 11/07/2021 04:32:07 - INFO - __main__ - Step 51883: {'lr': 0.00037247003071069106, 'samples': 9961536, 'steps': 51882, 'loss/train': 1.4476546049118042} 11/07/2021 04:32:08 - INFO - __main__ - Step 51884: {'lr': 0.0003724654043174767, 'samples': 9961728, 'steps': 51883, 'loss/train': 1.1984243392944336} 11/07/2021 04:32:09 - INFO - __main__ - Step 51885: {'lr': 0.0003724607778690813, 'samples': 9961920, 'steps': 51884, 'loss/train': 1.2851240634918213} 11/07/2021 04:32:09 - INFO - __main__ - Step 51886: {'lr': 0.00037245615136550695, 'samples': 9962112, 'steps': 51885, 'loss/train': 1.0749436616897583} 11/07/2021 04:32:09 - INFO - __main__ - Step 51887: {'lr': 0.00037245152480675577, 'samples': 9962304, 'steps': 51886, 'loss/train': 1.9282810688018799} 11/07/2021 04:32:10 - INFO - __main__ - Step 51888: {'lr': 0.0003724468981928298, 'samples': 9962496, 'steps': 51887, 'loss/train': 1.6814134120941162} 11/07/2021 04:32:10 - INFO - __main__ - Step 51889: {'lr': 0.00037244227152373113, 'samples': 9962688, 'steps': 51888, 'loss/train': 1.3974967002868652} 11/07/2021 04:32:11 - INFO - __main__ - Step 51890: {'lr': 0.0003724376447994619, 'samples': 9962880, 'steps': 51889, 'loss/train': 1.7117220163345337} 11/07/2021 04:32:12 - INFO - __main__ - Step 51891: {'lr': 0.00037243301802002414, 'samples': 9963072, 'steps': 51890, 'loss/train': 1.6015797853469849} 11/07/2021 04:32:12 - INFO - __main__ - Step 51892: {'lr': 0.00037242839118542, 'samples': 9963264, 'steps': 51891, 'loss/train': 1.636836051940918} 11/07/2021 04:32:12 - INFO - __main__ - Step 51893: {'lr': 0.00037242376429565143, 'samples': 9963456, 'steps': 51892, 'loss/train': 1.7007267475128174} 11/07/2021 04:32:13 - INFO - __main__ - Step 51894: {'lr': 0.0003724191373507206, 'samples': 9963648, 'steps': 51893, 'loss/train': 0.9272041916847229} 11/07/2021 04:32:14 - INFO - __main__ - Step 51895: {'lr': 0.00037241451035062965, 'samples': 9963840, 'steps': 51894, 'loss/train': 1.2658214569091797} 11/07/2021 04:32:14 - INFO - __main__ - Step 51896: {'lr': 0.0003724098832953806, 'samples': 9964032, 'steps': 51895, 'loss/train': 1.257627248764038} 11/07/2021 04:32:14 - INFO - __main__ - Step 51897: {'lr': 0.00037240525618497555, 'samples': 9964224, 'steps': 51896, 'loss/train': 1.2878658771514893} 11/07/2021 04:32:15 - INFO - __main__ - Step 51898: {'lr': 0.00037240062901941663, 'samples': 9964416, 'steps': 51897, 'loss/train': 1.421335220336914} 11/07/2021 04:32:15 - INFO - __main__ - Step 51899: {'lr': 0.0003723960017987058, 'samples': 9964608, 'steps': 51898, 'loss/train': 1.9077128171920776} 11/07/2021 04:32:16 - INFO - __main__ - Step 51900: {'lr': 0.00037239137452284527, 'samples': 9964800, 'steps': 51899, 'loss/train': 1.497809648513794} 11/07/2021 04:32:16 - INFO - __main__ - Step 51901: {'lr': 0.0003723867471918371, 'samples': 9964992, 'steps': 51900, 'loss/train': 1.8138912916183472} 11/07/2021 04:32:17 - INFO - __main__ - Step 51902: {'lr': 0.00037238211980568326, 'samples': 9965184, 'steps': 51901, 'loss/train': 0.4053778350353241} 11/07/2021 04:32:17 - INFO - __main__ - Step 51903: {'lr': 0.00037237749236438593, 'samples': 9965376, 'steps': 51902, 'loss/train': 1.2938474416732788} 11/07/2021 04:32:18 - INFO - __main__ - Step 51904: {'lr': 0.0003723728648679472, 'samples': 9965568, 'steps': 51903, 'loss/train': 1.5468488931655884} 11/07/2021 04:32:18 - INFO - __main__ - Step 51905: {'lr': 0.0003723682373163693, 'samples': 9965760, 'steps': 51904, 'loss/train': 1.8980106115341187} 11/07/2021 04:32:19 - INFO - __main__ - Step 51906: {'lr': 0.0003723636097096539, 'samples': 9965952, 'steps': 51905, 'loss/train': 1.3081457614898682} 11/07/2021 04:32:19 - INFO - __main__ - Step 51907: {'lr': 0.00037235898204780347, 'samples': 9966144, 'steps': 51906, 'loss/train': 1.562143325805664} 11/07/2021 04:32:20 - INFO - __main__ - Step 51908: {'lr': 0.00037235435433082004, 'samples': 9966336, 'steps': 51907, 'loss/train': 1.954079270362854} 11/07/2021 04:32:20 - INFO - __main__ - Step 51909: {'lr': 0.0003723497265587055, 'samples': 9966528, 'steps': 51908, 'loss/train': 0.8910297751426697} 11/07/2021 04:32:21 - INFO - __main__ - Step 51910: {'lr': 0.0003723450987314622, 'samples': 9966720, 'steps': 51909, 'loss/train': 1.4652249813079834} 11/07/2021 04:32:21 - INFO - __main__ - Step 51911: {'lr': 0.00037234047084909195, 'samples': 9966912, 'steps': 51910, 'loss/train': 1.4521297216415405} 11/07/2021 04:32:22 - INFO - __main__ - Step 51912: {'lr': 0.0003723358429115971, 'samples': 9967104, 'steps': 51911, 'loss/train': 1.370575189590454} 11/07/2021 04:32:22 - INFO - __main__ - Step 51913: {'lr': 0.00037233121491897953, 'samples': 9967296, 'steps': 51912, 'loss/train': 0.8230983018875122} 11/07/2021 04:32:22 - INFO - __main__ - Step 51914: {'lr': 0.00037232658687124135, 'samples': 9967488, 'steps': 51913, 'loss/train': 1.110690951347351} 11/07/2021 04:32:23 - INFO - __main__ - Step 51915: {'lr': 0.00037232195876838484, 'samples': 9967680, 'steps': 51914, 'loss/train': 1.117390513420105} 11/07/2021 04:32:24 - INFO - __main__ - Step 51916: {'lr': 0.00037231733061041176, 'samples': 9967872, 'steps': 51915, 'loss/train': 1.2352626323699951} 11/07/2021 04:32:24 - INFO - __main__ - Step 51917: {'lr': 0.0003723127023973245, 'samples': 9968064, 'steps': 51916, 'loss/train': 1.2384299039840698} 11/07/2021 04:32:24 - INFO - __main__ - Step 51918: {'lr': 0.00037230807412912505, 'samples': 9968256, 'steps': 51917, 'loss/train': 1.602675437927246} 11/07/2021 04:32:25 - INFO - __main__ - Step 51919: {'lr': 0.00037230344580581543, 'samples': 9968448, 'steps': 51918, 'loss/train': 1.3293837308883667} 11/07/2021 04:32:26 - INFO - __main__ - Step 51920: {'lr': 0.00037229881742739776, 'samples': 9968640, 'steps': 51919, 'loss/train': 1.4339882135391235} 11/07/2021 04:32:26 - INFO - __main__ - Step 51921: {'lr': 0.0003722941889938741, 'samples': 9968832, 'steps': 51920, 'loss/train': 1.3232321739196777} 11/07/2021 04:32:26 - INFO - __main__ - Step 51922: {'lr': 0.0003722895605052466, 'samples': 9969024, 'steps': 51921, 'loss/train': 1.449321985244751} 11/07/2021 04:32:27 - INFO - __main__ - Step 51923: {'lr': 0.0003722849319615173, 'samples': 9969216, 'steps': 51922, 'loss/train': 1.5120227336883545} 11/07/2021 04:32:27 - INFO - __main__ - Step 51924: {'lr': 0.0003722803033626883, 'samples': 9969408, 'steps': 51923, 'loss/train': 1.466483235359192} 11/07/2021 04:32:28 - INFO - __main__ - Step 51925: {'lr': 0.0003722756747087617, 'samples': 9969600, 'steps': 51924, 'loss/train': 1.7208658456802368} 11/07/2021 04:32:29 - INFO - __main__ - Step 51926: {'lr': 0.0003722710459997395, 'samples': 9969792, 'steps': 51925, 'loss/train': 1.3644756078720093} 11/07/2021 04:32:29 - INFO - __main__ - Step 51927: {'lr': 0.00037226641723562393, 'samples': 9969984, 'steps': 51926, 'loss/train': 1.3063249588012695} 11/07/2021 04:32:29 - INFO - __main__ - Step 51928: {'lr': 0.000372261788416417, 'samples': 9970176, 'steps': 51927, 'loss/train': 1.3560901880264282} 11/07/2021 04:32:30 - INFO - __main__ - Step 51929: {'lr': 0.00037225715954212075, 'samples': 9970368, 'steps': 51928, 'loss/train': 1.6959338188171387} 11/07/2021 04:32:30 - INFO - __main__ - Step 51930: {'lr': 0.00037225253061273734, 'samples': 9970560, 'steps': 51929, 'loss/train': 1.0584107637405396} 11/07/2021 04:32:31 - INFO - __main__ - Step 51931: {'lr': 0.0003722479016282688, 'samples': 9970752, 'steps': 51930, 'loss/train': 1.3664273023605347} 11/07/2021 04:32:31 - INFO - __main__ - Step 51932: {'lr': 0.00037224327258871724, 'samples': 9970944, 'steps': 51931, 'loss/train': 1.2679414749145508} 11/07/2021 04:32:32 - INFO - __main__ - Step 51933: {'lr': 0.00037223864349408484, 'samples': 9971136, 'steps': 51932, 'loss/train': 1.3450313806533813} 11/07/2021 04:32:32 - INFO - __main__ - Step 51934: {'lr': 0.0003722340143443735, 'samples': 9971328, 'steps': 51933, 'loss/train': 1.2071459293365479} 11/07/2021 04:32:32 - INFO - __main__ - Step 51935: {'lr': 0.0003722293851395854, 'samples': 9971520, 'steps': 51934, 'loss/train': 1.6511484384536743} 11/07/2021 04:32:33 - INFO - __main__ - Step 51936: {'lr': 0.00037222475587972263, 'samples': 9971712, 'steps': 51935, 'loss/train': 1.1503103971481323} 11/07/2021 04:32:34 - INFO - __main__ - Step 51937: {'lr': 0.00037222012656478733, 'samples': 9971904, 'steps': 51936, 'loss/train': 1.4829190969467163} 11/07/2021 04:32:34 - INFO - __main__ - Step 51938: {'lr': 0.00037221549719478145, 'samples': 9972096, 'steps': 51937, 'loss/train': 1.1945439577102661} 11/07/2021 04:32:34 - INFO - __main__ - Step 51939: {'lr': 0.0003722108677697072, 'samples': 9972288, 'steps': 51938, 'loss/train': 1.6184502840042114} 11/07/2021 04:32:35 - INFO - __main__ - Step 51940: {'lr': 0.00037220623828956655, 'samples': 9972480, 'steps': 51939, 'loss/train': 1.6197539567947388} 11/07/2021 04:32:36 - INFO - __main__ - Step 51941: {'lr': 0.00037220160875436176, 'samples': 9972672, 'steps': 51940, 'loss/train': 1.4190608263015747} 11/07/2021 04:32:37 - INFO - __main__ - Step 51942: {'lr': 0.0003721969791640948, 'samples': 9972864, 'steps': 51941, 'loss/train': 1.4613615274429321} 11/07/2021 04:32:37 - INFO - __main__ - Step 51943: {'lr': 0.0003721923495187677, 'samples': 9973056, 'steps': 51942, 'loss/train': 1.7495886087417603} 11/07/2021 04:32:38 - INFO - __main__ - Step 51944: {'lr': 0.00037218771981838264, 'samples': 9973248, 'steps': 51943, 'loss/train': 1.5991290807724} 11/07/2021 04:32:38 - INFO - __main__ - Step 51945: {'lr': 0.0003721830900629416, 'samples': 9973440, 'steps': 51944, 'loss/train': 1.4240131378173828} 11/07/2021 04:32:38 - INFO - __main__ - Step 51946: {'lr': 0.00037217846025244686, 'samples': 9973632, 'steps': 51945, 'loss/train': 1.5212458372116089} 11/07/2021 04:32:39 - INFO - __main__ - Step 51947: {'lr': 0.0003721738303869004, 'samples': 9973824, 'steps': 51946, 'loss/train': 4.068519592285156} 11/07/2021 04:32:40 - INFO - __main__ - Step 51948: {'lr': 0.0003721692004663042, 'samples': 9974016, 'steps': 51947, 'loss/train': 4.253776550292969} 11/07/2021 04:32:40 - INFO - __main__ - Step 51949: {'lr': 0.0003721645704906605, 'samples': 9974208, 'steps': 51948, 'loss/train': 1.685030221939087} 11/07/2021 04:32:40 - INFO - __main__ - Step 51950: {'lr': 0.0003721599404599713, 'samples': 9974400, 'steps': 51949, 'loss/train': 0.7306886911392212} 11/07/2021 04:32:41 - INFO - __main__ - Step 51951: {'lr': 0.0003721553103742388, 'samples': 9974592, 'steps': 51950, 'loss/train': 1.0275191068649292} 11/07/2021 04:32:41 - INFO - __main__ - Step 51952: {'lr': 0.00037215068023346495, 'samples': 9974784, 'steps': 51951, 'loss/train': 1.06412935256958} 11/07/2021 04:32:43 - INFO - __main__ - Step 51953: {'lr': 0.0003721460500376518, 'samples': 9974976, 'steps': 51952, 'loss/train': 1.450171709060669} 11/07/2021 04:32:43 - INFO - __main__ - Step 51954: {'lr': 0.00037214141978680166, 'samples': 9975168, 'steps': 51953, 'loss/train': 2.867142915725708} 11/07/2021 04:32:44 - INFO - __main__ - Step 51955: {'lr': 0.00037213678948091637, 'samples': 9975360, 'steps': 51954, 'loss/train': 2.8771166801452637} 11/07/2021 04:32:44 - INFO - __main__ - Step 51956: {'lr': 0.0003721321591199982, 'samples': 9975552, 'steps': 51955, 'loss/train': 2.8488717079162598} 11/07/2021 04:32:44 - INFO - __main__ - Step 51957: {'lr': 0.00037212752870404917, 'samples': 9975744, 'steps': 51956, 'loss/train': 1.3689314126968384} 11/07/2021 04:32:45 - INFO - __main__ - Step 51958: {'lr': 0.0003721228982330713, 'samples': 9975936, 'steps': 51957, 'loss/train': 1.8371143341064453} 11/07/2021 04:32:45 - INFO - __main__ - Step 51959: {'lr': 0.0003721182677070668, 'samples': 9976128, 'steps': 51958, 'loss/train': 0.7011808753013611} 11/07/2021 04:32:46 - INFO - __main__ - Step 51960: {'lr': 0.00037211363712603767, 'samples': 9976320, 'steps': 51959, 'loss/train': 1.4102351665496826} 11/07/2021 04:32:47 - INFO - __main__ - Step 51961: {'lr': 0.00037210900648998604, 'samples': 9976512, 'steps': 51960, 'loss/train': 1.3600716590881348} 11/07/2021 04:32:47 - INFO - __main__ - Step 51962: {'lr': 0.0003721043757989139, 'samples': 9976704, 'steps': 51961, 'loss/train': 1.119926929473877} 11/07/2021 04:32:47 - INFO - __main__ - Step 51963: {'lr': 0.0003720997450528235, 'samples': 9976896, 'steps': 51962, 'loss/train': 1.2507145404815674} 11/07/2021 04:32:48 - INFO - __main__ - Step 51964: {'lr': 0.0003720951142517168, 'samples': 9977088, 'steps': 51963, 'loss/train': 1.713240623474121} 11/07/2021 04:32:48 - INFO - __main__ - Step 51965: {'lr': 0.0003720904833955959, 'samples': 9977280, 'steps': 51964, 'loss/train': 2.1338562965393066} 11/07/2021 04:32:49 - INFO - __main__ - Step 51966: {'lr': 0.000372085852484463, 'samples': 9977472, 'steps': 51965, 'loss/train': 2.168433904647827} 11/07/2021 04:32:50 - INFO - __main__ - Step 51967: {'lr': 0.00037208122151832004, 'samples': 9977664, 'steps': 51966, 'loss/train': 1.1912193298339844} 11/07/2021 04:32:50 - INFO - __main__ - Step 51968: {'lr': 0.0003720765904971691, 'samples': 9977856, 'steps': 51967, 'loss/train': 2.077446699142456} 11/07/2021 04:32:50 - INFO - __main__ - Step 51969: {'lr': 0.0003720719594210124, 'samples': 9978048, 'steps': 51968, 'loss/train': 1.4788835048675537} 11/07/2021 04:32:51 - INFO - __main__ - Step 51970: {'lr': 0.00037206732828985197, 'samples': 9978240, 'steps': 51969, 'loss/train': 0.874961793422699} 11/07/2021 04:32:52 - INFO - __main__ - Step 51971: {'lr': 0.00037206269710368987, 'samples': 9978432, 'steps': 51970, 'loss/train': 1.4377036094665527} 11/07/2021 04:32:52 - INFO - __main__ - Step 51972: {'lr': 0.0003720580658625282, 'samples': 9978624, 'steps': 51971, 'loss/train': 1.9104055166244507} 11/07/2021 04:32:52 - INFO - __main__ - Step 51973: {'lr': 0.00037205343456636907, 'samples': 9978816, 'steps': 51972, 'loss/train': 1.0862919092178345} 11/07/2021 04:32:53 - INFO - __main__ - Step 51974: {'lr': 0.0003720488032152145, 'samples': 9979008, 'steps': 51973, 'loss/train': 1.4456400871276855} 11/07/2021 04:32:53 - INFO - __main__ - Step 51975: {'lr': 0.0003720441718090667, 'samples': 9979200, 'steps': 51974, 'loss/train': 1.407605528831482} 11/07/2021 04:32:55 - INFO - __main__ - Step 51976: {'lr': 0.0003720395403479276, 'samples': 9979392, 'steps': 51975, 'loss/train': 1.2332403659820557} 11/07/2021 04:32:56 - INFO - __main__ - Step 51977: {'lr': 0.00037203490883179935, 'samples': 9979584, 'steps': 51976, 'loss/train': 1.1306183338165283} 11/07/2021 04:32:56 - INFO - __main__ - Step 51978: {'lr': 0.0003720302772606841, 'samples': 9979776, 'steps': 51977, 'loss/train': 1.499109148979187} 11/07/2021 04:32:56 - INFO - __main__ - Step 51979: {'lr': 0.00037202564563458394, 'samples': 9979968, 'steps': 51978, 'loss/train': 1.4061548709869385} 11/07/2021 04:32:57 - INFO - __main__ - Step 51980: {'lr': 0.00037202101395350084, 'samples': 9980160, 'steps': 51979, 'loss/train': 1.3858675956726074} 11/07/2021 04:32:57 - INFO - __main__ - Step 51981: {'lr': 0.0003720163822174369, 'samples': 9980352, 'steps': 51980, 'loss/train': 1.4802196025848389} 11/07/2021 04:32:57 - INFO - __main__ - Step 51982: {'lr': 0.0003720117504263944, 'samples': 9980544, 'steps': 51981, 'loss/train': 1.8351459503173828} 11/07/2021 04:32:58 - INFO - __main__ - Step 51983: {'lr': 0.0003720071185803752, 'samples': 9980736, 'steps': 51982, 'loss/train': 1.2026005983352661} 11/07/2021 04:32:59 - INFO - __main__ - Step 51984: {'lr': 0.00037200248667938155, 'samples': 9980928, 'steps': 51983, 'loss/train': 1.4524941444396973} 11/07/2021 04:32:59 - INFO - __main__ - Step 51985: {'lr': 0.00037199785472341536, 'samples': 9981120, 'steps': 51984, 'loss/train': 1.9673750400543213} 11/07/2021 04:32:59 - INFO - __main__ - Step 51986: {'lr': 0.00037199322271247887, 'samples': 9981312, 'steps': 51985, 'loss/train': 1.7801264524459839} 11/07/2021 04:33:00 - INFO - __main__ - Step 51987: {'lr': 0.00037198859064657415, 'samples': 9981504, 'steps': 51986, 'loss/train': 0.8017715215682983} 11/07/2021 04:33:00 - INFO - __main__ - Step 51988: {'lr': 0.0003719839585257032, 'samples': 9981696, 'steps': 51987, 'loss/train': 1.621757984161377} 11/07/2021 04:33:01 - INFO - __main__ - Step 51989: {'lr': 0.0003719793263498681, 'samples': 9981888, 'steps': 51988, 'loss/train': 1.4853129386901855} 11/07/2021 04:33:02 - INFO - __main__ - Step 51990: {'lr': 0.00037197469411907115, 'samples': 9982080, 'steps': 51989, 'loss/train': 1.7996766567230225} 11/07/2021 04:33:02 - INFO - __main__ - Step 51991: {'lr': 0.0003719700618333142, 'samples': 9982272, 'steps': 51990, 'loss/train': 1.4016635417938232} 11/07/2021 04:33:02 - INFO - __main__ - Step 51992: {'lr': 0.0003719654294925994, 'samples': 9982464, 'steps': 51991, 'loss/train': 1.3064631223678589} 11/07/2021 04:33:03 - INFO - __main__ - Step 51993: {'lr': 0.00037196079709692894, 'samples': 9982656, 'steps': 51992, 'loss/train': 1.7431122064590454} 11/07/2021 04:33:04 - INFO - __main__ - Step 51994: {'lr': 0.0003719561646463048, 'samples': 9982848, 'steps': 51993, 'loss/train': 1.6113237142562866} 11/07/2021 04:33:04 - INFO - __main__ - Step 51995: {'lr': 0.00037195153214072903, 'samples': 9983040, 'steps': 51994, 'loss/train': 1.4810059070587158} 11/07/2021 04:33:04 - INFO - __main__ - Step 51996: {'lr': 0.0003719468995802038, 'samples': 9983232, 'steps': 51995, 'loss/train': 1.485140085220337} 11/07/2021 04:33:05 - INFO - __main__ - Step 51997: {'lr': 0.0003719422669647312, 'samples': 9983424, 'steps': 51996, 'loss/train': 1.5733298063278198} 11/07/2021 04:33:05 - INFO - __main__ - Step 51998: {'lr': 0.0003719376342943133, 'samples': 9983616, 'steps': 51997, 'loss/train': 1.1706762313842773} 11/07/2021 04:33:06 - INFO - __main__ - Step 51999: {'lr': 0.00037193300156895223, 'samples': 9983808, 'steps': 51998, 'loss/train': 1.4785041809082031} 11/07/2021 04:33:06 - INFO - __main__ - Step 52000: {'lr': 0.00037192836878864995, 'samples': 9984000, 'steps': 51999, 'loss/train': 1.6460516452789307} 11/07/2021 04:33:07 - INFO - __main__ - Step 52001: {'lr': 0.00037192373595340864, 'samples': 9984192, 'steps': 52000, 'loss/train': 1.6769099235534668} 11/07/2021 04:33:07 - INFO - __main__ - Step 52002: {'lr': 0.0003719191030632304, 'samples': 9984384, 'steps': 52001, 'loss/train': 1.7142972946166992} 11/07/2021 04:33:07 - INFO - __main__ - Step 52003: {'lr': 0.0003719144701181173, 'samples': 9984576, 'steps': 52002, 'loss/train': 1.564306616783142} 11/07/2021 04:33:08 - INFO - __main__ - Step 52004: {'lr': 0.0003719098371180714, 'samples': 9984768, 'steps': 52003, 'loss/train': 1.6705996990203857} 11/07/2021 04:33:09 - INFO - __main__ - Step 52005: {'lr': 0.00037190520406309483, 'samples': 9984960, 'steps': 52004, 'loss/train': 1.523310899734497} 11/07/2021 04:33:09 - INFO - __main__ - Step 52006: {'lr': 0.00037190057095318966, 'samples': 9985152, 'steps': 52005, 'loss/train': 1.764891505241394} 11/07/2021 04:33:10 - INFO - __main__ - Step 52007: {'lr': 0.00037189593778835794, 'samples': 9985344, 'steps': 52006, 'loss/train': 0.8343120217323303} 11/07/2021 04:33:10 - INFO - __main__ - Step 52008: {'lr': 0.0003718913045686018, 'samples': 9985536, 'steps': 52007, 'loss/train': 0.5164644122123718} 11/07/2021 04:33:10 - INFO - __main__ - Step 52009: {'lr': 0.0003718866712939233, 'samples': 9985728, 'steps': 52008, 'loss/train': 1.4166632890701294} 11/07/2021 04:33:11 - INFO - __main__ - Step 52010: {'lr': 0.00037188203796432464, 'samples': 9985920, 'steps': 52009, 'loss/train': 0.5160446166992188} 11/07/2021 04:33:12 - INFO - __main__ - Step 52011: {'lr': 0.00037187740457980776, 'samples': 9986112, 'steps': 52010, 'loss/train': 1.352651596069336} 11/07/2021 04:33:12 - INFO - __main__ - Step 52012: {'lr': 0.0003718727711403748, 'samples': 9986304, 'steps': 52011, 'loss/train': 1.4476298093795776} 11/07/2021 04:33:12 - INFO - __main__ - Step 52013: {'lr': 0.00037186813764602785, 'samples': 9986496, 'steps': 52012, 'loss/train': 0.8069983720779419} 11/07/2021 04:33:13 - INFO - __main__ - Step 52014: {'lr': 0.00037186350409676894, 'samples': 9986688, 'steps': 52013, 'loss/train': 1.9650357961654663} 11/07/2021 04:33:14 - INFO - __main__ - Step 52015: {'lr': 0.00037185887049260023, 'samples': 9986880, 'steps': 52014, 'loss/train': 1.585021734237671} 11/07/2021 04:33:14 - INFO - __main__ - Step 52016: {'lr': 0.0003718542368335239, 'samples': 9987072, 'steps': 52015, 'loss/train': 1.3363664150238037} 11/07/2021 04:33:14 - INFO - __main__ - Step 52017: {'lr': 0.0003718496031195419, 'samples': 9987264, 'steps': 52016, 'loss/train': 1.2658580541610718} 11/07/2021 04:33:15 - INFO - __main__ - Step 52018: {'lr': 0.00037184496935065625, 'samples': 9987456, 'steps': 52017, 'loss/train': 1.2854598760604858} 11/07/2021 04:33:15 - INFO - __main__ - Step 52019: {'lr': 0.0003718403355268692, 'samples': 9987648, 'steps': 52018, 'loss/train': 1.6694667339324951} 11/07/2021 04:33:16 - INFO - __main__ - Step 52020: {'lr': 0.0003718357016481828, 'samples': 9987840, 'steps': 52019, 'loss/train': 1.105134129524231} 11/07/2021 04:33:16 - INFO - __main__ - Step 52021: {'lr': 0.00037183106771459905, 'samples': 9988032, 'steps': 52020, 'loss/train': 1.6343470811843872} 11/07/2021 04:33:17 - INFO - __main__ - Step 52022: {'lr': 0.00037182643372612014, 'samples': 9988224, 'steps': 52021, 'loss/train': 1.064949631690979} 11/07/2021 04:33:17 - INFO - __main__ - Step 52023: {'lr': 0.00037182179968274807, 'samples': 9988416, 'steps': 52022, 'loss/train': 1.6126441955566406} 11/07/2021 04:33:18 - INFO - __main__ - Step 52024: {'lr': 0.00037181716558448507, 'samples': 9988608, 'steps': 52023, 'loss/train': 1.5883631706237793} 11/07/2021 04:33:19 - INFO - __main__ - Step 52025: {'lr': 0.0003718125314313331, 'samples': 9988800, 'steps': 52024, 'loss/train': 1.135672688484192} 11/07/2021 04:33:19 - INFO - __main__ - Step 52026: {'lr': 0.0003718078972232943, 'samples': 9988992, 'steps': 52025, 'loss/train': 1.3619341850280762} 11/07/2021 04:33:19 - INFO - __main__ - Step 52027: {'lr': 0.0003718032629603707, 'samples': 9989184, 'steps': 52026, 'loss/train': 1.456044316291809} 11/07/2021 04:33:20 - INFO - __main__ - Step 52028: {'lr': 0.00037179862864256444, 'samples': 9989376, 'steps': 52027, 'loss/train': 1.5300452709197998} 11/07/2021 04:33:20 - INFO - __main__ - Step 52029: {'lr': 0.00037179399426987757, 'samples': 9989568, 'steps': 52028, 'loss/train': 1.679802656173706} 11/07/2021 04:33:21 - INFO - __main__ - Step 52030: {'lr': 0.0003717893598423122, 'samples': 9989760, 'steps': 52029, 'loss/train': 1.2130610942840576} 11/07/2021 04:33:21 - INFO - __main__ - Step 52031: {'lr': 0.0003717847253598705, 'samples': 9989952, 'steps': 52030, 'loss/train': 1.6011747121810913} 11/07/2021 04:33:22 - INFO - __main__ - Step 52032: {'lr': 0.0003717800908225544, 'samples': 9990144, 'steps': 52031, 'loss/train': 0.8002707958221436} 11/07/2021 04:33:22 - INFO - __main__ - Step 52033: {'lr': 0.0003717754562303661, 'samples': 9990336, 'steps': 52032, 'loss/train': 1.558195948600769} 11/07/2021 04:33:22 - INFO - __main__ - Step 52034: {'lr': 0.00037177082158330773, 'samples': 9990528, 'steps': 52033, 'loss/train': 1.7184574604034424} 11/07/2021 04:33:23 - INFO - __main__ - Step 52035: {'lr': 0.0003717661868813812, 'samples': 9990720, 'steps': 52034, 'loss/train': 1.2530268430709839} 11/07/2021 04:33:24 - INFO - __main__ - Step 52036: {'lr': 0.00037176155212458875, 'samples': 9990912, 'steps': 52035, 'loss/train': 1.1753389835357666} 11/07/2021 04:33:24 - INFO - __main__ - Step 52037: {'lr': 0.0003717569173129324, 'samples': 9991104, 'steps': 52036, 'loss/train': 1.2855608463287354} 11/07/2021 04:33:25 - INFO - __main__ - Step 52038: {'lr': 0.0003717522824464143, 'samples': 9991296, 'steps': 52037, 'loss/train': 1.7887563705444336} 11/07/2021 04:33:25 - INFO - __main__ - Step 52039: {'lr': 0.0003717476475250365, 'samples': 9991488, 'steps': 52038, 'loss/train': 1.3528296947479248} 11/07/2021 04:33:25 - INFO - __main__ - Step 52040: {'lr': 0.0003717430125488011, 'samples': 9991680, 'steps': 52039, 'loss/train': 1.8824183940887451} 11/07/2021 04:33:26 - INFO - __main__ - Step 52041: {'lr': 0.0003717383775177101, 'samples': 9991872, 'steps': 52040, 'loss/train': 1.8882192373275757} 11/07/2021 04:33:27 - INFO - __main__ - Step 52042: {'lr': 0.0003717337424317657, 'samples': 9992064, 'steps': 52041, 'loss/train': 1.4328219890594482} 11/07/2021 04:33:27 - INFO - __main__ - Step 52043: {'lr': 0.00037172910729097006, 'samples': 9992256, 'steps': 52042, 'loss/train': 1.5730630159378052} 11/07/2021 04:33:28 - INFO - __main__ - Step 52044: {'lr': 0.000371724472095325, 'samples': 9992448, 'steps': 52043, 'loss/train': 1.2859323024749756} 11/07/2021 04:33:28 - INFO - __main__ - Step 52045: {'lr': 0.00037171983684483286, 'samples': 9992640, 'steps': 52044, 'loss/train': 1.8064703941345215} 11/07/2021 04:33:29 - INFO - __main__ - Step 52046: {'lr': 0.00037171520153949565, 'samples': 9992832, 'steps': 52045, 'loss/train': 1.1901495456695557} 11/07/2021 04:33:29 - INFO - __main__ - Step 52047: {'lr': 0.00037171056617931543, 'samples': 9993024, 'steps': 52046, 'loss/train': 1.4095689058303833} 11/07/2021 04:33:30 - INFO - __main__ - Step 52048: {'lr': 0.00037170593076429426, 'samples': 9993216, 'steps': 52047, 'loss/train': 1.3955882787704468} 11/07/2021 04:33:30 - INFO - __main__ - Step 52049: {'lr': 0.00037170129529443436, 'samples': 9993408, 'steps': 52048, 'loss/train': 1.4494365453720093} 11/07/2021 04:33:30 - INFO - __main__ - Step 52050: {'lr': 0.0003716966597697377, 'samples': 9993600, 'steps': 52049, 'loss/train': 1.5685220956802368} 11/07/2021 04:33:31 - INFO - __main__ - Step 52051: {'lr': 0.0003716920241902064, 'samples': 9993792, 'steps': 52050, 'loss/train': 1.1115155220031738} 11/07/2021 04:33:32 - INFO - __main__ - Step 52052: {'lr': 0.0003716873885558425, 'samples': 9993984, 'steps': 52051, 'loss/train': 1.441890835762024} 11/07/2021 04:33:32 - INFO - __main__ - Step 52053: {'lr': 0.0003716827528666482, 'samples': 9994176, 'steps': 52052, 'loss/train': 1.656650424003601} 11/07/2021 04:33:32 - INFO - __main__ - Step 52054: {'lr': 0.0003716781171226255, 'samples': 9994368, 'steps': 52053, 'loss/train': 1.5867408514022827} 11/07/2021 04:33:33 - INFO - __main__ - Step 52055: {'lr': 0.00037167348132377656, 'samples': 9994560, 'steps': 52054, 'loss/train': 1.499980092048645} 11/07/2021 04:33:34 - INFO - __main__ - Step 52056: {'lr': 0.0003716688454701034, 'samples': 9994752, 'steps': 52055, 'loss/train': 1.607343316078186} 11/07/2021 04:33:34 - INFO - __main__ - Step 52057: {'lr': 0.00037166420956160815, 'samples': 9994944, 'steps': 52056, 'loss/train': 0.8065264821052551} 11/07/2021 04:33:34 - INFO - __main__ - Step 52058: {'lr': 0.0003716595735982928, 'samples': 9995136, 'steps': 52057, 'loss/train': 1.3024451732635498} 11/07/2021 04:33:35 - INFO - __main__ - Step 52059: {'lr': 0.0003716549375801597, 'samples': 9995328, 'steps': 52058, 'loss/train': 2.1499733924865723} 11/07/2021 04:33:35 - INFO - __main__ - Step 52060: {'lr': 0.0003716503015072106, 'samples': 9995520, 'steps': 52059, 'loss/train': 1.2450579404830933} 11/07/2021 04:33:36 - INFO - __main__ - Step 52061: {'lr': 0.00037164566537944776, 'samples': 9995712, 'steps': 52060, 'loss/train': 2.1072373390197754} 11/07/2021 04:33:37 - INFO - __main__ - Step 52062: {'lr': 0.00037164102919687335, 'samples': 9995904, 'steps': 52061, 'loss/train': 1.5504851341247559} 11/07/2021 04:33:37 - INFO - __main__ - Step 52063: {'lr': 0.00037163639295948933, 'samples': 9996096, 'steps': 52062, 'loss/train': 1.3318660259246826} 11/07/2021 04:33:37 - INFO - __main__ - Step 52064: {'lr': 0.0003716317566672978, 'samples': 9996288, 'steps': 52063, 'loss/train': 1.722896933555603} 11/07/2021 04:33:38 - INFO - __main__ - Step 52065: {'lr': 0.00037162712032030095, 'samples': 9996480, 'steps': 52064, 'loss/train': 1.8784534931182861} 11/07/2021 04:33:38 - INFO - __main__ - Step 52066: {'lr': 0.00037162248391850076, 'samples': 9996672, 'steps': 52065, 'loss/train': 1.5277537107467651} 11/07/2021 04:33:39 - INFO - __main__ - Step 52067: {'lr': 0.0003716178474618993, 'samples': 9996864, 'steps': 52066, 'loss/train': 1.6308097839355469} 11/07/2021 04:33:39 - INFO - __main__ - Step 52068: {'lr': 0.0003716132109504988, 'samples': 9997056, 'steps': 52067, 'loss/train': 1.5583269596099854} 11/07/2021 04:33:40 - INFO - __main__ - Step 52069: {'lr': 0.0003716085743843012, 'samples': 9997248, 'steps': 52068, 'loss/train': 0.9065039157867432} 11/07/2021 04:33:40 - INFO - __main__ - Step 52070: {'lr': 0.0003716039377633087, 'samples': 9997440, 'steps': 52069, 'loss/train': 1.153981328010559} 11/07/2021 04:33:41 - INFO - __main__ - Step 52071: {'lr': 0.00037159930108752326, 'samples': 9997632, 'steps': 52070, 'loss/train': 1.4276835918426514} 11/07/2021 04:33:42 - INFO - __main__ - Step 52072: {'lr': 0.0003715946643569471, 'samples': 9997824, 'steps': 52071, 'loss/train': 1.7018718719482422} 11/07/2021 04:33:42 - INFO - __main__ - Step 52073: {'lr': 0.0003715900275715823, 'samples': 9998016, 'steps': 52072, 'loss/train': 1.0280437469482422} 11/07/2021 04:33:42 - INFO - __main__ - Step 52074: {'lr': 0.0003715853907314309, 'samples': 9998208, 'steps': 52073, 'loss/train': 0.7347991466522217} 11/07/2021 04:33:43 - INFO - __main__ - Step 52075: {'lr': 0.0003715807538364949, 'samples': 9998400, 'steps': 52074, 'loss/train': 1.3218401670455933} 11/07/2021 04:33:43 - INFO - __main__ - Step 52076: {'lr': 0.00037157611688677666, 'samples': 9998592, 'steps': 52075, 'loss/train': 1.8234870433807373} 11/07/2021 04:33:44 - INFO - __main__ - Step 52077: {'lr': 0.000371571479882278, 'samples': 9998784, 'steps': 52076, 'loss/train': 0.9849280118942261} 11/07/2021 04:33:44 - INFO - __main__ - Step 52078: {'lr': 0.00037156684282300105, 'samples': 9998976, 'steps': 52077, 'loss/train': 1.448745846748352} 11/07/2021 04:33:45 - INFO - __main__ - Step 52079: {'lr': 0.00037156220570894806, 'samples': 9999168, 'steps': 52078, 'loss/train': 1.3544305562973022} 11/07/2021 04:33:45 - INFO - __main__ - Step 52080: {'lr': 0.00037155756854012097, 'samples': 9999360, 'steps': 52079, 'loss/train': 1.0842026472091675} 11/07/2021 04:33:46 - INFO - __main__ - Step 52081: {'lr': 0.000371552931316522, 'samples': 9999552, 'steps': 52080, 'loss/train': 0.9628018736839294} 11/07/2021 04:33:47 - INFO - __main__ - Step 52082: {'lr': 0.00037154829403815307, 'samples': 9999744, 'steps': 52081, 'loss/train': 1.5966683626174927} 11/07/2021 04:33:47 - INFO - __main__ - Step 52083: {'lr': 0.0003715436567050163, 'samples': 9999936, 'steps': 52082, 'loss/train': 1.1053435802459717} 11/07/2021 04:33:47 - INFO - __main__ - Step 52084: {'lr': 0.0003715390193171139, 'samples': 10000128, 'steps': 52083, 'loss/train': 2.0694639682769775} 11/07/2021 04:33:48 - INFO - __main__ - Step 52085: {'lr': 0.0003715343818744479, 'samples': 10000320, 'steps': 52084, 'loss/train': 1.1751145124435425} 11/07/2021 04:33:48 - INFO - __main__ - Step 52086: {'lr': 0.0003715297443770203, 'samples': 10000512, 'steps': 52085, 'loss/train': 1.136919379234314} 11/07/2021 04:33:49 - INFO - __main__ - Step 52087: {'lr': 0.0003715251068248334, 'samples': 10000704, 'steps': 52086, 'loss/train': 1.6594806909561157} 11/07/2021 04:33:50 - INFO - __main__ - Step 52088: {'lr': 0.00037152046921788906, 'samples': 10000896, 'steps': 52087, 'loss/train': 0.5346752405166626} 11/07/2021 04:33:50 - INFO - __main__ - Step 52089: {'lr': 0.00037151583155618957, 'samples': 10001088, 'steps': 52088, 'loss/train': 1.593422770500183} 11/07/2021 04:33:50 - INFO - __main__ - Step 52090: {'lr': 0.00037151119383973684, 'samples': 10001280, 'steps': 52089, 'loss/train': 1.4512618780136108} 11/07/2021 04:33:51 - INFO - __main__ - Step 52091: {'lr': 0.0003715065560685331, 'samples': 10001472, 'steps': 52090, 'loss/train': 5.8047614097595215} 11/07/2021 04:33:51 - INFO - __main__ - Step 52092: {'lr': 0.00037150191824258027, 'samples': 10001664, 'steps': 52091, 'loss/train': 1.467519998550415} 11/07/2021 04:33:52 - INFO - __main__ - Step 52093: {'lr': 0.00037149728036188067, 'samples': 10001856, 'steps': 52092, 'loss/train': 1.4725695848464966} 11/07/2021 04:33:52 - INFO - __main__ - Step 52094: {'lr': 0.0003714926424264363, 'samples': 10002048, 'steps': 52093, 'loss/train': 1.010266661643982} 11/07/2021 04:33:53 - INFO - __main__ - Step 52095: {'lr': 0.00037148800443624906, 'samples': 10002240, 'steps': 52094, 'loss/train': 1.4820456504821777} 11/07/2021 04:33:53 - INFO - __main__ - Step 52096: {'lr': 0.0003714833663913213, 'samples': 10002432, 'steps': 52095, 'loss/train': 1.5023173093795776} 11/07/2021 04:33:53 - INFO - __main__ - Step 52097: {'lr': 0.00037147872829165497, 'samples': 10002624, 'steps': 52096, 'loss/train': 1.4011294841766357} 11/07/2021 04:33:54 - INFO - __main__ - Step 52098: {'lr': 0.00037147409013725226, 'samples': 10002816, 'steps': 52097, 'loss/train': 1.8304104804992676} 11/07/2021 04:33:55 - INFO - __main__ - Step 52099: {'lr': 0.00037146945192811513, 'samples': 10003008, 'steps': 52098, 'loss/train': 1.5196019411087036} 11/07/2021 04:33:55 - INFO - __main__ - Step 52100: {'lr': 0.00037146481366424585, 'samples': 10003200, 'steps': 52099, 'loss/train': 1.5309158563613892} 11/07/2021 04:33:56 - INFO - __main__ - Step 52101: {'lr': 0.0003714601753456463, 'samples': 10003392, 'steps': 52100, 'loss/train': 1.7246387004852295} 11/07/2021 04:33:56 - INFO - __main__ - Step 52102: {'lr': 0.0003714555369723187, 'samples': 10003584, 'steps': 52101, 'loss/train': 1.4665919542312622} 11/07/2021 04:33:56 - INFO - __main__ - Step 52103: {'lr': 0.00037145089854426504, 'samples': 10003776, 'steps': 52102, 'loss/train': 1.6272002458572388} 11/07/2021 04:33:57 - INFO - __main__ - Step 52104: {'lr': 0.0003714462600614876, 'samples': 10003968, 'steps': 52103, 'loss/train': 1.3589495420455933} 11/07/2021 04:33:58 - INFO - __main__ - Step 52105: {'lr': 0.0003714416215239883, 'samples': 10004160, 'steps': 52104, 'loss/train': 1.5083616971969604} 11/07/2021 04:33:58 - INFO - __main__ - Step 52106: {'lr': 0.00037143698293176923, 'samples': 10004352, 'steps': 52105, 'loss/train': 1.0739424228668213} 11/07/2021 04:33:58 - INFO - __main__ - Step 52107: {'lr': 0.0003714323442848326, 'samples': 10004544, 'steps': 52106, 'loss/train': 1.4502241611480713} 11/07/2021 04:33:59 - INFO - __main__ - Step 52108: {'lr': 0.0003714277055831804, 'samples': 10004736, 'steps': 52107, 'loss/train': 1.6501597166061401} 11/07/2021 04:34:00 - INFO - __main__ - Step 52109: {'lr': 0.00037142306682681476, 'samples': 10004928, 'steps': 52108, 'loss/train': 1.2590582370758057} 11/07/2021 04:34:00 - INFO - __main__ - Step 52110: {'lr': 0.00037141842801573775, 'samples': 10005120, 'steps': 52109, 'loss/train': 1.1598715782165527} 11/07/2021 04:34:00 - INFO - __main__ - Step 52111: {'lr': 0.00037141378914995146, 'samples': 10005312, 'steps': 52110, 'loss/train': 1.8247171640396118} 11/07/2021 04:34:01 - INFO - __main__ - Step 52112: {'lr': 0.000371409150229458, 'samples': 10005504, 'steps': 52111, 'loss/train': 1.4721322059631348} 11/07/2021 04:34:01 - INFO - __main__ - Step 52113: {'lr': 0.00037140451125425945, 'samples': 10005696, 'steps': 52112, 'loss/train': 1.5720247030258179} 11/07/2021 04:34:02 - INFO - __main__ - Step 52114: {'lr': 0.0003713998722243579, 'samples': 10005888, 'steps': 52113, 'loss/train': 1.7562553882598877} 11/07/2021 04:34:02 - INFO - __main__ - Step 52115: {'lr': 0.00037139523313975544, 'samples': 10006080, 'steps': 52114, 'loss/train': 1.492295742034912} 11/07/2021 04:34:03 - INFO - __main__ - Step 52116: {'lr': 0.00037139059400045416, 'samples': 10006272, 'steps': 52115, 'loss/train': 1.6417118310928345} 11/07/2021 04:34:03 - INFO - __main__ - Step 52117: {'lr': 0.00037138595480645613, 'samples': 10006464, 'steps': 52116, 'loss/train': 1.6110855340957642} 11/07/2021 04:34:03 - INFO - __main__ - Step 52118: {'lr': 0.0003713813155577635, 'samples': 10006656, 'steps': 52117, 'loss/train': 1.171640396118164} 11/07/2021 04:34:05 - INFO - __main__ - Step 52119: {'lr': 0.0003713766762543783, 'samples': 10006848, 'steps': 52118, 'loss/train': 1.4638890027999878} 11/07/2021 04:34:05 - INFO - __main__ - Step 52120: {'lr': 0.0003713720368963027, 'samples': 10007040, 'steps': 52119, 'loss/train': 1.145456075668335} 11/07/2021 04:34:05 - INFO - __main__ - Step 52121: {'lr': 0.0003713673974835387, 'samples': 10007232, 'steps': 52120, 'loss/train': 1.0905083417892456} 11/07/2021 04:34:06 - INFO - __main__ - Step 52122: {'lr': 0.0003713627580160884, 'samples': 10007424, 'steps': 52121, 'loss/train': 1.7812553644180298} 11/07/2021 04:34:06 - INFO - __main__ - Step 52123: {'lr': 0.0003713581184939539, 'samples': 10007616, 'steps': 52122, 'loss/train': 1.7905954122543335} 11/07/2021 04:34:07 - INFO - __main__ - Step 52124: {'lr': 0.00037135347891713733, 'samples': 10007808, 'steps': 52123, 'loss/train': 0.9739303588867188} 11/07/2021 04:34:07 - INFO - __main__ - Step 52125: {'lr': 0.00037134883928564074, 'samples': 10008000, 'steps': 52124, 'loss/train': 2.040052890777588} 11/07/2021 04:34:08 - INFO - __main__ - Step 52126: {'lr': 0.00037134419959946626, 'samples': 10008192, 'steps': 52125, 'loss/train': 1.1834639310836792} 11/07/2021 04:34:08 - INFO - __main__ - Step 52127: {'lr': 0.00037133955985861595, 'samples': 10008384, 'steps': 52126, 'loss/train': 1.6054444313049316} 11/07/2021 04:34:08 - INFO - __main__ - Step 52128: {'lr': 0.00037133492006309187, 'samples': 10008576, 'steps': 52127, 'loss/train': 0.986207127571106} 11/07/2021 04:34:09 - INFO - __main__ - Step 52129: {'lr': 0.00037133028021289625, 'samples': 10008768, 'steps': 52128, 'loss/train': 2.1998698711395264} 11/07/2021 04:34:10 - INFO - __main__ - Step 52130: {'lr': 0.000371325640308031, 'samples': 10008960, 'steps': 52129, 'loss/train': 1.144845724105835} 11/07/2021 04:34:10 - INFO - __main__ - Step 52131: {'lr': 0.0003713210003484982, 'samples': 10009152, 'steps': 52130, 'loss/train': 1.2223379611968994} 11/07/2021 04:34:10 - INFO - __main__ - Step 52132: {'lr': 0.00037131636033430017, 'samples': 10009344, 'steps': 52131, 'loss/train': 1.4668699502944946} 11/07/2021 04:34:11 - INFO - __main__ - Step 52133: {'lr': 0.0003713117202654388, 'samples': 10009536, 'steps': 52132, 'loss/train': 1.0695420503616333} 11/07/2021 04:34:12 - INFO - __main__ - Step 52134: {'lr': 0.0003713070801419163, 'samples': 10009728, 'steps': 52133, 'loss/train': 1.5132064819335938} 11/07/2021 04:34:12 - INFO - __main__ - Step 52135: {'lr': 0.00037130243996373466, 'samples': 10009920, 'steps': 52134, 'loss/train': 1.7165154218673706} 11/07/2021 04:34:13 - INFO - __main__ - Step 52136: {'lr': 0.00037129779973089596, 'samples': 10010112, 'steps': 52135, 'loss/train': 1.2871824502944946} 11/07/2021 04:34:13 - INFO - __main__ - Step 52137: {'lr': 0.0003712931594434024, 'samples': 10010304, 'steps': 52136, 'loss/train': 1.0461310148239136} 11/07/2021 04:34:13 - INFO - __main__ - Step 52138: {'lr': 0.000371288519101256, 'samples': 10010496, 'steps': 52137, 'loss/train': 1.2248376607894897} 11/07/2021 04:34:14 - INFO - __main__ - Step 52139: {'lr': 0.00037128387870445883, 'samples': 10010688, 'steps': 52138, 'loss/train': 1.1509482860565186} 11/07/2021 04:34:15 - INFO - __main__ - Step 52140: {'lr': 0.00037127923825301315, 'samples': 10010880, 'steps': 52139, 'loss/train': 1.6599177122116089} 11/07/2021 04:34:15 - INFO - __main__ - Step 52141: {'lr': 0.0003712745977469208, 'samples': 10011072, 'steps': 52140, 'loss/train': 2.065338134765625} 11/07/2021 04:34:15 - INFO - __main__ - Step 52142: {'lr': 0.000371269957186184, 'samples': 10011264, 'steps': 52141, 'loss/train': 1.4618128538131714} 11/07/2021 04:34:16 - INFO - __main__ - Step 52143: {'lr': 0.0003712653165708048, 'samples': 10011456, 'steps': 52142, 'loss/train': 0.9149712324142456} 11/07/2021 04:34:16 - INFO - __main__ - Step 52144: {'lr': 0.00037126067590078537, 'samples': 10011648, 'steps': 52143, 'loss/train': 1.1253894567489624} 11/07/2021 04:34:17 - INFO - __main__ - Step 52145: {'lr': 0.00037125603517612773, 'samples': 10011840, 'steps': 52144, 'loss/train': 2.2957513332366943} 11/07/2021 04:34:17 - INFO - __main__ - Step 52146: {'lr': 0.00037125139439683405, 'samples': 10012032, 'steps': 52145, 'loss/train': 1.360108494758606} 11/07/2021 04:34:18 - INFO - __main__ - Step 52147: {'lr': 0.00037124675356290635, 'samples': 10012224, 'steps': 52146, 'loss/train': 1.1610556840896606} 11/07/2021 04:34:18 - INFO - __main__ - Step 52148: {'lr': 0.00037124211267434667, 'samples': 10012416, 'steps': 52147, 'loss/train': 1.3958730697631836} 11/07/2021 04:34:18 - INFO - __main__ - Step 52149: {'lr': 0.0003712374717311572, 'samples': 10012608, 'steps': 52148, 'loss/train': 1.436025619506836} 11/07/2021 04:34:20 - INFO - __main__ - Step 52150: {'lr': 0.00037123283073333996, 'samples': 10012800, 'steps': 52149, 'loss/train': 0.7066998481750488} 11/07/2021 04:34:20 - INFO - __main__ - Step 52151: {'lr': 0.0003712281896808971, 'samples': 10012992, 'steps': 52150, 'loss/train': 1.125436782836914} 11/07/2021 04:34:20 - INFO - __main__ - Step 52152: {'lr': 0.0003712235485738307, 'samples': 10013184, 'steps': 52151, 'loss/train': 1.9018964767456055} 11/07/2021 04:34:21 - INFO - __main__ - Step 52153: {'lr': 0.0003712189074121428, 'samples': 10013376, 'steps': 52152, 'loss/train': 1.6347655057907104} 11/07/2021 04:34:21 - INFO - __main__ - Step 52154: {'lr': 0.0003712142661958356, 'samples': 10013568, 'steps': 52153, 'loss/train': 1.5352139472961426} 11/07/2021 04:34:22 - INFO - __main__ - Step 52155: {'lr': 0.0003712096249249111, 'samples': 10013760, 'steps': 52154, 'loss/train': 1.4936988353729248} 11/07/2021 04:34:22 - INFO - __main__ - Step 52156: {'lr': 0.00037120498359937136, 'samples': 10013952, 'steps': 52155, 'loss/train': 2.040149450302124} 11/07/2021 04:34:23 - INFO - __main__ - Step 52157: {'lr': 0.0003712003422192186, 'samples': 10014144, 'steps': 52156, 'loss/train': 1.3096764087677002} 11/07/2021 04:34:23 - INFO - __main__ - Step 52158: {'lr': 0.00037119570078445477, 'samples': 10014336, 'steps': 52157, 'loss/train': 1.6156455278396606} 11/07/2021 04:34:23 - INFO - __main__ - Step 52159: {'lr': 0.00037119105929508207, 'samples': 10014528, 'steps': 52158, 'loss/train': 1.2151671648025513} 11/07/2021 04:34:24 - INFO - __main__ - Step 52160: {'lr': 0.0003711864177511025, 'samples': 10014720, 'steps': 52159, 'loss/train': 1.809687614440918} 11/07/2021 04:34:25 - INFO - __main__ - Step 52161: {'lr': 0.0003711817761525183, 'samples': 10014912, 'steps': 52160, 'loss/train': 1.2535426616668701} 11/07/2021 04:34:25 - INFO - __main__ - Step 52162: {'lr': 0.00037117713449933136, 'samples': 10015104, 'steps': 52161, 'loss/train': 1.4790703058242798} 11/07/2021 04:34:25 - INFO - __main__ - Step 52163: {'lr': 0.0003711724927915439, 'samples': 10015296, 'steps': 52162, 'loss/train': 1.4326823949813843} 11/07/2021 04:34:26 - INFO - __main__ - Step 52164: {'lr': 0.000371167851029158, 'samples': 10015488, 'steps': 52163, 'loss/train': 1.416654109954834} 11/07/2021 04:34:26 - INFO - __main__ - Step 52165: {'lr': 0.0003711632092121757, 'samples': 10015680, 'steps': 52164, 'loss/train': 1.4773273468017578} 11/07/2021 04:34:27 - INFO - __main__ - Step 52166: {'lr': 0.00037115856734059916, 'samples': 10015872, 'steps': 52165, 'loss/train': 0.7316718101501465} 11/07/2021 04:34:27 - INFO - __main__ - Step 52167: {'lr': 0.0003711539254144305, 'samples': 10016064, 'steps': 52166, 'loss/train': 0.7960385084152222} 11/07/2021 04:34:28 - INFO - __main__ - Step 52168: {'lr': 0.0003711492834336717, 'samples': 10016256, 'steps': 52167, 'loss/train': 1.4833855628967285} 11/07/2021 04:34:28 - INFO - __main__ - Step 52169: {'lr': 0.00037114464139832487, 'samples': 10016448, 'steps': 52168, 'loss/train': 1.1186985969543457} 11/07/2021 04:34:28 - INFO - __main__ - Step 52170: {'lr': 0.00037113999930839215, 'samples': 10016640, 'steps': 52169, 'loss/train': 1.6359552145004272} 11/07/2021 04:34:29 - INFO - __main__ - Step 52171: {'lr': 0.00037113535716387565, 'samples': 10016832, 'steps': 52170, 'loss/train': 0.7417372465133667} 11/07/2021 04:34:30 - INFO - __main__ - Step 52172: {'lr': 0.00037113071496477733, 'samples': 10017024, 'steps': 52171, 'loss/train': 1.2818551063537598} 11/07/2021 04:34:30 - INFO - __main__ - Step 52173: {'lr': 0.0003711260727110995, 'samples': 10017216, 'steps': 52172, 'loss/train': 1.242611050605774} 11/07/2021 04:34:30 - INFO - __main__ - Step 52174: {'lr': 0.0003711214304028441, 'samples': 10017408, 'steps': 52173, 'loss/train': 1.686108946800232} 11/07/2021 04:34:31 - INFO - __main__ - Step 52175: {'lr': 0.00037111678804001324, 'samples': 10017600, 'steps': 52174, 'loss/train': 0.7480536103248596} 11/07/2021 04:34:32 - INFO - __main__ - Step 52176: {'lr': 0.00037111214562260896, 'samples': 10017792, 'steps': 52175, 'loss/train': 1.652687907218933} 11/07/2021 04:34:32 - INFO - __main__ - Step 52177: {'lr': 0.0003711075031506335, 'samples': 10017984, 'steps': 52176, 'loss/train': 1.6718931198120117} 11/07/2021 04:34:33 - INFO - __main__ - Step 52178: {'lr': 0.0003711028606240888, 'samples': 10018176, 'steps': 52177, 'loss/train': 1.234239101409912} 11/07/2021 04:34:33 - INFO - __main__ - Step 52179: {'lr': 0.00037109821804297706, 'samples': 10018368, 'steps': 52178, 'loss/train': 0.9123637676239014} 11/07/2021 04:34:33 - INFO - __main__ - Step 52180: {'lr': 0.00037109357540730033, 'samples': 10018560, 'steps': 52179, 'loss/train': 0.998516321182251} 11/07/2021 04:34:34 - INFO - __main__ - Step 52181: {'lr': 0.00037108893271706075, 'samples': 10018752, 'steps': 52180, 'loss/train': 1.4245970249176025} 11/07/2021 04:34:35 - INFO - __main__ - Step 52182: {'lr': 0.0003710842899722603, 'samples': 10018944, 'steps': 52181, 'loss/train': 0.6998801231384277} 11/07/2021 04:34:35 - INFO - __main__ - Step 52183: {'lr': 0.00037107964717290117, 'samples': 10019136, 'steps': 52182, 'loss/train': 1.276153326034546} 11/07/2021 04:34:35 - INFO - __main__ - Step 52184: {'lr': 0.0003710750043189854, 'samples': 10019328, 'steps': 52183, 'loss/train': 1.2006757259368896} 11/07/2021 04:34:36 - INFO - __main__ - Step 52185: {'lr': 0.0003710703614105151, 'samples': 10019520, 'steps': 52184, 'loss/train': 0.8838239908218384} 11/07/2021 04:34:37 - INFO - __main__ - Step 52186: {'lr': 0.0003710657184474924, 'samples': 10019712, 'steps': 52185, 'loss/train': 1.9587353467941284} 11/07/2021 04:34:37 - INFO - __main__ - Step 52187: {'lr': 0.00037106107542991937, 'samples': 10019904, 'steps': 52186, 'loss/train': 1.2354145050048828} 11/07/2021 04:34:37 - INFO - __main__ - Step 52188: {'lr': 0.00037105643235779803, 'samples': 10020096, 'steps': 52187, 'loss/train': 1.402997374534607} 11/07/2021 04:34:38 - INFO - __main__ - Step 52189: {'lr': 0.0003710517892311305, 'samples': 10020288, 'steps': 52188, 'loss/train': 1.4153070449829102} 11/07/2021 04:34:38 - INFO - __main__ - Step 52190: {'lr': 0.00037104714604991896, 'samples': 10020480, 'steps': 52189, 'loss/train': 1.4302728176116943} 11/07/2021 04:34:38 - INFO - __main__ - Step 52191: {'lr': 0.0003710425028141654, 'samples': 10020672, 'steps': 52190, 'loss/train': 1.6049634218215942} 11/07/2021 04:34:40 - INFO - __main__ - Step 52192: {'lr': 0.000371037859523872, 'samples': 10020864, 'steps': 52191, 'loss/train': 1.017406702041626} 11/07/2021 04:34:40 - INFO - __main__ - Step 52193: {'lr': 0.00037103321617904076, 'samples': 10021056, 'steps': 52192, 'loss/train': 1.6501271724700928} 11/07/2021 04:34:40 - INFO - __main__ - Step 52194: {'lr': 0.00037102857277967387, 'samples': 10021248, 'steps': 52193, 'loss/train': 1.695813775062561} 11/07/2021 04:34:41 - INFO - __main__ - Step 52195: {'lr': 0.0003710239293257734, 'samples': 10021440, 'steps': 52194, 'loss/train': 1.6397981643676758} 11/07/2021 04:34:41 - INFO - __main__ - Step 52196: {'lr': 0.00037101928581734136, 'samples': 10021632, 'steps': 52195, 'loss/train': 1.3900978565216064} 11/07/2021 04:34:42 - INFO - __main__ - Step 52197: {'lr': 0.00037101464225437986, 'samples': 10021824, 'steps': 52196, 'loss/train': 1.402624249458313} 11/07/2021 04:34:42 - INFO - __main__ - Step 52198: {'lr': 0.0003710099986368911, 'samples': 10022016, 'steps': 52197, 'loss/train': 1.3117009401321411} 11/07/2021 04:34:43 - INFO - __main__ - Step 52199: {'lr': 0.0003710053549648771, 'samples': 10022208, 'steps': 52198, 'loss/train': 1.4527676105499268} 11/07/2021 04:34:43 - INFO - __main__ - Step 52200: {'lr': 0.00037100071123833994, 'samples': 10022400, 'steps': 52199, 'loss/train': 1.0785988569259644} 11/07/2021 04:34:43 - INFO - __main__ - Step 52201: {'lr': 0.0003709960674572817, 'samples': 10022592, 'steps': 52200, 'loss/train': 1.6540666818618774} 11/07/2021 04:34:45 - INFO - __main__ - Step 52202: {'lr': 0.00037099142362170454, 'samples': 10022784, 'steps': 52201, 'loss/train': 1.695608377456665} 11/07/2021 04:34:45 - INFO - __main__ - Step 52203: {'lr': 0.0003709867797316105, 'samples': 10022976, 'steps': 52202, 'loss/train': 1.1415761709213257} 11/07/2021 04:34:46 - INFO - __main__ - Step 52204: {'lr': 0.0003709821357870016, 'samples': 10023168, 'steps': 52203, 'loss/train': 0.9897325038909912} 11/07/2021 04:34:46 - INFO - __main__ - Step 52205: {'lr': 0.0003709774917878802, 'samples': 10023360, 'steps': 52204, 'loss/train': 1.4016780853271484} 11/07/2021 04:34:46 - INFO - __main__ - Step 52206: {'lr': 0.00037097284773424805, 'samples': 10023552, 'steps': 52205, 'loss/train': 0.12023314088582993} 11/07/2021 04:34:47 - INFO - __main__ - Step 52207: {'lr': 0.0003709682036261075, 'samples': 10023744, 'steps': 52206, 'loss/train': 1.1758884191513062} 11/07/2021 04:34:48 - INFO - __main__ - Step 52208: {'lr': 0.00037096355946346045, 'samples': 10023936, 'steps': 52207, 'loss/train': 1.5671637058258057} 11/07/2021 04:34:48 - INFO - __main__ - Step 52209: {'lr': 0.00037095891524630914, 'samples': 10024128, 'steps': 52208, 'loss/train': 1.3916738033294678} 11/07/2021 04:34:48 - INFO - __main__ - Step 52210: {'lr': 0.00037095427097465564, 'samples': 10024320, 'steps': 52209, 'loss/train': 0.7325248122215271} 11/07/2021 04:34:49 - INFO - __main__ - Step 52211: {'lr': 0.00037094962664850194, 'samples': 10024512, 'steps': 52210, 'loss/train': 1.4294133186340332} 11/07/2021 04:34:50 - INFO - __main__ - Step 52212: {'lr': 0.00037094498226785023, 'samples': 10024704, 'steps': 52211, 'loss/train': 0.8006132245063782} 11/07/2021 04:34:50 - INFO - __main__ - Step 52213: {'lr': 0.00037094033783270256, 'samples': 10024896, 'steps': 52212, 'loss/train': 1.655676007270813} 11/07/2021 04:34:50 - INFO - __main__ - Step 52214: {'lr': 0.0003709356933430611, 'samples': 10025088, 'steps': 52213, 'loss/train': 1.8293867111206055} 11/07/2021 04:34:51 - INFO - __main__ - Step 52215: {'lr': 0.00037093104879892786, 'samples': 10025280, 'steps': 52214, 'loss/train': 1.439482569694519} 11/07/2021 04:34:51 - INFO - __main__ - Step 52216: {'lr': 0.000370926404200305, 'samples': 10025472, 'steps': 52215, 'loss/train': 0.9110205769538879} 11/07/2021 04:34:52 - INFO - __main__ - Step 52217: {'lr': 0.0003709217595471945, 'samples': 10025664, 'steps': 52216, 'loss/train': 1.7956041097640991} 11/07/2021 04:34:53 - INFO - __main__ - Step 52218: {'lr': 0.0003709171148395985, 'samples': 10025856, 'steps': 52217, 'loss/train': 1.5604493618011475} 11/07/2021 04:34:53 - INFO - __main__ - Step 52219: {'lr': 0.00037091247007751916, 'samples': 10026048, 'steps': 52218, 'loss/train': 1.6892719268798828} 11/07/2021 04:34:53 - INFO - __main__ - Step 52220: {'lr': 0.0003709078252609585, 'samples': 10026240, 'steps': 52219, 'loss/train': 1.3288346529006958} 11/07/2021 04:34:54 - INFO - __main__ - Step 52221: {'lr': 0.0003709031803899187, 'samples': 10026432, 'steps': 52220, 'loss/train': 1.9864845275878906} 11/07/2021 04:34:54 - INFO - __main__ - Step 52222: {'lr': 0.0003708985354644017, 'samples': 10026624, 'steps': 52221, 'loss/train': 1.2254571914672852} 11/07/2021 04:34:55 - INFO - __main__ - Step 52223: {'lr': 0.00037089389048440975, 'samples': 10026816, 'steps': 52222, 'loss/train': 1.5201977491378784} 11/07/2021 04:34:55 - INFO - __main__ - Step 52224: {'lr': 0.0003708892454499448, 'samples': 10027008, 'steps': 52223, 'loss/train': 2.122084856033325} 11/07/2021 04:34:56 - INFO - __main__ - Step 52225: {'lr': 0.00037088460036100915, 'samples': 10027200, 'steps': 52224, 'loss/train': 2.207977771759033} 11/07/2021 04:34:56 - INFO - __main__ - Step 52226: {'lr': 0.0003708799552176046, 'samples': 10027392, 'steps': 52225, 'loss/train': 1.3794692754745483} 11/07/2021 04:34:56 - INFO - __main__ - Step 52227: {'lr': 0.0003708753100197336, 'samples': 10027584, 'steps': 52226, 'loss/train': 1.742241382598877} 11/07/2021 04:34:57 - INFO - __main__ - Step 52228: {'lr': 0.00037087066476739795, 'samples': 10027776, 'steps': 52227, 'loss/train': 1.3941805362701416} 11/07/2021 04:34:58 - INFO - __main__ - Step 52229: {'lr': 0.0003708660194605998, 'samples': 10027968, 'steps': 52228, 'loss/train': 1.2187187671661377} 11/07/2021 04:34:58 - INFO - __main__ - Step 52230: {'lr': 0.0003708613740993414, 'samples': 10028160, 'steps': 52229, 'loss/train': 1.5902061462402344} 11/07/2021 04:34:59 - INFO - __main__ - Step 52231: {'lr': 0.00037085672868362464, 'samples': 10028352, 'steps': 52230, 'loss/train': 1.5531061887741089} 11/07/2021 04:34:59 - INFO - __main__ - Step 52232: {'lr': 0.0003708520832134518, 'samples': 10028544, 'steps': 52231, 'loss/train': 2.109830141067505} 11/07/2021 04:35:00 - INFO - __main__ - Step 52233: {'lr': 0.00037084743768882474, 'samples': 10028736, 'steps': 52232, 'loss/train': 1.4185560941696167} 11/07/2021 04:35:00 - INFO - __main__ - Step 52234: {'lr': 0.00037084279210974577, 'samples': 10028928, 'steps': 52233, 'loss/train': 1.798834204673767} 11/07/2021 04:35:01 - INFO - __main__ - Step 52235: {'lr': 0.00037083814647621686, 'samples': 10029120, 'steps': 52234, 'loss/train': 2.322261333465576} 11/07/2021 04:35:01 - INFO - __main__ - Step 52236: {'lr': 0.0003708335007882402, 'samples': 10029312, 'steps': 52235, 'loss/train': 1.1212799549102783} 11/07/2021 04:35:01 - INFO - __main__ - Step 52237: {'lr': 0.00037082885504581775, 'samples': 10029504, 'steps': 52236, 'loss/train': 0.9621900320053101} 11/07/2021 04:35:02 - INFO - __main__ - Step 52238: {'lr': 0.0003708242092489518, 'samples': 10029696, 'steps': 52237, 'loss/train': 1.47548508644104} 11/07/2021 04:35:03 - INFO - __main__ - Step 52239: {'lr': 0.0003708195633976442, 'samples': 10029888, 'steps': 52238, 'loss/train': 0.9711529612541199} 11/07/2021 04:35:03 - INFO - __main__ - Step 52240: {'lr': 0.0003708149174918972, 'samples': 10030080, 'steps': 52239, 'loss/train': 1.5734766721725464} 11/07/2021 04:35:03 - INFO - __main__ - Step 52241: {'lr': 0.000370810271531713, 'samples': 10030272, 'steps': 52240, 'loss/train': 1.292687177658081} 11/07/2021 04:35:04 - INFO - __main__ - Step 52242: {'lr': 0.0003708056255170934, 'samples': 10030464, 'steps': 52241, 'loss/train': 1.3701890707015991} 11/07/2021 04:35:04 - INFO - __main__ - Step 52243: {'lr': 0.0003708009794480407, 'samples': 10030656, 'steps': 52242, 'loss/train': 1.7831813097000122} 11/07/2021 04:35:05 - INFO - __main__ - Step 52244: {'lr': 0.0003707963333245569, 'samples': 10030848, 'steps': 52243, 'loss/train': 1.6027790307998657} 11/07/2021 04:35:05 - INFO - __main__ - Step 52245: {'lr': 0.0003707916871466442, 'samples': 10031040, 'steps': 52244, 'loss/train': 1.0570629835128784} 11/07/2021 04:35:06 - INFO - __main__ - Step 52246: {'lr': 0.0003707870409143046, 'samples': 10031232, 'steps': 52245, 'loss/train': 1.3571195602416992} 11/07/2021 04:35:06 - INFO - __main__ - Step 52247: {'lr': 0.00037078239462754023, 'samples': 10031424, 'steps': 52246, 'loss/train': 1.5466933250427246} 11/07/2021 04:35:06 - INFO - __main__ - Step 52248: {'lr': 0.0003707777482863532, 'samples': 10031616, 'steps': 52247, 'loss/train': 1.3029929399490356} 11/07/2021 04:35:07 - INFO - __main__ - Step 52249: {'lr': 0.00037077310189074554, 'samples': 10031808, 'steps': 52248, 'loss/train': 1.5803481340408325} 11/07/2021 04:35:08 - INFO - __main__ - Step 52250: {'lr': 0.0003707684554407194, 'samples': 10032000, 'steps': 52249, 'loss/train': 1.0084435939788818} 11/07/2021 04:35:08 - INFO - __main__ - Step 52251: {'lr': 0.0003707638089362769, 'samples': 10032192, 'steps': 52250, 'loss/train': 1.7735447883605957} 11/07/2021 04:35:08 - INFO - __main__ - Step 52252: {'lr': 0.00037075916237742, 'samples': 10032384, 'steps': 52251, 'loss/train': 1.7647929191589355} 11/07/2021 04:35:09 - INFO - __main__ - Step 52253: {'lr': 0.00037075451576415095, 'samples': 10032576, 'steps': 52252, 'loss/train': 2.0002384185791016} 11/07/2021 04:35:10 - INFO - __main__ - Step 52254: {'lr': 0.00037074986909647173, 'samples': 10032768, 'steps': 52253, 'loss/train': 1.2115262746810913} 11/07/2021 04:35:10 - INFO - __main__ - Step 52255: {'lr': 0.00037074522237438455, 'samples': 10032960, 'steps': 52254, 'loss/train': 1.3711144924163818} 11/07/2021 04:35:11 - INFO - __main__ - Step 52256: {'lr': 0.0003707405755978914, 'samples': 10033152, 'steps': 52255, 'loss/train': 1.4815617799758911} 11/07/2021 04:35:11 - INFO - __main__ - Step 52257: {'lr': 0.00037073592876699443, 'samples': 10033344, 'steps': 52256, 'loss/train': 1.395278811454773} 11/07/2021 04:35:11 - INFO - __main__ - Step 52258: {'lr': 0.0003707312818816956, 'samples': 10033536, 'steps': 52257, 'loss/train': 1.6057864427566528} 11/07/2021 04:35:12 - INFO - __main__ - Step 52259: {'lr': 0.00037072663494199724, 'samples': 10033728, 'steps': 52258, 'loss/train': 1.6924219131469727} 11/07/2021 04:35:13 - INFO - __main__ - Step 52260: {'lr': 0.0003707219879479013, 'samples': 10033920, 'steps': 52259, 'loss/train': 0.9788321852684021} 11/07/2021 04:35:13 - INFO - __main__ - Step 52261: {'lr': 0.0003707173408994099, 'samples': 10034112, 'steps': 52260, 'loss/train': 1.6294881105422974} 11/07/2021 04:35:13 - INFO - __main__ - Step 52262: {'lr': 0.0003707126937965251, 'samples': 10034304, 'steps': 52261, 'loss/train': 1.7520924806594849} 11/07/2021 04:35:14 - INFO - __main__ - Step 52263: {'lr': 0.0003707080466392491, 'samples': 10034496, 'steps': 52262, 'loss/train': 1.4391309022903442} 11/07/2021 04:35:15 - INFO - __main__ - Step 52264: {'lr': 0.0003707033994275838, 'samples': 10034688, 'steps': 52263, 'loss/train': 1.4674715995788574} 11/07/2021 04:35:15 - INFO - __main__ - Step 52265: {'lr': 0.0003706987521615315, 'samples': 10034880, 'steps': 52264, 'loss/train': 1.7465715408325195} 11/07/2021 04:35:15 - INFO - __main__ - Step 52266: {'lr': 0.0003706941048410941, 'samples': 10035072, 'steps': 52265, 'loss/train': 1.4335037469863892} 11/07/2021 04:35:16 - INFO - __main__ - Step 52267: {'lr': 0.0003706894574662739, 'samples': 10035264, 'steps': 52266, 'loss/train': 1.3291215896606445} 11/07/2021 04:35:16 - INFO - __main__ - Step 52268: {'lr': 0.0003706848100370729, 'samples': 10035456, 'steps': 52267, 'loss/train': 1.7214386463165283} 11/07/2021 04:35:17 - INFO - __main__ - Step 52269: {'lr': 0.00037068016255349315, 'samples': 10035648, 'steps': 52268, 'loss/train': 0.7420121431350708} 11/07/2021 04:35:17 - INFO - __main__ - Step 52270: {'lr': 0.0003706755150155368, 'samples': 10035840, 'steps': 52269, 'loss/train': 1.389876127243042} 11/07/2021 04:35:18 - INFO - __main__ - Step 52271: {'lr': 0.0003706708674232059, 'samples': 10036032, 'steps': 52270, 'loss/train': 1.0616005659103394} 11/07/2021 04:35:18 - INFO - __main__ - Step 52272: {'lr': 0.0003706662197765025, 'samples': 10036224, 'steps': 52271, 'loss/train': 1.2601686716079712} 11/07/2021 04:35:19 - INFO - __main__ - Step 52273: {'lr': 0.00037066157207542885, 'samples': 10036416, 'steps': 52272, 'loss/train': 1.566015362739563} 11/07/2021 04:35:19 - INFO - __main__ - Step 52274: {'lr': 0.00037065692431998695, 'samples': 10036608, 'steps': 52273, 'loss/train': 1.3770115375518799} 11/07/2021 04:35:20 - INFO - __main__ - Step 52275: {'lr': 0.00037065227651017897, 'samples': 10036800, 'steps': 52274, 'loss/train': 1.5455814599990845} 11/07/2021 04:35:20 - INFO - __main__ - Step 52276: {'lr': 0.0003706476286460068, 'samples': 10036992, 'steps': 52275, 'loss/train': 1.2364351749420166} 11/07/2021 04:35:21 - INFO - __main__ - Step 52277: {'lr': 0.0003706429807274728, 'samples': 10037184, 'steps': 52276, 'loss/train': 1.5304927825927734} 11/07/2021 04:35:21 - INFO - __main__ - Step 52278: {'lr': 0.0003706383327545788, 'samples': 10037376, 'steps': 52277, 'loss/train': 1.2667349576950073} 11/07/2021 04:35:21 - INFO - __main__ - Step 52279: {'lr': 0.0003706336847273271, 'samples': 10037568, 'steps': 52278, 'loss/train': 1.5852817296981812} 11/07/2021 04:35:22 - INFO - __main__ - Step 52280: {'lr': 0.00037062903664571975, 'samples': 10037760, 'steps': 52279, 'loss/train': 1.7066222429275513} 11/07/2021 04:35:23 - INFO - __main__ - Step 52281: {'lr': 0.00037062438850975877, 'samples': 10037952, 'steps': 52280, 'loss/train': 1.5558053255081177} 11/07/2021 04:35:23 - INFO - __main__ - Step 52282: {'lr': 0.00037061974031944635, 'samples': 10038144, 'steps': 52281, 'loss/train': 1.5429325103759766} 11/07/2021 04:35:23 - INFO - __main__ - Step 52283: {'lr': 0.0003706150920747845, 'samples': 10038336, 'steps': 52282, 'loss/train': 1.3963603973388672} 11/07/2021 04:35:24 - INFO - __main__ - Step 52284: {'lr': 0.00037061044377577535, 'samples': 10038528, 'steps': 52283, 'loss/train': 1.558125376701355} 11/07/2021 04:35:25 - INFO - __main__ - Step 52285: {'lr': 0.00037060579542242094, 'samples': 10038720, 'steps': 52284, 'loss/train': 1.5957164764404297} 11/07/2021 04:35:25 - INFO - __main__ - Step 52286: {'lr': 0.00037060114701472355, 'samples': 10038912, 'steps': 52285, 'loss/train': 1.7144229412078857} 11/07/2021 04:35:25 - INFO - __main__ - Step 52287: {'lr': 0.00037059649855268503, 'samples': 10039104, 'steps': 52286, 'loss/train': 1.2599412202835083} 11/07/2021 04:35:26 - INFO - __main__ - Step 52288: {'lr': 0.0003705918500363077, 'samples': 10039296, 'steps': 52287, 'loss/train': 1.2666865587234497} 11/07/2021 04:35:26 - INFO - __main__ - Step 52289: {'lr': 0.0003705872014655934, 'samples': 10039488, 'steps': 52288, 'loss/train': 1.5479722023010254} 11/07/2021 04:35:27 - INFO - __main__ - Step 52290: {'lr': 0.0003705825528405445, 'samples': 10039680, 'steps': 52289, 'loss/train': 1.3783215284347534} 11/07/2021 04:35:27 - INFO - __main__ - Step 52291: {'lr': 0.0003705779041611629, 'samples': 10039872, 'steps': 52290, 'loss/train': 1.649381160736084} 11/07/2021 04:35:28 - INFO - __main__ - Step 52292: {'lr': 0.00037057325542745075, 'samples': 10040064, 'steps': 52291, 'loss/train': 1.252488374710083} 11/07/2021 04:35:28 - INFO - __main__ - Step 52293: {'lr': 0.00037056860663941014, 'samples': 10040256, 'steps': 52292, 'loss/train': 1.1404080390930176} 11/07/2021 04:35:29 - INFO - __main__ - Step 52294: {'lr': 0.0003705639577970432, 'samples': 10040448, 'steps': 52293, 'loss/train': 1.6178735494613647} 11/07/2021 04:35:29 - INFO - __main__ - Step 52295: {'lr': 0.00037055930890035203, 'samples': 10040640, 'steps': 52294, 'loss/train': 1.131771206855774} 11/07/2021 04:35:30 - INFO - __main__ - Step 52296: {'lr': 0.00037055465994933866, 'samples': 10040832, 'steps': 52295, 'loss/train': 1.2351644039154053} 11/07/2021 04:35:30 - INFO - __main__ - Step 52297: {'lr': 0.00037055001094400523, 'samples': 10041024, 'steps': 52296, 'loss/train': 1.5820939540863037} 11/07/2021 04:35:31 - INFO - __main__ - Step 52298: {'lr': 0.0003705453618843538, 'samples': 10041216, 'steps': 52297, 'loss/train': 1.3582496643066406} 11/07/2021 04:35:31 - INFO - __main__ - Step 52299: {'lr': 0.00037054071277038654, 'samples': 10041408, 'steps': 52298, 'loss/train': 1.574306607246399} 11/07/2021 04:35:31 - INFO - __main__ - Step 52300: {'lr': 0.00037053606360210544, 'samples': 10041600, 'steps': 52299, 'loss/train': 1.3616039752960205} 11/07/2021 04:35:32 - INFO - __main__ - Step 52301: {'lr': 0.00037053141437951264, 'samples': 10041792, 'steps': 52300, 'loss/train': 1.6979727745056152} 11/07/2021 04:35:33 - INFO - __main__ - Step 52302: {'lr': 0.00037052676510261043, 'samples': 10041984, 'steps': 52301, 'loss/train': 1.4811590909957886} 11/07/2021 04:35:33 - INFO - __main__ - Step 52303: {'lr': 0.00037052211577140047, 'samples': 10042176, 'steps': 52302, 'loss/train': 1.540258765220642} 11/07/2021 04:35:33 - INFO - __main__ - Step 52304: {'lr': 0.00037051746638588526, 'samples': 10042368, 'steps': 52303, 'loss/train': 2.055814504623413} 11/07/2021 04:35:34 - INFO - __main__ - Step 52305: {'lr': 0.00037051281694606666, 'samples': 10042560, 'steps': 52304, 'loss/train': 1.7825428247451782} 11/07/2021 04:35:35 - INFO - __main__ - Step 52306: {'lr': 0.00037050816745194686, 'samples': 10042752, 'steps': 52305, 'loss/train': 1.316963791847229} 11/07/2021 04:35:36 - INFO - __main__ - Step 52307: {'lr': 0.00037050351790352795, 'samples': 10042944, 'steps': 52306, 'loss/train': 0.2811761200428009} 11/07/2021 04:35:36 - INFO - __main__ - Step 52308: {'lr': 0.00037049886830081203, 'samples': 10043136, 'steps': 52307, 'loss/train': 2.200223207473755} 11/07/2021 04:35:36 - INFO - __main__ - Step 52309: {'lr': 0.00037049421864380116, 'samples': 10043328, 'steps': 52308, 'loss/train': 1.544974684715271} 11/07/2021 04:35:37 - INFO - __main__ - Step 52310: {'lr': 0.00037048956893249746, 'samples': 10043520, 'steps': 52309, 'loss/train': 1.5108259916305542} 11/07/2021 04:35:37 - INFO - __main__ - Step 52311: {'lr': 0.00037048491916690304, 'samples': 10043712, 'steps': 52310, 'loss/train': 1.7455787658691406} 11/07/2021 04:35:38 - INFO - __main__ - Step 52312: {'lr': 0.00037048026934701997, 'samples': 10043904, 'steps': 52311, 'loss/train': 0.5629734992980957} 11/07/2021 04:35:38 - INFO - __main__ - Step 52313: {'lr': 0.0003704756194728503, 'samples': 10044096, 'steps': 52312, 'loss/train': 2.0434930324554443} 11/07/2021 04:35:39 - INFO - __main__ - Step 52314: {'lr': 0.0003704709695443962, 'samples': 10044288, 'steps': 52313, 'loss/train': 1.4854761362075806} 11/07/2021 04:35:39 - INFO - __main__ - Step 52315: {'lr': 0.00037046631956165975, 'samples': 10044480, 'steps': 52314, 'loss/train': 1.2033756971359253} 11/07/2021 04:35:40 - INFO - __main__ - Step 52316: {'lr': 0.00037046166952464307, 'samples': 10044672, 'steps': 52315, 'loss/train': 1.3256874084472656} 11/07/2021 04:35:41 - INFO - __main__ - Step 52317: {'lr': 0.00037045701943334814, 'samples': 10044864, 'steps': 52316, 'loss/train': 2.1899333000183105} 11/07/2021 04:35:41 - INFO - __main__ - Step 52318: {'lr': 0.0003704523692877772, 'samples': 10045056, 'steps': 52317, 'loss/train': 0.2942962050437927} 11/07/2021 04:35:41 - INFO - __main__ - Step 52319: {'lr': 0.00037044771908793225, 'samples': 10045248, 'steps': 52318, 'loss/train': 1.5847573280334473} 11/07/2021 04:35:42 - INFO - __main__ - Step 52320: {'lr': 0.0003704430688338154, 'samples': 10045440, 'steps': 52319, 'loss/train': 1.1405531167984009} 11/07/2021 04:35:42 - INFO - __main__ - Step 52321: {'lr': 0.0003704384185254288, 'samples': 10045632, 'steps': 52320, 'loss/train': 1.4911326169967651} 11/07/2021 04:35:43 - INFO - __main__ - Step 52322: {'lr': 0.00037043376816277453, 'samples': 10045824, 'steps': 52321, 'loss/train': 1.1727687120437622} 11/07/2021 04:35:44 - INFO - __main__ - Step 52323: {'lr': 0.00037042911774585465, 'samples': 10046016, 'steps': 52322, 'loss/train': 5.766360759735107} 11/07/2021 04:35:44 - INFO - __main__ - Step 52324: {'lr': 0.0003704244672746712, 'samples': 10046208, 'steps': 52323, 'loss/train': 1.3843739032745361} 11/07/2021 04:35:44 - INFO - __main__ - Step 52325: {'lr': 0.00037041981674922644, 'samples': 10046400, 'steps': 52324, 'loss/train': 1.3703349828720093} 11/07/2021 04:35:45 - INFO - __main__ - Step 52326: {'lr': 0.00037041516616952223, 'samples': 10046592, 'steps': 52325, 'loss/train': 1.5113415718078613} 11/07/2021 04:35:45 - INFO - __main__ - Step 52327: {'lr': 0.0003704105155355609, 'samples': 10046784, 'steps': 52326, 'loss/train': 1.5702812671661377} 11/07/2021 04:35:46 - INFO - __main__ - Step 52328: {'lr': 0.0003704058648473445, 'samples': 10046976, 'steps': 52327, 'loss/train': 1.2943845987319946} 11/07/2021 04:35:46 - INFO - __main__ - Step 52329: {'lr': 0.000370401214104875, 'samples': 10047168, 'steps': 52328, 'loss/train': 1.3867039680480957} 11/07/2021 04:35:47 - INFO - __main__ - Step 52330: {'lr': 0.0003703965633081546, 'samples': 10047360, 'steps': 52329, 'loss/train': 1.4289169311523438} 11/07/2021 04:35:47 - INFO - __main__ - Step 52331: {'lr': 0.00037039191245718536, 'samples': 10047552, 'steps': 52330, 'loss/train': 1.3633909225463867} 11/07/2021 04:35:47 - INFO - __main__ - Step 52332: {'lr': 0.00037038726155196934, 'samples': 10047744, 'steps': 52331, 'loss/train': 1.0888973474502563} 11/07/2021 04:35:48 - INFO - __main__ - Step 52333: {'lr': 0.00037038261059250873, 'samples': 10047936, 'steps': 52332, 'loss/train': 1.523136854171753} 11/07/2021 04:35:49 - INFO - __main__ - Step 52334: {'lr': 0.0003703779595788056, 'samples': 10048128, 'steps': 52333, 'loss/train': 1.490991234779358} 11/07/2021 04:35:49 - INFO - __main__ - Step 52335: {'lr': 0.00037037330851086194, 'samples': 10048320, 'steps': 52334, 'loss/train': 1.7871202230453491} 11/07/2021 04:35:49 - INFO - __main__ - Step 52336: {'lr': 0.00037036865738868, 'samples': 10048512, 'steps': 52335, 'loss/train': 1.5942060947418213} 11/07/2021 04:35:50 - INFO - __main__ - Step 52337: {'lr': 0.00037036400621226175, 'samples': 10048704, 'steps': 52336, 'loss/train': 1.4056665897369385} 11/07/2021 04:35:51 - INFO - __main__ - Step 52338: {'lr': 0.00037035935498160933, 'samples': 10048896, 'steps': 52337, 'loss/train': 1.394268274307251} 11/07/2021 04:35:51 - INFO - __main__ - Step 52339: {'lr': 0.00037035470369672484, 'samples': 10049088, 'steps': 52338, 'loss/train': 1.4369909763336182} 11/07/2021 04:35:51 - INFO - __main__ - Step 52340: {'lr': 0.0003703500523576104, 'samples': 10049280, 'steps': 52339, 'loss/train': 1.4380542039871216} 11/07/2021 04:35:52 - INFO - __main__ - Step 52341: {'lr': 0.0003703454009642681, 'samples': 10049472, 'steps': 52340, 'loss/train': 0.6161519289016724} 11/07/2021 04:35:52 - INFO - __main__ - Step 52342: {'lr': 0.0003703407495167, 'samples': 10049664, 'steps': 52341, 'loss/train': 1.1656584739685059} 11/07/2021 04:35:53 - INFO - __main__ - Step 52343: {'lr': 0.0003703360980149082, 'samples': 10049856, 'steps': 52342, 'loss/train': 1.527284860610962} 11/07/2021 04:35:54 - INFO - __main__ - Step 52344: {'lr': 0.00037033144645889487, 'samples': 10050048, 'steps': 52343, 'loss/train': 0.4405560791492462} 11/07/2021 04:35:54 - INFO - __main__ - Step 52345: {'lr': 0.000370326794848662, 'samples': 10050240, 'steps': 52344, 'loss/train': 1.3896318674087524} 11/07/2021 04:35:54 - INFO - __main__ - Step 52346: {'lr': 0.00037032214318421174, 'samples': 10050432, 'steps': 52345, 'loss/train': 1.2713168859481812} 11/07/2021 04:35:55 - INFO - __main__ - Step 52347: {'lr': 0.00037031749146554616, 'samples': 10050624, 'steps': 52346, 'loss/train': 1.679565668106079} 11/07/2021 04:35:56 - INFO - __main__ - Step 52348: {'lr': 0.00037031283969266737, 'samples': 10050816, 'steps': 52347, 'loss/train': 1.7118709087371826} 11/07/2021 04:35:56 - INFO - __main__ - Step 52349: {'lr': 0.0003703081878655775, 'samples': 10051008, 'steps': 52348, 'loss/train': 1.497228980064392} 11/07/2021 04:35:56 - INFO - __main__ - Step 52350: {'lr': 0.00037030353598427866, 'samples': 10051200, 'steps': 52349, 'loss/train': 1.8344635963439941} 11/07/2021 04:35:57 - INFO - __main__ - Step 52351: {'lr': 0.0003702988840487728, 'samples': 10051392, 'steps': 52350, 'loss/train': 1.1844738721847534} 11/07/2021 04:35:57 - INFO - __main__ - Step 52352: {'lr': 0.0003702942320590622, 'samples': 10051584, 'steps': 52351, 'loss/train': 1.0559014081954956} 11/07/2021 04:35:58 - INFO - __main__ - Step 52353: {'lr': 0.00037028958001514886, 'samples': 10051776, 'steps': 52352, 'loss/train': 1.5483312606811523} 11/07/2021 04:35:58 - INFO - __main__ - Step 52354: {'lr': 0.00037028492791703484, 'samples': 10051968, 'steps': 52353, 'loss/train': 5.4565935134887695} 11/07/2021 04:35:59 - INFO - __main__ - Step 52355: {'lr': 0.0003702802757647223, 'samples': 10052160, 'steps': 52354, 'loss/train': 1.3493404388427734} 11/07/2021 04:35:59 - INFO - __main__ - Step 52356: {'lr': 0.0003702756235582134, 'samples': 10052352, 'steps': 52355, 'loss/train': 1.788238525390625} 11/07/2021 04:36:00 - INFO - __main__ - Step 52357: {'lr': 0.00037027097129751016, 'samples': 10052544, 'steps': 52356, 'loss/train': 1.8171045780181885} 11/07/2021 04:36:00 - INFO - __main__ - Step 52358: {'lr': 0.0003702663189826146, 'samples': 10052736, 'steps': 52357, 'loss/train': 1.5824415683746338} 11/07/2021 04:36:01 - INFO - __main__ - Step 52359: {'lr': 0.0003702616666135289, 'samples': 10052928, 'steps': 52358, 'loss/train': 1.1794646978378296} 11/07/2021 04:36:01 - INFO - __main__ - Step 52360: {'lr': 0.0003702570141902552, 'samples': 10053120, 'steps': 52359, 'loss/train': 1.9084322452545166} 11/07/2021 04:36:01 - INFO - __main__ - Step 52361: {'lr': 0.00037025236171279546, 'samples': 10053312, 'steps': 52360, 'loss/train': 1.4402989149093628} 11/07/2021 04:36:02 - INFO - __main__ - Step 52362: {'lr': 0.000370247709181152, 'samples': 10053504, 'steps': 52361, 'loss/train': 1.2373167276382446} 11/07/2021 04:36:02 - INFO - __main__ - Step 52363: {'lr': 0.00037024305659532665, 'samples': 10053696, 'steps': 52362, 'loss/train': 1.5906411409378052} 11/07/2021 04:36:03 - INFO - __main__ - Step 52364: {'lr': 0.00037023840395532167, 'samples': 10053888, 'steps': 52363, 'loss/train': 1.3774484395980835} 11/07/2021 04:36:04 - INFO - __main__ - Step 52365: {'lr': 0.0003702337512611391, 'samples': 10054080, 'steps': 52364, 'loss/train': 1.7504037618637085} 11/07/2021 04:36:04 - INFO - __main__ - Step 52366: {'lr': 0.00037022909851278107, 'samples': 10054272, 'steps': 52365, 'loss/train': 1.125939965248108} 11/07/2021 04:36:04 - INFO - __main__ - Step 52367: {'lr': 0.0003702244457102497, 'samples': 10054464, 'steps': 52366, 'loss/train': 1.6031582355499268} 11/07/2021 04:36:05 - INFO - __main__ - Step 52368: {'lr': 0.000370219792853547, 'samples': 10054656, 'steps': 52367, 'loss/train': 2.110670566558838} 11/07/2021 04:36:06 - INFO - __main__ - Step 52369: {'lr': 0.0003702151399426752, 'samples': 10054848, 'steps': 52368, 'loss/train': 0.9725801348686218} 11/07/2021 04:36:06 - INFO - __main__ - Step 52370: {'lr': 0.0003702104869776362, 'samples': 10055040, 'steps': 52369, 'loss/train': 1.8233392238616943} 11/07/2021 04:36:06 - INFO - __main__ - Step 52371: {'lr': 0.0003702058339584323, 'samples': 10055232, 'steps': 52370, 'loss/train': 1.6103260517120361} 11/07/2021 04:36:07 - INFO - __main__ - Step 52372: {'lr': 0.00037020118088506546, 'samples': 10055424, 'steps': 52371, 'loss/train': 1.3690156936645508} 11/07/2021 04:36:07 - INFO - __main__ - Step 52373: {'lr': 0.0003701965277575378, 'samples': 10055616, 'steps': 52372, 'loss/train': 1.5724278688430786} 11/07/2021 04:36:08 - INFO - __main__ - Step 52374: {'lr': 0.0003701918745758515, 'samples': 10055808, 'steps': 52373, 'loss/train': 1.6084142923355103} 11/07/2021 04:36:08 - INFO - __main__ - Step 52375: {'lr': 0.00037018722134000856, 'samples': 10056000, 'steps': 52374, 'loss/train': 1.272934913635254} 11/07/2021 04:36:09 - INFO - __main__ - Step 52376: {'lr': 0.00037018256805001115, 'samples': 10056192, 'steps': 52375, 'loss/train': 1.0911200046539307} 11/07/2021 04:36:09 - INFO - __main__ - Step 52377: {'lr': 0.00037017791470586126, 'samples': 10056384, 'steps': 52376, 'loss/train': 1.0611250400543213} 11/07/2021 04:36:09 - INFO - __main__ - Step 52378: {'lr': 0.0003701732613075611, 'samples': 10056576, 'steps': 52377, 'loss/train': 1.5138376951217651} 11/07/2021 04:36:11 - INFO - __main__ - Step 52379: {'lr': 0.00037016860785511274, 'samples': 10056768, 'steps': 52378, 'loss/train': 1.3715938329696655} 11/07/2021 04:36:11 - INFO - __main__ - Step 52380: {'lr': 0.00037016395434851825, 'samples': 10056960, 'steps': 52379, 'loss/train': 2.1460390090942383} 11/07/2021 04:36:11 - INFO - __main__ - Step 52381: {'lr': 0.0003701593007877797, 'samples': 10057152, 'steps': 52380, 'loss/train': 1.119550347328186} 11/07/2021 04:36:12 - INFO - __main__ - Step 52382: {'lr': 0.00037015464717289924, 'samples': 10057344, 'steps': 52381, 'loss/train': 0.9674318432807922} 11/07/2021 04:36:12 - INFO - __main__ - Step 52383: {'lr': 0.000370149993503879, 'samples': 10057536, 'steps': 52382, 'loss/train': 1.636091709136963} 11/07/2021 04:36:13 - INFO - __main__ - Step 52384: {'lr': 0.000370145339780721, 'samples': 10057728, 'steps': 52383, 'loss/train': 1.5504209995269775} 11/07/2021 04:36:13 - INFO - __main__ - Step 52385: {'lr': 0.0003701406860034273, 'samples': 10057920, 'steps': 52384, 'loss/train': 1.3792790174484253} 11/07/2021 04:36:14 - INFO - __main__ - Step 52386: {'lr': 0.0003701360321720001, 'samples': 10058112, 'steps': 52385, 'loss/train': 1.3007304668426514} 11/07/2021 04:36:14 - INFO - __main__ - Step 52387: {'lr': 0.0003701313782864415, 'samples': 10058304, 'steps': 52386, 'loss/train': 1.4700512886047363} 11/07/2021 04:36:14 - INFO - __main__ - Step 52388: {'lr': 0.0003701267243467535, 'samples': 10058496, 'steps': 52387, 'loss/train': 1.9685583114624023} 11/07/2021 04:36:15 - INFO - __main__ - Step 52389: {'lr': 0.00037012207035293834, 'samples': 10058688, 'steps': 52388, 'loss/train': 1.2751091718673706} 11/07/2021 04:36:16 - INFO - __main__ - Step 52390: {'lr': 0.00037011741630499796, 'samples': 10058880, 'steps': 52389, 'loss/train': 1.6515552997589111} 11/07/2021 04:36:16 - INFO - __main__ - Step 52391: {'lr': 0.00037011276220293447, 'samples': 10059072, 'steps': 52390, 'loss/train': 1.0018212795257568} 11/07/2021 04:36:16 - INFO - __main__ - Step 52392: {'lr': 0.0003701081080467501, 'samples': 10059264, 'steps': 52391, 'loss/train': 1.524664044380188} 11/07/2021 04:36:17 - INFO - __main__ - Step 52393: {'lr': 0.0003701034538364468, 'samples': 10059456, 'steps': 52392, 'loss/train': 1.8944721221923828} 11/07/2021 04:36:17 - INFO - __main__ - Step 52394: {'lr': 0.0003700987995720269, 'samples': 10059648, 'steps': 52393, 'loss/train': 0.9276136159896851} 11/07/2021 04:36:18 - INFO - __main__ - Step 52395: {'lr': 0.0003700941452534922, 'samples': 10059840, 'steps': 52394, 'loss/train': 1.7017980813980103} 11/07/2021 04:36:19 - INFO - __main__ - Step 52396: {'lr': 0.0003700894908808449, 'samples': 10060032, 'steps': 52395, 'loss/train': 1.5161792039871216} 11/07/2021 04:36:19 - INFO - __main__ - Step 52397: {'lr': 0.0003700848364540872, 'samples': 10060224, 'steps': 52396, 'loss/train': 1.536611557006836} 11/07/2021 04:36:19 - INFO - __main__ - Step 52398: {'lr': 0.0003700801819732211, 'samples': 10060416, 'steps': 52397, 'loss/train': 1.2143956422805786} 11/07/2021 04:36:20 - INFO - __main__ - Step 52399: {'lr': 0.0003700755274382487, 'samples': 10060608, 'steps': 52398, 'loss/train': 1.8598138093948364} 11/07/2021 04:36:21 - INFO - __main__ - Step 52400: {'lr': 0.0003700708728491722, 'samples': 10060800, 'steps': 52399, 'loss/train': 1.3368159532546997} 11/07/2021 04:36:21 - INFO - __main__ - Step 52401: {'lr': 0.0003700662182059936, 'samples': 10060992, 'steps': 52400, 'loss/train': 1.0483607053756714} 11/07/2021 04:36:21 - INFO - __main__ - Step 52402: {'lr': 0.0003700615635087149, 'samples': 10061184, 'steps': 52401, 'loss/train': 1.4616649150848389} 11/07/2021 04:36:22 - INFO - __main__ - Step 52403: {'lr': 0.00037005690875733843, 'samples': 10061376, 'steps': 52402, 'loss/train': 1.5702778100967407} 11/07/2021 04:36:22 - INFO - __main__ - Step 52404: {'lr': 0.00037005225395186616, 'samples': 10061568, 'steps': 52403, 'loss/train': 1.7624890804290771} 11/07/2021 04:36:23 - INFO - __main__ - Step 52405: {'lr': 0.00037004759909230016, 'samples': 10061760, 'steps': 52404, 'loss/train': 1.0354856252670288} 11/07/2021 04:36:24 - INFO - __main__ - Step 52406: {'lr': 0.0003700429441786426, 'samples': 10061952, 'steps': 52405, 'loss/train': 1.3917793035507202} 11/07/2021 04:36:24 - INFO - __main__ - Step 52407: {'lr': 0.0003700382892108955, 'samples': 10062144, 'steps': 52406, 'loss/train': 1.4974435567855835} 11/07/2021 04:36:24 - INFO - __main__ - Step 52408: {'lr': 0.000370033634189061, 'samples': 10062336, 'steps': 52407, 'loss/train': 1.3563601970672607} 11/07/2021 04:36:25 - INFO - __main__ - Step 52409: {'lr': 0.00037002897911314126, 'samples': 10062528, 'steps': 52408, 'loss/train': 1.632383108139038} 11/07/2021 04:36:26 - INFO - __main__ - Step 52410: {'lr': 0.0003700243239831382, 'samples': 10062720, 'steps': 52409, 'loss/train': 1.2046328783035278} 11/07/2021 04:36:26 - INFO - __main__ - Step 52411: {'lr': 0.00037001966879905414, 'samples': 10062912, 'steps': 52410, 'loss/train': 1.6029366254806519} 11/07/2021 04:36:26 - INFO - __main__ - Step 52412: {'lr': 0.00037001501356089103, 'samples': 10063104, 'steps': 52411, 'loss/train': 1.5252598524093628} 11/07/2021 04:36:27 - INFO - __main__ - Step 52413: {'lr': 0.00037001035826865096, 'samples': 10063296, 'steps': 52412, 'loss/train': 1.8335431814193726} 11/07/2021 04:36:27 - INFO - __main__ - Step 52414: {'lr': 0.00037000570292233613, 'samples': 10063488, 'steps': 52413, 'loss/train': 1.4575445652008057} 11/07/2021 04:36:28 - INFO - __main__ - Step 52415: {'lr': 0.00037000104752194857, 'samples': 10063680, 'steps': 52414, 'loss/train': 1.302187204360962} 11/07/2021 04:36:28 - INFO - __main__ - Step 52416: {'lr': 0.0003699963920674905, 'samples': 10063872, 'steps': 52415, 'loss/train': 1.4415488243103027} 11/07/2021 04:36:29 - INFO - __main__ - Step 52417: {'lr': 0.00036999173655896374, 'samples': 10064064, 'steps': 52416, 'loss/train': 1.3341501951217651} 11/07/2021 04:36:29 - INFO - __main__ - Step 52418: {'lr': 0.00036998708099637064, 'samples': 10064256, 'steps': 52417, 'loss/train': 1.7092583179473877} 11/07/2021 04:36:30 - INFO - __main__ - Step 52419: {'lr': 0.00036998242537971315, 'samples': 10064448, 'steps': 52418, 'loss/train': 1.554855227470398} 11/07/2021 04:36:31 - INFO - __main__ - Step 52420: {'lr': 0.00036997776970899344, 'samples': 10064640, 'steps': 52419, 'loss/train': 1.633256196975708} 11/07/2021 04:36:31 - INFO - __main__ - Step 52421: {'lr': 0.0003699731139842136, 'samples': 10064832, 'steps': 52420, 'loss/train': 1.1283069849014282} 11/07/2021 04:36:31 - INFO - __main__ - Step 52422: {'lr': 0.0003699684582053758, 'samples': 10065024, 'steps': 52421, 'loss/train': 0.8390668630599976} 11/07/2021 04:36:32 - INFO - __main__ - Step 52423: {'lr': 0.00036996380237248205, 'samples': 10065216, 'steps': 52422, 'loss/train': 1.277381181716919} 11/07/2021 04:36:32 - INFO - __main__ - Step 52424: {'lr': 0.0003699591464855344, 'samples': 10065408, 'steps': 52423, 'loss/train': 1.3982897996902466} 11/07/2021 04:36:32 - INFO - __main__ - Step 52425: {'lr': 0.00036995449054453503, 'samples': 10065600, 'steps': 52424, 'loss/train': 1.5712449550628662} 11/07/2021 04:36:33 - INFO - __main__ - Step 52426: {'lr': 0.00036994983454948605, 'samples': 10065792, 'steps': 52425, 'loss/train': 1.5282293558120728} 11/07/2021 04:36:34 - INFO - __main__ - Step 52427: {'lr': 0.0003699451785003895, 'samples': 10065984, 'steps': 52426, 'loss/train': 1.2385307550430298} 11/07/2021 04:36:34 - INFO - __main__ - Step 52428: {'lr': 0.0003699405223972475, 'samples': 10066176, 'steps': 52427, 'loss/train': 0.7437629103660583} 11/07/2021 04:36:34 - INFO - __main__ - Step 52429: {'lr': 0.0003699358662400622, 'samples': 10066368, 'steps': 52428, 'loss/train': 1.2149325609207153} 11/07/2021 04:36:35 - INFO - __main__ - Step 52430: {'lr': 0.00036993121002883557, 'samples': 10066560, 'steps': 52429, 'loss/train': 1.6896101236343384} 11/07/2021 04:36:36 - INFO - __main__ - Step 52431: {'lr': 0.0003699265537635698, 'samples': 10066752, 'steps': 52430, 'loss/train': 2.1384401321411133} 11/07/2021 04:36:36 - INFO - __main__ - Step 52432: {'lr': 0.000369921897444267, 'samples': 10066944, 'steps': 52431, 'loss/train': 1.595112681388855} 11/07/2021 04:36:37 - INFO - __main__ - Step 52433: {'lr': 0.00036991724107092927, 'samples': 10067136, 'steps': 52432, 'loss/train': 1.4829649925231934} 11/07/2021 04:36:37 - INFO - __main__ - Step 52434: {'lr': 0.00036991258464355863, 'samples': 10067328, 'steps': 52433, 'loss/train': 2.3356728553771973} 11/07/2021 04:36:37 - INFO - __main__ - Step 52435: {'lr': 0.00036990792816215726, 'samples': 10067520, 'steps': 52434, 'loss/train': 1.679817795753479} 11/07/2021 04:36:38 - INFO - __main__ - Step 52436: {'lr': 0.0003699032716267273, 'samples': 10067712, 'steps': 52435, 'loss/train': 1.3072232007980347} 11/07/2021 04:36:39 - INFO - __main__ - Step 52437: {'lr': 0.00036989861503727064, 'samples': 10067904, 'steps': 52436, 'loss/train': 1.4434219598770142} 11/07/2021 04:36:39 - INFO - __main__ - Step 52438: {'lr': 0.0003698939583937896, 'samples': 10068096, 'steps': 52437, 'loss/train': 1.5289331674575806} 11/07/2021 04:36:39 - INFO - __main__ - Step 52439: {'lr': 0.0003698893016962861, 'samples': 10068288, 'steps': 52438, 'loss/train': 1.2497870922088623} 11/07/2021 04:36:40 - INFO - __main__ - Step 52440: {'lr': 0.00036988464494476243, 'samples': 10068480, 'steps': 52439, 'loss/train': 1.7348291873931885} 11/07/2021 04:36:41 - INFO - __main__ - Step 52441: {'lr': 0.0003698799881392205, 'samples': 10068672, 'steps': 52440, 'loss/train': 1.2171458005905151} 11/07/2021 04:36:41 - INFO - __main__ - Step 52442: {'lr': 0.00036987533127966253, 'samples': 10068864, 'steps': 52441, 'loss/train': 1.4699468612670898} 11/07/2021 04:36:41 - INFO - __main__ - Step 52443: {'lr': 0.0003698706743660907, 'samples': 10069056, 'steps': 52442, 'loss/train': 1.6238996982574463} 11/07/2021 04:36:42 - INFO - __main__ - Step 52444: {'lr': 0.0003698660173985069, 'samples': 10069248, 'steps': 52443, 'loss/train': 1.011279821395874} 11/07/2021 04:36:42 - INFO - __main__ - Step 52445: {'lr': 0.0003698613603769133, 'samples': 10069440, 'steps': 52444, 'loss/train': 1.3448474407196045} 11/07/2021 04:36:43 - INFO - __main__ - Step 52446: {'lr': 0.00036985670330131205, 'samples': 10069632, 'steps': 52445, 'loss/train': 1.0529274940490723} 11/07/2021 04:36:43 - INFO - __main__ - Step 52447: {'lr': 0.0003698520461717052, 'samples': 10069824, 'steps': 52446, 'loss/train': 1.2374534606933594} 11/07/2021 04:36:44 - INFO - __main__ - Step 52448: {'lr': 0.0003698473889880949, 'samples': 10070016, 'steps': 52447, 'loss/train': 1.584261178970337} 11/07/2021 04:36:44 - INFO - __main__ - Step 52449: {'lr': 0.0003698427317504832, 'samples': 10070208, 'steps': 52448, 'loss/train': 1.6233322620391846} 11/07/2021 04:36:44 - INFO - __main__ - Step 52450: {'lr': 0.00036983807445887217, 'samples': 10070400, 'steps': 52449, 'loss/train': 0.7425636053085327} 11/07/2021 04:36:45 - INFO - __main__ - Step 52451: {'lr': 0.00036983341711326403, 'samples': 10070592, 'steps': 52450, 'loss/train': 1.4514048099517822} 11/07/2021 04:36:46 - INFO - __main__ - Step 52452: {'lr': 0.00036982875971366074, 'samples': 10070784, 'steps': 52451, 'loss/train': 1.2515755891799927} 11/07/2021 04:36:46 - INFO - __main__ - Step 52453: {'lr': 0.00036982410226006445, 'samples': 10070976, 'steps': 52452, 'loss/train': 1.7804089784622192} 11/07/2021 04:36:47 - INFO - __main__ - Step 52454: {'lr': 0.0003698194447524773, 'samples': 10071168, 'steps': 52453, 'loss/train': 1.7630228996276855} 11/07/2021 04:36:47 - INFO - __main__ - Step 52455: {'lr': 0.0003698147871909014, 'samples': 10071360, 'steps': 52454, 'loss/train': 1.496080756187439} 11/07/2021 04:36:48 - INFO - __main__ - Step 52456: {'lr': 0.0003698101295753388, 'samples': 10071552, 'steps': 52455, 'loss/train': 0.8906934261322021} 11/07/2021 04:36:48 - INFO - __main__ - Step 52457: {'lr': 0.00036980547190579153, 'samples': 10071744, 'steps': 52456, 'loss/train': 1.3202104568481445} 11/07/2021 04:36:49 - INFO - __main__ - Step 52458: {'lr': 0.0003698008141822618, 'samples': 10071936, 'steps': 52457, 'loss/train': 1.3986629247665405} 11/07/2021 04:36:49 - INFO - __main__ - Step 52459: {'lr': 0.00036979615640475165, 'samples': 10072128, 'steps': 52458, 'loss/train': 1.3181469440460205} 11/07/2021 04:36:49 - INFO - __main__ - Step 52460: {'lr': 0.0003697914985732632, 'samples': 10072320, 'steps': 52459, 'loss/train': 1.538682460784912} 11/07/2021 04:36:50 - INFO - __main__ - Step 52461: {'lr': 0.0003697868406877986, 'samples': 10072512, 'steps': 52460, 'loss/train': 1.3042988777160645} 11/07/2021 04:36:51 - INFO - __main__ - Step 52462: {'lr': 0.00036978218274835993, 'samples': 10072704, 'steps': 52461, 'loss/train': 1.4915683269500732} 11/07/2021 04:36:51 - INFO - __main__ - Step 52463: {'lr': 0.0003697775247549492, 'samples': 10072896, 'steps': 52462, 'loss/train': 1.5328110456466675} 11/07/2021 04:36:51 - INFO - __main__ - Step 52464: {'lr': 0.00036977286670756854, 'samples': 10073088, 'steps': 52463, 'loss/train': 1.7254937887191772} 11/07/2021 04:36:52 - INFO - __main__ - Step 52465: {'lr': 0.00036976820860622005, 'samples': 10073280, 'steps': 52464, 'loss/train': 1.226966142654419} 11/07/2021 04:36:52 - INFO - __main__ - Step 52466: {'lr': 0.00036976355045090594, 'samples': 10073472, 'steps': 52465, 'loss/train': 2.6069347858428955} 11/07/2021 04:36:53 - INFO - __main__ - Step 52467: {'lr': 0.00036975889224162816, 'samples': 10073664, 'steps': 52466, 'loss/train': 1.3844895362854004} 11/07/2021 04:36:53 - INFO - __main__ - Step 52468: {'lr': 0.000369754233978389, 'samples': 10073856, 'steps': 52467, 'loss/train': 1.1404250860214233} 11/07/2021 04:36:54 - INFO - __main__ - Step 52469: {'lr': 0.00036974957566119027, 'samples': 10074048, 'steps': 52468, 'loss/train': 1.866579532623291} 11/07/2021 04:36:54 - INFO - __main__ - Step 52470: {'lr': 0.00036974491729003427, 'samples': 10074240, 'steps': 52469, 'loss/train': 1.5345561504364014} 11/07/2021 04:36:54 - INFO - __main__ - Step 52471: {'lr': 0.00036974025886492306, 'samples': 10074432, 'steps': 52470, 'loss/train': 1.763494849205017} 11/07/2021 04:36:56 - INFO - __main__ - Step 52472: {'lr': 0.00036973560038585876, 'samples': 10074624, 'steps': 52471, 'loss/train': 1.8525594472885132} 11/07/2021 04:36:56 - INFO - __main__ - Step 52473: {'lr': 0.0003697309418528435, 'samples': 10074816, 'steps': 52472, 'loss/train': 1.4374958276748657} 11/07/2021 04:36:56 - INFO - __main__ - Step 52474: {'lr': 0.0003697262832658792, 'samples': 10075008, 'steps': 52473, 'loss/train': 1.283557415008545} 11/07/2021 04:36:57 - INFO - __main__ - Step 52475: {'lr': 0.00036972162462496817, 'samples': 10075200, 'steps': 52474, 'loss/train': 1.6452559232711792} 11/07/2021 04:36:57 - INFO - __main__ - Step 52476: {'lr': 0.0003697169659301124, 'samples': 10075392, 'steps': 52475, 'loss/train': 1.243672490119934} 11/07/2021 04:36:58 - INFO - __main__ - Step 52477: {'lr': 0.000369712307181314, 'samples': 10075584, 'steps': 52476, 'loss/train': 1.765824317932129} 11/07/2021 04:36:58 - INFO - __main__ - Step 52478: {'lr': 0.00036970764837857505, 'samples': 10075776, 'steps': 52477, 'loss/train': 0.7308313250541687} 11/07/2021 04:36:59 - INFO - __main__ - Step 52479: {'lr': 0.0003697029895218978, 'samples': 10075968, 'steps': 52478, 'loss/train': 1.3887279033660889} 11/07/2021 04:36:59 - INFO - __main__ - Step 52480: {'lr': 0.0003696983306112842, 'samples': 10076160, 'steps': 52479, 'loss/train': 1.4204062223434448} 11/07/2021 04:36:59 - INFO - __main__ - Step 52481: {'lr': 0.00036969367164673626, 'samples': 10076352, 'steps': 52480, 'loss/train': 1.1525694131851196} 11/07/2021 04:37:00 - INFO - __main__ - Step 52482: {'lr': 0.0003696890126282563, 'samples': 10076544, 'steps': 52481, 'loss/train': 1.8334134817123413} 11/07/2021 04:37:01 - INFO - __main__ - Step 52483: {'lr': 0.0003696843535558463, 'samples': 10076736, 'steps': 52482, 'loss/train': 1.369024634361267} 11/07/2021 04:37:01 - INFO - __main__ - Step 52484: {'lr': 0.0003696796944295084, 'samples': 10076928, 'steps': 52483, 'loss/train': 1.4560978412628174} 11/07/2021 04:37:01 - INFO - __main__ - Step 52485: {'lr': 0.00036967503524924463, 'samples': 10077120, 'steps': 52484, 'loss/train': 1.6482877731323242} 11/07/2021 04:37:02 - INFO - __main__ - Step 52486: {'lr': 0.00036967037601505715, 'samples': 10077312, 'steps': 52485, 'loss/train': 1.0509904623031616} 11/07/2021 04:37:03 - INFO - __main__ - Step 52487: {'lr': 0.000369665716726948, 'samples': 10077504, 'steps': 52486, 'loss/train': 1.792357325553894} 11/07/2021 04:37:03 - INFO - __main__ - Step 52488: {'lr': 0.0003696610573849194, 'samples': 10077696, 'steps': 52487, 'loss/train': 1.3979697227478027} 11/07/2021 04:37:04 - INFO - __main__ - Step 52489: {'lr': 0.0003696563979889733, 'samples': 10077888, 'steps': 52488, 'loss/train': 1.4037448167800903} 11/07/2021 04:37:04 - INFO - __main__ - Step 52490: {'lr': 0.00036965173853911195, 'samples': 10078080, 'steps': 52489, 'loss/train': 1.6520057916641235} 11/07/2021 04:37:04 - INFO - __main__ - Step 52491: {'lr': 0.0003696470790353373, 'samples': 10078272, 'steps': 52490, 'loss/train': 1.4462602138519287} 11/07/2021 04:37:05 - INFO - __main__ - Step 52492: {'lr': 0.0003696424194776516, 'samples': 10078464, 'steps': 52491, 'loss/train': 1.744876742362976} 11/07/2021 04:37:06 - INFO - __main__ - Step 52493: {'lr': 0.0003696377598660569, 'samples': 10078656, 'steps': 52492, 'loss/train': 1.4057873487472534} 11/07/2021 04:37:06 - INFO - __main__ - Step 52494: {'lr': 0.0003696331002005551, 'samples': 10078848, 'steps': 52493, 'loss/train': 1.3740580081939697} 11/07/2021 04:37:06 - INFO - __main__ - Step 52495: {'lr': 0.00036962844048114856, 'samples': 10079040, 'steps': 52494, 'loss/train': 0.8017370700836182} 11/07/2021 04:37:07 - INFO - __main__ - Step 52496: {'lr': 0.0003696237807078393, 'samples': 10079232, 'steps': 52495, 'loss/train': 1.0897456407546997} 11/07/2021 04:37:08 - INFO - __main__ - Step 52497: {'lr': 0.00036961912088062947, 'samples': 10079424, 'steps': 52496, 'loss/train': 1.2632558345794678} 11/07/2021 04:37:08 - INFO - __main__ - Step 52498: {'lr': 0.00036961446099952104, 'samples': 10079616, 'steps': 52497, 'loss/train': 1.6042799949645996} 11/07/2021 04:37:09 - INFO - __main__ - Step 52499: {'lr': 0.0003696098010645162, 'samples': 10079808, 'steps': 52498, 'loss/train': 1.7075955867767334} 11/07/2021 04:37:09 - INFO - __main__ - Step 52500: {'lr': 0.00036960514107561707, 'samples': 10080000, 'steps': 52499, 'loss/train': 1.529590368270874} 11/07/2021 04:37:09 - INFO - __main__ - Step 52501: {'lr': 0.00036960048103282564, 'samples': 10080192, 'steps': 52500, 'loss/train': 1.487412452697754} 11/07/2021 04:37:10 - INFO - __main__ - Step 52502: {'lr': 0.00036959582093614406, 'samples': 10080384, 'steps': 52501, 'loss/train': 0.9780846834182739} 11/07/2021 04:37:11 - INFO - __main__ - Step 52503: {'lr': 0.00036959116078557453, 'samples': 10080576, 'steps': 52502, 'loss/train': 1.3496427536010742} 11/07/2021 04:37:11 - INFO - __main__ - Step 52504: {'lr': 0.000369586500581119, 'samples': 10080768, 'steps': 52503, 'loss/train': 1.612030029296875} 11/07/2021 04:37:11 - INFO - __main__ - Step 52505: {'lr': 0.00036958184032277974, 'samples': 10080960, 'steps': 52504, 'loss/train': 1.4903218746185303} 11/07/2021 04:37:12 - INFO - __main__ - Step 52506: {'lr': 0.0003695771800105586, 'samples': 10081152, 'steps': 52505, 'loss/train': 1.9057798385620117} 11/07/2021 04:37:12 - INFO - __main__ - Step 52507: {'lr': 0.0003695725196444579, 'samples': 10081344, 'steps': 52506, 'loss/train': 1.758776307106018} 11/07/2021 04:37:13 - INFO - __main__ - Step 52508: {'lr': 0.0003695678592244797, 'samples': 10081536, 'steps': 52507, 'loss/train': 1.4301761388778687} 11/07/2021 04:37:13 - INFO - __main__ - Step 52509: {'lr': 0.00036956319875062604, 'samples': 10081728, 'steps': 52508, 'loss/train': 1.5318809747695923} 11/07/2021 04:37:14 - INFO - __main__ - Step 52510: {'lr': 0.0003695585382228991, 'samples': 10081920, 'steps': 52509, 'loss/train': 1.4332736730575562} 11/07/2021 04:37:14 - INFO - __main__ - Step 52511: {'lr': 0.0003695538776413009, 'samples': 10082112, 'steps': 52510, 'loss/train': 1.3565497398376465} 11/07/2021 04:37:14 - INFO - __main__ - Step 52512: {'lr': 0.0003695492170058335, 'samples': 10082304, 'steps': 52511, 'loss/train': 1.397042989730835} 11/07/2021 04:37:16 - INFO - __main__ - Step 52513: {'lr': 0.0003695445563164991, 'samples': 10082496, 'steps': 52512, 'loss/train': 1.3915115594863892} 11/07/2021 04:37:16 - INFO - __main__ - Step 52514: {'lr': 0.00036953989557329976, 'samples': 10082688, 'steps': 52513, 'loss/train': 1.6850425004959106} 11/07/2021 04:37:16 - INFO - __main__ - Step 52515: {'lr': 0.0003695352347762376, 'samples': 10082880, 'steps': 52514, 'loss/train': 1.2292646169662476} 11/07/2021 04:37:17 - INFO - __main__ - Step 52516: {'lr': 0.00036953057392531474, 'samples': 10083072, 'steps': 52515, 'loss/train': 1.9233672618865967} 11/07/2021 04:37:17 - INFO - __main__ - Step 52517: {'lr': 0.00036952591302053325, 'samples': 10083264, 'steps': 52516, 'loss/train': 1.7152197360992432} 11/07/2021 04:37:18 - INFO - __main__ - Step 52518: {'lr': 0.00036952125206189516, 'samples': 10083456, 'steps': 52517, 'loss/train': 1.3954459428787231} 11/07/2021 04:37:18 - INFO - __main__ - Step 52519: {'lr': 0.00036951659104940274, 'samples': 10083648, 'steps': 52518, 'loss/train': 1.5164927244186401} 11/07/2021 04:37:19 - INFO - __main__ - Step 52520: {'lr': 0.0003695119299830579, 'samples': 10083840, 'steps': 52519, 'loss/train': 0.9250338077545166} 11/07/2021 04:37:19 - INFO - __main__ - Step 52521: {'lr': 0.0003695072688628628, 'samples': 10084032, 'steps': 52520, 'loss/train': 1.6769369840621948} 11/07/2021 04:37:19 - INFO - __main__ - Step 52522: {'lr': 0.00036950260768881963, 'samples': 10084224, 'steps': 52521, 'loss/train': 1.140705943107605} 11/07/2021 04:37:20 - INFO - __main__ - Step 52523: {'lr': 0.00036949794646093045, 'samples': 10084416, 'steps': 52522, 'loss/train': 1.746580958366394} 11/07/2021 04:37:21 - INFO - __main__ - Step 52524: {'lr': 0.00036949328517919735, 'samples': 10084608, 'steps': 52523, 'loss/train': 1.4720600843429565} 11/07/2021 04:37:21 - INFO - __main__ - Step 52525: {'lr': 0.0003694886238436224, 'samples': 10084800, 'steps': 52524, 'loss/train': 1.255598545074463} 11/07/2021 04:37:22 - INFO - __main__ - Step 52526: {'lr': 0.0003694839624542077, 'samples': 10084992, 'steps': 52525, 'loss/train': 1.5190978050231934} 11/07/2021 04:37:22 - INFO - __main__ - Step 52527: {'lr': 0.0003694793010109553, 'samples': 10085184, 'steps': 52526, 'loss/train': 1.9542680978775024} 11/07/2021 04:37:23 - INFO - __main__ - Step 52528: {'lr': 0.00036947463951386743, 'samples': 10085376, 'steps': 52527, 'loss/train': 1.2795751094818115} 11/07/2021 04:37:23 - INFO - __main__ - Step 52529: {'lr': 0.0003694699779629461, 'samples': 10085568, 'steps': 52528, 'loss/train': 0.9457770586013794} 11/07/2021 04:37:24 - INFO - __main__ - Step 52530: {'lr': 0.0003694653163581936, 'samples': 10085760, 'steps': 52529, 'loss/train': 1.4678503274917603} 11/07/2021 04:37:24 - INFO - __main__ - Step 52531: {'lr': 0.0003694606546996117, 'samples': 10085952, 'steps': 52530, 'loss/train': 1.4220609664916992} 11/07/2021 04:37:24 - INFO - __main__ - Step 52532: {'lr': 0.0003694559929872028, 'samples': 10086144, 'steps': 52531, 'loss/train': 0.9893736243247986} 11/07/2021 04:37:26 - INFO - __main__ - Step 52533: {'lr': 0.00036945133122096875, 'samples': 10086336, 'steps': 52532, 'loss/train': 0.9789683222770691} 11/07/2021 04:37:26 - INFO - __main__ - Step 52534: {'lr': 0.0003694466694009118, 'samples': 10086528, 'steps': 52533, 'loss/train': 1.138243317604065} 11/07/2021 04:37:26 - INFO - __main__ - Step 52535: {'lr': 0.00036944200752703405, 'samples': 10086720, 'steps': 52534, 'loss/train': 1.3803220987319946} 11/07/2021 04:37:27 - INFO - __main__ - Step 52536: {'lr': 0.0003694373455993376, 'samples': 10086912, 'steps': 52535, 'loss/train': 1.5448083877563477} 11/07/2021 04:37:27 - INFO - __main__ - Step 52537: {'lr': 0.0003694326836178245, 'samples': 10087104, 'steps': 52536, 'loss/train': 1.1240888833999634} 11/07/2021 04:37:28 - INFO - __main__ - Step 52538: {'lr': 0.0003694280215824969, 'samples': 10087296, 'steps': 52537, 'loss/train': 2.725187301635742} 11/07/2021 04:37:29 - INFO - __main__ - Step 52539: {'lr': 0.0003694233594933568, 'samples': 10087488, 'steps': 52538, 'loss/train': 1.2964894771575928} 11/07/2021 04:37:29 - INFO - __main__ - Step 52540: {'lr': 0.00036941869735040647, 'samples': 10087680, 'steps': 52539, 'loss/train': 0.9070099592208862} 11/07/2021 04:37:29 - INFO - __main__ - Step 52541: {'lr': 0.0003694140351536479, 'samples': 10087872, 'steps': 52540, 'loss/train': 1.3074787855148315} 11/07/2021 04:37:30 - INFO - __main__ - Step 52542: {'lr': 0.00036940937290308315, 'samples': 10088064, 'steps': 52541, 'loss/train': 1.516692042350769} 11/07/2021 04:37:30 - INFO - __main__ - Step 52543: {'lr': 0.0003694047105987144, 'samples': 10088256, 'steps': 52542, 'loss/train': 0.9241036772727966} 11/07/2021 04:37:31 - INFO - __main__ - Step 52544: {'lr': 0.00036940004824054376, 'samples': 10088448, 'steps': 52543, 'loss/train': 1.690172791481018} 11/07/2021 04:37:31 - INFO - __main__ - Step 52545: {'lr': 0.0003693953858285733, 'samples': 10088640, 'steps': 52544, 'loss/train': 1.2397607564926147} 11/07/2021 04:37:32 - INFO - __main__ - Step 52546: {'lr': 0.0003693907233628051, 'samples': 10088832, 'steps': 52545, 'loss/train': 0.42579424381256104} 11/07/2021 04:37:32 - INFO - __main__ - Step 52547: {'lr': 0.00036938606084324123, 'samples': 10089024, 'steps': 52546, 'loss/train': 1.5375990867614746} 11/07/2021 04:37:32 - INFO - __main__ - Step 52548: {'lr': 0.00036938139826988393, 'samples': 10089216, 'steps': 52547, 'loss/train': 1.901663064956665} 11/07/2021 04:37:34 - INFO - __main__ - Step 52549: {'lr': 0.0003693767356427352, 'samples': 10089408, 'steps': 52548, 'loss/train': 1.3865082263946533} 11/07/2021 04:37:34 - INFO - __main__ - Step 52550: {'lr': 0.00036937207296179717, 'samples': 10089600, 'steps': 52549, 'loss/train': 1.1753380298614502} 11/07/2021 04:37:34 - INFO - __main__ - Step 52551: {'lr': 0.0003693674102270719, 'samples': 10089792, 'steps': 52550, 'loss/train': 1.980668306350708} 11/07/2021 04:37:35 - INFO - __main__ - Step 52552: {'lr': 0.0003693627474385615, 'samples': 10089984, 'steps': 52551, 'loss/train': 1.3791366815567017} 11/07/2021 04:37:35 - INFO - __main__ - Step 52553: {'lr': 0.00036935808459626806, 'samples': 10090176, 'steps': 52552, 'loss/train': 1.610526442527771} 11/07/2021 04:37:36 - INFO - __main__ - Step 52554: {'lr': 0.00036935342170019375, 'samples': 10090368, 'steps': 52553, 'loss/train': 1.6922630071640015} 11/07/2021 04:37:37 - INFO - __main__ - Step 52555: {'lr': 0.00036934875875034063, 'samples': 10090560, 'steps': 52554, 'loss/train': 1.4130017757415771} 11/07/2021 04:37:37 - INFO - __main__ - Step 52556: {'lr': 0.0003693440957467108, 'samples': 10090752, 'steps': 52555, 'loss/train': 1.6270310878753662} 11/07/2021 04:37:38 - INFO - __main__ - Step 52557: {'lr': 0.00036933943268930636, 'samples': 10090944, 'steps': 52556, 'loss/train': 1.6018462181091309} 11/07/2021 04:37:38 - INFO - __main__ - Step 52558: {'lr': 0.00036933476957812944, 'samples': 10091136, 'steps': 52557, 'loss/train': 1.3814212083816528} 11/07/2021 04:37:39 - INFO - __main__ - Step 52559: {'lr': 0.0003693301064131821, 'samples': 10091328, 'steps': 52558, 'loss/train': 0.15778601169586182} 11/07/2021 04:37:39 - INFO - __main__ - Step 52560: {'lr': 0.0003693254431944664, 'samples': 10091520, 'steps': 52559, 'loss/train': 1.6924690008163452} 11/07/2021 04:37:40 - INFO - __main__ - Step 52561: {'lr': 0.00036932077992198455, 'samples': 10091712, 'steps': 52560, 'loss/train': 1.5354642868041992} 11/07/2021 04:37:40 - INFO - __main__ - Step 52562: {'lr': 0.0003693161165957386, 'samples': 10091904, 'steps': 52561, 'loss/train': 0.11148352921009064} 11/07/2021 04:37:40 - INFO - __main__ - Step 52563: {'lr': 0.0003693114532157306, 'samples': 10092096, 'steps': 52562, 'loss/train': 0.8067753911018372} 11/07/2021 04:37:41 - INFO - __main__ - Step 52564: {'lr': 0.00036930678978196283, 'samples': 10092288, 'steps': 52563, 'loss/train': 1.5785139799118042} 11/07/2021 04:37:42 - INFO - __main__ - Step 52565: {'lr': 0.00036930212629443716, 'samples': 10092480, 'steps': 52564, 'loss/train': 1.1247729063034058} 11/07/2021 04:37:42 - INFO - __main__ - Step 52566: {'lr': 0.00036929746275315577, 'samples': 10092672, 'steps': 52565, 'loss/train': 1.0086208581924438} 11/07/2021 04:37:43 - INFO - __main__ - Step 52567: {'lr': 0.0003692927991581208, 'samples': 10092864, 'steps': 52566, 'loss/train': 0.7568204998970032} 11/07/2021 04:37:43 - INFO - __main__ - Step 52568: {'lr': 0.0003692881355093344, 'samples': 10093056, 'steps': 52567, 'loss/train': 1.6512137651443481} 11/07/2021 04:37:43 - INFO - __main__ - Step 52569: {'lr': 0.00036928347180679847, 'samples': 10093248, 'steps': 52568, 'loss/train': 0.9752391576766968} 11/07/2021 04:37:44 - INFO - __main__ - Step 52570: {'lr': 0.0003692788080505154, 'samples': 10093440, 'steps': 52569, 'loss/train': 1.4460619688034058} 11/07/2021 04:37:45 - INFO - __main__ - Step 52571: {'lr': 0.0003692741442404871, 'samples': 10093632, 'steps': 52570, 'loss/train': 1.1609233617782593} 11/07/2021 04:37:45 - INFO - __main__ - Step 52572: {'lr': 0.0003692694803767157, 'samples': 10093824, 'steps': 52571, 'loss/train': 1.4413496255874634} 11/07/2021 04:37:45 - INFO - __main__ - Step 52573: {'lr': 0.0003692648164592033, 'samples': 10094016, 'steps': 52572, 'loss/train': 1.7335922718048096} 11/07/2021 04:37:46 - INFO - __main__ - Step 52574: {'lr': 0.00036926015248795195, 'samples': 10094208, 'steps': 52573, 'loss/train': 1.4201934337615967} 11/07/2021 04:37:47 - INFO - __main__ - Step 52575: {'lr': 0.0003692554884629639, 'samples': 10094400, 'steps': 52574, 'loss/train': 1.4277849197387695} 11/07/2021 04:37:47 - INFO - __main__ - Step 52576: {'lr': 0.00036925082438424116, 'samples': 10094592, 'steps': 52575, 'loss/train': 0.9422187209129333} 11/07/2021 04:37:47 - INFO - __main__ - Step 52577: {'lr': 0.00036924616025178585, 'samples': 10094784, 'steps': 52576, 'loss/train': 1.1624577045440674} 11/07/2021 04:37:48 - INFO - __main__ - Step 52578: {'lr': 0.0003692414960656, 'samples': 10094976, 'steps': 52577, 'loss/train': 1.2759878635406494} 11/07/2021 04:37:48 - INFO - __main__ - Step 52579: {'lr': 0.00036923683182568586, 'samples': 10095168, 'steps': 52578, 'loss/train': 1.250462532043457} 11/07/2021 04:37:48 - INFO - __main__ - Step 52580: {'lr': 0.00036923216753204536, 'samples': 10095360, 'steps': 52579, 'loss/train': 1.490509033203125} 11/07/2021 04:37:49 - INFO - __main__ - Step 52581: {'lr': 0.00036922750318468074, 'samples': 10095552, 'steps': 52580, 'loss/train': 1.3488402366638184} 11/07/2021 04:37:50 - INFO - __main__ - Step 52582: {'lr': 0.00036922283878359396, 'samples': 10095744, 'steps': 52581, 'loss/train': 1.675639033317566} 11/07/2021 04:37:50 - INFO - __main__ - Step 52583: {'lr': 0.0003692181743287873, 'samples': 10095936, 'steps': 52582, 'loss/train': 1.3151415586471558} 11/07/2021 04:37:50 - INFO - __main__ - Step 52584: {'lr': 0.0003692135098202628, 'samples': 10096128, 'steps': 52583, 'loss/train': 1.478013515472412} 11/07/2021 04:37:51 - INFO - __main__ - Step 52585: {'lr': 0.0003692088452580225, 'samples': 10096320, 'steps': 52584, 'loss/train': 1.319058895111084} 11/07/2021 04:37:52 - INFO - __main__ - Step 52586: {'lr': 0.00036920418064206845, 'samples': 10096512, 'steps': 52585, 'loss/train': 1.2535852193832397} 11/07/2021 04:37:52 - INFO - __main__ - Step 52587: {'lr': 0.0003691995159724029, 'samples': 10096704, 'steps': 52586, 'loss/train': 1.552321195602417} 11/07/2021 04:37:52 - INFO - __main__ - Step 52588: {'lr': 0.00036919485124902785, 'samples': 10096896, 'steps': 52587, 'loss/train': 1.5046734809875488} 11/07/2021 04:37:53 - INFO - __main__ - Step 52589: {'lr': 0.00036919018647194545, 'samples': 10097088, 'steps': 52588, 'loss/train': 1.6909958124160767} 11/07/2021 04:37:53 - INFO - __main__ - Step 52590: {'lr': 0.0003691855216411578, 'samples': 10097280, 'steps': 52589, 'loss/train': 1.3594661951065063} 11/07/2021 04:37:54 - INFO - __main__ - Step 52591: {'lr': 0.00036918085675666706, 'samples': 10097472, 'steps': 52590, 'loss/train': 1.430137276649475} 11/07/2021 04:37:55 - INFO - __main__ - Step 52592: {'lr': 0.00036917619181847525, 'samples': 10097664, 'steps': 52591, 'loss/train': 1.3334394693374634} 11/07/2021 04:37:55 - INFO - __main__ - Step 52593: {'lr': 0.00036917152682658437, 'samples': 10097856, 'steps': 52592, 'loss/train': 1.7569663524627686} 11/07/2021 04:37:55 - INFO - __main__ - Step 52594: {'lr': 0.0003691668617809968, 'samples': 10098048, 'steps': 52593, 'loss/train': 1.6426643133163452} 11/07/2021 04:37:56 - INFO - __main__ - Step 52595: {'lr': 0.00036916219668171435, 'samples': 10098240, 'steps': 52594, 'loss/train': 1.2272065877914429} 11/07/2021 04:37:57 - INFO - __main__ - Step 52596: {'lr': 0.0003691575315287393, 'samples': 10098432, 'steps': 52595, 'loss/train': 0.9954767823219299} 11/07/2021 04:37:57 - INFO - __main__ - Step 52597: {'lr': 0.00036915286632207374, 'samples': 10098624, 'steps': 52596, 'loss/train': 1.4305698871612549} 11/07/2021 04:37:57 - INFO - __main__ - Step 52598: {'lr': 0.0003691482010617197, 'samples': 10098816, 'steps': 52597, 'loss/train': 1.2820786237716675} 11/07/2021 04:37:58 - INFO - __main__ - Step 52599: {'lr': 0.00036914353574767935, 'samples': 10099008, 'steps': 52598, 'loss/train': 0.8896142244338989} 11/07/2021 04:37:58 - INFO - __main__ - Step 52600: {'lr': 0.0003691388703799547, 'samples': 10099200, 'steps': 52599, 'loss/train': 1.439296841621399} 11/07/2021 04:37:59 - INFO - __main__ - Step 52601: {'lr': 0.00036913420495854793, 'samples': 10099392, 'steps': 52600, 'loss/train': 1.7042537927627563} 11/07/2021 04:37:59 - INFO - __main__ - Step 52602: {'lr': 0.00036912953948346115, 'samples': 10099584, 'steps': 52601, 'loss/train': 1.6592191457748413} 11/07/2021 04:38:00 - INFO - __main__ - Step 52603: {'lr': 0.00036912487395469645, 'samples': 10099776, 'steps': 52602, 'loss/train': 1.6329528093338013} 11/07/2021 04:38:00 - INFO - __main__ - Step 52604: {'lr': 0.0003691202083722559, 'samples': 10099968, 'steps': 52603, 'loss/train': 1.626839518547058} 11/07/2021 04:38:01 - INFO - __main__ - Step 52605: {'lr': 0.0003691155427361416, 'samples': 10100160, 'steps': 52604, 'loss/train': 1.8369107246398926} 11/07/2021 04:38:01 - INFO - __main__ - Step 52606: {'lr': 0.0003691108770463557, 'samples': 10100352, 'steps': 52605, 'loss/train': 1.1976712942123413} 11/07/2021 04:38:02 - INFO - __main__ - Step 52607: {'lr': 0.00036910621130290027, 'samples': 10100544, 'steps': 52606, 'loss/train': 1.3801106214523315} 11/07/2021 04:38:02 - INFO - __main__ - Step 52608: {'lr': 0.0003691015455057775, 'samples': 10100736, 'steps': 52607, 'loss/train': 0.8249280452728271} 11/07/2021 04:38:03 - INFO - __main__ - Step 52609: {'lr': 0.0003690968796549893, 'samples': 10100928, 'steps': 52608, 'loss/train': 1.2240235805511475} 11/07/2021 04:38:03 - INFO - __main__ - Step 52610: {'lr': 0.0003690922137505379, 'samples': 10101120, 'steps': 52609, 'loss/train': 0.8298952579498291} 11/07/2021 04:38:03 - INFO - __main__ - Step 52611: {'lr': 0.00036908754779242545, 'samples': 10101312, 'steps': 52610, 'loss/train': 0.6339261531829834} 11/07/2021 04:38:04 - INFO - __main__ - Step 52612: {'lr': 0.00036908288178065393, 'samples': 10101504, 'steps': 52611, 'loss/train': 1.789779782295227} 11/07/2021 04:38:05 - INFO - __main__ - Step 52613: {'lr': 0.00036907821571522553, 'samples': 10101696, 'steps': 52612, 'loss/train': 1.3123681545257568} 11/07/2021 04:38:05 - INFO - __main__ - Step 52614: {'lr': 0.0003690735495961423, 'samples': 10101888, 'steps': 52613, 'loss/train': 1.3512696027755737} 11/07/2021 04:38:05 - INFO - __main__ - Step 52615: {'lr': 0.0003690688834234064, 'samples': 10102080, 'steps': 52614, 'loss/train': 0.40644317865371704} 11/07/2021 04:38:06 - INFO - __main__ - Step 52616: {'lr': 0.0003690642171970198, 'samples': 10102272, 'steps': 52615, 'loss/train': 1.7856353521347046} 11/07/2021 04:38:07 - INFO - __main__ - Step 52617: {'lr': 0.0003690595509169848, 'samples': 10102464, 'steps': 52616, 'loss/train': 1.9296809434890747} 11/07/2021 04:38:07 - INFO - __main__ - Step 52618: {'lr': 0.00036905488458330337, 'samples': 10102656, 'steps': 52617, 'loss/train': 1.402385950088501} 11/07/2021 04:38:07 - INFO - __main__ - Step 52619: {'lr': 0.00036905021819597767, 'samples': 10102848, 'steps': 52618, 'loss/train': 1.3527421951293945} 11/07/2021 04:38:08 - INFO - __main__ - Step 52620: {'lr': 0.00036904555175500977, 'samples': 10103040, 'steps': 52619, 'loss/train': 1.1639877557754517} 11/07/2021 04:38:08 - INFO - __main__ - Step 52621: {'lr': 0.00036904088526040177, 'samples': 10103232, 'steps': 52620, 'loss/train': 1.4407010078430176} 11/07/2021 04:38:09 - INFO - __main__ - Step 52622: {'lr': 0.00036903621871215575, 'samples': 10103424, 'steps': 52621, 'loss/train': 1.1555688381195068} 11/07/2021 04:38:10 - INFO - __main__ - Step 52623: {'lr': 0.0003690315521102739, 'samples': 10103616, 'steps': 52622, 'loss/train': 1.330957293510437} 11/07/2021 04:38:10 - INFO - __main__ - Step 52624: {'lr': 0.0003690268854547583, 'samples': 10103808, 'steps': 52623, 'loss/train': 1.6458876132965088} 11/07/2021 04:38:10 - INFO - __main__ - Step 52625: {'lr': 0.00036902221874561097, 'samples': 10104000, 'steps': 52624, 'loss/train': 1.416123867034912} 11/07/2021 04:38:11 - INFO - __main__ - Step 52626: {'lr': 0.00036901755198283403, 'samples': 10104192, 'steps': 52625, 'loss/train': 1.401869773864746} 11/07/2021 04:38:12 - INFO - __main__ - Step 52627: {'lr': 0.0003690128851664297, 'samples': 10104384, 'steps': 52626, 'loss/train': 1.1351631879806519} 11/07/2021 04:38:12 - INFO - __main__ - Step 52628: {'lr': 0.0003690082182964, 'samples': 10104576, 'steps': 52627, 'loss/train': 1.4857701063156128} 11/07/2021 04:38:12 - INFO - __main__ - Step 52629: {'lr': 0.00036900355137274696, 'samples': 10104768, 'steps': 52628, 'loss/train': 0.9550331234931946} 11/07/2021 04:38:13 - INFO - __main__ - Step 52630: {'lr': 0.00036899888439547276, 'samples': 10104960, 'steps': 52629, 'loss/train': 1.4199917316436768} 11/07/2021 04:38:13 - INFO - __main__ - Step 52631: {'lr': 0.00036899421736457955, 'samples': 10105152, 'steps': 52630, 'loss/train': 1.479548454284668} 11/07/2021 04:38:14 - INFO - __main__ - Step 52632: {'lr': 0.00036898955028006936, 'samples': 10105344, 'steps': 52631, 'loss/train': 1.6742368936538696} 11/07/2021 04:38:14 - INFO - __main__ - Step 52633: {'lr': 0.0003689848831419443, 'samples': 10105536, 'steps': 52632, 'loss/train': 1.5146318674087524} 11/07/2021 04:38:15 - INFO - __main__ - Step 52634: {'lr': 0.0003689802159502065, 'samples': 10105728, 'steps': 52633, 'loss/train': 1.5963118076324463} 11/07/2021 04:38:15 - INFO - __main__ - Step 52635: {'lr': 0.00036897554870485804, 'samples': 10105920, 'steps': 52634, 'loss/train': 1.5557894706726074} 11/07/2021 04:38:15 - INFO - __main__ - Step 52636: {'lr': 0.000368970881405901, 'samples': 10106112, 'steps': 52635, 'loss/train': 1.2636913061141968} 11/07/2021 04:38:16 - INFO - __main__ - Step 52637: {'lr': 0.0003689662140533376, 'samples': 10106304, 'steps': 52636, 'loss/train': 1.3850836753845215} 11/07/2021 04:38:17 - INFO - __main__ - Step 52638: {'lr': 0.00036896154664716987, 'samples': 10106496, 'steps': 52637, 'loss/train': 1.4937160015106201} 11/07/2021 04:38:17 - INFO - __main__ - Step 52639: {'lr': 0.00036895687918739984, 'samples': 10106688, 'steps': 52638, 'loss/train': 1.4627037048339844} 11/07/2021 04:38:17 - INFO - __main__ - Step 52640: {'lr': 0.0003689522116740296, 'samples': 10106880, 'steps': 52639, 'loss/train': 1.375765323638916} 11/07/2021 04:38:18 - INFO - __main__ - Step 52641: {'lr': 0.0003689475441070615, 'samples': 10107072, 'steps': 52640, 'loss/train': 0.9709281921386719} 11/07/2021 04:38:19 - INFO - __main__ - Step 52642: {'lr': 0.0003689428764864974, 'samples': 10107264, 'steps': 52641, 'loss/train': 1.6289591789245605} 11/07/2021 04:38:19 - INFO - __main__ - Step 52643: {'lr': 0.0003689382088123394, 'samples': 10107456, 'steps': 52642, 'loss/train': 1.2757549285888672} 11/07/2021 04:38:20 - INFO - __main__ - Step 52644: {'lr': 0.0003689335410845898, 'samples': 10107648, 'steps': 52643, 'loss/train': 1.6775438785552979} 11/07/2021 04:38:20 - INFO - __main__ - Step 52645: {'lr': 0.00036892887330325054, 'samples': 10107840, 'steps': 52644, 'loss/train': 1.077232837677002} 11/07/2021 04:38:20 - INFO - __main__ - Step 52646: {'lr': 0.00036892420546832375, 'samples': 10108032, 'steps': 52645, 'loss/train': 0.8511205911636353} 11/07/2021 04:38:21 - INFO - __main__ - Step 52647: {'lr': 0.0003689195375798115, 'samples': 10108224, 'steps': 52646, 'loss/train': 1.0628376007080078} 11/07/2021 04:38:22 - INFO - __main__ - Step 52648: {'lr': 0.00036891486963771603, 'samples': 10108416, 'steps': 52647, 'loss/train': 1.3356902599334717} 11/07/2021 04:38:22 - INFO - __main__ - Step 52649: {'lr': 0.00036891020164203924, 'samples': 10108608, 'steps': 52648, 'loss/train': 1.426182508468628} 11/07/2021 04:38:22 - INFO - __main__ - Step 52650: {'lr': 0.00036890553359278345, 'samples': 10108800, 'steps': 52649, 'loss/train': 0.993011474609375} 11/07/2021 04:38:23 - INFO - __main__ - Step 52651: {'lr': 0.0003689008654899507, 'samples': 10108992, 'steps': 52650, 'loss/train': 1.282455563545227} 11/07/2021 04:38:23 - INFO - __main__ - Step 52652: {'lr': 0.00036889619733354297, 'samples': 10109184, 'steps': 52651, 'loss/train': 1.4460147619247437} 11/07/2021 04:38:24 - INFO - __main__ - Step 52653: {'lr': 0.0003688915291235625, 'samples': 10109376, 'steps': 52652, 'loss/train': 1.548343539237976} 11/07/2021 04:38:24 - INFO - __main__ - Step 52654: {'lr': 0.0003688868608600113, 'samples': 10109568, 'steps': 52653, 'loss/train': 1.2227991819381714} 11/07/2021 04:38:25 - INFO - __main__ - Step 52655: {'lr': 0.00036888219254289147, 'samples': 10109760, 'steps': 52654, 'loss/train': 1.3856556415557861} 11/07/2021 04:38:25 - INFO - __main__ - Step 52656: {'lr': 0.0003688775241722052, 'samples': 10109952, 'steps': 52655, 'loss/train': 1.3090105056762695} 11/07/2021 04:38:25 - INFO - __main__ - Step 52657: {'lr': 0.0003688728557479546, 'samples': 10110144, 'steps': 52656, 'loss/train': 1.471129298210144} 11/07/2021 04:38:27 - INFO - __main__ - Step 52658: {'lr': 0.00036886818727014173, 'samples': 10110336, 'steps': 52657, 'loss/train': 1.2394299507141113} 11/07/2021 04:38:27 - INFO - __main__ - Step 52659: {'lr': 0.0003688635187387686, 'samples': 10110528, 'steps': 52658, 'loss/train': 1.114780068397522} 11/07/2021 04:38:27 - INFO - __main__ - Step 52660: {'lr': 0.0003688588501538375, 'samples': 10110720, 'steps': 52659, 'loss/train': 1.5244759321212769} 11/07/2021 04:38:28 - INFO - __main__ - Step 52661: {'lr': 0.00036885418151535033, 'samples': 10110912, 'steps': 52660, 'loss/train': 1.6582838296890259} 11/07/2021 04:38:28 - INFO - __main__ - Step 52662: {'lr': 0.00036884951282330935, 'samples': 10111104, 'steps': 52661, 'loss/train': 1.3638769388198853} 11/07/2021 04:38:28 - INFO - __main__ - Step 52663: {'lr': 0.00036884484407771664, 'samples': 10111296, 'steps': 52662, 'loss/train': 1.2208144664764404} 11/07/2021 04:38:29 - INFO - __main__ - Step 52664: {'lr': 0.00036884017527857426, 'samples': 10111488, 'steps': 52663, 'loss/train': 1.3330110311508179} 11/07/2021 04:38:30 - INFO - __main__ - Step 52665: {'lr': 0.0003688355064258844, 'samples': 10111680, 'steps': 52664, 'loss/train': 0.10926000028848648} 11/07/2021 04:38:30 - INFO - __main__ - Step 52666: {'lr': 0.00036883083751964896, 'samples': 10111872, 'steps': 52665, 'loss/train': 1.4924814701080322} 11/07/2021 04:38:30 - INFO - __main__ - Step 52667: {'lr': 0.00036882616855987027, 'samples': 10112064, 'steps': 52666, 'loss/train': 1.1043123006820679} 11/07/2021 04:38:31 - INFO - __main__ - Step 52668: {'lr': 0.0003688214995465503, 'samples': 10112256, 'steps': 52667, 'loss/train': 1.3527469635009766} 11/07/2021 04:38:32 - INFO - __main__ - Step 52669: {'lr': 0.00036881683047969115, 'samples': 10112448, 'steps': 52668, 'loss/train': 1.0695163011550903} 11/07/2021 04:38:32 - INFO - __main__ - Step 52670: {'lr': 0.00036881216135929506, 'samples': 10112640, 'steps': 52669, 'loss/train': 1.1192641258239746} 11/07/2021 04:38:33 - INFO - __main__ - Step 52671: {'lr': 0.0003688074921853641, 'samples': 10112832, 'steps': 52670, 'loss/train': 1.308417797088623} 11/07/2021 04:38:33 - INFO - __main__ - Step 52672: {'lr': 0.0003688028229579002, 'samples': 10113024, 'steps': 52671, 'loss/train': 1.0733835697174072} 11/07/2021 04:38:33 - INFO - __main__ - Step 52673: {'lr': 0.0003687981536769056, 'samples': 10113216, 'steps': 52672, 'loss/train': 1.4950209856033325} 11/07/2021 04:38:34 - INFO - __main__ - Step 52674: {'lr': 0.00036879348434238235, 'samples': 10113408, 'steps': 52673, 'loss/train': 1.5509028434753418} 11/07/2021 04:38:35 - INFO - __main__ - Step 52675: {'lr': 0.00036878881495433264, 'samples': 10113600, 'steps': 52674, 'loss/train': 1.8097690343856812} 11/07/2021 04:38:35 - INFO - __main__ - Step 52676: {'lr': 0.0003687841455127585, 'samples': 10113792, 'steps': 52675, 'loss/train': 1.6956031322479248} 11/07/2021 04:38:35 - INFO - __main__ - Step 52677: {'lr': 0.0003687794760176621, 'samples': 10113984, 'steps': 52676, 'loss/train': 1.7950348854064941} 11/07/2021 04:38:36 - INFO - __main__ - Step 52678: {'lr': 0.0003687748064690455, 'samples': 10114176, 'steps': 52677, 'loss/train': 1.0014179944992065} 11/07/2021 04:38:37 - INFO - __main__ - Step 52679: {'lr': 0.0003687701368669108, 'samples': 10114368, 'steps': 52678, 'loss/train': 1.463243842124939} 11/07/2021 04:38:37 - INFO - __main__ - Step 52680: {'lr': 0.0003687654672112601, 'samples': 10114560, 'steps': 52679, 'loss/train': 1.2007852792739868} 11/07/2021 04:38:37 - INFO - __main__ - Step 52681: {'lr': 0.00036876079750209544, 'samples': 10114752, 'steps': 52680, 'loss/train': 1.4676156044006348} 11/07/2021 04:38:38 - INFO - __main__ - Step 52682: {'lr': 0.00036875612773941906, 'samples': 10114944, 'steps': 52681, 'loss/train': 1.2182501554489136} 11/07/2021 04:38:38 - INFO - __main__ - Step 52683: {'lr': 0.00036875145792323303, 'samples': 10115136, 'steps': 52682, 'loss/train': 1.168044090270996} 11/07/2021 04:38:39 - INFO - __main__ - Step 52684: {'lr': 0.0003687467880535394, 'samples': 10115328, 'steps': 52683, 'loss/train': 1.3582491874694824} 11/07/2021 04:38:40 - INFO - __main__ - Step 52685: {'lr': 0.00036874211813034034, 'samples': 10115520, 'steps': 52684, 'loss/train': 1.5834335088729858} 11/07/2021 04:38:40 - INFO - __main__ - Step 52686: {'lr': 0.00036873744815363785, 'samples': 10115712, 'steps': 52685, 'loss/train': 1.6150399446487427} 11/07/2021 04:38:40 - INFO - __main__ - Step 52687: {'lr': 0.0003687327781234341, 'samples': 10115904, 'steps': 52686, 'loss/train': 1.1509486436843872} 11/07/2021 04:38:41 - INFO - __main__ - Step 52688: {'lr': 0.0003687281080397312, 'samples': 10116096, 'steps': 52687, 'loss/train': 1.3488634824752808} 11/07/2021 04:38:42 - INFO - __main__ - Step 52689: {'lr': 0.0003687234379025313, 'samples': 10116288, 'steps': 52688, 'loss/train': 1.3734240531921387} 11/07/2021 04:38:42 - INFO - __main__ - Step 52690: {'lr': 0.00036871876771183635, 'samples': 10116480, 'steps': 52689, 'loss/train': 1.438968300819397} 11/07/2021 04:38:42 - INFO - __main__ - Step 52691: {'lr': 0.0003687140974676486, 'samples': 10116672, 'steps': 52690, 'loss/train': 1.4034918546676636} 11/07/2021 04:38:43 - INFO - __main__ - Step 52692: {'lr': 0.0003687094271699702, 'samples': 10116864, 'steps': 52691, 'loss/train': 0.7095943093299866} 11/07/2021 04:38:43 - INFO - __main__ - Step 52693: {'lr': 0.00036870475681880313, 'samples': 10117056, 'steps': 52692, 'loss/train': 1.2739990949630737} 11/07/2021 04:38:44 - INFO - __main__ - Step 52694: {'lr': 0.00036870008641414945, 'samples': 10117248, 'steps': 52693, 'loss/train': 1.3993279933929443} 11/07/2021 04:38:44 - INFO - __main__ - Step 52695: {'lr': 0.0003686954159560114, 'samples': 10117440, 'steps': 52694, 'loss/train': 0.857010543346405} 11/07/2021 04:38:45 - INFO - __main__ - Step 52696: {'lr': 0.00036869074544439097, 'samples': 10117632, 'steps': 52695, 'loss/train': 1.7480690479278564} 11/07/2021 04:38:45 - INFO - __main__ - Step 52697: {'lr': 0.00036868607487929034, 'samples': 10117824, 'steps': 52696, 'loss/train': 1.7123708724975586} 11/07/2021 04:38:45 - INFO - __main__ - Step 52698: {'lr': 0.00036868140426071165, 'samples': 10118016, 'steps': 52697, 'loss/train': 0.7053462266921997} 11/07/2021 04:38:46 - INFO - __main__ - Step 52699: {'lr': 0.00036867673358865696, 'samples': 10118208, 'steps': 52698, 'loss/train': 1.1842609643936157} 11/07/2021 04:38:47 - INFO - __main__ - Step 52700: {'lr': 0.0003686720628631283, 'samples': 10118400, 'steps': 52699, 'loss/train': 1.9817100763320923} 11/07/2021 04:38:47 - INFO - __main__ - Step 52701: {'lr': 0.0003686673920841278, 'samples': 10118592, 'steps': 52700, 'loss/train': 1.0959868431091309} 11/07/2021 04:38:47 - INFO - __main__ - Step 52702: {'lr': 0.0003686627212516577, 'samples': 10118784, 'steps': 52701, 'loss/train': 1.3468350172042847} 11/07/2021 04:38:48 - INFO - __main__ - Step 52703: {'lr': 0.0003686580503657199, 'samples': 10118976, 'steps': 52702, 'loss/train': 0.9252719879150391} 11/07/2021 04:38:49 - INFO - __main__ - Step 52704: {'lr': 0.00036865337942631674, 'samples': 10119168, 'steps': 52703, 'loss/train': 1.5211286544799805} 11/07/2021 04:38:49 - INFO - __main__ - Step 52705: {'lr': 0.00036864870843345015, 'samples': 10119360, 'steps': 52704, 'loss/train': 1.3844294548034668} 11/07/2021 04:38:50 - INFO - __main__ - Step 52706: {'lr': 0.00036864403738712226, 'samples': 10119552, 'steps': 52705, 'loss/train': 1.3543879985809326} 11/07/2021 04:38:50 - INFO - __main__ - Step 52707: {'lr': 0.00036863936628733524, 'samples': 10119744, 'steps': 52706, 'loss/train': 1.4449284076690674} 11/07/2021 04:38:50 - INFO - __main__ - Step 52708: {'lr': 0.0003686346951340911, 'samples': 10119936, 'steps': 52707, 'loss/train': 1.6860452890396118} 11/07/2021 04:38:51 - INFO - __main__ - Step 52709: {'lr': 0.000368630023927392, 'samples': 10120128, 'steps': 52708, 'loss/train': 1.3681421279907227} 11/07/2021 04:38:52 - INFO - __main__ - Step 52710: {'lr': 0.00036862535266724006, 'samples': 10120320, 'steps': 52709, 'loss/train': 1.2561464309692383} 11/07/2021 04:38:52 - INFO - __main__ - Step 52711: {'lr': 0.0003686206813536374, 'samples': 10120512, 'steps': 52710, 'loss/train': 1.1494148969650269} 11/07/2021 04:38:52 - INFO - __main__ - Step 52712: {'lr': 0.0003686160099865861, 'samples': 10120704, 'steps': 52711, 'loss/train': 1.4577538967132568} 11/07/2021 04:38:53 - INFO - __main__ - Step 52713: {'lr': 0.00036861133856608817, 'samples': 10120896, 'steps': 52712, 'loss/train': 0.9733343124389648} 11/07/2021 04:38:53 - INFO - __main__ - Step 52714: {'lr': 0.0003686066670921459, 'samples': 10121088, 'steps': 52713, 'loss/train': 1.50171959400177} 11/07/2021 04:38:54 - INFO - __main__ - Step 52715: {'lr': 0.00036860199556476125, 'samples': 10121280, 'steps': 52714, 'loss/train': 1.476938247680664} 11/07/2021 04:38:54 - INFO - __main__ - Step 52716: {'lr': 0.0003685973239839364, 'samples': 10121472, 'steps': 52715, 'loss/train': 1.5557302236557007} 11/07/2021 04:38:55 - INFO - __main__ - Step 52717: {'lr': 0.0003685926523496733, 'samples': 10121664, 'steps': 52716, 'loss/train': 1.514423131942749} 11/07/2021 04:38:55 - INFO - __main__ - Step 52718: {'lr': 0.0003685879806619743, 'samples': 10121856, 'steps': 52717, 'loss/train': 1.328827142715454} 11/07/2021 04:38:56 - INFO - __main__ - Step 52719: {'lr': 0.0003685833089208414, 'samples': 10122048, 'steps': 52718, 'loss/train': 1.6780064105987549} 11/07/2021 04:38:57 - INFO - __main__ - Step 52720: {'lr': 0.00036857863712627664, 'samples': 10122240, 'steps': 52719, 'loss/train': 0.09408082067966461} 11/07/2021 04:38:57 - INFO - __main__ - Step 52721: {'lr': 0.0003685739652782822, 'samples': 10122432, 'steps': 52720, 'loss/train': 1.4401472806930542} 11/07/2021 04:38:57 - INFO - __main__ - Step 52722: {'lr': 0.00036856929337686015, 'samples': 10122624, 'steps': 52721, 'loss/train': 1.3808116912841797} 11/07/2021 04:38:58 - INFO - __main__ - Step 52723: {'lr': 0.0003685646214220126, 'samples': 10122816, 'steps': 52722, 'loss/train': 0.9897241592407227} 11/07/2021 04:38:58 - INFO - __main__ - Step 52724: {'lr': 0.00036855994941374165, 'samples': 10123008, 'steps': 52723, 'loss/train': 1.315622091293335} 11/07/2021 04:38:59 - INFO - __main__ - Step 52725: {'lr': 0.0003685552773520495, 'samples': 10123200, 'steps': 52724, 'loss/train': 1.044396162033081} 11/07/2021 04:38:59 - INFO - __main__ - Step 52726: {'lr': 0.0003685506052369381, 'samples': 10123392, 'steps': 52725, 'loss/train': 1.548998236656189} 11/07/2021 04:39:00 - INFO - __main__ - Step 52727: {'lr': 0.00036854593306840955, 'samples': 10123584, 'steps': 52726, 'loss/train': 1.4104607105255127} 11/07/2021 04:39:00 - INFO - __main__ - Step 52728: {'lr': 0.0003685412608464661, 'samples': 10123776, 'steps': 52727, 'loss/train': 1.6745527982711792} 11/07/2021 04:39:00 - INFO - __main__ - Step 52729: {'lr': 0.00036853658857110986, 'samples': 10123968, 'steps': 52728, 'loss/train': 1.0913670063018799} 11/07/2021 04:39:01 - INFO - __main__ - Step 52730: {'lr': 0.0003685319162423428, 'samples': 10124160, 'steps': 52729, 'loss/train': 0.8038442730903625} 11/07/2021 04:39:02 - INFO - __main__ - Step 52731: {'lr': 0.0003685272438601671, 'samples': 10124352, 'steps': 52730, 'loss/train': 1.4698398113250732} 11/07/2021 04:39:02 - INFO - __main__ - Step 52732: {'lr': 0.0003685225714245848, 'samples': 10124544, 'steps': 52731, 'loss/train': 1.162439227104187} 11/07/2021 04:39:03 - INFO - __main__ - Step 52733: {'lr': 0.0003685178989355981, 'samples': 10124736, 'steps': 52732, 'loss/train': 1.3873324394226074} 11/07/2021 04:39:03 - INFO - __main__ - Step 52734: {'lr': 0.00036851322639320903, 'samples': 10124928, 'steps': 52733, 'loss/train': 1.532759666442871} 11/07/2021 04:39:04 - INFO - __main__ - Step 52735: {'lr': 0.00036850855379741984, 'samples': 10125120, 'steps': 52734, 'loss/train': 1.61179780960083} 11/07/2021 04:39:04 - INFO - __main__ - Step 52736: {'lr': 0.0003685038811482324, 'samples': 10125312, 'steps': 52735, 'loss/train': 1.3504177331924438} 11/07/2021 04:39:05 - INFO - __main__ - Step 52737: {'lr': 0.00036849920844564903, 'samples': 10125504, 'steps': 52736, 'loss/train': 1.3645915985107422} 11/07/2021 04:39:05 - INFO - __main__ - Step 52738: {'lr': 0.00036849453568967174, 'samples': 10125696, 'steps': 52737, 'loss/train': 1.734371304512024} 11/07/2021 04:39:05 - INFO - __main__ - Step 52739: {'lr': 0.0003684898628803026, 'samples': 10125888, 'steps': 52738, 'loss/train': 1.2029647827148438} 11/07/2021 04:39:06 - INFO - __main__ - Step 52740: {'lr': 0.00036848519001754374, 'samples': 10126080, 'steps': 52739, 'loss/train': 1.7320870161056519} 11/07/2021 04:39:07 - INFO - __main__ - Step 52741: {'lr': 0.0003684805171013973, 'samples': 10126272, 'steps': 52740, 'loss/train': 1.3814479112625122} 11/07/2021 04:39:07 - INFO - __main__ - Step 52742: {'lr': 0.00036847584413186537, 'samples': 10126464, 'steps': 52741, 'loss/train': 0.9930067658424377} 11/07/2021 04:39:07 - INFO - __main__ - Step 52743: {'lr': 0.0003684711711089501, 'samples': 10126656, 'steps': 52742, 'loss/train': 1.5547336339950562} 11/07/2021 04:39:08 - INFO - __main__ - Step 52744: {'lr': 0.00036846649803265344, 'samples': 10126848, 'steps': 52743, 'loss/train': 1.307753086090088} 11/07/2021 04:39:10 - INFO - __main__ - Step 52745: {'lr': 0.0003684618249029776, 'samples': 10127040, 'steps': 52744, 'loss/train': 1.7585844993591309} 11/07/2021 04:39:10 - INFO - __main__ - Step 52746: {'lr': 0.0003684571517199248, 'samples': 10127232, 'steps': 52745, 'loss/train': 1.3824597597122192} 11/07/2021 04:39:10 - INFO - __main__ - Step 52747: {'lr': 0.000368452478483497, 'samples': 10127424, 'steps': 52746, 'loss/train': 1.4839593172073364} 11/07/2021 04:39:11 - INFO - __main__ - Step 52748: {'lr': 0.0003684478051936964, 'samples': 10127616, 'steps': 52747, 'loss/train': 1.148180603981018} 11/07/2021 04:39:11 - INFO - __main__ - Step 52749: {'lr': 0.0003684431318505249, 'samples': 10127808, 'steps': 52748, 'loss/train': 1.2352585792541504} 11/07/2021 04:39:11 - INFO - __main__ - Step 52750: {'lr': 0.0003684384584539848, 'samples': 10128000, 'steps': 52749, 'loss/train': 1.4004900455474854} 11/07/2021 04:39:12 - INFO - __main__ - Step 52751: {'lr': 0.0003684337850040782, 'samples': 10128192, 'steps': 52750, 'loss/train': 1.7843750715255737} 11/07/2021 04:39:12 - INFO - __main__ - Step 52752: {'lr': 0.00036842911150080716, 'samples': 10128384, 'steps': 52751, 'loss/train': 1.7915418148040771} 11/07/2021 04:39:13 - INFO - __main__ - Step 52753: {'lr': 0.0003684244379441738, 'samples': 10128576, 'steps': 52752, 'loss/train': 0.08706151694059372} 11/07/2021 04:39:14 - INFO - __main__ - Step 52754: {'lr': 0.00036841976433418024, 'samples': 10128768, 'steps': 52753, 'loss/train': 1.140884280204773} 11/07/2021 04:39:14 - INFO - __main__ - Step 52755: {'lr': 0.0003684150906708285, 'samples': 10128960, 'steps': 52754, 'loss/train': 1.865882396697998} 11/07/2021 04:39:14 - INFO - __main__ - Step 52756: {'lr': 0.00036841041695412076, 'samples': 10129152, 'steps': 52755, 'loss/train': 1.543358564376831} 11/07/2021 04:39:15 - INFO - __main__ - Step 52757: {'lr': 0.00036840574318405914, 'samples': 10129344, 'steps': 52756, 'loss/train': 2.6298458576202393} 11/07/2021 04:39:16 - INFO - __main__ - Step 52758: {'lr': 0.00036840106936064567, 'samples': 10129536, 'steps': 52757, 'loss/train': 1.492418646812439} 11/07/2021 04:39:16 - INFO - __main__ - Step 52759: {'lr': 0.0003683963954838826, 'samples': 10129728, 'steps': 52758, 'loss/train': 1.6637871265411377} 11/07/2021 04:39:16 - INFO - __main__ - Step 52760: {'lr': 0.00036839172155377184, 'samples': 10129920, 'steps': 52759, 'loss/train': 0.6908199787139893} 11/07/2021 04:39:17 - INFO - __main__ - Step 52761: {'lr': 0.0003683870475703156, 'samples': 10130112, 'steps': 52760, 'loss/train': 1.5701454877853394} 11/07/2021 04:39:17 - INFO - __main__ - Step 52762: {'lr': 0.000368382373533516, 'samples': 10130304, 'steps': 52761, 'loss/train': 1.6280708312988281} 11/07/2021 04:39:18 - INFO - __main__ - Step 52763: {'lr': 0.0003683776994433752, 'samples': 10130496, 'steps': 52762, 'loss/train': 1.4984570741653442} 11/07/2021 04:39:18 - INFO - __main__ - Step 52764: {'lr': 0.0003683730252998951, 'samples': 10130688, 'steps': 52763, 'loss/train': 1.3527424335479736} 11/07/2021 04:39:19 - INFO - __main__ - Step 52765: {'lr': 0.00036836835110307803, 'samples': 10130880, 'steps': 52764, 'loss/train': 1.1018459796905518} 11/07/2021 04:39:19 - INFO - __main__ - Step 52766: {'lr': 0.00036836367685292605, 'samples': 10131072, 'steps': 52765, 'loss/train': 1.0537971258163452} 11/07/2021 04:39:19 - INFO - __main__ - Step 52767: {'lr': 0.00036835900254944114, 'samples': 10131264, 'steps': 52766, 'loss/train': 1.1499534845352173} 11/07/2021 04:39:20 - INFO - __main__ - Step 52768: {'lr': 0.0003683543281926255, 'samples': 10131456, 'steps': 52767, 'loss/train': 2.200810670852661} 11/07/2021 04:39:21 - INFO - __main__ - Step 52769: {'lr': 0.0003683496537824813, 'samples': 10131648, 'steps': 52768, 'loss/train': 3.037747621536255} 11/07/2021 04:39:21 - INFO - __main__ - Step 52770: {'lr': 0.0003683449793190105, 'samples': 10131840, 'steps': 52769, 'loss/train': 1.3351248502731323} 11/07/2021 04:39:22 - INFO - __main__ - Step 52771: {'lr': 0.0003683403048022153, 'samples': 10132032, 'steps': 52770, 'loss/train': 1.5194337368011475} 11/07/2021 04:39:22 - INFO - __main__ - Step 52772: {'lr': 0.0003683356302320978, 'samples': 10132224, 'steps': 52771, 'loss/train': 1.1175599098205566} 11/07/2021 04:39:22 - INFO - __main__ - Step 52773: {'lr': 0.00036833095560866007, 'samples': 10132416, 'steps': 52772, 'loss/train': 1.6043848991394043} 11/07/2021 04:39:23 - INFO - __main__ - Step 52774: {'lr': 0.00036832628093190424, 'samples': 10132608, 'steps': 52773, 'loss/train': 1.1120038032531738} 11/07/2021 04:39:24 - INFO - __main__ - Step 52775: {'lr': 0.0003683216062018324, 'samples': 10132800, 'steps': 52774, 'loss/train': 1.2036117315292358} 11/07/2021 04:39:24 - INFO - __main__ - Step 52776: {'lr': 0.0003683169314184467, 'samples': 10132992, 'steps': 52775, 'loss/train': 1.5896140336990356} 11/07/2021 04:39:24 - INFO - __main__ - Step 52777: {'lr': 0.00036831225658174915, 'samples': 10133184, 'steps': 52776, 'loss/train': 1.4444432258605957} 11/07/2021 04:39:25 - INFO - __main__ - Step 52778: {'lr': 0.000368307581691742, 'samples': 10133376, 'steps': 52777, 'loss/train': 1.8758273124694824} 11/07/2021 04:39:26 - INFO - __main__ - Step 52779: {'lr': 0.0003683029067484273, 'samples': 10133568, 'steps': 52778, 'loss/train': 1.510718822479248} 11/07/2021 04:39:26 - INFO - __main__ - Step 52780: {'lr': 0.0003682982317518071, 'samples': 10133760, 'steps': 52779, 'loss/train': 1.6195272207260132} 11/07/2021 04:39:27 - INFO - __main__ - Step 52781: {'lr': 0.00036829355670188355, 'samples': 10133952, 'steps': 52780, 'loss/train': 1.4773942232131958} 11/07/2021 04:39:27 - INFO - __main__ - Step 52782: {'lr': 0.0003682888815986587, 'samples': 10134144, 'steps': 52781, 'loss/train': 1.7710773944854736} 11/07/2021 04:39:27 - INFO - __main__ - Step 52783: {'lr': 0.00036828420644213474, 'samples': 10134336, 'steps': 52782, 'loss/train': 1.5674437284469604} 11/07/2021 04:39:28 - INFO - __main__ - Step 52784: {'lr': 0.00036827953123231373, 'samples': 10134528, 'steps': 52783, 'loss/train': 1.591394305229187} 11/07/2021 04:39:29 - INFO - __main__ - Step 52785: {'lr': 0.00036827485596919773, 'samples': 10134720, 'steps': 52784, 'loss/train': 1.6549326181411743} 11/07/2021 04:39:29 - INFO - __main__ - Step 52786: {'lr': 0.00036827018065278903, 'samples': 10134912, 'steps': 52785, 'loss/train': 1.7973638772964478} 11/07/2021 04:39:29 - INFO - __main__ - Step 52787: {'lr': 0.00036826550528308956, 'samples': 10135104, 'steps': 52786, 'loss/train': 1.8353632688522339} 11/07/2021 04:39:30 - INFO - __main__ - Step 52788: {'lr': 0.00036826082986010145, 'samples': 10135296, 'steps': 52787, 'loss/train': 1.3484896421432495} 11/07/2021 04:39:31 - INFO - __main__ - Step 52789: {'lr': 0.00036825615438382687, 'samples': 10135488, 'steps': 52788, 'loss/train': 1.712052345275879} 11/07/2021 04:39:31 - INFO - __main__ - Step 52790: {'lr': 0.00036825147885426786, 'samples': 10135680, 'steps': 52789, 'loss/train': 1.2510075569152832} 11/07/2021 04:39:31 - INFO - __main__ - Step 52791: {'lr': 0.00036824680327142656, 'samples': 10135872, 'steps': 52790, 'loss/train': 1.3238176107406616} 11/07/2021 04:39:32 - INFO - __main__ - Step 52792: {'lr': 0.0003682421276353051, 'samples': 10136064, 'steps': 52791, 'loss/train': 1.807876467704773} 11/07/2021 04:39:32 - INFO - __main__ - Step 52793: {'lr': 0.0003682374519459056, 'samples': 10136256, 'steps': 52792, 'loss/train': 1.330115556716919} 11/07/2021 04:39:33 - INFO - __main__ - Step 52794: {'lr': 0.00036823277620323, 'samples': 10136448, 'steps': 52793, 'loss/train': 1.3857930898666382} 11/07/2021 04:39:33 - INFO - __main__ - Step 52795: {'lr': 0.00036822810040728065, 'samples': 10136640, 'steps': 52794, 'loss/train': 1.3801594972610474} 11/07/2021 04:39:34 - INFO - __main__ - Step 52796: {'lr': 0.00036822342455805954, 'samples': 10136832, 'steps': 52795, 'loss/train': 1.4747073650360107} 11/07/2021 04:39:34 - INFO - __main__ - Step 52797: {'lr': 0.0003682187486555687, 'samples': 10137024, 'steps': 52796, 'loss/train': 1.7067310810089111} 11/07/2021 04:39:34 - INFO - __main__ - Step 52798: {'lr': 0.0003682140726998104, 'samples': 10137216, 'steps': 52797, 'loss/train': 1.6776131391525269} 11/07/2021 04:39:35 - INFO - __main__ - Step 52799: {'lr': 0.0003682093966907867, 'samples': 10137408, 'steps': 52798, 'loss/train': 1.3789921998977661} 11/07/2021 04:39:36 - INFO - __main__ - Step 52800: {'lr': 0.00036820472062849954, 'samples': 10137600, 'steps': 52799, 'loss/train': 2.107645034790039} 11/07/2021 04:39:36 - INFO - __main__ - Step 52801: {'lr': 0.0003682000445129512, 'samples': 10137792, 'steps': 52800, 'loss/train': 1.6758891344070435} 11/07/2021 04:39:36 - INFO - __main__ - Step 52802: {'lr': 0.00036819536834414374, 'samples': 10137984, 'steps': 52801, 'loss/train': 1.5608279705047607} 11/07/2021 04:39:37 - INFO - __main__ - Step 52803: {'lr': 0.00036819069212207933, 'samples': 10138176, 'steps': 52802, 'loss/train': 1.8600659370422363} 11/07/2021 04:39:38 - INFO - __main__ - Step 52804: {'lr': 0.00036818601584675994, 'samples': 10138368, 'steps': 52803, 'loss/train': 1.3875086307525635} 11/07/2021 04:39:38 - INFO - __main__ - Step 52805: {'lr': 0.0003681813395181878, 'samples': 10138560, 'steps': 52804, 'loss/train': 1.279493808746338} 11/07/2021 04:39:39 - INFO - __main__ - Step 52806: {'lr': 0.000368176663136365, 'samples': 10138752, 'steps': 52805, 'loss/train': 1.4216787815093994} 11/07/2021 04:39:39 - INFO - __main__ - Step 52807: {'lr': 0.00036817198670129357, 'samples': 10138944, 'steps': 52806, 'loss/train': 1.4187105894088745} 11/07/2021 04:39:39 - INFO - __main__ - Step 52808: {'lr': 0.00036816731021297567, 'samples': 10139136, 'steps': 52807, 'loss/train': 1.3704214096069336} 11/07/2021 04:39:40 - INFO - __main__ - Step 52809: {'lr': 0.0003681626336714134, 'samples': 10139328, 'steps': 52808, 'loss/train': 1.5069106817245483} 11/07/2021 04:39:41 - INFO - __main__ - Step 52810: {'lr': 0.00036815795707660886, 'samples': 10139520, 'steps': 52809, 'loss/train': 1.245521903038025} 11/07/2021 04:39:41 - INFO - __main__ - Step 52811: {'lr': 0.00036815328042856424, 'samples': 10139712, 'steps': 52810, 'loss/train': 1.384450078010559} 11/07/2021 04:39:42 - INFO - __main__ - Step 52812: {'lr': 0.0003681486037272815, 'samples': 10139904, 'steps': 52811, 'loss/train': 1.499172568321228} 11/07/2021 04:39:42 - INFO - __main__ - Step 52813: {'lr': 0.0003681439269727629, 'samples': 10140096, 'steps': 52812, 'loss/train': 2.122101068496704} 11/07/2021 04:39:42 - INFO - __main__ - Step 52814: {'lr': 0.00036813925016501036, 'samples': 10140288, 'steps': 52813, 'loss/train': 0.27971822023391724} 11/07/2021 04:39:43 - INFO - __main__ - Step 52815: {'lr': 0.00036813457330402616, 'samples': 10140480, 'steps': 52814, 'loss/train': 0.21748846769332886} 11/07/2021 04:39:44 - INFO - __main__ - Step 52816: {'lr': 0.0003681298963898124, 'samples': 10140672, 'steps': 52815, 'loss/train': 1.3669146299362183} 11/07/2021 04:39:44 - INFO - __main__ - Step 52817: {'lr': 0.000368125219422371, 'samples': 10140864, 'steps': 52816, 'loss/train': 1.6275256872177124} 11/07/2021 04:39:44 - INFO - __main__ - Step 52818: {'lr': 0.00036812054240170427, 'samples': 10141056, 'steps': 52817, 'loss/train': 1.4270051717758179} 11/07/2021 04:39:45 - INFO - __main__ - Step 52819: {'lr': 0.00036811586532781425, 'samples': 10141248, 'steps': 52818, 'loss/train': 1.4802923202514648} 11/07/2021 04:39:46 - INFO - __main__ - Step 52820: {'lr': 0.0003681111882007031, 'samples': 10141440, 'steps': 52819, 'loss/train': 1.1163501739501953} 11/07/2021 04:39:46 - INFO - __main__ - Step 52821: {'lr': 0.0003681065110203728, 'samples': 10141632, 'steps': 52820, 'loss/train': 1.1054524183273315} 11/07/2021 04:39:47 - INFO - __main__ - Step 52822: {'lr': 0.0003681018337868255, 'samples': 10141824, 'steps': 52821, 'loss/train': 1.3694102764129639} 11/07/2021 04:39:47 - INFO - __main__ - Step 52823: {'lr': 0.00036809715650006335, 'samples': 10142016, 'steps': 52822, 'loss/train': 1.2319449186325073} 11/07/2021 04:39:47 - INFO - __main__ - Step 52824: {'lr': 0.0003680924791600885, 'samples': 10142208, 'steps': 52823, 'loss/train': 2.1207361221313477} 11/07/2021 04:39:49 - INFO - __main__ - Step 52825: {'lr': 0.000368087801766903, 'samples': 10142400, 'steps': 52824, 'loss/train': 1.6262270212173462} 11/07/2021 04:39:49 - INFO - __main__ - Step 52826: {'lr': 0.0003680831243205089, 'samples': 10142592, 'steps': 52825, 'loss/train': 1.576689600944519} 11/07/2021 04:39:49 - INFO - __main__ - Step 52827: {'lr': 0.00036807844682090843, 'samples': 10142784, 'steps': 52826, 'loss/train': 1.8751413822174072} 11/07/2021 04:39:50 - INFO - __main__ - Step 52828: {'lr': 0.0003680737692681036, 'samples': 10142976, 'steps': 52827, 'loss/train': 0.7473287582397461} 11/07/2021 04:39:50 - INFO - __main__ - Step 52829: {'lr': 0.0003680690916620966, 'samples': 10143168, 'steps': 52828, 'loss/train': 2.2561402320861816} 11/07/2021 04:39:50 - INFO - __main__ - Step 52830: {'lr': 0.00036806441400288935, 'samples': 10143360, 'steps': 52829, 'loss/train': 1.2570043802261353} 11/07/2021 04:39:52 - INFO - __main__ - Step 52831: {'lr': 0.00036805973629048416, 'samples': 10143552, 'steps': 52830, 'loss/train': 0.8068132400512695} 11/07/2021 04:39:52 - INFO - __main__ - Step 52832: {'lr': 0.0003680550585248831, 'samples': 10143744, 'steps': 52831, 'loss/train': 1.494175910949707} 11/07/2021 04:39:52 - INFO - __main__ - Step 52833: {'lr': 0.0003680503807060883, 'samples': 10143936, 'steps': 52832, 'loss/train': 0.6244137287139893} 11/07/2021 04:39:53 - INFO - __main__ - Step 52834: {'lr': 0.0003680457028341018, 'samples': 10144128, 'steps': 52833, 'loss/train': 1.396943211555481} 11/07/2021 04:39:53 - INFO - __main__ - Step 52835: {'lr': 0.00036804102490892567, 'samples': 10144320, 'steps': 52834, 'loss/train': 0.6147857904434204} 11/07/2021 04:39:53 - INFO - __main__ - Step 52836: {'lr': 0.0003680363469305621, 'samples': 10144512, 'steps': 52835, 'loss/train': 1.2490501403808594} 11/07/2021 04:39:55 - INFO - __main__ - Step 52837: {'lr': 0.00036803166889901316, 'samples': 10144704, 'steps': 52836, 'loss/train': 1.5557249784469604} 11/07/2021 04:39:55 - INFO - __main__ - Step 52838: {'lr': 0.000368026990814281, 'samples': 10144896, 'steps': 52837, 'loss/train': 1.5296154022216797} 11/07/2021 04:39:55 - INFO - __main__ - Step 52839: {'lr': 0.00036802231267636773, 'samples': 10145088, 'steps': 52838, 'loss/train': 1.4874796867370605} 11/07/2021 04:39:56 - INFO - __main__ - Step 52840: {'lr': 0.0003680176344852754, 'samples': 10145280, 'steps': 52839, 'loss/train': 1.9834364652633667} 11/07/2021 04:39:56 - INFO - __main__ - Step 52841: {'lr': 0.00036801295624100616, 'samples': 10145472, 'steps': 52840, 'loss/train': 1.5413490533828735} 11/07/2021 04:39:56 - INFO - __main__ - Step 52842: {'lr': 0.00036800827794356206, 'samples': 10145664, 'steps': 52841, 'loss/train': 0.3478671908378601} 11/07/2021 04:39:57 - INFO - __main__ - Step 52843: {'lr': 0.0003680035995929453, 'samples': 10145856, 'steps': 52842, 'loss/train': 1.8136168718338013} 11/07/2021 04:39:58 - INFO - __main__ - Step 52844: {'lr': 0.00036799892118915785, 'samples': 10146048, 'steps': 52843, 'loss/train': 1.2765828371047974} 11/07/2021 04:39:58 - INFO - __main__ - Step 52845: {'lr': 0.0003679942427322019, 'samples': 10146240, 'steps': 52844, 'loss/train': 1.1193867921829224} 11/07/2021 04:39:58 - INFO - __main__ - Step 52846: {'lr': 0.00036798956422207975, 'samples': 10146432, 'steps': 52845, 'loss/train': 1.0044621229171753} 11/07/2021 04:39:59 - INFO - __main__ - Step 52847: {'lr': 0.0003679848856587932, 'samples': 10146624, 'steps': 52846, 'loss/train': 1.0877169370651245} 11/07/2021 04:40:00 - INFO - __main__ - Step 52848: {'lr': 0.0003679802070423445, 'samples': 10146816, 'steps': 52847, 'loss/train': 1.8449225425720215} 11/07/2021 04:40:00 - INFO - __main__ - Step 52849: {'lr': 0.0003679755283727357, 'samples': 10147008, 'steps': 52848, 'loss/train': 1.5287681818008423} 11/07/2021 04:40:01 - INFO - __main__ - Step 52850: {'lr': 0.0003679708496499689, 'samples': 10147200, 'steps': 52849, 'loss/train': 0.08660674095153809} 11/07/2021 04:40:01 - INFO - __main__ - Step 52851: {'lr': 0.0003679661708740463, 'samples': 10147392, 'steps': 52850, 'loss/train': 1.3134608268737793} 11/07/2021 04:40:01 - INFO - __main__ - Step 52852: {'lr': 0.00036796149204497, 'samples': 10147584, 'steps': 52851, 'loss/train': 1.192000150680542} 11/07/2021 04:40:02 - INFO - __main__ - Step 52853: {'lr': 0.0003679568131627421, 'samples': 10147776, 'steps': 52852, 'loss/train': 1.604162573814392} 11/07/2021 04:40:03 - INFO - __main__ - Step 52854: {'lr': 0.0003679521342273647, 'samples': 10147968, 'steps': 52853, 'loss/train': 1.0740938186645508} 11/07/2021 04:40:03 - INFO - __main__ - Step 52855: {'lr': 0.00036794745523883977, 'samples': 10148160, 'steps': 52854, 'loss/train': 1.07406485080719} 11/07/2021 04:40:03 - INFO - __main__ - Step 52856: {'lr': 0.0003679427761971696, 'samples': 10148352, 'steps': 52855, 'loss/train': 1.535104751586914} 11/07/2021 04:40:04 - INFO - __main__ - Step 52857: {'lr': 0.0003679380971023562, 'samples': 10148544, 'steps': 52856, 'loss/train': 1.2201820611953735} 11/07/2021 04:40:05 - INFO - __main__ - Step 52858: {'lr': 0.00036793341795440175, 'samples': 10148736, 'steps': 52857, 'loss/train': 0.13308845460414886} 11/07/2021 04:40:05 - INFO - __main__ - Step 52859: {'lr': 0.00036792873875330837, 'samples': 10148928, 'steps': 52858, 'loss/train': 1.3101484775543213} 11/07/2021 04:40:05 - INFO - __main__ - Step 52860: {'lr': 0.000367924059499078, 'samples': 10149120, 'steps': 52859, 'loss/train': 1.8716589212417603} 11/07/2021 04:40:06 - INFO - __main__ - Step 52861: {'lr': 0.000367919380191713, 'samples': 10149312, 'steps': 52860, 'loss/train': 1.605707049369812} 11/07/2021 04:40:06 - INFO - __main__ - Step 52862: {'lr': 0.0003679147008312153, 'samples': 10149504, 'steps': 52861, 'loss/train': 1.4423457384109497} 11/07/2021 04:40:07 - INFO - __main__ - Step 52863: {'lr': 0.000367910021417587, 'samples': 10149696, 'steps': 52862, 'loss/train': 1.2574059963226318} 11/07/2021 04:40:08 - INFO - __main__ - Step 52864: {'lr': 0.0003679053419508303, 'samples': 10149888, 'steps': 52863, 'loss/train': 1.2672683000564575} 11/07/2021 04:40:08 - INFO - __main__ - Step 52865: {'lr': 0.0003679006624309472, 'samples': 10150080, 'steps': 52864, 'loss/train': 1.4139201641082764} 11/07/2021 04:40:08 - INFO - __main__ - Step 52866: {'lr': 0.00036789598285794003, 'samples': 10150272, 'steps': 52865, 'loss/train': 1.1911715269088745} 11/07/2021 04:40:09 - INFO - __main__ - Step 52867: {'lr': 0.0003678913032318107, 'samples': 10150464, 'steps': 52866, 'loss/train': 1.4001514911651611} 11/07/2021 04:40:10 - INFO - __main__ - Step 52868: {'lr': 0.0003678866235525613, 'samples': 10150656, 'steps': 52867, 'loss/train': 1.3431363105773926} 11/07/2021 04:40:10 - INFO - __main__ - Step 52869: {'lr': 0.00036788194382019406, 'samples': 10150848, 'steps': 52868, 'loss/train': 1.4878185987472534} 11/07/2021 04:40:10 - INFO - __main__ - Step 52870: {'lr': 0.000367877264034711, 'samples': 10151040, 'steps': 52869, 'loss/train': 1.5815105438232422} 11/07/2021 04:40:11 - INFO - __main__ - Step 52871: {'lr': 0.0003678725841961144, 'samples': 10151232, 'steps': 52870, 'loss/train': 1.1997718811035156} 11/07/2021 04:40:11 - INFO - __main__ - Step 52872: {'lr': 0.00036786790430440606, 'samples': 10151424, 'steps': 52871, 'loss/train': 1.1991007328033447} 11/07/2021 04:40:12 - INFO - __main__ - Step 52873: {'lr': 0.0003678632243595883, 'samples': 10151616, 'steps': 52872, 'loss/train': 1.9988185167312622} 11/07/2021 04:40:12 - INFO - __main__ - Step 52874: {'lr': 0.0003678585443616632, 'samples': 10151808, 'steps': 52873, 'loss/train': 1.3749264478683472} 11/07/2021 04:40:13 - INFO - __main__ - Step 52875: {'lr': 0.0003678538643106329, 'samples': 10152000, 'steps': 52874, 'loss/train': 1.6679110527038574} 11/07/2021 04:40:13 - INFO - __main__ - Step 52876: {'lr': 0.0003678491842064995, 'samples': 10152192, 'steps': 52875, 'loss/train': 1.896567702293396} 11/07/2021 04:40:13 - INFO - __main__ - Step 52877: {'lr': 0.00036784450404926493, 'samples': 10152384, 'steps': 52876, 'loss/train': 1.790103554725647} 11/07/2021 04:40:14 - INFO - __main__ - Step 52878: {'lr': 0.00036783982383893155, 'samples': 10152576, 'steps': 52877, 'loss/train': 1.5069408416748047} 11/07/2021 04:40:15 - INFO - __main__ - Step 52879: {'lr': 0.0003678351435755014, 'samples': 10152768, 'steps': 52878, 'loss/train': 1.5018293857574463} 11/07/2021 04:40:15 - INFO - __main__ - Step 52880: {'lr': 0.0003678304632589764, 'samples': 10152960, 'steps': 52879, 'loss/train': 1.562927484512329} 11/07/2021 04:40:15 - INFO - __main__ - Step 52881: {'lr': 0.00036782578288935893, 'samples': 10153152, 'steps': 52880, 'loss/train': 1.2411366701126099} 11/07/2021 04:40:16 - INFO - __main__ - Step 52882: {'lr': 0.000367821102466651, 'samples': 10153344, 'steps': 52881, 'loss/train': 1.8379011154174805} 11/07/2021 04:40:16 - INFO - __main__ - Step 52883: {'lr': 0.0003678164219908546, 'samples': 10153536, 'steps': 52882, 'loss/train': 1.386399269104004} 11/07/2021 04:40:17 - INFO - __main__ - Step 52884: {'lr': 0.00036781174146197207, 'samples': 10153728, 'steps': 52883, 'loss/train': 1.314054012298584} 11/07/2021 04:40:17 - INFO - __main__ - Step 52885: {'lr': 0.00036780706088000524, 'samples': 10153920, 'steps': 52884, 'loss/train': 1.4697747230529785} 11/07/2021 04:40:18 - INFO - __main__ - Step 52886: {'lr': 0.0003678023802449564, 'samples': 10154112, 'steps': 52885, 'loss/train': 1.603981614112854} 11/07/2021 04:40:18 - INFO - __main__ - Step 52887: {'lr': 0.0003677976995568277, 'samples': 10154304, 'steps': 52886, 'loss/train': 3.247936487197876} 11/07/2021 04:40:19 - INFO - __main__ - Step 52888: {'lr': 0.00036779301881562115, 'samples': 10154496, 'steps': 52887, 'loss/train': 1.4492740631103516} 11/07/2021 04:40:20 - INFO - __main__ - Step 52889: {'lr': 0.00036778833802133886, 'samples': 10154688, 'steps': 52888, 'loss/train': 1.2354487180709839} 11/07/2021 04:40:20 - INFO - __main__ - Step 52890: {'lr': 0.000367783657173983, 'samples': 10154880, 'steps': 52889, 'loss/train': 1.3675084114074707} 11/07/2021 04:40:20 - INFO - __main__ - Step 52891: {'lr': 0.0003677789762735556, 'samples': 10155072, 'steps': 52890, 'loss/train': 1.591147541999817} 11/07/2021 04:40:21 - INFO - __main__ - Step 52892: {'lr': 0.0003677742953200588, 'samples': 10155264, 'steps': 52891, 'loss/train': 1.391609787940979} 11/07/2021 04:40:21 - INFO - __main__ - Step 52893: {'lr': 0.0003677696143134948, 'samples': 10155456, 'steps': 52892, 'loss/train': 1.3725422620773315} 11/07/2021 04:40:22 - INFO - __main__ - Step 52894: {'lr': 0.00036776493325386554, 'samples': 10155648, 'steps': 52893, 'loss/train': 1.142369031906128} 11/07/2021 04:40:22 - INFO - __main__ - Step 52895: {'lr': 0.00036776025214117325, 'samples': 10155840, 'steps': 52894, 'loss/train': 1.3131283521652222} 11/07/2021 04:40:23 - INFO - __main__ - Step 52896: {'lr': 0.00036775557097542, 'samples': 10156032, 'steps': 52895, 'loss/train': 1.5847556591033936} 11/07/2021 04:40:23 - INFO - __main__ - Step 52897: {'lr': 0.00036775088975660793, 'samples': 10156224, 'steps': 52896, 'loss/train': 0.18475568294525146} 11/07/2021 04:40:23 - INFO - __main__ - Step 52898: {'lr': 0.0003677462084847391, 'samples': 10156416, 'steps': 52897, 'loss/train': 1.4460605382919312} 11/07/2021 04:40:24 - INFO - __main__ - Step 52899: {'lr': 0.0003677415271598157, 'samples': 10156608, 'steps': 52898, 'loss/train': 1.1791458129882812} 11/07/2021 04:40:25 - INFO - __main__ - Step 52900: {'lr': 0.00036773684578183976, 'samples': 10156800, 'steps': 52899, 'loss/train': 1.582863450050354} 11/07/2021 04:40:25 - INFO - __main__ - Step 52901: {'lr': 0.00036773216435081335, 'samples': 10156992, 'steps': 52900, 'loss/train': 1.208898901939392} 11/07/2021 04:40:25 - INFO - __main__ - Step 52902: {'lr': 0.00036772748286673866, 'samples': 10157184, 'steps': 52901, 'loss/train': 1.6890238523483276} 11/07/2021 04:40:26 - INFO - __main__ - Step 52903: {'lr': 0.00036772280132961786, 'samples': 10157376, 'steps': 52902, 'loss/train': 0.6045779585838318} 11/07/2021 04:40:27 - INFO - __main__ - Step 52904: {'lr': 0.0003677181197394529, 'samples': 10157568, 'steps': 52903, 'loss/train': 1.7396961450576782} 11/07/2021 04:40:27 - INFO - __main__ - Step 52905: {'lr': 0.000367713438096246, 'samples': 10157760, 'steps': 52904, 'loss/train': 1.5900174379348755} 11/07/2021 04:40:28 - INFO - __main__ - Step 52906: {'lr': 0.00036770875639999923, 'samples': 10157952, 'steps': 52905, 'loss/train': 5.787691593170166} 11/07/2021 04:40:28 - INFO - __main__ - Step 52907: {'lr': 0.0003677040746507148, 'samples': 10158144, 'steps': 52906, 'loss/train': 1.204666018486023} 11/07/2021 04:40:28 - INFO - __main__ - Step 52908: {'lr': 0.00036769939284839463, 'samples': 10158336, 'steps': 52907, 'loss/train': 1.6088929176330566} 11/07/2021 04:40:29 - INFO - __main__ - Step 52909: {'lr': 0.000367694710993041, 'samples': 10158528, 'steps': 52908, 'loss/train': 1.2527908086776733} 11/07/2021 04:40:30 - INFO - __main__ - Step 52910: {'lr': 0.00036769002908465585, 'samples': 10158720, 'steps': 52909, 'loss/train': 1.308556318283081} 11/07/2021 04:40:30 - INFO - __main__ - Step 52911: {'lr': 0.0003676853471232415, 'samples': 10158912, 'steps': 52910, 'loss/train': 1.3853533267974854} 11/07/2021 04:40:31 - INFO - __main__ - Step 52912: {'lr': 0.00036768066510879985, 'samples': 10159104, 'steps': 52911, 'loss/train': 1.427247405052185} 11/07/2021 04:40:31 - INFO - __main__ - Step 52913: {'lr': 0.0003676759830413332, 'samples': 10159296, 'steps': 52912, 'loss/train': 1.8365446329116821} 11/07/2021 04:40:31 - INFO - __main__ - Step 52914: {'lr': 0.0003676713009208435, 'samples': 10159488, 'steps': 52913, 'loss/train': 1.6054060459136963} 11/07/2021 04:40:32 - INFO - __main__ - Step 52915: {'lr': 0.000367666618747333, 'samples': 10159680, 'steps': 52914, 'loss/train': 1.307416319847107} 11/07/2021 04:40:33 - INFO - __main__ - Step 52916: {'lr': 0.0003676619365208036, 'samples': 10159872, 'steps': 52915, 'loss/train': 1.5037816762924194} 11/07/2021 04:40:33 - INFO - __main__ - Step 52917: {'lr': 0.0003676572542412576, 'samples': 10160064, 'steps': 52916, 'loss/train': 1.2301470041275024} 11/07/2021 04:40:34 - INFO - __main__ - Step 52918: {'lr': 0.00036765257190869715, 'samples': 10160256, 'steps': 52917, 'loss/train': 1.2114273309707642} 11/07/2021 04:40:34 - INFO - __main__ - Step 52919: {'lr': 0.0003676478895231242, 'samples': 10160448, 'steps': 52918, 'loss/train': 1.2205685377120972} 11/07/2021 04:40:34 - INFO - __main__ - Step 52920: {'lr': 0.00036764320708454094, 'samples': 10160640, 'steps': 52919, 'loss/train': 0.9563878178596497} 11/07/2021 04:40:35 - INFO - __main__ - Step 52921: {'lr': 0.0003676385245929494, 'samples': 10160832, 'steps': 52920, 'loss/train': 1.5602089166641235} 11/07/2021 04:40:36 - INFO - __main__ - Step 52922: {'lr': 0.00036763384204835186, 'samples': 10161024, 'steps': 52921, 'loss/train': 1.5643709897994995} 11/07/2021 04:40:36 - INFO - __main__ - Step 52923: {'lr': 0.0003676291594507503, 'samples': 10161216, 'steps': 52922, 'loss/train': 1.4799211025238037} 11/07/2021 04:40:36 - INFO - __main__ - Step 52924: {'lr': 0.0003676244768001468, 'samples': 10161408, 'steps': 52923, 'loss/train': 1.402815341949463} 11/07/2021 04:40:37 - INFO - __main__ - Step 52925: {'lr': 0.00036761979409654353, 'samples': 10161600, 'steps': 52924, 'loss/train': 1.6642074584960938} 11/07/2021 04:40:38 - INFO - __main__ - Step 52926: {'lr': 0.0003676151113399427, 'samples': 10161792, 'steps': 52925, 'loss/train': 1.6063995361328125} 11/07/2021 04:40:38 - INFO - __main__ - Step 52927: {'lr': 0.0003676104285303463, 'samples': 10161984, 'steps': 52926, 'loss/train': 1.527408480644226} 11/07/2021 04:40:39 - INFO - __main__ - Step 52928: {'lr': 0.00036760574566775634, 'samples': 10162176, 'steps': 52927, 'loss/train': 1.8549025058746338} 11/07/2021 04:40:39 - INFO - __main__ - Step 52929: {'lr': 0.0003676010627521751, 'samples': 10162368, 'steps': 52928, 'loss/train': 0.8605169653892517} 11/07/2021 04:40:39 - INFO - __main__ - Step 52930: {'lr': 0.00036759637978360467, 'samples': 10162560, 'steps': 52929, 'loss/train': 1.3480849266052246} 11/07/2021 04:40:40 - INFO - __main__ - Step 52931: {'lr': 0.00036759169676204705, 'samples': 10162752, 'steps': 52930, 'loss/train': 3.5450997352600098} 11/07/2021 04:40:41 - INFO - __main__ - Step 52932: {'lr': 0.0003675870136875045, 'samples': 10162944, 'steps': 52931, 'loss/train': 1.152413010597229} 11/07/2021 04:40:41 - INFO - __main__ - Step 52933: {'lr': 0.00036758233055997905, 'samples': 10163136, 'steps': 52932, 'loss/train': 1.3054156303405762} 11/07/2021 04:40:42 - INFO - __main__ - Step 52934: {'lr': 0.0003675776473794728, 'samples': 10163328, 'steps': 52933, 'loss/train': 1.5548264980316162} 11/07/2021 04:40:42 - INFO - __main__ - Step 52935: {'lr': 0.00036757296414598786, 'samples': 10163520, 'steps': 52934, 'loss/train': 1.5610015392303467} 11/07/2021 04:40:42 - INFO - __main__ - Step 52936: {'lr': 0.00036756828085952637, 'samples': 10163712, 'steps': 52935, 'loss/train': 3.2426817417144775} 11/07/2021 04:40:43 - INFO - __main__ - Step 52937: {'lr': 0.0003675635975200904, 'samples': 10163904, 'steps': 52936, 'loss/train': 1.5200356245040894} 11/07/2021 04:40:44 - INFO - __main__ - Step 52938: {'lr': 0.0003675589141276821, 'samples': 10164096, 'steps': 52937, 'loss/train': 1.821755051612854} 11/07/2021 04:40:44 - INFO - __main__ - Step 52939: {'lr': 0.0003675542306823036, 'samples': 10164288, 'steps': 52938, 'loss/train': 1.5270503759384155} 11/07/2021 04:40:44 - INFO - __main__ - Step 52940: {'lr': 0.000367549547183957, 'samples': 10164480, 'steps': 52939, 'loss/train': 1.5565789937973022} 11/07/2021 04:40:45 - INFO - __main__ - Step 52941: {'lr': 0.0003675448636326443, 'samples': 10164672, 'steps': 52940, 'loss/train': 1.1866822242736816} 11/07/2021 04:40:46 - INFO - __main__ - Step 52942: {'lr': 0.0003675401800283678, 'samples': 10164864, 'steps': 52941, 'loss/train': 1.5489954948425293} 11/07/2021 04:40:46 - INFO - __main__ - Step 52943: {'lr': 0.0003675354963711294, 'samples': 10165056, 'steps': 52942, 'loss/train': 1.4503802061080933} 11/07/2021 04:40:47 - INFO - __main__ - Step 52944: {'lr': 0.00036753081266093136, 'samples': 10165248, 'steps': 52943, 'loss/train': 1.572939395904541} 11/07/2021 04:40:47 - INFO - __main__ - Step 52945: {'lr': 0.00036752612889777577, 'samples': 10165440, 'steps': 52944, 'loss/train': 1.523209810256958} 11/07/2021 04:40:47 - INFO - __main__ - Step 52946: {'lr': 0.0003675214450816647, 'samples': 10165632, 'steps': 52945, 'loss/train': 1.3453178405761719} 11/07/2021 04:40:48 - INFO - __main__ - Step 52947: {'lr': 0.00036751676121260035, 'samples': 10165824, 'steps': 52946, 'loss/train': 1.3668791055679321} 11/07/2021 04:40:49 - INFO - __main__ - Step 52948: {'lr': 0.00036751207729058465, 'samples': 10166016, 'steps': 52947, 'loss/train': 1.5065885782241821} 11/07/2021 04:40:49 - INFO - __main__ - Step 52949: {'lr': 0.00036750739331561986, 'samples': 10166208, 'steps': 52948, 'loss/train': 1.566748023033142} 11/07/2021 04:40:49 - INFO - __main__ - Step 52950: {'lr': 0.0003675027092877081, 'samples': 10166400, 'steps': 52949, 'loss/train': 1.1874886751174927} 11/07/2021 04:40:50 - INFO - __main__ - Step 52951: {'lr': 0.0003674980252068514, 'samples': 10166592, 'steps': 52950, 'loss/train': 0.9539691209793091} 11/07/2021 04:40:51 - INFO - __main__ - Step 52952: {'lr': 0.0003674933410730519, 'samples': 10166784, 'steps': 52951, 'loss/train': 1.2791460752487183} 11/07/2021 04:40:51 - INFO - __main__ - Step 52953: {'lr': 0.00036748865688631175, 'samples': 10166976, 'steps': 52952, 'loss/train': 1.7817262411117554} 11/07/2021 04:40:51 - INFO - __main__ - Step 52954: {'lr': 0.000367483972646633, 'samples': 10167168, 'steps': 52953, 'loss/train': 1.161556363105774} 11/07/2021 04:40:52 - INFO - __main__ - Step 52955: {'lr': 0.00036747928835401773, 'samples': 10167360, 'steps': 52954, 'loss/train': 0.9506720304489136} 11/07/2021 04:40:52 - INFO - __main__ - Step 52956: {'lr': 0.00036747460400846815, 'samples': 10167552, 'steps': 52955, 'loss/train': 1.0091511011123657} 11/07/2021 04:40:53 - INFO - __main__ - Step 52957: {'lr': 0.00036746991960998635, 'samples': 10167744, 'steps': 52956, 'loss/train': 1.3968636989593506} 11/07/2021 04:40:53 - INFO - __main__ - Step 52958: {'lr': 0.00036746523515857434, 'samples': 10167936, 'steps': 52957, 'loss/train': 1.1523995399475098} 11/07/2021 04:40:54 - INFO - __main__ - Step 52959: {'lr': 0.00036746055065423435, 'samples': 10168128, 'steps': 52958, 'loss/train': 1.3701913356781006} 11/07/2021 04:40:54 - INFO - __main__ - Step 52960: {'lr': 0.0003674558660969685, 'samples': 10168320, 'steps': 52959, 'loss/train': 1.5888440608978271} 11/07/2021 04:40:55 - INFO - __main__ - Step 52961: {'lr': 0.0003674511814867788, 'samples': 10168512, 'steps': 52960, 'loss/train': 1.3518424034118652} 11/07/2021 04:40:56 - INFO - __main__ - Step 52962: {'lr': 0.00036744649682366744, 'samples': 10168704, 'steps': 52961, 'loss/train': 1.3583984375} 11/07/2021 04:40:56 - INFO - __main__ - Step 52963: {'lr': 0.0003674418121076365, 'samples': 10168896, 'steps': 52962, 'loss/train': 1.5291193723678589} 11/07/2021 04:40:57 - INFO - __main__ - Step 52964: {'lr': 0.00036743712733868807, 'samples': 10169088, 'steps': 52963, 'loss/train': 1.021351933479309} 11/07/2021 04:40:57 - INFO - __main__ - Step 52965: {'lr': 0.00036743244251682424, 'samples': 10169280, 'steps': 52964, 'loss/train': 1.406258225440979} 11/07/2021 04:40:57 - INFO - __main__ - Step 52966: {'lr': 0.00036742775764204717, 'samples': 10169472, 'steps': 52965, 'loss/train': 0.17799949645996094} 11/07/2021 04:40:58 - INFO - __main__ - Step 52967: {'lr': 0.000367423072714359, 'samples': 10169664, 'steps': 52966, 'loss/train': 1.4866503477096558} 11/07/2021 04:40:59 - INFO - __main__ - Step 52968: {'lr': 0.00036741838773376187, 'samples': 10169856, 'steps': 52967, 'loss/train': 0.9869881272315979} 11/07/2021 04:40:59 - INFO - __main__ - Step 52969: {'lr': 0.00036741370270025776, 'samples': 10170048, 'steps': 52968, 'loss/train': 1.5774484872817993} 11/07/2021 04:40:59 - INFO - __main__ - Step 52970: {'lr': 0.0003674090176138488, 'samples': 10170240, 'steps': 52969, 'loss/train': 1.9227925539016724} 11/07/2021 04:41:00 - INFO - __main__ - Step 52971: {'lr': 0.0003674043324745372, 'samples': 10170432, 'steps': 52970, 'loss/train': 1.5562046766281128} 11/07/2021 04:41:00 - INFO - __main__ - Step 52972: {'lr': 0.000367399647282325, 'samples': 10170624, 'steps': 52971, 'loss/train': 1.7658380270004272} 11/07/2021 04:41:01 - INFO - __main__ - Step 52973: {'lr': 0.0003673949620372143, 'samples': 10170816, 'steps': 52972, 'loss/train': 1.072238802909851} 11/07/2021 04:41:02 - INFO - __main__ - Step 52974: {'lr': 0.0003673902767392074, 'samples': 10171008, 'steps': 52973, 'loss/train': 1.7072405815124512} 11/07/2021 04:41:02 - INFO - __main__ - Step 52975: {'lr': 0.00036738559138830613, 'samples': 10171200, 'steps': 52974, 'loss/train': 0.4392591416835785} 11/07/2021 04:41:02 - INFO - __main__ - Step 52976: {'lr': 0.0003673809059845127, 'samples': 10171392, 'steps': 52975, 'loss/train': 1.2999823093414307} 11/07/2021 04:41:03 - INFO - __main__ - Step 52977: {'lr': 0.00036737622052782933, 'samples': 10171584, 'steps': 52976, 'loss/train': 1.0242704153060913} 11/07/2021 04:41:04 - INFO - __main__ - Step 52978: {'lr': 0.000367371535018258, 'samples': 10171776, 'steps': 52977, 'loss/train': 0.625713586807251} 11/07/2021 04:41:04 - INFO - __main__ - Step 52979: {'lr': 0.00036736684945580083, 'samples': 10171968, 'steps': 52978, 'loss/train': 1.398547649383545} 11/07/2021 04:41:04 - INFO - __main__ - Step 52980: {'lr': 0.00036736216384046, 'samples': 10172160, 'steps': 52979, 'loss/train': 0.5298481583595276} 11/07/2021 04:41:05 - INFO - __main__ - Step 52981: {'lr': 0.00036735747817223766, 'samples': 10172352, 'steps': 52980, 'loss/train': 1.3302054405212402} 11/07/2021 04:41:05 - INFO - __main__ - Step 52982: {'lr': 0.00036735279245113573, 'samples': 10172544, 'steps': 52981, 'loss/train': 1.6523569822311401} 11/07/2021 04:41:06 - INFO - __main__ - Step 52983: {'lr': 0.0003673481066771565, 'samples': 10172736, 'steps': 52982, 'loss/train': 0.9141687154769897} 11/07/2021 04:41:07 - INFO - __main__ - Step 52984: {'lr': 0.00036734342085030205, 'samples': 10172928, 'steps': 52983, 'loss/train': 1.2575457096099854} 11/07/2021 04:41:07 - INFO - __main__ - Step 52985: {'lr': 0.0003673387349705744, 'samples': 10173120, 'steps': 52984, 'loss/train': 0.15876692533493042} 11/07/2021 04:41:07 - INFO - __main__ - Step 52986: {'lr': 0.00036733404903797575, 'samples': 10173312, 'steps': 52985, 'loss/train': 1.4802007675170898} 11/07/2021 04:41:08 - INFO - __main__ - Step 52987: {'lr': 0.00036732936305250826, 'samples': 10173504, 'steps': 52986, 'loss/train': 0.9805260300636292} 11/07/2021 04:41:09 - INFO - __main__ - Step 52988: {'lr': 0.00036732467701417387, 'samples': 10173696, 'steps': 52987, 'loss/train': 1.862620234489441} 11/07/2021 04:41:10 - INFO - __main__ - Step 52989: {'lr': 0.00036731999092297487, 'samples': 10173888, 'steps': 52988, 'loss/train': 1.2013376951217651} 11/07/2021 04:41:10 - INFO - __main__ - Step 52990: {'lr': 0.0003673153047789132, 'samples': 10174080, 'steps': 52989, 'loss/train': 0.9784761667251587} 11/07/2021 04:41:10 - INFO - __main__ - Step 52991: {'lr': 0.0003673106185819911, 'samples': 10174272, 'steps': 52990, 'loss/train': 1.5166722536087036} 11/07/2021 04:41:11 - INFO - __main__ - Step 52992: {'lr': 0.00036730593233221074, 'samples': 10174464, 'steps': 52991, 'loss/train': 1.002928376197815} 11/07/2021 04:41:11 - INFO - __main__ - Step 52993: {'lr': 0.000367301246029574, 'samples': 10174656, 'steps': 52992, 'loss/train': 0.31574416160583496} 11/07/2021 04:41:12 - INFO - __main__ - Step 52994: {'lr': 0.00036729655967408326, 'samples': 10174848, 'steps': 52993, 'loss/train': 0.4325564205646515} 11/07/2021 04:41:13 - INFO - __main__ - Step 52995: {'lr': 0.00036729187326574043, 'samples': 10175040, 'steps': 52994, 'loss/train': 1.2070051431655884} 11/07/2021 04:41:13 - INFO - __main__ - Step 52996: {'lr': 0.00036728718680454763, 'samples': 10175232, 'steps': 52995, 'loss/train': 1.5072743892669678} 11/07/2021 04:41:13 - INFO - __main__ - Step 52997: {'lr': 0.0003672825002905071, 'samples': 10175424, 'steps': 52996, 'loss/train': 1.7143254280090332} 11/07/2021 04:41:14 - INFO - __main__ - Step 52998: {'lr': 0.0003672778137236209, 'samples': 10175616, 'steps': 52997, 'loss/train': 1.6909080743789673} 11/07/2021 04:41:15 - INFO - __main__ - Step 52999: {'lr': 0.0003672731271038911, 'samples': 10175808, 'steps': 52998, 'loss/train': 1.1389824151992798} 11/07/2021 04:41:15 - INFO - __main__ - Step 53000: {'lr': 0.0003672684404313199, 'samples': 10176000, 'steps': 52999, 'loss/train': 2.1818671226501465} 11/07/2021 04:41:15 - INFO - __main__ - Step 53001: {'lr': 0.00036726375370590926, 'samples': 10176192, 'steps': 53000, 'loss/train': 1.1636219024658203} 11/07/2021 04:41:16 - INFO - __main__ - Step 53002: {'lr': 0.0003672590669276614, 'samples': 10176384, 'steps': 53001, 'loss/train': 1.425121784210205} 11/07/2021 04:41:16 - INFO - __main__ - Step 53003: {'lr': 0.0003672543800965784, 'samples': 10176576, 'steps': 53002, 'loss/train': 1.5879302024841309} 11/07/2021 04:41:17 - INFO - __main__ - Step 53004: {'lr': 0.00036724969321266245, 'samples': 10176768, 'steps': 53003, 'loss/train': 1.3570855855941772} 11/07/2021 04:41:17 - INFO - __main__ - Step 53005: {'lr': 0.0003672450062759156, 'samples': 10176960, 'steps': 53004, 'loss/train': 1.5459725856781006} 11/07/2021 04:41:18 - INFO - __main__ - Step 53006: {'lr': 0.00036724031928633995, 'samples': 10177152, 'steps': 53005, 'loss/train': 1.6546989679336548} 11/07/2021 04:41:18 - INFO - __main__ - Step 53007: {'lr': 0.00036723563224393753, 'samples': 10177344, 'steps': 53006, 'loss/train': 1.8153413534164429} 11/07/2021 04:41:18 - INFO - __main__ - Step 53008: {'lr': 0.0003672309451487106, 'samples': 10177536, 'steps': 53007, 'loss/train': 1.5242239236831665} 11/07/2021 04:41:20 - INFO - __main__ - Step 53009: {'lr': 0.0003672262580006612, 'samples': 10177728, 'steps': 53008, 'loss/train': 1.094207763671875} 11/07/2021 04:41:20 - INFO - __main__ - Step 53010: {'lr': 0.00036722157079979153, 'samples': 10177920, 'steps': 53009, 'loss/train': 1.5206443071365356} 11/07/2021 04:41:21 - INFO - __main__ - Step 53011: {'lr': 0.0003672168835461036, 'samples': 10178112, 'steps': 53010, 'loss/train': 1.6117490530014038} 11/07/2021 04:41:21 - INFO - __main__ - Step 53012: {'lr': 0.00036721219623959956, 'samples': 10178304, 'steps': 53011, 'loss/train': 0.5125022530555725} 11/07/2021 04:41:21 - INFO - __main__ - Step 53013: {'lr': 0.00036720750888028143, 'samples': 10178496, 'steps': 53012, 'loss/train': 0.4820607602596283} 11/07/2021 04:41:22 - INFO - __main__ - Step 53014: {'lr': 0.0003672028214681515, 'samples': 10178688, 'steps': 53013, 'loss/train': 0.36183875799179077} 11/07/2021 04:41:23 - INFO - __main__ - Step 53015: {'lr': 0.00036719813400321174, 'samples': 10178880, 'steps': 53014, 'loss/train': 1.2964993715286255} 11/07/2021 04:41:23 - INFO - __main__ - Step 53016: {'lr': 0.0003671934464854643, 'samples': 10179072, 'steps': 53015, 'loss/train': 1.4271796941757202} 11/07/2021 04:41:23 - INFO - __main__ - Step 53017: {'lr': 0.00036718875891491134, 'samples': 10179264, 'steps': 53016, 'loss/train': 1.3503791093826294} 11/07/2021 04:41:24 - INFO - __main__ - Step 53018: {'lr': 0.0003671840712915549, 'samples': 10179456, 'steps': 53017, 'loss/train': 1.1013667583465576} 11/07/2021 04:41:25 - INFO - __main__ - Step 53019: {'lr': 0.0003671793836153972, 'samples': 10179648, 'steps': 53018, 'loss/train': 1.2425892353057861} 11/07/2021 04:41:25 - INFO - __main__ - Step 53020: {'lr': 0.00036717469588644017, 'samples': 10179840, 'steps': 53019, 'loss/train': 1.2072207927703857} 11/07/2021 04:41:25 - INFO - __main__ - Step 53021: {'lr': 0.000367170008104686, 'samples': 10180032, 'steps': 53020, 'loss/train': 1.719152808189392} 11/07/2021 04:41:26 - INFO - __main__ - Step 53022: {'lr': 0.000367165320270137, 'samples': 10180224, 'steps': 53021, 'loss/train': 1.2317638397216797} 11/07/2021 04:41:26 - INFO - __main__ - Step 53023: {'lr': 0.000367160632382795, 'samples': 10180416, 'steps': 53022, 'loss/train': 0.9830251932144165} 11/07/2021 04:41:27 - INFO - __main__ - Step 53024: {'lr': 0.00036715594444266224, 'samples': 10180608, 'steps': 53023, 'loss/train': 0.9751511216163635} 11/07/2021 04:41:27 - INFO - __main__ - Step 53025: {'lr': 0.0003671512564497408, 'samples': 10180800, 'steps': 53024, 'loss/train': 0.847625195980072} 11/07/2021 04:41:28 - INFO - __main__ - Step 53026: {'lr': 0.0003671465684040328, 'samples': 10180992, 'steps': 53025, 'loss/train': 0.9451130032539368} 11/07/2021 04:41:28 - INFO - __main__ - Step 53027: {'lr': 0.00036714188030554046, 'samples': 10181184, 'steps': 53026, 'loss/train': 1.130821704864502} 11/07/2021 04:41:28 - INFO - __main__ - Step 53028: {'lr': 0.00036713719215426577, 'samples': 10181376, 'steps': 53027, 'loss/train': 1.5467818975448608} 11/07/2021 04:41:29 - INFO - __main__ - Step 53029: {'lr': 0.0003671325039502108, 'samples': 10181568, 'steps': 53028, 'loss/train': 1.7766822576522827} 11/07/2021 04:41:30 - INFO - __main__ - Step 53030: {'lr': 0.0003671278156933778, 'samples': 10181760, 'steps': 53029, 'loss/train': 1.1858930587768555} 11/07/2021 04:41:30 - INFO - __main__ - Step 53031: {'lr': 0.00036712312738376875, 'samples': 10181952, 'steps': 53030, 'loss/train': 1.2770707607269287} 11/07/2021 04:41:30 - INFO - __main__ - Step 53032: {'lr': 0.00036711843902138586, 'samples': 10182144, 'steps': 53031, 'loss/train': 1.662916660308838} 11/07/2021 04:41:31 - INFO - __main__ - Step 53033: {'lr': 0.0003671137506062312, 'samples': 10182336, 'steps': 53032, 'loss/train': 1.1410259008407593} 11/07/2021 04:41:32 - INFO - __main__ - Step 53034: {'lr': 0.000367109062138307, 'samples': 10182528, 'steps': 53033, 'loss/train': 1.4344712495803833} 11/07/2021 04:41:32 - INFO - __main__ - Step 53035: {'lr': 0.00036710437361761513, 'samples': 10182720, 'steps': 53034, 'loss/train': 0.7155401110649109} 11/07/2021 04:41:33 - INFO - __main__ - Step 53036: {'lr': 0.00036709968504415786, 'samples': 10182912, 'steps': 53035, 'loss/train': 1.6942758560180664} 11/07/2021 04:41:33 - INFO - __main__ - Step 53037: {'lr': 0.00036709499641793725, 'samples': 10183104, 'steps': 53036, 'loss/train': 1.1712610721588135} 11/07/2021 04:41:33 - INFO - __main__ - Step 53038: {'lr': 0.00036709030773895545, 'samples': 10183296, 'steps': 53037, 'loss/train': 1.5960693359375} 11/07/2021 04:41:35 - INFO - __main__ - Step 53039: {'lr': 0.0003670856190072146, 'samples': 10183488, 'steps': 53038, 'loss/train': 0.23049531877040863} 11/07/2021 04:41:35 - INFO - __main__ - Step 53040: {'lr': 0.00036708093022271677, 'samples': 10183680, 'steps': 53039, 'loss/train': 1.8371566534042358} 11/07/2021 04:41:35 - INFO - __main__ - Step 53041: {'lr': 0.0003670762413854641, 'samples': 10183872, 'steps': 53040, 'loss/train': 1.4111393690109253} 11/07/2021 04:41:36 - INFO - __main__ - Step 53042: {'lr': 0.0003670715524954587, 'samples': 10184064, 'steps': 53041, 'loss/train': 1.3444184064865112} 11/07/2021 04:41:36 - INFO - __main__ - Step 53043: {'lr': 0.0003670668635527026, 'samples': 10184256, 'steps': 53042, 'loss/train': 1.6000396013259888} 11/07/2021 04:41:37 - INFO - __main__ - Step 53044: {'lr': 0.00036706217455719805, 'samples': 10184448, 'steps': 53043, 'loss/train': 1.6131395101547241} 11/07/2021 04:41:37 - INFO - __main__ - Step 53045: {'lr': 0.000367057485508947, 'samples': 10184640, 'steps': 53044, 'loss/train': 1.2647408246994019} 11/07/2021 04:41:38 - INFO - __main__ - Step 53046: {'lr': 0.0003670527964079517, 'samples': 10184832, 'steps': 53045, 'loss/train': 1.4808870553970337} 11/07/2021 04:41:38 - INFO - __main__ - Step 53047: {'lr': 0.0003670481072542142, 'samples': 10185024, 'steps': 53046, 'loss/train': 1.9649957418441772} 11/07/2021 04:41:38 - INFO - __main__ - Step 53048: {'lr': 0.0003670434180477367, 'samples': 10185216, 'steps': 53047, 'loss/train': 0.874976396560669} 11/07/2021 04:41:39 - INFO - __main__ - Step 53049: {'lr': 0.00036703872878852115, 'samples': 10185408, 'steps': 53048, 'loss/train': 1.363416314125061} 11/07/2021 04:41:40 - INFO - __main__ - Step 53050: {'lr': 0.00036703403947656977, 'samples': 10185600, 'steps': 53049, 'loss/train': 1.3838443756103516} 11/07/2021 04:41:40 - INFO - __main__ - Step 53051: {'lr': 0.0003670293501118847, 'samples': 10185792, 'steps': 53050, 'loss/train': 1.5967652797698975} 11/07/2021 04:41:41 - INFO - __main__ - Step 53052: {'lr': 0.00036702466069446797, 'samples': 10185984, 'steps': 53051, 'loss/train': 1.5647650957107544} 11/07/2021 04:41:41 - INFO - __main__ - Step 53053: {'lr': 0.00036701997122432173, 'samples': 10186176, 'steps': 53052, 'loss/train': 1.5553793907165527} 11/07/2021 04:41:42 - INFO - __main__ - Step 53054: {'lr': 0.00036701528170144813, 'samples': 10186368, 'steps': 53053, 'loss/train': 2.1166625022888184} 11/07/2021 04:41:42 - INFO - __main__ - Step 53055: {'lr': 0.0003670105921258493, 'samples': 10186560, 'steps': 53054, 'loss/train': 1.009533166885376} 11/07/2021 04:41:43 - INFO - __main__ - Step 53056: {'lr': 0.0003670059024975272, 'samples': 10186752, 'steps': 53055, 'loss/train': 1.0486948490142822} 11/07/2021 04:41:43 - INFO - __main__ - Step 53057: {'lr': 0.00036700121281648415, 'samples': 10186944, 'steps': 53056, 'loss/train': 1.3450143337249756} 11/07/2021 04:41:43 - INFO - __main__ - Step 53058: {'lr': 0.000366996523082722, 'samples': 10187136, 'steps': 53057, 'loss/train': 1.5745469331741333} 11/07/2021 04:41:44 - INFO - __main__ - Step 53059: {'lr': 0.00036699183329624315, 'samples': 10187328, 'steps': 53058, 'loss/train': 0.5636777877807617} 11/07/2021 04:41:45 - INFO - __main__ - Step 53060: {'lr': 0.00036698714345704956, 'samples': 10187520, 'steps': 53059, 'loss/train': 1.2411562204360962} 11/07/2021 04:41:45 - INFO - __main__ - Step 53061: {'lr': 0.00036698245356514336, 'samples': 10187712, 'steps': 53060, 'loss/train': 1.4474177360534668} 11/07/2021 04:41:45 - INFO - __main__ - Step 53062: {'lr': 0.0003669777636205267, 'samples': 10187904, 'steps': 53061, 'loss/train': 1.518201470375061} 11/07/2021 04:41:46 - INFO - __main__ - Step 53063: {'lr': 0.00036697307362320165, 'samples': 10188096, 'steps': 53062, 'loss/train': 1.5803409814834595} 11/07/2021 04:41:46 - INFO - __main__ - Step 53064: {'lr': 0.0003669683835731703, 'samples': 10188288, 'steps': 53063, 'loss/train': 1.2509534358978271} 11/07/2021 04:41:47 - INFO - __main__ - Step 53065: {'lr': 0.00036696369347043477, 'samples': 10188480, 'steps': 53064, 'loss/train': 1.2445851564407349} 11/07/2021 04:41:48 - INFO - __main__ - Step 53066: {'lr': 0.00036695900331499735, 'samples': 10188672, 'steps': 53065, 'loss/train': 0.7373975515365601} 11/07/2021 04:41:48 - INFO - __main__ - Step 53067: {'lr': 0.0003669543131068599, 'samples': 10188864, 'steps': 53066, 'loss/train': 1.220491886138916} 11/07/2021 04:41:48 - INFO - __main__ - Step 53068: {'lr': 0.0003669496228460247, 'samples': 10189056, 'steps': 53067, 'loss/train': 1.3528779745101929} 11/07/2021 04:41:49 - INFO - __main__ - Step 53069: {'lr': 0.00036694493253249373, 'samples': 10189248, 'steps': 53068, 'loss/train': 0.6026855707168579} 11/07/2021 04:41:50 - INFO - __main__ - Step 53070: {'lr': 0.0003669402421662692, 'samples': 10189440, 'steps': 53069, 'loss/train': 1.7145239114761353} 11/07/2021 04:41:50 - INFO - __main__ - Step 53071: {'lr': 0.0003669355517473532, 'samples': 10189632, 'steps': 53070, 'loss/train': 2.2936959266662598} 11/07/2021 04:41:50 - INFO - __main__ - Step 53072: {'lr': 0.0003669308612757479, 'samples': 10189824, 'steps': 53071, 'loss/train': 1.1821892261505127} 11/07/2021 04:41:51 - INFO - __main__ - Step 53073: {'lr': 0.0003669261707514553, 'samples': 10190016, 'steps': 53072, 'loss/train': 1.20310640335083} 11/07/2021 04:41:51 - INFO - __main__ - Step 53074: {'lr': 0.0003669214801744776, 'samples': 10190208, 'steps': 53073, 'loss/train': 1.389641284942627} 11/07/2021 04:41:52 - INFO - __main__ - Step 53075: {'lr': 0.0003669167895448169, 'samples': 10190400, 'steps': 53074, 'loss/train': 1.560219645500183} 11/07/2021 04:41:52 - INFO - __main__ - Step 53076: {'lr': 0.0003669120988624752, 'samples': 10190592, 'steps': 53075, 'loss/train': 1.284456729888916} 11/07/2021 04:41:53 - INFO - __main__ - Step 53077: {'lr': 0.0003669074081274548, 'samples': 10190784, 'steps': 53076, 'loss/train': 1.1114064455032349} 11/07/2021 04:41:53 - INFO - __main__ - Step 53078: {'lr': 0.0003669027173397577, 'samples': 10190976, 'steps': 53077, 'loss/train': 1.3795138597488403} 11/07/2021 04:41:54 - INFO - __main__ - Step 53079: {'lr': 0.00036689802649938607, 'samples': 10191168, 'steps': 53078, 'loss/train': 1.1561951637268066} 11/07/2021 04:41:54 - INFO - __main__ - Step 53080: {'lr': 0.00036689333560634195, 'samples': 10191360, 'steps': 53079, 'loss/train': 1.5332540273666382} 11/07/2021 04:41:55 - INFO - __main__ - Step 53081: {'lr': 0.00036688864466062756, 'samples': 10191552, 'steps': 53080, 'loss/train': 1.370802402496338} 11/07/2021 04:41:55 - INFO - __main__ - Step 53082: {'lr': 0.0003668839536622449, 'samples': 10191744, 'steps': 53081, 'loss/train': 1.1882829666137695} 11/07/2021 04:41:56 - INFO - __main__ - Step 53083: {'lr': 0.0003668792626111962, 'samples': 10191936, 'steps': 53082, 'loss/train': 1.2191630601882935} 11/07/2021 04:41:56 - INFO - __main__ - Step 53084: {'lr': 0.0003668745715074834, 'samples': 10192128, 'steps': 53083, 'loss/train': 1.7221304178237915} 11/07/2021 04:41:57 - INFO - __main__ - Step 53085: {'lr': 0.00036686988035110877, 'samples': 10192320, 'steps': 53084, 'loss/train': 1.3087737560272217} 11/07/2021 04:41:57 - INFO - __main__ - Step 53086: {'lr': 0.0003668651891420744, 'samples': 10192512, 'steps': 53085, 'loss/train': 1.5288056135177612} 11/07/2021 04:41:58 - INFO - __main__ - Step 53087: {'lr': 0.0003668604978803823, 'samples': 10192704, 'steps': 53086, 'loss/train': 1.5008461475372314} 11/07/2021 04:41:58 - INFO - __main__ - Step 53088: {'lr': 0.0003668558065660348, 'samples': 10192896, 'steps': 53087, 'loss/train': 1.3333481550216675} 11/07/2021 04:41:58 - INFO - __main__ - Step 53089: {'lr': 0.0003668511151990338, 'samples': 10193088, 'steps': 53088, 'loss/train': 1.4734662771224976} 11/07/2021 04:41:59 - INFO - __main__ - Step 53090: {'lr': 0.0003668464237793815, 'samples': 10193280, 'steps': 53089, 'loss/train': 1.0968613624572754} 11/07/2021 04:42:00 - INFO - __main__ - Step 53091: {'lr': 0.00036684173230707996, 'samples': 10193472, 'steps': 53090, 'loss/train': 1.2754640579223633} 11/07/2021 04:42:00 - INFO - __main__ - Step 53092: {'lr': 0.00036683704078213137, 'samples': 10193664, 'steps': 53091, 'loss/train': 1.2075488567352295} 11/07/2021 04:42:00 - INFO - __main__ - Step 53093: {'lr': 0.00036683234920453783, 'samples': 10193856, 'steps': 53092, 'loss/train': 1.518633246421814} 11/07/2021 04:42:01 - INFO - __main__ - Step 53094: {'lr': 0.0003668276575743014, 'samples': 10194048, 'steps': 53093, 'loss/train': 1.2452999353408813} 11/07/2021 04:42:01 - INFO - __main__ - Step 53095: {'lr': 0.0003668229658914243, 'samples': 10194240, 'steps': 53094, 'loss/train': 1.1099741458892822} 11/07/2021 04:42:02 - INFO - __main__ - Step 53096: {'lr': 0.0003668182741559085, 'samples': 10194432, 'steps': 53095, 'loss/train': 1.2349625825881958} 11/07/2021 04:42:03 - INFO - __main__ - Step 53097: {'lr': 0.00036681358236775625, 'samples': 10194624, 'steps': 53096, 'loss/train': 1.0939483642578125} 11/07/2021 04:42:03 - INFO - __main__ - Step 53098: {'lr': 0.00036680889052696954, 'samples': 10194816, 'steps': 53097, 'loss/train': 1.4480918645858765} 11/07/2021 04:42:03 - INFO - __main__ - Step 53099: {'lr': 0.00036680419863355056, 'samples': 10195008, 'steps': 53098, 'loss/train': 0.1520826816558838} 11/07/2021 04:42:04 - INFO - __main__ - Step 53100: {'lr': 0.0003667995066875014, 'samples': 10195200, 'steps': 53099, 'loss/train': 1.2088161706924438} 11/07/2021 04:42:05 - INFO - __main__ - Step 53101: {'lr': 0.00036679481468882425, 'samples': 10195392, 'steps': 53100, 'loss/train': 1.375238299369812} 11/07/2021 04:42:05 - INFO - __main__ - Step 53102: {'lr': 0.00036679012263752115, 'samples': 10195584, 'steps': 53101, 'loss/train': 1.405340552330017} 11/07/2021 04:42:05 - INFO - __main__ - Step 53103: {'lr': 0.00036678543053359413, 'samples': 10195776, 'steps': 53102, 'loss/train': 1.8700934648513794} 11/07/2021 04:42:06 - INFO - __main__ - Step 53104: {'lr': 0.0003667807383770455, 'samples': 10195968, 'steps': 53103, 'loss/train': 1.4020655155181885} 11/07/2021 04:42:06 - INFO - __main__ - Step 53105: {'lr': 0.00036677604616787717, 'samples': 10196160, 'steps': 53104, 'loss/train': 1.789962887763977} 11/07/2021 04:42:07 - INFO - __main__ - Step 53106: {'lr': 0.00036677135390609145, 'samples': 10196352, 'steps': 53105, 'loss/train': 1.2877886295318604} 11/07/2021 04:42:07 - INFO - __main__ - Step 53107: {'lr': 0.0003667666615916903, 'samples': 10196544, 'steps': 53106, 'loss/train': 1.5096367597579956} 11/07/2021 04:42:08 - INFO - __main__ - Step 53108: {'lr': 0.00036676196922467595, 'samples': 10196736, 'steps': 53107, 'loss/train': 1.4057319164276123} 11/07/2021 04:42:08 - INFO - __main__ - Step 53109: {'lr': 0.00036675727680505045, 'samples': 10196928, 'steps': 53108, 'loss/train': 1.3050413131713867} 11/07/2021 04:42:08 - INFO - __main__ - Step 53110: {'lr': 0.0003667525843328159, 'samples': 10197120, 'steps': 53109, 'loss/train': 1.1219984292984009} 11/07/2021 04:42:09 - INFO - __main__ - Step 53111: {'lr': 0.0003667478918079744, 'samples': 10197312, 'steps': 53110, 'loss/train': 1.7990424633026123} 11/07/2021 04:42:10 - INFO - __main__ - Step 53112: {'lr': 0.0003667431992305281, 'samples': 10197504, 'steps': 53111, 'loss/train': 1.388959527015686} 11/07/2021 04:42:10 - INFO - __main__ - Step 53113: {'lr': 0.0003667385066004792, 'samples': 10197696, 'steps': 53112, 'loss/train': 1.0651360750198364} 11/07/2021 04:42:11 - INFO - __main__ - Step 53114: {'lr': 0.0003667338139178297, 'samples': 10197888, 'steps': 53113, 'loss/train': 0.6847978830337524} 11/07/2021 04:42:11 - INFO - __main__ - Step 53115: {'lr': 0.0003667291211825817, 'samples': 10198080, 'steps': 53114, 'loss/train': 1.4442719221115112} 11/07/2021 04:42:12 - INFO - __main__ - Step 53116: {'lr': 0.0003667244283947374, 'samples': 10198272, 'steps': 53115, 'loss/train': 1.5201616287231445} 11/07/2021 04:42:12 - INFO - __main__ - Step 53117: {'lr': 0.0003667197355542989, 'samples': 10198464, 'steps': 53116, 'loss/train': 1.1403840780258179} 11/07/2021 04:42:13 - INFO - __main__ - Step 53118: {'lr': 0.0003667150426612682, 'samples': 10198656, 'steps': 53117, 'loss/train': 1.546968698501587} 11/07/2021 04:42:13 - INFO - __main__ - Step 53119: {'lr': 0.0003667103497156475, 'samples': 10198848, 'steps': 53118, 'loss/train': 1.7902982234954834} 11/07/2021 04:42:13 - INFO - __main__ - Step 53120: {'lr': 0.00036670565671743905, 'samples': 10199040, 'steps': 53119, 'loss/train': 1.352241039276123} 11/07/2021 04:42:14 - INFO - __main__ - Step 53121: {'lr': 0.0003667009636666447, 'samples': 10199232, 'steps': 53120, 'loss/train': 1.0399212837219238} 11/07/2021 04:42:15 - INFO - __main__ - Step 53122: {'lr': 0.00036669627056326685, 'samples': 10199424, 'steps': 53121, 'loss/train': 1.4006602764129639} 11/07/2021 04:42:15 - INFO - __main__ - Step 53123: {'lr': 0.0003666915774073073, 'samples': 10199616, 'steps': 53122, 'loss/train': 1.2680741548538208} 11/07/2021 04:42:16 - INFO - __main__ - Step 53124: {'lr': 0.00036668688419876837, 'samples': 10199808, 'steps': 53123, 'loss/train': 1.5843561887741089} 11/07/2021 04:42:16 - INFO - __main__ - Step 53125: {'lr': 0.0003666821909376522, 'samples': 10200000, 'steps': 53124, 'loss/train': 1.2021772861480713} 11/07/2021 04:42:16 - INFO - __main__ - Step 53126: {'lr': 0.00036667749762396074, 'samples': 10200192, 'steps': 53125, 'loss/train': 1.378400444984436} 11/07/2021 04:42:17 - INFO - __main__ - Step 53127: {'lr': 0.0003666728042576962, 'samples': 10200384, 'steps': 53126, 'loss/train': 0.9234182238578796} 11/07/2021 04:42:18 - INFO - __main__ - Step 53128: {'lr': 0.0003666681108388608, 'samples': 10200576, 'steps': 53127, 'loss/train': 1.4577910900115967} 11/07/2021 04:42:18 - INFO - __main__ - Step 53129: {'lr': 0.0003666634173674565, 'samples': 10200768, 'steps': 53128, 'loss/train': 1.1644287109375} 11/07/2021 04:42:18 - INFO - __main__ - Step 53130: {'lr': 0.00036665872384348543, 'samples': 10200960, 'steps': 53129, 'loss/train': 2.0268161296844482} 11/07/2021 04:42:19 - INFO - __main__ - Step 53131: {'lr': 0.00036665403026694976, 'samples': 10201152, 'steps': 53130, 'loss/train': 1.7038955688476562} 11/07/2021 04:42:19 - INFO - __main__ - Step 53132: {'lr': 0.0003666493366378516, 'samples': 10201344, 'steps': 53131, 'loss/train': 2.0351192951202393} 11/07/2021 04:42:20 - INFO - __main__ - Step 53133: {'lr': 0.00036664464295619296, 'samples': 10201536, 'steps': 53132, 'loss/train': 1.5343565940856934} 11/07/2021 04:42:20 - INFO - __main__ - Step 53134: {'lr': 0.0003666399492219762, 'samples': 10201728, 'steps': 53133, 'loss/train': 1.140608549118042} 11/07/2021 04:42:21 - INFO - __main__ - Step 53135: {'lr': 0.0003666352554352032, 'samples': 10201920, 'steps': 53134, 'loss/train': 1.3192079067230225} 11/07/2021 04:42:21 - INFO - __main__ - Step 53136: {'lr': 0.00036663056159587614, 'samples': 10202112, 'steps': 53135, 'loss/train': 0.8051148653030396} 11/07/2021 04:42:22 - INFO - __main__ - Step 53137: {'lr': 0.0003666258677039971, 'samples': 10202304, 'steps': 53136, 'loss/train': 1.203921914100647} 11/07/2021 04:42:23 - INFO - __main__ - Step 53138: {'lr': 0.00036662117375956834, 'samples': 10202496, 'steps': 53137, 'loss/train': 1.6183037757873535} 11/07/2021 04:42:23 - INFO - __main__ - Step 53139: {'lr': 0.00036661647976259185, 'samples': 10202688, 'steps': 53138, 'loss/train': 1.3904577493667603} 11/07/2021 04:42:23 - INFO - __main__ - Step 53140: {'lr': 0.0003666117857130698, 'samples': 10202880, 'steps': 53139, 'loss/train': 1.312795639038086} 11/07/2021 04:42:24 - INFO - __main__ - Step 53141: {'lr': 0.00036660709161100423, 'samples': 10203072, 'steps': 53140, 'loss/train': 1.4761615991592407} 11/07/2021 04:42:24 - INFO - __main__ - Step 53142: {'lr': 0.0003666023974563973, 'samples': 10203264, 'steps': 53141, 'loss/train': 1.2925437688827515} 11/07/2021 04:42:25 - INFO - __main__ - Step 53143: {'lr': 0.0003665977032492511, 'samples': 10203456, 'steps': 53142, 'loss/train': 1.2739945650100708} 11/07/2021 04:42:26 - INFO - __main__ - Step 53144: {'lr': 0.00036659300898956784, 'samples': 10203648, 'steps': 53143, 'loss/train': 1.3361073732376099} 11/07/2021 04:42:26 - INFO - __main__ - Step 53145: {'lr': 0.0003665883146773496, 'samples': 10203840, 'steps': 53144, 'loss/train': 1.0176421403884888} 11/07/2021 04:42:26 - INFO - __main__ - Step 53146: {'lr': 0.0003665836203125984, 'samples': 10204032, 'steps': 53145, 'loss/train': 1.5041325092315674} 11/07/2021 04:42:27 - INFO - __main__ - Step 53147: {'lr': 0.0003665789258953164, 'samples': 10204224, 'steps': 53146, 'loss/train': 1.6452785730361938} 11/07/2021 04:42:27 - INFO - __main__ - Step 53148: {'lr': 0.00036657423142550576, 'samples': 10204416, 'steps': 53147, 'loss/train': 1.0984362363815308} 11/07/2021 04:42:28 - INFO - __main__ - Step 53149: {'lr': 0.00036656953690316865, 'samples': 10204608, 'steps': 53148, 'loss/train': 1.234283208847046} 11/07/2021 04:42:29 - INFO - __main__ - Step 53150: {'lr': 0.000366564842328307, 'samples': 10204800, 'steps': 53149, 'loss/train': 0.9838631749153137} 11/07/2021 04:42:29 - INFO - __main__ - Step 53151: {'lr': 0.0003665601477009231, 'samples': 10204992, 'steps': 53150, 'loss/train': 1.545081615447998} 11/07/2021 04:42:29 - INFO - __main__ - Step 53152: {'lr': 0.00036655545302101894, 'samples': 10205184, 'steps': 53151, 'loss/train': 1.600849986076355} 11/07/2021 04:42:30 - INFO - __main__ - Step 53153: {'lr': 0.00036655075828859673, 'samples': 10205376, 'steps': 53152, 'loss/train': 1.3004236221313477} 11/07/2021 04:42:31 - INFO - __main__ - Step 53154: {'lr': 0.0003665460635036585, 'samples': 10205568, 'steps': 53153, 'loss/train': 1.072939395904541} 11/07/2021 04:42:31 - INFO - __main__ - Step 53155: {'lr': 0.00036654136866620646, 'samples': 10205760, 'steps': 53154, 'loss/train': 1.4235090017318726} 11/07/2021 04:42:31 - INFO - __main__ - Step 53156: {'lr': 0.0003665366737762427, 'samples': 10205952, 'steps': 53155, 'loss/train': 1.2102850675582886} 11/07/2021 04:42:32 - INFO - __main__ - Step 53157: {'lr': 0.0003665319788337692, 'samples': 10206144, 'steps': 53156, 'loss/train': 1.6960684061050415} 11/07/2021 04:42:32 - INFO - __main__ - Step 53158: {'lr': 0.0003665272838387883, 'samples': 10206336, 'steps': 53157, 'loss/train': 1.5066572427749634} 11/07/2021 04:42:33 - INFO - __main__ - Step 53159: {'lr': 0.00036652258879130194, 'samples': 10206528, 'steps': 53158, 'loss/train': 1.4323980808258057} 11/07/2021 04:42:33 - INFO - __main__ - Step 53160: {'lr': 0.0003665178936913123, 'samples': 10206720, 'steps': 53159, 'loss/train': 1.448259711265564} 11/07/2021 04:42:34 - INFO - __main__ - Step 53161: {'lr': 0.0003665131985388215, 'samples': 10206912, 'steps': 53160, 'loss/train': 1.64296555519104} 11/07/2021 04:42:34 - INFO - __main__ - Step 53162: {'lr': 0.00036650850333383174, 'samples': 10207104, 'steps': 53161, 'loss/train': 1.4066882133483887} 11/07/2021 04:42:34 - INFO - __main__ - Step 53163: {'lr': 0.000366503808076345, 'samples': 10207296, 'steps': 53162, 'loss/train': 1.3050343990325928} 11/07/2021 04:42:35 - INFO - __main__ - Step 53164: {'lr': 0.00036649911276636336, 'samples': 10207488, 'steps': 53163, 'loss/train': 1.0233391523361206} 11/07/2021 04:42:36 - INFO - __main__ - Step 53165: {'lr': 0.0003664944174038891, 'samples': 10207680, 'steps': 53164, 'loss/train': 1.230887770652771} 11/07/2021 04:42:36 - INFO - __main__ - Step 53166: {'lr': 0.0003664897219889242, 'samples': 10207872, 'steps': 53165, 'loss/train': 1.853659987449646} 11/07/2021 04:42:36 - INFO - __main__ - Step 53167: {'lr': 0.0003664850265214709, 'samples': 10208064, 'steps': 53166, 'loss/train': 1.800600528717041} 11/07/2021 04:42:37 - INFO - __main__ - Step 53168: {'lr': 0.00036648033100153117, 'samples': 10208256, 'steps': 53167, 'loss/train': 1.7309788465499878} 11/07/2021 04:42:38 - INFO - __main__ - Step 53169: {'lr': 0.0003664756354291073, 'samples': 10208448, 'steps': 53168, 'loss/train': 1.5815919637680054} 11/07/2021 04:42:38 - INFO - __main__ - Step 53170: {'lr': 0.0003664709398042012, 'samples': 10208640, 'steps': 53169, 'loss/train': 1.2343692779541016} 11/07/2021 04:42:39 - INFO - __main__ - Step 53171: {'lr': 0.00036646624412681514, 'samples': 10208832, 'steps': 53170, 'loss/train': 1.4695652723312378} 11/07/2021 04:42:39 - INFO - __main__ - Step 53172: {'lr': 0.0003664615483969511, 'samples': 10209024, 'steps': 53171, 'loss/train': 1.4880211353302002} 11/07/2021 04:42:39 - INFO - __main__ - Step 53173: {'lr': 0.0003664568526146114, 'samples': 10209216, 'steps': 53172, 'loss/train': 1.166122555732727} 11/07/2021 04:42:40 - INFO - __main__ - Step 53174: {'lr': 0.000366452156779798, 'samples': 10209408, 'steps': 53173, 'loss/train': 1.0564590692520142} 11/07/2021 04:42:41 - INFO - __main__ - Step 53175: {'lr': 0.000366447460892513, 'samples': 10209600, 'steps': 53174, 'loss/train': 1.40821373462677} 11/07/2021 04:42:41 - INFO - __main__ - Step 53176: {'lr': 0.0003664427649527587, 'samples': 10209792, 'steps': 53175, 'loss/train': 1.0120983123779297} 11/07/2021 04:42:42 - INFO - __main__ - Step 53177: {'lr': 0.000366438068960537, 'samples': 10209984, 'steps': 53176, 'loss/train': 1.0967178344726562} 11/07/2021 04:42:42 - INFO - __main__ - Step 53178: {'lr': 0.0003664333729158501, 'samples': 10210176, 'steps': 53177, 'loss/train': 1.3859779834747314} 11/07/2021 04:42:43 - INFO - __main__ - Step 53179: {'lr': 0.0003664286768187002, 'samples': 10210368, 'steps': 53178, 'loss/train': 1.549895167350769} 11/07/2021 04:42:43 - INFO - __main__ - Step 53180: {'lr': 0.0003664239806690892, 'samples': 10210560, 'steps': 53179, 'loss/train': 1.21084463596344} 11/07/2021 04:42:44 - INFO - __main__ - Step 53181: {'lr': 0.00036641928446701943, 'samples': 10210752, 'steps': 53180, 'loss/train': 1.6016613245010376} 11/07/2021 04:42:44 - INFO - __main__ - Step 53182: {'lr': 0.00036641458821249295, 'samples': 10210944, 'steps': 53181, 'loss/train': 0.8361701965332031} 11/07/2021 04:42:44 - INFO - __main__ - Step 53183: {'lr': 0.00036640989190551184, 'samples': 10211136, 'steps': 53182, 'loss/train': 1.4793580770492554} 11/07/2021 04:42:45 - INFO - __main__ - Step 53184: {'lr': 0.00036640519554607823, 'samples': 10211328, 'steps': 53183, 'loss/train': 0.36974507570266724} 11/07/2021 04:42:46 - INFO - __main__ - Step 53185: {'lr': 0.00036640049913419417, 'samples': 10211520, 'steps': 53184, 'loss/train': 1.7542303800582886} 11/07/2021 04:42:46 - INFO - __main__ - Step 53186: {'lr': 0.00036639580266986183, 'samples': 10211712, 'steps': 53185, 'loss/train': 1.954509973526001} 11/07/2021 04:42:46 - INFO - __main__ - Step 53187: {'lr': 0.00036639110615308343, 'samples': 10211904, 'steps': 53186, 'loss/train': 1.5697658061981201} 11/07/2021 04:42:47 - INFO - __main__ - Step 53188: {'lr': 0.0003663864095838609, 'samples': 10212096, 'steps': 53187, 'loss/train': 1.2949846982955933} 11/07/2021 04:42:48 - INFO - __main__ - Step 53189: {'lr': 0.0003663817129621966, 'samples': 10212288, 'steps': 53188, 'loss/train': 1.4699655771255493} 11/07/2021 04:42:48 - INFO - __main__ - Step 53190: {'lr': 0.0003663770162880924, 'samples': 10212480, 'steps': 53189, 'loss/train': 1.5642396211624146} 11/07/2021 04:42:48 - INFO - __main__ - Step 53191: {'lr': 0.00036637231956155046, 'samples': 10212672, 'steps': 53190, 'loss/train': 0.9616280794143677} 11/07/2021 04:42:49 - INFO - __main__ - Step 53192: {'lr': 0.000366367622782573, 'samples': 10212864, 'steps': 53191, 'loss/train': 1.5829464197158813} 11/07/2021 04:42:49 - INFO - __main__ - Step 53193: {'lr': 0.0003663629259511621, 'samples': 10213056, 'steps': 53192, 'loss/train': 1.0942155122756958} 11/07/2021 04:42:50 - INFO - __main__ - Step 53194: {'lr': 0.00036635822906731986, 'samples': 10213248, 'steps': 53193, 'loss/train': 0.8965877294540405} 11/07/2021 04:42:50 - INFO - __main__ - Step 53195: {'lr': 0.0003663535321310484, 'samples': 10213440, 'steps': 53194, 'loss/train': 1.493140697479248} 11/07/2021 04:42:51 - INFO - __main__ - Step 53196: {'lr': 0.00036634883514234987, 'samples': 10213632, 'steps': 53195, 'loss/train': 1.5236399173736572} 11/07/2021 04:42:51 - INFO - __main__ - Step 53197: {'lr': 0.00036634413810122626, 'samples': 10213824, 'steps': 53196, 'loss/train': 1.4427578449249268} 11/07/2021 04:42:51 - INFO - __main__ - Step 53198: {'lr': 0.0003663394410076798, 'samples': 10214016, 'steps': 53197, 'loss/train': 1.128544807434082} 11/07/2021 04:42:52 - INFO - __main__ - Step 53199: {'lr': 0.00036633474386171263, 'samples': 10214208, 'steps': 53198, 'loss/train': 1.072456955909729} 11/07/2021 04:42:53 - INFO - __main__ - Step 53200: {'lr': 0.00036633004666332674, 'samples': 10214400, 'steps': 53199, 'loss/train': 1.6818119287490845} 11/07/2021 04:42:54 - INFO - __main__ - Step 53201: {'lr': 0.0003663253494125244, 'samples': 10214592, 'steps': 53200, 'loss/train': 1.135022521018982} 11/07/2021 04:42:54 - INFO - __main__ - Step 53202: {'lr': 0.0003663206521093076, 'samples': 10214784, 'steps': 53201, 'loss/train': 2.0167996883392334} 11/07/2021 04:42:54 - INFO - __main__ - Step 53203: {'lr': 0.00036631595475367855, 'samples': 10214976, 'steps': 53202, 'loss/train': 1.3336080312728882} 11/07/2021 04:42:55 - INFO - __main__ - Step 53204: {'lr': 0.0003663112573456393, 'samples': 10215168, 'steps': 53203, 'loss/train': 1.409445881843567} 11/07/2021 04:42:56 - INFO - __main__ - Step 53205: {'lr': 0.00036630655988519203, 'samples': 10215360, 'steps': 53204, 'loss/train': 0.08970138430595398} 11/07/2021 04:42:56 - INFO - __main__ - Step 53206: {'lr': 0.00036630186237233877, 'samples': 10215552, 'steps': 53205, 'loss/train': 1.392133116722107} 11/07/2021 04:42:56 - INFO - __main__ - Step 53207: {'lr': 0.00036629716480708174, 'samples': 10215744, 'steps': 53206, 'loss/train': 1.4997931718826294} 11/07/2021 04:42:57 - INFO - __main__ - Step 53208: {'lr': 0.00036629246718942294, 'samples': 10215936, 'steps': 53207, 'loss/train': 1.5354804992675781} 11/07/2021 04:42:57 - INFO - __main__ - Step 53209: {'lr': 0.0003662877695193646, 'samples': 10216128, 'steps': 53208, 'loss/train': 1.1489993333816528} 11/07/2021 04:42:58 - INFO - __main__ - Step 53210: {'lr': 0.00036628307179690877, 'samples': 10216320, 'steps': 53209, 'loss/train': 1.0780820846557617} 11/07/2021 04:42:58 - INFO - __main__ - Step 53211: {'lr': 0.0003662783740220576, 'samples': 10216512, 'steps': 53210, 'loss/train': 1.525235652923584} 11/07/2021 04:42:59 - INFO - __main__ - Step 53212: {'lr': 0.00036627367619481316, 'samples': 10216704, 'steps': 53211, 'loss/train': 1.4894222021102905} 11/07/2021 04:42:59 - INFO - __main__ - Step 53213: {'lr': 0.00036626897831517756, 'samples': 10216896, 'steps': 53212, 'loss/train': 0.07788790017366409} 11/07/2021 04:43:00 - INFO - __main__ - Step 53214: {'lr': 0.000366264280383153, 'samples': 10217088, 'steps': 53213, 'loss/train': 2.0759551525115967} 11/07/2021 04:43:01 - INFO - __main__ - Step 53215: {'lr': 0.00036625958239874156, 'samples': 10217280, 'steps': 53214, 'loss/train': 1.3804129362106323} 11/07/2021 04:43:01 - INFO - __main__ - Step 53216: {'lr': 0.0003662548843619454, 'samples': 10217472, 'steps': 53215, 'loss/train': 1.3251698017120361} 11/07/2021 04:43:01 - INFO - __main__ - Step 53217: {'lr': 0.00036625018627276646, 'samples': 10217664, 'steps': 53216, 'loss/train': 1.5707327127456665} 11/07/2021 04:43:02 - INFO - __main__ - Step 53218: {'lr': 0.0003662454881312071, 'samples': 10217856, 'steps': 53217, 'loss/train': 1.3364756107330322} 11/07/2021 04:43:02 - INFO - __main__ - Step 53219: {'lr': 0.0003662407899372692, 'samples': 10218048, 'steps': 53218, 'loss/train': 1.5194370746612549} 11/07/2021 04:43:04 - INFO - __main__ - Step 53220: {'lr': 0.000366236091690955, 'samples': 10218240, 'steps': 53219, 'loss/train': 1.5489014387130737} 11/07/2021 04:43:04 - INFO - __main__ - Step 53221: {'lr': 0.00036623139339226664, 'samples': 10218432, 'steps': 53220, 'loss/train': 1.1221784353256226} 11/07/2021 04:43:04 - INFO - __main__ - Step 53222: {'lr': 0.00036622669504120627, 'samples': 10218624, 'steps': 53221, 'loss/train': 1.5648833513259888} 11/07/2021 04:43:05 - INFO - __main__ - Step 53223: {'lr': 0.0003662219966377759, 'samples': 10218816, 'steps': 53222, 'loss/train': 1.6098930835723877} 11/07/2021 04:43:05 - INFO - __main__ - Step 53224: {'lr': 0.0003662172981819777, 'samples': 10219008, 'steps': 53223, 'loss/train': 1.4301648139953613} 11/07/2021 04:43:05 - INFO - __main__ - Step 53225: {'lr': 0.00036621259967381374, 'samples': 10219200, 'steps': 53224, 'loss/train': 1.5039647817611694} 11/07/2021 04:43:06 - INFO - __main__ - Step 53226: {'lr': 0.0003662079011132862, 'samples': 10219392, 'steps': 53225, 'loss/train': 1.1975923776626587} 11/07/2021 04:43:07 - INFO - __main__ - Step 53227: {'lr': 0.0003662032025003972, 'samples': 10219584, 'steps': 53226, 'loss/train': 1.125380516052246} 11/07/2021 04:43:07 - INFO - __main__ - Step 53228: {'lr': 0.0003661985038351488, 'samples': 10219776, 'steps': 53227, 'loss/train': 1.6176185607910156} 11/07/2021 04:43:07 - INFO - __main__ - Step 53229: {'lr': 0.0003661938051175432, 'samples': 10219968, 'steps': 53228, 'loss/train': 1.3787031173706055} 11/07/2021 04:43:08 - INFO - __main__ - Step 53230: {'lr': 0.0003661891063475824, 'samples': 10220160, 'steps': 53229, 'loss/train': 1.0745278596878052} 11/07/2021 04:43:09 - INFO - __main__ - Step 53231: {'lr': 0.0003661844075252686, 'samples': 10220352, 'steps': 53230, 'loss/train': 1.470468282699585} 11/07/2021 04:43:09 - INFO - __main__ - Step 53232: {'lr': 0.0003661797086506039, 'samples': 10220544, 'steps': 53231, 'loss/train': 1.6155319213867188} 11/07/2021 04:43:10 - INFO - __main__ - Step 53233: {'lr': 0.0003661750097235904, 'samples': 10220736, 'steps': 53232, 'loss/train': 1.4225972890853882} 11/07/2021 04:43:10 - INFO - __main__ - Step 53234: {'lr': 0.00036617031074423023, 'samples': 10220928, 'steps': 53233, 'loss/train': 1.4886188507080078} 11/07/2021 04:43:10 - INFO - __main__ - Step 53235: {'lr': 0.00036616561171252547, 'samples': 10221120, 'steps': 53234, 'loss/train': 1.8060020208358765} 11/07/2021 04:43:11 - INFO - __main__ - Step 53236: {'lr': 0.0003661609126284784, 'samples': 10221312, 'steps': 53235, 'loss/train': 1.5143846273422241} 11/07/2021 04:43:12 - INFO - __main__ - Step 53237: {'lr': 0.00036615621349209094, 'samples': 10221504, 'steps': 53236, 'loss/train': 0.9544692635536194} 11/07/2021 04:43:12 - INFO - __main__ - Step 53238: {'lr': 0.00036615151430336536, 'samples': 10221696, 'steps': 53237, 'loss/train': 1.2647370100021362} 11/07/2021 04:43:12 - INFO - __main__ - Step 53239: {'lr': 0.0003661468150623036, 'samples': 10221888, 'steps': 53238, 'loss/train': 1.6468429565429688} 11/07/2021 04:43:13 - INFO - __main__ - Step 53240: {'lr': 0.0003661421157689079, 'samples': 10222080, 'steps': 53239, 'loss/train': 0.07004977762699127} 11/07/2021 04:43:14 - INFO - __main__ - Step 53241: {'lr': 0.00036613741642318033, 'samples': 10222272, 'steps': 53240, 'loss/train': 1.4579503536224365} 11/07/2021 04:43:14 - INFO - __main__ - Step 53242: {'lr': 0.00036613271702512306, 'samples': 10222464, 'steps': 53241, 'loss/train': 1.2062925100326538} 11/07/2021 04:43:15 - INFO - __main__ - Step 53243: {'lr': 0.00036612801757473823, 'samples': 10222656, 'steps': 53242, 'loss/train': 1.461490511894226} 11/07/2021 04:43:15 - INFO - __main__ - Step 53244: {'lr': 0.00036612331807202785, 'samples': 10222848, 'steps': 53243, 'loss/train': 1.7669768333435059} 11/07/2021 04:43:15 - INFO - __main__ - Step 53245: {'lr': 0.00036611861851699415, 'samples': 10223040, 'steps': 53244, 'loss/train': 1.256188988685608} 11/07/2021 04:43:16 - INFO - __main__ - Step 53246: {'lr': 0.00036611391890963913, 'samples': 10223232, 'steps': 53245, 'loss/train': 1.417641520500183} 11/07/2021 04:43:17 - INFO - __main__ - Step 53247: {'lr': 0.000366109219249965, 'samples': 10223424, 'steps': 53246, 'loss/train': 1.5248044729232788} 11/07/2021 04:43:17 - INFO - __main__ - Step 53248: {'lr': 0.00036610451953797386, 'samples': 10223616, 'steps': 53247, 'loss/train': 1.5818085670471191} 11/07/2021 04:43:17 - INFO - __main__ - Step 53249: {'lr': 0.0003660998197736677, 'samples': 10223808, 'steps': 53248, 'loss/train': 1.3921006917953491} 11/07/2021 04:43:18 - INFO - __main__ - Step 53250: {'lr': 0.00036609511995704894, 'samples': 10224000, 'steps': 53249, 'loss/train': 1.4409935474395752} 11/07/2021 04:43:19 - INFO - __main__ - Step 53251: {'lr': 0.0003660904200881194, 'samples': 10224192, 'steps': 53250, 'loss/train': 1.338744044303894} 11/07/2021 04:43:19 - INFO - __main__ - Step 53252: {'lr': 0.00036608572016688136, 'samples': 10224384, 'steps': 53251, 'loss/train': 1.491830825805664} 11/07/2021 04:43:19 - INFO - __main__ - Step 53253: {'lr': 0.00036608102019333684, 'samples': 10224576, 'steps': 53252, 'loss/train': 1.646281361579895} 11/07/2021 04:43:20 - INFO - __main__ - Step 53254: {'lr': 0.00036607632016748796, 'samples': 10224768, 'steps': 53253, 'loss/train': 1.2800869941711426} 11/07/2021 04:43:20 - INFO - __main__ - Step 53255: {'lr': 0.00036607162008933696, 'samples': 10224960, 'steps': 53254, 'loss/train': 1.4255291223526} 11/07/2021 04:43:21 - INFO - __main__ - Step 53256: {'lr': 0.00036606691995888594, 'samples': 10225152, 'steps': 53255, 'loss/train': 1.3554235696792603} 11/07/2021 04:43:21 - INFO - __main__ - Step 53257: {'lr': 0.00036606221977613686, 'samples': 10225344, 'steps': 53256, 'loss/train': 1.6191420555114746} 11/07/2021 04:43:22 - INFO - __main__ - Step 53258: {'lr': 0.0003660575195410919, 'samples': 10225536, 'steps': 53257, 'loss/train': 1.1593029499053955} 11/07/2021 04:43:22 - INFO - __main__ - Step 53259: {'lr': 0.0003660528192537533, 'samples': 10225728, 'steps': 53258, 'loss/train': 1.086011528968811} 11/07/2021 04:43:22 - INFO - __main__ - Step 53260: {'lr': 0.00036604811891412296, 'samples': 10225920, 'steps': 53259, 'loss/train': 1.4210145473480225} 11/07/2021 04:43:24 - INFO - __main__ - Step 53261: {'lr': 0.00036604341852220325, 'samples': 10226112, 'steps': 53260, 'loss/train': 1.5465900897979736} 11/07/2021 04:43:24 - INFO - __main__ - Step 53262: {'lr': 0.00036603871807799616, 'samples': 10226304, 'steps': 53261, 'loss/train': 1.6001391410827637} 11/07/2021 04:43:24 - INFO - __main__ - Step 53263: {'lr': 0.0003660340175815038, 'samples': 10226496, 'steps': 53262, 'loss/train': 1.5498238801956177} 11/07/2021 04:43:25 - INFO - __main__ - Step 53264: {'lr': 0.0003660293170327283, 'samples': 10226688, 'steps': 53263, 'loss/train': 1.7648087739944458} 11/07/2021 04:43:25 - INFO - __main__ - Step 53265: {'lr': 0.0003660246164316717, 'samples': 10226880, 'steps': 53264, 'loss/train': 1.5192945003509521} 11/07/2021 04:43:26 - INFO - __main__ - Step 53266: {'lr': 0.00036601991577833634, 'samples': 10227072, 'steps': 53265, 'loss/train': 1.386931300163269} 11/07/2021 04:43:26 - INFO - __main__ - Step 53267: {'lr': 0.00036601521507272414, 'samples': 10227264, 'steps': 53266, 'loss/train': 1.0379760265350342} 11/07/2021 04:43:27 - INFO - __main__ - Step 53268: {'lr': 0.00036601051431483725, 'samples': 10227456, 'steps': 53267, 'loss/train': 1.528818130493164} 11/07/2021 04:43:27 - INFO - __main__ - Step 53269: {'lr': 0.0003660058135046778, 'samples': 10227648, 'steps': 53268, 'loss/train': 1.4720981121063232} 11/07/2021 04:43:27 - INFO - __main__ - Step 53270: {'lr': 0.000366001112642248, 'samples': 10227840, 'steps': 53269, 'loss/train': 1.534490942955017} 11/07/2021 04:43:28 - INFO - __main__ - Step 53271: {'lr': 0.00036599641172754984, 'samples': 10228032, 'steps': 53270, 'loss/train': 1.5040323734283447} 11/07/2021 04:43:29 - INFO - __main__ - Step 53272: {'lr': 0.0003659917107605854, 'samples': 10228224, 'steps': 53271, 'loss/train': 1.1398561000823975} 11/07/2021 04:43:29 - INFO - __main__ - Step 53273: {'lr': 0.000365987009741357, 'samples': 10228416, 'steps': 53272, 'loss/train': 0.8920938968658447} 11/07/2021 04:43:29 - INFO - __main__ - Step 53274: {'lr': 0.0003659823086698666, 'samples': 10228608, 'steps': 53273, 'loss/train': 1.3302233219146729} 11/07/2021 04:43:30 - INFO - __main__ - Step 53275: {'lr': 0.0003659776075461164, 'samples': 10228800, 'steps': 53274, 'loss/train': 1.2213811874389648} 11/07/2021 04:43:31 - INFO - __main__ - Step 53276: {'lr': 0.0003659729063701084, 'samples': 10228992, 'steps': 53275, 'loss/train': 1.399293303489685} 11/07/2021 04:43:31 - INFO - __main__ - Step 53277: {'lr': 0.00036596820514184485, 'samples': 10229184, 'steps': 53276, 'loss/train': 1.7406283617019653} 11/07/2021 04:43:32 - INFO - __main__ - Step 53278: {'lr': 0.00036596350386132784, 'samples': 10229376, 'steps': 53277, 'loss/train': 1.2684037685394287} 11/07/2021 04:43:32 - INFO - __main__ - Step 53279: {'lr': 0.0003659588025285594, 'samples': 10229568, 'steps': 53278, 'loss/train': 1.9442917108535767} 11/07/2021 04:43:32 - INFO - __main__ - Step 53280: {'lr': 0.0003659541011435418, 'samples': 10229760, 'steps': 53279, 'loss/train': 1.444143295288086} 11/07/2021 04:43:33 - INFO - __main__ - Step 53281: {'lr': 0.00036594939970627704, 'samples': 10229952, 'steps': 53280, 'loss/train': 1.5801076889038086} 11/07/2021 04:43:34 - INFO - __main__ - Step 53282: {'lr': 0.0003659446982167672, 'samples': 10230144, 'steps': 53281, 'loss/train': 1.7955842018127441} 11/07/2021 04:43:34 - INFO - __main__ - Step 53283: {'lr': 0.00036593999667501457, 'samples': 10230336, 'steps': 53282, 'loss/train': 1.5097438097000122} 11/07/2021 04:43:34 - INFO - __main__ - Step 53284: {'lr': 0.0003659352950810211, 'samples': 10230528, 'steps': 53283, 'loss/train': 1.2218964099884033} 11/07/2021 04:43:35 - INFO - __main__ - Step 53285: {'lr': 0.00036593059343478904, 'samples': 10230720, 'steps': 53284, 'loss/train': 1.3489710092544556} 11/07/2021 04:43:36 - INFO - __main__ - Step 53286: {'lr': 0.0003659258917363204, 'samples': 10230912, 'steps': 53285, 'loss/train': 1.256871223449707} 11/07/2021 04:43:36 - INFO - __main__ - Step 53287: {'lr': 0.0003659211899856173, 'samples': 10231104, 'steps': 53286, 'loss/train': 1.3437983989715576} 11/07/2021 04:43:36 - INFO - __main__ - Step 53288: {'lr': 0.0003659164881826819, 'samples': 10231296, 'steps': 53287, 'loss/train': 1.0385193824768066} 11/07/2021 04:43:37 - INFO - __main__ - Step 53289: {'lr': 0.00036591178632751635, 'samples': 10231488, 'steps': 53288, 'loss/train': 1.3750908374786377} 11/07/2021 04:43:37 - INFO - __main__ - Step 53290: {'lr': 0.00036590708442012275, 'samples': 10231680, 'steps': 53289, 'loss/train': 1.4072775840759277} 11/07/2021 04:43:38 - INFO - __main__ - Step 53291: {'lr': 0.0003659023824605033, 'samples': 10231872, 'steps': 53290, 'loss/train': 1.370407223701477} 11/07/2021 04:43:38 - INFO - __main__ - Step 53292: {'lr': 0.0003658976804486599, 'samples': 10232064, 'steps': 53291, 'loss/train': 1.3309603929519653} 11/07/2021 04:43:39 - INFO - __main__ - Step 53293: {'lr': 0.0003658929783845948, 'samples': 10232256, 'steps': 53292, 'loss/train': 1.2490254640579224} 11/07/2021 04:43:39 - INFO - __main__ - Step 53294: {'lr': 0.0003658882762683101, 'samples': 10232448, 'steps': 53293, 'loss/train': 1.263980507850647} 11/07/2021 04:43:39 - INFO - __main__ - Step 53295: {'lr': 0.000365883574099808, 'samples': 10232640, 'steps': 53294, 'loss/train': 1.3692395687103271} 11/07/2021 04:43:40 - INFO - __main__ - Step 53296: {'lr': 0.00036587887187909045, 'samples': 10232832, 'steps': 53295, 'loss/train': 1.2554272413253784} 11/07/2021 04:43:41 - INFO - __main__ - Step 53297: {'lr': 0.0003658741696061598, 'samples': 10233024, 'steps': 53296, 'loss/train': 1.6472340822219849} 11/07/2021 04:43:41 - INFO - __main__ - Step 53298: {'lr': 0.0003658694672810179, 'samples': 10233216, 'steps': 53297, 'loss/train': 1.2174272537231445} 11/07/2021 04:43:41 - INFO - __main__ - Step 53299: {'lr': 0.00036586476490366713, 'samples': 10233408, 'steps': 53298, 'loss/train': 0.06238739192485809} 11/07/2021 04:43:42 - INFO - __main__ - Step 53300: {'lr': 0.0003658600624741094, 'samples': 10233600, 'steps': 53299, 'loss/train': 1.3266628980636597} 11/07/2021 04:43:42 - INFO - __main__ - Step 53301: {'lr': 0.00036585535999234697, 'samples': 10233792, 'steps': 53300, 'loss/train': 1.4299750328063965} 11/07/2021 04:43:44 - INFO - __main__ - Step 53302: {'lr': 0.0003658506574583819, 'samples': 10233984, 'steps': 53301, 'loss/train': 1.2797915935516357} 11/07/2021 04:43:44 - INFO - __main__ - Step 53303: {'lr': 0.0003658459548722163, 'samples': 10234176, 'steps': 53302, 'loss/train': 1.6376073360443115} 11/07/2021 04:43:44 - INFO - __main__ - Step 53304: {'lr': 0.00036584125223385224, 'samples': 10234368, 'steps': 53303, 'loss/train': 1.6182200908660889} 11/07/2021 04:43:45 - INFO - __main__ - Step 53305: {'lr': 0.0003658365495432919, 'samples': 10234560, 'steps': 53304, 'loss/train': 1.4072526693344116} 11/07/2021 04:43:45 - INFO - __main__ - Step 53306: {'lr': 0.0003658318468005375, 'samples': 10234752, 'steps': 53305, 'loss/train': 1.190161943435669} 11/07/2021 04:43:46 - INFO - __main__ - Step 53307: {'lr': 0.000365827144005591, 'samples': 10234944, 'steps': 53306, 'loss/train': 1.2810604572296143} 11/07/2021 04:43:46 - INFO - __main__ - Step 53308: {'lr': 0.0003658224411584545, 'samples': 10235136, 'steps': 53307, 'loss/train': 1.5987610816955566} 11/07/2021 04:43:47 - INFO - __main__ - Step 53309: {'lr': 0.0003658177382591303, 'samples': 10235328, 'steps': 53308, 'loss/train': 1.5264441967010498} 11/07/2021 04:43:47 - INFO - __main__ - Step 53310: {'lr': 0.0003658130353076204, 'samples': 10235520, 'steps': 53309, 'loss/train': 1.1257977485656738} 11/07/2021 04:43:47 - INFO - __main__ - Step 53311: {'lr': 0.00036580833230392696, 'samples': 10235712, 'steps': 53310, 'loss/train': 1.4543594121932983} 11/07/2021 04:43:48 - INFO - __main__ - Step 53312: {'lr': 0.00036580362924805204, 'samples': 10235904, 'steps': 53311, 'loss/train': 1.5833771228790283} 11/07/2021 04:43:49 - INFO - __main__ - Step 53313: {'lr': 0.0003657989261399978, 'samples': 10236096, 'steps': 53312, 'loss/train': 1.826405644416809} 11/07/2021 04:43:49 - INFO - __main__ - Step 53314: {'lr': 0.0003657942229797663, 'samples': 10236288, 'steps': 53313, 'loss/train': 1.50407075881958} 11/07/2021 04:43:49 - INFO - __main__ - Step 53315: {'lr': 0.00036578951976735973, 'samples': 10236480, 'steps': 53314, 'loss/train': 1.4628835916519165} 11/07/2021 04:43:50 - INFO - __main__ - Step 53316: {'lr': 0.00036578481650278023, 'samples': 10236672, 'steps': 53315, 'loss/train': 1.3163783550262451} 11/07/2021 04:43:51 - INFO - __main__ - Step 53317: {'lr': 0.0003657801131860299, 'samples': 10236864, 'steps': 53316, 'loss/train': 1.312928557395935} 11/07/2021 04:43:51 - INFO - __main__ - Step 53318: {'lr': 0.0003657754098171108, 'samples': 10237056, 'steps': 53317, 'loss/train': 1.446033239364624} 11/07/2021 04:43:51 - INFO - __main__ - Step 53319: {'lr': 0.0003657707063960251, 'samples': 10237248, 'steps': 53318, 'loss/train': 1.3465927839279175} 11/07/2021 04:43:52 - INFO - __main__ - Step 53320: {'lr': 0.00036576600292277477, 'samples': 10237440, 'steps': 53319, 'loss/train': 1.060685634613037} 11/07/2021 04:43:52 - INFO - __main__ - Step 53321: {'lr': 0.0003657612993973622, 'samples': 10237632, 'steps': 53320, 'loss/train': 1.0209652185440063} 11/07/2021 04:43:53 - INFO - __main__ - Step 53322: {'lr': 0.00036575659581978935, 'samples': 10237824, 'steps': 53321, 'loss/train': 1.5221610069274902} 11/07/2021 04:43:53 - INFO - __main__ - Step 53323: {'lr': 0.0003657518921900583, 'samples': 10238016, 'steps': 53322, 'loss/train': 1.7708724737167358} 11/07/2021 04:43:54 - INFO - __main__ - Step 53324: {'lr': 0.0003657471885081714, 'samples': 10238208, 'steps': 53323, 'loss/train': 1.097522258758545} 11/07/2021 04:43:54 - INFO - __main__ - Step 53325: {'lr': 0.0003657424847741305, 'samples': 10238400, 'steps': 53324, 'loss/train': 1.5267057418823242} 11/07/2021 04:43:55 - INFO - __main__ - Step 53326: {'lr': 0.0003657377809879378, 'samples': 10238592, 'steps': 53325, 'loss/train': 1.7962665557861328} 11/07/2021 04:43:55 - INFO - __main__ - Step 53327: {'lr': 0.0003657330771495955, 'samples': 10238784, 'steps': 53326, 'loss/train': 1.2069767713546753} 11/07/2021 04:43:56 - INFO - __main__ - Step 53328: {'lr': 0.0003657283732591056, 'samples': 10238976, 'steps': 53327, 'loss/train': 1.2866443395614624} 11/07/2021 04:43:56 - INFO - __main__ - Step 53329: {'lr': 0.00036572366931647034, 'samples': 10239168, 'steps': 53328, 'loss/train': 1.5319390296936035} 11/07/2021 04:43:57 - INFO - __main__ - Step 53330: {'lr': 0.0003657189653216918, 'samples': 10239360, 'steps': 53329, 'loss/train': 0.8240731954574585} 11/07/2021 04:43:57 - INFO - __main__ - Step 53331: {'lr': 0.000365714261274772, 'samples': 10239552, 'steps': 53330, 'loss/train': 1.9161169528961182} 11/07/2021 04:43:57 - INFO - __main__ - Step 53332: {'lr': 0.00036570955717571315, 'samples': 10239744, 'steps': 53331, 'loss/train': 1.3911247253417969} 11/07/2021 04:43:58 - INFO - __main__ - Step 53333: {'lr': 0.0003657048530245174, 'samples': 10239936, 'steps': 53332, 'loss/train': 1.295586347579956} 11/07/2021 04:43:59 - INFO - __main__ - Step 53334: {'lr': 0.0003657001488211868, 'samples': 10240128, 'steps': 53333, 'loss/train': 0.737235426902771} 11/07/2021 04:43:59 - INFO - __main__ - Step 53335: {'lr': 0.00036569544456572346, 'samples': 10240320, 'steps': 53334, 'loss/train': 1.0752605199813843} 11/07/2021 04:44:00 - INFO - __main__ - Step 53336: {'lr': 0.0003656907402581296, 'samples': 10240512, 'steps': 53335, 'loss/train': 1.7795945405960083} 11/07/2021 04:44:00 - INFO - __main__ - Step 53337: {'lr': 0.00036568603589840734, 'samples': 10240704, 'steps': 53336, 'loss/train': 1.2930161952972412} 11/07/2021 04:44:01 - INFO - __main__ - Step 53338: {'lr': 0.00036568133148655855, 'samples': 10240896, 'steps': 53337, 'loss/train': 1.6007355451583862} 11/07/2021 04:44:01 - INFO - __main__ - Step 53339: {'lr': 0.0003656766270225857, 'samples': 10241088, 'steps': 53338, 'loss/train': 1.1706973314285278} 11/07/2021 04:44:02 - INFO - __main__ - Step 53340: {'lr': 0.00036567192250649066, 'samples': 10241280, 'steps': 53339, 'loss/train': 1.1047382354736328} 11/07/2021 04:44:02 - INFO - __main__ - Step 53341: {'lr': 0.0003656672179382757, 'samples': 10241472, 'steps': 53340, 'loss/train': 1.2220205068588257} 11/07/2021 04:44:02 - INFO - __main__ - Step 53342: {'lr': 0.00036566251331794284, 'samples': 10241664, 'steps': 53341, 'loss/train': 1.6235361099243164} 11/07/2021 04:44:03 - INFO - __main__ - Step 53343: {'lr': 0.00036565780864549423, 'samples': 10241856, 'steps': 53342, 'loss/train': 1.4334454536437988} 11/07/2021 04:44:04 - INFO - __main__ - Step 53344: {'lr': 0.00036565310392093204, 'samples': 10242048, 'steps': 53343, 'loss/train': 1.7549564838409424} 11/07/2021 04:44:04 - INFO - __main__ - Step 53345: {'lr': 0.0003656483991442583, 'samples': 10242240, 'steps': 53344, 'loss/train': 1.4066451787948608} 11/07/2021 04:44:04 - INFO - __main__ - Step 53346: {'lr': 0.0003656436943154752, 'samples': 10242432, 'steps': 53345, 'loss/train': 0.18494181334972382} 11/07/2021 04:44:05 - INFO - __main__ - Step 53347: {'lr': 0.0003656389894345848, 'samples': 10242624, 'steps': 53346, 'loss/train': 2.0516510009765625} 11/07/2021 04:44:06 - INFO - __main__ - Step 53348: {'lr': 0.0003656342845015893, 'samples': 10242816, 'steps': 53347, 'loss/train': 1.5392905473709106} 11/07/2021 04:44:06 - INFO - __main__ - Step 53349: {'lr': 0.00036562957951649075, 'samples': 10243008, 'steps': 53348, 'loss/train': 1.7591869831085205} 11/07/2021 04:44:06 - INFO - __main__ - Step 53350: {'lr': 0.00036562487447929133, 'samples': 10243200, 'steps': 53349, 'loss/train': 1.567441463470459} 11/07/2021 04:44:07 - INFO - __main__ - Step 53351: {'lr': 0.0003656201693899931, 'samples': 10243392, 'steps': 53350, 'loss/train': 1.250799298286438} 11/07/2021 04:44:07 - INFO - __main__ - Step 53352: {'lr': 0.0003656154642485982, 'samples': 10243584, 'steps': 53351, 'loss/train': 1.1381863355636597} 11/07/2021 04:44:08 - INFO - __main__ - Step 53353: {'lr': 0.00036561075905510874, 'samples': 10243776, 'steps': 53352, 'loss/train': 1.5367908477783203} 11/07/2021 04:44:09 - INFO - __main__ - Step 53354: {'lr': 0.00036560605380952686, 'samples': 10243968, 'steps': 53353, 'loss/train': 1.4778127670288086} 11/07/2021 04:44:09 - INFO - __main__ - Step 53355: {'lr': 0.00036560134851185475, 'samples': 10244160, 'steps': 53354, 'loss/train': 1.3822078704833984} 11/07/2021 04:44:09 - INFO - __main__ - Step 53356: {'lr': 0.00036559664316209437, 'samples': 10244352, 'steps': 53355, 'loss/train': 0.31846997141838074} 11/07/2021 04:44:10 - INFO - __main__ - Step 53357: {'lr': 0.00036559193776024794, 'samples': 10244544, 'steps': 53356, 'loss/train': 1.2317649126052856} 11/07/2021 04:44:11 - INFO - __main__ - Step 53358: {'lr': 0.00036558723230631764, 'samples': 10244736, 'steps': 53357, 'loss/train': 1.395480751991272} 11/07/2021 04:44:11 - INFO - __main__ - Step 53359: {'lr': 0.00036558252680030546, 'samples': 10244928, 'steps': 53358, 'loss/train': 1.1462469100952148} 11/07/2021 04:44:11 - INFO - __main__ - Step 53360: {'lr': 0.0003655778212422135, 'samples': 10245120, 'steps': 53359, 'loss/train': 1.554935097694397} 11/07/2021 04:44:12 - INFO - __main__ - Step 53361: {'lr': 0.0003655731156320441, 'samples': 10245312, 'steps': 53360, 'loss/train': 0.32667481899261475} 11/07/2021 04:44:12 - INFO - __main__ - Step 53362: {'lr': 0.00036556840996979914, 'samples': 10245504, 'steps': 53361, 'loss/train': 1.145167350769043} 11/07/2021 04:44:13 - INFO - __main__ - Step 53363: {'lr': 0.0003655637042554809, 'samples': 10245696, 'steps': 53362, 'loss/train': 1.7694740295410156} 11/07/2021 04:44:13 - INFO - __main__ - Step 53364: {'lr': 0.0003655589984890914, 'samples': 10245888, 'steps': 53363, 'loss/train': 0.7376524209976196} 11/07/2021 04:44:14 - INFO - __main__ - Step 53365: {'lr': 0.00036555429267063277, 'samples': 10246080, 'steps': 53364, 'loss/train': 1.1962106227874756} 11/07/2021 04:44:14 - INFO - __main__ - Step 53366: {'lr': 0.0003655495868001072, 'samples': 10246272, 'steps': 53365, 'loss/train': 1.0792287588119507} 11/07/2021 04:44:14 - INFO - __main__ - Step 53367: {'lr': 0.00036554488087751674, 'samples': 10246464, 'steps': 53366, 'loss/train': 1.6274288892745972} 11/07/2021 04:44:16 - INFO - __main__ - Step 53368: {'lr': 0.00036554017490286354, 'samples': 10246656, 'steps': 53367, 'loss/train': 1.4863463640213013} 11/07/2021 04:44:16 - INFO - __main__ - Step 53369: {'lr': 0.0003655354688761498, 'samples': 10246848, 'steps': 53368, 'loss/train': 1.4312962293624878} 11/07/2021 04:44:16 - INFO - __main__ - Step 53370: {'lr': 0.00036553076279737743, 'samples': 10247040, 'steps': 53369, 'loss/train': 2.0777783393859863} 11/07/2021 04:44:17 - INFO - __main__ - Step 53371: {'lr': 0.0003655260566665488, 'samples': 10247232, 'steps': 53370, 'loss/train': 1.3745750188827515} 11/07/2021 04:44:17 - INFO - __main__ - Step 53372: {'lr': 0.0003655213504836659, 'samples': 10247424, 'steps': 53371, 'loss/train': 1.7350544929504395} 11/07/2021 04:44:17 - INFO - __main__ - Step 53373: {'lr': 0.00036551664424873084, 'samples': 10247616, 'steps': 53372, 'loss/train': 1.1542366743087769} 11/07/2021 04:44:18 - INFO - __main__ - Step 53374: {'lr': 0.00036551193796174577, 'samples': 10247808, 'steps': 53373, 'loss/train': 1.0817580223083496} 11/07/2021 04:44:19 - INFO - __main__ - Step 53375: {'lr': 0.0003655072316227127, 'samples': 10248000, 'steps': 53374, 'loss/train': 1.3213239908218384} 11/07/2021 04:44:19 - INFO - __main__ - Step 53376: {'lr': 0.000365502525231634, 'samples': 10248192, 'steps': 53375, 'loss/train': 1.5007151365280151} 11/07/2021 04:44:19 - INFO - __main__ - Step 53377: {'lr': 0.00036549781878851155, 'samples': 10248384, 'steps': 53376, 'loss/train': 1.3726071119308472} 11/07/2021 04:44:20 - INFO - __main__ - Step 53378: {'lr': 0.0003654931122933476, 'samples': 10248576, 'steps': 53377, 'loss/train': 0.986018180847168} 11/07/2021 04:44:21 - INFO - __main__ - Step 53379: {'lr': 0.0003654884057461443, 'samples': 10248768, 'steps': 53378, 'loss/train': 1.1477071046829224} 11/07/2021 04:44:21 - INFO - __main__ - Step 53380: {'lr': 0.0003654836991469036, 'samples': 10248960, 'steps': 53379, 'loss/train': 1.3435907363891602} 11/07/2021 04:44:21 - INFO - __main__ - Step 53381: {'lr': 0.00036547899249562776, 'samples': 10249152, 'steps': 53380, 'loss/train': 1.4615086317062378} 11/07/2021 04:44:22 - INFO - __main__ - Step 53382: {'lr': 0.00036547428579231886, 'samples': 10249344, 'steps': 53381, 'loss/train': 1.4760574102401733} 11/07/2021 04:44:22 - INFO - __main__ - Step 53383: {'lr': 0.000365469579036979, 'samples': 10249536, 'steps': 53382, 'loss/train': 1.1706430912017822} 11/07/2021 04:44:23 - INFO - __main__ - Step 53384: {'lr': 0.00036546487222961045, 'samples': 10249728, 'steps': 53383, 'loss/train': 1.2544550895690918} 11/07/2021 04:44:24 - INFO - __main__ - Step 53385: {'lr': 0.0003654601653702151, 'samples': 10249920, 'steps': 53384, 'loss/train': 1.4860308170318604} 11/07/2021 04:44:24 - INFO - __main__ - Step 53386: {'lr': 0.0003654554584587952, 'samples': 10250112, 'steps': 53385, 'loss/train': 1.3248411417007446} 11/07/2021 04:44:24 - INFO - __main__ - Step 53387: {'lr': 0.0003654507514953529, 'samples': 10250304, 'steps': 53386, 'loss/train': 1.7449822425842285} 11/07/2021 04:44:25 - INFO - __main__ - Step 53388: {'lr': 0.0003654460444798902, 'samples': 10250496, 'steps': 53387, 'loss/train': 1.3748877048492432} 11/07/2021 04:44:26 - INFO - __main__ - Step 53389: {'lr': 0.00036544133741240936, 'samples': 10250688, 'steps': 53388, 'loss/train': 1.5594673156738281} 11/07/2021 04:44:26 - INFO - __main__ - Step 53390: {'lr': 0.0003654366302929124, 'samples': 10250880, 'steps': 53389, 'loss/train': 1.599009394645691} 11/07/2021 04:44:26 - INFO - __main__ - Step 53391: {'lr': 0.0003654319231214015, 'samples': 10251072, 'steps': 53390, 'loss/train': 1.4899357557296753} 11/07/2021 04:44:27 - INFO - __main__ - Step 53392: {'lr': 0.00036542721589787877, 'samples': 10251264, 'steps': 53391, 'loss/train': 1.1830111742019653} 11/07/2021 04:44:27 - INFO - __main__ - Step 53393: {'lr': 0.0003654225086223463, 'samples': 10251456, 'steps': 53392, 'loss/train': 2.0893449783325195} 11/07/2021 04:44:28 - INFO - __main__ - Step 53394: {'lr': 0.00036541780129480616, 'samples': 10251648, 'steps': 53393, 'loss/train': 1.5052937269210815} 11/07/2021 04:44:28 - INFO - __main__ - Step 53395: {'lr': 0.00036541309391526064, 'samples': 10251840, 'steps': 53394, 'loss/train': 0.8931623697280884} 11/07/2021 04:44:29 - INFO - __main__ - Step 53396: {'lr': 0.0003654083864837117, 'samples': 10252032, 'steps': 53395, 'loss/train': 1.2950444221496582} 11/07/2021 04:44:29 - INFO - __main__ - Step 53397: {'lr': 0.0003654036790001616, 'samples': 10252224, 'steps': 53396, 'loss/train': 1.530462622642517} 11/07/2021 04:44:29 - INFO - __main__ - Step 53398: {'lr': 0.00036539897146461227, 'samples': 10252416, 'steps': 53397, 'loss/train': 1.3619475364685059} 11/07/2021 04:44:30 - INFO - __main__ - Step 53399: {'lr': 0.000365394263877066, 'samples': 10252608, 'steps': 53398, 'loss/train': 1.2248635292053223} 11/07/2021 04:44:31 - INFO - __main__ - Step 53400: {'lr': 0.0003653895562375248, 'samples': 10252800, 'steps': 53399, 'loss/train': 1.672544002532959} 11/07/2021 04:44:31 - INFO - __main__ - Step 53401: {'lr': 0.0003653848485459909, 'samples': 10252992, 'steps': 53400, 'loss/train': 1.9200359582901} 11/07/2021 04:44:31 - INFO - __main__ - Step 53402: {'lr': 0.0003653801408024664, 'samples': 10253184, 'steps': 53401, 'loss/train': 1.3319880962371826} 11/07/2021 04:44:32 - INFO - __main__ - Step 53403: {'lr': 0.00036537543300695335, 'samples': 10253376, 'steps': 53402, 'loss/train': 1.3862860202789307} 11/07/2021 04:44:33 - INFO - __main__ - Step 53404: {'lr': 0.0003653707251594539, 'samples': 10253568, 'steps': 53403, 'loss/train': 0.7829197645187378} 11/07/2021 04:44:33 - INFO - __main__ - Step 53405: {'lr': 0.0003653660172599702, 'samples': 10253760, 'steps': 53404, 'loss/train': 1.0968180894851685} 11/07/2021 04:44:34 - INFO - __main__ - Step 53406: {'lr': 0.00036536130930850435, 'samples': 10253952, 'steps': 53405, 'loss/train': 1.2832274436950684} 11/07/2021 04:44:34 - INFO - __main__ - Step 53407: {'lr': 0.0003653566013050585, 'samples': 10254144, 'steps': 53406, 'loss/train': 1.0513582229614258} 11/07/2021 04:44:34 - INFO - __main__ - Step 53408: {'lr': 0.0003653518932496347, 'samples': 10254336, 'steps': 53407, 'loss/train': 1.6917462348937988} 11/07/2021 04:44:35 - INFO - __main__ - Step 53409: {'lr': 0.00036534718514223517, 'samples': 10254528, 'steps': 53408, 'loss/train': 1.126509428024292} 11/07/2021 04:44:36 - INFO - __main__ - Step 53410: {'lr': 0.00036534247698286195, 'samples': 10254720, 'steps': 53409, 'loss/train': 1.294304609298706} 11/07/2021 04:44:36 - INFO - __main__ - Step 53411: {'lr': 0.0003653377687715171, 'samples': 10254912, 'steps': 53410, 'loss/train': 1.2954366207122803} 11/07/2021 04:44:36 - INFO - __main__ - Step 53412: {'lr': 0.00036533306050820296, 'samples': 10255104, 'steps': 53411, 'loss/train': 1.6132702827453613} 11/07/2021 04:44:37 - INFO - __main__ - Step 53413: {'lr': 0.00036532835219292147, 'samples': 10255296, 'steps': 53412, 'loss/train': 1.3538674116134644} 11/07/2021 04:44:37 - INFO - __main__ - Step 53414: {'lr': 0.0003653236438256748, 'samples': 10255488, 'steps': 53413, 'loss/train': 1.5556293725967407} 11/07/2021 04:44:38 - INFO - __main__ - Step 53415: {'lr': 0.0003653189354064652, 'samples': 10255680, 'steps': 53414, 'loss/train': 1.6456143856048584} 11/07/2021 04:44:38 - INFO - __main__ - Step 53416: {'lr': 0.0003653142269352945, 'samples': 10255872, 'steps': 53415, 'loss/train': 1.5403021574020386} 11/07/2021 04:44:39 - INFO - __main__ - Step 53417: {'lr': 0.00036530951841216505, 'samples': 10256064, 'steps': 53416, 'loss/train': 1.4491617679595947} 11/07/2021 04:44:39 - INFO - __main__ - Step 53418: {'lr': 0.00036530480983707885, 'samples': 10256256, 'steps': 53417, 'loss/train': 1.2381377220153809} 11/07/2021 04:44:39 - INFO - __main__ - Step 53419: {'lr': 0.0003653001012100382, 'samples': 10256448, 'steps': 53418, 'loss/train': 1.6028646230697632} 11/07/2021 04:44:41 - INFO - __main__ - Step 53420: {'lr': 0.00036529539253104507, 'samples': 10256640, 'steps': 53419, 'loss/train': 1.0861432552337646} 11/07/2021 04:44:41 - INFO - __main__ - Step 53421: {'lr': 0.00036529068380010155, 'samples': 10256832, 'steps': 53420, 'loss/train': 1.8178462982177734} 11/07/2021 04:44:41 - INFO - __main__ - Step 53422: {'lr': 0.00036528597501720984, 'samples': 10257024, 'steps': 53421, 'loss/train': 0.979706346988678} 11/07/2021 04:44:42 - INFO - __main__ - Step 53423: {'lr': 0.00036528126618237206, 'samples': 10257216, 'steps': 53422, 'loss/train': 1.5682344436645508} 11/07/2021 04:44:42 - INFO - __main__ - Step 53424: {'lr': 0.00036527655729559036, 'samples': 10257408, 'steps': 53423, 'loss/train': 1.6208622455596924} 11/07/2021 04:44:43 - INFO - __main__ - Step 53425: {'lr': 0.0003652718483568668, 'samples': 10257600, 'steps': 53424, 'loss/train': 1.4337201118469238} 11/07/2021 04:44:43 - INFO - __main__ - Step 53426: {'lr': 0.00036526713936620354, 'samples': 10257792, 'steps': 53425, 'loss/train': 1.003976821899414} 11/07/2021 04:44:44 - INFO - __main__ - Step 53427: {'lr': 0.00036526243032360264, 'samples': 10257984, 'steps': 53426, 'loss/train': 1.253382682800293} 11/07/2021 04:44:44 - INFO - __main__ - Step 53428: {'lr': 0.0003652577212290663, 'samples': 10258176, 'steps': 53427, 'loss/train': 1.2909741401672363} 11/07/2021 04:44:44 - INFO - __main__ - Step 53429: {'lr': 0.0003652530120825966, 'samples': 10258368, 'steps': 53428, 'loss/train': 1.1331803798675537} 11/07/2021 04:44:45 - INFO - __main__ - Step 53430: {'lr': 0.0003652483028841956, 'samples': 10258560, 'steps': 53429, 'loss/train': 1.1578407287597656} 11/07/2021 04:44:46 - INFO - __main__ - Step 53431: {'lr': 0.0003652435936338656, 'samples': 10258752, 'steps': 53430, 'loss/train': 1.2707332372665405} 11/07/2021 04:44:46 - INFO - __main__ - Step 53432: {'lr': 0.00036523888433160864, 'samples': 10258944, 'steps': 53431, 'loss/train': 0.9340760111808777} 11/07/2021 04:44:46 - INFO - __main__ - Step 53433: {'lr': 0.00036523417497742673, 'samples': 10259136, 'steps': 53432, 'loss/train': 1.292786717414856} 11/07/2021 04:44:47 - INFO - __main__ - Step 53434: {'lr': 0.00036522946557132206, 'samples': 10259328, 'steps': 53433, 'loss/train': 1.3662357330322266} 11/07/2021 04:44:48 - INFO - __main__ - Step 53435: {'lr': 0.00036522475611329685, 'samples': 10259520, 'steps': 53434, 'loss/train': 1.433696985244751} 11/07/2021 04:44:48 - INFO - __main__ - Step 53436: {'lr': 0.00036522004660335304, 'samples': 10259712, 'steps': 53435, 'loss/train': 1.3966200351715088} 11/07/2021 04:44:49 - INFO - __main__ - Step 53437: {'lr': 0.000365215337041493, 'samples': 10259904, 'steps': 53436, 'loss/train': 1.2878247499465942} 11/07/2021 04:44:49 - INFO - __main__ - Step 53438: {'lr': 0.00036521062742771865, 'samples': 10260096, 'steps': 53437, 'loss/train': 0.956116795539856} 11/07/2021 04:44:49 - INFO - __main__ - Step 53439: {'lr': 0.0003652059177620322, 'samples': 10260288, 'steps': 53438, 'loss/train': 0.9694187641143799} 11/07/2021 04:44:50 - INFO - __main__ - Step 53440: {'lr': 0.00036520120804443563, 'samples': 10260480, 'steps': 53439, 'loss/train': 0.9357909560203552} 11/07/2021 04:44:51 - INFO - __main__ - Step 53441: {'lr': 0.00036519649827493117, 'samples': 10260672, 'steps': 53440, 'loss/train': 1.8616234064102173} 11/07/2021 04:44:51 - INFO - __main__ - Step 53442: {'lr': 0.000365191788453521, 'samples': 10260864, 'steps': 53441, 'loss/train': 1.3704938888549805} 11/07/2021 04:44:51 - INFO - __main__ - Step 53443: {'lr': 0.0003651870785802072, 'samples': 10261056, 'steps': 53442, 'loss/train': 0.5557447075843811} 11/07/2021 04:44:52 - INFO - __main__ - Step 53444: {'lr': 0.00036518236865499187, 'samples': 10261248, 'steps': 53443, 'loss/train': 1.4378814697265625} 11/07/2021 04:44:53 - INFO - __main__ - Step 53445: {'lr': 0.0003651776586778772, 'samples': 10261440, 'steps': 53444, 'loss/train': 1.5799425840377808} 11/07/2021 04:44:53 - INFO - __main__ - Step 53446: {'lr': 0.00036517294864886517, 'samples': 10261632, 'steps': 53445, 'loss/train': 2.1427876949310303} 11/07/2021 04:44:53 - INFO - __main__ - Step 53447: {'lr': 0.00036516823856795806, 'samples': 10261824, 'steps': 53446, 'loss/train': 0.9222480654716492} 11/07/2021 04:44:54 - INFO - __main__ - Step 53448: {'lr': 0.0003651635284351579, 'samples': 10262016, 'steps': 53447, 'loss/train': 1.2064964771270752} 11/07/2021 04:44:54 - INFO - __main__ - Step 53449: {'lr': 0.00036515881825046676, 'samples': 10262208, 'steps': 53448, 'loss/train': 1.5260143280029297} 11/07/2021 04:44:55 - INFO - __main__ - Step 53450: {'lr': 0.00036515410801388686, 'samples': 10262400, 'steps': 53449, 'loss/train': 1.222004771232605} 11/07/2021 04:44:55 - INFO - __main__ - Step 53451: {'lr': 0.0003651493977254204, 'samples': 10262592, 'steps': 53450, 'loss/train': 1.1987919807434082} 11/07/2021 04:44:56 - INFO - __main__ - Step 53452: {'lr': 0.0003651446873850693, 'samples': 10262784, 'steps': 53451, 'loss/train': 1.5520845651626587} 11/07/2021 04:44:56 - INFO - __main__ - Step 53453: {'lr': 0.0003651399769928358, 'samples': 10262976, 'steps': 53452, 'loss/train': 1.459889531135559} 11/07/2021 04:44:57 - INFO - __main__ - Step 53454: {'lr': 0.000365135266548722, 'samples': 10263168, 'steps': 53453, 'loss/train': 1.5366923809051514} 11/07/2021 04:44:58 - INFO - __main__ - Step 53455: {'lr': 0.00036513055605273, 'samples': 10263360, 'steps': 53454, 'loss/train': 3.485288143157959} 11/07/2021 04:44:58 - INFO - __main__ - Step 53456: {'lr': 0.0003651258455048619, 'samples': 10263552, 'steps': 53455, 'loss/train': 1.5448509454727173} 11/07/2021 04:44:58 - INFO - __main__ - Step 53457: {'lr': 0.00036512113490512, 'samples': 10263744, 'steps': 53456, 'loss/train': 1.5369842052459717} 11/07/2021 04:44:59 - INFO - __main__ - Step 53458: {'lr': 0.00036511642425350626, 'samples': 10263936, 'steps': 53457, 'loss/train': 1.3163940906524658} 11/07/2021 04:44:59 - INFO - __main__ - Step 53459: {'lr': 0.00036511171355002283, 'samples': 10264128, 'steps': 53458, 'loss/train': 1.2179282903671265} 11/07/2021 04:45:00 - INFO - __main__ - Step 53460: {'lr': 0.0003651070027946718, 'samples': 10264320, 'steps': 53459, 'loss/train': 1.1904709339141846} 11/07/2021 04:45:01 - INFO - __main__ - Step 53461: {'lr': 0.0003651022919874554, 'samples': 10264512, 'steps': 53460, 'loss/train': 1.4106436967849731} 11/07/2021 04:45:01 - INFO - __main__ - Step 53462: {'lr': 0.0003650975811283756, 'samples': 10264704, 'steps': 53461, 'loss/train': 1.307440996170044} 11/07/2021 04:45:01 - INFO - __main__ - Step 53463: {'lr': 0.00036509287021743465, 'samples': 10264896, 'steps': 53462, 'loss/train': 0.7371805906295776} 11/07/2021 04:45:02 - INFO - __main__ - Step 53464: {'lr': 0.00036508815925463456, 'samples': 10265088, 'steps': 53463, 'loss/train': 0.8744301199913025} 11/07/2021 04:45:02 - INFO - __main__ - Step 53465: {'lr': 0.0003650834482399776, 'samples': 10265280, 'steps': 53464, 'loss/train': 1.3795242309570312} 11/07/2021 04:45:03 - INFO - __main__ - Step 53466: {'lr': 0.00036507873717346584, 'samples': 10265472, 'steps': 53465, 'loss/train': 1.2720105648040771} 11/07/2021 04:45:03 - INFO - __main__ - Step 53467: {'lr': 0.00036507402605510134, 'samples': 10265664, 'steps': 53466, 'loss/train': 1.4956802129745483} 11/07/2021 04:45:04 - INFO - __main__ - Step 53468: {'lr': 0.00036506931488488627, 'samples': 10265856, 'steps': 53467, 'loss/train': 1.259634017944336} 11/07/2021 04:45:04 - INFO - __main__ - Step 53469: {'lr': 0.0003650646036628227, 'samples': 10266048, 'steps': 53468, 'loss/train': 1.3439853191375732} 11/07/2021 04:45:04 - INFO - __main__ - Step 53470: {'lr': 0.0003650598923889128, 'samples': 10266240, 'steps': 53469, 'loss/train': 1.5502805709838867} 11/07/2021 04:45:06 - INFO - __main__ - Step 53471: {'lr': 0.0003650551810631587, 'samples': 10266432, 'steps': 53470, 'loss/train': 1.3919814825057983} 11/07/2021 04:45:06 - INFO - __main__ - Step 53472: {'lr': 0.00036505046968556253, 'samples': 10266624, 'steps': 53471, 'loss/train': 1.6537296772003174} 11/07/2021 04:45:06 - INFO - __main__ - Step 53473: {'lr': 0.0003650457582561264, 'samples': 10266816, 'steps': 53472, 'loss/train': 4.414062023162842} 11/07/2021 04:45:07 - INFO - __main__ - Step 53474: {'lr': 0.0003650410467748524, 'samples': 10267008, 'steps': 53473, 'loss/train': 1.4036349058151245} 11/07/2021 04:45:07 - INFO - __main__ - Step 53475: {'lr': 0.0003650363352417427, 'samples': 10267200, 'steps': 53474, 'loss/train': 1.817798376083374} 11/07/2021 04:45:07 - INFO - __main__ - Step 53476: {'lr': 0.00036503162365679936, 'samples': 10267392, 'steps': 53475, 'loss/train': 1.4261497259140015} 11/07/2021 04:45:08 - INFO - __main__ - Step 53477: {'lr': 0.00036502691202002456, 'samples': 10267584, 'steps': 53476, 'loss/train': 1.447472095489502} 11/07/2021 04:45:09 - INFO - __main__ - Step 53478: {'lr': 0.00036502220033142045, 'samples': 10267776, 'steps': 53477, 'loss/train': 1.4116909503936768} 11/07/2021 04:45:09 - INFO - __main__ - Step 53479: {'lr': 0.0003650174885909891, 'samples': 10267968, 'steps': 53478, 'loss/train': 1.485318899154663} 11/07/2021 04:45:10 - INFO - __main__ - Step 53480: {'lr': 0.0003650127767987326, 'samples': 10268160, 'steps': 53479, 'loss/train': 1.534500002861023} 11/07/2021 04:45:10 - INFO - __main__ - Step 53481: {'lr': 0.00036500806495465315, 'samples': 10268352, 'steps': 53480, 'loss/train': 1.147910475730896} 11/07/2021 04:45:11 - INFO - __main__ - Step 53482: {'lr': 0.0003650033530587529, 'samples': 10268544, 'steps': 53481, 'loss/train': 1.1712579727172852} 11/07/2021 04:45:11 - INFO - __main__ - Step 53483: {'lr': 0.00036499864111103384, 'samples': 10268736, 'steps': 53482, 'loss/train': 1.265006184577942} 11/07/2021 04:45:12 - INFO - __main__ - Step 53484: {'lr': 0.00036499392911149817, 'samples': 10268928, 'steps': 53483, 'loss/train': 1.2264678478240967} 11/07/2021 04:45:12 - INFO - __main__ - Step 53485: {'lr': 0.00036498921706014804, 'samples': 10269120, 'steps': 53484, 'loss/train': 1.565567135810852} 11/07/2021 04:45:12 - INFO - __main__ - Step 53486: {'lr': 0.00036498450495698557, 'samples': 10269312, 'steps': 53485, 'loss/train': 1.284726858139038} 11/07/2021 04:45:13 - INFO - __main__ - Step 53487: {'lr': 0.00036497979280201276, 'samples': 10269504, 'steps': 53486, 'loss/train': 1.3235887289047241} 11/07/2021 04:45:14 - INFO - __main__ - Step 53488: {'lr': 0.0003649750805952319, 'samples': 10269696, 'steps': 53487, 'loss/train': 1.6617445945739746} 11/07/2021 04:45:14 - INFO - __main__ - Step 53489: {'lr': 0.000364970368336645, 'samples': 10269888, 'steps': 53488, 'loss/train': 1.4836846590042114} 11/07/2021 04:45:14 - INFO - __main__ - Step 53490: {'lr': 0.0003649656560262542, 'samples': 10270080, 'steps': 53489, 'loss/train': 1.16900634765625} 11/07/2021 04:45:15 - INFO - __main__ - Step 53491: {'lr': 0.00036496094366406166, 'samples': 10270272, 'steps': 53490, 'loss/train': 1.4340850114822388} 11/07/2021 04:45:16 - INFO - __main__ - Step 53492: {'lr': 0.0003649562312500696, 'samples': 10270464, 'steps': 53491, 'loss/train': 1.9902280569076538} 11/07/2021 04:45:16 - INFO - __main__ - Step 53493: {'lr': 0.00036495151878427994, 'samples': 10270656, 'steps': 53492, 'loss/train': 0.06550008803606033} 11/07/2021 04:45:17 - INFO - __main__ - Step 53494: {'lr': 0.00036494680626669495, 'samples': 10270848, 'steps': 53493, 'loss/train': 1.1288018226623535} 11/07/2021 04:45:17 - INFO - __main__ - Step 53495: {'lr': 0.00036494209369731666, 'samples': 10271040, 'steps': 53494, 'loss/train': 1.6085138320922852} 11/07/2021 04:45:17 - INFO - __main__ - Step 53496: {'lr': 0.0003649373810761473, 'samples': 10271232, 'steps': 53495, 'loss/train': 1.1278513669967651} 11/07/2021 04:45:18 - INFO - __main__ - Step 53497: {'lr': 0.00036493266840318886, 'samples': 10271424, 'steps': 53496, 'loss/train': 1.226789951324463} 11/07/2021 04:45:19 - INFO - __main__ - Step 53498: {'lr': 0.0003649279556784436, 'samples': 10271616, 'steps': 53497, 'loss/train': 1.1760696172714233} 11/07/2021 04:45:19 - INFO - __main__ - Step 53499: {'lr': 0.0003649232429019135, 'samples': 10271808, 'steps': 53498, 'loss/train': 1.5847797393798828} 11/07/2021 04:45:19 - INFO - __main__ - Step 53500: {'lr': 0.0003649185300736008, 'samples': 10272000, 'steps': 53499, 'loss/train': 1.463171124458313} 11/07/2021 04:45:20 - INFO - __main__ - Step 53501: {'lr': 0.0003649138171935076, 'samples': 10272192, 'steps': 53500, 'loss/train': 1.2854593992233276} 11/07/2021 04:45:21 - INFO - __main__ - Step 53502: {'lr': 0.0003649091042616359, 'samples': 10272384, 'steps': 53501, 'loss/train': 1.0506432056427002} 11/07/2021 04:45:21 - INFO - __main__ - Step 53503: {'lr': 0.000364904391277988, 'samples': 10272576, 'steps': 53502, 'loss/train': 0.06548427045345306} 11/07/2021 04:45:22 - INFO - __main__ - Step 53504: {'lr': 0.00036489967824256597, 'samples': 10272768, 'steps': 53503, 'loss/train': 1.4971596002578735} 11/07/2021 04:45:22 - INFO - __main__ - Step 53505: {'lr': 0.000364894965155372, 'samples': 10272960, 'steps': 53504, 'loss/train': 5.800402641296387} 11/07/2021 04:45:22 - INFO - __main__ - Step 53506: {'lr': 0.000364890252016408, 'samples': 10273152, 'steps': 53505, 'loss/train': 1.3495075702667236} 11/07/2021 04:45:23 - INFO - __main__ - Step 53507: {'lr': 0.0003648855388256763, 'samples': 10273344, 'steps': 53506, 'loss/train': 1.5951036214828491} 11/07/2021 04:45:24 - INFO - __main__ - Step 53508: {'lr': 0.0003648808255831789, 'samples': 10273536, 'steps': 53507, 'loss/train': 1.4750956296920776} 11/07/2021 04:45:24 - INFO - __main__ - Step 53509: {'lr': 0.00036487611228891805, 'samples': 10273728, 'steps': 53508, 'loss/train': 1.296762466430664} 11/07/2021 04:45:25 - INFO - __main__ - Step 53510: {'lr': 0.00036487139894289566, 'samples': 10273920, 'steps': 53509, 'loss/train': 1.7952351570129395} 11/07/2021 04:45:25 - INFO - __main__ - Step 53511: {'lr': 0.0003648666855451141, 'samples': 10274112, 'steps': 53510, 'loss/train': 1.3936107158660889} 11/07/2021 04:45:25 - INFO - __main__ - Step 53512: {'lr': 0.0003648619720955754, 'samples': 10274304, 'steps': 53511, 'loss/train': 1.4741666316986084} 11/07/2021 04:45:27 - INFO - __main__ - Step 53513: {'lr': 0.00036485725859428163, 'samples': 10274496, 'steps': 53512, 'loss/train': 0.07048851251602173} 11/07/2021 04:45:27 - INFO - __main__ - Step 53514: {'lr': 0.00036485254504123495, 'samples': 10274688, 'steps': 53513, 'loss/train': 0.936514139175415} 11/07/2021 04:45:27 - INFO - __main__ - Step 53515: {'lr': 0.00036484783143643745, 'samples': 10274880, 'steps': 53514, 'loss/train': 1.230549693107605} 11/07/2021 04:45:28 - INFO - __main__ - Step 53516: {'lr': 0.0003648431177798913, 'samples': 10275072, 'steps': 53515, 'loss/train': 1.5734517574310303} 11/07/2021 04:45:28 - INFO - __main__ - Step 53517: {'lr': 0.00036483840407159864, 'samples': 10275264, 'steps': 53516, 'loss/train': 1.1997522115707397} 11/07/2021 04:45:29 - INFO - __main__ - Step 53518: {'lr': 0.0003648336903115616, 'samples': 10275456, 'steps': 53517, 'loss/train': 1.3396321535110474} 11/07/2021 04:45:29 - INFO - __main__ - Step 53519: {'lr': 0.0003648289764997823, 'samples': 10275648, 'steps': 53518, 'loss/train': 1.186674952507019} 11/07/2021 04:45:30 - INFO - __main__ - Step 53520: {'lr': 0.00036482426263626265, 'samples': 10275840, 'steps': 53519, 'loss/train': 1.2877269983291626} 11/07/2021 04:45:30 - INFO - __main__ - Step 53521: {'lr': 0.0003648195487210051, 'samples': 10276032, 'steps': 53520, 'loss/train': 0.8472537398338318} 11/07/2021 04:45:30 - INFO - __main__ - Step 53522: {'lr': 0.0003648148347540116, 'samples': 10276224, 'steps': 53521, 'loss/train': 1.4158985614776611} 11/07/2021 04:45:31 - INFO - __main__ - Step 53523: {'lr': 0.0003648101207352843, 'samples': 10276416, 'steps': 53522, 'loss/train': 1.3949472904205322} 11/07/2021 04:45:32 - INFO - __main__ - Step 53524: {'lr': 0.00036480540666482535, 'samples': 10276608, 'steps': 53523, 'loss/train': 1.6620945930480957} 11/07/2021 04:45:32 - INFO - __main__ - Step 53525: {'lr': 0.00036480069254263693, 'samples': 10276800, 'steps': 53524, 'loss/train': 1.4589643478393555} 11/07/2021 04:45:32 - INFO - __main__ - Step 53526: {'lr': 0.000364795978368721, 'samples': 10276992, 'steps': 53525, 'loss/train': 1.1077258586883545} 11/07/2021 04:45:33 - INFO - __main__ - Step 53527: {'lr': 0.0003647912641430798, 'samples': 10277184, 'steps': 53526, 'loss/train': 1.6570274829864502} 11/07/2021 04:45:34 - INFO - __main__ - Step 53528: {'lr': 0.0003647865498657154, 'samples': 10277376, 'steps': 53527, 'loss/train': 1.724677324295044} 11/07/2021 04:45:34 - INFO - __main__ - Step 53529: {'lr': 0.0003647818355366299, 'samples': 10277568, 'steps': 53528, 'loss/train': 0.858352541923523} 11/07/2021 04:45:35 - INFO - __main__ - Step 53530: {'lr': 0.00036477712115582555, 'samples': 10277760, 'steps': 53529, 'loss/train': 1.4554532766342163} 11/07/2021 04:45:35 - INFO - __main__ - Step 53531: {'lr': 0.0003647724067233044, 'samples': 10277952, 'steps': 53530, 'loss/train': 1.0617146492004395} 11/07/2021 04:45:35 - INFO - __main__ - Step 53532: {'lr': 0.00036476769223906864, 'samples': 10278144, 'steps': 53531, 'loss/train': 0.09869363158941269} 11/07/2021 04:45:36 - INFO - __main__ - Step 53533: {'lr': 0.0003647629777031202, 'samples': 10278336, 'steps': 53532, 'loss/train': 1.4492369890213013} 11/07/2021 04:45:37 - INFO - __main__ - Step 53534: {'lr': 0.0003647582631154614, 'samples': 10278528, 'steps': 53533, 'loss/train': 1.1524577140808105} 11/07/2021 04:45:37 - INFO - __main__ - Step 53535: {'lr': 0.00036475354847609434, 'samples': 10278720, 'steps': 53534, 'loss/train': 1.536961555480957} 11/07/2021 04:45:37 - INFO - __main__ - Step 53536: {'lr': 0.000364748833785021, 'samples': 10278912, 'steps': 53535, 'loss/train': 1.051010012626648} 11/07/2021 04:45:38 - INFO - __main__ - Step 53537: {'lr': 0.0003647441190422437, 'samples': 10279104, 'steps': 53536, 'loss/train': 1.30070161819458} 11/07/2021 04:45:38 - INFO - __main__ - Step 53538: {'lr': 0.00036473940424776443, 'samples': 10279296, 'steps': 53537, 'loss/train': 1.432908535003662} 11/07/2021 04:45:39 - INFO - __main__ - Step 53539: {'lr': 0.0003647346894015853, 'samples': 10279488, 'steps': 53538, 'loss/train': 1.2249596118927002} 11/07/2021 04:45:39 - INFO - __main__ - Step 53540: {'lr': 0.0003647299745037085, 'samples': 10279680, 'steps': 53539, 'loss/train': 1.7161222696304321} 11/07/2021 04:45:40 - INFO - __main__ - Step 53541: {'lr': 0.00036472525955413626, 'samples': 10279872, 'steps': 53540, 'loss/train': 1.095268964767456} 11/07/2021 04:45:40 - INFO - __main__ - Step 53542: {'lr': 0.00036472054455287053, 'samples': 10280064, 'steps': 53541, 'loss/train': 1.3333752155303955} 11/07/2021 04:45:40 - INFO - __main__ - Step 53543: {'lr': 0.00036471582949991347, 'samples': 10280256, 'steps': 53542, 'loss/train': 1.6389453411102295} 11/07/2021 04:45:42 - INFO - __main__ - Step 53544: {'lr': 0.0003647111143952672, 'samples': 10280448, 'steps': 53543, 'loss/train': 1.4052221775054932} 11/07/2021 04:45:42 - INFO - __main__ - Step 53545: {'lr': 0.0003647063992389339, 'samples': 10280640, 'steps': 53544, 'loss/train': 1.2893710136413574} 11/07/2021 04:45:42 - INFO - __main__ - Step 53546: {'lr': 0.00036470168403091567, 'samples': 10280832, 'steps': 53545, 'loss/train': 1.4591162204742432} 11/07/2021 04:45:43 - INFO - __main__ - Step 53547: {'lr': 0.00036469696877121464, 'samples': 10281024, 'steps': 53546, 'loss/train': 1.500877022743225} 11/07/2021 04:45:43 - INFO - __main__ - Step 53548: {'lr': 0.000364692253459833, 'samples': 10281216, 'steps': 53547, 'loss/train': 1.5913686752319336} 11/07/2021 04:45:44 - INFO - __main__ - Step 53549: {'lr': 0.0003646875380967727, 'samples': 10281408, 'steps': 53548, 'loss/train': 1.3998956680297852} 11/07/2021 04:45:45 - INFO - __main__ - Step 53550: {'lr': 0.00036468282268203595, 'samples': 10281600, 'steps': 53549, 'loss/train': 1.3212279081344604} 11/07/2021 04:45:45 - INFO - __main__ - Step 53551: {'lr': 0.0003646781072156249, 'samples': 10281792, 'steps': 53550, 'loss/train': 0.31877216696739197} 11/07/2021 04:45:45 - INFO - __main__ - Step 53552: {'lr': 0.00036467339169754173, 'samples': 10281984, 'steps': 53551, 'loss/train': 1.8028637170791626} 11/07/2021 04:45:46 - INFO - __main__ - Step 53553: {'lr': 0.0003646686761277884, 'samples': 10282176, 'steps': 53552, 'loss/train': 1.836937665939331} 11/07/2021 04:45:46 - INFO - __main__ - Step 53554: {'lr': 0.00036466396050636725, 'samples': 10282368, 'steps': 53553, 'loss/train': 2.672879457473755} 11/07/2021 04:45:47 - INFO - __main__ - Step 53555: {'lr': 0.0003646592448332802, 'samples': 10282560, 'steps': 53554, 'loss/train': 1.848436951637268} 11/07/2021 04:45:48 - INFO - __main__ - Step 53556: {'lr': 0.00036465452910852946, 'samples': 10282752, 'steps': 53555, 'loss/train': 1.1363451480865479} 11/07/2021 04:45:48 - INFO - __main__ - Step 53557: {'lr': 0.00036464981333211724, 'samples': 10282944, 'steps': 53556, 'loss/train': 0.9401386976242065} 11/07/2021 04:45:48 - INFO - __main__ - Step 53558: {'lr': 0.0003646450975040455, 'samples': 10283136, 'steps': 53557, 'loss/train': 1.7269724607467651} 11/07/2021 04:45:49 - INFO - __main__ - Step 53559: {'lr': 0.00036464038162431657, 'samples': 10283328, 'steps': 53558, 'loss/train': 1.0810272693634033} 11/07/2021 04:45:50 - INFO - __main__ - Step 53560: {'lr': 0.00036463566569293235, 'samples': 10283520, 'steps': 53559, 'loss/train': 1.4755587577819824} 11/07/2021 04:45:50 - INFO - __main__ - Step 53561: {'lr': 0.0003646309497098951, 'samples': 10283712, 'steps': 53560, 'loss/train': 1.209079623222351} 11/07/2021 04:45:50 - INFO - __main__ - Step 53562: {'lr': 0.00036462623367520684, 'samples': 10283904, 'steps': 53561, 'loss/train': 1.545916199684143} 11/07/2021 04:45:51 - INFO - __main__ - Step 53563: {'lr': 0.00036462151758886985, 'samples': 10284096, 'steps': 53562, 'loss/train': 1.3853113651275635} 11/07/2021 04:45:51 - INFO - __main__ - Step 53564: {'lr': 0.0003646168014508861, 'samples': 10284288, 'steps': 53563, 'loss/train': 1.524139404296875} 11/07/2021 04:45:52 - INFO - __main__ - Step 53565: {'lr': 0.00036461208526125785, 'samples': 10284480, 'steps': 53564, 'loss/train': 1.2327629327774048} 11/07/2021 04:45:53 - INFO - __main__ - Step 53566: {'lr': 0.0003646073690199872, 'samples': 10284672, 'steps': 53565, 'loss/train': 1.8953274488449097} 11/07/2021 04:45:53 - INFO - __main__ - Step 53567: {'lr': 0.00036460265272707617, 'samples': 10284864, 'steps': 53566, 'loss/train': 1.569270372390747} 11/07/2021 04:45:53 - INFO - __main__ - Step 53568: {'lr': 0.000364597936382527, 'samples': 10285056, 'steps': 53567, 'loss/train': 1.6651109457015991} 11/07/2021 04:45:54 - INFO - __main__ - Step 53569: {'lr': 0.0003645932199863417, 'samples': 10285248, 'steps': 53568, 'loss/train': 0.5657076239585876} 11/07/2021 04:45:54 - INFO - __main__ - Step 53570: {'lr': 0.00036458850353852246, 'samples': 10285440, 'steps': 53569, 'loss/train': 1.2051150798797607} 11/07/2021 04:45:55 - INFO - __main__ - Step 53571: {'lr': 0.0003645837870390715, 'samples': 10285632, 'steps': 53570, 'loss/train': 1.6608856916427612} 11/07/2021 04:45:55 - INFO - __main__ - Step 53572: {'lr': 0.00036457907048799084, 'samples': 10285824, 'steps': 53571, 'loss/train': 1.7461100816726685} 11/07/2021 04:45:56 - INFO - __main__ - Step 53573: {'lr': 0.00036457435388528257, 'samples': 10286016, 'steps': 53572, 'loss/train': 1.378989338874817} 11/07/2021 04:45:56 - INFO - __main__ - Step 53574: {'lr': 0.0003645696372309488, 'samples': 10286208, 'steps': 53573, 'loss/train': 1.4414912462234497} 11/07/2021 04:45:57 - INFO - __main__ - Step 53575: {'lr': 0.00036456492052499185, 'samples': 10286400, 'steps': 53574, 'loss/train': 1.825268030166626} 11/07/2021 04:45:58 - INFO - __main__ - Step 53576: {'lr': 0.00036456020376741363, 'samples': 10286592, 'steps': 53575, 'loss/train': 1.3124676942825317} 11/07/2021 04:45:58 - INFO - __main__ - Step 53577: {'lr': 0.0003645554869582164, 'samples': 10286784, 'steps': 53576, 'loss/train': 1.6099679470062256} 11/07/2021 04:45:58 - INFO - __main__ - Step 53578: {'lr': 0.0003645507700974022, 'samples': 10286976, 'steps': 53577, 'loss/train': 1.296777606010437} 11/07/2021 04:45:59 - INFO - __main__ - Step 53579: {'lr': 0.00036454605318497323, 'samples': 10287168, 'steps': 53578, 'loss/train': 1.4289377927780151} 11/07/2021 04:45:59 - INFO - __main__ - Step 53580: {'lr': 0.00036454133622093154, 'samples': 10287360, 'steps': 53579, 'loss/train': 1.4092215299606323} 11/07/2021 04:46:00 - INFO - __main__ - Step 53581: {'lr': 0.00036453661920527933, 'samples': 10287552, 'steps': 53580, 'loss/train': 1.2635722160339355} 11/07/2021 04:46:00 - INFO - __main__ - Step 53582: {'lr': 0.0003645319021380186, 'samples': 10287744, 'steps': 53581, 'loss/train': 0.8141332268714905} 11/07/2021 04:46:01 - INFO - __main__ - Step 53583: {'lr': 0.00036452718501915165, 'samples': 10287936, 'steps': 53582, 'loss/train': 1.2602858543395996} 11/07/2021 04:46:01 - INFO - __main__ - Step 53584: {'lr': 0.00036452246784868047, 'samples': 10288128, 'steps': 53583, 'loss/train': 1.5883314609527588} 11/07/2021 04:46:01 - INFO - __main__ - Step 53585: {'lr': 0.0003645177506266072, 'samples': 10288320, 'steps': 53584, 'loss/train': 1.6093188524246216} 11/07/2021 04:46:03 - INFO - __main__ - Step 53586: {'lr': 0.0003645130333529342, 'samples': 10288512, 'steps': 53585, 'loss/train': 5.9109697341918945} 11/07/2021 04:46:03 - INFO - __main__ - Step 53587: {'lr': 0.0003645083160276632, 'samples': 10288704, 'steps': 53586, 'loss/train': 1.1899070739746094} 11/07/2021 04:46:03 - INFO - __main__ - Step 53588: {'lr': 0.0003645035986507966, 'samples': 10288896, 'steps': 53587, 'loss/train': 1.0773814916610718} 11/07/2021 04:46:04 - INFO - __main__ - Step 53589: {'lr': 0.00036449888122233636, 'samples': 10289088, 'steps': 53588, 'loss/train': 1.2378230094909668} 11/07/2021 04:46:04 - INFO - __main__ - Step 53590: {'lr': 0.00036449416374228474, 'samples': 10289280, 'steps': 53589, 'loss/train': 1.4816677570343018} 11/07/2021 04:46:04 - INFO - __main__ - Step 53591: {'lr': 0.00036448944621064386, 'samples': 10289472, 'steps': 53590, 'loss/train': 1.2232826948165894} 11/07/2021 04:46:05 - INFO - __main__ - Step 53592: {'lr': 0.00036448472862741577, 'samples': 10289664, 'steps': 53591, 'loss/train': 1.4840363264083862} 11/07/2021 04:46:06 - INFO - __main__ - Step 53593: {'lr': 0.0003644800109926026, 'samples': 10289856, 'steps': 53592, 'loss/train': 1.1795276403427124} 11/07/2021 04:46:06 - INFO - __main__ - Step 53594: {'lr': 0.00036447529330620653, 'samples': 10290048, 'steps': 53593, 'loss/train': 1.5766221284866333} 11/07/2021 04:46:07 - INFO - __main__ - Step 53595: {'lr': 0.0003644705755682296, 'samples': 10290240, 'steps': 53594, 'loss/train': 1.8372597694396973} 11/07/2021 04:46:07 - INFO - __main__ - Step 53596: {'lr': 0.00036446585777867406, 'samples': 10290432, 'steps': 53595, 'loss/train': 1.5694047212600708} 11/07/2021 04:46:08 - INFO - __main__ - Step 53597: {'lr': 0.0003644611399375419, 'samples': 10290624, 'steps': 53596, 'loss/train': 1.432969570159912} 11/07/2021 04:46:08 - INFO - __main__ - Step 53598: {'lr': 0.0003644564220448354, 'samples': 10290816, 'steps': 53597, 'loss/train': 1.5193006992340088} 11/07/2021 04:46:09 - INFO - __main__ - Step 53599: {'lr': 0.0003644517041005566, 'samples': 10291008, 'steps': 53598, 'loss/train': 1.0880459547042847} 11/07/2021 04:46:09 - INFO - __main__ - Step 53600: {'lr': 0.0003644469861047076, 'samples': 10291200, 'steps': 53599, 'loss/train': 1.2695410251617432} 11/07/2021 04:46:09 - INFO - __main__ - Step 53601: {'lr': 0.0003644422680572906, 'samples': 10291392, 'steps': 53600, 'loss/train': 1.6431523561477661} 11/07/2021 04:46:10 - INFO - __main__ - Step 53602: {'lr': 0.00036443754995830763, 'samples': 10291584, 'steps': 53601, 'loss/train': 1.2466450929641724} 11/07/2021 04:46:11 - INFO - __main__ - Step 53603: {'lr': 0.0003644328318077609, 'samples': 10291776, 'steps': 53602, 'loss/train': 0.5655657649040222} 11/07/2021 04:46:11 - INFO - __main__ - Step 53604: {'lr': 0.0003644281136056524, 'samples': 10291968, 'steps': 53603, 'loss/train': 1.275779366493225} 11/07/2021 04:46:11 - INFO - __main__ - Step 53605: {'lr': 0.00036442339535198444, 'samples': 10292160, 'steps': 53604, 'loss/train': 1.2544212341308594} 11/07/2021 04:46:12 - INFO - __main__ - Step 53606: {'lr': 0.00036441867704675913, 'samples': 10292352, 'steps': 53605, 'loss/train': 1.4334867000579834} 11/07/2021 04:46:13 - INFO - __main__ - Step 53607: {'lr': 0.00036441395868997843, 'samples': 10292544, 'steps': 53606, 'loss/train': 0.7503970265388489} 11/07/2021 04:46:13 - INFO - __main__ - Step 53608: {'lr': 0.00036440924028164457, 'samples': 10292736, 'steps': 53607, 'loss/train': 1.4109858274459839} 11/07/2021 04:46:13 - INFO - __main__ - Step 53609: {'lr': 0.0003644045218217597, 'samples': 10292928, 'steps': 53608, 'loss/train': 0.9213677644729614} 11/07/2021 04:46:14 - INFO - __main__ - Step 53610: {'lr': 0.000364399803310326, 'samples': 10293120, 'steps': 53609, 'loss/train': 1.2618991136550903} 11/07/2021 04:46:14 - INFO - __main__ - Step 53611: {'lr': 0.0003643950847473453, 'samples': 10293312, 'steps': 53610, 'loss/train': 1.7567811012268066} 11/07/2021 04:46:15 - INFO - __main__ - Step 53612: {'lr': 0.0003643903661328201, 'samples': 10293504, 'steps': 53611, 'loss/train': 1.2855720520019531} 11/07/2021 04:46:16 - INFO - __main__ - Step 53613: {'lr': 0.0003643856474667524, 'samples': 10293696, 'steps': 53612, 'loss/train': 2.0284533500671387} 11/07/2021 04:46:16 - INFO - __main__ - Step 53614: {'lr': 0.0003643809287491442, 'samples': 10293888, 'steps': 53613, 'loss/train': 2.2099084854125977} 11/07/2021 04:46:16 - INFO - __main__ - Step 53615: {'lr': 0.00036437620997999777, 'samples': 10294080, 'steps': 53614, 'loss/train': 0.9690789580345154} 11/07/2021 04:46:17 - INFO - __main__ - Step 53616: {'lr': 0.0003643714911593151, 'samples': 10294272, 'steps': 53615, 'loss/train': 1.2599529027938843} 11/07/2021 04:46:18 - INFO - __main__ - Step 53617: {'lr': 0.00036436677228709845, 'samples': 10294464, 'steps': 53616, 'loss/train': 1.177477240562439} 11/07/2021 04:46:18 - INFO - __main__ - Step 53618: {'lr': 0.00036436205336334995, 'samples': 10294656, 'steps': 53617, 'loss/train': 1.4854182004928589} 11/07/2021 04:46:18 - INFO - __main__ - Step 53619: {'lr': 0.0003643573343880716, 'samples': 10294848, 'steps': 53618, 'loss/train': 1.0214343070983887} 11/07/2021 04:46:19 - INFO - __main__ - Step 53620: {'lr': 0.00036435261536126566, 'samples': 10295040, 'steps': 53619, 'loss/train': 1.2438859939575195} 11/07/2021 04:46:19 - INFO - __main__ - Step 53621: {'lr': 0.0003643478962829342, 'samples': 10295232, 'steps': 53620, 'loss/train': 1.707126498222351} 11/07/2021 04:46:20 - INFO - __main__ - Step 53622: {'lr': 0.0003643431771530793, 'samples': 10295424, 'steps': 53621, 'loss/train': 1.0260186195373535} 11/07/2021 04:46:20 - INFO - __main__ - Step 53623: {'lr': 0.0003643384579717031, 'samples': 10295616, 'steps': 53622, 'loss/train': 1.2955429553985596} 11/07/2021 04:46:21 - INFO - __main__ - Step 53624: {'lr': 0.0003643337387388078, 'samples': 10295808, 'steps': 53623, 'loss/train': 1.1433353424072266} 11/07/2021 04:46:21 - INFO - __main__ - Step 53625: {'lr': 0.00036432901945439544, 'samples': 10296000, 'steps': 53624, 'loss/train': 1.1356937885284424} 11/07/2021 04:46:22 - INFO - __main__ - Step 53626: {'lr': 0.0003643243001184683, 'samples': 10296192, 'steps': 53625, 'loss/train': 1.1286875009536743} 11/07/2021 04:46:23 - INFO - __main__ - Step 53627: {'lr': 0.00036431958073102825, 'samples': 10296384, 'steps': 53626, 'loss/train': 1.6646336317062378} 11/07/2021 04:46:23 - INFO - __main__ - Step 53628: {'lr': 0.00036431486129207767, 'samples': 10296576, 'steps': 53627, 'loss/train': 1.510939598083496} 11/07/2021 04:46:23 - INFO - __main__ - Step 53629: {'lr': 0.00036431014180161853, 'samples': 10296768, 'steps': 53628, 'loss/train': 1.5132286548614502} 11/07/2021 04:46:24 - INFO - __main__ - Step 53630: {'lr': 0.000364305422259653, 'samples': 10296960, 'steps': 53629, 'loss/train': 1.8390154838562012} 11/07/2021 04:46:24 - INFO - __main__ - Step 53631: {'lr': 0.0003643007026661832, 'samples': 10297152, 'steps': 53630, 'loss/train': 1.5268847942352295} 11/07/2021 04:46:24 - INFO - __main__ - Step 53632: {'lr': 0.0003642959830212113, 'samples': 10297344, 'steps': 53631, 'loss/train': 0.9875638484954834} 11/07/2021 04:46:26 - INFO - __main__ - Step 53633: {'lr': 0.0003642912633247394, 'samples': 10297536, 'steps': 53632, 'loss/train': 1.898142695426941} 11/07/2021 04:46:26 - INFO - __main__ - Step 53634: {'lr': 0.0003642865435767696, 'samples': 10297728, 'steps': 53633, 'loss/train': 2.0873570442199707} 11/07/2021 04:46:26 - INFO - __main__ - Step 53635: {'lr': 0.00036428182377730407, 'samples': 10297920, 'steps': 53634, 'loss/train': 1.409475564956665} 11/07/2021 04:46:27 - INFO - __main__ - Step 53636: {'lr': 0.00036427710392634483, 'samples': 10298112, 'steps': 53635, 'loss/train': 1.2655452489852905} 11/07/2021 04:46:27 - INFO - __main__ - Step 53637: {'lr': 0.0003642723840238942, 'samples': 10298304, 'steps': 53636, 'loss/train': 1.5383204221725464} 11/07/2021 04:46:28 - INFO - __main__ - Step 53638: {'lr': 0.0003642676640699542, 'samples': 10298496, 'steps': 53637, 'loss/train': 1.6451209783554077} 11/07/2021 04:46:28 - INFO - __main__ - Step 53639: {'lr': 0.0003642629440645269, 'samples': 10298688, 'steps': 53638, 'loss/train': 1.602700114250183} 11/07/2021 04:46:29 - INFO - __main__ - Step 53640: {'lr': 0.00036425822400761444, 'samples': 10298880, 'steps': 53639, 'loss/train': 1.311402440071106} 11/07/2021 04:46:29 - INFO - __main__ - Step 53641: {'lr': 0.000364253503899219, 'samples': 10299072, 'steps': 53640, 'loss/train': 1.4763884544372559} 11/07/2021 04:46:29 - INFO - __main__ - Step 53642: {'lr': 0.00036424878373934275, 'samples': 10299264, 'steps': 53641, 'loss/train': 1.4716440439224243} 11/07/2021 04:46:30 - INFO - __main__ - Step 53643: {'lr': 0.0003642440635279877, 'samples': 10299456, 'steps': 53642, 'loss/train': 1.5461177825927734} 11/07/2021 04:46:31 - INFO - __main__ - Step 53644: {'lr': 0.0003642393432651561, 'samples': 10299648, 'steps': 53643, 'loss/train': 1.2750368118286133} 11/07/2021 04:46:31 - INFO - __main__ - Step 53645: {'lr': 0.00036423462295085, 'samples': 10299840, 'steps': 53644, 'loss/train': 2.0808663368225098} 11/07/2021 04:46:31 - INFO - __main__ - Step 53646: {'lr': 0.00036422990258507155, 'samples': 10300032, 'steps': 53645, 'loss/train': 1.3094879388809204} 11/07/2021 04:46:32 - INFO - __main__ - Step 53647: {'lr': 0.00036422518216782285, 'samples': 10300224, 'steps': 53646, 'loss/train': 1.192118525505066} 11/07/2021 04:46:33 - INFO - __main__ - Step 53648: {'lr': 0.00036422046169910604, 'samples': 10300416, 'steps': 53647, 'loss/train': 1.42933189868927} 11/07/2021 04:46:33 - INFO - __main__ - Step 53649: {'lr': 0.00036421574117892323, 'samples': 10300608, 'steps': 53648, 'loss/train': 0.10659755021333694} 11/07/2021 04:46:34 - INFO - __main__ - Step 53650: {'lr': 0.0003642110206072766, 'samples': 10300800, 'steps': 53649, 'loss/train': 1.4478704929351807} 11/07/2021 04:46:34 - INFO - __main__ - Step 53651: {'lr': 0.0003642062999841682, 'samples': 10300992, 'steps': 53650, 'loss/train': 1.446449637413025} 11/07/2021 04:46:34 - INFO - __main__ - Step 53652: {'lr': 0.00036420157930960027, 'samples': 10301184, 'steps': 53651, 'loss/train': 1.577466368675232} 11/07/2021 04:46:36 - INFO - __main__ - Step 53653: {'lr': 0.00036419685858357485, 'samples': 10301376, 'steps': 53652, 'loss/train': 1.2220356464385986} 11/07/2021 04:46:36 - INFO - __main__ - Step 53654: {'lr': 0.0003641921378060941, 'samples': 10301568, 'steps': 53653, 'loss/train': 0.8797346949577332} 11/07/2021 04:46:36 - INFO - __main__ - Step 53655: {'lr': 0.00036418741697716013, 'samples': 10301760, 'steps': 53654, 'loss/train': 1.463127851486206} 11/07/2021 04:46:37 - INFO - __main__ - Step 53656: {'lr': 0.00036418269609677506, 'samples': 10301952, 'steps': 53655, 'loss/train': 1.4317694902420044} 11/07/2021 04:46:37 - INFO - __main__ - Step 53657: {'lr': 0.000364177975164941, 'samples': 10302144, 'steps': 53656, 'loss/train': 1.1459680795669556} 11/07/2021 04:46:38 - INFO - __main__ - Step 53658: {'lr': 0.0003641732541816601, 'samples': 10302336, 'steps': 53657, 'loss/train': 1.4241100549697876} 11/07/2021 04:46:38 - INFO - __main__ - Step 53659: {'lr': 0.0003641685331469346, 'samples': 10302528, 'steps': 53658, 'loss/train': 1.5625414848327637} 11/07/2021 04:46:39 - INFO - __main__ - Step 53660: {'lr': 0.0003641638120607665, 'samples': 10302720, 'steps': 53659, 'loss/train': 1.0930854082107544} 11/07/2021 04:46:39 - INFO - __main__ - Step 53661: {'lr': 0.00036415909092315786, 'samples': 10302912, 'steps': 53660, 'loss/train': 1.5359793901443481} 11/07/2021 04:46:39 - INFO - __main__ - Step 53662: {'lr': 0.00036415436973411095, 'samples': 10303104, 'steps': 53661, 'loss/train': 1.6880676746368408} 11/07/2021 04:46:40 - INFO - __main__ - Step 53663: {'lr': 0.0003641496484936278, 'samples': 10303296, 'steps': 53662, 'loss/train': 1.3289389610290527} 11/07/2021 04:46:41 - INFO - __main__ - Step 53664: {'lr': 0.0003641449272017106, 'samples': 10303488, 'steps': 53663, 'loss/train': 1.5129036903381348} 11/07/2021 04:46:41 - INFO - __main__ - Step 53665: {'lr': 0.00036414020585836144, 'samples': 10303680, 'steps': 53664, 'loss/train': 1.5871593952178955} 11/07/2021 04:46:41 - INFO - __main__ - Step 53666: {'lr': 0.00036413548446358255, 'samples': 10303872, 'steps': 53665, 'loss/train': 1.5378220081329346} 11/07/2021 04:46:42 - INFO - __main__ - Step 53667: {'lr': 0.0003641307630173759, 'samples': 10304064, 'steps': 53666, 'loss/train': 1.207553744316101} 11/07/2021 04:46:43 - INFO - __main__ - Step 53668: {'lr': 0.0003641260415197437, 'samples': 10304256, 'steps': 53667, 'loss/train': 0.4405873119831085} 11/07/2021 04:46:43 - INFO - __main__ - Step 53669: {'lr': 0.0003641213199706881, 'samples': 10304448, 'steps': 53668, 'loss/train': 1.459364414215088} 11/07/2021 04:46:43 - INFO - __main__ - Step 53670: {'lr': 0.0003641165983702111, 'samples': 10304640, 'steps': 53669, 'loss/train': 1.4419173002243042} 11/07/2021 04:46:44 - INFO - __main__ - Step 53671: {'lr': 0.000364111876718315, 'samples': 10304832, 'steps': 53670, 'loss/train': 1.442563533782959} 11/07/2021 04:46:44 - INFO - __main__ - Step 53672: {'lr': 0.0003641071550150019, 'samples': 10305024, 'steps': 53671, 'loss/train': 1.708547592163086} 11/07/2021 04:46:45 - INFO - __main__ - Step 53673: {'lr': 0.00036410243326027373, 'samples': 10305216, 'steps': 53672, 'loss/train': 1.2650524377822876} 11/07/2021 04:46:46 - INFO - __main__ - Step 53674: {'lr': 0.0003640977114541328, 'samples': 10305408, 'steps': 53673, 'loss/train': 1.3750654458999634} 11/07/2021 04:46:46 - INFO - __main__ - Step 53675: {'lr': 0.0003640929895965813, 'samples': 10305600, 'steps': 53674, 'loss/train': 1.3330191373825073} 11/07/2021 04:46:46 - INFO - __main__ - Step 53676: {'lr': 0.0003640882676876212, 'samples': 10305792, 'steps': 53675, 'loss/train': 1.2945555448532104} 11/07/2021 04:46:47 - INFO - __main__ - Step 53677: {'lr': 0.0003640835457272547, 'samples': 10305984, 'steps': 53676, 'loss/train': 0.9961517453193665} 11/07/2021 04:46:47 - INFO - __main__ - Step 53678: {'lr': 0.00036407882371548394, 'samples': 10306176, 'steps': 53677, 'loss/train': 1.57715904712677} 11/07/2021 04:46:48 - INFO - __main__ - Step 53679: {'lr': 0.00036407410165231096, 'samples': 10306368, 'steps': 53678, 'loss/train': 1.3340072631835938} 11/07/2021 04:46:48 - INFO - __main__ - Step 53680: {'lr': 0.000364069379537738, 'samples': 10306560, 'steps': 53679, 'loss/train': 1.40402352809906} 11/07/2021 04:46:49 - INFO - __main__ - Step 53681: {'lr': 0.0003640646573717671, 'samples': 10306752, 'steps': 53680, 'loss/train': 1.870320200920105} 11/07/2021 04:46:49 - INFO - __main__ - Step 53682: {'lr': 0.00036405993515440044, 'samples': 10306944, 'steps': 53681, 'loss/train': 1.4479228258132935} 11/07/2021 04:46:49 - INFO - __main__ - Step 53683: {'lr': 0.0003640552128856401, 'samples': 10307136, 'steps': 53682, 'loss/train': 1.7776511907577515} 11/07/2021 04:46:50 - INFO - __main__ - Step 53684: {'lr': 0.00036405049056548834, 'samples': 10307328, 'steps': 53683, 'loss/train': 1.0751904249191284} 11/07/2021 04:46:51 - INFO - __main__ - Step 53685: {'lr': 0.0003640457681939471, 'samples': 10307520, 'steps': 53684, 'loss/train': 1.1244386434555054} 11/07/2021 04:46:51 - INFO - __main__ - Step 53686: {'lr': 0.0003640410457710186, 'samples': 10307712, 'steps': 53685, 'loss/train': 1.837347149848938} 11/07/2021 04:46:51 - INFO - __main__ - Step 53687: {'lr': 0.000364036323296705, 'samples': 10307904, 'steps': 53686, 'loss/train': 1.4676305055618286} 11/07/2021 04:46:52 - INFO - __main__ - Step 53688: {'lr': 0.0003640316007710084, 'samples': 10308096, 'steps': 53687, 'loss/train': 1.7935171127319336} 11/07/2021 04:46:53 - INFO - __main__ - Step 53689: {'lr': 0.0003640268781939309, 'samples': 10308288, 'steps': 53688, 'loss/train': 0.6592240333557129} 11/07/2021 04:46:53 - INFO - __main__ - Step 53690: {'lr': 0.0003640221555654747, 'samples': 10308480, 'steps': 53689, 'loss/train': 1.243087887763977} 11/07/2021 04:46:53 - INFO - __main__ - Step 53691: {'lr': 0.0003640174328856418, 'samples': 10308672, 'steps': 53690, 'loss/train': 1.3240450620651245} 11/07/2021 04:46:54 - INFO - __main__ - Step 53692: {'lr': 0.0003640127101544344, 'samples': 10308864, 'steps': 53691, 'loss/train': 1.3351621627807617} 11/07/2021 04:46:54 - INFO - __main__ - Step 53693: {'lr': 0.00036400798737185465, 'samples': 10309056, 'steps': 53692, 'loss/train': 1.294119954109192} 11/07/2021 04:46:55 - INFO - __main__ - Step 53694: {'lr': 0.0003640032645379047, 'samples': 10309248, 'steps': 53693, 'loss/train': 1.5483777523040771} 11/07/2021 04:46:56 - INFO - __main__ - Step 53695: {'lr': 0.0003639985416525866, 'samples': 10309440, 'steps': 53694, 'loss/train': 1.6617079973220825} 11/07/2021 04:46:56 - INFO - __main__ - Step 53696: {'lr': 0.00036399381871590254, 'samples': 10309632, 'steps': 53695, 'loss/train': 1.4084619283676147} 11/07/2021 04:46:56 - INFO - __main__ - Step 53697: {'lr': 0.0003639890957278546, 'samples': 10309824, 'steps': 53696, 'loss/train': 0.486763060092926} 11/07/2021 04:46:57 - INFO - __main__ - Step 53698: {'lr': 0.0003639843726884449, 'samples': 10310016, 'steps': 53697, 'loss/train': 0.8962081670761108} 11/07/2021 04:46:58 - INFO - __main__ - Step 53699: {'lr': 0.0003639796495976757, 'samples': 10310208, 'steps': 53698, 'loss/train': 1.4068572521209717} 11/07/2021 04:46:58 - INFO - __main__ - Step 53700: {'lr': 0.000363974926455549, 'samples': 10310400, 'steps': 53699, 'loss/train': 1.7828649282455444} 11/07/2021 04:46:58 - INFO - __main__ - Step 53701: {'lr': 0.0003639702032620669, 'samples': 10310592, 'steps': 53700, 'loss/train': 1.5869852304458618} 11/07/2021 04:46:59 - INFO - __main__ - Step 53702: {'lr': 0.00036396548001723164, 'samples': 10310784, 'steps': 53701, 'loss/train': 1.3004422187805176} 11/07/2021 04:46:59 - INFO - __main__ - Step 53703: {'lr': 0.00036396075672104523, 'samples': 10310976, 'steps': 53702, 'loss/train': 0.5950105786323547} 11/07/2021 04:47:00 - INFO - __main__ - Step 53704: {'lr': 0.00036395603337350987, 'samples': 10311168, 'steps': 53703, 'loss/train': 1.9966845512390137} 11/07/2021 04:47:01 - INFO - __main__ - Step 53705: {'lr': 0.0003639513099746277, 'samples': 10311360, 'steps': 53704, 'loss/train': 1.6448525190353394} 11/07/2021 04:47:01 - INFO - __main__ - Step 53706: {'lr': 0.0003639465865244008, 'samples': 10311552, 'steps': 53705, 'loss/train': 1.3442103862762451} 11/07/2021 04:47:01 - INFO - __main__ - Step 53707: {'lr': 0.0003639418630228314, 'samples': 10311744, 'steps': 53706, 'loss/train': 1.686835765838623} 11/07/2021 04:47:02 - INFO - __main__ - Step 53708: {'lr': 0.00036393713946992156, 'samples': 10311936, 'steps': 53707, 'loss/train': 1.5133798122406006} 11/07/2021 04:47:02 - INFO - __main__ - Step 53709: {'lr': 0.0003639324158656733, 'samples': 10312128, 'steps': 53708, 'loss/train': 1.7039297819137573} 11/07/2021 04:47:03 - INFO - __main__ - Step 53710: {'lr': 0.00036392769221008895, 'samples': 10312320, 'steps': 53709, 'loss/train': 1.2365678548812866} 11/07/2021 04:47:03 - INFO - __main__ - Step 53711: {'lr': 0.0003639229685031705, 'samples': 10312512, 'steps': 53710, 'loss/train': 1.77914559841156} 11/07/2021 04:47:04 - INFO - __main__ - Step 53712: {'lr': 0.0003639182447449201, 'samples': 10312704, 'steps': 53711, 'loss/train': 1.595699429512024} 11/07/2021 04:47:04 - INFO - __main__ - Step 53713: {'lr': 0.00036391352093533995, 'samples': 10312896, 'steps': 53712, 'loss/train': 1.299613118171692} 11/07/2021 04:47:04 - INFO - __main__ - Step 53714: {'lr': 0.0003639087970744321, 'samples': 10313088, 'steps': 53713, 'loss/train': 1.5028570890426636} 11/07/2021 04:47:05 - INFO - __main__ - Step 53715: {'lr': 0.00036390407316219865, 'samples': 10313280, 'steps': 53714, 'loss/train': 1.6350452899932861} 11/07/2021 04:47:06 - INFO - __main__ - Step 53716: {'lr': 0.0003638993491986419, 'samples': 10313472, 'steps': 53715, 'loss/train': 1.6072763204574585} 11/07/2021 04:47:06 - INFO - __main__ - Step 53717: {'lr': 0.0003638946251837637, 'samples': 10313664, 'steps': 53716, 'loss/train': 0.8431642651557922} 11/07/2021 04:47:07 - INFO - __main__ - Step 53718: {'lr': 0.0003638899011175664, 'samples': 10313856, 'steps': 53717, 'loss/train': 1.6525471210479736} 11/07/2021 04:47:07 - INFO - __main__ - Step 53719: {'lr': 0.00036388517700005214, 'samples': 10314048, 'steps': 53718, 'loss/train': 1.4922749996185303} 11/07/2021 04:47:08 - INFO - __main__ - Step 53720: {'lr': 0.00036388045283122295, 'samples': 10314240, 'steps': 53719, 'loss/train': 1.281136393547058} 11/07/2021 04:47:08 - INFO - __main__ - Step 53721: {'lr': 0.00036387572861108097, 'samples': 10314432, 'steps': 53720, 'loss/train': 1.217883825302124} 11/07/2021 04:47:09 - INFO - __main__ - Step 53722: {'lr': 0.0003638710043396283, 'samples': 10314624, 'steps': 53721, 'loss/train': 1.5383968353271484} 11/07/2021 04:47:09 - INFO - __main__ - Step 53723: {'lr': 0.0003638662800168672, 'samples': 10314816, 'steps': 53722, 'loss/train': 1.8818771839141846} 11/07/2021 04:47:09 - INFO - __main__ - Step 53724: {'lr': 0.00036386155564279967, 'samples': 10315008, 'steps': 53723, 'loss/train': 2.0879263877868652} 11/07/2021 04:47:10 - INFO - __main__ - Step 53725: {'lr': 0.00036385683121742786, 'samples': 10315200, 'steps': 53724, 'loss/train': 1.3340603113174438} 11/07/2021 04:47:11 - INFO - __main__ - Step 53726: {'lr': 0.00036385210674075394, 'samples': 10315392, 'steps': 53725, 'loss/train': 0.9048414826393127} 11/07/2021 04:47:11 - INFO - __main__ - Step 53727: {'lr': 0.00036384738221278, 'samples': 10315584, 'steps': 53726, 'loss/train': 1.2589433193206787} 11/07/2021 04:47:11 - INFO - __main__ - Step 53728: {'lr': 0.0003638426576335082, 'samples': 10315776, 'steps': 53727, 'loss/train': 1.4779775142669678} 11/07/2021 04:47:12 - INFO - __main__ - Step 53729: {'lr': 0.00036383793300294063, 'samples': 10315968, 'steps': 53728, 'loss/train': 1.1890757083892822} 11/07/2021 04:47:13 - INFO - __main__ - Step 53730: {'lr': 0.00036383320832107945, 'samples': 10316160, 'steps': 53729, 'loss/train': 0.902030885219574} 11/07/2021 04:47:13 - INFO - __main__ - Step 53731: {'lr': 0.0003638284835879268, 'samples': 10316352, 'steps': 53730, 'loss/train': 1.5771640539169312} 11/07/2021 04:47:14 - INFO - __main__ - Step 53732: {'lr': 0.0003638237588034848, 'samples': 10316544, 'steps': 53731, 'loss/train': 1.7153632640838623} 11/07/2021 04:47:14 - INFO - __main__ - Step 53733: {'lr': 0.00036381903396775556, 'samples': 10316736, 'steps': 53732, 'loss/train': 1.4391967058181763} 11/07/2021 04:47:14 - INFO - __main__ - Step 53734: {'lr': 0.00036381430908074126, 'samples': 10316928, 'steps': 53733, 'loss/train': 0.6550378799438477} 11/07/2021 04:47:15 - INFO - __main__ - Step 53735: {'lr': 0.00036380958414244393, 'samples': 10317120, 'steps': 53734, 'loss/train': 1.7250784635543823} 11/07/2021 04:47:16 - INFO - __main__ - Step 53736: {'lr': 0.0003638048591528658, 'samples': 10317312, 'steps': 53735, 'loss/train': 1.365229606628418} 11/07/2021 04:47:16 - INFO - __main__ - Step 53737: {'lr': 0.0003638001341120089, 'samples': 10317504, 'steps': 53736, 'loss/train': 1.5920060873031616} 11/07/2021 04:47:16 - INFO - __main__ - Step 53738: {'lr': 0.00036379540901987546, 'samples': 10317696, 'steps': 53737, 'loss/train': 1.2214041948318481} 11/07/2021 04:47:17 - INFO - __main__ - Step 53739: {'lr': 0.0003637906838764675, 'samples': 10317888, 'steps': 53738, 'loss/train': 1.0287377834320068} 11/07/2021 04:47:18 - INFO - __main__ - Step 53740: {'lr': 0.00036378595868178737, 'samples': 10318080, 'steps': 53739, 'loss/train': 0.5260177254676819} 11/07/2021 04:47:18 - INFO - __main__ - Step 53741: {'lr': 0.00036378123343583694, 'samples': 10318272, 'steps': 53740, 'loss/train': 4.54772424697876} 11/07/2021 04:47:18 - INFO - __main__ - Step 53742: {'lr': 0.0003637765081386184, 'samples': 10318464, 'steps': 53741, 'loss/train': 1.2605921030044556} 11/07/2021 04:47:19 - INFO - __main__ - Step 53743: {'lr': 0.000363771782790134, 'samples': 10318656, 'steps': 53742, 'loss/train': 1.553490400314331} 11/07/2021 04:47:19 - INFO - __main__ - Step 53744: {'lr': 0.0003637670573903857, 'samples': 10318848, 'steps': 53743, 'loss/train': 1.5717657804489136} 11/07/2021 04:47:20 - INFO - __main__ - Step 53745: {'lr': 0.0003637623319393758, 'samples': 10319040, 'steps': 53744, 'loss/train': 1.5152475833892822} 11/07/2021 04:47:21 - INFO - __main__ - Step 53746: {'lr': 0.0003637576064371063, 'samples': 10319232, 'steps': 53745, 'loss/train': 1.27506685256958} 11/07/2021 04:47:21 - INFO - __main__ - Step 53747: {'lr': 0.0003637528808835794, 'samples': 10319424, 'steps': 53746, 'loss/train': 1.5017693042755127} 11/07/2021 04:47:21 - INFO - __main__ - Step 53748: {'lr': 0.00036374815527879725, 'samples': 10319616, 'steps': 53747, 'loss/train': 0.9367703795433044} 11/07/2021 04:47:22 - INFO - __main__ - Step 53749: {'lr': 0.0003637434296227619, 'samples': 10319808, 'steps': 53748, 'loss/train': 1.4065848588943481} 11/07/2021 04:47:22 - INFO - __main__ - Step 53750: {'lr': 0.0003637387039154755, 'samples': 10320000, 'steps': 53749, 'loss/train': 1.6673144102096558} 11/07/2021 04:47:23 - INFO - __main__ - Step 53751: {'lr': 0.0003637339781569402, 'samples': 10320192, 'steps': 53750, 'loss/train': 1.3397016525268555} 11/07/2021 04:47:23 - INFO - __main__ - Step 53752: {'lr': 0.0003637292523471581, 'samples': 10320384, 'steps': 53751, 'loss/train': 1.1209821701049805} 11/07/2021 04:47:24 - INFO - __main__ - Step 53753: {'lr': 0.0003637245264861314, 'samples': 10320576, 'steps': 53752, 'loss/train': 1.556083083152771} 11/07/2021 04:47:24 - INFO - __main__ - Step 53754: {'lr': 0.0003637198005738622, 'samples': 10320768, 'steps': 53753, 'loss/train': 1.0590068101882935} 11/07/2021 04:47:24 - INFO - __main__ - Step 53755: {'lr': 0.0003637150746103526, 'samples': 10320960, 'steps': 53754, 'loss/train': 1.3447222709655762} 11/07/2021 04:47:26 - INFO - __main__ - Step 53756: {'lr': 0.0003637103485956047, 'samples': 10321152, 'steps': 53755, 'loss/train': 1.2610257863998413} 11/07/2021 04:47:26 - INFO - __main__ - Step 53757: {'lr': 0.0003637056225296207, 'samples': 10321344, 'steps': 53756, 'loss/train': 1.4213162660598755} 11/07/2021 04:47:26 - INFO - __main__ - Step 53758: {'lr': 0.00036370089641240264, 'samples': 10321536, 'steps': 53757, 'loss/train': 1.2108180522918701} 11/07/2021 04:47:27 - INFO - __main__ - Step 53759: {'lr': 0.0003636961702439527, 'samples': 10321728, 'steps': 53758, 'loss/train': 2.09029483795166} 11/07/2021 04:47:27 - INFO - __main__ - Step 53760: {'lr': 0.0003636914440242732, 'samples': 10321920, 'steps': 53759, 'loss/train': 0.6936137080192566} 11/07/2021 04:47:28 - INFO - __main__ - Step 53761: {'lr': 0.00036368671775336597, 'samples': 10322112, 'steps': 53760, 'loss/train': 5.76736307144165} 11/07/2021 04:47:28 - INFO - __main__ - Step 53762: {'lr': 0.00036368199143123326, 'samples': 10322304, 'steps': 53761, 'loss/train': 0.5955904126167297} 11/07/2021 04:47:29 - INFO - __main__ - Step 53763: {'lr': 0.0003636772650578772, 'samples': 10322496, 'steps': 53762, 'loss/train': 1.3145180940628052} 11/07/2021 04:47:29 - INFO - __main__ - Step 53764: {'lr': 0.0003636725386332999, 'samples': 10322688, 'steps': 53763, 'loss/train': 1.5261660814285278} 11/07/2021 04:47:30 - INFO - __main__ - Step 53765: {'lr': 0.00036366781215750355, 'samples': 10322880, 'steps': 53764, 'loss/train': 1.5784448385238647} 11/07/2021 04:47:31 - INFO - __main__ - Step 53766: {'lr': 0.0003636630856304902, 'samples': 10323072, 'steps': 53765, 'loss/train': 1.6527149677276611} 11/07/2021 04:47:31 - INFO - __main__ - Step 53767: {'lr': 0.0003636583590522621, 'samples': 10323264, 'steps': 53766, 'loss/train': 1.1597646474838257} 11/07/2021 04:47:31 - INFO - __main__ - Step 53768: {'lr': 0.00036365363242282117, 'samples': 10323456, 'steps': 53767, 'loss/train': 1.369114637374878} 11/07/2021 04:47:32 - INFO - __main__ - Step 53769: {'lr': 0.00036364890574216974, 'samples': 10323648, 'steps': 53768, 'loss/train': 1.6082525253295898} 11/07/2021 04:47:32 - INFO - __main__ - Step 53770: {'lr': 0.0003636441790103098, 'samples': 10323840, 'steps': 53769, 'loss/train': 1.1371461153030396} 11/07/2021 04:47:32 - INFO - __main__ - Step 53771: {'lr': 0.00036363945222724363, 'samples': 10324032, 'steps': 53770, 'loss/train': 1.4053053855895996} 11/07/2021 04:47:33 - INFO - __main__ - Step 53772: {'lr': 0.0003636347253929733, 'samples': 10324224, 'steps': 53771, 'loss/train': 1.359999179840088} 11/07/2021 04:47:34 - INFO - __main__ - Step 53773: {'lr': 0.0003636299985075008, 'samples': 10324416, 'steps': 53772, 'loss/train': 0.18141159415245056} 11/07/2021 04:47:34 - INFO - __main__ - Step 53774: {'lr': 0.00036362527157082845, 'samples': 10324608, 'steps': 53773, 'loss/train': 1.189090371131897} 11/07/2021 04:47:34 - INFO - __main__ - Step 53775: {'lr': 0.00036362054458295836, 'samples': 10324800, 'steps': 53774, 'loss/train': 1.1098508834838867} 11/07/2021 04:47:35 - INFO - __main__ - Step 53776: {'lr': 0.0003636158175438925, 'samples': 10324992, 'steps': 53775, 'loss/train': 1.6424130201339722} 11/07/2021 04:47:36 - INFO - __main__ - Step 53777: {'lr': 0.00036361109045363315, 'samples': 10325184, 'steps': 53776, 'loss/train': 1.2648173570632935} 11/07/2021 04:47:36 - INFO - __main__ - Step 53778: {'lr': 0.0003636063633121824, 'samples': 10325376, 'steps': 53777, 'loss/train': 1.6093236207962036} 11/07/2021 04:47:36 - INFO - __main__ - Step 53779: {'lr': 0.0003636016361195423, 'samples': 10325568, 'steps': 53778, 'loss/train': 1.2295337915420532} 11/07/2021 04:47:37 - INFO - __main__ - Step 53780: {'lr': 0.0003635969088757152, 'samples': 10325760, 'steps': 53779, 'loss/train': 1.4420655965805054} 11/07/2021 04:47:37 - INFO - __main__ - Step 53781: {'lr': 0.000363592181580703, 'samples': 10325952, 'steps': 53780, 'loss/train': 1.3411906957626343} 11/07/2021 04:47:38 - INFO - __main__ - Step 53782: {'lr': 0.00036358745423450793, 'samples': 10326144, 'steps': 53781, 'loss/train': 1.7579644918441772} 11/07/2021 04:47:38 - INFO - __main__ - Step 53783: {'lr': 0.00036358272683713214, 'samples': 10326336, 'steps': 53782, 'loss/train': 1.5507636070251465} 11/07/2021 04:47:39 - INFO - __main__ - Step 53784: {'lr': 0.00036357799938857766, 'samples': 10326528, 'steps': 53783, 'loss/train': 1.4037292003631592} 11/07/2021 04:47:39 - INFO - __main__ - Step 53785: {'lr': 0.0003635732718888467, 'samples': 10326720, 'steps': 53784, 'loss/train': 1.7858861684799194} 11/07/2021 04:47:39 - INFO - __main__ - Step 53786: {'lr': 0.0003635685443379414, 'samples': 10326912, 'steps': 53785, 'loss/train': 1.620208501815796} 11/07/2021 04:47:41 - INFO - __main__ - Step 53787: {'lr': 0.0003635638167358639, 'samples': 10327104, 'steps': 53786, 'loss/train': 1.504921793937683} 11/07/2021 04:47:41 - INFO - __main__ - Step 53788: {'lr': 0.00036355908908261624, 'samples': 10327296, 'steps': 53787, 'loss/train': 1.457025170326233} 11/07/2021 04:47:41 - INFO - __main__ - Step 53789: {'lr': 0.0003635543613782006, 'samples': 10327488, 'steps': 53788, 'loss/train': 1.4828907251358032} 11/07/2021 04:47:42 - INFO - __main__ - Step 53790: {'lr': 0.0003635496336226192, 'samples': 10327680, 'steps': 53789, 'loss/train': 1.8180770874023438} 11/07/2021 04:47:42 - INFO - __main__ - Step 53791: {'lr': 0.00036354490581587396, 'samples': 10327872, 'steps': 53790, 'loss/train': 1.270146131515503} 11/07/2021 04:47:43 - INFO - __main__ - Step 53792: {'lr': 0.0003635401779579672, 'samples': 10328064, 'steps': 53791, 'loss/train': 1.7751363515853882} 11/07/2021 04:47:43 - INFO - __main__ - Step 53793: {'lr': 0.000363535450048901, 'samples': 10328256, 'steps': 53792, 'loss/train': 1.0968623161315918} 11/07/2021 04:47:44 - INFO - __main__ - Step 53794: {'lr': 0.00036353072208867746, 'samples': 10328448, 'steps': 53793, 'loss/train': 1.5342804193496704} 11/07/2021 04:47:44 - INFO - __main__ - Step 53795: {'lr': 0.00036352599407729873, 'samples': 10328640, 'steps': 53794, 'loss/train': 2.254974126815796} 11/07/2021 04:47:44 - INFO - __main__ - Step 53796: {'lr': 0.00036352126601476697, 'samples': 10328832, 'steps': 53795, 'loss/train': 0.7802295088768005} 11/07/2021 04:47:46 - INFO - __main__ - Step 53797: {'lr': 0.0003635165379010842, 'samples': 10329024, 'steps': 53796, 'loss/train': 1.4556673765182495} 11/07/2021 04:47:46 - INFO - __main__ - Step 53798: {'lr': 0.0003635118097362528, 'samples': 10329216, 'steps': 53797, 'loss/train': 1.4433975219726562} 11/07/2021 04:47:46 - INFO - __main__ - Step 53799: {'lr': 0.0003635070815202746, 'samples': 10329408, 'steps': 53798, 'loss/train': 1.3020963668823242} 11/07/2021 04:47:47 - INFO - __main__ - Step 53800: {'lr': 0.0003635023532531518, 'samples': 10329600, 'steps': 53799, 'loss/train': 1.1585911512374878} 11/07/2021 04:47:47 - INFO - __main__ - Step 53801: {'lr': 0.00036349762493488667, 'samples': 10329792, 'steps': 53800, 'loss/train': 1.520574927330017} 11/07/2021 04:47:48 - INFO - __main__ - Step 53802: {'lr': 0.0003634928965654813, 'samples': 10329984, 'steps': 53801, 'loss/train': 1.0751932859420776} 11/07/2021 04:47:48 - INFO - __main__ - Step 53803: {'lr': 0.0003634881681449377, 'samples': 10330176, 'steps': 53802, 'loss/train': 1.605992317199707} 11/07/2021 04:47:49 - INFO - __main__ - Step 53804: {'lr': 0.00036348343967325814, 'samples': 10330368, 'steps': 53803, 'loss/train': 1.3527597188949585} 11/07/2021 04:47:49 - INFO - __main__ - Step 53805: {'lr': 0.00036347871115044466, 'samples': 10330560, 'steps': 53804, 'loss/train': 1.6451653242111206} 11/07/2021 04:47:49 - INFO - __main__ - Step 53806: {'lr': 0.0003634739825764995, 'samples': 10330752, 'steps': 53805, 'loss/train': 1.4473252296447754} 11/07/2021 04:47:50 - INFO - __main__ - Step 53807: {'lr': 0.00036346925395142467, 'samples': 10330944, 'steps': 53806, 'loss/train': 1.5503804683685303} 11/07/2021 04:47:51 - INFO - __main__ - Step 53808: {'lr': 0.00036346452527522233, 'samples': 10331136, 'steps': 53807, 'loss/train': 0.5679416656494141} 11/07/2021 04:47:51 - INFO - __main__ - Step 53809: {'lr': 0.0003634597965478946, 'samples': 10331328, 'steps': 53808, 'loss/train': 1.4964993000030518} 11/07/2021 04:47:51 - INFO - __main__ - Step 53810: {'lr': 0.00036345506776944364, 'samples': 10331520, 'steps': 53809, 'loss/train': 1.9264088869094849} 11/07/2021 04:47:52 - INFO - __main__ - Step 53811: {'lr': 0.00036345033893987164, 'samples': 10331712, 'steps': 53810, 'loss/train': 1.2241166830062866} 11/07/2021 04:47:53 - INFO - __main__ - Step 53812: {'lr': 0.00036344561005918064, 'samples': 10331904, 'steps': 53811, 'loss/train': 1.086397409439087} 11/07/2021 04:47:53 - INFO - __main__ - Step 53813: {'lr': 0.00036344088112737276, 'samples': 10332096, 'steps': 53812, 'loss/train': 1.5006593465805054} 11/07/2021 04:47:53 - INFO - __main__ - Step 53814: {'lr': 0.0003634361521444502, 'samples': 10332288, 'steps': 53813, 'loss/train': 1.4294464588165283} 11/07/2021 04:47:54 - INFO - __main__ - Step 53815: {'lr': 0.00036343142311041503, 'samples': 10332480, 'steps': 53814, 'loss/train': 1.3686741590499878} 11/07/2021 04:47:54 - INFO - __main__ - Step 53816: {'lr': 0.00036342669402526946, 'samples': 10332672, 'steps': 53815, 'loss/train': 1.4084200859069824} 11/07/2021 04:47:55 - INFO - __main__ - Step 53817: {'lr': 0.0003634219648890156, 'samples': 10332864, 'steps': 53816, 'loss/train': 1.3252638578414917} 11/07/2021 04:47:55 - INFO - __main__ - Step 53818: {'lr': 0.00036341723570165545, 'samples': 10333056, 'steps': 53817, 'loss/train': 1.0600765943527222} 11/07/2021 04:47:56 - INFO - __main__ - Step 53819: {'lr': 0.0003634125064631913, 'samples': 10333248, 'steps': 53818, 'loss/train': 1.2755149602890015} 11/07/2021 04:47:56 - INFO - __main__ - Step 53820: {'lr': 0.0003634077771736252, 'samples': 10333440, 'steps': 53819, 'loss/train': 1.4315431118011475} 11/07/2021 04:47:57 - INFO - __main__ - Step 53821: {'lr': 0.00036340304783295937, 'samples': 10333632, 'steps': 53820, 'loss/train': 1.654606819152832} 11/07/2021 04:47:57 - INFO - __main__ - Step 53822: {'lr': 0.0003633983184411958, 'samples': 10333824, 'steps': 53821, 'loss/train': 1.3569787740707397} 11/07/2021 04:47:58 - INFO - __main__ - Step 53823: {'lr': 0.00036339358899833675, 'samples': 10334016, 'steps': 53822, 'loss/train': 1.6146955490112305} 11/07/2021 04:47:59 - INFO - __main__ - Step 53824: {'lr': 0.00036338885950438425, 'samples': 10334208, 'steps': 53823, 'loss/train': 1.7868359088897705} 11/07/2021 04:47:59 - INFO - __main__ - Step 53825: {'lr': 0.00036338412995934056, 'samples': 10334400, 'steps': 53824, 'loss/train': 1.8714745044708252} 11/07/2021 04:47:59 - INFO - __main__ - Step 53826: {'lr': 0.00036337940036320764, 'samples': 10334592, 'steps': 53825, 'loss/train': 1.0929986238479614} 11/07/2021 04:48:00 - INFO - __main__ - Step 53827: {'lr': 0.0003633746707159877, 'samples': 10334784, 'steps': 53826, 'loss/train': 1.3598603010177612} 11/07/2021 04:48:01 - INFO - __main__ - Step 53828: {'lr': 0.00036336994101768304, 'samples': 10334976, 'steps': 53827, 'loss/train': 0.10197296738624573} 11/07/2021 04:48:01 - INFO - __main__ - Step 53829: {'lr': 0.00036336521126829554, 'samples': 10335168, 'steps': 53828, 'loss/train': 0.9619058966636658} 11/07/2021 04:48:01 - INFO - __main__ - Step 53830: {'lr': 0.00036336048146782743, 'samples': 10335360, 'steps': 53829, 'loss/train': 1.5208972692489624} 11/07/2021 04:48:02 - INFO - __main__ - Step 53831: {'lr': 0.00036335575161628076, 'samples': 10335552, 'steps': 53830, 'loss/train': 1.1052848100662231} 11/07/2021 04:48:02 - INFO - __main__ - Step 53832: {'lr': 0.0003633510217136578, 'samples': 10335744, 'steps': 53831, 'loss/train': 1.211521029472351} 11/07/2021 04:48:02 - INFO - __main__ - Step 53833: {'lr': 0.0003633462917599606, 'samples': 10335936, 'steps': 53832, 'loss/train': 1.2743152379989624} 11/07/2021 04:48:04 - INFO - __main__ - Step 53834: {'lr': 0.0003633415617551914, 'samples': 10336128, 'steps': 53833, 'loss/train': 1.2937463521957397} 11/07/2021 04:48:04 - INFO - __main__ - Step 53835: {'lr': 0.0003633368316993521, 'samples': 10336320, 'steps': 53834, 'loss/train': 1.0048315525054932} 11/07/2021 04:48:04 - INFO - __main__ - Step 53836: {'lr': 0.0003633321015924451, 'samples': 10336512, 'steps': 53835, 'loss/train': 1.6738377809524536} 11/07/2021 04:48:05 - INFO - __main__ - Step 53837: {'lr': 0.0003633273714344723, 'samples': 10336704, 'steps': 53836, 'loss/train': 1.8290668725967407} 11/07/2021 04:48:05 - INFO - __main__ - Step 53838: {'lr': 0.00036332264122543594, 'samples': 10336896, 'steps': 53837, 'loss/train': 1.311269998550415} 11/07/2021 04:48:06 - INFO - __main__ - Step 53839: {'lr': 0.00036331791096533815, 'samples': 10337088, 'steps': 53838, 'loss/train': 1.4427428245544434} 11/07/2021 04:48:06 - INFO - __main__ - Step 53840: {'lr': 0.0003633131806541811, 'samples': 10337280, 'steps': 53839, 'loss/train': 1.5543686151504517} 11/07/2021 04:48:07 - INFO - __main__ - Step 53841: {'lr': 0.000363308450291967, 'samples': 10337472, 'steps': 53840, 'loss/train': 1.4297562837600708} 11/07/2021 04:48:07 - INFO - __main__ - Step 53842: {'lr': 0.0003633037198786977, 'samples': 10337664, 'steps': 53841, 'loss/train': 1.35160493850708} 11/07/2021 04:48:07 - INFO - __main__ - Step 53843: {'lr': 0.0003632989894143755, 'samples': 10337856, 'steps': 53842, 'loss/train': 1.9040676355361938} 11/07/2021 04:48:09 - INFO - __main__ - Step 53844: {'lr': 0.0003632942588990025, 'samples': 10338048, 'steps': 53843, 'loss/train': 0.9160776734352112} 11/07/2021 04:48:09 - INFO - __main__ - Step 53845: {'lr': 0.00036328952833258096, 'samples': 10338240, 'steps': 53844, 'loss/train': 1.2757924795150757} 11/07/2021 04:48:09 - INFO - __main__ - Step 53846: {'lr': 0.0003632847977151128, 'samples': 10338432, 'steps': 53845, 'loss/train': 1.3739603757858276} 11/07/2021 04:48:10 - INFO - __main__ - Step 53847: {'lr': 0.0003632800670466003, 'samples': 10338624, 'steps': 53846, 'loss/train': 2.090402126312256} 11/07/2021 04:48:10 - INFO - __main__ - Step 53848: {'lr': 0.0003632753363270456, 'samples': 10338816, 'steps': 53847, 'loss/train': 1.2552708387374878} 11/07/2021 04:48:11 - INFO - __main__ - Step 53849: {'lr': 0.00036327060555645075, 'samples': 10339008, 'steps': 53848, 'loss/train': 2.2328643798828125} 11/07/2021 04:48:11 - INFO - __main__ - Step 53850: {'lr': 0.0003632658747348179, 'samples': 10339200, 'steps': 53849, 'loss/train': 1.481790542602539} 11/07/2021 04:48:12 - INFO - __main__ - Step 53851: {'lr': 0.0003632611438621492, 'samples': 10339392, 'steps': 53850, 'loss/train': 1.4229248762130737} 11/07/2021 04:48:12 - INFO - __main__ - Step 53852: {'lr': 0.00036325641293844674, 'samples': 10339584, 'steps': 53851, 'loss/train': 1.4889165163040161} 11/07/2021 04:48:13 - INFO - __main__ - Step 53853: {'lr': 0.0003632516819637127, 'samples': 10339776, 'steps': 53852, 'loss/train': 1.4895575046539307} 11/07/2021 04:48:14 - INFO - __main__ - Step 53854: {'lr': 0.0003632469509379492, 'samples': 10339968, 'steps': 53853, 'loss/train': 1.564021110534668} 11/07/2021 04:48:14 - INFO - __main__ - Step 53855: {'lr': 0.00036324221986115847, 'samples': 10340160, 'steps': 53854, 'loss/train': 1.301256775856018} 11/07/2021 04:48:14 - INFO - __main__ - Step 53856: {'lr': 0.00036323748873334246, 'samples': 10340352, 'steps': 53855, 'loss/train': 1.5127121210098267} 11/07/2021 04:48:15 - INFO - __main__ - Step 53857: {'lr': 0.00036323275755450335, 'samples': 10340544, 'steps': 53856, 'loss/train': 1.2701239585876465} 11/07/2021 04:48:15 - INFO - __main__ - Step 53858: {'lr': 0.00036322802632464336, 'samples': 10340736, 'steps': 53857, 'loss/train': 1.4404006004333496} 11/07/2021 04:48:16 - INFO - __main__ - Step 53859: {'lr': 0.00036322329504376457, 'samples': 10340928, 'steps': 53858, 'loss/train': 1.1640100479125977} 11/07/2021 04:48:16 - INFO - __main__ - Step 53860: {'lr': 0.0003632185637118691, 'samples': 10341120, 'steps': 53859, 'loss/train': 3.0354626178741455} 11/07/2021 04:48:17 - INFO - __main__ - Step 53861: {'lr': 0.0003632138323289591, 'samples': 10341312, 'steps': 53860, 'loss/train': 1.0776021480560303} 11/07/2021 04:48:17 - INFO - __main__ - Step 53862: {'lr': 0.00036320910089503665, 'samples': 10341504, 'steps': 53861, 'loss/train': 1.5987331867218018} 11/07/2021 04:48:17 - INFO - __main__ - Step 53863: {'lr': 0.00036320436941010396, 'samples': 10341696, 'steps': 53862, 'loss/train': 1.5577402114868164} 11/07/2021 04:48:19 - INFO - __main__ - Step 53864: {'lr': 0.00036319963787416313, 'samples': 10341888, 'steps': 53863, 'loss/train': 1.2340596914291382} 11/07/2021 04:48:19 - INFO - __main__ - Step 53865: {'lr': 0.0003631949062872163, 'samples': 10342080, 'steps': 53864, 'loss/train': 1.2378990650177002} 11/07/2021 04:48:19 - INFO - __main__ - Step 53866: {'lr': 0.0003631901746492656, 'samples': 10342272, 'steps': 53865, 'loss/train': 0.44143298268318176} 11/07/2021 04:48:20 - INFO - __main__ - Step 53867: {'lr': 0.0003631854429603131, 'samples': 10342464, 'steps': 53866, 'loss/train': 1.5051151514053345} 11/07/2021 04:48:20 - INFO - __main__ - Step 53868: {'lr': 0.00036318071122036104, 'samples': 10342656, 'steps': 53867, 'loss/train': 1.5813905000686646} 11/07/2021 04:48:20 - INFO - __main__ - Step 53869: {'lr': 0.0003631759794294115, 'samples': 10342848, 'steps': 53868, 'loss/train': 0.08255495876073837} 11/07/2021 04:48:21 - INFO - __main__ - Step 53870: {'lr': 0.00036317124758746656, 'samples': 10343040, 'steps': 53869, 'loss/train': 1.330836296081543} 11/07/2021 04:48:22 - INFO - __main__ - Step 53871: {'lr': 0.0003631665156945284, 'samples': 10343232, 'steps': 53870, 'loss/train': 1.091623306274414} 11/07/2021 04:48:22 - INFO - __main__ - Step 53872: {'lr': 0.0003631617837505992, 'samples': 10343424, 'steps': 53871, 'loss/train': 1.3825600147247314} 11/07/2021 04:48:22 - INFO - __main__ - Step 53873: {'lr': 0.00036315705175568103, 'samples': 10343616, 'steps': 53872, 'loss/train': 1.1355825662612915} 11/07/2021 04:48:23 - INFO - __main__ - Step 53874: {'lr': 0.000363152319709776, 'samples': 10343808, 'steps': 53873, 'loss/train': 1.1304973363876343} 11/07/2021 04:48:24 - INFO - __main__ - Step 53875: {'lr': 0.00036314758761288643, 'samples': 10344000, 'steps': 53874, 'loss/train': 1.3253768682479858} 11/07/2021 04:48:24 - INFO - __main__ - Step 53876: {'lr': 0.00036314285546501415, 'samples': 10344192, 'steps': 53875, 'loss/train': 1.359163522720337} 11/07/2021 04:48:24 - INFO - __main__ - Step 53877: {'lr': 0.0003631381232661615, 'samples': 10344384, 'steps': 53876, 'loss/train': 1.0204049348831177} 11/07/2021 04:48:25 - INFO - __main__ - Step 53878: {'lr': 0.0003631333910163305, 'samples': 10344576, 'steps': 53877, 'loss/train': 1.375170111656189} 11/07/2021 04:48:25 - INFO - __main__ - Step 53879: {'lr': 0.0003631286587155234, 'samples': 10344768, 'steps': 53878, 'loss/train': 1.1854912042617798} 11/07/2021 04:48:26 - INFO - __main__ - Step 53880: {'lr': 0.00036312392636374225, 'samples': 10344960, 'steps': 53879, 'loss/train': 1.249647855758667} 11/07/2021 04:48:26 - INFO - __main__ - Step 53881: {'lr': 0.00036311919396098927, 'samples': 10345152, 'steps': 53880, 'loss/train': 1.4819236993789673} 11/07/2021 04:48:27 - INFO - __main__ - Step 53882: {'lr': 0.0003631144615072665, 'samples': 10345344, 'steps': 53881, 'loss/train': 1.2886788845062256} 11/07/2021 04:48:27 - INFO - __main__ - Step 53883: {'lr': 0.000363109729002576, 'samples': 10345536, 'steps': 53882, 'loss/train': 1.022611379623413} 11/07/2021 04:48:28 - INFO - __main__ - Step 53884: {'lr': 0.0003631049964469201, 'samples': 10345728, 'steps': 53883, 'loss/train': 0.9048346877098083} 11/07/2021 04:48:29 - INFO - __main__ - Step 53885: {'lr': 0.0003631002638403008, 'samples': 10345920, 'steps': 53884, 'loss/train': 1.0623703002929688} 11/07/2021 04:48:29 - INFO - __main__ - Step 53886: {'lr': 0.0003630955311827202, 'samples': 10346112, 'steps': 53885, 'loss/train': 1.9823561906814575} 11/07/2021 04:48:29 - INFO - __main__ - Step 53887: {'lr': 0.0003630907984741806, 'samples': 10346304, 'steps': 53886, 'loss/train': 1.3129116296768188} 11/07/2021 04:48:30 - INFO - __main__ - Step 53888: {'lr': 0.00036308606571468406, 'samples': 10346496, 'steps': 53887, 'loss/train': 1.1289273500442505} 11/07/2021 04:48:30 - INFO - __main__ - Step 53889: {'lr': 0.00036308133290423257, 'samples': 10346688, 'steps': 53888, 'loss/train': 1.3464230298995972} 11/07/2021 04:48:31 - INFO - __main__ - Step 53890: {'lr': 0.00036307660004282846, 'samples': 10346880, 'steps': 53889, 'loss/train': 2.4567363262176514} 11/07/2021 04:48:31 - INFO - __main__ - Step 53891: {'lr': 0.0003630718671304737, 'samples': 10347072, 'steps': 53890, 'loss/train': 2.03482985496521} 11/07/2021 04:48:32 - INFO - __main__ - Step 53892: {'lr': 0.0003630671341671705, 'samples': 10347264, 'steps': 53891, 'loss/train': 1.603682518005371} 11/07/2021 04:48:32 - INFO - __main__ - Step 53893: {'lr': 0.0003630624011529211, 'samples': 10347456, 'steps': 53892, 'loss/train': 1.567132592201233} 11/07/2021 04:48:33 - INFO - __main__ - Step 53894: {'lr': 0.00036305766808772746, 'samples': 10347648, 'steps': 53893, 'loss/train': 1.824984073638916} 11/07/2021 04:48:34 - INFO - __main__ - Step 53895: {'lr': 0.0003630529349715918, 'samples': 10347840, 'steps': 53894, 'loss/train': 1.2993921041488647} 11/07/2021 04:48:34 - INFO - __main__ - Step 53896: {'lr': 0.0003630482018045163, 'samples': 10348032, 'steps': 53895, 'loss/train': 1.4212656021118164} 11/07/2021 04:48:34 - INFO - __main__ - Step 53897: {'lr': 0.0003630434685865029, 'samples': 10348224, 'steps': 53896, 'loss/train': 1.712717890739441} 11/07/2021 04:48:35 - INFO - __main__ - Step 53898: {'lr': 0.0003630387353175539, 'samples': 10348416, 'steps': 53897, 'loss/train': 1.80906081199646} 11/07/2021 04:48:35 - INFO - __main__ - Step 53899: {'lr': 0.0003630340019976713, 'samples': 10348608, 'steps': 53898, 'loss/train': 1.4683196544647217} 11/07/2021 04:48:35 - INFO - __main__ - Step 53900: {'lr': 0.0003630292686268575, 'samples': 10348800, 'steps': 53899, 'loss/train': 1.5854510068893433} 11/07/2021 04:48:36 - INFO - __main__ - Step 53901: {'lr': 0.00036302453520511437, 'samples': 10348992, 'steps': 53900, 'loss/train': 1.466431736946106} 11/07/2021 04:48:37 - INFO - __main__ - Step 53902: {'lr': 0.0003630198017324441, 'samples': 10349184, 'steps': 53901, 'loss/train': 1.3616070747375488} 11/07/2021 04:48:37 - INFO - __main__ - Step 53903: {'lr': 0.0003630150682088489, 'samples': 10349376, 'steps': 53902, 'loss/train': 1.537035346031189} 11/07/2021 04:48:37 - INFO - __main__ - Step 53904: {'lr': 0.00036301033463433086, 'samples': 10349568, 'steps': 53903, 'loss/train': 1.1756144762039185} 11/07/2021 04:48:38 - INFO - __main__ - Step 53905: {'lr': 0.0003630056010088921, 'samples': 10349760, 'steps': 53904, 'loss/train': 1.52498459815979} 11/07/2021 04:48:39 - INFO - __main__ - Step 53906: {'lr': 0.00036300086733253466, 'samples': 10349952, 'steps': 53905, 'loss/train': 1.091630220413208} 11/07/2021 04:48:39 - INFO - __main__ - Step 53907: {'lr': 0.0003629961336052609, 'samples': 10350144, 'steps': 53906, 'loss/train': 1.2740652561187744} 11/07/2021 04:48:39 - INFO - __main__ - Step 53908: {'lr': 0.0003629913998270728, 'samples': 10350336, 'steps': 53907, 'loss/train': 1.3619333505630493} 11/07/2021 04:48:40 - INFO - __main__ - Step 53909: {'lr': 0.00036298666599797247, 'samples': 10350528, 'steps': 53908, 'loss/train': 1.5594091415405273} 11/07/2021 04:48:40 - INFO - __main__ - Step 53910: {'lr': 0.00036298193211796215, 'samples': 10350720, 'steps': 53909, 'loss/train': 1.852355718612671} 11/07/2021 04:48:41 - INFO - __main__ - Step 53911: {'lr': 0.0003629771981870439, 'samples': 10350912, 'steps': 53910, 'loss/train': 1.3001776933670044} 11/07/2021 04:48:42 - INFO - __main__ - Step 53912: {'lr': 0.0003629724642052198, 'samples': 10351104, 'steps': 53911, 'loss/train': 1.5244321823120117} 11/07/2021 04:48:42 - INFO - __main__ - Step 53913: {'lr': 0.00036296773017249214, 'samples': 10351296, 'steps': 53912, 'loss/train': 1.7219111919403076} 11/07/2021 04:48:42 - INFO - __main__ - Step 53914: {'lr': 0.0003629629960888629, 'samples': 10351488, 'steps': 53913, 'loss/train': 0.431939959526062} 11/07/2021 04:48:43 - INFO - __main__ - Step 53915: {'lr': 0.00036295826195433434, 'samples': 10351680, 'steps': 53914, 'loss/train': 1.7920550107955933} 11/07/2021 04:48:44 - INFO - __main__ - Step 53916: {'lr': 0.0003629535277689085, 'samples': 10351872, 'steps': 53915, 'loss/train': 1.7504496574401855} 11/07/2021 04:48:44 - INFO - __main__ - Step 53917: {'lr': 0.00036294879353258755, 'samples': 10352064, 'steps': 53916, 'loss/train': 1.102777361869812} 11/07/2021 04:48:45 - INFO - __main__ - Step 53918: {'lr': 0.0003629440592453736, 'samples': 10352256, 'steps': 53917, 'loss/train': 1.5030699968338013} 11/07/2021 04:48:45 - INFO - __main__ - Step 53919: {'lr': 0.0003629393249072688, 'samples': 10352448, 'steps': 53918, 'loss/train': 1.0274672508239746} 11/07/2021 04:48:45 - INFO - __main__ - Step 53920: {'lr': 0.00036293459051827526, 'samples': 10352640, 'steps': 53919, 'loss/train': 1.5233278274536133} 11/07/2021 04:48:46 - INFO - __main__ - Step 53921: {'lr': 0.0003629298560783952, 'samples': 10352832, 'steps': 53920, 'loss/train': 1.2541234493255615} 11/07/2021 04:48:47 - INFO - __main__ - Step 53922: {'lr': 0.0003629251215876307, 'samples': 10353024, 'steps': 53921, 'loss/train': 1.6004103422164917} 11/07/2021 04:48:47 - INFO - __main__ - Step 53923: {'lr': 0.0003629203870459838, 'samples': 10353216, 'steps': 53922, 'loss/train': 1.385047435760498} 11/07/2021 04:48:47 - INFO - __main__ - Step 53924: {'lr': 0.00036291565245345677, 'samples': 10353408, 'steps': 53923, 'loss/train': 1.3251805305480957} 11/07/2021 04:48:48 - INFO - __main__ - Step 53925: {'lr': 0.0003629109178100516, 'samples': 10353600, 'steps': 53924, 'loss/train': 1.2210359573364258} 11/07/2021 04:48:48 - INFO - __main__ - Step 53926: {'lr': 0.0003629061831157706, 'samples': 10353792, 'steps': 53925, 'loss/train': 0.8227512836456299} 11/07/2021 04:48:49 - INFO - __main__ - Step 53927: {'lr': 0.00036290144837061586, 'samples': 10353984, 'steps': 53926, 'loss/train': 1.217120885848999} 11/07/2021 04:48:50 - INFO - __main__ - Step 53928: {'lr': 0.00036289671357458937, 'samples': 10354176, 'steps': 53927, 'loss/train': 0.7737321257591248} 11/07/2021 04:48:50 - INFO - __main__ - Step 53929: {'lr': 0.00036289197872769346, 'samples': 10354368, 'steps': 53928, 'loss/train': 1.5767134428024292} 11/07/2021 04:48:50 - INFO - __main__ - Step 53930: {'lr': 0.0003628872438299301, 'samples': 10354560, 'steps': 53929, 'loss/train': 1.3147779703140259} 11/07/2021 04:48:51 - INFO - __main__ - Step 53931: {'lr': 0.0003628825088813015, 'samples': 10354752, 'steps': 53930, 'loss/train': 0.9428694248199463} 11/07/2021 04:48:52 - INFO - __main__ - Step 53932: {'lr': 0.00036287777388180977, 'samples': 10354944, 'steps': 53931, 'loss/train': 1.4914923906326294} 11/07/2021 04:48:52 - INFO - __main__ - Step 53933: {'lr': 0.00036287303883145703, 'samples': 10355136, 'steps': 53932, 'loss/train': 1.3886076211929321} 11/07/2021 04:48:52 - INFO - __main__ - Step 53934: {'lr': 0.00036286830373024546, 'samples': 10355328, 'steps': 53933, 'loss/train': 1.3711997270584106} 11/07/2021 04:48:53 - INFO - __main__ - Step 53935: {'lr': 0.00036286356857817727, 'samples': 10355520, 'steps': 53934, 'loss/train': 1.2949281930923462} 11/07/2021 04:48:53 - INFO - __main__ - Step 53936: {'lr': 0.0003628588333752544, 'samples': 10355712, 'steps': 53935, 'loss/train': 1.4670205116271973} 11/07/2021 04:48:54 - INFO - __main__ - Step 53937: {'lr': 0.0003628540981214791, 'samples': 10355904, 'steps': 53936, 'loss/train': 1.432778239250183} 11/07/2021 04:48:54 - INFO - __main__ - Step 53938: {'lr': 0.00036284936281685354, 'samples': 10356096, 'steps': 53937, 'loss/train': 1.5690598487854004} 11/07/2021 04:48:55 - INFO - __main__ - Step 53939: {'lr': 0.0003628446274613797, 'samples': 10356288, 'steps': 53938, 'loss/train': 1.3823480606079102} 11/07/2021 04:48:55 - INFO - __main__ - Step 53940: {'lr': 0.00036283989205505987, 'samples': 10356480, 'steps': 53939, 'loss/train': 1.4392648935317993} 11/07/2021 04:48:55 - INFO - __main__ - Step 53941: {'lr': 0.00036283515659789615, 'samples': 10356672, 'steps': 53940, 'loss/train': 1.712378978729248} 11/07/2021 04:48:56 - INFO - __main__ - Step 53942: {'lr': 0.0003628304210898906, 'samples': 10356864, 'steps': 53941, 'loss/train': 1.7036980390548706} 11/07/2021 04:48:57 - INFO - __main__ - Step 53943: {'lr': 0.00036282568553104545, 'samples': 10357056, 'steps': 53942, 'loss/train': 1.7509475946426392} 11/07/2021 04:48:57 - INFO - __main__ - Step 53944: {'lr': 0.00036282094992136273, 'samples': 10357248, 'steps': 53943, 'loss/train': 1.8214190006256104} 11/07/2021 04:48:57 - INFO - __main__ - Step 53945: {'lr': 0.00036281621426084465, 'samples': 10357440, 'steps': 53944, 'loss/train': 1.5594524145126343} 11/07/2021 04:48:58 - INFO - __main__ - Step 53946: {'lr': 0.0003628114785494934, 'samples': 10357632, 'steps': 53945, 'loss/train': 1.8242552280426025} 11/07/2021 04:48:59 - INFO - __main__ - Step 53947: {'lr': 0.00036280674278731096, 'samples': 10357824, 'steps': 53946, 'loss/train': 1.0017260313034058} 11/07/2021 04:48:59 - INFO - __main__ - Step 53948: {'lr': 0.00036280200697429957, 'samples': 10358016, 'steps': 53947, 'loss/train': 1.238772988319397} 11/07/2021 04:48:59 - INFO - __main__ - Step 53949: {'lr': 0.00036279727111046127, 'samples': 10358208, 'steps': 53948, 'loss/train': 0.9045762419700623} 11/07/2021 04:49:00 - INFO - __main__ - Step 53950: {'lr': 0.0003627925351957983, 'samples': 10358400, 'steps': 53949, 'loss/train': 1.3031666278839111} 11/07/2021 04:49:00 - INFO - __main__ - Step 53951: {'lr': 0.0003627877992303128, 'samples': 10358592, 'steps': 53950, 'loss/train': 1.427990198135376} 11/07/2021 04:49:01 - INFO - __main__ - Step 53952: {'lr': 0.0003627830632140068, 'samples': 10358784, 'steps': 53951, 'loss/train': 1.4603978395462036} 11/07/2021 04:49:01 - INFO - __main__ - Step 53953: {'lr': 0.0003627783271468825, 'samples': 10358976, 'steps': 53952, 'loss/train': 1.7184420824050903} 11/07/2021 04:49:02 - INFO - __main__ - Step 53954: {'lr': 0.0003627735910289421, 'samples': 10359168, 'steps': 53953, 'loss/train': 1.546921730041504} 11/07/2021 04:49:02 - INFO - __main__ - Step 53955: {'lr': 0.0003627688548601876, 'samples': 10359360, 'steps': 53954, 'loss/train': 1.5725740194320679} 11/07/2021 04:49:03 - INFO - __main__ - Step 53956: {'lr': 0.00036276411864062116, 'samples': 10359552, 'steps': 53955, 'loss/train': 1.5984454154968262} 11/07/2021 04:49:03 - INFO - __main__ - Step 53957: {'lr': 0.00036275938237024505, 'samples': 10359744, 'steps': 53956, 'loss/train': 1.3893181085586548} 11/07/2021 04:49:04 - INFO - __main__ - Step 53958: {'lr': 0.00036275464604906116, 'samples': 10359936, 'steps': 53957, 'loss/train': 1.3340251445770264} 11/07/2021 04:49:04 - INFO - __main__ - Step 53959: {'lr': 0.0003627499096770719, 'samples': 10360128, 'steps': 53958, 'loss/train': 0.9483999609947205} 11/07/2021 04:49:05 - INFO - __main__ - Step 53960: {'lr': 0.0003627451732542791, 'samples': 10360320, 'steps': 53959, 'loss/train': 1.4981751441955566} 11/07/2021 04:49:05 - INFO - __main__ - Step 53961: {'lr': 0.00036274043678068526, 'samples': 10360512, 'steps': 53960, 'loss/train': 1.141883373260498} 11/07/2021 04:49:05 - INFO - __main__ - Step 53962: {'lr': 0.0003627357002562923, 'samples': 10360704, 'steps': 53961, 'loss/train': 1.1795448064804077} 11/07/2021 04:49:06 - INFO - __main__ - Step 53963: {'lr': 0.0003627309636811023, 'samples': 10360896, 'steps': 53962, 'loss/train': 1.356592059135437} 11/07/2021 04:49:07 - INFO - __main__ - Step 53964: {'lr': 0.00036272622705511745, 'samples': 10361088, 'steps': 53963, 'loss/train': 0.8316559195518494} 11/07/2021 04:49:07 - INFO - __main__ - Step 53965: {'lr': 0.0003627214903783399, 'samples': 10361280, 'steps': 53964, 'loss/train': 1.6369526386260986} 11/07/2021 04:49:07 - INFO - __main__ - Step 53966: {'lr': 0.00036271675365077185, 'samples': 10361472, 'steps': 53965, 'loss/train': 1.0900590419769287} 11/07/2021 04:49:08 - INFO - __main__ - Step 53967: {'lr': 0.0003627120168724153, 'samples': 10361664, 'steps': 53966, 'loss/train': 1.1406044960021973} 11/07/2021 04:49:09 - INFO - __main__ - Step 53968: {'lr': 0.00036270728004327246, 'samples': 10361856, 'steps': 53967, 'loss/train': 1.480116367340088} 11/07/2021 04:49:09 - INFO - __main__ - Step 53969: {'lr': 0.0003627025431633455, 'samples': 10362048, 'steps': 53968, 'loss/train': 1.4173986911773682} 11/07/2021 04:49:10 - INFO - __main__ - Step 53970: {'lr': 0.00036269780623263647, 'samples': 10362240, 'steps': 53969, 'loss/train': 1.6775959730148315} 11/07/2021 04:49:10 - INFO - __main__ - Step 53971: {'lr': 0.00036269306925114765, 'samples': 10362432, 'steps': 53970, 'loss/train': 0.3500784933567047} 11/07/2021 04:49:10 - INFO - __main__ - Step 53972: {'lr': 0.000362688332218881, 'samples': 10362624, 'steps': 53971, 'loss/train': 1.574920654296875} 11/07/2021 04:49:11 - INFO - __main__ - Step 53973: {'lr': 0.0003626835951358387, 'samples': 10362816, 'steps': 53972, 'loss/train': 1.0546258687973022} 11/07/2021 04:49:12 - INFO - __main__ - Step 53974: {'lr': 0.00036267885800202296, 'samples': 10363008, 'steps': 53973, 'loss/train': 1.5321637392044067} 11/07/2021 04:49:12 - INFO - __main__ - Step 53975: {'lr': 0.00036267412081743576, 'samples': 10363200, 'steps': 53974, 'loss/train': 1.3582209348678589} 11/07/2021 04:49:12 - INFO - __main__ - Step 53976: {'lr': 0.00036266938358207944, 'samples': 10363392, 'steps': 53975, 'loss/train': 1.6329455375671387} 11/07/2021 04:49:13 - INFO - __main__ - Step 53977: {'lr': 0.0003626646462959561, 'samples': 10363584, 'steps': 53976, 'loss/train': 1.1768256425857544} 11/07/2021 04:49:14 - INFO - __main__ - Step 53978: {'lr': 0.00036265990895906767, 'samples': 10363776, 'steps': 53977, 'loss/train': 1.0313069820404053} 11/07/2021 04:49:14 - INFO - __main__ - Step 53979: {'lr': 0.0003626551715714165, 'samples': 10363968, 'steps': 53978, 'loss/train': 1.4519003629684448} 11/07/2021 04:49:14 - INFO - __main__ - Step 53980: {'lr': 0.00036265043413300456, 'samples': 10364160, 'steps': 53979, 'loss/train': 1.3776179552078247} 11/07/2021 04:49:15 - INFO - __main__ - Step 53981: {'lr': 0.0003626456966438342, 'samples': 10364352, 'steps': 53980, 'loss/train': 0.06659340113401413} 11/07/2021 04:49:15 - INFO - __main__ - Step 53982: {'lr': 0.00036264095910390736, 'samples': 10364544, 'steps': 53981, 'loss/train': 1.3852801322937012} 11/07/2021 04:49:16 - INFO - __main__ - Step 53983: {'lr': 0.0003626362215132263, 'samples': 10364736, 'steps': 53982, 'loss/train': 1.8173325061798096} 11/07/2021 04:49:16 - INFO - __main__ - Step 53984: {'lr': 0.00036263148387179303, 'samples': 10364928, 'steps': 53983, 'loss/train': 1.6196330785751343} 11/07/2021 04:49:17 - INFO - __main__ - Step 53985: {'lr': 0.0003626267461796097, 'samples': 10365120, 'steps': 53984, 'loss/train': 1.427735447883606} 11/07/2021 04:49:17 - INFO - __main__ - Step 53986: {'lr': 0.0003626220084366786, 'samples': 10365312, 'steps': 53985, 'loss/train': 0.9255143404006958} 11/07/2021 04:49:18 - INFO - __main__ - Step 53987: {'lr': 0.0003626172706430017, 'samples': 10365504, 'steps': 53986, 'loss/train': 1.3839311599731445} 11/07/2021 04:49:18 - INFO - __main__ - Step 53988: {'lr': 0.0003626125327985812, 'samples': 10365696, 'steps': 53987, 'loss/train': 1.7437059879302979} 11/07/2021 04:49:19 - INFO - __main__ - Step 53989: {'lr': 0.0003626077949034193, 'samples': 10365888, 'steps': 53988, 'loss/train': 1.4239369630813599} 11/07/2021 04:49:19 - INFO - __main__ - Step 53990: {'lr': 0.000362603056957518, 'samples': 10366080, 'steps': 53989, 'loss/train': 1.8578649759292603} 11/07/2021 04:49:20 - INFO - __main__ - Step 53991: {'lr': 0.0003625983189608795, 'samples': 10366272, 'steps': 53990, 'loss/train': 1.660885214805603} 11/07/2021 04:49:20 - INFO - __main__ - Step 53992: {'lr': 0.00036259358091350597, 'samples': 10366464, 'steps': 53991, 'loss/train': 1.3914047479629517} 11/07/2021 04:49:20 - INFO - __main__ - Step 53993: {'lr': 0.0003625888428153995, 'samples': 10366656, 'steps': 53992, 'loss/train': 1.2779313325881958} 11/07/2021 04:49:21 - INFO - __main__ - Step 53994: {'lr': 0.0003625841046665622, 'samples': 10366848, 'steps': 53993, 'loss/train': 1.4684120416641235} 11/07/2021 04:49:22 - INFO - __main__ - Step 53995: {'lr': 0.00036257936646699626, 'samples': 10367040, 'steps': 53994, 'loss/train': 1.657848834991455} 11/07/2021 04:49:22 - INFO - __main__ - Step 53996: {'lr': 0.00036257462821670387, 'samples': 10367232, 'steps': 53995, 'loss/train': 0.1811019480228424} 11/07/2021 04:49:22 - INFO - __main__ - Step 53997: {'lr': 0.00036256988991568696, 'samples': 10367424, 'steps': 53996, 'loss/train': 0.8272801637649536} 11/07/2021 04:49:23 - INFO - __main__ - Step 53998: {'lr': 0.0003625651515639479, 'samples': 10367616, 'steps': 53997, 'loss/train': 1.1919171810150146} 11/07/2021 04:49:24 - INFO - __main__ - Step 53999: {'lr': 0.00036256041316148864, 'samples': 10367808, 'steps': 53998, 'loss/train': 1.85353684425354} 11/07/2021 04:49:24 - INFO - __main__ - Step 54000: {'lr': 0.0003625556747083114, 'samples': 10368000, 'steps': 53999, 'loss/train': 1.776261568069458} 11/07/2021 04:49:25 - INFO - __main__ - Step 54001: {'lr': 0.0003625509362044183, 'samples': 10368192, 'steps': 54000, 'loss/train': 0.9551630616188049} 11/07/2021 04:49:25 - INFO - __main__ - Step 54002: {'lr': 0.00036254619764981155, 'samples': 10368384, 'steps': 54001, 'loss/train': 2.972165584564209} 11/07/2021 04:49:25 - INFO - __main__ - Step 54003: {'lr': 0.0003625414590444932, 'samples': 10368576, 'steps': 54002, 'loss/train': 1.1933668851852417} 11/07/2021 04:49:26 - INFO - __main__ - Step 54004: {'lr': 0.0003625367203884654, 'samples': 10368768, 'steps': 54003, 'loss/train': 1.2040570974349976} 11/07/2021 04:49:27 - INFO - __main__ - Step 54005: {'lr': 0.0003625319816817303, 'samples': 10368960, 'steps': 54004, 'loss/train': 1.7131401300430298} 11/07/2021 04:49:27 - INFO - __main__ - Step 54006: {'lr': 0.00036252724292429, 'samples': 10369152, 'steps': 54005, 'loss/train': 1.2289642095565796} 11/07/2021 04:49:27 - INFO - __main__ - Step 54007: {'lr': 0.00036252250411614666, 'samples': 10369344, 'steps': 54006, 'loss/train': 2.046152353286743} 11/07/2021 04:49:28 - INFO - __main__ - Step 54008: {'lr': 0.0003625177652573024, 'samples': 10369536, 'steps': 54007, 'loss/train': 1.5472607612609863} 11/07/2021 04:49:28 - INFO - __main__ - Step 54009: {'lr': 0.0003625130263477595, 'samples': 10369728, 'steps': 54008, 'loss/train': 1.8445370197296143} 11/07/2021 04:49:29 - INFO - __main__ - Step 54010: {'lr': 0.00036250828738751986, 'samples': 10369920, 'steps': 54009, 'loss/train': 1.957759976387024} 11/07/2021 04:49:29 - INFO - __main__ - Step 54011: {'lr': 0.0003625035483765857, 'samples': 10370112, 'steps': 54010, 'loss/train': 1.6986782550811768} 11/07/2021 04:49:30 - INFO - __main__ - Step 54012: {'lr': 0.00036249880931495923, 'samples': 10370304, 'steps': 54011, 'loss/train': 1.4239552021026611} 11/07/2021 04:49:30 - INFO - __main__ - Step 54013: {'lr': 0.00036249407020264246, 'samples': 10370496, 'steps': 54012, 'loss/train': 1.3341420888900757} 11/07/2021 04:49:30 - INFO - __main__ - Step 54014: {'lr': 0.00036248933103963767, 'samples': 10370688, 'steps': 54013, 'loss/train': 1.6599293947219849} 11/07/2021 04:49:31 - INFO - __main__ - Step 54015: {'lr': 0.0003624845918259469, 'samples': 10370880, 'steps': 54014, 'loss/train': 1.383053183555603} 11/07/2021 04:49:32 - INFO - __main__ - Step 54016: {'lr': 0.00036247985256157236, 'samples': 10371072, 'steps': 54015, 'loss/train': 1.2755595445632935} 11/07/2021 04:49:32 - INFO - __main__ - Step 54017: {'lr': 0.0003624751132465161, 'samples': 10371264, 'steps': 54016, 'loss/train': 1.4104011058807373} 11/07/2021 04:49:32 - INFO - __main__ - Step 54018: {'lr': 0.00036247037388078017, 'samples': 10371456, 'steps': 54017, 'loss/train': 1.340372085571289} 11/07/2021 04:49:33 - INFO - __main__ - Step 54019: {'lr': 0.00036246563446436697, 'samples': 10371648, 'steps': 54018, 'loss/train': 1.0824288129806519} 11/07/2021 04:49:34 - INFO - __main__ - Step 54020: {'lr': 0.00036246089499727843, 'samples': 10371840, 'steps': 54019, 'loss/train': 1.7775789499282837} 11/07/2021 04:49:34 - INFO - __main__ - Step 54021: {'lr': 0.0003624561554795168, 'samples': 10372032, 'steps': 54020, 'loss/train': 1.7932307720184326} 11/07/2021 04:49:35 - INFO - __main__ - Step 54022: {'lr': 0.0003624514159110841, 'samples': 10372224, 'steps': 54021, 'loss/train': 1.6062067747116089} 11/07/2021 04:49:35 - INFO - __main__ - Step 54023: {'lr': 0.0003624466762919826, 'samples': 10372416, 'steps': 54022, 'loss/train': 1.559800386428833} 11/07/2021 04:49:35 - INFO - __main__ - Step 54024: {'lr': 0.00036244193662221427, 'samples': 10372608, 'steps': 54023, 'loss/train': 1.265794038772583} 11/07/2021 04:49:36 - INFO - __main__ - Step 54025: {'lr': 0.0003624371969017814, 'samples': 10372800, 'steps': 54024, 'loss/train': 1.1560970544815063} 11/07/2021 04:49:37 - INFO - __main__ - Step 54026: {'lr': 0.000362432457130686, 'samples': 10372992, 'steps': 54025, 'loss/train': 1.1742134094238281} 11/07/2021 04:49:37 - INFO - __main__ - Step 54027: {'lr': 0.0003624277173089303, 'samples': 10373184, 'steps': 54026, 'loss/train': 1.1121913194656372} 11/07/2021 04:49:37 - INFO - __main__ - Step 54028: {'lr': 0.0003624229774365165, 'samples': 10373376, 'steps': 54027, 'loss/train': 1.2578948736190796} 11/07/2021 04:49:38 - INFO - __main__ - Step 54029: {'lr': 0.00036241823751344656, 'samples': 10373568, 'steps': 54028, 'loss/train': 1.18523371219635} 11/07/2021 04:49:38 - INFO - __main__ - Step 54030: {'lr': 0.0003624134975397227, 'samples': 10373760, 'steps': 54029, 'loss/train': 0.6954852938652039} 11/07/2021 04:49:39 - INFO - __main__ - Step 54031: {'lr': 0.0003624087575153471, 'samples': 10373952, 'steps': 54030, 'loss/train': 1.4838982820510864} 11/07/2021 04:49:39 - INFO - __main__ - Step 54032: {'lr': 0.00036240401744032174, 'samples': 10374144, 'steps': 54031, 'loss/train': 1.252082109451294} 11/07/2021 04:49:40 - INFO - __main__ - Step 54033: {'lr': 0.00036239927731464896, 'samples': 10374336, 'steps': 54032, 'loss/train': 1.4678068161010742} 11/07/2021 04:49:40 - INFO - __main__ - Step 54034: {'lr': 0.0003623945371383307, 'samples': 10374528, 'steps': 54033, 'loss/train': 1.1606920957565308} 11/07/2021 04:49:40 - INFO - __main__ - Step 54035: {'lr': 0.0003623897969113693, 'samples': 10374720, 'steps': 54034, 'loss/train': 1.449933409690857} 11/07/2021 04:49:42 - INFO - __main__ - Step 54036: {'lr': 0.00036238505663376675, 'samples': 10374912, 'steps': 54035, 'loss/train': 1.7730284929275513} 11/07/2021 04:49:42 - INFO - __main__ - Step 54037: {'lr': 0.00036238031630552527, 'samples': 10375104, 'steps': 54036, 'loss/train': 1.622470736503601} 11/07/2021 04:49:42 - INFO - __main__ - Step 54038: {'lr': 0.0003623755759266469, 'samples': 10375296, 'steps': 54037, 'loss/train': 0.9323673248291016} 11/07/2021 04:49:43 - INFO - __main__ - Step 54039: {'lr': 0.00036237083549713387, 'samples': 10375488, 'steps': 54038, 'loss/train': 1.631394386291504} 11/07/2021 04:49:43 - INFO - __main__ - Step 54040: {'lr': 0.0003623660950169882, 'samples': 10375680, 'steps': 54039, 'loss/train': 1.4027773141860962} 11/07/2021 04:49:44 - INFO - __main__ - Step 54041: {'lr': 0.00036236135448621215, 'samples': 10375872, 'steps': 54040, 'loss/train': 1.392757773399353} 11/07/2021 04:49:44 - INFO - __main__ - Step 54042: {'lr': 0.0003623566139048078, 'samples': 10376064, 'steps': 54041, 'loss/train': 0.9803919196128845} 11/07/2021 04:49:45 - INFO - __main__ - Step 54043: {'lr': 0.00036235187327277735, 'samples': 10376256, 'steps': 54042, 'loss/train': 1.434658169746399} 11/07/2021 04:49:45 - INFO - __main__ - Step 54044: {'lr': 0.0003623471325901228, 'samples': 10376448, 'steps': 54043, 'loss/train': 1.3706212043762207} 11/07/2021 04:49:45 - INFO - __main__ - Step 54045: {'lr': 0.00036234239185684643, 'samples': 10376640, 'steps': 54044, 'loss/train': 1.1884236335754395} 11/07/2021 04:49:46 - INFO - __main__ - Step 54046: {'lr': 0.00036233765107295023, 'samples': 10376832, 'steps': 54045, 'loss/train': 2.0011510848999023} 11/07/2021 04:49:47 - INFO - __main__ - Step 54047: {'lr': 0.00036233291023843653, 'samples': 10377024, 'steps': 54046, 'loss/train': 1.5218291282653809} 11/07/2021 04:49:47 - INFO - __main__ - Step 54048: {'lr': 0.00036232816935330723, 'samples': 10377216, 'steps': 54047, 'loss/train': 1.739459753036499} 11/07/2021 04:49:47 - INFO - __main__ - Step 54049: {'lr': 0.00036232342841756467, 'samples': 10377408, 'steps': 54048, 'loss/train': 1.8697669506072998} 11/07/2021 04:49:48 - INFO - __main__ - Step 54050: {'lr': 0.00036231868743121095, 'samples': 10377600, 'steps': 54049, 'loss/train': 1.3852152824401855} 11/07/2021 04:49:48 - INFO - __main__ - Step 54051: {'lr': 0.0003623139463942481, 'samples': 10377792, 'steps': 54050, 'loss/train': 4.39796257019043} 11/07/2021 04:49:49 - INFO - __main__ - Step 54052: {'lr': 0.0003623092053066783, 'samples': 10377984, 'steps': 54051, 'loss/train': 1.9474323987960815} 11/07/2021 04:49:50 - INFO - __main__ - Step 54053: {'lr': 0.0003623044641685037, 'samples': 10378176, 'steps': 54052, 'loss/train': 1.0721606016159058} 11/07/2021 04:49:50 - INFO - __main__ - Step 54054: {'lr': 0.00036229972297972644, 'samples': 10378368, 'steps': 54053, 'loss/train': 1.2265321016311646} 11/07/2021 04:49:50 - INFO - __main__ - Step 54055: {'lr': 0.00036229498174034867, 'samples': 10378560, 'steps': 54054, 'loss/train': 1.8366349935531616} 11/07/2021 04:49:51 - INFO - __main__ - Step 54056: {'lr': 0.00036229024045037264, 'samples': 10378752, 'steps': 54055, 'loss/train': 1.0183594226837158} 11/07/2021 04:49:52 - INFO - __main__ - Step 54057: {'lr': 0.00036228549910980026, 'samples': 10378944, 'steps': 54056, 'loss/train': 1.1268253326416016} 11/07/2021 04:49:52 - INFO - __main__ - Step 54058: {'lr': 0.0003622807577186337, 'samples': 10379136, 'steps': 54057, 'loss/train': 1.5386079549789429} 11/07/2021 04:49:52 - INFO - __main__ - Step 54059: {'lr': 0.0003622760162768752, 'samples': 10379328, 'steps': 54058, 'loss/train': 1.5613408088684082} 11/07/2021 04:49:53 - INFO - __main__ - Step 54060: {'lr': 0.0003622712747845269, 'samples': 10379520, 'steps': 54059, 'loss/train': 1.7073429822921753} 11/07/2021 04:49:53 - INFO - __main__ - Step 54061: {'lr': 0.0003622665332415909, 'samples': 10379712, 'steps': 54060, 'loss/train': 1.2677898406982422} 11/07/2021 04:49:54 - INFO - __main__ - Step 54062: {'lr': 0.00036226179164806926, 'samples': 10379904, 'steps': 54061, 'loss/train': 0.06435222923755646} 11/07/2021 04:49:55 - INFO - __main__ - Step 54063: {'lr': 0.00036225705000396424, 'samples': 10380096, 'steps': 54062, 'loss/train': 1.371171236038208} 11/07/2021 04:49:55 - INFO - __main__ - Step 54064: {'lr': 0.000362252308309278, 'samples': 10380288, 'steps': 54063, 'loss/train': 1.5303770303726196} 11/07/2021 04:49:55 - INFO - __main__ - Step 54065: {'lr': 0.00036224756656401245, 'samples': 10380480, 'steps': 54064, 'loss/train': 1.3622829914093018} 11/07/2021 04:49:56 - INFO - __main__ - Step 54066: {'lr': 0.0003622428247681699, 'samples': 10380672, 'steps': 54065, 'loss/train': 1.0054705142974854} 11/07/2021 04:49:56 - INFO - __main__ - Step 54067: {'lr': 0.0003622380829217526, 'samples': 10380864, 'steps': 54066, 'loss/train': 1.0937042236328125} 11/07/2021 04:49:57 - INFO - __main__ - Step 54068: {'lr': 0.00036223334102476247, 'samples': 10381056, 'steps': 54067, 'loss/train': 1.4480668306350708} 11/07/2021 04:49:57 - INFO - __main__ - Step 54069: {'lr': 0.00036222859907720167, 'samples': 10381248, 'steps': 54068, 'loss/train': 1.1883262395858765} 11/07/2021 04:49:58 - INFO - __main__ - Step 54070: {'lr': 0.00036222385707907254, 'samples': 10381440, 'steps': 54069, 'loss/train': 1.5054994821548462} 11/07/2021 04:49:58 - INFO - __main__ - Step 54071: {'lr': 0.000362219115030377, 'samples': 10381632, 'steps': 54070, 'loss/train': 1.5895830392837524} 11/07/2021 04:49:58 - INFO - __main__ - Step 54072: {'lr': 0.0003622143729311172, 'samples': 10381824, 'steps': 54071, 'loss/train': 1.5016613006591797} 11/07/2021 04:49:59 - INFO - __main__ - Step 54073: {'lr': 0.00036220963078129536, 'samples': 10382016, 'steps': 54072, 'loss/train': 1.3931150436401367} 11/07/2021 04:50:00 - INFO - __main__ - Step 54074: {'lr': 0.0003622048885809136, 'samples': 10382208, 'steps': 54073, 'loss/train': 1.243634819984436} 11/07/2021 04:50:00 - INFO - __main__ - Step 54075: {'lr': 0.0003622001463299741, 'samples': 10382400, 'steps': 54074, 'loss/train': 1.7613712549209595} 11/07/2021 04:50:00 - INFO - __main__ - Step 54076: {'lr': 0.0003621954040284789, 'samples': 10382592, 'steps': 54075, 'loss/train': 0.41087788343429565} 11/07/2021 04:50:01 - INFO - __main__ - Step 54077: {'lr': 0.00036219066167643015, 'samples': 10382784, 'steps': 54076, 'loss/train': 1.0473359823226929} 11/07/2021 04:50:02 - INFO - __main__ - Step 54078: {'lr': 0.00036218591927383, 'samples': 10382976, 'steps': 54077, 'loss/train': 1.528517723083496} 11/07/2021 04:50:02 - INFO - __main__ - Step 54079: {'lr': 0.00036218117682068076, 'samples': 10383168, 'steps': 54078, 'loss/train': 1.5811829566955566} 11/07/2021 04:50:03 - INFO - __main__ - Step 54080: {'lr': 0.0003621764343169843, 'samples': 10383360, 'steps': 54079, 'loss/train': 1.828255295753479} 11/07/2021 04:50:03 - INFO - __main__ - Step 54081: {'lr': 0.0003621716917627429, 'samples': 10383552, 'steps': 54080, 'loss/train': 1.3414658308029175} 11/07/2021 04:50:03 - INFO - __main__ - Step 54082: {'lr': 0.0003621669491579587, 'samples': 10383744, 'steps': 54081, 'loss/train': 1.5407863855361938} 11/07/2021 04:50:04 - INFO - __main__ - Step 54083: {'lr': 0.0003621622065026337, 'samples': 10383936, 'steps': 54082, 'loss/train': 1.6347789764404297} 11/07/2021 04:50:05 - INFO - __main__ - Step 54084: {'lr': 0.0003621574637967702, 'samples': 10384128, 'steps': 54083, 'loss/train': 4.492691993713379} 11/07/2021 04:50:05 - INFO - __main__ - Step 54085: {'lr': 0.00036215272104037023, 'samples': 10384320, 'steps': 54084, 'loss/train': 1.2677205801010132} 11/07/2021 04:50:05 - INFO - __main__ - Step 54086: {'lr': 0.0003621479782334361, 'samples': 10384512, 'steps': 54085, 'loss/train': 1.4585630893707275} 11/07/2021 04:50:06 - INFO - __main__ - Step 54087: {'lr': 0.00036214323537596974, 'samples': 10384704, 'steps': 54086, 'loss/train': 0.7516101598739624} 11/07/2021 04:50:06 - INFO - __main__ - Step 54088: {'lr': 0.0003621384924679733, 'samples': 10384896, 'steps': 54087, 'loss/train': 2.0621416568756104} 11/07/2021 04:50:07 - INFO - __main__ - Step 54089: {'lr': 0.00036213374950944913, 'samples': 10385088, 'steps': 54088, 'loss/train': 1.4169466495513916} 11/07/2021 04:50:08 - INFO - __main__ - Step 54090: {'lr': 0.0003621290065003991, 'samples': 10385280, 'steps': 54089, 'loss/train': 1.8630374670028687} 11/07/2021 04:50:08 - INFO - __main__ - Step 54091: {'lr': 0.00036212426344082554, 'samples': 10385472, 'steps': 54090, 'loss/train': 0.8615546822547913} 11/07/2021 04:50:08 - INFO - __main__ - Step 54092: {'lr': 0.0003621195203307305, 'samples': 10385664, 'steps': 54091, 'loss/train': 1.5436806678771973} 11/07/2021 04:50:09 - INFO - __main__ - Step 54093: {'lr': 0.0003621147771701161, 'samples': 10385856, 'steps': 54092, 'loss/train': 2.168379306793213} 11/07/2021 04:50:10 - INFO - __main__ - Step 54094: {'lr': 0.00036211003395898456, 'samples': 10386048, 'steps': 54093, 'loss/train': 0.6066447496414185} 11/07/2021 04:50:10 - INFO - __main__ - Step 54095: {'lr': 0.0003621052906973379, 'samples': 10386240, 'steps': 54094, 'loss/train': 1.5131169557571411} 11/07/2021 04:50:10 - INFO - __main__ - Step 54096: {'lr': 0.0003621005473851784, 'samples': 10386432, 'steps': 54095, 'loss/train': 1.8245435953140259} 11/07/2021 04:50:11 - INFO - __main__ - Step 54097: {'lr': 0.0003620958040225081, 'samples': 10386624, 'steps': 54096, 'loss/train': 1.176207184791565} 11/07/2021 04:50:11 - INFO - __main__ - Step 54098: {'lr': 0.0003620910606093292, 'samples': 10386816, 'steps': 54097, 'loss/train': 1.4618706703186035} 11/07/2021 04:50:12 - INFO - __main__ - Step 54099: {'lr': 0.0003620863171456437, 'samples': 10387008, 'steps': 54098, 'loss/train': 1.1126728057861328} 11/07/2021 04:50:12 - INFO - __main__ - Step 54100: {'lr': 0.0003620815736314539, 'samples': 10387200, 'steps': 54099, 'loss/train': 1.6649500131607056} 11/07/2021 04:50:13 - INFO - __main__ - Step 54101: {'lr': 0.0003620768300667618, 'samples': 10387392, 'steps': 54100, 'loss/train': 1.5151457786560059} 11/07/2021 04:50:13 - INFO - __main__ - Step 54102: {'lr': 0.00036207208645156977, 'samples': 10387584, 'steps': 54101, 'loss/train': 1.6389533281326294} 11/07/2021 04:50:13 - INFO - __main__ - Step 54103: {'lr': 0.00036206734278587964, 'samples': 10387776, 'steps': 54102, 'loss/train': 1.6350964307785034} 11/07/2021 04:50:15 - INFO - __main__ - Step 54104: {'lr': 0.0003620625990696937, 'samples': 10387968, 'steps': 54103, 'loss/train': 1.5984524488449097} 11/07/2021 04:50:15 - INFO - __main__ - Step 54105: {'lr': 0.00036205785530301417, 'samples': 10388160, 'steps': 54104, 'loss/train': 1.4757647514343262} 11/07/2021 04:50:15 - INFO - __main__ - Step 54106: {'lr': 0.00036205311148584306, 'samples': 10388352, 'steps': 54105, 'loss/train': 1.5447732210159302} 11/07/2021 04:50:16 - INFO - __main__ - Step 54107: {'lr': 0.00036204836761818255, 'samples': 10388544, 'steps': 54106, 'loss/train': 1.7922970056533813} 11/07/2021 04:50:16 - INFO - __main__ - Step 54108: {'lr': 0.00036204362370003475, 'samples': 10388736, 'steps': 54107, 'loss/train': 1.2513302564620972} 11/07/2021 04:50:17 - INFO - __main__ - Step 54109: {'lr': 0.00036203887973140184, 'samples': 10388928, 'steps': 54108, 'loss/train': 2.252163887023926} 11/07/2021 04:50:17 - INFO - __main__ - Step 54110: {'lr': 0.000362034135712286, 'samples': 10389120, 'steps': 54109, 'loss/train': 1.445444107055664} 11/07/2021 04:50:18 - INFO - __main__ - Step 54111: {'lr': 0.00036202939164268924, 'samples': 10389312, 'steps': 54110, 'loss/train': 1.8695931434631348} 11/07/2021 04:50:18 - INFO - __main__ - Step 54112: {'lr': 0.0003620246475226138, 'samples': 10389504, 'steps': 54111, 'loss/train': 1.2193603515625} 11/07/2021 04:50:18 - INFO - __main__ - Step 54113: {'lr': 0.0003620199033520617, 'samples': 10389696, 'steps': 54112, 'loss/train': 0.8813428282737732} 11/07/2021 04:50:19 - INFO - __main__ - Step 54114: {'lr': 0.0003620151591310352, 'samples': 10389888, 'steps': 54113, 'loss/train': 1.7636504173278809} 11/07/2021 04:50:20 - INFO - __main__ - Step 54115: {'lr': 0.0003620104148595364, 'samples': 10390080, 'steps': 54114, 'loss/train': 1.3986188173294067} 11/07/2021 04:50:20 - INFO - __main__ - Step 54116: {'lr': 0.00036200567053756746, 'samples': 10390272, 'steps': 54115, 'loss/train': 1.3709259033203125} 11/07/2021 04:50:20 - INFO - __main__ - Step 54117: {'lr': 0.0003620009261651305, 'samples': 10390464, 'steps': 54116, 'loss/train': 1.6733927726745605} 11/07/2021 04:50:21 - INFO - __main__ - Step 54118: {'lr': 0.0003619961817422276, 'samples': 10390656, 'steps': 54117, 'loss/train': 1.5279405117034912} 11/07/2021 04:50:21 - INFO - __main__ - Step 54119: {'lr': 0.00036199143726886097, 'samples': 10390848, 'steps': 54118, 'loss/train': 1.7747327089309692} 11/07/2021 04:50:22 - INFO - __main__ - Step 54120: {'lr': 0.00036198669274503274, 'samples': 10391040, 'steps': 54119, 'loss/train': 1.4915437698364258} 11/07/2021 04:50:22 - INFO - __main__ - Step 54121: {'lr': 0.00036198194817074503, 'samples': 10391232, 'steps': 54120, 'loss/train': 1.5646147727966309} 11/07/2021 04:50:23 - INFO - __main__ - Step 54122: {'lr': 0.00036197720354599997, 'samples': 10391424, 'steps': 54121, 'loss/train': 1.1594526767730713} 11/07/2021 04:50:23 - INFO - __main__ - Step 54123: {'lr': 0.0003619724588707997, 'samples': 10391616, 'steps': 54122, 'loss/train': 1.6819772720336914} 11/07/2021 04:50:23 - INFO - __main__ - Step 54124: {'lr': 0.00036196771414514643, 'samples': 10391808, 'steps': 54123, 'loss/train': 1.259159803390503} 11/07/2021 04:50:25 - INFO - __main__ - Step 54125: {'lr': 0.0003619629693690422, 'samples': 10392000, 'steps': 54124, 'loss/train': 1.4479601383209229} 11/07/2021 04:50:25 - INFO - __main__ - Step 54126: {'lr': 0.00036195822454248916, 'samples': 10392192, 'steps': 54125, 'loss/train': 1.284369945526123} 11/07/2021 04:50:25 - INFO - __main__ - Step 54127: {'lr': 0.00036195347966548955, 'samples': 10392384, 'steps': 54126, 'loss/train': 5.417552471160889} 11/07/2021 04:50:26 - INFO - __main__ - Step 54128: {'lr': 0.0003619487347380454, 'samples': 10392576, 'steps': 54127, 'loss/train': 1.50994873046875} 11/07/2021 04:50:26 - INFO - __main__ - Step 54129: {'lr': 0.00036194398976015875, 'samples': 10392768, 'steps': 54128, 'loss/train': 2.001593589782715} 11/07/2021 04:50:27 - INFO - __main__ - Step 54130: {'lr': 0.00036193924473183205, 'samples': 10392960, 'steps': 54129, 'loss/train': 1.5888789892196655} 11/07/2021 04:50:28 - INFO - __main__ - Step 54131: {'lr': 0.00036193449965306714, 'samples': 10393152, 'steps': 54130, 'loss/train': 1.8174561262130737} 11/07/2021 04:50:28 - INFO - __main__ - Step 54132: {'lr': 0.0003619297545238663, 'samples': 10393344, 'steps': 54131, 'loss/train': 1.3752504587173462} 11/07/2021 04:50:28 - INFO - __main__ - Step 54133: {'lr': 0.00036192500934423163, 'samples': 10393536, 'steps': 54132, 'loss/train': 1.7326428890228271} 11/07/2021 04:50:29 - INFO - __main__ - Step 54134: {'lr': 0.0003619202641141652, 'samples': 10393728, 'steps': 54133, 'loss/train': 1.702778935432434} 11/07/2021 04:50:29 - INFO - __main__ - Step 54135: {'lr': 0.00036191551883366937, 'samples': 10393920, 'steps': 54134, 'loss/train': 1.3861414194107056} 11/07/2021 04:50:30 - INFO - __main__ - Step 54136: {'lr': 0.000361910773502746, 'samples': 10394112, 'steps': 54135, 'loss/train': 1.3360629081726074} 11/07/2021 04:50:31 - INFO - __main__ - Step 54137: {'lr': 0.00036190602812139757, 'samples': 10394304, 'steps': 54136, 'loss/train': 1.3992722034454346} 11/07/2021 04:50:31 - INFO - __main__ - Step 54138: {'lr': 0.00036190128268962586, 'samples': 10394496, 'steps': 54137, 'loss/train': 1.0293000936508179} 11/07/2021 04:50:31 - INFO - __main__ - Step 54139: {'lr': 0.00036189653720743317, 'samples': 10394688, 'steps': 54138, 'loss/train': 0.8107431530952454} 11/07/2021 04:50:32 - INFO - __main__ - Step 54140: {'lr': 0.0003618917916748216, 'samples': 10394880, 'steps': 54139, 'loss/train': 1.5401232242584229} 11/07/2021 04:50:33 - INFO - __main__ - Step 54141: {'lr': 0.00036188704609179333, 'samples': 10395072, 'steps': 54140, 'loss/train': 1.5690703392028809} 11/07/2021 04:50:33 - INFO - __main__ - Step 54142: {'lr': 0.00036188230045835053, 'samples': 10395264, 'steps': 54141, 'loss/train': 1.303375482559204} 11/07/2021 04:50:33 - INFO - __main__ - Step 54143: {'lr': 0.00036187755477449525, 'samples': 10395456, 'steps': 54142, 'loss/train': 0.09238816797733307} 11/07/2021 04:50:34 - INFO - __main__ - Step 54144: {'lr': 0.00036187280904022973, 'samples': 10395648, 'steps': 54143, 'loss/train': 1.3750076293945312} 11/07/2021 04:50:34 - INFO - __main__ - Step 54145: {'lr': 0.000361868063255556, 'samples': 10395840, 'steps': 54144, 'loss/train': 1.9195276498794556} 11/07/2021 04:50:35 - INFO - __main__ - Step 54146: {'lr': 0.00036186331742047627, 'samples': 10396032, 'steps': 54145, 'loss/train': 1.367439866065979} 11/07/2021 04:50:36 - INFO - __main__ - Step 54147: {'lr': 0.0003618585715349926, 'samples': 10396224, 'steps': 54146, 'loss/train': 1.5602264404296875} 11/07/2021 04:50:36 - INFO - __main__ - Step 54148: {'lr': 0.00036185382559910723, 'samples': 10396416, 'steps': 54147, 'loss/train': 1.2956527471542358} 11/07/2021 04:50:36 - INFO - __main__ - Step 54149: {'lr': 0.0003618490796128222, 'samples': 10396608, 'steps': 54148, 'loss/train': 1.2464632987976074} 11/07/2021 04:50:37 - INFO - __main__ - Step 54150: {'lr': 0.0003618443335761398, 'samples': 10396800, 'steps': 54149, 'loss/train': 1.4042271375656128} 11/07/2021 04:50:38 - INFO - __main__ - Step 54151: {'lr': 0.00036183958748906204, 'samples': 10396992, 'steps': 54150, 'loss/train': 1.5905922651290894} 11/07/2021 04:50:38 - INFO - __main__ - Step 54152: {'lr': 0.00036183484135159105, 'samples': 10397184, 'steps': 54151, 'loss/train': 1.538983702659607} 11/07/2021 04:50:38 - INFO - __main__ - Step 54153: {'lr': 0.000361830095163729, 'samples': 10397376, 'steps': 54152, 'loss/train': 1.7917529344558716} 11/07/2021 04:50:39 - INFO - __main__ - Step 54154: {'lr': 0.000361825348925478, 'samples': 10397568, 'steps': 54153, 'loss/train': 1.845569133758545} 11/07/2021 04:50:39 - INFO - __main__ - Step 54155: {'lr': 0.0003618206026368403, 'samples': 10397760, 'steps': 54154, 'loss/train': 1.2024149894714355} 11/07/2021 04:50:40 - INFO - __main__ - Step 54156: {'lr': 0.00036181585629781795, 'samples': 10397952, 'steps': 54155, 'loss/train': 1.522216558456421} 11/07/2021 04:50:40 - INFO - __main__ - Step 54157: {'lr': 0.0003618111099084131, 'samples': 10398144, 'steps': 54156, 'loss/train': 1.477552056312561} 11/07/2021 04:50:41 - INFO - __main__ - Step 54158: {'lr': 0.00036180636346862786, 'samples': 10398336, 'steps': 54157, 'loss/train': 1.573291301727295} 11/07/2021 04:50:41 - INFO - __main__ - Step 54159: {'lr': 0.0003618016169784645, 'samples': 10398528, 'steps': 54158, 'loss/train': 1.4481648206710815} 11/07/2021 04:50:42 - INFO - __main__ - Step 54160: {'lr': 0.0003617968704379249, 'samples': 10398720, 'steps': 54159, 'loss/train': 1.1955894231796265} 11/07/2021 04:50:42 - INFO - __main__ - Step 54161: {'lr': 0.0003617921238470114, 'samples': 10398912, 'steps': 54160, 'loss/train': 1.571686863899231} 11/07/2021 04:50:43 - INFO - __main__ - Step 54162: {'lr': 0.00036178737720572615, 'samples': 10399104, 'steps': 54161, 'loss/train': 0.8738097548484802} 11/07/2021 04:50:43 - INFO - __main__ - Step 54163: {'lr': 0.0003617826305140712, 'samples': 10399296, 'steps': 54162, 'loss/train': 0.8634712100028992} 11/07/2021 04:50:43 - INFO - __main__ - Step 54164: {'lr': 0.0003617778837720488, 'samples': 10399488, 'steps': 54163, 'loss/train': 1.8333909511566162} 11/07/2021 04:50:44 - INFO - __main__ - Step 54165: {'lr': 0.00036177313697966087, 'samples': 10399680, 'steps': 54164, 'loss/train': 1.1427308320999146} 11/07/2021 04:50:44 - INFO - __main__ - Step 54166: {'lr': 0.00036176839013690975, 'samples': 10399872, 'steps': 54165, 'loss/train': 0.8946309089660645} 11/07/2021 04:50:45 - INFO - __main__ - Step 54167: {'lr': 0.0003617636432437975, 'samples': 10400064, 'steps': 54166, 'loss/train': 2.0450594425201416} 11/07/2021 04:50:46 - INFO - __main__ - Step 54168: {'lr': 0.00036175889630032633, 'samples': 10400256, 'steps': 54167, 'loss/train': 1.6099635362625122} 11/07/2021 04:50:46 - INFO - __main__ - Step 54169: {'lr': 0.0003617541493064983, 'samples': 10400448, 'steps': 54168, 'loss/train': 3.37135648727417} 11/07/2021 04:50:46 - INFO - __main__ - Step 54170: {'lr': 0.00036174940226231555, 'samples': 10400640, 'steps': 54169, 'loss/train': 1.496845006942749} 11/07/2021 04:50:47 - INFO - __main__ - Step 54171: {'lr': 0.0003617446551677803, 'samples': 10400832, 'steps': 54170, 'loss/train': 1.6073050498962402} 11/07/2021 04:50:48 - INFO - __main__ - Step 54172: {'lr': 0.0003617399080228946, 'samples': 10401024, 'steps': 54171, 'loss/train': 1.785964012145996} 11/07/2021 04:50:48 - INFO - __main__ - Step 54173: {'lr': 0.0003617351608276606, 'samples': 10401216, 'steps': 54172, 'loss/train': 1.5745853185653687} 11/07/2021 04:50:48 - INFO - __main__ - Step 54174: {'lr': 0.00036173041358208047, 'samples': 10401408, 'steps': 54173, 'loss/train': 2.1846258640289307} 11/07/2021 04:50:49 - INFO - __main__ - Step 54175: {'lr': 0.0003617256662861563, 'samples': 10401600, 'steps': 54174, 'loss/train': 1.8046305179595947} 11/07/2021 04:50:49 - INFO - __main__ - Step 54176: {'lr': 0.00036172091893989033, 'samples': 10401792, 'steps': 54175, 'loss/train': 1.360886573791504} 11/07/2021 04:50:50 - INFO - __main__ - Step 54177: {'lr': 0.0003617161715432847, 'samples': 10401984, 'steps': 54176, 'loss/train': 2.502126455307007} 11/07/2021 04:50:50 - INFO - __main__ - Step 54178: {'lr': 0.0003617114240963414, 'samples': 10402176, 'steps': 54177, 'loss/train': 1.4828987121582031} 11/07/2021 04:50:51 - INFO - __main__ - Step 54179: {'lr': 0.00036170667659906263, 'samples': 10402368, 'steps': 54178, 'loss/train': 1.2660387754440308} 11/07/2021 04:50:51 - INFO - __main__ - Step 54180: {'lr': 0.0003617019290514506, 'samples': 10402560, 'steps': 54179, 'loss/train': 1.5110920667648315} 11/07/2021 04:50:51 - INFO - __main__ - Step 54181: {'lr': 0.0003616971814535074, 'samples': 10402752, 'steps': 54180, 'loss/train': 1.2024673223495483} 11/07/2021 04:50:52 - INFO - __main__ - Step 54182: {'lr': 0.0003616924338052352, 'samples': 10402944, 'steps': 54181, 'loss/train': 1.0852046012878418} 11/07/2021 04:50:53 - INFO - __main__ - Step 54183: {'lr': 0.00036168768610663605, 'samples': 10403136, 'steps': 54182, 'loss/train': 1.3982117176055908} 11/07/2021 04:50:53 - INFO - __main__ - Step 54184: {'lr': 0.0003616829383577123, 'samples': 10403328, 'steps': 54183, 'loss/train': 1.0698223114013672} 11/07/2021 04:50:54 - INFO - __main__ - Step 54185: {'lr': 0.00036167819055846575, 'samples': 10403520, 'steps': 54184, 'loss/train': 0.8570522665977478} 11/07/2021 04:50:54 - INFO - __main__ - Step 54186: {'lr': 0.0003616734427088988, 'samples': 10403712, 'steps': 54185, 'loss/train': 0.9845548868179321} 11/07/2021 04:50:55 - INFO - __main__ - Step 54187: {'lr': 0.00036166869480901354, 'samples': 10403904, 'steps': 54186, 'loss/train': 1.6993108987808228} 11/07/2021 04:50:55 - INFO - __main__ - Step 54188: {'lr': 0.0003616639468588121, 'samples': 10404096, 'steps': 54187, 'loss/train': 1.4021564722061157} 11/07/2021 04:50:56 - INFO - __main__ - Step 54189: {'lr': 0.00036165919885829654, 'samples': 10404288, 'steps': 54188, 'loss/train': 0.9608977437019348} 11/07/2021 04:50:56 - INFO - __main__ - Step 54190: {'lr': 0.0003616544508074691, 'samples': 10404480, 'steps': 54189, 'loss/train': 1.4246867895126343} 11/07/2021 04:50:56 - INFO - __main__ - Step 54191: {'lr': 0.00036164970270633195, 'samples': 10404672, 'steps': 54190, 'loss/train': 1.4522274732589722} 11/07/2021 04:50:57 - INFO - __main__ - Step 54192: {'lr': 0.0003616449545548871, 'samples': 10404864, 'steps': 54191, 'loss/train': 1.4007220268249512} 11/07/2021 04:50:58 - INFO - __main__ - Step 54193: {'lr': 0.00036164020635313677, 'samples': 10405056, 'steps': 54192, 'loss/train': 1.5226939916610718} 11/07/2021 04:50:58 - INFO - __main__ - Step 54194: {'lr': 0.0003616354581010831, 'samples': 10405248, 'steps': 54193, 'loss/train': 2.048875570297241} 11/07/2021 04:50:58 - INFO - __main__ - Step 54195: {'lr': 0.0003616307097987282, 'samples': 10405440, 'steps': 54194, 'loss/train': 1.852852702140808} 11/07/2021 04:50:59 - INFO - __main__ - Step 54196: {'lr': 0.00036162596144607425, 'samples': 10405632, 'steps': 54195, 'loss/train': 1.2445474863052368} 11/07/2021 04:50:59 - INFO - __main__ - Step 54197: {'lr': 0.00036162121304312336, 'samples': 10405824, 'steps': 54196, 'loss/train': 1.5783623456954956} 11/07/2021 04:51:00 - INFO - __main__ - Step 54198: {'lr': 0.0003616164645898776, 'samples': 10406016, 'steps': 54197, 'loss/train': 1.7226239442825317} 11/07/2021 04:51:01 - INFO - __main__ - Step 54199: {'lr': 0.0003616117160863393, 'samples': 10406208, 'steps': 54198, 'loss/train': 1.533337116241455} 11/07/2021 04:51:01 - INFO - __main__ - Step 54200: {'lr': 0.00036160696753251043, 'samples': 10406400, 'steps': 54199, 'loss/train': 0.7449395656585693} 11/07/2021 04:51:01 - INFO - __main__ - Step 54201: {'lr': 0.0003616022189283932, 'samples': 10406592, 'steps': 54200, 'loss/train': 1.16170072555542} 11/07/2021 04:51:02 - INFO - __main__ - Step 54202: {'lr': 0.00036159747027398963, 'samples': 10406784, 'steps': 54201, 'loss/train': 0.32056573033332825} 11/07/2021 04:51:03 - INFO - __main__ - Step 54203: {'lr': 0.0003615927215693021, 'samples': 10406976, 'steps': 54202, 'loss/train': 1.3610919713974} 11/07/2021 04:51:03 - INFO - __main__ - Step 54204: {'lr': 0.0003615879728143325, 'samples': 10407168, 'steps': 54203, 'loss/train': 1.335235834121704} 11/07/2021 04:51:03 - INFO - __main__ - Step 54205: {'lr': 0.00036158322400908316, 'samples': 10407360, 'steps': 54204, 'loss/train': 0.9815249443054199} 11/07/2021 04:51:04 - INFO - __main__ - Step 54206: {'lr': 0.00036157847515355614, 'samples': 10407552, 'steps': 54205, 'loss/train': 1.557557225227356} 11/07/2021 04:51:04 - INFO - __main__ - Step 54207: {'lr': 0.0003615737262477535, 'samples': 10407744, 'steps': 54206, 'loss/train': 1.1291550397872925} 11/07/2021 04:51:05 - INFO - __main__ - Step 54208: {'lr': 0.0003615689772916776, 'samples': 10407936, 'steps': 54207, 'loss/train': 1.934542179107666} 11/07/2021 04:51:05 - INFO - __main__ - Step 54209: {'lr': 0.00036156422828533035, 'samples': 10408128, 'steps': 54208, 'loss/train': 1.5808987617492676} 11/07/2021 04:51:06 - INFO - __main__ - Step 54210: {'lr': 0.000361559479228714, 'samples': 10408320, 'steps': 54209, 'loss/train': 1.5406129360198975} 11/07/2021 04:51:06 - INFO - __main__ - Step 54211: {'lr': 0.00036155473012183066, 'samples': 10408512, 'steps': 54210, 'loss/train': 1.6734875440597534} 11/07/2021 04:51:07 - INFO - __main__ - Step 54212: {'lr': 0.00036154998096468244, 'samples': 10408704, 'steps': 54211, 'loss/train': 1.8039016723632812} 11/07/2021 04:51:07 - INFO - __main__ - Step 54213: {'lr': 0.00036154523175727153, 'samples': 10408896, 'steps': 54212, 'loss/train': 1.470895528793335} 11/07/2021 04:51:08 - INFO - __main__ - Step 54214: {'lr': 0.00036154048249960015, 'samples': 10409088, 'steps': 54213, 'loss/train': 2.054375171661377} 11/07/2021 04:51:08 - INFO - __main__ - Step 54215: {'lr': 0.0003615357331916703, 'samples': 10409280, 'steps': 54214, 'loss/train': 2.149806022644043} 11/07/2021 04:51:09 - INFO - __main__ - Step 54216: {'lr': 0.0003615309838334841, 'samples': 10409472, 'steps': 54215, 'loss/train': 1.2030466794967651} 11/07/2021 04:51:09 - INFO - __main__ - Step 54217: {'lr': 0.00036152623442504386, 'samples': 10409664, 'steps': 54216, 'loss/train': 2.1831858158111572} 11/07/2021 04:51:09 - INFO - __main__ - Step 54218: {'lr': 0.0003615214849663516, 'samples': 10409856, 'steps': 54217, 'loss/train': 1.5984482765197754} 11/07/2021 04:51:10 - INFO - __main__ - Step 54219: {'lr': 0.0003615167354574094, 'samples': 10410048, 'steps': 54218, 'loss/train': 0.8072491884231567} 11/07/2021 04:51:11 - INFO - __main__ - Step 54220: {'lr': 0.0003615119858982196, 'samples': 10410240, 'steps': 54219, 'loss/train': 1.3046067953109741} 11/07/2021 04:51:11 - INFO - __main__ - Step 54221: {'lr': 0.0003615072362887841, 'samples': 10410432, 'steps': 54220, 'loss/train': 1.6609251499176025} 11/07/2021 04:51:11 - INFO - __main__ - Step 54222: {'lr': 0.0003615024866291052, 'samples': 10410624, 'steps': 54221, 'loss/train': 1.4929295778274536} 11/07/2021 04:51:12 - INFO - __main__ - Step 54223: {'lr': 0.0003614977369191851, 'samples': 10410816, 'steps': 54222, 'loss/train': 1.5850660800933838} 11/07/2021 04:51:13 - INFO - __main__ - Step 54224: {'lr': 0.00036149298715902573, 'samples': 10411008, 'steps': 54223, 'loss/train': 1.8780813217163086} 11/07/2021 04:51:13 - INFO - __main__ - Step 54225: {'lr': 0.00036148823734862934, 'samples': 10411200, 'steps': 54224, 'loss/train': 1.2997181415557861} 11/07/2021 04:51:14 - INFO - __main__ - Step 54226: {'lr': 0.00036148348748799816, 'samples': 10411392, 'steps': 54225, 'loss/train': 1.6346073150634766} 11/07/2021 04:51:14 - INFO - __main__ - Step 54227: {'lr': 0.00036147873757713417, 'samples': 10411584, 'steps': 54226, 'loss/train': 1.4231724739074707} 11/07/2021 04:51:14 - INFO - __main__ - Step 54228: {'lr': 0.0003614739876160396, 'samples': 10411776, 'steps': 54227, 'loss/train': 1.5670398473739624} 11/07/2021 04:51:15 - INFO - __main__ - Step 54229: {'lr': 0.0003614692376047165, 'samples': 10411968, 'steps': 54228, 'loss/train': 2.5385074615478516} 11/07/2021 04:51:16 - INFO - __main__ - Step 54230: {'lr': 0.00036146448754316717, 'samples': 10412160, 'steps': 54229, 'loss/train': 1.8390408754348755} 11/07/2021 04:51:16 - INFO - __main__ - Step 54231: {'lr': 0.0003614597374313937, 'samples': 10412352, 'steps': 54230, 'loss/train': 1.3580020666122437} 11/07/2021 04:51:16 - INFO - __main__ - Step 54232: {'lr': 0.00036145498726939806, 'samples': 10412544, 'steps': 54231, 'loss/train': 1.5656042098999023} 11/07/2021 04:51:17 - INFO - __main__ - Step 54233: {'lr': 0.0003614502370571826, 'samples': 10412736, 'steps': 54232, 'loss/train': 1.4175289869308472} 11/07/2021 04:51:17 - INFO - __main__ - Step 54234: {'lr': 0.00036144548679474943, 'samples': 10412928, 'steps': 54233, 'loss/train': 1.0778534412384033} 11/07/2021 04:51:18 - INFO - __main__ - Step 54235: {'lr': 0.0003614407364821005, 'samples': 10413120, 'steps': 54234, 'loss/train': 1.378931999206543} 11/07/2021 04:51:18 - INFO - __main__ - Step 54236: {'lr': 0.0003614359861192382, 'samples': 10413312, 'steps': 54235, 'loss/train': 0.9593347311019897} 11/07/2021 04:51:19 - INFO - __main__ - Step 54237: {'lr': 0.00036143123570616455, 'samples': 10413504, 'steps': 54236, 'loss/train': 1.707717776298523} 11/07/2021 04:51:19 - INFO - __main__ - Step 54238: {'lr': 0.0003614264852428817, 'samples': 10413696, 'steps': 54237, 'loss/train': 1.3806053400039673} 11/07/2021 04:51:19 - INFO - __main__ - Step 54239: {'lr': 0.0003614217347293918, 'samples': 10413888, 'steps': 54238, 'loss/train': 1.4995834827423096} 11/07/2021 04:51:20 - INFO - __main__ - Step 54240: {'lr': 0.000361416984165697, 'samples': 10414080, 'steps': 54239, 'loss/train': 1.6374309062957764} 11/07/2021 04:51:21 - INFO - __main__ - Step 54241: {'lr': 0.0003614122335517994, 'samples': 10414272, 'steps': 54240, 'loss/train': 1.4964238405227661} 11/07/2021 04:51:21 - INFO - __main__ - Step 54242: {'lr': 0.0003614074828877012, 'samples': 10414464, 'steps': 54241, 'loss/train': 2.1100287437438965} 11/07/2021 04:51:22 - INFO - __main__ - Step 54243: {'lr': 0.00036140273217340446, 'samples': 10414656, 'steps': 54242, 'loss/train': 0.5982230305671692} 11/07/2021 04:51:22 - INFO - __main__ - Step 54244: {'lr': 0.00036139798140891134, 'samples': 10414848, 'steps': 54243, 'loss/train': 1.047103762626648} 11/07/2021 04:51:23 - INFO - __main__ - Step 54245: {'lr': 0.0003613932305942241, 'samples': 10415040, 'steps': 54244, 'loss/train': 1.2297859191894531} 11/07/2021 04:51:23 - INFO - __main__ - Step 54246: {'lr': 0.00036138847972934477, 'samples': 10415232, 'steps': 54245, 'loss/train': 1.5348211526870728} 11/07/2021 04:51:24 - INFO - __main__ - Step 54247: {'lr': 0.0003613837288142755, 'samples': 10415424, 'steps': 54246, 'loss/train': 1.353988528251648} 11/07/2021 04:51:24 - INFO - __main__ - Step 54248: {'lr': 0.00036137897784901843, 'samples': 10415616, 'steps': 54247, 'loss/train': 1.3973597288131714} 11/07/2021 04:51:24 - INFO - __main__ - Step 54249: {'lr': 0.00036137422683357566, 'samples': 10415808, 'steps': 54248, 'loss/train': 1.4454056024551392} 11/07/2021 04:51:25 - INFO - __main__ - Step 54250: {'lr': 0.00036136947576794945, 'samples': 10416000, 'steps': 54249, 'loss/train': 1.7359259128570557} 11/07/2021 04:51:26 - INFO - __main__ - Step 54251: {'lr': 0.00036136472465214187, 'samples': 10416192, 'steps': 54250, 'loss/train': 1.2222212553024292} 11/07/2021 04:51:26 - INFO - __main__ - Step 54252: {'lr': 0.00036135997348615503, 'samples': 10416384, 'steps': 54251, 'loss/train': 1.7826566696166992} 11/07/2021 04:51:26 - INFO - __main__ - Step 54253: {'lr': 0.00036135522226999115, 'samples': 10416576, 'steps': 54252, 'loss/train': 1.589324951171875} 11/07/2021 04:51:27 - INFO - __main__ - Step 54254: {'lr': 0.00036135047100365223, 'samples': 10416768, 'steps': 54253, 'loss/train': 1.2796744108200073} 11/07/2021 04:51:27 - INFO - __main__ - Step 54255: {'lr': 0.00036134571968714056, 'samples': 10416960, 'steps': 54254, 'loss/train': 1.3641828298568726} 11/07/2021 04:51:28 - INFO - __main__ - Step 54256: {'lr': 0.00036134096832045825, 'samples': 10417152, 'steps': 54255, 'loss/train': 1.4050275087356567} 11/07/2021 04:51:29 - INFO - __main__ - Step 54257: {'lr': 0.0003613362169036074, 'samples': 10417344, 'steps': 54256, 'loss/train': 1.2682961225509644} 11/07/2021 04:51:29 - INFO - __main__ - Step 54258: {'lr': 0.00036133146543659026, 'samples': 10417536, 'steps': 54257, 'loss/train': 1.2486658096313477} 11/07/2021 04:51:29 - INFO - __main__ - Step 54259: {'lr': 0.00036132671391940875, 'samples': 10417728, 'steps': 54258, 'loss/train': 1.6154391765594482} 11/07/2021 04:51:30 - INFO - __main__ - Step 54260: {'lr': 0.0003613219623520652, 'samples': 10417920, 'steps': 54259, 'loss/train': 1.7659212350845337} 11/07/2021 04:51:31 - INFO - __main__ - Step 54261: {'lr': 0.00036131721073456163, 'samples': 10418112, 'steps': 54260, 'loss/train': 1.2127161026000977} 11/07/2021 04:51:31 - INFO - __main__ - Step 54262: {'lr': 0.0003613124590669003, 'samples': 10418304, 'steps': 54261, 'loss/train': 0.9236514568328857} 11/07/2021 04:51:31 - INFO - __main__ - Step 54263: {'lr': 0.0003613077073490832, 'samples': 10418496, 'steps': 54262, 'loss/train': 1.465780258178711} 11/07/2021 04:51:32 - INFO - __main__ - Step 54264: {'lr': 0.0003613029555811127, 'samples': 10418688, 'steps': 54263, 'loss/train': 1.7864091396331787} 11/07/2021 04:51:32 - INFO - __main__ - Step 54265: {'lr': 0.0003612982037629908, 'samples': 10418880, 'steps': 54264, 'loss/train': 1.5634664297103882} 11/07/2021 04:51:33 - INFO - __main__ - Step 54266: {'lr': 0.0003612934518947196, 'samples': 10419072, 'steps': 54265, 'loss/train': 1.3271775245666504} 11/07/2021 04:51:33 - INFO - __main__ - Step 54267: {'lr': 0.00036128869997630134, 'samples': 10419264, 'steps': 54266, 'loss/train': 1.2767351865768433} 11/07/2021 04:51:34 - INFO - __main__ - Step 54268: {'lr': 0.000361283948007738, 'samples': 10419456, 'steps': 54267, 'loss/train': 1.6081736087799072} 11/07/2021 04:51:34 - INFO - __main__ - Step 54269: {'lr': 0.00036127919598903186, 'samples': 10419648, 'steps': 54268, 'loss/train': 1.4866764545440674} 11/07/2021 04:51:35 - INFO - __main__ - Step 54270: {'lr': 0.00036127444392018503, 'samples': 10419840, 'steps': 54269, 'loss/train': 1.5276362895965576} 11/07/2021 04:51:35 - INFO - __main__ - Step 54271: {'lr': 0.00036126969180119977, 'samples': 10420032, 'steps': 54270, 'loss/train': 1.4294250011444092} 11/07/2021 04:51:36 - INFO - __main__ - Step 54272: {'lr': 0.000361264939632078, 'samples': 10420224, 'steps': 54271, 'loss/train': 1.4313125610351562} 11/07/2021 04:51:36 - INFO - __main__ - Step 54273: {'lr': 0.00036126018741282194, 'samples': 10420416, 'steps': 54272, 'loss/train': 1.4946861267089844} 11/07/2021 04:51:37 - INFO - __main__ - Step 54274: {'lr': 0.0003612554351434338, 'samples': 10420608, 'steps': 54273, 'loss/train': 0.9294034838676453} 11/07/2021 04:51:37 - INFO - __main__ - Step 54275: {'lr': 0.0003612506828239157, 'samples': 10420800, 'steps': 54274, 'loss/train': 1.4074270725250244} 11/07/2021 04:51:37 - INFO - __main__ - Step 54276: {'lr': 0.00036124593045426973, 'samples': 10420992, 'steps': 54275, 'loss/train': 1.4687080383300781} 11/07/2021 04:51:38 - INFO - __main__ - Step 54277: {'lr': 0.00036124117803449805, 'samples': 10421184, 'steps': 54276, 'loss/train': 1.4551621675491333} 11/07/2021 04:51:39 - INFO - __main__ - Step 54278: {'lr': 0.00036123642556460284, 'samples': 10421376, 'steps': 54277, 'loss/train': 1.1469041109085083} 11/07/2021 04:51:39 - INFO - __main__ - Step 54279: {'lr': 0.0003612316730445862, 'samples': 10421568, 'steps': 54278, 'loss/train': 1.6708741188049316} 11/07/2021 04:51:39 - INFO - __main__ - Step 54280: {'lr': 0.00036122692047445027, 'samples': 10421760, 'steps': 54279, 'loss/train': 1.777620792388916} 11/07/2021 04:51:40 - INFO - __main__ - Step 54281: {'lr': 0.00036122216785419725, 'samples': 10421952, 'steps': 54280, 'loss/train': 2.0170483589172363} 11/07/2021 04:51:41 - INFO - __main__ - Step 54282: {'lr': 0.00036121741518382915, 'samples': 10422144, 'steps': 54281, 'loss/train': 1.4133511781692505} 11/07/2021 04:51:41 - INFO - __main__ - Step 54283: {'lr': 0.00036121266246334825, 'samples': 10422336, 'steps': 54282, 'loss/train': 1.4715251922607422} 11/07/2021 04:51:42 - INFO - __main__ - Step 54284: {'lr': 0.00036120790969275667, 'samples': 10422528, 'steps': 54283, 'loss/train': 0.9404296278953552} 11/07/2021 04:51:42 - INFO - __main__ - Step 54285: {'lr': 0.0003612031568720565, 'samples': 10422720, 'steps': 54284, 'loss/train': 1.6253358125686646} 11/07/2021 04:51:42 - INFO - __main__ - Step 54286: {'lr': 0.0003611984040012499, 'samples': 10422912, 'steps': 54285, 'loss/train': 1.5697365999221802} 11/07/2021 04:51:43 - INFO - __main__ - Step 54287: {'lr': 0.000361193651080339, 'samples': 10423104, 'steps': 54286, 'loss/train': 1.9644566774368286} 11/07/2021 04:51:44 - INFO - __main__ - Step 54288: {'lr': 0.000361188898109326, 'samples': 10423296, 'steps': 54287, 'loss/train': 1.6080108880996704} 11/07/2021 04:51:44 - INFO - __main__ - Step 54289: {'lr': 0.00036118414508821295, 'samples': 10423488, 'steps': 54288, 'loss/train': 1.6078134775161743} 11/07/2021 04:51:44 - INFO - __main__ - Step 54290: {'lr': 0.0003611793920170021, 'samples': 10423680, 'steps': 54289, 'loss/train': 1.1907614469528198} 11/07/2021 04:51:45 - INFO - __main__ - Step 54291: {'lr': 0.0003611746388956955, 'samples': 10423872, 'steps': 54290, 'loss/train': 1.3729536533355713} 11/07/2021 04:51:46 - INFO - __main__ - Step 54292: {'lr': 0.00036116988572429534, 'samples': 10424064, 'steps': 54291, 'loss/train': 1.1345936059951782} 11/07/2021 04:51:46 - INFO - __main__ - Step 54293: {'lr': 0.0003611651325028037, 'samples': 10424256, 'steps': 54292, 'loss/train': 1.1989638805389404} 11/07/2021 04:51:46 - INFO - __main__ - Step 54294: {'lr': 0.0003611603792312228, 'samples': 10424448, 'steps': 54293, 'loss/train': 1.0696619749069214} 11/07/2021 04:51:47 - INFO - __main__ - Step 54295: {'lr': 0.0003611556259095547, 'samples': 10424640, 'steps': 54294, 'loss/train': 1.2648447751998901} 11/07/2021 04:51:47 - INFO - __main__ - Step 54296: {'lr': 0.00036115087253780164, 'samples': 10424832, 'steps': 54295, 'loss/train': 1.5976979732513428} 11/07/2021 04:51:48 - INFO - __main__ - Step 54297: {'lr': 0.0003611461191159657, 'samples': 10425024, 'steps': 54296, 'loss/train': 1.587814211845398} 11/07/2021 04:51:48 - INFO - __main__ - Step 54298: {'lr': 0.00036114136564404905, 'samples': 10425216, 'steps': 54297, 'loss/train': 1.583405613899231} 11/07/2021 04:51:49 - INFO - __main__ - Step 54299: {'lr': 0.0003611366121220538, 'samples': 10425408, 'steps': 54298, 'loss/train': 1.291266918182373} 11/07/2021 04:51:49 - INFO - __main__ - Step 54300: {'lr': 0.0003611318585499821, 'samples': 10425600, 'steps': 54299, 'loss/train': 1.4985302686691284} 11/07/2021 04:51:49 - INFO - __main__ - Step 54301: {'lr': 0.00036112710492783605, 'samples': 10425792, 'steps': 54300, 'loss/train': 1.4486124515533447} 11/07/2021 04:51:50 - INFO - __main__ - Step 54302: {'lr': 0.0003611223512556179, 'samples': 10425984, 'steps': 54301, 'loss/train': 1.560719609260559} 11/07/2021 04:51:51 - INFO - __main__ - Step 54303: {'lr': 0.0003611175975333297, 'samples': 10426176, 'steps': 54302, 'loss/train': 1.3340426683425903} 11/07/2021 04:51:51 - INFO - __main__ - Step 54304: {'lr': 0.0003611128437609737, 'samples': 10426368, 'steps': 54303, 'loss/train': 1.3235589265823364} 11/07/2021 04:51:51 - INFO - __main__ - Step 54305: {'lr': 0.00036110808993855195, 'samples': 10426560, 'steps': 54304, 'loss/train': 1.4664251804351807} 11/07/2021 04:51:52 - INFO - __main__ - Step 54306: {'lr': 0.0003611033360660666, 'samples': 10426752, 'steps': 54305, 'loss/train': 1.3883143663406372} 11/07/2021 04:51:52 - INFO - __main__ - Step 54307: {'lr': 0.00036109858214351977, 'samples': 10426944, 'steps': 54306, 'loss/train': 1.0334155559539795} 11/07/2021 04:51:53 - INFO - __main__ - Step 54308: {'lr': 0.0003610938281709136, 'samples': 10427136, 'steps': 54307, 'loss/train': 1.3422967195510864} 11/07/2021 04:51:54 - INFO - __main__ - Step 54309: {'lr': 0.0003610890741482503, 'samples': 10427328, 'steps': 54308, 'loss/train': 1.5439521074295044} 11/07/2021 04:51:54 - INFO - __main__ - Step 54310: {'lr': 0.000361084320075532, 'samples': 10427520, 'steps': 54309, 'loss/train': 1.237337350845337} 11/07/2021 04:51:54 - INFO - __main__ - Step 54311: {'lr': 0.00036107956595276083, 'samples': 10427712, 'steps': 54310, 'loss/train': 1.5016213655471802} 11/07/2021 04:51:55 - INFO - __main__ - Step 54312: {'lr': 0.00036107481177993897, 'samples': 10427904, 'steps': 54311, 'loss/train': 0.9651145935058594} 11/07/2021 04:51:56 - INFO - __main__ - Step 54313: {'lr': 0.0003610700575570684, 'samples': 10428096, 'steps': 54312, 'loss/train': 1.3069701194763184} 11/07/2021 04:51:56 - INFO - __main__ - Step 54314: {'lr': 0.00036106530328415136, 'samples': 10428288, 'steps': 54313, 'loss/train': 1.1593788862228394} 11/07/2021 04:51:56 - INFO - __main__ - Step 54315: {'lr': 0.0003610605489611901, 'samples': 10428480, 'steps': 54314, 'loss/train': 1.3796184062957764} 11/07/2021 04:51:57 - INFO - __main__ - Step 54316: {'lr': 0.0003610557945881866, 'samples': 10428672, 'steps': 54315, 'loss/train': 1.698350191116333} 11/07/2021 04:51:57 - INFO - __main__ - Step 54317: {'lr': 0.0003610510401651431, 'samples': 10428864, 'steps': 54316, 'loss/train': 1.696230411529541} 11/07/2021 04:51:58 - INFO - __main__ - Step 54318: {'lr': 0.00036104628569206176, 'samples': 10429056, 'steps': 54317, 'loss/train': 1.3979032039642334} 11/07/2021 04:51:58 - INFO - __main__ - Step 54319: {'lr': 0.00036104153116894465, 'samples': 10429248, 'steps': 54318, 'loss/train': 0.9612535834312439} 11/07/2021 04:51:59 - INFO - __main__ - Step 54320: {'lr': 0.00036103677659579393, 'samples': 10429440, 'steps': 54319, 'loss/train': 1.3281012773513794} 11/07/2021 04:51:59 - INFO - __main__ - Step 54321: {'lr': 0.0003610320219726118, 'samples': 10429632, 'steps': 54320, 'loss/train': 1.3183045387268066} 11/07/2021 04:52:00 - INFO - __main__ - Step 54322: {'lr': 0.00036102726729940026, 'samples': 10429824, 'steps': 54321, 'loss/train': 1.418987512588501} 11/07/2021 04:52:00 - INFO - __main__ - Step 54323: {'lr': 0.0003610225125761616, 'samples': 10430016, 'steps': 54322, 'loss/train': 1.4627366065979004} 11/07/2021 04:52:01 - INFO - __main__ - Step 54324: {'lr': 0.0003610177578028979, 'samples': 10430208, 'steps': 54323, 'loss/train': 1.9382911920547485} 11/07/2021 04:52:01 - INFO - __main__ - Step 54325: {'lr': 0.0003610130029796114, 'samples': 10430400, 'steps': 54324, 'loss/train': 1.7260798215866089} 11/07/2021 04:52:02 - INFO - __main__ - Step 54326: {'lr': 0.000361008248106304, 'samples': 10430592, 'steps': 54325, 'loss/train': 1.382545828819275} 11/07/2021 04:52:02 - INFO - __main__ - Step 54327: {'lr': 0.0003610034931829781, 'samples': 10430784, 'steps': 54326, 'loss/train': 1.1825271844863892} 11/07/2021 04:52:03 - INFO - __main__ - Step 54328: {'lr': 0.0003609987382096357, 'samples': 10430976, 'steps': 54327, 'loss/train': 2.017258882522583} 11/07/2021 04:52:03 - INFO - __main__ - Step 54329: {'lr': 0.00036099398318627896, 'samples': 10431168, 'steps': 54328, 'loss/train': 1.383423089981079} 11/07/2021 04:52:04 - INFO - __main__ - Step 54330: {'lr': 0.00036098922811291, 'samples': 10431360, 'steps': 54329, 'loss/train': 0.9380173683166504} 11/07/2021 04:52:04 - INFO - __main__ - Step 54331: {'lr': 0.00036098447298953107, 'samples': 10431552, 'steps': 54330, 'loss/train': 1.2224643230438232} 11/07/2021 04:52:04 - INFO - __main__ - Step 54332: {'lr': 0.00036097971781614435, 'samples': 10431744, 'steps': 54331, 'loss/train': 1.1786091327667236} 11/07/2021 04:52:05 - INFO - __main__ - Step 54333: {'lr': 0.0003609749625927518, 'samples': 10431936, 'steps': 54332, 'loss/train': 1.3576468229293823} 11/07/2021 04:52:06 - INFO - __main__ - Step 54334: {'lr': 0.0003609702073193556, 'samples': 10432128, 'steps': 54333, 'loss/train': 2.0165226459503174} 11/07/2021 04:52:06 - INFO - __main__ - Step 54335: {'lr': 0.000360965451995958, 'samples': 10432320, 'steps': 54334, 'loss/train': 1.2862190008163452} 11/07/2021 04:52:06 - INFO - __main__ - Step 54336: {'lr': 0.000360960696622561, 'samples': 10432512, 'steps': 54335, 'loss/train': 0.9898625612258911} 11/07/2021 04:52:07 - INFO - __main__ - Step 54337: {'lr': 0.0003609559411991669, 'samples': 10432704, 'steps': 54336, 'loss/train': 1.6091269254684448} 11/07/2021 04:52:08 - INFO - __main__ - Step 54338: {'lr': 0.00036095118572577773, 'samples': 10432896, 'steps': 54337, 'loss/train': 1.989393949508667} 11/07/2021 04:52:08 - INFO - __main__ - Step 54339: {'lr': 0.00036094643020239564, 'samples': 10433088, 'steps': 54338, 'loss/train': 0.6083117127418518} 11/07/2021 04:52:08 - INFO - __main__ - Step 54340: {'lr': 0.0003609416746290228, 'samples': 10433280, 'steps': 54339, 'loss/train': 0.7100966572761536} 11/07/2021 04:52:09 - INFO - __main__ - Step 54341: {'lr': 0.00036093691900566146, 'samples': 10433472, 'steps': 54340, 'loss/train': 1.5817288160324097} 11/07/2021 04:52:09 - INFO - __main__ - Step 54342: {'lr': 0.00036093216333231356, 'samples': 10433664, 'steps': 54341, 'loss/train': 1.5909347534179688} 11/07/2021 04:52:10 - INFO - __main__ - Step 54343: {'lr': 0.0003609274076089813, 'samples': 10433856, 'steps': 54342, 'loss/train': 1.1382523775100708} 11/07/2021 04:52:10 - INFO - __main__ - Step 54344: {'lr': 0.00036092265183566705, 'samples': 10434048, 'steps': 54343, 'loss/train': 1.5417569875717163} 11/07/2021 04:52:11 - INFO - __main__ - Step 54345: {'lr': 0.0003609178960123726, 'samples': 10434240, 'steps': 54344, 'loss/train': 1.293168544769287} 11/07/2021 04:52:11 - INFO - __main__ - Step 54346: {'lr': 0.0003609131401391003, 'samples': 10434432, 'steps': 54345, 'loss/train': 1.2680710554122925} 11/07/2021 04:52:11 - INFO - __main__ - Step 54347: {'lr': 0.00036090838421585223, 'samples': 10434624, 'steps': 54346, 'loss/train': 1.698599100112915} 11/07/2021 04:52:13 - INFO - __main__ - Step 54348: {'lr': 0.0003609036282426306, 'samples': 10434816, 'steps': 54347, 'loss/train': 1.044775366783142} 11/07/2021 04:52:13 - INFO - __main__ - Step 54349: {'lr': 0.0003608988722194375, 'samples': 10435008, 'steps': 54348, 'loss/train': 1.6415693759918213} 11/07/2021 04:52:13 - INFO - __main__ - Step 54350: {'lr': 0.000360894116146275, 'samples': 10435200, 'steps': 54349, 'loss/train': 1.753548264503479} 11/07/2021 04:52:14 - INFO - __main__ - Step 54351: {'lr': 0.0003608893600231454, 'samples': 10435392, 'steps': 54350, 'loss/train': 1.502634048461914} 11/07/2021 04:52:14 - INFO - __main__ - Step 54352: {'lr': 0.00036088460385005076, 'samples': 10435584, 'steps': 54351, 'loss/train': 1.4957200288772583} 11/07/2021 04:52:14 - INFO - __main__ - Step 54353: {'lr': 0.00036087984762699316, 'samples': 10435776, 'steps': 54352, 'loss/train': 1.4521698951721191} 11/07/2021 04:52:15 - INFO - __main__ - Step 54354: {'lr': 0.00036087509135397487, 'samples': 10435968, 'steps': 54353, 'loss/train': 1.2603263854980469} 11/07/2021 04:52:16 - INFO - __main__ - Step 54355: {'lr': 0.00036087033503099796, 'samples': 10436160, 'steps': 54354, 'loss/train': 1.5180668830871582} 11/07/2021 04:52:16 - INFO - __main__ - Step 54356: {'lr': 0.00036086557865806464, 'samples': 10436352, 'steps': 54355, 'loss/train': 1.1182160377502441} 11/07/2021 04:52:16 - INFO - __main__ - Step 54357: {'lr': 0.000360860822235177, 'samples': 10436544, 'steps': 54356, 'loss/train': 1.7119102478027344} 11/07/2021 04:52:17 - INFO - __main__ - Step 54358: {'lr': 0.0003608560657623371, 'samples': 10436736, 'steps': 54357, 'loss/train': 1.7710380554199219} 11/07/2021 04:52:18 - INFO - __main__ - Step 54359: {'lr': 0.0003608513092395472, 'samples': 10436928, 'steps': 54358, 'loss/train': 1.1594558954238892} 11/07/2021 04:52:18 - INFO - __main__ - Step 54360: {'lr': 0.00036084655266680946, 'samples': 10437120, 'steps': 54359, 'loss/train': 1.8689113855361938} 11/07/2021 04:52:19 - INFO - __main__ - Step 54361: {'lr': 0.00036084179604412594, 'samples': 10437312, 'steps': 54360, 'loss/train': 0.38729095458984375} 11/07/2021 04:52:19 - INFO - __main__ - Step 54362: {'lr': 0.00036083703937149877, 'samples': 10437504, 'steps': 54361, 'loss/train': 1.2443463802337646} 11/07/2021 04:52:19 - INFO - __main__ - Step 54363: {'lr': 0.0003608322826489302, 'samples': 10437696, 'steps': 54362, 'loss/train': 1.8838434219360352} 11/07/2021 04:52:20 - INFO - __main__ - Step 54364: {'lr': 0.00036082752587642225, 'samples': 10437888, 'steps': 54363, 'loss/train': 1.8106375932693481} 11/07/2021 04:52:21 - INFO - __main__ - Step 54365: {'lr': 0.00036082276905397714, 'samples': 10438080, 'steps': 54364, 'loss/train': 1.8011395931243896} 11/07/2021 04:52:21 - INFO - __main__ - Step 54366: {'lr': 0.0003608180121815971, 'samples': 10438272, 'steps': 54365, 'loss/train': 0.9508764147758484} 11/07/2021 04:52:21 - INFO - __main__ - Step 54367: {'lr': 0.0003608132552592841, 'samples': 10438464, 'steps': 54366, 'loss/train': 1.3006246089935303} 11/07/2021 04:52:22 - INFO - __main__ - Step 54368: {'lr': 0.0003608084982870404, 'samples': 10438656, 'steps': 54367, 'loss/train': 1.1658813953399658} 11/07/2021 04:52:22 - INFO - __main__ - Step 54369: {'lr': 0.00036080374126486804, 'samples': 10438848, 'steps': 54368, 'loss/train': 0.719493567943573} 11/07/2021 04:52:23 - INFO - __main__ - Step 54370: {'lr': 0.00036079898419276923, 'samples': 10439040, 'steps': 54369, 'loss/train': 1.668667197227478} 11/07/2021 04:52:23 - INFO - __main__ - Step 54371: {'lr': 0.0003607942270707461, 'samples': 10439232, 'steps': 54370, 'loss/train': 1.707180142402649} 11/07/2021 04:52:24 - INFO - __main__ - Step 54372: {'lr': 0.0003607894698988009, 'samples': 10439424, 'steps': 54371, 'loss/train': 1.9096475839614868} 11/07/2021 04:52:24 - INFO - __main__ - Step 54373: {'lr': 0.0003607847126769356, 'samples': 10439616, 'steps': 54372, 'loss/train': 1.17087984085083} 11/07/2021 04:52:24 - INFO - __main__ - Step 54374: {'lr': 0.0003607799554051524, 'samples': 10439808, 'steps': 54373, 'loss/train': 1.7106337547302246} 11/07/2021 04:52:25 - INFO - __main__ - Step 54375: {'lr': 0.0003607751980834535, 'samples': 10440000, 'steps': 54374, 'loss/train': 1.3186962604522705} 11/07/2021 04:52:26 - INFO - __main__ - Step 54376: {'lr': 0.00036077044071184094, 'samples': 10440192, 'steps': 54375, 'loss/train': 1.4960261583328247} 11/07/2021 04:52:26 - INFO - __main__ - Step 54377: {'lr': 0.00036076568329031694, 'samples': 10440384, 'steps': 54376, 'loss/train': 1.5511620044708252} 11/07/2021 04:52:26 - INFO - __main__ - Step 54378: {'lr': 0.0003607609258188837, 'samples': 10440576, 'steps': 54377, 'loss/train': 1.56057870388031} 11/07/2021 04:52:27 - INFO - __main__ - Step 54379: {'lr': 0.00036075616829754333, 'samples': 10440768, 'steps': 54378, 'loss/train': 1.3680694103240967} 11/07/2021 04:52:28 - INFO - __main__ - Step 54380: {'lr': 0.0003607514107262978, 'samples': 10440960, 'steps': 54379, 'loss/train': 1.5334746837615967} 11/07/2021 04:52:28 - INFO - __main__ - Step 54381: {'lr': 0.0003607466531051495, 'samples': 10441152, 'steps': 54380, 'loss/train': 0.6852833032608032} 11/07/2021 04:52:29 - INFO - __main__ - Step 54382: {'lr': 0.0003607418954341004, 'samples': 10441344, 'steps': 54381, 'loss/train': 1.2319045066833496} 11/07/2021 04:52:29 - INFO - __main__ - Step 54383: {'lr': 0.00036073713771315276, 'samples': 10441536, 'steps': 54382, 'loss/train': 1.784158706665039} 11/07/2021 04:52:29 - INFO - __main__ - Step 54384: {'lr': 0.00036073237994230863, 'samples': 10441728, 'steps': 54383, 'loss/train': 1.4005428552627563} 11/07/2021 04:52:31 - INFO - __main__ - Step 54385: {'lr': 0.0003607276221215702, 'samples': 10441920, 'steps': 54384, 'loss/train': 1.500992774963379} 11/07/2021 04:52:31 - INFO - __main__ - Step 54386: {'lr': 0.0003607228642509397, 'samples': 10442112, 'steps': 54385, 'loss/train': 1.3948326110839844} 11/07/2021 04:52:32 - INFO - __main__ - Step 54387: {'lr': 0.00036071810633041913, 'samples': 10442304, 'steps': 54386, 'loss/train': 1.016112208366394} 11/07/2021 04:52:32 - INFO - __main__ - Step 54388: {'lr': 0.0003607133483600107, 'samples': 10442496, 'steps': 54387, 'loss/train': 1.4534598588943481} 11/07/2021 04:52:32 - INFO - __main__ - Step 54389: {'lr': 0.00036070859033971646, 'samples': 10442688, 'steps': 54388, 'loss/train': 0.3873778283596039} 11/07/2021 04:52:33 - INFO - __main__ - Step 54390: {'lr': 0.00036070383226953875, 'samples': 10442880, 'steps': 54389, 'loss/train': 0.34228697419166565} 11/07/2021 04:52:34 - INFO - __main__ - Step 54391: {'lr': 0.0003606990741494795, 'samples': 10443072, 'steps': 54390, 'loss/train': 1.409023404121399} 11/07/2021 04:52:34 - INFO - __main__ - Step 54392: {'lr': 0.00036069431597954103, 'samples': 10443264, 'steps': 54391, 'loss/train': 1.137148141860962} 11/07/2021 04:52:35 - INFO - __main__ - Step 54393: {'lr': 0.0003606895577597254, 'samples': 10443456, 'steps': 54392, 'loss/train': 1.4473907947540283} 11/07/2021 04:52:35 - INFO - __main__ - Step 54394: {'lr': 0.0003606847994900347, 'samples': 10443648, 'steps': 54393, 'loss/train': 1.473389983177185} 11/07/2021 04:52:36 - INFO - __main__ - Step 54395: {'lr': 0.00036068004117047127, 'samples': 10443840, 'steps': 54394, 'loss/train': 1.6775680780410767} 11/07/2021 04:52:36 - INFO - __main__ - Step 54396: {'lr': 0.000360675282801037, 'samples': 10444032, 'steps': 54395, 'loss/train': 1.2410303354263306} 11/07/2021 04:52:37 - INFO - __main__ - Step 54397: {'lr': 0.0003606705243817342, 'samples': 10444224, 'steps': 54396, 'loss/train': 1.5199062824249268} 11/07/2021 04:52:37 - INFO - __main__ - Step 54398: {'lr': 0.00036066576591256496, 'samples': 10444416, 'steps': 54397, 'loss/train': 0.6575682163238525} 11/07/2021 04:52:37 - INFO - __main__ - Step 54399: {'lr': 0.00036066100739353145, 'samples': 10444608, 'steps': 54398, 'loss/train': 1.3232568502426147} 11/07/2021 04:52:38 - INFO - __main__ - Step 54400: {'lr': 0.0003606562488246358, 'samples': 10444800, 'steps': 54399, 'loss/train': 1.3682371377944946} 11/07/2021 04:52:39 - INFO - __main__ - Step 54401: {'lr': 0.00036065149020588015, 'samples': 10444992, 'steps': 54400, 'loss/train': 0.9206700325012207} 11/07/2021 04:52:39 - INFO - __main__ - Step 54402: {'lr': 0.00036064673153726664, 'samples': 10445184, 'steps': 54401, 'loss/train': 1.3973582983016968} 11/07/2021 04:52:39 - INFO - __main__ - Step 54403: {'lr': 0.0003606419728187974, 'samples': 10445376, 'steps': 54402, 'loss/train': 1.6272251605987549} 11/07/2021 04:52:40 - INFO - __main__ - Step 54404: {'lr': 0.00036063721405047463, 'samples': 10445568, 'steps': 54403, 'loss/train': 1.5553442239761353} 11/07/2021 04:52:40 - INFO - __main__ - Step 54405: {'lr': 0.00036063245523230037, 'samples': 10445760, 'steps': 54404, 'loss/train': 1.4646358489990234} 11/07/2021 04:52:41 - INFO - __main__ - Step 54406: {'lr': 0.0003606276963642769, 'samples': 10445952, 'steps': 54405, 'loss/train': 0.8105902075767517} 11/07/2021 04:52:42 - INFO - __main__ - Step 54407: {'lr': 0.00036062293744640637, 'samples': 10446144, 'steps': 54406, 'loss/train': 1.3325507640838623} 11/07/2021 04:52:42 - INFO - __main__ - Step 54408: {'lr': 0.0003606181784786907, 'samples': 10446336, 'steps': 54407, 'loss/train': 1.3325507640838623} 11/07/2021 04:52:42 - INFO - __main__ - Step 54409: {'lr': 0.00036061341946113225, 'samples': 10446528, 'steps': 54408, 'loss/train': 1.77774977684021} 11/07/2021 04:52:43 - INFO - __main__ - Step 54410: {'lr': 0.0003606086603937331, 'samples': 10446720, 'steps': 54409, 'loss/train': 0.8325642347335815} 11/07/2021 04:52:44 - INFO - __main__ - Step 54411: {'lr': 0.00036060390127649536, 'samples': 10446912, 'steps': 54410, 'loss/train': 1.4419652223587036} 11/07/2021 04:52:44 - INFO - __main__ - Step 54412: {'lr': 0.00036059914210942126, 'samples': 10447104, 'steps': 54411, 'loss/train': 0.8393263816833496} 11/07/2021 04:52:44 - INFO - __main__ - Step 54413: {'lr': 0.0003605943828925129, 'samples': 10447296, 'steps': 54412, 'loss/train': 1.6156970262527466} 11/07/2021 04:52:45 - INFO - __main__ - Step 54414: {'lr': 0.0003605896236257724, 'samples': 10447488, 'steps': 54413, 'loss/train': 1.5920202732086182} 11/07/2021 04:52:45 - INFO - __main__ - Step 54415: {'lr': 0.0003605848643092019, 'samples': 10447680, 'steps': 54414, 'loss/train': 1.5970395803451538} 11/07/2021 04:52:46 - INFO - __main__ - Step 54416: {'lr': 0.00036058010494280357, 'samples': 10447872, 'steps': 54415, 'loss/train': 1.5743522644042969} 11/07/2021 04:52:46 - INFO - __main__ - Step 54417: {'lr': 0.00036057534552657954, 'samples': 10448064, 'steps': 54416, 'loss/train': 0.698424756526947} 11/07/2021 04:52:47 - INFO - __main__ - Step 54418: {'lr': 0.000360570586060532, 'samples': 10448256, 'steps': 54417, 'loss/train': 0.8364660143852234} 11/07/2021 04:52:47 - INFO - __main__ - Step 54419: {'lr': 0.0003605658265446631, 'samples': 10448448, 'steps': 54418, 'loss/train': 1.6641874313354492} 11/07/2021 04:52:47 - INFO - __main__ - Step 54420: {'lr': 0.00036056106697897485, 'samples': 10448640, 'steps': 54419, 'loss/train': 1.6034648418426514} 11/07/2021 04:52:48 - INFO - __main__ - Step 54421: {'lr': 0.0003605563073634696, 'samples': 10448832, 'steps': 54420, 'loss/train': 1.2959027290344238} 11/07/2021 04:52:49 - INFO - __main__ - Step 54422: {'lr': 0.00036055154769814923, 'samples': 10449024, 'steps': 54421, 'loss/train': 1.5499179363250732} 11/07/2021 04:52:49 - INFO - __main__ - Step 54423: {'lr': 0.0003605467879830161, 'samples': 10449216, 'steps': 54422, 'loss/train': 1.3453330993652344} 11/07/2021 04:52:49 - INFO - __main__ - Step 54424: {'lr': 0.00036054202821807235, 'samples': 10449408, 'steps': 54423, 'loss/train': 0.702586829662323} 11/07/2021 04:52:50 - INFO - __main__ - Step 54425: {'lr': 0.00036053726840332004, 'samples': 10449600, 'steps': 54424, 'loss/train': 1.5528888702392578} 11/07/2021 04:52:50 - INFO - __main__ - Step 54426: {'lr': 0.00036053250853876134, 'samples': 10449792, 'steps': 54425, 'loss/train': 1.2884200811386108} 11/07/2021 04:52:51 - INFO - __main__ - Step 54427: {'lr': 0.0003605277486243984, 'samples': 10449984, 'steps': 54426, 'loss/train': 1.3678855895996094} 11/07/2021 04:52:52 - INFO - __main__ - Step 54428: {'lr': 0.0003605229886602334, 'samples': 10450176, 'steps': 54427, 'loss/train': 1.1845022439956665} 11/07/2021 04:52:52 - INFO - __main__ - Step 54429: {'lr': 0.0003605182286462683, 'samples': 10450368, 'steps': 54428, 'loss/train': 1.6093850135803223} 11/07/2021 04:52:52 - INFO - __main__ - Step 54430: {'lr': 0.00036051346858250556, 'samples': 10450560, 'steps': 54429, 'loss/train': 1.5918231010437012} 11/07/2021 04:52:53 - INFO - __main__ - Step 54431: {'lr': 0.0003605087084689471, 'samples': 10450752, 'steps': 54430, 'loss/train': 1.4565256834030151} 11/07/2021 04:52:54 - INFO - __main__ - Step 54432: {'lr': 0.0003605039483055951, 'samples': 10450944, 'steps': 54431, 'loss/train': 1.2546392679214478} 11/07/2021 04:52:54 - INFO - __main__ - Step 54433: {'lr': 0.00036049918809245173, 'samples': 10451136, 'steps': 54432, 'loss/train': 0.7608054876327515} 11/07/2021 04:52:54 - INFO - __main__ - Step 54434: {'lr': 0.00036049442782951915, 'samples': 10451328, 'steps': 54433, 'loss/train': 1.2936904430389404} 11/07/2021 04:52:55 - INFO - __main__ - Step 54435: {'lr': 0.00036048966751679945, 'samples': 10451520, 'steps': 54434, 'loss/train': 1.3748066425323486} 11/07/2021 04:52:55 - INFO - __main__ - Step 54436: {'lr': 0.0003604849071542948, 'samples': 10451712, 'steps': 54435, 'loss/train': 1.7087937593460083} 11/07/2021 04:52:56 - INFO - __main__ - Step 54437: {'lr': 0.0003604801467420074, 'samples': 10451904, 'steps': 54436, 'loss/train': 1.6028019189834595} 11/07/2021 04:52:56 - INFO - __main__ - Step 54438: {'lr': 0.00036047538627993937, 'samples': 10452096, 'steps': 54437, 'loss/train': 2.164919376373291} 11/07/2021 04:52:57 - INFO - __main__ - Step 54439: {'lr': 0.00036047062576809283, 'samples': 10452288, 'steps': 54438, 'loss/train': 1.4333831071853638} 11/07/2021 04:52:57 - INFO - __main__ - Step 54440: {'lr': 0.0003604658652064699, 'samples': 10452480, 'steps': 54439, 'loss/train': 0.7021957039833069} 11/07/2021 04:52:57 - INFO - __main__ - Step 54441: {'lr': 0.00036046110459507275, 'samples': 10452672, 'steps': 54440, 'loss/train': 1.6664040088653564} 11/07/2021 04:52:58 - INFO - __main__ - Step 54442: {'lr': 0.00036045634393390354, 'samples': 10452864, 'steps': 54441, 'loss/train': 1.2602707147598267} 11/07/2021 04:52:59 - INFO - __main__ - Step 54443: {'lr': 0.0003604515832229644, 'samples': 10453056, 'steps': 54442, 'loss/train': 0.9451863169670105} 11/07/2021 04:52:59 - INFO - __main__ - Step 54444: {'lr': 0.0003604468224622575, 'samples': 10453248, 'steps': 54443, 'loss/train': 1.4609249830245972} 11/07/2021 04:53:00 - INFO - __main__ - Step 54445: {'lr': 0.00036044206165178496, 'samples': 10453440, 'steps': 54444, 'loss/train': 1.35500168800354} 11/07/2021 04:53:00 - INFO - __main__ - Step 54446: {'lr': 0.00036043730079154897, 'samples': 10453632, 'steps': 54445, 'loss/train': 1.7862550020217896} 11/07/2021 04:53:01 - INFO - __main__ - Step 54447: {'lr': 0.00036043253988155157, 'samples': 10453824, 'steps': 54446, 'loss/train': 1.4353855848312378} 11/07/2021 04:53:01 - INFO - __main__ - Step 54448: {'lr': 0.00036042777892179503, 'samples': 10454016, 'steps': 54447, 'loss/train': 1.9089937210083008} 11/07/2021 04:53:02 - INFO - __main__ - Step 54449: {'lr': 0.0003604230179122814, 'samples': 10454208, 'steps': 54448, 'loss/train': 0.8357492685317993} 11/07/2021 04:53:02 - INFO - __main__ - Step 54450: {'lr': 0.0003604182568530128, 'samples': 10454400, 'steps': 54449, 'loss/train': 1.3180925846099854} 11/07/2021 04:53:02 - INFO - __main__ - Step 54451: {'lr': 0.0003604134957439915, 'samples': 10454592, 'steps': 54450, 'loss/train': 1.654521107673645} 11/07/2021 04:53:03 - INFO - __main__ - Step 54452: {'lr': 0.00036040873458521963, 'samples': 10454784, 'steps': 54451, 'loss/train': 1.3976401090621948} 11/07/2021 04:53:04 - INFO - __main__ - Step 54453: {'lr': 0.0003604039733766992, 'samples': 10454976, 'steps': 54452, 'loss/train': 0.8282707333564758} 11/07/2021 04:53:04 - INFO - __main__ - Step 54454: {'lr': 0.00036039921211843254, 'samples': 10455168, 'steps': 54453, 'loss/train': 1.454387903213501} 11/07/2021 04:53:04 - INFO - __main__ - Step 54455: {'lr': 0.0003603944508104216, 'samples': 10455360, 'steps': 54454, 'loss/train': 1.7368769645690918} 11/07/2021 04:53:05 - INFO - __main__ - Step 54456: {'lr': 0.0003603896894526687, 'samples': 10455552, 'steps': 54455, 'loss/train': 1.3560283184051514} 11/07/2021 04:53:06 - INFO - __main__ - Step 54457: {'lr': 0.00036038492804517586, 'samples': 10455744, 'steps': 54456, 'loss/train': 1.470711588859558} 11/07/2021 04:53:06 - INFO - __main__ - Step 54458: {'lr': 0.00036038016658794525, 'samples': 10455936, 'steps': 54457, 'loss/train': 1.2212845087051392} 11/07/2021 04:53:06 - INFO - __main__ - Step 54459: {'lr': 0.0003603754050809791, 'samples': 10456128, 'steps': 54458, 'loss/train': 0.828596830368042} 11/07/2021 04:53:07 - INFO - __main__ - Step 54460: {'lr': 0.0003603706435242795, 'samples': 10456320, 'steps': 54459, 'loss/train': 1.429358959197998} 11/07/2021 04:53:07 - INFO - __main__ - Step 54461: {'lr': 0.00036036588191784856, 'samples': 10456512, 'steps': 54460, 'loss/train': 1.4888195991516113} 11/07/2021 04:53:07 - INFO - __main__ - Step 54462: {'lr': 0.0003603611202616885, 'samples': 10456704, 'steps': 54461, 'loss/train': 1.4282782077789307} 11/07/2021 04:53:09 - INFO - __main__ - Step 54463: {'lr': 0.0003603563585558014, 'samples': 10456896, 'steps': 54462, 'loss/train': 0.9998105764389038} 11/07/2021 04:53:09 - INFO - __main__ - Step 54464: {'lr': 0.00036035159680018937, 'samples': 10457088, 'steps': 54463, 'loss/train': 1.499611258506775} 11/07/2021 04:53:09 - INFO - __main__ - Step 54465: {'lr': 0.00036034683499485467, 'samples': 10457280, 'steps': 54464, 'loss/train': 1.0547444820404053} 11/07/2021 04:53:10 - INFO - __main__ - Step 54466: {'lr': 0.0003603420731397994, 'samples': 10457472, 'steps': 54465, 'loss/train': 1.8761290311813354} 11/07/2021 04:53:10 - INFO - __main__ - Step 54467: {'lr': 0.00036033731123502567, 'samples': 10457664, 'steps': 54466, 'loss/train': 0.4542864263057709} 11/07/2021 04:53:11 - INFO - __main__ - Step 54468: {'lr': 0.00036033254928053565, 'samples': 10457856, 'steps': 54467, 'loss/train': 1.2216380834579468} 11/07/2021 04:53:11 - INFO - __main__ - Step 54469: {'lr': 0.0003603277872763315, 'samples': 10458048, 'steps': 54468, 'loss/train': 1.6646921634674072} 11/07/2021 04:53:12 - INFO - __main__ - Step 54470: {'lr': 0.0003603230252224153, 'samples': 10458240, 'steps': 54469, 'loss/train': 2.8909945487976074} 11/07/2021 04:53:12 - INFO - __main__ - Step 54471: {'lr': 0.0003603182631187893, 'samples': 10458432, 'steps': 54470, 'loss/train': 1.6032060384750366} 11/07/2021 04:53:12 - INFO - __main__ - Step 54472: {'lr': 0.00036031350096545555, 'samples': 10458624, 'steps': 54471, 'loss/train': 1.352306604385376} 11/07/2021 04:53:13 - INFO - __main__ - Step 54473: {'lr': 0.0003603087387624163, 'samples': 10458816, 'steps': 54472, 'loss/train': 1.335830569267273} 11/07/2021 04:53:14 - INFO - __main__ - Step 54474: {'lr': 0.0003603039765096736, 'samples': 10459008, 'steps': 54473, 'loss/train': 0.9098039269447327} 11/07/2021 04:53:14 - INFO - __main__ - Step 54475: {'lr': 0.00036029921420722966, 'samples': 10459200, 'steps': 54474, 'loss/train': 1.742047667503357} 11/07/2021 04:53:15 - INFO - __main__ - Step 54476: {'lr': 0.0003602944518550866, 'samples': 10459392, 'steps': 54475, 'loss/train': 1.7945836782455444} 11/07/2021 04:53:15 - INFO - __main__ - Step 54477: {'lr': 0.00036028968945324647, 'samples': 10459584, 'steps': 54476, 'loss/train': 1.4792283773422241} 11/07/2021 04:53:16 - INFO - __main__ - Step 54478: {'lr': 0.00036028492700171166, 'samples': 10459776, 'steps': 54477, 'loss/train': 1.4864531755447388} 11/07/2021 04:53:16 - INFO - __main__ - Step 54479: {'lr': 0.0003602801645004841, 'samples': 10459968, 'steps': 54478, 'loss/train': 1.3378815650939941} 11/07/2021 04:53:17 - INFO - __main__ - Step 54480: {'lr': 0.00036027540194956593, 'samples': 10460160, 'steps': 54479, 'loss/train': 1.5457781553268433} 11/07/2021 04:53:17 - INFO - __main__ - Step 54481: {'lr': 0.00036027063934895935, 'samples': 10460352, 'steps': 54480, 'loss/train': 1.1741148233413696} 11/07/2021 04:53:17 - INFO - __main__ - Step 54482: {'lr': 0.0003602658766986666, 'samples': 10460544, 'steps': 54481, 'loss/train': 0.20403732359409332} 11/07/2021 04:53:18 - INFO - __main__ - Step 54483: {'lr': 0.00036026111399868973, 'samples': 10460736, 'steps': 54482, 'loss/train': 1.2347005605697632} 11/07/2021 04:53:19 - INFO - __main__ - Step 54484: {'lr': 0.00036025635124903093, 'samples': 10460928, 'steps': 54483, 'loss/train': 1.3686023950576782} 11/07/2021 04:53:19 - INFO - __main__ - Step 54485: {'lr': 0.0003602515884496923, 'samples': 10461120, 'steps': 54484, 'loss/train': 2.0651817321777344} 11/07/2021 04:53:20 - INFO - __main__ - Step 54486: {'lr': 0.00036024682560067603, 'samples': 10461312, 'steps': 54485, 'loss/train': 1.4665913581848145} 11/07/2021 04:53:20 - INFO - __main__ - Step 54487: {'lr': 0.00036024206270198416, 'samples': 10461504, 'steps': 54486, 'loss/train': 1.2947725057601929} 11/07/2021 04:53:20 - INFO - __main__ - Step 54488: {'lr': 0.00036023729975361897, 'samples': 10461696, 'steps': 54487, 'loss/train': 1.147482991218567} 11/07/2021 04:53:21 - INFO - __main__ - Step 54489: {'lr': 0.00036023253675558257, 'samples': 10461888, 'steps': 54488, 'loss/train': 1.2997239828109741} 11/07/2021 04:53:22 - INFO - __main__ - Step 54490: {'lr': 0.0003602277737078771, 'samples': 10462080, 'steps': 54489, 'loss/train': 0.9417256712913513} 11/07/2021 04:53:22 - INFO - __main__ - Step 54491: {'lr': 0.00036022301061050467, 'samples': 10462272, 'steps': 54490, 'loss/train': 1.7079122066497803} 11/07/2021 04:53:22 - INFO - __main__ - Step 54492: {'lr': 0.00036021824746346746, 'samples': 10462464, 'steps': 54491, 'loss/train': 0.9305670261383057} 11/07/2021 04:53:23 - INFO - __main__ - Step 54493: {'lr': 0.00036021348426676754, 'samples': 10462656, 'steps': 54492, 'loss/train': 1.520282506942749} 11/07/2021 04:53:24 - INFO - __main__ - Step 54494: {'lr': 0.00036020872102040727, 'samples': 10462848, 'steps': 54493, 'loss/train': 1.3972678184509277} 11/07/2021 04:53:24 - INFO - __main__ - Step 54495: {'lr': 0.00036020395772438853, 'samples': 10463040, 'steps': 54494, 'loss/train': 1.6086684465408325} 11/07/2021 04:53:24 - INFO - __main__ - Step 54496: {'lr': 0.00036019919437871355, 'samples': 10463232, 'steps': 54495, 'loss/train': 1.4100995063781738} 11/07/2021 04:53:25 - INFO - __main__ - Step 54497: {'lr': 0.0003601944309833846, 'samples': 10463424, 'steps': 54496, 'loss/train': 1.7696830034255981} 11/07/2021 04:53:25 - INFO - __main__ - Step 54498: {'lr': 0.0003601896675384037, 'samples': 10463616, 'steps': 54497, 'loss/train': 0.6956066489219666} 11/07/2021 04:53:26 - INFO - __main__ - Step 54499: {'lr': 0.0003601849040437731, 'samples': 10463808, 'steps': 54498, 'loss/train': 1.7751015424728394} 11/07/2021 04:53:26 - INFO - __main__ - Step 54500: {'lr': 0.0003601801404994949, 'samples': 10464000, 'steps': 54499, 'loss/train': 1.4549936056137085} 11/07/2021 04:53:27 - INFO - __main__ - Step 54501: {'lr': 0.0003601753769055711, 'samples': 10464192, 'steps': 54500, 'loss/train': 1.4666833877563477} 11/07/2021 04:53:27 - INFO - __main__ - Step 54502: {'lr': 0.00036017061326200405, 'samples': 10464384, 'steps': 54501, 'loss/train': 1.3497642278671265} 11/07/2021 04:53:27 - INFO - __main__ - Step 54503: {'lr': 0.0003601658495687958, 'samples': 10464576, 'steps': 54502, 'loss/train': 1.423039197921753} 11/07/2021 04:53:29 - INFO - __main__ - Step 54504: {'lr': 0.0003601610858259485, 'samples': 10464768, 'steps': 54503, 'loss/train': 1.674121618270874} 11/07/2021 04:53:30 - INFO - __main__ - Step 54505: {'lr': 0.0003601563220334644, 'samples': 10464960, 'steps': 54504, 'loss/train': 0.5751160979270935} 11/07/2021 04:53:30 - INFO - __main__ - Step 54506: {'lr': 0.0003601515581913455, 'samples': 10465152, 'steps': 54505, 'loss/train': 0.6630145907402039} 11/07/2021 04:53:31 - INFO - __main__ - Step 54507: {'lr': 0.0003601467942995941, 'samples': 10465344, 'steps': 54506, 'loss/train': 1.765760898590088} 11/07/2021 04:53:31 - INFO - __main__ - Step 54508: {'lr': 0.00036014203035821213, 'samples': 10465536, 'steps': 54507, 'loss/train': 1.625728726387024} 11/07/2021 04:53:31 - INFO - __main__ - Step 54509: {'lr': 0.0003601372663672019, 'samples': 10465728, 'steps': 54508, 'loss/train': 1.4943439960479736} 11/07/2021 04:53:32 - INFO - __main__ - Step 54510: {'lr': 0.00036013250232656553, 'samples': 10465920, 'steps': 54509, 'loss/train': 1.8026031255722046} 11/07/2021 04:53:33 - INFO - __main__ - Step 54511: {'lr': 0.0003601277382363051, 'samples': 10466112, 'steps': 54510, 'loss/train': 1.1128720045089722} 11/07/2021 04:53:33 - INFO - __main__ - Step 54512: {'lr': 0.0003601229740964229, 'samples': 10466304, 'steps': 54511, 'loss/train': 1.4502888917922974} 11/07/2021 04:53:33 - INFO - __main__ - Step 54513: {'lr': 0.000360118209906921, 'samples': 10466496, 'steps': 54512, 'loss/train': 1.446730613708496} 11/07/2021 04:53:34 - INFO - __main__ - Step 54514: {'lr': 0.0003601134456678014, 'samples': 10466688, 'steps': 54513, 'loss/train': 1.6567314863204956} 11/07/2021 04:53:35 - INFO - __main__ - Step 54515: {'lr': 0.0003601086813790665, 'samples': 10466880, 'steps': 54514, 'loss/train': 1.2719297409057617} 11/07/2021 04:53:35 - INFO - __main__ - Step 54516: {'lr': 0.00036010391704071823, 'samples': 10467072, 'steps': 54515, 'loss/train': 1.5631134510040283} 11/07/2021 04:53:35 - INFO - __main__ - Step 54517: {'lr': 0.0003600991526527589, 'samples': 10467264, 'steps': 54516, 'loss/train': 1.6666017770767212} 11/07/2021 04:53:36 - INFO - __main__ - Step 54518: {'lr': 0.00036009438821519056, 'samples': 10467456, 'steps': 54517, 'loss/train': 1.1656901836395264} 11/07/2021 04:53:36 - INFO - __main__ - Step 54519: {'lr': 0.0003600896237280154, 'samples': 10467648, 'steps': 54518, 'loss/train': 1.5136204957962036} 11/07/2021 04:53:37 - INFO - __main__ - Step 54520: {'lr': 0.0003600848591912356, 'samples': 10467840, 'steps': 54519, 'loss/train': 2.2862720489501953} 11/07/2021 04:53:38 - INFO - __main__ - Step 54521: {'lr': 0.00036008009460485323, 'samples': 10468032, 'steps': 54520, 'loss/train': 1.4150656461715698} 11/07/2021 04:53:38 - INFO - __main__ - Step 54522: {'lr': 0.00036007532996887043, 'samples': 10468224, 'steps': 54521, 'loss/train': 1.6245402097702026} 11/07/2021 04:53:38 - INFO - __main__ - Step 54523: {'lr': 0.0003600705652832894, 'samples': 10468416, 'steps': 54522, 'loss/train': 1.3718273639678955} 11/07/2021 04:53:39 - INFO - __main__ - Step 54524: {'lr': 0.00036006580054811235, 'samples': 10468608, 'steps': 54523, 'loss/train': 0.33106160163879395} 11/07/2021 04:53:39 - INFO - __main__ - Step 54525: {'lr': 0.00036006103576334124, 'samples': 10468800, 'steps': 54524, 'loss/train': 1.3807979822158813} 11/07/2021 04:53:40 - INFO - __main__ - Step 54526: {'lr': 0.00036005627092897835, 'samples': 10468992, 'steps': 54525, 'loss/train': 1.3589504957199097} 11/07/2021 04:53:40 - INFO - __main__ - Step 54527: {'lr': 0.0003600515060450259, 'samples': 10469184, 'steps': 54526, 'loss/train': 1.1738468408584595} 11/07/2021 04:53:41 - INFO - __main__ - Step 54528: {'lr': 0.0003600467411114858, 'samples': 10469376, 'steps': 54527, 'loss/train': 1.4811841249465942} 11/07/2021 04:53:41 - INFO - __main__ - Step 54529: {'lr': 0.00036004197612836045, 'samples': 10469568, 'steps': 54528, 'loss/train': 1.2551336288452148} 11/07/2021 04:53:41 - INFO - __main__ - Step 54530: {'lr': 0.0003600372110956518, 'samples': 10469760, 'steps': 54529, 'loss/train': 1.477476954460144} 11/07/2021 04:53:42 - INFO - __main__ - Step 54531: {'lr': 0.0003600324460133621, 'samples': 10469952, 'steps': 54530, 'loss/train': 5.719179630279541} 11/07/2021 04:53:43 - INFO - __main__ - Step 54532: {'lr': 0.0003600276808814935, 'samples': 10470144, 'steps': 54531, 'loss/train': 1.3719375133514404} 11/07/2021 04:53:43 - INFO - __main__ - Step 54533: {'lr': 0.00036002291570004806, 'samples': 10470336, 'steps': 54532, 'loss/train': 0.9591979384422302} 11/07/2021 04:53:44 - INFO - __main__ - Step 54534: {'lr': 0.0003600181504690281, 'samples': 10470528, 'steps': 54533, 'loss/train': 1.4933171272277832} 11/07/2021 04:53:44 - INFO - __main__ - Step 54535: {'lr': 0.00036001338518843563, 'samples': 10470720, 'steps': 54534, 'loss/train': 1.0749701261520386} 11/07/2021 04:53:45 - INFO - __main__ - Step 54536: {'lr': 0.0003600086198582728, 'samples': 10470912, 'steps': 54535, 'loss/train': 1.3410433530807495} 11/07/2021 04:53:46 - INFO - __main__ - Step 54537: {'lr': 0.00036000385447854176, 'samples': 10471104, 'steps': 54536, 'loss/train': 1.4538391828536987} 11/07/2021 04:53:46 - INFO - __main__ - Step 54538: {'lr': 0.0003599990890492447, 'samples': 10471296, 'steps': 54537, 'loss/train': 1.852305293083191} 11/07/2021 04:53:46 - INFO - __main__ - Step 54539: {'lr': 0.00035999432357038374, 'samples': 10471488, 'steps': 54538, 'loss/train': 1.7534759044647217} 11/07/2021 04:53:47 - INFO - __main__ - Step 54540: {'lr': 0.0003599895580419611, 'samples': 10471680, 'steps': 54539, 'loss/train': 1.1182537078857422} 11/07/2021 04:53:47 - INFO - __main__ - Step 54541: {'lr': 0.0003599847924639788, 'samples': 10471872, 'steps': 54540, 'loss/train': 1.510610580444336} 11/07/2021 04:53:48 - INFO - __main__ - Step 54542: {'lr': 0.00035998002683643903, 'samples': 10472064, 'steps': 54541, 'loss/train': 0.08167430013418198} 11/07/2021 04:53:48 - INFO - __main__ - Step 54543: {'lr': 0.00035997526115934405, 'samples': 10472256, 'steps': 54542, 'loss/train': 0.7035478353500366} 11/07/2021 04:53:49 - INFO - __main__ - Step 54544: {'lr': 0.00035997049543269583, 'samples': 10472448, 'steps': 54543, 'loss/train': 1.052796721458435} 11/07/2021 04:53:49 - INFO - __main__ - Step 54545: {'lr': 0.0003599657296564966, 'samples': 10472640, 'steps': 54544, 'loss/train': 1.1861445903778076} 11/07/2021 04:53:49 - INFO - __main__ - Step 54546: {'lr': 0.00035996096383074855, 'samples': 10472832, 'steps': 54545, 'loss/train': 1.3695660829544067} 11/07/2021 04:53:50 - INFO - __main__ - Step 54547: {'lr': 0.0003599561979554538, 'samples': 10473024, 'steps': 54546, 'loss/train': 1.4467450380325317} 11/07/2021 04:53:51 - INFO - __main__ - Step 54548: {'lr': 0.0003599514320306144, 'samples': 10473216, 'steps': 54547, 'loss/train': 1.014519214630127} 11/07/2021 04:53:51 - INFO - __main__ - Step 54549: {'lr': 0.0003599466660562327, 'samples': 10473408, 'steps': 54548, 'loss/train': 1.5184181928634644} 11/07/2021 04:53:51 - INFO - __main__ - Step 54550: {'lr': 0.00035994190003231063, 'samples': 10473600, 'steps': 54549, 'loss/train': 1.8656283617019653} 11/07/2021 04:53:52 - INFO - __main__ - Step 54551: {'lr': 0.0003599371339588505, 'samples': 10473792, 'steps': 54550, 'loss/train': 1.8684369325637817} 11/07/2021 04:53:53 - INFO - __main__ - Step 54552: {'lr': 0.00035993236783585437, 'samples': 10473984, 'steps': 54551, 'loss/train': 1.1571929454803467} 11/07/2021 04:53:53 - INFO - __main__ - Step 54553: {'lr': 0.00035992760166332437, 'samples': 10474176, 'steps': 54552, 'loss/train': 1.4314864873886108} 11/07/2021 04:53:53 - INFO - __main__ - Step 54554: {'lr': 0.00035992283544126276, 'samples': 10474368, 'steps': 54553, 'loss/train': 1.5210908651351929} 11/07/2021 04:53:54 - INFO - __main__ - Step 54555: {'lr': 0.00035991806916967154, 'samples': 10474560, 'steps': 54554, 'loss/train': 1.3890198469161987} 11/07/2021 04:53:54 - INFO - __main__ - Step 54556: {'lr': 0.000359913302848553, 'samples': 10474752, 'steps': 54555, 'loss/train': 1.4590320587158203} 11/07/2021 04:53:55 - INFO - __main__ - Step 54557: {'lr': 0.0003599085364779092, 'samples': 10474944, 'steps': 54556, 'loss/train': 1.2427164316177368} 11/07/2021 04:53:55 - INFO - __main__ - Step 54558: {'lr': 0.0003599037700577423, 'samples': 10475136, 'steps': 54557, 'loss/train': 0.5811551213264465} 11/07/2021 04:53:56 - INFO - __main__ - Step 54559: {'lr': 0.0003598990035880545, 'samples': 10475328, 'steps': 54558, 'loss/train': 1.1282968521118164} 11/07/2021 04:53:56 - INFO - __main__ - Step 54560: {'lr': 0.0003598942370688479, 'samples': 10475520, 'steps': 54559, 'loss/train': 1.6033250093460083} 11/07/2021 04:53:57 - INFO - __main__ - Step 54561: {'lr': 0.0003598894705001246, 'samples': 10475712, 'steps': 54560, 'loss/train': 1.3572427034378052} 11/07/2021 04:53:58 - INFO - __main__ - Step 54562: {'lr': 0.00035988470388188684, 'samples': 10475904, 'steps': 54561, 'loss/train': 1.8462798595428467} 11/07/2021 04:53:59 - INFO - __main__ - Step 54563: {'lr': 0.0003598799372141367, 'samples': 10476096, 'steps': 54562, 'loss/train': 1.3394297361373901} 11/07/2021 04:53:59 - INFO - __main__ - Step 54564: {'lr': 0.00035987517049687633, 'samples': 10476288, 'steps': 54563, 'loss/train': 1.4395896196365356} 11/07/2021 04:53:59 - INFO - __main__ - Step 54565: {'lr': 0.0003598704037301079, 'samples': 10476480, 'steps': 54564, 'loss/train': 1.1239593029022217} 11/07/2021 04:54:00 - INFO - __main__ - Step 54566: {'lr': 0.00035986563691383364, 'samples': 10476672, 'steps': 54565, 'loss/train': 1.3218730688095093} 11/07/2021 04:54:00 - INFO - __main__ - Step 54567: {'lr': 0.0003598608700480556, 'samples': 10476864, 'steps': 54566, 'loss/train': 1.663745403289795} 11/07/2021 04:54:01 - INFO - __main__ - Step 54568: {'lr': 0.00035985610313277595, 'samples': 10477056, 'steps': 54567, 'loss/train': 1.3814071416854858} 11/07/2021 04:54:01 - INFO - __main__ - Step 54569: {'lr': 0.0003598513361679968, 'samples': 10477248, 'steps': 54568, 'loss/train': 1.009146809577942} 11/07/2021 04:54:02 - INFO - __main__ - Step 54570: {'lr': 0.00035984656915372034, 'samples': 10477440, 'steps': 54569, 'loss/train': 1.7624433040618896} 11/07/2021 04:54:02 - INFO - __main__ - Step 54571: {'lr': 0.0003598418020899487, 'samples': 10477632, 'steps': 54570, 'loss/train': 0.8357031345367432} 11/07/2021 04:54:02 - INFO - __main__ - Step 54572: {'lr': 0.0003598370349766841, 'samples': 10477824, 'steps': 54571, 'loss/train': 1.2402217388153076} 11/07/2021 04:54:04 - INFO - __main__ - Step 54573: {'lr': 0.0003598322678139285, 'samples': 10478016, 'steps': 54572, 'loss/train': 1.397077202796936} 11/07/2021 04:54:04 - INFO - __main__ - Step 54574: {'lr': 0.00035982750060168436, 'samples': 10478208, 'steps': 54573, 'loss/train': 1.3012789487838745} 11/07/2021 04:54:04 - INFO - __main__ - Step 54575: {'lr': 0.0003598227333399535, 'samples': 10478400, 'steps': 54574, 'loss/train': 1.774486780166626} 11/07/2021 04:54:05 - INFO - __main__ - Step 54576: {'lr': 0.00035981796602873825, 'samples': 10478592, 'steps': 54575, 'loss/train': 1.4299341440200806} 11/07/2021 04:54:05 - INFO - __main__ - Step 54577: {'lr': 0.00035981319866804074, 'samples': 10478784, 'steps': 54576, 'loss/train': 1.7281969785690308} 11/07/2021 04:54:05 - INFO - __main__ - Step 54578: {'lr': 0.00035980843125786306, 'samples': 10478976, 'steps': 54577, 'loss/train': 1.4509475231170654} 11/07/2021 04:54:06 - INFO - __main__ - Step 54579: {'lr': 0.0003598036637982074, 'samples': 10479168, 'steps': 54578, 'loss/train': 0.8924833536148071} 11/07/2021 04:54:07 - INFO - __main__ - Step 54580: {'lr': 0.00035979889628907593, 'samples': 10479360, 'steps': 54579, 'loss/train': 1.6304229497909546} 11/07/2021 04:54:07 - INFO - __main__ - Step 54581: {'lr': 0.0003597941287304708, 'samples': 10479552, 'steps': 54580, 'loss/train': 1.5461974143981934} 11/07/2021 04:54:07 - INFO - __main__ - Step 54582: {'lr': 0.0003597893611223941, 'samples': 10479744, 'steps': 54581, 'loss/train': 1.5604580640792847} 11/07/2021 04:54:08 - INFO - __main__ - Step 54583: {'lr': 0.00035978459346484794, 'samples': 10479936, 'steps': 54582, 'loss/train': 1.3171463012695312} 11/07/2021 04:54:10 - INFO - __main__ - Step 54584: {'lr': 0.0003597798257578346, 'samples': 10480128, 'steps': 54583, 'loss/train': 1.2966458797454834} 11/07/2021 04:54:11 - INFO - __main__ - Step 54585: {'lr': 0.0003597750580013561, 'samples': 10480320, 'steps': 54584, 'loss/train': 1.376119613647461} 11/07/2021 04:54:11 - INFO - __main__ - Step 54586: {'lr': 0.0003597702901954147, 'samples': 10480512, 'steps': 54585, 'loss/train': 1.302351713180542} 11/07/2021 04:54:11 - INFO - __main__ - Step 54587: {'lr': 0.00035976552234001256, 'samples': 10480704, 'steps': 54586, 'loss/train': 1.7879868745803833} 11/07/2021 04:54:12 - INFO - __main__ - Step 54588: {'lr': 0.00035976075443515176, 'samples': 10480896, 'steps': 54587, 'loss/train': 1.8037214279174805} 11/07/2021 04:54:12 - INFO - __main__ - Step 54589: {'lr': 0.0003597559864808344, 'samples': 10481088, 'steps': 54588, 'loss/train': 1.7866116762161255} 11/07/2021 04:54:12 - INFO - __main__ - Step 54590: {'lr': 0.0003597512184770627, 'samples': 10481280, 'steps': 54589, 'loss/train': 1.7747493982315063} 11/07/2021 04:54:13 - INFO - __main__ - Step 54591: {'lr': 0.0003597464504238388, 'samples': 10481472, 'steps': 54590, 'loss/train': 1.5926944017410278} 11/07/2021 04:54:14 - INFO - __main__ - Step 54592: {'lr': 0.00035974168232116486, 'samples': 10481664, 'steps': 54591, 'loss/train': 1.5279266834259033} 11/07/2021 04:54:14 - INFO - __main__ - Step 54593: {'lr': 0.00035973691416904297, 'samples': 10481856, 'steps': 54592, 'loss/train': 1.7045717239379883} 11/07/2021 04:54:14 - INFO - __main__ - Step 54594: {'lr': 0.0003597321459674754, 'samples': 10482048, 'steps': 54593, 'loss/train': 1.2987728118896484} 11/07/2021 04:54:15 - INFO - __main__ - Step 54595: {'lr': 0.0003597273777164641, 'samples': 10482240, 'steps': 54594, 'loss/train': 1.4758131504058838} 11/07/2021 04:54:15 - INFO - __main__ - Step 54596: {'lr': 0.00035972260941601145, 'samples': 10482432, 'steps': 54595, 'loss/train': 0.771090030670166} 11/07/2021 04:54:16 - INFO - __main__ - Step 54597: {'lr': 0.0003597178410661194, 'samples': 10482624, 'steps': 54596, 'loss/train': 1.8686833381652832} 11/07/2021 04:54:17 - INFO - __main__ - Step 54598: {'lr': 0.00035971307266679023, 'samples': 10482816, 'steps': 54597, 'loss/train': 0.07533127069473267} 11/07/2021 04:54:17 - INFO - __main__ - Step 54599: {'lr': 0.000359708304218026, 'samples': 10483008, 'steps': 54598, 'loss/train': 1.3319792747497559} 11/07/2021 04:54:17 - INFO - __main__ - Step 54600: {'lr': 0.00035970353571982897, 'samples': 10483200, 'steps': 54599, 'loss/train': 1.1695317029953003} 11/07/2021 04:54:18 - INFO - __main__ - Step 54601: {'lr': 0.0003596987671722012, 'samples': 10483392, 'steps': 54600, 'loss/train': 1.0282728672027588} 11/07/2021 04:54:19 - INFO - __main__ - Step 54602: {'lr': 0.00035969399857514484, 'samples': 10483584, 'steps': 54601, 'loss/train': 1.557262659072876} 11/07/2021 04:54:19 - INFO - __main__ - Step 54603: {'lr': 0.00035968922992866205, 'samples': 10483776, 'steps': 54602, 'loss/train': 1.3373254537582397} 11/07/2021 04:54:19 - INFO - __main__ - Step 54604: {'lr': 0.00035968446123275493, 'samples': 10483968, 'steps': 54603, 'loss/train': 1.6461188793182373} 11/07/2021 04:54:20 - INFO - __main__ - Step 54605: {'lr': 0.00035967969248742576, 'samples': 10484160, 'steps': 54604, 'loss/train': 1.3914114236831665} 11/07/2021 04:54:20 - INFO - __main__ - Step 54606: {'lr': 0.00035967492369267664, 'samples': 10484352, 'steps': 54605, 'loss/train': 1.6466848850250244} 11/07/2021 04:54:21 - INFO - __main__ - Step 54607: {'lr': 0.00035967015484850964, 'samples': 10484544, 'steps': 54606, 'loss/train': 1.8524785041809082} 11/07/2021 04:54:21 - INFO - __main__ - Step 54608: {'lr': 0.000359665385954927, 'samples': 10484736, 'steps': 54607, 'loss/train': 1.4837788343429565} 11/07/2021 04:54:22 - INFO - __main__ - Step 54609: {'lr': 0.00035966061701193073, 'samples': 10484928, 'steps': 54608, 'loss/train': 1.4896328449249268} 11/07/2021 04:54:22 - INFO - __main__ - Step 54610: {'lr': 0.00035965584801952316, 'samples': 10485120, 'steps': 54609, 'loss/train': 1.3101656436920166} 11/07/2021 04:54:23 - INFO - __main__ - Step 54611: {'lr': 0.0003596510789777064, 'samples': 10485312, 'steps': 54610, 'loss/train': 1.8135974407196045} 11/07/2021 04:54:23 - INFO - __main__ - Step 54612: {'lr': 0.0003596463098864825, 'samples': 10485504, 'steps': 54611, 'loss/train': 1.3681882619857788} 11/07/2021 04:54:24 - INFO - __main__ - Step 54613: {'lr': 0.00035964154074585365, 'samples': 10485696, 'steps': 54612, 'loss/train': 1.1502083539962769} 11/07/2021 04:54:24 - INFO - __main__ - Step 54614: {'lr': 0.00035963677155582204, 'samples': 10485888, 'steps': 54613, 'loss/train': 1.2346999645233154} 11/07/2021 04:54:25 - INFO - __main__ - Step 54615: {'lr': 0.0003596320023163898, 'samples': 10486080, 'steps': 54614, 'loss/train': 2.0042223930358887} 11/07/2021 04:54:25 - INFO - __main__ - Step 54616: {'lr': 0.000359627233027559, 'samples': 10486272, 'steps': 54615, 'loss/train': 1.206191062927246} 11/07/2021 04:54:25 - INFO - __main__ - Step 54617: {'lr': 0.0003596224636893319, 'samples': 10486464, 'steps': 54616, 'loss/train': 1.0956661701202393} 11/07/2021 04:54:26 - INFO - __main__ - Step 54618: {'lr': 0.0003596176943017107, 'samples': 10486656, 'steps': 54617, 'loss/train': 1.5135501623153687} 11/07/2021 04:54:27 - INFO - __main__ - Step 54619: {'lr': 0.0003596129248646974, 'samples': 10486848, 'steps': 54618, 'loss/train': 1.3166604042053223} 11/07/2021 04:54:27 - INFO - __main__ - Step 54620: {'lr': 0.0003596081553782942, 'samples': 10487040, 'steps': 54619, 'loss/train': 0.583804190158844} 11/07/2021 04:54:28 - INFO - __main__ - Step 54621: {'lr': 0.0003596033858425032, 'samples': 10487232, 'steps': 54620, 'loss/train': 1.8584426641464233} 11/07/2021 04:54:28 - INFO - __main__ - Step 54622: {'lr': 0.00035959861625732667, 'samples': 10487424, 'steps': 54621, 'loss/train': 1.3924795389175415} 11/07/2021 04:54:29 - INFO - __main__ - Step 54623: {'lr': 0.0003595938466227667, 'samples': 10487616, 'steps': 54622, 'loss/train': 1.4461191892623901} 11/07/2021 04:54:29 - INFO - __main__ - Step 54624: {'lr': 0.0003595890769388254, 'samples': 10487808, 'steps': 54623, 'loss/train': 1.0373965501785278} 11/07/2021 04:54:30 - INFO - __main__ - Step 54625: {'lr': 0.00035958430720550494, 'samples': 10488000, 'steps': 54624, 'loss/train': 1.4201768636703491} 11/07/2021 04:54:30 - INFO - __main__ - Step 54626: {'lr': 0.00035957953742280754, 'samples': 10488192, 'steps': 54625, 'loss/train': 1.5804579257965088} 11/07/2021 04:54:30 - INFO - __main__ - Step 54627: {'lr': 0.0003595747675907352, 'samples': 10488384, 'steps': 54626, 'loss/train': 1.476851463317871} 11/07/2021 04:54:31 - INFO - __main__ - Step 54628: {'lr': 0.0003595699977092902, 'samples': 10488576, 'steps': 54627, 'loss/train': 1.5772747993469238} 11/07/2021 04:54:32 - INFO - __main__ - Step 54629: {'lr': 0.00035956522777847474, 'samples': 10488768, 'steps': 54628, 'loss/train': 1.0126844644546509} 11/07/2021 04:54:32 - INFO - __main__ - Step 54630: {'lr': 0.00035956045779829085, 'samples': 10488960, 'steps': 54629, 'loss/train': 1.3361972570419312} 11/07/2021 04:54:33 - INFO - __main__ - Step 54631: {'lr': 0.00035955568776874057, 'samples': 10489152, 'steps': 54630, 'loss/train': 1.5533382892608643} 11/07/2021 04:54:33 - INFO - __main__ - Step 54632: {'lr': 0.0003595509176898263, 'samples': 10489344, 'steps': 54631, 'loss/train': 1.1519670486450195} 11/07/2021 04:54:33 - INFO - __main__ - Step 54633: {'lr': 0.0003595461475615501, 'samples': 10489536, 'steps': 54632, 'loss/train': 1.5794849395751953} 11/07/2021 04:54:34 - INFO - __main__ - Step 54634: {'lr': 0.00035954137738391405, 'samples': 10489728, 'steps': 54633, 'loss/train': 2.0183982849121094} 11/07/2021 04:54:35 - INFO - __main__ - Step 54635: {'lr': 0.00035953660715692037, 'samples': 10489920, 'steps': 54634, 'loss/train': 1.477344274520874} 11/07/2021 04:54:35 - INFO - __main__ - Step 54636: {'lr': 0.0003595318368805711, 'samples': 10490112, 'steps': 54635, 'loss/train': 1.7437412738800049} 11/07/2021 04:54:35 - INFO - __main__ - Step 54637: {'lr': 0.00035952706655486855, 'samples': 10490304, 'steps': 54636, 'loss/train': 1.31654691696167} 11/07/2021 04:54:36 - INFO - __main__ - Step 54638: {'lr': 0.0003595222961798148, 'samples': 10490496, 'steps': 54637, 'loss/train': 1.4689444303512573} 11/07/2021 04:54:37 - INFO - __main__ - Step 54639: {'lr': 0.000359517525755412, 'samples': 10490688, 'steps': 54638, 'loss/train': 1.7984954118728638} 11/07/2021 04:54:37 - INFO - __main__ - Step 54640: {'lr': 0.0003595127552816623, 'samples': 10490880, 'steps': 54639, 'loss/train': 0.9978362321853638} 11/07/2021 04:54:37 - INFO - __main__ - Step 54641: {'lr': 0.00035950798475856783, 'samples': 10491072, 'steps': 54640, 'loss/train': 1.6896424293518066} 11/07/2021 04:54:38 - INFO - __main__ - Step 54642: {'lr': 0.0003595032141861307, 'samples': 10491264, 'steps': 54641, 'loss/train': 1.4855388402938843} 11/07/2021 04:54:38 - INFO - __main__ - Step 54643: {'lr': 0.00035949844356435314, 'samples': 10491456, 'steps': 54642, 'loss/train': 1.5688718557357788} 11/07/2021 04:54:39 - INFO - __main__ - Step 54644: {'lr': 0.00035949367289323723, 'samples': 10491648, 'steps': 54643, 'loss/train': 1.24217689037323} 11/07/2021 04:54:40 - INFO - __main__ - Step 54645: {'lr': 0.00035948890217278525, 'samples': 10491840, 'steps': 54644, 'loss/train': 1.5060263872146606} 11/07/2021 04:54:40 - INFO - __main__ - Step 54646: {'lr': 0.0003594841314029992, 'samples': 10492032, 'steps': 54645, 'loss/train': 0.8719421029090881} 11/07/2021 04:54:40 - INFO - __main__ - Step 54647: {'lr': 0.00035947936058388134, 'samples': 10492224, 'steps': 54646, 'loss/train': 1.6886515617370605} 11/07/2021 04:54:41 - INFO - __main__ - Step 54648: {'lr': 0.00035947458971543375, 'samples': 10492416, 'steps': 54647, 'loss/train': 1.4497294425964355} 11/07/2021 04:54:42 - INFO - __main__ - Step 54649: {'lr': 0.00035946981879765854, 'samples': 10492608, 'steps': 54648, 'loss/train': 1.4971867799758911} 11/07/2021 04:54:42 - INFO - __main__ - Step 54650: {'lr': 0.000359465047830558, 'samples': 10492800, 'steps': 54649, 'loss/train': 1.4914140701293945} 11/07/2021 04:54:42 - INFO - __main__ - Step 54651: {'lr': 0.0003594602768141342, 'samples': 10492992, 'steps': 54650, 'loss/train': 1.6268728971481323} 11/07/2021 04:54:43 - INFO - __main__ - Step 54652: {'lr': 0.0003594555057483892, 'samples': 10493184, 'steps': 54651, 'loss/train': 0.7136030793190002} 11/07/2021 04:54:43 - INFO - __main__ - Step 54653: {'lr': 0.0003594507346333253, 'samples': 10493376, 'steps': 54652, 'loss/train': 1.345230221748352} 11/07/2021 04:54:44 - INFO - __main__ - Step 54654: {'lr': 0.00035944596346894456, 'samples': 10493568, 'steps': 54653, 'loss/train': 1.0661356449127197} 11/07/2021 04:54:45 - INFO - __main__ - Step 54655: {'lr': 0.00035944119225524916, 'samples': 10493760, 'steps': 54654, 'loss/train': 1.279564380645752} 11/07/2021 04:54:45 - INFO - __main__ - Step 54656: {'lr': 0.00035943642099224126, 'samples': 10493952, 'steps': 54655, 'loss/train': 1.2487595081329346} 11/07/2021 04:54:45 - INFO - __main__ - Step 54657: {'lr': 0.00035943164967992304, 'samples': 10494144, 'steps': 54656, 'loss/train': 1.5451102256774902} 11/07/2021 04:54:46 - INFO - __main__ - Step 54658: {'lr': 0.00035942687831829655, 'samples': 10494336, 'steps': 54657, 'loss/train': 1.196775197982788} 11/07/2021 04:54:47 - INFO - __main__ - Step 54659: {'lr': 0.000359422106907364, 'samples': 10494528, 'steps': 54658, 'loss/train': 1.6793714761734009} 11/07/2021 04:54:47 - INFO - __main__ - Step 54660: {'lr': 0.00035941733544712755, 'samples': 10494720, 'steps': 54659, 'loss/train': 0.9637711048126221} 11/07/2021 04:54:47 - INFO - __main__ - Step 54661: {'lr': 0.0003594125639375894, 'samples': 10494912, 'steps': 54660, 'loss/train': 1.2932835817337036} 11/07/2021 04:54:48 - INFO - __main__ - Step 54662: {'lr': 0.00035940779237875154, 'samples': 10495104, 'steps': 54661, 'loss/train': 1.0968278646469116} 11/07/2021 04:54:48 - INFO - __main__ - Step 54663: {'lr': 0.00035940302077061624, 'samples': 10495296, 'steps': 54662, 'loss/train': 1.633630394935608} 11/07/2021 04:54:48 - INFO - __main__ - Step 54664: {'lr': 0.0003593982491131857, 'samples': 10495488, 'steps': 54663, 'loss/train': 1.551399827003479} 11/07/2021 04:54:49 - INFO - __main__ - Step 54665: {'lr': 0.00035939347740646186, 'samples': 10495680, 'steps': 54664, 'loss/train': 2.6731960773468018} 11/07/2021 04:54:50 - INFO - __main__ - Step 54666: {'lr': 0.00035938870565044713, 'samples': 10495872, 'steps': 54665, 'loss/train': 1.301660180091858} 11/07/2021 04:54:50 - INFO - __main__ - Step 54667: {'lr': 0.0003593839338451435, 'samples': 10496064, 'steps': 54666, 'loss/train': 1.536627173423767} 11/07/2021 04:54:50 - INFO - __main__ - Step 54668: {'lr': 0.0003593791619905532, 'samples': 10496256, 'steps': 54667, 'loss/train': 1.3612464666366577} 11/07/2021 04:54:51 - INFO - __main__ - Step 54669: {'lr': 0.00035937439008667827, 'samples': 10496448, 'steps': 54668, 'loss/train': 1.4350947141647339} 11/07/2021 04:54:52 - INFO - __main__ - Step 54670: {'lr': 0.00035936961813352094, 'samples': 10496640, 'steps': 54669, 'loss/train': 2.1642894744873047} 11/07/2021 04:54:52 - INFO - __main__ - Step 54671: {'lr': 0.0003593648461310833, 'samples': 10496832, 'steps': 54670, 'loss/train': 1.3447295427322388} 11/07/2021 04:54:53 - INFO - __main__ - Step 54672: {'lr': 0.0003593600740793676, 'samples': 10497024, 'steps': 54671, 'loss/train': 1.5652481317520142} 11/07/2021 04:54:53 - INFO - __main__ - Step 54673: {'lr': 0.00035935530197837596, 'samples': 10497216, 'steps': 54672, 'loss/train': 1.2834434509277344} 11/07/2021 04:54:53 - INFO - __main__ - Step 54674: {'lr': 0.00035935052982811046, 'samples': 10497408, 'steps': 54673, 'loss/train': 1.7741061449050903} 11/07/2021 04:54:54 - INFO - __main__ - Step 54675: {'lr': 0.00035934575762857333, 'samples': 10497600, 'steps': 54674, 'loss/train': 1.2563782930374146} 11/07/2021 04:54:55 - INFO - __main__ - Step 54676: {'lr': 0.00035934098537976675, 'samples': 10497792, 'steps': 54675, 'loss/train': 1.5968042612075806} 11/07/2021 04:54:55 - INFO - __main__ - Step 54677: {'lr': 0.00035933621308169273, 'samples': 10497984, 'steps': 54676, 'loss/train': 1.2757591009140015} 11/07/2021 04:54:55 - INFO - __main__ - Step 54678: {'lr': 0.0003593314407343535, 'samples': 10498176, 'steps': 54677, 'loss/train': 1.7727009057998657} 11/07/2021 04:54:56 - INFO - __main__ - Step 54679: {'lr': 0.00035932666833775117, 'samples': 10498368, 'steps': 54678, 'loss/train': 1.224920630455017} 11/07/2021 04:54:57 - INFO - __main__ - Step 54680: {'lr': 0.00035932189589188803, 'samples': 10498560, 'steps': 54679, 'loss/train': 2.0108485221862793} 11/07/2021 04:54:57 - INFO - __main__ - Step 54681: {'lr': 0.00035931712339676617, 'samples': 10498752, 'steps': 54680, 'loss/train': 1.2102726697921753} 11/07/2021 04:54:57 - INFO - __main__ - Step 54682: {'lr': 0.00035931235085238754, 'samples': 10498944, 'steps': 54681, 'loss/train': 1.6900962591171265} 11/07/2021 04:54:58 - INFO - __main__ - Step 54683: {'lr': 0.0003593075782587545, 'samples': 10499136, 'steps': 54682, 'loss/train': 1.5737942457199097} 11/07/2021 04:54:58 - INFO - __main__ - Step 54684: {'lr': 0.0003593028056158692, 'samples': 10499328, 'steps': 54683, 'loss/train': 1.7900334596633911} 11/07/2021 04:54:58 - INFO - __main__ - Step 54685: {'lr': 0.0003592980329237337, 'samples': 10499520, 'steps': 54684, 'loss/train': 1.5666292905807495} 11/07/2021 04:54:59 - INFO - __main__ - Step 54686: {'lr': 0.0003592932601823502, 'samples': 10499712, 'steps': 54685, 'loss/train': 1.1327369213104248} 11/07/2021 04:55:00 - INFO - __main__ - Step 54687: {'lr': 0.0003592884873917209, 'samples': 10499904, 'steps': 54686, 'loss/train': 1.9120479822158813} 11/07/2021 04:55:00 - INFO - __main__ - Step 54688: {'lr': 0.0003592837145518479, 'samples': 10500096, 'steps': 54687, 'loss/train': 1.423751950263977} 11/07/2021 04:55:01 - INFO - __main__ - Step 54689: {'lr': 0.00035927894166273323, 'samples': 10500288, 'steps': 54688, 'loss/train': 0.8661104440689087} 11/07/2021 04:55:01 - INFO - __main__ - Step 54690: {'lr': 0.0003592741687243792, 'samples': 10500480, 'steps': 54689, 'loss/train': 1.3181918859481812} 11/07/2021 04:55:02 - INFO - __main__ - Step 54691: {'lr': 0.00035926939573678796, 'samples': 10500672, 'steps': 54690, 'loss/train': 0.8958945870399475} 11/07/2021 04:55:02 - INFO - __main__ - Step 54692: {'lr': 0.0003592646226999616, 'samples': 10500864, 'steps': 54691, 'loss/train': 1.375899076461792} 11/07/2021 04:55:02 - INFO - __main__ - Step 54693: {'lr': 0.0003592598496139023, 'samples': 10501056, 'steps': 54692, 'loss/train': 1.7506818771362305} 11/07/2021 04:55:03 - INFO - __main__ - Step 54694: {'lr': 0.0003592550764786122, 'samples': 10501248, 'steps': 54693, 'loss/train': 1.0960344076156616} 11/07/2021 04:55:03 - INFO - __main__ - Step 54695: {'lr': 0.00035925030329409343, 'samples': 10501440, 'steps': 54694, 'loss/train': 1.2517138719558716} 11/07/2021 04:55:04 - INFO - __main__ - Step 54696: {'lr': 0.0003592455300603481, 'samples': 10501632, 'steps': 54695, 'loss/train': 1.1153273582458496} 11/07/2021 04:55:05 - INFO - __main__ - Step 54697: {'lr': 0.0003592407567773785, 'samples': 10501824, 'steps': 54696, 'loss/train': 1.6166651248931885} 11/07/2021 04:55:05 - INFO - __main__ - Step 54698: {'lr': 0.0003592359834451866, 'samples': 10502016, 'steps': 54697, 'loss/train': 1.2193481922149658} 11/07/2021 04:55:05 - INFO - __main__ - Step 54699: {'lr': 0.0003592312100637748, 'samples': 10502208, 'steps': 54698, 'loss/train': 1.7973885536193848} 11/07/2021 04:55:06 - INFO - __main__ - Step 54700: {'lr': 0.00035922643663314504, 'samples': 10502400, 'steps': 54699, 'loss/train': 1.788243293762207} 11/07/2021 04:55:07 - INFO - __main__ - Step 54701: {'lr': 0.00035922166315329954, 'samples': 10502592, 'steps': 54700, 'loss/train': 1.3844908475875854} 11/07/2021 04:55:07 - INFO - __main__ - Step 54702: {'lr': 0.0003592168896242404, 'samples': 10502784, 'steps': 54701, 'loss/train': 1.2753205299377441} 11/07/2021 04:55:07 - INFO - __main__ - Step 54703: {'lr': 0.00035921211604596985, 'samples': 10502976, 'steps': 54702, 'loss/train': 1.247143030166626} 11/07/2021 04:55:08 - INFO - __main__ - Step 54704: {'lr': 0.00035920734241849, 'samples': 10503168, 'steps': 54703, 'loss/train': 1.7621675729751587} 11/07/2021 04:55:08 - INFO - __main__ - Step 54705: {'lr': 0.00035920256874180304, 'samples': 10503360, 'steps': 54704, 'loss/train': 1.524994134902954} 11/07/2021 04:55:09 - INFO - __main__ - Step 54706: {'lr': 0.00035919779501591097, 'samples': 10503552, 'steps': 54705, 'loss/train': 1.1982957124710083} 11/07/2021 04:55:09 - INFO - __main__ - Step 54707: {'lr': 0.00035919302124081613, 'samples': 10503744, 'steps': 54706, 'loss/train': 1.6131422519683838} 11/07/2021 04:55:10 - INFO - __main__ - Step 54708: {'lr': 0.0003591882474165207, 'samples': 10503936, 'steps': 54707, 'loss/train': 1.636912226676941} 11/07/2021 04:55:10 - INFO - __main__ - Step 54709: {'lr': 0.00035918347354302663, 'samples': 10504128, 'steps': 54708, 'loss/train': 1.732164740562439} 11/07/2021 04:55:10 - INFO - __main__ - Step 54710: {'lr': 0.00035917869962033615, 'samples': 10504320, 'steps': 54709, 'loss/train': 1.8825229406356812} 11/07/2021 04:55:11 - INFO - __main__ - Step 54711: {'lr': 0.00035917392564845146, 'samples': 10504512, 'steps': 54710, 'loss/train': 1.311721920967102} 11/07/2021 04:55:12 - INFO - __main__ - Step 54712: {'lr': 0.00035916915162737467, 'samples': 10504704, 'steps': 54711, 'loss/train': 1.5228288173675537} 11/07/2021 04:55:12 - INFO - __main__ - Step 54713: {'lr': 0.00035916437755710795, 'samples': 10504896, 'steps': 54712, 'loss/train': 1.5763819217681885} 11/07/2021 04:55:13 - INFO - __main__ - Step 54714: {'lr': 0.0003591596034376535, 'samples': 10505088, 'steps': 54713, 'loss/train': 1.1772390604019165} 11/07/2021 04:55:13 - INFO - __main__ - Step 54715: {'lr': 0.0003591548292690134, 'samples': 10505280, 'steps': 54714, 'loss/train': 1.0528409481048584} 11/07/2021 04:55:15 - INFO - __main__ - Step 54716: {'lr': 0.0003591500550511898, 'samples': 10505472, 'steps': 54715, 'loss/train': 1.5328580141067505} 11/07/2021 04:55:15 - INFO - __main__ - Step 54717: {'lr': 0.00035914528078418486, 'samples': 10505664, 'steps': 54716, 'loss/train': 1.5203498601913452} 11/07/2021 04:55:15 - INFO - __main__ - Step 54718: {'lr': 0.0003591405064680007, 'samples': 10505856, 'steps': 54717, 'loss/train': 1.7637076377868652} 11/07/2021 04:55:16 - INFO - __main__ - Step 54719: {'lr': 0.0003591357321026396, 'samples': 10506048, 'steps': 54718, 'loss/train': 1.7425967454910278} 11/07/2021 04:55:16 - INFO - __main__ - Step 54720: {'lr': 0.00035913095768810356, 'samples': 10506240, 'steps': 54719, 'loss/train': 1.5587249994277954} 11/07/2021 04:55:17 - INFO - __main__ - Step 54721: {'lr': 0.00035912618322439483, 'samples': 10506432, 'steps': 54720, 'loss/train': 1.2138665914535522} 11/07/2021 04:55:17 - INFO - __main__ - Step 54722: {'lr': 0.00035912140871151554, 'samples': 10506624, 'steps': 54721, 'loss/train': 0.9029083847999573} 11/07/2021 04:55:17 - INFO - __main__ - Step 54723: {'lr': 0.0003591166341494678, 'samples': 10506816, 'steps': 54722, 'loss/train': 1.1368441581726074} 11/07/2021 04:55:18 - INFO - __main__ - Step 54724: {'lr': 0.00035911185953825373, 'samples': 10507008, 'steps': 54723, 'loss/train': 2.8409199714660645} 11/07/2021 04:55:19 - INFO - __main__ - Step 54725: {'lr': 0.0003591070848778756, 'samples': 10507200, 'steps': 54724, 'loss/train': 1.2468154430389404} 11/07/2021 04:55:19 - INFO - __main__ - Step 54726: {'lr': 0.0003591023101683355, 'samples': 10507392, 'steps': 54725, 'loss/train': 1.3183799982070923} 11/07/2021 04:55:19 - INFO - __main__ - Step 54727: {'lr': 0.0003590975354096356, 'samples': 10507584, 'steps': 54726, 'loss/train': 1.333478331565857} 11/07/2021 04:55:20 - INFO - __main__ - Step 54728: {'lr': 0.000359092760601778, 'samples': 10507776, 'steps': 54727, 'loss/train': 0.8363317251205444} 11/07/2021 04:55:21 - INFO - __main__ - Step 54729: {'lr': 0.0003590879857447649, 'samples': 10507968, 'steps': 54728, 'loss/train': 1.8350917100906372} 11/07/2021 04:55:21 - INFO - __main__ - Step 54730: {'lr': 0.0003590832108385985, 'samples': 10508160, 'steps': 54729, 'loss/train': 1.5275685787200928} 11/07/2021 04:55:22 - INFO - __main__ - Step 54731: {'lr': 0.0003590784358832808, 'samples': 10508352, 'steps': 54730, 'loss/train': 1.3093987703323364} 11/07/2021 04:55:22 - INFO - __main__ - Step 54732: {'lr': 0.00035907366087881403, 'samples': 10508544, 'steps': 54731, 'loss/train': 1.8257447481155396} 11/07/2021 04:55:22 - INFO - __main__ - Step 54733: {'lr': 0.00035906888582520034, 'samples': 10508736, 'steps': 54732, 'loss/train': 2.0174505710601807} 11/07/2021 04:55:23 - INFO - __main__ - Step 54734: {'lr': 0.000359064110722442, 'samples': 10508928, 'steps': 54733, 'loss/train': 1.5022810697555542} 11/07/2021 04:55:24 - INFO - __main__ - Step 54735: {'lr': 0.00035905933557054103, 'samples': 10509120, 'steps': 54734, 'loss/train': 1.521554708480835} 11/07/2021 04:55:24 - INFO - __main__ - Step 54736: {'lr': 0.0003590545603694996, 'samples': 10509312, 'steps': 54735, 'loss/train': 0.6441707015037537} 11/07/2021 04:55:24 - INFO - __main__ - Step 54737: {'lr': 0.0003590497851193198, 'samples': 10509504, 'steps': 54736, 'loss/train': 1.5682966709136963} 11/07/2021 04:55:25 - INFO - __main__ - Step 54738: {'lr': 0.00035904500982000386, 'samples': 10509696, 'steps': 54737, 'loss/train': 1.404206395149231} 11/07/2021 04:55:25 - INFO - __main__ - Step 54739: {'lr': 0.0003590402344715539, 'samples': 10509888, 'steps': 54738, 'loss/train': 1.6242003440856934} 11/07/2021 04:55:26 - INFO - __main__ - Step 54740: {'lr': 0.00035903545907397215, 'samples': 10510080, 'steps': 54739, 'loss/train': 1.3122817277908325} 11/07/2021 04:55:26 - INFO - __main__ - Step 54741: {'lr': 0.0003590306836272608, 'samples': 10510272, 'steps': 54740, 'loss/train': 1.3039827346801758} 11/07/2021 04:55:27 - INFO - __main__ - Step 54742: {'lr': 0.0003590259081314218, 'samples': 10510464, 'steps': 54741, 'loss/train': 1.6676901578903198} 11/07/2021 04:55:27 - INFO - __main__ - Step 54743: {'lr': 0.00035902113258645733, 'samples': 10510656, 'steps': 54742, 'loss/train': 1.427638292312622} 11/07/2021 04:55:27 - INFO - __main__ - Step 54744: {'lr': 0.0003590163569923697, 'samples': 10510848, 'steps': 54743, 'loss/train': 1.2986115217208862} 11/07/2021 04:55:28 - INFO - __main__ - Step 54745: {'lr': 0.000359011581349161, 'samples': 10511040, 'steps': 54744, 'loss/train': 1.6545664072036743} 11/07/2021 04:55:29 - INFO - __main__ - Step 54746: {'lr': 0.00035900680565683333, 'samples': 10511232, 'steps': 54745, 'loss/train': 1.1662455797195435} 11/07/2021 04:55:29 - INFO - __main__ - Step 54747: {'lr': 0.00035900202991538894, 'samples': 10511424, 'steps': 54746, 'loss/train': 1.3269716501235962} 11/07/2021 04:55:29 - INFO - __main__ - Step 54748: {'lr': 0.00035899725412482985, 'samples': 10511616, 'steps': 54747, 'loss/train': 1.8401679992675781} 11/07/2021 04:55:30 - INFO - __main__ - Step 54749: {'lr': 0.00035899247828515837, 'samples': 10511808, 'steps': 54748, 'loss/train': 1.302247166633606} 11/07/2021 04:55:31 - INFO - __main__ - Step 54750: {'lr': 0.0003589877023963765, 'samples': 10512000, 'steps': 54749, 'loss/train': 1.7354470491409302} 11/07/2021 04:55:31 - INFO - __main__ - Step 54751: {'lr': 0.0003589829264584864, 'samples': 10512192, 'steps': 54750, 'loss/train': 1.776680827140808} 11/07/2021 04:55:31 - INFO - __main__ - Step 54752: {'lr': 0.00035897815047149033, 'samples': 10512384, 'steps': 54751, 'loss/train': 1.0573840141296387} 11/07/2021 04:55:32 - INFO - __main__ - Step 54753: {'lr': 0.00035897337443539036, 'samples': 10512576, 'steps': 54752, 'loss/train': 0.4910214841365814} 11/07/2021 04:55:32 - INFO - __main__ - Step 54754: {'lr': 0.0003589685983501887, 'samples': 10512768, 'steps': 54753, 'loss/train': 1.791231632232666} 11/07/2021 04:55:33 - INFO - __main__ - Step 54755: {'lr': 0.0003589638222158874, 'samples': 10512960, 'steps': 54754, 'loss/train': 1.1934956312179565} 11/07/2021 04:55:34 - INFO - __main__ - Step 54756: {'lr': 0.00035895904603248875, 'samples': 10513152, 'steps': 54755, 'loss/train': 1.5078513622283936} 11/07/2021 04:55:34 - INFO - __main__ - Step 54757: {'lr': 0.0003589542697999948, 'samples': 10513344, 'steps': 54756, 'loss/train': 1.753839135169983} 11/07/2021 04:55:34 - INFO - __main__ - Step 54758: {'lr': 0.00035894949351840784, 'samples': 10513536, 'steps': 54757, 'loss/train': 1.292758822441101} 11/07/2021 04:55:35 - INFO - __main__ - Step 54759: {'lr': 0.0003589447171877298, 'samples': 10513728, 'steps': 54758, 'loss/train': 1.2573915719985962} 11/07/2021 04:55:36 - INFO - __main__ - Step 54760: {'lr': 0.000358939940807963, 'samples': 10513920, 'steps': 54759, 'loss/train': 1.3286813497543335} 11/07/2021 04:55:36 - INFO - __main__ - Step 54761: {'lr': 0.00035893516437910956, 'samples': 10514112, 'steps': 54760, 'loss/train': 1.1425011157989502} 11/07/2021 04:55:37 - INFO - __main__ - Step 54762: {'lr': 0.00035893038790117156, 'samples': 10514304, 'steps': 54761, 'loss/train': 1.787023901939392} 11/07/2021 04:55:37 - INFO - __main__ - Step 54763: {'lr': 0.0003589256113741513, 'samples': 10514496, 'steps': 54762, 'loss/train': 1.1434789896011353} 11/07/2021 04:55:37 - INFO - __main__ - Step 54764: {'lr': 0.00035892083479805077, 'samples': 10514688, 'steps': 54763, 'loss/train': 1.3133426904678345} 11/07/2021 04:55:38 - INFO - __main__ - Step 54765: {'lr': 0.0003589160581728722, 'samples': 10514880, 'steps': 54764, 'loss/train': 1.6563326120376587} 11/07/2021 04:55:39 - INFO - __main__ - Step 54766: {'lr': 0.0003589112814986177, 'samples': 10515072, 'steps': 54765, 'loss/train': 0.22022061049938202} 11/07/2021 04:55:40 - INFO - __main__ - Step 54767: {'lr': 0.00035890650477528953, 'samples': 10515264, 'steps': 54766, 'loss/train': 1.168055534362793} 11/07/2021 04:55:40 - INFO - __main__ - Step 54768: {'lr': 0.00035890172800288965, 'samples': 10515456, 'steps': 54767, 'loss/train': 1.6907206773757935} 11/07/2021 04:55:40 - INFO - __main__ - Step 54769: {'lr': 0.0003588969511814205, 'samples': 10515648, 'steps': 54768, 'loss/train': 1.3247848749160767} 11/07/2021 04:55:41 - INFO - __main__ - Step 54770: {'lr': 0.00035889217431088396, 'samples': 10515840, 'steps': 54769, 'loss/train': 1.4610716104507446} 11/07/2021 04:55:41 - INFO - __main__ - Step 54771: {'lr': 0.00035888739739128227, 'samples': 10516032, 'steps': 54770, 'loss/train': 1.753288984298706} 11/07/2021 04:55:42 - INFO - __main__ - Step 54772: {'lr': 0.00035888262042261767, 'samples': 10516224, 'steps': 54771, 'loss/train': 1.5406216382980347} 11/07/2021 04:55:42 - INFO - __main__ - Step 54773: {'lr': 0.0003588778434048922, 'samples': 10516416, 'steps': 54772, 'loss/train': 1.367986798286438} 11/07/2021 04:55:43 - INFO - __main__ - Step 54774: {'lr': 0.0003588730663381081, 'samples': 10516608, 'steps': 54773, 'loss/train': 1.1686124801635742} 11/07/2021 04:55:43 - INFO - __main__ - Step 54775: {'lr': 0.00035886828922226737, 'samples': 10516800, 'steps': 54774, 'loss/train': 1.2851676940917969} 11/07/2021 04:55:44 - INFO - __main__ - Step 54776: {'lr': 0.00035886351205737237, 'samples': 10516992, 'steps': 54775, 'loss/train': 1.709902286529541} 11/07/2021 04:55:45 - INFO - __main__ - Step 54777: {'lr': 0.00035885873484342514, 'samples': 10517184, 'steps': 54776, 'loss/train': 0.9926208853721619} 11/07/2021 04:55:45 - INFO - __main__ - Step 54778: {'lr': 0.00035885395758042784, 'samples': 10517376, 'steps': 54777, 'loss/train': 1.3379695415496826} 11/07/2021 04:55:45 - INFO - __main__ - Step 54779: {'lr': 0.0003588491802683826, 'samples': 10517568, 'steps': 54778, 'loss/train': 1.153033971786499} 11/07/2021 04:55:46 - INFO - __main__ - Step 54780: {'lr': 0.0003588444029072916, 'samples': 10517760, 'steps': 54779, 'loss/train': 2.080644130706787} 11/07/2021 04:55:46 - INFO - __main__ - Step 54781: {'lr': 0.000358839625497157, 'samples': 10517952, 'steps': 54780, 'loss/train': 1.499314546585083} 11/07/2021 04:55:47 - INFO - __main__ - Step 54782: {'lr': 0.0003588348480379809, 'samples': 10518144, 'steps': 54781, 'loss/train': 1.6469850540161133} 11/07/2021 04:55:48 - INFO - __main__ - Step 54783: {'lr': 0.0003588300705297656, 'samples': 10518336, 'steps': 54782, 'loss/train': 1.3472864627838135} 11/07/2021 04:55:48 - INFO - __main__ - Step 54784: {'lr': 0.0003588252929725131, 'samples': 10518528, 'steps': 54783, 'loss/train': 1.4015719890594482} 11/07/2021 04:55:48 - INFO - __main__ - Step 54785: {'lr': 0.0003588205153662256, 'samples': 10518720, 'steps': 54784, 'loss/train': 1.296350359916687} 11/07/2021 04:55:49 - INFO - __main__ - Step 54786: {'lr': 0.0003588157377109052, 'samples': 10518912, 'steps': 54785, 'loss/train': 1.4564239978790283} 11/07/2021 04:55:49 - INFO - __main__ - Step 54787: {'lr': 0.0003588109600065541, 'samples': 10519104, 'steps': 54786, 'loss/train': 1.848463535308838} 11/07/2021 04:55:50 - INFO - __main__ - Step 54788: {'lr': 0.0003588061822531745, 'samples': 10519296, 'steps': 54787, 'loss/train': 1.4672874212265015} 11/07/2021 04:55:50 - INFO - __main__ - Step 54789: {'lr': 0.00035880140445076857, 'samples': 10519488, 'steps': 54788, 'loss/train': 1.2948073148727417} 11/07/2021 04:55:51 - INFO - __main__ - Step 54790: {'lr': 0.0003587966265993384, 'samples': 10519680, 'steps': 54789, 'loss/train': 1.4138782024383545} 11/07/2021 04:55:51 - INFO - __main__ - Step 54791: {'lr': 0.0003587918486988861, 'samples': 10519872, 'steps': 54790, 'loss/train': 1.0976370573043823} 11/07/2021 04:55:51 - INFO - __main__ - Step 54792: {'lr': 0.0003587870707494139, 'samples': 10520064, 'steps': 54791, 'loss/train': 1.3246370553970337} 11/07/2021 04:55:52 - INFO - __main__ - Step 54793: {'lr': 0.0003587822927509239, 'samples': 10520256, 'steps': 54792, 'loss/train': 1.5439074039459229} 11/07/2021 04:55:53 - INFO - __main__ - Step 54794: {'lr': 0.00035877751470341824, 'samples': 10520448, 'steps': 54793, 'loss/train': 1.418664813041687} 11/07/2021 04:55:53 - INFO - __main__ - Step 54795: {'lr': 0.00035877273660689916, 'samples': 10520640, 'steps': 54794, 'loss/train': 1.3769599199295044} 11/07/2021 04:55:53 - INFO - __main__ - Step 54796: {'lr': 0.0003587679584613688, 'samples': 10520832, 'steps': 54795, 'loss/train': 1.7042253017425537} 11/07/2021 04:55:54 - INFO - __main__ - Step 54797: {'lr': 0.00035876318026682925, 'samples': 10521024, 'steps': 54796, 'loss/train': 1.5919417142868042} 11/07/2021 04:55:55 - INFO - __main__ - Step 54798: {'lr': 0.0003587584020232827, 'samples': 10521216, 'steps': 54797, 'loss/train': 1.1344317197799683} 11/07/2021 04:55:55 - INFO - __main__ - Step 54799: {'lr': 0.00035875362373073125, 'samples': 10521408, 'steps': 54798, 'loss/train': 1.5724691152572632} 11/07/2021 04:55:56 - INFO - __main__ - Step 54800: {'lr': 0.00035874884538917705, 'samples': 10521600, 'steps': 54799, 'loss/train': 1.3933013677597046} 11/07/2021 04:55:56 - INFO - __main__ - Step 54801: {'lr': 0.0003587440669986224, 'samples': 10521792, 'steps': 54800, 'loss/train': 1.2266603708267212} 11/07/2021 04:55:56 - INFO - __main__ - Step 54802: {'lr': 0.00035873928855906933, 'samples': 10521984, 'steps': 54801, 'loss/train': 1.2729222774505615} 11/07/2021 04:55:58 - INFO - __main__ - Step 54803: {'lr': 0.00035873451007052, 'samples': 10522176, 'steps': 54802, 'loss/train': 1.3956317901611328} 11/07/2021 04:55:58 - INFO - __main__ - Step 54804: {'lr': 0.00035872973153297657, 'samples': 10522368, 'steps': 54803, 'loss/train': 1.8258665800094604} 11/07/2021 04:55:58 - INFO - __main__ - Step 54805: {'lr': 0.0003587249529464412, 'samples': 10522560, 'steps': 54804, 'loss/train': 1.849915623664856} 11/07/2021 04:55:59 - INFO - __main__ - Step 54806: {'lr': 0.00035872017431091605, 'samples': 10522752, 'steps': 54805, 'loss/train': 1.55470609664917} 11/07/2021 04:55:59 - INFO - __main__ - Step 54807: {'lr': 0.0003587153956264033, 'samples': 10522944, 'steps': 54806, 'loss/train': 1.1715747117996216} 11/07/2021 04:56:00 - INFO - __main__ - Step 54808: {'lr': 0.00035871061689290496, 'samples': 10523136, 'steps': 54807, 'loss/train': 2.1414685249328613} 11/07/2021 04:56:00 - INFO - __main__ - Step 54809: {'lr': 0.00035870583811042347, 'samples': 10523328, 'steps': 54808, 'loss/train': 1.671202301979065} 11/07/2021 04:56:01 - INFO - __main__ - Step 54810: {'lr': 0.0003587010592789607, 'samples': 10523520, 'steps': 54809, 'loss/train': 1.3291504383087158} 11/07/2021 04:56:01 - INFO - __main__ - Step 54811: {'lr': 0.0003586962803985189, 'samples': 10523712, 'steps': 54810, 'loss/train': 1.4139256477355957} 11/07/2021 04:56:01 - INFO - __main__ - Step 54812: {'lr': 0.00035869150146910025, 'samples': 10523904, 'steps': 54811, 'loss/train': 1.8183473348617554} 11/07/2021 04:56:02 - INFO - __main__ - Step 54813: {'lr': 0.00035868672249070684, 'samples': 10524096, 'steps': 54812, 'loss/train': 1.4273749589920044} 11/07/2021 04:56:03 - INFO - __main__ - Step 54814: {'lr': 0.00035868194346334094, 'samples': 10524288, 'steps': 54813, 'loss/train': 1.55579674243927} 11/07/2021 04:56:03 - INFO - __main__ - Step 54815: {'lr': 0.0003586771643870046, 'samples': 10524480, 'steps': 54814, 'loss/train': 1.4467002153396606} 11/07/2021 04:56:03 - INFO - __main__ - Step 54816: {'lr': 0.0003586723852617, 'samples': 10524672, 'steps': 54815, 'loss/train': 1.8852932453155518} 11/07/2021 04:56:04 - INFO - __main__ - Step 54817: {'lr': 0.00035866760608742934, 'samples': 10524864, 'steps': 54816, 'loss/train': 1.3477979898452759} 11/07/2021 04:56:04 - INFO - __main__ - Step 54818: {'lr': 0.0003586628268641947, 'samples': 10525056, 'steps': 54817, 'loss/train': 2.309300661087036} 11/07/2021 04:56:05 - INFO - __main__ - Step 54819: {'lr': 0.00035865804759199825, 'samples': 10525248, 'steps': 54818, 'loss/train': 1.7186243534088135} 11/07/2021 04:56:06 - INFO - __main__ - Step 54820: {'lr': 0.00035865326827084224, 'samples': 10525440, 'steps': 54819, 'loss/train': 1.4784338474273682} 11/07/2021 04:56:06 - INFO - __main__ - Step 54821: {'lr': 0.00035864848890072864, 'samples': 10525632, 'steps': 54820, 'loss/train': 0.2255532443523407} 11/07/2021 04:56:06 - INFO - __main__ - Step 54822: {'lr': 0.0003586437094816598, 'samples': 10525824, 'steps': 54821, 'loss/train': 1.672151803970337} 11/07/2021 04:56:07 - INFO - __main__ - Step 54823: {'lr': 0.00035863893001363776, 'samples': 10526016, 'steps': 54822, 'loss/train': 1.2856768369674683} 11/07/2021 04:56:08 - INFO - __main__ - Step 54824: {'lr': 0.0003586341504966647, 'samples': 10526208, 'steps': 54823, 'loss/train': 1.5706262588500977} 11/07/2021 04:56:08 - INFO - __main__ - Step 54825: {'lr': 0.00035862937093074273, 'samples': 10526400, 'steps': 54824, 'loss/train': 1.5209360122680664} 11/07/2021 04:56:08 - INFO - __main__ - Step 54826: {'lr': 0.000358624591315874, 'samples': 10526592, 'steps': 54825, 'loss/train': 1.3436338901519775} 11/07/2021 04:56:09 - INFO - __main__ - Step 54827: {'lr': 0.0003586198116520608, 'samples': 10526784, 'steps': 54826, 'loss/train': 1.5253491401672363} 11/07/2021 04:56:09 - INFO - __main__ - Step 54828: {'lr': 0.0003586150319393051, 'samples': 10526976, 'steps': 54827, 'loss/train': 1.2950458526611328} 11/07/2021 04:56:10 - INFO - __main__ - Step 54829: {'lr': 0.00035861025217760924, 'samples': 10527168, 'steps': 54828, 'loss/train': 1.3377083539962769} 11/07/2021 04:56:10 - INFO - __main__ - Step 54830: {'lr': 0.00035860547236697525, 'samples': 10527360, 'steps': 54829, 'loss/train': 1.8295022249221802} 11/07/2021 04:56:11 - INFO - __main__ - Step 54831: {'lr': 0.0003586006925074053, 'samples': 10527552, 'steps': 54830, 'loss/train': 1.5570582151412964} 11/07/2021 04:56:11 - INFO - __main__ - Step 54832: {'lr': 0.0003585959125989015, 'samples': 10527744, 'steps': 54831, 'loss/train': 1.6938668489456177} 11/07/2021 04:56:11 - INFO - __main__ - Step 54833: {'lr': 0.00035859113264146607, 'samples': 10527936, 'steps': 54832, 'loss/train': 1.421567440032959} 11/07/2021 04:56:12 - INFO - __main__ - Step 54834: {'lr': 0.00035858635263510117, 'samples': 10528128, 'steps': 54833, 'loss/train': 1.3689855337142944} 11/07/2021 04:56:13 - INFO - __main__ - Step 54835: {'lr': 0.00035858157257980894, 'samples': 10528320, 'steps': 54834, 'loss/train': 1.3991369009017944} 11/07/2021 04:56:13 - INFO - __main__ - Step 54836: {'lr': 0.0003585767924755916, 'samples': 10528512, 'steps': 54835, 'loss/train': 1.5207114219665527} 11/07/2021 04:56:13 - INFO - __main__ - Step 54837: {'lr': 0.0003585720123224512, 'samples': 10528704, 'steps': 54836, 'loss/train': 1.5550506114959717} 11/07/2021 04:56:14 - INFO - __main__ - Step 54838: {'lr': 0.00035856723212038987, 'samples': 10528896, 'steps': 54837, 'loss/train': 1.6766122579574585} 11/07/2021 04:56:16 - INFO - __main__ - Step 54839: {'lr': 0.0003585624518694098, 'samples': 10529088, 'steps': 54838, 'loss/train': 1.085741639137268} 11/07/2021 04:56:16 - INFO - __main__ - Step 54840: {'lr': 0.00035855767156951323, 'samples': 10529280, 'steps': 54839, 'loss/train': 1.7896482944488525} 11/07/2021 04:56:16 - INFO - __main__ - Step 54841: {'lr': 0.0003585528912207022, 'samples': 10529472, 'steps': 54840, 'loss/train': 1.8680771589279175} 11/07/2021 04:56:17 - INFO - __main__ - Step 54842: {'lr': 0.0003585481108229789, 'samples': 10529664, 'steps': 54841, 'loss/train': 1.7494255304336548} 11/07/2021 04:56:17 - INFO - __main__ - Step 54843: {'lr': 0.0003585433303763456, 'samples': 10529856, 'steps': 54842, 'loss/train': 1.7449979782104492} 11/07/2021 04:56:17 - INFO - __main__ - Step 54844: {'lr': 0.0003585385498808043, 'samples': 10530048, 'steps': 54843, 'loss/train': 1.756689190864563} 11/07/2021 04:56:18 - INFO - __main__ - Step 54845: {'lr': 0.00035853376933635717, 'samples': 10530240, 'steps': 54844, 'loss/train': 1.3483474254608154} 11/07/2021 04:56:19 - INFO - __main__ - Step 54846: {'lr': 0.0003585289887430064, 'samples': 10530432, 'steps': 54845, 'loss/train': 0.672095537185669} 11/07/2021 04:56:19 - INFO - __main__ - Step 54847: {'lr': 0.0003585242081007542, 'samples': 10530624, 'steps': 54846, 'loss/train': 1.7137326002120972} 11/07/2021 04:56:19 - INFO - __main__ - Step 54848: {'lr': 0.0003585194274096026, 'samples': 10530816, 'steps': 54847, 'loss/train': 1.1771575212478638} 11/07/2021 04:56:20 - INFO - __main__ - Step 54849: {'lr': 0.00035851464666955383, 'samples': 10531008, 'steps': 54848, 'loss/train': 1.3936138153076172} 11/07/2021 04:56:20 - INFO - __main__ - Step 54850: {'lr': 0.0003585098658806101, 'samples': 10531200, 'steps': 54849, 'loss/train': 1.1228164434432983} 11/07/2021 04:56:21 - INFO - __main__ - Step 54851: {'lr': 0.00035850508504277345, 'samples': 10531392, 'steps': 54850, 'loss/train': 1.3927152156829834} 11/07/2021 04:56:22 - INFO - __main__ - Step 54852: {'lr': 0.0003585003041560461, 'samples': 10531584, 'steps': 54851, 'loss/train': 1.5183683633804321} 11/07/2021 04:56:22 - INFO - __main__ - Step 54853: {'lr': 0.00035849552322043016, 'samples': 10531776, 'steps': 54852, 'loss/train': 1.6004995107650757} 11/07/2021 04:56:22 - INFO - __main__ - Step 54854: {'lr': 0.0003584907422359278, 'samples': 10531968, 'steps': 54853, 'loss/train': 1.502145528793335} 11/07/2021 04:56:23 - INFO - __main__ - Step 54855: {'lr': 0.00035848596120254125, 'samples': 10532160, 'steps': 54854, 'loss/train': 1.7841987609863281} 11/07/2021 04:56:24 - INFO - __main__ - Step 54856: {'lr': 0.0003584811801202726, 'samples': 10532352, 'steps': 54855, 'loss/train': 1.1733726263046265} 11/07/2021 04:56:24 - INFO - __main__ - Step 54857: {'lr': 0.00035847639898912395, 'samples': 10532544, 'steps': 54856, 'loss/train': 1.5412342548370361} 11/07/2021 04:56:25 - INFO - __main__ - Step 54858: {'lr': 0.00035847161780909746, 'samples': 10532736, 'steps': 54857, 'loss/train': 1.5491420030593872} 11/07/2021 04:56:25 - INFO - __main__ - Step 54859: {'lr': 0.0003584668365801954, 'samples': 10532928, 'steps': 54858, 'loss/train': 1.6432112455368042} 11/07/2021 04:56:25 - INFO - __main__ - Step 54860: {'lr': 0.00035846205530241985, 'samples': 10533120, 'steps': 54859, 'loss/train': 1.6675420999526978} 11/07/2021 04:56:26 - INFO - __main__ - Step 54861: {'lr': 0.00035845727397577296, 'samples': 10533312, 'steps': 54860, 'loss/train': 2.051708698272705} 11/07/2021 04:56:27 - INFO - __main__ - Step 54862: {'lr': 0.0003584524926002569, 'samples': 10533504, 'steps': 54861, 'loss/train': 2.053427219390869} 11/07/2021 04:56:27 - INFO - __main__ - Step 54863: {'lr': 0.00035844771117587396, 'samples': 10533696, 'steps': 54862, 'loss/train': 1.2147974967956543} 11/07/2021 04:56:27 - INFO - __main__ - Step 54864: {'lr': 0.0003584429297026259, 'samples': 10533888, 'steps': 54863, 'loss/train': 1.6371829509735107} 11/07/2021 04:56:28 - INFO - __main__ - Step 54865: {'lr': 0.00035843814818051537, 'samples': 10534080, 'steps': 54864, 'loss/train': 1.8123425245285034} 11/07/2021 04:56:29 - INFO - __main__ - Step 54866: {'lr': 0.0003584333666095441, 'samples': 10534272, 'steps': 54865, 'loss/train': 1.2196067571640015} 11/07/2021 04:56:29 - INFO - __main__ - Step 54867: {'lr': 0.0003584285849897145, 'samples': 10534464, 'steps': 54866, 'loss/train': 1.0219111442565918} 11/07/2021 04:56:30 - INFO - __main__ - Step 54868: {'lr': 0.00035842380332102864, 'samples': 10534656, 'steps': 54867, 'loss/train': 1.749085545539856} 11/07/2021 04:56:30 - INFO - __main__ - Step 54869: {'lr': 0.0003584190216034887, 'samples': 10534848, 'steps': 54868, 'loss/train': 1.6745190620422363} 11/07/2021 04:56:30 - INFO - __main__ - Step 54870: {'lr': 0.0003584142398370969, 'samples': 10535040, 'steps': 54869, 'loss/train': 1.1375041007995605} 11/07/2021 04:56:32 - INFO - __main__ - Step 54871: {'lr': 0.0003584094580218552, 'samples': 10535232, 'steps': 54870, 'loss/train': 1.648248314857483} 11/07/2021 04:56:32 - INFO - __main__ - Step 54872: {'lr': 0.00035840467615776584, 'samples': 10535424, 'steps': 54871, 'loss/train': 1.3192315101623535} 11/07/2021 04:56:32 - INFO - __main__ - Step 54873: {'lr': 0.0003583998942448311, 'samples': 10535616, 'steps': 54872, 'loss/train': 1.1297112703323364} 11/07/2021 04:56:33 - INFO - __main__ - Step 54874: {'lr': 0.000358395112283053, 'samples': 10535808, 'steps': 54873, 'loss/train': 0.830177903175354} 11/07/2021 04:56:33 - INFO - __main__ - Step 54875: {'lr': 0.00035839033027243374, 'samples': 10536000, 'steps': 54874, 'loss/train': 1.4521688222885132} 11/07/2021 04:56:33 - INFO - __main__ - Step 54876: {'lr': 0.0003583855482129755, 'samples': 10536192, 'steps': 54875, 'loss/train': 1.3463702201843262} 11/07/2021 04:56:34 - INFO - __main__ - Step 54877: {'lr': 0.0003583807661046804, 'samples': 10536384, 'steps': 54876, 'loss/train': 1.3776121139526367} 11/07/2021 04:56:35 - INFO - __main__ - Step 54878: {'lr': 0.0003583759839475506, 'samples': 10536576, 'steps': 54877, 'loss/train': 1.6864997148513794} 11/07/2021 04:56:35 - INFO - __main__ - Step 54879: {'lr': 0.00035837120174158824, 'samples': 10536768, 'steps': 54878, 'loss/train': 1.2307826280593872} 11/07/2021 04:56:35 - INFO - __main__ - Step 54880: {'lr': 0.00035836641948679544, 'samples': 10536960, 'steps': 54879, 'loss/train': 1.410631537437439} 11/07/2021 04:56:36 - INFO - __main__ - Step 54881: {'lr': 0.0003583616371831745, 'samples': 10537152, 'steps': 54880, 'loss/train': 1.5849672555923462} 11/07/2021 04:56:37 - INFO - __main__ - Step 54882: {'lr': 0.0003583568548307274, 'samples': 10537344, 'steps': 54881, 'loss/train': 1.581113338470459} 11/07/2021 04:56:37 - INFO - __main__ - Step 54883: {'lr': 0.0003583520724294564, 'samples': 10537536, 'steps': 54882, 'loss/train': 1.610122561454773} 11/07/2021 04:56:37 - INFO - __main__ - Step 54884: {'lr': 0.0003583472899793636, 'samples': 10537728, 'steps': 54883, 'loss/train': 1.4421613216400146} 11/07/2021 04:56:38 - INFO - __main__ - Step 54885: {'lr': 0.0003583425074804512, 'samples': 10537920, 'steps': 54884, 'loss/train': 1.5189831256866455} 11/07/2021 04:56:38 - INFO - __main__ - Step 54886: {'lr': 0.0003583377249327213, 'samples': 10538112, 'steps': 54885, 'loss/train': 1.2513885498046875} 11/07/2021 04:56:39 - INFO - __main__ - Step 54887: {'lr': 0.00035833294233617626, 'samples': 10538304, 'steps': 54886, 'loss/train': 0.901969850063324} 11/07/2021 04:56:40 - INFO - __main__ - Step 54888: {'lr': 0.0003583281596908179, 'samples': 10538496, 'steps': 54887, 'loss/train': 1.3590373992919922} 11/07/2021 04:56:40 - INFO - __main__ - Step 54889: {'lr': 0.00035832337699664865, 'samples': 10538688, 'steps': 54888, 'loss/train': 1.3569122552871704} 11/07/2021 04:56:40 - INFO - __main__ - Step 54890: {'lr': 0.0003583185942536704, 'samples': 10538880, 'steps': 54889, 'loss/train': 1.6509311199188232} 11/07/2021 04:56:41 - INFO - __main__ - Step 54891: {'lr': 0.00035831381146188556, 'samples': 10539072, 'steps': 54890, 'loss/train': 1.7270584106445312} 11/07/2021 04:56:42 - INFO - __main__ - Step 54892: {'lr': 0.00035830902862129627, 'samples': 10539264, 'steps': 54891, 'loss/train': 1.676131010055542} 11/07/2021 04:56:42 - INFO - __main__ - Step 54893: {'lr': 0.0003583042457319045, 'samples': 10539456, 'steps': 54892, 'loss/train': 1.7393121719360352} 11/07/2021 04:56:42 - INFO - __main__ - Step 54894: {'lr': 0.0003582994627937125, 'samples': 10539648, 'steps': 54893, 'loss/train': 1.097292423248291} 11/07/2021 04:56:43 - INFO - __main__ - Step 54895: {'lr': 0.00035829467980672247, 'samples': 10539840, 'steps': 54894, 'loss/train': 2.1243093013763428} 11/07/2021 04:56:43 - INFO - __main__ - Step 54896: {'lr': 0.00035828989677093656, 'samples': 10540032, 'steps': 54895, 'loss/train': 1.4508275985717773} 11/07/2021 04:56:44 - INFO - __main__ - Step 54897: {'lr': 0.00035828511368635684, 'samples': 10540224, 'steps': 54896, 'loss/train': 1.221198558807373} 11/07/2021 04:56:44 - INFO - __main__ - Step 54898: {'lr': 0.0003582803305529856, 'samples': 10540416, 'steps': 54897, 'loss/train': 1.4680225849151611} 11/07/2021 04:56:45 - INFO - __main__ - Step 54899: {'lr': 0.0003582755473708248, 'samples': 10540608, 'steps': 54898, 'loss/train': 1.4800821542739868} 11/07/2021 04:56:45 - INFO - __main__ - Step 54900: {'lr': 0.00035827076413987675, 'samples': 10540800, 'steps': 54899, 'loss/train': 0.9042848348617554} 11/07/2021 04:56:46 - INFO - __main__ - Step 54901: {'lr': 0.00035826598086014357, 'samples': 10540992, 'steps': 54900, 'loss/train': 1.3569746017456055} 11/07/2021 04:56:47 - INFO - __main__ - Step 54902: {'lr': 0.0003582611975316274, 'samples': 10541184, 'steps': 54901, 'loss/train': 1.4082578420639038} 11/07/2021 04:56:47 - INFO - __main__ - Step 54903: {'lr': 0.00035825641415433045, 'samples': 10541376, 'steps': 54902, 'loss/train': 1.1146998405456543} 11/07/2021 04:56:47 - INFO - __main__ - Step 54904: {'lr': 0.0003582516307282548, 'samples': 10541568, 'steps': 54903, 'loss/train': 1.0259791612625122} 11/07/2021 04:56:48 - INFO - __main__ - Step 54905: {'lr': 0.00035824684725340263, 'samples': 10541760, 'steps': 54904, 'loss/train': 1.6249933242797852} 11/07/2021 04:56:48 - INFO - __main__ - Step 54906: {'lr': 0.00035824206372977606, 'samples': 10541952, 'steps': 54905, 'loss/train': 1.9509371519088745} 11/07/2021 04:56:49 - INFO - __main__ - Step 54907: {'lr': 0.00035823728015737735, 'samples': 10542144, 'steps': 54906, 'loss/train': 1.9668883085250854} 11/07/2021 04:56:50 - INFO - __main__ - Step 54908: {'lr': 0.0003582324965362086, 'samples': 10542336, 'steps': 54907, 'loss/train': 1.0915250778198242} 11/07/2021 04:56:50 - INFO - __main__ - Step 54909: {'lr': 0.0003582277128662719, 'samples': 10542528, 'steps': 54908, 'loss/train': 1.6986463069915771} 11/07/2021 04:56:50 - INFO - __main__ - Step 54910: {'lr': 0.00035822292914756954, 'samples': 10542720, 'steps': 54909, 'loss/train': 1.019133448600769} 11/07/2021 04:56:51 - INFO - __main__ - Step 54911: {'lr': 0.00035821814538010356, 'samples': 10542912, 'steps': 54910, 'loss/train': 1.1962698698043823} 11/07/2021 04:56:51 - INFO - __main__ - Step 54912: {'lr': 0.00035821336156387614, 'samples': 10543104, 'steps': 54911, 'loss/train': 1.3298062086105347} 11/07/2021 04:56:52 - INFO - __main__ - Step 54913: {'lr': 0.00035820857769888943, 'samples': 10543296, 'steps': 54912, 'loss/train': 1.249089241027832} 11/07/2021 04:56:52 - INFO - __main__ - Step 54914: {'lr': 0.0003582037937851456, 'samples': 10543488, 'steps': 54913, 'loss/train': 1.4023022651672363} 11/07/2021 04:56:53 - INFO - __main__ - Step 54915: {'lr': 0.00035819900982264684, 'samples': 10543680, 'steps': 54914, 'loss/train': 1.013444423675537} 11/07/2021 04:56:53 - INFO - __main__ - Step 54916: {'lr': 0.0003581942258113953, 'samples': 10543872, 'steps': 54915, 'loss/train': 1.0797520875930786} 11/07/2021 04:56:53 - INFO - __main__ - Step 54917: {'lr': 0.00035818944175139314, 'samples': 10544064, 'steps': 54916, 'loss/train': 1.686318039894104} 11/07/2021 04:56:54 - INFO - __main__ - Step 54918: {'lr': 0.0003581846576426423, 'samples': 10544256, 'steps': 54917, 'loss/train': 1.8161580562591553} 11/07/2021 04:56:55 - INFO - __main__ - Step 54919: {'lr': 0.0003581798734851453, 'samples': 10544448, 'steps': 54918, 'loss/train': 1.405177354812622} 11/07/2021 04:56:55 - INFO - __main__ - Step 54920: {'lr': 0.00035817508927890406, 'samples': 10544640, 'steps': 54919, 'loss/train': 1.540561556816101} 11/07/2021 04:56:55 - INFO - __main__ - Step 54921: {'lr': 0.00035817030502392083, 'samples': 10544832, 'steps': 54920, 'loss/train': 1.726969599723816} 11/07/2021 04:56:56 - INFO - __main__ - Step 54922: {'lr': 0.0003581655207201977, 'samples': 10545024, 'steps': 54921, 'loss/train': 0.9665097594261169} 11/07/2021 04:56:57 - INFO - __main__ - Step 54923: {'lr': 0.00035816073636773686, 'samples': 10545216, 'steps': 54922, 'loss/train': 1.431938648223877} 11/07/2021 04:56:57 - INFO - __main__ - Step 54924: {'lr': 0.0003581559519665405, 'samples': 10545408, 'steps': 54923, 'loss/train': 1.5702717304229736} 11/07/2021 04:56:58 - INFO - __main__ - Step 54925: {'lr': 0.0003581511675166107, 'samples': 10545600, 'steps': 54924, 'loss/train': 1.6298109292984009} 11/07/2021 04:56:58 - INFO - __main__ - Step 54926: {'lr': 0.00035814638301794966, 'samples': 10545792, 'steps': 54925, 'loss/train': 1.6957331895828247} 11/07/2021 04:56:58 - INFO - __main__ - Step 54927: {'lr': 0.0003581415984705595, 'samples': 10545984, 'steps': 54926, 'loss/train': 2.11731219291687} 11/07/2021 04:56:59 - INFO - __main__ - Step 54928: {'lr': 0.0003581368138744424, 'samples': 10546176, 'steps': 54927, 'loss/train': 1.3969091176986694} 11/07/2021 04:57:00 - INFO - __main__ - Step 54929: {'lr': 0.00035813202922960056, 'samples': 10546368, 'steps': 54928, 'loss/train': 1.469664454460144} 11/07/2021 04:57:00 - INFO - __main__ - Step 54930: {'lr': 0.00035812724453603614, 'samples': 10546560, 'steps': 54929, 'loss/train': 1.673553466796875} 11/07/2021 04:57:00 - INFO - __main__ - Step 54931: {'lr': 0.00035812245979375114, 'samples': 10546752, 'steps': 54930, 'loss/train': 1.3558012247085571} 11/07/2021 04:57:01 - INFO - __main__ - Step 54932: {'lr': 0.0003581176750027479, 'samples': 10546944, 'steps': 54931, 'loss/train': 1.2782957553863525} 11/07/2021 04:57:02 - INFO - __main__ - Step 54933: {'lr': 0.00035811289016302847, 'samples': 10547136, 'steps': 54932, 'loss/train': 0.5100459456443787} 11/07/2021 04:57:02 - INFO - __main__ - Step 54934: {'lr': 0.000358108105274595, 'samples': 10547328, 'steps': 54933, 'loss/train': 1.7913730144500732} 11/07/2021 04:57:02 - INFO - __main__ - Step 54935: {'lr': 0.0003581033203374498, 'samples': 10547520, 'steps': 54934, 'loss/train': 2.0409095287323} 11/07/2021 04:57:03 - INFO - __main__ - Step 54936: {'lr': 0.0003580985353515948, 'samples': 10547712, 'steps': 54935, 'loss/train': 1.9671167135238647} 11/07/2021 04:57:03 - INFO - __main__ - Step 54937: {'lr': 0.0003580937503170324, 'samples': 10547904, 'steps': 54936, 'loss/train': 1.8756842613220215} 11/07/2021 04:57:03 - INFO - __main__ - Step 54938: {'lr': 0.00035808896523376456, 'samples': 10548096, 'steps': 54937, 'loss/train': 1.152647614479065} 11/07/2021 04:57:05 - INFO - __main__ - Step 54939: {'lr': 0.00035808418010179345, 'samples': 10548288, 'steps': 54938, 'loss/train': 1.2513864040374756} 11/07/2021 04:57:05 - INFO - __main__ - Step 54940: {'lr': 0.0003580793949211213, 'samples': 10548480, 'steps': 54939, 'loss/train': 0.8941771984100342} 11/07/2021 04:57:05 - INFO - __main__ - Step 54941: {'lr': 0.00035807460969175027, 'samples': 10548672, 'steps': 54940, 'loss/train': 1.5848958492279053} 11/07/2021 04:57:06 - INFO - __main__ - Step 54942: {'lr': 0.0003580698244136825, 'samples': 10548864, 'steps': 54941, 'loss/train': 1.5485233068466187} 11/07/2021 04:57:06 - INFO - __main__ - Step 54943: {'lr': 0.0003580650390869201, 'samples': 10549056, 'steps': 54942, 'loss/train': 1.5843173265457153} 11/07/2021 04:57:07 - INFO - __main__ - Step 54944: {'lr': 0.0003580602537114653, 'samples': 10549248, 'steps': 54943, 'loss/train': 1.5410370826721191} 11/07/2021 04:57:07 - INFO - __main__ - Step 54945: {'lr': 0.0003580554682873202, 'samples': 10549440, 'steps': 54944, 'loss/train': 1.7643157243728638} 11/07/2021 04:57:08 - INFO - __main__ - Step 54946: {'lr': 0.00035805068281448687, 'samples': 10549632, 'steps': 54945, 'loss/train': 1.6490561962127686} 11/07/2021 04:57:08 - INFO - __main__ - Step 54947: {'lr': 0.00035804589729296766, 'samples': 10549824, 'steps': 54946, 'loss/train': 1.2998273372650146} 11/07/2021 04:57:08 - INFO - __main__ - Step 54948: {'lr': 0.00035804111172276464, 'samples': 10550016, 'steps': 54947, 'loss/train': 1.4899073839187622} 11/07/2021 04:57:09 - INFO - __main__ - Step 54949: {'lr': 0.00035803632610388, 'samples': 10550208, 'steps': 54948, 'loss/train': 1.5985949039459229} 11/07/2021 04:57:10 - INFO - __main__ - Step 54950: {'lr': 0.0003580315404363158, 'samples': 10550400, 'steps': 54949, 'loss/train': 1.704199194908142} 11/07/2021 04:57:10 - INFO - __main__ - Step 54951: {'lr': 0.0003580267547200743, 'samples': 10550592, 'steps': 54950, 'loss/train': 1.5796351432800293} 11/07/2021 04:57:11 - INFO - __main__ - Step 54952: {'lr': 0.00035802196895515757, 'samples': 10550784, 'steps': 54951, 'loss/train': 1.6180907487869263} 11/07/2021 04:57:11 - INFO - __main__ - Step 54953: {'lr': 0.00035801718314156785, 'samples': 10550976, 'steps': 54952, 'loss/train': 1.3222569227218628} 11/07/2021 04:57:12 - INFO - __main__ - Step 54954: {'lr': 0.00035801239727930716, 'samples': 10551168, 'steps': 54953, 'loss/train': 1.3321624994277954} 11/07/2021 04:57:12 - INFO - __main__ - Step 54955: {'lr': 0.00035800761136837783, 'samples': 10551360, 'steps': 54954, 'loss/train': 1.2968820333480835} 11/07/2021 04:57:13 - INFO - __main__ - Step 54956: {'lr': 0.0003580028254087819, 'samples': 10551552, 'steps': 54955, 'loss/train': 1.2862237691879272} 11/07/2021 04:57:13 - INFO - __main__ - Step 54957: {'lr': 0.00035799803940052163, 'samples': 10551744, 'steps': 54956, 'loss/train': 1.720456838607788} 11/07/2021 04:57:13 - INFO - __main__ - Step 54958: {'lr': 0.00035799325334359906, 'samples': 10551936, 'steps': 54957, 'loss/train': 1.7125890254974365} 11/07/2021 04:57:14 - INFO - __main__ - Step 54959: {'lr': 0.00035798846723801635, 'samples': 10552128, 'steps': 54958, 'loss/train': 1.3855805397033691} 11/07/2021 04:57:15 - INFO - __main__ - Step 54960: {'lr': 0.0003579836810837758, 'samples': 10552320, 'steps': 54959, 'loss/train': 1.5711387395858765} 11/07/2021 04:57:15 - INFO - __main__ - Step 54961: {'lr': 0.0003579788948808794, 'samples': 10552512, 'steps': 54960, 'loss/train': 1.1467931270599365} 11/07/2021 04:57:15 - INFO - __main__ - Step 54962: {'lr': 0.0003579741086293294, 'samples': 10552704, 'steps': 54961, 'loss/train': 1.2307853698730469} 11/07/2021 04:57:16 - INFO - __main__ - Step 54963: {'lr': 0.00035796932232912793, 'samples': 10552896, 'steps': 54962, 'loss/train': 1.5768113136291504} 11/07/2021 04:57:17 - INFO - __main__ - Step 54964: {'lr': 0.00035796453598027725, 'samples': 10553088, 'steps': 54963, 'loss/train': 1.4683003425598145} 11/07/2021 04:57:17 - INFO - __main__ - Step 54965: {'lr': 0.0003579597495827793, 'samples': 10553280, 'steps': 54964, 'loss/train': 1.5094189643859863} 11/07/2021 04:57:18 - INFO - __main__ - Step 54966: {'lr': 0.0003579549631366363, 'samples': 10553472, 'steps': 54965, 'loss/train': 1.278933048248291} 11/07/2021 04:57:18 - INFO - __main__ - Step 54967: {'lr': 0.0003579501766418505, 'samples': 10553664, 'steps': 54966, 'loss/train': 0.46214476227760315} 11/07/2021 04:57:18 - INFO - __main__ - Step 54968: {'lr': 0.0003579453900984241, 'samples': 10553856, 'steps': 54967, 'loss/train': 1.4764328002929688} 11/07/2021 04:57:19 - INFO - __main__ - Step 54969: {'lr': 0.0003579406035063591, 'samples': 10554048, 'steps': 54968, 'loss/train': 1.3582720756530762} 11/07/2021 04:57:20 - INFO - __main__ - Step 54970: {'lr': 0.0003579358168656577, 'samples': 10554240, 'steps': 54969, 'loss/train': 1.1853805780410767} 11/07/2021 04:57:20 - INFO - __main__ - Step 54971: {'lr': 0.00035793103017632224, 'samples': 10554432, 'steps': 54970, 'loss/train': 1.7739368677139282} 11/07/2021 04:57:21 - INFO - __main__ - Step 54972: {'lr': 0.0003579262434383546, 'samples': 10554624, 'steps': 54971, 'loss/train': 1.290099859237671} 11/07/2021 04:57:21 - INFO - __main__ - Step 54973: {'lr': 0.0003579214566517571, 'samples': 10554816, 'steps': 54972, 'loss/train': 1.2865920066833496} 11/07/2021 04:57:22 - INFO - __main__ - Step 54974: {'lr': 0.00035791666981653184, 'samples': 10555008, 'steps': 54973, 'loss/train': 0.5530202388763428} 11/07/2021 04:57:23 - INFO - __main__ - Step 54975: {'lr': 0.00035791188293268094, 'samples': 10555200, 'steps': 54974, 'loss/train': 1.7353119850158691} 11/07/2021 04:57:23 - INFO - __main__ - Step 54976: {'lr': 0.00035790709600020667, 'samples': 10555392, 'steps': 54975, 'loss/train': 1.444684386253357} 11/07/2021 04:57:23 - INFO - __main__ - Step 54977: {'lr': 0.00035790230901911114, 'samples': 10555584, 'steps': 54976, 'loss/train': 0.6288148760795593} 11/07/2021 04:57:24 - INFO - __main__ - Step 54978: {'lr': 0.00035789752198939646, 'samples': 10555776, 'steps': 54977, 'loss/train': 0.6787629127502441} 11/07/2021 04:57:24 - INFO - __main__ - Step 54979: {'lr': 0.00035789273491106485, 'samples': 10555968, 'steps': 54978, 'loss/train': 1.2206088304519653} 11/07/2021 04:57:25 - INFO - __main__ - Step 54980: {'lr': 0.00035788794778411837, 'samples': 10556160, 'steps': 54979, 'loss/train': 0.4907543957233429} 11/07/2021 04:57:25 - INFO - __main__ - Step 54981: {'lr': 0.0003578831606085593, 'samples': 10556352, 'steps': 54980, 'loss/train': 1.2815356254577637} 11/07/2021 04:57:26 - INFO - __main__ - Step 54982: {'lr': 0.00035787837338438976, 'samples': 10556544, 'steps': 54981, 'loss/train': 0.5801774859428406} 11/07/2021 04:57:26 - INFO - __main__ - Step 54983: {'lr': 0.00035787358611161186, 'samples': 10556736, 'steps': 54982, 'loss/train': 1.256115436553955} 11/07/2021 04:57:26 - INFO - __main__ - Step 54984: {'lr': 0.0003578687987902278, 'samples': 10556928, 'steps': 54983, 'loss/train': 0.5729624032974243} 11/07/2021 04:57:28 - INFO - __main__ - Step 54985: {'lr': 0.00035786401142023975, 'samples': 10557120, 'steps': 54984, 'loss/train': 1.3811451196670532} 11/07/2021 04:57:28 - INFO - __main__ - Step 54986: {'lr': 0.00035785922400164983, 'samples': 10557312, 'steps': 54985, 'loss/train': 1.2688241004943848} 11/07/2021 04:57:28 - INFO - __main__ - Step 54987: {'lr': 0.00035785443653446017, 'samples': 10557504, 'steps': 54986, 'loss/train': 1.331153392791748} 11/07/2021 04:57:29 - INFO - __main__ - Step 54988: {'lr': 0.000357849649018673, 'samples': 10557696, 'steps': 54987, 'loss/train': 1.2635180950164795} 11/07/2021 04:57:29 - INFO - __main__ - Step 54989: {'lr': 0.0003578448614542904, 'samples': 10557888, 'steps': 54988, 'loss/train': 0.9246165752410889} 11/07/2021 04:57:30 - INFO - __main__ - Step 54990: {'lr': 0.0003578400738413146, 'samples': 10558080, 'steps': 54989, 'loss/train': 1.4906529188156128} 11/07/2021 04:57:30 - INFO - __main__ - Step 54991: {'lr': 0.00035783528617974774, 'samples': 10558272, 'steps': 54990, 'loss/train': 0.8243188261985779} 11/07/2021 04:57:31 - INFO - __main__ - Step 54992: {'lr': 0.000357830498469592, 'samples': 10558464, 'steps': 54991, 'loss/train': 1.657012939453125} 11/07/2021 04:57:31 - INFO - __main__ - Step 54993: {'lr': 0.0003578257107108494, 'samples': 10558656, 'steps': 54992, 'loss/train': 1.5241446495056152} 11/07/2021 04:57:31 - INFO - __main__ - Step 54994: {'lr': 0.0003578209229035222, 'samples': 10558848, 'steps': 54993, 'loss/train': 0.8030633926391602} 11/07/2021 04:57:32 - INFO - __main__ - Step 54995: {'lr': 0.0003578161350476127, 'samples': 10559040, 'steps': 54994, 'loss/train': 1.4325087070465088} 11/07/2021 04:57:33 - INFO - __main__ - Step 54996: {'lr': 0.00035781134714312277, 'samples': 10559232, 'steps': 54995, 'loss/train': 1.1097663640975952} 11/07/2021 04:57:33 - INFO - __main__ - Step 54997: {'lr': 0.0003578065591900548, 'samples': 10559424, 'steps': 54996, 'loss/train': 1.9446706771850586} 11/07/2021 04:57:34 - INFO - __main__ - Step 54998: {'lr': 0.0003578017711884108, 'samples': 10559616, 'steps': 54997, 'loss/train': 0.1882682591676712} 11/07/2021 04:57:34 - INFO - __main__ - Step 54999: {'lr': 0.000357796983138193, 'samples': 10559808, 'steps': 54998, 'loss/train': 1.3456265926361084} 11/07/2021 04:57:35 - INFO - __main__ - Step 55000: {'lr': 0.0003577921950394035, 'samples': 10560000, 'steps': 54999, 'loss/train': 1.5979148149490356} 11/07/2021 04:57:35 - INFO - __main__ - Step 55001: {'lr': 0.00035778740689204456, 'samples': 10560192, 'steps': 55000, 'loss/train': 1.1905561685562134} 11/07/2021 04:57:36 - INFO - __main__ - Step 55002: {'lr': 0.0003577826186961183, 'samples': 10560384, 'steps': 55001, 'loss/train': 0.9871057868003845} 11/07/2021 04:57:36 - INFO - __main__ - Step 55003: {'lr': 0.0003577778304516268, 'samples': 10560576, 'steps': 55002, 'loss/train': 1.6301368474960327} 11/07/2021 04:57:37 - INFO - __main__ - Step 55004: {'lr': 0.0003577730421585723, 'samples': 10560768, 'steps': 55003, 'loss/train': 1.747949242591858} 11/07/2021 04:57:37 - INFO - __main__ - Step 55005: {'lr': 0.00035776825381695693, 'samples': 10560960, 'steps': 55004, 'loss/train': 0.9194250702857971} 11/07/2021 04:57:38 - INFO - __main__ - Step 55006: {'lr': 0.0003577634654267828, 'samples': 10561152, 'steps': 55005, 'loss/train': 1.667545199394226} 11/07/2021 04:57:38 - INFO - __main__ - Step 55007: {'lr': 0.0003577586769880522, 'samples': 10561344, 'steps': 55006, 'loss/train': 1.3301728963851929} 11/07/2021 04:57:39 - INFO - __main__ - Step 55008: {'lr': 0.00035775388850076714, 'samples': 10561536, 'steps': 55007, 'loss/train': 1.6160812377929688} 11/07/2021 04:57:39 - INFO - __main__ - Step 55009: {'lr': 0.0003577490999649298, 'samples': 10561728, 'steps': 55008, 'loss/train': 2.0927774906158447} 11/07/2021 04:57:39 - INFO - __main__ - Step 55010: {'lr': 0.0003577443113805425, 'samples': 10561920, 'steps': 55009, 'loss/train': 1.492805004119873} 11/07/2021 04:57:40 - INFO - __main__ - Step 55011: {'lr': 0.00035773952274760723, 'samples': 10562112, 'steps': 55010, 'loss/train': 1.437565565109253} 11/07/2021 04:57:41 - INFO - __main__ - Step 55012: {'lr': 0.00035773473406612615, 'samples': 10562304, 'steps': 55011, 'loss/train': 1.6526776552200317} 11/07/2021 04:57:41 - INFO - __main__ - Step 55013: {'lr': 0.0003577299453361015, 'samples': 10562496, 'steps': 55012, 'loss/train': 1.2323278188705444} 11/07/2021 04:57:41 - INFO - __main__ - Step 55014: {'lr': 0.00035772515655753536, 'samples': 10562688, 'steps': 55013, 'loss/train': 1.0117645263671875} 11/07/2021 04:57:42 - INFO - __main__ - Step 55015: {'lr': 0.00035772036773042994, 'samples': 10562880, 'steps': 55014, 'loss/train': 1.9306058883666992} 11/07/2021 04:57:43 - INFO - __main__ - Step 55016: {'lr': 0.00035771557885478744, 'samples': 10563072, 'steps': 55015, 'loss/train': 1.0696053504943848} 11/07/2021 04:57:43 - INFO - __main__ - Step 55017: {'lr': 0.0003577107899306099, 'samples': 10563264, 'steps': 55016, 'loss/train': 1.4434189796447754} 11/07/2021 04:57:43 - INFO - __main__ - Step 55018: {'lr': 0.00035770600095789957, 'samples': 10563456, 'steps': 55017, 'loss/train': 1.374860167503357} 11/07/2021 04:57:44 - INFO - __main__ - Step 55019: {'lr': 0.0003577012119366586, 'samples': 10563648, 'steps': 55018, 'loss/train': 1.7924306392669678} 11/07/2021 04:57:44 - INFO - __main__ - Step 55020: {'lr': 0.00035769642286688903, 'samples': 10563840, 'steps': 55019, 'loss/train': 1.1546483039855957} 11/07/2021 04:57:45 - INFO - __main__ - Step 55021: {'lr': 0.00035769163374859325, 'samples': 10564032, 'steps': 55020, 'loss/train': 1.1657540798187256} 11/07/2021 04:57:45 - INFO - __main__ - Step 55022: {'lr': 0.0003576868445817732, 'samples': 10564224, 'steps': 55021, 'loss/train': 1.4878591299057007} 11/07/2021 04:57:46 - INFO - __main__ - Step 55023: {'lr': 0.0003576820553664311, 'samples': 10564416, 'steps': 55022, 'loss/train': 1.4696134328842163} 11/07/2021 04:57:46 - INFO - __main__ - Step 55024: {'lr': 0.0003576772661025691, 'samples': 10564608, 'steps': 55023, 'loss/train': 1.193994402885437} 11/07/2021 04:57:46 - INFO - __main__ - Step 55025: {'lr': 0.0003576724767901895, 'samples': 10564800, 'steps': 55024, 'loss/train': 1.5176256895065308} 11/07/2021 04:57:48 - INFO - __main__ - Step 55026: {'lr': 0.00035766768742929436, 'samples': 10564992, 'steps': 55025, 'loss/train': 1.5638397932052612} 11/07/2021 04:57:48 - INFO - __main__ - Step 55027: {'lr': 0.00035766289801988574, 'samples': 10565184, 'steps': 55026, 'loss/train': 1.2947450876235962} 11/07/2021 04:57:48 - INFO - __main__ - Step 55028: {'lr': 0.00035765810856196585, 'samples': 10565376, 'steps': 55027, 'loss/train': 1.4492391347885132} 11/07/2021 04:57:49 - INFO - __main__ - Step 55029: {'lr': 0.00035765331905553686, 'samples': 10565568, 'steps': 55028, 'loss/train': 1.794018030166626} 11/07/2021 04:57:49 - INFO - __main__ - Step 55030: {'lr': 0.000357648529500601, 'samples': 10565760, 'steps': 55029, 'loss/train': 1.7248971462249756} 11/07/2021 04:57:50 - INFO - __main__ - Step 55031: {'lr': 0.00035764373989716035, 'samples': 10565952, 'steps': 55030, 'loss/train': 0.8968067169189453} 11/07/2021 04:57:50 - INFO - __main__ - Step 55032: {'lr': 0.0003576389502452172, 'samples': 10566144, 'steps': 55031, 'loss/train': 1.6573113203048706} 11/07/2021 04:57:51 - INFO - __main__ - Step 55033: {'lr': 0.0003576341605447735, 'samples': 10566336, 'steps': 55032, 'loss/train': 1.3684130907058716} 11/07/2021 04:57:51 - INFO - __main__ - Step 55034: {'lr': 0.0003576293707958315, 'samples': 10566528, 'steps': 55033, 'loss/train': 1.1835936307907104} 11/07/2021 04:57:51 - INFO - __main__ - Step 55035: {'lr': 0.0003576245809983934, 'samples': 10566720, 'steps': 55034, 'loss/train': 1.1828055381774902} 11/07/2021 04:57:52 - INFO - __main__ - Step 55036: {'lr': 0.0003576197911524613, 'samples': 10566912, 'steps': 55035, 'loss/train': 1.211046814918518} 11/07/2021 04:57:53 - INFO - __main__ - Step 55037: {'lr': 0.0003576150012580374, 'samples': 10567104, 'steps': 55036, 'loss/train': 1.112857699394226} 11/07/2021 04:57:53 - INFO - __main__ - Step 55038: {'lr': 0.00035761021131512383, 'samples': 10567296, 'steps': 55037, 'loss/train': 1.4128081798553467} 11/07/2021 04:57:54 - INFO - __main__ - Step 55039: {'lr': 0.00035760542132372275, 'samples': 10567488, 'steps': 55038, 'loss/train': 1.4762065410614014} 11/07/2021 04:57:54 - INFO - __main__ - Step 55040: {'lr': 0.00035760063128383637, 'samples': 10567680, 'steps': 55039, 'loss/train': 1.4175198078155518} 11/07/2021 04:57:54 - INFO - __main__ - Step 55041: {'lr': 0.0003575958411954668, 'samples': 10567872, 'steps': 55040, 'loss/train': 2.396432638168335} 11/07/2021 04:57:55 - INFO - __main__ - Step 55042: {'lr': 0.00035759105105861614, 'samples': 10568064, 'steps': 55041, 'loss/train': 1.0133460760116577} 11/07/2021 04:57:56 - INFO - __main__ - Step 55043: {'lr': 0.00035758626087328664, 'samples': 10568256, 'steps': 55042, 'loss/train': 1.1450141668319702} 11/07/2021 04:57:56 - INFO - __main__ - Step 55044: {'lr': 0.00035758147063948056, 'samples': 10568448, 'steps': 55043, 'loss/train': 2.798823595046997} 11/07/2021 04:57:56 - INFO - __main__ - Step 55045: {'lr': 0.00035757668035719974, 'samples': 10568640, 'steps': 55044, 'loss/train': 1.7040687799453735} 11/07/2021 04:57:57 - INFO - __main__ - Step 55046: {'lr': 0.00035757189002644664, 'samples': 10568832, 'steps': 55045, 'loss/train': 1.5583311319351196} 11/07/2021 04:57:58 - INFO - __main__ - Step 55047: {'lr': 0.00035756709964722324, 'samples': 10569024, 'steps': 55046, 'loss/train': 1.4309717416763306} 11/07/2021 04:57:58 - INFO - __main__ - Step 55048: {'lr': 0.00035756230921953183, 'samples': 10569216, 'steps': 55047, 'loss/train': 1.5375730991363525} 11/07/2021 04:57:58 - INFO - __main__ - Step 55049: {'lr': 0.0003575575187433744, 'samples': 10569408, 'steps': 55048, 'loss/train': 1.8171958923339844} 11/07/2021 04:57:59 - INFO - __main__ - Step 55050: {'lr': 0.0003575527282187533, 'samples': 10569600, 'steps': 55049, 'loss/train': 1.4335161447525024} 11/07/2021 04:57:59 - INFO - __main__ - Step 55051: {'lr': 0.00035754793764567063, 'samples': 10569792, 'steps': 55050, 'loss/train': 1.3283638954162598} 11/07/2021 04:58:00 - INFO - __main__ - Step 55052: {'lr': 0.0003575431470241285, 'samples': 10569984, 'steps': 55051, 'loss/train': 0.7707861661911011} 11/07/2021 04:58:00 - INFO - __main__ - Step 55053: {'lr': 0.000357538356354129, 'samples': 10570176, 'steps': 55052, 'loss/train': 1.370285153388977} 11/07/2021 04:58:01 - INFO - __main__ - Step 55054: {'lr': 0.0003575335656356744, 'samples': 10570368, 'steps': 55053, 'loss/train': 1.6500614881515503} 11/07/2021 04:58:01 - INFO - __main__ - Step 55055: {'lr': 0.0003575287748687669, 'samples': 10570560, 'steps': 55054, 'loss/train': 0.6962327361106873} 11/07/2021 04:58:02 - INFO - __main__ - Step 55056: {'lr': 0.0003575239840534086, 'samples': 10570752, 'steps': 55055, 'loss/train': 1.1676795482635498} 11/07/2021 04:58:03 - INFO - __main__ - Step 55057: {'lr': 0.00035751919318960157, 'samples': 10570944, 'steps': 55056, 'loss/train': 1.1781184673309326} 11/07/2021 04:58:03 - INFO - __main__ - Step 55058: {'lr': 0.0003575144022773481, 'samples': 10571136, 'steps': 55057, 'loss/train': 1.1429388523101807} 11/07/2021 04:58:03 - INFO - __main__ - Step 55059: {'lr': 0.00035750961131665034, 'samples': 10571328, 'steps': 55058, 'loss/train': 1.7127009630203247} 11/07/2021 04:58:04 - INFO - __main__ - Step 55060: {'lr': 0.0003575048203075103, 'samples': 10571520, 'steps': 55059, 'loss/train': 1.3227245807647705} 11/07/2021 04:58:04 - INFO - __main__ - Step 55061: {'lr': 0.0003575000292499303, 'samples': 10571712, 'steps': 55060, 'loss/train': 1.6034808158874512} 11/07/2021 04:58:04 - INFO - __main__ - Step 55062: {'lr': 0.0003574952381439125, 'samples': 10571904, 'steps': 55061, 'loss/train': 0.8018674254417419} 11/07/2021 04:58:05 - INFO - __main__ - Step 55063: {'lr': 0.0003574904469894589, 'samples': 10572096, 'steps': 55062, 'loss/train': 1.1603502035140991} 11/07/2021 04:58:06 - INFO - __main__ - Step 55064: {'lr': 0.00035748565578657185, 'samples': 10572288, 'steps': 55063, 'loss/train': 1.4817630052566528} 11/07/2021 04:58:06 - INFO - __main__ - Step 55065: {'lr': 0.0003574808645352534, 'samples': 10572480, 'steps': 55064, 'loss/train': 1.4915812015533447} 11/07/2021 04:58:07 - INFO - __main__ - Step 55066: {'lr': 0.00035747607323550573, 'samples': 10572672, 'steps': 55065, 'loss/train': 0.9114412069320679} 11/07/2021 04:58:07 - INFO - __main__ - Step 55067: {'lr': 0.000357471281887331, 'samples': 10572864, 'steps': 55066, 'loss/train': 1.3542866706848145} 11/07/2021 04:58:08 - INFO - __main__ - Step 55068: {'lr': 0.0003574664904907314, 'samples': 10573056, 'steps': 55067, 'loss/train': 1.4151079654693604} 11/07/2021 04:58:08 - INFO - __main__ - Step 55069: {'lr': 0.00035746169904570896, 'samples': 10573248, 'steps': 55068, 'loss/train': 1.4811794757843018} 11/07/2021 04:58:09 - INFO - __main__ - Step 55070: {'lr': 0.000357456907552266, 'samples': 10573440, 'steps': 55069, 'loss/train': 1.2116820812225342} 11/07/2021 04:58:09 - INFO - __main__ - Step 55071: {'lr': 0.00035745211601040464, 'samples': 10573632, 'steps': 55070, 'loss/train': 2.0120761394500732} 11/07/2021 04:58:09 - INFO - __main__ - Step 55072: {'lr': 0.000357447324420127, 'samples': 10573824, 'steps': 55071, 'loss/train': 1.7886202335357666} 11/07/2021 04:58:10 - INFO - __main__ - Step 55073: {'lr': 0.00035744253278143526, 'samples': 10574016, 'steps': 55072, 'loss/train': 2.2052876949310303} 11/07/2021 04:58:11 - INFO - __main__ - Step 55074: {'lr': 0.0003574377410943315, 'samples': 10574208, 'steps': 55073, 'loss/train': 1.3213140964508057} 11/07/2021 04:58:11 - INFO - __main__ - Step 55075: {'lr': 0.00035743294935881804, 'samples': 10574400, 'steps': 55074, 'loss/train': 1.1828625202178955} 11/07/2021 04:58:11 - INFO - __main__ - Step 55076: {'lr': 0.0003574281575748969, 'samples': 10574592, 'steps': 55075, 'loss/train': 0.9429059624671936} 11/07/2021 04:58:12 - INFO - __main__ - Step 55077: {'lr': 0.0003574233657425703, 'samples': 10574784, 'steps': 55076, 'loss/train': 1.2194664478302002} 11/07/2021 04:58:13 - INFO - __main__ - Step 55078: {'lr': 0.0003574185738618404, 'samples': 10574976, 'steps': 55077, 'loss/train': 1.5214813947677612} 11/07/2021 04:58:13 - INFO - __main__ - Step 55079: {'lr': 0.00035741378193270934, 'samples': 10575168, 'steps': 55078, 'loss/train': 2.3881895542144775} 11/07/2021 04:58:14 - INFO - __main__ - Step 55080: {'lr': 0.00035740898995517933, 'samples': 10575360, 'steps': 55079, 'loss/train': 3.341811180114746} 11/07/2021 04:58:14 - INFO - __main__ - Step 55081: {'lr': 0.00035740419792925244, 'samples': 10575552, 'steps': 55080, 'loss/train': 1.680826187133789} 11/07/2021 04:58:14 - INFO - __main__ - Step 55082: {'lr': 0.0003573994058549309, 'samples': 10575744, 'steps': 55081, 'loss/train': 1.295062780380249} 11/07/2021 04:58:15 - INFO - __main__ - Step 55083: {'lr': 0.00035739461373221677, 'samples': 10575936, 'steps': 55082, 'loss/train': 1.247595191001892} 11/07/2021 04:58:16 - INFO - __main__ - Step 55084: {'lr': 0.00035738982156111233, 'samples': 10576128, 'steps': 55083, 'loss/train': 1.5760958194732666} 11/07/2021 04:58:16 - INFO - __main__ - Step 55085: {'lr': 0.0003573850293416198, 'samples': 10576320, 'steps': 55084, 'loss/train': 1.8488069772720337} 11/07/2021 04:58:16 - INFO - __main__ - Step 55086: {'lr': 0.00035738023707374114, 'samples': 10576512, 'steps': 55085, 'loss/train': 1.074326992034912} 11/07/2021 04:58:17 - INFO - __main__ - Step 55087: {'lr': 0.0003573754447574785, 'samples': 10576704, 'steps': 55086, 'loss/train': 1.8997424840927124} 11/07/2021 04:58:17 - INFO - __main__ - Step 55088: {'lr': 0.0003573706523928343, 'samples': 10576896, 'steps': 55087, 'loss/train': 1.388343334197998} 11/07/2021 04:58:18 - INFO - __main__ - Step 55089: {'lr': 0.00035736585997981046, 'samples': 10577088, 'steps': 55088, 'loss/train': 1.54605233669281} 11/07/2021 04:58:18 - INFO - __main__ - Step 55090: {'lr': 0.00035736106751840926, 'samples': 10577280, 'steps': 55089, 'loss/train': 1.278705358505249} 11/07/2021 04:58:19 - INFO - __main__ - Step 55091: {'lr': 0.00035735627500863275, 'samples': 10577472, 'steps': 55090, 'loss/train': 1.3363699913024902} 11/07/2021 04:58:19 - INFO - __main__ - Step 55092: {'lr': 0.00035735148245048326, 'samples': 10577664, 'steps': 55091, 'loss/train': 1.4652860164642334} 11/07/2021 04:58:20 - INFO - __main__ - Step 55093: {'lr': 0.0003573466898439628, 'samples': 10577856, 'steps': 55092, 'loss/train': 1.192568302154541} 11/07/2021 04:58:21 - INFO - __main__ - Step 55094: {'lr': 0.00035734189718907364, 'samples': 10578048, 'steps': 55093, 'loss/train': 1.6429756879806519} 11/07/2021 04:58:21 - INFO - __main__ - Step 55095: {'lr': 0.00035733710448581773, 'samples': 10578240, 'steps': 55094, 'loss/train': 1.6769375801086426} 11/07/2021 04:58:21 - INFO - __main__ - Step 55096: {'lr': 0.0003573323117341975, 'samples': 10578432, 'steps': 55095, 'loss/train': 1.516952395439148} 11/07/2021 04:58:22 - INFO - __main__ - Step 55097: {'lr': 0.00035732751893421494, 'samples': 10578624, 'steps': 55096, 'loss/train': 1.70482337474823} 11/07/2021 04:58:22 - INFO - __main__ - Step 55098: {'lr': 0.0003573227260858723, 'samples': 10578816, 'steps': 55097, 'loss/train': 1.5933622121810913} 11/07/2021 04:58:24 - INFO - __main__ - Step 55099: {'lr': 0.00035731793318917167, 'samples': 10579008, 'steps': 55098, 'loss/train': 1.691129207611084} 11/07/2021 04:58:24 - INFO - __main__ - Step 55100: {'lr': 0.0003573131402441152, 'samples': 10579200, 'steps': 55099, 'loss/train': 1.790935754776001} 11/07/2021 04:58:25 - INFO - __main__ - Step 55101: {'lr': 0.0003573083472507051, 'samples': 10579392, 'steps': 55100, 'loss/train': 1.7911534309387207} 11/07/2021 04:58:25 - INFO - __main__ - Step 55102: {'lr': 0.00035730355420894355, 'samples': 10579584, 'steps': 55101, 'loss/train': 1.7881889343261719} 11/07/2021 04:58:25 - INFO - __main__ - Step 55103: {'lr': 0.00035729876111883265, 'samples': 10579776, 'steps': 55102, 'loss/train': 1.4734764099121094} 11/07/2021 04:58:26 - INFO - __main__ - Step 55104: {'lr': 0.0003572939679803746, 'samples': 10579968, 'steps': 55103, 'loss/train': 1.5647242069244385} 11/07/2021 04:58:26 - INFO - __main__ - Step 55105: {'lr': 0.00035728917479357154, 'samples': 10580160, 'steps': 55104, 'loss/train': 1.3219374418258667} 11/07/2021 04:58:27 - INFO - __main__ - Step 55106: {'lr': 0.00035728438155842556, 'samples': 10580352, 'steps': 55105, 'loss/train': 1.6424331665039062} 11/07/2021 04:58:27 - INFO - __main__ - Step 55107: {'lr': 0.000357279588274939, 'samples': 10580544, 'steps': 55106, 'loss/train': 1.7871103286743164} 11/07/2021 04:58:28 - INFO - __main__ - Step 55108: {'lr': 0.00035727479494311387, 'samples': 10580736, 'steps': 55107, 'loss/train': 0.7385053634643555} 11/07/2021 04:58:28 - INFO - __main__ - Step 55109: {'lr': 0.0003572700015629524, 'samples': 10580928, 'steps': 55108, 'loss/train': 1.3223023414611816} 11/07/2021 04:58:28 - INFO - __main__ - Step 55110: {'lr': 0.0003572652081344566, 'samples': 10581120, 'steps': 55109, 'loss/train': 1.6981300115585327} 11/07/2021 04:58:30 - INFO - __main__ - Step 55111: {'lr': 0.00035726041465762885, 'samples': 10581312, 'steps': 55110, 'loss/train': 1.184633493423462} 11/07/2021 04:58:30 - INFO - __main__ - Step 55112: {'lr': 0.0003572556211324713, 'samples': 10581504, 'steps': 55111, 'loss/train': 1.4075344800949097} 11/07/2021 04:58:30 - INFO - __main__ - Step 55113: {'lr': 0.0003572508275589859, 'samples': 10581696, 'steps': 55112, 'loss/train': 1.3846694231033325} 11/07/2021 04:58:31 - INFO - __main__ - Step 55114: {'lr': 0.00035724603393717493, 'samples': 10581888, 'steps': 55113, 'loss/train': 1.4604871273040771} 11/07/2021 04:58:31 - INFO - __main__ - Step 55115: {'lr': 0.00035724124026704064, 'samples': 10582080, 'steps': 55114, 'loss/train': 1.9760327339172363} 11/07/2021 04:58:31 - INFO - __main__ - Step 55116: {'lr': 0.000357236446548585, 'samples': 10582272, 'steps': 55115, 'loss/train': 0.805184543132782} 11/07/2021 04:58:32 - INFO - __main__ - Step 55117: {'lr': 0.0003572316527818103, 'samples': 10582464, 'steps': 55116, 'loss/train': 1.3658256530761719} 11/07/2021 04:58:33 - INFO - __main__ - Step 55118: {'lr': 0.00035722685896671876, 'samples': 10582656, 'steps': 55117, 'loss/train': 1.6515177488327026} 11/07/2021 04:58:33 - INFO - __main__ - Step 55119: {'lr': 0.00035722206510331237, 'samples': 10582848, 'steps': 55118, 'loss/train': 1.3519814014434814} 11/07/2021 04:58:33 - INFO - __main__ - Step 55120: {'lr': 0.0003572172711915934, 'samples': 10583040, 'steps': 55119, 'loss/train': 1.557307243347168} 11/07/2021 04:58:34 - INFO - __main__ - Step 55121: {'lr': 0.0003572124772315639, 'samples': 10583232, 'steps': 55120, 'loss/train': 1.3858848810195923} 11/07/2021 04:58:35 - INFO - __main__ - Step 55122: {'lr': 0.0003572076832232262, 'samples': 10583424, 'steps': 55121, 'loss/train': 1.3028733730316162} 11/07/2021 04:58:35 - INFO - __main__ - Step 55123: {'lr': 0.0003572028891665823, 'samples': 10583616, 'steps': 55122, 'loss/train': 1.1942554712295532} 11/07/2021 04:58:36 - INFO - __main__ - Step 55124: {'lr': 0.00035719809506163454, 'samples': 10583808, 'steps': 55123, 'loss/train': 1.3237128257751465} 11/07/2021 04:58:36 - INFO - __main__ - Step 55125: {'lr': 0.0003571933009083849, 'samples': 10584000, 'steps': 55124, 'loss/train': 1.6525145769119263} 11/07/2021 04:58:36 - INFO - __main__ - Step 55126: {'lr': 0.00035718850670683565, 'samples': 10584192, 'steps': 55125, 'loss/train': 1.6280956268310547} 11/07/2021 04:58:37 - INFO - __main__ - Step 55127: {'lr': 0.00035718371245698887, 'samples': 10584384, 'steps': 55126, 'loss/train': 0.13471516966819763} 11/07/2021 04:58:38 - INFO - __main__ - Step 55128: {'lr': 0.0003571789181588468, 'samples': 10584576, 'steps': 55127, 'loss/train': 1.616760015487671} 11/07/2021 04:58:38 - INFO - __main__ - Step 55129: {'lr': 0.00035717412381241153, 'samples': 10584768, 'steps': 55128, 'loss/train': 1.5680081844329834} 11/07/2021 04:58:38 - INFO - __main__ - Step 55130: {'lr': 0.00035716932941768525, 'samples': 10584960, 'steps': 55129, 'loss/train': 1.6211940050125122} 11/07/2021 04:58:39 - INFO - __main__ - Step 55131: {'lr': 0.0003571645349746702, 'samples': 10585152, 'steps': 55130, 'loss/train': 1.2587850093841553} 11/07/2021 04:58:40 - INFO - __main__ - Step 55132: {'lr': 0.00035715974048336843, 'samples': 10585344, 'steps': 55131, 'loss/train': 1.1344618797302246} 11/07/2021 04:58:40 - INFO - __main__ - Step 55133: {'lr': 0.0003571549459437821, 'samples': 10585536, 'steps': 55132, 'loss/train': 1.1305006742477417} 11/07/2021 04:58:41 - INFO - __main__ - Step 55134: {'lr': 0.00035715015135591346, 'samples': 10585728, 'steps': 55133, 'loss/train': 1.698072910308838} 11/07/2021 04:58:41 - INFO - __main__ - Step 55135: {'lr': 0.0003571453567197645, 'samples': 10585920, 'steps': 55134, 'loss/train': 1.8285516500473022} 11/07/2021 04:58:41 - INFO - __main__ - Step 55136: {'lr': 0.0003571405620353376, 'samples': 10586112, 'steps': 55135, 'loss/train': 1.2474533319473267} 11/07/2021 04:58:42 - INFO - __main__ - Step 55137: {'lr': 0.00035713576730263475, 'samples': 10586304, 'steps': 55136, 'loss/train': 1.4400640726089478} 11/07/2021 04:58:43 - INFO - __main__ - Step 55138: {'lr': 0.0003571309725216582, 'samples': 10586496, 'steps': 55137, 'loss/train': 0.987962543964386} 11/07/2021 04:58:43 - INFO - __main__ - Step 55139: {'lr': 0.0003571261776924102, 'samples': 10586688, 'steps': 55138, 'loss/train': 1.189251184463501} 11/07/2021 04:58:43 - INFO - __main__ - Step 55140: {'lr': 0.00035712138281489264, 'samples': 10586880, 'steps': 55139, 'loss/train': 0.8905391693115234} 11/07/2021 04:58:44 - INFO - __main__ - Step 55141: {'lr': 0.0003571165878891079, 'samples': 10587072, 'steps': 55140, 'loss/train': 1.3201361894607544} 11/07/2021 04:58:45 - INFO - __main__ - Step 55142: {'lr': 0.00035711179291505806, 'samples': 10587264, 'steps': 55141, 'loss/train': 1.5405439138412476} 11/07/2021 04:58:45 - INFO - __main__ - Step 55143: {'lr': 0.0003571069978927453, 'samples': 10587456, 'steps': 55142, 'loss/train': 1.4250941276550293} 11/07/2021 04:58:45 - INFO - __main__ - Step 55144: {'lr': 0.00035710220282217175, 'samples': 10587648, 'steps': 55143, 'loss/train': 0.9400364756584167} 11/07/2021 04:58:46 - INFO - __main__ - Step 55145: {'lr': 0.0003570974077033397, 'samples': 10587840, 'steps': 55144, 'loss/train': 1.5619584321975708} 11/07/2021 04:58:46 - INFO - __main__ - Step 55146: {'lr': 0.00035709261253625115, 'samples': 10588032, 'steps': 55145, 'loss/train': 0.9394890666007996} 11/07/2021 04:58:47 - INFO - __main__ - Step 55147: {'lr': 0.00035708781732090835, 'samples': 10588224, 'steps': 55146, 'loss/train': 1.5081124305725098} 11/07/2021 04:58:48 - INFO - __main__ - Step 55148: {'lr': 0.00035708302205731334, 'samples': 10588416, 'steps': 55147, 'loss/train': 1.3798731565475464} 11/07/2021 04:58:48 - INFO - __main__ - Step 55149: {'lr': 0.00035707822674546847, 'samples': 10588608, 'steps': 55148, 'loss/train': 1.479280710220337} 11/07/2021 04:58:48 - INFO - __main__ - Step 55150: {'lr': 0.00035707343138537584, 'samples': 10588800, 'steps': 55149, 'loss/train': 1.2436096668243408} 11/07/2021 04:58:49 - INFO - __main__ - Step 55151: {'lr': 0.00035706863597703746, 'samples': 10588992, 'steps': 55150, 'loss/train': 1.5817415714263916} 11/07/2021 04:58:50 - INFO - __main__ - Step 55152: {'lr': 0.00035706384052045567, 'samples': 10589184, 'steps': 55151, 'loss/train': 1.42521071434021} 11/07/2021 04:58:50 - INFO - __main__ - Step 55153: {'lr': 0.0003570590450156325, 'samples': 10589376, 'steps': 55152, 'loss/train': 1.4141230583190918} 11/07/2021 04:58:51 - INFO - __main__ - Step 55154: {'lr': 0.00035705424946257027, 'samples': 10589568, 'steps': 55153, 'loss/train': 1.715566873550415} 11/07/2021 04:58:51 - INFO - __main__ - Step 55155: {'lr': 0.000357049453861271, 'samples': 10589760, 'steps': 55154, 'loss/train': 0.5692832469940186} 11/07/2021 04:58:51 - INFO - __main__ - Step 55156: {'lr': 0.00035704465821173695, 'samples': 10589952, 'steps': 55155, 'loss/train': 1.2147572040557861} 11/07/2021 04:58:52 - INFO - __main__ - Step 55157: {'lr': 0.00035703986251397015, 'samples': 10590144, 'steps': 55156, 'loss/train': 1.3184512853622437} 11/07/2021 04:58:53 - INFO - __main__ - Step 55158: {'lr': 0.00035703506676797284, 'samples': 10590336, 'steps': 55157, 'loss/train': 1.4686580896377563} 11/07/2021 04:58:53 - INFO - __main__ - Step 55159: {'lr': 0.00035703027097374717, 'samples': 10590528, 'steps': 55158, 'loss/train': 1.6541765928268433} 11/07/2021 04:58:53 - INFO - __main__ - Step 55160: {'lr': 0.00035702547513129533, 'samples': 10590720, 'steps': 55159, 'loss/train': 1.4428073167800903} 11/07/2021 04:58:54 - INFO - __main__ - Step 55161: {'lr': 0.0003570206792406195, 'samples': 10590912, 'steps': 55160, 'loss/train': 1.1663644313812256} 11/07/2021 04:58:54 - INFO - __main__ - Step 55162: {'lr': 0.0003570158833017219, 'samples': 10591104, 'steps': 55161, 'loss/train': 1.5408821105957031} 11/07/2021 04:58:55 - INFO - __main__ - Step 55163: {'lr': 0.0003570110873146044, 'samples': 10591296, 'steps': 55162, 'loss/train': 1.4008692502975464} 11/07/2021 04:58:55 - INFO - __main__ - Step 55164: {'lr': 0.0003570062912792694, 'samples': 10591488, 'steps': 55163, 'loss/train': 1.1255208253860474} 11/07/2021 04:58:56 - INFO - __main__ - Step 55165: {'lr': 0.0003570014951957191, 'samples': 10591680, 'steps': 55164, 'loss/train': 1.3669341802597046} 11/07/2021 04:58:56 - INFO - __main__ - Step 55166: {'lr': 0.00035699669906395554, 'samples': 10591872, 'steps': 55165, 'loss/train': 1.8413978815078735} 11/07/2021 04:58:57 - INFO - __main__ - Step 55167: {'lr': 0.00035699190288398093, 'samples': 10592064, 'steps': 55166, 'loss/train': 1.1969252824783325} 11/07/2021 04:58:58 - INFO - __main__ - Step 55168: {'lr': 0.0003569871066557974, 'samples': 10592256, 'steps': 55167, 'loss/train': 1.5171173810958862} 11/07/2021 04:58:59 - INFO - __main__ - Step 55169: {'lr': 0.0003569823103794071, 'samples': 10592448, 'steps': 55168, 'loss/train': 1.1714458465576172} 11/07/2021 04:58:59 - INFO - __main__ - Step 55170: {'lr': 0.0003569775140548122, 'samples': 10592640, 'steps': 55169, 'loss/train': 1.2222853899002075} 11/07/2021 04:58:59 - INFO - __main__ - Step 55171: {'lr': 0.00035697271768201494, 'samples': 10592832, 'steps': 55170, 'loss/train': 0.13801901042461395} 11/07/2021 04:59:00 - INFO - __main__ - Step 55172: {'lr': 0.0003569679212610175, 'samples': 10593024, 'steps': 55171, 'loss/train': 0.31321340799331665} 11/07/2021 04:59:00 - INFO - __main__ - Step 55173: {'lr': 0.00035696312479182186, 'samples': 10593216, 'steps': 55172, 'loss/train': 0.2924405038356781} 11/07/2021 04:59:01 - INFO - __main__ - Step 55174: {'lr': 0.0003569583282744303, 'samples': 10593408, 'steps': 55173, 'loss/train': 0.39519545435905457} 11/07/2021 04:59:02 - INFO - __main__ - Step 55175: {'lr': 0.00035695353170884494, 'samples': 10593600, 'steps': 55174, 'loss/train': 1.630731463432312} 11/07/2021 04:59:02 - INFO - __main__ - Step 55176: {'lr': 0.000356948735095068, 'samples': 10593792, 'steps': 55175, 'loss/train': 1.5511635541915894} 11/07/2021 04:59:02 - INFO - __main__ - Step 55177: {'lr': 0.0003569439384331016, 'samples': 10593984, 'steps': 55176, 'loss/train': 1.8013595342636108} 11/07/2021 04:59:03 - INFO - __main__ - Step 55178: {'lr': 0.00035693914172294796, 'samples': 10594176, 'steps': 55177, 'loss/train': 1.6304501295089722} 11/07/2021 04:59:04 - INFO - __main__ - Step 55179: {'lr': 0.0003569343449646092, 'samples': 10594368, 'steps': 55178, 'loss/train': 1.5907942056655884} 11/07/2021 04:59:04 - INFO - __main__ - Step 55180: {'lr': 0.0003569295481580874, 'samples': 10594560, 'steps': 55179, 'loss/train': 1.7025096416473389} 11/07/2021 04:59:04 - INFO - __main__ - Step 55181: {'lr': 0.0003569247513033848, 'samples': 10594752, 'steps': 55180, 'loss/train': 1.6277837753295898} 11/07/2021 04:59:05 - INFO - __main__ - Step 55182: {'lr': 0.00035691995440050364, 'samples': 10594944, 'steps': 55181, 'loss/train': 1.5280399322509766} 11/07/2021 04:59:05 - INFO - __main__ - Step 55183: {'lr': 0.0003569151574494459, 'samples': 10595136, 'steps': 55182, 'loss/train': 2.082932472229004} 11/07/2021 04:59:05 - INFO - __main__ - Step 55184: {'lr': 0.00035691036045021384, 'samples': 10595328, 'steps': 55183, 'loss/train': 1.4191107749938965} 11/07/2021 04:59:06 - INFO - __main__ - Step 55185: {'lr': 0.0003569055634028097, 'samples': 10595520, 'steps': 55184, 'loss/train': 1.123810052871704} 11/07/2021 04:59:07 - INFO - __main__ - Step 55186: {'lr': 0.00035690076630723555, 'samples': 10595712, 'steps': 55185, 'loss/train': 1.7939107418060303} 11/07/2021 04:59:07 - INFO - __main__ - Step 55187: {'lr': 0.0003568959691634935, 'samples': 10595904, 'steps': 55186, 'loss/train': 1.6975566148757935} 11/07/2021 04:59:07 - INFO - __main__ - Step 55188: {'lr': 0.0003568911719715858, 'samples': 10596096, 'steps': 55187, 'loss/train': 1.7787225246429443} 11/07/2021 04:59:08 - INFO - __main__ - Step 55189: {'lr': 0.00035688637473151464, 'samples': 10596288, 'steps': 55188, 'loss/train': 1.1431853771209717} 11/07/2021 04:59:09 - INFO - __main__ - Step 55190: {'lr': 0.0003568815774432821, 'samples': 10596480, 'steps': 55189, 'loss/train': 1.4977154731750488} 11/07/2021 04:59:10 - INFO - __main__ - Step 55191: {'lr': 0.00035687678010689033, 'samples': 10596672, 'steps': 55190, 'loss/train': 0.1889737993478775} 11/07/2021 04:59:10 - INFO - __main__ - Step 55192: {'lr': 0.00035687198272234163, 'samples': 10596864, 'steps': 55191, 'loss/train': 1.3264491558074951} 11/07/2021 04:59:10 - INFO - __main__ - Step 55193: {'lr': 0.00035686718528963804, 'samples': 10597056, 'steps': 55192, 'loss/train': 1.0734184980392456} 11/07/2021 04:59:11 - INFO - __main__ - Step 55194: {'lr': 0.00035686238780878167, 'samples': 10597248, 'steps': 55193, 'loss/train': 1.3196901082992554} 11/07/2021 04:59:12 - INFO - __main__ - Step 55195: {'lr': 0.0003568575902797748, 'samples': 10597440, 'steps': 55194, 'loss/train': 1.5095797777175903} 11/07/2021 04:59:12 - INFO - __main__ - Step 55196: {'lr': 0.0003568527927026195, 'samples': 10597632, 'steps': 55195, 'loss/train': 1.3618577718734741} 11/07/2021 04:59:12 - INFO - __main__ - Step 55197: {'lr': 0.000356847995077318, 'samples': 10597824, 'steps': 55196, 'loss/train': 1.2174633741378784} 11/07/2021 04:59:13 - INFO - __main__ - Step 55198: {'lr': 0.0003568431974038725, 'samples': 10598016, 'steps': 55197, 'loss/train': 2.0520200729370117} 11/07/2021 04:59:13 - INFO - __main__ - Step 55199: {'lr': 0.0003568383996822851, 'samples': 10598208, 'steps': 55198, 'loss/train': 1.5670710802078247} 11/07/2021 04:59:14 - INFO - __main__ - Step 55200: {'lr': 0.0003568336019125579, 'samples': 10598400, 'steps': 55199, 'loss/train': 1.8527283668518066} 11/07/2021 04:59:14 - INFO - __main__ - Step 55201: {'lr': 0.0003568288040946931, 'samples': 10598592, 'steps': 55200, 'loss/train': 1.5662264823913574} 11/07/2021 04:59:15 - INFO - __main__ - Step 55202: {'lr': 0.000356824006228693, 'samples': 10598784, 'steps': 55201, 'loss/train': 1.0995025634765625} 11/07/2021 04:59:15 - INFO - __main__ - Step 55203: {'lr': 0.0003568192083145596, 'samples': 10598976, 'steps': 55202, 'loss/train': 1.422552466392517} 11/07/2021 04:59:15 - INFO - __main__ - Step 55204: {'lr': 0.0003568144103522951, 'samples': 10599168, 'steps': 55203, 'loss/train': 1.1638541221618652} 11/07/2021 04:59:16 - INFO - __main__ - Step 55205: {'lr': 0.00035680961234190166, 'samples': 10599360, 'steps': 55204, 'loss/train': 1.671225905418396} 11/07/2021 04:59:17 - INFO - __main__ - Step 55206: {'lr': 0.00035680481428338156, 'samples': 10599552, 'steps': 55205, 'loss/train': 1.1339223384857178} 11/07/2021 04:59:17 - INFO - __main__ - Step 55207: {'lr': 0.0003568000161767368, 'samples': 10599744, 'steps': 55206, 'loss/train': 1.547345519065857} 11/07/2021 04:59:18 - INFO - __main__ - Step 55208: {'lr': 0.0003567952180219696, 'samples': 10599936, 'steps': 55207, 'loss/train': 0.9051850438117981} 11/07/2021 04:59:18 - INFO - __main__ - Step 55209: {'lr': 0.00035679041981908206, 'samples': 10600128, 'steps': 55208, 'loss/train': 1.6167963743209839} 11/07/2021 04:59:19 - INFO - __main__ - Step 55210: {'lr': 0.0003567856215680765, 'samples': 10600320, 'steps': 55209, 'loss/train': 1.4095200300216675} 11/07/2021 04:59:19 - INFO - __main__ - Step 55211: {'lr': 0.0003567808232689549, 'samples': 10600512, 'steps': 55210, 'loss/train': 1.5192729234695435} 11/07/2021 04:59:20 - INFO - __main__ - Step 55212: {'lr': 0.00035677602492171953, 'samples': 10600704, 'steps': 55211, 'loss/train': 1.4212462902069092} 11/07/2021 04:59:20 - INFO - __main__ - Step 55213: {'lr': 0.0003567712265263726, 'samples': 10600896, 'steps': 55212, 'loss/train': 1.7365148067474365} 11/07/2021 04:59:20 - INFO - __main__ - Step 55214: {'lr': 0.0003567664280829161, 'samples': 10601088, 'steps': 55213, 'loss/train': 1.2166881561279297} 11/07/2021 04:59:21 - INFO - __main__ - Step 55215: {'lr': 0.0003567616295913524, 'samples': 10601280, 'steps': 55214, 'loss/train': 1.1411106586456299} 11/07/2021 04:59:22 - INFO - __main__ - Step 55216: {'lr': 0.0003567568310516834, 'samples': 10601472, 'steps': 55215, 'loss/train': 1.3288177251815796} 11/07/2021 04:59:22 - INFO - __main__ - Step 55217: {'lr': 0.0003567520324639116, 'samples': 10601664, 'steps': 55216, 'loss/train': 1.064652442932129} 11/07/2021 04:59:22 - INFO - __main__ - Step 55218: {'lr': 0.0003567472338280389, 'samples': 10601856, 'steps': 55217, 'loss/train': 1.6468113660812378} 11/07/2021 04:59:23 - INFO - __main__ - Step 55219: {'lr': 0.00035674243514406754, 'samples': 10602048, 'steps': 55218, 'loss/train': 1.756061315536499} 11/07/2021 04:59:23 - INFO - __main__ - Step 55220: {'lr': 0.00035673763641199974, 'samples': 10602240, 'steps': 55219, 'loss/train': 1.2694787979125977} 11/07/2021 04:59:24 - INFO - __main__ - Step 55221: {'lr': 0.0003567328376318375, 'samples': 10602432, 'steps': 55220, 'loss/train': 1.4165061712265015} 11/07/2021 04:59:25 - INFO - __main__ - Step 55222: {'lr': 0.0003567280388035832, 'samples': 10602624, 'steps': 55221, 'loss/train': 0.7822771668434143} 11/07/2021 04:59:25 - INFO - __main__ - Step 55223: {'lr': 0.0003567232399272388, 'samples': 10602816, 'steps': 55222, 'loss/train': 1.3410395383834839} 11/07/2021 04:59:25 - INFO - __main__ - Step 55224: {'lr': 0.0003567184410028066, 'samples': 10603008, 'steps': 55223, 'loss/train': 1.2008610963821411} 11/07/2021 04:59:26 - INFO - __main__ - Step 55225: {'lr': 0.0003567136420302887, 'samples': 10603200, 'steps': 55224, 'loss/train': 1.2869157791137695} 11/07/2021 04:59:27 - INFO - __main__ - Step 55226: {'lr': 0.00035670884300968735, 'samples': 10603392, 'steps': 55225, 'loss/train': 1.367020606994629} 11/07/2021 04:59:27 - INFO - __main__ - Step 55227: {'lr': 0.0003567040439410046, 'samples': 10603584, 'steps': 55226, 'loss/train': 1.6534112691879272} 11/07/2021 04:59:27 - INFO - __main__ - Step 55228: {'lr': 0.0003566992448242427, 'samples': 10603776, 'steps': 55227, 'loss/train': 1.0427751541137695} 11/07/2021 04:59:28 - INFO - __main__ - Step 55229: {'lr': 0.0003566944456594036, 'samples': 10603968, 'steps': 55228, 'loss/train': 1.3932331800460815} 11/07/2021 04:59:28 - INFO - __main__ - Step 55230: {'lr': 0.00035668964644648975, 'samples': 10604160, 'steps': 55229, 'loss/train': 1.6934592723846436} 11/07/2021 04:59:28 - INFO - __main__ - Step 55231: {'lr': 0.0003566848471855032, 'samples': 10604352, 'steps': 55230, 'loss/train': 1.457594633102417} 11/07/2021 04:59:29 - INFO - __main__ - Step 55232: {'lr': 0.0003566800478764461, 'samples': 10604544, 'steps': 55231, 'loss/train': 1.8326622247695923} 11/07/2021 04:59:30 - INFO - __main__ - Step 55233: {'lr': 0.00035667524851932066, 'samples': 10604736, 'steps': 55232, 'loss/train': 1.4921889305114746} 11/07/2021 04:59:30 - INFO - __main__ - Step 55234: {'lr': 0.0003566704491141289, 'samples': 10604928, 'steps': 55233, 'loss/train': 1.7390446662902832} 11/07/2021 04:59:30 - INFO - __main__ - Step 55235: {'lr': 0.0003566656496608731, 'samples': 10605120, 'steps': 55234, 'loss/train': 1.584368109703064} 11/07/2021 04:59:31 - INFO - __main__ - Step 55236: {'lr': 0.0003566608501595554, 'samples': 10605312, 'steps': 55235, 'loss/train': 1.5843379497528076} 11/07/2021 04:59:32 - INFO - __main__ - Step 55237: {'lr': 0.000356656050610178, 'samples': 10605504, 'steps': 55236, 'loss/train': 1.2228553295135498} 11/07/2021 04:59:32 - INFO - __main__ - Step 55238: {'lr': 0.000356651251012743, 'samples': 10605696, 'steps': 55237, 'loss/train': 1.507058024406433} 11/07/2021 04:59:32 - INFO - __main__ - Step 55239: {'lr': 0.0003566464513672527, 'samples': 10605888, 'steps': 55238, 'loss/train': 1.0552634000778198} 11/07/2021 04:59:33 - INFO - __main__ - Step 55240: {'lr': 0.00035664165167370907, 'samples': 10606080, 'steps': 55239, 'loss/train': 1.8468682765960693} 11/07/2021 04:59:33 - INFO - __main__ - Step 55241: {'lr': 0.0003566368519321144, 'samples': 10606272, 'steps': 55240, 'loss/train': 0.9865829944610596} 11/07/2021 04:59:34 - INFO - __main__ - Step 55242: {'lr': 0.0003566320521424707, 'samples': 10606464, 'steps': 55241, 'loss/train': 1.405584454536438} 11/07/2021 04:59:35 - INFO - __main__ - Step 55243: {'lr': 0.0003566272523047803, 'samples': 10606656, 'steps': 55242, 'loss/train': 1.6511468887329102} 11/07/2021 04:59:35 - INFO - __main__ - Step 55244: {'lr': 0.00035662245241904533, 'samples': 10606848, 'steps': 55243, 'loss/train': 1.335024118423462} 11/07/2021 04:59:35 - INFO - __main__ - Step 55245: {'lr': 0.0003566176524852679, 'samples': 10607040, 'steps': 55244, 'loss/train': 1.1019014120101929} 11/07/2021 04:59:36 - INFO - __main__ - Step 55246: {'lr': 0.00035661285250345023, 'samples': 10607232, 'steps': 55245, 'loss/train': 1.3142695426940918} 11/07/2021 04:59:37 - INFO - __main__ - Step 55247: {'lr': 0.00035660805247359444, 'samples': 10607424, 'steps': 55246, 'loss/train': 1.6288530826568604} 11/07/2021 04:59:37 - INFO - __main__ - Step 55248: {'lr': 0.0003566032523957027, 'samples': 10607616, 'steps': 55247, 'loss/train': 1.4779716730117798} 11/07/2021 04:59:37 - INFO - __main__ - Step 55249: {'lr': 0.00035659845226977715, 'samples': 10607808, 'steps': 55248, 'loss/train': 1.170922875404358} 11/07/2021 04:59:38 - INFO - __main__ - Step 55250: {'lr': 0.00035659365209582004, 'samples': 10608000, 'steps': 55249, 'loss/train': 1.358385682106018} 11/07/2021 04:59:38 - INFO - __main__ - Step 55251: {'lr': 0.00035658885187383343, 'samples': 10608192, 'steps': 55250, 'loss/train': 1.4741021394729614} 11/07/2021 04:59:39 - INFO - __main__ - Step 55252: {'lr': 0.0003565840516038196, 'samples': 10608384, 'steps': 55251, 'loss/train': 1.5403051376342773} 11/07/2021 04:59:39 - INFO - __main__ - Step 55253: {'lr': 0.00035657925128578064, 'samples': 10608576, 'steps': 55252, 'loss/train': 1.1264894008636475} 11/07/2021 04:59:40 - INFO - __main__ - Step 55254: {'lr': 0.00035657445091971863, 'samples': 10608768, 'steps': 55253, 'loss/train': 1.2137291431427002} 11/07/2021 04:59:40 - INFO - __main__ - Step 55255: {'lr': 0.00035656965050563584, 'samples': 10608960, 'steps': 55254, 'loss/train': 1.276440978050232} 11/07/2021 04:59:40 - INFO - __main__ - Step 55256: {'lr': 0.0003565648500435344, 'samples': 10609152, 'steps': 55255, 'loss/train': 1.6435093879699707} 11/07/2021 04:59:41 - INFO - __main__ - Step 55257: {'lr': 0.0003565600495334165, 'samples': 10609344, 'steps': 55256, 'loss/train': 0.828242838382721} 11/07/2021 04:59:42 - INFO - __main__ - Step 55258: {'lr': 0.0003565552489752843, 'samples': 10609536, 'steps': 55257, 'loss/train': 1.0416570901870728} 11/07/2021 04:59:42 - INFO - __main__ - Step 55259: {'lr': 0.0003565504483691399, 'samples': 10609728, 'steps': 55258, 'loss/train': 1.0516154766082764} 11/07/2021 04:59:42 - INFO - __main__ - Step 55260: {'lr': 0.0003565456477149856, 'samples': 10609920, 'steps': 55259, 'loss/train': 0.9261599779129028} 11/07/2021 04:59:43 - INFO - __main__ - Step 55261: {'lr': 0.0003565408470128234, 'samples': 10610112, 'steps': 55260, 'loss/train': 1.4984734058380127} 11/07/2021 04:59:44 - INFO - __main__ - Step 55262: {'lr': 0.00035653604626265556, 'samples': 10610304, 'steps': 55261, 'loss/train': 0.5827086567878723} 11/07/2021 04:59:44 - INFO - __main__ - Step 55263: {'lr': 0.00035653124546448423, 'samples': 10610496, 'steps': 55262, 'loss/train': 1.7296804189682007} 11/07/2021 04:59:45 - INFO - __main__ - Step 55264: {'lr': 0.0003565264446183116, 'samples': 10610688, 'steps': 55263, 'loss/train': 2.3289999961853027} 11/07/2021 04:59:45 - INFO - __main__ - Step 55265: {'lr': 0.00035652164372413975, 'samples': 10610880, 'steps': 55264, 'loss/train': 1.4187769889831543} 11/07/2021 04:59:45 - INFO - __main__ - Step 55266: {'lr': 0.0003565168427819709, 'samples': 10611072, 'steps': 55265, 'loss/train': 2.162388801574707} 11/07/2021 04:59:46 - INFO - __main__ - Step 55267: {'lr': 0.00035651204179180723, 'samples': 10611264, 'steps': 55266, 'loss/train': 1.417816162109375} 11/07/2021 04:59:47 - INFO - __main__ - Step 55268: {'lr': 0.00035650724075365084, 'samples': 10611456, 'steps': 55267, 'loss/train': 1.6270649433135986} 11/07/2021 04:59:47 - INFO - __main__ - Step 55269: {'lr': 0.000356502439667504, 'samples': 10611648, 'steps': 55268, 'loss/train': 2.0392963886260986} 11/07/2021 04:59:48 - INFO - __main__ - Step 55270: {'lr': 0.0003564976385333687, 'samples': 10611840, 'steps': 55269, 'loss/train': 1.2095744609832764} 11/07/2021 04:59:48 - INFO - __main__ - Step 55271: {'lr': 0.00035649283735124723, 'samples': 10612032, 'steps': 55270, 'loss/train': 1.201027512550354} 11/07/2021 04:59:48 - INFO - __main__ - Step 55272: {'lr': 0.0003564880361211418, 'samples': 10612224, 'steps': 55271, 'loss/train': 1.771641731262207} 11/07/2021 04:59:49 - INFO - __main__ - Step 55273: {'lr': 0.00035648323484305445, 'samples': 10612416, 'steps': 55272, 'loss/train': 1.5814708471298218} 11/07/2021 04:59:50 - INFO - __main__ - Step 55274: {'lr': 0.00035647843351698736, 'samples': 10612608, 'steps': 55273, 'loss/train': 1.9617000818252563} 11/07/2021 04:59:50 - INFO - __main__ - Step 55275: {'lr': 0.0003564736321429428, 'samples': 10612800, 'steps': 55274, 'loss/train': 1.4196951389312744} 11/07/2021 04:59:50 - INFO - __main__ - Step 55276: {'lr': 0.00035646883072092285, 'samples': 10612992, 'steps': 55275, 'loss/train': 1.2456954717636108} 11/07/2021 04:59:51 - INFO - __main__ - Step 55277: {'lr': 0.00035646402925092966, 'samples': 10613184, 'steps': 55276, 'loss/train': 1.528760552406311} 11/07/2021 04:59:52 - INFO - __main__ - Step 55278: {'lr': 0.00035645922773296546, 'samples': 10613376, 'steps': 55277, 'loss/train': 1.3044077157974243} 11/07/2021 04:59:52 - INFO - __main__ - Step 55279: {'lr': 0.0003564544261670324, 'samples': 10613568, 'steps': 55278, 'loss/train': 1.3315811157226562} 11/07/2021 04:59:52 - INFO - __main__ - Step 55280: {'lr': 0.0003564496245531326, 'samples': 10613760, 'steps': 55279, 'loss/train': 1.7874959707260132} 11/07/2021 04:59:53 - INFO - __main__ - Step 55281: {'lr': 0.0003564448228912682, 'samples': 10613952, 'steps': 55280, 'loss/train': 1.3334084749221802} 11/07/2021 04:59:53 - INFO - __main__ - Step 55282: {'lr': 0.0003564400211814414, 'samples': 10614144, 'steps': 55281, 'loss/train': 1.3982524871826172} 11/07/2021 04:59:54 - INFO - __main__ - Step 55283: {'lr': 0.0003564352194236544, 'samples': 10614336, 'steps': 55282, 'loss/train': 1.5548148155212402} 11/07/2021 04:59:55 - INFO - __main__ - Step 55284: {'lr': 0.00035643041761790936, 'samples': 10614528, 'steps': 55283, 'loss/train': 1.3802969455718994} 11/07/2021 04:59:55 - INFO - __main__ - Step 55285: {'lr': 0.00035642561576420834, 'samples': 10614720, 'steps': 55284, 'loss/train': 1.5726593732833862} 11/07/2021 04:59:55 - INFO - __main__ - Step 55286: {'lr': 0.00035642081386255366, 'samples': 10614912, 'steps': 55285, 'loss/train': 1.7341140508651733} 11/07/2021 04:59:56 - INFO - __main__ - Step 55287: {'lr': 0.0003564160119129473, 'samples': 10615104, 'steps': 55286, 'loss/train': 1.3809142112731934} 11/07/2021 04:59:56 - INFO - __main__ - Step 55288: {'lr': 0.0003564112099153916, 'samples': 10615296, 'steps': 55287, 'loss/train': 0.09288747608661652} 11/07/2021 04:59:57 - INFO - __main__ - Step 55289: {'lr': 0.00035640640786988866, 'samples': 10615488, 'steps': 55288, 'loss/train': 1.3767549991607666} 11/07/2021 04:59:58 - INFO - __main__ - Step 55290: {'lr': 0.0003564016057764406, 'samples': 10615680, 'steps': 55289, 'loss/train': 1.3744242191314697} 11/07/2021 04:59:58 - INFO - __main__ - Step 55291: {'lr': 0.00035639680363504965, 'samples': 10615872, 'steps': 55290, 'loss/train': 0.11196830868721008} 11/07/2021 04:59:59 - INFO - __main__ - Step 55292: {'lr': 0.0003563920014457179, 'samples': 10616064, 'steps': 55291, 'loss/train': 1.2871259450912476} 11/07/2021 04:59:59 - INFO - __main__ - Step 55293: {'lr': 0.0003563871992084476, 'samples': 10616256, 'steps': 55292, 'loss/train': 1.16878342628479} 11/07/2021 05:00:00 - INFO - __main__ - Step 55294: {'lr': 0.0003563823969232409, 'samples': 10616448, 'steps': 55293, 'loss/train': 1.460994005203247} 11/07/2021 05:00:00 - INFO - __main__ - Step 55295: {'lr': 0.0003563775945900999, 'samples': 10616640, 'steps': 55294, 'loss/train': 0.4720584750175476} 11/07/2021 05:00:01 - INFO - __main__ - Step 55296: {'lr': 0.00035637279220902677, 'samples': 10616832, 'steps': 55295, 'loss/train': 1.453580379486084} 11/07/2021 05:00:01 - INFO - __main__ - Step 55297: {'lr': 0.00035636798978002374, 'samples': 10617024, 'steps': 55296, 'loss/train': 1.009641408920288} 11/07/2021 05:00:01 - INFO - __main__ - Step 55298: {'lr': 0.00035636318730309285, 'samples': 10617216, 'steps': 55297, 'loss/train': 1.3926947116851807} 11/07/2021 05:00:02 - INFO - __main__ - Step 55299: {'lr': 0.0003563583847782364, 'samples': 10617408, 'steps': 55298, 'loss/train': 1.8583132028579712} 11/07/2021 05:00:03 - INFO - __main__ - Step 55300: {'lr': 0.0003563535822054565, 'samples': 10617600, 'steps': 55299, 'loss/train': 1.485907793045044} 11/07/2021 05:00:03 - INFO - __main__ - Step 55301: {'lr': 0.00035634877958475535, 'samples': 10617792, 'steps': 55300, 'loss/train': 1.32669198513031} 11/07/2021 05:00:03 - INFO - __main__ - Step 55302: {'lr': 0.0003563439769161351, 'samples': 10617984, 'steps': 55301, 'loss/train': 1.3650391101837158} 11/07/2021 05:00:04 - INFO - __main__ - Step 55303: {'lr': 0.00035633917419959784, 'samples': 10618176, 'steps': 55302, 'loss/train': 1.4098504781723022} 11/07/2021 05:00:05 - INFO - __main__ - Step 55304: {'lr': 0.0003563343714351458, 'samples': 10618368, 'steps': 55303, 'loss/train': 0.8533596992492676} 11/07/2021 05:00:06 - INFO - __main__ - Step 55305: {'lr': 0.0003563295686227811, 'samples': 10618560, 'steps': 55304, 'loss/train': 1.6199250221252441} 11/07/2021 05:00:06 - INFO - __main__ - Step 55306: {'lr': 0.000356324765762506, 'samples': 10618752, 'steps': 55305, 'loss/train': 1.3163352012634277} 11/07/2021 05:00:06 - INFO - __main__ - Step 55307: {'lr': 0.0003563199628543226, 'samples': 10618944, 'steps': 55306, 'loss/train': 0.8771881461143494} 11/07/2021 05:00:07 - INFO - __main__ - Step 55308: {'lr': 0.00035631515989823306, 'samples': 10619136, 'steps': 55307, 'loss/train': 1.5897128582000732} 11/07/2021 05:00:08 - INFO - __main__ - Step 55309: {'lr': 0.0003563103568942395, 'samples': 10619328, 'steps': 55308, 'loss/train': 0.20608165860176086} 11/07/2021 05:00:08 - INFO - __main__ - Step 55310: {'lr': 0.0003563055538423441, 'samples': 10619520, 'steps': 55309, 'loss/train': 1.151322603225708} 11/07/2021 05:00:08 - INFO - __main__ - Step 55311: {'lr': 0.00035630075074254917, 'samples': 10619712, 'steps': 55310, 'loss/train': 1.5774955749511719} 11/07/2021 05:00:09 - INFO - __main__ - Step 55312: {'lr': 0.0003562959475948567, 'samples': 10619904, 'steps': 55311, 'loss/train': 1.6166630983352661} 11/07/2021 05:00:09 - INFO - __main__ - Step 55313: {'lr': 0.00035629114439926897, 'samples': 10620096, 'steps': 55312, 'loss/train': 1.3984489440917969} 11/07/2021 05:00:10 - INFO - __main__ - Step 55314: {'lr': 0.00035628634115578806, 'samples': 10620288, 'steps': 55313, 'loss/train': 1.7488480806350708} 11/07/2021 05:00:11 - INFO - __main__ - Step 55315: {'lr': 0.00035628153786441616, 'samples': 10620480, 'steps': 55314, 'loss/train': 1.669127345085144} 11/07/2021 05:00:11 - INFO - __main__ - Step 55316: {'lr': 0.0003562767345251554, 'samples': 10620672, 'steps': 55315, 'loss/train': 1.605141282081604} 11/07/2021 05:00:11 - INFO - __main__ - Step 55317: {'lr': 0.00035627193113800797, 'samples': 10620864, 'steps': 55316, 'loss/train': 1.0980522632598877} 11/07/2021 05:00:12 - INFO - __main__ - Step 55318: {'lr': 0.0003562671277029761, 'samples': 10621056, 'steps': 55317, 'loss/train': 1.114978313446045} 11/07/2021 05:00:13 - INFO - __main__ - Step 55319: {'lr': 0.00035626232422006186, 'samples': 10621248, 'steps': 55318, 'loss/train': 0.24382823705673218} 11/07/2021 05:00:13 - INFO - __main__ - Step 55320: {'lr': 0.0003562575206892676, 'samples': 10621440, 'steps': 55319, 'loss/train': 1.6088790893554688} 11/07/2021 05:00:13 - INFO - __main__ - Step 55321: {'lr': 0.0003562527171105952, 'samples': 10621632, 'steps': 55320, 'loss/train': 1.1404860019683838} 11/07/2021 05:00:14 - INFO - __main__ - Step 55322: {'lr': 0.000356247913484047, 'samples': 10621824, 'steps': 55321, 'loss/train': 1.7770678997039795} 11/07/2021 05:00:14 - INFO - __main__ - Step 55323: {'lr': 0.00035624310980962516, 'samples': 10622016, 'steps': 55322, 'loss/train': 1.3695993423461914} 11/07/2021 05:00:16 - INFO - __main__ - Step 55324: {'lr': 0.0003562383060873318, 'samples': 10622208, 'steps': 55323, 'loss/train': 1.3157083988189697} 11/07/2021 05:00:16 - INFO - __main__ - Step 55325: {'lr': 0.000356233502317169, 'samples': 10622400, 'steps': 55324, 'loss/train': 1.2763516902923584} 11/07/2021 05:00:17 - INFO - __main__ - Step 55326: {'lr': 0.00035622869849913916, 'samples': 10622592, 'steps': 55325, 'loss/train': 1.2261382341384888} 11/07/2021 05:00:17 - INFO - __main__ - Step 55327: {'lr': 0.00035622389463324424, 'samples': 10622784, 'steps': 55326, 'loss/train': 1.374090313911438} 11/07/2021 05:00:17 - INFO - __main__ - Step 55328: {'lr': 0.0003562190907194865, 'samples': 10622976, 'steps': 55327, 'loss/train': 1.3394628763198853} 11/07/2021 05:00:18 - INFO - __main__ - Step 55329: {'lr': 0.00035621428675786804, 'samples': 10623168, 'steps': 55328, 'loss/train': 0.39241358637809753} 11/07/2021 05:00:19 - INFO - __main__ - Step 55330: {'lr': 0.0003562094827483911, 'samples': 10623360, 'steps': 55329, 'loss/train': 0.10898163914680481} 11/07/2021 05:00:19 - INFO - __main__ - Step 55331: {'lr': 0.0003562046786910578, 'samples': 10623552, 'steps': 55330, 'loss/train': 1.525092363357544} 11/07/2021 05:00:19 - INFO - __main__ - Step 55332: {'lr': 0.0003561998745858703, 'samples': 10623744, 'steps': 55331, 'loss/train': 1.3611594438552856} 11/07/2021 05:00:20 - INFO - __main__ - Step 55333: {'lr': 0.00035619507043283075, 'samples': 10623936, 'steps': 55332, 'loss/train': 1.5670605897903442} 11/07/2021 05:00:20 - INFO - __main__ - Step 55334: {'lr': 0.0003561902662319414, 'samples': 10624128, 'steps': 55333, 'loss/train': 0.6823264360427856} 11/07/2021 05:00:21 - INFO - __main__ - Step 55335: {'lr': 0.00035618546198320426, 'samples': 10624320, 'steps': 55334, 'loss/train': 1.4306527376174927} 11/07/2021 05:00:22 - INFO - __main__ - Step 55336: {'lr': 0.0003561806576866217, 'samples': 10624512, 'steps': 55335, 'loss/train': 0.6125316619873047} 11/07/2021 05:00:22 - INFO - __main__ - Step 55337: {'lr': 0.0003561758533421957, 'samples': 10624704, 'steps': 55336, 'loss/train': 1.3668783903121948} 11/07/2021 05:00:22 - INFO - __main__ - Step 55338: {'lr': 0.00035617104894992854, 'samples': 10624896, 'steps': 55337, 'loss/train': 1.2590347528457642} 11/07/2021 05:00:23 - INFO - __main__ - Step 55339: {'lr': 0.00035616624450982227, 'samples': 10625088, 'steps': 55338, 'loss/train': 1.9527186155319214} 11/07/2021 05:00:24 - INFO - __main__ - Step 55340: {'lr': 0.0003561614400218792, 'samples': 10625280, 'steps': 55339, 'loss/train': 1.564045786857605} 11/07/2021 05:00:24 - INFO - __main__ - Step 55341: {'lr': 0.00035615663548610145, 'samples': 10625472, 'steps': 55340, 'loss/train': 1.387307047843933} 11/07/2021 05:00:24 - INFO - __main__ - Step 55342: {'lr': 0.0003561518309024911, 'samples': 10625664, 'steps': 55341, 'loss/train': 1.4752954244613647} 11/07/2021 05:00:25 - INFO - __main__ - Step 55343: {'lr': 0.0003561470262710504, 'samples': 10625856, 'steps': 55342, 'loss/train': 1.4978200197219849} 11/07/2021 05:00:25 - INFO - __main__ - Step 55344: {'lr': 0.00035614222159178143, 'samples': 10626048, 'steps': 55343, 'loss/train': 1.1438692808151245} 11/07/2021 05:00:26 - INFO - __main__ - Step 55345: {'lr': 0.00035613741686468646, 'samples': 10626240, 'steps': 55344, 'loss/train': 1.4912155866622925} 11/07/2021 05:00:27 - INFO - __main__ - Step 55346: {'lr': 0.0003561326120897676, 'samples': 10626432, 'steps': 55345, 'loss/train': 1.2749402523040771} 11/07/2021 05:00:27 - INFO - __main__ - Step 55347: {'lr': 0.00035612780726702707, 'samples': 10626624, 'steps': 55346, 'loss/train': 1.9267258644104004} 11/07/2021 05:00:27 - INFO - __main__ - Step 55348: {'lr': 0.00035612300239646694, 'samples': 10626816, 'steps': 55347, 'loss/train': 1.3484121561050415} 11/07/2021 05:00:28 - INFO - __main__ - Step 55349: {'lr': 0.00035611819747808943, 'samples': 10627008, 'steps': 55348, 'loss/train': 1.1575568914413452} 11/07/2021 05:00:28 - INFO - __main__ - Step 55350: {'lr': 0.00035611339251189665, 'samples': 10627200, 'steps': 55349, 'loss/train': 1.8071938753128052} 11/07/2021 05:00:29 - INFO - __main__ - Step 55351: {'lr': 0.0003561085874978909, 'samples': 10627392, 'steps': 55350, 'loss/train': 2.0694077014923096} 11/07/2021 05:00:29 - INFO - __main__ - Step 55352: {'lr': 0.00035610378243607424, 'samples': 10627584, 'steps': 55351, 'loss/train': 1.7644639015197754} 11/07/2021 05:00:30 - INFO - __main__ - Step 55353: {'lr': 0.0003560989773264488, 'samples': 10627776, 'steps': 55352, 'loss/train': 1.836050271987915} 11/07/2021 05:00:30 - INFO - __main__ - Step 55354: {'lr': 0.00035609417216901683, 'samples': 10627968, 'steps': 55353, 'loss/train': 0.966948390007019} 11/07/2021 05:00:30 - INFO - __main__ - Step 55355: {'lr': 0.00035608936696378046, 'samples': 10628160, 'steps': 55354, 'loss/train': 1.9758682250976562} 11/07/2021 05:00:31 - INFO - __main__ - Step 55356: {'lr': 0.0003560845617107419, 'samples': 10628352, 'steps': 55355, 'loss/train': 1.2718393802642822} 11/07/2021 05:00:32 - INFO - __main__ - Step 55357: {'lr': 0.0003560797564099032, 'samples': 10628544, 'steps': 55356, 'loss/train': 1.1412488222122192} 11/07/2021 05:00:32 - INFO - __main__ - Step 55358: {'lr': 0.00035607495106126664, 'samples': 10628736, 'steps': 55357, 'loss/train': 1.7440805435180664} 11/07/2021 05:00:33 - INFO - __main__ - Step 55359: {'lr': 0.0003560701456648343, 'samples': 10628928, 'steps': 55358, 'loss/train': 1.2522573471069336} 11/07/2021 05:00:33 - INFO - __main__ - Step 55360: {'lr': 0.0003560653402206085, 'samples': 10629120, 'steps': 55359, 'loss/train': 1.4632554054260254} 11/07/2021 05:00:34 - INFO - __main__ - Step 55361: {'lr': 0.0003560605347285912, 'samples': 10629312, 'steps': 55360, 'loss/train': 1.911437749862671} 11/07/2021 05:00:34 - INFO - __main__ - Step 55362: {'lr': 0.0003560557291887847, 'samples': 10629504, 'steps': 55361, 'loss/train': 1.4185771942138672} 11/07/2021 05:00:35 - INFO - __main__ - Step 55363: {'lr': 0.0003560509236011911, 'samples': 10629696, 'steps': 55362, 'loss/train': 1.55713951587677} 11/07/2021 05:00:35 - INFO - __main__ - Step 55364: {'lr': 0.0003560461179658125, 'samples': 10629888, 'steps': 55363, 'loss/train': 2.0547432899475098} 11/07/2021 05:00:35 - INFO - __main__ - Step 55365: {'lr': 0.0003560413122826513, 'samples': 10630080, 'steps': 55364, 'loss/train': 1.1069855690002441} 11/07/2021 05:00:36 - INFO - __main__ - Step 55366: {'lr': 0.0003560365065517095, 'samples': 10630272, 'steps': 55365, 'loss/train': 1.6910406351089478} 11/07/2021 05:00:37 - INFO - __main__ - Step 55367: {'lr': 0.0003560317007729893, 'samples': 10630464, 'steps': 55366, 'loss/train': 1.2973557710647583} 11/07/2021 05:00:37 - INFO - __main__ - Step 55368: {'lr': 0.00035602689494649274, 'samples': 10630656, 'steps': 55367, 'loss/train': 1.5025098323822021} 11/07/2021 05:00:37 - INFO - __main__ - Step 55369: {'lr': 0.0003560220890722222, 'samples': 10630848, 'steps': 55368, 'loss/train': 1.8217743635177612} 11/07/2021 05:00:38 - INFO - __main__ - Step 55370: {'lr': 0.00035601728315017966, 'samples': 10631040, 'steps': 55369, 'loss/train': 1.04408597946167} 11/07/2021 05:00:39 - INFO - __main__ - Step 55371: {'lr': 0.00035601247718036744, 'samples': 10631232, 'steps': 55370, 'loss/train': 1.1525126695632935} 11/07/2021 05:00:39 - INFO - __main__ - Step 55372: {'lr': 0.00035600767116278765, 'samples': 10631424, 'steps': 55371, 'loss/train': 1.5420783758163452} 11/07/2021 05:00:39 - INFO - __main__ - Step 55373: {'lr': 0.0003560028650974424, 'samples': 10631616, 'steps': 55372, 'loss/train': 1.541002869606018} 11/07/2021 05:00:40 - INFO - __main__ - Step 55374: {'lr': 0.0003559980589843339, 'samples': 10631808, 'steps': 55373, 'loss/train': 1.1891919374465942} 11/07/2021 05:00:40 - INFO - __main__ - Step 55375: {'lr': 0.0003559932528234643, 'samples': 10632000, 'steps': 55374, 'loss/train': 1.2719550132751465} 11/07/2021 05:00:41 - INFO - __main__ - Step 55376: {'lr': 0.0003559884466148358, 'samples': 10632192, 'steps': 55375, 'loss/train': 1.1132653951644897} 11/07/2021 05:00:41 - INFO - __main__ - Step 55377: {'lr': 0.0003559836403584505, 'samples': 10632384, 'steps': 55376, 'loss/train': 1.6261625289916992} 11/07/2021 05:00:42 - INFO - __main__ - Step 55378: {'lr': 0.00035597883405431066, 'samples': 10632576, 'steps': 55377, 'loss/train': 1.1378587484359741} 11/07/2021 05:00:42 - INFO - __main__ - Step 55379: {'lr': 0.0003559740277024183, 'samples': 10632768, 'steps': 55378, 'loss/train': 1.080365777015686} 11/07/2021 05:00:43 - INFO - __main__ - Step 55380: {'lr': 0.0003559692213027758, 'samples': 10632960, 'steps': 55379, 'loss/train': 1.2772960662841797} 11/07/2021 05:00:43 - INFO - __main__ - Step 55381: {'lr': 0.00035596441485538513, 'samples': 10633152, 'steps': 55380, 'loss/train': 0.5054352283477783} 11/07/2021 05:00:44 - INFO - __main__ - Step 55382: {'lr': 0.00035595960836024856, 'samples': 10633344, 'steps': 55381, 'loss/train': 0.09342030435800552} 11/07/2021 05:00:44 - INFO - __main__ - Step 55383: {'lr': 0.00035595480181736816, 'samples': 10633536, 'steps': 55382, 'loss/train': 0.9672507047653198} 11/07/2021 05:00:45 - INFO - __main__ - Step 55384: {'lr': 0.0003559499952267462, 'samples': 10633728, 'steps': 55383, 'loss/train': 1.3447973728179932} 11/07/2021 05:00:45 - INFO - __main__ - Step 55385: {'lr': 0.00035594518858838485, 'samples': 10633920, 'steps': 55384, 'loss/train': 1.415655255317688} 11/07/2021 05:00:46 - INFO - __main__ - Step 55386: {'lr': 0.0003559403819022862, 'samples': 10634112, 'steps': 55385, 'loss/train': 1.2938896417617798} 11/07/2021 05:00:46 - INFO - __main__ - Step 55387: {'lr': 0.0003559355751684525, 'samples': 10634304, 'steps': 55386, 'loss/train': 1.2487860918045044} 11/07/2021 05:00:47 - INFO - __main__ - Step 55388: {'lr': 0.00035593076838688576, 'samples': 10634496, 'steps': 55387, 'loss/train': 1.5711008310317993} 11/07/2021 05:00:47 - INFO - __main__ - Step 55389: {'lr': 0.0003559259615575883, 'samples': 10634688, 'steps': 55388, 'loss/train': 0.8426490426063538} 11/07/2021 05:00:47 - INFO - __main__ - Step 55390: {'lr': 0.00035592115468056223, 'samples': 10634880, 'steps': 55389, 'loss/train': 1.5957858562469482} 11/07/2021 05:00:48 - INFO - __main__ - Step 55391: {'lr': 0.0003559163477558098, 'samples': 10635072, 'steps': 55390, 'loss/train': 1.5772674083709717} 11/07/2021 05:00:49 - INFO - __main__ - Step 55392: {'lr': 0.000355911540783333, 'samples': 10635264, 'steps': 55391, 'loss/train': 1.4916237592697144} 11/07/2021 05:00:49 - INFO - __main__ - Step 55393: {'lr': 0.0003559067337631341, 'samples': 10635456, 'steps': 55392, 'loss/train': 1.1380609273910522} 11/07/2021 05:00:49 - INFO - __main__ - Step 55394: {'lr': 0.0003559019266952153, 'samples': 10635648, 'steps': 55393, 'loss/train': 1.0099992752075195} 11/07/2021 05:00:50 - INFO - __main__ - Step 55395: {'lr': 0.0003558971195795787, 'samples': 10635840, 'steps': 55394, 'loss/train': 1.6600886583328247} 11/07/2021 05:00:50 - INFO - __main__ - Step 55396: {'lr': 0.00035589231241622653, 'samples': 10636032, 'steps': 55395, 'loss/train': 1.3273447751998901} 11/07/2021 05:00:51 - INFO - __main__ - Step 55397: {'lr': 0.0003558875052051609, 'samples': 10636224, 'steps': 55396, 'loss/train': 1.3793827295303345} 11/07/2021 05:00:52 - INFO - __main__ - Step 55398: {'lr': 0.000355882697946384, 'samples': 10636416, 'steps': 55397, 'loss/train': 1.4172426462173462} 11/07/2021 05:00:52 - INFO - __main__ - Step 55399: {'lr': 0.00035587789063989793, 'samples': 10636608, 'steps': 55398, 'loss/train': 1.429591417312622} 11/07/2021 05:00:52 - INFO - __main__ - Step 55400: {'lr': 0.0003558730832857049, 'samples': 10636800, 'steps': 55399, 'loss/train': 1.3588968515396118} 11/07/2021 05:00:53 - INFO - __main__ - Step 55401: {'lr': 0.00035586827588380724, 'samples': 10636992, 'steps': 55400, 'loss/train': 1.4501574039459229} 11/07/2021 05:00:54 - INFO - __main__ - Step 55402: {'lr': 0.00035586346843420694, 'samples': 10637184, 'steps': 55401, 'loss/train': 1.075702428817749} 11/07/2021 05:00:54 - INFO - __main__ - Step 55403: {'lr': 0.0003558586609369061, 'samples': 10637376, 'steps': 55402, 'loss/train': 1.8202695846557617} 11/07/2021 05:00:54 - INFO - __main__ - Step 55404: {'lr': 0.000355853853391907, 'samples': 10637568, 'steps': 55403, 'loss/train': 2.1382062435150146} 11/07/2021 05:00:55 - INFO - __main__ - Step 55405: {'lr': 0.0003558490457992118, 'samples': 10637760, 'steps': 55404, 'loss/train': 0.784355878829956} 11/07/2021 05:00:55 - INFO - __main__ - Step 55406: {'lr': 0.00035584423815882265, 'samples': 10637952, 'steps': 55405, 'loss/train': 1.167901873588562} 11/07/2021 05:00:56 - INFO - __main__ - Step 55407: {'lr': 0.00035583943047074173, 'samples': 10638144, 'steps': 55406, 'loss/train': 1.8354178667068481} 11/07/2021 05:00:57 - INFO - __main__ - Step 55408: {'lr': 0.00035583462273497125, 'samples': 10638336, 'steps': 55407, 'loss/train': 1.2057093381881714} 11/07/2021 05:00:57 - INFO - __main__ - Step 55409: {'lr': 0.0003558298149515132, 'samples': 10638528, 'steps': 55408, 'loss/train': 1.6236366033554077} 11/07/2021 05:00:57 - INFO - __main__ - Step 55410: {'lr': 0.00035582500712037, 'samples': 10638720, 'steps': 55409, 'loss/train': 1.6326333284378052} 11/07/2021 05:00:58 - INFO - __main__ - Step 55411: {'lr': 0.0003558201992415436, 'samples': 10638912, 'steps': 55410, 'loss/train': 0.08839976787567139} 11/07/2021 05:00:59 - INFO - __main__ - Step 55412: {'lr': 0.00035581539131503625, 'samples': 10639104, 'steps': 55411, 'loss/train': 1.2671127319335938} 11/07/2021 05:00:59 - INFO - __main__ - Step 55413: {'lr': 0.00035581058334085015, 'samples': 10639296, 'steps': 55412, 'loss/train': 1.4644861221313477} 11/07/2021 05:00:59 - INFO - __main__ - Step 55414: {'lr': 0.00035580577531898745, 'samples': 10639488, 'steps': 55413, 'loss/train': 1.4764949083328247} 11/07/2021 05:01:00 - INFO - __main__ - Step 55415: {'lr': 0.00035580096724945027, 'samples': 10639680, 'steps': 55414, 'loss/train': 1.5854401588439941} 11/07/2021 05:01:00 - INFO - __main__ - Step 55416: {'lr': 0.00035579615913224077, 'samples': 10639872, 'steps': 55415, 'loss/train': 1.243757724761963} 11/07/2021 05:01:01 - INFO - __main__ - Step 55417: {'lr': 0.0003557913509673612, 'samples': 10640064, 'steps': 55416, 'loss/train': 1.207960844039917} 11/07/2021 05:01:02 - INFO - __main__ - Step 55418: {'lr': 0.0003557865427548137, 'samples': 10640256, 'steps': 55417, 'loss/train': 0.07647102326154709} 11/07/2021 05:01:02 - INFO - __main__ - Step 55419: {'lr': 0.0003557817344946004, 'samples': 10640448, 'steps': 55418, 'loss/train': 1.532102108001709} 11/07/2021 05:01:02 - INFO - __main__ - Step 55420: {'lr': 0.0003557769261867235, 'samples': 10640640, 'steps': 55419, 'loss/train': 1.1410948038101196} 11/07/2021 05:01:03 - INFO - __main__ - Step 55421: {'lr': 0.0003557721178311851, 'samples': 10640832, 'steps': 55420, 'loss/train': 1.5957534313201904} 11/07/2021 05:01:04 - INFO - __main__ - Step 55422: {'lr': 0.0003557673094279874, 'samples': 10641024, 'steps': 55421, 'loss/train': 1.580186128616333} 11/07/2021 05:01:04 - INFO - __main__ - Step 55423: {'lr': 0.00035576250097713263, 'samples': 10641216, 'steps': 55422, 'loss/train': 1.1744462251663208} 11/07/2021 05:01:05 - INFO - __main__ - Step 55424: {'lr': 0.00035575769247862295, 'samples': 10641408, 'steps': 55423, 'loss/train': 1.4780709743499756} 11/07/2021 05:01:05 - INFO - __main__ - Step 55425: {'lr': 0.0003557528839324604, 'samples': 10641600, 'steps': 55424, 'loss/train': 0.10940485447645187} 11/07/2021 05:01:05 - INFO - __main__ - Step 55426: {'lr': 0.0003557480753386473, 'samples': 10641792, 'steps': 55425, 'loss/train': 1.5092653036117554} 11/07/2021 05:01:06 - INFO - __main__ - Step 55427: {'lr': 0.0003557432666971857, 'samples': 10641984, 'steps': 55426, 'loss/train': 1.1131150722503662} 11/07/2021 05:01:07 - INFO - __main__ - Step 55428: {'lr': 0.0003557384580080778, 'samples': 10642176, 'steps': 55427, 'loss/train': 1.2832558155059814} 11/07/2021 05:01:07 - INFO - __main__ - Step 55429: {'lr': 0.0003557336492713258, 'samples': 10642368, 'steps': 55428, 'loss/train': 1.6966832876205444} 11/07/2021 05:01:07 - INFO - __main__ - Step 55430: {'lr': 0.00035572884048693193, 'samples': 10642560, 'steps': 55429, 'loss/train': 1.5837956666946411} 11/07/2021 05:01:08 - INFO - __main__ - Step 55431: {'lr': 0.0003557240316548982, 'samples': 10642752, 'steps': 55430, 'loss/train': 1.6265196800231934} 11/07/2021 05:01:09 - INFO - __main__ - Step 55432: {'lr': 0.0003557192227752268, 'samples': 10642944, 'steps': 55431, 'loss/train': 1.322920322418213} 11/07/2021 05:01:09 - INFO - __main__ - Step 55433: {'lr': 0.00035571441384792005, 'samples': 10643136, 'steps': 55432, 'loss/train': 1.5318617820739746} 11/07/2021 05:01:09 - INFO - __main__ - Step 55434: {'lr': 0.00035570960487298, 'samples': 10643328, 'steps': 55433, 'loss/train': 1.135300636291504} 11/07/2021 05:01:10 - INFO - __main__ - Step 55435: {'lr': 0.00035570479585040883, 'samples': 10643520, 'steps': 55434, 'loss/train': 1.3599398136138916} 11/07/2021 05:01:10 - INFO - __main__ - Step 55436: {'lr': 0.00035569998678020866, 'samples': 10643712, 'steps': 55435, 'loss/train': 1.7021390199661255} 11/07/2021 05:01:11 - INFO - __main__ - Step 55437: {'lr': 0.0003556951776623817, 'samples': 10643904, 'steps': 55436, 'loss/train': 1.6679959297180176} 11/07/2021 05:01:12 - INFO - __main__ - Step 55438: {'lr': 0.0003556903684969302, 'samples': 10644096, 'steps': 55437, 'loss/train': 1.413353681564331} 11/07/2021 05:01:12 - INFO - __main__ - Step 55439: {'lr': 0.0003556855592838562, 'samples': 10644288, 'steps': 55438, 'loss/train': 0.949391782283783} 11/07/2021 05:01:12 - INFO - __main__ - Step 55440: {'lr': 0.00035568075002316194, 'samples': 10644480, 'steps': 55439, 'loss/train': 1.5333319902420044} 11/07/2021 05:01:13 - INFO - __main__ - Step 55441: {'lr': 0.0003556759407148496, 'samples': 10644672, 'steps': 55440, 'loss/train': 1.7259225845336914} 11/07/2021 05:01:14 - INFO - __main__ - Step 55442: {'lr': 0.00035567113135892125, 'samples': 10644864, 'steps': 55441, 'loss/train': 1.1740769147872925} 11/07/2021 05:01:14 - INFO - __main__ - Step 55443: {'lr': 0.0003556663219553791, 'samples': 10645056, 'steps': 55442, 'loss/train': 1.4690876007080078} 11/07/2021 05:01:14 - INFO - __main__ - Step 55444: {'lr': 0.00035566151250422543, 'samples': 10645248, 'steps': 55443, 'loss/train': 1.029263973236084} 11/07/2021 05:01:15 - INFO - __main__ - Step 55445: {'lr': 0.0003556567030054622, 'samples': 10645440, 'steps': 55444, 'loss/train': 2.324265480041504} 11/07/2021 05:01:15 - INFO - __main__ - Step 55446: {'lr': 0.00035565189345909177, 'samples': 10645632, 'steps': 55445, 'loss/train': 1.2199026346206665} 11/07/2021 05:01:16 - INFO - __main__ - Step 55447: {'lr': 0.0003556470838651162, 'samples': 10645824, 'steps': 55446, 'loss/train': 0.14488187432289124} 11/07/2021 05:01:17 - INFO - __main__ - Step 55448: {'lr': 0.0003556422742235377, 'samples': 10646016, 'steps': 55447, 'loss/train': 1.4290181398391724} 11/07/2021 05:01:17 - INFO - __main__ - Step 55449: {'lr': 0.0003556374645343584, 'samples': 10646208, 'steps': 55448, 'loss/train': 1.3962857723236084} 11/07/2021 05:01:17 - INFO - __main__ - Step 55450: {'lr': 0.0003556326547975805, 'samples': 10646400, 'steps': 55449, 'loss/train': 1.8636422157287598} 11/07/2021 05:01:18 - INFO - __main__ - Step 55451: {'lr': 0.0003556278450132062, 'samples': 10646592, 'steps': 55450, 'loss/train': 0.9242614507675171} 11/07/2021 05:01:18 - INFO - __main__ - Step 55452: {'lr': 0.0003556230351812375, 'samples': 10646784, 'steps': 55451, 'loss/train': 1.8774685859680176} 11/07/2021 05:01:19 - INFO - __main__ - Step 55453: {'lr': 0.00035561822530167677, 'samples': 10646976, 'steps': 55452, 'loss/train': 1.392649531364441} 11/07/2021 05:01:19 - INFO - __main__ - Step 55454: {'lr': 0.0003556134153745261, 'samples': 10647168, 'steps': 55453, 'loss/train': 1.306005597114563} 11/07/2021 05:01:20 - INFO - __main__ - Step 55455: {'lr': 0.0003556086053997877, 'samples': 10647360, 'steps': 55454, 'loss/train': 1.3837898969650269} 11/07/2021 05:01:20 - INFO - __main__ - Step 55456: {'lr': 0.0003556037953774636, 'samples': 10647552, 'steps': 55455, 'loss/train': 2.020146369934082} 11/07/2021 05:01:20 - INFO - __main__ - Step 55457: {'lr': 0.0003555989853075561, 'samples': 10647744, 'steps': 55456, 'loss/train': 1.6230862140655518} 11/07/2021 05:01:21 - INFO - __main__ - Step 55458: {'lr': 0.0003555941751900673, 'samples': 10647936, 'steps': 55457, 'loss/train': 1.6130120754241943} 11/07/2021 05:01:22 - INFO - __main__ - Step 55459: {'lr': 0.00035558936502499944, 'samples': 10648128, 'steps': 55458, 'loss/train': 1.3010731935501099} 11/07/2021 05:01:22 - INFO - __main__ - Step 55460: {'lr': 0.00035558455481235463, 'samples': 10648320, 'steps': 55459, 'loss/train': 1.4402084350585938} 11/07/2021 05:01:22 - INFO - __main__ - Step 55461: {'lr': 0.000355579744552135, 'samples': 10648512, 'steps': 55460, 'loss/train': 1.2482751607894897} 11/07/2021 05:01:23 - INFO - __main__ - Step 55462: {'lr': 0.00035557493424434285, 'samples': 10648704, 'steps': 55461, 'loss/train': 1.2822860479354858} 11/07/2021 05:01:24 - INFO - __main__ - Step 55463: {'lr': 0.0003555701238889802, 'samples': 10648896, 'steps': 55462, 'loss/train': 1.7497222423553467} 11/07/2021 05:01:24 - INFO - __main__ - Step 55464: {'lr': 0.0003555653134860493, 'samples': 10649088, 'steps': 55463, 'loss/train': 1.5999925136566162} 11/07/2021 05:01:24 - INFO - __main__ - Step 55465: {'lr': 0.00035556050303555233, 'samples': 10649280, 'steps': 55464, 'loss/train': 1.547693133354187} 11/07/2021 05:01:25 - INFO - __main__ - Step 55466: {'lr': 0.00035555569253749135, 'samples': 10649472, 'steps': 55465, 'loss/train': 1.3012789487838745} 11/07/2021 05:01:25 - INFO - __main__ - Step 55467: {'lr': 0.0003555508819918687, 'samples': 10649664, 'steps': 55466, 'loss/train': 1.346777081489563} 11/07/2021 05:01:26 - INFO - __main__ - Step 55468: {'lr': 0.0003555460713986864, 'samples': 10649856, 'steps': 55467, 'loss/train': 1.7407342195510864} 11/07/2021 05:01:27 - INFO - __main__ - Step 55469: {'lr': 0.00035554126075794666, 'samples': 10650048, 'steps': 55468, 'loss/train': 1.5766143798828125} 11/07/2021 05:01:27 - INFO - __main__ - Step 55470: {'lr': 0.0003555364500696517, 'samples': 10650240, 'steps': 55469, 'loss/train': 1.2244343757629395} 11/07/2021 05:01:28 - INFO - __main__ - Step 55471: {'lr': 0.0003555316393338036, 'samples': 10650432, 'steps': 55470, 'loss/train': 0.45889508724212646} 11/07/2021 05:01:28 - INFO - __main__ - Step 55472: {'lr': 0.0003555268285504045, 'samples': 10650624, 'steps': 55471, 'loss/train': 1.283272624015808} 11/07/2021 05:01:29 - INFO - __main__ - Step 55473: {'lr': 0.00035552201771945675, 'samples': 10650816, 'steps': 55472, 'loss/train': 1.3844774961471558} 11/07/2021 05:01:29 - INFO - __main__ - Step 55474: {'lr': 0.0003555172068409624, 'samples': 10651008, 'steps': 55473, 'loss/train': 1.8402692079544067} 11/07/2021 05:01:30 - INFO - __main__ - Step 55475: {'lr': 0.0003555123959149236, 'samples': 10651200, 'steps': 55474, 'loss/train': 1.477421760559082} 11/07/2021 05:01:30 - INFO - __main__ - Step 55476: {'lr': 0.00035550758494134257, 'samples': 10651392, 'steps': 55475, 'loss/train': 1.8838911056518555} 11/07/2021 05:01:30 - INFO - __main__ - Step 55477: {'lr': 0.0003555027739202214, 'samples': 10651584, 'steps': 55476, 'loss/train': 1.3513555526733398} 11/07/2021 05:01:31 - INFO - __main__ - Step 55478: {'lr': 0.00035549796285156234, 'samples': 10651776, 'steps': 55477, 'loss/train': 0.6545446515083313} 11/07/2021 05:01:32 - INFO - __main__ - Step 55479: {'lr': 0.0003554931517353675, 'samples': 10651968, 'steps': 55478, 'loss/train': 1.3153272867202759} 11/07/2021 05:01:32 - INFO - __main__ - Step 55480: {'lr': 0.0003554883405716391, 'samples': 10652160, 'steps': 55479, 'loss/train': 1.3092068433761597} 11/07/2021 05:01:32 - INFO - __main__ - Step 55481: {'lr': 0.0003554835293603793, 'samples': 10652352, 'steps': 55480, 'loss/train': 2.0189783573150635} 11/07/2021 05:01:33 - INFO - __main__ - Step 55482: {'lr': 0.0003554787181015903, 'samples': 10652544, 'steps': 55481, 'loss/train': 1.1473562717437744} 11/07/2021 05:01:33 - INFO - __main__ - Step 55483: {'lr': 0.0003554739067952741, 'samples': 10652736, 'steps': 55482, 'loss/train': 1.3377685546875} 11/07/2021 05:01:34 - INFO - __main__ - Step 55484: {'lr': 0.00035546909544143304, 'samples': 10652928, 'steps': 55483, 'loss/train': 1.6772104501724243} 11/07/2021 05:01:34 - INFO - __main__ - Step 55485: {'lr': 0.00035546428404006913, 'samples': 10653120, 'steps': 55484, 'loss/train': 1.154433250427246} 11/07/2021 05:01:35 - INFO - __main__ - Step 55486: {'lr': 0.0003554594725911848, 'samples': 10653312, 'steps': 55485, 'loss/train': 0.08047390729188919} 11/07/2021 05:01:35 - INFO - __main__ - Step 55487: {'lr': 0.00035545466109478195, 'samples': 10653504, 'steps': 55486, 'loss/train': 1.0241944789886475} 11/07/2021 05:01:35 - INFO - __main__ - Step 55488: {'lr': 0.00035544984955086296, 'samples': 10653696, 'steps': 55487, 'loss/train': 1.5227360725402832} 11/07/2021 05:01:37 - INFO - __main__ - Step 55489: {'lr': 0.00035544503795942984, 'samples': 10653888, 'steps': 55488, 'loss/train': 1.5348587036132812} 11/07/2021 05:01:37 - INFO - __main__ - Step 55490: {'lr': 0.00035544022632048476, 'samples': 10654080, 'steps': 55489, 'loss/train': 2.0114195346832275} 11/07/2021 05:01:37 - INFO - __main__ - Step 55491: {'lr': 0.00035543541463402994, 'samples': 10654272, 'steps': 55490, 'loss/train': 1.1516433954238892} 11/07/2021 05:01:38 - INFO - __main__ - Step 55492: {'lr': 0.0003554306029000676, 'samples': 10654464, 'steps': 55491, 'loss/train': 1.52472984790802} 11/07/2021 05:01:38 - INFO - __main__ - Step 55493: {'lr': 0.00035542579111859986, 'samples': 10654656, 'steps': 55492, 'loss/train': 1.7366708517074585} 11/07/2021 05:01:39 - INFO - __main__ - Step 55494: {'lr': 0.0003554209792896289, 'samples': 10654848, 'steps': 55493, 'loss/train': 1.614952564239502} 11/07/2021 05:01:39 - INFO - __main__ - Step 55495: {'lr': 0.00035541616741315685, 'samples': 10655040, 'steps': 55494, 'loss/train': 1.4215108156204224} 11/07/2021 05:01:40 - INFO - __main__ - Step 55496: {'lr': 0.0003554113554891859, 'samples': 10655232, 'steps': 55495, 'loss/train': 1.2625516653060913} 11/07/2021 05:01:40 - INFO - __main__ - Step 55497: {'lr': 0.0003554065435177183, 'samples': 10655424, 'steps': 55496, 'loss/train': 1.1632428169250488} 11/07/2021 05:01:40 - INFO - __main__ - Step 55498: {'lr': 0.00035540173149875597, 'samples': 10655616, 'steps': 55497, 'loss/train': 1.403335690498352} 11/07/2021 05:01:42 - INFO - __main__ - Step 55499: {'lr': 0.00035539691943230135, 'samples': 10655808, 'steps': 55498, 'loss/train': 1.8241478204727173} 11/07/2021 05:01:42 - INFO - __main__ - Step 55500: {'lr': 0.00035539210731835646, 'samples': 10656000, 'steps': 55499, 'loss/train': 1.243905782699585} 11/07/2021 05:01:42 - INFO - __main__ - Step 55501: {'lr': 0.00035538729515692356, 'samples': 10656192, 'steps': 55500, 'loss/train': 1.7358616590499878} 11/07/2021 05:01:43 - INFO - __main__ - Step 55502: {'lr': 0.0003553824829480048, 'samples': 10656384, 'steps': 55501, 'loss/train': 1.4769762754440308} 11/07/2021 05:01:43 - INFO - __main__ - Step 55503: {'lr': 0.00035537767069160234, 'samples': 10656576, 'steps': 55502, 'loss/train': 1.0820908546447754} 11/07/2021 05:01:44 - INFO - __main__ - Step 55504: {'lr': 0.00035537285838771823, 'samples': 10656768, 'steps': 55503, 'loss/train': 1.6202404499053955} 11/07/2021 05:01:44 - INFO - __main__ - Step 55505: {'lr': 0.00035536804603635474, 'samples': 10656960, 'steps': 55504, 'loss/train': 1.3159245252609253} 11/07/2021 05:01:45 - INFO - __main__ - Step 55506: {'lr': 0.00035536323363751405, 'samples': 10657152, 'steps': 55505, 'loss/train': 1.5448861122131348} 11/07/2021 05:01:45 - INFO - __main__ - Step 55507: {'lr': 0.0003553584211911983, 'samples': 10657344, 'steps': 55506, 'loss/train': 1.380665898323059} 11/07/2021 05:01:45 - INFO - __main__ - Step 55508: {'lr': 0.00035535360869740973, 'samples': 10657536, 'steps': 55507, 'loss/train': 1.0124974250793457} 11/07/2021 05:01:46 - INFO - __main__ - Step 55509: {'lr': 0.00035534879615615046, 'samples': 10657728, 'steps': 55508, 'loss/train': 1.0134326219558716} 11/07/2021 05:01:47 - INFO - __main__ - Step 55510: {'lr': 0.0003553439835674226, 'samples': 10657920, 'steps': 55509, 'loss/train': 1.1719768047332764} 11/07/2021 05:01:47 - INFO - __main__ - Step 55511: {'lr': 0.00035533917093122835, 'samples': 10658112, 'steps': 55510, 'loss/train': 1.3605214357376099} 11/07/2021 05:01:48 - INFO - __main__ - Step 55512: {'lr': 0.00035533435824756986, 'samples': 10658304, 'steps': 55511, 'loss/train': 1.351001501083374} 11/07/2021 05:01:48 - INFO - __main__ - Step 55513: {'lr': 0.00035532954551644944, 'samples': 10658496, 'steps': 55512, 'loss/train': 1.2918845415115356} 11/07/2021 05:01:48 - INFO - __main__ - Step 55514: {'lr': 0.0003553247327378691, 'samples': 10658688, 'steps': 55513, 'loss/train': 1.2629348039627075} 11/07/2021 05:01:49 - INFO - __main__ - Step 55515: {'lr': 0.0003553199199118311, 'samples': 10658880, 'steps': 55514, 'loss/train': 1.4175992012023926} 11/07/2021 05:01:50 - INFO - __main__ - Step 55516: {'lr': 0.00035531510703833754, 'samples': 10659072, 'steps': 55515, 'loss/train': 1.5937676429748535} 11/07/2021 05:01:50 - INFO - __main__ - Step 55517: {'lr': 0.00035531029411739056, 'samples': 10659264, 'steps': 55516, 'loss/train': 1.3809469938278198} 11/07/2021 05:01:50 - INFO - __main__ - Step 55518: {'lr': 0.00035530548114899243, 'samples': 10659456, 'steps': 55517, 'loss/train': 1.5462132692337036} 11/07/2021 05:01:51 - INFO - __main__ - Step 55519: {'lr': 0.00035530066813314534, 'samples': 10659648, 'steps': 55518, 'loss/train': 1.0150903463363647} 11/07/2021 05:01:52 - INFO - __main__ - Step 55520: {'lr': 0.0003552958550698513, 'samples': 10659840, 'steps': 55519, 'loss/train': 1.4693259000778198} 11/07/2021 05:01:52 - INFO - __main__ - Step 55521: {'lr': 0.00035529104195911255, 'samples': 10660032, 'steps': 55520, 'loss/train': 0.9689898490905762} 11/07/2021 05:01:52 - INFO - __main__ - Step 55522: {'lr': 0.00035528622880093145, 'samples': 10660224, 'steps': 55521, 'loss/train': 1.226699948310852} 11/07/2021 05:01:53 - INFO - __main__ - Step 55523: {'lr': 0.00035528141559530984, 'samples': 10660416, 'steps': 55522, 'loss/train': 1.4950664043426514} 11/07/2021 05:01:53 - INFO - __main__ - Step 55524: {'lr': 0.0003552766023422501, 'samples': 10660608, 'steps': 55523, 'loss/train': 1.5928208827972412} 11/07/2021 05:01:54 - INFO - __main__ - Step 55525: {'lr': 0.00035527178904175435, 'samples': 10660800, 'steps': 55524, 'loss/train': 1.644692063331604} 11/07/2021 05:01:54 - INFO - __main__ - Step 55526: {'lr': 0.0003552669756938247, 'samples': 10660992, 'steps': 55525, 'loss/train': 1.6959644556045532} 11/07/2021 05:01:55 - INFO - __main__ - Step 55527: {'lr': 0.0003552621622984634, 'samples': 10661184, 'steps': 55526, 'loss/train': 1.4864704608917236} 11/07/2021 05:01:55 - INFO - __main__ - Step 55528: {'lr': 0.00035525734885567275, 'samples': 10661376, 'steps': 55527, 'loss/train': 1.743637204170227} 11/07/2021 05:01:55 - INFO - __main__ - Step 55529: {'lr': 0.0003552525353654546, 'samples': 10661568, 'steps': 55528, 'loss/train': 4.9168701171875} 11/07/2021 05:01:56 - INFO - __main__ - Step 55530: {'lr': 0.0003552477218278113, 'samples': 10661760, 'steps': 55529, 'loss/train': 1.2728878259658813} 11/07/2021 05:01:57 - INFO - __main__ - Step 55531: {'lr': 0.00035524290824274504, 'samples': 10661952, 'steps': 55530, 'loss/train': 1.0321156978607178} 11/07/2021 05:01:57 - INFO - __main__ - Step 55532: {'lr': 0.0003552380946102579, 'samples': 10662144, 'steps': 55531, 'loss/train': 0.8566840291023254} 11/07/2021 05:01:57 - INFO - __main__ - Step 55533: {'lr': 0.0003552332809303521, 'samples': 10662336, 'steps': 55532, 'loss/train': 0.9766098260879517} 11/07/2021 05:01:58 - INFO - __main__ - Step 55534: {'lr': 0.0003552284672030298, 'samples': 10662528, 'steps': 55533, 'loss/train': 1.8043036460876465} 11/07/2021 05:01:59 - INFO - __main__ - Step 55535: {'lr': 0.0003552236534282933, 'samples': 10662720, 'steps': 55534, 'loss/train': 1.1812506914138794} 11/07/2021 05:01:59 - INFO - __main__ - Step 55536: {'lr': 0.00035521883960614456, 'samples': 10662912, 'steps': 55535, 'loss/train': 1.427696704864502} 11/07/2021 05:02:00 - INFO - __main__ - Step 55537: {'lr': 0.0003552140257365858, 'samples': 10663104, 'steps': 55536, 'loss/train': 1.419637680053711} 11/07/2021 05:02:00 - INFO - __main__ - Step 55538: {'lr': 0.00035520921181961924, 'samples': 10663296, 'steps': 55537, 'loss/train': 1.5478235483169556} 11/07/2021 05:02:00 - INFO - __main__ - Step 55539: {'lr': 0.00035520439785524703, 'samples': 10663488, 'steps': 55538, 'loss/train': 1.291942834854126} 11/07/2021 05:02:01 - INFO - __main__ - Step 55540: {'lr': 0.00035519958384347134, 'samples': 10663680, 'steps': 55539, 'loss/train': 1.566218376159668} 11/07/2021 05:02:02 - INFO - __main__ - Step 55541: {'lr': 0.00035519476978429433, 'samples': 10663872, 'steps': 55540, 'loss/train': 1.5057703256607056} 11/07/2021 05:02:02 - INFO - __main__ - Step 55542: {'lr': 0.0003551899556777183, 'samples': 10664064, 'steps': 55541, 'loss/train': 1.5911173820495605} 11/07/2021 05:02:02 - INFO - __main__ - Step 55543: {'lr': 0.00035518514152374514, 'samples': 10664256, 'steps': 55542, 'loss/train': 1.2359411716461182} 11/07/2021 05:02:03 - INFO - __main__ - Step 55544: {'lr': 0.00035518032732237724, 'samples': 10664448, 'steps': 55543, 'loss/train': 1.1860450506210327} 11/07/2021 05:02:04 - INFO - __main__ - Step 55545: {'lr': 0.00035517551307361674, 'samples': 10664640, 'steps': 55544, 'loss/train': 3.10489821434021} 11/07/2021 05:02:04 - INFO - __main__ - Step 55546: {'lr': 0.0003551706987774657, 'samples': 10664832, 'steps': 55545, 'loss/train': 1.326438307762146} 11/07/2021 05:02:05 - INFO - __main__ - Step 55547: {'lr': 0.00035516588443392644, 'samples': 10665024, 'steps': 55546, 'loss/train': 1.4802407026290894} 11/07/2021 05:02:05 - INFO - __main__ - Step 55548: {'lr': 0.00035516107004300107, 'samples': 10665216, 'steps': 55547, 'loss/train': 1.7567200660705566} 11/07/2021 05:02:05 - INFO - __main__ - Step 55549: {'lr': 0.00035515625560469174, 'samples': 10665408, 'steps': 55548, 'loss/train': 1.3075168132781982} 11/07/2021 05:02:06 - INFO - __main__ - Step 55550: {'lr': 0.00035515144111900054, 'samples': 10665600, 'steps': 55549, 'loss/train': 1.2171063423156738} 11/07/2021 05:02:07 - INFO - __main__ - Step 55551: {'lr': 0.00035514662658592977, 'samples': 10665792, 'steps': 55550, 'loss/train': 1.5514367818832397} 11/07/2021 05:02:07 - INFO - __main__ - Step 55552: {'lr': 0.0003551418120054816, 'samples': 10665984, 'steps': 55551, 'loss/train': 1.5149608850479126} 11/07/2021 05:02:07 - INFO - __main__ - Step 55553: {'lr': 0.0003551369973776581, 'samples': 10666176, 'steps': 55552, 'loss/train': 1.4682872295379639} 11/07/2021 05:02:08 - INFO - __main__ - Step 55554: {'lr': 0.0003551321827024615, 'samples': 10666368, 'steps': 55553, 'loss/train': 1.1639363765716553} 11/07/2021 05:02:09 - INFO - __main__ - Step 55555: {'lr': 0.0003551273679798939, 'samples': 10666560, 'steps': 55554, 'loss/train': 1.7590302228927612} 11/07/2021 05:02:09 - INFO - __main__ - Step 55556: {'lr': 0.00035512255320995764, 'samples': 10666752, 'steps': 55555, 'loss/train': 1.9666401147842407} 11/07/2021 05:02:09 - INFO - __main__ - Step 55557: {'lr': 0.0003551177383926547, 'samples': 10666944, 'steps': 55556, 'loss/train': 1.5923014879226685} 11/07/2021 05:02:10 - INFO - __main__ - Step 55558: {'lr': 0.00035511292352798736, 'samples': 10667136, 'steps': 55557, 'loss/train': 1.565259575843811} 11/07/2021 05:02:10 - INFO - __main__ - Step 55559: {'lr': 0.0003551081086159578, 'samples': 10667328, 'steps': 55558, 'loss/train': 1.5437698364257812} 11/07/2021 05:02:11 - INFO - __main__ - Step 55560: {'lr': 0.0003551032936565681, 'samples': 10667520, 'steps': 55559, 'loss/train': 1.3968523740768433} 11/07/2021 05:02:12 - INFO - __main__ - Step 55561: {'lr': 0.0003550984786498205, 'samples': 10667712, 'steps': 55560, 'loss/train': 1.0897365808486938} 11/07/2021 05:02:12 - INFO - __main__ - Step 55562: {'lr': 0.0003550936635957171, 'samples': 10667904, 'steps': 55561, 'loss/train': 1.755570888519287} 11/07/2021 05:02:12 - INFO - __main__ - Step 55563: {'lr': 0.00035508884849426014, 'samples': 10668096, 'steps': 55562, 'loss/train': 1.15757417678833} 11/07/2021 05:02:13 - INFO - __main__ - Step 55564: {'lr': 0.0003550840333454518, 'samples': 10668288, 'steps': 55563, 'loss/train': 1.2781257629394531} 11/07/2021 05:02:13 - INFO - __main__ - Step 55565: {'lr': 0.00035507921814929415, 'samples': 10668480, 'steps': 55564, 'loss/train': 1.5428166389465332} 11/07/2021 05:02:14 - INFO - __main__ - Step 55566: {'lr': 0.0003550744029057895, 'samples': 10668672, 'steps': 55565, 'loss/train': 1.2315104007720947} 11/07/2021 05:02:14 - INFO - __main__ - Step 55567: {'lr': 0.0003550695876149399, 'samples': 10668864, 'steps': 55566, 'loss/train': 1.3412801027297974} 11/07/2021 05:02:15 - INFO - __main__ - Step 55568: {'lr': 0.00035506477227674753, 'samples': 10669056, 'steps': 55567, 'loss/train': 0.994350790977478} 11/07/2021 05:02:15 - INFO - __main__ - Step 55569: {'lr': 0.0003550599568912147, 'samples': 10669248, 'steps': 55568, 'loss/train': 1.5082281827926636} 11/07/2021 05:02:15 - INFO - __main__ - Step 55570: {'lr': 0.00035505514145834337, 'samples': 10669440, 'steps': 55569, 'loss/train': 0.8763818144798279} 11/07/2021 05:02:16 - INFO - __main__ - Step 55571: {'lr': 0.0003550503259781359, 'samples': 10669632, 'steps': 55570, 'loss/train': 1.2236778736114502} 11/07/2021 05:02:17 - INFO - __main__ - Step 55572: {'lr': 0.0003550455104505943, 'samples': 10669824, 'steps': 55571, 'loss/train': 1.1714478731155396} 11/07/2021 05:02:17 - INFO - __main__ - Step 55573: {'lr': 0.00035504069487572086, 'samples': 10670016, 'steps': 55572, 'loss/train': 1.3182233572006226} 11/07/2021 05:02:17 - INFO - __main__ - Step 55574: {'lr': 0.00035503587925351767, 'samples': 10670208, 'steps': 55573, 'loss/train': 1.1601752042770386} 11/07/2021 05:02:18 - INFO - __main__ - Step 55575: {'lr': 0.00035503106358398694, 'samples': 10670400, 'steps': 55574, 'loss/train': 1.5089037418365479} 11/07/2021 05:02:19 - INFO - __main__ - Step 55576: {'lr': 0.0003550262478671309, 'samples': 10670592, 'steps': 55575, 'loss/train': 1.5794835090637207} 11/07/2021 05:02:19 - INFO - __main__ - Step 55577: {'lr': 0.00035502143210295163, 'samples': 10670784, 'steps': 55576, 'loss/train': 1.2077217102050781} 11/07/2021 05:02:20 - INFO - __main__ - Step 55578: {'lr': 0.0003550166162914513, 'samples': 10670976, 'steps': 55577, 'loss/train': 1.2896077632904053} 11/07/2021 05:02:20 - INFO - __main__ - Step 55579: {'lr': 0.00035501180043263203, 'samples': 10671168, 'steps': 55578, 'loss/train': 1.2451173067092896} 11/07/2021 05:02:20 - INFO - __main__ - Step 55580: {'lr': 0.00035500698452649613, 'samples': 10671360, 'steps': 55579, 'loss/train': 1.1873743534088135} 11/07/2021 05:02:21 - INFO - __main__ - Step 55581: {'lr': 0.00035500216857304575, 'samples': 10671552, 'steps': 55580, 'loss/train': 1.361046552658081} 11/07/2021 05:02:22 - INFO - __main__ - Step 55582: {'lr': 0.000354997352572283, 'samples': 10671744, 'steps': 55581, 'loss/train': 1.1923409700393677} 11/07/2021 05:02:22 - INFO - __main__ - Step 55583: {'lr': 0.00035499253652421, 'samples': 10671936, 'steps': 55582, 'loss/train': 1.2904736995697021} 11/07/2021 05:02:22 - INFO - __main__ - Step 55584: {'lr': 0.000354987720428829, 'samples': 10672128, 'steps': 55583, 'loss/train': 1.4541096687316895} 11/07/2021 05:02:23 - INFO - __main__ - Step 55585: {'lr': 0.00035498290428614217, 'samples': 10672320, 'steps': 55584, 'loss/train': 1.910554051399231} 11/07/2021 05:02:24 - INFO - __main__ - Step 55586: {'lr': 0.0003549780880961516, 'samples': 10672512, 'steps': 55585, 'loss/train': 1.4025523662567139} 11/07/2021 05:02:24 - INFO - __main__ - Step 55587: {'lr': 0.00035497327185885966, 'samples': 10672704, 'steps': 55586, 'loss/train': 1.4680031538009644} 11/07/2021 05:02:24 - INFO - __main__ - Step 55588: {'lr': 0.00035496845557426824, 'samples': 10672896, 'steps': 55587, 'loss/train': 1.2862597703933716} 11/07/2021 05:02:25 - INFO - __main__ - Step 55589: {'lr': 0.0003549636392423798, 'samples': 10673088, 'steps': 55588, 'loss/train': 4.9805216789245605} 11/07/2021 05:02:25 - INFO - __main__ - Step 55590: {'lr': 0.00035495882286319625, 'samples': 10673280, 'steps': 55589, 'loss/train': 1.4303516149520874} 11/07/2021 05:02:26 - INFO - __main__ - Step 55591: {'lr': 0.0003549540064367199, 'samples': 10673472, 'steps': 55590, 'loss/train': 1.1181601285934448} 11/07/2021 05:02:26 - INFO - __main__ - Step 55592: {'lr': 0.0003549491899629529, 'samples': 10673664, 'steps': 55591, 'loss/train': 1.3412623405456543} 11/07/2021 05:02:27 - INFO - __main__ - Step 55593: {'lr': 0.00035494437344189746, 'samples': 10673856, 'steps': 55592, 'loss/train': 1.5310654640197754} 11/07/2021 05:02:27 - INFO - __main__ - Step 55594: {'lr': 0.0003549395568735556, 'samples': 10674048, 'steps': 55593, 'loss/train': 1.5789437294006348} 11/07/2021 05:02:28 - INFO - __main__ - Step 55595: {'lr': 0.00035493474025792966, 'samples': 10674240, 'steps': 55594, 'loss/train': 1.375770092010498} 11/07/2021 05:02:29 - INFO - __main__ - Step 55596: {'lr': 0.0003549299235950218, 'samples': 10674432, 'steps': 55595, 'loss/train': 1.112752914428711} 11/07/2021 05:02:29 - INFO - __main__ - Step 55597: {'lr': 0.000354925106884834, 'samples': 10674624, 'steps': 55596, 'loss/train': 1.0414875745773315} 11/07/2021 05:02:29 - INFO - __main__ - Step 55598: {'lr': 0.0003549202901273687, 'samples': 10674816, 'steps': 55597, 'loss/train': 1.5901310443878174} 11/07/2021 05:02:30 - INFO - __main__ - Step 55599: {'lr': 0.00035491547332262786, 'samples': 10675008, 'steps': 55598, 'loss/train': 1.6778441667556763} 11/07/2021 05:02:30 - INFO - __main__ - Step 55600: {'lr': 0.00035491065647061377, 'samples': 10675200, 'steps': 55599, 'loss/train': 1.326326608657837} 11/07/2021 05:02:31 - INFO - __main__ - Step 55601: {'lr': 0.0003549058395713285, 'samples': 10675392, 'steps': 55600, 'loss/train': 1.043556809425354} 11/07/2021 05:02:31 - INFO - __main__ - Step 55602: {'lr': 0.00035490102262477436, 'samples': 10675584, 'steps': 55601, 'loss/train': 1.8972359895706177} 11/07/2021 05:02:32 - INFO - __main__ - Step 55603: {'lr': 0.0003548962056309534, 'samples': 10675776, 'steps': 55602, 'loss/train': 0.05763842165470123} 11/07/2021 05:02:32 - INFO - __main__ - Step 55604: {'lr': 0.0003548913885898678, 'samples': 10675968, 'steps': 55603, 'loss/train': 1.331835389137268} 11/07/2021 05:02:32 - INFO - __main__ - Step 55605: {'lr': 0.0003548865715015198, 'samples': 10676160, 'steps': 55604, 'loss/train': 1.4535555839538574} 11/07/2021 05:02:34 - INFO - __main__ - Step 55606: {'lr': 0.00035488175436591146, 'samples': 10676352, 'steps': 55605, 'loss/train': 1.5322768688201904} 11/07/2021 05:02:34 - INFO - __main__ - Step 55607: {'lr': 0.00035487693718304504, 'samples': 10676544, 'steps': 55606, 'loss/train': 1.1713974475860596} 11/07/2021 05:02:34 - INFO - __main__ - Step 55608: {'lr': 0.00035487211995292276, 'samples': 10676736, 'steps': 55607, 'loss/train': 1.6466498374938965} 11/07/2021 05:02:35 - INFO - __main__ - Step 55609: {'lr': 0.00035486730267554666, 'samples': 10676928, 'steps': 55608, 'loss/train': 1.2568548917770386} 11/07/2021 05:02:35 - INFO - __main__ - Step 55610: {'lr': 0.000354862485350919, 'samples': 10677120, 'steps': 55609, 'loss/train': 1.1440668106079102} 11/07/2021 05:02:35 - INFO - __main__ - Step 55611: {'lr': 0.0003548576679790419, 'samples': 10677312, 'steps': 55610, 'loss/train': 0.8220343589782715} 11/07/2021 05:02:36 - INFO - __main__ - Step 55612: {'lr': 0.00035485285055991754, 'samples': 10677504, 'steps': 55611, 'loss/train': 1.3758190870285034} 11/07/2021 05:02:37 - INFO - __main__ - Step 55613: {'lr': 0.00035484803309354814, 'samples': 10677696, 'steps': 55612, 'loss/train': 1.9407711029052734} 11/07/2021 05:02:37 - INFO - __main__ - Step 55614: {'lr': 0.0003548432155799358, 'samples': 10677888, 'steps': 55613, 'loss/train': 1.3740770816802979} 11/07/2021 05:02:38 - INFO - __main__ - Step 55615: {'lr': 0.00035483839801908276, 'samples': 10678080, 'steps': 55614, 'loss/train': 1.1971131563186646} 11/07/2021 05:02:38 - INFO - __main__ - Step 55616: {'lr': 0.00035483358041099117, 'samples': 10678272, 'steps': 55615, 'loss/train': 1.2182449102401733} 11/07/2021 05:02:39 - INFO - __main__ - Step 55617: {'lr': 0.00035482876275566317, 'samples': 10678464, 'steps': 55616, 'loss/train': 1.621694564819336} 11/07/2021 05:02:39 - INFO - __main__ - Step 55618: {'lr': 0.00035482394505310087, 'samples': 10678656, 'steps': 55617, 'loss/train': 1.561873435974121} 11/07/2021 05:02:40 - INFO - __main__ - Step 55619: {'lr': 0.0003548191273033066, 'samples': 10678848, 'steps': 55618, 'loss/train': 0.8010305762290955} 11/07/2021 05:02:40 - INFO - __main__ - Step 55620: {'lr': 0.0003548143095062825, 'samples': 10679040, 'steps': 55619, 'loss/train': 1.3462319374084473} 11/07/2021 05:02:40 - INFO - __main__ - Step 55621: {'lr': 0.00035480949166203057, 'samples': 10679232, 'steps': 55620, 'loss/train': 1.859246015548706} 11/07/2021 05:02:41 - INFO - __main__ - Step 55622: {'lr': 0.00035480467377055314, 'samples': 10679424, 'steps': 55621, 'loss/train': 1.6003825664520264} 11/07/2021 05:02:42 - INFO - __main__ - Step 55623: {'lr': 0.00035479985583185237, 'samples': 10679616, 'steps': 55622, 'loss/train': 0.9864547848701477} 11/07/2021 05:02:42 - INFO - __main__ - Step 55624: {'lr': 0.0003547950378459304, 'samples': 10679808, 'steps': 55623, 'loss/train': 1.4512523412704468} 11/07/2021 05:02:42 - INFO - __main__ - Step 55625: {'lr': 0.00035479021981278935, 'samples': 10680000, 'steps': 55624, 'loss/train': 1.546105146408081} 11/07/2021 05:02:43 - INFO - __main__ - Step 55626: {'lr': 0.0003547854017324315, 'samples': 10680192, 'steps': 55625, 'loss/train': 1.1211549043655396} 11/07/2021 05:02:44 - INFO - __main__ - Step 55627: {'lr': 0.000354780583604859, 'samples': 10680384, 'steps': 55626, 'loss/train': 2.3880438804626465} 11/07/2021 05:02:44 - INFO - __main__ - Step 55628: {'lr': 0.0003547757654300739, 'samples': 10680576, 'steps': 55627, 'loss/train': 1.8930439949035645} 11/07/2021 05:02:45 - INFO - __main__ - Step 55629: {'lr': 0.0003547709472080785, 'samples': 10680768, 'steps': 55628, 'loss/train': 1.3211060762405396} 11/07/2021 05:02:45 - INFO - __main__ - Step 55630: {'lr': 0.00035476612893887494, 'samples': 10680960, 'steps': 55629, 'loss/train': 0.43241170048713684} 11/07/2021 05:02:45 - INFO - __main__ - Step 55631: {'lr': 0.0003547613106224653, 'samples': 10681152, 'steps': 55630, 'loss/train': 1.4644253253936768} 11/07/2021 05:02:46 - INFO - __main__ - Step 55632: {'lr': 0.0003547564922588519, 'samples': 10681344, 'steps': 55631, 'loss/train': 1.2657135725021362} 11/07/2021 05:02:47 - INFO - __main__ - Step 55633: {'lr': 0.0003547516738480369, 'samples': 10681536, 'steps': 55632, 'loss/train': 0.6447150707244873} 11/07/2021 05:02:47 - INFO - __main__ - Step 55634: {'lr': 0.0003547468553900223, 'samples': 10681728, 'steps': 55633, 'loss/train': 0.8582543134689331} 11/07/2021 05:02:47 - INFO - __main__ - Step 55635: {'lr': 0.0003547420368848104, 'samples': 10681920, 'steps': 55634, 'loss/train': 1.463858962059021} 11/07/2021 05:02:48 - INFO - __main__ - Step 55636: {'lr': 0.0003547372183324034, 'samples': 10682112, 'steps': 55635, 'loss/train': 1.176918625831604} 11/07/2021 05:02:49 - INFO - __main__ - Step 55637: {'lr': 0.0003547323997328034, 'samples': 10682304, 'steps': 55636, 'loss/train': 1.400640606880188} 11/07/2021 05:02:49 - INFO - __main__ - Step 55638: {'lr': 0.0003547275810860126, 'samples': 10682496, 'steps': 55637, 'loss/train': 1.6146177053451538} 11/07/2021 05:02:50 - INFO - __main__ - Step 55639: {'lr': 0.00035472276239203315, 'samples': 10682688, 'steps': 55638, 'loss/train': 1.731985092163086} 11/07/2021 05:02:50 - INFO - __main__ - Step 55640: {'lr': 0.00035471794365086724, 'samples': 10682880, 'steps': 55639, 'loss/train': 1.5149266719818115} 11/07/2021 05:02:50 - INFO - __main__ - Step 55641: {'lr': 0.00035471312486251707, 'samples': 10683072, 'steps': 55640, 'loss/train': 1.0393422842025757} 11/07/2021 05:02:51 - INFO - __main__ - Step 55642: {'lr': 0.0003547083060269848, 'samples': 10683264, 'steps': 55641, 'loss/train': 1.256230115890503} 11/07/2021 05:02:52 - INFO - __main__ - Step 55643: {'lr': 0.00035470348714427256, 'samples': 10683456, 'steps': 55642, 'loss/train': 1.4536901712417603} 11/07/2021 05:02:52 - INFO - __main__ - Step 55644: {'lr': 0.0003546986682143825, 'samples': 10683648, 'steps': 55643, 'loss/train': 1.0605077743530273} 11/07/2021 05:02:53 - INFO - __main__ - Step 55645: {'lr': 0.0003546938492373169, 'samples': 10683840, 'steps': 55644, 'loss/train': 1.6237866878509521} 11/07/2021 05:02:53 - INFO - __main__ - Step 55646: {'lr': 0.0003546890302130778, 'samples': 10684032, 'steps': 55645, 'loss/train': 1.383773922920227} 11/07/2021 05:02:54 - INFO - __main__ - Step 55647: {'lr': 0.0003546842111416675, 'samples': 10684224, 'steps': 55646, 'loss/train': 2.5310633182525635} 11/07/2021 05:02:54 - INFO - __main__ - Step 55648: {'lr': 0.0003546793920230881, 'samples': 10684416, 'steps': 55647, 'loss/train': 1.7178682088851929} 11/07/2021 05:02:55 - INFO - __main__ - Step 55649: {'lr': 0.0003546745728573418, 'samples': 10684608, 'steps': 55648, 'loss/train': 1.7007197141647339} 11/07/2021 05:02:55 - INFO - __main__ - Step 55650: {'lr': 0.0003546697536444307, 'samples': 10684800, 'steps': 55649, 'loss/train': 1.089084267616272} 11/07/2021 05:02:55 - INFO - __main__ - Step 55651: {'lr': 0.00035466493438435703, 'samples': 10684992, 'steps': 55650, 'loss/train': 1.3870054483413696} 11/07/2021 05:02:57 - INFO - __main__ - Step 55652: {'lr': 0.000354660115077123, 'samples': 10685184, 'steps': 55651, 'loss/train': 1.2411152124404907} 11/07/2021 05:02:57 - INFO - __main__ - Step 55653: {'lr': 0.0003546552957227307, 'samples': 10685376, 'steps': 55652, 'loss/train': 1.4214160442352295} 11/07/2021 05:02:57 - INFO - __main__ - Step 55654: {'lr': 0.0003546504763211823, 'samples': 10685568, 'steps': 55653, 'loss/train': 1.6649715900421143} 11/07/2021 05:02:58 - INFO - __main__ - Step 55655: {'lr': 0.0003546456568724801, 'samples': 10685760, 'steps': 55654, 'loss/train': 2.0860495567321777} 11/07/2021 05:02:58 - INFO - __main__ - Step 55656: {'lr': 0.0003546408373766262, 'samples': 10685952, 'steps': 55655, 'loss/train': 1.3127493858337402} 11/07/2021 05:02:59 - INFO - __main__ - Step 55657: {'lr': 0.0003546360178336226, 'samples': 10686144, 'steps': 55656, 'loss/train': 1.2331385612487793} 11/07/2021 05:02:59 - INFO - __main__ - Step 55658: {'lr': 0.0003546311982434717, 'samples': 10686336, 'steps': 55657, 'loss/train': 1.3542340993881226} 11/07/2021 05:03:00 - INFO - __main__ - Step 55659: {'lr': 0.00035462637860617563, 'samples': 10686528, 'steps': 55658, 'loss/train': 1.4579474925994873} 11/07/2021 05:03:00 - INFO - __main__ - Step 55660: {'lr': 0.00035462155892173654, 'samples': 10686720, 'steps': 55659, 'loss/train': 1.4526863098144531} 11/07/2021 05:03:00 - INFO - __main__ - Step 55661: {'lr': 0.0003546167391901566, 'samples': 10686912, 'steps': 55660, 'loss/train': 1.1624071598052979} 11/07/2021 05:03:01 - INFO - __main__ - Step 55662: {'lr': 0.0003546119194114379, 'samples': 10687104, 'steps': 55661, 'loss/train': 1.5463212728500366} 11/07/2021 05:03:02 - INFO - __main__ - Step 55663: {'lr': 0.00035460709958558273, 'samples': 10687296, 'steps': 55662, 'loss/train': 1.1025784015655518} 11/07/2021 05:03:02 - INFO - __main__ - Step 55664: {'lr': 0.0003546022797125932, 'samples': 10687488, 'steps': 55663, 'loss/train': 1.408887267112732} 11/07/2021 05:03:02 - INFO - __main__ - Step 55665: {'lr': 0.00035459745979247146, 'samples': 10687680, 'steps': 55664, 'loss/train': 1.5203648805618286} 11/07/2021 05:03:03 - INFO - __main__ - Step 55666: {'lr': 0.00035459263982521975, 'samples': 10687872, 'steps': 55665, 'loss/train': 1.1913645267486572} 11/07/2021 05:03:03 - INFO - __main__ - Step 55667: {'lr': 0.00035458781981084026, 'samples': 10688064, 'steps': 55666, 'loss/train': 1.3048712015151978} 11/07/2021 05:03:04 - INFO - __main__ - Step 55668: {'lr': 0.00035458299974933506, 'samples': 10688256, 'steps': 55667, 'loss/train': 1.9954164028167725} 11/07/2021 05:03:05 - INFO - __main__ - Step 55669: {'lr': 0.00035457817964070637, 'samples': 10688448, 'steps': 55668, 'loss/train': 1.4409780502319336} 11/07/2021 05:03:05 - INFO - __main__ - Step 55670: {'lr': 0.0003545733594849564, 'samples': 10688640, 'steps': 55669, 'loss/train': 1.484114408493042} 11/07/2021 05:03:05 - INFO - __main__ - Step 55671: {'lr': 0.0003545685392820873, 'samples': 10688832, 'steps': 55670, 'loss/train': 1.255584716796875} 11/07/2021 05:03:06 - INFO - __main__ - Step 55672: {'lr': 0.0003545637190321012, 'samples': 10689024, 'steps': 55671, 'loss/train': 0.7551189064979553} 11/07/2021 05:03:07 - INFO - __main__ - Step 55673: {'lr': 0.00035455889873500026, 'samples': 10689216, 'steps': 55672, 'loss/train': 1.4032310247421265} 11/07/2021 05:03:07 - INFO - __main__ - Step 55674: {'lr': 0.00035455407839078673, 'samples': 10689408, 'steps': 55673, 'loss/train': 1.6993948221206665} 11/07/2021 05:03:07 - INFO - __main__ - Step 55675: {'lr': 0.00035454925799946273, 'samples': 10689600, 'steps': 55674, 'loss/train': 1.3456084728240967} 11/07/2021 05:03:08 - INFO - __main__ - Step 55676: {'lr': 0.0003545444375610306, 'samples': 10689792, 'steps': 55675, 'loss/train': 1.4696476459503174} 11/07/2021 05:03:08 - INFO - __main__ - Step 55677: {'lr': 0.0003545396170754922, 'samples': 10689984, 'steps': 55676, 'loss/train': 0.8574563264846802} 11/07/2021 05:03:09 - INFO - __main__ - Step 55678: {'lr': 0.0003545347965428498, 'samples': 10690176, 'steps': 55677, 'loss/train': 1.7760615348815918} 11/07/2021 05:03:09 - INFO - __main__ - Step 55679: {'lr': 0.00035452997596310576, 'samples': 10690368, 'steps': 55678, 'loss/train': 1.7549160718917847} 11/07/2021 05:03:10 - INFO - __main__ - Step 55680: {'lr': 0.00035452515533626204, 'samples': 10690560, 'steps': 55679, 'loss/train': 1.2228102684020996} 11/07/2021 05:03:10 - INFO - __main__ - Step 55681: {'lr': 0.00035452033466232095, 'samples': 10690752, 'steps': 55680, 'loss/train': 1.4908599853515625} 11/07/2021 05:03:11 - INFO - __main__ - Step 55682: {'lr': 0.0003545155139412847, 'samples': 10690944, 'steps': 55681, 'loss/train': 1.7345000505447388} 11/07/2021 05:03:11 - INFO - __main__ - Step 55683: {'lr': 0.00035451069317315526, 'samples': 10691136, 'steps': 55682, 'loss/train': 1.1505171060562134} 11/07/2021 05:03:12 - INFO - __main__ - Step 55684: {'lr': 0.00035450587235793493, 'samples': 10691328, 'steps': 55683, 'loss/train': 1.6514390707015991} 11/07/2021 05:03:12 - INFO - __main__ - Step 55685: {'lr': 0.0003545010514956258, 'samples': 10691520, 'steps': 55684, 'loss/train': 1.8190395832061768} 11/07/2021 05:03:13 - INFO - __main__ - Step 55686: {'lr': 0.0003544962305862302, 'samples': 10691712, 'steps': 55685, 'loss/train': 1.4316520690917969} 11/07/2021 05:03:13 - INFO - __main__ - Step 55687: {'lr': 0.0003544914096297502, 'samples': 10691904, 'steps': 55686, 'loss/train': 1.3914636373519897} 11/07/2021 05:03:13 - INFO - __main__ - Step 55688: {'lr': 0.000354486588626188, 'samples': 10692096, 'steps': 55687, 'loss/train': 1.4303710460662842} 11/07/2021 05:03:14 - INFO - __main__ - Step 55689: {'lr': 0.00035448176757554574, 'samples': 10692288, 'steps': 55688, 'loss/train': 1.674943208694458} 11/07/2021 05:03:15 - INFO - __main__ - Step 55690: {'lr': 0.0003544769464778256, 'samples': 10692480, 'steps': 55689, 'loss/train': 2.552509069442749} 11/07/2021 05:03:15 - INFO - __main__ - Step 55691: {'lr': 0.00035447212533302975, 'samples': 10692672, 'steps': 55690, 'loss/train': 1.3160301446914673} 11/07/2021 05:03:15 - INFO - __main__ - Step 55692: {'lr': 0.00035446730414116036, 'samples': 10692864, 'steps': 55691, 'loss/train': 1.3368175029754639} 11/07/2021 05:03:16 - INFO - __main__ - Step 55693: {'lr': 0.00035446248290221967, 'samples': 10693056, 'steps': 55692, 'loss/train': 1.7469947338104248} 11/07/2021 05:03:17 - INFO - __main__ - Step 55694: {'lr': 0.00035445766161620976, 'samples': 10693248, 'steps': 55693, 'loss/train': 1.2765860557556152} 11/07/2021 05:03:17 - INFO - __main__ - Step 55695: {'lr': 0.00035445284028313284, 'samples': 10693440, 'steps': 55694, 'loss/train': 1.7284281253814697} 11/07/2021 05:03:17 - INFO - __main__ - Step 55696: {'lr': 0.00035444801890299103, 'samples': 10693632, 'steps': 55695, 'loss/train': 1.036617398262024} 11/07/2021 05:03:18 - INFO - __main__ - Step 55697: {'lr': 0.0003544431974757866, 'samples': 10693824, 'steps': 55696, 'loss/train': 0.9842028021812439} 11/07/2021 05:03:18 - INFO - __main__ - Step 55698: {'lr': 0.00035443837600152174, 'samples': 10694016, 'steps': 55697, 'loss/train': 1.3776626586914062} 11/07/2021 05:03:19 - INFO - __main__ - Step 55699: {'lr': 0.00035443355448019854, 'samples': 10694208, 'steps': 55698, 'loss/train': 1.433046579360962} 11/07/2021 05:03:19 - INFO - __main__ - Step 55700: {'lr': 0.0003544287329118191, 'samples': 10694400, 'steps': 55699, 'loss/train': 1.37983238697052} 11/07/2021 05:03:20 - INFO - __main__ - Step 55701: {'lr': 0.0003544239112963857, 'samples': 10694592, 'steps': 55700, 'loss/train': 1.3031537532806396} 11/07/2021 05:03:20 - INFO - __main__ - Step 55702: {'lr': 0.0003544190896339006, 'samples': 10694784, 'steps': 55701, 'loss/train': 1.7372462749481201} 11/07/2021 05:03:21 - INFO - __main__ - Step 55703: {'lr': 0.00035441426792436574, 'samples': 10694976, 'steps': 55702, 'loss/train': 1.5571116209030151} 11/07/2021 05:03:22 - INFO - __main__ - Step 55704: {'lr': 0.0003544094461677836, 'samples': 10695168, 'steps': 55703, 'loss/train': 1.8758482933044434} 11/07/2021 05:03:22 - INFO - __main__ - Step 55705: {'lr': 0.000354404624364156, 'samples': 10695360, 'steps': 55704, 'loss/train': 1.7318722009658813} 11/07/2021 05:03:22 - INFO - __main__ - Step 55706: {'lr': 0.00035439980251348533, 'samples': 10695552, 'steps': 55705, 'loss/train': 1.0945961475372314} 11/07/2021 05:03:23 - INFO - __main__ - Step 55707: {'lr': 0.0003543949806157738, 'samples': 10695744, 'steps': 55706, 'loss/train': 1.7378387451171875} 11/07/2021 05:03:23 - INFO - __main__ - Step 55708: {'lr': 0.0003543901586710234, 'samples': 10695936, 'steps': 55707, 'loss/train': 1.4655539989471436} 11/07/2021 05:03:23 - INFO - __main__ - Step 55709: {'lr': 0.00035438533667923644, 'samples': 10696128, 'steps': 55708, 'loss/train': 1.4681276082992554} 11/07/2021 05:03:24 - INFO - __main__ - Step 55710: {'lr': 0.0003543805146404151, 'samples': 10696320, 'steps': 55709, 'loss/train': 1.5186320543289185} 11/07/2021 05:03:25 - INFO - __main__ - Step 55711: {'lr': 0.0003543756925545615, 'samples': 10696512, 'steps': 55710, 'loss/train': 1.5182175636291504} 11/07/2021 05:03:25 - INFO - __main__ - Step 55712: {'lr': 0.0003543708704216778, 'samples': 10696704, 'steps': 55711, 'loss/train': 1.3943164348602295} 11/07/2021 05:03:25 - INFO - __main__ - Step 55713: {'lr': 0.00035436604824176616, 'samples': 10696896, 'steps': 55712, 'loss/train': 1.1336770057678223} 11/07/2021 05:03:26 - INFO - __main__ - Step 55714: {'lr': 0.0003543612260148288, 'samples': 10697088, 'steps': 55713, 'loss/train': 1.567400336265564} 11/07/2021 05:03:27 - INFO - __main__ - Step 55715: {'lr': 0.0003543564037408679, 'samples': 10697280, 'steps': 55714, 'loss/train': 1.5476655960083008} 11/07/2021 05:03:27 - INFO - __main__ - Step 55716: {'lr': 0.00035435158141988564, 'samples': 10697472, 'steps': 55715, 'loss/train': 1.5898628234863281} 11/07/2021 05:03:27 - INFO - __main__ - Step 55717: {'lr': 0.0003543467590518842, 'samples': 10697664, 'steps': 55716, 'loss/train': 1.419777274131775} 11/07/2021 05:03:28 - INFO - __main__ - Step 55718: {'lr': 0.00035434193663686566, 'samples': 10697856, 'steps': 55717, 'loss/train': 1.6237761974334717} 11/07/2021 05:03:28 - INFO - __main__ - Step 55719: {'lr': 0.0003543371141748323, 'samples': 10698048, 'steps': 55718, 'loss/train': 1.4816151857376099} 11/07/2021 05:03:30 - INFO - __main__ - Step 55720: {'lr': 0.0003543322916657862, 'samples': 10698240, 'steps': 55719, 'loss/train': 1.4831697940826416} 11/07/2021 05:03:30 - INFO - __main__ - Step 55721: {'lr': 0.0003543274691097295, 'samples': 10698432, 'steps': 55720, 'loss/train': 1.7541534900665283} 11/07/2021 05:03:30 - INFO - __main__ - Step 55722: {'lr': 0.00035432264650666457, 'samples': 10698624, 'steps': 55721, 'loss/train': 1.258467674255371} 11/07/2021 05:03:31 - INFO - __main__ - Step 55723: {'lr': 0.0003543178238565935, 'samples': 10698816, 'steps': 55722, 'loss/train': 1.677546739578247} 11/07/2021 05:03:31 - INFO - __main__ - Step 55724: {'lr': 0.0003543130011595183, 'samples': 10699008, 'steps': 55723, 'loss/train': 1.0825533866882324} 11/07/2021 05:03:32 - INFO - __main__ - Step 55725: {'lr': 0.0003543081784154414, 'samples': 10699200, 'steps': 55724, 'loss/train': 0.9049617648124695} 11/07/2021 05:03:32 - INFO - __main__ - Step 55726: {'lr': 0.00035430335562436474, 'samples': 10699392, 'steps': 55725, 'loss/train': 1.016931414604187} 11/07/2021 05:03:33 - INFO - __main__ - Step 55727: {'lr': 0.00035429853278629063, 'samples': 10699584, 'steps': 55726, 'loss/train': 1.0655796527862549} 11/07/2021 05:03:33 - INFO - __main__ - Step 55728: {'lr': 0.00035429370990122124, 'samples': 10699776, 'steps': 55727, 'loss/train': 1.3815515041351318} 11/07/2021 05:03:33 - INFO - __main__ - Step 55729: {'lr': 0.0003542888869691586, 'samples': 10699968, 'steps': 55728, 'loss/train': 1.3427400588989258} 11/07/2021 05:03:34 - INFO - __main__ - Step 55730: {'lr': 0.00035428406399010516, 'samples': 10700160, 'steps': 55729, 'loss/train': 1.446705937385559} 11/07/2021 05:03:35 - INFO - __main__ - Step 55731: {'lr': 0.00035427924096406287, 'samples': 10700352, 'steps': 55730, 'loss/train': 1.5031720399856567} 11/07/2021 05:03:35 - INFO - __main__ - Step 55732: {'lr': 0.00035427441789103397, 'samples': 10700544, 'steps': 55731, 'loss/train': 1.099146842956543} 11/07/2021 05:03:35 - INFO - __main__ - Step 55733: {'lr': 0.0003542695947710206, 'samples': 10700736, 'steps': 55732, 'loss/train': 1.984749674797058} 11/07/2021 05:03:36 - INFO - __main__ - Step 55734: {'lr': 0.00035426477160402495, 'samples': 10700928, 'steps': 55733, 'loss/train': 1.5043107271194458} 11/07/2021 05:03:37 - INFO - __main__ - Step 55735: {'lr': 0.0003542599483900492, 'samples': 10701120, 'steps': 55734, 'loss/train': 1.2688812017440796} 11/07/2021 05:03:37 - INFO - __main__ - Step 55736: {'lr': 0.00035425512512909555, 'samples': 10701312, 'steps': 55735, 'loss/train': 1.0769466161727905} 11/07/2021 05:03:37 - INFO - __main__ - Step 55737: {'lr': 0.00035425030182116617, 'samples': 10701504, 'steps': 55736, 'loss/train': 1.056416392326355} 11/07/2021 05:03:38 - INFO - __main__ - Step 55738: {'lr': 0.0003542454784662632, 'samples': 10701696, 'steps': 55737, 'loss/train': 1.3985483646392822} 11/07/2021 05:03:38 - INFO - __main__ - Step 55739: {'lr': 0.00035424065506438877, 'samples': 10701888, 'steps': 55738, 'loss/train': 1.5039695501327515} 11/07/2021 05:03:39 - INFO - __main__ - Step 55740: {'lr': 0.0003542358316155452, 'samples': 10702080, 'steps': 55739, 'loss/train': 1.6511571407318115} 11/07/2021 05:03:39 - INFO - __main__ - Step 55741: {'lr': 0.00035423100811973453, 'samples': 10702272, 'steps': 55740, 'loss/train': 1.7982109785079956} 11/07/2021 05:03:40 - INFO - __main__ - Step 55742: {'lr': 0.00035422618457695893, 'samples': 10702464, 'steps': 55741, 'loss/train': 1.3135054111480713} 11/07/2021 05:03:40 - INFO - __main__ - Step 55743: {'lr': 0.0003542213609872207, 'samples': 10702656, 'steps': 55742, 'loss/train': 1.3639512062072754} 11/07/2021 05:03:41 - INFO - __main__ - Step 55744: {'lr': 0.0003542165373505219, 'samples': 10702848, 'steps': 55743, 'loss/train': 1.1558784246444702} 11/07/2021 05:03:42 - INFO - __main__ - Step 55745: {'lr': 0.0003542117136668647, 'samples': 10703040, 'steps': 55744, 'loss/train': 1.5514709949493408} 11/07/2021 05:03:42 - INFO - __main__ - Step 55746: {'lr': 0.0003542068899362514, 'samples': 10703232, 'steps': 55745, 'loss/train': 1.05830717086792} 11/07/2021 05:03:43 - INFO - __main__ - Step 55747: {'lr': 0.000354202066158684, 'samples': 10703424, 'steps': 55746, 'loss/train': 1.4797271490097046} 11/07/2021 05:03:43 - INFO - __main__ - Step 55748: {'lr': 0.0003541972423341648, 'samples': 10703616, 'steps': 55747, 'loss/train': 0.926233172416687} 11/07/2021 05:03:43 - INFO - __main__ - Step 55749: {'lr': 0.0003541924184626959, 'samples': 10703808, 'steps': 55748, 'loss/train': 1.4752252101898193} 11/07/2021 05:03:44 - INFO - __main__ - Step 55750: {'lr': 0.00035418759454427953, 'samples': 10704000, 'steps': 55749, 'loss/train': 1.860127329826355} 11/07/2021 05:03:45 - INFO - __main__ - Step 55751: {'lr': 0.00035418277057891776, 'samples': 10704192, 'steps': 55750, 'loss/train': 1.6169946193695068} 11/07/2021 05:03:45 - INFO - __main__ - Step 55752: {'lr': 0.00035417794656661297, 'samples': 10704384, 'steps': 55751, 'loss/train': 1.3335211277008057} 11/07/2021 05:03:45 - INFO - __main__ - Step 55753: {'lr': 0.0003541731225073671, 'samples': 10704576, 'steps': 55752, 'loss/train': 1.571130394935608} 11/07/2021 05:03:46 - INFO - __main__ - Step 55754: {'lr': 0.0003541682984011825, 'samples': 10704768, 'steps': 55753, 'loss/train': 1.1665571928024292} 11/07/2021 05:03:46 - INFO - __main__ - Step 55755: {'lr': 0.00035416347424806124, 'samples': 10704960, 'steps': 55754, 'loss/train': 1.2701736688613892} 11/07/2021 05:03:47 - INFO - __main__ - Step 55756: {'lr': 0.00035415865004800553, 'samples': 10705152, 'steps': 55755, 'loss/train': 1.2953274250030518} 11/07/2021 05:03:47 - INFO - __main__ - Step 55757: {'lr': 0.00035415382580101753, 'samples': 10705344, 'steps': 55756, 'loss/train': 1.617814302444458} 11/07/2021 05:03:48 - INFO - __main__ - Step 55758: {'lr': 0.00035414900150709946, 'samples': 10705536, 'steps': 55757, 'loss/train': 1.571497917175293} 11/07/2021 05:03:48 - INFO - __main__ - Step 55759: {'lr': 0.00035414417716625343, 'samples': 10705728, 'steps': 55758, 'loss/train': 1.2555686235427856} 11/07/2021 05:03:48 - INFO - __main__ - Step 55760: {'lr': 0.00035413935277848156, 'samples': 10705920, 'steps': 55759, 'loss/train': 1.4754409790039062} 11/07/2021 05:03:50 - INFO - __main__ - Step 55761: {'lr': 0.00035413452834378624, 'samples': 10706112, 'steps': 55760, 'loss/train': 1.7791800498962402} 11/07/2021 05:03:50 - INFO - __main__ - Step 55762: {'lr': 0.0003541297038621694, 'samples': 10706304, 'steps': 55761, 'loss/train': 1.4712127447128296} 11/07/2021 05:03:50 - INFO - __main__ - Step 55763: {'lr': 0.00035412487933363335, 'samples': 10706496, 'steps': 55762, 'loss/train': 0.13739466667175293} 11/07/2021 05:03:51 - INFO - __main__ - Step 55764: {'lr': 0.00035412005475818033, 'samples': 10706688, 'steps': 55763, 'loss/train': 1.9544419050216675} 11/07/2021 05:03:51 - INFO - __main__ - Step 55765: {'lr': 0.0003541152301358124, 'samples': 10706880, 'steps': 55764, 'loss/train': 1.2634907960891724} 11/07/2021 05:03:52 - INFO - __main__ - Step 55766: {'lr': 0.0003541104054665316, 'samples': 10707072, 'steps': 55765, 'loss/train': 1.4438755512237549} 11/07/2021 05:03:52 - INFO - __main__ - Step 55767: {'lr': 0.0003541055807503404, 'samples': 10707264, 'steps': 55766, 'loss/train': 1.4670195579528809} 11/07/2021 05:03:53 - INFO - __main__ - Step 55768: {'lr': 0.0003541007559872408, 'samples': 10707456, 'steps': 55767, 'loss/train': 1.658206820487976} 11/07/2021 05:03:53 - INFO - __main__ - Step 55769: {'lr': 0.000354095931177235, 'samples': 10707648, 'steps': 55768, 'loss/train': 1.565147876739502} 11/07/2021 05:03:53 - INFO - __main__ - Step 55770: {'lr': 0.0003540911063203252, 'samples': 10707840, 'steps': 55769, 'loss/train': 1.3767871856689453} 11/07/2021 05:03:54 - INFO - __main__ - Step 55771: {'lr': 0.00035408628141651356, 'samples': 10708032, 'steps': 55770, 'loss/train': 1.4693639278411865} 11/07/2021 05:03:55 - INFO - __main__ - Step 55772: {'lr': 0.0003540814564658022, 'samples': 10708224, 'steps': 55771, 'loss/train': 1.3036214113235474} 11/07/2021 05:03:55 - INFO - __main__ - Step 55773: {'lr': 0.00035407663146819337, 'samples': 10708416, 'steps': 55772, 'loss/train': 1.8937538862228394} 11/07/2021 05:03:55 - INFO - __main__ - Step 55774: {'lr': 0.0003540718064236892, 'samples': 10708608, 'steps': 55773, 'loss/train': 1.8074778318405151} 11/07/2021 05:03:56 - INFO - __main__ - Step 55775: {'lr': 0.0003540669813322919, 'samples': 10708800, 'steps': 55774, 'loss/train': 1.460472583770752} 11/07/2021 05:03:56 - INFO - __main__ - Step 55776: {'lr': 0.00035406215619400357, 'samples': 10708992, 'steps': 55775, 'loss/train': 1.9324228763580322} 11/07/2021 05:03:57 - INFO - __main__ - Step 55777: {'lr': 0.00035405733100882654, 'samples': 10709184, 'steps': 55776, 'loss/train': 1.4910401105880737} 11/07/2021 05:03:58 - INFO - __main__ - Step 55778: {'lr': 0.0003540525057767628, 'samples': 10709376, 'steps': 55777, 'loss/train': 1.679534912109375} 11/07/2021 05:03:58 - INFO - __main__ - Step 55779: {'lr': 0.0003540476804978146, 'samples': 10709568, 'steps': 55778, 'loss/train': 1.5839797258377075} 11/07/2021 05:03:58 - INFO - __main__ - Step 55780: {'lr': 0.00035404285517198417, 'samples': 10709760, 'steps': 55779, 'loss/train': 1.5468313694000244} 11/07/2021 05:03:59 - INFO - __main__ - Step 55781: {'lr': 0.00035403802979927355, 'samples': 10709952, 'steps': 55780, 'loss/train': 1.651582956314087} 11/07/2021 05:04:00 - INFO - __main__ - Step 55782: {'lr': 0.0003540332043796851, 'samples': 10710144, 'steps': 55781, 'loss/train': 1.212729811668396} 11/07/2021 05:04:00 - INFO - __main__ - Step 55783: {'lr': 0.00035402837891322083, 'samples': 10710336, 'steps': 55782, 'loss/train': 1.1240630149841309} 11/07/2021 05:04:00 - INFO - __main__ - Step 55784: {'lr': 0.00035402355339988307, 'samples': 10710528, 'steps': 55783, 'loss/train': 1.4452277421951294} 11/07/2021 05:04:01 - INFO - __main__ - Step 55785: {'lr': 0.00035401872783967384, 'samples': 10710720, 'steps': 55784, 'loss/train': 1.333119511604309} 11/07/2021 05:04:01 - INFO - __main__ - Step 55786: {'lr': 0.00035401390223259536, 'samples': 10710912, 'steps': 55785, 'loss/train': 1.4963150024414062} 11/07/2021 05:04:02 - INFO - __main__ - Step 55787: {'lr': 0.0003540090765786498, 'samples': 10711104, 'steps': 55786, 'loss/train': 1.3810827732086182} 11/07/2021 05:04:03 - INFO - __main__ - Step 55788: {'lr': 0.0003540042508778394, 'samples': 10711296, 'steps': 55787, 'loss/train': 6.768846035003662} 11/07/2021 05:04:03 - INFO - __main__ - Step 55789: {'lr': 0.00035399942513016623, 'samples': 10711488, 'steps': 55788, 'loss/train': 1.3474509716033936} 11/07/2021 05:04:03 - INFO - __main__ - Step 55790: {'lr': 0.0003539945993356326, 'samples': 10711680, 'steps': 55789, 'loss/train': 1.1473783254623413} 11/07/2021 05:04:04 - INFO - __main__ - Step 55791: {'lr': 0.0003539897734942406, 'samples': 10711872, 'steps': 55790, 'loss/train': 1.2376493215560913} 11/07/2021 05:04:04 - INFO - __main__ - Step 55792: {'lr': 0.00035398494760599243, 'samples': 10712064, 'steps': 55791, 'loss/train': 1.5866944789886475} 11/07/2021 05:04:05 - INFO - __main__ - Step 55793: {'lr': 0.00035398012167089016, 'samples': 10712256, 'steps': 55792, 'loss/train': 1.3174208402633667} 11/07/2021 05:04:05 - INFO - __main__ - Step 55794: {'lr': 0.0003539752956889361, 'samples': 10712448, 'steps': 55793, 'loss/train': 1.541314721107483} 11/07/2021 05:04:06 - INFO - __main__ - Step 55795: {'lr': 0.00035397046966013235, 'samples': 10712640, 'steps': 55794, 'loss/train': 1.03879714012146} 11/07/2021 05:04:06 - INFO - __main__ - Step 55796: {'lr': 0.00035396564358448115, 'samples': 10712832, 'steps': 55795, 'loss/train': 1.6715527772903442} 11/07/2021 05:04:06 - INFO - __main__ - Step 55797: {'lr': 0.00035396081746198467, 'samples': 10713024, 'steps': 55796, 'loss/train': 0.9951236844062805} 11/07/2021 05:04:08 - INFO - __main__ - Step 55798: {'lr': 0.000353955991292645, 'samples': 10713216, 'steps': 55797, 'loss/train': 1.7825127840042114} 11/07/2021 05:04:08 - INFO - __main__ - Step 55799: {'lr': 0.00035395116507646435, 'samples': 10713408, 'steps': 55798, 'loss/train': 1.3425874710083008} 11/07/2021 05:04:08 - INFO - __main__ - Step 55800: {'lr': 0.00035394633881344497, 'samples': 10713600, 'steps': 55799, 'loss/train': 1.4844225645065308} 11/07/2021 05:04:09 - INFO - __main__ - Step 55801: {'lr': 0.00035394151250358886, 'samples': 10713792, 'steps': 55800, 'loss/train': 1.30772066116333} 11/07/2021 05:04:09 - INFO - __main__ - Step 55802: {'lr': 0.00035393668614689837, 'samples': 10713984, 'steps': 55801, 'loss/train': 1.5883336067199707} 11/07/2021 05:04:09 - INFO - __main__ - Step 55803: {'lr': 0.00035393185974337565, 'samples': 10714176, 'steps': 55802, 'loss/train': 1.5440160036087036} 11/07/2021 05:04:10 - INFO - __main__ - Step 55804: {'lr': 0.0003539270332930228, 'samples': 10714368, 'steps': 55803, 'loss/train': 1.5361167192459106} 11/07/2021 05:04:11 - INFO - __main__ - Step 55805: {'lr': 0.00035392220679584206, 'samples': 10714560, 'steps': 55804, 'loss/train': 0.9911054372787476} 11/07/2021 05:04:11 - INFO - __main__ - Step 55806: {'lr': 0.0003539173802518356, 'samples': 10714752, 'steps': 55805, 'loss/train': 1.6084593534469604} 11/07/2021 05:04:12 - INFO - __main__ - Step 55807: {'lr': 0.0003539125536610055, 'samples': 10714944, 'steps': 55806, 'loss/train': 0.6804617643356323} 11/07/2021 05:04:12 - INFO - __main__ - Step 55808: {'lr': 0.00035390772702335405, 'samples': 10715136, 'steps': 55807, 'loss/train': 1.5012800693511963} 11/07/2021 05:04:13 - INFO - __main__ - Step 55809: {'lr': 0.0003539029003388833, 'samples': 10715328, 'steps': 55808, 'loss/train': 1.6742792129516602} 11/07/2021 05:04:13 - INFO - __main__ - Step 55810: {'lr': 0.0003538980736075956, 'samples': 10715520, 'steps': 55809, 'loss/train': 1.4297534227371216} 11/07/2021 05:04:14 - INFO - __main__ - Step 55811: {'lr': 0.0003538932468294931, 'samples': 10715712, 'steps': 55810, 'loss/train': 1.3954731225967407} 11/07/2021 05:04:14 - INFO - __main__ - Step 55812: {'lr': 0.0003538884200045778, 'samples': 10715904, 'steps': 55811, 'loss/train': 1.2371412515640259} 11/07/2021 05:04:14 - INFO - __main__ - Step 55813: {'lr': 0.00035388359313285196, 'samples': 10716096, 'steps': 55812, 'loss/train': 1.278853178024292} 11/07/2021 05:04:15 - INFO - __main__ - Step 55814: {'lr': 0.0003538787662143178, 'samples': 10716288, 'steps': 55813, 'loss/train': 1.3808411359786987} 11/07/2021 05:04:16 - INFO - __main__ - Step 55815: {'lr': 0.00035387393924897747, 'samples': 10716480, 'steps': 55814, 'loss/train': 1.3120583295822144} 11/07/2021 05:04:16 - INFO - __main__ - Step 55816: {'lr': 0.0003538691122368332, 'samples': 10716672, 'steps': 55815, 'loss/train': 1.2973705530166626} 11/07/2021 05:04:16 - INFO - __main__ - Step 55817: {'lr': 0.00035386428517788707, 'samples': 10716864, 'steps': 55816, 'loss/train': 1.9615147113800049} 11/07/2021 05:04:17 - INFO - __main__ - Step 55818: {'lr': 0.00035385945807214124, 'samples': 10717056, 'steps': 55817, 'loss/train': 1.3233554363250732} 11/07/2021 05:04:18 - INFO - __main__ - Step 55819: {'lr': 0.000353854630919598, 'samples': 10717248, 'steps': 55818, 'loss/train': 1.0138968229293823} 11/07/2021 05:04:18 - INFO - __main__ - Step 55820: {'lr': 0.0003538498037202595, 'samples': 10717440, 'steps': 55819, 'loss/train': 1.7565664052963257} 11/07/2021 05:04:18 - INFO - __main__ - Step 55821: {'lr': 0.0003538449764741278, 'samples': 10717632, 'steps': 55820, 'loss/train': 1.3925102949142456} 11/07/2021 05:04:19 - INFO - __main__ - Step 55822: {'lr': 0.00035384014918120527, 'samples': 10717824, 'steps': 55821, 'loss/train': 1.0944126844406128} 11/07/2021 05:04:19 - INFO - __main__ - Step 55823: {'lr': 0.00035383532184149393, 'samples': 10718016, 'steps': 55822, 'loss/train': 1.1503760814666748} 11/07/2021 05:04:20 - INFO - __main__ - Step 55824: {'lr': 0.00035383049445499596, 'samples': 10718208, 'steps': 55823, 'loss/train': 1.5366030931472778} 11/07/2021 05:04:20 - INFO - __main__ - Step 55825: {'lr': 0.0003538256670217135, 'samples': 10718400, 'steps': 55824, 'loss/train': 1.2309882640838623} 11/07/2021 05:04:21 - INFO - __main__ - Step 55826: {'lr': 0.0003538208395416489, 'samples': 10718592, 'steps': 55825, 'loss/train': 1.2972712516784668} 11/07/2021 05:04:21 - INFO - __main__ - Step 55827: {'lr': 0.00035381601201480426, 'samples': 10718784, 'steps': 55826, 'loss/train': 1.76887845993042} 11/07/2021 05:04:21 - INFO - __main__ - Step 55828: {'lr': 0.00035381118444118167, 'samples': 10718976, 'steps': 55827, 'loss/train': 0.9605786204338074} 11/07/2021 05:04:23 - INFO - __main__ - Step 55829: {'lr': 0.00035380635682078334, 'samples': 10719168, 'steps': 55828, 'loss/train': 1.267890453338623} 11/07/2021 05:04:23 - INFO - __main__ - Step 55830: {'lr': 0.00035380152915361144, 'samples': 10719360, 'steps': 55829, 'loss/train': 1.2921608686447144} 11/07/2021 05:04:23 - INFO - __main__ - Step 55831: {'lr': 0.00035379670143966826, 'samples': 10719552, 'steps': 55830, 'loss/train': 2.0029077529907227} 11/07/2021 05:04:24 - INFO - __main__ - Step 55832: {'lr': 0.00035379187367895584, 'samples': 10719744, 'steps': 55831, 'loss/train': 1.0866672992706299} 11/07/2021 05:04:24 - INFO - __main__ - Step 55833: {'lr': 0.0003537870458714765, 'samples': 10719936, 'steps': 55832, 'loss/train': 0.10541039705276489} 11/07/2021 05:04:25 - INFO - __main__ - Step 55834: {'lr': 0.0003537822180172322, 'samples': 10720128, 'steps': 55833, 'loss/train': 1.5007840394973755} 11/07/2021 05:04:25 - INFO - __main__ - Step 55835: {'lr': 0.00035377739011622524, 'samples': 10720320, 'steps': 55834, 'loss/train': 1.2568243741989136} 11/07/2021 05:04:26 - INFO - __main__ - Step 55836: {'lr': 0.0003537725621684578, 'samples': 10720512, 'steps': 55835, 'loss/train': 1.7705588340759277} 11/07/2021 05:04:26 - INFO - __main__ - Step 55837: {'lr': 0.0003537677341739321, 'samples': 10720704, 'steps': 55836, 'loss/train': 1.0530240535736084} 11/07/2021 05:04:26 - INFO - __main__ - Step 55838: {'lr': 0.0003537629061326503, 'samples': 10720896, 'steps': 55837, 'loss/train': 0.9166146516799927} 11/07/2021 05:04:28 - INFO - __main__ - Step 55839: {'lr': 0.0003537580780446144, 'samples': 10721088, 'steps': 55838, 'loss/train': 1.3045827150344849} 11/07/2021 05:04:28 - INFO - __main__ - Step 55840: {'lr': 0.0003537532499098268, 'samples': 10721280, 'steps': 55839, 'loss/train': 1.0616631507873535} 11/07/2021 05:04:28 - INFO - __main__ - Step 55841: {'lr': 0.0003537484217282895, 'samples': 10721472, 'steps': 55840, 'loss/train': 1.56183922290802} 11/07/2021 05:04:29 - INFO - __main__ - Step 55842: {'lr': 0.00035374359350000484, 'samples': 10721664, 'steps': 55841, 'loss/train': 1.5610754489898682} 11/07/2021 05:04:29 - INFO - __main__ - Step 55843: {'lr': 0.0003537387652249749, 'samples': 10721856, 'steps': 55842, 'loss/train': 0.9415774941444397} 11/07/2021 05:04:30 - INFO - __main__ - Step 55844: {'lr': 0.0003537339369032019, 'samples': 10722048, 'steps': 55843, 'loss/train': 1.4559112787246704} 11/07/2021 05:04:30 - INFO - __main__ - Step 55845: {'lr': 0.0003537291085346879, 'samples': 10722240, 'steps': 55844, 'loss/train': 0.9914204478263855} 11/07/2021 05:04:31 - INFO - __main__ - Step 55846: {'lr': 0.0003537242801194353, 'samples': 10722432, 'steps': 55845, 'loss/train': 1.1230731010437012} 11/07/2021 05:04:31 - INFO - __main__ - Step 55847: {'lr': 0.000353719451657446, 'samples': 10722624, 'steps': 55846, 'loss/train': 2.628389596939087} 11/07/2021 05:04:31 - INFO - __main__ - Step 55848: {'lr': 0.0003537146231487224, 'samples': 10722816, 'steps': 55847, 'loss/train': 1.2518551349639893} 11/07/2021 05:04:32 - INFO - __main__ - Step 55849: {'lr': 0.0003537097945932666, 'samples': 10723008, 'steps': 55848, 'loss/train': 1.021565318107605} 11/07/2021 05:04:33 - INFO - __main__ - Step 55850: {'lr': 0.00035370496599108073, 'samples': 10723200, 'steps': 55849, 'loss/train': 1.3686603307724} 11/07/2021 05:04:33 - INFO - __main__ - Step 55851: {'lr': 0.00035370013734216697, 'samples': 10723392, 'steps': 55850, 'loss/train': 1.6993813514709473} 11/07/2021 05:04:34 - INFO - __main__ - Step 55852: {'lr': 0.0003536953086465276, 'samples': 10723584, 'steps': 55851, 'loss/train': 1.298002004623413} 11/07/2021 05:04:34 - INFO - __main__ - Step 55853: {'lr': 0.0003536904799041647, 'samples': 10723776, 'steps': 55852, 'loss/train': 1.31694495677948} 11/07/2021 05:04:35 - INFO - __main__ - Step 55854: {'lr': 0.00035368565111508043, 'samples': 10723968, 'steps': 55853, 'loss/train': 1.5860368013381958} 11/07/2021 05:04:35 - INFO - __main__ - Step 55855: {'lr': 0.000353680822279277, 'samples': 10724160, 'steps': 55854, 'loss/train': 1.274109959602356} 11/07/2021 05:04:36 - INFO - __main__ - Step 55856: {'lr': 0.00035367599339675664, 'samples': 10724352, 'steps': 55855, 'loss/train': 0.953221321105957} 11/07/2021 05:04:36 - INFO - __main__ - Step 55857: {'lr': 0.0003536711644675215, 'samples': 10724544, 'steps': 55856, 'loss/train': 1.5896235704421997} 11/07/2021 05:04:36 - INFO - __main__ - Step 55858: {'lr': 0.0003536663354915737, 'samples': 10724736, 'steps': 55857, 'loss/train': 0.6878846287727356} 11/07/2021 05:04:38 - INFO - __main__ - Step 55859: {'lr': 0.00035366150646891543, 'samples': 10724928, 'steps': 55858, 'loss/train': 1.3998024463653564} 11/07/2021 05:04:38 - INFO - __main__ - Step 55860: {'lr': 0.0003536566773995489, 'samples': 10725120, 'steps': 55859, 'loss/train': 1.8037703037261963} 11/07/2021 05:04:38 - INFO - __main__ - Step 55861: {'lr': 0.0003536518482834763, 'samples': 10725312, 'steps': 55860, 'loss/train': 2.0232062339782715} 11/07/2021 05:04:39 - INFO - __main__ - Step 55862: {'lr': 0.0003536470191206997, 'samples': 10725504, 'steps': 55861, 'loss/train': 1.6955820322036743} 11/07/2021 05:04:39 - INFO - __main__ - Step 55863: {'lr': 0.00035364218991122145, 'samples': 10725696, 'steps': 55862, 'loss/train': 1.434757113456726} 11/07/2021 05:04:40 - INFO - __main__ - Step 55864: {'lr': 0.00035363736065504355, 'samples': 10725888, 'steps': 55863, 'loss/train': 1.400452971458435} 11/07/2021 05:04:40 - INFO - __main__ - Step 55865: {'lr': 0.0003536325313521683, 'samples': 10726080, 'steps': 55864, 'loss/train': 1.664194107055664} 11/07/2021 05:04:41 - INFO - __main__ - Step 55866: {'lr': 0.0003536277020025978, 'samples': 10726272, 'steps': 55865, 'loss/train': 1.6159415245056152} 11/07/2021 05:04:41 - INFO - __main__ - Step 55867: {'lr': 0.0003536228726063343, 'samples': 10726464, 'steps': 55866, 'loss/train': 0.9347060322761536} 11/07/2021 05:04:41 - INFO - __main__ - Step 55868: {'lr': 0.00035361804316337987, 'samples': 10726656, 'steps': 55867, 'loss/train': 1.5989532470703125} 11/07/2021 05:04:42 - INFO - __main__ - Step 55869: {'lr': 0.00035361321367373676, 'samples': 10726848, 'steps': 55868, 'loss/train': 1.2443060874938965} 11/07/2021 05:04:43 - INFO - __main__ - Step 55870: {'lr': 0.00035360838413740715, 'samples': 10727040, 'steps': 55869, 'loss/train': 1.1219351291656494} 11/07/2021 05:04:43 - INFO - __main__ - Step 55871: {'lr': 0.0003536035545543933, 'samples': 10727232, 'steps': 55870, 'loss/train': 0.5671005249023438} 11/07/2021 05:04:43 - INFO - __main__ - Step 55872: {'lr': 0.00035359872492469715, 'samples': 10727424, 'steps': 55871, 'loss/train': 1.500485897064209} 11/07/2021 05:04:44 - INFO - __main__ - Step 55873: {'lr': 0.0003535938952483211, 'samples': 10727616, 'steps': 55872, 'loss/train': 1.4232797622680664} 11/07/2021 05:04:45 - INFO - __main__ - Step 55874: {'lr': 0.00035358906552526714, 'samples': 10727808, 'steps': 55873, 'loss/train': 1.3123780488967896} 11/07/2021 05:04:45 - INFO - __main__ - Step 55875: {'lr': 0.0003535842357555376, 'samples': 10728000, 'steps': 55874, 'loss/train': 1.4959659576416016} 11/07/2021 05:04:46 - INFO - __main__ - Step 55876: {'lr': 0.0003535794059391346, 'samples': 10728192, 'steps': 55875, 'loss/train': 1.2305139303207397} 11/07/2021 05:04:46 - INFO - __main__ - Step 55877: {'lr': 0.00035357457607606034, 'samples': 10728384, 'steps': 55876, 'loss/train': 1.3052294254302979} 11/07/2021 05:04:46 - INFO - __main__ - Step 55878: {'lr': 0.00035356974616631697, 'samples': 10728576, 'steps': 55877, 'loss/train': 1.7017009258270264} 11/07/2021 05:04:47 - INFO - __main__ - Step 55879: {'lr': 0.00035356491620990667, 'samples': 10728768, 'steps': 55878, 'loss/train': 1.325434923171997} 11/07/2021 05:04:48 - INFO - __main__ - Step 55880: {'lr': 0.0003535600862068316, 'samples': 10728960, 'steps': 55879, 'loss/train': 1.3903403282165527} 11/07/2021 05:04:48 - INFO - __main__ - Step 55881: {'lr': 0.00035355525615709393, 'samples': 10729152, 'steps': 55880, 'loss/train': 1.5887025594711304} 11/07/2021 05:04:49 - INFO - __main__ - Step 55882: {'lr': 0.0003535504260606959, 'samples': 10729344, 'steps': 55881, 'loss/train': 1.132631778717041} 11/07/2021 05:04:49 - INFO - __main__ - Step 55883: {'lr': 0.00035354559591763965, 'samples': 10729536, 'steps': 55882, 'loss/train': 1.4556214809417725} 11/07/2021 05:04:49 - INFO - __main__ - Step 55884: {'lr': 0.0003535407657279273, 'samples': 10729728, 'steps': 55883, 'loss/train': 1.1831425428390503} 11/07/2021 05:04:50 - INFO - __main__ - Step 55885: {'lr': 0.00035353593549156115, 'samples': 10729920, 'steps': 55884, 'loss/train': 1.3290823698043823} 11/07/2021 05:04:51 - INFO - __main__ - Step 55886: {'lr': 0.00035353110520854324, 'samples': 10730112, 'steps': 55885, 'loss/train': 1.6130807399749756} 11/07/2021 05:04:51 - INFO - __main__ - Step 55887: {'lr': 0.0003535262748788759, 'samples': 10730304, 'steps': 55886, 'loss/train': 1.4945546388626099} 11/07/2021 05:04:51 - INFO - __main__ - Step 55888: {'lr': 0.00035352144450256115, 'samples': 10730496, 'steps': 55887, 'loss/train': 1.2880640029907227} 11/07/2021 05:04:52 - INFO - __main__ - Step 55889: {'lr': 0.00035351661407960125, 'samples': 10730688, 'steps': 55888, 'loss/train': 1.4798877239227295} 11/07/2021 05:04:53 - INFO - __main__ - Step 55890: {'lr': 0.0003535117836099983, 'samples': 10730880, 'steps': 55889, 'loss/train': 1.4757639169692993} 11/07/2021 05:04:53 - INFO - __main__ - Step 55891: {'lr': 0.00035350695309375465, 'samples': 10731072, 'steps': 55890, 'loss/train': 1.1573599576950073} 11/07/2021 05:04:53 - INFO - __main__ - Step 55892: {'lr': 0.00035350212253087233, 'samples': 10731264, 'steps': 55891, 'loss/train': 1.365740180015564} 11/07/2021 05:04:54 - INFO - __main__ - Step 55893: {'lr': 0.0003534972919213535, 'samples': 10731456, 'steps': 55892, 'loss/train': 1.3830736875534058} 11/07/2021 05:04:54 - INFO - __main__ - Step 55894: {'lr': 0.0003534924612652004, 'samples': 10731648, 'steps': 55893, 'loss/train': 1.2669957876205444} 11/07/2021 05:04:55 - INFO - __main__ - Step 55895: {'lr': 0.00035348763056241515, 'samples': 10731840, 'steps': 55894, 'loss/train': 1.4290146827697754} 11/07/2021 05:04:55 - INFO - __main__ - Step 55896: {'lr': 0.0003534827998130001, 'samples': 10732032, 'steps': 55895, 'loss/train': 1.075005292892456} 11/07/2021 05:04:56 - INFO - __main__ - Step 55897: {'lr': 0.00035347796901695716, 'samples': 10732224, 'steps': 55896, 'loss/train': 1.3419417142868042} 11/07/2021 05:04:56 - INFO - __main__ - Step 55898: {'lr': 0.0003534731381742888, 'samples': 10732416, 'steps': 55897, 'loss/train': 1.0262254476547241} 11/07/2021 05:04:56 - INFO - __main__ - Step 55899: {'lr': 0.0003534683072849969, 'samples': 10732608, 'steps': 55898, 'loss/train': 1.3884958028793335} 11/07/2021 05:04:57 - INFO - __main__ - Step 55900: {'lr': 0.0003534634763490838, 'samples': 10732800, 'steps': 55899, 'loss/train': 1.6960127353668213} 11/07/2021 05:04:58 - INFO - __main__ - Step 55901: {'lr': 0.0003534586453665517, 'samples': 10732992, 'steps': 55900, 'loss/train': 1.274144172668457} 11/07/2021 05:04:58 - INFO - __main__ - Step 55902: {'lr': 0.00035345381433740273, 'samples': 10733184, 'steps': 55901, 'loss/train': 1.7366596460342407} 11/07/2021 05:04:59 - INFO - __main__ - Step 55903: {'lr': 0.00035344898326163907, 'samples': 10733376, 'steps': 55902, 'loss/train': 1.1894418001174927} 11/07/2021 05:04:59 - INFO - __main__ - Step 55904: {'lr': 0.00035344415213926284, 'samples': 10733568, 'steps': 55903, 'loss/train': 1.2441020011901855} 11/07/2021 05:05:00 - INFO - __main__ - Step 55905: {'lr': 0.0003534393209702764, 'samples': 10733760, 'steps': 55904, 'loss/train': 0.24876174330711365} 11/07/2021 05:05:00 - INFO - __main__ - Step 55906: {'lr': 0.0003534344897546816, 'samples': 10733952, 'steps': 55905, 'loss/train': 1.4499990940093994} 11/07/2021 05:05:01 - INFO - __main__ - Step 55907: {'lr': 0.00035342965849248097, 'samples': 10734144, 'steps': 55906, 'loss/train': 1.3530652523040771} 11/07/2021 05:05:01 - INFO - __main__ - Step 55908: {'lr': 0.00035342482718367645, 'samples': 10734336, 'steps': 55907, 'loss/train': 0.10861533880233765} 11/07/2021 05:05:01 - INFO - __main__ - Step 55909: {'lr': 0.0003534199958282703, 'samples': 10734528, 'steps': 55908, 'loss/train': 1.851309895515442} 11/07/2021 05:05:02 - INFO - __main__ - Step 55910: {'lr': 0.00035341516442626475, 'samples': 10734720, 'steps': 55909, 'loss/train': 1.528367519378662} 11/07/2021 05:05:03 - INFO - __main__ - Step 55911: {'lr': 0.0003534103329776619, 'samples': 10734912, 'steps': 55910, 'loss/train': 1.3974149227142334} 11/07/2021 05:05:03 - INFO - __main__ - Step 55912: {'lr': 0.000353405501482464, 'samples': 10735104, 'steps': 55911, 'loss/train': 1.1010369062423706} 11/07/2021 05:05:04 - INFO - __main__ - Step 55913: {'lr': 0.0003534006699406731, 'samples': 10735296, 'steps': 55912, 'loss/train': 1.5172953605651855} 11/07/2021 05:05:04 - INFO - __main__ - Step 55914: {'lr': 0.0003533958383522915, 'samples': 10735488, 'steps': 55913, 'loss/train': 1.2249624729156494} 11/07/2021 05:05:05 - INFO - __main__ - Step 55915: {'lr': 0.0003533910067173213, 'samples': 10735680, 'steps': 55914, 'loss/train': 1.1259907484054565} 11/07/2021 05:05:05 - INFO - __main__ - Step 55916: {'lr': 0.0003533861750357647, 'samples': 10735872, 'steps': 55915, 'loss/train': 1.15132737159729} 11/07/2021 05:05:06 - INFO - __main__ - Step 55917: {'lr': 0.0003533813433076239, 'samples': 10736064, 'steps': 55916, 'loss/train': 1.443205714225769} 11/07/2021 05:05:06 - INFO - __main__ - Step 55918: {'lr': 0.00035337651153290113, 'samples': 10736256, 'steps': 55917, 'loss/train': 1.978428602218628} 11/07/2021 05:05:06 - INFO - __main__ - Step 55919: {'lr': 0.00035337167971159837, 'samples': 10736448, 'steps': 55918, 'loss/train': 1.5240490436553955} 11/07/2021 05:05:07 - INFO - __main__ - Step 55920: {'lr': 0.000353366847843718, 'samples': 10736640, 'steps': 55919, 'loss/train': 1.230600357055664} 11/07/2021 05:05:08 - INFO - __main__ - Step 55921: {'lr': 0.0003533620159292621, 'samples': 10736832, 'steps': 55920, 'loss/train': 1.642690896987915} 11/07/2021 05:05:08 - INFO - __main__ - Step 55922: {'lr': 0.0003533571839682329, 'samples': 10737024, 'steps': 55921, 'loss/train': 1.8003747463226318} 11/07/2021 05:05:09 - INFO - __main__ - Step 55923: {'lr': 0.00035335235196063254, 'samples': 10737216, 'steps': 55922, 'loss/train': 0.1256265640258789} 11/07/2021 05:05:09 - INFO - __main__ - Step 55924: {'lr': 0.0003533475199064632, 'samples': 10737408, 'steps': 55923, 'loss/train': 1.00397527217865} 11/07/2021 05:05:09 - INFO - __main__ - Step 55925: {'lr': 0.00035334268780572707, 'samples': 10737600, 'steps': 55924, 'loss/train': 1.1774753332138062} 11/07/2021 05:05:10 - INFO - __main__ - Step 55926: {'lr': 0.0003533378556584263, 'samples': 10737792, 'steps': 55925, 'loss/train': 1.18112051486969} 11/07/2021 05:05:11 - INFO - __main__ - Step 55927: {'lr': 0.0003533330234645631, 'samples': 10737984, 'steps': 55926, 'loss/train': 1.4713926315307617} 11/07/2021 05:05:11 - INFO - __main__ - Step 55928: {'lr': 0.00035332819122413963, 'samples': 10738176, 'steps': 55927, 'loss/train': 1.5009284019470215} 11/07/2021 05:05:11 - INFO - __main__ - Step 55929: {'lr': 0.00035332335893715805, 'samples': 10738368, 'steps': 55928, 'loss/train': 1.3760688304901123} 11/07/2021 05:05:12 - INFO - __main__ - Step 55930: {'lr': 0.00035331852660362055, 'samples': 10738560, 'steps': 55929, 'loss/train': 1.3465180397033691} 11/07/2021 05:05:13 - INFO - __main__ - Step 55931: {'lr': 0.00035331369422352937, 'samples': 10738752, 'steps': 55930, 'loss/train': 1.4027389287948608} 11/07/2021 05:05:13 - INFO - __main__ - Step 55932: {'lr': 0.00035330886179688666, 'samples': 10738944, 'steps': 55931, 'loss/train': 1.3508884906768799} 11/07/2021 05:05:13 - INFO - __main__ - Step 55933: {'lr': 0.0003533040293236945, 'samples': 10739136, 'steps': 55932, 'loss/train': 1.6447728872299194} 11/07/2021 05:05:14 - INFO - __main__ - Step 55934: {'lr': 0.0003532991968039552, 'samples': 10739328, 'steps': 55933, 'loss/train': 1.617796540260315} 11/07/2021 05:05:14 - INFO - __main__ - Step 55935: {'lr': 0.0003532943642376708, 'samples': 10739520, 'steps': 55934, 'loss/train': 0.9896909594535828} 11/07/2021 05:05:15 - INFO - __main__ - Step 55936: {'lr': 0.00035328953162484355, 'samples': 10739712, 'steps': 55935, 'loss/train': 0.7301599383354187} 11/07/2021 05:05:16 - INFO - __main__ - Step 55937: {'lr': 0.00035328469896547566, 'samples': 10739904, 'steps': 55936, 'loss/train': 1.390870213508606} 11/07/2021 05:05:16 - INFO - __main__ - Step 55938: {'lr': 0.0003532798662595693, 'samples': 10740096, 'steps': 55937, 'loss/train': 1.366762399673462} 11/07/2021 05:05:16 - INFO - __main__ - Step 55939: {'lr': 0.00035327503350712666, 'samples': 10740288, 'steps': 55938, 'loss/train': 1.447092890739441} 11/07/2021 05:05:17 - INFO - __main__ - Step 55940: {'lr': 0.0003532702007081498, 'samples': 10740480, 'steps': 55939, 'loss/train': 1.8277941942214966} 11/07/2021 05:05:18 - INFO - __main__ - Step 55941: {'lr': 0.000353265367862641, 'samples': 10740672, 'steps': 55940, 'loss/train': 1.4005780220031738} 11/07/2021 05:05:18 - INFO - __main__ - Step 55942: {'lr': 0.0003532605349706025, 'samples': 10740864, 'steps': 55941, 'loss/train': 1.4999957084655762} 11/07/2021 05:05:18 - INFO - __main__ - Step 55943: {'lr': 0.00035325570203203626, 'samples': 10741056, 'steps': 55942, 'loss/train': 1.7539501190185547} 11/07/2021 05:05:19 - INFO - __main__ - Step 55944: {'lr': 0.0003532508690469447, 'samples': 10741248, 'steps': 55943, 'loss/train': 1.5305148363113403} 11/07/2021 05:05:19 - INFO - __main__ - Step 55945: {'lr': 0.0003532460360153299, 'samples': 10741440, 'steps': 55944, 'loss/train': 1.5526245832443237} 11/07/2021 05:05:20 - INFO - __main__ - Step 55946: {'lr': 0.000353241202937194, 'samples': 10741632, 'steps': 55945, 'loss/train': 1.1486482620239258} 11/07/2021 05:05:21 - INFO - __main__ - Step 55947: {'lr': 0.00035323636981253914, 'samples': 10741824, 'steps': 55946, 'loss/train': 1.5427560806274414} 11/07/2021 05:05:21 - INFO - __main__ - Step 55948: {'lr': 0.00035323153664136765, 'samples': 10742016, 'steps': 55947, 'loss/train': 1.8074579238891602} 11/07/2021 05:05:21 - INFO - __main__ - Step 55949: {'lr': 0.00035322670342368155, 'samples': 10742208, 'steps': 55948, 'loss/train': 1.2773897647857666} 11/07/2021 05:05:22 - INFO - __main__ - Step 55950: {'lr': 0.0003532218701594832, 'samples': 10742400, 'steps': 55949, 'loss/train': 1.4659587144851685} 11/07/2021 05:05:22 - INFO - __main__ - Step 55951: {'lr': 0.0003532170368487746, 'samples': 10742592, 'steps': 55950, 'loss/train': 1.7059035301208496} 11/07/2021 05:05:23 - INFO - __main__ - Step 55952: {'lr': 0.00035321220349155796, 'samples': 10742784, 'steps': 55951, 'loss/train': 1.6169774532318115} 11/07/2021 05:05:24 - INFO - __main__ - Step 55953: {'lr': 0.00035320737008783556, 'samples': 10742976, 'steps': 55952, 'loss/train': 1.169887661933899} 11/07/2021 05:05:24 - INFO - __main__ - Step 55954: {'lr': 0.0003532025366376095, 'samples': 10743168, 'steps': 55953, 'loss/train': 1.4586212635040283} 11/07/2021 05:05:24 - INFO - __main__ - Step 55955: {'lr': 0.0003531977031408819, 'samples': 10743360, 'steps': 55954, 'loss/train': 1.5239309072494507} 11/07/2021 05:05:25 - INFO - __main__ - Step 55956: {'lr': 0.0003531928695976551, 'samples': 10743552, 'steps': 55955, 'loss/train': 1.4125745296478271} 11/07/2021 05:05:25 - INFO - __main__ - Step 55957: {'lr': 0.00035318803600793117, 'samples': 10743744, 'steps': 55956, 'loss/train': 1.0789803266525269} 11/07/2021 05:05:26 - INFO - __main__ - Step 55958: {'lr': 0.00035318320237171224, 'samples': 10743936, 'steps': 55957, 'loss/train': 1.5693883895874023} 11/07/2021 05:05:26 - INFO - __main__ - Step 55959: {'lr': 0.0003531783686890006, 'samples': 10744128, 'steps': 55958, 'loss/train': 1.3762418031692505} 11/07/2021 05:05:27 - INFO - __main__ - Step 55960: {'lr': 0.0003531735349597984, 'samples': 10744320, 'steps': 55959, 'loss/train': 0.825796902179718} 11/07/2021 05:05:27 - INFO - __main__ - Step 55961: {'lr': 0.0003531687011841077, 'samples': 10744512, 'steps': 55960, 'loss/train': 1.4723906517028809} 11/07/2021 05:05:27 - INFO - __main__ - Step 55962: {'lr': 0.0003531638673619309, 'samples': 10744704, 'steps': 55961, 'loss/train': 1.387729525566101} 11/07/2021 05:05:28 - INFO - __main__ - Step 55963: {'lr': 0.00035315903349327, 'samples': 10744896, 'steps': 55962, 'loss/train': 1.4891746044158936} 11/07/2021 05:05:29 - INFO - __main__ - Step 55964: {'lr': 0.00035315419957812725, 'samples': 10745088, 'steps': 55963, 'loss/train': 1.6799993515014648} 11/07/2021 05:05:29 - INFO - __main__ - Step 55965: {'lr': 0.0003531493656165047, 'samples': 10745280, 'steps': 55964, 'loss/train': 1.3721928596496582} 11/07/2021 05:05:29 - INFO - __main__ - Step 55966: {'lr': 0.00035314453160840476, 'samples': 10745472, 'steps': 55965, 'loss/train': 1.483290433883667} 11/07/2021 05:05:30 - INFO - __main__ - Step 55967: {'lr': 0.00035313969755382946, 'samples': 10745664, 'steps': 55966, 'loss/train': 1.7456120252609253} 11/07/2021 05:05:31 - INFO - __main__ - Step 55968: {'lr': 0.000353134863452781, 'samples': 10745856, 'steps': 55967, 'loss/train': 1.7259849309921265} 11/07/2021 05:05:31 - INFO - __main__ - Step 55969: {'lr': 0.00035313002930526156, 'samples': 10746048, 'steps': 55968, 'loss/train': 0.07837814092636108} 11/07/2021 05:05:31 - INFO - __main__ - Step 55970: {'lr': 0.00035312519511127325, 'samples': 10746240, 'steps': 55969, 'loss/train': 1.245185375213623} 11/07/2021 05:05:32 - INFO - __main__ - Step 55971: {'lr': 0.0003531203608708184, 'samples': 10746432, 'steps': 55970, 'loss/train': 0.6822187900543213} 11/07/2021 05:05:32 - INFO - __main__ - Step 55972: {'lr': 0.00035311552658389914, 'samples': 10746624, 'steps': 55971, 'loss/train': 0.39193999767303467} 11/07/2021 05:05:33 - INFO - __main__ - Step 55973: {'lr': 0.00035311069225051755, 'samples': 10746816, 'steps': 55972, 'loss/train': 1.1932096481323242} 11/07/2021 05:05:34 - INFO - __main__ - Step 55974: {'lr': 0.0003531058578706759, 'samples': 10747008, 'steps': 55973, 'loss/train': 1.864783525466919} 11/07/2021 05:05:34 - INFO - __main__ - Step 55975: {'lr': 0.00035310102344437636, 'samples': 10747200, 'steps': 55974, 'loss/train': 1.5394617319107056} 11/07/2021 05:05:34 - INFO - __main__ - Step 55976: {'lr': 0.00035309618897162097, 'samples': 10747392, 'steps': 55975, 'loss/train': 2.1839776039123535} 11/07/2021 05:05:35 - INFO - __main__ - Step 55977: {'lr': 0.0003530913544524121, 'samples': 10747584, 'steps': 55976, 'loss/train': 1.5453792810440063} 11/07/2021 05:05:35 - INFO - __main__ - Step 55978: {'lr': 0.00035308651988675194, 'samples': 10747776, 'steps': 55977, 'loss/train': 1.2105815410614014} 11/07/2021 05:05:36 - INFO - __main__ - Step 55979: {'lr': 0.0003530816852746426, 'samples': 10747968, 'steps': 55978, 'loss/train': 1.77792489528656} 11/07/2021 05:05:37 - INFO - __main__ - Step 55980: {'lr': 0.00035307685061608605, 'samples': 10748160, 'steps': 55979, 'loss/train': 1.562185287475586} 11/07/2021 05:05:37 - INFO - __main__ - Step 55981: {'lr': 0.00035307201591108485, 'samples': 10748352, 'steps': 55980, 'loss/train': 0.111997589468956} 11/07/2021 05:05:37 - INFO - __main__ - Step 55982: {'lr': 0.0003530671811596409, 'samples': 10748544, 'steps': 55981, 'loss/train': 0.3349896967411041} 11/07/2021 05:05:38 - INFO - __main__ - Step 55983: {'lr': 0.00035306234636175646, 'samples': 10748736, 'steps': 55982, 'loss/train': 0.15526923537254333} 11/07/2021 05:05:39 - INFO - __main__ - Step 55984: {'lr': 0.0003530575115174337, 'samples': 10748928, 'steps': 55983, 'loss/train': 1.426676630973816} 11/07/2021 05:05:39 - INFO - __main__ - Step 55985: {'lr': 0.00035305267662667485, 'samples': 10749120, 'steps': 55984, 'loss/train': 1.652669072151184} 11/07/2021 05:05:40 - INFO - __main__ - Step 55986: {'lr': 0.0003530478416894821, 'samples': 10749312, 'steps': 55985, 'loss/train': 1.0717411041259766} 11/07/2021 05:05:40 - INFO - __main__ - Step 55987: {'lr': 0.00035304300670585754, 'samples': 10749504, 'steps': 55986, 'loss/train': 1.5535953044891357} 11/07/2021 05:05:40 - INFO - __main__ - Step 55988: {'lr': 0.0003530381716758034, 'samples': 10749696, 'steps': 55987, 'loss/train': 1.174253225326538} 11/07/2021 05:05:41 - INFO - __main__ - Step 55989: {'lr': 0.00035303333659932187, 'samples': 10749888, 'steps': 55988, 'loss/train': 1.393289566040039} 11/07/2021 05:05:42 - INFO - __main__ - Step 55990: {'lr': 0.000353028501476415, 'samples': 10750080, 'steps': 55989, 'loss/train': 1.339763879776001} 11/07/2021 05:05:42 - INFO - __main__ - Step 55991: {'lr': 0.0003530236663070852, 'samples': 10750272, 'steps': 55990, 'loss/train': 1.4543182849884033} 11/07/2021 05:05:42 - INFO - __main__ - Step 55992: {'lr': 0.00035301883109133456, 'samples': 10750464, 'steps': 55991, 'loss/train': 1.587518334388733} 11/07/2021 05:05:43 - INFO - __main__ - Step 55993: {'lr': 0.0003530139958291651, 'samples': 10750656, 'steps': 55992, 'loss/train': 1.3030308485031128} 11/07/2021 05:05:44 - INFO - __main__ - Step 55994: {'lr': 0.0003530091605205792, 'samples': 10750848, 'steps': 55993, 'loss/train': 0.8324307799339294} 11/07/2021 05:05:44 - INFO - __main__ - Step 55995: {'lr': 0.0003530043251655789, 'samples': 10751040, 'steps': 55994, 'loss/train': 0.6881075501441956} 11/07/2021 05:05:44 - INFO - __main__ - Step 55996: {'lr': 0.00035299948976416645, 'samples': 10751232, 'steps': 55995, 'loss/train': 1.4833433628082275} 11/07/2021 05:05:45 - INFO - __main__ - Step 55997: {'lr': 0.00035299465431634403, 'samples': 10751424, 'steps': 55996, 'loss/train': 1.2558081150054932} 11/07/2021 05:05:45 - INFO - __main__ - Step 55998: {'lr': 0.00035298981882211385, 'samples': 10751616, 'steps': 55997, 'loss/train': 1.4812978506088257} 11/07/2021 05:05:46 - INFO - __main__ - Step 55999: {'lr': 0.00035298498328147803, 'samples': 10751808, 'steps': 55998, 'loss/train': 0.9314523935317993} 11/07/2021 05:05:47 - INFO - __main__ - Step 56000: {'lr': 0.00035298014769443874, 'samples': 10752000, 'steps': 55999, 'loss/train': 1.2715140581130981} 11/07/2021 05:05:47 - INFO - __main__ - Step 56001: {'lr': 0.0003529753120609982, 'samples': 10752192, 'steps': 56000, 'loss/train': 1.534266710281372} 11/07/2021 05:05:47 - INFO - __main__ - Step 56002: {'lr': 0.0003529704763811585, 'samples': 10752384, 'steps': 56001, 'loss/train': 1.180860996246338} 11/07/2021 05:05:48 - INFO - __main__ - Step 56003: {'lr': 0.000352965640654922, 'samples': 10752576, 'steps': 56002, 'loss/train': 1.4908504486083984} 11/07/2021 05:05:48 - INFO - __main__ - Step 56004: {'lr': 0.0003529608048822908, 'samples': 10752768, 'steps': 56003, 'loss/train': 1.4740095138549805} 11/07/2021 05:05:49 - INFO - __main__ - Step 56005: {'lr': 0.0003529559690632669, 'samples': 10752960, 'steps': 56004, 'loss/train': 2.202659845352173} 11/07/2021 05:05:50 - INFO - __main__ - Step 56006: {'lr': 0.00035295113319785276, 'samples': 10753152, 'steps': 56005, 'loss/train': 1.436906337738037} 11/07/2021 05:05:50 - INFO - __main__ - Step 56007: {'lr': 0.0003529462972860504, 'samples': 10753344, 'steps': 56006, 'loss/train': 1.2278133630752563} 11/07/2021 05:05:50 - INFO - __main__ - Step 56008: {'lr': 0.000352941461327862, 'samples': 10753536, 'steps': 56007, 'loss/train': 1.1644611358642578} 11/07/2021 05:05:51 - INFO - __main__ - Step 56009: {'lr': 0.0003529366253232897, 'samples': 10753728, 'steps': 56008, 'loss/train': 1.4787629842758179} 11/07/2021 05:05:52 - INFO - __main__ - Step 56010: {'lr': 0.00035293178927233587, 'samples': 10753920, 'steps': 56009, 'loss/train': 1.521426796913147} 11/07/2021 05:05:52 - INFO - __main__ - Step 56011: {'lr': 0.0003529269531750025, 'samples': 10754112, 'steps': 56010, 'loss/train': 1.2113900184631348} 11/07/2021 05:05:52 - INFO - __main__ - Step 56012: {'lr': 0.0003529221170312919, 'samples': 10754304, 'steps': 56011, 'loss/train': 1.088667392730713} 11/07/2021 05:05:53 - INFO - __main__ - Step 56013: {'lr': 0.0003529172808412061, 'samples': 10754496, 'steps': 56012, 'loss/train': 0.11348990350961685} 11/07/2021 05:05:53 - INFO - __main__ - Step 56014: {'lr': 0.0003529124446047474, 'samples': 10754688, 'steps': 56013, 'loss/train': 1.4001039266586304} 11/07/2021 05:05:54 - INFO - __main__ - Step 56015: {'lr': 0.0003529076083219179, 'samples': 10754880, 'steps': 56014, 'loss/train': 1.681990146636963} 11/07/2021 05:05:55 - INFO - __main__ - Step 56016: {'lr': 0.0003529027719927199, 'samples': 10755072, 'steps': 56015, 'loss/train': 1.2354339361190796} 11/07/2021 05:05:55 - INFO - __main__ - Step 56017: {'lr': 0.00035289793561715544, 'samples': 10755264, 'steps': 56016, 'loss/train': 1.2206889390945435} 11/07/2021 05:05:55 - INFO - __main__ - Step 56018: {'lr': 0.0003528930991952267, 'samples': 10755456, 'steps': 56017, 'loss/train': 0.7317473888397217} 11/07/2021 05:05:56 - INFO - __main__ - Step 56019: {'lr': 0.00035288826272693606, 'samples': 10755648, 'steps': 56018, 'loss/train': 1.7954611778259277} 11/07/2021 05:05:57 - INFO - __main__ - Step 56020: {'lr': 0.0003528834262122855, 'samples': 10755840, 'steps': 56019, 'loss/train': 1.266148328781128} 11/07/2021 05:05:57 - INFO - __main__ - Step 56021: {'lr': 0.00035287858965127723, 'samples': 10756032, 'steps': 56020, 'loss/train': 0.9832165241241455} 11/07/2021 05:05:57 - INFO - __main__ - Step 56022: {'lr': 0.00035287375304391343, 'samples': 10756224, 'steps': 56021, 'loss/train': 1.249596118927002} 11/07/2021 05:05:58 - INFO - __main__ - Step 56023: {'lr': 0.00035286891639019636, 'samples': 10756416, 'steps': 56022, 'loss/train': 1.4130103588104248} 11/07/2021 05:05:58 - INFO - __main__ - Step 56024: {'lr': 0.00035286407969012813, 'samples': 10756608, 'steps': 56023, 'loss/train': 1.6560721397399902} 11/07/2021 05:05:59 - INFO - __main__ - Step 56025: {'lr': 0.00035285924294371085, 'samples': 10756800, 'steps': 56024, 'loss/train': 1.474733591079712} 11/07/2021 05:05:59 - INFO - __main__ - Step 56026: {'lr': 0.00035285440615094696, 'samples': 10756992, 'steps': 56025, 'loss/train': 1.5125707387924194} 11/07/2021 05:06:00 - INFO - __main__ - Step 56027: {'lr': 0.0003528495693118383, 'samples': 10757184, 'steps': 56026, 'loss/train': 1.1230933666229248} 11/07/2021 05:06:00 - INFO - __main__ - Step 56028: {'lr': 0.0003528447324263873, 'samples': 10757376, 'steps': 56027, 'loss/train': 0.8970708250999451} 11/07/2021 05:06:00 - INFO - __main__ - Step 56029: {'lr': 0.000352839895494596, 'samples': 10757568, 'steps': 56028, 'loss/train': 1.7706984281539917} 11/07/2021 05:06:02 - INFO - __main__ - Step 56030: {'lr': 0.00035283505851646665, 'samples': 10757760, 'steps': 56029, 'loss/train': 1.450102686882019} 11/07/2021 05:06:02 - INFO - __main__ - Step 56031: {'lr': 0.0003528302214920014, 'samples': 10757952, 'steps': 56030, 'loss/train': 1.749725341796875} 11/07/2021 05:06:02 - INFO - __main__ - Step 56032: {'lr': 0.0003528253844212024, 'samples': 10758144, 'steps': 56031, 'loss/train': 0.9640790224075317} 11/07/2021 05:06:03 - INFO - __main__ - Step 56033: {'lr': 0.00035282054730407196, 'samples': 10758336, 'steps': 56032, 'loss/train': 1.684749960899353} 11/07/2021 05:06:03 - INFO - __main__ - Step 56034: {'lr': 0.00035281571014061214, 'samples': 10758528, 'steps': 56033, 'loss/train': 1.329773187637329} 11/07/2021 05:06:04 - INFO - __main__ - Step 56035: {'lr': 0.0003528108729308251, 'samples': 10758720, 'steps': 56034, 'loss/train': 1.843616247177124} 11/07/2021 05:06:04 - INFO - __main__ - Step 56036: {'lr': 0.0003528060356747131, 'samples': 10758912, 'steps': 56035, 'loss/train': 1.5963677167892456} 11/07/2021 05:06:05 - INFO - __main__ - Step 56037: {'lr': 0.0003528011983722783, 'samples': 10759104, 'steps': 56036, 'loss/train': 1.7013540267944336} 11/07/2021 05:06:05 - INFO - __main__ - Step 56038: {'lr': 0.0003527963610235229, 'samples': 10759296, 'steps': 56037, 'loss/train': 0.8886352777481079} 11/07/2021 05:06:05 - INFO - __main__ - Step 56039: {'lr': 0.000352791523628449, 'samples': 10759488, 'steps': 56038, 'loss/train': 1.77224600315094} 11/07/2021 05:06:06 - INFO - __main__ - Step 56040: {'lr': 0.0003527866861870588, 'samples': 10759680, 'steps': 56039, 'loss/train': 1.5709130764007568} 11/07/2021 05:06:07 - INFO - __main__ - Step 56041: {'lr': 0.00035278184869935454, 'samples': 10759872, 'steps': 56040, 'loss/train': 0.9209262728691101} 11/07/2021 05:06:07 - INFO - __main__ - Step 56042: {'lr': 0.0003527770111653383, 'samples': 10760064, 'steps': 56041, 'loss/train': 1.5137721300125122} 11/07/2021 05:06:07 - INFO - __main__ - Step 56043: {'lr': 0.0003527721735850124, 'samples': 10760256, 'steps': 56042, 'loss/train': 1.4074609279632568} 11/07/2021 05:06:08 - INFO - __main__ - Step 56044: {'lr': 0.0003527673359583789, 'samples': 10760448, 'steps': 56043, 'loss/train': 1.074844241142273} 11/07/2021 05:06:09 - INFO - __main__ - Step 56045: {'lr': 0.00035276249828544004, 'samples': 10760640, 'steps': 56044, 'loss/train': 1.5094329118728638} 11/07/2021 05:06:09 - INFO - __main__ - Step 56046: {'lr': 0.0003527576605661981, 'samples': 10760832, 'steps': 56045, 'loss/train': 1.6409003734588623} 11/07/2021 05:06:10 - INFO - __main__ - Step 56047: {'lr': 0.00035275282280065493, 'samples': 10761024, 'steps': 56046, 'loss/train': 1.4655380249023438} 11/07/2021 05:06:10 - INFO - __main__ - Step 56048: {'lr': 0.00035274798498881305, 'samples': 10761216, 'steps': 56047, 'loss/train': 0.4239441454410553} 11/07/2021 05:06:10 - INFO - __main__ - Step 56049: {'lr': 0.00035274314713067454, 'samples': 10761408, 'steps': 56048, 'loss/train': 1.879326581954956} 11/07/2021 05:06:11 - INFO - __main__ - Step 56050: {'lr': 0.00035273830922624147, 'samples': 10761600, 'steps': 56049, 'loss/train': 1.8466169834136963} 11/07/2021 05:06:12 - INFO - __main__ - Step 56051: {'lr': 0.00035273347127551616, 'samples': 10761792, 'steps': 56050, 'loss/train': 1.1744219064712524} 11/07/2021 05:06:12 - INFO - __main__ - Step 56052: {'lr': 0.00035272863327850067, 'samples': 10761984, 'steps': 56051, 'loss/train': 1.4762969017028809} 11/07/2021 05:06:12 - INFO - __main__ - Step 56053: {'lr': 0.00035272379523519734, 'samples': 10762176, 'steps': 56052, 'loss/train': 1.756905436515808} 11/07/2021 05:06:13 - INFO - __main__ - Step 56054: {'lr': 0.0003527189571456082, 'samples': 10762368, 'steps': 56053, 'loss/train': 0.1097700297832489} 11/07/2021 05:06:14 - INFO - __main__ - Step 56055: {'lr': 0.00035271411900973545, 'samples': 10762560, 'steps': 56054, 'loss/train': 1.5177136659622192} 11/07/2021 05:06:14 - INFO - __main__ - Step 56056: {'lr': 0.00035270928082758134, 'samples': 10762752, 'steps': 56055, 'loss/train': 0.9365915060043335} 11/07/2021 05:06:14 - INFO - __main__ - Step 56057: {'lr': 0.00035270444259914794, 'samples': 10762944, 'steps': 56056, 'loss/train': 1.0476480722427368} 11/07/2021 05:06:15 - INFO - __main__ - Step 56058: {'lr': 0.0003526996043244376, 'samples': 10763136, 'steps': 56057, 'loss/train': 1.4696296453475952} 11/07/2021 05:06:15 - INFO - __main__ - Step 56059: {'lr': 0.0003526947660034524, 'samples': 10763328, 'steps': 56058, 'loss/train': 1.3283686637878418} 11/07/2021 05:06:16 - INFO - __main__ - Step 56060: {'lr': 0.0003526899276361945, 'samples': 10763520, 'steps': 56059, 'loss/train': 1.6950527429580688} 11/07/2021 05:06:16 - INFO - __main__ - Step 56061: {'lr': 0.00035268508922266614, 'samples': 10763712, 'steps': 56060, 'loss/train': 1.4874831438064575} 11/07/2021 05:06:17 - INFO - __main__ - Step 56062: {'lr': 0.00035268025076286936, 'samples': 10763904, 'steps': 56061, 'loss/train': 1.465022087097168} 11/07/2021 05:06:17 - INFO - __main__ - Step 56063: {'lr': 0.00035267541225680654, 'samples': 10764096, 'steps': 56062, 'loss/train': 1.7769290208816528} 11/07/2021 05:06:17 - INFO - __main__ - Step 56064: {'lr': 0.00035267057370447967, 'samples': 10764288, 'steps': 56063, 'loss/train': 1.2261261940002441} 11/07/2021 05:06:19 - INFO - __main__ - Step 56065: {'lr': 0.00035266573510589114, 'samples': 10764480, 'steps': 56064, 'loss/train': 2.025089979171753} 11/07/2021 05:06:19 - INFO - __main__ - Step 56066: {'lr': 0.00035266089646104296, 'samples': 10764672, 'steps': 56065, 'loss/train': 1.5993165969848633} 11/07/2021 05:06:19 - INFO - __main__ - Step 56067: {'lr': 0.00035265605776993735, 'samples': 10764864, 'steps': 56066, 'loss/train': 0.9580615162849426} 11/07/2021 05:06:20 - INFO - __main__ - Step 56068: {'lr': 0.0003526512190325765, 'samples': 10765056, 'steps': 56067, 'loss/train': 0.8672080039978027} 11/07/2021 05:06:20 - INFO - __main__ - Step 56069: {'lr': 0.0003526463802489626, 'samples': 10765248, 'steps': 56068, 'loss/train': 1.4725104570388794} 11/07/2021 05:06:20 - INFO - __main__ - Step 56070: {'lr': 0.00035264154141909787, 'samples': 10765440, 'steps': 56069, 'loss/train': 0.6661514043807983} 11/07/2021 05:06:21 - INFO - __main__ - Step 56071: {'lr': 0.00035263670254298443, 'samples': 10765632, 'steps': 56070, 'loss/train': 0.22711573541164398} 11/07/2021 05:06:22 - INFO - __main__ - Step 56072: {'lr': 0.0003526318636206244, 'samples': 10765824, 'steps': 56071, 'loss/train': 1.1294758319854736} 11/07/2021 05:06:22 - INFO - __main__ - Step 56073: {'lr': 0.0003526270246520201, 'samples': 10766016, 'steps': 56072, 'loss/train': 1.2055062055587769} 11/07/2021 05:06:22 - INFO - __main__ - Step 56074: {'lr': 0.0003526221856371737, 'samples': 10766208, 'steps': 56073, 'loss/train': 1.413435935974121} 11/07/2021 05:06:23 - INFO - __main__ - Step 56075: {'lr': 0.0003526173465760872, 'samples': 10766400, 'steps': 56074, 'loss/train': 1.345708966255188} 11/07/2021 05:06:24 - INFO - __main__ - Step 56076: {'lr': 0.000352612507468763, 'samples': 10766592, 'steps': 56075, 'loss/train': 1.6085463762283325} 11/07/2021 05:06:24 - INFO - __main__ - Step 56077: {'lr': 0.00035260766831520315, 'samples': 10766784, 'steps': 56076, 'loss/train': 0.28465864062309265} 11/07/2021 05:06:24 - INFO - __main__ - Step 56078: {'lr': 0.0003526028291154099, 'samples': 10766976, 'steps': 56077, 'loss/train': 0.41470253467559814} 11/07/2021 05:06:25 - INFO - __main__ - Step 56079: {'lr': 0.00035259798986938537, 'samples': 10767168, 'steps': 56078, 'loss/train': 1.6819130182266235} 11/07/2021 05:06:25 - INFO - __main__ - Step 56080: {'lr': 0.00035259315057713177, 'samples': 10767360, 'steps': 56079, 'loss/train': 1.3918805122375488} 11/07/2021 05:06:26 - INFO - __main__ - Step 56081: {'lr': 0.0003525883112386513, 'samples': 10767552, 'steps': 56080, 'loss/train': 1.782894253730774} 11/07/2021 05:06:27 - INFO - __main__ - Step 56082: {'lr': 0.00035258347185394606, 'samples': 10767744, 'steps': 56081, 'loss/train': 1.506486177444458} 11/07/2021 05:06:27 - INFO - __main__ - Step 56083: {'lr': 0.00035257863242301834, 'samples': 10767936, 'steps': 56082, 'loss/train': 1.3411083221435547} 11/07/2021 05:06:27 - INFO - __main__ - Step 56084: {'lr': 0.0003525737929458703, 'samples': 10768128, 'steps': 56083, 'loss/train': 1.1377899646759033} 11/07/2021 05:06:28 - INFO - __main__ - Step 56085: {'lr': 0.0003525689534225041, 'samples': 10768320, 'steps': 56084, 'loss/train': 1.076027750968933} 11/07/2021 05:06:29 - INFO - __main__ - Step 56086: {'lr': 0.00035256411385292186, 'samples': 10768512, 'steps': 56085, 'loss/train': 1.695565938949585} 11/07/2021 05:06:29 - INFO - __main__ - Step 56087: {'lr': 0.0003525592742371258, 'samples': 10768704, 'steps': 56086, 'loss/train': 1.4092981815338135} 11/07/2021 05:06:29 - INFO - __main__ - Step 56088: {'lr': 0.0003525544345751182, 'samples': 10768896, 'steps': 56087, 'loss/train': 1.2346290349960327} 11/07/2021 05:06:30 - INFO - __main__ - Step 56089: {'lr': 0.00035254959486690103, 'samples': 10769088, 'steps': 56088, 'loss/train': 1.3800259828567505} 11/07/2021 05:06:30 - INFO - __main__ - Step 56090: {'lr': 0.0003525447551124766, 'samples': 10769280, 'steps': 56089, 'loss/train': 1.3414525985717773} 11/07/2021 05:06:31 - INFO - __main__ - Step 56091: {'lr': 0.0003525399153118472, 'samples': 10769472, 'steps': 56090, 'loss/train': 1.544366478919983} 11/07/2021 05:06:32 - INFO - __main__ - Step 56092: {'lr': 0.00035253507546501484, 'samples': 10769664, 'steps': 56091, 'loss/train': 1.2946702241897583} 11/07/2021 05:06:32 - INFO - __main__ - Step 56093: {'lr': 0.0003525302355719818, 'samples': 10769856, 'steps': 56092, 'loss/train': 1.568245768547058} 11/07/2021 05:06:32 - INFO - __main__ - Step 56094: {'lr': 0.0003525253956327501, 'samples': 10770048, 'steps': 56093, 'loss/train': 1.3921455144882202} 11/07/2021 05:06:33 - INFO - __main__ - Step 56095: {'lr': 0.0003525205556473221, 'samples': 10770240, 'steps': 56094, 'loss/train': 1.4071180820465088} 11/07/2021 05:06:33 - INFO - __main__ - Step 56096: {'lr': 0.0003525157156157, 'samples': 10770432, 'steps': 56095, 'loss/train': 1.4553344249725342} 11/07/2021 05:06:34 - INFO - __main__ - Step 56097: {'lr': 0.00035251087553788584, 'samples': 10770624, 'steps': 56096, 'loss/train': 1.5578862428665161} 11/07/2021 05:06:34 - INFO - __main__ - Step 56098: {'lr': 0.00035250603541388183, 'samples': 10770816, 'steps': 56097, 'loss/train': 0.07733270525932312} 11/07/2021 05:06:35 - INFO - __main__ - Step 56099: {'lr': 0.00035250119524369016, 'samples': 10771008, 'steps': 56098, 'loss/train': 1.6780366897583008} 11/07/2021 05:06:35 - INFO - __main__ - Step 56100: {'lr': 0.00035249635502731315, 'samples': 10771200, 'steps': 56099, 'loss/train': 1.8048059940338135} 11/07/2021 05:06:36 - INFO - __main__ - Step 56101: {'lr': 0.0003524915147647528, 'samples': 10771392, 'steps': 56100, 'loss/train': 1.3484421968460083} 11/07/2021 05:06:37 - INFO - __main__ - Step 56102: {'lr': 0.00035248667445601133, 'samples': 10771584, 'steps': 56101, 'loss/train': 1.3193823099136353} 11/07/2021 05:06:37 - INFO - __main__ - Step 56103: {'lr': 0.00035248183410109096, 'samples': 10771776, 'steps': 56102, 'loss/train': 0.33234304189682007} 11/07/2021 05:06:37 - INFO - __main__ - Step 56104: {'lr': 0.0003524769936999939, 'samples': 10771968, 'steps': 56103, 'loss/train': 1.1785374879837036} 11/07/2021 05:06:38 - INFO - __main__ - Step 56105: {'lr': 0.0003524721532527222, 'samples': 10772160, 'steps': 56104, 'loss/train': 0.7846789360046387} 11/07/2021 05:06:38 - INFO - __main__ - Step 56106: {'lr': 0.0003524673127592782, 'samples': 10772352, 'steps': 56105, 'loss/train': 1.1164886951446533} 11/07/2021 05:06:39 - INFO - __main__ - Step 56107: {'lr': 0.000352462472219664, 'samples': 10772544, 'steps': 56106, 'loss/train': 1.6663761138916016} 11/07/2021 05:06:39 - INFO - __main__ - Step 56108: {'lr': 0.0003524576316338818, 'samples': 10772736, 'steps': 56107, 'loss/train': 0.842548131942749} 11/07/2021 05:06:40 - INFO - __main__ - Step 56109: {'lr': 0.0003524527910019337, 'samples': 10772928, 'steps': 56108, 'loss/train': 1.3627212047576904} 11/07/2021 05:06:40 - INFO - __main__ - Step 56110: {'lr': 0.00035244795032382206, 'samples': 10773120, 'steps': 56109, 'loss/train': 1.4599909782409668} 11/07/2021 05:06:41 - INFO - __main__ - Step 56111: {'lr': 0.00035244310959954886, 'samples': 10773312, 'steps': 56110, 'loss/train': 1.293717861175537} 11/07/2021 05:06:42 - INFO - __main__ - Step 56112: {'lr': 0.0003524382688291164, 'samples': 10773504, 'steps': 56111, 'loss/train': 1.4754217863082886} 11/07/2021 05:06:42 - INFO - __main__ - Step 56113: {'lr': 0.0003524334280125269, 'samples': 10773696, 'steps': 56112, 'loss/train': 0.8247331976890564} 11/07/2021 05:06:42 - INFO - __main__ - Step 56114: {'lr': 0.0003524285871497824, 'samples': 10773888, 'steps': 56113, 'loss/train': 1.2718819379806519} 11/07/2021 05:06:43 - INFO - __main__ - Step 56115: {'lr': 0.0003524237462408852, 'samples': 10774080, 'steps': 56114, 'loss/train': 1.5664552450180054} 11/07/2021 05:06:43 - INFO - __main__ - Step 56116: {'lr': 0.0003524189052858374, 'samples': 10774272, 'steps': 56115, 'loss/train': 1.324585199356079} 11/07/2021 05:06:43 - INFO - __main__ - Step 56117: {'lr': 0.0003524140642846413, 'samples': 10774464, 'steps': 56116, 'loss/train': 0.9505982995033264} 11/07/2021 05:06:44 - INFO - __main__ - Step 56118: {'lr': 0.0003524092232372989, 'samples': 10774656, 'steps': 56117, 'loss/train': 1.560680627822876} 11/07/2021 05:06:45 - INFO - __main__ - Step 56119: {'lr': 0.00035240438214381253, 'samples': 10774848, 'steps': 56118, 'loss/train': 1.2786351442337036} 11/07/2021 05:06:45 - INFO - __main__ - Step 56120: {'lr': 0.00035239954100418436, 'samples': 10775040, 'steps': 56119, 'loss/train': 1.2073078155517578} 11/07/2021 05:06:45 - INFO - __main__ - Step 56121: {'lr': 0.00035239469981841656, 'samples': 10775232, 'steps': 56120, 'loss/train': 1.5953010320663452} 11/07/2021 05:06:46 - INFO - __main__ - Step 56122: {'lr': 0.0003523898585865112, 'samples': 10775424, 'steps': 56121, 'loss/train': 1.4239513874053955} 11/07/2021 05:06:47 - INFO - __main__ - Step 56123: {'lr': 0.0003523850173084706, 'samples': 10775616, 'steps': 56122, 'loss/train': 1.1879708766937256} 11/07/2021 05:06:47 - INFO - __main__ - Step 56124: {'lr': 0.00035238017598429686, 'samples': 10775808, 'steps': 56123, 'loss/train': 1.3142179250717163} 11/07/2021 05:06:47 - INFO - __main__ - Step 56125: {'lr': 0.0003523753346139922, 'samples': 10776000, 'steps': 56124, 'loss/train': 1.1931241750717163} 11/07/2021 05:06:48 - INFO - __main__ - Step 56126: {'lr': 0.0003523704931975588, 'samples': 10776192, 'steps': 56125, 'loss/train': 1.5110180377960205} 11/07/2021 05:06:48 - INFO - __main__ - Step 56127: {'lr': 0.0003523656517349989, 'samples': 10776384, 'steps': 56126, 'loss/train': 1.1871814727783203} 11/07/2021 05:06:49 - INFO - __main__ - Step 56128: {'lr': 0.0003523608102263145, 'samples': 10776576, 'steps': 56127, 'loss/train': 1.6022156476974487} 11/07/2021 05:06:50 - INFO - __main__ - Step 56129: {'lr': 0.00035235596867150797, 'samples': 10776768, 'steps': 56128, 'loss/train': 1.578641653060913} 11/07/2021 05:06:50 - INFO - __main__ - Step 56130: {'lr': 0.0003523511270705814, 'samples': 10776960, 'steps': 56129, 'loss/train': 1.1083091497421265} 11/07/2021 05:06:50 - INFO - __main__ - Step 56131: {'lr': 0.000352346285423537, 'samples': 10777152, 'steps': 56130, 'loss/train': 1.749462604522705} 11/07/2021 05:06:51 - INFO - __main__ - Step 56132: {'lr': 0.0003523414437303769, 'samples': 10777344, 'steps': 56131, 'loss/train': 1.507034182548523} 11/07/2021 05:06:52 - INFO - __main__ - Step 56133: {'lr': 0.0003523366019911035, 'samples': 10777536, 'steps': 56132, 'loss/train': 1.1592656373977661} 11/07/2021 05:06:52 - INFO - __main__ - Step 56134: {'lr': 0.00035233176020571863, 'samples': 10777728, 'steps': 56133, 'loss/train': 1.5731182098388672} 11/07/2021 05:06:52 - INFO - __main__ - Step 56135: {'lr': 0.0003523269183742246, 'samples': 10777920, 'steps': 56134, 'loss/train': 0.8437652587890625} 11/07/2021 05:06:53 - INFO - __main__ - Step 56136: {'lr': 0.0003523220764966238, 'samples': 10778112, 'steps': 56135, 'loss/train': 1.5683997869491577} 11/07/2021 05:06:53 - INFO - __main__ - Step 56137: {'lr': 0.00035231723457291816, 'samples': 10778304, 'steps': 56136, 'loss/train': 1.260100245475769} 11/07/2021 05:06:54 - INFO - __main__ - Step 56138: {'lr': 0.00035231239260311, 'samples': 10778496, 'steps': 56137, 'loss/train': 1.6461832523345947} 11/07/2021 05:06:55 - INFO - __main__ - Step 56139: {'lr': 0.0003523075505872014, 'samples': 10778688, 'steps': 56138, 'loss/train': 1.358130931854248} 11/07/2021 05:06:55 - INFO - __main__ - Step 56140: {'lr': 0.00035230270852519465, 'samples': 10778880, 'steps': 56139, 'loss/train': 1.5266367197036743} 11/07/2021 05:06:56 - INFO - __main__ - Step 56141: {'lr': 0.00035229786641709183, 'samples': 10779072, 'steps': 56140, 'loss/train': 1.1808722019195557} 11/07/2021 05:06:56 - INFO - __main__ - Step 56142: {'lr': 0.00035229302426289524, 'samples': 10779264, 'steps': 56141, 'loss/train': 0.1280687153339386} 11/07/2021 05:06:57 - INFO - __main__ - Step 56143: {'lr': 0.00035228818206260693, 'samples': 10779456, 'steps': 56142, 'loss/train': 1.2151098251342773} 11/07/2021 05:06:57 - INFO - __main__ - Step 56144: {'lr': 0.00035228333981622914, 'samples': 10779648, 'steps': 56143, 'loss/train': 1.38080894947052} 11/07/2021 05:06:58 - INFO - __main__ - Step 56145: {'lr': 0.0003522784975237641, 'samples': 10779840, 'steps': 56144, 'loss/train': 1.393256425857544} 11/07/2021 05:06:58 - INFO - __main__ - Step 56146: {'lr': 0.00035227365518521387, 'samples': 10780032, 'steps': 56145, 'loss/train': 1.5399317741394043} 11/07/2021 05:06:58 - INFO - __main__ - Step 56147: {'lr': 0.00035226881280058084, 'samples': 10780224, 'steps': 56146, 'loss/train': 1.4262508153915405} 11/07/2021 05:07:00 - INFO - __main__ - Step 56148: {'lr': 0.00035226397036986694, 'samples': 10780416, 'steps': 56147, 'loss/train': 1.5341914892196655} 11/07/2021 05:07:00 - INFO - __main__ - Step 56149: {'lr': 0.0003522591278930745, 'samples': 10780608, 'steps': 56148, 'loss/train': 1.5461366176605225} 11/07/2021 05:07:01 - INFO - __main__ - Step 56150: {'lr': 0.0003522542853702057, 'samples': 10780800, 'steps': 56149, 'loss/train': 1.8080079555511475} 11/07/2021 05:07:01 - INFO - __main__ - Step 56151: {'lr': 0.0003522494428012627, 'samples': 10780992, 'steps': 56150, 'loss/train': 1.4833968877792358} 11/07/2021 05:07:01 - INFO - __main__ - Step 56152: {'lr': 0.0003522446001862476, 'samples': 10781184, 'steps': 56151, 'loss/train': 1.6682887077331543} 11/07/2021 05:07:02 - INFO - __main__ - Step 56153: {'lr': 0.00035223975752516273, 'samples': 10781376, 'steps': 56152, 'loss/train': 1.2973426580429077} 11/07/2021 05:07:02 - INFO - __main__ - Step 56154: {'lr': 0.0003522349148180103, 'samples': 10781568, 'steps': 56153, 'loss/train': 0.6574332118034363} 11/07/2021 05:07:03 - INFO - __main__ - Step 56155: {'lr': 0.00035223007206479226, 'samples': 10781760, 'steps': 56154, 'loss/train': 1.3152788877487183} 11/07/2021 05:07:03 - INFO - __main__ - Step 56156: {'lr': 0.00035222522926551094, 'samples': 10781952, 'steps': 56155, 'loss/train': 1.5688040256500244} 11/07/2021 05:07:04 - INFO - __main__ - Step 56157: {'lr': 0.0003522203864201685, 'samples': 10782144, 'steps': 56156, 'loss/train': 1.280598521232605} 11/07/2021 05:07:04 - INFO - __main__ - Step 56158: {'lr': 0.00035221554352876715, 'samples': 10782336, 'steps': 56157, 'loss/train': 1.8648959398269653} 11/07/2021 05:07:04 - INFO - __main__ - Step 56159: {'lr': 0.00035221070059130913, 'samples': 10782528, 'steps': 56158, 'loss/train': 1.2692580223083496} 11/07/2021 05:07:05 - INFO - __main__ - Step 56160: {'lr': 0.0003522058576077965, 'samples': 10782720, 'steps': 56159, 'loss/train': 1.649580955505371} 11/07/2021 05:07:06 - INFO - __main__ - Step 56161: {'lr': 0.00035220101457823143, 'samples': 10782912, 'steps': 56160, 'loss/train': 1.2421942949295044} 11/07/2021 05:07:06 - INFO - __main__ - Step 56162: {'lr': 0.0003521961715026162, 'samples': 10783104, 'steps': 56161, 'loss/train': 1.789180040359497} 11/07/2021 05:07:07 - INFO - __main__ - Step 56163: {'lr': 0.0003521913283809529, 'samples': 10783296, 'steps': 56162, 'loss/train': 1.1353594064712524} 11/07/2021 05:07:07 - INFO - __main__ - Step 56164: {'lr': 0.00035218648521324387, 'samples': 10783488, 'steps': 56163, 'loss/train': 1.6117337942123413} 11/07/2021 05:07:08 - INFO - __main__ - Step 56165: {'lr': 0.0003521816419994911, 'samples': 10783680, 'steps': 56164, 'loss/train': 0.9769127368927002} 11/07/2021 05:07:08 - INFO - __main__ - Step 56166: {'lr': 0.0003521767987396969, 'samples': 10783872, 'steps': 56165, 'loss/train': 1.6191201210021973} 11/07/2021 05:07:09 - INFO - __main__ - Step 56167: {'lr': 0.00035217195543386345, 'samples': 10784064, 'steps': 56166, 'loss/train': 1.4446793794631958} 11/07/2021 05:07:09 - INFO - __main__ - Step 56168: {'lr': 0.0003521671120819928, 'samples': 10784256, 'steps': 56167, 'loss/train': 1.4271960258483887} 11/07/2021 05:07:09 - INFO - __main__ - Step 56169: {'lr': 0.0003521622686840873, 'samples': 10784448, 'steps': 56168, 'loss/train': 0.05827701464295387} 11/07/2021 05:07:10 - INFO - __main__ - Step 56170: {'lr': 0.000352157425240149, 'samples': 10784640, 'steps': 56169, 'loss/train': 1.6075183153152466} 11/07/2021 05:07:11 - INFO - __main__ - Step 56171: {'lr': 0.00035215258175018015, 'samples': 10784832, 'steps': 56170, 'loss/train': 1.5340136289596558} 11/07/2021 05:07:11 - INFO - __main__ - Step 56172: {'lr': 0.00035214773821418295, 'samples': 10785024, 'steps': 56171, 'loss/train': 1.220860242843628} 11/07/2021 05:07:11 - INFO - __main__ - Step 56173: {'lr': 0.00035214289463215954, 'samples': 10785216, 'steps': 56172, 'loss/train': 1.3255250453948975} 11/07/2021 05:07:12 - INFO - __main__ - Step 56174: {'lr': 0.00035213805100411217, 'samples': 10785408, 'steps': 56173, 'loss/train': 1.3527663946151733} 11/07/2021 05:07:13 - INFO - __main__ - Step 56175: {'lr': 0.00035213320733004297, 'samples': 10785600, 'steps': 56174, 'loss/train': 1.3792458772659302} 11/07/2021 05:07:13 - INFO - __main__ - Step 56176: {'lr': 0.00035212836360995405, 'samples': 10785792, 'steps': 56175, 'loss/train': 1.4815794229507446} 11/07/2021 05:07:14 - INFO - __main__ - Step 56177: {'lr': 0.0003521235198438477, 'samples': 10785984, 'steps': 56176, 'loss/train': 1.5355722904205322} 11/07/2021 05:07:14 - INFO - __main__ - Step 56178: {'lr': 0.000352118676031726, 'samples': 10786176, 'steps': 56177, 'loss/train': 1.8288989067077637} 11/07/2021 05:07:14 - INFO - __main__ - Step 56179: {'lr': 0.0003521138321735913, 'samples': 10786368, 'steps': 56178, 'loss/train': 1.0465120077133179} 11/07/2021 05:07:15 - INFO - __main__ - Step 56180: {'lr': 0.0003521089882694456, 'samples': 10786560, 'steps': 56179, 'loss/train': 1.408577799797058} 11/07/2021 05:07:16 - INFO - __main__ - Step 56181: {'lr': 0.0003521041443192913, 'samples': 10786752, 'steps': 56180, 'loss/train': 1.7961503267288208} 11/07/2021 05:07:16 - INFO - __main__ - Step 56182: {'lr': 0.00035209930032313033, 'samples': 10786944, 'steps': 56181, 'loss/train': 1.2707103490829468} 11/07/2021 05:07:16 - INFO - __main__ - Step 56183: {'lr': 0.000352094456280965, 'samples': 10787136, 'steps': 56182, 'loss/train': 1.4756022691726685} 11/07/2021 05:07:17 - INFO - __main__ - Step 56184: {'lr': 0.0003520896121927975, 'samples': 10787328, 'steps': 56183, 'loss/train': 1.4517836570739746} 11/07/2021 05:07:17 - INFO - __main__ - Step 56185: {'lr': 0.00035208476805863, 'samples': 10787520, 'steps': 56184, 'loss/train': 1.2510775327682495} 11/07/2021 05:07:18 - INFO - __main__ - Step 56186: {'lr': 0.00035207992387846466, 'samples': 10787712, 'steps': 56185, 'loss/train': 1.7733508348464966} 11/07/2021 05:07:19 - INFO - __main__ - Step 56187: {'lr': 0.0003520750796523037, 'samples': 10787904, 'steps': 56186, 'loss/train': 1.5007990598678589} 11/07/2021 05:07:19 - INFO - __main__ - Step 56188: {'lr': 0.0003520702353801493, 'samples': 10788096, 'steps': 56187, 'loss/train': 1.7201709747314453} 11/07/2021 05:07:19 - INFO - __main__ - Step 56189: {'lr': 0.0003520653910620036, 'samples': 10788288, 'steps': 56188, 'loss/train': 1.6831419467926025} 11/07/2021 05:07:20 - INFO - __main__ - Step 56190: {'lr': 0.0003520605466978688, 'samples': 10788480, 'steps': 56189, 'loss/train': 1.3703277111053467} 11/07/2021 05:07:20 - INFO - __main__ - Step 56191: {'lr': 0.00035205570228774715, 'samples': 10788672, 'steps': 56190, 'loss/train': 1.1786553859710693} 11/07/2021 05:07:21 - INFO - __main__ - Step 56192: {'lr': 0.0003520508578316407, 'samples': 10788864, 'steps': 56191, 'loss/train': 0.9848109483718872} 11/07/2021 05:07:21 - INFO - __main__ - Step 56193: {'lr': 0.0003520460133295518, 'samples': 10789056, 'steps': 56192, 'loss/train': 0.5717485547065735} 11/07/2021 05:07:22 - INFO - __main__ - Step 56194: {'lr': 0.0003520411687814825, 'samples': 10789248, 'steps': 56193, 'loss/train': 0.6043590903282166} 11/07/2021 05:07:22 - INFO - __main__ - Step 56195: {'lr': 0.000352036324187435, 'samples': 10789440, 'steps': 56194, 'loss/train': 1.6420927047729492} 11/07/2021 05:07:22 - INFO - __main__ - Step 56196: {'lr': 0.0003520314795474115, 'samples': 10789632, 'steps': 56195, 'loss/train': 1.801336646080017} 11/07/2021 05:07:23 - INFO - __main__ - Step 56197: {'lr': 0.00035202663486141417, 'samples': 10789824, 'steps': 56196, 'loss/train': 1.2203136682510376} 11/07/2021 05:07:24 - INFO - __main__ - Step 56198: {'lr': 0.00035202179012944527, 'samples': 10790016, 'steps': 56197, 'loss/train': 1.20452082157135} 11/07/2021 05:07:24 - INFO - __main__ - Step 56199: {'lr': 0.0003520169453515069, 'samples': 10790208, 'steps': 56198, 'loss/train': 1.4479241371154785} 11/07/2021 05:07:25 - INFO - __main__ - Step 56200: {'lr': 0.00035201210052760123, 'samples': 10790400, 'steps': 56199, 'loss/train': 1.5330617427825928} 11/07/2021 05:07:25 - INFO - __main__ - Step 56201: {'lr': 0.0003520072556577306, 'samples': 10790592, 'steps': 56200, 'loss/train': 1.4841582775115967} 11/07/2021 05:07:26 - INFO - __main__ - Step 56202: {'lr': 0.000352002410741897, 'samples': 10790784, 'steps': 56201, 'loss/train': 1.36457359790802} 11/07/2021 05:07:26 - INFO - __main__ - Step 56203: {'lr': 0.00035199756578010267, 'samples': 10790976, 'steps': 56202, 'loss/train': 1.483276128768921} 11/07/2021 05:07:27 - INFO - __main__ - Step 56204: {'lr': 0.0003519927207723498, 'samples': 10791168, 'steps': 56203, 'loss/train': 4.350569725036621} 11/07/2021 05:07:27 - INFO - __main__ - Step 56205: {'lr': 0.00035198787571864067, 'samples': 10791360, 'steps': 56204, 'loss/train': 1.2586426734924316} 11/07/2021 05:07:27 - INFO - __main__ - Step 56206: {'lr': 0.0003519830306189773, 'samples': 10791552, 'steps': 56205, 'loss/train': 1.167547345161438} 11/07/2021 05:07:28 - INFO - __main__ - Step 56207: {'lr': 0.000351978185473362, 'samples': 10791744, 'steps': 56206, 'loss/train': 2.1130430698394775} 11/07/2021 05:07:29 - INFO - __main__ - Step 56208: {'lr': 0.0003519733402817968, 'samples': 10791936, 'steps': 56207, 'loss/train': 1.4643226861953735} 11/07/2021 05:07:29 - INFO - __main__ - Step 56209: {'lr': 0.0003519684950442841, 'samples': 10792128, 'steps': 56208, 'loss/train': 1.52162504196167} 11/07/2021 05:07:29 - INFO - __main__ - Step 56210: {'lr': 0.00035196364976082593, 'samples': 10792320, 'steps': 56209, 'loss/train': 1.4099293947219849} 11/07/2021 05:07:30 - INFO - __main__ - Step 56211: {'lr': 0.0003519588044314245, 'samples': 10792512, 'steps': 56210, 'loss/train': 1.165474534034729} 11/07/2021 05:07:30 - INFO - __main__ - Step 56212: {'lr': 0.000351953959056082, 'samples': 10792704, 'steps': 56211, 'loss/train': 1.2444922924041748} 11/07/2021 05:07:31 - INFO - __main__ - Step 56213: {'lr': 0.0003519491136348006, 'samples': 10792896, 'steps': 56212, 'loss/train': 1.6387226581573486} 11/07/2021 05:07:32 - INFO - __main__ - Step 56214: {'lr': 0.0003519442681675826, 'samples': 10793088, 'steps': 56213, 'loss/train': 1.2214232683181763} 11/07/2021 05:07:32 - INFO - __main__ - Step 56215: {'lr': 0.00035193942265443, 'samples': 10793280, 'steps': 56214, 'loss/train': 0.08244670182466507} 11/07/2021 05:07:32 - INFO - __main__ - Step 56216: {'lr': 0.0003519345770953452, 'samples': 10793472, 'steps': 56215, 'loss/train': 1.309668779373169} 11/07/2021 05:07:33 - INFO - __main__ - Step 56217: {'lr': 0.00035192973149033007, 'samples': 10793664, 'steps': 56216, 'loss/train': 1.4655251502990723} 11/07/2021 05:07:34 - INFO - __main__ - Step 56218: {'lr': 0.0003519248858393871, 'samples': 10793856, 'steps': 56217, 'loss/train': 1.0780271291732788} 11/07/2021 05:07:34 - INFO - __main__ - Step 56219: {'lr': 0.0003519200401425183, 'samples': 10794048, 'steps': 56218, 'loss/train': 1.5708234310150146} 11/07/2021 05:07:34 - INFO - __main__ - Step 56220: {'lr': 0.0003519151943997259, 'samples': 10794240, 'steps': 56219, 'loss/train': 1.6604366302490234} 11/07/2021 05:07:35 - INFO - __main__ - Step 56221: {'lr': 0.0003519103486110121, 'samples': 10794432, 'steps': 56220, 'loss/train': 1.2300477027893066} 11/07/2021 05:07:35 - INFO - __main__ - Step 56222: {'lr': 0.0003519055027763791, 'samples': 10794624, 'steps': 56221, 'loss/train': 1.3391817808151245} 11/07/2021 05:07:36 - INFO - __main__ - Step 56223: {'lr': 0.00035190065689582895, 'samples': 10794816, 'steps': 56222, 'loss/train': 1.343144178390503} 11/07/2021 05:07:36 - INFO - __main__ - Step 56224: {'lr': 0.00035189581096936395, 'samples': 10795008, 'steps': 56223, 'loss/train': 1.6980105638504028} 11/07/2021 05:07:37 - INFO - __main__ - Step 56225: {'lr': 0.0003518909649969864, 'samples': 10795200, 'steps': 56224, 'loss/train': 1.4061853885650635} 11/07/2021 05:07:37 - INFO - __main__ - Step 56226: {'lr': 0.00035188611897869824, 'samples': 10795392, 'steps': 56225, 'loss/train': 1.4199899435043335} 11/07/2021 05:07:37 - INFO - __main__ - Step 56227: {'lr': 0.00035188127291450183, 'samples': 10795584, 'steps': 56226, 'loss/train': 1.4231295585632324} 11/07/2021 05:07:39 - INFO - __main__ - Step 56228: {'lr': 0.00035187642680439927, 'samples': 10795776, 'steps': 56227, 'loss/train': 1.2420918941497803} 11/07/2021 05:07:39 - INFO - __main__ - Step 56229: {'lr': 0.0003518715806483928, 'samples': 10795968, 'steps': 56228, 'loss/train': 1.7138904333114624} 11/07/2021 05:07:39 - INFO - __main__ - Step 56230: {'lr': 0.0003518667344464845, 'samples': 10796160, 'steps': 56229, 'loss/train': 1.467958927154541} 11/07/2021 05:07:40 - INFO - __main__ - Step 56231: {'lr': 0.00035186188819867663, 'samples': 10796352, 'steps': 56230, 'loss/train': 1.091019630432129} 11/07/2021 05:07:40 - INFO - __main__ - Step 56232: {'lr': 0.00035185704190497137, 'samples': 10796544, 'steps': 56231, 'loss/train': 0.6361316442489624} 11/07/2021 05:07:41 - INFO - __main__ - Step 56233: {'lr': 0.0003518521955653709, 'samples': 10796736, 'steps': 56232, 'loss/train': 1.327677607536316} 11/07/2021 05:07:41 - INFO - __main__ - Step 56234: {'lr': 0.0003518473491798774, 'samples': 10796928, 'steps': 56233, 'loss/train': 1.552869439125061} 11/07/2021 05:07:42 - INFO - __main__ - Step 56235: {'lr': 0.00035184250274849306, 'samples': 10797120, 'steps': 56234, 'loss/train': 1.4740413427352905} 11/07/2021 05:07:42 - INFO - __main__ - Step 56236: {'lr': 0.0003518376562712201, 'samples': 10797312, 'steps': 56235, 'loss/train': 1.5292969942092896} 11/07/2021 05:07:42 - INFO - __main__ - Step 56237: {'lr': 0.00035183280974806065, 'samples': 10797504, 'steps': 56236, 'loss/train': 1.5130194425582886} 11/07/2021 05:07:43 - INFO - __main__ - Step 56238: {'lr': 0.0003518279631790169, 'samples': 10797696, 'steps': 56237, 'loss/train': 1.3904306888580322} 11/07/2021 05:07:44 - INFO - __main__ - Step 56239: {'lr': 0.000351823116564091, 'samples': 10797888, 'steps': 56238, 'loss/train': 0.7780100703239441} 11/07/2021 05:07:44 - INFO - __main__ - Step 56240: {'lr': 0.0003518182699032852, 'samples': 10798080, 'steps': 56239, 'loss/train': 1.3004306554794312} 11/07/2021 05:07:44 - INFO - __main__ - Step 56241: {'lr': 0.0003518134231966017, 'samples': 10798272, 'steps': 56240, 'loss/train': 1.8354072570800781} 11/07/2021 05:07:45 - INFO - __main__ - Step 56242: {'lr': 0.0003518085764440426, 'samples': 10798464, 'steps': 56241, 'loss/train': 1.4560760259628296} 11/07/2021 05:07:46 - INFO - __main__ - Step 56243: {'lr': 0.00035180372964561013, 'samples': 10798656, 'steps': 56242, 'loss/train': 1.2428865432739258} 11/07/2021 05:07:46 - INFO - __main__ - Step 56244: {'lr': 0.00035179888280130646, 'samples': 10798848, 'steps': 56243, 'loss/train': 2.055394411087036} 11/07/2021 05:07:47 - INFO - __main__ - Step 56245: {'lr': 0.00035179403591113377, 'samples': 10799040, 'steps': 56244, 'loss/train': 1.4973167181015015} 11/07/2021 05:07:47 - INFO - __main__ - Step 56246: {'lr': 0.0003517891889750943, 'samples': 10799232, 'steps': 56245, 'loss/train': 1.0132272243499756} 11/07/2021 05:07:47 - INFO - __main__ - Step 56247: {'lr': 0.0003517843419931902, 'samples': 10799424, 'steps': 56246, 'loss/train': 1.3247162103652954} 11/07/2021 05:07:48 - INFO - __main__ - Step 56248: {'lr': 0.0003517794949654236, 'samples': 10799616, 'steps': 56247, 'loss/train': 1.4394179582595825} 11/07/2021 05:07:49 - INFO - __main__ - Step 56249: {'lr': 0.00035177464789179675, 'samples': 10799808, 'steps': 56248, 'loss/train': 1.0051782131195068} 11/07/2021 05:07:49 - INFO - __main__ - Step 56250: {'lr': 0.0003517698007723118, 'samples': 10800000, 'steps': 56249, 'loss/train': 0.5667877197265625} 11/07/2021 05:07:49 - INFO - __main__ - Step 56251: {'lr': 0.00035176495360697096, 'samples': 10800192, 'steps': 56250, 'loss/train': 1.3332971334457397} 11/07/2021 05:07:50 - INFO - __main__ - Step 56252: {'lr': 0.0003517601063957764, 'samples': 10800384, 'steps': 56251, 'loss/train': 1.1086777448654175} 11/07/2021 05:07:51 - INFO - __main__ - Step 56253: {'lr': 0.0003517552591387303, 'samples': 10800576, 'steps': 56252, 'loss/train': 1.2256686687469482} 11/07/2021 05:07:51 - INFO - __main__ - Step 56254: {'lr': 0.0003517504118358349, 'samples': 10800768, 'steps': 56253, 'loss/train': 0.9534670114517212} 11/07/2021 05:07:52 - INFO - __main__ - Step 56255: {'lr': 0.0003517455644870923, 'samples': 10800960, 'steps': 56254, 'loss/train': 1.9628669023513794} 11/07/2021 05:07:52 - INFO - __main__ - Step 56256: {'lr': 0.00035174071709250475, 'samples': 10801152, 'steps': 56255, 'loss/train': 1.3649989366531372} 11/07/2021 05:07:52 - INFO - __main__ - Step 56257: {'lr': 0.00035173586965207436, 'samples': 10801344, 'steps': 56256, 'loss/train': 0.8693755269050598} 11/07/2021 05:07:53 - INFO - __main__ - Step 56258: {'lr': 0.0003517310221658033, 'samples': 10801536, 'steps': 56257, 'loss/train': 1.0500729084014893} 11/07/2021 05:07:54 - INFO - __main__ - Step 56259: {'lr': 0.00035172617463369397, 'samples': 10801728, 'steps': 56258, 'loss/train': 1.4382660388946533} 11/07/2021 05:07:54 - INFO - __main__ - Step 56260: {'lr': 0.0003517213270557482, 'samples': 10801920, 'steps': 56259, 'loss/train': 1.8214768171310425} 11/07/2021 05:07:54 - INFO - __main__ - Step 56261: {'lr': 0.00035171647943196854, 'samples': 10802112, 'steps': 56260, 'loss/train': 1.5306284427642822} 11/07/2021 05:07:55 - INFO - __main__ - Step 56262: {'lr': 0.00035171163176235694, 'samples': 10802304, 'steps': 56261, 'loss/train': 1.4659202098846436} 11/07/2021 05:07:55 - INFO - __main__ - Step 56263: {'lr': 0.00035170678404691563, 'samples': 10802496, 'steps': 56262, 'loss/train': 1.4727448225021362} 11/07/2021 05:07:55 - INFO - __main__ - Step 56264: {'lr': 0.00035170193628564683, 'samples': 10802688, 'steps': 56263, 'loss/train': 1.5459659099578857} 11/07/2021 05:07:57 - INFO - __main__ - Step 56265: {'lr': 0.0003516970884785527, 'samples': 10802880, 'steps': 56264, 'loss/train': 1.6038678884506226} 11/07/2021 05:07:57 - INFO - __main__ - Step 56266: {'lr': 0.00035169224062563543, 'samples': 10803072, 'steps': 56265, 'loss/train': 1.2403324842453003} 11/07/2021 05:07:57 - INFO - __main__ - Step 56267: {'lr': 0.0003516873927268972, 'samples': 10803264, 'steps': 56266, 'loss/train': 1.416727900505066} 11/07/2021 05:07:58 - INFO - __main__ - Step 56268: {'lr': 0.0003516825447823403, 'samples': 10803456, 'steps': 56267, 'loss/train': 1.4993882179260254} 11/07/2021 05:07:58 - INFO - __main__ - Step 56269: {'lr': 0.0003516776967919667, 'samples': 10803648, 'steps': 56268, 'loss/train': 0.9666994214057922} 11/07/2021 05:07:59 - INFO - __main__ - Step 56270: {'lr': 0.0003516728487557787, 'samples': 10803840, 'steps': 56269, 'loss/train': 2.310224771499634} 11/07/2021 05:08:00 - INFO - __main__ - Step 56271: {'lr': 0.00035166800067377855, 'samples': 10804032, 'steps': 56270, 'loss/train': 1.0139963626861572} 11/07/2021 05:08:00 - INFO - __main__ - Step 56272: {'lr': 0.00035166315254596826, 'samples': 10804224, 'steps': 56271, 'loss/train': 1.0522022247314453} 11/07/2021 05:08:00 - INFO - __main__ - Step 56273: {'lr': 0.0003516583043723502, 'samples': 10804416, 'steps': 56272, 'loss/train': 1.325087308883667} 11/07/2021 05:08:01 - INFO - __main__ - Step 56274: {'lr': 0.0003516534561529264, 'samples': 10804608, 'steps': 56273, 'loss/train': 1.4703959226608276} 11/07/2021 05:08:01 - INFO - __main__ - Step 56275: {'lr': 0.00035164860788769925, 'samples': 10804800, 'steps': 56274, 'loss/train': 2.523444175720215} 11/07/2021 05:08:02 - INFO - __main__ - Step 56276: {'lr': 0.0003516437595766708, 'samples': 10804992, 'steps': 56275, 'loss/train': 1.155405044555664} 11/07/2021 05:08:03 - INFO - __main__ - Step 56277: {'lr': 0.00035163891121984316, 'samples': 10805184, 'steps': 56276, 'loss/train': 1.262868046760559} 11/07/2021 05:08:03 - INFO - __main__ - Step 56278: {'lr': 0.0003516340628172186, 'samples': 10805376, 'steps': 56277, 'loss/train': 0.8349221348762512} 11/07/2021 05:08:03 - INFO - __main__ - Step 56279: {'lr': 0.0003516292143687993, 'samples': 10805568, 'steps': 56278, 'loss/train': 1.7897318601608276} 11/07/2021 05:08:04 - INFO - __main__ - Step 56280: {'lr': 0.00035162436587458744, 'samples': 10805760, 'steps': 56279, 'loss/train': 0.9219061732292175} 11/07/2021 05:08:04 - INFO - __main__ - Step 56281: {'lr': 0.0003516195173345853, 'samples': 10805952, 'steps': 56280, 'loss/train': 1.3079655170440674} 11/07/2021 05:08:05 - INFO - __main__ - Step 56282: {'lr': 0.0003516146687487949, 'samples': 10806144, 'steps': 56281, 'loss/train': 0.2008887082338333} 11/07/2021 05:08:05 - INFO - __main__ - Step 56283: {'lr': 0.0003516098201172185, 'samples': 10806336, 'steps': 56282, 'loss/train': 1.6396210193634033} 11/07/2021 05:08:06 - INFO - __main__ - Step 56284: {'lr': 0.00035160497143985823, 'samples': 10806528, 'steps': 56283, 'loss/train': 1.391641616821289} 11/07/2021 05:08:06 - INFO - __main__ - Step 56285: {'lr': 0.0003516001227167164, 'samples': 10806720, 'steps': 56284, 'loss/train': 1.7935125827789307} 11/07/2021 05:08:07 - INFO - __main__ - Step 56286: {'lr': 0.0003515952739477951, 'samples': 10806912, 'steps': 56285, 'loss/train': 1.2115530967712402} 11/07/2021 05:08:08 - INFO - __main__ - Step 56287: {'lr': 0.0003515904251330965, 'samples': 10807104, 'steps': 56286, 'loss/train': 1.4226312637329102} 11/07/2021 05:08:08 - INFO - __main__ - Step 56288: {'lr': 0.00035158557627262295, 'samples': 10807296, 'steps': 56287, 'loss/train': 1.071256160736084} 11/07/2021 05:08:08 - INFO - __main__ - Step 56289: {'lr': 0.00035158072736637643, 'samples': 10807488, 'steps': 56288, 'loss/train': 1.5754724740982056} 11/07/2021 05:08:09 - INFO - __main__ - Step 56290: {'lr': 0.0003515758784143592, 'samples': 10807680, 'steps': 56289, 'loss/train': 2.0535340309143066} 11/07/2021 05:08:09 - INFO - __main__ - Step 56291: {'lr': 0.00035157102941657336, 'samples': 10807872, 'steps': 56290, 'loss/train': 1.3907785415649414} 11/07/2021 05:08:10 - INFO - __main__ - Step 56292: {'lr': 0.0003515661803730213, 'samples': 10808064, 'steps': 56291, 'loss/train': 1.1638755798339844} 11/07/2021 05:08:10 - INFO - __main__ - Step 56293: {'lr': 0.000351561331283705, 'samples': 10808256, 'steps': 56292, 'loss/train': 1.2919719219207764} 11/07/2021 05:08:11 - INFO - __main__ - Step 56294: {'lr': 0.0003515564821486268, 'samples': 10808448, 'steps': 56293, 'loss/train': 1.6496127843856812} 11/07/2021 05:08:11 - INFO - __main__ - Step 56295: {'lr': 0.00035155163296778883, 'samples': 10808640, 'steps': 56294, 'loss/train': 1.5217833518981934} 11/07/2021 05:08:11 - INFO - __main__ - Step 56296: {'lr': 0.0003515467837411932, 'samples': 10808832, 'steps': 56295, 'loss/train': 1.1412055492401123} 11/07/2021 05:08:12 - INFO - __main__ - Step 56297: {'lr': 0.0003515419344688422, 'samples': 10809024, 'steps': 56296, 'loss/train': 0.5805878043174744} 11/07/2021 05:08:13 - INFO - __main__ - Step 56298: {'lr': 0.00035153708515073793, 'samples': 10809216, 'steps': 56297, 'loss/train': 1.531552791595459} 11/07/2021 05:08:13 - INFO - __main__ - Step 56299: {'lr': 0.00035153223578688263, 'samples': 10809408, 'steps': 56298, 'loss/train': 0.08228524029254913} 11/07/2021 05:08:14 - INFO - __main__ - Step 56300: {'lr': 0.0003515273863772785, 'samples': 10809600, 'steps': 56299, 'loss/train': 1.5238845348358154} 11/07/2021 05:08:14 - INFO - __main__ - Step 56301: {'lr': 0.00035152253692192765, 'samples': 10809792, 'steps': 56300, 'loss/train': 1.3679463863372803} 11/07/2021 05:08:15 - INFO - __main__ - Step 56302: {'lr': 0.0003515176874208324, 'samples': 10809984, 'steps': 56301, 'loss/train': 1.4546902179718018} 11/07/2021 05:08:15 - INFO - __main__ - Step 56303: {'lr': 0.0003515128378739948, 'samples': 10810176, 'steps': 56302, 'loss/train': 1.1979960203170776} 11/07/2021 05:08:16 - INFO - __main__ - Step 56304: {'lr': 0.0003515079882814171, 'samples': 10810368, 'steps': 56303, 'loss/train': 1.2700610160827637} 11/07/2021 05:08:16 - INFO - __main__ - Step 56305: {'lr': 0.00035150313864310137, 'samples': 10810560, 'steps': 56304, 'loss/train': 1.5382988452911377} 11/07/2021 05:08:16 - INFO - __main__ - Step 56306: {'lr': 0.00035149828895904994, 'samples': 10810752, 'steps': 56305, 'loss/train': 1.3243350982666016} 11/07/2021 05:08:17 - INFO - __main__ - Step 56307: {'lr': 0.00035149343922926497, 'samples': 10810944, 'steps': 56306, 'loss/train': 1.6069839000701904} 11/07/2021 05:08:18 - INFO - __main__ - Step 56308: {'lr': 0.0003514885894537486, 'samples': 10811136, 'steps': 56307, 'loss/train': 1.2906118631362915} 11/07/2021 05:08:18 - INFO - __main__ - Step 56309: {'lr': 0.00035148373963250307, 'samples': 10811328, 'steps': 56308, 'loss/train': 1.5618865489959717} 11/07/2021 05:08:18 - INFO - __main__ - Step 56310: {'lr': 0.0003514788897655305, 'samples': 10811520, 'steps': 56309, 'loss/train': 0.7043637633323669} 11/07/2021 05:08:19 - INFO - __main__ - Step 56311: {'lr': 0.0003514740398528331, 'samples': 10811712, 'steps': 56310, 'loss/train': 0.9927510619163513} 11/07/2021 05:08:19 - INFO - __main__ - Step 56312: {'lr': 0.0003514691898944131, 'samples': 10811904, 'steps': 56311, 'loss/train': 1.4340620040893555} 11/07/2021 05:08:20 - INFO - __main__ - Step 56313: {'lr': 0.0003514643398902727, 'samples': 10812096, 'steps': 56312, 'loss/train': 1.3973286151885986} 11/07/2021 05:08:20 - INFO - __main__ - Step 56314: {'lr': 0.00035145948984041393, 'samples': 10812288, 'steps': 56313, 'loss/train': 1.5491747856140137} 11/07/2021 05:08:21 - INFO - __main__ - Step 56315: {'lr': 0.00035145463974483915, 'samples': 10812480, 'steps': 56314, 'loss/train': 1.3057949542999268} 11/07/2021 05:08:21 - INFO - __main__ - Step 56316: {'lr': 0.00035144978960355045, 'samples': 10812672, 'steps': 56315, 'loss/train': 1.265416145324707} 11/07/2021 05:08:22 - INFO - __main__ - Step 56317: {'lr': 0.00035144493941655, 'samples': 10812864, 'steps': 56316, 'loss/train': 1.378902792930603} 11/07/2021 05:08:23 - INFO - __main__ - Step 56318: {'lr': 0.00035144008918384006, 'samples': 10813056, 'steps': 56317, 'loss/train': 0.857494592666626} 11/07/2021 05:08:23 - INFO - __main__ - Step 56319: {'lr': 0.0003514352389054228, 'samples': 10813248, 'steps': 56318, 'loss/train': 1.1948238611221313} 11/07/2021 05:08:23 - INFO - __main__ - Step 56320: {'lr': 0.00035143038858130034, 'samples': 10813440, 'steps': 56319, 'loss/train': 1.2425715923309326} 11/07/2021 05:08:24 - INFO - __main__ - Step 56321: {'lr': 0.00035142553821147494, 'samples': 10813632, 'steps': 56320, 'loss/train': 1.6433122158050537} 11/07/2021 05:08:24 - INFO - __main__ - Step 56322: {'lr': 0.00035142068779594885, 'samples': 10813824, 'steps': 56321, 'loss/train': 2.380887031555176} 11/07/2021 05:08:25 - INFO - __main__ - Step 56323: {'lr': 0.00035141583733472407, 'samples': 10814016, 'steps': 56322, 'loss/train': 1.6240476369857788} 11/07/2021 05:08:26 - INFO - __main__ - Step 56324: {'lr': 0.0003514109868278028, 'samples': 10814208, 'steps': 56323, 'loss/train': 0.8962580561637878} 11/07/2021 05:08:26 - INFO - __main__ - Step 56325: {'lr': 0.0003514061362751874, 'samples': 10814400, 'steps': 56324, 'loss/train': 1.420181393623352} 11/07/2021 05:08:26 - INFO - __main__ - Step 56326: {'lr': 0.0003514012856768799, 'samples': 10814592, 'steps': 56325, 'loss/train': 1.265038251876831} 11/07/2021 05:08:27 - INFO - __main__ - Step 56327: {'lr': 0.0003513964350328826, 'samples': 10814784, 'steps': 56326, 'loss/train': 1.5866719484329224} 11/07/2021 05:08:27 - INFO - __main__ - Step 56328: {'lr': 0.0003513915843431977, 'samples': 10814976, 'steps': 56327, 'loss/train': 1.5536901950836182} 11/07/2021 05:08:28 - INFO - __main__ - Step 56329: {'lr': 0.0003513867336078272, 'samples': 10815168, 'steps': 56328, 'loss/train': 1.133144736289978} 11/07/2021 05:08:28 - INFO - __main__ - Step 56330: {'lr': 0.00035138188282677344, 'samples': 10815360, 'steps': 56329, 'loss/train': 1.1731499433517456} 11/07/2021 05:08:29 - INFO - __main__ - Step 56331: {'lr': 0.00035137703200003857, 'samples': 10815552, 'steps': 56330, 'loss/train': 1.340118408203125} 11/07/2021 05:08:29 - INFO - __main__ - Step 56332: {'lr': 0.00035137218112762475, 'samples': 10815744, 'steps': 56331, 'loss/train': 2.1245827674865723} 11/07/2021 05:08:30 - INFO - __main__ - Step 56333: {'lr': 0.0003513673302095342, 'samples': 10815936, 'steps': 56332, 'loss/train': 1.8567603826522827} 11/07/2021 05:08:30 - INFO - __main__ - Step 56334: {'lr': 0.0003513624792457691, 'samples': 10816128, 'steps': 56333, 'loss/train': 1.5586357116699219} 11/07/2021 05:08:31 - INFO - __main__ - Step 56335: {'lr': 0.00035135762823633167, 'samples': 10816320, 'steps': 56334, 'loss/train': 1.6616904735565186} 11/07/2021 05:08:31 - INFO - __main__ - Step 56336: {'lr': 0.00035135277718122403, 'samples': 10816512, 'steps': 56335, 'loss/train': 2.636472225189209} 11/07/2021 05:08:31 - INFO - __main__ - Step 56337: {'lr': 0.0003513479260804484, 'samples': 10816704, 'steps': 56336, 'loss/train': 1.2617383003234863} 11/07/2021 05:08:32 - INFO - __main__ - Step 56338: {'lr': 0.0003513430749340069, 'samples': 10816896, 'steps': 56337, 'loss/train': 1.1958850622177124} 11/07/2021 05:08:33 - INFO - __main__ - Step 56339: {'lr': 0.0003513382237419018, 'samples': 10817088, 'steps': 56338, 'loss/train': 1.488937497138977} 11/07/2021 05:08:33 - INFO - __main__ - Step 56340: {'lr': 0.00035133337250413534, 'samples': 10817280, 'steps': 56339, 'loss/train': 1.2550724744796753} 11/07/2021 05:08:33 - INFO - __main__ - Step 56341: {'lr': 0.00035132852122070953, 'samples': 10817472, 'steps': 56340, 'loss/train': 1.411402940750122} 11/07/2021 05:08:34 - INFO - __main__ - Step 56342: {'lr': 0.0003513236698916267, 'samples': 10817664, 'steps': 56341, 'loss/train': 1.3002254962921143} 11/07/2021 05:08:34 - INFO - __main__ - Step 56343: {'lr': 0.00035131881851688896, 'samples': 10817856, 'steps': 56342, 'loss/train': 0.9825994372367859} 11/07/2021 05:08:35 - INFO - __main__ - Step 56344: {'lr': 0.00035131396709649855, 'samples': 10818048, 'steps': 56343, 'loss/train': 1.5272952318191528} 11/07/2021 05:08:36 - INFO - __main__ - Step 56345: {'lr': 0.00035130911563045764, 'samples': 10818240, 'steps': 56344, 'loss/train': 2.537260055541992} 11/07/2021 05:08:36 - INFO - __main__ - Step 56346: {'lr': 0.00035130426411876834, 'samples': 10818432, 'steps': 56345, 'loss/train': 1.7120434045791626} 11/07/2021 05:08:36 - INFO - __main__ - Step 56347: {'lr': 0.00035129941256143295, 'samples': 10818624, 'steps': 56346, 'loss/train': 1.48082435131073} 11/07/2021 05:08:37 - INFO - __main__ - Step 56348: {'lr': 0.0003512945609584536, 'samples': 10818816, 'steps': 56347, 'loss/train': 0.07550205290317535} 11/07/2021 05:08:38 - INFO - __main__ - Step 56349: {'lr': 0.0003512897093098325, 'samples': 10819008, 'steps': 56348, 'loss/train': 1.1048388481140137} 11/07/2021 05:08:38 - INFO - __main__ - Step 56350: {'lr': 0.0003512848576155718, 'samples': 10819200, 'steps': 56349, 'loss/train': 2.077885389328003} 11/07/2021 05:08:38 - INFO - __main__ - Step 56351: {'lr': 0.0003512800058756738, 'samples': 10819392, 'steps': 56350, 'loss/train': 1.0637279748916626} 11/07/2021 05:08:39 - INFO - __main__ - Step 56352: {'lr': 0.00035127515409014046, 'samples': 10819584, 'steps': 56351, 'loss/train': 1.535597324371338} 11/07/2021 05:08:39 - INFO - __main__ - Step 56353: {'lr': 0.00035127030225897413, 'samples': 10819776, 'steps': 56352, 'loss/train': 1.1597362756729126} 11/07/2021 05:08:40 - INFO - __main__ - Step 56354: {'lr': 0.000351265450382177, 'samples': 10819968, 'steps': 56353, 'loss/train': 1.274021863937378} 11/07/2021 05:08:41 - INFO - __main__ - Step 56355: {'lr': 0.0003512605984597512, 'samples': 10820160, 'steps': 56354, 'loss/train': 1.341858983039856} 11/07/2021 05:08:41 - INFO - __main__ - Step 56356: {'lr': 0.00035125574649169894, 'samples': 10820352, 'steps': 56355, 'loss/train': 1.5707485675811768} 11/07/2021 05:08:41 - INFO - __main__ - Step 56357: {'lr': 0.0003512508944780224, 'samples': 10820544, 'steps': 56356, 'loss/train': 1.7568302154541016} 11/07/2021 05:08:42 - INFO - __main__ - Step 56358: {'lr': 0.0003512460424187237, 'samples': 10820736, 'steps': 56357, 'loss/train': 1.2920538187026978} 11/07/2021 05:08:43 - INFO - __main__ - Step 56359: {'lr': 0.00035124119031380526, 'samples': 10820928, 'steps': 56358, 'loss/train': 1.617993712425232} 11/07/2021 05:08:43 - INFO - __main__ - Step 56360: {'lr': 0.000351236338163269, 'samples': 10821120, 'steps': 56359, 'loss/train': 0.6446110606193542} 11/07/2021 05:08:43 - INFO - __main__ - Step 56361: {'lr': 0.00035123148596711716, 'samples': 10821312, 'steps': 56360, 'loss/train': 1.295314908027649} 11/07/2021 05:08:44 - INFO - __main__ - Step 56362: {'lr': 0.0003512266337253521, 'samples': 10821504, 'steps': 56361, 'loss/train': 1.4957098960876465} 11/07/2021 05:08:44 - INFO - __main__ - Step 56363: {'lr': 0.0003512217814379758, 'samples': 10821696, 'steps': 56362, 'loss/train': 1.2495609521865845} 11/07/2021 05:08:45 - INFO - __main__ - Step 56364: {'lr': 0.0003512169291049905, 'samples': 10821888, 'steps': 56363, 'loss/train': 1.2257405519485474} 11/07/2021 05:08:46 - INFO - __main__ - Step 56365: {'lr': 0.0003512120767263985, 'samples': 10822080, 'steps': 56364, 'loss/train': 1.4817039966583252} 11/07/2021 05:08:46 - INFO - __main__ - Step 56366: {'lr': 0.0003512072243022018, 'samples': 10822272, 'steps': 56365, 'loss/train': 1.5123552083969116} 11/07/2021 05:08:46 - INFO - __main__ - Step 56367: {'lr': 0.00035120237183240276, 'samples': 10822464, 'steps': 56366, 'loss/train': 1.4283339977264404} 11/07/2021 05:08:47 - INFO - __main__ - Step 56368: {'lr': 0.00035119751931700344, 'samples': 10822656, 'steps': 56367, 'loss/train': 1.0413459539413452} 11/07/2021 05:08:48 - INFO - __main__ - Step 56369: {'lr': 0.00035119266675600615, 'samples': 10822848, 'steps': 56368, 'loss/train': 1.5881069898605347} 11/07/2021 05:08:48 - INFO - __main__ - Step 56370: {'lr': 0.00035118781414941296, 'samples': 10823040, 'steps': 56369, 'loss/train': 1.1666054725646973} 11/07/2021 05:08:48 - INFO - __main__ - Step 56371: {'lr': 0.00035118296149722614, 'samples': 10823232, 'steps': 56370, 'loss/train': 1.1436271667480469} 11/07/2021 05:08:49 - INFO - __main__ - Step 56372: {'lr': 0.0003511781087994478, 'samples': 10823424, 'steps': 56371, 'loss/train': 1.313245415687561} 11/07/2021 05:08:49 - INFO - __main__ - Step 56373: {'lr': 0.00035117325605608013, 'samples': 10823616, 'steps': 56372, 'loss/train': 1.2382103204727173} 11/07/2021 05:08:49 - INFO - __main__ - Step 56374: {'lr': 0.0003511684032671254, 'samples': 10823808, 'steps': 56373, 'loss/train': 1.6699844598770142} 11/07/2021 05:08:51 - INFO - __main__ - Step 56375: {'lr': 0.0003511635504325857, 'samples': 10824000, 'steps': 56374, 'loss/train': 1.621235966682434} 11/07/2021 05:08:51 - INFO - __main__ - Step 56376: {'lr': 0.0003511586975524634, 'samples': 10824192, 'steps': 56375, 'loss/train': 1.34029221534729} 11/07/2021 05:08:52 - INFO - __main__ - Step 56377: {'lr': 0.0003511538446267604, 'samples': 10824384, 'steps': 56376, 'loss/train': 5.697155952453613} 11/07/2021 05:08:52 - INFO - __main__ - Step 56378: {'lr': 0.00035114899165547916, 'samples': 10824576, 'steps': 56377, 'loss/train': 1.3295198678970337} 11/07/2021 05:08:52 - INFO - __main__ - Step 56379: {'lr': 0.00035114413863862164, 'samples': 10824768, 'steps': 56378, 'loss/train': 1.9706774950027466} 11/07/2021 05:08:53 - INFO - __main__ - Step 56380: {'lr': 0.0003511392855761902, 'samples': 10824960, 'steps': 56379, 'loss/train': 5.348339557647705} 11/07/2021 05:08:54 - INFO - __main__ - Step 56381: {'lr': 0.0003511344324681869, 'samples': 10825152, 'steps': 56380, 'loss/train': 5.333381175994873} 11/07/2021 05:08:54 - INFO - __main__ - Step 56382: {'lr': 0.00035112957931461407, 'samples': 10825344, 'steps': 56381, 'loss/train': 1.400259256362915} 11/07/2021 05:08:54 - INFO - __main__ - Step 56383: {'lr': 0.00035112472611547376, 'samples': 10825536, 'steps': 56382, 'loss/train': 1.722835659980774} 11/07/2021 05:08:55 - INFO - __main__ - Step 56384: {'lr': 0.0003511198728707682, 'samples': 10825728, 'steps': 56383, 'loss/train': 1.4241292476654053} 11/07/2021 05:08:55 - INFO - __main__ - Step 56385: {'lr': 0.0003511150195804996, 'samples': 10825920, 'steps': 56384, 'loss/train': 1.08267343044281} 11/07/2021 05:08:55 - INFO - __main__ - Step 56386: {'lr': 0.00035111016624467007, 'samples': 10826112, 'steps': 56385, 'loss/train': 1.0685404539108276} 11/07/2021 05:08:56 - INFO - __main__ - Step 56387: {'lr': 0.00035110531286328193, 'samples': 10826304, 'steps': 56386, 'loss/train': 1.584810733795166} 11/07/2021 05:08:57 - INFO - __main__ - Step 56388: {'lr': 0.0003511004594363373, 'samples': 10826496, 'steps': 56387, 'loss/train': 1.4250186681747437} 11/07/2021 05:08:57 - INFO - __main__ - Step 56389: {'lr': 0.0003510956059638384, 'samples': 10826688, 'steps': 56388, 'loss/train': 1.1023499965667725} 11/07/2021 05:08:57 - INFO - __main__ - Step 56390: {'lr': 0.0003510907524457873, 'samples': 10826880, 'steps': 56389, 'loss/train': 1.2967067956924438} 11/07/2021 05:08:58 - INFO - __main__ - Step 56391: {'lr': 0.0003510858988821863, 'samples': 10827072, 'steps': 56390, 'loss/train': 1.3536230325698853} 11/07/2021 05:08:59 - INFO - __main__ - Step 56392: {'lr': 0.00035108104527303754, 'samples': 10827264, 'steps': 56391, 'loss/train': 1.3348649740219116} 11/07/2021 05:08:59 - INFO - __main__ - Step 56393: {'lr': 0.0003510761916183432, 'samples': 10827456, 'steps': 56392, 'loss/train': 1.6952288150787354} 11/07/2021 05:09:00 - INFO - __main__ - Step 56394: {'lr': 0.00035107133791810555, 'samples': 10827648, 'steps': 56393, 'loss/train': 1.2137962579727173} 11/07/2021 05:09:00 - INFO - __main__ - Step 56395: {'lr': 0.00035106648417232666, 'samples': 10827840, 'steps': 56394, 'loss/train': 0.9549675583839417} 11/07/2021 05:09:00 - INFO - __main__ - Step 56396: {'lr': 0.0003510616303810088, 'samples': 10828032, 'steps': 56395, 'loss/train': 1.2758619785308838} 11/07/2021 05:09:01 - INFO - __main__ - Step 56397: {'lr': 0.00035105677654415416, 'samples': 10828224, 'steps': 56396, 'loss/train': 0.17087025940418243} 11/07/2021 05:09:02 - INFO - __main__ - Step 56398: {'lr': 0.0003510519226617648, 'samples': 10828416, 'steps': 56397, 'loss/train': 1.5702942609786987} 11/07/2021 05:09:02 - INFO - __main__ - Step 56399: {'lr': 0.00035104706873384305, 'samples': 10828608, 'steps': 56398, 'loss/train': 0.874310314655304} 11/07/2021 05:09:02 - INFO - __main__ - Step 56400: {'lr': 0.0003510422147603911, 'samples': 10828800, 'steps': 56399, 'loss/train': 1.2134251594543457} 11/07/2021 05:09:03 - INFO - __main__ - Step 56401: {'lr': 0.00035103736074141103, 'samples': 10828992, 'steps': 56400, 'loss/train': 1.54283607006073} 11/07/2021 05:09:04 - INFO - __main__ - Step 56402: {'lr': 0.0003510325066769051, 'samples': 10829184, 'steps': 56401, 'loss/train': 1.7283002138137817} 11/07/2021 05:09:04 - INFO - __main__ - Step 56403: {'lr': 0.00035102765256687555, 'samples': 10829376, 'steps': 56402, 'loss/train': 1.5847009420394897} 11/07/2021 05:09:04 - INFO - __main__ - Step 56404: {'lr': 0.0003510227984113244, 'samples': 10829568, 'steps': 56403, 'loss/train': 1.4769731760025024} 11/07/2021 05:09:05 - INFO - __main__ - Step 56405: {'lr': 0.00035101794421025395, 'samples': 10829760, 'steps': 56404, 'loss/train': 0.45974522829055786} 11/07/2021 05:09:05 - INFO - __main__ - Step 56406: {'lr': 0.00035101308996366635, 'samples': 10829952, 'steps': 56405, 'loss/train': 1.4863005876541138} 11/07/2021 05:09:06 - INFO - __main__ - Step 56407: {'lr': 0.00035100823567156385, 'samples': 10830144, 'steps': 56406, 'loss/train': 1.0604636669158936} 11/07/2021 05:09:07 - INFO - __main__ - Step 56408: {'lr': 0.0003510033813339486, 'samples': 10830336, 'steps': 56407, 'loss/train': 1.907752513885498} 11/07/2021 05:09:07 - INFO - __main__ - Step 56409: {'lr': 0.00035099852695082286, 'samples': 10830528, 'steps': 56408, 'loss/train': 1.43251633644104} 11/07/2021 05:09:07 - INFO - __main__ - Step 56410: {'lr': 0.0003509936725221886, 'samples': 10830720, 'steps': 56409, 'loss/train': 1.3085565567016602} 11/07/2021 05:09:08 - INFO - __main__ - Step 56411: {'lr': 0.0003509888180480483, 'samples': 10830912, 'steps': 56410, 'loss/train': 1.3520587682724} 11/07/2021 05:09:09 - INFO - __main__ - Step 56412: {'lr': 0.00035098396352840384, 'samples': 10831104, 'steps': 56411, 'loss/train': 1.17833411693573} 11/07/2021 05:09:09 - INFO - __main__ - Step 56413: {'lr': 0.00035097910896325765, 'samples': 10831296, 'steps': 56412, 'loss/train': 1.0977505445480347} 11/07/2021 05:09:09 - INFO - __main__ - Step 56414: {'lr': 0.0003509742543526118, 'samples': 10831488, 'steps': 56413, 'loss/train': 1.566825032234192} 11/07/2021 05:09:10 - INFO - __main__ - Step 56415: {'lr': 0.00035096939969646854, 'samples': 10831680, 'steps': 56414, 'loss/train': 1.2906500101089478} 11/07/2021 05:09:10 - INFO - __main__ - Step 56416: {'lr': 0.00035096454499483, 'samples': 10831872, 'steps': 56415, 'loss/train': 1.7458585500717163} 11/07/2021 05:09:11 - INFO - __main__ - Step 56417: {'lr': 0.0003509596902476985, 'samples': 10832064, 'steps': 56416, 'loss/train': 1.7231554985046387} 11/07/2021 05:09:11 - INFO - __main__ - Step 56418: {'lr': 0.000350954835455076, 'samples': 10832256, 'steps': 56417, 'loss/train': 1.106741189956665} 11/07/2021 05:09:12 - INFO - __main__ - Step 56419: {'lr': 0.00035094998061696483, 'samples': 10832448, 'steps': 56418, 'loss/train': 1.2603288888931274} 11/07/2021 05:09:12 - INFO - __main__ - Step 56420: {'lr': 0.0003509451257333671, 'samples': 10832640, 'steps': 56419, 'loss/train': 1.025185465812683} 11/07/2021 05:09:13 - INFO - __main__ - Step 56421: {'lr': 0.00035094027080428514, 'samples': 10832832, 'steps': 56420, 'loss/train': 1.6788630485534668} 11/07/2021 05:09:13 - INFO - __main__ - Step 56422: {'lr': 0.00035093541582972105, 'samples': 10833024, 'steps': 56421, 'loss/train': 1.6398028135299683} 11/07/2021 05:09:14 - INFO - __main__ - Step 56423: {'lr': 0.000350930560809677, 'samples': 10833216, 'steps': 56422, 'loss/train': 1.3926358222961426} 11/07/2021 05:09:14 - INFO - __main__ - Step 56424: {'lr': 0.0003509257057441552, 'samples': 10833408, 'steps': 56423, 'loss/train': 2.116044759750366} 11/07/2021 05:09:14 - INFO - __main__ - Step 56425: {'lr': 0.00035092085063315783, 'samples': 10833600, 'steps': 56424, 'loss/train': 1.327752709388733} 11/07/2021 05:09:15 - INFO - __main__ - Step 56426: {'lr': 0.00035091599547668707, 'samples': 10833792, 'steps': 56425, 'loss/train': 1.8574451208114624} 11/07/2021 05:09:16 - INFO - __main__ - Step 56427: {'lr': 0.00035091114027474514, 'samples': 10833984, 'steps': 56426, 'loss/train': 1.6053627729415894} 11/07/2021 05:09:16 - INFO - __main__ - Step 56428: {'lr': 0.0003509062850273342, 'samples': 10834176, 'steps': 56427, 'loss/train': 1.2255128622055054} 11/07/2021 05:09:16 - INFO - __main__ - Step 56429: {'lr': 0.0003509014297344565, 'samples': 10834368, 'steps': 56428, 'loss/train': 1.2436903715133667} 11/07/2021 05:09:17 - INFO - __main__ - Step 56430: {'lr': 0.0003508965743961141, 'samples': 10834560, 'steps': 56429, 'loss/train': 1.6321382522583008} 11/07/2021 05:09:17 - INFO - __main__ - Step 56431: {'lr': 0.00035089171901230926, 'samples': 10834752, 'steps': 56430, 'loss/train': 1.8084721565246582} 11/07/2021 05:09:18 - INFO - __main__ - Step 56432: {'lr': 0.0003508868635830442, 'samples': 10834944, 'steps': 56431, 'loss/train': 1.4079978466033936} 11/07/2021 05:09:19 - INFO - __main__ - Step 56433: {'lr': 0.00035088200810832104, 'samples': 10835136, 'steps': 56432, 'loss/train': 1.562827467918396} 11/07/2021 05:09:19 - INFO - __main__ - Step 56434: {'lr': 0.00035087715258814203, 'samples': 10835328, 'steps': 56433, 'loss/train': 1.4147230386734009} 11/07/2021 05:09:19 - INFO - __main__ - Step 56435: {'lr': 0.00035087229702250936, 'samples': 10835520, 'steps': 56434, 'loss/train': 1.3920665979385376} 11/07/2021 05:09:20 - INFO - __main__ - Step 56436: {'lr': 0.00035086744141142514, 'samples': 10835712, 'steps': 56435, 'loss/train': 1.5755733251571655} 11/07/2021 05:09:21 - INFO - __main__ - Step 56437: {'lr': 0.0003508625857548916, 'samples': 10835904, 'steps': 56436, 'loss/train': 1.351709246635437} 11/07/2021 05:09:21 - INFO - __main__ - Step 56438: {'lr': 0.000350857730052911, 'samples': 10836096, 'steps': 56437, 'loss/train': 1.2981946468353271} 11/07/2021 05:09:21 - INFO - __main__ - Step 56439: {'lr': 0.0003508528743054854, 'samples': 10836288, 'steps': 56438, 'loss/train': 1.1369142532348633} 11/07/2021 05:09:22 - INFO - __main__ - Step 56440: {'lr': 0.00035084801851261707, 'samples': 10836480, 'steps': 56439, 'loss/train': 0.850168764591217} 11/07/2021 05:09:22 - INFO - __main__ - Step 56441: {'lr': 0.00035084316267430815, 'samples': 10836672, 'steps': 56440, 'loss/train': 1.2398172616958618} 11/07/2021 05:09:23 - INFO - __main__ - Step 56442: {'lr': 0.0003508383067905609, 'samples': 10836864, 'steps': 56441, 'loss/train': 1.5632760524749756} 11/07/2021 05:09:24 - INFO - __main__ - Step 56443: {'lr': 0.0003508334508613775, 'samples': 10837056, 'steps': 56442, 'loss/train': 1.3541842699050903} 11/07/2021 05:09:24 - INFO - __main__ - Step 56444: {'lr': 0.00035082859488676005, 'samples': 10837248, 'steps': 56443, 'loss/train': 1.5581953525543213} 11/07/2021 05:09:24 - INFO - __main__ - Step 56445: {'lr': 0.0003508237388667108, 'samples': 10837440, 'steps': 56444, 'loss/train': 1.1634643077850342} 11/07/2021 05:09:25 - INFO - __main__ - Step 56446: {'lr': 0.00035081888280123194, 'samples': 10837632, 'steps': 56445, 'loss/train': 1.3987843990325928} 11/07/2021 05:09:25 - INFO - __main__ - Step 56447: {'lr': 0.0003508140266903256, 'samples': 10837824, 'steps': 56446, 'loss/train': 1.2198574542999268} 11/07/2021 05:09:26 - INFO - __main__ - Step 56448: {'lr': 0.0003508091705339941, 'samples': 10838016, 'steps': 56447, 'loss/train': 1.7012450695037842} 11/07/2021 05:09:27 - INFO - __main__ - Step 56449: {'lr': 0.00035080431433223946, 'samples': 10838208, 'steps': 56448, 'loss/train': 1.41193425655365} 11/07/2021 05:09:27 - INFO - __main__ - Step 56450: {'lr': 0.000350799458085064, 'samples': 10838400, 'steps': 56449, 'loss/train': 0.7745003700256348} 11/07/2021 05:09:27 - INFO - __main__ - Step 56451: {'lr': 0.00035079460179246984, 'samples': 10838592, 'steps': 56450, 'loss/train': 1.8149513006210327} 11/07/2021 05:09:28 - INFO - __main__ - Step 56452: {'lr': 0.0003507897454544592, 'samples': 10838784, 'steps': 56451, 'loss/train': 1.5369768142700195} 11/07/2021 05:09:29 - INFO - __main__ - Step 56453: {'lr': 0.0003507848890710342, 'samples': 10838976, 'steps': 56452, 'loss/train': 1.4429296255111694} 11/07/2021 05:09:29 - INFO - __main__ - Step 56454: {'lr': 0.00035078003264219713, 'samples': 10839168, 'steps': 56453, 'loss/train': 1.3574390411376953} 11/07/2021 05:09:29 - INFO - __main__ - Step 56455: {'lr': 0.0003507751761679502, 'samples': 10839360, 'steps': 56454, 'loss/train': 1.4801967144012451} 11/07/2021 05:09:30 - INFO - __main__ - Step 56456: {'lr': 0.0003507703196482955, 'samples': 10839552, 'steps': 56455, 'loss/train': 1.9251022338867188} 11/07/2021 05:09:30 - INFO - __main__ - Step 56457: {'lr': 0.0003507654630832352, 'samples': 10839744, 'steps': 56456, 'loss/train': 1.4896724224090576} 11/07/2021 05:09:31 - INFO - __main__ - Step 56458: {'lr': 0.0003507606064727715, 'samples': 10839936, 'steps': 56457, 'loss/train': 1.2481746673583984} 11/07/2021 05:09:31 - INFO - __main__ - Step 56459: {'lr': 0.0003507557498169067, 'samples': 10840128, 'steps': 56458, 'loss/train': 1.343714952468872} 11/07/2021 05:09:32 - INFO - __main__ - Step 56460: {'lr': 0.0003507508931156429, 'samples': 10840320, 'steps': 56459, 'loss/train': 1.4095211029052734} 11/07/2021 05:09:32 - INFO - __main__ - Step 56461: {'lr': 0.0003507460363689823, 'samples': 10840512, 'steps': 56460, 'loss/train': 1.118160367012024} 11/07/2021 05:09:33 - INFO - __main__ - Step 56462: {'lr': 0.00035074117957692707, 'samples': 10840704, 'steps': 56461, 'loss/train': 1.6819826364517212} 11/07/2021 05:09:33 - INFO - __main__ - Step 56463: {'lr': 0.0003507363227394795, 'samples': 10840896, 'steps': 56462, 'loss/train': 0.5558393597602844} 11/07/2021 05:09:34 - INFO - __main__ - Step 56464: {'lr': 0.00035073146585664163, 'samples': 10841088, 'steps': 56463, 'loss/train': 1.30038583278656} 11/07/2021 05:09:34 - INFO - __main__ - Step 56465: {'lr': 0.00035072660892841566, 'samples': 10841280, 'steps': 56464, 'loss/train': 1.0113273859024048} 11/07/2021 05:09:35 - INFO - __main__ - Step 56466: {'lr': 0.0003507217519548039, 'samples': 10841472, 'steps': 56465, 'loss/train': 1.5108157396316528} 11/07/2021 05:09:35 - INFO - __main__ - Step 56467: {'lr': 0.00035071689493580845, 'samples': 10841664, 'steps': 56466, 'loss/train': 1.5632741451263428} 11/07/2021 05:09:35 - INFO - __main__ - Step 56468: {'lr': 0.0003507120378714315, 'samples': 10841856, 'steps': 56467, 'loss/train': 1.402575135231018} 11/07/2021 05:09:36 - INFO - __main__ - Step 56469: {'lr': 0.0003507071807616753, 'samples': 10842048, 'steps': 56468, 'loss/train': 1.387196660041809} 11/07/2021 05:09:37 - INFO - __main__ - Step 56470: {'lr': 0.0003507023236065421, 'samples': 10842240, 'steps': 56469, 'loss/train': 1.0838427543640137} 11/07/2021 05:09:37 - INFO - __main__ - Step 56471: {'lr': 0.0003506974664060338, 'samples': 10842432, 'steps': 56470, 'loss/train': 1.2613595724105835} 11/07/2021 05:09:37 - INFO - __main__ - Step 56472: {'lr': 0.00035069260916015287, 'samples': 10842624, 'steps': 56471, 'loss/train': 2.4340803623199463} 11/07/2021 05:09:38 - INFO - __main__ - Step 56473: {'lr': 0.0003506877518689014, 'samples': 10842816, 'steps': 56472, 'loss/train': 1.6305314302444458} 11/07/2021 05:09:39 - INFO - __main__ - Step 56474: {'lr': 0.0003506828945322816, 'samples': 10843008, 'steps': 56473, 'loss/train': 0.9941365718841553} 11/07/2021 05:09:39 - INFO - __main__ - Step 56475: {'lr': 0.0003506780371502956, 'samples': 10843200, 'steps': 56474, 'loss/train': 1.4905797243118286} 11/07/2021 05:09:40 - INFO - __main__ - Step 56476: {'lr': 0.00035067317972294564, 'samples': 10843392, 'steps': 56475, 'loss/train': 1.0587031841278076} 11/07/2021 05:09:40 - INFO - __main__ - Step 56477: {'lr': 0.00035066832225023393, 'samples': 10843584, 'steps': 56476, 'loss/train': 1.3000357151031494} 11/07/2021 05:09:40 - INFO - __main__ - Step 56478: {'lr': 0.0003506634647321626, 'samples': 10843776, 'steps': 56477, 'loss/train': 1.6559810638427734} 11/07/2021 05:09:41 - INFO - __main__ - Step 56479: {'lr': 0.0003506586071687338, 'samples': 10843968, 'steps': 56478, 'loss/train': 1.014644742012024} 11/07/2021 05:09:42 - INFO - __main__ - Step 56480: {'lr': 0.0003506537495599499, 'samples': 10844160, 'steps': 56479, 'loss/train': 0.08396033942699432} 11/07/2021 05:09:42 - INFO - __main__ - Step 56481: {'lr': 0.0003506488919058129, 'samples': 10844352, 'steps': 56480, 'loss/train': 1.287361979484558} 11/07/2021 05:09:42 - INFO - __main__ - Step 56482: {'lr': 0.00035064403420632505, 'samples': 10844544, 'steps': 56481, 'loss/train': 1.5210411548614502} 11/07/2021 05:09:43 - INFO - __main__ - Step 56483: {'lr': 0.0003506391764614887, 'samples': 10844736, 'steps': 56482, 'loss/train': 1.29570734500885} 11/07/2021 05:09:44 - INFO - __main__ - Step 56484: {'lr': 0.00035063431867130576, 'samples': 10844928, 'steps': 56483, 'loss/train': 1.594150424003601} 11/07/2021 05:09:44 - INFO - __main__ - Step 56485: {'lr': 0.00035062946083577853, 'samples': 10845120, 'steps': 56484, 'loss/train': 1.563456654548645} 11/07/2021 05:09:45 - INFO - __main__ - Step 56486: {'lr': 0.00035062460295490926, 'samples': 10845312, 'steps': 56485, 'loss/train': 1.0996873378753662} 11/07/2021 05:09:45 - INFO - __main__ - Step 56487: {'lr': 0.00035061974502870007, 'samples': 10845504, 'steps': 56486, 'loss/train': 1.7642879486083984} 11/07/2021 05:09:45 - INFO - __main__ - Step 56488: {'lr': 0.0003506148870571533, 'samples': 10845696, 'steps': 56487, 'loss/train': 1.9007972478866577} 11/07/2021 05:09:46 - INFO - __main__ - Step 56489: {'lr': 0.00035061002904027084, 'samples': 10845888, 'steps': 56488, 'loss/train': 1.7326982021331787} 11/07/2021 05:09:47 - INFO - __main__ - Step 56490: {'lr': 0.0003506051709780551, 'samples': 10846080, 'steps': 56489, 'loss/train': 1.564028024673462} 11/07/2021 05:09:47 - INFO - __main__ - Step 56491: {'lr': 0.0003506003128705083, 'samples': 10846272, 'steps': 56490, 'loss/train': 0.7779012322425842} 11/07/2021 05:09:47 - INFO - __main__ - Step 56492: {'lr': 0.0003505954547176325, 'samples': 10846464, 'steps': 56491, 'loss/train': 1.4085956811904907} 11/07/2021 05:09:48 - INFO - __main__ - Step 56493: {'lr': 0.00035059059651942995, 'samples': 10846656, 'steps': 56492, 'loss/train': 1.0903558731079102} 11/07/2021 05:09:48 - INFO - __main__ - Step 56494: {'lr': 0.00035058573827590286, 'samples': 10846848, 'steps': 56493, 'loss/train': 1.7710188627243042} 11/07/2021 05:09:50 - INFO - __main__ - Step 56495: {'lr': 0.0003505808799870533, 'samples': 10847040, 'steps': 56494, 'loss/train': 0.8494094014167786} 11/07/2021 05:09:50 - INFO - __main__ - Step 56496: {'lr': 0.0003505760216528836, 'samples': 10847232, 'steps': 56495, 'loss/train': 1.1221890449523926} 11/07/2021 05:09:50 - INFO - __main__ - Step 56497: {'lr': 0.0003505711632733959, 'samples': 10847424, 'steps': 56496, 'loss/train': 1.116998553276062} 11/07/2021 05:09:51 - INFO - __main__ - Step 56498: {'lr': 0.00035056630484859235, 'samples': 10847616, 'steps': 56497, 'loss/train': 1.4646931886672974} 11/07/2021 05:09:51 - INFO - __main__ - Step 56499: {'lr': 0.00035056144637847525, 'samples': 10847808, 'steps': 56498, 'loss/train': 0.979411780834198} 11/07/2021 05:09:51 - INFO - __main__ - Step 56500: {'lr': 0.0003505565878630467, 'samples': 10848000, 'steps': 56499, 'loss/train': 1.0676548480987549} 11/07/2021 05:09:52 - INFO - __main__ - Step 56501: {'lr': 0.0003505517293023088, 'samples': 10848192, 'steps': 56500, 'loss/train': 0.17799589037895203} 11/07/2021 05:09:53 - INFO - __main__ - Step 56502: {'lr': 0.0003505468706962639, 'samples': 10848384, 'steps': 56501, 'loss/train': 0.748681902885437} 11/07/2021 05:09:53 - INFO - __main__ - Step 56503: {'lr': 0.00035054201204491413, 'samples': 10848576, 'steps': 56502, 'loss/train': 1.7646546363830566} 11/07/2021 05:09:53 - INFO - __main__ - Step 56504: {'lr': 0.00035053715334826176, 'samples': 10848768, 'steps': 56503, 'loss/train': 1.495108723640442} 11/07/2021 05:09:54 - INFO - __main__ - Step 56505: {'lr': 0.0003505322946063089, 'samples': 10848960, 'steps': 56504, 'loss/train': 2.6900439262390137} 11/07/2021 05:09:55 - INFO - __main__ - Step 56506: {'lr': 0.0003505274358190576, 'samples': 10849152, 'steps': 56505, 'loss/train': 1.8099464178085327} 11/07/2021 05:09:55 - INFO - __main__ - Step 56507: {'lr': 0.00035052257698651025, 'samples': 10849344, 'steps': 56506, 'loss/train': 1.2279366254806519} 11/07/2021 05:09:56 - INFO - __main__ - Step 56508: {'lr': 0.000350517718108669, 'samples': 10849536, 'steps': 56507, 'loss/train': 1.1620947122573853} 11/07/2021 05:09:56 - INFO - __main__ - Step 56509: {'lr': 0.000350512859185536, 'samples': 10849728, 'steps': 56508, 'loss/train': 1.5747660398483276} 11/07/2021 05:09:56 - INFO - __main__ - Step 56510: {'lr': 0.00035050800021711346, 'samples': 10849920, 'steps': 56509, 'loss/train': 0.7256755828857422} 11/07/2021 05:09:57 - INFO - __main__ - Step 56511: {'lr': 0.00035050314120340357, 'samples': 10850112, 'steps': 56510, 'loss/train': 1.4492433071136475} 11/07/2021 05:09:58 - INFO - __main__ - Step 56512: {'lr': 0.00035049828214440856, 'samples': 10850304, 'steps': 56511, 'loss/train': 2.0569992065429688} 11/07/2021 05:09:58 - INFO - __main__ - Step 56513: {'lr': 0.00035049342304013055, 'samples': 10850496, 'steps': 56512, 'loss/train': 1.1229181289672852} 11/07/2021 05:09:58 - INFO - __main__ - Step 56514: {'lr': 0.0003504885638905717, 'samples': 10850688, 'steps': 56513, 'loss/train': 1.1955887079238892} 11/07/2021 05:09:59 - INFO - __main__ - Step 56515: {'lr': 0.0003504837046957343, 'samples': 10850880, 'steps': 56514, 'loss/train': 1.7722307443618774} 11/07/2021 05:10:00 - INFO - __main__ - Step 56516: {'lr': 0.0003504788454556205, 'samples': 10851072, 'steps': 56515, 'loss/train': 1.2696478366851807} 11/07/2021 05:10:00 - INFO - __main__ - Step 56517: {'lr': 0.00035047398617023246, 'samples': 10851264, 'steps': 56516, 'loss/train': 1.5104753971099854} 11/07/2021 05:10:00 - INFO - __main__ - Step 56518: {'lr': 0.0003504691268395724, 'samples': 10851456, 'steps': 56517, 'loss/train': 1.338602066040039} 11/07/2021 05:10:01 - INFO - __main__ - Step 56519: {'lr': 0.00035046426746364247, 'samples': 10851648, 'steps': 56518, 'loss/train': 1.8242599964141846} 11/07/2021 05:10:01 - INFO - __main__ - Step 56520: {'lr': 0.0003504594080424449, 'samples': 10851840, 'steps': 56519, 'loss/train': 1.4481475353240967} 11/07/2021 05:10:01 - INFO - __main__ - Step 56521: {'lr': 0.00035045454857598194, 'samples': 10852032, 'steps': 56520, 'loss/train': 1.4232033491134644} 11/07/2021 05:10:03 - INFO - __main__ - Step 56522: {'lr': 0.0003504496890642556, 'samples': 10852224, 'steps': 56521, 'loss/train': 2.073962688446045} 11/07/2021 05:10:03 - INFO - __main__ - Step 56523: {'lr': 0.0003504448295072683, 'samples': 10852416, 'steps': 56522, 'loss/train': 1.0623143911361694} 11/07/2021 05:10:03 - INFO - __main__ - Step 56524: {'lr': 0.00035043996990502204, 'samples': 10852608, 'steps': 56523, 'loss/train': 2.2309913635253906} 11/07/2021 05:10:04 - INFO - __main__ - Step 56525: {'lr': 0.00035043511025751906, 'samples': 10852800, 'steps': 56524, 'loss/train': 1.4322583675384521} 11/07/2021 05:10:04 - INFO - __main__ - Step 56526: {'lr': 0.00035043025056476164, 'samples': 10852992, 'steps': 56525, 'loss/train': 1.4538013935089111} 11/07/2021 05:10:05 - INFO - __main__ - Step 56527: {'lr': 0.00035042539082675184, 'samples': 10853184, 'steps': 56526, 'loss/train': 0.9756379723548889} 11/07/2021 05:10:05 - INFO - __main__ - Step 56528: {'lr': 0.00035042053104349195, 'samples': 10853376, 'steps': 56527, 'loss/train': 0.7582591772079468} 11/07/2021 05:10:06 - INFO - __main__ - Step 56529: {'lr': 0.00035041567121498406, 'samples': 10853568, 'steps': 56528, 'loss/train': 1.4369869232177734} 11/07/2021 05:10:06 - INFO - __main__ - Step 56530: {'lr': 0.0003504108113412305, 'samples': 10853760, 'steps': 56529, 'loss/train': 1.3216538429260254} 11/07/2021 05:10:06 - INFO - __main__ - Step 56531: {'lr': 0.0003504059514222333, 'samples': 10853952, 'steps': 56530, 'loss/train': 1.1601468324661255} 11/07/2021 05:10:07 - INFO - __main__ - Step 56532: {'lr': 0.00035040109145799474, 'samples': 10854144, 'steps': 56531, 'loss/train': 1.9203786849975586} 11/07/2021 05:10:08 - INFO - __main__ - Step 56533: {'lr': 0.0003503962314485171, 'samples': 10854336, 'steps': 56532, 'loss/train': 1.3713005781173706} 11/07/2021 05:10:08 - INFO - __main__ - Step 56534: {'lr': 0.00035039137139380235, 'samples': 10854528, 'steps': 56533, 'loss/train': 1.756475567817688} 11/07/2021 05:10:08 - INFO - __main__ - Step 56535: {'lr': 0.0003503865112938528, 'samples': 10854720, 'steps': 56534, 'loss/train': 1.4186993837356567} 11/07/2021 05:10:09 - INFO - __main__ - Step 56536: {'lr': 0.00035038165114867066, 'samples': 10854912, 'steps': 56535, 'loss/train': 0.7980080246925354} 11/07/2021 05:10:10 - INFO - __main__ - Step 56537: {'lr': 0.00035037679095825815, 'samples': 10855104, 'steps': 56536, 'loss/train': 1.6508228778839111} 11/07/2021 05:10:10 - INFO - __main__ - Step 56538: {'lr': 0.00035037193072261734, 'samples': 10855296, 'steps': 56537, 'loss/train': 0.7000250816345215} 11/07/2021 05:10:11 - INFO - __main__ - Step 56539: {'lr': 0.00035036707044175055, 'samples': 10855488, 'steps': 56538, 'loss/train': 1.0380233526229858} 11/07/2021 05:10:11 - INFO - __main__ - Step 56540: {'lr': 0.00035036221011565985, 'samples': 10855680, 'steps': 56539, 'loss/train': 1.4951800107955933} 11/07/2021 05:10:11 - INFO - __main__ - Step 56541: {'lr': 0.00035035734974434745, 'samples': 10855872, 'steps': 56540, 'loss/train': 1.3978549242019653} 11/07/2021 05:10:12 - INFO - __main__ - Step 56542: {'lr': 0.00035035248932781564, 'samples': 10856064, 'steps': 56541, 'loss/train': 1.6059826612472534} 11/07/2021 05:10:13 - INFO - __main__ - Step 56543: {'lr': 0.0003503476288660665, 'samples': 10856256, 'steps': 56542, 'loss/train': 1.4883650541305542} 11/07/2021 05:10:13 - INFO - __main__ - Step 56544: {'lr': 0.0003503427683591024, 'samples': 10856448, 'steps': 56543, 'loss/train': 1.5489416122436523} 11/07/2021 05:10:13 - INFO - __main__ - Step 56545: {'lr': 0.00035033790780692527, 'samples': 10856640, 'steps': 56544, 'loss/train': 0.9137527346611023} 11/07/2021 05:10:14 - INFO - __main__ - Step 56546: {'lr': 0.0003503330472095375, 'samples': 10856832, 'steps': 56545, 'loss/train': 1.1663609743118286} 11/07/2021 05:10:15 - INFO - __main__ - Step 56547: {'lr': 0.0003503281865669411, 'samples': 10857024, 'steps': 56546, 'loss/train': 1.150452971458435} 11/07/2021 05:10:15 - INFO - __main__ - Step 56548: {'lr': 0.00035032332587913844, 'samples': 10857216, 'steps': 56547, 'loss/train': 1.3628579378128052} 11/07/2021 05:10:16 - INFO - __main__ - Step 56549: {'lr': 0.00035031846514613164, 'samples': 10857408, 'steps': 56548, 'loss/train': 0.8619349598884583} 11/07/2021 05:10:16 - INFO - __main__ - Step 56550: {'lr': 0.00035031360436792294, 'samples': 10857600, 'steps': 56549, 'loss/train': 1.1280677318572998} 11/07/2021 05:10:16 - INFO - __main__ - Step 56551: {'lr': 0.00035030874354451434, 'samples': 10857792, 'steps': 56550, 'loss/train': 1.2040126323699951} 11/07/2021 05:10:17 - INFO - __main__ - Step 56552: {'lr': 0.0003503038826759083, 'samples': 10857984, 'steps': 56551, 'loss/train': 1.8396841287612915} 11/07/2021 05:10:18 - INFO - __main__ - Step 56553: {'lr': 0.00035029902176210675, 'samples': 10858176, 'steps': 56552, 'loss/train': 1.0404289960861206} 11/07/2021 05:10:18 - INFO - __main__ - Step 56554: {'lr': 0.0003502941608031121, 'samples': 10858368, 'steps': 56553, 'loss/train': 1.653855562210083} 11/07/2021 05:10:19 - INFO - __main__ - Step 56555: {'lr': 0.00035028929979892645, 'samples': 10858560, 'steps': 56554, 'loss/train': 1.0883740186691284} 11/07/2021 05:10:19 - INFO - __main__ - Step 56556: {'lr': 0.00035028443874955196, 'samples': 10858752, 'steps': 56555, 'loss/train': 1.2312816381454468} 11/07/2021 05:10:19 - INFO - __main__ - Step 56557: {'lr': 0.00035027957765499084, 'samples': 10858944, 'steps': 56556, 'loss/train': 1.2569242715835571} 11/07/2021 05:10:20 - INFO - __main__ - Step 56558: {'lr': 0.00035027471651524533, 'samples': 10859136, 'steps': 56557, 'loss/train': 1.6510467529296875} 11/07/2021 05:10:21 - INFO - __main__ - Step 56559: {'lr': 0.00035026985533031754, 'samples': 10859328, 'steps': 56558, 'loss/train': 1.7253813743591309} 11/07/2021 05:10:21 - INFO - __main__ - Step 56560: {'lr': 0.00035026499410020974, 'samples': 10859520, 'steps': 56559, 'loss/train': 1.6616827249526978} 11/07/2021 05:10:21 - INFO - __main__ - Step 56561: {'lr': 0.00035026013282492404, 'samples': 10859712, 'steps': 56560, 'loss/train': 1.174570918083191} 11/07/2021 05:10:22 - INFO - __main__ - Step 56562: {'lr': 0.0003502552715044627, 'samples': 10859904, 'steps': 56561, 'loss/train': 1.5358526706695557} 11/07/2021 05:10:23 - INFO - __main__ - Step 56563: {'lr': 0.0003502504101388279, 'samples': 10860096, 'steps': 56562, 'loss/train': 1.172845482826233} 11/07/2021 05:10:23 - INFO - __main__ - Step 56564: {'lr': 0.0003502455487280218, 'samples': 10860288, 'steps': 56563, 'loss/train': 1.251356601715088} 11/07/2021 05:10:23 - INFO - __main__ - Step 56565: {'lr': 0.00035024068727204655, 'samples': 10860480, 'steps': 56564, 'loss/train': 1.361531138420105} 11/07/2021 05:10:24 - INFO - __main__ - Step 56566: {'lr': 0.0003502358257709044, 'samples': 10860672, 'steps': 56565, 'loss/train': 1.2823978662490845} 11/07/2021 05:10:24 - INFO - __main__ - Step 56567: {'lr': 0.00035023096422459756, 'samples': 10860864, 'steps': 56566, 'loss/train': 1.017927885055542} 11/07/2021 05:10:25 - INFO - __main__ - Step 56568: {'lr': 0.0003502261026331282, 'samples': 10861056, 'steps': 56567, 'loss/train': 1.5585192441940308} 11/07/2021 05:10:26 - INFO - __main__ - Step 56569: {'lr': 0.0003502212409964985, 'samples': 10861248, 'steps': 56568, 'loss/train': 1.7004319429397583} 11/07/2021 05:10:26 - INFO - __main__ - Step 56570: {'lr': 0.00035021637931471075, 'samples': 10861440, 'steps': 56569, 'loss/train': 1.6190402507781982} 11/07/2021 05:10:26 - INFO - __main__ - Step 56571: {'lr': 0.00035021151758776693, 'samples': 10861632, 'steps': 56570, 'loss/train': 1.1929931640625} 11/07/2021 05:10:27 - INFO - __main__ - Step 56572: {'lr': 0.00035020665581566934, 'samples': 10861824, 'steps': 56571, 'loss/train': 1.413413405418396} 11/07/2021 05:10:28 - INFO - __main__ - Step 56573: {'lr': 0.0003502017939984202, 'samples': 10862016, 'steps': 56572, 'loss/train': 1.2944855690002441} 11/07/2021 05:10:28 - INFO - __main__ - Step 56574: {'lr': 0.0003501969321360217, 'samples': 10862208, 'steps': 56573, 'loss/train': 0.9877818822860718} 11/07/2021 05:10:28 - INFO - __main__ - Step 56575: {'lr': 0.00035019207022847596, 'samples': 10862400, 'steps': 56574, 'loss/train': 1.2913336753845215} 11/07/2021 05:10:29 - INFO - __main__ - Step 56576: {'lr': 0.0003501872082757852, 'samples': 10862592, 'steps': 56575, 'loss/train': 1.257947564125061} 11/07/2021 05:10:29 - INFO - __main__ - Step 56577: {'lr': 0.0003501823462779518, 'samples': 10862784, 'steps': 56576, 'loss/train': 1.541757345199585} 11/07/2021 05:10:29 - INFO - __main__ - Step 56578: {'lr': 0.00035017748423497766, 'samples': 10862976, 'steps': 56577, 'loss/train': 1.3554177284240723} 11/07/2021 05:10:31 - INFO - __main__ - Step 56579: {'lr': 0.00035017262214686505, 'samples': 10863168, 'steps': 56578, 'loss/train': 1.756501317024231} 11/07/2021 05:10:31 - INFO - __main__ - Step 56580: {'lr': 0.00035016776001361625, 'samples': 10863360, 'steps': 56579, 'loss/train': 1.479548692703247} 11/07/2021 05:10:31 - INFO - __main__ - Step 56581: {'lr': 0.00035016289783523335, 'samples': 10863552, 'steps': 56580, 'loss/train': 0.9331073760986328} 11/07/2021 05:10:32 - INFO - __main__ - Step 56582: {'lr': 0.00035015803561171864, 'samples': 10863744, 'steps': 56581, 'loss/train': 1.2737088203430176} 11/07/2021 05:10:32 - INFO - __main__ - Step 56583: {'lr': 0.0003501531733430743, 'samples': 10863936, 'steps': 56582, 'loss/train': 1.0593665838241577} 11/07/2021 05:10:33 - INFO - __main__ - Step 56584: {'lr': 0.00035014831102930246, 'samples': 10864128, 'steps': 56583, 'loss/train': 1.5288864374160767} 11/07/2021 05:10:33 - INFO - __main__ - Step 56585: {'lr': 0.0003501434486704053, 'samples': 10864320, 'steps': 56584, 'loss/train': 1.3636438846588135} 11/07/2021 05:10:34 - INFO - __main__ - Step 56586: {'lr': 0.0003501385862663851, 'samples': 10864512, 'steps': 56585, 'loss/train': 1.6007615327835083} 11/07/2021 05:10:34 - INFO - __main__ - Step 56587: {'lr': 0.00035013372381724397, 'samples': 10864704, 'steps': 56586, 'loss/train': 1.267218828201294} 11/07/2021 05:10:34 - INFO - __main__ - Step 56588: {'lr': 0.00035012886132298413, 'samples': 10864896, 'steps': 56587, 'loss/train': 1.598380208015442} 11/07/2021 05:10:35 - INFO - __main__ - Step 56589: {'lr': 0.0003501239987836078, 'samples': 10865088, 'steps': 56588, 'loss/train': 1.4377321004867554} 11/07/2021 05:10:36 - INFO - __main__ - Step 56590: {'lr': 0.00035011913619911706, 'samples': 10865280, 'steps': 56589, 'loss/train': 1.3788068294525146} 11/07/2021 05:10:36 - INFO - __main__ - Step 56591: {'lr': 0.0003501142735695143, 'samples': 10865472, 'steps': 56590, 'loss/train': 0.7732943296432495} 11/07/2021 05:10:36 - INFO - __main__ - Step 56592: {'lr': 0.0003501094108948015, 'samples': 10865664, 'steps': 56591, 'loss/train': 1.038304328918457} 11/07/2021 05:10:37 - INFO - __main__ - Step 56593: {'lr': 0.000350104548174981, 'samples': 10865856, 'steps': 56592, 'loss/train': 1.422461748123169} 11/07/2021 05:10:38 - INFO - __main__ - Step 56594: {'lr': 0.00035009968541005487, 'samples': 10866048, 'steps': 56593, 'loss/train': 1.973090410232544} 11/07/2021 05:10:38 - INFO - __main__ - Step 56595: {'lr': 0.00035009482260002544, 'samples': 10866240, 'steps': 56594, 'loss/train': 1.0853878259658813} 11/07/2021 05:10:38 - INFO - __main__ - Step 56596: {'lr': 0.00035008995974489477, 'samples': 10866432, 'steps': 56595, 'loss/train': 1.6085518598556519} 11/07/2021 05:10:39 - INFO - __main__ - Step 56597: {'lr': 0.0003500850968446652, 'samples': 10866624, 'steps': 56596, 'loss/train': 1.4332517385482788} 11/07/2021 05:10:39 - INFO - __main__ - Step 56598: {'lr': 0.00035008023389933876, 'samples': 10866816, 'steps': 56597, 'loss/train': 1.1358370780944824} 11/07/2021 05:10:40 - INFO - __main__ - Step 56599: {'lr': 0.00035007537090891766, 'samples': 10867008, 'steps': 56598, 'loss/train': 1.6242140531539917} 11/07/2021 05:10:40 - INFO - __main__ - Step 56600: {'lr': 0.0003500705078734042, 'samples': 10867200, 'steps': 56599, 'loss/train': 1.2923790216445923} 11/07/2021 05:10:41 - INFO - __main__ - Step 56601: {'lr': 0.0003500656447928005, 'samples': 10867392, 'steps': 56600, 'loss/train': 1.4352679252624512} 11/07/2021 05:10:41 - INFO - __main__ - Step 56602: {'lr': 0.00035006078166710877, 'samples': 10867584, 'steps': 56601, 'loss/train': 1.1847213506698608} 11/07/2021 05:10:41 - INFO - __main__ - Step 56603: {'lr': 0.00035005591849633123, 'samples': 10867776, 'steps': 56602, 'loss/train': 1.1905523538589478} 11/07/2021 05:10:42 - INFO - __main__ - Step 56604: {'lr': 0.00035005105528047, 'samples': 10867968, 'steps': 56603, 'loss/train': 1.476470708847046} 11/07/2021 05:10:43 - INFO - __main__ - Step 56605: {'lr': 0.00035004619201952736, 'samples': 10868160, 'steps': 56604, 'loss/train': 1.518057942390442} 11/07/2021 05:10:43 - INFO - __main__ - Step 56606: {'lr': 0.00035004132871350535, 'samples': 10868352, 'steps': 56605, 'loss/train': 1.1097615957260132} 11/07/2021 05:10:44 - INFO - __main__ - Step 56607: {'lr': 0.0003500364653624063, 'samples': 10868544, 'steps': 56606, 'loss/train': 2.138749122619629} 11/07/2021 05:10:44 - INFO - __main__ - Step 56608: {'lr': 0.0003500316019662324, 'samples': 10868736, 'steps': 56607, 'loss/train': 1.003953456878662} 11/07/2021 05:10:45 - INFO - __main__ - Step 56609: {'lr': 0.00035002673852498577, 'samples': 10868928, 'steps': 56608, 'loss/train': 1.2726805210113525} 11/07/2021 05:10:45 - INFO - __main__ - Step 56610: {'lr': 0.0003500218750386687, 'samples': 10869120, 'steps': 56609, 'loss/train': 1.379153847694397} 11/07/2021 05:10:46 - INFO - __main__ - Step 56611: {'lr': 0.0003500170115072833, 'samples': 10869312, 'steps': 56610, 'loss/train': 1.613093376159668} 11/07/2021 05:10:46 - INFO - __main__ - Step 56612: {'lr': 0.00035001214793083167, 'samples': 10869504, 'steps': 56611, 'loss/train': 0.7998729348182678} 11/07/2021 05:10:46 - INFO - __main__ - Step 56613: {'lr': 0.00035000728430931616, 'samples': 10869696, 'steps': 56612, 'loss/train': 1.0273258686065674} 11/07/2021 05:10:48 - INFO - __main__ - Step 56614: {'lr': 0.000350002420642739, 'samples': 10869888, 'steps': 56613, 'loss/train': 1.4873912334442139} 11/07/2021 05:10:48 - INFO - __main__ - Step 56615: {'lr': 0.0003499975569311022, 'samples': 10870080, 'steps': 56614, 'loss/train': 1.6536808013916016} 11/07/2021 05:10:48 - INFO - __main__ - Step 56616: {'lr': 0.00034999269317440804, 'samples': 10870272, 'steps': 56615, 'loss/train': 1.554644227027893} 11/07/2021 05:10:49 - INFO - __main__ - Step 56617: {'lr': 0.0003499878293726588, 'samples': 10870464, 'steps': 56616, 'loss/train': 1.8007792234420776} 11/07/2021 05:10:49 - INFO - __main__ - Step 56618: {'lr': 0.0003499829655258565, 'samples': 10870656, 'steps': 56617, 'loss/train': 1.3087400197982788} 11/07/2021 05:10:50 - INFO - __main__ - Step 56619: {'lr': 0.00034997810163400343, 'samples': 10870848, 'steps': 56618, 'loss/train': 1.185659646987915} 11/07/2021 05:10:51 - INFO - __main__ - Step 56620: {'lr': 0.0003499732376971018, 'samples': 10871040, 'steps': 56619, 'loss/train': 1.4963005781173706} 11/07/2021 05:10:51 - INFO - __main__ - Step 56621: {'lr': 0.0003499683737151538, 'samples': 10871232, 'steps': 56620, 'loss/train': 1.4350378513336182} 11/07/2021 05:10:51 - INFO - __main__ - Step 56622: {'lr': 0.0003499635096881615, 'samples': 10871424, 'steps': 56621, 'loss/train': 1.1455438137054443} 11/07/2021 05:10:52 - INFO - __main__ - Step 56623: {'lr': 0.0003499586456161273, 'samples': 10871616, 'steps': 56622, 'loss/train': 0.8691271543502808} 11/07/2021 05:10:52 - INFO - __main__ - Step 56624: {'lr': 0.0003499537814990532, 'samples': 10871808, 'steps': 56623, 'loss/train': 1.3309210538864136} 11/07/2021 05:10:53 - INFO - __main__ - Step 56625: {'lr': 0.0003499489173369415, 'samples': 10872000, 'steps': 56624, 'loss/train': 1.5248233079910278} 11/07/2021 05:10:53 - INFO - __main__ - Step 56626: {'lr': 0.00034994405312979433, 'samples': 10872192, 'steps': 56625, 'loss/train': 1.2739338874816895} 11/07/2021 05:10:54 - INFO - __main__ - Step 56627: {'lr': 0.00034993918887761386, 'samples': 10872384, 'steps': 56626, 'loss/train': 1.1588736772537231} 11/07/2021 05:10:54 - INFO - __main__ - Step 56628: {'lr': 0.0003499343245804025, 'samples': 10872576, 'steps': 56627, 'loss/train': 1.2341482639312744} 11/07/2021 05:10:54 - INFO - __main__ - Step 56629: {'lr': 0.00034992946023816216, 'samples': 10872768, 'steps': 56628, 'loss/train': 2.0720772743225098} 11/07/2021 05:10:55 - INFO - __main__ - Step 56630: {'lr': 0.00034992459585089515, 'samples': 10872960, 'steps': 56629, 'loss/train': 0.9724142551422119} 11/07/2021 05:10:56 - INFO - __main__ - Step 56631: {'lr': 0.00034991973141860366, 'samples': 10873152, 'steps': 56630, 'loss/train': 1.3173853158950806} 11/07/2021 05:10:56 - INFO - __main__ - Step 56632: {'lr': 0.00034991486694128986, 'samples': 10873344, 'steps': 56631, 'loss/train': 1.7120763063430786} 11/07/2021 05:10:57 - INFO - __main__ - Step 56633: {'lr': 0.000349910002418956, 'samples': 10873536, 'steps': 56632, 'loss/train': 1.405206322669983} 11/07/2021 05:10:57 - INFO - __main__ - Step 56634: {'lr': 0.0003499051378516043, 'samples': 10873728, 'steps': 56633, 'loss/train': 1.1220430135726929} 11/07/2021 05:10:57 - INFO - __main__ - Step 56635: {'lr': 0.0003499002732392368, 'samples': 10873920, 'steps': 56634, 'loss/train': 1.4978506565093994} 11/07/2021 05:10:58 - INFO - __main__ - Step 56636: {'lr': 0.0003498954085818558, 'samples': 10874112, 'steps': 56635, 'loss/train': 1.1060702800750732} 11/07/2021 05:10:59 - INFO - __main__ - Step 56637: {'lr': 0.00034989054387946344, 'samples': 10874304, 'steps': 56636, 'loss/train': 1.8852922916412354} 11/07/2021 05:10:59 - INFO - __main__ - Step 56638: {'lr': 0.000349885679132062, 'samples': 10874496, 'steps': 56637, 'loss/train': 1.8907239437103271} 11/07/2021 05:10:59 - INFO - __main__ - Step 56639: {'lr': 0.00034988081433965355, 'samples': 10874688, 'steps': 56638, 'loss/train': 1.0390472412109375} 11/07/2021 05:11:00 - INFO - __main__ - Step 56640: {'lr': 0.00034987594950224043, 'samples': 10874880, 'steps': 56639, 'loss/train': 0.5694819688796997} 11/07/2021 05:11:01 - INFO - __main__ - Step 56641: {'lr': 0.0003498710846198247, 'samples': 10875072, 'steps': 56640, 'loss/train': 1.0915600061416626} 11/07/2021 05:11:01 - INFO - __main__ - Step 56642: {'lr': 0.0003498662196924086, 'samples': 10875264, 'steps': 56641, 'loss/train': 0.9697006940841675} 11/07/2021 05:11:01 - INFO - __main__ - Step 56643: {'lr': 0.00034986135471999424, 'samples': 10875456, 'steps': 56642, 'loss/train': 1.376035451889038} 11/07/2021 05:11:02 - INFO - __main__ - Step 56644: {'lr': 0.00034985648970258404, 'samples': 10875648, 'steps': 56643, 'loss/train': 1.46683669090271} 11/07/2021 05:11:02 - INFO - __main__ - Step 56645: {'lr': 0.00034985162464018, 'samples': 10875840, 'steps': 56644, 'loss/train': 1.4045829772949219} 11/07/2021 05:11:03 - INFO - __main__ - Step 56646: {'lr': 0.00034984675953278433, 'samples': 10876032, 'steps': 56645, 'loss/train': 1.4965193271636963} 11/07/2021 05:11:03 - INFO - __main__ - Step 56647: {'lr': 0.00034984189438039926, 'samples': 10876224, 'steps': 56646, 'loss/train': 1.2718865871429443} 11/07/2021 05:11:04 - INFO - __main__ - Step 56648: {'lr': 0.00034983702918302696, 'samples': 10876416, 'steps': 56647, 'loss/train': 1.1979362964630127} 11/07/2021 05:11:04 - INFO - __main__ - Step 56649: {'lr': 0.00034983216394066964, 'samples': 10876608, 'steps': 56648, 'loss/train': 1.4846811294555664} 11/07/2021 05:11:05 - INFO - __main__ - Step 56650: {'lr': 0.00034982729865332953, 'samples': 10876800, 'steps': 56649, 'loss/train': 1.5367149114608765} 11/07/2021 05:11:06 - INFO - __main__ - Step 56651: {'lr': 0.0003498224333210087, 'samples': 10876992, 'steps': 56650, 'loss/train': 1.163350224494934} 11/07/2021 05:11:06 - INFO - __main__ - Step 56652: {'lr': 0.0003498175679437095, 'samples': 10877184, 'steps': 56651, 'loss/train': 1.3177233934402466} 11/07/2021 05:11:06 - INFO - __main__ - Step 56653: {'lr': 0.00034981270252143406, 'samples': 10877376, 'steps': 56652, 'loss/train': 1.9447318315505981} 11/07/2021 05:11:07 - INFO - __main__ - Step 56654: {'lr': 0.0003498078370541845, 'samples': 10877568, 'steps': 56653, 'loss/train': 1.667618751525879} 11/07/2021 05:11:07 - INFO - __main__ - Step 56655: {'lr': 0.00034980297154196306, 'samples': 10877760, 'steps': 56654, 'loss/train': 1.3392319679260254} 11/07/2021 05:11:07 - INFO - __main__ - Step 56656: {'lr': 0.0003497981059847719, 'samples': 10877952, 'steps': 56655, 'loss/train': 1.136358380317688} 11/07/2021 05:11:08 - INFO - __main__ - Step 56657: {'lr': 0.00034979324038261327, 'samples': 10878144, 'steps': 56656, 'loss/train': 1.5484172105789185} 11/07/2021 05:11:09 - INFO - __main__ - Step 56658: {'lr': 0.00034978837473548946, 'samples': 10878336, 'steps': 56657, 'loss/train': 0.7881152629852295} 11/07/2021 05:11:09 - INFO - __main__ - Step 56659: {'lr': 0.0003497835090434025, 'samples': 10878528, 'steps': 56658, 'loss/train': 2.1221213340759277} 11/07/2021 05:11:09 - INFO - __main__ - Step 56660: {'lr': 0.00034977864330635455, 'samples': 10878720, 'steps': 56659, 'loss/train': 1.5315827131271362} 11/07/2021 05:11:10 - INFO - __main__ - Step 56661: {'lr': 0.00034977377752434797, 'samples': 10878912, 'steps': 56660, 'loss/train': 0.46029800176620483} 11/07/2021 05:11:11 - INFO - __main__ - Step 56662: {'lr': 0.0003497689116973848, 'samples': 10879104, 'steps': 56661, 'loss/train': 1.756382703781128} 11/07/2021 05:11:11 - INFO - __main__ - Step 56663: {'lr': 0.00034976404582546736, 'samples': 10879296, 'steps': 56662, 'loss/train': 1.5322246551513672} 11/07/2021 05:11:11 - INFO - __main__ - Step 56664: {'lr': 0.00034975917990859773, 'samples': 10879488, 'steps': 56663, 'loss/train': 1.0125632286071777} 11/07/2021 05:11:12 - INFO - __main__ - Step 56665: {'lr': 0.00034975431394677827, 'samples': 10879680, 'steps': 56664, 'loss/train': 1.1704699993133545} 11/07/2021 05:11:12 - INFO - __main__ - Step 56666: {'lr': 0.0003497494479400109, 'samples': 10879872, 'steps': 56665, 'loss/train': 1.2864733934402466} 11/07/2021 05:11:13 - INFO - __main__ - Step 56667: {'lr': 0.00034974458188829805, 'samples': 10880064, 'steps': 56666, 'loss/train': 1.3400623798370361} 11/07/2021 05:11:13 - INFO - __main__ - Step 56668: {'lr': 0.0003497397157916418, 'samples': 10880256, 'steps': 56667, 'loss/train': 1.3853057622909546} 11/07/2021 05:11:14 - INFO - __main__ - Step 56669: {'lr': 0.00034973484965004437, 'samples': 10880448, 'steps': 56668, 'loss/train': 1.6519192457199097} 11/07/2021 05:11:14 - INFO - __main__ - Step 56670: {'lr': 0.0003497299834635079, 'samples': 10880640, 'steps': 56669, 'loss/train': 1.7198807001113892} 11/07/2021 05:11:14 - INFO - __main__ - Step 56671: {'lr': 0.0003497251172320348, 'samples': 10880832, 'steps': 56670, 'loss/train': 1.3893827199935913} 11/07/2021 05:11:16 - INFO - __main__ - Step 56672: {'lr': 0.00034972025095562697, 'samples': 10881024, 'steps': 56671, 'loss/train': 1.578684687614441} 11/07/2021 05:11:16 - INFO - __main__ - Step 56673: {'lr': 0.00034971538463428683, 'samples': 10881216, 'steps': 56672, 'loss/train': 0.8849255442619324} 11/07/2021 05:11:16 - INFO - __main__ - Step 56674: {'lr': 0.0003497105182680164, 'samples': 10881408, 'steps': 56673, 'loss/train': 0.7655577659606934} 11/07/2021 05:11:17 - INFO - __main__ - Step 56675: {'lr': 0.00034970565185681794, 'samples': 10881600, 'steps': 56674, 'loss/train': 1.4446046352386475} 11/07/2021 05:11:17 - INFO - __main__ - Step 56676: {'lr': 0.0003497007854006937, 'samples': 10881792, 'steps': 56675, 'loss/train': 1.8495018482208252} 11/07/2021 05:11:18 - INFO - __main__ - Step 56677: {'lr': 0.0003496959188996458, 'samples': 10881984, 'steps': 56676, 'loss/train': 1.4334365129470825} 11/07/2021 05:11:18 - INFO - __main__ - Step 56678: {'lr': 0.00034969105235367647, 'samples': 10882176, 'steps': 56677, 'loss/train': 1.2246066331863403} 11/07/2021 05:11:19 - INFO - __main__ - Step 56679: {'lr': 0.0003496861857627879, 'samples': 10882368, 'steps': 56678, 'loss/train': 0.0722784623503685} 11/07/2021 05:11:19 - INFO - __main__ - Step 56680: {'lr': 0.0003496813191269822, 'samples': 10882560, 'steps': 56679, 'loss/train': 4.06576681137085} 11/07/2021 05:11:19 - INFO - __main__ - Step 56681: {'lr': 0.0003496764524462617, 'samples': 10882752, 'steps': 56680, 'loss/train': 1.331288456916809} 11/07/2021 05:11:20 - INFO - __main__ - Step 56682: {'lr': 0.00034967158572062854, 'samples': 10882944, 'steps': 56681, 'loss/train': 1.4976335763931274} 11/07/2021 05:11:21 - INFO - __main__ - Step 56683: {'lr': 0.00034966671895008485, 'samples': 10883136, 'steps': 56682, 'loss/train': 1.457160234451294} 11/07/2021 05:11:21 - INFO - __main__ - Step 56684: {'lr': 0.0003496618521346329, 'samples': 10883328, 'steps': 56683, 'loss/train': 1.4467800855636597} 11/07/2021 05:11:22 - INFO - __main__ - Step 56685: {'lr': 0.00034965698527427493, 'samples': 10883520, 'steps': 56684, 'loss/train': 1.098656415939331} 11/07/2021 05:11:22 - INFO - __main__ - Step 56686: {'lr': 0.00034965211836901293, 'samples': 10883712, 'steps': 56685, 'loss/train': 0.9556757807731628} 11/07/2021 05:11:23 - INFO - __main__ - Step 56687: {'lr': 0.00034964725141884936, 'samples': 10883904, 'steps': 56686, 'loss/train': 1.6815333366394043} 11/07/2021 05:11:23 - INFO - __main__ - Step 56688: {'lr': 0.00034964238442378615, 'samples': 10884096, 'steps': 56687, 'loss/train': 1.2224161624908447} 11/07/2021 05:11:24 - INFO - __main__ - Step 56689: {'lr': 0.00034963751738382564, 'samples': 10884288, 'steps': 56688, 'loss/train': 1.469492793083191} 11/07/2021 05:11:24 - INFO - __main__ - Step 56690: {'lr': 0.00034963265029897006, 'samples': 10884480, 'steps': 56689, 'loss/train': 0.7678528428077698} 11/07/2021 05:11:24 - INFO - __main__ - Step 56691: {'lr': 0.00034962778316922156, 'samples': 10884672, 'steps': 56690, 'loss/train': 1.1522170305252075} 11/07/2021 05:11:25 - INFO - __main__ - Step 56692: {'lr': 0.0003496229159945823, 'samples': 10884864, 'steps': 56691, 'loss/train': 0.4534689486026764} 11/07/2021 05:11:26 - INFO - __main__ - Step 56693: {'lr': 0.0003496180487750544, 'samples': 10885056, 'steps': 56692, 'loss/train': 1.801413655281067} 11/07/2021 05:11:26 - INFO - __main__ - Step 56694: {'lr': 0.00034961318151064026, 'samples': 10885248, 'steps': 56693, 'loss/train': 1.435295820236206} 11/07/2021 05:11:26 - INFO - __main__ - Step 56695: {'lr': 0.00034960831420134187, 'samples': 10885440, 'steps': 56694, 'loss/train': 1.5555894374847412} 11/07/2021 05:11:27 - INFO - __main__ - Step 56696: {'lr': 0.0003496034468471616, 'samples': 10885632, 'steps': 56695, 'loss/train': 1.4497580528259277} 11/07/2021 05:11:28 - INFO - __main__ - Step 56697: {'lr': 0.00034959857944810144, 'samples': 10885824, 'steps': 56696, 'loss/train': 1.1744581460952759} 11/07/2021 05:11:28 - INFO - __main__ - Step 56698: {'lr': 0.0003495937120041638, 'samples': 10886016, 'steps': 56697, 'loss/train': 1.328554630279541} 11/07/2021 05:11:29 - INFO - __main__ - Step 56699: {'lr': 0.00034958884451535073, 'samples': 10886208, 'steps': 56698, 'loss/train': 0.9245701432228088} 11/07/2021 05:11:29 - INFO - __main__ - Step 56700: {'lr': 0.00034958397698166445, 'samples': 10886400, 'steps': 56699, 'loss/train': 1.1948953866958618} 11/07/2021 05:11:29 - INFO - __main__ - Step 56701: {'lr': 0.00034957910940310716, 'samples': 10886592, 'steps': 56700, 'loss/train': 1.4919748306274414} 11/07/2021 05:11:30 - INFO - __main__ - Step 56702: {'lr': 0.00034957424177968114, 'samples': 10886784, 'steps': 56701, 'loss/train': 1.282505750656128} 11/07/2021 05:11:31 - INFO - __main__ - Step 56703: {'lr': 0.0003495693741113884, 'samples': 10886976, 'steps': 56702, 'loss/train': 1.3276407718658447} 11/07/2021 05:11:31 - INFO - __main__ - Step 56704: {'lr': 0.00034956450639823125, 'samples': 10887168, 'steps': 56703, 'loss/train': 1.5844906568527222} 11/07/2021 05:11:31 - INFO - __main__ - Step 56705: {'lr': 0.00034955963864021194, 'samples': 10887360, 'steps': 56704, 'loss/train': 1.1592485904693604} 11/07/2021 05:11:32 - INFO - __main__ - Step 56706: {'lr': 0.00034955477083733257, 'samples': 10887552, 'steps': 56705, 'loss/train': 1.3726892471313477} 11/07/2021 05:11:33 - INFO - __main__ - Step 56707: {'lr': 0.0003495499029895953, 'samples': 10887744, 'steps': 56706, 'loss/train': 0.03601754084229469} 11/07/2021 05:11:33 - INFO - __main__ - Step 56708: {'lr': 0.00034954503509700244, 'samples': 10887936, 'steps': 56707, 'loss/train': 1.2998552322387695} 11/07/2021 05:11:33 - INFO - __main__ - Step 56709: {'lr': 0.0003495401671595561, 'samples': 10888128, 'steps': 56708, 'loss/train': 0.6695010662078857} 11/07/2021 05:11:34 - INFO - __main__ - Step 56710: {'lr': 0.0003495352991772585, 'samples': 10888320, 'steps': 56709, 'loss/train': 1.436187744140625} 11/07/2021 05:11:34 - INFO - __main__ - Step 56711: {'lr': 0.0003495304311501118, 'samples': 10888512, 'steps': 56710, 'loss/train': 0.6216771006584167} 11/07/2021 05:11:34 - INFO - __main__ - Step 56712: {'lr': 0.0003495255630781183, 'samples': 10888704, 'steps': 56711, 'loss/train': 1.65614914894104} 11/07/2021 05:11:36 - INFO - __main__ - Step 56713: {'lr': 0.00034952069496128007, 'samples': 10888896, 'steps': 56712, 'loss/train': 1.5235693454742432} 11/07/2021 05:11:36 - INFO - __main__ - Step 56714: {'lr': 0.0003495158267995994, 'samples': 10889088, 'steps': 56713, 'loss/train': 1.5222333669662476} 11/07/2021 05:11:36 - INFO - __main__ - Step 56715: {'lr': 0.0003495109585930784, 'samples': 10889280, 'steps': 56714, 'loss/train': 1.6087963581085205} 11/07/2021 05:11:37 - INFO - __main__ - Step 56716: {'lr': 0.0003495060903417192, 'samples': 10889472, 'steps': 56715, 'loss/train': 1.353843331336975} 11/07/2021 05:11:37 - INFO - __main__ - Step 56717: {'lr': 0.00034950122204552417, 'samples': 10889664, 'steps': 56716, 'loss/train': 1.246916651725769} 11/07/2021 05:11:38 - INFO - __main__ - Step 56718: {'lr': 0.00034949635370449546, 'samples': 10889856, 'steps': 56717, 'loss/train': 1.7605196237564087} 11/07/2021 05:11:38 - INFO - __main__ - Step 56719: {'lr': 0.00034949148531863517, 'samples': 10890048, 'steps': 56718, 'loss/train': 1.7312897443771362} 11/07/2021 05:11:39 - INFO - __main__ - Step 56720: {'lr': 0.0003494866168879456, 'samples': 10890240, 'steps': 56719, 'loss/train': 0.5320704579353333} 11/07/2021 05:11:39 - INFO - __main__ - Step 56721: {'lr': 0.0003494817484124289, 'samples': 10890432, 'steps': 56720, 'loss/train': 1.625646710395813} 11/07/2021 05:11:39 - INFO - __main__ - Step 56722: {'lr': 0.0003494768798920872, 'samples': 10890624, 'steps': 56721, 'loss/train': 1.09530508518219} 11/07/2021 05:11:40 - INFO - __main__ - Step 56723: {'lr': 0.0003494720113269227, 'samples': 10890816, 'steps': 56722, 'loss/train': 1.383299469947815} 11/07/2021 05:11:41 - INFO - __main__ - Step 56724: {'lr': 0.00034946714271693783, 'samples': 10891008, 'steps': 56723, 'loss/train': 1.5900954008102417} 11/07/2021 05:11:41 - INFO - __main__ - Step 56725: {'lr': 0.0003494622740621345, 'samples': 10891200, 'steps': 56724, 'loss/train': 1.601185917854309} 11/07/2021 05:11:41 - INFO - __main__ - Step 56726: {'lr': 0.00034945740536251505, 'samples': 10891392, 'steps': 56725, 'loss/train': 1.5290385484695435} 11/07/2021 05:11:42 - INFO - __main__ - Step 56727: {'lr': 0.0003494525366180815, 'samples': 10891584, 'steps': 56726, 'loss/train': 1.253729224205017} 11/07/2021 05:11:43 - INFO - __main__ - Step 56728: {'lr': 0.0003494476678288363, 'samples': 10891776, 'steps': 56727, 'loss/train': 0.7405962347984314} 11/07/2021 05:11:43 - INFO - __main__ - Step 56729: {'lr': 0.00034944279899478146, 'samples': 10891968, 'steps': 56728, 'loss/train': 1.7755745649337769} 11/07/2021 05:11:44 - INFO - __main__ - Step 56730: {'lr': 0.00034943793011591926, 'samples': 10892160, 'steps': 56729, 'loss/train': 1.3615187406539917} 11/07/2021 05:11:44 - INFO - __main__ - Step 56731: {'lr': 0.0003494330611922518, 'samples': 10892352, 'steps': 56730, 'loss/train': 1.5941202640533447} 11/07/2021 05:11:44 - INFO - __main__ - Step 56732: {'lr': 0.0003494281922237814, 'samples': 10892544, 'steps': 56731, 'loss/train': 1.1688631772994995} 11/07/2021 05:11:45 - INFO - __main__ - Step 56733: {'lr': 0.0003494233232105102, 'samples': 10892736, 'steps': 56732, 'loss/train': 1.40535569190979} 11/07/2021 05:11:46 - INFO - __main__ - Step 56734: {'lr': 0.0003494184541524403, 'samples': 10892928, 'steps': 56733, 'loss/train': 1.4211196899414062} 11/07/2021 05:11:46 - INFO - __main__ - Step 56735: {'lr': 0.0003494135850495741, 'samples': 10893120, 'steps': 56734, 'loss/train': 1.3987970352172852} 11/07/2021 05:11:46 - INFO - __main__ - Step 56736: {'lr': 0.0003494087159019136, 'samples': 10893312, 'steps': 56735, 'loss/train': 1.4669206142425537} 11/07/2021 05:11:47 - INFO - __main__ - Step 56737: {'lr': 0.0003494038467094611, 'samples': 10893504, 'steps': 56736, 'loss/train': 1.8312269449234009} 11/07/2021 05:11:48 - INFO - __main__ - Step 56738: {'lr': 0.00034939897747221873, 'samples': 10893696, 'steps': 56737, 'loss/train': 1.2512260675430298} 11/07/2021 05:11:48 - INFO - __main__ - Step 56739: {'lr': 0.00034939410819018874, 'samples': 10893888, 'steps': 56738, 'loss/train': 1.0880860090255737} 11/07/2021 05:11:48 - INFO - __main__ - Step 56740: {'lr': 0.0003493892388633733, 'samples': 10894080, 'steps': 56739, 'loss/train': 1.3743757009506226} 11/07/2021 05:11:49 - INFO - __main__ - Step 56741: {'lr': 0.0003493843694917745, 'samples': 10894272, 'steps': 56740, 'loss/train': 1.1670089960098267} 11/07/2021 05:11:49 - INFO - __main__ - Step 56742: {'lr': 0.00034937950007539475, 'samples': 10894464, 'steps': 56741, 'loss/train': 1.5157989263534546} 11/07/2021 05:11:50 - INFO - __main__ - Step 56743: {'lr': 0.0003493746306142361, 'samples': 10894656, 'steps': 56742, 'loss/train': 1.471632957458496} 11/07/2021 05:11:51 - INFO - __main__ - Step 56744: {'lr': 0.00034936976110830077, 'samples': 10894848, 'steps': 56743, 'loss/train': 1.2681918144226074} 11/07/2021 05:11:51 - INFO - __main__ - Step 56745: {'lr': 0.000349364891557591, 'samples': 10895040, 'steps': 56744, 'loss/train': 1.4375859498977661} 11/07/2021 05:11:51 - INFO - __main__ - Step 56746: {'lr': 0.00034936002196210895, 'samples': 10895232, 'steps': 56745, 'loss/train': 1.1087936162948608} 11/07/2021 05:11:52 - INFO - __main__ - Step 56747: {'lr': 0.0003493551523218567, 'samples': 10895424, 'steps': 56746, 'loss/train': 1.2363958358764648} 11/07/2021 05:11:52 - INFO - __main__ - Step 56748: {'lr': 0.0003493502826368366, 'samples': 10895616, 'steps': 56747, 'loss/train': 1.2666784524917603} 11/07/2021 05:11:53 - INFO - __main__ - Step 56749: {'lr': 0.0003493454129070508, 'samples': 10895808, 'steps': 56748, 'loss/train': 1.105308175086975} 11/07/2021 05:11:54 - INFO - __main__ - Step 56750: {'lr': 0.0003493405431325015, 'samples': 10896000, 'steps': 56749, 'loss/train': 0.38008758425712585} 11/07/2021 05:11:54 - INFO - __main__ - Step 56751: {'lr': 0.0003493356733131909, 'samples': 10896192, 'steps': 56750, 'loss/train': 1.7625583410263062} 11/07/2021 05:11:54 - INFO - __main__ - Step 56752: {'lr': 0.0003493308034491212, 'samples': 10896384, 'steps': 56751, 'loss/train': 0.9974585771560669} 11/07/2021 05:11:55 - INFO - __main__ - Step 56753: {'lr': 0.00034932593354029454, 'samples': 10896576, 'steps': 56752, 'loss/train': 1.030988335609436} 11/07/2021 05:11:56 - INFO - __main__ - Step 56754: {'lr': 0.00034932106358671314, 'samples': 10896768, 'steps': 56753, 'loss/train': 1.3519301414489746} 11/07/2021 05:11:56 - INFO - __main__ - Step 56755: {'lr': 0.0003493161935883792, 'samples': 10896960, 'steps': 56754, 'loss/train': 1.558119773864746} 11/07/2021 05:11:57 - INFO - __main__ - Step 56756: {'lr': 0.0003493113235452949, 'samples': 10897152, 'steps': 56755, 'loss/train': 1.1805732250213623} 11/07/2021 05:11:57 - INFO - __main__ - Step 56757: {'lr': 0.00034930645345746246, 'samples': 10897344, 'steps': 56756, 'loss/train': 0.9049232602119446} 11/07/2021 05:11:57 - INFO - __main__ - Step 56758: {'lr': 0.0003493015833248841, 'samples': 10897536, 'steps': 56757, 'loss/train': 1.5116055011749268} 11/07/2021 05:11:58 - INFO - __main__ - Step 56759: {'lr': 0.00034929671314756197, 'samples': 10897728, 'steps': 56758, 'loss/train': 1.3431657552719116} 11/07/2021 05:11:59 - INFO - __main__ - Step 56760: {'lr': 0.0003492918429254983, 'samples': 10897920, 'steps': 56759, 'loss/train': 1.3007172346115112} 11/07/2021 05:11:59 - INFO - __main__ - Step 56761: {'lr': 0.00034928697265869515, 'samples': 10898112, 'steps': 56760, 'loss/train': 1.5657540559768677} 11/07/2021 05:11:59 - INFO - __main__ - Step 56762: {'lr': 0.00034928210234715497, 'samples': 10898304, 'steps': 56761, 'loss/train': 1.4907721281051636} 11/07/2021 05:12:00 - INFO - __main__ - Step 56763: {'lr': 0.0003492772319908797, 'samples': 10898496, 'steps': 56762, 'loss/train': 1.4233965873718262} 11/07/2021 05:12:01 - INFO - __main__ - Step 56764: {'lr': 0.0003492723615898716, 'samples': 10898688, 'steps': 56763, 'loss/train': 0.4662286937236786} 11/07/2021 05:12:01 - INFO - __main__ - Step 56765: {'lr': 0.000349267491144133, 'samples': 10898880, 'steps': 56764, 'loss/train': 1.4547139406204224} 11/07/2021 05:12:01 - INFO - __main__ - Step 56766: {'lr': 0.00034926262065366597, 'samples': 10899072, 'steps': 56765, 'loss/train': 1.2869064807891846} 11/07/2021 05:12:02 - INFO - __main__ - Step 56767: {'lr': 0.0003492577501184727, 'samples': 10899264, 'steps': 56766, 'loss/train': 1.4037586450576782} 11/07/2021 05:12:02 - INFO - __main__ - Step 56768: {'lr': 0.0003492528795385556, 'samples': 10899456, 'steps': 56767, 'loss/train': 1.3813951015472412} 11/07/2021 05:12:03 - INFO - __main__ - Step 56769: {'lr': 0.00034924800891391645, 'samples': 10899648, 'steps': 56768, 'loss/train': 0.9622254371643066} 11/07/2021 05:12:03 - INFO - __main__ - Step 56770: {'lr': 0.0003492431382445578, 'samples': 10899840, 'steps': 56769, 'loss/train': 1.509605884552002} 11/07/2021 05:12:04 - INFO - __main__ - Step 56771: {'lr': 0.00034923826753048163, 'samples': 10900032, 'steps': 56770, 'loss/train': 1.247928500175476} 11/07/2021 05:12:04 - INFO - __main__ - Step 56772: {'lr': 0.00034923339677169033, 'samples': 10900224, 'steps': 56771, 'loss/train': 1.6541073322296143} 11/07/2021 05:12:05 - INFO - __main__ - Step 56773: {'lr': 0.000349228525968186, 'samples': 10900416, 'steps': 56772, 'loss/train': 1.0791624784469604} 11/07/2021 05:12:06 - INFO - __main__ - Step 56774: {'lr': 0.0003492236551199707, 'samples': 10900608, 'steps': 56773, 'loss/train': 1.03981351852417} 11/07/2021 05:12:06 - INFO - __main__ - Step 56775: {'lr': 0.0003492187842270469, 'samples': 10900800, 'steps': 56774, 'loss/train': 1.2804206609725952} 11/07/2021 05:12:06 - INFO - __main__ - Step 56776: {'lr': 0.00034921391328941655, 'samples': 10900992, 'steps': 56775, 'loss/train': 1.37317955493927} 11/07/2021 05:12:07 - INFO - __main__ - Step 56777: {'lr': 0.00034920904230708195, 'samples': 10901184, 'steps': 56776, 'loss/train': 1.294616937637329} 11/07/2021 05:12:07 - INFO - __main__ - Step 56778: {'lr': 0.0003492041712800453, 'samples': 10901376, 'steps': 56777, 'loss/train': 1.3605133295059204} 11/07/2021 05:12:08 - INFO - __main__ - Step 56779: {'lr': 0.0003491993002083088, 'samples': 10901568, 'steps': 56778, 'loss/train': 1.0155563354492188} 11/07/2021 05:12:09 - INFO - __main__ - Step 56780: {'lr': 0.00034919442909187465, 'samples': 10901760, 'steps': 56779, 'loss/train': 1.3637478351593018} 11/07/2021 05:12:09 - INFO - __main__ - Step 56781: {'lr': 0.000349189557930745, 'samples': 10901952, 'steps': 56780, 'loss/train': 1.6369366645812988} 11/07/2021 05:12:09 - INFO - __main__ - Step 56782: {'lr': 0.000349184686724922, 'samples': 10902144, 'steps': 56781, 'loss/train': 1.3192890882492065} 11/07/2021 05:12:10 - INFO - __main__ - Step 56783: {'lr': 0.00034917981547440797, 'samples': 10902336, 'steps': 56782, 'loss/train': 1.9538573026657104} 11/07/2021 05:12:11 - INFO - __main__ - Step 56784: {'lr': 0.00034917494417920504, 'samples': 10902528, 'steps': 56783, 'loss/train': 1.6415321826934814} 11/07/2021 05:12:11 - INFO - __main__ - Step 56785: {'lr': 0.0003491700728393154, 'samples': 10902720, 'steps': 56784, 'loss/train': 1.7147635221481323} 11/07/2021 05:12:11 - INFO - __main__ - Step 56786: {'lr': 0.0003491652014547413, 'samples': 10902912, 'steps': 56785, 'loss/train': 1.1865392923355103} 11/07/2021 05:12:12 - INFO - __main__ - Step 56787: {'lr': 0.00034916033002548486, 'samples': 10903104, 'steps': 56786, 'loss/train': 1.6632952690124512} 11/07/2021 05:12:12 - INFO - __main__ - Step 56788: {'lr': 0.00034915545855154827, 'samples': 10903296, 'steps': 56787, 'loss/train': 1.759561538696289} 11/07/2021 05:12:13 - INFO - __main__ - Step 56789: {'lr': 0.00034915058703293377, 'samples': 10903488, 'steps': 56788, 'loss/train': 1.39176607131958} 11/07/2021 05:12:13 - INFO - __main__ - Step 56790: {'lr': 0.0003491457154696436, 'samples': 10903680, 'steps': 56789, 'loss/train': 1.5938973426818848} 11/07/2021 05:12:14 - INFO - __main__ - Step 56791: {'lr': 0.0003491408438616798, 'samples': 10903872, 'steps': 56790, 'loss/train': 1.3176683187484741} 11/07/2021 05:12:14 - INFO - __main__ - Step 56792: {'lr': 0.0003491359722090448, 'samples': 10904064, 'steps': 56791, 'loss/train': 1.5389387607574463} 11/07/2021 05:12:14 - INFO - __main__ - Step 56793: {'lr': 0.00034913110051174056, 'samples': 10904256, 'steps': 56792, 'loss/train': 1.2145445346832275} 11/07/2021 05:12:15 - INFO - __main__ - Step 56794: {'lr': 0.0003491262287697694, 'samples': 10904448, 'steps': 56793, 'loss/train': 1.434438705444336} 11/07/2021 05:12:16 - INFO - __main__ - Step 56795: {'lr': 0.0003491213569831335, 'samples': 10904640, 'steps': 56794, 'loss/train': 1.6607800722122192} 11/07/2021 05:12:16 - INFO - __main__ - Step 56796: {'lr': 0.000349116485151835, 'samples': 10904832, 'steps': 56795, 'loss/train': 1.0760871171951294} 11/07/2021 05:12:16 - INFO - __main__ - Step 56797: {'lr': 0.00034911161327587625, 'samples': 10905024, 'steps': 56796, 'loss/train': 1.4420219659805298} 11/07/2021 05:12:17 - INFO - __main__ - Step 56798: {'lr': 0.00034910674135525926, 'samples': 10905216, 'steps': 56797, 'loss/train': 1.4551119804382324} 11/07/2021 05:12:18 - INFO - __main__ - Step 56799: {'lr': 0.0003491018693899863, 'samples': 10905408, 'steps': 56798, 'loss/train': 0.805732786655426} 11/07/2021 05:12:18 - INFO - __main__ - Step 56800: {'lr': 0.00034909699738005964, 'samples': 10905600, 'steps': 56799, 'loss/train': 1.4538215398788452} 11/07/2021 05:12:18 - INFO - __main__ - Step 56801: {'lr': 0.0003490921253254813, 'samples': 10905792, 'steps': 56800, 'loss/train': 1.2486374378204346} 11/07/2021 05:12:19 - INFO - __main__ - Step 56802: {'lr': 0.00034908725322625365, 'samples': 10905984, 'steps': 56801, 'loss/train': 1.6391202211380005} 11/07/2021 05:12:19 - INFO - __main__ - Step 56803: {'lr': 0.0003490823810823788, 'samples': 10906176, 'steps': 56802, 'loss/train': 1.2304691076278687} 11/07/2021 05:12:20 - INFO - __main__ - Step 56804: {'lr': 0.0003490775088938589, 'samples': 10906368, 'steps': 56803, 'loss/train': 1.4793990850448608} 11/07/2021 05:12:21 - INFO - __main__ - Step 56805: {'lr': 0.00034907263666069624, 'samples': 10906560, 'steps': 56804, 'loss/train': 1.2749805450439453} 11/07/2021 05:12:21 - INFO - __main__ - Step 56806: {'lr': 0.000349067764382893, 'samples': 10906752, 'steps': 56805, 'loss/train': 1.2632070779800415} 11/07/2021 05:12:21 - INFO - __main__ - Step 56807: {'lr': 0.0003490628920604513, 'samples': 10906944, 'steps': 56806, 'loss/train': 1.7383389472961426} 11/07/2021 05:12:22 - INFO - __main__ - Step 56808: {'lr': 0.00034905801969337347, 'samples': 10907136, 'steps': 56807, 'loss/train': 1.5192008018493652} 11/07/2021 05:12:23 - INFO - __main__ - Step 56809: {'lr': 0.0003490531472816616, 'samples': 10907328, 'steps': 56808, 'loss/train': 1.4717756509780884} 11/07/2021 05:12:23 - INFO - __main__ - Step 56810: {'lr': 0.00034904827482531785, 'samples': 10907520, 'steps': 56809, 'loss/train': 0.5518280267715454} 11/07/2021 05:12:23 - INFO - __main__ - Step 56811: {'lr': 0.0003490434023243445, 'samples': 10907712, 'steps': 56810, 'loss/train': 1.182074785232544} 11/07/2021 05:12:24 - INFO - __main__ - Step 56812: {'lr': 0.0003490385297787438, 'samples': 10907904, 'steps': 56811, 'loss/train': 1.262606143951416} 11/07/2021 05:12:24 - INFO - __main__ - Step 56813: {'lr': 0.00034903365718851775, 'samples': 10908096, 'steps': 56812, 'loss/train': 0.9040560722351074} 11/07/2021 05:12:25 - INFO - __main__ - Step 56814: {'lr': 0.00034902878455366876, 'samples': 10908288, 'steps': 56813, 'loss/train': 1.0892393589019775} 11/07/2021 05:12:25 - INFO - __main__ - Step 56815: {'lr': 0.0003490239118741989, 'samples': 10908480, 'steps': 56814, 'loss/train': 1.5409772396087646} 11/07/2021 05:12:26 - INFO - __main__ - Step 56816: {'lr': 0.00034901903915011035, 'samples': 10908672, 'steps': 56815, 'loss/train': 1.5004373788833618} 11/07/2021 05:12:26 - INFO - __main__ - Step 56817: {'lr': 0.0003490141663814054, 'samples': 10908864, 'steps': 56816, 'loss/train': 1.1160506010055542} 11/07/2021 05:12:26 - INFO - __main__ - Step 56818: {'lr': 0.00034900929356808613, 'samples': 10909056, 'steps': 56817, 'loss/train': 1.5083330869674683} 11/07/2021 05:12:28 - INFO - __main__ - Step 56819: {'lr': 0.00034900442071015485, 'samples': 10909248, 'steps': 56818, 'loss/train': 1.148646593093872} 11/07/2021 05:12:28 - INFO - __main__ - Step 56820: {'lr': 0.00034899954780761373, 'samples': 10909440, 'steps': 56819, 'loss/train': 1.3670462369918823} 11/07/2021 05:12:28 - INFO - __main__ - Step 56821: {'lr': 0.00034899467486046486, 'samples': 10909632, 'steps': 56820, 'loss/train': 1.1212408542633057} 11/07/2021 05:12:29 - INFO - __main__ - Step 56822: {'lr': 0.0003489898018687106, 'samples': 10909824, 'steps': 56821, 'loss/train': 1.2553880214691162} 11/07/2021 05:12:29 - INFO - __main__ - Step 56823: {'lr': 0.000348984928832353, 'samples': 10910016, 'steps': 56822, 'loss/train': 1.1198461055755615} 11/07/2021 05:12:30 - INFO - __main__ - Step 56824: {'lr': 0.00034898005575139437, 'samples': 10910208, 'steps': 56823, 'loss/train': 1.5854177474975586} 11/07/2021 05:12:30 - INFO - __main__ - Step 56825: {'lr': 0.00034897518262583683, 'samples': 10910400, 'steps': 56824, 'loss/train': 1.3185606002807617} 11/07/2021 05:12:31 - INFO - __main__ - Step 56826: {'lr': 0.00034897030945568264, 'samples': 10910592, 'steps': 56825, 'loss/train': 1.1018697023391724} 11/07/2021 05:12:31 - INFO - __main__ - Step 56827: {'lr': 0.0003489654362409339, 'samples': 10910784, 'steps': 56826, 'loss/train': 0.7537766098976135} 11/07/2021 05:12:31 - INFO - __main__ - Step 56828: {'lr': 0.00034896056298159287, 'samples': 10910976, 'steps': 56827, 'loss/train': 1.3221607208251953} 11/07/2021 05:12:33 - INFO - __main__ - Step 56829: {'lr': 0.0003489556896776618, 'samples': 10911168, 'steps': 56828, 'loss/train': 0.6373487710952759} 11/07/2021 05:12:33 - INFO - __main__ - Step 56830: {'lr': 0.00034895081632914274, 'samples': 10911360, 'steps': 56829, 'loss/train': 1.5407037734985352} 11/07/2021 05:12:33 - INFO - __main__ - Step 56831: {'lr': 0.000348945942936038, 'samples': 10911552, 'steps': 56830, 'loss/train': 1.3099194765090942} 11/07/2021 05:12:34 - INFO - __main__ - Step 56832: {'lr': 0.0003489410694983497, 'samples': 10911744, 'steps': 56831, 'loss/train': 1.4125101566314697} 11/07/2021 05:12:34 - INFO - __main__ - Step 56833: {'lr': 0.00034893619601608015, 'samples': 10911936, 'steps': 56832, 'loss/train': 1.600282073020935} 11/07/2021 05:12:34 - INFO - __main__ - Step 56834: {'lr': 0.0003489313224892314, 'samples': 10912128, 'steps': 56833, 'loss/train': 1.6528139114379883} 11/07/2021 05:12:35 - INFO - __main__ - Step 56835: {'lr': 0.0003489264489178058, 'samples': 10912320, 'steps': 56834, 'loss/train': 1.6586995124816895} 11/07/2021 05:12:36 - INFO - __main__ - Step 56836: {'lr': 0.00034892157530180546, 'samples': 10912512, 'steps': 56835, 'loss/train': 0.987562894821167} 11/07/2021 05:12:36 - INFO - __main__ - Step 56837: {'lr': 0.0003489167016412326, 'samples': 10912704, 'steps': 56836, 'loss/train': 1.5761076211929321} 11/07/2021 05:12:36 - INFO - __main__ - Step 56838: {'lr': 0.00034891182793608935, 'samples': 10912896, 'steps': 56837, 'loss/train': 1.2682623863220215} 11/07/2021 05:12:37 - INFO - __main__ - Step 56839: {'lr': 0.000348906954186378, 'samples': 10913088, 'steps': 56838, 'loss/train': 2.0079851150512695} 11/07/2021 05:12:38 - INFO - __main__ - Step 56840: {'lr': 0.0003489020803921007, 'samples': 10913280, 'steps': 56839, 'loss/train': 1.529241681098938} 11/07/2021 05:12:38 - INFO - __main__ - Step 56841: {'lr': 0.00034889720655325955, 'samples': 10913472, 'steps': 56840, 'loss/train': 1.412021517753601} 11/07/2021 05:12:38 - INFO - __main__ - Step 56842: {'lr': 0.000348892332669857, 'samples': 10913664, 'steps': 56841, 'loss/train': 1.6484094858169556} 11/07/2021 05:12:39 - INFO - __main__ - Step 56843: {'lr': 0.000348887458741895, 'samples': 10913856, 'steps': 56842, 'loss/train': 1.524601936340332} 11/07/2021 05:12:39 - INFO - __main__ - Step 56844: {'lr': 0.0003488825847693758, 'samples': 10914048, 'steps': 56843, 'loss/train': 1.6736701726913452} 11/07/2021 05:12:40 - INFO - __main__ - Step 56845: {'lr': 0.0003488777107523017, 'samples': 10914240, 'steps': 56844, 'loss/train': 1.7258293628692627} 11/07/2021 05:12:40 - INFO - __main__ - Step 56846: {'lr': 0.0003488728366906748, 'samples': 10914432, 'steps': 56845, 'loss/train': 1.5056270360946655} 11/07/2021 05:12:41 - INFO - __main__ - Step 56847: {'lr': 0.0003488679625844974, 'samples': 10914624, 'steps': 56846, 'loss/train': 1.5097498893737793} 11/07/2021 05:12:41 - INFO - __main__ - Step 56848: {'lr': 0.0003488630884337715, 'samples': 10914816, 'steps': 56847, 'loss/train': 1.4018149375915527} 11/07/2021 05:12:42 - INFO - __main__ - Step 56849: {'lr': 0.0003488582142384995, 'samples': 10915008, 'steps': 56848, 'loss/train': 1.080633521080017} 11/07/2021 05:12:43 - INFO - __main__ - Step 56850: {'lr': 0.00034885333999868344, 'samples': 10915200, 'steps': 56849, 'loss/train': 0.059797029942274094} 11/07/2021 05:12:43 - INFO - __main__ - Step 56851: {'lr': 0.0003488484657143257, 'samples': 10915392, 'steps': 56850, 'loss/train': 1.1406599283218384} 11/07/2021 05:12:43 - INFO - __main__ - Step 56852: {'lr': 0.00034884359138542825, 'samples': 10915584, 'steps': 56851, 'loss/train': 1.5817950963974} 11/07/2021 05:12:44 - INFO - __main__ - Step 56853: {'lr': 0.0003488387170119935, 'samples': 10915776, 'steps': 56852, 'loss/train': 1.3233317136764526} 11/07/2021 05:12:44 - INFO - __main__ - Step 56854: {'lr': 0.0003488338425940235, 'samples': 10915968, 'steps': 56853, 'loss/train': 0.7318007349967957} 11/07/2021 05:12:45 - INFO - __main__ - Step 56855: {'lr': 0.00034882896813152056, 'samples': 10916160, 'steps': 56854, 'loss/train': 1.2505193948745728} 11/07/2021 05:12:46 - INFO - __main__ - Step 56856: {'lr': 0.0003488240936244867, 'samples': 10916352, 'steps': 56855, 'loss/train': 2.2404658794403076} 11/07/2021 05:12:46 - INFO - __main__ - Step 56857: {'lr': 0.0003488192190729243, 'samples': 10916544, 'steps': 56856, 'loss/train': 1.4771580696105957} 11/07/2021 05:12:46 - INFO - __main__ - Step 56858: {'lr': 0.0003488143444768355, 'samples': 10916736, 'steps': 56857, 'loss/train': 1.3698559999465942} 11/07/2021 05:12:47 - INFO - __main__ - Step 56859: {'lr': 0.0003488094698362224, 'samples': 10916928, 'steps': 56858, 'loss/train': 1.3732186555862427} 11/07/2021 05:12:48 - INFO - __main__ - Step 56860: {'lr': 0.00034880459515108735, 'samples': 10917120, 'steps': 56859, 'loss/train': 1.0488653182983398} 11/07/2021 05:12:48 - INFO - __main__ - Step 56861: {'lr': 0.0003487997204214325, 'samples': 10917312, 'steps': 56860, 'loss/train': 1.0818208456039429} 11/07/2021 05:12:48 - INFO - __main__ - Step 56862: {'lr': 0.00034879484564725993, 'samples': 10917504, 'steps': 56861, 'loss/train': 1.702939748764038} 11/07/2021 05:12:49 - INFO - __main__ - Step 56863: {'lr': 0.00034878997082857195, 'samples': 10917696, 'steps': 56862, 'loss/train': 1.3372565507888794} 11/07/2021 05:12:49 - INFO - __main__ - Step 56864: {'lr': 0.0003487850959653708, 'samples': 10917888, 'steps': 56863, 'loss/train': 1.1413240432739258} 11/07/2021 05:12:49 - INFO - __main__ - Step 56865: {'lr': 0.0003487802210576585, 'samples': 10918080, 'steps': 56864, 'loss/train': 1.5832221508026123} 11/07/2021 05:12:51 - INFO - __main__ - Step 56866: {'lr': 0.0003487753461054375, 'samples': 10918272, 'steps': 56865, 'loss/train': 1.3116753101348877} 11/07/2021 05:12:51 - INFO - __main__ - Step 56867: {'lr': 0.00034877047110870975, 'samples': 10918464, 'steps': 56866, 'loss/train': 1.2290250062942505} 11/07/2021 05:12:51 - INFO - __main__ - Step 56868: {'lr': 0.0003487655960674776, 'samples': 10918656, 'steps': 56867, 'loss/train': 1.3250553607940674} 11/07/2021 05:12:52 - INFO - __main__ - Step 56869: {'lr': 0.00034876072098174315, 'samples': 10918848, 'steps': 56868, 'loss/train': 1.3140314817428589} 11/07/2021 05:12:52 - INFO - __main__ - Step 56870: {'lr': 0.00034875584585150864, 'samples': 10919040, 'steps': 56869, 'loss/train': 1.451603651046753} 11/07/2021 05:12:53 - INFO - __main__ - Step 56871: {'lr': 0.0003487509706767763, 'samples': 10919232, 'steps': 56870, 'loss/train': 1.445232629776001} 11/07/2021 05:12:53 - INFO - __main__ - Step 56872: {'lr': 0.00034874609545754826, 'samples': 10919424, 'steps': 56871, 'loss/train': 0.06420867890119553} 11/07/2021 05:12:54 - INFO - __main__ - Step 56873: {'lr': 0.00034874122019382684, 'samples': 10919616, 'steps': 56872, 'loss/train': 1.2108557224273682} 11/07/2021 05:12:54 - INFO - __main__ - Step 56874: {'lr': 0.0003487363448856141, 'samples': 10919808, 'steps': 56873, 'loss/train': 1.455207347869873} 11/07/2021 05:12:55 - INFO - __main__ - Step 56875: {'lr': 0.00034873146953291224, 'samples': 10920000, 'steps': 56874, 'loss/train': 1.3070580959320068} 11/07/2021 05:12:55 - INFO - __main__ - Step 56876: {'lr': 0.0003487265941357236, 'samples': 10920192, 'steps': 56875, 'loss/train': 1.1097195148468018} 11/07/2021 05:12:56 - INFO - __main__ - Step 56877: {'lr': 0.00034872171869405015, 'samples': 10920384, 'steps': 56876, 'loss/train': 1.4130849838256836} 11/07/2021 05:12:56 - INFO - __main__ - Step 56878: {'lr': 0.0003487168432078943, 'samples': 10920576, 'steps': 56877, 'loss/train': 1.3187248706817627} 11/07/2021 05:12:57 - INFO - __main__ - Step 56879: {'lr': 0.0003487119676772582, 'samples': 10920768, 'steps': 56878, 'loss/train': 2.050884246826172} 11/07/2021 05:12:57 - INFO - __main__ - Step 56880: {'lr': 0.00034870709210214397, 'samples': 10920960, 'steps': 56879, 'loss/train': 0.8796589374542236} 11/07/2021 05:12:58 - INFO - __main__ - Step 56881: {'lr': 0.00034870221648255383, 'samples': 10921152, 'steps': 56880, 'loss/train': 1.1586881875991821} 11/07/2021 05:12:58 - INFO - __main__ - Step 56882: {'lr': 0.00034869734081849, 'samples': 10921344, 'steps': 56881, 'loss/train': 1.4075803756713867} 11/07/2021 05:12:59 - INFO - __main__ - Step 56883: {'lr': 0.0003486924651099547, 'samples': 10921536, 'steps': 56882, 'loss/train': 0.9032168984413147} 11/07/2021 05:12:59 - INFO - __main__ - Step 56884: {'lr': 0.00034868758935695, 'samples': 10921728, 'steps': 56883, 'loss/train': 1.346758246421814} 11/07/2021 05:12:59 - INFO - __main__ - Step 56885: {'lr': 0.0003486827135594783, 'samples': 10921920, 'steps': 56884, 'loss/train': 1.114176869392395} 11/07/2021 05:13:00 - INFO - __main__ - Step 56886: {'lr': 0.0003486778377175417, 'samples': 10922112, 'steps': 56885, 'loss/train': 1.0794364213943481} 11/07/2021 05:13:01 - INFO - __main__ - Step 56887: {'lr': 0.00034867296183114236, 'samples': 10922304, 'steps': 56886, 'loss/train': 1.1506068706512451} 11/07/2021 05:13:01 - INFO - __main__ - Step 56888: {'lr': 0.0003486680859002825, 'samples': 10922496, 'steps': 56887, 'loss/train': 1.1032979488372803} 11/07/2021 05:13:01 - INFO - __main__ - Step 56889: {'lr': 0.00034866320992496427, 'samples': 10922688, 'steps': 56888, 'loss/train': 1.5184299945831299} 11/07/2021 05:13:02 - INFO - __main__ - Step 56890: {'lr': 0.00034865833390518996, 'samples': 10922880, 'steps': 56889, 'loss/train': 1.0353816747665405} 11/07/2021 05:13:03 - INFO - __main__ - Step 56891: {'lr': 0.0003486534578409618, 'samples': 10923072, 'steps': 56890, 'loss/train': 1.490349292755127} 11/07/2021 05:13:03 - INFO - __main__ - Step 56892: {'lr': 0.0003486485817322819, 'samples': 10923264, 'steps': 56891, 'loss/train': 1.4208921194076538} 11/07/2021 05:13:03 - INFO - __main__ - Step 56893: {'lr': 0.0003486437055791524, 'samples': 10923456, 'steps': 56892, 'loss/train': 1.3553944826126099} 11/07/2021 05:13:04 - INFO - __main__ - Step 56894: {'lr': 0.00034863882938157553, 'samples': 10923648, 'steps': 56893, 'loss/train': 1.4005050659179688} 11/07/2021 05:13:04 - INFO - __main__ - Step 56895: {'lr': 0.0003486339531395536, 'samples': 10923840, 'steps': 56894, 'loss/train': 1.583556056022644} 11/07/2021 05:13:05 - INFO - __main__ - Step 56896: {'lr': 0.0003486290768530887, 'samples': 10924032, 'steps': 56895, 'loss/train': 1.2132513523101807} 11/07/2021 05:13:06 - INFO - __main__ - Step 56897: {'lr': 0.00034862420052218313, 'samples': 10924224, 'steps': 56896, 'loss/train': 1.1575571298599243} 11/07/2021 05:13:06 - INFO - __main__ - Step 56898: {'lr': 0.00034861932414683897, 'samples': 10924416, 'steps': 56897, 'loss/train': 1.5526455640792847} 11/07/2021 05:13:06 - INFO - __main__ - Step 56899: {'lr': 0.00034861444772705846, 'samples': 10924608, 'steps': 56898, 'loss/train': 1.6995787620544434} 11/07/2021 05:13:07 - INFO - __main__ - Step 56900: {'lr': 0.0003486095712628438, 'samples': 10924800, 'steps': 56899, 'loss/train': 1.4922962188720703} 11/07/2021 05:13:08 - INFO - __main__ - Step 56901: {'lr': 0.00034860469475419723, 'samples': 10924992, 'steps': 56900, 'loss/train': 1.801711082458496} 11/07/2021 05:13:08 - INFO - __main__ - Step 56902: {'lr': 0.00034859981820112084, 'samples': 10925184, 'steps': 56901, 'loss/train': 1.2205525636672974} 11/07/2021 05:13:08 - INFO - __main__ - Step 56903: {'lr': 0.00034859494160361694, 'samples': 10925376, 'steps': 56902, 'loss/train': 1.3201030492782593} 11/07/2021 05:13:09 - INFO - __main__ - Step 56904: {'lr': 0.00034859006496168764, 'samples': 10925568, 'steps': 56903, 'loss/train': 1.0855777263641357} 11/07/2021 05:13:09 - INFO - __main__ - Step 56905: {'lr': 0.0003485851882753352, 'samples': 10925760, 'steps': 56904, 'loss/train': 1.3407800197601318} 11/07/2021 05:13:10 - INFO - __main__ - Step 56906: {'lr': 0.00034858031154456177, 'samples': 10925952, 'steps': 56905, 'loss/train': 0.8941053152084351} 11/07/2021 05:13:11 - INFO - __main__ - Step 56907: {'lr': 0.0003485754347693696, 'samples': 10926144, 'steps': 56906, 'loss/train': 1.1388531923294067} 11/07/2021 05:13:11 - INFO - __main__ - Step 56908: {'lr': 0.0003485705579497609, 'samples': 10926336, 'steps': 56907, 'loss/train': 1.4479374885559082} 11/07/2021 05:13:11 - INFO - __main__ - Step 56909: {'lr': 0.0003485656810857378, 'samples': 10926528, 'steps': 56908, 'loss/train': 1.2756643295288086} 11/07/2021 05:13:12 - INFO - __main__ - Step 56910: {'lr': 0.00034856080417730253, 'samples': 10926720, 'steps': 56909, 'loss/train': 1.2452208995819092} 11/07/2021 05:13:12 - INFO - __main__ - Step 56911: {'lr': 0.0003485559272244572, 'samples': 10926912, 'steps': 56910, 'loss/train': 1.669407606124878} 11/07/2021 05:13:13 - INFO - __main__ - Step 56912: {'lr': 0.0003485510502272042, 'samples': 10927104, 'steps': 56911, 'loss/train': 2.0323190689086914} 11/07/2021 05:13:13 - INFO - __main__ - Step 56913: {'lr': 0.0003485461731855456, 'samples': 10927296, 'steps': 56912, 'loss/train': 1.4654431343078613} 11/07/2021 05:13:14 - INFO - __main__ - Step 56914: {'lr': 0.0003485412960994836, 'samples': 10927488, 'steps': 56913, 'loss/train': 1.3183684349060059} 11/07/2021 05:13:14 - INFO - __main__ - Step 56915: {'lr': 0.0003485364189690203, 'samples': 10927680, 'steps': 56914, 'loss/train': 1.2402470111846924} 11/07/2021 05:13:14 - INFO - __main__ - Step 56916: {'lr': 0.0003485315417941581, 'samples': 10927872, 'steps': 56915, 'loss/train': 1.2380595207214355} 11/07/2021 05:13:15 - INFO - __main__ - Step 56917: {'lr': 0.00034852666457489917, 'samples': 10928064, 'steps': 56916, 'loss/train': 1.0708080530166626} 11/07/2021 05:13:16 - INFO - __main__ - Step 56918: {'lr': 0.00034852178731124557, 'samples': 10928256, 'steps': 56917, 'loss/train': 1.747732162475586} 11/07/2021 05:13:16 - INFO - __main__ - Step 56919: {'lr': 0.00034851691000319963, 'samples': 10928448, 'steps': 56918, 'loss/train': 1.5586053133010864} 11/07/2021 05:13:16 - INFO - __main__ - Step 56920: {'lr': 0.0003485120326507635, 'samples': 10928640, 'steps': 56919, 'loss/train': 0.9507009387016296} 11/07/2021 05:13:17 - INFO - __main__ - Step 56921: {'lr': 0.0003485071552539393, 'samples': 10928832, 'steps': 56920, 'loss/train': 1.2531992197036743} 11/07/2021 05:13:18 - INFO - __main__ - Step 56922: {'lr': 0.0003485022778127293, 'samples': 10929024, 'steps': 56921, 'loss/train': 1.6206005811691284} 11/07/2021 05:13:18 - INFO - __main__ - Step 56923: {'lr': 0.0003484974003271357, 'samples': 10929216, 'steps': 56922, 'loss/train': 1.4933346509933472} 11/07/2021 05:13:19 - INFO - __main__ - Step 56924: {'lr': 0.0003484925227971607, 'samples': 10929408, 'steps': 56923, 'loss/train': 0.821574866771698} 11/07/2021 05:13:19 - INFO - __main__ - Step 56925: {'lr': 0.0003484876452228065, 'samples': 10929600, 'steps': 56924, 'loss/train': 1.5044890642166138} 11/07/2021 05:13:19 - INFO - __main__ - Step 56926: {'lr': 0.00034848276760407525, 'samples': 10929792, 'steps': 56925, 'loss/train': 1.297538161277771} 11/07/2021 05:13:20 - INFO - __main__ - Step 56927: {'lr': 0.0003484778899409693, 'samples': 10929984, 'steps': 56926, 'loss/train': 1.383144736289978} 11/07/2021 05:13:21 - INFO - __main__ - Step 56928: {'lr': 0.0003484730122334906, 'samples': 10930176, 'steps': 56927, 'loss/train': 1.371925711631775} 11/07/2021 05:13:21 - INFO - __main__ - Step 56929: {'lr': 0.00034846813448164153, 'samples': 10930368, 'steps': 56928, 'loss/train': 1.3051584959030151} 11/07/2021 05:13:21 - INFO - __main__ - Step 56930: {'lr': 0.00034846325668542425, 'samples': 10930560, 'steps': 56929, 'loss/train': 1.477121114730835} 11/07/2021 05:13:22 - INFO - __main__ - Step 56931: {'lr': 0.00034845837884484086, 'samples': 10930752, 'steps': 56930, 'loss/train': 0.9537971019744873} 11/07/2021 05:13:23 - INFO - __main__ - Step 56932: {'lr': 0.00034845350095989377, 'samples': 10930944, 'steps': 56931, 'loss/train': 0.9450234770774841} 11/07/2021 05:13:23 - INFO - __main__ - Step 56933: {'lr': 0.000348448623030585, 'samples': 10931136, 'steps': 56932, 'loss/train': 1.4369124174118042} 11/07/2021 05:13:23 - INFO - __main__ - Step 56934: {'lr': 0.00034844374505691686, 'samples': 10931328, 'steps': 56933, 'loss/train': 0.9749782681465149} 11/07/2021 05:13:24 - INFO - __main__ - Step 56935: {'lr': 0.0003484388670388914, 'samples': 10931520, 'steps': 56934, 'loss/train': 1.6764055490493774} 11/07/2021 05:13:24 - INFO - __main__ - Step 56936: {'lr': 0.0003484339889765109, 'samples': 10931712, 'steps': 56935, 'loss/train': 1.48472261428833} 11/07/2021 05:13:25 - INFO - __main__ - Step 56937: {'lr': 0.0003484291108697776, 'samples': 10931904, 'steps': 56936, 'loss/train': 1.3301887512207031} 11/07/2021 05:13:25 - INFO - __main__ - Step 56938: {'lr': 0.0003484242327186936, 'samples': 10932096, 'steps': 56937, 'loss/train': 1.0789657831192017} 11/07/2021 05:13:26 - INFO - __main__ - Step 56939: {'lr': 0.0003484193545232612, 'samples': 10932288, 'steps': 56938, 'loss/train': 1.1669780015945435} 11/07/2021 05:13:26 - INFO - __main__ - Step 56940: {'lr': 0.00034841447628348267, 'samples': 10932480, 'steps': 56939, 'loss/train': 1.3737448453903198} 11/07/2021 05:13:26 - INFO - __main__ - Step 56941: {'lr': 0.00034840959799936, 'samples': 10932672, 'steps': 56940, 'loss/train': 1.536058783531189} 11/07/2021 05:13:27 - INFO - __main__ - Step 56942: {'lr': 0.0003484047196708955, 'samples': 10932864, 'steps': 56941, 'loss/train': 1.282861351966858} 11/07/2021 05:13:28 - INFO - __main__ - Step 56943: {'lr': 0.00034839984129809125, 'samples': 10933056, 'steps': 56942, 'loss/train': 1.2766464948654175} 11/07/2021 05:13:28 - INFO - __main__ - Step 56944: {'lr': 0.00034839496288094964, 'samples': 10933248, 'steps': 56943, 'loss/train': 1.8850005865097046} 11/07/2021 05:13:28 - INFO - __main__ - Step 56945: {'lr': 0.0003483900844194728, 'samples': 10933440, 'steps': 56944, 'loss/train': 1.4289957284927368} 11/07/2021 05:13:29 - INFO - __main__ - Step 56946: {'lr': 0.00034838520591366285, 'samples': 10933632, 'steps': 56945, 'loss/train': 1.6287003755569458} 11/07/2021 05:13:30 - INFO - __main__ - Step 56947: {'lr': 0.0003483803273635221, 'samples': 10933824, 'steps': 56946, 'loss/train': 0.452557772397995} 11/07/2021 05:13:30 - INFO - __main__ - Step 56948: {'lr': 0.0003483754487690527, 'samples': 10934016, 'steps': 56947, 'loss/train': 1.4415332078933716} 11/07/2021 05:13:31 - INFO - __main__ - Step 56949: {'lr': 0.0003483705701302567, 'samples': 10934208, 'steps': 56948, 'loss/train': 1.4060859680175781} 11/07/2021 05:13:31 - INFO - __main__ - Step 56950: {'lr': 0.0003483656914471366, 'samples': 10934400, 'steps': 56949, 'loss/train': 1.652616024017334} 11/07/2021 05:13:31 - INFO - __main__ - Step 56951: {'lr': 0.00034836081271969436, 'samples': 10934592, 'steps': 56950, 'loss/train': 1.4206058979034424} 11/07/2021 05:13:32 - INFO - __main__ - Step 56952: {'lr': 0.0003483559339479323, 'samples': 10934784, 'steps': 56951, 'loss/train': 1.4728578329086304} 11/07/2021 05:13:33 - INFO - __main__ - Step 56953: {'lr': 0.00034835105513185253, 'samples': 10934976, 'steps': 56952, 'loss/train': 1.665027141571045} 11/07/2021 05:13:33 - INFO - __main__ - Step 56954: {'lr': 0.00034834617627145737, 'samples': 10935168, 'steps': 56953, 'loss/train': 1.2767750024795532} 11/07/2021 05:13:33 - INFO - __main__ - Step 56955: {'lr': 0.00034834129736674885, 'samples': 10935360, 'steps': 56954, 'loss/train': 2.0450494289398193} 11/07/2021 05:13:34 - INFO - __main__ - Step 56956: {'lr': 0.0003483364184177293, 'samples': 10935552, 'steps': 56955, 'loss/train': 1.1888480186462402} 11/07/2021 05:13:34 - INFO - __main__ - Step 56957: {'lr': 0.0003483315394244009, 'samples': 10935744, 'steps': 56956, 'loss/train': 1.5797548294067383} 11/07/2021 05:13:35 - INFO - __main__ - Step 56958: {'lr': 0.00034832666038676576, 'samples': 10935936, 'steps': 56957, 'loss/train': 1.4554880857467651} 11/07/2021 05:13:35 - INFO - __main__ - Step 56959: {'lr': 0.0003483217813048262, 'samples': 10936128, 'steps': 56958, 'loss/train': 1.3941136598587036} 11/07/2021 05:13:36 - INFO - __main__ - Step 56960: {'lr': 0.0003483169021785844, 'samples': 10936320, 'steps': 56959, 'loss/train': 2.511237382888794} 11/07/2021 05:13:36 - INFO - __main__ - Step 56961: {'lr': 0.00034831202300804245, 'samples': 10936512, 'steps': 56960, 'loss/train': 1.3640533685684204} 11/07/2021 05:13:36 - INFO - __main__ - Step 56962: {'lr': 0.0003483071437932026, 'samples': 10936704, 'steps': 56961, 'loss/train': 1.2520326375961304} 11/07/2021 05:13:37 - INFO - __main__ - Step 56963: {'lr': 0.0003483022645340671, 'samples': 10936896, 'steps': 56962, 'loss/train': 1.1164780855178833} 11/07/2021 05:13:38 - INFO - __main__ - Step 56964: {'lr': 0.0003482973852306381, 'samples': 10937088, 'steps': 56963, 'loss/train': 1.4276697635650635} 11/07/2021 05:13:38 - INFO - __main__ - Step 56965: {'lr': 0.00034829250588291785, 'samples': 10937280, 'steps': 56964, 'loss/train': 1.3328008651733398} 11/07/2021 05:13:38 - INFO - __main__ - Step 56966: {'lr': 0.00034828762649090843, 'samples': 10937472, 'steps': 56965, 'loss/train': 1.4390015602111816} 11/07/2021 05:13:39 - INFO - __main__ - Step 56967: {'lr': 0.0003482827470546123, 'samples': 10937664, 'steps': 56966, 'loss/train': 1.0958737134933472} 11/07/2021 05:13:40 - INFO - __main__ - Step 56968: {'lr': 0.00034827786757403136, 'samples': 10937856, 'steps': 56967, 'loss/train': 1.070756196975708} 11/07/2021 05:13:40 - INFO - __main__ - Step 56969: {'lr': 0.00034827298804916793, 'samples': 10938048, 'steps': 56968, 'loss/train': 1.1565850973129272} 11/07/2021 05:13:41 - INFO - __main__ - Step 56970: {'lr': 0.00034826810848002416, 'samples': 10938240, 'steps': 56969, 'loss/train': 1.0730781555175781} 11/07/2021 05:13:41 - INFO - __main__ - Step 56971: {'lr': 0.00034826322886660234, 'samples': 10938432, 'steps': 56970, 'loss/train': 1.6203216314315796} 11/07/2021 05:13:41 - INFO - __main__ - Step 56972: {'lr': 0.00034825834920890463, 'samples': 10938624, 'steps': 56971, 'loss/train': 1.631622314453125} 11/07/2021 05:13:42 - INFO - __main__ - Step 56973: {'lr': 0.00034825346950693325, 'samples': 10938816, 'steps': 56972, 'loss/train': 0.08236096799373627} 11/07/2021 05:13:43 - INFO - __main__ - Step 56974: {'lr': 0.00034824858976069043, 'samples': 10939008, 'steps': 56973, 'loss/train': 1.3933860063552856} 11/07/2021 05:13:43 - INFO - __main__ - Step 56975: {'lr': 0.00034824370997017817, 'samples': 10939200, 'steps': 56974, 'loss/train': 1.330894112586975} 11/07/2021 05:13:43 - INFO - __main__ - Step 56976: {'lr': 0.0003482388301353989, 'samples': 10939392, 'steps': 56975, 'loss/train': 1.251906156539917} 11/07/2021 05:13:44 - INFO - __main__ - Step 56977: {'lr': 0.0003482339502563547, 'samples': 10939584, 'steps': 56976, 'loss/train': 1.060488224029541} 11/07/2021 05:13:45 - INFO - __main__ - Step 56978: {'lr': 0.0003482290703330478, 'samples': 10939776, 'steps': 56977, 'loss/train': 1.015341877937317} 11/07/2021 05:13:45 - INFO - __main__ - Step 56979: {'lr': 0.0003482241903654804, 'samples': 10939968, 'steps': 56978, 'loss/train': 1.4945625066757202} 11/07/2021 05:13:45 - INFO - __main__ - Step 56980: {'lr': 0.00034821931035365465, 'samples': 10940160, 'steps': 56979, 'loss/train': 1.133074402809143} 11/07/2021 05:13:46 - INFO - __main__ - Step 56981: {'lr': 0.0003482144302975729, 'samples': 10940352, 'steps': 56980, 'loss/train': 1.8464810848236084} 11/07/2021 05:13:46 - INFO - __main__ - Step 56982: {'lr': 0.0003482095501972372, 'samples': 10940544, 'steps': 56981, 'loss/train': 1.4704564809799194} 11/07/2021 05:13:47 - INFO - __main__ - Step 56983: {'lr': 0.0003482046700526498, 'samples': 10940736, 'steps': 56982, 'loss/train': 1.6985701322555542} 11/07/2021 05:13:47 - INFO - __main__ - Step 56984: {'lr': 0.0003481997898638128, 'samples': 10940928, 'steps': 56983, 'loss/train': 1.6070499420166016} 11/07/2021 05:13:48 - INFO - __main__ - Step 56985: {'lr': 0.0003481949096307285, 'samples': 10941120, 'steps': 56984, 'loss/train': 1.2578685283660889} 11/07/2021 05:13:48 - INFO - __main__ - Step 56986: {'lr': 0.0003481900293533992, 'samples': 10941312, 'steps': 56985, 'loss/train': 1.5594573020935059} 11/07/2021 05:13:48 - INFO - __main__ - Step 56987: {'lr': 0.00034818514903182696, 'samples': 10941504, 'steps': 56986, 'loss/train': 1.0292149782180786} 11/07/2021 05:13:50 - INFO - __main__ - Step 56988: {'lr': 0.000348180268666014, 'samples': 10941696, 'steps': 56987, 'loss/train': 1.2681105136871338} 11/07/2021 05:13:50 - INFO - __main__ - Step 56989: {'lr': 0.00034817538825596253, 'samples': 10941888, 'steps': 56988, 'loss/train': 1.165054202079773} 11/07/2021 05:13:50 - INFO - __main__ - Step 56990: {'lr': 0.0003481705078016747, 'samples': 10942080, 'steps': 56989, 'loss/train': 1.1151914596557617} 11/07/2021 05:13:51 - INFO - __main__ - Step 56991: {'lr': 0.0003481656273031527, 'samples': 10942272, 'steps': 56990, 'loss/train': 1.0866490602493286} 11/07/2021 05:13:51 - INFO - __main__ - Step 56992: {'lr': 0.0003481607467603989, 'samples': 10942464, 'steps': 56991, 'loss/train': 0.044805631041526794} 11/07/2021 05:13:52 - INFO - __main__ - Step 56993: {'lr': 0.00034815586617341533, 'samples': 10942656, 'steps': 56992, 'loss/train': 1.5680711269378662} 11/07/2021 05:13:53 - INFO - __main__ - Step 56994: {'lr': 0.0003481509855422043, 'samples': 10942848, 'steps': 56993, 'loss/train': 1.6193270683288574} 11/07/2021 05:13:53 - INFO - __main__ - Step 56995: {'lr': 0.0003481461048667679, 'samples': 10943040, 'steps': 56994, 'loss/train': 0.08559868484735489} 11/07/2021 05:13:53 - INFO - __main__ - Step 56996: {'lr': 0.00034814122414710837, 'samples': 10943232, 'steps': 56995, 'loss/train': 1.2014765739440918} 11/07/2021 05:13:54 - INFO - __main__ - Step 56997: {'lr': 0.0003481363433832279, 'samples': 10943424, 'steps': 56996, 'loss/train': 1.3474739789962769} 11/07/2021 05:13:55 - INFO - __main__ - Step 56998: {'lr': 0.00034813146257512876, 'samples': 10943616, 'steps': 56997, 'loss/train': 1.2970685958862305} 11/07/2021 05:13:55 - INFO - __main__ - Step 56999: {'lr': 0.0003481265817228131, 'samples': 10943808, 'steps': 56998, 'loss/train': 1.4136161804199219} 11/07/2021 05:13:56 - INFO - __main__ - Step 57000: {'lr': 0.00034812170082628303, 'samples': 10944000, 'steps': 56999, 'loss/train': 1.2367044687271118} 11/07/2021 05:13:56 - INFO - __main__ - Step 57001: {'lr': 0.00034811681988554095, 'samples': 10944192, 'steps': 57000, 'loss/train': 1.0630079507827759} 11/07/2021 05:13:56 - INFO - __main__ - Step 57002: {'lr': 0.0003481119389005889, 'samples': 10944384, 'steps': 57001, 'loss/train': 1.4385117292404175} 11/07/2021 05:13:58 - INFO - __main__ - Step 57003: {'lr': 0.0003481070578714291, 'samples': 10944576, 'steps': 57002, 'loss/train': 0.06309520453214645} 11/07/2021 05:13:58 - INFO - __main__ - Step 57004: {'lr': 0.0003481021767980638, 'samples': 10944768, 'steps': 57003, 'loss/train': 1.4560445547103882} 11/07/2021 05:13:58 - INFO - __main__ - Step 57005: {'lr': 0.00034809729568049513, 'samples': 10944960, 'steps': 57004, 'loss/train': 1.5791494846343994} 11/07/2021 05:13:59 - INFO - __main__ - Step 57006: {'lr': 0.0003480924145187254, 'samples': 10945152, 'steps': 57005, 'loss/train': 1.7447997331619263} 11/07/2021 05:13:59 - INFO - __main__ - Step 57007: {'lr': 0.0003480875333127567, 'samples': 10945344, 'steps': 57006, 'loss/train': 1.5612694025039673} 11/07/2021 05:13:59 - INFO - __main__ - Step 57008: {'lr': 0.0003480826520625913, 'samples': 10945536, 'steps': 57007, 'loss/train': 1.7285468578338623} 11/07/2021 05:14:00 - INFO - __main__ - Step 57009: {'lr': 0.0003480777707682313, 'samples': 10945728, 'steps': 57008, 'loss/train': 0.1764259934425354} 11/07/2021 05:14:01 - INFO - __main__ - Step 57010: {'lr': 0.00034807288942967905, 'samples': 10945920, 'steps': 57009, 'loss/train': 1.6600234508514404} 11/07/2021 05:14:01 - INFO - __main__ - Step 57011: {'lr': 0.0003480680080469366, 'samples': 10946112, 'steps': 57010, 'loss/train': 1.1775566339492798} 11/07/2021 05:14:01 - INFO - __main__ - Step 57012: {'lr': 0.0003480631266200063, 'samples': 10946304, 'steps': 57011, 'loss/train': 1.5337494611740112} 11/07/2021 05:14:02 - INFO - __main__ - Step 57013: {'lr': 0.0003480582451488902, 'samples': 10946496, 'steps': 57012, 'loss/train': 1.730421543121338} 11/07/2021 05:14:03 - INFO - __main__ - Step 57014: {'lr': 0.00034805336363359066, 'samples': 10946688, 'steps': 57013, 'loss/train': 1.591624140739441} 11/07/2021 05:14:03 - INFO - __main__ - Step 57015: {'lr': 0.00034804848207410974, 'samples': 10946880, 'steps': 57014, 'loss/train': 1.498093843460083} 11/07/2021 05:14:04 - INFO - __main__ - Step 57016: {'lr': 0.00034804360047044965, 'samples': 10947072, 'steps': 57015, 'loss/train': 1.7804088592529297} 11/07/2021 05:14:04 - INFO - __main__ - Step 57017: {'lr': 0.0003480387188226126, 'samples': 10947264, 'steps': 57016, 'loss/train': 1.5107934474945068} 11/07/2021 05:14:04 - INFO - __main__ - Step 57018: {'lr': 0.0003480338371306009, 'samples': 10947456, 'steps': 57017, 'loss/train': 1.2875913381576538} 11/07/2021 05:14:05 - INFO - __main__ - Step 57019: {'lr': 0.0003480289553944166, 'samples': 10947648, 'steps': 57018, 'loss/train': 1.4271272420883179} 11/07/2021 05:14:06 - INFO - __main__ - Step 57020: {'lr': 0.000348024073614062, 'samples': 10947840, 'steps': 57019, 'loss/train': 1.0144290924072266} 11/07/2021 05:14:06 - INFO - __main__ - Step 57021: {'lr': 0.0003480191917895393, 'samples': 10948032, 'steps': 57020, 'loss/train': 1.7339799404144287} 11/07/2021 05:14:06 - INFO - __main__ - Step 57022: {'lr': 0.0003480143099208506, 'samples': 10948224, 'steps': 57021, 'loss/train': 1.292941927909851} 11/07/2021 05:14:07 - INFO - __main__ - Step 57023: {'lr': 0.00034800942800799817, 'samples': 10948416, 'steps': 57022, 'loss/train': 1.318372130393982} 11/07/2021 05:14:08 - INFO - __main__ - Step 57024: {'lr': 0.00034800454605098417, 'samples': 10948608, 'steps': 57023, 'loss/train': 0.3450126349925995} 11/07/2021 05:14:08 - INFO - __main__ - Step 57025: {'lr': 0.00034799966404981095, 'samples': 10948800, 'steps': 57024, 'loss/train': 1.3935389518737793} 11/07/2021 05:14:08 - INFO - __main__ - Step 57026: {'lr': 0.00034799478200448056, 'samples': 10948992, 'steps': 57025, 'loss/train': 1.5002765655517578} 11/07/2021 05:14:09 - INFO - __main__ - Step 57027: {'lr': 0.0003479898999149952, 'samples': 10949184, 'steps': 57026, 'loss/train': 1.3063820600509644} 11/07/2021 05:14:09 - INFO - __main__ - Step 57028: {'lr': 0.00034798501778135704, 'samples': 10949376, 'steps': 57027, 'loss/train': 1.380514144897461} 11/07/2021 05:14:10 - INFO - __main__ - Step 57029: {'lr': 0.0003479801356035684, 'samples': 10949568, 'steps': 57028, 'loss/train': 1.4907722473144531} 11/07/2021 05:14:11 - INFO - __main__ - Step 57030: {'lr': 0.0003479752533816315, 'samples': 10949760, 'steps': 57029, 'loss/train': 1.2079057693481445} 11/07/2021 05:14:11 - INFO - __main__ - Step 57031: {'lr': 0.0003479703711155484, 'samples': 10949952, 'steps': 57030, 'loss/train': 1.1674896478652954} 11/07/2021 05:14:11 - INFO - __main__ - Step 57032: {'lr': 0.00034796548880532135, 'samples': 10950144, 'steps': 57031, 'loss/train': 1.5546785593032837} 11/07/2021 05:14:12 - INFO - __main__ - Step 57033: {'lr': 0.0003479606064509526, 'samples': 10950336, 'steps': 57032, 'loss/train': 1.2576812505722046} 11/07/2021 05:14:13 - INFO - __main__ - Step 57034: {'lr': 0.00034795572405244425, 'samples': 10950528, 'steps': 57033, 'loss/train': 1.206067681312561} 11/07/2021 05:14:13 - INFO - __main__ - Step 57035: {'lr': 0.0003479508416097986, 'samples': 10950720, 'steps': 57034, 'loss/train': 1.2598367929458618} 11/07/2021 05:14:13 - INFO - __main__ - Step 57036: {'lr': 0.0003479459591230177, 'samples': 10950912, 'steps': 57035, 'loss/train': 1.8784977197647095} 11/07/2021 05:14:14 - INFO - __main__ - Step 57037: {'lr': 0.0003479410765921041, 'samples': 10951104, 'steps': 57036, 'loss/train': 1.462232232093811} 11/07/2021 05:14:14 - INFO - __main__ - Step 57038: {'lr': 0.0003479361940170596, 'samples': 10951296, 'steps': 57037, 'loss/train': 1.3565127849578857} 11/07/2021 05:14:15 - INFO - __main__ - Step 57039: {'lr': 0.0003479313113978866, 'samples': 10951488, 'steps': 57038, 'loss/train': 1.4616706371307373} 11/07/2021 05:14:15 - INFO - __main__ - Step 57040: {'lr': 0.00034792642873458725, 'samples': 10951680, 'steps': 57039, 'loss/train': 0.4259514808654785} 11/07/2021 05:14:16 - INFO - __main__ - Step 57041: {'lr': 0.00034792154602716376, 'samples': 10951872, 'steps': 57040, 'loss/train': 0.5904067158699036} 11/07/2021 05:14:16 - INFO - __main__ - Step 57042: {'lr': 0.0003479166632756184, 'samples': 10952064, 'steps': 57041, 'loss/train': 1.4015944004058838} 11/07/2021 05:14:16 - INFO - __main__ - Step 57043: {'lr': 0.0003479117804799532, 'samples': 10952256, 'steps': 57042, 'loss/train': 0.7780531048774719} 11/07/2021 05:14:18 - INFO - __main__ - Step 57044: {'lr': 0.00034790689764017046, 'samples': 10952448, 'steps': 57043, 'loss/train': 1.5652540922164917} 11/07/2021 05:14:18 - INFO - __main__ - Step 57045: {'lr': 0.00034790201475627246, 'samples': 10952640, 'steps': 57044, 'loss/train': 1.007209300994873} 11/07/2021 05:14:18 - INFO - __main__ - Step 57046: {'lr': 0.00034789713182826126, 'samples': 10952832, 'steps': 57045, 'loss/train': 1.5317529439926147} 11/07/2021 05:14:19 - INFO - __main__ - Step 57047: {'lr': 0.0003478922488561392, 'samples': 10953024, 'steps': 57046, 'loss/train': 1.1437493562698364} 11/07/2021 05:14:19 - INFO - __main__ - Step 57048: {'lr': 0.0003478873658399084, 'samples': 10953216, 'steps': 57047, 'loss/train': 0.6343510150909424} 11/07/2021 05:14:20 - INFO - __main__ - Step 57049: {'lr': 0.000347882482779571, 'samples': 10953408, 'steps': 57048, 'loss/train': 0.871766984462738} 11/07/2021 05:14:20 - INFO - __main__ - Step 57050: {'lr': 0.00034787759967512923, 'samples': 10953600, 'steps': 57049, 'loss/train': 1.715659260749817} 11/07/2021 05:14:21 - INFO - __main__ - Step 57051: {'lr': 0.00034787271652658534, 'samples': 10953792, 'steps': 57050, 'loss/train': 1.4466074705123901} 11/07/2021 05:14:21 - INFO - __main__ - Step 57052: {'lr': 0.0003478678333339416, 'samples': 10953984, 'steps': 57051, 'loss/train': 1.6088135242462158} 11/07/2021 05:14:21 - INFO - __main__ - Step 57053: {'lr': 0.0003478629500972, 'samples': 10954176, 'steps': 57052, 'loss/train': 1.1456817388534546} 11/07/2021 05:14:22 - INFO - __main__ - Step 57054: {'lr': 0.0003478580668163631, 'samples': 10954368, 'steps': 57053, 'loss/train': 1.8548530340194702} 11/07/2021 05:14:23 - INFO - __main__ - Step 57055: {'lr': 0.0003478531834914326, 'samples': 10954560, 'steps': 57054, 'loss/train': 1.2103031873703003} 11/07/2021 05:14:23 - INFO - __main__ - Step 57056: {'lr': 0.0003478483001224111, 'samples': 10954752, 'steps': 57055, 'loss/train': 1.7151169776916504} 11/07/2021 05:14:23 - INFO - __main__ - Step 57057: {'lr': 0.00034784341670930066, 'samples': 10954944, 'steps': 57056, 'loss/train': 1.577838659286499} 11/07/2021 05:14:24 - INFO - __main__ - Step 57058: {'lr': 0.00034783853325210344, 'samples': 10955136, 'steps': 57057, 'loss/train': 1.2713603973388672} 11/07/2021 05:14:25 - INFO - __main__ - Step 57059: {'lr': 0.0003478336497508217, 'samples': 10955328, 'steps': 57058, 'loss/train': 1.584445595741272} 11/07/2021 05:14:25 - INFO - __main__ - Step 57060: {'lr': 0.0003478287662054576, 'samples': 10955520, 'steps': 57059, 'loss/train': 1.4578912258148193} 11/07/2021 05:14:25 - INFO - __main__ - Step 57061: {'lr': 0.0003478238826160135, 'samples': 10955712, 'steps': 57060, 'loss/train': 1.5543626546859741} 11/07/2021 05:14:26 - INFO - __main__ - Step 57062: {'lr': 0.00034781899898249136, 'samples': 10955904, 'steps': 57061, 'loss/train': 1.0813395977020264} 11/07/2021 05:14:26 - INFO - __main__ - Step 57063: {'lr': 0.0003478141153048935, 'samples': 10956096, 'steps': 57062, 'loss/train': 1.8367377519607544} 11/07/2021 05:14:27 - INFO - __main__ - Step 57064: {'lr': 0.0003478092315832221, 'samples': 10956288, 'steps': 57063, 'loss/train': 1.458410382270813} 11/07/2021 05:14:28 - INFO - __main__ - Step 57065: {'lr': 0.00034780434781747936, 'samples': 10956480, 'steps': 57064, 'loss/train': 1.5763148069381714} 11/07/2021 05:14:28 - INFO - __main__ - Step 57066: {'lr': 0.0003477994640076675, 'samples': 10956672, 'steps': 57065, 'loss/train': 1.4608738422393799} 11/07/2021 05:14:28 - INFO - __main__ - Step 57067: {'lr': 0.00034779458015378874, 'samples': 10956864, 'steps': 57066, 'loss/train': 1.340052843093872} 11/07/2021 05:14:29 - INFO - __main__ - Step 57068: {'lr': 0.00034778969625584523, 'samples': 10957056, 'steps': 57067, 'loss/train': 1.2458049058914185} 11/07/2021 05:14:30 - INFO - __main__ - Step 57069: {'lr': 0.0003477848123138392, 'samples': 10957248, 'steps': 57068, 'loss/train': 1.28065824508667} 11/07/2021 05:14:30 - INFO - __main__ - Step 57070: {'lr': 0.0003477799283277728, 'samples': 10957440, 'steps': 57069, 'loss/train': 0.7120200395584106} 11/07/2021 05:14:30 - INFO - __main__ - Step 57071: {'lr': 0.0003477750442976483, 'samples': 10957632, 'steps': 57070, 'loss/train': 0.8300448656082153} 11/07/2021 05:14:31 - INFO - __main__ - Step 57072: {'lr': 0.0003477701602234679, 'samples': 10957824, 'steps': 57071, 'loss/train': 1.402589201927185} 11/07/2021 05:14:31 - INFO - __main__ - Step 57073: {'lr': 0.00034776527610523377, 'samples': 10958016, 'steps': 57072, 'loss/train': 1.1367018222808838} 11/07/2021 05:14:31 - INFO - __main__ - Step 57074: {'lr': 0.00034776039194294806, 'samples': 10958208, 'steps': 57073, 'loss/train': 1.4829801321029663} 11/07/2021 05:14:32 - INFO - __main__ - Step 57075: {'lr': 0.0003477555077366131, 'samples': 10958400, 'steps': 57074, 'loss/train': 1.6662720441818237} 11/07/2021 05:14:33 - INFO - __main__ - Step 57076: {'lr': 0.000347750623486231, 'samples': 10958592, 'steps': 57075, 'loss/train': 1.5730881690979004} 11/07/2021 05:14:33 - INFO - __main__ - Step 57077: {'lr': 0.00034774573919180396, 'samples': 10958784, 'steps': 57076, 'loss/train': 1.4155805110931396} 11/07/2021 05:14:33 - INFO - __main__ - Step 57078: {'lr': 0.0003477408548533342, 'samples': 10958976, 'steps': 57077, 'loss/train': 1.7398052215576172} 11/07/2021 05:14:34 - INFO - __main__ - Step 57079: {'lr': 0.0003477359704708239, 'samples': 10959168, 'steps': 57078, 'loss/train': 1.2042269706726074} 11/07/2021 05:14:35 - INFO - __main__ - Step 57080: {'lr': 0.00034773108604427527, 'samples': 10959360, 'steps': 57079, 'loss/train': 0.755284309387207} 11/07/2021 05:14:35 - INFO - __main__ - Step 57081: {'lr': 0.0003477262015736906, 'samples': 10959552, 'steps': 57080, 'loss/train': 1.5055509805679321} 11/07/2021 05:14:35 - INFO - __main__ - Step 57082: {'lr': 0.000347721317059072, 'samples': 10959744, 'steps': 57081, 'loss/train': 1.6597017049789429} 11/07/2021 05:14:36 - INFO - __main__ - Step 57083: {'lr': 0.00034771643250042163, 'samples': 10959936, 'steps': 57082, 'loss/train': 1.3603308200836182} 11/07/2021 05:14:36 - INFO - __main__ - Step 57084: {'lr': 0.0003477115478977417, 'samples': 10960128, 'steps': 57083, 'loss/train': 1.0547682046890259} 11/07/2021 05:14:37 - INFO - __main__ - Step 57085: {'lr': 0.0003477066632510346, 'samples': 10960320, 'steps': 57084, 'loss/train': 2.257406234741211} 11/07/2021 05:14:37 - INFO - __main__ - Step 57086: {'lr': 0.00034770177856030223, 'samples': 10960512, 'steps': 57085, 'loss/train': 1.6284343004226685} 11/07/2021 05:14:38 - INFO - __main__ - Step 57087: {'lr': 0.00034769689382554704, 'samples': 10960704, 'steps': 57086, 'loss/train': 1.7002869844436646} 11/07/2021 05:14:38 - INFO - __main__ - Step 57088: {'lr': 0.0003476920090467711, 'samples': 10960896, 'steps': 57087, 'loss/train': 1.0626585483551025} 11/07/2021 05:14:39 - INFO - __main__ - Step 57089: {'lr': 0.0003476871242239767, 'samples': 10961088, 'steps': 57088, 'loss/train': 0.4999178349971771} 11/07/2021 05:14:39 - INFO - __main__ - Step 57090: {'lr': 0.0003476822393571659, 'samples': 10961280, 'steps': 57089, 'loss/train': 1.420108675956726} 11/07/2021 05:14:40 - INFO - __main__ - Step 57091: {'lr': 0.00034767735444634105, 'samples': 10961472, 'steps': 57090, 'loss/train': 1.3181227445602417} 11/07/2021 05:14:40 - INFO - __main__ - Step 57092: {'lr': 0.00034767246949150425, 'samples': 10961664, 'steps': 57091, 'loss/train': 1.2863893508911133} 11/07/2021 05:14:41 - INFO - __main__ - Step 57093: {'lr': 0.0003476675844926578, 'samples': 10961856, 'steps': 57092, 'loss/train': 1.6348007917404175} 11/07/2021 05:14:41 - INFO - __main__ - Step 57094: {'lr': 0.0003476626994498038, 'samples': 10962048, 'steps': 57093, 'loss/train': 1.8778717517852783} 11/07/2021 05:14:42 - INFO - __main__ - Step 57095: {'lr': 0.0003476578143629445, 'samples': 10962240, 'steps': 57094, 'loss/train': 1.701960563659668} 11/07/2021 05:14:42 - INFO - __main__ - Step 57096: {'lr': 0.0003476529292320821, 'samples': 10962432, 'steps': 57095, 'loss/train': 1.5509767532348633} 11/07/2021 05:14:43 - INFO - __main__ - Step 57097: {'lr': 0.00034764804405721885, 'samples': 10962624, 'steps': 57096, 'loss/train': 1.1595499515533447} 11/07/2021 05:14:43 - INFO - __main__ - Step 57098: {'lr': 0.0003476431588383568, 'samples': 10962816, 'steps': 57097, 'loss/train': 0.8355369567871094} 11/07/2021 05:14:43 - INFO - __main__ - Step 57099: {'lr': 0.0003476382735754983, 'samples': 10963008, 'steps': 57098, 'loss/train': 1.2774282693862915} 11/07/2021 05:14:45 - INFO - __main__ - Step 57100: {'lr': 0.00034763338826864556, 'samples': 10963200, 'steps': 57099, 'loss/train': 1.3770716190338135} 11/07/2021 05:14:45 - INFO - __main__ - Step 57101: {'lr': 0.0003476285029178006, 'samples': 10963392, 'steps': 57100, 'loss/train': 1.096310019493103} 11/07/2021 05:14:46 - INFO - __main__ - Step 57102: {'lr': 0.0003476236175229659, 'samples': 10963584, 'steps': 57101, 'loss/train': 1.3786247968673706} 11/07/2021 05:14:46 - INFO - __main__ - Step 57103: {'lr': 0.0003476187320841434, 'samples': 10963776, 'steps': 57102, 'loss/train': 1.2944613695144653} 11/07/2021 05:14:46 - INFO - __main__ - Step 57104: {'lr': 0.0003476138466013354, 'samples': 10963968, 'steps': 57103, 'loss/train': 1.6037278175354004} 11/07/2021 05:14:47 - INFO - __main__ - Step 57105: {'lr': 0.00034760896107454407, 'samples': 10964160, 'steps': 57104, 'loss/train': 0.710163414478302} 11/07/2021 05:14:47 - INFO - __main__ - Step 57106: {'lr': 0.0003476040755037717, 'samples': 10964352, 'steps': 57105, 'loss/train': 2.0947957038879395} 11/07/2021 05:14:48 - INFO - __main__ - Step 57107: {'lr': 0.00034759918988902045, 'samples': 10964544, 'steps': 57106, 'loss/train': 1.7483853101730347} 11/07/2021 05:14:49 - INFO - __main__ - Step 57108: {'lr': 0.00034759430423029255, 'samples': 10964736, 'steps': 57107, 'loss/train': 1.576919674873352} 11/07/2021 05:14:49 - INFO - __main__ - Step 57109: {'lr': 0.0003475894185275901, 'samples': 10964928, 'steps': 57108, 'loss/train': 1.459820032119751} 11/07/2021 05:14:49 - INFO - __main__ - Step 57110: {'lr': 0.00034758453278091537, 'samples': 10965120, 'steps': 57109, 'loss/train': 1.5835328102111816} 11/07/2021 05:14:50 - INFO - __main__ - Step 57111: {'lr': 0.00034757964699027054, 'samples': 10965312, 'steps': 57110, 'loss/train': 1.4095795154571533} 11/07/2021 05:14:51 - INFO - __main__ - Step 57112: {'lr': 0.0003475747611556579, 'samples': 10965504, 'steps': 57111, 'loss/train': 1.0903464555740356} 11/07/2021 05:14:51 - INFO - __main__ - Step 57113: {'lr': 0.0003475698752770795, 'samples': 10965696, 'steps': 57112, 'loss/train': 1.3601086139678955} 11/07/2021 05:14:51 - INFO - __main__ - Step 57114: {'lr': 0.0003475649893545376, 'samples': 10965888, 'steps': 57113, 'loss/train': 1.4585551023483276} 11/07/2021 05:14:52 - INFO - __main__ - Step 57115: {'lr': 0.0003475601033880346, 'samples': 10966080, 'steps': 57114, 'loss/train': 1.3388257026672363} 11/07/2021 05:14:52 - INFO - __main__ - Step 57116: {'lr': 0.00034755521737757237, 'samples': 10966272, 'steps': 57115, 'loss/train': 1.212584376335144} 11/07/2021 05:14:53 - INFO - __main__ - Step 57117: {'lr': 0.0003475503313231533, 'samples': 10966464, 'steps': 57116, 'loss/train': 1.2603864669799805} 11/07/2021 05:14:53 - INFO - __main__ - Step 57118: {'lr': 0.0003475454452247795, 'samples': 10966656, 'steps': 57117, 'loss/train': 1.2596713304519653} 11/07/2021 05:14:54 - INFO - __main__ - Step 57119: {'lr': 0.00034754055908245326, 'samples': 10966848, 'steps': 57118, 'loss/train': 1.3811181783676147} 11/07/2021 05:14:54 - INFO - __main__ - Step 57120: {'lr': 0.0003475356728961767, 'samples': 10967040, 'steps': 57119, 'loss/train': 0.9532086849212646} 11/07/2021 05:14:54 - INFO - __main__ - Step 57121: {'lr': 0.0003475307866659522, 'samples': 10967232, 'steps': 57120, 'loss/train': 1.3746074438095093} 11/07/2021 05:14:56 - INFO - __main__ - Step 57122: {'lr': 0.00034752590039178175, 'samples': 10967424, 'steps': 57121, 'loss/train': 1.6336432695388794} 11/07/2021 05:14:56 - INFO - __main__ - Step 57123: {'lr': 0.00034752101407366763, 'samples': 10967616, 'steps': 57122, 'loss/train': 1.1915686130523682} 11/07/2021 05:14:56 - INFO - __main__ - Step 57124: {'lr': 0.00034751612771161214, 'samples': 10967808, 'steps': 57123, 'loss/train': 1.6695116758346558} 11/07/2021 05:14:57 - INFO - __main__ - Step 57125: {'lr': 0.0003475112413056173, 'samples': 10968000, 'steps': 57124, 'loss/train': 1.7698873281478882} 11/07/2021 05:14:57 - INFO - __main__ - Step 57126: {'lr': 0.0003475063548556854, 'samples': 10968192, 'steps': 57125, 'loss/train': 1.1600044965744019} 11/07/2021 05:14:58 - INFO - __main__ - Step 57127: {'lr': 0.0003475014683618186, 'samples': 10968384, 'steps': 57126, 'loss/train': 1.1192550659179688} 11/07/2021 05:14:58 - INFO - __main__ - Step 57128: {'lr': 0.00034749658182401923, 'samples': 10968576, 'steps': 57127, 'loss/train': 1.7815827131271362} 11/07/2021 05:14:59 - INFO - __main__ - Step 57129: {'lr': 0.00034749169524228937, 'samples': 10968768, 'steps': 57128, 'loss/train': 1.80890953540802} 11/07/2021 05:14:59 - INFO - __main__ - Step 57130: {'lr': 0.0003474868086166312, 'samples': 10968960, 'steps': 57129, 'loss/train': 1.3178973197937012} 11/07/2021 05:14:59 - INFO - __main__ - Step 57131: {'lr': 0.0003474819219470471, 'samples': 10969152, 'steps': 57130, 'loss/train': 1.3103485107421875} 11/07/2021 05:15:01 - INFO - __main__ - Step 57132: {'lr': 0.0003474770352335391, 'samples': 10969344, 'steps': 57131, 'loss/train': 0.8342475295066833} 11/07/2021 05:15:01 - INFO - __main__ - Step 57133: {'lr': 0.00034747214847610943, 'samples': 10969536, 'steps': 57132, 'loss/train': 1.472228765487671} 11/07/2021 05:15:01 - INFO - __main__ - Step 57134: {'lr': 0.00034746726167476027, 'samples': 10969728, 'steps': 57133, 'loss/train': 1.7928417921066284} 11/07/2021 05:15:02 - INFO - __main__ - Step 57135: {'lr': 0.00034746237482949393, 'samples': 10969920, 'steps': 57134, 'loss/train': 1.3290101289749146} 11/07/2021 05:15:02 - INFO - __main__ - Step 57136: {'lr': 0.0003474574879403126, 'samples': 10970112, 'steps': 57135, 'loss/train': 0.19079530239105225} 11/07/2021 05:15:03 - INFO - __main__ - Step 57137: {'lr': 0.0003474526010072183, 'samples': 10970304, 'steps': 57136, 'loss/train': 1.43719482421875} 11/07/2021 05:15:04 - INFO - __main__ - Step 57138: {'lr': 0.0003474477140302134, 'samples': 10970496, 'steps': 57137, 'loss/train': 1.7045570611953735} 11/07/2021 05:15:04 - INFO - __main__ - Step 57139: {'lr': 0.0003474428270093001, 'samples': 10970688, 'steps': 57138, 'loss/train': 1.9332865476608276} 11/07/2021 05:15:04 - INFO - __main__ - Step 57140: {'lr': 0.00034743793994448057, 'samples': 10970880, 'steps': 57139, 'loss/train': 1.4491996765136719} 11/07/2021 05:15:05 - INFO - __main__ - Step 57141: {'lr': 0.000347433052835757, 'samples': 10971072, 'steps': 57140, 'loss/train': 1.4185638427734375} 11/07/2021 05:15:06 - INFO - __main__ - Step 57142: {'lr': 0.00034742816568313165, 'samples': 10971264, 'steps': 57141, 'loss/train': 0.08232451230287552} 11/07/2021 05:15:06 - INFO - __main__ - Step 57143: {'lr': 0.0003474232784866066, 'samples': 10971456, 'steps': 57142, 'loss/train': 1.1138741970062256} 11/07/2021 05:15:06 - INFO - __main__ - Step 57144: {'lr': 0.0003474183912461841, 'samples': 10971648, 'steps': 57143, 'loss/train': 1.3590716123580933} 11/07/2021 05:15:07 - INFO - __main__ - Step 57145: {'lr': 0.00034741350396186646, 'samples': 10971840, 'steps': 57144, 'loss/train': 1.4189975261688232} 11/07/2021 05:15:07 - INFO - __main__ - Step 57146: {'lr': 0.0003474086166336557, 'samples': 10972032, 'steps': 57145, 'loss/train': 1.466606855392456} 11/07/2021 05:15:07 - INFO - __main__ - Step 57147: {'lr': 0.0003474037292615542, 'samples': 10972224, 'steps': 57146, 'loss/train': 1.2200415134429932} 11/07/2021 05:15:09 - INFO - __main__ - Step 57148: {'lr': 0.000347398841845564, 'samples': 10972416, 'steps': 57147, 'loss/train': 1.6551554203033447} 11/07/2021 05:15:09 - INFO - __main__ - Step 57149: {'lr': 0.0003473939543856875, 'samples': 10972608, 'steps': 57148, 'loss/train': 1.3456071615219116} 11/07/2021 05:15:09 - INFO - __main__ - Step 57150: {'lr': 0.00034738906688192673, 'samples': 10972800, 'steps': 57149, 'loss/train': 1.497469186782837} 11/07/2021 05:15:10 - INFO - __main__ - Step 57151: {'lr': 0.0003473841793342839, 'samples': 10972992, 'steps': 57150, 'loss/train': 1.4696048498153687} 11/07/2021 05:15:10 - INFO - __main__ - Step 57152: {'lr': 0.00034737929174276133, 'samples': 10973184, 'steps': 57151, 'loss/train': 1.2160767316818237} 11/07/2021 05:15:11 - INFO - __main__ - Step 57153: {'lr': 0.0003473744041073611, 'samples': 10973376, 'steps': 57152, 'loss/train': 1.4955682754516602} 11/07/2021 05:15:11 - INFO - __main__ - Step 57154: {'lr': 0.0003473695164280855, 'samples': 10973568, 'steps': 57153, 'loss/train': 1.8641687631607056} 11/07/2021 05:15:12 - INFO - __main__ - Step 57155: {'lr': 0.0003473646287049368, 'samples': 10973760, 'steps': 57154, 'loss/train': 1.1599198579788208} 11/07/2021 05:15:12 - INFO - __main__ - Step 57156: {'lr': 0.00034735974093791697, 'samples': 10973952, 'steps': 57155, 'loss/train': 0.8260247707366943} 11/07/2021 05:15:12 - INFO - __main__ - Step 57157: {'lr': 0.00034735485312702835, 'samples': 10974144, 'steps': 57156, 'loss/train': 1.145938515663147} 11/07/2021 05:15:13 - INFO - __main__ - Step 57158: {'lr': 0.00034734996527227313, 'samples': 10974336, 'steps': 57157, 'loss/train': 1.5853530168533325} 11/07/2021 05:15:14 - INFO - __main__ - Step 57159: {'lr': 0.0003473450773736536, 'samples': 10974528, 'steps': 57158, 'loss/train': 1.3499584197998047} 11/07/2021 05:15:14 - INFO - __main__ - Step 57160: {'lr': 0.00034734018943117183, 'samples': 10974720, 'steps': 57159, 'loss/train': 1.316514492034912} 11/07/2021 05:15:14 - INFO - __main__ - Step 57161: {'lr': 0.00034733530144483003, 'samples': 10974912, 'steps': 57160, 'loss/train': 1.3778836727142334} 11/07/2021 05:15:15 - INFO - __main__ - Step 57162: {'lr': 0.0003473304134146305, 'samples': 10975104, 'steps': 57161, 'loss/train': 2.1690492630004883} 11/07/2021 05:15:16 - INFO - __main__ - Step 57163: {'lr': 0.0003473255253405754, 'samples': 10975296, 'steps': 57162, 'loss/train': 1.465073823928833} 11/07/2021 05:15:16 - INFO - __main__ - Step 57164: {'lr': 0.0003473206372226668, 'samples': 10975488, 'steps': 57163, 'loss/train': 0.9232267141342163} 11/07/2021 05:15:16 - INFO - __main__ - Step 57165: {'lr': 0.0003473157490609071, 'samples': 10975680, 'steps': 57164, 'loss/train': 1.0334287881851196} 11/07/2021 05:15:17 - INFO - __main__ - Step 57166: {'lr': 0.0003473108608552985, 'samples': 10975872, 'steps': 57165, 'loss/train': 1.4061572551727295} 11/07/2021 05:15:17 - INFO - __main__ - Step 57167: {'lr': 0.00034730597260584304, 'samples': 10976064, 'steps': 57166, 'loss/train': 1.1580924987792969} 11/07/2021 05:15:18 - INFO - __main__ - Step 57168: {'lr': 0.0003473010843125431, 'samples': 10976256, 'steps': 57167, 'loss/train': 1.3049694299697876} 11/07/2021 05:15:18 - INFO - __main__ - Step 57169: {'lr': 0.0003472961959754007, 'samples': 10976448, 'steps': 57168, 'loss/train': 1.7472347021102905} 11/07/2021 05:15:19 - INFO - __main__ - Step 57170: {'lr': 0.0003472913075944182, 'samples': 10976640, 'steps': 57169, 'loss/train': 1.2301182746887207} 11/07/2021 05:15:19 - INFO - __main__ - Step 57171: {'lr': 0.00034728641916959767, 'samples': 10976832, 'steps': 57170, 'loss/train': 1.4124161005020142} 11/07/2021 05:15:19 - INFO - __main__ - Step 57172: {'lr': 0.00034728153070094143, 'samples': 10977024, 'steps': 57171, 'loss/train': 1.0823720693588257} 11/07/2021 05:15:21 - INFO - __main__ - Step 57173: {'lr': 0.0003472766421884516, 'samples': 10977216, 'steps': 57172, 'loss/train': 1.1575038433074951} 11/07/2021 05:15:21 - INFO - __main__ - Step 57174: {'lr': 0.00034727175363213046, 'samples': 10977408, 'steps': 57173, 'loss/train': 0.38192084431648254} 11/07/2021 05:15:21 - INFO - __main__ - Step 57175: {'lr': 0.0003472668650319801, 'samples': 10977600, 'steps': 57174, 'loss/train': 1.71419095993042} 11/07/2021 05:15:22 - INFO - __main__ - Step 57176: {'lr': 0.0003472619763880029, 'samples': 10977792, 'steps': 57175, 'loss/train': 1.3020073175430298} 11/07/2021 05:15:22 - INFO - __main__ - Step 57177: {'lr': 0.00034725708770020085, 'samples': 10977984, 'steps': 57176, 'loss/train': 1.2572638988494873} 11/07/2021 05:15:23 - INFO - __main__ - Step 57178: {'lr': 0.0003472521989685763, 'samples': 10978176, 'steps': 57177, 'loss/train': 1.4011561870574951} 11/07/2021 05:15:23 - INFO - __main__ - Step 57179: {'lr': 0.00034724731019313145, 'samples': 10978368, 'steps': 57178, 'loss/train': 1.1475541591644287} 11/07/2021 05:15:24 - INFO - __main__ - Step 57180: {'lr': 0.0003472424213738684, 'samples': 10978560, 'steps': 57179, 'loss/train': 1.1881182193756104} 11/07/2021 05:15:24 - INFO - __main__ - Step 57181: {'lr': 0.0003472375325107894, 'samples': 10978752, 'steps': 57180, 'loss/train': 1.7085241079330444} 11/07/2021 05:15:24 - INFO - __main__ - Step 57182: {'lr': 0.00034723264360389674, 'samples': 10978944, 'steps': 57181, 'loss/train': 1.2377347946166992} 11/07/2021 05:15:25 - INFO - __main__ - Step 57183: {'lr': 0.0003472277546531925, 'samples': 10979136, 'steps': 57182, 'loss/train': 1.4597644805908203} 11/07/2021 05:15:26 - INFO - __main__ - Step 57184: {'lr': 0.00034722286565867897, 'samples': 10979328, 'steps': 57183, 'loss/train': 1.2952302694320679} 11/07/2021 05:15:26 - INFO - __main__ - Step 57185: {'lr': 0.00034721797662035824, 'samples': 10979520, 'steps': 57184, 'loss/train': 0.7012817859649658} 11/07/2021 05:15:26 - INFO - __main__ - Step 57186: {'lr': 0.00034721308753823266, 'samples': 10979712, 'steps': 57185, 'loss/train': 1.0798956155776978} 11/07/2021 05:15:27 - INFO - __main__ - Step 57187: {'lr': 0.00034720819841230433, 'samples': 10979904, 'steps': 57186, 'loss/train': 1.4146571159362793} 11/07/2021 05:15:28 - INFO - __main__ - Step 57188: {'lr': 0.0003472033092425755, 'samples': 10980096, 'steps': 57187, 'loss/train': 1.2016068696975708} 11/07/2021 05:15:28 - INFO - __main__ - Step 57189: {'lr': 0.00034719842002904844, 'samples': 10980288, 'steps': 57188, 'loss/train': 1.0058929920196533} 11/07/2021 05:15:29 - INFO - __main__ - Step 57190: {'lr': 0.00034719353077172516, 'samples': 10980480, 'steps': 57189, 'loss/train': 1.4530718326568604} 11/07/2021 05:15:29 - INFO - __main__ - Step 57191: {'lr': 0.00034718864147060803, 'samples': 10980672, 'steps': 57190, 'loss/train': 1.6307939291000366} 11/07/2021 05:15:29 - INFO - __main__ - Step 57192: {'lr': 0.00034718375212569916, 'samples': 10980864, 'steps': 57191, 'loss/train': 1.438184142112732} 11/07/2021 05:15:30 - INFO - __main__ - Step 57193: {'lr': 0.0003471788627370008, 'samples': 10981056, 'steps': 57192, 'loss/train': 1.6724183559417725} 11/07/2021 05:15:31 - INFO - __main__ - Step 57194: {'lr': 0.0003471739733045151, 'samples': 10981248, 'steps': 57193, 'loss/train': 0.655161440372467} 11/07/2021 05:15:31 - INFO - __main__ - Step 57195: {'lr': 0.00034716908382824435, 'samples': 10981440, 'steps': 57194, 'loss/train': 1.3965442180633545} 11/07/2021 05:15:31 - INFO - __main__ - Step 57196: {'lr': 0.0003471641943081908, 'samples': 10981632, 'steps': 57195, 'loss/train': 1.3838998079299927} 11/07/2021 05:15:32 - INFO - __main__ - Step 57197: {'lr': 0.0003471593047443564, 'samples': 10981824, 'steps': 57196, 'loss/train': 1.01152765750885} 11/07/2021 05:15:32 - INFO - __main__ - Step 57198: {'lr': 0.00034715441513674363, 'samples': 10982016, 'steps': 57197, 'loss/train': 1.327661156654358} 11/07/2021 05:15:33 - INFO - __main__ - Step 57199: {'lr': 0.00034714952548535455, 'samples': 10982208, 'steps': 57198, 'loss/train': 0.8171754479408264} 11/07/2021 05:15:34 - INFO - __main__ - Step 57200: {'lr': 0.0003471446357901914, 'samples': 10982400, 'steps': 57199, 'loss/train': 1.1679540872573853} 11/07/2021 05:15:34 - INFO - __main__ - Step 57201: {'lr': 0.0003471397460512563, 'samples': 10982592, 'steps': 57200, 'loss/train': 1.601515531539917} 11/07/2021 05:15:34 - INFO - __main__ - Step 57202: {'lr': 0.0003471348562685517, 'samples': 10982784, 'steps': 57201, 'loss/train': 1.4043967723846436} 11/07/2021 05:15:35 - INFO - __main__ - Step 57203: {'lr': 0.0003471299664420795, 'samples': 10982976, 'steps': 57202, 'loss/train': 0.9429094195365906} 11/07/2021 05:15:36 - INFO - __main__ - Step 57204: {'lr': 0.00034712507657184207, 'samples': 10983168, 'steps': 57203, 'loss/train': 1.6474299430847168} 11/07/2021 05:15:36 - INFO - __main__ - Step 57205: {'lr': 0.00034712018665784155, 'samples': 10983360, 'steps': 57204, 'loss/train': 1.2825877666473389} 11/07/2021 05:15:36 - INFO - __main__ - Step 57206: {'lr': 0.0003471152967000802, 'samples': 10983552, 'steps': 57205, 'loss/train': 1.379895806312561} 11/07/2021 05:15:37 - INFO - __main__ - Step 57207: {'lr': 0.0003471104066985602, 'samples': 10983744, 'steps': 57206, 'loss/train': 1.354645848274231} 11/07/2021 05:15:37 - INFO - __main__ - Step 57208: {'lr': 0.0003471055166532837, 'samples': 10983936, 'steps': 57207, 'loss/train': 1.132021188735962} 11/07/2021 05:15:38 - INFO - __main__ - Step 57209: {'lr': 0.00034710062656425304, 'samples': 10984128, 'steps': 57208, 'loss/train': 0.8864858150482178} 11/07/2021 05:15:38 - INFO - __main__ - Step 57210: {'lr': 0.0003470957364314703, 'samples': 10984320, 'steps': 57209, 'loss/train': 1.4582781791687012} 11/07/2021 05:15:39 - INFO - __main__ - Step 57211: {'lr': 0.0003470908462549377, 'samples': 10984512, 'steps': 57210, 'loss/train': 1.2331348657608032} 11/07/2021 05:15:39 - INFO - __main__ - Step 57212: {'lr': 0.00034708595603465743, 'samples': 10984704, 'steps': 57211, 'loss/train': 0.7234694957733154} 11/07/2021 05:15:39 - INFO - __main__ - Step 57213: {'lr': 0.0003470810657706318, 'samples': 10984896, 'steps': 57212, 'loss/train': 1.3726696968078613} 11/07/2021 05:15:41 - INFO - __main__ - Step 57214: {'lr': 0.0003470761754628629, 'samples': 10985088, 'steps': 57213, 'loss/train': 1.3842049837112427} 11/07/2021 05:15:41 - INFO - __main__ - Step 57215: {'lr': 0.000347071285111353, 'samples': 10985280, 'steps': 57214, 'loss/train': 1.4608348608016968} 11/07/2021 05:15:41 - INFO - __main__ - Step 57216: {'lr': 0.00034706639471610424, 'samples': 10985472, 'steps': 57215, 'loss/train': 0.8314490914344788} 11/07/2021 05:15:42 - INFO - __main__ - Step 57217: {'lr': 0.0003470615042771189, 'samples': 10985664, 'steps': 57216, 'loss/train': 1.5083117485046387} 11/07/2021 05:15:42 - INFO - __main__ - Step 57218: {'lr': 0.00034705661379439914, 'samples': 10985856, 'steps': 57217, 'loss/train': 0.9466352462768555} 11/07/2021 05:15:43 - INFO - __main__ - Step 57219: {'lr': 0.0003470517232679471, 'samples': 10986048, 'steps': 57218, 'loss/train': 1.3346738815307617} 11/07/2021 05:15:43 - INFO - __main__ - Step 57220: {'lr': 0.0003470468326977651, 'samples': 10986240, 'steps': 57219, 'loss/train': 0.9492648243904114} 11/07/2021 05:15:44 - INFO - __main__ - Step 57221: {'lr': 0.0003470419420838553, 'samples': 10986432, 'steps': 57220, 'loss/train': 1.3113689422607422} 11/07/2021 05:15:44 - INFO - __main__ - Step 57222: {'lr': 0.0003470370514262199, 'samples': 10986624, 'steps': 57221, 'loss/train': 1.5532441139221191} 11/07/2021 05:15:44 - INFO - __main__ - Step 57223: {'lr': 0.0003470321607248611, 'samples': 10986816, 'steps': 57222, 'loss/train': 1.6135445833206177} 11/07/2021 05:15:45 - INFO - __main__ - Step 57224: {'lr': 0.0003470272699797811, 'samples': 10987008, 'steps': 57223, 'loss/train': 1.5716966390609741} 11/07/2021 05:15:46 - INFO - __main__ - Step 57225: {'lr': 0.0003470223791909821, 'samples': 10987200, 'steps': 57224, 'loss/train': 1.6298424005508423} 11/07/2021 05:15:46 - INFO - __main__ - Step 57226: {'lr': 0.0003470174883584664, 'samples': 10987392, 'steps': 57225, 'loss/train': 1.3984442949295044} 11/07/2021 05:15:46 - INFO - __main__ - Step 57227: {'lr': 0.00034701259748223595, 'samples': 10987584, 'steps': 57226, 'loss/train': 1.1414973735809326} 11/07/2021 05:15:47 - INFO - __main__ - Step 57228: {'lr': 0.00034700770656229324, 'samples': 10987776, 'steps': 57227, 'loss/train': 1.8051258325576782} 11/07/2021 05:15:47 - INFO - __main__ - Step 57229: {'lr': 0.00034700281559864034, 'samples': 10987968, 'steps': 57228, 'loss/train': 1.304807186126709} 11/07/2021 05:15:48 - INFO - __main__ - Step 57230: {'lr': 0.00034699792459127945, 'samples': 10988160, 'steps': 57229, 'loss/train': 1.508192539215088} 11/07/2021 05:15:48 - INFO - __main__ - Step 57231: {'lr': 0.00034699303354021285, 'samples': 10988352, 'steps': 57230, 'loss/train': 1.1588743925094604} 11/07/2021 05:15:49 - INFO - __main__ - Step 57232: {'lr': 0.0003469881424454426, 'samples': 10988544, 'steps': 57231, 'loss/train': 1.6050984859466553} 11/07/2021 05:15:49 - INFO - __main__ - Step 57233: {'lr': 0.000346983251306971, 'samples': 10988736, 'steps': 57232, 'loss/train': 1.877519965171814} 11/07/2021 05:15:50 - INFO - __main__ - Step 57234: {'lr': 0.0003469783601248002, 'samples': 10988928, 'steps': 57233, 'loss/train': 1.5269972085952759} 11/07/2021 05:15:51 - INFO - __main__ - Step 57235: {'lr': 0.0003469734688989326, 'samples': 10989120, 'steps': 57234, 'loss/train': 1.4625635147094727} 11/07/2021 05:15:51 - INFO - __main__ - Step 57236: {'lr': 0.0003469685776293702, 'samples': 10989312, 'steps': 57235, 'loss/train': 1.305212378501892} 11/07/2021 05:15:51 - INFO - __main__ - Step 57237: {'lr': 0.0003469636863161152, 'samples': 10989504, 'steps': 57236, 'loss/train': 1.4039185047149658} 11/07/2021 05:15:52 - INFO - __main__ - Step 57238: {'lr': 0.0003469587949591698, 'samples': 10989696, 'steps': 57237, 'loss/train': 1.2485064268112183} 11/07/2021 05:15:52 - INFO - __main__ - Step 57239: {'lr': 0.0003469539035585364, 'samples': 10989888, 'steps': 57238, 'loss/train': 1.3334019184112549} 11/07/2021 05:15:53 - INFO - __main__ - Step 57240: {'lr': 0.00034694901211421695, 'samples': 10990080, 'steps': 57239, 'loss/train': 0.6343439221382141} 11/07/2021 05:15:53 - INFO - __main__ - Step 57241: {'lr': 0.00034694412062621384, 'samples': 10990272, 'steps': 57240, 'loss/train': 1.6191811561584473} 11/07/2021 05:15:54 - INFO - __main__ - Step 57242: {'lr': 0.0003469392290945292, 'samples': 10990464, 'steps': 57241, 'loss/train': 0.6367524862289429} 11/07/2021 05:15:54 - INFO - __main__ - Step 57243: {'lr': 0.00034693433751916525, 'samples': 10990656, 'steps': 57242, 'loss/train': 1.3679172992706299} 11/07/2021 05:15:54 - INFO - __main__ - Step 57244: {'lr': 0.0003469294459001242, 'samples': 10990848, 'steps': 57243, 'loss/train': 1.4669983386993408} 11/07/2021 05:15:56 - INFO - __main__ - Step 57245: {'lr': 0.0003469245542374082, 'samples': 10991040, 'steps': 57244, 'loss/train': 1.5077030658721924} 11/07/2021 05:15:56 - INFO - __main__ - Step 57246: {'lr': 0.00034691966253101947, 'samples': 10991232, 'steps': 57245, 'loss/train': 1.504335880279541} 11/07/2021 05:15:56 - INFO - __main__ - Step 57247: {'lr': 0.00034691477078096025, 'samples': 10991424, 'steps': 57246, 'loss/train': 1.3656114339828491} 11/07/2021 05:15:57 - INFO - __main__ - Step 57248: {'lr': 0.0003469098789872327, 'samples': 10991616, 'steps': 57247, 'loss/train': 1.659846305847168} 11/07/2021 05:15:57 - INFO - __main__ - Step 57249: {'lr': 0.0003469049871498392, 'samples': 10991808, 'steps': 57248, 'loss/train': 0.7325455546379089} 11/07/2021 05:15:58 - INFO - __main__ - Step 57250: {'lr': 0.0003469000952687817, 'samples': 10992000, 'steps': 57249, 'loss/train': 3.3924875259399414} 11/07/2021 05:15:59 - INFO - __main__ - Step 57251: {'lr': 0.0003468952033440625, 'samples': 10992192, 'steps': 57250, 'loss/train': 1.5785741806030273} 11/07/2021 05:15:59 - INFO - __main__ - Step 57252: {'lr': 0.00034689031137568384, 'samples': 10992384, 'steps': 57251, 'loss/train': 1.597221851348877} 11/07/2021 05:15:59 - INFO - __main__ - Step 57253: {'lr': 0.0003468854193636479, 'samples': 10992576, 'steps': 57252, 'loss/train': 1.3004043102264404} 11/07/2021 05:16:00 - INFO - __main__ - Step 57254: {'lr': 0.00034688052730795683, 'samples': 10992768, 'steps': 57253, 'loss/train': 1.6935105323791504} 11/07/2021 05:16:01 - INFO - __main__ - Step 57255: {'lr': 0.00034687563520861294, 'samples': 10992960, 'steps': 57254, 'loss/train': 1.0508880615234375} 11/07/2021 05:16:01 - INFO - __main__ - Step 57256: {'lr': 0.0003468707430656184, 'samples': 10993152, 'steps': 57255, 'loss/train': 1.217911958694458} 11/07/2021 05:16:01 - INFO - __main__ - Step 57257: {'lr': 0.00034686585087897537, 'samples': 10993344, 'steps': 57256, 'loss/train': 1.1200474500656128} 11/07/2021 05:16:02 - INFO - __main__ - Step 57258: {'lr': 0.0003468609586486861, 'samples': 10993536, 'steps': 57257, 'loss/train': 1.9295845031738281} 11/07/2021 05:16:02 - INFO - __main__ - Step 57259: {'lr': 0.00034685606637475274, 'samples': 10993728, 'steps': 57258, 'loss/train': 1.585288405418396} 11/07/2021 05:16:03 - INFO - __main__ - Step 57260: {'lr': 0.0003468511740571776, 'samples': 10993920, 'steps': 57259, 'loss/train': 0.9167661666870117} 11/07/2021 05:16:03 - INFO - __main__ - Step 57261: {'lr': 0.00034684628169596277, 'samples': 10994112, 'steps': 57260, 'loss/train': 1.277349829673767} 11/07/2021 05:16:04 - INFO - __main__ - Step 57262: {'lr': 0.0003468413892911105, 'samples': 10994304, 'steps': 57261, 'loss/train': 1.2273873090744019} 11/07/2021 05:16:04 - INFO - __main__ - Step 57263: {'lr': 0.00034683649684262303, 'samples': 10994496, 'steps': 57262, 'loss/train': 1.5148195028305054} 11/07/2021 05:16:05 - INFO - __main__ - Step 57264: {'lr': 0.0003468316043505025, 'samples': 10994688, 'steps': 57263, 'loss/train': 1.4055837392807007} 11/07/2021 05:16:06 - INFO - __main__ - Step 57265: {'lr': 0.00034682671181475113, 'samples': 10994880, 'steps': 57264, 'loss/train': 1.6199654340744019} 11/07/2021 05:16:06 - INFO - __main__ - Step 57266: {'lr': 0.00034682181923537114, 'samples': 10995072, 'steps': 57265, 'loss/train': 1.3538107872009277} 11/07/2021 05:16:06 - INFO - __main__ - Step 57267: {'lr': 0.0003468169266123647, 'samples': 10995264, 'steps': 57266, 'loss/train': 1.1905224323272705} 11/07/2021 05:16:07 - INFO - __main__ - Step 57268: {'lr': 0.0003468120339457341, 'samples': 10995456, 'steps': 57267, 'loss/train': 1.4119234085083008} 11/07/2021 05:16:07 - INFO - __main__ - Step 57269: {'lr': 0.00034680714123548146, 'samples': 10995648, 'steps': 57268, 'loss/train': 1.2339471578598022} 11/07/2021 05:16:07 - INFO - __main__ - Step 57270: {'lr': 0.0003468022484816091, 'samples': 10995840, 'steps': 57269, 'loss/train': 1.278130292892456} 11/07/2021 05:16:08 - INFO - __main__ - Step 57271: {'lr': 0.0003467973556841191, 'samples': 10996032, 'steps': 57270, 'loss/train': 1.2380174398422241} 11/07/2021 05:16:09 - INFO - __main__ - Step 57272: {'lr': 0.00034679246284301365, 'samples': 10996224, 'steps': 57271, 'loss/train': 1.3780403137207031} 11/07/2021 05:16:09 - INFO - __main__ - Step 57273: {'lr': 0.000346787569958295, 'samples': 10996416, 'steps': 57272, 'loss/train': 1.2759188413619995} 11/07/2021 05:16:09 - INFO - __main__ - Step 57274: {'lr': 0.0003467826770299654, 'samples': 10996608, 'steps': 57273, 'loss/train': 1.464417576789856} 11/07/2021 05:16:10 - INFO - __main__ - Step 57275: {'lr': 0.000346777784058027, 'samples': 10996800, 'steps': 57274, 'loss/train': 1.413720965385437} 11/07/2021 05:16:11 - INFO - __main__ - Step 57276: {'lr': 0.0003467728910424821, 'samples': 10996992, 'steps': 57275, 'loss/train': 1.4004560708999634} 11/07/2021 05:16:11 - INFO - __main__ - Step 57277: {'lr': 0.0003467679979833328, 'samples': 10997184, 'steps': 57276, 'loss/train': 1.57266366481781} 11/07/2021 05:16:11 - INFO - __main__ - Step 57278: {'lr': 0.00034676310488058126, 'samples': 10997376, 'steps': 57277, 'loss/train': 1.1300008296966553} 11/07/2021 05:16:12 - INFO - __main__ - Step 57279: {'lr': 0.00034675821173422983, 'samples': 10997568, 'steps': 57278, 'loss/train': 0.7691164612770081} 11/07/2021 05:16:12 - INFO - __main__ - Step 57280: {'lr': 0.0003467533185442806, 'samples': 10997760, 'steps': 57279, 'loss/train': 1.7687597274780273} 11/07/2021 05:16:13 - INFO - __main__ - Step 57281: {'lr': 0.00034674842531073587, 'samples': 10997952, 'steps': 57280, 'loss/train': 1.3839212656021118} 11/07/2021 05:16:14 - INFO - __main__ - Step 57282: {'lr': 0.0003467435320335978, 'samples': 10998144, 'steps': 57281, 'loss/train': 1.2992281913757324} 11/07/2021 05:16:14 - INFO - __main__ - Step 57283: {'lr': 0.00034673863871286854, 'samples': 10998336, 'steps': 57282, 'loss/train': 1.5022506713867188} 11/07/2021 05:16:14 - INFO - __main__ - Step 57284: {'lr': 0.00034673374534855035, 'samples': 10998528, 'steps': 57283, 'loss/train': 1.6627693176269531} 11/07/2021 05:16:15 - INFO - __main__ - Step 57285: {'lr': 0.0003467288519406454, 'samples': 10998720, 'steps': 57284, 'loss/train': 1.5031355619430542} 11/07/2021 05:16:16 - INFO - __main__ - Step 57286: {'lr': 0.00034672395848915594, 'samples': 10998912, 'steps': 57285, 'loss/train': 1.4156684875488281} 11/07/2021 05:16:16 - INFO - __main__ - Step 57287: {'lr': 0.00034671906499408417, 'samples': 10999104, 'steps': 57286, 'loss/train': 1.271929383277893} 11/07/2021 05:16:16 - INFO - __main__ - Step 57288: {'lr': 0.0003467141714554323, 'samples': 10999296, 'steps': 57287, 'loss/train': 1.4576336145401} 11/07/2021 05:16:17 - INFO - __main__ - Step 57289: {'lr': 0.0003467092778732025, 'samples': 10999488, 'steps': 57288, 'loss/train': 1.3043655157089233} 11/07/2021 05:16:17 - INFO - __main__ - Step 57290: {'lr': 0.00034670438424739695, 'samples': 10999680, 'steps': 57289, 'loss/train': 1.2958300113677979} 11/07/2021 05:16:18 - INFO - __main__ - Step 57291: {'lr': 0.000346699490578018, 'samples': 10999872, 'steps': 57290, 'loss/train': 1.3538079261779785} 11/07/2021 05:16:19 - INFO - __main__ - Step 57292: {'lr': 0.00034669459686506766, 'samples': 11000064, 'steps': 57291, 'loss/train': 0.9389482736587524} 11/07/2021 05:16:19 - INFO - __main__ - Step 57293: {'lr': 0.0003466897031085482, 'samples': 11000256, 'steps': 57292, 'loss/train': 1.2801103591918945} 11/07/2021 05:16:19 - INFO - __main__ - Step 57294: {'lr': 0.000346684809308462, 'samples': 11000448, 'steps': 57293, 'loss/train': 1.7676334381103516} 11/07/2021 05:16:20 - INFO - __main__ - Step 57295: {'lr': 0.00034667991546481096, 'samples': 11000640, 'steps': 57294, 'loss/train': 1.8828591108322144} 11/07/2021 05:16:21 - INFO - __main__ - Step 57296: {'lr': 0.0003466750215775975, 'samples': 11000832, 'steps': 57295, 'loss/train': 1.4249674081802368} 11/07/2021 05:16:21 - INFO - __main__ - Step 57297: {'lr': 0.0003466701276468238, 'samples': 11001024, 'steps': 57296, 'loss/train': 1.5335332155227661} 11/07/2021 05:16:21 - INFO - __main__ - Step 57298: {'lr': 0.00034666523367249196, 'samples': 11001216, 'steps': 57297, 'loss/train': 1.4811491966247559} 11/07/2021 05:16:22 - INFO - __main__ - Step 57299: {'lr': 0.0003466603396546043, 'samples': 11001408, 'steps': 57298, 'loss/train': 1.5754340887069702} 11/07/2021 05:16:22 - INFO - __main__ - Step 57300: {'lr': 0.00034665544559316303, 'samples': 11001600, 'steps': 57299, 'loss/train': 1.0971513986587524} 11/07/2021 05:16:23 - INFO - __main__ - Step 57301: {'lr': 0.0003466505514881703, 'samples': 11001792, 'steps': 57300, 'loss/train': 3.5548441410064697} 11/07/2021 05:16:23 - INFO - __main__ - Step 57302: {'lr': 0.00034664565733962823, 'samples': 11001984, 'steps': 57301, 'loss/train': 1.138931155204773} 11/07/2021 05:16:24 - INFO - __main__ - Step 57303: {'lr': 0.0003466407631475392, 'samples': 11002176, 'steps': 57302, 'loss/train': 1.2977111339569092} 11/07/2021 05:16:24 - INFO - __main__ - Step 57304: {'lr': 0.00034663586891190524, 'samples': 11002368, 'steps': 57303, 'loss/train': 1.4463646411895752} 11/07/2021 05:16:24 - INFO - __main__ - Step 57305: {'lr': 0.0003466309746327288, 'samples': 11002560, 'steps': 57304, 'loss/train': 1.1357953548431396} 11/07/2021 05:16:26 - INFO - __main__ - Step 57306: {'lr': 0.0003466260803100118, 'samples': 11002752, 'steps': 57305, 'loss/train': 1.6303383111953735} 11/07/2021 05:16:26 - INFO - __main__ - Step 57307: {'lr': 0.0003466211859437566, 'samples': 11002944, 'steps': 57306, 'loss/train': 1.6040101051330566} 11/07/2021 05:16:27 - INFO - __main__ - Step 57308: {'lr': 0.00034661629153396543, 'samples': 11003136, 'steps': 57307, 'loss/train': 1.4108153581619263} 11/07/2021 05:16:27 - INFO - __main__ - Step 57309: {'lr': 0.00034661139708064043, 'samples': 11003328, 'steps': 57308, 'loss/train': 0.22060233354568481} 11/07/2021 05:16:27 - INFO - __main__ - Step 57310: {'lr': 0.00034660650258378384, 'samples': 11003520, 'steps': 57309, 'loss/train': 1.1796612739562988} 11/07/2021 05:16:28 - INFO - __main__ - Step 57311: {'lr': 0.00034660160804339784, 'samples': 11003712, 'steps': 57310, 'loss/train': 1.29084312915802} 11/07/2021 05:16:29 - INFO - __main__ - Step 57312: {'lr': 0.0003465967134594847, 'samples': 11003904, 'steps': 57311, 'loss/train': 1.0948700904846191} 11/07/2021 05:16:29 - INFO - __main__ - Step 57313: {'lr': 0.0003465918188320465, 'samples': 11004096, 'steps': 57312, 'loss/train': 0.959750235080719} 11/07/2021 05:16:29 - INFO - __main__ - Step 57314: {'lr': 0.0003465869241610855, 'samples': 11004288, 'steps': 57313, 'loss/train': 1.9232094287872314} 11/07/2021 05:16:30 - INFO - __main__ - Step 57315: {'lr': 0.00034658202944660396, 'samples': 11004480, 'steps': 57314, 'loss/train': 1.5287576913833618} 11/07/2021 05:16:30 - INFO - __main__ - Step 57316: {'lr': 0.000346577134688604, 'samples': 11004672, 'steps': 57315, 'loss/train': 1.2158355712890625} 11/07/2021 05:16:31 - INFO - __main__ - Step 57317: {'lr': 0.00034657223988708796, 'samples': 11004864, 'steps': 57316, 'loss/train': 1.6573148965835571} 11/07/2021 05:16:31 - INFO - __main__ - Step 57318: {'lr': 0.0003465673450420579, 'samples': 11005056, 'steps': 57317, 'loss/train': 1.4946718215942383} 11/07/2021 05:16:32 - INFO - __main__ - Step 57319: {'lr': 0.0003465624501535161, 'samples': 11005248, 'steps': 57318, 'loss/train': 1.2168070077896118} 11/07/2021 05:16:32 - INFO - __main__ - Step 57320: {'lr': 0.0003465575552214648, 'samples': 11005440, 'steps': 57319, 'loss/train': 1.6642897129058838} 11/07/2021 05:16:32 - INFO - __main__ - Step 57321: {'lr': 0.00034655266024590604, 'samples': 11005632, 'steps': 57320, 'loss/train': 1.0355887413024902} 11/07/2021 05:16:33 - INFO - __main__ - Step 57322: {'lr': 0.0003465477652268422, 'samples': 11005824, 'steps': 57321, 'loss/train': 1.2140164375305176} 11/07/2021 05:16:34 - INFO - __main__ - Step 57323: {'lr': 0.0003465428701642755, 'samples': 11006016, 'steps': 57322, 'loss/train': 1.493344783782959} 11/07/2021 05:16:34 - INFO - __main__ - Step 57324: {'lr': 0.00034653797505820795, 'samples': 11006208, 'steps': 57323, 'loss/train': 1.6135051250457764} 11/07/2021 05:16:35 - INFO - __main__ - Step 57325: {'lr': 0.000346533079908642, 'samples': 11006400, 'steps': 57324, 'loss/train': 1.1621938943862915} 11/07/2021 05:16:35 - INFO - __main__ - Step 57326: {'lr': 0.0003465281847155796, 'samples': 11006592, 'steps': 57325, 'loss/train': 1.454559087753296} 11/07/2021 05:16:36 - INFO - __main__ - Step 57327: {'lr': 0.00034652328947902317, 'samples': 11006784, 'steps': 57326, 'loss/train': 1.4118322134017944} 11/07/2021 05:16:36 - INFO - __main__ - Step 57328: {'lr': 0.0003465183941989748, 'samples': 11006976, 'steps': 57327, 'loss/train': 0.8050600290298462} 11/07/2021 05:16:37 - INFO - __main__ - Step 57329: {'lr': 0.00034651349887543674, 'samples': 11007168, 'steps': 57328, 'loss/train': 1.7745147943496704} 11/07/2021 05:16:37 - INFO - __main__ - Step 57330: {'lr': 0.00034650860350841125, 'samples': 11007360, 'steps': 57329, 'loss/train': 1.2334222793579102} 11/07/2021 05:16:37 - INFO - __main__ - Step 57331: {'lr': 0.0003465037080979004, 'samples': 11007552, 'steps': 57330, 'loss/train': 1.2701380252838135} 11/07/2021 05:16:38 - INFO - __main__ - Step 57332: {'lr': 0.0003464988126439065, 'samples': 11007744, 'steps': 57331, 'loss/train': 1.4824724197387695} 11/07/2021 05:16:39 - INFO - __main__ - Step 57333: {'lr': 0.0003464939171464317, 'samples': 11007936, 'steps': 57332, 'loss/train': 1.4551417827606201} 11/07/2021 05:16:39 - INFO - __main__ - Step 57334: {'lr': 0.0003464890216054782, 'samples': 11008128, 'steps': 57333, 'loss/train': 1.0542746782302856} 11/07/2021 05:16:39 - INFO - __main__ - Step 57335: {'lr': 0.0003464841260210483, 'samples': 11008320, 'steps': 57334, 'loss/train': 1.463664174079895} 11/07/2021 05:16:40 - INFO - __main__ - Step 57336: {'lr': 0.0003464792303931441, 'samples': 11008512, 'steps': 57335, 'loss/train': 1.3808726072311401} 11/07/2021 05:16:41 - INFO - __main__ - Step 57337: {'lr': 0.0003464743347217679, 'samples': 11008704, 'steps': 57336, 'loss/train': 1.4817286729812622} 11/07/2021 05:16:41 - INFO - __main__ - Step 57338: {'lr': 0.00034646943900692187, 'samples': 11008896, 'steps': 57337, 'loss/train': 1.192395806312561} 11/07/2021 05:16:41 - INFO - __main__ - Step 57339: {'lr': 0.0003464645432486081, 'samples': 11009088, 'steps': 57338, 'loss/train': 1.324694275856018} 11/07/2021 05:16:42 - INFO - __main__ - Step 57340: {'lr': 0.000346459647446829, 'samples': 11009280, 'steps': 57339, 'loss/train': 1.774898648262024} 11/07/2021 05:16:42 - INFO - __main__ - Step 57341: {'lr': 0.0003464547516015866, 'samples': 11009472, 'steps': 57340, 'loss/train': 1.6069287061691284} 11/07/2021 05:16:43 - INFO - __main__ - Step 57342: {'lr': 0.0003464498557128832, 'samples': 11009664, 'steps': 57341, 'loss/train': 5.762038707733154} 11/07/2021 05:16:43 - INFO - __main__ - Step 57343: {'lr': 0.00034644495978072094, 'samples': 11009856, 'steps': 57342, 'loss/train': 1.7801626920700073} 11/07/2021 05:16:44 - INFO - __main__ - Step 57344: {'lr': 0.00034644006380510215, 'samples': 11010048, 'steps': 57343, 'loss/train': 1.6430482864379883} 11/07/2021 05:16:44 - INFO - __main__ - Step 57345: {'lr': 0.0003464351677860289, 'samples': 11010240, 'steps': 57344, 'loss/train': 1.46976637840271} 11/07/2021 05:16:45 - INFO - __main__ - Step 57346: {'lr': 0.00034643027172350345, 'samples': 11010432, 'steps': 57345, 'loss/train': 1.1630475521087646} 11/07/2021 05:16:45 - INFO - __main__ - Step 57347: {'lr': 0.000346425375617528, 'samples': 11010624, 'steps': 57346, 'loss/train': 0.7415656447410583} 11/07/2021 05:16:46 - INFO - __main__ - Step 57348: {'lr': 0.00034642047946810477, 'samples': 11010816, 'steps': 57347, 'loss/train': 1.3568874597549438} 11/07/2021 05:16:46 - INFO - __main__ - Step 57349: {'lr': 0.000346415583275236, 'samples': 11011008, 'steps': 57348, 'loss/train': 1.4117162227630615} 11/07/2021 05:16:47 - INFO - __main__ - Step 57350: {'lr': 0.00034641068703892387, 'samples': 11011200, 'steps': 57349, 'loss/train': 1.1161599159240723} 11/07/2021 05:16:47 - INFO - __main__ - Step 57351: {'lr': 0.00034640579075917053, 'samples': 11011392, 'steps': 57350, 'loss/train': 1.0881834030151367} 11/07/2021 05:16:47 - INFO - __main__ - Step 57352: {'lr': 0.0003464008944359782, 'samples': 11011584, 'steps': 57351, 'loss/train': 1.755181074142456} 11/07/2021 05:16:48 - INFO - __main__ - Step 57353: {'lr': 0.00034639599806934917, 'samples': 11011776, 'steps': 57352, 'loss/train': 1.1692019701004028} 11/07/2021 05:16:49 - INFO - __main__ - Step 57354: {'lr': 0.0003463911016592856, 'samples': 11011968, 'steps': 57353, 'loss/train': 1.4031453132629395} 11/07/2021 05:16:49 - INFO - __main__ - Step 57355: {'lr': 0.0003463862052057896, 'samples': 11012160, 'steps': 57354, 'loss/train': 1.7824376821517944} 11/07/2021 05:16:49 - INFO - __main__ - Step 57356: {'lr': 0.00034638130870886353, 'samples': 11012352, 'steps': 57355, 'loss/train': 1.1180835962295532} 11/07/2021 05:16:50 - INFO - __main__ - Step 57357: {'lr': 0.0003463764121685096, 'samples': 11012544, 'steps': 57356, 'loss/train': 1.5490951538085938} 11/07/2021 05:16:51 - INFO - __main__ - Step 57358: {'lr': 0.0003463715155847298, 'samples': 11012736, 'steps': 57357, 'loss/train': 1.4867855310440063} 11/07/2021 05:16:51 - INFO - __main__ - Step 57359: {'lr': 0.00034636661895752653, 'samples': 11012928, 'steps': 57358, 'loss/train': 1.299237608909607} 11/07/2021 05:16:52 - INFO - __main__ - Step 57360: {'lr': 0.000346361722286902, 'samples': 11013120, 'steps': 57359, 'loss/train': 1.4953867197036743} 11/07/2021 05:16:52 - INFO - __main__ - Step 57361: {'lr': 0.0003463568255728583, 'samples': 11013312, 'steps': 57360, 'loss/train': 1.8217616081237793} 11/07/2021 05:16:52 - INFO - __main__ - Step 57362: {'lr': 0.0003463519288153977, 'samples': 11013504, 'steps': 57361, 'loss/train': 1.428085207939148} 11/07/2021 05:16:53 - INFO - __main__ - Step 57363: {'lr': 0.00034634703201452243, 'samples': 11013696, 'steps': 57362, 'loss/train': 1.5637208223342896} 11/07/2021 05:16:54 - INFO - __main__ - Step 57364: {'lr': 0.00034634213517023473, 'samples': 11013888, 'steps': 57363, 'loss/train': 1.333274245262146} 11/07/2021 05:16:54 - INFO - __main__ - Step 57365: {'lr': 0.0003463372382825367, 'samples': 11014080, 'steps': 57364, 'loss/train': 0.20709101855754852} 11/07/2021 05:16:54 - INFO - __main__ - Step 57366: {'lr': 0.0003463323413514306, 'samples': 11014272, 'steps': 57365, 'loss/train': 1.1538246870040894} 11/07/2021 05:16:55 - INFO - __main__ - Step 57367: {'lr': 0.0003463274443769186, 'samples': 11014464, 'steps': 57366, 'loss/train': 1.3712096214294434} 11/07/2021 05:16:55 - INFO - __main__ - Step 57368: {'lr': 0.000346322547359003, 'samples': 11014656, 'steps': 57367, 'loss/train': 1.385015606880188} 11/07/2021 05:16:56 - INFO - __main__ - Step 57369: {'lr': 0.00034631765029768594, 'samples': 11014848, 'steps': 57368, 'loss/train': 1.4773645401000977} 11/07/2021 05:16:56 - INFO - __main__ - Step 57370: {'lr': 0.0003463127531929696, 'samples': 11015040, 'steps': 57369, 'loss/train': 1.287975788116455} 11/07/2021 05:16:57 - INFO - __main__ - Step 57371: {'lr': 0.0003463078560448562, 'samples': 11015232, 'steps': 57370, 'loss/train': 1.7418196201324463} 11/07/2021 05:16:57 - INFO - __main__ - Step 57372: {'lr': 0.000346302958853348, 'samples': 11015424, 'steps': 57371, 'loss/train': 1.3799270391464233} 11/07/2021 05:16:57 - INFO - __main__ - Step 57373: {'lr': 0.0003462980616184472, 'samples': 11015616, 'steps': 57372, 'loss/train': 1.3816124200820923} 11/07/2021 05:16:58 - INFO - __main__ - Step 57374: {'lr': 0.0003462931643401559, 'samples': 11015808, 'steps': 57373, 'loss/train': 1.3825258016586304} 11/07/2021 05:16:59 - INFO - __main__ - Step 57375: {'lr': 0.00034628826701847644, 'samples': 11016000, 'steps': 57374, 'loss/train': 0.2318015843629837} 11/07/2021 05:16:59 - INFO - __main__ - Step 57376: {'lr': 0.000346283369653411, 'samples': 11016192, 'steps': 57375, 'loss/train': 1.0265570878982544} 11/07/2021 05:17:00 - INFO - __main__ - Step 57377: {'lr': 0.0003462784722449617, 'samples': 11016384, 'steps': 57376, 'loss/train': 1.0038529634475708} 11/07/2021 05:17:00 - INFO - __main__ - Step 57378: {'lr': 0.00034627357479313087, 'samples': 11016576, 'steps': 57377, 'loss/train': 1.564623236656189} 11/07/2021 05:17:02 - INFO - __main__ - Step 57379: {'lr': 0.0003462686772979206, 'samples': 11016768, 'steps': 57378, 'loss/train': 0.4925709664821625} 11/07/2021 05:17:02 - INFO - __main__ - Step 57380: {'lr': 0.00034626377975933314, 'samples': 11016960, 'steps': 57379, 'loss/train': 1.9429572820663452} 11/07/2021 05:17:02 - INFO - __main__ - Step 57381: {'lr': 0.00034625888217737076, 'samples': 11017152, 'steps': 57380, 'loss/train': 1.6184098720550537} 11/07/2021 05:17:03 - INFO - __main__ - Step 57382: {'lr': 0.0003462539845520356, 'samples': 11017344, 'steps': 57381, 'loss/train': 1.3695964813232422} 11/07/2021 05:17:03 - INFO - __main__ - Step 57383: {'lr': 0.0003462490868833298, 'samples': 11017536, 'steps': 57382, 'loss/train': 1.2670353651046753} 11/07/2021 05:17:03 - INFO - __main__ - Step 57384: {'lr': 0.00034624418917125575, 'samples': 11017728, 'steps': 57383, 'loss/train': 0.5824760794639587} 11/07/2021 05:17:05 - INFO - __main__ - Step 57385: {'lr': 0.00034623929141581555, 'samples': 11017920, 'steps': 57384, 'loss/train': 1.0204366445541382} 11/07/2021 05:17:05 - INFO - __main__ - Step 57386: {'lr': 0.0003462343936170114, 'samples': 11018112, 'steps': 57385, 'loss/train': 1.1853617429733276} 11/07/2021 05:17:05 - INFO - __main__ - Step 57387: {'lr': 0.0003462294957748455, 'samples': 11018304, 'steps': 57386, 'loss/train': 1.394788384437561} 11/07/2021 05:17:06 - INFO - __main__ - Step 57388: {'lr': 0.00034622459788932004, 'samples': 11018496, 'steps': 57387, 'loss/train': 1.415103554725647} 11/07/2021 05:17:06 - INFO - __main__ - Step 57389: {'lr': 0.00034621969996043725, 'samples': 11018688, 'steps': 57388, 'loss/train': 1.2047784328460693} 11/07/2021 05:17:07 - INFO - __main__ - Step 57390: {'lr': 0.0003462148019881994, 'samples': 11018880, 'steps': 57389, 'loss/train': 1.1877517700195312} 11/07/2021 05:17:07 - INFO - __main__ - Step 57391: {'lr': 0.0003462099039726087, 'samples': 11019072, 'steps': 57390, 'loss/train': 1.1441655158996582} 11/07/2021 05:17:08 - INFO - __main__ - Step 57392: {'lr': 0.0003462050059136672, 'samples': 11019264, 'steps': 57391, 'loss/train': 1.7648218870162964} 11/07/2021 05:17:08 - INFO - __main__ - Step 57393: {'lr': 0.00034620010781137724, 'samples': 11019456, 'steps': 57392, 'loss/train': 1.5704514980316162} 11/07/2021 05:17:08 - INFO - __main__ - Step 57394: {'lr': 0.000346195209665741, 'samples': 11019648, 'steps': 57393, 'loss/train': 1.6239277124404907} 11/07/2021 05:17:09 - INFO - __main__ - Step 57395: {'lr': 0.0003461903114767607, 'samples': 11019840, 'steps': 57394, 'loss/train': 0.7149443626403809} 11/07/2021 05:17:10 - INFO - __main__ - Step 57396: {'lr': 0.00034618541324443844, 'samples': 11020032, 'steps': 57395, 'loss/train': 1.5315989255905151} 11/07/2021 05:17:10 - INFO - __main__ - Step 57397: {'lr': 0.0003461805149687767, 'samples': 11020224, 'steps': 57396, 'loss/train': 1.3829823732376099} 11/07/2021 05:17:10 - INFO - __main__ - Step 57398: {'lr': 0.0003461756166497773, 'samples': 11020416, 'steps': 57397, 'loss/train': 1.4139355421066284} 11/07/2021 05:17:11 - INFO - __main__ - Step 57399: {'lr': 0.00034617071828744274, 'samples': 11020608, 'steps': 57398, 'loss/train': 1.1383767127990723} 11/07/2021 05:17:12 - INFO - __main__ - Step 57400: {'lr': 0.00034616581988177516, 'samples': 11020800, 'steps': 57399, 'loss/train': 1.0450670719146729} 11/07/2021 05:17:12 - INFO - __main__ - Step 57401: {'lr': 0.00034616092143277674, 'samples': 11020992, 'steps': 57400, 'loss/train': 0.9170766472816467} 11/07/2021 05:17:13 - INFO - __main__ - Step 57402: {'lr': 0.0003461560229404497, 'samples': 11021184, 'steps': 57401, 'loss/train': 2.8225150108337402} 11/07/2021 05:17:13 - INFO - __main__ - Step 57403: {'lr': 0.0003461511244047962, 'samples': 11021376, 'steps': 57402, 'loss/train': 1.3104650974273682} 11/07/2021 05:17:13 - INFO - __main__ - Step 57404: {'lr': 0.0003461462258258185, 'samples': 11021568, 'steps': 57403, 'loss/train': 1.458613634109497} 11/07/2021 05:17:14 - INFO - __main__ - Step 57405: {'lr': 0.00034614132720351884, 'samples': 11021760, 'steps': 57404, 'loss/train': 0.919306218624115} 11/07/2021 05:17:15 - INFO - __main__ - Step 57406: {'lr': 0.00034613642853789927, 'samples': 11021952, 'steps': 57405, 'loss/train': 1.053566813468933} 11/07/2021 05:17:15 - INFO - __main__ - Step 57407: {'lr': 0.00034613152982896224, 'samples': 11022144, 'steps': 57406, 'loss/train': 1.1296557188034058} 11/07/2021 05:17:15 - INFO - __main__ - Step 57408: {'lr': 0.0003461266310767097, 'samples': 11022336, 'steps': 57407, 'loss/train': 1.511297583580017} 11/07/2021 05:17:16 - INFO - __main__ - Step 57409: {'lr': 0.00034612173228114405, 'samples': 11022528, 'steps': 57408, 'loss/train': 1.3894857168197632} 11/07/2021 05:17:17 - INFO - __main__ - Step 57410: {'lr': 0.00034611683344226745, 'samples': 11022720, 'steps': 57409, 'loss/train': 1.1479789018630981} 11/07/2021 05:17:17 - INFO - __main__ - Step 57411: {'lr': 0.0003461119345600821, 'samples': 11022912, 'steps': 57410, 'loss/train': 1.1791572570800781} 11/07/2021 05:17:17 - INFO - __main__ - Step 57412: {'lr': 0.0003461070356345902, 'samples': 11023104, 'steps': 57411, 'loss/train': 1.347753643989563} 11/07/2021 05:17:18 - INFO - __main__ - Step 57413: {'lr': 0.0003461021366657939, 'samples': 11023296, 'steps': 57412, 'loss/train': 1.4777131080627441} 11/07/2021 05:17:18 - INFO - __main__ - Step 57414: {'lr': 0.00034609723765369546, 'samples': 11023488, 'steps': 57413, 'loss/train': 0.8746947646141052} 11/07/2021 05:17:19 - INFO - __main__ - Step 57415: {'lr': 0.00034609233859829707, 'samples': 11023680, 'steps': 57414, 'loss/train': 1.1914379596710205} 11/07/2021 05:17:20 - INFO - __main__ - Step 57416: {'lr': 0.00034608743949960096, 'samples': 11023872, 'steps': 57415, 'loss/train': 0.7074412703514099} 11/07/2021 05:17:20 - INFO - __main__ - Step 57417: {'lr': 0.00034608254035760946, 'samples': 11024064, 'steps': 57416, 'loss/train': 1.6975936889648438} 11/07/2021 05:17:20 - INFO - __main__ - Step 57418: {'lr': 0.0003460776411723245, 'samples': 11024256, 'steps': 57417, 'loss/train': 1.3936597108840942} 11/07/2021 05:17:21 - INFO - __main__ - Step 57419: {'lr': 0.00034607274194374847, 'samples': 11024448, 'steps': 57418, 'loss/train': 1.178491234779358} 11/07/2021 05:17:21 - INFO - __main__ - Step 57420: {'lr': 0.00034606784267188364, 'samples': 11024640, 'steps': 57419, 'loss/train': 1.1270482540130615} 11/07/2021 05:17:22 - INFO - __main__ - Step 57421: {'lr': 0.000346062943356732, 'samples': 11024832, 'steps': 57420, 'loss/train': 1.1398437023162842} 11/07/2021 05:17:22 - INFO - __main__ - Step 57422: {'lr': 0.00034605804399829595, 'samples': 11025024, 'steps': 57421, 'loss/train': 1.578539252281189} 11/07/2021 05:17:23 - INFO - __main__ - Step 57423: {'lr': 0.00034605314459657763, 'samples': 11025216, 'steps': 57422, 'loss/train': 1.3543661832809448} 11/07/2021 05:17:23 - INFO - __main__ - Step 57424: {'lr': 0.00034604824515157916, 'samples': 11025408, 'steps': 57423, 'loss/train': 1.1689528226852417} 11/07/2021 05:17:23 - INFO - __main__ - Step 57425: {'lr': 0.0003460433456633029, 'samples': 11025600, 'steps': 57424, 'loss/train': 1.2760167121887207} 11/07/2021 05:17:25 - INFO - __main__ - Step 57426: {'lr': 0.000346038446131751, 'samples': 11025792, 'steps': 57425, 'loss/train': 1.5373750925064087} 11/07/2021 05:17:25 - INFO - __main__ - Step 57427: {'lr': 0.0003460335465569256, 'samples': 11025984, 'steps': 57426, 'loss/train': 0.09772712737321854} 11/07/2021 05:17:25 - INFO - __main__ - Step 57428: {'lr': 0.0003460286469388291, 'samples': 11026176, 'steps': 57427, 'loss/train': 1.454162836074829} 11/07/2021 05:17:26 - INFO - __main__ - Step 57429: {'lr': 0.0003460237472774634, 'samples': 11026368, 'steps': 57428, 'loss/train': 1.8001880645751953} 11/07/2021 05:17:26 - INFO - __main__ - Step 57430: {'lr': 0.000346018847572831, 'samples': 11026560, 'steps': 57429, 'loss/train': 1.9741418361663818} 11/07/2021 05:17:27 - INFO - __main__ - Step 57431: {'lr': 0.00034601394782493393, 'samples': 11026752, 'steps': 57430, 'loss/train': 1.4095287322998047} 11/07/2021 05:17:28 - INFO - __main__ - Step 57432: {'lr': 0.00034600904803377454, 'samples': 11026944, 'steps': 57431, 'loss/train': 1.6216919422149658} 11/07/2021 05:17:28 - INFO - __main__ - Step 57433: {'lr': 0.0003460041481993549, 'samples': 11027136, 'steps': 57432, 'loss/train': 1.2803337574005127} 11/07/2021 05:17:28 - INFO - __main__ - Step 57434: {'lr': 0.0003459992483216773, 'samples': 11027328, 'steps': 57433, 'loss/train': 1.4107896089553833} 11/07/2021 05:17:29 - INFO - __main__ - Step 57435: {'lr': 0.0003459943484007438, 'samples': 11027520, 'steps': 57434, 'loss/train': 0.09461843967437744} 11/07/2021 05:17:30 - INFO - __main__ - Step 57436: {'lr': 0.0003459894484365568, 'samples': 11027712, 'steps': 57435, 'loss/train': 1.2363675832748413} 11/07/2021 05:17:30 - INFO - __main__ - Step 57437: {'lr': 0.0003459845484291185, 'samples': 11027904, 'steps': 57436, 'loss/train': 1.6568247079849243} 11/07/2021 05:17:30 - INFO - __main__ - Step 57438: {'lr': 0.00034597964837843097, 'samples': 11028096, 'steps': 57437, 'loss/train': 0.8117554187774658} 11/07/2021 05:17:31 - INFO - __main__ - Step 57439: {'lr': 0.00034597474828449646, 'samples': 11028288, 'steps': 57438, 'loss/train': 1.149634599685669} 11/07/2021 05:17:31 - INFO - __main__ - Step 57440: {'lr': 0.00034596984814731736, 'samples': 11028480, 'steps': 57439, 'loss/train': 1.4476789236068726} 11/07/2021 05:17:32 - INFO - __main__ - Step 57441: {'lr': 0.0003459649479668956, 'samples': 11028672, 'steps': 57440, 'loss/train': 1.3072500228881836} 11/07/2021 05:17:33 - INFO - __main__ - Step 57442: {'lr': 0.00034596004774323355, 'samples': 11028864, 'steps': 57441, 'loss/train': 1.5590335130691528} 11/07/2021 05:17:33 - INFO - __main__ - Step 57443: {'lr': 0.0003459551474763334, 'samples': 11029056, 'steps': 57442, 'loss/train': 1.6526044607162476} 11/07/2021 05:17:33 - INFO - __main__ - Step 57444: {'lr': 0.00034595024716619726, 'samples': 11029248, 'steps': 57443, 'loss/train': 1.0462483167648315} 11/07/2021 05:17:34 - INFO - __main__ - Step 57445: {'lr': 0.0003459453468128276, 'samples': 11029440, 'steps': 57444, 'loss/train': 1.490347981452942} 11/07/2021 05:17:35 - INFO - __main__ - Step 57446: {'lr': 0.0003459404464162263, 'samples': 11029632, 'steps': 57445, 'loss/train': 1.1569639444351196} 11/07/2021 05:17:35 - INFO - __main__ - Step 57447: {'lr': 0.0003459355459763957, 'samples': 11029824, 'steps': 57446, 'loss/train': 1.3507537841796875} 11/07/2021 05:17:35 - INFO - __main__ - Step 57448: {'lr': 0.0003459306454933381, 'samples': 11030016, 'steps': 57447, 'loss/train': 1.0689138174057007} 11/07/2021 05:17:36 - INFO - __main__ - Step 57449: {'lr': 0.0003459257449670555, 'samples': 11030208, 'steps': 57448, 'loss/train': 1.300590991973877} 11/07/2021 05:17:36 - INFO - __main__ - Step 57450: {'lr': 0.0003459208443975504, 'samples': 11030400, 'steps': 57449, 'loss/train': 2.5017642974853516} 11/07/2021 05:17:37 - INFO - __main__ - Step 57451: {'lr': 0.00034591594378482484, 'samples': 11030592, 'steps': 57450, 'loss/train': 1.2393287420272827} 11/07/2021 05:17:37 - INFO - __main__ - Step 57452: {'lr': 0.00034591104312888096, 'samples': 11030784, 'steps': 57451, 'loss/train': 2.2869598865509033} 11/07/2021 05:17:38 - INFO - __main__ - Step 57453: {'lr': 0.00034590614242972106, 'samples': 11030976, 'steps': 57452, 'loss/train': 1.5786266326904297} 11/07/2021 05:17:38 - INFO - __main__ - Step 57454: {'lr': 0.00034590124168734735, 'samples': 11031168, 'steps': 57453, 'loss/train': 1.2358040809631348} 11/07/2021 05:17:38 - INFO - __main__ - Step 57455: {'lr': 0.00034589634090176195, 'samples': 11031360, 'steps': 57454, 'loss/train': 1.4128271341323853} 11/07/2021 05:17:39 - INFO - __main__ - Step 57456: {'lr': 0.0003458914400729672, 'samples': 11031552, 'steps': 57455, 'loss/train': 1.3198496103286743} 11/07/2021 05:17:40 - INFO - __main__ - Step 57457: {'lr': 0.00034588653920096524, 'samples': 11031744, 'steps': 57456, 'loss/train': 0.814330518245697} 11/07/2021 05:17:40 - INFO - __main__ - Step 57458: {'lr': 0.00034588163828575837, 'samples': 11031936, 'steps': 57457, 'loss/train': 1.571532130241394} 11/07/2021 05:17:41 - INFO - __main__ - Step 57459: {'lr': 0.0003458767373273486, 'samples': 11032128, 'steps': 57458, 'loss/train': 1.1901860237121582} 11/07/2021 05:17:41 - INFO - __main__ - Step 57460: {'lr': 0.00034587183632573825, 'samples': 11032320, 'steps': 57459, 'loss/train': 1.4375938177108765} 11/07/2021 05:17:41 - INFO - __main__ - Step 57461: {'lr': 0.00034586693528092954, 'samples': 11032512, 'steps': 57460, 'loss/train': 1.5013798475265503} 11/07/2021 05:17:42 - INFO - __main__ - Step 57462: {'lr': 0.0003458620341929247, 'samples': 11032704, 'steps': 57461, 'loss/train': 0.18268518149852753} 11/07/2021 05:17:43 - INFO - __main__ - Step 57463: {'lr': 0.0003458571330617259, 'samples': 11032896, 'steps': 57462, 'loss/train': 1.1528363227844238} 11/07/2021 05:17:43 - INFO - __main__ - Step 57464: {'lr': 0.00034585223188733535, 'samples': 11033088, 'steps': 57463, 'loss/train': 2.0035576820373535} 11/07/2021 05:17:43 - INFO - __main__ - Step 57465: {'lr': 0.0003458473306697553, 'samples': 11033280, 'steps': 57464, 'loss/train': 1.2071844339370728} 11/07/2021 05:17:44 - INFO - __main__ - Step 57466: {'lr': 0.0003458424294089879, 'samples': 11033472, 'steps': 57465, 'loss/train': 1.8875221014022827} 11/07/2021 05:17:45 - INFO - __main__ - Step 57467: {'lr': 0.00034583752810503533, 'samples': 11033664, 'steps': 57466, 'loss/train': 1.198325276374817} 11/07/2021 05:17:45 - INFO - __main__ - Step 57468: {'lr': 0.0003458326267578999, 'samples': 11033856, 'steps': 57467, 'loss/train': 1.4165171384811401} 11/07/2021 05:17:45 - INFO - __main__ - Step 57469: {'lr': 0.0003458277253675837, 'samples': 11034048, 'steps': 57468, 'loss/train': 1.3195197582244873} 11/07/2021 05:17:46 - INFO - __main__ - Step 57470: {'lr': 0.0003458228239340891, 'samples': 11034240, 'steps': 57469, 'loss/train': 1.482831358909607} 11/07/2021 05:17:46 - INFO - __main__ - Step 57471: {'lr': 0.0003458179224574182, 'samples': 11034432, 'steps': 57470, 'loss/train': 0.546582818031311} 11/07/2021 05:17:47 - INFO - __main__ - Step 57472: {'lr': 0.00034581302093757317, 'samples': 11034624, 'steps': 57471, 'loss/train': 1.2985109090805054} 11/07/2021 05:17:48 - INFO - __main__ - Step 57473: {'lr': 0.0003458081193745563, 'samples': 11034816, 'steps': 57472, 'loss/train': 1.4421230554580688} 11/07/2021 05:17:48 - INFO - __main__ - Step 57474: {'lr': 0.00034580321776836974, 'samples': 11035008, 'steps': 57473, 'loss/train': 1.6130874156951904} 11/07/2021 05:17:49 - INFO - __main__ - Step 57475: {'lr': 0.0003457983161190158, 'samples': 11035200, 'steps': 57474, 'loss/train': 1.1718443632125854} 11/07/2021 05:17:49 - INFO - __main__ - Step 57476: {'lr': 0.00034579341442649654, 'samples': 11035392, 'steps': 57475, 'loss/train': 1.7045835256576538} 11/07/2021 05:17:49 - INFO - __main__ - Step 57477: {'lr': 0.00034578851269081426, 'samples': 11035584, 'steps': 57476, 'loss/train': 1.056429386138916} 11/07/2021 05:17:50 - INFO - __main__ - Step 57478: {'lr': 0.0003457836109119712, 'samples': 11035776, 'steps': 57477, 'loss/train': 1.4926329851150513} 11/07/2021 05:17:51 - INFO - __main__ - Step 57479: {'lr': 0.0003457787090899695, 'samples': 11035968, 'steps': 57478, 'loss/train': 1.235234022140503} 11/07/2021 05:17:51 - INFO - __main__ - Step 57480: {'lr': 0.00034577380722481137, 'samples': 11036160, 'steps': 57479, 'loss/train': 1.5416814088821411} 11/07/2021 05:17:51 - INFO - __main__ - Step 57481: {'lr': 0.00034576890531649905, 'samples': 11036352, 'steps': 57480, 'loss/train': 0.5756336450576782} 11/07/2021 05:17:52 - INFO - __main__ - Step 57482: {'lr': 0.0003457640033650348, 'samples': 11036544, 'steps': 57481, 'loss/train': 1.221463680267334} 11/07/2021 05:17:53 - INFO - __main__ - Step 57483: {'lr': 0.00034575910137042064, 'samples': 11036736, 'steps': 57482, 'loss/train': 0.822562575340271} 11/07/2021 05:17:53 - INFO - __main__ - Step 57484: {'lr': 0.000345754199332659, 'samples': 11036928, 'steps': 57483, 'loss/train': 1.3301583528518677} 11/07/2021 05:17:54 - INFO - __main__ - Step 57485: {'lr': 0.00034574929725175203, 'samples': 11037120, 'steps': 57484, 'loss/train': 0.14026418328285217} 11/07/2021 05:17:54 - INFO - __main__ - Step 57486: {'lr': 0.0003457443951277018, 'samples': 11037312, 'steps': 57485, 'loss/train': 1.0388625860214233} 11/07/2021 05:17:54 - INFO - __main__ - Step 57487: {'lr': 0.00034573949296051065, 'samples': 11037504, 'steps': 57486, 'loss/train': 1.5076392889022827} 11/07/2021 05:17:55 - INFO - __main__ - Step 57488: {'lr': 0.0003457345907501808, 'samples': 11037696, 'steps': 57487, 'loss/train': 1.0548630952835083} 11/07/2021 05:17:56 - INFO - __main__ - Step 57489: {'lr': 0.0003457296884967144, 'samples': 11037888, 'steps': 57488, 'loss/train': 0.9093506336212158} 11/07/2021 05:17:56 - INFO - __main__ - Step 57490: {'lr': 0.0003457247862001137, 'samples': 11038080, 'steps': 57489, 'loss/train': 1.5272330045700073} 11/07/2021 05:17:56 - INFO - __main__ - Step 57491: {'lr': 0.0003457198838603809, 'samples': 11038272, 'steps': 57490, 'loss/train': 1.4084599018096924} 11/07/2021 05:17:57 - INFO - __main__ - Step 57492: {'lr': 0.0003457149814775182, 'samples': 11038464, 'steps': 57491, 'loss/train': 0.740615725517273} 11/07/2021 05:17:59 - INFO - __main__ - Step 57493: {'lr': 0.00034571007905152774, 'samples': 11038656, 'steps': 57492, 'loss/train': 1.2653871774673462} 11/07/2021 05:17:59 - INFO - __main__ - Step 57494: {'lr': 0.00034570517658241186, 'samples': 11038848, 'steps': 57493, 'loss/train': 1.5141576528549194} 11/07/2021 05:17:59 - INFO - __main__ - Step 57495: {'lr': 0.00034570027407017264, 'samples': 11039040, 'steps': 57494, 'loss/train': 1.4731392860412598} 11/07/2021 05:18:00 - INFO - __main__ - Step 57496: {'lr': 0.0003456953715148124, 'samples': 11039232, 'steps': 57495, 'loss/train': 1.3965580463409424} 11/07/2021 05:18:00 - INFO - __main__ - Step 57497: {'lr': 0.0003456904689163333, 'samples': 11039424, 'steps': 57496, 'loss/train': 1.3600486516952515} 11/07/2021 05:18:01 - INFO - __main__ - Step 57498: {'lr': 0.0003456855662747376, 'samples': 11039616, 'steps': 57497, 'loss/train': 1.4234179258346558} 11/07/2021 05:18:01 - INFO - __main__ - Step 57499: {'lr': 0.0003456806635900274, 'samples': 11039808, 'steps': 57498, 'loss/train': 0.7817096710205078} 11/07/2021 05:18:02 - INFO - __main__ - Step 57500: {'lr': 0.00034567576086220493, 'samples': 11040000, 'steps': 57499, 'loss/train': 1.321328043937683} 11/07/2021 05:18:02 - INFO - __main__ - Step 57501: {'lr': 0.0003456708580912725, 'samples': 11040192, 'steps': 57500, 'loss/train': 1.4732120037078857} 11/07/2021 05:18:03 - INFO - __main__ - Step 57502: {'lr': 0.0003456659552772322, 'samples': 11040384, 'steps': 57501, 'loss/train': 1.3813444375991821} 11/07/2021 05:18:03 - INFO - __main__ - Step 57503: {'lr': 0.0003456610524200863, 'samples': 11040576, 'steps': 57502, 'loss/train': 1.4180666208267212} 11/07/2021 05:18:03 - INFO - __main__ - Step 57504: {'lr': 0.00034565614951983706, 'samples': 11040768, 'steps': 57503, 'loss/train': 1.3131844997406006} 11/07/2021 05:18:04 - INFO - __main__ - Step 57505: {'lr': 0.00034565124657648665, 'samples': 11040960, 'steps': 57504, 'loss/train': 1.6327060461044312} 11/07/2021 05:18:05 - INFO - __main__ - Step 57506: {'lr': 0.0003456463435900372, 'samples': 11041152, 'steps': 57505, 'loss/train': 1.7291350364685059} 11/07/2021 05:18:05 - INFO - __main__ - Step 57507: {'lr': 0.0003456414405604911, 'samples': 11041344, 'steps': 57506, 'loss/train': 1.431210994720459} 11/07/2021 05:18:05 - INFO - __main__ - Step 57508: {'lr': 0.0003456365374878503, 'samples': 11041536, 'steps': 57507, 'loss/train': 1.4372285604476929} 11/07/2021 05:18:06 - INFO - __main__ - Step 57509: {'lr': 0.00034563163437211717, 'samples': 11041728, 'steps': 57508, 'loss/train': 1.177987813949585} 11/07/2021 05:18:07 - INFO - __main__ - Step 57510: {'lr': 0.000345626731213294, 'samples': 11041920, 'steps': 57509, 'loss/train': 1.9565181732177734} 11/07/2021 05:18:07 - INFO - __main__ - Step 57511: {'lr': 0.00034562182801138277, 'samples': 11042112, 'steps': 57510, 'loss/train': 1.3832385540008545} 11/07/2021 05:18:08 - INFO - __main__ - Step 57512: {'lr': 0.00034561692476638595, 'samples': 11042304, 'steps': 57511, 'loss/train': 1.446972131729126} 11/07/2021 05:18:08 - INFO - __main__ - Step 57513: {'lr': 0.00034561202147830554, 'samples': 11042496, 'steps': 57512, 'loss/train': 1.1404516696929932} 11/07/2021 05:18:08 - INFO - __main__ - Step 57514: {'lr': 0.00034560711814714387, 'samples': 11042688, 'steps': 57513, 'loss/train': 1.307263970375061} 11/07/2021 05:18:10 - INFO - __main__ - Step 57515: {'lr': 0.0003456022147729031, 'samples': 11042880, 'steps': 57514, 'loss/train': 1.6337566375732422} 11/07/2021 05:18:10 - INFO - __main__ - Step 57516: {'lr': 0.00034559731135558536, 'samples': 11043072, 'steps': 57515, 'loss/train': 1.3496122360229492} 11/07/2021 05:18:10 - INFO - __main__ - Step 57517: {'lr': 0.000345592407895193, 'samples': 11043264, 'steps': 57516, 'loss/train': 1.1995166540145874} 11/07/2021 05:18:11 - INFO - __main__ - Step 57518: {'lr': 0.00034558750439172826, 'samples': 11043456, 'steps': 57517, 'loss/train': 0.35492417216300964} 11/07/2021 05:18:11 - INFO - __main__ - Step 57519: {'lr': 0.0003455826008451932, 'samples': 11043648, 'steps': 57518, 'loss/train': 1.1940852403640747} 11/07/2021 05:18:12 - INFO - __main__ - Step 57520: {'lr': 0.00034557769725559014, 'samples': 11043840, 'steps': 57519, 'loss/train': 1.813337802886963} 11/07/2021 05:18:12 - INFO - __main__ - Step 57521: {'lr': 0.00034557279362292117, 'samples': 11044032, 'steps': 57520, 'loss/train': 1.3919909000396729} 11/07/2021 05:18:13 - INFO - __main__ - Step 57522: {'lr': 0.00034556788994718855, 'samples': 11044224, 'steps': 57521, 'loss/train': 1.758271336555481} 11/07/2021 05:18:13 - INFO - __main__ - Step 57523: {'lr': 0.00034556298622839463, 'samples': 11044416, 'steps': 57522, 'loss/train': 1.1725791692733765} 11/07/2021 05:18:13 - INFO - __main__ - Step 57524: {'lr': 0.0003455580824665414, 'samples': 11044608, 'steps': 57523, 'loss/train': 1.4767088890075684} 11/07/2021 05:18:15 - INFO - __main__ - Step 57525: {'lr': 0.0003455531786616313, 'samples': 11044800, 'steps': 57524, 'loss/train': 1.4721578359603882} 11/07/2021 05:18:15 - INFO - __main__ - Step 57526: {'lr': 0.0003455482748136663, 'samples': 11044992, 'steps': 57525, 'loss/train': 0.6027123332023621} 11/07/2021 05:18:15 - INFO - __main__ - Step 57527: {'lr': 0.00034554337092264874, 'samples': 11045184, 'steps': 57526, 'loss/train': 0.8254349231719971} 11/07/2021 05:18:16 - INFO - __main__ - Step 57528: {'lr': 0.00034553846698858083, 'samples': 11045376, 'steps': 57527, 'loss/train': 1.0931326150894165} 11/07/2021 05:18:16 - INFO - __main__ - Step 57529: {'lr': 0.00034553356301146473, 'samples': 11045568, 'steps': 57528, 'loss/train': 1.4469027519226074} 11/07/2021 05:18:17 - INFO - __main__ - Step 57530: {'lr': 0.0003455286589913027, 'samples': 11045760, 'steps': 57529, 'loss/train': 1.4669450521469116} 11/07/2021 05:18:17 - INFO - __main__ - Step 57531: {'lr': 0.0003455237549280969, 'samples': 11045952, 'steps': 57530, 'loss/train': 1.434045433998108} 11/07/2021 05:18:18 - INFO - __main__ - Step 57532: {'lr': 0.0003455188508218496, 'samples': 11046144, 'steps': 57531, 'loss/train': 0.8300938606262207} 11/07/2021 05:18:18 - INFO - __main__ - Step 57533: {'lr': 0.000345513946672563, 'samples': 11046336, 'steps': 57532, 'loss/train': 1.4491974115371704} 11/07/2021 05:18:18 - INFO - __main__ - Step 57534: {'lr': 0.0003455090424802393, 'samples': 11046528, 'steps': 57533, 'loss/train': 1.086493730545044} 11/07/2021 05:18:20 - INFO - __main__ - Step 57535: {'lr': 0.00034550413824488066, 'samples': 11046720, 'steps': 57534, 'loss/train': 1.7283111810684204} 11/07/2021 05:18:20 - INFO - __main__ - Step 57536: {'lr': 0.0003454992339664893, 'samples': 11046912, 'steps': 57535, 'loss/train': 1.2079789638519287} 11/07/2021 05:18:20 - INFO - __main__ - Step 57537: {'lr': 0.00034549432964506755, 'samples': 11047104, 'steps': 57536, 'loss/train': 1.3897489309310913} 11/07/2021 05:18:21 - INFO - __main__ - Step 57538: {'lr': 0.0003454894252806175, 'samples': 11047296, 'steps': 57537, 'loss/train': 1.3062047958374023} 11/07/2021 05:18:21 - INFO - __main__ - Step 57539: {'lr': 0.00034548452087314135, 'samples': 11047488, 'steps': 57538, 'loss/train': 1.1824064254760742} 11/07/2021 05:18:21 - INFO - __main__ - Step 57540: {'lr': 0.0003454796164226414, 'samples': 11047680, 'steps': 57539, 'loss/train': 2.041522979736328} 11/07/2021 05:18:22 - INFO - __main__ - Step 57541: {'lr': 0.00034547471192911973, 'samples': 11047872, 'steps': 57540, 'loss/train': 1.3433326482772827} 11/07/2021 05:18:23 - INFO - __main__ - Step 57542: {'lr': 0.0003454698073925787, 'samples': 11048064, 'steps': 57541, 'loss/train': 1.5176191329956055} 11/07/2021 05:18:23 - INFO - __main__ - Step 57543: {'lr': 0.00034546490281302033, 'samples': 11048256, 'steps': 57542, 'loss/train': 1.421345829963684} 11/07/2021 05:18:23 - INFO - __main__ - Step 57544: {'lr': 0.000345459998190447, 'samples': 11048448, 'steps': 57543, 'loss/train': 1.42875075340271} 11/07/2021 05:18:24 - INFO - __main__ - Step 57545: {'lr': 0.000345455093524861, 'samples': 11048640, 'steps': 57544, 'loss/train': 1.1425549983978271} 11/07/2021 05:18:25 - INFO - __main__ - Step 57546: {'lr': 0.00034545018881626435, 'samples': 11048832, 'steps': 57545, 'loss/train': 0.5950518250465393} 11/07/2021 05:18:25 - INFO - __main__ - Step 57547: {'lr': 0.00034544528406465927, 'samples': 11049024, 'steps': 57546, 'loss/train': 1.4897198677062988} 11/07/2021 05:18:25 - INFO - __main__ - Step 57548: {'lr': 0.000345440379270048, 'samples': 11049216, 'steps': 57547, 'loss/train': 0.9663727879524231} 11/07/2021 05:18:26 - INFO - __main__ - Step 57549: {'lr': 0.0003454354744324328, 'samples': 11049408, 'steps': 57548, 'loss/train': 1.471144199371338} 11/07/2021 05:18:26 - INFO - __main__ - Step 57550: {'lr': 0.00034543056955181584, 'samples': 11049600, 'steps': 57549, 'loss/train': 1.6009119749069214} 11/07/2021 05:18:27 - INFO - __main__ - Step 57551: {'lr': 0.0003454256646281993, 'samples': 11049792, 'steps': 57550, 'loss/train': 1.3860677480697632} 11/07/2021 05:18:28 - INFO - __main__ - Step 57552: {'lr': 0.0003454207596615855, 'samples': 11049984, 'steps': 57551, 'loss/train': 0.10142990201711655} 11/07/2021 05:18:28 - INFO - __main__ - Step 57553: {'lr': 0.00034541585465197653, 'samples': 11050176, 'steps': 57552, 'loss/train': 0.9701891541481018} 11/07/2021 05:18:28 - INFO - __main__ - Step 57554: {'lr': 0.0003454109495993747, 'samples': 11050368, 'steps': 57553, 'loss/train': 1.6230173110961914} 11/07/2021 05:18:29 - INFO - __main__ - Step 57555: {'lr': 0.0003454060445037821, 'samples': 11050560, 'steps': 57554, 'loss/train': 1.0044591426849365} 11/07/2021 05:18:30 - INFO - __main__ - Step 57556: {'lr': 0.0003454011393652011, 'samples': 11050752, 'steps': 57555, 'loss/train': 1.0634393692016602} 11/07/2021 05:18:30 - INFO - __main__ - Step 57557: {'lr': 0.0003453962341836337, 'samples': 11050944, 'steps': 57556, 'loss/train': 1.2863399982452393} 11/07/2021 05:18:30 - INFO - __main__ - Step 57558: {'lr': 0.0003453913289590823, 'samples': 11051136, 'steps': 57557, 'loss/train': 1.1702903509140015} 11/07/2021 05:18:31 - INFO - __main__ - Step 57559: {'lr': 0.00034538642369154907, 'samples': 11051328, 'steps': 57558, 'loss/train': 0.8804982304573059} 11/07/2021 05:18:31 - INFO - __main__ - Step 57560: {'lr': 0.00034538151838103614, 'samples': 11051520, 'steps': 57559, 'loss/train': 1.1462171077728271} 11/07/2021 05:18:32 - INFO - __main__ - Step 57561: {'lr': 0.00034537661302754577, 'samples': 11051712, 'steps': 57560, 'loss/train': 1.5659589767456055} 11/07/2021 05:18:33 - INFO - __main__ - Step 57562: {'lr': 0.00034537170763108017, 'samples': 11051904, 'steps': 57561, 'loss/train': 1.5989017486572266} 11/07/2021 05:18:33 - INFO - __main__ - Step 57563: {'lr': 0.00034536680219164156, 'samples': 11052096, 'steps': 57562, 'loss/train': 0.07066918164491653} 11/07/2021 05:18:33 - INFO - __main__ - Step 57564: {'lr': 0.0003453618967092322, 'samples': 11052288, 'steps': 57563, 'loss/train': 1.3893033266067505} 11/07/2021 05:18:34 - INFO - __main__ - Step 57565: {'lr': 0.00034535699118385413, 'samples': 11052480, 'steps': 57564, 'loss/train': 1.46445894241333} 11/07/2021 05:18:35 - INFO - __main__ - Step 57566: {'lr': 0.00034535208561550974, 'samples': 11052672, 'steps': 57565, 'loss/train': 1.139594554901123} 11/07/2021 05:18:35 - INFO - __main__ - Step 57567: {'lr': 0.00034534718000420113, 'samples': 11052864, 'steps': 57566, 'loss/train': 1.5781636238098145} 11/07/2021 05:18:35 - INFO - __main__ - Step 57568: {'lr': 0.0003453422743499306, 'samples': 11053056, 'steps': 57567, 'loss/train': 1.1728266477584839} 11/07/2021 05:18:36 - INFO - __main__ - Step 57569: {'lr': 0.00034533736865270025, 'samples': 11053248, 'steps': 57568, 'loss/train': 1.7905948162078857} 11/07/2021 05:18:36 - INFO - __main__ - Step 57570: {'lr': 0.0003453324629125124, 'samples': 11053440, 'steps': 57569, 'loss/train': 1.189206600189209} 11/07/2021 05:18:37 - INFO - __main__ - Step 57571: {'lr': 0.00034532755712936926, 'samples': 11053632, 'steps': 57570, 'loss/train': 1.337283968925476} 11/07/2021 05:18:38 - INFO - __main__ - Step 57572: {'lr': 0.0003453226513032729, 'samples': 11053824, 'steps': 57571, 'loss/train': 1.6351426839828491} 11/07/2021 05:18:38 - INFO - __main__ - Step 57573: {'lr': 0.00034531774543422567, 'samples': 11054016, 'steps': 57572, 'loss/train': 1.2395817041397095} 11/07/2021 05:18:38 - INFO - __main__ - Step 57574: {'lr': 0.00034531283952222975, 'samples': 11054208, 'steps': 57573, 'loss/train': 1.7429171800613403} 11/07/2021 05:18:39 - INFO - __main__ - Step 57575: {'lr': 0.00034530793356728727, 'samples': 11054400, 'steps': 57574, 'loss/train': 0.9734483957290649} 11/07/2021 05:18:40 - INFO - __main__ - Step 57576: {'lr': 0.0003453030275694006, 'samples': 11054592, 'steps': 57575, 'loss/train': 1.240654706954956} 11/07/2021 05:18:40 - INFO - __main__ - Step 57577: {'lr': 0.0003452981215285718, 'samples': 11054784, 'steps': 57576, 'loss/train': 1.734090805053711} 11/07/2021 05:18:40 - INFO - __main__ - Step 57578: {'lr': 0.0003452932154448031, 'samples': 11054976, 'steps': 57577, 'loss/train': 1.6185129880905151} 11/07/2021 05:18:41 - INFO - __main__ - Step 57579: {'lr': 0.0003452883093180968, 'samples': 11055168, 'steps': 57578, 'loss/train': 1.7786346673965454} 11/07/2021 05:18:41 - INFO - __main__ - Step 57580: {'lr': 0.0003452834031484551, 'samples': 11055360, 'steps': 57579, 'loss/train': 1.3384336233139038} 11/07/2021 05:18:42 - INFO - __main__ - Step 57581: {'lr': 0.0003452784969358801, 'samples': 11055552, 'steps': 57580, 'loss/train': 1.6141666173934937} 11/07/2021 05:18:42 - INFO - __main__ - Step 57582: {'lr': 0.0003452735906803741, 'samples': 11055744, 'steps': 57581, 'loss/train': 1.5685594081878662} 11/07/2021 05:18:43 - INFO - __main__ - Step 57583: {'lr': 0.0003452686843819393, 'samples': 11055936, 'steps': 57582, 'loss/train': 1.4723563194274902} 11/07/2021 05:18:43 - INFO - __main__ - Step 57584: {'lr': 0.0003452637780405778, 'samples': 11056128, 'steps': 57583, 'loss/train': 0.8280128836631775} 11/07/2021 05:18:43 - INFO - __main__ - Step 57585: {'lr': 0.000345258871656292, 'samples': 11056320, 'steps': 57584, 'loss/train': 1.4360042810440063} 11/07/2021 05:18:45 - INFO - __main__ - Step 57586: {'lr': 0.0003452539652290841, 'samples': 11056512, 'steps': 57585, 'loss/train': 1.7037127017974854} 11/07/2021 05:18:45 - INFO - __main__ - Step 57587: {'lr': 0.00034524905875895614, 'samples': 11056704, 'steps': 57586, 'loss/train': 0.9621776342391968} 11/07/2021 05:18:46 - INFO - __main__ - Step 57588: {'lr': 0.00034524415224591046, 'samples': 11056896, 'steps': 57587, 'loss/train': 1.1307048797607422} 11/07/2021 05:18:46 - INFO - __main__ - Step 57589: {'lr': 0.00034523924568994913, 'samples': 11057088, 'steps': 57588, 'loss/train': 1.5178391933441162} 11/07/2021 05:18:46 - INFO - __main__ - Step 57590: {'lr': 0.00034523433909107454, 'samples': 11057280, 'steps': 57589, 'loss/train': 0.6705071926116943} 11/07/2021 05:18:47 - INFO - __main__ - Step 57591: {'lr': 0.00034522943244928885, 'samples': 11057472, 'steps': 57590, 'loss/train': 1.5455048084259033} 11/07/2021 05:18:48 - INFO - __main__ - Step 57592: {'lr': 0.0003452245257645943, 'samples': 11057664, 'steps': 57591, 'loss/train': 1.3068667650222778} 11/07/2021 05:18:48 - INFO - __main__ - Step 57593: {'lr': 0.00034521961903699296, 'samples': 11057856, 'steps': 57592, 'loss/train': 1.2720032930374146} 11/07/2021 05:18:48 - INFO - __main__ - Step 57594: {'lr': 0.00034521471226648716, 'samples': 11058048, 'steps': 57593, 'loss/train': 1.7495051622390747} 11/07/2021 05:18:49 - INFO - __main__ - Step 57595: {'lr': 0.000345209805453079, 'samples': 11058240, 'steps': 57594, 'loss/train': 1.0688161849975586} 11/07/2021 05:18:49 - INFO - __main__ - Step 57596: {'lr': 0.00034520489859677083, 'samples': 11058432, 'steps': 57595, 'loss/train': 1.3923890590667725} 11/07/2021 05:18:50 - INFO - __main__ - Step 57597: {'lr': 0.0003451999916975648, 'samples': 11058624, 'steps': 57596, 'loss/train': 1.1420775651931763} 11/07/2021 05:18:50 - INFO - __main__ - Step 57598: {'lr': 0.00034519508475546314, 'samples': 11058816, 'steps': 57597, 'loss/train': 1.9673023223876953} 11/07/2021 05:18:51 - INFO - __main__ - Step 57599: {'lr': 0.0003451901777704681, 'samples': 11059008, 'steps': 57598, 'loss/train': 1.1575087308883667} 11/07/2021 05:18:51 - INFO - __main__ - Step 57600: {'lr': 0.00034518527074258175, 'samples': 11059200, 'steps': 57599, 'loss/train': 1.737684726715088} 11/07/2021 05:18:52 - INFO - __main__ - Step 57601: {'lr': 0.00034518036367180637, 'samples': 11059392, 'steps': 57600, 'loss/train': 0.9970178008079529} 11/07/2021 05:18:53 - INFO - __main__ - Step 57602: {'lr': 0.00034517545655814424, 'samples': 11059584, 'steps': 57601, 'loss/train': 0.5880791544914246} 11/07/2021 05:18:53 - INFO - __main__ - Step 57603: {'lr': 0.0003451705494015975, 'samples': 11059776, 'steps': 57602, 'loss/train': 1.185686469078064} 11/07/2021 05:18:53 - INFO - __main__ - Step 57604: {'lr': 0.0003451656422021684, 'samples': 11059968, 'steps': 57603, 'loss/train': 1.592069387435913} 11/07/2021 05:18:54 - INFO - __main__ - Step 57605: {'lr': 0.0003451607349598591, 'samples': 11060160, 'steps': 57604, 'loss/train': 1.4268330335617065} 11/07/2021 05:18:54 - INFO - __main__ - Step 57606: {'lr': 0.0003451558276746719, 'samples': 11060352, 'steps': 57605, 'loss/train': 1.9687608480453491} 11/07/2021 05:18:55 - INFO - __main__ - Step 57607: {'lr': 0.0003451509203466089, 'samples': 11060544, 'steps': 57606, 'loss/train': 1.3777413368225098} 11/07/2021 05:18:56 - INFO - __main__ - Step 57608: {'lr': 0.00034514601297567235, 'samples': 11060736, 'steps': 57607, 'loss/train': 1.1087727546691895} 11/07/2021 05:18:56 - INFO - __main__ - Step 57609: {'lr': 0.00034514110556186446, 'samples': 11060928, 'steps': 57608, 'loss/train': 1.738145112991333} 11/07/2021 05:18:56 - INFO - __main__ - Step 57610: {'lr': 0.0003451361981051875, 'samples': 11061120, 'steps': 57609, 'loss/train': 1.6119134426116943} 11/07/2021 05:18:57 - INFO - __main__ - Step 57611: {'lr': 0.00034513129060564365, 'samples': 11061312, 'steps': 57610, 'loss/train': 0.7874797582626343} 11/07/2021 05:18:58 - INFO - __main__ - Step 57612: {'lr': 0.00034512638306323506, 'samples': 11061504, 'steps': 57611, 'loss/train': 0.8196049928665161} 11/07/2021 05:18:58 - INFO - __main__ - Step 57613: {'lr': 0.000345121475477964, 'samples': 11061696, 'steps': 57612, 'loss/train': 1.529039978981018} 11/07/2021 05:18:58 - INFO - __main__ - Step 57614: {'lr': 0.0003451165678498327, 'samples': 11061888, 'steps': 57613, 'loss/train': 1.3116601705551147} 11/07/2021 05:18:59 - INFO - __main__ - Step 57615: {'lr': 0.00034511166017884334, 'samples': 11062080, 'steps': 57614, 'loss/train': 1.3582158088684082} 11/07/2021 05:18:59 - INFO - __main__ - Step 57616: {'lr': 0.0003451067524649981, 'samples': 11062272, 'steps': 57615, 'loss/train': 0.7368661761283875} 11/07/2021 05:19:00 - INFO - __main__ - Step 57617: {'lr': 0.00034510184470829924, 'samples': 11062464, 'steps': 57616, 'loss/train': 1.6011773347854614} 11/07/2021 05:19:00 - INFO - __main__ - Step 57618: {'lr': 0.000345096936908749, 'samples': 11062656, 'steps': 57617, 'loss/train': 0.5298988819122314} 11/07/2021 05:19:01 - INFO - __main__ - Step 57619: {'lr': 0.0003450920290663495, 'samples': 11062848, 'steps': 57618, 'loss/train': 1.0885212421417236} 11/07/2021 05:19:01 - INFO - __main__ - Step 57620: {'lr': 0.000345087121181103, 'samples': 11063040, 'steps': 57619, 'loss/train': 1.5097908973693848} 11/07/2021 05:19:01 - INFO - __main__ - Step 57621: {'lr': 0.0003450822132530117, 'samples': 11063232, 'steps': 57620, 'loss/train': 1.131121277809143} 11/07/2021 05:19:02 - INFO - __main__ - Step 57622: {'lr': 0.0003450773052820779, 'samples': 11063424, 'steps': 57621, 'loss/train': 0.9881262183189392} 11/07/2021 05:19:03 - INFO - __main__ - Step 57623: {'lr': 0.0003450723972683036, 'samples': 11063616, 'steps': 57622, 'loss/train': 1.750808835029602} 11/07/2021 05:19:03 - INFO - __main__ - Step 57624: {'lr': 0.00034506748921169124, 'samples': 11063808, 'steps': 57623, 'loss/train': 1.2427629232406616} 11/07/2021 05:19:03 - INFO - __main__ - Step 57625: {'lr': 0.00034506258111224294, 'samples': 11064000, 'steps': 57624, 'loss/train': 1.133847713470459} 11/07/2021 05:19:04 - INFO - __main__ - Step 57626: {'lr': 0.00034505767296996086, 'samples': 11064192, 'steps': 57625, 'loss/train': 1.138538122177124} 11/07/2021 05:19:05 - INFO - __main__ - Step 57627: {'lr': 0.0003450527647848473, 'samples': 11064384, 'steps': 57626, 'loss/train': 1.8103456497192383} 11/07/2021 05:19:05 - INFO - __main__ - Step 57628: {'lr': 0.0003450478565569044, 'samples': 11064576, 'steps': 57627, 'loss/train': 1.8080732822418213} 11/07/2021 05:19:06 - INFO - __main__ - Step 57629: {'lr': 0.0003450429482861344, 'samples': 11064768, 'steps': 57628, 'loss/train': 1.4648878574371338} 11/07/2021 05:19:06 - INFO - __main__ - Step 57630: {'lr': 0.0003450380399725396, 'samples': 11064960, 'steps': 57629, 'loss/train': 0.7530885338783264} 11/07/2021 05:19:06 - INFO - __main__ - Step 57631: {'lr': 0.000345033131616122, 'samples': 11065152, 'steps': 57630, 'loss/train': 1.634304404258728} 11/07/2021 05:19:07 - INFO - __main__ - Step 57632: {'lr': 0.000345028223216884, 'samples': 11065344, 'steps': 57631, 'loss/train': 1.7826693058013916} 11/07/2021 05:19:08 - INFO - __main__ - Step 57633: {'lr': 0.0003450233147748278, 'samples': 11065536, 'steps': 57632, 'loss/train': 1.7429877519607544} 11/07/2021 05:19:08 - INFO - __main__ - Step 57634: {'lr': 0.00034501840628995545, 'samples': 11065728, 'steps': 57633, 'loss/train': 1.4022407531738281} 11/07/2021 05:19:08 - INFO - __main__ - Step 57635: {'lr': 0.0003450134977622693, 'samples': 11065920, 'steps': 57634, 'loss/train': 1.4085102081298828} 11/07/2021 05:19:09 - INFO - __main__ - Step 57636: {'lr': 0.0003450085891917716, 'samples': 11066112, 'steps': 57635, 'loss/train': 1.1837139129638672} 11/07/2021 05:19:09 - INFO - __main__ - Step 57637: {'lr': 0.00034500368057846444, 'samples': 11066304, 'steps': 57636, 'loss/train': 1.6431742906570435} 11/07/2021 05:19:10 - INFO - __main__ - Step 57638: {'lr': 0.00034499877192235005, 'samples': 11066496, 'steps': 57637, 'loss/train': 1.51119065284729} 11/07/2021 05:19:11 - INFO - __main__ - Step 57639: {'lr': 0.00034499386322343087, 'samples': 11066688, 'steps': 57638, 'loss/train': 1.151750087738037} 11/07/2021 05:19:11 - INFO - __main__ - Step 57640: {'lr': 0.00034498895448170874, 'samples': 11066880, 'steps': 57639, 'loss/train': 1.271364450454712} 11/07/2021 05:19:11 - INFO - __main__ - Step 57641: {'lr': 0.0003449840456971861, 'samples': 11067072, 'steps': 57640, 'loss/train': 1.3291971683502197} 11/07/2021 05:19:12 - INFO - __main__ - Step 57642: {'lr': 0.0003449791368698651, 'samples': 11067264, 'steps': 57641, 'loss/train': 1.3007110357284546} 11/07/2021 05:19:13 - INFO - __main__ - Step 57643: {'lr': 0.000344974227999748, 'samples': 11067456, 'steps': 57642, 'loss/train': 1.3085644245147705} 11/07/2021 05:19:13 - INFO - __main__ - Step 57644: {'lr': 0.0003449693190868369, 'samples': 11067648, 'steps': 57643, 'loss/train': 1.3239974975585938} 11/07/2021 05:19:13 - INFO - __main__ - Step 57645: {'lr': 0.0003449644101311341, 'samples': 11067840, 'steps': 57644, 'loss/train': 1.547965168952942} 11/07/2021 05:19:14 - INFO - __main__ - Step 57646: {'lr': 0.00034495950113264194, 'samples': 11068032, 'steps': 57645, 'loss/train': 1.2187222242355347} 11/07/2021 05:19:14 - INFO - __main__ - Step 57647: {'lr': 0.0003449545920913624, 'samples': 11068224, 'steps': 57646, 'loss/train': 1.7817707061767578} 11/07/2021 05:19:15 - INFO - __main__ - Step 57648: {'lr': 0.0003449496830072978, 'samples': 11068416, 'steps': 57647, 'loss/train': 1.0762932300567627} 11/07/2021 05:19:15 - INFO - __main__ - Step 57649: {'lr': 0.0003449447738804503, 'samples': 11068608, 'steps': 57648, 'loss/train': 0.5625360608100891} 11/07/2021 05:19:16 - INFO - __main__ - Step 57650: {'lr': 0.00034493986471082215, 'samples': 11068800, 'steps': 57649, 'loss/train': 1.4757633209228516} 11/07/2021 05:19:16 - INFO - __main__ - Step 57651: {'lr': 0.0003449349554984156, 'samples': 11068992, 'steps': 57650, 'loss/train': 1.2945477962493896} 11/07/2021 05:19:16 - INFO - __main__ - Step 57652: {'lr': 0.0003449300462432328, 'samples': 11069184, 'steps': 57651, 'loss/train': 1.314429759979248} 11/07/2021 05:19:17 - INFO - __main__ - Step 57653: {'lr': 0.0003449251369452761, 'samples': 11069376, 'steps': 57652, 'loss/train': 1.6082700490951538} 11/07/2021 05:19:18 - INFO - __main__ - Step 57654: {'lr': 0.00034492022760454743, 'samples': 11069568, 'steps': 57653, 'loss/train': 1.409935474395752} 11/07/2021 05:19:18 - INFO - __main__ - Step 57655: {'lr': 0.00034491531822104923, 'samples': 11069760, 'steps': 57654, 'loss/train': 1.408873438835144} 11/07/2021 05:19:18 - INFO - __main__ - Step 57656: {'lr': 0.00034491040879478364, 'samples': 11069952, 'steps': 57655, 'loss/train': 1.2753372192382812} 11/07/2021 05:19:19 - INFO - __main__ - Step 57657: {'lr': 0.0003449054993257529, 'samples': 11070144, 'steps': 57656, 'loss/train': 1.4293252229690552} 11/07/2021 05:19:20 - INFO - __main__ - Step 57658: {'lr': 0.0003449005898139592, 'samples': 11070336, 'steps': 57657, 'loss/train': 1.550768494606018} 11/07/2021 05:19:20 - INFO - __main__ - Step 57659: {'lr': 0.0003448956802594048, 'samples': 11070528, 'steps': 57658, 'loss/train': 0.6354227662086487} 11/07/2021 05:19:20 - INFO - __main__ - Step 57660: {'lr': 0.00034489077066209185, 'samples': 11070720, 'steps': 57659, 'loss/train': 1.4722617864608765} 11/07/2021 05:19:21 - INFO - __main__ - Step 57661: {'lr': 0.0003448858610220226, 'samples': 11070912, 'steps': 57660, 'loss/train': 1.1395282745361328} 11/07/2021 05:19:21 - INFO - __main__ - Step 57662: {'lr': 0.00034488095133919914, 'samples': 11071104, 'steps': 57661, 'loss/train': 1.1864697933197021} 11/07/2021 05:19:22 - INFO - __main__ - Step 57663: {'lr': 0.0003448760416136239, 'samples': 11071296, 'steps': 57662, 'loss/train': 1.638628602027893} 11/07/2021 05:19:22 - INFO - __main__ - Step 57664: {'lr': 0.00034487113184529896, 'samples': 11071488, 'steps': 57663, 'loss/train': 1.009995937347412} 11/07/2021 05:19:23 - INFO - __main__ - Step 57665: {'lr': 0.0003448662220342265, 'samples': 11071680, 'steps': 57664, 'loss/train': 1.5353615283966064} 11/07/2021 05:19:23 - INFO - __main__ - Step 57666: {'lr': 0.0003448613121804088, 'samples': 11071872, 'steps': 57665, 'loss/train': 1.1033000946044922} 11/07/2021 05:19:23 - INFO - __main__ - Step 57667: {'lr': 0.0003448564022838481, 'samples': 11072064, 'steps': 57666, 'loss/train': 1.711601734161377} 11/07/2021 05:19:25 - INFO - __main__ - Step 57668: {'lr': 0.0003448514923445466, 'samples': 11072256, 'steps': 57667, 'loss/train': 0.8116577863693237} 11/07/2021 05:19:25 - INFO - __main__ - Step 57669: {'lr': 0.00034484658236250636, 'samples': 11072448, 'steps': 57668, 'loss/train': 1.245047926902771} 11/07/2021 05:19:25 - INFO - __main__ - Step 57670: {'lr': 0.0003448416723377298, 'samples': 11072640, 'steps': 57669, 'loss/train': 1.4965254068374634} 11/07/2021 05:19:26 - INFO - __main__ - Step 57671: {'lr': 0.00034483676227021906, 'samples': 11072832, 'steps': 57670, 'loss/train': 1.5550516843795776} 11/07/2021 05:19:26 - INFO - __main__ - Step 57672: {'lr': 0.00034483185215997624, 'samples': 11073024, 'steps': 57671, 'loss/train': 1.3319071531295776} 11/07/2021 05:19:27 - INFO - __main__ - Step 57673: {'lr': 0.00034482694200700377, 'samples': 11073216, 'steps': 57672, 'loss/train': 1.562532901763916} 11/07/2021 05:19:27 - INFO - __main__ - Step 57674: {'lr': 0.00034482203181130365, 'samples': 11073408, 'steps': 57673, 'loss/train': 1.730484962463379} 11/07/2021 05:19:28 - INFO - __main__ - Step 57675: {'lr': 0.00034481712157287826, 'samples': 11073600, 'steps': 57674, 'loss/train': 1.526288628578186} 11/07/2021 05:19:28 - INFO - __main__ - Step 57676: {'lr': 0.00034481221129172967, 'samples': 11073792, 'steps': 57675, 'loss/train': 1.6786359548568726} 11/07/2021 05:19:28 - INFO - __main__ - Step 57677: {'lr': 0.0003448073009678602, 'samples': 11073984, 'steps': 57676, 'loss/train': 1.3379011154174805} 11/07/2021 05:19:29 - INFO - __main__ - Step 57678: {'lr': 0.00034480239060127204, 'samples': 11074176, 'steps': 57677, 'loss/train': 1.4021776914596558} 11/07/2021 05:19:30 - INFO - __main__ - Step 57679: {'lr': 0.00034479748019196734, 'samples': 11074368, 'steps': 57678, 'loss/train': 1.0051445960998535} 11/07/2021 05:19:30 - INFO - __main__ - Step 57680: {'lr': 0.00034479256973994843, 'samples': 11074560, 'steps': 57679, 'loss/train': 0.7099761366844177} 11/07/2021 05:19:30 - INFO - __main__ - Step 57681: {'lr': 0.0003447876592452174, 'samples': 11074752, 'steps': 57680, 'loss/train': 1.466721773147583} 11/07/2021 05:19:31 - INFO - __main__ - Step 57682: {'lr': 0.00034478274870777646, 'samples': 11074944, 'steps': 57681, 'loss/train': 1.3075916767120361} 11/07/2021 05:19:32 - INFO - __main__ - Step 57683: {'lr': 0.00034477783812762795, 'samples': 11075136, 'steps': 57682, 'loss/train': 1.4150981903076172} 11/07/2021 05:19:32 - INFO - __main__ - Step 57684: {'lr': 0.00034477292750477396, 'samples': 11075328, 'steps': 57683, 'loss/train': 1.0988235473632812} 11/07/2021 05:19:33 - INFO - __main__ - Step 57685: {'lr': 0.00034476801683921683, 'samples': 11075520, 'steps': 57684, 'loss/train': 1.3911079168319702} 11/07/2021 05:19:33 - INFO - __main__ - Step 57686: {'lr': 0.00034476310613095867, 'samples': 11075712, 'steps': 57685, 'loss/train': 0.05851629748940468} 11/07/2021 05:19:33 - INFO - __main__ - Step 57687: {'lr': 0.0003447581953800017, 'samples': 11075904, 'steps': 57686, 'loss/train': 1.8319309949874878} 11/07/2021 05:19:34 - INFO - __main__ - Step 57688: {'lr': 0.00034475328458634814, 'samples': 11076096, 'steps': 57687, 'loss/train': 1.9513481855392456} 11/07/2021 05:19:35 - INFO - __main__ - Step 57689: {'lr': 0.00034474837375000016, 'samples': 11076288, 'steps': 57688, 'loss/train': 1.4049649238586426} 11/07/2021 05:19:35 - INFO - __main__ - Step 57690: {'lr': 0.0003447434628709601, 'samples': 11076480, 'steps': 57689, 'loss/train': 2.0631494522094727} 11/07/2021 05:19:35 - INFO - __main__ - Step 57691: {'lr': 0.00034473855194923006, 'samples': 11076672, 'steps': 57690, 'loss/train': 1.1817691326141357} 11/07/2021 05:19:36 - INFO - __main__ - Step 57692: {'lr': 0.0003447336409848124, 'samples': 11076864, 'steps': 57691, 'loss/train': 1.365451693534851} 11/07/2021 05:19:37 - INFO - __main__ - Step 57693: {'lr': 0.0003447287299777091, 'samples': 11077056, 'steps': 57692, 'loss/train': 1.5198798179626465} 11/07/2021 05:19:37 - INFO - __main__ - Step 57694: {'lr': 0.0003447238189279225, 'samples': 11077248, 'steps': 57693, 'loss/train': 1.849091649055481} 11/07/2021 05:19:37 - INFO - __main__ - Step 57695: {'lr': 0.0003447189078354548, 'samples': 11077440, 'steps': 57694, 'loss/train': 1.4110535383224487} 11/07/2021 05:19:38 - INFO - __main__ - Step 57696: {'lr': 0.00034471399670030824, 'samples': 11077632, 'steps': 57695, 'loss/train': 1.1762892007827759} 11/07/2021 05:19:38 - INFO - __main__ - Step 57697: {'lr': 0.00034470908552248504, 'samples': 11077824, 'steps': 57696, 'loss/train': 1.0973211526870728} 11/07/2021 05:19:38 - INFO - __main__ - Step 57698: {'lr': 0.00034470417430198743, 'samples': 11078016, 'steps': 57697, 'loss/train': 1.0146803855895996} 11/07/2021 05:19:40 - INFO - __main__ - Step 57699: {'lr': 0.00034469926303881747, 'samples': 11078208, 'steps': 57698, 'loss/train': 1.2770438194274902} 11/07/2021 05:19:40 - INFO - __main__ - Step 57700: {'lr': 0.0003446943517329776, 'samples': 11078400, 'steps': 57699, 'loss/train': 1.2319482564926147} 11/07/2021 05:19:40 - INFO - __main__ - Step 57701: {'lr': 0.0003446894403844698, 'samples': 11078592, 'steps': 57700, 'loss/train': 1.2217903137207031} 11/07/2021 05:19:41 - INFO - __main__ - Step 57702: {'lr': 0.0003446845289932965, 'samples': 11078784, 'steps': 57701, 'loss/train': 1.4680088758468628} 11/07/2021 05:19:41 - INFO - __main__ - Step 57703: {'lr': 0.0003446796175594598, 'samples': 11078976, 'steps': 57702, 'loss/train': 0.3309529423713684} 11/07/2021 05:19:42 - INFO - __main__ - Step 57704: {'lr': 0.00034467470608296185, 'samples': 11079168, 'steps': 57703, 'loss/train': 1.5323333740234375} 11/07/2021 05:19:42 - INFO - __main__ - Step 57705: {'lr': 0.00034466979456380497, 'samples': 11079360, 'steps': 57704, 'loss/train': 1.2461626529693604} 11/07/2021 05:19:43 - INFO - __main__ - Step 57706: {'lr': 0.0003446648830019914, 'samples': 11079552, 'steps': 57705, 'loss/train': 1.397922158241272} 11/07/2021 05:19:43 - INFO - __main__ - Step 57707: {'lr': 0.00034465997139752327, 'samples': 11079744, 'steps': 57706, 'loss/train': 1.5268123149871826} 11/07/2021 05:19:43 - INFO - __main__ - Step 57708: {'lr': 0.00034465505975040273, 'samples': 11079936, 'steps': 57707, 'loss/train': 0.03303880989551544} 11/07/2021 05:19:44 - INFO - __main__ - Step 57709: {'lr': 0.0003446501480606322, 'samples': 11080128, 'steps': 57708, 'loss/train': 1.4570459127426147} 11/07/2021 05:19:45 - INFO - __main__ - Step 57710: {'lr': 0.0003446452363282137, 'samples': 11080320, 'steps': 57709, 'loss/train': 1.3838202953338623} 11/07/2021 05:19:45 - INFO - __main__ - Step 57711: {'lr': 0.00034464032455314955, 'samples': 11080512, 'steps': 57710, 'loss/train': 1.2123501300811768} 11/07/2021 05:19:45 - INFO - __main__ - Step 57712: {'lr': 0.0003446354127354419, 'samples': 11080704, 'steps': 57711, 'loss/train': 1.649638056755066} 11/07/2021 05:19:46 - INFO - __main__ - Step 57713: {'lr': 0.000344630500875093, 'samples': 11080896, 'steps': 57712, 'loss/train': 1.2207626104354858} 11/07/2021 05:19:47 - INFO - __main__ - Step 57714: {'lr': 0.0003446255889721051, 'samples': 11081088, 'steps': 57713, 'loss/train': 1.644190788269043} 11/07/2021 05:19:47 - INFO - __main__ - Step 57715: {'lr': 0.00034462067702648036, 'samples': 11081280, 'steps': 57714, 'loss/train': 1.2539130449295044} 11/07/2021 05:19:47 - INFO - __main__ - Step 57716: {'lr': 0.000344615765038221, 'samples': 11081472, 'steps': 57715, 'loss/train': 1.253875494003296} 11/07/2021 05:19:48 - INFO - __main__ - Step 57717: {'lr': 0.0003446108530073292, 'samples': 11081664, 'steps': 57716, 'loss/train': 1.1340068578720093} 11/07/2021 05:19:48 - INFO - __main__ - Step 57718: {'lr': 0.0003446059409338072, 'samples': 11081856, 'steps': 57717, 'loss/train': 0.8219293355941772} 11/07/2021 05:19:49 - INFO - __main__ - Step 57719: {'lr': 0.00034460102881765723, 'samples': 11082048, 'steps': 57718, 'loss/train': 1.338723063468933} 11/07/2021 05:19:50 - INFO - __main__ - Step 57720: {'lr': 0.0003445961166588816, 'samples': 11082240, 'steps': 57719, 'loss/train': 1.2686856985092163} 11/07/2021 05:19:50 - INFO - __main__ - Step 57721: {'lr': 0.0003445912044574823, 'samples': 11082432, 'steps': 57720, 'loss/train': 1.2591438293457031} 11/07/2021 05:19:50 - INFO - __main__ - Step 57722: {'lr': 0.00034458629221346173, 'samples': 11082624, 'steps': 57721, 'loss/train': 1.7226529121398926} 11/07/2021 05:19:51 - INFO - __main__ - Step 57723: {'lr': 0.000344581379926822, 'samples': 11082816, 'steps': 57722, 'loss/train': 1.2006354331970215} 11/07/2021 05:19:52 - INFO - __main__ - Step 57724: {'lr': 0.00034457646759756535, 'samples': 11083008, 'steps': 57723, 'loss/train': 1.7096425294876099} 11/07/2021 05:19:52 - INFO - __main__ - Step 57725: {'lr': 0.00034457155522569393, 'samples': 11083200, 'steps': 57724, 'loss/train': 1.5500774383544922} 11/07/2021 05:19:52 - INFO - __main__ - Step 57726: {'lr': 0.00034456664281121017, 'samples': 11083392, 'steps': 57725, 'loss/train': 1.4748237133026123} 11/07/2021 05:19:53 - INFO - __main__ - Step 57727: {'lr': 0.00034456173035411606, 'samples': 11083584, 'steps': 57726, 'loss/train': 1.2925790548324585} 11/07/2021 05:19:53 - INFO - __main__ - Step 57728: {'lr': 0.00034455681785441395, 'samples': 11083776, 'steps': 57727, 'loss/train': 1.6779370307922363} 11/07/2021 05:19:54 - INFO - __main__ - Step 57729: {'lr': 0.00034455190531210595, 'samples': 11083968, 'steps': 57728, 'loss/train': 1.0728123188018799} 11/07/2021 05:19:55 - INFO - __main__ - Step 57730: {'lr': 0.0003445469927271944, 'samples': 11084160, 'steps': 57729, 'loss/train': 1.1827887296676636} 11/07/2021 05:19:55 - INFO - __main__ - Step 57731: {'lr': 0.0003445420800996813, 'samples': 11084352, 'steps': 57730, 'loss/train': 1.102344036102295} 11/07/2021 05:19:55 - INFO - __main__ - Step 57732: {'lr': 0.0003445371674295691, 'samples': 11084544, 'steps': 57731, 'loss/train': 1.6247098445892334} 11/07/2021 05:19:56 - INFO - __main__ - Step 57733: {'lr': 0.0003445322547168599, 'samples': 11084736, 'steps': 57732, 'loss/train': 1.6746764183044434} 11/07/2021 05:19:57 - INFO - __main__ - Step 57734: {'lr': 0.0003445273419615559, 'samples': 11084928, 'steps': 57733, 'loss/train': 1.0753048658370972} 11/07/2021 05:19:57 - INFO - __main__ - Step 57735: {'lr': 0.00034452242916365935, 'samples': 11085120, 'steps': 57734, 'loss/train': 1.6022965908050537} 11/07/2021 05:19:58 - INFO - __main__ - Step 57736: {'lr': 0.0003445175163231724, 'samples': 11085312, 'steps': 57735, 'loss/train': 1.5820335149765015} 11/07/2021 05:19:58 - INFO - __main__ - Step 57737: {'lr': 0.00034451260344009737, 'samples': 11085504, 'steps': 57736, 'loss/train': 1.6642982959747314} 11/07/2021 05:19:58 - INFO - __main__ - Step 57738: {'lr': 0.00034450769051443635, 'samples': 11085696, 'steps': 57737, 'loss/train': 1.0080724954605103} 11/07/2021 05:19:59 - INFO - __main__ - Step 57739: {'lr': 0.0003445027775461917, 'samples': 11085888, 'steps': 57738, 'loss/train': 1.4903745651245117} 11/07/2021 05:20:00 - INFO - __main__ - Step 57740: {'lr': 0.0003444978645353656, 'samples': 11086080, 'steps': 57739, 'loss/train': 0.07775591313838959} 11/07/2021 05:20:00 - INFO - __main__ - Step 57741: {'lr': 0.0003444929514819601, 'samples': 11086272, 'steps': 57740, 'loss/train': 1.0781891345977783} 11/07/2021 05:20:01 - INFO - __main__ - Step 57742: {'lr': 0.00034448803838597766, 'samples': 11086464, 'steps': 57741, 'loss/train': 2.499934673309326} 11/07/2021 05:20:01 - INFO - __main__ - Step 57743: {'lr': 0.00034448312524742027, 'samples': 11086656, 'steps': 57742, 'loss/train': 1.4848999977111816} 11/07/2021 05:20:01 - INFO - __main__ - Step 57744: {'lr': 0.00034447821206629026, 'samples': 11086848, 'steps': 57743, 'loss/train': 1.5241488218307495} 11/07/2021 05:20:02 - INFO - __main__ - Step 57745: {'lr': 0.0003444732988425898, 'samples': 11087040, 'steps': 57744, 'loss/train': 1.348928451538086} 11/07/2021 05:20:03 - INFO - __main__ - Step 57746: {'lr': 0.0003444683855763212, 'samples': 11087232, 'steps': 57745, 'loss/train': 1.2769927978515625} 11/07/2021 05:20:03 - INFO - __main__ - Step 57747: {'lr': 0.0003444634722674866, 'samples': 11087424, 'steps': 57746, 'loss/train': 1.7812557220458984} 11/07/2021 05:20:03 - INFO - __main__ - Step 57748: {'lr': 0.0003444585589160882, 'samples': 11087616, 'steps': 57747, 'loss/train': 1.227088451385498} 11/07/2021 05:20:04 - INFO - __main__ - Step 57749: {'lr': 0.0003444536455221282, 'samples': 11087808, 'steps': 57748, 'loss/train': 1.3041777610778809} 11/07/2021 05:20:04 - INFO - __main__ - Step 57750: {'lr': 0.00034444873208560884, 'samples': 11088000, 'steps': 57749, 'loss/train': 1.9908219575881958} 11/07/2021 05:20:05 - INFO - __main__ - Step 57751: {'lr': 0.00034444381860653233, 'samples': 11088192, 'steps': 57750, 'loss/train': 1.521040678024292} 11/07/2021 05:20:05 - INFO - __main__ - Step 57752: {'lr': 0.00034443890508490093, 'samples': 11088384, 'steps': 57751, 'loss/train': 1.2209283113479614} 11/07/2021 05:20:06 - INFO - __main__ - Step 57753: {'lr': 0.0003444339915207168, 'samples': 11088576, 'steps': 57752, 'loss/train': 1.6212396621704102} 11/07/2021 05:20:06 - INFO - __main__ - Step 57754: {'lr': 0.0003444290779139823, 'samples': 11088768, 'steps': 57753, 'loss/train': 1.6818703413009644} 11/07/2021 05:20:07 - INFO - __main__ - Step 57755: {'lr': 0.00034442416426469936, 'samples': 11088960, 'steps': 57754, 'loss/train': 1.79239022731781} 11/07/2021 05:20:08 - INFO - __main__ - Step 57756: {'lr': 0.0003444192505728704, 'samples': 11089152, 'steps': 57755, 'loss/train': 1.2616841793060303} 11/07/2021 05:20:08 - INFO - __main__ - Step 57757: {'lr': 0.0003444143368384975, 'samples': 11089344, 'steps': 57756, 'loss/train': 1.4383338689804077} 11/07/2021 05:20:08 - INFO - __main__ - Step 57758: {'lr': 0.000344409423061583, 'samples': 11089536, 'steps': 57757, 'loss/train': 1.6081862449645996} 11/07/2021 05:20:09 - INFO - __main__ - Step 57759: {'lr': 0.00034440450924212913, 'samples': 11089728, 'steps': 57758, 'loss/train': 1.5299433469772339} 11/07/2021 05:20:09 - INFO - __main__ - Step 57760: {'lr': 0.00034439959538013805, 'samples': 11089920, 'steps': 57759, 'loss/train': 1.4348293542861938} 11/07/2021 05:20:10 - INFO - __main__ - Step 57761: {'lr': 0.0003443946814756119, 'samples': 11090112, 'steps': 57760, 'loss/train': 1.2355751991271973} 11/07/2021 05:20:10 - INFO - __main__ - Step 57762: {'lr': 0.000344389767528553, 'samples': 11090304, 'steps': 57761, 'loss/train': 0.541526198387146} 11/07/2021 05:20:11 - INFO - __main__ - Step 57763: {'lr': 0.0003443848535389635, 'samples': 11090496, 'steps': 57762, 'loss/train': 1.4673749208450317} 11/07/2021 05:20:11 - INFO - __main__ - Step 57764: {'lr': 0.00034437993950684566, 'samples': 11090688, 'steps': 57763, 'loss/train': 1.3057758808135986} 11/07/2021 05:20:11 - INFO - __main__ - Step 57765: {'lr': 0.00034437502543220166, 'samples': 11090880, 'steps': 57764, 'loss/train': 1.5854928493499756} 11/07/2021 05:20:12 - INFO - __main__ - Step 57766: {'lr': 0.0003443701113150337, 'samples': 11091072, 'steps': 57765, 'loss/train': 0.4549166262149811} 11/07/2021 05:20:13 - INFO - __main__ - Step 57767: {'lr': 0.00034436519715534415, 'samples': 11091264, 'steps': 57766, 'loss/train': 1.3708440065383911} 11/07/2021 05:20:13 - INFO - __main__ - Step 57768: {'lr': 0.00034436028295313503, 'samples': 11091456, 'steps': 57767, 'loss/train': 1.161154866218567} 11/07/2021 05:20:13 - INFO - __main__ - Step 57769: {'lr': 0.00034435536870840855, 'samples': 11091648, 'steps': 57768, 'loss/train': 1.197624683380127} 11/07/2021 05:20:14 - INFO - __main__ - Step 57770: {'lr': 0.0003443504544211671, 'samples': 11091840, 'steps': 57769, 'loss/train': 1.1574995517730713} 11/07/2021 05:20:15 - INFO - __main__ - Step 57771: {'lr': 0.0003443455400914127, 'samples': 11092032, 'steps': 57770, 'loss/train': 1.2135539054870605} 11/07/2021 05:20:15 - INFO - __main__ - Step 57772: {'lr': 0.0003443406257191477, 'samples': 11092224, 'steps': 57771, 'loss/train': 1.3909350633621216} 11/07/2021 05:20:16 - INFO - __main__ - Step 57773: {'lr': 0.0003443357113043743, 'samples': 11092416, 'steps': 57772, 'loss/train': 1.4927105903625488} 11/07/2021 05:20:16 - INFO - __main__ - Step 57774: {'lr': 0.00034433079684709466, 'samples': 11092608, 'steps': 57773, 'loss/train': 1.1110316514968872} 11/07/2021 05:20:16 - INFO - __main__ - Step 57775: {'lr': 0.000344325882347311, 'samples': 11092800, 'steps': 57774, 'loss/train': 1.4093879461288452} 11/07/2021 05:20:17 - INFO - __main__ - Step 57776: {'lr': 0.00034432096780502564, 'samples': 11092992, 'steps': 57775, 'loss/train': 1.352190613746643} 11/07/2021 05:20:18 - INFO - __main__ - Step 57777: {'lr': 0.0003443160532202406, 'samples': 11093184, 'steps': 57776, 'loss/train': 0.969149649143219} 11/07/2021 05:20:18 - INFO - __main__ - Step 57778: {'lr': 0.00034431113859295827, 'samples': 11093376, 'steps': 57777, 'loss/train': 1.5390814542770386} 11/07/2021 05:20:18 - INFO - __main__ - Step 57779: {'lr': 0.00034430622392318073, 'samples': 11093568, 'steps': 57778, 'loss/train': 1.5301694869995117} 11/07/2021 05:20:19 - INFO - __main__ - Step 57780: {'lr': 0.0003443013092109103, 'samples': 11093760, 'steps': 57779, 'loss/train': 1.3802093267440796} 11/07/2021 05:20:20 - INFO - __main__ - Step 57781: {'lr': 0.0003442963944561492, 'samples': 11093952, 'steps': 57780, 'loss/train': 1.5616902112960815} 11/07/2021 05:20:20 - INFO - __main__ - Step 57782: {'lr': 0.0003442914796588995, 'samples': 11094144, 'steps': 57781, 'loss/train': 2.0523765087127686} 11/07/2021 05:20:20 - INFO - __main__ - Step 57783: {'lr': 0.00034428656481916357, 'samples': 11094336, 'steps': 57782, 'loss/train': 0.9525910019874573} 11/07/2021 05:20:21 - INFO - __main__ - Step 57784: {'lr': 0.00034428164993694356, 'samples': 11094528, 'steps': 57783, 'loss/train': 1.1512936353683472} 11/07/2021 05:20:21 - INFO - __main__ - Step 57785: {'lr': 0.0003442767350122417, 'samples': 11094720, 'steps': 57784, 'loss/train': 1.297159194946289} 11/07/2021 05:20:22 - INFO - __main__ - Step 57786: {'lr': 0.0003442718200450602, 'samples': 11094912, 'steps': 57785, 'loss/train': 1.5386817455291748} 11/07/2021 05:20:22 - INFO - __main__ - Step 57787: {'lr': 0.0003442669050354013, 'samples': 11095104, 'steps': 57786, 'loss/train': 1.2106409072875977} 11/07/2021 05:20:23 - INFO - __main__ - Step 57788: {'lr': 0.00034426198998326713, 'samples': 11095296, 'steps': 57787, 'loss/train': 1.3910138607025146} 11/07/2021 05:20:23 - INFO - __main__ - Step 57789: {'lr': 0.00034425707488866, 'samples': 11095488, 'steps': 57788, 'loss/train': 1.5607638359069824} 11/07/2021 05:20:23 - INFO - __main__ - Step 57790: {'lr': 0.0003442521597515821, 'samples': 11095680, 'steps': 57789, 'loss/train': 0.755548894405365} 11/07/2021 05:20:24 - INFO - __main__ - Step 57791: {'lr': 0.00034424724457203553, 'samples': 11095872, 'steps': 57790, 'loss/train': 1.1952698230743408} 11/07/2021 05:20:25 - INFO - __main__ - Step 57792: {'lr': 0.0003442423293500227, 'samples': 11096064, 'steps': 57791, 'loss/train': 1.4297170639038086} 11/07/2021 05:20:25 - INFO - __main__ - Step 57793: {'lr': 0.0003442374140855457, 'samples': 11096256, 'steps': 57792, 'loss/train': 1.6370017528533936} 11/07/2021 05:20:25 - INFO - __main__ - Step 57794: {'lr': 0.00034423249877860683, 'samples': 11096448, 'steps': 57793, 'loss/train': 0.8426752090454102} 11/07/2021 05:20:26 - INFO - __main__ - Step 57795: {'lr': 0.0003442275834292082, 'samples': 11096640, 'steps': 57794, 'loss/train': 1.2816393375396729} 11/07/2021 05:20:27 - INFO - __main__ - Step 57796: {'lr': 0.0003442226680373521, 'samples': 11096832, 'steps': 57795, 'loss/train': 1.638270616531372} 11/07/2021 05:20:27 - INFO - __main__ - Step 57797: {'lr': 0.00034421775260304067, 'samples': 11097024, 'steps': 57796, 'loss/train': 1.542790412902832} 11/07/2021 05:20:28 - INFO - __main__ - Step 57798: {'lr': 0.0003442128371262762, 'samples': 11097216, 'steps': 57797, 'loss/train': 1.4426237344741821} 11/07/2021 05:20:28 - INFO - __main__ - Step 57799: {'lr': 0.00034420792160706087, 'samples': 11097408, 'steps': 57798, 'loss/train': 0.5713027715682983} 11/07/2021 05:20:28 - INFO - __main__ - Step 57800: {'lr': 0.0003442030060453969, 'samples': 11097600, 'steps': 57799, 'loss/train': 1.7395669221878052} 11/07/2021 05:20:29 - INFO - __main__ - Step 57801: {'lr': 0.0003441980904412866, 'samples': 11097792, 'steps': 57800, 'loss/train': 1.0744584798812866} 11/07/2021 05:20:30 - INFO - __main__ - Step 57802: {'lr': 0.000344193174794732, 'samples': 11097984, 'steps': 57801, 'loss/train': 2.3064522743225098} 11/07/2021 05:20:30 - INFO - __main__ - Step 57803: {'lr': 0.00034418825910573545, 'samples': 11098176, 'steps': 57802, 'loss/train': 1.4305062294006348} 11/07/2021 05:20:31 - INFO - __main__ - Step 57804: {'lr': 0.00034418334337429907, 'samples': 11098368, 'steps': 57803, 'loss/train': 0.11569163203239441} 11/07/2021 05:20:31 - INFO - __main__ - Step 57805: {'lr': 0.00034417842760042517, 'samples': 11098560, 'steps': 57804, 'loss/train': 1.8115553855895996} 11/07/2021 05:20:32 - INFO - __main__ - Step 57806: {'lr': 0.0003441735117841159, 'samples': 11098752, 'steps': 57805, 'loss/train': 1.549162745475769} 11/07/2021 05:20:32 - INFO - __main__ - Step 57807: {'lr': 0.0003441685959253736, 'samples': 11098944, 'steps': 57806, 'loss/train': 1.1044977903366089} 11/07/2021 05:20:33 - INFO - __main__ - Step 57808: {'lr': 0.0003441636800242003, 'samples': 11099136, 'steps': 57807, 'loss/train': 1.107480525970459} 11/07/2021 05:20:33 - INFO - __main__ - Step 57809: {'lr': 0.0003441587640805983, 'samples': 11099328, 'steps': 57808, 'loss/train': 1.224845290184021} 11/07/2021 05:20:34 - INFO - __main__ - Step 57810: {'lr': 0.0003441538480945697, 'samples': 11099520, 'steps': 57809, 'loss/train': 1.0226614475250244} 11/07/2021 05:20:34 - INFO - __main__ - Step 57811: {'lr': 0.00034414893206611695, 'samples': 11099712, 'steps': 57810, 'loss/train': 1.5952225923538208} 11/07/2021 05:20:35 - INFO - __main__ - Step 57812: {'lr': 0.0003441440159952422, 'samples': 11099904, 'steps': 57811, 'loss/train': 1.1762220859527588} 11/07/2021 05:20:36 - INFO - __main__ - Step 57813: {'lr': 0.00034413909988194753, 'samples': 11100096, 'steps': 57812, 'loss/train': 1.836982011795044} 11/07/2021 05:20:36 - INFO - __main__ - Step 57814: {'lr': 0.0003441341837262353, 'samples': 11100288, 'steps': 57813, 'loss/train': 0.7086502909660339} 11/07/2021 05:20:36 - INFO - __main__ - Step 57815: {'lr': 0.00034412926752810756, 'samples': 11100480, 'steps': 57814, 'loss/train': 0.8831301331520081} 11/07/2021 05:20:37 - INFO - __main__ - Step 57816: {'lr': 0.0003441243512875667, 'samples': 11100672, 'steps': 57815, 'loss/train': 1.4944149255752563} 11/07/2021 05:20:38 - INFO - __main__ - Step 57817: {'lr': 0.00034411943500461484, 'samples': 11100864, 'steps': 57816, 'loss/train': 1.751255750656128} 11/07/2021 05:20:38 - INFO - __main__ - Step 57818: {'lr': 0.0003441145186792542, 'samples': 11101056, 'steps': 57817, 'loss/train': 1.6801643371582031} 11/07/2021 05:20:38 - INFO - __main__ - Step 57819: {'lr': 0.000344109602311487, 'samples': 11101248, 'steps': 57818, 'loss/train': 1.4103548526763916} 11/07/2021 05:20:39 - INFO - __main__ - Step 57820: {'lr': 0.0003441046859013155, 'samples': 11101440, 'steps': 57819, 'loss/train': 1.3571488857269287} 11/07/2021 05:20:39 - INFO - __main__ - Step 57821: {'lr': 0.00034409976944874186, 'samples': 11101632, 'steps': 57820, 'loss/train': 1.3241790533065796} 11/07/2021 05:20:40 - INFO - __main__ - Step 57822: {'lr': 0.0003440948529537683, 'samples': 11101824, 'steps': 57821, 'loss/train': 1.1724662780761719} 11/07/2021 05:20:40 - INFO - __main__ - Step 57823: {'lr': 0.00034408993641639707, 'samples': 11102016, 'steps': 57822, 'loss/train': 1.1723467111587524} 11/07/2021 05:20:41 - INFO - __main__ - Step 57824: {'lr': 0.0003440850198366304, 'samples': 11102208, 'steps': 57823, 'loss/train': 1.08247971534729} 11/07/2021 05:20:41 - INFO - __main__ - Step 57825: {'lr': 0.0003440801032144704, 'samples': 11102400, 'steps': 57824, 'loss/train': 1.039486050605774} 11/07/2021 05:20:42 - INFO - __main__ - Step 57826: {'lr': 0.00034407518654991945, 'samples': 11102592, 'steps': 57825, 'loss/train': 1.1007189750671387} 11/07/2021 05:20:43 - INFO - __main__ - Step 57827: {'lr': 0.00034407026984297964, 'samples': 11102784, 'steps': 57826, 'loss/train': 1.5922685861587524} 11/07/2021 05:20:43 - INFO - __main__ - Step 57828: {'lr': 0.00034406535309365317, 'samples': 11102976, 'steps': 57827, 'loss/train': 1.230177402496338} 11/07/2021 05:20:43 - INFO - __main__ - Step 57829: {'lr': 0.0003440604363019423, 'samples': 11103168, 'steps': 57828, 'loss/train': 1.265795111656189} 11/07/2021 05:20:44 - INFO - __main__ - Step 57830: {'lr': 0.0003440555194678493, 'samples': 11103360, 'steps': 57829, 'loss/train': 1.4641492366790771} 11/07/2021 05:20:44 - INFO - __main__ - Step 57831: {'lr': 0.0003440506025913763, 'samples': 11103552, 'steps': 57830, 'loss/train': 1.6361924409866333} 11/07/2021 05:20:45 - INFO - __main__ - Step 57832: {'lr': 0.0003440456856725256, 'samples': 11103744, 'steps': 57831, 'loss/train': 1.487054467201233} 11/07/2021 05:20:45 - INFO - __main__ - Step 57833: {'lr': 0.0003440407687112993, 'samples': 11103936, 'steps': 57832, 'loss/train': 1.1568161249160767} 11/07/2021 05:20:46 - INFO - __main__ - Step 57834: {'lr': 0.0003440358517076997, 'samples': 11104128, 'steps': 57833, 'loss/train': 1.13609778881073} 11/07/2021 05:20:46 - INFO - __main__ - Step 57835: {'lr': 0.00034403093466172903, 'samples': 11104320, 'steps': 57834, 'loss/train': 1.3248300552368164} 11/07/2021 05:20:46 - INFO - __main__ - Step 57836: {'lr': 0.00034402601757338946, 'samples': 11104512, 'steps': 57835, 'loss/train': 1.3437557220458984} 11/07/2021 05:20:47 - INFO - __main__ - Step 57837: {'lr': 0.00034402110044268327, 'samples': 11104704, 'steps': 57836, 'loss/train': 1.1121588945388794} 11/07/2021 05:20:48 - INFO - __main__ - Step 57838: {'lr': 0.00034401618326961253, 'samples': 11104896, 'steps': 57837, 'loss/train': 1.7878296375274658} 11/07/2021 05:20:48 - INFO - __main__ - Step 57839: {'lr': 0.0003440112660541795, 'samples': 11105088, 'steps': 57838, 'loss/train': 1.859351396560669} 11/07/2021 05:20:48 - INFO - __main__ - Step 57840: {'lr': 0.0003440063487963866, 'samples': 11105280, 'steps': 57839, 'loss/train': 1.1802302598953247} 11/07/2021 05:20:49 - INFO - __main__ - Step 57841: {'lr': 0.00034400143149623574, 'samples': 11105472, 'steps': 57840, 'loss/train': 1.8758044242858887} 11/07/2021 05:20:50 - INFO - __main__ - Step 57842: {'lr': 0.0003439965141537294, 'samples': 11105664, 'steps': 57841, 'loss/train': 1.8026816844940186} 11/07/2021 05:20:50 - INFO - __main__ - Step 57843: {'lr': 0.00034399159676886965, 'samples': 11105856, 'steps': 57842, 'loss/train': 0.5418456792831421} 11/07/2021 05:20:51 - INFO - __main__ - Step 57844: {'lr': 0.00034398667934165873, 'samples': 11106048, 'steps': 57843, 'loss/train': 0.05089479684829712} 11/07/2021 05:20:51 - INFO - __main__ - Step 57845: {'lr': 0.00034398176187209887, 'samples': 11106240, 'steps': 57844, 'loss/train': 1.1053050756454468} 11/07/2021 05:20:51 - INFO - __main__ - Step 57846: {'lr': 0.0003439768443601923, 'samples': 11106432, 'steps': 57845, 'loss/train': 1.5870858430862427} 11/07/2021 05:20:52 - INFO - __main__ - Step 57847: {'lr': 0.0003439719268059411, 'samples': 11106624, 'steps': 57846, 'loss/train': 1.7755992412567139} 11/07/2021 05:20:53 - INFO - __main__ - Step 57848: {'lr': 0.0003439670092093478, 'samples': 11106816, 'steps': 57847, 'loss/train': 0.8469088673591614} 11/07/2021 05:20:53 - INFO - __main__ - Step 57849: {'lr': 0.00034396209157041424, 'samples': 11107008, 'steps': 57848, 'loss/train': 1.2248122692108154} 11/07/2021 05:20:53 - INFO - __main__ - Step 57850: {'lr': 0.0003439571738891428, 'samples': 11107200, 'steps': 57849, 'loss/train': 0.9531872868537903} 11/07/2021 05:20:54 - INFO - __main__ - Step 57851: {'lr': 0.00034395225616553585, 'samples': 11107392, 'steps': 57850, 'loss/train': 1.5660516023635864} 11/07/2021 05:20:54 - INFO - __main__ - Step 57852: {'lr': 0.00034394733839959534, 'samples': 11107584, 'steps': 57851, 'loss/train': 1.6531728506088257} 11/07/2021 05:20:55 - INFO - __main__ - Step 57853: {'lr': 0.0003439424205913236, 'samples': 11107776, 'steps': 57852, 'loss/train': 1.7772634029388428} 11/07/2021 05:20:56 - INFO - __main__ - Step 57854: {'lr': 0.000343937502740723, 'samples': 11107968, 'steps': 57853, 'loss/train': 1.7304883003234863} 11/07/2021 05:20:56 - INFO - __main__ - Step 57855: {'lr': 0.00034393258484779555, 'samples': 11108160, 'steps': 57854, 'loss/train': 1.537498116493225} 11/07/2021 05:20:56 - INFO - __main__ - Step 57856: {'lr': 0.0003439276669125435, 'samples': 11108352, 'steps': 57855, 'loss/train': 1.427783727645874} 11/07/2021 05:20:57 - INFO - __main__ - Step 57857: {'lr': 0.00034392274893496903, 'samples': 11108544, 'steps': 57856, 'loss/train': 1.1278772354125977} 11/07/2021 05:20:58 - INFO - __main__ - Step 57858: {'lr': 0.0003439178309150745, 'samples': 11108736, 'steps': 57857, 'loss/train': 1.3305517435073853} 11/07/2021 05:20:58 - INFO - __main__ - Step 57859: {'lr': 0.000343912912852862, 'samples': 11108928, 'steps': 57858, 'loss/train': 1.2788364887237549} 11/07/2021 05:20:58 - INFO - __main__ - Step 57860: {'lr': 0.00034390799474833385, 'samples': 11109120, 'steps': 57859, 'loss/train': 0.5940954089164734} 11/07/2021 05:20:59 - INFO - __main__ - Step 57861: {'lr': 0.0003439030766014922, 'samples': 11109312, 'steps': 57860, 'loss/train': 1.5349797010421753} 11/07/2021 05:20:59 - INFO - __main__ - Step 57862: {'lr': 0.0003438981584123392, 'samples': 11109504, 'steps': 57861, 'loss/train': 1.8759766817092896} 11/07/2021 05:21:00 - INFO - __main__ - Step 57863: {'lr': 0.0003438932401808772, 'samples': 11109696, 'steps': 57862, 'loss/train': 0.8557454943656921} 11/07/2021 05:21:01 - INFO - __main__ - Step 57864: {'lr': 0.0003438883219071083, 'samples': 11109888, 'steps': 57863, 'loss/train': 1.0207797288894653} 11/07/2021 05:21:01 - INFO - __main__ - Step 57865: {'lr': 0.00034388340359103485, 'samples': 11110080, 'steps': 57864, 'loss/train': 1.1741703748703003} 11/07/2021 05:21:01 - INFO - __main__ - Step 57866: {'lr': 0.0003438784852326589, 'samples': 11110272, 'steps': 57865, 'loss/train': 0.4235229194164276} 11/07/2021 05:21:02 - INFO - __main__ - Step 57867: {'lr': 0.0003438735668319828, 'samples': 11110464, 'steps': 57866, 'loss/train': 1.6470001935958862} 11/07/2021 05:21:03 - INFO - __main__ - Step 57868: {'lr': 0.00034386864838900877, 'samples': 11110656, 'steps': 57867, 'loss/train': 1.5702577829360962} 11/07/2021 05:21:03 - INFO - __main__ - Step 57869: {'lr': 0.00034386372990373893, 'samples': 11110848, 'steps': 57868, 'loss/train': 1.312443733215332} 11/07/2021 05:21:03 - INFO - __main__ - Step 57870: {'lr': 0.0003438588113761755, 'samples': 11111040, 'steps': 57869, 'loss/train': 0.07350760698318481} 11/07/2021 05:21:04 - INFO - __main__ - Step 57871: {'lr': 0.00034385389280632077, 'samples': 11111232, 'steps': 57870, 'loss/train': 1.0477763414382935} 11/07/2021 05:21:04 - INFO - __main__ - Step 57872: {'lr': 0.00034384897419417694, 'samples': 11111424, 'steps': 57871, 'loss/train': 0.7560630440711975} 11/07/2021 05:21:05 - INFO - __main__ - Step 57873: {'lr': 0.0003438440555397462, 'samples': 11111616, 'steps': 57872, 'loss/train': 1.059000015258789} 11/07/2021 05:21:06 - INFO - __main__ - Step 57874: {'lr': 0.00034383913684303075, 'samples': 11111808, 'steps': 57873, 'loss/train': 1.5997111797332764} 11/07/2021 05:21:06 - INFO - __main__ - Step 57875: {'lr': 0.00034383421810403294, 'samples': 11112000, 'steps': 57874, 'loss/train': 1.3373414278030396} 11/07/2021 05:21:07 - INFO - __main__ - Step 57876: {'lr': 0.00034382929932275476, 'samples': 11112192, 'steps': 57875, 'loss/train': 1.5026295185089111} 11/07/2021 05:21:07 - INFO - __main__ - Step 57877: {'lr': 0.0003438243804991986, 'samples': 11112384, 'steps': 57876, 'loss/train': 1.6113172769546509} 11/07/2021 05:21:08 - INFO - __main__ - Step 57878: {'lr': 0.0003438194616333666, 'samples': 11112576, 'steps': 57877, 'loss/train': 0.31013643741607666} 11/07/2021 05:21:08 - INFO - __main__ - Step 57879: {'lr': 0.00034381454272526096, 'samples': 11112768, 'steps': 57878, 'loss/train': 1.2493027448654175} 11/07/2021 05:21:09 - INFO - __main__ - Step 57880: {'lr': 0.000343809623774884, 'samples': 11112960, 'steps': 57879, 'loss/train': 1.2290037870407104} 11/07/2021 05:21:09 - INFO - __main__ - Step 57881: {'lr': 0.0003438047047822379, 'samples': 11113152, 'steps': 57880, 'loss/train': 1.3986181020736694} 11/07/2021 05:21:09 - INFO - __main__ - Step 57882: {'lr': 0.0003437997857473248, 'samples': 11113344, 'steps': 57881, 'loss/train': 1.0933270454406738} 11/07/2021 05:21:10 - INFO - __main__ - Step 57883: {'lr': 0.0003437948666701469, 'samples': 11113536, 'steps': 57882, 'loss/train': 0.9028867483139038} 11/07/2021 05:21:11 - INFO - __main__ - Step 57884: {'lr': 0.00034378994755070657, 'samples': 11113728, 'steps': 57883, 'loss/train': 1.2105731964111328} 11/07/2021 05:21:11 - INFO - __main__ - Step 57885: {'lr': 0.00034378502838900587, 'samples': 11113920, 'steps': 57884, 'loss/train': 1.6737838983535767} 11/07/2021 05:21:11 - INFO - __main__ - Step 57886: {'lr': 0.00034378010918504714, 'samples': 11114112, 'steps': 57885, 'loss/train': 1.116660714149475} 11/07/2021 05:21:12 - INFO - __main__ - Step 57887: {'lr': 0.0003437751899388325, 'samples': 11114304, 'steps': 57886, 'loss/train': 1.4971961975097656} 11/07/2021 05:21:13 - INFO - __main__ - Step 57888: {'lr': 0.00034377027065036423, 'samples': 11114496, 'steps': 57887, 'loss/train': 1.456732153892517} 11/07/2021 05:21:13 - INFO - __main__ - Step 57889: {'lr': 0.0003437653513196446, 'samples': 11114688, 'steps': 57888, 'loss/train': 1.1172996759414673} 11/07/2021 05:21:14 - INFO - __main__ - Step 57890: {'lr': 0.0003437604319466756, 'samples': 11114880, 'steps': 57889, 'loss/train': 1.5059787034988403} 11/07/2021 05:21:14 - INFO - __main__ - Step 57891: {'lr': 0.0003437555125314597, 'samples': 11115072, 'steps': 57890, 'loss/train': 0.9523252248764038} 11/07/2021 05:21:14 - INFO - __main__ - Step 57892: {'lr': 0.00034375059307399896, 'samples': 11115264, 'steps': 57891, 'loss/train': 1.4915522336959839} 11/07/2021 05:21:15 - INFO - __main__ - Step 57893: {'lr': 0.00034374567357429563, 'samples': 11115456, 'steps': 57892, 'loss/train': 1.5016074180603027} 11/07/2021 05:21:16 - INFO - __main__ - Step 57894: {'lr': 0.000343740754032352, 'samples': 11115648, 'steps': 57893, 'loss/train': 1.9699833393096924} 11/07/2021 05:21:16 - INFO - __main__ - Step 57895: {'lr': 0.00034373583444817024, 'samples': 11115840, 'steps': 57894, 'loss/train': 1.5629128217697144} 11/07/2021 05:21:17 - INFO - __main__ - Step 57896: {'lr': 0.0003437309148217526, 'samples': 11116032, 'steps': 57895, 'loss/train': 1.4169114828109741} 11/07/2021 05:21:17 - INFO - __main__ - Step 57897: {'lr': 0.00034372599515310117, 'samples': 11116224, 'steps': 57896, 'loss/train': 1.3311491012573242} 11/07/2021 05:21:17 - INFO - __main__ - Step 57898: {'lr': 0.00034372107544221824, 'samples': 11116416, 'steps': 57897, 'loss/train': 1.322851538658142} 11/07/2021 05:21:18 - INFO - __main__ - Step 57899: {'lr': 0.00034371615568910607, 'samples': 11116608, 'steps': 57898, 'loss/train': 1.0532596111297607} 11/07/2021 05:21:19 - INFO - __main__ - Step 57900: {'lr': 0.00034371123589376683, 'samples': 11116800, 'steps': 57899, 'loss/train': 1.5819395780563354} 11/07/2021 05:21:19 - INFO - __main__ - Step 57901: {'lr': 0.00034370631605620285, 'samples': 11116992, 'steps': 57900, 'loss/train': 1.5906492471694946} 11/07/2021 05:21:19 - INFO - __main__ - Step 57902: {'lr': 0.0003437013961764162, 'samples': 11117184, 'steps': 57901, 'loss/train': 1.4747021198272705} 11/07/2021 05:21:20 - INFO - __main__ - Step 57903: {'lr': 0.00034369647625440906, 'samples': 11117376, 'steps': 57902, 'loss/train': 1.3367058038711548} 11/07/2021 05:21:21 - INFO - __main__ - Step 57904: {'lr': 0.00034369155629018376, 'samples': 11117568, 'steps': 57903, 'loss/train': 1.4811495542526245} 11/07/2021 05:21:21 - INFO - __main__ - Step 57905: {'lr': 0.00034368663628374255, 'samples': 11117760, 'steps': 57904, 'loss/train': 1.115447759628296} 11/07/2021 05:21:22 - INFO - __main__ - Step 57906: {'lr': 0.0003436817162350876, 'samples': 11117952, 'steps': 57905, 'loss/train': 1.1253905296325684} 11/07/2021 05:21:22 - INFO - __main__ - Step 57907: {'lr': 0.00034367679614422103, 'samples': 11118144, 'steps': 57906, 'loss/train': 1.396866798400879} 11/07/2021 05:21:22 - INFO - __main__ - Step 57908: {'lr': 0.0003436718760111452, 'samples': 11118336, 'steps': 57907, 'loss/train': 1.3769924640655518} 11/07/2021 05:21:23 - INFO - __main__ - Step 57909: {'lr': 0.0003436669558358623, 'samples': 11118528, 'steps': 57908, 'loss/train': 1.8341175317764282} 11/07/2021 05:21:24 - INFO - __main__ - Step 57910: {'lr': 0.00034366203561837446, 'samples': 11118720, 'steps': 57909, 'loss/train': 1.081642508506775} 11/07/2021 05:21:24 - INFO - __main__ - Step 57911: {'lr': 0.00034365711535868396, 'samples': 11118912, 'steps': 57910, 'loss/train': 1.3062201738357544} 11/07/2021 05:21:24 - INFO - __main__ - Step 57912: {'lr': 0.000343652195056793, 'samples': 11119104, 'steps': 57911, 'loss/train': 1.5675636529922485} 11/07/2021 05:21:25 - INFO - __main__ - Step 57913: {'lr': 0.0003436472747127038, 'samples': 11119296, 'steps': 57912, 'loss/train': 1.618319034576416} 11/07/2021 05:21:26 - INFO - __main__ - Step 57914: {'lr': 0.0003436423543264186, 'samples': 11119488, 'steps': 57913, 'loss/train': 1.3111189603805542} 11/07/2021 05:21:26 - INFO - __main__ - Step 57915: {'lr': 0.00034363743389793965, 'samples': 11119680, 'steps': 57914, 'loss/train': 1.3982800245285034} 11/07/2021 05:21:26 - INFO - __main__ - Step 57916: {'lr': 0.0003436325134272691, 'samples': 11119872, 'steps': 57915, 'loss/train': 1.8558919429779053} 11/07/2021 05:21:27 - INFO - __main__ - Step 57917: {'lr': 0.0003436275929144091, 'samples': 11120064, 'steps': 57916, 'loss/train': 0.7404921054840088} 11/07/2021 05:21:27 - INFO - __main__ - Step 57918: {'lr': 0.000343622672359362, 'samples': 11120256, 'steps': 57917, 'loss/train': 1.2649706602096558} 11/07/2021 05:21:28 - INFO - __main__ - Step 57919: {'lr': 0.0003436177517621299, 'samples': 11120448, 'steps': 57918, 'loss/train': 1.645738959312439} 11/07/2021 05:21:29 - INFO - __main__ - Step 57920: {'lr': 0.0003436128311227152, 'samples': 11120640, 'steps': 57919, 'loss/train': 1.0767428874969482} 11/07/2021 05:21:29 - INFO - __main__ - Step 57921: {'lr': 0.00034360791044111996, 'samples': 11120832, 'steps': 57920, 'loss/train': 1.6540600061416626} 11/07/2021 05:21:29 - INFO - __main__ - Step 57922: {'lr': 0.00034360298971734647, 'samples': 11121024, 'steps': 57921, 'loss/train': 0.6858277916908264} 11/07/2021 05:21:30 - INFO - __main__ - Step 57923: {'lr': 0.00034359806895139686, 'samples': 11121216, 'steps': 57922, 'loss/train': 1.1156251430511475} 11/07/2021 05:21:31 - INFO - __main__ - Step 57924: {'lr': 0.0003435931481432735, 'samples': 11121408, 'steps': 57923, 'loss/train': 1.3785035610198975} 11/07/2021 05:21:31 - INFO - __main__ - Step 57925: {'lr': 0.00034358822729297847, 'samples': 11121600, 'steps': 57924, 'loss/train': 2.4129350185394287} 11/07/2021 05:21:31 - INFO - __main__ - Step 57926: {'lr': 0.00034358330640051396, 'samples': 11121792, 'steps': 57925, 'loss/train': 1.0540449619293213} 11/07/2021 05:21:32 - INFO - __main__ - Step 57927: {'lr': 0.0003435783854658823, 'samples': 11121984, 'steps': 57926, 'loss/train': 0.7716109156608582} 11/07/2021 05:21:32 - INFO - __main__ - Step 57928: {'lr': 0.00034357346448908566, 'samples': 11122176, 'steps': 57927, 'loss/train': 1.7647292613983154} 11/07/2021 05:21:32 - INFO - __main__ - Step 57929: {'lr': 0.00034356854347012626, 'samples': 11122368, 'steps': 57928, 'loss/train': 5.723478317260742} 11/07/2021 05:21:34 - INFO - __main__ - Step 57930: {'lr': 0.00034356362240900635, 'samples': 11122560, 'steps': 57929, 'loss/train': 1.1204917430877686} 11/07/2021 05:21:34 - INFO - __main__ - Step 57931: {'lr': 0.0003435587013057281, 'samples': 11122752, 'steps': 57930, 'loss/train': 1.4171696901321411} 11/07/2021 05:21:34 - INFO - __main__ - Step 57932: {'lr': 0.0003435537801602937, 'samples': 11122944, 'steps': 57931, 'loss/train': 0.584794819355011} 11/07/2021 05:21:35 - INFO - __main__ - Step 57933: {'lr': 0.00034354885897270546, 'samples': 11123136, 'steps': 57932, 'loss/train': 0.9820351600646973} 11/07/2021 05:21:35 - INFO - __main__ - Step 57934: {'lr': 0.0003435439377429655, 'samples': 11123328, 'steps': 57933, 'loss/train': 1.352911353111267} 11/07/2021 05:21:36 - INFO - __main__ - Step 57935: {'lr': 0.00034353901647107615, 'samples': 11123520, 'steps': 57934, 'loss/train': 1.4055920839309692} 11/07/2021 05:21:36 - INFO - __main__ - Step 57936: {'lr': 0.0003435340951570395, 'samples': 11123712, 'steps': 57935, 'loss/train': 1.2950204610824585} 11/07/2021 05:21:37 - INFO - __main__ - Step 57937: {'lr': 0.00034352917380085784, 'samples': 11123904, 'steps': 57936, 'loss/train': 1.9416735172271729} 11/07/2021 05:21:37 - INFO - __main__ - Step 57938: {'lr': 0.00034352425240253344, 'samples': 11124096, 'steps': 57937, 'loss/train': 1.0612602233886719} 11/07/2021 05:21:37 - INFO - __main__ - Step 57939: {'lr': 0.0003435193309620684, 'samples': 11124288, 'steps': 57938, 'loss/train': 1.3730577230453491} 11/07/2021 05:21:38 - INFO - __main__ - Step 57940: {'lr': 0.000343514409479465, 'samples': 11124480, 'steps': 57939, 'loss/train': 1.103216290473938} 11/07/2021 05:21:39 - INFO - __main__ - Step 57941: {'lr': 0.00034350948795472543, 'samples': 11124672, 'steps': 57940, 'loss/train': 1.063675880432129} 11/07/2021 05:21:39 - INFO - __main__ - Step 57942: {'lr': 0.000343504566387852, 'samples': 11124864, 'steps': 57941, 'loss/train': 1.0157917737960815} 11/07/2021 05:21:40 - INFO - __main__ - Step 57943: {'lr': 0.0003434996447788468, 'samples': 11125056, 'steps': 57942, 'loss/train': 1.411668062210083} 11/07/2021 05:21:40 - INFO - __main__ - Step 57944: {'lr': 0.0003434947231277121, 'samples': 11125248, 'steps': 57943, 'loss/train': 1.6276054382324219} 11/07/2021 05:21:40 - INFO - __main__ - Step 57945: {'lr': 0.0003434898014344501, 'samples': 11125440, 'steps': 57944, 'loss/train': 1.142353892326355} 11/07/2021 05:21:42 - INFO - __main__ - Step 57946: {'lr': 0.00034348487969906307, 'samples': 11125632, 'steps': 57945, 'loss/train': 2.2820980548858643} 11/07/2021 05:21:42 - INFO - __main__ - Step 57947: {'lr': 0.00034347995792155316, 'samples': 11125824, 'steps': 57946, 'loss/train': 1.163718819618225} 11/07/2021 05:21:42 - INFO - __main__ - Step 57948: {'lr': 0.00034347503610192265, 'samples': 11126016, 'steps': 57947, 'loss/train': 1.7779464721679688} 11/07/2021 05:21:43 - INFO - __main__ - Step 57949: {'lr': 0.0003434701142401738, 'samples': 11126208, 'steps': 57948, 'loss/train': 1.4960453510284424} 11/07/2021 05:21:43 - INFO - __main__ - Step 57950: {'lr': 0.0003434651923363087, 'samples': 11126400, 'steps': 57949, 'loss/train': 0.7504159808158875} 11/07/2021 05:21:44 - INFO - __main__ - Step 57951: {'lr': 0.0003434602703903296, 'samples': 11126592, 'steps': 57950, 'loss/train': 1.6045472621917725} 11/07/2021 05:21:44 - INFO - __main__ - Step 57952: {'lr': 0.0003434553484022388, 'samples': 11126784, 'steps': 57951, 'loss/train': 0.257572740316391} 11/07/2021 05:21:45 - INFO - __main__ - Step 57953: {'lr': 0.0003434504263720384, 'samples': 11126976, 'steps': 57952, 'loss/train': 1.4461199045181274} 11/07/2021 05:21:45 - INFO - __main__ - Step 57954: {'lr': 0.0003434455042997307, 'samples': 11127168, 'steps': 57953, 'loss/train': 1.3689465522766113} 11/07/2021 05:21:45 - INFO - __main__ - Step 57955: {'lr': 0.00034344058218531794, 'samples': 11127360, 'steps': 57954, 'loss/train': 1.4795900583267212} 11/07/2021 05:21:47 - INFO - __main__ - Step 57956: {'lr': 0.0003434356600288023, 'samples': 11127552, 'steps': 57955, 'loss/train': 1.458519458770752} 11/07/2021 05:21:47 - INFO - __main__ - Step 57957: {'lr': 0.00034343073783018593, 'samples': 11127744, 'steps': 57956, 'loss/train': 1.0316636562347412} 11/07/2021 05:21:47 - INFO - __main__ - Step 57958: {'lr': 0.00034342581558947113, 'samples': 11127936, 'steps': 57957, 'loss/train': 1.6033358573913574} 11/07/2021 05:21:48 - INFO - __main__ - Step 57959: {'lr': 0.00034342089330666, 'samples': 11128128, 'steps': 57958, 'loss/train': 1.5310497283935547} 11/07/2021 05:21:48 - INFO - __main__ - Step 57960: {'lr': 0.00034341597098175503, 'samples': 11128320, 'steps': 57959, 'loss/train': 1.3717107772827148} 11/07/2021 05:21:49 - INFO - __main__ - Step 57961: {'lr': 0.0003434110486147582, 'samples': 11128512, 'steps': 57960, 'loss/train': 1.3544975519180298} 11/07/2021 05:21:50 - INFO - __main__ - Step 57962: {'lr': 0.0003434061262056718, 'samples': 11128704, 'steps': 57961, 'loss/train': 1.8260358572006226} 11/07/2021 05:21:50 - INFO - __main__ - Step 57963: {'lr': 0.0003434012037544981, 'samples': 11128896, 'steps': 57962, 'loss/train': 1.7426804304122925} 11/07/2021 05:21:50 - INFO - __main__ - Step 57964: {'lr': 0.0003433962812612391, 'samples': 11129088, 'steps': 57963, 'loss/train': 1.6939276456832886} 11/07/2021 05:21:51 - INFO - __main__ - Step 57965: {'lr': 0.0003433913587258973, 'samples': 11129280, 'steps': 57964, 'loss/train': 0.47931137681007385} 11/07/2021 05:21:51 - INFO - __main__ - Step 57966: {'lr': 0.0003433864361484748, 'samples': 11129472, 'steps': 57965, 'loss/train': 0.7440065145492554} 11/07/2021 05:21:52 - INFO - __main__ - Step 57967: {'lr': 0.00034338151352897376, 'samples': 11129664, 'steps': 57966, 'loss/train': 1.2602134943008423} 11/07/2021 05:21:52 - INFO - __main__ - Step 57968: {'lr': 0.00034337659086739646, 'samples': 11129856, 'steps': 57967, 'loss/train': 1.6563085317611694} 11/07/2021 05:21:53 - INFO - __main__ - Step 57969: {'lr': 0.0003433716681637451, 'samples': 11130048, 'steps': 57968, 'loss/train': 1.4218497276306152} 11/07/2021 05:21:53 - INFO - __main__ - Step 57970: {'lr': 0.0003433667454180219, 'samples': 11130240, 'steps': 57969, 'loss/train': 1.4764411449432373} 11/07/2021 05:21:53 - INFO - __main__ - Step 57971: {'lr': 0.00034336182263022916, 'samples': 11130432, 'steps': 57970, 'loss/train': 1.9545987844467163} 11/07/2021 05:21:55 - INFO - __main__ - Step 57972: {'lr': 0.000343356899800369, 'samples': 11130624, 'steps': 57971, 'loss/train': 1.9532370567321777} 11/07/2021 05:21:55 - INFO - __main__ - Step 57973: {'lr': 0.0003433519769284436, 'samples': 11130816, 'steps': 57972, 'loss/train': 0.43979793787002563} 11/07/2021 05:21:55 - INFO - __main__ - Step 57974: {'lr': 0.00034334705401445527, 'samples': 11131008, 'steps': 57973, 'loss/train': 1.2245796918869019} 11/07/2021 05:21:56 - INFO - __main__ - Step 57975: {'lr': 0.00034334213105840616, 'samples': 11131200, 'steps': 57974, 'loss/train': 1.6564574241638184} 11/07/2021 05:21:56 - INFO - __main__ - Step 57976: {'lr': 0.00034333720806029863, 'samples': 11131392, 'steps': 57975, 'loss/train': 1.4846664667129517} 11/07/2021 05:21:57 - INFO - __main__ - Step 57977: {'lr': 0.00034333228502013473, 'samples': 11131584, 'steps': 57976, 'loss/train': 1.1061686277389526} 11/07/2021 05:21:57 - INFO - __main__ - Step 57978: {'lr': 0.00034332736193791675, 'samples': 11131776, 'steps': 57977, 'loss/train': 1.5870442390441895} 11/07/2021 05:21:58 - INFO - __main__ - Step 57979: {'lr': 0.0003433224388136469, 'samples': 11131968, 'steps': 57978, 'loss/train': 1.3707860708236694} 11/07/2021 05:21:58 - INFO - __main__ - Step 57980: {'lr': 0.0003433175156473274, 'samples': 11132160, 'steps': 57979, 'loss/train': 1.1279058456420898} 11/07/2021 05:21:58 - INFO - __main__ - Step 57981: {'lr': 0.0003433125924389604, 'samples': 11132352, 'steps': 57980, 'loss/train': 1.3661913871765137} 11/07/2021 05:21:59 - INFO - __main__ - Step 57982: {'lr': 0.00034330766918854827, 'samples': 11132544, 'steps': 57981, 'loss/train': 1.43588125705719} 11/07/2021 05:22:00 - INFO - __main__ - Step 57983: {'lr': 0.0003433027458960932, 'samples': 11132736, 'steps': 57982, 'loss/train': 1.4345775842666626} 11/07/2021 05:22:00 - INFO - __main__ - Step 57984: {'lr': 0.00034329782256159724, 'samples': 11132928, 'steps': 57983, 'loss/train': 1.5670498609542847} 11/07/2021 05:22:00 - INFO - __main__ - Step 57985: {'lr': 0.00034329289918506276, 'samples': 11133120, 'steps': 57984, 'loss/train': 0.8577389717102051} 11/07/2021 05:22:01 - INFO - __main__ - Step 57986: {'lr': 0.0003432879757664919, 'samples': 11133312, 'steps': 57985, 'loss/train': 0.9772616028785706} 11/07/2021 05:22:02 - INFO - __main__ - Step 57987: {'lr': 0.00034328305230588694, 'samples': 11133504, 'steps': 57986, 'loss/train': 1.9733684062957764} 11/07/2021 05:22:02 - INFO - __main__ - Step 57988: {'lr': 0.0003432781288032501, 'samples': 11133696, 'steps': 57987, 'loss/train': 1.7668211460113525} 11/07/2021 05:22:03 - INFO - __main__ - Step 57989: {'lr': 0.00034327320525858357, 'samples': 11133888, 'steps': 57988, 'loss/train': 1.6571251153945923} 11/07/2021 05:22:03 - INFO - __main__ - Step 57990: {'lr': 0.00034326828167188957, 'samples': 11134080, 'steps': 57989, 'loss/train': 1.3920128345489502} 11/07/2021 05:22:03 - INFO - __main__ - Step 57991: {'lr': 0.0003432633580431703, 'samples': 11134272, 'steps': 57990, 'loss/train': 1.3259575366973877} 11/07/2021 05:22:04 - INFO - __main__ - Step 57992: {'lr': 0.00034325843437242804, 'samples': 11134464, 'steps': 57991, 'loss/train': 1.3787813186645508} 11/07/2021 05:22:05 - INFO - __main__ - Step 57993: {'lr': 0.0003432535106596649, 'samples': 11134656, 'steps': 57992, 'loss/train': 1.1121562719345093} 11/07/2021 05:22:05 - INFO - __main__ - Step 57994: {'lr': 0.00034324858690488324, 'samples': 11134848, 'steps': 57993, 'loss/train': 0.5191564559936523} 11/07/2021 05:22:05 - INFO - __main__ - Step 57995: {'lr': 0.0003432436631080851, 'samples': 11135040, 'steps': 57994, 'loss/train': 1.6217098236083984} 11/07/2021 05:22:06 - INFO - __main__ - Step 57996: {'lr': 0.00034323873926927296, 'samples': 11135232, 'steps': 57995, 'loss/train': 1.472947359085083} 11/07/2021 05:22:06 - INFO - __main__ - Step 57997: {'lr': 0.00034323381538844884, 'samples': 11135424, 'steps': 57996, 'loss/train': 0.7900754809379578} 11/07/2021 05:22:07 - INFO - __main__ - Step 57998: {'lr': 0.0003432288914656149, 'samples': 11135616, 'steps': 57997, 'loss/train': 1.040427327156067} 11/07/2021 05:22:08 - INFO - __main__ - Step 57999: {'lr': 0.00034322396750077354, 'samples': 11135808, 'steps': 57998, 'loss/train': 1.0184991359710693} 11/07/2021 05:22:08 - INFO - __main__ - Step 58000: {'lr': 0.0003432190434939269, 'samples': 11136000, 'steps': 57999, 'loss/train': 1.4191187620162964} 11/07/2021 05:22:08 - INFO - __main__ - Step 58001: {'lr': 0.0003432141194450772, 'samples': 11136192, 'steps': 58000, 'loss/train': 1.4723877906799316} 11/07/2021 05:22:09 - INFO - __main__ - Step 58002: {'lr': 0.0003432091953542267, 'samples': 11136384, 'steps': 58001, 'loss/train': 1.513293981552124} 11/07/2021 05:22:10 - INFO - __main__ - Step 58003: {'lr': 0.00034320427122137745, 'samples': 11136576, 'steps': 58002, 'loss/train': 1.0501506328582764} 11/07/2021 05:22:10 - INFO - __main__ - Step 58004: {'lr': 0.0003431993470465319, 'samples': 11136768, 'steps': 58003, 'loss/train': 1.3065688610076904} 11/07/2021 05:22:10 - INFO - __main__ - Step 58005: {'lr': 0.00034319442282969206, 'samples': 11136960, 'steps': 58004, 'loss/train': 0.4899785816669464} 11/07/2021 05:22:11 - INFO - __main__ - Step 58006: {'lr': 0.0003431894985708603, 'samples': 11137152, 'steps': 58005, 'loss/train': 1.4666774272918701} 11/07/2021 05:22:11 - INFO - __main__ - Step 58007: {'lr': 0.0003431845742700388, 'samples': 11137344, 'steps': 58006, 'loss/train': 1.3265217542648315} 11/07/2021 05:22:12 - INFO - __main__ - Step 58008: {'lr': 0.00034317964992722975, 'samples': 11137536, 'steps': 58007, 'loss/train': 1.5669384002685547} 11/07/2021 05:22:12 - INFO - __main__ - Step 58009: {'lr': 0.00034317472554243545, 'samples': 11137728, 'steps': 58008, 'loss/train': 0.9563673138618469} 11/07/2021 05:22:13 - INFO - __main__ - Step 58010: {'lr': 0.00034316980111565796, 'samples': 11137920, 'steps': 58009, 'loss/train': 1.1143641471862793} 11/07/2021 05:22:13 - INFO - __main__ - Step 58011: {'lr': 0.00034316487664689974, 'samples': 11138112, 'steps': 58010, 'loss/train': 0.9312244057655334} 11/07/2021 05:22:13 - INFO - __main__ - Step 58012: {'lr': 0.00034315995213616266, 'samples': 11138304, 'steps': 58011, 'loss/train': 1.4314597845077515} 11/07/2021 05:22:14 - INFO - __main__ - Step 58013: {'lr': 0.0003431550275834493, 'samples': 11138496, 'steps': 58012, 'loss/train': 1.2774111032485962} 11/07/2021 05:22:15 - INFO - __main__ - Step 58014: {'lr': 0.0003431501029887617, 'samples': 11138688, 'steps': 58013, 'loss/train': 1.5383182764053345} 11/07/2021 05:22:15 - INFO - __main__ - Step 58015: {'lr': 0.00034314517835210207, 'samples': 11138880, 'steps': 58014, 'loss/train': 1.4472438097000122} 11/07/2021 05:22:16 - INFO - __main__ - Step 58016: {'lr': 0.00034314025367347266, 'samples': 11139072, 'steps': 58015, 'loss/train': 1.0739012956619263} 11/07/2021 05:22:16 - INFO - __main__ - Step 58017: {'lr': 0.00034313532895287574, 'samples': 11139264, 'steps': 58016, 'loss/train': 1.1794275045394897} 11/07/2021 05:22:17 - INFO - __main__ - Step 58018: {'lr': 0.00034313040419031336, 'samples': 11139456, 'steps': 58017, 'loss/train': 1.309901237487793} 11/07/2021 05:22:17 - INFO - __main__ - Step 58019: {'lr': 0.00034312547938578796, 'samples': 11139648, 'steps': 58018, 'loss/train': 1.4783639907836914} 11/07/2021 05:22:18 - INFO - __main__ - Step 58020: {'lr': 0.0003431205545393016, 'samples': 11139840, 'steps': 58019, 'loss/train': 1.6573526859283447} 11/07/2021 05:22:18 - INFO - __main__ - Step 58021: {'lr': 0.00034311562965085664, 'samples': 11140032, 'steps': 58020, 'loss/train': 1.2112683057785034} 11/07/2021 05:22:18 - INFO - __main__ - Step 58022: {'lr': 0.0003431107047204552, 'samples': 11140224, 'steps': 58021, 'loss/train': 1.7318570613861084} 11/07/2021 05:22:19 - INFO - __main__ - Step 58023: {'lr': 0.00034310577974809944, 'samples': 11140416, 'steps': 58022, 'loss/train': 1.3370510339736938} 11/07/2021 05:22:20 - INFO - __main__ - Step 58024: {'lr': 0.0003431008547337917, 'samples': 11140608, 'steps': 58023, 'loss/train': 1.460934042930603} 11/07/2021 05:22:20 - INFO - __main__ - Step 58025: {'lr': 0.0003430959296775341, 'samples': 11140800, 'steps': 58024, 'loss/train': 1.3419275283813477} 11/07/2021 05:22:21 - INFO - __main__ - Step 58026: {'lr': 0.00034309100457932895, 'samples': 11140992, 'steps': 58025, 'loss/train': 1.6616933345794678} 11/07/2021 05:22:21 - INFO - __main__ - Step 58027: {'lr': 0.0003430860794391784, 'samples': 11141184, 'steps': 58026, 'loss/train': 1.6319591999053955} 11/07/2021 05:22:22 - INFO - __main__ - Step 58028: {'lr': 0.00034308115425708477, 'samples': 11141376, 'steps': 58027, 'loss/train': 1.709490418434143} 11/07/2021 05:22:22 - INFO - __main__ - Step 58029: {'lr': 0.0003430762290330501, 'samples': 11141568, 'steps': 58028, 'loss/train': 1.3632644414901733} 11/07/2021 05:22:22 - INFO - __main__ - Step 58030: {'lr': 0.00034307130376707684, 'samples': 11141760, 'steps': 58029, 'loss/train': 1.4385491609573364} 11/07/2021 05:22:23 - INFO - __main__ - Step 58031: {'lr': 0.000343066378459167, 'samples': 11141952, 'steps': 58030, 'loss/train': 1.7479932308197021} 11/07/2021 05:22:23 - INFO - __main__ - Step 58032: {'lr': 0.00034306145310932293, 'samples': 11142144, 'steps': 58031, 'loss/train': 0.8931760787963867} 11/07/2021 05:22:24 - INFO - __main__ - Step 58033: {'lr': 0.0003430565277175468, 'samples': 11142336, 'steps': 58032, 'loss/train': 1.4687832593917847} 11/07/2021 05:22:25 - INFO - __main__ - Step 58034: {'lr': 0.0003430516022838408, 'samples': 11142528, 'steps': 58033, 'loss/train': 1.2242742776870728} 11/07/2021 05:22:25 - INFO - __main__ - Step 58035: {'lr': 0.00034304667680820714, 'samples': 11142720, 'steps': 58034, 'loss/train': 1.1620845794677734} 11/07/2021 05:22:25 - INFO - __main__ - Step 58036: {'lr': 0.0003430417512906482, 'samples': 11142912, 'steps': 58035, 'loss/train': 1.0705658197402954} 11/07/2021 05:22:26 - INFO - __main__ - Step 58037: {'lr': 0.0003430368257311661, 'samples': 11143104, 'steps': 58036, 'loss/train': 1.4266974925994873} 11/07/2021 05:22:26 - INFO - __main__ - Step 58038: {'lr': 0.0003430319001297629, 'samples': 11143296, 'steps': 58037, 'loss/train': 0.85700923204422} 11/07/2021 05:22:27 - INFO - __main__ - Step 58039: {'lr': 0.00034302697448644105, 'samples': 11143488, 'steps': 58038, 'loss/train': 1.4314415454864502} 11/07/2021 05:22:27 - INFO - __main__ - Step 58040: {'lr': 0.00034302204880120267, 'samples': 11143680, 'steps': 58039, 'loss/train': 1.265537142753601} 11/07/2021 05:22:28 - INFO - __main__ - Step 58041: {'lr': 0.00034301712307404996, 'samples': 11143872, 'steps': 58040, 'loss/train': 1.2173185348510742} 11/07/2021 05:22:28 - INFO - __main__ - Step 58042: {'lr': 0.00034301219730498524, 'samples': 11144064, 'steps': 58041, 'loss/train': 1.1953152418136597} 11/07/2021 05:22:28 - INFO - __main__ - Step 58043: {'lr': 0.00034300727149401064, 'samples': 11144256, 'steps': 58042, 'loss/train': 1.4818683862686157} 11/07/2021 05:22:29 - INFO - __main__ - Step 58044: {'lr': 0.00034300234564112837, 'samples': 11144448, 'steps': 58043, 'loss/train': 1.2073689699172974} 11/07/2021 05:22:30 - INFO - __main__ - Step 58045: {'lr': 0.0003429974197463407, 'samples': 11144640, 'steps': 58044, 'loss/train': 1.2426390647888184} 11/07/2021 05:22:30 - INFO - __main__ - Step 58046: {'lr': 0.00034299249380964977, 'samples': 11144832, 'steps': 58045, 'loss/train': 2.0916635990142822} 11/07/2021 05:22:30 - INFO - __main__ - Step 58047: {'lr': 0.0003429875678310579, 'samples': 11145024, 'steps': 58046, 'loss/train': 1.402531623840332} 11/07/2021 05:22:31 - INFO - __main__ - Step 58048: {'lr': 0.0003429826418105673, 'samples': 11145216, 'steps': 58047, 'loss/train': 1.5624163150787354} 11/07/2021 05:22:32 - INFO - __main__ - Step 58049: {'lr': 0.0003429777157481801, 'samples': 11145408, 'steps': 58048, 'loss/train': 1.217465877532959} 11/07/2021 05:22:32 - INFO - __main__ - Step 58050: {'lr': 0.0003429727896438986, 'samples': 11145600, 'steps': 58049, 'loss/train': 1.2060452699661255} 11/07/2021 05:22:32 - INFO - __main__ - Step 58051: {'lr': 0.00034296786349772494, 'samples': 11145792, 'steps': 58050, 'loss/train': 1.6606749296188354} 11/07/2021 05:22:33 - INFO - __main__ - Step 58052: {'lr': 0.0003429629373096615, 'samples': 11145984, 'steps': 58051, 'loss/train': 1.473239779472351} 11/07/2021 05:22:33 - INFO - __main__ - Step 58053: {'lr': 0.0003429580110797103, 'samples': 11146176, 'steps': 58052, 'loss/train': 0.9374370574951172} 11/07/2021 05:22:34 - INFO - __main__ - Step 58054: {'lr': 0.0003429530848078737, 'samples': 11146368, 'steps': 58053, 'loss/train': 1.3914765119552612} 11/07/2021 05:22:34 - INFO - __main__ - Step 58055: {'lr': 0.0003429481584941538, 'samples': 11146560, 'steps': 58054, 'loss/train': 1.4052411317825317} 11/07/2021 05:22:35 - INFO - __main__ - Step 58056: {'lr': 0.0003429432321385531, 'samples': 11146752, 'steps': 58055, 'loss/train': 1.6026771068572998} 11/07/2021 05:22:35 - INFO - __main__ - Step 58057: {'lr': 0.00034293830574107345, 'samples': 11146944, 'steps': 58056, 'loss/train': 1.494822382926941} 11/07/2021 05:22:36 - INFO - __main__ - Step 58058: {'lr': 0.0003429333793017173, 'samples': 11147136, 'steps': 58057, 'loss/train': 0.8916078209877014} 11/07/2021 05:22:37 - INFO - __main__ - Step 58059: {'lr': 0.00034292845282048667, 'samples': 11147328, 'steps': 58058, 'loss/train': 1.2187190055847168} 11/07/2021 05:22:37 - INFO - __main__ - Step 58060: {'lr': 0.00034292352629738406, 'samples': 11147520, 'steps': 58059, 'loss/train': 0.9712035655975342} 11/07/2021 05:22:37 - INFO - __main__ - Step 58061: {'lr': 0.00034291859973241146, 'samples': 11147712, 'steps': 58060, 'loss/train': 0.9141193628311157} 11/07/2021 05:22:38 - INFO - __main__ - Step 58062: {'lr': 0.0003429136731255712, 'samples': 11147904, 'steps': 58061, 'loss/train': 1.4545624256134033} 11/07/2021 05:22:38 - INFO - __main__ - Step 58063: {'lr': 0.0003429087464768655, 'samples': 11148096, 'steps': 58062, 'loss/train': 1.2438966035842896} 11/07/2021 05:22:39 - INFO - __main__ - Step 58064: {'lr': 0.00034290381978629655, 'samples': 11148288, 'steps': 58063, 'loss/train': 1.4399888515472412} 11/07/2021 05:22:39 - INFO - __main__ - Step 58065: {'lr': 0.00034289889305386654, 'samples': 11148480, 'steps': 58064, 'loss/train': 1.438434362411499} 11/07/2021 05:22:40 - INFO - __main__ - Step 58066: {'lr': 0.0003428939662795777, 'samples': 11148672, 'steps': 58065, 'loss/train': 1.2491456270217896} 11/07/2021 05:22:40 - INFO - __main__ - Step 58067: {'lr': 0.0003428890394634323, 'samples': 11148864, 'steps': 58066, 'loss/train': 1.3859294652938843} 11/07/2021 05:22:40 - INFO - __main__ - Step 58068: {'lr': 0.0003428841126054326, 'samples': 11149056, 'steps': 58067, 'loss/train': 1.482081413269043} 11/07/2021 05:22:42 - INFO - __main__ - Step 58069: {'lr': 0.0003428791857055806, 'samples': 11149248, 'steps': 58068, 'loss/train': 1.2527201175689697} 11/07/2021 05:22:42 - INFO - __main__ - Step 58070: {'lr': 0.0003428742587638788, 'samples': 11149440, 'steps': 58069, 'loss/train': 0.22418178617954254} 11/07/2021 05:22:42 - INFO - __main__ - Step 58071: {'lr': 0.0003428693317803293, 'samples': 11149632, 'steps': 58070, 'loss/train': 1.7594146728515625} 11/07/2021 05:22:43 - INFO - __main__ - Step 58072: {'lr': 0.00034286440475493423, 'samples': 11149824, 'steps': 58071, 'loss/train': 1.462848424911499} 11/07/2021 05:22:43 - INFO - __main__ - Step 58073: {'lr': 0.0003428594776876959, 'samples': 11150016, 'steps': 58072, 'loss/train': 1.330108880996704} 11/07/2021 05:22:44 - INFO - __main__ - Step 58074: {'lr': 0.0003428545505786166, 'samples': 11150208, 'steps': 58073, 'loss/train': 1.428093433380127} 11/07/2021 05:22:44 - INFO - __main__ - Step 58075: {'lr': 0.0003428496234276984, 'samples': 11150400, 'steps': 58074, 'loss/train': 1.7277567386627197} 11/07/2021 05:22:45 - INFO - __main__ - Step 58076: {'lr': 0.0003428446962349437, 'samples': 11150592, 'steps': 58075, 'loss/train': 1.0090595483779907} 11/07/2021 05:22:45 - INFO - __main__ - Step 58077: {'lr': 0.0003428397690003545, 'samples': 11150784, 'steps': 58076, 'loss/train': 1.258705496788025} 11/07/2021 05:22:45 - INFO - __main__ - Step 58078: {'lr': 0.00034283484172393315, 'samples': 11150976, 'steps': 58077, 'loss/train': 1.2631856203079224} 11/07/2021 05:22:47 - INFO - __main__ - Step 58079: {'lr': 0.0003428299144056818, 'samples': 11151168, 'steps': 58078, 'loss/train': 1.2879300117492676} 11/07/2021 05:22:47 - INFO - __main__ - Step 58080: {'lr': 0.00034282498704560284, 'samples': 11151360, 'steps': 58079, 'loss/train': 1.1683639287948608} 11/07/2021 05:22:47 - INFO - __main__ - Step 58081: {'lr': 0.0003428200596436983, 'samples': 11151552, 'steps': 58080, 'loss/train': 1.3141639232635498} 11/07/2021 05:22:48 - INFO - __main__ - Step 58082: {'lr': 0.00034281513219997054, 'samples': 11151744, 'steps': 58081, 'loss/train': 1.1433618068695068} 11/07/2021 05:22:48 - INFO - __main__ - Step 58083: {'lr': 0.0003428102047144217, 'samples': 11151936, 'steps': 58082, 'loss/train': 1.0904260873794556} 11/07/2021 05:22:48 - INFO - __main__ - Step 58084: {'lr': 0.00034280527718705397, 'samples': 11152128, 'steps': 58083, 'loss/train': 1.9198188781738281} 11/07/2021 05:22:50 - INFO - __main__ - Step 58085: {'lr': 0.0003428003496178696, 'samples': 11152320, 'steps': 58084, 'loss/train': 1.529852032661438} 11/07/2021 05:22:50 - INFO - __main__ - Step 58086: {'lr': 0.00034279542200687087, 'samples': 11152512, 'steps': 58085, 'loss/train': 1.5440113544464111} 11/07/2021 05:22:50 - INFO - __main__ - Step 58087: {'lr': 0.0003427904943540599, 'samples': 11152704, 'steps': 58086, 'loss/train': 1.61406672000885} 11/07/2021 05:22:51 - INFO - __main__ - Step 58088: {'lr': 0.000342785566659439, 'samples': 11152896, 'steps': 58087, 'loss/train': 1.048874855041504} 11/07/2021 05:22:52 - INFO - __main__ - Step 58089: {'lr': 0.00034278063892301036, 'samples': 11153088, 'steps': 58088, 'loss/train': 0.7622469067573547} 11/07/2021 05:22:52 - INFO - __main__ - Step 58090: {'lr': 0.00034277571114477623, 'samples': 11153280, 'steps': 58089, 'loss/train': 0.9306669235229492} 11/07/2021 05:22:52 - INFO - __main__ - Step 58091: {'lr': 0.0003427707833247388, 'samples': 11153472, 'steps': 58090, 'loss/train': 1.3199565410614014} 11/07/2021 05:22:53 - INFO - __main__ - Step 58092: {'lr': 0.0003427658554629002, 'samples': 11153664, 'steps': 58091, 'loss/train': 1.4994642734527588} 11/07/2021 05:22:53 - INFO - __main__ - Step 58093: {'lr': 0.00034276092755926275, 'samples': 11153856, 'steps': 58092, 'loss/train': 1.5093677043914795} 11/07/2021 05:22:54 - INFO - __main__ - Step 58094: {'lr': 0.0003427559996138287, 'samples': 11154048, 'steps': 58093, 'loss/train': 2.099083662033081} 11/07/2021 05:22:55 - INFO - __main__ - Step 58095: {'lr': 0.00034275107162660024, 'samples': 11154240, 'steps': 58094, 'loss/train': 1.3972128629684448} 11/07/2021 05:22:55 - INFO - __main__ - Step 58096: {'lr': 0.0003427461435975796, 'samples': 11154432, 'steps': 58095, 'loss/train': 1.568275809288025} 11/07/2021 05:22:56 - INFO - __main__ - Step 58097: {'lr': 0.0003427412155267688, 'samples': 11154624, 'steps': 58096, 'loss/train': 1.3446893692016602} 11/07/2021 05:22:56 - INFO - __main__ - Step 58098: {'lr': 0.00034273628741417043, 'samples': 11154816, 'steps': 58097, 'loss/train': 1.7160266637802124} 11/07/2021 05:22:56 - INFO - __main__ - Step 58099: {'lr': 0.0003427313592597865, 'samples': 11155008, 'steps': 58098, 'loss/train': 1.2969858646392822} 11/07/2021 05:22:57 - INFO - __main__ - Step 58100: {'lr': 0.00034272643106361916, 'samples': 11155200, 'steps': 58099, 'loss/train': 1.284481406211853} 11/07/2021 05:22:58 - INFO - __main__ - Step 58101: {'lr': 0.00034272150282567084, 'samples': 11155392, 'steps': 58100, 'loss/train': 1.4279965162277222} 11/07/2021 05:22:58 - INFO - __main__ - Step 58102: {'lr': 0.00034271657454594355, 'samples': 11155584, 'steps': 58101, 'loss/train': 1.6952944993972778} 11/07/2021 05:22:58 - INFO - __main__ - Step 58103: {'lr': 0.0003427116462244396, 'samples': 11155776, 'steps': 58102, 'loss/train': 1.5189787149429321} 11/07/2021 05:22:59 - INFO - __main__ - Step 58104: {'lr': 0.00034270671786116127, 'samples': 11155968, 'steps': 58103, 'loss/train': 1.3295122385025024} 11/07/2021 05:22:59 - INFO - __main__ - Step 58105: {'lr': 0.00034270178945611067, 'samples': 11156160, 'steps': 58104, 'loss/train': 1.4877725839614868} 11/07/2021 05:23:00 - INFO - __main__ - Step 58106: {'lr': 0.00034269686100929015, 'samples': 11156352, 'steps': 58105, 'loss/train': 1.7005438804626465} 11/07/2021 05:23:00 - INFO - __main__ - Step 58107: {'lr': 0.0003426919325207018, 'samples': 11156544, 'steps': 58106, 'loss/train': 1.6490744352340698} 11/07/2021 05:23:01 - INFO - __main__ - Step 58108: {'lr': 0.0003426870039903479, 'samples': 11156736, 'steps': 58107, 'loss/train': 1.5698260068893433} 11/07/2021 05:23:01 - INFO - __main__ - Step 58109: {'lr': 0.00034268207541823066, 'samples': 11156928, 'steps': 58108, 'loss/train': 1.6872550249099731} 11/07/2021 05:23:01 - INFO - __main__ - Step 58110: {'lr': 0.0003426771468043523, 'samples': 11157120, 'steps': 58109, 'loss/train': 2.1855580806732178} 11/07/2021 05:23:03 - INFO - __main__ - Step 58111: {'lr': 0.00034267221814871505, 'samples': 11157312, 'steps': 58110, 'loss/train': 1.670572280883789} 11/07/2021 05:23:03 - INFO - __main__ - Step 58112: {'lr': 0.0003426672894513212, 'samples': 11157504, 'steps': 58111, 'loss/train': 1.4893500804901123} 11/07/2021 05:23:03 - INFO - __main__ - Step 58113: {'lr': 0.00034266236071217284, 'samples': 11157696, 'steps': 58112, 'loss/train': 1.5400220155715942} 11/07/2021 05:23:04 - INFO - __main__ - Step 58114: {'lr': 0.00034265743193127217, 'samples': 11157888, 'steps': 58113, 'loss/train': 1.7129415273666382} 11/07/2021 05:23:04 - INFO - __main__ - Step 58115: {'lr': 0.00034265250310862164, 'samples': 11158080, 'steps': 58114, 'loss/train': 1.34105384349823} 11/07/2021 05:23:05 - INFO - __main__ - Step 58116: {'lr': 0.0003426475742442232, 'samples': 11158272, 'steps': 58115, 'loss/train': 1.7803500890731812} 11/07/2021 05:23:05 - INFO - __main__ - Step 58117: {'lr': 0.0003426426453380793, 'samples': 11158464, 'steps': 58116, 'loss/train': 1.460178256034851} 11/07/2021 05:23:06 - INFO - __main__ - Step 58118: {'lr': 0.000342637716390192, 'samples': 11158656, 'steps': 58117, 'loss/train': 1.3733336925506592} 11/07/2021 05:23:06 - INFO - __main__ - Step 58119: {'lr': 0.0003426327874005636, 'samples': 11158848, 'steps': 58118, 'loss/train': 1.0330922603607178} 11/07/2021 05:23:06 - INFO - __main__ - Step 58120: {'lr': 0.00034262785836919617, 'samples': 11159040, 'steps': 58119, 'loss/train': 1.5005682706832886} 11/07/2021 05:23:07 - INFO - __main__ - Step 58121: {'lr': 0.00034262292929609217, 'samples': 11159232, 'steps': 58120, 'loss/train': 0.6388111710548401} 11/07/2021 05:23:08 - INFO - __main__ - Step 58122: {'lr': 0.0003426180001812537, 'samples': 11159424, 'steps': 58121, 'loss/train': 0.8463659286499023} 11/07/2021 05:23:08 - INFO - __main__ - Step 58123: {'lr': 0.000342613071024683, 'samples': 11159616, 'steps': 58122, 'loss/train': 0.7747419476509094} 11/07/2021 05:23:08 - INFO - __main__ - Step 58124: {'lr': 0.0003426081418263823, 'samples': 11159808, 'steps': 58123, 'loss/train': 1.5754873752593994} 11/07/2021 05:23:09 - INFO - __main__ - Step 58125: {'lr': 0.00034260321258635377, 'samples': 11160000, 'steps': 58124, 'loss/train': 0.8849248886108398} 11/07/2021 05:23:09 - INFO - __main__ - Step 58126: {'lr': 0.0003425982833045996, 'samples': 11160192, 'steps': 58125, 'loss/train': 1.6146043539047241} 11/07/2021 05:23:10 - INFO - __main__ - Step 58127: {'lr': 0.0003425933539811221, 'samples': 11160384, 'steps': 58126, 'loss/train': 1.31283438205719} 11/07/2021 05:23:11 - INFO - __main__ - Step 58128: {'lr': 0.0003425884246159235, 'samples': 11160576, 'steps': 58127, 'loss/train': 1.4597915410995483} 11/07/2021 05:23:11 - INFO - __main__ - Step 58129: {'lr': 0.00034258349520900595, 'samples': 11160768, 'steps': 58128, 'loss/train': 1.814573049545288} 11/07/2021 05:23:11 - INFO - __main__ - Step 58130: {'lr': 0.0003425785657603718, 'samples': 11160960, 'steps': 58129, 'loss/train': 1.185489535331726} 11/07/2021 05:23:12 - INFO - __main__ - Step 58131: {'lr': 0.0003425736362700231, 'samples': 11161152, 'steps': 58130, 'loss/train': 1.199737310409546} 11/07/2021 05:23:13 - INFO - __main__ - Step 58132: {'lr': 0.00034256870673796217, 'samples': 11161344, 'steps': 58131, 'loss/train': 1.3147960901260376} 11/07/2021 05:23:13 - INFO - __main__ - Step 58133: {'lr': 0.0003425637771641911, 'samples': 11161536, 'steps': 58132, 'loss/train': 1.3148967027664185} 11/07/2021 05:23:13 - INFO - __main__ - Step 58134: {'lr': 0.00034255884754871233, 'samples': 11161728, 'steps': 58133, 'loss/train': 1.83072030544281} 11/07/2021 05:23:14 - INFO - __main__ - Step 58135: {'lr': 0.000342553917891528, 'samples': 11161920, 'steps': 58134, 'loss/train': 1.1407413482666016} 11/07/2021 05:23:14 - INFO - __main__ - Step 58136: {'lr': 0.0003425489881926402, 'samples': 11162112, 'steps': 58135, 'loss/train': 2.0290093421936035} 11/07/2021 05:23:15 - INFO - __main__ - Step 58137: {'lr': 0.0003425440584520514, 'samples': 11162304, 'steps': 58136, 'loss/train': 1.5655841827392578} 11/07/2021 05:23:15 - INFO - __main__ - Step 58138: {'lr': 0.00034253912866976353, 'samples': 11162496, 'steps': 58137, 'loss/train': 1.5796418190002441} 11/07/2021 05:23:16 - INFO - __main__ - Step 58139: {'lr': 0.000342534198845779, 'samples': 11162688, 'steps': 58138, 'loss/train': 1.5528148412704468} 11/07/2021 05:23:16 - INFO - __main__ - Step 58140: {'lr': 0.0003425292689801, 'samples': 11162880, 'steps': 58139, 'loss/train': 1.3445820808410645} 11/07/2021 05:23:16 - INFO - __main__ - Step 58141: {'lr': 0.00034252433907272875, 'samples': 11163072, 'steps': 58140, 'loss/train': 1.4922312498092651} 11/07/2021 05:23:18 - INFO - __main__ - Step 58142: {'lr': 0.0003425194091236674, 'samples': 11163264, 'steps': 58141, 'loss/train': 1.2161821126937866} 11/07/2021 05:23:18 - INFO - __main__ - Step 58143: {'lr': 0.0003425144791329183, 'samples': 11163456, 'steps': 58142, 'loss/train': 1.1257246732711792} 11/07/2021 05:23:18 - INFO - __main__ - Step 58144: {'lr': 0.00034250954910048357, 'samples': 11163648, 'steps': 58143, 'loss/train': 1.5302990674972534} 11/07/2021 05:23:19 - INFO - __main__ - Step 58145: {'lr': 0.0003425046190263655, 'samples': 11163840, 'steps': 58144, 'loss/train': 1.6194462776184082} 11/07/2021 05:23:19 - INFO - __main__ - Step 58146: {'lr': 0.00034249968891056625, 'samples': 11164032, 'steps': 58145, 'loss/train': 1.2853859663009644} 11/07/2021 05:23:20 - INFO - __main__ - Step 58147: {'lr': 0.00034249475875308813, 'samples': 11164224, 'steps': 58146, 'loss/train': 1.7340830564498901} 11/07/2021 05:23:20 - INFO - __main__ - Step 58148: {'lr': 0.00034248982855393317, 'samples': 11164416, 'steps': 58147, 'loss/train': 1.2193033695220947} 11/07/2021 05:23:21 - INFO - __main__ - Step 58149: {'lr': 0.0003424848983131038, 'samples': 11164608, 'steps': 58148, 'loss/train': 1.3887706995010376} 11/07/2021 05:23:21 - INFO - __main__ - Step 58150: {'lr': 0.0003424799680306022, 'samples': 11164800, 'steps': 58149, 'loss/train': 0.44565680623054504} 11/07/2021 05:23:21 - INFO - __main__ - Step 58151: {'lr': 0.0003424750377064305, 'samples': 11164992, 'steps': 58150, 'loss/train': 1.300445556640625} 11/07/2021 05:23:22 - INFO - __main__ - Step 58152: {'lr': 0.000342470107340591, 'samples': 11165184, 'steps': 58151, 'loss/train': 1.3930526971817017} 11/07/2021 05:23:23 - INFO - __main__ - Step 58153: {'lr': 0.0003424651769330859, 'samples': 11165376, 'steps': 58152, 'loss/train': 1.7067426443099976} 11/07/2021 05:23:23 - INFO - __main__ - Step 58154: {'lr': 0.0003424602464839173, 'samples': 11165568, 'steps': 58153, 'loss/train': 0.9218025207519531} 11/07/2021 05:23:24 - INFO - __main__ - Step 58155: {'lr': 0.0003424553159930877, 'samples': 11165760, 'steps': 58154, 'loss/train': 1.6319758892059326} 11/07/2021 05:23:24 - INFO - __main__ - Step 58156: {'lr': 0.00034245038546059904, 'samples': 11165952, 'steps': 58155, 'loss/train': 1.5520331859588623} 11/07/2021 05:23:24 - INFO - __main__ - Step 58157: {'lr': 0.0003424454548864538, 'samples': 11166144, 'steps': 58156, 'loss/train': 1.245050072669983} 11/07/2021 05:23:25 - INFO - __main__ - Step 58158: {'lr': 0.00034244052427065397, 'samples': 11166336, 'steps': 58157, 'loss/train': 1.8263821601867676} 11/07/2021 05:23:26 - INFO - __main__ - Step 58159: {'lr': 0.00034243559361320187, 'samples': 11166528, 'steps': 58158, 'loss/train': 3.1753933429718018} 11/07/2021 05:23:26 - INFO - __main__ - Step 58160: {'lr': 0.00034243066291409977, 'samples': 11166720, 'steps': 58159, 'loss/train': 1.89552903175354} 11/07/2021 05:23:26 - INFO - __main__ - Step 58161: {'lr': 0.0003424257321733497, 'samples': 11166912, 'steps': 58160, 'loss/train': 1.6489208936691284} 11/07/2021 05:23:27 - INFO - __main__ - Step 58162: {'lr': 0.00034242080139095416, 'samples': 11167104, 'steps': 58161, 'loss/train': 1.6356033086776733} 11/07/2021 05:23:28 - INFO - __main__ - Step 58163: {'lr': 0.0003424158705669152, 'samples': 11167296, 'steps': 58162, 'loss/train': 1.5166915655136108} 11/07/2021 05:23:28 - INFO - __main__ - Step 58164: {'lr': 0.0003424109397012351, 'samples': 11167488, 'steps': 58163, 'loss/train': 1.524658441543579} 11/07/2021 05:23:28 - INFO - __main__ - Step 58165: {'lr': 0.000342406008793916, 'samples': 11167680, 'steps': 58164, 'loss/train': 1.2651325464248657} 11/07/2021 05:23:29 - INFO - __main__ - Step 58166: {'lr': 0.00034240107784496023, 'samples': 11167872, 'steps': 58165, 'loss/train': 1.3366292715072632} 11/07/2021 05:23:29 - INFO - __main__ - Step 58167: {'lr': 0.00034239614685436994, 'samples': 11168064, 'steps': 58166, 'loss/train': 1.7601878643035889} 11/07/2021 05:23:30 - INFO - __main__ - Step 58168: {'lr': 0.0003423912158221473, 'samples': 11168256, 'steps': 58167, 'loss/train': 1.148177981376648} 11/07/2021 05:23:31 - INFO - __main__ - Step 58169: {'lr': 0.0003423862847482947, 'samples': 11168448, 'steps': 58168, 'loss/train': 0.06079781800508499} 11/07/2021 05:23:31 - INFO - __main__ - Step 58170: {'lr': 0.0003423813536328143, 'samples': 11168640, 'steps': 58169, 'loss/train': 1.6206070184707642} 11/07/2021 05:23:31 - INFO - __main__ - Step 58171: {'lr': 0.00034237642247570815, 'samples': 11168832, 'steps': 58170, 'loss/train': 1.9088107347488403} 11/07/2021 05:23:32 - INFO - __main__ - Step 58172: {'lr': 0.0003423714912769787, 'samples': 11169024, 'steps': 58171, 'loss/train': 1.0276298522949219} 11/07/2021 05:23:32 - INFO - __main__ - Step 58173: {'lr': 0.000342366560036628, 'samples': 11169216, 'steps': 58172, 'loss/train': 1.4985283613204956} 11/07/2021 05:23:33 - INFO - __main__ - Step 58174: {'lr': 0.0003423616287546585, 'samples': 11169408, 'steps': 58173, 'loss/train': 1.5972003936767578} 11/07/2021 05:23:33 - INFO - __main__ - Step 58175: {'lr': 0.00034235669743107214, 'samples': 11169600, 'steps': 58174, 'loss/train': 1.5434128046035767} 11/07/2021 05:23:34 - INFO - __main__ - Step 58176: {'lr': 0.0003423517660658713, 'samples': 11169792, 'steps': 58175, 'loss/train': 0.8475615978240967} 11/07/2021 05:23:34 - INFO - __main__ - Step 58177: {'lr': 0.0003423468346590583, 'samples': 11169984, 'steps': 58176, 'loss/train': 1.3889529705047607} 11/07/2021 05:23:34 - INFO - __main__ - Step 58178: {'lr': 0.00034234190321063516, 'samples': 11170176, 'steps': 58177, 'loss/train': 1.2486295700073242} 11/07/2021 05:23:36 - INFO - __main__ - Step 58179: {'lr': 0.00034233697172060415, 'samples': 11170368, 'steps': 58178, 'loss/train': 1.5169450044631958} 11/07/2021 05:23:36 - INFO - __main__ - Step 58180: {'lr': 0.00034233204018896754, 'samples': 11170560, 'steps': 58179, 'loss/train': 1.3016443252563477} 11/07/2021 05:23:37 - INFO - __main__ - Step 58181: {'lr': 0.00034232710861572754, 'samples': 11170752, 'steps': 58180, 'loss/train': 0.5447980761528015} 11/07/2021 05:23:37 - INFO - __main__ - Step 58182: {'lr': 0.0003423221770008864, 'samples': 11170944, 'steps': 58181, 'loss/train': 1.4304301738739014} 11/07/2021 05:23:37 - INFO - __main__ - Step 58183: {'lr': 0.0003423172453444462, 'samples': 11171136, 'steps': 58182, 'loss/train': 1.4617197513580322} 11/07/2021 05:23:38 - INFO - __main__ - Step 58184: {'lr': 0.00034231231364640946, 'samples': 11171328, 'steps': 58183, 'loss/train': 1.0902422666549683} 11/07/2021 05:23:39 - INFO - __main__ - Step 58185: {'lr': 0.0003423073819067781, 'samples': 11171520, 'steps': 58184, 'loss/train': 1.501054048538208} 11/07/2021 05:23:39 - INFO - __main__ - Step 58186: {'lr': 0.00034230245012555445, 'samples': 11171712, 'steps': 58185, 'loss/train': 0.9810585379600525} 11/07/2021 05:23:39 - INFO - __main__ - Step 58187: {'lr': 0.00034229751830274077, 'samples': 11171904, 'steps': 58186, 'loss/train': 1.3098068237304688} 11/07/2021 05:23:40 - INFO - __main__ - Step 58188: {'lr': 0.0003422925864383392, 'samples': 11172096, 'steps': 58187, 'loss/train': 1.1051206588745117} 11/07/2021 05:23:41 - INFO - __main__ - Step 58189: {'lr': 0.00034228765453235213, 'samples': 11172288, 'steps': 58188, 'loss/train': 1.5620949268341064} 11/07/2021 05:23:41 - INFO - __main__ - Step 58190: {'lr': 0.0003422827225847816, 'samples': 11172480, 'steps': 58189, 'loss/train': 1.223131537437439} 11/07/2021 05:23:41 - INFO - __main__ - Step 58191: {'lr': 0.0003422777905956299, 'samples': 11172672, 'steps': 58190, 'loss/train': 1.2403318881988525} 11/07/2021 05:23:42 - INFO - __main__ - Step 58192: {'lr': 0.0003422728585648992, 'samples': 11172864, 'steps': 58191, 'loss/train': 1.8370261192321777} 11/07/2021 05:23:42 - INFO - __main__ - Step 58193: {'lr': 0.00034226792649259184, 'samples': 11173056, 'steps': 58192, 'loss/train': 1.4494093656539917} 11/07/2021 05:23:43 - INFO - __main__ - Step 58194: {'lr': 0.00034226299437870993, 'samples': 11173248, 'steps': 58193, 'loss/train': 1.1527293920516968} 11/07/2021 05:23:44 - INFO - __main__ - Step 58195: {'lr': 0.0003422580622232558, 'samples': 11173440, 'steps': 58194, 'loss/train': 1.4890687465667725} 11/07/2021 05:23:44 - INFO - __main__ - Step 58196: {'lr': 0.0003422531300262316, 'samples': 11173632, 'steps': 58195, 'loss/train': 1.525909662246704} 11/07/2021 05:23:44 - INFO - __main__ - Step 58197: {'lr': 0.00034224819778763953, 'samples': 11173824, 'steps': 58196, 'loss/train': 1.4138767719268799} 11/07/2021 05:23:45 - INFO - __main__ - Step 58198: {'lr': 0.0003422432655074819, 'samples': 11174016, 'steps': 58197, 'loss/train': 1.4295481443405151} 11/07/2021 05:23:45 - INFO - __main__ - Step 58199: {'lr': 0.0003422383331857608, 'samples': 11174208, 'steps': 58198, 'loss/train': 1.5134727954864502} 11/07/2021 05:23:46 - INFO - __main__ - Step 58200: {'lr': 0.00034223340082247856, 'samples': 11174400, 'steps': 58199, 'loss/train': 2.110922336578369} 11/07/2021 05:23:46 - INFO - __main__ - Step 58201: {'lr': 0.0003422284684176374, 'samples': 11174592, 'steps': 58200, 'loss/train': 1.659425973892212} 11/07/2021 05:23:47 - INFO - __main__ - Step 58202: {'lr': 0.00034222353597123946, 'samples': 11174784, 'steps': 58201, 'loss/train': 1.4581602811813354} 11/07/2021 05:23:47 - INFO - __main__ - Step 58203: {'lr': 0.00034221860348328703, 'samples': 11174976, 'steps': 58202, 'loss/train': 1.4331773519515991} 11/07/2021 05:23:47 - INFO - __main__ - Step 58204: {'lr': 0.0003422136709537824, 'samples': 11175168, 'steps': 58203, 'loss/train': 1.7570655345916748} 11/07/2021 05:23:48 - INFO - __main__ - Step 58205: {'lr': 0.00034220873838272767, 'samples': 11175360, 'steps': 58204, 'loss/train': 1.2941737174987793} 11/07/2021 05:23:49 - INFO - __main__ - Step 58206: {'lr': 0.00034220380577012506, 'samples': 11175552, 'steps': 58205, 'loss/train': 1.502335548400879} 11/07/2021 05:23:49 - INFO - __main__ - Step 58207: {'lr': 0.00034219887311597686, 'samples': 11175744, 'steps': 58206, 'loss/train': 1.3283244371414185} 11/07/2021 05:23:49 - INFO - __main__ - Step 58208: {'lr': 0.0003421939404202853, 'samples': 11175936, 'steps': 58207, 'loss/train': 1.7608331441879272} 11/07/2021 05:23:50 - INFO - __main__ - Step 58209: {'lr': 0.0003421890076830525, 'samples': 11176128, 'steps': 58208, 'loss/train': 1.7661632299423218} 11/07/2021 05:23:51 - INFO - __main__ - Step 58210: {'lr': 0.00034218407490428085, 'samples': 11176320, 'steps': 58209, 'loss/train': 1.0454431772232056} 11/07/2021 05:23:51 - INFO - __main__ - Step 58211: {'lr': 0.0003421791420839724, 'samples': 11176512, 'steps': 58210, 'loss/train': 1.4422022104263306} 11/07/2021 05:23:52 - INFO - __main__ - Step 58212: {'lr': 0.00034217420922212947, 'samples': 11176704, 'steps': 58211, 'loss/train': 1.4320636987686157} 11/07/2021 05:23:52 - INFO - __main__ - Step 58213: {'lr': 0.0003421692763187543, 'samples': 11176896, 'steps': 58212, 'loss/train': 1.5016170740127563} 11/07/2021 05:23:52 - INFO - __main__ - Step 58214: {'lr': 0.00034216434337384905, 'samples': 11177088, 'steps': 58213, 'loss/train': 1.1252275705337524} 11/07/2021 05:23:53 - INFO - __main__ - Step 58215: {'lr': 0.000342159410387416, 'samples': 11177280, 'steps': 58214, 'loss/train': 1.9015330076217651} 11/07/2021 05:23:54 - INFO - __main__ - Step 58216: {'lr': 0.0003421544773594573, 'samples': 11177472, 'steps': 58215, 'loss/train': 1.9007363319396973} 11/07/2021 05:23:54 - INFO - __main__ - Step 58217: {'lr': 0.0003421495442899753, 'samples': 11177664, 'steps': 58216, 'loss/train': 1.3868484497070312} 11/07/2021 05:23:54 - INFO - __main__ - Step 58218: {'lr': 0.0003421446111789721, 'samples': 11177856, 'steps': 58217, 'loss/train': 2.0489532947540283} 11/07/2021 05:23:55 - INFO - __main__ - Step 58219: {'lr': 0.00034213967802644986, 'samples': 11178048, 'steps': 58218, 'loss/train': 1.278505802154541} 11/07/2021 05:23:56 - INFO - __main__ - Step 58220: {'lr': 0.000342134744832411, 'samples': 11178240, 'steps': 58219, 'loss/train': 1.5730618238449097} 11/07/2021 05:23:56 - INFO - __main__ - Step 58221: {'lr': 0.0003421298115968576, 'samples': 11178432, 'steps': 58220, 'loss/train': 2.0743625164031982} 11/07/2021 05:23:57 - INFO - __main__ - Step 58222: {'lr': 0.0003421248783197919, 'samples': 11178624, 'steps': 58221, 'loss/train': 1.1445319652557373} 11/07/2021 05:23:57 - INFO - __main__ - Step 58223: {'lr': 0.0003421199450012162, 'samples': 11178816, 'steps': 58222, 'loss/train': 1.6939197778701782} 11/07/2021 05:23:57 - INFO - __main__ - Step 58224: {'lr': 0.00034211501164113276, 'samples': 11179008, 'steps': 58223, 'loss/train': 1.6914745569229126} 11/07/2021 05:23:58 - INFO - __main__ - Step 58225: {'lr': 0.0003421100782395436, 'samples': 11179200, 'steps': 58224, 'loss/train': 1.6748243570327759} 11/07/2021 05:23:59 - INFO - __main__ - Step 58226: {'lr': 0.000342105144796451, 'samples': 11179392, 'steps': 58225, 'loss/train': 1.6589128971099854} 11/07/2021 05:23:59 - INFO - __main__ - Step 58227: {'lr': 0.0003421002113118574, 'samples': 11179584, 'steps': 58226, 'loss/train': 1.6836780309677124} 11/07/2021 05:23:59 - INFO - __main__ - Step 58228: {'lr': 0.00034209527778576477, 'samples': 11179776, 'steps': 58227, 'loss/train': 1.0992323160171509} 11/07/2021 05:24:00 - INFO - __main__ - Step 58229: {'lr': 0.0003420903442181755, 'samples': 11179968, 'steps': 58228, 'loss/train': 1.1824052333831787} 11/07/2021 05:24:00 - INFO - __main__ - Step 58230: {'lr': 0.0003420854106090917, 'samples': 11180160, 'steps': 58229, 'loss/train': 1.4953030347824097} 11/07/2021 05:24:01 - INFO - __main__ - Step 58231: {'lr': 0.00034208047695851563, 'samples': 11180352, 'steps': 58230, 'loss/train': 1.0680210590362549} 11/07/2021 05:24:01 - INFO - __main__ - Step 58232: {'lr': 0.0003420755432664495, 'samples': 11180544, 'steps': 58231, 'loss/train': 1.3415437936782837} 11/07/2021 05:24:02 - INFO - __main__ - Step 58233: {'lr': 0.0003420706095328956, 'samples': 11180736, 'steps': 58232, 'loss/train': 1.3045610189437866} 11/07/2021 05:24:02 - INFO - __main__ - Step 58234: {'lr': 0.0003420656757578561, 'samples': 11180928, 'steps': 58233, 'loss/train': 1.4776599407196045} 11/07/2021 05:24:02 - INFO - __main__ - Step 58235: {'lr': 0.00034206074194133323, 'samples': 11181120, 'steps': 58234, 'loss/train': 1.7522863149642944} 11/07/2021 05:24:04 - INFO - __main__ - Step 58236: {'lr': 0.00034205580808332916, 'samples': 11181312, 'steps': 58235, 'loss/train': 1.7411361932754517} 11/07/2021 05:24:04 - INFO - __main__ - Step 58237: {'lr': 0.0003420508741838462, 'samples': 11181504, 'steps': 58236, 'loss/train': 1.7615766525268555} 11/07/2021 05:24:04 - INFO - __main__ - Step 58238: {'lr': 0.0003420459402428865, 'samples': 11181696, 'steps': 58237, 'loss/train': 1.363251805305481} 11/07/2021 05:24:05 - INFO - __main__ - Step 58239: {'lr': 0.00034204100626045235, 'samples': 11181888, 'steps': 58238, 'loss/train': 1.4591209888458252} 11/07/2021 05:24:05 - INFO - __main__ - Step 58240: {'lr': 0.00034203607223654594, 'samples': 11182080, 'steps': 58239, 'loss/train': 1.4060337543487549} 11/07/2021 05:24:06 - INFO - __main__ - Step 58241: {'lr': 0.00034203113817116957, 'samples': 11182272, 'steps': 58240, 'loss/train': 0.6312339901924133} 11/07/2021 05:24:06 - INFO - __main__ - Step 58242: {'lr': 0.0003420262040643253, 'samples': 11182464, 'steps': 58241, 'loss/train': 1.2740561962127686} 11/07/2021 05:24:07 - INFO - __main__ - Step 58243: {'lr': 0.0003420212699160154, 'samples': 11182656, 'steps': 58242, 'loss/train': 1.509224534034729} 11/07/2021 05:24:07 - INFO - __main__ - Step 58244: {'lr': 0.00034201633572624216, 'samples': 11182848, 'steps': 58243, 'loss/train': 1.5922558307647705} 11/07/2021 05:24:07 - INFO - __main__ - Step 58245: {'lr': 0.00034201140149500784, 'samples': 11183040, 'steps': 58244, 'loss/train': 1.5326972007751465} 11/07/2021 05:24:08 - INFO - __main__ - Step 58246: {'lr': 0.0003420064672223146, 'samples': 11183232, 'steps': 58245, 'loss/train': 1.5039807558059692} 11/07/2021 05:24:09 - INFO - __main__ - Step 58247: {'lr': 0.0003420015329081647, 'samples': 11183424, 'steps': 58246, 'loss/train': 1.651671051979065} 11/07/2021 05:24:09 - INFO - __main__ - Step 58248: {'lr': 0.00034199659855256023, 'samples': 11183616, 'steps': 58247, 'loss/train': 1.4716250896453857} 11/07/2021 05:24:09 - INFO - __main__ - Step 58249: {'lr': 0.00034199166415550353, 'samples': 11183808, 'steps': 58248, 'loss/train': 1.0323662757873535} 11/07/2021 05:24:10 - INFO - __main__ - Step 58250: {'lr': 0.0003419867297169968, 'samples': 11184000, 'steps': 58249, 'loss/train': 1.5270559787750244} 11/07/2021 05:24:10 - INFO - __main__ - Step 58251: {'lr': 0.00034198179523704233, 'samples': 11184192, 'steps': 58250, 'loss/train': 1.4787980318069458} 11/07/2021 05:24:11 - INFO - __main__ - Step 58252: {'lr': 0.0003419768607156423, 'samples': 11184384, 'steps': 58251, 'loss/train': 1.374104380607605} 11/07/2021 05:24:12 - INFO - __main__ - Step 58253: {'lr': 0.0003419719261527988, 'samples': 11184576, 'steps': 58252, 'loss/train': 1.5750901699066162} 11/07/2021 05:24:12 - INFO - __main__ - Step 58254: {'lr': 0.0003419669915485142, 'samples': 11184768, 'steps': 58253, 'loss/train': 1.468117117881775} 11/07/2021 05:24:12 - INFO - __main__ - Step 58255: {'lr': 0.00034196205690279076, 'samples': 11184960, 'steps': 58254, 'loss/train': 1.0962551832199097} 11/07/2021 05:24:13 - INFO - __main__ - Step 58256: {'lr': 0.00034195712221563057, 'samples': 11185152, 'steps': 58255, 'loss/train': 1.4864016771316528} 11/07/2021 05:24:14 - INFO - __main__ - Step 58257: {'lr': 0.00034195218748703596, 'samples': 11185344, 'steps': 58256, 'loss/train': 1.6173681020736694} 11/07/2021 05:24:14 - INFO - __main__ - Step 58258: {'lr': 0.00034194725271700915, 'samples': 11185536, 'steps': 58257, 'loss/train': 1.283270001411438} 11/07/2021 05:24:14 - INFO - __main__ - Step 58259: {'lr': 0.0003419423179055523, 'samples': 11185728, 'steps': 58258, 'loss/train': 0.7892928719520569} 11/07/2021 05:24:15 - INFO - __main__ - Step 58260: {'lr': 0.0003419373830526676, 'samples': 11185920, 'steps': 58259, 'loss/train': 1.3765954971313477} 11/07/2021 05:24:15 - INFO - __main__ - Step 58261: {'lr': 0.0003419324481583574, 'samples': 11186112, 'steps': 58260, 'loss/train': 1.7961735725402832} 11/07/2021 05:24:16 - INFO - __main__ - Step 58262: {'lr': 0.00034192751322262375, 'samples': 11186304, 'steps': 58261, 'loss/train': 1.456167221069336} 11/07/2021 05:24:16 - INFO - __main__ - Step 58263: {'lr': 0.0003419225782454691, 'samples': 11186496, 'steps': 58262, 'loss/train': 1.5626648664474487} 11/07/2021 05:24:17 - INFO - __main__ - Step 58264: {'lr': 0.00034191764322689553, 'samples': 11186688, 'steps': 58263, 'loss/train': 1.3266459703445435} 11/07/2021 05:24:17 - INFO - __main__ - Step 58265: {'lr': 0.00034191270816690526, 'samples': 11186880, 'steps': 58264, 'loss/train': 1.2717241048812866} 11/07/2021 05:24:17 - INFO - __main__ - Step 58266: {'lr': 0.0003419077730655006, 'samples': 11187072, 'steps': 58265, 'loss/train': 1.8614730834960938} 11/07/2021 05:24:18 - INFO - __main__ - Step 58267: {'lr': 0.00034190283792268365, 'samples': 11187264, 'steps': 58266, 'loss/train': 1.3185659646987915} 11/07/2021 05:24:19 - INFO - __main__ - Step 58268: {'lr': 0.0003418979027384567, 'samples': 11187456, 'steps': 58267, 'loss/train': 1.5168559551239014} 11/07/2021 05:24:19 - INFO - __main__ - Step 58269: {'lr': 0.00034189296751282203, 'samples': 11187648, 'steps': 58268, 'loss/train': 1.7645275592803955} 11/07/2021 05:24:20 - INFO - __main__ - Step 58270: {'lr': 0.0003418880322457817, 'samples': 11187840, 'steps': 58269, 'loss/train': 1.0933140516281128} 11/07/2021 05:24:20 - INFO - __main__ - Step 58271: {'lr': 0.0003418830969373382, 'samples': 11188032, 'steps': 58270, 'loss/train': 1.7920719385147095} 11/07/2021 05:24:20 - INFO - __main__ - Step 58272: {'lr': 0.00034187816158749354, 'samples': 11188224, 'steps': 58271, 'loss/train': 0.6926540732383728} 11/07/2021 05:24:21 - INFO - __main__ - Step 58273: {'lr': 0.00034187322619624996, 'samples': 11188416, 'steps': 58272, 'loss/train': 1.59482741355896} 11/07/2021 05:24:22 - INFO - __main__ - Step 58274: {'lr': 0.0003418682907636097, 'samples': 11188608, 'steps': 58273, 'loss/train': 1.5063611268997192} 11/07/2021 05:24:22 - INFO - __main__ - Step 58275: {'lr': 0.000341863355289575, 'samples': 11188800, 'steps': 58274, 'loss/train': 1.543867826461792} 11/07/2021 05:24:22 - INFO - __main__ - Step 58276: {'lr': 0.0003418584197741481, 'samples': 11188992, 'steps': 58275, 'loss/train': 1.5246760845184326} 11/07/2021 05:24:23 - INFO - __main__ - Step 58277: {'lr': 0.00034185348421733125, 'samples': 11189184, 'steps': 58276, 'loss/train': 1.4306000471115112} 11/07/2021 05:24:24 - INFO - __main__ - Step 58278: {'lr': 0.0003418485486191267, 'samples': 11189376, 'steps': 58277, 'loss/train': 1.5192270278930664} 11/07/2021 05:24:24 - INFO - __main__ - Step 58279: {'lr': 0.0003418436129795365, 'samples': 11189568, 'steps': 58278, 'loss/train': 1.6188006401062012} 11/07/2021 05:24:24 - INFO - __main__ - Step 58280: {'lr': 0.000341838677298563, 'samples': 11189760, 'steps': 58279, 'loss/train': 1.1027847528457642} 11/07/2021 05:24:25 - INFO - __main__ - Step 58281: {'lr': 0.00034183374157620847, 'samples': 11189952, 'steps': 58280, 'loss/train': 1.3313562870025635} 11/07/2021 05:24:25 - INFO - __main__ - Step 58282: {'lr': 0.000341828805812475, 'samples': 11190144, 'steps': 58281, 'loss/train': 1.4884618520736694} 11/07/2021 05:24:26 - INFO - __main__ - Step 58283: {'lr': 0.0003418238700073649, 'samples': 11190336, 'steps': 58282, 'loss/train': 1.4345486164093018} 11/07/2021 05:24:26 - INFO - __main__ - Step 58284: {'lr': 0.0003418189341608804, 'samples': 11190528, 'steps': 58283, 'loss/train': 1.6635472774505615} 11/07/2021 05:24:27 - INFO - __main__ - Step 58285: {'lr': 0.0003418139982730237, 'samples': 11190720, 'steps': 58284, 'loss/train': 1.0728845596313477} 11/07/2021 05:24:27 - INFO - __main__ - Step 58286: {'lr': 0.0003418090623437971, 'samples': 11190912, 'steps': 58285, 'loss/train': 1.3892258405685425} 11/07/2021 05:24:27 - INFO - __main__ - Step 58287: {'lr': 0.00034180412637320267, 'samples': 11191104, 'steps': 58286, 'loss/train': 0.9549578428268433} 11/07/2021 05:24:28 - INFO - __main__ - Step 58288: {'lr': 0.0003417991903612427, 'samples': 11191296, 'steps': 58287, 'loss/train': 1.3454868793487549} 11/07/2021 05:24:29 - INFO - __main__ - Step 58289: {'lr': 0.0003417942543079195, 'samples': 11191488, 'steps': 58288, 'loss/train': 1.5178730487823486} 11/07/2021 05:24:29 - INFO - __main__ - Step 58290: {'lr': 0.00034178931821323517, 'samples': 11191680, 'steps': 58289, 'loss/train': 1.457436203956604} 11/07/2021 05:24:30 - INFO - __main__ - Step 58291: {'lr': 0.0003417843820771921, 'samples': 11191872, 'steps': 58290, 'loss/train': 1.162864089012146} 11/07/2021 05:24:30 - INFO - __main__ - Step 58292: {'lr': 0.00034177944589979225, 'samples': 11192064, 'steps': 58291, 'loss/train': 1.137751817703247} 11/07/2021 05:24:30 - INFO - __main__ - Step 58293: {'lr': 0.0003417745096810381, 'samples': 11192256, 'steps': 58292, 'loss/train': 1.3616067171096802} 11/07/2021 05:24:31 - INFO - __main__ - Step 58294: {'lr': 0.00034176957342093174, 'samples': 11192448, 'steps': 58293, 'loss/train': 1.521599292755127} 11/07/2021 05:24:32 - INFO - __main__ - Step 58295: {'lr': 0.0003417646371194754, 'samples': 11192640, 'steps': 58294, 'loss/train': 1.423263430595398} 11/07/2021 05:24:32 - INFO - __main__ - Step 58296: {'lr': 0.00034175970077667136, 'samples': 11192832, 'steps': 58295, 'loss/train': 1.610305905342102} 11/07/2021 05:24:32 - INFO - __main__ - Step 58297: {'lr': 0.00034175476439252177, 'samples': 11193024, 'steps': 58296, 'loss/train': 1.8208974599838257} 11/07/2021 05:24:33 - INFO - __main__ - Step 58298: {'lr': 0.00034174982796702895, 'samples': 11193216, 'steps': 58297, 'loss/train': 1.125916600227356} 11/07/2021 05:24:34 - INFO - __main__ - Step 58299: {'lr': 0.00034174489150019506, 'samples': 11193408, 'steps': 58298, 'loss/train': 1.8532830476760864} 11/07/2021 05:24:34 - INFO - __main__ - Step 58300: {'lr': 0.0003417399549920224, 'samples': 11193600, 'steps': 58299, 'loss/train': 1.7013945579528809} 11/07/2021 05:24:35 - INFO - __main__ - Step 58301: {'lr': 0.00034173501844251305, 'samples': 11193792, 'steps': 58300, 'loss/train': 1.583761215209961} 11/07/2021 05:24:35 - INFO - __main__ - Step 58302: {'lr': 0.0003417300818516693, 'samples': 11193984, 'steps': 58301, 'loss/train': 1.596790075302124} 11/07/2021 05:24:35 - INFO - __main__ - Step 58303: {'lr': 0.00034172514521949336, 'samples': 11194176, 'steps': 58302, 'loss/train': 1.9282759428024292} 11/07/2021 05:24:36 - INFO - __main__ - Step 58304: {'lr': 0.0003417202085459876, 'samples': 11194368, 'steps': 58303, 'loss/train': 2.0007357597351074} 11/07/2021 05:24:36 - INFO - __main__ - Step 58305: {'lr': 0.00034171527183115413, 'samples': 11194560, 'steps': 58304, 'loss/train': 1.590613842010498} 11/07/2021 05:24:37 - INFO - __main__ - Step 58306: {'lr': 0.0003417103350749951, 'samples': 11194752, 'steps': 58305, 'loss/train': 0.5088095664978027} 11/07/2021 05:24:37 - INFO - __main__ - Step 58307: {'lr': 0.00034170539827751284, 'samples': 11194944, 'steps': 58306, 'loss/train': 1.890604019165039} 11/07/2021 05:24:38 - INFO - __main__ - Step 58308: {'lr': 0.0003417004614387095, 'samples': 11195136, 'steps': 58307, 'loss/train': 1.2942551374435425} 11/07/2021 05:24:38 - INFO - __main__ - Step 58309: {'lr': 0.0003416955245585874, 'samples': 11195328, 'steps': 58308, 'loss/train': 0.6271637678146362} 11/07/2021 05:24:39 - INFO - __main__ - Step 58310: {'lr': 0.00034169058763714865, 'samples': 11195520, 'steps': 58309, 'loss/train': 2.029214859008789} 11/07/2021 05:24:39 - INFO - __main__ - Step 58311: {'lr': 0.0003416856506743956, 'samples': 11195712, 'steps': 58310, 'loss/train': 1.5270578861236572} 11/07/2021 05:24:40 - INFO - __main__ - Step 58312: {'lr': 0.00034168071367033043, 'samples': 11195904, 'steps': 58311, 'loss/train': 1.3987207412719727} 11/07/2021 05:24:40 - INFO - __main__ - Step 58313: {'lr': 0.0003416757766249553, 'samples': 11196096, 'steps': 58312, 'loss/train': 1.6209709644317627} 11/07/2021 05:24:40 - INFO - __main__ - Step 58314: {'lr': 0.0003416708395382725, 'samples': 11196288, 'steps': 58313, 'loss/train': 1.304992437362671} 11/07/2021 05:24:41 - INFO - __main__ - Step 58315: {'lr': 0.00034166590241028425, 'samples': 11196480, 'steps': 58314, 'loss/train': 1.3999801874160767} 11/07/2021 05:24:42 - INFO - __main__ - Step 58316: {'lr': 0.00034166096524099264, 'samples': 11196672, 'steps': 58315, 'loss/train': 1.3416448831558228} 11/07/2021 05:24:42 - INFO - __main__ - Step 58317: {'lr': 0.00034165602803040013, 'samples': 11196864, 'steps': 58316, 'loss/train': 0.8841568827629089} 11/07/2021 05:24:42 - INFO - __main__ - Step 58318: {'lr': 0.00034165109077850884, 'samples': 11197056, 'steps': 58317, 'loss/train': 1.7505240440368652} 11/07/2021 05:24:43 - INFO - __main__ - Step 58319: {'lr': 0.00034164615348532094, 'samples': 11197248, 'steps': 58318, 'loss/train': 1.6183518171310425} 11/07/2021 05:24:44 - INFO - __main__ - Step 58320: {'lr': 0.0003416412161508387, 'samples': 11197440, 'steps': 58319, 'loss/train': 1.959389328956604} 11/07/2021 05:24:44 - INFO - __main__ - Step 58321: {'lr': 0.0003416362787750643, 'samples': 11197632, 'steps': 58320, 'loss/train': 1.081329107284546} 11/07/2021 05:24:44 - INFO - __main__ - Step 58322: {'lr': 0.00034163134135800004, 'samples': 11197824, 'steps': 58321, 'loss/train': 1.529457688331604} 11/07/2021 05:24:45 - INFO - __main__ - Step 58323: {'lr': 0.00034162640389964814, 'samples': 11198016, 'steps': 58322, 'loss/train': 1.1754611730575562} 11/07/2021 05:24:45 - INFO - __main__ - Step 58324: {'lr': 0.0003416214664000108, 'samples': 11198208, 'steps': 58323, 'loss/train': 1.4142574071884155} 11/07/2021 05:24:46 - INFO - __main__ - Step 58325: {'lr': 0.00034161652885909025, 'samples': 11198400, 'steps': 58324, 'loss/train': 1.1935970783233643} 11/07/2021 05:24:47 - INFO - __main__ - Step 58326: {'lr': 0.0003416115912768887, 'samples': 11198592, 'steps': 58325, 'loss/train': 1.3951812982559204} 11/07/2021 05:24:47 - INFO - __main__ - Step 58327: {'lr': 0.0003416066536534083, 'samples': 11198784, 'steps': 58326, 'loss/train': 0.9472230672836304} 11/07/2021 05:24:47 - INFO - __main__ - Step 58328: {'lr': 0.0003416017159886514, 'samples': 11198976, 'steps': 58327, 'loss/train': 0.9106329083442688} 11/07/2021 05:24:48 - INFO - __main__ - Step 58329: {'lr': 0.0003415967782826202, 'samples': 11199168, 'steps': 58328, 'loss/train': 0.9734601974487305} 11/07/2021 05:24:49 - INFO - __main__ - Step 58330: {'lr': 0.0003415918405353169, 'samples': 11199360, 'steps': 58329, 'loss/train': 0.8111278414726257} 11/07/2021 05:24:49 - INFO - __main__ - Step 58331: {'lr': 0.0003415869027467437, 'samples': 11199552, 'steps': 58330, 'loss/train': 1.3817026615142822} 11/07/2021 05:24:50 - INFO - __main__ - Step 58332: {'lr': 0.000341581964916903, 'samples': 11199744, 'steps': 58331, 'loss/train': 2.877706527709961} 11/07/2021 05:24:50 - INFO - __main__ - Step 58333: {'lr': 0.00034157702704579667, 'samples': 11199936, 'steps': 58332, 'loss/train': 1.1789413690567017} 11/07/2021 05:24:50 - INFO - __main__ - Step 58334: {'lr': 0.00034157208913342726, 'samples': 11200128, 'steps': 58333, 'loss/train': 1.6798967123031616} 11/07/2021 05:24:51 - INFO - __main__ - Step 58335: {'lr': 0.00034156715117979685, 'samples': 11200320, 'steps': 58334, 'loss/train': 1.5536830425262451} 11/07/2021 05:24:52 - INFO - __main__ - Step 58336: {'lr': 0.00034156221318490767, 'samples': 11200512, 'steps': 58335, 'loss/train': 1.310221791267395} 11/07/2021 05:24:52 - INFO - __main__ - Step 58337: {'lr': 0.000341557275148762, 'samples': 11200704, 'steps': 58336, 'loss/train': 1.6157326698303223} 11/07/2021 05:24:52 - INFO - __main__ - Step 58338: {'lr': 0.0003415523370713621, 'samples': 11200896, 'steps': 58337, 'loss/train': 1.3565797805786133} 11/07/2021 05:24:53 - INFO - __main__ - Step 58339: {'lr': 0.00034154739895271005, 'samples': 11201088, 'steps': 58338, 'loss/train': 1.564904808998108} 11/07/2021 05:24:54 - INFO - __main__ - Step 58340: {'lr': 0.00034154246079280817, 'samples': 11201280, 'steps': 58339, 'loss/train': 1.5634486675262451} 11/07/2021 05:24:54 - INFO - __main__ - Step 58341: {'lr': 0.0003415375225916586, 'samples': 11201472, 'steps': 58340, 'loss/train': 1.7276787757873535} 11/07/2021 05:24:55 - INFO - __main__ - Step 58342: {'lr': 0.0003415325843492637, 'samples': 11201664, 'steps': 58341, 'loss/train': 1.4368934631347656} 11/07/2021 05:24:55 - INFO - __main__ - Step 58343: {'lr': 0.00034152764606562564, 'samples': 11201856, 'steps': 58342, 'loss/train': 1.7426483631134033} 11/07/2021 05:24:55 - INFO - __main__ - Step 58344: {'lr': 0.0003415227077407466, 'samples': 11202048, 'steps': 58343, 'loss/train': 2.328885555267334} 11/07/2021 05:24:56 - INFO - __main__ - Step 58345: {'lr': 0.00034151776937462895, 'samples': 11202240, 'steps': 58344, 'loss/train': 1.487534761428833} 11/07/2021 05:24:57 - INFO - __main__ - Step 58346: {'lr': 0.0003415128309672747, 'samples': 11202432, 'steps': 58345, 'loss/train': 1.504127860069275} 11/07/2021 05:24:57 - INFO - __main__ - Step 58347: {'lr': 0.0003415078925186862, 'samples': 11202624, 'steps': 58346, 'loss/train': 1.621259331703186} 11/07/2021 05:24:57 - INFO - __main__ - Step 58348: {'lr': 0.00034150295402886566, 'samples': 11202816, 'steps': 58347, 'loss/train': 1.4061305522918701} 11/07/2021 05:24:58 - INFO - __main__ - Step 58349: {'lr': 0.0003414980154978153, 'samples': 11203008, 'steps': 58348, 'loss/train': 1.718531847000122} 11/07/2021 05:24:58 - INFO - __main__ - Step 58350: {'lr': 0.00034149307692553734, 'samples': 11203200, 'steps': 58349, 'loss/train': 1.1638323068618774} 11/07/2021 05:24:59 - INFO - __main__ - Step 58351: {'lr': 0.000341488138312034, 'samples': 11203392, 'steps': 58350, 'loss/train': 1.4727329015731812} 11/07/2021 05:24:59 - INFO - __main__ - Step 58352: {'lr': 0.00034148319965730757, 'samples': 11203584, 'steps': 58351, 'loss/train': 1.5973743200302124} 11/07/2021 05:25:00 - INFO - __main__ - Step 58353: {'lr': 0.0003414782609613602, 'samples': 11203776, 'steps': 58352, 'loss/train': 1.1058869361877441} 11/07/2021 05:25:00 - INFO - __main__ - Step 58354: {'lr': 0.0003414733222241941, 'samples': 11203968, 'steps': 58353, 'loss/train': 1.8483293056488037} 11/07/2021 05:25:00 - INFO - __main__ - Step 58355: {'lr': 0.00034146838344581155, 'samples': 11204160, 'steps': 58354, 'loss/train': 1.3488630056381226} 11/07/2021 05:25:01 - INFO - __main__ - Step 58356: {'lr': 0.00034146344462621477, 'samples': 11204352, 'steps': 58355, 'loss/train': 1.172558307647705} 11/07/2021 05:25:02 - INFO - __main__ - Step 58357: {'lr': 0.00034145850576540595, 'samples': 11204544, 'steps': 58356, 'loss/train': 1.4694602489471436} 11/07/2021 05:25:02 - INFO - __main__ - Step 58358: {'lr': 0.00034145356686338736, 'samples': 11204736, 'steps': 58357, 'loss/train': 1.4156039953231812} 11/07/2021 05:25:02 - INFO - __main__ - Step 58359: {'lr': 0.00034144862792016123, 'samples': 11204928, 'steps': 58358, 'loss/train': 1.4292247295379639} 11/07/2021 05:25:03 - INFO - __main__ - Step 58360: {'lr': 0.00034144368893572973, 'samples': 11205120, 'steps': 58359, 'loss/train': 0.8911574482917786} 11/07/2021 05:25:04 - INFO - __main__ - Step 58361: {'lr': 0.00034143874991009513, 'samples': 11205312, 'steps': 58360, 'loss/train': 1.5435965061187744} 11/07/2021 05:25:04 - INFO - __main__ - Step 58362: {'lr': 0.0003414338108432596, 'samples': 11205504, 'steps': 58361, 'loss/train': 1.7827941179275513} 11/07/2021 05:25:05 - INFO - __main__ - Step 58363: {'lr': 0.0003414288717352254, 'samples': 11205696, 'steps': 58362, 'loss/train': 1.3278061151504517} 11/07/2021 05:25:05 - INFO - __main__ - Step 58364: {'lr': 0.00034142393258599485, 'samples': 11205888, 'steps': 58363, 'loss/train': 2.042924642562866} 11/07/2021 05:25:05 - INFO - __main__ - Step 58365: {'lr': 0.00034141899339557003, 'samples': 11206080, 'steps': 58364, 'loss/train': 0.7355446219444275} 11/07/2021 05:25:06 - INFO - __main__ - Step 58366: {'lr': 0.0003414140541639532, 'samples': 11206272, 'steps': 58365, 'loss/train': 1.257541537284851} 11/07/2021 05:25:07 - INFO - __main__ - Step 58367: {'lr': 0.0003414091148911466, 'samples': 11206464, 'steps': 58366, 'loss/train': 1.490370750427246} 11/07/2021 05:25:07 - INFO - __main__ - Step 58368: {'lr': 0.00034140417557715255, 'samples': 11206656, 'steps': 58367, 'loss/train': 1.6572939157485962} 11/07/2021 05:25:07 - INFO - __main__ - Step 58369: {'lr': 0.0003413992362219731, 'samples': 11206848, 'steps': 58368, 'loss/train': 1.513646125793457} 11/07/2021 05:25:08 - INFO - __main__ - Step 58370: {'lr': 0.0003413942968256106, 'samples': 11207040, 'steps': 58369, 'loss/train': 1.2397743463516235} 11/07/2021 05:25:09 - INFO - __main__ - Step 58371: {'lr': 0.00034138935738806727, 'samples': 11207232, 'steps': 58370, 'loss/train': 1.7745050191879272} 11/07/2021 05:25:09 - INFO - __main__ - Step 58372: {'lr': 0.0003413844179093453, 'samples': 11207424, 'steps': 58371, 'loss/train': 1.7837297916412354} 11/07/2021 05:25:09 - INFO - __main__ - Step 58373: {'lr': 0.0003413794783894468, 'samples': 11207616, 'steps': 58372, 'loss/train': 1.6627751588821411} 11/07/2021 05:25:10 - INFO - __main__ - Step 58374: {'lr': 0.0003413745388283742, 'samples': 11207808, 'steps': 58373, 'loss/train': 1.1967734098434448} 11/07/2021 05:25:10 - INFO - __main__ - Step 58375: {'lr': 0.00034136959922612977, 'samples': 11208000, 'steps': 58374, 'loss/train': 1.3841203451156616} 11/07/2021 05:25:10 - INFO - __main__ - Step 58376: {'lr': 0.00034136465958271546, 'samples': 11208192, 'steps': 58375, 'loss/train': 1.028459072113037} 11/07/2021 05:25:11 - INFO - __main__ - Step 58377: {'lr': 0.00034135971989813363, 'samples': 11208384, 'steps': 58376, 'loss/train': 1.339853286743164} 11/07/2021 05:25:12 - INFO - __main__ - Step 58378: {'lr': 0.0003413547801723866, 'samples': 11208576, 'steps': 58377, 'loss/train': 1.0697388648986816} 11/07/2021 05:25:12 - INFO - __main__ - Step 58379: {'lr': 0.00034134984040547645, 'samples': 11208768, 'steps': 58378, 'loss/train': 1.5722737312316895} 11/07/2021 05:25:13 - INFO - __main__ - Step 58380: {'lr': 0.0003413449005974055, 'samples': 11208960, 'steps': 58379, 'loss/train': 1.5390830039978027} 11/07/2021 05:25:13 - INFO - __main__ - Step 58381: {'lr': 0.00034133996074817597, 'samples': 11209152, 'steps': 58380, 'loss/train': 1.4299684762954712} 11/07/2021 05:25:14 - INFO - __main__ - Step 58382: {'lr': 0.00034133502085779006, 'samples': 11209344, 'steps': 58381, 'loss/train': 1.7696834802627563} 11/07/2021 05:25:14 - INFO - __main__ - Step 58383: {'lr': 0.00034133008092624995, 'samples': 11209536, 'steps': 58382, 'loss/train': 1.5855791568756104} 11/07/2021 05:25:15 - INFO - __main__ - Step 58384: {'lr': 0.0003413251409535579, 'samples': 11209728, 'steps': 58383, 'loss/train': 1.531802773475647} 11/07/2021 05:25:15 - INFO - __main__ - Step 58385: {'lr': 0.0003413202009397163, 'samples': 11209920, 'steps': 58384, 'loss/train': 1.2706406116485596} 11/07/2021 05:25:16 - INFO - __main__ - Step 58386: {'lr': 0.0003413152608847271, 'samples': 11210112, 'steps': 58385, 'loss/train': 0.9080643653869629} 11/07/2021 05:25:17 - INFO - __main__ - Step 58387: {'lr': 0.0003413103207885927, 'samples': 11210304, 'steps': 58386, 'loss/train': 1.0522125959396362} 11/07/2021 05:25:17 - INFO - __main__ - Step 58388: {'lr': 0.00034130538065131524, 'samples': 11210496, 'steps': 58387, 'loss/train': 1.045793890953064} 11/07/2021 05:25:17 - INFO - __main__ - Step 58389: {'lr': 0.000341300440472897, 'samples': 11210688, 'steps': 58388, 'loss/train': 1.702425479888916} 11/07/2021 05:25:18 - INFO - __main__ - Step 58390: {'lr': 0.00034129550025334014, 'samples': 11210880, 'steps': 58389, 'loss/train': 1.2906677722930908} 11/07/2021 05:25:18 - INFO - __main__ - Step 58391: {'lr': 0.00034129055999264704, 'samples': 11211072, 'steps': 58390, 'loss/train': 1.4244874715805054} 11/07/2021 05:25:19 - INFO - __main__ - Step 58392: {'lr': 0.0003412856196908198, 'samples': 11211264, 'steps': 58391, 'loss/train': 1.1508504152297974} 11/07/2021 05:25:19 - INFO - __main__ - Step 58393: {'lr': 0.00034128067934786064, 'samples': 11211456, 'steps': 58392, 'loss/train': 1.6703565120697021} 11/07/2021 05:25:20 - INFO - __main__ - Step 58394: {'lr': 0.0003412757389637718, 'samples': 11211648, 'steps': 58393, 'loss/train': 1.8441166877746582} 11/07/2021 05:25:20 - INFO - __main__ - Step 58395: {'lr': 0.00034127079853855545, 'samples': 11211840, 'steps': 58394, 'loss/train': 1.4765968322753906} 11/07/2021 05:25:20 - INFO - __main__ - Step 58396: {'lr': 0.00034126585807221397, 'samples': 11212032, 'steps': 58395, 'loss/train': 2.049560546875} 11/07/2021 05:25:21 - INFO - __main__ - Step 58397: {'lr': 0.0003412609175647495, 'samples': 11212224, 'steps': 58396, 'loss/train': 1.4930609464645386} 11/07/2021 05:25:22 - INFO - __main__ - Step 58398: {'lr': 0.0003412559770161643, 'samples': 11212416, 'steps': 58397, 'loss/train': 1.710405945777893} 11/07/2021 05:25:22 - INFO - __main__ - Step 58399: {'lr': 0.0003412510364264606, 'samples': 11212608, 'steps': 58398, 'loss/train': 1.7443203926086426} 11/07/2021 05:25:23 - INFO - __main__ - Step 58400: {'lr': 0.0003412460957956405, 'samples': 11212800, 'steps': 58399, 'loss/train': 2.1306657791137695} 11/07/2021 05:25:23 - INFO - __main__ - Step 58401: {'lr': 0.00034124115512370636, 'samples': 11212992, 'steps': 58400, 'loss/train': 1.4786614179611206} 11/07/2021 05:25:23 - INFO - __main__ - Step 58402: {'lr': 0.0003412362144106603, 'samples': 11213184, 'steps': 58401, 'loss/train': 1.1887379884719849} 11/07/2021 05:25:24 - INFO - __main__ - Step 58403: {'lr': 0.00034123127365650463, 'samples': 11213376, 'steps': 58402, 'loss/train': 1.3642297983169556} 11/07/2021 05:25:25 - INFO - __main__ - Step 58404: {'lr': 0.0003412263328612416, 'samples': 11213568, 'steps': 58403, 'loss/train': 1.8511093854904175} 11/07/2021 05:25:25 - INFO - __main__ - Step 58405: {'lr': 0.00034122139202487334, 'samples': 11213760, 'steps': 58404, 'loss/train': 1.587825059890747} 11/07/2021 05:25:25 - INFO - __main__ - Step 58406: {'lr': 0.00034121645114740224, 'samples': 11213952, 'steps': 58405, 'loss/train': 1.443624496459961} 11/07/2021 05:25:26 - INFO - __main__ - Step 58407: {'lr': 0.00034121151022883033, 'samples': 11214144, 'steps': 58406, 'loss/train': 1.4731860160827637} 11/07/2021 05:25:26 - INFO - __main__ - Step 58408: {'lr': 0.00034120656926915995, 'samples': 11214336, 'steps': 58407, 'loss/train': 1.2097426652908325} 11/07/2021 05:25:27 - INFO - __main__ - Step 58409: {'lr': 0.0003412016282683932, 'samples': 11214528, 'steps': 58408, 'loss/train': 1.3646695613861084} 11/07/2021 05:25:28 - INFO - __main__ - Step 58410: {'lr': 0.0003411966872265325, 'samples': 11214720, 'steps': 58409, 'loss/train': 1.5305606126785278} 11/07/2021 05:25:28 - INFO - __main__ - Step 58411: {'lr': 0.00034119174614357994, 'samples': 11214912, 'steps': 58410, 'loss/train': 1.4746593236923218} 11/07/2021 05:25:28 - INFO - __main__ - Step 58412: {'lr': 0.00034118680501953784, 'samples': 11215104, 'steps': 58411, 'loss/train': 1.6140037775039673} 11/07/2021 05:25:29 - INFO - __main__ - Step 58413: {'lr': 0.00034118186385440833, 'samples': 11215296, 'steps': 58412, 'loss/train': 1.4457831382751465} 11/07/2021 05:25:30 - INFO - __main__ - Step 58414: {'lr': 0.00034117692264819374, 'samples': 11215488, 'steps': 58413, 'loss/train': 0.9754199385643005} 11/07/2021 05:25:30 - INFO - __main__ - Step 58415: {'lr': 0.0003411719814008961, 'samples': 11215680, 'steps': 58414, 'loss/train': 1.1841983795166016} 11/07/2021 05:25:30 - INFO - __main__ - Step 58416: {'lr': 0.0003411670401125179, 'samples': 11215872, 'steps': 58415, 'loss/train': 1.4019126892089844} 11/07/2021 05:25:31 - INFO - __main__ - Step 58417: {'lr': 0.00034116209878306116, 'samples': 11216064, 'steps': 58416, 'loss/train': 1.2930548191070557} 11/07/2021 05:25:31 - INFO - __main__ - Step 58418: {'lr': 0.00034115715741252824, 'samples': 11216256, 'steps': 58417, 'loss/train': 1.7811188697814941} 11/07/2021 05:25:32 - INFO - __main__ - Step 58419: {'lr': 0.0003411522160009213, 'samples': 11216448, 'steps': 58418, 'loss/train': 1.5482412576675415} 11/07/2021 05:25:32 - INFO - __main__ - Step 58420: {'lr': 0.00034114727454824257, 'samples': 11216640, 'steps': 58419, 'loss/train': 1.6688786745071411} 11/07/2021 05:25:33 - INFO - __main__ - Step 58421: {'lr': 0.00034114233305449426, 'samples': 11216832, 'steps': 58420, 'loss/train': 1.3504208326339722} 11/07/2021 05:25:33 - INFO - __main__ - Step 58422: {'lr': 0.00034113739151967864, 'samples': 11217024, 'steps': 58421, 'loss/train': 1.7886126041412354} 11/07/2021 05:25:33 - INFO - __main__ - Step 58423: {'lr': 0.00034113244994379794, 'samples': 11217216, 'steps': 58422, 'loss/train': 1.0460597276687622} 11/07/2021 05:25:35 - INFO - __main__ - Step 58424: {'lr': 0.00034112750832685434, 'samples': 11217408, 'steps': 58423, 'loss/train': 1.436403751373291} 11/07/2021 05:25:35 - INFO - __main__ - Step 58425: {'lr': 0.0003411225666688501, 'samples': 11217600, 'steps': 58424, 'loss/train': 2.3052642345428467} 11/07/2021 05:25:35 - INFO - __main__ - Step 58426: {'lr': 0.0003411176249697875, 'samples': 11217792, 'steps': 58425, 'loss/train': 1.1679171323776245} 11/07/2021 05:25:36 - INFO - __main__ - Step 58427: {'lr': 0.0003411126832296686, 'samples': 11217984, 'steps': 58426, 'loss/train': 1.7709113359451294} 11/07/2021 05:25:36 - INFO - __main__ - Step 58428: {'lr': 0.00034110774144849575, 'samples': 11218176, 'steps': 58427, 'loss/train': 1.904578447341919} 11/07/2021 05:25:37 - INFO - __main__ - Step 58429: {'lr': 0.00034110279962627115, 'samples': 11218368, 'steps': 58428, 'loss/train': 0.8743123412132263} 11/07/2021 05:25:38 - INFO - __main__ - Step 58430: {'lr': 0.0003410978577629971, 'samples': 11218560, 'steps': 58429, 'loss/train': 2.0276880264282227} 11/07/2021 05:25:38 - INFO - __main__ - Step 58431: {'lr': 0.0003410929158586757, 'samples': 11218752, 'steps': 58430, 'loss/train': 0.9819093942642212} 11/07/2021 05:25:38 - INFO - __main__ - Step 58432: {'lr': 0.0003410879739133093, 'samples': 11218944, 'steps': 58431, 'loss/train': 1.8218625783920288} 11/07/2021 05:25:39 - INFO - __main__ - Step 58433: {'lr': 0.00034108303192690003, 'samples': 11219136, 'steps': 58432, 'loss/train': 2.471243381500244} 11/07/2021 05:25:40 - INFO - __main__ - Step 58434: {'lr': 0.0003410780898994501, 'samples': 11219328, 'steps': 58433, 'loss/train': 1.8734984397888184} 11/07/2021 05:25:40 - INFO - __main__ - Step 58435: {'lr': 0.00034107314783096183, 'samples': 11219520, 'steps': 58434, 'loss/train': 1.4329745769500732} 11/07/2021 05:25:41 - INFO - __main__ - Step 58436: {'lr': 0.0003410682057214374, 'samples': 11219712, 'steps': 58435, 'loss/train': 0.6696116924285889} 11/07/2021 05:25:41 - INFO - __main__ - Step 58437: {'lr': 0.00034106326357087905, 'samples': 11219904, 'steps': 58436, 'loss/train': 1.388970971107483} 11/07/2021 05:25:41 - INFO - __main__ - Step 58438: {'lr': 0.000341058321379289, 'samples': 11220096, 'steps': 58437, 'loss/train': 1.5257045030593872} 11/07/2021 05:25:42 - INFO - __main__ - Step 58439: {'lr': 0.0003410533791466695, 'samples': 11220288, 'steps': 58438, 'loss/train': 1.4143130779266357} 11/07/2021 05:25:43 - INFO - __main__ - Step 58440: {'lr': 0.0003410484368730227, 'samples': 11220480, 'steps': 58439, 'loss/train': 1.3513290882110596} 11/07/2021 05:25:43 - INFO - __main__ - Step 58441: {'lr': 0.00034104349455835094, 'samples': 11220672, 'steps': 58440, 'loss/train': 0.796983003616333} 11/07/2021 05:25:43 - INFO - __main__ - Step 58442: {'lr': 0.0003410385522026563, 'samples': 11220864, 'steps': 58441, 'loss/train': 1.6240644454956055} 11/07/2021 05:25:44 - INFO - __main__ - Step 58443: {'lr': 0.0003410336098059412, 'samples': 11221056, 'steps': 58442, 'loss/train': 0.7649230360984802} 11/07/2021 05:25:44 - INFO - __main__ - Step 58444: {'lr': 0.0003410286673682077, 'samples': 11221248, 'steps': 58443, 'loss/train': 1.2710825204849243} 11/07/2021 05:25:45 - INFO - __main__ - Step 58445: {'lr': 0.0003410237248894581, 'samples': 11221440, 'steps': 58444, 'loss/train': 1.3829941749572754} 11/07/2021 05:25:45 - INFO - __main__ - Step 58446: {'lr': 0.00034101878236969464, 'samples': 11221632, 'steps': 58445, 'loss/train': 2.1312994956970215} 11/07/2021 05:25:46 - INFO - __main__ - Step 58447: {'lr': 0.0003410138398089195, 'samples': 11221824, 'steps': 58446, 'loss/train': 1.766077995300293} 11/07/2021 05:25:46 - INFO - __main__ - Step 58448: {'lr': 0.0003410088972071349, 'samples': 11222016, 'steps': 58447, 'loss/train': 1.6206458806991577} 11/07/2021 05:25:47 - INFO - __main__ - Step 58449: {'lr': 0.0003410039545643431, 'samples': 11222208, 'steps': 58448, 'loss/train': 1.5594794750213623} 11/07/2021 05:25:48 - INFO - __main__ - Step 58450: {'lr': 0.0003409990118805463, 'samples': 11222400, 'steps': 58449, 'loss/train': 1.4494882822036743} 11/07/2021 05:25:48 - INFO - __main__ - Step 58451: {'lr': 0.0003409940691557468, 'samples': 11222592, 'steps': 58450, 'loss/train': 1.6536080837249756} 11/07/2021 05:25:48 - INFO - __main__ - Step 58452: {'lr': 0.0003409891263899467, 'samples': 11222784, 'steps': 58451, 'loss/train': 1.4545527696609497} 11/07/2021 05:25:49 - INFO - __main__ - Step 58453: {'lr': 0.0003409841835831484, 'samples': 11222976, 'steps': 58452, 'loss/train': 1.3133459091186523} 11/07/2021 05:25:49 - INFO - __main__ - Step 58454: {'lr': 0.000340979240735354, 'samples': 11223168, 'steps': 58453, 'loss/train': 1.4204963445663452} 11/07/2021 05:25:49 - INFO - __main__ - Step 58455: {'lr': 0.00034097429784656574, 'samples': 11223360, 'steps': 58454, 'loss/train': 1.6007115840911865} 11/07/2021 05:25:50 - INFO - __main__ - Step 58456: {'lr': 0.00034096935491678595, 'samples': 11223552, 'steps': 58455, 'loss/train': 1.6012096405029297} 11/07/2021 05:25:51 - INFO - __main__ - Step 58457: {'lr': 0.0003409644119460166, 'samples': 11223744, 'steps': 58456, 'loss/train': 1.7056620121002197} 11/07/2021 05:25:51 - INFO - __main__ - Step 58458: {'lr': 0.00034095946893426024, 'samples': 11223936, 'steps': 58457, 'loss/train': 0.8120853900909424} 11/07/2021 05:25:51 - INFO - __main__ - Step 58459: {'lr': 0.0003409545258815189, 'samples': 11224128, 'steps': 58458, 'loss/train': 1.0179810523986816} 11/07/2021 05:25:52 - INFO - __main__ - Step 58460: {'lr': 0.00034094958278779486, 'samples': 11224320, 'steps': 58459, 'loss/train': 1.7241183519363403} 11/07/2021 05:25:53 - INFO - __main__ - Step 58461: {'lr': 0.00034094463965309035, 'samples': 11224512, 'steps': 58460, 'loss/train': 1.5440194606781006} 11/07/2021 05:25:53 - INFO - __main__ - Step 58462: {'lr': 0.00034093969647740755, 'samples': 11224704, 'steps': 58461, 'loss/train': 0.582253098487854} 11/07/2021 05:25:54 - INFO - __main__ - Step 58463: {'lr': 0.00034093475326074874, 'samples': 11224896, 'steps': 58462, 'loss/train': 1.6896227598190308} 11/07/2021 05:25:54 - INFO - __main__ - Step 58464: {'lr': 0.00034092981000311614, 'samples': 11225088, 'steps': 58463, 'loss/train': 1.6384936571121216} 11/07/2021 05:25:54 - INFO - __main__ - Step 58465: {'lr': 0.00034092486670451197, 'samples': 11225280, 'steps': 58464, 'loss/train': 0.5751922130584717} 11/07/2021 05:25:55 - INFO - __main__ - Step 58466: {'lr': 0.0003409199233649385, 'samples': 11225472, 'steps': 58465, 'loss/train': 0.8946777582168579} 11/07/2021 05:25:56 - INFO - __main__ - Step 58467: {'lr': 0.0003409149799843979, 'samples': 11225664, 'steps': 58466, 'loss/train': 1.3113489151000977} 11/07/2021 05:25:56 - INFO - __main__ - Step 58468: {'lr': 0.00034091003656289235, 'samples': 11225856, 'steps': 58467, 'loss/train': 1.8885964155197144} 11/07/2021 05:25:56 - INFO - __main__ - Step 58469: {'lr': 0.00034090509310042414, 'samples': 11226048, 'steps': 58468, 'loss/train': 1.6570892333984375} 11/07/2021 05:25:57 - INFO - __main__ - Step 58470: {'lr': 0.00034090014959699554, 'samples': 11226240, 'steps': 58469, 'loss/train': 1.4603313207626343} 11/07/2021 05:25:59 - INFO - __main__ - Step 58471: {'lr': 0.0003408952060526087, 'samples': 11226432, 'steps': 58470, 'loss/train': 1.2665852308273315} 11/07/2021 05:25:59 - INFO - __main__ - Step 58472: {'lr': 0.00034089026246726596, 'samples': 11226624, 'steps': 58471, 'loss/train': 1.663528561592102} 11/07/2021 05:26:00 - INFO - __main__ - Step 58473: {'lr': 0.00034088531884096944, 'samples': 11226816, 'steps': 58472, 'loss/train': 1.2827757596969604} 11/07/2021 05:26:00 - INFO - __main__ - Step 58474: {'lr': 0.0003408803751737214, 'samples': 11227008, 'steps': 58473, 'loss/train': 1.6015524864196777} 11/07/2021 05:26:00 - INFO - __main__ - Step 58475: {'lr': 0.00034087543146552404, 'samples': 11227200, 'steps': 58474, 'loss/train': 1.0194486379623413} 11/07/2021 05:26:01 - INFO - __main__ - Step 58476: {'lr': 0.0003408704877163796, 'samples': 11227392, 'steps': 58475, 'loss/train': 1.4756325483322144} 11/07/2021 05:26:01 - INFO - __main__ - Step 58477: {'lr': 0.00034086554392629033, 'samples': 11227584, 'steps': 58476, 'loss/train': 0.9150516986846924} 11/07/2021 05:26:01 - INFO - __main__ - Step 58478: {'lr': 0.00034086060009525844, 'samples': 11227776, 'steps': 58477, 'loss/train': 0.882000207901001} 11/07/2021 05:26:02 - INFO - __main__ - Step 58479: {'lr': 0.0003408556562232862, 'samples': 11227968, 'steps': 58478, 'loss/train': 1.1304144859313965} 11/07/2021 05:26:03 - INFO - __main__ - Step 58480: {'lr': 0.00034085071231037585, 'samples': 11228160, 'steps': 58479, 'loss/train': 1.2820016145706177} 11/07/2021 05:26:03 - INFO - __main__ - Step 58481: {'lr': 0.0003408457683565295, 'samples': 11228352, 'steps': 58480, 'loss/train': 1.3621982336044312} 11/07/2021 05:26:03 - INFO - __main__ - Step 58482: {'lr': 0.00034084082436174946, 'samples': 11228544, 'steps': 58481, 'loss/train': 1.0864423513412476} 11/07/2021 05:26:04 - INFO - __main__ - Step 58483: {'lr': 0.0003408358803260379, 'samples': 11228736, 'steps': 58482, 'loss/train': 1.4017298221588135} 11/07/2021 05:26:04 - INFO - __main__ - Step 58484: {'lr': 0.00034083093624939716, 'samples': 11228928, 'steps': 58483, 'loss/train': 1.4749741554260254} 11/07/2021 05:26:05 - INFO - __main__ - Step 58485: {'lr': 0.00034082599213182933, 'samples': 11229120, 'steps': 58484, 'loss/train': 1.3955676555633545} 11/07/2021 05:26:05 - INFO - __main__ - Step 58486: {'lr': 0.0003408210479733368, 'samples': 11229312, 'steps': 58485, 'loss/train': 1.6664291620254517} 11/07/2021 05:26:06 - INFO - __main__ - Step 58487: {'lr': 0.0003408161037739217, 'samples': 11229504, 'steps': 58486, 'loss/train': 1.0070327520370483} 11/07/2021 05:26:06 - INFO - __main__ - Step 58488: {'lr': 0.0003408111595335862, 'samples': 11229696, 'steps': 58487, 'loss/train': 1.5712045431137085} 11/07/2021 05:26:06 - INFO - __main__ - Step 58489: {'lr': 0.00034080621525233264, 'samples': 11229888, 'steps': 58488, 'loss/train': 1.3353732824325562} 11/07/2021 05:26:08 - INFO - __main__ - Step 58490: {'lr': 0.0003408012709301632, 'samples': 11230080, 'steps': 58489, 'loss/train': 1.323580026626587} 11/07/2021 05:26:08 - INFO - __main__ - Step 58491: {'lr': 0.00034079632656708005, 'samples': 11230272, 'steps': 58490, 'loss/train': 1.2277162075042725} 11/07/2021 05:26:08 - INFO - __main__ - Step 58492: {'lr': 0.00034079138216308553, 'samples': 11230464, 'steps': 58491, 'loss/train': 1.6625176668167114} 11/07/2021 05:26:09 - INFO - __main__ - Step 58493: {'lr': 0.00034078643771818184, 'samples': 11230656, 'steps': 58492, 'loss/train': 1.116775393486023} 11/07/2021 05:26:09 - INFO - __main__ - Step 58494: {'lr': 0.00034078149323237114, 'samples': 11230848, 'steps': 58493, 'loss/train': 1.5711719989776611} 11/07/2021 05:26:10 - INFO - __main__ - Step 58495: {'lr': 0.00034077654870565566, 'samples': 11231040, 'steps': 58494, 'loss/train': 1.6955015659332275} 11/07/2021 05:26:10 - INFO - __main__ - Step 58496: {'lr': 0.00034077160413803774, 'samples': 11231232, 'steps': 58495, 'loss/train': 1.8912805318832397} 11/07/2021 05:26:11 - INFO - __main__ - Step 58497: {'lr': 0.0003407666595295195, 'samples': 11231424, 'steps': 58496, 'loss/train': 2.1351826190948486} 11/07/2021 05:26:11 - INFO - __main__ - Step 58498: {'lr': 0.0003407617148801033, 'samples': 11231616, 'steps': 58497, 'loss/train': 1.5520429611206055} 11/07/2021 05:26:11 - INFO - __main__ - Step 58499: {'lr': 0.0003407567701897911, 'samples': 11231808, 'steps': 58498, 'loss/train': 1.2427788972854614} 11/07/2021 05:26:12 - INFO - __main__ - Step 58500: {'lr': 0.0003407518254585854, 'samples': 11232000, 'steps': 58499, 'loss/train': 1.039374828338623} 11/07/2021 05:26:13 - INFO - __main__ - Step 58501: {'lr': 0.0003407468806864883, 'samples': 11232192, 'steps': 58500, 'loss/train': 1.1369225978851318} 11/07/2021 05:26:13 - INFO - __main__ - Step 58502: {'lr': 0.0003407419358735021, 'samples': 11232384, 'steps': 58501, 'loss/train': 1.0871407985687256} 11/07/2021 05:26:13 - INFO - __main__ - Step 58503: {'lr': 0.0003407369910196289, 'samples': 11232576, 'steps': 58502, 'loss/train': 1.1347147226333618} 11/07/2021 05:26:14 - INFO - __main__ - Step 58504: {'lr': 0.0003407320461248711, 'samples': 11232768, 'steps': 58503, 'loss/train': 1.7463244199752808} 11/07/2021 05:26:15 - INFO - __main__ - Step 58505: {'lr': 0.00034072710118923086, 'samples': 11232960, 'steps': 58504, 'loss/train': 1.2943141460418701} 11/07/2021 05:26:15 - INFO - __main__ - Step 58506: {'lr': 0.0003407221562127103, 'samples': 11233152, 'steps': 58505, 'loss/train': 1.6389752626419067} 11/07/2021 05:26:15 - INFO - __main__ - Step 58507: {'lr': 0.0003407172111953117, 'samples': 11233344, 'steps': 58506, 'loss/train': 1.3285409212112427} 11/07/2021 05:26:16 - INFO - __main__ - Step 58508: {'lr': 0.00034071226613703744, 'samples': 11233536, 'steps': 58507, 'loss/train': 1.5552165508270264} 11/07/2021 05:26:16 - INFO - __main__ - Step 58509: {'lr': 0.0003407073210378897, 'samples': 11233728, 'steps': 58508, 'loss/train': 1.6005282402038574} 11/07/2021 05:26:17 - INFO - __main__ - Step 58510: {'lr': 0.00034070237589787047, 'samples': 11233920, 'steps': 58509, 'loss/train': 1.6873674392700195} 11/07/2021 05:26:18 - INFO - __main__ - Step 58511: {'lr': 0.00034069743071698215, 'samples': 11234112, 'steps': 58510, 'loss/train': 1.2324090003967285} 11/07/2021 05:26:18 - INFO - __main__ - Step 58512: {'lr': 0.000340692485495227, 'samples': 11234304, 'steps': 58511, 'loss/train': 1.2363803386688232} 11/07/2021 05:26:18 - INFO - __main__ - Step 58513: {'lr': 0.0003406875402326073, 'samples': 11234496, 'steps': 58512, 'loss/train': 1.2501964569091797} 11/07/2021 05:26:19 - INFO - __main__ - Step 58514: {'lr': 0.00034068259492912514, 'samples': 11234688, 'steps': 58513, 'loss/train': 1.5018353462219238} 11/07/2021 05:26:19 - INFO - __main__ - Step 58515: {'lr': 0.00034067764958478283, 'samples': 11234880, 'steps': 58514, 'loss/train': 1.2529922723770142} 11/07/2021 05:26:20 - INFO - __main__ - Step 58516: {'lr': 0.0003406727041995825, 'samples': 11235072, 'steps': 58515, 'loss/train': 1.340499997138977} 11/07/2021 05:26:20 - INFO - __main__ - Step 58517: {'lr': 0.00034066775877352644, 'samples': 11235264, 'steps': 58516, 'loss/train': 2.417407751083374} 11/07/2021 05:26:21 - INFO - __main__ - Step 58518: {'lr': 0.00034066281330661697, 'samples': 11235456, 'steps': 58517, 'loss/train': 1.1057500839233398} 11/07/2021 05:26:21 - INFO - __main__ - Step 58519: {'lr': 0.0003406578677988562, 'samples': 11235648, 'steps': 58518, 'loss/train': 1.3503586053848267} 11/07/2021 05:26:21 - INFO - __main__ - Step 58520: {'lr': 0.00034065292225024643, 'samples': 11235840, 'steps': 58519, 'loss/train': 1.4391413927078247} 11/07/2021 05:26:22 - INFO - __main__ - Step 58521: {'lr': 0.0003406479766607898, 'samples': 11236032, 'steps': 58520, 'loss/train': 1.568670392036438} 11/07/2021 05:26:23 - INFO - __main__ - Step 58522: {'lr': 0.00034064303103048863, 'samples': 11236224, 'steps': 58521, 'loss/train': 1.5107706785202026} 11/07/2021 05:26:23 - INFO - __main__ - Step 58523: {'lr': 0.000340638085359345, 'samples': 11236416, 'steps': 58522, 'loss/train': 1.541881799697876} 11/07/2021 05:26:23 - INFO - __main__ - Step 58524: {'lr': 0.00034063313964736135, 'samples': 11236608, 'steps': 58523, 'loss/train': 1.410552740097046} 11/07/2021 05:26:24 - INFO - __main__ - Step 58525: {'lr': 0.0003406281938945398, 'samples': 11236800, 'steps': 58524, 'loss/train': 1.5633823871612549} 11/07/2021 05:26:25 - INFO - __main__ - Step 58526: {'lr': 0.0003406232481008825, 'samples': 11236992, 'steps': 58525, 'loss/train': 1.2339831590652466} 11/07/2021 05:26:25 - INFO - __main__ - Step 58527: {'lr': 0.0003406183022663919, 'samples': 11237184, 'steps': 58526, 'loss/train': 1.4524608850479126} 11/07/2021 05:26:26 - INFO - __main__ - Step 58528: {'lr': 0.00034061335639107006, 'samples': 11237376, 'steps': 58527, 'loss/train': 1.383617877960205} 11/07/2021 05:26:26 - INFO - __main__ - Step 58529: {'lr': 0.0003406084104749192, 'samples': 11237568, 'steps': 58528, 'loss/train': 1.4862842559814453} 11/07/2021 05:26:26 - INFO - __main__ - Step 58530: {'lr': 0.00034060346451794156, 'samples': 11237760, 'steps': 58529, 'loss/train': 1.6672933101654053} 11/07/2021 05:26:27 - INFO - __main__ - Step 58531: {'lr': 0.0003405985185201394, 'samples': 11237952, 'steps': 58530, 'loss/train': 1.3401219844818115} 11/07/2021 05:26:28 - INFO - __main__ - Step 58532: {'lr': 0.000340593572481515, 'samples': 11238144, 'steps': 58531, 'loss/train': 1.2921035289764404} 11/07/2021 05:26:28 - INFO - __main__ - Step 58533: {'lr': 0.0003405886264020706, 'samples': 11238336, 'steps': 58532, 'loss/train': 1.5293786525726318} 11/07/2021 05:26:28 - INFO - __main__ - Step 58534: {'lr': 0.0003405836802818082, 'samples': 11238528, 'steps': 58533, 'loss/train': 0.5165374279022217} 11/07/2021 05:26:29 - INFO - __main__ - Step 58535: {'lr': 0.00034057873412073026, 'samples': 11238720, 'steps': 58534, 'loss/train': 1.6103554964065552} 11/07/2021 05:26:30 - INFO - __main__ - Step 58536: {'lr': 0.0003405737879188389, 'samples': 11238912, 'steps': 58535, 'loss/train': 1.6690616607666016} 11/07/2021 05:26:30 - INFO - __main__ - Step 58537: {'lr': 0.0003405688416761364, 'samples': 11239104, 'steps': 58536, 'loss/train': 1.1031736135482788} 11/07/2021 05:26:30 - INFO - __main__ - Step 58538: {'lr': 0.00034056389539262506, 'samples': 11239296, 'steps': 58537, 'loss/train': 0.9342900514602661} 11/07/2021 05:26:31 - INFO - __main__ - Step 58539: {'lr': 0.000340558949068307, 'samples': 11239488, 'steps': 58538, 'loss/train': 1.6302703619003296} 11/07/2021 05:26:31 - INFO - __main__ - Step 58540: {'lr': 0.0003405540027031845, 'samples': 11239680, 'steps': 58539, 'loss/train': 1.5585918426513672} 11/07/2021 05:26:32 - INFO - __main__ - Step 58541: {'lr': 0.00034054905629725965, 'samples': 11239872, 'steps': 58540, 'loss/train': 0.9911255836486816} 11/07/2021 05:26:33 - INFO - __main__ - Step 58542: {'lr': 0.00034054410985053483, 'samples': 11240064, 'steps': 58541, 'loss/train': 1.3347887992858887} 11/07/2021 05:26:33 - INFO - __main__ - Step 58543: {'lr': 0.00034053916336301225, 'samples': 11240256, 'steps': 58542, 'loss/train': 1.497240662574768} 11/07/2021 05:26:33 - INFO - __main__ - Step 58544: {'lr': 0.00034053421683469416, 'samples': 11240448, 'steps': 58543, 'loss/train': 0.7741641402244568} 11/07/2021 05:26:34 - INFO - __main__ - Step 58545: {'lr': 0.00034052927026558265, 'samples': 11240640, 'steps': 58544, 'loss/train': 1.7934575080871582} 11/07/2021 05:26:35 - INFO - __main__ - Step 58546: {'lr': 0.00034052432365568015, 'samples': 11240832, 'steps': 58545, 'loss/train': 1.5654975175857544} 11/07/2021 05:26:35 - INFO - __main__ - Step 58547: {'lr': 0.0003405193770049888, 'samples': 11241024, 'steps': 58546, 'loss/train': 1.5136960744857788} 11/07/2021 05:26:35 - INFO - __main__ - Step 58548: {'lr': 0.0003405144303135108, 'samples': 11241216, 'steps': 58547, 'loss/train': 1.2865959405899048} 11/07/2021 05:26:36 - INFO - __main__ - Step 58549: {'lr': 0.00034050948358124836, 'samples': 11241408, 'steps': 58548, 'loss/train': 0.9187461137771606} 11/07/2021 05:26:36 - INFO - __main__ - Step 58550: {'lr': 0.00034050453680820373, 'samples': 11241600, 'steps': 58549, 'loss/train': 1.1021056175231934} 11/07/2021 05:26:37 - INFO - __main__ - Step 58551: {'lr': 0.0003404995899943791, 'samples': 11241792, 'steps': 58550, 'loss/train': 1.6945784091949463} 11/07/2021 05:26:37 - INFO - __main__ - Step 58552: {'lr': 0.00034049464313977684, 'samples': 11241984, 'steps': 58551, 'loss/train': 1.5661370754241943} 11/07/2021 05:26:38 - INFO - __main__ - Step 58553: {'lr': 0.0003404896962443991, 'samples': 11242176, 'steps': 58552, 'loss/train': 1.3892261981964111} 11/07/2021 05:26:38 - INFO - __main__ - Step 58554: {'lr': 0.0003404847493082481, 'samples': 11242368, 'steps': 58553, 'loss/train': 1.0884599685668945} 11/07/2021 05:26:38 - INFO - __main__ - Step 58555: {'lr': 0.000340479802331326, 'samples': 11242560, 'steps': 58554, 'loss/train': 1.1438909769058228} 11/07/2021 05:26:39 - INFO - __main__ - Step 58556: {'lr': 0.0003404748553136351, 'samples': 11242752, 'steps': 58555, 'loss/train': 1.5107654333114624} 11/07/2021 05:26:40 - INFO - __main__ - Step 58557: {'lr': 0.00034046990825517765, 'samples': 11242944, 'steps': 58556, 'loss/train': 1.5175063610076904} 11/07/2021 05:26:40 - INFO - __main__ - Step 58558: {'lr': 0.0003404649611559559, 'samples': 11243136, 'steps': 58557, 'loss/train': 1.0964844226837158} 11/07/2021 05:26:41 - INFO - __main__ - Step 58559: {'lr': 0.0003404600140159719, 'samples': 11243328, 'steps': 58558, 'loss/train': 1.0572022199630737} 11/07/2021 05:26:41 - INFO - __main__ - Step 58560: {'lr': 0.0003404550668352282, 'samples': 11243520, 'steps': 58559, 'loss/train': 1.190142035484314} 11/07/2021 05:26:41 - INFO - __main__ - Step 58561: {'lr': 0.00034045011961372676, 'samples': 11243712, 'steps': 58560, 'loss/train': 1.0817368030548096} 11/07/2021 05:26:42 - INFO - __main__ - Step 58562: {'lr': 0.0003404451723514699, 'samples': 11243904, 'steps': 58561, 'loss/train': 1.7233643531799316} 11/07/2021 05:26:43 - INFO - __main__ - Step 58563: {'lr': 0.00034044022504845986, 'samples': 11244096, 'steps': 58562, 'loss/train': 1.3554329872131348} 11/07/2021 05:26:43 - INFO - __main__ - Step 58564: {'lr': 0.00034043527770469874, 'samples': 11244288, 'steps': 58563, 'loss/train': 1.447244644165039} 11/07/2021 05:26:43 - INFO - __main__ - Step 58565: {'lr': 0.00034043033032018897, 'samples': 11244480, 'steps': 58564, 'loss/train': 1.2000093460083008} 11/07/2021 05:26:44 - INFO - __main__ - Step 58566: {'lr': 0.00034042538289493266, 'samples': 11244672, 'steps': 58565, 'loss/train': 1.7884174585342407} 11/07/2021 05:26:45 - INFO - __main__ - Step 58567: {'lr': 0.00034042043542893214, 'samples': 11244864, 'steps': 58566, 'loss/train': 0.39903920888900757} 11/07/2021 05:26:45 - INFO - __main__ - Step 58568: {'lr': 0.0003404154879221895, 'samples': 11245056, 'steps': 58567, 'loss/train': 0.9793217778205872} 11/07/2021 05:26:46 - INFO - __main__ - Step 58569: {'lr': 0.00034041054037470703, 'samples': 11245248, 'steps': 58568, 'loss/train': 1.5822800397872925} 11/07/2021 05:26:46 - INFO - __main__ - Step 58570: {'lr': 0.00034040559278648695, 'samples': 11245440, 'steps': 58569, 'loss/train': 2.0370893478393555} 11/07/2021 05:26:46 - INFO - __main__ - Step 58571: {'lr': 0.00034040064515753154, 'samples': 11245632, 'steps': 58570, 'loss/train': 1.6098743677139282} 11/07/2021 05:26:47 - INFO - __main__ - Step 58572: {'lr': 0.000340395697487843, 'samples': 11245824, 'steps': 58571, 'loss/train': 1.3138717412948608} 11/07/2021 05:26:48 - INFO - __main__ - Step 58573: {'lr': 0.00034039074977742356, 'samples': 11246016, 'steps': 58572, 'loss/train': 0.9125752449035645} 11/07/2021 05:26:48 - INFO - __main__ - Step 58574: {'lr': 0.00034038580202627543, 'samples': 11246208, 'steps': 58573, 'loss/train': 1.0024455785751343} 11/07/2021 05:26:49 - INFO - __main__ - Step 58575: {'lr': 0.0003403808542344009, 'samples': 11246400, 'steps': 58574, 'loss/train': 1.3768975734710693} 11/07/2021 05:26:49 - INFO - __main__ - Step 58576: {'lr': 0.00034037590640180205, 'samples': 11246592, 'steps': 58575, 'loss/train': 1.6127654314041138} 11/07/2021 05:26:49 - INFO - __main__ - Step 58577: {'lr': 0.00034037095852848125, 'samples': 11246784, 'steps': 58576, 'loss/train': 0.5709400177001953} 11/07/2021 05:26:51 - INFO - __main__ - Step 58578: {'lr': 0.00034036601061444074, 'samples': 11246976, 'steps': 58577, 'loss/train': 1.6432132720947266} 11/07/2021 05:26:51 - INFO - __main__ - Step 58579: {'lr': 0.00034036106265968263, 'samples': 11247168, 'steps': 58578, 'loss/train': 1.5485974550247192} 11/07/2021 05:26:51 - INFO - __main__ - Step 58580: {'lr': 0.00034035611466420927, 'samples': 11247360, 'steps': 58579, 'loss/train': 2.261461019515991} 11/07/2021 05:26:52 - INFO - __main__ - Step 58581: {'lr': 0.00034035116662802287, 'samples': 11247552, 'steps': 58580, 'loss/train': 1.7287871837615967} 11/07/2021 05:26:52 - INFO - __main__ - Step 58582: {'lr': 0.0003403462185511256, 'samples': 11247744, 'steps': 58581, 'loss/train': 1.6491888761520386} 11/07/2021 05:26:53 - INFO - __main__ - Step 58583: {'lr': 0.0003403412704335196, 'samples': 11247936, 'steps': 58582, 'loss/train': 2.915558338165283} 11/07/2021 05:26:54 - INFO - __main__ - Step 58584: {'lr': 0.0003403363222752074, 'samples': 11248128, 'steps': 58583, 'loss/train': 1.3191951513290405} 11/07/2021 05:26:54 - INFO - __main__ - Step 58585: {'lr': 0.0003403313740761909, 'samples': 11248320, 'steps': 58584, 'loss/train': 1.1483147144317627} 11/07/2021 05:26:54 - INFO - __main__ - Step 58586: {'lr': 0.00034032642583647254, 'samples': 11248512, 'steps': 58585, 'loss/train': 0.8842759132385254} 11/07/2021 05:26:55 - INFO - __main__ - Step 58587: {'lr': 0.0003403214775560545, 'samples': 11248704, 'steps': 58586, 'loss/train': 1.4254627227783203} 11/07/2021 05:26:56 - INFO - __main__ - Step 58588: {'lr': 0.000340316529234939, 'samples': 11248896, 'steps': 58587, 'loss/train': 1.6152998208999634} 11/07/2021 05:26:56 - INFO - __main__ - Step 58589: {'lr': 0.00034031158087312823, 'samples': 11249088, 'steps': 58588, 'loss/train': 1.426893949508667} 11/07/2021 05:26:56 - INFO - __main__ - Step 58590: {'lr': 0.0003403066324706245, 'samples': 11249280, 'steps': 58589, 'loss/train': 1.4237428903579712} 11/07/2021 05:26:57 - INFO - __main__ - Step 58591: {'lr': 0.00034030168402742996, 'samples': 11249472, 'steps': 58590, 'loss/train': 1.5858216285705566} 11/07/2021 05:26:57 - INFO - __main__ - Step 58592: {'lr': 0.0003402967355435469, 'samples': 11249664, 'steps': 58591, 'loss/train': 2.0844335556030273} 11/07/2021 05:26:58 - INFO - __main__ - Step 58593: {'lr': 0.00034029178701897744, 'samples': 11249856, 'steps': 58592, 'loss/train': 1.3354320526123047} 11/07/2021 05:26:58 - INFO - __main__ - Step 58594: {'lr': 0.00034028683845372407, 'samples': 11250048, 'steps': 58593, 'loss/train': 1.568726897239685} 11/07/2021 05:26:59 - INFO - __main__ - Step 58595: {'lr': 0.00034028188984778867, 'samples': 11250240, 'steps': 58594, 'loss/train': 1.2948017120361328} 11/07/2021 05:26:59 - INFO - __main__ - Step 58596: {'lr': 0.0003402769412011737, 'samples': 11250432, 'steps': 58595, 'loss/train': 1.8752646446228027} 11/07/2021 05:26:59 - INFO - __main__ - Step 58597: {'lr': 0.00034027199251388137, 'samples': 11250624, 'steps': 58596, 'loss/train': 1.518560528755188} 11/07/2021 05:27:00 - INFO - __main__ - Step 58598: {'lr': 0.0003402670437859138, 'samples': 11250816, 'steps': 58597, 'loss/train': 1.5436047315597534} 11/07/2021 05:27:01 - INFO - __main__ - Step 58599: {'lr': 0.0003402620950172733, 'samples': 11251008, 'steps': 58598, 'loss/train': 1.5812362432479858} 11/07/2021 05:27:01 - INFO - __main__ - Step 58600: {'lr': 0.00034025714620796225, 'samples': 11251200, 'steps': 58599, 'loss/train': 1.1292755603790283} 11/07/2021 05:27:02 - INFO - __main__ - Step 58601: {'lr': 0.0003402521973579826, 'samples': 11251392, 'steps': 58600, 'loss/train': 1.4832994937896729} 11/07/2021 05:27:02 - INFO - __main__ - Step 58602: {'lr': 0.00034024724846733667, 'samples': 11251584, 'steps': 58601, 'loss/train': 1.6459180116653442} 11/07/2021 05:27:03 - INFO - __main__ - Step 58603: {'lr': 0.0003402422995360268, 'samples': 11251776, 'steps': 58602, 'loss/train': 1.2370795011520386} 11/07/2021 05:27:03 - INFO - __main__ - Step 58604: {'lr': 0.00034023735056405507, 'samples': 11251968, 'steps': 58603, 'loss/train': 1.4220401048660278} 11/07/2021 05:27:04 - INFO - __main__ - Step 58605: {'lr': 0.00034023240155142383, 'samples': 11252160, 'steps': 58604, 'loss/train': 1.5945957899093628} 11/07/2021 05:27:04 - INFO - __main__ - Step 58606: {'lr': 0.00034022745249813523, 'samples': 11252352, 'steps': 58605, 'loss/train': 1.4449801445007324} 11/07/2021 05:27:04 - INFO - __main__ - Step 58607: {'lr': 0.0003402225034041916, 'samples': 11252544, 'steps': 58606, 'loss/train': 1.0904531478881836} 11/07/2021 05:27:05 - INFO - __main__ - Step 58608: {'lr': 0.000340217554269595, 'samples': 11252736, 'steps': 58607, 'loss/train': 1.895560622215271} 11/07/2021 05:27:06 - INFO - __main__ - Step 58609: {'lr': 0.00034021260509434784, 'samples': 11252928, 'steps': 58608, 'loss/train': 1.7363203763961792} 11/07/2021 05:27:06 - INFO - __main__ - Step 58610: {'lr': 0.0003402076558784522, 'samples': 11253120, 'steps': 58609, 'loss/train': 0.9608591198921204} 11/07/2021 05:27:06 - INFO - __main__ - Step 58611: {'lr': 0.00034020270662191046, 'samples': 11253312, 'steps': 58610, 'loss/train': 1.9516892433166504} 11/07/2021 05:27:07 - INFO - __main__ - Step 58612: {'lr': 0.00034019775732472467, 'samples': 11253504, 'steps': 58611, 'loss/train': 1.3757413625717163} 11/07/2021 05:27:07 - INFO - __main__ - Step 58613: {'lr': 0.0003401928079868973, 'samples': 11253696, 'steps': 58612, 'loss/train': 1.1356873512268066} 11/07/2021 05:27:08 - INFO - __main__ - Step 58614: {'lr': 0.0003401878586084304, 'samples': 11253888, 'steps': 58613, 'loss/train': 1.5409858226776123} 11/07/2021 05:27:08 - INFO - __main__ - Step 58615: {'lr': 0.0003401829091893262, 'samples': 11254080, 'steps': 58614, 'loss/train': 1.800312876701355} 11/07/2021 05:27:09 - INFO - __main__ - Step 58616: {'lr': 0.000340177959729587, 'samples': 11254272, 'steps': 58615, 'loss/train': 1.058474063873291} 11/07/2021 05:27:09 - INFO - __main__ - Step 58617: {'lr': 0.000340173010229215, 'samples': 11254464, 'steps': 58616, 'loss/train': 1.5124789476394653} 11/07/2021 05:27:09 - INFO - __main__ - Step 58618: {'lr': 0.0003401680606882124, 'samples': 11254656, 'steps': 58617, 'loss/train': 1.286466360092163} 11/07/2021 05:27:11 - INFO - __main__ - Step 58619: {'lr': 0.0003401631111065815, 'samples': 11254848, 'steps': 58618, 'loss/train': 1.4471435546875} 11/07/2021 05:27:11 - INFO - __main__ - Step 58620: {'lr': 0.0003401581614843244, 'samples': 11255040, 'steps': 58619, 'loss/train': 1.771852731704712} 11/07/2021 05:27:11 - INFO - __main__ - Step 58621: {'lr': 0.00034015321182144357, 'samples': 11255232, 'steps': 58620, 'loss/train': 1.1483772993087769} 11/07/2021 05:27:12 - INFO - __main__ - Step 58622: {'lr': 0.00034014826211794104, 'samples': 11255424, 'steps': 58621, 'loss/train': 1.7596633434295654} 11/07/2021 05:27:12 - INFO - __main__ - Step 58623: {'lr': 0.0003401433123738191, 'samples': 11255616, 'steps': 58622, 'loss/train': 1.252971887588501} 11/07/2021 05:27:12 - INFO - __main__ - Step 58624: {'lr': 0.00034013836258907994, 'samples': 11255808, 'steps': 58623, 'loss/train': 1.7156128883361816} 11/07/2021 05:27:13 - INFO - __main__ - Step 58625: {'lr': 0.0003401334127637258, 'samples': 11256000, 'steps': 58624, 'loss/train': 1.6319975852966309} 11/07/2021 05:27:14 - INFO - __main__ - Step 58626: {'lr': 0.000340128462897759, 'samples': 11256192, 'steps': 58625, 'loss/train': 0.9937305450439453} 11/07/2021 05:27:14 - INFO - __main__ - Step 58627: {'lr': 0.0003401235129911817, 'samples': 11256384, 'steps': 58626, 'loss/train': 1.6375449895858765} 11/07/2021 05:27:14 - INFO - __main__ - Step 58628: {'lr': 0.0003401185630439961, 'samples': 11256576, 'steps': 58627, 'loss/train': 1.6889230012893677} 11/07/2021 05:27:15 - INFO - __main__ - Step 58629: {'lr': 0.0003401136130562045, 'samples': 11256768, 'steps': 58628, 'loss/train': 1.2075716257095337} 11/07/2021 05:27:16 - INFO - __main__ - Step 58630: {'lr': 0.0003401086630278091, 'samples': 11256960, 'steps': 58629, 'loss/train': 1.2659410238265991} 11/07/2021 05:27:16 - INFO - __main__ - Step 58631: {'lr': 0.00034010371295881207, 'samples': 11257152, 'steps': 58630, 'loss/train': 1.9190737009048462} 11/07/2021 05:27:16 - INFO - __main__ - Step 58632: {'lr': 0.00034009876284921576, 'samples': 11257344, 'steps': 58631, 'loss/train': 1.6319178342819214} 11/07/2021 05:27:17 - INFO - __main__ - Step 58633: {'lr': 0.00034009381269902236, 'samples': 11257536, 'steps': 58632, 'loss/train': 1.64049232006073} 11/07/2021 05:27:17 - INFO - __main__ - Step 58634: {'lr': 0.000340088862508234, 'samples': 11257728, 'steps': 58633, 'loss/train': 0.9824512004852295} 11/07/2021 05:27:18 - INFO - __main__ - Step 58635: {'lr': 0.00034008391227685305, 'samples': 11257920, 'steps': 58634, 'loss/train': 1.4872363805770874} 11/07/2021 05:27:18 - INFO - __main__ - Step 58636: {'lr': 0.00034007896200488163, 'samples': 11258112, 'steps': 58635, 'loss/train': 1.282004475593567} 11/07/2021 05:27:19 - INFO - __main__ - Step 58637: {'lr': 0.0003400740116923221, 'samples': 11258304, 'steps': 58636, 'loss/train': 1.1979315280914307} 11/07/2021 05:27:19 - INFO - __main__ - Step 58638: {'lr': 0.00034006906133917655, 'samples': 11258496, 'steps': 58637, 'loss/train': 1.8014849424362183} 11/07/2021 05:27:20 - INFO - __main__ - Step 58639: {'lr': 0.0003400641109454473, 'samples': 11258688, 'steps': 58638, 'loss/train': 1.707274317741394} 11/07/2021 05:27:21 - INFO - __main__ - Step 58640: {'lr': 0.0003400591605111364, 'samples': 11258880, 'steps': 58639, 'loss/train': 1.3305047750473022} 11/07/2021 05:27:21 - INFO - __main__ - Step 58641: {'lr': 0.0003400542100362464, 'samples': 11259072, 'steps': 58640, 'loss/train': 1.6776947975158691} 11/07/2021 05:27:21 - INFO - __main__ - Step 58642: {'lr': 0.0003400492595207793, 'samples': 11259264, 'steps': 58641, 'loss/train': 1.1673380136489868} 11/07/2021 05:27:22 - INFO - __main__ - Step 58643: {'lr': 0.00034004430896473743, 'samples': 11259456, 'steps': 58642, 'loss/train': 1.5140269994735718} 11/07/2021 05:27:22 - INFO - __main__ - Step 58644: {'lr': 0.000340039358368123, 'samples': 11259648, 'steps': 58643, 'loss/train': 1.3781821727752686} 11/07/2021 05:27:22 - INFO - __main__ - Step 58645: {'lr': 0.00034003440773093817, 'samples': 11259840, 'steps': 58644, 'loss/train': 1.7038558721542358} 11/07/2021 05:27:24 - INFO - __main__ - Step 58646: {'lr': 0.0003400294570531852, 'samples': 11260032, 'steps': 58645, 'loss/train': 0.6562949419021606} 11/07/2021 05:27:24 - INFO - __main__ - Step 58647: {'lr': 0.0003400245063348664, 'samples': 11260224, 'steps': 58646, 'loss/train': 1.6678837537765503} 11/07/2021 05:27:24 - INFO - __main__ - Step 58648: {'lr': 0.000340019555575984, 'samples': 11260416, 'steps': 58647, 'loss/train': 1.872270941734314} 11/07/2021 05:27:25 - INFO - __main__ - Step 58649: {'lr': 0.00034001460477654013, 'samples': 11260608, 'steps': 58648, 'loss/train': 1.8131369352340698} 11/07/2021 05:27:25 - INFO - __main__ - Step 58650: {'lr': 0.00034000965393653703, 'samples': 11260800, 'steps': 58649, 'loss/train': 1.475797414779663} 11/07/2021 05:27:26 - INFO - __main__ - Step 58651: {'lr': 0.00034000470305597697, 'samples': 11260992, 'steps': 58650, 'loss/train': 1.2250440120697021} 11/07/2021 05:27:26 - INFO - __main__ - Step 58652: {'lr': 0.0003399997521348622, 'samples': 11261184, 'steps': 58651, 'loss/train': 1.5154170989990234} 11/07/2021 05:27:27 - INFO - __main__ - Step 58653: {'lr': 0.00033999480117319494, 'samples': 11261376, 'steps': 58652, 'loss/train': 1.315185308456421} 11/07/2021 05:27:27 - INFO - __main__ - Step 58654: {'lr': 0.0003399898501709774, 'samples': 11261568, 'steps': 58653, 'loss/train': 1.6343590021133423} 11/07/2021 05:27:27 - INFO - __main__ - Step 58655: {'lr': 0.00033998489912821187, 'samples': 11261760, 'steps': 58654, 'loss/train': 0.7975795865058899} 11/07/2021 05:27:28 - INFO - __main__ - Step 58656: {'lr': 0.00033997994804490047, 'samples': 11261952, 'steps': 58655, 'loss/train': 1.5882381200790405} 11/07/2021 05:27:29 - INFO - __main__ - Step 58657: {'lr': 0.0003399749969210455, 'samples': 11262144, 'steps': 58656, 'loss/train': 1.467847466468811} 11/07/2021 05:27:29 - INFO - __main__ - Step 58658: {'lr': 0.0003399700457566492, 'samples': 11262336, 'steps': 58657, 'loss/train': 1.6790921688079834} 11/07/2021 05:27:30 - INFO - __main__ - Step 58659: {'lr': 0.00033996509455171375, 'samples': 11262528, 'steps': 58658, 'loss/train': 1.463994026184082} 11/07/2021 05:27:30 - INFO - __main__ - Step 58660: {'lr': 0.0003399601433062415, 'samples': 11262720, 'steps': 58659, 'loss/train': 1.6266387701034546} 11/07/2021 05:27:30 - INFO - __main__ - Step 58661: {'lr': 0.00033995519202023453, 'samples': 11262912, 'steps': 58660, 'loss/train': 1.5797277688980103} 11/07/2021 05:27:31 - INFO - __main__ - Step 58662: {'lr': 0.00033995024069369517, 'samples': 11263104, 'steps': 58661, 'loss/train': 1.788406491279602} 11/07/2021 05:27:32 - INFO - __main__ - Step 58663: {'lr': 0.0003399452893266256, 'samples': 11263296, 'steps': 58662, 'loss/train': 1.2432971000671387} 11/07/2021 05:27:32 - INFO - __main__ - Step 58664: {'lr': 0.000339940337919028, 'samples': 11263488, 'steps': 58663, 'loss/train': 1.4610928297042847} 11/07/2021 05:27:32 - INFO - __main__ - Step 58665: {'lr': 0.0003399353864709048, 'samples': 11263680, 'steps': 58664, 'loss/train': 1.2338680028915405} 11/07/2021 05:27:33 - INFO - __main__ - Step 58666: {'lr': 0.000339930434982258, 'samples': 11263872, 'steps': 58665, 'loss/train': 1.294327735900879} 11/07/2021 05:27:34 - INFO - __main__ - Step 58667: {'lr': 0.00033992548345309, 'samples': 11264064, 'steps': 58666, 'loss/train': 1.6519348621368408} 11/07/2021 05:27:34 - INFO - __main__ - Step 58668: {'lr': 0.000339920531883403, 'samples': 11264256, 'steps': 58667, 'loss/train': 1.3834352493286133} 11/07/2021 05:27:34 - INFO - __main__ - Step 58669: {'lr': 0.0003399155802731991, 'samples': 11264448, 'steps': 58668, 'loss/train': 1.2895777225494385} 11/07/2021 05:27:35 - INFO - __main__ - Step 58670: {'lr': 0.0003399106286224807, 'samples': 11264640, 'steps': 58669, 'loss/train': 1.0402488708496094} 11/07/2021 05:27:35 - INFO - __main__ - Step 58671: {'lr': 0.0003399056769312499, 'samples': 11264832, 'steps': 58670, 'loss/train': 1.2308584451675415} 11/07/2021 05:27:36 - INFO - __main__ - Step 58672: {'lr': 0.000339900725199509, 'samples': 11265024, 'steps': 58671, 'loss/train': 1.3313902616500854} 11/07/2021 05:27:37 - INFO - __main__ - Step 58673: {'lr': 0.0003398957734272602, 'samples': 11265216, 'steps': 58672, 'loss/train': 1.8147152662277222} 11/07/2021 05:27:37 - INFO - __main__ - Step 58674: {'lr': 0.00033989082161450584, 'samples': 11265408, 'steps': 58673, 'loss/train': 1.1897612810134888} 11/07/2021 05:27:37 - INFO - __main__ - Step 58675: {'lr': 0.000339885869761248, 'samples': 11265600, 'steps': 58674, 'loss/train': 3.0839974880218506} 11/07/2021 05:27:38 - INFO - __main__ - Step 58676: {'lr': 0.000339880917867489, 'samples': 11265792, 'steps': 58675, 'loss/train': 1.751492977142334} 11/07/2021 05:27:39 - INFO - __main__ - Step 58677: {'lr': 0.00033987596593323103, 'samples': 11265984, 'steps': 58676, 'loss/train': 1.605707049369812} 11/07/2021 05:27:39 - INFO - __main__ - Step 58678: {'lr': 0.00033987101395847636, 'samples': 11266176, 'steps': 58677, 'loss/train': 1.7508043050765991} 11/07/2021 05:27:40 - INFO - __main__ - Step 58679: {'lr': 0.00033986606194322716, 'samples': 11266368, 'steps': 58678, 'loss/train': 1.5047215223312378} 11/07/2021 05:27:40 - INFO - __main__ - Step 58680: {'lr': 0.00033986110988748567, 'samples': 11266560, 'steps': 58679, 'loss/train': 1.4525678157806396} 11/07/2021 05:27:40 - INFO - __main__ - Step 58681: {'lr': 0.00033985615779125427, 'samples': 11266752, 'steps': 58680, 'loss/train': 1.2383021116256714} 11/07/2021 05:27:41 - INFO - __main__ - Step 58682: {'lr': 0.00033985120565453497, 'samples': 11266944, 'steps': 58681, 'loss/train': 1.4485909938812256} 11/07/2021 05:27:42 - INFO - __main__ - Step 58683: {'lr': 0.00033984625347733015, 'samples': 11267136, 'steps': 58682, 'loss/train': 1.7783548831939697} 11/07/2021 05:27:42 - INFO - __main__ - Step 58684: {'lr': 0.000339841301259642, 'samples': 11267328, 'steps': 58683, 'loss/train': 1.653826117515564} 11/07/2021 05:27:42 - INFO - __main__ - Step 58685: {'lr': 0.0003398363490014727, 'samples': 11267520, 'steps': 58684, 'loss/train': 1.6136869192123413} 11/07/2021 05:27:43 - INFO - __main__ - Step 58686: {'lr': 0.0003398313967028245, 'samples': 11267712, 'steps': 58685, 'loss/train': 1.4425431489944458} 11/07/2021 05:27:43 - INFO - __main__ - Step 58687: {'lr': 0.00033982644436369975, 'samples': 11267904, 'steps': 58686, 'loss/train': 1.4149539470672607} 11/07/2021 05:27:44 - INFO - __main__ - Step 58688: {'lr': 0.00033982149198410057, 'samples': 11268096, 'steps': 58687, 'loss/train': 1.8159171342849731} 11/07/2021 05:27:44 - INFO - __main__ - Step 58689: {'lr': 0.0003398165395640292, 'samples': 11268288, 'steps': 58688, 'loss/train': 1.5644479990005493} 11/07/2021 05:27:45 - INFO - __main__ - Step 58690: {'lr': 0.00033981158710348787, 'samples': 11268480, 'steps': 58689, 'loss/train': 1.5640636682510376} 11/07/2021 05:27:45 - INFO - __main__ - Step 58691: {'lr': 0.0003398066346024788, 'samples': 11268672, 'steps': 58690, 'loss/train': 1.2179890871047974} 11/07/2021 05:27:45 - INFO - __main__ - Step 58692: {'lr': 0.0003398016820610043, 'samples': 11268864, 'steps': 58691, 'loss/train': 1.3885226249694824} 11/07/2021 05:27:46 - INFO - __main__ - Step 58693: {'lr': 0.00033979672947906646, 'samples': 11269056, 'steps': 58692, 'loss/train': 1.5098356008529663} 11/07/2021 05:27:47 - INFO - __main__ - Step 58694: {'lr': 0.0003397917768566677, 'samples': 11269248, 'steps': 58693, 'loss/train': 1.6539536714553833} 11/07/2021 05:27:47 - INFO - __main__ - Step 58695: {'lr': 0.0003397868241938101, 'samples': 11269440, 'steps': 58694, 'loss/train': 1.2934186458587646} 11/07/2021 05:27:47 - INFO - __main__ - Step 58696: {'lr': 0.00033978187149049597, 'samples': 11269632, 'steps': 58695, 'loss/train': 1.6360303163528442} 11/07/2021 05:27:48 - INFO - __main__ - Step 58697: {'lr': 0.0003397769187467275, 'samples': 11269824, 'steps': 58696, 'loss/train': 1.910764455795288} 11/07/2021 05:27:49 - INFO - __main__ - Step 58698: {'lr': 0.0003397719659625069, 'samples': 11270016, 'steps': 58697, 'loss/train': 1.394666314125061} 11/07/2021 05:27:49 - INFO - __main__ - Step 58699: {'lr': 0.0003397670131378365, 'samples': 11270208, 'steps': 58698, 'loss/train': 1.1060389280319214} 11/07/2021 05:27:50 - INFO - __main__ - Step 58700: {'lr': 0.0003397620602727184, 'samples': 11270400, 'steps': 58699, 'loss/train': 1.065557837486267} 11/07/2021 05:27:50 - INFO - __main__ - Step 58701: {'lr': 0.00033975710736715504, 'samples': 11270592, 'steps': 58700, 'loss/train': 2.4465560913085938} 11/07/2021 05:27:50 - INFO - __main__ - Step 58702: {'lr': 0.00033975215442114836, 'samples': 11270784, 'steps': 58701, 'loss/train': 1.1995782852172852} 11/07/2021 05:27:51 - INFO - __main__ - Step 58703: {'lr': 0.00033974720143470084, 'samples': 11270976, 'steps': 58702, 'loss/train': 1.7081576585769653} 11/07/2021 05:27:52 - INFO - __main__ - Step 58704: {'lr': 0.00033974224840781453, 'samples': 11271168, 'steps': 58703, 'loss/train': 1.3235079050064087} 11/07/2021 05:27:52 - INFO - __main__ - Step 58705: {'lr': 0.0003397372953404918, 'samples': 11271360, 'steps': 58704, 'loss/train': 1.443793535232544} 11/07/2021 05:27:52 - INFO - __main__ - Step 58706: {'lr': 0.0003397323422327348, 'samples': 11271552, 'steps': 58705, 'loss/train': 0.9678148031234741} 11/07/2021 05:27:53 - INFO - __main__ - Step 58707: {'lr': 0.0003397273890845458, 'samples': 11271744, 'steps': 58706, 'loss/train': 1.5049041509628296} 11/07/2021 05:27:54 - INFO - __main__ - Step 58708: {'lr': 0.0003397224358959271, 'samples': 11271936, 'steps': 58707, 'loss/train': 1.630176305770874} 11/07/2021 05:27:54 - INFO - __main__ - Step 58709: {'lr': 0.0003397174826668808, 'samples': 11272128, 'steps': 58708, 'loss/train': 1.4808259010314941} 11/07/2021 05:27:54 - INFO - __main__ - Step 58710: {'lr': 0.00033971252939740915, 'samples': 11272320, 'steps': 58709, 'loss/train': 1.1939408779144287} 11/07/2021 05:27:55 - INFO - __main__ - Step 58711: {'lr': 0.00033970757608751446, 'samples': 11272512, 'steps': 58710, 'loss/train': 1.1783385276794434} 11/07/2021 05:27:55 - INFO - __main__ - Step 58712: {'lr': 0.0003397026227371989, 'samples': 11272704, 'steps': 58711, 'loss/train': 1.3982597589492798} 11/07/2021 05:27:56 - INFO - __main__ - Step 58713: {'lr': 0.0003396976693464647, 'samples': 11272896, 'steps': 58712, 'loss/train': 1.225518822669983} 11/07/2021 05:27:57 - INFO - __main__ - Step 58714: {'lr': 0.0003396927159153141, 'samples': 11273088, 'steps': 58713, 'loss/train': 2.111703872680664} 11/07/2021 05:27:57 - INFO - __main__ - Step 58715: {'lr': 0.0003396877624437495, 'samples': 11273280, 'steps': 58714, 'loss/train': 1.869307041168213} 11/07/2021 05:27:57 - INFO - __main__ - Step 58716: {'lr': 0.0003396828089317728, 'samples': 11273472, 'steps': 58715, 'loss/train': 1.187577247619629} 11/07/2021 05:27:58 - INFO - __main__ - Step 58717: {'lr': 0.0003396778553793865, 'samples': 11273664, 'steps': 58716, 'loss/train': 1.9779194593429565} 11/07/2021 05:27:58 - INFO - __main__ - Step 58718: {'lr': 0.00033967290178659273, 'samples': 11273856, 'steps': 58717, 'loss/train': 1.1186343431472778} 11/07/2021 05:27:59 - INFO - __main__ - Step 58719: {'lr': 0.0003396679481533937, 'samples': 11274048, 'steps': 58718, 'loss/train': 1.6696780920028687} 11/07/2021 05:27:59 - INFO - __main__ - Step 58720: {'lr': 0.0003396629944797917, 'samples': 11274240, 'steps': 58719, 'loss/train': 1.3711998462677002} 11/07/2021 05:28:00 - INFO - __main__ - Step 58721: {'lr': 0.0003396580407657889, 'samples': 11274432, 'steps': 58720, 'loss/train': 1.3863716125488281} 11/07/2021 05:28:00 - INFO - __main__ - Step 58722: {'lr': 0.0003396530870113877, 'samples': 11274624, 'steps': 58721, 'loss/train': 1.036597490310669} 11/07/2021 05:28:00 - INFO - __main__ - Step 58723: {'lr': 0.0003396481332165901, 'samples': 11274816, 'steps': 58722, 'loss/train': 1.3757930994033813} 11/07/2021 05:28:01 - INFO - __main__ - Step 58724: {'lr': 0.00033964317938139845, 'samples': 11275008, 'steps': 58723, 'loss/train': 1.1915156841278076} 11/07/2021 05:28:02 - INFO - __main__ - Step 58725: {'lr': 0.00033963822550581494, 'samples': 11275200, 'steps': 58724, 'loss/train': 1.6021989583969116} 11/07/2021 05:28:02 - INFO - __main__ - Step 58726: {'lr': 0.0003396332715898418, 'samples': 11275392, 'steps': 58725, 'loss/train': 1.5126571655273438} 11/07/2021 05:28:02 - INFO - __main__ - Step 58727: {'lr': 0.00033962831763348133, 'samples': 11275584, 'steps': 58726, 'loss/train': 1.6027170419692993} 11/07/2021 05:28:03 - INFO - __main__ - Step 58728: {'lr': 0.00033962336363673585, 'samples': 11275776, 'steps': 58727, 'loss/train': 1.2824872732162476} 11/07/2021 05:28:04 - INFO - __main__ - Step 58729: {'lr': 0.00033961840959960735, 'samples': 11275968, 'steps': 58728, 'loss/train': 1.62641441822052} 11/07/2021 05:28:04 - INFO - __main__ - Step 58730: {'lr': 0.0003396134555220982, 'samples': 11276160, 'steps': 58729, 'loss/train': 1.0455607175827026} 11/07/2021 05:28:05 - INFO - __main__ - Step 58731: {'lr': 0.0003396085014042105, 'samples': 11276352, 'steps': 58730, 'loss/train': 2.2827987670898438} 11/07/2021 05:28:05 - INFO - __main__ - Step 58732: {'lr': 0.00033960354724594665, 'samples': 11276544, 'steps': 58731, 'loss/train': 1.1796807050704956} 11/07/2021 05:28:05 - INFO - __main__ - Step 58733: {'lr': 0.0003395985930473089, 'samples': 11276736, 'steps': 58732, 'loss/train': 1.3315881490707397} 11/07/2021 05:28:06 - INFO - __main__ - Step 58734: {'lr': 0.00033959363880829935, 'samples': 11276928, 'steps': 58733, 'loss/train': 0.8745399713516235} 11/07/2021 05:28:07 - INFO - __main__ - Step 58735: {'lr': 0.00033958868452892035, 'samples': 11277120, 'steps': 58734, 'loss/train': 1.1936465501785278} 11/07/2021 05:28:07 - INFO - __main__ - Step 58736: {'lr': 0.000339583730209174, 'samples': 11277312, 'steps': 58735, 'loss/train': 1.7472937107086182} 11/07/2021 05:28:07 - INFO - __main__ - Step 58737: {'lr': 0.0003395787758490626, 'samples': 11277504, 'steps': 58736, 'loss/train': 1.8023500442504883} 11/07/2021 05:28:08 - INFO - __main__ - Step 58738: {'lr': 0.0003395738214485884, 'samples': 11277696, 'steps': 58737, 'loss/train': 1.734580397605896} 11/07/2021 05:28:08 - INFO - __main__ - Step 58739: {'lr': 0.0003395688670077536, 'samples': 11277888, 'steps': 58738, 'loss/train': 1.074816107749939} 11/07/2021 05:28:09 - INFO - __main__ - Step 58740: {'lr': 0.0003395639125265605, 'samples': 11278080, 'steps': 58739, 'loss/train': 1.2014753818511963} 11/07/2021 05:28:10 - INFO - __main__ - Step 58741: {'lr': 0.00033955895800501126, 'samples': 11278272, 'steps': 58740, 'loss/train': 1.6427022218704224} 11/07/2021 05:28:10 - INFO - __main__ - Step 58742: {'lr': 0.0003395540034431082, 'samples': 11278464, 'steps': 58741, 'loss/train': 1.4095505475997925} 11/07/2021 05:28:10 - INFO - __main__ - Step 58743: {'lr': 0.0003395490488408534, 'samples': 11278656, 'steps': 58742, 'loss/train': 1.2669202089309692} 11/07/2021 05:28:11 - INFO - __main__ - Step 58744: {'lr': 0.00033954409419824924, 'samples': 11278848, 'steps': 58743, 'loss/train': 1.5797938108444214} 11/07/2021 05:28:12 - INFO - __main__ - Step 58745: {'lr': 0.0003395391395152978, 'samples': 11279040, 'steps': 58744, 'loss/train': 1.3473795652389526} 11/07/2021 05:28:12 - INFO - __main__ - Step 58746: {'lr': 0.0003395341847920015, 'samples': 11279232, 'steps': 58745, 'loss/train': 1.8958699703216553} 11/07/2021 05:28:12 - INFO - __main__ - Step 58747: {'lr': 0.00033952923002836244, 'samples': 11279424, 'steps': 58746, 'loss/train': 1.3172633647918701} 11/07/2021 05:28:13 - INFO - __main__ - Step 58748: {'lr': 0.0003395242752243829, 'samples': 11279616, 'steps': 58747, 'loss/train': 1.4201595783233643} 11/07/2021 05:28:13 - INFO - __main__ - Step 58749: {'lr': 0.00033951932038006513, 'samples': 11279808, 'steps': 58748, 'loss/train': 1.725259780883789} 11/07/2021 05:28:14 - INFO - __main__ - Step 58750: {'lr': 0.00033951436549541124, 'samples': 11280000, 'steps': 58749, 'loss/train': 1.2477748394012451} 11/07/2021 05:28:15 - INFO - __main__ - Step 58751: {'lr': 0.0003395094105704236, 'samples': 11280192, 'steps': 58750, 'loss/train': 2.0528318881988525} 11/07/2021 05:28:15 - INFO - __main__ - Step 58752: {'lr': 0.00033950445560510445, 'samples': 11280384, 'steps': 58751, 'loss/train': 1.4452126026153564} 11/07/2021 05:28:15 - INFO - __main__ - Step 58753: {'lr': 0.00033949950059945593, 'samples': 11280576, 'steps': 58752, 'loss/train': 1.1500120162963867} 11/07/2021 05:28:16 - INFO - __main__ - Step 58754: {'lr': 0.00033949454555348035, 'samples': 11280768, 'steps': 58753, 'loss/train': 1.5976330041885376} 11/07/2021 05:28:17 - INFO - __main__ - Step 58755: {'lr': 0.0003394895904671799, 'samples': 11280960, 'steps': 58754, 'loss/train': 1.6130250692367554} 11/07/2021 05:28:17 - INFO - __main__ - Step 58756: {'lr': 0.00033948463534055683, 'samples': 11281152, 'steps': 58755, 'loss/train': 1.5885192155838013} 11/07/2021 05:28:17 - INFO - __main__ - Step 58757: {'lr': 0.0003394796801736133, 'samples': 11281344, 'steps': 58756, 'loss/train': 4.94766902923584} 11/07/2021 05:28:18 - INFO - __main__ - Step 58758: {'lr': 0.0003394747249663517, 'samples': 11281536, 'steps': 58757, 'loss/train': 1.3459274768829346} 11/07/2021 05:28:18 - INFO - __main__ - Step 58759: {'lr': 0.0003394697697187741, 'samples': 11281728, 'steps': 58758, 'loss/train': 1.4971816539764404} 11/07/2021 05:28:19 - INFO - __main__ - Step 58760: {'lr': 0.00033946481443088286, 'samples': 11281920, 'steps': 58759, 'loss/train': 1.644182801246643} 11/07/2021 05:28:19 - INFO - __main__ - Step 58761: {'lr': 0.00033945985910268007, 'samples': 11282112, 'steps': 58760, 'loss/train': 1.4899896383285522} 11/07/2021 05:28:20 - INFO - __main__ - Step 58762: {'lr': 0.0003394549037341681, 'samples': 11282304, 'steps': 58761, 'loss/train': 1.335142970085144} 11/07/2021 05:28:20 - INFO - __main__ - Step 58763: {'lr': 0.00033944994832534915, 'samples': 11282496, 'steps': 58762, 'loss/train': 0.785372257232666} 11/07/2021 05:28:21 - INFO - __main__ - Step 58764: {'lr': 0.0003394449928762254, 'samples': 11282688, 'steps': 58763, 'loss/train': 1.4672189950942993} 11/07/2021 05:28:21 - INFO - __main__ - Step 58765: {'lr': 0.0003394400373867991, 'samples': 11282880, 'steps': 58764, 'loss/train': 1.681606411933899} 11/07/2021 05:28:22 - INFO - __main__ - Step 58766: {'lr': 0.00033943508185707257, 'samples': 11283072, 'steps': 58765, 'loss/train': 0.9648610949516296} 11/07/2021 05:28:22 - INFO - __main__ - Step 58767: {'lr': 0.0003394301262870479, 'samples': 11283264, 'steps': 58766, 'loss/train': 1.4518312215805054} 11/07/2021 05:28:23 - INFO - __main__ - Step 58768: {'lr': 0.00033942517067672744, 'samples': 11283456, 'steps': 58767, 'loss/train': 1.7672967910766602} 11/07/2021 05:28:23 - INFO - __main__ - Step 58769: {'lr': 0.00033942021502611334, 'samples': 11283648, 'steps': 58768, 'loss/train': 1.2715245485305786} 11/07/2021 05:28:23 - INFO - __main__ - Step 58770: {'lr': 0.0003394152593352079, 'samples': 11283840, 'steps': 58769, 'loss/train': 1.7486598491668701} 11/07/2021 05:28:25 - INFO - __main__ - Step 58771: {'lr': 0.0003394103036040133, 'samples': 11284032, 'steps': 58770, 'loss/train': 1.5962742567062378} 11/07/2021 05:28:25 - INFO - __main__ - Step 58772: {'lr': 0.00033940534783253185, 'samples': 11284224, 'steps': 58771, 'loss/train': 1.1509003639221191} 11/07/2021 05:28:25 - INFO - __main__ - Step 58773: {'lr': 0.00033940039202076574, 'samples': 11284416, 'steps': 58772, 'loss/train': 1.5305585861206055} 11/07/2021 05:28:26 - INFO - __main__ - Step 58774: {'lr': 0.0003393954361687172, 'samples': 11284608, 'steps': 58773, 'loss/train': 1.8478020429611206} 11/07/2021 05:28:26 - INFO - __main__ - Step 58775: {'lr': 0.0003393904802763883, 'samples': 11284800, 'steps': 58774, 'loss/train': 1.253279685974121} 11/07/2021 05:28:27 - INFO - __main__ - Step 58776: {'lr': 0.00033938552434378155, 'samples': 11284992, 'steps': 58775, 'loss/train': 1.685002088546753} 11/07/2021 05:28:27 - INFO - __main__ - Step 58777: {'lr': 0.00033938056837089903, 'samples': 11285184, 'steps': 58776, 'loss/train': 0.8082150816917419} 11/07/2021 05:28:28 - INFO - __main__ - Step 58778: {'lr': 0.00033937561235774307, 'samples': 11285376, 'steps': 58777, 'loss/train': 1.5093408823013306} 11/07/2021 05:28:28 - INFO - __main__ - Step 58779: {'lr': 0.00033937065630431577, 'samples': 11285568, 'steps': 58778, 'loss/train': 1.5160646438598633} 11/07/2021 05:28:28 - INFO - __main__ - Step 58780: {'lr': 0.00033936570021061947, 'samples': 11285760, 'steps': 58779, 'loss/train': 1.2714723348617554} 11/07/2021 05:28:29 - INFO - __main__ - Step 58781: {'lr': 0.0003393607440766563, 'samples': 11285952, 'steps': 58780, 'loss/train': 1.4022120237350464} 11/07/2021 05:28:30 - INFO - __main__ - Step 58782: {'lr': 0.0003393557879024286, 'samples': 11286144, 'steps': 58781, 'loss/train': 1.0381698608398438} 11/07/2021 05:28:30 - INFO - __main__ - Step 58783: {'lr': 0.00033935083168793855, 'samples': 11286336, 'steps': 58782, 'loss/train': 1.210511326789856} 11/07/2021 05:28:30 - INFO - __main__ - Step 58784: {'lr': 0.00033934587543318846, 'samples': 11286528, 'steps': 58783, 'loss/train': 1.061765432357788} 11/07/2021 05:28:31 - INFO - __main__ - Step 58785: {'lr': 0.00033934091913818043, 'samples': 11286720, 'steps': 58784, 'loss/train': 1.9269541501998901} 11/07/2021 05:28:32 - INFO - __main__ - Step 58786: {'lr': 0.0003393359628029168, 'samples': 11286912, 'steps': 58785, 'loss/train': 1.7752442359924316} 11/07/2021 05:28:32 - INFO - __main__ - Step 58787: {'lr': 0.0003393310064273997, 'samples': 11287104, 'steps': 58786, 'loss/train': 1.9661850929260254} 11/07/2021 05:28:32 - INFO - __main__ - Step 58788: {'lr': 0.0003393260500116315, 'samples': 11287296, 'steps': 58787, 'loss/train': 1.5297590494155884} 11/07/2021 05:28:33 - INFO - __main__ - Step 58789: {'lr': 0.0003393210935556143, 'samples': 11287488, 'steps': 58788, 'loss/train': 1.4794427156448364} 11/07/2021 05:28:33 - INFO - __main__ - Step 58790: {'lr': 0.00033931613705935046, 'samples': 11287680, 'steps': 58789, 'loss/train': 1.4250047206878662} 11/07/2021 05:28:34 - INFO - __main__ - Step 58791: {'lr': 0.000339311180522842, 'samples': 11287872, 'steps': 58790, 'loss/train': 1.4402660131454468} 11/07/2021 05:28:34 - INFO - __main__ - Step 58792: {'lr': 0.00033930622394609143, 'samples': 11288064, 'steps': 58791, 'loss/train': 1.288271188735962} 11/07/2021 05:28:35 - INFO - __main__ - Step 58793: {'lr': 0.00033930126732910083, 'samples': 11288256, 'steps': 58792, 'loss/train': 1.3805677890777588} 11/07/2021 05:28:35 - INFO - __main__ - Step 58794: {'lr': 0.0003392963106718725, 'samples': 11288448, 'steps': 58793, 'loss/train': 1.5357708930969238} 11/07/2021 05:28:35 - INFO - __main__ - Step 58795: {'lr': 0.00033929135397440857, 'samples': 11288640, 'steps': 58794, 'loss/train': 0.8117380142211914} 11/07/2021 05:28:36 - INFO - __main__ - Step 58796: {'lr': 0.0003392863972367114, 'samples': 11288832, 'steps': 58795, 'loss/train': 1.312749981880188} 11/07/2021 05:28:37 - INFO - __main__ - Step 58797: {'lr': 0.0003392814404587831, 'samples': 11289024, 'steps': 58796, 'loss/train': 1.5030581951141357} 11/07/2021 05:28:37 - INFO - __main__ - Step 58798: {'lr': 0.00033927648364062593, 'samples': 11289216, 'steps': 58797, 'loss/train': 1.2318079471588135} 11/07/2021 05:28:37 - INFO - __main__ - Step 58799: {'lr': 0.00033927152678224216, 'samples': 11289408, 'steps': 58798, 'loss/train': 1.2372905015945435} 11/07/2021 05:28:38 - INFO - __main__ - Step 58800: {'lr': 0.00033926656988363406, 'samples': 11289600, 'steps': 58799, 'loss/train': 0.903574526309967} 11/07/2021 05:28:39 - INFO - __main__ - Step 58801: {'lr': 0.00033926161294480384, 'samples': 11289792, 'steps': 58800, 'loss/train': 1.9908605813980103} 11/07/2021 05:28:39 - INFO - __main__ - Step 58802: {'lr': 0.00033925665596575374, 'samples': 11289984, 'steps': 58801, 'loss/train': 1.9544095993041992} 11/07/2021 05:28:40 - INFO - __main__ - Step 58803: {'lr': 0.00033925169894648586, 'samples': 11290176, 'steps': 58802, 'loss/train': 1.858659029006958} 11/07/2021 05:28:40 - INFO - __main__ - Step 58804: {'lr': 0.0003392467418870026, 'samples': 11290368, 'steps': 58803, 'loss/train': 1.8051953315734863} 11/07/2021 05:28:40 - INFO - __main__ - Step 58805: {'lr': 0.0003392417847873061, 'samples': 11290560, 'steps': 58804, 'loss/train': 1.6961629390716553} 11/07/2021 05:28:41 - INFO - __main__ - Step 58806: {'lr': 0.00033923682764739867, 'samples': 11290752, 'steps': 58805, 'loss/train': 1.0150158405303955} 11/07/2021 05:28:42 - INFO - __main__ - Step 58807: {'lr': 0.0003392318704672825, 'samples': 11290944, 'steps': 58806, 'loss/train': 1.145486831665039} 11/07/2021 05:28:42 - INFO - __main__ - Step 58808: {'lr': 0.00033922691324695975, 'samples': 11291136, 'steps': 58807, 'loss/train': 1.3180099725723267} 11/07/2021 05:28:42 - INFO - __main__ - Step 58809: {'lr': 0.00033922195598643293, 'samples': 11291328, 'steps': 58808, 'loss/train': 1.2919305562973022} 11/07/2021 05:28:43 - INFO - __main__ - Step 58810: {'lr': 0.0003392169986857039, 'samples': 11291520, 'steps': 58809, 'loss/train': 1.4855728149414062} 11/07/2021 05:28:43 - INFO - __main__ - Step 58811: {'lr': 0.0003392120413447751, 'samples': 11291712, 'steps': 58810, 'loss/train': 1.737269401550293} 11/07/2021 05:28:44 - INFO - __main__ - Step 58812: {'lr': 0.0003392070839636487, 'samples': 11291904, 'steps': 58811, 'loss/train': 1.4681472778320312} 11/07/2021 05:28:44 - INFO - __main__ - Step 58813: {'lr': 0.000339202126542327, 'samples': 11292096, 'steps': 58812, 'loss/train': 1.1999744176864624} 11/07/2021 05:28:45 - INFO - __main__ - Step 58814: {'lr': 0.00033919716908081224, 'samples': 11292288, 'steps': 58813, 'loss/train': 1.0629862546920776} 11/07/2021 05:28:45 - INFO - __main__ - Step 58815: {'lr': 0.0003391922115791065, 'samples': 11292480, 'steps': 58814, 'loss/train': 1.4069360494613647} 11/07/2021 05:28:46 - INFO - __main__ - Step 58816: {'lr': 0.0003391872540372123, 'samples': 11292672, 'steps': 58815, 'loss/train': 1.1592175960540771} 11/07/2021 05:28:47 - INFO - __main__ - Step 58817: {'lr': 0.00033918229645513154, 'samples': 11292864, 'steps': 58816, 'loss/train': 1.3755083084106445} 11/07/2021 05:28:47 - INFO - __main__ - Step 58818: {'lr': 0.0003391773388328667, 'samples': 11293056, 'steps': 58817, 'loss/train': 1.0568621158599854} 11/07/2021 05:28:47 - INFO - __main__ - Step 58819: {'lr': 0.0003391723811704199, 'samples': 11293248, 'steps': 58818, 'loss/train': 2.00492000579834} 11/07/2021 05:28:48 - INFO - __main__ - Step 58820: {'lr': 0.0003391674234677934, 'samples': 11293440, 'steps': 58819, 'loss/train': 1.5136122703552246} 11/07/2021 05:28:48 - INFO - __main__ - Step 58821: {'lr': 0.0003391624657249894, 'samples': 11293632, 'steps': 58820, 'loss/train': 1.2482664585113525} 11/07/2021 05:28:49 - INFO - __main__ - Step 58822: {'lr': 0.0003391575079420102, 'samples': 11293824, 'steps': 58821, 'loss/train': 1.6979384422302246} 11/07/2021 05:28:50 - INFO - __main__ - Step 58823: {'lr': 0.00033915255011885803, 'samples': 11294016, 'steps': 58822, 'loss/train': 1.4658546447753906} 11/07/2021 05:28:50 - INFO - __main__ - Step 58824: {'lr': 0.000339147592255535, 'samples': 11294208, 'steps': 58823, 'loss/train': 0.37755846977233887} 11/07/2021 05:28:50 - INFO - __main__ - Step 58825: {'lr': 0.00033914263435204356, 'samples': 11294400, 'steps': 58824, 'loss/train': 1.4825007915496826} 11/07/2021 05:28:51 - INFO - __main__ - Step 58826: {'lr': 0.0003391376764083858, 'samples': 11294592, 'steps': 58825, 'loss/train': 1.3988101482391357} 11/07/2021 05:28:52 - INFO - __main__ - Step 58827: {'lr': 0.00033913271842456394, 'samples': 11294784, 'steps': 58826, 'loss/train': 1.2597273588180542} 11/07/2021 05:28:52 - INFO - __main__ - Step 58828: {'lr': 0.0003391277604005802, 'samples': 11294976, 'steps': 58827, 'loss/train': 1.4019348621368408} 11/07/2021 05:28:52 - INFO - __main__ - Step 58829: {'lr': 0.00033912280233643706, 'samples': 11295168, 'steps': 58828, 'loss/train': 1.0812780857086182} 11/07/2021 05:28:53 - INFO - __main__ - Step 58830: {'lr': 0.00033911784423213645, 'samples': 11295360, 'steps': 58829, 'loss/train': 1.0912331342697144} 11/07/2021 05:28:53 - INFO - __main__ - Step 58831: {'lr': 0.00033911288608768063, 'samples': 11295552, 'steps': 58830, 'loss/train': 1.7023320198059082} 11/07/2021 05:28:54 - INFO - __main__ - Step 58832: {'lr': 0.000339107927903072, 'samples': 11295744, 'steps': 58831, 'loss/train': 1.2451249361038208} 11/07/2021 05:28:55 - INFO - __main__ - Step 58833: {'lr': 0.00033910296967831267, 'samples': 11295936, 'steps': 58832, 'loss/train': 1.6281076669692993} 11/07/2021 05:28:55 - INFO - __main__ - Step 58834: {'lr': 0.00033909801141340497, 'samples': 11296128, 'steps': 58833, 'loss/train': 1.525278925895691} 11/07/2021 05:28:55 - INFO - __main__ - Step 58835: {'lr': 0.00033909305310835105, 'samples': 11296320, 'steps': 58834, 'loss/train': 3.20565128326416} 11/07/2021 05:28:56 - INFO - __main__ - Step 58836: {'lr': 0.00033908809476315325, 'samples': 11296512, 'steps': 58835, 'loss/train': 0.47229987382888794} 11/07/2021 05:28:56 - INFO - __main__ - Step 58837: {'lr': 0.0003390831363778136, 'samples': 11296704, 'steps': 58836, 'loss/train': 1.7827147245407104} 11/07/2021 05:28:57 - INFO - __main__ - Step 58838: {'lr': 0.00033907817795233454, 'samples': 11296896, 'steps': 58837, 'loss/train': 1.7282392978668213} 11/07/2021 05:28:57 - INFO - __main__ - Step 58839: {'lr': 0.0003390732194867182, 'samples': 11297088, 'steps': 58838, 'loss/train': 1.4342314004898071} 11/07/2021 05:28:58 - INFO - __main__ - Step 58840: {'lr': 0.00033906826098096686, 'samples': 11297280, 'steps': 58839, 'loss/train': 1.4809050559997559} 11/07/2021 05:28:58 - INFO - __main__ - Step 58841: {'lr': 0.0003390633024350827, 'samples': 11297472, 'steps': 58840, 'loss/train': 1.2768079042434692} 11/07/2021 05:28:59 - INFO - __main__ - Step 58842: {'lr': 0.000339058343849068, 'samples': 11297664, 'steps': 58841, 'loss/train': 1.1015459299087524} 11/07/2021 05:29:00 - INFO - __main__ - Step 58843: {'lr': 0.00033905338522292514, 'samples': 11297856, 'steps': 58842, 'loss/train': 1.8206262588500977} 11/07/2021 05:29:00 - INFO - __main__ - Step 58844: {'lr': 0.00033904842655665604, 'samples': 11298048, 'steps': 58843, 'loss/train': 1.1822291612625122} 11/07/2021 05:29:00 - INFO - __main__ - Step 58845: {'lr': 0.00033904346785026306, 'samples': 11298240, 'steps': 58844, 'loss/train': 1.5804479122161865} 11/07/2021 05:29:01 - INFO - __main__ - Step 58846: {'lr': 0.0003390385091037486, 'samples': 11298432, 'steps': 58845, 'loss/train': 0.22993512451648712} 11/07/2021 05:29:01 - INFO - __main__ - Step 58847: {'lr': 0.0003390335503171146, 'samples': 11298624, 'steps': 58846, 'loss/train': 1.4215235710144043} 11/07/2021 05:29:02 - INFO - __main__ - Step 58848: {'lr': 0.0003390285914903636, 'samples': 11298816, 'steps': 58847, 'loss/train': 1.3228076696395874} 11/07/2021 05:29:02 - INFO - __main__ - Step 58849: {'lr': 0.0003390236326234977, 'samples': 11299008, 'steps': 58848, 'loss/train': 1.8202475309371948} 11/07/2021 05:29:03 - INFO - __main__ - Step 58850: {'lr': 0.000339018673716519, 'samples': 11299200, 'steps': 58849, 'loss/train': 1.0453081130981445} 11/07/2021 05:29:03 - INFO - __main__ - Step 58851: {'lr': 0.0003390137147694299, 'samples': 11299392, 'steps': 58850, 'loss/train': 1.7012423276901245} 11/07/2021 05:29:03 - INFO - __main__ - Step 58852: {'lr': 0.0003390087557822326, 'samples': 11299584, 'steps': 58851, 'loss/train': 1.2014970779418945} 11/07/2021 05:29:05 - INFO - __main__ - Step 58853: {'lr': 0.00033900379675492933, 'samples': 11299776, 'steps': 58852, 'loss/train': 1.6210826635360718} 11/07/2021 05:29:05 - INFO - __main__ - Step 58854: {'lr': 0.00033899883768752234, 'samples': 11299968, 'steps': 58853, 'loss/train': 0.15507566928863525} 11/07/2021 05:29:05 - INFO - __main__ - Step 58855: {'lr': 0.00033899387858001386, 'samples': 11300160, 'steps': 58854, 'loss/train': 1.646939754486084} 11/07/2021 05:29:06 - INFO - __main__ - Step 58856: {'lr': 0.0003389889194324061, 'samples': 11300352, 'steps': 58855, 'loss/train': 1.5895023345947266} 11/07/2021 05:29:06 - INFO - __main__ - Step 58857: {'lr': 0.0003389839602447013, 'samples': 11300544, 'steps': 58856, 'loss/train': 1.5853663682937622} 11/07/2021 05:29:07 - INFO - __main__ - Step 58858: {'lr': 0.0003389790010169017, 'samples': 11300736, 'steps': 58857, 'loss/train': 1.2243292331695557} 11/07/2021 05:29:07 - INFO - __main__ - Step 58859: {'lr': 0.00033897404174900955, 'samples': 11300928, 'steps': 58858, 'loss/train': 1.537453532218933} 11/07/2021 05:29:08 - INFO - __main__ - Step 58860: {'lr': 0.000338969082441027, 'samples': 11301120, 'steps': 58859, 'loss/train': 1.625871181488037} 11/07/2021 05:29:08 - INFO - __main__ - Step 58861: {'lr': 0.00033896412309295643, 'samples': 11301312, 'steps': 58860, 'loss/train': 1.698410153388977} 11/07/2021 05:29:08 - INFO - __main__ - Step 58862: {'lr': 0.00033895916370479994, 'samples': 11301504, 'steps': 58861, 'loss/train': 1.4595767259597778} 11/07/2021 05:29:09 - INFO - __main__ - Step 58863: {'lr': 0.00033895420427655995, 'samples': 11301696, 'steps': 58862, 'loss/train': 1.8079336881637573} 11/07/2021 05:29:10 - INFO - __main__ - Step 58864: {'lr': 0.0003389492448082384, 'samples': 11301888, 'steps': 58863, 'loss/train': 1.5572354793548584} 11/07/2021 05:29:10 - INFO - __main__ - Step 58865: {'lr': 0.0003389442852998378, 'samples': 11302080, 'steps': 58864, 'loss/train': 0.9346509575843811} 11/07/2021 05:29:10 - INFO - __main__ - Step 58866: {'lr': 0.0003389393257513602, 'samples': 11302272, 'steps': 58865, 'loss/train': 1.4819879531860352} 11/07/2021 05:29:11 - INFO - __main__ - Step 58867: {'lr': 0.00033893436616280796, 'samples': 11302464, 'steps': 58866, 'loss/train': 1.6216272115707397} 11/07/2021 05:29:11 - INFO - __main__ - Step 58868: {'lr': 0.0003389294065341833, 'samples': 11302656, 'steps': 58867, 'loss/train': 1.5425323247909546} 11/07/2021 05:29:12 - INFO - __main__ - Step 58869: {'lr': 0.0003389244468654884, 'samples': 11302848, 'steps': 58868, 'loss/train': 1.6218205690383911} 11/07/2021 05:29:13 - INFO - __main__ - Step 58870: {'lr': 0.0003389194871567255, 'samples': 11303040, 'steps': 58869, 'loss/train': 1.5652034282684326} 11/07/2021 05:29:13 - INFO - __main__ - Step 58871: {'lr': 0.00033891452740789687, 'samples': 11303232, 'steps': 58870, 'loss/train': 1.0977128744125366} 11/07/2021 05:29:13 - INFO - __main__ - Step 58872: {'lr': 0.0003389095676190047, 'samples': 11303424, 'steps': 58871, 'loss/train': 1.8839526176452637} 11/07/2021 05:29:14 - INFO - __main__ - Step 58873: {'lr': 0.00033890460779005126, 'samples': 11303616, 'steps': 58872, 'loss/train': 0.8943089246749878} 11/07/2021 05:29:15 - INFO - __main__ - Step 58874: {'lr': 0.0003388996479210388, 'samples': 11303808, 'steps': 58873, 'loss/train': 1.4530586004257202} 11/07/2021 05:29:15 - INFO - __main__ - Step 58875: {'lr': 0.0003388946880119695, 'samples': 11304000, 'steps': 58874, 'loss/train': 1.6234910488128662} 11/07/2021 05:29:15 - INFO - __main__ - Step 58876: {'lr': 0.0003388897280628457, 'samples': 11304192, 'steps': 58875, 'loss/train': 1.7356258630752563} 11/07/2021 05:29:16 - INFO - __main__ - Step 58877: {'lr': 0.00033888476807366946, 'samples': 11304384, 'steps': 58876, 'loss/train': 1.4881302118301392} 11/07/2021 05:29:16 - INFO - __main__ - Step 58878: {'lr': 0.00033887980804444314, 'samples': 11304576, 'steps': 58877, 'loss/train': 2.574214220046997} 11/07/2021 05:29:17 - INFO - __main__ - Step 58879: {'lr': 0.00033887484797516895, 'samples': 11304768, 'steps': 58878, 'loss/train': 1.7607316970825195} 11/07/2021 05:29:18 - INFO - __main__ - Step 58880: {'lr': 0.00033886988786584914, 'samples': 11304960, 'steps': 58879, 'loss/train': 1.4334523677825928} 11/07/2021 05:29:18 - INFO - __main__ - Step 58881: {'lr': 0.0003388649277164859, 'samples': 11305152, 'steps': 58880, 'loss/train': 1.0518630743026733} 11/07/2021 05:29:18 - INFO - __main__ - Step 58882: {'lr': 0.0003388599675270815, 'samples': 11305344, 'steps': 58881, 'loss/train': 1.4622739553451538} 11/07/2021 05:29:19 - INFO - __main__ - Step 58883: {'lr': 0.00033885500729763824, 'samples': 11305536, 'steps': 58882, 'loss/train': 1.1759334802627563} 11/07/2021 05:29:19 - INFO - __main__ - Step 58884: {'lr': 0.00033885004702815825, 'samples': 11305728, 'steps': 58883, 'loss/train': 1.3764007091522217} 11/07/2021 05:29:20 - INFO - __main__ - Step 58885: {'lr': 0.00033884508671864377, 'samples': 11305920, 'steps': 58884, 'loss/train': 2.0678844451904297} 11/07/2021 05:29:20 - INFO - __main__ - Step 58886: {'lr': 0.0003388401263690971, 'samples': 11306112, 'steps': 58885, 'loss/train': 1.4492619037628174} 11/07/2021 05:29:21 - INFO - __main__ - Step 58887: {'lr': 0.00033883516597952033, 'samples': 11306304, 'steps': 58886, 'loss/train': 1.3749444484710693} 11/07/2021 05:29:21 - INFO - __main__ - Step 58888: {'lr': 0.00033883020554991594, 'samples': 11306496, 'steps': 58887, 'loss/train': 1.6271275281906128} 11/07/2021 05:29:21 - INFO - __main__ - Step 58889: {'lr': 0.000338825245080286, 'samples': 11306688, 'steps': 58888, 'loss/train': 1.6742115020751953} 11/07/2021 05:29:22 - INFO - __main__ - Step 58890: {'lr': 0.0003388202845706328, 'samples': 11306880, 'steps': 58889, 'loss/train': 1.6185718774795532} 11/07/2021 05:29:23 - INFO - __main__ - Step 58891: {'lr': 0.0003388153240209585, 'samples': 11307072, 'steps': 58890, 'loss/train': 1.3241333961486816} 11/07/2021 05:29:23 - INFO - __main__ - Step 58892: {'lr': 0.0003388103634312654, 'samples': 11307264, 'steps': 58891, 'loss/train': 0.517514169216156} 11/07/2021 05:29:23 - INFO - __main__ - Step 58893: {'lr': 0.0003388054028015557, 'samples': 11307456, 'steps': 58892, 'loss/train': 1.2490746974945068} 11/07/2021 05:29:24 - INFO - __main__ - Step 58894: {'lr': 0.00033880044213183163, 'samples': 11307648, 'steps': 58893, 'loss/train': 1.1121854782104492} 11/07/2021 05:29:25 - INFO - __main__ - Step 58895: {'lr': 0.00033879548142209546, 'samples': 11307840, 'steps': 58894, 'loss/train': 1.2886314392089844} 11/07/2021 05:29:25 - INFO - __main__ - Step 58896: {'lr': 0.0003387905206723496, 'samples': 11308032, 'steps': 58895, 'loss/train': 1.453001856803894} 11/07/2021 05:29:25 - INFO - __main__ - Step 58897: {'lr': 0.00033878555988259583, 'samples': 11308224, 'steps': 58896, 'loss/train': 1.716269612312317} 11/07/2021 05:29:26 - INFO - __main__ - Step 58898: {'lr': 0.0003387805990528368, 'samples': 11308416, 'steps': 58897, 'loss/train': 1.284360408782959} 11/07/2021 05:29:26 - INFO - __main__ - Step 58899: {'lr': 0.0003387756381830746, 'samples': 11308608, 'steps': 58898, 'loss/train': 1.240405559539795} 11/07/2021 05:29:27 - INFO - __main__ - Step 58900: {'lr': 0.00033877067727331145, 'samples': 11308800, 'steps': 58899, 'loss/train': 1.0199034214019775} 11/07/2021 05:29:27 - INFO - __main__ - Step 58901: {'lr': 0.00033876571632354956, 'samples': 11308992, 'steps': 58900, 'loss/train': 1.4096803665161133} 11/07/2021 05:29:28 - INFO - __main__ - Step 58902: {'lr': 0.0003387607553337913, 'samples': 11309184, 'steps': 58901, 'loss/train': 1.220066785812378} 11/07/2021 05:29:28 - INFO - __main__ - Step 58903: {'lr': 0.00033875579430403877, 'samples': 11309376, 'steps': 58902, 'loss/train': 1.6530935764312744} 11/07/2021 05:29:29 - INFO - __main__ - Step 58904: {'lr': 0.00033875083323429425, 'samples': 11309568, 'steps': 58903, 'loss/train': 1.5301777124404907} 11/07/2021 05:29:30 - INFO - __main__ - Step 58905: {'lr': 0.0003387458721245599, 'samples': 11309760, 'steps': 58904, 'loss/train': 1.2797760963439941} 11/07/2021 05:29:30 - INFO - __main__ - Step 58906: {'lr': 0.0003387409109748381, 'samples': 11309952, 'steps': 58905, 'loss/train': 1.5946542024612427} 11/07/2021 05:29:30 - INFO - __main__ - Step 58907: {'lr': 0.0003387359497851311, 'samples': 11310144, 'steps': 58906, 'loss/train': 1.783267855644226} 11/07/2021 05:29:31 - INFO - __main__ - Step 58908: {'lr': 0.00033873098855544093, 'samples': 11310336, 'steps': 58907, 'loss/train': 1.5928608179092407} 11/07/2021 05:29:31 - INFO - __main__ - Step 58909: {'lr': 0.00033872602728576997, 'samples': 11310528, 'steps': 58908, 'loss/train': 1.4806081056594849} 11/07/2021 05:29:32 - INFO - __main__ - Step 58910: {'lr': 0.0003387210659761204, 'samples': 11310720, 'steps': 58909, 'loss/train': 1.586470365524292} 11/07/2021 05:29:32 - INFO - __main__ - Step 58911: {'lr': 0.00033871610462649456, 'samples': 11310912, 'steps': 58910, 'loss/train': 1.1386555433273315} 11/07/2021 05:29:33 - INFO - __main__ - Step 58912: {'lr': 0.00033871114323689457, 'samples': 11311104, 'steps': 58911, 'loss/train': 1.499022126197815} 11/07/2021 05:29:33 - INFO - __main__ - Step 58913: {'lr': 0.0003387061818073227, 'samples': 11311296, 'steps': 58912, 'loss/train': 1.3436602354049683} 11/07/2021 05:29:33 - INFO - __main__ - Step 58914: {'lr': 0.00033870122033778123, 'samples': 11311488, 'steps': 58913, 'loss/train': 1.4523167610168457} 11/07/2021 05:29:34 - INFO - __main__ - Step 58915: {'lr': 0.00033869625882827233, 'samples': 11311680, 'steps': 58914, 'loss/train': 1.2665297985076904} 11/07/2021 05:29:35 - INFO - __main__ - Step 58916: {'lr': 0.00033869129727879827, 'samples': 11311872, 'steps': 58915, 'loss/train': 1.5188469886779785} 11/07/2021 05:29:35 - INFO - __main__ - Step 58917: {'lr': 0.0003386863356893612, 'samples': 11312064, 'steps': 58916, 'loss/train': 1.2079638242721558} 11/07/2021 05:29:36 - INFO - __main__ - Step 58918: {'lr': 0.00033868137405996363, 'samples': 11312256, 'steps': 58917, 'loss/train': 1.0151643753051758} 11/07/2021 05:29:36 - INFO - __main__ - Step 58919: {'lr': 0.0003386764123906075, 'samples': 11312448, 'steps': 58918, 'loss/train': 1.3737975358963013} 11/07/2021 05:29:36 - INFO - __main__ - Step 58920: {'lr': 0.00033867145068129515, 'samples': 11312640, 'steps': 58919, 'loss/train': 1.2140005826950073} 11/07/2021 05:29:37 - INFO - __main__ - Step 58921: {'lr': 0.0003386664889320287, 'samples': 11312832, 'steps': 58920, 'loss/train': 1.5475101470947266} 11/07/2021 05:29:38 - INFO - __main__ - Step 58922: {'lr': 0.0003386615271428106, 'samples': 11313024, 'steps': 58921, 'loss/train': 1.5680527687072754} 11/07/2021 05:29:38 - INFO - __main__ - Step 58923: {'lr': 0.000338656565313643, 'samples': 11313216, 'steps': 58922, 'loss/train': 0.8294448852539062} 11/07/2021 05:29:38 - INFO - __main__ - Step 58924: {'lr': 0.0003386516034445281, 'samples': 11313408, 'steps': 58923, 'loss/train': 1.2829402685165405} 11/07/2021 05:29:39 - INFO - __main__ - Step 58925: {'lr': 0.0003386466415354682, 'samples': 11313600, 'steps': 58924, 'loss/train': 1.3706296682357788} 11/07/2021 05:29:40 - INFO - __main__ - Step 58926: {'lr': 0.00033864167958646543, 'samples': 11313792, 'steps': 58925, 'loss/train': 1.8204190731048584} 11/07/2021 05:29:40 - INFO - __main__ - Step 58927: {'lr': 0.00033863671759752206, 'samples': 11313984, 'steps': 58926, 'loss/train': 1.4258006811141968} 11/07/2021 05:29:40 - INFO - __main__ - Step 58928: {'lr': 0.0003386317555686404, 'samples': 11314176, 'steps': 58927, 'loss/train': 1.1298774480819702} 11/07/2021 05:29:41 - INFO - __main__ - Step 58929: {'lr': 0.0003386267934998226, 'samples': 11314368, 'steps': 58928, 'loss/train': 1.2495743036270142} 11/07/2021 05:29:41 - INFO - __main__ - Step 58930: {'lr': 0.00033862183139107106, 'samples': 11314560, 'steps': 58929, 'loss/train': 1.1795114278793335} 11/07/2021 05:29:41 - INFO - __main__ - Step 58931: {'lr': 0.0003386168692423878, 'samples': 11314752, 'steps': 58930, 'loss/train': 1.692266821861267} 11/07/2021 05:29:43 - INFO - __main__ - Step 58932: {'lr': 0.0003386119070537751, 'samples': 11314944, 'steps': 58931, 'loss/train': 1.6516591310501099} 11/07/2021 05:29:43 - INFO - __main__ - Step 58933: {'lr': 0.0003386069448252353, 'samples': 11315136, 'steps': 58932, 'loss/train': 1.7982105016708374} 11/07/2021 05:29:43 - INFO - __main__ - Step 58934: {'lr': 0.00033860198255677054, 'samples': 11315328, 'steps': 58933, 'loss/train': 1.8225808143615723} 11/07/2021 05:29:44 - INFO - __main__ - Step 58935: {'lr': 0.0003385970202483831, 'samples': 11315520, 'steps': 58934, 'loss/train': 0.9737790822982788} 11/07/2021 05:29:44 - INFO - __main__ - Step 58936: {'lr': 0.0003385920579000752, 'samples': 11315712, 'steps': 58935, 'loss/train': 1.3228092193603516} 11/07/2021 05:29:45 - INFO - __main__ - Step 58937: {'lr': 0.0003385870955118492, 'samples': 11315904, 'steps': 58936, 'loss/train': 1.6627357006072998} 11/07/2021 05:29:45 - INFO - __main__ - Step 58938: {'lr': 0.0003385821330837071, 'samples': 11316096, 'steps': 58937, 'loss/train': 1.279924988746643} 11/07/2021 05:29:46 - INFO - __main__ - Step 58939: {'lr': 0.0003385771706156513, 'samples': 11316288, 'steps': 58938, 'loss/train': 2.1263813972473145} 11/07/2021 05:29:46 - INFO - __main__ - Step 58940: {'lr': 0.00033857220810768395, 'samples': 11316480, 'steps': 58939, 'loss/train': 1.8346434831619263} 11/07/2021 05:29:46 - INFO - __main__ - Step 58941: {'lr': 0.00033856724555980736, 'samples': 11316672, 'steps': 58940, 'loss/train': 1.4390860795974731} 11/07/2021 05:29:47 - INFO - __main__ - Step 58942: {'lr': 0.00033856228297202373, 'samples': 11316864, 'steps': 58941, 'loss/train': 0.9148387312889099} 11/07/2021 05:29:48 - INFO - __main__ - Step 58943: {'lr': 0.0003385573203443354, 'samples': 11317056, 'steps': 58942, 'loss/train': 1.4333701133728027} 11/07/2021 05:29:48 - INFO - __main__ - Step 58944: {'lr': 0.0003385523576767444, 'samples': 11317248, 'steps': 58943, 'loss/train': 1.6836155652999878} 11/07/2021 05:29:48 - INFO - __main__ - Step 58945: {'lr': 0.0003385473949692531, 'samples': 11317440, 'steps': 58944, 'loss/train': 1.46369206905365} 11/07/2021 05:29:49 - INFO - __main__ - Step 58946: {'lr': 0.0003385424322218637, 'samples': 11317632, 'steps': 58945, 'loss/train': 1.2269657850265503} 11/07/2021 05:29:50 - INFO - __main__ - Step 58947: {'lr': 0.0003385374694345784, 'samples': 11317824, 'steps': 58946, 'loss/train': 1.8922995328903198} 11/07/2021 05:29:50 - INFO - __main__ - Step 58948: {'lr': 0.00033853250660739954, 'samples': 11318016, 'steps': 58947, 'loss/train': 1.7637218236923218} 11/07/2021 05:29:51 - INFO - __main__ - Step 58949: {'lr': 0.00033852754374032927, 'samples': 11318208, 'steps': 58948, 'loss/train': 1.931194543838501} 11/07/2021 05:29:51 - INFO - __main__ - Step 58950: {'lr': 0.00033852258083336996, 'samples': 11318400, 'steps': 58949, 'loss/train': 1.311229944229126} 11/07/2021 05:29:51 - INFO - __main__ - Step 58951: {'lr': 0.0003385176178865236, 'samples': 11318592, 'steps': 58950, 'loss/train': 1.9002926349639893} 11/07/2021 05:29:52 - INFO - __main__ - Step 58952: {'lr': 0.00033851265489979267, 'samples': 11318784, 'steps': 58951, 'loss/train': 1.1847608089447021} 11/07/2021 05:29:53 - INFO - __main__ - Step 58953: {'lr': 0.00033850769187317923, 'samples': 11318976, 'steps': 58952, 'loss/train': 1.194960117340088} 11/07/2021 05:29:53 - INFO - __main__ - Step 58954: {'lr': 0.00033850272880668565, 'samples': 11319168, 'steps': 58953, 'loss/train': 1.077144742012024} 11/07/2021 05:29:53 - INFO - __main__ - Step 58955: {'lr': 0.000338497765700314, 'samples': 11319360, 'steps': 58954, 'loss/train': 1.4415621757507324} 11/07/2021 05:29:54 - INFO - __main__ - Step 58956: {'lr': 0.00033849280255406674, 'samples': 11319552, 'steps': 58955, 'loss/train': 1.3159661293029785} 11/07/2021 05:29:55 - INFO - __main__ - Step 58957: {'lr': 0.000338487839367946, 'samples': 11319744, 'steps': 58956, 'loss/train': 1.506406545639038} 11/07/2021 05:29:55 - INFO - __main__ - Step 58958: {'lr': 0.00033848287614195394, 'samples': 11319936, 'steps': 58957, 'loss/train': 1.6731747388839722} 11/07/2021 05:29:55 - INFO - __main__ - Step 58959: {'lr': 0.00033847791287609287, 'samples': 11320128, 'steps': 58958, 'loss/train': 1.0841224193572998} 11/07/2021 05:29:56 - INFO - __main__ - Step 58960: {'lr': 0.00033847294957036503, 'samples': 11320320, 'steps': 58959, 'loss/train': 1.33205246925354} 11/07/2021 05:29:56 - INFO - __main__ - Step 58961: {'lr': 0.0003384679862247726, 'samples': 11320512, 'steps': 58960, 'loss/train': 0.9158821105957031} 11/07/2021 05:29:57 - INFO - __main__ - Step 58962: {'lr': 0.0003384630228393179, 'samples': 11320704, 'steps': 58961, 'loss/train': 0.9067651033401489} 11/07/2021 05:29:58 - INFO - __main__ - Step 58963: {'lr': 0.0003384580594140031, 'samples': 11320896, 'steps': 58962, 'loss/train': 1.540272831916809} 11/07/2021 05:29:58 - INFO - __main__ - Step 58964: {'lr': 0.00033845309594883054, 'samples': 11321088, 'steps': 58963, 'loss/train': 1.5439705848693848} 11/07/2021 05:29:58 - INFO - __main__ - Step 58965: {'lr': 0.0003384481324438023, 'samples': 11321280, 'steps': 58964, 'loss/train': 1.2622952461242676} 11/07/2021 05:29:59 - INFO - __main__ - Step 58966: {'lr': 0.00033844316889892074, 'samples': 11321472, 'steps': 58965, 'loss/train': 1.2017014026641846} 11/07/2021 05:30:00 - INFO - __main__ - Step 58967: {'lr': 0.000338438205314188, 'samples': 11321664, 'steps': 58966, 'loss/train': 1.4321461915969849} 11/07/2021 05:30:00 - INFO - __main__ - Step 58968: {'lr': 0.00033843324168960644, 'samples': 11321856, 'steps': 58967, 'loss/train': 1.3554356098175049} 11/07/2021 05:30:00 - INFO - __main__ - Step 58969: {'lr': 0.0003384282780251782, 'samples': 11322048, 'steps': 58968, 'loss/train': 1.3642568588256836} 11/07/2021 05:30:01 - INFO - __main__ - Step 58970: {'lr': 0.0003384233143209056, 'samples': 11322240, 'steps': 58969, 'loss/train': 1.3941218852996826} 11/07/2021 05:30:01 - INFO - __main__ - Step 58971: {'lr': 0.0003384183505767907, 'samples': 11322432, 'steps': 58970, 'loss/train': 1.179962158203125} 11/07/2021 05:30:01 - INFO - __main__ - Step 58972: {'lr': 0.0003384133867928359, 'samples': 11322624, 'steps': 58971, 'loss/train': 1.2346917390823364} 11/07/2021 05:30:02 - INFO - __main__ - Step 58973: {'lr': 0.0003384084229690434, 'samples': 11322816, 'steps': 58972, 'loss/train': 1.7636467218399048} 11/07/2021 05:30:03 - INFO - __main__ - Step 58974: {'lr': 0.0003384034591054154, 'samples': 11323008, 'steps': 58973, 'loss/train': 1.005846381187439} 11/07/2021 05:30:03 - INFO - __main__ - Step 58975: {'lr': 0.0003383984952019542, 'samples': 11323200, 'steps': 58974, 'loss/train': 1.4979660511016846} 11/07/2021 05:30:03 - INFO - __main__ - Step 58976: {'lr': 0.00033839353125866194, 'samples': 11323392, 'steps': 58975, 'loss/train': 1.52721107006073} 11/07/2021 05:30:04 - INFO - __main__ - Step 58977: {'lr': 0.00033838856727554106, 'samples': 11323584, 'steps': 58976, 'loss/train': 1.274423599243164} 11/07/2021 05:30:05 - INFO - __main__ - Step 58978: {'lr': 0.00033838360325259354, 'samples': 11323776, 'steps': 58977, 'loss/train': 0.9597126841545105} 11/07/2021 05:30:05 - INFO - __main__ - Step 58979: {'lr': 0.00033837863918982175, 'samples': 11323968, 'steps': 58978, 'loss/train': 1.0574196577072144} 11/07/2021 05:30:05 - INFO - __main__ - Step 58980: {'lr': 0.0003383736750872279, 'samples': 11324160, 'steps': 58979, 'loss/train': 1.7264882326126099} 11/07/2021 05:30:06 - INFO - __main__ - Step 58981: {'lr': 0.00033836871094481433, 'samples': 11324352, 'steps': 58980, 'loss/train': 1.2755435705184937} 11/07/2021 05:30:06 - INFO - __main__ - Step 58982: {'lr': 0.0003383637467625831, 'samples': 11324544, 'steps': 58981, 'loss/train': 1.6883162260055542} 11/07/2021 05:30:07 - INFO - __main__ - Step 58983: {'lr': 0.00033835878254053647, 'samples': 11324736, 'steps': 58982, 'loss/train': 1.2053639888763428} 11/07/2021 05:30:07 - INFO - __main__ - Step 58984: {'lr': 0.00033835381827867686, 'samples': 11324928, 'steps': 58983, 'loss/train': 1.5472948551177979} 11/07/2021 05:30:08 - INFO - __main__ - Step 58985: {'lr': 0.00033834885397700633, 'samples': 11325120, 'steps': 58984, 'loss/train': 1.3101754188537598} 11/07/2021 05:30:08 - INFO - __main__ - Step 58986: {'lr': 0.00033834388963552715, 'samples': 11325312, 'steps': 58985, 'loss/train': 1.4082560539245605} 11/07/2021 05:30:09 - INFO - __main__ - Step 58987: {'lr': 0.0003383389252542416, 'samples': 11325504, 'steps': 58986, 'loss/train': 1.0450400114059448} 11/07/2021 05:30:10 - INFO - __main__ - Step 58988: {'lr': 0.0003383339608331519, 'samples': 11325696, 'steps': 58987, 'loss/train': 1.324913501739502} 11/07/2021 05:30:10 - INFO - __main__ - Step 58989: {'lr': 0.00033832899637226024, 'samples': 11325888, 'steps': 58988, 'loss/train': 1.4061994552612305} 11/07/2021 05:30:10 - INFO - __main__ - Step 58990: {'lr': 0.0003383240318715689, 'samples': 11326080, 'steps': 58989, 'loss/train': 1.8055460453033447} 11/07/2021 05:30:11 - INFO - __main__ - Step 58991: {'lr': 0.0003383190673310802, 'samples': 11326272, 'steps': 58990, 'loss/train': 1.3656814098358154} 11/07/2021 05:30:11 - INFO - __main__ - Step 58992: {'lr': 0.0003383141027507962, 'samples': 11326464, 'steps': 58991, 'loss/train': 1.1166648864746094} 11/07/2021 05:30:12 - INFO - __main__ - Step 58993: {'lr': 0.0003383091381307193, 'samples': 11326656, 'steps': 58992, 'loss/train': 1.576315999031067} 11/07/2021 05:30:12 - INFO - __main__ - Step 58994: {'lr': 0.0003383041734708516, 'samples': 11326848, 'steps': 58993, 'loss/train': 1.656105637550354} 11/07/2021 05:30:13 - INFO - __main__ - Step 58995: {'lr': 0.0003382992087711954, 'samples': 11327040, 'steps': 58994, 'loss/train': 1.2767597436904907} 11/07/2021 05:30:13 - INFO - __main__ - Step 58996: {'lr': 0.00033829424403175297, 'samples': 11327232, 'steps': 58995, 'loss/train': 0.18723994493484497} 11/07/2021 05:30:13 - INFO - __main__ - Step 58997: {'lr': 0.00033828927925252657, 'samples': 11327424, 'steps': 58996, 'loss/train': 1.48435378074646} 11/07/2021 05:30:14 - INFO - __main__ - Step 58998: {'lr': 0.0003382843144335183, 'samples': 11327616, 'steps': 58997, 'loss/train': 1.5496824979782104} 11/07/2021 05:30:15 - INFO - __main__ - Step 58999: {'lr': 0.0003382793495747305, 'samples': 11327808, 'steps': 58998, 'loss/train': 1.3105796575546265} 11/07/2021 05:30:15 - INFO - __main__ - Step 59000: {'lr': 0.0003382743846761654, 'samples': 11328000, 'steps': 58999, 'loss/train': 1.9641363620758057} 11/07/2021 05:30:15 - INFO - __main__ - Step 59001: {'lr': 0.0003382694197378252, 'samples': 11328192, 'steps': 59000, 'loss/train': 0.6021393537521362} 11/07/2021 05:30:16 - INFO - __main__ - Step 59002: {'lr': 0.00033826445475971216, 'samples': 11328384, 'steps': 59001, 'loss/train': 1.301620602607727} 11/07/2021 05:30:16 - INFO - __main__ - Step 59003: {'lr': 0.0003382594897418285, 'samples': 11328576, 'steps': 59002, 'loss/train': 1.00771963596344} 11/07/2021 05:30:17 - INFO - __main__ - Step 59004: {'lr': 0.0003382545246841766, 'samples': 11328768, 'steps': 59003, 'loss/train': 1.8510748147964478} 11/07/2021 05:30:18 - INFO - __main__ - Step 59005: {'lr': 0.00033824955958675843, 'samples': 11328960, 'steps': 59004, 'loss/train': 0.6068124175071716} 11/07/2021 05:30:18 - INFO - __main__ - Step 59006: {'lr': 0.00033824459444957645, 'samples': 11329152, 'steps': 59005, 'loss/train': 1.2799570560455322} 11/07/2021 05:30:18 - INFO - __main__ - Step 59007: {'lr': 0.0003382396292726328, 'samples': 11329344, 'steps': 59006, 'loss/train': 1.6679545640945435} 11/07/2021 05:30:19 - INFO - __main__ - Step 59008: {'lr': 0.00033823466405592974, 'samples': 11329536, 'steps': 59007, 'loss/train': 1.59792160987854} 11/07/2021 05:30:20 - INFO - __main__ - Step 59009: {'lr': 0.00033822969879946947, 'samples': 11329728, 'steps': 59008, 'loss/train': 1.542983889579773} 11/07/2021 05:30:20 - INFO - __main__ - Step 59010: {'lr': 0.0003382247335032542, 'samples': 11329920, 'steps': 59009, 'loss/train': 1.476171851158142} 11/07/2021 05:30:20 - INFO - __main__ - Step 59011: {'lr': 0.0003382197681672864, 'samples': 11330112, 'steps': 59010, 'loss/train': 0.20667408406734467} 11/07/2021 05:30:21 - INFO - __main__ - Step 59012: {'lr': 0.000338214802791568, 'samples': 11330304, 'steps': 59011, 'loss/train': 1.7993354797363281} 11/07/2021 05:30:21 - INFO - __main__ - Step 59013: {'lr': 0.00033820983737610147, 'samples': 11330496, 'steps': 59012, 'loss/train': 1.5575295686721802} 11/07/2021 05:30:22 - INFO - __main__ - Step 59014: {'lr': 0.00033820487192088883, 'samples': 11330688, 'steps': 59013, 'loss/train': 1.5250554084777832} 11/07/2021 05:30:23 - INFO - __main__ - Step 59015: {'lr': 0.0003381999064259325, 'samples': 11330880, 'steps': 59014, 'loss/train': 1.177808165550232} 11/07/2021 05:30:23 - INFO - __main__ - Step 59016: {'lr': 0.00033819494089123466, 'samples': 11331072, 'steps': 59015, 'loss/train': 1.3401293754577637} 11/07/2021 05:30:23 - INFO - __main__ - Step 59017: {'lr': 0.00033818997531679756, 'samples': 11331264, 'steps': 59016, 'loss/train': 0.44552716612815857} 11/07/2021 05:30:24 - INFO - __main__ - Step 59018: {'lr': 0.0003381850097026234, 'samples': 11331456, 'steps': 59017, 'loss/train': 1.4919835329055786} 11/07/2021 05:30:25 - INFO - __main__ - Step 59019: {'lr': 0.0003381800440487144, 'samples': 11331648, 'steps': 59018, 'loss/train': 1.3549867868423462} 11/07/2021 05:30:25 - INFO - __main__ - Step 59020: {'lr': 0.00033817507835507283, 'samples': 11331840, 'steps': 59019, 'loss/train': 1.3828877210617065} 11/07/2021 05:30:25 - INFO - __main__ - Step 59021: {'lr': 0.00033817011262170097, 'samples': 11332032, 'steps': 59020, 'loss/train': 1.3331918716430664} 11/07/2021 05:30:26 - INFO - __main__ - Step 59022: {'lr': 0.000338165146848601, 'samples': 11332224, 'steps': 59021, 'loss/train': 1.3515325784683228} 11/07/2021 05:30:26 - INFO - __main__ - Step 59023: {'lr': 0.0003381601810357752, 'samples': 11332416, 'steps': 59022, 'loss/train': 1.296303391456604} 11/07/2021 05:30:27 - INFO - __main__ - Step 59024: {'lr': 0.00033815521518322576, 'samples': 11332608, 'steps': 59023, 'loss/train': 1.0616830587387085} 11/07/2021 05:30:27 - INFO - __main__ - Step 59025: {'lr': 0.00033815024929095496, 'samples': 11332800, 'steps': 59024, 'loss/train': 1.3384326696395874} 11/07/2021 05:30:28 - INFO - __main__ - Step 59026: {'lr': 0.000338145283358965, 'samples': 11332992, 'steps': 59025, 'loss/train': 1.268005132675171} 11/07/2021 05:30:28 - INFO - __main__ - Step 59027: {'lr': 0.0003381403173872581, 'samples': 11333184, 'steps': 59026, 'loss/train': 0.9674777984619141} 11/07/2021 05:30:29 - INFO - __main__ - Step 59028: {'lr': 0.00033813535137583656, 'samples': 11333376, 'steps': 59027, 'loss/train': 1.3868147134780884} 11/07/2021 05:30:29 - INFO - __main__ - Step 59029: {'lr': 0.0003381303853247026, 'samples': 11333568, 'steps': 59028, 'loss/train': 1.801058292388916} 11/07/2021 05:30:30 - INFO - __main__ - Step 59030: {'lr': 0.0003381254192338585, 'samples': 11333760, 'steps': 59029, 'loss/train': 1.4851211309432983} 11/07/2021 05:30:30 - INFO - __main__ - Step 59031: {'lr': 0.00033812045310330636, 'samples': 11333952, 'steps': 59030, 'loss/train': 0.4006422758102417} 11/07/2021 05:30:31 - INFO - __main__ - Step 59032: {'lr': 0.0003381154869330485, 'samples': 11334144, 'steps': 59031, 'loss/train': 0.8292285799980164} 11/07/2021 05:30:31 - INFO - __main__ - Step 59033: {'lr': 0.00033811052072308724, 'samples': 11334336, 'steps': 59032, 'loss/train': 1.8231481313705444} 11/07/2021 05:30:31 - INFO - __main__ - Step 59034: {'lr': 0.0003381055544734247, 'samples': 11334528, 'steps': 59033, 'loss/train': 1.622146725654602} 11/07/2021 05:30:32 - INFO - __main__ - Step 59035: {'lr': 0.00033810058818406307, 'samples': 11334720, 'steps': 59034, 'loss/train': 1.0690467357635498} 11/07/2021 05:30:33 - INFO - __main__ - Step 59036: {'lr': 0.0003380956218550049, 'samples': 11334912, 'steps': 59035, 'loss/train': 1.2485395669937134} 11/07/2021 05:30:33 - INFO - __main__ - Step 59037: {'lr': 0.000338090655486252, 'samples': 11335104, 'steps': 59036, 'loss/train': 1.5792678594589233} 11/07/2021 05:30:33 - INFO - __main__ - Step 59038: {'lr': 0.00033808568907780687, 'samples': 11335296, 'steps': 59037, 'loss/train': 1.1977735757827759} 11/07/2021 05:30:34 - INFO - __main__ - Step 59039: {'lr': 0.00033808072262967164, 'samples': 11335488, 'steps': 59038, 'loss/train': 1.333406686782837} 11/07/2021 05:30:35 - INFO - __main__ - Step 59040: {'lr': 0.00033807575614184864, 'samples': 11335680, 'steps': 59039, 'loss/train': 1.4398648738861084} 11/07/2021 05:30:35 - INFO - __main__ - Step 59041: {'lr': 0.0003380707896143401, 'samples': 11335872, 'steps': 59040, 'loss/train': 1.0140451192855835} 11/07/2021 05:30:36 - INFO - __main__ - Step 59042: {'lr': 0.0003380658230471482, 'samples': 11336064, 'steps': 59041, 'loss/train': 1.574953317642212} 11/07/2021 05:30:36 - INFO - __main__ - Step 59043: {'lr': 0.0003380608564402752, 'samples': 11336256, 'steps': 59042, 'loss/train': 1.8984801769256592} 11/07/2021 05:30:36 - INFO - __main__ - Step 59044: {'lr': 0.0003380558897937233, 'samples': 11336448, 'steps': 59043, 'loss/train': 1.3492224216461182} 11/07/2021 05:30:37 - INFO - __main__ - Step 59045: {'lr': 0.0003380509231074948, 'samples': 11336640, 'steps': 59044, 'loss/train': 1.5896109342575073} 11/07/2021 05:30:38 - INFO - __main__ - Step 59046: {'lr': 0.0003380459563815919, 'samples': 11336832, 'steps': 59045, 'loss/train': 1.0907433032989502} 11/07/2021 05:30:38 - INFO - __main__ - Step 59047: {'lr': 0.0003380409896160169, 'samples': 11337024, 'steps': 59046, 'loss/train': 1.303443193435669} 11/07/2021 05:30:38 - INFO - __main__ - Step 59048: {'lr': 0.00033803602281077194, 'samples': 11337216, 'steps': 59047, 'loss/train': 1.4601738452911377} 11/07/2021 05:30:39 - INFO - __main__ - Step 59049: {'lr': 0.0003380310559658593, 'samples': 11337408, 'steps': 59048, 'loss/train': 1.196200966835022} 11/07/2021 05:30:40 - INFO - __main__ - Step 59050: {'lr': 0.00033802608908128126, 'samples': 11337600, 'steps': 59049, 'loss/train': 1.0969765186309814} 11/07/2021 05:30:40 - INFO - __main__ - Step 59051: {'lr': 0.00033802112215704, 'samples': 11337792, 'steps': 59050, 'loss/train': 1.753443717956543} 11/07/2021 05:30:40 - INFO - __main__ - Step 59052: {'lr': 0.0003380161551931378, 'samples': 11337984, 'steps': 59051, 'loss/train': 1.0757197141647339} 11/07/2021 05:30:41 - INFO - __main__ - Step 59053: {'lr': 0.00033801118818957686, 'samples': 11338176, 'steps': 59052, 'loss/train': 1.2961983680725098} 11/07/2021 05:30:41 - INFO - __main__ - Step 59054: {'lr': 0.00033800622114635943, 'samples': 11338368, 'steps': 59053, 'loss/train': 1.1351161003112793} 11/07/2021 05:30:42 - INFO - __main__ - Step 59055: {'lr': 0.0003380012540634878, 'samples': 11338560, 'steps': 59054, 'loss/train': 1.7041480541229248} 11/07/2021 05:30:42 - INFO - __main__ - Step 59056: {'lr': 0.00033799628694096407, 'samples': 11338752, 'steps': 59055, 'loss/train': 1.5331214666366577} 11/07/2021 05:30:43 - INFO - __main__ - Step 59057: {'lr': 0.0003379913197787907, 'samples': 11338944, 'steps': 59056, 'loss/train': 1.6381770372390747} 11/07/2021 05:30:43 - INFO - __main__ - Step 59058: {'lr': 0.00033798635257696976, 'samples': 11339136, 'steps': 59057, 'loss/train': 0.44634076952934265} 11/07/2021 05:30:43 - INFO - __main__ - Step 59059: {'lr': 0.0003379813853355034, 'samples': 11339328, 'steps': 59058, 'loss/train': 1.2636781930923462} 11/07/2021 05:30:45 - INFO - __main__ - Step 59060: {'lr': 0.0003379764180543941, 'samples': 11339520, 'steps': 59059, 'loss/train': 1.602419376373291} 11/07/2021 05:30:45 - INFO - __main__ - Step 59061: {'lr': 0.000337971450733644, 'samples': 11339712, 'steps': 59060, 'loss/train': 1.3686432838439941} 11/07/2021 05:30:45 - INFO - __main__ - Step 59062: {'lr': 0.00033796648337325525, 'samples': 11339904, 'steps': 59061, 'loss/train': 0.6409185528755188} 11/07/2021 05:30:46 - INFO - __main__ - Step 59063: {'lr': 0.0003379615159732302, 'samples': 11340096, 'steps': 59062, 'loss/train': 1.3427462577819824} 11/07/2021 05:30:46 - INFO - __main__ - Step 59064: {'lr': 0.00033795654853357104, 'samples': 11340288, 'steps': 59063, 'loss/train': 1.4081156253814697} 11/07/2021 05:30:47 - INFO - __main__ - Step 59065: {'lr': 0.00033795158105428, 'samples': 11340480, 'steps': 59064, 'loss/train': 1.1225422620773315} 11/07/2021 05:30:47 - INFO - __main__ - Step 59066: {'lr': 0.0003379466135353594, 'samples': 11340672, 'steps': 59065, 'loss/train': 1.297689437866211} 11/07/2021 05:30:48 - INFO - __main__ - Step 59067: {'lr': 0.0003379416459768114, 'samples': 11340864, 'steps': 59066, 'loss/train': 1.8279924392700195} 11/07/2021 05:30:48 - INFO - __main__ - Step 59068: {'lr': 0.00033793667837863815, 'samples': 11341056, 'steps': 59067, 'loss/train': 1.3760242462158203} 11/07/2021 05:30:48 - INFO - __main__ - Step 59069: {'lr': 0.0003379317107408421, 'samples': 11341248, 'steps': 59068, 'loss/train': 1.5519757270812988} 11/07/2021 05:30:49 - INFO - __main__ - Step 59070: {'lr': 0.0003379267430634253, 'samples': 11341440, 'steps': 59069, 'loss/train': 1.4017212390899658} 11/07/2021 05:30:50 - INFO - __main__ - Step 59071: {'lr': 0.00033792177534639015, 'samples': 11341632, 'steps': 59070, 'loss/train': 1.1278568506240845} 11/07/2021 05:30:50 - INFO - __main__ - Step 59072: {'lr': 0.00033791680758973874, 'samples': 11341824, 'steps': 59071, 'loss/train': 1.3602644205093384} 11/07/2021 05:30:51 - INFO - __main__ - Step 59073: {'lr': 0.0003379118397934734, 'samples': 11342016, 'steps': 59072, 'loss/train': 5.711589813232422} 11/07/2021 05:30:51 - INFO - __main__ - Step 59074: {'lr': 0.00033790687195759636, 'samples': 11342208, 'steps': 59073, 'loss/train': 1.1390502452850342} 11/07/2021 05:30:51 - INFO - __main__ - Step 59075: {'lr': 0.00033790190408210973, 'samples': 11342400, 'steps': 59074, 'loss/train': 1.4508646726608276} 11/07/2021 05:30:52 - INFO - __main__ - Step 59076: {'lr': 0.000337896936167016, 'samples': 11342592, 'steps': 59075, 'loss/train': 1.3428006172180176} 11/07/2021 05:30:53 - INFO - __main__ - Step 59077: {'lr': 0.00033789196821231717, 'samples': 11342784, 'steps': 59076, 'loss/train': 1.171927809715271} 11/07/2021 05:30:53 - INFO - __main__ - Step 59078: {'lr': 0.00033788700021801564, 'samples': 11342976, 'steps': 59077, 'loss/train': 1.3192864656448364} 11/07/2021 05:30:53 - INFO - __main__ - Step 59079: {'lr': 0.00033788203218411357, 'samples': 11343168, 'steps': 59078, 'loss/train': 1.6850391626358032} 11/07/2021 05:30:54 - INFO - __main__ - Step 59080: {'lr': 0.0003378770641106132, 'samples': 11343360, 'steps': 59079, 'loss/train': 1.1569558382034302} 11/07/2021 05:30:55 - INFO - __main__ - Step 59081: {'lr': 0.00033787209599751676, 'samples': 11343552, 'steps': 59080, 'loss/train': 1.3070340156555176} 11/07/2021 05:30:55 - INFO - __main__ - Step 59082: {'lr': 0.0003378671278448265, 'samples': 11343744, 'steps': 59081, 'loss/train': 1.6367872953414917} 11/07/2021 05:30:56 - INFO - __main__ - Step 59083: {'lr': 0.00033786215965254474, 'samples': 11343936, 'steps': 59082, 'loss/train': 0.9003011584281921} 11/07/2021 05:30:56 - INFO - __main__ - Step 59084: {'lr': 0.00033785719142067364, 'samples': 11344128, 'steps': 59083, 'loss/train': 1.2933342456817627} 11/07/2021 05:30:56 - INFO - __main__ - Step 59085: {'lr': 0.0003378522231492154, 'samples': 11344320, 'steps': 59084, 'loss/train': 1.5280778408050537} 11/07/2021 05:30:57 - INFO - __main__ - Step 59086: {'lr': 0.0003378472548381723, 'samples': 11344512, 'steps': 59085, 'loss/train': 1.450378656387329} 11/07/2021 05:30:58 - INFO - __main__ - Step 59087: {'lr': 0.0003378422864875466, 'samples': 11344704, 'steps': 59086, 'loss/train': 1.1537388563156128} 11/07/2021 05:30:58 - INFO - __main__ - Step 59088: {'lr': 0.0003378373180973405, 'samples': 11344896, 'steps': 59087, 'loss/train': 1.1885035037994385} 11/07/2021 05:30:58 - INFO - __main__ - Step 59089: {'lr': 0.0003378323496675563, 'samples': 11345088, 'steps': 59088, 'loss/train': 1.5443912744522095} 11/07/2021 05:30:59 - INFO - __main__ - Step 59090: {'lr': 0.0003378273811981961, 'samples': 11345280, 'steps': 59089, 'loss/train': 1.4305788278579712} 11/07/2021 05:30:59 - INFO - __main__ - Step 59091: {'lr': 0.00033782241268926237, 'samples': 11345472, 'steps': 59090, 'loss/train': 1.343381404876709} 11/07/2021 05:31:00 - INFO - __main__ - Step 59092: {'lr': 0.00033781744414075723, 'samples': 11345664, 'steps': 59091, 'loss/train': 1.6727418899536133} 11/07/2021 05:31:00 - INFO - __main__ - Step 59093: {'lr': 0.0003378124755526828, 'samples': 11345856, 'steps': 59092, 'loss/train': 1.842724323272705} 11/07/2021 05:31:01 - INFO - __main__ - Step 59094: {'lr': 0.0003378075069250414, 'samples': 11346048, 'steps': 59093, 'loss/train': 1.3655223846435547} 11/07/2021 05:31:01 - INFO - __main__ - Step 59095: {'lr': 0.00033780253825783533, 'samples': 11346240, 'steps': 59094, 'loss/train': 1.0376087427139282} 11/07/2021 05:31:02 - INFO - __main__ - Step 59096: {'lr': 0.0003377975695510668, 'samples': 11346432, 'steps': 59095, 'loss/train': 1.179636001586914} 11/07/2021 05:31:03 - INFO - __main__ - Step 59097: {'lr': 0.0003377926008047381, 'samples': 11346624, 'steps': 59096, 'loss/train': 1.4483702182769775} 11/07/2021 05:31:03 - INFO - __main__ - Step 59098: {'lr': 0.0003377876320188514, 'samples': 11346816, 'steps': 59097, 'loss/train': 1.4322985410690308} 11/07/2021 05:31:03 - INFO - __main__ - Step 59099: {'lr': 0.0003377826631934089, 'samples': 11347008, 'steps': 59098, 'loss/train': 1.5368103981018066} 11/07/2021 05:31:04 - INFO - __main__ - Step 59100: {'lr': 0.0003377776943284129, 'samples': 11347200, 'steps': 59099, 'loss/train': 1.2427552938461304} 11/07/2021 05:31:04 - INFO - __main__ - Step 59101: {'lr': 0.00033777272542386564, 'samples': 11347392, 'steps': 59100, 'loss/train': 0.9969795346260071} 11/07/2021 05:31:05 - INFO - __main__ - Step 59102: {'lr': 0.0003377677564797693, 'samples': 11347584, 'steps': 59101, 'loss/train': 1.987402319908142} 11/07/2021 05:31:05 - INFO - __main__ - Step 59103: {'lr': 0.00033776278749612617, 'samples': 11347776, 'steps': 59102, 'loss/train': 1.5692646503448486} 11/07/2021 05:31:06 - INFO - __main__ - Step 59104: {'lr': 0.00033775781847293846, 'samples': 11347968, 'steps': 59103, 'loss/train': 0.8707871437072754} 11/07/2021 05:31:06 - INFO - __main__ - Step 59105: {'lr': 0.00033775284941020854, 'samples': 11348160, 'steps': 59104, 'loss/train': 0.0916455090045929} 11/07/2021 05:31:06 - INFO - __main__ - Step 59106: {'lr': 0.0003377478803079385, 'samples': 11348352, 'steps': 59105, 'loss/train': 1.0275636911392212} 11/07/2021 05:31:07 - INFO - __main__ - Step 59107: {'lr': 0.00033774291116613054, 'samples': 11348544, 'steps': 59106, 'loss/train': 1.5135724544525146} 11/07/2021 05:31:08 - INFO - __main__ - Step 59108: {'lr': 0.000337737941984787, 'samples': 11348736, 'steps': 59107, 'loss/train': 1.7488468885421753} 11/07/2021 05:31:08 - INFO - __main__ - Step 59109: {'lr': 0.00033773297276391015, 'samples': 11348928, 'steps': 59108, 'loss/train': 1.8064600229263306} 11/07/2021 05:31:09 - INFO - __main__ - Step 59110: {'lr': 0.00033772800350350215, 'samples': 11349120, 'steps': 59109, 'loss/train': 1.6492990255355835} 11/07/2021 05:31:09 - INFO - __main__ - Step 59111: {'lr': 0.0003377230342035653, 'samples': 11349312, 'steps': 59110, 'loss/train': 1.475067138671875} 11/07/2021 05:31:09 - INFO - __main__ - Step 59112: {'lr': 0.00033771806486410176, 'samples': 11349504, 'steps': 59111, 'loss/train': 1.3603543043136597} 11/07/2021 05:31:10 - INFO - __main__ - Step 59113: {'lr': 0.0003377130954851138, 'samples': 11349696, 'steps': 59112, 'loss/train': 1.298151969909668} 11/07/2021 05:31:11 - INFO - __main__ - Step 59114: {'lr': 0.0003377081260666037, 'samples': 11349888, 'steps': 59113, 'loss/train': 2.65168833732605} 11/07/2021 05:31:11 - INFO - __main__ - Step 59115: {'lr': 0.00033770315660857367, 'samples': 11350080, 'steps': 59114, 'loss/train': 1.5206751823425293} 11/07/2021 05:31:11 - INFO - __main__ - Step 59116: {'lr': 0.00033769818711102594, 'samples': 11350272, 'steps': 59115, 'loss/train': 1.7334089279174805} 11/07/2021 05:31:12 - INFO - __main__ - Step 59117: {'lr': 0.0003376932175739628, 'samples': 11350464, 'steps': 59116, 'loss/train': 1.4664421081542969} 11/07/2021 05:31:13 - INFO - __main__ - Step 59118: {'lr': 0.00033768824799738646, 'samples': 11350656, 'steps': 59117, 'loss/train': 1.2359026670455933} 11/07/2021 05:31:13 - INFO - __main__ - Step 59119: {'lr': 0.0003376832783812991, 'samples': 11350848, 'steps': 59118, 'loss/train': 1.3090336322784424} 11/07/2021 05:31:13 - INFO - __main__ - Step 59120: {'lr': 0.000337678308725703, 'samples': 11351040, 'steps': 59119, 'loss/train': 1.4215831756591797} 11/07/2021 05:31:14 - INFO - __main__ - Step 59121: {'lr': 0.0003376733390306004, 'samples': 11351232, 'steps': 59120, 'loss/train': 1.9562761783599854} 11/07/2021 05:31:14 - INFO - __main__ - Step 59122: {'lr': 0.00033766836929599353, 'samples': 11351424, 'steps': 59121, 'loss/train': 1.4182853698730469} 11/07/2021 05:31:15 - INFO - __main__ - Step 59123: {'lr': 0.00033766339952188474, 'samples': 11351616, 'steps': 59122, 'loss/train': 0.6436710953712463} 11/07/2021 05:31:16 - INFO - __main__ - Step 59124: {'lr': 0.0003376584297082761, 'samples': 11351808, 'steps': 59123, 'loss/train': 1.6214978694915771} 11/07/2021 05:31:16 - INFO - __main__ - Step 59125: {'lr': 0.00033765345985517, 'samples': 11352000, 'steps': 59124, 'loss/train': 1.309025526046753} 11/07/2021 05:31:16 - INFO - __main__ - Step 59126: {'lr': 0.0003376484899625685, 'samples': 11352192, 'steps': 59125, 'loss/train': 1.4658406972885132} 11/07/2021 05:31:17 - INFO - __main__ - Step 59127: {'lr': 0.00033764352003047397, 'samples': 11352384, 'steps': 59126, 'loss/train': 1.7807698249816895} 11/07/2021 05:31:17 - INFO - __main__ - Step 59128: {'lr': 0.00033763855005888865, 'samples': 11352576, 'steps': 59127, 'loss/train': 1.494262456893921} 11/07/2021 05:31:18 - INFO - __main__ - Step 59129: {'lr': 0.00033763358004781474, 'samples': 11352768, 'steps': 59128, 'loss/train': 0.5428523421287537} 11/07/2021 05:31:18 - INFO - __main__ - Step 59130: {'lr': 0.00033762860999725456, 'samples': 11352960, 'steps': 59129, 'loss/train': 1.6502593755722046} 11/07/2021 05:31:19 - INFO - __main__ - Step 59131: {'lr': 0.0003376236399072101, 'samples': 11353152, 'steps': 59130, 'loss/train': 1.2911310195922852} 11/07/2021 05:31:19 - INFO - __main__ - Step 59132: {'lr': 0.000337618669777684, 'samples': 11353344, 'steps': 59131, 'loss/train': 1.6087331771850586} 11/07/2021 05:31:19 - INFO - __main__ - Step 59133: {'lr': 0.0003376136996086782, 'samples': 11353536, 'steps': 59132, 'loss/train': 1.5826588869094849} 11/07/2021 05:31:21 - INFO - __main__ - Step 59134: {'lr': 0.00033760872940019496, 'samples': 11353728, 'steps': 59133, 'loss/train': 1.4108322858810425} 11/07/2021 05:31:21 - INFO - __main__ - Step 59135: {'lr': 0.00033760375915223664, 'samples': 11353920, 'steps': 59134, 'loss/train': 1.2831298112869263} 11/07/2021 05:31:21 - INFO - __main__ - Step 59136: {'lr': 0.00033759878886480534, 'samples': 11354112, 'steps': 59135, 'loss/train': 1.5412404537200928} 11/07/2021 05:31:22 - INFO - __main__ - Step 59137: {'lr': 0.00033759381853790344, 'samples': 11354304, 'steps': 59136, 'loss/train': 1.5404393672943115} 11/07/2021 05:31:22 - INFO - __main__ - Step 59138: {'lr': 0.0003375888481715331, 'samples': 11354496, 'steps': 59137, 'loss/train': 0.9630423784255981} 11/07/2021 05:31:23 - INFO - __main__ - Step 59139: {'lr': 0.0003375838777656966, 'samples': 11354688, 'steps': 59138, 'loss/train': 1.5023905038833618} 11/07/2021 05:31:23 - INFO - __main__ - Step 59140: {'lr': 0.00033757890732039617, 'samples': 11354880, 'steps': 59139, 'loss/train': 1.1749709844589233} 11/07/2021 05:31:24 - INFO - __main__ - Step 59141: {'lr': 0.000337573936835634, 'samples': 11355072, 'steps': 59140, 'loss/train': 1.4763453006744385} 11/07/2021 05:31:24 - INFO - __main__ - Step 59142: {'lr': 0.0003375689663114123, 'samples': 11355264, 'steps': 59141, 'loss/train': 1.478682041168213} 11/07/2021 05:31:24 - INFO - __main__ - Step 59143: {'lr': 0.00033756399574773343, 'samples': 11355456, 'steps': 59142, 'loss/train': 1.1878386735916138} 11/07/2021 05:31:25 - INFO - __main__ - Step 59144: {'lr': 0.00033755902514459964, 'samples': 11355648, 'steps': 59143, 'loss/train': 1.773189663887024} 11/07/2021 05:31:26 - INFO - __main__ - Step 59145: {'lr': 0.0003375540545020131, 'samples': 11355840, 'steps': 59144, 'loss/train': 1.5469133853912354} 11/07/2021 05:31:26 - INFO - __main__ - Step 59146: {'lr': 0.00033754908381997595, 'samples': 11356032, 'steps': 59145, 'loss/train': 1.284981608390808} 11/07/2021 05:31:26 - INFO - __main__ - Step 59147: {'lr': 0.00033754411309849065, 'samples': 11356224, 'steps': 59146, 'loss/train': 1.224838137626648} 11/07/2021 05:31:27 - INFO - __main__ - Step 59148: {'lr': 0.0003375391423375592, 'samples': 11356416, 'steps': 59147, 'loss/train': 1.4976599216461182} 11/07/2021 05:31:28 - INFO - __main__ - Step 59149: {'lr': 0.00033753417153718405, 'samples': 11356608, 'steps': 59148, 'loss/train': 1.3317590951919556} 11/07/2021 05:31:28 - INFO - __main__ - Step 59150: {'lr': 0.0003375292006973673, 'samples': 11356800, 'steps': 59149, 'loss/train': 1.0247522592544556} 11/07/2021 05:31:29 - INFO - __main__ - Step 59151: {'lr': 0.0003375242298181113, 'samples': 11356992, 'steps': 59150, 'loss/train': 0.06391794979572296} 11/07/2021 05:31:29 - INFO - __main__ - Step 59152: {'lr': 0.0003375192588994183, 'samples': 11357184, 'steps': 59151, 'loss/train': 1.327622890472412} 11/07/2021 05:31:29 - INFO - __main__ - Step 59153: {'lr': 0.0003375142879412903, 'samples': 11357376, 'steps': 59152, 'loss/train': 1.6796025037765503} 11/07/2021 05:31:30 - INFO - __main__ - Step 59154: {'lr': 0.0003375093169437298, 'samples': 11357568, 'steps': 59153, 'loss/train': 1.3426998853683472} 11/07/2021 05:31:31 - INFO - __main__ - Step 59155: {'lr': 0.00033750434590673893, 'samples': 11357760, 'steps': 59154, 'loss/train': 1.3947049379348755} 11/07/2021 05:31:31 - INFO - __main__ - Step 59156: {'lr': 0.00033749937483031994, 'samples': 11357952, 'steps': 59155, 'loss/train': 1.1491570472717285} 11/07/2021 05:31:31 - INFO - __main__ - Step 59157: {'lr': 0.00033749440371447513, 'samples': 11358144, 'steps': 59156, 'loss/train': 1.46199631690979} 11/07/2021 05:31:32 - INFO - __main__ - Step 59158: {'lr': 0.00033748943255920667, 'samples': 11358336, 'steps': 59157, 'loss/train': 1.4713354110717773} 11/07/2021 05:31:33 - INFO - __main__ - Step 59159: {'lr': 0.00033748446136451683, 'samples': 11358528, 'steps': 59158, 'loss/train': 1.4283231496810913} 11/07/2021 05:31:33 - INFO - __main__ - Step 59160: {'lr': 0.00033747949013040784, 'samples': 11358720, 'steps': 59159, 'loss/train': 1.6834089756011963} 11/07/2021 05:31:34 - INFO - __main__ - Step 59161: {'lr': 0.000337474518856882, 'samples': 11358912, 'steps': 59160, 'loss/train': 1.418248176574707} 11/07/2021 05:31:34 - INFO - __main__ - Step 59162: {'lr': 0.0003374695475439413, 'samples': 11359104, 'steps': 59161, 'loss/train': 1.3154340982437134} 11/07/2021 05:31:34 - INFO - __main__ - Step 59163: {'lr': 0.0003374645761915883, 'samples': 11359296, 'steps': 59162, 'loss/train': 1.5452814102172852} 11/07/2021 05:31:35 - INFO - __main__ - Step 59164: {'lr': 0.00033745960479982515, 'samples': 11359488, 'steps': 59163, 'loss/train': 1.2473890781402588} 11/07/2021 05:31:36 - INFO - __main__ - Step 59165: {'lr': 0.00033745463336865407, 'samples': 11359680, 'steps': 59164, 'loss/train': 1.9435663223266602} 11/07/2021 05:31:36 - INFO - __main__ - Step 59166: {'lr': 0.0003374496618980772, 'samples': 11359872, 'steps': 59165, 'loss/train': 1.3976154327392578} 11/07/2021 05:31:36 - INFO - __main__ - Step 59167: {'lr': 0.0003374446903880969, 'samples': 11360064, 'steps': 59166, 'loss/train': 1.1069480180740356} 11/07/2021 05:31:37 - INFO - __main__ - Step 59168: {'lr': 0.0003374397188387153, 'samples': 11360256, 'steps': 59167, 'loss/train': 0.9928202629089355} 11/07/2021 05:31:38 - INFO - __main__ - Step 59169: {'lr': 0.0003374347472499348, 'samples': 11360448, 'steps': 59168, 'loss/train': 1.838614821434021} 11/07/2021 05:31:38 - INFO - __main__ - Step 59170: {'lr': 0.00033742977562175756, 'samples': 11360640, 'steps': 59169, 'loss/train': 1.390904426574707} 11/07/2021 05:31:39 - INFO - __main__ - Step 59171: {'lr': 0.00033742480395418574, 'samples': 11360832, 'steps': 59170, 'loss/train': 0.5628034472465515} 11/07/2021 05:31:39 - INFO - __main__ - Step 59172: {'lr': 0.0003374198322472217, 'samples': 11361024, 'steps': 59171, 'loss/train': 1.525457739830017} 11/07/2021 05:31:39 - INFO - __main__ - Step 59173: {'lr': 0.00033741486050086763, 'samples': 11361216, 'steps': 59172, 'loss/train': 1.7237218618392944} 11/07/2021 05:31:40 - INFO - __main__ - Step 59174: {'lr': 0.00033740988871512574, 'samples': 11361408, 'steps': 59173, 'loss/train': 1.40801203250885} 11/07/2021 05:31:41 - INFO - __main__ - Step 59175: {'lr': 0.0003374049168899983, 'samples': 11361600, 'steps': 59174, 'loss/train': 1.4984530210494995} 11/07/2021 05:31:41 - INFO - __main__ - Step 59176: {'lr': 0.00033739994502548766, 'samples': 11361792, 'steps': 59175, 'loss/train': 1.8369193077087402} 11/07/2021 05:31:41 - INFO - __main__ - Step 59177: {'lr': 0.0003373949731215958, 'samples': 11361984, 'steps': 59176, 'loss/train': 1.3676249980926514} 11/07/2021 05:31:42 - INFO - __main__ - Step 59178: {'lr': 0.0003373900011783252, 'samples': 11362176, 'steps': 59177, 'loss/train': 1.2269113063812256} 11/07/2021 05:31:43 - INFO - __main__ - Step 59179: {'lr': 0.000337385029195678, 'samples': 11362368, 'steps': 59178, 'loss/train': 1.5693867206573486} 11/07/2021 05:31:43 - INFO - __main__ - Step 59180: {'lr': 0.00033738005717365646, 'samples': 11362560, 'steps': 59179, 'loss/train': 1.3788961172103882} 11/07/2021 05:31:43 - INFO - __main__ - Step 59181: {'lr': 0.00033737508511226283, 'samples': 11362752, 'steps': 59180, 'loss/train': 1.3437258005142212} 11/07/2021 05:31:44 - INFO - __main__ - Step 59182: {'lr': 0.00033737011301149933, 'samples': 11362944, 'steps': 59181, 'loss/train': 1.7837194204330444} 11/07/2021 05:31:44 - INFO - __main__ - Step 59183: {'lr': 0.0003373651408713682, 'samples': 11363136, 'steps': 59182, 'loss/train': 1.8440576791763306} 11/07/2021 05:31:44 - INFO - __main__ - Step 59184: {'lr': 0.00033736016869187165, 'samples': 11363328, 'steps': 59183, 'loss/train': 1.4563645124435425} 11/07/2021 05:31:45 - INFO - __main__ - Step 59185: {'lr': 0.0003373551964730119, 'samples': 11363520, 'steps': 59184, 'loss/train': 1.5352476835250854} 11/07/2021 05:31:46 - INFO - __main__ - Step 59186: {'lr': 0.00033735022421479136, 'samples': 11363712, 'steps': 59185, 'loss/train': 1.0495479106903076} 11/07/2021 05:31:46 - INFO - __main__ - Step 59187: {'lr': 0.00033734525191721215, 'samples': 11363904, 'steps': 59186, 'loss/train': 1.6984713077545166} 11/07/2021 05:31:47 - INFO - __main__ - Step 59188: {'lr': 0.00033734027958027646, 'samples': 11364096, 'steps': 59187, 'loss/train': 1.4865436553955078} 11/07/2021 05:31:47 - INFO - __main__ - Step 59189: {'lr': 0.00033733530720398666, 'samples': 11364288, 'steps': 59188, 'loss/train': 1.7580646276474} 11/07/2021 05:31:48 - INFO - __main__ - Step 59190: {'lr': 0.00033733033478834483, 'samples': 11364480, 'steps': 59189, 'loss/train': 1.0829137563705444} 11/07/2021 05:31:48 - INFO - __main__ - Step 59191: {'lr': 0.00033732536233335334, 'samples': 11364672, 'steps': 59190, 'loss/train': 1.535908579826355} 11/07/2021 05:31:49 - INFO - __main__ - Step 59192: {'lr': 0.0003373203898390145, 'samples': 11364864, 'steps': 59191, 'loss/train': 1.6308199167251587} 11/07/2021 05:31:49 - INFO - __main__ - Step 59193: {'lr': 0.0003373154173053303, 'samples': 11365056, 'steps': 59192, 'loss/train': 1.6149977445602417} 11/07/2021 05:31:49 - INFO - __main__ - Step 59194: {'lr': 0.0003373104447323031, 'samples': 11365248, 'steps': 59193, 'loss/train': 1.3431729078292847} 11/07/2021 05:31:50 - INFO - __main__ - Step 59195: {'lr': 0.00033730547211993525, 'samples': 11365440, 'steps': 59194, 'loss/train': 1.1915044784545898} 11/07/2021 05:31:51 - INFO - __main__ - Step 59196: {'lr': 0.00033730049946822883, 'samples': 11365632, 'steps': 59195, 'loss/train': 1.6732431650161743} 11/07/2021 05:31:51 - INFO - __main__ - Step 59197: {'lr': 0.0003372955267771862, 'samples': 11365824, 'steps': 59196, 'loss/train': 1.5167385339736938} 11/07/2021 05:31:51 - INFO - __main__ - Step 59198: {'lr': 0.00033729055404680953, 'samples': 11366016, 'steps': 59197, 'loss/train': 1.1936794519424438} 11/07/2021 05:31:52 - INFO - __main__ - Step 59199: {'lr': 0.00033728558127710115, 'samples': 11366208, 'steps': 59198, 'loss/train': 1.2849947214126587} 11/07/2021 05:31:53 - INFO - __main__ - Step 59200: {'lr': 0.0003372806084680632, 'samples': 11366400, 'steps': 59199, 'loss/train': 1.5322794914245605} 11/07/2021 05:31:53 - INFO - __main__ - Step 59201: {'lr': 0.0003372756356196979, 'samples': 11366592, 'steps': 59200, 'loss/train': 1.5394585132598877} 11/07/2021 05:31:54 - INFO - __main__ - Step 59202: {'lr': 0.0003372706627320076, 'samples': 11366784, 'steps': 59201, 'loss/train': 1.2204450368881226} 11/07/2021 05:31:54 - INFO - __main__ - Step 59203: {'lr': 0.0003372656898049944, 'samples': 11366976, 'steps': 59202, 'loss/train': 1.5385537147521973} 11/07/2021 05:31:54 - INFO - __main__ - Step 59204: {'lr': 0.0003372607168386607, 'samples': 11367168, 'steps': 59203, 'loss/train': 1.9647362232208252} 11/07/2021 05:31:55 - INFO - __main__ - Step 59205: {'lr': 0.00033725574383300865, 'samples': 11367360, 'steps': 59204, 'loss/train': 1.1615335941314697} 11/07/2021 05:31:56 - INFO - __main__ - Step 59206: {'lr': 0.0003372507707880406, 'samples': 11367552, 'steps': 59205, 'loss/train': 1.3706697225570679} 11/07/2021 05:31:56 - INFO - __main__ - Step 59207: {'lr': 0.0003372457977037586, 'samples': 11367744, 'steps': 59206, 'loss/train': 1.6664659976959229} 11/07/2021 05:31:56 - INFO - __main__ - Step 59208: {'lr': 0.000337240824580165, 'samples': 11367936, 'steps': 59207, 'loss/train': 1.6000585556030273} 11/07/2021 05:31:57 - INFO - __main__ - Step 59209: {'lr': 0.00033723585141726196, 'samples': 11368128, 'steps': 59208, 'loss/train': 1.851723074913025} 11/07/2021 05:31:58 - INFO - __main__ - Step 59210: {'lr': 0.0003372308782150519, 'samples': 11368320, 'steps': 59209, 'loss/train': 0.06964382529258728} 11/07/2021 05:31:58 - INFO - __main__ - Step 59211: {'lr': 0.0003372259049735369, 'samples': 11368512, 'steps': 59210, 'loss/train': 1.5304545164108276} 11/07/2021 05:31:59 - INFO - __main__ - Step 59212: {'lr': 0.00033722093169271934, 'samples': 11368704, 'steps': 59211, 'loss/train': 1.4610363245010376} 11/07/2021 05:31:59 - INFO - __main__ - Step 59213: {'lr': 0.00033721595837260125, 'samples': 11368896, 'steps': 59212, 'loss/train': 1.2715320587158203} 11/07/2021 05:31:59 - INFO - __main__ - Step 59214: {'lr': 0.00033721098501318506, 'samples': 11369088, 'steps': 59213, 'loss/train': 1.2266206741333008} 11/07/2021 05:32:00 - INFO - __main__ - Step 59215: {'lr': 0.00033720601161447294, 'samples': 11369280, 'steps': 59214, 'loss/train': 1.958136796951294} 11/07/2021 05:32:01 - INFO - __main__ - Step 59216: {'lr': 0.0003372010381764671, 'samples': 11369472, 'steps': 59215, 'loss/train': 1.6986103057861328} 11/07/2021 05:32:01 - INFO - __main__ - Step 59217: {'lr': 0.00033719606469916985, 'samples': 11369664, 'steps': 59216, 'loss/train': 1.4668993949890137} 11/07/2021 05:32:01 - INFO - __main__ - Step 59218: {'lr': 0.0003371910911825834, 'samples': 11369856, 'steps': 59217, 'loss/train': 2.378601312637329} 11/07/2021 05:32:02 - INFO - __main__ - Step 59219: {'lr': 0.00033718611762671003, 'samples': 11370048, 'steps': 59218, 'loss/train': 1.9074461460113525} 11/07/2021 05:32:02 - INFO - __main__ - Step 59220: {'lr': 0.0003371811440315519, 'samples': 11370240, 'steps': 59219, 'loss/train': 1.6932467222213745} 11/07/2021 05:32:03 - INFO - __main__ - Step 59221: {'lr': 0.0003371761703971113, 'samples': 11370432, 'steps': 59220, 'loss/train': 1.7494654655456543} 11/07/2021 05:32:03 - INFO - __main__ - Step 59222: {'lr': 0.0003371711967233905, 'samples': 11370624, 'steps': 59221, 'loss/train': 1.582100749015808} 11/07/2021 05:32:04 - INFO - __main__ - Step 59223: {'lr': 0.00033716622301039164, 'samples': 11370816, 'steps': 59222, 'loss/train': 1.5961817502975464} 11/07/2021 05:32:04 - INFO - __main__ - Step 59224: {'lr': 0.000337161249258117, 'samples': 11371008, 'steps': 59223, 'loss/train': 0.8752351403236389} 11/07/2021 05:32:05 - INFO - __main__ - Step 59225: {'lr': 0.0003371562754665689, 'samples': 11371200, 'steps': 59224, 'loss/train': 1.7890576124191284} 11/07/2021 05:32:05 - INFO - __main__ - Step 59226: {'lr': 0.0003371513016357496, 'samples': 11371392, 'steps': 59225, 'loss/train': 1.5614650249481201} 11/07/2021 05:32:06 - INFO - __main__ - Step 59227: {'lr': 0.0003371463277656611, 'samples': 11371584, 'steps': 59226, 'loss/train': 1.256270170211792} 11/07/2021 05:32:06 - INFO - __main__ - Step 59228: {'lr': 0.00033714135385630597, 'samples': 11371776, 'steps': 59227, 'loss/train': 1.6196098327636719} 11/07/2021 05:32:07 - INFO - __main__ - Step 59229: {'lr': 0.0003371363799076862, 'samples': 11371968, 'steps': 59228, 'loss/train': 1.5933330059051514} 11/07/2021 05:32:07 - INFO - __main__ - Step 59230: {'lr': 0.00033713140591980407, 'samples': 11372160, 'steps': 59229, 'loss/train': 1.7267251014709473} 11/07/2021 05:32:07 - INFO - __main__ - Step 59231: {'lr': 0.00033712643189266197, 'samples': 11372352, 'steps': 59230, 'loss/train': 1.4047694206237793} 11/07/2021 05:32:08 - INFO - __main__ - Step 59232: {'lr': 0.00033712145782626205, 'samples': 11372544, 'steps': 59231, 'loss/train': 1.4500529766082764} 11/07/2021 05:32:09 - INFO - __main__ - Step 59233: {'lr': 0.0003371164837206065, 'samples': 11372736, 'steps': 59232, 'loss/train': 1.6374399662017822} 11/07/2021 05:32:09 - INFO - __main__ - Step 59234: {'lr': 0.00033711150957569763, 'samples': 11372928, 'steps': 59233, 'loss/train': 1.4036811590194702} 11/07/2021 05:32:09 - INFO - __main__ - Step 59235: {'lr': 0.00033710653539153763, 'samples': 11373120, 'steps': 59234, 'loss/train': 1.540671467781067} 11/07/2021 05:32:10 - INFO - __main__ - Step 59236: {'lr': 0.0003371015611681288, 'samples': 11373312, 'steps': 59235, 'loss/train': 1.4164243936538696} 11/07/2021 05:32:11 - INFO - __main__ - Step 59237: {'lr': 0.0003370965869054733, 'samples': 11373504, 'steps': 59236, 'loss/train': 0.0903257355093956} 11/07/2021 05:32:11 - INFO - __main__ - Step 59238: {'lr': 0.0003370916126035735, 'samples': 11373696, 'steps': 59237, 'loss/train': 1.7022573947906494} 11/07/2021 05:32:11 - INFO - __main__ - Step 59239: {'lr': 0.0003370866382624315, 'samples': 11373888, 'steps': 59238, 'loss/train': 1.0951852798461914} 11/07/2021 05:32:12 - INFO - __main__ - Step 59240: {'lr': 0.00033708166388204963, 'samples': 11374080, 'steps': 59239, 'loss/train': 1.3774245977401733} 11/07/2021 05:32:12 - INFO - __main__ - Step 59241: {'lr': 0.0003370766894624301, 'samples': 11374272, 'steps': 59240, 'loss/train': 1.5817006826400757} 11/07/2021 05:32:13 - INFO - __main__ - Step 59242: {'lr': 0.00033707171500357516, 'samples': 11374464, 'steps': 59241, 'loss/train': 1.5087922811508179} 11/07/2021 05:32:14 - INFO - __main__ - Step 59243: {'lr': 0.000337066740505487, 'samples': 11374656, 'steps': 59242, 'loss/train': 2.273547410964966} 11/07/2021 05:32:14 - INFO - __main__ - Step 59244: {'lr': 0.00033706176596816795, 'samples': 11374848, 'steps': 59243, 'loss/train': 1.559385895729065} 11/07/2021 05:32:15 - INFO - __main__ - Step 59245: {'lr': 0.0003370567913916203, 'samples': 11375040, 'steps': 59244, 'loss/train': 1.7910820245742798} 11/07/2021 05:32:15 - INFO - __main__ - Step 59246: {'lr': 0.0003370518167758461, 'samples': 11375232, 'steps': 59245, 'loss/train': 1.7979004383087158} 11/07/2021 05:32:15 - INFO - __main__ - Step 59247: {'lr': 0.00033704684212084774, 'samples': 11375424, 'steps': 59246, 'loss/train': 1.5317434072494507} 11/07/2021 05:32:16 - INFO - __main__ - Step 59248: {'lr': 0.0003370418674266273, 'samples': 11375616, 'steps': 59247, 'loss/train': 1.1110860109329224} 11/07/2021 05:32:17 - INFO - __main__ - Step 59249: {'lr': 0.00033703689269318725, 'samples': 11375808, 'steps': 59248, 'loss/train': 1.7913658618927002} 11/07/2021 05:32:17 - INFO - __main__ - Step 59250: {'lr': 0.00033703191792052974, 'samples': 11376000, 'steps': 59249, 'loss/train': 1.4231507778167725} 11/07/2021 05:32:17 - INFO - __main__ - Step 59251: {'lr': 0.00033702694310865696, 'samples': 11376192, 'steps': 59250, 'loss/train': 1.6132559776306152} 11/07/2021 05:32:18 - INFO - __main__ - Step 59252: {'lr': 0.00033702196825757114, 'samples': 11376384, 'steps': 59251, 'loss/train': 1.4366536140441895} 11/07/2021 05:32:19 - INFO - __main__ - Step 59253: {'lr': 0.00033701699336727465, 'samples': 11376576, 'steps': 59252, 'loss/train': 1.3314954042434692} 11/07/2021 05:32:19 - INFO - __main__ - Step 59254: {'lr': 0.00033701201843776957, 'samples': 11376768, 'steps': 59253, 'loss/train': 1.6341077089309692} 11/07/2021 05:32:19 - INFO - __main__ - Step 59255: {'lr': 0.0003370070434690583, 'samples': 11376960, 'steps': 59254, 'loss/train': 1.0928893089294434} 11/07/2021 05:32:20 - INFO - __main__ - Step 59256: {'lr': 0.0003370020684611429, 'samples': 11377152, 'steps': 59255, 'loss/train': 1.1169596910476685} 11/07/2021 05:32:20 - INFO - __main__ - Step 59257: {'lr': 0.0003369970934140257, 'samples': 11377344, 'steps': 59256, 'loss/train': 1.2681905031204224} 11/07/2021 05:32:22 - INFO - __main__ - Step 59258: {'lr': 0.00033699211832770906, 'samples': 11377536, 'steps': 59257, 'loss/train': 1.5688832998275757} 11/07/2021 05:32:22 - INFO - __main__ - Step 59259: {'lr': 0.000336987143202195, 'samples': 11377728, 'steps': 59258, 'loss/train': 1.8637508153915405} 11/07/2021 05:32:22 - INFO - __main__ - Step 59260: {'lr': 0.000336982168037486, 'samples': 11377920, 'steps': 59259, 'loss/train': 1.576521635055542} 11/07/2021 05:32:23 - INFO - __main__ - Step 59261: {'lr': 0.0003369771928335841, 'samples': 11378112, 'steps': 59260, 'loss/train': 1.802741527557373} 11/07/2021 05:32:23 - INFO - __main__ - Step 59262: {'lr': 0.00033697221759049163, 'samples': 11378304, 'steps': 59261, 'loss/train': 1.3999284505844116} 11/07/2021 05:32:23 - INFO - __main__ - Step 59263: {'lr': 0.0003369672423082108, 'samples': 11378496, 'steps': 59262, 'loss/train': 1.166931390762329} 11/07/2021 05:32:24 - INFO - __main__ - Step 59264: {'lr': 0.00033696226698674386, 'samples': 11378688, 'steps': 59263, 'loss/train': 3.1869373321533203} 11/07/2021 05:32:25 - INFO - __main__ - Step 59265: {'lr': 0.0003369572916260931, 'samples': 11378880, 'steps': 59264, 'loss/train': 0.38798093795776367} 11/07/2021 05:32:25 - INFO - __main__ - Step 59266: {'lr': 0.0003369523162262608, 'samples': 11379072, 'steps': 59265, 'loss/train': 1.6349838972091675} 11/07/2021 05:32:26 - INFO - __main__ - Step 59267: {'lr': 0.00033694734078724904, 'samples': 11379264, 'steps': 59266, 'loss/train': 1.0898051261901855} 11/07/2021 05:32:26 - INFO - __main__ - Step 59268: {'lr': 0.00033694236530906014, 'samples': 11379456, 'steps': 59267, 'loss/train': 1.725724458694458} 11/07/2021 05:32:26 - INFO - __main__ - Step 59269: {'lr': 0.00033693738979169636, 'samples': 11379648, 'steps': 59268, 'loss/train': 1.2573586702346802} 11/07/2021 05:32:28 - INFO - __main__ - Step 59270: {'lr': 0.0003369324142351599, 'samples': 11379840, 'steps': 59269, 'loss/train': 0.08163659274578094} 11/07/2021 05:32:28 - INFO - __main__ - Step 59271: {'lr': 0.0003369274386394531, 'samples': 11380032, 'steps': 59270, 'loss/train': 1.5909442901611328} 11/07/2021 05:32:28 - INFO - __main__ - Step 59272: {'lr': 0.0003369224630045781, 'samples': 11380224, 'steps': 59271, 'loss/train': 1.4559556245803833} 11/07/2021 05:32:29 - INFO - __main__ - Step 59273: {'lr': 0.0003369174873305373, 'samples': 11380416, 'steps': 59272, 'loss/train': 1.5236685276031494} 11/07/2021 05:32:29 - INFO - __main__ - Step 59274: {'lr': 0.0003369125116173327, 'samples': 11380608, 'steps': 59273, 'loss/train': 1.7690908908843994} 11/07/2021 05:32:30 - INFO - __main__ - Step 59275: {'lr': 0.00033690753586496666, 'samples': 11380800, 'steps': 59274, 'loss/train': 1.5433930158615112} 11/07/2021 05:32:30 - INFO - __main__ - Step 59276: {'lr': 0.00033690256007344144, 'samples': 11380992, 'steps': 59275, 'loss/train': 1.5260899066925049} 11/07/2021 05:32:31 - INFO - __main__ - Step 59277: {'lr': 0.0003368975842427592, 'samples': 11381184, 'steps': 59276, 'loss/train': 1.5196844339370728} 11/07/2021 05:32:31 - INFO - __main__ - Step 59278: {'lr': 0.00033689260837292234, 'samples': 11381376, 'steps': 59277, 'loss/train': 1.7671854496002197} 11/07/2021 05:32:31 - INFO - __main__ - Step 59279: {'lr': 0.000336887632463933, 'samples': 11381568, 'steps': 59278, 'loss/train': 2.06708025932312} 11/07/2021 05:32:33 - INFO - __main__ - Step 59280: {'lr': 0.00033688265651579354, 'samples': 11381760, 'steps': 59279, 'loss/train': 1.5717920064926147} 11/07/2021 05:32:33 - INFO - __main__ - Step 59281: {'lr': 0.0003368776805285059, 'samples': 11381952, 'steps': 59280, 'loss/train': 1.348695158958435} 11/07/2021 05:32:33 - INFO - __main__ - Step 59282: {'lr': 0.0003368727045020726, 'samples': 11382144, 'steps': 59281, 'loss/train': 1.636444091796875} 11/07/2021 05:32:34 - INFO - __main__ - Step 59283: {'lr': 0.00033686772843649583, 'samples': 11382336, 'steps': 59282, 'loss/train': 1.4910632371902466} 11/07/2021 05:32:34 - INFO - __main__ - Step 59284: {'lr': 0.00033686275233177777, 'samples': 11382528, 'steps': 59283, 'loss/train': 1.386777639389038} 11/07/2021 05:32:34 - INFO - __main__ - Step 59285: {'lr': 0.00033685777618792066, 'samples': 11382720, 'steps': 59284, 'loss/train': 1.2180293798446655} 11/07/2021 05:32:36 - INFO - __main__ - Step 59286: {'lr': 0.0003368528000049269, 'samples': 11382912, 'steps': 59285, 'loss/train': 0.7874096632003784} 11/07/2021 05:32:36 - INFO - __main__ - Step 59287: {'lr': 0.00033684782378279847, 'samples': 11383104, 'steps': 59286, 'loss/train': 1.564002275466919} 11/07/2021 05:32:36 - INFO - __main__ - Step 59288: {'lr': 0.0003368428475215378, 'samples': 11383296, 'steps': 59287, 'loss/train': 1.4068653583526611} 11/07/2021 05:32:37 - INFO - __main__ - Step 59289: {'lr': 0.00033683787122114713, 'samples': 11383488, 'steps': 59288, 'loss/train': 1.694163203239441} 11/07/2021 05:32:37 - INFO - __main__ - Step 59290: {'lr': 0.0003368328948816286, 'samples': 11383680, 'steps': 59289, 'loss/train': 1.6267966032028198} 11/07/2021 05:32:38 - INFO - __main__ - Step 59291: {'lr': 0.0003368279185029845, 'samples': 11383872, 'steps': 59290, 'loss/train': 1.035529375076294} 11/07/2021 05:32:38 - INFO - __main__ - Step 59292: {'lr': 0.0003368229420852171, 'samples': 11384064, 'steps': 59291, 'loss/train': 1.2174526453018188} 11/07/2021 05:32:39 - INFO - __main__ - Step 59293: {'lr': 0.00033681796562832865, 'samples': 11384256, 'steps': 59292, 'loss/train': 1.5012623071670532} 11/07/2021 05:32:39 - INFO - __main__ - Step 59294: {'lr': 0.0003368129891323213, 'samples': 11384448, 'steps': 59293, 'loss/train': 0.7952998876571655} 11/07/2021 05:32:39 - INFO - __main__ - Step 59295: {'lr': 0.0003368080125971974, 'samples': 11384640, 'steps': 59294, 'loss/train': 1.5874347686767578} 11/07/2021 05:32:41 - INFO - __main__ - Step 59296: {'lr': 0.00033680303602295913, 'samples': 11384832, 'steps': 59295, 'loss/train': 1.1530067920684814} 11/07/2021 05:32:41 - INFO - __main__ - Step 59297: {'lr': 0.00033679805940960877, 'samples': 11385024, 'steps': 59296, 'loss/train': 1.5158717632293701} 11/07/2021 05:32:41 - INFO - __main__ - Step 59298: {'lr': 0.0003367930827571485, 'samples': 11385216, 'steps': 59297, 'loss/train': 1.4791208505630493} 11/07/2021 05:32:42 - INFO - __main__ - Step 59299: {'lr': 0.00033678810606558077, 'samples': 11385408, 'steps': 59298, 'loss/train': 1.3377068042755127} 11/07/2021 05:32:42 - INFO - __main__ - Step 59300: {'lr': 0.00033678312933490753, 'samples': 11385600, 'steps': 59299, 'loss/train': 1.330207347869873} 11/07/2021 05:32:43 - INFO - __main__ - Step 59301: {'lr': 0.00033677815256513114, 'samples': 11385792, 'steps': 59300, 'loss/train': 1.4737035036087036} 11/07/2021 05:32:43 - INFO - __main__ - Step 59302: {'lr': 0.0003367731757562538, 'samples': 11385984, 'steps': 59301, 'loss/train': 1.0610145330429077} 11/07/2021 05:32:44 - INFO - __main__ - Step 59303: {'lr': 0.0003367681989082779, 'samples': 11386176, 'steps': 59302, 'loss/train': 1.8311078548431396} 11/07/2021 05:32:44 - INFO - __main__ - Step 59304: {'lr': 0.0003367632220212056, 'samples': 11386368, 'steps': 59303, 'loss/train': 1.2422478199005127} 11/07/2021 05:32:44 - INFO - __main__ - Step 59305: {'lr': 0.0003367582450950391, 'samples': 11386560, 'steps': 59304, 'loss/train': 1.722801685333252} 11/07/2021 05:32:45 - INFO - __main__ - Step 59306: {'lr': 0.0003367532681297807, 'samples': 11386752, 'steps': 59305, 'loss/train': 1.5947935581207275} 11/07/2021 05:32:46 - INFO - __main__ - Step 59307: {'lr': 0.0003367482911254325, 'samples': 11386944, 'steps': 59306, 'loss/train': 1.0572227239608765} 11/07/2021 05:32:46 - INFO - __main__ - Step 59308: {'lr': 0.000336743314081997, 'samples': 11387136, 'steps': 59307, 'loss/train': 1.228071689605713} 11/07/2021 05:32:46 - INFO - __main__ - Step 59309: {'lr': 0.0003367383369994762, 'samples': 11387328, 'steps': 59308, 'loss/train': 1.5417029857635498} 11/07/2021 05:32:47 - INFO - __main__ - Step 59310: {'lr': 0.0003367333598778725, 'samples': 11387520, 'steps': 59309, 'loss/train': 1.4989715814590454} 11/07/2021 05:32:48 - INFO - __main__ - Step 59311: {'lr': 0.0003367283827171881, 'samples': 11387712, 'steps': 59310, 'loss/train': 1.4128915071487427} 11/07/2021 05:32:48 - INFO - __main__ - Step 59312: {'lr': 0.0003367234055174252, 'samples': 11387904, 'steps': 59311, 'loss/train': 0.8536984920501709} 11/07/2021 05:32:49 - INFO - __main__ - Step 59313: {'lr': 0.00033671842827858605, 'samples': 11388096, 'steps': 59312, 'loss/train': 1.2938698530197144} 11/07/2021 05:32:49 - INFO - __main__ - Step 59314: {'lr': 0.000336713451000673, 'samples': 11388288, 'steps': 59313, 'loss/train': 0.43849876523017883} 11/07/2021 05:32:49 - INFO - __main__ - Step 59315: {'lr': 0.00033670847368368805, 'samples': 11388480, 'steps': 59314, 'loss/train': 1.4717206954956055} 11/07/2021 05:32:50 - INFO - __main__ - Step 59316: {'lr': 0.00033670349632763377, 'samples': 11388672, 'steps': 59315, 'loss/train': 1.661509394645691} 11/07/2021 05:32:51 - INFO - __main__ - Step 59317: {'lr': 0.0003366985189325121, 'samples': 11388864, 'steps': 59316, 'loss/train': 2.1882638931274414} 11/07/2021 05:32:51 - INFO - __main__ - Step 59318: {'lr': 0.00033669354149832556, 'samples': 11389056, 'steps': 59317, 'loss/train': 1.328861951828003} 11/07/2021 05:32:52 - INFO - __main__ - Step 59319: {'lr': 0.0003366885640250761, 'samples': 11389248, 'steps': 59318, 'loss/train': 1.3705939054489136} 11/07/2021 05:32:52 - INFO - __main__ - Step 59320: {'lr': 0.00033668358651276614, 'samples': 11389440, 'steps': 59319, 'loss/train': 1.318787693977356} 11/07/2021 05:32:52 - INFO - __main__ - Step 59321: {'lr': 0.000336678608961398, 'samples': 11389632, 'steps': 59320, 'loss/train': 0.34411898255348206} 11/07/2021 05:32:53 - INFO - __main__ - Step 59322: {'lr': 0.00033667363137097374, 'samples': 11389824, 'steps': 59321, 'loss/train': 1.7547041177749634} 11/07/2021 05:32:54 - INFO - __main__ - Step 59323: {'lr': 0.0003366686537414957, 'samples': 11390016, 'steps': 59322, 'loss/train': 1.3817046880722046} 11/07/2021 05:32:54 - INFO - __main__ - Step 59324: {'lr': 0.00033666367607296607, 'samples': 11390208, 'steps': 59323, 'loss/train': 1.3879252672195435} 11/07/2021 05:32:54 - INFO - __main__ - Step 59325: {'lr': 0.0003366586983653871, 'samples': 11390400, 'steps': 59324, 'loss/train': 1.6638904809951782} 11/07/2021 05:32:55 - INFO - __main__ - Step 59326: {'lr': 0.0003366537206187611, 'samples': 11390592, 'steps': 59325, 'loss/train': 2.0016846656799316} 11/07/2021 05:32:56 - INFO - __main__ - Step 59327: {'lr': 0.0003366487428330903, 'samples': 11390784, 'steps': 59326, 'loss/train': 1.3757736682891846} 11/07/2021 05:32:56 - INFO - __main__ - Step 59328: {'lr': 0.0003366437650083768, 'samples': 11390976, 'steps': 59327, 'loss/train': 1.6433217525482178} 11/07/2021 05:32:57 - INFO - __main__ - Step 59329: {'lr': 0.0003366387871446231, 'samples': 11391168, 'steps': 59328, 'loss/train': 1.3558849096298218} 11/07/2021 05:32:57 - INFO - __main__ - Step 59330: {'lr': 0.00033663380924183123, 'samples': 11391360, 'steps': 59329, 'loss/train': 1.678908109664917} 11/07/2021 05:32:57 - INFO - __main__ - Step 59331: {'lr': 0.0003366288313000035, 'samples': 11391552, 'steps': 59330, 'loss/train': 0.7552284598350525} 11/07/2021 05:32:58 - INFO - __main__ - Step 59332: {'lr': 0.00033662385331914216, 'samples': 11391744, 'steps': 59331, 'loss/train': 0.9863568544387817} 11/07/2021 05:32:59 - INFO - __main__ - Step 59333: {'lr': 0.0003366188752992495, 'samples': 11391936, 'steps': 59332, 'loss/train': 1.226762294769287} 11/07/2021 05:32:59 - INFO - __main__ - Step 59334: {'lr': 0.00033661389724032765, 'samples': 11392128, 'steps': 59333, 'loss/train': 1.5386178493499756} 11/07/2021 05:32:59 - INFO - __main__ - Step 59335: {'lr': 0.0003366089191423789, 'samples': 11392320, 'steps': 59334, 'loss/train': 1.459900140762329} 11/07/2021 05:33:00 - INFO - __main__ - Step 59336: {'lr': 0.00033660394100540553, 'samples': 11392512, 'steps': 59335, 'loss/train': 1.12985360622406} 11/07/2021 05:33:00 - INFO - __main__ - Step 59337: {'lr': 0.00033659896282940975, 'samples': 11392704, 'steps': 59336, 'loss/train': 1.3907475471496582} 11/07/2021 05:33:01 - INFO - __main__ - Step 59338: {'lr': 0.0003365939846143938, 'samples': 11392896, 'steps': 59337, 'loss/train': 1.3147752285003662} 11/07/2021 05:33:02 - INFO - __main__ - Step 59339: {'lr': 0.00033658900636036, 'samples': 11393088, 'steps': 59338, 'loss/train': 1.8776735067367554} 11/07/2021 05:33:02 - INFO - __main__ - Step 59340: {'lr': 0.00033658402806731054, 'samples': 11393280, 'steps': 59339, 'loss/train': 1.6488615274429321} 11/07/2021 05:33:02 - INFO - __main__ - Step 59341: {'lr': 0.00033657904973524754, 'samples': 11393472, 'steps': 59340, 'loss/train': 1.708038330078125} 11/07/2021 05:33:03 - INFO - __main__ - Step 59342: {'lr': 0.00033657407136417343, 'samples': 11393664, 'steps': 59341, 'loss/train': 1.3855866193771362} 11/07/2021 05:33:04 - INFO - __main__ - Step 59343: {'lr': 0.0003365690929540904, 'samples': 11393856, 'steps': 59342, 'loss/train': 1.318496823310852} 11/07/2021 05:33:04 - INFO - __main__ - Step 59344: {'lr': 0.0003365641145050006, 'samples': 11394048, 'steps': 59343, 'loss/train': 1.5625256299972534} 11/07/2021 05:33:04 - INFO - __main__ - Step 59345: {'lr': 0.0003365591360169064, 'samples': 11394240, 'steps': 59344, 'loss/train': 1.9332079887390137} 11/07/2021 05:33:05 - INFO - __main__ - Step 59346: {'lr': 0.00033655415748981, 'samples': 11394432, 'steps': 59345, 'loss/train': 1.0971368551254272} 11/07/2021 05:33:05 - INFO - __main__ - Step 59347: {'lr': 0.00033654917892371363, 'samples': 11394624, 'steps': 59346, 'loss/train': 1.528804898262024} 11/07/2021 05:33:06 - INFO - __main__ - Step 59348: {'lr': 0.00033654420031861953, 'samples': 11394816, 'steps': 59347, 'loss/train': 1.6762912273406982} 11/07/2021 05:33:06 - INFO - __main__ - Step 59349: {'lr': 0.0003365392216745299, 'samples': 11395008, 'steps': 59348, 'loss/train': 1.4819291830062866} 11/07/2021 05:33:07 - INFO - __main__ - Step 59350: {'lr': 0.0003365342429914471, 'samples': 11395200, 'steps': 59349, 'loss/train': 0.7258336544036865} 11/07/2021 05:33:07 - INFO - __main__ - Step 59351: {'lr': 0.0003365292642693733, 'samples': 11395392, 'steps': 59350, 'loss/train': 1.3492298126220703} 11/07/2021 05:33:07 - INFO - __main__ - Step 59352: {'lr': 0.0003365242855083107, 'samples': 11395584, 'steps': 59351, 'loss/train': 1.3703134059906006} 11/07/2021 05:33:09 - INFO - __main__ - Step 59353: {'lr': 0.00033651930670826157, 'samples': 11395776, 'steps': 59352, 'loss/train': 1.5654878616333008} 11/07/2021 05:33:09 - INFO - __main__ - Step 59354: {'lr': 0.0003365143278692283, 'samples': 11395968, 'steps': 59353, 'loss/train': 1.1283808946609497} 11/07/2021 05:33:09 - INFO - __main__ - Step 59355: {'lr': 0.0003365093489912129, 'samples': 11396160, 'steps': 59354, 'loss/train': 1.5003823041915894} 11/07/2021 05:33:10 - INFO - __main__ - Step 59356: {'lr': 0.00033650437007421775, 'samples': 11396352, 'steps': 59355, 'loss/train': 0.9820141196250916} 11/07/2021 05:33:10 - INFO - __main__ - Step 59357: {'lr': 0.0003364993911182451, 'samples': 11396544, 'steps': 59356, 'loss/train': 1.8636622428894043} 11/07/2021 05:33:11 - INFO - __main__ - Step 59358: {'lr': 0.0003364944121232971, 'samples': 11396736, 'steps': 59357, 'loss/train': 1.063173532485962} 11/07/2021 05:33:11 - INFO - __main__ - Step 59359: {'lr': 0.0003364894330893761, 'samples': 11396928, 'steps': 59358, 'loss/train': 1.4838443994522095} 11/07/2021 05:33:12 - INFO - __main__ - Step 59360: {'lr': 0.0003364844540164843, 'samples': 11397120, 'steps': 59359, 'loss/train': 1.0383440256118774} 11/07/2021 05:33:12 - INFO - __main__ - Step 59361: {'lr': 0.00033647947490462386, 'samples': 11397312, 'steps': 59360, 'loss/train': 1.0313713550567627} 11/07/2021 05:33:12 - INFO - __main__ - Step 59362: {'lr': 0.0003364744957537972, 'samples': 11397504, 'steps': 59361, 'loss/train': 0.9415063858032227} 11/07/2021 05:33:13 - INFO - __main__ - Step 59363: {'lr': 0.00033646951656400635, 'samples': 11397696, 'steps': 59362, 'loss/train': 1.3204634189605713} 11/07/2021 05:33:14 - INFO - __main__ - Step 59364: {'lr': 0.0003364645373352538, 'samples': 11397888, 'steps': 59363, 'loss/train': 1.4480024576187134} 11/07/2021 05:33:15 - INFO - __main__ - Step 59365: {'lr': 0.00033645955806754156, 'samples': 11398080, 'steps': 59364, 'loss/train': 1.6689462661743164} 11/07/2021 05:33:15 - INFO - __main__ - Step 59366: {'lr': 0.00033645457876087205, 'samples': 11398272, 'steps': 59365, 'loss/train': 1.1205670833587646} 11/07/2021 05:33:15 - INFO - __main__ - Step 59367: {'lr': 0.0003364495994152474, 'samples': 11398464, 'steps': 59366, 'loss/train': 0.1568594127893448} 11/07/2021 05:33:16 - INFO - __main__ - Step 59368: {'lr': 0.00033644462003066996, 'samples': 11398656, 'steps': 59367, 'loss/train': 1.8210835456848145} 11/07/2021 05:33:17 - INFO - __main__ - Step 59369: {'lr': 0.00033643964060714183, 'samples': 11398848, 'steps': 59368, 'loss/train': 2.4489545822143555} 11/07/2021 05:33:18 - INFO - __main__ - Step 59370: {'lr': 0.00033643466114466537, 'samples': 11399040, 'steps': 59369, 'loss/train': 1.9088011980056763} 11/07/2021 05:33:18 - INFO - __main__ - Step 59371: {'lr': 0.0003364296816432428, 'samples': 11399232, 'steps': 59370, 'loss/train': 2.0525920391082764} 11/07/2021 05:33:18 - INFO - __main__ - Step 59372: {'lr': 0.0003364247021028763, 'samples': 11399424, 'steps': 59371, 'loss/train': 1.4817010164260864} 11/07/2021 05:33:19 - INFO - __main__ - Step 59373: {'lr': 0.0003364197225235682, 'samples': 11399616, 'steps': 59372, 'loss/train': 1.2891323566436768} 11/07/2021 05:33:19 - INFO - __main__ - Step 59374: {'lr': 0.0003364147429053207, 'samples': 11399808, 'steps': 59373, 'loss/train': 1.1993621587753296} 11/07/2021 05:33:20 - INFO - __main__ - Step 59375: {'lr': 0.00033640976324813605, 'samples': 11400000, 'steps': 59374, 'loss/train': 0.8058171272277832} 11/07/2021 05:33:20 - INFO - __main__ - Step 59376: {'lr': 0.00033640478355201646, 'samples': 11400192, 'steps': 59375, 'loss/train': 1.5635567903518677} 11/07/2021 05:33:21 - INFO - __main__ - Step 59377: {'lr': 0.00033639980381696425, 'samples': 11400384, 'steps': 59376, 'loss/train': 1.3642702102661133} 11/07/2021 05:33:21 - INFO - __main__ - Step 59378: {'lr': 0.0003363948240429816, 'samples': 11400576, 'steps': 59377, 'loss/train': 1.5507620573043823} 11/07/2021 05:33:21 - INFO - __main__ - Step 59379: {'lr': 0.0003363898442300708, 'samples': 11400768, 'steps': 59378, 'loss/train': 1.1319968700408936} 11/07/2021 05:33:22 - INFO - __main__ - Step 59380: {'lr': 0.0003363848643782341, 'samples': 11400960, 'steps': 59379, 'loss/train': 1.0250396728515625} 11/07/2021 05:33:23 - INFO - __main__ - Step 59381: {'lr': 0.00033637988448747365, 'samples': 11401152, 'steps': 59380, 'loss/train': 0.688690721988678} 11/07/2021 05:33:23 - INFO - __main__ - Step 59382: {'lr': 0.00033637490455779175, 'samples': 11401344, 'steps': 59381, 'loss/train': 1.5102119445800781} 11/07/2021 05:33:23 - INFO - __main__ - Step 59383: {'lr': 0.0003363699245891907, 'samples': 11401536, 'steps': 59382, 'loss/train': 1.6710865497589111} 11/07/2021 05:33:24 - INFO - __main__ - Step 59384: {'lr': 0.00033636494458167267, 'samples': 11401728, 'steps': 59383, 'loss/train': 0.4653959572315216} 11/07/2021 05:33:24 - INFO - __main__ - Step 59385: {'lr': 0.00033635996453523987, 'samples': 11401920, 'steps': 59384, 'loss/train': 1.7350064516067505} 11/07/2021 05:33:25 - INFO - __main__ - Step 59386: {'lr': 0.0003363549844498947, 'samples': 11402112, 'steps': 59385, 'loss/train': 1.4382284879684448} 11/07/2021 05:33:26 - INFO - __main__ - Step 59387: {'lr': 0.00033635000432563926, 'samples': 11402304, 'steps': 59386, 'loss/train': 0.8000993132591248} 11/07/2021 05:33:26 - INFO - __main__ - Step 59388: {'lr': 0.0003363450241624759, 'samples': 11402496, 'steps': 59387, 'loss/train': 1.293431282043457} 11/07/2021 05:33:26 - INFO - __main__ - Step 59389: {'lr': 0.00033634004396040673, 'samples': 11402688, 'steps': 59388, 'loss/train': 1.5316147804260254} 11/07/2021 05:33:27 - INFO - __main__ - Step 59390: {'lr': 0.0003363350637194341, 'samples': 11402880, 'steps': 59389, 'loss/train': 1.2599658966064453} 11/07/2021 05:33:28 - INFO - __main__ - Step 59391: {'lr': 0.0003363300834395602, 'samples': 11403072, 'steps': 59390, 'loss/train': 1.994467854499817} 11/07/2021 05:33:28 - INFO - __main__ - Step 59392: {'lr': 0.0003363251031207873, 'samples': 11403264, 'steps': 59391, 'loss/train': 1.5025508403778076} 11/07/2021 05:33:28 - INFO - __main__ - Step 59393: {'lr': 0.00033632012276311763, 'samples': 11403456, 'steps': 59392, 'loss/train': 1.3745367527008057} 11/07/2021 05:33:29 - INFO - __main__ - Step 59394: {'lr': 0.00033631514236655345, 'samples': 11403648, 'steps': 59393, 'loss/train': 2.2225964069366455} 11/07/2021 05:33:29 - INFO - __main__ - Step 59395: {'lr': 0.00033631016193109704, 'samples': 11403840, 'steps': 59394, 'loss/train': 1.3247920274734497} 11/07/2021 05:33:30 - INFO - __main__ - Step 59396: {'lr': 0.00033630518145675057, 'samples': 11404032, 'steps': 59395, 'loss/train': 1.5393571853637695} 11/07/2021 05:33:30 - INFO - __main__ - Step 59397: {'lr': 0.0003363002009435163, 'samples': 11404224, 'steps': 59396, 'loss/train': 1.3608369827270508} 11/07/2021 05:33:31 - INFO - __main__ - Step 59398: {'lr': 0.00033629522039139656, 'samples': 11404416, 'steps': 59397, 'loss/train': 1.306970238685608} 11/07/2021 05:33:31 - INFO - __main__ - Step 59399: {'lr': 0.00033629023980039346, 'samples': 11404608, 'steps': 59398, 'loss/train': 1.4013481140136719} 11/07/2021 05:33:32 - INFO - __main__ - Step 59400: {'lr': 0.00033628525917050935, 'samples': 11404800, 'steps': 59399, 'loss/train': 0.9148676991462708} 11/07/2021 05:33:32 - INFO - __main__ - Step 59401: {'lr': 0.0003362802785017464, 'samples': 11404992, 'steps': 59400, 'loss/train': 1.1169158220291138} 11/07/2021 05:33:33 - INFO - __main__ - Step 59402: {'lr': 0.00033627529779410695, 'samples': 11405184, 'steps': 59401, 'loss/train': 1.1892825365066528} 11/07/2021 05:33:33 - INFO - __main__ - Step 59403: {'lr': 0.0003362703170475931, 'samples': 11405376, 'steps': 59402, 'loss/train': 1.4537960290908813} 11/07/2021 05:33:34 - INFO - __main__ - Step 59404: {'lr': 0.00033626533626220724, 'samples': 11405568, 'steps': 59403, 'loss/train': 1.6124074459075928} 11/07/2021 05:33:34 - INFO - __main__ - Step 59405: {'lr': 0.0003362603554379515, 'samples': 11405760, 'steps': 59404, 'loss/train': 1.7371422052383423} 11/07/2021 05:33:34 - INFO - __main__ - Step 59406: {'lr': 0.0003362553745748281, 'samples': 11405952, 'steps': 59405, 'loss/train': 1.1758849620819092} 11/07/2021 05:33:36 - INFO - __main__ - Step 59407: {'lr': 0.00033625039367283957, 'samples': 11406144, 'steps': 59406, 'loss/train': 1.7903153896331787} 11/07/2021 05:33:36 - INFO - __main__ - Step 59408: {'lr': 0.00033624541273198785, 'samples': 11406336, 'steps': 59407, 'loss/train': 1.4561078548431396} 11/07/2021 05:33:36 - INFO - __main__ - Step 59409: {'lr': 0.0003362404317522752, 'samples': 11406528, 'steps': 59408, 'loss/train': 1.1592931747436523} 11/07/2021 05:33:37 - INFO - __main__ - Step 59410: {'lr': 0.000336235450733704, 'samples': 11406720, 'steps': 59409, 'loss/train': 1.768731713294983} 11/07/2021 05:33:37 - INFO - __main__ - Step 59411: {'lr': 0.00033623046967627647, 'samples': 11406912, 'steps': 59410, 'loss/train': 1.1054468154907227} 11/07/2021 05:33:38 - INFO - __main__ - Step 59412: {'lr': 0.00033622548857999477, 'samples': 11407104, 'steps': 59411, 'loss/train': 2.166207790374756} 11/07/2021 05:33:38 - INFO - __main__ - Step 59413: {'lr': 0.00033622050744486117, 'samples': 11407296, 'steps': 59412, 'loss/train': 1.1738817691802979} 11/07/2021 05:33:39 - INFO - __main__ - Step 59414: {'lr': 0.000336215526270878, 'samples': 11407488, 'steps': 59413, 'loss/train': 1.745837926864624} 11/07/2021 05:33:39 - INFO - __main__ - Step 59415: {'lr': 0.00033621054505804745, 'samples': 11407680, 'steps': 59414, 'loss/train': 1.676180124282837} 11/07/2021 05:33:40 - INFO - __main__ - Step 59416: {'lr': 0.0003362055638063717, 'samples': 11407872, 'steps': 59415, 'loss/train': 1.809029221534729} 11/07/2021 05:33:40 - INFO - __main__ - Step 59417: {'lr': 0.00033620058251585314, 'samples': 11408064, 'steps': 59416, 'loss/train': 0.9265207648277283} 11/07/2021 05:33:41 - INFO - __main__ - Step 59418: {'lr': 0.00033619560118649383, 'samples': 11408256, 'steps': 59417, 'loss/train': 1.4864901304244995} 11/07/2021 05:33:41 - INFO - __main__ - Step 59419: {'lr': 0.0003361906198182961, 'samples': 11408448, 'steps': 59418, 'loss/train': 1.5386531352996826} 11/07/2021 05:33:42 - INFO - __main__ - Step 59420: {'lr': 0.0003361856384112623, 'samples': 11408640, 'steps': 59419, 'loss/train': 1.6746221780776978} 11/07/2021 05:33:42 - INFO - __main__ - Step 59421: {'lr': 0.00033618065696539457, 'samples': 11408832, 'steps': 59420, 'loss/train': 1.4639761447906494} 11/07/2021 05:33:42 - INFO - __main__ - Step 59422: {'lr': 0.00033617567548069517, 'samples': 11409024, 'steps': 59421, 'loss/train': 0.8389047980308533} 11/07/2021 05:33:43 - INFO - __main__ - Step 59423: {'lr': 0.00033617069395716626, 'samples': 11409216, 'steps': 59422, 'loss/train': 1.4436959028244019} 11/07/2021 05:33:44 - INFO - __main__ - Step 59424: {'lr': 0.0003361657123948103, 'samples': 11409408, 'steps': 59423, 'loss/train': 1.450130581855774} 11/07/2021 05:33:44 - INFO - __main__ - Step 59425: {'lr': 0.00033616073079362923, 'samples': 11409600, 'steps': 59424, 'loss/train': 1.641268253326416} 11/07/2021 05:33:44 - INFO - __main__ - Step 59426: {'lr': 0.00033615574915362556, 'samples': 11409792, 'steps': 59425, 'loss/train': 1.3853462934494019} 11/07/2021 05:33:45 - INFO - __main__ - Step 59427: {'lr': 0.0003361507674748015, 'samples': 11409984, 'steps': 59426, 'loss/train': 1.1510624885559082} 11/07/2021 05:33:46 - INFO - __main__ - Step 59428: {'lr': 0.00033614578575715914, 'samples': 11410176, 'steps': 59427, 'loss/train': 1.7111918926239014} 11/07/2021 05:33:46 - INFO - __main__ - Step 59429: {'lr': 0.0003361408040007008, 'samples': 11410368, 'steps': 59428, 'loss/train': 1.0449142456054688} 11/07/2021 05:33:46 - INFO - __main__ - Step 59430: {'lr': 0.00033613582220542884, 'samples': 11410560, 'steps': 59429, 'loss/train': 1.3424932956695557} 11/07/2021 05:33:47 - INFO - __main__ - Step 59431: {'lr': 0.00033613084037134534, 'samples': 11410752, 'steps': 59430, 'loss/train': 1.526412010192871} 11/07/2021 05:33:47 - INFO - __main__ - Step 59432: {'lr': 0.00033612585849845256, 'samples': 11410944, 'steps': 59431, 'loss/train': 1.7384799718856812} 11/07/2021 05:33:48 - INFO - __main__ - Step 59433: {'lr': 0.00033612087658675287, 'samples': 11411136, 'steps': 59432, 'loss/train': 1.1954818964004517} 11/07/2021 05:33:49 - INFO - __main__ - Step 59434: {'lr': 0.0003361158946362485, 'samples': 11411328, 'steps': 59433, 'loss/train': 1.625868320465088} 11/07/2021 05:33:49 - INFO - __main__ - Step 59435: {'lr': 0.00033611091264694156, 'samples': 11411520, 'steps': 59434, 'loss/train': 0.24433954060077667} 11/07/2021 05:33:49 - INFO - __main__ - Step 59436: {'lr': 0.0003361059306188344, 'samples': 11411712, 'steps': 59435, 'loss/train': 1.3391706943511963} 11/07/2021 05:33:50 - INFO - __main__ - Step 59437: {'lr': 0.0003361009485519292, 'samples': 11411904, 'steps': 59436, 'loss/train': 0.8222254514694214} 11/07/2021 05:33:51 - INFO - __main__ - Step 59438: {'lr': 0.0003360959664462282, 'samples': 11412096, 'steps': 59437, 'loss/train': 1.3256622552871704} 11/07/2021 05:33:51 - INFO - __main__ - Step 59439: {'lr': 0.0003360909843017338, 'samples': 11412288, 'steps': 59438, 'loss/train': 1.7633172273635864} 11/07/2021 05:33:52 - INFO - __main__ - Step 59440: {'lr': 0.0003360860021184481, 'samples': 11412480, 'steps': 59439, 'loss/train': 1.453478217124939} 11/07/2021 05:33:52 - INFO - __main__ - Step 59441: {'lr': 0.0003360810198963733, 'samples': 11412672, 'steps': 59440, 'loss/train': 1.3727055788040161} 11/07/2021 05:33:52 - INFO - __main__ - Step 59442: {'lr': 0.0003360760376355118, 'samples': 11412864, 'steps': 59441, 'loss/train': 1.7543045282363892} 11/07/2021 05:33:53 - INFO - __main__ - Step 59443: {'lr': 0.00033607105533586573, 'samples': 11413056, 'steps': 59442, 'loss/train': 1.4854484796524048} 11/07/2021 05:33:54 - INFO - __main__ - Step 59444: {'lr': 0.0003360660729974374, 'samples': 11413248, 'steps': 59443, 'loss/train': 1.4913972616195679} 11/07/2021 05:33:54 - INFO - __main__ - Step 59445: {'lr': 0.00033606109062022906, 'samples': 11413440, 'steps': 59444, 'loss/train': 0.49917852878570557} 11/07/2021 05:33:54 - INFO - __main__ - Step 59446: {'lr': 0.0003360561082042428, 'samples': 11413632, 'steps': 59445, 'loss/train': 1.382643699645996} 11/07/2021 05:33:55 - INFO - __main__ - Step 59447: {'lr': 0.00033605112574948106, 'samples': 11413824, 'steps': 59446, 'loss/train': 1.754879117012024} 11/07/2021 05:33:56 - INFO - __main__ - Step 59448: {'lr': 0.000336046143255946, 'samples': 11414016, 'steps': 59447, 'loss/train': 0.3582473695278168} 11/07/2021 05:33:56 - INFO - __main__ - Step 59449: {'lr': 0.0003360411607236399, 'samples': 11414208, 'steps': 59448, 'loss/train': 1.2509737014770508} 11/07/2021 05:33:57 - INFO - __main__ - Step 59450: {'lr': 0.0003360361781525649, 'samples': 11414400, 'steps': 59449, 'loss/train': 1.4894764423370361} 11/07/2021 05:33:57 - INFO - __main__ - Step 59451: {'lr': 0.00033603119554272343, 'samples': 11414592, 'steps': 59450, 'loss/train': 1.4741876125335693} 11/07/2021 05:33:57 - INFO - __main__ - Step 59452: {'lr': 0.0003360262128941176, 'samples': 11414784, 'steps': 59451, 'loss/train': 1.1519569158554077} 11/07/2021 05:33:58 - INFO - __main__ - Step 59453: {'lr': 0.00033602123020674965, 'samples': 11414976, 'steps': 59452, 'loss/train': 1.4954636096954346} 11/07/2021 05:33:59 - INFO - __main__ - Step 59454: {'lr': 0.0003360162474806219, 'samples': 11415168, 'steps': 59453, 'loss/train': 1.335933804512024} 11/07/2021 05:33:59 - INFO - __main__ - Step 59455: {'lr': 0.0003360112647157366, 'samples': 11415360, 'steps': 59454, 'loss/train': 0.730075478553772} 11/07/2021 05:33:59 - INFO - __main__ - Step 59456: {'lr': 0.0003360062819120958, 'samples': 11415552, 'steps': 59455, 'loss/train': 1.5944722890853882} 11/07/2021 05:34:00 - INFO - __main__ - Step 59457: {'lr': 0.000336001299069702, 'samples': 11415744, 'steps': 59456, 'loss/train': 1.1954927444458008} 11/07/2021 05:34:00 - INFO - __main__ - Step 59458: {'lr': 0.0003359963161885573, 'samples': 11415936, 'steps': 59457, 'loss/train': 1.7233270406723022} 11/07/2021 05:34:01 - INFO - __main__ - Step 59459: {'lr': 0.000335991333268664, 'samples': 11416128, 'steps': 59458, 'loss/train': 1.4710155725479126} 11/07/2021 05:34:02 - INFO - __main__ - Step 59460: {'lr': 0.0003359863503100244, 'samples': 11416320, 'steps': 59459, 'loss/train': 1.1441378593444824} 11/07/2021 05:34:02 - INFO - __main__ - Step 59461: {'lr': 0.0003359813673126406, 'samples': 11416512, 'steps': 59460, 'loss/train': 1.2662721872329712} 11/07/2021 05:34:02 - INFO - __main__ - Step 59462: {'lr': 0.000335976384276515, 'samples': 11416704, 'steps': 59461, 'loss/train': 1.6323987245559692} 11/07/2021 05:34:03 - INFO - __main__ - Step 59463: {'lr': 0.0003359714012016497, 'samples': 11416896, 'steps': 59462, 'loss/train': 1.9028390645980835} 11/07/2021 05:34:04 - INFO - __main__ - Step 59464: {'lr': 0.000335966418088047, 'samples': 11417088, 'steps': 59463, 'loss/train': 0.28088605403900146} 11/07/2021 05:34:04 - INFO - __main__ - Step 59465: {'lr': 0.0003359614349357092, 'samples': 11417280, 'steps': 59464, 'loss/train': 1.1726419925689697} 11/07/2021 05:34:04 - INFO - __main__ - Step 59466: {'lr': 0.00033595645174463843, 'samples': 11417472, 'steps': 59465, 'loss/train': 1.3974151611328125} 11/07/2021 05:34:05 - INFO - __main__ - Step 59467: {'lr': 0.0003359514685148371, 'samples': 11417664, 'steps': 59466, 'loss/train': 1.5635926723480225} 11/07/2021 05:34:05 - INFO - __main__ - Step 59468: {'lr': 0.0003359464852463074, 'samples': 11417856, 'steps': 59467, 'loss/train': 0.8446754217147827} 11/07/2021 05:34:06 - INFO - __main__ - Step 59469: {'lr': 0.00033594150193905144, 'samples': 11418048, 'steps': 59468, 'loss/train': 1.2928434610366821} 11/07/2021 05:34:06 - INFO - __main__ - Step 59470: {'lr': 0.0003359365185930716, 'samples': 11418240, 'steps': 59469, 'loss/train': 1.5282357931137085} 11/07/2021 05:34:07 - INFO - __main__ - Step 59471: {'lr': 0.00033593153520837006, 'samples': 11418432, 'steps': 59470, 'loss/train': 1.4693994522094727} 11/07/2021 05:34:07 - INFO - __main__ - Step 59472: {'lr': 0.0003359265517849491, 'samples': 11418624, 'steps': 59471, 'loss/train': 1.5970423221588135} 11/07/2021 05:34:08 - INFO - __main__ - Step 59473: {'lr': 0.000335921568322811, 'samples': 11418816, 'steps': 59472, 'loss/train': 1.3727434873580933} 11/07/2021 05:34:09 - INFO - __main__ - Step 59474: {'lr': 0.00033591658482195796, 'samples': 11419008, 'steps': 59473, 'loss/train': 1.4155181646347046} 11/07/2021 05:34:09 - INFO - __main__ - Step 59475: {'lr': 0.0003359116012823923, 'samples': 11419200, 'steps': 59474, 'loss/train': 1.4483219385147095} 11/07/2021 05:34:10 - INFO - __main__ - Step 59476: {'lr': 0.0003359066177041161, 'samples': 11419392, 'steps': 59475, 'loss/train': 1.1662133932113647} 11/07/2021 05:34:10 - INFO - __main__ - Step 59477: {'lr': 0.0003359016340871317, 'samples': 11419584, 'steps': 59476, 'loss/train': 2.426244020462036} 11/07/2021 05:34:10 - INFO - __main__ - Step 59478: {'lr': 0.0003358966504314414, 'samples': 11419776, 'steps': 59477, 'loss/train': 1.0725919008255005} 11/07/2021 05:34:11 - INFO - __main__ - Step 59479: {'lr': 0.00033589166673704735, 'samples': 11419968, 'steps': 59478, 'loss/train': 0.5581346750259399} 11/07/2021 05:34:12 - INFO - __main__ - Step 59480: {'lr': 0.0003358866830039519, 'samples': 11420160, 'steps': 59479, 'loss/train': 1.908988118171692} 11/07/2021 05:34:12 - INFO - __main__ - Step 59481: {'lr': 0.0003358816992321572, 'samples': 11420352, 'steps': 59480, 'loss/train': 0.7702450752258301} 11/07/2021 05:34:12 - INFO - __main__ - Step 59482: {'lr': 0.0003358767154216655, 'samples': 11420544, 'steps': 59481, 'loss/train': 1.5035277605056763} 11/07/2021 05:34:13 - INFO - __main__ - Step 59483: {'lr': 0.00033587173157247915, 'samples': 11420736, 'steps': 59482, 'loss/train': 1.2740265130996704} 11/07/2021 05:34:13 - INFO - __main__ - Step 59484: {'lr': 0.00033586674768460025, 'samples': 11420928, 'steps': 59483, 'loss/train': 1.1467206478118896} 11/07/2021 05:34:14 - INFO - __main__ - Step 59485: {'lr': 0.0003358617637580311, 'samples': 11421120, 'steps': 59484, 'loss/train': 1.1890020370483398} 11/07/2021 05:34:14 - INFO - __main__ - Step 59486: {'lr': 0.00033585677979277407, 'samples': 11421312, 'steps': 59485, 'loss/train': 1.4078155755996704} 11/07/2021 05:34:15 - INFO - __main__ - Step 59487: {'lr': 0.00033585179578883123, 'samples': 11421504, 'steps': 59486, 'loss/train': 1.1714547872543335} 11/07/2021 05:34:15 - INFO - __main__ - Step 59488: {'lr': 0.00033584681174620497, 'samples': 11421696, 'steps': 59487, 'loss/train': 1.5593702793121338} 11/07/2021 05:34:15 - INFO - __main__ - Step 59489: {'lr': 0.00033584182766489736, 'samples': 11421888, 'steps': 59488, 'loss/train': 1.2370959520339966} 11/07/2021 05:34:17 - INFO - __main__ - Step 59490: {'lr': 0.0003358368435449108, 'samples': 11422080, 'steps': 59489, 'loss/train': 1.6344785690307617} 11/07/2021 05:34:17 - INFO - __main__ - Step 59491: {'lr': 0.0003358318593862474, 'samples': 11422272, 'steps': 59490, 'loss/train': 1.0064419507980347} 11/07/2021 05:34:17 - INFO - __main__ - Step 59492: {'lr': 0.0003358268751889096, 'samples': 11422464, 'steps': 59491, 'loss/train': 1.6573501825332642} 11/07/2021 05:34:18 - INFO - __main__ - Step 59493: {'lr': 0.0003358218909528995, 'samples': 11422656, 'steps': 59492, 'loss/train': 1.7153048515319824} 11/07/2021 05:34:18 - INFO - __main__ - Step 59494: {'lr': 0.00033581690667821933, 'samples': 11422848, 'steps': 59493, 'loss/train': 1.686608910560608} 11/07/2021 05:34:19 - INFO - __main__ - Step 59495: {'lr': 0.00033581192236487153, 'samples': 11423040, 'steps': 59494, 'loss/train': 1.7390670776367188} 11/07/2021 05:34:19 - INFO - __main__ - Step 59496: {'lr': 0.00033580693801285805, 'samples': 11423232, 'steps': 59495, 'loss/train': 1.3494672775268555} 11/07/2021 05:34:20 - INFO - __main__ - Step 59497: {'lr': 0.0003358019536221814, 'samples': 11423424, 'steps': 59496, 'loss/train': 1.100712537765503} 11/07/2021 05:34:20 - INFO - __main__ - Step 59498: {'lr': 0.00033579696919284357, 'samples': 11423616, 'steps': 59497, 'loss/train': 1.6030791997909546} 11/07/2021 05:34:20 - INFO - __main__ - Step 59499: {'lr': 0.00033579198472484707, 'samples': 11423808, 'steps': 59498, 'loss/train': 1.0974394083023071} 11/07/2021 05:34:21 - INFO - __main__ - Step 59500: {'lr': 0.000335787000218194, 'samples': 11424000, 'steps': 59499, 'loss/train': 1.6031955480575562} 11/07/2021 05:34:22 - INFO - __main__ - Step 59501: {'lr': 0.0003357820156728866, 'samples': 11424192, 'steps': 59500, 'loss/train': 0.9540095925331116} 11/07/2021 05:34:22 - INFO - __main__ - Step 59502: {'lr': 0.0003357770310889272, 'samples': 11424384, 'steps': 59501, 'loss/train': 1.1483746767044067} 11/07/2021 05:34:23 - INFO - __main__ - Step 59503: {'lr': 0.0003357720464663179, 'samples': 11424576, 'steps': 59502, 'loss/train': 0.8712205290794373} 11/07/2021 05:34:23 - INFO - __main__ - Step 59504: {'lr': 0.0003357670618050611, 'samples': 11424768, 'steps': 59503, 'loss/train': 1.2671444416046143} 11/07/2021 05:34:24 - INFO - __main__ - Step 59505: {'lr': 0.000335762077105159, 'samples': 11424960, 'steps': 59504, 'loss/train': 2.224961042404175} 11/07/2021 05:34:25 - INFO - __main__ - Step 59506: {'lr': 0.0003357570923666138, 'samples': 11425152, 'steps': 59505, 'loss/train': 2.035517930984497} 11/07/2021 05:34:25 - INFO - __main__ - Step 59507: {'lr': 0.0003357521075894278, 'samples': 11425344, 'steps': 59506, 'loss/train': 1.5490002632141113} 11/07/2021 05:34:25 - INFO - __main__ - Step 59508: {'lr': 0.00033574712277360325, 'samples': 11425536, 'steps': 59507, 'loss/train': 1.7147842645645142} 11/07/2021 05:34:26 - INFO - __main__ - Step 59509: {'lr': 0.00033574213791914235, 'samples': 11425728, 'steps': 59508, 'loss/train': 0.10911858081817627} 11/07/2021 05:34:27 - INFO - __main__ - Step 59510: {'lr': 0.00033573715302604736, 'samples': 11425920, 'steps': 59509, 'loss/train': 1.6349005699157715} 11/07/2021 05:34:27 - INFO - __main__ - Step 59511: {'lr': 0.0003357321680943205, 'samples': 11426112, 'steps': 59510, 'loss/train': 1.656241774559021} 11/07/2021 05:34:27 - INFO - __main__ - Step 59512: {'lr': 0.00033572718312396404, 'samples': 11426304, 'steps': 59511, 'loss/train': 1.4408379793167114} 11/07/2021 05:34:28 - INFO - __main__ - Step 59513: {'lr': 0.0003357221981149803, 'samples': 11426496, 'steps': 59512, 'loss/train': 1.3564294576644897} 11/07/2021 05:34:28 - INFO - __main__ - Step 59514: {'lr': 0.0003357172130673714, 'samples': 11426688, 'steps': 59513, 'loss/train': 1.758360743522644} 11/07/2021 05:34:29 - INFO - __main__ - Step 59515: {'lr': 0.00033571222798113977, 'samples': 11426880, 'steps': 59514, 'loss/train': 1.6956180334091187} 11/07/2021 05:34:29 - INFO - __main__ - Step 59516: {'lr': 0.0003357072428562874, 'samples': 11427072, 'steps': 59515, 'loss/train': 1.64950692653656} 11/07/2021 05:34:30 - INFO - __main__ - Step 59517: {'lr': 0.0003357022576928167, 'samples': 11427264, 'steps': 59516, 'loss/train': 1.238654613494873} 11/07/2021 05:34:30 - INFO - __main__ - Step 59518: {'lr': 0.0003356972724907299, 'samples': 11427456, 'steps': 59517, 'loss/train': 1.1894230842590332} 11/07/2021 05:34:30 - INFO - __main__ - Step 59519: {'lr': 0.0003356922872500292, 'samples': 11427648, 'steps': 59518, 'loss/train': 1.35769522190094} 11/07/2021 05:34:31 - INFO - __main__ - Step 59520: {'lr': 0.0003356873019707169, 'samples': 11427840, 'steps': 59519, 'loss/train': 1.5582650899887085} 11/07/2021 05:34:32 - INFO - __main__ - Step 59521: {'lr': 0.0003356823166527952, 'samples': 11428032, 'steps': 59520, 'loss/train': 1.5470657348632812} 11/07/2021 05:34:32 - INFO - __main__ - Step 59522: {'lr': 0.00033567733129626645, 'samples': 11428224, 'steps': 59521, 'loss/train': 1.1028765439987183} 11/07/2021 05:34:33 - INFO - __main__ - Step 59523: {'lr': 0.00033567234590113274, 'samples': 11428416, 'steps': 59522, 'loss/train': 1.334879994392395} 11/07/2021 05:34:33 - INFO - __main__ - Step 59524: {'lr': 0.00033566736046739643, 'samples': 11428608, 'steps': 59523, 'loss/train': 1.5988858938217163} 11/07/2021 05:34:33 - INFO - __main__ - Step 59525: {'lr': 0.0003356623749950597, 'samples': 11428800, 'steps': 59524, 'loss/train': 1.3079609870910645} 11/07/2021 05:34:34 - INFO - __main__ - Step 59526: {'lr': 0.0003356573894841248, 'samples': 11428992, 'steps': 59525, 'loss/train': 1.1951123476028442} 11/07/2021 05:34:34 - INFO - __main__ - Step 59527: {'lr': 0.0003356524039345941, 'samples': 11429184, 'steps': 59526, 'loss/train': 1.3614507913589478} 11/07/2021 05:34:35 - INFO - __main__ - Step 59528: {'lr': 0.00033564741834646967, 'samples': 11429376, 'steps': 59527, 'loss/train': 1.3990288972854614} 11/07/2021 05:34:35 - INFO - __main__ - Step 59529: {'lr': 0.0003356424327197539, 'samples': 11429568, 'steps': 59528, 'loss/train': 1.2418543100357056} 11/07/2021 05:34:36 - INFO - __main__ - Step 59530: {'lr': 0.00033563744705444886, 'samples': 11429760, 'steps': 59529, 'loss/train': 1.0261796712875366} 11/07/2021 05:34:37 - INFO - __main__ - Step 59531: {'lr': 0.000335632461350557, 'samples': 11429952, 'steps': 59530, 'loss/train': 1.293760061264038} 11/07/2021 05:34:37 - INFO - __main__ - Step 59532: {'lr': 0.00033562747560808044, 'samples': 11430144, 'steps': 59531, 'loss/train': 1.2552067041397095} 11/07/2021 05:34:37 - INFO - __main__ - Step 59533: {'lr': 0.00033562248982702144, 'samples': 11430336, 'steps': 59532, 'loss/train': 1.2043218612670898} 11/07/2021 05:34:38 - INFO - __main__ - Step 59534: {'lr': 0.0003356175040073823, 'samples': 11430528, 'steps': 59533, 'loss/train': 1.0706124305725098} 11/07/2021 05:34:38 - INFO - __main__ - Step 59535: {'lr': 0.0003356125181491653, 'samples': 11430720, 'steps': 59534, 'loss/train': 1.805811882019043} 11/07/2021 05:34:39 - INFO - __main__ - Step 59536: {'lr': 0.0003356075322523725, 'samples': 11430912, 'steps': 59535, 'loss/train': 1.2499884366989136} 11/07/2021 05:34:39 - INFO - __main__ - Step 59537: {'lr': 0.00033560254631700634, 'samples': 11431104, 'steps': 59536, 'loss/train': 1.4227045774459839} 11/07/2021 05:34:40 - INFO - __main__ - Step 59538: {'lr': 0.0003355975603430689, 'samples': 11431296, 'steps': 59537, 'loss/train': 1.4031585454940796} 11/07/2021 05:34:40 - INFO - __main__ - Step 59539: {'lr': 0.0003355925743305626, 'samples': 11431488, 'steps': 59538, 'loss/train': 0.8719915747642517} 11/07/2021 05:34:40 - INFO - __main__ - Step 59540: {'lr': 0.0003355875882794896, 'samples': 11431680, 'steps': 59539, 'loss/train': 1.7507216930389404} 11/07/2021 05:34:41 - INFO - __main__ - Step 59541: {'lr': 0.00033558260218985214, 'samples': 11431872, 'steps': 59540, 'loss/train': 1.453427791595459} 11/07/2021 05:34:42 - INFO - __main__ - Step 59542: {'lr': 0.00033557761606165253, 'samples': 11432064, 'steps': 59541, 'loss/train': 0.9681723117828369} 11/07/2021 05:34:42 - INFO - __main__ - Step 59543: {'lr': 0.00033557262989489294, 'samples': 11432256, 'steps': 59542, 'loss/train': 1.5702011585235596} 11/07/2021 05:34:43 - INFO - __main__ - Step 59544: {'lr': 0.0003355676436895756, 'samples': 11432448, 'steps': 59543, 'loss/train': 1.639803409576416} 11/07/2021 05:34:43 - INFO - __main__ - Step 59545: {'lr': 0.0003355626574457029, 'samples': 11432640, 'steps': 59544, 'loss/train': 1.6117422580718994} 11/07/2021 05:34:44 - INFO - __main__ - Step 59546: {'lr': 0.00033555767116327686, 'samples': 11432832, 'steps': 59545, 'loss/train': 1.8059355020523071} 11/07/2021 05:34:44 - INFO - __main__ - Step 59547: {'lr': 0.00033555268484229987, 'samples': 11433024, 'steps': 59546, 'loss/train': 1.1739661693572998} 11/07/2021 05:34:45 - INFO - __main__ - Step 59548: {'lr': 0.0003355476984827743, 'samples': 11433216, 'steps': 59547, 'loss/train': 1.4571211338043213} 11/07/2021 05:34:45 - INFO - __main__ - Step 59549: {'lr': 0.0003355427120847021, 'samples': 11433408, 'steps': 59548, 'loss/train': 1.7085621356964111} 11/07/2021 05:34:45 - INFO - __main__ - Step 59550: {'lr': 0.0003355377256480858, 'samples': 11433600, 'steps': 59549, 'loss/train': 1.1984351873397827} 11/07/2021 05:34:46 - INFO - __main__ - Step 59551: {'lr': 0.00033553273917292744, 'samples': 11433792, 'steps': 59550, 'loss/train': 1.194305181503296} 11/07/2021 05:34:47 - INFO - __main__ - Step 59552: {'lr': 0.0003355277526592293, 'samples': 11433984, 'steps': 59551, 'loss/train': 1.199188232421875} 11/07/2021 05:34:47 - INFO - __main__ - Step 59553: {'lr': 0.00033552276610699375, 'samples': 11434176, 'steps': 59552, 'loss/train': 1.4422672986984253} 11/07/2021 05:34:47 - INFO - __main__ - Step 59554: {'lr': 0.00033551777951622297, 'samples': 11434368, 'steps': 59553, 'loss/train': 1.435340404510498} 11/07/2021 05:34:48 - INFO - __main__ - Step 59555: {'lr': 0.0003355127928869192, 'samples': 11434560, 'steps': 59554, 'loss/train': 1.4675737619400024} 11/07/2021 05:34:49 - INFO - __main__ - Step 59556: {'lr': 0.0003355078062190847, 'samples': 11434752, 'steps': 59555, 'loss/train': 0.8876587748527527} 11/07/2021 05:34:49 - INFO - __main__ - Step 59557: {'lr': 0.00033550281951272163, 'samples': 11434944, 'steps': 59556, 'loss/train': 1.6628034114837646} 11/07/2021 05:34:50 - INFO - __main__ - Step 59558: {'lr': 0.0003354978327678323, 'samples': 11435136, 'steps': 59557, 'loss/train': 1.1888867616653442} 11/07/2021 05:34:50 - INFO - __main__ - Step 59559: {'lr': 0.00033549284598441897, 'samples': 11435328, 'steps': 59558, 'loss/train': 1.5983210802078247} 11/07/2021 05:34:50 - INFO - __main__ - Step 59560: {'lr': 0.0003354878591624839, 'samples': 11435520, 'steps': 59559, 'loss/train': 1.2895994186401367} 11/07/2021 05:34:51 - INFO - __main__ - Step 59561: {'lr': 0.0003354828723020294, 'samples': 11435712, 'steps': 59560, 'loss/train': 1.5360382795333862} 11/07/2021 05:34:52 - INFO - __main__ - Step 59562: {'lr': 0.0003354778854030576, 'samples': 11435904, 'steps': 59561, 'loss/train': 1.4257394075393677} 11/07/2021 05:34:52 - INFO - __main__ - Step 59563: {'lr': 0.0003354728984655708, 'samples': 11436096, 'steps': 59562, 'loss/train': 1.2033840417861938} 11/07/2021 05:34:52 - INFO - __main__ - Step 59564: {'lr': 0.0003354679114895711, 'samples': 11436288, 'steps': 59563, 'loss/train': 1.349900722503662} 11/07/2021 05:34:53 - INFO - __main__ - Step 59565: {'lr': 0.000335462924475061, 'samples': 11436480, 'steps': 59564, 'loss/train': 0.9771448373794556} 11/07/2021 05:34:54 - INFO - __main__ - Step 59566: {'lr': 0.00033545793742204255, 'samples': 11436672, 'steps': 59565, 'loss/train': 1.3499360084533691} 11/07/2021 05:34:54 - INFO - __main__ - Step 59567: {'lr': 0.00033545295033051814, 'samples': 11436864, 'steps': 59566, 'loss/train': 1.3435114622116089} 11/07/2021 05:34:54 - INFO - __main__ - Step 59568: {'lr': 0.00033544796320048996, 'samples': 11437056, 'steps': 59567, 'loss/train': 1.2455247640609741} 11/07/2021 05:34:55 - INFO - __main__ - Step 59569: {'lr': 0.0003354429760319602, 'samples': 11437248, 'steps': 59568, 'loss/train': 1.316182017326355} 11/07/2021 05:34:55 - INFO - __main__ - Step 59570: {'lr': 0.00033543798882493123, 'samples': 11437440, 'steps': 59569, 'loss/train': 0.7195746302604675} 11/07/2021 05:34:55 - INFO - __main__ - Step 59571: {'lr': 0.0003354330015794051, 'samples': 11437632, 'steps': 59570, 'loss/train': 1.4050893783569336} 11/07/2021 05:34:56 - INFO - __main__ - Step 59572: {'lr': 0.00033542801429538424, 'samples': 11437824, 'steps': 59571, 'loss/train': 1.6291940212249756} 11/07/2021 05:34:57 - INFO - __main__ - Step 59573: {'lr': 0.0003354230269728709, 'samples': 11438016, 'steps': 59572, 'loss/train': 1.7440979480743408} 11/07/2021 05:34:57 - INFO - __main__ - Step 59574: {'lr': 0.0003354180396118671, 'samples': 11438208, 'steps': 59573, 'loss/train': 1.2694685459136963} 11/07/2021 05:34:58 - INFO - __main__ - Step 59575: {'lr': 0.0003354130522123754, 'samples': 11438400, 'steps': 59574, 'loss/train': 1.6482088565826416} 11/07/2021 05:34:59 - INFO - __main__ - Step 59576: {'lr': 0.0003354080647743978, 'samples': 11438592, 'steps': 59575, 'loss/train': 1.1161706447601318} 11/07/2021 05:34:59 - INFO - __main__ - Step 59577: {'lr': 0.0003354030772979367, 'samples': 11438784, 'steps': 59576, 'loss/train': 1.1914986371994019} 11/07/2021 05:34:59 - INFO - __main__ - Step 59578: {'lr': 0.00033539808978299423, 'samples': 11438976, 'steps': 59577, 'loss/train': 1.3084155321121216} 11/07/2021 05:35:00 - INFO - __main__ - Step 59579: {'lr': 0.0003353931022295728, 'samples': 11439168, 'steps': 59578, 'loss/train': 2.5183029174804688} 11/07/2021 05:35:00 - INFO - __main__ - Step 59580: {'lr': 0.0003353881146376745, 'samples': 11439360, 'steps': 59579, 'loss/train': 1.261258840560913} 11/07/2021 05:35:00 - INFO - __main__ - Step 59581: {'lr': 0.0003353831270073016, 'samples': 11439552, 'steps': 59580, 'loss/train': 0.11749255657196045} 11/07/2021 05:35:02 - INFO - __main__ - Step 59582: {'lr': 0.0003353781393384564, 'samples': 11439744, 'steps': 59581, 'loss/train': 1.3081586360931396} 11/07/2021 05:35:02 - INFO - __main__ - Step 59583: {'lr': 0.0003353731516311411, 'samples': 11439936, 'steps': 59582, 'loss/train': 0.18103912472724915} 11/07/2021 05:35:03 - INFO - __main__ - Step 59584: {'lr': 0.00033536816388535814, 'samples': 11440128, 'steps': 59583, 'loss/train': 1.3822505474090576} 11/07/2021 05:35:03 - INFO - __main__ - Step 59585: {'lr': 0.0003353631761011094, 'samples': 11440320, 'steps': 59584, 'loss/train': 1.2396767139434814} 11/07/2021 05:35:03 - INFO - __main__ - Step 59586: {'lr': 0.00033535818827839744, 'samples': 11440512, 'steps': 59585, 'loss/train': 1.3074694871902466} 11/07/2021 05:35:05 - INFO - __main__ - Step 59587: {'lr': 0.0003353532004172244, 'samples': 11440704, 'steps': 59586, 'loss/train': 1.5764880180358887} 11/07/2021 05:35:05 - INFO - __main__ - Step 59588: {'lr': 0.00033534821251759246, 'samples': 11440896, 'steps': 59587, 'loss/train': 0.9902772307395935} 11/07/2021 05:35:05 - INFO - __main__ - Step 59589: {'lr': 0.00033534322457950396, 'samples': 11441088, 'steps': 59588, 'loss/train': 1.802744746208191} 11/07/2021 05:35:06 - INFO - __main__ - Step 59590: {'lr': 0.00033533823660296115, 'samples': 11441280, 'steps': 59589, 'loss/train': 1.0557581186294556} 11/07/2021 05:35:06 - INFO - __main__ - Step 59591: {'lr': 0.00033533324858796623, 'samples': 11441472, 'steps': 59590, 'loss/train': 1.397140622138977} 11/07/2021 05:35:07 - INFO - __main__ - Step 59592: {'lr': 0.00033532826053452145, 'samples': 11441664, 'steps': 59591, 'loss/train': 0.15451475977897644} 11/07/2021 05:35:07 - INFO - __main__ - Step 59593: {'lr': 0.00033532327244262906, 'samples': 11441856, 'steps': 59592, 'loss/train': 1.3034030199050903} 11/07/2021 05:35:08 - INFO - __main__ - Step 59594: {'lr': 0.0003353182843122913, 'samples': 11442048, 'steps': 59593, 'loss/train': 1.8378833532333374} 11/07/2021 05:35:08 - INFO - __main__ - Step 59595: {'lr': 0.0003353132961435105, 'samples': 11442240, 'steps': 59594, 'loss/train': 1.328703761100769} 11/07/2021 05:35:08 - INFO - __main__ - Step 59596: {'lr': 0.00033530830793628886, 'samples': 11442432, 'steps': 59595, 'loss/train': 1.4518554210662842} 11/07/2021 05:35:10 - INFO - __main__ - Step 59597: {'lr': 0.00033530331969062853, 'samples': 11442624, 'steps': 59596, 'loss/train': 1.4871795177459717} 11/07/2021 05:35:10 - INFO - __main__ - Step 59598: {'lr': 0.00033529833140653187, 'samples': 11442816, 'steps': 59597, 'loss/train': 1.5746878385543823} 11/07/2021 05:35:10 - INFO - __main__ - Step 59599: {'lr': 0.0003352933430840011, 'samples': 11443008, 'steps': 59598, 'loss/train': 0.10077402740716934} 11/07/2021 05:35:11 - INFO - __main__ - Step 59600: {'lr': 0.0003352883547230385, 'samples': 11443200, 'steps': 59599, 'loss/train': 1.2615989446640015} 11/07/2021 05:35:11 - INFO - __main__ - Step 59601: {'lr': 0.00033528336632364624, 'samples': 11443392, 'steps': 59600, 'loss/train': 1.404905080795288} 11/07/2021 05:35:12 - INFO - __main__ - Step 59602: {'lr': 0.00033527837788582663, 'samples': 11443584, 'steps': 59601, 'loss/train': 0.9862393736839294} 11/07/2021 05:35:12 - INFO - __main__ - Step 59603: {'lr': 0.00033527338940958197, 'samples': 11443776, 'steps': 59602, 'loss/train': 1.3578568696975708} 11/07/2021 05:35:13 - INFO - __main__ - Step 59604: {'lr': 0.00033526840089491433, 'samples': 11443968, 'steps': 59603, 'loss/train': 1.252523422241211} 11/07/2021 05:35:13 - INFO - __main__ - Step 59605: {'lr': 0.00033526341234182613, 'samples': 11444160, 'steps': 59604, 'loss/train': 1.8080463409423828} 11/07/2021 05:35:14 - INFO - __main__ - Step 59606: {'lr': 0.00033525842375031946, 'samples': 11444352, 'steps': 59605, 'loss/train': 1.431477427482605} 11/07/2021 05:35:15 - INFO - __main__ - Step 59607: {'lr': 0.00033525343512039673, 'samples': 11444544, 'steps': 59606, 'loss/train': 1.2721199989318848} 11/07/2021 05:35:15 - INFO - __main__ - Step 59608: {'lr': 0.0003352484464520601, 'samples': 11444736, 'steps': 59607, 'loss/train': 0.9697229862213135} 11/07/2021 05:35:15 - INFO - __main__ - Step 59609: {'lr': 0.0003352434577453119, 'samples': 11444928, 'steps': 59608, 'loss/train': 1.3642055988311768} 11/07/2021 05:35:16 - INFO - __main__ - Step 59610: {'lr': 0.00033523846900015427, 'samples': 11445120, 'steps': 59609, 'loss/train': 1.4536397457122803} 11/07/2021 05:35:16 - INFO - __main__ - Step 59611: {'lr': 0.00033523348021658947, 'samples': 11445312, 'steps': 59610, 'loss/train': 1.5773667097091675} 11/07/2021 05:35:17 - INFO - __main__ - Step 59612: {'lr': 0.00033522849139461973, 'samples': 11445504, 'steps': 59611, 'loss/train': 1.2811578512191772} 11/07/2021 05:35:18 - INFO - __main__ - Step 59613: {'lr': 0.0003352235025342475, 'samples': 11445696, 'steps': 59612, 'loss/train': 1.6257545948028564} 11/07/2021 05:35:18 - INFO - __main__ - Step 59614: {'lr': 0.00033521851363547473, 'samples': 11445888, 'steps': 59613, 'loss/train': 1.0231727361679077} 11/07/2021 05:35:18 - INFO - __main__ - Step 59615: {'lr': 0.0003352135246983039, 'samples': 11446080, 'steps': 59614, 'loss/train': 1.655920386314392} 11/07/2021 05:35:19 - INFO - __main__ - Step 59616: {'lr': 0.0003352085357227372, 'samples': 11446272, 'steps': 59615, 'loss/train': 1.7154494524002075} 11/07/2021 05:35:19 - INFO - __main__ - Step 59617: {'lr': 0.00033520354670877673, 'samples': 11446464, 'steps': 59616, 'loss/train': 1.5022478103637695} 11/07/2021 05:35:20 - INFO - __main__ - Step 59618: {'lr': 0.00033519855765642493, 'samples': 11446656, 'steps': 59617, 'loss/train': 1.3677871227264404} 11/07/2021 05:35:21 - INFO - __main__ - Step 59619: {'lr': 0.00033519356856568397, 'samples': 11446848, 'steps': 59618, 'loss/train': 1.5715504884719849} 11/07/2021 05:35:21 - INFO - __main__ - Step 59620: {'lr': 0.00033518857943655607, 'samples': 11447040, 'steps': 59619, 'loss/train': 1.1762737035751343} 11/07/2021 05:35:21 - INFO - __main__ - Step 59621: {'lr': 0.00033518359026904357, 'samples': 11447232, 'steps': 59620, 'loss/train': 1.3297492265701294} 11/07/2021 05:35:22 - INFO - __main__ - Step 59622: {'lr': 0.00033517860106314863, 'samples': 11447424, 'steps': 59621, 'loss/train': 0.13469816744327545} 11/07/2021 05:35:23 - INFO - __main__ - Step 59623: {'lr': 0.00033517361181887353, 'samples': 11447616, 'steps': 59622, 'loss/train': 1.3062771558761597} 11/07/2021 05:35:23 - INFO - __main__ - Step 59624: {'lr': 0.0003351686225362205, 'samples': 11447808, 'steps': 59623, 'loss/train': 1.686296820640564} 11/07/2021 05:35:23 - INFO - __main__ - Step 59625: {'lr': 0.00033516363321519185, 'samples': 11448000, 'steps': 59624, 'loss/train': 1.2708745002746582} 11/07/2021 05:35:24 - INFO - __main__ - Step 59626: {'lr': 0.0003351586438557897, 'samples': 11448192, 'steps': 59625, 'loss/train': 1.0592166185379028} 11/07/2021 05:35:24 - INFO - __main__ - Step 59627: {'lr': 0.00033515365445801635, 'samples': 11448384, 'steps': 59626, 'loss/train': 1.2782230377197266} 11/07/2021 05:35:25 - INFO - __main__ - Step 59628: {'lr': 0.00033514866502187417, 'samples': 11448576, 'steps': 59627, 'loss/train': 0.7311087846755981} 11/07/2021 05:35:26 - INFO - __main__ - Step 59629: {'lr': 0.0003351436755473654, 'samples': 11448768, 'steps': 59628, 'loss/train': 1.175594449043274} 11/07/2021 05:35:26 - INFO - __main__ - Step 59630: {'lr': 0.00033513868603449203, 'samples': 11448960, 'steps': 59629, 'loss/train': 1.260723352432251} 11/07/2021 05:35:26 - INFO - __main__ - Step 59631: {'lr': 0.00033513369648325653, 'samples': 11449152, 'steps': 59630, 'loss/train': 1.033092737197876} 11/07/2021 05:35:27 - INFO - __main__ - Step 59632: {'lr': 0.00033512870689366114, 'samples': 11449344, 'steps': 59631, 'loss/train': 1.294837474822998} 11/07/2021 05:35:27 - INFO - __main__ - Step 59633: {'lr': 0.0003351237172657081, 'samples': 11449536, 'steps': 59632, 'loss/train': 1.1664807796478271} 11/07/2021 05:35:28 - INFO - __main__ - Step 59634: {'lr': 0.00033511872759939954, 'samples': 11449728, 'steps': 59633, 'loss/train': 1.6271213293075562} 11/07/2021 05:35:28 - INFO - __main__ - Step 59635: {'lr': 0.0003351137378947378, 'samples': 11449920, 'steps': 59634, 'loss/train': 1.7285903692245483} 11/07/2021 05:35:29 - INFO - __main__ - Step 59636: {'lr': 0.00033510874815172523, 'samples': 11450112, 'steps': 59635, 'loss/train': 0.9194626212120056} 11/07/2021 05:35:29 - INFO - __main__ - Step 59637: {'lr': 0.00033510375837036386, 'samples': 11450304, 'steps': 59636, 'loss/train': 1.944482445716858} 11/07/2021 05:35:29 - INFO - __main__ - Step 59638: {'lr': 0.0003350987685506561, 'samples': 11450496, 'steps': 59637, 'loss/train': 1.3527714014053345} 11/07/2021 05:35:31 - INFO - __main__ - Step 59639: {'lr': 0.0003350937786926041, 'samples': 11450688, 'steps': 59638, 'loss/train': 1.75435471534729} 11/07/2021 05:35:31 - INFO - __main__ - Step 59640: {'lr': 0.0003350887887962102, 'samples': 11450880, 'steps': 59639, 'loss/train': 1.2585010528564453} 11/07/2021 05:35:31 - INFO - __main__ - Step 59641: {'lr': 0.00033508379886147655, 'samples': 11451072, 'steps': 59640, 'loss/train': 1.3809635639190674} 11/07/2021 05:35:32 - INFO - __main__ - Step 59642: {'lr': 0.00033507880888840547, 'samples': 11451264, 'steps': 59641, 'loss/train': 1.4172636270523071} 11/07/2021 05:35:32 - INFO - __main__ - Step 59643: {'lr': 0.00033507381887699927, 'samples': 11451456, 'steps': 59642, 'loss/train': 1.5943095684051514} 11/07/2021 05:35:33 - INFO - __main__ - Step 59644: {'lr': 0.0003350688288272601, 'samples': 11451648, 'steps': 59643, 'loss/train': 0.06723592430353165} 11/07/2021 05:35:34 - INFO - __main__ - Step 59645: {'lr': 0.00033506383873919016, 'samples': 11451840, 'steps': 59644, 'loss/train': 1.517072319984436} 11/07/2021 05:35:34 - INFO - __main__ - Step 59646: {'lr': 0.0003350588486127918, 'samples': 11452032, 'steps': 59645, 'loss/train': 0.43619540333747864} 11/07/2021 05:35:34 - INFO - __main__ - Step 59647: {'lr': 0.0003350538584480672, 'samples': 11452224, 'steps': 59646, 'loss/train': 1.0386179685592651} 11/07/2021 05:35:35 - INFO - __main__ - Step 59648: {'lr': 0.0003350488682450187, 'samples': 11452416, 'steps': 59647, 'loss/train': 2.321737051010132} 11/07/2021 05:35:36 - INFO - __main__ - Step 59649: {'lr': 0.00033504387800364856, 'samples': 11452608, 'steps': 59648, 'loss/train': 1.195865511894226} 11/07/2021 05:35:36 - INFO - __main__ - Step 59650: {'lr': 0.00033503888772395886, 'samples': 11452800, 'steps': 59649, 'loss/train': 2.187549591064453} 11/07/2021 05:35:37 - INFO - __main__ - Step 59651: {'lr': 0.0003350338974059519, 'samples': 11452992, 'steps': 59650, 'loss/train': 1.7402242422103882} 11/07/2021 05:35:37 - INFO - __main__ - Step 59652: {'lr': 0.0003350289070496301, 'samples': 11453184, 'steps': 59651, 'loss/train': 1.593725323677063} 11/07/2021 05:35:37 - INFO - __main__ - Step 59653: {'lr': 0.0003350239166549955, 'samples': 11453376, 'steps': 59652, 'loss/train': 1.6116315126419067} 11/07/2021 05:35:38 - INFO - __main__ - Step 59654: {'lr': 0.0003350189262220504, 'samples': 11453568, 'steps': 59653, 'loss/train': 1.701385736465454} 11/07/2021 05:35:39 - INFO - __main__ - Step 59655: {'lr': 0.0003350139357507972, 'samples': 11453760, 'steps': 59654, 'loss/train': 1.0990419387817383} 11/07/2021 05:35:39 - INFO - __main__ - Step 59656: {'lr': 0.00033500894524123796, 'samples': 11453952, 'steps': 59655, 'loss/train': 1.5260257720947266} 11/07/2021 05:35:39 - INFO - __main__ - Step 59657: {'lr': 0.0003350039546933751, 'samples': 11454144, 'steps': 59656, 'loss/train': 1.3701422214508057} 11/07/2021 05:35:40 - INFO - __main__ - Step 59658: {'lr': 0.00033499896410721066, 'samples': 11454336, 'steps': 59657, 'loss/train': 1.5399134159088135} 11/07/2021 05:35:40 - INFO - __main__ - Step 59659: {'lr': 0.000334993973482747, 'samples': 11454528, 'steps': 59658, 'loss/train': 0.7803982496261597} 11/07/2021 05:35:41 - INFO - __main__ - Step 59660: {'lr': 0.0003349889828199864, 'samples': 11454720, 'steps': 59659, 'loss/train': 1.2550472021102905} 11/07/2021 05:35:41 - INFO - __main__ - Step 59661: {'lr': 0.000334983992118931, 'samples': 11454912, 'steps': 59660, 'loss/train': 1.2362215518951416} 11/07/2021 05:35:42 - INFO - __main__ - Step 59662: {'lr': 0.00033497900137958325, 'samples': 11455104, 'steps': 59661, 'loss/train': 1.4623883962631226} 11/07/2021 05:35:42 - INFO - __main__ - Step 59663: {'lr': 0.00033497401060194525, 'samples': 11455296, 'steps': 59662, 'loss/train': 1.6779707670211792} 11/07/2021 05:35:42 - INFO - __main__ - Step 59664: {'lr': 0.00033496901978601924, 'samples': 11455488, 'steps': 59663, 'loss/train': 1.2709845304489136} 11/07/2021 05:35:44 - INFO - __main__ - Step 59665: {'lr': 0.0003349640289318075, 'samples': 11455680, 'steps': 59664, 'loss/train': 1.0757722854614258} 11/07/2021 05:35:44 - INFO - __main__ - Step 59666: {'lr': 0.0003349590380393123, 'samples': 11455872, 'steps': 59665, 'loss/train': 0.8987736105918884} 11/07/2021 05:35:44 - INFO - __main__ - Step 59667: {'lr': 0.0003349540471085358, 'samples': 11456064, 'steps': 59666, 'loss/train': 0.9906978607177734} 11/07/2021 05:35:45 - INFO - __main__ - Step 59668: {'lr': 0.00033494905613948035, 'samples': 11456256, 'steps': 59667, 'loss/train': 1.535966396331787} 11/07/2021 05:35:45 - INFO - __main__ - Step 59669: {'lr': 0.00033494406513214826, 'samples': 11456448, 'steps': 59668, 'loss/train': 1.8228522539138794} 11/07/2021 05:35:46 - INFO - __main__ - Step 59670: {'lr': 0.0003349390740865416, 'samples': 11456640, 'steps': 59669, 'loss/train': 1.481791377067566} 11/07/2021 05:35:46 - INFO - __main__ - Step 59671: {'lr': 0.0003349340830026627, 'samples': 11456832, 'steps': 59670, 'loss/train': 0.6913386583328247} 11/07/2021 05:35:47 - INFO - __main__ - Step 59672: {'lr': 0.0003349290918805138, 'samples': 11457024, 'steps': 59671, 'loss/train': 0.9363782405853271} 11/07/2021 05:35:47 - INFO - __main__ - Step 59673: {'lr': 0.0003349241007200972, 'samples': 11457216, 'steps': 59672, 'loss/train': 1.3432589769363403} 11/07/2021 05:35:48 - INFO - __main__ - Step 59674: {'lr': 0.0003349191095214151, 'samples': 11457408, 'steps': 59673, 'loss/train': 1.3509225845336914} 11/07/2021 05:35:49 - INFO - __main__ - Step 59675: {'lr': 0.00033491411828446974, 'samples': 11457600, 'steps': 59674, 'loss/train': 1.407131314277649} 11/07/2021 05:35:49 - INFO - __main__ - Step 59676: {'lr': 0.00033490912700926345, 'samples': 11457792, 'steps': 59675, 'loss/train': 1.3684331178665161} 11/07/2021 05:35:49 - INFO - __main__ - Step 59677: {'lr': 0.00033490413569579837, 'samples': 11457984, 'steps': 59676, 'loss/train': 1.229500651359558} 11/07/2021 05:35:50 - INFO - __main__ - Step 59678: {'lr': 0.00033489914434407683, 'samples': 11458176, 'steps': 59677, 'loss/train': 1.3191747665405273} 11/07/2021 05:35:50 - INFO - __main__ - Step 59679: {'lr': 0.00033489415295410096, 'samples': 11458368, 'steps': 59678, 'loss/train': 1.3091589212417603} 11/07/2021 05:35:51 - INFO - __main__ - Step 59680: {'lr': 0.0003348891615258732, 'samples': 11458560, 'steps': 59679, 'loss/train': 1.3246545791625977} 11/07/2021 05:35:52 - INFO - __main__ - Step 59681: {'lr': 0.0003348841700593956, 'samples': 11458752, 'steps': 59680, 'loss/train': 0.14527994394302368} 11/07/2021 05:35:52 - INFO - __main__ - Step 59682: {'lr': 0.00033487917855467056, 'samples': 11458944, 'steps': 59681, 'loss/train': 1.6974310874938965} 11/07/2021 05:35:52 - INFO - __main__ - Step 59683: {'lr': 0.0003348741870117003, 'samples': 11459136, 'steps': 59682, 'loss/train': 1.2273606061935425} 11/07/2021 05:35:53 - INFO - __main__ - Step 59684: {'lr': 0.000334869195430487, 'samples': 11459328, 'steps': 59683, 'loss/train': 1.6151094436645508} 11/07/2021 05:35:54 - INFO - __main__ - Step 59685: {'lr': 0.0003348642038110329, 'samples': 11459520, 'steps': 59684, 'loss/train': 1.2376044988632202} 11/07/2021 05:35:54 - INFO - __main__ - Step 59686: {'lr': 0.0003348592121533404, 'samples': 11459712, 'steps': 59685, 'loss/train': 1.2281603813171387} 11/07/2021 05:35:54 - INFO - __main__ - Step 59687: {'lr': 0.00033485422045741154, 'samples': 11459904, 'steps': 59686, 'loss/train': 1.5677322149276733} 11/07/2021 05:35:55 - INFO - __main__ - Step 59688: {'lr': 0.00033484922872324875, 'samples': 11460096, 'steps': 59687, 'loss/train': 1.1358038187026978} 11/07/2021 05:35:55 - INFO - __main__ - Step 59689: {'lr': 0.0003348442369508542, 'samples': 11460288, 'steps': 59688, 'loss/train': 1.6422333717346191} 11/07/2021 05:35:55 - INFO - __main__ - Step 59690: {'lr': 0.0003348392451402302, 'samples': 11460480, 'steps': 59689, 'loss/train': 1.289177656173706} 11/07/2021 05:35:56 - INFO - __main__ - Step 59691: {'lr': 0.00033483425329137886, 'samples': 11460672, 'steps': 59690, 'loss/train': 1.566542148590088} 11/07/2021 05:35:57 - INFO - __main__ - Step 59692: {'lr': 0.00033482926140430253, 'samples': 11460864, 'steps': 59691, 'loss/train': 1.6093891859054565} 11/07/2021 05:35:57 - INFO - __main__ - Step 59693: {'lr': 0.00033482426947900346, 'samples': 11461056, 'steps': 59692, 'loss/train': 1.5558502674102783} 11/07/2021 05:35:57 - INFO - __main__ - Step 59694: {'lr': 0.0003348192775154839, 'samples': 11461248, 'steps': 59693, 'loss/train': 1.9984288215637207} 11/07/2021 05:35:58 - INFO - __main__ - Step 59695: {'lr': 0.000334814285513746, 'samples': 11461440, 'steps': 59694, 'loss/train': 1.259392261505127} 11/07/2021 05:35:59 - INFO - __main__ - Step 59696: {'lr': 0.0003348092934737922, 'samples': 11461632, 'steps': 59695, 'loss/train': 1.1630260944366455} 11/07/2021 05:35:59 - INFO - __main__ - Step 59697: {'lr': 0.00033480430139562456, 'samples': 11461824, 'steps': 59696, 'loss/train': 0.7738115191459656} 11/07/2021 05:35:59 - INFO - __main__ - Step 59698: {'lr': 0.00033479930927924543, 'samples': 11462016, 'steps': 59697, 'loss/train': 1.451158046722412} 11/07/2021 05:36:00 - INFO - __main__ - Step 59699: {'lr': 0.000334794317124657, 'samples': 11462208, 'steps': 59698, 'loss/train': 1.1373018026351929} 11/07/2021 05:36:00 - INFO - __main__ - Step 59700: {'lr': 0.00033478932493186163, 'samples': 11462400, 'steps': 59699, 'loss/train': 1.2634257078170776} 11/07/2021 05:36:01 - INFO - __main__ - Step 59701: {'lr': 0.0003347843327008615, 'samples': 11462592, 'steps': 59700, 'loss/train': 1.2875890731811523} 11/07/2021 05:36:02 - INFO - __main__ - Step 59702: {'lr': 0.0003347793404316589, 'samples': 11462784, 'steps': 59701, 'loss/train': 1.8182542324066162} 11/07/2021 05:36:02 - INFO - __main__ - Step 59703: {'lr': 0.00033477434812425596, 'samples': 11462976, 'steps': 59702, 'loss/train': 0.7015295624732971} 11/07/2021 05:36:02 - INFO - __main__ - Step 59704: {'lr': 0.00033476935577865497, 'samples': 11463168, 'steps': 59703, 'loss/train': 1.2705014944076538} 11/07/2021 05:36:03 - INFO - __main__ - Step 59705: {'lr': 0.0003347643633948583, 'samples': 11463360, 'steps': 59704, 'loss/train': 0.9029524922370911} 11/07/2021 05:36:04 - INFO - __main__ - Step 59706: {'lr': 0.00033475937097286805, 'samples': 11463552, 'steps': 59705, 'loss/train': 1.3314130306243896} 11/07/2021 05:36:04 - INFO - __main__ - Step 59707: {'lr': 0.00033475437851268657, 'samples': 11463744, 'steps': 59706, 'loss/train': 0.4221527874469757} 11/07/2021 05:36:04 - INFO - __main__ - Step 59708: {'lr': 0.0003347493860143161, 'samples': 11463936, 'steps': 59707, 'loss/train': 1.3227791786193848} 11/07/2021 05:36:05 - INFO - __main__ - Step 59709: {'lr': 0.0003347443934777589, 'samples': 11464128, 'steps': 59708, 'loss/train': 1.4634250402450562} 11/07/2021 05:36:05 - INFO - __main__ - Step 59710: {'lr': 0.0003347394009030171, 'samples': 11464320, 'steps': 59709, 'loss/train': 1.2356635332107544} 11/07/2021 05:36:06 - INFO - __main__ - Step 59711: {'lr': 0.00033473440829009303, 'samples': 11464512, 'steps': 59710, 'loss/train': 1.7734612226486206} 11/07/2021 05:36:06 - INFO - __main__ - Step 59712: {'lr': 0.00033472941563898897, 'samples': 11464704, 'steps': 59711, 'loss/train': 1.698994517326355} 11/07/2021 05:36:07 - INFO - __main__ - Step 59713: {'lr': 0.00033472442294970716, 'samples': 11464896, 'steps': 59712, 'loss/train': 1.3743351697921753} 11/07/2021 05:36:07 - INFO - __main__ - Step 59714: {'lr': 0.00033471943022224984, 'samples': 11465088, 'steps': 59713, 'loss/train': 1.2542681694030762} 11/07/2021 05:36:08 - INFO - __main__ - Step 59715: {'lr': 0.0003347144374566192, 'samples': 11465280, 'steps': 59714, 'loss/train': 1.762437105178833} 11/07/2021 05:36:09 - INFO - __main__ - Step 59716: {'lr': 0.00033470944465281753, 'samples': 11465472, 'steps': 59715, 'loss/train': 1.3514084815979004} 11/07/2021 05:36:09 - INFO - __main__ - Step 59717: {'lr': 0.00033470445181084716, 'samples': 11465664, 'steps': 59716, 'loss/train': 1.3770685195922852} 11/07/2021 05:36:09 - INFO - __main__ - Step 59718: {'lr': 0.0003346994589307102, 'samples': 11465856, 'steps': 59717, 'loss/train': 1.4346739053726196} 11/07/2021 05:36:10 - INFO - __main__ - Step 59719: {'lr': 0.00033469446601240907, 'samples': 11466048, 'steps': 59718, 'loss/train': 1.4784272909164429} 11/07/2021 05:36:10 - INFO - __main__ - Step 59720: {'lr': 0.00033468947305594586, 'samples': 11466240, 'steps': 59719, 'loss/train': 0.5607002377510071} 11/07/2021 05:36:11 - INFO - __main__ - Step 59721: {'lr': 0.0003346844800613229, 'samples': 11466432, 'steps': 59720, 'loss/train': 1.163059949874878} 11/07/2021 05:36:11 - INFO - __main__ - Step 59722: {'lr': 0.00033467948702854233, 'samples': 11466624, 'steps': 59721, 'loss/train': 1.3342009782791138} 11/07/2021 05:36:12 - INFO - __main__ - Step 59723: {'lr': 0.00033467449395760656, 'samples': 11466816, 'steps': 59722, 'loss/train': 1.665111780166626} 11/07/2021 05:36:12 - INFO - __main__ - Step 59724: {'lr': 0.0003346695008485179, 'samples': 11467008, 'steps': 59723, 'loss/train': 1.8015192747116089} 11/07/2021 05:36:13 - INFO - __main__ - Step 59725: {'lr': 0.00033466450770127824, 'samples': 11467200, 'steps': 59724, 'loss/train': 1.6405117511749268} 11/07/2021 05:36:13 - INFO - __main__ - Step 59726: {'lr': 0.0003346595145158902, 'samples': 11467392, 'steps': 59725, 'loss/train': 0.12103454768657684} 11/07/2021 05:36:14 - INFO - __main__ - Step 59727: {'lr': 0.00033465452129235584, 'samples': 11467584, 'steps': 59726, 'loss/train': 2.292454957962036} 11/07/2021 05:36:14 - INFO - __main__ - Step 59728: {'lr': 0.00033464952803067746, 'samples': 11467776, 'steps': 59727, 'loss/train': 1.4133906364440918} 11/07/2021 05:36:15 - INFO - __main__ - Step 59729: {'lr': 0.0003346445347308573, 'samples': 11467968, 'steps': 59728, 'loss/train': 1.6800386905670166} 11/07/2021 05:36:15 - INFO - __main__ - Step 59730: {'lr': 0.0003346395413928977, 'samples': 11468160, 'steps': 59729, 'loss/train': 1.228143334388733} 11/07/2021 05:36:16 - INFO - __main__ - Step 59731: {'lr': 0.0003346345480168007, 'samples': 11468352, 'steps': 59730, 'loss/train': 1.40749990940094} 11/07/2021 05:36:16 - INFO - __main__ - Step 59732: {'lr': 0.00033462955460256876, 'samples': 11468544, 'steps': 59731, 'loss/train': 1.4132229089736938} 11/07/2021 05:36:17 - INFO - __main__ - Step 59733: {'lr': 0.00033462456115020405, 'samples': 11468736, 'steps': 59732, 'loss/train': 1.8561713695526123} 11/07/2021 05:36:17 - INFO - __main__ - Step 59734: {'lr': 0.0003346195676597088, 'samples': 11468928, 'steps': 59733, 'loss/train': 1.4148104190826416} 11/07/2021 05:36:17 - INFO - __main__ - Step 59735: {'lr': 0.00033461457413108524, 'samples': 11469120, 'steps': 59734, 'loss/train': 1.4156752824783325} 11/07/2021 05:36:18 - INFO - __main__ - Step 59736: {'lr': 0.00033460958056433574, 'samples': 11469312, 'steps': 59735, 'loss/train': 1.3613808155059814} 11/07/2021 05:36:19 - INFO - __main__ - Step 59737: {'lr': 0.00033460458695946244, 'samples': 11469504, 'steps': 59736, 'loss/train': 1.300649881362915} 11/07/2021 05:36:19 - INFO - __main__ - Step 59738: {'lr': 0.0003345995933164676, 'samples': 11469696, 'steps': 59737, 'loss/train': 1.3083850145339966} 11/07/2021 05:36:19 - INFO - __main__ - Step 59739: {'lr': 0.0003345945996353535, 'samples': 11469888, 'steps': 59738, 'loss/train': 1.737492561340332} 11/07/2021 05:36:20 - INFO - __main__ - Step 59740: {'lr': 0.0003345896059161224, 'samples': 11470080, 'steps': 59739, 'loss/train': 1.0234007835388184} 11/07/2021 05:36:20 - INFO - __main__ - Step 59741: {'lr': 0.00033458461215877644, 'samples': 11470272, 'steps': 59740, 'loss/train': 1.4739450216293335} 11/07/2021 05:36:21 - INFO - __main__ - Step 59742: {'lr': 0.000334579618363318, 'samples': 11470464, 'steps': 59741, 'loss/train': 1.3406312465667725} 11/07/2021 05:36:21 - INFO - __main__ - Step 59743: {'lr': 0.0003345746245297494, 'samples': 11470656, 'steps': 59742, 'loss/train': 0.7481667399406433} 11/07/2021 05:36:22 - INFO - __main__ - Step 59744: {'lr': 0.00033456963065807264, 'samples': 11470848, 'steps': 59743, 'loss/train': 1.0869067907333374} 11/07/2021 05:36:22 - INFO - __main__ - Step 59745: {'lr': 0.0003345646367482902, 'samples': 11471040, 'steps': 59744, 'loss/train': 1.4960869550704956} 11/07/2021 05:36:22 - INFO - __main__ - Step 59746: {'lr': 0.00033455964280040417, 'samples': 11471232, 'steps': 59745, 'loss/train': 1.6462482213974} 11/07/2021 05:36:24 - INFO - __main__ - Step 59747: {'lr': 0.0003345546488144169, 'samples': 11471424, 'steps': 59746, 'loss/train': 1.2318847179412842} 11/07/2021 05:36:24 - INFO - __main__ - Step 59748: {'lr': 0.0003345496547903306, 'samples': 11471616, 'steps': 59747, 'loss/train': 1.2419209480285645} 11/07/2021 05:36:24 - INFO - __main__ - Step 59749: {'lr': 0.0003345446607281475, 'samples': 11471808, 'steps': 59748, 'loss/train': 0.16311971843242645} 11/07/2021 05:36:25 - INFO - __main__ - Step 59750: {'lr': 0.00033453966662786995, 'samples': 11472000, 'steps': 59749, 'loss/train': 1.2368395328521729} 11/07/2021 05:36:25 - INFO - __main__ - Step 59751: {'lr': 0.0003345346724895001, 'samples': 11472192, 'steps': 59750, 'loss/train': 1.3162906169891357} 11/07/2021 05:36:26 - INFO - __main__ - Step 59752: {'lr': 0.0003345296783130402, 'samples': 11472384, 'steps': 59751, 'loss/train': 1.7774964570999146} 11/07/2021 05:36:26 - INFO - __main__ - Step 59753: {'lr': 0.0003345246840984926, 'samples': 11472576, 'steps': 59752, 'loss/train': 1.8524725437164307} 11/07/2021 05:36:27 - INFO - __main__ - Step 59754: {'lr': 0.0003345196898458594, 'samples': 11472768, 'steps': 59753, 'loss/train': 1.4533371925354004} 11/07/2021 05:36:27 - INFO - __main__ - Step 59755: {'lr': 0.00033451469555514294, 'samples': 11472960, 'steps': 59754, 'loss/train': 1.2159703969955444} 11/07/2021 05:36:27 - INFO - __main__ - Step 59756: {'lr': 0.0003345097012263456, 'samples': 11473152, 'steps': 59755, 'loss/train': 1.1995917558670044} 11/07/2021 05:36:29 - INFO - __main__ - Step 59757: {'lr': 0.0003345047068594694, 'samples': 11473344, 'steps': 59756, 'loss/train': 1.2755703926086426} 11/07/2021 05:36:29 - INFO - __main__ - Step 59758: {'lr': 0.0003344997124545166, 'samples': 11473536, 'steps': 59757, 'loss/train': 1.3318367004394531} 11/07/2021 05:36:29 - INFO - __main__ - Step 59759: {'lr': 0.00033449471801148963, 'samples': 11473728, 'steps': 59758, 'loss/train': 2.1833062171936035} 11/07/2021 05:36:30 - INFO - __main__ - Step 59760: {'lr': 0.00033448972353039065, 'samples': 11473920, 'steps': 59759, 'loss/train': 1.5835164785385132} 11/07/2021 05:36:30 - INFO - __main__ - Step 59761: {'lr': 0.00033448472901122185, 'samples': 11474112, 'steps': 59760, 'loss/train': 1.106222152709961} 11/07/2021 05:36:31 - INFO - __main__ - Step 59762: {'lr': 0.0003344797344539855, 'samples': 11474304, 'steps': 59761, 'loss/train': 1.3897550106048584} 11/07/2021 05:36:31 - INFO - __main__ - Step 59763: {'lr': 0.000334474739858684, 'samples': 11474496, 'steps': 59762, 'loss/train': 0.879180371761322} 11/07/2021 05:36:32 - INFO - __main__ - Step 59764: {'lr': 0.0003344697452253195, 'samples': 11474688, 'steps': 59763, 'loss/train': 1.3495070934295654} 11/07/2021 05:36:32 - INFO - __main__ - Step 59765: {'lr': 0.00033446475055389413, 'samples': 11474880, 'steps': 59764, 'loss/train': 1.703831434249878} 11/07/2021 05:36:32 - INFO - __main__ - Step 59766: {'lr': 0.00033445975584441023, 'samples': 11475072, 'steps': 59765, 'loss/train': 1.5049314498901367} 11/07/2021 05:36:33 - INFO - __main__ - Step 59767: {'lr': 0.00033445476109687013, 'samples': 11475264, 'steps': 59766, 'loss/train': 1.7458064556121826} 11/07/2021 05:36:34 - INFO - __main__ - Step 59768: {'lr': 0.000334449766311276, 'samples': 11475456, 'steps': 59767, 'loss/train': 1.2554872035980225} 11/07/2021 05:36:34 - INFO - __main__ - Step 59769: {'lr': 0.00033444477148763006, 'samples': 11475648, 'steps': 59768, 'loss/train': 0.7513704895973206} 11/07/2021 05:36:34 - INFO - __main__ - Step 59770: {'lr': 0.0003344397766259348, 'samples': 11475840, 'steps': 59769, 'loss/train': 1.1398470401763916} 11/07/2021 05:36:35 - INFO - __main__ - Step 59771: {'lr': 0.0003344347817261921, 'samples': 11476032, 'steps': 59770, 'loss/train': 1.2013721466064453} 11/07/2021 05:36:35 - INFO - __main__ - Step 59772: {'lr': 0.0003344297867884044, 'samples': 11476224, 'steps': 59771, 'loss/train': 0.7768096327781677} 11/07/2021 05:36:36 - INFO - __main__ - Step 59773: {'lr': 0.000334424791812574, 'samples': 11476416, 'steps': 59772, 'loss/train': 1.5392483472824097} 11/07/2021 05:36:37 - INFO - __main__ - Step 59774: {'lr': 0.00033441979679870305, 'samples': 11476608, 'steps': 59773, 'loss/train': 1.5798436403274536} 11/07/2021 05:36:37 - INFO - __main__ - Step 59775: {'lr': 0.00033441480174679385, 'samples': 11476800, 'steps': 59774, 'loss/train': 1.1530206203460693} 11/07/2021 05:36:37 - INFO - __main__ - Step 59776: {'lr': 0.00033440980665684866, 'samples': 11476992, 'steps': 59775, 'loss/train': 1.7026039361953735} 11/07/2021 05:36:38 - INFO - __main__ - Step 59777: {'lr': 0.00033440481152886977, 'samples': 11477184, 'steps': 59776, 'loss/train': 2.033388376235962} 11/07/2021 05:36:39 - INFO - __main__ - Step 59778: {'lr': 0.00033439981636285935, 'samples': 11477376, 'steps': 59777, 'loss/train': 0.8706176280975342} 11/07/2021 05:36:39 - INFO - __main__ - Step 59779: {'lr': 0.0003343948211588196, 'samples': 11477568, 'steps': 59778, 'loss/train': 1.307417869567871} 11/07/2021 05:36:39 - INFO - __main__ - Step 59780: {'lr': 0.00033438982591675284, 'samples': 11477760, 'steps': 59779, 'loss/train': 0.745901346206665} 11/07/2021 05:36:40 - INFO - __main__ - Step 59781: {'lr': 0.00033438483063666136, 'samples': 11477952, 'steps': 59780, 'loss/train': 1.1092472076416016} 11/07/2021 05:36:40 - INFO - __main__ - Step 59782: {'lr': 0.0003343798353185474, 'samples': 11478144, 'steps': 59781, 'loss/train': 1.6200780868530273} 11/07/2021 05:36:41 - INFO - __main__ - Step 59783: {'lr': 0.0003343748399624131, 'samples': 11478336, 'steps': 59782, 'loss/train': 0.8787214756011963} 11/07/2021 05:36:42 - INFO - __main__ - Step 59784: {'lr': 0.00033436984456826097, 'samples': 11478528, 'steps': 59783, 'loss/train': 1.573609709739685} 11/07/2021 05:36:42 - INFO - __main__ - Step 59785: {'lr': 0.000334364849136093, 'samples': 11478720, 'steps': 59784, 'loss/train': 1.411788821220398} 11/07/2021 05:36:42 - INFO - __main__ - Step 59786: {'lr': 0.0003343598536659115, 'samples': 11478912, 'steps': 59785, 'loss/train': 1.3767591714859009} 11/07/2021 05:36:43 - INFO - __main__ - Step 59787: {'lr': 0.00033435485815771875, 'samples': 11479104, 'steps': 59786, 'loss/train': 1.462869644165039} 11/07/2021 05:36:44 - INFO - __main__ - Step 59788: {'lr': 0.00033434986261151705, 'samples': 11479296, 'steps': 59787, 'loss/train': 1.5607069730758667} 11/07/2021 05:36:44 - INFO - __main__ - Step 59789: {'lr': 0.0003343448670273086, 'samples': 11479488, 'steps': 59788, 'loss/train': 1.6731963157653809} 11/07/2021 05:36:44 - INFO - __main__ - Step 59790: {'lr': 0.00033433987140509566, 'samples': 11479680, 'steps': 59789, 'loss/train': 0.9215992093086243} 11/07/2021 05:36:45 - INFO - __main__ - Step 59791: {'lr': 0.0003343348757448804, 'samples': 11479872, 'steps': 59790, 'loss/train': 1.42835533618927} 11/07/2021 05:36:45 - INFO - __main__ - Step 59792: {'lr': 0.0003343298800466652, 'samples': 11480064, 'steps': 59791, 'loss/train': 0.590241551399231} 11/07/2021 05:36:46 - INFO - __main__ - Step 59793: {'lr': 0.0003343248843104523, 'samples': 11480256, 'steps': 59792, 'loss/train': 1.6024614572525024} 11/07/2021 05:36:46 - INFO - __main__ - Step 59794: {'lr': 0.00033431988853624384, 'samples': 11480448, 'steps': 59793, 'loss/train': 1.5865528583526611} 11/07/2021 05:36:47 - INFO - __main__ - Step 59795: {'lr': 0.00033431489272404215, 'samples': 11480640, 'steps': 59794, 'loss/train': 1.3040661811828613} 11/07/2021 05:36:47 - INFO - __main__ - Step 59796: {'lr': 0.0003343098968738495, 'samples': 11480832, 'steps': 59795, 'loss/train': 1.4046781063079834} 11/07/2021 05:36:47 - INFO - __main__ - Step 59797: {'lr': 0.00033430490098566813, 'samples': 11481024, 'steps': 59796, 'loss/train': 1.5291950702667236} 11/07/2021 05:36:48 - INFO - __main__ - Step 59798: {'lr': 0.00033429990505950025, 'samples': 11481216, 'steps': 59797, 'loss/train': 1.2005763053894043} 11/07/2021 05:36:49 - INFO - __main__ - Step 59799: {'lr': 0.0003342949090953481, 'samples': 11481408, 'steps': 59798, 'loss/train': 1.9645662307739258} 11/07/2021 05:36:49 - INFO - __main__ - Step 59800: {'lr': 0.000334289913093214, 'samples': 11481600, 'steps': 59799, 'loss/train': 1.816665768623352} 11/07/2021 05:36:49 - INFO - __main__ - Step 59801: {'lr': 0.0003342849170531001, 'samples': 11481792, 'steps': 59800, 'loss/train': 1.27961266040802} 11/07/2021 05:36:50 - INFO - __main__ - Step 59802: {'lr': 0.00033427992097500876, 'samples': 11481984, 'steps': 59801, 'loss/train': 1.76497220993042} 11/07/2021 05:36:51 - INFO - __main__ - Step 59803: {'lr': 0.00033427492485894216, 'samples': 11482176, 'steps': 59802, 'loss/train': 1.549760103225708} 11/07/2021 05:36:51 - INFO - __main__ - Step 59804: {'lr': 0.0003342699287049027, 'samples': 11482368, 'steps': 59803, 'loss/train': 1.2277553081512451} 11/07/2021 05:36:51 - INFO - __main__ - Step 59805: {'lr': 0.0003342649325128924, 'samples': 11482560, 'steps': 59804, 'loss/train': 1.6004462242126465} 11/07/2021 05:36:52 - INFO - __main__ - Step 59806: {'lr': 0.00033425993628291367, 'samples': 11482752, 'steps': 59805, 'loss/train': 1.6093926429748535} 11/07/2021 05:36:52 - INFO - __main__ - Step 59807: {'lr': 0.0003342549400149687, 'samples': 11482944, 'steps': 59806, 'loss/train': 0.9378297924995422} 11/07/2021 05:36:53 - INFO - __main__ - Step 59808: {'lr': 0.0003342499437090597, 'samples': 11483136, 'steps': 59807, 'loss/train': 1.5748845338821411} 11/07/2021 05:36:54 - INFO - __main__ - Step 59809: {'lr': 0.000334244947365189, 'samples': 11483328, 'steps': 59808, 'loss/train': 1.2187085151672363} 11/07/2021 05:36:54 - INFO - __main__ - Step 59810: {'lr': 0.00033423995098335886, 'samples': 11483520, 'steps': 59809, 'loss/train': 1.9742186069488525} 11/07/2021 05:36:54 - INFO - __main__ - Step 59811: {'lr': 0.00033423495456357156, 'samples': 11483712, 'steps': 59810, 'loss/train': 1.3039319515228271} 11/07/2021 05:36:55 - INFO - __main__ - Step 59812: {'lr': 0.00033422995810582917, 'samples': 11483904, 'steps': 59811, 'loss/train': 1.3863568305969238} 11/07/2021 05:36:56 - INFO - __main__ - Step 59813: {'lr': 0.0003342249616101341, 'samples': 11484096, 'steps': 59812, 'loss/train': 1.435603380203247} 11/07/2021 05:36:56 - INFO - __main__ - Step 59814: {'lr': 0.0003342199650764886, 'samples': 11484288, 'steps': 59813, 'loss/train': 1.6086353063583374} 11/07/2021 05:36:56 - INFO - __main__ - Step 59815: {'lr': 0.0003342149685048949, 'samples': 11484480, 'steps': 59814, 'loss/train': 1.496732234954834} 11/07/2021 05:36:57 - INFO - __main__ - Step 59816: {'lr': 0.0003342099718953551, 'samples': 11484672, 'steps': 59815, 'loss/train': 1.3912758827209473} 11/07/2021 05:36:57 - INFO - __main__ - Step 59817: {'lr': 0.00033420497524787177, 'samples': 11484864, 'steps': 59816, 'loss/train': 1.4049148559570312} 11/07/2021 05:36:58 - INFO - __main__ - Step 59818: {'lr': 0.0003341999785624468, 'samples': 11485056, 'steps': 59817, 'loss/train': 1.3746356964111328} 11/07/2021 05:36:58 - INFO - __main__ - Step 59819: {'lr': 0.0003341949818390827, 'samples': 11485248, 'steps': 59818, 'loss/train': 0.8529001474380493} 11/07/2021 05:36:59 - INFO - __main__ - Step 59820: {'lr': 0.00033418998507778164, 'samples': 11485440, 'steps': 59819, 'loss/train': 1.4577114582061768} 11/07/2021 05:36:59 - INFO - __main__ - Step 59821: {'lr': 0.00033418498827854587, 'samples': 11485632, 'steps': 59820, 'loss/train': 1.522295355796814} 11/07/2021 05:36:59 - INFO - __main__ - Step 59822: {'lr': 0.0003341799914413776, 'samples': 11485824, 'steps': 59821, 'loss/train': 1.6530967950820923} 11/07/2021 05:37:01 - INFO - __main__ - Step 59823: {'lr': 0.0003341749945662792, 'samples': 11486016, 'steps': 59822, 'loss/train': 1.5433629751205444} 11/07/2021 05:37:01 - INFO - __main__ - Step 59824: {'lr': 0.00033416999765325286, 'samples': 11486208, 'steps': 59823, 'loss/train': 1.4311084747314453} 11/07/2021 05:37:01 - INFO - __main__ - Step 59825: {'lr': 0.0003341650007023008, 'samples': 11486400, 'steps': 59824, 'loss/train': 1.3693621158599854} 11/07/2021 05:37:02 - INFO - __main__ - Step 59826: {'lr': 0.0003341600037134252, 'samples': 11486592, 'steps': 59825, 'loss/train': 1.2301878929138184} 11/07/2021 05:37:02 - INFO - __main__ - Step 59827: {'lr': 0.00033415500668662845, 'samples': 11486784, 'steps': 59826, 'loss/train': 1.6107709407806396} 11/07/2021 05:37:02 - INFO - __main__ - Step 59828: {'lr': 0.00033415000962191277, 'samples': 11486976, 'steps': 59827, 'loss/train': 1.178155779838562} 11/07/2021 05:37:03 - INFO - __main__ - Step 59829: {'lr': 0.0003341450125192804, 'samples': 11487168, 'steps': 59828, 'loss/train': 1.3227167129516602} 11/07/2021 05:37:04 - INFO - __main__ - Step 59830: {'lr': 0.0003341400153787336, 'samples': 11487360, 'steps': 59829, 'loss/train': 1.284824013710022} 11/07/2021 05:37:04 - INFO - __main__ - Step 59831: {'lr': 0.00033413501820027456, 'samples': 11487552, 'steps': 59830, 'loss/train': 1.378790259361267} 11/07/2021 05:37:04 - INFO - __main__ - Step 59832: {'lr': 0.00033413002098390567, 'samples': 11487744, 'steps': 59831, 'loss/train': 0.932711124420166} 11/07/2021 05:37:05 - INFO - __main__ - Step 59833: {'lr': 0.00033412502372962894, 'samples': 11487936, 'steps': 59832, 'loss/train': 1.0881677865982056} 11/07/2021 05:37:06 - INFO - __main__ - Step 59834: {'lr': 0.0003341200264374469, 'samples': 11488128, 'steps': 59833, 'loss/train': 1.1093249320983887} 11/07/2021 05:37:06 - INFO - __main__ - Step 59835: {'lr': 0.0003341150291073616, 'samples': 11488320, 'steps': 59834, 'loss/train': 1.4371024370193481} 11/07/2021 05:37:06 - INFO - __main__ - Step 59836: {'lr': 0.0003341100317393754, 'samples': 11488512, 'steps': 59835, 'loss/train': 1.6734120845794678} 11/07/2021 05:37:07 - INFO - __main__ - Step 59837: {'lr': 0.00033410503433349055, 'samples': 11488704, 'steps': 59836, 'loss/train': 1.3474565744400024} 11/07/2021 05:37:07 - INFO - __main__ - Step 59838: {'lr': 0.00033410003688970927, 'samples': 11488896, 'steps': 59837, 'loss/train': 3.1182456016540527} 11/07/2021 05:37:08 - INFO - __main__ - Step 59839: {'lr': 0.0003340950394080337, 'samples': 11489088, 'steps': 59838, 'loss/train': 1.438690423965454} 11/07/2021 05:37:09 - INFO - __main__ - Step 59840: {'lr': 0.0003340900418884663, 'samples': 11489280, 'steps': 59839, 'loss/train': 1.6547439098358154} 11/07/2021 05:37:09 - INFO - __main__ - Step 59841: {'lr': 0.00033408504433100916, 'samples': 11489472, 'steps': 59840, 'loss/train': 1.5318498611450195} 11/07/2021 05:37:09 - INFO - __main__ - Step 59842: {'lr': 0.0003340800467356647, 'samples': 11489664, 'steps': 59841, 'loss/train': 0.697304368019104} 11/07/2021 05:37:10 - INFO - __main__ - Step 59843: {'lr': 0.00033407504910243504, 'samples': 11489856, 'steps': 59842, 'loss/train': 1.1265060901641846} 11/07/2021 05:37:10 - INFO - __main__ - Step 59844: {'lr': 0.0003340700514313224, 'samples': 11490048, 'steps': 59843, 'loss/train': 1.5800620317459106} 11/07/2021 05:37:11 - INFO - __main__ - Step 59845: {'lr': 0.0003340650537223291, 'samples': 11490240, 'steps': 59844, 'loss/train': 1.636988878250122} 11/07/2021 05:37:11 - INFO - __main__ - Step 59846: {'lr': 0.0003340600559754574, 'samples': 11490432, 'steps': 59845, 'loss/train': 2.1109626293182373} 11/07/2021 05:37:12 - INFO - __main__ - Step 59847: {'lr': 0.0003340550581907095, 'samples': 11490624, 'steps': 59846, 'loss/train': 1.8847805261611938} 11/07/2021 05:37:12 - INFO - __main__ - Step 59848: {'lr': 0.0003340500603680878, 'samples': 11490816, 'steps': 59847, 'loss/train': 1.22675359249115} 11/07/2021 05:37:12 - INFO - __main__ - Step 59849: {'lr': 0.00033404506250759436, 'samples': 11491008, 'steps': 59848, 'loss/train': 1.5579403638839722} 11/07/2021 05:37:13 - INFO - __main__ - Step 59850: {'lr': 0.0003340400646092315, 'samples': 11491200, 'steps': 59849, 'loss/train': 1.0249091386795044} 11/07/2021 05:37:14 - INFO - __main__ - Step 59851: {'lr': 0.0003340350666730015, 'samples': 11491392, 'steps': 59850, 'loss/train': 1.398000717163086} 11/07/2021 05:37:14 - INFO - __main__ - Step 59852: {'lr': 0.0003340300686989066, 'samples': 11491584, 'steps': 59851, 'loss/train': 1.31829035282135} 11/07/2021 05:37:15 - INFO - __main__ - Step 59853: {'lr': 0.0003340250706869491, 'samples': 11491776, 'steps': 59852, 'loss/train': 1.0441431999206543} 11/07/2021 05:37:15 - INFO - __main__ - Step 59854: {'lr': 0.00033402007263713115, 'samples': 11491968, 'steps': 59853, 'loss/train': 1.3283215761184692} 11/07/2021 05:37:16 - INFO - __main__ - Step 59855: {'lr': 0.000334015074549455, 'samples': 11492160, 'steps': 59854, 'loss/train': 1.2623116970062256} 11/07/2021 05:37:16 - INFO - __main__ - Step 59856: {'lr': 0.000334010076423923, 'samples': 11492352, 'steps': 59855, 'loss/train': 1.3793851137161255} 11/07/2021 05:37:17 - INFO - __main__ - Step 59857: {'lr': 0.00033400507826053733, 'samples': 11492544, 'steps': 59856, 'loss/train': 1.8009610176086426} 11/07/2021 05:37:17 - INFO - __main__ - Step 59858: {'lr': 0.0003340000800593004, 'samples': 11492736, 'steps': 59857, 'loss/train': 0.09556188434362411} 11/07/2021 05:37:17 - INFO - __main__ - Step 59859: {'lr': 0.0003339950818202142, 'samples': 11492928, 'steps': 59858, 'loss/train': 1.0893874168395996} 11/07/2021 05:37:18 - INFO - __main__ - Step 59860: {'lr': 0.00033399008354328106, 'samples': 11493120, 'steps': 59859, 'loss/train': 1.4364213943481445} 11/07/2021 05:37:19 - INFO - __main__ - Step 59861: {'lr': 0.0003339850852285034, 'samples': 11493312, 'steps': 59860, 'loss/train': 1.5607722997665405} 11/07/2021 05:37:19 - INFO - __main__ - Step 59862: {'lr': 0.00033398008687588333, 'samples': 11493504, 'steps': 59861, 'loss/train': 1.3319916725158691} 11/07/2021 05:37:19 - INFO - __main__ - Step 59863: {'lr': 0.00033397508848542306, 'samples': 11493696, 'steps': 59862, 'loss/train': 1.4987218379974365} 11/07/2021 05:37:20 - INFO - __main__ - Step 59864: {'lr': 0.000333970090057125, 'samples': 11493888, 'steps': 59863, 'loss/train': 1.1037565469741821} 11/07/2021 05:37:21 - INFO - __main__ - Step 59865: {'lr': 0.00033396509159099133, 'samples': 11494080, 'steps': 59864, 'loss/train': 1.326676845550537} 11/07/2021 05:37:21 - INFO - __main__ - Step 59866: {'lr': 0.00033396009308702426, 'samples': 11494272, 'steps': 59865, 'loss/train': 1.9202079772949219} 11/07/2021 05:37:22 - INFO - __main__ - Step 59867: {'lr': 0.000333955094545226, 'samples': 11494464, 'steps': 59866, 'loss/train': 1.259358286857605} 11/07/2021 05:37:22 - INFO - __main__ - Step 59868: {'lr': 0.00033395009596559887, 'samples': 11494656, 'steps': 59867, 'loss/train': 1.3198281526565552} 11/07/2021 05:37:22 - INFO - __main__ - Step 59869: {'lr': 0.00033394509734814516, 'samples': 11494848, 'steps': 59868, 'loss/train': 1.099117636680603} 11/07/2021 05:37:23 - INFO - __main__ - Step 59870: {'lr': 0.0003339400986928671, 'samples': 11495040, 'steps': 59869, 'loss/train': 1.5058186054229736} 11/07/2021 05:37:24 - INFO - __main__ - Step 59871: {'lr': 0.000333935099999767, 'samples': 11495232, 'steps': 59870, 'loss/train': 1.140355110168457} 11/07/2021 05:37:24 - INFO - __main__ - Step 59872: {'lr': 0.00033393010126884696, 'samples': 11495424, 'steps': 59871, 'loss/train': 2.3761146068573} 11/07/2021 05:37:24 - INFO - __main__ - Step 59873: {'lr': 0.00033392510250010926, 'samples': 11495616, 'steps': 59872, 'loss/train': 1.3409818410873413} 11/07/2021 05:37:25 - INFO - __main__ - Step 59874: {'lr': 0.00033392010369355627, 'samples': 11495808, 'steps': 59873, 'loss/train': 1.6499170064926147} 11/07/2021 05:37:25 - INFO - __main__ - Step 59875: {'lr': 0.00033391510484919015, 'samples': 11496000, 'steps': 59874, 'loss/train': 0.8844736814498901} 11/07/2021 05:37:26 - INFO - __main__ - Step 59876: {'lr': 0.00033391010596701314, 'samples': 11496192, 'steps': 59875, 'loss/train': 1.285159945487976} 11/07/2021 05:37:27 - INFO - __main__ - Step 59877: {'lr': 0.0003339051070470276, 'samples': 11496384, 'steps': 59876, 'loss/train': 2.0157523155212402} 11/07/2021 05:37:27 - INFO - __main__ - Step 59878: {'lr': 0.00033390010808923573, 'samples': 11496576, 'steps': 59877, 'loss/train': 1.7896143198013306} 11/07/2021 05:37:27 - INFO - __main__ - Step 59879: {'lr': 0.00033389510909363974, 'samples': 11496768, 'steps': 59878, 'loss/train': 1.4786338806152344} 11/07/2021 05:37:28 - INFO - __main__ - Step 59880: {'lr': 0.00033389011006024183, 'samples': 11496960, 'steps': 59879, 'loss/train': 1.577348232269287} 11/07/2021 05:37:28 - INFO - __main__ - Step 59881: {'lr': 0.0003338851109890444, 'samples': 11497152, 'steps': 59880, 'loss/train': 0.7143099904060364} 11/07/2021 05:37:29 - INFO - __main__ - Step 59882: {'lr': 0.00033388011188004965, 'samples': 11497344, 'steps': 59881, 'loss/train': 1.6632063388824463} 11/07/2021 05:37:29 - INFO - __main__ - Step 59883: {'lr': 0.00033387511273325976, 'samples': 11497536, 'steps': 59882, 'loss/train': 1.4234802722930908} 11/07/2021 05:37:30 - INFO - __main__ - Step 59884: {'lr': 0.0003338701135486771, 'samples': 11497728, 'steps': 59883, 'loss/train': 2.193471908569336} 11/07/2021 05:37:30 - INFO - __main__ - Step 59885: {'lr': 0.0003338651143263038, 'samples': 11497920, 'steps': 59884, 'loss/train': 1.4223037958145142} 11/07/2021 05:37:31 - INFO - __main__ - Step 59886: {'lr': 0.0003338601150661423, 'samples': 11498112, 'steps': 59885, 'loss/train': 0.952494740486145} 11/07/2021 05:37:31 - INFO - __main__ - Step 59887: {'lr': 0.0003338551157681946, 'samples': 11498304, 'steps': 59886, 'loss/train': 1.3072333335876465} 11/07/2021 05:37:32 - INFO - __main__ - Step 59888: {'lr': 0.00033385011643246313, 'samples': 11498496, 'steps': 59887, 'loss/train': 0.570316731929779} 11/07/2021 05:37:32 - INFO - __main__ - Step 59889: {'lr': 0.00033384511705895003, 'samples': 11498688, 'steps': 59888, 'loss/train': 1.7107895612716675} 11/07/2021 05:37:33 - INFO - __main__ - Step 59890: {'lr': 0.00033384011764765764, 'samples': 11498880, 'steps': 59889, 'loss/train': 1.3058995008468628} 11/07/2021 05:37:33 - INFO - __main__ - Step 59891: {'lr': 0.0003338351181985882, 'samples': 11499072, 'steps': 59890, 'loss/train': 1.1536494493484497} 11/07/2021 05:37:33 - INFO - __main__ - Step 59892: {'lr': 0.000333830118711744, 'samples': 11499264, 'steps': 59891, 'loss/train': 1.3105016946792603} 11/07/2021 05:37:34 - INFO - __main__ - Step 59893: {'lr': 0.00033382511918712723, 'samples': 11499456, 'steps': 59892, 'loss/train': 1.7648695707321167} 11/07/2021 05:37:35 - INFO - __main__ - Step 59894: {'lr': 0.00033382011962474004, 'samples': 11499648, 'steps': 59893, 'loss/train': 1.2767893075942993} 11/07/2021 05:37:35 - INFO - __main__ - Step 59895: {'lr': 0.0003338151200245849, 'samples': 11499840, 'steps': 59894, 'loss/train': 1.6090312004089355} 11/07/2021 05:37:35 - INFO - __main__ - Step 59896: {'lr': 0.000333810120386664, 'samples': 11500032, 'steps': 59895, 'loss/train': 1.5503777265548706} 11/07/2021 05:37:36 - INFO - __main__ - Step 59897: {'lr': 0.00033380512071097947, 'samples': 11500224, 'steps': 59896, 'loss/train': 1.195521593093872} 11/07/2021 05:37:37 - INFO - __main__ - Step 59898: {'lr': 0.00033380012099753364, 'samples': 11500416, 'steps': 59897, 'loss/train': 1.4737271070480347} 11/07/2021 05:37:37 - INFO - __main__ - Step 59899: {'lr': 0.00033379512124632885, 'samples': 11500608, 'steps': 59898, 'loss/train': 1.5121731758117676} 11/07/2021 05:37:38 - INFO - __main__ - Step 59900: {'lr': 0.0003337901214573672, 'samples': 11500800, 'steps': 59899, 'loss/train': 1.6407800912857056} 11/07/2021 05:37:38 - INFO - __main__ - Step 59901: {'lr': 0.000333785121630651, 'samples': 11500992, 'steps': 59900, 'loss/train': 1.5833220481872559} 11/07/2021 05:37:38 - INFO - __main__ - Step 59902: {'lr': 0.0003337801217661826, 'samples': 11501184, 'steps': 59901, 'loss/train': 1.5912500619888306} 11/07/2021 05:37:40 - INFO - __main__ - Step 59903: {'lr': 0.0003337751218639641, 'samples': 11501376, 'steps': 59902, 'loss/train': 1.6189994812011719} 11/07/2021 05:37:40 - INFO - __main__ - Step 59904: {'lr': 0.0003337701219239978, 'samples': 11501568, 'steps': 59903, 'loss/train': 1.056644082069397} 11/07/2021 05:37:40 - INFO - __main__ - Step 59905: {'lr': 0.00033376512194628605, 'samples': 11501760, 'steps': 59904, 'loss/train': 1.5313489437103271} 11/07/2021 05:37:41 - INFO - __main__ - Step 59906: {'lr': 0.000333760121930831, 'samples': 11501952, 'steps': 59905, 'loss/train': 1.311906099319458} 11/07/2021 05:37:41 - INFO - __main__ - Step 59907: {'lr': 0.0003337551218776349, 'samples': 11502144, 'steps': 59906, 'loss/train': 1.4480106830596924} 11/07/2021 05:37:42 - INFO - __main__ - Step 59908: {'lr': 0.0003337501217867001, 'samples': 11502336, 'steps': 59907, 'loss/train': 0.07434665411710739} 11/07/2021 05:37:42 - INFO - __main__ - Step 59909: {'lr': 0.00033374512165802874, 'samples': 11502528, 'steps': 59908, 'loss/train': 1.4887604713439941} 11/07/2021 05:37:43 - INFO - __main__ - Step 59910: {'lr': 0.00033374012149162314, 'samples': 11502720, 'steps': 59909, 'loss/train': 1.267632007598877} 11/07/2021 05:37:43 - INFO - __main__ - Step 59911: {'lr': 0.0003337351212874856, 'samples': 11502912, 'steps': 59910, 'loss/train': 1.438001036643982} 11/07/2021 05:37:43 - INFO - __main__ - Step 59912: {'lr': 0.00033373012104561815, 'samples': 11503104, 'steps': 59911, 'loss/train': 1.8732619285583496} 11/07/2021 05:37:44 - INFO - __main__ - Step 59913: {'lr': 0.0003337251207660233, 'samples': 11503296, 'steps': 59912, 'loss/train': 1.391196608543396} 11/07/2021 05:37:45 - INFO - __main__ - Step 59914: {'lr': 0.00033372012044870317, 'samples': 11503488, 'steps': 59913, 'loss/train': 1.5033156871795654} 11/07/2021 05:37:45 - INFO - __main__ - Step 59915: {'lr': 0.00033371512009366006, 'samples': 11503680, 'steps': 59914, 'loss/train': 1.3974770307540894} 11/07/2021 05:37:45 - INFO - __main__ - Step 59916: {'lr': 0.0003337101197008962, 'samples': 11503872, 'steps': 59915, 'loss/train': 1.5923370122909546} 11/07/2021 05:37:46 - INFO - __main__ - Step 59917: {'lr': 0.00033370511927041386, 'samples': 11504064, 'steps': 59916, 'loss/train': 1.885499358177185} 11/07/2021 05:37:47 - INFO - __main__ - Step 59918: {'lr': 0.0003337001188022153, 'samples': 11504256, 'steps': 59917, 'loss/train': 1.192004919052124} 11/07/2021 05:37:47 - INFO - __main__ - Step 59919: {'lr': 0.0003336951182963027, 'samples': 11504448, 'steps': 59918, 'loss/train': 1.1246466636657715} 11/07/2021 05:37:48 - INFO - __main__ - Step 59920: {'lr': 0.0003336901177526784, 'samples': 11504640, 'steps': 59919, 'loss/train': 1.2675156593322754} 11/07/2021 05:37:48 - INFO - __main__ - Step 59921: {'lr': 0.0003336851171713447, 'samples': 11504832, 'steps': 59920, 'loss/train': 1.1757452487945557} 11/07/2021 05:37:48 - INFO - __main__ - Step 59922: {'lr': 0.00033368011655230366, 'samples': 11505024, 'steps': 59921, 'loss/train': 1.2680805921554565} 11/07/2021 05:37:49 - INFO - __main__ - Step 59923: {'lr': 0.0003336751158955577, 'samples': 11505216, 'steps': 59922, 'loss/train': 1.0946545600891113} 11/07/2021 05:37:50 - INFO - __main__ - Step 59924: {'lr': 0.00033367011520110906, 'samples': 11505408, 'steps': 59923, 'loss/train': 1.6121041774749756} 11/07/2021 05:37:50 - INFO - __main__ - Step 59925: {'lr': 0.00033366511446896, 'samples': 11505600, 'steps': 59924, 'loss/train': 1.5153909921646118} 11/07/2021 05:37:50 - INFO - __main__ - Step 59926: {'lr': 0.0003336601136991126, 'samples': 11505792, 'steps': 59925, 'loss/train': 0.9676465392112732} 11/07/2021 05:37:51 - INFO - __main__ - Step 59927: {'lr': 0.0003336551128915693, 'samples': 11505984, 'steps': 59926, 'loss/train': 1.4178122282028198} 11/07/2021 05:37:51 - INFO - __main__ - Step 59928: {'lr': 0.00033365011204633234, 'samples': 11506176, 'steps': 59927, 'loss/train': 1.9650468826293945} 11/07/2021 05:37:52 - INFO - __main__ - Step 59929: {'lr': 0.0003336451111634038, 'samples': 11506368, 'steps': 59928, 'loss/train': 0.666279673576355} 11/07/2021 05:37:53 - INFO - __main__ - Step 59930: {'lr': 0.00033364011024278616, 'samples': 11506560, 'steps': 59929, 'loss/train': 1.2978079319000244} 11/07/2021 05:37:53 - INFO - __main__ - Step 59931: {'lr': 0.0003336351092844816, 'samples': 11506752, 'steps': 59930, 'loss/train': 1.436763882637024} 11/07/2021 05:37:53 - INFO - __main__ - Step 59932: {'lr': 0.0003336301082884924, 'samples': 11506944, 'steps': 59931, 'loss/train': 1.1624747514724731} 11/07/2021 05:37:54 - INFO - __main__ - Step 59933: {'lr': 0.00033362510725482063, 'samples': 11507136, 'steps': 59932, 'loss/train': 1.6544556617736816} 11/07/2021 05:37:55 - INFO - __main__ - Step 59934: {'lr': 0.0003336201061834687, 'samples': 11507328, 'steps': 59933, 'loss/train': 1.6201013326644897} 11/07/2021 05:37:55 - INFO - __main__ - Step 59935: {'lr': 0.0003336151050744389, 'samples': 11507520, 'steps': 59934, 'loss/train': 1.6059478521347046} 11/07/2021 05:37:55 - INFO - __main__ - Step 59936: {'lr': 0.00033361010392773336, 'samples': 11507712, 'steps': 59935, 'loss/train': 1.707397222518921} 11/07/2021 05:37:56 - INFO - __main__ - Step 59937: {'lr': 0.0003336051027433544, 'samples': 11507904, 'steps': 59936, 'loss/train': 0.16855694353580475} 11/07/2021 05:37:56 - INFO - __main__ - Step 59938: {'lr': 0.00033360010152130436, 'samples': 11508096, 'steps': 59937, 'loss/train': 1.3239763975143433} 11/07/2021 05:37:57 - INFO - __main__ - Step 59939: {'lr': 0.00033359510026158534, 'samples': 11508288, 'steps': 59938, 'loss/train': 1.1314833164215088} 11/07/2021 05:37:58 - INFO - __main__ - Step 59940: {'lr': 0.00033359009896419966, 'samples': 11508480, 'steps': 59939, 'loss/train': 1.8635438680648804} 11/07/2021 05:37:58 - INFO - __main__ - Step 59941: {'lr': 0.00033358509762914957, 'samples': 11508672, 'steps': 59940, 'loss/train': 1.7521969079971313} 11/07/2021 05:37:59 - INFO - __main__ - Step 59942: {'lr': 0.0003335800962564374, 'samples': 11508864, 'steps': 59941, 'loss/train': 0.7541856169700623} 11/07/2021 05:37:59 - INFO - __main__ - Step 59943: {'lr': 0.0003335750948460652, 'samples': 11509056, 'steps': 59942, 'loss/train': 1.1702239513397217} 11/07/2021 05:37:59 - INFO - __main__ - Step 59944: {'lr': 0.0003335700933980354, 'samples': 11509248, 'steps': 59943, 'loss/train': 1.7087632417678833} 11/07/2021 05:38:00 - INFO - __main__ - Step 59945: {'lr': 0.0003335650919123503, 'samples': 11509440, 'steps': 59944, 'loss/train': 1.7990319728851318} 11/07/2021 05:38:01 - INFO - __main__ - Step 59946: {'lr': 0.0003335600903890119, 'samples': 11509632, 'steps': 59945, 'loss/train': 1.8563764095306396} 11/07/2021 05:38:01 - INFO - __main__ - Step 59947: {'lr': 0.0003335550888280227, 'samples': 11509824, 'steps': 59946, 'loss/train': 1.5483802556991577} 11/07/2021 05:38:01 - INFO - __main__ - Step 59948: {'lr': 0.00033355008722938485, 'samples': 11510016, 'steps': 59947, 'loss/train': 0.8849313855171204} 11/07/2021 05:38:02 - INFO - __main__ - Step 59949: {'lr': 0.0003335450855931006, 'samples': 11510208, 'steps': 59948, 'loss/train': 1.6003526449203491} 11/07/2021 05:38:03 - INFO - __main__ - Step 59950: {'lr': 0.00033354008391917224, 'samples': 11510400, 'steps': 59949, 'loss/train': 1.5374506711959839} 11/07/2021 05:38:03 - INFO - __main__ - Step 59951: {'lr': 0.00033353508220760204, 'samples': 11510592, 'steps': 59950, 'loss/train': 1.5414222478866577} 11/07/2021 05:38:03 - INFO - __main__ - Step 59952: {'lr': 0.00033353008045839224, 'samples': 11510784, 'steps': 59951, 'loss/train': 1.1849223375320435} 11/07/2021 05:38:04 - INFO - __main__ - Step 59953: {'lr': 0.000333525078671545, 'samples': 11510976, 'steps': 59952, 'loss/train': 0.8675971031188965} 11/07/2021 05:38:04 - INFO - __main__ - Step 59954: {'lr': 0.0003335200768470627, 'samples': 11511168, 'steps': 59953, 'loss/train': 0.0594203881919384} 11/07/2021 05:38:05 - INFO - __main__ - Step 59955: {'lr': 0.0003335150749849475, 'samples': 11511360, 'steps': 59954, 'loss/train': 1.1955112218856812} 11/07/2021 05:38:05 - INFO - __main__ - Step 59956: {'lr': 0.0003335100730852017, 'samples': 11511552, 'steps': 59955, 'loss/train': 1.3597326278686523} 11/07/2021 05:38:06 - INFO - __main__ - Step 59957: {'lr': 0.0003335050711478276, 'samples': 11511744, 'steps': 59956, 'loss/train': 1.4074528217315674} 11/07/2021 05:38:06 - INFO - __main__ - Step 59958: {'lr': 0.0003335000691728273, 'samples': 11511936, 'steps': 59957, 'loss/train': 1.0630269050598145} 11/07/2021 05:38:07 - INFO - __main__ - Step 59959: {'lr': 0.0003334950671602033, 'samples': 11512128, 'steps': 59958, 'loss/train': 1.3670204877853394} 11/07/2021 05:38:08 - INFO - __main__ - Step 59960: {'lr': 0.00033349006510995766, 'samples': 11512320, 'steps': 59959, 'loss/train': 1.0078097581863403} 11/07/2021 05:38:08 - INFO - __main__ - Step 59961: {'lr': 0.00033348506302209265, 'samples': 11512512, 'steps': 59960, 'loss/train': 1.460436224937439} 11/07/2021 05:38:08 - INFO - __main__ - Step 59962: {'lr': 0.00033348006089661055, 'samples': 11512704, 'steps': 59961, 'loss/train': 1.4430351257324219} 11/07/2021 05:38:09 - INFO - __main__ - Step 59963: {'lr': 0.0003334750587335136, 'samples': 11512896, 'steps': 59962, 'loss/train': 1.1543771028518677} 11/07/2021 05:38:09 - INFO - __main__ - Step 59964: {'lr': 0.00033347005653280414, 'samples': 11513088, 'steps': 59963, 'loss/train': 1.6782102584838867} 11/07/2021 05:38:09 - INFO - __main__ - Step 59965: {'lr': 0.0003334650542944844, 'samples': 11513280, 'steps': 59964, 'loss/train': 1.6661664247512817} 11/07/2021 05:38:10 - INFO - __main__ - Step 59966: {'lr': 0.00033346005201855656, 'samples': 11513472, 'steps': 59965, 'loss/train': 1.1616204977035522} 11/07/2021 05:38:11 - INFO - __main__ - Step 59967: {'lr': 0.00033345504970502284, 'samples': 11513664, 'steps': 59966, 'loss/train': 0.9831913709640503} 11/07/2021 05:38:11 - INFO - __main__ - Step 59968: {'lr': 0.0003334500473538856, 'samples': 11513856, 'steps': 59967, 'loss/train': 1.378980278968811} 11/07/2021 05:38:11 - INFO - __main__ - Step 59969: {'lr': 0.00033344504496514703, 'samples': 11514048, 'steps': 59968, 'loss/train': 1.2002038955688477} 11/07/2021 05:38:12 - INFO - __main__ - Step 59970: {'lr': 0.0003334400425388095, 'samples': 11514240, 'steps': 59969, 'loss/train': 1.185613989830017} 11/07/2021 05:38:13 - INFO - __main__ - Step 59971: {'lr': 0.00033343504007487515, 'samples': 11514432, 'steps': 59970, 'loss/train': 1.4303959608078003} 11/07/2021 05:38:13 - INFO - __main__ - Step 59972: {'lr': 0.00033343003757334625, 'samples': 11514624, 'steps': 59971, 'loss/train': 1.2609548568725586} 11/07/2021 05:38:14 - INFO - __main__ - Step 59973: {'lr': 0.000333425035034225, 'samples': 11514816, 'steps': 59972, 'loss/train': 1.406761884689331} 11/07/2021 05:38:14 - INFO - __main__ - Step 59974: {'lr': 0.00033342003245751374, 'samples': 11515008, 'steps': 59973, 'loss/train': 1.5263012647628784} 11/07/2021 05:38:14 - INFO - __main__ - Step 59975: {'lr': 0.0003334150298432147, 'samples': 11515200, 'steps': 59974, 'loss/train': 1.870259404182434} 11/07/2021 05:38:15 - INFO - __main__ - Step 59976: {'lr': 0.00033341002719133016, 'samples': 11515392, 'steps': 59975, 'loss/train': 1.5945241451263428} 11/07/2021 05:38:16 - INFO - __main__ - Step 59977: {'lr': 0.0003334050245018624, 'samples': 11515584, 'steps': 59976, 'loss/train': 1.0896097421646118} 11/07/2021 05:38:16 - INFO - __main__ - Step 59978: {'lr': 0.00033340002177481353, 'samples': 11515776, 'steps': 59977, 'loss/train': 1.4379078149795532} 11/07/2021 05:38:16 - INFO - __main__ - Step 59979: {'lr': 0.00033339501901018595, 'samples': 11515968, 'steps': 59978, 'loss/train': 1.8286679983139038} 11/07/2021 05:38:17 - INFO - __main__ - Step 59980: {'lr': 0.0003333900162079818, 'samples': 11516160, 'steps': 59979, 'loss/train': 1.4866057634353638} 11/07/2021 05:38:18 - INFO - __main__ - Step 59981: {'lr': 0.00033338501336820347, 'samples': 11516352, 'steps': 59980, 'loss/train': 1.0197144746780396} 11/07/2021 05:38:18 - INFO - __main__ - Step 59982: {'lr': 0.0003333800104908531, 'samples': 11516544, 'steps': 59981, 'loss/train': 1.3618757724761963} 11/07/2021 05:38:18 - INFO - __main__ - Step 59983: {'lr': 0.00033337500757593306, 'samples': 11516736, 'steps': 59982, 'loss/train': 1.2537387609481812} 11/07/2021 05:38:19 - INFO - __main__ - Step 59984: {'lr': 0.0003333700046234454, 'samples': 11516928, 'steps': 59983, 'loss/train': 0.8415775299072266} 11/07/2021 05:38:19 - INFO - __main__ - Step 59985: {'lr': 0.00033336500163339255, 'samples': 11517120, 'steps': 59984, 'loss/train': 1.416278600692749} 11/07/2021 05:38:20 - INFO - __main__ - Step 59986: {'lr': 0.00033335999860577677, 'samples': 11517312, 'steps': 59985, 'loss/train': 1.417700171470642} 11/07/2021 05:38:21 - INFO - __main__ - Step 59987: {'lr': 0.0003333549955406002, 'samples': 11517504, 'steps': 59986, 'loss/train': 1.4902400970458984} 11/07/2021 05:38:21 - INFO - __main__ - Step 59988: {'lr': 0.0003333499924378652, 'samples': 11517696, 'steps': 59987, 'loss/train': 1.3031017780303955} 11/07/2021 05:38:21 - INFO - __main__ - Step 59989: {'lr': 0.00033334498929757394, 'samples': 11517888, 'steps': 59988, 'loss/train': 1.293977975845337} 11/07/2021 05:38:22 - INFO - __main__ - Step 59990: {'lr': 0.0003333399861197287, 'samples': 11518080, 'steps': 59989, 'loss/train': 0.5613684058189392} 11/07/2021 05:38:23 - INFO - __main__ - Step 59991: {'lr': 0.00033333498290433184, 'samples': 11518272, 'steps': 59990, 'loss/train': 1.4361369609832764} 11/07/2021 05:38:23 - INFO - __main__ - Step 59992: {'lr': 0.00033332997965138545, 'samples': 11518464, 'steps': 59991, 'loss/train': 1.5548150539398193} 11/07/2021 05:38:24 - INFO - __main__ - Step 59993: {'lr': 0.0003333249763608919, 'samples': 11518656, 'steps': 59992, 'loss/train': 1.6068527698516846} 11/07/2021 05:38:24 - INFO - __main__ - Step 59994: {'lr': 0.00033331997303285334, 'samples': 11518848, 'steps': 59993, 'loss/train': 0.06748134642839432} 11/07/2021 05:38:24 - INFO - __main__ - Step 59995: {'lr': 0.00033331496966727207, 'samples': 11519040, 'steps': 59994, 'loss/train': 1.4003791809082031} 11/07/2021 05:38:25 - INFO - __main__ - Step 59996: {'lr': 0.00033330996626415046, 'samples': 11519232, 'steps': 59995, 'loss/train': 1.4762942790985107} 11/07/2021 05:38:26 - INFO - __main__ - Step 59997: {'lr': 0.0003333049628234906, 'samples': 11519424, 'steps': 59996, 'loss/train': 1.3110620975494385} 11/07/2021 05:38:26 - INFO - __main__ - Step 59998: {'lr': 0.0003332999593452948, 'samples': 11519616, 'steps': 59997, 'loss/train': 1.3451272249221802} 11/07/2021 05:38:27 - INFO - __main__ - Step 59999: {'lr': 0.0003332949558295654, 'samples': 11519808, 'steps': 59998, 'loss/train': 1.0439751148223877} 11/07/2021 05:38:27 - INFO - __main__ - Step 60000: {'lr': 0.0003332899522763045, 'samples': 11520000, 'steps': 59999, 'loss/train': 1.3764021396636963} 11/07/2021 05:38:27 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 05:41:40 - INFO - __main__ - Step 60000: {'loss/eval': 1.3816074132919312, 'perplexity': 3.9812960624694824} 11/07/2021 05:41:56 - WARNING - huggingface_hub.repository - Several commits (4) will be pushed upstream. 11/07/2021 05:41:56 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 05:42:18 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small 5ed0776..a409272 proud-haze-135 -> proud-haze-135 11/07/2021 05:42:19 - INFO - __main__ - Step 60001: {'lr': 0.0003332849486855144, 'samples': 11520192, 'steps': 60000, 'loss/train': 1.4027130603790283} 11/07/2021 05:42:20 - INFO - __main__ - Step 60002: {'lr': 0.0003332799450571975, 'samples': 11520384, 'steps': 60001, 'loss/train': 1.1730762720108032} 11/07/2021 05:42:20 - INFO - __main__ - Step 60003: {'lr': 0.0003332749413913558, 'samples': 11520576, 'steps': 60002, 'loss/train': 1.4289072751998901} 11/07/2021 05:42:20 - INFO - __main__ - Step 60004: {'lr': 0.0003332699376879918, 'samples': 11520768, 'steps': 60003, 'loss/train': 1.4739397764205933} 11/07/2021 05:42:21 - INFO - __main__ - Step 60005: {'lr': 0.00033326493394710764, 'samples': 11520960, 'steps': 60004, 'loss/train': 1.6397807598114014} 11/07/2021 05:42:22 - INFO - __main__ - Step 60006: {'lr': 0.0003332599301687056, 'samples': 11521152, 'steps': 60005, 'loss/train': 1.3413503170013428} 11/07/2021 05:42:22 - INFO - __main__ - Step 60007: {'lr': 0.0003332549263527879, 'samples': 11521344, 'steps': 60006, 'loss/train': 1.267769455909729} 11/07/2021 05:42:23 - INFO - __main__ - Step 60008: {'lr': 0.00033324992249935683, 'samples': 11521536, 'steps': 60007, 'loss/train': 1.0894927978515625} 11/07/2021 05:42:23 - INFO - __main__ - Step 60009: {'lr': 0.00033324491860841455, 'samples': 11521728, 'steps': 60008, 'loss/train': 1.3656480312347412} 11/07/2021 05:42:23 - INFO - __main__ - Step 60010: {'lr': 0.0003332399146799635, 'samples': 11521920, 'steps': 60009, 'loss/train': 1.2945382595062256} 11/07/2021 05:42:24 - INFO - __main__ - Step 60011: {'lr': 0.00033323491071400574, 'samples': 11522112, 'steps': 60010, 'loss/train': 0.7060404419898987} 11/07/2021 05:42:25 - INFO - __main__ - Step 60012: {'lr': 0.0003332299067105437, 'samples': 11522304, 'steps': 60011, 'loss/train': 0.04345306381583214} 11/07/2021 05:42:25 - INFO - __main__ - Step 60013: {'lr': 0.0003332249026695795, 'samples': 11522496, 'steps': 60012, 'loss/train': 1.8770544528961182} 11/07/2021 05:42:25 - INFO - __main__ - Step 60014: {'lr': 0.00033321989859111547, 'samples': 11522688, 'steps': 60013, 'loss/train': 1.2019213438034058} 11/07/2021 05:42:26 - INFO - __main__ - Step 60015: {'lr': 0.0003332148944751538, 'samples': 11522880, 'steps': 60014, 'loss/train': 1.11774742603302} 11/07/2021 05:42:26 - INFO - __main__ - Step 60016: {'lr': 0.0003332098903216968, 'samples': 11523072, 'steps': 60015, 'loss/train': 1.1836717128753662} 11/07/2021 05:42:27 - INFO - __main__ - Step 60017: {'lr': 0.00033320488613074666, 'samples': 11523264, 'steps': 60016, 'loss/train': 1.7324599027633667} 11/07/2021 05:42:27 - INFO - __main__ - Step 60018: {'lr': 0.00033319988190230575, 'samples': 11523456, 'steps': 60017, 'loss/train': 1.2083885669708252} 11/07/2021 05:42:28 - INFO - __main__ - Step 60019: {'lr': 0.00033319487763637626, 'samples': 11523648, 'steps': 60018, 'loss/train': 1.2822444438934326} 11/07/2021 05:42:28 - INFO - __main__ - Step 60020: {'lr': 0.00033318987333296043, 'samples': 11523840, 'steps': 60019, 'loss/train': 1.3512039184570312} 11/07/2021 05:42:29 - INFO - __main__ - Step 60021: {'lr': 0.0003331848689920605, 'samples': 11524032, 'steps': 60020, 'loss/train': 1.2660363912582397} 11/07/2021 05:42:29 - INFO - __main__ - Step 60022: {'lr': 0.0003331798646136788, 'samples': 11524224, 'steps': 60021, 'loss/train': 1.1435778141021729} 11/07/2021 05:42:30 - INFO - __main__ - Step 60023: {'lr': 0.00033317486019781743, 'samples': 11524416, 'steps': 60022, 'loss/train': 1.2764756679534912} 11/07/2021 05:42:30 - INFO - __main__ - Step 60024: {'lr': 0.0003331698557444788, 'samples': 11524608, 'steps': 60023, 'loss/train': 1.1031367778778076} 11/07/2021 05:42:30 - INFO - __main__ - Step 60025: {'lr': 0.00033316485125366516, 'samples': 11524800, 'steps': 60024, 'loss/train': 1.538691520690918} 11/07/2021 05:42:31 - INFO - __main__ - Step 60026: {'lr': 0.00033315984672537875, 'samples': 11524992, 'steps': 60025, 'loss/train': 1.4740277528762817} 11/07/2021 05:42:32 - INFO - __main__ - Step 60027: {'lr': 0.00033315484215962177, 'samples': 11525184, 'steps': 60026, 'loss/train': 1.3683785200119019} 11/07/2021 05:42:32 - INFO - __main__ - Step 60028: {'lr': 0.00033314983755639645, 'samples': 11525376, 'steps': 60027, 'loss/train': 1.7538186311721802} 11/07/2021 05:42:33 - INFO - __main__ - Step 60029: {'lr': 0.00033314483291570506, 'samples': 11525568, 'steps': 60028, 'loss/train': 1.5800448656082153} 11/07/2021 05:42:33 - INFO - __main__ - Step 60030: {'lr': 0.00033313982823755003, 'samples': 11525760, 'steps': 60029, 'loss/train': 0.9814277291297913} 11/07/2021 05:42:33 - INFO - __main__ - Step 60031: {'lr': 0.0003331348235219334, 'samples': 11525952, 'steps': 60030, 'loss/train': 1.55951988697052} 11/07/2021 05:42:35 - INFO - __main__ - Step 60032: {'lr': 0.0003331298187688575, 'samples': 11526144, 'steps': 60031, 'loss/train': 1.5238792896270752} 11/07/2021 05:42:35 - INFO - __main__ - Step 60033: {'lr': 0.0003331248139783246, 'samples': 11526336, 'steps': 60032, 'loss/train': 1.5183178186416626} 11/07/2021 05:42:35 - INFO - __main__ - Step 60034: {'lr': 0.0003331198091503369, 'samples': 11526528, 'steps': 60033, 'loss/train': 1.3602252006530762} 11/07/2021 05:42:36 - INFO - __main__ - Step 60035: {'lr': 0.00033311480428489674, 'samples': 11526720, 'steps': 60034, 'loss/train': 1.3004679679870605} 11/07/2021 05:42:36 - INFO - __main__ - Step 60036: {'lr': 0.0003331097993820063, 'samples': 11526912, 'steps': 60035, 'loss/train': 1.6742260456085205} 11/07/2021 05:42:36 - INFO - __main__ - Step 60037: {'lr': 0.0003331047944416679, 'samples': 11527104, 'steps': 60036, 'loss/train': 1.8161835670471191} 11/07/2021 05:42:37 - INFO - __main__ - Step 60038: {'lr': 0.00033309978946388376, 'samples': 11527296, 'steps': 60037, 'loss/train': 0.1785723865032196} 11/07/2021 05:42:38 - INFO - __main__ - Step 60039: {'lr': 0.00033309478444865613, 'samples': 11527488, 'steps': 60038, 'loss/train': 1.3815536499023438} 11/07/2021 05:42:38 - INFO - __main__ - Step 60040: {'lr': 0.00033308977939598727, 'samples': 11527680, 'steps': 60039, 'loss/train': 1.4509578943252563} 11/07/2021 05:42:38 - INFO - __main__ - Step 60041: {'lr': 0.0003330847743058795, 'samples': 11527872, 'steps': 60040, 'loss/train': 0.9885734915733337} 11/07/2021 05:42:39 - INFO - __main__ - Step 60042: {'lr': 0.00033307976917833486, 'samples': 11528064, 'steps': 60041, 'loss/train': 1.3995981216430664} 11/07/2021 05:42:40 - INFO - __main__ - Step 60043: {'lr': 0.0003330747640133558, 'samples': 11528256, 'steps': 60042, 'loss/train': 1.852107048034668} 11/07/2021 05:42:40 - INFO - __main__ - Step 60044: {'lr': 0.00033306975881094465, 'samples': 11528448, 'steps': 60043, 'loss/train': 1.2878859043121338} 11/07/2021 05:42:40 - INFO - __main__ - Step 60045: {'lr': 0.00033306475357110346, 'samples': 11528640, 'steps': 60044, 'loss/train': 1.3943895101547241} 11/07/2021 05:42:41 - INFO - __main__ - Step 60046: {'lr': 0.00033305974829383464, 'samples': 11528832, 'steps': 60045, 'loss/train': 1.4130584001541138} 11/07/2021 05:42:41 - INFO - __main__ - Step 60047: {'lr': 0.0003330547429791403, 'samples': 11529024, 'steps': 60046, 'loss/train': 0.7879865169525146} 11/07/2021 05:42:42 - INFO - __main__ - Step 60048: {'lr': 0.00033304973762702286, 'samples': 11529216, 'steps': 60047, 'loss/train': 1.8848154544830322} 11/07/2021 05:42:43 - INFO - __main__ - Step 60049: {'lr': 0.00033304473223748436, 'samples': 11529408, 'steps': 60048, 'loss/train': 1.2965070009231567} 11/07/2021 05:42:43 - INFO - __main__ - Step 60050: {'lr': 0.0003330397268105273, 'samples': 11529600, 'steps': 60049, 'loss/train': 5.723517894744873} 11/07/2021 05:42:43 - INFO - __main__ - Step 60051: {'lr': 0.00033303472134615377, 'samples': 11529792, 'steps': 60050, 'loss/train': 0.8310794234275818} 11/07/2021 05:42:44 - INFO - __main__ - Step 60052: {'lr': 0.00033302971584436603, 'samples': 11529984, 'steps': 60051, 'loss/train': 1.2260273694992065} 11/07/2021 05:42:45 - INFO - __main__ - Step 60053: {'lr': 0.00033302471030516653, 'samples': 11530176, 'steps': 60052, 'loss/train': 1.8872122764587402} 11/07/2021 05:42:45 - INFO - __main__ - Step 60054: {'lr': 0.00033301970472855724, 'samples': 11530368, 'steps': 60053, 'loss/train': 1.3108330965042114} 11/07/2021 05:42:46 - INFO - __main__ - Step 60055: {'lr': 0.00033301469911454064, 'samples': 11530560, 'steps': 60054, 'loss/train': 1.489721417427063} 11/07/2021 05:42:46 - INFO - __main__ - Step 60056: {'lr': 0.00033300969346311885, 'samples': 11530752, 'steps': 60055, 'loss/train': 0.9486181139945984} 11/07/2021 05:42:46 - INFO - __main__ - Step 60057: {'lr': 0.00033300468777429414, 'samples': 11530944, 'steps': 60056, 'loss/train': 1.1696879863739014} 11/07/2021 05:42:47 - INFO - __main__ - Step 60058: {'lr': 0.00033299968204806885, 'samples': 11531136, 'steps': 60057, 'loss/train': 1.6790357828140259} 11/07/2021 05:42:48 - INFO - __main__ - Step 60059: {'lr': 0.0003329946762844452, 'samples': 11531328, 'steps': 60058, 'loss/train': 1.4899635314941406} 11/07/2021 05:42:48 - INFO - __main__ - Step 60060: {'lr': 0.00033298967048342535, 'samples': 11531520, 'steps': 60059, 'loss/train': 1.420759677886963} 11/07/2021 05:42:48 - INFO - __main__ - Step 60061: {'lr': 0.0003329846646450117, 'samples': 11531712, 'steps': 60060, 'loss/train': 1.6790883541107178} 11/07/2021 05:42:49 - INFO - __main__ - Step 60062: {'lr': 0.00033297965876920646, 'samples': 11531904, 'steps': 60061, 'loss/train': 1.4663532972335815} 11/07/2021 05:42:50 - INFO - __main__ - Step 60063: {'lr': 0.0003329746528560118, 'samples': 11532096, 'steps': 60062, 'loss/train': 0.7939538359642029} 11/07/2021 05:42:50 - INFO - __main__ - Step 60064: {'lr': 0.00033296964690543007, 'samples': 11532288, 'steps': 60063, 'loss/train': 1.3513762950897217} 11/07/2021 05:42:51 - INFO - __main__ - Step 60065: {'lr': 0.00033296464091746346, 'samples': 11532480, 'steps': 60064, 'loss/train': 1.7557631731033325} 11/07/2021 05:42:51 - INFO - __main__ - Step 60066: {'lr': 0.0003329596348921144, 'samples': 11532672, 'steps': 60065, 'loss/train': 1.2613897323608398} 11/07/2021 05:42:51 - INFO - __main__ - Step 60067: {'lr': 0.0003329546288293849, 'samples': 11532864, 'steps': 60066, 'loss/train': 0.6459363698959351} 11/07/2021 05:42:52 - INFO - __main__ - Step 60068: {'lr': 0.0003329496227292773, 'samples': 11533056, 'steps': 60067, 'loss/train': 1.1494110822677612} 11/07/2021 05:42:53 - INFO - __main__ - Step 60069: {'lr': 0.0003329446165917939, 'samples': 11533248, 'steps': 60068, 'loss/train': 1.0682427883148193} 11/07/2021 05:42:53 - INFO - __main__ - Step 60070: {'lr': 0.00033293961041693697, 'samples': 11533440, 'steps': 60069, 'loss/train': 1.7688817977905273} 11/07/2021 05:42:53 - INFO - __main__ - Step 60071: {'lr': 0.00033293460420470873, 'samples': 11533632, 'steps': 60070, 'loss/train': 1.0781092643737793} 11/07/2021 05:42:54 - INFO - __main__ - Step 60072: {'lr': 0.0003329295979551114, 'samples': 11533824, 'steps': 60071, 'loss/train': 1.3361777067184448} 11/07/2021 05:42:54 - INFO - __main__ - Step 60073: {'lr': 0.0003329245916681473, 'samples': 11534016, 'steps': 60072, 'loss/train': 1.2227041721343994} 11/07/2021 05:42:55 - INFO - __main__ - Step 60074: {'lr': 0.00033291958534381865, 'samples': 11534208, 'steps': 60073, 'loss/train': 1.4153375625610352} 11/07/2021 05:42:55 - INFO - __main__ - Step 60075: {'lr': 0.0003329145789821277, 'samples': 11534400, 'steps': 60074, 'loss/train': 1.5498137474060059} 11/07/2021 05:42:56 - INFO - __main__ - Step 60076: {'lr': 0.00033290957258307676, 'samples': 11534592, 'steps': 60075, 'loss/train': 1.4215513467788696} 11/07/2021 05:42:56 - INFO - __main__ - Step 60077: {'lr': 0.00033290456614666804, 'samples': 11534784, 'steps': 60076, 'loss/train': 1.2427122592926025} 11/07/2021 05:42:56 - INFO - __main__ - Step 60078: {'lr': 0.0003328995596729038, 'samples': 11534976, 'steps': 60077, 'loss/train': 1.5530147552490234} 11/07/2021 05:42:58 - INFO - __main__ - Step 60079: {'lr': 0.00033289455316178626, 'samples': 11535168, 'steps': 60078, 'loss/train': 1.3919967412948608} 11/07/2021 05:42:59 - INFO - __main__ - Step 60080: {'lr': 0.00033288954661331776, 'samples': 11535360, 'steps': 60079, 'loss/train': 1.2633476257324219} 11/07/2021 05:42:59 - INFO - __main__ - Step 60081: {'lr': 0.00033288454002750045, 'samples': 11535552, 'steps': 60080, 'loss/train': 1.618464469909668} 11/07/2021 05:42:59 - INFO - __main__ - Step 60082: {'lr': 0.0003328795334043367, 'samples': 11535744, 'steps': 60081, 'loss/train': 0.5591421127319336} 11/07/2021 05:43:00 - INFO - __main__ - Step 60083: {'lr': 0.00033287452674382866, 'samples': 11535936, 'steps': 60082, 'loss/train': 1.253922462463379} 11/07/2021 05:43:00 - INFO - __main__ - Step 60084: {'lr': 0.0003328695200459787, 'samples': 11536128, 'steps': 60083, 'loss/train': 1.1268606185913086} 11/07/2021 05:43:00 - INFO - __main__ - Step 60085: {'lr': 0.00033286451331078894, 'samples': 11536320, 'steps': 60084, 'loss/train': 1.7629292011260986} 11/07/2021 05:43:01 - INFO - __main__ - Step 60086: {'lr': 0.0003328595065382618, 'samples': 11536512, 'steps': 60085, 'loss/train': 1.7556977272033691} 11/07/2021 05:43:02 - INFO - __main__ - Step 60087: {'lr': 0.00033285449972839944, 'samples': 11536704, 'steps': 60086, 'loss/train': 0.9680095314979553} 11/07/2021 05:43:02 - INFO - __main__ - Step 60088: {'lr': 0.00033284949288120403, 'samples': 11536896, 'steps': 60087, 'loss/train': 1.3183473348617554} 11/07/2021 05:43:03 - INFO - __main__ - Step 60089: {'lr': 0.00033284448599667796, 'samples': 11537088, 'steps': 60088, 'loss/train': 1.4026401042938232} 11/07/2021 05:43:03 - INFO - __main__ - Step 60090: {'lr': 0.0003328394790748234, 'samples': 11537280, 'steps': 60089, 'loss/train': 1.5954433679580688} 11/07/2021 05:43:04 - INFO - __main__ - Step 60091: {'lr': 0.0003328344721156427, 'samples': 11537472, 'steps': 60090, 'loss/train': 1.3260072469711304} 11/07/2021 05:43:04 - INFO - __main__ - Step 60092: {'lr': 0.00033282946511913806, 'samples': 11537664, 'steps': 60091, 'loss/train': 1.452695369720459} 11/07/2021 05:43:05 - INFO - __main__ - Step 60093: {'lr': 0.0003328244580853118, 'samples': 11537856, 'steps': 60092, 'loss/train': 1.6910144090652466} 11/07/2021 05:43:05 - INFO - __main__ - Step 60094: {'lr': 0.00033281945101416605, 'samples': 11538048, 'steps': 60093, 'loss/train': 1.383185625076294} 11/07/2021 05:43:05 - INFO - __main__ - Step 60095: {'lr': 0.00033281444390570317, 'samples': 11538240, 'steps': 60094, 'loss/train': 1.1259127855300903} 11/07/2021 05:43:06 - INFO - __main__ - Step 60096: {'lr': 0.0003328094367599253, 'samples': 11538432, 'steps': 60095, 'loss/train': 1.6830825805664062} 11/07/2021 05:43:07 - INFO - __main__ - Step 60097: {'lr': 0.0003328044295768349, 'samples': 11538624, 'steps': 60096, 'loss/train': 0.8142919540405273} 11/07/2021 05:43:07 - INFO - __main__ - Step 60098: {'lr': 0.00033279942235643395, 'samples': 11538816, 'steps': 60097, 'loss/train': 1.8182958364486694} 11/07/2021 05:43:07 - INFO - __main__ - Step 60099: {'lr': 0.00033279441509872495, 'samples': 11539008, 'steps': 60098, 'loss/train': 1.364698886871338} 11/07/2021 05:43:08 - INFO - __main__ - Step 60100: {'lr': 0.0003327894078037101, 'samples': 11539200, 'steps': 60099, 'loss/train': 1.6317778825759888} 11/07/2021 05:43:08 - INFO - __main__ - Step 60101: {'lr': 0.0003327844004713916, 'samples': 11539392, 'steps': 60100, 'loss/train': 1.1237514019012451} 11/07/2021 05:43:09 - INFO - __main__ - Step 60102: {'lr': 0.0003327793931017716, 'samples': 11539584, 'steps': 60101, 'loss/train': 2.4172372817993164} 11/07/2021 05:43:10 - INFO - __main__ - Step 60103: {'lr': 0.0003327743856948526, 'samples': 11539776, 'steps': 60102, 'loss/train': 1.2120815515518188} 11/07/2021 05:43:10 - INFO - __main__ - Step 60104: {'lr': 0.00033276937825063677, 'samples': 11539968, 'steps': 60103, 'loss/train': 0.29472219944000244} 11/07/2021 05:43:11 - INFO - __main__ - Step 60105: {'lr': 0.0003327643707691263, 'samples': 11540160, 'steps': 60104, 'loss/train': 1.3919295072555542} 11/07/2021 05:43:11 - INFO - __main__ - Step 60106: {'lr': 0.00033275936325032345, 'samples': 11540352, 'steps': 60105, 'loss/train': 1.3413450717926025} 11/07/2021 05:43:12 - INFO - __main__ - Step 60107: {'lr': 0.0003327543556942305, 'samples': 11540544, 'steps': 60106, 'loss/train': 1.4700926542282104} 11/07/2021 05:43:12 - INFO - __main__ - Step 60108: {'lr': 0.00033274934810084976, 'samples': 11540736, 'steps': 60107, 'loss/train': 1.7066904306411743} 11/07/2021 05:43:13 - INFO - __main__ - Step 60109: {'lr': 0.0003327443404701834, 'samples': 11540928, 'steps': 60108, 'loss/train': 1.4894722700119019} 11/07/2021 05:43:13 - INFO - __main__ - Step 60110: {'lr': 0.0003327393328022337, 'samples': 11541120, 'steps': 60109, 'loss/train': 1.1582971811294556} 11/07/2021 05:43:14 - INFO - __main__ - Step 60111: {'lr': 0.0003327343250970031, 'samples': 11541312, 'steps': 60110, 'loss/train': 1.1014586687088013} 11/07/2021 05:43:14 - INFO - __main__ - Step 60112: {'lr': 0.0003327293173544935, 'samples': 11541504, 'steps': 60111, 'loss/train': 0.7368154525756836} 11/07/2021 05:43:15 - INFO - __main__ - Step 60113: {'lr': 0.00033272430957470746, 'samples': 11541696, 'steps': 60112, 'loss/train': 0.9518160223960876} 11/07/2021 05:43:15 - INFO - __main__ - Step 60114: {'lr': 0.000332719301757647, 'samples': 11541888, 'steps': 60113, 'loss/train': 1.8234716653823853} 11/07/2021 05:43:16 - INFO - __main__ - Step 60115: {'lr': 0.00033271429390331457, 'samples': 11542080, 'steps': 60114, 'loss/train': 1.3615777492523193} 11/07/2021 05:43:16 - INFO - __main__ - Step 60116: {'lr': 0.0003327092860117124, 'samples': 11542272, 'steps': 60115, 'loss/train': 1.6237295866012573} 11/07/2021 05:43:17 - INFO - __main__ - Step 60117: {'lr': 0.00033270427808284263, 'samples': 11542464, 'steps': 60116, 'loss/train': 1.4205503463745117} 11/07/2021 05:43:17 - INFO - __main__ - Step 60118: {'lr': 0.00033269927011670764, 'samples': 11542656, 'steps': 60117, 'loss/train': 1.4304492473602295} 11/07/2021 05:43:18 - INFO - __main__ - Step 60119: {'lr': 0.0003326942621133096, 'samples': 11542848, 'steps': 60118, 'loss/train': 1.344322919845581} 11/07/2021 05:43:18 - INFO - __main__ - Step 60120: {'lr': 0.00033268925407265083, 'samples': 11543040, 'steps': 60119, 'loss/train': 1.4176051616668701} 11/07/2021 05:43:18 - INFO - __main__ - Step 60121: {'lr': 0.0003326842459947335, 'samples': 11543232, 'steps': 60120, 'loss/train': 0.057131148874759674} 11/07/2021 05:43:19 - INFO - __main__ - Step 60122: {'lr': 0.00033267923787956, 'samples': 11543424, 'steps': 60121, 'loss/train': 1.6833255290985107} 11/07/2021 05:43:20 - INFO - __main__ - Step 60123: {'lr': 0.0003326742297271325, 'samples': 11543616, 'steps': 60122, 'loss/train': 1.4128553867340088} 11/07/2021 05:43:20 - INFO - __main__ - Step 60124: {'lr': 0.0003326692215374532, 'samples': 11543808, 'steps': 60123, 'loss/train': 1.2498970031738281} 11/07/2021 05:43:21 - INFO - __main__ - Step 60125: {'lr': 0.0003326642133105245, 'samples': 11544000, 'steps': 60124, 'loss/train': 1.0101922750473022} 11/07/2021 05:43:21 - INFO - __main__ - Step 60126: {'lr': 0.0003326592050463485, 'samples': 11544192, 'steps': 60125, 'loss/train': 2.382988214492798} 11/07/2021 05:43:22 - INFO - __main__ - Step 60127: {'lr': 0.00033265419674492763, 'samples': 11544384, 'steps': 60126, 'loss/train': 1.5429669618606567} 11/07/2021 05:43:22 - INFO - __main__ - Step 60128: {'lr': 0.000332649188406264, 'samples': 11544576, 'steps': 60127, 'loss/train': 1.2830042839050293} 11/07/2021 05:43:23 - INFO - __main__ - Step 60129: {'lr': 0.00033264418003035997, 'samples': 11544768, 'steps': 60128, 'loss/train': 1.5531110763549805} 11/07/2021 05:43:23 - INFO - __main__ - Step 60130: {'lr': 0.0003326391716172177, 'samples': 11544960, 'steps': 60129, 'loss/train': 1.2202798128128052} 11/07/2021 05:43:23 - INFO - __main__ - Step 60131: {'lr': 0.00033263416316683947, 'samples': 11545152, 'steps': 60130, 'loss/train': 1.815140962600708} 11/07/2021 05:43:24 - INFO - __main__ - Step 60132: {'lr': 0.0003326291546792276, 'samples': 11545344, 'steps': 60131, 'loss/train': 0.9622929096221924} 11/07/2021 05:43:25 - INFO - __main__ - Step 60133: {'lr': 0.00033262414615438434, 'samples': 11545536, 'steps': 60132, 'loss/train': 0.7201207876205444} 11/07/2021 05:43:25 - INFO - __main__ - Step 60134: {'lr': 0.0003326191375923119, 'samples': 11545728, 'steps': 60133, 'loss/train': 1.2248847484588623} 11/07/2021 05:43:25 - INFO - __main__ - Step 60135: {'lr': 0.00033261412899301246, 'samples': 11545920, 'steps': 60134, 'loss/train': 1.3969788551330566} 11/07/2021 05:43:26 - INFO - __main__ - Step 60136: {'lr': 0.0003326091203564885, 'samples': 11546112, 'steps': 60135, 'loss/train': 1.0070995092391968} 11/07/2021 05:43:26 - INFO - __main__ - Step 60137: {'lr': 0.00033260411168274206, 'samples': 11546304, 'steps': 60136, 'loss/train': 1.187082290649414} 11/07/2021 05:43:27 - INFO - __main__ - Step 60138: {'lr': 0.00033259910297177547, 'samples': 11546496, 'steps': 60137, 'loss/train': 1.251821756362915} 11/07/2021 05:43:28 - INFO - __main__ - Step 60139: {'lr': 0.00033259409422359103, 'samples': 11546688, 'steps': 60138, 'loss/train': 1.230056643486023} 11/07/2021 05:43:28 - INFO - __main__ - Step 60140: {'lr': 0.000332589085438191, 'samples': 11546880, 'steps': 60139, 'loss/train': 0.8535072803497314} 11/07/2021 05:43:28 - INFO - __main__ - Step 60141: {'lr': 0.0003325840766155776, 'samples': 11547072, 'steps': 60140, 'loss/train': 1.0840997695922852} 11/07/2021 05:43:29 - INFO - __main__ - Step 60142: {'lr': 0.00033257906775575305, 'samples': 11547264, 'steps': 60141, 'loss/train': 1.5496935844421387} 11/07/2021 05:43:30 - INFO - __main__ - Step 60143: {'lr': 0.00033257405885871963, 'samples': 11547456, 'steps': 60142, 'loss/train': 1.8011113405227661} 11/07/2021 05:43:30 - INFO - __main__ - Step 60144: {'lr': 0.00033256904992447965, 'samples': 11547648, 'steps': 60143, 'loss/train': 0.2006329596042633} 11/07/2021 05:43:31 - INFO - __main__ - Step 60145: {'lr': 0.00033256404095303527, 'samples': 11547840, 'steps': 60144, 'loss/train': 1.3201642036437988} 11/07/2021 05:43:31 - INFO - __main__ - Step 60146: {'lr': 0.0003325590319443889, 'samples': 11548032, 'steps': 60145, 'loss/train': 1.1438864469528198} 11/07/2021 05:43:32 - INFO - __main__ - Step 60147: {'lr': 0.0003325540228985427, 'samples': 11548224, 'steps': 60146, 'loss/train': 1.7037405967712402} 11/07/2021 05:43:32 - INFO - __main__ - Step 60148: {'lr': 0.00033254901381549884, 'samples': 11548416, 'steps': 60147, 'loss/train': 1.545479416847229} 11/07/2021 05:43:33 - INFO - __main__ - Step 60149: {'lr': 0.00033254400469525974, 'samples': 11548608, 'steps': 60148, 'loss/train': 1.7037698030471802} 11/07/2021 05:43:33 - INFO - __main__ - Step 60150: {'lr': 0.0003325389955378276, 'samples': 11548800, 'steps': 60149, 'loss/train': 0.6792606115341187} 11/07/2021 05:43:33 - INFO - __main__ - Step 60151: {'lr': 0.0003325339863432046, 'samples': 11548992, 'steps': 60150, 'loss/train': 1.6015790700912476} 11/07/2021 05:43:34 - INFO - __main__ - Step 60152: {'lr': 0.00033252897711139306, 'samples': 11549184, 'steps': 60151, 'loss/train': 1.6868928670883179} 11/07/2021 05:43:35 - INFO - __main__ - Step 60153: {'lr': 0.00033252396784239535, 'samples': 11549376, 'steps': 60152, 'loss/train': 1.2827423810958862} 11/07/2021 05:43:35 - INFO - __main__ - Step 60154: {'lr': 0.0003325189585362135, 'samples': 11549568, 'steps': 60153, 'loss/train': 1.4003520011901855} 11/07/2021 05:43:35 - INFO - __main__ - Step 60155: {'lr': 0.0003325139491928499, 'samples': 11549760, 'steps': 60154, 'loss/train': 1.542554497718811} 11/07/2021 05:43:36 - INFO - __main__ - Step 60156: {'lr': 0.0003325089398123068, 'samples': 11549952, 'steps': 60155, 'loss/train': 1.2335885763168335} 11/07/2021 05:43:37 - INFO - __main__ - Step 60157: {'lr': 0.0003325039303945864, 'samples': 11550144, 'steps': 60156, 'loss/train': 1.2563700675964355} 11/07/2021 05:43:37 - INFO - __main__ - Step 60158: {'lr': 0.000332498920939691, 'samples': 11550336, 'steps': 60157, 'loss/train': 1.7701759338378906} 11/07/2021 05:43:37 - INFO - __main__ - Step 60159: {'lr': 0.000332493911447623, 'samples': 11550528, 'steps': 60158, 'loss/train': 0.8544160723686218} 11/07/2021 05:43:38 - INFO - __main__ - Step 60160: {'lr': 0.0003324889019183844, 'samples': 11550720, 'steps': 60159, 'loss/train': 1.1591763496398926} 11/07/2021 05:43:38 - INFO - __main__ - Step 60161: {'lr': 0.00033248389235197764, 'samples': 11550912, 'steps': 60160, 'loss/train': 1.493994116783142} 11/07/2021 05:43:39 - INFO - __main__ - Step 60162: {'lr': 0.00033247888274840485, 'samples': 11551104, 'steps': 60161, 'loss/train': 1.8042875528335571} 11/07/2021 05:43:40 - INFO - __main__ - Step 60163: {'lr': 0.0003324738731076683, 'samples': 11551296, 'steps': 60162, 'loss/train': 1.0496973991394043} 11/07/2021 05:43:40 - INFO - __main__ - Step 60164: {'lr': 0.0003324688634297704, 'samples': 11551488, 'steps': 60163, 'loss/train': 0.523000955581665} 11/07/2021 05:43:40 - INFO - __main__ - Step 60165: {'lr': 0.0003324638537147132, 'samples': 11551680, 'steps': 60164, 'loss/train': 1.390426754951477} 11/07/2021 05:43:41 - INFO - __main__ - Step 60166: {'lr': 0.00033245884396249916, 'samples': 11551872, 'steps': 60165, 'loss/train': 1.7877496480941772} 11/07/2021 05:43:41 - INFO - __main__ - Step 60167: {'lr': 0.0003324538341731304, 'samples': 11552064, 'steps': 60166, 'loss/train': 1.2512640953063965} 11/07/2021 05:43:42 - INFO - __main__ - Step 60168: {'lr': 0.0003324488243466092, 'samples': 11552256, 'steps': 60167, 'loss/train': 1.449568510055542} 11/07/2021 05:43:42 - INFO - __main__ - Step 60169: {'lr': 0.0003324438144829379, 'samples': 11552448, 'steps': 60168, 'loss/train': 1.214891791343689} 11/07/2021 05:43:43 - INFO - __main__ - Step 60170: {'lr': 0.0003324388045821186, 'samples': 11552640, 'steps': 60169, 'loss/train': 1.3670734167099} 11/07/2021 05:43:43 - INFO - __main__ - Step 60171: {'lr': 0.0003324337946441537, 'samples': 11552832, 'steps': 60170, 'loss/train': 1.1034481525421143} 11/07/2021 05:43:43 - INFO - __main__ - Step 60172: {'lr': 0.00033242878466904535, 'samples': 11553024, 'steps': 60171, 'loss/train': 0.9929606318473816} 11/07/2021 05:43:45 - INFO - __main__ - Step 60173: {'lr': 0.00033242377465679583, 'samples': 11553216, 'steps': 60172, 'loss/train': 1.6893740892410278} 11/07/2021 05:43:45 - INFO - __main__ - Step 60174: {'lr': 0.0003324187646074076, 'samples': 11553408, 'steps': 60173, 'loss/train': 1.3800173997879028} 11/07/2021 05:43:45 - INFO - __main__ - Step 60175: {'lr': 0.0003324137545208826, 'samples': 11553600, 'steps': 60174, 'loss/train': 1.354891061782837} 11/07/2021 05:43:46 - INFO - __main__ - Step 60176: {'lr': 0.0003324087443972233, 'samples': 11553792, 'steps': 60175, 'loss/train': 1.3293216228485107} 11/07/2021 05:43:46 - INFO - __main__ - Step 60177: {'lr': 0.0003324037342364319, 'samples': 11553984, 'steps': 60176, 'loss/train': 1.267116904258728} 11/07/2021 05:43:47 - INFO - __main__ - Step 60178: {'lr': 0.0003323987240385106, 'samples': 11554176, 'steps': 60177, 'loss/train': 1.09414541721344} 11/07/2021 05:43:47 - INFO - __main__ - Step 60179: {'lr': 0.00033239371380346165, 'samples': 11554368, 'steps': 60178, 'loss/train': 1.282373309135437} 11/07/2021 05:43:48 - INFO - __main__ - Step 60180: {'lr': 0.0003323887035312875, 'samples': 11554560, 'steps': 60179, 'loss/train': 0.9317958950996399} 11/07/2021 05:43:48 - INFO - __main__ - Step 60181: {'lr': 0.0003323836932219902, 'samples': 11554752, 'steps': 60180, 'loss/train': 1.0857734680175781} 11/07/2021 05:43:48 - INFO - __main__ - Step 60182: {'lr': 0.0003323786828755721, 'samples': 11554944, 'steps': 60181, 'loss/train': 1.3090232610702515} 11/07/2021 05:43:49 - INFO - __main__ - Step 60183: {'lr': 0.00033237367249203543, 'samples': 11555136, 'steps': 60182, 'loss/train': 1.329426646232605} 11/07/2021 05:43:50 - INFO - __main__ - Step 60184: {'lr': 0.0003323686620713824, 'samples': 11555328, 'steps': 60183, 'loss/train': 1.1482723951339722} 11/07/2021 05:43:50 - INFO - __main__ - Step 60185: {'lr': 0.00033236365161361535, 'samples': 11555520, 'steps': 60184, 'loss/train': 1.3021589517593384} 11/07/2021 05:43:50 - INFO - __main__ - Step 60186: {'lr': 0.00033235864111873654, 'samples': 11555712, 'steps': 60185, 'loss/train': 1.534145474433899} 11/07/2021 05:43:51 - INFO - __main__ - Step 60187: {'lr': 0.00033235363058674826, 'samples': 11555904, 'steps': 60186, 'loss/train': 1.6366760730743408} 11/07/2021 05:43:51 - INFO - __main__ - Step 60188: {'lr': 0.0003323486200176526, 'samples': 11556096, 'steps': 60187, 'loss/train': 1.575746774673462} 11/07/2021 05:43:52 - INFO - __main__ - Step 60189: {'lr': 0.000332343609411452, 'samples': 11556288, 'steps': 60188, 'loss/train': 1.628832459449768} 11/07/2021 05:43:52 - INFO - __main__ - Step 60190: {'lr': 0.00033233859876814856, 'samples': 11556480, 'steps': 60189, 'loss/train': 1.4792842864990234} 11/07/2021 05:43:53 - INFO - __main__ - Step 60191: {'lr': 0.0003323335880877446, 'samples': 11556672, 'steps': 60190, 'loss/train': 1.4913865327835083} 11/07/2021 05:43:53 - INFO - __main__ - Step 60192: {'lr': 0.00033232857737024244, 'samples': 11556864, 'steps': 60191, 'loss/train': 1.416801929473877} 11/07/2021 05:43:54 - INFO - __main__ - Step 60193: {'lr': 0.00033232356661564436, 'samples': 11557056, 'steps': 60192, 'loss/train': 1.5051209926605225} 11/07/2021 05:43:55 - INFO - __main__ - Step 60194: {'lr': 0.00033231855582395247, 'samples': 11557248, 'steps': 60193, 'loss/train': 1.3351315259933472} 11/07/2021 05:43:55 - INFO - __main__ - Step 60195: {'lr': 0.00033231354499516915, 'samples': 11557440, 'steps': 60194, 'loss/train': 1.9947597980499268} 11/07/2021 05:43:55 - INFO - __main__ - Step 60196: {'lr': 0.00033230853412929664, 'samples': 11557632, 'steps': 60195, 'loss/train': 1.7530628442764282} 11/07/2021 05:43:56 - INFO - __main__ - Step 60197: {'lr': 0.00033230352322633703, 'samples': 11557824, 'steps': 60196, 'loss/train': 1.3776448965072632} 11/07/2021 05:43:56 - INFO - __main__ - Step 60198: {'lr': 0.0003322985122862929, 'samples': 11558016, 'steps': 60197, 'loss/train': 1.218187928199768} 11/07/2021 05:43:57 - INFO - __main__ - Step 60199: {'lr': 0.00033229350130916627, 'samples': 11558208, 'steps': 60198, 'loss/train': 1.507169246673584} 11/07/2021 05:43:58 - INFO - __main__ - Step 60200: {'lr': 0.0003322884902949594, 'samples': 11558400, 'steps': 60199, 'loss/train': 1.1210753917694092} 11/07/2021 05:43:58 - INFO - __main__ - Step 60201: {'lr': 0.0003322834792436747, 'samples': 11558592, 'steps': 60200, 'loss/train': 1.3480879068374634} 11/07/2021 05:43:58 - INFO - __main__ - Step 60202: {'lr': 0.00033227846815531424, 'samples': 11558784, 'steps': 60201, 'loss/train': 1.1348947286605835} 11/07/2021 05:43:59 - INFO - __main__ - Step 60203: {'lr': 0.0003322734570298804, 'samples': 11558976, 'steps': 60202, 'loss/train': 1.3035460710525513} 11/07/2021 05:44:00 - INFO - __main__ - Step 60204: {'lr': 0.00033226844586737545, 'samples': 11559168, 'steps': 60203, 'loss/train': 1.833981990814209} 11/07/2021 05:44:00 - INFO - __main__ - Step 60205: {'lr': 0.00033226343466780155, 'samples': 11559360, 'steps': 60204, 'loss/train': 1.6156854629516602} 11/07/2021 05:44:00 - INFO - __main__ - Step 60206: {'lr': 0.0003322584234311611, 'samples': 11559552, 'steps': 60205, 'loss/train': 1.4758983850479126} 11/07/2021 05:44:01 - INFO - __main__ - Step 60207: {'lr': 0.0003322534121574562, 'samples': 11559744, 'steps': 60206, 'loss/train': 1.37001371383667} 11/07/2021 05:44:01 - INFO - __main__ - Step 60208: {'lr': 0.0003322484008466892, 'samples': 11559936, 'steps': 60207, 'loss/train': 1.0512351989746094} 11/07/2021 05:44:02 - INFO - __main__ - Step 60209: {'lr': 0.00033224338949886233, 'samples': 11560128, 'steps': 60208, 'loss/train': 1.4253219366073608} 11/07/2021 05:44:03 - INFO - __main__ - Step 60210: {'lr': 0.0003322383781139779, 'samples': 11560320, 'steps': 60209, 'loss/train': 0.05440700426697731} 11/07/2021 05:44:03 - INFO - __main__ - Step 60211: {'lr': 0.0003322333666920381, 'samples': 11560512, 'steps': 60210, 'loss/train': 1.1338646411895752} 11/07/2021 05:44:03 - INFO - __main__ - Step 60212: {'lr': 0.0003322283552330452, 'samples': 11560704, 'steps': 60211, 'loss/train': 0.9653365015983582} 11/07/2021 05:44:04 - INFO - __main__ - Step 60213: {'lr': 0.00033222334373700146, 'samples': 11560896, 'steps': 60212, 'loss/train': 1.4488608837127686} 11/07/2021 05:44:04 - INFO - __main__ - Step 60214: {'lr': 0.00033221833220390925, 'samples': 11561088, 'steps': 60213, 'loss/train': 2.4441239833831787} 11/07/2021 05:44:05 - INFO - __main__ - Step 60215: {'lr': 0.00033221332063377066, 'samples': 11561280, 'steps': 60214, 'loss/train': 0.5568519234657288} 11/07/2021 05:44:06 - INFO - __main__ - Step 60216: {'lr': 0.0003322083090265879, 'samples': 11561472, 'steps': 60215, 'loss/train': 1.5247681140899658} 11/07/2021 05:44:06 - INFO - __main__ - Step 60217: {'lr': 0.0003322032973823635, 'samples': 11561664, 'steps': 60216, 'loss/train': 1.316370964050293} 11/07/2021 05:44:06 - INFO - __main__ - Step 60218: {'lr': 0.0003321982857010995, 'samples': 11561856, 'steps': 60217, 'loss/train': 1.5131691694259644} 11/07/2021 05:44:07 - INFO - __main__ - Step 60219: {'lr': 0.00033219327398279825, 'samples': 11562048, 'steps': 60218, 'loss/train': 1.0950616598129272} 11/07/2021 05:44:08 - INFO - __main__ - Step 60220: {'lr': 0.00033218826222746194, 'samples': 11562240, 'steps': 60219, 'loss/train': 1.7482783794403076} 11/07/2021 05:44:08 - INFO - __main__ - Step 60221: {'lr': 0.00033218325043509297, 'samples': 11562432, 'steps': 60220, 'loss/train': 0.9750014543533325} 11/07/2021 05:44:08 - INFO - __main__ - Step 60222: {'lr': 0.0003321782386056934, 'samples': 11562624, 'steps': 60221, 'loss/train': 1.3724541664123535} 11/07/2021 05:44:09 - INFO - __main__ - Step 60223: {'lr': 0.0003321732267392656, 'samples': 11562816, 'steps': 60222, 'loss/train': 1.4684392213821411} 11/07/2021 05:44:09 - INFO - __main__ - Step 60224: {'lr': 0.0003321682148358118, 'samples': 11563008, 'steps': 60223, 'loss/train': 1.3060544729232788} 11/07/2021 05:44:09 - INFO - __main__ - Step 60225: {'lr': 0.0003321632028953343, 'samples': 11563200, 'steps': 60224, 'loss/train': 1.7968882322311401} 11/07/2021 05:44:11 - INFO - __main__ - Step 60226: {'lr': 0.0003321581909178353, 'samples': 11563392, 'steps': 60225, 'loss/train': 5.3611836433410645} 11/07/2021 05:44:11 - INFO - __main__ - Step 60227: {'lr': 0.0003321531789033171, 'samples': 11563584, 'steps': 60226, 'loss/train': 1.1560535430908203} 11/07/2021 05:44:11 - INFO - __main__ - Step 60228: {'lr': 0.00033214816685178195, 'samples': 11563776, 'steps': 60227, 'loss/train': 1.748153805732727} 11/07/2021 05:44:12 - INFO - __main__ - Step 60229: {'lr': 0.0003321431547632321, 'samples': 11563968, 'steps': 60228, 'loss/train': 1.0872668027877808} 11/07/2021 05:44:12 - INFO - __main__ - Step 60230: {'lr': 0.00033213814263766985, 'samples': 11564160, 'steps': 60229, 'loss/train': 1.5244708061218262} 11/07/2021 05:44:13 - INFO - __main__ - Step 60231: {'lr': 0.0003321331304750973, 'samples': 11564352, 'steps': 60230, 'loss/train': 1.4618442058563232} 11/07/2021 05:44:13 - INFO - __main__ - Step 60232: {'lr': 0.00033212811827551693, 'samples': 11564544, 'steps': 60231, 'loss/train': 1.330047845840454} 11/07/2021 05:44:14 - INFO - __main__ - Step 60233: {'lr': 0.00033212310603893087, 'samples': 11564736, 'steps': 60232, 'loss/train': 1.4405914545059204} 11/07/2021 05:44:14 - INFO - __main__ - Step 60234: {'lr': 0.0003321180937653415, 'samples': 11564928, 'steps': 60233, 'loss/train': 1.3131108283996582} 11/07/2021 05:44:14 - INFO - __main__ - Step 60235: {'lr': 0.0003321130814547508, 'samples': 11565120, 'steps': 60234, 'loss/train': 1.4534627199172974} 11/07/2021 05:44:15 - INFO - __main__ - Step 60236: {'lr': 0.00033210806910716136, 'samples': 11565312, 'steps': 60235, 'loss/train': 1.42470121383667} 11/07/2021 05:44:16 - INFO - __main__ - Step 60237: {'lr': 0.00033210305672257525, 'samples': 11565504, 'steps': 60236, 'loss/train': 1.232730507850647} 11/07/2021 05:44:16 - INFO - __main__ - Step 60238: {'lr': 0.0003320980443009947, 'samples': 11565696, 'steps': 60237, 'loss/train': 1.3475693464279175} 11/07/2021 05:44:16 - INFO - __main__ - Step 60239: {'lr': 0.00033209303184242214, 'samples': 11565888, 'steps': 60238, 'loss/train': 1.4086356163024902} 11/07/2021 05:44:17 - INFO - __main__ - Step 60240: {'lr': 0.00033208801934685975, 'samples': 11566080, 'steps': 60239, 'loss/train': 1.3684958219528198} 11/07/2021 05:44:18 - INFO - __main__ - Step 60241: {'lr': 0.00033208300681430964, 'samples': 11566272, 'steps': 60240, 'loss/train': 1.3690298795700073} 11/07/2021 05:44:18 - INFO - __main__ - Step 60242: {'lr': 0.00033207799424477425, 'samples': 11566464, 'steps': 60241, 'loss/train': 1.9028096199035645} 11/07/2021 05:44:18 - INFO - __main__ - Step 60243: {'lr': 0.0003320729816382558, 'samples': 11566656, 'steps': 60242, 'loss/train': 1.3465632200241089} 11/07/2021 05:44:19 - INFO - __main__ - Step 60244: {'lr': 0.0003320679689947565, 'samples': 11566848, 'steps': 60243, 'loss/train': 0.9780444502830505} 11/07/2021 05:44:19 - INFO - __main__ - Step 60245: {'lr': 0.0003320629563142787, 'samples': 11567040, 'steps': 60244, 'loss/train': 1.5218417644500732} 11/07/2021 05:44:20 - INFO - __main__ - Step 60246: {'lr': 0.00033205794359682456, 'samples': 11567232, 'steps': 60245, 'loss/train': 1.240847110748291} 11/07/2021 05:44:20 - INFO - __main__ - Step 60247: {'lr': 0.0003320529308423963, 'samples': 11567424, 'steps': 60246, 'loss/train': 1.1041347980499268} 11/07/2021 05:44:21 - INFO - __main__ - Step 60248: {'lr': 0.00033204791805099636, 'samples': 11567616, 'steps': 60247, 'loss/train': 1.478561282157898} 11/07/2021 05:44:21 - INFO - __main__ - Step 60249: {'lr': 0.00033204290522262684, 'samples': 11567808, 'steps': 60248, 'loss/train': 1.462732195854187} 11/07/2021 05:44:22 - INFO - __main__ - Step 60250: {'lr': 0.0003320378923572901, 'samples': 11568000, 'steps': 60249, 'loss/train': 1.5104708671569824} 11/07/2021 05:44:22 - INFO - __main__ - Step 60251: {'lr': 0.0003320328794549884, 'samples': 11568192, 'steps': 60250, 'loss/train': 1.607977032661438} 11/07/2021 05:44:23 - INFO - __main__ - Step 60252: {'lr': 0.0003320278665157238, 'samples': 11568384, 'steps': 60251, 'loss/train': 1.1915283203125} 11/07/2021 05:44:23 - INFO - __main__ - Step 60253: {'lr': 0.0003320228535394988, 'samples': 11568576, 'steps': 60252, 'loss/train': 0.4284207224845886} 11/07/2021 05:44:24 - INFO - __main__ - Step 60254: {'lr': 0.0003320178405263156, 'samples': 11568768, 'steps': 60253, 'loss/train': 1.5843852758407593} 11/07/2021 05:44:24 - INFO - __main__ - Step 60255: {'lr': 0.00033201282747617636, 'samples': 11568960, 'steps': 60254, 'loss/train': 1.2781689167022705} 11/07/2021 05:44:25 - INFO - __main__ - Step 60256: {'lr': 0.00033200781438908345, 'samples': 11569152, 'steps': 60255, 'loss/train': 1.4429857730865479} 11/07/2021 05:44:25 - INFO - __main__ - Step 60257: {'lr': 0.00033200280126503904, 'samples': 11569344, 'steps': 60256, 'loss/train': 1.8916230201721191} 11/07/2021 05:44:26 - INFO - __main__ - Step 60258: {'lr': 0.00033199778810404546, 'samples': 11569536, 'steps': 60257, 'loss/train': 1.2465578317642212} 11/07/2021 05:44:26 - INFO - __main__ - Step 60259: {'lr': 0.0003319927749061049, 'samples': 11569728, 'steps': 60258, 'loss/train': 1.4934141635894775} 11/07/2021 05:44:26 - INFO - __main__ - Step 60260: {'lr': 0.0003319877616712197, 'samples': 11569920, 'steps': 60259, 'loss/train': 1.7052654027938843} 11/07/2021 05:44:27 - INFO - __main__ - Step 60261: {'lr': 0.0003319827483993921, 'samples': 11570112, 'steps': 60260, 'loss/train': 1.5795328617095947} 11/07/2021 05:44:28 - INFO - __main__ - Step 60262: {'lr': 0.00033197773509062434, 'samples': 11570304, 'steps': 60261, 'loss/train': 1.430423378944397} 11/07/2021 05:44:28 - INFO - __main__ - Step 60263: {'lr': 0.00033197272174491864, 'samples': 11570496, 'steps': 60262, 'loss/train': 1.6539891958236694} 11/07/2021 05:44:29 - INFO - __main__ - Step 60264: {'lr': 0.0003319677083622773, 'samples': 11570688, 'steps': 60263, 'loss/train': 1.349233627319336} 11/07/2021 05:44:29 - INFO - __main__ - Step 60265: {'lr': 0.0003319626949427026, 'samples': 11570880, 'steps': 60264, 'loss/train': 1.2653108835220337} 11/07/2021 05:44:30 - INFO - __main__ - Step 60266: {'lr': 0.00033195768148619676, 'samples': 11571072, 'steps': 60265, 'loss/train': 0.0857623815536499} 11/07/2021 05:44:31 - INFO - __main__ - Step 60267: {'lr': 0.000331952667992762, 'samples': 11571264, 'steps': 60266, 'loss/train': 1.8026045560836792} 11/07/2021 05:44:31 - INFO - __main__ - Step 60268: {'lr': 0.0003319476544624007, 'samples': 11571456, 'steps': 60267, 'loss/train': 1.1682244539260864} 11/07/2021 05:44:31 - INFO - __main__ - Step 60269: {'lr': 0.0003319426408951151, 'samples': 11571648, 'steps': 60268, 'loss/train': 1.3311861753463745} 11/07/2021 05:44:32 - INFO - __main__ - Step 60270: {'lr': 0.0003319376272909073, 'samples': 11571840, 'steps': 60269, 'loss/train': 1.6249704360961914} 11/07/2021 05:44:32 - INFO - __main__ - Step 60271: {'lr': 0.0003319326136497797, 'samples': 11572032, 'steps': 60270, 'loss/train': 1.4199638366699219} 11/07/2021 05:44:33 - INFO - __main__ - Step 60272: {'lr': 0.00033192759997173455, 'samples': 11572224, 'steps': 60271, 'loss/train': 1.3483589887619019} 11/07/2021 05:44:33 - INFO - __main__ - Step 60273: {'lr': 0.0003319225862567741, 'samples': 11572416, 'steps': 60272, 'loss/train': 1.455997109413147} 11/07/2021 05:44:34 - INFO - __main__ - Step 60274: {'lr': 0.0003319175725049006, 'samples': 11572608, 'steps': 60273, 'loss/train': 1.1764227151870728} 11/07/2021 05:44:34 - INFO - __main__ - Step 60275: {'lr': 0.00033191255871611625, 'samples': 11572800, 'steps': 60274, 'loss/train': 1.2343151569366455} 11/07/2021 05:44:35 - INFO - __main__ - Step 60276: {'lr': 0.0003319075448904234, 'samples': 11572992, 'steps': 60275, 'loss/train': 1.4214507341384888} 11/07/2021 05:44:35 - INFO - __main__ - Step 60277: {'lr': 0.00033190253102782433, 'samples': 11573184, 'steps': 60276, 'loss/train': 1.5615508556365967} 11/07/2021 05:44:36 - INFO - __main__ - Step 60278: {'lr': 0.0003318975171283212, 'samples': 11573376, 'steps': 60277, 'loss/train': 0.8586135506629944} 11/07/2021 05:44:36 - INFO - __main__ - Step 60279: {'lr': 0.0003318925031919162, 'samples': 11573568, 'steps': 60278, 'loss/train': 1.3413286209106445} 11/07/2021 05:44:37 - INFO - __main__ - Step 60280: {'lr': 0.00033188748921861186, 'samples': 11573760, 'steps': 60279, 'loss/train': 1.3240867853164673} 11/07/2021 05:44:37 - INFO - __main__ - Step 60281: {'lr': 0.00033188247520841025, 'samples': 11573952, 'steps': 60280, 'loss/train': 1.8146029710769653} 11/07/2021 05:44:38 - INFO - __main__ - Step 60282: {'lr': 0.0003318774611613136, 'samples': 11574144, 'steps': 60281, 'loss/train': 1.811106562614441} 11/07/2021 05:44:38 - INFO - __main__ - Step 60283: {'lr': 0.00033187244707732425, 'samples': 11574336, 'steps': 60282, 'loss/train': 1.2580935955047607} 11/07/2021 05:44:39 - INFO - __main__ - Step 60284: {'lr': 0.00033186743295644447, 'samples': 11574528, 'steps': 60283, 'loss/train': 1.0723472833633423} 11/07/2021 05:44:39 - INFO - __main__ - Step 60285: {'lr': 0.00033186241879867644, 'samples': 11574720, 'steps': 60284, 'loss/train': 1.6085362434387207} 11/07/2021 05:44:39 - INFO - __main__ - Step 60286: {'lr': 0.00033185740460402245, 'samples': 11574912, 'steps': 60285, 'loss/train': 1.4193038940429688} 11/07/2021 05:44:40 - INFO - __main__ - Step 60287: {'lr': 0.0003318523903724849, 'samples': 11575104, 'steps': 60286, 'loss/train': 1.3279359340667725} 11/07/2021 05:44:41 - INFO - __main__ - Step 60288: {'lr': 0.00033184737610406583, 'samples': 11575296, 'steps': 60287, 'loss/train': 1.5117497444152832} 11/07/2021 05:44:41 - INFO - __main__ - Step 60289: {'lr': 0.00033184236179876765, 'samples': 11575488, 'steps': 60288, 'loss/train': 0.9239439368247986} 11/07/2021 05:44:41 - INFO - __main__ - Step 60290: {'lr': 0.0003318373474565925, 'samples': 11575680, 'steps': 60289, 'loss/train': 1.1956406831741333} 11/07/2021 05:44:42 - INFO - __main__ - Step 60291: {'lr': 0.0003318323330775427, 'samples': 11575872, 'steps': 60290, 'loss/train': 1.2792822122573853} 11/07/2021 05:44:43 - INFO - __main__ - Step 60292: {'lr': 0.00033182731866162056, 'samples': 11576064, 'steps': 60291, 'loss/train': 1.0465173721313477} 11/07/2021 05:44:43 - INFO - __main__ - Step 60293: {'lr': 0.00033182230420882833, 'samples': 11576256, 'steps': 60292, 'loss/train': 0.8956973552703857} 11/07/2021 05:44:44 - INFO - __main__ - Step 60294: {'lr': 0.00033181728971916813, 'samples': 11576448, 'steps': 60293, 'loss/train': 1.19414484500885} 11/07/2021 05:44:44 - INFO - __main__ - Step 60295: {'lr': 0.0003318122751926424, 'samples': 11576640, 'steps': 60294, 'loss/train': 0.7102293372154236} 11/07/2021 05:44:44 - INFO - __main__ - Step 60296: {'lr': 0.0003318072606292533, 'samples': 11576832, 'steps': 60295, 'loss/train': 0.849144458770752} 11/07/2021 05:44:45 - INFO - __main__ - Step 60297: {'lr': 0.0003318022460290031, 'samples': 11577024, 'steps': 60296, 'loss/train': 1.1355880498886108} 11/07/2021 05:44:46 - INFO - __main__ - Step 60298: {'lr': 0.00033179723139189403, 'samples': 11577216, 'steps': 60297, 'loss/train': 0.94053715467453} 11/07/2021 05:44:46 - INFO - __main__ - Step 60299: {'lr': 0.00033179221671792846, 'samples': 11577408, 'steps': 60298, 'loss/train': 1.364589810371399} 11/07/2021 05:44:46 - INFO - __main__ - Step 60300: {'lr': 0.0003317872020071085, 'samples': 11577600, 'steps': 60299, 'loss/train': 0.9318669438362122} 11/07/2021 05:44:47 - INFO - __main__ - Step 60301: {'lr': 0.00033178218725943666, 'samples': 11577792, 'steps': 60300, 'loss/train': 1.2909047603607178} 11/07/2021 05:44:48 - INFO - __main__ - Step 60302: {'lr': 0.0003317771724749149, 'samples': 11577984, 'steps': 60301, 'loss/train': 1.2135589122772217} 11/07/2021 05:44:48 - INFO - __main__ - Step 60303: {'lr': 0.0003317721576535456, 'samples': 11578176, 'steps': 60302, 'loss/train': 1.4646660089492798} 11/07/2021 05:44:48 - INFO - __main__ - Step 60304: {'lr': 0.00033176714279533107, 'samples': 11578368, 'steps': 60303, 'loss/train': 1.1077381372451782} 11/07/2021 05:44:49 - INFO - __main__ - Step 60305: {'lr': 0.0003317621279002734, 'samples': 11578560, 'steps': 60304, 'loss/train': 1.5224531888961792} 11/07/2021 05:44:49 - INFO - __main__ - Step 60306: {'lr': 0.0003317571129683751, 'samples': 11578752, 'steps': 60305, 'loss/train': 1.3471980094909668} 11/07/2021 05:44:50 - INFO - __main__ - Step 60307: {'lr': 0.0003317520979996383, 'samples': 11578944, 'steps': 60306, 'loss/train': 0.7186444401741028} 11/07/2021 05:44:51 - INFO - __main__ - Step 60308: {'lr': 0.0003317470829940653, 'samples': 11579136, 'steps': 60307, 'loss/train': 1.7448220252990723} 11/07/2021 05:44:51 - INFO - __main__ - Step 60309: {'lr': 0.0003317420679516583, 'samples': 11579328, 'steps': 60308, 'loss/train': 0.973672091960907} 11/07/2021 05:44:52 - INFO - __main__ - Step 60310: {'lr': 0.0003317370528724195, 'samples': 11579520, 'steps': 60309, 'loss/train': 1.31278395652771} 11/07/2021 05:44:52 - INFO - __main__ - Step 60311: {'lr': 0.0003317320377563514, 'samples': 11579712, 'steps': 60310, 'loss/train': 1.1477553844451904} 11/07/2021 05:44:53 - INFO - __main__ - Step 60312: {'lr': 0.0003317270226034559, 'samples': 11579904, 'steps': 60311, 'loss/train': 1.2974623441696167} 11/07/2021 05:44:53 - INFO - __main__ - Step 60313: {'lr': 0.0003317220074137356, 'samples': 11580096, 'steps': 60312, 'loss/train': 0.6148391962051392} 11/07/2021 05:44:54 - INFO - __main__ - Step 60314: {'lr': 0.00033171699218719267, 'samples': 11580288, 'steps': 60313, 'loss/train': 2.774914264678955} 11/07/2021 05:44:54 - INFO - __main__ - Step 60315: {'lr': 0.00033171197692382926, 'samples': 11580480, 'steps': 60314, 'loss/train': 0.911057710647583} 11/07/2021 05:44:54 - INFO - __main__ - Step 60316: {'lr': 0.00033170696162364765, 'samples': 11580672, 'steps': 60315, 'loss/train': 1.5811431407928467} 11/07/2021 05:44:55 - INFO - __main__ - Step 60317: {'lr': 0.00033170194628665017, 'samples': 11580864, 'steps': 60316, 'loss/train': 1.626639723777771} 11/07/2021 05:44:56 - INFO - __main__ - Step 60318: {'lr': 0.0003316969309128391, 'samples': 11581056, 'steps': 60317, 'loss/train': 1.4853150844573975} 11/07/2021 05:44:56 - INFO - __main__ - Step 60319: {'lr': 0.00033169191550221663, 'samples': 11581248, 'steps': 60318, 'loss/train': 1.4956469535827637} 11/07/2021 05:44:56 - INFO - __main__ - Step 60320: {'lr': 0.000331686900054785, 'samples': 11581440, 'steps': 60319, 'loss/train': 1.3181567192077637} 11/07/2021 05:44:57 - INFO - __main__ - Step 60321: {'lr': 0.00033168188457054654, 'samples': 11581632, 'steps': 60320, 'loss/train': 1.239157795906067} 11/07/2021 05:44:57 - INFO - __main__ - Step 60322: {'lr': 0.00033167686904950357, 'samples': 11581824, 'steps': 60321, 'loss/train': 0.19734248518943787} 11/07/2021 05:44:58 - INFO - __main__ - Step 60323: {'lr': 0.00033167185349165817, 'samples': 11582016, 'steps': 60322, 'loss/train': 1.5673600435256958} 11/07/2021 05:44:59 - INFO - __main__ - Step 60324: {'lr': 0.00033166683789701267, 'samples': 11582208, 'steps': 60323, 'loss/train': 1.3493220806121826} 11/07/2021 05:44:59 - INFO - __main__ - Step 60325: {'lr': 0.0003316618222655694, 'samples': 11582400, 'steps': 60324, 'loss/train': 1.2684649229049683} 11/07/2021 05:44:59 - INFO - __main__ - Step 60326: {'lr': 0.00033165680659733054, 'samples': 11582592, 'steps': 60325, 'loss/train': 1.5344562530517578} 11/07/2021 05:45:00 - INFO - __main__ - Step 60327: {'lr': 0.00033165179089229846, 'samples': 11582784, 'steps': 60326, 'loss/train': 1.2550421953201294} 11/07/2021 05:45:00 - INFO - __main__ - Step 60328: {'lr': 0.00033164677515047533, 'samples': 11582976, 'steps': 60327, 'loss/train': 0.20303970575332642} 11/07/2021 05:45:01 - INFO - __main__ - Step 60329: {'lr': 0.0003316417593718634, 'samples': 11583168, 'steps': 60328, 'loss/train': 1.3137410879135132} 11/07/2021 05:45:01 - INFO - __main__ - Step 60330: {'lr': 0.0003316367435564649, 'samples': 11583360, 'steps': 60329, 'loss/train': 1.0781980752944946} 11/07/2021 05:45:02 - INFO - __main__ - Step 60331: {'lr': 0.0003316317277042822, 'samples': 11583552, 'steps': 60330, 'loss/train': 1.7377347946166992} 11/07/2021 05:45:02 - INFO - __main__ - Step 60332: {'lr': 0.0003316267118153175, 'samples': 11583744, 'steps': 60331, 'loss/train': 1.2611678838729858} 11/07/2021 05:45:02 - INFO - __main__ - Step 60333: {'lr': 0.00033162169588957295, 'samples': 11583936, 'steps': 60332, 'loss/train': 1.0639727115631104} 11/07/2021 05:45:04 - INFO - __main__ - Step 60334: {'lr': 0.00033161667992705104, 'samples': 11584128, 'steps': 60333, 'loss/train': 1.2822020053863525} 11/07/2021 05:45:04 - INFO - __main__ - Step 60335: {'lr': 0.0003316116639277539, 'samples': 11584320, 'steps': 60334, 'loss/train': 1.499685287475586} 11/07/2021 05:45:04 - INFO - __main__ - Step 60336: {'lr': 0.00033160664789168385, 'samples': 11584512, 'steps': 60335, 'loss/train': 1.3909335136413574} 11/07/2021 05:45:05 - INFO - __main__ - Step 60337: {'lr': 0.00033160163181884307, 'samples': 11584704, 'steps': 60336, 'loss/train': 1.5050562620162964} 11/07/2021 05:45:05 - INFO - __main__ - Step 60338: {'lr': 0.00033159661570923384, 'samples': 11584896, 'steps': 60337, 'loss/train': 1.6542142629623413} 11/07/2021 05:45:06 - INFO - __main__ - Step 60339: {'lr': 0.0003315915995628584, 'samples': 11585088, 'steps': 60338, 'loss/train': 1.3440932035446167} 11/07/2021 05:45:06 - INFO - __main__ - Step 60340: {'lr': 0.000331586583379719, 'samples': 11585280, 'steps': 60339, 'loss/train': 1.0555334091186523} 11/07/2021 05:45:07 - INFO - __main__ - Step 60341: {'lr': 0.0003315815671598181, 'samples': 11585472, 'steps': 60340, 'loss/train': 1.2632547616958618} 11/07/2021 05:45:07 - INFO - __main__ - Step 60342: {'lr': 0.00033157655090315777, 'samples': 11585664, 'steps': 60341, 'loss/train': 1.4036873579025269} 11/07/2021 05:45:07 - INFO - __main__ - Step 60343: {'lr': 0.0003315715346097402, 'samples': 11585856, 'steps': 60342, 'loss/train': 1.8107974529266357} 11/07/2021 05:45:08 - INFO - __main__ - Step 60344: {'lr': 0.0003315665182795678, 'samples': 11586048, 'steps': 60343, 'loss/train': 1.180841088294983} 11/07/2021 05:45:09 - INFO - __main__ - Step 60345: {'lr': 0.00033156150191264276, 'samples': 11586240, 'steps': 60344, 'loss/train': 1.525046467781067} 11/07/2021 05:45:09 - INFO - __main__ - Step 60346: {'lr': 0.00033155648550896744, 'samples': 11586432, 'steps': 60345, 'loss/train': 1.4833077192306519} 11/07/2021 05:45:09 - INFO - __main__ - Step 60347: {'lr': 0.000331551469068544, 'samples': 11586624, 'steps': 60346, 'loss/train': 1.4567911624908447} 11/07/2021 05:45:10 - INFO - __main__ - Step 60348: {'lr': 0.00033154645259137475, 'samples': 11586816, 'steps': 60347, 'loss/train': 1.5256929397583008} 11/07/2021 05:45:11 - INFO - __main__ - Step 60349: {'lr': 0.0003315414360774619, 'samples': 11587008, 'steps': 60348, 'loss/train': 1.482256293296814} 11/07/2021 05:45:11 - INFO - __main__ - Step 60350: {'lr': 0.00033153641952680767, 'samples': 11587200, 'steps': 60349, 'loss/train': 1.4682725667953491} 11/07/2021 05:45:12 - INFO - __main__ - Step 60351: {'lr': 0.00033153140293941445, 'samples': 11587392, 'steps': 60350, 'loss/train': 0.8860106468200684} 11/07/2021 05:45:12 - INFO - __main__ - Step 60352: {'lr': 0.00033152638631528446, 'samples': 11587584, 'steps': 60351, 'loss/train': 1.4941452741622925} 11/07/2021 05:45:12 - INFO - __main__ - Step 60353: {'lr': 0.0003315213696544199, 'samples': 11587776, 'steps': 60352, 'loss/train': 1.3626902103424072} 11/07/2021 05:45:13 - INFO - __main__ - Step 60354: {'lr': 0.00033151635295682307, 'samples': 11587968, 'steps': 60353, 'loss/train': 1.616555094718933} 11/07/2021 05:45:14 - INFO - __main__ - Step 60355: {'lr': 0.0003315113362224963, 'samples': 11588160, 'steps': 60354, 'loss/train': 1.1568067073822021} 11/07/2021 05:45:14 - INFO - __main__ - Step 60356: {'lr': 0.0003315063194514417, 'samples': 11588352, 'steps': 60355, 'loss/train': 0.5721087455749512} 11/07/2021 05:45:14 - INFO - __main__ - Step 60357: {'lr': 0.00033150130264366165, 'samples': 11588544, 'steps': 60356, 'loss/train': 0.8112773895263672} 11/07/2021 05:45:15 - INFO - __main__ - Step 60358: {'lr': 0.00033149628579915835, 'samples': 11588736, 'steps': 60357, 'loss/train': 1.7352149486541748} 11/07/2021 05:45:15 - INFO - __main__ - Step 60359: {'lr': 0.0003314912689179341, 'samples': 11588928, 'steps': 60358, 'loss/train': 2.028284788131714} 11/07/2021 05:45:16 - INFO - __main__ - Step 60360: {'lr': 0.0003314862519999911, 'samples': 11589120, 'steps': 60359, 'loss/train': 1.4436801671981812} 11/07/2021 05:45:17 - INFO - __main__ - Step 60361: {'lr': 0.0003314812350453317, 'samples': 11589312, 'steps': 60360, 'loss/train': 1.6026908159255981} 11/07/2021 05:45:17 - INFO - __main__ - Step 60362: {'lr': 0.0003314762180539581, 'samples': 11589504, 'steps': 60361, 'loss/train': 1.2008447647094727} 11/07/2021 05:45:17 - INFO - __main__ - Step 60363: {'lr': 0.00033147120102587256, 'samples': 11589696, 'steps': 60362, 'loss/train': 1.8036283254623413} 11/07/2021 05:45:18 - INFO - __main__ - Step 60364: {'lr': 0.00033146618396107737, 'samples': 11589888, 'steps': 60363, 'loss/train': 1.3074967861175537} 11/07/2021 05:45:19 - INFO - __main__ - Step 60365: {'lr': 0.00033146116685957473, 'samples': 11590080, 'steps': 60364, 'loss/train': 0.13003472983837128} 11/07/2021 05:45:19 - INFO - __main__ - Step 60366: {'lr': 0.00033145614972136697, 'samples': 11590272, 'steps': 60365, 'loss/train': 1.4109644889831543} 11/07/2021 05:45:20 - INFO - __main__ - Step 60367: {'lr': 0.0003314511325464563, 'samples': 11590464, 'steps': 60366, 'loss/train': 1.773784875869751} 11/07/2021 05:45:20 - INFO - __main__ - Step 60368: {'lr': 0.0003314461153348451, 'samples': 11590656, 'steps': 60367, 'loss/train': 1.3438807725906372} 11/07/2021 05:45:20 - INFO - __main__ - Step 60369: {'lr': 0.0003314410980865355, 'samples': 11590848, 'steps': 60368, 'loss/train': 1.5360952615737915} 11/07/2021 05:45:21 - INFO - __main__ - Step 60370: {'lr': 0.00033143608080152975, 'samples': 11591040, 'steps': 60369, 'loss/train': 1.3308426141738892} 11/07/2021 05:45:22 - INFO - __main__ - Step 60371: {'lr': 0.0003314310634798302, 'samples': 11591232, 'steps': 60370, 'loss/train': 1.5121368169784546} 11/07/2021 05:45:22 - INFO - __main__ - Step 60372: {'lr': 0.00033142604612143903, 'samples': 11591424, 'steps': 60371, 'loss/train': 1.768896222114563} 11/07/2021 05:45:22 - INFO - __main__ - Step 60373: {'lr': 0.0003314210287263586, 'samples': 11591616, 'steps': 60372, 'loss/train': 1.6109230518341064} 11/07/2021 05:45:23 - INFO - __main__ - Step 60374: {'lr': 0.0003314160112945911, 'samples': 11591808, 'steps': 60373, 'loss/train': 1.5864044427871704} 11/07/2021 05:45:24 - INFO - __main__ - Step 60375: {'lr': 0.00033141099382613876, 'samples': 11592000, 'steps': 60374, 'loss/train': 1.4085843563079834} 11/07/2021 05:45:24 - INFO - __main__ - Step 60376: {'lr': 0.00033140597632100386, 'samples': 11592192, 'steps': 60375, 'loss/train': 1.1991398334503174} 11/07/2021 05:45:24 - INFO - __main__ - Step 60377: {'lr': 0.0003314009587791887, 'samples': 11592384, 'steps': 60376, 'loss/train': 0.8106988072395325} 11/07/2021 05:45:25 - INFO - __main__ - Step 60378: {'lr': 0.0003313959412006956, 'samples': 11592576, 'steps': 60377, 'loss/train': 1.144124984741211} 11/07/2021 05:45:25 - INFO - __main__ - Step 60379: {'lr': 0.00033139092358552667, 'samples': 11592768, 'steps': 60378, 'loss/train': 1.1977169513702393} 11/07/2021 05:45:26 - INFO - __main__ - Step 60380: {'lr': 0.00033138590593368437, 'samples': 11592960, 'steps': 60379, 'loss/train': 1.6603484153747559} 11/07/2021 05:45:27 - INFO - __main__ - Step 60381: {'lr': 0.00033138088824517066, 'samples': 11593152, 'steps': 60380, 'loss/train': 1.0656213760375977} 11/07/2021 05:45:27 - INFO - __main__ - Step 60382: {'lr': 0.0003313758705199881, 'samples': 11593344, 'steps': 60381, 'loss/train': 0.8614035248756409} 11/07/2021 05:45:27 - INFO - __main__ - Step 60383: {'lr': 0.00033137085275813873, 'samples': 11593536, 'steps': 60382, 'loss/train': 0.8455507159233093} 11/07/2021 05:45:28 - INFO - __main__ - Step 60384: {'lr': 0.00033136583495962496, 'samples': 11593728, 'steps': 60383, 'loss/train': 1.6019415855407715} 11/07/2021 05:45:28 - INFO - __main__ - Step 60385: {'lr': 0.00033136081712444905, 'samples': 11593920, 'steps': 60384, 'loss/train': 2.0023818016052246} 11/07/2021 05:45:29 - INFO - __main__ - Step 60386: {'lr': 0.0003313557992526132, 'samples': 11594112, 'steps': 60385, 'loss/train': 1.2212547063827515} 11/07/2021 05:45:29 - INFO - __main__ - Step 60387: {'lr': 0.00033135078134411956, 'samples': 11594304, 'steps': 60386, 'loss/train': 1.4593546390533447} 11/07/2021 05:45:30 - INFO - __main__ - Step 60388: {'lr': 0.0003313457633989706, 'samples': 11594496, 'steps': 60387, 'loss/train': 1.4312816858291626} 11/07/2021 05:45:30 - INFO - __main__ - Step 60389: {'lr': 0.00033134074541716854, 'samples': 11594688, 'steps': 60388, 'loss/train': 1.6268479824066162} 11/07/2021 05:45:30 - INFO - __main__ - Step 60390: {'lr': 0.00033133572739871546, 'samples': 11594880, 'steps': 60389, 'loss/train': 1.43461012840271} 11/07/2021 05:45:31 - INFO - __main__ - Step 60391: {'lr': 0.0003313307093436139, 'samples': 11595072, 'steps': 60390, 'loss/train': 1.3008288145065308} 11/07/2021 05:45:32 - INFO - __main__ - Step 60392: {'lr': 0.00033132569125186596, 'samples': 11595264, 'steps': 60391, 'loss/train': 1.018583059310913} 11/07/2021 05:45:32 - INFO - __main__ - Step 60393: {'lr': 0.00033132067312347386, 'samples': 11595456, 'steps': 60392, 'loss/train': 1.4265658855438232} 11/07/2021 05:45:32 - INFO - __main__ - Step 60394: {'lr': 0.0003313156549584399, 'samples': 11595648, 'steps': 60393, 'loss/train': 1.50850248336792} 11/07/2021 05:45:33 - INFO - __main__ - Step 60395: {'lr': 0.0003313106367567664, 'samples': 11595840, 'steps': 60394, 'loss/train': 1.281469702720642} 11/07/2021 05:45:34 - INFO - __main__ - Step 60396: {'lr': 0.00033130561851845564, 'samples': 11596032, 'steps': 60395, 'loss/train': 1.4847666025161743} 11/07/2021 05:45:34 - INFO - __main__ - Step 60397: {'lr': 0.0003313006002435097, 'samples': 11596224, 'steps': 60396, 'loss/train': 1.2876263856887817} 11/07/2021 05:45:34 - INFO - __main__ - Step 60398: {'lr': 0.00033129558193193103, 'samples': 11596416, 'steps': 60397, 'loss/train': 0.1523812711238861} 11/07/2021 05:45:35 - INFO - __main__ - Step 60399: {'lr': 0.0003312905635837218, 'samples': 11596608, 'steps': 60398, 'loss/train': 1.2893315553665161} 11/07/2021 05:45:35 - INFO - __main__ - Step 60400: {'lr': 0.00033128554519888437, 'samples': 11596800, 'steps': 60399, 'loss/train': 1.6972821950912476} 11/07/2021 05:45:36 - INFO - __main__ - Step 60401: {'lr': 0.0003312805267774209, 'samples': 11596992, 'steps': 60400, 'loss/train': 0.9182083010673523} 11/07/2021 05:45:37 - INFO - __main__ - Step 60402: {'lr': 0.0003312755083193337, 'samples': 11597184, 'steps': 60401, 'loss/train': 0.8457756638526917} 11/07/2021 05:45:37 - INFO - __main__ - Step 60403: {'lr': 0.0003312704898246249, 'samples': 11597376, 'steps': 60402, 'loss/train': 1.2087161540985107} 11/07/2021 05:45:37 - INFO - __main__ - Step 60404: {'lr': 0.00033126547129329694, 'samples': 11597568, 'steps': 60403, 'loss/train': 0.6992038488388062} 11/07/2021 05:45:38 - INFO - __main__ - Step 60405: {'lr': 0.000331260452725352, 'samples': 11597760, 'steps': 60404, 'loss/train': 1.202842354774475} 11/07/2021 05:45:39 - INFO - __main__ - Step 60406: {'lr': 0.0003312554341207924, 'samples': 11597952, 'steps': 60405, 'loss/train': 1.6577576398849487} 11/07/2021 05:45:39 - INFO - __main__ - Step 60407: {'lr': 0.0003312504154796203, 'samples': 11598144, 'steps': 60406, 'loss/train': 2.1166443824768066} 11/07/2021 05:45:39 - INFO - __main__ - Step 60408: {'lr': 0.000331245396801838, 'samples': 11598336, 'steps': 60407, 'loss/train': 1.43145751953125} 11/07/2021 05:45:40 - INFO - __main__ - Step 60409: {'lr': 0.0003312403780874479, 'samples': 11598528, 'steps': 60408, 'loss/train': 1.4576280117034912} 11/07/2021 05:45:40 - INFO - __main__ - Step 60410: {'lr': 0.000331235359336452, 'samples': 11598720, 'steps': 60409, 'loss/train': 1.1086785793304443} 11/07/2021 05:45:41 - INFO - __main__ - Step 60411: {'lr': 0.00033123034054885275, 'samples': 11598912, 'steps': 60410, 'loss/train': 1.1411659717559814} 11/07/2021 05:45:41 - INFO - __main__ - Step 60412: {'lr': 0.0003312253217246524, 'samples': 11599104, 'steps': 60411, 'loss/train': 1.6032564640045166} 11/07/2021 05:45:42 - INFO - __main__ - Step 60413: {'lr': 0.0003312203028638531, 'samples': 11599296, 'steps': 60412, 'loss/train': 1.157576084136963} 11/07/2021 05:45:42 - INFO - __main__ - Step 60414: {'lr': 0.0003312152839664572, 'samples': 11599488, 'steps': 60413, 'loss/train': 1.1573214530944824} 11/07/2021 05:45:43 - INFO - __main__ - Step 60415: {'lr': 0.00033121026503246697, 'samples': 11599680, 'steps': 60414, 'loss/train': 1.1228677034378052} 11/07/2021 05:45:44 - INFO - __main__ - Step 60416: {'lr': 0.0003312052460618847, 'samples': 11599872, 'steps': 60415, 'loss/train': 1.5968266725540161} 11/07/2021 05:45:44 - INFO - __main__ - Step 60417: {'lr': 0.0003312002270547125, 'samples': 11600064, 'steps': 60416, 'loss/train': 1.2464699745178223} 11/07/2021 05:45:44 - INFO - __main__ - Step 60418: {'lr': 0.0003311952080109528, 'samples': 11600256, 'steps': 60417, 'loss/train': 1.3546085357666016} 11/07/2021 05:45:45 - INFO - __main__ - Step 60419: {'lr': 0.00033119018893060774, 'samples': 11600448, 'steps': 60418, 'loss/train': 1.1661535501480103} 11/07/2021 05:45:45 - INFO - __main__ - Step 60420: {'lr': 0.0003311851698136797, 'samples': 11600640, 'steps': 60419, 'loss/train': 1.293436884880066} 11/07/2021 05:45:45 - INFO - __main__ - Step 60421: {'lr': 0.00033118015066017085, 'samples': 11600832, 'steps': 60420, 'loss/train': 1.4185272455215454} 11/07/2021 05:45:47 - INFO - __main__ - Step 60422: {'lr': 0.0003311751314700835, 'samples': 11601024, 'steps': 60421, 'loss/train': 0.9404990673065186} 11/07/2021 05:45:47 - INFO - __main__ - Step 60423: {'lr': 0.0003311701122434198, 'samples': 11601216, 'steps': 60422, 'loss/train': 1.164953351020813} 11/07/2021 05:45:47 - INFO - __main__ - Step 60424: {'lr': 0.00033116509298018217, 'samples': 11601408, 'steps': 60423, 'loss/train': 1.353729486465454} 11/07/2021 05:45:48 - INFO - __main__ - Step 60425: {'lr': 0.0003311600736803728, 'samples': 11601600, 'steps': 60424, 'loss/train': 1.114314317703247} 11/07/2021 05:45:48 - INFO - __main__ - Step 60426: {'lr': 0.0003311550543439939, 'samples': 11601792, 'steps': 60425, 'loss/train': 1.550822138786316} 11/07/2021 05:45:49 - INFO - __main__ - Step 60427: {'lr': 0.00033115003497104787, 'samples': 11601984, 'steps': 60426, 'loss/train': 1.3000882863998413} 11/07/2021 05:45:49 - INFO - __main__ - Step 60428: {'lr': 0.00033114501556153673, 'samples': 11602176, 'steps': 60427, 'loss/train': 1.2256699800491333} 11/07/2021 05:45:50 - INFO - __main__ - Step 60429: {'lr': 0.0003311399961154631, 'samples': 11602368, 'steps': 60428, 'loss/train': 1.636409878730774} 11/07/2021 05:45:50 - INFO - __main__ - Step 60430: {'lr': 0.00033113497663282893, 'samples': 11602560, 'steps': 60429, 'loss/train': 0.8849931359291077} 11/07/2021 05:45:50 - INFO - __main__ - Step 60431: {'lr': 0.00033112995711363666, 'samples': 11602752, 'steps': 60430, 'loss/train': 1.2163029909133911} 11/07/2021 05:45:51 - INFO - __main__ - Step 60432: {'lr': 0.0003311249375578884, 'samples': 11602944, 'steps': 60431, 'loss/train': 1.3822362422943115} 11/07/2021 05:45:52 - INFO - __main__ - Step 60433: {'lr': 0.0003311199179655865, 'samples': 11603136, 'steps': 60432, 'loss/train': 1.0989490747451782} 11/07/2021 05:45:52 - INFO - __main__ - Step 60434: {'lr': 0.00033111489833673326, 'samples': 11603328, 'steps': 60433, 'loss/train': 1.3215237855911255} 11/07/2021 05:45:53 - INFO - __main__ - Step 60435: {'lr': 0.00033110987867133085, 'samples': 11603520, 'steps': 60434, 'loss/train': 1.512205958366394} 11/07/2021 05:45:53 - INFO - __main__ - Step 60436: {'lr': 0.0003311048589693817, 'samples': 11603712, 'steps': 60435, 'loss/train': 1.5882304906845093} 11/07/2021 05:45:54 - INFO - __main__ - Step 60437: {'lr': 0.0003310998392308878, 'samples': 11603904, 'steps': 60436, 'loss/train': 1.346817135810852} 11/07/2021 05:45:54 - INFO - __main__ - Step 60438: {'lr': 0.00033109481945585163, 'samples': 11604096, 'steps': 60437, 'loss/train': 1.1953644752502441} 11/07/2021 05:45:55 - INFO - __main__ - Step 60439: {'lr': 0.0003310897996442754, 'samples': 11604288, 'steps': 60438, 'loss/train': 1.3942230939865112} 11/07/2021 05:45:55 - INFO - __main__ - Step 60440: {'lr': 0.0003310847797961613, 'samples': 11604480, 'steps': 60439, 'loss/train': 1.8244459629058838} 11/07/2021 05:45:55 - INFO - __main__ - Step 60441: {'lr': 0.0003310797599115117, 'samples': 11604672, 'steps': 60440, 'loss/train': 1.0840210914611816} 11/07/2021 05:45:56 - INFO - __main__ - Step 60442: {'lr': 0.0003310747399903288, 'samples': 11604864, 'steps': 60441, 'loss/train': 1.5264474153518677} 11/07/2021 05:45:57 - INFO - __main__ - Step 60443: {'lr': 0.00033106972003261494, 'samples': 11605056, 'steps': 60442, 'loss/train': 1.5136007070541382} 11/07/2021 05:45:57 - INFO - __main__ - Step 60444: {'lr': 0.00033106470003837227, 'samples': 11605248, 'steps': 60443, 'loss/train': 1.2287436723709106} 11/07/2021 05:45:57 - INFO - __main__ - Step 60445: {'lr': 0.000331059680007603, 'samples': 11605440, 'steps': 60444, 'loss/train': 1.2876503467559814} 11/07/2021 05:45:58 - INFO - __main__ - Step 60446: {'lr': 0.0003310546599403096, 'samples': 11605632, 'steps': 60445, 'loss/train': 1.3455021381378174} 11/07/2021 05:45:59 - INFO - __main__ - Step 60447: {'lr': 0.00033104963983649415, 'samples': 11605824, 'steps': 60446, 'loss/train': 1.4982023239135742} 11/07/2021 05:45:59 - INFO - __main__ - Step 60448: {'lr': 0.0003310446196961591, 'samples': 11606016, 'steps': 60447, 'loss/train': 1.5876092910766602} 11/07/2021 05:45:59 - INFO - __main__ - Step 60449: {'lr': 0.0003310395995193065, 'samples': 11606208, 'steps': 60448, 'loss/train': 1.3300775289535522} 11/07/2021 05:46:00 - INFO - __main__ - Step 60450: {'lr': 0.00033103457930593874, 'samples': 11606400, 'steps': 60449, 'loss/train': 1.3081071376800537} 11/07/2021 05:46:00 - INFO - __main__ - Step 60451: {'lr': 0.000331029559056058, 'samples': 11606592, 'steps': 60450, 'loss/train': 1.231108546257019} 11/07/2021 05:46:01 - INFO - __main__ - Step 60452: {'lr': 0.0003310245387696666, 'samples': 11606784, 'steps': 60451, 'loss/train': 1.4177879095077515} 11/07/2021 05:46:01 - INFO - __main__ - Step 60453: {'lr': 0.0003310195184467668, 'samples': 11606976, 'steps': 60452, 'loss/train': 1.3387709856033325} 11/07/2021 05:46:02 - INFO - __main__ - Step 60454: {'lr': 0.0003310144980873609, 'samples': 11607168, 'steps': 60453, 'loss/train': 1.3432488441467285} 11/07/2021 05:46:02 - INFO - __main__ - Step 60455: {'lr': 0.00033100947769145107, 'samples': 11607360, 'steps': 60454, 'loss/train': 1.3807772397994995} 11/07/2021 05:46:03 - INFO - __main__ - Step 60456: {'lr': 0.0003310044572590397, 'samples': 11607552, 'steps': 60455, 'loss/train': 1.472807765007019} 11/07/2021 05:46:03 - INFO - __main__ - Step 60457: {'lr': 0.0003309994367901289, 'samples': 11607744, 'steps': 60456, 'loss/train': 1.4208564758300781} 11/07/2021 05:46:04 - INFO - __main__ - Step 60458: {'lr': 0.000330994416284721, 'samples': 11607936, 'steps': 60457, 'loss/train': 1.665175437927246} 11/07/2021 05:46:04 - INFO - __main__ - Step 60459: {'lr': 0.0003309893957428183, 'samples': 11608128, 'steps': 60458, 'loss/train': 1.2103618383407593} 11/07/2021 05:46:05 - INFO - __main__ - Step 60460: {'lr': 0.00033098437516442295, 'samples': 11608320, 'steps': 60459, 'loss/train': 1.0908466577529907} 11/07/2021 05:46:05 - INFO - __main__ - Step 60461: {'lr': 0.00033097935454953737, 'samples': 11608512, 'steps': 60460, 'loss/train': 1.9695872068405151} 11/07/2021 05:46:06 - INFO - __main__ - Step 60462: {'lr': 0.00033097433389816367, 'samples': 11608704, 'steps': 60461, 'loss/train': 1.2414299249649048} 11/07/2021 05:46:06 - INFO - __main__ - Step 60463: {'lr': 0.00033096931321030434, 'samples': 11608896, 'steps': 60462, 'loss/train': 1.3011271953582764} 11/07/2021 05:46:07 - INFO - __main__ - Step 60464: {'lr': 0.00033096429248596134, 'samples': 11609088, 'steps': 60463, 'loss/train': 1.3683841228485107} 11/07/2021 05:46:07 - INFO - __main__ - Step 60465: {'lr': 0.0003309592717251371, 'samples': 11609280, 'steps': 60464, 'loss/train': 1.3523870706558228} 11/07/2021 05:46:07 - INFO - __main__ - Step 60466: {'lr': 0.00033095425092783385, 'samples': 11609472, 'steps': 60465, 'loss/train': 0.09235043078660965} 11/07/2021 05:46:08 - INFO - __main__ - Step 60467: {'lr': 0.0003309492300940539, 'samples': 11609664, 'steps': 60466, 'loss/train': 1.3416205644607544} 11/07/2021 05:46:09 - INFO - __main__ - Step 60468: {'lr': 0.0003309442092237995, 'samples': 11609856, 'steps': 60467, 'loss/train': 0.6195552945137024} 11/07/2021 05:46:09 - INFO - __main__ - Step 60469: {'lr': 0.0003309391883170729, 'samples': 11610048, 'steps': 60468, 'loss/train': 1.1269935369491577} 11/07/2021 05:46:09 - INFO - __main__ - Step 60470: {'lr': 0.0003309341673738763, 'samples': 11610240, 'steps': 60469, 'loss/train': 1.5088574886322021} 11/07/2021 05:46:10 - INFO - __main__ - Step 60471: {'lr': 0.000330929146394212, 'samples': 11610432, 'steps': 60470, 'loss/train': 1.4399762153625488} 11/07/2021 05:46:10 - INFO - __main__ - Step 60472: {'lr': 0.0003309241253780823, 'samples': 11610624, 'steps': 60471, 'loss/train': 1.3345330953598022} 11/07/2021 05:46:11 - INFO - __main__ - Step 60473: {'lr': 0.00033091910432548943, 'samples': 11610816, 'steps': 60472, 'loss/train': 0.9177373647689819} 11/07/2021 05:46:11 - INFO - __main__ - Step 60474: {'lr': 0.00033091408323643567, 'samples': 11611008, 'steps': 60473, 'loss/train': 1.3507461547851562} 11/07/2021 05:46:12 - INFO - __main__ - Step 60475: {'lr': 0.00033090906211092323, 'samples': 11611200, 'steps': 60474, 'loss/train': 1.188789963722229} 11/07/2021 05:46:12 - INFO - __main__ - Step 60476: {'lr': 0.00033090404094895454, 'samples': 11611392, 'steps': 60475, 'loss/train': 1.4171885251998901} 11/07/2021 05:46:13 - INFO - __main__ - Step 60477: {'lr': 0.0003308990197505316, 'samples': 11611584, 'steps': 60476, 'loss/train': 1.2103404998779297} 11/07/2021 05:46:13 - INFO - __main__ - Step 60478: {'lr': 0.0003308939985156569, 'samples': 11611776, 'steps': 60477, 'loss/train': 1.5409365892410278} 11/07/2021 05:46:14 - INFO - __main__ - Step 60479: {'lr': 0.00033088897724433254, 'samples': 11611968, 'steps': 60478, 'loss/train': 1.5020544528961182} 11/07/2021 05:46:14 - INFO - __main__ - Step 60480: {'lr': 0.0003308839559365609, 'samples': 11612160, 'steps': 60479, 'loss/train': 1.2332615852355957} 11/07/2021 05:46:15 - INFO - __main__ - Step 60481: {'lr': 0.0003308789345923442, 'samples': 11612352, 'steps': 60480, 'loss/train': 1.4904967546463013} 11/07/2021 05:46:15 - INFO - __main__ - Step 60482: {'lr': 0.0003308739132116847, 'samples': 11612544, 'steps': 60481, 'loss/train': 1.570164680480957} 11/07/2021 05:46:16 - INFO - __main__ - Step 60483: {'lr': 0.0003308688917945847, 'samples': 11612736, 'steps': 60482, 'loss/train': 1.664711594581604} 11/07/2021 05:46:16 - INFO - __main__ - Step 60484: {'lr': 0.00033086387034104634, 'samples': 11612928, 'steps': 60483, 'loss/train': 1.0946649312973022} 11/07/2021 05:46:17 - INFO - __main__ - Step 60485: {'lr': 0.00033085884885107196, 'samples': 11613120, 'steps': 60484, 'loss/train': 1.709930658340454} 11/07/2021 05:46:17 - INFO - __main__ - Step 60486: {'lr': 0.0003308538273246639, 'samples': 11613312, 'steps': 60485, 'loss/train': 1.459619164466858} 11/07/2021 05:46:17 - INFO - __main__ - Step 60487: {'lr': 0.0003308488057618243, 'samples': 11613504, 'steps': 60486, 'loss/train': 0.6794403195381165} 11/07/2021 05:46:18 - INFO - __main__ - Step 60488: {'lr': 0.0003308437841625555, 'samples': 11613696, 'steps': 60487, 'loss/train': 1.4986510276794434} 11/07/2021 05:46:19 - INFO - __main__ - Step 60489: {'lr': 0.00033083876252685976, 'samples': 11613888, 'steps': 60488, 'loss/train': 1.230689287185669} 11/07/2021 05:46:19 - INFO - __main__ - Step 60490: {'lr': 0.0003308337408547393, 'samples': 11614080, 'steps': 60489, 'loss/train': 1.6394184827804565} 11/07/2021 05:46:19 - INFO - __main__ - Step 60491: {'lr': 0.00033082871914619645, 'samples': 11614272, 'steps': 60490, 'loss/train': 1.0770151615142822} 11/07/2021 05:46:20 - INFO - __main__ - Step 60492: {'lr': 0.00033082369740123333, 'samples': 11614464, 'steps': 60491, 'loss/train': 1.1978358030319214} 11/07/2021 05:46:21 - INFO - __main__ - Step 60493: {'lr': 0.00033081867561985236, 'samples': 11614656, 'steps': 60492, 'loss/train': 1.4656721353530884} 11/07/2021 05:46:21 - INFO - __main__ - Step 60494: {'lr': 0.00033081365380205574, 'samples': 11614848, 'steps': 60493, 'loss/train': 1.214465856552124} 11/07/2021 05:46:21 - INFO - __main__ - Step 60495: {'lr': 0.0003308086319478457, 'samples': 11615040, 'steps': 60494, 'loss/train': 1.2922340631484985} 11/07/2021 05:46:22 - INFO - __main__ - Step 60496: {'lr': 0.0003308036100572246, 'samples': 11615232, 'steps': 60495, 'loss/train': 0.7649991512298584} 11/07/2021 05:46:22 - INFO - __main__ - Step 60497: {'lr': 0.00033079858813019465, 'samples': 11615424, 'steps': 60496, 'loss/train': 1.6473861932754517} 11/07/2021 05:46:23 - INFO - __main__ - Step 60498: {'lr': 0.000330793566166758, 'samples': 11615616, 'steps': 60497, 'loss/train': 1.1830865144729614} 11/07/2021 05:46:23 - INFO - __main__ - Step 60499: {'lr': 0.0003307885441669171, 'samples': 11615808, 'steps': 60498, 'loss/train': 1.39444100856781} 11/07/2021 05:46:24 - INFO - __main__ - Step 60500: {'lr': 0.0003307835221306741, 'samples': 11616000, 'steps': 60499, 'loss/train': 1.3697834014892578} 11/07/2021 05:46:24 - INFO - __main__ - Step 60501: {'lr': 0.0003307785000580313, 'samples': 11616192, 'steps': 60500, 'loss/train': 1.6147236824035645} 11/07/2021 05:46:24 - INFO - __main__ - Step 60502: {'lr': 0.00033077347794899096, 'samples': 11616384, 'steps': 60501, 'loss/train': 1.1470057964324951} 11/07/2021 05:46:25 - INFO - __main__ - Step 60503: {'lr': 0.00033076845580355533, 'samples': 11616576, 'steps': 60502, 'loss/train': 1.4067966938018799} 11/07/2021 05:46:26 - INFO - __main__ - Step 60504: {'lr': 0.00033076343362172666, 'samples': 11616768, 'steps': 60503, 'loss/train': 1.1807045936584473} 11/07/2021 05:46:26 - INFO - __main__ - Step 60505: {'lr': 0.00033075841140350724, 'samples': 11616960, 'steps': 60504, 'loss/train': 1.243285894393921} 11/07/2021 05:46:27 - INFO - __main__ - Step 60506: {'lr': 0.00033075338914889934, 'samples': 11617152, 'steps': 60505, 'loss/train': 0.978061318397522} 11/07/2021 05:46:27 - INFO - __main__ - Step 60507: {'lr': 0.00033074836685790523, 'samples': 11617344, 'steps': 60506, 'loss/train': 1.232883095741272} 11/07/2021 05:46:28 - INFO - __main__ - Step 60508: {'lr': 0.0003307433445305271, 'samples': 11617536, 'steps': 60507, 'loss/train': 1.4723087549209595} 11/07/2021 05:46:28 - INFO - __main__ - Step 60509: {'lr': 0.0003307383221667673, 'samples': 11617728, 'steps': 60508, 'loss/train': 1.3440356254577637} 11/07/2021 05:46:29 - INFO - __main__ - Step 60510: {'lr': 0.00033073329976662807, 'samples': 11617920, 'steps': 60509, 'loss/train': 1.2128040790557861} 11/07/2021 05:46:29 - INFO - __main__ - Step 60511: {'lr': 0.00033072827733011164, 'samples': 11618112, 'steps': 60510, 'loss/train': 1.0282177925109863} 11/07/2021 05:46:29 - INFO - __main__ - Step 60512: {'lr': 0.0003307232548572203, 'samples': 11618304, 'steps': 60511, 'loss/train': 1.2984400987625122} 11/07/2021 05:46:30 - INFO - __main__ - Step 60513: {'lr': 0.0003307182323479563, 'samples': 11618496, 'steps': 60512, 'loss/train': 2.0061848163604736} 11/07/2021 05:46:31 - INFO - __main__ - Step 60514: {'lr': 0.0003307132098023219, 'samples': 11618688, 'steps': 60513, 'loss/train': 0.42296141386032104} 11/07/2021 05:46:31 - INFO - __main__ - Step 60515: {'lr': 0.00033070818722031936, 'samples': 11618880, 'steps': 60514, 'loss/train': 1.3688561916351318} 11/07/2021 05:46:31 - INFO - __main__ - Step 60516: {'lr': 0.00033070316460195106, 'samples': 11619072, 'steps': 60515, 'loss/train': 1.6524235010147095} 11/07/2021 05:46:32 - INFO - __main__ - Step 60517: {'lr': 0.00033069814194721905, 'samples': 11619264, 'steps': 60516, 'loss/train': 1.4624488353729248} 11/07/2021 05:46:33 - INFO - __main__ - Step 60518: {'lr': 0.0003306931192561257, 'samples': 11619456, 'steps': 60517, 'loss/train': 1.4170624017715454} 11/07/2021 05:46:33 - INFO - __main__ - Step 60519: {'lr': 0.0003306880965286734, 'samples': 11619648, 'steps': 60518, 'loss/train': 1.1470211744308472} 11/07/2021 05:46:33 - INFO - __main__ - Step 60520: {'lr': 0.0003306830737648642, 'samples': 11619840, 'steps': 60519, 'loss/train': 1.4575327634811401} 11/07/2021 05:46:34 - INFO - __main__ - Step 60521: {'lr': 0.0003306780509647004, 'samples': 11620032, 'steps': 60520, 'loss/train': 1.3133435249328613} 11/07/2021 05:46:34 - INFO - __main__ - Step 60522: {'lr': 0.0003306730281281843, 'samples': 11620224, 'steps': 60521, 'loss/train': 1.526803970336914} 11/07/2021 05:46:35 - INFO - __main__ - Step 60523: {'lr': 0.00033066800525531826, 'samples': 11620416, 'steps': 60522, 'loss/train': 1.5734599828720093} 11/07/2021 05:46:35 - INFO - __main__ - Step 60524: {'lr': 0.0003306629823461045, 'samples': 11620608, 'steps': 60523, 'loss/train': 0.7263179421424866} 11/07/2021 05:46:36 - INFO - __main__ - Step 60525: {'lr': 0.0003306579594005452, 'samples': 11620800, 'steps': 60524, 'loss/train': 1.3870697021484375} 11/07/2021 05:46:36 - INFO - __main__ - Step 60526: {'lr': 0.0003306529364186426, 'samples': 11620992, 'steps': 60525, 'loss/train': 1.1966873407363892} 11/07/2021 05:46:37 - INFO - __main__ - Step 60527: {'lr': 0.00033064791340039915, 'samples': 11621184, 'steps': 60526, 'loss/train': 1.10444974899292} 11/07/2021 05:46:37 - INFO - __main__ - Step 60528: {'lr': 0.0003306428903458169, 'samples': 11621376, 'steps': 60527, 'loss/train': 1.3444327116012573} 11/07/2021 05:46:38 - INFO - __main__ - Step 60529: {'lr': 0.0003306378672548982, 'samples': 11621568, 'steps': 60528, 'loss/train': 1.3391139507293701} 11/07/2021 05:46:38 - INFO - __main__ - Step 60530: {'lr': 0.0003306328441276454, 'samples': 11621760, 'steps': 60529, 'loss/train': 1.583579421043396} 11/07/2021 05:46:39 - INFO - __main__ - Step 60531: {'lr': 0.0003306278209640607, 'samples': 11621952, 'steps': 60530, 'loss/train': 0.6949528455734253} 11/07/2021 05:46:39 - INFO - __main__ - Step 60532: {'lr': 0.0003306227977641463, 'samples': 11622144, 'steps': 60531, 'loss/train': 1.2163158655166626} 11/07/2021 05:46:39 - INFO - __main__ - Step 60533: {'lr': 0.0003306177745279045, 'samples': 11622336, 'steps': 60532, 'loss/train': 1.397707223892212} 11/07/2021 05:46:40 - INFO - __main__ - Step 60534: {'lr': 0.0003306127512553375, 'samples': 11622528, 'steps': 60533, 'loss/train': 1.631386637687683} 11/07/2021 05:46:41 - INFO - __main__ - Step 60535: {'lr': 0.00033060772794644776, 'samples': 11622720, 'steps': 60534, 'loss/train': 1.0709989070892334} 11/07/2021 05:46:41 - INFO - __main__ - Step 60536: {'lr': 0.00033060270460123737, 'samples': 11622912, 'steps': 60535, 'loss/train': 1.7698630094528198} 11/07/2021 05:46:41 - INFO - __main__ - Step 60537: {'lr': 0.0003305976812197087, 'samples': 11623104, 'steps': 60536, 'loss/train': 1.9117897748947144} 11/07/2021 05:46:42 - INFO - __main__ - Step 60538: {'lr': 0.00033059265780186386, 'samples': 11623296, 'steps': 60537, 'loss/train': 0.9508830904960632} 11/07/2021 05:46:43 - INFO - __main__ - Step 60539: {'lr': 0.00033058763434770536, 'samples': 11623488, 'steps': 60538, 'loss/train': 1.0077407360076904} 11/07/2021 05:46:43 - INFO - __main__ - Step 60540: {'lr': 0.0003305826108572352, 'samples': 11623680, 'steps': 60539, 'loss/train': 1.5150368213653564} 11/07/2021 05:46:44 - INFO - __main__ - Step 60541: {'lr': 0.00033057758733045573, 'samples': 11623872, 'steps': 60540, 'loss/train': 1.3964483737945557} 11/07/2021 05:46:44 - INFO - __main__ - Step 60542: {'lr': 0.0003305725637673693, 'samples': 11624064, 'steps': 60541, 'loss/train': 1.6991616487503052} 11/07/2021 05:46:44 - INFO - __main__ - Step 60543: {'lr': 0.00033056754016797814, 'samples': 11624256, 'steps': 60542, 'loss/train': 1.2141392230987549} 11/07/2021 05:46:45 - INFO - __main__ - Step 60544: {'lr': 0.00033056251653228446, 'samples': 11624448, 'steps': 60543, 'loss/train': 0.0622902512550354} 11/07/2021 05:46:46 - INFO - __main__ - Step 60545: {'lr': 0.00033055749286029054, 'samples': 11624640, 'steps': 60544, 'loss/train': 1.4211654663085938} 11/07/2021 05:46:46 - INFO - __main__ - Step 60546: {'lr': 0.0003305524691519987, 'samples': 11624832, 'steps': 60545, 'loss/train': 1.6176471710205078} 11/07/2021 05:46:46 - INFO - __main__ - Step 60547: {'lr': 0.0003305474454074111, 'samples': 11625024, 'steps': 60546, 'loss/train': 0.9996992945671082} 11/07/2021 05:46:47 - INFO - __main__ - Step 60548: {'lr': 0.0003305424216265301, 'samples': 11625216, 'steps': 60547, 'loss/train': 0.9403320550918579} 11/07/2021 05:46:48 - INFO - __main__ - Step 60549: {'lr': 0.0003305373978093579, 'samples': 11625408, 'steps': 60548, 'loss/train': 1.3911980390548706} 11/07/2021 05:46:48 - INFO - __main__ - Step 60550: {'lr': 0.0003305323739558969, 'samples': 11625600, 'steps': 60549, 'loss/train': 1.2299684286117554} 11/07/2021 05:46:49 - INFO - __main__ - Step 60551: {'lr': 0.0003305273500661491, 'samples': 11625792, 'steps': 60550, 'loss/train': 1.5531128644943237} 11/07/2021 05:46:49 - INFO - __main__ - Step 60552: {'lr': 0.000330522326140117, 'samples': 11625984, 'steps': 60551, 'loss/train': 0.1190563440322876} 11/07/2021 05:46:49 - INFO - __main__ - Step 60553: {'lr': 0.00033051730217780275, 'samples': 11626176, 'steps': 60552, 'loss/train': 1.5374178886413574} 11/07/2021 05:46:50 - INFO - __main__ - Step 60554: {'lr': 0.00033051227817920865, 'samples': 11626368, 'steps': 60553, 'loss/train': 1.2859631776809692} 11/07/2021 05:46:51 - INFO - __main__ - Step 60555: {'lr': 0.000330507254144337, 'samples': 11626560, 'steps': 60554, 'loss/train': 1.298138976097107} 11/07/2021 05:46:51 - INFO - __main__ - Step 60556: {'lr': 0.00033050223007319, 'samples': 11626752, 'steps': 60555, 'loss/train': 1.380121111869812} 11/07/2021 05:46:51 - INFO - __main__ - Step 60557: {'lr': 0.00033049720596576996, 'samples': 11626944, 'steps': 60556, 'loss/train': 1.2393879890441895} 11/07/2021 05:46:52 - INFO - __main__ - Step 60558: {'lr': 0.0003304921818220791, 'samples': 11627136, 'steps': 60557, 'loss/train': 1.2624696493148804} 11/07/2021 05:46:53 - INFO - __main__ - Step 60559: {'lr': 0.00033048715764211965, 'samples': 11627328, 'steps': 60558, 'loss/train': 1.1988160610198975} 11/07/2021 05:46:53 - INFO - __main__ - Step 60560: {'lr': 0.00033048213342589403, 'samples': 11627520, 'steps': 60559, 'loss/train': 1.4029486179351807} 11/07/2021 05:46:53 - INFO - __main__ - Step 60561: {'lr': 0.0003304771091734043, 'samples': 11627712, 'steps': 60560, 'loss/train': 1.0740954875946045} 11/07/2021 05:46:54 - INFO - __main__ - Step 60562: {'lr': 0.00033047208488465286, 'samples': 11627904, 'steps': 60561, 'loss/train': 1.487207055091858} 11/07/2021 05:46:54 - INFO - __main__ - Step 60563: {'lr': 0.00033046706055964197, 'samples': 11628096, 'steps': 60562, 'loss/train': 1.5586096048355103} 11/07/2021 05:46:55 - INFO - __main__ - Step 60564: {'lr': 0.0003304620361983739, 'samples': 11628288, 'steps': 60563, 'loss/train': 1.276833415031433} 11/07/2021 05:46:55 - INFO - __main__ - Step 60565: {'lr': 0.00033045701180085086, 'samples': 11628480, 'steps': 60564, 'loss/train': 1.6744836568832397} 11/07/2021 05:46:56 - INFO - __main__ - Step 60566: {'lr': 0.00033045198736707503, 'samples': 11628672, 'steps': 60565, 'loss/train': 1.6981033086776733} 11/07/2021 05:46:56 - INFO - __main__ - Step 60567: {'lr': 0.0003304469628970489, 'samples': 11628864, 'steps': 60566, 'loss/train': 0.8988810181617737} 11/07/2021 05:46:57 - INFO - __main__ - Step 60568: {'lr': 0.00033044193839077454, 'samples': 11629056, 'steps': 60567, 'loss/train': 1.5413932800292969} 11/07/2021 05:46:57 - INFO - __main__ - Step 60569: {'lr': 0.0003304369138482543, 'samples': 11629248, 'steps': 60568, 'loss/train': 1.3961039781570435} 11/07/2021 05:46:58 - INFO - __main__ - Step 60570: {'lr': 0.00033043188926949046, 'samples': 11629440, 'steps': 60569, 'loss/train': 1.0392414331436157} 11/07/2021 05:46:58 - INFO - __main__ - Step 60571: {'lr': 0.00033042686465448526, 'samples': 11629632, 'steps': 60570, 'loss/train': 0.07148535549640656} 11/07/2021 05:46:59 - INFO - __main__ - Step 60572: {'lr': 0.00033042184000324086, 'samples': 11629824, 'steps': 60571, 'loss/train': 1.1729272603988647} 11/07/2021 05:46:59 - INFO - __main__ - Step 60573: {'lr': 0.00033041681531575966, 'samples': 11630016, 'steps': 60572, 'loss/train': 1.4384188652038574} 11/07/2021 05:46:59 - INFO - __main__ - Step 60574: {'lr': 0.0003304117905920439, 'samples': 11630208, 'steps': 60573, 'loss/train': 1.2048555612564087} 11/07/2021 05:47:01 - INFO - __main__ - Step 60575: {'lr': 0.0003304067658320958, 'samples': 11630400, 'steps': 60574, 'loss/train': 1.4673428535461426} 11/07/2021 05:47:01 - INFO - __main__ - Step 60576: {'lr': 0.0003304017410359177, 'samples': 11630592, 'steps': 60575, 'loss/train': 1.2894229888916016} 11/07/2021 05:47:01 - INFO - __main__ - Step 60577: {'lr': 0.00033039671620351186, 'samples': 11630784, 'steps': 60576, 'loss/train': 1.159155249595642} 11/07/2021 05:47:02 - INFO - __main__ - Step 60578: {'lr': 0.00033039169133488043, 'samples': 11630976, 'steps': 60577, 'loss/train': 1.1955130100250244} 11/07/2021 05:47:02 - INFO - __main__ - Step 60579: {'lr': 0.00033038666643002575, 'samples': 11631168, 'steps': 60578, 'loss/train': 1.142136573791504} 11/07/2021 05:47:03 - INFO - __main__ - Step 60580: {'lr': 0.0003303816414889501, 'samples': 11631360, 'steps': 60579, 'loss/train': 1.7106280326843262} 11/07/2021 05:47:03 - INFO - __main__ - Step 60581: {'lr': 0.0003303766165116557, 'samples': 11631552, 'steps': 60580, 'loss/train': 1.328818917274475} 11/07/2021 05:47:04 - INFO - __main__ - Step 60582: {'lr': 0.00033037159149814483, 'samples': 11631744, 'steps': 60581, 'loss/train': 1.497675895690918} 11/07/2021 05:47:04 - INFO - __main__ - Step 60583: {'lr': 0.00033036656644841976, 'samples': 11631936, 'steps': 60582, 'loss/train': 1.0084922313690186} 11/07/2021 05:47:04 - INFO - __main__ - Step 60584: {'lr': 0.0003303615413624828, 'samples': 11632128, 'steps': 60583, 'loss/train': 1.4408111572265625} 11/07/2021 05:47:05 - INFO - __main__ - Step 60585: {'lr': 0.00033035651624033614, 'samples': 11632320, 'steps': 60584, 'loss/train': 1.0659106969833374} 11/07/2021 05:47:06 - INFO - __main__ - Step 60586: {'lr': 0.00033035149108198204, 'samples': 11632512, 'steps': 60585, 'loss/train': 1.4918583631515503} 11/07/2021 05:47:06 - INFO - __main__ - Step 60587: {'lr': 0.00033034646588742285, 'samples': 11632704, 'steps': 60586, 'loss/train': 1.179693579673767} 11/07/2021 05:47:06 - INFO - __main__ - Step 60588: {'lr': 0.00033034144065666074, 'samples': 11632896, 'steps': 60587, 'loss/train': 2.067915916442871} 11/07/2021 05:47:07 - INFO - __main__ - Step 60589: {'lr': 0.00033033641538969804, 'samples': 11633088, 'steps': 60588, 'loss/train': 1.194901704788208} 11/07/2021 05:47:08 - INFO - __main__ - Step 60590: {'lr': 0.000330331390086537, 'samples': 11633280, 'steps': 60589, 'loss/train': 1.5048609972000122} 11/07/2021 05:47:08 - INFO - __main__ - Step 60591: {'lr': 0.0003303263647471799, 'samples': 11633472, 'steps': 60590, 'loss/train': 1.6762449741363525} 11/07/2021 05:47:09 - INFO - __main__ - Step 60592: {'lr': 0.00033032133937162895, 'samples': 11633664, 'steps': 60591, 'loss/train': 1.4446053504943848} 11/07/2021 05:47:09 - INFO - __main__ - Step 60593: {'lr': 0.00033031631395988645, 'samples': 11633856, 'steps': 60592, 'loss/train': 1.4192105531692505} 11/07/2021 05:47:09 - INFO - __main__ - Step 60594: {'lr': 0.0003303112885119546, 'samples': 11634048, 'steps': 60593, 'loss/train': 1.7279887199401855} 11/07/2021 05:47:10 - INFO - __main__ - Step 60595: {'lr': 0.0003303062630278357, 'samples': 11634240, 'steps': 60594, 'loss/train': 1.6831245422363281} 11/07/2021 05:47:11 - INFO - __main__ - Step 60596: {'lr': 0.00033030123750753216, 'samples': 11634432, 'steps': 60595, 'loss/train': 1.213637113571167} 11/07/2021 05:47:11 - INFO - __main__ - Step 60597: {'lr': 0.00033029621195104607, 'samples': 11634624, 'steps': 60596, 'loss/train': 1.3287698030471802} 11/07/2021 05:47:11 - INFO - __main__ - Step 60598: {'lr': 0.0003302911863583798, 'samples': 11634816, 'steps': 60597, 'loss/train': 1.2982192039489746} 11/07/2021 05:47:12 - INFO - __main__ - Step 60599: {'lr': 0.0003302861607295355, 'samples': 11635008, 'steps': 60598, 'loss/train': 1.3942272663116455} 11/07/2021 05:47:13 - INFO - __main__ - Step 60600: {'lr': 0.0003302811350645155, 'samples': 11635200, 'steps': 60599, 'loss/train': 1.2280123233795166} 11/07/2021 05:47:13 - INFO - __main__ - Step 60601: {'lr': 0.000330276109363322, 'samples': 11635392, 'steps': 60600, 'loss/train': 1.5997462272644043} 11/07/2021 05:47:13 - INFO - __main__ - Step 60602: {'lr': 0.0003302710836259574, 'samples': 11635584, 'steps': 60601, 'loss/train': 1.2864990234375} 11/07/2021 05:47:14 - INFO - __main__ - Step 60603: {'lr': 0.00033026605785242387, 'samples': 11635776, 'steps': 60602, 'loss/train': 1.1384344100952148} 11/07/2021 05:47:14 - INFO - __main__ - Step 60604: {'lr': 0.0003302610320427237, 'samples': 11635968, 'steps': 60603, 'loss/train': 1.0575647354125977} 11/07/2021 05:47:15 - INFO - __main__ - Step 60605: {'lr': 0.0003302560061968591, 'samples': 11636160, 'steps': 60604, 'loss/train': 1.0888789892196655} 11/07/2021 05:47:15 - INFO - __main__ - Step 60606: {'lr': 0.0003302509803148325, 'samples': 11636352, 'steps': 60605, 'loss/train': 0.7133286595344543} 11/07/2021 05:47:16 - INFO - __main__ - Step 60607: {'lr': 0.000330245954396646, 'samples': 11636544, 'steps': 60606, 'loss/train': 1.0684980154037476} 11/07/2021 05:47:16 - INFO - __main__ - Step 60608: {'lr': 0.0003302409284423018, 'samples': 11636736, 'steps': 60607, 'loss/train': 1.6473618745803833} 11/07/2021 05:47:16 - INFO - __main__ - Step 60609: {'lr': 0.00033023590245180237, 'samples': 11636928, 'steps': 60608, 'loss/train': 1.3427810668945312} 11/07/2021 05:47:18 - INFO - __main__ - Step 60610: {'lr': 0.0003302308764251499, 'samples': 11637120, 'steps': 60609, 'loss/train': 1.1121400594711304} 11/07/2021 05:47:18 - INFO - __main__ - Step 60611: {'lr': 0.0003302258503623466, 'samples': 11637312, 'steps': 60610, 'loss/train': 1.2629863023757935} 11/07/2021 05:47:18 - INFO - __main__ - Step 60612: {'lr': 0.0003302208242633948, 'samples': 11637504, 'steps': 60611, 'loss/train': 1.285487174987793} 11/07/2021 05:47:19 - INFO - __main__ - Step 60613: {'lr': 0.00033021579812829666, 'samples': 11637696, 'steps': 60612, 'loss/train': 1.341538667678833} 11/07/2021 05:47:19 - INFO - __main__ - Step 60614: {'lr': 0.0003302107719570546, 'samples': 11637888, 'steps': 60613, 'loss/train': 1.2091463804244995} 11/07/2021 05:47:21 - INFO - __main__ - Step 60615: {'lr': 0.0003302057457496707, 'samples': 11638080, 'steps': 60614, 'loss/train': 1.4588520526885986} 11/07/2021 05:47:21 - INFO - __main__ - Step 60616: {'lr': 0.0003302007195061474, 'samples': 11638272, 'steps': 60615, 'loss/train': 1.1496399641036987} 11/07/2021 05:47:21 - INFO - __main__ - Step 60617: {'lr': 0.00033019569322648693, 'samples': 11638464, 'steps': 60616, 'loss/train': 1.255807638168335} 11/07/2021 05:47:22 - INFO - __main__ - Step 60618: {'lr': 0.0003301906669106915, 'samples': 11638656, 'steps': 60617, 'loss/train': 1.5705156326293945} 11/07/2021 05:47:22 - INFO - __main__ - Step 60619: {'lr': 0.0003301856405587634, 'samples': 11638848, 'steps': 60618, 'loss/train': 1.6443169116973877} 11/07/2021 05:47:22 - INFO - __main__ - Step 60620: {'lr': 0.0003301806141707048, 'samples': 11639040, 'steps': 60619, 'loss/train': 1.7913302183151245} 11/07/2021 05:47:23 - INFO - __main__ - Step 60621: {'lr': 0.0003301755877465181, 'samples': 11639232, 'steps': 60620, 'loss/train': 1.776828408241272} 11/07/2021 05:47:24 - INFO - __main__ - Step 60622: {'lr': 0.0003301705612862055, 'samples': 11639424, 'steps': 60621, 'loss/train': 1.0684175491333008} 11/07/2021 05:47:24 - INFO - __main__ - Step 60623: {'lr': 0.0003301655347897694, 'samples': 11639616, 'steps': 60622, 'loss/train': 1.179228663444519} 11/07/2021 05:47:24 - INFO - __main__ - Step 60624: {'lr': 0.0003301605082572119, 'samples': 11639808, 'steps': 60623, 'loss/train': 1.192246913909912} 11/07/2021 05:47:25 - INFO - __main__ - Step 60625: {'lr': 0.0003301554816885352, 'samples': 11640000, 'steps': 60624, 'loss/train': 0.6861558556556702} 11/07/2021 05:47:25 - INFO - __main__ - Step 60626: {'lr': 0.00033015045508374177, 'samples': 11640192, 'steps': 60625, 'loss/train': 0.9431832432746887} 11/07/2021 05:47:26 - INFO - __main__ - Step 60627: {'lr': 0.00033014542844283373, 'samples': 11640384, 'steps': 60626, 'loss/train': 1.4092527627944946} 11/07/2021 05:47:26 - INFO - __main__ - Step 60628: {'lr': 0.00033014040176581347, 'samples': 11640576, 'steps': 60627, 'loss/train': 1.176310658454895} 11/07/2021 05:47:27 - INFO - __main__ - Step 60629: {'lr': 0.0003301353750526831, 'samples': 11640768, 'steps': 60628, 'loss/train': 1.29683256149292} 11/07/2021 05:47:27 - INFO - __main__ - Step 60630: {'lr': 0.000330130348303445, 'samples': 11640960, 'steps': 60629, 'loss/train': 1.387449026107788} 11/07/2021 05:47:27 - INFO - __main__ - Step 60631: {'lr': 0.00033012532151810144, 'samples': 11641152, 'steps': 60630, 'loss/train': 1.432706594467163} 11/07/2021 05:47:29 - INFO - __main__ - Step 60632: {'lr': 0.0003301202946966546, 'samples': 11641344, 'steps': 60631, 'loss/train': 1.1304993629455566} 11/07/2021 05:47:29 - INFO - __main__ - Step 60633: {'lr': 0.0003301152678391068, 'samples': 11641536, 'steps': 60632, 'loss/train': 1.2297613620758057} 11/07/2021 05:47:29 - INFO - __main__ - Step 60634: {'lr': 0.00033011024094546025, 'samples': 11641728, 'steps': 60633, 'loss/train': 1.4637088775634766} 11/07/2021 05:47:30 - INFO - __main__ - Step 60635: {'lr': 0.00033010521401571734, 'samples': 11641920, 'steps': 60634, 'loss/train': 1.4055501222610474} 11/07/2021 05:47:30 - INFO - __main__ - Step 60636: {'lr': 0.0003301001870498802, 'samples': 11642112, 'steps': 60635, 'loss/train': 1.5108671188354492} 11/07/2021 05:47:31 - INFO - __main__ - Step 60637: {'lr': 0.00033009516004795127, 'samples': 11642304, 'steps': 60636, 'loss/train': 1.2170085906982422} 11/07/2021 05:47:32 - INFO - __main__ - Step 60638: {'lr': 0.0003300901330099326, 'samples': 11642496, 'steps': 60637, 'loss/train': 0.7308080196380615} 11/07/2021 05:47:32 - INFO - __main__ - Step 60639: {'lr': 0.0003300851059358265, 'samples': 11642688, 'steps': 60638, 'loss/train': 1.6531659364700317} 11/07/2021 05:47:32 - INFO - __main__ - Step 60640: {'lr': 0.0003300800788256354, 'samples': 11642880, 'steps': 60639, 'loss/train': 1.2142397165298462} 11/07/2021 05:47:33 - INFO - __main__ - Step 60641: {'lr': 0.00033007505167936135, 'samples': 11643072, 'steps': 60640, 'loss/train': 0.7283065915107727} 11/07/2021 05:47:33 - INFO - __main__ - Step 60642: {'lr': 0.0003300700244970068, 'samples': 11643264, 'steps': 60641, 'loss/train': 1.3169159889221191} 11/07/2021 05:47:34 - INFO - __main__ - Step 60643: {'lr': 0.00033006499727857393, 'samples': 11643456, 'steps': 60642, 'loss/train': 0.07941319048404694} 11/07/2021 05:47:35 - INFO - __main__ - Step 60644: {'lr': 0.000330059970024065, 'samples': 11643648, 'steps': 60643, 'loss/train': 1.162714958190918} 11/07/2021 05:47:35 - INFO - __main__ - Step 60645: {'lr': 0.00033005494273348224, 'samples': 11643840, 'steps': 60644, 'loss/train': 5.611440181732178} 11/07/2021 05:47:35 - INFO - __main__ - Step 60646: {'lr': 0.00033004991540682793, 'samples': 11644032, 'steps': 60645, 'loss/train': 1.3779680728912354} 11/07/2021 05:47:36 - INFO - __main__ - Step 60647: {'lr': 0.00033004488804410444, 'samples': 11644224, 'steps': 60646, 'loss/train': 1.7804577350616455} 11/07/2021 05:47:36 - INFO - __main__ - Step 60648: {'lr': 0.000330039860645314, 'samples': 11644416, 'steps': 60647, 'loss/train': 1.6821194887161255} 11/07/2021 05:47:37 - INFO - __main__ - Step 60649: {'lr': 0.00033003483321045874, 'samples': 11644608, 'steps': 60648, 'loss/train': 0.5718071460723877} 11/07/2021 05:47:37 - INFO - __main__ - Step 60650: {'lr': 0.000330029805739541, 'samples': 11644800, 'steps': 60649, 'loss/train': 1.2007300853729248} 11/07/2021 05:47:38 - INFO - __main__ - Step 60651: {'lr': 0.0003300247782325631, 'samples': 11644992, 'steps': 60650, 'loss/train': 1.3830184936523438} 11/07/2021 05:47:38 - INFO - __main__ - Step 60652: {'lr': 0.0003300197506895273, 'samples': 11645184, 'steps': 60651, 'loss/train': 1.3559999465942383} 11/07/2021 05:47:38 - INFO - __main__ - Step 60653: {'lr': 0.0003300147231104358, 'samples': 11645376, 'steps': 60652, 'loss/train': 1.4350225925445557} 11/07/2021 05:47:39 - INFO - __main__ - Step 60654: {'lr': 0.0003300096954952909, 'samples': 11645568, 'steps': 60653, 'loss/train': 0.8601046204566956} 11/07/2021 05:47:40 - INFO - __main__ - Step 60655: {'lr': 0.00033000466784409487, 'samples': 11645760, 'steps': 60654, 'loss/train': 1.31141996383667} 11/07/2021 05:47:40 - INFO - __main__ - Step 60656: {'lr': 0.00032999964015685004, 'samples': 11645952, 'steps': 60655, 'loss/train': 0.5687170028686523} 11/07/2021 05:47:41 - INFO - __main__ - Step 60657: {'lr': 0.0003299946124335585, 'samples': 11646144, 'steps': 60656, 'loss/train': 1.2618039846420288} 11/07/2021 05:47:41 - INFO - __main__ - Step 60658: {'lr': 0.0003299895846742227, 'samples': 11646336, 'steps': 60657, 'loss/train': 0.8939794898033142} 11/07/2021 05:47:42 - INFO - __main__ - Step 60659: {'lr': 0.0003299845568788448, 'samples': 11646528, 'steps': 60658, 'loss/train': 1.4081119298934937} 11/07/2021 05:47:42 - INFO - __main__ - Step 60660: {'lr': 0.0003299795290474271, 'samples': 11646720, 'steps': 60659, 'loss/train': 1.2664859294891357} 11/07/2021 05:47:42 - INFO - __main__ - Step 60661: {'lr': 0.00032997450117997184, 'samples': 11646912, 'steps': 60660, 'loss/train': 1.4706214666366577} 11/07/2021 05:47:43 - INFO - __main__ - Step 60662: {'lr': 0.0003299694732764813, 'samples': 11647104, 'steps': 60661, 'loss/train': 0.7092648148536682} 11/07/2021 05:47:43 - INFO - __main__ - Step 60663: {'lr': 0.00032996444533695777, 'samples': 11647296, 'steps': 60662, 'loss/train': 1.4878045320510864} 11/07/2021 05:47:44 - INFO - __main__ - Step 60664: {'lr': 0.00032995941736140347, 'samples': 11647488, 'steps': 60663, 'loss/train': 1.189684271812439} 11/07/2021 05:47:45 - INFO - __main__ - Step 60665: {'lr': 0.00032995438934982075, 'samples': 11647680, 'steps': 60664, 'loss/train': 1.6056480407714844} 11/07/2021 05:47:45 - INFO - __main__ - Step 60666: {'lr': 0.00032994936130221174, 'samples': 11647872, 'steps': 60665, 'loss/train': 1.6511934995651245} 11/07/2021 05:47:45 - INFO - __main__ - Step 60667: {'lr': 0.00032994433321857885, 'samples': 11648064, 'steps': 60666, 'loss/train': 1.2506403923034668} 11/07/2021 05:47:46 - INFO - __main__ - Step 60668: {'lr': 0.0003299393050989242, 'samples': 11648256, 'steps': 60667, 'loss/train': 1.7383122444152832} 11/07/2021 05:47:47 - INFO - __main__ - Step 60669: {'lr': 0.00032993427694325017, 'samples': 11648448, 'steps': 60668, 'loss/train': 1.6635938882827759} 11/07/2021 05:47:47 - INFO - __main__ - Step 60670: {'lr': 0.000329929248751559, 'samples': 11648640, 'steps': 60669, 'loss/train': 1.4398576021194458} 11/07/2021 05:47:48 - INFO - __main__ - Step 60671: {'lr': 0.00032992422052385297, 'samples': 11648832, 'steps': 60670, 'loss/train': 0.48403576016426086} 11/07/2021 05:47:48 - INFO - __main__ - Step 60672: {'lr': 0.00032991919226013427, 'samples': 11649024, 'steps': 60671, 'loss/train': 0.8784247040748596} 11/07/2021 05:47:48 - INFO - __main__ - Step 60673: {'lr': 0.00032991416396040526, 'samples': 11649216, 'steps': 60672, 'loss/train': 0.10399379581212997} 11/07/2021 05:47:49 - INFO - __main__ - Step 60674: {'lr': 0.00032990913562466805, 'samples': 11649408, 'steps': 60673, 'loss/train': 1.675089955329895} 11/07/2021 05:47:50 - INFO - __main__ - Step 60675: {'lr': 0.00032990410725292513, 'samples': 11649600, 'steps': 60674, 'loss/train': 1.0215145349502563} 11/07/2021 05:47:50 - INFO - __main__ - Step 60676: {'lr': 0.00032989907884517863, 'samples': 11649792, 'steps': 60675, 'loss/train': 0.07518627494573593} 11/07/2021 05:47:50 - INFO - __main__ - Step 60677: {'lr': 0.0003298940504014308, 'samples': 11649984, 'steps': 60676, 'loss/train': 1.3465046882629395} 11/07/2021 05:47:51 - INFO - __main__ - Step 60678: {'lr': 0.000329889021921684, 'samples': 11650176, 'steps': 60677, 'loss/train': 1.119543433189392} 11/07/2021 05:47:51 - INFO - __main__ - Step 60679: {'lr': 0.00032988399340594046, 'samples': 11650368, 'steps': 60678, 'loss/train': 1.567456841468811} 11/07/2021 05:47:52 - INFO - __main__ - Step 60680: {'lr': 0.0003298789648542023, 'samples': 11650560, 'steps': 60679, 'loss/train': 1.307152271270752} 11/07/2021 05:47:53 - INFO - __main__ - Step 60681: {'lr': 0.000329873936266472, 'samples': 11650752, 'steps': 60680, 'loss/train': 1.3369818925857544} 11/07/2021 05:47:53 - INFO - __main__ - Step 60682: {'lr': 0.00032986890764275174, 'samples': 11650944, 'steps': 60681, 'loss/train': 1.215666651725769} 11/07/2021 05:47:53 - INFO - __main__ - Step 60683: {'lr': 0.00032986387898304375, 'samples': 11651136, 'steps': 60682, 'loss/train': 1.0280569791793823} 11/07/2021 05:47:54 - INFO - __main__ - Step 60684: {'lr': 0.00032985885028735033, 'samples': 11651328, 'steps': 60683, 'loss/train': 1.1442331075668335} 11/07/2021 05:47:55 - INFO - __main__ - Step 60685: {'lr': 0.00032985382155567377, 'samples': 11651520, 'steps': 60684, 'loss/train': 1.6198537349700928} 11/07/2021 05:47:55 - INFO - __main__ - Step 60686: {'lr': 0.0003298487927880163, 'samples': 11651712, 'steps': 60685, 'loss/train': 1.5489678382873535} 11/07/2021 05:47:56 - INFO - __main__ - Step 60687: {'lr': 0.00032984376398438023, 'samples': 11651904, 'steps': 60686, 'loss/train': 6.284025192260742} 11/07/2021 05:47:56 - INFO - __main__ - Step 60688: {'lr': 0.00032983873514476776, 'samples': 11652096, 'steps': 60687, 'loss/train': 1.2292230129241943} 11/07/2021 05:47:56 - INFO - __main__ - Step 60689: {'lr': 0.0003298337062691812, 'samples': 11652288, 'steps': 60688, 'loss/train': 1.8296973705291748} 11/07/2021 05:47:57 - INFO - __main__ - Step 60690: {'lr': 0.00032982867735762274, 'samples': 11652480, 'steps': 60689, 'loss/train': 0.28794270753860474} 11/07/2021 05:47:58 - INFO - __main__ - Step 60691: {'lr': 0.0003298236484100948, 'samples': 11652672, 'steps': 60690, 'loss/train': 1.2104394435882568} 11/07/2021 05:47:58 - INFO - __main__ - Step 60692: {'lr': 0.00032981861942659954, 'samples': 11652864, 'steps': 60691, 'loss/train': 1.6643383502960205} 11/07/2021 05:47:59 - INFO - __main__ - Step 60693: {'lr': 0.00032981359040713923, 'samples': 11653056, 'steps': 60692, 'loss/train': 1.440932273864746} 11/07/2021 05:47:59 - INFO - __main__ - Step 60694: {'lr': 0.0003298085613517161, 'samples': 11653248, 'steps': 60693, 'loss/train': 0.9736169576644897} 11/07/2021 05:48:00 - INFO - __main__ - Step 60695: {'lr': 0.0003298035322603324, 'samples': 11653440, 'steps': 60694, 'loss/train': 1.6957916021347046} 11/07/2021 05:48:00 - INFO - __main__ - Step 60696: {'lr': 0.00032979850313299064, 'samples': 11653632, 'steps': 60695, 'loss/train': 1.6791720390319824} 11/07/2021 05:48:01 - INFO - __main__ - Step 60697: {'lr': 0.0003297934739696928, 'samples': 11653824, 'steps': 60696, 'loss/train': 1.369613528251648} 11/07/2021 05:48:01 - INFO - __main__ - Step 60698: {'lr': 0.00032978844477044136, 'samples': 11654016, 'steps': 60697, 'loss/train': 1.1992104053497314} 11/07/2021 05:48:01 - INFO - __main__ - Step 60699: {'lr': 0.0003297834155352383, 'samples': 11654208, 'steps': 60698, 'loss/train': 1.04888117313385} 11/07/2021 05:48:02 - INFO - __main__ - Step 60700: {'lr': 0.00032977838626408617, 'samples': 11654400, 'steps': 60699, 'loss/train': 1.277007818222046} 11/07/2021 05:48:03 - INFO - __main__ - Step 60701: {'lr': 0.00032977335695698714, 'samples': 11654592, 'steps': 60700, 'loss/train': 1.8611299991607666} 11/07/2021 05:48:03 - INFO - __main__ - Step 60702: {'lr': 0.00032976832761394344, 'samples': 11654784, 'steps': 60701, 'loss/train': 1.7086050510406494} 11/07/2021 05:48:03 - INFO - __main__ - Step 60703: {'lr': 0.0003297632982349573, 'samples': 11654976, 'steps': 60702, 'loss/train': 1.193851113319397} 11/07/2021 05:48:04 - INFO - __main__ - Step 60704: {'lr': 0.0003297582688200311, 'samples': 11655168, 'steps': 60703, 'loss/train': 1.5073415040969849} 11/07/2021 05:48:04 - INFO - __main__ - Step 60705: {'lr': 0.0003297532393691672, 'samples': 11655360, 'steps': 60704, 'loss/train': 1.4802409410476685} 11/07/2021 05:48:05 - INFO - __main__ - Step 60706: {'lr': 0.00032974820988236755, 'samples': 11655552, 'steps': 60705, 'loss/train': 1.2700610160827637} 11/07/2021 05:48:05 - INFO - __main__ - Step 60707: {'lr': 0.00032974318035963463, 'samples': 11655744, 'steps': 60706, 'loss/train': 1.3991612195968628} 11/07/2021 05:48:06 - INFO - __main__ - Step 60708: {'lr': 0.00032973815080097066, 'samples': 11655936, 'steps': 60707, 'loss/train': 1.2612824440002441} 11/07/2021 05:48:06 - INFO - __main__ - Step 60709: {'lr': 0.0003297331212063779, 'samples': 11656128, 'steps': 60708, 'loss/train': 1.1863231658935547} 11/07/2021 05:48:07 - INFO - __main__ - Step 60710: {'lr': 0.00032972809157585866, 'samples': 11656320, 'steps': 60709, 'loss/train': 1.657922625541687} 11/07/2021 05:48:08 - INFO - __main__ - Step 60711: {'lr': 0.0003297230619094151, 'samples': 11656512, 'steps': 60710, 'loss/train': 1.3497611284255981} 11/07/2021 05:48:08 - INFO - __main__ - Step 60712: {'lr': 0.00032971803220704964, 'samples': 11656704, 'steps': 60711, 'loss/train': 1.3468565940856934} 11/07/2021 05:48:08 - INFO - __main__ - Step 60713: {'lr': 0.00032971300246876443, 'samples': 11656896, 'steps': 60712, 'loss/train': 1.3525850772857666} 11/07/2021 05:48:09 - INFO - __main__ - Step 60714: {'lr': 0.00032970797269456177, 'samples': 11657088, 'steps': 60713, 'loss/train': 1.3074458837509155} 11/07/2021 05:48:09 - INFO - __main__ - Step 60715: {'lr': 0.00032970294288444394, 'samples': 11657280, 'steps': 60714, 'loss/train': 0.9867516756057739} 11/07/2021 05:48:10 - INFO - __main__ - Step 60716: {'lr': 0.00032969791303841316, 'samples': 11657472, 'steps': 60715, 'loss/train': 1.2417187690734863} 11/07/2021 05:48:10 - INFO - __main__ - Step 60717: {'lr': 0.00032969288315647176, 'samples': 11657664, 'steps': 60716, 'loss/train': 1.4247527122497559} 11/07/2021 05:48:11 - INFO - __main__ - Step 60718: {'lr': 0.00032968785323862207, 'samples': 11657856, 'steps': 60717, 'loss/train': 1.8826614618301392} 11/07/2021 05:48:11 - INFO - __main__ - Step 60719: {'lr': 0.0003296828232848661, 'samples': 11658048, 'steps': 60718, 'loss/train': 1.2042641639709473} 11/07/2021 05:48:11 - INFO - __main__ - Step 60720: {'lr': 0.0003296777932952064, 'samples': 11658240, 'steps': 60719, 'loss/train': 1.5452167987823486} 11/07/2021 05:48:12 - INFO - __main__ - Step 60721: {'lr': 0.000329672763269645, 'samples': 11658432, 'steps': 60720, 'loss/train': 1.541505217552185} 11/07/2021 05:48:13 - INFO - __main__ - Step 60722: {'lr': 0.00032966773320818434, 'samples': 11658624, 'steps': 60721, 'loss/train': 1.504671335220337} 11/07/2021 05:48:13 - INFO - __main__ - Step 60723: {'lr': 0.00032966270311082666, 'samples': 11658816, 'steps': 60722, 'loss/train': 1.824981927871704} 11/07/2021 05:48:13 - INFO - __main__ - Step 60724: {'lr': 0.0003296576729775741, 'samples': 11659008, 'steps': 60723, 'loss/train': 1.8621501922607422} 11/07/2021 05:48:14 - INFO - __main__ - Step 60725: {'lr': 0.00032965264280842915, 'samples': 11659200, 'steps': 60724, 'loss/train': 1.6492481231689453} 11/07/2021 05:48:15 - INFO - __main__ - Step 60726: {'lr': 0.00032964761260339387, 'samples': 11659392, 'steps': 60725, 'loss/train': 1.1449464559555054} 11/07/2021 05:48:15 - INFO - __main__ - Step 60727: {'lr': 0.00032964258236247064, 'samples': 11659584, 'steps': 60726, 'loss/train': 1.0660771131515503} 11/07/2021 05:48:16 - INFO - __main__ - Step 60728: {'lr': 0.00032963755208566167, 'samples': 11659776, 'steps': 60727, 'loss/train': 1.293587327003479} 11/07/2021 05:48:16 - INFO - __main__ - Step 60729: {'lr': 0.0003296325217729692, 'samples': 11659968, 'steps': 60728, 'loss/train': 1.539455771446228} 11/07/2021 05:48:17 - INFO - __main__ - Step 60730: {'lr': 0.0003296274914243956, 'samples': 11660160, 'steps': 60729, 'loss/train': 0.11940598487854004} 11/07/2021 05:48:18 - INFO - __main__ - Step 60731: {'lr': 0.0003296224610399431, 'samples': 11660352, 'steps': 60730, 'loss/train': 1.3583306074142456} 11/07/2021 05:48:18 - INFO - __main__ - Step 60732: {'lr': 0.00032961743061961395, 'samples': 11660544, 'steps': 60731, 'loss/train': 1.7204253673553467} 11/07/2021 05:48:18 - INFO - __main__ - Step 60733: {'lr': 0.0003296124001634104, 'samples': 11660736, 'steps': 60732, 'loss/train': 0.9321410059928894} 11/07/2021 05:48:19 - INFO - __main__ - Step 60734: {'lr': 0.0003296073696713347, 'samples': 11660928, 'steps': 60733, 'loss/train': 1.4577077627182007} 11/07/2021 05:48:19 - INFO - __main__ - Step 60735: {'lr': 0.0003296023391433892, 'samples': 11661120, 'steps': 60734, 'loss/train': 1.7334606647491455} 11/07/2021 05:48:20 - INFO - __main__ - Step 60736: {'lr': 0.00032959730857957606, 'samples': 11661312, 'steps': 60735, 'loss/train': 1.1991406679153442} 11/07/2021 05:48:20 - INFO - __main__ - Step 60737: {'lr': 0.0003295922779798976, 'samples': 11661504, 'steps': 60736, 'loss/train': 1.3567078113555908} 11/07/2021 05:48:21 - INFO - __main__ - Step 60738: {'lr': 0.00032958724734435615, 'samples': 11661696, 'steps': 60737, 'loss/train': 1.2413519620895386} 11/07/2021 05:48:21 - INFO - __main__ - Step 60739: {'lr': 0.00032958221667295386, 'samples': 11661888, 'steps': 60738, 'loss/train': 1.5157809257507324} 11/07/2021 05:48:21 - INFO - __main__ - Step 60740: {'lr': 0.0003295771859656931, 'samples': 11662080, 'steps': 60739, 'loss/train': 1.394585371017456} 11/07/2021 05:48:22 - INFO - __main__ - Step 60741: {'lr': 0.000329572155222576, 'samples': 11662272, 'steps': 60740, 'loss/train': 1.8004993200302124} 11/07/2021 05:48:23 - INFO - __main__ - Step 60742: {'lr': 0.000329567124443605, 'samples': 11662464, 'steps': 60741, 'loss/train': 1.3596118688583374} 11/07/2021 05:48:23 - INFO - __main__ - Step 60743: {'lr': 0.0003295620936287822, 'samples': 11662656, 'steps': 60742, 'loss/train': 1.3198894262313843} 11/07/2021 05:48:23 - INFO - __main__ - Step 60744: {'lr': 0.00032955706277811004, 'samples': 11662848, 'steps': 60743, 'loss/train': 1.1721166372299194} 11/07/2021 05:48:24 - INFO - __main__ - Step 60745: {'lr': 0.00032955203189159065, 'samples': 11663040, 'steps': 60744, 'loss/train': 1.3579411506652832} 11/07/2021 05:48:25 - INFO - __main__ - Step 60746: {'lr': 0.00032954700096922635, 'samples': 11663232, 'steps': 60745, 'loss/train': 1.3705973625183105} 11/07/2021 05:48:25 - INFO - __main__ - Step 60747: {'lr': 0.00032954197001101935, 'samples': 11663424, 'steps': 60746, 'loss/train': 1.167642593383789} 11/07/2021 05:48:26 - INFO - __main__ - Step 60748: {'lr': 0.000329536939016972, 'samples': 11663616, 'steps': 60747, 'loss/train': 1.3932781219482422} 11/07/2021 05:48:26 - INFO - __main__ - Step 60749: {'lr': 0.0003295319079870866, 'samples': 11663808, 'steps': 60748, 'loss/train': 1.272137999534607} 11/07/2021 05:48:26 - INFO - __main__ - Step 60750: {'lr': 0.0003295268769213653, 'samples': 11664000, 'steps': 60749, 'loss/train': 1.538712501525879} 11/07/2021 05:48:27 - INFO - __main__ - Step 60751: {'lr': 0.0003295218458198104, 'samples': 11664192, 'steps': 60750, 'loss/train': 1.065285325050354} 11/07/2021 05:48:28 - INFO - __main__ - Step 60752: {'lr': 0.00032951681468242424, 'samples': 11664384, 'steps': 60751, 'loss/train': 1.5054277181625366} 11/07/2021 05:48:28 - INFO - __main__ - Step 60753: {'lr': 0.00032951178350920895, 'samples': 11664576, 'steps': 60752, 'loss/train': 0.9820560216903687} 11/07/2021 05:48:28 - INFO - __main__ - Step 60754: {'lr': 0.0003295067523001669, 'samples': 11664768, 'steps': 60753, 'loss/train': 1.215604543685913} 11/07/2021 05:48:29 - INFO - __main__ - Step 60755: {'lr': 0.0003295017210553003, 'samples': 11664960, 'steps': 60754, 'loss/train': 1.103384256362915} 11/07/2021 05:48:29 - INFO - __main__ - Step 60756: {'lr': 0.0003294966897746115, 'samples': 11665152, 'steps': 60755, 'loss/train': 1.1796678304672241} 11/07/2021 05:48:30 - INFO - __main__ - Step 60757: {'lr': 0.0003294916584581027, 'samples': 11665344, 'steps': 60756, 'loss/train': 1.6232643127441406} 11/07/2021 05:48:30 - INFO - __main__ - Step 60758: {'lr': 0.00032948662710577625, 'samples': 11665536, 'steps': 60757, 'loss/train': 2.174596071243286} 11/07/2021 05:48:31 - INFO - __main__ - Step 60759: {'lr': 0.0003294815957176343, 'samples': 11665728, 'steps': 60758, 'loss/train': 1.3142354488372803} 11/07/2021 05:48:31 - INFO - __main__ - Step 60760: {'lr': 0.00032947656429367915, 'samples': 11665920, 'steps': 60759, 'loss/train': 1.1102142333984375} 11/07/2021 05:48:32 - INFO - __main__ - Step 60761: {'lr': 0.00032947153283391313, 'samples': 11666112, 'steps': 60760, 'loss/train': 1.7375808954238892} 11/07/2021 05:48:33 - INFO - __main__ - Step 60762: {'lr': 0.00032946650133833846, 'samples': 11666304, 'steps': 60761, 'loss/train': 0.5720475912094116} 11/07/2021 05:48:33 - INFO - __main__ - Step 60763: {'lr': 0.00032946146980695736, 'samples': 11666496, 'steps': 60762, 'loss/train': 0.8699394464492798} 11/07/2021 05:48:33 - INFO - __main__ - Step 60764: {'lr': 0.00032945643823977216, 'samples': 11666688, 'steps': 60763, 'loss/train': 1.1901174783706665} 11/07/2021 05:48:34 - INFO - __main__ - Step 60765: {'lr': 0.0003294514066367852, 'samples': 11666880, 'steps': 60764, 'loss/train': 1.4037927389144897} 11/07/2021 05:48:34 - INFO - __main__ - Step 60766: {'lr': 0.0003294463749979986, 'samples': 11667072, 'steps': 60765, 'loss/train': 1.6341971158981323} 11/07/2021 05:48:36 - INFO - __main__ - Step 60767: {'lr': 0.00032944134332341465, 'samples': 11667264, 'steps': 60766, 'loss/train': 1.599009394645691} 11/07/2021 05:48:36 - INFO - __main__ - Step 60768: {'lr': 0.0003294363116130357, 'samples': 11667456, 'steps': 60767, 'loss/train': 1.398998498916626} 11/07/2021 05:48:36 - INFO - __main__ - Step 60769: {'lr': 0.00032943127986686393, 'samples': 11667648, 'steps': 60768, 'loss/train': 0.4687707722187042} 11/07/2021 05:48:37 - INFO - __main__ - Step 60770: {'lr': 0.0003294262480849017, 'samples': 11667840, 'steps': 60769, 'loss/train': 1.1392436027526855} 11/07/2021 05:48:37 - INFO - __main__ - Step 60771: {'lr': 0.0003294212162671512, 'samples': 11668032, 'steps': 60770, 'loss/train': 1.4771801233291626} 11/07/2021 05:48:38 - INFO - __main__ - Step 60772: {'lr': 0.00032941618441361477, 'samples': 11668224, 'steps': 60771, 'loss/train': 0.9016726016998291} 11/07/2021 05:48:38 - INFO - __main__ - Step 60773: {'lr': 0.0003294111525242946, 'samples': 11668416, 'steps': 60772, 'loss/train': 1.062914490699768} 11/07/2021 05:48:39 - INFO - __main__ - Step 60774: {'lr': 0.000329406120599193, 'samples': 11668608, 'steps': 60773, 'loss/train': 1.1237372159957886} 11/07/2021 05:48:39 - INFO - __main__ - Step 60775: {'lr': 0.0003294010886383122, 'samples': 11668800, 'steps': 60774, 'loss/train': 1.2759335041046143} 11/07/2021 05:48:39 - INFO - __main__ - Step 60776: {'lr': 0.0003293960566416545, 'samples': 11668992, 'steps': 60775, 'loss/train': 1.1923125982284546} 11/07/2021 05:48:40 - INFO - __main__ - Step 60777: {'lr': 0.00032939102460922227, 'samples': 11669184, 'steps': 60776, 'loss/train': 1.4919078350067139} 11/07/2021 05:48:41 - INFO - __main__ - Step 60778: {'lr': 0.00032938599254101755, 'samples': 11669376, 'steps': 60777, 'loss/train': 1.7334095239639282} 11/07/2021 05:48:41 - INFO - __main__ - Step 60779: {'lr': 0.0003293809604370427, 'samples': 11669568, 'steps': 60778, 'loss/train': 1.3436882495880127} 11/07/2021 05:48:42 - INFO - __main__ - Step 60780: {'lr': 0.0003293759282973001, 'samples': 11669760, 'steps': 60779, 'loss/train': 5.806026458740234} 11/07/2021 05:48:42 - INFO - __main__ - Step 60781: {'lr': 0.0003293708961217919, 'samples': 11669952, 'steps': 60780, 'loss/train': 1.7960362434387207} 11/07/2021 05:48:43 - INFO - __main__ - Step 60782: {'lr': 0.00032936586391052035, 'samples': 11670144, 'steps': 60781, 'loss/train': 1.141554594039917} 11/07/2021 05:48:43 - INFO - __main__ - Step 60783: {'lr': 0.0003293608316634879, 'samples': 11670336, 'steps': 60782, 'loss/train': 1.095795750617981} 11/07/2021 05:48:44 - INFO - __main__ - Step 60784: {'lr': 0.0003293557993806966, 'samples': 11670528, 'steps': 60783, 'loss/train': 1.5137826204299927} 11/07/2021 05:48:44 - INFO - __main__ - Step 60785: {'lr': 0.0003293507670621488, 'samples': 11670720, 'steps': 60784, 'loss/train': 1.2012248039245605} 11/07/2021 05:48:44 - INFO - __main__ - Step 60786: {'lr': 0.00032934573470784674, 'samples': 11670912, 'steps': 60785, 'loss/train': 1.7050749063491821} 11/07/2021 05:48:45 - INFO - __main__ - Step 60787: {'lr': 0.00032934070231779275, 'samples': 11671104, 'steps': 60786, 'loss/train': 1.384711503982544} 11/07/2021 05:48:46 - INFO - __main__ - Step 60788: {'lr': 0.0003293356698919891, 'samples': 11671296, 'steps': 60787, 'loss/train': 1.21917724609375} 11/07/2021 05:48:46 - INFO - __main__ - Step 60789: {'lr': 0.000329330637430438, 'samples': 11671488, 'steps': 60788, 'loss/train': 1.6019275188446045} 11/07/2021 05:48:47 - INFO - __main__ - Step 60790: {'lr': 0.00032932560493314166, 'samples': 11671680, 'steps': 60789, 'loss/train': 4.096528053283691} 11/07/2021 05:48:47 - INFO - __main__ - Step 60791: {'lr': 0.0003293205724001025, 'samples': 11671872, 'steps': 60790, 'loss/train': 1.6543517112731934} 11/07/2021 05:48:47 - INFO - __main__ - Step 60792: {'lr': 0.00032931553983132266, 'samples': 11672064, 'steps': 60791, 'loss/train': 1.0663360357284546} 11/07/2021 05:48:48 - INFO - __main__ - Step 60793: {'lr': 0.00032931050722680453, 'samples': 11672256, 'steps': 60792, 'loss/train': 1.2937300205230713} 11/07/2021 05:48:49 - INFO - __main__ - Step 60794: {'lr': 0.00032930547458655035, 'samples': 11672448, 'steps': 60793, 'loss/train': 1.565666913986206} 11/07/2021 05:48:49 - INFO - __main__ - Step 60795: {'lr': 0.00032930044191056227, 'samples': 11672640, 'steps': 60794, 'loss/train': 1.1504673957824707} 11/07/2021 05:48:49 - INFO - __main__ - Step 60796: {'lr': 0.0003292954091988426, 'samples': 11672832, 'steps': 60795, 'loss/train': 1.1843305826187134} 11/07/2021 05:48:50 - INFO - __main__ - Step 60797: {'lr': 0.0003292903764513937, 'samples': 11673024, 'steps': 60796, 'loss/train': 2.790372848510742} 11/07/2021 05:48:50 - INFO - __main__ - Step 60798: {'lr': 0.0003292853436682177, 'samples': 11673216, 'steps': 60797, 'loss/train': 1.4339756965637207} 11/07/2021 05:48:51 - INFO - __main__ - Step 60799: {'lr': 0.0003292803108493171, 'samples': 11673408, 'steps': 60798, 'loss/train': 1.4392610788345337} 11/07/2021 05:48:52 - INFO - __main__ - Step 60800: {'lr': 0.0003292752779946939, 'samples': 11673600, 'steps': 60799, 'loss/train': 1.2790781259536743} 11/07/2021 05:48:52 - INFO - __main__ - Step 60801: {'lr': 0.00032927024510435055, 'samples': 11673792, 'steps': 60800, 'loss/train': 1.354580283164978} 11/07/2021 05:48:52 - INFO - __main__ - Step 60802: {'lr': 0.0003292652121782892, 'samples': 11673984, 'steps': 60801, 'loss/train': 1.2318165302276611} 11/07/2021 05:48:53 - INFO - __main__ - Step 60803: {'lr': 0.0003292601792165122, 'samples': 11674176, 'steps': 60802, 'loss/train': 1.8341155052185059} 11/07/2021 05:48:54 - INFO - __main__ - Step 60804: {'lr': 0.00032925514621902173, 'samples': 11674368, 'steps': 60803, 'loss/train': 1.482343316078186} 11/07/2021 05:48:54 - INFO - __main__ - Step 60805: {'lr': 0.0003292501131858201, 'samples': 11674560, 'steps': 60804, 'loss/train': 0.5775644183158875} 11/07/2021 05:48:54 - INFO - __main__ - Step 60806: {'lr': 0.0003292450801169097, 'samples': 11674752, 'steps': 60805, 'loss/train': 1.343030571937561} 11/07/2021 05:48:55 - INFO - __main__ - Step 60807: {'lr': 0.00032924004701229267, 'samples': 11674944, 'steps': 60806, 'loss/train': 1.3336561918258667} 11/07/2021 05:48:55 - INFO - __main__ - Step 60808: {'lr': 0.00032923501387197127, 'samples': 11675136, 'steps': 60807, 'loss/train': 1.3868985176086426} 11/07/2021 05:48:56 - INFO - __main__ - Step 60809: {'lr': 0.00032922998069594774, 'samples': 11675328, 'steps': 60808, 'loss/train': 1.567786455154419} 11/07/2021 05:48:56 - INFO - __main__ - Step 60810: {'lr': 0.0003292249474842244, 'samples': 11675520, 'steps': 60809, 'loss/train': 1.3836430311203003} 11/07/2021 05:48:57 - INFO - __main__ - Step 60811: {'lr': 0.00032921991423680356, 'samples': 11675712, 'steps': 60810, 'loss/train': 1.2382856607437134} 11/07/2021 05:48:57 - INFO - __main__ - Step 60812: {'lr': 0.0003292148809536876, 'samples': 11675904, 'steps': 60811, 'loss/train': 1.5415695905685425} 11/07/2021 05:48:57 - INFO - __main__ - Step 60813: {'lr': 0.0003292098476348784, 'samples': 11676096, 'steps': 60812, 'loss/train': 1.2859939336776733} 11/07/2021 05:48:59 - INFO - __main__ - Step 60814: {'lr': 0.00032920481428037857, 'samples': 11676288, 'steps': 60813, 'loss/train': 1.8356401920318604} 11/07/2021 05:48:59 - INFO - __main__ - Step 60815: {'lr': 0.00032919978089019026, 'samples': 11676480, 'steps': 60814, 'loss/train': 1.531577229499817} 11/07/2021 05:48:59 - INFO - __main__ - Step 60816: {'lr': 0.00032919474746431575, 'samples': 11676672, 'steps': 60815, 'loss/train': 1.6791971921920776} 11/07/2021 05:49:00 - INFO - __main__ - Step 60817: {'lr': 0.00032918971400275733, 'samples': 11676864, 'steps': 60816, 'loss/train': 1.2713189125061035} 11/07/2021 05:49:00 - INFO - __main__ - Step 60818: {'lr': 0.0003291846805055172, 'samples': 11677056, 'steps': 60817, 'loss/train': 1.5244804620742798} 11/07/2021 05:49:01 - INFO - __main__ - Step 60819: {'lr': 0.0003291796469725977, 'samples': 11677248, 'steps': 60818, 'loss/train': 1.200952410697937} 11/07/2021 05:49:01 - INFO - __main__ - Step 60820: {'lr': 0.0003291746134040011, 'samples': 11677440, 'steps': 60819, 'loss/train': 1.237603783607483} 11/07/2021 05:49:02 - INFO - __main__ - Step 60821: {'lr': 0.00032916957979972964, 'samples': 11677632, 'steps': 60820, 'loss/train': 1.0285141468048096} 11/07/2021 05:49:02 - INFO - __main__ - Step 60822: {'lr': 0.00032916454615978554, 'samples': 11677824, 'steps': 60821, 'loss/train': 1.2525105476379395} 11/07/2021 05:49:02 - INFO - __main__ - Step 60823: {'lr': 0.00032915951248417113, 'samples': 11678016, 'steps': 60822, 'loss/train': 1.5177724361419678} 11/07/2021 05:49:03 - INFO - __main__ - Step 60824: {'lr': 0.0003291544787728887, 'samples': 11678208, 'steps': 60823, 'loss/train': 1.6468875408172607} 11/07/2021 05:49:04 - INFO - __main__ - Step 60825: {'lr': 0.00032914944502594046, 'samples': 11678400, 'steps': 60824, 'loss/train': 1.3263120651245117} 11/07/2021 05:49:04 - INFO - __main__ - Step 60826: {'lr': 0.00032914441124332874, 'samples': 11678592, 'steps': 60825, 'loss/train': 1.786871075630188} 11/07/2021 05:49:05 - INFO - __main__ - Step 60827: {'lr': 0.0003291393774250557, 'samples': 11678784, 'steps': 60826, 'loss/train': 1.3893988132476807} 11/07/2021 05:49:05 - INFO - __main__ - Step 60828: {'lr': 0.0003291343435711237, 'samples': 11678976, 'steps': 60827, 'loss/train': 1.1714072227478027} 11/07/2021 05:49:05 - INFO - __main__ - Step 60829: {'lr': 0.000329129309681535, 'samples': 11679168, 'steps': 60828, 'loss/train': 1.5040473937988281} 11/07/2021 05:49:07 - INFO - __main__ - Step 60830: {'lr': 0.0003291242757562919, 'samples': 11679360, 'steps': 60829, 'loss/train': 1.3606387376785278} 11/07/2021 05:49:07 - INFO - __main__ - Step 60831: {'lr': 0.00032911924179539653, 'samples': 11679552, 'steps': 60830, 'loss/train': 1.2878488302230835} 11/07/2021 05:49:08 - INFO - __main__ - Step 60832: {'lr': 0.00032911420779885135, 'samples': 11679744, 'steps': 60831, 'loss/train': 1.34402334690094} 11/07/2021 05:49:08 - INFO - __main__ - Step 60833: {'lr': 0.00032910917376665846, 'samples': 11679936, 'steps': 60832, 'loss/train': 0.10699727386236191} 11/07/2021 05:49:09 - INFO - __main__ - Step 60834: {'lr': 0.0003291041396988202, 'samples': 11680128, 'steps': 60833, 'loss/train': 1.4673844575881958} 11/07/2021 05:49:09 - INFO - __main__ - Step 60835: {'lr': 0.00032909910559533886, 'samples': 11680320, 'steps': 60834, 'loss/train': 1.543282389640808} 11/07/2021 05:49:09 - INFO - __main__ - Step 60836: {'lr': 0.00032909407145621664, 'samples': 11680512, 'steps': 60835, 'loss/train': 1.5532816648483276} 11/07/2021 05:49:10 - INFO - __main__ - Step 60837: {'lr': 0.0003290890372814559, 'samples': 11680704, 'steps': 60836, 'loss/train': 1.6544691324234009} 11/07/2021 05:49:11 - INFO - __main__ - Step 60838: {'lr': 0.0003290840030710588, 'samples': 11680896, 'steps': 60837, 'loss/train': 1.170867681503296} 11/07/2021 05:49:11 - INFO - __main__ - Step 60839: {'lr': 0.00032907896882502775, 'samples': 11681088, 'steps': 60838, 'loss/train': 1.418349266052246} 11/07/2021 05:49:11 - INFO - __main__ - Step 60840: {'lr': 0.00032907393454336493, 'samples': 11681280, 'steps': 60839, 'loss/train': 1.5169421434402466} 11/07/2021 05:49:12 - INFO - __main__ - Step 60841: {'lr': 0.0003290689002260726, 'samples': 11681472, 'steps': 60840, 'loss/train': 0.8501607179641724} 11/07/2021 05:49:13 - INFO - __main__ - Step 60842: {'lr': 0.00032906386587315295, 'samples': 11681664, 'steps': 60841, 'loss/train': 1.3683832883834839} 11/07/2021 05:49:13 - INFO - __main__ - Step 60843: {'lr': 0.00032905883148460845, 'samples': 11681856, 'steps': 60842, 'loss/train': 0.9722559452056885} 11/07/2021 05:49:13 - INFO - __main__ - Step 60844: {'lr': 0.0003290537970604412, 'samples': 11682048, 'steps': 60843, 'loss/train': 0.8485066294670105} 11/07/2021 05:49:14 - INFO - __main__ - Step 60845: {'lr': 0.00032904876260065355, 'samples': 11682240, 'steps': 60844, 'loss/train': 1.705854892730713} 11/07/2021 05:49:14 - INFO - __main__ - Step 60846: {'lr': 0.0003290437281052478, 'samples': 11682432, 'steps': 60845, 'loss/train': 1.2524421215057373} 11/07/2021 05:49:15 - INFO - __main__ - Step 60847: {'lr': 0.00032903869357422613, 'samples': 11682624, 'steps': 60846, 'loss/train': 1.145171046257019} 11/07/2021 05:49:15 - INFO - __main__ - Step 60848: {'lr': 0.0003290336590075908, 'samples': 11682816, 'steps': 60847, 'loss/train': 1.4955649375915527} 11/07/2021 05:49:16 - INFO - __main__ - Step 60849: {'lr': 0.00032902862440534414, 'samples': 11683008, 'steps': 60848, 'loss/train': 1.524688959121704} 11/07/2021 05:49:16 - INFO - __main__ - Step 60850: {'lr': 0.00032902358976748844, 'samples': 11683200, 'steps': 60849, 'loss/train': 1.5933032035827637} 11/07/2021 05:49:16 - INFO - __main__ - Step 60851: {'lr': 0.0003290185550940259, 'samples': 11683392, 'steps': 60850, 'loss/train': 1.5746548175811768} 11/07/2021 05:49:18 - INFO - __main__ - Step 60852: {'lr': 0.0003290135203849588, 'samples': 11683584, 'steps': 60851, 'loss/train': 1.497358798980713} 11/07/2021 05:49:18 - INFO - __main__ - Step 60853: {'lr': 0.00032900848564028953, 'samples': 11683776, 'steps': 60852, 'loss/train': 1.3079599142074585} 11/07/2021 05:49:18 - INFO - __main__ - Step 60854: {'lr': 0.00032900345086002013, 'samples': 11683968, 'steps': 60853, 'loss/train': 1.663913369178772} 11/07/2021 05:49:19 - INFO - __main__ - Step 60855: {'lr': 0.00032899841604415306, 'samples': 11684160, 'steps': 60854, 'loss/train': 1.504226565361023} 11/07/2021 05:49:19 - INFO - __main__ - Step 60856: {'lr': 0.0003289933811926905, 'samples': 11684352, 'steps': 60855, 'loss/train': 1.3720711469650269} 11/07/2021 05:49:20 - INFO - __main__ - Step 60857: {'lr': 0.0003289883463056347, 'samples': 11684544, 'steps': 60856, 'loss/train': 2.003519296646118} 11/07/2021 05:49:20 - INFO - __main__ - Step 60858: {'lr': 0.0003289833113829881, 'samples': 11684736, 'steps': 60857, 'loss/train': 1.2619646787643433} 11/07/2021 05:49:21 - INFO - __main__ - Step 60859: {'lr': 0.0003289782764247528, 'samples': 11684928, 'steps': 60858, 'loss/train': 1.2730093002319336} 11/07/2021 05:49:21 - INFO - __main__ - Step 60860: {'lr': 0.000328973241430931, 'samples': 11685120, 'steps': 60859, 'loss/train': 1.4986016750335693} 11/07/2021 05:49:21 - INFO - __main__ - Step 60861: {'lr': 0.0003289682064015251, 'samples': 11685312, 'steps': 60860, 'loss/train': 1.258947491645813} 11/07/2021 05:49:22 - INFO - __main__ - Step 60862: {'lr': 0.0003289631713365374, 'samples': 11685504, 'steps': 60861, 'loss/train': 1.988419771194458} 11/07/2021 05:49:23 - INFO - __main__ - Step 60863: {'lr': 0.00032895813623597017, 'samples': 11685696, 'steps': 60862, 'loss/train': 1.4329723119735718} 11/07/2021 05:49:23 - INFO - __main__ - Step 60864: {'lr': 0.0003289531010998255, 'samples': 11685888, 'steps': 60863, 'loss/train': 1.4903864860534668} 11/07/2021 05:49:24 - INFO - __main__ - Step 60865: {'lr': 0.0003289480659281058, 'samples': 11686080, 'steps': 60864, 'loss/train': 1.7138718366622925} 11/07/2021 05:49:24 - INFO - __main__ - Step 60866: {'lr': 0.0003289430307208134, 'samples': 11686272, 'steps': 60865, 'loss/train': 5.734623432159424} 11/07/2021 05:49:24 - INFO - __main__ - Step 60867: {'lr': 0.00032893799547795046, 'samples': 11686464, 'steps': 60866, 'loss/train': 1.3461408615112305} 11/07/2021 05:49:25 - INFO - __main__ - Step 60868: {'lr': 0.0003289329601995192, 'samples': 11686656, 'steps': 60867, 'loss/train': 1.3386622667312622} 11/07/2021 05:49:26 - INFO - __main__ - Step 60869: {'lr': 0.00032892792488552203, 'samples': 11686848, 'steps': 60868, 'loss/train': 1.6153637170791626} 11/07/2021 05:49:26 - INFO - __main__ - Step 60870: {'lr': 0.00032892288953596116, 'samples': 11687040, 'steps': 60869, 'loss/train': 0.29348981380462646} 11/07/2021 05:49:26 - INFO - __main__ - Step 60871: {'lr': 0.00032891785415083884, 'samples': 11687232, 'steps': 60870, 'loss/train': 1.676207184791565} 11/07/2021 05:49:27 - INFO - __main__ - Step 60872: {'lr': 0.00032891281873015734, 'samples': 11687424, 'steps': 60871, 'loss/train': 1.318395972251892} 11/07/2021 05:49:27 - INFO - __main__ - Step 60873: {'lr': 0.000328907783273919, 'samples': 11687616, 'steps': 60872, 'loss/train': 1.820590615272522} 11/07/2021 05:49:28 - INFO - __main__ - Step 60874: {'lr': 0.000328902747782126, 'samples': 11687808, 'steps': 60873, 'loss/train': 1.2372708320617676} 11/07/2021 05:49:28 - INFO - __main__ - Step 60875: {'lr': 0.0003288977122547806, 'samples': 11688000, 'steps': 60874, 'loss/train': 1.4369685649871826} 11/07/2021 05:49:29 - INFO - __main__ - Step 60876: {'lr': 0.00032889267669188515, 'samples': 11688192, 'steps': 60875, 'loss/train': 1.2627582550048828} 11/07/2021 05:49:29 - INFO - __main__ - Step 60877: {'lr': 0.0003288876410934418, 'samples': 11688384, 'steps': 60876, 'loss/train': 1.730799674987793} 11/07/2021 05:49:30 - INFO - __main__ - Step 60878: {'lr': 0.000328882605459453, 'samples': 11688576, 'steps': 60877, 'loss/train': 1.7355254888534546} 11/07/2021 05:49:31 - INFO - __main__ - Step 60879: {'lr': 0.0003288775697899209, 'samples': 11688768, 'steps': 60878, 'loss/train': 1.9340633153915405} 11/07/2021 05:49:31 - INFO - __main__ - Step 60880: {'lr': 0.00032887253408484776, 'samples': 11688960, 'steps': 60879, 'loss/train': 2.0717720985412598} 11/07/2021 05:49:32 - INFO - __main__ - Step 60881: {'lr': 0.0003288674983442358, 'samples': 11689152, 'steps': 60880, 'loss/train': 1.6929211616516113} 11/07/2021 05:49:32 - INFO - __main__ - Step 60882: {'lr': 0.0003288624625680875, 'samples': 11689344, 'steps': 60881, 'loss/train': 0.575290858745575} 11/07/2021 05:49:32 - INFO - __main__ - Step 60883: {'lr': 0.0003288574267564049, 'samples': 11689536, 'steps': 60882, 'loss/train': 1.4447029829025269} 11/07/2021 05:49:33 - INFO - __main__ - Step 60884: {'lr': 0.0003288523909091904, 'samples': 11689728, 'steps': 60883, 'loss/train': 1.5660972595214844} 11/07/2021 05:49:34 - INFO - __main__ - Step 60885: {'lr': 0.0003288473550264462, 'samples': 11689920, 'steps': 60884, 'loss/train': 1.4460694789886475} 11/07/2021 05:49:34 - INFO - __main__ - Step 60886: {'lr': 0.00032884231910817465, 'samples': 11690112, 'steps': 60885, 'loss/train': 1.607729434967041} 11/07/2021 05:49:34 - INFO - __main__ - Step 60887: {'lr': 0.0003288372831543779, 'samples': 11690304, 'steps': 60886, 'loss/train': 1.566941738128662} 11/07/2021 05:49:35 - INFO - __main__ - Step 60888: {'lr': 0.0003288322471650583, 'samples': 11690496, 'steps': 60887, 'loss/train': 1.2857362031936646} 11/07/2021 05:49:36 - INFO - __main__ - Step 60889: {'lr': 0.0003288272111402181, 'samples': 11690688, 'steps': 60888, 'loss/train': 1.3125181198120117} 11/07/2021 05:49:36 - INFO - __main__ - Step 60890: {'lr': 0.0003288221750798596, 'samples': 11690880, 'steps': 60889, 'loss/train': 0.8595288395881653} 11/07/2021 05:49:36 - INFO - __main__ - Step 60891: {'lr': 0.0003288171389839851, 'samples': 11691072, 'steps': 60890, 'loss/train': 1.5488319396972656} 11/07/2021 05:49:37 - INFO - __main__ - Step 60892: {'lr': 0.0003288121028525967, 'samples': 11691264, 'steps': 60891, 'loss/train': 1.348591923713684} 11/07/2021 05:49:37 - INFO - __main__ - Step 60893: {'lr': 0.0003288070666856969, 'samples': 11691456, 'steps': 60892, 'loss/train': 1.237944483757019} 11/07/2021 05:49:38 - INFO - __main__ - Step 60894: {'lr': 0.00032880203048328777, 'samples': 11691648, 'steps': 60893, 'loss/train': 2.102830171585083} 11/07/2021 05:49:39 - INFO - __main__ - Step 60895: {'lr': 0.0003287969942453717, 'samples': 11691840, 'steps': 60894, 'loss/train': 1.4096628427505493} 11/07/2021 05:49:39 - INFO - __main__ - Step 60896: {'lr': 0.0003287919579719509, 'samples': 11692032, 'steps': 60895, 'loss/train': 1.3991302251815796} 11/07/2021 05:49:39 - INFO - __main__ - Step 60897: {'lr': 0.00032878692166302766, 'samples': 11692224, 'steps': 60896, 'loss/train': 1.9141570329666138} 11/07/2021 05:49:40 - INFO - __main__ - Step 60898: {'lr': 0.0003287818853186042, 'samples': 11692416, 'steps': 60897, 'loss/train': 0.920104444026947} 11/07/2021 05:49:40 - INFO - __main__ - Step 60899: {'lr': 0.0003287768489386829, 'samples': 11692608, 'steps': 60898, 'loss/train': 1.5053181648254395} 11/07/2021 05:49:41 - INFO - __main__ - Step 60900: {'lr': 0.000328771812523266, 'samples': 11692800, 'steps': 60899, 'loss/train': 1.4130250215530396} 11/07/2021 05:49:41 - INFO - __main__ - Step 60901: {'lr': 0.00032876677607235566, 'samples': 11692992, 'steps': 60900, 'loss/train': 1.0249323844909668} 11/07/2021 05:49:42 - INFO - __main__ - Step 60902: {'lr': 0.0003287617395859543, 'samples': 11693184, 'steps': 60901, 'loss/train': 1.9679993391036987} 11/07/2021 05:49:42 - INFO - __main__ - Step 60903: {'lr': 0.00032875670306406403, 'samples': 11693376, 'steps': 60902, 'loss/train': 1.4490913152694702} 11/07/2021 05:49:42 - INFO - __main__ - Step 60904: {'lr': 0.00032875166650668725, 'samples': 11693568, 'steps': 60903, 'loss/train': 1.246215581893921} 11/07/2021 05:49:44 - INFO - __main__ - Step 60905: {'lr': 0.0003287466299138262, 'samples': 11693760, 'steps': 60904, 'loss/train': 1.7815371751785278} 11/07/2021 05:49:44 - INFO - __main__ - Step 60906: {'lr': 0.00032874159328548315, 'samples': 11693952, 'steps': 60905, 'loss/train': 1.7777659893035889} 11/07/2021 05:49:44 - INFO - __main__ - Step 60907: {'lr': 0.0003287365566216603, 'samples': 11694144, 'steps': 60906, 'loss/train': 1.575698971748352} 11/07/2021 05:49:45 - INFO - __main__ - Step 60908: {'lr': 0.00032873151992236, 'samples': 11694336, 'steps': 60907, 'loss/train': 1.436080813407898} 11/07/2021 05:49:45 - INFO - __main__ - Step 60909: {'lr': 0.00032872648318758445, 'samples': 11694528, 'steps': 60908, 'loss/train': 0.9839154481887817} 11/07/2021 05:49:46 - INFO - __main__ - Step 60910: {'lr': 0.000328721446417336, 'samples': 11694720, 'steps': 60909, 'loss/train': 1.5517046451568604} 11/07/2021 05:49:46 - INFO - __main__ - Step 60911: {'lr': 0.00032871640961161687, 'samples': 11694912, 'steps': 60910, 'loss/train': 1.1343740224838257} 11/07/2021 05:49:47 - INFO - __main__ - Step 60912: {'lr': 0.0003287113727704294, 'samples': 11695104, 'steps': 60911, 'loss/train': 1.0799760818481445} 11/07/2021 05:49:47 - INFO - __main__ - Step 60913: {'lr': 0.00032870633589377575, 'samples': 11695296, 'steps': 60912, 'loss/train': 1.3780189752578735} 11/07/2021 05:49:47 - INFO - __main__ - Step 60914: {'lr': 0.00032870129898165826, 'samples': 11695488, 'steps': 60913, 'loss/train': 0.9872614145278931} 11/07/2021 05:49:48 - INFO - __main__ - Step 60915: {'lr': 0.00032869626203407907, 'samples': 11695680, 'steps': 60914, 'loss/train': 1.6188406944274902} 11/07/2021 05:49:49 - INFO - __main__ - Step 60916: {'lr': 0.00032869122505104067, 'samples': 11695872, 'steps': 60915, 'loss/train': 1.3201267719268799} 11/07/2021 05:49:49 - INFO - __main__ - Step 60917: {'lr': 0.0003286861880325452, 'samples': 11696064, 'steps': 60916, 'loss/train': 1.1589133739471436} 11/07/2021 05:49:49 - INFO - __main__ - Step 60918: {'lr': 0.00032868115097859496, 'samples': 11696256, 'steps': 60917, 'loss/train': 1.3187493085861206} 11/07/2021 05:49:50 - INFO - __main__ - Step 60919: {'lr': 0.00032867611388919215, 'samples': 11696448, 'steps': 60918, 'loss/train': 0.9139127135276794} 11/07/2021 05:49:51 - INFO - __main__ - Step 60920: {'lr': 0.0003286710767643392, 'samples': 11696640, 'steps': 60919, 'loss/train': 1.4804084300994873} 11/07/2021 05:49:51 - INFO - __main__ - Step 60921: {'lr': 0.0003286660396040382, 'samples': 11696832, 'steps': 60920, 'loss/train': 1.5790210962295532} 11/07/2021 05:49:51 - INFO - __main__ - Step 60922: {'lr': 0.0003286610024082915, 'samples': 11697024, 'steps': 60921, 'loss/train': 1.5043166875839233} 11/07/2021 05:49:52 - INFO - __main__ - Step 60923: {'lr': 0.0003286559651771014, 'samples': 11697216, 'steps': 60922, 'loss/train': 1.1254929304122925} 11/07/2021 05:49:52 - INFO - __main__ - Step 60924: {'lr': 0.00032865092791047013, 'samples': 11697408, 'steps': 60923, 'loss/train': 1.5920418500900269} 11/07/2021 05:49:53 - INFO - __main__ - Step 60925: {'lr': 0.0003286458906083999, 'samples': 11697600, 'steps': 60924, 'loss/train': 1.523051381111145} 11/07/2021 05:49:54 - INFO - __main__ - Step 60926: {'lr': 0.0003286408532708931, 'samples': 11697792, 'steps': 60925, 'loss/train': 1.314002275466919} 11/07/2021 05:49:54 - INFO - __main__ - Step 60927: {'lr': 0.00032863581589795193, 'samples': 11697984, 'steps': 60926, 'loss/train': 1.8102748394012451} 11/07/2021 05:49:54 - INFO - __main__ - Step 60928: {'lr': 0.00032863077848957874, 'samples': 11698176, 'steps': 60927, 'loss/train': 1.5697373151779175} 11/07/2021 05:49:55 - INFO - __main__ - Step 60929: {'lr': 0.00032862574104577567, 'samples': 11698368, 'steps': 60928, 'loss/train': 1.4349370002746582} 11/07/2021 05:49:55 - INFO - __main__ - Step 60930: {'lr': 0.00032862070356654504, 'samples': 11698560, 'steps': 60929, 'loss/train': 1.5551376342773438} 11/07/2021 05:49:56 - INFO - __main__ - Step 60931: {'lr': 0.00032861566605188914, 'samples': 11698752, 'steps': 60930, 'loss/train': 1.466261863708496} 11/07/2021 05:49:56 - INFO - __main__ - Step 60932: {'lr': 0.00032861062850181023, 'samples': 11698944, 'steps': 60931, 'loss/train': 1.1526209115982056} 11/07/2021 05:49:57 - INFO - __main__ - Step 60933: {'lr': 0.00032860559091631066, 'samples': 11699136, 'steps': 60932, 'loss/train': 1.5189297199249268} 11/07/2021 05:49:57 - INFO - __main__ - Step 60934: {'lr': 0.0003286005532953926, 'samples': 11699328, 'steps': 60933, 'loss/train': 1.3256059885025024} 11/07/2021 05:49:57 - INFO - __main__ - Step 60935: {'lr': 0.00032859551563905825, 'samples': 11699520, 'steps': 60934, 'loss/train': 1.5294737815856934} 11/07/2021 05:49:58 - INFO - __main__ - Step 60936: {'lr': 0.00032859047794731, 'samples': 11699712, 'steps': 60935, 'loss/train': 1.5895602703094482} 11/07/2021 05:49:59 - INFO - __main__ - Step 60937: {'lr': 0.00032858544022015015, 'samples': 11699904, 'steps': 60936, 'loss/train': 1.127139925956726} 11/07/2021 05:49:59 - INFO - __main__ - Step 60938: {'lr': 0.0003285804024575809, 'samples': 11700096, 'steps': 60937, 'loss/train': 1.7358180284500122} 11/07/2021 05:49:59 - INFO - __main__ - Step 60939: {'lr': 0.0003285753646596045, 'samples': 11700288, 'steps': 60938, 'loss/train': 1.3762918710708618} 11/07/2021 05:50:00 - INFO - __main__ - Step 60940: {'lr': 0.00032857032682622335, 'samples': 11700480, 'steps': 60939, 'loss/train': 1.2725944519042969} 11/07/2021 05:50:01 - INFO - __main__ - Step 60941: {'lr': 0.00032856528895743953, 'samples': 11700672, 'steps': 60940, 'loss/train': 1.3119075298309326} 11/07/2021 05:50:02 - INFO - __main__ - Step 60942: {'lr': 0.00032856025105325537, 'samples': 11700864, 'steps': 60941, 'loss/train': 1.3656136989593506} 11/07/2021 05:50:02 - INFO - __main__ - Step 60943: {'lr': 0.00032855521311367326, 'samples': 11701056, 'steps': 60942, 'loss/train': 1.329738736152649} 11/07/2021 05:50:02 - INFO - __main__ - Step 60944: {'lr': 0.00032855017513869537, 'samples': 11701248, 'steps': 60943, 'loss/train': 1.2539479732513428} 11/07/2021 05:50:03 - INFO - __main__ - Step 60945: {'lr': 0.0003285451371283239, 'samples': 11701440, 'steps': 60944, 'loss/train': 0.6535127758979797} 11/07/2021 05:50:03 - INFO - __main__ - Step 60946: {'lr': 0.00032854009908256127, 'samples': 11701632, 'steps': 60945, 'loss/train': 1.1424142122268677} 11/07/2021 05:50:04 - INFO - __main__ - Step 60947: {'lr': 0.00032853506100140973, 'samples': 11701824, 'steps': 60946, 'loss/train': 1.6103777885437012} 11/07/2021 05:50:04 - INFO - __main__ - Step 60948: {'lr': 0.00032853002288487146, 'samples': 11702016, 'steps': 60947, 'loss/train': 1.7086386680603027} 11/07/2021 05:50:05 - INFO - __main__ - Step 60949: {'lr': 0.00032852498473294874, 'samples': 11702208, 'steps': 60948, 'loss/train': 1.1817028522491455} 11/07/2021 05:50:05 - INFO - __main__ - Step 60950: {'lr': 0.0003285199465456439, 'samples': 11702400, 'steps': 60949, 'loss/train': 1.1594256162643433} 11/07/2021 05:50:05 - INFO - __main__ - Step 60951: {'lr': 0.0003285149083229592, 'samples': 11702592, 'steps': 60950, 'loss/train': 1.2299762964248657} 11/07/2021 05:50:06 - INFO - __main__ - Step 60952: {'lr': 0.00032850987006489686, 'samples': 11702784, 'steps': 60951, 'loss/train': 1.6137765645980835} 11/07/2021 05:50:07 - INFO - __main__ - Step 60953: {'lr': 0.00032850483177145924, 'samples': 11702976, 'steps': 60952, 'loss/train': 1.5926047563552856} 11/07/2021 05:50:07 - INFO - __main__ - Step 60954: {'lr': 0.00032849979344264844, 'samples': 11703168, 'steps': 60953, 'loss/train': 1.2599759101867676} 11/07/2021 05:50:07 - INFO - __main__ - Step 60955: {'lr': 0.00032849475507846696, 'samples': 11703360, 'steps': 60954, 'loss/train': 1.4531519412994385} 11/07/2021 05:50:08 - INFO - __main__ - Step 60956: {'lr': 0.0003284897166789169, 'samples': 11703552, 'steps': 60955, 'loss/train': 1.5889521837234497} 11/07/2021 05:50:09 - INFO - __main__ - Step 60957: {'lr': 0.0003284846782440006, 'samples': 11703744, 'steps': 60956, 'loss/train': 1.7075833082199097} 11/07/2021 05:50:09 - INFO - __main__ - Step 60958: {'lr': 0.0003284796397737203, 'samples': 11703936, 'steps': 60957, 'loss/train': 1.6474119424819946} 11/07/2021 05:50:10 - INFO - __main__ - Step 60959: {'lr': 0.0003284746012680783, 'samples': 11704128, 'steps': 60958, 'loss/train': 1.5012398958206177} 11/07/2021 05:50:10 - INFO - __main__ - Step 60960: {'lr': 0.0003284695627270769, 'samples': 11704320, 'steps': 60959, 'loss/train': 1.3086237907409668} 11/07/2021 05:50:10 - INFO - __main__ - Step 60961: {'lr': 0.00032846452415071826, 'samples': 11704512, 'steps': 60960, 'loss/train': 1.5910253524780273} 11/07/2021 05:50:11 - INFO - __main__ - Step 60962: {'lr': 0.00032845948553900475, 'samples': 11704704, 'steps': 60961, 'loss/train': 1.5576115846633911} 11/07/2021 05:50:12 - INFO - __main__ - Step 60963: {'lr': 0.0003284544468919386, 'samples': 11704896, 'steps': 60962, 'loss/train': 1.6046710014343262} 11/07/2021 05:50:12 - INFO - __main__ - Step 60964: {'lr': 0.0003284494082095221, 'samples': 11705088, 'steps': 60963, 'loss/train': 1.515358328819275} 11/07/2021 05:50:12 - INFO - __main__ - Step 60965: {'lr': 0.00032844436949175745, 'samples': 11705280, 'steps': 60964, 'loss/train': 1.2024003267288208} 11/07/2021 05:50:13 - INFO - __main__ - Step 60966: {'lr': 0.00032843933073864695, 'samples': 11705472, 'steps': 60965, 'loss/train': 1.6858707666397095} 11/07/2021 05:50:14 - INFO - __main__ - Step 60967: {'lr': 0.00032843429195019303, 'samples': 11705664, 'steps': 60966, 'loss/train': 1.7407084703445435} 11/07/2021 05:50:15 - INFO - __main__ - Step 60968: {'lr': 0.00032842925312639775, 'samples': 11705856, 'steps': 60967, 'loss/train': 1.646317481994629} 11/07/2021 05:50:15 - INFO - __main__ - Step 60969: {'lr': 0.0003284242142672635, 'samples': 11706048, 'steps': 60968, 'loss/train': 1.4023878574371338} 11/07/2021 05:50:15 - INFO - __main__ - Step 60970: {'lr': 0.00032841917537279245, 'samples': 11706240, 'steps': 60969, 'loss/train': 1.1254688501358032} 11/07/2021 05:50:16 - INFO - __main__ - Step 60971: {'lr': 0.00032841413644298697, 'samples': 11706432, 'steps': 60970, 'loss/train': 1.7579162120819092} 11/07/2021 05:50:16 - INFO - __main__ - Step 60972: {'lr': 0.00032840909747784924, 'samples': 11706624, 'steps': 60971, 'loss/train': 1.8975846767425537} 11/07/2021 05:50:16 - INFO - __main__ - Step 60973: {'lr': 0.00032840405847738165, 'samples': 11706816, 'steps': 60972, 'loss/train': 1.3754252195358276} 11/07/2021 05:50:17 - INFO - __main__ - Step 60974: {'lr': 0.0003283990194415864, 'samples': 11707008, 'steps': 60973, 'loss/train': 1.0830473899841309} 11/07/2021 05:50:18 - INFO - __main__ - Step 60975: {'lr': 0.0003283939803704657, 'samples': 11707200, 'steps': 60974, 'loss/train': 1.545719027519226} 11/07/2021 05:50:18 - INFO - __main__ - Step 60976: {'lr': 0.0003283889412640219, 'samples': 11707392, 'steps': 60975, 'loss/train': 1.7423877716064453} 11/07/2021 05:50:18 - INFO - __main__ - Step 60977: {'lr': 0.0003283839021222573, 'samples': 11707584, 'steps': 60976, 'loss/train': 1.581856369972229} 11/07/2021 05:50:19 - INFO - __main__ - Step 60978: {'lr': 0.000328378862945174, 'samples': 11707776, 'steps': 60977, 'loss/train': 1.4505118131637573} 11/07/2021 05:50:20 - INFO - __main__ - Step 60979: {'lr': 0.0003283738237327745, 'samples': 11707968, 'steps': 60978, 'loss/train': 0.9981711506843567} 11/07/2021 05:50:20 - INFO - __main__ - Step 60980: {'lr': 0.000328368784485061, 'samples': 11708160, 'steps': 60979, 'loss/train': 1.6145329475402832} 11/07/2021 05:50:20 - INFO - __main__ - Step 60981: {'lr': 0.00032836374520203574, 'samples': 11708352, 'steps': 60980, 'loss/train': 0.5990775227546692} 11/07/2021 05:50:21 - INFO - __main__ - Step 60982: {'lr': 0.0003283587058837009, 'samples': 11708544, 'steps': 60981, 'loss/train': 1.2665526866912842} 11/07/2021 05:50:21 - INFO - __main__ - Step 60983: {'lr': 0.0003283536665300588, 'samples': 11708736, 'steps': 60982, 'loss/train': 1.2701187133789062} 11/07/2021 05:50:23 - INFO - __main__ - Step 60984: {'lr': 0.00032834862714111184, 'samples': 11708928, 'steps': 60983, 'loss/train': 1.7815805673599243} 11/07/2021 05:50:23 - INFO - __main__ - Step 60985: {'lr': 0.0003283435877168622, 'samples': 11709120, 'steps': 60984, 'loss/train': 0.09350957721471786} 11/07/2021 05:50:23 - INFO - __main__ - Step 60986: {'lr': 0.00032833854825731207, 'samples': 11709312, 'steps': 60985, 'loss/train': 1.3603200912475586} 11/07/2021 05:50:24 - INFO - __main__ - Step 60987: {'lr': 0.00032833350876246395, 'samples': 11709504, 'steps': 60986, 'loss/train': 1.3743747472763062} 11/07/2021 05:50:24 - INFO - __main__ - Step 60988: {'lr': 0.0003283284692323198, 'samples': 11709696, 'steps': 60987, 'loss/train': 1.5840610265731812} 11/07/2021 05:50:25 - INFO - __main__ - Step 60989: {'lr': 0.0003283234296668821, 'samples': 11709888, 'steps': 60988, 'loss/train': 1.4733576774597168} 11/07/2021 05:50:25 - INFO - __main__ - Step 60990: {'lr': 0.00032831839006615307, 'samples': 11710080, 'steps': 60989, 'loss/train': 1.3523863554000854} 11/07/2021 05:50:26 - INFO - __main__ - Step 60991: {'lr': 0.000328313350430135, 'samples': 11710272, 'steps': 60990, 'loss/train': 1.568184494972229} 11/07/2021 05:50:26 - INFO - __main__ - Step 60992: {'lr': 0.0003283083107588301, 'samples': 11710464, 'steps': 60991, 'loss/train': 1.1772278547286987} 11/07/2021 05:50:26 - INFO - __main__ - Step 60993: {'lr': 0.0003283032710522407, 'samples': 11710656, 'steps': 60992, 'loss/train': 1.4237229824066162} 11/07/2021 05:50:28 - INFO - __main__ - Step 60994: {'lr': 0.0003282982313103691, 'samples': 11710848, 'steps': 60993, 'loss/train': 1.361504077911377} 11/07/2021 05:50:28 - INFO - __main__ - Step 60995: {'lr': 0.0003282931915332175, 'samples': 11711040, 'steps': 60994, 'loss/train': 1.5041236877441406} 11/07/2021 05:50:28 - INFO - __main__ - Step 60996: {'lr': 0.0003282881517207882, 'samples': 11711232, 'steps': 60995, 'loss/train': 0.43361717462539673} 11/07/2021 05:50:29 - INFO - __main__ - Step 60997: {'lr': 0.00032828311187308346, 'samples': 11711424, 'steps': 60996, 'loss/train': 1.9876699447631836} 11/07/2021 05:50:29 - INFO - __main__ - Step 60998: {'lr': 0.00032827807199010554, 'samples': 11711616, 'steps': 60997, 'loss/train': 1.9234117269515991} 11/07/2021 05:50:29 - INFO - __main__ - Step 60999: {'lr': 0.00032827303207185675, 'samples': 11711808, 'steps': 60998, 'loss/train': 1.502403974533081} 11/07/2021 05:50:30 - INFO - __main__ - Step 61000: {'lr': 0.00032826799211833934, 'samples': 11712000, 'steps': 60999, 'loss/train': 1.4910343885421753} 11/07/2021 05:50:31 - INFO - __main__ - Step 61001: {'lr': 0.0003282629521295556, 'samples': 11712192, 'steps': 61000, 'loss/train': 1.5350884199142456} 11/07/2021 05:50:31 - INFO - __main__ - Step 61002: {'lr': 0.00032825791210550775, 'samples': 11712384, 'steps': 61001, 'loss/train': 1.6233097314834595} 11/07/2021 05:50:31 - INFO - __main__ - Step 61003: {'lr': 0.00032825287204619807, 'samples': 11712576, 'steps': 61002, 'loss/train': 0.9833868145942688} 11/07/2021 05:50:32 - INFO - __main__ - Step 61004: {'lr': 0.0003282478319516289, 'samples': 11712768, 'steps': 61003, 'loss/train': 1.0935187339782715} 11/07/2021 05:50:33 - INFO - __main__ - Step 61005: {'lr': 0.00032824279182180243, 'samples': 11712960, 'steps': 61004, 'loss/train': 1.507974624633789} 11/07/2021 05:50:33 - INFO - __main__ - Step 61006: {'lr': 0.00032823775165672096, 'samples': 11713152, 'steps': 61005, 'loss/train': 1.4777790307998657} 11/07/2021 05:50:33 - INFO - __main__ - Step 61007: {'lr': 0.0003282327114563869, 'samples': 11713344, 'steps': 61006, 'loss/train': 0.6834260821342468} 11/07/2021 05:50:34 - INFO - __main__ - Step 61008: {'lr': 0.0003282276712208022, 'samples': 11713536, 'steps': 61007, 'loss/train': 1.6291919946670532} 11/07/2021 05:50:34 - INFO - __main__ - Step 61009: {'lr': 0.0003282226309499694, 'samples': 11713728, 'steps': 61008, 'loss/train': 1.5804270505905151} 11/07/2021 05:50:35 - INFO - __main__ - Step 61010: {'lr': 0.0003282175906438907, 'samples': 11713920, 'steps': 61009, 'loss/train': 2.0539088249206543} 11/07/2021 05:50:36 - INFO - __main__ - Step 61011: {'lr': 0.00032821255030256836, 'samples': 11714112, 'steps': 61010, 'loss/train': 1.29142165184021} 11/07/2021 05:50:36 - INFO - __main__ - Step 61012: {'lr': 0.00032820750992600464, 'samples': 11714304, 'steps': 61011, 'loss/train': 1.480143427848816} 11/07/2021 05:50:36 - INFO - __main__ - Step 61013: {'lr': 0.0003282024695142018, 'samples': 11714496, 'steps': 61012, 'loss/train': 1.317313313484192} 11/07/2021 05:50:37 - INFO - __main__ - Step 61014: {'lr': 0.0003281974290671622, 'samples': 11714688, 'steps': 61013, 'loss/train': 1.4092847108840942} 11/07/2021 05:50:37 - INFO - __main__ - Step 61015: {'lr': 0.000328192388584888, 'samples': 11714880, 'steps': 61014, 'loss/train': 1.549198031425476} 11/07/2021 05:50:38 - INFO - __main__ - Step 61016: {'lr': 0.00032818734806738147, 'samples': 11715072, 'steps': 61015, 'loss/train': 1.314173698425293} 11/07/2021 05:50:38 - INFO - __main__ - Step 61017: {'lr': 0.00032818230751464493, 'samples': 11715264, 'steps': 61016, 'loss/train': 1.880759835243225} 11/07/2021 05:50:39 - INFO - __main__ - Step 61018: {'lr': 0.0003281772669266807, 'samples': 11715456, 'steps': 61017, 'loss/train': 1.2192219495773315} 11/07/2021 05:50:39 - INFO - __main__ - Step 61019: {'lr': 0.00032817222630349103, 'samples': 11715648, 'steps': 61018, 'loss/train': 1.0569159984588623} 11/07/2021 05:50:39 - INFO - __main__ - Step 61020: {'lr': 0.00032816718564507806, 'samples': 11715840, 'steps': 61019, 'loss/train': 2.9799513816833496} 11/07/2021 05:50:40 - INFO - __main__ - Step 61021: {'lr': 0.0003281621449514443, 'samples': 11716032, 'steps': 61020, 'loss/train': 1.5769379138946533} 11/07/2021 05:50:41 - INFO - __main__ - Step 61022: {'lr': 0.0003281571042225918, 'samples': 11716224, 'steps': 61021, 'loss/train': 0.3695509731769562} 11/07/2021 05:50:41 - INFO - __main__ - Step 61023: {'lr': 0.0003281520634585229, 'samples': 11716416, 'steps': 61022, 'loss/train': 0.7960950136184692} 11/07/2021 05:50:41 - INFO - __main__ - Step 61024: {'lr': 0.0003281470226592399, 'samples': 11716608, 'steps': 61023, 'loss/train': 1.4980852603912354} 11/07/2021 05:50:42 - INFO - __main__ - Step 61025: {'lr': 0.0003281419818247451, 'samples': 11716800, 'steps': 61024, 'loss/train': 1.8101065158843994} 11/07/2021 05:50:43 - INFO - __main__ - Step 61026: {'lr': 0.00032813694095504064, 'samples': 11716992, 'steps': 61025, 'loss/train': 1.6509958505630493} 11/07/2021 05:50:43 - INFO - __main__ - Step 61027: {'lr': 0.000328131900050129, 'samples': 11717184, 'steps': 61026, 'loss/train': 0.789555549621582} 11/07/2021 05:50:44 - INFO - __main__ - Step 61028: {'lr': 0.0003281268591100123, 'samples': 11717376, 'steps': 61027, 'loss/train': 1.5360044240951538} 11/07/2021 05:50:44 - INFO - __main__ - Step 61029: {'lr': 0.00032812181813469276, 'samples': 11717568, 'steps': 61028, 'loss/train': 1.9363083839416504} 11/07/2021 05:50:44 - INFO - __main__ - Step 61030: {'lr': 0.0003281167771241728, 'samples': 11717760, 'steps': 61029, 'loss/train': 1.7857050895690918} 11/07/2021 05:50:45 - INFO - __main__ - Step 61031: {'lr': 0.00032811173607845455, 'samples': 11717952, 'steps': 61030, 'loss/train': 1.2539656162261963} 11/07/2021 05:50:46 - INFO - __main__ - Step 61032: {'lr': 0.0003281066949975404, 'samples': 11718144, 'steps': 61031, 'loss/train': 0.551241397857666} 11/07/2021 05:50:46 - INFO - __main__ - Step 61033: {'lr': 0.00032810165388143264, 'samples': 11718336, 'steps': 61032, 'loss/train': 1.083627462387085} 11/07/2021 05:50:47 - INFO - __main__ - Step 61034: {'lr': 0.00032809661273013345, 'samples': 11718528, 'steps': 61033, 'loss/train': 1.4929090738296509} 11/07/2021 05:50:47 - INFO - __main__ - Step 61035: {'lr': 0.0003280915715436451, 'samples': 11718720, 'steps': 61034, 'loss/train': 1.2530862092971802} 11/07/2021 05:50:47 - INFO - __main__ - Step 61036: {'lr': 0.00032808653032196993, 'samples': 11718912, 'steps': 61035, 'loss/train': 1.6660953760147095} 11/07/2021 05:50:48 - INFO - __main__ - Step 61037: {'lr': 0.00032808148906511017, 'samples': 11719104, 'steps': 61036, 'loss/train': 1.1134589910507202} 11/07/2021 05:50:49 - INFO - __main__ - Step 61038: {'lr': 0.00032807644777306804, 'samples': 11719296, 'steps': 61037, 'loss/train': 1.4202388525009155} 11/07/2021 05:50:49 - INFO - __main__ - Step 61039: {'lr': 0.00032807140644584593, 'samples': 11719488, 'steps': 61038, 'loss/train': 1.2152388095855713} 11/07/2021 05:50:49 - INFO - __main__ - Step 61040: {'lr': 0.000328066365083446, 'samples': 11719680, 'steps': 61039, 'loss/train': 1.6370117664337158} 11/07/2021 05:50:50 - INFO - __main__ - Step 61041: {'lr': 0.0003280613236858707, 'samples': 11719872, 'steps': 61040, 'loss/train': 1.3871208429336548} 11/07/2021 05:50:51 - INFO - __main__ - Step 61042: {'lr': 0.000328056282253122, 'samples': 11720064, 'steps': 61041, 'loss/train': 1.781936526298523} 11/07/2021 05:50:51 - INFO - __main__ - Step 61043: {'lr': 0.0003280512407852024, 'samples': 11720256, 'steps': 61042, 'loss/train': 2.106290578842163} 11/07/2021 05:50:52 - INFO - __main__ - Step 61044: {'lr': 0.00032804619928211416, 'samples': 11720448, 'steps': 61043, 'loss/train': 1.2413381338119507} 11/07/2021 05:50:52 - INFO - __main__ - Step 61045: {'lr': 0.0003280411577438595, 'samples': 11720640, 'steps': 61044, 'loss/train': 0.8452399969100952} 11/07/2021 05:50:52 - INFO - __main__ - Step 61046: {'lr': 0.00032803611617044065, 'samples': 11720832, 'steps': 61045, 'loss/train': 1.6051359176635742} 11/07/2021 05:50:53 - INFO - __main__ - Step 61047: {'lr': 0.00032803107456186, 'samples': 11721024, 'steps': 61046, 'loss/train': 1.3182430267333984} 11/07/2021 05:50:54 - INFO - __main__ - Step 61048: {'lr': 0.00032802603291811965, 'samples': 11721216, 'steps': 61047, 'loss/train': 1.4768226146697998} 11/07/2021 05:50:54 - INFO - __main__ - Step 61049: {'lr': 0.00032802099123922204, 'samples': 11721408, 'steps': 61048, 'loss/train': 0.9101802110671997} 11/07/2021 05:50:54 - INFO - __main__ - Step 61050: {'lr': 0.00032801594952516934, 'samples': 11721600, 'steps': 61049, 'loss/train': 1.6374058723449707} 11/07/2021 05:50:55 - INFO - __main__ - Step 61051: {'lr': 0.0003280109077759639, 'samples': 11721792, 'steps': 61050, 'loss/train': 1.23479163646698} 11/07/2021 05:50:56 - INFO - __main__ - Step 61052: {'lr': 0.000328005865991608, 'samples': 11721984, 'steps': 61051, 'loss/train': 1.8454961776733398} 11/07/2021 05:50:56 - INFO - __main__ - Step 61053: {'lr': 0.0003280008241721038, 'samples': 11722176, 'steps': 61052, 'loss/train': 1.49979829788208} 11/07/2021 05:50:57 - INFO - __main__ - Step 61054: {'lr': 0.00032799578231745353, 'samples': 11722368, 'steps': 61053, 'loss/train': 1.3875283002853394} 11/07/2021 05:50:57 - INFO - __main__ - Step 61055: {'lr': 0.0003279907404276596, 'samples': 11722560, 'steps': 61054, 'loss/train': 2.0466017723083496} 11/07/2021 05:50:57 - INFO - __main__ - Step 61056: {'lr': 0.00032798569850272434, 'samples': 11722752, 'steps': 61055, 'loss/train': 1.3644413948059082} 11/07/2021 05:50:58 - INFO - __main__ - Step 61057: {'lr': 0.00032798065654264996, 'samples': 11722944, 'steps': 61056, 'loss/train': 1.7765916585922241} 11/07/2021 05:50:59 - INFO - __main__ - Step 61058: {'lr': 0.00032797561454743864, 'samples': 11723136, 'steps': 61057, 'loss/train': 1.1016849279403687} 11/07/2021 05:50:59 - INFO - __main__ - Step 61059: {'lr': 0.00032797057251709267, 'samples': 11723328, 'steps': 61058, 'loss/train': 1.0619457960128784} 11/07/2021 05:50:59 - INFO - __main__ - Step 61060: {'lr': 0.0003279655304516144, 'samples': 11723520, 'steps': 61059, 'loss/train': 1.5456308126449585} 11/07/2021 05:51:00 - INFO - __main__ - Step 61061: {'lr': 0.00032796048835100603, 'samples': 11723712, 'steps': 61060, 'loss/train': 1.696075677871704} 11/07/2021 05:51:00 - INFO - __main__ - Step 61062: {'lr': 0.00032795544621527, 'samples': 11723904, 'steps': 61061, 'loss/train': 1.8837459087371826} 11/07/2021 05:51:01 - INFO - __main__ - Step 61063: {'lr': 0.0003279504040444083, 'samples': 11724096, 'steps': 61062, 'loss/train': 1.5717778205871582} 11/07/2021 05:51:01 - INFO - __main__ - Step 61064: {'lr': 0.0003279453618384234, 'samples': 11724288, 'steps': 61063, 'loss/train': 2.987701892852783} 11/07/2021 05:51:02 - INFO - __main__ - Step 61065: {'lr': 0.0003279403195973175, 'samples': 11724480, 'steps': 61064, 'loss/train': 1.8139675855636597} 11/07/2021 05:51:02 - INFO - __main__ - Step 61066: {'lr': 0.0003279352773210929, 'samples': 11724672, 'steps': 61065, 'loss/train': 1.2584967613220215} 11/07/2021 05:51:02 - INFO - __main__ - Step 61067: {'lr': 0.0003279302350097519, 'samples': 11724864, 'steps': 61066, 'loss/train': 1.164832592010498} 11/07/2021 05:51:04 - INFO - __main__ - Step 61068: {'lr': 0.00032792519266329674, 'samples': 11725056, 'steps': 61067, 'loss/train': 1.580984354019165} 11/07/2021 05:51:04 - INFO - __main__ - Step 61069: {'lr': 0.00032792015028172965, 'samples': 11725248, 'steps': 61068, 'loss/train': 1.4725831747055054} 11/07/2021 05:51:04 - INFO - __main__ - Step 61070: {'lr': 0.00032791510786505296, 'samples': 11725440, 'steps': 61069, 'loss/train': 0.8660678267478943} 11/07/2021 05:51:05 - INFO - __main__ - Step 61071: {'lr': 0.00032791006541326893, 'samples': 11725632, 'steps': 61070, 'loss/train': 1.5362060070037842} 11/07/2021 05:51:05 - INFO - __main__ - Step 61072: {'lr': 0.0003279050229263798, 'samples': 11725824, 'steps': 61071, 'loss/train': 1.2969363927841187} 11/07/2021 05:51:06 - INFO - __main__ - Step 61073: {'lr': 0.0003278999804043879, 'samples': 11726016, 'steps': 61072, 'loss/train': 1.6157944202423096} 11/07/2021 05:51:06 - INFO - __main__ - Step 61074: {'lr': 0.0003278949378472955, 'samples': 11726208, 'steps': 61073, 'loss/train': 1.332826852798462} 11/07/2021 05:51:07 - INFO - __main__ - Step 61075: {'lr': 0.0003278898952551048, 'samples': 11726400, 'steps': 61074, 'loss/train': 1.519913911819458} 11/07/2021 05:51:07 - INFO - __main__ - Step 61076: {'lr': 0.0003278848526278181, 'samples': 11726592, 'steps': 61075, 'loss/train': 1.7936556339263916} 11/07/2021 05:51:07 - INFO - __main__ - Step 61077: {'lr': 0.0003278798099654377, 'samples': 11726784, 'steps': 61076, 'loss/train': 1.518728256225586} 11/07/2021 05:51:08 - INFO - __main__ - Step 61078: {'lr': 0.0003278747672679659, 'samples': 11726976, 'steps': 61077, 'loss/train': 1.2895351648330688} 11/07/2021 05:51:09 - INFO - __main__ - Step 61079: {'lr': 0.00032786972453540487, 'samples': 11727168, 'steps': 61078, 'loss/train': 1.466309905052185} 11/07/2021 05:51:09 - INFO - __main__ - Step 61080: {'lr': 0.00032786468176775697, 'samples': 11727360, 'steps': 61079, 'loss/train': 1.2341852188110352} 11/07/2021 05:51:09 - INFO - __main__ - Step 61081: {'lr': 0.00032785963896502445, 'samples': 11727552, 'steps': 61080, 'loss/train': 0.8937261700630188} 11/07/2021 05:51:10 - INFO - __main__ - Step 61082: {'lr': 0.0003278545961272096, 'samples': 11727744, 'steps': 61081, 'loss/train': 1.9857218265533447} 11/07/2021 05:51:11 - INFO - __main__ - Step 61083: {'lr': 0.00032784955325431466, 'samples': 11727936, 'steps': 61082, 'loss/train': 1.5603748559951782} 11/07/2021 05:51:11 - INFO - __main__ - Step 61084: {'lr': 0.0003278445103463419, 'samples': 11728128, 'steps': 61083, 'loss/train': 1.7255642414093018} 11/07/2021 05:51:11 - INFO - __main__ - Step 61085: {'lr': 0.00032783946740329355, 'samples': 11728320, 'steps': 61084, 'loss/train': 1.4104775190353394} 11/07/2021 05:51:12 - INFO - __main__ - Step 61086: {'lr': 0.00032783442442517203, 'samples': 11728512, 'steps': 61085, 'loss/train': 1.660438060760498} 11/07/2021 05:51:12 - INFO - __main__ - Step 61087: {'lr': 0.0003278293814119795, 'samples': 11728704, 'steps': 61086, 'loss/train': 0.778331995010376} 11/07/2021 05:51:13 - INFO - __main__ - Step 61088: {'lr': 0.0003278243383637182, 'samples': 11728896, 'steps': 61087, 'loss/train': 1.1552221775054932} 11/07/2021 05:51:14 - INFO - __main__ - Step 61089: {'lr': 0.0003278192952803905, 'samples': 11729088, 'steps': 61088, 'loss/train': 0.6114634275436401} 11/07/2021 05:51:14 - INFO - __main__ - Step 61090: {'lr': 0.00032781425216199864, 'samples': 11729280, 'steps': 61089, 'loss/train': 1.5190335512161255} 11/07/2021 05:51:14 - INFO - __main__ - Step 61091: {'lr': 0.0003278092090085448, 'samples': 11729472, 'steps': 61090, 'loss/train': 1.3329195976257324} 11/07/2021 05:51:15 - INFO - __main__ - Step 61092: {'lr': 0.00032780416582003143, 'samples': 11729664, 'steps': 61091, 'loss/train': 2.287088632583618} 11/07/2021 05:51:16 - INFO - __main__ - Step 61093: {'lr': 0.0003277991225964606, 'samples': 11729856, 'steps': 61092, 'loss/train': 1.2586085796356201} 11/07/2021 05:51:16 - INFO - __main__ - Step 61094: {'lr': 0.00032779407933783476, 'samples': 11730048, 'steps': 61093, 'loss/train': 1.249376654624939} 11/07/2021 05:51:16 - INFO - __main__ - Step 61095: {'lr': 0.0003277890360441561, 'samples': 11730240, 'steps': 61094, 'loss/train': 1.0457830429077148} 11/07/2021 05:51:17 - INFO - __main__ - Step 61096: {'lr': 0.0003277839927154269, 'samples': 11730432, 'steps': 61095, 'loss/train': 1.6110353469848633} 11/07/2021 05:51:17 - INFO - __main__ - Step 61097: {'lr': 0.0003277789493516494, 'samples': 11730624, 'steps': 61096, 'loss/train': 1.315926194190979} 11/07/2021 05:51:17 - INFO - __main__ - Step 61098: {'lr': 0.00032777390595282595, 'samples': 11730816, 'steps': 61097, 'loss/train': 1.1741043329238892} 11/07/2021 05:51:19 - INFO - __main__ - Step 61099: {'lr': 0.00032776886251895874, 'samples': 11731008, 'steps': 61098, 'loss/train': 1.6774158477783203} 11/07/2021 05:51:19 - INFO - __main__ - Step 61100: {'lr': 0.0003277638190500501, 'samples': 11731200, 'steps': 61099, 'loss/train': 1.6008009910583496} 11/07/2021 05:51:19 - INFO - __main__ - Step 61101: {'lr': 0.0003277587755461023, 'samples': 11731392, 'steps': 61100, 'loss/train': 1.2453557252883911} 11/07/2021 05:51:20 - INFO - __main__ - Step 61102: {'lr': 0.0003277537320071176, 'samples': 11731584, 'steps': 61101, 'loss/train': 1.1743148565292358} 11/07/2021 05:51:20 - INFO - __main__ - Step 61103: {'lr': 0.00032774868843309823, 'samples': 11731776, 'steps': 61102, 'loss/train': 1.0753929615020752} 11/07/2021 05:51:21 - INFO - __main__ - Step 61104: {'lr': 0.0003277436448240465, 'samples': 11731968, 'steps': 61103, 'loss/train': 0.7344729900360107} 11/07/2021 05:51:21 - INFO - __main__ - Step 61105: {'lr': 0.00032773860117996475, 'samples': 11732160, 'steps': 61104, 'loss/train': 1.2093058824539185} 11/07/2021 05:51:22 - INFO - __main__ - Step 61106: {'lr': 0.0003277335575008551, 'samples': 11732352, 'steps': 61105, 'loss/train': 1.5719099044799805} 11/07/2021 05:51:22 - INFO - __main__ - Step 61107: {'lr': 0.00032772851378672, 'samples': 11732544, 'steps': 61106, 'loss/train': 0.970406711101532} 11/07/2021 05:51:22 - INFO - __main__ - Step 61108: {'lr': 0.00032772347003756153, 'samples': 11732736, 'steps': 61107, 'loss/train': 1.6089681386947632} 11/07/2021 05:51:23 - INFO - __main__ - Step 61109: {'lr': 0.0003277184262533821, 'samples': 11732928, 'steps': 61108, 'loss/train': 0.5141808986663818} 11/07/2021 05:51:24 - INFO - __main__ - Step 61110: {'lr': 0.00032771338243418397, 'samples': 11733120, 'steps': 61109, 'loss/train': 1.494165301322937} 11/07/2021 05:51:24 - INFO - __main__ - Step 61111: {'lr': 0.0003277083385799694, 'samples': 11733312, 'steps': 61110, 'loss/train': 1.0327638387680054} 11/07/2021 05:51:25 - INFO - __main__ - Step 61112: {'lr': 0.0003277032946907406, 'samples': 11733504, 'steps': 61111, 'loss/train': 1.7614773511886597} 11/07/2021 05:51:25 - INFO - __main__ - Step 61113: {'lr': 0.0003276982507664999, 'samples': 11733696, 'steps': 61112, 'loss/train': 1.390594720840454} 11/07/2021 05:51:26 - INFO - __main__ - Step 61114: {'lr': 0.00032769320680724954, 'samples': 11733888, 'steps': 61113, 'loss/train': 1.03023099899292} 11/07/2021 05:51:26 - INFO - __main__ - Step 61115: {'lr': 0.00032768816281299195, 'samples': 11734080, 'steps': 61114, 'loss/train': 1.3134782314300537} 11/07/2021 05:51:27 - INFO - __main__ - Step 61116: {'lr': 0.0003276831187837292, 'samples': 11734272, 'steps': 61115, 'loss/train': 1.4178789854049683} 11/07/2021 05:51:27 - INFO - __main__ - Step 61117: {'lr': 0.00032767807471946366, 'samples': 11734464, 'steps': 61116, 'loss/train': 1.3784812688827515} 11/07/2021 05:51:27 - INFO - __main__ - Step 61118: {'lr': 0.00032767303062019746, 'samples': 11734656, 'steps': 61117, 'loss/train': 1.6506515741348267} 11/07/2021 05:51:29 - INFO - __main__ - Step 61119: {'lr': 0.0003276679864859331, 'samples': 11734848, 'steps': 61118, 'loss/train': 1.2997311353683472} 11/07/2021 05:51:29 - INFO - __main__ - Step 61120: {'lr': 0.0003276629423166727, 'samples': 11735040, 'steps': 61119, 'loss/train': 1.195374608039856} 11/07/2021 05:51:29 - INFO - __main__ - Step 61121: {'lr': 0.00032765789811241866, 'samples': 11735232, 'steps': 61120, 'loss/train': 1.1335614919662476} 11/07/2021 05:51:30 - INFO - __main__ - Step 61122: {'lr': 0.0003276528538731731, 'samples': 11735424, 'steps': 61121, 'loss/train': 1.4917290210723877} 11/07/2021 05:51:30 - INFO - __main__ - Step 61123: {'lr': 0.0003276478095989384, 'samples': 11735616, 'steps': 61122, 'loss/train': 1.07723867893219} 11/07/2021 05:51:31 - INFO - __main__ - Step 61124: {'lr': 0.0003276427652897167, 'samples': 11735808, 'steps': 61123, 'loss/train': 1.4772253036499023} 11/07/2021 05:51:31 - INFO - __main__ - Step 61125: {'lr': 0.0003276377209455104, 'samples': 11736000, 'steps': 61124, 'loss/train': 1.6047300100326538} 11/07/2021 05:51:32 - INFO - __main__ - Step 61126: {'lr': 0.0003276326765663218, 'samples': 11736192, 'steps': 61125, 'loss/train': 1.2581044435501099} 11/07/2021 05:51:32 - INFO - __main__ - Step 61127: {'lr': 0.0003276276321521531, 'samples': 11736384, 'steps': 61126, 'loss/train': 1.323181390762329} 11/07/2021 05:51:32 - INFO - __main__ - Step 61128: {'lr': 0.00032762258770300656, 'samples': 11736576, 'steps': 61127, 'loss/train': 1.4986112117767334} 11/07/2021 05:51:33 - INFO - __main__ - Step 61129: {'lr': 0.0003276175432188845, 'samples': 11736768, 'steps': 61128, 'loss/train': 1.1463905572891235} 11/07/2021 05:51:34 - INFO - __main__ - Step 61130: {'lr': 0.00032761249869978917, 'samples': 11736960, 'steps': 61129, 'loss/train': 1.6076515913009644} 11/07/2021 05:51:34 - INFO - __main__ - Step 61131: {'lr': 0.00032760745414572287, 'samples': 11737152, 'steps': 61130, 'loss/train': 1.0312691926956177} 11/07/2021 05:51:35 - INFO - __main__ - Step 61132: {'lr': 0.0003276024095566878, 'samples': 11737344, 'steps': 61131, 'loss/train': 1.1000157594680786} 11/07/2021 05:51:35 - INFO - __main__ - Step 61133: {'lr': 0.0003275973649326863, 'samples': 11737536, 'steps': 61132, 'loss/train': 1.6720885038375854} 11/07/2021 05:51:35 - INFO - __main__ - Step 61134: {'lr': 0.0003275923202737206, 'samples': 11737728, 'steps': 61133, 'loss/train': 1.4818214178085327} 11/07/2021 05:51:36 - INFO - __main__ - Step 61135: {'lr': 0.00032758727557979304, 'samples': 11737920, 'steps': 61134, 'loss/train': 1.8701986074447632} 11/07/2021 05:51:37 - INFO - __main__ - Step 61136: {'lr': 0.00032758223085090586, 'samples': 11738112, 'steps': 61135, 'loss/train': 1.2974063158035278} 11/07/2021 05:51:37 - INFO - __main__ - Step 61137: {'lr': 0.0003275771860870613, 'samples': 11738304, 'steps': 61136, 'loss/train': 1.0082355737686157} 11/07/2021 05:51:37 - INFO - __main__ - Step 61138: {'lr': 0.0003275721412882616, 'samples': 11738496, 'steps': 61137, 'loss/train': 1.4605848789215088} 11/07/2021 05:51:38 - INFO - __main__ - Step 61139: {'lr': 0.00032756709645450916, 'samples': 11738688, 'steps': 61138, 'loss/train': 1.4013506174087524} 11/07/2021 05:51:38 - INFO - __main__ - Step 61140: {'lr': 0.00032756205158580615, 'samples': 11738880, 'steps': 61139, 'loss/train': 1.3920090198516846} 11/07/2021 05:51:39 - INFO - __main__ - Step 61141: {'lr': 0.00032755700668215496, 'samples': 11739072, 'steps': 61140, 'loss/train': 1.5448421239852905} 11/07/2021 05:51:40 - INFO - __main__ - Step 61142: {'lr': 0.0003275519617435577, 'samples': 11739264, 'steps': 61141, 'loss/train': 1.5818395614624023} 11/07/2021 05:51:40 - INFO - __main__ - Step 61143: {'lr': 0.00032754691677001674, 'samples': 11739456, 'steps': 61142, 'loss/train': 1.6005518436431885} 11/07/2021 05:51:40 - INFO - __main__ - Step 61144: {'lr': 0.0003275418717615343, 'samples': 11739648, 'steps': 61143, 'loss/train': 1.545937418937683} 11/07/2021 05:51:41 - INFO - __main__ - Step 61145: {'lr': 0.00032753682671811277, 'samples': 11739840, 'steps': 61144, 'loss/train': 1.2838823795318604} 11/07/2021 05:51:43 - INFO - __main__ - Step 61146: {'lr': 0.00032753178163975427, 'samples': 11740032, 'steps': 61145, 'loss/train': 1.5442448854446411} 11/07/2021 05:51:43 - INFO - __main__ - Step 61147: {'lr': 0.00032752673652646115, 'samples': 11740224, 'steps': 61146, 'loss/train': 1.7692872285842896} 11/07/2021 05:51:44 - INFO - __main__ - Step 61148: {'lr': 0.00032752169137823575, 'samples': 11740416, 'steps': 61147, 'loss/train': 1.9901727437973022} 11/07/2021 05:51:44 - INFO - __main__ - Step 61149: {'lr': 0.0003275166461950802, 'samples': 11740608, 'steps': 61148, 'loss/train': 1.7491075992584229} 11/07/2021 05:51:44 - INFO - __main__ - Step 61150: {'lr': 0.0003275116009769969, 'samples': 11740800, 'steps': 61149, 'loss/train': 1.7617838382720947} 11/07/2021 05:51:45 - INFO - __main__ - Step 61151: {'lr': 0.000327506555723988, 'samples': 11740992, 'steps': 61150, 'loss/train': 1.5644147396087646} 11/07/2021 05:51:45 - INFO - __main__ - Step 61152: {'lr': 0.00032750151043605584, 'samples': 11741184, 'steps': 61151, 'loss/train': 1.5719618797302246} 11/07/2021 05:51:46 - INFO - __main__ - Step 61153: {'lr': 0.00032749646511320276, 'samples': 11741376, 'steps': 61152, 'loss/train': 1.8954986333847046} 11/07/2021 05:51:46 - INFO - __main__ - Step 61154: {'lr': 0.00032749141975543095, 'samples': 11741568, 'steps': 61153, 'loss/train': 1.387721061706543} 11/07/2021 05:51:47 - INFO - __main__ - Step 61155: {'lr': 0.0003274863743627427, 'samples': 11741760, 'steps': 61154, 'loss/train': 0.5132010579109192} 11/07/2021 05:51:47 - INFO - __main__ - Step 61156: {'lr': 0.00032748132893514027, 'samples': 11741952, 'steps': 61155, 'loss/train': 0.8893358111381531} 11/07/2021 05:51:47 - INFO - __main__ - Step 61157: {'lr': 0.00032747628347262595, 'samples': 11742144, 'steps': 61156, 'loss/train': 0.17172792553901672} 11/07/2021 05:51:48 - INFO - __main__ - Step 61158: {'lr': 0.00032747123797520207, 'samples': 11742336, 'steps': 61157, 'loss/train': 1.3810263872146606} 11/07/2021 05:51:49 - INFO - __main__ - Step 61159: {'lr': 0.0003274661924428707, 'samples': 11742528, 'steps': 61158, 'loss/train': 1.3487558364868164} 11/07/2021 05:51:49 - INFO - __main__ - Step 61160: {'lr': 0.0003274611468756344, 'samples': 11742720, 'steps': 61159, 'loss/train': 0.9172453880310059} 11/07/2021 05:51:49 - INFO - __main__ - Step 61161: {'lr': 0.00032745610127349524, 'samples': 11742912, 'steps': 61160, 'loss/train': 1.6749958992004395} 11/07/2021 05:51:50 - INFO - __main__ - Step 61162: {'lr': 0.0003274510556364556, 'samples': 11743104, 'steps': 61161, 'loss/train': 1.7863643169403076} 11/07/2021 05:51:51 - INFO - __main__ - Step 61163: {'lr': 0.00032744600996451766, 'samples': 11743296, 'steps': 61162, 'loss/train': 1.2058573961257935} 11/07/2021 05:51:51 - INFO - __main__ - Step 61164: {'lr': 0.00032744096425768376, 'samples': 11743488, 'steps': 61163, 'loss/train': 1.5258495807647705} 11/07/2021 05:51:51 - INFO - __main__ - Step 61165: {'lr': 0.0003274359185159562, 'samples': 11743680, 'steps': 61164, 'loss/train': 1.905347466468811} 11/07/2021 05:51:52 - INFO - __main__ - Step 61166: {'lr': 0.00032743087273933715, 'samples': 11743872, 'steps': 61165, 'loss/train': 1.167897343635559} 11/07/2021 05:51:52 - INFO - __main__ - Step 61167: {'lr': 0.00032742582692782895, 'samples': 11744064, 'steps': 61166, 'loss/train': 1.7359015941619873} 11/07/2021 05:51:53 - INFO - __main__ - Step 61168: {'lr': 0.00032742078108143394, 'samples': 11744256, 'steps': 61167, 'loss/train': 0.8394058346748352} 11/07/2021 05:51:53 - INFO - __main__ - Step 61169: {'lr': 0.0003274157352001543, 'samples': 11744448, 'steps': 61168, 'loss/train': 1.4408408403396606} 11/07/2021 05:51:54 - INFO - __main__ - Step 61170: {'lr': 0.0003274106892839923, 'samples': 11744640, 'steps': 61169, 'loss/train': 1.0708322525024414} 11/07/2021 05:51:54 - INFO - __main__ - Step 61171: {'lr': 0.00032740564333295013, 'samples': 11744832, 'steps': 61170, 'loss/train': 1.2573456764221191} 11/07/2021 05:51:54 - INFO - __main__ - Step 61172: {'lr': 0.00032740059734703034, 'samples': 11745024, 'steps': 61171, 'loss/train': 1.449235439300537} 11/07/2021 05:51:55 - INFO - __main__ - Step 61173: {'lr': 0.0003273955513262349, 'samples': 11745216, 'steps': 61172, 'loss/train': 2.3243517875671387} 11/07/2021 05:51:56 - INFO - __main__ - Step 61174: {'lr': 0.0003273905052705663, 'samples': 11745408, 'steps': 61173, 'loss/train': 1.156581997871399} 11/07/2021 05:51:56 - INFO - __main__ - Step 61175: {'lr': 0.0003273854591800267, 'samples': 11745600, 'steps': 61174, 'loss/train': 1.1363582611083984} 11/07/2021 05:51:57 - INFO - __main__ - Step 61176: {'lr': 0.00032738041305461845, 'samples': 11745792, 'steps': 61175, 'loss/train': 1.1491389274597168} 11/07/2021 05:51:57 - INFO - __main__ - Step 61177: {'lr': 0.00032737536689434377, 'samples': 11745984, 'steps': 61176, 'loss/train': 1.6266882419586182} 11/07/2021 05:51:57 - INFO - __main__ - Step 61178: {'lr': 0.00032737032069920494, 'samples': 11746176, 'steps': 61177, 'loss/train': 1.1691802740097046} 11/07/2021 05:51:58 - INFO - __main__ - Step 61179: {'lr': 0.0003273652744692042, 'samples': 11746368, 'steps': 61178, 'loss/train': 1.693306565284729} 11/07/2021 05:51:59 - INFO - __main__ - Step 61180: {'lr': 0.0003273602282043439, 'samples': 11746560, 'steps': 61179, 'loss/train': 1.6026204824447632} 11/07/2021 05:51:59 - INFO - __main__ - Step 61181: {'lr': 0.0003273551819046263, 'samples': 11746752, 'steps': 61180, 'loss/train': 1.6590989828109741} 11/07/2021 05:51:59 - INFO - __main__ - Step 61182: {'lr': 0.00032735013557005357, 'samples': 11746944, 'steps': 61181, 'loss/train': 1.4133268594741821} 11/07/2021 05:52:00 - INFO - __main__ - Step 61183: {'lr': 0.00032734508920062805, 'samples': 11747136, 'steps': 61182, 'loss/train': 1.3779863119125366} 11/07/2021 05:52:01 - INFO - __main__ - Step 61184: {'lr': 0.0003273400427963521, 'samples': 11747328, 'steps': 61183, 'loss/train': 1.4191943407058716} 11/07/2021 05:52:01 - INFO - __main__ - Step 61185: {'lr': 0.0003273349963572279, 'samples': 11747520, 'steps': 61184, 'loss/train': 1.3021948337554932} 11/07/2021 05:52:02 - INFO - __main__ - Step 61186: {'lr': 0.0003273299498832578, 'samples': 11747712, 'steps': 61185, 'loss/train': 0.6304476261138916} 11/07/2021 05:52:02 - INFO - __main__ - Step 61187: {'lr': 0.00032732490337444387, 'samples': 11747904, 'steps': 61186, 'loss/train': 1.7254788875579834} 11/07/2021 05:52:02 - INFO - __main__ - Step 61188: {'lr': 0.0003273198568307886, 'samples': 11748096, 'steps': 61187, 'loss/train': 1.10112726688385} 11/07/2021 05:52:04 - INFO - __main__ - Step 61189: {'lr': 0.0003273148102522943, 'samples': 11748288, 'steps': 61188, 'loss/train': 1.7098734378814697} 11/07/2021 05:52:04 - INFO - __main__ - Step 61190: {'lr': 0.00032730976363896296, 'samples': 11748480, 'steps': 61189, 'loss/train': 1.4145262241363525} 11/07/2021 05:52:05 - INFO - __main__ - Step 61191: {'lr': 0.00032730471699079724, 'samples': 11748672, 'steps': 61190, 'loss/train': 1.4901962280273438} 11/07/2021 05:52:05 - INFO - __main__ - Step 61192: {'lr': 0.00032729967030779904, 'samples': 11748864, 'steps': 61191, 'loss/train': 1.4331060647964478} 11/07/2021 05:52:05 - INFO - __main__ - Step 61193: {'lr': 0.00032729462358997084, 'samples': 11749056, 'steps': 61192, 'loss/train': 1.8083152770996094} 11/07/2021 05:52:06 - INFO - __main__ - Step 61194: {'lr': 0.0003272895768373149, 'samples': 11749248, 'steps': 61193, 'loss/train': 1.4056133031845093} 11/07/2021 05:52:07 - INFO - __main__ - Step 61195: {'lr': 0.0003272845300498335, 'samples': 11749440, 'steps': 61194, 'loss/train': 1.475929856300354} 11/07/2021 05:52:07 - INFO - __main__ - Step 61196: {'lr': 0.00032727948322752883, 'samples': 11749632, 'steps': 61195, 'loss/train': 1.6179981231689453} 11/07/2021 05:52:07 - INFO - __main__ - Step 61197: {'lr': 0.0003272744363704032, 'samples': 11749824, 'steps': 61196, 'loss/train': 1.1152403354644775} 11/07/2021 05:52:08 - INFO - __main__ - Step 61198: {'lr': 0.00032726938947845897, 'samples': 11750016, 'steps': 61197, 'loss/train': 1.4277780055999756} 11/07/2021 05:52:08 - INFO - __main__ - Step 61199: {'lr': 0.0003272643425516983, 'samples': 11750208, 'steps': 61198, 'loss/train': 1.1091493368148804} 11/07/2021 05:52:08 - INFO - __main__ - Step 61200: {'lr': 0.0003272592955901235, 'samples': 11750400, 'steps': 61199, 'loss/train': 1.1875914335250854} 11/07/2021 05:52:09 - INFO - __main__ - Step 61201: {'lr': 0.00032725424859373687, 'samples': 11750592, 'steps': 61200, 'loss/train': 1.7485542297363281} 11/07/2021 05:52:10 - INFO - __main__ - Step 61202: {'lr': 0.00032724920156254074, 'samples': 11750784, 'steps': 61201, 'loss/train': 1.4588688611984253} 11/07/2021 05:52:10 - INFO - __main__ - Step 61203: {'lr': 0.0003272441544965372, 'samples': 11750976, 'steps': 61202, 'loss/train': 1.943355917930603} 11/07/2021 05:52:11 - INFO - __main__ - Step 61204: {'lr': 0.0003272391073957287, 'samples': 11751168, 'steps': 61203, 'loss/train': 1.155621886253357} 11/07/2021 05:52:11 - INFO - __main__ - Step 61205: {'lr': 0.00032723406026011735, 'samples': 11751360, 'steps': 61204, 'loss/train': 0.9988946318626404} 11/07/2021 05:52:12 - INFO - __main__ - Step 61206: {'lr': 0.00032722901308970565, 'samples': 11751552, 'steps': 61205, 'loss/train': 1.7233763933181763} 11/07/2021 05:52:12 - INFO - __main__ - Step 61207: {'lr': 0.00032722396588449567, 'samples': 11751744, 'steps': 61206, 'loss/train': 1.3720651865005493} 11/07/2021 05:52:13 - INFO - __main__ - Step 61208: {'lr': 0.00032721891864448985, 'samples': 11751936, 'steps': 61207, 'loss/train': 2.1764490604400635} 11/07/2021 05:52:13 - INFO - __main__ - Step 61209: {'lr': 0.00032721387136969035, 'samples': 11752128, 'steps': 61208, 'loss/train': 1.4794026613235474} 11/07/2021 05:52:13 - INFO - __main__ - Step 61210: {'lr': 0.0003272088240600994, 'samples': 11752320, 'steps': 61209, 'loss/train': 1.6463526487350464} 11/07/2021 05:52:14 - INFO - __main__ - Step 61211: {'lr': 0.0003272037767157194, 'samples': 11752512, 'steps': 61210, 'loss/train': 1.53522527217865} 11/07/2021 05:52:15 - INFO - __main__ - Step 61212: {'lr': 0.00032719872933655253, 'samples': 11752704, 'steps': 61211, 'loss/train': 1.2181434631347656} 11/07/2021 05:52:15 - INFO - __main__ - Step 61213: {'lr': 0.0003271936819226011, 'samples': 11752896, 'steps': 61212, 'loss/train': 1.2257877588272095} 11/07/2021 05:52:15 - INFO - __main__ - Step 61214: {'lr': 0.00032718863447386745, 'samples': 11753088, 'steps': 61213, 'loss/train': 1.581288456916809} 11/07/2021 05:52:16 - INFO - __main__ - Step 61215: {'lr': 0.0003271835869903537, 'samples': 11753280, 'steps': 61214, 'loss/train': 1.6884077787399292} 11/07/2021 05:52:16 - INFO - __main__ - Step 61216: {'lr': 0.0003271785394720623, 'samples': 11753472, 'steps': 61215, 'loss/train': 1.1614298820495605} 11/07/2021 05:52:17 - INFO - __main__ - Step 61217: {'lr': 0.0003271734919189955, 'samples': 11753664, 'steps': 61216, 'loss/train': 1.5793207883834839} 11/07/2021 05:52:18 - INFO - __main__ - Step 61218: {'lr': 0.0003271684443311554, 'samples': 11753856, 'steps': 61217, 'loss/train': 1.4562331438064575} 11/07/2021 05:52:18 - INFO - __main__ - Step 61219: {'lr': 0.0003271633967085444, 'samples': 11754048, 'steps': 61218, 'loss/train': 1.5454373359680176} 11/07/2021 05:52:18 - INFO - __main__ - Step 61220: {'lr': 0.00032715834905116474, 'samples': 11754240, 'steps': 61219, 'loss/train': 1.4990508556365967} 11/07/2021 05:52:19 - INFO - __main__ - Step 61221: {'lr': 0.0003271533013590188, 'samples': 11754432, 'steps': 61220, 'loss/train': 1.5320392847061157} 11/07/2021 05:52:20 - INFO - __main__ - Step 61222: {'lr': 0.0003271482536321088, 'samples': 11754624, 'steps': 61221, 'loss/train': 1.5097732543945312} 11/07/2021 05:52:20 - INFO - __main__ - Step 61223: {'lr': 0.00032714320587043686, 'samples': 11754816, 'steps': 61222, 'loss/train': 1.6652705669403076} 11/07/2021 05:52:20 - INFO - __main__ - Step 61224: {'lr': 0.0003271381580740055, 'samples': 11755008, 'steps': 61223, 'loss/train': 0.8475200533866882} 11/07/2021 05:52:21 - INFO - __main__ - Step 61225: {'lr': 0.0003271331102428168, 'samples': 11755200, 'steps': 61224, 'loss/train': 1.3985636234283447} 11/07/2021 05:52:21 - INFO - __main__ - Step 61226: {'lr': 0.0003271280623768731, 'samples': 11755392, 'steps': 61225, 'loss/train': 1.3999381065368652} 11/07/2021 05:52:22 - INFO - __main__ - Step 61227: {'lr': 0.00032712301447617673, 'samples': 11755584, 'steps': 61226, 'loss/train': 1.2994848489761353} 11/07/2021 05:52:22 - INFO - __main__ - Step 61228: {'lr': 0.0003271179665407299, 'samples': 11755776, 'steps': 61227, 'loss/train': 1.5134246349334717} 11/07/2021 05:52:23 - INFO - __main__ - Step 61229: {'lr': 0.0003271129185705349, 'samples': 11755968, 'steps': 61228, 'loss/train': 0.8397372961044312} 11/07/2021 05:52:23 - INFO - __main__ - Step 61230: {'lr': 0.00032710787056559404, 'samples': 11756160, 'steps': 61229, 'loss/train': 1.2083983421325684} 11/07/2021 05:52:24 - INFO - __main__ - Step 61231: {'lr': 0.00032710282252590954, 'samples': 11756352, 'steps': 61230, 'loss/train': 1.106205701828003} 11/07/2021 05:52:25 - INFO - __main__ - Step 61232: {'lr': 0.00032709777445148367, 'samples': 11756544, 'steps': 61231, 'loss/train': 1.4570974111557007} 11/07/2021 05:52:25 - INFO - __main__ - Step 61233: {'lr': 0.0003270927263423188, 'samples': 11756736, 'steps': 61232, 'loss/train': 1.3369232416152954} 11/07/2021 05:52:26 - INFO - __main__ - Step 61234: {'lr': 0.0003270876781984171, 'samples': 11756928, 'steps': 61233, 'loss/train': 0.9349343180656433} 11/07/2021 05:52:26 - INFO - __main__ - Step 61235: {'lr': 0.0003270826300197809, 'samples': 11757120, 'steps': 61234, 'loss/train': 1.218192219734192} 11/07/2021 05:52:27 - INFO - __main__ - Step 61236: {'lr': 0.00032707758180641245, 'samples': 11757312, 'steps': 61235, 'loss/train': 1.5850309133529663} 11/07/2021 05:52:27 - INFO - __main__ - Step 61237: {'lr': 0.000327072533558314, 'samples': 11757504, 'steps': 61236, 'loss/train': 1.6114580631256104} 11/07/2021 05:52:27 - INFO - __main__ - Step 61238: {'lr': 0.00032706748527548793, 'samples': 11757696, 'steps': 61237, 'loss/train': 1.70370614528656} 11/07/2021 05:52:28 - INFO - __main__ - Step 61239: {'lr': 0.00032706243695793634, 'samples': 11757888, 'steps': 61238, 'loss/train': 1.106522798538208} 11/07/2021 05:52:29 - INFO - __main__ - Step 61240: {'lr': 0.00032705738860566166, 'samples': 11758080, 'steps': 61239, 'loss/train': 1.194445252418518} 11/07/2021 05:52:29 - INFO - __main__ - Step 61241: {'lr': 0.0003270523402186661, 'samples': 11758272, 'steps': 61240, 'loss/train': 1.3703862428665161} 11/07/2021 05:52:29 - INFO - __main__ - Step 61242: {'lr': 0.0003270472917969519, 'samples': 11758464, 'steps': 61241, 'loss/train': 1.450257658958435} 11/07/2021 05:52:30 - INFO - __main__ - Step 61243: {'lr': 0.0003270422433405215, 'samples': 11758656, 'steps': 61242, 'loss/train': 1.731853723526001} 11/07/2021 05:52:31 - INFO - __main__ - Step 61244: {'lr': 0.000327037194849377, 'samples': 11758848, 'steps': 61243, 'loss/train': 1.435577154159546} 11/07/2021 05:52:31 - INFO - __main__ - Step 61245: {'lr': 0.0003270321463235207, 'samples': 11759040, 'steps': 61244, 'loss/train': 1.1576025485992432} 11/07/2021 05:52:31 - INFO - __main__ - Step 61246: {'lr': 0.00032702709776295493, 'samples': 11759232, 'steps': 61245, 'loss/train': 1.6844067573547363} 11/07/2021 05:52:32 - INFO - __main__ - Step 61247: {'lr': 0.00032702204916768186, 'samples': 11759424, 'steps': 61246, 'loss/train': 1.4276758432388306} 11/07/2021 05:52:32 - INFO - __main__ - Step 61248: {'lr': 0.00032701700053770386, 'samples': 11759616, 'steps': 61247, 'loss/train': 1.45592200756073} 11/07/2021 05:52:33 - INFO - __main__ - Step 61249: {'lr': 0.00032701195187302337, 'samples': 11759808, 'steps': 61248, 'loss/train': 1.5428460836410522} 11/07/2021 05:52:33 - INFO - __main__ - Step 61250: {'lr': 0.0003270069031736423, 'samples': 11760000, 'steps': 61249, 'loss/train': 1.7563533782958984} 11/07/2021 05:52:34 - INFO - __main__ - Step 61251: {'lr': 0.00032700185443956315, 'samples': 11760192, 'steps': 61250, 'loss/train': 1.6011223793029785} 11/07/2021 05:52:34 - INFO - __main__ - Step 61252: {'lr': 0.00032699680567078814, 'samples': 11760384, 'steps': 61251, 'loss/train': 2.078875780105591} 11/07/2021 05:52:35 - INFO - __main__ - Step 61253: {'lr': 0.0003269917568673196, 'samples': 11760576, 'steps': 61252, 'loss/train': 1.274832010269165} 11/07/2021 05:52:36 - INFO - __main__ - Step 61254: {'lr': 0.0003269867080291597, 'samples': 11760768, 'steps': 61253, 'loss/train': 2.552259683609009} 11/07/2021 05:52:36 - INFO - __main__ - Step 61255: {'lr': 0.0003269816591563108, 'samples': 11760960, 'steps': 61254, 'loss/train': 1.6558302640914917} 11/07/2021 05:52:36 - INFO - __main__ - Step 61256: {'lr': 0.0003269766102487752, 'samples': 11761152, 'steps': 61255, 'loss/train': 1.571105718612671} 11/07/2021 05:52:37 - INFO - __main__ - Step 61257: {'lr': 0.00032697156130655507, 'samples': 11761344, 'steps': 61256, 'loss/train': 1.167958378791809} 11/07/2021 05:52:37 - INFO - __main__ - Step 61258: {'lr': 0.0003269665123296528, 'samples': 11761536, 'steps': 61257, 'loss/train': 1.34224534034729} 11/07/2021 05:52:37 - INFO - __main__ - Step 61259: {'lr': 0.0003269614633180705, 'samples': 11761728, 'steps': 61258, 'loss/train': 0.788581907749176} 11/07/2021 05:52:39 - INFO - __main__ - Step 61260: {'lr': 0.00032695641427181064, 'samples': 11761920, 'steps': 61259, 'loss/train': 1.3740125894546509} 11/07/2021 05:52:39 - INFO - __main__ - Step 61261: {'lr': 0.00032695136519087545, 'samples': 11762112, 'steps': 61260, 'loss/train': 1.601479411125183} 11/07/2021 05:52:39 - INFO - __main__ - Step 61262: {'lr': 0.00032694631607526703, 'samples': 11762304, 'steps': 61261, 'loss/train': 1.4173800945281982} 11/07/2021 05:52:40 - INFO - __main__ - Step 61263: {'lr': 0.00032694126692498794, 'samples': 11762496, 'steps': 61262, 'loss/train': 1.9521383047103882} 11/07/2021 05:52:40 - INFO - __main__ - Step 61264: {'lr': 0.00032693621774004025, 'samples': 11762688, 'steps': 61263, 'loss/train': 1.4073188304901123} 11/07/2021 05:52:41 - INFO - __main__ - Step 61265: {'lr': 0.0003269311685204262, 'samples': 11762880, 'steps': 61264, 'loss/train': 1.299781322479248} 11/07/2021 05:52:41 - INFO - __main__ - Step 61266: {'lr': 0.00032692611926614823, 'samples': 11763072, 'steps': 61265, 'loss/train': 1.4022568464279175} 11/07/2021 05:52:42 - INFO - __main__ - Step 61267: {'lr': 0.00032692106997720847, 'samples': 11763264, 'steps': 61266, 'loss/train': 1.3272984027862549} 11/07/2021 05:52:42 - INFO - __main__ - Step 61268: {'lr': 0.0003269160206536093, 'samples': 11763456, 'steps': 61267, 'loss/train': 2.150545597076416} 11/07/2021 05:52:42 - INFO - __main__ - Step 61269: {'lr': 0.0003269109712953531, 'samples': 11763648, 'steps': 61268, 'loss/train': 0.895738959312439} 11/07/2021 05:52:44 - INFO - __main__ - Step 61270: {'lr': 0.0003269059219024418, 'samples': 11763840, 'steps': 61269, 'loss/train': 1.5356467962265015} 11/07/2021 05:52:44 - INFO - __main__ - Step 61271: {'lr': 0.00032690087247487797, 'samples': 11764032, 'steps': 61270, 'loss/train': 1.8067290782928467} 11/07/2021 05:52:44 - INFO - __main__ - Step 61272: {'lr': 0.0003268958230126637, 'samples': 11764224, 'steps': 61271, 'loss/train': 0.5211678147315979} 11/07/2021 05:52:45 - INFO - __main__ - Step 61273: {'lr': 0.00032689077351580147, 'samples': 11764416, 'steps': 61272, 'loss/train': 1.1155176162719727} 11/07/2021 05:52:45 - INFO - __main__ - Step 61274: {'lr': 0.00032688572398429337, 'samples': 11764608, 'steps': 61273, 'loss/train': 1.4223790168762207} 11/07/2021 05:52:46 - INFO - __main__ - Step 61275: {'lr': 0.0003268806744181418, 'samples': 11764800, 'steps': 61274, 'loss/train': 2.14035701751709} 11/07/2021 05:52:46 - INFO - __main__ - Step 61276: {'lr': 0.0003268756248173491, 'samples': 11764992, 'steps': 61275, 'loss/train': 1.237484335899353} 11/07/2021 05:52:47 - INFO - __main__ - Step 61277: {'lr': 0.0003268705751819173, 'samples': 11765184, 'steps': 61276, 'loss/train': 1.824066162109375} 11/07/2021 05:52:47 - INFO - __main__ - Step 61278: {'lr': 0.00032686552551184874, 'samples': 11765376, 'steps': 61277, 'loss/train': 1.352165937423706} 11/07/2021 05:52:47 - INFO - __main__ - Step 61279: {'lr': 0.00032686047580714585, 'samples': 11765568, 'steps': 61278, 'loss/train': 1.2672141790390015} 11/07/2021 05:52:48 - INFO - __main__ - Step 61280: {'lr': 0.0003268554260678108, 'samples': 11765760, 'steps': 61279, 'loss/train': 1.5162378549575806} 11/07/2021 05:52:49 - INFO - __main__ - Step 61281: {'lr': 0.00032685037629384586, 'samples': 11765952, 'steps': 61280, 'loss/train': 1.2649943828582764} 11/07/2021 05:52:49 - INFO - __main__ - Step 61282: {'lr': 0.0003268453264852533, 'samples': 11766144, 'steps': 61281, 'loss/train': 0.998900830745697} 11/07/2021 05:52:50 - INFO - __main__ - Step 61283: {'lr': 0.0003268402766420355, 'samples': 11766336, 'steps': 61282, 'loss/train': 1.2809820175170898} 11/07/2021 05:52:50 - INFO - __main__ - Step 61284: {'lr': 0.00032683522676419465, 'samples': 11766528, 'steps': 61283, 'loss/train': 1.3413070440292358} 11/07/2021 05:52:50 - INFO - __main__ - Step 61285: {'lr': 0.000326830176851733, 'samples': 11766720, 'steps': 61284, 'loss/train': 1.6412798166275024} 11/07/2021 05:52:51 - INFO - __main__ - Step 61286: {'lr': 0.00032682512690465284, 'samples': 11766912, 'steps': 61285, 'loss/train': 1.5056488513946533} 11/07/2021 05:52:52 - INFO - __main__ - Step 61287: {'lr': 0.00032682007692295647, 'samples': 11767104, 'steps': 61286, 'loss/train': 1.862308382987976} 11/07/2021 05:52:52 - INFO - __main__ - Step 61288: {'lr': 0.0003268150269066462, 'samples': 11767296, 'steps': 61287, 'loss/train': 1.194726586341858} 11/07/2021 05:52:53 - INFO - __main__ - Step 61289: {'lr': 0.0003268099768557242, 'samples': 11767488, 'steps': 61288, 'loss/train': 1.5180402994155884} 11/07/2021 05:52:53 - INFO - __main__ - Step 61290: {'lr': 0.00032680492677019285, 'samples': 11767680, 'steps': 61289, 'loss/train': 1.4870150089263916} 11/07/2021 05:52:54 - INFO - __main__ - Step 61291: {'lr': 0.0003267998766500544, 'samples': 11767872, 'steps': 61290, 'loss/train': 1.5506267547607422} 11/07/2021 05:52:54 - INFO - __main__ - Step 61292: {'lr': 0.00032679482649531104, 'samples': 11768064, 'steps': 61291, 'loss/train': 1.7387115955352783} 11/07/2021 05:52:55 - INFO - __main__ - Step 61293: {'lr': 0.00032678977630596517, 'samples': 11768256, 'steps': 61292, 'loss/train': 1.728071928024292} 11/07/2021 05:52:55 - INFO - __main__ - Step 61294: {'lr': 0.00032678472608201905, 'samples': 11768448, 'steps': 61293, 'loss/train': 1.7021468877792358} 11/07/2021 05:52:56 - INFO - __main__ - Step 61295: {'lr': 0.00032677967582347484, 'samples': 11768640, 'steps': 61294, 'loss/train': 1.3504661321640015} 11/07/2021 05:52:56 - INFO - __main__ - Step 61296: {'lr': 0.000326774625530335, 'samples': 11768832, 'steps': 61295, 'loss/train': 1.2014474868774414} 11/07/2021 05:52:57 - INFO - __main__ - Step 61297: {'lr': 0.00032676957520260156, 'samples': 11769024, 'steps': 61296, 'loss/train': 0.7606809139251709} 11/07/2021 05:52:57 - INFO - __main__ - Step 61298: {'lr': 0.00032676452484027704, 'samples': 11769216, 'steps': 61297, 'loss/train': 1.4283581972122192} 11/07/2021 05:52:58 - INFO - __main__ - Step 61299: {'lr': 0.0003267594744433636, 'samples': 11769408, 'steps': 61298, 'loss/train': 1.353149175643921} 11/07/2021 05:52:58 - INFO - __main__ - Step 61300: {'lr': 0.00032675442401186344, 'samples': 11769600, 'steps': 61299, 'loss/train': 1.257101058959961} 11/07/2021 05:52:58 - INFO - __main__ - Step 61301: {'lr': 0.000326749373545779, 'samples': 11769792, 'steps': 61300, 'loss/train': 1.4521551132202148} 11/07/2021 05:52:59 - INFO - __main__ - Step 61302: {'lr': 0.00032674432304511243, 'samples': 11769984, 'steps': 61301, 'loss/train': 1.1812132596969604} 11/07/2021 05:53:00 - INFO - __main__ - Step 61303: {'lr': 0.0003267392725098661, 'samples': 11770176, 'steps': 61302, 'loss/train': 1.7228306531906128} 11/07/2021 05:53:00 - INFO - __main__ - Step 61304: {'lr': 0.0003267342219400422, 'samples': 11770368, 'steps': 61303, 'loss/train': 0.13332369923591614} 11/07/2021 05:53:01 - INFO - __main__ - Step 61305: {'lr': 0.00032672917133564304, 'samples': 11770560, 'steps': 61304, 'loss/train': 1.0790495872497559} 11/07/2021 05:53:01 - INFO - __main__ - Step 61306: {'lr': 0.00032672412069667094, 'samples': 11770752, 'steps': 61305, 'loss/train': 1.0943893194198608} 11/07/2021 05:53:02 - INFO - __main__ - Step 61307: {'lr': 0.00032671907002312814, 'samples': 11770944, 'steps': 61306, 'loss/train': 1.5806915760040283} 11/07/2021 05:53:02 - INFO - __main__ - Step 61308: {'lr': 0.0003267140193150169, 'samples': 11771136, 'steps': 61307, 'loss/train': 0.8717992305755615} 11/07/2021 05:53:03 - INFO - __main__ - Step 61309: {'lr': 0.0003267089685723395, 'samples': 11771328, 'steps': 61308, 'loss/train': 1.2429490089416504} 11/07/2021 05:53:03 - INFO - __main__ - Step 61310: {'lr': 0.00032670391779509824, 'samples': 11771520, 'steps': 61309, 'loss/train': 1.407184362411499} 11/07/2021 05:53:03 - INFO - __main__ - Step 61311: {'lr': 0.0003266988669832953, 'samples': 11771712, 'steps': 61310, 'loss/train': 1.5074046850204468} 11/07/2021 05:53:04 - INFO - __main__ - Step 61312: {'lr': 0.00032669381613693307, 'samples': 11771904, 'steps': 61311, 'loss/train': 1.880023717880249} 11/07/2021 05:53:05 - INFO - __main__ - Step 61313: {'lr': 0.00032668876525601383, 'samples': 11772096, 'steps': 61312, 'loss/train': 3.5788331031799316} 11/07/2021 05:53:05 - INFO - __main__ - Step 61314: {'lr': 0.00032668371434053977, 'samples': 11772288, 'steps': 61313, 'loss/train': 2.0937397480010986} 11/07/2021 05:53:06 - INFO - __main__ - Step 61315: {'lr': 0.00032667866339051326, 'samples': 11772480, 'steps': 61314, 'loss/train': 1.6478530168533325} 11/07/2021 05:53:06 - INFO - __main__ - Step 61316: {'lr': 0.0003266736124059365, 'samples': 11772672, 'steps': 61315, 'loss/train': 2.126032829284668} 11/07/2021 05:53:06 - INFO - __main__ - Step 61317: {'lr': 0.0003266685613868118, 'samples': 11772864, 'steps': 61316, 'loss/train': 1.5299557447433472} 11/07/2021 05:53:07 - INFO - __main__ - Step 61318: {'lr': 0.0003266635103331414, 'samples': 11773056, 'steps': 61317, 'loss/train': 1.5470257997512817} 11/07/2021 05:53:08 - INFO - __main__ - Step 61319: {'lr': 0.00032665845924492764, 'samples': 11773248, 'steps': 61318, 'loss/train': 1.5980430841445923} 11/07/2021 05:53:08 - INFO - __main__ - Step 61320: {'lr': 0.0003266534081221728, 'samples': 11773440, 'steps': 61319, 'loss/train': 2.0668272972106934} 11/07/2021 05:53:08 - INFO - __main__ - Step 61321: {'lr': 0.00032664835696487906, 'samples': 11773632, 'steps': 61320, 'loss/train': 1.7213462591171265} 11/07/2021 05:53:09 - INFO - __main__ - Step 61322: {'lr': 0.00032664330577304875, 'samples': 11773824, 'steps': 61321, 'loss/train': 1.7935516834259033} 11/07/2021 05:53:10 - INFO - __main__ - Step 61323: {'lr': 0.00032663825454668416, 'samples': 11774016, 'steps': 61322, 'loss/train': 1.3711440563201904} 11/07/2021 05:53:10 - INFO - __main__ - Step 61324: {'lr': 0.0003266332032857875, 'samples': 11774208, 'steps': 61323, 'loss/train': 0.9474714398384094} 11/07/2021 05:53:10 - INFO - __main__ - Step 61325: {'lr': 0.0003266281519903612, 'samples': 11774400, 'steps': 61324, 'loss/train': 1.6421631574630737} 11/07/2021 05:53:11 - INFO - __main__ - Step 61326: {'lr': 0.0003266231006604074, 'samples': 11774592, 'steps': 61325, 'loss/train': 1.2108572721481323} 11/07/2021 05:53:11 - INFO - __main__ - Step 61327: {'lr': 0.00032661804929592843, 'samples': 11774784, 'steps': 61326, 'loss/train': 1.6689257621765137} 11/07/2021 05:53:12 - INFO - __main__ - Step 61328: {'lr': 0.0003266129978969265, 'samples': 11774976, 'steps': 61327, 'loss/train': 1.4198055267333984} 11/07/2021 05:53:13 - INFO - __main__ - Step 61329: {'lr': 0.000326607946463404, 'samples': 11775168, 'steps': 61328, 'loss/train': 1.592890977859497} 11/07/2021 05:53:13 - INFO - __main__ - Step 61330: {'lr': 0.00032660289499536303, 'samples': 11775360, 'steps': 61329, 'loss/train': 1.4091731309890747} 11/07/2021 05:53:13 - INFO - __main__ - Step 61331: {'lr': 0.00032659784349280607, 'samples': 11775552, 'steps': 61330, 'loss/train': 1.4928501844406128} 11/07/2021 05:53:14 - INFO - __main__ - Step 61332: {'lr': 0.0003265927919557353, 'samples': 11775744, 'steps': 61331, 'loss/train': 1.2701103687286377} 11/07/2021 05:53:15 - INFO - __main__ - Step 61333: {'lr': 0.000326587740384153, 'samples': 11775936, 'steps': 61332, 'loss/train': 2.1046884059906006} 11/07/2021 05:53:15 - INFO - __main__ - Step 61334: {'lr': 0.0003265826887780614, 'samples': 11776128, 'steps': 61333, 'loss/train': 1.6680314540863037} 11/07/2021 05:53:16 - INFO - __main__ - Step 61335: {'lr': 0.00032657763713746284, 'samples': 11776320, 'steps': 61334, 'loss/train': 1.9651881456375122} 11/07/2021 05:53:16 - INFO - __main__ - Step 61336: {'lr': 0.0003265725854623596, 'samples': 11776512, 'steps': 61335, 'loss/train': 0.9767792224884033} 11/07/2021 05:53:16 - INFO - __main__ - Step 61337: {'lr': 0.00032656753375275396, 'samples': 11776704, 'steps': 61336, 'loss/train': 1.6291636228561401} 11/07/2021 05:53:17 - INFO - __main__ - Step 61338: {'lr': 0.00032656248200864813, 'samples': 11776896, 'steps': 61337, 'loss/train': 1.4194518327713013} 11/07/2021 05:53:18 - INFO - __main__ - Step 61339: {'lr': 0.0003265574302300444, 'samples': 11777088, 'steps': 61338, 'loss/train': 1.8283817768096924} 11/07/2021 05:53:18 - INFO - __main__ - Step 61340: {'lr': 0.0003265523784169451, 'samples': 11777280, 'steps': 61339, 'loss/train': 2.130239486694336} 11/07/2021 05:53:19 - INFO - __main__ - Step 61341: {'lr': 0.0003265473265693525, 'samples': 11777472, 'steps': 61340, 'loss/train': 1.5171703100204468} 11/07/2021 05:53:19 - INFO - __main__ - Step 61342: {'lr': 0.00032654227468726884, 'samples': 11777664, 'steps': 61341, 'loss/train': 1.7059507369995117} 11/07/2021 05:53:19 - INFO - __main__ - Step 61343: {'lr': 0.00032653722277069643, 'samples': 11777856, 'steps': 61342, 'loss/train': 0.06909545511007309} 11/07/2021 05:53:20 - INFO - __main__ - Step 61344: {'lr': 0.00032653217081963755, 'samples': 11778048, 'steps': 61343, 'loss/train': 1.583729863166809} 11/07/2021 05:53:21 - INFO - __main__ - Step 61345: {'lr': 0.0003265271188340944, 'samples': 11778240, 'steps': 61344, 'loss/train': 1.285346508026123} 11/07/2021 05:53:21 - INFO - __main__ - Step 61346: {'lr': 0.0003265220668140693, 'samples': 11778432, 'steps': 61345, 'loss/train': 1.3908618688583374} 11/07/2021 05:53:21 - INFO - __main__ - Step 61347: {'lr': 0.0003265170147595646, 'samples': 11778624, 'steps': 61346, 'loss/train': 1.443522572517395} 11/07/2021 05:53:22 - INFO - __main__ - Step 61348: {'lr': 0.00032651196267058244, 'samples': 11778816, 'steps': 61347, 'loss/train': 1.4675703048706055} 11/07/2021 05:53:23 - INFO - __main__ - Step 61349: {'lr': 0.00032650691054712523, 'samples': 11779008, 'steps': 61348, 'loss/train': 1.6273024082183838} 11/07/2021 05:53:23 - INFO - __main__ - Step 61350: {'lr': 0.00032650185838919516, 'samples': 11779200, 'steps': 61349, 'loss/train': 1.3863720893859863} 11/07/2021 05:53:24 - INFO - __main__ - Step 61351: {'lr': 0.00032649680619679456, 'samples': 11779392, 'steps': 61350, 'loss/train': 1.4186891317367554} 11/07/2021 05:53:24 - INFO - __main__ - Step 61352: {'lr': 0.00032649175396992565, 'samples': 11779584, 'steps': 61351, 'loss/train': 2.030136823654175} 11/07/2021 05:53:24 - INFO - __main__ - Step 61353: {'lr': 0.0003264867017085907, 'samples': 11779776, 'steps': 61352, 'loss/train': 2.2038733959198} 11/07/2021 05:53:25 - INFO - __main__ - Step 61354: {'lr': 0.0003264816494127921, 'samples': 11779968, 'steps': 61353, 'loss/train': 1.4440791606903076} 11/07/2021 05:53:26 - INFO - __main__ - Step 61355: {'lr': 0.000326476597082532, 'samples': 11780160, 'steps': 61354, 'loss/train': 1.4485981464385986} 11/07/2021 05:53:26 - INFO - __main__ - Step 61356: {'lr': 0.0003264715447178127, 'samples': 11780352, 'steps': 61355, 'loss/train': 1.655215859413147} 11/07/2021 05:53:26 - INFO - __main__ - Step 61357: {'lr': 0.0003264664923186366, 'samples': 11780544, 'steps': 61356, 'loss/train': 1.532602071762085} 11/07/2021 05:53:27 - INFO - __main__ - Step 61358: {'lr': 0.0003264614398850058, 'samples': 11780736, 'steps': 61357, 'loss/train': 1.313807487487793} 11/07/2021 05:53:27 - INFO - __main__ - Step 61359: {'lr': 0.0003264563874169227, 'samples': 11780928, 'steps': 61358, 'loss/train': 1.4197273254394531} 11/07/2021 05:53:28 - INFO - __main__ - Step 61360: {'lr': 0.00032645133491438947, 'samples': 11781120, 'steps': 61359, 'loss/train': 0.9482593536376953} 11/07/2021 05:53:28 - INFO - __main__ - Step 61361: {'lr': 0.0003264462823774085, 'samples': 11781312, 'steps': 61360, 'loss/train': 1.4741480350494385} 11/07/2021 05:53:29 - INFO - __main__ - Step 61362: {'lr': 0.000326441229805982, 'samples': 11781504, 'steps': 61361, 'loss/train': 1.3037137985229492} 11/07/2021 05:53:29 - INFO - __main__ - Step 61363: {'lr': 0.00032643617720011227, 'samples': 11781696, 'steps': 61362, 'loss/train': 1.6840412616729736} 11/07/2021 05:53:30 - INFO - __main__ - Step 61364: {'lr': 0.0003264311245598016, 'samples': 11781888, 'steps': 61363, 'loss/train': 0.8109203577041626} 11/07/2021 05:53:31 - INFO - __main__ - Step 61365: {'lr': 0.0003264260718850522, 'samples': 11782080, 'steps': 61364, 'loss/train': 1.4557477235794067} 11/07/2021 05:53:31 - INFO - __main__ - Step 61366: {'lr': 0.00032642101917586643, 'samples': 11782272, 'steps': 61365, 'loss/train': 1.5860857963562012} 11/07/2021 05:53:31 - INFO - __main__ - Step 61367: {'lr': 0.00032641596643224644, 'samples': 11782464, 'steps': 61366, 'loss/train': 0.9690595269203186} 11/07/2021 05:53:32 - INFO - __main__ - Step 61368: {'lr': 0.0003264109136541947, 'samples': 11782656, 'steps': 61367, 'loss/train': 1.6686527729034424} 11/07/2021 05:53:32 - INFO - __main__ - Step 61369: {'lr': 0.00032640586084171333, 'samples': 11782848, 'steps': 61368, 'loss/train': 1.5772621631622314} 11/07/2021 05:53:33 - INFO - __main__ - Step 61370: {'lr': 0.0003264008079948047, 'samples': 11783040, 'steps': 61369, 'loss/train': 1.616234540939331} 11/07/2021 05:53:33 - INFO - __main__ - Step 61371: {'lr': 0.000326395755113471, 'samples': 11783232, 'steps': 61370, 'loss/train': 1.021813154220581} 11/07/2021 05:53:34 - INFO - __main__ - Step 61372: {'lr': 0.00032639070219771455, 'samples': 11783424, 'steps': 61371, 'loss/train': 1.623144268989563} 11/07/2021 05:53:34 - INFO - __main__ - Step 61373: {'lr': 0.0003263856492475377, 'samples': 11783616, 'steps': 61372, 'loss/train': 1.2814526557922363} 11/07/2021 05:53:34 - INFO - __main__ - Step 61374: {'lr': 0.00032638059626294253, 'samples': 11783808, 'steps': 61373, 'loss/train': 1.4019120931625366} 11/07/2021 05:53:35 - INFO - __main__ - Step 61375: {'lr': 0.0003263755432439315, 'samples': 11784000, 'steps': 61374, 'loss/train': 1.9236347675323486} 11/07/2021 05:53:36 - INFO - __main__ - Step 61376: {'lr': 0.00032637049019050687, 'samples': 11784192, 'steps': 61375, 'loss/train': 1.5798259973526} 11/07/2021 05:53:36 - INFO - __main__ - Step 61377: {'lr': 0.00032636543710267085, 'samples': 11784384, 'steps': 61376, 'loss/train': 1.3039302825927734} 11/07/2021 05:53:36 - INFO - __main__ - Step 61378: {'lr': 0.00032636038398042573, 'samples': 11784576, 'steps': 61377, 'loss/train': 1.6459133625030518} 11/07/2021 05:53:37 - INFO - __main__ - Step 61379: {'lr': 0.0003263553308237738, 'samples': 11784768, 'steps': 61378, 'loss/train': 1.53175950050354} 11/07/2021 05:53:38 - INFO - __main__ - Step 61380: {'lr': 0.00032635027763271737, 'samples': 11784960, 'steps': 61379, 'loss/train': 0.29972559213638306} 11/07/2021 05:53:38 - INFO - __main__ - Step 61381: {'lr': 0.00032634522440725864, 'samples': 11785152, 'steps': 61380, 'loss/train': 1.397218108177185} 11/07/2021 05:53:39 - INFO - __main__ - Step 61382: {'lr': 0.00032634017114739996, 'samples': 11785344, 'steps': 61381, 'loss/train': 0.9051415324211121} 11/07/2021 05:53:39 - INFO - __main__ - Step 61383: {'lr': 0.0003263351178531435, 'samples': 11785536, 'steps': 61382, 'loss/train': 0.1861395537853241} 11/07/2021 05:53:39 - INFO - __main__ - Step 61384: {'lr': 0.00032633006452449176, 'samples': 11785728, 'steps': 61383, 'loss/train': 1.3627625703811646} 11/07/2021 05:53:40 - INFO - __main__ - Step 61385: {'lr': 0.00032632501116144674, 'samples': 11785920, 'steps': 61384, 'loss/train': 0.9604775905609131} 11/07/2021 05:53:41 - INFO - __main__ - Step 61386: {'lr': 0.0003263199577640109, 'samples': 11786112, 'steps': 61385, 'loss/train': 1.5813976526260376} 11/07/2021 05:53:41 - INFO - __main__ - Step 61387: {'lr': 0.00032631490433218647, 'samples': 11786304, 'steps': 61386, 'loss/train': 1.5147321224212646} 11/07/2021 05:53:41 - INFO - __main__ - Step 61388: {'lr': 0.0003263098508659757, 'samples': 11786496, 'steps': 61387, 'loss/train': 2.1994996070861816} 11/07/2021 05:53:42 - INFO - __main__ - Step 61389: {'lr': 0.0003263047973653809, 'samples': 11786688, 'steps': 61388, 'loss/train': 1.4807060956954956} 11/07/2021 05:53:42 - INFO - __main__ - Step 61390: {'lr': 0.0003262997438304044, 'samples': 11786880, 'steps': 61389, 'loss/train': 1.4167277812957764} 11/07/2021 05:53:43 - INFO - __main__ - Step 61391: {'lr': 0.0003262946902610483, 'samples': 11787072, 'steps': 61390, 'loss/train': 1.6013931035995483} 11/07/2021 05:53:43 - INFO - __main__ - Step 61392: {'lr': 0.00032628963665731504, 'samples': 11787264, 'steps': 61391, 'loss/train': 1.2740676403045654} 11/07/2021 05:53:44 - INFO - __main__ - Step 61393: {'lr': 0.00032628458301920684, 'samples': 11787456, 'steps': 61392, 'loss/train': 1.9979413747787476} 11/07/2021 05:53:44 - INFO - __main__ - Step 61394: {'lr': 0.000326279529346726, 'samples': 11787648, 'steps': 61393, 'loss/train': 1.6604276895523071} 11/07/2021 05:53:44 - INFO - __main__ - Step 61395: {'lr': 0.0003262744756398748, 'samples': 11787840, 'steps': 61394, 'loss/train': 1.6483005285263062} 11/07/2021 05:53:46 - INFO - __main__ - Step 61396: {'lr': 0.0003262694218986554, 'samples': 11788032, 'steps': 61395, 'loss/train': 1.5815706253051758} 11/07/2021 05:53:46 - INFO - __main__ - Step 61397: {'lr': 0.0003262643681230703, 'samples': 11788224, 'steps': 61396, 'loss/train': 0.931351363658905} 11/07/2021 05:53:46 - INFO - __main__ - Step 61398: {'lr': 0.0003262593143131216, 'samples': 11788416, 'steps': 61397, 'loss/train': 1.057442545890808} 11/07/2021 05:53:47 - INFO - __main__ - Step 61399: {'lr': 0.0003262542604688116, 'samples': 11788608, 'steps': 61398, 'loss/train': 1.248337984085083} 11/07/2021 05:53:47 - INFO - __main__ - Step 61400: {'lr': 0.00032624920659014264, 'samples': 11788800, 'steps': 61399, 'loss/train': 2.5246410369873047} 11/07/2021 05:53:48 - INFO - __main__ - Step 61401: {'lr': 0.00032624415267711694, 'samples': 11788992, 'steps': 61400, 'loss/train': 1.3805303573608398} 11/07/2021 05:53:48 - INFO - __main__ - Step 61402: {'lr': 0.00032623909872973677, 'samples': 11789184, 'steps': 61401, 'loss/train': 1.6853001117706299} 11/07/2021 05:53:49 - INFO - __main__ - Step 61403: {'lr': 0.00032623404474800457, 'samples': 11789376, 'steps': 61402, 'loss/train': 1.5147709846496582} 11/07/2021 05:53:49 - INFO - __main__ - Step 61404: {'lr': 0.0003262289907319224, 'samples': 11789568, 'steps': 61403, 'loss/train': 1.6801886558532715} 11/07/2021 05:53:49 - INFO - __main__ - Step 61405: {'lr': 0.0003262239366814926, 'samples': 11789760, 'steps': 61404, 'loss/train': 1.1277427673339844} 11/07/2021 05:53:50 - INFO - __main__ - Step 61406: {'lr': 0.0003262188825967175, 'samples': 11789952, 'steps': 61405, 'loss/train': 1.4626790285110474} 11/07/2021 05:53:51 - INFO - __main__ - Step 61407: {'lr': 0.00032621382847759935, 'samples': 11790144, 'steps': 61406, 'loss/train': 1.0720701217651367} 11/07/2021 05:53:51 - INFO - __main__ - Step 61408: {'lr': 0.00032620877432414043, 'samples': 11790336, 'steps': 61407, 'loss/train': 1.9682276248931885} 11/07/2021 05:53:52 - INFO - __main__ - Step 61409: {'lr': 0.000326203720136343, 'samples': 11790528, 'steps': 61408, 'loss/train': 1.5422592163085938} 11/07/2021 05:53:52 - INFO - __main__ - Step 61410: {'lr': 0.00032619866591420934, 'samples': 11790720, 'steps': 61409, 'loss/train': 1.351910948753357} 11/07/2021 05:53:53 - INFO - __main__ - Step 61411: {'lr': 0.0003261936116577418, 'samples': 11790912, 'steps': 61410, 'loss/train': 1.13876473903656} 11/07/2021 05:53:53 - INFO - __main__ - Step 61412: {'lr': 0.0003261885573669425, 'samples': 11791104, 'steps': 61411, 'loss/train': 1.2936224937438965} 11/07/2021 05:53:54 - INFO - __main__ - Step 61413: {'lr': 0.0003261835030418139, 'samples': 11791296, 'steps': 61412, 'loss/train': 1.5423035621643066} 11/07/2021 05:53:54 - INFO - __main__ - Step 61414: {'lr': 0.0003261784486823581, 'samples': 11791488, 'steps': 61413, 'loss/train': 1.3736850023269653} 11/07/2021 05:53:54 - INFO - __main__ - Step 61415: {'lr': 0.0003261733942885775, 'samples': 11791680, 'steps': 61414, 'loss/train': 0.3291564881801605} 11/07/2021 05:53:55 - INFO - __main__ - Step 61416: {'lr': 0.00032616833986047434, 'samples': 11791872, 'steps': 61415, 'loss/train': 1.8306245803833008} 11/07/2021 05:53:56 - INFO - __main__ - Step 61417: {'lr': 0.000326163285398051, 'samples': 11792064, 'steps': 61416, 'loss/train': 1.5760759115219116} 11/07/2021 05:53:56 - INFO - __main__ - Step 61418: {'lr': 0.0003261582309013095, 'samples': 11792256, 'steps': 61417, 'loss/train': 0.752622127532959} 11/07/2021 05:53:56 - INFO - __main__ - Step 61419: {'lr': 0.00032615317637025237, 'samples': 11792448, 'steps': 61418, 'loss/train': 1.216813087463379} 11/07/2021 05:53:57 - INFO - __main__ - Step 61420: {'lr': 0.00032614812180488173, 'samples': 11792640, 'steps': 61419, 'loss/train': 1.634286880493164} 11/07/2021 05:53:57 - INFO - __main__ - Step 61421: {'lr': 0.0003261430672052, 'samples': 11792832, 'steps': 61420, 'loss/train': 1.25361967086792} 11/07/2021 05:53:58 - INFO - __main__ - Step 61422: {'lr': 0.00032613801257120933, 'samples': 11793024, 'steps': 61421, 'loss/train': 1.5829486846923828} 11/07/2021 05:53:59 - INFO - __main__ - Step 61423: {'lr': 0.0003261329579029121, 'samples': 11793216, 'steps': 61422, 'loss/train': 0.9598292708396912} 11/07/2021 05:53:59 - INFO - __main__ - Step 61424: {'lr': 0.0003261279032003105, 'samples': 11793408, 'steps': 61423, 'loss/train': 1.51951265335083} 11/07/2021 05:53:59 - INFO - __main__ - Step 61425: {'lr': 0.0003261228484634068, 'samples': 11793600, 'steps': 61424, 'loss/train': 1.343214750289917} 11/07/2021 05:54:00 - INFO - __main__ - Step 61426: {'lr': 0.0003261177936922034, 'samples': 11793792, 'steps': 61425, 'loss/train': 1.4703766107559204} 11/07/2021 05:54:01 - INFO - __main__ - Step 61427: {'lr': 0.0003261127388867024, 'samples': 11793984, 'steps': 61426, 'loss/train': 1.2007951736450195} 11/07/2021 05:54:01 - INFO - __main__ - Step 61428: {'lr': 0.0003261076840469062, 'samples': 11794176, 'steps': 61427, 'loss/train': 1.2797375917434692} 11/07/2021 05:54:01 - INFO - __main__ - Step 61429: {'lr': 0.0003261026291728171, 'samples': 11794368, 'steps': 61428, 'loss/train': 1.701186180114746} 11/07/2021 05:54:02 - INFO - __main__ - Step 61430: {'lr': 0.0003260975742644373, 'samples': 11794560, 'steps': 61429, 'loss/train': 0.6782390475273132} 11/07/2021 05:54:02 - INFO - __main__ - Step 61431: {'lr': 0.0003260925193217692, 'samples': 11794752, 'steps': 61430, 'loss/train': 3.8179023265838623} 11/07/2021 05:54:03 - INFO - __main__ - Step 61432: {'lr': 0.00032608746434481485, 'samples': 11794944, 'steps': 61431, 'loss/train': 1.5631794929504395} 11/07/2021 05:54:04 - INFO - __main__ - Step 61433: {'lr': 0.0003260824093335767, 'samples': 11795136, 'steps': 61432, 'loss/train': 1.5096896886825562} 11/07/2021 05:54:04 - INFO - __main__ - Step 61434: {'lr': 0.00032607735428805704, 'samples': 11795328, 'steps': 61433, 'loss/train': 1.6055477857589722} 11/07/2021 05:54:04 - INFO - __main__ - Step 61435: {'lr': 0.00032607229920825806, 'samples': 11795520, 'steps': 61434, 'loss/train': 0.606212854385376} 11/07/2021 05:54:05 - INFO - __main__ - Step 61436: {'lr': 0.000326067244094182, 'samples': 11795712, 'steps': 61435, 'loss/train': 1.6238412857055664} 11/07/2021 05:54:05 - INFO - __main__ - Step 61437: {'lr': 0.0003260621889458314, 'samples': 11795904, 'steps': 61436, 'loss/train': 0.8438888788223267} 11/07/2021 05:54:06 - INFO - __main__ - Step 61438: {'lr': 0.00032605713376320823, 'samples': 11796096, 'steps': 61437, 'loss/train': 1.3050416707992554} 11/07/2021 05:54:06 - INFO - __main__ - Step 61439: {'lr': 0.00032605207854631487, 'samples': 11796288, 'steps': 61438, 'loss/train': 1.1117010116577148} 11/07/2021 05:54:07 - INFO - __main__ - Step 61440: {'lr': 0.00032604702329515367, 'samples': 11796480, 'steps': 61439, 'loss/train': 1.5093955993652344} 11/07/2021 05:54:07 - INFO - __main__ - Step 61441: {'lr': 0.0003260419680097268, 'samples': 11796672, 'steps': 61440, 'loss/train': 0.8973720669746399} 11/07/2021 05:54:07 - INFO - __main__ - Step 61442: {'lr': 0.0003260369126900366, 'samples': 11796864, 'steps': 61441, 'loss/train': 1.3227843046188354} 11/07/2021 05:54:08 - INFO - __main__ - Step 61443: {'lr': 0.0003260318573360854, 'samples': 11797056, 'steps': 61442, 'loss/train': 1.5139319896697998} 11/07/2021 05:54:09 - INFO - __main__ - Step 61444: {'lr': 0.00032602680194787544, 'samples': 11797248, 'steps': 61443, 'loss/train': 1.1931483745574951} 11/07/2021 05:54:09 - INFO - __main__ - Step 61445: {'lr': 0.0003260217465254089, 'samples': 11797440, 'steps': 61444, 'loss/train': 1.5017277002334595} 11/07/2021 05:54:09 - INFO - __main__ - Step 61446: {'lr': 0.00032601669106868816, 'samples': 11797632, 'steps': 61445, 'loss/train': 1.2555017471313477} 11/07/2021 05:54:10 - INFO - __main__ - Step 61447: {'lr': 0.0003260116355777154, 'samples': 11797824, 'steps': 61446, 'loss/train': 1.2493174076080322} 11/07/2021 05:54:11 - INFO - __main__ - Step 61448: {'lr': 0.00032600658005249307, 'samples': 11798016, 'steps': 61447, 'loss/train': 1.8332664966583252} 11/07/2021 05:54:11 - INFO - __main__ - Step 61449: {'lr': 0.00032600152449302337, 'samples': 11798208, 'steps': 61448, 'loss/train': 1.5016850233078003} 11/07/2021 05:54:12 - INFO - __main__ - Step 61450: {'lr': 0.00032599646889930843, 'samples': 11798400, 'steps': 61449, 'loss/train': 1.5256508588790894} 11/07/2021 05:54:12 - INFO - __main__ - Step 61451: {'lr': 0.0003259914132713507, 'samples': 11798592, 'steps': 61450, 'loss/train': 1.2444126605987549} 11/07/2021 05:54:12 - INFO - __main__ - Step 61452: {'lr': 0.00032598635760915253, 'samples': 11798784, 'steps': 61451, 'loss/train': 1.3724397420883179} 11/07/2021 05:54:13 - INFO - __main__ - Step 61453: {'lr': 0.0003259813019127159, 'samples': 11798976, 'steps': 61452, 'loss/train': 1.007129430770874} 11/07/2021 05:54:14 - INFO - __main__ - Step 61454: {'lr': 0.00032597624618204335, 'samples': 11799168, 'steps': 61453, 'loss/train': 1.470384955406189} 11/07/2021 05:54:14 - INFO - __main__ - Step 61455: {'lr': 0.0003259711904171372, 'samples': 11799360, 'steps': 61454, 'loss/train': 1.6323463916778564} 11/07/2021 05:54:14 - INFO - __main__ - Step 61456: {'lr': 0.00032596613461799944, 'samples': 11799552, 'steps': 61455, 'loss/train': 1.3535388708114624} 11/07/2021 05:54:15 - INFO - __main__ - Step 61457: {'lr': 0.00032596107878463256, 'samples': 11799744, 'steps': 61456, 'loss/train': 1.415780782699585} 11/07/2021 05:54:16 - INFO - __main__ - Step 61458: {'lr': 0.00032595602291703873, 'samples': 11799936, 'steps': 61457, 'loss/train': 0.9384996891021729} 11/07/2021 05:54:16 - INFO - __main__ - Step 61459: {'lr': 0.0003259509670152204, 'samples': 11800128, 'steps': 61458, 'loss/train': 1.2650721073150635} 11/07/2021 05:54:16 - INFO - __main__ - Step 61460: {'lr': 0.0003259459110791797, 'samples': 11800320, 'steps': 61459, 'loss/train': 1.453385591506958} 11/07/2021 05:54:17 - INFO - __main__ - Step 61461: {'lr': 0.00032594085510891894, 'samples': 11800512, 'steps': 61460, 'loss/train': 1.1821001768112183} 11/07/2021 05:54:17 - INFO - __main__ - Step 61462: {'lr': 0.0003259357991044404, 'samples': 11800704, 'steps': 61461, 'loss/train': 1.5935136079788208} 11/07/2021 05:54:18 - INFO - __main__ - Step 61463: {'lr': 0.00032593074306574635, 'samples': 11800896, 'steps': 61462, 'loss/train': 1.3866955041885376} 11/07/2021 05:54:18 - INFO - __main__ - Step 61464: {'lr': 0.00032592568699283905, 'samples': 11801088, 'steps': 61463, 'loss/train': 1.36967933177948} 11/07/2021 05:54:19 - INFO - __main__ - Step 61465: {'lr': 0.0003259206308857208, 'samples': 11801280, 'steps': 61464, 'loss/train': 0.6925641894340515} 11/07/2021 05:54:19 - INFO - __main__ - Step 61466: {'lr': 0.0003259155747443939, 'samples': 11801472, 'steps': 61465, 'loss/train': 1.294487476348877} 11/07/2021 05:54:20 - INFO - __main__ - Step 61467: {'lr': 0.00032591051856886065, 'samples': 11801664, 'steps': 61466, 'loss/train': 1.5789685249328613} 11/07/2021 05:54:20 - INFO - __main__ - Step 61468: {'lr': 0.00032590546235912335, 'samples': 11801856, 'steps': 61467, 'loss/train': 0.9701781868934631} 11/07/2021 05:54:21 - INFO - __main__ - Step 61469: {'lr': 0.0003259004061151841, 'samples': 11802048, 'steps': 61468, 'loss/train': 1.1676175594329834} 11/07/2021 05:54:21 - INFO - __main__ - Step 61470: {'lr': 0.00032589534983704533, 'samples': 11802240, 'steps': 61469, 'loss/train': 1.2782318592071533} 11/07/2021 05:54:22 - INFO - __main__ - Step 61471: {'lr': 0.0003258902935247093, 'samples': 11802432, 'steps': 61470, 'loss/train': 1.5714412927627563} 11/07/2021 05:54:22 - INFO - __main__ - Step 61472: {'lr': 0.0003258852371781783, 'samples': 11802624, 'steps': 61471, 'loss/train': 1.6010366678237915} 11/07/2021 05:54:23 - INFO - __main__ - Step 61473: {'lr': 0.0003258801807974545, 'samples': 11802816, 'steps': 61472, 'loss/train': 1.4471787214279175} 11/07/2021 05:54:23 - INFO - __main__ - Step 61474: {'lr': 0.00032587512438254034, 'samples': 11803008, 'steps': 61473, 'loss/train': 1.4005281925201416} 11/07/2021 05:54:24 - INFO - __main__ - Step 61475: {'lr': 0.000325870067933438, 'samples': 11803200, 'steps': 61474, 'loss/train': 1.5009253025054932} 11/07/2021 05:54:24 - INFO - __main__ - Step 61476: {'lr': 0.0003258650114501498, 'samples': 11803392, 'steps': 61475, 'loss/train': 1.1562955379486084} 11/07/2021 05:54:24 - INFO - __main__ - Step 61477: {'lr': 0.000325859954932678, 'samples': 11803584, 'steps': 61476, 'loss/train': 1.40598464012146} 11/07/2021 05:54:25 - INFO - __main__ - Step 61478: {'lr': 0.00032585489838102483, 'samples': 11803776, 'steps': 61477, 'loss/train': 1.3726388216018677} 11/07/2021 05:54:26 - INFO - __main__ - Step 61479: {'lr': 0.0003258498417951926, 'samples': 11803968, 'steps': 61478, 'loss/train': 1.1818774938583374} 11/07/2021 05:54:26 - INFO - __main__ - Step 61480: {'lr': 0.00032584478517518365, 'samples': 11804160, 'steps': 61479, 'loss/train': 1.4754964113235474} 11/07/2021 05:54:26 - INFO - __main__ - Step 61481: {'lr': 0.00032583972852100017, 'samples': 11804352, 'steps': 61480, 'loss/train': 1.3834129571914673} 11/07/2021 05:54:27 - INFO - __main__ - Step 61482: {'lr': 0.0003258346718326445, 'samples': 11804544, 'steps': 61481, 'loss/train': 1.4961880445480347} 11/07/2021 05:54:27 - INFO - __main__ - Step 61483: {'lr': 0.0003258296151101189, 'samples': 11804736, 'steps': 61482, 'loss/train': 1.222324013710022} 11/07/2021 05:54:28 - INFO - __main__ - Step 61484: {'lr': 0.0003258245583534256, 'samples': 11804928, 'steps': 61483, 'loss/train': 1.8639585971832275} 11/07/2021 05:54:29 - INFO - __main__ - Step 61485: {'lr': 0.00032581950156256707, 'samples': 11805120, 'steps': 61484, 'loss/train': 1.0577083826065063} 11/07/2021 05:54:29 - INFO - __main__ - Step 61486: {'lr': 0.0003258144447375453, 'samples': 11805312, 'steps': 61485, 'loss/train': 1.563603401184082} 11/07/2021 05:54:29 - INFO - __main__ - Step 61487: {'lr': 0.00032580938787836277, 'samples': 11805504, 'steps': 61486, 'loss/train': 1.353276014328003} 11/07/2021 05:54:30 - INFO - __main__ - Step 61488: {'lr': 0.0003258043309850217, 'samples': 11805696, 'steps': 61487, 'loss/train': 1.1624282598495483} 11/07/2021 05:54:31 - INFO - __main__ - Step 61489: {'lr': 0.0003257992740575243, 'samples': 11805888, 'steps': 61488, 'loss/train': 1.5955121517181396} 11/07/2021 05:54:31 - INFO - __main__ - Step 61490: {'lr': 0.000325794217095873, 'samples': 11806080, 'steps': 61489, 'loss/train': 2.2852206230163574} 11/07/2021 05:54:31 - INFO - __main__ - Step 61491: {'lr': 0.00032578916010006997, 'samples': 11806272, 'steps': 61490, 'loss/train': 1.775505542755127} 11/07/2021 05:54:32 - INFO - __main__ - Step 61492: {'lr': 0.0003257841030701175, 'samples': 11806464, 'steps': 61491, 'loss/train': 1.1599000692367554} 11/07/2021 05:54:32 - INFO - __main__ - Step 61493: {'lr': 0.0003257790460060179, 'samples': 11806656, 'steps': 61492, 'loss/train': 1.6968505382537842} 11/07/2021 05:54:33 - INFO - __main__ - Step 61494: {'lr': 0.0003257739889077734, 'samples': 11806848, 'steps': 61493, 'loss/train': 1.2472717761993408} 11/07/2021 05:54:33 - INFO - __main__ - Step 61495: {'lr': 0.0003257689317753863, 'samples': 11807040, 'steps': 61494, 'loss/train': 1.6707364320755005} 11/07/2021 05:54:34 - INFO - __main__ - Step 61496: {'lr': 0.00032576387460885893, 'samples': 11807232, 'steps': 61495, 'loss/train': 1.1498141288757324} 11/07/2021 05:54:34 - INFO - __main__ - Step 61497: {'lr': 0.00032575881740819353, 'samples': 11807424, 'steps': 61496, 'loss/train': 1.4076035022735596} 11/07/2021 05:54:34 - INFO - __main__ - Step 61498: {'lr': 0.00032575376017339236, 'samples': 11807616, 'steps': 61497, 'loss/train': 1.4537957906723022} 11/07/2021 05:54:36 - INFO - __main__ - Step 61499: {'lr': 0.00032574870290445773, 'samples': 11807808, 'steps': 61498, 'loss/train': 1.2137858867645264} 11/07/2021 05:54:36 - INFO - __main__ - Step 61500: {'lr': 0.0003257436456013919, 'samples': 11808000, 'steps': 61499, 'loss/train': 1.0287110805511475} 11/07/2021 05:54:37 - INFO - __main__ - Step 61501: {'lr': 0.0003257385882641971, 'samples': 11808192, 'steps': 61500, 'loss/train': 0.8828456997871399} 11/07/2021 05:54:37 - INFO - __main__ - Step 61502: {'lr': 0.0003257335308928757, 'samples': 11808384, 'steps': 61501, 'loss/train': 1.2600120306015015} 11/07/2021 05:54:37 - INFO - __main__ - Step 61503: {'lr': 0.00032572847348742994, 'samples': 11808576, 'steps': 61502, 'loss/train': 1.3331743478775024} 11/07/2021 05:54:38 - INFO - __main__ - Step 61504: {'lr': 0.0003257234160478621, 'samples': 11808768, 'steps': 61503, 'loss/train': 1.79777991771698} 11/07/2021 05:54:38 - INFO - __main__ - Step 61505: {'lr': 0.0003257183585741745, 'samples': 11808960, 'steps': 61504, 'loss/train': 1.487135410308838} 11/07/2021 05:54:39 - INFO - __main__ - Step 61506: {'lr': 0.0003257133010663693, 'samples': 11809152, 'steps': 61505, 'loss/train': 1.9170905351638794} 11/07/2021 05:54:39 - INFO - __main__ - Step 61507: {'lr': 0.0003257082435244489, 'samples': 11809344, 'steps': 61506, 'loss/train': 1.2994619607925415} 11/07/2021 05:54:40 - INFO - __main__ - Step 61508: {'lr': 0.0003257031859484155, 'samples': 11809536, 'steps': 61507, 'loss/train': 1.5882923603057861} 11/07/2021 05:54:40 - INFO - __main__ - Step 61509: {'lr': 0.00032569812833827146, 'samples': 11809728, 'steps': 61508, 'loss/train': 1.2078595161437988} 11/07/2021 05:54:40 - INFO - __main__ - Step 61510: {'lr': 0.000325693070694019, 'samples': 11809920, 'steps': 61509, 'loss/train': 0.8185383081436157} 11/07/2021 05:54:41 - INFO - __main__ - Step 61511: {'lr': 0.0003256880130156604, 'samples': 11810112, 'steps': 61510, 'loss/train': 1.5797677040100098} 11/07/2021 05:54:42 - INFO - __main__ - Step 61512: {'lr': 0.0003256829553031979, 'samples': 11810304, 'steps': 61511, 'loss/train': 1.4536304473876953} 11/07/2021 05:54:42 - INFO - __main__ - Step 61513: {'lr': 0.0003256778975566339, 'samples': 11810496, 'steps': 61512, 'loss/train': 1.3820080757141113} 11/07/2021 05:54:42 - INFO - __main__ - Step 61514: {'lr': 0.00032567283977597055, 'samples': 11810688, 'steps': 61513, 'loss/train': 0.8406897783279419} 11/07/2021 05:54:43 - INFO - __main__ - Step 61515: {'lr': 0.0003256677819612102, 'samples': 11810880, 'steps': 61514, 'loss/train': 1.4375447034835815} 11/07/2021 05:54:44 - INFO - __main__ - Step 61516: {'lr': 0.00032566272411235515, 'samples': 11811072, 'steps': 61515, 'loss/train': 0.883335292339325} 11/07/2021 05:54:44 - INFO - __main__ - Step 61517: {'lr': 0.0003256576662294076, 'samples': 11811264, 'steps': 61516, 'loss/train': 1.4180454015731812} 11/07/2021 05:54:45 - INFO - __main__ - Step 61518: {'lr': 0.00032565260831237, 'samples': 11811456, 'steps': 61517, 'loss/train': 1.5043305158615112} 11/07/2021 05:54:45 - INFO - __main__ - Step 61519: {'lr': 0.0003256475503612444, 'samples': 11811648, 'steps': 61518, 'loss/train': 1.3079458475112915} 11/07/2021 05:54:45 - INFO - __main__ - Step 61520: {'lr': 0.0003256424923760332, 'samples': 11811840, 'steps': 61519, 'loss/train': 1.5205203294754028} 11/07/2021 05:54:46 - INFO - __main__ - Step 61521: {'lr': 0.00032563743435673855, 'samples': 11812032, 'steps': 61520, 'loss/train': 1.2639096975326538} 11/07/2021 05:54:47 - INFO - __main__ - Step 61522: {'lr': 0.00032563237630336294, 'samples': 11812224, 'steps': 61521, 'loss/train': 1.8247222900390625} 11/07/2021 05:54:47 - INFO - __main__ - Step 61523: {'lr': 0.00032562731821590853, 'samples': 11812416, 'steps': 61522, 'loss/train': 1.3942615985870361} 11/07/2021 05:54:47 - INFO - __main__ - Step 61524: {'lr': 0.00032562226009437764, 'samples': 11812608, 'steps': 61523, 'loss/train': 1.3061853647232056} 11/07/2021 05:54:48 - INFO - __main__ - Step 61525: {'lr': 0.00032561720193877256, 'samples': 11812800, 'steps': 61524, 'loss/train': 1.480621337890625} 11/07/2021 05:54:48 - INFO - __main__ - Step 61526: {'lr': 0.0003256121437490955, 'samples': 11812992, 'steps': 61525, 'loss/train': 1.720937967300415} 11/07/2021 05:54:49 - INFO - __main__ - Step 61527: {'lr': 0.00032560708552534874, 'samples': 11813184, 'steps': 61526, 'loss/train': 0.9781678915023804} 11/07/2021 05:54:49 - INFO - __main__ - Step 61528: {'lr': 0.0003256020272675346, 'samples': 11813376, 'steps': 61527, 'loss/train': 1.1798361539840698} 11/07/2021 05:54:50 - INFO - __main__ - Step 61529: {'lr': 0.0003255969689756554, 'samples': 11813568, 'steps': 61528, 'loss/train': 1.3623031377792358} 11/07/2021 05:54:50 - INFO - __main__ - Step 61530: {'lr': 0.00032559191064971326, 'samples': 11813760, 'steps': 61529, 'loss/train': 1.8805787563323975} 11/07/2021 05:54:50 - INFO - __main__ - Step 61531: {'lr': 0.0003255868522897107, 'samples': 11813952, 'steps': 61530, 'loss/train': 1.69122314453125} 11/07/2021 05:54:51 - INFO - __main__ - Step 61532: {'lr': 0.0003255817938956498, 'samples': 11814144, 'steps': 61531, 'loss/train': 1.6923480033874512} 11/07/2021 05:54:52 - INFO - __main__ - Step 61533: {'lr': 0.00032557673546753296, 'samples': 11814336, 'steps': 61532, 'loss/train': 1.4964898824691772} 11/07/2021 05:54:52 - INFO - __main__ - Step 61534: {'lr': 0.0003255716770053624, 'samples': 11814528, 'steps': 61533, 'loss/train': 1.8765034675598145} 11/07/2021 05:54:53 - INFO - __main__ - Step 61535: {'lr': 0.0003255666185091404, 'samples': 11814720, 'steps': 61534, 'loss/train': 1.5332255363464355} 11/07/2021 05:54:53 - INFO - __main__ - Step 61536: {'lr': 0.0003255615599788692, 'samples': 11814912, 'steps': 61535, 'loss/train': 1.640250325202942} 11/07/2021 05:54:54 - INFO - __main__ - Step 61537: {'lr': 0.00032555650141455117, 'samples': 11815104, 'steps': 61536, 'loss/train': 1.510709524154663} 11/07/2021 05:54:54 - INFO - __main__ - Step 61538: {'lr': 0.0003255514428161886, 'samples': 11815296, 'steps': 61537, 'loss/train': 1.523738980293274} 11/07/2021 05:54:55 - INFO - __main__ - Step 61539: {'lr': 0.0003255463841837837, 'samples': 11815488, 'steps': 61538, 'loss/train': 1.4540036916732788} 11/07/2021 05:54:55 - INFO - __main__ - Step 61540: {'lr': 0.00032554132551733866, 'samples': 11815680, 'steps': 61539, 'loss/train': 1.4357317686080933} 11/07/2021 05:54:55 - INFO - __main__ - Step 61541: {'lr': 0.00032553626681685596, 'samples': 11815872, 'steps': 61540, 'loss/train': 0.7762305736541748} 11/07/2021 05:54:56 - INFO - __main__ - Step 61542: {'lr': 0.0003255312080823377, 'samples': 11816064, 'steps': 61541, 'loss/train': 1.3285143375396729} 11/07/2021 05:54:57 - INFO - __main__ - Step 61543: {'lr': 0.0003255261493137863, 'samples': 11816256, 'steps': 61542, 'loss/train': 1.5476784706115723} 11/07/2021 05:54:57 - INFO - __main__ - Step 61544: {'lr': 0.000325521090511204, 'samples': 11816448, 'steps': 61543, 'loss/train': 1.6470131874084473} 11/07/2021 05:54:57 - INFO - __main__ - Step 61545: {'lr': 0.0003255160316745931, 'samples': 11816640, 'steps': 61544, 'loss/train': 0.8971713185310364} 11/07/2021 05:54:58 - INFO - __main__ - Step 61546: {'lr': 0.00032551097280395576, 'samples': 11816832, 'steps': 61545, 'loss/train': 1.3870933055877686} 11/07/2021 05:54:59 - INFO - __main__ - Step 61547: {'lr': 0.00032550591389929437, 'samples': 11817024, 'steps': 61546, 'loss/train': 0.8673347234725952} 11/07/2021 05:54:59 - INFO - __main__ - Step 61548: {'lr': 0.0003255008549606111, 'samples': 11817216, 'steps': 61547, 'loss/train': 1.1631065607070923} 11/07/2021 05:54:59 - INFO - __main__ - Step 61549: {'lr': 0.0003254957959879084, 'samples': 11817408, 'steps': 61548, 'loss/train': 1.6665390729904175} 11/07/2021 05:55:00 - INFO - __main__ - Step 61550: {'lr': 0.0003254907369811885, 'samples': 11817600, 'steps': 61549, 'loss/train': 1.7706317901611328} 11/07/2021 05:55:00 - INFO - __main__ - Step 61551: {'lr': 0.00032548567794045354, 'samples': 11817792, 'steps': 61550, 'loss/train': 1.2463479042053223} 11/07/2021 05:55:01 - INFO - __main__ - Step 61552: {'lr': 0.000325480618865706, 'samples': 11817984, 'steps': 61551, 'loss/train': 1.4745067358016968} 11/07/2021 05:55:02 - INFO - __main__ - Step 61553: {'lr': 0.00032547555975694797, 'samples': 11818176, 'steps': 61552, 'loss/train': 2.213452100753784} 11/07/2021 05:55:02 - INFO - __main__ - Step 61554: {'lr': 0.0003254705006141818, 'samples': 11818368, 'steps': 61553, 'loss/train': 1.845282793045044} 11/07/2021 05:55:02 - INFO - __main__ - Step 61555: {'lr': 0.00032546544143740983, 'samples': 11818560, 'steps': 61554, 'loss/train': 1.5940386056900024} 11/07/2021 05:55:03 - INFO - __main__ - Step 61556: {'lr': 0.0003254603822266343, 'samples': 11818752, 'steps': 61555, 'loss/train': 1.3468687534332275} 11/07/2021 05:55:03 - INFO - __main__ - Step 61557: {'lr': 0.0003254553229818575, 'samples': 11818944, 'steps': 61556, 'loss/train': 1.8215885162353516} 11/07/2021 05:55:04 - INFO - __main__ - Step 61558: {'lr': 0.00032545026370308175, 'samples': 11819136, 'steps': 61557, 'loss/train': 0.31067413091659546} 11/07/2021 05:55:04 - INFO - __main__ - Step 61559: {'lr': 0.00032544520439030915, 'samples': 11819328, 'steps': 61558, 'loss/train': 1.6353020668029785} 11/07/2021 05:55:05 - INFO - __main__ - Step 61560: {'lr': 0.00032544014504354215, 'samples': 11819520, 'steps': 61559, 'loss/train': 1.081398844718933} 11/07/2021 05:55:05 - INFO - __main__ - Step 61561: {'lr': 0.000325435085662783, 'samples': 11819712, 'steps': 61560, 'loss/train': 1.3697625398635864} 11/07/2021 05:55:06 - INFO - __main__ - Step 61562: {'lr': 0.000325430026248034, 'samples': 11819904, 'steps': 61561, 'loss/train': 1.9082508087158203} 11/07/2021 05:55:07 - INFO - __main__ - Step 61563: {'lr': 0.00032542496679929735, 'samples': 11820096, 'steps': 61562, 'loss/train': 1.3935414552688599} 11/07/2021 05:55:07 - INFO - __main__ - Step 61564: {'lr': 0.00032541990731657536, 'samples': 11820288, 'steps': 61563, 'loss/train': 1.3700449466705322} 11/07/2021 05:55:07 - INFO - __main__ - Step 61565: {'lr': 0.00032541484779987034, 'samples': 11820480, 'steps': 61564, 'loss/train': 1.4314204454421997} 11/07/2021 05:55:08 - INFO - __main__ - Step 61566: {'lr': 0.00032540978824918454, 'samples': 11820672, 'steps': 61565, 'loss/train': 1.2623803615570068} 11/07/2021 05:55:08 - INFO - __main__ - Step 61567: {'lr': 0.0003254047286645203, 'samples': 11820864, 'steps': 61566, 'loss/train': 1.612963318824768} 11/07/2021 05:55:09 - INFO - __main__ - Step 61568: {'lr': 0.0003253996690458798, 'samples': 11821056, 'steps': 61567, 'loss/train': 1.5843113660812378} 11/07/2021 05:55:09 - INFO - __main__ - Step 61569: {'lr': 0.00032539460939326535, 'samples': 11821248, 'steps': 61568, 'loss/train': 1.2013710737228394} 11/07/2021 05:55:10 - INFO - __main__ - Step 61570: {'lr': 0.00032538954970667936, 'samples': 11821440, 'steps': 61569, 'loss/train': 1.9235506057739258} 11/07/2021 05:55:10 - INFO - __main__ - Step 61571: {'lr': 0.0003253844899861239, 'samples': 11821632, 'steps': 61570, 'loss/train': 0.2325337678194046} 11/07/2021 05:55:10 - INFO - __main__ - Step 61572: {'lr': 0.0003253794302316014, 'samples': 11821824, 'steps': 61571, 'loss/train': 1.5029551982879639} 11/07/2021 05:55:11 - INFO - __main__ - Step 61573: {'lr': 0.00032537437044311414, 'samples': 11822016, 'steps': 61572, 'loss/train': 1.816683053970337} 11/07/2021 05:55:12 - INFO - __main__ - Step 61574: {'lr': 0.0003253693106206643, 'samples': 11822208, 'steps': 61573, 'loss/train': 2.017103910446167} 11/07/2021 05:55:12 - INFO - __main__ - Step 61575: {'lr': 0.0003253642507642541, 'samples': 11822400, 'steps': 61574, 'loss/train': 1.1707075834274292} 11/07/2021 05:55:12 - INFO - __main__ - Step 61576: {'lr': 0.0003253591908738861, 'samples': 11822592, 'steps': 61575, 'loss/train': 1.2937357425689697} 11/07/2021 05:55:13 - INFO - __main__ - Step 61577: {'lr': 0.00032535413094956237, 'samples': 11822784, 'steps': 61576, 'loss/train': 0.8252028822898865} 11/07/2021 05:55:14 - INFO - __main__ - Step 61578: {'lr': 0.0003253490709912852, 'samples': 11822976, 'steps': 61577, 'loss/train': 1.6062746047973633} 11/07/2021 05:55:14 - INFO - __main__ - Step 61579: {'lr': 0.0003253440109990569, 'samples': 11823168, 'steps': 61578, 'loss/train': 1.2613424062728882} 11/07/2021 05:55:15 - INFO - __main__ - Step 61580: {'lr': 0.0003253389509728798, 'samples': 11823360, 'steps': 61579, 'loss/train': 1.6685161590576172} 11/07/2021 05:55:15 - INFO - __main__ - Step 61581: {'lr': 0.0003253338909127561, 'samples': 11823552, 'steps': 61580, 'loss/train': 1.4771311283111572} 11/07/2021 05:55:15 - INFO - __main__ - Step 61582: {'lr': 0.00032532883081868804, 'samples': 11823744, 'steps': 61581, 'loss/train': 1.8704391717910767} 11/07/2021 05:55:16 - INFO - __main__ - Step 61583: {'lr': 0.000325323770690678, 'samples': 11823936, 'steps': 61582, 'loss/train': 1.1638740301132202} 11/07/2021 05:55:17 - INFO - __main__ - Step 61584: {'lr': 0.00032531871052872836, 'samples': 11824128, 'steps': 61583, 'loss/train': 0.2791132628917694} 11/07/2021 05:55:17 - INFO - __main__ - Step 61585: {'lr': 0.00032531365033284116, 'samples': 11824320, 'steps': 61584, 'loss/train': 1.348402738571167} 11/07/2021 05:55:18 - INFO - __main__ - Step 61586: {'lr': 0.0003253085901030188, 'samples': 11824512, 'steps': 61585, 'loss/train': 1.7301862239837646} 11/07/2021 05:55:18 - INFO - __main__ - Step 61587: {'lr': 0.0003253035298392636, 'samples': 11824704, 'steps': 61586, 'loss/train': 1.3266973495483398} 11/07/2021 05:55:18 - INFO - __main__ - Step 61588: {'lr': 0.0003252984695415777, 'samples': 11824896, 'steps': 61587, 'loss/train': 1.5498160123825073} 11/07/2021 05:55:19 - INFO - __main__ - Step 61589: {'lr': 0.0003252934092099636, 'samples': 11825088, 'steps': 61588, 'loss/train': 1.5619803667068481} 11/07/2021 05:55:20 - INFO - __main__ - Step 61590: {'lr': 0.00032528834884442337, 'samples': 11825280, 'steps': 61589, 'loss/train': 1.6173477172851562} 11/07/2021 05:55:20 - INFO - __main__ - Step 61591: {'lr': 0.0003252832884449594, 'samples': 11825472, 'steps': 61590, 'loss/train': 1.4104039669036865} 11/07/2021 05:55:21 - INFO - __main__ - Step 61592: {'lr': 0.00032527822801157384, 'samples': 11825664, 'steps': 61591, 'loss/train': 4.624558448791504} 11/07/2021 05:55:21 - INFO - __main__ - Step 61593: {'lr': 0.00032527316754426915, 'samples': 11825856, 'steps': 61592, 'loss/train': 1.5679292678833008} 11/07/2021 05:55:21 - INFO - __main__ - Step 61594: {'lr': 0.0003252681070430476, 'samples': 11826048, 'steps': 61593, 'loss/train': 1.4815891981124878} 11/07/2021 05:55:23 - INFO - __main__ - Step 61595: {'lr': 0.00032526304650791135, 'samples': 11826240, 'steps': 61594, 'loss/train': 1.5397474765777588} 11/07/2021 05:55:24 - INFO - __main__ - Step 61596: {'lr': 0.0003252579859388627, 'samples': 11826432, 'steps': 61595, 'loss/train': 1.2073768377304077} 11/07/2021 05:55:24 - INFO - __main__ - Step 61597: {'lr': 0.000325252925335904, 'samples': 11826624, 'steps': 61596, 'loss/train': 1.8232322931289673} 11/07/2021 05:55:24 - INFO - __main__ - Step 61598: {'lr': 0.00032524786469903744, 'samples': 11826816, 'steps': 61597, 'loss/train': 0.18981899321079254} 11/07/2021 05:55:25 - INFO - __main__ - Step 61599: {'lr': 0.0003252428040282654, 'samples': 11827008, 'steps': 61598, 'loss/train': 0.9941990971565247} 11/07/2021 05:55:25 - INFO - __main__ - Step 61600: {'lr': 0.00032523774332359016, 'samples': 11827200, 'steps': 61599, 'loss/train': 2.028055429458618} 11/07/2021 05:55:25 - INFO - __main__ - Step 61601: {'lr': 0.00032523268258501385, 'samples': 11827392, 'steps': 61600, 'loss/train': 1.9085824489593506} 11/07/2021 05:55:26 - INFO - __main__ - Step 61602: {'lr': 0.0003252276218125389, 'samples': 11827584, 'steps': 61601, 'loss/train': 1.8336129188537598} 11/07/2021 05:55:27 - INFO - __main__ - Step 61603: {'lr': 0.00032522256100616753, 'samples': 11827776, 'steps': 61602, 'loss/train': 1.8297333717346191} 11/07/2021 05:55:27 - INFO - __main__ - Step 61604: {'lr': 0.00032521750016590206, 'samples': 11827968, 'steps': 61603, 'loss/train': 1.012547492980957} 11/07/2021 05:55:27 - INFO - __main__ - Step 61605: {'lr': 0.0003252124392917447, 'samples': 11828160, 'steps': 61604, 'loss/train': 1.8527491092681885} 11/07/2021 05:55:28 - INFO - __main__ - Step 61606: {'lr': 0.00032520737838369785, 'samples': 11828352, 'steps': 61605, 'loss/train': 1.4119595289230347} 11/07/2021 05:55:28 - INFO - __main__ - Step 61607: {'lr': 0.0003252023174417637, 'samples': 11828544, 'steps': 61606, 'loss/train': 1.42026948928833} 11/07/2021 05:55:29 - INFO - __main__ - Step 61608: {'lr': 0.0003251972564659445, 'samples': 11828736, 'steps': 61607, 'loss/train': 0.9984369874000549} 11/07/2021 05:55:30 - INFO - __main__ - Step 61609: {'lr': 0.0003251921954562426, 'samples': 11828928, 'steps': 61608, 'loss/train': 1.859770655632019} 11/07/2021 05:55:30 - INFO - __main__ - Step 61610: {'lr': 0.00032518713441266026, 'samples': 11829120, 'steps': 61609, 'loss/train': 1.1963284015655518} 11/07/2021 05:55:30 - INFO - __main__ - Step 61611: {'lr': 0.0003251820733351997, 'samples': 11829312, 'steps': 61610, 'loss/train': 1.397135853767395} 11/07/2021 05:55:31 - INFO - __main__ - Step 61612: {'lr': 0.0003251770122238634, 'samples': 11829504, 'steps': 61611, 'loss/train': 1.4002445936203003} 11/07/2021 05:55:32 - INFO - __main__ - Step 61613: {'lr': 0.0003251719510786534, 'samples': 11829696, 'steps': 61612, 'loss/train': 1.9908074140548706} 11/07/2021 05:55:32 - INFO - __main__ - Step 61614: {'lr': 0.0003251668898995721, 'samples': 11829888, 'steps': 61613, 'loss/train': 2.5899808406829834} 11/07/2021 05:55:32 - INFO - __main__ - Step 61615: {'lr': 0.0003251618286866217, 'samples': 11830080, 'steps': 61614, 'loss/train': 1.749487042427063} 11/07/2021 05:55:33 - INFO - __main__ - Step 61616: {'lr': 0.0003251567674398046, 'samples': 11830272, 'steps': 61615, 'loss/train': 1.4478895664215088} 11/07/2021 05:55:33 - INFO - __main__ - Step 61617: {'lr': 0.00032515170615912296, 'samples': 11830464, 'steps': 61616, 'loss/train': 1.4570165872573853} 11/07/2021 05:55:34 - INFO - __main__ - Step 61618: {'lr': 0.00032514664484457916, 'samples': 11830656, 'steps': 61617, 'loss/train': 1.593237042427063} 11/07/2021 05:55:35 - INFO - __main__ - Step 61619: {'lr': 0.0003251415834961755, 'samples': 11830848, 'steps': 61618, 'loss/train': 1.2972723245620728} 11/07/2021 05:55:35 - INFO - __main__ - Step 61620: {'lr': 0.0003251365221139142, 'samples': 11831040, 'steps': 61619, 'loss/train': 1.3628705739974976} 11/07/2021 05:55:35 - INFO - __main__ - Step 61621: {'lr': 0.0003251314606977975, 'samples': 11831232, 'steps': 61620, 'loss/train': 1.630635142326355} 11/07/2021 05:55:36 - INFO - __main__ - Step 61622: {'lr': 0.0003251263992478277, 'samples': 11831424, 'steps': 61621, 'loss/train': 1.096143126487732} 11/07/2021 05:55:36 - INFO - __main__ - Step 61623: {'lr': 0.0003251213377640071, 'samples': 11831616, 'steps': 61622, 'loss/train': 1.6300262212753296} 11/07/2021 05:55:37 - INFO - __main__ - Step 61624: {'lr': 0.000325116276246338, 'samples': 11831808, 'steps': 61623, 'loss/train': 0.37938064336776733} 11/07/2021 05:55:37 - INFO - __main__ - Step 61625: {'lr': 0.00032511121469482263, 'samples': 11832000, 'steps': 61624, 'loss/train': 1.813296914100647} 11/07/2021 05:55:38 - INFO - __main__ - Step 61626: {'lr': 0.0003251061531094634, 'samples': 11832192, 'steps': 61625, 'loss/train': 1.3614808320999146} 11/07/2021 05:55:38 - INFO - __main__ - Step 61627: {'lr': 0.00032510109149026247, 'samples': 11832384, 'steps': 61626, 'loss/train': 1.440901279449463} 11/07/2021 05:55:38 - INFO - __main__ - Step 61628: {'lr': 0.0003250960298372221, 'samples': 11832576, 'steps': 61627, 'loss/train': 1.7482993602752686} 11/07/2021 05:55:39 - INFO - __main__ - Step 61629: {'lr': 0.0003250909681503446, 'samples': 11832768, 'steps': 61628, 'loss/train': 1.3079259395599365} 11/07/2021 05:55:40 - INFO - __main__ - Step 61630: {'lr': 0.00032508590642963233, 'samples': 11832960, 'steps': 61629, 'loss/train': 0.5356094241142273} 11/07/2021 05:55:40 - INFO - __main__ - Step 61631: {'lr': 0.00032508084467508747, 'samples': 11833152, 'steps': 61630, 'loss/train': 1.718245029449463} 11/07/2021 05:55:40 - INFO - __main__ - Step 61632: {'lr': 0.0003250757828867124, 'samples': 11833344, 'steps': 61631, 'loss/train': 1.2456375360488892} 11/07/2021 05:55:41 - INFO - __main__ - Step 61633: {'lr': 0.0003250707210645093, 'samples': 11833536, 'steps': 61632, 'loss/train': 1.7365440130233765} 11/07/2021 05:55:42 - INFO - __main__ - Step 61634: {'lr': 0.0003250656592084805, 'samples': 11833728, 'steps': 61633, 'loss/train': 1.8623408079147339} 11/07/2021 05:55:42 - INFO - __main__ - Step 61635: {'lr': 0.00032506059731862827, 'samples': 11833920, 'steps': 61634, 'loss/train': 1.1969873905181885} 11/07/2021 05:55:43 - INFO - __main__ - Step 61636: {'lr': 0.0003250555353949548, 'samples': 11834112, 'steps': 61635, 'loss/train': 1.5889657735824585} 11/07/2021 05:55:43 - INFO - __main__ - Step 61637: {'lr': 0.0003250504734374626, 'samples': 11834304, 'steps': 61636, 'loss/train': 1.2956634759902954} 11/07/2021 05:55:43 - INFO - __main__ - Step 61638: {'lr': 0.0003250454114461537, 'samples': 11834496, 'steps': 61637, 'loss/train': 1.2128545045852661} 11/07/2021 05:55:44 - INFO - __main__ - Step 61639: {'lr': 0.0003250403494210306, 'samples': 11834688, 'steps': 61638, 'loss/train': 1.7697961330413818} 11/07/2021 05:55:45 - INFO - __main__ - Step 61640: {'lr': 0.00032503528736209543, 'samples': 11834880, 'steps': 61639, 'loss/train': 1.3518706560134888} 11/07/2021 05:55:45 - INFO - __main__ - Step 61641: {'lr': 0.00032503022526935056, 'samples': 11835072, 'steps': 61640, 'loss/train': 1.5176104307174683} 11/07/2021 05:55:45 - INFO - __main__ - Step 61642: {'lr': 0.00032502516314279815, 'samples': 11835264, 'steps': 61641, 'loss/train': 1.5259063243865967} 11/07/2021 05:55:46 - INFO - __main__ - Step 61643: {'lr': 0.0003250201009824406, 'samples': 11835456, 'steps': 61642, 'loss/train': 1.5473954677581787} 11/07/2021 05:55:47 - INFO - __main__ - Step 61644: {'lr': 0.00032501503878828016, 'samples': 11835648, 'steps': 61643, 'loss/train': 1.4872665405273438} 11/07/2021 05:55:47 - INFO - __main__ - Step 61645: {'lr': 0.00032500997656031907, 'samples': 11835840, 'steps': 61644, 'loss/train': 1.3022844791412354} 11/07/2021 05:55:47 - INFO - __main__ - Step 61646: {'lr': 0.0003250049142985597, 'samples': 11836032, 'steps': 61645, 'loss/train': 1.4299184083938599} 11/07/2021 05:55:48 - INFO - __main__ - Step 61647: {'lr': 0.0003249998520030042, 'samples': 11836224, 'steps': 61646, 'loss/train': 1.0759062767028809} 11/07/2021 05:55:48 - INFO - __main__ - Step 61648: {'lr': 0.00032499478967365497, 'samples': 11836416, 'steps': 61647, 'loss/train': 1.485964059829712} 11/07/2021 05:55:48 - INFO - __main__ - Step 61649: {'lr': 0.00032498972731051425, 'samples': 11836608, 'steps': 61648, 'loss/train': 1.0703984498977661} 11/07/2021 05:55:49 - INFO - __main__ - Step 61650: {'lr': 0.00032498466491358427, 'samples': 11836800, 'steps': 61649, 'loss/train': 1.0376168489456177} 11/07/2021 05:55:50 - INFO - __main__ - Step 61651: {'lr': 0.0003249796024828674, 'samples': 11836992, 'steps': 61650, 'loss/train': 1.5688486099243164} 11/07/2021 05:55:50 - INFO - __main__ - Step 61652: {'lr': 0.00032497454001836586, 'samples': 11837184, 'steps': 61651, 'loss/train': 1.5336010456085205} 11/07/2021 05:55:50 - INFO - __main__ - Step 61653: {'lr': 0.000324969477520082, 'samples': 11837376, 'steps': 61652, 'loss/train': 1.4611262083053589} 11/07/2021 05:55:51 - INFO - __main__ - Step 61654: {'lr': 0.000324964414988018, 'samples': 11837568, 'steps': 61653, 'loss/train': 1.6185957193374634} 11/07/2021 05:55:52 - INFO - __main__ - Step 61655: {'lr': 0.0003249593524221762, 'samples': 11837760, 'steps': 61654, 'loss/train': 1.7738850116729736} 11/07/2021 05:55:52 - INFO - __main__ - Step 61656: {'lr': 0.0003249542898225588, 'samples': 11837952, 'steps': 61655, 'loss/train': 1.4637606143951416} 11/07/2021 05:55:53 - INFO - __main__ - Step 61657: {'lr': 0.00032494922718916824, 'samples': 11838144, 'steps': 61656, 'loss/train': 1.2157018184661865} 11/07/2021 05:55:53 - INFO - __main__ - Step 61658: {'lr': 0.0003249441645220067, 'samples': 11838336, 'steps': 61657, 'loss/train': 1.577221155166626} 11/07/2021 05:55:53 - INFO - __main__ - Step 61659: {'lr': 0.0003249391018210765, 'samples': 11838528, 'steps': 61658, 'loss/train': 1.3288838863372803} 11/07/2021 05:55:54 - INFO - __main__ - Step 61660: {'lr': 0.0003249340390863799, 'samples': 11838720, 'steps': 61659, 'loss/train': 1.0145045518875122} 11/07/2021 05:55:55 - INFO - __main__ - Step 61661: {'lr': 0.00032492897631791913, 'samples': 11838912, 'steps': 61660, 'loss/train': 1.6778185367584229} 11/07/2021 05:55:55 - INFO - __main__ - Step 61662: {'lr': 0.0003249239135156965, 'samples': 11839104, 'steps': 61661, 'loss/train': 1.5032259225845337} 11/07/2021 05:55:55 - INFO - __main__ - Step 61663: {'lr': 0.0003249188506797144, 'samples': 11839296, 'steps': 61662, 'loss/train': 1.0572091341018677} 11/07/2021 05:55:56 - INFO - __main__ - Step 61664: {'lr': 0.00032491378780997494, 'samples': 11839488, 'steps': 61663, 'loss/train': 1.4259939193725586} 11/07/2021 05:55:57 - INFO - __main__ - Step 61665: {'lr': 0.0003249087249064805, 'samples': 11839680, 'steps': 61664, 'loss/train': 1.1903914213180542} 11/07/2021 05:55:57 - INFO - __main__ - Step 61666: {'lr': 0.00032490366196923336, 'samples': 11839872, 'steps': 61665, 'loss/train': 1.4405509233474731} 11/07/2021 05:55:58 - INFO - __main__ - Step 61667: {'lr': 0.00032489859899823584, 'samples': 11840064, 'steps': 61666, 'loss/train': 1.5418877601623535} 11/07/2021 05:55:58 - INFO - __main__ - Step 61668: {'lr': 0.0003248935359934901, 'samples': 11840256, 'steps': 61667, 'loss/train': 1.566874623298645} 11/07/2021 05:55:58 - INFO - __main__ - Step 61669: {'lr': 0.00032488847295499847, 'samples': 11840448, 'steps': 61668, 'loss/train': 1.4466632604599} 11/07/2021 05:55:59 - INFO - __main__ - Step 61670: {'lr': 0.0003248834098827633, 'samples': 11840640, 'steps': 61669, 'loss/train': 1.3673452138900757} 11/07/2021 05:56:00 - INFO - __main__ - Step 61671: {'lr': 0.0003248783467767867, 'samples': 11840832, 'steps': 61670, 'loss/train': 1.7298191785812378} 11/07/2021 05:56:00 - INFO - __main__ - Step 61672: {'lr': 0.00032487328363707123, 'samples': 11841024, 'steps': 61671, 'loss/train': 1.2929913997650146} 11/07/2021 05:56:00 - INFO - __main__ - Step 61673: {'lr': 0.00032486822046361895, 'samples': 11841216, 'steps': 61672, 'loss/train': 1.81553053855896} 11/07/2021 05:56:01 - INFO - __main__ - Step 61674: {'lr': 0.0003248631572564322, 'samples': 11841408, 'steps': 61673, 'loss/train': 1.8455238342285156} 11/07/2021 05:56:02 - INFO - __main__ - Step 61675: {'lr': 0.0003248580940155133, 'samples': 11841600, 'steps': 61674, 'loss/train': 1.362914800643921} 11/07/2021 05:56:02 - INFO - __main__ - Step 61676: {'lr': 0.0003248530307408645, 'samples': 11841792, 'steps': 61675, 'loss/train': 0.7790258526802063} 11/07/2021 05:56:02 - INFO - __main__ - Step 61677: {'lr': 0.00032484796743248803, 'samples': 11841984, 'steps': 61676, 'loss/train': 0.9085867404937744} 11/07/2021 05:56:03 - INFO - __main__ - Step 61678: {'lr': 0.00032484290409038626, 'samples': 11842176, 'steps': 61677, 'loss/train': 1.602375864982605} 11/07/2021 05:56:03 - INFO - __main__ - Step 61679: {'lr': 0.00032483784071456146, 'samples': 11842368, 'steps': 61678, 'loss/train': 1.037764549255371} 11/07/2021 05:56:04 - INFO - __main__ - Step 61680: {'lr': 0.0003248327773050158, 'samples': 11842560, 'steps': 61679, 'loss/train': 1.5360796451568604} 11/07/2021 05:56:05 - INFO - __main__ - Step 61681: {'lr': 0.0003248277138617517, 'samples': 11842752, 'steps': 61680, 'loss/train': 1.2887235879898071} 11/07/2021 05:56:05 - INFO - __main__ - Step 61682: {'lr': 0.0003248226503847714, 'samples': 11842944, 'steps': 61681, 'loss/train': 1.7370150089263916} 11/07/2021 05:56:05 - INFO - __main__ - Step 61683: {'lr': 0.0003248175868740771, 'samples': 11843136, 'steps': 61682, 'loss/train': 1.4277887344360352} 11/07/2021 05:56:06 - INFO - __main__ - Step 61684: {'lr': 0.0003248125233296712, 'samples': 11843328, 'steps': 61683, 'loss/train': 1.7126696109771729} 11/07/2021 05:56:06 - INFO - __main__ - Step 61685: {'lr': 0.0003248074597515559, 'samples': 11843520, 'steps': 61684, 'loss/train': 1.3835041522979736} 11/07/2021 05:56:07 - INFO - __main__ - Step 61686: {'lr': 0.0003248023961397336, 'samples': 11843712, 'steps': 61685, 'loss/train': 1.631285309791565} 11/07/2021 05:56:07 - INFO - __main__ - Step 61687: {'lr': 0.0003247973324942064, 'samples': 11843904, 'steps': 61686, 'loss/train': 1.3369096517562866} 11/07/2021 05:56:08 - INFO - __main__ - Step 61688: {'lr': 0.0003247922688149767, 'samples': 11844096, 'steps': 61687, 'loss/train': 1.6173938512802124} 11/07/2021 05:56:08 - INFO - __main__ - Step 61689: {'lr': 0.0003247872051020468, 'samples': 11844288, 'steps': 61688, 'loss/train': 1.6016994714736938} 11/07/2021 05:56:08 - INFO - __main__ - Step 61690: {'lr': 0.0003247821413554188, 'samples': 11844480, 'steps': 61689, 'loss/train': 0.9484822750091553} 11/07/2021 05:56:10 - INFO - __main__ - Step 61691: {'lr': 0.00032477707757509527, 'samples': 11844672, 'steps': 61690, 'loss/train': 1.2473747730255127} 11/07/2021 05:56:10 - INFO - __main__ - Step 61692: {'lr': 0.0003247720137610783, 'samples': 11844864, 'steps': 61691, 'loss/train': 1.5042186975479126} 11/07/2021 05:56:10 - INFO - __main__ - Step 61693: {'lr': 0.0003247669499133702, 'samples': 11845056, 'steps': 61692, 'loss/train': 1.2142223119735718} 11/07/2021 05:56:11 - INFO - __main__ - Step 61694: {'lr': 0.00032476188603197334, 'samples': 11845248, 'steps': 61693, 'loss/train': 1.1783576011657715} 11/07/2021 05:56:11 - INFO - __main__ - Step 61695: {'lr': 0.00032475682211688986, 'samples': 11845440, 'steps': 61694, 'loss/train': 1.5230088233947754} 11/07/2021 05:56:12 - INFO - __main__ - Step 61696: {'lr': 0.00032475175816812206, 'samples': 11845632, 'steps': 61695, 'loss/train': 1.7840255498886108} 11/07/2021 05:56:12 - INFO - __main__ - Step 61697: {'lr': 0.0003247466941856724, 'samples': 11845824, 'steps': 61696, 'loss/train': 1.6666908264160156} 11/07/2021 05:56:13 - INFO - __main__ - Step 61698: {'lr': 0.00032474163016954293, 'samples': 11846016, 'steps': 61697, 'loss/train': 1.537450909614563} 11/07/2021 05:56:13 - INFO - __main__ - Step 61699: {'lr': 0.00032473656611973605, 'samples': 11846208, 'steps': 61698, 'loss/train': 1.4414680004119873} 11/07/2021 05:56:13 - INFO - __main__ - Step 61700: {'lr': 0.00032473150203625407, 'samples': 11846400, 'steps': 61699, 'loss/train': 1.5020642280578613} 11/07/2021 05:56:14 - INFO - __main__ - Step 61701: {'lr': 0.0003247264379190992, 'samples': 11846592, 'steps': 61700, 'loss/train': 1.6326870918273926} 11/07/2021 05:56:15 - INFO - __main__ - Step 61702: {'lr': 0.00032472137376827375, 'samples': 11846784, 'steps': 61701, 'loss/train': 1.161419153213501} 11/07/2021 05:56:15 - INFO - __main__ - Step 61703: {'lr': 0.00032471630958378, 'samples': 11846976, 'steps': 61702, 'loss/train': 1.3674455881118774} 11/07/2021 05:56:15 - INFO - __main__ - Step 61704: {'lr': 0.0003247112453656202, 'samples': 11847168, 'steps': 61703, 'loss/train': 1.6483865976333618} 11/07/2021 05:56:16 - INFO - __main__ - Step 61705: {'lr': 0.0003247061811137967, 'samples': 11847360, 'steps': 61704, 'loss/train': 1.7933553457260132} 11/07/2021 05:56:17 - INFO - __main__ - Step 61706: {'lr': 0.00032470111682831183, 'samples': 11847552, 'steps': 61705, 'loss/train': 1.3871276378631592} 11/07/2021 05:56:17 - INFO - __main__ - Step 61707: {'lr': 0.00032469605250916766, 'samples': 11847744, 'steps': 61706, 'loss/train': 1.4312708377838135} 11/07/2021 05:56:18 - INFO - __main__ - Step 61708: {'lr': 0.00032469098815636667, 'samples': 11847936, 'steps': 61707, 'loss/train': 1.6899466514587402} 11/07/2021 05:56:18 - INFO - __main__ - Step 61709: {'lr': 0.000324685923769911, 'samples': 11848128, 'steps': 61708, 'loss/train': 1.3158061504364014} 11/07/2021 05:56:18 - INFO - __main__ - Step 61710: {'lr': 0.00032468085934980306, 'samples': 11848320, 'steps': 61709, 'loss/train': 1.4132249355316162} 11/07/2021 05:56:19 - INFO - __main__ - Step 61711: {'lr': 0.0003246757948960451, 'samples': 11848512, 'steps': 61710, 'loss/train': 1.1601734161376953} 11/07/2021 05:56:20 - INFO - __main__ - Step 61712: {'lr': 0.00032467073040863943, 'samples': 11848704, 'steps': 61711, 'loss/train': 1.5337822437286377} 11/07/2021 05:56:20 - INFO - __main__ - Step 61713: {'lr': 0.00032466566588758815, 'samples': 11848896, 'steps': 61712, 'loss/train': 1.6022971868515015} 11/07/2021 05:56:21 - INFO - __main__ - Step 61714: {'lr': 0.00032466060133289374, 'samples': 11849088, 'steps': 61713, 'loss/train': 2.0359065532684326} 11/07/2021 05:56:21 - INFO - __main__ - Step 61715: {'lr': 0.0003246555367445584, 'samples': 11849280, 'steps': 61714, 'loss/train': 3.2962281703948975} 11/07/2021 05:56:21 - INFO - __main__ - Step 61716: {'lr': 0.0003246504721225844, 'samples': 11849472, 'steps': 61715, 'loss/train': 1.3707195520401} 11/07/2021 05:56:22 - INFO - __main__ - Step 61717: {'lr': 0.00032464540746697415, 'samples': 11849664, 'steps': 61716, 'loss/train': 1.536264419555664} 11/07/2021 05:56:23 - INFO - __main__ - Step 61718: {'lr': 0.00032464034277772977, 'samples': 11849856, 'steps': 61717, 'loss/train': 1.5213849544525146} 11/07/2021 05:56:23 - INFO - __main__ - Step 61719: {'lr': 0.0003246352780548536, 'samples': 11850048, 'steps': 61718, 'loss/train': 1.3210241794586182} 11/07/2021 05:56:24 - INFO - __main__ - Step 61720: {'lr': 0.0003246302132983479, 'samples': 11850240, 'steps': 61719, 'loss/train': 1.7763680219650269} 11/07/2021 05:56:24 - INFO - __main__ - Step 61721: {'lr': 0.000324625148508215, 'samples': 11850432, 'steps': 61720, 'loss/train': 1.26631498336792} 11/07/2021 05:56:24 - INFO - __main__ - Step 61722: {'lr': 0.00032462008368445717, 'samples': 11850624, 'steps': 61721, 'loss/train': 1.2398202419281006} 11/07/2021 05:56:26 - INFO - __main__ - Step 61723: {'lr': 0.00032461501882707667, 'samples': 11850816, 'steps': 61722, 'loss/train': 1.1354326009750366} 11/07/2021 05:56:26 - INFO - __main__ - Step 61724: {'lr': 0.0003246099539360758, 'samples': 11851008, 'steps': 61723, 'loss/train': 1.6111080646514893} 11/07/2021 05:56:26 - INFO - __main__ - Step 61725: {'lr': 0.0003246048890114568, 'samples': 11851200, 'steps': 61724, 'loss/train': 2.1003098487854004} 11/07/2021 05:56:27 - INFO - __main__ - Step 61726: {'lr': 0.00032459982405322205, 'samples': 11851392, 'steps': 61725, 'loss/train': 1.47649347782135} 11/07/2021 05:56:27 - INFO - __main__ - Step 61727: {'lr': 0.0003245947590613737, 'samples': 11851584, 'steps': 61726, 'loss/train': 1.5608826875686646} 11/07/2021 05:56:27 - INFO - __main__ - Step 61728: {'lr': 0.00032458969403591415, 'samples': 11851776, 'steps': 61727, 'loss/train': 1.3991248607635498} 11/07/2021 05:56:29 - INFO - __main__ - Step 61729: {'lr': 0.00032458462897684564, 'samples': 11851968, 'steps': 61728, 'loss/train': 1.2988120317459106} 11/07/2021 05:56:29 - INFO - __main__ - Step 61730: {'lr': 0.00032457956388417045, 'samples': 11852160, 'steps': 61729, 'loss/train': 1.142595887184143} 11/07/2021 05:56:29 - INFO - __main__ - Step 61731: {'lr': 0.00032457449875789084, 'samples': 11852352, 'steps': 61730, 'loss/train': 1.5439296960830688} 11/07/2021 05:56:30 - INFO - __main__ - Step 61732: {'lr': 0.0003245694335980091, 'samples': 11852544, 'steps': 61731, 'loss/train': 0.7509870529174805} 11/07/2021 05:56:30 - INFO - __main__ - Step 61733: {'lr': 0.00032456436840452754, 'samples': 11852736, 'steps': 61732, 'loss/train': 1.2246531248092651} 11/07/2021 05:56:31 - INFO - __main__ - Step 61734: {'lr': 0.00032455930317744846, 'samples': 11852928, 'steps': 61733, 'loss/train': 1.4776289463043213} 11/07/2021 05:56:32 - INFO - __main__ - Step 61735: {'lr': 0.0003245542379167741, 'samples': 11853120, 'steps': 61734, 'loss/train': 1.7374354600906372} 11/07/2021 05:56:32 - INFO - __main__ - Step 61736: {'lr': 0.0003245491726225067, 'samples': 11853312, 'steps': 61735, 'loss/train': 2.0906143188476562} 11/07/2021 05:56:32 - INFO - __main__ - Step 61737: {'lr': 0.00032454410729464855, 'samples': 11853504, 'steps': 61736, 'loss/train': 1.7867194414138794} 11/07/2021 05:56:33 - INFO - __main__ - Step 61738: {'lr': 0.00032453904193320207, 'samples': 11853696, 'steps': 61737, 'loss/train': 1.903996467590332} 11/07/2021 05:56:33 - INFO - __main__ - Step 61739: {'lr': 0.0003245339765381694, 'samples': 11853888, 'steps': 61738, 'loss/train': 1.356618046760559} 11/07/2021 05:56:34 - INFO - __main__ - Step 61740: {'lr': 0.00032452891110955296, 'samples': 11854080, 'steps': 61739, 'loss/train': 0.8299818634986877} 11/07/2021 05:56:34 - INFO - __main__ - Step 61741: {'lr': 0.0003245238456473549, 'samples': 11854272, 'steps': 61740, 'loss/train': 1.8407310247421265} 11/07/2021 05:56:35 - INFO - __main__ - Step 61742: {'lr': 0.0003245187801515775, 'samples': 11854464, 'steps': 61741, 'loss/train': 1.6757076978683472} 11/07/2021 05:56:35 - INFO - __main__ - Step 61743: {'lr': 0.00032451371462222307, 'samples': 11854656, 'steps': 61742, 'loss/train': 0.6669234037399292} 11/07/2021 05:56:35 - INFO - __main__ - Step 61744: {'lr': 0.00032450864905929393, 'samples': 11854848, 'steps': 61743, 'loss/train': 1.6236460208892822} 11/07/2021 05:56:36 - INFO - __main__ - Step 61745: {'lr': 0.00032450358346279237, 'samples': 11855040, 'steps': 61744, 'loss/train': 1.543822169303894} 11/07/2021 05:56:37 - INFO - __main__ - Step 61746: {'lr': 0.0003244985178327206, 'samples': 11855232, 'steps': 61745, 'loss/train': 1.2841920852661133} 11/07/2021 05:56:37 - INFO - __main__ - Step 61747: {'lr': 0.00032449345216908107, 'samples': 11855424, 'steps': 61746, 'loss/train': 2.712681770324707} 11/07/2021 05:56:37 - INFO - __main__ - Step 61748: {'lr': 0.0003244883864718758, 'samples': 11855616, 'steps': 61747, 'loss/train': 1.2513415813446045} 11/07/2021 05:56:38 - INFO - __main__ - Step 61749: {'lr': 0.00032448332074110726, 'samples': 11855808, 'steps': 61748, 'loss/train': 1.0415215492248535} 11/07/2021 05:56:39 - INFO - __main__ - Step 61750: {'lr': 0.0003244782549767777, 'samples': 11856000, 'steps': 61749, 'loss/train': 1.0644752979278564} 11/07/2021 05:56:39 - INFO - __main__ - Step 61751: {'lr': 0.00032447318917888933, 'samples': 11856192, 'steps': 61750, 'loss/train': 1.3255069255828857} 11/07/2021 05:56:40 - INFO - __main__ - Step 61752: {'lr': 0.0003244681233474446, 'samples': 11856384, 'steps': 61751, 'loss/train': 1.9664037227630615} 11/07/2021 05:56:40 - INFO - __main__ - Step 61753: {'lr': 0.00032446305748244566, 'samples': 11856576, 'steps': 61752, 'loss/train': 1.4596322774887085} 11/07/2021 05:56:40 - INFO - __main__ - Step 61754: {'lr': 0.0003244579915838947, 'samples': 11856768, 'steps': 61753, 'loss/train': 1.5203322172164917} 11/07/2021 05:56:41 - INFO - __main__ - Step 61755: {'lr': 0.0003244529256517942, 'samples': 11856960, 'steps': 61754, 'loss/train': 1.4791004657745361} 11/07/2021 05:56:42 - INFO - __main__ - Step 61756: {'lr': 0.0003244478596861464, 'samples': 11857152, 'steps': 61755, 'loss/train': 1.1074382066726685} 11/07/2021 05:56:42 - INFO - __main__ - Step 61757: {'lr': 0.00032444279368695343, 'samples': 11857344, 'steps': 61756, 'loss/train': 1.3437477350234985} 11/07/2021 05:56:42 - INFO - __main__ - Step 61758: {'lr': 0.00032443772765421776, 'samples': 11857536, 'steps': 61757, 'loss/train': 1.348388433456421} 11/07/2021 05:56:43 - INFO - __main__ - Step 61759: {'lr': 0.0003244326615879416, 'samples': 11857728, 'steps': 61758, 'loss/train': 0.9336689114570618} 11/07/2021 05:56:43 - INFO - __main__ - Step 61760: {'lr': 0.0003244275954881273, 'samples': 11857920, 'steps': 61759, 'loss/train': 1.3251368999481201} 11/07/2021 05:56:44 - INFO - __main__ - Step 61761: {'lr': 0.00032442252935477696, 'samples': 11858112, 'steps': 61760, 'loss/train': 1.1870735883712769} 11/07/2021 05:56:45 - INFO - __main__ - Step 61762: {'lr': 0.000324417463187893, 'samples': 11858304, 'steps': 61761, 'loss/train': 1.5616074800491333} 11/07/2021 05:56:45 - INFO - __main__ - Step 61763: {'lr': 0.00032441239698747766, 'samples': 11858496, 'steps': 61762, 'loss/train': 1.3802059888839722} 11/07/2021 05:56:45 - INFO - __main__ - Step 61764: {'lr': 0.0003244073307535333, 'samples': 11858688, 'steps': 61763, 'loss/train': 1.524458885192871} 11/07/2021 05:56:46 - INFO - __main__ - Step 61765: {'lr': 0.00032440226448606207, 'samples': 11858880, 'steps': 61764, 'loss/train': 1.9080713987350464} 11/07/2021 05:56:47 - INFO - __main__ - Step 61766: {'lr': 0.0003243971981850664, 'samples': 11859072, 'steps': 61765, 'loss/train': 1.4792211055755615} 11/07/2021 05:56:47 - INFO - __main__ - Step 61767: {'lr': 0.0003243921318505485, 'samples': 11859264, 'steps': 61766, 'loss/train': 1.4061397314071655} 11/07/2021 05:56:47 - INFO - __main__ - Step 61768: {'lr': 0.0003243870654825106, 'samples': 11859456, 'steps': 61767, 'loss/train': 1.5265042781829834} 11/07/2021 05:56:48 - INFO - __main__ - Step 61769: {'lr': 0.0003243819990809551, 'samples': 11859648, 'steps': 61768, 'loss/train': 1.365269422531128} 11/07/2021 05:56:48 - INFO - __main__ - Step 61770: {'lr': 0.0003243769326458842, 'samples': 11859840, 'steps': 61769, 'loss/train': 1.6682672500610352} 11/07/2021 05:56:49 - INFO - __main__ - Step 61771: {'lr': 0.00032437186617730013, 'samples': 11860032, 'steps': 61770, 'loss/train': 1.7197140455245972} 11/07/2021 05:56:49 - INFO - __main__ - Step 61772: {'lr': 0.0003243667996752053, 'samples': 11860224, 'steps': 61771, 'loss/train': 1.2528765201568604} 11/07/2021 05:56:50 - INFO - __main__ - Step 61773: {'lr': 0.00032436173313960193, 'samples': 11860416, 'steps': 61772, 'loss/train': 1.240363597869873} 11/07/2021 05:56:50 - INFO - __main__ - Step 61774: {'lr': 0.00032435666657049236, 'samples': 11860608, 'steps': 61773, 'loss/train': 1.5028364658355713} 11/07/2021 05:56:50 - INFO - __main__ - Step 61775: {'lr': 0.0003243515999678788, 'samples': 11860800, 'steps': 61774, 'loss/train': 0.8720287680625916} 11/07/2021 05:56:52 - INFO - __main__ - Step 61776: {'lr': 0.0003243465333317635, 'samples': 11860992, 'steps': 61775, 'loss/train': 1.6989129781723022} 11/07/2021 05:56:52 - INFO - __main__ - Step 61777: {'lr': 0.0003243414666621489, 'samples': 11861184, 'steps': 61776, 'loss/train': 1.5199521780014038} 11/07/2021 05:56:52 - INFO - __main__ - Step 61778: {'lr': 0.0003243363999590371, 'samples': 11861376, 'steps': 61777, 'loss/train': 1.0361982583999634} 11/07/2021 05:56:53 - INFO - __main__ - Step 61779: {'lr': 0.00032433133322243047, 'samples': 11861568, 'steps': 61778, 'loss/train': 1.5172899961471558} 11/07/2021 05:56:53 - INFO - __main__ - Step 61780: {'lr': 0.00032432626645233133, 'samples': 11861760, 'steps': 61779, 'loss/train': 1.497607946395874} 11/07/2021 05:56:54 - INFO - __main__ - Step 61781: {'lr': 0.0003243211996487419, 'samples': 11861952, 'steps': 61780, 'loss/train': 0.9178078174591064} 11/07/2021 05:56:54 - INFO - __main__ - Step 61782: {'lr': 0.00032431613281166445, 'samples': 11862144, 'steps': 61781, 'loss/train': 1.4077749252319336} 11/07/2021 05:56:55 - INFO - __main__ - Step 61783: {'lr': 0.0003243110659411013, 'samples': 11862336, 'steps': 61782, 'loss/train': 1.741990089416504} 11/07/2021 05:56:55 - INFO - __main__ - Step 61784: {'lr': 0.0003243059990370548, 'samples': 11862528, 'steps': 61783, 'loss/train': 1.4728020429611206} 11/07/2021 05:56:55 - INFO - __main__ - Step 61785: {'lr': 0.0003243009320995271, 'samples': 11862720, 'steps': 61784, 'loss/train': 1.562947392463684} 11/07/2021 05:56:56 - INFO - __main__ - Step 61786: {'lr': 0.0003242958651285206, 'samples': 11862912, 'steps': 61785, 'loss/train': 1.415725588798523} 11/07/2021 05:56:57 - INFO - __main__ - Step 61787: {'lr': 0.0003242907981240375, 'samples': 11863104, 'steps': 61786, 'loss/train': 1.1760563850402832} 11/07/2021 05:56:57 - INFO - __main__ - Step 61788: {'lr': 0.00032428573108608013, 'samples': 11863296, 'steps': 61787, 'loss/train': 1.3077471256256104} 11/07/2021 05:56:58 - INFO - __main__ - Step 61789: {'lr': 0.00032428066401465075, 'samples': 11863488, 'steps': 61788, 'loss/train': 1.7661689519882202} 11/07/2021 05:56:58 - INFO - __main__ - Step 61790: {'lr': 0.0003242755969097516, 'samples': 11863680, 'steps': 61789, 'loss/train': 1.1031032800674438} 11/07/2021 05:56:59 - INFO - __main__ - Step 61791: {'lr': 0.00032427052977138506, 'samples': 11863872, 'steps': 61790, 'loss/train': 1.2959251403808594} 11/07/2021 05:56:59 - INFO - __main__ - Step 61792: {'lr': 0.0003242654625995533, 'samples': 11864064, 'steps': 61791, 'loss/train': 1.2092527151107788} 11/07/2021 05:57:00 - INFO - __main__ - Step 61793: {'lr': 0.0003242603953942587, 'samples': 11864256, 'steps': 61792, 'loss/train': 1.3166017532348633} 11/07/2021 05:57:00 - INFO - __main__ - Step 61794: {'lr': 0.0003242553281555036, 'samples': 11864448, 'steps': 61793, 'loss/train': 1.5001777410507202} 11/07/2021 05:57:00 - INFO - __main__ - Step 61795: {'lr': 0.0003242502608832901, 'samples': 11864640, 'steps': 61794, 'loss/train': 1.0128358602523804} 11/07/2021 05:57:01 - INFO - __main__ - Step 61796: {'lr': 0.0003242451935776206, 'samples': 11864832, 'steps': 61795, 'loss/train': 1.4378232955932617} 11/07/2021 05:57:02 - INFO - __main__ - Step 61797: {'lr': 0.0003242401262384974, 'samples': 11865024, 'steps': 61796, 'loss/train': 1.5651167631149292} 11/07/2021 05:57:02 - INFO - __main__ - Step 61798: {'lr': 0.0003242350588659227, 'samples': 11865216, 'steps': 61797, 'loss/train': 1.327396273612976} 11/07/2021 05:57:03 - INFO - __main__ - Step 61799: {'lr': 0.00032422999145989887, 'samples': 11865408, 'steps': 61798, 'loss/train': 1.854842185974121} 11/07/2021 05:57:03 - INFO - __main__ - Step 61800: {'lr': 0.0003242249240204281, 'samples': 11865600, 'steps': 61799, 'loss/train': 1.4890387058258057} 11/07/2021 05:57:03 - INFO - __main__ - Step 61801: {'lr': 0.00032421985654751276, 'samples': 11865792, 'steps': 61800, 'loss/train': 1.7489291429519653} 11/07/2021 05:57:04 - INFO - __main__ - Step 61802: {'lr': 0.0003242147890411551, 'samples': 11865984, 'steps': 61801, 'loss/train': 2.061201572418213} 11/07/2021 05:57:05 - INFO - __main__ - Step 61803: {'lr': 0.00032420972150135736, 'samples': 11866176, 'steps': 61802, 'loss/train': 1.7351831197738647} 11/07/2021 05:57:05 - INFO - __main__ - Step 61804: {'lr': 0.00032420465392812186, 'samples': 11866368, 'steps': 61803, 'loss/train': 1.3871090412139893} 11/07/2021 05:57:05 - INFO - __main__ - Step 61805: {'lr': 0.0003241995863214509, 'samples': 11866560, 'steps': 61804, 'loss/train': 1.1448936462402344} 11/07/2021 05:57:06 - INFO - __main__ - Step 61806: {'lr': 0.00032419451868134677, 'samples': 11866752, 'steps': 61805, 'loss/train': 1.4936537742614746} 11/07/2021 05:57:06 - INFO - __main__ - Step 61807: {'lr': 0.0003241894510078118, 'samples': 11866944, 'steps': 61806, 'loss/train': 1.283284306526184} 11/07/2021 05:57:07 - INFO - __main__ - Step 61808: {'lr': 0.0003241843833008481, 'samples': 11867136, 'steps': 61807, 'loss/train': 1.6622568368911743} 11/07/2021 05:57:07 - INFO - __main__ - Step 61809: {'lr': 0.0003241793155604581, 'samples': 11867328, 'steps': 61808, 'loss/train': 1.1293435096740723} 11/07/2021 05:57:08 - INFO - __main__ - Step 61810: {'lr': 0.00032417424778664406, 'samples': 11867520, 'steps': 61809, 'loss/train': 1.9085335731506348} 11/07/2021 05:57:08 - INFO - __main__ - Step 61811: {'lr': 0.00032416917997940824, 'samples': 11867712, 'steps': 61810, 'loss/train': 1.1425395011901855} 11/07/2021 05:57:09 - INFO - __main__ - Step 61812: {'lr': 0.0003241641121387529, 'samples': 11867904, 'steps': 61811, 'loss/train': 1.4201077222824097} 11/07/2021 05:57:09 - INFO - __main__ - Step 61813: {'lr': 0.0003241590442646804, 'samples': 11868096, 'steps': 61812, 'loss/train': 1.6736416816711426} 11/07/2021 05:57:10 - INFO - __main__ - Step 61814: {'lr': 0.000324153976357193, 'samples': 11868288, 'steps': 61813, 'loss/train': 0.9233061075210571} 11/07/2021 05:57:10 - INFO - __main__ - Step 61815: {'lr': 0.0003241489084162929, 'samples': 11868480, 'steps': 61814, 'loss/train': 1.2684361934661865} 11/07/2021 05:57:11 - INFO - __main__ - Step 61816: {'lr': 0.0003241438404419825, 'samples': 11868672, 'steps': 61815, 'loss/train': 1.4449368715286255} 11/07/2021 05:57:11 - INFO - __main__ - Step 61817: {'lr': 0.000324138772434264, 'samples': 11868864, 'steps': 61816, 'loss/train': 0.8983046412467957} 11/07/2021 05:57:11 - INFO - __main__ - Step 61818: {'lr': 0.00032413370439313973, 'samples': 11869056, 'steps': 61817, 'loss/train': 1.7072992324829102} 11/07/2021 05:57:12 - INFO - __main__ - Step 61819: {'lr': 0.00032412863631861187, 'samples': 11869248, 'steps': 61818, 'loss/train': 1.149965763092041} 11/07/2021 05:57:13 - INFO - __main__ - Step 61820: {'lr': 0.0003241235682106829, 'samples': 11869440, 'steps': 61819, 'loss/train': 1.7514861822128296} 11/07/2021 05:57:13 - INFO - __main__ - Step 61821: {'lr': 0.000324118500069355, 'samples': 11869632, 'steps': 61820, 'loss/train': 1.0768152475357056} 11/07/2021 05:57:13 - INFO - __main__ - Step 61822: {'lr': 0.00032411343189463036, 'samples': 11869824, 'steps': 61821, 'loss/train': 1.4332852363586426} 11/07/2021 05:57:14 - INFO - __main__ - Step 61823: {'lr': 0.00032410836368651144, 'samples': 11870016, 'steps': 61822, 'loss/train': 1.7719522714614868} 11/07/2021 05:57:15 - INFO - __main__ - Step 61824: {'lr': 0.00032410329544500034, 'samples': 11870208, 'steps': 61823, 'loss/train': 1.6602250337600708} 11/07/2021 05:57:15 - INFO - __main__ - Step 61825: {'lr': 0.0003240982271700995, 'samples': 11870400, 'steps': 61824, 'loss/train': 1.4560860395431519} 11/07/2021 05:57:15 - INFO - __main__ - Step 61826: {'lr': 0.00032409315886181115, 'samples': 11870592, 'steps': 61825, 'loss/train': 2.208475112915039} 11/07/2021 05:57:16 - INFO - __main__ - Step 61827: {'lr': 0.00032408809052013755, 'samples': 11870784, 'steps': 61826, 'loss/train': 1.1845368146896362} 11/07/2021 05:57:16 - INFO - __main__ - Step 61828: {'lr': 0.000324083022145081, 'samples': 11870976, 'steps': 61827, 'loss/train': 1.3441156148910522} 11/07/2021 05:57:17 - INFO - __main__ - Step 61829: {'lr': 0.0003240779537366438, 'samples': 11871168, 'steps': 61828, 'loss/train': 1.2383091449737549} 11/07/2021 05:57:18 - INFO - __main__ - Step 61830: {'lr': 0.0003240728852948281, 'samples': 11871360, 'steps': 61829, 'loss/train': 1.546064853668213} 11/07/2021 05:57:18 - INFO - __main__ - Step 61831: {'lr': 0.0003240678168196365, 'samples': 11871552, 'steps': 61830, 'loss/train': 0.8571770787239075} 11/07/2021 05:57:18 - INFO - __main__ - Step 61832: {'lr': 0.00032406274831107095, 'samples': 11871744, 'steps': 61831, 'loss/train': 1.2920833826065063} 11/07/2021 05:57:19 - INFO - __main__ - Step 61833: {'lr': 0.0003240576797691339, 'samples': 11871936, 'steps': 61832, 'loss/train': 1.9337031841278076} 11/07/2021 05:57:19 - INFO - __main__ - Step 61834: {'lr': 0.0003240526111938276, 'samples': 11872128, 'steps': 61833, 'loss/train': 1.1460736989974976} 11/07/2021 05:57:20 - INFO - __main__ - Step 61835: {'lr': 0.0003240475425851543, 'samples': 11872320, 'steps': 61834, 'loss/train': 1.6275171041488647} 11/07/2021 05:57:20 - INFO - __main__ - Step 61836: {'lr': 0.00032404247394311644, 'samples': 11872512, 'steps': 61835, 'loss/train': 1.4822568893432617} 11/07/2021 05:57:21 - INFO - __main__ - Step 61837: {'lr': 0.0003240374052677161, 'samples': 11872704, 'steps': 61836, 'loss/train': 1.4401177167892456} 11/07/2021 05:57:21 - INFO - __main__ - Step 61838: {'lr': 0.0003240323365589556, 'samples': 11872896, 'steps': 61837, 'loss/train': 1.5256555080413818} 11/07/2021 05:57:21 - INFO - __main__ - Step 61839: {'lr': 0.00032402726781683734, 'samples': 11873088, 'steps': 61838, 'loss/train': 1.0582515001296997} 11/07/2021 05:57:22 - INFO - __main__ - Step 61840: {'lr': 0.0003240221990413635, 'samples': 11873280, 'steps': 61839, 'loss/train': 1.3080247640609741} 11/07/2021 05:57:23 - INFO - __main__ - Step 61841: {'lr': 0.0003240171302325364, 'samples': 11873472, 'steps': 61840, 'loss/train': 1.4246902465820312} 11/07/2021 05:57:23 - INFO - __main__ - Step 61842: {'lr': 0.0003240120613903584, 'samples': 11873664, 'steps': 61841, 'loss/train': 1.4921460151672363} 11/07/2021 05:57:23 - INFO - __main__ - Step 61843: {'lr': 0.0003240069925148316, 'samples': 11873856, 'steps': 61842, 'loss/train': 1.623861312866211} 11/07/2021 05:57:24 - INFO - __main__ - Step 61844: {'lr': 0.0003240019236059585, 'samples': 11874048, 'steps': 61843, 'loss/train': 1.175253987312317} 11/07/2021 05:57:25 - INFO - __main__ - Step 61845: {'lr': 0.0003239968546637412, 'samples': 11874240, 'steps': 61844, 'loss/train': 1.5230365991592407} 11/07/2021 05:57:25 - INFO - __main__ - Step 61846: {'lr': 0.00032399178568818203, 'samples': 11874432, 'steps': 61845, 'loss/train': 1.3172128200531006} 11/07/2021 05:57:26 - INFO - __main__ - Step 61847: {'lr': 0.00032398671667928337, 'samples': 11874624, 'steps': 61846, 'loss/train': 0.12329617887735367} 11/07/2021 05:57:26 - INFO - __main__ - Step 61848: {'lr': 0.0003239816476370474, 'samples': 11874816, 'steps': 61847, 'loss/train': 1.383508563041687} 11/07/2021 05:57:26 - INFO - __main__ - Step 61849: {'lr': 0.0003239765785614765, 'samples': 11875008, 'steps': 61848, 'loss/train': 1.55354642868042} 11/07/2021 05:57:27 - INFO - __main__ - Step 61850: {'lr': 0.0003239715094525728, 'samples': 11875200, 'steps': 61849, 'loss/train': 2.198378801345825} 11/07/2021 05:57:28 - INFO - __main__ - Step 61851: {'lr': 0.0003239664403103387, 'samples': 11875392, 'steps': 61850, 'loss/train': 1.359033226966858} 11/07/2021 05:57:28 - INFO - __main__ - Step 61852: {'lr': 0.0003239613711347766, 'samples': 11875584, 'steps': 61851, 'loss/train': 1.9286397695541382} 11/07/2021 05:57:28 - INFO - __main__ - Step 61853: {'lr': 0.00032395630192588856, 'samples': 11875776, 'steps': 61852, 'loss/train': 1.7284924983978271} 11/07/2021 05:57:29 - INFO - __main__ - Step 61854: {'lr': 0.00032395123268367685, 'samples': 11875968, 'steps': 61853, 'loss/train': 1.1633837223052979} 11/07/2021 05:57:30 - INFO - __main__ - Step 61855: {'lr': 0.000323946163408144, 'samples': 11876160, 'steps': 61854, 'loss/train': 1.1887677907943726} 11/07/2021 05:57:30 - INFO - __main__ - Step 61856: {'lr': 0.00032394109409929206, 'samples': 11876352, 'steps': 61855, 'loss/train': 1.6292109489440918} 11/07/2021 05:57:31 - INFO - __main__ - Step 61857: {'lr': 0.0003239360247571234, 'samples': 11876544, 'steps': 61856, 'loss/train': 1.7362862825393677} 11/07/2021 05:57:31 - INFO - __main__ - Step 61858: {'lr': 0.0003239309553816404, 'samples': 11876736, 'steps': 61857, 'loss/train': 1.2697372436523438} 11/07/2021 05:57:31 - INFO - __main__ - Step 61859: {'lr': 0.0003239258859728452, 'samples': 11876928, 'steps': 61858, 'loss/train': 1.314550757408142} 11/07/2021 05:57:32 - INFO - __main__ - Step 61860: {'lr': 0.0003239208165307401, 'samples': 11877120, 'steps': 61859, 'loss/train': 1.4843074083328247} 11/07/2021 05:57:33 - INFO - __main__ - Step 61861: {'lr': 0.00032391574705532746, 'samples': 11877312, 'steps': 61860, 'loss/train': 0.6689080595970154} 11/07/2021 05:57:33 - INFO - __main__ - Step 61862: {'lr': 0.0003239106775466095, 'samples': 11877504, 'steps': 61861, 'loss/train': 1.7361575365066528} 11/07/2021 05:57:33 - INFO - __main__ - Step 61863: {'lr': 0.00032390560800458855, 'samples': 11877696, 'steps': 61862, 'loss/train': 1.8188105821609497} 11/07/2021 05:57:34 - INFO - __main__ - Step 61864: {'lr': 0.00032390053842926684, 'samples': 11877888, 'steps': 61863, 'loss/train': 1.653065800666809} 11/07/2021 05:57:34 - INFO - __main__ - Step 61865: {'lr': 0.00032389546882064673, 'samples': 11878080, 'steps': 61864, 'loss/train': 1.7758830785751343} 11/07/2021 05:57:35 - INFO - __main__ - Step 61866: {'lr': 0.0003238903991787304, 'samples': 11878272, 'steps': 61865, 'loss/train': 1.702692985534668} 11/07/2021 05:57:35 - INFO - __main__ - Step 61867: {'lr': 0.0003238853295035203, 'samples': 11878464, 'steps': 61866, 'loss/train': 1.3264950513839722} 11/07/2021 05:57:36 - INFO - __main__ - Step 61868: {'lr': 0.0003238802597950186, 'samples': 11878656, 'steps': 61867, 'loss/train': 1.6209849119186401} 11/07/2021 05:57:36 - INFO - __main__ - Step 61869: {'lr': 0.0003238751900532275, 'samples': 11878848, 'steps': 61868, 'loss/train': 1.9027670621871948} 11/07/2021 05:57:36 - INFO - __main__ - Step 61870: {'lr': 0.00032387012027814945, 'samples': 11879040, 'steps': 61869, 'loss/train': 1.4053469896316528} 11/07/2021 05:57:38 - INFO - __main__ - Step 61871: {'lr': 0.00032386505046978667, 'samples': 11879232, 'steps': 61870, 'loss/train': 1.1881669759750366} 11/07/2021 05:57:38 - INFO - __main__ - Step 61872: {'lr': 0.00032385998062814137, 'samples': 11879424, 'steps': 61871, 'loss/train': 1.3447136878967285} 11/07/2021 05:57:38 - INFO - __main__ - Step 61873: {'lr': 0.00032385491075321595, 'samples': 11879616, 'steps': 61872, 'loss/train': 1.5920741558074951} 11/07/2021 05:57:39 - INFO - __main__ - Step 61874: {'lr': 0.00032384984084501267, 'samples': 11879808, 'steps': 61873, 'loss/train': 1.1583985090255737} 11/07/2021 05:57:39 - INFO - __main__ - Step 61875: {'lr': 0.00032384477090353377, 'samples': 11880000, 'steps': 61874, 'loss/train': 1.2242507934570312} 11/07/2021 05:57:39 - INFO - __main__ - Step 61876: {'lr': 0.0003238397009287815, 'samples': 11880192, 'steps': 61875, 'loss/train': 1.4628705978393555} 11/07/2021 05:57:40 - INFO - __main__ - Step 61877: {'lr': 0.00032383463092075824, 'samples': 11880384, 'steps': 61876, 'loss/train': 1.1985481977462769} 11/07/2021 05:57:41 - INFO - __main__ - Step 61878: {'lr': 0.0003238295608794662, 'samples': 11880576, 'steps': 61877, 'loss/train': 1.7491861581802368} 11/07/2021 05:57:41 - INFO - __main__ - Step 61879: {'lr': 0.0003238244908049078, 'samples': 11880768, 'steps': 61878, 'loss/train': 1.466186761856079} 11/07/2021 05:57:41 - INFO - __main__ - Step 61880: {'lr': 0.0003238194206970851, 'samples': 11880960, 'steps': 61879, 'loss/train': 1.8479321002960205} 11/07/2021 05:57:42 - INFO - __main__ - Step 61881: {'lr': 0.0003238143505560007, 'samples': 11881152, 'steps': 61880, 'loss/train': 0.8473256230354309} 11/07/2021 05:57:43 - INFO - __main__ - Step 61882: {'lr': 0.0003238092803816565, 'samples': 11881344, 'steps': 61881, 'loss/train': 1.6602250337600708} 11/07/2021 05:57:43 - INFO - __main__ - Step 61883: {'lr': 0.00032380421017405504, 'samples': 11881536, 'steps': 61882, 'loss/train': 1.2556284666061401} 11/07/2021 05:57:43 - INFO - __main__ - Step 61884: {'lr': 0.00032379913993319854, 'samples': 11881728, 'steps': 61883, 'loss/train': 1.1970146894454956} 11/07/2021 05:57:44 - INFO - __main__ - Step 61885: {'lr': 0.0003237940696590893, 'samples': 11881920, 'steps': 61884, 'loss/train': 1.3891535997390747} 11/07/2021 05:57:44 - INFO - __main__ - Step 61886: {'lr': 0.00032378899935172955, 'samples': 11882112, 'steps': 61885, 'loss/train': 1.2977911233901978} 11/07/2021 05:57:45 - INFO - __main__ - Step 61887: {'lr': 0.0003237839290111216, 'samples': 11882304, 'steps': 61886, 'loss/train': 1.4083435535430908} 11/07/2021 05:57:45 - INFO - __main__ - Step 61888: {'lr': 0.0003237788586372679, 'samples': 11882496, 'steps': 61887, 'loss/train': 1.4357545375823975} 11/07/2021 05:57:46 - INFO - __main__ - Step 61889: {'lr': 0.00032377378823017044, 'samples': 11882688, 'steps': 61888, 'loss/train': 1.495382308959961} 11/07/2021 05:57:46 - INFO - __main__ - Step 61890: {'lr': 0.0003237687177898317, 'samples': 11882880, 'steps': 61889, 'loss/train': 1.3782849311828613} 11/07/2021 05:57:47 - INFO - __main__ - Step 61891: {'lr': 0.0003237636473162539, 'samples': 11883072, 'steps': 61890, 'loss/train': 1.5736939907073975} 11/07/2021 05:57:48 - INFO - __main__ - Step 61892: {'lr': 0.0003237585768094393, 'samples': 11883264, 'steps': 61891, 'loss/train': 1.1663696765899658} 11/07/2021 05:57:48 - INFO - __main__ - Step 61893: {'lr': 0.00032375350626939026, 'samples': 11883456, 'steps': 61892, 'loss/train': 1.4058462381362915} 11/07/2021 05:57:48 - INFO - __main__ - Step 61894: {'lr': 0.0003237484356961091, 'samples': 11883648, 'steps': 61893, 'loss/train': 1.4834306240081787} 11/07/2021 05:57:49 - INFO - __main__ - Step 61895: {'lr': 0.00032374336508959796, 'samples': 11883840, 'steps': 61894, 'loss/train': 1.5994411706924438} 11/07/2021 05:57:49 - INFO - __main__ - Step 61896: {'lr': 0.0003237382944498592, 'samples': 11884032, 'steps': 61895, 'loss/train': 1.7316782474517822} 11/07/2021 05:57:49 - INFO - __main__ - Step 61897: {'lr': 0.0003237332237768951, 'samples': 11884224, 'steps': 61896, 'loss/train': 2.058790683746338} 11/07/2021 05:57:51 - INFO - __main__ - Step 61898: {'lr': 0.000323728153070708, 'samples': 11884416, 'steps': 61897, 'loss/train': 1.4940078258514404} 11/07/2021 05:57:52 - INFO - __main__ - Step 61899: {'lr': 0.0003237230823313, 'samples': 11884608, 'steps': 61898, 'loss/train': 1.5305954217910767} 11/07/2021 05:57:52 - INFO - __main__ - Step 61900: {'lr': 0.00032371801155867363, 'samples': 11884800, 'steps': 61899, 'loss/train': 2.1137497425079346} 11/07/2021 05:57:52 - INFO - __main__ - Step 61901: {'lr': 0.0003237129407528311, 'samples': 11884992, 'steps': 61900, 'loss/train': 1.781248927116394} 11/07/2021 05:57:53 - INFO - __main__ - Step 61902: {'lr': 0.00032370786991377454, 'samples': 11885184, 'steps': 61901, 'loss/train': 1.7152830362319946} 11/07/2021 05:57:53 - INFO - __main__ - Step 61903: {'lr': 0.0003237027990415064, 'samples': 11885376, 'steps': 61902, 'loss/train': 1.736863613128662} 11/07/2021 05:57:53 - INFO - __main__ - Step 61904: {'lr': 0.0003236977281360289, 'samples': 11885568, 'steps': 61903, 'loss/train': 1.7097084522247314} 11/07/2021 05:57:54 - INFO - __main__ - Step 61905: {'lr': 0.0003236926571973444, 'samples': 11885760, 'steps': 61904, 'loss/train': 1.584526777267456} 11/07/2021 05:57:55 - INFO - __main__ - Step 61906: {'lr': 0.000323687586225455, 'samples': 11885952, 'steps': 61905, 'loss/train': 1.4457571506500244} 11/07/2021 05:57:55 - INFO - __main__ - Step 61907: {'lr': 0.0003236825152203632, 'samples': 11886144, 'steps': 61906, 'loss/train': 1.7083287239074707} 11/07/2021 05:57:55 - INFO - __main__ - Step 61908: {'lr': 0.0003236774441820713, 'samples': 11886336, 'steps': 61907, 'loss/train': 1.7431252002716064} 11/07/2021 05:57:56 - INFO - __main__ - Step 61909: {'lr': 0.00032367237311058133, 'samples': 11886528, 'steps': 61908, 'loss/train': 1.5368584394454956} 11/07/2021 05:57:57 - INFO - __main__ - Step 61910: {'lr': 0.0003236673020058958, 'samples': 11886720, 'steps': 61909, 'loss/train': 1.1235260963439941} 11/07/2021 05:57:57 - INFO - __main__ - Step 61911: {'lr': 0.0003236622308680168, 'samples': 11886912, 'steps': 61910, 'loss/train': 1.2605373859405518} 11/07/2021 05:57:58 - INFO - __main__ - Step 61912: {'lr': 0.0003236571596969469, 'samples': 11887104, 'steps': 61911, 'loss/train': 1.5974717140197754} 11/07/2021 05:57:58 - INFO - __main__ - Step 61913: {'lr': 0.0003236520884926881, 'samples': 11887296, 'steps': 61912, 'loss/train': 0.9960759878158569} 11/07/2021 05:57:58 - INFO - __main__ - Step 61914: {'lr': 0.00032364701725524285, 'samples': 11887488, 'steps': 61913, 'loss/train': 1.5156807899475098} 11/07/2021 05:58:00 - INFO - __main__ - Step 61915: {'lr': 0.00032364194598461345, 'samples': 11887680, 'steps': 61914, 'loss/train': 1.3956077098846436} 11/07/2021 05:58:00 - INFO - __main__ - Step 61916: {'lr': 0.00032363687468080205, 'samples': 11887872, 'steps': 61915, 'loss/train': 1.381554126739502} 11/07/2021 05:58:00 - INFO - __main__ - Step 61917: {'lr': 0.000323631803343811, 'samples': 11888064, 'steps': 61916, 'loss/train': 1.2477246522903442} 11/07/2021 05:58:01 - INFO - __main__ - Step 61918: {'lr': 0.0003236267319736426, 'samples': 11888256, 'steps': 61917, 'loss/train': 2.3861846923828125} 11/07/2021 05:58:01 - INFO - __main__ - Step 61919: {'lr': 0.00032362166057029915, 'samples': 11888448, 'steps': 61918, 'loss/train': 1.961648941040039} 11/07/2021 05:58:02 - INFO - __main__ - Step 61920: {'lr': 0.0003236165891337829, 'samples': 11888640, 'steps': 61919, 'loss/train': 1.0538597106933594} 11/07/2021 05:58:02 - INFO - __main__ - Step 61921: {'lr': 0.00032361151766409623, 'samples': 11888832, 'steps': 61920, 'loss/train': 1.6152410507202148} 11/07/2021 05:58:03 - INFO - __main__ - Step 61922: {'lr': 0.0003236064461612413, 'samples': 11889024, 'steps': 61921, 'loss/train': 0.959873378276825} 11/07/2021 05:58:03 - INFO - __main__ - Step 61923: {'lr': 0.00032360137462522046, 'samples': 11889216, 'steps': 61922, 'loss/train': 1.572746753692627} 11/07/2021 05:58:03 - INFO - __main__ - Step 61924: {'lr': 0.0003235963030560359, 'samples': 11889408, 'steps': 61923, 'loss/train': 0.9194735288619995} 11/07/2021 05:58:04 - INFO - __main__ - Step 61925: {'lr': 0.00032359123145369, 'samples': 11889600, 'steps': 61924, 'loss/train': 1.3601068258285522} 11/07/2021 05:58:05 - INFO - __main__ - Step 61926: {'lr': 0.00032358615981818505, 'samples': 11889792, 'steps': 61925, 'loss/train': 1.266107201576233} 11/07/2021 05:58:05 - INFO - __main__ - Step 61927: {'lr': 0.0003235810881495233, 'samples': 11889984, 'steps': 61926, 'loss/train': 1.3247363567352295} 11/07/2021 05:58:05 - INFO - __main__ - Step 61928: {'lr': 0.00032357601644770714, 'samples': 11890176, 'steps': 61927, 'loss/train': 1.5525723695755005} 11/07/2021 05:58:06 - INFO - __main__ - Step 61929: {'lr': 0.0003235709447127386, 'samples': 11890368, 'steps': 61928, 'loss/train': 1.7488627433776855} 11/07/2021 05:58:06 - INFO - __main__ - Step 61930: {'lr': 0.00032356587294462023, 'samples': 11890560, 'steps': 61929, 'loss/train': 1.4525083303451538} 11/07/2021 05:58:07 - INFO - __main__ - Step 61931: {'lr': 0.00032356080114335416, 'samples': 11890752, 'steps': 61930, 'loss/train': 1.5392303466796875} 11/07/2021 05:58:08 - INFO - __main__ - Step 61932: {'lr': 0.0003235557293089428, 'samples': 11890944, 'steps': 61931, 'loss/train': 1.5483129024505615} 11/07/2021 05:58:08 - INFO - __main__ - Step 61933: {'lr': 0.00032355065744138836, 'samples': 11891136, 'steps': 61932, 'loss/train': 2.0889017581939697} 11/07/2021 05:58:08 - INFO - __main__ - Step 61934: {'lr': 0.00032354558554069303, 'samples': 11891328, 'steps': 61933, 'loss/train': 2.0167336463928223} 11/07/2021 05:58:09 - INFO - __main__ - Step 61935: {'lr': 0.00032354051360685934, 'samples': 11891520, 'steps': 61934, 'loss/train': 1.441537618637085} 11/07/2021 05:58:10 - INFO - __main__ - Step 61936: {'lr': 0.0003235354416398893, 'samples': 11891712, 'steps': 61935, 'loss/train': 5.813534736633301} 11/07/2021 05:58:10 - INFO - __main__ - Step 61937: {'lr': 0.0003235303696397854, 'samples': 11891904, 'steps': 61936, 'loss/train': 1.461822748184204} 11/07/2021 05:58:11 - INFO - __main__ - Step 61938: {'lr': 0.0003235252976065498, 'samples': 11892096, 'steps': 61937, 'loss/train': 1.184041976928711} 11/07/2021 05:58:11 - INFO - __main__ - Step 61939: {'lr': 0.00032352022554018483, 'samples': 11892288, 'steps': 61938, 'loss/train': 1.4853055477142334} 11/07/2021 05:58:11 - INFO - __main__ - Step 61940: {'lr': 0.00032351515344069285, 'samples': 11892480, 'steps': 61939, 'loss/train': 1.6828155517578125} 11/07/2021 05:58:12 - INFO - __main__ - Step 61941: {'lr': 0.000323510081308076, 'samples': 11892672, 'steps': 61940, 'loss/train': 1.0645782947540283} 11/07/2021 05:58:13 - INFO - __main__ - Step 61942: {'lr': 0.0003235050091423367, 'samples': 11892864, 'steps': 61941, 'loss/train': 1.232614517211914} 11/07/2021 05:58:13 - INFO - __main__ - Step 61943: {'lr': 0.0003234999369434771, 'samples': 11893056, 'steps': 61942, 'loss/train': 1.4772956371307373} 11/07/2021 05:58:13 - INFO - __main__ - Step 61944: {'lr': 0.00032349486471149963, 'samples': 11893248, 'steps': 61943, 'loss/train': 1.7297793626785278} 11/07/2021 05:58:14 - INFO - __main__ - Step 61945: {'lr': 0.0003234897924464065, 'samples': 11893440, 'steps': 61944, 'loss/train': 0.5714331865310669} 11/07/2021 05:58:14 - INFO - __main__ - Step 61946: {'lr': 0.00032348472014819994, 'samples': 11893632, 'steps': 61945, 'loss/train': 1.7703527212142944} 11/07/2021 05:58:15 - INFO - __main__ - Step 61947: {'lr': 0.0003234796478168824, 'samples': 11893824, 'steps': 61946, 'loss/train': 1.378594994544983} 11/07/2021 05:58:15 - INFO - __main__ - Step 61948: {'lr': 0.00032347457545245606, 'samples': 11894016, 'steps': 61947, 'loss/train': 1.0633728504180908} 11/07/2021 05:58:16 - INFO - __main__ - Step 61949: {'lr': 0.0003234695030549232, 'samples': 11894208, 'steps': 61948, 'loss/train': 0.43849238753318787} 11/07/2021 05:58:16 - INFO - __main__ - Step 61950: {'lr': 0.00032346443062428605, 'samples': 11894400, 'steps': 61949, 'loss/train': 1.5281093120574951} 11/07/2021 05:58:17 - INFO - __main__ - Step 61951: {'lr': 0.000323459358160547, 'samples': 11894592, 'steps': 61950, 'loss/train': 1.6779379844665527} 11/07/2021 05:58:18 - INFO - __main__ - Step 61952: {'lr': 0.0003234542856637083, 'samples': 11894784, 'steps': 61951, 'loss/train': 1.0336118936538696} 11/07/2021 05:58:18 - INFO - __main__ - Step 61953: {'lr': 0.0003234492131337722, 'samples': 11894976, 'steps': 61952, 'loss/train': 0.9002169370651245} 11/07/2021 05:58:18 - INFO - __main__ - Step 61954: {'lr': 0.000323444140570741, 'samples': 11895168, 'steps': 61953, 'loss/train': 1.5202224254608154} 11/07/2021 05:58:19 - INFO - __main__ - Step 61955: {'lr': 0.00032343906797461716, 'samples': 11895360, 'steps': 61954, 'loss/train': 1.3035728931427002} 11/07/2021 05:58:19 - INFO - __main__ - Step 61956: {'lr': 0.00032343399534540265, 'samples': 11895552, 'steps': 61955, 'loss/train': 1.4672057628631592} 11/07/2021 05:58:20 - INFO - __main__ - Step 61957: {'lr': 0.00032342892268309996, 'samples': 11895744, 'steps': 61956, 'loss/train': 1.7767953872680664} 11/07/2021 05:58:20 - INFO - __main__ - Step 61958: {'lr': 0.00032342384998771133, 'samples': 11895936, 'steps': 61957, 'loss/train': 1.63362455368042} 11/07/2021 05:58:21 - INFO - __main__ - Step 61959: {'lr': 0.000323418777259239, 'samples': 11896128, 'steps': 61958, 'loss/train': 1.8503328561782837} 11/07/2021 05:58:21 - INFO - __main__ - Step 61960: {'lr': 0.0003234137044976854, 'samples': 11896320, 'steps': 61959, 'loss/train': 1.2303320169448853} 11/07/2021 05:58:21 - INFO - __main__ - Step 61961: {'lr': 0.0003234086317030526, 'samples': 11896512, 'steps': 61960, 'loss/train': 1.613657832145691} 11/07/2021 05:58:22 - INFO - __main__ - Step 61962: {'lr': 0.00032340355887534313, 'samples': 11896704, 'steps': 61961, 'loss/train': 1.2385461330413818} 11/07/2021 05:58:23 - INFO - __main__ - Step 61963: {'lr': 0.00032339848601455913, 'samples': 11896896, 'steps': 61962, 'loss/train': 1.2594621181488037} 11/07/2021 05:58:23 - INFO - __main__ - Step 61964: {'lr': 0.0003233934131207028, 'samples': 11897088, 'steps': 61963, 'loss/train': 1.578066110610962} 11/07/2021 05:58:23 - INFO - __main__ - Step 61965: {'lr': 0.0003233883401937766, 'samples': 11897280, 'steps': 61964, 'loss/train': 1.1574463844299316} 11/07/2021 05:58:24 - INFO - __main__ - Step 61966: {'lr': 0.00032338326723378274, 'samples': 11897472, 'steps': 61965, 'loss/train': 1.5419903993606567} 11/07/2021 05:58:25 - INFO - __main__ - Step 61967: {'lr': 0.00032337819424072353, 'samples': 11897664, 'steps': 61966, 'loss/train': 1.354822039604187} 11/07/2021 05:58:26 - INFO - __main__ - Step 61968: {'lr': 0.00032337312121460125, 'samples': 11897856, 'steps': 61967, 'loss/train': 1.4387123584747314} 11/07/2021 05:58:26 - INFO - __main__ - Step 61969: {'lr': 0.00032336804815541817, 'samples': 11898048, 'steps': 61968, 'loss/train': 1.8532602787017822} 11/07/2021 05:58:26 - INFO - __main__ - Step 61970: {'lr': 0.0003233629750631765, 'samples': 11898240, 'steps': 61969, 'loss/train': 1.8000972270965576} 11/07/2021 05:58:27 - INFO - __main__ - Step 61971: {'lr': 0.0003233579019378787, 'samples': 11898432, 'steps': 61970, 'loss/train': 1.7329866886138916} 11/07/2021 05:58:27 - INFO - __main__ - Step 61972: {'lr': 0.0003233528287795269, 'samples': 11898624, 'steps': 61971, 'loss/train': 1.5369231700897217} 11/07/2021 05:58:28 - INFO - __main__ - Step 61973: {'lr': 0.00032334775558812346, 'samples': 11898816, 'steps': 61972, 'loss/train': 1.1939178705215454} 11/07/2021 05:58:28 - INFO - __main__ - Step 61974: {'lr': 0.0003233426823636706, 'samples': 11899008, 'steps': 61973, 'loss/train': 1.3880362510681152} 11/07/2021 05:58:29 - INFO - __main__ - Step 61975: {'lr': 0.0003233376091061708, 'samples': 11899200, 'steps': 61974, 'loss/train': 1.0212767124176025} 11/07/2021 05:58:29 - INFO - __main__ - Step 61976: {'lr': 0.00032333253581562615, 'samples': 11899392, 'steps': 61975, 'loss/train': 1.1388977766036987} 11/07/2021 05:58:29 - INFO - __main__ - Step 61977: {'lr': 0.0003233274624920389, 'samples': 11899584, 'steps': 61976, 'loss/train': 1.636702537536621} 11/07/2021 05:58:30 - INFO - __main__ - Step 61978: {'lr': 0.0003233223891354116, 'samples': 11899776, 'steps': 61977, 'loss/train': 1.2343858480453491} 11/07/2021 05:58:31 - INFO - __main__ - Step 61979: {'lr': 0.00032331731574574617, 'samples': 11899968, 'steps': 61978, 'loss/train': 1.2654637098312378} 11/07/2021 05:58:31 - INFO - __main__ - Step 61980: {'lr': 0.00032331224232304517, 'samples': 11900160, 'steps': 61979, 'loss/train': 1.4161187410354614} 11/07/2021 05:58:31 - INFO - __main__ - Step 61981: {'lr': 0.00032330716886731087, 'samples': 11900352, 'steps': 61980, 'loss/train': 1.1601057052612305} 11/07/2021 05:58:32 - INFO - __main__ - Step 61982: {'lr': 0.0003233020953785454, 'samples': 11900544, 'steps': 61981, 'loss/train': 1.5647937059402466} 11/07/2021 05:58:32 - INFO - __main__ - Step 61983: {'lr': 0.00032329702185675117, 'samples': 11900736, 'steps': 61982, 'loss/train': 1.45183265209198} 11/07/2021 05:58:33 - INFO - __main__ - Step 61984: {'lr': 0.00032329194830193044, 'samples': 11900928, 'steps': 61983, 'loss/train': 1.2516897916793823} 11/07/2021 05:58:34 - INFO - __main__ - Step 61985: {'lr': 0.00032328687471408545, 'samples': 11901120, 'steps': 61984, 'loss/train': 0.9546338319778442} 11/07/2021 05:58:34 - INFO - __main__ - Step 61986: {'lr': 0.0003232818010932186, 'samples': 11901312, 'steps': 61985, 'loss/train': 1.4930994510650635} 11/07/2021 05:58:34 - INFO - __main__ - Step 61987: {'lr': 0.0003232767274393321, 'samples': 11901504, 'steps': 61986, 'loss/train': 1.813164234161377} 11/07/2021 05:58:35 - INFO - __main__ - Step 61988: {'lr': 0.0003232716537524282, 'samples': 11901696, 'steps': 61987, 'loss/train': 1.9347676038742065} 11/07/2021 05:58:36 - INFO - __main__ - Step 61989: {'lr': 0.00032326658003250917, 'samples': 11901888, 'steps': 61988, 'loss/train': 1.8708559274673462} 11/07/2021 05:58:36 - INFO - __main__ - Step 61990: {'lr': 0.0003232615062795774, 'samples': 11902080, 'steps': 61989, 'loss/train': 1.7254104614257812} 11/07/2021 05:58:37 - INFO - __main__ - Step 61991: {'lr': 0.0003232564324936351, 'samples': 11902272, 'steps': 61990, 'loss/train': 1.6419090032577515} 11/07/2021 05:58:37 - INFO - __main__ - Step 61992: {'lr': 0.0003232513586746847, 'samples': 11902464, 'steps': 61991, 'loss/train': 1.3409444093704224} 11/07/2021 05:58:37 - INFO - __main__ - Step 61993: {'lr': 0.00032324628482272824, 'samples': 11902656, 'steps': 61992, 'loss/train': 1.3349100351333618} 11/07/2021 05:58:38 - INFO - __main__ - Step 61994: {'lr': 0.00032324121093776817, 'samples': 11902848, 'steps': 61993, 'loss/train': 1.5028674602508545} 11/07/2021 05:58:39 - INFO - __main__ - Step 61995: {'lr': 0.0003232361370198067, 'samples': 11903040, 'steps': 61994, 'loss/train': 2.314544200897217} 11/07/2021 05:58:39 - INFO - __main__ - Step 61996: {'lr': 0.0003232310630688462, 'samples': 11903232, 'steps': 61995, 'loss/train': 1.7559916973114014} 11/07/2021 05:58:39 - INFO - __main__ - Step 61997: {'lr': 0.00032322598908488887, 'samples': 11903424, 'steps': 61996, 'loss/train': 1.0925874710083008} 11/07/2021 05:58:40 - INFO - __main__ - Step 61998: {'lr': 0.00032322091506793715, 'samples': 11903616, 'steps': 61997, 'loss/train': 1.1896063089370728} 11/07/2021 05:58:41 - INFO - __main__ - Step 61999: {'lr': 0.00032321584101799316, 'samples': 11903808, 'steps': 61998, 'loss/train': 1.0083476305007935} 11/07/2021 05:58:41 - INFO - __main__ - Step 62000: {'lr': 0.0003232107669350592, 'samples': 11904000, 'steps': 61999, 'loss/train': 0.6351624727249146} 11/07/2021 05:58:41 - INFO - __main__ - Step 62001: {'lr': 0.0003232056928191376, 'samples': 11904192, 'steps': 62000, 'loss/train': 1.0270509719848633} 11/07/2021 05:58:42 - INFO - __main__ - Step 62002: {'lr': 0.00032320061867023066, 'samples': 11904384, 'steps': 62001, 'loss/train': 1.284279704093933} 11/07/2021 05:58:42 - INFO - __main__ - Step 62003: {'lr': 0.0003231955444883407, 'samples': 11904576, 'steps': 62002, 'loss/train': 1.6004997491836548} 11/07/2021 05:58:43 - INFO - __main__ - Step 62004: {'lr': 0.0003231904702734699, 'samples': 11904768, 'steps': 62003, 'loss/train': 1.0223402976989746} 11/07/2021 05:58:44 - INFO - __main__ - Step 62005: {'lr': 0.00032318539602562064, 'samples': 11904960, 'steps': 62004, 'loss/train': 1.610092043876648} 11/07/2021 05:58:44 - INFO - __main__ - Step 62006: {'lr': 0.00032318032174479515, 'samples': 11905152, 'steps': 62005, 'loss/train': 1.3842592239379883} 11/07/2021 05:58:44 - INFO - __main__ - Step 62007: {'lr': 0.0003231752474309957, 'samples': 11905344, 'steps': 62006, 'loss/train': 1.7968509197235107} 11/07/2021 05:58:45 - INFO - __main__ - Step 62008: {'lr': 0.00032317017308422464, 'samples': 11905536, 'steps': 62007, 'loss/train': 1.6465976238250732} 11/07/2021 05:58:45 - INFO - __main__ - Step 62009: {'lr': 0.0003231650987044843, 'samples': 11905728, 'steps': 62008, 'loss/train': 1.6330751180648804} 11/07/2021 05:58:46 - INFO - __main__ - Step 62010: {'lr': 0.00032316002429177683, 'samples': 11905920, 'steps': 62009, 'loss/train': 1.7054857015609741} 11/07/2021 05:58:46 - INFO - __main__ - Step 62011: {'lr': 0.00032315494984610463, 'samples': 11906112, 'steps': 62010, 'loss/train': 1.3888680934906006} 11/07/2021 05:58:47 - INFO - __main__ - Step 62012: {'lr': 0.0003231498753674698, 'samples': 11906304, 'steps': 62011, 'loss/train': 1.3335429430007935} 11/07/2021 05:58:47 - INFO - __main__ - Step 62013: {'lr': 0.00032314480085587487, 'samples': 11906496, 'steps': 62012, 'loss/train': 1.4598748683929443} 11/07/2021 05:58:47 - INFO - __main__ - Step 62014: {'lr': 0.00032313972631132197, 'samples': 11906688, 'steps': 62013, 'loss/train': 1.560803771018982} 11/07/2021 05:58:48 - INFO - __main__ - Step 62015: {'lr': 0.0003231346517338135, 'samples': 11906880, 'steps': 62014, 'loss/train': 1.380338430404663} 11/07/2021 05:58:49 - INFO - __main__ - Step 62016: {'lr': 0.00032312957712335173, 'samples': 11907072, 'steps': 62015, 'loss/train': 1.2676122188568115} 11/07/2021 05:58:49 - INFO - __main__ - Step 62017: {'lr': 0.0003231245024799388, 'samples': 11907264, 'steps': 62016, 'loss/train': 1.7665510177612305} 11/07/2021 05:58:49 - INFO - __main__ - Step 62018: {'lr': 0.00032311942780357714, 'samples': 11907456, 'steps': 62017, 'loss/train': 1.6583725214004517} 11/07/2021 05:58:50 - INFO - __main__ - Step 62019: {'lr': 0.00032311435309426894, 'samples': 11907648, 'steps': 62018, 'loss/train': 1.5119041204452515} 11/07/2021 05:58:51 - INFO - __main__ - Step 62020: {'lr': 0.00032310927835201665, 'samples': 11907840, 'steps': 62019, 'loss/train': 1.2792350053787231} 11/07/2021 05:58:51 - INFO - __main__ - Step 62021: {'lr': 0.00032310420357682234, 'samples': 11908032, 'steps': 62020, 'loss/train': 1.8435345888137817} 11/07/2021 05:58:52 - INFO - __main__ - Step 62022: {'lr': 0.0003230991287686885, 'samples': 11908224, 'steps': 62021, 'loss/train': 1.983917236328125} 11/07/2021 05:58:52 - INFO - __main__ - Step 62023: {'lr': 0.00032309405392761726, 'samples': 11908416, 'steps': 62022, 'loss/train': 1.4740469455718994} 11/07/2021 05:58:52 - INFO - __main__ - Step 62024: {'lr': 0.00032308897905361094, 'samples': 11908608, 'steps': 62023, 'loss/train': 0.9116451740264893} 11/07/2021 05:58:53 - INFO - __main__ - Step 62025: {'lr': 0.00032308390414667186, 'samples': 11908800, 'steps': 62024, 'loss/train': 1.2507102489471436} 11/07/2021 05:58:54 - INFO - __main__ - Step 62026: {'lr': 0.00032307882920680237, 'samples': 11908992, 'steps': 62025, 'loss/train': 1.321568489074707} 11/07/2021 05:58:54 - INFO - __main__ - Step 62027: {'lr': 0.0003230737542340046, 'samples': 11909184, 'steps': 62026, 'loss/train': 1.5347850322723389} 11/07/2021 05:58:54 - INFO - __main__ - Step 62028: {'lr': 0.00032306867922828096, 'samples': 11909376, 'steps': 62027, 'loss/train': 1.1870124340057373} 11/07/2021 05:58:55 - INFO - __main__ - Step 62029: {'lr': 0.00032306360418963377, 'samples': 11909568, 'steps': 62028, 'loss/train': 1.35513174533844} 11/07/2021 05:58:55 - INFO - __main__ - Step 62030: {'lr': 0.0003230585291180652, 'samples': 11909760, 'steps': 62029, 'loss/train': 0.8412436246871948} 11/07/2021 05:58:56 - INFO - __main__ - Step 62031: {'lr': 0.00032305345401357756, 'samples': 11909952, 'steps': 62030, 'loss/train': 1.6801143884658813} 11/07/2021 05:58:57 - INFO - __main__ - Step 62032: {'lr': 0.00032304837887617315, 'samples': 11910144, 'steps': 62031, 'loss/train': 1.1237632036209106} 11/07/2021 05:58:57 - INFO - __main__ - Step 62033: {'lr': 0.0003230433037058543, 'samples': 11910336, 'steps': 62032, 'loss/train': 1.2535407543182373} 11/07/2021 05:58:57 - INFO - __main__ - Step 62034: {'lr': 0.00032303822850262323, 'samples': 11910528, 'steps': 62033, 'loss/train': 1.5598143339157104} 11/07/2021 05:58:58 - INFO - __main__ - Step 62035: {'lr': 0.0003230331532664823, 'samples': 11910720, 'steps': 62034, 'loss/train': 1.0798410177230835} 11/07/2021 05:58:59 - INFO - __main__ - Step 62036: {'lr': 0.00032302807799743376, 'samples': 11910912, 'steps': 62035, 'loss/train': 1.230085849761963} 11/07/2021 05:58:59 - INFO - __main__ - Step 62037: {'lr': 0.0003230230026954799, 'samples': 11911104, 'steps': 62036, 'loss/train': 1.615645170211792} 11/07/2021 05:58:59 - INFO - __main__ - Step 62038: {'lr': 0.00032301792736062296, 'samples': 11911296, 'steps': 62037, 'loss/train': 1.2320530414581299} 11/07/2021 05:59:00 - INFO - __main__ - Step 62039: {'lr': 0.00032301285199286527, 'samples': 11911488, 'steps': 62038, 'loss/train': 1.607682466506958} 11/07/2021 05:59:00 - INFO - __main__ - Step 62040: {'lr': 0.00032300777659220915, 'samples': 11911680, 'steps': 62039, 'loss/train': 1.0978400707244873} 11/07/2021 05:59:01 - INFO - __main__ - Step 62041: {'lr': 0.0003230027011586568, 'samples': 11911872, 'steps': 62040, 'loss/train': 1.1216998100280762} 11/07/2021 05:59:02 - INFO - __main__ - Step 62042: {'lr': 0.0003229976256922107, 'samples': 11912064, 'steps': 62041, 'loss/train': 1.280750036239624} 11/07/2021 05:59:02 - INFO - __main__ - Step 62043: {'lr': 0.0003229925501928729, 'samples': 11912256, 'steps': 62042, 'loss/train': 1.9677507877349854} 11/07/2021 05:59:02 - INFO - __main__ - Step 62044: {'lr': 0.0003229874746606457, 'samples': 11912448, 'steps': 62043, 'loss/train': 1.3165696859359741} 11/07/2021 05:59:03 - INFO - __main__ - Step 62045: {'lr': 0.00032298239909553156, 'samples': 11912640, 'steps': 62044, 'loss/train': 1.3943947553634644} 11/07/2021 05:59:03 - INFO - __main__ - Step 62046: {'lr': 0.0003229773234975327, 'samples': 11912832, 'steps': 62045, 'loss/train': 1.3836567401885986} 11/07/2021 05:59:04 - INFO - __main__ - Step 62047: {'lr': 0.0003229722478666513, 'samples': 11913024, 'steps': 62046, 'loss/train': 1.3229278326034546} 11/07/2021 05:59:04 - INFO - __main__ - Step 62048: {'lr': 0.0003229671722028898, 'samples': 11913216, 'steps': 62047, 'loss/train': 1.189399242401123} 11/07/2021 05:59:05 - INFO - __main__ - Step 62049: {'lr': 0.0003229620965062504, 'samples': 11913408, 'steps': 62048, 'loss/train': 1.4496954679489136} 11/07/2021 05:59:05 - INFO - __main__ - Step 62050: {'lr': 0.0003229570207767354, 'samples': 11913600, 'steps': 62049, 'loss/train': 1.295222520828247} 11/07/2021 05:59:05 - INFO - __main__ - Step 62051: {'lr': 0.0003229519450143471, 'samples': 11913792, 'steps': 62050, 'loss/train': 1.4727783203125} 11/07/2021 05:59:06 - INFO - __main__ - Step 62052: {'lr': 0.0003229468692190878, 'samples': 11913984, 'steps': 62051, 'loss/train': 1.5647094249725342} 11/07/2021 05:59:07 - INFO - __main__ - Step 62053: {'lr': 0.0003229417933909597, 'samples': 11914176, 'steps': 62052, 'loss/train': 1.5530482530593872} 11/07/2021 05:59:07 - INFO - __main__ - Step 62054: {'lr': 0.0003229367175299652, 'samples': 11914368, 'steps': 62053, 'loss/train': 1.4071860313415527} 11/07/2021 05:59:07 - INFO - __main__ - Step 62055: {'lr': 0.0003229316416361065, 'samples': 11914560, 'steps': 62054, 'loss/train': 1.5348197221755981} 11/07/2021 05:59:08 - INFO - __main__ - Step 62056: {'lr': 0.00032292656570938604, 'samples': 11914752, 'steps': 62055, 'loss/train': 1.6185989379882812} 11/07/2021 05:59:09 - INFO - __main__ - Step 62057: {'lr': 0.0003229214897498059, 'samples': 11914944, 'steps': 62056, 'loss/train': 1.3745324611663818} 11/07/2021 05:59:09 - INFO - __main__ - Step 62058: {'lr': 0.00032291641375736845, 'samples': 11915136, 'steps': 62057, 'loss/train': 1.388189435005188} 11/07/2021 05:59:10 - INFO - __main__ - Step 62059: {'lr': 0.00032291133773207603, 'samples': 11915328, 'steps': 62058, 'loss/train': 1.354575276374817} 11/07/2021 05:59:10 - INFO - __main__ - Step 62060: {'lr': 0.00032290626167393087, 'samples': 11915520, 'steps': 62059, 'loss/train': 2.402904748916626} 11/07/2021 05:59:10 - INFO - __main__ - Step 62061: {'lr': 0.00032290118558293525, 'samples': 11915712, 'steps': 62060, 'loss/train': 1.561341404914856} 11/07/2021 05:59:11 - INFO - __main__ - Step 62062: {'lr': 0.0003228961094590915, 'samples': 11915904, 'steps': 62061, 'loss/train': 1.5637913942337036} 11/07/2021 05:59:12 - INFO - __main__ - Step 62063: {'lr': 0.0003228910333024019, 'samples': 11916096, 'steps': 62062, 'loss/train': 0.8823919296264648} 11/07/2021 05:59:12 - INFO - __main__ - Step 62064: {'lr': 0.0003228859571128688, 'samples': 11916288, 'steps': 62063, 'loss/train': 1.5480313301086426} 11/07/2021 05:59:12 - INFO - __main__ - Step 62065: {'lr': 0.0003228808808904943, 'samples': 11916480, 'steps': 62064, 'loss/train': 1.4205304384231567} 11/07/2021 05:59:13 - INFO - __main__ - Step 62066: {'lr': 0.0003228758046352808, 'samples': 11916672, 'steps': 62065, 'loss/train': 1.773432731628418} 11/07/2021 05:59:13 - INFO - __main__ - Step 62067: {'lr': 0.0003228707283472306, 'samples': 11916864, 'steps': 62066, 'loss/train': 1.6218236684799194} 11/07/2021 05:59:14 - INFO - __main__ - Step 62068: {'lr': 0.000322865652026346, 'samples': 11917056, 'steps': 62067, 'loss/train': 1.4488346576690674} 11/07/2021 05:59:15 - INFO - __main__ - Step 62069: {'lr': 0.0003228605756726293, 'samples': 11917248, 'steps': 62068, 'loss/train': 1.6866248846054077} 11/07/2021 05:59:15 - INFO - __main__ - Step 62070: {'lr': 0.00032285549928608273, 'samples': 11917440, 'steps': 62069, 'loss/train': 2.5494933128356934} 11/07/2021 05:59:15 - INFO - __main__ - Step 62071: {'lr': 0.00032285042286670857, 'samples': 11917632, 'steps': 62070, 'loss/train': 1.6425361633300781} 11/07/2021 05:59:16 - INFO - __main__ - Step 62072: {'lr': 0.00032284534641450916, 'samples': 11917824, 'steps': 62071, 'loss/train': 1.562300443649292} 11/07/2021 05:59:16 - INFO - __main__ - Step 62073: {'lr': 0.00032284026992948666, 'samples': 11918016, 'steps': 62072, 'loss/train': 1.3726646900177002} 11/07/2021 05:59:17 - INFO - __main__ - Step 62074: {'lr': 0.0003228351934116436, 'samples': 11918208, 'steps': 62073, 'loss/train': 1.7623752355575562} 11/07/2021 05:59:18 - INFO - __main__ - Step 62075: {'lr': 0.000322830116860982, 'samples': 11918400, 'steps': 62074, 'loss/train': 1.0923092365264893} 11/07/2021 05:59:18 - INFO - __main__ - Step 62076: {'lr': 0.00032282504027750437, 'samples': 11918592, 'steps': 62075, 'loss/train': 2.5915541648864746} 11/07/2021 05:59:18 - INFO - __main__ - Step 62077: {'lr': 0.00032281996366121285, 'samples': 11918784, 'steps': 62076, 'loss/train': 1.3945598602294922} 11/07/2021 05:59:19 - INFO - __main__ - Step 62078: {'lr': 0.0003228148870121098, 'samples': 11918976, 'steps': 62077, 'loss/train': 1.9103196859359741} 11/07/2021 05:59:20 - INFO - __main__ - Step 62079: {'lr': 0.00032280981033019744, 'samples': 11919168, 'steps': 62078, 'loss/train': 1.4244332313537598} 11/07/2021 05:59:20 - INFO - __main__ - Step 62080: {'lr': 0.0003228047336154782, 'samples': 11919360, 'steps': 62079, 'loss/train': 1.5648815631866455} 11/07/2021 05:59:20 - INFO - __main__ - Step 62081: {'lr': 0.0003227996568679542, 'samples': 11919552, 'steps': 62080, 'loss/train': 1.6451821327209473} 11/07/2021 05:59:21 - INFO - __main__ - Step 62082: {'lr': 0.0003227945800876278, 'samples': 11919744, 'steps': 62081, 'loss/train': 1.625777006149292} 11/07/2021 05:59:21 - INFO - __main__ - Step 62083: {'lr': 0.0003227895032745013, 'samples': 11919936, 'steps': 62082, 'loss/train': 1.061529517173767} 11/07/2021 05:59:22 - INFO - __main__ - Step 62084: {'lr': 0.00032278442642857697, 'samples': 11920128, 'steps': 62083, 'loss/train': 1.0246723890304565} 11/07/2021 05:59:22 - INFO - __main__ - Step 62085: {'lr': 0.0003227793495498571, 'samples': 11920320, 'steps': 62084, 'loss/train': 1.450440526008606} 11/07/2021 05:59:23 - INFO - __main__ - Step 62086: {'lr': 0.000322774272638344, 'samples': 11920512, 'steps': 62085, 'loss/train': 1.6799771785736084} 11/07/2021 05:59:23 - INFO - __main__ - Step 62087: {'lr': 0.00032276919569403984, 'samples': 11920704, 'steps': 62086, 'loss/train': 0.7835772633552551} 11/07/2021 05:59:23 - INFO - __main__ - Step 62088: {'lr': 0.0003227641187169471, 'samples': 11920896, 'steps': 62087, 'loss/train': 1.4831663370132446} 11/07/2021 05:59:25 - INFO - __main__ - Step 62089: {'lr': 0.0003227590417070679, 'samples': 11921088, 'steps': 62088, 'loss/train': 1.709035873413086} 11/07/2021 05:59:25 - INFO - __main__ - Step 62090: {'lr': 0.0003227539646644048, 'samples': 11921280, 'steps': 62089, 'loss/train': 1.7957364320755005} 11/07/2021 05:59:25 - INFO - __main__ - Step 62091: {'lr': 0.00032274888758895967, 'samples': 11921472, 'steps': 62090, 'loss/train': 1.7591421604156494} 11/07/2021 05:59:26 - INFO - __main__ - Step 62092: {'lr': 0.00032274381048073505, 'samples': 11921664, 'steps': 62091, 'loss/train': 0.780302882194519} 11/07/2021 05:59:26 - INFO - __main__ - Step 62093: {'lr': 0.0003227387333397332, 'samples': 11921856, 'steps': 62092, 'loss/train': 1.3854526281356812} 11/07/2021 05:59:28 - INFO - __main__ - Step 62094: {'lr': 0.0003227336561659564, 'samples': 11922048, 'steps': 62093, 'loss/train': 1.477522373199463} 11/07/2021 05:59:28 - INFO - __main__ - Step 62095: {'lr': 0.000322728578959407, 'samples': 11922240, 'steps': 62094, 'loss/train': 1.668030023574829} 11/07/2021 05:59:29 - INFO - __main__ - Step 62096: {'lr': 0.0003227235017200872, 'samples': 11922432, 'steps': 62095, 'loss/train': 1.903845191001892} 11/07/2021 05:59:29 - INFO - __main__ - Step 62097: {'lr': 0.00032271842444799926, 'samples': 11922624, 'steps': 62096, 'loss/train': 1.8416446447372437} 11/07/2021 05:59:30 - INFO - __main__ - Step 62098: {'lr': 0.0003227133471431455, 'samples': 11922816, 'steps': 62097, 'loss/train': 1.8415181636810303} 11/07/2021 05:59:30 - INFO - __main__ - Step 62099: {'lr': 0.0003227082698055283, 'samples': 11923008, 'steps': 62098, 'loss/train': 0.9389302134513855} 11/07/2021 05:59:30 - INFO - __main__ - Step 62100: {'lr': 0.0003227031924351499, 'samples': 11923200, 'steps': 62099, 'loss/train': 1.285951018333435} 11/07/2021 05:59:31 - INFO - __main__ - Step 62101: {'lr': 0.00032269811503201246, 'samples': 11923392, 'steps': 62100, 'loss/train': 1.039847493171692} 11/07/2021 05:59:32 - INFO - __main__ - Step 62102: {'lr': 0.0003226930375961185, 'samples': 11923584, 'steps': 62101, 'loss/train': 1.3505854606628418} 11/07/2021 05:59:32 - INFO - __main__ - Step 62103: {'lr': 0.0003226879601274701, 'samples': 11923776, 'steps': 62102, 'loss/train': 1.396170735359192} 11/07/2021 05:59:32 - INFO - __main__ - Step 62104: {'lr': 0.0003226828826260696, 'samples': 11923968, 'steps': 62103, 'loss/train': 1.794682264328003} 11/07/2021 05:59:33 - INFO - __main__ - Step 62105: {'lr': 0.00032267780509191935, 'samples': 11924160, 'steps': 62104, 'loss/train': 1.3331573009490967} 11/07/2021 05:59:33 - INFO - __main__ - Step 62106: {'lr': 0.0003226727275250216, 'samples': 11924352, 'steps': 62105, 'loss/train': 1.497399926185608} 11/07/2021 05:59:34 - INFO - __main__ - Step 62107: {'lr': 0.0003226676499253786, 'samples': 11924544, 'steps': 62106, 'loss/train': 1.421413779258728} 11/07/2021 05:59:34 - INFO - __main__ - Step 62108: {'lr': 0.0003226625722929927, 'samples': 11924736, 'steps': 62107, 'loss/train': 1.5419210195541382} 11/07/2021 05:59:35 - INFO - __main__ - Step 62109: {'lr': 0.0003226574946278662, 'samples': 11924928, 'steps': 62108, 'loss/train': 1.0239914655685425} 11/07/2021 05:59:35 - INFO - __main__ - Step 62110: {'lr': 0.0003226524169300014, 'samples': 11925120, 'steps': 62109, 'loss/train': 1.421985387802124} 11/07/2021 05:59:35 - INFO - __main__ - Step 62111: {'lr': 0.00032264733919940046, 'samples': 11925312, 'steps': 62110, 'loss/train': 1.4355427026748657} 11/07/2021 05:59:36 - INFO - __main__ - Step 62112: {'lr': 0.00032264226143606577, 'samples': 11925504, 'steps': 62111, 'loss/train': 1.3821020126342773} 11/07/2021 05:59:37 - INFO - __main__ - Step 62113: {'lr': 0.0003226371836399996, 'samples': 11925696, 'steps': 62112, 'loss/train': 1.5289630889892578} 11/07/2021 05:59:37 - INFO - __main__ - Step 62114: {'lr': 0.00032263210581120425, 'samples': 11925888, 'steps': 62113, 'loss/train': 1.2788571119308472} 11/07/2021 05:59:38 - INFO - __main__ - Step 62115: {'lr': 0.000322627027949682, 'samples': 11926080, 'steps': 62114, 'loss/train': 1.3617141246795654} 11/07/2021 05:59:38 - INFO - __main__ - Step 62116: {'lr': 0.0003226219500554351, 'samples': 11926272, 'steps': 62115, 'loss/train': 1.3758994340896606} 11/07/2021 05:59:39 - INFO - __main__ - Step 62117: {'lr': 0.0003226168721284659, 'samples': 11926464, 'steps': 62116, 'loss/train': 1.6157293319702148} 11/07/2021 05:59:39 - INFO - __main__ - Step 62118: {'lr': 0.00032261179416877663, 'samples': 11926656, 'steps': 62117, 'loss/train': 0.9893162846565247} 11/07/2021 05:59:40 - INFO - __main__ - Step 62119: {'lr': 0.0003226067161763696, 'samples': 11926848, 'steps': 62118, 'loss/train': 1.4717767238616943} 11/07/2021 05:59:40 - INFO - __main__ - Step 62120: {'lr': 0.0003226016381512471, 'samples': 11927040, 'steps': 62119, 'loss/train': 1.5229684114456177} 11/07/2021 05:59:40 - INFO - __main__ - Step 62121: {'lr': 0.0003225965600934115, 'samples': 11927232, 'steps': 62120, 'loss/train': 1.3061890602111816} 11/07/2021 05:59:41 - INFO - __main__ - Step 62122: {'lr': 0.0003225914820028649, 'samples': 11927424, 'steps': 62121, 'loss/train': 1.661460280418396} 11/07/2021 05:59:42 - INFO - __main__ - Step 62123: {'lr': 0.0003225864038796098, 'samples': 11927616, 'steps': 62122, 'loss/train': 1.26640784740448} 11/07/2021 05:59:42 - INFO - __main__ - Step 62124: {'lr': 0.0003225813257236483, 'samples': 11927808, 'steps': 62123, 'loss/train': 1.5461128950119019} 11/07/2021 05:59:42 - INFO - __main__ - Step 62125: {'lr': 0.00032257624753498284, 'samples': 11928000, 'steps': 62124, 'loss/train': 1.250343680381775} 11/07/2021 05:59:43 - INFO - __main__ - Step 62126: {'lr': 0.00032257116931361555, 'samples': 11928192, 'steps': 62125, 'loss/train': 1.7326388359069824} 11/07/2021 05:59:43 - INFO - __main__ - Step 62127: {'lr': 0.00032256609105954894, 'samples': 11928384, 'steps': 62126, 'loss/train': 1.4259109497070312} 11/07/2021 05:59:44 - INFO - __main__ - Step 62128: {'lr': 0.0003225610127727851, 'samples': 11928576, 'steps': 62127, 'loss/train': 1.8567981719970703} 11/07/2021 05:59:45 - INFO - __main__ - Step 62129: {'lr': 0.00032255593445332644, 'samples': 11928768, 'steps': 62128, 'loss/train': 1.5566751956939697} 11/07/2021 05:59:45 - INFO - __main__ - Step 62130: {'lr': 0.0003225508561011751, 'samples': 11928960, 'steps': 62129, 'loss/train': 1.5098305940628052} 11/07/2021 05:59:45 - INFO - __main__ - Step 62131: {'lr': 0.0003225457777163335, 'samples': 11929152, 'steps': 62130, 'loss/train': 1.1909325122833252} 11/07/2021 05:59:46 - INFO - __main__ - Step 62132: {'lr': 0.00032254069929880393, 'samples': 11929344, 'steps': 62131, 'loss/train': 1.2939033508300781} 11/07/2021 05:59:46 - INFO - __main__ - Step 62133: {'lr': 0.0003225356208485886, 'samples': 11929536, 'steps': 62132, 'loss/train': 1.8755450248718262} 11/07/2021 05:59:47 - INFO - __main__ - Step 62134: {'lr': 0.00032253054236568987, 'samples': 11929728, 'steps': 62133, 'loss/train': 1.1956738233566284} 11/07/2021 05:59:48 - INFO - __main__ - Step 62135: {'lr': 0.00032252546385010995, 'samples': 11929920, 'steps': 62134, 'loss/train': 0.6145014762878418} 11/07/2021 05:59:48 - INFO - __main__ - Step 62136: {'lr': 0.0003225203853018512, 'samples': 11930112, 'steps': 62135, 'loss/train': 1.5339617729187012} 11/07/2021 05:59:48 - INFO - __main__ - Step 62137: {'lr': 0.00032251530672091597, 'samples': 11930304, 'steps': 62136, 'loss/train': 1.1743263006210327} 11/07/2021 05:59:49 - INFO - __main__ - Step 62138: {'lr': 0.00032251022810730635, 'samples': 11930496, 'steps': 62137, 'loss/train': 1.5870106220245361} 11/07/2021 05:59:50 - INFO - __main__ - Step 62139: {'lr': 0.0003225051494610248, 'samples': 11930688, 'steps': 62138, 'loss/train': 1.7674230337142944} 11/07/2021 05:59:50 - INFO - __main__ - Step 62140: {'lr': 0.00032250007078207343, 'samples': 11930880, 'steps': 62139, 'loss/train': 1.8535687923431396} 11/07/2021 05:59:50 - INFO - __main__ - Step 62141: {'lr': 0.00032249499207045475, 'samples': 11931072, 'steps': 62140, 'loss/train': 1.0144108533859253} 11/07/2021 05:59:51 - INFO - __main__ - Step 62142: {'lr': 0.00032248991332617095, 'samples': 11931264, 'steps': 62141, 'loss/train': 1.653807282447815} 11/07/2021 05:59:51 - INFO - __main__ - Step 62143: {'lr': 0.0003224848345492243, 'samples': 11931456, 'steps': 62142, 'loss/train': 1.636649250984192} 11/07/2021 05:59:52 - INFO - __main__ - Step 62144: {'lr': 0.0003224797557396171, 'samples': 11931648, 'steps': 62143, 'loss/train': 1.2844284772872925} 11/07/2021 05:59:52 - INFO - __main__ - Step 62145: {'lr': 0.00032247467689735165, 'samples': 11931840, 'steps': 62144, 'loss/train': 1.4100286960601807} 11/07/2021 05:59:53 - INFO - __main__ - Step 62146: {'lr': 0.0003224695980224302, 'samples': 11932032, 'steps': 62145, 'loss/train': 1.4362330436706543} 11/07/2021 05:59:53 - INFO - __main__ - Step 62147: {'lr': 0.00032246451911485506, 'samples': 11932224, 'steps': 62146, 'loss/train': 1.4630874395370483} 11/07/2021 05:59:54 - INFO - __main__ - Step 62148: {'lr': 0.00032245944017462856, 'samples': 11932416, 'steps': 62147, 'loss/train': 1.6102347373962402} 11/07/2021 05:59:55 - INFO - __main__ - Step 62149: {'lr': 0.00032245436120175293, 'samples': 11932608, 'steps': 62148, 'loss/train': 1.3772767782211304} 11/07/2021 05:59:56 - INFO - __main__ - Step 62150: {'lr': 0.00032244928219623056, 'samples': 11932800, 'steps': 62149, 'loss/train': 1.532142996788025} 11/07/2021 05:59:56 - INFO - __main__ - Step 62151: {'lr': 0.0003224442031580636, 'samples': 11932992, 'steps': 62150, 'loss/train': 1.6583315134048462} 11/07/2021 05:59:56 - INFO - __main__ - Step 62152: {'lr': 0.00032243912408725435, 'samples': 11933184, 'steps': 62151, 'loss/train': 1.5270122289657593} 11/07/2021 05:59:57 - INFO - __main__ - Step 62153: {'lr': 0.00032243404498380517, 'samples': 11933376, 'steps': 62152, 'loss/train': 1.3804429769515991} 11/07/2021 05:59:57 - INFO - __main__ - Step 62154: {'lr': 0.00032242896584771836, 'samples': 11933568, 'steps': 62153, 'loss/train': 2.0568268299102783} 11/07/2021 05:59:57 - INFO - __main__ - Step 62155: {'lr': 0.00032242388667899614, 'samples': 11933760, 'steps': 62154, 'loss/train': 3.2064177989959717} 11/07/2021 05:59:58 - INFO - __main__ - Step 62156: {'lr': 0.00032241880747764084, 'samples': 11933952, 'steps': 62155, 'loss/train': 1.7309250831604004} 11/07/2021 05:59:59 - INFO - __main__ - Step 62157: {'lr': 0.00032241372824365485, 'samples': 11934144, 'steps': 62156, 'loss/train': 2.0724711418151855} 11/07/2021 05:59:59 - INFO - __main__ - Step 62158: {'lr': 0.00032240864897704023, 'samples': 11934336, 'steps': 62157, 'loss/train': 1.576859951019287} 11/07/2021 05:59:59 - INFO - __main__ - Step 62159: {'lr': 0.0003224035696777994, 'samples': 11934528, 'steps': 62158, 'loss/train': 1.448672890663147} 11/07/2021 06:00:00 - INFO - __main__ - Step 62160: {'lr': 0.0003223984903459347, 'samples': 11934720, 'steps': 62159, 'loss/train': 1.7515509128570557} 11/07/2021 06:00:01 - INFO - __main__ - Step 62161: {'lr': 0.0003223934109814483, 'samples': 11934912, 'steps': 62160, 'loss/train': 1.2422280311584473} 11/07/2021 06:00:01 - INFO - __main__ - Step 62162: {'lr': 0.00032238833158434256, 'samples': 11935104, 'steps': 62161, 'loss/train': 1.557267189025879} 11/07/2021 06:00:02 - INFO - __main__ - Step 62163: {'lr': 0.0003223832521546198, 'samples': 11935296, 'steps': 62162, 'loss/train': 1.7055078744888306} 11/07/2021 06:00:02 - INFO - __main__ - Step 62164: {'lr': 0.00032237817269228225, 'samples': 11935488, 'steps': 62163, 'loss/train': 1.799223780632019} 11/07/2021 06:00:02 - INFO - __main__ - Step 62165: {'lr': 0.0003223730931973322, 'samples': 11935680, 'steps': 62164, 'loss/train': 2.1427528858184814} 11/07/2021 06:00:03 - INFO - __main__ - Step 62166: {'lr': 0.0003223680136697719, 'samples': 11935872, 'steps': 62165, 'loss/train': 1.524526834487915} 11/07/2021 06:00:04 - INFO - __main__ - Step 62167: {'lr': 0.0003223629341096037, 'samples': 11936064, 'steps': 62166, 'loss/train': 1.9467236995697021} 11/07/2021 06:00:04 - INFO - __main__ - Step 62168: {'lr': 0.0003223578545168299, 'samples': 11936256, 'steps': 62167, 'loss/train': 1.4575132131576538} 11/07/2021 06:00:04 - INFO - __main__ - Step 62169: {'lr': 0.0003223527748914528, 'samples': 11936448, 'steps': 62168, 'loss/train': 2.0065438747406006} 11/07/2021 06:00:05 - INFO - __main__ - Step 62170: {'lr': 0.0003223476952334747, 'samples': 11936640, 'steps': 62169, 'loss/train': 1.4502456188201904} 11/07/2021 06:00:05 - INFO - __main__ - Step 62171: {'lr': 0.0003223426155428977, 'samples': 11936832, 'steps': 62170, 'loss/train': 1.2023667097091675} 11/07/2021 06:00:06 - INFO - __main__ - Step 62172: {'lr': 0.0003223375358197244, 'samples': 11937024, 'steps': 62171, 'loss/train': 1.5157071352005005} 11/07/2021 06:00:07 - INFO - __main__ - Step 62173: {'lr': 0.00032233245606395677, 'samples': 11937216, 'steps': 62172, 'loss/train': 1.896994948387146} 11/07/2021 06:00:07 - INFO - __main__ - Step 62174: {'lr': 0.00032232737627559734, 'samples': 11937408, 'steps': 62173, 'loss/train': 1.429111123085022} 11/07/2021 06:00:07 - INFO - __main__ - Step 62175: {'lr': 0.00032232229645464826, 'samples': 11937600, 'steps': 62174, 'loss/train': 1.776868224143982} 11/07/2021 06:00:08 - INFO - __main__ - Step 62176: {'lr': 0.0003223172166011119, 'samples': 11937792, 'steps': 62175, 'loss/train': 0.8143520355224609} 11/07/2021 06:00:09 - INFO - __main__ - Step 62177: {'lr': 0.00032231213671499057, 'samples': 11937984, 'steps': 62176, 'loss/train': 1.8354641199111938} 11/07/2021 06:00:09 - INFO - __main__ - Step 62178: {'lr': 0.0003223070567962864, 'samples': 11938176, 'steps': 62177, 'loss/train': 2.7669684886932373} 11/07/2021 06:00:09 - INFO - __main__ - Step 62179: {'lr': 0.00032230197684500185, 'samples': 11938368, 'steps': 62178, 'loss/train': 1.4976438283920288} 11/07/2021 06:00:10 - INFO - __main__ - Step 62180: {'lr': 0.0003222968968611391, 'samples': 11938560, 'steps': 62179, 'loss/train': 1.6665229797363281} 11/07/2021 06:00:10 - INFO - __main__ - Step 62181: {'lr': 0.00032229181684470054, 'samples': 11938752, 'steps': 62180, 'loss/train': 1.3118454217910767} 11/07/2021 06:00:11 - INFO - __main__ - Step 62182: {'lr': 0.00032228673679568834, 'samples': 11938944, 'steps': 62181, 'loss/train': 1.8235739469528198} 11/07/2021 06:00:11 - INFO - __main__ - Step 62183: {'lr': 0.00032228165671410486, 'samples': 11939136, 'steps': 62182, 'loss/train': 0.683228075504303} 11/07/2021 06:00:12 - INFO - __main__ - Step 62184: {'lr': 0.00032227657659995244, 'samples': 11939328, 'steps': 62183, 'loss/train': 1.3874784708023071} 11/07/2021 06:00:12 - INFO - __main__ - Step 62185: {'lr': 0.0003222714964532333, 'samples': 11939520, 'steps': 62184, 'loss/train': 1.469804048538208} 11/07/2021 06:00:13 - INFO - __main__ - Step 62186: {'lr': 0.0003222664162739497, 'samples': 11939712, 'steps': 62185, 'loss/train': 1.20376455783844} 11/07/2021 06:00:13 - INFO - __main__ - Step 62187: {'lr': 0.000322261336062104, 'samples': 11939904, 'steps': 62186, 'loss/train': 1.4525421857833862} 11/07/2021 06:00:14 - INFO - __main__ - Step 62188: {'lr': 0.00032225625581769844, 'samples': 11940096, 'steps': 62187, 'loss/train': 1.2404707670211792} 11/07/2021 06:00:14 - INFO - __main__ - Step 62189: {'lr': 0.0003222511755407353, 'samples': 11940288, 'steps': 62188, 'loss/train': 1.2399481534957886} 11/07/2021 06:00:15 - INFO - __main__ - Step 62190: {'lr': 0.000322246095231217, 'samples': 11940480, 'steps': 62189, 'loss/train': 1.3816452026367188} 11/07/2021 06:00:15 - INFO - __main__ - Step 62191: {'lr': 0.00032224101488914566, 'samples': 11940672, 'steps': 62190, 'loss/train': 0.352179616689682} 11/07/2021 06:00:15 - INFO - __main__ - Step 62192: {'lr': 0.0003222359345145236, 'samples': 11940864, 'steps': 62191, 'loss/train': 1.6278325319290161} 11/07/2021 06:00:16 - INFO - __main__ - Step 62193: {'lr': 0.00032223085410735316, 'samples': 11941056, 'steps': 62192, 'loss/train': 1.3308560848236084} 11/07/2021 06:00:17 - INFO - __main__ - Step 62194: {'lr': 0.0003222257736676366, 'samples': 11941248, 'steps': 62193, 'loss/train': 1.5505620241165161} 11/07/2021 06:00:17 - INFO - __main__ - Step 62195: {'lr': 0.0003222206931953762, 'samples': 11941440, 'steps': 62194, 'loss/train': 1.4487247467041016} 11/07/2021 06:00:17 - INFO - __main__ - Step 62196: {'lr': 0.0003222156126905743, 'samples': 11941632, 'steps': 62195, 'loss/train': 1.5724740028381348} 11/07/2021 06:00:18 - INFO - __main__ - Step 62197: {'lr': 0.0003222105321532333, 'samples': 11941824, 'steps': 62196, 'loss/train': 1.5784674882888794} 11/07/2021 06:00:19 - INFO - __main__ - Step 62198: {'lr': 0.0003222054515833551, 'samples': 11942016, 'steps': 62197, 'loss/train': 1.7332839965820312} 11/07/2021 06:00:19 - INFO - __main__ - Step 62199: {'lr': 0.0003222003709809424, 'samples': 11942208, 'steps': 62198, 'loss/train': 1.0498515367507935} 11/07/2021 06:00:20 - INFO - __main__ - Step 62200: {'lr': 0.00032219529034599725, 'samples': 11942400, 'steps': 62199, 'loss/train': 1.4764364957809448} 11/07/2021 06:00:20 - INFO - __main__ - Step 62201: {'lr': 0.000322190209678522, 'samples': 11942592, 'steps': 62200, 'loss/train': 1.5207912921905518} 11/07/2021 06:00:20 - INFO - __main__ - Step 62202: {'lr': 0.00032218512897851906, 'samples': 11942784, 'steps': 62201, 'loss/train': 1.3907841444015503} 11/07/2021 06:00:21 - INFO - __main__ - Step 62203: {'lr': 0.00032218004824599057, 'samples': 11942976, 'steps': 62202, 'loss/train': 1.467451572418213} 11/07/2021 06:00:22 - INFO - __main__ - Step 62204: {'lr': 0.0003221749674809389, 'samples': 11943168, 'steps': 62203, 'loss/train': 1.4183214902877808} 11/07/2021 06:00:22 - INFO - __main__ - Step 62205: {'lr': 0.00032216988668336624, 'samples': 11943360, 'steps': 62204, 'loss/train': 1.5991114377975464} 11/07/2021 06:00:22 - INFO - __main__ - Step 62206: {'lr': 0.0003221648058532749, 'samples': 11943552, 'steps': 62205, 'loss/train': 1.561563491821289} 11/07/2021 06:00:23 - INFO - __main__ - Step 62207: {'lr': 0.00032215972499066725, 'samples': 11943744, 'steps': 62206, 'loss/train': 1.7333014011383057} 11/07/2021 06:00:24 - INFO - __main__ - Step 62208: {'lr': 0.00032215464409554557, 'samples': 11943936, 'steps': 62207, 'loss/train': 1.7598669528961182} 11/07/2021 06:00:24 - INFO - __main__ - Step 62209: {'lr': 0.00032214956316791213, 'samples': 11944128, 'steps': 62208, 'loss/train': 1.4627665281295776} 11/07/2021 06:00:24 - INFO - __main__ - Step 62210: {'lr': 0.00032214448220776917, 'samples': 11944320, 'steps': 62209, 'loss/train': 1.1402077674865723} 11/07/2021 06:00:25 - INFO - __main__ - Step 62211: {'lr': 0.0003221394012151191, 'samples': 11944512, 'steps': 62210, 'loss/train': 1.5884208679199219} 11/07/2021 06:00:25 - INFO - __main__ - Step 62212: {'lr': 0.000322134320189964, 'samples': 11944704, 'steps': 62211, 'loss/train': 1.7792161703109741} 11/07/2021 06:00:26 - INFO - __main__ - Step 62213: {'lr': 0.0003221292391323064, 'samples': 11944896, 'steps': 62212, 'loss/train': 1.7669247388839722} 11/07/2021 06:00:26 - INFO - __main__ - Step 62214: {'lr': 0.00032212415804214845, 'samples': 11945088, 'steps': 62213, 'loss/train': 1.5140823125839233} 11/07/2021 06:00:27 - INFO - __main__ - Step 62215: {'lr': 0.00032211907691949237, 'samples': 11945280, 'steps': 62214, 'loss/train': 1.7081862688064575} 11/07/2021 06:00:27 - INFO - __main__ - Step 62216: {'lr': 0.0003221139957643406, 'samples': 11945472, 'steps': 62215, 'loss/train': 1.8853799104690552} 11/07/2021 06:00:28 - INFO - __main__ - Step 62217: {'lr': 0.00032210891457669556, 'samples': 11945664, 'steps': 62216, 'loss/train': 1.5169610977172852} 11/07/2021 06:00:29 - INFO - __main__ - Step 62218: {'lr': 0.0003221038333565591, 'samples': 11945856, 'steps': 62217, 'loss/train': 1.4632009267807007} 11/07/2021 06:00:29 - INFO - __main__ - Step 62219: {'lr': 0.0003220987521039339, 'samples': 11946048, 'steps': 62218, 'loss/train': 1.516819715499878} 11/07/2021 06:00:29 - INFO - __main__ - Step 62220: {'lr': 0.00032209367081882206, 'samples': 11946240, 'steps': 62219, 'loss/train': 1.3271254301071167} 11/07/2021 06:00:30 - INFO - __main__ - Step 62221: {'lr': 0.000322088589501226, 'samples': 11946432, 'steps': 62220, 'loss/train': 1.2813365459442139} 11/07/2021 06:00:30 - INFO - __main__ - Step 62222: {'lr': 0.00032208350815114787, 'samples': 11946624, 'steps': 62221, 'loss/train': 1.7972697019577026} 11/07/2021 06:00:30 - INFO - __main__ - Step 62223: {'lr': 0.00032207842676859, 'samples': 11946816, 'steps': 62222, 'loss/train': 1.3801666498184204} 11/07/2021 06:00:32 - INFO - __main__ - Step 62224: {'lr': 0.0003220733453535548, 'samples': 11947008, 'steps': 62223, 'loss/train': 1.0154740810394287} 11/07/2021 06:00:32 - INFO - __main__ - Step 62225: {'lr': 0.0003220682639060444, 'samples': 11947200, 'steps': 62224, 'loss/train': 1.3241146802902222} 11/07/2021 06:00:33 - INFO - __main__ - Step 62226: {'lr': 0.00032206318242606116, 'samples': 11947392, 'steps': 62225, 'loss/train': 0.4414569139480591} 11/07/2021 06:00:33 - INFO - __main__ - Step 62227: {'lr': 0.00032205810091360734, 'samples': 11947584, 'steps': 62226, 'loss/train': 0.9429555535316467} 11/07/2021 06:00:33 - INFO - __main__ - Step 62228: {'lr': 0.00032205301936868525, 'samples': 11947776, 'steps': 62227, 'loss/train': 1.3356339931488037} 11/07/2021 06:00:35 - INFO - __main__ - Step 62229: {'lr': 0.00032204793779129715, 'samples': 11947968, 'steps': 62228, 'loss/train': 1.9647600650787354} 11/07/2021 06:00:35 - INFO - __main__ - Step 62230: {'lr': 0.00032204285618144543, 'samples': 11948160, 'steps': 62229, 'loss/train': 1.8121614456176758} 11/07/2021 06:00:35 - INFO - __main__ - Step 62231: {'lr': 0.0003220377745391323, 'samples': 11948352, 'steps': 62230, 'loss/train': 1.7827095985412598} 11/07/2021 06:00:36 - INFO - __main__ - Step 62232: {'lr': 0.00032203269286436005, 'samples': 11948544, 'steps': 62231, 'loss/train': 1.4822990894317627} 11/07/2021 06:00:36 - INFO - __main__ - Step 62233: {'lr': 0.000322027611157131, 'samples': 11948736, 'steps': 62232, 'loss/train': 1.2873740196228027} 11/07/2021 06:00:36 - INFO - __main__ - Step 62234: {'lr': 0.00032202252941744737, 'samples': 11948928, 'steps': 62233, 'loss/train': 1.4082040786743164} 11/07/2021 06:00:38 - INFO - __main__ - Step 62235: {'lr': 0.00032201744764531157, 'samples': 11949120, 'steps': 62234, 'loss/train': 1.5601204633712769} 11/07/2021 06:00:38 - INFO - __main__ - Step 62236: {'lr': 0.00032201236584072576, 'samples': 11949312, 'steps': 62235, 'loss/train': 1.8611047267913818} 11/07/2021 06:00:38 - INFO - __main__ - Step 62237: {'lr': 0.00032200728400369233, 'samples': 11949504, 'steps': 62236, 'loss/train': 1.254782795906067} 11/07/2021 06:00:39 - INFO - __main__ - Step 62238: {'lr': 0.0003220022021342135, 'samples': 11949696, 'steps': 62237, 'loss/train': 1.4337131977081299} 11/07/2021 06:00:39 - INFO - __main__ - Step 62239: {'lr': 0.00032199712023229154, 'samples': 11949888, 'steps': 62238, 'loss/train': 1.0762704610824585} 11/07/2021 06:00:40 - INFO - __main__ - Step 62240: {'lr': 0.0003219920382979289, 'samples': 11950080, 'steps': 62239, 'loss/train': 1.4533580541610718} 11/07/2021 06:00:40 - INFO - __main__ - Step 62241: {'lr': 0.0003219869563311277, 'samples': 11950272, 'steps': 62240, 'loss/train': 1.7883936166763306} 11/07/2021 06:00:41 - INFO - __main__ - Step 62242: {'lr': 0.00032198187433189025, 'samples': 11950464, 'steps': 62241, 'loss/train': 1.5666615962982178} 11/07/2021 06:00:41 - INFO - __main__ - Step 62243: {'lr': 0.00032197679230021894, 'samples': 11950656, 'steps': 62242, 'loss/train': 1.4360456466674805} 11/07/2021 06:00:41 - INFO - __main__ - Step 62244: {'lr': 0.000321971710236116, 'samples': 11950848, 'steps': 62243, 'loss/train': 1.2677775621414185} 11/07/2021 06:00:42 - INFO - __main__ - Step 62245: {'lr': 0.00032196662813958367, 'samples': 11951040, 'steps': 62244, 'loss/train': 1.2663824558258057} 11/07/2021 06:00:43 - INFO - __main__ - Step 62246: {'lr': 0.0003219615460106243, 'samples': 11951232, 'steps': 62245, 'loss/train': 1.5417560338974} 11/07/2021 06:00:43 - INFO - __main__ - Step 62247: {'lr': 0.0003219564638492402, 'samples': 11951424, 'steps': 62246, 'loss/train': 1.3199471235275269} 11/07/2021 06:00:43 - INFO - __main__ - Step 62248: {'lr': 0.0003219513816554336, 'samples': 11951616, 'steps': 62247, 'loss/train': 1.800362467765808} 11/07/2021 06:00:44 - INFO - __main__ - Step 62249: {'lr': 0.00032194629942920684, 'samples': 11951808, 'steps': 62248, 'loss/train': 1.1387205123901367} 11/07/2021 06:00:45 - INFO - __main__ - Step 62250: {'lr': 0.0003219412171705622, 'samples': 11952000, 'steps': 62249, 'loss/train': 1.6221137046813965} 11/07/2021 06:00:45 - INFO - __main__ - Step 62251: {'lr': 0.0003219361348795019, 'samples': 11952192, 'steps': 62250, 'loss/train': 1.429417610168457} 11/07/2021 06:00:46 - INFO - __main__ - Step 62252: {'lr': 0.00032193105255602834, 'samples': 11952384, 'steps': 62251, 'loss/train': 1.2793004512786865} 11/07/2021 06:00:46 - INFO - __main__ - Step 62253: {'lr': 0.00032192597020014367, 'samples': 11952576, 'steps': 62252, 'loss/train': 1.476012945175171} 11/07/2021 06:00:46 - INFO - __main__ - Step 62254: {'lr': 0.00032192088781185036, 'samples': 11952768, 'steps': 62253, 'loss/train': 1.4012712240219116} 11/07/2021 06:00:47 - INFO - __main__ - Step 62255: {'lr': 0.0003219158053911506, 'samples': 11952960, 'steps': 62254, 'loss/train': 1.5862665176391602} 11/07/2021 06:00:48 - INFO - __main__ - Step 62256: {'lr': 0.0003219107229380467, 'samples': 11953152, 'steps': 62255, 'loss/train': 1.6648285388946533} 11/07/2021 06:00:48 - INFO - __main__ - Step 62257: {'lr': 0.00032190564045254087, 'samples': 11953344, 'steps': 62256, 'loss/train': 1.3629717826843262} 11/07/2021 06:00:48 - INFO - __main__ - Step 62258: {'lr': 0.0003219005579346355, 'samples': 11953536, 'steps': 62257, 'loss/train': 1.4822603464126587} 11/07/2021 06:00:49 - INFO - __main__ - Step 62259: {'lr': 0.0003218954753843329, 'samples': 11953728, 'steps': 62258, 'loss/train': 1.2832096815109253} 11/07/2021 06:00:49 - INFO - __main__ - Step 62260: {'lr': 0.0003218903928016352, 'samples': 11953920, 'steps': 62259, 'loss/train': 1.801827311515808} 11/07/2021 06:00:50 - INFO - __main__ - Step 62261: {'lr': 0.00032188531018654496, 'samples': 11954112, 'steps': 62260, 'loss/train': 1.1506352424621582} 11/07/2021 06:00:51 - INFO - __main__ - Step 62262: {'lr': 0.0003218802275390642, 'samples': 11954304, 'steps': 62261, 'loss/train': 1.327998399734497} 11/07/2021 06:00:51 - INFO - __main__ - Step 62263: {'lr': 0.00032187514485919534, 'samples': 11954496, 'steps': 62262, 'loss/train': 0.6937238574028015} 11/07/2021 06:00:51 - INFO - __main__ - Step 62264: {'lr': 0.00032187006214694057, 'samples': 11954688, 'steps': 62263, 'loss/train': 1.4406298398971558} 11/07/2021 06:00:52 - INFO - __main__ - Step 62265: {'lr': 0.00032186497940230236, 'samples': 11954880, 'steps': 62264, 'loss/train': 1.3937427997589111} 11/07/2021 06:00:53 - INFO - __main__ - Step 62266: {'lr': 0.00032185989662528294, 'samples': 11955072, 'steps': 62265, 'loss/train': 1.5524076223373413} 11/07/2021 06:00:53 - INFO - __main__ - Step 62267: {'lr': 0.0003218548138158844, 'samples': 11955264, 'steps': 62266, 'loss/train': 1.084521770477295} 11/07/2021 06:00:53 - INFO - __main__ - Step 62268: {'lr': 0.0003218497309741093, 'samples': 11955456, 'steps': 62267, 'loss/train': 5.596930027008057} 11/07/2021 06:00:54 - INFO - __main__ - Step 62269: {'lr': 0.00032184464809995977, 'samples': 11955648, 'steps': 62268, 'loss/train': 1.5097105503082275} 11/07/2021 06:00:54 - INFO - __main__ - Step 62270: {'lr': 0.00032183956519343815, 'samples': 11955840, 'steps': 62269, 'loss/train': 1.2526121139526367} 11/07/2021 06:00:55 - INFO - __main__ - Step 62271: {'lr': 0.00032183448225454674, 'samples': 11956032, 'steps': 62270, 'loss/train': 1.0813474655151367} 11/07/2021 06:00:56 - INFO - __main__ - Step 62272: {'lr': 0.0003218293992832879, 'samples': 11956224, 'steps': 62271, 'loss/train': 1.5682425498962402} 11/07/2021 06:00:56 - INFO - __main__ - Step 62273: {'lr': 0.0003218243162796638, 'samples': 11956416, 'steps': 62272, 'loss/train': 1.9536798000335693} 11/07/2021 06:00:56 - INFO - __main__ - Step 62274: {'lr': 0.00032181923324367675, 'samples': 11956608, 'steps': 62273, 'loss/train': 1.4467296600341797} 11/07/2021 06:00:57 - INFO - __main__ - Step 62275: {'lr': 0.000321814150175329, 'samples': 11956800, 'steps': 62274, 'loss/train': 1.5461552143096924} 11/07/2021 06:00:57 - INFO - __main__ - Step 62276: {'lr': 0.000321809067074623, 'samples': 11956992, 'steps': 62275, 'loss/train': 1.2083916664123535} 11/07/2021 06:00:58 - INFO - __main__ - Step 62277: {'lr': 0.00032180398394156083, 'samples': 11957184, 'steps': 62276, 'loss/train': 1.6954283714294434} 11/07/2021 06:00:58 - INFO - __main__ - Step 62278: {'lr': 0.00032179890077614506, 'samples': 11957376, 'steps': 62277, 'loss/train': 1.2788339853286743} 11/07/2021 06:00:59 - INFO - __main__ - Step 62279: {'lr': 0.00032179381757837773, 'samples': 11957568, 'steps': 62278, 'loss/train': 1.2684471607208252} 11/07/2021 06:00:59 - INFO - __main__ - Step 62280: {'lr': 0.00032178873434826117, 'samples': 11957760, 'steps': 62279, 'loss/train': 1.3235187530517578} 11/07/2021 06:01:00 - INFO - __main__ - Step 62281: {'lr': 0.00032178365108579776, 'samples': 11957952, 'steps': 62280, 'loss/train': 2.181682825088501} 11/07/2021 06:01:00 - INFO - __main__ - Step 62282: {'lr': 0.0003217785677909897, 'samples': 11958144, 'steps': 62281, 'loss/train': 1.7529807090759277} 11/07/2021 06:01:01 - INFO - __main__ - Step 62283: {'lr': 0.00032177348446383935, 'samples': 11958336, 'steps': 62282, 'loss/train': 1.902503490447998} 11/07/2021 06:01:01 - INFO - __main__ - Step 62284: {'lr': 0.000321768401104349, 'samples': 11958528, 'steps': 62283, 'loss/train': 0.9813767671585083} 11/07/2021 06:01:01 - INFO - __main__ - Step 62285: {'lr': 0.0003217633177125209, 'samples': 11958720, 'steps': 62284, 'loss/train': 1.1798211336135864} 11/07/2021 06:01:02 - INFO - __main__ - Step 62286: {'lr': 0.0003217582342883574, 'samples': 11958912, 'steps': 62285, 'loss/train': 1.5203356742858887} 11/07/2021 06:01:03 - INFO - __main__ - Step 62287: {'lr': 0.0003217531508318607, 'samples': 11959104, 'steps': 62286, 'loss/train': 1.8967620134353638} 11/07/2021 06:01:03 - INFO - __main__ - Step 62288: {'lr': 0.00032174806734303307, 'samples': 11959296, 'steps': 62287, 'loss/train': 1.0010137557983398} 11/07/2021 06:01:04 - INFO - __main__ - Step 62289: {'lr': 0.00032174298382187696, 'samples': 11959488, 'steps': 62288, 'loss/train': 0.8161570429801941} 11/07/2021 06:01:04 - INFO - __main__ - Step 62290: {'lr': 0.00032173790026839455, 'samples': 11959680, 'steps': 62289, 'loss/train': 1.264140009880066} 11/07/2021 06:01:04 - INFO - __main__ - Step 62291: {'lr': 0.0003217328166825882, 'samples': 11959872, 'steps': 62290, 'loss/train': 0.46105247735977173} 11/07/2021 06:01:05 - INFO - __main__ - Step 62292: {'lr': 0.00032172773306446005, 'samples': 11960064, 'steps': 62291, 'loss/train': 1.001259207725525} 11/07/2021 06:01:06 - INFO - __main__ - Step 62293: {'lr': 0.0003217226494140125, 'samples': 11960256, 'steps': 62292, 'loss/train': 1.3950756788253784} 11/07/2021 06:01:06 - INFO - __main__ - Step 62294: {'lr': 0.0003217175657312479, 'samples': 11960448, 'steps': 62293, 'loss/train': 1.3553504943847656} 11/07/2021 06:01:06 - INFO - __main__ - Step 62295: {'lr': 0.00032171248201616845, 'samples': 11960640, 'steps': 62294, 'loss/train': 0.6372998952865601} 11/07/2021 06:01:07 - INFO - __main__ - Step 62296: {'lr': 0.0003217073982687764, 'samples': 11960832, 'steps': 62295, 'loss/train': 1.7139816284179688} 11/07/2021 06:01:08 - INFO - __main__ - Step 62297: {'lr': 0.00032170231448907415, 'samples': 11961024, 'steps': 62296, 'loss/train': 1.3011888265609741} 11/07/2021 06:01:08 - INFO - __main__ - Step 62298: {'lr': 0.000321697230677064, 'samples': 11961216, 'steps': 62297, 'loss/train': 1.5688661336898804} 11/07/2021 06:01:08 - INFO - __main__ - Step 62299: {'lr': 0.00032169214683274816, 'samples': 11961408, 'steps': 62298, 'loss/train': 1.5491338968276978} 11/07/2021 06:01:09 - INFO - __main__ - Step 62300: {'lr': 0.00032168706295612894, 'samples': 11961600, 'steps': 62299, 'loss/train': 1.191232681274414} 11/07/2021 06:01:09 - INFO - __main__ - Step 62301: {'lr': 0.0003216819790472085, 'samples': 11961792, 'steps': 62300, 'loss/train': 1.2156633138656616} 11/07/2021 06:01:10 - INFO - __main__ - Step 62302: {'lr': 0.0003216768951059894, 'samples': 11961984, 'steps': 62301, 'loss/train': 0.9970032572746277} 11/07/2021 06:01:11 - INFO - __main__ - Step 62303: {'lr': 0.0003216718111324738, 'samples': 11962176, 'steps': 62302, 'loss/train': 1.6318626403808594} 11/07/2021 06:01:11 - INFO - __main__ - Step 62304: {'lr': 0.00032166672712666397, 'samples': 11962368, 'steps': 62303, 'loss/train': 1.2841179370880127} 11/07/2021 06:01:11 - INFO - __main__ - Step 62305: {'lr': 0.0003216616430885622, 'samples': 11962560, 'steps': 62304, 'loss/train': 1.3508998155593872} 11/07/2021 06:01:12 - INFO - __main__ - Step 62306: {'lr': 0.0003216565590181708, 'samples': 11962752, 'steps': 62305, 'loss/train': 1.0414454936981201} 11/07/2021 06:01:13 - INFO - __main__ - Step 62307: {'lr': 0.0003216514749154921, 'samples': 11962944, 'steps': 62306, 'loss/train': 1.4238158464431763} 11/07/2021 06:01:13 - INFO - __main__ - Step 62308: {'lr': 0.0003216463907805283, 'samples': 11963136, 'steps': 62307, 'loss/train': 1.3521634340286255} 11/07/2021 06:01:13 - INFO - __main__ - Step 62309: {'lr': 0.0003216413066132818, 'samples': 11963328, 'steps': 62308, 'loss/train': 1.2107762098312378} 11/07/2021 06:01:14 - INFO - __main__ - Step 62310: {'lr': 0.00032163622241375477, 'samples': 11963520, 'steps': 62309, 'loss/train': 1.222054123878479} 11/07/2021 06:01:14 - INFO - __main__ - Step 62311: {'lr': 0.0003216311381819496, 'samples': 11963712, 'steps': 62310, 'loss/train': 1.2332404851913452} 11/07/2021 06:01:14 - INFO - __main__ - Step 62312: {'lr': 0.00032162605391786853, 'samples': 11963904, 'steps': 62311, 'loss/train': 1.8746639490127563} 11/07/2021 06:01:15 - INFO - __main__ - Step 62313: {'lr': 0.0003216209696215139, 'samples': 11964096, 'steps': 62312, 'loss/train': 2.024770498275757} 11/07/2021 06:01:16 - INFO - __main__ - Step 62314: {'lr': 0.0003216158852928879, 'samples': 11964288, 'steps': 62313, 'loss/train': 1.9051958322525024} 11/07/2021 06:01:16 - INFO - __main__ - Step 62315: {'lr': 0.00032161080093199293, 'samples': 11964480, 'steps': 62314, 'loss/train': 1.4154891967773438} 11/07/2021 06:01:16 - INFO - __main__ - Step 62316: {'lr': 0.0003216057165388312, 'samples': 11964672, 'steps': 62315, 'loss/train': 0.9609681367874146} 11/07/2021 06:01:17 - INFO - __main__ - Step 62317: {'lr': 0.0003216006321134051, 'samples': 11964864, 'steps': 62316, 'loss/train': 1.6904473304748535} 11/07/2021 06:01:18 - INFO - __main__ - Step 62318: {'lr': 0.0003215955476557169, 'samples': 11965056, 'steps': 62317, 'loss/train': 1.8222298622131348} 11/07/2021 06:01:18 - INFO - __main__ - Step 62319: {'lr': 0.0003215904631657687, 'samples': 11965248, 'steps': 62318, 'loss/train': 1.5291094779968262} 11/07/2021 06:01:19 - INFO - __main__ - Step 62320: {'lr': 0.00032158537864356306, 'samples': 11965440, 'steps': 62319, 'loss/train': 1.4574264287948608} 11/07/2021 06:01:19 - INFO - __main__ - Step 62321: {'lr': 0.0003215802940891021, 'samples': 11965632, 'steps': 62320, 'loss/train': 1.1403807401657104} 11/07/2021 06:01:19 - INFO - __main__ - Step 62322: {'lr': 0.00032157520950238814, 'samples': 11965824, 'steps': 62321, 'loss/train': 1.2880257368087769} 11/07/2021 06:01:20 - INFO - __main__ - Step 62323: {'lr': 0.00032157012488342356, 'samples': 11966016, 'steps': 62322, 'loss/train': 1.641842007637024} 11/07/2021 06:01:21 - INFO - __main__ - Step 62324: {'lr': 0.0003215650402322106, 'samples': 11966208, 'steps': 62323, 'loss/train': 1.3787118196487427} 11/07/2021 06:01:21 - INFO - __main__ - Step 62325: {'lr': 0.0003215599555487515, 'samples': 11966400, 'steps': 62324, 'loss/train': 1.133534550666809} 11/07/2021 06:01:21 - INFO - __main__ - Step 62326: {'lr': 0.00032155487083304857, 'samples': 11966592, 'steps': 62325, 'loss/train': 1.988978385925293} 11/07/2021 06:01:22 - INFO - __main__ - Step 62327: {'lr': 0.00032154978608510415, 'samples': 11966784, 'steps': 62326, 'loss/train': 1.3291963338851929} 11/07/2021 06:01:23 - INFO - __main__ - Step 62328: {'lr': 0.0003215447013049205, 'samples': 11966976, 'steps': 62327, 'loss/train': 0.9387235045433044} 11/07/2021 06:01:24 - INFO - __main__ - Step 62329: {'lr': 0.00032153961649249987, 'samples': 11967168, 'steps': 62328, 'loss/train': 1.3177895545959473} 11/07/2021 06:01:24 - INFO - __main__ - Step 62330: {'lr': 0.0003215345316478446, 'samples': 11967360, 'steps': 62329, 'loss/train': 1.092084288597107} 11/07/2021 06:01:25 - INFO - __main__ - Step 62331: {'lr': 0.00032152944677095696, 'samples': 11967552, 'steps': 62330, 'loss/train': 1.0656335353851318} 11/07/2021 06:01:25 - INFO - __main__ - Step 62332: {'lr': 0.0003215243618618394, 'samples': 11967744, 'steps': 62331, 'loss/train': 1.8815562725067139} 11/07/2021 06:01:25 - INFO - __main__ - Step 62333: {'lr': 0.00032151927692049395, 'samples': 11967936, 'steps': 62332, 'loss/train': 1.5611562728881836} 11/07/2021 06:01:26 - INFO - __main__ - Step 62334: {'lr': 0.000321514191946923, 'samples': 11968128, 'steps': 62333, 'loss/train': 1.7134298086166382} 11/07/2021 06:01:27 - INFO - __main__ - Step 62335: {'lr': 0.0003215091069411289, 'samples': 11968320, 'steps': 62334, 'loss/train': 1.4284589290618896} 11/07/2021 06:01:27 - INFO - __main__ - Step 62336: {'lr': 0.00032150402190311383, 'samples': 11968512, 'steps': 62335, 'loss/train': 0.10895877331495285} 11/07/2021 06:01:27 - INFO - __main__ - Step 62337: {'lr': 0.00032149893683288024, 'samples': 11968704, 'steps': 62336, 'loss/train': 1.9488602876663208} 11/07/2021 06:01:28 - INFO - __main__ - Step 62338: {'lr': 0.00032149385173043033, 'samples': 11968896, 'steps': 62337, 'loss/train': 1.5233485698699951} 11/07/2021 06:01:28 - INFO - __main__ - Step 62339: {'lr': 0.0003214887665957663, 'samples': 11969088, 'steps': 62338, 'loss/train': 1.257074236869812} 11/07/2021 06:01:29 - INFO - __main__ - Step 62340: {'lr': 0.0003214836814288906, 'samples': 11969280, 'steps': 62339, 'loss/train': 1.4289500713348389} 11/07/2021 06:01:29 - INFO - __main__ - Step 62341: {'lr': 0.0003214785962298055, 'samples': 11969472, 'steps': 62340, 'loss/train': 1.297006368637085} 11/07/2021 06:01:30 - INFO - __main__ - Step 62342: {'lr': 0.0003214735109985131, 'samples': 11969664, 'steps': 62341, 'loss/train': 1.2540606260299683} 11/07/2021 06:01:30 - INFO - __main__ - Step 62343: {'lr': 0.000321468425735016, 'samples': 11969856, 'steps': 62342, 'loss/train': 1.3769084215164185} 11/07/2021 06:01:30 - INFO - __main__ - Step 62344: {'lr': 0.00032146334043931625, 'samples': 11970048, 'steps': 62343, 'loss/train': 1.943401575088501} 11/07/2021 06:01:31 - INFO - __main__ - Step 62345: {'lr': 0.00032145825511141626, 'samples': 11970240, 'steps': 62344, 'loss/train': 1.3622338771820068} 11/07/2021 06:01:32 - INFO - __main__ - Step 62346: {'lr': 0.0003214531697513183, 'samples': 11970432, 'steps': 62345, 'loss/train': 1.5287635326385498} 11/07/2021 06:01:32 - INFO - __main__ - Step 62347: {'lr': 0.00032144808435902454, 'samples': 11970624, 'steps': 62346, 'loss/train': 1.4852794408798218} 11/07/2021 06:01:33 - INFO - __main__ - Step 62348: {'lr': 0.00032144299893453743, 'samples': 11970816, 'steps': 62347, 'loss/train': 1.1693280935287476} 11/07/2021 06:01:33 - INFO - __main__ - Step 62349: {'lr': 0.0003214379134778592, 'samples': 11971008, 'steps': 62348, 'loss/train': 1.6231582164764404} 11/07/2021 06:01:34 - INFO - __main__ - Step 62350: {'lr': 0.0003214328279889922, 'samples': 11971200, 'steps': 62349, 'loss/train': 1.090929627418518} 11/07/2021 06:01:34 - INFO - __main__ - Step 62351: {'lr': 0.0003214277424679386, 'samples': 11971392, 'steps': 62350, 'loss/train': 1.244305968284607} 11/07/2021 06:01:35 - INFO - __main__ - Step 62352: {'lr': 0.00032142265691470083, 'samples': 11971584, 'steps': 62351, 'loss/train': 1.6756391525268555} 11/07/2021 06:01:35 - INFO - __main__ - Step 62353: {'lr': 0.00032141757132928114, 'samples': 11971776, 'steps': 62352, 'loss/train': 1.592952847480774} 11/07/2021 06:01:35 - INFO - __main__ - Step 62354: {'lr': 0.0003214124857116817, 'samples': 11971968, 'steps': 62353, 'loss/train': 1.4001635313034058} 11/07/2021 06:01:36 - INFO - __main__ - Step 62355: {'lr': 0.00032140740006190494, 'samples': 11972160, 'steps': 62354, 'loss/train': 1.2248793840408325} 11/07/2021 06:01:37 - INFO - __main__ - Step 62356: {'lr': 0.00032140231437995304, 'samples': 11972352, 'steps': 62355, 'loss/train': 2.0128517150878906} 11/07/2021 06:01:37 - INFO - __main__ - Step 62357: {'lr': 0.0003213972286658284, 'samples': 11972544, 'steps': 62356, 'loss/train': 1.0471782684326172} 11/07/2021 06:01:37 - INFO - __main__ - Step 62358: {'lr': 0.0003213921429195334, 'samples': 11972736, 'steps': 62357, 'loss/train': 1.9435913562774658} 11/07/2021 06:01:38 - INFO - __main__ - Step 62359: {'lr': 0.0003213870571410701, 'samples': 11972928, 'steps': 62358, 'loss/train': 1.3619399070739746} 11/07/2021 06:01:38 - INFO - __main__ - Step 62360: {'lr': 0.00032138197133044086, 'samples': 11973120, 'steps': 62359, 'loss/train': 1.5199415683746338} 11/07/2021 06:01:39 - INFO - __main__ - Step 62361: {'lr': 0.000321376885487648, 'samples': 11973312, 'steps': 62360, 'loss/train': 1.2292982339859009} 11/07/2021 06:01:39 - INFO - __main__ - Step 62362: {'lr': 0.00032137179961269386, 'samples': 11973504, 'steps': 62361, 'loss/train': 1.1265146732330322} 11/07/2021 06:01:40 - INFO - __main__ - Step 62363: {'lr': 0.0003213667137055807, 'samples': 11973696, 'steps': 62362, 'loss/train': 1.4637807607650757} 11/07/2021 06:01:40 - INFO - __main__ - Step 62364: {'lr': 0.0003213616277663107, 'samples': 11973888, 'steps': 62363, 'loss/train': 2.0051817893981934} 11/07/2021 06:01:40 - INFO - __main__ - Step 62365: {'lr': 0.00032135654179488637, 'samples': 11974080, 'steps': 62364, 'loss/train': 1.4825255870819092} 11/07/2021 06:01:41 - INFO - __main__ - Step 62366: {'lr': 0.00032135145579130985, 'samples': 11974272, 'steps': 62365, 'loss/train': 1.9909164905548096} 11/07/2021 06:01:42 - INFO - __main__ - Step 62367: {'lr': 0.00032134636975558343, 'samples': 11974464, 'steps': 62366, 'loss/train': 1.4592262506484985} 11/07/2021 06:01:42 - INFO - __main__ - Step 62368: {'lr': 0.0003213412836877095, 'samples': 11974656, 'steps': 62367, 'loss/train': 1.4790903329849243} 11/07/2021 06:01:42 - INFO - __main__ - Step 62369: {'lr': 0.0003213361975876902, 'samples': 11974848, 'steps': 62368, 'loss/train': 0.3617219924926758} 11/07/2021 06:01:43 - INFO - __main__ - Step 62370: {'lr': 0.00032133111145552797, 'samples': 11975040, 'steps': 62369, 'loss/train': 1.818790078163147} 11/07/2021 06:01:44 - INFO - __main__ - Step 62371: {'lr': 0.000321326025291225, 'samples': 11975232, 'steps': 62370, 'loss/train': 1.642632246017456} 11/07/2021 06:01:44 - INFO - __main__ - Step 62372: {'lr': 0.0003213209390947837, 'samples': 11975424, 'steps': 62371, 'loss/train': 1.468485951423645} 11/07/2021 06:01:45 - INFO - __main__ - Step 62373: {'lr': 0.00032131585286620623, 'samples': 11975616, 'steps': 62372, 'loss/train': 1.6108511686325073} 11/07/2021 06:01:45 - INFO - __main__ - Step 62374: {'lr': 0.00032131076660549496, 'samples': 11975808, 'steps': 62373, 'loss/train': 1.8044074773788452} 11/07/2021 06:01:45 - INFO - __main__ - Step 62375: {'lr': 0.00032130568031265216, 'samples': 11976000, 'steps': 62374, 'loss/train': 1.5075205564498901} 11/07/2021 06:01:46 - INFO - __main__ - Step 62376: {'lr': 0.00032130059398768006, 'samples': 11976192, 'steps': 62375, 'loss/train': 1.4473289251327515} 11/07/2021 06:01:47 - INFO - __main__ - Step 62377: {'lr': 0.00032129550763058105, 'samples': 11976384, 'steps': 62376, 'loss/train': 1.05477774143219} 11/07/2021 06:01:47 - INFO - __main__ - Step 62378: {'lr': 0.00032129042124135745, 'samples': 11976576, 'steps': 62377, 'loss/train': 1.613659381866455} 11/07/2021 06:01:47 - INFO - __main__ - Step 62379: {'lr': 0.00032128533482001144, 'samples': 11976768, 'steps': 62378, 'loss/train': 1.1217000484466553} 11/07/2021 06:01:48 - INFO - __main__ - Step 62380: {'lr': 0.00032128024836654533, 'samples': 11976960, 'steps': 62379, 'loss/train': 1.2609049081802368} 11/07/2021 06:01:49 - INFO - __main__ - Step 62381: {'lr': 0.00032127516188096153, 'samples': 11977152, 'steps': 62380, 'loss/train': 1.2486363649368286} 11/07/2021 06:01:49 - INFO - __main__ - Step 62382: {'lr': 0.00032127007536326215, 'samples': 11977344, 'steps': 62381, 'loss/train': 1.3987350463867188} 11/07/2021 06:01:49 - INFO - __main__ - Step 62383: {'lr': 0.00032126498881344956, 'samples': 11977536, 'steps': 62382, 'loss/train': 1.0004364252090454} 11/07/2021 06:01:50 - INFO - __main__ - Step 62384: {'lr': 0.0003212599022315261, 'samples': 11977728, 'steps': 62383, 'loss/train': 1.544081211090088} 11/07/2021 06:01:50 - INFO - __main__ - Step 62385: {'lr': 0.00032125481561749405, 'samples': 11977920, 'steps': 62384, 'loss/train': 1.4494726657867432} 11/07/2021 06:01:50 - INFO - __main__ - Step 62386: {'lr': 0.0003212497289713556, 'samples': 11978112, 'steps': 62385, 'loss/train': 1.6052567958831787} 11/07/2021 06:01:52 - INFO - __main__ - Step 62387: {'lr': 0.0003212446422931132, 'samples': 11978304, 'steps': 62386, 'loss/train': 1.2276850938796997} 11/07/2021 06:01:52 - INFO - __main__ - Step 62388: {'lr': 0.00032123955558276905, 'samples': 11978496, 'steps': 62387, 'loss/train': 1.255285620689392} 11/07/2021 06:01:52 - INFO - __main__ - Step 62389: {'lr': 0.0003212344688403255, 'samples': 11978688, 'steps': 62388, 'loss/train': 1.3982292413711548} 11/07/2021 06:01:53 - INFO - __main__ - Step 62390: {'lr': 0.0003212293820657848, 'samples': 11978880, 'steps': 62389, 'loss/train': 1.5056167840957642} 11/07/2021 06:01:53 - INFO - __main__ - Step 62391: {'lr': 0.0003212242952591491, 'samples': 11979072, 'steps': 62390, 'loss/train': 1.2538071870803833} 11/07/2021 06:01:54 - INFO - __main__ - Step 62392: {'lr': 0.000321219208420421, 'samples': 11979264, 'steps': 62391, 'loss/train': 1.1903376579284668} 11/07/2021 06:01:54 - INFO - __main__ - Step 62393: {'lr': 0.0003212141215496025, 'samples': 11979456, 'steps': 62392, 'loss/train': 1.7909162044525146} 11/07/2021 06:01:55 - INFO - __main__ - Step 62394: {'lr': 0.00032120903464669603, 'samples': 11979648, 'steps': 62393, 'loss/train': 1.257682204246521} 11/07/2021 06:01:55 - INFO - __main__ - Step 62395: {'lr': 0.0003212039477117039, 'samples': 11979840, 'steps': 62394, 'loss/train': 1.2452243566513062} 11/07/2021 06:01:55 - INFO - __main__ - Step 62396: {'lr': 0.0003211988607446284, 'samples': 11980032, 'steps': 62395, 'loss/train': 1.7398910522460938} 11/07/2021 06:01:57 - INFO - __main__ - Step 62397: {'lr': 0.0003211937737454718, 'samples': 11980224, 'steps': 62396, 'loss/train': 1.6822526454925537} 11/07/2021 06:01:57 - INFO - __main__ - Step 62398: {'lr': 0.0003211886867142363, 'samples': 11980416, 'steps': 62397, 'loss/train': 0.6014876961708069} 11/07/2021 06:01:57 - INFO - __main__ - Step 62399: {'lr': 0.00032118359965092424, 'samples': 11980608, 'steps': 62398, 'loss/train': 1.156036376953125} 11/07/2021 06:01:58 - INFO - __main__ - Step 62400: {'lr': 0.00032117851255553803, 'samples': 11980800, 'steps': 62399, 'loss/train': 1.4908323287963867} 11/07/2021 06:01:58 - INFO - __main__ - Step 62401: {'lr': 0.0003211734254280799, 'samples': 11980992, 'steps': 62400, 'loss/train': 1.3126240968704224} 11/07/2021 06:01:59 - INFO - __main__ - Step 62402: {'lr': 0.00032116833826855215, 'samples': 11981184, 'steps': 62401, 'loss/train': 0.904496967792511} 11/07/2021 06:02:00 - INFO - __main__ - Step 62403: {'lr': 0.000321163251076957, 'samples': 11981376, 'steps': 62402, 'loss/train': 1.2968679666519165} 11/07/2021 06:02:00 - INFO - __main__ - Step 62404: {'lr': 0.00032115816385329675, 'samples': 11981568, 'steps': 62403, 'loss/train': 1.4300117492675781} 11/07/2021 06:02:00 - INFO - __main__ - Step 62405: {'lr': 0.00032115307659757374, 'samples': 11981760, 'steps': 62404, 'loss/train': 1.5802758932113647} 11/07/2021 06:02:01 - INFO - __main__ - Step 62406: {'lr': 0.0003211479893097903, 'samples': 11981952, 'steps': 62405, 'loss/train': 1.476452112197876} 11/07/2021 06:02:01 - INFO - __main__ - Step 62407: {'lr': 0.00032114290198994867, 'samples': 11982144, 'steps': 62406, 'loss/train': 1.6660008430480957} 11/07/2021 06:02:03 - INFO - __main__ - Step 62408: {'lr': 0.0003211378146380511, 'samples': 11982336, 'steps': 62407, 'loss/train': 1.4348708391189575} 11/07/2021 06:02:03 - INFO - __main__ - Step 62409: {'lr': 0.0003211327272541, 'samples': 11982528, 'steps': 62408, 'loss/train': 1.8010343313217163} 11/07/2021 06:02:04 - INFO - __main__ - Step 62410: {'lr': 0.00032112763983809753, 'samples': 11982720, 'steps': 62409, 'loss/train': 1.3811204433441162} 11/07/2021 06:02:04 - INFO - __main__ - Step 62411: {'lr': 0.000321122552390046, 'samples': 11982912, 'steps': 62410, 'loss/train': 1.1752146482467651} 11/07/2021 06:02:04 - INFO - __main__ - Step 62412: {'lr': 0.0003211174649099479, 'samples': 11983104, 'steps': 62411, 'loss/train': 1.8879623413085938} 11/07/2021 06:02:05 - INFO - __main__ - Step 62413: {'lr': 0.0003211123773978052, 'samples': 11983296, 'steps': 62412, 'loss/train': 1.272231936454773} 11/07/2021 06:02:06 - INFO - __main__ - Step 62414: {'lr': 0.00032110728985362044, 'samples': 11983488, 'steps': 62413, 'loss/train': 1.6675348281860352} 11/07/2021 06:02:06 - INFO - __main__ - Step 62415: {'lr': 0.0003211022022773958, 'samples': 11983680, 'steps': 62414, 'loss/train': 1.466812014579773} 11/07/2021 06:02:06 - INFO - __main__ - Step 62416: {'lr': 0.0003210971146691336, 'samples': 11983872, 'steps': 62415, 'loss/train': 1.6694635152816772} 11/07/2021 06:02:07 - INFO - __main__ - Step 62417: {'lr': 0.0003210920270288362, 'samples': 11984064, 'steps': 62416, 'loss/train': 1.1894623041152954} 11/07/2021 06:02:07 - INFO - __main__ - Step 62418: {'lr': 0.00032108693935650577, 'samples': 11984256, 'steps': 62417, 'loss/train': 1.3704510927200317} 11/07/2021 06:02:08 - INFO - __main__ - Step 62419: {'lr': 0.0003210818516521447, 'samples': 11984448, 'steps': 62418, 'loss/train': 1.824023723602295} 11/07/2021 06:02:08 - INFO - __main__ - Step 62420: {'lr': 0.00032107676391575525, 'samples': 11984640, 'steps': 62419, 'loss/train': 1.6543632745742798} 11/07/2021 06:02:09 - INFO - __main__ - Step 62421: {'lr': 0.0003210716761473397, 'samples': 11984832, 'steps': 62420, 'loss/train': 0.8174358606338501} 11/07/2021 06:02:09 - INFO - __main__ - Step 62422: {'lr': 0.0003210665883469003, 'samples': 11985024, 'steps': 62421, 'loss/train': 1.2469074726104736} 11/07/2021 06:02:10 - INFO - __main__ - Step 62423: {'lr': 0.0003210615005144394, 'samples': 11985216, 'steps': 62422, 'loss/train': 1.533103585243225} 11/07/2021 06:02:11 - INFO - __main__ - Step 62424: {'lr': 0.00032105641264995935, 'samples': 11985408, 'steps': 62423, 'loss/train': 1.4389373064041138} 11/07/2021 06:02:11 - INFO - __main__ - Step 62425: {'lr': 0.00032105132475346233, 'samples': 11985600, 'steps': 62424, 'loss/train': 1.325593113899231} 11/07/2021 06:02:12 - INFO - __main__ - Step 62426: {'lr': 0.0003210462368249507, 'samples': 11985792, 'steps': 62425, 'loss/train': 1.4844963550567627} 11/07/2021 06:02:12 - INFO - __main__ - Step 62427: {'lr': 0.0003210411488644267, 'samples': 11985984, 'steps': 62426, 'loss/train': 1.7086414098739624} 11/07/2021 06:02:12 - INFO - __main__ - Step 62428: {'lr': 0.00032103606087189267, 'samples': 11986176, 'steps': 62427, 'loss/train': 1.5240494012832642} 11/07/2021 06:02:14 - INFO - __main__ - Step 62429: {'lr': 0.0003210309728473509, 'samples': 11986368, 'steps': 62428, 'loss/train': 1.5011428594589233} 11/07/2021 06:02:14 - INFO - __main__ - Step 62430: {'lr': 0.0003210258847908036, 'samples': 11986560, 'steps': 62429, 'loss/train': 1.578031301498413} 11/07/2021 06:02:15 - INFO - __main__ - Step 62431: {'lr': 0.00032102079670225325, 'samples': 11986752, 'steps': 62430, 'loss/train': 1.7203574180603027} 11/07/2021 06:02:15 - INFO - __main__ - Step 62432: {'lr': 0.00032101570858170196, 'samples': 11986944, 'steps': 62431, 'loss/train': 1.7779955863952637} 11/07/2021 06:02:15 - INFO - __main__ - Step 62433: {'lr': 0.0003210106204291521, 'samples': 11987136, 'steps': 62432, 'loss/train': 1.2332617044448853} 11/07/2021 06:02:16 - INFO - __main__ - Step 62434: {'lr': 0.00032100553224460594, 'samples': 11987328, 'steps': 62433, 'loss/train': 0.9742845892906189} 11/07/2021 06:02:16 - INFO - __main__ - Step 62435: {'lr': 0.00032100044402806583, 'samples': 11987520, 'steps': 62434, 'loss/train': 1.416128396987915} 11/07/2021 06:02:17 - INFO - __main__ - Step 62436: {'lr': 0.00032099535577953395, 'samples': 11987712, 'steps': 62435, 'loss/train': 0.8876898884773254} 11/07/2021 06:02:17 - INFO - __main__ - Step 62437: {'lr': 0.0003209902674990127, 'samples': 11987904, 'steps': 62436, 'loss/train': 1.2622498273849487} 11/07/2021 06:02:18 - INFO - __main__ - Step 62438: {'lr': 0.00032098517918650426, 'samples': 11988096, 'steps': 62437, 'loss/train': 1.515376329421997} 11/07/2021 06:02:18 - INFO - __main__ - Step 62439: {'lr': 0.0003209800908420111, 'samples': 11988288, 'steps': 62438, 'loss/train': 1.3629449605941772} 11/07/2021 06:02:18 - INFO - __main__ - Step 62440: {'lr': 0.00032097500246553535, 'samples': 11988480, 'steps': 62439, 'loss/train': 1.5179250240325928} 11/07/2021 06:02:19 - INFO - __main__ - Step 62441: {'lr': 0.00032096991405707937, 'samples': 11988672, 'steps': 62440, 'loss/train': 1.7064610719680786} 11/07/2021 06:02:20 - INFO - __main__ - Step 62442: {'lr': 0.00032096482561664544, 'samples': 11988864, 'steps': 62441, 'loss/train': 1.5190913677215576} 11/07/2021 06:02:20 - INFO - __main__ - Step 62443: {'lr': 0.00032095973714423584, 'samples': 11989056, 'steps': 62442, 'loss/train': 1.6264219284057617} 11/07/2021 06:02:20 - INFO - __main__ - Step 62444: {'lr': 0.00032095464863985285, 'samples': 11989248, 'steps': 62443, 'loss/train': 1.7164746522903442} 11/07/2021 06:02:21 - INFO - __main__ - Step 62445: {'lr': 0.00032094956010349885, 'samples': 11989440, 'steps': 62444, 'loss/train': 1.625199794769287} 11/07/2021 06:02:21 - INFO - __main__ - Step 62446: {'lr': 0.00032094447153517607, 'samples': 11989632, 'steps': 62445, 'loss/train': 1.3359289169311523} 11/07/2021 06:02:22 - INFO - __main__ - Step 62447: {'lr': 0.0003209393829348868, 'samples': 11989824, 'steps': 62446, 'loss/train': 1.4685285091400146} 11/07/2021 06:02:23 - INFO - __main__ - Step 62448: {'lr': 0.0003209342943026333, 'samples': 11990016, 'steps': 62447, 'loss/train': 1.0337048768997192} 11/07/2021 06:02:23 - INFO - __main__ - Step 62449: {'lr': 0.00032092920563841793, 'samples': 11990208, 'steps': 62448, 'loss/train': 1.6090806722640991} 11/07/2021 06:02:23 - INFO - __main__ - Step 62450: {'lr': 0.00032092411694224294, 'samples': 11990400, 'steps': 62449, 'loss/train': 2.1596853733062744} 11/07/2021 06:02:24 - INFO - __main__ - Step 62451: {'lr': 0.0003209190282141106, 'samples': 11990592, 'steps': 62450, 'loss/train': 0.9066995978355408} 11/07/2021 06:02:25 - INFO - __main__ - Step 62452: {'lr': 0.0003209139394540233, 'samples': 11990784, 'steps': 62451, 'loss/train': 1.5285738706588745} 11/07/2021 06:02:25 - INFO - __main__ - Step 62453: {'lr': 0.00032090885066198336, 'samples': 11990976, 'steps': 62452, 'loss/train': 1.5639458894729614} 11/07/2021 06:02:25 - INFO - __main__ - Step 62454: {'lr': 0.00032090376183799285, 'samples': 11991168, 'steps': 62453, 'loss/train': 1.4905081987380981} 11/07/2021 06:02:26 - INFO - __main__ - Step 62455: {'lr': 0.0003208986729820542, 'samples': 11991360, 'steps': 62454, 'loss/train': 1.302152156829834} 11/07/2021 06:02:26 - INFO - __main__ - Step 62456: {'lr': 0.0003208935840941697, 'samples': 11991552, 'steps': 62455, 'loss/train': 1.7421486377716064} 11/07/2021 06:02:27 - INFO - __main__ - Step 62457: {'lr': 0.0003208884951743417, 'samples': 11991744, 'steps': 62456, 'loss/train': 1.5326241254806519} 11/07/2021 06:02:27 - INFO - __main__ - Step 62458: {'lr': 0.00032088340622257245, 'samples': 11991936, 'steps': 62457, 'loss/train': 1.6364829540252686} 11/07/2021 06:02:28 - INFO - __main__ - Step 62459: {'lr': 0.0003208783172388642, 'samples': 11992128, 'steps': 62458, 'loss/train': 1.3357620239257812} 11/07/2021 06:02:28 - INFO - __main__ - Step 62460: {'lr': 0.0003208732282232193, 'samples': 11992320, 'steps': 62459, 'loss/train': 1.1460192203521729} 11/07/2021 06:02:28 - INFO - __main__ - Step 62461: {'lr': 0.00032086813917563996, 'samples': 11992512, 'steps': 62460, 'loss/train': 1.452422857284546} 11/07/2021 06:02:30 - INFO - __main__ - Step 62462: {'lr': 0.0003208630500961286, 'samples': 11992704, 'steps': 62461, 'loss/train': 1.1765297651290894} 11/07/2021 06:02:30 - INFO - __main__ - Step 62463: {'lr': 0.0003208579609846874, 'samples': 11992896, 'steps': 62462, 'loss/train': 1.650329828262329} 11/07/2021 06:02:30 - INFO - __main__ - Step 62464: {'lr': 0.00032085287184131865, 'samples': 11993088, 'steps': 62463, 'loss/train': 1.715993881225586} 11/07/2021 06:02:31 - INFO - __main__ - Step 62465: {'lr': 0.0003208477826660248, 'samples': 11993280, 'steps': 62464, 'loss/train': 1.2238794565200806} 11/07/2021 06:02:31 - INFO - __main__ - Step 62466: {'lr': 0.000320842693458808, 'samples': 11993472, 'steps': 62465, 'loss/train': 1.4740071296691895} 11/07/2021 06:02:32 - INFO - __main__ - Step 62467: {'lr': 0.00032083760421967053, 'samples': 11993664, 'steps': 62466, 'loss/train': 1.913862943649292} 11/07/2021 06:02:32 - INFO - __main__ - Step 62468: {'lr': 0.00032083251494861474, 'samples': 11993856, 'steps': 62467, 'loss/train': 1.63801908493042} 11/07/2021 06:02:33 - INFO - __main__ - Step 62469: {'lr': 0.00032082742564564296, 'samples': 11994048, 'steps': 62468, 'loss/train': 1.554450273513794} 11/07/2021 06:02:33 - INFO - __main__ - Step 62470: {'lr': 0.0003208223363107573, 'samples': 11994240, 'steps': 62469, 'loss/train': 1.4857127666473389} 11/07/2021 06:02:34 - INFO - __main__ - Step 62471: {'lr': 0.00032081724694396033, 'samples': 11994432, 'steps': 62470, 'loss/train': 1.1134861707687378} 11/07/2021 06:02:34 - INFO - __main__ - Step 62472: {'lr': 0.0003208121575452541, 'samples': 11994624, 'steps': 62471, 'loss/train': 1.3760361671447754} 11/07/2021 06:02:35 - INFO - __main__ - Step 62473: {'lr': 0.0003208070681146411, 'samples': 11994816, 'steps': 62472, 'loss/train': 1.397686243057251} 11/07/2021 06:02:35 - INFO - __main__ - Step 62474: {'lr': 0.00032080197865212354, 'samples': 11995008, 'steps': 62473, 'loss/train': 1.285237193107605} 11/07/2021 06:02:35 - INFO - __main__ - Step 62475: {'lr': 0.0003207968891577036, 'samples': 11995200, 'steps': 62474, 'loss/train': 1.3300914764404297} 11/07/2021 06:02:36 - INFO - __main__ - Step 62476: {'lr': 0.00032079179963138374, 'samples': 11995392, 'steps': 62475, 'loss/train': 1.4584017992019653} 11/07/2021 06:02:36 - INFO - __main__ - Step 62477: {'lr': 0.0003207867100731661, 'samples': 11995584, 'steps': 62476, 'loss/train': 1.4901098012924194} 11/07/2021 06:02:37 - INFO - __main__ - Step 62478: {'lr': 0.00032078162048305314, 'samples': 11995776, 'steps': 62477, 'loss/train': 1.2473688125610352} 11/07/2021 06:02:38 - INFO - __main__ - Step 62479: {'lr': 0.0003207765308610471, 'samples': 11995968, 'steps': 62478, 'loss/train': 1.2273480892181396} 11/07/2021 06:02:38 - INFO - __main__ - Step 62480: {'lr': 0.00032077144120715026, 'samples': 11996160, 'steps': 62479, 'loss/train': 1.7620359659194946} 11/07/2021 06:02:38 - INFO - __main__ - Step 62481: {'lr': 0.0003207663515213648, 'samples': 11996352, 'steps': 62480, 'loss/train': 1.5368489027023315} 11/07/2021 06:02:39 - INFO - __main__ - Step 62482: {'lr': 0.0003207612618036932, 'samples': 11996544, 'steps': 62481, 'loss/train': 1.691999912261963} 11/07/2021 06:02:40 - INFO - __main__ - Step 62483: {'lr': 0.0003207561720541376, 'samples': 11996736, 'steps': 62482, 'loss/train': 1.4539132118225098} 11/07/2021 06:02:40 - INFO - __main__ - Step 62484: {'lr': 0.0003207510822727004, 'samples': 11996928, 'steps': 62483, 'loss/train': 0.9805406928062439} 11/07/2021 06:02:40 - INFO - __main__ - Step 62485: {'lr': 0.0003207459924593839, 'samples': 11997120, 'steps': 62484, 'loss/train': 1.7054483890533447} 11/07/2021 06:02:41 - INFO - __main__ - Step 62486: {'lr': 0.0003207409026141903, 'samples': 11997312, 'steps': 62485, 'loss/train': 1.6597075462341309} 11/07/2021 06:02:41 - INFO - __main__ - Step 62487: {'lr': 0.0003207358127371219, 'samples': 11997504, 'steps': 62486, 'loss/train': 1.9365460872650146} 11/07/2021 06:02:42 - INFO - __main__ - Step 62488: {'lr': 0.00032073072282818107, 'samples': 11997696, 'steps': 62487, 'loss/train': 1.6961088180541992} 11/07/2021 06:02:43 - INFO - __main__ - Step 62489: {'lr': 0.00032072563288737006, 'samples': 11997888, 'steps': 62488, 'loss/train': 1.3870824575424194} 11/07/2021 06:02:43 - INFO - __main__ - Step 62490: {'lr': 0.00032072054291469116, 'samples': 11998080, 'steps': 62489, 'loss/train': 1.412424921989441} 11/07/2021 06:02:43 - INFO - __main__ - Step 62491: {'lr': 0.0003207154529101467, 'samples': 11998272, 'steps': 62490, 'loss/train': 1.5783405303955078} 11/07/2021 06:02:44 - INFO - __main__ - Step 62492: {'lr': 0.0003207103628737389, 'samples': 11998464, 'steps': 62491, 'loss/train': 1.3173037767410278} 11/07/2021 06:02:45 - INFO - __main__ - Step 62493: {'lr': 0.00032070527280547023, 'samples': 11998656, 'steps': 62492, 'loss/train': 1.4297562837600708} 11/07/2021 06:02:45 - INFO - __main__ - Step 62494: {'lr': 0.00032070018270534276, 'samples': 11998848, 'steps': 62493, 'loss/train': 2.224841594696045} 11/07/2021 06:02:45 - INFO - __main__ - Step 62495: {'lr': 0.0003206950925733589, 'samples': 11999040, 'steps': 62494, 'loss/train': 1.588693618774414} 11/07/2021 06:02:46 - INFO - __main__ - Step 62496: {'lr': 0.0003206900024095208, 'samples': 11999232, 'steps': 62495, 'loss/train': 1.8224881887435913} 11/07/2021 06:02:46 - INFO - __main__ - Step 62497: {'lr': 0.00032068491221383106, 'samples': 11999424, 'steps': 62496, 'loss/train': 1.4474643468856812} 11/07/2021 06:02:47 - INFO - __main__ - Step 62498: {'lr': 0.0003206798219862917, 'samples': 11999616, 'steps': 62497, 'loss/train': 1.5218315124511719} 11/07/2021 06:02:48 - INFO - __main__ - Step 62499: {'lr': 0.0003206747317269051, 'samples': 11999808, 'steps': 62498, 'loss/train': 1.738558053970337} 11/07/2021 06:02:48 - INFO - __main__ - Step 62500: {'lr': 0.0003206696414356736, 'samples': 12000000, 'steps': 62499, 'loss/train': 1.2298485040664673} 11/07/2021 06:02:48 - INFO - __main__ - Step 62501: {'lr': 0.0003206645511125995, 'samples': 12000192, 'steps': 62500, 'loss/train': 1.4853605031967163} 11/07/2021 06:02:49 - INFO - __main__ - Step 62502: {'lr': 0.00032065946075768493, 'samples': 12000384, 'steps': 62501, 'loss/train': 1.3411840200424194} 11/07/2021 06:02:49 - INFO - __main__ - Step 62503: {'lr': 0.0003206543703709323, 'samples': 12000576, 'steps': 62502, 'loss/train': 1.8073434829711914} 11/07/2021 06:02:50 - INFO - __main__ - Step 62504: {'lr': 0.00032064927995234397, 'samples': 12000768, 'steps': 62503, 'loss/train': 1.0487048625946045} 11/07/2021 06:02:50 - INFO - __main__ - Step 62505: {'lr': 0.0003206441895019221, 'samples': 12000960, 'steps': 62504, 'loss/train': 1.2646498680114746} 11/07/2021 06:02:51 - INFO - __main__ - Step 62506: {'lr': 0.0003206390990196691, 'samples': 12001152, 'steps': 62505, 'loss/train': 1.0387916564941406} 11/07/2021 06:02:51 - INFO - __main__ - Step 62507: {'lr': 0.0003206340085055872, 'samples': 12001344, 'steps': 62506, 'loss/train': 1.238993525505066} 11/07/2021 06:02:51 - INFO - __main__ - Step 62508: {'lr': 0.0003206289179596787, 'samples': 12001536, 'steps': 62507, 'loss/train': 1.612460970878601} 11/07/2021 06:02:53 - INFO - __main__ - Step 62509: {'lr': 0.00032062382738194586, 'samples': 12001728, 'steps': 62508, 'loss/train': 1.2764756679534912} 11/07/2021 06:02:53 - INFO - __main__ - Step 62510: {'lr': 0.0003206187367723911, 'samples': 12001920, 'steps': 62509, 'loss/train': 1.5293302536010742} 11/07/2021 06:02:53 - INFO - __main__ - Step 62511: {'lr': 0.0003206136461310165, 'samples': 12002112, 'steps': 62510, 'loss/train': 1.7813631296157837} 11/07/2021 06:02:54 - INFO - __main__ - Step 62512: {'lr': 0.0003206085554578246, 'samples': 12002304, 'steps': 62511, 'loss/train': 1.3717366456985474} 11/07/2021 06:02:54 - INFO - __main__ - Step 62513: {'lr': 0.00032060346475281754, 'samples': 12002496, 'steps': 62512, 'loss/train': 1.6237064599990845} 11/07/2021 06:02:55 - INFO - __main__ - Step 62514: {'lr': 0.0003205983740159976, 'samples': 12002688, 'steps': 62513, 'loss/train': 1.6506487131118774} 11/07/2021 06:02:55 - INFO - __main__ - Step 62515: {'lr': 0.00032059328324736717, 'samples': 12002880, 'steps': 62514, 'loss/train': 1.383272647857666} 11/07/2021 06:02:56 - INFO - __main__ - Step 62516: {'lr': 0.00032058819244692847, 'samples': 12003072, 'steps': 62515, 'loss/train': 1.2880539894104004} 11/07/2021 06:02:56 - INFO - __main__ - Step 62517: {'lr': 0.00032058310161468383, 'samples': 12003264, 'steps': 62516, 'loss/train': 1.2734917402267456} 11/07/2021 06:02:56 - INFO - __main__ - Step 62518: {'lr': 0.0003205780107506356, 'samples': 12003456, 'steps': 62517, 'loss/train': 1.742875099182129} 11/07/2021 06:02:57 - INFO - __main__ - Step 62519: {'lr': 0.00032057291985478596, 'samples': 12003648, 'steps': 62518, 'loss/train': 1.662589192390442} 11/07/2021 06:02:58 - INFO - __main__ - Step 62520: {'lr': 0.0003205678289271372, 'samples': 12003840, 'steps': 62519, 'loss/train': 1.4764641523361206} 11/07/2021 06:02:58 - INFO - __main__ - Step 62521: {'lr': 0.00032056273796769177, 'samples': 12004032, 'steps': 62520, 'loss/train': 1.5118993520736694} 11/07/2021 06:02:58 - INFO - __main__ - Step 62522: {'lr': 0.00032055764697645176, 'samples': 12004224, 'steps': 62521, 'loss/train': 1.4304637908935547} 11/07/2021 06:02:59 - INFO - __main__ - Step 62523: {'lr': 0.0003205525559534196, 'samples': 12004416, 'steps': 62522, 'loss/train': 1.5470013618469238} 11/07/2021 06:02:59 - INFO - __main__ - Step 62524: {'lr': 0.00032054746489859756, 'samples': 12004608, 'steps': 62523, 'loss/train': 1.2027415037155151} 11/07/2021 06:03:01 - INFO - __main__ - Step 62525: {'lr': 0.0003205423738119879, 'samples': 12004800, 'steps': 62524, 'loss/train': 1.645536184310913} 11/07/2021 06:03:01 - INFO - __main__ - Step 62526: {'lr': 0.00032053728269359295, 'samples': 12004992, 'steps': 62525, 'loss/train': 1.3825663328170776} 11/07/2021 06:03:01 - INFO - __main__ - Step 62527: {'lr': 0.00032053219154341497, 'samples': 12005184, 'steps': 62526, 'loss/train': 1.1126614809036255} 11/07/2021 06:03:02 - INFO - __main__ - Step 62528: {'lr': 0.00032052710036145626, 'samples': 12005376, 'steps': 62527, 'loss/train': 1.0449708700180054} 11/07/2021 06:03:02 - INFO - __main__ - Step 62529: {'lr': 0.0003205220091477191, 'samples': 12005568, 'steps': 62528, 'loss/train': 0.8979393839836121} 11/07/2021 06:03:02 - INFO - __main__ - Step 62530: {'lr': 0.0003205169179022059, 'samples': 12005760, 'steps': 62529, 'loss/train': 1.1279497146606445} 11/07/2021 06:03:04 - INFO - __main__ - Step 62531: {'lr': 0.00032051182662491885, 'samples': 12005952, 'steps': 62530, 'loss/train': 1.3504050970077515} 11/07/2021 06:03:04 - INFO - __main__ - Step 62532: {'lr': 0.00032050673531586025, 'samples': 12006144, 'steps': 62531, 'loss/train': 1.482869267463684} 11/07/2021 06:03:05 - INFO - __main__ - Step 62533: {'lr': 0.0003205016439750323, 'samples': 12006336, 'steps': 62532, 'loss/train': 2.044969081878662} 11/07/2021 06:03:05 - INFO - __main__ - Step 62534: {'lr': 0.0003204965526024375, 'samples': 12006528, 'steps': 62533, 'loss/train': 0.9794402122497559} 11/07/2021 06:03:05 - INFO - __main__ - Step 62535: {'lr': 0.00032049146119807816, 'samples': 12006720, 'steps': 62534, 'loss/train': 1.2806293964385986} 11/07/2021 06:03:06 - INFO - __main__ - Step 62536: {'lr': 0.0003204863697619563, 'samples': 12006912, 'steps': 62535, 'loss/train': 1.1370335817337036} 11/07/2021 06:03:07 - INFO - __main__ - Step 62537: {'lr': 0.0003204812782940744, 'samples': 12007104, 'steps': 62536, 'loss/train': 0.866182804107666} 11/07/2021 06:03:07 - INFO - __main__ - Step 62538: {'lr': 0.00032047618679443467, 'samples': 12007296, 'steps': 62537, 'loss/train': 1.513248324394226} 11/07/2021 06:03:08 - INFO - __main__ - Step 62539: {'lr': 0.00032047109526303944, 'samples': 12007488, 'steps': 62538, 'loss/train': 1.3523597717285156} 11/07/2021 06:03:08 - INFO - __main__ - Step 62540: {'lr': 0.0003204660036998911, 'samples': 12007680, 'steps': 62539, 'loss/train': 1.2142537832260132} 11/07/2021 06:03:08 - INFO - __main__ - Step 62541: {'lr': 0.0003204609121049919, 'samples': 12007872, 'steps': 62540, 'loss/train': 1.4751675128936768} 11/07/2021 06:03:09 - INFO - __main__ - Step 62542: {'lr': 0.00032045582047834405, 'samples': 12008064, 'steps': 62541, 'loss/train': 1.5533076524734497} 11/07/2021 06:03:10 - INFO - __main__ - Step 62543: {'lr': 0.00032045072881994993, 'samples': 12008256, 'steps': 62542, 'loss/train': 1.2877624034881592} 11/07/2021 06:03:10 - INFO - __main__ - Step 62544: {'lr': 0.00032044563712981173, 'samples': 12008448, 'steps': 62543, 'loss/train': 0.9442176222801208} 11/07/2021 06:03:10 - INFO - __main__ - Step 62545: {'lr': 0.00032044054540793183, 'samples': 12008640, 'steps': 62544, 'loss/train': 1.6460802555084229} 11/07/2021 06:03:11 - INFO - __main__ - Step 62546: {'lr': 0.00032043545365431246, 'samples': 12008832, 'steps': 62545, 'loss/train': 1.7990704774856567} 11/07/2021 06:03:11 - INFO - __main__ - Step 62547: {'lr': 0.00032043036186895615, 'samples': 12009024, 'steps': 62546, 'loss/train': 1.491529107093811} 11/07/2021 06:03:12 - INFO - __main__ - Step 62548: {'lr': 0.0003204252700518648, 'samples': 12009216, 'steps': 62547, 'loss/train': 1.4219372272491455} 11/07/2021 06:03:12 - INFO - __main__ - Step 62549: {'lr': 0.00032042017820304105, 'samples': 12009408, 'steps': 62548, 'loss/train': 1.7159372568130493} 11/07/2021 06:03:13 - INFO - __main__ - Step 62550: {'lr': 0.000320415086322487, 'samples': 12009600, 'steps': 62549, 'loss/train': 1.5540374517440796} 11/07/2021 06:03:13 - INFO - __main__ - Step 62551: {'lr': 0.00032040999441020497, 'samples': 12009792, 'steps': 62550, 'loss/train': 1.2234042882919312} 11/07/2021 06:03:13 - INFO - __main__ - Step 62552: {'lr': 0.00032040490246619725, 'samples': 12009984, 'steps': 62551, 'loss/train': 0.9070450067520142} 11/07/2021 06:03:14 - INFO - __main__ - Step 62553: {'lr': 0.0003203998104904663, 'samples': 12010176, 'steps': 62552, 'loss/train': 0.7004640102386475} 11/07/2021 06:03:15 - INFO - __main__ - Step 62554: {'lr': 0.0003203947184830142, 'samples': 12010368, 'steps': 62553, 'loss/train': 1.7067278623580933} 11/07/2021 06:03:15 - INFO - __main__ - Step 62555: {'lr': 0.0003203896264438433, 'samples': 12010560, 'steps': 62554, 'loss/train': 1.3608046770095825} 11/07/2021 06:03:15 - INFO - __main__ - Step 62556: {'lr': 0.00032038453437295593, 'samples': 12010752, 'steps': 62555, 'loss/train': 1.1645599603652954} 11/07/2021 06:03:16 - INFO - __main__ - Step 62557: {'lr': 0.00032037944227035443, 'samples': 12010944, 'steps': 62556, 'loss/train': 1.2873722314834595} 11/07/2021 06:03:17 - INFO - __main__ - Step 62558: {'lr': 0.000320374350136041, 'samples': 12011136, 'steps': 62557, 'loss/train': 1.2238942384719849} 11/07/2021 06:03:17 - INFO - __main__ - Step 62559: {'lr': 0.00032036925797001794, 'samples': 12011328, 'steps': 62558, 'loss/train': 1.212776780128479} 11/07/2021 06:03:17 - INFO - __main__ - Step 62560: {'lr': 0.00032036416577228767, 'samples': 12011520, 'steps': 62559, 'loss/train': 1.6321191787719727} 11/07/2021 06:03:18 - INFO - __main__ - Step 62561: {'lr': 0.0003203590735428523, 'samples': 12011712, 'steps': 62560, 'loss/train': 0.5809390544891357} 11/07/2021 06:03:18 - INFO - __main__ - Step 62562: {'lr': 0.0003203539812817143, 'samples': 12011904, 'steps': 62561, 'loss/train': 1.380789041519165} 11/07/2021 06:03:19 - INFO - __main__ - Step 62563: {'lr': 0.0003203488889888758, 'samples': 12012096, 'steps': 62562, 'loss/train': 1.5878074169158936} 11/07/2021 06:03:20 - INFO - __main__ - Step 62564: {'lr': 0.0003203437966643392, 'samples': 12012288, 'steps': 62563, 'loss/train': 1.7706053256988525} 11/07/2021 06:03:20 - INFO - __main__ - Step 62565: {'lr': 0.00032033870430810677, 'samples': 12012480, 'steps': 62564, 'loss/train': 1.149591088294983} 11/07/2021 06:03:20 - INFO - __main__ - Step 62566: {'lr': 0.0003203336119201808, 'samples': 12012672, 'steps': 62565, 'loss/train': 1.3068805932998657} 11/07/2021 06:03:21 - INFO - __main__ - Step 62567: {'lr': 0.00032032851950056376, 'samples': 12012864, 'steps': 62566, 'loss/train': 0.8859485387802124} 11/07/2021 06:03:22 - INFO - __main__ - Step 62568: {'lr': 0.0003203234270492575, 'samples': 12013056, 'steps': 62567, 'loss/train': 0.8570880889892578} 11/07/2021 06:03:22 - INFO - __main__ - Step 62569: {'lr': 0.0003203183345662648, 'samples': 12013248, 'steps': 62568, 'loss/train': 1.4416708946228027} 11/07/2021 06:03:23 - INFO - __main__ - Step 62570: {'lr': 0.0003203132420515876, 'samples': 12013440, 'steps': 62569, 'loss/train': 1.3884958028793335} 11/07/2021 06:03:23 - INFO - __main__ - Step 62571: {'lr': 0.0003203081495052284, 'samples': 12013632, 'steps': 62570, 'loss/train': 1.4366120100021362} 11/07/2021 06:03:23 - INFO - __main__ - Step 62572: {'lr': 0.00032030305692718944, 'samples': 12013824, 'steps': 62571, 'loss/train': 1.2171107530593872} 11/07/2021 06:03:24 - INFO - __main__ - Step 62573: {'lr': 0.000320297964317473, 'samples': 12014016, 'steps': 62572, 'loss/train': 1.4773526191711426} 11/07/2021 06:03:25 - INFO - __main__ - Step 62574: {'lr': 0.0003202928716760814, 'samples': 12014208, 'steps': 62573, 'loss/train': 1.5549575090408325} 11/07/2021 06:03:25 - INFO - __main__ - Step 62575: {'lr': 0.0003202877790030169, 'samples': 12014400, 'steps': 62574, 'loss/train': 1.6596471071243286} 11/07/2021 06:03:25 - INFO - __main__ - Step 62576: {'lr': 0.00032028268629828184, 'samples': 12014592, 'steps': 62575, 'loss/train': 1.6134611368179321} 11/07/2021 06:03:26 - INFO - __main__ - Step 62577: {'lr': 0.0003202775935618784, 'samples': 12014784, 'steps': 62576, 'loss/train': 1.5370959043502808} 11/07/2021 06:03:26 - INFO - __main__ - Step 62578: {'lr': 0.000320272500793809, 'samples': 12014976, 'steps': 62577, 'loss/train': 1.1561040878295898} 11/07/2021 06:03:27 - INFO - __main__ - Step 62579: {'lr': 0.0003202674079940759, 'samples': 12015168, 'steps': 62578, 'loss/train': 1.7069735527038574} 11/07/2021 06:03:28 - INFO - __main__ - Step 62580: {'lr': 0.00032026231516268147, 'samples': 12015360, 'steps': 62579, 'loss/train': 1.673508644104004} 11/07/2021 06:03:28 - INFO - __main__ - Step 62581: {'lr': 0.0003202572222996278, 'samples': 12015552, 'steps': 62580, 'loss/train': 1.3847873210906982} 11/07/2021 06:03:28 - INFO - __main__ - Step 62582: {'lr': 0.0003202521294049174, 'samples': 12015744, 'steps': 62581, 'loss/train': 1.3386253118515015} 11/07/2021 06:03:29 - INFO - __main__ - Step 62583: {'lr': 0.0003202470364785524, 'samples': 12015936, 'steps': 62582, 'loss/train': 0.9756842851638794} 11/07/2021 06:03:30 - INFO - __main__ - Step 62584: {'lr': 0.0003202419435205352, 'samples': 12016128, 'steps': 62583, 'loss/train': 0.9317203164100647} 11/07/2021 06:03:30 - INFO - __main__ - Step 62585: {'lr': 0.0003202368505308681, 'samples': 12016320, 'steps': 62584, 'loss/train': 1.6625269651412964} 11/07/2021 06:03:30 - INFO - __main__ - Step 62586: {'lr': 0.0003202317575095533, 'samples': 12016512, 'steps': 62585, 'loss/train': 1.5217801332473755} 11/07/2021 06:03:31 - INFO - __main__ - Step 62587: {'lr': 0.0003202266644565932, 'samples': 12016704, 'steps': 62586, 'loss/train': 1.3391423225402832} 11/07/2021 06:03:31 - INFO - __main__ - Step 62588: {'lr': 0.00032022157137199, 'samples': 12016896, 'steps': 62587, 'loss/train': 1.721959114074707} 11/07/2021 06:03:32 - INFO - __main__ - Step 62589: {'lr': 0.0003202164782557461, 'samples': 12017088, 'steps': 62588, 'loss/train': 1.577879548072815} 11/07/2021 06:03:33 - INFO - __main__ - Step 62590: {'lr': 0.0003202113851078637, 'samples': 12017280, 'steps': 62589, 'loss/train': 1.335235834121704} 11/07/2021 06:03:33 - INFO - __main__ - Step 62591: {'lr': 0.0003202062919283452, 'samples': 12017472, 'steps': 62590, 'loss/train': 1.0915756225585938} 11/07/2021 06:03:33 - INFO - __main__ - Step 62592: {'lr': 0.00032020119871719276, 'samples': 12017664, 'steps': 62591, 'loss/train': 1.5226856470108032} 11/07/2021 06:03:34 - INFO - __main__ - Step 62593: {'lr': 0.00032019610547440874, 'samples': 12017856, 'steps': 62592, 'loss/train': 1.6386942863464355} 11/07/2021 06:03:34 - INFO - __main__ - Step 62594: {'lr': 0.0003201910121999955, 'samples': 12018048, 'steps': 62593, 'loss/train': 3.1110341548919678} 11/07/2021 06:03:35 - INFO - __main__ - Step 62595: {'lr': 0.0003201859188939552, 'samples': 12018240, 'steps': 62594, 'loss/train': 1.675277829170227} 11/07/2021 06:03:35 - INFO - __main__ - Step 62596: {'lr': 0.00032018082555629025, 'samples': 12018432, 'steps': 62595, 'loss/train': 1.7365220785140991} 11/07/2021 06:03:36 - INFO - __main__ - Step 62597: {'lr': 0.0003201757321870029, 'samples': 12018624, 'steps': 62596, 'loss/train': 1.4035617113113403} 11/07/2021 06:03:36 - INFO - __main__ - Step 62598: {'lr': 0.0003201706387860954, 'samples': 12018816, 'steps': 62597, 'loss/train': 1.6051220893859863} 11/07/2021 06:03:36 - INFO - __main__ - Step 62599: {'lr': 0.00032016554535357016, 'samples': 12019008, 'steps': 62598, 'loss/train': 1.3190909624099731} 11/07/2021 06:03:37 - INFO - __main__ - Step 62600: {'lr': 0.00032016045188942946, 'samples': 12019200, 'steps': 62599, 'loss/train': 1.2726839780807495} 11/07/2021 06:03:38 - INFO - __main__ - Step 62601: {'lr': 0.00032015535839367544, 'samples': 12019392, 'steps': 62600, 'loss/train': 1.9633264541625977} 11/07/2021 06:03:38 - INFO - __main__ - Step 62602: {'lr': 0.0003201502648663105, 'samples': 12019584, 'steps': 62601, 'loss/train': 1.7215160131454468} 11/07/2021 06:03:38 - INFO - __main__ - Step 62603: {'lr': 0.00032014517130733695, 'samples': 12019776, 'steps': 62602, 'loss/train': 1.469570279121399} 11/07/2021 06:03:39 - INFO - __main__ - Step 62604: {'lr': 0.0003201400777167571, 'samples': 12019968, 'steps': 62603, 'loss/train': 1.7642886638641357} 11/07/2021 06:03:40 - INFO - __main__ - Step 62605: {'lr': 0.00032013498409457316, 'samples': 12020160, 'steps': 62604, 'loss/train': 1.50009024143219} 11/07/2021 06:03:40 - INFO - __main__ - Step 62606: {'lr': 0.00032012989044078745, 'samples': 12020352, 'steps': 62605, 'loss/train': 0.6350144147872925} 11/07/2021 06:03:41 - INFO - __main__ - Step 62607: {'lr': 0.0003201247967554024, 'samples': 12020544, 'steps': 62606, 'loss/train': 1.370793104171753} 11/07/2021 06:03:41 - INFO - __main__ - Step 62608: {'lr': 0.0003201197030384201, 'samples': 12020736, 'steps': 62607, 'loss/train': 1.6469072103500366} 11/07/2021 06:03:41 - INFO - __main__ - Step 62609: {'lr': 0.00032011460928984306, 'samples': 12020928, 'steps': 62608, 'loss/train': 1.2905105352401733} 11/07/2021 06:03:42 - INFO - __main__ - Step 62610: {'lr': 0.00032010951550967337, 'samples': 12021120, 'steps': 62609, 'loss/train': 1.7103897333145142} 11/07/2021 06:03:43 - INFO - __main__ - Step 62611: {'lr': 0.00032010442169791344, 'samples': 12021312, 'steps': 62610, 'loss/train': 1.4599632024765015} 11/07/2021 06:03:43 - INFO - __main__ - Step 62612: {'lr': 0.0003200993278545655, 'samples': 12021504, 'steps': 62611, 'loss/train': 1.635070562362671} 11/07/2021 06:03:43 - INFO - __main__ - Step 62613: {'lr': 0.0003200942339796319, 'samples': 12021696, 'steps': 62612, 'loss/train': 1.4792155027389526} 11/07/2021 06:03:44 - INFO - __main__ - Step 62614: {'lr': 0.000320089140073115, 'samples': 12021888, 'steps': 62613, 'loss/train': 1.400504231452942} 11/07/2021 06:03:45 - INFO - __main__ - Step 62615: {'lr': 0.00032008404613501697, 'samples': 12022080, 'steps': 62614, 'loss/train': 1.4532432556152344} 11/07/2021 06:03:45 - INFO - __main__ - Step 62616: {'lr': 0.0003200789521653401, 'samples': 12022272, 'steps': 62615, 'loss/train': 1.532862663269043} 11/07/2021 06:03:45 - INFO - __main__ - Step 62617: {'lr': 0.00032007385816408676, 'samples': 12022464, 'steps': 62616, 'loss/train': 1.0251758098602295} 11/07/2021 06:03:46 - INFO - __main__ - Step 62618: {'lr': 0.00032006876413125926, 'samples': 12022656, 'steps': 62617, 'loss/train': 1.050552487373352} 11/07/2021 06:03:46 - INFO - __main__ - Step 62619: {'lr': 0.0003200636700668598, 'samples': 12022848, 'steps': 62618, 'loss/train': 1.331942081451416} 11/07/2021 06:03:47 - INFO - __main__ - Step 62620: {'lr': 0.00032005857597089074, 'samples': 12023040, 'steps': 62619, 'loss/train': 1.1915242671966553} 11/07/2021 06:03:47 - INFO - __main__ - Step 62621: {'lr': 0.00032005348184335443, 'samples': 12023232, 'steps': 62620, 'loss/train': 1.5625697374343872} 11/07/2021 06:03:48 - INFO - __main__ - Step 62622: {'lr': 0.00032004838768425305, 'samples': 12023424, 'steps': 62621, 'loss/train': 1.2616782188415527} 11/07/2021 06:03:48 - INFO - __main__ - Step 62623: {'lr': 0.00032004329349358897, 'samples': 12023616, 'steps': 62622, 'loss/train': 1.5851287841796875} 11/07/2021 06:03:49 - INFO - __main__ - Step 62624: {'lr': 0.0003200381992713644, 'samples': 12023808, 'steps': 62623, 'loss/train': 1.5535544157028198} 11/07/2021 06:03:50 - INFO - __main__ - Step 62625: {'lr': 0.00032003310501758177, 'samples': 12024000, 'steps': 62624, 'loss/train': 1.3308470249176025} 11/07/2021 06:03:50 - INFO - __main__ - Step 62626: {'lr': 0.00032002801073224325, 'samples': 12024192, 'steps': 62625, 'loss/train': 1.7714858055114746} 11/07/2021 06:03:50 - INFO - __main__ - Step 62627: {'lr': 0.00032002291641535126, 'samples': 12024384, 'steps': 62626, 'loss/train': 1.3204787969589233} 11/07/2021 06:03:51 - INFO - __main__ - Step 62628: {'lr': 0.000320017822066908, 'samples': 12024576, 'steps': 62627, 'loss/train': 1.3606059551239014} 11/07/2021 06:03:51 - INFO - __main__ - Step 62629: {'lr': 0.00032001272768691577, 'samples': 12024768, 'steps': 62628, 'loss/train': 1.2951653003692627} 11/07/2021 06:03:52 - INFO - __main__ - Step 62630: {'lr': 0.00032000763327537683, 'samples': 12024960, 'steps': 62629, 'loss/train': 1.2710437774658203} 11/07/2021 06:03:52 - INFO - __main__ - Step 62631: {'lr': 0.00032000253883229357, 'samples': 12025152, 'steps': 62630, 'loss/train': 1.3489089012145996} 11/07/2021 06:03:53 - INFO - __main__ - Step 62632: {'lr': 0.0003199974443576683, 'samples': 12025344, 'steps': 62631, 'loss/train': 1.1418787240982056} 11/07/2021 06:03:53 - INFO - __main__ - Step 62633: {'lr': 0.00031999234985150314, 'samples': 12025536, 'steps': 62632, 'loss/train': 1.0987135171890259} 11/07/2021 06:03:53 - INFO - __main__ - Step 62634: {'lr': 0.0003199872553138007, 'samples': 12025728, 'steps': 62633, 'loss/train': 1.559191346168518} 11/07/2021 06:03:54 - INFO - __main__ - Step 62635: {'lr': 0.00031998216074456296, 'samples': 12025920, 'steps': 62634, 'loss/train': 1.8639273643493652} 11/07/2021 06:03:55 - INFO - __main__ - Step 62636: {'lr': 0.00031997706614379236, 'samples': 12026112, 'steps': 62635, 'loss/train': 1.4559524059295654} 11/07/2021 06:03:55 - INFO - __main__ - Step 62637: {'lr': 0.00031997197151149116, 'samples': 12026304, 'steps': 62636, 'loss/train': 1.5474399328231812} 11/07/2021 06:03:56 - INFO - __main__ - Step 62638: {'lr': 0.0003199668768476617, 'samples': 12026496, 'steps': 62637, 'loss/train': 1.5071251392364502} 11/07/2021 06:03:56 - INFO - __main__ - Step 62639: {'lr': 0.0003199617821523062, 'samples': 12026688, 'steps': 62638, 'loss/train': 0.9290895462036133} 11/07/2021 06:03:56 - INFO - __main__ - Step 62640: {'lr': 0.000319956687425427, 'samples': 12026880, 'steps': 62639, 'loss/train': 1.1499719619750977} 11/07/2021 06:03:57 - INFO - __main__ - Step 62641: {'lr': 0.00031995159266702647, 'samples': 12027072, 'steps': 62640, 'loss/train': 1.454851508140564} 11/07/2021 06:03:58 - INFO - __main__ - Step 62642: {'lr': 0.0003199464978771067, 'samples': 12027264, 'steps': 62641, 'loss/train': 1.2020326852798462} 11/07/2021 06:03:58 - INFO - __main__ - Step 62643: {'lr': 0.0003199414030556702, 'samples': 12027456, 'steps': 62642, 'loss/train': 1.5142760276794434} 11/07/2021 06:03:58 - INFO - __main__ - Step 62644: {'lr': 0.00031993630820271925, 'samples': 12027648, 'steps': 62643, 'loss/train': 0.6910327076911926} 11/07/2021 06:03:59 - INFO - __main__ - Step 62645: {'lr': 0.000319931213318256, 'samples': 12027840, 'steps': 62644, 'loss/train': 1.4935252666473389} 11/07/2021 06:04:00 - INFO - __main__ - Step 62646: {'lr': 0.0003199261184022828, 'samples': 12028032, 'steps': 62645, 'loss/train': 1.7279657125473022} 11/07/2021 06:04:00 - INFO - __main__ - Step 62647: {'lr': 0.000319921023454802, 'samples': 12028224, 'steps': 62646, 'loss/train': 1.349311113357544} 11/07/2021 06:04:00 - INFO - __main__ - Step 62648: {'lr': 0.0003199159284758159, 'samples': 12028416, 'steps': 62647, 'loss/train': 1.6510727405548096} 11/07/2021 06:04:01 - INFO - __main__ - Step 62649: {'lr': 0.0003199108334653267, 'samples': 12028608, 'steps': 62648, 'loss/train': 1.2208062410354614} 11/07/2021 06:04:01 - INFO - __main__ - Step 62650: {'lr': 0.0003199057384233368, 'samples': 12028800, 'steps': 62649, 'loss/train': 0.6792144775390625} 11/07/2021 06:04:02 - INFO - __main__ - Step 62651: {'lr': 0.0003199006433498484, 'samples': 12028992, 'steps': 62650, 'loss/train': 1.4753657579421997} 11/07/2021 06:04:02 - INFO - __main__ - Step 62652: {'lr': 0.0003198955482448639, 'samples': 12029184, 'steps': 62651, 'loss/train': 1.5258936882019043} 11/07/2021 06:04:03 - INFO - __main__ - Step 62653: {'lr': 0.0003198904531083856, 'samples': 12029376, 'steps': 62652, 'loss/train': 1.0058056116104126} 11/07/2021 06:04:03 - INFO - __main__ - Step 62654: {'lr': 0.0003198853579404157, 'samples': 12029568, 'steps': 62653, 'loss/train': 1.350157618522644} 11/07/2021 06:04:03 - INFO - __main__ - Step 62655: {'lr': 0.0003198802627409565, 'samples': 12029760, 'steps': 62654, 'loss/train': 1.1061919927597046} 11/07/2021 06:04:04 - INFO - __main__ - Step 62656: {'lr': 0.0003198751675100103, 'samples': 12029952, 'steps': 62655, 'loss/train': 1.7942076921463013} 11/07/2021 06:04:05 - INFO - __main__ - Step 62657: {'lr': 0.0003198700722475795, 'samples': 12030144, 'steps': 62656, 'loss/train': 1.2547370195388794} 11/07/2021 06:04:05 - INFO - __main__ - Step 62658: {'lr': 0.00031986497695366624, 'samples': 12030336, 'steps': 62657, 'loss/train': 1.169823169708252} 11/07/2021 06:04:05 - INFO - __main__ - Step 62659: {'lr': 0.000319859881628273, 'samples': 12030528, 'steps': 62658, 'loss/train': 1.3539986610412598} 11/07/2021 06:04:06 - INFO - __main__ - Step 62660: {'lr': 0.0003198547862714019, 'samples': 12030720, 'steps': 62659, 'loss/train': 1.62205970287323} 11/07/2021 06:04:07 - INFO - __main__ - Step 62661: {'lr': 0.0003198496908830554, 'samples': 12030912, 'steps': 62660, 'loss/train': 1.0933656692504883} 11/07/2021 06:04:07 - INFO - __main__ - Step 62662: {'lr': 0.00031984459546323564, 'samples': 12031104, 'steps': 62661, 'loss/train': 1.43789803981781} 11/07/2021 06:04:08 - INFO - __main__ - Step 62663: {'lr': 0.000319839500011945, 'samples': 12031296, 'steps': 62662, 'loss/train': 1.1594839096069336} 11/07/2021 06:04:08 - INFO - __main__ - Step 62664: {'lr': 0.0003198344045291857, 'samples': 12031488, 'steps': 62663, 'loss/train': 1.0476890802383423} 11/07/2021 06:04:08 - INFO - __main__ - Step 62665: {'lr': 0.00031982930901496015, 'samples': 12031680, 'steps': 62664, 'loss/train': 1.6713236570358276} 11/07/2021 06:04:09 - INFO - __main__ - Step 62666: {'lr': 0.0003198242134692706, 'samples': 12031872, 'steps': 62665, 'loss/train': 1.4561036825180054} 11/07/2021 06:04:10 - INFO - __main__ - Step 62667: {'lr': 0.0003198191178921193, 'samples': 12032064, 'steps': 62666, 'loss/train': 1.7016842365264893} 11/07/2021 06:04:10 - INFO - __main__ - Step 62668: {'lr': 0.00031981402228350867, 'samples': 12032256, 'steps': 62667, 'loss/train': 1.483222484588623} 11/07/2021 06:04:10 - INFO - __main__ - Step 62669: {'lr': 0.00031980892664344084, 'samples': 12032448, 'steps': 62668, 'loss/train': 1.6233973503112793} 11/07/2021 06:04:11 - INFO - __main__ - Step 62670: {'lr': 0.0003198038309719182, 'samples': 12032640, 'steps': 62669, 'loss/train': 0.7984271049499512} 11/07/2021 06:04:11 - INFO - __main__ - Step 62671: {'lr': 0.000319798735268943, 'samples': 12032832, 'steps': 62670, 'loss/train': 1.442606806755066} 11/07/2021 06:04:12 - INFO - __main__ - Step 62672: {'lr': 0.00031979363953451765, 'samples': 12033024, 'steps': 62671, 'loss/train': 1.436437964439392} 11/07/2021 06:04:13 - INFO - __main__ - Step 62673: {'lr': 0.00031978854376864426, 'samples': 12033216, 'steps': 62672, 'loss/train': 0.7068847417831421} 11/07/2021 06:04:13 - INFO - __main__ - Step 62674: {'lr': 0.00031978344797132526, 'samples': 12033408, 'steps': 62673, 'loss/train': 1.479882001876831} 11/07/2021 06:04:13 - INFO - __main__ - Step 62675: {'lr': 0.000319778352142563, 'samples': 12033600, 'steps': 62674, 'loss/train': 1.5137968063354492} 11/07/2021 06:04:14 - INFO - __main__ - Step 62676: {'lr': 0.00031977325628235957, 'samples': 12033792, 'steps': 62675, 'loss/train': 1.234453797340393} 11/07/2021 06:04:15 - INFO - __main__ - Step 62677: {'lr': 0.0003197681603907174, 'samples': 12033984, 'steps': 62676, 'loss/train': 2.338395118713379} 11/07/2021 06:04:15 - INFO - __main__ - Step 62678: {'lr': 0.0003197630644676389, 'samples': 12034176, 'steps': 62677, 'loss/train': 1.450269341468811} 11/07/2021 06:04:15 - INFO - __main__ - Step 62679: {'lr': 0.0003197579685131261, 'samples': 12034368, 'steps': 62678, 'loss/train': 1.4716039896011353} 11/07/2021 06:04:16 - INFO - __main__ - Step 62680: {'lr': 0.0003197528725271815, 'samples': 12034560, 'steps': 62679, 'loss/train': 1.8092190027236938} 11/07/2021 06:04:16 - INFO - __main__ - Step 62681: {'lr': 0.00031974777650980735, 'samples': 12034752, 'steps': 62680, 'loss/train': 1.0038436651229858} 11/07/2021 06:04:17 - INFO - __main__ - Step 62682: {'lr': 0.00031974268046100593, 'samples': 12034944, 'steps': 62681, 'loss/train': 1.3514845371246338} 11/07/2021 06:04:18 - INFO - __main__ - Step 62683: {'lr': 0.0003197375843807795, 'samples': 12035136, 'steps': 62682, 'loss/train': 1.5164755582809448} 11/07/2021 06:04:18 - INFO - __main__ - Step 62684: {'lr': 0.00031973248826913035, 'samples': 12035328, 'steps': 62683, 'loss/train': 1.2570487260818481} 11/07/2021 06:04:18 - INFO - __main__ - Step 62685: {'lr': 0.0003197273921260609, 'samples': 12035520, 'steps': 62684, 'loss/train': 0.8190792202949524} 11/07/2021 06:04:19 - INFO - __main__ - Step 62686: {'lr': 0.0003197222959515733, 'samples': 12035712, 'steps': 62685, 'loss/train': 1.5686802864074707} 11/07/2021 06:04:20 - INFO - __main__ - Step 62687: {'lr': 0.00031971719974566994, 'samples': 12035904, 'steps': 62686, 'loss/train': 1.0731760263442993} 11/07/2021 06:04:20 - INFO - __main__ - Step 62688: {'lr': 0.00031971210350835314, 'samples': 12036096, 'steps': 62687, 'loss/train': 1.1709024906158447} 11/07/2021 06:04:21 - INFO - __main__ - Step 62689: {'lr': 0.00031970700723962504, 'samples': 12036288, 'steps': 62688, 'loss/train': 0.550015389919281} 11/07/2021 06:04:21 - INFO - __main__ - Step 62690: {'lr': 0.0003197019109394881, 'samples': 12036480, 'steps': 62689, 'loss/train': 0.12864889204502106} 11/07/2021 06:04:21 - INFO - __main__ - Step 62691: {'lr': 0.00031969681460794453, 'samples': 12036672, 'steps': 62690, 'loss/train': 1.3492085933685303} 11/07/2021 06:04:22 - INFO - __main__ - Step 62692: {'lr': 0.00031969171824499667, 'samples': 12036864, 'steps': 62691, 'loss/train': 1.0030395984649658} 11/07/2021 06:04:23 - INFO - __main__ - Step 62693: {'lr': 0.00031968662185064673, 'samples': 12037056, 'steps': 62692, 'loss/train': 1.1825387477874756} 11/07/2021 06:04:23 - INFO - __main__ - Step 62694: {'lr': 0.00031968152542489716, 'samples': 12037248, 'steps': 62693, 'loss/train': 1.5348248481750488} 11/07/2021 06:04:24 - INFO - __main__ - Step 62695: {'lr': 0.0003196764289677502, 'samples': 12037440, 'steps': 62694, 'loss/train': 1.782454252243042} 11/07/2021 06:04:24 - INFO - __main__ - Step 62696: {'lr': 0.000319671332479208, 'samples': 12037632, 'steps': 62695, 'loss/train': 1.664305329322815} 11/07/2021 06:04:24 - INFO - __main__ - Step 62697: {'lr': 0.00031966623595927303, 'samples': 12037824, 'steps': 62696, 'loss/train': 1.255210518836975} 11/07/2021 06:04:25 - INFO - __main__ - Step 62698: {'lr': 0.0003196611394079475, 'samples': 12038016, 'steps': 62697, 'loss/train': 1.4708527326583862} 11/07/2021 06:04:26 - INFO - __main__ - Step 62699: {'lr': 0.00031965604282523373, 'samples': 12038208, 'steps': 62698, 'loss/train': 1.1854840517044067} 11/07/2021 06:04:26 - INFO - __main__ - Step 62700: {'lr': 0.00031965094621113407, 'samples': 12038400, 'steps': 62699, 'loss/train': 1.7140016555786133} 11/07/2021 06:04:26 - INFO - __main__ - Step 62701: {'lr': 0.0003196458495656508, 'samples': 12038592, 'steps': 62700, 'loss/train': 1.5162678956985474} 11/07/2021 06:04:27 - INFO - __main__ - Step 62702: {'lr': 0.00031964075288878614, 'samples': 12038784, 'steps': 62701, 'loss/train': 1.3241618871688843} 11/07/2021 06:04:28 - INFO - __main__ - Step 62703: {'lr': 0.00031963565618054244, 'samples': 12038976, 'steps': 62702, 'loss/train': 1.8253673315048218} 11/07/2021 06:04:28 - INFO - __main__ - Step 62704: {'lr': 0.0003196305594409219, 'samples': 12039168, 'steps': 62703, 'loss/train': 1.5152649879455566} 11/07/2021 06:04:29 - INFO - __main__ - Step 62705: {'lr': 0.000319625462669927, 'samples': 12039360, 'steps': 62704, 'loss/train': 0.5592816472053528} 11/07/2021 06:04:29 - INFO - __main__ - Step 62706: {'lr': 0.00031962036586755994, 'samples': 12039552, 'steps': 62705, 'loss/train': 0.9880719780921936} 11/07/2021 06:04:29 - INFO - __main__ - Step 62707: {'lr': 0.000319615269033823, 'samples': 12039744, 'steps': 62706, 'loss/train': 1.3332786560058594} 11/07/2021 06:04:30 - INFO - __main__ - Step 62708: {'lr': 0.00031961017216871853, 'samples': 12039936, 'steps': 62707, 'loss/train': 1.3389471769332886} 11/07/2021 06:04:31 - INFO - __main__ - Step 62709: {'lr': 0.0003196050752722487, 'samples': 12040128, 'steps': 62708, 'loss/train': 3.3720107078552246} 11/07/2021 06:04:31 - INFO - __main__ - Step 62710: {'lr': 0.00031959997834441595, 'samples': 12040320, 'steps': 62709, 'loss/train': 1.37107515335083} 11/07/2021 06:04:31 - INFO - __main__ - Step 62711: {'lr': 0.00031959488138522254, 'samples': 12040512, 'steps': 62710, 'loss/train': 1.5134437084197998} 11/07/2021 06:04:32 - INFO - __main__ - Step 62712: {'lr': 0.0003195897843946707, 'samples': 12040704, 'steps': 62711, 'loss/train': 1.4119442701339722} 11/07/2021 06:04:32 - INFO - __main__ - Step 62713: {'lr': 0.0003195846873727628, 'samples': 12040896, 'steps': 62712, 'loss/train': 1.608046054840088} 11/07/2021 06:04:33 - INFO - __main__ - Step 62714: {'lr': 0.00031957959031950114, 'samples': 12041088, 'steps': 62713, 'loss/train': 1.6858094930648804} 11/07/2021 06:04:34 - INFO - __main__ - Step 62715: {'lr': 0.00031957449323488803, 'samples': 12041280, 'steps': 62714, 'loss/train': 1.032314658164978} 11/07/2021 06:04:34 - INFO - __main__ - Step 62716: {'lr': 0.00031956939611892565, 'samples': 12041472, 'steps': 62715, 'loss/train': 1.5113379955291748} 11/07/2021 06:04:34 - INFO - __main__ - Step 62717: {'lr': 0.0003195642989716164, 'samples': 12041664, 'steps': 62716, 'loss/train': 1.1849489212036133} 11/07/2021 06:04:35 - INFO - __main__ - Step 62718: {'lr': 0.0003195592017929625, 'samples': 12041856, 'steps': 62717, 'loss/train': 1.4050066471099854} 11/07/2021 06:04:36 - INFO - __main__ - Step 62719: {'lr': 0.00031955410458296636, 'samples': 12042048, 'steps': 62718, 'loss/train': 1.2454042434692383} 11/07/2021 06:04:36 - INFO - __main__ - Step 62720: {'lr': 0.00031954900734163015, 'samples': 12042240, 'steps': 62719, 'loss/train': 1.582647442817688} 11/07/2021 06:04:36 - INFO - __main__ - Step 62721: {'lr': 0.0003195439100689563, 'samples': 12042432, 'steps': 62720, 'loss/train': 1.0886907577514648} 11/07/2021 06:04:37 - INFO - __main__ - Step 62722: {'lr': 0.00031953881276494705, 'samples': 12042624, 'steps': 62721, 'loss/train': 1.4682846069335938} 11/07/2021 06:04:37 - INFO - __main__ - Step 62723: {'lr': 0.00031953371542960466, 'samples': 12042816, 'steps': 62722, 'loss/train': 1.1626001596450806} 11/07/2021 06:04:38 - INFO - __main__ - Step 62724: {'lr': 0.0003195286180629314, 'samples': 12043008, 'steps': 62723, 'loss/train': 1.7747900485992432} 11/07/2021 06:04:38 - INFO - __main__ - Step 62725: {'lr': 0.0003195235206649297, 'samples': 12043200, 'steps': 62724, 'loss/train': 1.6025872230529785} 11/07/2021 06:04:39 - INFO - __main__ - Step 62726: {'lr': 0.0003195184232356017, 'samples': 12043392, 'steps': 62725, 'loss/train': 1.954345464706421} 11/07/2021 06:04:39 - INFO - __main__ - Step 62727: {'lr': 0.00031951332577494977, 'samples': 12043584, 'steps': 62726, 'loss/train': 1.7068994045257568} 11/07/2021 06:04:39 - INFO - __main__ - Step 62728: {'lr': 0.0003195082282829763, 'samples': 12043776, 'steps': 62727, 'loss/train': 1.6482192277908325} 11/07/2021 06:04:41 - INFO - __main__ - Step 62729: {'lr': 0.0003195031307596834, 'samples': 12043968, 'steps': 62728, 'loss/train': 1.4585254192352295} 11/07/2021 06:04:41 - INFO - __main__ - Step 62730: {'lr': 0.00031949803320507355, 'samples': 12044160, 'steps': 62729, 'loss/train': 1.7501304149627686} 11/07/2021 06:04:41 - INFO - __main__ - Step 62731: {'lr': 0.0003194929356191489, 'samples': 12044352, 'steps': 62730, 'loss/train': 1.306194543838501} 11/07/2021 06:04:42 - INFO - __main__ - Step 62732: {'lr': 0.00031948783800191176, 'samples': 12044544, 'steps': 62731, 'loss/train': 1.5864689350128174} 11/07/2021 06:04:42 - INFO - __main__ - Step 62733: {'lr': 0.00031948274035336455, 'samples': 12044736, 'steps': 62732, 'loss/train': 1.2477009296417236} 11/07/2021 06:04:43 - INFO - __main__ - Step 62734: {'lr': 0.00031947764267350944, 'samples': 12044928, 'steps': 62733, 'loss/train': 1.7200684547424316} 11/07/2021 06:04:43 - INFO - __main__ - Step 62735: {'lr': 0.00031947254496234885, 'samples': 12045120, 'steps': 62734, 'loss/train': 1.2488747835159302} 11/07/2021 06:04:44 - INFO - __main__ - Step 62736: {'lr': 0.00031946744721988497, 'samples': 12045312, 'steps': 62735, 'loss/train': 1.6390701532363892} 11/07/2021 06:04:44 - INFO - __main__ - Step 62737: {'lr': 0.00031946234944612006, 'samples': 12045504, 'steps': 62736, 'loss/train': 1.2818548679351807} 11/07/2021 06:04:44 - INFO - __main__ - Step 62738: {'lr': 0.00031945725164105656, 'samples': 12045696, 'steps': 62737, 'loss/train': 1.8401554822921753} 11/07/2021 06:04:45 - INFO - __main__ - Step 62739: {'lr': 0.00031945215380469664, 'samples': 12045888, 'steps': 62738, 'loss/train': 0.7181070446968079} 11/07/2021 06:04:46 - INFO - __main__ - Step 62740: {'lr': 0.0003194470559370427, 'samples': 12046080, 'steps': 62739, 'loss/train': 1.9320555925369263} 11/07/2021 06:04:46 - INFO - __main__ - Step 62741: {'lr': 0.00031944195803809694, 'samples': 12046272, 'steps': 62740, 'loss/train': 5.807243347167969} 11/07/2021 06:04:47 - INFO - __main__ - Step 62742: {'lr': 0.00031943686010786176, 'samples': 12046464, 'steps': 62741, 'loss/train': 1.5501166582107544} 11/07/2021 06:04:47 - INFO - __main__ - Step 62743: {'lr': 0.0003194317621463394, 'samples': 12046656, 'steps': 62742, 'loss/train': 1.3100804090499878} 11/07/2021 06:04:47 - INFO - __main__ - Step 62744: {'lr': 0.0003194266641535322, 'samples': 12046848, 'steps': 62743, 'loss/train': 1.1554694175720215} 11/07/2021 06:04:48 - INFO - __main__ - Step 62745: {'lr': 0.0003194215661294423, 'samples': 12047040, 'steps': 62744, 'loss/train': 1.6617426872253418} 11/07/2021 06:04:49 - INFO - __main__ - Step 62746: {'lr': 0.00031941646807407217, 'samples': 12047232, 'steps': 62745, 'loss/train': 1.1735862493515015} 11/07/2021 06:04:49 - INFO - __main__ - Step 62747: {'lr': 0.000319411369987424, 'samples': 12047424, 'steps': 62746, 'loss/train': 1.7436447143554688} 11/07/2021 06:04:49 - INFO - __main__ - Step 62748: {'lr': 0.00031940627186950027, 'samples': 12047616, 'steps': 62747, 'loss/train': 1.4927548170089722} 11/07/2021 06:04:50 - INFO - __main__ - Step 62749: {'lr': 0.00031940117372030304, 'samples': 12047808, 'steps': 62748, 'loss/train': 1.8997554779052734} 11/07/2021 06:04:51 - INFO - __main__ - Step 62750: {'lr': 0.00031939607553983475, 'samples': 12048000, 'steps': 62749, 'loss/train': 1.0926433801651} 11/07/2021 06:04:51 - INFO - __main__ - Step 62751: {'lr': 0.00031939097732809765, 'samples': 12048192, 'steps': 62750, 'loss/train': 1.5546214580535889} 11/07/2021 06:04:51 - INFO - __main__ - Step 62752: {'lr': 0.000319385879085094, 'samples': 12048384, 'steps': 62751, 'loss/train': 1.3821102380752563} 11/07/2021 06:04:52 - INFO - __main__ - Step 62753: {'lr': 0.0003193807808108262, 'samples': 12048576, 'steps': 62752, 'loss/train': 1.0645699501037598} 11/07/2021 06:04:52 - INFO - __main__ - Step 62754: {'lr': 0.00031937568250529647, 'samples': 12048768, 'steps': 62753, 'loss/train': 0.751480221748352} 11/07/2021 06:04:53 - INFO - __main__ - Step 62755: {'lr': 0.00031937058416850716, 'samples': 12048960, 'steps': 62754, 'loss/train': 1.6059151887893677} 11/07/2021 06:04:53 - INFO - __main__ - Step 62756: {'lr': 0.00031936548580046046, 'samples': 12049152, 'steps': 62755, 'loss/train': 1.4997224807739258} 11/07/2021 06:04:54 - INFO - __main__ - Step 62757: {'lr': 0.0003193603874011588, 'samples': 12049344, 'steps': 62756, 'loss/train': 1.559220314025879} 11/07/2021 06:04:54 - INFO - __main__ - Step 62758: {'lr': 0.0003193552889706044, 'samples': 12049536, 'steps': 62757, 'loss/train': 1.629244327545166} 11/07/2021 06:04:55 - INFO - __main__ - Step 62759: {'lr': 0.0003193501905087996, 'samples': 12049728, 'steps': 62758, 'loss/train': 1.7782288789749146} 11/07/2021 06:04:55 - INFO - __main__ - Step 62760: {'lr': 0.0003193450920157467, 'samples': 12049920, 'steps': 62759, 'loss/train': 1.3906774520874023} 11/07/2021 06:04:56 - INFO - __main__ - Step 62761: {'lr': 0.0003193399934914479, 'samples': 12050112, 'steps': 62760, 'loss/train': 1.6222286224365234} 11/07/2021 06:04:56 - INFO - __main__ - Step 62762: {'lr': 0.0003193348949359056, 'samples': 12050304, 'steps': 62761, 'loss/train': 1.4147886037826538} 11/07/2021 06:04:57 - INFO - __main__ - Step 62763: {'lr': 0.0003193297963491221, 'samples': 12050496, 'steps': 62762, 'loss/train': 1.9671435356140137} 11/07/2021 06:04:57 - INFO - __main__ - Step 62764: {'lr': 0.00031932469773109963, 'samples': 12050688, 'steps': 62763, 'loss/train': 1.5566567182540894} 11/07/2021 06:04:57 - INFO - __main__ - Step 62765: {'lr': 0.0003193195990818405, 'samples': 12050880, 'steps': 62764, 'loss/train': 1.0800551176071167} 11/07/2021 06:04:59 - INFO - __main__ - Step 62766: {'lr': 0.00031931450040134705, 'samples': 12051072, 'steps': 62765, 'loss/train': 1.2862639427185059} 11/07/2021 06:04:59 - INFO - __main__ - Step 62767: {'lr': 0.00031930940168962155, 'samples': 12051264, 'steps': 62766, 'loss/train': 1.8367111682891846} 11/07/2021 06:04:59 - INFO - __main__ - Step 62768: {'lr': 0.00031930430294666636, 'samples': 12051456, 'steps': 62767, 'loss/train': 2.0855631828308105} 11/07/2021 06:05:00 - INFO - __main__ - Step 62769: {'lr': 0.00031929920417248366, 'samples': 12051648, 'steps': 62768, 'loss/train': 1.5210955142974854} 11/07/2021 06:05:00 - INFO - __main__ - Step 62770: {'lr': 0.0003192941053670758, 'samples': 12051840, 'steps': 62769, 'loss/train': 1.4959686994552612} 11/07/2021 06:05:00 - INFO - __main__ - Step 62771: {'lr': 0.00031928900653044513, 'samples': 12052032, 'steps': 62770, 'loss/train': 1.3294947147369385} 11/07/2021 06:05:02 - INFO - __main__ - Step 62772: {'lr': 0.00031928390766259386, 'samples': 12052224, 'steps': 62771, 'loss/train': 1.5084253549575806} 11/07/2021 06:05:02 - INFO - __main__ - Step 62773: {'lr': 0.00031927880876352435, 'samples': 12052416, 'steps': 62772, 'loss/train': 1.6528867483139038} 11/07/2021 06:05:02 - INFO - __main__ - Step 62774: {'lr': 0.0003192737098332388, 'samples': 12052608, 'steps': 62773, 'loss/train': 1.4084581136703491} 11/07/2021 06:05:03 - INFO - __main__ - Step 62775: {'lr': 0.00031926861087173974, 'samples': 12052800, 'steps': 62774, 'loss/train': 0.7405214905738831} 11/07/2021 06:05:03 - INFO - __main__ - Step 62776: {'lr': 0.00031926351187902926, 'samples': 12052992, 'steps': 62775, 'loss/train': 1.6125173568725586} 11/07/2021 06:05:03 - INFO - __main__ - Step 62777: {'lr': 0.00031925841285510964, 'samples': 12053184, 'steps': 62776, 'loss/train': 1.0178664922714233} 11/07/2021 06:05:04 - INFO - __main__ - Step 62778: {'lr': 0.0003192533137999833, 'samples': 12053376, 'steps': 62777, 'loss/train': 1.2676581144332886} 11/07/2021 06:05:05 - INFO - __main__ - Step 62779: {'lr': 0.0003192482147136525, 'samples': 12053568, 'steps': 62778, 'loss/train': 1.469295859336853} 11/07/2021 06:05:05 - INFO - __main__ - Step 62780: {'lr': 0.00031924311559611946, 'samples': 12053760, 'steps': 62779, 'loss/train': 1.8677239418029785} 11/07/2021 06:05:05 - INFO - __main__ - Step 62781: {'lr': 0.0003192380164473866, 'samples': 12053952, 'steps': 62780, 'loss/train': 1.348706841468811} 11/07/2021 06:05:06 - INFO - __main__ - Step 62782: {'lr': 0.0003192329172674562, 'samples': 12054144, 'steps': 62781, 'loss/train': 1.3065993785858154} 11/07/2021 06:05:07 - INFO - __main__ - Step 62783: {'lr': 0.0003192278180563304, 'samples': 12054336, 'steps': 62782, 'loss/train': 1.3841580152511597} 11/07/2021 06:05:07 - INFO - __main__ - Step 62784: {'lr': 0.00031922271881401165, 'samples': 12054528, 'steps': 62783, 'loss/train': 1.2542434930801392} 11/07/2021 06:05:07 - INFO - __main__ - Step 62785: {'lr': 0.0003192176195405023, 'samples': 12054720, 'steps': 62784, 'loss/train': 1.6238476037979126} 11/07/2021 06:05:08 - INFO - __main__ - Step 62786: {'lr': 0.00031921252023580445, 'samples': 12054912, 'steps': 62785, 'loss/train': 1.5787123441696167} 11/07/2021 06:05:08 - INFO - __main__ - Step 62787: {'lr': 0.00031920742089992056, 'samples': 12055104, 'steps': 62786, 'loss/train': 1.3160008192062378} 11/07/2021 06:05:10 - INFO - __main__ - Step 62788: {'lr': 0.0003192023215328529, 'samples': 12055296, 'steps': 62787, 'loss/train': 1.45685613155365} 11/07/2021 06:05:10 - INFO - __main__ - Step 62789: {'lr': 0.0003191972221346037, 'samples': 12055488, 'steps': 62788, 'loss/train': 1.769276738166809} 11/07/2021 06:05:10 - INFO - __main__ - Step 62790: {'lr': 0.0003191921227051753, 'samples': 12055680, 'steps': 62789, 'loss/train': 1.7501845359802246} 11/07/2021 06:05:11 - INFO - __main__ - Step 62791: {'lr': 0.0003191870232445699, 'samples': 12055872, 'steps': 62790, 'loss/train': 1.7350530624389648} 11/07/2021 06:05:11 - INFO - __main__ - Step 62792: {'lr': 0.00031918192375279006, 'samples': 12056064, 'steps': 62791, 'loss/train': 1.6111891269683838} 11/07/2021 06:05:11 - INFO - __main__ - Step 62793: {'lr': 0.00031917682422983787, 'samples': 12056256, 'steps': 62792, 'loss/train': 1.795663595199585} 11/07/2021 06:05:12 - INFO - __main__ - Step 62794: {'lr': 0.00031917172467571563, 'samples': 12056448, 'steps': 62793, 'loss/train': 1.5335144996643066} 11/07/2021 06:05:13 - INFO - __main__ - Step 62795: {'lr': 0.0003191666250904257, 'samples': 12056640, 'steps': 62794, 'loss/train': 1.6222795248031616} 11/07/2021 06:05:13 - INFO - __main__ - Step 62796: {'lr': 0.0003191615254739703, 'samples': 12056832, 'steps': 62795, 'loss/train': 1.6779783964157104} 11/07/2021 06:05:13 - INFO - __main__ - Step 62797: {'lr': 0.00031915642582635185, 'samples': 12057024, 'steps': 62796, 'loss/train': 1.5173181295394897} 11/07/2021 06:05:14 - INFO - __main__ - Step 62798: {'lr': 0.0003191513261475726, 'samples': 12057216, 'steps': 62797, 'loss/train': 1.39989173412323} 11/07/2021 06:05:15 - INFO - __main__ - Step 62799: {'lr': 0.0003191462264376348, 'samples': 12057408, 'steps': 62798, 'loss/train': 1.2754218578338623} 11/07/2021 06:05:15 - INFO - __main__ - Step 62800: {'lr': 0.0003191411266965408, 'samples': 12057600, 'steps': 62799, 'loss/train': 1.0048216581344604} 11/07/2021 06:05:16 - INFO - __main__ - Step 62801: {'lr': 0.0003191360269242928, 'samples': 12057792, 'steps': 62800, 'loss/train': 1.6289876699447632} 11/07/2021 06:05:16 - INFO - __main__ - Step 62802: {'lr': 0.0003191309271208932, 'samples': 12057984, 'steps': 62801, 'loss/train': 1.4402602910995483} 11/07/2021 06:05:16 - INFO - __main__ - Step 62803: {'lr': 0.0003191258272863443, 'samples': 12058176, 'steps': 62802, 'loss/train': 1.295466661453247} 11/07/2021 06:05:17 - INFO - __main__ - Step 62804: {'lr': 0.0003191207274206484, 'samples': 12058368, 'steps': 62803, 'loss/train': 1.2740763425827026} 11/07/2021 06:05:18 - INFO - __main__ - Step 62805: {'lr': 0.00031911562752380773, 'samples': 12058560, 'steps': 62804, 'loss/train': 1.3469212055206299} 11/07/2021 06:05:18 - INFO - __main__ - Step 62806: {'lr': 0.0003191105275958246, 'samples': 12058752, 'steps': 62805, 'loss/train': 1.6076288223266602} 11/07/2021 06:05:18 - INFO - __main__ - Step 62807: {'lr': 0.00031910542763670136, 'samples': 12058944, 'steps': 62806, 'loss/train': 1.1623677015304565} 11/07/2021 06:05:19 - INFO - __main__ - Step 62808: {'lr': 0.00031910032764644026, 'samples': 12059136, 'steps': 62807, 'loss/train': 1.7273036241531372} 11/07/2021 06:05:19 - INFO - __main__ - Step 62809: {'lr': 0.0003190952276250437, 'samples': 12059328, 'steps': 62808, 'loss/train': 1.5818376541137695} 11/07/2021 06:05:20 - INFO - __main__ - Step 62810: {'lr': 0.00031909012757251376, 'samples': 12059520, 'steps': 62809, 'loss/train': 1.3998363018035889} 11/07/2021 06:05:21 - INFO - __main__ - Step 62811: {'lr': 0.000319085027488853, 'samples': 12059712, 'steps': 62810, 'loss/train': 1.303147554397583} 11/07/2021 06:05:21 - INFO - __main__ - Step 62812: {'lr': 0.0003190799273740635, 'samples': 12059904, 'steps': 62811, 'loss/train': 1.9011322259902954} 11/07/2021 06:05:21 - INFO - __main__ - Step 62813: {'lr': 0.00031907482722814766, 'samples': 12060096, 'steps': 62812, 'loss/train': 1.0940624475479126} 11/07/2021 06:05:22 - INFO - __main__ - Step 62814: {'lr': 0.0003190697270511078, 'samples': 12060288, 'steps': 62813, 'loss/train': 5.792829990386963} 11/07/2021 06:05:22 - INFO - __main__ - Step 62815: {'lr': 0.0003190646268429462, 'samples': 12060480, 'steps': 62814, 'loss/train': 1.5821784734725952} 11/07/2021 06:05:23 - INFO - __main__ - Step 62816: {'lr': 0.00031905952660366514, 'samples': 12060672, 'steps': 62815, 'loss/train': 1.8324247598648071} 11/07/2021 06:05:23 - INFO - __main__ - Step 62817: {'lr': 0.0003190544263332669, 'samples': 12060864, 'steps': 62816, 'loss/train': 1.5756969451904297} 11/07/2021 06:05:24 - INFO - __main__ - Step 62818: {'lr': 0.00031904932603175386, 'samples': 12061056, 'steps': 62817, 'loss/train': 1.7203505039215088} 11/07/2021 06:05:24 - INFO - __main__ - Step 62819: {'lr': 0.00031904422569912816, 'samples': 12061248, 'steps': 62818, 'loss/train': 1.6765302419662476} 11/07/2021 06:05:25 - INFO - __main__ - Step 62820: {'lr': 0.00031903912533539226, 'samples': 12061440, 'steps': 62819, 'loss/train': 1.616826057434082} 11/07/2021 06:05:26 - INFO - __main__ - Step 62821: {'lr': 0.0003190340249405484, 'samples': 12061632, 'steps': 62820, 'loss/train': 1.403430461883545} 11/07/2021 06:05:26 - INFO - __main__ - Step 62822: {'lr': 0.00031902892451459884, 'samples': 12061824, 'steps': 62821, 'loss/train': 1.3646094799041748} 11/07/2021 06:05:26 - INFO - __main__ - Step 62823: {'lr': 0.000319023824057546, 'samples': 12062016, 'steps': 62822, 'loss/train': 1.6588906049728394} 11/07/2021 06:05:27 - INFO - __main__ - Step 62824: {'lr': 0.00031901872356939197, 'samples': 12062208, 'steps': 62823, 'loss/train': 1.3457950353622437} 11/07/2021 06:05:27 - INFO - __main__ - Step 62825: {'lr': 0.00031901362305013925, 'samples': 12062400, 'steps': 62824, 'loss/train': 1.4459965229034424} 11/07/2021 06:05:27 - INFO - __main__ - Step 62826: {'lr': 0.00031900852249979004, 'samples': 12062592, 'steps': 62825, 'loss/train': 1.7885419130325317} 11/07/2021 06:05:29 - INFO - __main__ - Step 62827: {'lr': 0.00031900342191834656, 'samples': 12062784, 'steps': 62826, 'loss/train': 2.0018813610076904} 11/07/2021 06:05:29 - INFO - __main__ - Step 62828: {'lr': 0.0003189983213058113, 'samples': 12062976, 'steps': 62827, 'loss/train': 1.1798408031463623} 11/07/2021 06:05:29 - INFO - __main__ - Step 62829: {'lr': 0.0003189932206621865, 'samples': 12063168, 'steps': 62828, 'loss/train': 1.69442617893219} 11/07/2021 06:05:30 - INFO - __main__ - Step 62830: {'lr': 0.00031898811998747436, 'samples': 12063360, 'steps': 62829, 'loss/train': 1.4957584142684937} 11/07/2021 06:05:30 - INFO - __main__ - Step 62831: {'lr': 0.0003189830192816772, 'samples': 12063552, 'steps': 62830, 'loss/train': 1.353190541267395} 11/07/2021 06:05:31 - INFO - __main__ - Step 62832: {'lr': 0.0003189779185447974, 'samples': 12063744, 'steps': 62831, 'loss/train': 1.1701005697250366} 11/07/2021 06:05:31 - INFO - __main__ - Step 62833: {'lr': 0.0003189728177768372, 'samples': 12063936, 'steps': 62832, 'loss/train': 1.413456678390503} 11/07/2021 06:05:32 - INFO - __main__ - Step 62834: {'lr': 0.00031896771697779893, 'samples': 12064128, 'steps': 62833, 'loss/train': 1.51386296749115} 11/07/2021 06:05:32 - INFO - __main__ - Step 62835: {'lr': 0.00031896261614768485, 'samples': 12064320, 'steps': 62834, 'loss/train': 1.3551669120788574} 11/07/2021 06:05:32 - INFO - __main__ - Step 62836: {'lr': 0.00031895751528649737, 'samples': 12064512, 'steps': 62835, 'loss/train': 2.143911838531494} 11/07/2021 06:05:34 - INFO - __main__ - Step 62837: {'lr': 0.0003189524143942386, 'samples': 12064704, 'steps': 62836, 'loss/train': 1.3425195217132568} 11/07/2021 06:05:34 - INFO - __main__ - Step 62838: {'lr': 0.00031894731347091094, 'samples': 12064896, 'steps': 62837, 'loss/train': 1.6632822751998901} 11/07/2021 06:05:34 - INFO - __main__ - Step 62839: {'lr': 0.00031894221251651666, 'samples': 12065088, 'steps': 62838, 'loss/train': 1.703442096710205} 11/07/2021 06:05:35 - INFO - __main__ - Step 62840: {'lr': 0.00031893711153105814, 'samples': 12065280, 'steps': 62839, 'loss/train': 1.0265618562698364} 11/07/2021 06:05:35 - INFO - __main__ - Step 62841: {'lr': 0.00031893201051453755, 'samples': 12065472, 'steps': 62840, 'loss/train': 1.2318131923675537} 11/07/2021 06:05:36 - INFO - __main__ - Step 62842: {'lr': 0.0003189269094669574, 'samples': 12065664, 'steps': 62841, 'loss/train': 1.3999874591827393} 11/07/2021 06:05:36 - INFO - __main__ - Step 62843: {'lr': 0.0003189218083883197, 'samples': 12065856, 'steps': 62842, 'loss/train': 1.3861138820648193} 11/07/2021 06:05:37 - INFO - __main__ - Step 62844: {'lr': 0.00031891670727862703, 'samples': 12066048, 'steps': 62843, 'loss/train': 0.5687077045440674} 11/07/2021 06:05:37 - INFO - __main__ - Step 62845: {'lr': 0.0003189116061378815, 'samples': 12066240, 'steps': 62844, 'loss/train': 1.6193588972091675} 11/07/2021 06:05:37 - INFO - __main__ - Step 62846: {'lr': 0.0003189065049660854, 'samples': 12066432, 'steps': 62845, 'loss/train': 1.4318444728851318} 11/07/2021 06:05:38 - INFO - __main__ - Step 62847: {'lr': 0.00031890140376324117, 'samples': 12066624, 'steps': 62846, 'loss/train': 1.3463820219039917} 11/07/2021 06:05:39 - INFO - __main__ - Step 62848: {'lr': 0.00031889630252935095, 'samples': 12066816, 'steps': 62847, 'loss/train': 1.4363994598388672} 11/07/2021 06:05:40 - INFO - __main__ - Step 62849: {'lr': 0.0003188912012644172, 'samples': 12067008, 'steps': 62848, 'loss/train': 1.118097186088562} 11/07/2021 06:05:40 - INFO - __main__ - Step 62850: {'lr': 0.00031888609996844216, 'samples': 12067200, 'steps': 62849, 'loss/train': 1.507352352142334} 11/07/2021 06:05:40 - INFO - __main__ - Step 62851: {'lr': 0.0003188809986414281, 'samples': 12067392, 'steps': 62850, 'loss/train': 1.4152538776397705} 11/07/2021 06:05:41 - INFO - __main__ - Step 62852: {'lr': 0.0003188758972833772, 'samples': 12067584, 'steps': 62851, 'loss/train': 1.748613953590393} 11/07/2021 06:05:42 - INFO - __main__ - Step 62853: {'lr': 0.00031887079589429195, 'samples': 12067776, 'steps': 62852, 'loss/train': 1.2790157794952393} 11/07/2021 06:05:42 - INFO - __main__ - Step 62854: {'lr': 0.00031886569447417456, 'samples': 12067968, 'steps': 62853, 'loss/train': 1.3689525127410889} 11/07/2021 06:05:42 - INFO - __main__ - Step 62855: {'lr': 0.0003188605930230274, 'samples': 12068160, 'steps': 62854, 'loss/train': 1.283076286315918} 11/07/2021 06:05:43 - INFO - __main__ - Step 62856: {'lr': 0.00031885549154085283, 'samples': 12068352, 'steps': 62855, 'loss/train': 1.3265925645828247} 11/07/2021 06:05:43 - INFO - __main__ - Step 62857: {'lr': 0.0003188503900276529, 'samples': 12068544, 'steps': 62856, 'loss/train': 1.6590124368667603} 11/07/2021 06:05:44 - INFO - __main__ - Step 62858: {'lr': 0.00031884528848342996, 'samples': 12068736, 'steps': 62857, 'loss/train': 1.5741198062896729} 11/07/2021 06:05:45 - INFO - __main__ - Step 62859: {'lr': 0.0003188401869081865, 'samples': 12068928, 'steps': 62858, 'loss/train': 1.6346012353897095} 11/07/2021 06:05:45 - INFO - __main__ - Step 62860: {'lr': 0.0003188350853019247, 'samples': 12069120, 'steps': 62859, 'loss/train': 1.4630694389343262} 11/07/2021 06:05:45 - INFO - __main__ - Step 62861: {'lr': 0.0003188299836646469, 'samples': 12069312, 'steps': 62860, 'loss/train': 1.4791243076324463} 11/07/2021 06:05:46 - INFO - __main__ - Step 62862: {'lr': 0.00031882488199635534, 'samples': 12069504, 'steps': 62861, 'loss/train': 1.8343448638916016} 11/07/2021 06:05:47 - INFO - __main__ - Step 62863: {'lr': 0.0003188197802970524, 'samples': 12069696, 'steps': 62862, 'loss/train': 1.5424829721450806} 11/07/2021 06:05:47 - INFO - __main__ - Step 62864: {'lr': 0.0003188146785667403, 'samples': 12069888, 'steps': 62863, 'loss/train': 1.3602969646453857} 11/07/2021 06:05:48 - INFO - __main__ - Step 62865: {'lr': 0.0003188095768054214, 'samples': 12070080, 'steps': 62864, 'loss/train': 0.9886502027511597} 11/07/2021 06:05:48 - INFO - __main__ - Step 62866: {'lr': 0.00031880447501309787, 'samples': 12070272, 'steps': 62865, 'loss/train': 1.5733585357666016} 11/07/2021 06:05:48 - INFO - __main__ - Step 62867: {'lr': 0.00031879937318977214, 'samples': 12070464, 'steps': 62866, 'loss/train': 1.4942855834960938} 11/07/2021 06:05:49 - INFO - __main__ - Step 62868: {'lr': 0.0003187942713354465, 'samples': 12070656, 'steps': 62867, 'loss/train': 1.27978515625} 11/07/2021 06:05:50 - INFO - __main__ - Step 62869: {'lr': 0.00031878916945012324, 'samples': 12070848, 'steps': 62868, 'loss/train': 2.9820356369018555} 11/07/2021 06:05:50 - INFO - __main__ - Step 62870: {'lr': 0.0003187840675338047, 'samples': 12071040, 'steps': 62869, 'loss/train': 1.7443702220916748} 11/07/2021 06:05:50 - INFO - __main__ - Step 62871: {'lr': 0.000318778965586493, 'samples': 12071232, 'steps': 62870, 'loss/train': 1.7695730924606323} 11/07/2021 06:05:51 - INFO - __main__ - Step 62872: {'lr': 0.0003187738636081906, 'samples': 12071424, 'steps': 62871, 'loss/train': 0.773431122303009} 11/07/2021 06:05:51 - INFO - __main__ - Step 62873: {'lr': 0.00031876876159889976, 'samples': 12071616, 'steps': 62872, 'loss/train': 1.6220970153808594} 11/07/2021 06:05:52 - INFO - __main__ - Step 62874: {'lr': 0.00031876365955862273, 'samples': 12071808, 'steps': 62873, 'loss/train': 1.5574958324432373} 11/07/2021 06:05:52 - INFO - __main__ - Step 62875: {'lr': 0.0003187585574873619, 'samples': 12072000, 'steps': 62874, 'loss/train': 1.6196461915969849} 11/07/2021 06:05:53 - INFO - __main__ - Step 62876: {'lr': 0.00031875345538511955, 'samples': 12072192, 'steps': 62875, 'loss/train': 1.6090736389160156} 11/07/2021 06:05:53 - INFO - __main__ - Step 62877: {'lr': 0.000318748353251898, 'samples': 12072384, 'steps': 62876, 'loss/train': 1.2574700117111206} 11/07/2021 06:05:53 - INFO - __main__ - Step 62878: {'lr': 0.00031874325108769943, 'samples': 12072576, 'steps': 62877, 'loss/train': 1.4264516830444336} 11/07/2021 06:05:54 - INFO - __main__ - Step 62879: {'lr': 0.0003187381488925262, 'samples': 12072768, 'steps': 62878, 'loss/train': 1.451191782951355} 11/07/2021 06:05:55 - INFO - __main__ - Step 62880: {'lr': 0.0003187330466663806, 'samples': 12072960, 'steps': 62879, 'loss/train': 1.2697819471359253} 11/07/2021 06:05:55 - INFO - __main__ - Step 62881: {'lr': 0.000318727944409265, 'samples': 12073152, 'steps': 62880, 'loss/train': 1.6496285200119019} 11/07/2021 06:05:56 - INFO - __main__ - Step 62882: {'lr': 0.0003187228421211816, 'samples': 12073344, 'steps': 62881, 'loss/train': 1.1297719478607178} 11/07/2021 06:05:56 - INFO - __main__ - Step 62883: {'lr': 0.00031871773980213285, 'samples': 12073536, 'steps': 62882, 'loss/train': 1.6314855813980103} 11/07/2021 06:05:56 - INFO - __main__ - Step 62884: {'lr': 0.0003187126374521209, 'samples': 12073728, 'steps': 62883, 'loss/train': 1.523177981376648} 11/07/2021 06:05:57 - INFO - __main__ - Step 62885: {'lr': 0.00031870753507114803, 'samples': 12073920, 'steps': 62884, 'loss/train': 1.2236073017120361} 11/07/2021 06:05:58 - INFO - __main__ - Step 62886: {'lr': 0.0003187024326592167, 'samples': 12074112, 'steps': 62885, 'loss/train': 1.5062845945358276} 11/07/2021 06:05:58 - INFO - __main__ - Step 62887: {'lr': 0.000318697330216329, 'samples': 12074304, 'steps': 62886, 'loss/train': 1.5799016952514648} 11/07/2021 06:05:58 - INFO - __main__ - Step 62888: {'lr': 0.0003186922277424874, 'samples': 12074496, 'steps': 62887, 'loss/train': 1.3866292238235474} 11/07/2021 06:05:59 - INFO - __main__ - Step 62889: {'lr': 0.00031868712523769425, 'samples': 12074688, 'steps': 62888, 'loss/train': 1.5198187828063965} 11/07/2021 06:06:00 - INFO - __main__ - Step 62890: {'lr': 0.00031868202270195163, 'samples': 12074880, 'steps': 62889, 'loss/train': 1.3592770099639893} 11/07/2021 06:06:00 - INFO - __main__ - Step 62891: {'lr': 0.0003186769201352619, 'samples': 12075072, 'steps': 62890, 'loss/train': 1.1795319318771362} 11/07/2021 06:06:00 - INFO - __main__ - Step 62892: {'lr': 0.0003186718175376275, 'samples': 12075264, 'steps': 62891, 'loss/train': 1.3074475526809692} 11/07/2021 06:06:01 - INFO - __main__ - Step 62893: {'lr': 0.0003186667149090506, 'samples': 12075456, 'steps': 62892, 'loss/train': 1.4222569465637207} 11/07/2021 06:06:01 - INFO - __main__ - Step 62894: {'lr': 0.00031866161224953355, 'samples': 12075648, 'steps': 62893, 'loss/train': 0.2956567704677582} 11/07/2021 06:06:02 - INFO - __main__ - Step 62895: {'lr': 0.0003186565095590786, 'samples': 12075840, 'steps': 62894, 'loss/train': 1.2765796184539795} 11/07/2021 06:06:02 - INFO - __main__ - Step 62896: {'lr': 0.0003186514068376882, 'samples': 12076032, 'steps': 62895, 'loss/train': 1.4188268184661865} 11/07/2021 06:06:03 - INFO - __main__ - Step 62897: {'lr': 0.00031864630408536443, 'samples': 12076224, 'steps': 62896, 'loss/train': 1.9744813442230225} 11/07/2021 06:06:03 - INFO - __main__ - Step 62898: {'lr': 0.00031864120130210973, 'samples': 12076416, 'steps': 62897, 'loss/train': 1.2005726099014282} 11/07/2021 06:06:03 - INFO - __main__ - Step 62899: {'lr': 0.00031863609848792633, 'samples': 12076608, 'steps': 62898, 'loss/train': 1.2968778610229492} 11/07/2021 06:06:05 - INFO - __main__ - Step 62900: {'lr': 0.0003186309956428166, 'samples': 12076800, 'steps': 62899, 'loss/train': 1.371687412261963} 11/07/2021 06:06:06 - INFO - __main__ - Step 62901: {'lr': 0.00031862589276678276, 'samples': 12076992, 'steps': 62900, 'loss/train': 1.4554340839385986} 11/07/2021 06:06:06 - INFO - __main__ - Step 62902: {'lr': 0.00031862078985982716, 'samples': 12077184, 'steps': 62901, 'loss/train': 3.637113571166992} 11/07/2021 06:06:06 - INFO - __main__ - Step 62903: {'lr': 0.0003186156869219522, 'samples': 12077376, 'steps': 62902, 'loss/train': 2.1794629096984863} 11/07/2021 06:06:07 - INFO - __main__ - Step 62904: {'lr': 0.00031861058395316, 'samples': 12077568, 'steps': 62903, 'loss/train': 1.4042840003967285} 11/07/2021 06:06:07 - INFO - __main__ - Step 62905: {'lr': 0.00031860548095345286, 'samples': 12077760, 'steps': 62904, 'loss/train': 1.2945623397827148} 11/07/2021 06:06:08 - INFO - __main__ - Step 62906: {'lr': 0.0003186003779228332, 'samples': 12077952, 'steps': 62905, 'loss/train': 1.1992005109786987} 11/07/2021 06:06:08 - INFO - __main__ - Step 62907: {'lr': 0.0003185952748613033, 'samples': 12078144, 'steps': 62906, 'loss/train': 1.2004752159118652} 11/07/2021 06:06:09 - INFO - __main__ - Step 62908: {'lr': 0.0003185901717688654, 'samples': 12078336, 'steps': 62907, 'loss/train': 0.991555392742157} 11/07/2021 06:06:09 - INFO - __main__ - Step 62909: {'lr': 0.0003185850686455218, 'samples': 12078528, 'steps': 62908, 'loss/train': 1.4151902198791504} 11/07/2021 06:06:10 - INFO - __main__ - Step 62910: {'lr': 0.00031857996549127486, 'samples': 12078720, 'steps': 62909, 'loss/train': 1.1773803234100342} 11/07/2021 06:06:10 - INFO - __main__ - Step 62911: {'lr': 0.00031857486230612686, 'samples': 12078912, 'steps': 62910, 'loss/train': 1.8096431493759155} 11/07/2021 06:06:11 - INFO - __main__ - Step 62912: {'lr': 0.00031856975909008007, 'samples': 12079104, 'steps': 62911, 'loss/train': 1.7013487815856934} 11/07/2021 06:06:11 - INFO - __main__ - Step 62913: {'lr': 0.00031856465584313676, 'samples': 12079296, 'steps': 62912, 'loss/train': 1.644162654876709} 11/07/2021 06:06:12 - INFO - __main__ - Step 62914: {'lr': 0.00031855955256529934, 'samples': 12079488, 'steps': 62913, 'loss/train': 1.4057717323303223} 11/07/2021 06:06:12 - INFO - __main__ - Step 62915: {'lr': 0.00031855444925656996, 'samples': 12079680, 'steps': 62914, 'loss/train': 1.141312599182129} 11/07/2021 06:06:13 - INFO - __main__ - Step 62916: {'lr': 0.0003185493459169511, 'samples': 12079872, 'steps': 62915, 'loss/train': 1.4969638586044312} 11/07/2021 06:06:13 - INFO - __main__ - Step 62917: {'lr': 0.00031854424254644493, 'samples': 12080064, 'steps': 62916, 'loss/train': 1.3744603395462036} 11/07/2021 06:06:14 - INFO - __main__ - Step 62918: {'lr': 0.0003185391391450538, 'samples': 12080256, 'steps': 62917, 'loss/train': 0.8720232248306274} 11/07/2021 06:06:14 - INFO - __main__ - Step 62919: {'lr': 0.00031853403571277994, 'samples': 12080448, 'steps': 62918, 'loss/train': 1.3518121242523193} 11/07/2021 06:06:14 - INFO - __main__ - Step 62920: {'lr': 0.0003185289322496257, 'samples': 12080640, 'steps': 62919, 'loss/train': 1.2877914905548096} 11/07/2021 06:06:15 - INFO - __main__ - Step 62921: {'lr': 0.0003185238287555934, 'samples': 12080832, 'steps': 62920, 'loss/train': 1.7885470390319824} 11/07/2021 06:06:16 - INFO - __main__ - Step 62922: {'lr': 0.00031851872523068535, 'samples': 12081024, 'steps': 62921, 'loss/train': 1.3362536430358887} 11/07/2021 06:06:16 - INFO - __main__ - Step 62923: {'lr': 0.0003185136216749038, 'samples': 12081216, 'steps': 62922, 'loss/train': 1.8386307954788208} 11/07/2021 06:06:16 - INFO - __main__ - Step 62924: {'lr': 0.00031850851808825107, 'samples': 12081408, 'steps': 62923, 'loss/train': 1.5057613849639893} 11/07/2021 06:06:17 - INFO - __main__ - Step 62925: {'lr': 0.0003185034144707294, 'samples': 12081600, 'steps': 62924, 'loss/train': 1.1500089168548584} 11/07/2021 06:06:17 - INFO - __main__ - Step 62926: {'lr': 0.00031849831082234124, 'samples': 12081792, 'steps': 62925, 'loss/train': 1.8149229288101196} 11/07/2021 06:06:18 - INFO - __main__ - Step 62927: {'lr': 0.0003184932071430888, 'samples': 12081984, 'steps': 62926, 'loss/train': 1.3818577527999878} 11/07/2021 06:06:18 - INFO - __main__ - Step 62928: {'lr': 0.00031848810343297433, 'samples': 12082176, 'steps': 62927, 'loss/train': 1.6038484573364258} 11/07/2021 06:06:19 - INFO - __main__ - Step 62929: {'lr': 0.0003184829996920002, 'samples': 12082368, 'steps': 62928, 'loss/train': 1.3935163021087646} 11/07/2021 06:06:19 - INFO - __main__ - Step 62930: {'lr': 0.0003184778959201687, 'samples': 12082560, 'steps': 62929, 'loss/train': 1.081667184829712} 11/07/2021 06:06:19 - INFO - __main__ - Step 62931: {'lr': 0.00031847279211748205, 'samples': 12082752, 'steps': 62930, 'loss/train': 1.1576759815216064} 11/07/2021 06:06:21 - INFO - __main__ - Step 62932: {'lr': 0.00031846768828394266, 'samples': 12082944, 'steps': 62931, 'loss/train': 1.6442691087722778} 11/07/2021 06:06:21 - INFO - __main__ - Step 62933: {'lr': 0.00031846258441955283, 'samples': 12083136, 'steps': 62932, 'loss/train': 1.7579379081726074} 11/07/2021 06:06:21 - INFO - __main__ - Step 62934: {'lr': 0.0003184574805243148, 'samples': 12083328, 'steps': 62933, 'loss/train': 1.4538865089416504} 11/07/2021 06:06:22 - INFO - __main__ - Step 62935: {'lr': 0.0003184523765982308, 'samples': 12083520, 'steps': 62934, 'loss/train': 2.084916591644287} 11/07/2021 06:06:22 - INFO - __main__ - Step 62936: {'lr': 0.0003184472726413032, 'samples': 12083712, 'steps': 62935, 'loss/train': 1.6213289499282837} 11/07/2021 06:06:23 - INFO - __main__ - Step 62937: {'lr': 0.00031844216865353444, 'samples': 12083904, 'steps': 62936, 'loss/train': 1.4477144479751587} 11/07/2021 06:06:23 - INFO - __main__ - Step 62938: {'lr': 0.0003184370646349267, 'samples': 12084096, 'steps': 62937, 'loss/train': 1.6457769870758057} 11/07/2021 06:06:24 - INFO - __main__ - Step 62939: {'lr': 0.0003184319605854822, 'samples': 12084288, 'steps': 62938, 'loss/train': 1.7711384296417236} 11/07/2021 06:06:24 - INFO - __main__ - Step 62940: {'lr': 0.0003184268565052033, 'samples': 12084480, 'steps': 62939, 'loss/train': 0.8980353474617004} 11/07/2021 06:06:24 - INFO - __main__ - Step 62941: {'lr': 0.00031842175239409233, 'samples': 12084672, 'steps': 62940, 'loss/train': 1.3622828722000122} 11/07/2021 06:06:25 - INFO - __main__ - Step 62942: {'lr': 0.00031841664825215163, 'samples': 12084864, 'steps': 62941, 'loss/train': 0.8932542204856873} 11/07/2021 06:06:26 - INFO - __main__ - Step 62943: {'lr': 0.0003184115440793834, 'samples': 12085056, 'steps': 62942, 'loss/train': 1.8229278326034546} 11/07/2021 06:06:26 - INFO - __main__ - Step 62944: {'lr': 0.00031840643987579, 'samples': 12085248, 'steps': 62943, 'loss/train': 1.0217106342315674} 11/07/2021 06:06:26 - INFO - __main__ - Step 62945: {'lr': 0.0003184013356413737, 'samples': 12085440, 'steps': 62944, 'loss/train': 1.432934284210205} 11/07/2021 06:06:27 - INFO - __main__ - Step 62946: {'lr': 0.0003183962313761368, 'samples': 12085632, 'steps': 62945, 'loss/train': 1.5045151710510254} 11/07/2021 06:06:28 - INFO - __main__ - Step 62947: {'lr': 0.0003183911270800816, 'samples': 12085824, 'steps': 62946, 'loss/train': 1.1433345079421997} 11/07/2021 06:06:28 - INFO - __main__ - Step 62948: {'lr': 0.00031838602275321043, 'samples': 12086016, 'steps': 62947, 'loss/train': 1.588500738143921} 11/07/2021 06:06:29 - INFO - __main__ - Step 62949: {'lr': 0.00031838091839552564, 'samples': 12086208, 'steps': 62948, 'loss/train': 1.0964946746826172} 11/07/2021 06:06:29 - INFO - __main__ - Step 62950: {'lr': 0.0003183758140070294, 'samples': 12086400, 'steps': 62949, 'loss/train': 1.549712061882019} 11/07/2021 06:06:29 - INFO - __main__ - Step 62951: {'lr': 0.0003183707095877241, 'samples': 12086592, 'steps': 62950, 'loss/train': 1.7639520168304443} 11/07/2021 06:06:30 - INFO - __main__ - Step 62952: {'lr': 0.000318365605137612, 'samples': 12086784, 'steps': 62951, 'loss/train': 1.5123714208602905} 11/07/2021 06:06:31 - INFO - __main__ - Step 62953: {'lr': 0.00031836050065669536, 'samples': 12086976, 'steps': 62952, 'loss/train': 1.3155418634414673} 11/07/2021 06:06:31 - INFO - __main__ - Step 62954: {'lr': 0.00031835539614497656, 'samples': 12087168, 'steps': 62953, 'loss/train': 2.0044779777526855} 11/07/2021 06:06:32 - INFO - __main__ - Step 62955: {'lr': 0.00031835029160245785, 'samples': 12087360, 'steps': 62954, 'loss/train': 1.3633402585983276} 11/07/2021 06:06:32 - INFO - __main__ - Step 62956: {'lr': 0.0003183451870291416, 'samples': 12087552, 'steps': 62955, 'loss/train': 1.1600128412246704} 11/07/2021 06:06:32 - INFO - __main__ - Step 62957: {'lr': 0.00031834008242503014, 'samples': 12087744, 'steps': 62956, 'loss/train': 1.4359625577926636} 11/07/2021 06:06:33 - INFO - __main__ - Step 62958: {'lr': 0.0003183349777901256, 'samples': 12087936, 'steps': 62957, 'loss/train': 1.6359970569610596} 11/07/2021 06:06:34 - INFO - __main__ - Step 62959: {'lr': 0.0003183298731244304, 'samples': 12088128, 'steps': 62958, 'loss/train': 1.7613108158111572} 11/07/2021 06:06:34 - INFO - __main__ - Step 62960: {'lr': 0.0003183247684279468, 'samples': 12088320, 'steps': 62959, 'loss/train': 1.4047390222549438} 11/07/2021 06:06:34 - INFO - __main__ - Step 62961: {'lr': 0.0003183196637006771, 'samples': 12088512, 'steps': 62960, 'loss/train': 0.9882500767707825} 11/07/2021 06:06:35 - INFO - __main__ - Step 62962: {'lr': 0.0003183145589426236, 'samples': 12088704, 'steps': 62961, 'loss/train': 1.5768455266952515} 11/07/2021 06:06:36 - INFO - __main__ - Step 62963: {'lr': 0.0003183094541537887, 'samples': 12088896, 'steps': 62962, 'loss/train': 1.0958646535873413} 11/07/2021 06:06:36 - INFO - __main__ - Step 62964: {'lr': 0.0003183043493341746, 'samples': 12089088, 'steps': 62963, 'loss/train': 0.23138955235481262} 11/07/2021 06:06:36 - INFO - __main__ - Step 62965: {'lr': 0.0003182992444837835, 'samples': 12089280, 'steps': 62964, 'loss/train': 1.3828896284103394} 11/07/2021 06:06:37 - INFO - __main__ - Step 62966: {'lr': 0.0003182941396026179, 'samples': 12089472, 'steps': 62965, 'loss/train': 0.31182658672332764} 11/07/2021 06:06:37 - INFO - __main__ - Step 62967: {'lr': 0.00031828903469068, 'samples': 12089664, 'steps': 62966, 'loss/train': 1.12623131275177} 11/07/2021 06:06:38 - INFO - __main__ - Step 62968: {'lr': 0.0003182839297479721, 'samples': 12089856, 'steps': 62967, 'loss/train': 1.9121274948120117} 11/07/2021 06:06:39 - INFO - __main__ - Step 62969: {'lr': 0.00031827882477449655, 'samples': 12090048, 'steps': 62968, 'loss/train': 0.5431030988693237} 11/07/2021 06:06:39 - INFO - __main__ - Step 62970: {'lr': 0.0003182737197702556, 'samples': 12090240, 'steps': 62969, 'loss/train': 1.2795430421829224} 11/07/2021 06:06:39 - INFO - __main__ - Step 62971: {'lr': 0.00031826861473525155, 'samples': 12090432, 'steps': 62970, 'loss/train': 1.7520716190338135} 11/07/2021 06:06:40 - INFO - __main__ - Step 62972: {'lr': 0.0003182635096694867, 'samples': 12090624, 'steps': 62971, 'loss/train': 1.174762487411499} 11/07/2021 06:06:40 - INFO - __main__ - Step 62973: {'lr': 0.0003182584045729634, 'samples': 12090816, 'steps': 62972, 'loss/train': 1.136440396308899} 11/07/2021 06:06:41 - INFO - __main__ - Step 62974: {'lr': 0.0003182532994456839, 'samples': 12091008, 'steps': 62973, 'loss/train': 1.159808874130249} 11/07/2021 06:06:42 - INFO - __main__ - Step 62975: {'lr': 0.0003182481942876505, 'samples': 12091200, 'steps': 62974, 'loss/train': 1.6123169660568237} 11/07/2021 06:06:42 - INFO - __main__ - Step 62976: {'lr': 0.00031824308909886556, 'samples': 12091392, 'steps': 62975, 'loss/train': 1.6030811071395874} 11/07/2021 06:06:42 - INFO - __main__ - Step 62977: {'lr': 0.00031823798387933133, 'samples': 12091584, 'steps': 62976, 'loss/train': 1.5259240865707397} 11/07/2021 06:06:43 - INFO - __main__ - Step 62978: {'lr': 0.00031823287862905016, 'samples': 12091776, 'steps': 62977, 'loss/train': 1.473885178565979} 11/07/2021 06:06:44 - INFO - __main__ - Step 62979: {'lr': 0.0003182277733480242, 'samples': 12091968, 'steps': 62978, 'loss/train': 1.318306803703308} 11/07/2021 06:06:44 - INFO - __main__ - Step 62980: {'lr': 0.0003182226680362559, 'samples': 12092160, 'steps': 62979, 'loss/train': 1.0462079048156738} 11/07/2021 06:06:45 - INFO - __main__ - Step 62981: {'lr': 0.00031821756269374753, 'samples': 12092352, 'steps': 62980, 'loss/train': 0.40875178575515747} 11/07/2021 06:06:45 - INFO - __main__ - Step 62982: {'lr': 0.00031821245732050136, 'samples': 12092544, 'steps': 62981, 'loss/train': 0.20984575152397156} 11/07/2021 06:06:45 - INFO - __main__ - Step 62983: {'lr': 0.0003182073519165197, 'samples': 12092736, 'steps': 62982, 'loss/train': 1.2318884134292603} 11/07/2021 06:06:47 - INFO - __main__ - Step 62984: {'lr': 0.000318202246481805, 'samples': 12092928, 'steps': 62983, 'loss/train': 1.0008355379104614} 11/07/2021 06:06:47 - INFO - __main__ - Step 62985: {'lr': 0.0003181971410163593, 'samples': 12093120, 'steps': 62984, 'loss/train': 1.7314125299453735} 11/07/2021 06:06:47 - INFO - __main__ - Step 62986: {'lr': 0.000318192035520185, 'samples': 12093312, 'steps': 62985, 'loss/train': 0.9711993336677551} 11/07/2021 06:06:48 - INFO - __main__ - Step 62987: {'lr': 0.0003181869299932844, 'samples': 12093504, 'steps': 62986, 'loss/train': 1.233894944190979} 11/07/2021 06:06:48 - INFO - __main__ - Step 62988: {'lr': 0.0003181818244356599, 'samples': 12093696, 'steps': 62987, 'loss/train': 1.2984248399734497} 11/07/2021 06:06:49 - INFO - __main__ - Step 62989: {'lr': 0.0003181767188473137, 'samples': 12093888, 'steps': 62988, 'loss/train': 1.5789108276367188} 11/07/2021 06:06:49 - INFO - __main__ - Step 62990: {'lr': 0.00031817161322824814, 'samples': 12094080, 'steps': 62989, 'loss/train': 1.7289677858352661} 11/07/2021 06:06:50 - INFO - __main__ - Step 62991: {'lr': 0.0003181665075784654, 'samples': 12094272, 'steps': 62990, 'loss/train': 1.2297018766403198} 11/07/2021 06:06:50 - INFO - __main__ - Step 62992: {'lr': 0.00031816140189796805, 'samples': 12094464, 'steps': 62991, 'loss/train': 1.4172545671463013} 11/07/2021 06:06:51 - INFO - __main__ - Step 62993: {'lr': 0.0003181562961867581, 'samples': 12094656, 'steps': 62992, 'loss/train': 1.7168022394180298} 11/07/2021 06:06:51 - INFO - __main__ - Step 62994: {'lr': 0.000318151190444838, 'samples': 12094848, 'steps': 62993, 'loss/train': 1.4341708421707153} 11/07/2021 06:06:52 - INFO - __main__ - Step 62995: {'lr': 0.00031814608467221005, 'samples': 12095040, 'steps': 62994, 'loss/train': 1.3349941968917847} 11/07/2021 06:06:53 - INFO - __main__ - Step 62996: {'lr': 0.0003181409788688765, 'samples': 12095232, 'steps': 62995, 'loss/train': 1.2072203159332275} 11/07/2021 06:06:53 - INFO - __main__ - Step 62997: {'lr': 0.0003181358730348397, 'samples': 12095424, 'steps': 62996, 'loss/train': 1.4630333185195923} 11/07/2021 06:06:53 - INFO - __main__ - Step 62998: {'lr': 0.00031813076717010193, 'samples': 12095616, 'steps': 62997, 'loss/train': 1.6724992990493774} 11/07/2021 06:06:54 - INFO - __main__ - Step 62999: {'lr': 0.00031812566127466545, 'samples': 12095808, 'steps': 62998, 'loss/train': 1.7105885744094849} 11/07/2021 06:06:54 - INFO - __main__ - Step 63000: {'lr': 0.00031812055534853265, 'samples': 12096000, 'steps': 62999, 'loss/train': 1.686954140663147} 11/07/2021 06:06:55 - INFO - __main__ - Step 63001: {'lr': 0.0003181154493917057, 'samples': 12096192, 'steps': 63000, 'loss/train': 2.1113829612731934} 11/07/2021 06:06:56 - INFO - __main__ - Step 63002: {'lr': 0.00031811034340418706, 'samples': 12096384, 'steps': 63001, 'loss/train': 1.480530023574829} 11/07/2021 06:06:56 - INFO - __main__ - Step 63003: {'lr': 0.00031810523738597893, 'samples': 12096576, 'steps': 63002, 'loss/train': 1.1180436611175537} 11/07/2021 06:06:56 - INFO - __main__ - Step 63004: {'lr': 0.0003181001313370836, 'samples': 12096768, 'steps': 63003, 'loss/train': 1.7363084554672241} 11/07/2021 06:06:57 - INFO - __main__ - Step 63005: {'lr': 0.00031809502525750346, 'samples': 12096960, 'steps': 63004, 'loss/train': 1.370914340019226} 11/07/2021 06:06:57 - INFO - __main__ - Step 63006: {'lr': 0.0003180899191472407, 'samples': 12097152, 'steps': 63005, 'loss/train': 1.246711254119873} 11/07/2021 06:06:57 - INFO - __main__ - Step 63007: {'lr': 0.00031808481300629765, 'samples': 12097344, 'steps': 63006, 'loss/train': 1.4653093814849854} 11/07/2021 06:06:59 - INFO - __main__ - Step 63008: {'lr': 0.0003180797068346767, 'samples': 12097536, 'steps': 63007, 'loss/train': 0.08288740366697311} 11/07/2021 06:06:59 - INFO - __main__ - Step 63009: {'lr': 0.00031807460063238005, 'samples': 12097728, 'steps': 63008, 'loss/train': 1.3861093521118164} 11/07/2021 06:06:59 - INFO - __main__ - Step 63010: {'lr': 0.00031806949439941006, 'samples': 12097920, 'steps': 63009, 'loss/train': 1.408057689666748} 11/07/2021 06:07:00 - INFO - __main__ - Step 63011: {'lr': 0.000318064388135769, 'samples': 12098112, 'steps': 63010, 'loss/train': 1.5917364358901978} 11/07/2021 06:07:00 - INFO - __main__ - Step 63012: {'lr': 0.00031805928184145917, 'samples': 12098304, 'steps': 63011, 'loss/train': 1.4174437522888184} 11/07/2021 06:07:00 - INFO - __main__ - Step 63013: {'lr': 0.00031805417551648287, 'samples': 12098496, 'steps': 63012, 'loss/train': 1.7917883396148682} 11/07/2021 06:07:01 - INFO - __main__ - Step 63014: {'lr': 0.00031804906916084235, 'samples': 12098688, 'steps': 63013, 'loss/train': 5.927755355834961} 11/07/2021 06:07:02 - INFO - __main__ - Step 63015: {'lr': 0.00031804396277454005, 'samples': 12098880, 'steps': 63014, 'loss/train': 1.2406262159347534} 11/07/2021 06:07:02 - INFO - __main__ - Step 63016: {'lr': 0.0003180388563575782, 'samples': 12099072, 'steps': 63015, 'loss/train': 1.499419093132019} 11/07/2021 06:07:02 - INFO - __main__ - Step 63017: {'lr': 0.0003180337499099591, 'samples': 12099264, 'steps': 63016, 'loss/train': 1.5423909425735474} 11/07/2021 06:07:03 - INFO - __main__ - Step 63018: {'lr': 0.000318028643431685, 'samples': 12099456, 'steps': 63017, 'loss/train': 1.3199553489685059} 11/07/2021 06:07:04 - INFO - __main__ - Step 63019: {'lr': 0.0003180235369227582, 'samples': 12099648, 'steps': 63018, 'loss/train': 1.7403062582015991} 11/07/2021 06:07:04 - INFO - __main__ - Step 63020: {'lr': 0.0003180184303831811, 'samples': 12099840, 'steps': 63019, 'loss/train': 1.4302176237106323} 11/07/2021 06:07:05 - INFO - __main__ - Step 63021: {'lr': 0.0003180133238129559, 'samples': 12100032, 'steps': 63020, 'loss/train': 1.3424339294433594} 11/07/2021 06:07:05 - INFO - __main__ - Step 63022: {'lr': 0.000318008217212085, 'samples': 12100224, 'steps': 63021, 'loss/train': 0.9812575578689575} 11/07/2021 06:07:05 - INFO - __main__ - Step 63023: {'lr': 0.00031800311058057066, 'samples': 12100416, 'steps': 63022, 'loss/train': 1.595754861831665} 11/07/2021 06:07:06 - INFO - __main__ - Step 63024: {'lr': 0.0003179980039184152, 'samples': 12100608, 'steps': 63023, 'loss/train': 1.4596023559570312} 11/07/2021 06:07:07 - INFO - __main__ - Step 63025: {'lr': 0.00031799289722562075, 'samples': 12100800, 'steps': 63024, 'loss/train': 1.0298255681991577} 11/07/2021 06:07:07 - INFO - __main__ - Step 63026: {'lr': 0.00031798779050218985, 'samples': 12100992, 'steps': 63025, 'loss/train': 1.2846009731292725} 11/07/2021 06:07:07 - INFO - __main__ - Step 63027: {'lr': 0.00031798268374812465, 'samples': 12101184, 'steps': 63026, 'loss/train': 1.4373846054077148} 11/07/2021 06:07:08 - INFO - __main__ - Step 63028: {'lr': 0.00031797757696342755, 'samples': 12101376, 'steps': 63027, 'loss/train': 1.1676311492919922} 11/07/2021 06:07:08 - INFO - __main__ - Step 63029: {'lr': 0.0003179724701481007, 'samples': 12101568, 'steps': 63028, 'loss/train': 1.9477519989013672} 11/07/2021 06:07:09 - INFO - __main__ - Step 63030: {'lr': 0.0003179673633021466, 'samples': 12101760, 'steps': 63029, 'loss/train': 1.6473277807235718} 11/07/2021 06:07:09 - INFO - __main__ - Step 63031: {'lr': 0.00031796225642556755, 'samples': 12101952, 'steps': 63030, 'loss/train': 1.4235305786132812} 11/07/2021 06:07:10 - INFO - __main__ - Step 63032: {'lr': 0.0003179571495183656, 'samples': 12102144, 'steps': 63031, 'loss/train': 1.809062123298645} 11/07/2021 06:07:10 - INFO - __main__ - Step 63033: {'lr': 0.00031795204258054324, 'samples': 12102336, 'steps': 63032, 'loss/train': 1.4413336515426636} 11/07/2021 06:07:11 - INFO - __main__ - Step 63034: {'lr': 0.00031794693561210276, 'samples': 12102528, 'steps': 63033, 'loss/train': 1.1605788469314575} 11/07/2021 06:07:12 - INFO - __main__ - Step 63035: {'lr': 0.00031794182861304637, 'samples': 12102720, 'steps': 63034, 'loss/train': 0.6967434287071228} 11/07/2021 06:07:12 - INFO - __main__ - Step 63036: {'lr': 0.0003179367215833765, 'samples': 12102912, 'steps': 63035, 'loss/train': 1.681667685508728} 11/07/2021 06:07:12 - INFO - __main__ - Step 63037: {'lr': 0.00031793161452309547, 'samples': 12103104, 'steps': 63036, 'loss/train': 1.186102271080017} 11/07/2021 06:07:13 - INFO - __main__ - Step 63038: {'lr': 0.0003179265074322054, 'samples': 12103296, 'steps': 63037, 'loss/train': 1.4839084148406982} 11/07/2021 06:07:13 - INFO - __main__ - Step 63039: {'lr': 0.0003179214003107087, 'samples': 12103488, 'steps': 63038, 'loss/train': 1.171899676322937} 11/07/2021 06:07:14 - INFO - __main__ - Step 63040: {'lr': 0.0003179162931586077, 'samples': 12103680, 'steps': 63039, 'loss/train': 1.4078123569488525} 11/07/2021 06:07:14 - INFO - __main__ - Step 63041: {'lr': 0.00031791118597590464, 'samples': 12103872, 'steps': 63040, 'loss/train': 1.5161457061767578} 11/07/2021 06:07:15 - INFO - __main__ - Step 63042: {'lr': 0.00031790607876260187, 'samples': 12104064, 'steps': 63041, 'loss/train': 1.2884504795074463} 11/07/2021 06:07:15 - INFO - __main__ - Step 63043: {'lr': 0.0003179009715187016, 'samples': 12104256, 'steps': 63042, 'loss/train': 1.2852283716201782} 11/07/2021 06:07:15 - INFO - __main__ - Step 63044: {'lr': 0.00031789586424420637, 'samples': 12104448, 'steps': 63043, 'loss/train': 1.026237964630127} 11/07/2021 06:07:16 - INFO - __main__ - Step 63045: {'lr': 0.0003178907569391182, 'samples': 12104640, 'steps': 63044, 'loss/train': 1.4672752618789673} 11/07/2021 06:07:18 - INFO - __main__ - Step 63046: {'lr': 0.00031788564960343946, 'samples': 12104832, 'steps': 63045, 'loss/train': 0.892769992351532} 11/07/2021 06:07:18 - INFO - __main__ - Step 63047: {'lr': 0.0003178805422371725, 'samples': 12105024, 'steps': 63046, 'loss/train': 1.4738463163375854} 11/07/2021 06:07:18 - INFO - __main__ - Step 63048: {'lr': 0.0003178754348403197, 'samples': 12105216, 'steps': 63047, 'loss/train': 0.9283245801925659} 11/07/2021 06:07:19 - INFO - __main__ - Step 63049: {'lr': 0.00031787032741288315, 'samples': 12105408, 'steps': 63048, 'loss/train': 1.8145825862884521} 11/07/2021 06:07:19 - INFO - __main__ - Step 63050: {'lr': 0.0003178652199548653, 'samples': 12105600, 'steps': 63049, 'loss/train': 1.403896450996399} 11/07/2021 06:07:19 - INFO - __main__ - Step 63051: {'lr': 0.00031786011246626855, 'samples': 12105792, 'steps': 63050, 'loss/train': 1.868774652481079} 11/07/2021 06:07:20 - INFO - __main__ - Step 63052: {'lr': 0.000317855004947095, 'samples': 12105984, 'steps': 63051, 'loss/train': 1.8284270763397217} 11/07/2021 06:07:21 - INFO - __main__ - Step 63053: {'lr': 0.00031784989739734706, 'samples': 12106176, 'steps': 63052, 'loss/train': 1.7557520866394043} 11/07/2021 06:07:21 - INFO - __main__ - Step 63054: {'lr': 0.000317844789817027, 'samples': 12106368, 'steps': 63053, 'loss/train': 1.4404797554016113} 11/07/2021 06:07:21 - INFO - __main__ - Step 63055: {'lr': 0.0003178396822061371, 'samples': 12106560, 'steps': 63054, 'loss/train': 1.655177354812622} 11/07/2021 06:07:22 - INFO - __main__ - Step 63056: {'lr': 0.0003178345745646797, 'samples': 12106752, 'steps': 63055, 'loss/train': 1.6835720539093018} 11/07/2021 06:07:22 - INFO - __main__ - Step 63057: {'lr': 0.00031782946689265713, 'samples': 12106944, 'steps': 63056, 'loss/train': 1.422389268875122} 11/07/2021 06:07:23 - INFO - __main__ - Step 63058: {'lr': 0.0003178243591900716, 'samples': 12107136, 'steps': 63057, 'loss/train': 1.1540732383728027} 11/07/2021 06:07:23 - INFO - __main__ - Step 63059: {'lr': 0.0003178192514569255, 'samples': 12107328, 'steps': 63058, 'loss/train': 1.1486173868179321} 11/07/2021 06:07:24 - INFO - __main__ - Step 63060: {'lr': 0.000317814143693221, 'samples': 12107520, 'steps': 63059, 'loss/train': 1.2418451309204102} 11/07/2021 06:07:24 - INFO - __main__ - Step 63061: {'lr': 0.00031780903589896057, 'samples': 12107712, 'steps': 63060, 'loss/train': 1.3251705169677734} 11/07/2021 06:07:25 - INFO - __main__ - Step 63062: {'lr': 0.0003178039280741464, 'samples': 12107904, 'steps': 63061, 'loss/train': 1.4622974395751953} 11/07/2021 06:07:26 - INFO - __main__ - Step 63063: {'lr': 0.00031779882021878086, 'samples': 12108096, 'steps': 63062, 'loss/train': 0.7434900403022766} 11/07/2021 06:07:26 - INFO - __main__ - Step 63064: {'lr': 0.00031779371233286617, 'samples': 12108288, 'steps': 63063, 'loss/train': 1.6530544757843018} 11/07/2021 06:07:27 - INFO - __main__ - Step 63065: {'lr': 0.00031778860441640473, 'samples': 12108480, 'steps': 63064, 'loss/train': 1.7289760112762451} 11/07/2021 06:07:27 - INFO - __main__ - Step 63066: {'lr': 0.00031778349646939877, 'samples': 12108672, 'steps': 63065, 'loss/train': 1.5689198970794678} 11/07/2021 06:07:27 - INFO - __main__ - Step 63067: {'lr': 0.0003177783884918506, 'samples': 12108864, 'steps': 63066, 'loss/train': 1.17428719997406} 11/07/2021 06:07:28 - INFO - __main__ - Step 63068: {'lr': 0.0003177732804837626, 'samples': 12109056, 'steps': 63067, 'loss/train': 2.15928053855896} 11/07/2021 06:07:29 - INFO - __main__ - Step 63069: {'lr': 0.0003177681724451369, 'samples': 12109248, 'steps': 63068, 'loss/train': 1.0739952325820923} 11/07/2021 06:07:29 - INFO - __main__ - Step 63070: {'lr': 0.00031776306437597594, 'samples': 12109440, 'steps': 63069, 'loss/train': 1.8155490159988403} 11/07/2021 06:07:29 - INFO - __main__ - Step 63071: {'lr': 0.000317757956276282, 'samples': 12109632, 'steps': 63070, 'loss/train': 1.7201696634292603} 11/07/2021 06:07:30 - INFO - __main__ - Step 63072: {'lr': 0.00031775284814605743, 'samples': 12109824, 'steps': 63071, 'loss/train': 1.3833786249160767} 11/07/2021 06:07:30 - INFO - __main__ - Step 63073: {'lr': 0.00031774773998530443, 'samples': 12110016, 'steps': 63072, 'loss/train': 1.7025986909866333} 11/07/2021 06:07:31 - INFO - __main__ - Step 63074: {'lr': 0.00031774263179402533, 'samples': 12110208, 'steps': 63073, 'loss/train': 1.7014565467834473} 11/07/2021 06:07:31 - INFO - __main__ - Step 63075: {'lr': 0.0003177375235722225, 'samples': 12110400, 'steps': 63074, 'loss/train': 1.1378681659698486} 11/07/2021 06:07:32 - INFO - __main__ - Step 63076: {'lr': 0.00031773241531989803, 'samples': 12110592, 'steps': 63075, 'loss/train': 1.7004752159118652} 11/07/2021 06:07:32 - INFO - __main__ - Step 63077: {'lr': 0.00031772730703705454, 'samples': 12110784, 'steps': 63076, 'loss/train': 1.6542659997940063} 11/07/2021 06:07:32 - INFO - __main__ - Step 63078: {'lr': 0.0003177221987236941, 'samples': 12110976, 'steps': 63077, 'loss/train': 1.567537784576416} 11/07/2021 06:07:33 - INFO - __main__ - Step 63079: {'lr': 0.0003177170903798191, 'samples': 12111168, 'steps': 63078, 'loss/train': 1.71661376953125} 11/07/2021 06:07:34 - INFO - __main__ - Step 63080: {'lr': 0.0003177119820054318, 'samples': 12111360, 'steps': 63079, 'loss/train': 1.1212530136108398} 11/07/2021 06:07:34 - INFO - __main__ - Step 63081: {'lr': 0.0003177068736005346, 'samples': 12111552, 'steps': 63080, 'loss/train': 1.4297432899475098} 11/07/2021 06:07:35 - INFO - __main__ - Step 63082: {'lr': 0.00031770176516512965, 'samples': 12111744, 'steps': 63081, 'loss/train': 1.575315237045288} 11/07/2021 06:07:35 - INFO - __main__ - Step 63083: {'lr': 0.0003176966566992193, 'samples': 12111936, 'steps': 63082, 'loss/train': 1.0940203666687012} 11/07/2021 06:07:35 - INFO - __main__ - Step 63084: {'lr': 0.00031769154820280606, 'samples': 12112128, 'steps': 63083, 'loss/train': 1.6096394062042236} 11/07/2021 06:07:37 - INFO - __main__ - Step 63085: {'lr': 0.0003176864396758919, 'samples': 12112320, 'steps': 63084, 'loss/train': 1.157027006149292} 11/07/2021 06:07:38 - INFO - __main__ - Step 63086: {'lr': 0.0003176813311184793, 'samples': 12112512, 'steps': 63085, 'loss/train': 0.9442289471626282} 11/07/2021 06:07:38 - INFO - __main__ - Step 63087: {'lr': 0.0003176762225305705, 'samples': 12112704, 'steps': 63086, 'loss/train': 1.5392539501190186} 11/07/2021 06:07:38 - INFO - __main__ - Step 63088: {'lr': 0.0003176711139121679, 'samples': 12112896, 'steps': 63087, 'loss/train': 1.5293116569519043} 11/07/2021 06:07:39 - INFO - __main__ - Step 63089: {'lr': 0.00031766600526327373, 'samples': 12113088, 'steps': 63088, 'loss/train': 0.8766775727272034} 11/07/2021 06:07:39 - INFO - __main__ - Step 63090: {'lr': 0.00031766089658389024, 'samples': 12113280, 'steps': 63089, 'loss/train': 0.7651761174201965} 11/07/2021 06:07:39 - INFO - __main__ - Step 63091: {'lr': 0.00031765578787401995, 'samples': 12113472, 'steps': 63090, 'loss/train': 0.7930426597595215} 11/07/2021 06:07:40 - INFO - __main__ - Step 63092: {'lr': 0.00031765067913366483, 'samples': 12113664, 'steps': 63091, 'loss/train': 1.0052293539047241} 11/07/2021 06:07:41 - INFO - __main__ - Step 63093: {'lr': 0.0003176455703628274, 'samples': 12113856, 'steps': 63092, 'loss/train': 1.705410122871399} 11/07/2021 06:07:41 - INFO - __main__ - Step 63094: {'lr': 0.00031764046156151, 'samples': 12114048, 'steps': 63093, 'loss/train': 1.9972124099731445} 11/07/2021 06:07:41 - INFO - __main__ - Step 63095: {'lr': 0.00031763535272971477, 'samples': 12114240, 'steps': 63094, 'loss/train': 2.968980073928833} 11/07/2021 06:07:42 - INFO - __main__ - Step 63096: {'lr': 0.0003176302438674441, 'samples': 12114432, 'steps': 63095, 'loss/train': 1.6399117708206177} 11/07/2021 06:07:43 - INFO - __main__ - Step 63097: {'lr': 0.00031762513497470034, 'samples': 12114624, 'steps': 63096, 'loss/train': 1.3070074319839478} 11/07/2021 06:07:43 - INFO - __main__ - Step 63098: {'lr': 0.00031762002605148574, 'samples': 12114816, 'steps': 63097, 'loss/train': 1.6677085161209106} 11/07/2021 06:07:43 - INFO - __main__ - Step 63099: {'lr': 0.00031761491709780256, 'samples': 12115008, 'steps': 63098, 'loss/train': 1.583078145980835} 11/07/2021 06:07:44 - INFO - __main__ - Step 63100: {'lr': 0.00031760980811365314, 'samples': 12115200, 'steps': 63099, 'loss/train': 1.7419310808181763} 11/07/2021 06:07:44 - INFO - __main__ - Step 63101: {'lr': 0.00031760469909903976, 'samples': 12115392, 'steps': 63100, 'loss/train': 1.2330833673477173} 11/07/2021 06:07:45 - INFO - __main__ - Step 63102: {'lr': 0.0003175995900539648, 'samples': 12115584, 'steps': 63101, 'loss/train': 1.1224735975265503} 11/07/2021 06:07:46 - INFO - __main__ - Step 63103: {'lr': 0.00031759448097843046, 'samples': 12115776, 'steps': 63102, 'loss/train': 2.446258783340454} 11/07/2021 06:07:46 - INFO - __main__ - Step 63104: {'lr': 0.00031758937187243916, 'samples': 12115968, 'steps': 63103, 'loss/train': 1.5499900579452515} 11/07/2021 06:07:46 - INFO - __main__ - Step 63105: {'lr': 0.0003175842627359931, 'samples': 12116160, 'steps': 63104, 'loss/train': 1.1715272665023804} 11/07/2021 06:07:47 - INFO - __main__ - Step 63106: {'lr': 0.00031757915356909463, 'samples': 12116352, 'steps': 63105, 'loss/train': 1.559904932975769} 11/07/2021 06:07:47 - INFO - __main__ - Step 63107: {'lr': 0.00031757404437174596, 'samples': 12116544, 'steps': 63106, 'loss/train': 0.36213722825050354} 11/07/2021 06:07:48 - INFO - __main__ - Step 63108: {'lr': 0.00031756893514394953, 'samples': 12116736, 'steps': 63107, 'loss/train': 1.4017750024795532} 11/07/2021 06:07:48 - INFO - __main__ - Step 63109: {'lr': 0.0003175638258857075, 'samples': 12116928, 'steps': 63108, 'loss/train': 1.8122090101242065} 11/07/2021 06:07:49 - INFO - __main__ - Step 63110: {'lr': 0.00031755871659702235, 'samples': 12117120, 'steps': 63109, 'loss/train': 1.6397557258605957} 11/07/2021 06:07:49 - INFO - __main__ - Step 63111: {'lr': 0.0003175536072778963, 'samples': 12117312, 'steps': 63110, 'loss/train': 1.394418478012085} 11/07/2021 06:07:50 - INFO - __main__ - Step 63112: {'lr': 0.0003175484979283316, 'samples': 12117504, 'steps': 63111, 'loss/train': 1.6863934993743896} 11/07/2021 06:07:50 - INFO - __main__ - Step 63113: {'lr': 0.00031754338854833055, 'samples': 12117696, 'steps': 63112, 'loss/train': 1.449765920639038} 11/07/2021 06:07:51 - INFO - __main__ - Step 63114: {'lr': 0.0003175382791378955, 'samples': 12117888, 'steps': 63113, 'loss/train': 1.2403554916381836} 11/07/2021 06:07:51 - INFO - __main__ - Step 63115: {'lr': 0.0003175331696970288, 'samples': 12118080, 'steps': 63114, 'loss/train': 1.1292684078216553} 11/07/2021 06:07:52 - INFO - __main__ - Step 63116: {'lr': 0.0003175280602257327, 'samples': 12118272, 'steps': 63115, 'loss/train': 1.2015715837478638} 11/07/2021 06:07:52 - INFO - __main__ - Step 63117: {'lr': 0.0003175229507240094, 'samples': 12118464, 'steps': 63116, 'loss/train': 1.2396279573440552} 11/07/2021 06:07:52 - INFO - __main__ - Step 63118: {'lr': 0.0003175178411918614, 'samples': 12118656, 'steps': 63117, 'loss/train': 1.8457748889923096} 11/07/2021 06:07:53 - INFO - __main__ - Step 63119: {'lr': 0.00031751273162929083, 'samples': 12118848, 'steps': 63118, 'loss/train': 0.992904543876648} 11/07/2021 06:07:54 - INFO - __main__ - Step 63120: {'lr': 0.00031750762203630015, 'samples': 12119040, 'steps': 63119, 'loss/train': 1.3842920064926147} 11/07/2021 06:07:54 - INFO - __main__ - Step 63121: {'lr': 0.00031750251241289147, 'samples': 12119232, 'steps': 63120, 'loss/train': 1.3559246063232422} 11/07/2021 06:07:54 - INFO - __main__ - Step 63122: {'lr': 0.0003174974027590672, 'samples': 12119424, 'steps': 63121, 'loss/train': 1.526852011680603} 11/07/2021 06:07:55 - INFO - __main__ - Step 63123: {'lr': 0.00031749229307482976, 'samples': 12119616, 'steps': 63122, 'loss/train': 1.4535211324691772} 11/07/2021 06:07:56 - INFO - __main__ - Step 63124: {'lr': 0.00031748718336018124, 'samples': 12119808, 'steps': 63123, 'loss/train': 1.2073591947555542} 11/07/2021 06:07:56 - INFO - __main__ - Step 63125: {'lr': 0.00031748207361512415, 'samples': 12120000, 'steps': 63124, 'loss/train': 1.6813476085662842} 11/07/2021 06:07:56 - INFO - __main__ - Step 63126: {'lr': 0.00031747696383966056, 'samples': 12120192, 'steps': 63125, 'loss/train': 0.8896179795265198} 11/07/2021 06:07:57 - INFO - __main__ - Step 63127: {'lr': 0.0003174718540337929, 'samples': 12120384, 'steps': 63126, 'loss/train': 1.571856141090393} 11/07/2021 06:07:57 - INFO - __main__ - Step 63128: {'lr': 0.0003174667441975235, 'samples': 12120576, 'steps': 63127, 'loss/train': 1.5230246782302856} 11/07/2021 06:07:58 - INFO - __main__ - Step 63129: {'lr': 0.0003174616343308546, 'samples': 12120768, 'steps': 63128, 'loss/train': 1.5079621076583862} 11/07/2021 06:07:59 - INFO - __main__ - Step 63130: {'lr': 0.0003174565244337886, 'samples': 12120960, 'steps': 63129, 'loss/train': 1.49240243434906} 11/07/2021 06:07:59 - INFO - __main__ - Step 63131: {'lr': 0.0003174514145063277, 'samples': 12121152, 'steps': 63130, 'loss/train': 1.6270167827606201} 11/07/2021 06:07:59 - INFO - __main__ - Step 63132: {'lr': 0.00031744630454847415, 'samples': 12121344, 'steps': 63131, 'loss/train': 1.4224827289581299} 11/07/2021 06:08:00 - INFO - __main__ - Step 63133: {'lr': 0.0003174411945602304, 'samples': 12121536, 'steps': 63132, 'loss/train': 1.5181938409805298} 11/07/2021 06:08:01 - INFO - __main__ - Step 63134: {'lr': 0.00031743608454159864, 'samples': 12121728, 'steps': 63133, 'loss/train': 1.0754095315933228} 11/07/2021 06:08:01 - INFO - __main__ - Step 63135: {'lr': 0.0003174309744925813, 'samples': 12121920, 'steps': 63134, 'loss/train': 1.3103073835372925} 11/07/2021 06:08:01 - INFO - __main__ - Step 63136: {'lr': 0.00031742586441318055, 'samples': 12122112, 'steps': 63135, 'loss/train': 1.4320391416549683} 11/07/2021 06:08:02 - INFO - __main__ - Step 63137: {'lr': 0.0003174207543033988, 'samples': 12122304, 'steps': 63136, 'loss/train': 1.068374752998352} 11/07/2021 06:08:02 - INFO - __main__ - Step 63138: {'lr': 0.0003174156441632383, 'samples': 12122496, 'steps': 63137, 'loss/train': 1.6047292947769165} 11/07/2021 06:08:03 - INFO - __main__ - Step 63139: {'lr': 0.0003174105339927013, 'samples': 12122688, 'steps': 63138, 'loss/train': 2.612107038497925} 11/07/2021 06:08:04 - INFO - __main__ - Step 63140: {'lr': 0.00031740542379179017, 'samples': 12122880, 'steps': 63139, 'loss/train': 2.2865381240844727} 11/07/2021 06:08:04 - INFO - __main__ - Step 63141: {'lr': 0.00031740031356050717, 'samples': 12123072, 'steps': 63140, 'loss/train': 1.4802366495132446} 11/07/2021 06:08:04 - INFO - __main__ - Step 63142: {'lr': 0.00031739520329885463, 'samples': 12123264, 'steps': 63141, 'loss/train': 1.2248705625534058} 11/07/2021 06:08:05 - INFO - __main__ - Step 63143: {'lr': 0.00031739009300683484, 'samples': 12123456, 'steps': 63142, 'loss/train': 1.651851773262024} 11/07/2021 06:08:06 - INFO - __main__ - Step 63144: {'lr': 0.00031738498268445023, 'samples': 12123648, 'steps': 63143, 'loss/train': 1.3091552257537842} 11/07/2021 06:08:06 - INFO - __main__ - Step 63145: {'lr': 0.0003173798723317029, 'samples': 12123840, 'steps': 63144, 'loss/train': 1.7981802225112915} 11/07/2021 06:08:06 - INFO - __main__ - Step 63146: {'lr': 0.00031737476194859524, 'samples': 12124032, 'steps': 63145, 'loss/train': 1.2422637939453125} 11/07/2021 06:08:07 - INFO - __main__ - Step 63147: {'lr': 0.0003173696515351295, 'samples': 12124224, 'steps': 63146, 'loss/train': 1.7273184061050415} 11/07/2021 06:08:07 - INFO - __main__ - Step 63148: {'lr': 0.00031736454109130815, 'samples': 12124416, 'steps': 63147, 'loss/train': 2.44171404838562} 11/07/2021 06:08:07 - INFO - __main__ - Step 63149: {'lr': 0.0003173594306171333, 'samples': 12124608, 'steps': 63148, 'loss/train': 1.4580343961715698} 11/07/2021 06:08:08 - INFO - __main__ - Step 63150: {'lr': 0.0003173543201126073, 'samples': 12124800, 'steps': 63149, 'loss/train': 1.6344165802001953} 11/07/2021 06:08:09 - INFO - __main__ - Step 63151: {'lr': 0.0003173492095777326, 'samples': 12124992, 'steps': 63150, 'loss/train': 1.3508014678955078} 11/07/2021 06:08:09 - INFO - __main__ - Step 63152: {'lr': 0.0003173440990125113, 'samples': 12125184, 'steps': 63151, 'loss/train': 1.3953615427017212} 11/07/2021 06:08:10 - INFO - __main__ - Step 63153: {'lr': 0.0003173389884169458, 'samples': 12125376, 'steps': 63152, 'loss/train': 1.5016578435897827} 11/07/2021 06:08:10 - INFO - __main__ - Step 63154: {'lr': 0.0003173338777910384, 'samples': 12125568, 'steps': 63153, 'loss/train': 1.6186538934707642} 11/07/2021 06:08:11 - INFO - __main__ - Step 63155: {'lr': 0.0003173287671347914, 'samples': 12125760, 'steps': 63154, 'loss/train': 1.4554203748703003} 11/07/2021 06:08:11 - INFO - __main__ - Step 63156: {'lr': 0.00031732365644820704, 'samples': 12125952, 'steps': 63155, 'loss/train': 1.635951042175293} 11/07/2021 06:08:12 - INFO - __main__ - Step 63157: {'lr': 0.0003173185457312877, 'samples': 12126144, 'steps': 63156, 'loss/train': 1.2848457098007202} 11/07/2021 06:08:12 - INFO - __main__ - Step 63158: {'lr': 0.00031731343498403577, 'samples': 12126336, 'steps': 63157, 'loss/train': 1.4302533864974976} 11/07/2021 06:08:12 - INFO - __main__ - Step 63159: {'lr': 0.0003173083242064534, 'samples': 12126528, 'steps': 63158, 'loss/train': 1.5026944875717163} 11/07/2021 06:08:13 - INFO - __main__ - Step 63160: {'lr': 0.0003173032133985428, 'samples': 12126720, 'steps': 63159, 'loss/train': 1.4908734560012817} 11/07/2021 06:08:14 - INFO - __main__ - Step 63161: {'lr': 0.00031729810256030653, 'samples': 12126912, 'steps': 63160, 'loss/train': 1.5590217113494873} 11/07/2021 06:08:14 - INFO - __main__ - Step 63162: {'lr': 0.00031729299169174673, 'samples': 12127104, 'steps': 63161, 'loss/train': 1.1842931509017944} 11/07/2021 06:08:14 - INFO - __main__ - Step 63163: {'lr': 0.0003172878807928658, 'samples': 12127296, 'steps': 63162, 'loss/train': 1.4058233499526978} 11/07/2021 06:08:15 - INFO - __main__ - Step 63164: {'lr': 0.00031728276986366593, 'samples': 12127488, 'steps': 63163, 'loss/train': 1.0332566499710083} 11/07/2021 06:08:16 - INFO - __main__ - Step 63165: {'lr': 0.0003172776589041496, 'samples': 12127680, 'steps': 63164, 'loss/train': 1.659403920173645} 11/07/2021 06:08:16 - INFO - __main__ - Step 63166: {'lr': 0.00031727254791431885, 'samples': 12127872, 'steps': 63165, 'loss/train': 1.5310786962509155} 11/07/2021 06:08:17 - INFO - __main__ - Step 63167: {'lr': 0.0003172674368941762, 'samples': 12128064, 'steps': 63166, 'loss/train': 1.687441349029541} 11/07/2021 06:08:17 - INFO - __main__ - Step 63168: {'lr': 0.0003172623258437238, 'samples': 12128256, 'steps': 63167, 'loss/train': 0.872977614402771} 11/07/2021 06:08:17 - INFO - __main__ - Step 63169: {'lr': 0.00031725721476296413, 'samples': 12128448, 'steps': 63168, 'loss/train': 1.6834124326705933} 11/07/2021 06:08:18 - INFO - __main__ - Step 63170: {'lr': 0.00031725210365189936, 'samples': 12128640, 'steps': 63169, 'loss/train': 1.6399822235107422} 11/07/2021 06:08:19 - INFO - __main__ - Step 63171: {'lr': 0.00031724699251053185, 'samples': 12128832, 'steps': 63170, 'loss/train': 1.3494538068771362} 11/07/2021 06:08:19 - INFO - __main__ - Step 63172: {'lr': 0.0003172418813388639, 'samples': 12129024, 'steps': 63171, 'loss/train': 1.0503318309783936} 11/07/2021 06:08:19 - INFO - __main__ - Step 63173: {'lr': 0.00031723677013689776, 'samples': 12129216, 'steps': 63172, 'loss/train': 0.9210957288742065} 11/07/2021 06:08:20 - INFO - __main__ - Step 63174: {'lr': 0.0003172316589046358, 'samples': 12129408, 'steps': 63173, 'loss/train': 1.1490209102630615} 11/07/2021 06:08:20 - INFO - __main__ - Step 63175: {'lr': 0.00031722654764208027, 'samples': 12129600, 'steps': 63174, 'loss/train': 1.484758734703064} 11/07/2021 06:08:21 - INFO - __main__ - Step 63176: {'lr': 0.00031722143634923346, 'samples': 12129792, 'steps': 63175, 'loss/train': 1.1538735628128052} 11/07/2021 06:08:21 - INFO - __main__ - Step 63177: {'lr': 0.0003172163250260977, 'samples': 12129984, 'steps': 63176, 'loss/train': 1.843886137008667} 11/07/2021 06:08:22 - INFO - __main__ - Step 63178: {'lr': 0.00031721121367267533, 'samples': 12130176, 'steps': 63177, 'loss/train': 1.8387632369995117} 11/07/2021 06:08:22 - INFO - __main__ - Step 63179: {'lr': 0.0003172061022889687, 'samples': 12130368, 'steps': 63178, 'loss/train': 1.4776829481124878} 11/07/2021 06:08:23 - INFO - __main__ - Step 63180: {'lr': 0.00031720099087497995, 'samples': 12130560, 'steps': 63179, 'loss/train': 1.5016491413116455} 11/07/2021 06:08:23 - INFO - __main__ - Step 63181: {'lr': 0.0003171958794307115, 'samples': 12130752, 'steps': 63180, 'loss/train': 1.3737914562225342} 11/07/2021 06:08:24 - INFO - __main__ - Step 63182: {'lr': 0.00031719076795616564, 'samples': 12130944, 'steps': 63181, 'loss/train': 1.4239604473114014} 11/07/2021 06:08:24 - INFO - __main__ - Step 63183: {'lr': 0.00031718565645134456, 'samples': 12131136, 'steps': 63182, 'loss/train': 1.5110260248184204} 11/07/2021 06:08:25 - INFO - __main__ - Step 63184: {'lr': 0.00031718054491625076, 'samples': 12131328, 'steps': 63183, 'loss/train': 1.4322487115859985} 11/07/2021 06:08:25 - INFO - __main__ - Step 63185: {'lr': 0.0003171754333508864, 'samples': 12131520, 'steps': 63184, 'loss/train': 1.7661365270614624} 11/07/2021 06:08:25 - INFO - __main__ - Step 63186: {'lr': 0.0003171703217552539, 'samples': 12131712, 'steps': 63185, 'loss/train': 1.3991198539733887} 11/07/2021 06:08:26 - INFO - __main__ - Step 63187: {'lr': 0.0003171652101293554, 'samples': 12131904, 'steps': 63186, 'loss/train': 1.3023326396942139} 11/07/2021 06:08:27 - INFO - __main__ - Step 63188: {'lr': 0.00031716009847319334, 'samples': 12132096, 'steps': 63187, 'loss/train': 1.1005672216415405} 11/07/2021 06:08:27 - INFO - __main__ - Step 63189: {'lr': 0.0003171549867867699, 'samples': 12132288, 'steps': 63188, 'loss/train': 1.9319616556167603} 11/07/2021 06:08:27 - INFO - __main__ - Step 63190: {'lr': 0.00031714987507008754, 'samples': 12132480, 'steps': 63189, 'loss/train': 1.6513354778289795} 11/07/2021 06:08:28 - INFO - __main__ - Step 63191: {'lr': 0.0003171447633231485, 'samples': 12132672, 'steps': 63190, 'loss/train': 1.4909493923187256} 11/07/2021 06:08:29 - INFO - __main__ - Step 63192: {'lr': 0.000317139651545955, 'samples': 12132864, 'steps': 63191, 'loss/train': 0.798714280128479} 11/07/2021 06:08:29 - INFO - __main__ - Step 63193: {'lr': 0.0003171345397385095, 'samples': 12133056, 'steps': 63192, 'loss/train': 1.4433954954147339} 11/07/2021 06:08:29 - INFO - __main__ - Step 63194: {'lr': 0.0003171294279008141, 'samples': 12133248, 'steps': 63193, 'loss/train': 1.5596977472305298} 11/07/2021 06:08:30 - INFO - __main__ - Step 63195: {'lr': 0.00031712431603287127, 'samples': 12133440, 'steps': 63194, 'loss/train': 1.7019500732421875} 11/07/2021 06:08:30 - INFO - __main__ - Step 63196: {'lr': 0.0003171192041346833, 'samples': 12133632, 'steps': 63195, 'loss/train': 1.5884125232696533} 11/07/2021 06:08:31 - INFO - __main__ - Step 63197: {'lr': 0.00031711409220625236, 'samples': 12133824, 'steps': 63196, 'loss/train': 1.4113277196884155} 11/07/2021 06:08:32 - INFO - __main__ - Step 63198: {'lr': 0.0003171089802475809, 'samples': 12134016, 'steps': 63197, 'loss/train': 1.410969853401184} 11/07/2021 06:08:32 - INFO - __main__ - Step 63199: {'lr': 0.0003171038682586712, 'samples': 12134208, 'steps': 63198, 'loss/train': 0.14110176265239716} 11/07/2021 06:08:33 - INFO - __main__ - Step 63200: {'lr': 0.00031709875623952546, 'samples': 12134400, 'steps': 63199, 'loss/train': 1.7233704328536987} 11/07/2021 06:08:33 - INFO - __main__ - Step 63201: {'lr': 0.0003170936441901461, 'samples': 12134592, 'steps': 63200, 'loss/train': 1.2098278999328613} 11/07/2021 06:08:33 - INFO - __main__ - Step 63202: {'lr': 0.0003170885321105354, 'samples': 12134784, 'steps': 63201, 'loss/train': 1.351232886314392} 11/07/2021 06:08:34 - INFO - __main__ - Step 63203: {'lr': 0.0003170834200006956, 'samples': 12134976, 'steps': 63202, 'loss/train': 1.2788013219833374} 11/07/2021 06:08:35 - INFO - __main__ - Step 63204: {'lr': 0.000317078307860629, 'samples': 12135168, 'steps': 63203, 'loss/train': 1.4612175226211548} 11/07/2021 06:08:35 - INFO - __main__ - Step 63205: {'lr': 0.00031707319569033803, 'samples': 12135360, 'steps': 63204, 'loss/train': 1.2600510120391846} 11/07/2021 06:08:35 - INFO - __main__ - Step 63206: {'lr': 0.00031706808348982486, 'samples': 12135552, 'steps': 63205, 'loss/train': 1.3870478868484497} 11/07/2021 06:08:36 - INFO - __main__ - Step 63207: {'lr': 0.00031706297125909193, 'samples': 12135744, 'steps': 63206, 'loss/train': 1.0089713335037231} 11/07/2021 06:08:37 - INFO - __main__ - Step 63208: {'lr': 0.0003170578589981414, 'samples': 12135936, 'steps': 63207, 'loss/train': 1.5225330591201782} 11/07/2021 06:08:37 - INFO - __main__ - Step 63209: {'lr': 0.00031705274670697567, 'samples': 12136128, 'steps': 63208, 'loss/train': 1.7948685884475708} 11/07/2021 06:08:38 - INFO - __main__ - Step 63210: {'lr': 0.00031704763438559694, 'samples': 12136320, 'steps': 63209, 'loss/train': 1.6601728200912476} 11/07/2021 06:08:38 - INFO - __main__ - Step 63211: {'lr': 0.0003170425220340076, 'samples': 12136512, 'steps': 63210, 'loss/train': 1.3167130947113037} 11/07/2021 06:08:38 - INFO - __main__ - Step 63212: {'lr': 0.00031703740965221, 'samples': 12136704, 'steps': 63211, 'loss/train': 1.6505823135375977} 11/07/2021 06:08:39 - INFO - __main__ - Step 63213: {'lr': 0.0003170322972402063, 'samples': 12136896, 'steps': 63212, 'loss/train': 1.2266247272491455} 11/07/2021 06:08:40 - INFO - __main__ - Step 63214: {'lr': 0.0003170271847979989, 'samples': 12137088, 'steps': 63213, 'loss/train': 1.7205129861831665} 11/07/2021 06:08:40 - INFO - __main__ - Step 63215: {'lr': 0.0003170220723255901, 'samples': 12137280, 'steps': 63214, 'loss/train': 1.6366275548934937} 11/07/2021 06:08:40 - INFO - __main__ - Step 63216: {'lr': 0.00031701695982298215, 'samples': 12137472, 'steps': 63215, 'loss/train': 1.372025728225708} 11/07/2021 06:08:41 - INFO - __main__ - Step 63217: {'lr': 0.00031701184729017744, 'samples': 12137664, 'steps': 63216, 'loss/train': 1.2927111387252808} 11/07/2021 06:08:41 - INFO - __main__ - Step 63218: {'lr': 0.00031700673472717823, 'samples': 12137856, 'steps': 63217, 'loss/train': 0.4540724754333496} 11/07/2021 06:08:42 - INFO - __main__ - Step 63219: {'lr': 0.0003170016221339869, 'samples': 12138048, 'steps': 63218, 'loss/train': 1.464347243309021} 11/07/2021 06:08:42 - INFO - __main__ - Step 63220: {'lr': 0.00031699650951060547, 'samples': 12138240, 'steps': 63219, 'loss/train': 1.1028984785079956} 11/07/2021 06:08:43 - INFO - __main__ - Step 63221: {'lr': 0.0003169913968570366, 'samples': 12138432, 'steps': 63220, 'loss/train': 1.7722779512405396} 11/07/2021 06:08:43 - INFO - __main__ - Step 63222: {'lr': 0.00031698628417328235, 'samples': 12138624, 'steps': 63221, 'loss/train': 1.610006332397461} 11/07/2021 06:08:43 - INFO - __main__ - Step 63223: {'lr': 0.00031698117145934513, 'samples': 12138816, 'steps': 63222, 'loss/train': 1.2499324083328247} 11/07/2021 06:08:45 - INFO - __main__ - Step 63224: {'lr': 0.0003169760587152273, 'samples': 12139008, 'steps': 63223, 'loss/train': 1.3780732154846191} 11/07/2021 06:08:45 - INFO - __main__ - Step 63225: {'lr': 0.000316970945940931, 'samples': 12139200, 'steps': 63224, 'loss/train': 1.7504724264144897} 11/07/2021 06:08:45 - INFO - __main__ - Step 63226: {'lr': 0.0003169658331364587, 'samples': 12139392, 'steps': 63225, 'loss/train': 1.4076112508773804} 11/07/2021 06:08:46 - INFO - __main__ - Step 63227: {'lr': 0.00031696072030181264, 'samples': 12139584, 'steps': 63226, 'loss/train': 1.7026584148406982} 11/07/2021 06:08:46 - INFO - __main__ - Step 63228: {'lr': 0.000316955607436995, 'samples': 12139776, 'steps': 63227, 'loss/train': 2.0076682567596436} 11/07/2021 06:08:47 - INFO - __main__ - Step 63229: {'lr': 0.0003169504945420083, 'samples': 12139968, 'steps': 63228, 'loss/train': 2.9392809867858887} 11/07/2021 06:08:47 - INFO - __main__ - Step 63230: {'lr': 0.0003169453816168547, 'samples': 12140160, 'steps': 63229, 'loss/train': 1.4516372680664062} 11/07/2021 06:08:48 - INFO - __main__ - Step 63231: {'lr': 0.0003169402686615365, 'samples': 12140352, 'steps': 63230, 'loss/train': 1.242875576019287} 11/07/2021 06:08:48 - INFO - __main__ - Step 63232: {'lr': 0.0003169351556760562, 'samples': 12140544, 'steps': 63231, 'loss/train': 1.4050648212432861} 11/07/2021 06:08:48 - INFO - __main__ - Step 63233: {'lr': 0.0003169300426604158, 'samples': 12140736, 'steps': 63232, 'loss/train': 1.0349814891815186} 11/07/2021 06:08:49 - INFO - __main__ - Step 63234: {'lr': 0.0003169249296146178, 'samples': 12140928, 'steps': 63233, 'loss/train': 0.9917479753494263} 11/07/2021 06:08:50 - INFO - __main__ - Step 63235: {'lr': 0.00031691981653866446, 'samples': 12141120, 'steps': 63234, 'loss/train': 1.5265969038009644} 11/07/2021 06:08:50 - INFO - __main__ - Step 63236: {'lr': 0.00031691470343255814, 'samples': 12141312, 'steps': 63235, 'loss/train': 1.4351654052734375} 11/07/2021 06:08:50 - INFO - __main__ - Step 63237: {'lr': 0.000316909590296301, 'samples': 12141504, 'steps': 63236, 'loss/train': 1.304352879524231} 11/07/2021 06:08:51 - INFO - __main__ - Step 63238: {'lr': 0.00031690447712989545, 'samples': 12141696, 'steps': 63237, 'loss/train': 1.4280191659927368} 11/07/2021 06:08:52 - INFO - __main__ - Step 63239: {'lr': 0.00031689936393334385, 'samples': 12141888, 'steps': 63238, 'loss/train': 1.455592393875122} 11/07/2021 06:08:52 - INFO - __main__ - Step 63240: {'lr': 0.00031689425070664833, 'samples': 12142080, 'steps': 63239, 'loss/train': 1.6116493940353394} 11/07/2021 06:08:53 - INFO - __main__ - Step 63241: {'lr': 0.00031688913744981135, 'samples': 12142272, 'steps': 63240, 'loss/train': 1.2503787279129028} 11/07/2021 06:08:53 - INFO - __main__ - Step 63242: {'lr': 0.0003168840241628351, 'samples': 12142464, 'steps': 63241, 'loss/train': 1.5166751146316528} 11/07/2021 06:08:53 - INFO - __main__ - Step 63243: {'lr': 0.000316878910845722, 'samples': 12142656, 'steps': 63242, 'loss/train': 1.546694278717041} 11/07/2021 06:08:54 - INFO - __main__ - Step 63244: {'lr': 0.0003168737974984743, 'samples': 12142848, 'steps': 63243, 'loss/train': 1.3764374256134033} 11/07/2021 06:08:55 - INFO - __main__ - Step 63245: {'lr': 0.0003168686841210943, 'samples': 12143040, 'steps': 63244, 'loss/train': 1.0762114524841309} 11/07/2021 06:08:55 - INFO - __main__ - Step 63246: {'lr': 0.0003168635707135842, 'samples': 12143232, 'steps': 63245, 'loss/train': 1.2528473138809204} 11/07/2021 06:08:55 - INFO - __main__ - Step 63247: {'lr': 0.00031685845727594654, 'samples': 12143424, 'steps': 63246, 'loss/train': 1.5424680709838867} 11/07/2021 06:08:56 - INFO - __main__ - Step 63248: {'lr': 0.00031685334380818344, 'samples': 12143616, 'steps': 63247, 'loss/train': 1.2899528741836548} 11/07/2021 06:08:56 - INFO - __main__ - Step 63249: {'lr': 0.0003168482303102972, 'samples': 12143808, 'steps': 63248, 'loss/train': 1.5462632179260254} 11/07/2021 06:08:57 - INFO - __main__ - Step 63250: {'lr': 0.0003168431167822903, 'samples': 12144000, 'steps': 63249, 'loss/train': 1.3214713335037231} 11/07/2021 06:08:58 - INFO - __main__ - Step 63251: {'lr': 0.0003168380032241648, 'samples': 12144192, 'steps': 63250, 'loss/train': 1.2005935907363892} 11/07/2021 06:08:58 - INFO - __main__ - Step 63252: {'lr': 0.0003168328896359232, 'samples': 12144384, 'steps': 63251, 'loss/train': 1.0492750406265259} 11/07/2021 06:08:58 - INFO - __main__ - Step 63253: {'lr': 0.00031682777601756774, 'samples': 12144576, 'steps': 63252, 'loss/train': 1.5833330154418945} 11/07/2021 06:08:59 - INFO - __main__ - Step 63254: {'lr': 0.0003168226623691006, 'samples': 12144768, 'steps': 63253, 'loss/train': 1.8120852708816528} 11/07/2021 06:08:59 - INFO - __main__ - Step 63255: {'lr': 0.00031681754869052433, 'samples': 12144960, 'steps': 63254, 'loss/train': 1.8042423725128174} 11/07/2021 06:09:00 - INFO - __main__ - Step 63256: {'lr': 0.00031681243498184105, 'samples': 12145152, 'steps': 63255, 'loss/train': 0.7937582731246948} 11/07/2021 06:09:00 - INFO - __main__ - Step 63257: {'lr': 0.0003168073212430531, 'samples': 12145344, 'steps': 63256, 'loss/train': 1.811558485031128} 11/07/2021 06:09:01 - INFO - __main__ - Step 63258: {'lr': 0.00031680220747416283, 'samples': 12145536, 'steps': 63257, 'loss/train': 0.9601514339447021} 11/07/2021 06:09:01 - INFO - __main__ - Step 63259: {'lr': 0.00031679709367517255, 'samples': 12145728, 'steps': 63258, 'loss/train': 1.4245036840438843} 11/07/2021 06:09:01 - INFO - __main__ - Step 63260: {'lr': 0.0003167919798460845, 'samples': 12145920, 'steps': 63259, 'loss/train': 1.2403985261917114} 11/07/2021 06:09:02 - INFO - __main__ - Step 63261: {'lr': 0.000316786865986901, 'samples': 12146112, 'steps': 63260, 'loss/train': 1.308061122894287} 11/07/2021 06:09:03 - INFO - __main__ - Step 63262: {'lr': 0.0003167817520976244, 'samples': 12146304, 'steps': 63261, 'loss/train': 1.34512197971344} 11/07/2021 06:09:03 - INFO - __main__ - Step 63263: {'lr': 0.00031677663817825693, 'samples': 12146496, 'steps': 63262, 'loss/train': 1.575966477394104} 11/07/2021 06:09:03 - INFO - __main__ - Step 63264: {'lr': 0.000316771524228801, 'samples': 12146688, 'steps': 63263, 'loss/train': 1.619935154914856} 11/07/2021 06:09:04 - INFO - __main__ - Step 63265: {'lr': 0.00031676641024925873, 'samples': 12146880, 'steps': 63264, 'loss/train': 1.2140400409698486} 11/07/2021 06:09:05 - INFO - __main__ - Step 63266: {'lr': 0.0003167612962396327, 'samples': 12147072, 'steps': 63265, 'loss/train': 1.2032262086868286} 11/07/2021 06:09:05 - INFO - __main__ - Step 63267: {'lr': 0.000316756182199925, 'samples': 12147264, 'steps': 63266, 'loss/train': 1.9103201627731323} 11/07/2021 06:09:06 - INFO - __main__ - Step 63268: {'lr': 0.0003167510681301379, 'samples': 12147456, 'steps': 63267, 'loss/train': 1.4679080247879028} 11/07/2021 06:09:06 - INFO - __main__ - Step 63269: {'lr': 0.0003167459540302739, 'samples': 12147648, 'steps': 63268, 'loss/train': 1.012247085571289} 11/07/2021 06:09:06 - INFO - __main__ - Step 63270: {'lr': 0.0003167408399003352, 'samples': 12147840, 'steps': 63269, 'loss/train': 1.6189519166946411} 11/07/2021 06:09:07 - INFO - __main__ - Step 63271: {'lr': 0.0003167357257403241, 'samples': 12148032, 'steps': 63270, 'loss/train': 1.5429028272628784} 11/07/2021 06:09:08 - INFO - __main__ - Step 63272: {'lr': 0.00031673061155024283, 'samples': 12148224, 'steps': 63271, 'loss/train': 1.3142999410629272} 11/07/2021 06:09:08 - INFO - __main__ - Step 63273: {'lr': 0.00031672549733009395, 'samples': 12148416, 'steps': 63272, 'loss/train': 1.852387547492981} 11/07/2021 06:09:09 - INFO - __main__ - Step 63274: {'lr': 0.00031672038307987944, 'samples': 12148608, 'steps': 63273, 'loss/train': 1.533747673034668} 11/07/2021 06:09:09 - INFO - __main__ - Step 63275: {'lr': 0.00031671526879960185, 'samples': 12148800, 'steps': 63274, 'loss/train': 1.4117803573608398} 11/07/2021 06:09:09 - INFO - __main__ - Step 63276: {'lr': 0.00031671015448926334, 'samples': 12148992, 'steps': 63275, 'loss/train': 0.8289645910263062} 11/07/2021 06:09:11 - INFO - __main__ - Step 63277: {'lr': 0.0003167050401488662, 'samples': 12149184, 'steps': 63276, 'loss/train': 0.46141818165779114} 11/07/2021 06:09:11 - INFO - __main__ - Step 63278: {'lr': 0.0003166999257784129, 'samples': 12149376, 'steps': 63277, 'loss/train': 1.4786579608917236} 11/07/2021 06:09:11 - INFO - __main__ - Step 63279: {'lr': 0.00031669481137790563, 'samples': 12149568, 'steps': 63278, 'loss/train': 1.7042946815490723} 11/07/2021 06:09:12 - INFO - __main__ - Step 63280: {'lr': 0.00031668969694734667, 'samples': 12149760, 'steps': 63279, 'loss/train': 1.505806565284729} 11/07/2021 06:09:12 - INFO - __main__ - Step 63281: {'lr': 0.0003166845824867384, 'samples': 12149952, 'steps': 63280, 'loss/train': 1.3867721557617188} 11/07/2021 06:09:13 - INFO - __main__ - Step 63282: {'lr': 0.00031667946799608307, 'samples': 12150144, 'steps': 63281, 'loss/train': 1.6306571960449219} 11/07/2021 06:09:13 - INFO - __main__ - Step 63283: {'lr': 0.00031667435347538294, 'samples': 12150336, 'steps': 63282, 'loss/train': 1.6030339002609253} 11/07/2021 06:09:14 - INFO - __main__ - Step 63284: {'lr': 0.0003166692389246404, 'samples': 12150528, 'steps': 63283, 'loss/train': 1.722259283065796} 11/07/2021 06:09:14 - INFO - __main__ - Step 63285: {'lr': 0.0003166641243438578, 'samples': 12150720, 'steps': 63284, 'loss/train': 1.50814950466156} 11/07/2021 06:09:14 - INFO - __main__ - Step 63286: {'lr': 0.00031665900973303735, 'samples': 12150912, 'steps': 63285, 'loss/train': 1.201569676399231} 11/07/2021 06:09:15 - INFO - __main__ - Step 63287: {'lr': 0.00031665389509218133, 'samples': 12151104, 'steps': 63286, 'loss/train': 0.9050660729408264} 11/07/2021 06:09:16 - INFO - __main__ - Step 63288: {'lr': 0.00031664878042129215, 'samples': 12151296, 'steps': 63287, 'loss/train': 1.395282506942749} 11/07/2021 06:09:16 - INFO - __main__ - Step 63289: {'lr': 0.00031664366572037203, 'samples': 12151488, 'steps': 63288, 'loss/train': 1.6593084335327148} 11/07/2021 06:09:16 - INFO - __main__ - Step 63290: {'lr': 0.0003166385509894233, 'samples': 12151680, 'steps': 63289, 'loss/train': 1.6567044258117676} 11/07/2021 06:09:17 - INFO - __main__ - Step 63291: {'lr': 0.00031663343622844825, 'samples': 12151872, 'steps': 63290, 'loss/train': 1.3964787721633911} 11/07/2021 06:09:17 - INFO - __main__ - Step 63292: {'lr': 0.00031662832143744925, 'samples': 12152064, 'steps': 63291, 'loss/train': 1.378088355064392} 11/07/2021 06:09:18 - INFO - __main__ - Step 63293: {'lr': 0.00031662320661642854, 'samples': 12152256, 'steps': 63292, 'loss/train': 1.4356642961502075} 11/07/2021 06:09:18 - INFO - __main__ - Step 63294: {'lr': 0.00031661809176538843, 'samples': 12152448, 'steps': 63293, 'loss/train': 1.7050373554229736} 11/07/2021 06:09:19 - INFO - __main__ - Step 63295: {'lr': 0.0003166129768843312, 'samples': 12152640, 'steps': 63294, 'loss/train': 1.4932609796524048} 11/07/2021 06:09:19 - INFO - __main__ - Step 63296: {'lr': 0.00031660786197325926, 'samples': 12152832, 'steps': 63295, 'loss/train': 1.703467845916748} 11/07/2021 06:09:20 - INFO - __main__ - Step 63297: {'lr': 0.0003166027470321748, 'samples': 12153024, 'steps': 63296, 'loss/train': 1.3965703248977661} 11/07/2021 06:09:21 - INFO - __main__ - Step 63298: {'lr': 0.0003165976320610802, 'samples': 12153216, 'steps': 63297, 'loss/train': 1.6870492696762085} 11/07/2021 06:09:21 - INFO - __main__ - Step 63299: {'lr': 0.00031659251705997766, 'samples': 12153408, 'steps': 63298, 'loss/train': 0.9811890721321106} 11/07/2021 06:09:21 - INFO - __main__ - Step 63300: {'lr': 0.0003165874020288697, 'samples': 12153600, 'steps': 63299, 'loss/train': 0.21358953416347504} 11/07/2021 06:09:22 - INFO - __main__ - Step 63301: {'lr': 0.00031658228696775835, 'samples': 12153792, 'steps': 63300, 'loss/train': 1.496970295906067} 11/07/2021 06:09:22 - INFO - __main__ - Step 63302: {'lr': 0.0003165771718766461, 'samples': 12153984, 'steps': 63301, 'loss/train': 0.9168739914894104} 11/07/2021 06:09:23 - INFO - __main__ - Step 63303: {'lr': 0.0003165720567555352, 'samples': 12154176, 'steps': 63302, 'loss/train': 1.8247389793395996} 11/07/2021 06:09:23 - INFO - __main__ - Step 63304: {'lr': 0.00031656694160442795, 'samples': 12154368, 'steps': 63303, 'loss/train': 1.6279079914093018} 11/07/2021 06:09:24 - INFO - __main__ - Step 63305: {'lr': 0.00031656182642332667, 'samples': 12154560, 'steps': 63304, 'loss/train': 1.4412732124328613} 11/07/2021 06:09:24 - INFO - __main__ - Step 63306: {'lr': 0.0003165567112122337, 'samples': 12154752, 'steps': 63305, 'loss/train': 1.4022495746612549} 11/07/2021 06:09:25 - INFO - __main__ - Step 63307: {'lr': 0.0003165515959711513, 'samples': 12154944, 'steps': 63306, 'loss/train': 1.4918062686920166} 11/07/2021 06:09:26 - INFO - __main__ - Step 63308: {'lr': 0.00031654648070008175, 'samples': 12155136, 'steps': 63307, 'loss/train': 1.6337021589279175} 11/07/2021 06:09:26 - INFO - __main__ - Step 63309: {'lr': 0.0003165413653990273, 'samples': 12155328, 'steps': 63308, 'loss/train': 1.1688199043273926} 11/07/2021 06:09:26 - INFO - __main__ - Step 63310: {'lr': 0.0003165362500679905, 'samples': 12155520, 'steps': 63309, 'loss/train': 1.404693841934204} 11/07/2021 06:09:27 - INFO - __main__ - Step 63311: {'lr': 0.0003165311347069734, 'samples': 12155712, 'steps': 63310, 'loss/train': 1.891250729560852} 11/07/2021 06:09:27 - INFO - __main__ - Step 63312: {'lr': 0.00031652601931597837, 'samples': 12155904, 'steps': 63311, 'loss/train': 1.3162856101989746} 11/07/2021 06:09:28 - INFO - __main__ - Step 63313: {'lr': 0.00031652090389500776, 'samples': 12156096, 'steps': 63312, 'loss/train': 1.6793127059936523} 11/07/2021 06:09:28 - INFO - __main__ - Step 63314: {'lr': 0.0003165157884440639, 'samples': 12156288, 'steps': 63313, 'loss/train': 1.217315912246704} 11/07/2021 06:09:29 - INFO - __main__ - Step 63315: {'lr': 0.000316510672963149, 'samples': 12156480, 'steps': 63314, 'loss/train': 1.3735637664794922} 11/07/2021 06:09:29 - INFO - __main__ - Step 63316: {'lr': 0.00031650555745226547, 'samples': 12156672, 'steps': 63315, 'loss/train': 1.190039873123169} 11/07/2021 06:09:29 - INFO - __main__ - Step 63317: {'lr': 0.00031650044191141555, 'samples': 12156864, 'steps': 63316, 'loss/train': 0.9881426095962524} 11/07/2021 06:09:30 - INFO - __main__ - Step 63318: {'lr': 0.00031649532634060154, 'samples': 12157056, 'steps': 63317, 'loss/train': 1.521949291229248} 11/07/2021 06:09:31 - INFO - __main__ - Step 63319: {'lr': 0.0003164902107398257, 'samples': 12157248, 'steps': 63318, 'loss/train': 1.534589171409607} 11/07/2021 06:09:31 - INFO - __main__ - Step 63320: {'lr': 0.0003164850951090905, 'samples': 12157440, 'steps': 63319, 'loss/train': 0.5227958559989929} 11/07/2021 06:09:31 - INFO - __main__ - Step 63321: {'lr': 0.00031647997944839814, 'samples': 12157632, 'steps': 63320, 'loss/train': 1.5157705545425415} 11/07/2021 06:09:32 - INFO - __main__ - Step 63322: {'lr': 0.0003164748637577509, 'samples': 12157824, 'steps': 63321, 'loss/train': 1.3654530048370361} 11/07/2021 06:09:33 - INFO - __main__ - Step 63323: {'lr': 0.00031646974803715104, 'samples': 12158016, 'steps': 63322, 'loss/train': 1.5284191370010376} 11/07/2021 06:09:33 - INFO - __main__ - Step 63324: {'lr': 0.000316464632286601, 'samples': 12158208, 'steps': 63323, 'loss/train': 1.9541243314743042} 11/07/2021 06:09:34 - INFO - __main__ - Step 63325: {'lr': 0.000316459516506103, 'samples': 12158400, 'steps': 63324, 'loss/train': 2.17706561088562} 11/07/2021 06:09:34 - INFO - __main__ - Step 63326: {'lr': 0.00031645440069565946, 'samples': 12158592, 'steps': 63325, 'loss/train': 1.370567798614502} 11/07/2021 06:09:34 - INFO - __main__ - Step 63327: {'lr': 0.0003164492848552725, 'samples': 12158784, 'steps': 63326, 'loss/train': 1.4034243822097778} 11/07/2021 06:09:35 - INFO - __main__ - Step 63328: {'lr': 0.00031644416898494456, 'samples': 12158976, 'steps': 63327, 'loss/train': 2.0898783206939697} 11/07/2021 06:09:36 - INFO - __main__ - Step 63329: {'lr': 0.00031643905308467783, 'samples': 12159168, 'steps': 63328, 'loss/train': 1.0967957973480225} 11/07/2021 06:09:36 - INFO - __main__ - Step 63330: {'lr': 0.0003164339371544748, 'samples': 12159360, 'steps': 63329, 'loss/train': 1.6074188947677612} 11/07/2021 06:09:36 - INFO - __main__ - Step 63331: {'lr': 0.0003164288211943376, 'samples': 12159552, 'steps': 63330, 'loss/train': 1.1600793600082397} 11/07/2021 06:09:37 - INFO - __main__ - Step 63332: {'lr': 0.0003164237052042686, 'samples': 12159744, 'steps': 63331, 'loss/train': 1.0306875705718994} 11/07/2021 06:09:37 - INFO - __main__ - Step 63333: {'lr': 0.00031641858918427006, 'samples': 12159936, 'steps': 63332, 'loss/train': 1.9872030019760132} 11/07/2021 06:09:38 - INFO - __main__ - Step 63334: {'lr': 0.00031641347313434446, 'samples': 12160128, 'steps': 63333, 'loss/train': 1.1026113033294678} 11/07/2021 06:09:38 - INFO - __main__ - Step 63335: {'lr': 0.00031640835705449384, 'samples': 12160320, 'steps': 63334, 'loss/train': 1.2963074445724487} 11/07/2021 06:09:39 - INFO - __main__ - Step 63336: {'lr': 0.0003164032409447207, 'samples': 12160512, 'steps': 63335, 'loss/train': 1.5570805072784424} 11/07/2021 06:09:39 - INFO - __main__ - Step 63337: {'lr': 0.0003163981248050273, 'samples': 12160704, 'steps': 63336, 'loss/train': 1.8954963684082031} 11/07/2021 06:09:39 - INFO - __main__ - Step 63338: {'lr': 0.0003163930086354159, 'samples': 12160896, 'steps': 63337, 'loss/train': 0.7896292805671692} 11/07/2021 06:09:40 - INFO - __main__ - Step 63339: {'lr': 0.00031638789243588876, 'samples': 12161088, 'steps': 63338, 'loss/train': 1.629398226737976} 11/07/2021 06:09:41 - INFO - __main__ - Step 63340: {'lr': 0.0003163827762064484, 'samples': 12161280, 'steps': 63339, 'loss/train': 1.6168503761291504} 11/07/2021 06:09:41 - INFO - __main__ - Step 63341: {'lr': 0.0003163776599470969, 'samples': 12161472, 'steps': 63340, 'loss/train': 1.3514294624328613} 11/07/2021 06:09:42 - INFO - __main__ - Step 63342: {'lr': 0.00031637254365783667, 'samples': 12161664, 'steps': 63341, 'loss/train': 1.1914820671081543} 11/07/2021 06:09:42 - INFO - __main__ - Step 63343: {'lr': 0.00031636742733867, 'samples': 12161856, 'steps': 63342, 'loss/train': 1.4012451171875} 11/07/2021 06:09:43 - INFO - __main__ - Step 63344: {'lr': 0.00031636231098959924, 'samples': 12162048, 'steps': 63343, 'loss/train': 1.7631449699401855} 11/07/2021 06:09:43 - INFO - __main__ - Step 63345: {'lr': 0.0003163571946106265, 'samples': 12162240, 'steps': 63344, 'loss/train': 1.3246086835861206} 11/07/2021 06:09:44 - INFO - __main__ - Step 63346: {'lr': 0.00031635207820175437, 'samples': 12162432, 'steps': 63345, 'loss/train': 1.5614906549453735} 11/07/2021 06:09:44 - INFO - __main__ - Step 63347: {'lr': 0.000316346961762985, 'samples': 12162624, 'steps': 63346, 'loss/train': 1.159961462020874} 11/07/2021 06:09:44 - INFO - __main__ - Step 63348: {'lr': 0.0003163418452943207, 'samples': 12162816, 'steps': 63347, 'loss/train': 1.664660096168518} 11/07/2021 06:09:45 - INFO - __main__ - Step 63349: {'lr': 0.00031633672879576377, 'samples': 12163008, 'steps': 63348, 'loss/train': 1.2994552850723267} 11/07/2021 06:09:46 - INFO - __main__ - Step 63350: {'lr': 0.00031633161226731654, 'samples': 12163200, 'steps': 63349, 'loss/train': 1.0720208883285522} 11/07/2021 06:09:46 - INFO - __main__ - Step 63351: {'lr': 0.0003163264957089813, 'samples': 12163392, 'steps': 63350, 'loss/train': 1.5205823183059692} 11/07/2021 06:09:46 - INFO - __main__ - Step 63352: {'lr': 0.0003163213791207604, 'samples': 12163584, 'steps': 63351, 'loss/train': 1.1667022705078125} 11/07/2021 06:09:47 - INFO - __main__ - Step 63353: {'lr': 0.0003163162625026561, 'samples': 12163776, 'steps': 63352, 'loss/train': 1.5080126523971558} 11/07/2021 06:09:48 - INFO - __main__ - Step 63354: {'lr': 0.0003163111458546707, 'samples': 12163968, 'steps': 63353, 'loss/train': 1.5152760744094849} 11/07/2021 06:09:48 - INFO - __main__ - Step 63355: {'lr': 0.0003163060291768065, 'samples': 12164160, 'steps': 63354, 'loss/train': 0.941566526889801} 11/07/2021 06:09:48 - INFO - __main__ - Step 63356: {'lr': 0.00031630091246906585, 'samples': 12164352, 'steps': 63355, 'loss/train': 1.3876854181289673} 11/07/2021 06:09:49 - INFO - __main__ - Step 63357: {'lr': 0.000316295795731451, 'samples': 12164544, 'steps': 63356, 'loss/train': 1.2669808864593506} 11/07/2021 06:09:49 - INFO - __main__ - Step 63358: {'lr': 0.0003162906789639643, 'samples': 12164736, 'steps': 63357, 'loss/train': 1.5011337995529175} 11/07/2021 06:09:50 - INFO - __main__ - Step 63359: {'lr': 0.00031628556216660805, 'samples': 12164928, 'steps': 63358, 'loss/train': 1.7308405637741089} 11/07/2021 06:09:51 - INFO - __main__ - Step 63360: {'lr': 0.0003162804453393846, 'samples': 12165120, 'steps': 63359, 'loss/train': 1.034052848815918} 11/07/2021 06:09:51 - INFO - __main__ - Step 63361: {'lr': 0.0003162753284822962, 'samples': 12165312, 'steps': 63360, 'loss/train': 1.3659526109695435} 11/07/2021 06:09:51 - INFO - __main__ - Step 63362: {'lr': 0.0003162702115953451, 'samples': 12165504, 'steps': 63361, 'loss/train': 1.217204213142395} 11/07/2021 06:09:52 - INFO - __main__ - Step 63363: {'lr': 0.00031626509467853366, 'samples': 12165696, 'steps': 63362, 'loss/train': 1.4287961721420288} 11/07/2021 06:09:52 - INFO - __main__ - Step 63364: {'lr': 0.0003162599777318642, 'samples': 12165888, 'steps': 63363, 'loss/train': 1.5051320791244507} 11/07/2021 06:09:53 - INFO - __main__ - Step 63365: {'lr': 0.00031625486075533905, 'samples': 12166080, 'steps': 63364, 'loss/train': 1.5933514833450317} 11/07/2021 06:09:53 - INFO - __main__ - Step 63366: {'lr': 0.0003162497437489604, 'samples': 12166272, 'steps': 63365, 'loss/train': 1.552978277206421} 11/07/2021 06:09:54 - INFO - __main__ - Step 63367: {'lr': 0.0003162446267127308, 'samples': 12166464, 'steps': 63366, 'loss/train': 1.4843875169754028} 11/07/2021 06:09:54 - INFO - __main__ - Step 63368: {'lr': 0.00031623950964665225, 'samples': 12166656, 'steps': 63367, 'loss/train': 1.3519343137741089} 11/07/2021 06:09:54 - INFO - __main__ - Step 63369: {'lr': 0.00031623439255072726, 'samples': 12166848, 'steps': 63368, 'loss/train': 0.7859129309654236} 11/07/2021 06:09:55 - INFO - __main__ - Step 63370: {'lr': 0.000316229275424958, 'samples': 12167040, 'steps': 63369, 'loss/train': 1.1740882396697998} 11/07/2021 06:09:56 - INFO - __main__ - Step 63371: {'lr': 0.00031622415826934694, 'samples': 12167232, 'steps': 63370, 'loss/train': 1.8439863920211792} 11/07/2021 06:09:56 - INFO - __main__ - Step 63372: {'lr': 0.0003162190410838963, 'samples': 12167424, 'steps': 63371, 'loss/train': 1.3022757768630981} 11/07/2021 06:09:56 - INFO - __main__ - Step 63373: {'lr': 0.00031621392386860833, 'samples': 12167616, 'steps': 63372, 'loss/train': 1.310835361480713} 11/07/2021 06:09:57 - INFO - __main__ - Step 63374: {'lr': 0.00031620880662348546, 'samples': 12167808, 'steps': 63373, 'loss/train': 1.7579938173294067} 11/07/2021 06:09:58 - INFO - __main__ - Step 63375: {'lr': 0.00031620368934852985, 'samples': 12168000, 'steps': 63374, 'loss/train': 1.7128256559371948} 11/07/2021 06:09:58 - INFO - __main__ - Step 63376: {'lr': 0.0003161985720437439, 'samples': 12168192, 'steps': 63375, 'loss/train': 1.0540295839309692} 11/07/2021 06:09:58 - INFO - __main__ - Step 63377: {'lr': 0.0003161934547091299, 'samples': 12168384, 'steps': 63376, 'loss/train': 1.2955199480056763} 11/07/2021 06:09:59 - INFO - __main__ - Step 63378: {'lr': 0.0003161883373446901, 'samples': 12168576, 'steps': 63377, 'loss/train': 1.2888885736465454} 11/07/2021 06:09:59 - INFO - __main__ - Step 63379: {'lr': 0.0003161832199504269, 'samples': 12168768, 'steps': 63378, 'loss/train': 1.5784484148025513} 11/07/2021 06:10:00 - INFO - __main__ - Step 63380: {'lr': 0.0003161781025263426, 'samples': 12168960, 'steps': 63379, 'loss/train': 1.7263455390930176} 11/07/2021 06:10:01 - INFO - __main__ - Step 63381: {'lr': 0.0003161729850724394, 'samples': 12169152, 'steps': 63380, 'loss/train': 1.2715622186660767} 11/07/2021 06:10:01 - INFO - __main__ - Step 63382: {'lr': 0.00031616786758871974, 'samples': 12169344, 'steps': 63381, 'loss/train': 0.7897055745124817} 11/07/2021 06:10:01 - INFO - __main__ - Step 63383: {'lr': 0.0003161627500751858, 'samples': 12169536, 'steps': 63382, 'loss/train': 1.3962452411651611} 11/07/2021 06:10:02 - INFO - __main__ - Step 63384: {'lr': 0.00031615763253183996, 'samples': 12169728, 'steps': 63383, 'loss/train': 1.1344273090362549} 11/07/2021 06:10:03 - INFO - __main__ - Step 63385: {'lr': 0.0003161525149586845, 'samples': 12169920, 'steps': 63384, 'loss/train': 1.2184386253356934} 11/07/2021 06:10:03 - INFO - __main__ - Step 63386: {'lr': 0.0003161473973557218, 'samples': 12170112, 'steps': 63385, 'loss/train': 1.097032070159912} 11/07/2021 06:10:03 - INFO - __main__ - Step 63387: {'lr': 0.00031614227972295405, 'samples': 12170304, 'steps': 63386, 'loss/train': 1.5628775358200073} 11/07/2021 06:10:04 - INFO - __main__ - Step 63388: {'lr': 0.0003161371620603837, 'samples': 12170496, 'steps': 63387, 'loss/train': 1.280206561088562} 11/07/2021 06:10:04 - INFO - __main__ - Step 63389: {'lr': 0.00031613204436801285, 'samples': 12170688, 'steps': 63388, 'loss/train': 1.317950963973999} 11/07/2021 06:10:04 - INFO - __main__ - Step 63390: {'lr': 0.00031612692664584395, 'samples': 12170880, 'steps': 63389, 'loss/train': 1.0373796224594116} 11/07/2021 06:10:05 - INFO - __main__ - Step 63391: {'lr': 0.0003161218088938793, 'samples': 12171072, 'steps': 63390, 'loss/train': 1.592924952507019} 11/07/2021 06:10:06 - INFO - __main__ - Step 63392: {'lr': 0.00031611669111212117, 'samples': 12171264, 'steps': 63391, 'loss/train': 1.5184216499328613} 11/07/2021 06:10:06 - INFO - __main__ - Step 63393: {'lr': 0.00031611157330057183, 'samples': 12171456, 'steps': 63392, 'loss/train': 1.1802539825439453} 11/07/2021 06:10:06 - INFO - __main__ - Step 63394: {'lr': 0.0003161064554592337, 'samples': 12171648, 'steps': 63393, 'loss/train': 1.0108134746551514} 11/07/2021 06:10:07 - INFO - __main__ - Step 63395: {'lr': 0.00031610133758810905, 'samples': 12171840, 'steps': 63394, 'loss/train': 1.6150267124176025} 11/07/2021 06:10:08 - INFO - __main__ - Step 63396: {'lr': 0.0003160962196872001, 'samples': 12172032, 'steps': 63395, 'loss/train': 0.8368946313858032} 11/07/2021 06:10:08 - INFO - __main__ - Step 63397: {'lr': 0.00031609110175650926, 'samples': 12172224, 'steps': 63396, 'loss/train': 1.1027264595031738} 11/07/2021 06:10:08 - INFO - __main__ - Step 63398: {'lr': 0.0003160859837960387, 'samples': 12172416, 'steps': 63397, 'loss/train': 1.4078596830368042} 11/07/2021 06:10:09 - INFO - __main__ - Step 63399: {'lr': 0.0003160808658057909, 'samples': 12172608, 'steps': 63398, 'loss/train': 1.4386380910873413} 11/07/2021 06:10:09 - INFO - __main__ - Step 63400: {'lr': 0.00031607574778576807, 'samples': 12172800, 'steps': 63399, 'loss/train': 1.2782206535339355} 11/07/2021 06:10:10 - INFO - __main__ - Step 63401: {'lr': 0.0003160706297359725, 'samples': 12172992, 'steps': 63400, 'loss/train': 1.056853175163269} 11/07/2021 06:10:11 - INFO - __main__ - Step 63402: {'lr': 0.0003160655116564065, 'samples': 12173184, 'steps': 63401, 'loss/train': 1.3855323791503906} 11/07/2021 06:10:11 - INFO - __main__ - Step 63403: {'lr': 0.00031606039354707243, 'samples': 12173376, 'steps': 63402, 'loss/train': 1.3348041772842407} 11/07/2021 06:10:12 - INFO - __main__ - Step 63404: {'lr': 0.00031605527540797256, 'samples': 12173568, 'steps': 63403, 'loss/train': 1.4096312522888184} 11/07/2021 06:10:12 - INFO - __main__ - Step 63405: {'lr': 0.0003160501572391092, 'samples': 12173760, 'steps': 63404, 'loss/train': 1.3921446800231934} 11/07/2021 06:10:13 - INFO - __main__ - Step 63406: {'lr': 0.0003160450390404847, 'samples': 12173952, 'steps': 63405, 'loss/train': 0.8954759240150452} 11/07/2021 06:10:13 - INFO - __main__ - Step 63407: {'lr': 0.0003160399208121013, 'samples': 12174144, 'steps': 63406, 'loss/train': 1.5013645887374878} 11/07/2021 06:10:14 - INFO - __main__ - Step 63408: {'lr': 0.0003160348025539613, 'samples': 12174336, 'steps': 63407, 'loss/train': 1.5932530164718628} 11/07/2021 06:10:14 - INFO - __main__ - Step 63409: {'lr': 0.0003160296842660671, 'samples': 12174528, 'steps': 63408, 'loss/train': 0.8674557209014893} 11/07/2021 06:10:14 - INFO - __main__ - Step 63410: {'lr': 0.00031602456594842087, 'samples': 12174720, 'steps': 63409, 'loss/train': 1.0870306491851807} 11/07/2021 06:10:16 - INFO - __main__ - Step 63411: {'lr': 0.0003160194476010251, 'samples': 12174912, 'steps': 63410, 'loss/train': 1.1535037755966187} 11/07/2021 06:10:16 - INFO - __main__ - Step 63412: {'lr': 0.00031601432922388187, 'samples': 12175104, 'steps': 63411, 'loss/train': 1.8951451778411865} 11/07/2021 06:10:16 - INFO - __main__ - Step 63413: {'lr': 0.00031600921081699365, 'samples': 12175296, 'steps': 63412, 'loss/train': 1.4286848306655884} 11/07/2021 06:10:17 - INFO - __main__ - Step 63414: {'lr': 0.0003160040923803627, 'samples': 12175488, 'steps': 63413, 'loss/train': 1.3640401363372803} 11/07/2021 06:10:17 - INFO - __main__ - Step 63415: {'lr': 0.00031599897391399134, 'samples': 12175680, 'steps': 63414, 'loss/train': 1.2432786226272583} 11/07/2021 06:10:18 - INFO - __main__ - Step 63416: {'lr': 0.00031599385541788186, 'samples': 12175872, 'steps': 63415, 'loss/train': 1.7839120626449585} 11/07/2021 06:10:18 - INFO - __main__ - Step 63417: {'lr': 0.0003159887368920365, 'samples': 12176064, 'steps': 63416, 'loss/train': 1.219434142112732} 11/07/2021 06:10:19 - INFO - __main__ - Step 63418: {'lr': 0.00031598361833645765, 'samples': 12176256, 'steps': 63417, 'loss/train': 1.9833210706710815} 11/07/2021 06:10:19 - INFO - __main__ - Step 63419: {'lr': 0.0003159784997511476, 'samples': 12176448, 'steps': 63418, 'loss/train': 1.1644161939620972} 11/07/2021 06:10:19 - INFO - __main__ - Step 63420: {'lr': 0.0003159733811361087, 'samples': 12176640, 'steps': 63419, 'loss/train': 1.3531197309494019} 11/07/2021 06:10:20 - INFO - __main__ - Step 63421: {'lr': 0.00031596826249134324, 'samples': 12176832, 'steps': 63420, 'loss/train': 0.5067185759544373} 11/07/2021 06:10:21 - INFO - __main__ - Step 63422: {'lr': 0.00031596314381685344, 'samples': 12177024, 'steps': 63421, 'loss/train': 1.1595700979232788} 11/07/2021 06:10:22 - INFO - __main__ - Step 63423: {'lr': 0.0003159580251126417, 'samples': 12177216, 'steps': 63422, 'loss/train': 1.4732309579849243} 11/07/2021 06:10:22 - INFO - __main__ - Step 63424: {'lr': 0.00031595290637871024, 'samples': 12177408, 'steps': 63423, 'loss/train': 2.8518855571746826} 11/07/2021 06:10:22 - INFO - __main__ - Step 63425: {'lr': 0.0003159477876150615, 'samples': 12177600, 'steps': 63424, 'loss/train': 1.849096417427063} 11/07/2021 06:10:23 - INFO - __main__ - Step 63426: {'lr': 0.00031594266882169756, 'samples': 12177792, 'steps': 63425, 'loss/train': 1.867957353591919} 11/07/2021 06:10:23 - INFO - __main__ - Step 63427: {'lr': 0.00031593754999862105, 'samples': 12177984, 'steps': 63426, 'loss/train': 1.5339882373809814} 11/07/2021 06:10:24 - INFO - __main__ - Step 63428: {'lr': 0.00031593243114583404, 'samples': 12178176, 'steps': 63427, 'loss/train': 1.486051321029663} 11/07/2021 06:10:24 - INFO - __main__ - Step 63429: {'lr': 0.0003159273122633388, 'samples': 12178368, 'steps': 63428, 'loss/train': 1.2975986003875732} 11/07/2021 06:10:25 - INFO - __main__ - Step 63430: {'lr': 0.00031592219335113784, 'samples': 12178560, 'steps': 63429, 'loss/train': 1.346978783607483} 11/07/2021 06:10:25 - INFO - __main__ - Step 63431: {'lr': 0.0003159170744092333, 'samples': 12178752, 'steps': 63430, 'loss/train': 1.4224967956542969} 11/07/2021 06:10:25 - INFO - __main__ - Step 63432: {'lr': 0.0003159119554376275, 'samples': 12178944, 'steps': 63431, 'loss/train': 1.4995206594467163} 11/07/2021 06:10:26 - INFO - __main__ - Step 63433: {'lr': 0.0003159068364363229, 'samples': 12179136, 'steps': 63432, 'loss/train': 1.0408614873886108} 11/07/2021 06:10:27 - INFO - __main__ - Step 63434: {'lr': 0.0003159017174053217, 'samples': 12179328, 'steps': 63433, 'loss/train': 1.112278938293457} 11/07/2021 06:10:27 - INFO - __main__ - Step 63435: {'lr': 0.00031589659834462615, 'samples': 12179520, 'steps': 63434, 'loss/train': 1.3181244134902954} 11/07/2021 06:10:27 - INFO - __main__ - Step 63436: {'lr': 0.00031589147925423856, 'samples': 12179712, 'steps': 63435, 'loss/train': 1.269150733947754} 11/07/2021 06:10:28 - INFO - __main__ - Step 63437: {'lr': 0.00031588636013416135, 'samples': 12179904, 'steps': 63436, 'loss/train': 1.9419015645980835} 11/07/2021 06:10:28 - INFO - __main__ - Step 63438: {'lr': 0.0003158812409843967, 'samples': 12180096, 'steps': 63437, 'loss/train': 1.132951021194458} 11/07/2021 06:10:29 - INFO - __main__ - Step 63439: {'lr': 0.000315876121804947, 'samples': 12180288, 'steps': 63438, 'loss/train': 1.3250256776809692} 11/07/2021 06:10:29 - INFO - __main__ - Step 63440: {'lr': 0.0003158710025958146, 'samples': 12180480, 'steps': 63439, 'loss/train': 1.1986637115478516} 11/07/2021 06:10:30 - INFO - __main__ - Step 63441: {'lr': 0.0003158658833570017, 'samples': 12180672, 'steps': 63440, 'loss/train': 1.112156629562378} 11/07/2021 06:10:30 - INFO - __main__ - Step 63442: {'lr': 0.00031586076408851067, 'samples': 12180864, 'steps': 63441, 'loss/train': 1.3525608777999878} 11/07/2021 06:10:31 - INFO - __main__ - Step 63443: {'lr': 0.00031585564479034376, 'samples': 12181056, 'steps': 63442, 'loss/train': 1.362454891204834} 11/07/2021 06:10:32 - INFO - __main__ - Step 63444: {'lr': 0.0003158505254625034, 'samples': 12181248, 'steps': 63443, 'loss/train': 1.3906463384628296} 11/07/2021 06:10:32 - INFO - __main__ - Step 63445: {'lr': 0.0003158454061049917, 'samples': 12181440, 'steps': 63444, 'loss/train': 1.6062864065170288} 11/07/2021 06:10:32 - INFO - __main__ - Step 63446: {'lr': 0.00031584028671781107, 'samples': 12181632, 'steps': 63445, 'loss/train': 1.625989556312561} 11/07/2021 06:10:33 - INFO - __main__ - Step 63447: {'lr': 0.000315835167300964, 'samples': 12181824, 'steps': 63446, 'loss/train': 0.9491329789161682} 11/07/2021 06:10:33 - INFO - __main__ - Step 63448: {'lr': 0.0003158300478544524, 'samples': 12182016, 'steps': 63447, 'loss/train': 1.154571533203125} 11/07/2021 06:10:34 - INFO - __main__ - Step 63449: {'lr': 0.0003158249283782789, 'samples': 12182208, 'steps': 63448, 'loss/train': 1.6843020915985107} 11/07/2021 06:10:34 - INFO - __main__ - Step 63450: {'lr': 0.00031581980887244565, 'samples': 12182400, 'steps': 63449, 'loss/train': 1.4193636178970337} 11/07/2021 06:10:35 - INFO - __main__ - Step 63451: {'lr': 0.00031581468933695507, 'samples': 12182592, 'steps': 63450, 'loss/train': 1.7323213815689087} 11/07/2021 06:10:35 - INFO - __main__ - Step 63452: {'lr': 0.0003158095697718094, 'samples': 12182784, 'steps': 63451, 'loss/train': 1.3951255083084106} 11/07/2021 06:10:35 - INFO - __main__ - Step 63453: {'lr': 0.00031580445017701094, 'samples': 12182976, 'steps': 63452, 'loss/train': 1.7713255882263184} 11/07/2021 06:10:37 - INFO - __main__ - Step 63454: {'lr': 0.00031579933055256206, 'samples': 12183168, 'steps': 63453, 'loss/train': 1.4213266372680664} 11/07/2021 06:10:37 - INFO - __main__ - Step 63455: {'lr': 0.0003157942108984649, 'samples': 12183360, 'steps': 63454, 'loss/train': 1.6531875133514404} 11/07/2021 06:10:37 - INFO - __main__ - Step 63456: {'lr': 0.000315789091214722, 'samples': 12183552, 'steps': 63455, 'loss/train': 1.5109175443649292} 11/07/2021 06:10:38 - INFO - __main__ - Step 63457: {'lr': 0.00031578397150133547, 'samples': 12183744, 'steps': 63456, 'loss/train': 1.4989370107650757} 11/07/2021 06:10:38 - INFO - __main__ - Step 63458: {'lr': 0.0003157788517583077, 'samples': 12183936, 'steps': 63457, 'loss/train': 1.3570330142974854} 11/07/2021 06:10:38 - INFO - __main__ - Step 63459: {'lr': 0.0003157737319856411, 'samples': 12184128, 'steps': 63458, 'loss/train': 1.4584674835205078} 11/07/2021 06:10:39 - INFO - __main__ - Step 63460: {'lr': 0.00031576861218333773, 'samples': 12184320, 'steps': 63459, 'loss/train': 1.2474781274795532} 11/07/2021 06:10:40 - INFO - __main__ - Step 63461: {'lr': 0.0003157634923514001, 'samples': 12184512, 'steps': 63460, 'loss/train': 1.6628044843673706} 11/07/2021 06:10:40 - INFO - __main__ - Step 63462: {'lr': 0.00031575837248983045, 'samples': 12184704, 'steps': 63461, 'loss/train': 1.377041220664978} 11/07/2021 06:10:40 - INFO - __main__ - Step 63463: {'lr': 0.00031575325259863114, 'samples': 12184896, 'steps': 63462, 'loss/train': 1.3037155866622925} 11/07/2021 06:10:41 - INFO - __main__ - Step 63464: {'lr': 0.0003157481326778043, 'samples': 12185088, 'steps': 63463, 'loss/train': 1.1675523519515991} 11/07/2021 06:10:42 - INFO - __main__ - Step 63465: {'lr': 0.00031574301272735254, 'samples': 12185280, 'steps': 63464, 'loss/train': 1.8717883825302124} 11/07/2021 06:10:42 - INFO - __main__ - Step 63466: {'lr': 0.0003157378927472779, 'samples': 12185472, 'steps': 63465, 'loss/train': 1.5642437934875488} 11/07/2021 06:10:43 - INFO - __main__ - Step 63467: {'lr': 0.00031573277273758284, 'samples': 12185664, 'steps': 63466, 'loss/train': 5.896816730499268} 11/07/2021 06:10:43 - INFO - __main__ - Step 63468: {'lr': 0.00031572765269826953, 'samples': 12185856, 'steps': 63467, 'loss/train': 1.35538649559021} 11/07/2021 06:10:43 - INFO - __main__ - Step 63469: {'lr': 0.00031572253262934037, 'samples': 12186048, 'steps': 63468, 'loss/train': 1.152369499206543} 11/07/2021 06:10:44 - INFO - __main__ - Step 63470: {'lr': 0.0003157174125307977, 'samples': 12186240, 'steps': 63469, 'loss/train': 1.90871262550354} 11/07/2021 06:10:45 - INFO - __main__ - Step 63471: {'lr': 0.0003157122924026437, 'samples': 12186432, 'steps': 63470, 'loss/train': 1.7425318956375122} 11/07/2021 06:10:45 - INFO - __main__ - Step 63472: {'lr': 0.00031570717224488077, 'samples': 12186624, 'steps': 63471, 'loss/train': 1.2911161184310913} 11/07/2021 06:10:45 - INFO - __main__ - Step 63473: {'lr': 0.00031570205205751125, 'samples': 12186816, 'steps': 63472, 'loss/train': 1.0681676864624023} 11/07/2021 06:10:46 - INFO - __main__ - Step 63474: {'lr': 0.00031569693184053737, 'samples': 12187008, 'steps': 63473, 'loss/train': 1.3242188692092896} 11/07/2021 06:10:46 - INFO - __main__ - Step 63475: {'lr': 0.0003156918115939614, 'samples': 12187200, 'steps': 63474, 'loss/train': 1.524174451828003} 11/07/2021 06:10:47 - INFO - __main__ - Step 63476: {'lr': 0.00031568669131778587, 'samples': 12187392, 'steps': 63475, 'loss/train': 1.4743164777755737} 11/07/2021 06:10:47 - INFO - __main__ - Step 63477: {'lr': 0.00031568157101201285, 'samples': 12187584, 'steps': 63476, 'loss/train': 1.3958302736282349} 11/07/2021 06:10:48 - INFO - __main__ - Step 63478: {'lr': 0.00031567645067664474, 'samples': 12187776, 'steps': 63477, 'loss/train': 1.4827399253845215} 11/07/2021 06:10:48 - INFO - __main__ - Step 63479: {'lr': 0.0003156713303116838, 'samples': 12187968, 'steps': 63478, 'loss/train': 1.5677341222763062} 11/07/2021 06:10:48 - INFO - __main__ - Step 63480: {'lr': 0.0003156662099171324, 'samples': 12188160, 'steps': 63479, 'loss/train': 1.7530694007873535} 11/07/2021 06:10:50 - INFO - __main__ - Step 63481: {'lr': 0.00031566108949299284, 'samples': 12188352, 'steps': 63480, 'loss/train': 1.7828270196914673} 11/07/2021 06:10:50 - INFO - __main__ - Step 63482: {'lr': 0.00031565596903926737, 'samples': 12188544, 'steps': 63481, 'loss/train': 1.254654049873352} 11/07/2021 06:10:50 - INFO - __main__ - Step 63483: {'lr': 0.00031565084855595825, 'samples': 12188736, 'steps': 63482, 'loss/train': 2.0601515769958496} 11/07/2021 06:10:51 - INFO - __main__ - Step 63484: {'lr': 0.00031564572804306803, 'samples': 12188928, 'steps': 63483, 'loss/train': 0.9551054835319519} 11/07/2021 06:10:51 - INFO - __main__ - Step 63485: {'lr': 0.00031564060750059877, 'samples': 12189120, 'steps': 63484, 'loss/train': 1.3809465169906616} 11/07/2021 06:10:51 - INFO - __main__ - Step 63486: {'lr': 0.0003156354869285528, 'samples': 12189312, 'steps': 63485, 'loss/train': 1.2333961725234985} 11/07/2021 06:10:52 - INFO - __main__ - Step 63487: {'lr': 0.0003156303663269326, 'samples': 12189504, 'steps': 63486, 'loss/train': 1.7833454608917236} 11/07/2021 06:10:53 - INFO - __main__ - Step 63488: {'lr': 0.00031562524569574043, 'samples': 12189696, 'steps': 63487, 'loss/train': 1.5089420080184937} 11/07/2021 06:10:53 - INFO - __main__ - Step 63489: {'lr': 0.0003156201250349784, 'samples': 12189888, 'steps': 63488, 'loss/train': 1.192374348640442} 11/07/2021 06:10:53 - INFO - __main__ - Step 63490: {'lr': 0.00031561500434464904, 'samples': 12190080, 'steps': 63489, 'loss/train': 1.3422991037368774} 11/07/2021 06:10:54 - INFO - __main__ - Step 63491: {'lr': 0.00031560988362475454, 'samples': 12190272, 'steps': 63490, 'loss/train': 1.0031070709228516} 11/07/2021 06:10:55 - INFO - __main__ - Step 63492: {'lr': 0.00031560476287529715, 'samples': 12190464, 'steps': 63491, 'loss/train': 1.4682241678237915} 11/07/2021 06:10:55 - INFO - __main__ - Step 63493: {'lr': 0.00031559964209627937, 'samples': 12190656, 'steps': 63492, 'loss/train': 1.776482105255127} 11/07/2021 06:10:56 - INFO - __main__ - Step 63494: {'lr': 0.00031559452128770337, 'samples': 12190848, 'steps': 63493, 'loss/train': 1.7526189088821411} 11/07/2021 06:10:56 - INFO - __main__ - Step 63495: {'lr': 0.0003155894004495716, 'samples': 12191040, 'steps': 63494, 'loss/train': 0.4794616103172302} 11/07/2021 06:10:56 - INFO - __main__ - Step 63496: {'lr': 0.0003155842795818861, 'samples': 12191232, 'steps': 63495, 'loss/train': 1.1431442499160767} 11/07/2021 06:10:57 - INFO - __main__ - Step 63497: {'lr': 0.00031557915868464943, 'samples': 12191424, 'steps': 63496, 'loss/train': 1.5880749225616455} 11/07/2021 06:10:58 - INFO - __main__ - Step 63498: {'lr': 0.00031557403775786373, 'samples': 12191616, 'steps': 63497, 'loss/train': 1.0505249500274658} 11/07/2021 06:10:58 - INFO - __main__ - Step 63499: {'lr': 0.00031556891680153146, 'samples': 12191808, 'steps': 63498, 'loss/train': 1.8781708478927612} 11/07/2021 06:10:59 - INFO - __main__ - Step 63500: {'lr': 0.00031556379581565474, 'samples': 12192000, 'steps': 63499, 'loss/train': 1.408470869064331} 11/07/2021 06:10:59 - INFO - __main__ - Step 63501: {'lr': 0.00031555867480023616, 'samples': 12192192, 'steps': 63500, 'loss/train': 1.5167676210403442} 11/07/2021 06:10:59 - INFO - __main__ - Step 63502: {'lr': 0.00031555355375527774, 'samples': 12192384, 'steps': 63501, 'loss/train': 1.692413330078125} 11/07/2021 06:11:00 - INFO - __main__ - Step 63503: {'lr': 0.00031554843268078185, 'samples': 12192576, 'steps': 63502, 'loss/train': 1.5161224603652954} 11/07/2021 06:11:01 - INFO - __main__ - Step 63504: {'lr': 0.00031554331157675094, 'samples': 12192768, 'steps': 63503, 'loss/train': 1.3664014339447021} 11/07/2021 06:11:01 - INFO - __main__ - Step 63505: {'lr': 0.0003155381904431872, 'samples': 12192960, 'steps': 63504, 'loss/train': 1.6219927072525024} 11/07/2021 06:11:01 - INFO - __main__ - Step 63506: {'lr': 0.0003155330692800929, 'samples': 12193152, 'steps': 63505, 'loss/train': 1.5314934253692627} 11/07/2021 06:11:02 - INFO - __main__ - Step 63507: {'lr': 0.0003155279480874705, 'samples': 12193344, 'steps': 63506, 'loss/train': 1.358770728111267} 11/07/2021 06:11:03 - INFO - __main__ - Step 63508: {'lr': 0.0003155228268653222, 'samples': 12193536, 'steps': 63507, 'loss/train': 0.9858087301254272} 11/07/2021 06:11:03 - INFO - __main__ - Step 63509: {'lr': 0.00031551770561365027, 'samples': 12193728, 'steps': 63508, 'loss/train': 1.237775444984436} 11/07/2021 06:11:03 - INFO - __main__ - Step 63510: {'lr': 0.0003155125843324571, 'samples': 12193920, 'steps': 63509, 'loss/train': 1.4457612037658691} 11/07/2021 06:11:04 - INFO - __main__ - Step 63511: {'lr': 0.000315507463021745, 'samples': 12194112, 'steps': 63510, 'loss/train': 1.1473102569580078} 11/07/2021 06:11:04 - INFO - __main__ - Step 63512: {'lr': 0.0003155023416815162, 'samples': 12194304, 'steps': 63511, 'loss/train': 1.7757632732391357} 11/07/2021 06:11:05 - INFO - __main__ - Step 63513: {'lr': 0.0003154972203117731, 'samples': 12194496, 'steps': 63512, 'loss/train': 1.7134867906570435} 11/07/2021 06:11:05 - INFO - __main__ - Step 63514: {'lr': 0.00031549209891251794, 'samples': 12194688, 'steps': 63513, 'loss/train': 1.4333361387252808} 11/07/2021 06:11:06 - INFO - __main__ - Step 63515: {'lr': 0.0003154869774837531, 'samples': 12194880, 'steps': 63514, 'loss/train': 1.8366435766220093} 11/07/2021 06:11:06 - INFO - __main__ - Step 63516: {'lr': 0.0003154818560254808, 'samples': 12195072, 'steps': 63515, 'loss/train': 1.0698198080062866} 11/07/2021 06:11:07 - INFO - __main__ - Step 63517: {'lr': 0.00031547673453770337, 'samples': 12195264, 'steps': 63516, 'loss/train': 1.584882140159607} 11/07/2021 06:11:07 - INFO - __main__ - Step 63518: {'lr': 0.00031547161302042316, 'samples': 12195456, 'steps': 63517, 'loss/train': 0.823932945728302} 11/07/2021 06:11:08 - INFO - __main__ - Step 63519: {'lr': 0.00031546649147364236, 'samples': 12195648, 'steps': 63518, 'loss/train': 1.3774389028549194} 11/07/2021 06:11:08 - INFO - __main__ - Step 63520: {'lr': 0.0003154613698973635, 'samples': 12195840, 'steps': 63519, 'loss/train': 1.4698961973190308} 11/07/2021 06:11:09 - INFO - __main__ - Step 63521: {'lr': 0.0003154562482915887, 'samples': 12196032, 'steps': 63520, 'loss/train': 1.490541934967041} 11/07/2021 06:11:09 - INFO - __main__ - Step 63522: {'lr': 0.00031545112665632037, 'samples': 12196224, 'steps': 63521, 'loss/train': 1.6818863153457642} 11/07/2021 06:11:09 - INFO - __main__ - Step 63523: {'lr': 0.00031544600499156076, 'samples': 12196416, 'steps': 63522, 'loss/train': 1.3312374353408813} 11/07/2021 06:11:10 - INFO - __main__ - Step 63524: {'lr': 0.00031544088329731214, 'samples': 12196608, 'steps': 63523, 'loss/train': 1.3348139524459839} 11/07/2021 06:11:11 - INFO - __main__ - Step 63525: {'lr': 0.00031543576157357686, 'samples': 12196800, 'steps': 63524, 'loss/train': 1.660157561302185} 11/07/2021 06:11:11 - INFO - __main__ - Step 63526: {'lr': 0.00031543063982035724, 'samples': 12196992, 'steps': 63525, 'loss/train': 1.5402826070785522} 11/07/2021 06:11:11 - INFO - __main__ - Step 63527: {'lr': 0.0003154255180376556, 'samples': 12197184, 'steps': 63526, 'loss/train': 1.0925259590148926} 11/07/2021 06:11:12 - INFO - __main__ - Step 63528: {'lr': 0.00031542039622547426, 'samples': 12197376, 'steps': 63527, 'loss/train': 1.574507474899292} 11/07/2021 06:11:13 - INFO - __main__ - Step 63529: {'lr': 0.0003154152743838155, 'samples': 12197568, 'steps': 63528, 'loss/train': 1.5038747787475586} 11/07/2021 06:11:13 - INFO - __main__ - Step 63530: {'lr': 0.0003154101525126816, 'samples': 12197760, 'steps': 63529, 'loss/train': 1.3178468942642212} 11/07/2021 06:11:14 - INFO - __main__ - Step 63531: {'lr': 0.0003154050306120749, 'samples': 12197952, 'steps': 63530, 'loss/train': 0.6582124829292297} 11/07/2021 06:11:14 - INFO - __main__ - Step 63532: {'lr': 0.0003153999086819977, 'samples': 12198144, 'steps': 63531, 'loss/train': 1.2580722570419312} 11/07/2021 06:11:14 - INFO - __main__ - Step 63533: {'lr': 0.00031539478672245225, 'samples': 12198336, 'steps': 63532, 'loss/train': 1.924052357673645} 11/07/2021 06:11:15 - INFO - __main__ - Step 63534: {'lr': 0.000315389664733441, 'samples': 12198528, 'steps': 63533, 'loss/train': 1.6368000507354736} 11/07/2021 06:11:16 - INFO - __main__ - Step 63535: {'lr': 0.0003153845427149662, 'samples': 12198720, 'steps': 63534, 'loss/train': 1.62251877784729} 11/07/2021 06:11:16 - INFO - __main__ - Step 63536: {'lr': 0.0003153794206670301, 'samples': 12198912, 'steps': 63535, 'loss/train': 1.590144395828247} 11/07/2021 06:11:16 - INFO - __main__ - Step 63537: {'lr': 0.000315374298589635, 'samples': 12199104, 'steps': 63536, 'loss/train': 0.8089854717254639} 11/07/2021 06:11:17 - INFO - __main__ - Step 63538: {'lr': 0.00031536917648278327, 'samples': 12199296, 'steps': 63537, 'loss/train': 1.7614320516586304} 11/07/2021 06:11:17 - INFO - __main__ - Step 63539: {'lr': 0.0003153640543464772, 'samples': 12199488, 'steps': 63538, 'loss/train': 1.2092760801315308} 11/07/2021 06:11:18 - INFO - __main__ - Step 63540: {'lr': 0.0003153589321807191, 'samples': 12199680, 'steps': 63539, 'loss/train': 1.3696832656860352} 11/07/2021 06:11:19 - INFO - __main__ - Step 63541: {'lr': 0.00031535380998551127, 'samples': 12199872, 'steps': 63540, 'loss/train': 1.3958102464675903} 11/07/2021 06:11:19 - INFO - __main__ - Step 63542: {'lr': 0.00031534868776085615, 'samples': 12200064, 'steps': 63541, 'loss/train': 1.3377269506454468} 11/07/2021 06:11:19 - INFO - __main__ - Step 63543: {'lr': 0.00031534356550675573, 'samples': 12200256, 'steps': 63542, 'loss/train': 1.4581636190414429} 11/07/2021 06:11:20 - INFO - __main__ - Step 63544: {'lr': 0.0003153384432232126, 'samples': 12200448, 'steps': 63543, 'loss/train': 1.6855357885360718} 11/07/2021 06:11:21 - INFO - __main__ - Step 63545: {'lr': 0.00031533332091022894, 'samples': 12200640, 'steps': 63544, 'loss/train': 1.8071836233139038} 11/07/2021 06:11:21 - INFO - __main__ - Step 63546: {'lr': 0.0003153281985678071, 'samples': 12200832, 'steps': 63545, 'loss/train': 1.6201304197311401} 11/07/2021 06:11:21 - INFO - __main__ - Step 63547: {'lr': 0.00031532307619594935, 'samples': 12201024, 'steps': 63546, 'loss/train': 1.5531151294708252} 11/07/2021 06:11:22 - INFO - __main__ - Step 63548: {'lr': 0.0003153179537946581, 'samples': 12201216, 'steps': 63547, 'loss/train': 1.2381559610366821} 11/07/2021 06:11:22 - INFO - __main__ - Step 63549: {'lr': 0.0003153128313639356, 'samples': 12201408, 'steps': 63548, 'loss/train': 1.1411452293395996} 11/07/2021 06:11:23 - INFO - __main__ - Step 63550: {'lr': 0.00031530770890378406, 'samples': 12201600, 'steps': 63549, 'loss/train': 1.5581775903701782} 11/07/2021 06:11:23 - INFO - __main__ - Step 63551: {'lr': 0.00031530258641420593, 'samples': 12201792, 'steps': 63550, 'loss/train': 1.5502650737762451} 11/07/2021 06:11:24 - INFO - __main__ - Step 63552: {'lr': 0.0003152974638952034, 'samples': 12201984, 'steps': 63551, 'loss/train': 0.946287214756012} 11/07/2021 06:11:24 - INFO - __main__ - Step 63553: {'lr': 0.0003152923413467789, 'samples': 12202176, 'steps': 63552, 'loss/train': 1.6280981302261353} 11/07/2021 06:11:24 - INFO - __main__ - Step 63554: {'lr': 0.0003152872187689347, 'samples': 12202368, 'steps': 63553, 'loss/train': 1.4066399335861206} 11/07/2021 06:11:26 - INFO - __main__ - Step 63555: {'lr': 0.0003152820961616731, 'samples': 12202560, 'steps': 63554, 'loss/train': 1.8144488334655762} 11/07/2021 06:11:26 - INFO - __main__ - Step 63556: {'lr': 0.00031527697352499637, 'samples': 12202752, 'steps': 63555, 'loss/train': 1.4270957708358765} 11/07/2021 06:11:26 - INFO - __main__ - Step 63557: {'lr': 0.00031527185085890677, 'samples': 12202944, 'steps': 63556, 'loss/train': 1.2306573390960693} 11/07/2021 06:11:27 - INFO - __main__ - Step 63558: {'lr': 0.0003152667281634067, 'samples': 12203136, 'steps': 63557, 'loss/train': 0.24882304668426514} 11/07/2021 06:11:27 - INFO - __main__ - Step 63559: {'lr': 0.00031526160543849855, 'samples': 12203328, 'steps': 63558, 'loss/train': 1.1916000843048096} 11/07/2021 06:11:28 - INFO - __main__ - Step 63560: {'lr': 0.0003152564826841844, 'samples': 12203520, 'steps': 63559, 'loss/train': 1.5640099048614502} 11/07/2021 06:11:29 - INFO - __main__ - Step 63561: {'lr': 0.0003152513599004667, 'samples': 12203712, 'steps': 63560, 'loss/train': 1.9893447160720825} 11/07/2021 06:11:29 - INFO - __main__ - Step 63562: {'lr': 0.0003152462370873479, 'samples': 12203904, 'steps': 63561, 'loss/train': 1.378453254699707} 11/07/2021 06:11:29 - INFO - __main__ - Step 63563: {'lr': 0.00031524111424483, 'samples': 12204096, 'steps': 63562, 'loss/train': 1.4607787132263184} 11/07/2021 06:11:30 - INFO - __main__ - Step 63564: {'lr': 0.00031523599137291554, 'samples': 12204288, 'steps': 63563, 'loss/train': 1.5709941387176514} 11/07/2021 06:11:30 - INFO - __main__ - Step 63565: {'lr': 0.0003152308684716067, 'samples': 12204480, 'steps': 63564, 'loss/train': 1.9657527208328247} 11/07/2021 06:11:31 - INFO - __main__ - Step 63566: {'lr': 0.00031522574554090584, 'samples': 12204672, 'steps': 63565, 'loss/train': 1.7270009517669678} 11/07/2021 06:11:31 - INFO - __main__ - Step 63567: {'lr': 0.00031522062258081525, 'samples': 12204864, 'steps': 63566, 'loss/train': 1.4754637479782104} 11/07/2021 06:11:32 - INFO - __main__ - Step 63568: {'lr': 0.0003152154995913373, 'samples': 12205056, 'steps': 63567, 'loss/train': 1.1645697355270386} 11/07/2021 06:11:32 - INFO - __main__ - Step 63569: {'lr': 0.0003152103765724743, 'samples': 12205248, 'steps': 63568, 'loss/train': 1.567056655883789} 11/07/2021 06:11:32 - INFO - __main__ - Step 63570: {'lr': 0.0003152052535242284, 'samples': 12205440, 'steps': 63569, 'loss/train': 1.431926965713501} 11/07/2021 06:11:34 - INFO - __main__ - Step 63571: {'lr': 0.00031520013044660205, 'samples': 12205632, 'steps': 63570, 'loss/train': 1.4367835521697998} 11/07/2021 06:11:34 - INFO - __main__ - Step 63572: {'lr': 0.0003151950073395975, 'samples': 12205824, 'steps': 63571, 'loss/train': 1.3793284893035889} 11/07/2021 06:11:34 - INFO - __main__ - Step 63573: {'lr': 0.00031518988420321716, 'samples': 12206016, 'steps': 63572, 'loss/train': 1.3469127416610718} 11/07/2021 06:11:35 - INFO - __main__ - Step 63574: {'lr': 0.0003151847610374632, 'samples': 12206208, 'steps': 63573, 'loss/train': 1.0975996255874634} 11/07/2021 06:11:35 - INFO - __main__ - Step 63575: {'lr': 0.00031517963784233804, 'samples': 12206400, 'steps': 63574, 'loss/train': 0.30244603753089905} 11/07/2021 06:11:36 - INFO - __main__ - Step 63576: {'lr': 0.000315174514617844, 'samples': 12206592, 'steps': 63575, 'loss/train': 1.5931717157363892} 11/07/2021 06:11:36 - INFO - __main__ - Step 63577: {'lr': 0.00031516939136398323, 'samples': 12206784, 'steps': 63576, 'loss/train': 0.5598853230476379} 11/07/2021 06:11:37 - INFO - __main__ - Step 63578: {'lr': 0.0003151642680807581, 'samples': 12206976, 'steps': 63577, 'loss/train': 1.4442839622497559} 11/07/2021 06:11:37 - INFO - __main__ - Step 63579: {'lr': 0.00031515914476817105, 'samples': 12207168, 'steps': 63578, 'loss/train': 1.2477236986160278} 11/07/2021 06:11:37 - INFO - __main__ - Step 63580: {'lr': 0.00031515402142622424, 'samples': 12207360, 'steps': 63579, 'loss/train': 1.4740936756134033} 11/07/2021 06:11:39 - INFO - __main__ - Step 63581: {'lr': 0.00031514889805492005, 'samples': 12207552, 'steps': 63580, 'loss/train': 1.2168738842010498} 11/07/2021 06:11:39 - INFO - __main__ - Step 63582: {'lr': 0.0003151437746542608, 'samples': 12207744, 'steps': 63581, 'loss/train': 1.6373648643493652} 11/07/2021 06:11:40 - INFO - __main__ - Step 63583: {'lr': 0.00031513865122424875, 'samples': 12207936, 'steps': 63582, 'loss/train': 3.3067452907562256} 11/07/2021 06:11:40 - INFO - __main__ - Step 63584: {'lr': 0.00031513352776488626, 'samples': 12208128, 'steps': 63583, 'loss/train': 2.044935941696167} 11/07/2021 06:11:40 - INFO - __main__ - Step 63585: {'lr': 0.0003151284042761755, 'samples': 12208320, 'steps': 63584, 'loss/train': 1.4573237895965576} 11/07/2021 06:11:41 - INFO - __main__ - Step 63586: {'lr': 0.00031512328075811895, 'samples': 12208512, 'steps': 63585, 'loss/train': 1.6663295030593872} 11/07/2021 06:11:41 - INFO - __main__ - Step 63587: {'lr': 0.0003151181572107189, 'samples': 12208704, 'steps': 63586, 'loss/train': 1.2064708471298218} 11/07/2021 06:11:42 - INFO - __main__ - Step 63588: {'lr': 0.0003151130336339776, 'samples': 12208896, 'steps': 63587, 'loss/train': 0.7885711193084717} 11/07/2021 06:11:42 - INFO - __main__ - Step 63589: {'lr': 0.00031510791002789735, 'samples': 12209088, 'steps': 63588, 'loss/train': 1.5703097581863403} 11/07/2021 06:11:43 - INFO - __main__ - Step 63590: {'lr': 0.0003151027863924805, 'samples': 12209280, 'steps': 63589, 'loss/train': 1.31941556930542} 11/07/2021 06:11:43 - INFO - __main__ - Step 63591: {'lr': 0.00031509766272772927, 'samples': 12209472, 'steps': 63590, 'loss/train': 1.6543264389038086} 11/07/2021 06:11:43 - INFO - __main__ - Step 63592: {'lr': 0.0003150925390336461, 'samples': 12209664, 'steps': 63591, 'loss/train': 1.4779689311981201} 11/07/2021 06:11:44 - INFO - __main__ - Step 63593: {'lr': 0.0003150874153102332, 'samples': 12209856, 'steps': 63592, 'loss/train': 1.425304889678955} 11/07/2021 06:11:45 - INFO - __main__ - Step 63594: {'lr': 0.00031508229155749294, 'samples': 12210048, 'steps': 63593, 'loss/train': 1.2093089818954468} 11/07/2021 06:11:45 - INFO - __main__ - Step 63595: {'lr': 0.0003150771677754276, 'samples': 12210240, 'steps': 63594, 'loss/train': 1.13663649559021} 11/07/2021 06:11:45 - INFO - __main__ - Step 63596: {'lr': 0.00031507204396403956, 'samples': 12210432, 'steps': 63595, 'loss/train': 1.7628862857818604} 11/07/2021 06:11:46 - INFO - __main__ - Step 63597: {'lr': 0.00031506692012333096, 'samples': 12210624, 'steps': 63596, 'loss/train': 1.414615273475647} 11/07/2021 06:11:46 - INFO - __main__ - Step 63598: {'lr': 0.00031506179625330423, 'samples': 12210816, 'steps': 63597, 'loss/train': 1.9403865337371826} 11/07/2021 06:11:47 - INFO - __main__ - Step 63599: {'lr': 0.00031505667235396176, 'samples': 12211008, 'steps': 63598, 'loss/train': 1.5062757730484009} 11/07/2021 06:11:48 - INFO - __main__ - Step 63600: {'lr': 0.0003150515484253056, 'samples': 12211200, 'steps': 63599, 'loss/train': 1.005183458328247} 11/07/2021 06:11:48 - INFO - __main__ - Step 63601: {'lr': 0.00031504642446733826, 'samples': 12211392, 'steps': 63600, 'loss/train': 1.0469269752502441} 11/07/2021 06:11:48 - INFO - __main__ - Step 63602: {'lr': 0.00031504130048006206, 'samples': 12211584, 'steps': 63601, 'loss/train': 1.6610376834869385} 11/07/2021 06:11:49 - INFO - __main__ - Step 63603: {'lr': 0.00031503617646347923, 'samples': 12211776, 'steps': 63602, 'loss/train': 1.7525477409362793} 11/07/2021 06:11:50 - INFO - __main__ - Step 63604: {'lr': 0.00031503105241759204, 'samples': 12211968, 'steps': 63603, 'loss/train': 1.418697714805603} 11/07/2021 06:11:50 - INFO - __main__ - Step 63605: {'lr': 0.000315025928342403, 'samples': 12212160, 'steps': 63604, 'loss/train': 1.6644628047943115} 11/07/2021 06:11:50 - INFO - __main__ - Step 63606: {'lr': 0.00031502080423791417, 'samples': 12212352, 'steps': 63605, 'loss/train': 5.817831039428711} 11/07/2021 06:11:51 - INFO - __main__ - Step 63607: {'lr': 0.000315015680104128, 'samples': 12212544, 'steps': 63606, 'loss/train': 1.6763979196548462} 11/07/2021 06:11:51 - INFO - __main__ - Step 63608: {'lr': 0.0003150105559410468, 'samples': 12212736, 'steps': 63607, 'loss/train': 1.3046973943710327} 11/07/2021 06:11:52 - INFO - __main__ - Step 63609: {'lr': 0.00031500543174867277, 'samples': 12212928, 'steps': 63608, 'loss/train': 1.541710376739502} 11/07/2021 06:11:53 - INFO - __main__ - Step 63610: {'lr': 0.0003150003075270084, 'samples': 12213120, 'steps': 63609, 'loss/train': 1.1046491861343384} 11/07/2021 06:11:53 - INFO - __main__ - Step 63611: {'lr': 0.00031499518327605583, 'samples': 12213312, 'steps': 63610, 'loss/train': 1.6506779193878174} 11/07/2021 06:11:53 - INFO - __main__ - Step 63612: {'lr': 0.0003149900589958174, 'samples': 12213504, 'steps': 63611, 'loss/train': 1.2448698282241821} 11/07/2021 06:11:54 - INFO - __main__ - Step 63613: {'lr': 0.00031498493468629546, 'samples': 12213696, 'steps': 63612, 'loss/train': 0.5770947933197021} 11/07/2021 06:11:55 - INFO - __main__ - Step 63614: {'lr': 0.00031497981034749235, 'samples': 12213888, 'steps': 63613, 'loss/train': 1.409685492515564} 11/07/2021 06:11:55 - INFO - __main__ - Step 63615: {'lr': 0.0003149746859794103, 'samples': 12214080, 'steps': 63614, 'loss/train': 1.9366803169250488} 11/07/2021 06:11:55 - INFO - __main__ - Step 63616: {'lr': 0.00031496956158205176, 'samples': 12214272, 'steps': 63615, 'loss/train': 0.8019841909408569} 11/07/2021 06:11:56 - INFO - __main__ - Step 63617: {'lr': 0.00031496443715541884, 'samples': 12214464, 'steps': 63616, 'loss/train': 1.4916391372680664} 11/07/2021 06:11:56 - INFO - __main__ - Step 63618: {'lr': 0.0003149593126995139, 'samples': 12214656, 'steps': 63617, 'loss/train': 0.7912411689758301} 11/07/2021 06:11:57 - INFO - __main__ - Step 63619: {'lr': 0.0003149541882143394, 'samples': 12214848, 'steps': 63618, 'loss/train': 1.4290804862976074} 11/07/2021 06:11:58 - INFO - __main__ - Step 63620: {'lr': 0.0003149490636998975, 'samples': 12215040, 'steps': 63619, 'loss/train': 1.4817118644714355} 11/07/2021 06:11:58 - INFO - __main__ - Step 63621: {'lr': 0.00031494393915619057, 'samples': 12215232, 'steps': 63620, 'loss/train': 1.5871628522872925} 11/07/2021 06:11:58 - INFO - __main__ - Step 63622: {'lr': 0.000314938814583221, 'samples': 12215424, 'steps': 63621, 'loss/train': 1.5339829921722412} 11/07/2021 06:11:59 - INFO - __main__ - Step 63623: {'lr': 0.00031493368998099084, 'samples': 12215616, 'steps': 63622, 'loss/train': 1.3918378353118896} 11/07/2021 06:11:59 - INFO - __main__ - Step 63624: {'lr': 0.00031492856534950264, 'samples': 12215808, 'steps': 63623, 'loss/train': 0.761618435382843} 11/07/2021 06:11:59 - INFO - __main__ - Step 63625: {'lr': 0.0003149234406887586, 'samples': 12216000, 'steps': 63624, 'loss/train': 1.6459662914276123} 11/07/2021 06:12:01 - INFO - __main__ - Step 63626: {'lr': 0.0003149183159987611, 'samples': 12216192, 'steps': 63625, 'loss/train': 1.0607693195343018} 11/07/2021 06:12:01 - INFO - __main__ - Step 63627: {'lr': 0.00031491319127951236, 'samples': 12216384, 'steps': 63626, 'loss/train': 1.117624282836914} 11/07/2021 06:12:01 - INFO - __main__ - Step 63628: {'lr': 0.0003149080665310148, 'samples': 12216576, 'steps': 63627, 'loss/train': 1.650597333908081} 11/07/2021 06:12:02 - INFO - __main__ - Step 63629: {'lr': 0.0003149029417532706, 'samples': 12216768, 'steps': 63628, 'loss/train': 5.205667972564697} 11/07/2021 06:12:02 - INFO - __main__ - Step 63630: {'lr': 0.0003148978169462822, 'samples': 12216960, 'steps': 63629, 'loss/train': 1.6056212186813354} 11/07/2021 06:12:03 - INFO - __main__ - Step 63631: {'lr': 0.00031489269211005177, 'samples': 12217152, 'steps': 63630, 'loss/train': 1.4431899785995483} 11/07/2021 06:12:03 - INFO - __main__ - Step 63632: {'lr': 0.00031488756724458173, 'samples': 12217344, 'steps': 63631, 'loss/train': 1.2649383544921875} 11/07/2021 06:12:04 - INFO - __main__ - Step 63633: {'lr': 0.0003148824423498744, 'samples': 12217536, 'steps': 63632, 'loss/train': 1.541593313217163} 11/07/2021 06:12:04 - INFO - __main__ - Step 63634: {'lr': 0.000314877317425932, 'samples': 12217728, 'steps': 63633, 'loss/train': 1.6123497486114502} 11/07/2021 06:12:04 - INFO - __main__ - Step 63635: {'lr': 0.0003148721924727568, 'samples': 12217920, 'steps': 63634, 'loss/train': 1.6263617277145386} 11/07/2021 06:12:05 - INFO - __main__ - Step 63636: {'lr': 0.00031486706749035134, 'samples': 12218112, 'steps': 63635, 'loss/train': 1.396032691001892} 11/07/2021 06:12:06 - INFO - __main__ - Step 63637: {'lr': 0.0003148619424787177, 'samples': 12218304, 'steps': 63636, 'loss/train': 1.2115333080291748} 11/07/2021 06:12:06 - INFO - __main__ - Step 63638: {'lr': 0.0003148568174378583, 'samples': 12218496, 'steps': 63637, 'loss/train': 1.532669186592102} 11/07/2021 06:12:06 - INFO - __main__ - Step 63639: {'lr': 0.0003148516923677754, 'samples': 12218688, 'steps': 63638, 'loss/train': 1.322365641593933} 11/07/2021 06:12:07 - INFO - __main__ - Step 63640: {'lr': 0.00031484656726847127, 'samples': 12218880, 'steps': 63639, 'loss/train': 0.7425395846366882} 11/07/2021 06:12:08 - INFO - __main__ - Step 63641: {'lr': 0.00031484144213994835, 'samples': 12219072, 'steps': 63640, 'loss/train': 1.0824750661849976} 11/07/2021 06:12:08 - INFO - __main__ - Step 63642: {'lr': 0.00031483631698220896, 'samples': 12219264, 'steps': 63641, 'loss/train': 1.5362910032272339} 11/07/2021 06:12:08 - INFO - __main__ - Step 63643: {'lr': 0.0003148311917952552, 'samples': 12219456, 'steps': 63642, 'loss/train': 1.8431518077850342} 11/07/2021 06:12:09 - INFO - __main__ - Step 63644: {'lr': 0.0003148260665790895, 'samples': 12219648, 'steps': 63643, 'loss/train': 1.600023627281189} 11/07/2021 06:12:09 - INFO - __main__ - Step 63645: {'lr': 0.0003148209413337142, 'samples': 12219840, 'steps': 63644, 'loss/train': 1.596527099609375} 11/07/2021 06:12:09 - INFO - __main__ - Step 63646: {'lr': 0.00031481581605913154, 'samples': 12220032, 'steps': 63645, 'loss/train': 1.3351709842681885} 11/07/2021 06:12:11 - INFO - __main__ - Step 63647: {'lr': 0.000314810690755344, 'samples': 12220224, 'steps': 63646, 'loss/train': 1.450132131576538} 11/07/2021 06:12:11 - INFO - __main__ - Step 63648: {'lr': 0.00031480556542235366, 'samples': 12220416, 'steps': 63647, 'loss/train': 2.2361457347869873} 11/07/2021 06:12:11 - INFO - __main__ - Step 63649: {'lr': 0.000314800440060163, 'samples': 12220608, 'steps': 63648, 'loss/train': 1.5123553276062012} 11/07/2021 06:12:12 - INFO - __main__ - Step 63650: {'lr': 0.0003147953146687742, 'samples': 12220800, 'steps': 63649, 'loss/train': 1.5257267951965332} 11/07/2021 06:12:12 - INFO - __main__ - Step 63651: {'lr': 0.00031479018924818967, 'samples': 12220992, 'steps': 63650, 'loss/train': 1.753623366355896} 11/07/2021 06:12:13 - INFO - __main__ - Step 63652: {'lr': 0.00031478506379841164, 'samples': 12221184, 'steps': 63651, 'loss/train': 0.6699872016906738} 11/07/2021 06:12:14 - INFO - __main__ - Step 63653: {'lr': 0.0003147799383194425, 'samples': 12221376, 'steps': 63652, 'loss/train': 1.2376106977462769} 11/07/2021 06:12:14 - INFO - __main__ - Step 63654: {'lr': 0.0003147748128112845, 'samples': 12221568, 'steps': 63653, 'loss/train': 0.8958832621574402} 11/07/2021 06:12:14 - INFO - __main__ - Step 63655: {'lr': 0.00031476968727393997, 'samples': 12221760, 'steps': 63654, 'loss/train': 0.18516960740089417} 11/07/2021 06:12:15 - INFO - __main__ - Step 63656: {'lr': 0.00031476456170741125, 'samples': 12221952, 'steps': 63655, 'loss/train': 1.3259886503219604} 11/07/2021 06:12:15 - INFO - __main__ - Step 63657: {'lr': 0.0003147594361117006, 'samples': 12222144, 'steps': 63656, 'loss/train': 1.559755563735962} 11/07/2021 06:12:16 - INFO - __main__ - Step 63658: {'lr': 0.0003147543104868103, 'samples': 12222336, 'steps': 63657, 'loss/train': 1.7092628479003906} 11/07/2021 06:12:16 - INFO - __main__ - Step 63659: {'lr': 0.0003147491848327427, 'samples': 12222528, 'steps': 63658, 'loss/train': 1.5438791513442993} 11/07/2021 06:12:17 - INFO - __main__ - Step 63660: {'lr': 0.00031474405914950023, 'samples': 12222720, 'steps': 63659, 'loss/train': 1.3905956745147705} 11/07/2021 06:12:17 - INFO - __main__ - Step 63661: {'lr': 0.00031473893343708496, 'samples': 12222912, 'steps': 63660, 'loss/train': 1.3133800029754639} 11/07/2021 06:12:17 - INFO - __main__ - Step 63662: {'lr': 0.00031473380769549944, 'samples': 12223104, 'steps': 63661, 'loss/train': 1.1495500802993774} 11/07/2021 06:12:19 - INFO - __main__ - Step 63663: {'lr': 0.0003147286819247458, 'samples': 12223296, 'steps': 63662, 'loss/train': 1.334654688835144} 11/07/2021 06:12:19 - INFO - __main__ - Step 63664: {'lr': 0.00031472355612482646, 'samples': 12223488, 'steps': 63663, 'loss/train': 1.2506400346755981} 11/07/2021 06:12:19 - INFO - __main__ - Step 63665: {'lr': 0.0003147184302957436, 'samples': 12223680, 'steps': 63664, 'loss/train': 1.7598986625671387} 11/07/2021 06:12:20 - INFO - __main__ - Step 63666: {'lr': 0.00031471330443749967, 'samples': 12223872, 'steps': 63665, 'loss/train': 1.5150237083435059} 11/07/2021 06:12:20 - INFO - __main__ - Step 63667: {'lr': 0.00031470817855009693, 'samples': 12224064, 'steps': 63666, 'loss/train': 1.3173795938491821} 11/07/2021 06:12:21 - INFO - __main__ - Step 63668: {'lr': 0.0003147030526335376, 'samples': 12224256, 'steps': 63667, 'loss/train': 1.649468183517456} 11/07/2021 06:12:21 - INFO - __main__ - Step 63669: {'lr': 0.0003146979266878242, 'samples': 12224448, 'steps': 63668, 'loss/train': 0.39336392283439636} 11/07/2021 06:12:22 - INFO - __main__ - Step 63670: {'lr': 0.00031469280071295887, 'samples': 12224640, 'steps': 63669, 'loss/train': 1.42099928855896} 11/07/2021 06:12:22 - INFO - __main__ - Step 63671: {'lr': 0.00031468767470894395, 'samples': 12224832, 'steps': 63670, 'loss/train': 1.5035680532455444} 11/07/2021 06:12:23 - INFO - __main__ - Step 63672: {'lr': 0.0003146825486757817, 'samples': 12225024, 'steps': 63671, 'loss/train': 1.7050800323486328} 11/07/2021 06:12:24 - INFO - __main__ - Step 63673: {'lr': 0.00031467742261347457, 'samples': 12225216, 'steps': 63672, 'loss/train': 1.2219147682189941} 11/07/2021 06:12:24 - INFO - __main__ - Step 63674: {'lr': 0.00031467229652202476, 'samples': 12225408, 'steps': 63673, 'loss/train': 1.4953210353851318} 11/07/2021 06:12:24 - INFO - __main__ - Step 63675: {'lr': 0.00031466717040143464, 'samples': 12225600, 'steps': 63674, 'loss/train': 0.34416744112968445} 11/07/2021 06:12:25 - INFO - __main__ - Step 63676: {'lr': 0.0003146620442517065, 'samples': 12225792, 'steps': 63675, 'loss/train': 1.3838837146759033} 11/07/2021 06:12:25 - INFO - __main__ - Step 63677: {'lr': 0.0003146569180728426, 'samples': 12225984, 'steps': 63676, 'loss/train': 1.158333659172058} 11/07/2021 06:12:26 - INFO - __main__ - Step 63678: {'lr': 0.0003146517918648453, 'samples': 12226176, 'steps': 63677, 'loss/train': 1.6816290616989136} 11/07/2021 06:12:26 - INFO - __main__ - Step 63679: {'lr': 0.00031464666562771687, 'samples': 12226368, 'steps': 63678, 'loss/train': 1.4635900259017944} 11/07/2021 06:12:27 - INFO - __main__ - Step 63680: {'lr': 0.0003146415393614597, 'samples': 12226560, 'steps': 63679, 'loss/train': 1.0986214876174927} 11/07/2021 06:12:27 - INFO - __main__ - Step 63681: {'lr': 0.00031463641306607605, 'samples': 12226752, 'steps': 63680, 'loss/train': 1.30622136592865} 11/07/2021 06:12:28 - INFO - __main__ - Step 63682: {'lr': 0.00031463128674156816, 'samples': 12226944, 'steps': 63681, 'loss/train': 1.46089506149292} 11/07/2021 06:12:28 - INFO - __main__ - Step 63683: {'lr': 0.00031462616038793853, 'samples': 12227136, 'steps': 63682, 'loss/train': 1.2605233192443848} 11/07/2021 06:12:29 - INFO - __main__ - Step 63684: {'lr': 0.00031462103400518924, 'samples': 12227328, 'steps': 63683, 'loss/train': 1.403699278831482} 11/07/2021 06:12:29 - INFO - __main__ - Step 63685: {'lr': 0.0003146159075933228, 'samples': 12227520, 'steps': 63684, 'loss/train': 1.4912350177764893} 11/07/2021 06:12:30 - INFO - __main__ - Step 63686: {'lr': 0.0003146107811523413, 'samples': 12227712, 'steps': 63685, 'loss/train': 1.6222110986709595} 11/07/2021 06:12:30 - INFO - __main__ - Step 63687: {'lr': 0.00031460565468224735, 'samples': 12227904, 'steps': 63686, 'loss/train': 0.5956270098686218} 11/07/2021 06:12:30 - INFO - __main__ - Step 63688: {'lr': 0.000314600528183043, 'samples': 12228096, 'steps': 63687, 'loss/train': 1.4927442073822021} 11/07/2021 06:12:31 - INFO - __main__ - Step 63689: {'lr': 0.00031459540165473067, 'samples': 12228288, 'steps': 63688, 'loss/train': 1.595251441001892} 11/07/2021 06:12:32 - INFO - __main__ - Step 63690: {'lr': 0.00031459027509731256, 'samples': 12228480, 'steps': 63689, 'loss/train': 1.413708209991455} 11/07/2021 06:12:32 - INFO - __main__ - Step 63691: {'lr': 0.0003145851485107911, 'samples': 12228672, 'steps': 63690, 'loss/train': 1.2932485342025757} 11/07/2021 06:12:32 - INFO - __main__ - Step 63692: {'lr': 0.00031458002189516863, 'samples': 12228864, 'steps': 63691, 'loss/train': 1.2793400287628174} 11/07/2021 06:12:33 - INFO - __main__ - Step 63693: {'lr': 0.00031457489525044737, 'samples': 12229056, 'steps': 63692, 'loss/train': 1.4053237438201904} 11/07/2021 06:12:34 - INFO - __main__ - Step 63694: {'lr': 0.00031456976857662964, 'samples': 12229248, 'steps': 63693, 'loss/train': 1.041388750076294} 11/07/2021 06:12:34 - INFO - __main__ - Step 63695: {'lr': 0.0003145646418737178, 'samples': 12229440, 'steps': 63694, 'loss/train': 1.694385051727295} 11/07/2021 06:12:34 - INFO - __main__ - Step 63696: {'lr': 0.0003145595151417142, 'samples': 12229632, 'steps': 63695, 'loss/train': 1.4375271797180176} 11/07/2021 06:12:35 - INFO - __main__ - Step 63697: {'lr': 0.00031455438838062094, 'samples': 12229824, 'steps': 63696, 'loss/train': 1.2743048667907715} 11/07/2021 06:12:35 - INFO - __main__ - Step 63698: {'lr': 0.0003145492615904405, 'samples': 12230016, 'steps': 63697, 'loss/train': 1.3462414741516113} 11/07/2021 06:12:36 - INFO - __main__ - Step 63699: {'lr': 0.0003145441347711752, 'samples': 12230208, 'steps': 63698, 'loss/train': 1.5041486024856567} 11/07/2021 06:12:36 - INFO - __main__ - Step 63700: {'lr': 0.0003145390079228273, 'samples': 12230400, 'steps': 63699, 'loss/train': 1.4222933053970337} 11/07/2021 06:12:37 - INFO - __main__ - Step 63701: {'lr': 0.0003145338810453991, 'samples': 12230592, 'steps': 63700, 'loss/train': 1.0801745653152466} 11/07/2021 06:12:37 - INFO - __main__ - Step 63702: {'lr': 0.00031452875413889294, 'samples': 12230784, 'steps': 63701, 'loss/train': 1.0051101446151733} 11/07/2021 06:12:37 - INFO - __main__ - Step 63703: {'lr': 0.0003145236272033112, 'samples': 12230976, 'steps': 63702, 'loss/train': 1.1215540170669556} 11/07/2021 06:12:39 - INFO - __main__ - Step 63704: {'lr': 0.00031451850023865596, 'samples': 12231168, 'steps': 63703, 'loss/train': 2.0430121421813965} 11/07/2021 06:12:39 - INFO - __main__ - Step 63705: {'lr': 0.0003145133732449298, 'samples': 12231360, 'steps': 63704, 'loss/train': 1.616100549697876} 11/07/2021 06:12:39 - INFO - __main__ - Step 63706: {'lr': 0.0003145082462221348, 'samples': 12231552, 'steps': 63705, 'loss/train': 1.2408947944641113} 11/07/2021 06:12:40 - INFO - __main__ - Step 63707: {'lr': 0.00031450311917027347, 'samples': 12231744, 'steps': 63706, 'loss/train': 1.3917851448059082} 11/07/2021 06:12:40 - INFO - __main__ - Step 63708: {'lr': 0.00031449799208934796, 'samples': 12231936, 'steps': 63707, 'loss/train': 1.3247909545898438} 11/07/2021 06:12:40 - INFO - __main__ - Step 63709: {'lr': 0.0003144928649793607, 'samples': 12232128, 'steps': 63708, 'loss/train': 1.3391718864440918} 11/07/2021 06:12:41 - INFO - __main__ - Step 63710: {'lr': 0.000314487737840314, 'samples': 12232320, 'steps': 63709, 'loss/train': 1.1587707996368408} 11/07/2021 06:12:42 - INFO - __main__ - Step 63711: {'lr': 0.00031448261067221, 'samples': 12232512, 'steps': 63710, 'loss/train': 1.3059202432632446} 11/07/2021 06:12:42 - INFO - __main__ - Step 63712: {'lr': 0.00031447748347505124, 'samples': 12232704, 'steps': 63711, 'loss/train': 1.7122573852539062} 11/07/2021 06:12:42 - INFO - __main__ - Step 63713: {'lr': 0.00031447235624883983, 'samples': 12232896, 'steps': 63712, 'loss/train': 1.7460346221923828} 11/07/2021 06:12:43 - INFO - __main__ - Step 63714: {'lr': 0.0003144672289935782, 'samples': 12233088, 'steps': 63713, 'loss/train': 0.8189389705657959} 11/07/2021 06:12:44 - INFO - __main__ - Step 63715: {'lr': 0.00031446210170926866, 'samples': 12233280, 'steps': 63714, 'loss/train': 1.3711313009262085} 11/07/2021 06:12:44 - INFO - __main__ - Step 63716: {'lr': 0.00031445697439591347, 'samples': 12233472, 'steps': 63715, 'loss/train': 1.3928627967834473} 11/07/2021 06:12:45 - INFO - __main__ - Step 63717: {'lr': 0.000314451847053515, 'samples': 12233664, 'steps': 63716, 'loss/train': 1.009372353553772} 11/07/2021 06:12:45 - INFO - __main__ - Step 63718: {'lr': 0.00031444671968207545, 'samples': 12233856, 'steps': 63717, 'loss/train': 1.8116745948791504} 11/07/2021 06:12:45 - INFO - __main__ - Step 63719: {'lr': 0.00031444159228159724, 'samples': 12234048, 'steps': 63718, 'loss/train': 1.567450761795044} 11/07/2021 06:12:46 - INFO - __main__ - Step 63720: {'lr': 0.0003144364648520827, 'samples': 12234240, 'steps': 63719, 'loss/train': 0.9239063262939453} 11/07/2021 06:12:47 - INFO - __main__ - Step 63721: {'lr': 0.00031443133739353395, 'samples': 12234432, 'steps': 63720, 'loss/train': 0.9643616080284119} 11/07/2021 06:12:47 - INFO - __main__ - Step 63722: {'lr': 0.0003144262099059535, 'samples': 12234624, 'steps': 63721, 'loss/train': 1.5364936590194702} 11/07/2021 06:12:47 - INFO - __main__ - Step 63723: {'lr': 0.0003144210823893436, 'samples': 12234816, 'steps': 63722, 'loss/train': 1.7541552782058716} 11/07/2021 06:12:48 - INFO - __main__ - Step 63724: {'lr': 0.0003144159548437066, 'samples': 12235008, 'steps': 63723, 'loss/train': 1.5813205242156982} 11/07/2021 06:12:49 - INFO - __main__ - Step 63725: {'lr': 0.00031441082726904476, 'samples': 12235200, 'steps': 63724, 'loss/train': 1.6406899690628052} 11/07/2021 06:12:49 - INFO - __main__ - Step 63726: {'lr': 0.0003144056996653603, 'samples': 12235392, 'steps': 63725, 'loss/train': 1.2718735933303833} 11/07/2021 06:12:49 - INFO - __main__ - Step 63727: {'lr': 0.0003144005720326557, 'samples': 12235584, 'steps': 63726, 'loss/train': 2.104417562484741} 11/07/2021 06:12:50 - INFO - __main__ - Step 63728: {'lr': 0.00031439544437093325, 'samples': 12235776, 'steps': 63727, 'loss/train': 1.3495579957962036} 11/07/2021 06:12:50 - INFO - __main__ - Step 63729: {'lr': 0.00031439031668019515, 'samples': 12235968, 'steps': 63728, 'loss/train': 1.0877494812011719} 11/07/2021 06:12:51 - INFO - __main__ - Step 63730: {'lr': 0.00031438518896044373, 'samples': 12236160, 'steps': 63729, 'loss/train': 1.3216586112976074} 11/07/2021 06:12:52 - INFO - __main__ - Step 63731: {'lr': 0.00031438006121168135, 'samples': 12236352, 'steps': 63730, 'loss/train': 1.516033411026001} 11/07/2021 06:12:52 - INFO - __main__ - Step 63732: {'lr': 0.00031437493343391027, 'samples': 12236544, 'steps': 63731, 'loss/train': 1.3553940057754517} 11/07/2021 06:12:52 - INFO - __main__ - Step 63733: {'lr': 0.00031436980562713293, 'samples': 12236736, 'steps': 63732, 'loss/train': 1.5686146020889282} 11/07/2021 06:12:53 - INFO - __main__ - Step 63734: {'lr': 0.0003143646777913515, 'samples': 12236928, 'steps': 63733, 'loss/train': 1.5746148824691772} 11/07/2021 06:12:54 - INFO - __main__ - Step 63735: {'lr': 0.00031435954992656837, 'samples': 12237120, 'steps': 63734, 'loss/train': 1.1590605974197388} 11/07/2021 06:12:54 - INFO - __main__ - Step 63736: {'lr': 0.00031435442203278576, 'samples': 12237312, 'steps': 63735, 'loss/train': 1.1354097127914429} 11/07/2021 06:12:54 - INFO - __main__ - Step 63737: {'lr': 0.00031434929411000605, 'samples': 12237504, 'steps': 63736, 'loss/train': 1.0960760116577148} 11/07/2021 06:12:55 - INFO - __main__ - Step 63738: {'lr': 0.0003143441661582316, 'samples': 12237696, 'steps': 63737, 'loss/train': 1.3779433965682983} 11/07/2021 06:12:55 - INFO - __main__ - Step 63739: {'lr': 0.0003143390381774647, 'samples': 12237888, 'steps': 63738, 'loss/train': 1.1069951057434082} 11/07/2021 06:12:55 - INFO - __main__ - Step 63740: {'lr': 0.0003143339101677075, 'samples': 12238080, 'steps': 63739, 'loss/train': 1.2740741968154907} 11/07/2021 06:12:56 - INFO - __main__ - Step 63741: {'lr': 0.0003143287821289625, 'samples': 12238272, 'steps': 63740, 'loss/train': 1.4812133312225342} 11/07/2021 06:12:57 - INFO - __main__ - Step 63742: {'lr': 0.0003143236540612319, 'samples': 12238464, 'steps': 63741, 'loss/train': 1.3161389827728271} 11/07/2021 06:12:57 - INFO - __main__ - Step 63743: {'lr': 0.0003143185259645181, 'samples': 12238656, 'steps': 63742, 'loss/train': 1.404428482055664} 11/07/2021 06:12:57 - INFO - __main__ - Step 63744: {'lr': 0.0003143133978388234, 'samples': 12238848, 'steps': 63743, 'loss/train': 0.9945654273033142} 11/07/2021 06:12:58 - INFO - __main__ - Step 63745: {'lr': 0.00031430826968414997, 'samples': 12239040, 'steps': 63744, 'loss/train': 1.5371944904327393} 11/07/2021 06:12:59 - INFO - __main__ - Step 63746: {'lr': 0.0003143031415005003, 'samples': 12239232, 'steps': 63745, 'loss/train': 1.2803683280944824} 11/07/2021 06:12:59 - INFO - __main__ - Step 63747: {'lr': 0.0003142980132878766, 'samples': 12239424, 'steps': 63746, 'loss/train': 1.2708312273025513} 11/07/2021 06:13:00 - INFO - __main__ - Step 63748: {'lr': 0.0003142928850462812, 'samples': 12239616, 'steps': 63747, 'loss/train': 1.5820856094360352} 11/07/2021 06:13:00 - INFO - __main__ - Step 63749: {'lr': 0.00031428775677571643, 'samples': 12239808, 'steps': 63748, 'loss/train': 1.0497758388519287} 11/07/2021 06:13:00 - INFO - __main__ - Step 63750: {'lr': 0.0003142826284761846, 'samples': 12240000, 'steps': 63749, 'loss/train': 1.3138014078140259} 11/07/2021 06:13:01 - INFO - __main__ - Step 63751: {'lr': 0.00031427750014768804, 'samples': 12240192, 'steps': 63750, 'loss/train': 1.4371124505996704} 11/07/2021 06:13:02 - INFO - __main__ - Step 63752: {'lr': 0.00031427237179022896, 'samples': 12240384, 'steps': 63751, 'loss/train': 1.520728349685669} 11/07/2021 06:13:02 - INFO - __main__ - Step 63753: {'lr': 0.00031426724340380977, 'samples': 12240576, 'steps': 63752, 'loss/train': 1.277466058731079} 11/07/2021 06:13:02 - INFO - __main__ - Step 63754: {'lr': 0.0003142621149884327, 'samples': 12240768, 'steps': 63753, 'loss/train': 1.7164884805679321} 11/07/2021 06:13:03 - INFO - __main__ - Step 63755: {'lr': 0.00031425698654410016, 'samples': 12240960, 'steps': 63754, 'loss/train': 1.4934465885162354} 11/07/2021 06:13:04 - INFO - __main__ - Step 63756: {'lr': 0.0003142518580708144, 'samples': 12241152, 'steps': 63755, 'loss/train': 1.5137640237808228} 11/07/2021 06:13:04 - INFO - __main__ - Step 63757: {'lr': 0.0003142467295685778, 'samples': 12241344, 'steps': 63756, 'loss/train': 0.7192648649215698} 11/07/2021 06:13:04 - INFO - __main__ - Step 63758: {'lr': 0.00031424160103739264, 'samples': 12241536, 'steps': 63757, 'loss/train': 1.1078375577926636} 11/07/2021 06:13:05 - INFO - __main__ - Step 63759: {'lr': 0.0003142364724772611, 'samples': 12241728, 'steps': 63758, 'loss/train': 1.4112651348114014} 11/07/2021 06:13:05 - INFO - __main__ - Step 63760: {'lr': 0.00031423134388818566, 'samples': 12241920, 'steps': 63759, 'loss/train': 1.9082887172698975} 11/07/2021 06:13:06 - INFO - __main__ - Step 63761: {'lr': 0.00031422621527016847, 'samples': 12242112, 'steps': 63760, 'loss/train': 0.8465831279754639} 11/07/2021 06:13:07 - INFO - __main__ - Step 63762: {'lr': 0.000314221086623212, 'samples': 12242304, 'steps': 63761, 'loss/train': 1.086564302444458} 11/07/2021 06:13:07 - INFO - __main__ - Step 63763: {'lr': 0.0003142159579473186, 'samples': 12242496, 'steps': 63762, 'loss/train': 1.5879054069519043} 11/07/2021 06:13:07 - INFO - __main__ - Step 63764: {'lr': 0.0003142108292424904, 'samples': 12242688, 'steps': 63763, 'loss/train': 1.150031328201294} 11/07/2021 06:13:08 - INFO - __main__ - Step 63765: {'lr': 0.00031420570050872976, 'samples': 12242880, 'steps': 63764, 'loss/train': 1.3304767608642578} 11/07/2021 06:13:08 - INFO - __main__ - Step 63766: {'lr': 0.00031420057174603907, 'samples': 12243072, 'steps': 63765, 'loss/train': 1.9064077138900757} 11/07/2021 06:13:09 - INFO - __main__ - Step 63767: {'lr': 0.00031419544295442056, 'samples': 12243264, 'steps': 63766, 'loss/train': 1.4721193313598633} 11/07/2021 06:13:09 - INFO - __main__ - Step 63768: {'lr': 0.00031419031413387657, 'samples': 12243456, 'steps': 63767, 'loss/train': 1.9137777090072632} 11/07/2021 06:13:10 - INFO - __main__ - Step 63769: {'lr': 0.0003141851852844094, 'samples': 12243648, 'steps': 63768, 'loss/train': 0.9342829585075378} 11/07/2021 06:13:10 - INFO - __main__ - Step 63770: {'lr': 0.00031418005640602146, 'samples': 12243840, 'steps': 63769, 'loss/train': 1.2957144975662231} 11/07/2021 06:13:10 - INFO - __main__ - Step 63771: {'lr': 0.0003141749274987149, 'samples': 12244032, 'steps': 63770, 'loss/train': 0.1390659660100937} 11/07/2021 06:13:12 - INFO - __main__ - Step 63772: {'lr': 0.00031416979856249217, 'samples': 12244224, 'steps': 63771, 'loss/train': 1.2877081632614136} 11/07/2021 06:13:12 - INFO - __main__ - Step 63773: {'lr': 0.00031416466959735545, 'samples': 12244416, 'steps': 63772, 'loss/train': 1.4031295776367188} 11/07/2021 06:13:13 - INFO - __main__ - Step 63774: {'lr': 0.0003141595406033071, 'samples': 12244608, 'steps': 63773, 'loss/train': 0.14602501690387726} 11/07/2021 06:13:13 - INFO - __main__ - Step 63775: {'lr': 0.00031415441158034953, 'samples': 12244800, 'steps': 63774, 'loss/train': 1.5537039041519165} 11/07/2021 06:13:13 - INFO - __main__ - Step 63776: {'lr': 0.00031414928252848493, 'samples': 12244992, 'steps': 63775, 'loss/train': 1.2725657224655151} 11/07/2021 06:13:14 - INFO - __main__ - Step 63777: {'lr': 0.0003141441534477157, 'samples': 12245184, 'steps': 63776, 'loss/train': 1.75302255153656} 11/07/2021 06:13:15 - INFO - __main__ - Step 63778: {'lr': 0.00031413902433804407, 'samples': 12245376, 'steps': 63777, 'loss/train': 1.6923080682754517} 11/07/2021 06:13:15 - INFO - __main__ - Step 63779: {'lr': 0.0003141338951994724, 'samples': 12245568, 'steps': 63778, 'loss/train': 1.4368083477020264} 11/07/2021 06:13:15 - INFO - __main__ - Step 63780: {'lr': 0.00031412876603200297, 'samples': 12245760, 'steps': 63779, 'loss/train': 1.1779040098190308} 11/07/2021 06:13:16 - INFO - __main__ - Step 63781: {'lr': 0.0003141236368356381, 'samples': 12245952, 'steps': 63780, 'loss/train': 1.3694730997085571} 11/07/2021 06:13:16 - INFO - __main__ - Step 63782: {'lr': 0.00031411850761038006, 'samples': 12246144, 'steps': 63781, 'loss/train': 1.1291414499282837} 11/07/2021 06:13:17 - INFO - __main__ - Step 63783: {'lr': 0.0003141133783562313, 'samples': 12246336, 'steps': 63782, 'loss/train': 1.5462322235107422} 11/07/2021 06:13:17 - INFO - __main__ - Step 63784: {'lr': 0.0003141082490731941, 'samples': 12246528, 'steps': 63783, 'loss/train': 1.4994969367980957} 11/07/2021 06:13:18 - INFO - __main__ - Step 63785: {'lr': 0.0003141031197612706, 'samples': 12246720, 'steps': 63784, 'loss/train': 1.3516660928726196} 11/07/2021 06:13:18 - INFO - __main__ - Step 63786: {'lr': 0.0003140979904204632, 'samples': 12246912, 'steps': 63785, 'loss/train': 1.3235321044921875} 11/07/2021 06:13:18 - INFO - __main__ - Step 63787: {'lr': 0.0003140928610507743, 'samples': 12247104, 'steps': 63786, 'loss/train': 1.472931981086731} 11/07/2021 06:13:20 - INFO - __main__ - Step 63788: {'lr': 0.0003140877316522061, 'samples': 12247296, 'steps': 63787, 'loss/train': 1.3921698331832886} 11/07/2021 06:13:20 - INFO - __main__ - Step 63789: {'lr': 0.000314082602224761, 'samples': 12247488, 'steps': 63788, 'loss/train': 1.2941868305206299} 11/07/2021 06:13:20 - INFO - __main__ - Step 63790: {'lr': 0.00031407747276844127, 'samples': 12247680, 'steps': 63789, 'loss/train': 0.8889561295509338} 11/07/2021 06:13:21 - INFO - __main__ - Step 63791: {'lr': 0.0003140723432832492, 'samples': 12247872, 'steps': 63790, 'loss/train': 0.8192839026451111} 11/07/2021 06:13:21 - INFO - __main__ - Step 63792: {'lr': 0.0003140672137691871, 'samples': 12248064, 'steps': 63791, 'loss/train': 1.2762593030929565} 11/07/2021 06:13:22 - INFO - __main__ - Step 63793: {'lr': 0.0003140620842262573, 'samples': 12248256, 'steps': 63792, 'loss/train': 1.623866081237793} 11/07/2021 06:13:22 - INFO - __main__ - Step 63794: {'lr': 0.00031405695465446215, 'samples': 12248448, 'steps': 63793, 'loss/train': 1.2883126735687256} 11/07/2021 06:13:23 - INFO - __main__ - Step 63795: {'lr': 0.0003140518250538039, 'samples': 12248640, 'steps': 63794, 'loss/train': 1.4411123991012573} 11/07/2021 06:13:23 - INFO - __main__ - Step 63796: {'lr': 0.0003140466954242849, 'samples': 12248832, 'steps': 63795, 'loss/train': 1.0851913690567017} 11/07/2021 06:13:23 - INFO - __main__ - Step 63797: {'lr': 0.00031404156576590747, 'samples': 12249024, 'steps': 63796, 'loss/train': 1.160710096359253} 11/07/2021 06:13:24 - INFO - __main__ - Step 63798: {'lr': 0.0003140364360786739, 'samples': 12249216, 'steps': 63797, 'loss/train': 1.4154531955718994} 11/07/2021 06:13:25 - INFO - __main__ - Step 63799: {'lr': 0.0003140313063625865, 'samples': 12249408, 'steps': 63798, 'loss/train': 1.8704577684402466} 11/07/2021 06:13:25 - INFO - __main__ - Step 63800: {'lr': 0.0003140261766176475, 'samples': 12249600, 'steps': 63799, 'loss/train': 0.8622133135795593} 11/07/2021 06:13:26 - INFO - __main__ - Step 63801: {'lr': 0.00031402104684385935, 'samples': 12249792, 'steps': 63800, 'loss/train': 1.477581262588501} 11/07/2021 06:13:26 - INFO - __main__ - Step 63802: {'lr': 0.00031401591704122427, 'samples': 12249984, 'steps': 63801, 'loss/train': 0.7765588164329529} 11/07/2021 06:13:27 - INFO - __main__ - Step 63803: {'lr': 0.00031401078720974464, 'samples': 12250176, 'steps': 63802, 'loss/train': 1.267633080482483} 11/07/2021 06:13:27 - INFO - __main__ - Step 63804: {'lr': 0.0003140056573494228, 'samples': 12250368, 'steps': 63803, 'loss/train': 1.3937236070632935} 11/07/2021 06:13:28 - INFO - __main__ - Step 63805: {'lr': 0.0003140005274602609, 'samples': 12250560, 'steps': 63804, 'loss/train': 1.1157236099243164} 11/07/2021 06:13:28 - INFO - __main__ - Step 63806: {'lr': 0.0003139953975422614, 'samples': 12250752, 'steps': 63805, 'loss/train': 2.541715621948242} 11/07/2021 06:13:28 - INFO - __main__ - Step 63807: {'lr': 0.00031399026759542655, 'samples': 12250944, 'steps': 63806, 'loss/train': 1.7783548831939697} 11/07/2021 06:13:29 - INFO - __main__ - Step 63808: {'lr': 0.00031398513761975866, 'samples': 12251136, 'steps': 63807, 'loss/train': 1.323073148727417} 11/07/2021 06:13:30 - INFO - __main__ - Step 63809: {'lr': 0.00031398000761526004, 'samples': 12251328, 'steps': 63808, 'loss/train': 0.8394582271575928} 11/07/2021 06:13:30 - INFO - __main__ - Step 63810: {'lr': 0.0003139748775819331, 'samples': 12251520, 'steps': 63809, 'loss/train': 1.1562730073928833} 11/07/2021 06:13:30 - INFO - __main__ - Step 63811: {'lr': 0.00031396974751977995, 'samples': 12251712, 'steps': 63810, 'loss/train': 1.1972589492797852} 11/07/2021 06:13:31 - INFO - __main__ - Step 63812: {'lr': 0.0003139646174288031, 'samples': 12251904, 'steps': 63811, 'loss/train': 1.1447075605392456} 11/07/2021 06:13:31 - INFO - __main__ - Step 63813: {'lr': 0.0003139594873090047, 'samples': 12252096, 'steps': 63812, 'loss/train': 1.2245370149612427} 11/07/2021 06:13:32 - INFO - __main__ - Step 63814: {'lr': 0.0003139543571603872, 'samples': 12252288, 'steps': 63813, 'loss/train': 1.4437758922576904} 11/07/2021 06:13:33 - INFO - __main__ - Step 63815: {'lr': 0.0003139492269829529, 'samples': 12252480, 'steps': 63814, 'loss/train': 0.7305995225906372} 11/07/2021 06:13:33 - INFO - __main__ - Step 63816: {'lr': 0.000313944096776704, 'samples': 12252672, 'steps': 63815, 'loss/train': 1.4332391023635864} 11/07/2021 06:13:33 - INFO - __main__ - Step 63817: {'lr': 0.0003139389665416429, 'samples': 12252864, 'steps': 63816, 'loss/train': 1.2958354949951172} 11/07/2021 06:13:34 - INFO - __main__ - Step 63818: {'lr': 0.0003139338362777719, 'samples': 12253056, 'steps': 63817, 'loss/train': 1.0651663541793823} 11/07/2021 06:13:35 - INFO - __main__ - Step 63819: {'lr': 0.00031392870598509324, 'samples': 12253248, 'steps': 63818, 'loss/train': 1.6734437942504883} 11/07/2021 06:13:35 - INFO - __main__ - Step 63820: {'lr': 0.00031392357566360936, 'samples': 12253440, 'steps': 63819, 'loss/train': 1.410559058189392} 11/07/2021 06:13:35 - INFO - __main__ - Step 63821: {'lr': 0.0003139184453133224, 'samples': 12253632, 'steps': 63820, 'loss/train': 0.9960004091262817} 11/07/2021 06:13:36 - INFO - __main__ - Step 63822: {'lr': 0.00031391331493423486, 'samples': 12253824, 'steps': 63821, 'loss/train': 1.5641285181045532} 11/07/2021 06:13:36 - INFO - __main__ - Step 63823: {'lr': 0.00031390818452634896, 'samples': 12254016, 'steps': 63822, 'loss/train': 1.8550728559494019} 11/07/2021 06:13:37 - INFO - __main__ - Step 63824: {'lr': 0.0003139030540896671, 'samples': 12254208, 'steps': 63823, 'loss/train': 1.555353045463562} 11/07/2021 06:13:37 - INFO - __main__ - Step 63825: {'lr': 0.0003138979236241914, 'samples': 12254400, 'steps': 63824, 'loss/train': 1.5102028846740723} 11/07/2021 06:13:38 - INFO - __main__ - Step 63826: {'lr': 0.0003138927931299243, 'samples': 12254592, 'steps': 63825, 'loss/train': 1.4938808679580688} 11/07/2021 06:13:38 - INFO - __main__ - Step 63827: {'lr': 0.0003138876626068681, 'samples': 12254784, 'steps': 63826, 'loss/train': 0.7583309412002563} 11/07/2021 06:13:38 - INFO - __main__ - Step 63828: {'lr': 0.000313882532055025, 'samples': 12254976, 'steps': 63827, 'loss/train': 1.6018754243850708} 11/07/2021 06:13:39 - INFO - __main__ - Step 63829: {'lr': 0.00031387740147439757, 'samples': 12255168, 'steps': 63828, 'loss/train': 2.122490406036377} 11/07/2021 06:13:40 - INFO - __main__ - Step 63830: {'lr': 0.0003138722708649879, 'samples': 12255360, 'steps': 63829, 'loss/train': 1.1518162488937378} 11/07/2021 06:13:40 - INFO - __main__ - Step 63831: {'lr': 0.00031386714022679844, 'samples': 12255552, 'steps': 63830, 'loss/train': 1.054744005203247} 11/07/2021 06:13:40 - INFO - __main__ - Step 63832: {'lr': 0.0003138620095598314, 'samples': 12255744, 'steps': 63831, 'loss/train': 2.2133686542510986} 11/07/2021 06:13:41 - INFO - __main__ - Step 63833: {'lr': 0.0003138568788640891, 'samples': 12255936, 'steps': 63832, 'loss/train': 1.2415919303894043} 11/07/2021 06:13:42 - INFO - __main__ - Step 63834: {'lr': 0.00031385174813957387, 'samples': 12256128, 'steps': 63833, 'loss/train': 1.0585044622421265} 11/07/2021 06:13:42 - INFO - __main__ - Step 63835: {'lr': 0.00031384661738628804, 'samples': 12256320, 'steps': 63834, 'loss/train': 1.4224621057510376} 11/07/2021 06:13:42 - INFO - __main__ - Step 63836: {'lr': 0.0003138414866042339, 'samples': 12256512, 'steps': 63835, 'loss/train': 1.3205275535583496} 11/07/2021 06:13:43 - INFO - __main__ - Step 63837: {'lr': 0.0003138363557934138, 'samples': 12256704, 'steps': 63836, 'loss/train': 1.5867118835449219} 11/07/2021 06:13:43 - INFO - __main__ - Step 63838: {'lr': 0.00031383122495382996, 'samples': 12256896, 'steps': 63837, 'loss/train': 1.5123367309570312} 11/07/2021 06:13:44 - INFO - __main__ - Step 63839: {'lr': 0.00031382609408548486, 'samples': 12257088, 'steps': 63838, 'loss/train': 1.1572967767715454} 11/07/2021 06:13:45 - INFO - __main__ - Step 63840: {'lr': 0.0003138209631883806, 'samples': 12257280, 'steps': 63839, 'loss/train': 1.3332960605621338} 11/07/2021 06:13:45 - INFO - __main__ - Step 63841: {'lr': 0.00031381583226251965, 'samples': 12257472, 'steps': 63840, 'loss/train': 1.491525650024414} 11/07/2021 06:13:45 - INFO - __main__ - Step 63842: {'lr': 0.00031381070130790425, 'samples': 12257664, 'steps': 63841, 'loss/train': 1.329023838043213} 11/07/2021 06:13:46 - INFO - __main__ - Step 63843: {'lr': 0.0003138055703245368, 'samples': 12257856, 'steps': 63842, 'loss/train': 1.399357557296753} 11/07/2021 06:13:46 - INFO - __main__ - Step 63844: {'lr': 0.0003138004393124195, 'samples': 12258048, 'steps': 63843, 'loss/train': 0.8555728793144226} 11/07/2021 06:13:47 - INFO - __main__ - Step 63845: {'lr': 0.00031379530827155467, 'samples': 12258240, 'steps': 63844, 'loss/train': 1.40887451171875} 11/07/2021 06:13:47 - INFO - __main__ - Step 63846: {'lr': 0.0003137901772019447, 'samples': 12258432, 'steps': 63845, 'loss/train': 1.3632392883300781} 11/07/2021 06:13:48 - INFO - __main__ - Step 63847: {'lr': 0.00031378504610359183, 'samples': 12258624, 'steps': 63846, 'loss/train': 1.3869996070861816} 11/07/2021 06:13:48 - INFO - __main__ - Step 63848: {'lr': 0.0003137799149764984, 'samples': 12258816, 'steps': 63847, 'loss/train': 1.4269963502883911} 11/07/2021 06:13:48 - INFO - __main__ - Step 63849: {'lr': 0.00031377478382066675, 'samples': 12259008, 'steps': 63848, 'loss/train': 1.5223387479782104} 11/07/2021 06:13:49 - INFO - __main__ - Step 63850: {'lr': 0.0003137696526360991, 'samples': 12259200, 'steps': 63849, 'loss/train': 1.6533526182174683} 11/07/2021 06:13:50 - INFO - __main__ - Step 63851: {'lr': 0.00031376452142279796, 'samples': 12259392, 'steps': 63850, 'loss/train': 0.519260823726654} 11/07/2021 06:13:50 - INFO - __main__ - Step 63852: {'lr': 0.0003137593901807655, 'samples': 12259584, 'steps': 63851, 'loss/train': 2.0309336185455322} 11/07/2021 06:13:50 - INFO - __main__ - Step 63853: {'lr': 0.0003137542589100039, 'samples': 12259776, 'steps': 63852, 'loss/train': 1.4636965990066528} 11/07/2021 06:13:51 - INFO - __main__ - Step 63854: {'lr': 0.00031374912761051574, 'samples': 12259968, 'steps': 63853, 'loss/train': 1.0297911167144775} 11/07/2021 06:13:52 - INFO - __main__ - Step 63855: {'lr': 0.00031374399628230314, 'samples': 12260160, 'steps': 63854, 'loss/train': 1.4428566694259644} 11/07/2021 06:13:52 - INFO - __main__ - Step 63856: {'lr': 0.0003137388649253685, 'samples': 12260352, 'steps': 63855, 'loss/train': 0.7090169787406921} 11/07/2021 06:13:53 - INFO - __main__ - Step 63857: {'lr': 0.0003137337335397141, 'samples': 12260544, 'steps': 63856, 'loss/train': 1.4936705827713013} 11/07/2021 06:13:53 - INFO - __main__ - Step 63858: {'lr': 0.0003137286021253423, 'samples': 12260736, 'steps': 63857, 'loss/train': 0.7548542618751526} 11/07/2021 06:13:53 - INFO - __main__ - Step 63859: {'lr': 0.0003137234706822554, 'samples': 12260928, 'steps': 63858, 'loss/train': 1.1887898445129395} 11/07/2021 06:13:54 - INFO - __main__ - Step 63860: {'lr': 0.0003137183392104556, 'samples': 12261120, 'steps': 63859, 'loss/train': 1.5677504539489746} 11/07/2021 06:13:55 - INFO - __main__ - Step 63861: {'lr': 0.00031371320770994535, 'samples': 12261312, 'steps': 63860, 'loss/train': 1.7803324460983276} 11/07/2021 06:13:55 - INFO - __main__ - Step 63862: {'lr': 0.00031370807618072693, 'samples': 12261504, 'steps': 63861, 'loss/train': 1.4310604333877563} 11/07/2021 06:13:55 - INFO - __main__ - Step 63863: {'lr': 0.00031370294462280257, 'samples': 12261696, 'steps': 63862, 'loss/train': 1.5725879669189453} 11/07/2021 06:13:56 - INFO - __main__ - Step 63864: {'lr': 0.0003136978130361747, 'samples': 12261888, 'steps': 63863, 'loss/train': 1.366296648979187} 11/07/2021 06:13:57 - INFO - __main__ - Step 63865: {'lr': 0.00031369268142084555, 'samples': 12262080, 'steps': 63864, 'loss/train': 1.7536742687225342} 11/07/2021 06:13:57 - INFO - __main__ - Step 63866: {'lr': 0.00031368754977681744, 'samples': 12262272, 'steps': 63865, 'loss/train': 1.199036955833435} 11/07/2021 06:13:58 - INFO - __main__ - Step 63867: {'lr': 0.00031368241810409277, 'samples': 12262464, 'steps': 63866, 'loss/train': 0.6258852481842041} 11/07/2021 06:13:58 - INFO - __main__ - Step 63868: {'lr': 0.00031367728640267377, 'samples': 12262656, 'steps': 63867, 'loss/train': 1.5628023147583008} 11/07/2021 06:13:58 - INFO - __main__ - Step 63869: {'lr': 0.0003136721546725627, 'samples': 12262848, 'steps': 63868, 'loss/train': 0.7620623111724854} 11/07/2021 06:13:59 - INFO - __main__ - Step 63870: {'lr': 0.00031366702291376204, 'samples': 12263040, 'steps': 63869, 'loss/train': 1.1925634145736694} 11/07/2021 06:14:00 - INFO - __main__ - Step 63871: {'lr': 0.0003136618911262739, 'samples': 12263232, 'steps': 63870, 'loss/train': 1.3214553594589233} 11/07/2021 06:14:00 - INFO - __main__ - Step 63872: {'lr': 0.00031365675931010074, 'samples': 12263424, 'steps': 63871, 'loss/train': 1.603002667427063} 11/07/2021 06:14:00 - INFO - __main__ - Step 63873: {'lr': 0.0003136516274652449, 'samples': 12263616, 'steps': 63872, 'loss/train': 0.8334980010986328} 11/07/2021 06:14:01 - INFO - __main__ - Step 63874: {'lr': 0.00031364649559170857, 'samples': 12263808, 'steps': 63873, 'loss/train': 1.228448748588562} 11/07/2021 06:14:02 - INFO - __main__ - Step 63875: {'lr': 0.000313641363689494, 'samples': 12264000, 'steps': 63874, 'loss/train': 1.251383662223816} 11/07/2021 06:14:02 - INFO - __main__ - Step 63876: {'lr': 0.00031363623175860374, 'samples': 12264192, 'steps': 63875, 'loss/train': 2.076368570327759} 11/07/2021 06:14:02 - INFO - __main__ - Step 63877: {'lr': 0.00031363109979903994, 'samples': 12264384, 'steps': 63876, 'loss/train': 1.399755835533142} 11/07/2021 06:14:03 - INFO - __main__ - Step 63878: {'lr': 0.00031362596781080496, 'samples': 12264576, 'steps': 63877, 'loss/train': 1.3208478689193726} 11/07/2021 06:14:03 - INFO - __main__ - Step 63879: {'lr': 0.0003136208357939011, 'samples': 12264768, 'steps': 63878, 'loss/train': 1.9533166885375977} 11/07/2021 06:14:04 - INFO - __main__ - Step 63880: {'lr': 0.00031361570374833066, 'samples': 12264960, 'steps': 63879, 'loss/train': 1.5322751998901367} 11/07/2021 06:14:05 - INFO - __main__ - Step 63881: {'lr': 0.00031361057167409595, 'samples': 12265152, 'steps': 63880, 'loss/train': 1.0557042360305786} 11/07/2021 06:14:05 - INFO - __main__ - Step 63882: {'lr': 0.0003136054395711993, 'samples': 12265344, 'steps': 63881, 'loss/train': 1.4018189907073975} 11/07/2021 06:14:05 - INFO - __main__ - Step 63883: {'lr': 0.000313600307439643, 'samples': 12265536, 'steps': 63882, 'loss/train': 1.5879380702972412} 11/07/2021 06:14:06 - INFO - __main__ - Step 63884: {'lr': 0.0003135951752794295, 'samples': 12265728, 'steps': 63883, 'loss/train': 1.3048466444015503} 11/07/2021 06:14:07 - INFO - __main__ - Step 63885: {'lr': 0.0003135900430905609, 'samples': 12265920, 'steps': 63884, 'loss/train': 0.6768122911453247} 11/07/2021 06:14:07 - INFO - __main__ - Step 63886: {'lr': 0.0003135849108730396, 'samples': 12266112, 'steps': 63885, 'loss/train': 1.3993808031082153} 11/07/2021 06:14:07 - INFO - __main__ - Step 63887: {'lr': 0.0003135797786268679, 'samples': 12266304, 'steps': 63886, 'loss/train': 1.376622200012207} 11/07/2021 06:14:08 - INFO - __main__ - Step 63888: {'lr': 0.00031357464635204817, 'samples': 12266496, 'steps': 63887, 'loss/train': 1.269747257232666} 11/07/2021 06:14:08 - INFO - __main__ - Step 63889: {'lr': 0.0003135695140485827, 'samples': 12266688, 'steps': 63888, 'loss/train': 1.1362382173538208} 11/07/2021 06:14:09 - INFO - __main__ - Step 63890: {'lr': 0.00031356438171647376, 'samples': 12266880, 'steps': 63889, 'loss/train': 1.19915771484375} 11/07/2021 06:14:09 - INFO - __main__ - Step 63891: {'lr': 0.00031355924935572377, 'samples': 12267072, 'steps': 63890, 'loss/train': 1.1249141693115234} 11/07/2021 06:14:10 - INFO - __main__ - Step 63892: {'lr': 0.0003135541169663349, 'samples': 12267264, 'steps': 63891, 'loss/train': 1.2392725944519043} 11/07/2021 06:14:10 - INFO - __main__ - Step 63893: {'lr': 0.0003135489845483095, 'samples': 12267456, 'steps': 63892, 'loss/train': 1.63540518283844} 11/07/2021 06:14:10 - INFO - __main__ - Step 63894: {'lr': 0.00031354385210164993, 'samples': 12267648, 'steps': 63893, 'loss/train': 1.5515879392623901} 11/07/2021 06:14:11 - INFO - __main__ - Step 63895: {'lr': 0.0003135387196263585, 'samples': 12267840, 'steps': 63894, 'loss/train': 1.4445161819458008} 11/07/2021 06:14:12 - INFO - __main__ - Step 63896: {'lr': 0.0003135335871224375, 'samples': 12268032, 'steps': 63895, 'loss/train': 1.5368703603744507} 11/07/2021 06:14:12 - INFO - __main__ - Step 63897: {'lr': 0.0003135284545898892, 'samples': 12268224, 'steps': 63896, 'loss/train': 1.4389433860778809} 11/07/2021 06:14:13 - INFO - __main__ - Step 63898: {'lr': 0.00031352332202871604, 'samples': 12268416, 'steps': 63897, 'loss/train': 1.0929152965545654} 11/07/2021 06:14:13 - INFO - __main__ - Step 63899: {'lr': 0.00031351818943892016, 'samples': 12268608, 'steps': 63898, 'loss/train': 1.491763949394226} 11/07/2021 06:14:13 - INFO - __main__ - Step 63900: {'lr': 0.000313513056820504, 'samples': 12268800, 'steps': 63899, 'loss/train': 1.810036301612854} 11/07/2021 06:14:14 - INFO - __main__ - Step 63901: {'lr': 0.0003135079241734698, 'samples': 12268992, 'steps': 63900, 'loss/train': 1.4243932962417603} 11/07/2021 06:14:15 - INFO - __main__ - Step 63902: {'lr': 0.00031350279149782004, 'samples': 12269184, 'steps': 63901, 'loss/train': 1.2681301832199097} 11/07/2021 06:14:15 - INFO - __main__ - Step 63903: {'lr': 0.00031349765879355675, 'samples': 12269376, 'steps': 63902, 'loss/train': 1.2460395097732544} 11/07/2021 06:14:15 - INFO - __main__ - Step 63904: {'lr': 0.00031349252606068244, 'samples': 12269568, 'steps': 63903, 'loss/train': 2.0019915103912354} 11/07/2021 06:14:16 - INFO - __main__ - Step 63905: {'lr': 0.0003134873932991995, 'samples': 12269760, 'steps': 63904, 'loss/train': 1.4287101030349731} 11/07/2021 06:14:17 - INFO - __main__ - Step 63906: {'lr': 0.00031348226050911, 'samples': 12269952, 'steps': 63905, 'loss/train': 0.6402405500411987} 11/07/2021 06:14:17 - INFO - __main__ - Step 63907: {'lr': 0.00031347712769041634, 'samples': 12270144, 'steps': 63906, 'loss/train': 0.7244023084640503} 11/07/2021 06:14:17 - INFO - __main__ - Step 63908: {'lr': 0.0003134719948431209, 'samples': 12270336, 'steps': 63907, 'loss/train': 1.5232720375061035} 11/07/2021 06:14:18 - INFO - __main__ - Step 63909: {'lr': 0.00031346686196722604, 'samples': 12270528, 'steps': 63908, 'loss/train': 1.023444652557373} 11/07/2021 06:14:18 - INFO - __main__ - Step 63910: {'lr': 0.0003134617290627339, 'samples': 12270720, 'steps': 63909, 'loss/train': 1.0642585754394531} 11/07/2021 06:14:20 - INFO - __main__ - Step 63911: {'lr': 0.00031345659612964694, 'samples': 12270912, 'steps': 63910, 'loss/train': 1.3665122985839844} 11/07/2021 06:14:20 - INFO - __main__ - Step 63912: {'lr': 0.0003134514631679674, 'samples': 12271104, 'steps': 63911, 'loss/train': 1.281935453414917} 11/07/2021 06:14:20 - INFO - __main__ - Step 63913: {'lr': 0.00031344633017769757, 'samples': 12271296, 'steps': 63912, 'loss/train': 1.717185378074646} 11/07/2021 06:14:21 - INFO - __main__ - Step 63914: {'lr': 0.00031344119715883984, 'samples': 12271488, 'steps': 63913, 'loss/train': 1.6928479671478271} 11/07/2021 06:14:21 - INFO - __main__ - Step 63915: {'lr': 0.0003134360641113965, 'samples': 12271680, 'steps': 63914, 'loss/train': 1.761587142944336} 11/07/2021 06:14:22 - INFO - __main__ - Step 63916: {'lr': 0.0003134309310353698, 'samples': 12271872, 'steps': 63915, 'loss/train': 1.534303903579712} 11/07/2021 06:14:22 - INFO - __main__ - Step 63917: {'lr': 0.0003134257979307621, 'samples': 12272064, 'steps': 63916, 'loss/train': 1.3704549074172974} 11/07/2021 06:14:23 - INFO - __main__ - Step 63918: {'lr': 0.0003134206647975758, 'samples': 12272256, 'steps': 63917, 'loss/train': 2.072396993637085} 11/07/2021 06:14:23 - INFO - __main__ - Step 63919: {'lr': 0.00031341553163581306, 'samples': 12272448, 'steps': 63918, 'loss/train': 1.5404181480407715} 11/07/2021 06:14:24 - INFO - __main__ - Step 63920: {'lr': 0.00031341039844547623, 'samples': 12272640, 'steps': 63919, 'loss/train': 1.2841166257858276} 11/07/2021 06:14:24 - INFO - __main__ - Step 63921: {'lr': 0.00031340526522656765, 'samples': 12272832, 'steps': 63920, 'loss/train': 1.3127849102020264} 11/07/2021 06:14:24 - INFO - __main__ - Step 63922: {'lr': 0.0003134001319790897, 'samples': 12273024, 'steps': 63921, 'loss/train': 1.074154019355774} 11/07/2021 06:14:25 - INFO - __main__ - Step 63923: {'lr': 0.0003133949987030446, 'samples': 12273216, 'steps': 63922, 'loss/train': 0.8222047686576843} 11/07/2021 06:14:26 - INFO - __main__ - Step 63924: {'lr': 0.0003133898653984347, 'samples': 12273408, 'steps': 63923, 'loss/train': 0.7967286109924316} 11/07/2021 06:14:26 - INFO - __main__ - Step 63925: {'lr': 0.0003133847320652623, 'samples': 12273600, 'steps': 63924, 'loss/train': 1.3612934350967407} 11/07/2021 06:14:26 - INFO - __main__ - Step 63926: {'lr': 0.0003133795987035297, 'samples': 12273792, 'steps': 63925, 'loss/train': 1.220097303390503} 11/07/2021 06:14:27 - INFO - __main__ - Step 63927: {'lr': 0.0003133744653132393, 'samples': 12273984, 'steps': 63926, 'loss/train': 0.9675995111465454} 11/07/2021 06:14:28 - INFO - __main__ - Step 63928: {'lr': 0.00031336933189439324, 'samples': 12274176, 'steps': 63927, 'loss/train': 0.8973199725151062} 11/07/2021 06:14:28 - INFO - __main__ - Step 63929: {'lr': 0.00031336419844699403, 'samples': 12274368, 'steps': 63928, 'loss/train': 0.9846889972686768} 11/07/2021 06:14:28 - INFO - __main__ - Step 63930: {'lr': 0.0003133590649710438, 'samples': 12274560, 'steps': 63929, 'loss/train': 1.3060462474822998} 11/07/2021 06:14:29 - INFO - __main__ - Step 63931: {'lr': 0.00031335393146654506, 'samples': 12274752, 'steps': 63930, 'loss/train': 1.3243286609649658} 11/07/2021 06:14:29 - INFO - __main__ - Step 63932: {'lr': 0.00031334879793349995, 'samples': 12274944, 'steps': 63931, 'loss/train': 1.1085684299468994} 11/07/2021 06:14:31 - INFO - __main__ - Step 63933: {'lr': 0.00031334366437191084, 'samples': 12275136, 'steps': 63932, 'loss/train': 1.758329153060913} 11/07/2021 06:14:32 - INFO - __main__ - Step 63934: {'lr': 0.0003133385307817801, 'samples': 12275328, 'steps': 63933, 'loss/train': 1.8444701433181763} 11/07/2021 06:14:32 - INFO - __main__ - Step 63935: {'lr': 0.0003133333971631099, 'samples': 12275520, 'steps': 63934, 'loss/train': 1.2291074991226196} 11/07/2021 06:14:32 - INFO - __main__ - Step 63936: {'lr': 0.00031332826351590276, 'samples': 12275712, 'steps': 63935, 'loss/train': 1.8874682188034058} 11/07/2021 06:14:33 - INFO - __main__ - Step 63937: {'lr': 0.0003133231298401608, 'samples': 12275904, 'steps': 63936, 'loss/train': 0.8093202114105225} 11/07/2021 06:14:33 - INFO - __main__ - Step 63938: {'lr': 0.00031331799613588653, 'samples': 12276096, 'steps': 63937, 'loss/train': 4.136759281158447} 11/07/2021 06:14:34 - INFO - __main__ - Step 63939: {'lr': 0.00031331286240308205, 'samples': 12276288, 'steps': 63938, 'loss/train': 1.5248805284500122} 11/07/2021 06:14:34 - INFO - __main__ - Step 63940: {'lr': 0.0003133077286417498, 'samples': 12276480, 'steps': 63939, 'loss/train': 1.2784357070922852} 11/07/2021 06:14:35 - INFO - __main__ - Step 63941: {'lr': 0.00031330259485189203, 'samples': 12276672, 'steps': 63940, 'loss/train': 1.9252866506576538} 11/07/2021 06:14:35 - INFO - __main__ - Step 63942: {'lr': 0.0003132974610335111, 'samples': 12276864, 'steps': 63941, 'loss/train': 1.0025928020477295} 11/07/2021 06:14:36 - INFO - __main__ - Step 63943: {'lr': 0.0003132923271866093, 'samples': 12277056, 'steps': 63942, 'loss/train': 1.2830535173416138} 11/07/2021 06:14:36 - INFO - __main__ - Step 63944: {'lr': 0.000313287193311189, 'samples': 12277248, 'steps': 63943, 'loss/train': 1.124131441116333} 11/07/2021 06:14:37 - INFO - __main__ - Step 63945: {'lr': 0.0003132820594072525, 'samples': 12277440, 'steps': 63944, 'loss/train': 1.9107012748718262} 11/07/2021 06:14:37 - INFO - __main__ - Step 63946: {'lr': 0.000313276925474802, 'samples': 12277632, 'steps': 63945, 'loss/train': 1.3358672857284546} 11/07/2021 06:14:37 - INFO - __main__ - Step 63947: {'lr': 0.0003132717915138399, 'samples': 12277824, 'steps': 63946, 'loss/train': 1.1135520935058594} 11/07/2021 06:14:38 - INFO - __main__ - Step 63948: {'lr': 0.00031326665752436854, 'samples': 12278016, 'steps': 63947, 'loss/train': 2.06260347366333} 11/07/2021 06:14:39 - INFO - __main__ - Step 63949: {'lr': 0.00031326152350639016, 'samples': 12278208, 'steps': 63948, 'loss/train': 1.4723964929580688} 11/07/2021 06:14:39 - INFO - __main__ - Step 63950: {'lr': 0.0003132563894599071, 'samples': 12278400, 'steps': 63949, 'loss/train': 1.512219786643982} 11/07/2021 06:14:39 - INFO - __main__ - Step 63951: {'lr': 0.0003132512553849218, 'samples': 12278592, 'steps': 63950, 'loss/train': 1.543177604675293} 11/07/2021 06:14:40 - INFO - __main__ - Step 63952: {'lr': 0.0003132461212814364, 'samples': 12278784, 'steps': 63951, 'loss/train': 1.3445006608963013} 11/07/2021 06:14:41 - INFO - __main__ - Step 63953: {'lr': 0.0003132409871494533, 'samples': 12278976, 'steps': 63952, 'loss/train': 1.444547176361084} 11/07/2021 06:14:41 - INFO - __main__ - Step 63954: {'lr': 0.00031323585298897473, 'samples': 12279168, 'steps': 63953, 'loss/train': 1.5216925144195557} 11/07/2021 06:14:42 - INFO - __main__ - Step 63955: {'lr': 0.00031323071880000303, 'samples': 12279360, 'steps': 63954, 'loss/train': 1.3089152574539185} 11/07/2021 06:14:42 - INFO - __main__ - Step 63956: {'lr': 0.00031322558458254056, 'samples': 12279552, 'steps': 63955, 'loss/train': 1.6487388610839844} 11/07/2021 06:14:42 - INFO - __main__ - Step 63957: {'lr': 0.0003132204503365897, 'samples': 12279744, 'steps': 63956, 'loss/train': 1.1012299060821533} 11/07/2021 06:14:43 - INFO - __main__ - Step 63958: {'lr': 0.0003132153160621526, 'samples': 12279936, 'steps': 63957, 'loss/train': 1.17348051071167} 11/07/2021 06:14:44 - INFO - __main__ - Step 63959: {'lr': 0.0003132101817592317, 'samples': 12280128, 'steps': 63958, 'loss/train': 0.9191347360610962} 11/07/2021 06:14:44 - INFO - __main__ - Step 63960: {'lr': 0.0003132050474278293, 'samples': 12280320, 'steps': 63959, 'loss/train': 1.7066293954849243} 11/07/2021 06:14:44 - INFO - __main__ - Step 63961: {'lr': 0.0003131999130679476, 'samples': 12280512, 'steps': 63960, 'loss/train': 1.5879324674606323} 11/07/2021 06:14:45 - INFO - __main__ - Step 63962: {'lr': 0.000313194778679589, 'samples': 12280704, 'steps': 63961, 'loss/train': 0.07873307913541794} 11/07/2021 06:14:46 - INFO - __main__ - Step 63963: {'lr': 0.00031318964426275584, 'samples': 12280896, 'steps': 63962, 'loss/train': 1.059015154838562} 11/07/2021 06:14:46 - INFO - __main__ - Step 63964: {'lr': 0.0003131845098174504, 'samples': 12281088, 'steps': 63963, 'loss/train': 1.44731605052948} 11/07/2021 06:14:46 - INFO - __main__ - Step 63965: {'lr': 0.000313179375343675, 'samples': 12281280, 'steps': 63964, 'loss/train': 1.4651949405670166} 11/07/2021 06:14:47 - INFO - __main__ - Step 63966: {'lr': 0.00031317424084143197, 'samples': 12281472, 'steps': 63965, 'loss/train': 1.2210609912872314} 11/07/2021 06:14:47 - INFO - __main__ - Step 63967: {'lr': 0.00031316910631072354, 'samples': 12281664, 'steps': 63966, 'loss/train': 1.5908663272857666} 11/07/2021 06:14:48 - INFO - __main__ - Step 63968: {'lr': 0.00031316397175155215, 'samples': 12281856, 'steps': 63967, 'loss/train': 2.0071260929107666} 11/07/2021 06:14:48 - INFO - __main__ - Step 63969: {'lr': 0.00031315883716392, 'samples': 12282048, 'steps': 63968, 'loss/train': 1.5013117790222168} 11/07/2021 06:14:49 - INFO - __main__ - Step 63970: {'lr': 0.0003131537025478294, 'samples': 12282240, 'steps': 63969, 'loss/train': 1.2297369241714478} 11/07/2021 06:14:49 - INFO - __main__ - Step 63971: {'lr': 0.00031314856790328285, 'samples': 12282432, 'steps': 63970, 'loss/train': 1.5040333271026611} 11/07/2021 06:14:49 - INFO - __main__ - Step 63972: {'lr': 0.0003131434332302825, 'samples': 12282624, 'steps': 63971, 'loss/train': 1.7694451808929443} 11/07/2021 06:14:51 - INFO - __main__ - Step 63973: {'lr': 0.00031313829852883064, 'samples': 12282816, 'steps': 63972, 'loss/train': 1.3919657468795776} 11/07/2021 06:14:51 - INFO - __main__ - Step 63974: {'lr': 0.00031313316379892966, 'samples': 12283008, 'steps': 63973, 'loss/train': 1.3827494382858276} 11/07/2021 06:14:51 - INFO - __main__ - Step 63975: {'lr': 0.0003131280290405818, 'samples': 12283200, 'steps': 63974, 'loss/train': 1.3160192966461182} 11/07/2021 06:14:52 - INFO - __main__ - Step 63976: {'lr': 0.0003131228942537895, 'samples': 12283392, 'steps': 63975, 'loss/train': 1.4409501552581787} 11/07/2021 06:14:52 - INFO - __main__ - Step 63977: {'lr': 0.0003131177594385549, 'samples': 12283584, 'steps': 63976, 'loss/train': 0.940735936164856} 11/07/2021 06:14:53 - INFO - __main__ - Step 63978: {'lr': 0.00031311262459488053, 'samples': 12283776, 'steps': 63977, 'loss/train': 1.6007927656173706} 11/07/2021 06:14:53 - INFO - __main__ - Step 63979: {'lr': 0.0003131074897227686, 'samples': 12283968, 'steps': 63978, 'loss/train': 1.7437801361083984} 11/07/2021 06:14:54 - INFO - __main__ - Step 63980: {'lr': 0.00031310235482222124, 'samples': 12284160, 'steps': 63979, 'loss/train': 1.4285296201705933} 11/07/2021 06:14:54 - INFO - __main__ - Step 63981: {'lr': 0.00031309721989324107, 'samples': 12284352, 'steps': 63980, 'loss/train': 1.321335792541504} 11/07/2021 06:14:54 - INFO - __main__ - Step 63982: {'lr': 0.00031309208493583024, 'samples': 12284544, 'steps': 63981, 'loss/train': 1.2528060674667358} 11/07/2021 06:14:55 - INFO - __main__ - Step 63983: {'lr': 0.0003130869499499911, 'samples': 12284736, 'steps': 63982, 'loss/train': 1.690329670906067} 11/07/2021 06:14:56 - INFO - __main__ - Step 63984: {'lr': 0.0003130818149357259, 'samples': 12284928, 'steps': 63983, 'loss/train': 1.5027294158935547} 11/07/2021 06:14:56 - INFO - __main__ - Step 63985: {'lr': 0.00031307667989303713, 'samples': 12285120, 'steps': 63984, 'loss/train': 1.8524481058120728} 11/07/2021 06:14:57 - INFO - __main__ - Step 63986: {'lr': 0.00031307154482192683, 'samples': 12285312, 'steps': 63985, 'loss/train': 1.3047363758087158} 11/07/2021 06:14:57 - INFO - __main__ - Step 63987: {'lr': 0.00031306640972239753, 'samples': 12285504, 'steps': 63986, 'loss/train': 1.3555500507354736} 11/07/2021 06:14:58 - INFO - __main__ - Step 63988: {'lr': 0.0003130612745944515, 'samples': 12285696, 'steps': 63987, 'loss/train': 1.4343135356903076} 11/07/2021 06:14:58 - INFO - __main__ - Step 63989: {'lr': 0.000313056139438091, 'samples': 12285888, 'steps': 63988, 'loss/train': 2.2581465244293213} 11/07/2021 06:14:59 - INFO - __main__ - Step 63990: {'lr': 0.0003130510042533184, 'samples': 12286080, 'steps': 63989, 'loss/train': 1.5876951217651367} 11/07/2021 06:14:59 - INFO - __main__ - Step 63991: {'lr': 0.000313045869040136, 'samples': 12286272, 'steps': 63990, 'loss/train': 1.3944653272628784} 11/07/2021 06:14:59 - INFO - __main__ - Step 63992: {'lr': 0.00031304073379854607, 'samples': 12286464, 'steps': 63991, 'loss/train': 1.5414968729019165} 11/07/2021 06:15:00 - INFO - __main__ - Step 63993: {'lr': 0.00031303559852855097, 'samples': 12286656, 'steps': 63992, 'loss/train': 1.0431463718414307} 11/07/2021 06:15:01 - INFO - __main__ - Step 63994: {'lr': 0.00031303046323015297, 'samples': 12286848, 'steps': 63993, 'loss/train': 1.5748188495635986} 11/07/2021 06:15:01 - INFO - __main__ - Step 63995: {'lr': 0.00031302532790335446, 'samples': 12287040, 'steps': 63994, 'loss/train': 1.5859589576721191} 11/07/2021 06:15:01 - INFO - __main__ - Step 63996: {'lr': 0.0003130201925481577, 'samples': 12287232, 'steps': 63995, 'loss/train': 1.519286036491394} 11/07/2021 06:15:02 - INFO - __main__ - Step 63997: {'lr': 0.00031301505716456506, 'samples': 12287424, 'steps': 63996, 'loss/train': 1.0861650705337524} 11/07/2021 06:15:03 - INFO - __main__ - Step 63998: {'lr': 0.0003130099217525788, 'samples': 12287616, 'steps': 63997, 'loss/train': 0.972798228263855} 11/07/2021 06:15:03 - INFO - __main__ - Step 63999: {'lr': 0.00031300478631220114, 'samples': 12287808, 'steps': 63998, 'loss/train': 5.783071517944336} 11/07/2021 06:15:04 - INFO - __main__ - Step 64000: {'lr': 0.00031299965084343454, 'samples': 12288000, 'steps': 63999, 'loss/train': 1.7441192865371704} 11/07/2021 06:15:04 - INFO - __main__ - Step 64001: {'lr': 0.0003129945153462813, 'samples': 12288192, 'steps': 64000, 'loss/train': 1.4463145732879639} 11/07/2021 06:15:04 - INFO - __main__ - Step 64002: {'lr': 0.0003129893798207437, 'samples': 12288384, 'steps': 64001, 'loss/train': 1.4306249618530273} 11/07/2021 06:15:05 - INFO - __main__ - Step 64003: {'lr': 0.0003129842442668241, 'samples': 12288576, 'steps': 64002, 'loss/train': 1.2653406858444214} 11/07/2021 06:15:06 - INFO - __main__ - Step 64004: {'lr': 0.00031297910868452466, 'samples': 12288768, 'steps': 64003, 'loss/train': 1.4888859987258911} 11/07/2021 06:15:06 - INFO - __main__ - Step 64005: {'lr': 0.00031297397307384787, 'samples': 12288960, 'steps': 64004, 'loss/train': 2.019622802734375} 11/07/2021 06:15:06 - INFO - __main__ - Step 64006: {'lr': 0.000312968837434796, 'samples': 12289152, 'steps': 64005, 'loss/train': 1.4567288160324097} 11/07/2021 06:15:07 - INFO - __main__ - Step 64007: {'lr': 0.0003129637017673713, 'samples': 12289344, 'steps': 64006, 'loss/train': 1.592963695526123} 11/07/2021 06:15:07 - INFO - __main__ - Step 64008: {'lr': 0.0003129585660715762, 'samples': 12289536, 'steps': 64007, 'loss/train': 1.4150066375732422} 11/07/2021 06:15:08 - INFO - __main__ - Step 64009: {'lr': 0.00031295343034741285, 'samples': 12289728, 'steps': 64008, 'loss/train': 0.9153542518615723} 11/07/2021 06:15:08 - INFO - __main__ - Step 64010: {'lr': 0.0003129482945948837, 'samples': 12289920, 'steps': 64009, 'loss/train': 0.9372593760490417} 11/07/2021 06:15:09 - INFO - __main__ - Step 64011: {'lr': 0.00031294315881399097, 'samples': 12290112, 'steps': 64010, 'loss/train': 1.0255075693130493} 11/07/2021 06:15:09 - INFO - __main__ - Step 64012: {'lr': 0.0003129380230047371, 'samples': 12290304, 'steps': 64011, 'loss/train': 1.494992971420288} 11/07/2021 06:15:10 - INFO - __main__ - Step 64013: {'lr': 0.0003129328871671243, 'samples': 12290496, 'steps': 64012, 'loss/train': 1.4129475355148315} 11/07/2021 06:15:11 - INFO - __main__ - Step 64014: {'lr': 0.0003129277513011549, 'samples': 12290688, 'steps': 64013, 'loss/train': 1.4286003112792969} 11/07/2021 06:15:11 - INFO - __main__ - Step 64015: {'lr': 0.00031292261540683127, 'samples': 12290880, 'steps': 64014, 'loss/train': 1.6541752815246582} 11/07/2021 06:15:11 - INFO - __main__ - Step 64016: {'lr': 0.0003129174794841556, 'samples': 12291072, 'steps': 64015, 'loss/train': 1.031126618385315} 11/07/2021 06:15:12 - INFO - __main__ - Step 64017: {'lr': 0.00031291234353313037, 'samples': 12291264, 'steps': 64016, 'loss/train': 1.4695231914520264} 11/07/2021 06:15:12 - INFO - __main__ - Step 64018: {'lr': 0.00031290720755375773, 'samples': 12291456, 'steps': 64017, 'loss/train': 1.59310781955719} 11/07/2021 06:15:13 - INFO - __main__ - Step 64019: {'lr': 0.0003129020715460402, 'samples': 12291648, 'steps': 64018, 'loss/train': 1.0715389251708984} 11/07/2021 06:15:13 - INFO - __main__ - Step 64020: {'lr': 0.0003128969355099798, 'samples': 12291840, 'steps': 64019, 'loss/train': 1.549756646156311} 11/07/2021 06:15:14 - INFO - __main__ - Step 64021: {'lr': 0.0003128917994455791, 'samples': 12292032, 'steps': 64020, 'loss/train': 1.244370698928833} 11/07/2021 06:15:14 - INFO - __main__ - Step 64022: {'lr': 0.00031288666335284034, 'samples': 12292224, 'steps': 64021, 'loss/train': 1.3517982959747314} 11/07/2021 06:15:14 - INFO - __main__ - Step 64023: {'lr': 0.0003128815272317658, 'samples': 12292416, 'steps': 64022, 'loss/train': 1.6809731721878052} 11/07/2021 06:15:15 - INFO - __main__ - Step 64024: {'lr': 0.00031287639108235776, 'samples': 12292608, 'steps': 64023, 'loss/train': 1.4908372163772583} 11/07/2021 06:15:16 - INFO - __main__ - Step 64025: {'lr': 0.0003128712549046187, 'samples': 12292800, 'steps': 64024, 'loss/train': 1.4967737197875977} 11/07/2021 06:15:16 - INFO - __main__ - Step 64026: {'lr': 0.00031286611869855074, 'samples': 12292992, 'steps': 64025, 'loss/train': 1.8189202547073364} 11/07/2021 06:15:16 - INFO - __main__ - Step 64027: {'lr': 0.0003128609824641563, 'samples': 12293184, 'steps': 64026, 'loss/train': 1.1862003803253174} 11/07/2021 06:15:17 - INFO - __main__ - Step 64028: {'lr': 0.00031285584620143766, 'samples': 12293376, 'steps': 64027, 'loss/train': 1.347568392753601} 11/07/2021 06:15:17 - INFO - __main__ - Step 64029: {'lr': 0.0003128507099103971, 'samples': 12293568, 'steps': 64028, 'loss/train': 1.6064363718032837} 11/07/2021 06:15:18 - INFO - __main__ - Step 64030: {'lr': 0.00031284557359103704, 'samples': 12293760, 'steps': 64029, 'loss/train': 1.3139156103134155} 11/07/2021 06:15:19 - INFO - __main__ - Step 64031: {'lr': 0.00031284043724335973, 'samples': 12293952, 'steps': 64030, 'loss/train': 1.4028544425964355} 11/07/2021 06:15:19 - INFO - __main__ - Step 64032: {'lr': 0.00031283530086736756, 'samples': 12294144, 'steps': 64031, 'loss/train': 0.8042995929718018} 11/07/2021 06:15:19 - INFO - __main__ - Step 64033: {'lr': 0.0003128301644630627, 'samples': 12294336, 'steps': 64032, 'loss/train': 1.5992908477783203} 11/07/2021 06:15:20 - INFO - __main__ - Step 64034: {'lr': 0.0003128250280304475, 'samples': 12294528, 'steps': 64033, 'loss/train': 1.4924005270004272} 11/07/2021 06:15:21 - INFO - __main__ - Step 64035: {'lr': 0.00031281989156952436, 'samples': 12294720, 'steps': 64034, 'loss/train': 1.3388534784317017} 11/07/2021 06:15:21 - INFO - __main__ - Step 64036: {'lr': 0.0003128147550802955, 'samples': 12294912, 'steps': 64035, 'loss/train': 0.6217591166496277} 11/07/2021 06:15:21 - INFO - __main__ - Step 64037: {'lr': 0.0003128096185627633, 'samples': 12295104, 'steps': 64036, 'loss/train': 1.4665635824203491} 11/07/2021 06:15:22 - INFO - __main__ - Step 64038: {'lr': 0.0003128044820169301, 'samples': 12295296, 'steps': 64037, 'loss/train': 1.5091201066970825} 11/07/2021 06:15:22 - INFO - __main__ - Step 64039: {'lr': 0.00031279934544279817, 'samples': 12295488, 'steps': 64038, 'loss/train': 1.2834922075271606} 11/07/2021 06:15:23 - INFO - __main__ - Step 64040: {'lr': 0.0003127942088403698, 'samples': 12295680, 'steps': 64039, 'loss/train': 0.8581381440162659} 11/07/2021 06:15:23 - INFO - __main__ - Step 64041: {'lr': 0.0003127890722096473, 'samples': 12295872, 'steps': 64040, 'loss/train': 0.9223619103431702} 11/07/2021 06:15:24 - INFO - __main__ - Step 64042: {'lr': 0.000312783935550633, 'samples': 12296064, 'steps': 64041, 'loss/train': 1.4232999086380005} 11/07/2021 06:15:24 - INFO - __main__ - Step 64043: {'lr': 0.00031277879886332927, 'samples': 12296256, 'steps': 64042, 'loss/train': 1.6098583936691284} 11/07/2021 06:15:24 - INFO - __main__ - Step 64044: {'lr': 0.0003127736621477384, 'samples': 12296448, 'steps': 64043, 'loss/train': 0.8783707022666931} 11/07/2021 06:15:26 - INFO - __main__ - Step 64045: {'lr': 0.0003127685254038626, 'samples': 12296640, 'steps': 64044, 'loss/train': 1.5434707403182983} 11/07/2021 06:15:26 - INFO - __main__ - Step 64046: {'lr': 0.0003127633886317044, 'samples': 12296832, 'steps': 64045, 'loss/train': 1.682008147239685} 11/07/2021 06:15:26 - INFO - __main__ - Step 64047: {'lr': 0.0003127582518312659, 'samples': 12297024, 'steps': 64046, 'loss/train': 1.607465147972107} 11/07/2021 06:15:27 - INFO - __main__ - Step 64048: {'lr': 0.00031275311500254956, 'samples': 12297216, 'steps': 64047, 'loss/train': 0.852258563041687} 11/07/2021 06:15:27 - INFO - __main__ - Step 64049: {'lr': 0.00031274797814555754, 'samples': 12297408, 'steps': 64048, 'loss/train': 1.079520344734192} 11/07/2021 06:15:28 - INFO - __main__ - Step 64050: {'lr': 0.0003127428412602923, 'samples': 12297600, 'steps': 64049, 'loss/train': 1.1874958276748657} 11/07/2021 06:15:28 - INFO - __main__ - Step 64051: {'lr': 0.0003127377043467561, 'samples': 12297792, 'steps': 64050, 'loss/train': 1.0664887428283691} 11/07/2021 06:15:29 - INFO - __main__ - Step 64052: {'lr': 0.00031273256740495134, 'samples': 12297984, 'steps': 64051, 'loss/train': 1.6149077415466309} 11/07/2021 06:15:29 - INFO - __main__ - Step 64053: {'lr': 0.0003127274304348802, 'samples': 12298176, 'steps': 64052, 'loss/train': 1.4503227472305298} 11/07/2021 06:15:29 - INFO - __main__ - Step 64054: {'lr': 0.00031272229343654495, 'samples': 12298368, 'steps': 64053, 'loss/train': 1.0073800086975098} 11/07/2021 06:15:30 - INFO - __main__ - Step 64055: {'lr': 0.0003127171564099481, 'samples': 12298560, 'steps': 64054, 'loss/train': 1.4124358892440796} 11/07/2021 06:15:31 - INFO - __main__ - Step 64056: {'lr': 0.0003127120193550918, 'samples': 12298752, 'steps': 64055, 'loss/train': 1.3513495922088623} 11/07/2021 06:15:31 - INFO - __main__ - Step 64057: {'lr': 0.0003127068822719785, 'samples': 12298944, 'steps': 64056, 'loss/train': 1.5585713386535645} 11/07/2021 06:15:31 - INFO - __main__ - Step 64058: {'lr': 0.0003127017451606104, 'samples': 12299136, 'steps': 64057, 'loss/train': 1.2221976518630981} 11/07/2021 06:15:32 - INFO - __main__ - Step 64059: {'lr': 0.00031269660802098995, 'samples': 12299328, 'steps': 64058, 'loss/train': 1.4776496887207031} 11/07/2021 06:15:33 - INFO - __main__ - Step 64060: {'lr': 0.0003126914708531193, 'samples': 12299520, 'steps': 64059, 'loss/train': 1.3234013319015503} 11/07/2021 06:15:33 - INFO - __main__ - Step 64061: {'lr': 0.00031268633365700085, 'samples': 12299712, 'steps': 64060, 'loss/train': 1.2869466543197632} 11/07/2021 06:15:33 - INFO - __main__ - Step 64062: {'lr': 0.00031268119643263685, 'samples': 12299904, 'steps': 64061, 'loss/train': 1.6940871477127075} 11/07/2021 06:15:34 - INFO - __main__ - Step 64063: {'lr': 0.0003126760591800297, 'samples': 12300096, 'steps': 64062, 'loss/train': 1.2405636310577393} 11/07/2021 06:15:34 - INFO - __main__ - Step 64064: {'lr': 0.0003126709218991818, 'samples': 12300288, 'steps': 64063, 'loss/train': 1.3161622285842896} 11/07/2021 06:15:35 - INFO - __main__ - Step 64065: {'lr': 0.0003126657845900952, 'samples': 12300480, 'steps': 64064, 'loss/train': 1.2942014932632446} 11/07/2021 06:15:35 - INFO - __main__ - Step 64066: {'lr': 0.0003126606472527725, 'samples': 12300672, 'steps': 64065, 'loss/train': 1.3680108785629272} 11/07/2021 06:15:36 - INFO - __main__ - Step 64067: {'lr': 0.0003126555098872158, 'samples': 12300864, 'steps': 64066, 'loss/train': 1.2452858686447144} 11/07/2021 06:15:36 - INFO - __main__ - Step 64068: {'lr': 0.00031265037249342747, 'samples': 12301056, 'steps': 64067, 'loss/train': 1.2632673978805542} 11/07/2021 06:15:36 - INFO - __main__ - Step 64069: {'lr': 0.00031264523507140983, 'samples': 12301248, 'steps': 64068, 'loss/train': 1.4042387008666992} 11/07/2021 06:15:37 - INFO - __main__ - Step 64070: {'lr': 0.0003126400976211653, 'samples': 12301440, 'steps': 64069, 'loss/train': 1.3600008487701416} 11/07/2021 06:15:38 - INFO - __main__ - Step 64071: {'lr': 0.00031263496014269604, 'samples': 12301632, 'steps': 64070, 'loss/train': 1.5129388570785522} 11/07/2021 06:15:38 - INFO - __main__ - Step 64072: {'lr': 0.0003126298226360045, 'samples': 12301824, 'steps': 64071, 'loss/train': 1.3591758012771606} 11/07/2021 06:15:39 - INFO - __main__ - Step 64073: {'lr': 0.0003126246851010929, 'samples': 12302016, 'steps': 64072, 'loss/train': 1.5613411664962769} 11/07/2021 06:15:39 - INFO - __main__ - Step 64074: {'lr': 0.0003126195475379636, 'samples': 12302208, 'steps': 64073, 'loss/train': 1.4969468116760254} 11/07/2021 06:15:39 - INFO - __main__ - Step 64075: {'lr': 0.0003126144099466188, 'samples': 12302400, 'steps': 64074, 'loss/train': 1.3806339502334595} 11/07/2021 06:15:40 - INFO - __main__ - Step 64076: {'lr': 0.00031260927232706106, 'samples': 12302592, 'steps': 64075, 'loss/train': 1.3953781127929688} 11/07/2021 06:15:41 - INFO - __main__ - Step 64077: {'lr': 0.0003126041346792924, 'samples': 12302784, 'steps': 64076, 'loss/train': 1.8344213962554932} 11/07/2021 06:15:41 - INFO - __main__ - Step 64078: {'lr': 0.0003125989970033154, 'samples': 12302976, 'steps': 64077, 'loss/train': 1.276596188545227} 11/07/2021 06:15:41 - INFO - __main__ - Step 64079: {'lr': 0.00031259385929913224, 'samples': 12303168, 'steps': 64078, 'loss/train': 1.5150048732757568} 11/07/2021 06:15:42 - INFO - __main__ - Step 64080: {'lr': 0.00031258872156674525, 'samples': 12303360, 'steps': 64079, 'loss/train': 1.1794086694717407} 11/07/2021 06:15:43 - INFO - __main__ - Step 64081: {'lr': 0.0003125835838061567, 'samples': 12303552, 'steps': 64080, 'loss/train': 1.8497092723846436} 11/07/2021 06:15:43 - INFO - __main__ - Step 64082: {'lr': 0.00031257844601736897, 'samples': 12303744, 'steps': 64081, 'loss/train': 0.3923610746860504} 11/07/2021 06:15:44 - INFO - __main__ - Step 64083: {'lr': 0.00031257330820038434, 'samples': 12303936, 'steps': 64082, 'loss/train': 1.3765385150909424} 11/07/2021 06:15:44 - INFO - __main__ - Step 64084: {'lr': 0.0003125681703552052, 'samples': 12304128, 'steps': 64083, 'loss/train': 1.754288911819458} 11/07/2021 06:15:45 - INFO - __main__ - Step 64085: {'lr': 0.0003125630324818337, 'samples': 12304320, 'steps': 64084, 'loss/train': 1.488644003868103} 11/07/2021 06:15:46 - INFO - __main__ - Step 64086: {'lr': 0.0003125578945802724, 'samples': 12304512, 'steps': 64085, 'loss/train': 0.7568609714508057} 11/07/2021 06:15:46 - INFO - __main__ - Step 64087: {'lr': 0.0003125527566505234, 'samples': 12304704, 'steps': 64086, 'loss/train': 1.6661568880081177} 11/07/2021 06:15:46 - INFO - __main__ - Step 64088: {'lr': 0.0003125476186925891, 'samples': 12304896, 'steps': 64087, 'loss/train': 1.553022861480713} 11/07/2021 06:15:47 - INFO - __main__ - Step 64089: {'lr': 0.0003125424807064718, 'samples': 12305088, 'steps': 64088, 'loss/train': 0.7519490718841553} 11/07/2021 06:15:47 - INFO - __main__ - Step 64090: {'lr': 0.0003125373426921739, 'samples': 12305280, 'steps': 64089, 'loss/train': 1.0248383283615112} 11/07/2021 06:15:48 - INFO - __main__ - Step 64091: {'lr': 0.00031253220464969755, 'samples': 12305472, 'steps': 64090, 'loss/train': 1.7381852865219116} 11/07/2021 06:15:48 - INFO - __main__ - Step 64092: {'lr': 0.00031252706657904517, 'samples': 12305664, 'steps': 64091, 'loss/train': 1.2648321390151978} 11/07/2021 06:15:49 - INFO - __main__ - Step 64093: {'lr': 0.00031252192848021915, 'samples': 12305856, 'steps': 64092, 'loss/train': 1.2112482786178589} 11/07/2021 06:15:49 - INFO - __main__ - Step 64094: {'lr': 0.0003125167903532216, 'samples': 12306048, 'steps': 64093, 'loss/train': 0.7190549373626709} 11/07/2021 06:15:49 - INFO - __main__ - Step 64095: {'lr': 0.000312511652198055, 'samples': 12306240, 'steps': 64094, 'loss/train': 1.2794370651245117} 11/07/2021 06:15:50 - INFO - __main__ - Step 64096: {'lr': 0.00031250651401472157, 'samples': 12306432, 'steps': 64095, 'loss/train': 1.3927499055862427} 11/07/2021 06:15:51 - INFO - __main__ - Step 64097: {'lr': 0.0003125013758032237, 'samples': 12306624, 'steps': 64096, 'loss/train': 1.6875765323638916} 11/07/2021 06:15:51 - INFO - __main__ - Step 64098: {'lr': 0.00031249623756356365, 'samples': 12306816, 'steps': 64097, 'loss/train': 1.27635657787323} 11/07/2021 06:15:51 - INFO - __main__ - Step 64099: {'lr': 0.0003124910992957438, 'samples': 12307008, 'steps': 64098, 'loss/train': 1.5924144983291626} 11/07/2021 06:15:52 - INFO - __main__ - Step 64100: {'lr': 0.00031248596099976646, 'samples': 12307200, 'steps': 64099, 'loss/train': 0.8727297782897949} 11/07/2021 06:15:53 - INFO - __main__ - Step 64101: {'lr': 0.00031248082267563385, 'samples': 12307392, 'steps': 64100, 'loss/train': 1.4662224054336548} 11/07/2021 06:15:53 - INFO - __main__ - Step 64102: {'lr': 0.0003124756843233483, 'samples': 12307584, 'steps': 64101, 'loss/train': 1.228266716003418} 11/07/2021 06:15:53 - INFO - __main__ - Step 64103: {'lr': 0.00031247054594291226, 'samples': 12307776, 'steps': 64102, 'loss/train': 1.10092031955719} 11/07/2021 06:15:54 - INFO - __main__ - Step 64104: {'lr': 0.00031246540753432795, 'samples': 12307968, 'steps': 64103, 'loss/train': 1.1292716264724731} 11/07/2021 06:15:54 - INFO - __main__ - Step 64105: {'lr': 0.00031246026909759764, 'samples': 12308160, 'steps': 64104, 'loss/train': 1.2395440340042114} 11/07/2021 06:15:55 - INFO - __main__ - Step 64106: {'lr': 0.0003124551306327237, 'samples': 12308352, 'steps': 64105, 'loss/train': 1.4687221050262451} 11/07/2021 06:15:55 - INFO - __main__ - Step 64107: {'lr': 0.00031244999213970846, 'samples': 12308544, 'steps': 64106, 'loss/train': 0.7414936423301697} 11/07/2021 06:15:56 - INFO - __main__ - Step 64108: {'lr': 0.00031244485361855425, 'samples': 12308736, 'steps': 64107, 'loss/train': 1.2571576833724976} 11/07/2021 06:15:56 - INFO - __main__ - Step 64109: {'lr': 0.0003124397150692633, 'samples': 12308928, 'steps': 64108, 'loss/train': 1.5074971914291382} 11/07/2021 06:15:56 - INFO - __main__ - Step 64110: {'lr': 0.00031243457649183804, 'samples': 12309120, 'steps': 64109, 'loss/train': 1.1240174770355225} 11/07/2021 06:15:58 - INFO - __main__ - Step 64111: {'lr': 0.00031242943788628065, 'samples': 12309312, 'steps': 64110, 'loss/train': 1.3171608448028564} 11/07/2021 06:15:58 - INFO - __main__ - Step 64112: {'lr': 0.0003124242992525935, 'samples': 12309504, 'steps': 64111, 'loss/train': 1.5546694993972778} 11/07/2021 06:15:58 - INFO - __main__ - Step 64113: {'lr': 0.0003124191605907791, 'samples': 12309696, 'steps': 64112, 'loss/train': 1.0158270597457886} 11/07/2021 06:15:59 - INFO - __main__ - Step 64114: {'lr': 0.0003124140219008394, 'samples': 12309888, 'steps': 64113, 'loss/train': 1.1451634168624878} 11/07/2021 06:15:59 - INFO - __main__ - Step 64115: {'lr': 0.000312408883182777, 'samples': 12310080, 'steps': 64114, 'loss/train': 2.131730079650879} 11/07/2021 06:16:00 - INFO - __main__ - Step 64116: {'lr': 0.000312403744436594, 'samples': 12310272, 'steps': 64115, 'loss/train': 1.1038317680358887} 11/07/2021 06:16:00 - INFO - __main__ - Step 64117: {'lr': 0.000312398605662293, 'samples': 12310464, 'steps': 64116, 'loss/train': 1.5048938989639282} 11/07/2021 06:16:01 - INFO - __main__ - Step 64118: {'lr': 0.000312393466859876, 'samples': 12310656, 'steps': 64117, 'loss/train': 1.5400490760803223} 11/07/2021 06:16:01 - INFO - __main__ - Step 64119: {'lr': 0.0003123883280293456, 'samples': 12310848, 'steps': 64118, 'loss/train': 1.3192428350448608} 11/07/2021 06:16:01 - INFO - __main__ - Step 64120: {'lr': 0.00031238318917070396, 'samples': 12311040, 'steps': 64119, 'loss/train': 0.9653067588806152} 11/07/2021 06:16:03 - INFO - __main__ - Step 64121: {'lr': 0.00031237805028395336, 'samples': 12311232, 'steps': 64120, 'loss/train': 1.3508750200271606} 11/07/2021 06:16:03 - INFO - __main__ - Step 64122: {'lr': 0.0003123729113690962, 'samples': 12311424, 'steps': 64121, 'loss/train': 1.4877525568008423} 11/07/2021 06:16:03 - INFO - __main__ - Step 64123: {'lr': 0.00031236777242613475, 'samples': 12311616, 'steps': 64122, 'loss/train': 1.4431371688842773} 11/07/2021 06:16:04 - INFO - __main__ - Step 64124: {'lr': 0.00031236263345507133, 'samples': 12311808, 'steps': 64123, 'loss/train': 0.9599971175193787} 11/07/2021 06:16:04 - INFO - __main__ - Step 64125: {'lr': 0.0003123574944559083, 'samples': 12312000, 'steps': 64124, 'loss/train': 0.15993663668632507} 11/07/2021 06:16:04 - INFO - __main__ - Step 64126: {'lr': 0.000312352355428648, 'samples': 12312192, 'steps': 64125, 'loss/train': 0.868851363658905} 11/07/2021 06:16:05 - INFO - __main__ - Step 64127: {'lr': 0.0003123472163732926, 'samples': 12312384, 'steps': 64126, 'loss/train': 1.2845275402069092} 11/07/2021 06:16:06 - INFO - __main__ - Step 64128: {'lr': 0.0003123420772898445, 'samples': 12312576, 'steps': 64127, 'loss/train': 1.281809687614441} 11/07/2021 06:16:06 - INFO - __main__ - Step 64129: {'lr': 0.0003123369381783061, 'samples': 12312768, 'steps': 64128, 'loss/train': 1.2398498058319092} 11/07/2021 06:16:06 - INFO - __main__ - Step 64130: {'lr': 0.00031233179903867957, 'samples': 12312960, 'steps': 64129, 'loss/train': 1.7114229202270508} 11/07/2021 06:16:07 - INFO - __main__ - Step 64131: {'lr': 0.0003123266598709674, 'samples': 12313152, 'steps': 64130, 'loss/train': 1.5799322128295898} 11/07/2021 06:16:08 - INFO - __main__ - Step 64132: {'lr': 0.0003123215206751717, 'samples': 12313344, 'steps': 64131, 'loss/train': 0.7202985286712646} 11/07/2021 06:16:08 - INFO - __main__ - Step 64133: {'lr': 0.0003123163814512949, 'samples': 12313536, 'steps': 64132, 'loss/train': 1.3449627161026} 11/07/2021 06:16:08 - INFO - __main__ - Step 64134: {'lr': 0.0003123112421993393, 'samples': 12313728, 'steps': 64133, 'loss/train': 1.2769033908843994} 11/07/2021 06:16:09 - INFO - __main__ - Step 64135: {'lr': 0.00031230610291930723, 'samples': 12313920, 'steps': 64134, 'loss/train': 1.0160717964172363} 11/07/2021 06:16:09 - INFO - __main__ - Step 64136: {'lr': 0.000312300963611201, 'samples': 12314112, 'steps': 64135, 'loss/train': 2.016728162765503} 11/07/2021 06:16:10 - INFO - __main__ - Step 64137: {'lr': 0.0003122958242750229, 'samples': 12314304, 'steps': 64136, 'loss/train': 1.1337741613388062} 11/07/2021 06:16:11 - INFO - __main__ - Step 64138: {'lr': 0.0003122906849107753, 'samples': 12314496, 'steps': 64137, 'loss/train': 1.5534226894378662} 11/07/2021 06:16:11 - INFO - __main__ - Step 64139: {'lr': 0.00031228554551846046, 'samples': 12314688, 'steps': 64138, 'loss/train': 2.2538840770721436} 11/07/2021 06:16:11 - INFO - __main__ - Step 64140: {'lr': 0.00031228040609808063, 'samples': 12314880, 'steps': 64139, 'loss/train': 1.412512183189392} 11/07/2021 06:16:12 - INFO - __main__ - Step 64141: {'lr': 0.0003122752666496383, 'samples': 12315072, 'steps': 64140, 'loss/train': 1.0895251035690308} 11/07/2021 06:16:13 - INFO - __main__ - Step 64142: {'lr': 0.0003122701271731357, 'samples': 12315264, 'steps': 64141, 'loss/train': 1.5661265850067139} 11/07/2021 06:16:13 - INFO - __main__ - Step 64143: {'lr': 0.0003122649876685751, 'samples': 12315456, 'steps': 64142, 'loss/train': 1.6051491498947144} 11/07/2021 06:16:13 - INFO - __main__ - Step 64144: {'lr': 0.0003122598481359589, 'samples': 12315648, 'steps': 64143, 'loss/train': 0.9112840294837952} 11/07/2021 06:16:14 - INFO - __main__ - Step 64145: {'lr': 0.0003122547085752893, 'samples': 12315840, 'steps': 64144, 'loss/train': 1.069115400314331} 11/07/2021 06:16:14 - INFO - __main__ - Step 64146: {'lr': 0.00031224956898656876, 'samples': 12316032, 'steps': 64145, 'loss/train': 0.9950532913208008} 11/07/2021 06:16:15 - INFO - __main__ - Step 64147: {'lr': 0.00031224442936979947, 'samples': 12316224, 'steps': 64146, 'loss/train': 1.4277267456054688} 11/07/2021 06:16:15 - INFO - __main__ - Step 64148: {'lr': 0.0003122392897249839, 'samples': 12316416, 'steps': 64147, 'loss/train': 1.5683037042617798} 11/07/2021 06:16:16 - INFO - __main__ - Step 64149: {'lr': 0.0003122341500521242, 'samples': 12316608, 'steps': 64148, 'loss/train': 1.3600257635116577} 11/07/2021 06:16:16 - INFO - __main__ - Step 64150: {'lr': 0.0003122290103512227, 'samples': 12316800, 'steps': 64149, 'loss/train': 0.7640442252159119} 11/07/2021 06:16:16 - INFO - __main__ - Step 64151: {'lr': 0.0003122238706222818, 'samples': 12316992, 'steps': 64150, 'loss/train': 1.8079698085784912} 11/07/2021 06:16:18 - INFO - __main__ - Step 64152: {'lr': 0.0003122187308653038, 'samples': 12317184, 'steps': 64151, 'loss/train': 1.390527606010437} 11/07/2021 06:16:18 - INFO - __main__ - Step 64153: {'lr': 0.00031221359108029104, 'samples': 12317376, 'steps': 64152, 'loss/train': 1.4483915567398071} 11/07/2021 06:16:18 - INFO - __main__ - Step 64154: {'lr': 0.00031220845126724576, 'samples': 12317568, 'steps': 64153, 'loss/train': 1.4849127531051636} 11/07/2021 06:16:19 - INFO - __main__ - Step 64155: {'lr': 0.0003122033114261703, 'samples': 12317760, 'steps': 64154, 'loss/train': 1.6765470504760742} 11/07/2021 06:16:19 - INFO - __main__ - Step 64156: {'lr': 0.00031219817155706697, 'samples': 12317952, 'steps': 64155, 'loss/train': 1.799055814743042} 11/07/2021 06:16:20 - INFO - __main__ - Step 64157: {'lr': 0.0003121930316599381, 'samples': 12318144, 'steps': 64156, 'loss/train': 1.4235163927078247} 11/07/2021 06:16:20 - INFO - __main__ - Step 64158: {'lr': 0.00031218789173478607, 'samples': 12318336, 'steps': 64157, 'loss/train': 1.148787260055542} 11/07/2021 06:16:21 - INFO - __main__ - Step 64159: {'lr': 0.0003121827517816131, 'samples': 12318528, 'steps': 64158, 'loss/train': 1.3954594135284424} 11/07/2021 06:16:21 - INFO - __main__ - Step 64160: {'lr': 0.0003121776118004216, 'samples': 12318720, 'steps': 64159, 'loss/train': 0.7205312848091125} 11/07/2021 06:16:21 - INFO - __main__ - Step 64161: {'lr': 0.0003121724717912138, 'samples': 12318912, 'steps': 64160, 'loss/train': 1.6731314659118652} 11/07/2021 06:16:22 - INFO - __main__ - Step 64162: {'lr': 0.000312167331753992, 'samples': 12319104, 'steps': 64161, 'loss/train': 0.9881372451782227} 11/07/2021 06:16:23 - INFO - __main__ - Step 64163: {'lr': 0.00031216219168875856, 'samples': 12319296, 'steps': 64162, 'loss/train': 1.5324808359146118} 11/07/2021 06:16:23 - INFO - __main__ - Step 64164: {'lr': 0.00031215705159551576, 'samples': 12319488, 'steps': 64163, 'loss/train': 1.3317862749099731} 11/07/2021 06:16:23 - INFO - __main__ - Step 64165: {'lr': 0.000312151911474266, 'samples': 12319680, 'steps': 64164, 'loss/train': 1.4047881364822388} 11/07/2021 06:16:24 - INFO - __main__ - Step 64166: {'lr': 0.0003121467713250116, 'samples': 12319872, 'steps': 64165, 'loss/train': 1.3795809745788574} 11/07/2021 06:16:25 - INFO - __main__ - Step 64167: {'lr': 0.00031214163114775477, 'samples': 12320064, 'steps': 64166, 'loss/train': 1.4744377136230469} 11/07/2021 06:16:25 - INFO - __main__ - Step 64168: {'lr': 0.00031213649094249783, 'samples': 12320256, 'steps': 64167, 'loss/train': 2.2181577682495117} 11/07/2021 06:16:25 - INFO - __main__ - Step 64169: {'lr': 0.0003121313507092433, 'samples': 12320448, 'steps': 64168, 'loss/train': 1.4910545349121094} 11/07/2021 06:16:26 - INFO - __main__ - Step 64170: {'lr': 0.00031212621044799315, 'samples': 12320640, 'steps': 64169, 'loss/train': 1.543310284614563} 11/07/2021 06:16:26 - INFO - __main__ - Step 64171: {'lr': 0.00031212107015875, 'samples': 12320832, 'steps': 64170, 'loss/train': 1.1147645711898804} 11/07/2021 06:16:27 - INFO - __main__ - Step 64172: {'lr': 0.00031211592984151603, 'samples': 12321024, 'steps': 64171, 'loss/train': 1.3329957723617554} 11/07/2021 06:16:27 - INFO - __main__ - Step 64173: {'lr': 0.00031211078949629364, 'samples': 12321216, 'steps': 64172, 'loss/train': 1.283996820449829} 11/07/2021 06:16:28 - INFO - __main__ - Step 64174: {'lr': 0.00031210564912308506, 'samples': 12321408, 'steps': 64173, 'loss/train': 1.2919654846191406} 11/07/2021 06:16:28 - INFO - __main__ - Step 64175: {'lr': 0.00031210050872189257, 'samples': 12321600, 'steps': 64174, 'loss/train': 1.1967499256134033} 11/07/2021 06:16:29 - INFO - __main__ - Step 64176: {'lr': 0.00031209536829271856, 'samples': 12321792, 'steps': 64175, 'loss/train': 1.156166911125183} 11/07/2021 06:16:30 - INFO - __main__ - Step 64177: {'lr': 0.00031209022783556536, 'samples': 12321984, 'steps': 64176, 'loss/train': 1.1853877305984497} 11/07/2021 06:16:30 - INFO - __main__ - Step 64178: {'lr': 0.0003120850873504353, 'samples': 12322176, 'steps': 64177, 'loss/train': 1.430586576461792} 11/07/2021 06:16:30 - INFO - __main__ - Step 64179: {'lr': 0.00031207994683733054, 'samples': 12322368, 'steps': 64178, 'loss/train': 1.3461705446243286} 11/07/2021 06:16:31 - INFO - __main__ - Step 64180: {'lr': 0.0003120748062962537, 'samples': 12322560, 'steps': 64179, 'loss/train': 1.2734184265136719} 11/07/2021 06:16:31 - INFO - __main__ - Step 64181: {'lr': 0.00031206966572720676, 'samples': 12322752, 'steps': 64180, 'loss/train': 1.5446808338165283} 11/07/2021 06:16:31 - INFO - __main__ - Step 64182: {'lr': 0.00031206452513019223, 'samples': 12322944, 'steps': 64181, 'loss/train': 1.4371540546417236} 11/07/2021 06:16:32 - INFO - __main__ - Step 64183: {'lr': 0.0003120593845052124, 'samples': 12323136, 'steps': 64182, 'loss/train': 1.577638030052185} 11/07/2021 06:16:33 - INFO - __main__ - Step 64184: {'lr': 0.0003120542438522695, 'samples': 12323328, 'steps': 64183, 'loss/train': 1.6035656929016113} 11/07/2021 06:16:33 - INFO - __main__ - Step 64185: {'lr': 0.000312049103171366, 'samples': 12323520, 'steps': 64184, 'loss/train': 1.7676893472671509} 11/07/2021 06:16:33 - INFO - __main__ - Step 64186: {'lr': 0.00031204396246250403, 'samples': 12323712, 'steps': 64185, 'loss/train': 1.483948826789856} 11/07/2021 06:16:34 - INFO - __main__ - Step 64187: {'lr': 0.00031203882172568614, 'samples': 12323904, 'steps': 64186, 'loss/train': 1.1854757070541382} 11/07/2021 06:16:35 - INFO - __main__ - Step 64188: {'lr': 0.0003120336809609144, 'samples': 12324096, 'steps': 64187, 'loss/train': 1.5967278480529785} 11/07/2021 06:16:35 - INFO - __main__ - Step 64189: {'lr': 0.0003120285401681913, 'samples': 12324288, 'steps': 64188, 'loss/train': 0.8577131032943726} 11/07/2021 06:16:36 - INFO - __main__ - Step 64190: {'lr': 0.0003120233993475191, 'samples': 12324480, 'steps': 64189, 'loss/train': 1.673332691192627} 11/07/2021 06:16:36 - INFO - __main__ - Step 64191: {'lr': 0.00031201825849890013, 'samples': 12324672, 'steps': 64190, 'loss/train': 1.8055919408798218} 11/07/2021 06:16:36 - INFO - __main__ - Step 64192: {'lr': 0.00031201311762233666, 'samples': 12324864, 'steps': 64191, 'loss/train': 1.4511024951934814} 11/07/2021 06:16:37 - INFO - __main__ - Step 64193: {'lr': 0.000312007976717831, 'samples': 12325056, 'steps': 64192, 'loss/train': 1.3506723642349243} 11/07/2021 06:16:38 - INFO - __main__ - Step 64194: {'lr': 0.0003120028357853856, 'samples': 12325248, 'steps': 64193, 'loss/train': 1.1766330003738403} 11/07/2021 06:16:38 - INFO - __main__ - Step 64195: {'lr': 0.0003119976948250026, 'samples': 12325440, 'steps': 64194, 'loss/train': 0.493897408246994} 11/07/2021 06:16:39 - INFO - __main__ - Step 64196: {'lr': 0.0003119925538366844, 'samples': 12325632, 'steps': 64195, 'loss/train': 1.3095437288284302} 11/07/2021 06:16:39 - INFO - __main__ - Step 64197: {'lr': 0.00031198741282043333, 'samples': 12325824, 'steps': 64196, 'loss/train': 1.5274325609207153} 11/07/2021 06:16:39 - INFO - __main__ - Step 64198: {'lr': 0.0003119822717762517, 'samples': 12326016, 'steps': 64197, 'loss/train': 0.39127317070961} 11/07/2021 06:16:40 - INFO - __main__ - Step 64199: {'lr': 0.0003119771307041418, 'samples': 12326208, 'steps': 64198, 'loss/train': 0.9421136379241943} 11/07/2021 06:16:41 - INFO - __main__ - Step 64200: {'lr': 0.000311971989604106, 'samples': 12326400, 'steps': 64199, 'loss/train': 0.9558649063110352} 11/07/2021 06:16:41 - INFO - __main__ - Step 64201: {'lr': 0.00031196684847614655, 'samples': 12326592, 'steps': 64200, 'loss/train': 1.2375202178955078} 11/07/2021 06:16:41 - INFO - __main__ - Step 64202: {'lr': 0.00031196170732026576, 'samples': 12326784, 'steps': 64201, 'loss/train': 1.6410737037658691} 11/07/2021 06:16:42 - INFO - __main__ - Step 64203: {'lr': 0.00031195656613646595, 'samples': 12326976, 'steps': 64202, 'loss/train': 1.0684564113616943} 11/07/2021 06:16:43 - INFO - __main__ - Step 64204: {'lr': 0.00031195142492474956, 'samples': 12327168, 'steps': 64203, 'loss/train': 1.2294678688049316} 11/07/2021 06:16:43 - INFO - __main__ - Step 64205: {'lr': 0.00031194628368511876, 'samples': 12327360, 'steps': 64204, 'loss/train': 0.5304142236709595} 11/07/2021 06:16:43 - INFO - __main__ - Step 64206: {'lr': 0.00031194114241757593, 'samples': 12327552, 'steps': 64205, 'loss/train': 1.6338419914245605} 11/07/2021 06:16:44 - INFO - __main__ - Step 64207: {'lr': 0.0003119360011221234, 'samples': 12327744, 'steps': 64206, 'loss/train': 1.3286248445510864} 11/07/2021 06:16:44 - INFO - __main__ - Step 64208: {'lr': 0.00031193085979876347, 'samples': 12327936, 'steps': 64207, 'loss/train': 1.591288685798645} 11/07/2021 06:16:45 - INFO - __main__ - Step 64209: {'lr': 0.0003119257184474984, 'samples': 12328128, 'steps': 64208, 'loss/train': 1.509394645690918} 11/07/2021 06:16:45 - INFO - __main__ - Step 64210: {'lr': 0.00031192057706833055, 'samples': 12328320, 'steps': 64209, 'loss/train': 0.8525235652923584} 11/07/2021 06:16:46 - INFO - __main__ - Step 64211: {'lr': 0.0003119154356612623, 'samples': 12328512, 'steps': 64210, 'loss/train': 1.694977879524231} 11/07/2021 06:16:46 - INFO - __main__ - Step 64212: {'lr': 0.0003119102942262959, 'samples': 12328704, 'steps': 64211, 'loss/train': 1.468130350112915} 11/07/2021 06:16:46 - INFO - __main__ - Step 64213: {'lr': 0.0003119051527634336, 'samples': 12328896, 'steps': 64212, 'loss/train': 1.0284817218780518} 11/07/2021 06:16:47 - INFO - __main__ - Step 64214: {'lr': 0.00031190001127267793, 'samples': 12329088, 'steps': 64213, 'loss/train': 1.3142497539520264} 11/07/2021 06:16:48 - INFO - __main__ - Step 64215: {'lr': 0.00031189486975403096, 'samples': 12329280, 'steps': 64214, 'loss/train': 0.6928154230117798} 11/07/2021 06:16:48 - INFO - __main__ - Step 64216: {'lr': 0.00031188972820749515, 'samples': 12329472, 'steps': 64215, 'loss/train': 1.5191947221755981} 11/07/2021 06:16:49 - INFO - __main__ - Step 64217: {'lr': 0.0003118845866330728, 'samples': 12329664, 'steps': 64216, 'loss/train': 1.6469529867172241} 11/07/2021 06:16:49 - INFO - __main__ - Step 64218: {'lr': 0.0003118794450307662, 'samples': 12329856, 'steps': 64217, 'loss/train': 1.4505600929260254} 11/07/2021 06:16:50 - INFO - __main__ - Step 64219: {'lr': 0.0003118743034005776, 'samples': 12330048, 'steps': 64218, 'loss/train': 1.4539436101913452} 11/07/2021 06:16:50 - INFO - __main__ - Step 64220: {'lr': 0.0003118691617425095, 'samples': 12330240, 'steps': 64219, 'loss/train': 1.7922450304031372} 11/07/2021 06:16:51 - INFO - __main__ - Step 64221: {'lr': 0.0003118640200565641, 'samples': 12330432, 'steps': 64220, 'loss/train': 1.5437250137329102} 11/07/2021 06:16:51 - INFO - __main__ - Step 64222: {'lr': 0.00031185887834274373, 'samples': 12330624, 'steps': 64221, 'loss/train': 1.222664475440979} 11/07/2021 06:16:51 - INFO - __main__ - Step 64223: {'lr': 0.0003118537366010507, 'samples': 12330816, 'steps': 64222, 'loss/train': 1.5318642854690552} 11/07/2021 06:16:52 - INFO - __main__ - Step 64224: {'lr': 0.00031184859483148733, 'samples': 12331008, 'steps': 64223, 'loss/train': 1.1026078462600708} 11/07/2021 06:16:53 - INFO - __main__ - Step 64225: {'lr': 0.00031184345303405587, 'samples': 12331200, 'steps': 64224, 'loss/train': 1.5124138593673706} 11/07/2021 06:16:53 - INFO - __main__ - Step 64226: {'lr': 0.00031183831120875873, 'samples': 12331392, 'steps': 64225, 'loss/train': 1.348199725151062} 11/07/2021 06:16:53 - INFO - __main__ - Step 64227: {'lr': 0.0003118331693555983, 'samples': 12331584, 'steps': 64226, 'loss/train': 0.7947556972503662} 11/07/2021 06:16:54 - INFO - __main__ - Step 64228: {'lr': 0.00031182802747457665, 'samples': 12331776, 'steps': 64227, 'loss/train': 1.8721392154693604} 11/07/2021 06:16:54 - INFO - __main__ - Step 64229: {'lr': 0.00031182288556569636, 'samples': 12331968, 'steps': 64228, 'loss/train': 1.2050702571868896} 11/07/2021 06:16:55 - INFO - __main__ - Step 64230: {'lr': 0.0003118177436289596, 'samples': 12332160, 'steps': 64229, 'loss/train': 1.7704325914382935} 11/07/2021 06:16:56 - INFO - __main__ - Step 64231: {'lr': 0.0003118126016643686, 'samples': 12332352, 'steps': 64230, 'loss/train': 0.9311701059341431} 11/07/2021 06:16:56 - INFO - __main__ - Step 64232: {'lr': 0.00031180745967192595, 'samples': 12332544, 'steps': 64231, 'loss/train': 1.1535274982452393} 11/07/2021 06:16:56 - INFO - __main__ - Step 64233: {'lr': 0.00031180231765163375, 'samples': 12332736, 'steps': 64232, 'loss/train': 1.53884756565094} 11/07/2021 06:16:57 - INFO - __main__ - Step 64234: {'lr': 0.00031179717560349447, 'samples': 12332928, 'steps': 64233, 'loss/train': 1.0164769887924194} 11/07/2021 06:16:58 - INFO - __main__ - Step 64235: {'lr': 0.0003117920335275102, 'samples': 12333120, 'steps': 64234, 'loss/train': 1.3719590902328491} 11/07/2021 06:16:58 - INFO - __main__ - Step 64236: {'lr': 0.0003117868914236835, 'samples': 12333312, 'steps': 64235, 'loss/train': 1.3505131006240845} 11/07/2021 06:16:58 - INFO - __main__ - Step 64237: {'lr': 0.0003117817492920165, 'samples': 12333504, 'steps': 64236, 'loss/train': 1.5246117115020752} 11/07/2021 06:16:59 - INFO - __main__ - Step 64238: {'lr': 0.0003117766071325117, 'samples': 12333696, 'steps': 64237, 'loss/train': 1.5024970769882202} 11/07/2021 06:16:59 - INFO - __main__ - Step 64239: {'lr': 0.00031177146494517114, 'samples': 12333888, 'steps': 64238, 'loss/train': 0.8757146596908569} 11/07/2021 06:17:00 - INFO - __main__ - Step 64240: {'lr': 0.00031176632272999745, 'samples': 12334080, 'steps': 64239, 'loss/train': 0.5329570174217224} 11/07/2021 06:17:01 - INFO - __main__ - Step 64241: {'lr': 0.00031176118048699284, 'samples': 12334272, 'steps': 64240, 'loss/train': 1.1874568462371826} 11/07/2021 06:17:01 - INFO - __main__ - Step 64242: {'lr': 0.0003117560382161595, 'samples': 12334464, 'steps': 64241, 'loss/train': 1.356635332107544} 11/07/2021 06:17:01 - INFO - __main__ - Step 64243: {'lr': 0.0003117508959174998, 'samples': 12334656, 'steps': 64242, 'loss/train': 1.5432541370391846} 11/07/2021 06:17:02 - INFO - __main__ - Step 64244: {'lr': 0.0003117457535910162, 'samples': 12334848, 'steps': 64243, 'loss/train': 1.1046733856201172} 11/07/2021 06:17:02 - INFO - __main__ - Step 64245: {'lr': 0.0003117406112367109, 'samples': 12335040, 'steps': 64244, 'loss/train': 1.2513538599014282} 11/07/2021 06:17:03 - INFO - __main__ - Step 64246: {'lr': 0.00031173546885458623, 'samples': 12335232, 'steps': 64245, 'loss/train': 1.0698587894439697} 11/07/2021 06:17:03 - INFO - __main__ - Step 64247: {'lr': 0.00031173032644464456, 'samples': 12335424, 'steps': 64246, 'loss/train': 1.3360543251037598} 11/07/2021 06:17:04 - INFO - __main__ - Step 64248: {'lr': 0.000311725184006888, 'samples': 12335616, 'steps': 64247, 'loss/train': 1.1789697408676147} 11/07/2021 06:17:04 - INFO - __main__ - Step 64249: {'lr': 0.0003117200415413192, 'samples': 12335808, 'steps': 64248, 'loss/train': 1.3316409587860107} 11/07/2021 06:17:05 - INFO - __main__ - Step 64250: {'lr': 0.0003117148990479402, 'samples': 12336000, 'steps': 64249, 'loss/train': 1.3065505027770996} 11/07/2021 06:17:05 - INFO - __main__ - Step 64251: {'lr': 0.0003117097565267534, 'samples': 12336192, 'steps': 64250, 'loss/train': 1.2357457876205444} 11/07/2021 06:17:06 - INFO - __main__ - Step 64252: {'lr': 0.00031170461397776115, 'samples': 12336384, 'steps': 64251, 'loss/train': 1.088868260383606} 11/07/2021 06:17:06 - INFO - __main__ - Step 64253: {'lr': 0.0003116994714009658, 'samples': 12336576, 'steps': 64252, 'loss/train': 1.0208780765533447} 11/07/2021 06:17:07 - INFO - __main__ - Step 64254: {'lr': 0.0003116943287963697, 'samples': 12336768, 'steps': 64253, 'loss/train': 1.2344292402267456} 11/07/2021 06:17:07 - INFO - __main__ - Step 64255: {'lr': 0.00031168918616397495, 'samples': 12336960, 'steps': 64254, 'loss/train': 1.197978138923645} 11/07/2021 06:17:08 - INFO - __main__ - Step 64256: {'lr': 0.000311684043503784, 'samples': 12337152, 'steps': 64255, 'loss/train': 1.6368237733840942} 11/07/2021 06:17:08 - INFO - __main__ - Step 64257: {'lr': 0.00031167890081579925, 'samples': 12337344, 'steps': 64256, 'loss/train': 0.7683295607566833} 11/07/2021 06:17:09 - INFO - __main__ - Step 64258: {'lr': 0.0003116737581000229, 'samples': 12337536, 'steps': 64257, 'loss/train': 0.6215443015098572} 11/07/2021 06:17:09 - INFO - __main__ - Step 64259: {'lr': 0.0003116686153564573, 'samples': 12337728, 'steps': 64258, 'loss/train': 1.4435133934020996} 11/07/2021 06:17:09 - INFO - __main__ - Step 64260: {'lr': 0.0003116634725851048, 'samples': 12337920, 'steps': 64259, 'loss/train': 1.0766360759735107} 11/07/2021 06:17:11 - INFO - __main__ - Step 64261: {'lr': 0.0003116583297859677, 'samples': 12338112, 'steps': 64260, 'loss/train': 1.1157187223434448} 11/07/2021 06:17:11 - INFO - __main__ - Step 64262: {'lr': 0.00031165318695904824, 'samples': 12338304, 'steps': 64261, 'loss/train': 1.3763169050216675} 11/07/2021 06:17:11 - INFO - __main__ - Step 64263: {'lr': 0.0003116480441043489, 'samples': 12338496, 'steps': 64262, 'loss/train': 0.8172218203544617} 11/07/2021 06:17:12 - INFO - __main__ - Step 64264: {'lr': 0.0003116429012218718, 'samples': 12338688, 'steps': 64263, 'loss/train': 1.3743083477020264} 11/07/2021 06:17:12 - INFO - __main__ - Step 64265: {'lr': 0.00031163775831161947, 'samples': 12338880, 'steps': 64264, 'loss/train': 1.3380141258239746} 11/07/2021 06:17:12 - INFO - __main__ - Step 64266: {'lr': 0.00031163261537359404, 'samples': 12339072, 'steps': 64265, 'loss/train': 0.9835327863693237} 11/07/2021 06:17:13 - INFO - __main__ - Step 64267: {'lr': 0.0003116274724077979, 'samples': 12339264, 'steps': 64266, 'loss/train': 1.0069528818130493} 11/07/2021 06:17:14 - INFO - __main__ - Step 64268: {'lr': 0.0003116223294142334, 'samples': 12339456, 'steps': 64267, 'loss/train': 1.3630282878875732} 11/07/2021 06:17:14 - INFO - __main__ - Step 64269: {'lr': 0.00031161718639290283, 'samples': 12339648, 'steps': 64268, 'loss/train': 0.9935979247093201} 11/07/2021 06:17:14 - INFO - __main__ - Step 64270: {'lr': 0.0003116120433438085, 'samples': 12339840, 'steps': 64269, 'loss/train': 1.5819684267044067} 11/07/2021 06:17:15 - INFO - __main__ - Step 64271: {'lr': 0.00031160690026695275, 'samples': 12340032, 'steps': 64270, 'loss/train': 1.4344286918640137} 11/07/2021 06:17:16 - INFO - __main__ - Step 64272: {'lr': 0.00031160175716233793, 'samples': 12340224, 'steps': 64271, 'loss/train': 1.5819156169891357} 11/07/2021 06:17:16 - INFO - __main__ - Step 64273: {'lr': 0.00031159661402996617, 'samples': 12340416, 'steps': 64272, 'loss/train': 1.2389897108078003} 11/07/2021 06:17:17 - INFO - __main__ - Step 64274: {'lr': 0.00031159147086984003, 'samples': 12340608, 'steps': 64273, 'loss/train': 0.5517760515213013} 11/07/2021 06:17:17 - INFO - __main__ - Step 64275: {'lr': 0.0003115863276819617, 'samples': 12340800, 'steps': 64274, 'loss/train': 1.2823106050491333} 11/07/2021 06:17:17 - INFO - __main__ - Step 64276: {'lr': 0.00031158118446633355, 'samples': 12340992, 'steps': 64275, 'loss/train': 1.4322761297225952} 11/07/2021 06:17:18 - INFO - __main__ - Step 64277: {'lr': 0.0003115760412229578, 'samples': 12341184, 'steps': 64276, 'loss/train': 1.331775426864624} 11/07/2021 06:17:19 - INFO - __main__ - Step 64278: {'lr': 0.0003115708979518369, 'samples': 12341376, 'steps': 64277, 'loss/train': 3.9784915447235107} 11/07/2021 06:17:19 - INFO - __main__ - Step 64279: {'lr': 0.00031156575465297306, 'samples': 12341568, 'steps': 64278, 'loss/train': 1.6208183765411377} 11/07/2021 06:17:19 - INFO - __main__ - Step 64280: {'lr': 0.00031156061132636866, 'samples': 12341760, 'steps': 64279, 'loss/train': 1.222280502319336} 11/07/2021 06:17:20 - INFO - __main__ - Step 64281: {'lr': 0.00031155546797202597, 'samples': 12341952, 'steps': 64280, 'loss/train': 1.4298837184906006} 11/07/2021 06:17:20 - INFO - __main__ - Step 64282: {'lr': 0.0003115503245899474, 'samples': 12342144, 'steps': 64281, 'loss/train': 0.23977617919445038} 11/07/2021 06:17:21 - INFO - __main__ - Step 64283: {'lr': 0.0003115451811801351, 'samples': 12342336, 'steps': 64282, 'loss/train': 1.2323404550552368} 11/07/2021 06:17:22 - INFO - __main__ - Step 64284: {'lr': 0.0003115400377425916, 'samples': 12342528, 'steps': 64283, 'loss/train': 1.3595176935195923} 11/07/2021 06:17:22 - INFO - __main__ - Step 64285: {'lr': 0.00031153489427731906, 'samples': 12342720, 'steps': 64284, 'loss/train': 1.5330004692077637} 11/07/2021 06:17:22 - INFO - __main__ - Step 64286: {'lr': 0.0003115297507843198, 'samples': 12342912, 'steps': 64285, 'loss/train': 0.6413747072219849} 11/07/2021 06:17:23 - INFO - __main__ - Step 64287: {'lr': 0.00031152460726359627, 'samples': 12343104, 'steps': 64286, 'loss/train': 2.23771595954895} 11/07/2021 06:17:24 - INFO - __main__ - Step 64288: {'lr': 0.0003115194637151507, 'samples': 12343296, 'steps': 64287, 'loss/train': 1.7800078392028809} 11/07/2021 06:17:24 - INFO - __main__ - Step 64289: {'lr': 0.00031151432013898535, 'samples': 12343488, 'steps': 64288, 'loss/train': 1.1448698043823242} 11/07/2021 06:17:24 - INFO - __main__ - Step 64290: {'lr': 0.00031150917653510263, 'samples': 12343680, 'steps': 64289, 'loss/train': 1.1926072835922241} 11/07/2021 06:17:25 - INFO - __main__ - Step 64291: {'lr': 0.00031150403290350484, 'samples': 12343872, 'steps': 64290, 'loss/train': 1.3091577291488647} 11/07/2021 06:17:25 - INFO - __main__ - Step 64292: {'lr': 0.00031149888924419424, 'samples': 12344064, 'steps': 64291, 'loss/train': 1.6209492683410645} 11/07/2021 06:17:26 - INFO - __main__ - Step 64293: {'lr': 0.00031149374555717316, 'samples': 12344256, 'steps': 64292, 'loss/train': 0.7459496259689331} 11/07/2021 06:17:26 - INFO - __main__ - Step 64294: {'lr': 0.00031148860184244406, 'samples': 12344448, 'steps': 64293, 'loss/train': 1.3861970901489258} 11/07/2021 06:17:27 - INFO - __main__ - Step 64295: {'lr': 0.00031148345810000903, 'samples': 12344640, 'steps': 64294, 'loss/train': 1.551000952720642} 11/07/2021 06:17:27 - INFO - __main__ - Step 64296: {'lr': 0.0003114783143298706, 'samples': 12344832, 'steps': 64295, 'loss/train': 1.6217788457870483} 11/07/2021 06:17:27 - INFO - __main__ - Step 64297: {'lr': 0.00031147317053203087, 'samples': 12345024, 'steps': 64296, 'loss/train': 1.8015739917755127} 11/07/2021 06:17:28 - INFO - __main__ - Step 64298: {'lr': 0.0003114680267064924, 'samples': 12345216, 'steps': 64297, 'loss/train': 1.5419975519180298} 11/07/2021 06:17:29 - INFO - __main__ - Step 64299: {'lr': 0.0003114628828532573, 'samples': 12345408, 'steps': 64298, 'loss/train': 0.48436081409454346} 11/07/2021 06:17:29 - INFO - __main__ - Step 64300: {'lr': 0.000311457738972328, 'samples': 12345600, 'steps': 64299, 'loss/train': 1.0275404453277588} 11/07/2021 06:17:30 - INFO - __main__ - Step 64301: {'lr': 0.00031145259506370685, 'samples': 12345792, 'steps': 64300, 'loss/train': 1.636919379234314} 11/07/2021 06:17:30 - INFO - __main__ - Step 64302: {'lr': 0.00031144745112739603, 'samples': 12345984, 'steps': 64301, 'loss/train': 0.8199701309204102} 11/07/2021 06:17:31 - INFO - __main__ - Step 64303: {'lr': 0.00031144230716339795, 'samples': 12346176, 'steps': 64302, 'loss/train': 1.1399554014205933} 11/07/2021 06:17:31 - INFO - __main__ - Step 64304: {'lr': 0.00031143716317171493, 'samples': 12346368, 'steps': 64303, 'loss/train': 1.4326950311660767} 11/07/2021 06:17:32 - INFO - __main__ - Step 64305: {'lr': 0.00031143201915234924, 'samples': 12346560, 'steps': 64304, 'loss/train': 1.4060949087142944} 11/07/2021 06:17:32 - INFO - __main__ - Step 64306: {'lr': 0.0003114268751053033, 'samples': 12346752, 'steps': 64305, 'loss/train': 1.194818377494812} 11/07/2021 06:17:32 - INFO - __main__ - Step 64307: {'lr': 0.0003114217310305793, 'samples': 12346944, 'steps': 64306, 'loss/train': 1.3100347518920898} 11/07/2021 06:17:33 - INFO - __main__ - Step 64308: {'lr': 0.00031141658692817963, 'samples': 12347136, 'steps': 64307, 'loss/train': 1.4129728078842163} 11/07/2021 06:17:34 - INFO - __main__ - Step 64309: {'lr': 0.0003114114427981066, 'samples': 12347328, 'steps': 64308, 'loss/train': 1.3629199266433716} 11/07/2021 06:17:34 - INFO - __main__ - Step 64310: {'lr': 0.0003114062986403625, 'samples': 12347520, 'steps': 64309, 'loss/train': 1.007581114768982} 11/07/2021 06:17:34 - INFO - __main__ - Step 64311: {'lr': 0.0003114011544549497, 'samples': 12347712, 'steps': 64310, 'loss/train': 1.4361693859100342} 11/07/2021 06:17:35 - INFO - __main__ - Step 64312: {'lr': 0.0003113960102418705, 'samples': 12347904, 'steps': 64311, 'loss/train': 1.144338846206665} 11/07/2021 06:17:36 - INFO - __main__ - Step 64313: {'lr': 0.00031139086600112713, 'samples': 12348096, 'steps': 64312, 'loss/train': 1.1781221628189087} 11/07/2021 06:17:36 - INFO - __main__ - Step 64314: {'lr': 0.00031138572173272205, 'samples': 12348288, 'steps': 64313, 'loss/train': 1.3557771444320679} 11/07/2021 06:17:37 - INFO - __main__ - Step 64315: {'lr': 0.00031138057743665756, 'samples': 12348480, 'steps': 64314, 'loss/train': 1.022281527519226} 11/07/2021 06:17:37 - INFO - __main__ - Step 64316: {'lr': 0.0003113754331129359, 'samples': 12348672, 'steps': 64315, 'loss/train': 1.5188144445419312} 11/07/2021 06:17:37 - INFO - __main__ - Step 64317: {'lr': 0.0003113702887615593, 'samples': 12348864, 'steps': 64316, 'loss/train': 0.9748349189758301} 11/07/2021 06:17:38 - INFO - __main__ - Step 64318: {'lr': 0.00031136514438253026, 'samples': 12349056, 'steps': 64317, 'loss/train': 1.4513623714447021} 11/07/2021 06:17:39 - INFO - __main__ - Step 64319: {'lr': 0.0003113599999758511, 'samples': 12349248, 'steps': 64318, 'loss/train': 1.7605866193771362} 11/07/2021 06:17:39 - INFO - __main__ - Step 64320: {'lr': 0.000311354855541524, 'samples': 12349440, 'steps': 64319, 'loss/train': 1.5175087451934814} 11/07/2021 06:17:39 - INFO - __main__ - Step 64321: {'lr': 0.0003113497110795514, 'samples': 12349632, 'steps': 64320, 'loss/train': 1.2316827774047852} 11/07/2021 06:17:40 - INFO - __main__ - Step 64322: {'lr': 0.0003113445665899355, 'samples': 12349824, 'steps': 64321, 'loss/train': 1.2040547132492065} 11/07/2021 06:17:40 - INFO - __main__ - Step 64323: {'lr': 0.0003113394220726787, 'samples': 12350016, 'steps': 64322, 'loss/train': 1.4681057929992676} 11/07/2021 06:17:41 - INFO - __main__ - Step 64324: {'lr': 0.0003113342775277834, 'samples': 12350208, 'steps': 64323, 'loss/train': 0.7932981848716736} 11/07/2021 06:17:42 - INFO - __main__ - Step 64325: {'lr': 0.0003113291329552517, 'samples': 12350400, 'steps': 64324, 'loss/train': 1.3885477781295776} 11/07/2021 06:17:42 - INFO - __main__ - Step 64326: {'lr': 0.00031132398835508605, 'samples': 12350592, 'steps': 64325, 'loss/train': 1.4027165174484253} 11/07/2021 06:17:42 - INFO - __main__ - Step 64327: {'lr': 0.0003113188437272888, 'samples': 12350784, 'steps': 64326, 'loss/train': 1.490628957748413} 11/07/2021 06:17:43 - INFO - __main__ - Step 64328: {'lr': 0.00031131369907186227, 'samples': 12350976, 'steps': 64327, 'loss/train': 1.1036334037780762} 11/07/2021 06:17:44 - INFO - __main__ - Step 64329: {'lr': 0.00031130855438880867, 'samples': 12351168, 'steps': 64328, 'loss/train': 1.681276559829712} 11/07/2021 06:17:44 - INFO - __main__ - Step 64330: {'lr': 0.00031130340967813037, 'samples': 12351360, 'steps': 64329, 'loss/train': 1.7307283878326416} 11/07/2021 06:17:44 - INFO - __main__ - Step 64331: {'lr': 0.00031129826493982973, 'samples': 12351552, 'steps': 64330, 'loss/train': 1.2643094062805176} 11/07/2021 06:17:45 - INFO - __main__ - Step 64332: {'lr': 0.000311293120173909, 'samples': 12351744, 'steps': 64331, 'loss/train': 1.298771619796753} 11/07/2021 06:17:45 - INFO - __main__ - Step 64333: {'lr': 0.0003112879753803706, 'samples': 12351936, 'steps': 64332, 'loss/train': 1.7791659832000732} 11/07/2021 06:17:45 - INFO - __main__ - Step 64334: {'lr': 0.0003112828305592167, 'samples': 12352128, 'steps': 64333, 'loss/train': 1.67943274974823} 11/07/2021 06:17:46 - INFO - __main__ - Step 64335: {'lr': 0.0003112776857104498, 'samples': 12352320, 'steps': 64334, 'loss/train': 1.2794233560562134} 11/07/2021 06:17:47 - INFO - __main__ - Step 64336: {'lr': 0.0003112725408340721, 'samples': 12352512, 'steps': 64335, 'loss/train': 1.1784336566925049} 11/07/2021 06:17:47 - INFO - __main__ - Step 64337: {'lr': 0.00031126739593008586, 'samples': 12352704, 'steps': 64336, 'loss/train': 1.4858239889144897} 11/07/2021 06:17:47 - INFO - __main__ - Step 64338: {'lr': 0.00031126225099849356, 'samples': 12352896, 'steps': 64337, 'loss/train': 1.235020399093628} 11/07/2021 06:17:48 - INFO - __main__ - Step 64339: {'lr': 0.00031125710603929736, 'samples': 12353088, 'steps': 64338, 'loss/train': 1.868591070175171} 11/07/2021 06:17:49 - INFO - __main__ - Step 64340: {'lr': 0.0003112519610524997, 'samples': 12353280, 'steps': 64339, 'loss/train': 1.771009087562561} 11/07/2021 06:17:50 - INFO - __main__ - Step 64341: {'lr': 0.00031124681603810286, 'samples': 12353472, 'steps': 64340, 'loss/train': 1.2972071170806885} 11/07/2021 06:17:50 - INFO - __main__ - Step 64342: {'lr': 0.0003112416709961092, 'samples': 12353664, 'steps': 64341, 'loss/train': 0.8883042931556702} 11/07/2021 06:17:50 - INFO - __main__ - Step 64343: {'lr': 0.00031123652592652087, 'samples': 12353856, 'steps': 64342, 'loss/train': 1.5561474561691284} 11/07/2021 06:17:51 - INFO - __main__ - Step 64344: {'lr': 0.0003112313808293403, 'samples': 12354048, 'steps': 64343, 'loss/train': 1.2033318281173706} 11/07/2021 06:17:52 - INFO - __main__ - Step 64345: {'lr': 0.0003112262357045699, 'samples': 12354240, 'steps': 64344, 'loss/train': 1.0624504089355469} 11/07/2021 06:17:52 - INFO - __main__ - Step 64346: {'lr': 0.00031122109055221187, 'samples': 12354432, 'steps': 64345, 'loss/train': 1.5139741897583008} 11/07/2021 06:17:52 - INFO - __main__ - Step 64347: {'lr': 0.0003112159453722686, 'samples': 12354624, 'steps': 64346, 'loss/train': 1.7889249324798584} 11/07/2021 06:17:53 - INFO - __main__ - Step 64348: {'lr': 0.0003112108001647423, 'samples': 12354816, 'steps': 64347, 'loss/train': 2.12790584564209} 11/07/2021 06:17:53 - INFO - __main__ - Step 64349: {'lr': 0.0003112056549296354, 'samples': 12355008, 'steps': 64348, 'loss/train': 1.181300401687622} 11/07/2021 06:17:55 - INFO - __main__ - Step 64350: {'lr': 0.0003112005096669502, 'samples': 12355200, 'steps': 64349, 'loss/train': 1.6800435781478882} 11/07/2021 06:17:55 - INFO - __main__ - Step 64351: {'lr': 0.000311195364376689, 'samples': 12355392, 'steps': 64350, 'loss/train': 1.1087945699691772} 11/07/2021 06:17:56 - INFO - __main__ - Step 64352: {'lr': 0.00031119021905885404, 'samples': 12355584, 'steps': 64351, 'loss/train': 1.2518583536148071} 11/07/2021 06:17:56 - INFO - __main__ - Step 64353: {'lr': 0.00031118507371344774, 'samples': 12355776, 'steps': 64352, 'loss/train': 1.2802201509475708} 11/07/2021 06:17:56 - INFO - __main__ - Step 64354: {'lr': 0.00031117992834047244, 'samples': 12355968, 'steps': 64353, 'loss/train': 0.6498432755470276} 11/07/2021 06:17:57 - INFO - __main__ - Step 64355: {'lr': 0.0003111747829399304, 'samples': 12356160, 'steps': 64354, 'loss/train': 0.4742223620414734} 11/07/2021 06:17:58 - INFO - __main__ - Step 64356: {'lr': 0.0003111696375118239, 'samples': 12356352, 'steps': 64355, 'loss/train': 1.4606832265853882} 11/07/2021 06:17:58 - INFO - __main__ - Step 64357: {'lr': 0.0003111644920561553, 'samples': 12356544, 'steps': 64356, 'loss/train': 1.1803609132766724} 11/07/2021 06:17:58 - INFO - __main__ - Step 64358: {'lr': 0.0003111593465729269, 'samples': 12356736, 'steps': 64357, 'loss/train': 1.2122007608413696} 11/07/2021 06:17:59 - INFO - __main__ - Step 64359: {'lr': 0.0003111542010621411, 'samples': 12356928, 'steps': 64358, 'loss/train': 1.7763116359710693} 11/07/2021 06:17:59 - INFO - __main__ - Step 64360: {'lr': 0.00031114905552380017, 'samples': 12357120, 'steps': 64359, 'loss/train': 1.068136215209961} 11/07/2021 06:18:00 - INFO - __main__ - Step 64361: {'lr': 0.0003111439099579064, 'samples': 12357312, 'steps': 64360, 'loss/train': 1.5673366785049438} 11/07/2021 06:18:00 - INFO - __main__ - Step 64362: {'lr': 0.00031113876436446216, 'samples': 12357504, 'steps': 64361, 'loss/train': 1.2773340940475464} 11/07/2021 06:18:01 - INFO - __main__ - Step 64363: {'lr': 0.00031113361874346966, 'samples': 12357696, 'steps': 64362, 'loss/train': 1.071543574333191} 11/07/2021 06:18:01 - INFO - __main__ - Step 64364: {'lr': 0.0003111284730949314, 'samples': 12357888, 'steps': 64363, 'loss/train': 1.5846366882324219} 11/07/2021 06:18:01 - INFO - __main__ - Step 64365: {'lr': 0.0003111233274188495, 'samples': 12358080, 'steps': 64364, 'loss/train': 1.4363102912902832} 11/07/2021 06:18:03 - INFO - __main__ - Step 64366: {'lr': 0.0003111181817152264, 'samples': 12358272, 'steps': 64365, 'loss/train': 1.2070220708847046} 11/07/2021 06:18:03 - INFO - __main__ - Step 64367: {'lr': 0.0003111130359840644, 'samples': 12358464, 'steps': 64366, 'loss/train': 1.1227355003356934} 11/07/2021 06:18:03 - INFO - __main__ - Step 64368: {'lr': 0.0003111078902253658, 'samples': 12358656, 'steps': 64367, 'loss/train': 1.5641512870788574} 11/07/2021 06:18:04 - INFO - __main__ - Step 64369: {'lr': 0.00031110274443913295, 'samples': 12358848, 'steps': 64368, 'loss/train': 0.18496084213256836} 11/07/2021 06:18:04 - INFO - __main__ - Step 64370: {'lr': 0.0003110975986253681, 'samples': 12359040, 'steps': 64369, 'loss/train': 1.5299335718154907} 11/07/2021 06:18:05 - INFO - __main__ - Step 64371: {'lr': 0.0003110924527840736, 'samples': 12359232, 'steps': 64370, 'loss/train': 1.24351966381073} 11/07/2021 06:18:06 - INFO - __main__ - Step 64372: {'lr': 0.0003110873069152518, 'samples': 12359424, 'steps': 64371, 'loss/train': 1.377604365348816} 11/07/2021 06:18:06 - INFO - __main__ - Step 64373: {'lr': 0.0003110821610189051, 'samples': 12359616, 'steps': 64372, 'loss/train': 1.6086970567703247} 11/07/2021 06:18:06 - INFO - __main__ - Step 64374: {'lr': 0.0003110770150950356, 'samples': 12359808, 'steps': 64373, 'loss/train': 0.9112154841423035} 11/07/2021 06:18:07 - INFO - __main__ - Step 64375: {'lr': 0.00031107186914364584, 'samples': 12360000, 'steps': 64374, 'loss/train': 1.243470311164856} 11/07/2021 06:18:07 - INFO - __main__ - Step 64376: {'lr': 0.000311066723164738, 'samples': 12360192, 'steps': 64375, 'loss/train': 0.5697004199028015} 11/07/2021 06:18:08 - INFO - __main__ - Step 64377: {'lr': 0.0003110615771583144, 'samples': 12360384, 'steps': 64376, 'loss/train': 1.861837387084961} 11/07/2021 06:18:08 - INFO - __main__ - Step 64378: {'lr': 0.00031105643112437745, 'samples': 12360576, 'steps': 64377, 'loss/train': 1.5706568956375122} 11/07/2021 06:18:09 - INFO - __main__ - Step 64379: {'lr': 0.00031105128506292933, 'samples': 12360768, 'steps': 64378, 'loss/train': 1.4514089822769165} 11/07/2021 06:18:09 - INFO - __main__ - Step 64380: {'lr': 0.0003110461389739725, 'samples': 12360960, 'steps': 64379, 'loss/train': 1.7342199087142944} 11/07/2021 06:18:09 - INFO - __main__ - Step 64381: {'lr': 0.0003110409928575092, 'samples': 12361152, 'steps': 64380, 'loss/train': 1.434861421585083} 11/07/2021 06:18:10 - INFO - __main__ - Step 64382: {'lr': 0.0003110358467135418, 'samples': 12361344, 'steps': 64381, 'loss/train': 1.3462953567504883} 11/07/2021 06:18:11 - INFO - __main__ - Step 64383: {'lr': 0.0003110307005420726, 'samples': 12361536, 'steps': 64382, 'loss/train': 1.0036394596099854} 11/07/2021 06:18:11 - INFO - __main__ - Step 64384: {'lr': 0.00031102555434310385, 'samples': 12361728, 'steps': 64383, 'loss/train': 1.2485069036483765} 11/07/2021 06:18:12 - INFO - __main__ - Step 64385: {'lr': 0.00031102040811663794, 'samples': 12361920, 'steps': 64384, 'loss/train': 1.163333535194397} 11/07/2021 06:18:12 - INFO - __main__ - Step 64386: {'lr': 0.0003110152618626772, 'samples': 12362112, 'steps': 64385, 'loss/train': 0.9540652632713318} 11/07/2021 06:18:13 - INFO - __main__ - Step 64387: {'lr': 0.0003110101155812239, 'samples': 12362304, 'steps': 64386, 'loss/train': 1.413712978363037} 11/07/2021 06:18:13 - INFO - __main__ - Step 64388: {'lr': 0.00031100496927228047, 'samples': 12362496, 'steps': 64387, 'loss/train': 1.3105300664901733} 11/07/2021 06:18:14 - INFO - __main__ - Step 64389: {'lr': 0.00031099982293584903, 'samples': 12362688, 'steps': 64388, 'loss/train': 1.42280912399292} 11/07/2021 06:18:14 - INFO - __main__ - Step 64390: {'lr': 0.0003109946765719321, 'samples': 12362880, 'steps': 64389, 'loss/train': 1.441165804862976} 11/07/2021 06:18:14 - INFO - __main__ - Step 64391: {'lr': 0.00031098953018053187, 'samples': 12363072, 'steps': 64390, 'loss/train': 1.448914885520935} 11/07/2021 06:18:15 - INFO - __main__ - Step 64392: {'lr': 0.00031098438376165065, 'samples': 12363264, 'steps': 64391, 'loss/train': 1.4370944499969482} 11/07/2021 06:18:16 - INFO - __main__ - Step 64393: {'lr': 0.00031097923731529086, 'samples': 12363456, 'steps': 64392, 'loss/train': 1.4750510454177856} 11/07/2021 06:18:16 - INFO - __main__ - Step 64394: {'lr': 0.0003109740908414548, 'samples': 12363648, 'steps': 64393, 'loss/train': 0.9855761528015137} 11/07/2021 06:18:16 - INFO - __main__ - Step 64395: {'lr': 0.0003109689443401447, 'samples': 12363840, 'steps': 64394, 'loss/train': 1.4487109184265137} 11/07/2021 06:18:17 - INFO - __main__ - Step 64396: {'lr': 0.00031096379781136296, 'samples': 12364032, 'steps': 64395, 'loss/train': 1.4569756984710693} 11/07/2021 06:18:18 - INFO - __main__ - Step 64397: {'lr': 0.00031095865125511186, 'samples': 12364224, 'steps': 64396, 'loss/train': 1.50651216506958} 11/07/2021 06:18:18 - INFO - __main__ - Step 64398: {'lr': 0.0003109535046713937, 'samples': 12364416, 'steps': 64397, 'loss/train': 2.0018274784088135} 11/07/2021 06:18:18 - INFO - __main__ - Step 64399: {'lr': 0.0003109483580602109, 'samples': 12364608, 'steps': 64398, 'loss/train': 0.4213200807571411} 11/07/2021 06:18:19 - INFO - __main__ - Step 64400: {'lr': 0.00031094321142156574, 'samples': 12364800, 'steps': 64399, 'loss/train': 1.4000825881958008} 11/07/2021 06:18:19 - INFO - __main__ - Step 64401: {'lr': 0.00031093806475546046, 'samples': 12364992, 'steps': 64400, 'loss/train': 1.7811331748962402} 11/07/2021 06:18:20 - INFO - __main__ - Step 64402: {'lr': 0.0003109329180618974, 'samples': 12365184, 'steps': 64401, 'loss/train': 1.1759394407272339} 11/07/2021 06:18:21 - INFO - __main__ - Step 64403: {'lr': 0.00031092777134087893, 'samples': 12365376, 'steps': 64402, 'loss/train': 1.2734906673431396} 11/07/2021 06:18:21 - INFO - __main__ - Step 64404: {'lr': 0.0003109226245924073, 'samples': 12365568, 'steps': 64403, 'loss/train': 1.875203013420105} 11/07/2021 06:18:21 - INFO - __main__ - Step 64405: {'lr': 0.00031091747781648496, 'samples': 12365760, 'steps': 64404, 'loss/train': 0.891119658946991} 11/07/2021 06:18:22 - INFO - __main__ - Step 64406: {'lr': 0.00031091233101311405, 'samples': 12365952, 'steps': 64405, 'loss/train': 1.5407934188842773} 11/07/2021 06:18:22 - INFO - __main__ - Step 64407: {'lr': 0.0003109071841822971, 'samples': 12366144, 'steps': 64406, 'loss/train': 1.2352184057235718} 11/07/2021 06:18:23 - INFO - __main__ - Step 64408: {'lr': 0.0003109020373240362, 'samples': 12366336, 'steps': 64407, 'loss/train': 0.08685826510190964} 11/07/2021 06:18:23 - INFO - __main__ - Step 64409: {'lr': 0.0003108968904383338, 'samples': 12366528, 'steps': 64408, 'loss/train': 1.0818636417388916} 11/07/2021 06:18:24 - INFO - __main__ - Step 64410: {'lr': 0.00031089174352519225, 'samples': 12366720, 'steps': 64409, 'loss/train': 2.058584213256836} 11/07/2021 06:18:24 - INFO - __main__ - Step 64411: {'lr': 0.0003108865965846138, 'samples': 12366912, 'steps': 64410, 'loss/train': 1.477808952331543} 11/07/2021 06:18:25 - INFO - __main__ - Step 64412: {'lr': 0.00031088144961660083, 'samples': 12367104, 'steps': 64411, 'loss/train': 1.229239821434021} 11/07/2021 06:18:26 - INFO - __main__ - Step 64413: {'lr': 0.00031087630262115553, 'samples': 12367296, 'steps': 64412, 'loss/train': 1.2397093772888184} 11/07/2021 06:18:26 - INFO - __main__ - Step 64414: {'lr': 0.0003108711555982804, 'samples': 12367488, 'steps': 64413, 'loss/train': 1.6826647520065308} 11/07/2021 06:18:27 - INFO - __main__ - Step 64415: {'lr': 0.00031086600854797757, 'samples': 12367680, 'steps': 64414, 'loss/train': 1.4892154932022095} 11/07/2021 06:18:27 - INFO - __main__ - Step 64416: {'lr': 0.00031086086147024956, 'samples': 12367872, 'steps': 64415, 'loss/train': 1.2949227094650269} 11/07/2021 06:18:27 - INFO - __main__ - Step 64417: {'lr': 0.0003108557143650985, 'samples': 12368064, 'steps': 64416, 'loss/train': 1.3253108263015747} 11/07/2021 06:18:28 - INFO - __main__ - Step 64418: {'lr': 0.00031085056723252684, 'samples': 12368256, 'steps': 64417, 'loss/train': 0.2481679469347} 11/07/2021 06:18:29 - INFO - __main__ - Step 64419: {'lr': 0.0003108454200725368, 'samples': 12368448, 'steps': 64418, 'loss/train': 1.499453067779541} 11/07/2021 06:18:29 - INFO - __main__ - Step 64420: {'lr': 0.00031084027288513083, 'samples': 12368640, 'steps': 64419, 'loss/train': 2.056342363357544} 11/07/2021 06:18:30 - INFO - __main__ - Step 64421: {'lr': 0.0003108351256703111, 'samples': 12368832, 'steps': 64420, 'loss/train': 1.2817491292953491} 11/07/2021 06:18:30 - INFO - __main__ - Step 64422: {'lr': 0.0003108299784280801, 'samples': 12369024, 'steps': 64421, 'loss/train': 2.03193998336792} 11/07/2021 06:18:31 - INFO - __main__ - Step 64423: {'lr': 0.00031082483115843994, 'samples': 12369216, 'steps': 64422, 'loss/train': 1.5422592163085938} 11/07/2021 06:18:31 - INFO - __main__ - Step 64424: {'lr': 0.00031081968386139307, 'samples': 12369408, 'steps': 64423, 'loss/train': 1.5845035314559937} 11/07/2021 06:18:32 - INFO - __main__ - Step 64425: {'lr': 0.00031081453653694185, 'samples': 12369600, 'steps': 64424, 'loss/train': 1.6159271001815796} 11/07/2021 06:18:32 - INFO - __main__ - Step 64426: {'lr': 0.0003108093891850885, 'samples': 12369792, 'steps': 64425, 'loss/train': 1.7154778242111206} 11/07/2021 06:18:32 - INFO - __main__ - Step 64427: {'lr': 0.0003108042418058353, 'samples': 12369984, 'steps': 64426, 'loss/train': 1.3465951681137085} 11/07/2021 06:18:33 - INFO - __main__ - Step 64428: {'lr': 0.00031079909439918476, 'samples': 12370176, 'steps': 64427, 'loss/train': 1.5196363925933838} 11/07/2021 06:18:34 - INFO - __main__ - Step 64429: {'lr': 0.00031079394696513913, 'samples': 12370368, 'steps': 64428, 'loss/train': 1.2057528495788574} 11/07/2021 06:18:34 - INFO - __main__ - Step 64430: {'lr': 0.0003107887995037006, 'samples': 12370560, 'steps': 64429, 'loss/train': 1.631566047668457} 11/07/2021 06:18:34 - INFO - __main__ - Step 64431: {'lr': 0.0003107836520148716, 'samples': 12370752, 'steps': 64430, 'loss/train': 1.3158948421478271} 11/07/2021 06:18:35 - INFO - __main__ - Step 64432: {'lr': 0.00031077850449865433, 'samples': 12370944, 'steps': 64431, 'loss/train': 1.509680151939392} 11/07/2021 06:18:35 - INFO - __main__ - Step 64433: {'lr': 0.00031077335695505127, 'samples': 12371136, 'steps': 64432, 'loss/train': 1.4541646242141724} 11/07/2021 06:18:37 - INFO - __main__ - Step 64434: {'lr': 0.00031076820938406467, 'samples': 12371328, 'steps': 64433, 'loss/train': 1.7763025760650635} 11/07/2021 06:18:37 - INFO - __main__ - Step 64435: {'lr': 0.0003107630617856969, 'samples': 12371520, 'steps': 64434, 'loss/train': 1.2408638000488281} 11/07/2021 06:18:38 - INFO - __main__ - Step 64436: {'lr': 0.00031075791415995026, 'samples': 12371712, 'steps': 64435, 'loss/train': 1.4531288146972656} 11/07/2021 06:18:38 - INFO - __main__ - Step 64437: {'lr': 0.00031075276650682695, 'samples': 12371904, 'steps': 64436, 'loss/train': 0.28325310349464417} 11/07/2021 06:18:38 - INFO - __main__ - Step 64438: {'lr': 0.0003107476188263294, 'samples': 12372096, 'steps': 64437, 'loss/train': 1.3635197877883911} 11/07/2021 06:18:39 - INFO - __main__ - Step 64439: {'lr': 0.0003107424711184599, 'samples': 12372288, 'steps': 64438, 'loss/train': 1.7801979780197144} 11/07/2021 06:18:40 - INFO - __main__ - Step 64440: {'lr': 0.0003107373233832208, 'samples': 12372480, 'steps': 64439, 'loss/train': 1.2626007795333862} 11/07/2021 06:18:40 - INFO - __main__ - Step 64441: {'lr': 0.0003107321756206144, 'samples': 12372672, 'steps': 64440, 'loss/train': 1.449162483215332} 11/07/2021 06:18:41 - INFO - __main__ - Step 64442: {'lr': 0.00031072702783064307, 'samples': 12372864, 'steps': 64441, 'loss/train': 0.2914009094238281} 11/07/2021 06:18:41 - INFO - __main__ - Step 64443: {'lr': 0.00031072188001330905, 'samples': 12373056, 'steps': 64442, 'loss/train': 1.0604513883590698} 11/07/2021 06:18:42 - INFO - __main__ - Step 64444: {'lr': 0.00031071673216861463, 'samples': 12373248, 'steps': 64443, 'loss/train': 0.8078391551971436} 11/07/2021 06:18:42 - INFO - __main__ - Step 64445: {'lr': 0.0003107115842965622, 'samples': 12373440, 'steps': 64444, 'loss/train': 1.425939917564392} 11/07/2021 06:18:43 - INFO - __main__ - Step 64446: {'lr': 0.0003107064363971541, 'samples': 12373632, 'steps': 64445, 'loss/train': 1.4942128658294678} 11/07/2021 06:18:43 - INFO - __main__ - Step 64447: {'lr': 0.00031070128847039257, 'samples': 12373824, 'steps': 64446, 'loss/train': 1.2673029899597168} 11/07/2021 06:18:43 - INFO - __main__ - Step 64448: {'lr': 0.00031069614051628004, 'samples': 12374016, 'steps': 64447, 'loss/train': 1.4099483489990234} 11/07/2021 06:18:44 - INFO - __main__ - Step 64449: {'lr': 0.00031069099253481873, 'samples': 12374208, 'steps': 64448, 'loss/train': 1.3104630708694458} 11/07/2021 06:18:45 - INFO - __main__ - Step 64450: {'lr': 0.000310685844526011, 'samples': 12374400, 'steps': 64449, 'loss/train': 1.1597422361373901} 11/07/2021 06:18:45 - INFO - __main__ - Step 64451: {'lr': 0.0003106806964898592, 'samples': 12374592, 'steps': 64450, 'loss/train': 2.283325433731079} 11/07/2021 06:18:45 - INFO - __main__ - Step 64452: {'lr': 0.0003106755484263656, 'samples': 12374784, 'steps': 64451, 'loss/train': 0.9882016777992249} 11/07/2021 06:18:46 - INFO - __main__ - Step 64453: {'lr': 0.00031067040033553244, 'samples': 12374976, 'steps': 64452, 'loss/train': 1.8944851160049438} 11/07/2021 06:18:47 - INFO - __main__ - Step 64454: {'lr': 0.00031066525221736224, 'samples': 12375168, 'steps': 64453, 'loss/train': 1.205318570137024} 11/07/2021 06:18:47 - INFO - __main__ - Step 64455: {'lr': 0.0003106601040718572, 'samples': 12375360, 'steps': 64454, 'loss/train': 1.5682514905929565} 11/07/2021 06:18:47 - INFO - __main__ - Step 64456: {'lr': 0.00031065495589901966, 'samples': 12375552, 'steps': 64455, 'loss/train': 1.4449398517608643} 11/07/2021 06:18:48 - INFO - __main__ - Step 64457: {'lr': 0.0003106498076988519, 'samples': 12375744, 'steps': 64456, 'loss/train': 1.4835516214370728} 11/07/2021 06:18:48 - INFO - __main__ - Step 64458: {'lr': 0.00031064465947135627, 'samples': 12375936, 'steps': 64457, 'loss/train': 1.5826473236083984} 11/07/2021 06:18:49 - INFO - __main__ - Step 64459: {'lr': 0.0003106395112165351, 'samples': 12376128, 'steps': 64458, 'loss/train': 1.2903674840927124} 11/07/2021 06:18:50 - INFO - __main__ - Step 64460: {'lr': 0.00031063436293439066, 'samples': 12376320, 'steps': 64459, 'loss/train': 1.4661521911621094} 11/07/2021 06:18:50 - INFO - __main__ - Step 64461: {'lr': 0.0003106292146249254, 'samples': 12376512, 'steps': 64460, 'loss/train': 1.0102683305740356} 11/07/2021 06:18:50 - INFO - __main__ - Step 64462: {'lr': 0.0003106240662881415, 'samples': 12376704, 'steps': 64461, 'loss/train': 1.766306757926941} 11/07/2021 06:18:51 - INFO - __main__ - Step 64463: {'lr': 0.0003106189179240414, 'samples': 12376896, 'steps': 64462, 'loss/train': 1.6149543523788452} 11/07/2021 06:18:52 - INFO - __main__ - Step 64464: {'lr': 0.0003106137695326273, 'samples': 12377088, 'steps': 64463, 'loss/train': 1.3194544315338135} 11/07/2021 06:18:52 - INFO - __main__ - Step 64465: {'lr': 0.00031060862111390155, 'samples': 12377280, 'steps': 64464, 'loss/train': 0.39759689569473267} 11/07/2021 06:18:52 - INFO - __main__ - Step 64466: {'lr': 0.0003106034726678665, 'samples': 12377472, 'steps': 64465, 'loss/train': 1.3463854789733887} 11/07/2021 06:18:53 - INFO - __main__ - Step 64467: {'lr': 0.00031059832419452445, 'samples': 12377664, 'steps': 64466, 'loss/train': 1.2978808879852295} 11/07/2021 06:18:53 - INFO - __main__ - Step 64468: {'lr': 0.0003105931756938777, 'samples': 12377856, 'steps': 64467, 'loss/train': 1.3648529052734375} 11/07/2021 06:18:54 - INFO - __main__ - Step 64469: {'lr': 0.00031058802716592873, 'samples': 12378048, 'steps': 64468, 'loss/train': 1.7899971008300781} 11/07/2021 06:18:54 - INFO - __main__ - Step 64470: {'lr': 0.0003105828786106796, 'samples': 12378240, 'steps': 64469, 'loss/train': 1.2836068868637085} 11/07/2021 06:18:55 - INFO - __main__ - Step 64471: {'lr': 0.00031057773002813276, 'samples': 12378432, 'steps': 64470, 'loss/train': 1.337862491607666} 11/07/2021 06:18:55 - INFO - __main__ - Step 64472: {'lr': 0.0003105725814182906, 'samples': 12378624, 'steps': 64471, 'loss/train': 1.3629939556121826} 11/07/2021 06:18:56 - INFO - __main__ - Step 64473: {'lr': 0.00031056743278115535, 'samples': 12378816, 'steps': 64472, 'loss/train': 1.563594102859497} 11/07/2021 06:18:56 - INFO - __main__ - Step 64474: {'lr': 0.00031056228411672934, 'samples': 12379008, 'steps': 64473, 'loss/train': 1.251267671585083} 11/07/2021 06:18:57 - INFO - __main__ - Step 64475: {'lr': 0.00031055713542501483, 'samples': 12379200, 'steps': 64474, 'loss/train': 1.5049184560775757} 11/07/2021 06:18:57 - INFO - __main__ - Step 64476: {'lr': 0.00031055198670601437, 'samples': 12379392, 'steps': 64475, 'loss/train': 1.5835996866226196} 11/07/2021 06:18:58 - INFO - __main__ - Step 64477: {'lr': 0.00031054683795973007, 'samples': 12379584, 'steps': 64476, 'loss/train': 1.4745808839797974} 11/07/2021 06:18:58 - INFO - __main__ - Step 64478: {'lr': 0.0003105416891861642, 'samples': 12379776, 'steps': 64477, 'loss/train': 1.7997328042984009} 11/07/2021 06:18:59 - INFO - __main__ - Step 64479: {'lr': 0.00031053654038531927, 'samples': 12379968, 'steps': 64478, 'loss/train': 1.4978597164154053} 11/07/2021 06:18:59 - INFO - __main__ - Step 64480: {'lr': 0.00031053139155719743, 'samples': 12380160, 'steps': 64479, 'loss/train': 1.2528880834579468} 11/07/2021 06:19:00 - INFO - __main__ - Step 64481: {'lr': 0.00031052624270180114, 'samples': 12380352, 'steps': 64480, 'loss/train': 1.5805383920669556} 11/07/2021 06:19:00 - INFO - __main__ - Step 64482: {'lr': 0.0003105210938191326, 'samples': 12380544, 'steps': 64481, 'loss/train': 2.0169026851654053} 11/07/2021 06:19:00 - INFO - __main__ - Step 64483: {'lr': 0.0003105159449091943, 'samples': 12380736, 'steps': 64482, 'loss/train': 1.3407580852508545} 11/07/2021 06:19:01 - INFO - __main__ - Step 64484: {'lr': 0.0003105107959719884, 'samples': 12380928, 'steps': 64483, 'loss/train': 1.2766185998916626} 11/07/2021 06:19:02 - INFO - __main__ - Step 64485: {'lr': 0.0003105056470075172, 'samples': 12381120, 'steps': 64484, 'loss/train': 1.417485237121582} 11/07/2021 06:19:02 - INFO - __main__ - Step 64486: {'lr': 0.0003105004980157832, 'samples': 12381312, 'steps': 64485, 'loss/train': 1.4567251205444336} 11/07/2021 06:19:03 - INFO - __main__ - Step 64487: {'lr': 0.0003104953489967885, 'samples': 12381504, 'steps': 64486, 'loss/train': 0.5102068781852722} 11/07/2021 06:19:03 - INFO - __main__ - Step 64488: {'lr': 0.0003104901999505356, 'samples': 12381696, 'steps': 64487, 'loss/train': 0.1770712435245514} 11/07/2021 06:19:03 - INFO - __main__ - Step 64489: {'lr': 0.0003104850508770267, 'samples': 12381888, 'steps': 64488, 'loss/train': 1.749599814414978} 11/07/2021 06:19:04 - INFO - __main__ - Step 64490: {'lr': 0.00031047990177626424, 'samples': 12382080, 'steps': 64489, 'loss/train': 1.0720235109329224} 11/07/2021 06:19:05 - INFO - __main__ - Step 64491: {'lr': 0.0003104747526482504, 'samples': 12382272, 'steps': 64490, 'loss/train': 1.3082921504974365} 11/07/2021 06:19:05 - INFO - __main__ - Step 64492: {'lr': 0.0003104696034929876, 'samples': 12382464, 'steps': 64491, 'loss/train': 1.2733675241470337} 11/07/2021 06:19:05 - INFO - __main__ - Step 64493: {'lr': 0.0003104644543104781, 'samples': 12382656, 'steps': 64492, 'loss/train': 1.2923500537872314} 11/07/2021 06:19:06 - INFO - __main__ - Step 64494: {'lr': 0.00031045930510072427, 'samples': 12382848, 'steps': 64493, 'loss/train': 1.3991916179656982} 11/07/2021 06:19:07 - INFO - __main__ - Step 64495: {'lr': 0.00031045415586372844, 'samples': 12383040, 'steps': 64494, 'loss/train': 1.6813673973083496} 11/07/2021 06:19:07 - INFO - __main__ - Step 64496: {'lr': 0.00031044900659949295, 'samples': 12383232, 'steps': 64495, 'loss/train': 1.6035929918289185} 11/07/2021 06:19:08 - INFO - __main__ - Step 64497: {'lr': 0.0003104438573080199, 'samples': 12383424, 'steps': 64496, 'loss/train': 2.2722790241241455} 11/07/2021 06:19:08 - INFO - __main__ - Step 64498: {'lr': 0.00031043870798931194, 'samples': 12383616, 'steps': 64497, 'loss/train': 1.4281727075576782} 11/07/2021 06:19:08 - INFO - __main__ - Step 64499: {'lr': 0.00031043355864337113, 'samples': 12383808, 'steps': 64498, 'loss/train': 1.5345244407653809} 11/07/2021 06:19:09 - INFO - __main__ - Step 64500: {'lr': 0.00031042840927019994, 'samples': 12384000, 'steps': 64499, 'loss/train': 1.5098878145217896} 11/07/2021 06:19:10 - INFO - __main__ - Step 64501: {'lr': 0.00031042325986980064, 'samples': 12384192, 'steps': 64500, 'loss/train': 1.3364113569259644} 11/07/2021 06:19:10 - INFO - __main__ - Step 64502: {'lr': 0.0003104181104421755, 'samples': 12384384, 'steps': 64501, 'loss/train': 1.3845711946487427} 11/07/2021 06:19:10 - INFO - __main__ - Step 64503: {'lr': 0.000310412960987327, 'samples': 12384576, 'steps': 64502, 'loss/train': 1.5961384773254395} 11/07/2021 06:19:11 - INFO - __main__ - Step 64504: {'lr': 0.00031040781150525726, 'samples': 12384768, 'steps': 64503, 'loss/train': 1.529564619064331} 11/07/2021 06:19:12 - INFO - __main__ - Step 64505: {'lr': 0.0003104026619959687, 'samples': 12384960, 'steps': 64504, 'loss/train': 1.158545732498169} 11/07/2021 06:19:12 - INFO - __main__ - Step 64506: {'lr': 0.00031039751245946366, 'samples': 12385152, 'steps': 64505, 'loss/train': 1.5224494934082031} 11/07/2021 06:19:12 - INFO - __main__ - Step 64507: {'lr': 0.0003103923628957444, 'samples': 12385344, 'steps': 64506, 'loss/train': 1.1658382415771484} 11/07/2021 06:19:13 - INFO - __main__ - Step 64508: {'lr': 0.00031038721330481334, 'samples': 12385536, 'steps': 64507, 'loss/train': 1.4235256910324097} 11/07/2021 06:19:13 - INFO - __main__ - Step 64509: {'lr': 0.00031038206368667263, 'samples': 12385728, 'steps': 64508, 'loss/train': 0.8109691143035889} 11/07/2021 06:19:14 - INFO - __main__ - Step 64510: {'lr': 0.00031037691404132484, 'samples': 12385920, 'steps': 64509, 'loss/train': 1.203777551651001} 11/07/2021 06:19:15 - INFO - __main__ - Step 64511: {'lr': 0.0003103717643687721, 'samples': 12386112, 'steps': 64510, 'loss/train': 1.4260594844818115} 11/07/2021 06:19:15 - INFO - __main__ - Step 64512: {'lr': 0.00031036661466901666, 'samples': 12386304, 'steps': 64511, 'loss/train': 1.342487096786499} 11/07/2021 06:19:15 - INFO - __main__ - Step 64513: {'lr': 0.000310361464942061, 'samples': 12386496, 'steps': 64512, 'loss/train': 1.406256079673767} 11/07/2021 06:19:16 - INFO - __main__ - Step 64514: {'lr': 0.0003103563151879075, 'samples': 12386688, 'steps': 64513, 'loss/train': 1.120910406112671} 11/07/2021 06:19:16 - INFO - __main__ - Step 64515: {'lr': 0.00031035116540655824, 'samples': 12386880, 'steps': 64514, 'loss/train': 1.5858618021011353} 11/07/2021 06:19:17 - INFO - __main__ - Step 64516: {'lr': 0.0003103460155980158, 'samples': 12387072, 'steps': 64515, 'loss/train': 1.2597289085388184} 11/07/2021 06:19:17 - INFO - __main__ - Step 64517: {'lr': 0.00031034086576228227, 'samples': 12387264, 'steps': 64516, 'loss/train': 1.1812268495559692} 11/07/2021 06:19:18 - INFO - __main__ - Step 64518: {'lr': 0.00031033571589936015, 'samples': 12387456, 'steps': 64517, 'loss/train': 1.5780065059661865} 11/07/2021 06:19:18 - INFO - __main__ - Step 64519: {'lr': 0.0003103305660092516, 'samples': 12387648, 'steps': 64518, 'loss/train': 0.7617230415344238} 11/07/2021 06:19:18 - INFO - __main__ - Step 64520: {'lr': 0.0003103254160919591, 'samples': 12387840, 'steps': 64519, 'loss/train': 1.072746753692627} 11/07/2021 06:19:19 - INFO - __main__ - Step 64521: {'lr': 0.00031032026614748485, 'samples': 12388032, 'steps': 64520, 'loss/train': 1.2127937078475952} 11/07/2021 06:19:20 - INFO - __main__ - Step 64522: {'lr': 0.0003103151161758313, 'samples': 12388224, 'steps': 64521, 'loss/train': 1.8593577146530151} 11/07/2021 06:19:20 - INFO - __main__ - Step 64523: {'lr': 0.0003103099661770007, 'samples': 12388416, 'steps': 64522, 'loss/train': 1.234528660774231} 11/07/2021 06:19:20 - INFO - __main__ - Step 64524: {'lr': 0.00031030481615099527, 'samples': 12388608, 'steps': 64523, 'loss/train': 1.0186386108398438} 11/07/2021 06:19:21 - INFO - __main__ - Step 64525: {'lr': 0.00031029966609781747, 'samples': 12388800, 'steps': 64524, 'loss/train': 1.3962953090667725} 11/07/2021 06:19:22 - INFO - __main__ - Step 64526: {'lr': 0.0003102945160174695, 'samples': 12388992, 'steps': 64525, 'loss/train': 1.1185367107391357} 11/07/2021 06:19:22 - INFO - __main__ - Step 64527: {'lr': 0.0003102893659099538, 'samples': 12389184, 'steps': 64526, 'loss/train': 1.8594810962677002} 11/07/2021 06:19:23 - INFO - __main__ - Step 64528: {'lr': 0.0003102842157752727, 'samples': 12389376, 'steps': 64527, 'loss/train': 1.2847012281417847} 11/07/2021 06:19:23 - INFO - __main__ - Step 64529: {'lr': 0.0003102790656134284, 'samples': 12389568, 'steps': 64528, 'loss/train': 1.3959579467773438} 11/07/2021 06:19:23 - INFO - __main__ - Step 64530: {'lr': 0.0003102739154244233, 'samples': 12389760, 'steps': 64529, 'loss/train': 1.5774818658828735} 11/07/2021 06:19:24 - INFO - __main__ - Step 64531: {'lr': 0.0003102687652082597, 'samples': 12389952, 'steps': 64530, 'loss/train': 1.1206705570220947} 11/07/2021 06:19:25 - INFO - __main__ - Step 64532: {'lr': 0.0003102636149649399, 'samples': 12390144, 'steps': 64531, 'loss/train': 1.270370602607727} 11/07/2021 06:19:25 - INFO - __main__ - Step 64533: {'lr': 0.0003102584646944662, 'samples': 12390336, 'steps': 64532, 'loss/train': 1.4927480220794678} 11/07/2021 06:19:25 - INFO - __main__ - Step 64534: {'lr': 0.0003102533143968411, 'samples': 12390528, 'steps': 64533, 'loss/train': 0.7434923052787781} 11/07/2021 06:19:26 - INFO - __main__ - Step 64535: {'lr': 0.00031024816407206675, 'samples': 12390720, 'steps': 64534, 'loss/train': 1.5198426246643066} 11/07/2021 06:19:27 - INFO - __main__ - Step 64536: {'lr': 0.00031024301372014544, 'samples': 12390912, 'steps': 64535, 'loss/train': 1.7624022960662842} 11/07/2021 06:19:27 - INFO - __main__ - Step 64537: {'lr': 0.0003102378633410796, 'samples': 12391104, 'steps': 64536, 'loss/train': 1.4790247678756714} 11/07/2021 06:19:27 - INFO - __main__ - Step 64538: {'lr': 0.0003102327129348715, 'samples': 12391296, 'steps': 64537, 'loss/train': 1.6028329133987427} 11/07/2021 06:19:28 - INFO - __main__ - Step 64539: {'lr': 0.00031022756250152344, 'samples': 12391488, 'steps': 64538, 'loss/train': 1.3564287424087524} 11/07/2021 06:19:28 - INFO - __main__ - Step 64540: {'lr': 0.00031022241204103787, 'samples': 12391680, 'steps': 64539, 'loss/train': 1.2411730289459229} 11/07/2021 06:19:29 - INFO - __main__ - Step 64541: {'lr': 0.0003102172615534169, 'samples': 12391872, 'steps': 64540, 'loss/train': 1.3840043544769287} 11/07/2021 06:19:29 - INFO - __main__ - Step 64542: {'lr': 0.000310212111038663, 'samples': 12392064, 'steps': 64541, 'loss/train': 1.4428012371063232} 11/07/2021 06:19:30 - INFO - __main__ - Step 64543: {'lr': 0.00031020696049677846, 'samples': 12392256, 'steps': 64542, 'loss/train': 0.9364198446273804} 11/07/2021 06:19:30 - INFO - __main__ - Step 64544: {'lr': 0.0003102018099277656, 'samples': 12392448, 'steps': 64543, 'loss/train': 0.6283526420593262} 11/07/2021 06:19:31 - INFO - __main__ - Step 64545: {'lr': 0.0003101966593316267, 'samples': 12392640, 'steps': 64544, 'loss/train': 1.9443082809448242} 11/07/2021 06:19:31 - INFO - __main__ - Step 64546: {'lr': 0.00031019150870836414, 'samples': 12392832, 'steps': 64545, 'loss/train': 1.307923674583435} 11/07/2021 06:19:32 - INFO - __main__ - Step 64547: {'lr': 0.00031018635805798024, 'samples': 12393024, 'steps': 64546, 'loss/train': 0.6283186674118042} 11/07/2021 06:19:32 - INFO - __main__ - Step 64548: {'lr': 0.00031018120738047724, 'samples': 12393216, 'steps': 64547, 'loss/train': 1.143422245979309} 11/07/2021 06:19:33 - INFO - __main__ - Step 64549: {'lr': 0.00031017605667585754, 'samples': 12393408, 'steps': 64548, 'loss/train': 1.6108050346374512} 11/07/2021 06:19:33 - INFO - __main__ - Step 64550: {'lr': 0.0003101709059441234, 'samples': 12393600, 'steps': 64549, 'loss/train': 1.3961673974990845} 11/07/2021 06:19:34 - INFO - __main__ - Step 64551: {'lr': 0.00031016575518527726, 'samples': 12393792, 'steps': 64550, 'loss/train': 1.1469461917877197} 11/07/2021 06:19:34 - INFO - __main__ - Step 64552: {'lr': 0.0003101606043993213, 'samples': 12393984, 'steps': 64551, 'loss/train': 1.5289113521575928} 11/07/2021 06:19:35 - INFO - __main__ - Step 64553: {'lr': 0.0003101554535862579, 'samples': 12394176, 'steps': 64552, 'loss/train': 1.6134998798370361} 11/07/2021 06:19:35 - INFO - __main__ - Step 64554: {'lr': 0.0003101503027460894, 'samples': 12394368, 'steps': 64553, 'loss/train': 1.55754554271698} 11/07/2021 06:19:35 - INFO - __main__ - Step 64555: {'lr': 0.00031014515187881807, 'samples': 12394560, 'steps': 64554, 'loss/train': 1.5281881093978882} 11/07/2021 06:19:36 - INFO - __main__ - Step 64556: {'lr': 0.00031014000098444634, 'samples': 12394752, 'steps': 64555, 'loss/train': 1.498491883277893} 11/07/2021 06:19:37 - INFO - __main__ - Step 64557: {'lr': 0.00031013485006297644, 'samples': 12394944, 'steps': 64556, 'loss/train': 0.9949484467506409} 11/07/2021 06:19:37 - INFO - __main__ - Step 64558: {'lr': 0.00031012969911441065, 'samples': 12395136, 'steps': 64557, 'loss/train': 1.3057302236557007} 11/07/2021 06:19:37 - INFO - __main__ - Step 64559: {'lr': 0.00031012454813875135, 'samples': 12395328, 'steps': 64558, 'loss/train': 1.3553308248519897} 11/07/2021 06:19:38 - INFO - __main__ - Step 64560: {'lr': 0.0003101193971360009, 'samples': 12395520, 'steps': 64559, 'loss/train': 0.15921320021152496} 11/07/2021 06:19:38 - INFO - __main__ - Step 64561: {'lr': 0.0003101142461061615, 'samples': 12395712, 'steps': 64560, 'loss/train': 1.4483376741409302} 11/07/2021 06:19:39 - INFO - __main__ - Step 64562: {'lr': 0.00031010909504923555, 'samples': 12395904, 'steps': 64561, 'loss/train': 1.2970129251480103} 11/07/2021 06:19:40 - INFO - __main__ - Step 64563: {'lr': 0.00031010394396522553, 'samples': 12396096, 'steps': 64562, 'loss/train': 1.6322357654571533} 11/07/2021 06:19:40 - INFO - __main__ - Step 64564: {'lr': 0.00031009879285413345, 'samples': 12396288, 'steps': 64563, 'loss/train': 0.7702165246009827} 11/07/2021 06:19:40 - INFO - __main__ - Step 64565: {'lr': 0.00031009364171596184, 'samples': 12396480, 'steps': 64564, 'loss/train': 1.5590054988861084} 11/07/2021 06:19:41 - INFO - __main__ - Step 64566: {'lr': 0.00031008849055071293, 'samples': 12396672, 'steps': 64565, 'loss/train': 1.1740214824676514} 11/07/2021 06:19:42 - INFO - __main__ - Step 64567: {'lr': 0.00031008333935838905, 'samples': 12396864, 'steps': 64566, 'loss/train': 1.6948432922363281} 11/07/2021 06:19:42 - INFO - __main__ - Step 64568: {'lr': 0.0003100781881389926, 'samples': 12397056, 'steps': 64567, 'loss/train': 1.3072279691696167} 11/07/2021 06:19:42 - INFO - __main__ - Step 64569: {'lr': 0.00031007303689252583, 'samples': 12397248, 'steps': 64568, 'loss/train': 1.510092854499817} 11/07/2021 06:19:43 - INFO - __main__ - Step 64570: {'lr': 0.0003100678856189911, 'samples': 12397440, 'steps': 64569, 'loss/train': 1.2564352750778198} 11/07/2021 06:19:43 - INFO - __main__ - Step 64571: {'lr': 0.00031006273431839065, 'samples': 12397632, 'steps': 64570, 'loss/train': 1.599938988685608} 11/07/2021 06:19:44 - INFO - __main__ - Step 64572: {'lr': 0.00031005758299072685, 'samples': 12397824, 'steps': 64571, 'loss/train': 1.6229968070983887} 11/07/2021 06:19:44 - INFO - __main__ - Step 64573: {'lr': 0.00031005243163600207, 'samples': 12398016, 'steps': 64572, 'loss/train': 1.0405230522155762} 11/07/2021 06:19:45 - INFO - __main__ - Step 64574: {'lr': 0.0003100472802542186, 'samples': 12398208, 'steps': 64573, 'loss/train': 1.2883553504943848} 11/07/2021 06:19:45 - INFO - __main__ - Step 64575: {'lr': 0.0003100421288453787, 'samples': 12398400, 'steps': 64574, 'loss/train': 1.892146348953247} 11/07/2021 06:19:45 - INFO - __main__ - Step 64576: {'lr': 0.00031003697740948475, 'samples': 12398592, 'steps': 64575, 'loss/train': 1.2676140069961548} 11/07/2021 06:19:47 - INFO - __main__ - Step 64577: {'lr': 0.0003100318259465392, 'samples': 12398784, 'steps': 64576, 'loss/train': 1.6583062410354614} 11/07/2021 06:19:47 - INFO - __main__ - Step 64578: {'lr': 0.0003100266744565441, 'samples': 12398976, 'steps': 64577, 'loss/train': 1.395870566368103} 11/07/2021 06:19:47 - INFO - __main__ - Step 64579: {'lr': 0.00031002152293950193, 'samples': 12399168, 'steps': 64578, 'loss/train': 1.4247742891311646} 11/07/2021 06:19:48 - INFO - __main__ - Step 64580: {'lr': 0.000310016371395415, 'samples': 12399360, 'steps': 64579, 'loss/train': 1.6699081659317017} 11/07/2021 06:19:48 - INFO - __main__ - Step 64581: {'lr': 0.0003100112198242856, 'samples': 12399552, 'steps': 64580, 'loss/train': 1.2034149169921875} 11/07/2021 06:19:49 - INFO - __main__ - Step 64582: {'lr': 0.0003100060682261161, 'samples': 12399744, 'steps': 64581, 'loss/train': 1.2238582372665405} 11/07/2021 06:19:50 - INFO - __main__ - Step 64583: {'lr': 0.0003100009166009087, 'samples': 12399936, 'steps': 64582, 'loss/train': 1.3124338388442993} 11/07/2021 06:19:50 - INFO - __main__ - Step 64584: {'lr': 0.000309995764948666, 'samples': 12400128, 'steps': 64583, 'loss/train': 1.2176457643508911} 11/07/2021 06:19:50 - INFO - __main__ - Step 64585: {'lr': 0.00030999061326939, 'samples': 12400320, 'steps': 64584, 'loss/train': 0.9340316653251648} 11/07/2021 06:19:51 - INFO - __main__ - Step 64586: {'lr': 0.00030998546156308314, 'samples': 12400512, 'steps': 64585, 'loss/train': 0.14729942381381989} 11/07/2021 06:19:52 - INFO - __main__ - Step 64587: {'lr': 0.00030998030982974786, 'samples': 12400704, 'steps': 64586, 'loss/train': 1.6995962858200073} 11/07/2021 06:19:52 - INFO - __main__ - Step 64588: {'lr': 0.00030997515806938623, 'samples': 12400896, 'steps': 64587, 'loss/train': 2.580594301223755} 11/07/2021 06:19:52 - INFO - __main__ - Step 64589: {'lr': 0.0003099700062820008, 'samples': 12401088, 'steps': 64588, 'loss/train': 1.8716599941253662} 11/07/2021 06:19:53 - INFO - __main__ - Step 64590: {'lr': 0.0003099648544675939, 'samples': 12401280, 'steps': 64589, 'loss/train': 1.1471978425979614} 11/07/2021 06:19:53 - INFO - __main__ - Step 64591: {'lr': 0.0003099597026261677, 'samples': 12401472, 'steps': 64590, 'loss/train': 1.2542815208435059} 11/07/2021 06:19:54 - INFO - __main__ - Step 64592: {'lr': 0.0003099545507577245, 'samples': 12401664, 'steps': 64591, 'loss/train': 1.3937844038009644} 11/07/2021 06:19:55 - INFO - __main__ - Step 64593: {'lr': 0.00030994939886226674, 'samples': 12401856, 'steps': 64592, 'loss/train': 1.5915076732635498} 11/07/2021 06:19:55 - INFO - __main__ - Step 64594: {'lr': 0.0003099442469397967, 'samples': 12402048, 'steps': 64593, 'loss/train': 1.5810540914535522} 11/07/2021 06:19:55 - INFO - __main__ - Step 64595: {'lr': 0.0003099390949903168, 'samples': 12402240, 'steps': 64594, 'loss/train': 1.4533730745315552} 11/07/2021 06:19:56 - INFO - __main__ - Step 64596: {'lr': 0.00030993394301382916, 'samples': 12402432, 'steps': 64595, 'loss/train': 1.1989021301269531} 11/07/2021 06:19:56 - INFO - __main__ - Step 64597: {'lr': 0.00030992879101033634, 'samples': 12402624, 'steps': 64596, 'loss/train': 1.0566502809524536} 11/07/2021 06:19:57 - INFO - __main__ - Step 64598: {'lr': 0.00030992363897984043, 'samples': 12402816, 'steps': 64597, 'loss/train': 2.0005621910095215} 11/07/2021 06:19:57 - INFO - __main__ - Step 64599: {'lr': 0.00030991848692234387, 'samples': 12403008, 'steps': 64598, 'loss/train': 1.2011584043502808} 11/07/2021 06:19:58 - INFO - __main__ - Step 64600: {'lr': 0.00030991333483784895, 'samples': 12403200, 'steps': 64599, 'loss/train': 1.3270423412322998} 11/07/2021 06:19:58 - INFO - __main__ - Step 64601: {'lr': 0.000309908182726358, 'samples': 12403392, 'steps': 64600, 'loss/train': 1.2672487497329712} 11/07/2021 06:19:58 - INFO - __main__ - Step 64602: {'lr': 0.0003099030305878733, 'samples': 12403584, 'steps': 64601, 'loss/train': 1.4815977811813354} 11/07/2021 06:20:00 - INFO - __main__ - Step 64603: {'lr': 0.0003098978784223974, 'samples': 12403776, 'steps': 64602, 'loss/train': 1.5565108060836792} 11/07/2021 06:20:00 - INFO - __main__ - Step 64604: {'lr': 0.0003098927262299323, 'samples': 12403968, 'steps': 64603, 'loss/train': 1.292404294013977} 11/07/2021 06:20:00 - INFO - __main__ - Step 64605: {'lr': 0.0003098875740104805, 'samples': 12404160, 'steps': 64604, 'loss/train': 1.1197301149368286} 11/07/2021 06:20:01 - INFO - __main__ - Step 64606: {'lr': 0.00030988242176404425, 'samples': 12404352, 'steps': 64605, 'loss/train': 1.3953737020492554} 11/07/2021 06:20:01 - INFO - __main__ - Step 64607: {'lr': 0.00030987726949062596, 'samples': 12404544, 'steps': 64606, 'loss/train': 1.7192785739898682} 11/07/2021 06:20:02 - INFO - __main__ - Step 64608: {'lr': 0.00030987211719022784, 'samples': 12404736, 'steps': 64607, 'loss/train': 1.5412546396255493} 11/07/2021 06:20:02 - INFO - __main__ - Step 64609: {'lr': 0.00030986696486285227, 'samples': 12404928, 'steps': 64608, 'loss/train': 1.3460536003112793} 11/07/2021 06:20:03 - INFO - __main__ - Step 64610: {'lr': 0.00030986181250850165, 'samples': 12405120, 'steps': 64609, 'loss/train': 1.2490969896316528} 11/07/2021 06:20:03 - INFO - __main__ - Step 64611: {'lr': 0.00030985666012717814, 'samples': 12405312, 'steps': 64610, 'loss/train': 1.483860969543457} 11/07/2021 06:20:03 - INFO - __main__ - Step 64612: {'lr': 0.00030985150771888417, 'samples': 12405504, 'steps': 64611, 'loss/train': 1.4664640426635742} 11/07/2021 06:20:04 - INFO - __main__ - Step 64613: {'lr': 0.000309846355283622, 'samples': 12405696, 'steps': 64612, 'loss/train': 1.0578322410583496} 11/07/2021 06:20:05 - INFO - __main__ - Step 64614: {'lr': 0.000309841202821394, 'samples': 12405888, 'steps': 64613, 'loss/train': 1.5503655672073364} 11/07/2021 06:20:05 - INFO - __main__ - Step 64615: {'lr': 0.00030983605033220246, 'samples': 12406080, 'steps': 64614, 'loss/train': 1.579046368598938} 11/07/2021 06:20:05 - INFO - __main__ - Step 64616: {'lr': 0.0003098308978160498, 'samples': 12406272, 'steps': 64615, 'loss/train': 1.5327309370040894} 11/07/2021 06:20:06 - INFO - __main__ - Step 64617: {'lr': 0.0003098257452729382, 'samples': 12406464, 'steps': 64616, 'loss/train': 2.0569469928741455} 11/07/2021 06:20:07 - INFO - __main__ - Step 64618: {'lr': 0.00030982059270287006, 'samples': 12406656, 'steps': 64617, 'loss/train': 1.5048158168792725} 11/07/2021 06:20:07 - INFO - __main__ - Step 64619: {'lr': 0.00030981544010584767, 'samples': 12406848, 'steps': 64618, 'loss/train': 1.5536043643951416} 11/07/2021 06:20:08 - INFO - __main__ - Step 64620: {'lr': 0.0003098102874818734, 'samples': 12407040, 'steps': 64619, 'loss/train': 1.3544255495071411} 11/07/2021 06:20:08 - INFO - __main__ - Step 64621: {'lr': 0.0003098051348309495, 'samples': 12407232, 'steps': 64620, 'loss/train': 1.3201606273651123} 11/07/2021 06:20:08 - INFO - __main__ - Step 64622: {'lr': 0.0003097999821530783, 'samples': 12407424, 'steps': 64621, 'loss/train': 1.4889898300170898} 11/07/2021 06:20:09 - INFO - __main__ - Step 64623: {'lr': 0.0003097948294482622, 'samples': 12407616, 'steps': 64622, 'loss/train': 9.102642059326172} 11/07/2021 06:20:10 - INFO - __main__ - Step 64624: {'lr': 0.0003097896767165035, 'samples': 12407808, 'steps': 64623, 'loss/train': 1.1474658250808716} 11/07/2021 06:20:10 - INFO - __main__ - Step 64625: {'lr': 0.00030978452395780446, 'samples': 12408000, 'steps': 64624, 'loss/train': 1.1975020170211792} 11/07/2021 06:20:11 - INFO - __main__ - Step 64626: {'lr': 0.0003097793711721674, 'samples': 12408192, 'steps': 64625, 'loss/train': 1.2241607904434204} 11/07/2021 06:20:11 - INFO - __main__ - Step 64627: {'lr': 0.00030977421835959475, 'samples': 12408384, 'steps': 64626, 'loss/train': 1.340155005455017} 11/07/2021 06:20:11 - INFO - __main__ - Step 64628: {'lr': 0.0003097690655200887, 'samples': 12408576, 'steps': 64627, 'loss/train': 1.4539899826049805} 11/07/2021 06:20:13 - INFO - __main__ - Step 64629: {'lr': 0.0003097639126536516, 'samples': 12408768, 'steps': 64628, 'loss/train': 1.5408672094345093} 11/07/2021 06:20:13 - INFO - __main__ - Step 64630: {'lr': 0.00030975875976028586, 'samples': 12408960, 'steps': 64629, 'loss/train': 1.4350770711898804} 11/07/2021 06:20:13 - INFO - __main__ - Step 64631: {'lr': 0.0003097536068399938, 'samples': 12409152, 'steps': 64630, 'loss/train': 1.4715436697006226} 11/07/2021 06:20:14 - INFO - __main__ - Step 64632: {'lr': 0.00030974845389277763, 'samples': 12409344, 'steps': 64631, 'loss/train': 1.8157991170883179} 11/07/2021 06:20:14 - INFO - __main__ - Step 64633: {'lr': 0.00030974330091863974, 'samples': 12409536, 'steps': 64632, 'loss/train': 1.3551527261734009} 11/07/2021 06:20:15 - INFO - __main__ - Step 64634: {'lr': 0.00030973814791758237, 'samples': 12409728, 'steps': 64633, 'loss/train': 1.5451852083206177} 11/07/2021 06:20:15 - INFO - __main__ - Step 64635: {'lr': 0.000309732994889608, 'samples': 12409920, 'steps': 64634, 'loss/train': 1.609869360923767} 11/07/2021 06:20:16 - INFO - __main__ - Step 64636: {'lr': 0.0003097278418347188, 'samples': 12410112, 'steps': 64635, 'loss/train': 1.622768759727478} 11/07/2021 06:20:16 - INFO - __main__ - Step 64637: {'lr': 0.00030972268875291723, 'samples': 12410304, 'steps': 64636, 'loss/train': 1.505347728729248} 11/07/2021 06:20:16 - INFO - __main__ - Step 64638: {'lr': 0.0003097175356442055, 'samples': 12410496, 'steps': 64637, 'loss/train': 0.981511652469635} 11/07/2021 06:20:18 - INFO - __main__ - Step 64639: {'lr': 0.00030971238250858597, 'samples': 12410688, 'steps': 64638, 'loss/train': 1.3734675645828247} 11/07/2021 06:20:18 - INFO - __main__ - Step 64640: {'lr': 0.00030970722934606096, 'samples': 12410880, 'steps': 64639, 'loss/train': 1.1693629026412964} 11/07/2021 06:20:18 - INFO - __main__ - Step 64641: {'lr': 0.0003097020761566328, 'samples': 12411072, 'steps': 64640, 'loss/train': 0.08377185463905334} 11/07/2021 06:20:19 - INFO - __main__ - Step 64642: {'lr': 0.00030969692294030376, 'samples': 12411264, 'steps': 64641, 'loss/train': 1.4866790771484375} 11/07/2021 06:20:19 - INFO - __main__ - Step 64643: {'lr': 0.0003096917696970762, 'samples': 12411456, 'steps': 64642, 'loss/train': 1.630178689956665} 11/07/2021 06:20:20 - INFO - __main__ - Step 64644: {'lr': 0.00030968661642695255, 'samples': 12411648, 'steps': 64643, 'loss/train': 0.9758046865463257} 11/07/2021 06:20:21 - INFO - __main__ - Step 64645: {'lr': 0.00030968146312993503, 'samples': 12411840, 'steps': 64644, 'loss/train': 2.1064963340759277} 11/07/2021 06:20:21 - INFO - __main__ - Step 64646: {'lr': 0.0003096763098060259, 'samples': 12412032, 'steps': 64645, 'loss/train': 0.15061385929584503} 11/07/2021 06:20:21 - INFO - __main__ - Step 64647: {'lr': 0.00030967115645522754, 'samples': 12412224, 'steps': 64646, 'loss/train': 1.3265577554702759} 11/07/2021 06:20:22 - INFO - __main__ - Step 64648: {'lr': 0.0003096660030775423, 'samples': 12412416, 'steps': 64647, 'loss/train': 1.6801804304122925} 11/07/2021 06:20:22 - INFO - __main__ - Step 64649: {'lr': 0.0003096608496729724, 'samples': 12412608, 'steps': 64648, 'loss/train': 1.4591580629348755} 11/07/2021 06:20:23 - INFO - __main__ - Step 64650: {'lr': 0.00030965569624152037, 'samples': 12412800, 'steps': 64649, 'loss/train': 1.7962939739227295} 11/07/2021 06:20:23 - INFO - __main__ - Step 64651: {'lr': 0.00030965054278318837, 'samples': 12412992, 'steps': 64650, 'loss/train': 1.15523362159729} 11/07/2021 06:20:24 - INFO - __main__ - Step 64652: {'lr': 0.0003096453892979787, 'samples': 12413184, 'steps': 64651, 'loss/train': 1.3451582193374634} 11/07/2021 06:20:24 - INFO - __main__ - Step 64653: {'lr': 0.00030964023578589376, 'samples': 12413376, 'steps': 64652, 'loss/train': 1.8558392524719238} 11/07/2021 06:20:24 - INFO - __main__ - Step 64654: {'lr': 0.0003096350822469359, 'samples': 12413568, 'steps': 64653, 'loss/train': 1.110063076019287} 11/07/2021 06:20:25 - INFO - __main__ - Step 64655: {'lr': 0.00030962992868110734, 'samples': 12413760, 'steps': 64654, 'loss/train': 1.0178323984146118} 11/07/2021 06:20:26 - INFO - __main__ - Step 64656: {'lr': 0.0003096247750884105, 'samples': 12413952, 'steps': 64655, 'loss/train': 1.3924986124038696} 11/07/2021 06:20:26 - INFO - __main__ - Step 64657: {'lr': 0.00030961962146884765, 'samples': 12414144, 'steps': 64656, 'loss/train': 1.5906540155410767} 11/07/2021 06:20:26 - INFO - __main__ - Step 64658: {'lr': 0.0003096144678224211, 'samples': 12414336, 'steps': 64657, 'loss/train': 1.230492115020752} 11/07/2021 06:20:27 - INFO - __main__ - Step 64659: {'lr': 0.0003096093141491331, 'samples': 12414528, 'steps': 64658, 'loss/train': 1.5624099969863892} 11/07/2021 06:20:28 - INFO - __main__ - Step 64660: {'lr': 0.0003096041604489862, 'samples': 12414720, 'steps': 64659, 'loss/train': 1.1960197687149048} 11/07/2021 06:20:28 - INFO - __main__ - Step 64661: {'lr': 0.0003095990067219825, 'samples': 12414912, 'steps': 64660, 'loss/train': 1.7637509107589722} 11/07/2021 06:20:29 - INFO - __main__ - Step 64662: {'lr': 0.0003095938529681244, 'samples': 12415104, 'steps': 64661, 'loss/train': 1.117611289024353} 11/07/2021 06:20:29 - INFO - __main__ - Step 64663: {'lr': 0.0003095886991874143, 'samples': 12415296, 'steps': 64662, 'loss/train': 1.4197202920913696} 11/07/2021 06:20:29 - INFO - __main__ - Step 64664: {'lr': 0.00030958354537985444, 'samples': 12415488, 'steps': 64663, 'loss/train': 0.9076763391494751} 11/07/2021 06:20:31 - INFO - __main__ - Step 64665: {'lr': 0.00030957839154544713, 'samples': 12415680, 'steps': 64664, 'loss/train': 0.788428008556366} 11/07/2021 06:20:31 - INFO - __main__ - Step 64666: {'lr': 0.00030957323768419475, 'samples': 12415872, 'steps': 64665, 'loss/train': 1.5631747245788574} 11/07/2021 06:20:31 - INFO - __main__ - Step 64667: {'lr': 0.0003095680837960996, 'samples': 12416064, 'steps': 64666, 'loss/train': 1.4696147441864014} 11/07/2021 06:20:32 - INFO - __main__ - Step 64668: {'lr': 0.0003095629298811639, 'samples': 12416256, 'steps': 64667, 'loss/train': 5.7630934715271} 11/07/2021 06:20:32 - INFO - __main__ - Step 64669: {'lr': 0.0003095577759393902, 'samples': 12416448, 'steps': 64668, 'loss/train': 1.379296064376831} 11/07/2021 06:20:32 - INFO - __main__ - Step 64670: {'lr': 0.00030955262197078054, 'samples': 12416640, 'steps': 64669, 'loss/train': 1.6072453260421753} 11/07/2021 06:20:33 - INFO - __main__ - Step 64671: {'lr': 0.00030954746797533743, 'samples': 12416832, 'steps': 64670, 'loss/train': 1.4030202627182007} 11/07/2021 06:20:34 - INFO - __main__ - Step 64672: {'lr': 0.00030954231395306314, 'samples': 12417024, 'steps': 64671, 'loss/train': 1.6681654453277588} 11/07/2021 06:20:34 - INFO - __main__ - Step 64673: {'lr': 0.00030953715990396006, 'samples': 12417216, 'steps': 64672, 'loss/train': 1.3014835119247437} 11/07/2021 06:20:34 - INFO - __main__ - Step 64674: {'lr': 0.0003095320058280305, 'samples': 12417408, 'steps': 64673, 'loss/train': 0.6014445424079895} 11/07/2021 06:20:35 - INFO - __main__ - Step 64675: {'lr': 0.0003095268517252766, 'samples': 12417600, 'steps': 64674, 'loss/train': 1.5554429292678833} 11/07/2021 06:20:37 - INFO - __main__ - Step 64676: {'lr': 0.00030952169759570087, 'samples': 12417792, 'steps': 64675, 'loss/train': 1.0236736536026} 11/07/2021 06:20:37 - INFO - __main__ - Step 64677: {'lr': 0.00030951654343930557, 'samples': 12417984, 'steps': 64676, 'loss/train': 1.09987473487854} 11/07/2021 06:20:37 - INFO - __main__ - Step 64678: {'lr': 0.00030951138925609307, 'samples': 12418176, 'steps': 64677, 'loss/train': 1.5684386491775513} 11/07/2021 06:20:38 - INFO - __main__ - Step 64679: {'lr': 0.00030950623504606565, 'samples': 12418368, 'steps': 64678, 'loss/train': 1.8039933443069458} 11/07/2021 06:20:38 - INFO - __main__ - Step 64680: {'lr': 0.0003095010808092257, 'samples': 12418560, 'steps': 64679, 'loss/train': 1.7626852989196777} 11/07/2021 06:20:39 - INFO - __main__ - Step 64681: {'lr': 0.00030949592654557536, 'samples': 12418752, 'steps': 64680, 'loss/train': 0.9846799969673157} 11/07/2021 06:20:39 - INFO - __main__ - Step 64682: {'lr': 0.0003094907722551171, 'samples': 12418944, 'steps': 64681, 'loss/train': 1.799678087234497} 11/07/2021 06:20:40 - INFO - __main__ - Step 64683: {'lr': 0.00030948561793785325, 'samples': 12419136, 'steps': 64682, 'loss/train': 1.071650743484497} 11/07/2021 06:20:40 - INFO - __main__ - Step 64684: {'lr': 0.0003094804635937861, 'samples': 12419328, 'steps': 64683, 'loss/train': 1.3632618188858032} 11/07/2021 06:20:41 - INFO - __main__ - Step 64685: {'lr': 0.000309475309222918, 'samples': 12419520, 'steps': 64684, 'loss/train': 0.8740959763526917} 11/07/2021 06:20:41 - INFO - __main__ - Step 64686: {'lr': 0.0003094701548252512, 'samples': 12419712, 'steps': 64685, 'loss/train': 1.2610913515090942} 11/07/2021 06:20:41 - INFO - __main__ - Step 64687: {'lr': 0.00030946500040078805, 'samples': 12419904, 'steps': 64686, 'loss/train': 1.0280588865280151} 11/07/2021 06:20:42 - INFO - __main__ - Step 64688: {'lr': 0.0003094598459495309, 'samples': 12420096, 'steps': 64687, 'loss/train': 1.707588791847229} 11/07/2021 06:20:43 - INFO - __main__ - Step 64689: {'lr': 0.0003094546914714821, 'samples': 12420288, 'steps': 64688, 'loss/train': 1.1789630651474} 11/07/2021 06:20:43 - INFO - __main__ - Step 64690: {'lr': 0.00030944953696664384, 'samples': 12420480, 'steps': 64689, 'loss/train': 1.152160406112671} 11/07/2021 06:20:43 - INFO - __main__ - Step 64691: {'lr': 0.00030944438243501863, 'samples': 12420672, 'steps': 64690, 'loss/train': 0.6818108558654785} 11/07/2021 06:20:44 - INFO - __main__ - Step 64692: {'lr': 0.00030943922787660864, 'samples': 12420864, 'steps': 64691, 'loss/train': 1.3788914680480957} 11/07/2021 06:20:44 - INFO - __main__ - Step 64693: {'lr': 0.0003094340732914163, 'samples': 12421056, 'steps': 64692, 'loss/train': 1.3455688953399658} 11/07/2021 06:20:45 - INFO - __main__ - Step 64694: {'lr': 0.00030942891867944387, 'samples': 12421248, 'steps': 64693, 'loss/train': 1.2225260734558105} 11/07/2021 06:20:46 - INFO - __main__ - Step 64695: {'lr': 0.0003094237640406937, 'samples': 12421440, 'steps': 64694, 'loss/train': 1.1823158264160156} 11/07/2021 06:20:46 - INFO - __main__ - Step 64696: {'lr': 0.000309418609375168, 'samples': 12421632, 'steps': 64695, 'loss/train': 1.2562732696533203} 11/07/2021 06:20:46 - INFO - __main__ - Step 64697: {'lr': 0.0003094134546828693, 'samples': 12421824, 'steps': 64696, 'loss/train': 1.2488420009613037} 11/07/2021 06:20:47 - INFO - __main__ - Step 64698: {'lr': 0.00030940829996379984, 'samples': 12422016, 'steps': 64697, 'loss/train': 1.2618948221206665} 11/07/2021 06:20:48 - INFO - __main__ - Step 64699: {'lr': 0.0003094031452179618, 'samples': 12422208, 'steps': 64698, 'loss/train': 5.791396617889404} 11/07/2021 06:20:48 - INFO - __main__ - Step 64700: {'lr': 0.0003093979904453577, 'samples': 12422400, 'steps': 64699, 'loss/train': 1.2669965028762817} 11/07/2021 06:20:48 - INFO - __main__ - Step 64701: {'lr': 0.00030939283564598976, 'samples': 12422592, 'steps': 64700, 'loss/train': 1.2739782333374023} 11/07/2021 06:20:49 - INFO - __main__ - Step 64702: {'lr': 0.0003093876808198603, 'samples': 12422784, 'steps': 64701, 'loss/train': 1.1718430519104004} 11/07/2021 06:20:49 - INFO - __main__ - Step 64703: {'lr': 0.0003093825259669717, 'samples': 12422976, 'steps': 64702, 'loss/train': 1.6826294660568237} 11/07/2021 06:20:50 - INFO - __main__ - Step 64704: {'lr': 0.00030937737108732623, 'samples': 12423168, 'steps': 64703, 'loss/train': 1.0242314338684082} 11/07/2021 06:20:50 - INFO - __main__ - Step 64705: {'lr': 0.00030937221618092633, 'samples': 12423360, 'steps': 64704, 'loss/train': 1.2611570358276367} 11/07/2021 06:20:51 - INFO - __main__ - Step 64706: {'lr': 0.00030936706124777406, 'samples': 12423552, 'steps': 64705, 'loss/train': 1.8013652563095093} 11/07/2021 06:20:51 - INFO - __main__ - Step 64707: {'lr': 0.00030936190628787203, 'samples': 12423744, 'steps': 64706, 'loss/train': 1.1112252473831177} 11/07/2021 06:20:52 - INFO - __main__ - Step 64708: {'lr': 0.00030935675130122235, 'samples': 12423936, 'steps': 64707, 'loss/train': 1.3938384056091309} 11/07/2021 06:20:52 - INFO - __main__ - Step 64709: {'lr': 0.0003093515962878275, 'samples': 12424128, 'steps': 64708, 'loss/train': 1.5017311573028564} 11/07/2021 06:20:53 - INFO - __main__ - Step 64710: {'lr': 0.00030934644124768976, 'samples': 12424320, 'steps': 64709, 'loss/train': 1.6358330249786377} 11/07/2021 06:20:54 - INFO - __main__ - Step 64711: {'lr': 0.00030934128618081134, 'samples': 12424512, 'steps': 64710, 'loss/train': 1.1875364780426025} 11/07/2021 06:20:54 - INFO - __main__ - Step 64712: {'lr': 0.00030933613108719476, 'samples': 12424704, 'steps': 64711, 'loss/train': 0.8283331990242004} 11/07/2021 06:20:54 - INFO - __main__ - Step 64713: {'lr': 0.0003093309759668422, 'samples': 12424896, 'steps': 64712, 'loss/train': 1.3879083395004272} 11/07/2021 06:20:55 - INFO - __main__ - Step 64714: {'lr': 0.00030932582081975597, 'samples': 12425088, 'steps': 64713, 'loss/train': 0.9251329898834229} 11/07/2021 06:20:56 - INFO - __main__ - Step 64715: {'lr': 0.0003093206656459384, 'samples': 12425280, 'steps': 64714, 'loss/train': 0.11130845546722412} 11/07/2021 06:20:56 - INFO - __main__ - Step 64716: {'lr': 0.00030931551044539196, 'samples': 12425472, 'steps': 64715, 'loss/train': 1.5658456087112427} 11/07/2021 06:20:56 - INFO - __main__ - Step 64717: {'lr': 0.0003093103552181188, 'samples': 12425664, 'steps': 64716, 'loss/train': 1.2518538236618042} 11/07/2021 06:20:57 - INFO - __main__ - Step 64718: {'lr': 0.0003093051999641214, 'samples': 12425856, 'steps': 64717, 'loss/train': 1.3927249908447266} 11/07/2021 06:20:57 - INFO - __main__ - Step 64719: {'lr': 0.00030930004468340187, 'samples': 12426048, 'steps': 64718, 'loss/train': 1.3434505462646484} 11/07/2021 06:20:58 - INFO - __main__ - Step 64720: {'lr': 0.00030929488937596274, 'samples': 12426240, 'steps': 64719, 'loss/train': 1.3865703344345093} 11/07/2021 06:20:58 - INFO - __main__ - Step 64721: {'lr': 0.0003092897340418062, 'samples': 12426432, 'steps': 64720, 'loss/train': 0.9534475803375244} 11/07/2021 06:20:59 - INFO - __main__ - Step 64722: {'lr': 0.0003092845786809346, 'samples': 12426624, 'steps': 64721, 'loss/train': 0.5085859894752502} 11/07/2021 06:20:59 - INFO - __main__ - Step 64723: {'lr': 0.0003092794232933503, 'samples': 12426816, 'steps': 64722, 'loss/train': 1.780808687210083} 11/07/2021 06:21:00 - INFO - __main__ - Step 64724: {'lr': 0.00030927426787905564, 'samples': 12427008, 'steps': 64723, 'loss/train': 1.3705735206604004} 11/07/2021 06:21:01 - INFO - __main__ - Step 64725: {'lr': 0.000309269112438053, 'samples': 12427200, 'steps': 64724, 'loss/train': 1.648853063583374} 11/07/2021 06:21:01 - INFO - __main__ - Step 64726: {'lr': 0.0003092639569703445, 'samples': 12427392, 'steps': 64725, 'loss/train': 1.529177188873291} 11/07/2021 06:21:01 - INFO - __main__ - Step 64727: {'lr': 0.0003092588014759325, 'samples': 12427584, 'steps': 64726, 'loss/train': 1.1308536529541016} 11/07/2021 06:21:02 - INFO - __main__ - Step 64728: {'lr': 0.00030925364595481953, 'samples': 12427776, 'steps': 64727, 'loss/train': 1.4135950803756714} 11/07/2021 06:21:02 - INFO - __main__ - Step 64729: {'lr': 0.00030924849040700773, 'samples': 12427968, 'steps': 64728, 'loss/train': 0.9666653275489807} 11/07/2021 06:21:02 - INFO - __main__ - Step 64730: {'lr': 0.0003092433348324995, 'samples': 12428160, 'steps': 64729, 'loss/train': 1.3004740476608276} 11/07/2021 06:21:04 - INFO - __main__ - Step 64731: {'lr': 0.00030923817923129716, 'samples': 12428352, 'steps': 64730, 'loss/train': 1.8030682802200317} 11/07/2021 06:21:05 - INFO - __main__ - Step 64732: {'lr': 0.00030923302360340294, 'samples': 12428544, 'steps': 64731, 'loss/train': 1.3462437391281128} 11/07/2021 06:21:05 - INFO - __main__ - Step 64733: {'lr': 0.0003092278679488193, 'samples': 12428736, 'steps': 64732, 'loss/train': 1.175147294998169} 11/07/2021 06:21:05 - INFO - __main__ - Step 64734: {'lr': 0.0003092227122675484, 'samples': 12428928, 'steps': 64733, 'loss/train': 0.1521930992603302} 11/07/2021 06:21:06 - INFO - __main__ - Step 64735: {'lr': 0.0003092175565595927, 'samples': 12429120, 'steps': 64734, 'loss/train': 0.12361344695091248} 11/07/2021 06:21:07 - INFO - __main__ - Step 64736: {'lr': 0.0003092124008249545, 'samples': 12429312, 'steps': 64735, 'loss/train': 0.9511817097663879} 11/07/2021 06:21:07 - INFO - __main__ - Step 64737: {'lr': 0.00030920724506363614, 'samples': 12429504, 'steps': 64736, 'loss/train': 1.153037667274475} 11/07/2021 06:21:07 - INFO - __main__ - Step 64738: {'lr': 0.0003092020892756399, 'samples': 12429696, 'steps': 64737, 'loss/train': 1.3899072408676147} 11/07/2021 06:21:08 - INFO - __main__ - Step 64739: {'lr': 0.0003091969334609681, 'samples': 12429888, 'steps': 64738, 'loss/train': 1.2514845132827759} 11/07/2021 06:21:08 - INFO - __main__ - Step 64740: {'lr': 0.00030919177761962305, 'samples': 12430080, 'steps': 64739, 'loss/train': 0.7131841778755188} 11/07/2021 06:21:09 - INFO - __main__ - Step 64741: {'lr': 0.0003091866217516071, 'samples': 12430272, 'steps': 64740, 'loss/train': 0.8252730369567871} 11/07/2021 06:21:09 - INFO - __main__ - Step 64742: {'lr': 0.0003091814658569226, 'samples': 12430464, 'steps': 64741, 'loss/train': 2.147662878036499} 11/07/2021 06:21:10 - INFO - __main__ - Step 64743: {'lr': 0.00030917630993557176, 'samples': 12430656, 'steps': 64742, 'loss/train': 1.393196702003479} 11/07/2021 06:21:10 - INFO - __main__ - Step 64744: {'lr': 0.0003091711539875571, 'samples': 12430848, 'steps': 64743, 'loss/train': 1.4075959920883179} 11/07/2021 06:21:10 - INFO - __main__ - Step 64745: {'lr': 0.0003091659980128808, 'samples': 12431040, 'steps': 64744, 'loss/train': 1.3961005210876465} 11/07/2021 06:21:11 - INFO - __main__ - Step 64746: {'lr': 0.00030916084201154523, 'samples': 12431232, 'steps': 64745, 'loss/train': 1.5435726642608643} 11/07/2021 06:21:12 - INFO - __main__ - Step 64747: {'lr': 0.00030915568598355265, 'samples': 12431424, 'steps': 64746, 'loss/train': 1.4521009922027588} 11/07/2021 06:21:12 - INFO - __main__ - Step 64748: {'lr': 0.00030915052992890545, 'samples': 12431616, 'steps': 64747, 'loss/train': 1.3225255012512207} 11/07/2021 06:21:13 - INFO - __main__ - Step 64749: {'lr': 0.00030914537384760596, 'samples': 12431808, 'steps': 64748, 'loss/train': 1.2773340940475464} 11/07/2021 06:21:13 - INFO - __main__ - Step 64750: {'lr': 0.0003091402177396564, 'samples': 12432000, 'steps': 64749, 'loss/train': 0.20051749050617218} 11/07/2021 06:21:13 - INFO - __main__ - Step 64751: {'lr': 0.0003091350616050592, 'samples': 12432192, 'steps': 64750, 'loss/train': 1.9668525457382202} 11/07/2021 06:21:14 - INFO - __main__ - Step 64752: {'lr': 0.00030912990544381677, 'samples': 12432384, 'steps': 64751, 'loss/train': 1.5039644241333008} 11/07/2021 06:21:15 - INFO - __main__ - Step 64753: {'lr': 0.0003091247492559312, 'samples': 12432576, 'steps': 64752, 'loss/train': 1.7138373851776123} 11/07/2021 06:21:15 - INFO - __main__ - Step 64754: {'lr': 0.0003091195930414049, 'samples': 12432768, 'steps': 64753, 'loss/train': 1.2270245552062988} 11/07/2021 06:21:15 - INFO - __main__ - Step 64755: {'lr': 0.00030911443680024033, 'samples': 12432960, 'steps': 64754, 'loss/train': 1.69589102268219} 11/07/2021 06:21:16 - INFO - __main__ - Step 64756: {'lr': 0.00030910928053243963, 'samples': 12433152, 'steps': 64755, 'loss/train': 1.305094599723816} 11/07/2021 06:21:17 - INFO - __main__ - Step 64757: {'lr': 0.00030910412423800523, 'samples': 12433344, 'steps': 64756, 'loss/train': 1.6126759052276611} 11/07/2021 06:21:17 - INFO - __main__ - Step 64758: {'lr': 0.00030909896791693947, 'samples': 12433536, 'steps': 64757, 'loss/train': 1.6780248880386353} 11/07/2021 06:21:17 - INFO - __main__ - Step 64759: {'lr': 0.00030909381156924456, 'samples': 12433728, 'steps': 64758, 'loss/train': 1.5091902017593384} 11/07/2021 06:21:18 - INFO - __main__ - Step 64760: {'lr': 0.0003090886551949229, 'samples': 12433920, 'steps': 64759, 'loss/train': 1.3441435098648071} 11/07/2021 06:21:18 - INFO - __main__ - Step 64761: {'lr': 0.0003090834987939768, 'samples': 12434112, 'steps': 64760, 'loss/train': 1.519511103630066} 11/07/2021 06:21:19 - INFO - __main__ - Step 64762: {'lr': 0.00030907834236640856, 'samples': 12434304, 'steps': 64761, 'loss/train': 1.5836818218231201} 11/07/2021 06:21:20 - INFO - __main__ - Step 64763: {'lr': 0.00030907318591222056, 'samples': 12434496, 'steps': 64762, 'loss/train': 1.6075648069381714} 11/07/2021 06:21:20 - INFO - __main__ - Step 64764: {'lr': 0.0003090680294314151, 'samples': 12434688, 'steps': 64763, 'loss/train': 1.2624013423919678} 11/07/2021 06:21:20 - INFO - __main__ - Step 64765: {'lr': 0.00030906287292399457, 'samples': 12434880, 'steps': 64764, 'loss/train': 1.373499870300293} 11/07/2021 06:21:21 - INFO - __main__ - Step 64766: {'lr': 0.0003090577163899611, 'samples': 12435072, 'steps': 64765, 'loss/train': 1.8060362339019775} 11/07/2021 06:21:22 - INFO - __main__ - Step 64767: {'lr': 0.00030905255982931716, 'samples': 12435264, 'steps': 64766, 'loss/train': 0.1155720055103302} 11/07/2021 06:21:22 - INFO - __main__ - Step 64768: {'lr': 0.0003090474032420651, 'samples': 12435456, 'steps': 64767, 'loss/train': 1.4035189151763916} 11/07/2021 06:21:23 - INFO - __main__ - Step 64769: {'lr': 0.00030904224662820716, 'samples': 12435648, 'steps': 64768, 'loss/train': 1.4494366645812988} 11/07/2021 06:21:23 - INFO - __main__ - Step 64770: {'lr': 0.00030903708998774573, 'samples': 12435840, 'steps': 64769, 'loss/train': 0.9973698854446411} 11/07/2021 06:21:23 - INFO - __main__ - Step 64771: {'lr': 0.00030903193332068303, 'samples': 12436032, 'steps': 64770, 'loss/train': 1.3705912828445435} 11/07/2021 06:21:25 - INFO - __main__ - Step 64772: {'lr': 0.0003090267766270215, 'samples': 12436224, 'steps': 64771, 'loss/train': 1.680672526359558} 11/07/2021 06:21:25 - INFO - __main__ - Step 64773: {'lr': 0.00030902161990676344, 'samples': 12436416, 'steps': 64772, 'loss/train': 1.1254994869232178} 11/07/2021 06:21:25 - INFO - __main__ - Step 64774: {'lr': 0.00030901646315991104, 'samples': 12436608, 'steps': 64773, 'loss/train': 0.4444729685783386} 11/07/2021 06:21:26 - INFO - __main__ - Step 64775: {'lr': 0.00030901130638646686, 'samples': 12436800, 'steps': 64774, 'loss/train': 1.7578927278518677} 11/07/2021 06:21:26 - INFO - __main__ - Step 64776: {'lr': 0.00030900614958643305, 'samples': 12436992, 'steps': 64775, 'loss/train': 1.729585886001587} 11/07/2021 06:21:26 - INFO - __main__ - Step 64777: {'lr': 0.00030900099275981194, 'samples': 12437184, 'steps': 64776, 'loss/train': 1.6513299942016602} 11/07/2021 06:21:27 - INFO - __main__ - Step 64778: {'lr': 0.000308995835906606, 'samples': 12437376, 'steps': 64777, 'loss/train': 1.5330910682678223} 11/07/2021 06:21:28 - INFO - __main__ - Step 64779: {'lr': 0.00030899067902681734, 'samples': 12437568, 'steps': 64778, 'loss/train': 1.6851475238800049} 11/07/2021 06:21:28 - INFO - __main__ - Step 64780: {'lr': 0.0003089855221204484, 'samples': 12437760, 'steps': 64779, 'loss/train': 1.2228647470474243} 11/07/2021 06:21:28 - INFO - __main__ - Step 64781: {'lr': 0.0003089803651875015, 'samples': 12437952, 'steps': 64780, 'loss/train': 1.5288466215133667} 11/07/2021 06:21:29 - INFO - __main__ - Step 64782: {'lr': 0.000308975208227979, 'samples': 12438144, 'steps': 64781, 'loss/train': 1.1898071765899658} 11/07/2021 06:21:30 - INFO - __main__ - Step 64783: {'lr': 0.0003089700512418831, 'samples': 12438336, 'steps': 64782, 'loss/train': 1.3588953018188477} 11/07/2021 06:21:30 - INFO - __main__ - Step 64784: {'lr': 0.00030896489422921623, 'samples': 12438528, 'steps': 64783, 'loss/train': 1.3011415004730225} 11/07/2021 06:21:31 - INFO - __main__ - Step 64785: {'lr': 0.00030895973718998075, 'samples': 12438720, 'steps': 64784, 'loss/train': 1.4825669527053833} 11/07/2021 06:21:31 - INFO - __main__ - Step 64786: {'lr': 0.00030895458012417896, 'samples': 12438912, 'steps': 64785, 'loss/train': 1.2494871616363525} 11/07/2021 06:21:31 - INFO - __main__ - Step 64787: {'lr': 0.000308949423031813, 'samples': 12439104, 'steps': 64786, 'loss/train': 1.3616538047790527} 11/07/2021 06:21:32 - INFO - __main__ - Step 64788: {'lr': 0.0003089442659128854, 'samples': 12439296, 'steps': 64787, 'loss/train': 1.4683374166488647} 11/07/2021 06:21:33 - INFO - __main__ - Step 64789: {'lr': 0.00030893910876739845, 'samples': 12439488, 'steps': 64788, 'loss/train': 0.963502049446106} 11/07/2021 06:21:33 - INFO - __main__ - Step 64790: {'lr': 0.00030893395159535444, 'samples': 12439680, 'steps': 64789, 'loss/train': 1.540330410003662} 11/07/2021 06:21:33 - INFO - __main__ - Step 64791: {'lr': 0.0003089287943967557, 'samples': 12439872, 'steps': 64790, 'loss/train': 1.3155933618545532} 11/07/2021 06:21:34 - INFO - __main__ - Step 64792: {'lr': 0.00030892363717160455, 'samples': 12440064, 'steps': 64791, 'loss/train': 1.6471589803695679} 11/07/2021 06:21:35 - INFO - __main__ - Step 64793: {'lr': 0.00030891847991990334, 'samples': 12440256, 'steps': 64792, 'loss/train': 1.2958241701126099} 11/07/2021 06:21:35 - INFO - __main__ - Step 64794: {'lr': 0.00030891332264165435, 'samples': 12440448, 'steps': 64793, 'loss/train': 1.552553653717041} 11/07/2021 06:21:35 - INFO - __main__ - Step 64795: {'lr': 0.0003089081653368599, 'samples': 12440640, 'steps': 64794, 'loss/train': 1.4694684743881226} 11/07/2021 06:21:36 - INFO - __main__ - Step 64796: {'lr': 0.00030890300800552237, 'samples': 12440832, 'steps': 64795, 'loss/train': 1.3936485052108765} 11/07/2021 06:21:36 - INFO - __main__ - Step 64797: {'lr': 0.00030889785064764405, 'samples': 12441024, 'steps': 64796, 'loss/train': 1.4169387817382812} 11/07/2021 06:21:37 - INFO - __main__ - Step 64798: {'lr': 0.00030889269326322727, 'samples': 12441216, 'steps': 64797, 'loss/train': 0.9973545670509338} 11/07/2021 06:21:37 - INFO - __main__ - Step 64799: {'lr': 0.0003088875358522744, 'samples': 12441408, 'steps': 64798, 'loss/train': 1.069615364074707} 11/07/2021 06:21:38 - INFO - __main__ - Step 64800: {'lr': 0.00030888237841478764, 'samples': 12441600, 'steps': 64799, 'loss/train': 1.1841773986816406} 11/07/2021 06:21:38 - INFO - __main__ - Step 64801: {'lr': 0.0003088772209507694, 'samples': 12441792, 'steps': 64800, 'loss/train': 1.3479523658752441} 11/07/2021 06:21:38 - INFO - __main__ - Step 64802: {'lr': 0.000308872063460222, 'samples': 12441984, 'steps': 64801, 'loss/train': 1.2408267259597778} 11/07/2021 06:21:39 - INFO - __main__ - Step 64803: {'lr': 0.0003088669059431478, 'samples': 12442176, 'steps': 64802, 'loss/train': 1.7272355556488037} 11/07/2021 06:21:40 - INFO - __main__ - Step 64804: {'lr': 0.000308861748399549, 'samples': 12442368, 'steps': 64803, 'loss/train': 1.4060934782028198} 11/07/2021 06:21:40 - INFO - __main__ - Step 64805: {'lr': 0.00030885659082942806, 'samples': 12442560, 'steps': 64804, 'loss/train': 1.101048469543457} 11/07/2021 06:21:40 - INFO - __main__ - Step 64806: {'lr': 0.00030885143323278717, 'samples': 12442752, 'steps': 64805, 'loss/train': 1.6129337549209595} 11/07/2021 06:21:41 - INFO - __main__ - Step 64807: {'lr': 0.00030884627560962886, 'samples': 12442944, 'steps': 64806, 'loss/train': 1.3819926977157593} 11/07/2021 06:21:42 - INFO - __main__ - Step 64808: {'lr': 0.00030884111795995525, 'samples': 12443136, 'steps': 64807, 'loss/train': 1.2896251678466797} 11/07/2021 06:21:43 - INFO - __main__ - Step 64809: {'lr': 0.0003088359602837688, 'samples': 12443328, 'steps': 64808, 'loss/train': 1.264945387840271} 11/07/2021 06:21:43 - INFO - __main__ - Step 64810: {'lr': 0.0003088308025810717, 'samples': 12443520, 'steps': 64809, 'loss/train': 1.6344773769378662} 11/07/2021 06:21:43 - INFO - __main__ - Step 64811: {'lr': 0.0003088256448518664, 'samples': 12443712, 'steps': 64810, 'loss/train': 1.1079655885696411} 11/07/2021 06:21:44 - INFO - __main__ - Step 64812: {'lr': 0.00030882048709615515, 'samples': 12443904, 'steps': 64811, 'loss/train': 1.153310775756836} 11/07/2021 06:21:44 - INFO - __main__ - Step 64813: {'lr': 0.00030881532931394026, 'samples': 12444096, 'steps': 64812, 'loss/train': 1.5462653636932373} 11/07/2021 06:21:45 - INFO - __main__ - Step 64814: {'lr': 0.00030881017150522416, 'samples': 12444288, 'steps': 64813, 'loss/train': 0.43516916036605835} 11/07/2021 06:21:45 - INFO - __main__ - Step 64815: {'lr': 0.0003088050136700091, 'samples': 12444480, 'steps': 64814, 'loss/train': 1.2297500371932983} 11/07/2021 06:21:46 - INFO - __main__ - Step 64816: {'lr': 0.00030879985580829734, 'samples': 12444672, 'steps': 64815, 'loss/train': 1.252029299736023} 11/07/2021 06:21:46 - INFO - __main__ - Step 64817: {'lr': 0.0003087946979200913, 'samples': 12444864, 'steps': 64816, 'loss/train': 0.6851214170455933} 11/07/2021 06:21:46 - INFO - __main__ - Step 64818: {'lr': 0.0003087895400053933, 'samples': 12445056, 'steps': 64817, 'loss/train': 1.1707615852355957} 11/07/2021 06:21:47 - INFO - __main__ - Step 64819: {'lr': 0.0003087843820642057, 'samples': 12445248, 'steps': 64818, 'loss/train': 1.2018386125564575} 11/07/2021 06:21:48 - INFO - __main__ - Step 64820: {'lr': 0.00030877922409653063, 'samples': 12445440, 'steps': 64819, 'loss/train': 1.5277125835418701} 11/07/2021 06:21:48 - INFO - __main__ - Step 64821: {'lr': 0.0003087740661023706, 'samples': 12445632, 'steps': 64820, 'loss/train': 0.3694879710674286} 11/07/2021 06:21:49 - INFO - __main__ - Step 64822: {'lr': 0.0003087689080817279, 'samples': 12445824, 'steps': 64821, 'loss/train': 0.8739466667175293} 11/07/2021 06:21:49 - INFO - __main__ - Step 64823: {'lr': 0.0003087637500346048, 'samples': 12446016, 'steps': 64822, 'loss/train': 1.2477155923843384} 11/07/2021 06:21:50 - INFO - __main__ - Step 64824: {'lr': 0.0003087585919610037, 'samples': 12446208, 'steps': 64823, 'loss/train': 0.6469504833221436} 11/07/2021 06:21:50 - INFO - __main__ - Step 64825: {'lr': 0.0003087534338609269, 'samples': 12446400, 'steps': 64824, 'loss/train': 1.3115787506103516} 11/07/2021 06:21:51 - INFO - __main__ - Step 64826: {'lr': 0.0003087482757343767, 'samples': 12446592, 'steps': 64825, 'loss/train': 1.5024675130844116} 11/07/2021 06:21:51 - INFO - __main__ - Step 64827: {'lr': 0.00030874311758135535, 'samples': 12446784, 'steps': 64826, 'loss/train': 1.377240777015686} 11/07/2021 06:21:51 - INFO - __main__ - Step 64828: {'lr': 0.0003087379594018653, 'samples': 12446976, 'steps': 64827, 'loss/train': 0.9131162166595459} 11/07/2021 06:21:53 - INFO - __main__ - Step 64829: {'lr': 0.0003087328011959089, 'samples': 12447168, 'steps': 64828, 'loss/train': 0.5752022862434387} 11/07/2021 06:21:53 - INFO - __main__ - Step 64830: {'lr': 0.0003087276429634884, 'samples': 12447360, 'steps': 64829, 'loss/train': 2.08362078666687} 11/07/2021 06:21:53 - INFO - __main__ - Step 64831: {'lr': 0.0003087224847046061, 'samples': 12447552, 'steps': 64830, 'loss/train': 2.8437318801879883} 11/07/2021 06:21:54 - INFO - __main__ - Step 64832: {'lr': 0.0003087173264192643, 'samples': 12447744, 'steps': 64831, 'loss/train': 1.8131057024002075} 11/07/2021 06:21:54 - INFO - __main__ - Step 64833: {'lr': 0.00030871216810746544, 'samples': 12447936, 'steps': 64832, 'loss/train': 0.8744111061096191} 11/07/2021 06:21:54 - INFO - __main__ - Step 64834: {'lr': 0.0003087070097692118, 'samples': 12448128, 'steps': 64833, 'loss/train': 0.9457012414932251} 11/07/2021 06:21:55 - INFO - __main__ - Step 64835: {'lr': 0.00030870185140450564, 'samples': 12448320, 'steps': 64834, 'loss/train': 0.9038562178611755} 11/07/2021 06:21:56 - INFO - __main__ - Step 64836: {'lr': 0.00030869669301334936, 'samples': 12448512, 'steps': 64835, 'loss/train': 1.470394492149353} 11/07/2021 06:21:56 - INFO - __main__ - Step 64837: {'lr': 0.0003086915345957452, 'samples': 12448704, 'steps': 64836, 'loss/train': 1.6446168422698975} 11/07/2021 06:21:56 - INFO - __main__ - Step 64838: {'lr': 0.0003086863761516956, 'samples': 12448896, 'steps': 64837, 'loss/train': 1.3548681735992432} 11/07/2021 06:21:57 - INFO - __main__ - Step 64839: {'lr': 0.0003086812176812028, 'samples': 12449088, 'steps': 64838, 'loss/train': 1.3981322050094604} 11/07/2021 06:21:58 - INFO - __main__ - Step 64840: {'lr': 0.00030867605918426916, 'samples': 12449280, 'steps': 64839, 'loss/train': 1.1039109230041504} 11/07/2021 06:21:58 - INFO - __main__ - Step 64841: {'lr': 0.000308670900660897, 'samples': 12449472, 'steps': 64840, 'loss/train': 1.6446537971496582} 11/07/2021 06:21:58 - INFO - __main__ - Step 64842: {'lr': 0.00030866574211108863, 'samples': 12449664, 'steps': 64841, 'loss/train': 1.4737292528152466} 11/07/2021 06:21:59 - INFO - __main__ - Step 64843: {'lr': 0.0003086605835348464, 'samples': 12449856, 'steps': 64842, 'loss/train': 1.143540620803833} 11/07/2021 06:21:59 - INFO - __main__ - Step 64844: {'lr': 0.0003086554249321726, 'samples': 12450048, 'steps': 64843, 'loss/train': 1.0953729152679443} 11/07/2021 06:22:00 - INFO - __main__ - Step 64845: {'lr': 0.00030865026630306954, 'samples': 12450240, 'steps': 64844, 'loss/train': 1.4300092458724976} 11/07/2021 06:22:01 - INFO - __main__ - Step 64846: {'lr': 0.0003086451076475396, 'samples': 12450432, 'steps': 64845, 'loss/train': 1.186070203781128} 11/07/2021 06:22:01 - INFO - __main__ - Step 64847: {'lr': 0.00030863994896558513, 'samples': 12450624, 'steps': 64846, 'loss/train': 1.0530965328216553} 11/07/2021 06:22:01 - INFO - __main__ - Step 64848: {'lr': 0.0003086347902572083, 'samples': 12450816, 'steps': 64847, 'loss/train': 1.910246729850769} 11/07/2021 06:22:02 - INFO - __main__ - Step 64849: {'lr': 0.0003086296315224116, 'samples': 12451008, 'steps': 64848, 'loss/train': 1.4429287910461426} 11/07/2021 06:22:03 - INFO - __main__ - Step 64850: {'lr': 0.00030862447276119734, 'samples': 12451200, 'steps': 64849, 'loss/train': 1.153667688369751} 11/07/2021 06:22:03 - INFO - __main__ - Step 64851: {'lr': 0.0003086193139735677, 'samples': 12451392, 'steps': 64850, 'loss/train': 1.5782700777053833} 11/07/2021 06:22:03 - INFO - __main__ - Step 64852: {'lr': 0.00030861415515952517, 'samples': 12451584, 'steps': 64851, 'loss/train': 1.3089630603790283} 11/07/2021 06:22:04 - INFO - __main__ - Step 64853: {'lr': 0.000308608996319072, 'samples': 12451776, 'steps': 64852, 'loss/train': 1.3650670051574707} 11/07/2021 06:22:04 - INFO - __main__ - Step 64854: {'lr': 0.0003086038374522105, 'samples': 12451968, 'steps': 64853, 'loss/train': 1.069608211517334} 11/07/2021 06:22:05 - INFO - __main__ - Step 64855: {'lr': 0.00030859867855894296, 'samples': 12452160, 'steps': 64854, 'loss/train': 0.8994445204734802} 11/07/2021 06:22:06 - INFO - __main__ - Step 64856: {'lr': 0.00030859351963927184, 'samples': 12452352, 'steps': 64855, 'loss/train': 1.2551093101501465} 11/07/2021 06:22:06 - INFO - __main__ - Step 64857: {'lr': 0.00030858836069319937, 'samples': 12452544, 'steps': 64856, 'loss/train': 1.2338429689407349} 11/07/2021 06:22:06 - INFO - __main__ - Step 64858: {'lr': 0.00030858320172072787, 'samples': 12452736, 'steps': 64857, 'loss/train': 1.1929552555084229} 11/07/2021 06:22:07 - INFO - __main__ - Step 64859: {'lr': 0.00030857804272185974, 'samples': 12452928, 'steps': 64858, 'loss/train': 1.1777817010879517} 11/07/2021 06:22:08 - INFO - __main__ - Step 64860: {'lr': 0.0003085728836965972, 'samples': 12453120, 'steps': 64859, 'loss/train': 1.377216100692749} 11/07/2021 06:22:08 - INFO - __main__ - Step 64861: {'lr': 0.0003085677246449426, 'samples': 12453312, 'steps': 64860, 'loss/train': 1.482625961303711} 11/07/2021 06:22:08 - INFO - __main__ - Step 64862: {'lr': 0.00030856256556689835, 'samples': 12453504, 'steps': 64861, 'loss/train': 1.509861707687378} 11/07/2021 06:22:09 - INFO - __main__ - Step 64863: {'lr': 0.0003085574064624666, 'samples': 12453696, 'steps': 64862, 'loss/train': 1.274176001548767} 11/07/2021 06:22:09 - INFO - __main__ - Step 64864: {'lr': 0.00030855224733164987, 'samples': 12453888, 'steps': 64863, 'loss/train': 1.5217761993408203} 11/07/2021 06:22:10 - INFO - __main__ - Step 64865: {'lr': 0.0003085470881744504, 'samples': 12454080, 'steps': 64864, 'loss/train': 1.0419563055038452} 11/07/2021 06:22:10 - INFO - __main__ - Step 64866: {'lr': 0.0003085419289908705, 'samples': 12454272, 'steps': 64865, 'loss/train': 1.4580577611923218} 11/07/2021 06:22:11 - INFO - __main__ - Step 64867: {'lr': 0.00030853676978091256, 'samples': 12454464, 'steps': 64866, 'loss/train': 0.9171990156173706} 11/07/2021 06:22:11 - INFO - __main__ - Step 64868: {'lr': 0.0003085316105445788, 'samples': 12454656, 'steps': 64867, 'loss/train': 1.8204642534255981} 11/07/2021 06:22:12 - INFO - __main__ - Step 64869: {'lr': 0.00030852645128187157, 'samples': 12454848, 'steps': 64868, 'loss/train': 1.29697847366333} 11/07/2021 06:22:12 - INFO - __main__ - Step 64870: {'lr': 0.00030852129199279325, 'samples': 12455040, 'steps': 64869, 'loss/train': 1.4719280004501343} 11/07/2021 06:22:13 - INFO - __main__ - Step 64871: {'lr': 0.0003085161326773461, 'samples': 12455232, 'steps': 64870, 'loss/train': 1.0850064754486084} 11/07/2021 06:22:13 - INFO - __main__ - Step 64872: {'lr': 0.0003085109733355326, 'samples': 12455424, 'steps': 64871, 'loss/train': 1.8461496829986572} 11/07/2021 06:22:14 - INFO - __main__ - Step 64873: {'lr': 0.00030850581396735493, 'samples': 12455616, 'steps': 64872, 'loss/train': 0.6675298810005188} 11/07/2021 06:22:14 - INFO - __main__ - Step 64874: {'lr': 0.0003085006545728154, 'samples': 12455808, 'steps': 64873, 'loss/train': 1.700971245765686} 11/07/2021 06:22:14 - INFO - __main__ - Step 64875: {'lr': 0.00030849549515191637, 'samples': 12456000, 'steps': 64874, 'loss/train': 1.0358401536941528} 11/07/2021 06:22:15 - INFO - __main__ - Step 64876: {'lr': 0.00030849033570466017, 'samples': 12456192, 'steps': 64875, 'loss/train': 1.282575249671936} 11/07/2021 06:22:16 - INFO - __main__ - Step 64877: {'lr': 0.0003084851762310492, 'samples': 12456384, 'steps': 64876, 'loss/train': 1.266755223274231} 11/07/2021 06:22:16 - INFO - __main__ - Step 64878: {'lr': 0.0003084800167310856, 'samples': 12456576, 'steps': 64877, 'loss/train': 1.5955100059509277} 11/07/2021 06:22:16 - INFO - __main__ - Step 64879: {'lr': 0.00030847485720477194, 'samples': 12456768, 'steps': 64878, 'loss/train': 1.7814500331878662} 11/07/2021 06:22:17 - INFO - __main__ - Step 64880: {'lr': 0.0003084696976521103, 'samples': 12456960, 'steps': 64879, 'loss/train': 1.3262460231781006} 11/07/2021 06:22:18 - INFO - __main__ - Step 64881: {'lr': 0.00030846453807310316, 'samples': 12457152, 'steps': 64880, 'loss/train': 1.4625030755996704} 11/07/2021 06:22:18 - INFO - __main__ - Step 64882: {'lr': 0.0003084593784677527, 'samples': 12457344, 'steps': 64881, 'loss/train': 1.090683102607727} 11/07/2021 06:22:19 - INFO - __main__ - Step 64883: {'lr': 0.0003084542188360615, 'samples': 12457536, 'steps': 64882, 'loss/train': 1.6445178985595703} 11/07/2021 06:22:19 - INFO - __main__ - Step 64884: {'lr': 0.0003084490591780317, 'samples': 12457728, 'steps': 64883, 'loss/train': 1.9947587251663208} 11/07/2021 06:22:19 - INFO - __main__ - Step 64885: {'lr': 0.0003084438994936656, 'samples': 12457920, 'steps': 64884, 'loss/train': 0.12400373816490173} 11/07/2021 06:22:20 - INFO - __main__ - Step 64886: {'lr': 0.00030843873978296564, 'samples': 12458112, 'steps': 64885, 'loss/train': 1.6013234853744507} 11/07/2021 06:22:21 - INFO - __main__ - Step 64887: {'lr': 0.000308433580045934, 'samples': 12458304, 'steps': 64886, 'loss/train': 1.4025827646255493} 11/07/2021 06:22:21 - INFO - __main__ - Step 64888: {'lr': 0.0003084284202825732, 'samples': 12458496, 'steps': 64887, 'loss/train': 0.8709974884986877} 11/07/2021 06:22:21 - INFO - __main__ - Step 64889: {'lr': 0.0003084232604928854, 'samples': 12458688, 'steps': 64888, 'loss/train': 1.6246225833892822} 11/07/2021 06:22:22 - INFO - __main__ - Step 64890: {'lr': 0.0003084181006768729, 'samples': 12458880, 'steps': 64889, 'loss/train': 1.3899285793304443} 11/07/2021 06:22:23 - INFO - __main__ - Step 64891: {'lr': 0.0003084129408345382, 'samples': 12459072, 'steps': 64890, 'loss/train': 1.2756571769714355} 11/07/2021 06:22:23 - INFO - __main__ - Step 64892: {'lr': 0.0003084077809658835, 'samples': 12459264, 'steps': 64891, 'loss/train': 0.7576113939285278} 11/07/2021 06:22:23 - INFO - __main__ - Step 64893: {'lr': 0.0003084026210709112, 'samples': 12459456, 'steps': 64892, 'loss/train': 1.4553937911987305} 11/07/2021 06:22:24 - INFO - __main__ - Step 64894: {'lr': 0.00030839746114962356, 'samples': 12459648, 'steps': 64893, 'loss/train': 1.258760929107666} 11/07/2021 06:22:24 - INFO - __main__ - Step 64895: {'lr': 0.00030839230120202296, 'samples': 12459840, 'steps': 64894, 'loss/train': 1.8875389099121094} 11/07/2021 06:22:25 - INFO - __main__ - Step 64896: {'lr': 0.00030838714122811164, 'samples': 12460032, 'steps': 64895, 'loss/train': 1.6142040491104126} 11/07/2021 06:22:26 - INFO - __main__ - Step 64897: {'lr': 0.00030838198122789195, 'samples': 12460224, 'steps': 64896, 'loss/train': 1.4058127403259277} 11/07/2021 06:22:26 - INFO - __main__ - Step 64898: {'lr': 0.00030837682120136626, 'samples': 12460416, 'steps': 64897, 'loss/train': 1.4270634651184082} 11/07/2021 06:22:26 - INFO - __main__ - Step 64899: {'lr': 0.00030837166114853695, 'samples': 12460608, 'steps': 64898, 'loss/train': 1.3601850271224976} 11/07/2021 06:22:27 - INFO - __main__ - Step 64900: {'lr': 0.00030836650106940615, 'samples': 12460800, 'steps': 64899, 'loss/train': 0.08229967206716537} 11/07/2021 06:22:28 - INFO - __main__ - Step 64901: {'lr': 0.0003083613409639764, 'samples': 12460992, 'steps': 64900, 'loss/train': 1.3285950422286987} 11/07/2021 06:22:28 - INFO - __main__ - Step 64902: {'lr': 0.00030835618083224986, 'samples': 12461184, 'steps': 64901, 'loss/train': 1.4271875619888306} 11/07/2021 06:22:28 - INFO - __main__ - Step 64903: {'lr': 0.00030835102067422893, 'samples': 12461376, 'steps': 64902, 'loss/train': 1.5733044147491455} 11/07/2021 06:22:29 - INFO - __main__ - Step 64904: {'lr': 0.000308345860489916, 'samples': 12461568, 'steps': 64903, 'loss/train': 1.4636361598968506} 11/07/2021 06:22:29 - INFO - __main__ - Step 64905: {'lr': 0.00030834070027931326, 'samples': 12461760, 'steps': 64904, 'loss/train': 1.6133853197097778} 11/07/2021 06:22:30 - INFO - __main__ - Step 64906: {'lr': 0.00030833554004242313, 'samples': 12461952, 'steps': 64905, 'loss/train': 1.6911547183990479} 11/07/2021 06:22:31 - INFO - __main__ - Step 64907: {'lr': 0.0003083303797792479, 'samples': 12462144, 'steps': 64906, 'loss/train': 1.3386733531951904} 11/07/2021 06:22:31 - INFO - __main__ - Step 64908: {'lr': 0.0003083252194897899, 'samples': 12462336, 'steps': 64907, 'loss/train': 1.4799779653549194} 11/07/2021 06:22:31 - INFO - __main__ - Step 64909: {'lr': 0.00030832005917405146, 'samples': 12462528, 'steps': 64908, 'loss/train': 1.3564401865005493} 11/07/2021 06:22:32 - INFO - __main__ - Step 64910: {'lr': 0.0003083148988320349, 'samples': 12462720, 'steps': 64909, 'loss/train': 1.4773845672607422} 11/07/2021 06:22:32 - INFO - __main__ - Step 64911: {'lr': 0.00030830973846374257, 'samples': 12462912, 'steps': 64910, 'loss/train': 1.4078583717346191} 11/07/2021 06:22:33 - INFO - __main__ - Step 64912: {'lr': 0.00030830457806917664, 'samples': 12463104, 'steps': 64911, 'loss/train': 0.33250531554222107} 11/07/2021 06:22:34 - INFO - __main__ - Step 64913: {'lr': 0.0003082994176483398, 'samples': 12463296, 'steps': 64912, 'loss/train': 0.9110197424888611} 11/07/2021 06:22:34 - INFO - __main__ - Step 64914: {'lr': 0.00030829425720123397, 'samples': 12463488, 'steps': 64913, 'loss/train': 1.1871041059494019} 11/07/2021 06:22:34 - INFO - __main__ - Step 64915: {'lr': 0.0003082890967278617, 'samples': 12463680, 'steps': 64914, 'loss/train': 1.7483094930648804} 11/07/2021 06:22:35 - INFO - __main__ - Step 64916: {'lr': 0.0003082839362282253, 'samples': 12463872, 'steps': 64915, 'loss/train': 1.5058246850967407} 11/07/2021 06:22:36 - INFO - __main__ - Step 64917: {'lr': 0.0003082787757023269, 'samples': 12464064, 'steps': 64916, 'loss/train': 1.5922476053237915} 11/07/2021 06:22:36 - INFO - __main__ - Step 64918: {'lr': 0.0003082736151501691, 'samples': 12464256, 'steps': 64917, 'loss/train': 1.5824519395828247} 11/07/2021 06:22:36 - INFO - __main__ - Step 64919: {'lr': 0.0003082684545717541, 'samples': 12464448, 'steps': 64918, 'loss/train': 2.0863356590270996} 11/07/2021 06:22:37 - INFO - __main__ - Step 64920: {'lr': 0.0003082632939670843, 'samples': 12464640, 'steps': 64919, 'loss/train': 1.3743306398391724} 11/07/2021 06:22:37 - INFO - __main__ - Step 64921: {'lr': 0.0003082581333361619, 'samples': 12464832, 'steps': 64920, 'loss/train': 1.7153172492980957} 11/07/2021 06:22:38 - INFO - __main__ - Step 64922: {'lr': 0.0003082529726789893, 'samples': 12465024, 'steps': 64921, 'loss/train': 1.023331642150879} 11/07/2021 06:22:38 - INFO - __main__ - Step 64923: {'lr': 0.0003082478119955687, 'samples': 12465216, 'steps': 64922, 'loss/train': 1.2438290119171143} 11/07/2021 06:22:39 - INFO - __main__ - Step 64924: {'lr': 0.00030824265128590267, 'samples': 12465408, 'steps': 64923, 'loss/train': 1.535783290863037} 11/07/2021 06:22:39 - INFO - __main__ - Step 64925: {'lr': 0.00030823749054999336, 'samples': 12465600, 'steps': 64924, 'loss/train': 1.6045563220977783} 11/07/2021 06:22:39 - INFO - __main__ - Step 64926: {'lr': 0.00030823232978784317, 'samples': 12465792, 'steps': 64925, 'loss/train': 1.4273911714553833} 11/07/2021 06:22:40 - INFO - __main__ - Step 64927: {'lr': 0.00030822716899945435, 'samples': 12465984, 'steps': 64926, 'loss/train': 1.2376922369003296} 11/07/2021 06:22:41 - INFO - __main__ - Step 64928: {'lr': 0.00030822200818482926, 'samples': 12466176, 'steps': 64927, 'loss/train': 1.3988325595855713} 11/07/2021 06:22:42 - INFO - __main__ - Step 64929: {'lr': 0.0003082168473439702, 'samples': 12466368, 'steps': 64928, 'loss/train': 1.307599425315857} 11/07/2021 06:22:42 - INFO - __main__ - Step 64930: {'lr': 0.0003082116864768796, 'samples': 12466560, 'steps': 64929, 'loss/train': 1.274249792098999} 11/07/2021 06:22:42 - INFO - __main__ - Step 64931: {'lr': 0.00030820652558355963, 'samples': 12466752, 'steps': 64930, 'loss/train': 0.8555542826652527} 11/07/2021 06:22:43 - INFO - __main__ - Step 64932: {'lr': 0.00030820136466401277, 'samples': 12466944, 'steps': 64931, 'loss/train': 0.10376926511526108} 11/07/2021 06:22:44 - INFO - __main__ - Step 64933: {'lr': 0.0003081962037182413, 'samples': 12467136, 'steps': 64932, 'loss/train': 1.184641718864441} 11/07/2021 06:22:44 - INFO - __main__ - Step 64934: {'lr': 0.00030819104274624744, 'samples': 12467328, 'steps': 64933, 'loss/train': 1.2663947343826294} 11/07/2021 06:22:44 - INFO - __main__ - Step 64935: {'lr': 0.0003081858817480336, 'samples': 12467520, 'steps': 64934, 'loss/train': 1.3211075067520142} 11/07/2021 06:22:45 - INFO - __main__ - Step 64936: {'lr': 0.0003081807207236021, 'samples': 12467712, 'steps': 64935, 'loss/train': 1.5821092128753662} 11/07/2021 06:22:45 - INFO - __main__ - Step 64937: {'lr': 0.00030817555967295533, 'samples': 12467904, 'steps': 64936, 'loss/train': 1.4466710090637207} 11/07/2021 06:22:46 - INFO - __main__ - Step 64938: {'lr': 0.0003081703985960955, 'samples': 12468096, 'steps': 64937, 'loss/train': 0.9324856996536255} 11/07/2021 06:22:47 - INFO - __main__ - Step 64939: {'lr': 0.000308165237493025, 'samples': 12468288, 'steps': 64938, 'loss/train': 1.5516644716262817} 11/07/2021 06:22:47 - INFO - __main__ - Step 64940: {'lr': 0.0003081600763637461, 'samples': 12468480, 'steps': 64939, 'loss/train': 1.2542940378189087} 11/07/2021 06:22:47 - INFO - __main__ - Step 64941: {'lr': 0.0003081549152082612, 'samples': 12468672, 'steps': 64940, 'loss/train': 1.4450514316558838} 11/07/2021 06:22:48 - INFO - __main__ - Step 64942: {'lr': 0.0003081497540265726, 'samples': 12468864, 'steps': 64941, 'loss/train': 1.6832855939865112} 11/07/2021 06:22:48 - INFO - __main__ - Step 64943: {'lr': 0.0003081445928186827, 'samples': 12469056, 'steps': 64942, 'loss/train': 1.369161605834961} 11/07/2021 06:22:49 - INFO - __main__ - Step 64944: {'lr': 0.0003081394315845936, 'samples': 12469248, 'steps': 64943, 'loss/train': 1.0901702642440796} 11/07/2021 06:22:49 - INFO - __main__ - Step 64945: {'lr': 0.0003081342703243078, 'samples': 12469440, 'steps': 64944, 'loss/train': 1.0987261533737183} 11/07/2021 06:22:50 - INFO - __main__ - Step 64946: {'lr': 0.0003081291090378276, 'samples': 12469632, 'steps': 64945, 'loss/train': 1.848440408706665} 11/07/2021 06:22:50 - INFO - __main__ - Step 64947: {'lr': 0.00030812394772515534, 'samples': 12469824, 'steps': 64946, 'loss/train': 1.8363455533981323} 11/07/2021 06:22:51 - INFO - __main__ - Step 64948: {'lr': 0.0003081187863862934, 'samples': 12470016, 'steps': 64947, 'loss/train': 1.3199695348739624} 11/07/2021 06:22:52 - INFO - __main__ - Step 64949: {'lr': 0.00030811362502124396, 'samples': 12470208, 'steps': 64948, 'loss/train': 0.8893134593963623} 11/07/2021 06:22:52 - INFO - __main__ - Step 64950: {'lr': 0.0003081084636300094, 'samples': 12470400, 'steps': 64949, 'loss/train': 2.8123888969421387} 11/07/2021 06:22:52 - INFO - __main__ - Step 64951: {'lr': 0.0003081033022125921, 'samples': 12470592, 'steps': 64950, 'loss/train': 0.8366165161132812} 11/07/2021 06:22:53 - INFO - __main__ - Step 64952: {'lr': 0.0003080981407689943, 'samples': 12470784, 'steps': 64951, 'loss/train': 1.6579667329788208} 11/07/2021 06:22:53 - INFO - __main__ - Step 64953: {'lr': 0.00030809297929921837, 'samples': 12470976, 'steps': 64952, 'loss/train': 1.8838595151901245} 11/07/2021 06:22:54 - INFO - __main__ - Step 64954: {'lr': 0.00030808781780326675, 'samples': 12471168, 'steps': 64953, 'loss/train': 1.3017679452896118} 11/07/2021 06:22:55 - INFO - __main__ - Step 64955: {'lr': 0.0003080826562811415, 'samples': 12471360, 'steps': 64954, 'loss/train': 1.5480241775512695} 11/07/2021 06:22:55 - INFO - __main__ - Step 64956: {'lr': 0.0003080774947328452, 'samples': 12471552, 'steps': 64955, 'loss/train': 1.390790343284607} 11/07/2021 06:22:55 - INFO - __main__ - Step 64957: {'lr': 0.00030807233315838006, 'samples': 12471744, 'steps': 64956, 'loss/train': 1.188361644744873} 11/07/2021 06:22:56 - INFO - __main__ - Step 64958: {'lr': 0.0003080671715577484, 'samples': 12471936, 'steps': 64957, 'loss/train': 1.8559495210647583} 11/07/2021 06:22:56 - INFO - __main__ - Step 64959: {'lr': 0.00030806200993095255, 'samples': 12472128, 'steps': 64958, 'loss/train': 1.48601233959198} 11/07/2021 06:22:57 - INFO - __main__ - Step 64960: {'lr': 0.00030805684827799496, 'samples': 12472320, 'steps': 64959, 'loss/train': 0.692216694355011} 11/07/2021 06:22:57 - INFO - __main__ - Step 64961: {'lr': 0.0003080516865988778, 'samples': 12472512, 'steps': 64960, 'loss/train': 0.8408593535423279} 11/07/2021 06:22:58 - INFO - __main__ - Step 64962: {'lr': 0.00030804652489360343, 'samples': 12472704, 'steps': 64961, 'loss/train': 1.1331177949905396} 11/07/2021 06:22:58 - INFO - __main__ - Step 64963: {'lr': 0.0003080413631621741, 'samples': 12472896, 'steps': 64962, 'loss/train': 1.974977731704712} 11/07/2021 06:22:58 - INFO - __main__ - Step 64964: {'lr': 0.0003080362014045923, 'samples': 12473088, 'steps': 64963, 'loss/train': 0.7693413496017456} 11/07/2021 06:22:59 - INFO - __main__ - Step 64965: {'lr': 0.0003080310396208603, 'samples': 12473280, 'steps': 64964, 'loss/train': 1.5753437280654907} 11/07/2021 06:23:00 - INFO - __main__ - Step 64966: {'lr': 0.00030802587781098045, 'samples': 12473472, 'steps': 64965, 'loss/train': 1.0975028276443481} 11/07/2021 06:23:00 - INFO - __main__ - Step 64967: {'lr': 0.000308020715974955, 'samples': 12473664, 'steps': 64966, 'loss/train': 1.3841232061386108} 11/07/2021 06:23:00 - INFO - __main__ - Step 64968: {'lr': 0.00030801555411278633, 'samples': 12473856, 'steps': 64967, 'loss/train': 1.1676355600357056} 11/07/2021 06:23:01 - INFO - __main__ - Step 64969: {'lr': 0.0003080103922244767, 'samples': 12474048, 'steps': 64968, 'loss/train': 1.497896671295166} 11/07/2021 06:23:02 - INFO - __main__ - Step 64970: {'lr': 0.00030800523031002846, 'samples': 12474240, 'steps': 64969, 'loss/train': 1.7277567386627197} 11/07/2021 06:23:02 - INFO - __main__ - Step 64971: {'lr': 0.00030800006836944406, 'samples': 12474432, 'steps': 64970, 'loss/train': 1.1092153787612915} 11/07/2021 06:23:03 - INFO - __main__ - Step 64972: {'lr': 0.00030799490640272563, 'samples': 12474624, 'steps': 64971, 'loss/train': 1.3474578857421875} 11/07/2021 06:23:03 - INFO - __main__ - Step 64973: {'lr': 0.00030798974440987564, 'samples': 12474816, 'steps': 64972, 'loss/train': 1.441540241241455} 11/07/2021 06:23:03 - INFO - __main__ - Step 64974: {'lr': 0.0003079845823908964, 'samples': 12475008, 'steps': 64973, 'loss/train': 1.5054665803909302} 11/07/2021 06:23:04 - INFO - __main__ - Step 64975: {'lr': 0.00030797942034579013, 'samples': 12475200, 'steps': 64974, 'loss/train': 1.6688684225082397} 11/07/2021 06:23:05 - INFO - __main__ - Step 64976: {'lr': 0.0003079742582745592, 'samples': 12475392, 'steps': 64975, 'loss/train': 1.3917348384857178} 11/07/2021 06:23:05 - INFO - __main__ - Step 64977: {'lr': 0.000307969096177206, 'samples': 12475584, 'steps': 64976, 'loss/train': 1.2355527877807617} 11/07/2021 06:23:05 - INFO - __main__ - Step 64978: {'lr': 0.00030796393405373287, 'samples': 12475776, 'steps': 64977, 'loss/train': 1.2895933389663696} 11/07/2021 06:23:06 - INFO - __main__ - Step 64979: {'lr': 0.000307958771904142, 'samples': 12475968, 'steps': 64978, 'loss/train': 1.0904091596603394} 11/07/2021 06:23:06 - INFO - __main__ - Step 64980: {'lr': 0.00030795360972843595, 'samples': 12476160, 'steps': 64979, 'loss/train': 1.4901905059814453} 11/07/2021 06:23:07 - INFO - __main__ - Step 64981: {'lr': 0.0003079484475266168, 'samples': 12476352, 'steps': 64980, 'loss/train': 1.3001978397369385} 11/07/2021 06:23:07 - INFO - __main__ - Step 64982: {'lr': 0.00030794328529868694, 'samples': 12476544, 'steps': 64981, 'loss/train': 1.3847415447235107} 11/07/2021 06:23:08 - INFO - __main__ - Step 64983: {'lr': 0.00030793812304464875, 'samples': 12476736, 'steps': 64982, 'loss/train': 1.419884443283081} 11/07/2021 06:23:08 - INFO - __main__ - Step 64984: {'lr': 0.00030793296076450454, 'samples': 12476928, 'steps': 64983, 'loss/train': 1.5790928602218628} 11/07/2021 06:23:08 - INFO - __main__ - Step 64985: {'lr': 0.00030792779845825665, 'samples': 12477120, 'steps': 64984, 'loss/train': 1.2135502099990845} 11/07/2021 06:23:09 - INFO - __main__ - Step 64986: {'lr': 0.00030792263612590734, 'samples': 12477312, 'steps': 64985, 'loss/train': 1.641514539718628} 11/07/2021 06:23:10 - INFO - __main__ - Step 64987: {'lr': 0.0003079174737674591, 'samples': 12477504, 'steps': 64986, 'loss/train': 0.5223718285560608} 11/07/2021 06:23:10 - INFO - __main__ - Step 64988: {'lr': 0.00030791231138291406, 'samples': 12477696, 'steps': 64987, 'loss/train': 1.6744320392608643} 11/07/2021 06:23:10 - INFO - __main__ - Step 64989: {'lr': 0.00030790714897227457, 'samples': 12477888, 'steps': 64988, 'loss/train': 1.6686522960662842} 11/07/2021 06:23:11 - INFO - __main__ - Step 64990: {'lr': 0.00030790198653554305, 'samples': 12478080, 'steps': 64989, 'loss/train': 0.08793967962265015} 11/07/2021 06:23:12 - INFO - __main__ - Step 64991: {'lr': 0.00030789682407272184, 'samples': 12478272, 'steps': 64990, 'loss/train': 1.5556352138519287} 11/07/2021 06:23:12 - INFO - __main__ - Step 64992: {'lr': 0.00030789166158381315, 'samples': 12478464, 'steps': 64991, 'loss/train': 1.1068272590637207} 11/07/2021 06:23:13 - INFO - __main__ - Step 64993: {'lr': 0.0003078864990688194, 'samples': 12478656, 'steps': 64992, 'loss/train': 0.8587809205055237} 11/07/2021 06:23:13 - INFO - __main__ - Step 64994: {'lr': 0.000307881336527743, 'samples': 12478848, 'steps': 64993, 'loss/train': 1.278476595878601} 11/07/2021 06:23:13 - INFO - __main__ - Step 64995: {'lr': 0.00030787617396058596, 'samples': 12479040, 'steps': 64994, 'loss/train': 1.3320491313934326} 11/07/2021 06:23:14 - INFO - __main__ - Step 64996: {'lr': 0.00030787101136735094, 'samples': 12479232, 'steps': 64995, 'loss/train': 1.3644126653671265} 11/07/2021 06:23:15 - INFO - __main__ - Step 64997: {'lr': 0.00030786584874804005, 'samples': 12479424, 'steps': 64996, 'loss/train': 1.1394450664520264} 11/07/2021 06:23:15 - INFO - __main__ - Step 64998: {'lr': 0.0003078606861026558, 'samples': 12479616, 'steps': 64997, 'loss/train': 1.352457880973816} 11/07/2021 06:23:15 - INFO - __main__ - Step 64999: {'lr': 0.00030785552343120035, 'samples': 12479808, 'steps': 64998, 'loss/train': 1.7293787002563477} 11/07/2021 06:23:16 - INFO - __main__ - Step 65000: {'lr': 0.00030785036073367614, 'samples': 12480000, 'steps': 64999, 'loss/train': 1.5831491947174072} 11/07/2021 06:23:17 - INFO - __main__ - Step 65001: {'lr': 0.00030784519801008544, 'samples': 12480192, 'steps': 65000, 'loss/train': 3.019192695617676} 11/07/2021 06:23:17 - INFO - __main__ - Step 65002: {'lr': 0.0003078400352604305, 'samples': 12480384, 'steps': 65001, 'loss/train': 1.2507374286651611} 11/07/2021 06:23:17 - INFO - __main__ - Step 65003: {'lr': 0.0003078348724847138, 'samples': 12480576, 'steps': 65002, 'loss/train': 1.4952727556228638} 11/07/2021 06:23:18 - INFO - __main__ - Step 65004: {'lr': 0.0003078297096829376, 'samples': 12480768, 'steps': 65003, 'loss/train': 1.1903066635131836} 11/07/2021 06:23:18 - INFO - __main__ - Step 65005: {'lr': 0.0003078245468551042, 'samples': 12480960, 'steps': 65004, 'loss/train': 1.4824901819229126} 11/07/2021 06:23:19 - INFO - __main__ - Step 65006: {'lr': 0.000307819384001216, 'samples': 12481152, 'steps': 65005, 'loss/train': 1.350128412246704} 11/07/2021 06:23:20 - INFO - __main__ - Step 65007: {'lr': 0.0003078142211212753, 'samples': 12481344, 'steps': 65006, 'loss/train': 1.5840262174606323} 11/07/2021 06:23:20 - INFO - __main__ - Step 65008: {'lr': 0.00030780905821528435, 'samples': 12481536, 'steps': 65007, 'loss/train': 1.2398840188980103} 11/07/2021 06:23:20 - INFO - __main__ - Step 65009: {'lr': 0.00030780389528324554, 'samples': 12481728, 'steps': 65008, 'loss/train': 1.1628297567367554} 11/07/2021 06:23:21 - INFO - __main__ - Step 65010: {'lr': 0.00030779873232516115, 'samples': 12481920, 'steps': 65009, 'loss/train': 1.386281967163086} 11/07/2021 06:23:21 - INFO - __main__ - Step 65011: {'lr': 0.00030779356934103357, 'samples': 12482112, 'steps': 65010, 'loss/train': 1.382953405380249} 11/07/2021 06:23:22 - INFO - __main__ - Step 65012: {'lr': 0.00030778840633086514, 'samples': 12482304, 'steps': 65011, 'loss/train': 1.6681556701660156} 11/07/2021 06:23:22 - INFO - __main__ - Step 65013: {'lr': 0.0003077832432946581, 'samples': 12482496, 'steps': 65012, 'loss/train': 1.261975646018982} 11/07/2021 06:23:23 - INFO - __main__ - Step 65014: {'lr': 0.0003077780802324149, 'samples': 12482688, 'steps': 65013, 'loss/train': 0.946709394454956} 11/07/2021 06:23:23 - INFO - __main__ - Step 65015: {'lr': 0.0003077729171441377, 'samples': 12482880, 'steps': 65014, 'loss/train': 0.9239485859870911} 11/07/2021 06:23:23 - INFO - __main__ - Step 65016: {'lr': 0.00030776775402982894, 'samples': 12483072, 'steps': 65015, 'loss/train': 1.611318588256836} 11/07/2021 06:23:24 - INFO - __main__ - Step 65017: {'lr': 0.00030776259088949087, 'samples': 12483264, 'steps': 65016, 'loss/train': 2.368335008621216} 11/07/2021 06:23:25 - INFO - __main__ - Step 65018: {'lr': 0.00030775742772312593, 'samples': 12483456, 'steps': 65017, 'loss/train': 1.8014577627182007} 11/07/2021 06:23:25 - INFO - __main__ - Step 65019: {'lr': 0.00030775226453073635, 'samples': 12483648, 'steps': 65018, 'loss/train': 1.6361191272735596} 11/07/2021 06:23:25 - INFO - __main__ - Step 65020: {'lr': 0.0003077471013123246, 'samples': 12483840, 'steps': 65019, 'loss/train': 1.1594651937484741} 11/07/2021 06:23:26 - INFO - __main__ - Step 65021: {'lr': 0.0003077419380678927, 'samples': 12484032, 'steps': 65020, 'loss/train': 1.0148288011550903} 11/07/2021 06:23:27 - INFO - __main__ - Step 65022: {'lr': 0.00030773677479744335, 'samples': 12484224, 'steps': 65021, 'loss/train': 1.3405160903930664} 11/07/2021 06:23:27 - INFO - __main__ - Step 65023: {'lr': 0.0003077316115009786, 'samples': 12484416, 'steps': 65022, 'loss/train': 1.3257759809494019} 11/07/2021 06:23:27 - INFO - __main__ - Step 65024: {'lr': 0.0003077264481785009, 'samples': 12484608, 'steps': 65023, 'loss/train': 1.1679352521896362} 11/07/2021 06:23:28 - INFO - __main__ - Step 65025: {'lr': 0.0003077212848300126, 'samples': 12484800, 'steps': 65024, 'loss/train': 1.5388263463974} 11/07/2021 06:23:28 - INFO - __main__ - Step 65026: {'lr': 0.0003077161214555159, 'samples': 12484992, 'steps': 65025, 'loss/train': 1.4412955045700073} 11/07/2021 06:23:29 - INFO - __main__ - Step 65027: {'lr': 0.0003077109580550133, 'samples': 12485184, 'steps': 65026, 'loss/train': 1.7540967464447021} 11/07/2021 06:23:29 - INFO - __main__ - Step 65028: {'lr': 0.000307705794628507, 'samples': 12485376, 'steps': 65027, 'loss/train': 1.4970470666885376} 11/07/2021 06:23:30 - INFO - __main__ - Step 65029: {'lr': 0.0003077006311759993, 'samples': 12485568, 'steps': 65028, 'loss/train': 1.4506590366363525} 11/07/2021 06:23:30 - INFO - __main__ - Step 65030: {'lr': 0.00030769546769749263, 'samples': 12485760, 'steps': 65029, 'loss/train': 1.3949140310287476} 11/07/2021 06:23:31 - INFO - __main__ - Step 65031: {'lr': 0.00030769030419298927, 'samples': 12485952, 'steps': 65030, 'loss/train': 0.8584029078483582} 11/07/2021 06:23:32 - INFO - __main__ - Step 65032: {'lr': 0.00030768514066249156, 'samples': 12486144, 'steps': 65031, 'loss/train': 0.7948538661003113} 11/07/2021 06:23:32 - INFO - __main__ - Step 65033: {'lr': 0.0003076799771060018, 'samples': 12486336, 'steps': 65032, 'loss/train': 1.663269281387329} 11/07/2021 06:23:33 - INFO - __main__ - Step 65034: {'lr': 0.0003076748135235224, 'samples': 12486528, 'steps': 65033, 'loss/train': 1.3858355283737183} 11/07/2021 06:23:33 - INFO - __main__ - Step 65035: {'lr': 0.00030766964991505553, 'samples': 12486720, 'steps': 65034, 'loss/train': 1.1428765058517456} 11/07/2021 06:23:33 - INFO - __main__ - Step 65036: {'lr': 0.0003076644862806036, 'samples': 12486912, 'steps': 65035, 'loss/train': 1.4838552474975586} 11/07/2021 06:23:34 - INFO - __main__ - Step 65037: {'lr': 0.00030765932262016897, 'samples': 12487104, 'steps': 65036, 'loss/train': 0.5055636167526245} 11/07/2021 06:23:35 - INFO - __main__ - Step 65038: {'lr': 0.00030765415893375394, 'samples': 12487296, 'steps': 65037, 'loss/train': 1.732251524925232} 11/07/2021 06:23:35 - INFO - __main__ - Step 65039: {'lr': 0.0003076489952213609, 'samples': 12487488, 'steps': 65038, 'loss/train': 1.742485761642456} 11/07/2021 06:23:36 - INFO - __main__ - Step 65040: {'lr': 0.00030764383148299196, 'samples': 12487680, 'steps': 65039, 'loss/train': 1.7821813821792603} 11/07/2021 06:23:36 - INFO - __main__ - Step 65041: {'lr': 0.0003076386677186498, 'samples': 12487872, 'steps': 65040, 'loss/train': 0.4163075089454651} 11/07/2021 06:23:37 - INFO - __main__ - Step 65042: {'lr': 0.00030763350392833637, 'samples': 12488064, 'steps': 65041, 'loss/train': 1.2494051456451416} 11/07/2021 06:23:37 - INFO - __main__ - Step 65043: {'lr': 0.00030762834011205425, 'samples': 12488256, 'steps': 65042, 'loss/train': 1.4053919315338135} 11/07/2021 06:23:38 - INFO - __main__ - Step 65044: {'lr': 0.0003076231762698057, 'samples': 12488448, 'steps': 65043, 'loss/train': 1.1474379301071167} 11/07/2021 06:23:38 - INFO - __main__ - Step 65045: {'lr': 0.000307618012401593, 'samples': 12488640, 'steps': 65044, 'loss/train': 1.5112004280090332} 11/07/2021 06:23:38 - INFO - __main__ - Step 65046: {'lr': 0.0003076128485074185, 'samples': 12488832, 'steps': 65045, 'loss/train': 1.3529356718063354} 11/07/2021 06:23:39 - INFO - __main__ - Step 65047: {'lr': 0.0003076076845872846, 'samples': 12489024, 'steps': 65046, 'loss/train': 1.6586639881134033} 11/07/2021 06:23:40 - INFO - __main__ - Step 65048: {'lr': 0.00030760252064119354, 'samples': 12489216, 'steps': 65047, 'loss/train': 1.2501670122146606} 11/07/2021 06:23:40 - INFO - __main__ - Step 65049: {'lr': 0.00030759735666914767, 'samples': 12489408, 'steps': 65048, 'loss/train': 1.2043490409851074} 11/07/2021 06:23:40 - INFO - __main__ - Step 65050: {'lr': 0.0003075921926711493, 'samples': 12489600, 'steps': 65049, 'loss/train': 1.3611836433410645} 11/07/2021 06:23:41 - INFO - __main__ - Step 65051: {'lr': 0.0003075870286472008, 'samples': 12489792, 'steps': 65050, 'loss/train': 1.4605910778045654} 11/07/2021 06:23:42 - INFO - __main__ - Step 65052: {'lr': 0.0003075818645973044, 'samples': 12489984, 'steps': 65051, 'loss/train': 1.4080681800842285} 11/07/2021 06:23:42 - INFO - __main__ - Step 65053: {'lr': 0.00030757670052146256, 'samples': 12490176, 'steps': 65052, 'loss/train': 1.5863890647888184} 11/07/2021 06:23:42 - INFO - __main__ - Step 65054: {'lr': 0.0003075715364196776, 'samples': 12490368, 'steps': 65053, 'loss/train': 1.5852479934692383} 11/07/2021 06:23:43 - INFO - __main__ - Step 65055: {'lr': 0.00030756637229195177, 'samples': 12490560, 'steps': 65054, 'loss/train': 0.7820081114768982} 11/07/2021 06:23:43 - INFO - __main__ - Step 65056: {'lr': 0.0003075612081382874, 'samples': 12490752, 'steps': 65055, 'loss/train': 1.2790805101394653} 11/07/2021 06:23:44 - INFO - __main__ - Step 65057: {'lr': 0.00030755604395868685, 'samples': 12490944, 'steps': 65056, 'loss/train': 1.618144154548645} 11/07/2021 06:23:45 - INFO - __main__ - Step 65058: {'lr': 0.0003075508797531524, 'samples': 12491136, 'steps': 65057, 'loss/train': 1.4835259914398193} 11/07/2021 06:23:45 - INFO - __main__ - Step 65059: {'lr': 0.00030754571552168644, 'samples': 12491328, 'steps': 65058, 'loss/train': 1.6149746179580688} 11/07/2021 06:23:45 - INFO - __main__ - Step 65060: {'lr': 0.00030754055126429124, 'samples': 12491520, 'steps': 65059, 'loss/train': 1.488415241241455} 11/07/2021 06:23:46 - INFO - __main__ - Step 65061: {'lr': 0.00030753538698096924, 'samples': 12491712, 'steps': 65060, 'loss/train': 1.775986909866333} 11/07/2021 06:23:46 - INFO - __main__ - Step 65062: {'lr': 0.0003075302226717226, 'samples': 12491904, 'steps': 65061, 'loss/train': 1.5213794708251953} 11/07/2021 06:23:47 - INFO - __main__ - Step 65063: {'lr': 0.00030752505833655375, 'samples': 12492096, 'steps': 65062, 'loss/train': 1.3572715520858765} 11/07/2021 06:23:47 - INFO - __main__ - Step 65064: {'lr': 0.00030751989397546497, 'samples': 12492288, 'steps': 65063, 'loss/train': 1.1969444751739502} 11/07/2021 06:23:48 - INFO - __main__ - Step 65065: {'lr': 0.0003075147295884586, 'samples': 12492480, 'steps': 65064, 'loss/train': 0.9565674662590027} 11/07/2021 06:23:48 - INFO - __main__ - Step 65066: {'lr': 0.0003075095651755371, 'samples': 12492672, 'steps': 65065, 'loss/train': 1.3564871549606323} 11/07/2021 06:23:48 - INFO - __main__ - Step 65067: {'lr': 0.0003075044007367026, 'samples': 12492864, 'steps': 65066, 'loss/train': 1.1479103565216064} 11/07/2021 06:23:49 - INFO - __main__ - Step 65068: {'lr': 0.0003074992362719575, 'samples': 12493056, 'steps': 65067, 'loss/train': 1.4479570388793945} 11/07/2021 06:23:50 - INFO - __main__ - Step 65069: {'lr': 0.0003074940717813041, 'samples': 12493248, 'steps': 65068, 'loss/train': 1.776086688041687} 11/07/2021 06:23:50 - INFO - __main__ - Step 65070: {'lr': 0.00030748890726474474, 'samples': 12493440, 'steps': 65069, 'loss/train': 2.185962200164795} 11/07/2021 06:23:50 - INFO - __main__ - Step 65071: {'lr': 0.00030748374272228184, 'samples': 12493632, 'steps': 65070, 'loss/train': 1.3290791511535645} 11/07/2021 06:23:51 - INFO - __main__ - Step 65072: {'lr': 0.00030747857815391767, 'samples': 12493824, 'steps': 65071, 'loss/train': 1.2355061769485474} 11/07/2021 06:23:52 - INFO - __main__ - Step 65073: {'lr': 0.0003074734135596545, 'samples': 12494016, 'steps': 65072, 'loss/train': 1.0353158712387085} 11/07/2021 06:23:52 - INFO - __main__ - Step 65074: {'lr': 0.0003074682489394947, 'samples': 12494208, 'steps': 65073, 'loss/train': 1.4304317235946655} 11/07/2021 06:23:52 - INFO - __main__ - Step 65075: {'lr': 0.00030746308429344056, 'samples': 12494400, 'steps': 65074, 'loss/train': 0.9964219927787781} 11/07/2021 06:23:53 - INFO - __main__ - Step 65076: {'lr': 0.0003074579196214945, 'samples': 12494592, 'steps': 65075, 'loss/train': 1.323668360710144} 11/07/2021 06:23:53 - INFO - __main__ - Step 65077: {'lr': 0.00030745275492365874, 'samples': 12494784, 'steps': 65076, 'loss/train': 1.3614304065704346} 11/07/2021 06:23:54 - INFO - __main__ - Step 65078: {'lr': 0.0003074475901999357, 'samples': 12494976, 'steps': 65077, 'loss/train': 1.1536173820495605} 11/07/2021 06:23:54 - INFO - __main__ - Step 65079: {'lr': 0.00030744242545032764, 'samples': 12495168, 'steps': 65078, 'loss/train': 1.0120314359664917} 11/07/2021 06:23:55 - INFO - __main__ - Step 65080: {'lr': 0.0003074372606748369, 'samples': 12495360, 'steps': 65079, 'loss/train': 1.6718409061431885} 11/07/2021 06:23:55 - INFO - __main__ - Step 65081: {'lr': 0.0003074320958734658, 'samples': 12495552, 'steps': 65080, 'loss/train': 1.3258658647537231} 11/07/2021 06:23:56 - INFO - __main__ - Step 65082: {'lr': 0.0003074269310462167, 'samples': 12495744, 'steps': 65081, 'loss/train': 1.404412865638733} 11/07/2021 06:23:57 - INFO - __main__ - Step 65083: {'lr': 0.000307421766193092, 'samples': 12495936, 'steps': 65082, 'loss/train': 1.2720184326171875} 11/07/2021 06:23:57 - INFO - __main__ - Step 65084: {'lr': 0.0003074166013140938, 'samples': 12496128, 'steps': 65083, 'loss/train': 1.218847632408142} 11/07/2021 06:23:57 - INFO - __main__ - Step 65085: {'lr': 0.0003074114364092246, 'samples': 12496320, 'steps': 65084, 'loss/train': 1.1346498727798462} 11/07/2021 06:23:58 - INFO - __main__ - Step 65086: {'lr': 0.0003074062714784867, 'samples': 12496512, 'steps': 65085, 'loss/train': 1.4233808517456055} 11/07/2021 06:23:58 - INFO - __main__ - Step 65087: {'lr': 0.00030740110652188247, 'samples': 12496704, 'steps': 65086, 'loss/train': 1.5674620866775513} 11/07/2021 06:23:59 - INFO - __main__ - Step 65088: {'lr': 0.0003073959415394142, 'samples': 12496896, 'steps': 65087, 'loss/train': 1.3665738105773926} 11/07/2021 06:23:59 - INFO - __main__ - Step 65089: {'lr': 0.0003073907765310841, 'samples': 12497088, 'steps': 65088, 'loss/train': 0.9511145353317261} 11/07/2021 06:24:00 - INFO - __main__ - Step 65090: {'lr': 0.0003073856114968947, 'samples': 12497280, 'steps': 65089, 'loss/train': 1.4055358171463013} 11/07/2021 06:24:00 - INFO - __main__ - Step 65091: {'lr': 0.00030738044643684816, 'samples': 12497472, 'steps': 65090, 'loss/train': 1.4544185400009155} 11/07/2021 06:24:00 - INFO - __main__ - Step 65092: {'lr': 0.0003073752813509469, 'samples': 12497664, 'steps': 65091, 'loss/train': 1.2798799276351929} 11/07/2021 06:24:01 - INFO - __main__ - Step 65093: {'lr': 0.0003073701162391932, 'samples': 12497856, 'steps': 65092, 'loss/train': 1.5943646430969238} 11/07/2021 06:24:02 - INFO - __main__ - Step 65094: {'lr': 0.0003073649511015895, 'samples': 12498048, 'steps': 65093, 'loss/train': 1.2289012670516968} 11/07/2021 06:24:02 - INFO - __main__ - Step 65095: {'lr': 0.00030735978593813797, 'samples': 12498240, 'steps': 65094, 'loss/train': 1.287078619003296} 11/07/2021 06:24:03 - INFO - __main__ - Step 65096: {'lr': 0.00030735462074884097, 'samples': 12498432, 'steps': 65095, 'loss/train': 1.3137612342834473} 11/07/2021 06:24:03 - INFO - __main__ - Step 65097: {'lr': 0.00030734945553370093, 'samples': 12498624, 'steps': 65096, 'loss/train': 1.199890375137329} 11/07/2021 06:24:04 - INFO - __main__ - Step 65098: {'lr': 0.00030734429029272, 'samples': 12498816, 'steps': 65097, 'loss/train': 1.4651317596435547} 11/07/2021 06:24:04 - INFO - __main__ - Step 65099: {'lr': 0.0003073391250259007, 'samples': 12499008, 'steps': 65098, 'loss/train': 1.5230333805084229} 11/07/2021 06:24:05 - INFO - __main__ - Step 65100: {'lr': 0.0003073339597332453, 'samples': 12499200, 'steps': 65099, 'loss/train': 1.4286811351776123} 11/07/2021 06:24:05 - INFO - __main__ - Step 65101: {'lr': 0.00030732879441475614, 'samples': 12499392, 'steps': 65100, 'loss/train': 1.032585620880127} 11/07/2021 06:24:05 - INFO - __main__ - Step 65102: {'lr': 0.0003073236290704354, 'samples': 12499584, 'steps': 65101, 'loss/train': 1.6358826160430908} 11/07/2021 06:24:06 - INFO - __main__ - Step 65103: {'lr': 0.0003073184637002856, 'samples': 12499776, 'steps': 65102, 'loss/train': 1.7500327825546265} 11/07/2021 06:24:07 - INFO - __main__ - Step 65104: {'lr': 0.0003073132983043089, 'samples': 12499968, 'steps': 65103, 'loss/train': 1.107629656791687} 11/07/2021 06:24:07 - INFO - __main__ - Step 65105: {'lr': 0.0003073081328825078, 'samples': 12500160, 'steps': 65104, 'loss/train': 1.181552767753601} 11/07/2021 06:24:07 - INFO - __main__ - Step 65106: {'lr': 0.0003073029674348845, 'samples': 12500352, 'steps': 65105, 'loss/train': 2.1216142177581787} 11/07/2021 06:24:08 - INFO - __main__ - Step 65107: {'lr': 0.00030729780196144137, 'samples': 12500544, 'steps': 65106, 'loss/train': 1.2853692770004272} 11/07/2021 06:24:08 - INFO - __main__ - Step 65108: {'lr': 0.0003072926364621807, 'samples': 12500736, 'steps': 65107, 'loss/train': 1.3068504333496094} 11/07/2021 06:24:09 - INFO - __main__ - Step 65109: {'lr': 0.0003072874709371049, 'samples': 12500928, 'steps': 65108, 'loss/train': 1.4986859560012817} 11/07/2021 06:24:10 - INFO - __main__ - Step 65110: {'lr': 0.0003072823053862163, 'samples': 12501120, 'steps': 65109, 'loss/train': 1.4985475540161133} 11/07/2021 06:24:10 - INFO - __main__ - Step 65111: {'lr': 0.00030727713980951705, 'samples': 12501312, 'steps': 65110, 'loss/train': 1.3169059753417969} 11/07/2021 06:24:10 - INFO - __main__ - Step 65112: {'lr': 0.0003072719742070097, 'samples': 12501504, 'steps': 65111, 'loss/train': 1.1762031316757202} 11/07/2021 06:24:11 - INFO - __main__ - Step 65113: {'lr': 0.0003072668085786964, 'samples': 12501696, 'steps': 65112, 'loss/train': 1.0948702096939087} 11/07/2021 06:24:12 - INFO - __main__ - Step 65114: {'lr': 0.0003072616429245796, 'samples': 12501888, 'steps': 65113, 'loss/train': 0.9881782531738281} 11/07/2021 06:24:12 - INFO - __main__ - Step 65115: {'lr': 0.00030725647724466165, 'samples': 12502080, 'steps': 65114, 'loss/train': 1.7179502248764038} 11/07/2021 06:24:12 - INFO - __main__ - Step 65116: {'lr': 0.00030725131153894474, 'samples': 12502272, 'steps': 65115, 'loss/train': 1.3713270425796509} 11/07/2021 06:24:13 - INFO - __main__ - Step 65117: {'lr': 0.00030724614580743135, 'samples': 12502464, 'steps': 65116, 'loss/train': 1.6210484504699707} 11/07/2021 06:24:13 - INFO - __main__ - Step 65118: {'lr': 0.00030724098005012365, 'samples': 12502656, 'steps': 65117, 'loss/train': 1.4259990453720093} 11/07/2021 06:24:14 - INFO - __main__ - Step 65119: {'lr': 0.00030723581426702403, 'samples': 12502848, 'steps': 65118, 'loss/train': 1.1499935388565063} 11/07/2021 06:24:15 - INFO - __main__ - Step 65120: {'lr': 0.00030723064845813487, 'samples': 12503040, 'steps': 65119, 'loss/train': 1.1093627214431763} 11/07/2021 06:24:15 - INFO - __main__ - Step 65121: {'lr': 0.0003072254826234585, 'samples': 12503232, 'steps': 65120, 'loss/train': 0.9715672731399536} 11/07/2021 06:24:15 - INFO - __main__ - Step 65122: {'lr': 0.00030722031676299716, 'samples': 12503424, 'steps': 65121, 'loss/train': 1.2209464311599731} 11/07/2021 06:24:16 - INFO - __main__ - Step 65123: {'lr': 0.00030721515087675326, 'samples': 12503616, 'steps': 65122, 'loss/train': 1.5252859592437744} 11/07/2021 06:24:17 - INFO - __main__ - Step 65124: {'lr': 0.00030720998496472905, 'samples': 12503808, 'steps': 65123, 'loss/train': 1.332614541053772} 11/07/2021 06:24:17 - INFO - __main__ - Step 65125: {'lr': 0.000307204819026927, 'samples': 12504000, 'steps': 65124, 'loss/train': 0.6861950755119324} 11/07/2021 06:24:17 - INFO - __main__ - Step 65126: {'lr': 0.00030719965306334925, 'samples': 12504192, 'steps': 65125, 'loss/train': 1.609683871269226} 11/07/2021 06:24:18 - INFO - __main__ - Step 65127: {'lr': 0.0003071944870739982, 'samples': 12504384, 'steps': 65126, 'loss/train': 1.6389867067337036} 11/07/2021 06:24:18 - INFO - __main__ - Step 65128: {'lr': 0.0003071893210588763, 'samples': 12504576, 'steps': 65127, 'loss/train': 1.8838413953781128} 11/07/2021 06:24:19 - INFO - __main__ - Step 65129: {'lr': 0.00030718415501798576, 'samples': 12504768, 'steps': 65128, 'loss/train': 1.1104137897491455} 11/07/2021 06:24:19 - INFO - __main__ - Step 65130: {'lr': 0.00030717898895132883, 'samples': 12504960, 'steps': 65129, 'loss/train': 1.6028951406478882} 11/07/2021 06:24:20 - INFO - __main__ - Step 65131: {'lr': 0.000307173822858908, 'samples': 12505152, 'steps': 65130, 'loss/train': 0.5327861905097961} 11/07/2021 06:24:20 - INFO - __main__ - Step 65132: {'lr': 0.00030716865674072547, 'samples': 12505344, 'steps': 65131, 'loss/train': 1.4965598583221436} 11/07/2021 06:24:20 - INFO - __main__ - Step 65133: {'lr': 0.0003071634905967837, 'samples': 12505536, 'steps': 65132, 'loss/train': 1.559753656387329} 11/07/2021 06:24:21 - INFO - __main__ - Step 65134: {'lr': 0.00030715832442708484, 'samples': 12505728, 'steps': 65133, 'loss/train': 1.545695424079895} 11/07/2021 06:24:22 - INFO - __main__ - Step 65135: {'lr': 0.00030715315823163147, 'samples': 12505920, 'steps': 65134, 'loss/train': 1.0530073642730713} 11/07/2021 06:24:22 - INFO - __main__ - Step 65136: {'lr': 0.00030714799201042565, 'samples': 12506112, 'steps': 65135, 'loss/train': 1.170634150505066} 11/07/2021 06:24:22 - INFO - __main__ - Step 65137: {'lr': 0.00030714282576346986, 'samples': 12506304, 'steps': 65136, 'loss/train': 1.7498496770858765} 11/07/2021 06:24:23 - INFO - __main__ - Step 65138: {'lr': 0.0003071376594907664, 'samples': 12506496, 'steps': 65137, 'loss/train': 0.25893399119377136} 11/07/2021 06:24:23 - INFO - __main__ - Step 65139: {'lr': 0.00030713249319231755, 'samples': 12506688, 'steps': 65138, 'loss/train': 1.1980488300323486} 11/07/2021 06:24:24 - INFO - __main__ - Step 65140: {'lr': 0.00030712732686812575, 'samples': 12506880, 'steps': 65139, 'loss/train': 1.3273106813430786} 11/07/2021 06:24:25 - INFO - __main__ - Step 65141: {'lr': 0.0003071221605181933, 'samples': 12507072, 'steps': 65140, 'loss/train': 1.4141018390655518} 11/07/2021 06:24:25 - INFO - __main__ - Step 65142: {'lr': 0.0003071169941425224, 'samples': 12507264, 'steps': 65141, 'loss/train': 1.2667698860168457} 11/07/2021 06:24:25 - INFO - __main__ - Step 65143: {'lr': 0.00030711182774111544, 'samples': 12507456, 'steps': 65142, 'loss/train': 1.693641185760498} 11/07/2021 06:24:26 - INFO - __main__ - Step 65144: {'lr': 0.0003071066613139748, 'samples': 12507648, 'steps': 65143, 'loss/train': 1.3284976482391357} 11/07/2021 06:24:27 - INFO - __main__ - Step 65145: {'lr': 0.0003071014948611028, 'samples': 12507840, 'steps': 65144, 'loss/train': 1.425578236579895} 11/07/2021 06:24:27 - INFO - __main__ - Step 65146: {'lr': 0.0003070963283825017, 'samples': 12508032, 'steps': 65145, 'loss/train': 1.4960638284683228} 11/07/2021 06:24:27 - INFO - __main__ - Step 65147: {'lr': 0.00030709116187817396, 'samples': 12508224, 'steps': 65146, 'loss/train': 1.540917992591858} 11/07/2021 06:24:28 - INFO - __main__ - Step 65148: {'lr': 0.0003070859953481218, 'samples': 12508416, 'steps': 65147, 'loss/train': 0.06255259364843369} 11/07/2021 06:24:28 - INFO - __main__ - Step 65149: {'lr': 0.00030708082879234757, 'samples': 12508608, 'steps': 65148, 'loss/train': 0.9299478530883789} 11/07/2021 06:24:29 - INFO - __main__ - Step 65150: {'lr': 0.00030707566221085356, 'samples': 12508800, 'steps': 65149, 'loss/train': 1.3429279327392578} 11/07/2021 06:24:30 - INFO - __main__ - Step 65151: {'lr': 0.00030707049560364216, 'samples': 12508992, 'steps': 65150, 'loss/train': 1.4740707874298096} 11/07/2021 06:24:30 - INFO - __main__ - Step 65152: {'lr': 0.0003070653289707156, 'samples': 12509184, 'steps': 65151, 'loss/train': 1.4512189626693726} 11/07/2021 06:24:30 - INFO - __main__ - Step 65153: {'lr': 0.00030706016231207633, 'samples': 12509376, 'steps': 65152, 'loss/train': 1.4313772916793823} 11/07/2021 06:24:31 - INFO - __main__ - Step 65154: {'lr': 0.00030705499562772666, 'samples': 12509568, 'steps': 65153, 'loss/train': 0.918407142162323} 11/07/2021 06:24:32 - INFO - __main__ - Step 65155: {'lr': 0.000307049828917669, 'samples': 12509760, 'steps': 65154, 'loss/train': 1.5733401775360107} 11/07/2021 06:24:32 - INFO - __main__ - Step 65156: {'lr': 0.0003070446621819054, 'samples': 12509952, 'steps': 65155, 'loss/train': 1.1799265146255493} 11/07/2021 06:24:32 - INFO - __main__ - Step 65157: {'lr': 0.0003070394954204384, 'samples': 12510144, 'steps': 65156, 'loss/train': 0.3806449770927429} 11/07/2021 06:24:33 - INFO - __main__ - Step 65158: {'lr': 0.0003070343286332703, 'samples': 12510336, 'steps': 65157, 'loss/train': 0.7651152610778809} 11/07/2021 06:24:33 - INFO - __main__ - Step 65159: {'lr': 0.0003070291618204034, 'samples': 12510528, 'steps': 65158, 'loss/train': 1.1811665296554565} 11/07/2021 06:24:34 - INFO - __main__ - Step 65160: {'lr': 0.00030702399498184005, 'samples': 12510720, 'steps': 65159, 'loss/train': 1.2937517166137695} 11/07/2021 06:24:34 - INFO - __main__ - Step 65161: {'lr': 0.00030701882811758253, 'samples': 12510912, 'steps': 65160, 'loss/train': 1.6488434076309204} 11/07/2021 06:24:35 - INFO - __main__ - Step 65162: {'lr': 0.00030701366122763327, 'samples': 12511104, 'steps': 65161, 'loss/train': 1.4999668598175049} 11/07/2021 06:24:35 - INFO - __main__ - Step 65163: {'lr': 0.00030700849431199444, 'samples': 12511296, 'steps': 65162, 'loss/train': 1.055387258529663} 11/07/2021 06:24:35 - INFO - __main__ - Step 65164: {'lr': 0.0003070033273706685, 'samples': 12511488, 'steps': 65163, 'loss/train': 1.4091752767562866} 11/07/2021 06:24:37 - INFO - __main__ - Step 65165: {'lr': 0.0003069981604036578, 'samples': 12511680, 'steps': 65164, 'loss/train': 1.8099805116653442} 11/07/2021 06:24:37 - INFO - __main__ - Step 65166: {'lr': 0.00030699299341096456, 'samples': 12511872, 'steps': 65165, 'loss/train': 1.8354662656784058} 11/07/2021 06:24:37 - INFO - __main__ - Step 65167: {'lr': 0.0003069878263925912, 'samples': 12512064, 'steps': 65166, 'loss/train': 1.4942902326583862} 11/07/2021 06:24:38 - INFO - __main__ - Step 65168: {'lr': 0.00030698265934854, 'samples': 12512256, 'steps': 65167, 'loss/train': 1.803676724433899} 11/07/2021 06:24:38 - INFO - __main__ - Step 65169: {'lr': 0.0003069774922788132, 'samples': 12512448, 'steps': 65168, 'loss/train': 0.08837371319532394} 11/07/2021 06:24:40 - INFO - __main__ - Step 65170: {'lr': 0.0003069723251834133, 'samples': 12512640, 'steps': 65169, 'loss/train': 1.3107999563217163} 11/07/2021 06:24:40 - INFO - __main__ - Step 65171: {'lr': 0.00030696715806234257, 'samples': 12512832, 'steps': 65170, 'loss/train': 1.7187751531600952} 11/07/2021 06:24:40 - INFO - __main__ - Step 65172: {'lr': 0.0003069619909156032, 'samples': 12513024, 'steps': 65171, 'loss/train': 1.2522332668304443} 11/07/2021 06:24:41 - INFO - __main__ - Step 65173: {'lr': 0.0003069568237431978, 'samples': 12513216, 'steps': 65172, 'loss/train': 1.5482831001281738} 11/07/2021 06:24:41 - INFO - __main__ - Step 65174: {'lr': 0.0003069516565451284, 'samples': 12513408, 'steps': 65173, 'loss/train': 1.1986669301986694} 11/07/2021 06:24:42 - INFO - __main__ - Step 65175: {'lr': 0.0003069464893213976, 'samples': 12513600, 'steps': 65174, 'loss/train': 1.7511003017425537} 11/07/2021 06:24:42 - INFO - __main__ - Step 65176: {'lr': 0.0003069413220720075, 'samples': 12513792, 'steps': 65175, 'loss/train': 1.4603962898254395} 11/07/2021 06:24:43 - INFO - __main__ - Step 65177: {'lr': 0.00030693615479696046, 'samples': 12513984, 'steps': 65176, 'loss/train': 1.799436330795288} 11/07/2021 06:24:44 - INFO - __main__ - Step 65178: {'lr': 0.00030693098749625894, 'samples': 12514176, 'steps': 65177, 'loss/train': 1.6282914876937866} 11/07/2021 06:24:44 - INFO - __main__ - Step 65179: {'lr': 0.0003069258201699052, 'samples': 12514368, 'steps': 65178, 'loss/train': 0.14459623396396637} 11/07/2021 06:24:44 - INFO - __main__ - Step 65180: {'lr': 0.00030692065281790154, 'samples': 12514560, 'steps': 65179, 'loss/train': 1.3443652391433716} 11/07/2021 06:24:45 - INFO - __main__ - Step 65181: {'lr': 0.0003069154854402503, 'samples': 12514752, 'steps': 65180, 'loss/train': 1.4866318702697754} 11/07/2021 06:24:46 - INFO - __main__ - Step 65182: {'lr': 0.0003069103180369539, 'samples': 12514944, 'steps': 65181, 'loss/train': 1.2028367519378662} 11/07/2021 06:24:46 - INFO - __main__ - Step 65183: {'lr': 0.0003069051506080145, 'samples': 12515136, 'steps': 65182, 'loss/train': 1.4546797275543213} 11/07/2021 06:24:46 - INFO - __main__ - Step 65184: {'lr': 0.0003068999831534346, 'samples': 12515328, 'steps': 65183, 'loss/train': 1.394660234451294} 11/07/2021 06:24:47 - INFO - __main__ - Step 65185: {'lr': 0.00030689481567321635, 'samples': 12515520, 'steps': 65184, 'loss/train': 1.7795518636703491} 11/07/2021 06:24:47 - INFO - __main__ - Step 65186: {'lr': 0.0003068896481673622, 'samples': 12515712, 'steps': 65185, 'loss/train': 1.4744658470153809} 11/07/2021 06:24:48 - INFO - __main__ - Step 65187: {'lr': 0.00030688448063587447, 'samples': 12515904, 'steps': 65186, 'loss/train': 1.456432580947876} 11/07/2021 06:24:48 - INFO - __main__ - Step 65188: {'lr': 0.0003068793130787555, 'samples': 12516096, 'steps': 65187, 'loss/train': 1.7065198421478271} 11/07/2021 06:24:49 - INFO - __main__ - Step 65189: {'lr': 0.00030687414549600755, 'samples': 12516288, 'steps': 65188, 'loss/train': 0.6425004005432129} 11/07/2021 06:24:49 - INFO - __main__ - Step 65190: {'lr': 0.00030686897788763303, 'samples': 12516480, 'steps': 65189, 'loss/train': 1.80988347530365} 11/07/2021 06:24:50 - INFO - __main__ - Step 65191: {'lr': 0.0003068638102536342, 'samples': 12516672, 'steps': 65190, 'loss/train': 1.155943751335144} 11/07/2021 06:24:51 - INFO - __main__ - Step 65192: {'lr': 0.00030685864259401334, 'samples': 12516864, 'steps': 65191, 'loss/train': 1.758609652519226} 11/07/2021 06:24:51 - INFO - __main__ - Step 65193: {'lr': 0.00030685347490877295, 'samples': 12517056, 'steps': 65192, 'loss/train': 1.3404260873794556} 11/07/2021 06:24:51 - INFO - __main__ - Step 65194: {'lr': 0.00030684830719791525, 'samples': 12517248, 'steps': 65193, 'loss/train': 1.4326629638671875} 11/07/2021 06:24:52 - INFO - __main__ - Step 65195: {'lr': 0.0003068431394614426, 'samples': 12517440, 'steps': 65194, 'loss/train': 1.949111819267273} 11/07/2021 06:24:52 - INFO - __main__ - Step 65196: {'lr': 0.0003068379716993573, 'samples': 12517632, 'steps': 65195, 'loss/train': 1.0439518690109253} 11/07/2021 06:24:53 - INFO - __main__ - Step 65197: {'lr': 0.0003068328039116616, 'samples': 12517824, 'steps': 65196, 'loss/train': 1.0408331155776978} 11/07/2021 06:24:53 - INFO - __main__ - Step 65198: {'lr': 0.00030682763609835793, 'samples': 12518016, 'steps': 65197, 'loss/train': 1.0721874237060547} 11/07/2021 06:24:54 - INFO - __main__ - Step 65199: {'lr': 0.0003068224682594487, 'samples': 12518208, 'steps': 65198, 'loss/train': 1.0904020071029663} 11/07/2021 06:24:54 - INFO - __main__ - Step 65200: {'lr': 0.0003068173003949361, 'samples': 12518400, 'steps': 65199, 'loss/train': 1.017926573753357} 11/07/2021 06:24:54 - INFO - __main__ - Step 65201: {'lr': 0.00030681213250482255, 'samples': 12518592, 'steps': 65200, 'loss/train': 1.0569989681243896} 11/07/2021 06:24:56 - INFO - __main__ - Step 65202: {'lr': 0.0003068069645891102, 'samples': 12518784, 'steps': 65201, 'loss/train': 1.2262241840362549} 11/07/2021 06:24:56 - INFO - __main__ - Step 65203: {'lr': 0.0003068017966478016, 'samples': 12518976, 'steps': 65202, 'loss/train': 1.3618454933166504} 11/07/2021 06:24:56 - INFO - __main__ - Step 65204: {'lr': 0.000306796628680899, 'samples': 12519168, 'steps': 65203, 'loss/train': 1.2356011867523193} 11/07/2021 06:24:57 - INFO - __main__ - Step 65205: {'lr': 0.00030679146068840463, 'samples': 12519360, 'steps': 65204, 'loss/train': 1.0793384313583374} 11/07/2021 06:24:57 - INFO - __main__ - Step 65206: {'lr': 0.00030678629267032106, 'samples': 12519552, 'steps': 65205, 'loss/train': 1.2174307107925415} 11/07/2021 06:24:57 - INFO - __main__ - Step 65207: {'lr': 0.0003067811246266503, 'samples': 12519744, 'steps': 65206, 'loss/train': 1.3354328870773315} 11/07/2021 06:24:59 - INFO - __main__ - Step 65208: {'lr': 0.00030677595655739494, 'samples': 12519936, 'steps': 65207, 'loss/train': 1.2641942501068115} 11/07/2021 06:24:59 - INFO - __main__ - Step 65209: {'lr': 0.0003067707884625571, 'samples': 12520128, 'steps': 65208, 'loss/train': 1.708766222000122} 11/07/2021 06:24:59 - INFO - __main__ - Step 65210: {'lr': 0.00030676562034213933, 'samples': 12520320, 'steps': 65209, 'loss/train': 1.3566724061965942} 11/07/2021 06:25:00 - INFO - __main__ - Step 65211: {'lr': 0.0003067604521961438, 'samples': 12520512, 'steps': 65210, 'loss/train': 1.6201975345611572} 11/07/2021 06:25:00 - INFO - __main__ - Step 65212: {'lr': 0.00030675528402457293, 'samples': 12520704, 'steps': 65211, 'loss/train': 0.6128956079483032} 11/07/2021 06:25:01 - INFO - __main__ - Step 65213: {'lr': 0.000306750115827429, 'samples': 12520896, 'steps': 65212, 'loss/train': 1.3590779304504395} 11/07/2021 06:25:01 - INFO - __main__ - Step 65214: {'lr': 0.0003067449476047143, 'samples': 12521088, 'steps': 65213, 'loss/train': 1.5877410173416138} 11/07/2021 06:25:02 - INFO - __main__ - Step 65215: {'lr': 0.00030673977935643116, 'samples': 12521280, 'steps': 65214, 'loss/train': 1.3268773555755615} 11/07/2021 06:25:02 - INFO - __main__ - Step 65216: {'lr': 0.00030673461108258207, 'samples': 12521472, 'steps': 65215, 'loss/train': 1.3913111686706543} 11/07/2021 06:25:02 - INFO - __main__ - Step 65217: {'lr': 0.0003067294427831692, 'samples': 12521664, 'steps': 65216, 'loss/train': 1.4161643981933594} 11/07/2021 06:25:03 - INFO - __main__ - Step 65218: {'lr': 0.00030672427445819486, 'samples': 12521856, 'steps': 65217, 'loss/train': 1.3136537075042725} 11/07/2021 06:25:04 - INFO - __main__ - Step 65219: {'lr': 0.00030671910610766145, 'samples': 12522048, 'steps': 65218, 'loss/train': 1.4916890859603882} 11/07/2021 06:25:04 - INFO - __main__ - Step 65220: {'lr': 0.0003067139377315713, 'samples': 12522240, 'steps': 65219, 'loss/train': 1.0302430391311646} 11/07/2021 06:25:04 - INFO - __main__ - Step 65221: {'lr': 0.00030670876932992674, 'samples': 12522432, 'steps': 65220, 'loss/train': 1.1383991241455078} 11/07/2021 06:25:05 - INFO - __main__ - Step 65222: {'lr': 0.0003067036009027301, 'samples': 12522624, 'steps': 65221, 'loss/train': 1.4917129278182983} 11/07/2021 06:25:06 - INFO - __main__ - Step 65223: {'lr': 0.0003066984324499837, 'samples': 12522816, 'steps': 65222, 'loss/train': 1.5316007137298584} 11/07/2021 06:25:06 - INFO - __main__ - Step 65224: {'lr': 0.0003066932639716898, 'samples': 12523008, 'steps': 65223, 'loss/train': 1.2381497621536255} 11/07/2021 06:25:07 - INFO - __main__ - Step 65225: {'lr': 0.0003066880954678508, 'samples': 12523200, 'steps': 65224, 'loss/train': 1.192180871963501} 11/07/2021 06:25:07 - INFO - __main__ - Step 65226: {'lr': 0.00030668292693846903, 'samples': 12523392, 'steps': 65225, 'loss/train': 1.2678412199020386} 11/07/2021 06:25:07 - INFO - __main__ - Step 65227: {'lr': 0.0003066777583835468, 'samples': 12523584, 'steps': 65226, 'loss/train': 1.374587893486023} 11/07/2021 06:25:08 - INFO - __main__ - Step 65228: {'lr': 0.0003066725898030865, 'samples': 12523776, 'steps': 65227, 'loss/train': 1.6138473749160767} 11/07/2021 06:25:09 - INFO - __main__ - Step 65229: {'lr': 0.0003066674211970904, 'samples': 12523968, 'steps': 65228, 'loss/train': 1.2600966691970825} 11/07/2021 06:25:09 - INFO - __main__ - Step 65230: {'lr': 0.0003066622525655608, 'samples': 12524160, 'steps': 65229, 'loss/train': 1.895508885383606} 11/07/2021 06:25:09 - INFO - __main__ - Step 65231: {'lr': 0.00030665708390850005, 'samples': 12524352, 'steps': 65230, 'loss/train': 1.0162614583969116} 11/07/2021 06:25:10 - INFO - __main__ - Step 65232: {'lr': 0.00030665191522591054, 'samples': 12524544, 'steps': 65231, 'loss/train': 1.263337254524231} 11/07/2021 06:25:11 - INFO - __main__ - Step 65233: {'lr': 0.0003066467465177945, 'samples': 12524736, 'steps': 65232, 'loss/train': 1.2253398895263672} 11/07/2021 06:25:11 - INFO - __main__ - Step 65234: {'lr': 0.0003066415777841543, 'samples': 12524928, 'steps': 65233, 'loss/train': 1.3942275047302246} 11/07/2021 06:25:12 - INFO - __main__ - Step 65235: {'lr': 0.0003066364090249923, 'samples': 12525120, 'steps': 65234, 'loss/train': 1.3984483480453491} 11/07/2021 06:25:12 - INFO - __main__ - Step 65236: {'lr': 0.00030663124024031085, 'samples': 12525312, 'steps': 65235, 'loss/train': 1.146461844444275} 11/07/2021 06:25:12 - INFO - __main__ - Step 65237: {'lr': 0.0003066260714301122, 'samples': 12525504, 'steps': 65236, 'loss/train': 1.3390684127807617} 11/07/2021 06:25:13 - INFO - __main__ - Step 65238: {'lr': 0.0003066209025943987, 'samples': 12525696, 'steps': 65237, 'loss/train': 1.1870280504226685} 11/07/2021 06:25:14 - INFO - __main__ - Step 65239: {'lr': 0.00030661573373317273, 'samples': 12525888, 'steps': 65238, 'loss/train': 0.9293753504753113} 11/07/2021 06:25:14 - INFO - __main__ - Step 65240: {'lr': 0.00030661056484643657, 'samples': 12526080, 'steps': 65239, 'loss/train': 1.3787378072738647} 11/07/2021 06:25:14 - INFO - __main__ - Step 65241: {'lr': 0.00030660539593419255, 'samples': 12526272, 'steps': 65240, 'loss/train': 1.38843834400177} 11/07/2021 06:25:15 - INFO - __main__ - Step 65242: {'lr': 0.0003066002269964431, 'samples': 12526464, 'steps': 65241, 'loss/train': 1.3540962934494019} 11/07/2021 06:25:16 - INFO - __main__ - Step 65243: {'lr': 0.0003065950580331904, 'samples': 12526656, 'steps': 65242, 'loss/train': 1.6879535913467407} 11/07/2021 06:25:16 - INFO - __main__ - Step 65244: {'lr': 0.00030658988904443677, 'samples': 12526848, 'steps': 65243, 'loss/train': 1.1856584548950195} 11/07/2021 06:25:16 - INFO - __main__ - Step 65245: {'lr': 0.00030658472003018466, 'samples': 12527040, 'steps': 65244, 'loss/train': 1.2542388439178467} 11/07/2021 06:25:17 - INFO - __main__ - Step 65246: {'lr': 0.00030657955099043635, 'samples': 12527232, 'steps': 65245, 'loss/train': 0.440455824136734} 11/07/2021 06:25:17 - INFO - __main__ - Step 65247: {'lr': 0.00030657438192519416, 'samples': 12527424, 'steps': 65246, 'loss/train': 1.434252381324768} 11/07/2021 06:25:18 - INFO - __main__ - Step 65248: {'lr': 0.0003065692128344604, 'samples': 12527616, 'steps': 65247, 'loss/train': 1.0341593027114868} 11/07/2021 06:25:19 - INFO - __main__ - Step 65249: {'lr': 0.00030656404371823753, 'samples': 12527808, 'steps': 65248, 'loss/train': 1.3738213777542114} 11/07/2021 06:25:19 - INFO - __main__ - Step 65250: {'lr': 0.0003065588745765277, 'samples': 12528000, 'steps': 65249, 'loss/train': 1.4690800905227661} 11/07/2021 06:25:19 - INFO - __main__ - Step 65251: {'lr': 0.0003065537054093333, 'samples': 12528192, 'steps': 65250, 'loss/train': 1.526599645614624} 11/07/2021 06:25:20 - INFO - __main__ - Step 65252: {'lr': 0.00030654853621665665, 'samples': 12528384, 'steps': 65251, 'loss/train': 0.9194687008857727} 11/07/2021 06:25:21 - INFO - __main__ - Step 65253: {'lr': 0.0003065433669985002, 'samples': 12528576, 'steps': 65252, 'loss/train': 1.5901665687561035} 11/07/2021 06:25:21 - INFO - __main__ - Step 65254: {'lr': 0.0003065381977548661, 'samples': 12528768, 'steps': 65253, 'loss/train': 1.3446277379989624} 11/07/2021 06:25:21 - INFO - __main__ - Step 65255: {'lr': 0.00030653302848575683, 'samples': 12528960, 'steps': 65254, 'loss/train': 1.208847165107727} 11/07/2021 06:25:22 - INFO - __main__ - Step 65256: {'lr': 0.00030652785919117466, 'samples': 12529152, 'steps': 65255, 'loss/train': 1.3390421867370605} 11/07/2021 06:25:22 - INFO - __main__ - Step 65257: {'lr': 0.0003065226898711218, 'samples': 12529344, 'steps': 65256, 'loss/train': 0.622122049331665} 11/07/2021 06:25:23 - INFO - __main__ - Step 65258: {'lr': 0.0003065175205256008, 'samples': 12529536, 'steps': 65257, 'loss/train': 1.4989283084869385} 11/07/2021 06:25:24 - INFO - __main__ - Step 65259: {'lr': 0.0003065123511546138, 'samples': 12529728, 'steps': 65258, 'loss/train': 1.4907664060592651} 11/07/2021 06:25:24 - INFO - __main__ - Step 65260: {'lr': 0.0003065071817581632, 'samples': 12529920, 'steps': 65259, 'loss/train': 0.9137238264083862} 11/07/2021 06:25:24 - INFO - __main__ - Step 65261: {'lr': 0.0003065020123362514, 'samples': 12530112, 'steps': 65260, 'loss/train': 1.1303925514221191} 11/07/2021 06:25:25 - INFO - __main__ - Step 65262: {'lr': 0.0003064968428888806, 'samples': 12530304, 'steps': 65261, 'loss/train': 1.0812450647354126} 11/07/2021 06:25:26 - INFO - __main__ - Step 65263: {'lr': 0.0003064916734160532, 'samples': 12530496, 'steps': 65262, 'loss/train': 1.3640908002853394} 11/07/2021 06:25:26 - INFO - __main__ - Step 65264: {'lr': 0.0003064865039177716, 'samples': 12530688, 'steps': 65263, 'loss/train': 1.15481698513031} 11/07/2021 06:25:27 - INFO - __main__ - Step 65265: {'lr': 0.00030648133439403795, 'samples': 12530880, 'steps': 65264, 'loss/train': 1.1069289445877075} 11/07/2021 06:25:27 - INFO - __main__ - Step 65266: {'lr': 0.00030647616484485475, 'samples': 12531072, 'steps': 65265, 'loss/train': 1.3950742483139038} 11/07/2021 06:25:27 - INFO - __main__ - Step 65267: {'lr': 0.00030647099527022424, 'samples': 12531264, 'steps': 65266, 'loss/train': 1.3846884965896606} 11/07/2021 06:25:28 - INFO - __main__ - Step 65268: {'lr': 0.0003064658256701488, 'samples': 12531456, 'steps': 65267, 'loss/train': 0.06464345753192902} 11/07/2021 06:25:29 - INFO - __main__ - Step 65269: {'lr': 0.0003064606560446308, 'samples': 12531648, 'steps': 65268, 'loss/train': 1.3369436264038086} 11/07/2021 06:25:29 - INFO - __main__ - Step 65270: {'lr': 0.0003064554863936723, 'samples': 12531840, 'steps': 65269, 'loss/train': 1.0447789430618286} 11/07/2021 06:25:29 - INFO - __main__ - Step 65271: {'lr': 0.000306450316717276, 'samples': 12532032, 'steps': 65270, 'loss/train': 1.525952935218811} 11/07/2021 06:25:30 - INFO - __main__ - Step 65272: {'lr': 0.00030644514701544395, 'samples': 12532224, 'steps': 65271, 'loss/train': 1.2215602397918701} 11/07/2021 06:25:30 - INFO - __main__ - Step 65273: {'lr': 0.00030643997728817864, 'samples': 12532416, 'steps': 65272, 'loss/train': 1.0906411409378052} 11/07/2021 06:25:31 - INFO - __main__ - Step 65274: {'lr': 0.0003064348075354823, 'samples': 12532608, 'steps': 65273, 'loss/train': 1.3487950563430786} 11/07/2021 06:25:31 - INFO - __main__ - Step 65275: {'lr': 0.00030642963775735733, 'samples': 12532800, 'steps': 65274, 'loss/train': 1.2626148462295532} 11/07/2021 06:25:32 - INFO - __main__ - Step 65276: {'lr': 0.00030642446795380615, 'samples': 12532992, 'steps': 65275, 'loss/train': 1.6189236640930176} 11/07/2021 06:25:32 - INFO - __main__ - Step 65277: {'lr': 0.0003064192981248308, 'samples': 12533184, 'steps': 65276, 'loss/train': 0.9955286979675293} 11/07/2021 06:25:33 - INFO - __main__ - Step 65278: {'lr': 0.0003064141282704339, 'samples': 12533376, 'steps': 65277, 'loss/train': 0.4586583375930786} 11/07/2021 06:25:34 - INFO - __main__ - Step 65279: {'lr': 0.0003064089583906176, 'samples': 12533568, 'steps': 65278, 'loss/train': 1.6662291288375854} 11/07/2021 06:25:34 - INFO - __main__ - Step 65280: {'lr': 0.0003064037884853843, 'samples': 12533760, 'steps': 65279, 'loss/train': 1.2878652811050415} 11/07/2021 06:25:34 - INFO - __main__ - Step 65281: {'lr': 0.00030639861855473634, 'samples': 12533952, 'steps': 65280, 'loss/train': 0.9062223434448242} 11/07/2021 06:25:35 - INFO - __main__ - Step 65282: {'lr': 0.000306393448598676, 'samples': 12534144, 'steps': 65281, 'loss/train': 1.170034408569336} 11/07/2021 06:25:35 - INFO - __main__ - Step 65283: {'lr': 0.00030638827861720574, 'samples': 12534336, 'steps': 65282, 'loss/train': 1.4481512308120728} 11/07/2021 06:25:36 - INFO - __main__ - Step 65284: {'lr': 0.00030638310861032773, 'samples': 12534528, 'steps': 65283, 'loss/train': 1.4988969564437866} 11/07/2021 06:25:36 - INFO - __main__ - Step 65285: {'lr': 0.00030637793857804437, 'samples': 12534720, 'steps': 65284, 'loss/train': 1.7949358224868774} 11/07/2021 06:25:37 - INFO - __main__ - Step 65286: {'lr': 0.00030637276852035793, 'samples': 12534912, 'steps': 65285, 'loss/train': 1.2390375137329102} 11/07/2021 06:25:37 - INFO - __main__ - Step 65287: {'lr': 0.00030636759843727086, 'samples': 12535104, 'steps': 65286, 'loss/train': 1.5138137340545654} 11/07/2021 06:25:37 - INFO - __main__ - Step 65288: {'lr': 0.0003063624283287854, 'samples': 12535296, 'steps': 65287, 'loss/train': 1.3260064125061035} 11/07/2021 06:25:39 - INFO - __main__ - Step 65289: {'lr': 0.0003063572581949039, 'samples': 12535488, 'steps': 65288, 'loss/train': 1.1325165033340454} 11/07/2021 06:25:39 - INFO - __main__ - Step 65290: {'lr': 0.00030635208803562867, 'samples': 12535680, 'steps': 65289, 'loss/train': 0.673934817314148} 11/07/2021 06:25:39 - INFO - __main__ - Step 65291: {'lr': 0.0003063469178509621, 'samples': 12535872, 'steps': 65290, 'loss/train': 1.6352676153182983} 11/07/2021 06:25:40 - INFO - __main__ - Step 65292: {'lr': 0.00030634174764090645, 'samples': 12536064, 'steps': 65291, 'loss/train': 0.674453854560852} 11/07/2021 06:25:40 - INFO - __main__ - Step 65293: {'lr': 0.00030633657740546403, 'samples': 12536256, 'steps': 65292, 'loss/train': 1.543519139289856} 11/07/2021 06:25:41 - INFO - __main__ - Step 65294: {'lr': 0.00030633140714463725, 'samples': 12536448, 'steps': 65293, 'loss/train': 1.8140647411346436} 11/07/2021 06:25:41 - INFO - __main__ - Step 65295: {'lr': 0.0003063262368584284, 'samples': 12536640, 'steps': 65294, 'loss/train': 0.9432654976844788} 11/07/2021 06:25:42 - INFO - __main__ - Step 65296: {'lr': 0.0003063210665468399, 'samples': 12536832, 'steps': 65295, 'loss/train': 1.594377040863037} 11/07/2021 06:25:42 - INFO - __main__ - Step 65297: {'lr': 0.00030631589620987393, 'samples': 12537024, 'steps': 65296, 'loss/train': 1.8406291007995605} 11/07/2021 06:25:42 - INFO - __main__ - Step 65298: {'lr': 0.0003063107258475329, 'samples': 12537216, 'steps': 65297, 'loss/train': 1.1904022693634033} 11/07/2021 06:25:44 - INFO - __main__ - Step 65299: {'lr': 0.0003063055554598191, 'samples': 12537408, 'steps': 65298, 'loss/train': 1.2325977087020874} 11/07/2021 06:25:44 - INFO - __main__ - Step 65300: {'lr': 0.0003063003850467349, 'samples': 12537600, 'steps': 65299, 'loss/train': 0.7672238349914551} 11/07/2021 06:25:44 - INFO - __main__ - Step 65301: {'lr': 0.0003062952146082826, 'samples': 12537792, 'steps': 65300, 'loss/train': 1.4220865964889526} 11/07/2021 06:25:45 - INFO - __main__ - Step 65302: {'lr': 0.00030629004414446453, 'samples': 12537984, 'steps': 65301, 'loss/train': 0.9292018413543701} 11/07/2021 06:25:45 - INFO - __main__ - Step 65303: {'lr': 0.00030628487365528314, 'samples': 12538176, 'steps': 65302, 'loss/train': 1.5103529691696167} 11/07/2021 06:25:46 - INFO - __main__ - Step 65304: {'lr': 0.0003062797031407406, 'samples': 12538368, 'steps': 65303, 'loss/train': 1.3974852561950684} 11/07/2021 06:25:46 - INFO - __main__ - Step 65305: {'lr': 0.0003062745326008393, 'samples': 12538560, 'steps': 65304, 'loss/train': 1.5416008234024048} 11/07/2021 06:25:47 - INFO - __main__ - Step 65306: {'lr': 0.0003062693620355815, 'samples': 12538752, 'steps': 65305, 'loss/train': 1.8038392066955566} 11/07/2021 06:25:47 - INFO - __main__ - Step 65307: {'lr': 0.00030626419144496957, 'samples': 12538944, 'steps': 65306, 'loss/train': 1.7893794775009155} 11/07/2021 06:25:47 - INFO - __main__ - Step 65308: {'lr': 0.0003062590208290059, 'samples': 12539136, 'steps': 65307, 'loss/train': 1.3844192028045654} 11/07/2021 06:25:48 - INFO - __main__ - Step 65309: {'lr': 0.00030625385018769285, 'samples': 12539328, 'steps': 65308, 'loss/train': 1.5625331401824951} 11/07/2021 06:25:49 - INFO - __main__ - Step 65310: {'lr': 0.0003062486795210327, 'samples': 12539520, 'steps': 65309, 'loss/train': 1.4057537317276} 11/07/2021 06:25:49 - INFO - __main__ - Step 65311: {'lr': 0.0003062435088290277, 'samples': 12539712, 'steps': 65310, 'loss/train': 1.1580102443695068} 11/07/2021 06:25:50 - INFO - __main__ - Step 65312: {'lr': 0.0003062383381116802, 'samples': 12539904, 'steps': 65311, 'loss/train': 1.1063733100891113} 11/07/2021 06:25:50 - INFO - __main__ - Step 65313: {'lr': 0.00030623316736899263, 'samples': 12540096, 'steps': 65312, 'loss/train': 1.284899115562439} 11/07/2021 06:25:50 - INFO - __main__ - Step 65314: {'lr': 0.00030622799660096723, 'samples': 12540288, 'steps': 65313, 'loss/train': 0.10301804542541504} 11/07/2021 06:25:52 - INFO - __main__ - Step 65315: {'lr': 0.0003062228258076064, 'samples': 12540480, 'steps': 65314, 'loss/train': 2.376560926437378} 11/07/2021 06:25:52 - INFO - __main__ - Step 65316: {'lr': 0.00030621765498891246, 'samples': 12540672, 'steps': 65315, 'loss/train': 1.310238242149353} 11/07/2021 06:25:52 - INFO - __main__ - Step 65317: {'lr': 0.0003062124841448877, 'samples': 12540864, 'steps': 65316, 'loss/train': 1.3661901950836182} 11/07/2021 06:25:53 - INFO - __main__ - Step 65318: {'lr': 0.00030620731327553444, 'samples': 12541056, 'steps': 65317, 'loss/train': 1.5427223443984985} 11/07/2021 06:25:53 - INFO - __main__ - Step 65319: {'lr': 0.000306202142380855, 'samples': 12541248, 'steps': 65318, 'loss/train': 0.6957752704620361} 11/07/2021 06:25:54 - INFO - __main__ - Step 65320: {'lr': 0.0003061969714608517, 'samples': 12541440, 'steps': 65319, 'loss/train': 0.672153890132904} 11/07/2021 06:25:54 - INFO - __main__ - Step 65321: {'lr': 0.00030619180051552695, 'samples': 12541632, 'steps': 65320, 'loss/train': 1.4652976989746094} 11/07/2021 06:25:55 - INFO - __main__ - Step 65322: {'lr': 0.00030618662954488314, 'samples': 12541824, 'steps': 65321, 'loss/train': 1.5017162561416626} 11/07/2021 06:25:55 - INFO - __main__ - Step 65323: {'lr': 0.00030618145854892245, 'samples': 12542016, 'steps': 65322, 'loss/train': 1.671249270439148} 11/07/2021 06:25:55 - INFO - __main__ - Step 65324: {'lr': 0.00030617628752764727, 'samples': 12542208, 'steps': 65323, 'loss/train': 1.3790513277053833} 11/07/2021 06:25:56 - INFO - __main__ - Step 65325: {'lr': 0.0003061711164810598, 'samples': 12542400, 'steps': 65324, 'loss/train': 1.0591002702713013} 11/07/2021 06:25:57 - INFO - __main__ - Step 65326: {'lr': 0.00030616594540916264, 'samples': 12542592, 'steps': 65325, 'loss/train': 0.13060006499290466} 11/07/2021 06:25:57 - INFO - __main__ - Step 65327: {'lr': 0.0003061607743119579, 'samples': 12542784, 'steps': 65326, 'loss/train': 1.5515828132629395} 11/07/2021 06:25:57 - INFO - __main__ - Step 65328: {'lr': 0.000306155603189448, 'samples': 12542976, 'steps': 65327, 'loss/train': 1.350666880607605} 11/07/2021 06:25:58 - INFO - __main__ - Step 65329: {'lr': 0.0003061504320416352, 'samples': 12543168, 'steps': 65328, 'loss/train': 1.3022546768188477} 11/07/2021 06:25:59 - INFO - __main__ - Step 65330: {'lr': 0.000306145260868522, 'samples': 12543360, 'steps': 65329, 'loss/train': 1.5600100755691528} 11/07/2021 06:25:59 - INFO - __main__ - Step 65331: {'lr': 0.0003061400896701105, 'samples': 12543552, 'steps': 65330, 'loss/train': 1.4711719751358032} 11/07/2021 06:26:00 - INFO - __main__ - Step 65332: {'lr': 0.00030613491844640325, 'samples': 12543744, 'steps': 65331, 'loss/train': 1.3107181787490845} 11/07/2021 06:26:00 - INFO - __main__ - Step 65333: {'lr': 0.0003061297471974024, 'samples': 12543936, 'steps': 65332, 'loss/train': 0.7266539931297302} 11/07/2021 06:26:00 - INFO - __main__ - Step 65334: {'lr': 0.0003061245759231103, 'samples': 12544128, 'steps': 65333, 'loss/train': 0.9353621602058411} 11/07/2021 06:26:01 - INFO - __main__ - Step 65335: {'lr': 0.0003061194046235295, 'samples': 12544320, 'steps': 65334, 'loss/train': 1.2545524835586548} 11/07/2021 06:26:02 - INFO - __main__ - Step 65336: {'lr': 0.00030611423329866204, 'samples': 12544512, 'steps': 65335, 'loss/train': 1.292264699935913} 11/07/2021 06:26:02 - INFO - __main__ - Step 65337: {'lr': 0.0003061090619485104, 'samples': 12544704, 'steps': 65336, 'loss/train': 1.1664252281188965} 11/07/2021 06:26:02 - INFO - __main__ - Step 65338: {'lr': 0.0003061038905730769, 'samples': 12544896, 'steps': 65337, 'loss/train': 1.4863401651382446} 11/07/2021 06:26:03 - INFO - __main__ - Step 65339: {'lr': 0.00030609871917236373, 'samples': 12545088, 'steps': 65338, 'loss/train': 1.643831729888916} 11/07/2021 06:26:04 - INFO - __main__ - Step 65340: {'lr': 0.00030609354774637344, 'samples': 12545280, 'steps': 65339, 'loss/train': 1.286373496055603} 11/07/2021 06:26:04 - INFO - __main__ - Step 65341: {'lr': 0.00030608837629510834, 'samples': 12545472, 'steps': 65340, 'loss/train': 1.7023710012435913} 11/07/2021 06:26:04 - INFO - __main__ - Step 65342: {'lr': 0.00030608320481857054, 'samples': 12545664, 'steps': 65341, 'loss/train': 1.4312047958374023} 11/07/2021 06:26:05 - INFO - __main__ - Step 65343: {'lr': 0.0003060780333167626, 'samples': 12545856, 'steps': 65342, 'loss/train': 1.5016206502914429} 11/07/2021 06:26:05 - INFO - __main__ - Step 65344: {'lr': 0.00030607286178968677, 'samples': 12546048, 'steps': 65343, 'loss/train': 1.0163733959197998} 11/07/2021 06:26:06 - INFO - __main__ - Step 65345: {'lr': 0.00030606769023734534, 'samples': 12546240, 'steps': 65344, 'loss/train': 1.255252480506897} 11/07/2021 06:26:07 - INFO - __main__ - Step 65346: {'lr': 0.00030606251865974066, 'samples': 12546432, 'steps': 65345, 'loss/train': 1.469844102859497} 11/07/2021 06:26:07 - INFO - __main__ - Step 65347: {'lr': 0.0003060573470568751, 'samples': 12546624, 'steps': 65346, 'loss/train': 1.4484995603561401} 11/07/2021 06:26:07 - INFO - __main__ - Step 65348: {'lr': 0.00030605217542875097, 'samples': 12546816, 'steps': 65347, 'loss/train': 1.5174674987792969} 11/07/2021 06:26:08 - INFO - __main__ - Step 65349: {'lr': 0.0003060470037753705, 'samples': 12547008, 'steps': 65348, 'loss/train': 1.7742592096328735} 11/07/2021 06:26:08 - INFO - __main__ - Step 65350: {'lr': 0.00030604183209673625, 'samples': 12547200, 'steps': 65349, 'loss/train': 0.5503422617912292} 11/07/2021 06:26:09 - INFO - __main__ - Step 65351: {'lr': 0.0003060366603928504, 'samples': 12547392, 'steps': 65350, 'loss/train': 0.10929877310991287} 11/07/2021 06:26:10 - INFO - __main__ - Step 65352: {'lr': 0.00030603148866371524, 'samples': 12547584, 'steps': 65351, 'loss/train': 1.2644262313842773} 11/07/2021 06:26:10 - INFO - __main__ - Step 65353: {'lr': 0.0003060263169093332, 'samples': 12547776, 'steps': 65352, 'loss/train': 1.4725334644317627} 11/07/2021 06:26:10 - INFO - __main__ - Step 65354: {'lr': 0.0003060211451297065, 'samples': 12547968, 'steps': 65353, 'loss/train': 1.4502850770950317} 11/07/2021 06:26:11 - INFO - __main__ - Step 65355: {'lr': 0.00030601597332483753, 'samples': 12548160, 'steps': 65354, 'loss/train': 1.620972990989685} 11/07/2021 06:26:12 - INFO - __main__ - Step 65356: {'lr': 0.0003060108014947287, 'samples': 12548352, 'steps': 65355, 'loss/train': 1.425174593925476} 11/07/2021 06:26:12 - INFO - __main__ - Step 65357: {'lr': 0.0003060056296393823, 'samples': 12548544, 'steps': 65356, 'loss/train': 1.1491528749465942} 11/07/2021 06:26:12 - INFO - __main__ - Step 65358: {'lr': 0.00030600045775880055, 'samples': 12548736, 'steps': 65357, 'loss/train': 1.1318130493164062} 11/07/2021 06:26:13 - INFO - __main__ - Step 65359: {'lr': 0.00030599528585298585, 'samples': 12548928, 'steps': 65358, 'loss/train': 1.7956703901290894} 11/07/2021 06:26:13 - INFO - __main__ - Step 65360: {'lr': 0.00030599011392194053, 'samples': 12549120, 'steps': 65359, 'loss/train': 1.3804296255111694} 11/07/2021 06:26:14 - INFO - __main__ - Step 65361: {'lr': 0.000305984941965667, 'samples': 12549312, 'steps': 65360, 'loss/train': 1.613926887512207} 11/07/2021 06:26:15 - INFO - __main__ - Step 65362: {'lr': 0.0003059797699841674, 'samples': 12549504, 'steps': 65361, 'loss/train': 1.4630143642425537} 11/07/2021 06:26:15 - INFO - __main__ - Step 65363: {'lr': 0.00030597459797744434, 'samples': 12549696, 'steps': 65362, 'loss/train': 1.526277780532837} 11/07/2021 06:26:15 - INFO - __main__ - Step 65364: {'lr': 0.0003059694259454999, 'samples': 12549888, 'steps': 65363, 'loss/train': 1.6788201332092285} 11/07/2021 06:26:16 - INFO - __main__ - Step 65365: {'lr': 0.00030596425388833656, 'samples': 12550080, 'steps': 65364, 'loss/train': 1.4097588062286377} 11/07/2021 06:26:16 - INFO - __main__ - Step 65366: {'lr': 0.0003059590818059565, 'samples': 12550272, 'steps': 65365, 'loss/train': 1.2689449787139893} 11/07/2021 06:26:17 - INFO - __main__ - Step 65367: {'lr': 0.0003059539096983622, 'samples': 12550464, 'steps': 65366, 'loss/train': 1.5787996053695679} 11/07/2021 06:26:17 - INFO - __main__ - Step 65368: {'lr': 0.00030594873756555584, 'samples': 12550656, 'steps': 65367, 'loss/train': 1.88141930103302} 11/07/2021 06:26:18 - INFO - __main__ - Step 65369: {'lr': 0.00030594356540753994, 'samples': 12550848, 'steps': 65368, 'loss/train': 0.9840074181556702} 11/07/2021 06:26:18 - INFO - __main__ - Step 65370: {'lr': 0.0003059383932243168, 'samples': 12551040, 'steps': 65369, 'loss/train': 1.4921737909317017} 11/07/2021 06:26:19 - INFO - __main__ - Step 65371: {'lr': 0.0003059332210158886, 'samples': 12551232, 'steps': 65370, 'loss/train': 1.3470547199249268} 11/07/2021 06:26:20 - INFO - __main__ - Step 65372: {'lr': 0.00030592804878225765, 'samples': 12551424, 'steps': 65371, 'loss/train': 1.3032782077789307} 11/07/2021 06:26:20 - INFO - __main__ - Step 65373: {'lr': 0.00030592287652342646, 'samples': 12551616, 'steps': 65372, 'loss/train': 1.3700975179672241} 11/07/2021 06:26:20 - INFO - __main__ - Step 65374: {'lr': 0.0003059177042393974, 'samples': 12551808, 'steps': 65373, 'loss/train': 1.6109315156936646} 11/07/2021 06:26:21 - INFO - __main__ - Step 65375: {'lr': 0.0003059125319301725, 'samples': 12552000, 'steps': 65374, 'loss/train': 1.3162117004394531} 11/07/2021 06:26:21 - INFO - __main__ - Step 65376: {'lr': 0.0003059073595957544, 'samples': 12552192, 'steps': 65375, 'loss/train': 1.6721082925796509} 11/07/2021 06:26:22 - INFO - __main__ - Step 65377: {'lr': 0.00030590218723614525, 'samples': 12552384, 'steps': 65376, 'loss/train': 1.5048508644104004} 11/07/2021 06:26:23 - INFO - __main__ - Step 65378: {'lr': 0.0003058970148513475, 'samples': 12552576, 'steps': 65377, 'loss/train': 1.6767280101776123} 11/07/2021 06:26:23 - INFO - __main__ - Step 65379: {'lr': 0.0003058918424413634, 'samples': 12552768, 'steps': 65378, 'loss/train': 1.4977778196334839} 11/07/2021 06:26:23 - INFO - __main__ - Step 65380: {'lr': 0.0003058866700061952, 'samples': 12552960, 'steps': 65379, 'loss/train': 0.8864845633506775} 11/07/2021 06:26:24 - INFO - __main__ - Step 65381: {'lr': 0.00030588149754584543, 'samples': 12553152, 'steps': 65380, 'loss/train': 0.9508771300315857} 11/07/2021 06:26:25 - INFO - __main__ - Step 65382: {'lr': 0.00030587632506031624, 'samples': 12553344, 'steps': 65381, 'loss/train': 1.4901223182678223} 11/07/2021 06:26:25 - INFO - __main__ - Step 65383: {'lr': 0.0003058711525496102, 'samples': 12553536, 'steps': 65382, 'loss/train': 1.1886242628097534} 11/07/2021 06:26:25 - INFO - __main__ - Step 65384: {'lr': 0.00030586598001372935, 'samples': 12553728, 'steps': 65383, 'loss/train': 1.5547964572906494} 11/07/2021 06:26:26 - INFO - __main__ - Step 65385: {'lr': 0.0003058608074526762, 'samples': 12553920, 'steps': 65384, 'loss/train': 1.4220030307769775} 11/07/2021 06:26:26 - INFO - __main__ - Step 65386: {'lr': 0.000305855634866453, 'samples': 12554112, 'steps': 65385, 'loss/train': 1.360510230064392} 11/07/2021 06:26:27 - INFO - __main__ - Step 65387: {'lr': 0.00030585046225506206, 'samples': 12554304, 'steps': 65386, 'loss/train': 1.786932110786438} 11/07/2021 06:26:28 - INFO - __main__ - Step 65388: {'lr': 0.00030584528961850584, 'samples': 12554496, 'steps': 65387, 'loss/train': 1.3931899070739746} 11/07/2021 06:26:28 - INFO - __main__ - Step 65389: {'lr': 0.0003058401169567865, 'samples': 12554688, 'steps': 65388, 'loss/train': 1.688538908958435} 11/07/2021 06:26:28 - INFO - __main__ - Step 65390: {'lr': 0.0003058349442699067, 'samples': 12554880, 'steps': 65389, 'loss/train': 0.39479324221611023} 11/07/2021 06:26:29 - INFO - __main__ - Step 65391: {'lr': 0.00030582977155786835, 'samples': 12555072, 'steps': 65390, 'loss/train': 1.378877878189087} 11/07/2021 06:26:29 - INFO - __main__ - Step 65392: {'lr': 0.000305824598820674, 'samples': 12555264, 'steps': 65391, 'loss/train': 1.2278141975402832} 11/07/2021 06:26:31 - INFO - __main__ - Step 65393: {'lr': 0.0003058194260583259, 'samples': 12555456, 'steps': 65392, 'loss/train': 1.105849027633667} 11/07/2021 06:26:31 - INFO - __main__ - Step 65394: {'lr': 0.00030581425327082647, 'samples': 12555648, 'steps': 65393, 'loss/train': 1.8815479278564453} 11/07/2021 06:26:31 - INFO - __main__ - Step 65395: {'lr': 0.000305809080458178, 'samples': 12555840, 'steps': 65394, 'loss/train': 1.7891905307769775} 11/07/2021 06:26:32 - INFO - __main__ - Step 65396: {'lr': 0.00030580390762038277, 'samples': 12556032, 'steps': 65395, 'loss/train': 1.3263767957687378} 11/07/2021 06:26:32 - INFO - __main__ - Step 65397: {'lr': 0.0003057987347574433, 'samples': 12556224, 'steps': 65396, 'loss/train': 1.1058096885681152} 11/07/2021 06:26:32 - INFO - __main__ - Step 65398: {'lr': 0.00030579356186936164, 'samples': 12556416, 'steps': 65397, 'loss/train': 1.4431127309799194} 11/07/2021 06:26:34 - INFO - __main__ - Step 65399: {'lr': 0.00030578838895614033, 'samples': 12556608, 'steps': 65398, 'loss/train': 1.790960431098938} 11/07/2021 06:26:34 - INFO - __main__ - Step 65400: {'lr': 0.0003057832160177816, 'samples': 12556800, 'steps': 65399, 'loss/train': 1.2297438383102417} 11/07/2021 06:26:34 - INFO - __main__ - Step 65401: {'lr': 0.0003057780430542878, 'samples': 12556992, 'steps': 65400, 'loss/train': 1.082533836364746} 11/07/2021 06:26:35 - INFO - __main__ - Step 65402: {'lr': 0.00030577287006566134, 'samples': 12557184, 'steps': 65401, 'loss/train': 0.04597718268632889} 11/07/2021 06:26:35 - INFO - __main__ - Step 65403: {'lr': 0.00030576769705190445, 'samples': 12557376, 'steps': 65402, 'loss/train': 0.9942106008529663} 11/07/2021 06:26:35 - INFO - __main__ - Step 65404: {'lr': 0.0003057625240130195, 'samples': 12557568, 'steps': 65403, 'loss/train': 1.3398869037628174} 11/07/2021 06:26:36 - INFO - __main__ - Step 65405: {'lr': 0.0003057573509490088, 'samples': 12557760, 'steps': 65404, 'loss/train': 1.8214473724365234} 11/07/2021 06:26:37 - INFO - __main__ - Step 65406: {'lr': 0.00030575217785987473, 'samples': 12557952, 'steps': 65405, 'loss/train': 1.6190582513809204} 11/07/2021 06:26:37 - INFO - __main__ - Step 65407: {'lr': 0.00030574700474561957, 'samples': 12558144, 'steps': 65406, 'loss/train': 1.4322973489761353} 11/07/2021 06:26:37 - INFO - __main__ - Step 65408: {'lr': 0.0003057418316062456, 'samples': 12558336, 'steps': 65407, 'loss/train': 1.366652011871338} 11/07/2021 06:26:38 - INFO - __main__ - Step 65409: {'lr': 0.0003057366584417553, 'samples': 12558528, 'steps': 65408, 'loss/train': 1.4598286151885986} 11/07/2021 06:26:39 - INFO - __main__ - Step 65410: {'lr': 0.000305731485252151, 'samples': 12558720, 'steps': 65409, 'loss/train': 1.933537244796753} 11/07/2021 06:26:39 - INFO - __main__ - Step 65411: {'lr': 0.0003057263120374348, 'samples': 12558912, 'steps': 65410, 'loss/train': 1.2295336723327637} 11/07/2021 06:26:40 - INFO - __main__ - Step 65412: {'lr': 0.00030572113879760927, 'samples': 12559104, 'steps': 65411, 'loss/train': 0.724977433681488} 11/07/2021 06:26:40 - INFO - __main__ - Step 65413: {'lr': 0.0003057159655326766, 'samples': 12559296, 'steps': 65412, 'loss/train': 1.5012683868408203} 11/07/2021 06:26:40 - INFO - __main__ - Step 65414: {'lr': 0.0003057107922426392, 'samples': 12559488, 'steps': 65413, 'loss/train': 1.4906957149505615} 11/07/2021 06:26:41 - INFO - __main__ - Step 65415: {'lr': 0.00030570561892749945, 'samples': 12559680, 'steps': 65414, 'loss/train': 1.3957499265670776} 11/07/2021 06:26:42 - INFO - __main__ - Step 65416: {'lr': 0.00030570044558725953, 'samples': 12559872, 'steps': 65415, 'loss/train': 1.613613486289978} 11/07/2021 06:26:42 - INFO - __main__ - Step 65417: {'lr': 0.00030569527222192185, 'samples': 12560064, 'steps': 65416, 'loss/train': 1.6947405338287354} 11/07/2021 06:26:42 - INFO - __main__ - Step 65418: {'lr': 0.00030569009883148874, 'samples': 12560256, 'steps': 65417, 'loss/train': 1.3531324863433838} 11/07/2021 06:26:43 - INFO - __main__ - Step 65419: {'lr': 0.0003056849254159625, 'samples': 12560448, 'steps': 65418, 'loss/train': 1.6068952083587646} 11/07/2021 06:26:44 - INFO - __main__ - Step 65420: {'lr': 0.0003056797519753456, 'samples': 12560640, 'steps': 65419, 'loss/train': 0.532558262348175} 11/07/2021 06:26:44 - INFO - __main__ - Step 65421: {'lr': 0.0003056745785096402, 'samples': 12560832, 'steps': 65420, 'loss/train': 1.2660366296768188} 11/07/2021 06:26:45 - INFO - __main__ - Step 65422: {'lr': 0.00030566940501884865, 'samples': 12561024, 'steps': 65421, 'loss/train': 1.4270589351654053} 11/07/2021 06:26:45 - INFO - __main__ - Step 65423: {'lr': 0.00030566423150297335, 'samples': 12561216, 'steps': 65422, 'loss/train': 1.1242048740386963} 11/07/2021 06:26:45 - INFO - __main__ - Step 65424: {'lr': 0.00030565905796201665, 'samples': 12561408, 'steps': 65423, 'loss/train': 1.266587257385254} 11/07/2021 06:26:46 - INFO - __main__ - Step 65425: {'lr': 0.00030565388439598084, 'samples': 12561600, 'steps': 65424, 'loss/train': 0.11042986810207367} 11/07/2021 06:26:47 - INFO - __main__ - Step 65426: {'lr': 0.00030564871080486825, 'samples': 12561792, 'steps': 65425, 'loss/train': 1.274327278137207} 11/07/2021 06:26:47 - INFO - __main__ - Step 65427: {'lr': 0.0003056435371886811, 'samples': 12561984, 'steps': 65426, 'loss/train': 1.092172622680664} 11/07/2021 06:26:47 - INFO - __main__ - Step 65428: {'lr': 0.00030563836354742193, 'samples': 12562176, 'steps': 65427, 'loss/train': 0.5655176043510437} 11/07/2021 06:26:48 - INFO - __main__ - Step 65429: {'lr': 0.000305633189881093, 'samples': 12562368, 'steps': 65428, 'loss/train': 1.0904723405838013} 11/07/2021 06:26:49 - INFO - __main__ - Step 65430: {'lr': 0.0003056280161896965, 'samples': 12562560, 'steps': 65429, 'loss/train': 1.2971993684768677} 11/07/2021 06:26:49 - INFO - __main__ - Step 65431: {'lr': 0.00030562284247323497, 'samples': 12562752, 'steps': 65430, 'loss/train': 1.5035005807876587} 11/07/2021 06:26:50 - INFO - __main__ - Step 65432: {'lr': 0.0003056176687317106, 'samples': 12562944, 'steps': 65431, 'loss/train': 1.5377336740493774} 11/07/2021 06:26:50 - INFO - __main__ - Step 65433: {'lr': 0.00030561249496512577, 'samples': 12563136, 'steps': 65432, 'loss/train': 1.6248669624328613} 11/07/2021 06:26:50 - INFO - __main__ - Step 65434: {'lr': 0.00030560732117348283, 'samples': 12563328, 'steps': 65433, 'loss/train': 1.4574501514434814} 11/07/2021 06:26:51 - INFO - __main__ - Step 65435: {'lr': 0.00030560214735678403, 'samples': 12563520, 'steps': 65434, 'loss/train': 1.6956439018249512} 11/07/2021 06:26:52 - INFO - __main__ - Step 65436: {'lr': 0.00030559697351503187, 'samples': 12563712, 'steps': 65435, 'loss/train': 1.2988120317459106} 11/07/2021 06:26:52 - INFO - __main__ - Step 65437: {'lr': 0.0003055917996482285, 'samples': 12563904, 'steps': 65436, 'loss/train': 1.25798761844635} 11/07/2021 06:26:52 - INFO - __main__ - Step 65438: {'lr': 0.00030558662575637635, 'samples': 12564096, 'steps': 65437, 'loss/train': 1.259636402130127} 11/07/2021 06:26:53 - INFO - __main__ - Step 65439: {'lr': 0.0003055814518394777, 'samples': 12564288, 'steps': 65438, 'loss/train': 1.4605661630630493} 11/07/2021 06:26:54 - INFO - __main__ - Step 65440: {'lr': 0.0003055762778975349, 'samples': 12564480, 'steps': 65439, 'loss/train': 1.560842752456665} 11/07/2021 06:26:54 - INFO - __main__ - Step 65441: {'lr': 0.0003055711039305503, 'samples': 12564672, 'steps': 65440, 'loss/train': 0.7228081226348877} 11/07/2021 06:26:54 - INFO - __main__ - Step 65442: {'lr': 0.0003055659299385262, 'samples': 12564864, 'steps': 65441, 'loss/train': 1.7169851064682007} 11/07/2021 06:26:55 - INFO - __main__ - Step 65443: {'lr': 0.00030556075592146493, 'samples': 12565056, 'steps': 65442, 'loss/train': 1.3450257778167725} 11/07/2021 06:26:55 - INFO - __main__ - Step 65444: {'lr': 0.00030555558187936896, 'samples': 12565248, 'steps': 65443, 'loss/train': 1.2382526397705078} 11/07/2021 06:26:56 - INFO - __main__ - Step 65445: {'lr': 0.00030555040781224044, 'samples': 12565440, 'steps': 65444, 'loss/train': 1.344701886177063} 11/07/2021 06:26:57 - INFO - __main__ - Step 65446: {'lr': 0.0003055452337200817, 'samples': 12565632, 'steps': 65445, 'loss/train': 1.373091697692871} 11/07/2021 06:26:57 - INFO - __main__ - Step 65447: {'lr': 0.00030554005960289513, 'samples': 12565824, 'steps': 65446, 'loss/train': 1.2128839492797852} 11/07/2021 06:26:57 - INFO - __main__ - Step 65448: {'lr': 0.0003055348854606831, 'samples': 12566016, 'steps': 65447, 'loss/train': 1.2320806980133057} 11/07/2021 06:26:58 - INFO - __main__ - Step 65449: {'lr': 0.000305529711293448, 'samples': 12566208, 'steps': 65448, 'loss/train': 1.1446666717529297} 11/07/2021 06:26:58 - INFO - __main__ - Step 65450: {'lr': 0.0003055245371011919, 'samples': 12566400, 'steps': 65449, 'loss/train': 1.6462717056274414} 11/07/2021 06:26:59 - INFO - __main__ - Step 65451: {'lr': 0.00030551936288391744, 'samples': 12566592, 'steps': 65450, 'loss/train': 1.5427770614624023} 11/07/2021 06:27:00 - INFO - __main__ - Step 65452: {'lr': 0.0003055141886416268, 'samples': 12566784, 'steps': 65451, 'loss/train': 0.8661101460456848} 11/07/2021 06:27:00 - INFO - __main__ - Step 65453: {'lr': 0.0003055090143743223, 'samples': 12566976, 'steps': 65452, 'loss/train': 1.0707815885543823} 11/07/2021 06:27:00 - INFO - __main__ - Step 65454: {'lr': 0.00030550384008200623, 'samples': 12567168, 'steps': 65453, 'loss/train': 1.794528841972351} 11/07/2021 06:27:01 - INFO - __main__ - Step 65455: {'lr': 0.00030549866576468104, 'samples': 12567360, 'steps': 65454, 'loss/train': 1.9410419464111328} 11/07/2021 06:27:01 - INFO - __main__ - Step 65456: {'lr': 0.000305493491422349, 'samples': 12567552, 'steps': 65455, 'loss/train': 1.3247363567352295} 11/07/2021 06:27:02 - INFO - __main__ - Step 65457: {'lr': 0.0003054883170550125, 'samples': 12567744, 'steps': 65456, 'loss/train': 1.157758116722107} 11/07/2021 06:27:02 - INFO - __main__ - Step 65458: {'lr': 0.0003054831426626737, 'samples': 12567936, 'steps': 65457, 'loss/train': 1.0881963968276978} 11/07/2021 06:27:03 - INFO - __main__ - Step 65459: {'lr': 0.00030547796824533516, 'samples': 12568128, 'steps': 65458, 'loss/train': 0.3237638771533966} 11/07/2021 06:27:03 - INFO - __main__ - Step 65460: {'lr': 0.0003054727938029991, 'samples': 12568320, 'steps': 65459, 'loss/train': 1.220163345336914} 11/07/2021 06:27:03 - INFO - __main__ - Step 65461: {'lr': 0.0003054676193356678, 'samples': 12568512, 'steps': 65460, 'loss/train': 1.4308332204818726} 11/07/2021 06:27:04 - INFO - __main__ - Step 65462: {'lr': 0.00030546244484334364, 'samples': 12568704, 'steps': 65461, 'loss/train': 1.7021028995513916} 11/07/2021 06:27:05 - INFO - __main__ - Step 65463: {'lr': 0.000305457270326029, 'samples': 12568896, 'steps': 65462, 'loss/train': 1.55368971824646} 11/07/2021 06:27:05 - INFO - __main__ - Step 65464: {'lr': 0.00030545209578372617, 'samples': 12569088, 'steps': 65463, 'loss/train': 0.7483497858047485} 11/07/2021 06:27:06 - INFO - __main__ - Step 65465: {'lr': 0.00030544692121643746, 'samples': 12569280, 'steps': 65464, 'loss/train': 0.2766513228416443} 11/07/2021 06:27:06 - INFO - __main__ - Step 65466: {'lr': 0.00030544174662416526, 'samples': 12569472, 'steps': 65465, 'loss/train': 1.5296518802642822} 11/07/2021 06:27:07 - INFO - __main__ - Step 65467: {'lr': 0.0003054365720069118, 'samples': 12569664, 'steps': 65466, 'loss/train': 1.785591721534729} 11/07/2021 06:27:07 - INFO - __main__ - Step 65468: {'lr': 0.0003054313973646795, 'samples': 12569856, 'steps': 65467, 'loss/train': 0.9967355132102966} 11/07/2021 06:27:08 - INFO - __main__ - Step 65469: {'lr': 0.0003054262226974708, 'samples': 12570048, 'steps': 65468, 'loss/train': 1.529024362564087} 11/07/2021 06:27:08 - INFO - __main__ - Step 65470: {'lr': 0.0003054210480052877, 'samples': 12570240, 'steps': 65469, 'loss/train': 1.0163606405258179} 11/07/2021 06:27:08 - INFO - __main__ - Step 65471: {'lr': 0.0003054158732881328, 'samples': 12570432, 'steps': 65470, 'loss/train': 1.2930325269699097} 11/07/2021 06:27:10 - INFO - __main__ - Step 65472: {'lr': 0.0003054106985460084, 'samples': 12570624, 'steps': 65471, 'loss/train': 1.2481393814086914} 11/07/2021 06:27:10 - INFO - __main__ - Step 65473: {'lr': 0.00030540552377891674, 'samples': 12570816, 'steps': 65472, 'loss/train': 1.3565151691436768} 11/07/2021 06:27:10 - INFO - __main__ - Step 65474: {'lr': 0.00030540034898686024, 'samples': 12571008, 'steps': 65473, 'loss/train': 0.9360339045524597} 11/07/2021 06:27:11 - INFO - __main__ - Step 65475: {'lr': 0.00030539517416984123, 'samples': 12571200, 'steps': 65474, 'loss/train': 1.2349683046340942} 11/07/2021 06:27:11 - INFO - __main__ - Step 65476: {'lr': 0.000305389999327862, 'samples': 12571392, 'steps': 65475, 'loss/train': 0.6391121745109558} 11/07/2021 06:27:11 - INFO - __main__ - Step 65477: {'lr': 0.0003053848244609248, 'samples': 12571584, 'steps': 65476, 'loss/train': 1.5572962760925293} 11/07/2021 06:27:12 - INFO - __main__ - Step 65478: {'lr': 0.0003053796495690321, 'samples': 12571776, 'steps': 65477, 'loss/train': 1.3992671966552734} 11/07/2021 06:27:13 - INFO - __main__ - Step 65479: {'lr': 0.00030537447465218623, 'samples': 12571968, 'steps': 65478, 'loss/train': 1.5643736124038696} 11/07/2021 06:27:13 - INFO - __main__ - Step 65480: {'lr': 0.00030536929971038953, 'samples': 12572160, 'steps': 65479, 'loss/train': 1.1206386089324951} 11/07/2021 06:27:13 - INFO - __main__ - Step 65481: {'lr': 0.00030536412474364415, 'samples': 12572352, 'steps': 65480, 'loss/train': 1.6401299238204956} 11/07/2021 06:27:14 - INFO - __main__ - Step 65482: {'lr': 0.0003053589497519526, 'samples': 12572544, 'steps': 65481, 'loss/train': 1.5834903717041016} 11/07/2021 06:27:15 - INFO - __main__ - Step 65483: {'lr': 0.0003053537747353171, 'samples': 12572736, 'steps': 65482, 'loss/train': 1.0578091144561768} 11/07/2021 06:27:15 - INFO - __main__ - Step 65484: {'lr': 0.00030534859969374013, 'samples': 12572928, 'steps': 65483, 'loss/train': 0.6155405640602112} 11/07/2021 06:27:16 - INFO - __main__ - Step 65485: {'lr': 0.00030534342462722387, 'samples': 12573120, 'steps': 65484, 'loss/train': 1.9000731706619263} 11/07/2021 06:27:16 - INFO - __main__ - Step 65486: {'lr': 0.00030533824953577084, 'samples': 12573312, 'steps': 65485, 'loss/train': 1.2497289180755615} 11/07/2021 06:27:16 - INFO - __main__ - Step 65487: {'lr': 0.0003053330744193831, 'samples': 12573504, 'steps': 65486, 'loss/train': 1.3706153631210327} 11/07/2021 06:27:17 - INFO - __main__ - Step 65488: {'lr': 0.0003053278992780632, 'samples': 12573696, 'steps': 65487, 'loss/train': 1.3235225677490234} 11/07/2021 06:27:18 - INFO - __main__ - Step 65489: {'lr': 0.0003053227241118134, 'samples': 12573888, 'steps': 65488, 'loss/train': 1.5836257934570312} 11/07/2021 06:27:18 - INFO - __main__ - Step 65490: {'lr': 0.000305317548920636, 'samples': 12574080, 'steps': 65489, 'loss/train': 1.403812289237976} 11/07/2021 06:27:18 - INFO - __main__ - Step 65491: {'lr': 0.0003053123737045335, 'samples': 12574272, 'steps': 65490, 'loss/train': 1.2586967945098877} 11/07/2021 06:27:19 - INFO - __main__ - Step 65492: {'lr': 0.0003053071984635079, 'samples': 12574464, 'steps': 65491, 'loss/train': 1.2716727256774902} 11/07/2021 06:27:20 - INFO - __main__ - Step 65493: {'lr': 0.00030530202319756184, 'samples': 12574656, 'steps': 65492, 'loss/train': 1.5441497564315796} 11/07/2021 06:27:20 - INFO - __main__ - Step 65494: {'lr': 0.0003052968479066975, 'samples': 12574848, 'steps': 65493, 'loss/train': 1.296195149421692} 11/07/2021 06:27:21 - INFO - __main__ - Step 65495: {'lr': 0.0003052916725909173, 'samples': 12575040, 'steps': 65494, 'loss/train': 1.294801950454712} 11/07/2021 06:27:21 - INFO - __main__ - Step 65496: {'lr': 0.00030528649725022346, 'samples': 12575232, 'steps': 65495, 'loss/train': 1.457506775856018} 11/07/2021 06:27:21 - INFO - __main__ - Step 65497: {'lr': 0.0003052813218846184, 'samples': 12575424, 'steps': 65496, 'loss/train': 1.0814287662506104} 11/07/2021 06:27:22 - INFO - __main__ - Step 65498: {'lr': 0.0003052761464941045, 'samples': 12575616, 'steps': 65497, 'loss/train': 1.4690442085266113} 11/07/2021 06:27:23 - INFO - __main__ - Step 65499: {'lr': 0.00030527097107868395, 'samples': 12575808, 'steps': 65498, 'loss/train': 1.1190063953399658} 11/07/2021 06:27:23 - INFO - __main__ - Step 65500: {'lr': 0.00030526579563835916, 'samples': 12576000, 'steps': 65499, 'loss/train': 1.248207926750183} 11/07/2021 06:27:23 - INFO - __main__ - Step 65501: {'lr': 0.0003052606201731325, 'samples': 12576192, 'steps': 65500, 'loss/train': 1.166585922241211} 11/07/2021 06:27:24 - INFO - __main__ - Step 65502: {'lr': 0.0003052554446830062, 'samples': 12576384, 'steps': 65501, 'loss/train': 1.452914834022522} 11/07/2021 06:27:25 - INFO - __main__ - Step 65503: {'lr': 0.00030525026916798263, 'samples': 12576576, 'steps': 65502, 'loss/train': 1.2384341955184937} 11/07/2021 06:27:25 - INFO - __main__ - Step 65504: {'lr': 0.00030524509362806423, 'samples': 12576768, 'steps': 65503, 'loss/train': 1.730370283126831} 11/07/2021 06:27:25 - INFO - __main__ - Step 65505: {'lr': 0.00030523991806325325, 'samples': 12576960, 'steps': 65504, 'loss/train': 1.5426692962646484} 11/07/2021 06:27:26 - INFO - __main__ - Step 65506: {'lr': 0.0003052347424735519, 'samples': 12577152, 'steps': 65505, 'loss/train': 1.1872366666793823} 11/07/2021 06:27:26 - INFO - __main__ - Step 65507: {'lr': 0.00030522956685896267, 'samples': 12577344, 'steps': 65506, 'loss/train': 0.8867643475532532} 11/07/2021 06:27:27 - INFO - __main__ - Step 65508: {'lr': 0.0003052243912194879, 'samples': 12577536, 'steps': 65507, 'loss/train': 1.249651312828064} 11/07/2021 06:27:28 - INFO - __main__ - Step 65509: {'lr': 0.0003052192155551298, 'samples': 12577728, 'steps': 65508, 'loss/train': 1.5658690929412842} 11/07/2021 06:27:28 - INFO - __main__ - Step 65510: {'lr': 0.00030521403986589086, 'samples': 12577920, 'steps': 65509, 'loss/train': 1.430597186088562} 11/07/2021 06:27:28 - INFO - __main__ - Step 65511: {'lr': 0.0003052088641517733, 'samples': 12578112, 'steps': 65510, 'loss/train': 1.484337329864502} 11/07/2021 06:27:29 - INFO - __main__ - Step 65512: {'lr': 0.00030520368841277946, 'samples': 12578304, 'steps': 65511, 'loss/train': 1.126431941986084} 11/07/2021 06:27:29 - INFO - __main__ - Step 65513: {'lr': 0.00030519851264891167, 'samples': 12578496, 'steps': 65512, 'loss/train': 0.7217588424682617} 11/07/2021 06:27:30 - INFO - __main__ - Step 65514: {'lr': 0.0003051933368601723, 'samples': 12578688, 'steps': 65513, 'loss/train': 1.366166591644287} 11/07/2021 06:27:30 - INFO - __main__ - Step 65515: {'lr': 0.00030518816104656364, 'samples': 12578880, 'steps': 65514, 'loss/train': 1.13165283203125} 11/07/2021 06:27:31 - INFO - __main__ - Step 65516: {'lr': 0.00030518298520808805, 'samples': 12579072, 'steps': 65515, 'loss/train': 0.7698614597320557} 11/07/2021 06:27:31 - INFO - __main__ - Step 65517: {'lr': 0.0003051778093447479, 'samples': 12579264, 'steps': 65516, 'loss/train': 1.490037441253662} 11/07/2021 06:27:31 - INFO - __main__ - Step 65518: {'lr': 0.0003051726334565455, 'samples': 12579456, 'steps': 65517, 'loss/train': 1.8840373754501343} 11/07/2021 06:27:32 - INFO - __main__ - Step 65519: {'lr': 0.00030516745754348315, 'samples': 12579648, 'steps': 65518, 'loss/train': 1.6560660600662231} 11/07/2021 06:27:33 - INFO - __main__ - Step 65520: {'lr': 0.00030516228160556313, 'samples': 12579840, 'steps': 65519, 'loss/train': 1.7389663457870483} 11/07/2021 06:27:33 - INFO - __main__ - Step 65521: {'lr': 0.0003051571056427879, 'samples': 12580032, 'steps': 65520, 'loss/train': 1.559924840927124} 11/07/2021 06:27:33 - INFO - __main__ - Step 65522: {'lr': 0.0003051519296551597, 'samples': 12580224, 'steps': 65521, 'loss/train': 1.2276692390441895} 11/07/2021 06:27:34 - INFO - __main__ - Step 65523: {'lr': 0.0003051467536426809, 'samples': 12580416, 'steps': 65522, 'loss/train': 1.2414748668670654} 11/07/2021 06:27:35 - INFO - __main__ - Step 65524: {'lr': 0.0003051415776053538, 'samples': 12580608, 'steps': 65523, 'loss/train': 1.38613760471344} 11/07/2021 06:27:35 - INFO - __main__ - Step 65525: {'lr': 0.00030513640154318077, 'samples': 12580800, 'steps': 65524, 'loss/train': 1.1205756664276123} 11/07/2021 06:27:35 - INFO - __main__ - Step 65526: {'lr': 0.00030513122545616414, 'samples': 12580992, 'steps': 65525, 'loss/train': 1.233292579650879} 11/07/2021 06:27:36 - INFO - __main__ - Step 65527: {'lr': 0.0003051260493443062, 'samples': 12581184, 'steps': 65526, 'loss/train': 1.7069995403289795} 11/07/2021 06:27:36 - INFO - __main__ - Step 65528: {'lr': 0.00030512087320760933, 'samples': 12581376, 'steps': 65527, 'loss/train': 1.213829755783081} 11/07/2021 06:27:37 - INFO - __main__ - Step 65529: {'lr': 0.00030511569704607587, 'samples': 12581568, 'steps': 65528, 'loss/train': 1.3195809125900269} 11/07/2021 06:27:38 - INFO - __main__ - Step 65530: {'lr': 0.0003051105208597081, 'samples': 12581760, 'steps': 65529, 'loss/train': 1.162340760231018} 11/07/2021 06:27:38 - INFO - __main__ - Step 65531: {'lr': 0.0003051053446485084, 'samples': 12581952, 'steps': 65530, 'loss/train': 1.3744709491729736} 11/07/2021 06:27:38 - INFO - __main__ - Step 65532: {'lr': 0.0003051001684124791, 'samples': 12582144, 'steps': 65531, 'loss/train': 1.1749736070632935} 11/07/2021 06:27:39 - INFO - __main__ - Step 65533: {'lr': 0.00030509499215162247, 'samples': 12582336, 'steps': 65532, 'loss/train': 1.6029016971588135} 11/07/2021 06:27:40 - INFO - __main__ - Step 65534: {'lr': 0.0003050898158659409, 'samples': 12582528, 'steps': 65533, 'loss/train': 1.4045921564102173} 11/07/2021 06:27:40 - INFO - __main__ - Step 65535: {'lr': 0.00030508463955543667, 'samples': 12582720, 'steps': 65534, 'loss/train': 1.4536796808242798} 11/07/2021 06:27:41 - INFO - __main__ - Step 65536: {'lr': 0.0003050794632201122, 'samples': 12582912, 'steps': 65535, 'loss/train': 1.434549331665039} 11/07/2021 06:27:41 - INFO - __main__ - Step 65537: {'lr': 0.0003050742868599698, 'samples': 12583104, 'steps': 65536, 'loss/train': 1.3568854331970215} 11/07/2021 06:27:41 - INFO - __main__ - Step 65538: {'lr': 0.0003050691104750117, 'samples': 12583296, 'steps': 65537, 'loss/train': 1.749549150466919} 11/07/2021 06:27:43 - INFO - __main__ - Step 65539: {'lr': 0.0003050639340652404, 'samples': 12583488, 'steps': 65538, 'loss/train': 0.09941280633211136} 11/07/2021 06:27:43 - INFO - __main__ - Step 65540: {'lr': 0.0003050587576306581, 'samples': 12583680, 'steps': 65539, 'loss/train': 1.1594775915145874} 11/07/2021 06:27:43 - INFO - __main__ - Step 65541: {'lr': 0.00030505358117126715, 'samples': 12583872, 'steps': 65540, 'loss/train': 1.1687350273132324} 11/07/2021 06:27:44 - INFO - __main__ - Step 65542: {'lr': 0.0003050484046870699, 'samples': 12584064, 'steps': 65541, 'loss/train': 0.0760815367102623} 11/07/2021 06:27:44 - INFO - __main__ - Step 65543: {'lr': 0.00030504322817806874, 'samples': 12584256, 'steps': 65542, 'loss/train': 1.592501163482666} 11/07/2021 06:27:44 - INFO - __main__ - Step 65544: {'lr': 0.0003050380516442659, 'samples': 12584448, 'steps': 65543, 'loss/train': 1.7572723627090454} 11/07/2021 06:27:45 - INFO - __main__ - Step 65545: {'lr': 0.0003050328750856638, 'samples': 12584640, 'steps': 65544, 'loss/train': 1.4844245910644531} 11/07/2021 06:27:46 - INFO - __main__ - Step 65546: {'lr': 0.00030502769850226474, 'samples': 12584832, 'steps': 65545, 'loss/train': 1.137190818786621} 11/07/2021 06:27:46 - INFO - __main__ - Step 65547: {'lr': 0.000305022521894071, 'samples': 12585024, 'steps': 65546, 'loss/train': 1.3138365745544434} 11/07/2021 06:27:46 - INFO - __main__ - Step 65548: {'lr': 0.000305017345261085, 'samples': 12585216, 'steps': 65547, 'loss/train': 1.0521143674850464} 11/07/2021 06:27:47 - INFO - __main__ - Step 65549: {'lr': 0.000305012168603309, 'samples': 12585408, 'steps': 65548, 'loss/train': 0.9366679191589355} 11/07/2021 06:27:48 - INFO - __main__ - Step 65550: {'lr': 0.0003050069919207454, 'samples': 12585600, 'steps': 65549, 'loss/train': 1.593947410583496} 11/07/2021 06:27:48 - INFO - __main__ - Step 65551: {'lr': 0.00030500181521339646, 'samples': 12585792, 'steps': 65550, 'loss/train': 1.4851933717727661} 11/07/2021 06:27:49 - INFO - __main__ - Step 65552: {'lr': 0.00030499663848126464, 'samples': 12585984, 'steps': 65551, 'loss/train': 0.2746724784374237} 11/07/2021 06:27:49 - INFO - __main__ - Step 65553: {'lr': 0.0003049914617243521, 'samples': 12586176, 'steps': 65552, 'loss/train': 1.520794153213501} 11/07/2021 06:27:49 - INFO - __main__ - Step 65554: {'lr': 0.0003049862849426613, 'samples': 12586368, 'steps': 65553, 'loss/train': 0.9803227782249451} 11/07/2021 06:27:51 - INFO - __main__ - Step 65555: {'lr': 0.00030498110813619446, 'samples': 12586560, 'steps': 65554, 'loss/train': 1.903782844543457} 11/07/2021 06:27:51 - INFO - __main__ - Step 65556: {'lr': 0.000304975931304954, 'samples': 12586752, 'steps': 65555, 'loss/train': 1.65304696559906} 11/07/2021 06:27:51 - INFO - __main__ - Step 65557: {'lr': 0.0003049707544489423, 'samples': 12586944, 'steps': 65556, 'loss/train': 1.4051228761672974} 11/07/2021 06:27:52 - INFO - __main__ - Step 65558: {'lr': 0.0003049655775681616, 'samples': 12587136, 'steps': 65557, 'loss/train': 0.5467556118965149} 11/07/2021 06:27:52 - INFO - __main__ - Step 65559: {'lr': 0.0003049604006626142, 'samples': 12587328, 'steps': 65558, 'loss/train': 1.3950742483139038} 11/07/2021 06:27:53 - INFO - __main__ - Step 65560: {'lr': 0.0003049552237323026, 'samples': 12587520, 'steps': 65559, 'loss/train': 1.0854837894439697} 11/07/2021 06:27:53 - INFO - __main__ - Step 65561: {'lr': 0.0003049500467772289, 'samples': 12587712, 'steps': 65560, 'loss/train': 0.10237952321767807} 11/07/2021 06:27:54 - INFO - __main__ - Step 65562: {'lr': 0.0003049448697973956, 'samples': 12587904, 'steps': 65561, 'loss/train': 0.8399044275283813} 11/07/2021 06:27:54 - INFO - __main__ - Step 65563: {'lr': 0.00030493969279280506, 'samples': 12588096, 'steps': 65562, 'loss/train': 0.0728563442826271} 11/07/2021 06:27:54 - INFO - __main__ - Step 65564: {'lr': 0.0003049345157634594, 'samples': 12588288, 'steps': 65563, 'loss/train': 1.733435869216919} 11/07/2021 06:27:55 - INFO - __main__ - Step 65565: {'lr': 0.0003049293387093613, 'samples': 12588480, 'steps': 65564, 'loss/train': 0.9054592847824097} 11/07/2021 06:27:56 - INFO - __main__ - Step 65566: {'lr': 0.0003049241616305127, 'samples': 12588672, 'steps': 65565, 'loss/train': 1.3979660272598267} 11/07/2021 06:27:56 - INFO - __main__ - Step 65567: {'lr': 0.00030491898452691626, 'samples': 12588864, 'steps': 65566, 'loss/train': 0.03952913358807564} 11/07/2021 06:27:56 - INFO - __main__ - Step 65568: {'lr': 0.000304913807398574, 'samples': 12589056, 'steps': 65567, 'loss/train': 0.7711038589477539} 11/07/2021 06:27:57 - INFO - __main__ - Step 65569: {'lr': 0.0003049086302454886, 'samples': 12589248, 'steps': 65568, 'loss/train': 1.2664644718170166} 11/07/2021 06:27:58 - INFO - __main__ - Step 65570: {'lr': 0.0003049034530676621, 'samples': 12589440, 'steps': 65569, 'loss/train': 1.8325954675674438} 11/07/2021 06:27:58 - INFO - __main__ - Step 65571: {'lr': 0.000304898275865097, 'samples': 12589632, 'steps': 65570, 'loss/train': 0.06120570749044418} 11/07/2021 06:27:59 - INFO - __main__ - Step 65572: {'lr': 0.0003048930986377956, 'samples': 12589824, 'steps': 65571, 'loss/train': 1.4061169624328613} 11/07/2021 06:27:59 - INFO - __main__ - Step 65573: {'lr': 0.0003048879213857602, 'samples': 12590016, 'steps': 65572, 'loss/train': 1.2108420133590698} 11/07/2021 06:27:59 - INFO - __main__ - Step 65574: {'lr': 0.0003048827441089932, 'samples': 12590208, 'steps': 65573, 'loss/train': 1.933737874031067} 11/07/2021 06:28:00 - INFO - __main__ - Step 65575: {'lr': 0.0003048775668074968, 'samples': 12590400, 'steps': 65574, 'loss/train': 1.2875927686691284} 11/07/2021 06:28:01 - INFO - __main__ - Step 65576: {'lr': 0.00030487238948127344, 'samples': 12590592, 'steps': 65575, 'loss/train': 0.9589722752571106} 11/07/2021 06:28:01 - INFO - __main__ - Step 65577: {'lr': 0.0003048672121303254, 'samples': 12590784, 'steps': 65576, 'loss/train': 1.323085904121399} 11/07/2021 06:28:01 - INFO - __main__ - Step 65578: {'lr': 0.00030486203475465514, 'samples': 12590976, 'steps': 65577, 'loss/train': 1.2950907945632935} 11/07/2021 06:28:02 - INFO - __main__ - Step 65579: {'lr': 0.00030485685735426484, 'samples': 12591168, 'steps': 65578, 'loss/train': 1.4272518157958984} 11/07/2021 06:28:03 - INFO - __main__ - Step 65580: {'lr': 0.00030485167992915684, 'samples': 12591360, 'steps': 65579, 'loss/train': 1.1242353916168213} 11/07/2021 06:28:03 - INFO - __main__ - Step 65581: {'lr': 0.00030484650247933353, 'samples': 12591552, 'steps': 65580, 'loss/train': 1.1682605743408203} 11/07/2021 06:28:04 - INFO - __main__ - Step 65582: {'lr': 0.0003048413250047973, 'samples': 12591744, 'steps': 65581, 'loss/train': 1.2567076683044434} 11/07/2021 06:28:04 - INFO - __main__ - Step 65583: {'lr': 0.0003048361475055503, 'samples': 12591936, 'steps': 65582, 'loss/train': 1.4060285091400146} 11/07/2021 06:28:04 - INFO - __main__ - Step 65584: {'lr': 0.0003048309699815951, 'samples': 12592128, 'steps': 65583, 'loss/train': 1.5101253986358643} 11/07/2021 06:28:05 - INFO - __main__ - Step 65585: {'lr': 0.0003048257924329339, 'samples': 12592320, 'steps': 65584, 'loss/train': 1.825730800628662} 11/07/2021 06:28:06 - INFO - __main__ - Step 65586: {'lr': 0.00030482061485956905, 'samples': 12592512, 'steps': 65585, 'loss/train': 1.4834364652633667} 11/07/2021 06:28:06 - INFO - __main__ - Step 65587: {'lr': 0.0003048154372615028, 'samples': 12592704, 'steps': 65586, 'loss/train': 1.508509874343872} 11/07/2021 06:28:07 - INFO - __main__ - Step 65588: {'lr': 0.0003048102596387375, 'samples': 12592896, 'steps': 65587, 'loss/train': 1.5619813203811646} 11/07/2021 06:28:07 - INFO - __main__ - Step 65589: {'lr': 0.0003048050819912757, 'samples': 12593088, 'steps': 65588, 'loss/train': 1.4431226253509521} 11/07/2021 06:28:07 - INFO - __main__ - Step 65590: {'lr': 0.0003047999043191195, 'samples': 12593280, 'steps': 65589, 'loss/train': 1.2195205688476562} 11/07/2021 06:28:08 - INFO - __main__ - Step 65591: {'lr': 0.0003047947266222713, 'samples': 12593472, 'steps': 65590, 'loss/train': 1.561431646347046} 11/07/2021 06:28:09 - INFO - __main__ - Step 65592: {'lr': 0.00030478954890073354, 'samples': 12593664, 'steps': 65591, 'loss/train': 1.428887963294983} 11/07/2021 06:28:09 - INFO - __main__ - Step 65593: {'lr': 0.00030478437115450833, 'samples': 12593856, 'steps': 65592, 'loss/train': 1.6858370304107666} 11/07/2021 06:28:09 - INFO - __main__ - Step 65594: {'lr': 0.0003047791933835982, 'samples': 12594048, 'steps': 65593, 'loss/train': 1.5664355754852295} 11/07/2021 06:28:10 - INFO - __main__ - Step 65595: {'lr': 0.0003047740155880054, 'samples': 12594240, 'steps': 65594, 'loss/train': 1.728022575378418} 11/07/2021 06:28:11 - INFO - __main__ - Step 65596: {'lr': 0.0003047688377677322, 'samples': 12594432, 'steps': 65595, 'loss/train': 1.8372187614440918} 11/07/2021 06:28:11 - INFO - __main__ - Step 65597: {'lr': 0.0003047636599227811, 'samples': 12594624, 'steps': 65596, 'loss/train': 1.389586329460144} 11/07/2021 06:28:12 - INFO - __main__ - Step 65598: {'lr': 0.0003047584820531543, 'samples': 12594816, 'steps': 65597, 'loss/train': 0.9858003258705139} 11/07/2021 06:28:12 - INFO - __main__ - Step 65599: {'lr': 0.0003047533041588542, 'samples': 12595008, 'steps': 65598, 'loss/train': 0.8387219905853271} 11/07/2021 06:28:12 - INFO - __main__ - Step 65600: {'lr': 0.00030474812623988305, 'samples': 12595200, 'steps': 65599, 'loss/train': 1.8356655836105347} 11/07/2021 06:28:13 - INFO - __main__ - Step 65601: {'lr': 0.0003047429482962433, 'samples': 12595392, 'steps': 65600, 'loss/train': 1.553713321685791} 11/07/2021 06:28:14 - INFO - __main__ - Step 65602: {'lr': 0.0003047377703279372, 'samples': 12595584, 'steps': 65601, 'loss/train': 0.9021466970443726} 11/07/2021 06:28:14 - INFO - __main__ - Step 65603: {'lr': 0.0003047325923349671, 'samples': 12595776, 'steps': 65602, 'loss/train': 1.2048827409744263} 11/07/2021 06:28:14 - INFO - __main__ - Step 65604: {'lr': 0.00030472741431733535, 'samples': 12595968, 'steps': 65603, 'loss/train': 1.2595187425613403} 11/07/2021 06:28:15 - INFO - __main__ - Step 65605: {'lr': 0.00030472223627504424, 'samples': 12596160, 'steps': 65604, 'loss/train': 1.3759377002716064} 11/07/2021 06:28:15 - INFO - __main__ - Step 65606: {'lr': 0.0003047170582080962, 'samples': 12596352, 'steps': 65605, 'loss/train': 1.4050021171569824} 11/07/2021 06:28:16 - INFO - __main__ - Step 65607: {'lr': 0.0003047118801164934, 'samples': 12596544, 'steps': 65606, 'loss/train': 1.6933757066726685} 11/07/2021 06:28:17 - INFO - __main__ - Step 65608: {'lr': 0.00030470670200023834, 'samples': 12596736, 'steps': 65607, 'loss/train': 1.6173207759857178} 11/07/2021 06:28:17 - INFO - __main__ - Step 65609: {'lr': 0.0003047015238593333, 'samples': 12596928, 'steps': 65608, 'loss/train': 0.7659761309623718} 11/07/2021 06:28:17 - INFO - __main__ - Step 65610: {'lr': 0.0003046963456937806, 'samples': 12597120, 'steps': 65609, 'loss/train': 1.240500569343567} 11/07/2021 06:28:18 - INFO - __main__ - Step 65611: {'lr': 0.0003046911675035825, 'samples': 12597312, 'steps': 65610, 'loss/train': 1.3220716714859009} 11/07/2021 06:28:19 - INFO - __main__ - Step 65612: {'lr': 0.0003046859892887415, 'samples': 12597504, 'steps': 65611, 'loss/train': 1.288973331451416} 11/07/2021 06:28:19 - INFO - __main__ - Step 65613: {'lr': 0.0003046808110492597, 'samples': 12597696, 'steps': 65612, 'loss/train': 1.5127731561660767} 11/07/2021 06:28:20 - INFO - __main__ - Step 65614: {'lr': 0.0003046756327851397, 'samples': 12597888, 'steps': 65613, 'loss/train': 0.6069688200950623} 11/07/2021 06:28:20 - INFO - __main__ - Step 65615: {'lr': 0.00030467045449638367, 'samples': 12598080, 'steps': 65614, 'loss/train': 1.2952351570129395} 11/07/2021 06:28:20 - INFO - __main__ - Step 65616: {'lr': 0.0003046652761829939, 'samples': 12598272, 'steps': 65615, 'loss/train': 0.9536972045898438} 11/07/2021 06:28:21 - INFO - __main__ - Step 65617: {'lr': 0.0003046600978449729, 'samples': 12598464, 'steps': 65616, 'loss/train': 1.6187760829925537} 11/07/2021 06:28:22 - INFO - __main__ - Step 65618: {'lr': 0.0003046549194823228, 'samples': 12598656, 'steps': 65617, 'loss/train': 1.1827566623687744} 11/07/2021 06:28:22 - INFO - __main__ - Step 65619: {'lr': 0.0003046497410950461, 'samples': 12598848, 'steps': 65618, 'loss/train': 2.5074522495269775} 11/07/2021 06:28:22 - INFO - __main__ - Step 65620: {'lr': 0.00030464456268314516, 'samples': 12599040, 'steps': 65619, 'loss/train': 1.1571707725524902} 11/07/2021 06:28:23 - INFO - __main__ - Step 65621: {'lr': 0.00030463938424662215, 'samples': 12599232, 'steps': 65620, 'loss/train': 0.9855992197990417} 11/07/2021 06:28:24 - INFO - __main__ - Step 65622: {'lr': 0.0003046342057854794, 'samples': 12599424, 'steps': 65621, 'loss/train': 0.7627511024475098} 11/07/2021 06:28:24 - INFO - __main__ - Step 65623: {'lr': 0.0003046290272997194, 'samples': 12599616, 'steps': 65622, 'loss/train': 1.0113838911056519} 11/07/2021 06:28:25 - INFO - __main__ - Step 65624: {'lr': 0.0003046238487893443, 'samples': 12599808, 'steps': 65623, 'loss/train': 1.6245102882385254} 11/07/2021 06:28:25 - INFO - __main__ - Step 65625: {'lr': 0.00030461867025435667, 'samples': 12600000, 'steps': 65624, 'loss/train': 1.5250658988952637} 11/07/2021 06:28:25 - INFO - __main__ - Step 65626: {'lr': 0.0003046134916947587, 'samples': 12600192, 'steps': 65625, 'loss/train': 0.9535965919494629} 11/07/2021 06:28:26 - INFO - __main__ - Step 65627: {'lr': 0.0003046083131105527, 'samples': 12600384, 'steps': 65626, 'loss/train': 1.3669618368148804} 11/07/2021 06:28:27 - INFO - __main__ - Step 65628: {'lr': 0.00030460313450174104, 'samples': 12600576, 'steps': 65627, 'loss/train': 1.1695663928985596} 11/07/2021 06:28:27 - INFO - __main__ - Step 65629: {'lr': 0.000304597955868326, 'samples': 12600768, 'steps': 65628, 'loss/train': 1.0336097478866577} 11/07/2021 06:28:28 - INFO - __main__ - Step 65630: {'lr': 0.00030459277721031, 'samples': 12600960, 'steps': 65629, 'loss/train': 0.27926093339920044} 11/07/2021 06:28:28 - INFO - __main__ - Step 65631: {'lr': 0.00030458759852769533, 'samples': 12601152, 'steps': 65630, 'loss/train': 1.509860873222351} 11/07/2021 06:28:28 - INFO - __main__ - Step 65632: {'lr': 0.0003045824198204844, 'samples': 12601344, 'steps': 65631, 'loss/train': 0.7387316823005676} 11/07/2021 06:28:29 - INFO - __main__ - Step 65633: {'lr': 0.0003045772410886794, 'samples': 12601536, 'steps': 65632, 'loss/train': 1.6349905729293823} 11/07/2021 06:28:30 - INFO - __main__ - Step 65634: {'lr': 0.00030457206233228275, 'samples': 12601728, 'steps': 65633, 'loss/train': 1.1224538087844849} 11/07/2021 06:28:30 - INFO - __main__ - Step 65635: {'lr': 0.0003045668835512967, 'samples': 12601920, 'steps': 65634, 'loss/train': 1.4509427547454834} 11/07/2021 06:28:30 - INFO - __main__ - Step 65636: {'lr': 0.0003045617047457238, 'samples': 12602112, 'steps': 65635, 'loss/train': 1.3628233671188354} 11/07/2021 06:28:31 - INFO - __main__ - Step 65637: {'lr': 0.00030455652591556613, 'samples': 12602304, 'steps': 65636, 'loss/train': 1.5381624698638916} 11/07/2021 06:28:32 - INFO - __main__ - Step 65638: {'lr': 0.00030455134706082617, 'samples': 12602496, 'steps': 65637, 'loss/train': 2.09863018989563} 11/07/2021 06:28:32 - INFO - __main__ - Step 65639: {'lr': 0.00030454616818150626, 'samples': 12602688, 'steps': 65638, 'loss/train': 0.9482951164245605} 11/07/2021 06:28:33 - INFO - __main__ - Step 65640: {'lr': 0.0003045409892776086, 'samples': 12602880, 'steps': 65639, 'loss/train': 1.2770614624023438} 11/07/2021 06:28:33 - INFO - __main__ - Step 65641: {'lr': 0.0003045358103491357, 'samples': 12603072, 'steps': 65640, 'loss/train': 1.2394671440124512} 11/07/2021 06:28:33 - INFO - __main__ - Step 65642: {'lr': 0.0003045306313960897, 'samples': 12603264, 'steps': 65641, 'loss/train': 1.4781864881515503} 11/07/2021 06:28:34 - INFO - __main__ - Step 65643: {'lr': 0.0003045254524184731, 'samples': 12603456, 'steps': 65642, 'loss/train': 1.2860795259475708} 11/07/2021 06:28:35 - INFO - __main__ - Step 65644: {'lr': 0.00030452027341628816, 'samples': 12603648, 'steps': 65643, 'loss/train': 1.24008309841156} 11/07/2021 06:28:35 - INFO - __main__ - Step 65645: {'lr': 0.00030451509438953725, 'samples': 12603840, 'steps': 65644, 'loss/train': 1.1488949060440063} 11/07/2021 06:28:35 - INFO - __main__ - Step 65646: {'lr': 0.0003045099153382227, 'samples': 12604032, 'steps': 65645, 'loss/train': 0.0700165405869484} 11/07/2021 06:28:36 - INFO - __main__ - Step 65647: {'lr': 0.00030450473626234675, 'samples': 12604224, 'steps': 65646, 'loss/train': 1.419309377670288} 11/07/2021 06:28:37 - INFO - __main__ - Step 65648: {'lr': 0.00030449955716191184, 'samples': 12604416, 'steps': 65647, 'loss/train': 1.3248001337051392} 11/07/2021 06:28:37 - INFO - __main__ - Step 65649: {'lr': 0.00030449437803692033, 'samples': 12604608, 'steps': 65648, 'loss/train': 0.9109021425247192} 11/07/2021 06:28:37 - INFO - __main__ - Step 65650: {'lr': 0.0003044891988873744, 'samples': 12604800, 'steps': 65649, 'loss/train': 1.1158266067504883} 11/07/2021 06:28:38 - INFO - __main__ - Step 65651: {'lr': 0.00030448401971327647, 'samples': 12604992, 'steps': 65650, 'loss/train': 1.8320544958114624} 11/07/2021 06:28:38 - INFO - __main__ - Step 65652: {'lr': 0.000304478840514629, 'samples': 12605184, 'steps': 65651, 'loss/train': 1.6635764837265015} 11/07/2021 06:28:39 - INFO - __main__ - Step 65653: {'lr': 0.00030447366129143414, 'samples': 12605376, 'steps': 65652, 'loss/train': 1.9885799884796143} 11/07/2021 06:28:40 - INFO - __main__ - Step 65654: {'lr': 0.00030446848204369425, 'samples': 12605568, 'steps': 65653, 'loss/train': 1.4816867113113403} 11/07/2021 06:28:40 - INFO - __main__ - Step 65655: {'lr': 0.00030446330277141177, 'samples': 12605760, 'steps': 65654, 'loss/train': 1.4055112600326538} 11/07/2021 06:28:40 - INFO - __main__ - Step 65656: {'lr': 0.0003044581234745889, 'samples': 12605952, 'steps': 65655, 'loss/train': 1.8317155838012695} 11/07/2021 06:28:41 - INFO - __main__ - Step 65657: {'lr': 0.00030445294415322807, 'samples': 12606144, 'steps': 65656, 'loss/train': 1.4625667333602905} 11/07/2021 06:28:42 - INFO - __main__ - Step 65658: {'lr': 0.00030444776480733157, 'samples': 12606336, 'steps': 65657, 'loss/train': 1.0272966623306274} 11/07/2021 06:28:42 - INFO - __main__ - Step 65659: {'lr': 0.0003044425854369018, 'samples': 12606528, 'steps': 65658, 'loss/train': 1.2787208557128906} 11/07/2021 06:28:43 - INFO - __main__ - Step 65660: {'lr': 0.00030443740604194097, 'samples': 12606720, 'steps': 65659, 'loss/train': 1.3842177391052246} 11/07/2021 06:28:43 - INFO - __main__ - Step 65661: {'lr': 0.00030443222662245153, 'samples': 12606912, 'steps': 65660, 'loss/train': 1.345412254333496} 11/07/2021 06:28:43 - INFO - __main__ - Step 65662: {'lr': 0.00030442704717843576, 'samples': 12607104, 'steps': 65661, 'loss/train': 0.11614946275949478} 11/07/2021 06:28:44 - INFO - __main__ - Step 65663: {'lr': 0.000304421867709896, 'samples': 12607296, 'steps': 65662, 'loss/train': 1.3791049718856812} 11/07/2021 06:28:45 - INFO - __main__ - Step 65664: {'lr': 0.00030441668821683455, 'samples': 12607488, 'steps': 65663, 'loss/train': 1.7203598022460938} 11/07/2021 06:28:45 - INFO - __main__ - Step 65665: {'lr': 0.0003044115086992538, 'samples': 12607680, 'steps': 65664, 'loss/train': 1.5867562294006348} 11/07/2021 06:28:45 - INFO - __main__ - Step 65666: {'lr': 0.00030440632915715613, 'samples': 12607872, 'steps': 65665, 'loss/train': 1.3691120147705078} 11/07/2021 06:28:46 - INFO - __main__ - Step 65667: {'lr': 0.00030440114959054377, 'samples': 12608064, 'steps': 65666, 'loss/train': 1.8424015045166016} 11/07/2021 06:28:47 - INFO - __main__ - Step 65668: {'lr': 0.00030439596999941906, 'samples': 12608256, 'steps': 65667, 'loss/train': 1.3747832775115967} 11/07/2021 06:28:47 - INFO - __main__ - Step 65669: {'lr': 0.0003043907903837844, 'samples': 12608448, 'steps': 65668, 'loss/train': 1.2567352056503296} 11/07/2021 06:28:48 - INFO - __main__ - Step 65670: {'lr': 0.00030438561074364203, 'samples': 12608640, 'steps': 65669, 'loss/train': 1.461404800415039} 11/07/2021 06:28:48 - INFO - __main__ - Step 65671: {'lr': 0.00030438043107899437, 'samples': 12608832, 'steps': 65670, 'loss/train': 1.429835319519043} 11/07/2021 06:28:48 - INFO - __main__ - Step 65672: {'lr': 0.00030437525138984374, 'samples': 12609024, 'steps': 65671, 'loss/train': 1.302844762802124} 11/07/2021 06:28:49 - INFO - __main__ - Step 65673: {'lr': 0.00030437007167619253, 'samples': 12609216, 'steps': 65672, 'loss/train': 1.1681697368621826} 11/07/2021 06:28:50 - INFO - __main__ - Step 65674: {'lr': 0.00030436489193804296, 'samples': 12609408, 'steps': 65673, 'loss/train': 2.309459686279297} 11/07/2021 06:28:50 - INFO - __main__ - Step 65675: {'lr': 0.00030435971217539735, 'samples': 12609600, 'steps': 65674, 'loss/train': 0.9715363383293152} 11/07/2021 06:28:51 - INFO - __main__ - Step 65676: {'lr': 0.0003043545323882581, 'samples': 12609792, 'steps': 65675, 'loss/train': 0.6866092681884766} 11/07/2021 06:28:51 - INFO - __main__ - Step 65677: {'lr': 0.00030434935257662754, 'samples': 12609984, 'steps': 65676, 'loss/train': 1.3769540786743164} 11/07/2021 06:28:51 - INFO - __main__ - Step 65678: {'lr': 0.00030434417274050805, 'samples': 12610176, 'steps': 65677, 'loss/train': 1.5892438888549805} 11/07/2021 06:28:52 - INFO - __main__ - Step 65679: {'lr': 0.00030433899287990197, 'samples': 12610368, 'steps': 65678, 'loss/train': 1.4642549753189087} 11/07/2021 06:28:53 - INFO - __main__ - Step 65680: {'lr': 0.00030433381299481145, 'samples': 12610560, 'steps': 65679, 'loss/train': 1.73553466796875} 11/07/2021 06:28:53 - INFO - __main__ - Step 65681: {'lr': 0.000304328633085239, 'samples': 12610752, 'steps': 65680, 'loss/train': 1.5190372467041016} 11/07/2021 06:28:53 - INFO - __main__ - Step 65682: {'lr': 0.00030432345315118694, 'samples': 12610944, 'steps': 65681, 'loss/train': 1.4882855415344238} 11/07/2021 06:28:54 - INFO - __main__ - Step 65683: {'lr': 0.0003043182731926575, 'samples': 12611136, 'steps': 65682, 'loss/train': 0.7382122874259949} 11/07/2021 06:28:55 - INFO - __main__ - Step 65684: {'lr': 0.0003043130932096531, 'samples': 12611328, 'steps': 65683, 'loss/train': 0.931722104549408} 11/07/2021 06:28:55 - INFO - __main__ - Step 65685: {'lr': 0.0003043079132021761, 'samples': 12611520, 'steps': 65684, 'loss/train': 1.4197587966918945} 11/07/2021 06:28:56 - INFO - __main__ - Step 65686: {'lr': 0.0003043027331702288, 'samples': 12611712, 'steps': 65685, 'loss/train': 1.3056069612503052} 11/07/2021 06:28:56 - INFO - __main__ - Step 65687: {'lr': 0.00030429755311381346, 'samples': 12611904, 'steps': 65686, 'loss/train': 1.4685925245285034} 11/07/2021 06:28:56 - INFO - __main__ - Step 65688: {'lr': 0.00030429237303293257, 'samples': 12612096, 'steps': 65687, 'loss/train': 1.253088116645813} 11/07/2021 06:28:57 - INFO - __main__ - Step 65689: {'lr': 0.0003042871929275883, 'samples': 12612288, 'steps': 65688, 'loss/train': 1.102798581123352} 11/07/2021 06:28:58 - INFO - __main__ - Step 65690: {'lr': 0.0003042820127977831, 'samples': 12612480, 'steps': 65689, 'loss/train': 1.5659631490707397} 11/07/2021 06:28:58 - INFO - __main__ - Step 65691: {'lr': 0.0003042768326435192, 'samples': 12612672, 'steps': 65690, 'loss/train': 1.5314563512802124} 11/07/2021 06:28:58 - INFO - __main__ - Step 65692: {'lr': 0.00030427165246479904, 'samples': 12612864, 'steps': 65691, 'loss/train': 0.5198411345481873} 11/07/2021 06:28:59 - INFO - __main__ - Step 65693: {'lr': 0.00030426647226162497, 'samples': 12613056, 'steps': 65692, 'loss/train': 1.5474625825881958} 11/07/2021 06:29:00 - INFO - __main__ - Step 65694: {'lr': 0.00030426129203399915, 'samples': 12613248, 'steps': 65693, 'loss/train': 1.3653181791305542} 11/07/2021 06:29:00 - INFO - __main__ - Step 65695: {'lr': 0.0003042561117819241, 'samples': 12613440, 'steps': 65694, 'loss/train': 0.8984522223472595} 11/07/2021 06:29:00 - INFO - __main__ - Step 65696: {'lr': 0.00030425093150540205, 'samples': 12613632, 'steps': 65695, 'loss/train': 1.2004122734069824} 11/07/2021 06:29:01 - INFO - __main__ - Step 65697: {'lr': 0.0003042457512044354, 'samples': 12613824, 'steps': 65696, 'loss/train': 0.6820573210716248} 11/07/2021 06:29:01 - INFO - __main__ - Step 65698: {'lr': 0.0003042405708790264, 'samples': 12614016, 'steps': 65697, 'loss/train': 1.5488746166229248} 11/07/2021 06:29:01 - INFO - __main__ - Step 65699: {'lr': 0.00030423539052917755, 'samples': 12614208, 'steps': 65698, 'loss/train': 1.5532512664794922} 11/07/2021 06:29:02 - INFO - __main__ - Step 65700: {'lr': 0.00030423021015489095, 'samples': 12614400, 'steps': 65699, 'loss/train': 1.4740041494369507} 11/07/2021 06:29:03 - INFO - __main__ - Step 65701: {'lr': 0.00030422502975616914, 'samples': 12614592, 'steps': 65700, 'loss/train': 1.4233269691467285} 11/07/2021 06:29:03 - INFO - __main__ - Step 65702: {'lr': 0.0003042198493330143, 'samples': 12614784, 'steps': 65701, 'loss/train': 1.0966424942016602} 11/07/2021 06:29:03 - INFO - __main__ - Step 65703: {'lr': 0.0003042146688854288, 'samples': 12614976, 'steps': 65702, 'loss/train': 1.5056899785995483} 11/07/2021 06:29:04 - INFO - __main__ - Step 65704: {'lr': 0.0003042094884134151, 'samples': 12615168, 'steps': 65703, 'loss/train': 1.1387884616851807} 11/07/2021 06:29:05 - INFO - __main__ - Step 65705: {'lr': 0.0003042043079169754, 'samples': 12615360, 'steps': 65704, 'loss/train': 1.5540584325790405} 11/07/2021 06:29:05 - INFO - __main__ - Step 65706: {'lr': 0.0003041991273961121, 'samples': 12615552, 'steps': 65705, 'loss/train': 1.507927656173706} 11/07/2021 06:29:06 - INFO - __main__ - Step 65707: {'lr': 0.0003041939468508275, 'samples': 12615744, 'steps': 65706, 'loss/train': 1.0803898572921753} 11/07/2021 06:29:06 - INFO - __main__ - Step 65708: {'lr': 0.0003041887662811239, 'samples': 12615936, 'steps': 65707, 'loss/train': 1.4016867876052856} 11/07/2021 06:29:07 - INFO - __main__ - Step 65709: {'lr': 0.00030418358568700375, 'samples': 12616128, 'steps': 65708, 'loss/train': 0.8045868873596191} 11/07/2021 06:29:07 - INFO - __main__ - Step 65710: {'lr': 0.0003041784050684693, 'samples': 12616320, 'steps': 65709, 'loss/train': 1.6202858686447144} 11/07/2021 06:29:08 - INFO - __main__ - Step 65711: {'lr': 0.0003041732244255228, 'samples': 12616512, 'steps': 65710, 'loss/train': 1.2213151454925537} 11/07/2021 06:29:08 - INFO - __main__ - Step 65712: {'lr': 0.00030416804375816675, 'samples': 12616704, 'steps': 65711, 'loss/train': 1.2595828771591187} 11/07/2021 06:29:08 - INFO - __main__ - Step 65713: {'lr': 0.0003041628630664035, 'samples': 12616896, 'steps': 65712, 'loss/train': 1.2705177068710327} 11/07/2021 06:29:09 - INFO - __main__ - Step 65714: {'lr': 0.00030415768235023523, 'samples': 12617088, 'steps': 65713, 'loss/train': 1.42369544506073} 11/07/2021 06:29:10 - INFO - __main__ - Step 65715: {'lr': 0.0003041525016096643, 'samples': 12617280, 'steps': 65714, 'loss/train': 1.7619435787200928} 11/07/2021 06:29:10 - INFO - __main__ - Step 65716: {'lr': 0.0003041473208446931, 'samples': 12617472, 'steps': 65715, 'loss/train': 1.68000066280365} 11/07/2021 06:29:11 - INFO - __main__ - Step 65717: {'lr': 0.000304142140055324, 'samples': 12617664, 'steps': 65716, 'loss/train': 1.328873634338379} 11/07/2021 06:29:11 - INFO - __main__ - Step 65718: {'lr': 0.0003041369592415592, 'samples': 12617856, 'steps': 65717, 'loss/train': 0.9986335635185242} 11/07/2021 06:29:11 - INFO - __main__ - Step 65719: {'lr': 0.0003041317784034012, 'samples': 12618048, 'steps': 65718, 'loss/train': 0.043089985847473145} 11/07/2021 06:29:12 - INFO - __main__ - Step 65720: {'lr': 0.00030412659754085224, 'samples': 12618240, 'steps': 65719, 'loss/train': 1.3661606311798096} 11/07/2021 06:29:13 - INFO - __main__ - Step 65721: {'lr': 0.0003041214166539147, 'samples': 12618432, 'steps': 65720, 'loss/train': 1.7410013675689697} 11/07/2021 06:29:13 - INFO - __main__ - Step 65722: {'lr': 0.00030411623574259087, 'samples': 12618624, 'steps': 65721, 'loss/train': 1.1985992193222046} 11/07/2021 06:29:13 - INFO - __main__ - Step 65723: {'lr': 0.0003041110548068831, 'samples': 12618816, 'steps': 65722, 'loss/train': 1.232016921043396} 11/07/2021 06:29:14 - INFO - __main__ - Step 65724: {'lr': 0.0003041058738467937, 'samples': 12619008, 'steps': 65723, 'loss/train': 0.9085338711738586} 11/07/2021 06:29:15 - INFO - __main__ - Step 65725: {'lr': 0.000304100692862325, 'samples': 12619200, 'steps': 65724, 'loss/train': 1.6491734981536865} 11/07/2021 06:29:15 - INFO - __main__ - Step 65726: {'lr': 0.00030409551185347946, 'samples': 12619392, 'steps': 65725, 'loss/train': 1.1091291904449463} 11/07/2021 06:29:15 - INFO - __main__ - Step 65727: {'lr': 0.00030409033082025923, 'samples': 12619584, 'steps': 65726, 'loss/train': 1.377829909324646} 11/07/2021 06:29:16 - INFO - __main__ - Step 65728: {'lr': 0.00030408514976266673, 'samples': 12619776, 'steps': 65727, 'loss/train': 1.2632659673690796} 11/07/2021 06:29:16 - INFO - __main__ - Step 65729: {'lr': 0.0003040799686807043, 'samples': 12619968, 'steps': 65728, 'loss/train': 1.3212463855743408} 11/07/2021 06:29:17 - INFO - __main__ - Step 65730: {'lr': 0.0003040747875743743, 'samples': 12620160, 'steps': 65729, 'loss/train': 1.0176191329956055} 11/07/2021 06:29:17 - INFO - __main__ - Step 65731: {'lr': 0.00030406960644367904, 'samples': 12620352, 'steps': 65730, 'loss/train': 1.3617185354232788} 11/07/2021 06:29:18 - INFO - __main__ - Step 65732: {'lr': 0.00030406442528862083, 'samples': 12620544, 'steps': 65731, 'loss/train': 1.386955976486206} 11/07/2021 06:29:18 - INFO - __main__ - Step 65733: {'lr': 0.00030405924410920206, 'samples': 12620736, 'steps': 65732, 'loss/train': 0.6317892670631409} 11/07/2021 06:29:19 - INFO - __main__ - Step 65734: {'lr': 0.00030405406290542496, 'samples': 12620928, 'steps': 65733, 'loss/train': 1.4691007137298584} 11/07/2021 06:29:19 - INFO - __main__ - Step 65735: {'lr': 0.000304048881677292, 'samples': 12621120, 'steps': 65734, 'loss/train': 0.8576750755310059} 11/07/2021 06:29:20 - INFO - __main__ - Step 65736: {'lr': 0.0003040437004248054, 'samples': 12621312, 'steps': 65735, 'loss/train': 1.3385124206542969} 11/07/2021 06:29:20 - INFO - __main__ - Step 65737: {'lr': 0.0003040385191479675, 'samples': 12621504, 'steps': 65736, 'loss/train': 1.3965781927108765} 11/07/2021 06:29:21 - INFO - __main__ - Step 65738: {'lr': 0.0003040333378467808, 'samples': 12621696, 'steps': 65737, 'loss/train': 1.2832612991333008} 11/07/2021 06:29:21 - INFO - __main__ - Step 65739: {'lr': 0.0003040281565212475, 'samples': 12621888, 'steps': 65738, 'loss/train': 1.7271196842193604} 11/07/2021 06:29:22 - INFO - __main__ - Step 65740: {'lr': 0.0003040229751713699, 'samples': 12622080, 'steps': 65739, 'loss/train': 1.0887385606765747} 11/07/2021 06:29:22 - INFO - __main__ - Step 65741: {'lr': 0.00030401779379715037, 'samples': 12622272, 'steps': 65740, 'loss/train': 0.9279345870018005} 11/07/2021 06:29:23 - INFO - __main__ - Step 65742: {'lr': 0.00030401261239859124, 'samples': 12622464, 'steps': 65741, 'loss/train': 1.338363766670227} 11/07/2021 06:29:23 - INFO - __main__ - Step 65743: {'lr': 0.0003040074309756949, 'samples': 12622656, 'steps': 65742, 'loss/train': 0.754964292049408} 11/07/2021 06:29:23 - INFO - __main__ - Step 65744: {'lr': 0.0003040022495284637, 'samples': 12622848, 'steps': 65743, 'loss/train': 1.3916265964508057} 11/07/2021 06:29:25 - INFO - __main__ - Step 65745: {'lr': 0.0003039970680568998, 'samples': 12623040, 'steps': 65744, 'loss/train': 1.7641143798828125} 11/07/2021 06:29:25 - INFO - __main__ - Step 65746: {'lr': 0.00030399188656100574, 'samples': 12623232, 'steps': 65745, 'loss/train': 1.4687491655349731} 11/07/2021 06:29:26 - INFO - __main__ - Step 65747: {'lr': 0.0003039867050407837, 'samples': 12623424, 'steps': 65746, 'loss/train': 1.366152048110962} 11/07/2021 06:29:26 - INFO - __main__ - Step 65748: {'lr': 0.0003039815234962361, 'samples': 12623616, 'steps': 65747, 'loss/train': 1.117135763168335} 11/07/2021 06:29:26 - INFO - __main__ - Step 65749: {'lr': 0.00030397634192736535, 'samples': 12623808, 'steps': 65748, 'loss/train': 1.0333952903747559} 11/07/2021 06:29:27 - INFO - __main__ - Step 65750: {'lr': 0.0003039711603341736, 'samples': 12624000, 'steps': 65749, 'loss/train': 0.8384309411048889} 11/07/2021 06:29:28 - INFO - __main__ - Step 65751: {'lr': 0.00030396597871666333, 'samples': 12624192, 'steps': 65750, 'loss/train': 1.5665656328201294} 11/07/2021 06:29:28 - INFO - __main__ - Step 65752: {'lr': 0.0003039607970748368, 'samples': 12624384, 'steps': 65751, 'loss/train': 1.3373414278030396} 11/07/2021 06:29:28 - INFO - __main__ - Step 65753: {'lr': 0.0003039556154086963, 'samples': 12624576, 'steps': 65752, 'loss/train': 1.291377067565918} 11/07/2021 06:29:29 - INFO - __main__ - Step 65754: {'lr': 0.00030395043371824425, 'samples': 12624768, 'steps': 65753, 'loss/train': 1.2556719779968262} 11/07/2021 06:29:29 - INFO - __main__ - Step 65755: {'lr': 0.0003039452520034831, 'samples': 12624960, 'steps': 65754, 'loss/train': 1.1447139978408813} 11/07/2021 06:29:29 - INFO - __main__ - Step 65756: {'lr': 0.00030394007026441494, 'samples': 12625152, 'steps': 65755, 'loss/train': 1.1842752695083618} 11/07/2021 06:29:30 - INFO - __main__ - Step 65757: {'lr': 0.0003039348885010422, 'samples': 12625344, 'steps': 65756, 'loss/train': 0.37929531931877136} 11/07/2021 06:29:31 - INFO - __main__ - Step 65758: {'lr': 0.0003039297067133673, 'samples': 12625536, 'steps': 65757, 'loss/train': 1.406220555305481} 11/07/2021 06:29:31 - INFO - __main__ - Step 65759: {'lr': 0.00030392452490139244, 'samples': 12625728, 'steps': 65758, 'loss/train': 1.6937315464019775} 11/07/2021 06:29:31 - INFO - __main__ - Step 65760: {'lr': 0.0003039193430651201, 'samples': 12625920, 'steps': 65759, 'loss/train': 1.7432665824890137} 11/07/2021 06:29:32 - INFO - __main__ - Step 65761: {'lr': 0.0003039141612045525, 'samples': 12626112, 'steps': 65760, 'loss/train': 1.531729817390442} 11/07/2021 06:29:33 - INFO - __main__ - Step 65762: {'lr': 0.000303908979319692, 'samples': 12626304, 'steps': 65761, 'loss/train': 1.1679106950759888} 11/07/2021 06:29:33 - INFO - __main__ - Step 65763: {'lr': 0.0003039037974105409, 'samples': 12626496, 'steps': 65762, 'loss/train': 1.1353424787521362} 11/07/2021 06:29:33 - INFO - __main__ - Step 65764: {'lr': 0.0003038986154771016, 'samples': 12626688, 'steps': 65763, 'loss/train': 1.6213105916976929} 11/07/2021 06:29:34 - INFO - __main__ - Step 65765: {'lr': 0.0003038934335193765, 'samples': 12626880, 'steps': 65764, 'loss/train': 1.127847671508789} 11/07/2021 06:29:34 - INFO - __main__ - Step 65766: {'lr': 0.00030388825153736775, 'samples': 12627072, 'steps': 65765, 'loss/train': 1.6453256607055664} 11/07/2021 06:29:35 - INFO - __main__ - Step 65767: {'lr': 0.0003038830695310779, 'samples': 12627264, 'steps': 65766, 'loss/train': 1.0724252462387085} 11/07/2021 06:29:36 - INFO - __main__ - Step 65768: {'lr': 0.0003038778875005091, 'samples': 12627456, 'steps': 65767, 'loss/train': 1.0447502136230469} 11/07/2021 06:29:36 - INFO - __main__ - Step 65769: {'lr': 0.00030387270544566375, 'samples': 12627648, 'steps': 65768, 'loss/train': 0.48964399099349976} 11/07/2021 06:29:36 - INFO - __main__ - Step 65770: {'lr': 0.00030386752336654415, 'samples': 12627840, 'steps': 65769, 'loss/train': 0.7276933193206787} 11/07/2021 06:29:37 - INFO - __main__ - Step 65771: {'lr': 0.00030386234126315273, 'samples': 12628032, 'steps': 65770, 'loss/train': 0.2259787619113922} 11/07/2021 06:29:38 - INFO - __main__ - Step 65772: {'lr': 0.00030385715913549177, 'samples': 12628224, 'steps': 65771, 'loss/train': 1.5375685691833496} 11/07/2021 06:29:38 - INFO - __main__ - Step 65773: {'lr': 0.00030385197698356366, 'samples': 12628416, 'steps': 65772, 'loss/train': 1.0291926860809326} 11/07/2021 06:29:38 - INFO - __main__ - Step 65774: {'lr': 0.0003038467948073706, 'samples': 12628608, 'steps': 65773, 'loss/train': 1.2926836013793945} 11/07/2021 06:29:39 - INFO - __main__ - Step 65775: {'lr': 0.000303841612606915, 'samples': 12628800, 'steps': 65774, 'loss/train': 1.139314889907837} 11/07/2021 06:29:39 - INFO - __main__ - Step 65776: {'lr': 0.0003038364303821992, 'samples': 12628992, 'steps': 65775, 'loss/train': 1.0417550802230835} 11/07/2021 06:29:39 - INFO - __main__ - Step 65777: {'lr': 0.00030383124813322557, 'samples': 12629184, 'steps': 65776, 'loss/train': 1.2811909914016724} 11/07/2021 06:29:41 - INFO - __main__ - Step 65778: {'lr': 0.00030382606585999637, 'samples': 12629376, 'steps': 65777, 'loss/train': 2.1986868381500244} 11/07/2021 06:29:41 - INFO - __main__ - Step 65779: {'lr': 0.000303820883562514, 'samples': 12629568, 'steps': 65778, 'loss/train': 1.0189275741577148} 11/07/2021 06:29:41 - INFO - __main__ - Step 65780: {'lr': 0.00030381570124078086, 'samples': 12629760, 'steps': 65779, 'loss/train': 1.2666953802108765} 11/07/2021 06:29:42 - INFO - __main__ - Step 65781: {'lr': 0.00030381051889479904, 'samples': 12629952, 'steps': 65780, 'loss/train': 1.007005214691162} 11/07/2021 06:29:42 - INFO - __main__ - Step 65782: {'lr': 0.0003038053365245711, 'samples': 12630144, 'steps': 65781, 'loss/train': 1.3395113945007324} 11/07/2021 06:29:43 - INFO - __main__ - Step 65783: {'lr': 0.0003038001541300993, 'samples': 12630336, 'steps': 65782, 'loss/train': 0.059600308537483215} 11/07/2021 06:29:43 - INFO - __main__ - Step 65784: {'lr': 0.00030379497171138597, 'samples': 12630528, 'steps': 65783, 'loss/train': 1.4612953662872314} 11/07/2021 06:29:44 - INFO - __main__ - Step 65785: {'lr': 0.0003037897892684335, 'samples': 12630720, 'steps': 65784, 'loss/train': 1.5611944198608398} 11/07/2021 06:29:44 - INFO - __main__ - Step 65786: {'lr': 0.00030378460680124416, 'samples': 12630912, 'steps': 65785, 'loss/train': 1.4650487899780273} 11/07/2021 06:29:45 - INFO - __main__ - Step 65787: {'lr': 0.0003037794243098203, 'samples': 12631104, 'steps': 65786, 'loss/train': 1.3068972826004028} 11/07/2021 06:29:45 - INFO - __main__ - Step 65788: {'lr': 0.00030377424179416426, 'samples': 12631296, 'steps': 65787, 'loss/train': 1.3522133827209473} 11/07/2021 06:29:46 - INFO - __main__ - Step 65789: {'lr': 0.0003037690592542784, 'samples': 12631488, 'steps': 65788, 'loss/train': 1.7363780736923218} 11/07/2021 06:29:46 - INFO - __main__ - Step 65790: {'lr': 0.000303763876690165, 'samples': 12631680, 'steps': 65789, 'loss/train': 1.6071703433990479} 11/07/2021 06:29:47 - INFO - __main__ - Step 65791: {'lr': 0.00030375869410182636, 'samples': 12631872, 'steps': 65790, 'loss/train': 1.5317007303237915} 11/07/2021 06:29:47 - INFO - __main__ - Step 65792: {'lr': 0.000303753511489265, 'samples': 12632064, 'steps': 65791, 'loss/train': 0.8449847102165222} 11/07/2021 06:29:48 - INFO - __main__ - Step 65793: {'lr': 0.0003037483288524831, 'samples': 12632256, 'steps': 65792, 'loss/train': 1.0865534543991089} 11/07/2021 06:29:48 - INFO - __main__ - Step 65794: {'lr': 0.00030374314619148305, 'samples': 12632448, 'steps': 65793, 'loss/train': 1.5427426099777222} 11/07/2021 06:29:49 - INFO - __main__ - Step 65795: {'lr': 0.00030373796350626717, 'samples': 12632640, 'steps': 65794, 'loss/train': 1.5296263694763184} 11/07/2021 06:29:49 - INFO - __main__ - Step 65796: {'lr': 0.00030373278079683775, 'samples': 12632832, 'steps': 65795, 'loss/train': 1.5530946254730225} 11/07/2021 06:29:49 - INFO - __main__ - Step 65797: {'lr': 0.00030372759806319717, 'samples': 12633024, 'steps': 65796, 'loss/train': 0.5782380700111389} 11/07/2021 06:29:50 - INFO - __main__ - Step 65798: {'lr': 0.00030372241530534776, 'samples': 12633216, 'steps': 65797, 'loss/train': 1.6271296739578247} 11/07/2021 06:29:51 - INFO - __main__ - Step 65799: {'lr': 0.00030371723252329186, 'samples': 12633408, 'steps': 65798, 'loss/train': 1.8431646823883057} 11/07/2021 06:29:51 - INFO - __main__ - Step 65800: {'lr': 0.00030371204971703185, 'samples': 12633600, 'steps': 65799, 'loss/train': 1.3964025974273682} 11/07/2021 06:29:51 - INFO - __main__ - Step 65801: {'lr': 0.00030370686688657, 'samples': 12633792, 'steps': 65800, 'loss/train': 1.6670098304748535} 11/07/2021 06:29:52 - INFO - __main__ - Step 65802: {'lr': 0.00030370168403190867, 'samples': 12633984, 'steps': 65801, 'loss/train': 1.1498667001724243} 11/07/2021 06:29:53 - INFO - __main__ - Step 65803: {'lr': 0.00030369650115305016, 'samples': 12634176, 'steps': 65802, 'loss/train': 1.7127219438552856} 11/07/2021 06:29:53 - INFO - __main__ - Step 65804: {'lr': 0.00030369131824999686, 'samples': 12634368, 'steps': 65803, 'loss/train': 1.5298892259597778} 11/07/2021 06:29:54 - INFO - __main__ - Step 65805: {'lr': 0.000303686135322751, 'samples': 12634560, 'steps': 65804, 'loss/train': 1.277737021446228} 11/07/2021 06:29:54 - INFO - __main__ - Step 65806: {'lr': 0.0003036809523713151, 'samples': 12634752, 'steps': 65805, 'loss/train': 1.3471204042434692} 11/07/2021 06:29:54 - INFO - __main__ - Step 65807: {'lr': 0.0003036757693956914, 'samples': 12634944, 'steps': 65806, 'loss/train': 1.338641881942749} 11/07/2021 06:29:55 - INFO - __main__ - Step 65808: {'lr': 0.0003036705863958822, 'samples': 12635136, 'steps': 65807, 'loss/train': 1.3735355138778687} 11/07/2021 06:29:56 - INFO - __main__ - Step 65809: {'lr': 0.0003036654033718898, 'samples': 12635328, 'steps': 65808, 'loss/train': 1.204513669013977} 11/07/2021 06:29:56 - INFO - __main__ - Step 65810: {'lr': 0.00030366022032371666, 'samples': 12635520, 'steps': 65809, 'loss/train': 1.522355079650879} 11/07/2021 06:29:56 - INFO - __main__ - Step 65811: {'lr': 0.00030365503725136503, 'samples': 12635712, 'steps': 65810, 'loss/train': 0.9637055993080139} 11/07/2021 06:29:57 - INFO - __main__ - Step 65812: {'lr': 0.00030364985415483727, 'samples': 12635904, 'steps': 65811, 'loss/train': 1.5280184745788574} 11/07/2021 06:29:58 - INFO - __main__ - Step 65813: {'lr': 0.0003036446710341357, 'samples': 12636096, 'steps': 65812, 'loss/train': 1.558131456375122} 11/07/2021 06:29:58 - INFO - __main__ - Step 65814: {'lr': 0.0003036394878892627, 'samples': 12636288, 'steps': 65813, 'loss/train': 1.5909292697906494} 11/07/2021 06:29:59 - INFO - __main__ - Step 65815: {'lr': 0.0003036343047202206, 'samples': 12636480, 'steps': 65814, 'loss/train': 1.5338749885559082} 11/07/2021 06:29:59 - INFO - __main__ - Step 65816: {'lr': 0.00030362912152701163, 'samples': 12636672, 'steps': 65815, 'loss/train': 0.11059756577014923} 11/07/2021 06:29:59 - INFO - __main__ - Step 65817: {'lr': 0.00030362393830963826, 'samples': 12636864, 'steps': 65816, 'loss/train': 1.0801671743392944} 11/07/2021 06:30:00 - INFO - __main__ - Step 65818: {'lr': 0.00030361875506810273, 'samples': 12637056, 'steps': 65817, 'loss/train': 1.454890251159668} 11/07/2021 06:30:01 - INFO - __main__ - Step 65819: {'lr': 0.00030361357180240745, 'samples': 12637248, 'steps': 65818, 'loss/train': 1.057037591934204} 11/07/2021 06:30:01 - INFO - __main__ - Step 65820: {'lr': 0.0003036083885125547, 'samples': 12637440, 'steps': 65819, 'loss/train': 1.775730013847351} 11/07/2021 06:30:02 - INFO - __main__ - Step 65821: {'lr': 0.0003036032051985469, 'samples': 12637632, 'steps': 65820, 'loss/train': 1.5734775066375732} 11/07/2021 06:30:02 - INFO - __main__ - Step 65822: {'lr': 0.00030359802186038625, 'samples': 12637824, 'steps': 65821, 'loss/train': 1.2608145475387573} 11/07/2021 06:30:03 - INFO - __main__ - Step 65823: {'lr': 0.00030359283849807516, 'samples': 12638016, 'steps': 65822, 'loss/train': 1.1906715631484985} 11/07/2021 06:30:03 - INFO - __main__ - Step 65824: {'lr': 0.000303587655111616, 'samples': 12638208, 'steps': 65823, 'loss/train': 0.6400460004806519} 11/07/2021 06:30:04 - INFO - __main__ - Step 65825: {'lr': 0.00030358247170101104, 'samples': 12638400, 'steps': 65824, 'loss/train': 1.316996693611145} 11/07/2021 06:30:04 - INFO - __main__ - Step 65826: {'lr': 0.00030357728826626266, 'samples': 12638592, 'steps': 65825, 'loss/train': 1.6961294412612915} 11/07/2021 06:30:04 - INFO - __main__ - Step 65827: {'lr': 0.00030357210480737323, 'samples': 12638784, 'steps': 65826, 'loss/train': 1.2053287029266357} 11/07/2021 06:30:05 - INFO - __main__ - Step 65828: {'lr': 0.000303566921324345, 'samples': 12638976, 'steps': 65827, 'loss/train': 1.3278650045394897} 11/07/2021 06:30:06 - INFO - __main__ - Step 65829: {'lr': 0.00030356173781718033, 'samples': 12639168, 'steps': 65828, 'loss/train': 1.5704584121704102} 11/07/2021 06:30:06 - INFO - __main__ - Step 65830: {'lr': 0.0003035565542858816, 'samples': 12639360, 'steps': 65829, 'loss/train': 0.5059735774993896} 11/07/2021 06:30:07 - INFO - __main__ - Step 65831: {'lr': 0.00030355137073045105, 'samples': 12639552, 'steps': 65830, 'loss/train': 0.39020460844039917} 11/07/2021 06:30:07 - INFO - __main__ - Step 65832: {'lr': 0.0003035461871508911, 'samples': 12639744, 'steps': 65831, 'loss/train': 1.4428070783615112} 11/07/2021 06:30:07 - INFO - __main__ - Step 65833: {'lr': 0.00030354100354720403, 'samples': 12639936, 'steps': 65832, 'loss/train': 1.570949673652649} 11/07/2021 06:30:08 - INFO - __main__ - Step 65834: {'lr': 0.0003035358199193923, 'samples': 12640128, 'steps': 65833, 'loss/train': 1.1679518222808838} 11/07/2021 06:30:09 - INFO - __main__ - Step 65835: {'lr': 0.00030353063626745814, 'samples': 12640320, 'steps': 65834, 'loss/train': 1.3524140119552612} 11/07/2021 06:30:09 - INFO - __main__ - Step 65836: {'lr': 0.0003035254525914038, 'samples': 12640512, 'steps': 65835, 'loss/train': 0.9156963229179382} 11/07/2021 06:30:09 - INFO - __main__ - Step 65837: {'lr': 0.00030352026889123187, 'samples': 12640704, 'steps': 65836, 'loss/train': 2.0584347248077393} 11/07/2021 06:30:10 - INFO - __main__ - Step 65838: {'lr': 0.00030351508516694443, 'samples': 12640896, 'steps': 65837, 'loss/train': 0.6592512726783752} 11/07/2021 06:30:11 - INFO - __main__ - Step 65839: {'lr': 0.0003035099014185439, 'samples': 12641088, 'steps': 65838, 'loss/train': 1.1467190980911255} 11/07/2021 06:30:11 - INFO - __main__ - Step 65840: {'lr': 0.0003035047176460327, 'samples': 12641280, 'steps': 65839, 'loss/train': 1.5238995552062988} 11/07/2021 06:30:11 - INFO - __main__ - Step 65841: {'lr': 0.00030349953384941307, 'samples': 12641472, 'steps': 65840, 'loss/train': 0.9646552205085754} 11/07/2021 06:30:12 - INFO - __main__ - Step 65842: {'lr': 0.0003034943500286874, 'samples': 12641664, 'steps': 65841, 'loss/train': 1.4175161123275757} 11/07/2021 06:30:12 - INFO - __main__ - Step 65843: {'lr': 0.00030348916618385796, 'samples': 12641856, 'steps': 65842, 'loss/train': 1.5005053281784058} 11/07/2021 06:30:13 - INFO - __main__ - Step 65844: {'lr': 0.0003034839823149271, 'samples': 12642048, 'steps': 65843, 'loss/train': 1.5186415910720825} 11/07/2021 06:30:13 - INFO - __main__ - Step 65845: {'lr': 0.0003034787984218973, 'samples': 12642240, 'steps': 65844, 'loss/train': 0.750397264957428} 11/07/2021 06:30:14 - INFO - __main__ - Step 65846: {'lr': 0.0003034736145047707, 'samples': 12642432, 'steps': 65845, 'loss/train': 1.4713785648345947} 11/07/2021 06:30:14 - INFO - __main__ - Step 65847: {'lr': 0.0003034684305635497, 'samples': 12642624, 'steps': 65846, 'loss/train': 1.2911324501037598} 11/07/2021 06:30:15 - INFO - __main__ - Step 65848: {'lr': 0.0003034632465982367, 'samples': 12642816, 'steps': 65847, 'loss/train': 0.9099041819572449} 11/07/2021 06:30:16 - INFO - __main__ - Step 65849: {'lr': 0.00030345806260883396, 'samples': 12643008, 'steps': 65848, 'loss/train': 1.4377126693725586} 11/07/2021 06:30:16 - INFO - __main__ - Step 65850: {'lr': 0.00030345287859534384, 'samples': 12643200, 'steps': 65849, 'loss/train': 2.775050401687622} 11/07/2021 06:30:16 - INFO - __main__ - Step 65851: {'lr': 0.00030344769455776865, 'samples': 12643392, 'steps': 65850, 'loss/train': 1.2428561449050903} 11/07/2021 06:30:17 - INFO - __main__ - Step 65852: {'lr': 0.00030344251049611084, 'samples': 12643584, 'steps': 65851, 'loss/train': 0.8317819237709045} 11/07/2021 06:30:17 - INFO - __main__ - Step 65853: {'lr': 0.0003034373264103725, 'samples': 12643776, 'steps': 65852, 'loss/train': 1.468685269355774} 11/07/2021 06:30:18 - INFO - __main__ - Step 65854: {'lr': 0.00030343214230055634, 'samples': 12643968, 'steps': 65853, 'loss/train': 1.2669477462768555} 11/07/2021 06:30:18 - INFO - __main__ - Step 65855: {'lr': 0.0003034269581666643, 'samples': 12644160, 'steps': 65854, 'loss/train': 1.2286049127578735} 11/07/2021 06:30:19 - INFO - __main__ - Step 65856: {'lr': 0.00030342177400869905, 'samples': 12644352, 'steps': 65855, 'loss/train': 2.099937677383423} 11/07/2021 06:30:19 - INFO - __main__ - Step 65857: {'lr': 0.00030341658982666265, 'samples': 12644544, 'steps': 65856, 'loss/train': 1.707685947418213} 11/07/2021 06:30:19 - INFO - __main__ - Step 65858: {'lr': 0.00030341140562055755, 'samples': 12644736, 'steps': 65857, 'loss/train': 0.9288512468338013} 11/07/2021 06:30:20 - INFO - __main__ - Step 65859: {'lr': 0.00030340622139038616, 'samples': 12644928, 'steps': 65858, 'loss/train': 1.3154274225234985} 11/07/2021 06:30:21 - INFO - __main__ - Step 65860: {'lr': 0.0003034010371361507, 'samples': 12645120, 'steps': 65859, 'loss/train': 1.3837310075759888} 11/07/2021 06:30:21 - INFO - __main__ - Step 65861: {'lr': 0.00030339585285785365, 'samples': 12645312, 'steps': 65860, 'loss/train': 1.2816532850265503} 11/07/2021 06:30:22 - INFO - __main__ - Step 65862: {'lr': 0.0003033906685554972, 'samples': 12645504, 'steps': 65861, 'loss/train': 1.4769387245178223} 11/07/2021 06:30:22 - INFO - __main__ - Step 65863: {'lr': 0.00030338548422908373, 'samples': 12645696, 'steps': 65862, 'loss/train': 1.0040477514266968} 11/07/2021 06:30:22 - INFO - __main__ - Step 65864: {'lr': 0.0003033802998786156, 'samples': 12645888, 'steps': 65863, 'loss/train': 2.0564188957214355} 11/07/2021 06:30:23 - INFO - __main__ - Step 65865: {'lr': 0.0003033751155040951, 'samples': 12646080, 'steps': 65864, 'loss/train': 1.7397898435592651} 11/07/2021 06:30:24 - INFO - __main__ - Step 65866: {'lr': 0.00030336993110552455, 'samples': 12646272, 'steps': 65865, 'loss/train': 1.318013310432434} 11/07/2021 06:30:24 - INFO - __main__ - Step 65867: {'lr': 0.00030336474668290645, 'samples': 12646464, 'steps': 65866, 'loss/train': 1.4348700046539307} 11/07/2021 06:30:24 - INFO - __main__ - Step 65868: {'lr': 0.00030335956223624303, 'samples': 12646656, 'steps': 65867, 'loss/train': 1.2991455793380737} 11/07/2021 06:30:25 - INFO - __main__ - Step 65869: {'lr': 0.0003033543777655365, 'samples': 12646848, 'steps': 65868, 'loss/train': 1.545075535774231} 11/07/2021 06:30:25 - INFO - __main__ - Step 65870: {'lr': 0.00030334919327078936, 'samples': 12647040, 'steps': 65869, 'loss/train': 1.7339979410171509} 11/07/2021 06:30:26 - INFO - __main__ - Step 65871: {'lr': 0.0003033440087520039, 'samples': 12647232, 'steps': 65870, 'loss/train': 1.095917820930481} 11/07/2021 06:30:26 - INFO - __main__ - Step 65872: {'lr': 0.0003033388242091824, 'samples': 12647424, 'steps': 65871, 'loss/train': 1.2071040868759155} 11/07/2021 06:30:27 - INFO - __main__ - Step 65873: {'lr': 0.00030333363964232736, 'samples': 12647616, 'steps': 65872, 'loss/train': 1.6243252754211426} 11/07/2021 06:30:27 - INFO - __main__ - Step 65874: {'lr': 0.000303328455051441, 'samples': 12647808, 'steps': 65873, 'loss/train': 1.2549638748168945} 11/07/2021 06:30:27 - INFO - __main__ - Step 65875: {'lr': 0.00030332327043652553, 'samples': 12648000, 'steps': 65874, 'loss/train': 1.3456430435180664} 11/07/2021 06:30:28 - INFO - __main__ - Step 65876: {'lr': 0.0003033180857975835, 'samples': 12648192, 'steps': 65875, 'loss/train': 1.4951846599578857} 11/07/2021 06:30:29 - INFO - __main__ - Step 65877: {'lr': 0.00030331290113461715, 'samples': 12648384, 'steps': 65876, 'loss/train': 1.8474799394607544} 11/07/2021 06:30:29 - INFO - __main__ - Step 65878: {'lr': 0.00030330771644762887, 'samples': 12648576, 'steps': 65877, 'loss/train': 1.357438564300537} 11/07/2021 06:30:29 - INFO - __main__ - Step 65879: {'lr': 0.0003033025317366209, 'samples': 12648768, 'steps': 65878, 'loss/train': 1.3117824792861938} 11/07/2021 06:30:30 - INFO - __main__ - Step 65880: {'lr': 0.00030329734700159565, 'samples': 12648960, 'steps': 65879, 'loss/train': 0.7578467726707458} 11/07/2021 06:30:31 - INFO - __main__ - Step 65881: {'lr': 0.00030329216224255547, 'samples': 12649152, 'steps': 65880, 'loss/train': 1.475385308265686} 11/07/2021 06:30:31 - INFO - __main__ - Step 65882: {'lr': 0.0003032869774595026, 'samples': 12649344, 'steps': 65881, 'loss/train': 1.1342262029647827} 11/07/2021 06:30:32 - INFO - __main__ - Step 65883: {'lr': 0.0003032817926524395, 'samples': 12649536, 'steps': 65882, 'loss/train': 1.5247929096221924} 11/07/2021 06:30:32 - INFO - __main__ - Step 65884: {'lr': 0.00030327660782136843, 'samples': 12649728, 'steps': 65883, 'loss/train': 1.143919825553894} 11/07/2021 06:30:32 - INFO - __main__ - Step 65885: {'lr': 0.00030327142296629174, 'samples': 12649920, 'steps': 65884, 'loss/train': 0.8594958186149597} 11/07/2021 06:30:33 - INFO - __main__ - Step 65886: {'lr': 0.0003032662380872118, 'samples': 12650112, 'steps': 65885, 'loss/train': 1.476389765739441} 11/07/2021 06:30:34 - INFO - __main__ - Step 65887: {'lr': 0.00030326105318413086, 'samples': 12650304, 'steps': 65886, 'loss/train': 1.536088228225708} 11/07/2021 06:30:34 - INFO - __main__ - Step 65888: {'lr': 0.00030325586825705127, 'samples': 12650496, 'steps': 65887, 'loss/train': 1.424874186515808} 11/07/2021 06:30:34 - INFO - __main__ - Step 65889: {'lr': 0.0003032506833059755, 'samples': 12650688, 'steps': 65888, 'loss/train': 1.4630671739578247} 11/07/2021 06:30:35 - INFO - __main__ - Step 65890: {'lr': 0.00030324549833090573, 'samples': 12650880, 'steps': 65889, 'loss/train': 1.5434296131134033} 11/07/2021 06:30:36 - INFO - __main__ - Step 65891: {'lr': 0.00030324031333184444, 'samples': 12651072, 'steps': 65890, 'loss/train': 1.6599342823028564} 11/07/2021 06:30:36 - INFO - __main__ - Step 65892: {'lr': 0.00030323512830879377, 'samples': 12651264, 'steps': 65891, 'loss/train': 1.2865430116653442} 11/07/2021 06:30:36 - INFO - __main__ - Step 65893: {'lr': 0.00030322994326175627, 'samples': 12651456, 'steps': 65892, 'loss/train': 1.2366771697998047} 11/07/2021 06:30:37 - INFO - __main__ - Step 65894: {'lr': 0.0003032247581907342, 'samples': 12651648, 'steps': 65893, 'loss/train': 0.8318915367126465} 11/07/2021 06:30:37 - INFO - __main__ - Step 65895: {'lr': 0.0003032195730957298, 'samples': 12651840, 'steps': 65894, 'loss/train': 1.584991455078125} 11/07/2021 06:30:38 - INFO - __main__ - Step 65896: {'lr': 0.0003032143879767455, 'samples': 12652032, 'steps': 65895, 'loss/train': 1.4227981567382812} 11/07/2021 06:30:39 - INFO - __main__ - Step 65897: {'lr': 0.0003032092028337836, 'samples': 12652224, 'steps': 65896, 'loss/train': 1.7478755712509155} 11/07/2021 06:30:39 - INFO - __main__ - Step 65898: {'lr': 0.00030320401766684645, 'samples': 12652416, 'steps': 65897, 'loss/train': 1.5471925735473633} 11/07/2021 06:30:39 - INFO - __main__ - Step 65899: {'lr': 0.00030319883247593646, 'samples': 12652608, 'steps': 65898, 'loss/train': 1.2189738750457764} 11/07/2021 06:30:40 - INFO - __main__ - Step 65900: {'lr': 0.00030319364726105584, 'samples': 12652800, 'steps': 65899, 'loss/train': 1.3892449140548706} 11/07/2021 06:30:40 - INFO - __main__ - Step 65901: {'lr': 0.0003031884620222071, 'samples': 12652992, 'steps': 65900, 'loss/train': 0.5630688071250916} 11/07/2021 06:30:41 - INFO - __main__ - Step 65902: {'lr': 0.00030318327675939226, 'samples': 12653184, 'steps': 65901, 'loss/train': 1.614032506942749} 11/07/2021 06:30:41 - INFO - __main__ - Step 65903: {'lr': 0.000303178091472614, 'samples': 12653376, 'steps': 65902, 'loss/train': 1.2171766757965088} 11/07/2021 06:30:42 - INFO - __main__ - Step 65904: {'lr': 0.0003031729061618744, 'samples': 12653568, 'steps': 65903, 'loss/train': 1.5402904748916626} 11/07/2021 06:30:42 - INFO - __main__ - Step 65905: {'lr': 0.000303167720827176, 'samples': 12653760, 'steps': 65904, 'loss/train': 1.412523865699768} 11/07/2021 06:30:42 - INFO - __main__ - Step 65906: {'lr': 0.000303162535468521, 'samples': 12653952, 'steps': 65905, 'loss/train': 1.6023902893066406} 11/07/2021 06:30:43 - INFO - __main__ - Step 65907: {'lr': 0.00030315735008591184, 'samples': 12654144, 'steps': 65906, 'loss/train': 1.5538760423660278} 11/07/2021 06:30:44 - INFO - __main__ - Step 65908: {'lr': 0.00030315216467935083, 'samples': 12654336, 'steps': 65907, 'loss/train': 1.6558178663253784} 11/07/2021 06:30:44 - INFO - __main__ - Step 65909: {'lr': 0.0003031469792488402, 'samples': 12654528, 'steps': 65908, 'loss/train': 1.7403451204299927} 11/07/2021 06:30:44 - INFO - __main__ - Step 65910: {'lr': 0.00030314179379438227, 'samples': 12654720, 'steps': 65909, 'loss/train': 1.3885524272918701} 11/07/2021 06:30:45 - INFO - __main__ - Step 65911: {'lr': 0.0003031366083159796, 'samples': 12654912, 'steps': 65910, 'loss/train': 1.2204060554504395} 11/07/2021 06:30:46 - INFO - __main__ - Step 65912: {'lr': 0.00030313142281363436, 'samples': 12655104, 'steps': 65911, 'loss/train': 1.5216124057769775} 11/07/2021 06:30:46 - INFO - __main__ - Step 65913: {'lr': 0.0003031262372873489, 'samples': 12655296, 'steps': 65912, 'loss/train': 1.7177996635437012} 11/07/2021 06:30:46 - INFO - __main__ - Step 65914: {'lr': 0.00030312105173712554, 'samples': 12655488, 'steps': 65913, 'loss/train': 1.6352758407592773} 11/07/2021 06:30:47 - INFO - __main__ - Step 65915: {'lr': 0.00030311586616296683, 'samples': 12655680, 'steps': 65914, 'loss/train': 1.0177290439605713} 11/07/2021 06:30:47 - INFO - __main__ - Step 65916: {'lr': 0.0003031106805648748, 'samples': 12655872, 'steps': 65915, 'loss/train': 0.857221245765686} 11/07/2021 06:30:48 - INFO - __main__ - Step 65917: {'lr': 0.0003031054949428519, 'samples': 12656064, 'steps': 65916, 'loss/train': 1.392675518989563} 11/07/2021 06:30:49 - INFO - __main__ - Step 65918: {'lr': 0.0003031003092969005, 'samples': 12656256, 'steps': 65917, 'loss/train': 1.844696044921875} 11/07/2021 06:30:49 - INFO - __main__ - Step 65919: {'lr': 0.0003030951236270229, 'samples': 12656448, 'steps': 65918, 'loss/train': 1.6020987033843994} 11/07/2021 06:30:49 - INFO - __main__ - Step 65920: {'lr': 0.00030308993793322147, 'samples': 12656640, 'steps': 65919, 'loss/train': 1.7786922454833984} 11/07/2021 06:30:50 - INFO - __main__ - Step 65921: {'lr': 0.0003030847522154986, 'samples': 12656832, 'steps': 65920, 'loss/train': 1.36826753616333} 11/07/2021 06:30:51 - INFO - __main__ - Step 65922: {'lr': 0.00030307956647385653, 'samples': 12657024, 'steps': 65921, 'loss/train': 1.7280840873718262} 11/07/2021 06:30:51 - INFO - __main__ - Step 65923: {'lr': 0.00030307438070829764, 'samples': 12657216, 'steps': 65922, 'loss/train': 1.3443225622177124} 11/07/2021 06:30:52 - INFO - __main__ - Step 65924: {'lr': 0.0003030691949188242, 'samples': 12657408, 'steps': 65923, 'loss/train': 1.1909345388412476} 11/07/2021 06:30:52 - INFO - __main__ - Step 65925: {'lr': 0.0003030640091054386, 'samples': 12657600, 'steps': 65924, 'loss/train': 1.5364887714385986} 11/07/2021 06:30:53 - INFO - __main__ - Step 65926: {'lr': 0.00030305882326814315, 'samples': 12657792, 'steps': 65925, 'loss/train': 1.6382373571395874} 11/07/2021 06:30:53 - INFO - __main__ - Step 65927: {'lr': 0.00030305363740694023, 'samples': 12657984, 'steps': 65926, 'loss/train': 0.8814562559127808} 11/07/2021 06:30:54 - INFO - __main__ - Step 65928: {'lr': 0.0003030484515218323, 'samples': 12658176, 'steps': 65927, 'loss/train': 1.2474056482315063} 11/07/2021 06:30:54 - INFO - __main__ - Step 65929: {'lr': 0.0003030432656128214, 'samples': 12658368, 'steps': 65928, 'loss/train': 1.4179925918579102} 11/07/2021 06:30:55 - INFO - __main__ - Step 65930: {'lr': 0.00030303807967991007, 'samples': 12658560, 'steps': 65929, 'loss/train': 1.6114710569381714} 11/07/2021 06:30:55 - INFO - __main__ - Step 65931: {'lr': 0.00030303289372310063, 'samples': 12658752, 'steps': 65930, 'loss/train': 1.3754929304122925} 11/07/2021 06:30:55 - INFO - __main__ - Step 65932: {'lr': 0.00030302770774239527, 'samples': 12658944, 'steps': 65931, 'loss/train': 1.1369659900665283} 11/07/2021 06:30:56 - INFO - __main__ - Step 65933: {'lr': 0.00030302252173779653, 'samples': 12659136, 'steps': 65932, 'loss/train': 1.5189467668533325} 11/07/2021 06:30:57 - INFO - __main__ - Step 65934: {'lr': 0.0003030173357093067, 'samples': 12659328, 'steps': 65933, 'loss/train': 1.3327853679656982} 11/07/2021 06:30:57 - INFO - __main__ - Step 65935: {'lr': 0.0003030121496569281, 'samples': 12659520, 'steps': 65934, 'loss/train': 1.4102084636688232} 11/07/2021 06:30:57 - INFO - __main__ - Step 65936: {'lr': 0.00030300696358066294, 'samples': 12659712, 'steps': 65935, 'loss/train': 1.2232686281204224} 11/07/2021 06:30:58 - INFO - __main__ - Step 65937: {'lr': 0.00030300177748051373, 'samples': 12659904, 'steps': 65936, 'loss/train': 1.093085765838623} 11/07/2021 06:30:59 - INFO - __main__ - Step 65938: {'lr': 0.00030299659135648265, 'samples': 12660096, 'steps': 65937, 'loss/train': 1.2306736707687378} 11/07/2021 06:30:59 - INFO - __main__ - Step 65939: {'lr': 0.00030299140520857217, 'samples': 12660288, 'steps': 65938, 'loss/train': 1.0700452327728271} 11/07/2021 06:30:59 - INFO - __main__ - Step 65940: {'lr': 0.0003029862190367846, 'samples': 12660480, 'steps': 65939, 'loss/train': 1.3946728706359863} 11/07/2021 06:31:00 - INFO - __main__ - Step 65941: {'lr': 0.00030298103284112226, 'samples': 12660672, 'steps': 65940, 'loss/train': 0.8837518095970154} 11/07/2021 06:31:00 - INFO - __main__ - Step 65942: {'lr': 0.0003029758466215875, 'samples': 12660864, 'steps': 65941, 'loss/train': 1.6922825574874878} 11/07/2021 06:31:01 - INFO - __main__ - Step 65943: {'lr': 0.0003029706603781826, 'samples': 12661056, 'steps': 65942, 'loss/train': 1.2934892177581787} 11/07/2021 06:31:02 - INFO - __main__ - Step 65944: {'lr': 0.0003029654741109099, 'samples': 12661248, 'steps': 65943, 'loss/train': 2.247894048690796} 11/07/2021 06:31:02 - INFO - __main__ - Step 65945: {'lr': 0.0003029602878197719, 'samples': 12661440, 'steps': 65944, 'loss/train': 0.941990315914154} 11/07/2021 06:31:02 - INFO - __main__ - Step 65946: {'lr': 0.00030295510150477067, 'samples': 12661632, 'steps': 65945, 'loss/train': 1.36214017868042} 11/07/2021 06:31:03 - INFO - __main__ - Step 65947: {'lr': 0.00030294991516590877, 'samples': 12661824, 'steps': 65946, 'loss/train': 1.1888724565505981} 11/07/2021 06:31:04 - INFO - __main__ - Step 65948: {'lr': 0.00030294472880318846, 'samples': 12662016, 'steps': 65947, 'loss/train': 0.8436814546585083} 11/07/2021 06:31:04 - INFO - __main__ - Step 65949: {'lr': 0.000302939542416612, 'samples': 12662208, 'steps': 65948, 'loss/train': 1.1591527462005615} 11/07/2021 06:31:04 - INFO - __main__ - Step 65950: {'lr': 0.00030293435600618193, 'samples': 12662400, 'steps': 65949, 'loss/train': 0.7238063812255859} 11/07/2021 06:31:05 - INFO - __main__ - Step 65951: {'lr': 0.0003029291695719003, 'samples': 12662592, 'steps': 65950, 'loss/train': 0.7992502450942993} 11/07/2021 06:31:05 - INFO - __main__ - Step 65952: {'lr': 0.0003029239831137697, 'samples': 12662784, 'steps': 65951, 'loss/train': 1.248275637626648} 11/07/2021 06:31:06 - INFO - __main__ - Step 65953: {'lr': 0.00030291879663179233, 'samples': 12662976, 'steps': 65952, 'loss/train': 2.730868339538574} 11/07/2021 06:31:06 - INFO - __main__ - Step 65954: {'lr': 0.00030291361012597056, 'samples': 12663168, 'steps': 65953, 'loss/train': 1.841261386871338} 11/07/2021 06:31:07 - INFO - __main__ - Step 65955: {'lr': 0.0003029084235963068, 'samples': 12663360, 'steps': 65954, 'loss/train': 1.3878883123397827} 11/07/2021 06:31:07 - INFO - __main__ - Step 65956: {'lr': 0.00030290323704280334, 'samples': 12663552, 'steps': 65955, 'loss/train': 1.568084716796875} 11/07/2021 06:31:07 - INFO - __main__ - Step 65957: {'lr': 0.0003028980504654624, 'samples': 12663744, 'steps': 65956, 'loss/train': 1.0922099351882935} 11/07/2021 06:31:08 - INFO - __main__ - Step 65958: {'lr': 0.00030289286386428645, 'samples': 12663936, 'steps': 65957, 'loss/train': 1.2588309049606323} 11/07/2021 06:31:09 - INFO - __main__ - Step 65959: {'lr': 0.0003028876772392778, 'samples': 12664128, 'steps': 65958, 'loss/train': 1.2955604791641235} 11/07/2021 06:31:09 - INFO - __main__ - Step 65960: {'lr': 0.00030288249059043875, 'samples': 12664320, 'steps': 65959, 'loss/train': 1.2774229049682617} 11/07/2021 06:31:10 - INFO - __main__ - Step 65961: {'lr': 0.0003028773039177717, 'samples': 12664512, 'steps': 65960, 'loss/train': 1.0228570699691772} 11/07/2021 06:31:10 - INFO - __main__ - Step 65962: {'lr': 0.00030287211722127894, 'samples': 12664704, 'steps': 65961, 'loss/train': 1.3336848020553589} 11/07/2021 06:31:10 - INFO - __main__ - Step 65963: {'lr': 0.0003028669305009628, 'samples': 12664896, 'steps': 65962, 'loss/train': 1.4917720556259155} 11/07/2021 06:31:11 - INFO - __main__ - Step 65964: {'lr': 0.0003028617437568257, 'samples': 12665088, 'steps': 65963, 'loss/train': 1.410605549812317} 11/07/2021 06:31:12 - INFO - __main__ - Step 65965: {'lr': 0.0003028565569888699, 'samples': 12665280, 'steps': 65964, 'loss/train': 1.4664692878723145} 11/07/2021 06:31:12 - INFO - __main__ - Step 65966: {'lr': 0.00030285137019709767, 'samples': 12665472, 'steps': 65965, 'loss/train': 1.5625295639038086} 11/07/2021 06:31:12 - INFO - __main__ - Step 65967: {'lr': 0.0003028461833815115, 'samples': 12665664, 'steps': 65966, 'loss/train': 1.1447852849960327} 11/07/2021 06:31:13 - INFO - __main__ - Step 65968: {'lr': 0.00030284099654211366, 'samples': 12665856, 'steps': 65967, 'loss/train': 1.5597010850906372} 11/07/2021 06:31:14 - INFO - __main__ - Step 65969: {'lr': 0.00030283580967890644, 'samples': 12666048, 'steps': 65968, 'loss/train': 1.2995058298110962} 11/07/2021 06:31:14 - INFO - __main__ - Step 65970: {'lr': 0.0003028306227918922, 'samples': 12666240, 'steps': 65969, 'loss/train': 1.3013124465942383} 11/07/2021 06:31:14 - INFO - __main__ - Step 65971: {'lr': 0.00030282543588107337, 'samples': 12666432, 'steps': 65970, 'loss/train': 1.2262954711914062} 11/07/2021 06:31:15 - INFO - __main__ - Step 65972: {'lr': 0.00030282024894645213, 'samples': 12666624, 'steps': 65971, 'loss/train': 1.6392652988433838} 11/07/2021 06:31:15 - INFO - __main__ - Step 65973: {'lr': 0.000302815061988031, 'samples': 12666816, 'steps': 65972, 'loss/train': 1.5046273469924927} 11/07/2021 06:31:17 - INFO - __main__ - Step 65974: {'lr': 0.00030280987500581213, 'samples': 12667008, 'steps': 65973, 'loss/train': 2.0618746280670166} 11/07/2021 06:31:17 - INFO - __main__ - Step 65975: {'lr': 0.000302804687999798, 'samples': 12667200, 'steps': 65974, 'loss/train': 1.0882097482681274} 11/07/2021 06:31:17 - INFO - __main__ - Step 65976: {'lr': 0.00030279950096999094, 'samples': 12667392, 'steps': 65975, 'loss/train': 1.1423341035842896} 11/07/2021 06:31:18 - INFO - __main__ - Step 65977: {'lr': 0.0003027943139163931, 'samples': 12667584, 'steps': 65976, 'loss/train': 1.32948899269104} 11/07/2021 06:31:18 - INFO - __main__ - Step 65978: {'lr': 0.00030278912683900705, 'samples': 12667776, 'steps': 65977, 'loss/train': 1.4452431201934814} 11/07/2021 06:31:19 - INFO - __main__ - Step 65979: {'lr': 0.000302783939737835, 'samples': 12667968, 'steps': 65978, 'loss/train': 1.4601083993911743} 11/07/2021 06:31:19 - INFO - __main__ - Step 65980: {'lr': 0.0003027787526128794, 'samples': 12668160, 'steps': 65979, 'loss/train': 1.1131024360656738} 11/07/2021 06:31:20 - INFO - __main__ - Step 65981: {'lr': 0.0003027735654641424, 'samples': 12668352, 'steps': 65980, 'loss/train': 1.2655224800109863} 11/07/2021 06:31:20 - INFO - __main__ - Step 65982: {'lr': 0.0003027683782916265, 'samples': 12668544, 'steps': 65981, 'loss/train': 1.382132649421692} 11/07/2021 06:31:20 - INFO - __main__ - Step 65983: {'lr': 0.000302763191095334, 'samples': 12668736, 'steps': 65982, 'loss/train': 1.4616587162017822} 11/07/2021 06:31:21 - INFO - __main__ - Step 65984: {'lr': 0.0003027580038752672, 'samples': 12668928, 'steps': 65983, 'loss/train': 1.4586434364318848} 11/07/2021 06:31:22 - INFO - __main__ - Step 65985: {'lr': 0.00030275281663142843, 'samples': 12669120, 'steps': 65984, 'loss/train': 1.4357807636260986} 11/07/2021 06:31:22 - INFO - __main__ - Step 65986: {'lr': 0.00030274762936382003, 'samples': 12669312, 'steps': 65985, 'loss/train': 1.5252043008804321} 11/07/2021 06:31:23 - INFO - __main__ - Step 65987: {'lr': 0.00030274244207244446, 'samples': 12669504, 'steps': 65986, 'loss/train': 1.7549092769622803} 11/07/2021 06:31:23 - INFO - __main__ - Step 65988: {'lr': 0.00030273725475730393, 'samples': 12669696, 'steps': 65987, 'loss/train': 1.9255229234695435} 11/07/2021 06:31:24 - INFO - __main__ - Step 65989: {'lr': 0.00030273206741840083, 'samples': 12669888, 'steps': 65988, 'loss/train': 1.7338685989379883} 11/07/2021 06:31:24 - INFO - __main__ - Step 65990: {'lr': 0.0003027268800557374, 'samples': 12670080, 'steps': 65989, 'loss/train': 1.6383705139160156} 11/07/2021 06:31:25 - INFO - __main__ - Step 65991: {'lr': 0.00030272169266931605, 'samples': 12670272, 'steps': 65990, 'loss/train': 1.6055433750152588} 11/07/2021 06:31:25 - INFO - __main__ - Step 65992: {'lr': 0.0003027165052591391, 'samples': 12670464, 'steps': 65991, 'loss/train': 1.3893085718154907} 11/07/2021 06:31:25 - INFO - __main__ - Step 65993: {'lr': 0.000302711317825209, 'samples': 12670656, 'steps': 65992, 'loss/train': 1.0460020303726196} 11/07/2021 06:31:26 - INFO - __main__ - Step 65994: {'lr': 0.00030270613036752794, 'samples': 12670848, 'steps': 65993, 'loss/train': 1.5399891138076782} 11/07/2021 06:31:27 - INFO - __main__ - Step 65995: {'lr': 0.0003027009428860984, 'samples': 12671040, 'steps': 65994, 'loss/train': 1.8592840433120728} 11/07/2021 06:31:27 - INFO - __main__ - Step 65996: {'lr': 0.00030269575538092254, 'samples': 12671232, 'steps': 65995, 'loss/train': 1.264800786972046} 11/07/2021 06:31:27 - INFO - __main__ - Step 65997: {'lr': 0.00030269056785200277, 'samples': 12671424, 'steps': 65996, 'loss/train': 1.2388521432876587} 11/07/2021 06:31:28 - INFO - __main__ - Step 65998: {'lr': 0.00030268538029934146, 'samples': 12671616, 'steps': 65997, 'loss/train': 1.4318079948425293} 11/07/2021 06:31:28 - INFO - __main__ - Step 65999: {'lr': 0.0003026801927229409, 'samples': 12671808, 'steps': 65998, 'loss/train': 1.5289021730422974} 11/07/2021 06:31:29 - INFO - __main__ - Step 66000: {'lr': 0.0003026750051228035, 'samples': 12672000, 'steps': 65999, 'loss/train': 1.166120171546936} 11/07/2021 06:31:29 - INFO - __main__ - Step 66001: {'lr': 0.0003026698174989316, 'samples': 12672192, 'steps': 66000, 'loss/train': 1.311511516571045} 11/07/2021 06:31:30 - INFO - __main__ - Step 66002: {'lr': 0.0003026646298513274, 'samples': 12672384, 'steps': 66001, 'loss/train': 1.3958816528320312} 11/07/2021 06:31:30 - INFO - __main__ - Step 66003: {'lr': 0.0003026594421799934, 'samples': 12672576, 'steps': 66002, 'loss/train': 1.4459030628204346} 11/07/2021 06:31:30 - INFO - __main__ - Step 66004: {'lr': 0.00030265425448493185, 'samples': 12672768, 'steps': 66003, 'loss/train': 1.7882519960403442} 11/07/2021 06:31:31 - INFO - __main__ - Step 66005: {'lr': 0.0003026490667661451, 'samples': 12672960, 'steps': 66004, 'loss/train': 1.314634919166565} 11/07/2021 06:31:32 - INFO - __main__ - Step 66006: {'lr': 0.0003026438790236355, 'samples': 12673152, 'steps': 66005, 'loss/train': 1.435366153717041} 11/07/2021 06:31:32 - INFO - __main__ - Step 66007: {'lr': 0.0003026386912574054, 'samples': 12673344, 'steps': 66006, 'loss/train': 1.0256205797195435} 11/07/2021 06:31:32 - INFO - __main__ - Step 66008: {'lr': 0.0003026335034674571, 'samples': 12673536, 'steps': 66007, 'loss/train': 1.425282597541809} 11/07/2021 06:31:33 - INFO - __main__ - Step 66009: {'lr': 0.0003026283156537929, 'samples': 12673728, 'steps': 66008, 'loss/train': 0.8238840103149414} 11/07/2021 06:31:34 - INFO - __main__ - Step 66010: {'lr': 0.00030262312781641524, 'samples': 12673920, 'steps': 66009, 'loss/train': 1.3888885974884033} 11/07/2021 06:31:34 - INFO - __main__ - Step 66011: {'lr': 0.0003026179399553264, 'samples': 12674112, 'steps': 66010, 'loss/train': 1.053246259689331} 11/07/2021 06:31:35 - INFO - __main__ - Step 66012: {'lr': 0.0003026127520705288, 'samples': 12674304, 'steps': 66011, 'loss/train': 1.2123934030532837} 11/07/2021 06:31:35 - INFO - __main__ - Step 66013: {'lr': 0.00030260756416202464, 'samples': 12674496, 'steps': 66012, 'loss/train': 1.7311400175094604} 11/07/2021 06:31:35 - INFO - __main__ - Step 66014: {'lr': 0.0003026023762298163, 'samples': 12674688, 'steps': 66013, 'loss/train': 1.5190168619155884} 11/07/2021 06:31:36 - INFO - __main__ - Step 66015: {'lr': 0.00030259718827390617, 'samples': 12674880, 'steps': 66014, 'loss/train': 1.4490175247192383} 11/07/2021 06:31:37 - INFO - __main__ - Step 66016: {'lr': 0.00030259200029429656, 'samples': 12675072, 'steps': 66015, 'loss/train': 1.7390228509902954} 11/07/2021 06:31:37 - INFO - __main__ - Step 66017: {'lr': 0.00030258681229098977, 'samples': 12675264, 'steps': 66016, 'loss/train': 1.5305286645889282} 11/07/2021 06:31:37 - INFO - __main__ - Step 66018: {'lr': 0.0003025816242639883, 'samples': 12675456, 'steps': 66017, 'loss/train': 1.3228628635406494} 11/07/2021 06:31:38 - INFO - __main__ - Step 66019: {'lr': 0.0003025764362132942, 'samples': 12675648, 'steps': 66018, 'loss/train': 0.9877188205718994} 11/07/2021 06:31:39 - INFO - __main__ - Step 66020: {'lr': 0.0003025712481389101, 'samples': 12675840, 'steps': 66019, 'loss/train': 1.2110549211502075} 11/07/2021 06:31:40 - INFO - __main__ - Step 66021: {'lr': 0.00030256606004083807, 'samples': 12676032, 'steps': 66020, 'loss/train': 1.4498167037963867} 11/07/2021 06:31:40 - INFO - __main__ - Step 66022: {'lr': 0.00030256087191908067, 'samples': 12676224, 'steps': 66021, 'loss/train': 0.7578029036521912} 11/07/2021 06:31:40 - INFO - __main__ - Step 66023: {'lr': 0.00030255568377364017, 'samples': 12676416, 'steps': 66022, 'loss/train': 1.762236475944519} 11/07/2021 06:31:41 - INFO - __main__ - Step 66024: {'lr': 0.00030255049560451886, 'samples': 12676608, 'steps': 66023, 'loss/train': 1.7195353507995605} 11/07/2021 06:31:41 - INFO - __main__ - Step 66025: {'lr': 0.00030254530741171917, 'samples': 12676800, 'steps': 66024, 'loss/train': 1.7410240173339844} 11/07/2021 06:31:42 - INFO - __main__ - Step 66026: {'lr': 0.00030254011919524326, 'samples': 12676992, 'steps': 66025, 'loss/train': 1.6930688619613647} 11/07/2021 06:31:43 - INFO - __main__ - Step 66027: {'lr': 0.00030253493095509364, 'samples': 12677184, 'steps': 66026, 'loss/train': 1.3641654253005981} 11/07/2021 06:31:43 - INFO - __main__ - Step 66028: {'lr': 0.0003025297426912726, 'samples': 12677376, 'steps': 66027, 'loss/train': 1.2875826358795166} 11/07/2021 06:31:44 - INFO - __main__ - Step 66029: {'lr': 0.00030252455440378246, 'samples': 12677568, 'steps': 66028, 'loss/train': 0.8563156723976135} 11/07/2021 06:31:44 - INFO - __main__ - Step 66030: {'lr': 0.0003025193660926255, 'samples': 12677760, 'steps': 66029, 'loss/train': 1.6033872365951538} 11/07/2021 06:31:44 - INFO - __main__ - Step 66031: {'lr': 0.0003025141777578043, 'samples': 12677952, 'steps': 66030, 'loss/train': 1.4264047145843506} 11/07/2021 06:31:45 - INFO - __main__ - Step 66032: {'lr': 0.0003025089893993209, 'samples': 12678144, 'steps': 66031, 'loss/train': 1.5003989934921265} 11/07/2021 06:31:46 - INFO - __main__ - Step 66033: {'lr': 0.00030250380101717775, 'samples': 12678336, 'steps': 66032, 'loss/train': 1.1638858318328857} 11/07/2021 06:31:46 - INFO - __main__ - Step 66034: {'lr': 0.00030249861261137716, 'samples': 12678528, 'steps': 66033, 'loss/train': 1.9387015104293823} 11/07/2021 06:31:46 - INFO - __main__ - Step 66035: {'lr': 0.00030249342418192155, 'samples': 12678720, 'steps': 66034, 'loss/train': 0.7397319078445435} 11/07/2021 06:31:47 - INFO - __main__ - Step 66036: {'lr': 0.00030248823572881327, 'samples': 12678912, 'steps': 66035, 'loss/train': 1.3758482933044434} 11/07/2021 06:31:47 - INFO - __main__ - Step 66037: {'lr': 0.0003024830472520546, 'samples': 12679104, 'steps': 66036, 'loss/train': 1.382337212562561} 11/07/2021 06:31:48 - INFO - __main__ - Step 66038: {'lr': 0.0003024778587516478, 'samples': 12679296, 'steps': 66037, 'loss/train': 1.8034965991973877} 11/07/2021 06:31:48 - INFO - __main__ - Step 66039: {'lr': 0.0003024726702275953, 'samples': 12679488, 'steps': 66038, 'loss/train': 1.1101720333099365} 11/07/2021 06:31:49 - INFO - __main__ - Step 66040: {'lr': 0.0003024674816798995, 'samples': 12679680, 'steps': 66039, 'loss/train': 1.0828388929367065} 11/07/2021 06:31:49 - INFO - __main__ - Step 66041: {'lr': 0.0003024622931085626, 'samples': 12679872, 'steps': 66040, 'loss/train': 0.6200581192970276} 11/07/2021 06:31:50 - INFO - __main__ - Step 66042: {'lr': 0.000302457104513587, 'samples': 12680064, 'steps': 66041, 'loss/train': 1.2737438678741455} 11/07/2021 06:31:51 - INFO - __main__ - Step 66043: {'lr': 0.000302451915894975, 'samples': 12680256, 'steps': 66042, 'loss/train': 1.6150233745574951} 11/07/2021 06:31:51 - INFO - __main__ - Step 66044: {'lr': 0.00030244672725272906, 'samples': 12680448, 'steps': 66043, 'loss/train': 1.4744157791137695} 11/07/2021 06:31:51 - INFO - __main__ - Step 66045: {'lr': 0.00030244153858685136, 'samples': 12680640, 'steps': 66044, 'loss/train': 1.1516748666763306} 11/07/2021 06:31:52 - INFO - __main__ - Step 66046: {'lr': 0.0003024363498973444, 'samples': 12680832, 'steps': 66045, 'loss/train': 1.084799885749817} 11/07/2021 06:31:52 - INFO - __main__ - Step 66047: {'lr': 0.0003024311611842103, 'samples': 12681024, 'steps': 66046, 'loss/train': 0.8824546933174133} 11/07/2021 06:31:53 - INFO - __main__ - Step 66048: {'lr': 0.0003024259724474516, 'samples': 12681216, 'steps': 66047, 'loss/train': 1.2395120859146118} 11/07/2021 06:31:53 - INFO - __main__ - Step 66049: {'lr': 0.0003024207836870706, 'samples': 12681408, 'steps': 66048, 'loss/train': 1.58646821975708} 11/07/2021 06:31:54 - INFO - __main__ - Step 66050: {'lr': 0.00030241559490306957, 'samples': 12681600, 'steps': 66049, 'loss/train': 1.4991322755813599} 11/07/2021 06:31:54 - INFO - __main__ - Step 66051: {'lr': 0.0003024104060954509, 'samples': 12681792, 'steps': 66050, 'loss/train': 1.199262261390686} 11/07/2021 06:31:54 - INFO - __main__ - Step 66052: {'lr': 0.0003024052172642169, 'samples': 12681984, 'steps': 66051, 'loss/train': 0.849306583404541} 11/07/2021 06:31:55 - INFO - __main__ - Step 66053: {'lr': 0.00030240002840936994, 'samples': 12682176, 'steps': 66052, 'loss/train': 1.8362423181533813} 11/07/2021 06:31:56 - INFO - __main__ - Step 66054: {'lr': 0.0003023948395309123, 'samples': 12682368, 'steps': 66053, 'loss/train': 1.1649852991104126} 11/07/2021 06:31:56 - INFO - __main__ - Step 66055: {'lr': 0.00030238965062884634, 'samples': 12682560, 'steps': 66054, 'loss/train': 1.3500866889953613} 11/07/2021 06:31:56 - INFO - __main__ - Step 66056: {'lr': 0.00030238446170317444, 'samples': 12682752, 'steps': 66055, 'loss/train': 1.2988277673721313} 11/07/2021 06:31:57 - INFO - __main__ - Step 66057: {'lr': 0.0003023792727538989, 'samples': 12682944, 'steps': 66056, 'loss/train': 1.4603646993637085} 11/07/2021 06:31:58 - INFO - __main__ - Step 66058: {'lr': 0.0003023740837810221, 'samples': 12683136, 'steps': 66057, 'loss/train': 2.0789899826049805} 11/07/2021 06:31:58 - INFO - __main__ - Step 66059: {'lr': 0.00030236889478454633, 'samples': 12683328, 'steps': 66058, 'loss/train': 1.6552274227142334} 11/07/2021 06:31:58 - INFO - __main__ - Step 66060: {'lr': 0.0003023637057644739, 'samples': 12683520, 'steps': 66059, 'loss/train': 1.4530048370361328} 11/07/2021 06:31:59 - INFO - __main__ - Step 66061: {'lr': 0.0003023585167208072, 'samples': 12683712, 'steps': 66060, 'loss/train': 1.7590138912200928} 11/07/2021 06:31:59 - INFO - __main__ - Step 66062: {'lr': 0.0003023533276535486, 'samples': 12683904, 'steps': 66061, 'loss/train': 1.291744351387024} 11/07/2021 06:32:00 - INFO - __main__ - Step 66063: {'lr': 0.00030234813856270046, 'samples': 12684096, 'steps': 66062, 'loss/train': 1.3551322221755981} 11/07/2021 06:32:00 - INFO - __main__ - Step 66064: {'lr': 0.0003023429494482649, 'samples': 12684288, 'steps': 66063, 'loss/train': 1.162684440612793} 11/07/2021 06:32:01 - INFO - __main__ - Step 66065: {'lr': 0.0003023377603102445, 'samples': 12684480, 'steps': 66064, 'loss/train': 1.6207313537597656} 11/07/2021 06:32:01 - INFO - __main__ - Step 66066: {'lr': 0.00030233257114864156, 'samples': 12684672, 'steps': 66065, 'loss/train': 1.2845720052719116} 11/07/2021 06:32:02 - INFO - __main__ - Step 66067: {'lr': 0.0003023273819634583, 'samples': 12684864, 'steps': 66066, 'loss/train': 1.3460216522216797} 11/07/2021 06:32:03 - INFO - __main__ - Step 66068: {'lr': 0.00030232219275469713, 'samples': 12685056, 'steps': 66067, 'loss/train': 0.7987724542617798} 11/07/2021 06:32:03 - INFO - __main__ - Step 66069: {'lr': 0.00030231700352236044, 'samples': 12685248, 'steps': 66068, 'loss/train': 1.530179738998413} 11/07/2021 06:32:03 - INFO - __main__ - Step 66070: {'lr': 0.0003023118142664505, 'samples': 12685440, 'steps': 66069, 'loss/train': 1.1934905052185059} 11/07/2021 06:32:04 - INFO - __main__ - Step 66071: {'lr': 0.0003023066249869696, 'samples': 12685632, 'steps': 66070, 'loss/train': 1.4146476984024048} 11/07/2021 06:32:04 - INFO - __main__ - Step 66072: {'lr': 0.0003023014356839202, 'samples': 12685824, 'steps': 66071, 'loss/train': 1.515790581703186} 11/07/2021 06:32:05 - INFO - __main__ - Step 66073: {'lr': 0.0003022962463573046, 'samples': 12686016, 'steps': 66072, 'loss/train': 0.773702085018158} 11/07/2021 06:32:05 - INFO - __main__ - Step 66074: {'lr': 0.000302291057007125, 'samples': 12686208, 'steps': 66073, 'loss/train': 1.5538198947906494} 11/07/2021 06:32:06 - INFO - __main__ - Step 66075: {'lr': 0.00030228586763338393, 'samples': 12686400, 'steps': 66074, 'loss/train': 0.9768549799919128} 11/07/2021 06:32:06 - INFO - __main__ - Step 66076: {'lr': 0.00030228067823608376, 'samples': 12686592, 'steps': 66075, 'loss/train': 1.4014859199523926} 11/07/2021 06:32:06 - INFO - __main__ - Step 66077: {'lr': 0.0003022754888152266, 'samples': 12686784, 'steps': 66076, 'loss/train': 1.204777479171753} 11/07/2021 06:32:08 - INFO - __main__ - Step 66078: {'lr': 0.00030227029937081497, 'samples': 12686976, 'steps': 66077, 'loss/train': 1.4185173511505127} 11/07/2021 06:32:08 - INFO - __main__ - Step 66079: {'lr': 0.00030226510990285105, 'samples': 12687168, 'steps': 66078, 'loss/train': 1.0866520404815674} 11/07/2021 06:32:08 - INFO - __main__ - Step 66080: {'lr': 0.00030225992041133735, 'samples': 12687360, 'steps': 66079, 'loss/train': 1.3827886581420898} 11/07/2021 06:32:09 - INFO - __main__ - Step 66081: {'lr': 0.00030225473089627613, 'samples': 12687552, 'steps': 66080, 'loss/train': 1.31856369972229} 11/07/2021 06:32:09 - INFO - __main__ - Step 66082: {'lr': 0.0003022495413576697, 'samples': 12687744, 'steps': 66081, 'loss/train': 1.470003604888916} 11/07/2021 06:32:10 - INFO - __main__ - Step 66083: {'lr': 0.00030224435179552057, 'samples': 12687936, 'steps': 66082, 'loss/train': 1.395112156867981} 11/07/2021 06:32:11 - INFO - __main__ - Step 66084: {'lr': 0.00030223916220983084, 'samples': 12688128, 'steps': 66083, 'loss/train': 1.295493483543396} 11/07/2021 06:32:11 - INFO - __main__ - Step 66085: {'lr': 0.0003022339726006029, 'samples': 12688320, 'steps': 66084, 'loss/train': 1.4121266603469849} 11/07/2021 06:32:11 - INFO - __main__ - Step 66086: {'lr': 0.00030222878296783925, 'samples': 12688512, 'steps': 66085, 'loss/train': 1.1479909420013428} 11/07/2021 06:32:12 - INFO - __main__ - Step 66087: {'lr': 0.00030222359331154205, 'samples': 12688704, 'steps': 66086, 'loss/train': 1.4092377424240112} 11/07/2021 06:32:13 - INFO - __main__ - Step 66088: {'lr': 0.0003022184036317137, 'samples': 12688896, 'steps': 66087, 'loss/train': 0.08901222795248032} 11/07/2021 06:32:13 - INFO - __main__ - Step 66089: {'lr': 0.0003022132139283566, 'samples': 12689088, 'steps': 66088, 'loss/train': 1.5572547912597656} 11/07/2021 06:32:13 - INFO - __main__ - Step 66090: {'lr': 0.00030220802420147296, 'samples': 12689280, 'steps': 66089, 'loss/train': 1.2584506273269653} 11/07/2021 06:32:14 - INFO - __main__ - Step 66091: {'lr': 0.0003022028344510652, 'samples': 12689472, 'steps': 66090, 'loss/train': 1.5193743705749512} 11/07/2021 06:32:14 - INFO - __main__ - Step 66092: {'lr': 0.00030219764467713566, 'samples': 12689664, 'steps': 66091, 'loss/train': 1.4935814142227173} 11/07/2021 06:32:15 - INFO - __main__ - Step 66093: {'lr': 0.00030219245487968666, 'samples': 12689856, 'steps': 66092, 'loss/train': 1.5579750537872314} 11/07/2021 06:32:16 - INFO - __main__ - Step 66094: {'lr': 0.00030218726505872056, 'samples': 12690048, 'steps': 66093, 'loss/train': 1.4117143154144287} 11/07/2021 06:32:16 - INFO - __main__ - Step 66095: {'lr': 0.0003021820752142397, 'samples': 12690240, 'steps': 66094, 'loss/train': 1.2228713035583496} 11/07/2021 06:32:16 - INFO - __main__ - Step 66096: {'lr': 0.00030217688534624643, 'samples': 12690432, 'steps': 66095, 'loss/train': 1.3296047449111938} 11/07/2021 06:32:17 - INFO - __main__ - Step 66097: {'lr': 0.000302171695454743, 'samples': 12690624, 'steps': 66096, 'loss/train': 1.5043150186538696} 11/07/2021 06:32:17 - INFO - __main__ - Step 66098: {'lr': 0.0003021665055397318, 'samples': 12690816, 'steps': 66097, 'loss/train': 1.6270967721939087} 11/07/2021 06:32:18 - INFO - __main__ - Step 66099: {'lr': 0.0003021613156012152, 'samples': 12691008, 'steps': 66098, 'loss/train': 0.37339168787002563} 11/07/2021 06:32:18 - INFO - __main__ - Step 66100: {'lr': 0.00030215612563919554, 'samples': 12691200, 'steps': 66099, 'loss/train': 1.4297350645065308} 11/07/2021 06:32:19 - INFO - __main__ - Step 66101: {'lr': 0.0003021509356536751, 'samples': 12691392, 'steps': 66100, 'loss/train': 1.349469542503357} 11/07/2021 06:32:19 - INFO - __main__ - Step 66102: {'lr': 0.00030214574564465624, 'samples': 12691584, 'steps': 66101, 'loss/train': 1.4836211204528809} 11/07/2021 06:32:20 - INFO - __main__ - Step 66103: {'lr': 0.00030214055561214137, 'samples': 12691776, 'steps': 66102, 'loss/train': 1.4207823276519775} 11/07/2021 06:32:21 - INFO - __main__ - Step 66104: {'lr': 0.00030213536555613276, 'samples': 12691968, 'steps': 66103, 'loss/train': 1.4728041887283325} 11/07/2021 06:32:21 - INFO - __main__ - Step 66105: {'lr': 0.0003021301754766327, 'samples': 12692160, 'steps': 66104, 'loss/train': 5.477502346038818} 11/07/2021 06:32:21 - INFO - __main__ - Step 66106: {'lr': 0.00030212498537364365, 'samples': 12692352, 'steps': 66105, 'loss/train': 1.210676670074463} 11/07/2021 06:32:22 - INFO - __main__ - Step 66107: {'lr': 0.0003021197952471678, 'samples': 12692544, 'steps': 66106, 'loss/train': 1.3684641122817993} 11/07/2021 06:32:22 - INFO - __main__ - Step 66108: {'lr': 0.00030211460509720767, 'samples': 12692736, 'steps': 66107, 'loss/train': 1.4260414838790894} 11/07/2021 06:32:23 - INFO - __main__ - Step 66109: {'lr': 0.00030210941492376543, 'samples': 12692928, 'steps': 66108, 'loss/train': 1.3657807111740112} 11/07/2021 06:32:23 - INFO - __main__ - Step 66110: {'lr': 0.00030210422472684356, 'samples': 12693120, 'steps': 66109, 'loss/train': 1.3700734376907349} 11/07/2021 06:32:24 - INFO - __main__ - Step 66111: {'lr': 0.0003020990345064443, 'samples': 12693312, 'steps': 66110, 'loss/train': 1.0680466890335083} 11/07/2021 06:32:24 - INFO - __main__ - Step 66112: {'lr': 0.00030209384426257003, 'samples': 12693504, 'steps': 66111, 'loss/train': 1.4327889680862427} 11/07/2021 06:32:24 - INFO - __main__ - Step 66113: {'lr': 0.00030208865399522305, 'samples': 12693696, 'steps': 66112, 'loss/train': 1.4351911544799805} 11/07/2021 06:32:25 - INFO - __main__ - Step 66114: {'lr': 0.0003020834637044057, 'samples': 12693888, 'steps': 66113, 'loss/train': 1.608832597732544} 11/07/2021 06:32:26 - INFO - __main__ - Step 66115: {'lr': 0.0003020782733901204, 'samples': 12694080, 'steps': 66114, 'loss/train': 1.1910721063613892} 11/07/2021 06:32:26 - INFO - __main__ - Step 66116: {'lr': 0.0003020730830523695, 'samples': 12694272, 'steps': 66115, 'loss/train': 1.4546208381652832} 11/07/2021 06:32:26 - INFO - __main__ - Step 66117: {'lr': 0.00030206789269115515, 'samples': 12694464, 'steps': 66116, 'loss/train': 1.4267754554748535} 11/07/2021 06:32:27 - INFO - __main__ - Step 66118: {'lr': 0.00030206270230647987, 'samples': 12694656, 'steps': 66117, 'loss/train': 1.1376086473464966} 11/07/2021 06:32:27 - INFO - __main__ - Step 66119: {'lr': 0.0003020575118983459, 'samples': 12694848, 'steps': 66118, 'loss/train': 1.2418519258499146} 11/07/2021 06:32:28 - INFO - __main__ - Step 66120: {'lr': 0.00030205232146675564, 'samples': 12695040, 'steps': 66119, 'loss/train': 1.818841576576233} 11/07/2021 06:32:29 - INFO - __main__ - Step 66121: {'lr': 0.0003020471310117114, 'samples': 12695232, 'steps': 66120, 'loss/train': 1.4295326471328735} 11/07/2021 06:32:29 - INFO - __main__ - Step 66122: {'lr': 0.00030204194053321556, 'samples': 12695424, 'steps': 66121, 'loss/train': 0.9624068737030029} 11/07/2021 06:32:29 - INFO - __main__ - Step 66123: {'lr': 0.00030203675003127043, 'samples': 12695616, 'steps': 66122, 'loss/train': 1.850511908531189} 11/07/2021 06:32:30 - INFO - __main__ - Step 66124: {'lr': 0.0003020315595058783, 'samples': 12695808, 'steps': 66123, 'loss/train': 1.5325511693954468} 11/07/2021 06:32:31 - INFO - __main__ - Step 66125: {'lr': 0.00030202636895704157, 'samples': 12696000, 'steps': 66124, 'loss/train': 1.5649058818817139} 11/07/2021 06:32:31 - INFO - __main__ - Step 66126: {'lr': 0.0003020211783847625, 'samples': 12696192, 'steps': 66125, 'loss/train': 1.1009212732315063} 11/07/2021 06:32:31 - INFO - __main__ - Step 66127: {'lr': 0.00030201598778904353, 'samples': 12696384, 'steps': 66126, 'loss/train': 1.536102294921875} 11/07/2021 06:32:32 - INFO - __main__ - Step 66128: {'lr': 0.000302010797169887, 'samples': 12696576, 'steps': 66127, 'loss/train': 1.2469087839126587} 11/07/2021 06:32:32 - INFO - __main__ - Step 66129: {'lr': 0.0003020056065272951, 'samples': 12696768, 'steps': 66128, 'loss/train': 1.5016626119613647} 11/07/2021 06:32:33 - INFO - __main__ - Step 66130: {'lr': 0.00030200041586127046, 'samples': 12696960, 'steps': 66129, 'loss/train': 0.9326152801513672} 11/07/2021 06:32:33 - INFO - __main__ - Step 66131: {'lr': 0.0003019952251718151, 'samples': 12697152, 'steps': 66130, 'loss/train': 0.9984504580497742} 11/07/2021 06:32:34 - INFO - __main__ - Step 66132: {'lr': 0.0003019900344589315, 'samples': 12697344, 'steps': 66131, 'loss/train': 1.6687335968017578} 11/07/2021 06:32:34 - INFO - __main__ - Step 66133: {'lr': 0.0003019848437226221, 'samples': 12697536, 'steps': 66132, 'loss/train': 1.5387459993362427} 11/07/2021 06:32:34 - INFO - __main__ - Step 66134: {'lr': 0.00030197965296288896, 'samples': 12697728, 'steps': 66133, 'loss/train': 1.5455348491668701} 11/07/2021 06:32:36 - INFO - __main__ - Step 66135: {'lr': 0.00030197446217973474, 'samples': 12697920, 'steps': 66134, 'loss/train': 0.8604190945625305} 11/07/2021 06:32:36 - INFO - __main__ - Step 66136: {'lr': 0.0003019692713731616, 'samples': 12698112, 'steps': 66135, 'loss/train': 1.994323968887329} 11/07/2021 06:32:36 - INFO - __main__ - Step 66137: {'lr': 0.00030196408054317185, 'samples': 12698304, 'steps': 66136, 'loss/train': 1.596811294555664} 11/07/2021 06:32:37 - INFO - __main__ - Step 66138: {'lr': 0.00030195888968976794, 'samples': 12698496, 'steps': 66137, 'loss/train': 1.462005376815796} 11/07/2021 06:32:37 - INFO - __main__ - Step 66139: {'lr': 0.0003019536988129521, 'samples': 12698688, 'steps': 66138, 'loss/train': 1.1523295640945435} 11/07/2021 06:32:38 - INFO - __main__ - Step 66140: {'lr': 0.00030194850791272676, 'samples': 12698880, 'steps': 66139, 'loss/train': 0.3556951582431793} 11/07/2021 06:32:38 - INFO - __main__ - Step 66141: {'lr': 0.00030194331698909425, 'samples': 12699072, 'steps': 66140, 'loss/train': 1.4058557748794556} 11/07/2021 06:32:39 - INFO - __main__ - Step 66142: {'lr': 0.00030193812604205686, 'samples': 12699264, 'steps': 66141, 'loss/train': 1.267403483390808} 11/07/2021 06:32:39 - INFO - __main__ - Step 66143: {'lr': 0.00030193293507161696, 'samples': 12699456, 'steps': 66142, 'loss/train': 1.125701904296875} 11/07/2021 06:32:39 - INFO - __main__ - Step 66144: {'lr': 0.00030192774407777683, 'samples': 12699648, 'steps': 66143, 'loss/train': 1.3942259550094604} 11/07/2021 06:32:40 - INFO - __main__ - Step 66145: {'lr': 0.0003019225530605389, 'samples': 12699840, 'steps': 66144, 'loss/train': 0.8815711736679077} 11/07/2021 06:32:41 - INFO - __main__ - Step 66146: {'lr': 0.00030191736201990544, 'samples': 12700032, 'steps': 66145, 'loss/train': 1.3402565717697144} 11/07/2021 06:32:41 - INFO - __main__ - Step 66147: {'lr': 0.0003019121709558789, 'samples': 12700224, 'steps': 66146, 'loss/train': 1.5071115493774414} 11/07/2021 06:32:41 - INFO - __main__ - Step 66148: {'lr': 0.0003019069798684615, 'samples': 12700416, 'steps': 66147, 'loss/train': 1.5298112630844116} 11/07/2021 06:32:42 - INFO - __main__ - Step 66149: {'lr': 0.0003019017887576556, 'samples': 12700608, 'steps': 66148, 'loss/train': 0.7928662896156311} 11/07/2021 06:32:43 - INFO - __main__ - Step 66150: {'lr': 0.0003018965976234635, 'samples': 12700800, 'steps': 66149, 'loss/train': 1.5948011875152588} 11/07/2021 06:32:43 - INFO - __main__ - Step 66151: {'lr': 0.00030189140646588763, 'samples': 12700992, 'steps': 66150, 'loss/train': 0.7972404360771179} 11/07/2021 06:32:44 - INFO - __main__ - Step 66152: {'lr': 0.00030188621528493036, 'samples': 12701184, 'steps': 66151, 'loss/train': 1.2893505096435547} 11/07/2021 06:32:44 - INFO - __main__ - Step 66153: {'lr': 0.0003018810240805939, 'samples': 12701376, 'steps': 66152, 'loss/train': 1.8215950727462769} 11/07/2021 06:32:44 - INFO - __main__ - Step 66154: {'lr': 0.0003018758328528807, 'samples': 12701568, 'steps': 66153, 'loss/train': 1.1627811193466187} 11/07/2021 06:32:45 - INFO - __main__ - Step 66155: {'lr': 0.00030187064160179294, 'samples': 12701760, 'steps': 66154, 'loss/train': 1.520495891571045} 11/07/2021 06:32:46 - INFO - __main__ - Step 66156: {'lr': 0.00030186545032733316, 'samples': 12701952, 'steps': 66155, 'loss/train': 1.5524673461914062} 11/07/2021 06:32:46 - INFO - __main__ - Step 66157: {'lr': 0.0003018602590295036, 'samples': 12702144, 'steps': 66156, 'loss/train': 1.0705009698867798} 11/07/2021 06:32:46 - INFO - __main__ - Step 66158: {'lr': 0.00030185506770830664, 'samples': 12702336, 'steps': 66157, 'loss/train': 1.3895667791366577} 11/07/2021 06:32:47 - INFO - __main__ - Step 66159: {'lr': 0.0003018498763637445, 'samples': 12702528, 'steps': 66158, 'loss/train': 1.0781030654907227} 11/07/2021 06:32:47 - INFO - __main__ - Step 66160: {'lr': 0.0003018446849958196, 'samples': 12702720, 'steps': 66159, 'loss/train': 1.3008923530578613} 11/07/2021 06:32:48 - INFO - __main__ - Step 66161: {'lr': 0.0003018394936045344, 'samples': 12702912, 'steps': 66160, 'loss/train': 1.5877728462219238} 11/07/2021 06:32:48 - INFO - __main__ - Step 66162: {'lr': 0.00030183430218989107, 'samples': 12703104, 'steps': 66161, 'loss/train': 1.581357717514038} 11/07/2021 06:32:49 - INFO - __main__ - Step 66163: {'lr': 0.000301829110751892, 'samples': 12703296, 'steps': 66162, 'loss/train': 1.634331464767456} 11/07/2021 06:32:49 - INFO - __main__ - Step 66164: {'lr': 0.0003018239192905395, 'samples': 12703488, 'steps': 66163, 'loss/train': 1.2112176418304443} 11/07/2021 06:32:49 - INFO - __main__ - Step 66165: {'lr': 0.000301818727805836, 'samples': 12703680, 'steps': 66164, 'loss/train': 1.5666568279266357} 11/07/2021 06:32:50 - INFO - __main__ - Step 66166: {'lr': 0.0003018135362977837, 'samples': 12703872, 'steps': 66165, 'loss/train': 1.4542078971862793} 11/07/2021 06:32:51 - INFO - __main__ - Step 66167: {'lr': 0.00030180834476638507, 'samples': 12704064, 'steps': 66166, 'loss/train': 1.2997652292251587} 11/07/2021 06:32:51 - INFO - __main__ - Step 66168: {'lr': 0.0003018031532116424, 'samples': 12704256, 'steps': 66167, 'loss/train': 1.3479423522949219} 11/07/2021 06:32:51 - INFO - __main__ - Step 66169: {'lr': 0.000301797961633558, 'samples': 12704448, 'steps': 66168, 'loss/train': 1.3177669048309326} 11/07/2021 06:32:52 - INFO - __main__ - Step 66170: {'lr': 0.0003017927700321343, 'samples': 12704640, 'steps': 66169, 'loss/train': 1.5064001083374023} 11/07/2021 06:32:53 - INFO - __main__ - Step 66171: {'lr': 0.0003017875784073735, 'samples': 12704832, 'steps': 66170, 'loss/train': 1.2311620712280273} 11/07/2021 06:32:53 - INFO - __main__ - Step 66172: {'lr': 0.0003017823867592781, 'samples': 12705024, 'steps': 66171, 'loss/train': 0.35690027475357056} 11/07/2021 06:32:54 - INFO - __main__ - Step 66173: {'lr': 0.00030177719508785026, 'samples': 12705216, 'steps': 66172, 'loss/train': 1.7703450918197632} 11/07/2021 06:32:54 - INFO - __main__ - Step 66174: {'lr': 0.0003017720033930925, 'samples': 12705408, 'steps': 66173, 'loss/train': 1.4842348098754883} 11/07/2021 06:32:54 - INFO - __main__ - Step 66175: {'lr': 0.000301766811675007, 'samples': 12705600, 'steps': 66174, 'loss/train': 1.2898615598678589} 11/07/2021 06:32:55 - INFO - __main__ - Step 66176: {'lr': 0.00030176161993359626, 'samples': 12705792, 'steps': 66175, 'loss/train': 1.6308811902999878} 11/07/2021 06:32:56 - INFO - __main__ - Step 66177: {'lr': 0.0003017564281688625, 'samples': 12705984, 'steps': 66176, 'loss/train': 0.7652762532234192} 11/07/2021 06:32:56 - INFO - __main__ - Step 66178: {'lr': 0.0003017512363808081, 'samples': 12706176, 'steps': 66177, 'loss/train': 1.6740591526031494} 11/07/2021 06:32:56 - INFO - __main__ - Step 66179: {'lr': 0.0003017460445694353, 'samples': 12706368, 'steps': 66178, 'loss/train': 1.4080028533935547} 11/07/2021 06:32:57 - INFO - __main__ - Step 66180: {'lr': 0.00030174085273474663, 'samples': 12706560, 'steps': 66179, 'loss/train': 4.601407051086426} 11/07/2021 06:32:57 - INFO - __main__ - Step 66181: {'lr': 0.0003017356608767443, 'samples': 12706752, 'steps': 66180, 'loss/train': 1.2413469552993774} 11/07/2021 06:32:58 - INFO - __main__ - Step 66182: {'lr': 0.00030173046899543065, 'samples': 12706944, 'steps': 66181, 'loss/train': 1.3662645816802979} 11/07/2021 06:32:59 - INFO - __main__ - Step 66183: {'lr': 0.0003017252770908081, 'samples': 12707136, 'steps': 66182, 'loss/train': 1.2059928178787231} 11/07/2021 06:32:59 - INFO - __main__ - Step 66184: {'lr': 0.000301720085162879, 'samples': 12707328, 'steps': 66183, 'loss/train': 1.6556265354156494} 11/07/2021 06:32:59 - INFO - __main__ - Step 66185: {'lr': 0.00030171489321164545, 'samples': 12707520, 'steps': 66184, 'loss/train': 1.4139301776885986} 11/07/2021 06:33:00 - INFO - __main__ - Step 66186: {'lr': 0.00030170970123711004, 'samples': 12707712, 'steps': 66185, 'loss/train': 1.267896056175232} 11/07/2021 06:33:01 - INFO - __main__ - Step 66187: {'lr': 0.0003017045092392751, 'samples': 12707904, 'steps': 66186, 'loss/train': 1.237272024154663} 11/07/2021 06:33:01 - INFO - __main__ - Step 66188: {'lr': 0.00030169931721814287, 'samples': 12708096, 'steps': 66187, 'loss/train': 1.3574597835540771} 11/07/2021 06:33:01 - INFO - __main__ - Step 66189: {'lr': 0.0003016941251737157, 'samples': 12708288, 'steps': 66188, 'loss/train': 1.178155779838562} 11/07/2021 06:33:02 - INFO - __main__ - Step 66190: {'lr': 0.000301688933105996, 'samples': 12708480, 'steps': 66189, 'loss/train': 1.4059526920318604} 11/07/2021 06:33:02 - INFO - __main__ - Step 66191: {'lr': 0.00030168374101498604, 'samples': 12708672, 'steps': 66190, 'loss/train': 1.1377241611480713} 11/07/2021 06:33:03 - INFO - __main__ - Step 66192: {'lr': 0.0003016785489006882, 'samples': 12708864, 'steps': 66191, 'loss/train': 2.012834310531616} 11/07/2021 06:33:04 - INFO - __main__ - Step 66193: {'lr': 0.00030167335676310476, 'samples': 12709056, 'steps': 66192, 'loss/train': 1.8285329341888428} 11/07/2021 06:33:04 - INFO - __main__ - Step 66194: {'lr': 0.0003016681646022381, 'samples': 12709248, 'steps': 66193, 'loss/train': 1.3275129795074463} 11/07/2021 06:33:04 - INFO - __main__ - Step 66195: {'lr': 0.0003016629724180906, 'samples': 12709440, 'steps': 66194, 'loss/train': 1.0555791854858398} 11/07/2021 06:33:05 - INFO - __main__ - Step 66196: {'lr': 0.0003016577802106645, 'samples': 12709632, 'steps': 66195, 'loss/train': 1.147147297859192} 11/07/2021 06:33:06 - INFO - __main__ - Step 66197: {'lr': 0.00030165258797996237, 'samples': 12709824, 'steps': 66196, 'loss/train': 1.5471503734588623} 11/07/2021 06:33:06 - INFO - __main__ - Step 66198: {'lr': 0.00030164739572598626, 'samples': 12710016, 'steps': 66197, 'loss/train': 1.6465815305709839} 11/07/2021 06:33:06 - INFO - __main__ - Step 66199: {'lr': 0.0003016422034487386, 'samples': 12710208, 'steps': 66198, 'loss/train': 1.55559504032135} 11/07/2021 06:33:07 - INFO - __main__ - Step 66200: {'lr': 0.0003016370111482218, 'samples': 12710400, 'steps': 66199, 'loss/train': 0.817954957485199} 11/07/2021 06:33:07 - INFO - __main__ - Step 66201: {'lr': 0.0003016318188244381, 'samples': 12710592, 'steps': 66200, 'loss/train': 2.052607536315918} 11/07/2021 06:33:07 - INFO - __main__ - Step 66202: {'lr': 0.00030162662647738997, 'samples': 12710784, 'steps': 66201, 'loss/train': 0.8889533281326294} 11/07/2021 06:33:08 - INFO - __main__ - Step 66203: {'lr': 0.0003016214341070797, 'samples': 12710976, 'steps': 66202, 'loss/train': 1.4663881063461304} 11/07/2021 06:33:09 - INFO - __main__ - Step 66204: {'lr': 0.0003016162417135096, 'samples': 12711168, 'steps': 66203, 'loss/train': 1.0164583921432495} 11/07/2021 06:33:09 - INFO - __main__ - Step 66205: {'lr': 0.000301611049296682, 'samples': 12711360, 'steps': 66204, 'loss/train': 5.714252471923828} 11/07/2021 06:33:10 - INFO - __main__ - Step 66206: {'lr': 0.0003016058568565993, 'samples': 12711552, 'steps': 66205, 'loss/train': 1.2828541994094849} 11/07/2021 06:33:10 - INFO - __main__ - Step 66207: {'lr': 0.00030160066439326367, 'samples': 12711744, 'steps': 66206, 'loss/train': 1.4414817094802856} 11/07/2021 06:33:11 - INFO - __main__ - Step 66208: {'lr': 0.0003015954719066776, 'samples': 12711936, 'steps': 66207, 'loss/train': 1.4036071300506592} 11/07/2021 06:33:11 - INFO - __main__ - Step 66209: {'lr': 0.00030159027939684346, 'samples': 12712128, 'steps': 66208, 'loss/train': 1.3366219997406006} 11/07/2021 06:33:12 - INFO - __main__ - Step 66210: {'lr': 0.0003015850868637636, 'samples': 12712320, 'steps': 66209, 'loss/train': 1.6157299280166626} 11/07/2021 06:33:12 - INFO - __main__ - Step 66211: {'lr': 0.00030157989430744023, 'samples': 12712512, 'steps': 66210, 'loss/train': 1.5904408693313599} 11/07/2021 06:33:12 - INFO - __main__ - Step 66212: {'lr': 0.0003015747017278757, 'samples': 12712704, 'steps': 66211, 'loss/train': 1.4374349117279053} 11/07/2021 06:33:13 - INFO - __main__ - Step 66213: {'lr': 0.00030156950912507246, 'samples': 12712896, 'steps': 66212, 'loss/train': 1.4889440536499023} 11/07/2021 06:33:14 - INFO - __main__ - Step 66214: {'lr': 0.0003015643164990328, 'samples': 12713088, 'steps': 66213, 'loss/train': 1.2248388528823853} 11/07/2021 06:33:14 - INFO - __main__ - Step 66215: {'lr': 0.0003015591238497591, 'samples': 12713280, 'steps': 66214, 'loss/train': 1.361736536026001} 11/07/2021 06:33:14 - INFO - __main__ - Step 66216: {'lr': 0.00030155393117725355, 'samples': 12713472, 'steps': 66215, 'loss/train': 1.5740033388137817} 11/07/2021 06:33:15 - INFO - __main__ - Step 66217: {'lr': 0.00030154873848151873, 'samples': 12713664, 'steps': 66216, 'loss/train': 0.7496118545532227} 11/07/2021 06:33:16 - INFO - __main__ - Step 66218: {'lr': 0.0003015435457625567, 'samples': 12713856, 'steps': 66217, 'loss/train': 1.1563725471496582} 11/07/2021 06:33:16 - INFO - __main__ - Step 66219: {'lr': 0.00030153835302037, 'samples': 12714048, 'steps': 66218, 'loss/train': 0.864608645439148} 11/07/2021 06:33:17 - INFO - __main__ - Step 66220: {'lr': 0.00030153316025496093, 'samples': 12714240, 'steps': 66219, 'loss/train': 1.4641318321228027} 11/07/2021 06:33:17 - INFO - __main__ - Step 66221: {'lr': 0.0003015279674663318, 'samples': 12714432, 'steps': 66220, 'loss/train': 1.2474156618118286} 11/07/2021 06:33:17 - INFO - __main__ - Step 66222: {'lr': 0.00030152277465448496, 'samples': 12714624, 'steps': 66221, 'loss/train': 1.744721531867981} 11/07/2021 06:33:18 - INFO - __main__ - Step 66223: {'lr': 0.0003015175818194227, 'samples': 12714816, 'steps': 66222, 'loss/train': 1.7917207479476929} 11/07/2021 06:33:19 - INFO - __main__ - Step 66224: {'lr': 0.00030151238896114756, 'samples': 12715008, 'steps': 66223, 'loss/train': 1.0168606042861938} 11/07/2021 06:33:19 - INFO - __main__ - Step 66225: {'lr': 0.00030150719607966163, 'samples': 12715200, 'steps': 66224, 'loss/train': 1.1561100482940674} 11/07/2021 06:33:19 - INFO - __main__ - Step 66226: {'lr': 0.0003015020031749674, 'samples': 12715392, 'steps': 66225, 'loss/train': 1.5368092060089111} 11/07/2021 06:33:20 - INFO - __main__ - Step 66227: {'lr': 0.0003014968102470671, 'samples': 12715584, 'steps': 66226, 'loss/train': 0.14851365983486176} 11/07/2021 06:33:20 - INFO - __main__ - Step 66228: {'lr': 0.00030149161729596313, 'samples': 12715776, 'steps': 66227, 'loss/train': 1.1681921482086182} 11/07/2021 06:33:22 - INFO - __main__ - Step 66229: {'lr': 0.00030148642432165784, 'samples': 12715968, 'steps': 66228, 'loss/train': 1.2614736557006836} 11/07/2021 06:33:22 - INFO - __main__ - Step 66230: {'lr': 0.0003014812313241536, 'samples': 12716160, 'steps': 66229, 'loss/train': 1.700527548789978} 11/07/2021 06:33:22 - INFO - __main__ - Step 66231: {'lr': 0.00030147603830345276, 'samples': 12716352, 'steps': 66230, 'loss/train': 0.10398164391517639} 11/07/2021 06:33:23 - INFO - __main__ - Step 66232: {'lr': 0.0003014708452595575, 'samples': 12716544, 'steps': 66231, 'loss/train': 1.1002004146575928} 11/07/2021 06:33:23 - INFO - __main__ - Step 66233: {'lr': 0.00030146565219247033, 'samples': 12716736, 'steps': 66232, 'loss/train': 0.9450284838676453} 11/07/2021 06:33:24 - INFO - __main__ - Step 66234: {'lr': 0.0003014604591021936, 'samples': 12716928, 'steps': 66233, 'loss/train': 1.3022915124893188} 11/07/2021 06:33:24 - INFO - __main__ - Step 66235: {'lr': 0.0003014552659887294, 'samples': 12717120, 'steps': 66234, 'loss/train': 1.553784728050232} 11/07/2021 06:33:25 - INFO - __main__ - Step 66236: {'lr': 0.00030145007285208036, 'samples': 12717312, 'steps': 66235, 'loss/train': 1.6126375198364258} 11/07/2021 06:33:25 - INFO - __main__ - Step 66237: {'lr': 0.0003014448796922488, 'samples': 12717504, 'steps': 66236, 'loss/train': 1.4158909320831299} 11/07/2021 06:33:25 - INFO - __main__ - Step 66238: {'lr': 0.0003014396865092368, 'samples': 12717696, 'steps': 66237, 'loss/train': 1.2601286172866821} 11/07/2021 06:33:26 - INFO - __main__ - Step 66239: {'lr': 0.00030143449330304696, 'samples': 12717888, 'steps': 66238, 'loss/train': 1.1970356702804565} 11/07/2021 06:33:27 - INFO - __main__ - Step 66240: {'lr': 0.00030142930007368154, 'samples': 12718080, 'steps': 66239, 'loss/train': 1.0553947687149048} 11/07/2021 06:33:27 - INFO - __main__ - Step 66241: {'lr': 0.0003014241068211428, 'samples': 12718272, 'steps': 66240, 'loss/train': 1.7730993032455444} 11/07/2021 06:33:27 - INFO - __main__ - Step 66242: {'lr': 0.0003014189135454332, 'samples': 12718464, 'steps': 66241, 'loss/train': 1.973456859588623} 11/07/2021 06:33:28 - INFO - __main__ - Step 66243: {'lr': 0.000301413720246555, 'samples': 12718656, 'steps': 66242, 'loss/train': 0.6819082498550415} 11/07/2021 06:33:29 - INFO - __main__ - Step 66244: {'lr': 0.00030140852692451067, 'samples': 12718848, 'steps': 66243, 'loss/train': 1.366848349571228} 11/07/2021 06:33:29 - INFO - __main__ - Step 66245: {'lr': 0.00030140333357930237, 'samples': 12719040, 'steps': 66244, 'loss/train': 1.4902690649032593} 11/07/2021 06:33:30 - INFO - __main__ - Step 66246: {'lr': 0.0003013981402109325, 'samples': 12719232, 'steps': 66245, 'loss/train': 1.5027234554290771} 11/07/2021 06:33:30 - INFO - __main__ - Step 66247: {'lr': 0.00030139294681940347, 'samples': 12719424, 'steps': 66246, 'loss/train': 0.051249608397483826} 11/07/2021 06:33:30 - INFO - __main__ - Step 66248: {'lr': 0.00030138775340471754, 'samples': 12719616, 'steps': 66247, 'loss/train': 1.5245825052261353} 11/07/2021 06:33:31 - INFO - __main__ - Step 66249: {'lr': 0.00030138255996687706, 'samples': 12719808, 'steps': 66248, 'loss/train': 1.3866603374481201} 11/07/2021 06:33:32 - INFO - __main__ - Step 66250: {'lr': 0.0003013773665058844, 'samples': 12720000, 'steps': 66249, 'loss/train': 1.0463069677352905} 11/07/2021 06:33:32 - INFO - __main__ - Step 66251: {'lr': 0.000301372173021742, 'samples': 12720192, 'steps': 66250, 'loss/train': 0.9268507957458496} 11/07/2021 06:33:32 - INFO - __main__ - Step 66252: {'lr': 0.00030136697951445204, 'samples': 12720384, 'steps': 66251, 'loss/train': 1.6443283557891846} 11/07/2021 06:33:33 - INFO - __main__ - Step 66253: {'lr': 0.00030136178598401685, 'samples': 12720576, 'steps': 66252, 'loss/train': 0.9858477115631104} 11/07/2021 06:33:34 - INFO - __main__ - Step 66254: {'lr': 0.0003013565924304388, 'samples': 12720768, 'steps': 66253, 'loss/train': 1.5977133512496948} 11/07/2021 06:33:34 - INFO - __main__ - Step 66255: {'lr': 0.0003013513988537204, 'samples': 12720960, 'steps': 66254, 'loss/train': 1.4857988357543945} 11/07/2021 06:33:35 - INFO - __main__ - Step 66256: {'lr': 0.00030134620525386373, 'samples': 12721152, 'steps': 66255, 'loss/train': 0.8796291351318359} 11/07/2021 06:33:35 - INFO - __main__ - Step 66257: {'lr': 0.00030134101163087134, 'samples': 12721344, 'steps': 66256, 'loss/train': 1.8347536325454712} 11/07/2021 06:33:35 - INFO - __main__ - Step 66258: {'lr': 0.0003013358179847455, 'samples': 12721536, 'steps': 66257, 'loss/train': 0.05049759894609451} 11/07/2021 06:33:36 - INFO - __main__ - Step 66259: {'lr': 0.0003013306243154884, 'samples': 12721728, 'steps': 66258, 'loss/train': 1.7372595071792603} 11/07/2021 06:33:37 - INFO - __main__ - Step 66260: {'lr': 0.00030132543062310257, 'samples': 12721920, 'steps': 66259, 'loss/train': 0.7852805852890015} 11/07/2021 06:33:37 - INFO - __main__ - Step 66261: {'lr': 0.0003013202369075904, 'samples': 12722112, 'steps': 66260, 'loss/train': 0.7420227527618408} 11/07/2021 06:33:37 - INFO - __main__ - Step 66262: {'lr': 0.00030131504316895395, 'samples': 12722304, 'steps': 66261, 'loss/train': 0.7690791487693787} 11/07/2021 06:33:38 - INFO - __main__ - Step 66263: {'lr': 0.0003013098494071958, 'samples': 12722496, 'steps': 66262, 'loss/train': 0.8684605360031128} 11/07/2021 06:33:38 - INFO - __main__ - Step 66264: {'lr': 0.0003013046556223183, 'samples': 12722688, 'steps': 66263, 'loss/train': 1.191339135169983} 11/07/2021 06:33:39 - INFO - __main__ - Step 66265: {'lr': 0.00030129946181432364, 'samples': 12722880, 'steps': 66264, 'loss/train': 1.1495366096496582} 11/07/2021 06:33:39 - INFO - __main__ - Step 66266: {'lr': 0.00030129426798321425, 'samples': 12723072, 'steps': 66265, 'loss/train': 0.4775000810623169} 11/07/2021 06:33:40 - INFO - __main__ - Step 66267: {'lr': 0.00030128907412899244, 'samples': 12723264, 'steps': 66266, 'loss/train': 1.6882132291793823} 11/07/2021 06:33:40 - INFO - __main__ - Step 66268: {'lr': 0.0003012838802516606, 'samples': 12723456, 'steps': 66267, 'loss/train': 1.5383883714675903} 11/07/2021 06:33:40 - INFO - __main__ - Step 66269: {'lr': 0.00030127868635122096, 'samples': 12723648, 'steps': 66268, 'loss/train': 0.9380224347114563} 11/07/2021 06:33:42 - INFO - __main__ - Step 66270: {'lr': 0.00030127349242767607, 'samples': 12723840, 'steps': 66269, 'loss/train': 1.310180902481079} 11/07/2021 06:33:42 - INFO - __main__ - Step 66271: {'lr': 0.000301268298481028, 'samples': 12724032, 'steps': 66270, 'loss/train': 1.0228312015533447} 11/07/2021 06:33:42 - INFO - __main__ - Step 66272: {'lr': 0.0003012631045112793, 'samples': 12724224, 'steps': 66271, 'loss/train': 0.9921030402183533} 11/07/2021 06:33:43 - INFO - __main__ - Step 66273: {'lr': 0.0003012579105184322, 'samples': 12724416, 'steps': 66272, 'loss/train': 1.520680546760559} 11/07/2021 06:33:43 - INFO - __main__ - Step 66274: {'lr': 0.0003012527165024891, 'samples': 12724608, 'steps': 66273, 'loss/train': 1.529322862625122} 11/07/2021 06:33:44 - INFO - __main__ - Step 66275: {'lr': 0.0003012475224634523, 'samples': 12724800, 'steps': 66274, 'loss/train': 1.2175441980361938} 11/07/2021 06:33:44 - INFO - __main__ - Step 66276: {'lr': 0.0003012423284013242, 'samples': 12724992, 'steps': 66275, 'loss/train': 1.0858664512634277} 11/07/2021 06:33:45 - INFO - __main__ - Step 66277: {'lr': 0.00030123713431610705, 'samples': 12725184, 'steps': 66276, 'loss/train': 1.3240222930908203} 11/07/2021 06:33:45 - INFO - __main__ - Step 66278: {'lr': 0.00030123194020780327, 'samples': 12725376, 'steps': 66277, 'loss/train': 1.3425776958465576} 11/07/2021 06:33:45 - INFO - __main__ - Step 66279: {'lr': 0.00030122674607641514, 'samples': 12725568, 'steps': 66278, 'loss/train': 1.1732101440429688} 11/07/2021 06:33:46 - INFO - __main__ - Step 66280: {'lr': 0.000301221551921945, 'samples': 12725760, 'steps': 66279, 'loss/train': 1.0688154697418213} 11/07/2021 06:33:47 - INFO - __main__ - Step 66281: {'lr': 0.00030121635774439534, 'samples': 12725952, 'steps': 66280, 'loss/train': 1.2119218111038208} 11/07/2021 06:33:47 - INFO - __main__ - Step 66282: {'lr': 0.0003012111635437683, 'samples': 12726144, 'steps': 66281, 'loss/train': 0.7646006941795349} 11/07/2021 06:33:48 - INFO - __main__ - Step 66283: {'lr': 0.0003012059693200663, 'samples': 12726336, 'steps': 66282, 'loss/train': 1.7622190713882446} 11/07/2021 06:33:48 - INFO - __main__ - Step 66284: {'lr': 0.00030120077507329163, 'samples': 12726528, 'steps': 66283, 'loss/train': 1.3922927379608154} 11/07/2021 06:33:48 - INFO - __main__ - Step 66285: {'lr': 0.0003011955808034467, 'samples': 12726720, 'steps': 66284, 'loss/train': 1.3611963987350464} 11/07/2021 06:33:49 - INFO - __main__ - Step 66286: {'lr': 0.0003011903865105339, 'samples': 12726912, 'steps': 66285, 'loss/train': 1.6107890605926514} 11/07/2021 06:33:50 - INFO - __main__ - Step 66287: {'lr': 0.0003011851921945555, 'samples': 12727104, 'steps': 66286, 'loss/train': 1.5119935274124146} 11/07/2021 06:33:50 - INFO - __main__ - Step 66288: {'lr': 0.00030117999785551376, 'samples': 12727296, 'steps': 66287, 'loss/train': 1.5399515628814697} 11/07/2021 06:33:50 - INFO - __main__ - Step 66289: {'lr': 0.00030117480349341116, 'samples': 12727488, 'steps': 66288, 'loss/train': 1.0585014820098877} 11/07/2021 06:33:51 - INFO - __main__ - Step 66290: {'lr': 0.00030116960910824995, 'samples': 12727680, 'steps': 66289, 'loss/train': 0.9811328649520874} 11/07/2021 06:33:52 - INFO - __main__ - Step 66291: {'lr': 0.00030116441470003254, 'samples': 12727872, 'steps': 66290, 'loss/train': 1.8236507177352905} 11/07/2021 06:33:52 - INFO - __main__ - Step 66292: {'lr': 0.00030115922026876125, 'samples': 12728064, 'steps': 66291, 'loss/train': 1.247738242149353} 11/07/2021 06:33:52 - INFO - __main__ - Step 66293: {'lr': 0.00030115402581443835, 'samples': 12728256, 'steps': 66292, 'loss/train': 1.2401412725448608} 11/07/2021 06:33:53 - INFO - __main__ - Step 66294: {'lr': 0.0003011488313370663, 'samples': 12728448, 'steps': 66293, 'loss/train': 4.14809513092041} 11/07/2021 06:33:53 - INFO - __main__ - Step 66295: {'lr': 0.0003011436368366473, 'samples': 12728640, 'steps': 66294, 'loss/train': 1.3098433017730713} 11/07/2021 06:33:54 - INFO - __main__ - Step 66296: {'lr': 0.00030113844231318375, 'samples': 12728832, 'steps': 66295, 'loss/train': 1.3093377351760864} 11/07/2021 06:33:54 - INFO - __main__ - Step 66297: {'lr': 0.00030113324776667803, 'samples': 12729024, 'steps': 66296, 'loss/train': 1.5484081506729126} 11/07/2021 06:33:55 - INFO - __main__ - Step 66298: {'lr': 0.0003011280531971326, 'samples': 12729216, 'steps': 66297, 'loss/train': 1.594380497932434} 11/07/2021 06:33:55 - INFO - __main__ - Step 66299: {'lr': 0.0003011228586045495, 'samples': 12729408, 'steps': 66298, 'loss/train': 1.5846751928329468} 11/07/2021 06:33:55 - INFO - __main__ - Step 66300: {'lr': 0.00030111766398893127, 'samples': 12729600, 'steps': 66299, 'loss/train': 1.1599481105804443} 11/07/2021 06:33:57 - INFO - __main__ - Step 66301: {'lr': 0.0003011124693502802, 'samples': 12729792, 'steps': 66300, 'loss/train': 1.6559373140335083} 11/07/2021 06:33:57 - INFO - __main__ - Step 66302: {'lr': 0.00030110727468859864, 'samples': 12729984, 'steps': 66301, 'loss/train': 1.7291558980941772} 11/07/2021 06:33:57 - INFO - __main__ - Step 66303: {'lr': 0.00030110208000388896, 'samples': 12730176, 'steps': 66302, 'loss/train': 1.0608025789260864} 11/07/2021 06:33:58 - INFO - __main__ - Step 66304: {'lr': 0.0003010968852961535, 'samples': 12730368, 'steps': 66303, 'loss/train': 1.760735034942627} 11/07/2021 06:33:58 - INFO - __main__ - Step 66305: {'lr': 0.0003010916905653945, 'samples': 12730560, 'steps': 66304, 'loss/train': 1.3020204305648804} 11/07/2021 06:33:59 - INFO - __main__ - Step 66306: {'lr': 0.0003010864958116144, 'samples': 12730752, 'steps': 66305, 'loss/train': 0.7436755895614624} 11/07/2021 06:34:00 - INFO - __main__ - Step 66307: {'lr': 0.00030108130103481554, 'samples': 12730944, 'steps': 66306, 'loss/train': 1.4106642007827759} 11/07/2021 06:34:00 - INFO - __main__ - Step 66308: {'lr': 0.00030107610623500013, 'samples': 12731136, 'steps': 66307, 'loss/train': 1.5002694129943848} 11/07/2021 06:34:00 - INFO - __main__ - Step 66309: {'lr': 0.0003010709114121707, 'samples': 12731328, 'steps': 66308, 'loss/train': 1.767829179763794} 11/07/2021 06:34:01 - INFO - __main__ - Step 66310: {'lr': 0.0003010657165663295, 'samples': 12731520, 'steps': 66309, 'loss/train': 1.1225128173828125} 11/07/2021 06:34:02 - INFO - __main__ - Step 66311: {'lr': 0.00030106052169747886, 'samples': 12731712, 'steps': 66310, 'loss/train': 0.860564112663269} 11/07/2021 06:34:02 - INFO - __main__ - Step 66312: {'lr': 0.0003010553268056212, 'samples': 12731904, 'steps': 66311, 'loss/train': 0.4648149609565735} 11/07/2021 06:34:02 - INFO - __main__ - Step 66313: {'lr': 0.0003010501318907587, 'samples': 12732096, 'steps': 66312, 'loss/train': 0.04741385579109192} 11/07/2021 06:34:03 - INFO - __main__ - Step 66314: {'lr': 0.0003010449369528939, 'samples': 12732288, 'steps': 66313, 'loss/train': 1.7687499523162842} 11/07/2021 06:34:03 - INFO - __main__ - Step 66315: {'lr': 0.0003010397419920289, 'samples': 12732480, 'steps': 66314, 'loss/train': 0.784953236579895} 11/07/2021 06:34:04 - INFO - __main__ - Step 66316: {'lr': 0.0003010345470081663, 'samples': 12732672, 'steps': 66315, 'loss/train': 1.3809489011764526} 11/07/2021 06:34:04 - INFO - __main__ - Step 66317: {'lr': 0.0003010293520013083, 'samples': 12732864, 'steps': 66316, 'loss/train': 1.6020259857177734} 11/07/2021 06:34:05 - INFO - __main__ - Step 66318: {'lr': 0.00030102415697145726, 'samples': 12733056, 'steps': 66317, 'loss/train': 1.2462317943572998} 11/07/2021 06:34:05 - INFO - __main__ - Step 66319: {'lr': 0.0003010189619186155, 'samples': 12733248, 'steps': 66318, 'loss/train': 0.9910619258880615} 11/07/2021 06:34:05 - INFO - __main__ - Step 66320: {'lr': 0.0003010137668427854, 'samples': 12733440, 'steps': 66319, 'loss/train': 1.5168834924697876} 11/07/2021 06:34:07 - INFO - __main__ - Step 66321: {'lr': 0.00030100857174396924, 'samples': 12733632, 'steps': 66320, 'loss/train': 1.3829407691955566} 11/07/2021 06:34:07 - INFO - __main__ - Step 66322: {'lr': 0.0003010033766221694, 'samples': 12733824, 'steps': 66321, 'loss/train': 1.5154014825820923} 11/07/2021 06:34:08 - INFO - __main__ - Step 66323: {'lr': 0.00030099818147738826, 'samples': 12734016, 'steps': 66322, 'loss/train': 0.5619611144065857} 11/07/2021 06:34:08 - INFO - __main__ - Step 66324: {'lr': 0.00030099298630962813, 'samples': 12734208, 'steps': 66323, 'loss/train': 0.5387817025184631} 11/07/2021 06:34:08 - INFO - __main__ - Step 66325: {'lr': 0.0003009877911188914, 'samples': 12734400, 'steps': 66324, 'loss/train': 0.8011993169784546} 11/07/2021 06:34:09 - INFO - __main__ - Step 66326: {'lr': 0.0003009825959051803, 'samples': 12734592, 'steps': 66325, 'loss/train': 1.172276258468628} 11/07/2021 06:34:10 - INFO - __main__ - Step 66327: {'lr': 0.0003009774006684972, 'samples': 12734784, 'steps': 66326, 'loss/train': 1.4076952934265137} 11/07/2021 06:34:10 - INFO - __main__ - Step 66328: {'lr': 0.0003009722054088445, 'samples': 12734976, 'steps': 66327, 'loss/train': 1.4647339582443237} 11/07/2021 06:34:11 - INFO - __main__ - Step 66329: {'lr': 0.00030096701012622453, 'samples': 12735168, 'steps': 66328, 'loss/train': 0.8802450299263} 11/07/2021 06:34:11 - INFO - __main__ - Step 66330: {'lr': 0.0003009618148206396, 'samples': 12735360, 'steps': 66329, 'loss/train': 1.4735438823699951} 11/07/2021 06:34:11 - INFO - __main__ - Step 66331: {'lr': 0.0003009566194920921, 'samples': 12735552, 'steps': 66330, 'loss/train': 1.1487783193588257} 11/07/2021 06:34:12 - INFO - __main__ - Step 66332: {'lr': 0.0003009514241405843, 'samples': 12735744, 'steps': 66331, 'loss/train': 1.5365599393844604} 11/07/2021 06:34:13 - INFO - __main__ - Step 66333: {'lr': 0.00030094622876611853, 'samples': 12735936, 'steps': 66332, 'loss/train': 1.5994325876235962} 11/07/2021 06:34:13 - INFO - __main__ - Step 66334: {'lr': 0.00030094103336869723, 'samples': 12736128, 'steps': 66333, 'loss/train': 1.3584208488464355} 11/07/2021 06:34:13 - INFO - __main__ - Step 66335: {'lr': 0.0003009358379483227, 'samples': 12736320, 'steps': 66334, 'loss/train': 1.5823190212249756} 11/07/2021 06:34:14 - INFO - __main__ - Step 66336: {'lr': 0.0003009306425049972, 'samples': 12736512, 'steps': 66335, 'loss/train': 1.5797573328018188} 11/07/2021 06:34:15 - INFO - __main__ - Step 66337: {'lr': 0.00030092544703872316, 'samples': 12736704, 'steps': 66336, 'loss/train': 1.2382409572601318} 11/07/2021 06:34:15 - INFO - __main__ - Step 66338: {'lr': 0.000300920251549503, 'samples': 12736896, 'steps': 66337, 'loss/train': 1.4329423904418945} 11/07/2021 06:34:16 - INFO - __main__ - Step 66339: {'lr': 0.0003009150560373388, 'samples': 12737088, 'steps': 66338, 'loss/train': 1.1073436737060547} 11/07/2021 06:34:16 - INFO - __main__ - Step 66340: {'lr': 0.00030090986050223314, 'samples': 12737280, 'steps': 66339, 'loss/train': 1.487412691116333} 11/07/2021 06:34:16 - INFO - __main__ - Step 66341: {'lr': 0.00030090466494418826, 'samples': 12737472, 'steps': 66340, 'loss/train': 1.3531047105789185} 11/07/2021 06:34:17 - INFO - __main__ - Step 66342: {'lr': 0.00030089946936320654, 'samples': 12737664, 'steps': 66341, 'loss/train': 1.1915382146835327} 11/07/2021 06:34:18 - INFO - __main__ - Step 66343: {'lr': 0.0003008942737592903, 'samples': 12737856, 'steps': 66342, 'loss/train': 1.7370036840438843} 11/07/2021 06:34:18 - INFO - __main__ - Step 66344: {'lr': 0.0003008890781324419, 'samples': 12738048, 'steps': 66343, 'loss/train': 0.7978464961051941} 11/07/2021 06:34:18 - INFO - __main__ - Step 66345: {'lr': 0.00030088388248266366, 'samples': 12738240, 'steps': 66344, 'loss/train': 1.3977705240249634} 11/07/2021 06:34:19 - INFO - __main__ - Step 66346: {'lr': 0.00030087868680995795, 'samples': 12738432, 'steps': 66345, 'loss/train': 1.6669083833694458} 11/07/2021 06:34:19 - INFO - __main__ - Step 66347: {'lr': 0.00030087349111432705, 'samples': 12738624, 'steps': 66346, 'loss/train': 1.5853362083435059} 11/07/2021 06:34:20 - INFO - __main__ - Step 66348: {'lr': 0.00030086829539577336, 'samples': 12738816, 'steps': 66347, 'loss/train': 1.8417489528656006} 11/07/2021 06:34:20 - INFO - __main__ - Step 66349: {'lr': 0.0003008630996542992, 'samples': 12739008, 'steps': 66348, 'loss/train': 5.745266437530518} 11/07/2021 06:34:21 - INFO - __main__ - Step 66350: {'lr': 0.0003008579038899069, 'samples': 12739200, 'steps': 66349, 'loss/train': 1.1894187927246094} 11/07/2021 06:34:21 - INFO - __main__ - Step 66351: {'lr': 0.0003008527081025988, 'samples': 12739392, 'steps': 66350, 'loss/train': 1.181612253189087} 11/07/2021 06:34:22 - INFO - __main__ - Step 66352: {'lr': 0.00030084751229237733, 'samples': 12739584, 'steps': 66351, 'loss/train': 1.305056095123291} 11/07/2021 06:34:22 - INFO - __main__ - Step 66353: {'lr': 0.0003008423164592447, 'samples': 12739776, 'steps': 66352, 'loss/train': 1.5993620157241821} 11/07/2021 06:34:23 - INFO - __main__ - Step 66354: {'lr': 0.0003008371206032033, 'samples': 12739968, 'steps': 66353, 'loss/train': 1.4853211641311646} 11/07/2021 06:34:23 - INFO - __main__ - Step 66355: {'lr': 0.00030083192472425544, 'samples': 12740160, 'steps': 66354, 'loss/train': 1.3711051940917969} 11/07/2021 06:34:24 - INFO - __main__ - Step 66356: {'lr': 0.0003008267288224036, 'samples': 12740352, 'steps': 66355, 'loss/train': 1.302602767944336} 11/07/2021 06:34:24 - INFO - __main__ - Step 66357: {'lr': 0.0003008215328976499, 'samples': 12740544, 'steps': 66356, 'loss/train': 0.9855397939682007} 11/07/2021 06:34:24 - INFO - __main__ - Step 66358: {'lr': 0.00030081633694999696, 'samples': 12740736, 'steps': 66357, 'loss/train': 0.8939595222473145} 11/07/2021 06:34:27 - INFO - __main__ - Step 66359: {'lr': 0.0003008111409794468, 'samples': 12740928, 'steps': 66358, 'loss/train': 1.3550819158554077} 11/07/2021 06:34:27 - INFO - __main__ - Step 66360: {'lr': 0.00030080594498600206, 'samples': 12741120, 'steps': 66359, 'loss/train': 1.6358691453933716} 11/07/2021 06:34:28 - INFO - __main__ - Step 66361: {'lr': 0.00030080074896966487, 'samples': 12741312, 'steps': 66360, 'loss/train': 1.4113481044769287} 11/07/2021 06:34:28 - INFO - __main__ - Step 66362: {'lr': 0.0003007955529304376, 'samples': 12741504, 'steps': 66361, 'loss/train': 2.0597312450408936} 11/07/2021 06:34:28 - INFO - __main__ - Step 66363: {'lr': 0.00030079035686832276, 'samples': 12741696, 'steps': 66362, 'loss/train': 1.8299421072006226} 11/07/2021 06:34:29 - INFO - __main__ - Step 66364: {'lr': 0.00030078516078332245, 'samples': 12741888, 'steps': 66363, 'loss/train': 1.7863110303878784} 11/07/2021 06:34:29 - INFO - __main__ - Step 66365: {'lr': 0.00030077996467543924, 'samples': 12742080, 'steps': 66364, 'loss/train': 0.8672454953193665} 11/07/2021 06:34:29 - INFO - __main__ - Step 66366: {'lr': 0.0003007747685446753, 'samples': 12742272, 'steps': 66365, 'loss/train': 1.7702008485794067} 11/07/2021 06:34:30 - INFO - __main__ - Step 66367: {'lr': 0.00030076957239103306, 'samples': 12742464, 'steps': 66366, 'loss/train': 1.7702382802963257} 11/07/2021 06:34:31 - INFO - __main__ - Step 66368: {'lr': 0.00030076437621451475, 'samples': 12742656, 'steps': 66367, 'loss/train': 1.652742624282837} 11/07/2021 06:34:31 - INFO - __main__ - Step 66369: {'lr': 0.00030075918001512287, 'samples': 12742848, 'steps': 66368, 'loss/train': 1.3833401203155518} 11/07/2021 06:34:31 - INFO - __main__ - Step 66370: {'lr': 0.0003007539837928597, 'samples': 12743040, 'steps': 66369, 'loss/train': 1.5250346660614014} 11/07/2021 06:34:32 - INFO - __main__ - Step 66371: {'lr': 0.0003007487875477275, 'samples': 12743232, 'steps': 66370, 'loss/train': 1.3821173906326294} 11/07/2021 06:34:33 - INFO - __main__ - Step 66372: {'lr': 0.00030074359127972876, 'samples': 12743424, 'steps': 66371, 'loss/train': 1.7858225107192993} 11/07/2021 06:34:33 - INFO - __main__ - Step 66373: {'lr': 0.00030073839498886566, 'samples': 12743616, 'steps': 66372, 'loss/train': 1.5881478786468506} 11/07/2021 06:34:34 - INFO - __main__ - Step 66374: {'lr': 0.0003007331986751407, 'samples': 12743808, 'steps': 66373, 'loss/train': 1.0782166719436646} 11/07/2021 06:34:34 - INFO - __main__ - Step 66375: {'lr': 0.00030072800233855605, 'samples': 12744000, 'steps': 66374, 'loss/train': 1.2705281972885132} 11/07/2021 06:34:34 - INFO - __main__ - Step 66376: {'lr': 0.00030072280597911424, 'samples': 12744192, 'steps': 66375, 'loss/train': 2.0557868480682373} 11/07/2021 06:34:35 - INFO - __main__ - Step 66377: {'lr': 0.0003007176095968175, 'samples': 12744384, 'steps': 66376, 'loss/train': 1.8654026985168457} 11/07/2021 06:34:36 - INFO - __main__ - Step 66378: {'lr': 0.0003007124131916682, 'samples': 12744576, 'steps': 66377, 'loss/train': 1.5593841075897217} 11/07/2021 06:34:36 - INFO - __main__ - Step 66379: {'lr': 0.0003007072167636686, 'samples': 12744768, 'steps': 66378, 'loss/train': 1.2987421751022339} 11/07/2021 06:34:36 - INFO - __main__ - Step 66380: {'lr': 0.0003007020203128211, 'samples': 12744960, 'steps': 66379, 'loss/train': 1.2571394443511963} 11/07/2021 06:34:37 - INFO - __main__ - Step 66381: {'lr': 0.0003006968238391281, 'samples': 12745152, 'steps': 66380, 'loss/train': 1.672746181488037} 11/07/2021 06:34:38 - INFO - __main__ - Step 66382: {'lr': 0.00030069162734259195, 'samples': 12745344, 'steps': 66381, 'loss/train': 0.929642379283905} 11/07/2021 06:34:38 - INFO - __main__ - Step 66383: {'lr': 0.0003006864308232148, 'samples': 12745536, 'steps': 66382, 'loss/train': 1.939717411994934} 11/07/2021 06:34:39 - INFO - __main__ - Step 66384: {'lr': 0.00030068123428099924, 'samples': 12745728, 'steps': 66383, 'loss/train': 1.4837837219238281} 11/07/2021 06:34:39 - INFO - __main__ - Step 66385: {'lr': 0.0003006760377159475, 'samples': 12745920, 'steps': 66384, 'loss/train': 1.1217055320739746} 11/07/2021 06:34:39 - INFO - __main__ - Step 66386: {'lr': 0.00030067084112806185, 'samples': 12746112, 'steps': 66385, 'loss/train': 1.628525733947754} 11/07/2021 06:34:40 - INFO - __main__ - Step 66387: {'lr': 0.00030066564451734475, 'samples': 12746304, 'steps': 66386, 'loss/train': 1.3481296300888062} 11/07/2021 06:34:40 - INFO - __main__ - Step 66388: {'lr': 0.0003006604478837984, 'samples': 12746496, 'steps': 66387, 'loss/train': 1.0647903680801392} 11/07/2021 06:34:41 - INFO - __main__ - Step 66389: {'lr': 0.00030065525122742535, 'samples': 12746688, 'steps': 66388, 'loss/train': 1.5520217418670654} 11/07/2021 06:34:42 - INFO - __main__ - Step 66390: {'lr': 0.0003006500545482278, 'samples': 12746880, 'steps': 66389, 'loss/train': 1.302832007408142} 11/07/2021 06:34:42 - INFO - __main__ - Step 66391: {'lr': 0.0003006448578462081, 'samples': 12747072, 'steps': 66390, 'loss/train': 1.7191356420516968} 11/07/2021 06:34:42 - INFO - __main__ - Step 66392: {'lr': 0.00030063966112136865, 'samples': 12747264, 'steps': 66391, 'loss/train': 1.223104476928711} 11/07/2021 06:34:43 - INFO - __main__ - Step 66393: {'lr': 0.00030063446437371167, 'samples': 12747456, 'steps': 66392, 'loss/train': 1.8049123287200928} 11/07/2021 06:34:44 - INFO - __main__ - Step 66394: {'lr': 0.0003006292676032396, 'samples': 12747648, 'steps': 66393, 'loss/train': 0.25066444277763367} 11/07/2021 06:34:44 - INFO - __main__ - Step 66395: {'lr': 0.0003006240708099548, 'samples': 12747840, 'steps': 66394, 'loss/train': 0.6681715250015259} 11/07/2021 06:34:44 - INFO - __main__ - Step 66396: {'lr': 0.00030061887399385954, 'samples': 12748032, 'steps': 66395, 'loss/train': 2.1010284423828125} 11/07/2021 06:34:45 - INFO - __main__ - Step 66397: {'lr': 0.00030061367715495627, 'samples': 12748224, 'steps': 66396, 'loss/train': 0.5003383755683899} 11/07/2021 06:34:45 - INFO - __main__ - Step 66398: {'lr': 0.0003006084802932472, 'samples': 12748416, 'steps': 66397, 'loss/train': 1.2009634971618652} 11/07/2021 06:34:45 - INFO - __main__ - Step 66399: {'lr': 0.0003006032834087347, 'samples': 12748608, 'steps': 66398, 'loss/train': 1.5120054483413696} 11/07/2021 06:34:47 - INFO - __main__ - Step 66400: {'lr': 0.00030059808650142116, 'samples': 12748800, 'steps': 66399, 'loss/train': 1.4923722743988037} 11/07/2021 06:34:47 - INFO - __main__ - Step 66401: {'lr': 0.00030059288957130895, 'samples': 12748992, 'steps': 66400, 'loss/train': 1.0279885530471802} 11/07/2021 06:34:47 - INFO - __main__ - Step 66402: {'lr': 0.0003005876926184003, 'samples': 12749184, 'steps': 66401, 'loss/train': 1.373974323272705} 11/07/2021 06:34:48 - INFO - __main__ - Step 66403: {'lr': 0.00030058249564269765, 'samples': 12749376, 'steps': 66402, 'loss/train': 1.451093316078186} 11/07/2021 06:34:48 - INFO - __main__ - Step 66404: {'lr': 0.0003005772986442033, 'samples': 12749568, 'steps': 66403, 'loss/train': 1.7448621988296509} 11/07/2021 06:34:49 - INFO - __main__ - Step 66405: {'lr': 0.00030057210162291964, 'samples': 12749760, 'steps': 66404, 'loss/train': 1.9317957162857056} 11/07/2021 06:34:49 - INFO - __main__ - Step 66406: {'lr': 0.00030056690457884894, 'samples': 12749952, 'steps': 66405, 'loss/train': 1.0412652492523193} 11/07/2021 06:34:50 - INFO - __main__ - Step 66407: {'lr': 0.00030056170751199357, 'samples': 12750144, 'steps': 66406, 'loss/train': 0.9898883104324341} 11/07/2021 06:34:50 - INFO - __main__ - Step 66408: {'lr': 0.00030055651042235586, 'samples': 12750336, 'steps': 66407, 'loss/train': 1.3887337446212769} 11/07/2021 06:34:50 - INFO - __main__ - Step 66409: {'lr': 0.0003005513133099382, 'samples': 12750528, 'steps': 66408, 'loss/train': 1.0155344009399414} 11/07/2021 06:34:52 - INFO - __main__ - Step 66410: {'lr': 0.0003005461161747429, 'samples': 12750720, 'steps': 66409, 'loss/train': 1.3881829977035522} 11/07/2021 06:34:52 - INFO - __main__ - Step 66411: {'lr': 0.00030054091901677226, 'samples': 12750912, 'steps': 66410, 'loss/train': 1.581893801689148} 11/07/2021 06:34:52 - INFO - __main__ - Step 66412: {'lr': 0.00030053572183602866, 'samples': 12751104, 'steps': 66411, 'loss/train': 0.7832050323486328} 11/07/2021 06:34:53 - INFO - __main__ - Step 66413: {'lr': 0.00030053052463251443, 'samples': 12751296, 'steps': 66412, 'loss/train': 1.4011684656143188} 11/07/2021 06:34:53 - INFO - __main__ - Step 66414: {'lr': 0.000300525327406232, 'samples': 12751488, 'steps': 66413, 'loss/train': 1.4199122190475464} 11/07/2021 06:34:54 - INFO - __main__ - Step 66415: {'lr': 0.0003005201301571836, 'samples': 12751680, 'steps': 66414, 'loss/train': 0.05534379556775093} 11/07/2021 06:34:54 - INFO - __main__ - Step 66416: {'lr': 0.00030051493288537164, 'samples': 12751872, 'steps': 66415, 'loss/train': 1.7302234172821045} 11/07/2021 06:34:55 - INFO - __main__ - Step 66417: {'lr': 0.0003005097355907984, 'samples': 12752064, 'steps': 66416, 'loss/train': 1.3360222578048706} 11/07/2021 06:34:55 - INFO - __main__ - Step 66418: {'lr': 0.00030050453827346627, 'samples': 12752256, 'steps': 66417, 'loss/train': 1.598020076751709} 11/07/2021 06:34:55 - INFO - __main__ - Step 66419: {'lr': 0.0003004993409333775, 'samples': 12752448, 'steps': 66418, 'loss/train': 0.9319589138031006} 11/07/2021 06:34:56 - INFO - __main__ - Step 66420: {'lr': 0.0003004941435705346, 'samples': 12752640, 'steps': 66419, 'loss/train': 1.408770203590393} 11/07/2021 06:34:57 - INFO - __main__ - Step 66421: {'lr': 0.00030048894618493977, 'samples': 12752832, 'steps': 66420, 'loss/train': 1.794650912284851} 11/07/2021 06:34:57 - INFO - __main__ - Step 66422: {'lr': 0.0003004837487765954, 'samples': 12753024, 'steps': 66421, 'loss/train': 1.2745815515518188} 11/07/2021 06:34:57 - INFO - __main__ - Step 66423: {'lr': 0.00030047855134550383, 'samples': 12753216, 'steps': 66422, 'loss/train': 1.5514518022537231} 11/07/2021 06:34:58 - INFO - __main__ - Step 66424: {'lr': 0.00030047335389166743, 'samples': 12753408, 'steps': 66423, 'loss/train': 1.6519418954849243} 11/07/2021 06:34:59 - INFO - __main__ - Step 66425: {'lr': 0.00030046815641508853, 'samples': 12753600, 'steps': 66424, 'loss/train': 1.4577373266220093} 11/07/2021 06:34:59 - INFO - __main__ - Step 66426: {'lr': 0.0003004629589157694, 'samples': 12753792, 'steps': 66425, 'loss/train': 1.6781562566757202} 11/07/2021 06:34:59 - INFO - __main__ - Step 66427: {'lr': 0.0003004577613937125, 'samples': 12753984, 'steps': 66426, 'loss/train': 1.3605917692184448} 11/07/2021 06:35:00 - INFO - __main__ - Step 66428: {'lr': 0.00030045256384892007, 'samples': 12754176, 'steps': 66427, 'loss/train': 1.5632530450820923} 11/07/2021 06:35:00 - INFO - __main__ - Step 66429: {'lr': 0.00030044736628139445, 'samples': 12754368, 'steps': 66428, 'loss/train': 1.3104162216186523} 11/07/2021 06:35:01 - INFO - __main__ - Step 66430: {'lr': 0.0003004421686911381, 'samples': 12754560, 'steps': 66429, 'loss/train': 1.1378188133239746} 11/07/2021 06:35:01 - INFO - __main__ - Step 66431: {'lr': 0.0003004369710781533, 'samples': 12754752, 'steps': 66430, 'loss/train': 1.1872493028640747} 11/07/2021 06:35:02 - INFO - __main__ - Step 66432: {'lr': 0.00030043177344244235, 'samples': 12754944, 'steps': 66431, 'loss/train': 1.3594497442245483} 11/07/2021 06:35:02 - INFO - __main__ - Step 66433: {'lr': 0.0003004265757840076, 'samples': 12755136, 'steps': 66432, 'loss/train': 1.149265170097351} 11/07/2021 06:35:03 - INFO - __main__ - Step 66434: {'lr': 0.0003004213781028514, 'samples': 12755328, 'steps': 66433, 'loss/train': 1.2163373231887817} 11/07/2021 06:35:04 - INFO - __main__ - Step 66435: {'lr': 0.00030041618039897616, 'samples': 12755520, 'steps': 66434, 'loss/train': 1.9244475364685059} 11/07/2021 06:35:04 - INFO - __main__ - Step 66436: {'lr': 0.0003004109826723841, 'samples': 12755712, 'steps': 66435, 'loss/train': 1.0519063472747803} 11/07/2021 06:35:04 - INFO - __main__ - Step 66437: {'lr': 0.00030040578492307766, 'samples': 12755904, 'steps': 66436, 'loss/train': 1.0051522254943848} 11/07/2021 06:35:05 - INFO - __main__ - Step 66438: {'lr': 0.00030040058715105915, 'samples': 12756096, 'steps': 66437, 'loss/train': 0.8880148530006409} 11/07/2021 06:35:05 - INFO - __main__ - Step 66439: {'lr': 0.000300395389356331, 'samples': 12756288, 'steps': 66438, 'loss/train': 1.3298498392105103} 11/07/2021 06:35:06 - INFO - __main__ - Step 66440: {'lr': 0.00030039019153889536, 'samples': 12756480, 'steps': 66439, 'loss/train': 0.9069047570228577} 11/07/2021 06:35:06 - INFO - __main__ - Step 66441: {'lr': 0.00030038499369875474, 'samples': 12756672, 'steps': 66440, 'loss/train': 1.541292667388916} 11/07/2021 06:35:07 - INFO - __main__ - Step 66442: {'lr': 0.00030037979583591136, 'samples': 12756864, 'steps': 66441, 'loss/train': 1.391114592552185} 11/07/2021 06:35:07 - INFO - __main__ - Step 66443: {'lr': 0.0003003745979503676, 'samples': 12757056, 'steps': 66442, 'loss/train': 1.387724757194519} 11/07/2021 06:35:08 - INFO - __main__ - Step 66444: {'lr': 0.0003003694000421259, 'samples': 12757248, 'steps': 66443, 'loss/train': 1.7527414560317993} 11/07/2021 06:35:08 - INFO - __main__ - Step 66445: {'lr': 0.0003003642021111885, 'samples': 12757440, 'steps': 66444, 'loss/train': 1.7695266008377075} 11/07/2021 06:35:09 - INFO - __main__ - Step 66446: {'lr': 0.0003003590041575578, 'samples': 12757632, 'steps': 66445, 'loss/train': 1.1333246231079102} 11/07/2021 06:35:10 - INFO - __main__ - Step 66447: {'lr': 0.00030035380618123603, 'samples': 12757824, 'steps': 66446, 'loss/train': 1.450875997543335} 11/07/2021 06:35:10 - INFO - __main__ - Step 66448: {'lr': 0.00030034860818222564, 'samples': 12758016, 'steps': 66447, 'loss/train': 1.3751925230026245} 11/07/2021 06:35:10 - INFO - __main__ - Step 66449: {'lr': 0.000300343410160529, 'samples': 12758208, 'steps': 66448, 'loss/train': 1.158181071281433} 11/07/2021 06:35:11 - INFO - __main__ - Step 66450: {'lr': 0.0003003382121161483, 'samples': 12758400, 'steps': 66449, 'loss/train': 0.1001257672905922} 11/07/2021 06:35:12 - INFO - __main__ - Step 66451: {'lr': 0.000300333014049086, 'samples': 12758592, 'steps': 66450, 'loss/train': 1.4357441663742065} 11/07/2021 06:35:12 - INFO - __main__ - Step 66452: {'lr': 0.00030032781595934455, 'samples': 12758784, 'steps': 66451, 'loss/train': 1.4537526369094849} 11/07/2021 06:35:12 - INFO - __main__ - Step 66453: {'lr': 0.0003003226178469261, 'samples': 12758976, 'steps': 66452, 'loss/train': 0.6968719363212585} 11/07/2021 06:35:13 - INFO - __main__ - Step 66454: {'lr': 0.000300317419711833, 'samples': 12759168, 'steps': 66453, 'loss/train': 1.922614336013794} 11/07/2021 06:35:13 - INFO - __main__ - Step 66455: {'lr': 0.00030031222155406763, 'samples': 12759360, 'steps': 66454, 'loss/train': 1.389673113822937} 11/07/2021 06:35:13 - INFO - __main__ - Step 66456: {'lr': 0.0003003070233736324, 'samples': 12759552, 'steps': 66455, 'loss/train': 1.2452986240386963} 11/07/2021 06:35:14 - INFO - __main__ - Step 66457: {'lr': 0.00030030182517052956, 'samples': 12759744, 'steps': 66456, 'loss/train': 1.1820541620254517} 11/07/2021 06:35:15 - INFO - __main__ - Step 66458: {'lr': 0.0003002966269447615, 'samples': 12759936, 'steps': 66457, 'loss/train': 1.3216201066970825} 11/07/2021 06:35:15 - INFO - __main__ - Step 66459: {'lr': 0.00030029142869633066, 'samples': 12760128, 'steps': 66458, 'loss/train': 1.6012029647827148} 11/07/2021 06:35:15 - INFO - __main__ - Step 66460: {'lr': 0.0003002862304252392, 'samples': 12760320, 'steps': 66459, 'loss/train': 1.3498961925506592} 11/07/2021 06:35:16 - INFO - __main__ - Step 66461: {'lr': 0.0003002810321314895, 'samples': 12760512, 'steps': 66460, 'loss/train': 1.0835996866226196} 11/07/2021 06:35:17 - INFO - __main__ - Step 66462: {'lr': 0.00030027583381508395, 'samples': 12760704, 'steps': 66461, 'loss/train': 1.3220826387405396} 11/07/2021 06:35:17 - INFO - __main__ - Step 66463: {'lr': 0.0003002706354760249, 'samples': 12760896, 'steps': 66462, 'loss/train': 1.1198945045471191} 11/07/2021 06:35:18 - INFO - __main__ - Step 66464: {'lr': 0.0003002654371143147, 'samples': 12761088, 'steps': 66463, 'loss/train': 1.1133533716201782} 11/07/2021 06:35:18 - INFO - __main__ - Step 66465: {'lr': 0.0003002602387299557, 'samples': 12761280, 'steps': 66464, 'loss/train': 1.7230342626571655} 11/07/2021 06:35:18 - INFO - __main__ - Step 66466: {'lr': 0.00030025504032295014, 'samples': 12761472, 'steps': 66465, 'loss/train': 1.53044855594635} 11/07/2021 06:35:19 - INFO - __main__ - Step 66467: {'lr': 0.0003002498418933005, 'samples': 12761664, 'steps': 66466, 'loss/train': 1.5893867015838623} 11/07/2021 06:35:20 - INFO - __main__ - Step 66468: {'lr': 0.000300244643441009, 'samples': 12761856, 'steps': 66467, 'loss/train': 1.3961753845214844} 11/07/2021 06:35:20 - INFO - __main__ - Step 66469: {'lr': 0.0003002394449660781, 'samples': 12762048, 'steps': 66468, 'loss/train': 1.5602270364761353} 11/07/2021 06:35:20 - INFO - __main__ - Step 66470: {'lr': 0.00030023424646851, 'samples': 12762240, 'steps': 66469, 'loss/train': 1.6813372373580933} 11/07/2021 06:35:21 - INFO - __main__ - Step 66471: {'lr': 0.00030022904794830716, 'samples': 12762432, 'steps': 66470, 'loss/train': 1.3525004386901855} 11/07/2021 06:35:22 - INFO - __main__ - Step 66472: {'lr': 0.00030022384940547186, 'samples': 12762624, 'steps': 66471, 'loss/train': 0.09978263825178146} 11/07/2021 06:35:22 - INFO - __main__ - Step 66473: {'lr': 0.0003002186508400066, 'samples': 12762816, 'steps': 66472, 'loss/train': 1.2042571306228638} 11/07/2021 06:35:23 - INFO - __main__ - Step 66474: {'lr': 0.0003002134522519135, 'samples': 12763008, 'steps': 66473, 'loss/train': 5.399660110473633} 11/07/2021 06:35:23 - INFO - __main__ - Step 66475: {'lr': 0.00030020825364119496, 'samples': 12763200, 'steps': 66474, 'loss/train': 1.322930097579956} 11/07/2021 06:35:23 - INFO - __main__ - Step 66476: {'lr': 0.0003002030550078534, 'samples': 12763392, 'steps': 66475, 'loss/train': 1.705296277999878} 11/07/2021 06:35:24 - INFO - __main__ - Step 66477: {'lr': 0.0003001978563518911, 'samples': 12763584, 'steps': 66476, 'loss/train': 1.6787378787994385} 11/07/2021 06:35:25 - INFO - __main__ - Step 66478: {'lr': 0.0003001926576733104, 'samples': 12763776, 'steps': 66477, 'loss/train': 1.557503581047058} 11/07/2021 06:35:25 - INFO - __main__ - Step 66479: {'lr': 0.00030018745897211367, 'samples': 12763968, 'steps': 66478, 'loss/train': 0.910333514213562} 11/07/2021 06:35:26 - INFO - __main__ - Step 66480: {'lr': 0.0003001822602483033, 'samples': 12764160, 'steps': 66479, 'loss/train': 1.4036084413528442} 11/07/2021 06:35:26 - INFO - __main__ - Step 66481: {'lr': 0.0003001770615018815, 'samples': 12764352, 'steps': 66480, 'loss/train': 3.0230703353881836} 11/07/2021 06:35:26 - INFO - __main__ - Step 66482: {'lr': 0.0003001718627328507, 'samples': 12764544, 'steps': 66481, 'loss/train': 1.3377385139465332} 11/07/2021 06:35:27 - INFO - __main__ - Step 66483: {'lr': 0.0003001666639412133, 'samples': 12764736, 'steps': 66482, 'loss/train': 1.4664568901062012} 11/07/2021 06:35:28 - INFO - __main__ - Step 66484: {'lr': 0.0003001614651269715, 'samples': 12764928, 'steps': 66483, 'loss/train': 1.4603981971740723} 11/07/2021 06:35:28 - INFO - __main__ - Step 66485: {'lr': 0.00030015626629012774, 'samples': 12765120, 'steps': 66484, 'loss/train': 1.5789284706115723} 11/07/2021 06:35:28 - INFO - __main__ - Step 66486: {'lr': 0.00030015106743068443, 'samples': 12765312, 'steps': 66485, 'loss/train': 1.2661681175231934} 11/07/2021 06:35:29 - INFO - __main__ - Step 66487: {'lr': 0.00030014586854864374, 'samples': 12765504, 'steps': 66486, 'loss/train': 0.9853333234786987} 11/07/2021 06:35:29 - INFO - __main__ - Step 66488: {'lr': 0.0003001406696440081, 'samples': 12765696, 'steps': 66487, 'loss/train': 2.0446629524230957} 11/07/2021 06:35:30 - INFO - __main__ - Step 66489: {'lr': 0.00030013547071677983, 'samples': 12765888, 'steps': 66488, 'loss/train': 1.9378362894058228} 11/07/2021 06:35:31 - INFO - __main__ - Step 66490: {'lr': 0.0003001302717669613, 'samples': 12766080, 'steps': 66489, 'loss/train': 0.296688050031662} 11/07/2021 06:35:31 - INFO - __main__ - Step 66491: {'lr': 0.0003001250727945549, 'samples': 12766272, 'steps': 66490, 'loss/train': 1.3709726333618164} 11/07/2021 06:35:32 - INFO - __main__ - Step 66492: {'lr': 0.0003001198737995628, 'samples': 12766464, 'steps': 66491, 'loss/train': 1.3700075149536133} 11/07/2021 06:35:32 - INFO - __main__ - Step 66493: {'lr': 0.00030011467478198764, 'samples': 12766656, 'steps': 66492, 'loss/train': 1.6565260887145996} 11/07/2021 06:35:33 - INFO - __main__ - Step 66494: {'lr': 0.00030010947574183146, 'samples': 12766848, 'steps': 66493, 'loss/train': 1.720242977142334} 11/07/2021 06:35:33 - INFO - __main__ - Step 66495: {'lr': 0.00030010427667909666, 'samples': 12767040, 'steps': 66494, 'loss/train': 1.274006724357605} 11/07/2021 06:35:34 - INFO - __main__ - Step 66496: {'lr': 0.00030009907759378574, 'samples': 12767232, 'steps': 66495, 'loss/train': 1.4504820108413696} 11/07/2021 06:35:34 - INFO - __main__ - Step 66497: {'lr': 0.0003000938784859009, 'samples': 12767424, 'steps': 66496, 'loss/train': 0.9307864904403687} 11/07/2021 06:35:34 - INFO - __main__ - Step 66498: {'lr': 0.00030008867935544457, 'samples': 12767616, 'steps': 66497, 'loss/train': 1.708404302597046} 11/07/2021 06:35:35 - INFO - __main__ - Step 66499: {'lr': 0.0003000834802024191, 'samples': 12767808, 'steps': 66498, 'loss/train': 0.8875890970230103} 11/07/2021 06:35:36 - INFO - __main__ - Step 66500: {'lr': 0.0003000782810268267, 'samples': 12768000, 'steps': 66499, 'loss/train': 1.4397369623184204} 11/07/2021 06:35:36 - INFO - __main__ - Step 66501: {'lr': 0.0003000730818286698, 'samples': 12768192, 'steps': 66500, 'loss/train': 0.490379273891449} 11/07/2021 06:35:37 - INFO - __main__ - Step 66502: {'lr': 0.0003000678826079508, 'samples': 12768384, 'steps': 66501, 'loss/train': 1.6283694505691528} 11/07/2021 06:35:37 - INFO - __main__ - Step 66503: {'lr': 0.00030006268336467195, 'samples': 12768576, 'steps': 66502, 'loss/train': 1.101263403892517} 11/07/2021 06:35:37 - INFO - __main__ - Step 66504: {'lr': 0.0003000574840988357, 'samples': 12768768, 'steps': 66503, 'loss/train': 1.6629441976547241} 11/07/2021 06:35:38 - INFO - __main__ - Step 66505: {'lr': 0.00030005228481044414, 'samples': 12768960, 'steps': 66504, 'loss/train': 1.559578537940979} 11/07/2021 06:35:39 - INFO - __main__ - Step 66506: {'lr': 0.0003000470854995, 'samples': 12769152, 'steps': 66505, 'loss/train': 0.9264603853225708} 11/07/2021 06:35:39 - INFO - __main__ - Step 66507: {'lr': 0.0003000418861660053, 'samples': 12769344, 'steps': 66506, 'loss/train': 1.1376709938049316} 11/07/2021 06:35:39 - INFO - __main__ - Step 66508: {'lr': 0.0003000366868099625, 'samples': 12769536, 'steps': 66507, 'loss/train': 0.8077117204666138} 11/07/2021 06:35:40 - INFO - __main__ - Step 66509: {'lr': 0.000300031487431374, 'samples': 12769728, 'steps': 66508, 'loss/train': 1.5601677894592285} 11/07/2021 06:35:41 - INFO - __main__ - Step 66510: {'lr': 0.000300026288030242, 'samples': 12769920, 'steps': 66509, 'loss/train': 1.87398362159729} 11/07/2021 06:35:41 - INFO - __main__ - Step 66511: {'lr': 0.00030002108860656895, 'samples': 12770112, 'steps': 66510, 'loss/train': 1.0365922451019287} 11/07/2021 06:35:41 - INFO - __main__ - Step 66512: {'lr': 0.0003000158891603572, 'samples': 12770304, 'steps': 66511, 'loss/train': 1.6755969524383545} 11/07/2021 06:35:42 - INFO - __main__ - Step 66513: {'lr': 0.00030001068969160913, 'samples': 12770496, 'steps': 66512, 'loss/train': 1.4222900867462158} 11/07/2021 06:35:42 - INFO - __main__ - Step 66514: {'lr': 0.0003000054902003269, 'samples': 12770688, 'steps': 66513, 'loss/train': 1.504391074180603} 11/07/2021 06:35:43 - INFO - __main__ - Step 66515: {'lr': 0.00030000029068651303, 'samples': 12770880, 'steps': 66514, 'loss/train': 1.423309087753296} 11/07/2021 06:35:44 - INFO - __main__ - Step 66516: {'lr': 0.00029999509115016977, 'samples': 12771072, 'steps': 66515, 'loss/train': 1.4119130373001099} 11/07/2021 06:35:44 - INFO - __main__ - Step 66517: {'lr': 0.00029998989159129945, 'samples': 12771264, 'steps': 66516, 'loss/train': 1.4071619510650635} 11/07/2021 06:35:44 - INFO - __main__ - Step 66518: {'lr': 0.0002999846920099045, 'samples': 12771456, 'steps': 66517, 'loss/train': 1.4315712451934814} 11/07/2021 06:35:45 - INFO - __main__ - Step 66519: {'lr': 0.0002999794924059872, 'samples': 12771648, 'steps': 66518, 'loss/train': 1.4701224565505981} 11/07/2021 06:35:46 - INFO - __main__ - Step 66520: {'lr': 0.00029997429277955, 'samples': 12771840, 'steps': 66519, 'loss/train': 1.3891555070877075} 11/07/2021 06:35:46 - INFO - __main__ - Step 66521: {'lr': 0.0002999690931305951, 'samples': 12772032, 'steps': 66520, 'loss/train': 1.1938488483428955} 11/07/2021 06:35:46 - INFO - __main__ - Step 66522: {'lr': 0.00029996389345912487, 'samples': 12772224, 'steps': 66521, 'loss/train': 1.860782265663147} 11/07/2021 06:35:47 - INFO - __main__ - Step 66523: {'lr': 0.0002999586937651417, 'samples': 12772416, 'steps': 66522, 'loss/train': 0.9962335824966431} 11/07/2021 06:35:47 - INFO - __main__ - Step 66524: {'lr': 0.0002999534940486479, 'samples': 12772608, 'steps': 66523, 'loss/train': 1.6398015022277832} 11/07/2021 06:35:48 - INFO - __main__ - Step 66525: {'lr': 0.00029994829430964585, 'samples': 12772800, 'steps': 66524, 'loss/train': 1.6307404041290283} 11/07/2021 06:35:49 - INFO - __main__ - Step 66526: {'lr': 0.00029994309454813787, 'samples': 12772992, 'steps': 66525, 'loss/train': 1.5788381099700928} 11/07/2021 06:35:49 - INFO - __main__ - Step 66527: {'lr': 0.0002999378947641263, 'samples': 12773184, 'steps': 66526, 'loss/train': 0.38842490315437317} 11/07/2021 06:35:49 - INFO - __main__ - Step 66528: {'lr': 0.00029993269495761347, 'samples': 12773376, 'steps': 66527, 'loss/train': 2.418938159942627} 11/07/2021 06:35:50 - INFO - __main__ - Step 66529: {'lr': 0.0002999274951286017, 'samples': 12773568, 'steps': 66528, 'loss/train': 1.6499388217926025} 11/07/2021 06:35:50 - INFO - __main__ - Step 66530: {'lr': 0.00029992229527709346, 'samples': 12773760, 'steps': 66529, 'loss/train': 1.5943024158477783} 11/07/2021 06:35:51 - INFO - __main__ - Step 66531: {'lr': 0.000299917095403091, 'samples': 12773952, 'steps': 66530, 'loss/train': 1.7574938535690308} 11/07/2021 06:35:51 - INFO - __main__ - Step 66532: {'lr': 0.0002999118955065966, 'samples': 12774144, 'steps': 66531, 'loss/train': 1.3951570987701416} 11/07/2021 06:35:52 - INFO - __main__ - Step 66533: {'lr': 0.00029990669558761275, 'samples': 12774336, 'steps': 66532, 'loss/train': 1.6999765634536743} 11/07/2021 06:35:52 - INFO - __main__ - Step 66534: {'lr': 0.00029990149564614163, 'samples': 12774528, 'steps': 66533, 'loss/train': 1.4811875820159912} 11/07/2021 06:35:53 - INFO - __main__ - Step 66535: {'lr': 0.0002998962956821857, 'samples': 12774720, 'steps': 66534, 'loss/train': 1.310868263244629} 11/07/2021 06:35:53 - INFO - __main__ - Step 66536: {'lr': 0.0002998910956957472, 'samples': 12774912, 'steps': 66535, 'loss/train': 1.527773141860962} 11/07/2021 06:35:54 - INFO - __main__ - Step 66537: {'lr': 0.0002998858956868287, 'samples': 12775104, 'steps': 66536, 'loss/train': 1.4420627355575562} 11/07/2021 06:35:54 - INFO - __main__ - Step 66538: {'lr': 0.0002998806956554322, 'samples': 12775296, 'steps': 66537, 'loss/train': 1.6613962650299072} 11/07/2021 06:35:55 - INFO - __main__ - Step 66539: {'lr': 0.0002998754956015604, 'samples': 12775488, 'steps': 66538, 'loss/train': 1.502551794052124} 11/07/2021 06:35:55 - INFO - __main__ - Step 66540: {'lr': 0.0002998702955252154, 'samples': 12775680, 'steps': 66539, 'loss/train': 1.5565112829208374} 11/07/2021 06:35:56 - INFO - __main__ - Step 66541: {'lr': 0.00029986509542639955, 'samples': 12775872, 'steps': 66540, 'loss/train': 1.6070361137390137} 11/07/2021 06:35:57 - INFO - __main__ - Step 66542: {'lr': 0.00029985989530511534, 'samples': 12776064, 'steps': 66541, 'loss/train': 0.886452317237854} 11/07/2021 06:35:57 - INFO - __main__ - Step 66543: {'lr': 0.000299854695161365, 'samples': 12776256, 'steps': 66542, 'loss/train': 1.8088955879211426} 11/07/2021 06:35:57 - INFO - __main__ - Step 66544: {'lr': 0.00029984949499515097, 'samples': 12776448, 'steps': 66543, 'loss/train': 1.4069100618362427} 11/07/2021 06:35:58 - INFO - __main__ - Step 66545: {'lr': 0.00029984429480647547, 'samples': 12776640, 'steps': 66544, 'loss/train': 1.0144082307815552} 11/07/2021 06:35:59 - INFO - __main__ - Step 66546: {'lr': 0.0002998390945953409, 'samples': 12776832, 'steps': 66545, 'loss/train': 0.1167265772819519} 11/07/2021 06:35:59 - INFO - __main__ - Step 66547: {'lr': 0.0002998338943617496, 'samples': 12777024, 'steps': 66546, 'loss/train': 0.9750322103500366} 11/07/2021 06:35:59 - INFO - __main__ - Step 66548: {'lr': 0.0002998286941057038, 'samples': 12777216, 'steps': 66547, 'loss/train': 1.3699275255203247} 11/07/2021 06:36:00 - INFO - __main__ - Step 66549: {'lr': 0.00029982349382720613, 'samples': 12777408, 'steps': 66548, 'loss/train': 1.469891905784607} 11/07/2021 06:36:00 - INFO - __main__ - Step 66550: {'lr': 0.00029981829352625873, 'samples': 12777600, 'steps': 66549, 'loss/train': 1.881303071975708} 11/07/2021 06:36:01 - INFO - __main__ - Step 66551: {'lr': 0.000299813093202864, 'samples': 12777792, 'steps': 66550, 'loss/train': 1.8165940046310425} 11/07/2021 06:36:02 - INFO - __main__ - Step 66552: {'lr': 0.0002998078928570241, 'samples': 12777984, 'steps': 66551, 'loss/train': 0.8949316143989563} 11/07/2021 06:36:02 - INFO - __main__ - Step 66553: {'lr': 0.0002998026924887417, 'samples': 12778176, 'steps': 66552, 'loss/train': 1.7926430702209473} 11/07/2021 06:36:02 - INFO - __main__ - Step 66554: {'lr': 0.00029979749209801894, 'samples': 12778368, 'steps': 66553, 'loss/train': 1.4109517335891724} 11/07/2021 06:36:03 - INFO - __main__ - Step 66555: {'lr': 0.00029979229168485824, 'samples': 12778560, 'steps': 66554, 'loss/train': 1.7995214462280273} 11/07/2021 06:36:04 - INFO - __main__ - Step 66556: {'lr': 0.00029978709124926176, 'samples': 12778752, 'steps': 66555, 'loss/train': 1.1849400997161865} 11/07/2021 06:36:04 - INFO - __main__ - Step 66557: {'lr': 0.00029978189079123206, 'samples': 12778944, 'steps': 66556, 'loss/train': 1.301964282989502} 11/07/2021 06:36:04 - INFO - __main__ - Step 66558: {'lr': 0.0002997766903107714, 'samples': 12779136, 'steps': 66557, 'loss/train': 1.2365516424179077} 11/07/2021 06:36:05 - INFO - __main__ - Step 66559: {'lr': 0.00029977148980788213, 'samples': 12779328, 'steps': 66558, 'loss/train': 1.5096442699432373} 11/07/2021 06:36:05 - INFO - __main__ - Step 66560: {'lr': 0.0002997662892825666, 'samples': 12779520, 'steps': 66559, 'loss/train': 1.3052815198898315} 11/07/2021 06:36:06 - INFO - __main__ - Step 66561: {'lr': 0.0002997610887348272, 'samples': 12779712, 'steps': 66560, 'loss/train': 1.3549045324325562} 11/07/2021 06:36:06 - INFO - __main__ - Step 66562: {'lr': 0.0002997558881646662, 'samples': 12779904, 'steps': 66561, 'loss/train': 1.369839072227478} 11/07/2021 06:36:07 - INFO - __main__ - Step 66563: {'lr': 0.00029975068757208596, 'samples': 12780096, 'steps': 66562, 'loss/train': 1.2545422315597534} 11/07/2021 06:36:07 - INFO - __main__ - Step 66564: {'lr': 0.00029974548695708877, 'samples': 12780288, 'steps': 66563, 'loss/train': 0.9931343197822571} 11/07/2021 06:36:08 - INFO - __main__ - Step 66565: {'lr': 0.0002997402863196771, 'samples': 12780480, 'steps': 66564, 'loss/train': 1.3242884874343872} 11/07/2021 06:36:09 - INFO - __main__ - Step 66566: {'lr': 0.00029973508565985316, 'samples': 12780672, 'steps': 66565, 'loss/train': 1.4173847436904907} 11/07/2021 06:36:09 - INFO - __main__ - Step 66567: {'lr': 0.00029972988497761944, 'samples': 12780864, 'steps': 66566, 'loss/train': 1.5954455137252808} 11/07/2021 06:36:09 - INFO - __main__ - Step 66568: {'lr': 0.00029972468427297814, 'samples': 12781056, 'steps': 66567, 'loss/train': 1.0396016836166382} 11/07/2021 06:36:10 - INFO - __main__ - Step 66569: {'lr': 0.0002997194835459317, 'samples': 12781248, 'steps': 66568, 'loss/train': 1.4876033067703247} 11/07/2021 06:36:10 - INFO - __main__ - Step 66570: {'lr': 0.0002997142827964824, 'samples': 12781440, 'steps': 66569, 'loss/train': 1.3188257217407227} 11/07/2021 06:36:10 - INFO - __main__ - Step 66571: {'lr': 0.0002997090820246326, 'samples': 12781632, 'steps': 66570, 'loss/train': 1.7010259628295898} 11/07/2021 06:36:11 - INFO - __main__ - Step 66572: {'lr': 0.0002997038812303847, 'samples': 12781824, 'steps': 66571, 'loss/train': 1.7384356260299683} 11/07/2021 06:36:12 - INFO - __main__ - Step 66573: {'lr': 0.00029969868041374096, 'samples': 12782016, 'steps': 66572, 'loss/train': 1.2563496828079224} 11/07/2021 06:36:12 - INFO - __main__ - Step 66574: {'lr': 0.00029969347957470375, 'samples': 12782208, 'steps': 66573, 'loss/train': 1.5731861591339111} 11/07/2021 06:36:12 - INFO - __main__ - Step 66575: {'lr': 0.0002996882787132755, 'samples': 12782400, 'steps': 66574, 'loss/train': 1.1826473474502563} 11/07/2021 06:36:13 - INFO - __main__ - Step 66576: {'lr': 0.00029968307782945834, 'samples': 12782592, 'steps': 66575, 'loss/train': 1.324588656425476} 11/07/2021 06:36:14 - INFO - __main__ - Step 66577: {'lr': 0.00029967787692325486, 'samples': 12782784, 'steps': 66576, 'loss/train': 1.6832858324050903} 11/07/2021 06:36:14 - INFO - __main__ - Step 66578: {'lr': 0.0002996726759946673, 'samples': 12782976, 'steps': 66577, 'loss/train': 1.270734190940857} 11/07/2021 06:36:14 - INFO - __main__ - Step 66579: {'lr': 0.00029966747504369794, 'samples': 12783168, 'steps': 66578, 'loss/train': 1.2658019065856934} 11/07/2021 06:36:15 - INFO - __main__ - Step 66580: {'lr': 0.0002996622740703492, 'samples': 12783360, 'steps': 66579, 'loss/train': 1.2948060035705566} 11/07/2021 06:36:15 - INFO - __main__ - Step 66581: {'lr': 0.0002996570730746235, 'samples': 12783552, 'steps': 66580, 'loss/train': 1.0227888822555542} 11/07/2021 06:36:17 - INFO - __main__ - Step 66582: {'lr': 0.000299651872056523, 'samples': 12783744, 'steps': 66581, 'loss/train': 1.2743855714797974} 11/07/2021 06:36:17 - INFO - __main__ - Step 66583: {'lr': 0.0002996466710160501, 'samples': 12783936, 'steps': 66582, 'loss/train': 1.1227153539657593} 11/07/2021 06:36:17 - INFO - __main__ - Step 66584: {'lr': 0.0002996414699532072, 'samples': 12784128, 'steps': 66583, 'loss/train': 1.4747897386550903} 11/07/2021 06:36:18 - INFO - __main__ - Step 66585: {'lr': 0.00029963626886799665, 'samples': 12784320, 'steps': 66584, 'loss/train': 0.08756952732801437} 11/07/2021 06:36:18 - INFO - __main__ - Step 66586: {'lr': 0.0002996310677604208, 'samples': 12784512, 'steps': 66585, 'loss/train': 1.6547976732254028} 11/07/2021 06:36:19 - INFO - __main__ - Step 66587: {'lr': 0.00029962586663048193, 'samples': 12784704, 'steps': 66586, 'loss/train': 1.2268229722976685} 11/07/2021 06:36:20 - INFO - __main__ - Step 66588: {'lr': 0.00029962066547818233, 'samples': 12784896, 'steps': 66587, 'loss/train': 1.3614012002944946} 11/07/2021 06:36:20 - INFO - __main__ - Step 66589: {'lr': 0.0002996154643035245, 'samples': 12785088, 'steps': 66588, 'loss/train': 1.538271427154541} 11/07/2021 06:36:20 - INFO - __main__ - Step 66590: {'lr': 0.00029961026310651066, 'samples': 12785280, 'steps': 66589, 'loss/train': 1.4002732038497925} 11/07/2021 06:36:21 - INFO - __main__ - Step 66591: {'lr': 0.0002996050618871432, 'samples': 12785472, 'steps': 66590, 'loss/train': 1.1478115320205688} 11/07/2021 06:36:21 - INFO - __main__ - Step 66592: {'lr': 0.0002995998606454245, 'samples': 12785664, 'steps': 66591, 'loss/train': 1.4335908889770508} 11/07/2021 06:36:22 - INFO - __main__ - Step 66593: {'lr': 0.0002995946593813569, 'samples': 12785856, 'steps': 66592, 'loss/train': 0.6588929891586304} 11/07/2021 06:36:22 - INFO - __main__ - Step 66594: {'lr': 0.0002995894580949427, 'samples': 12786048, 'steps': 66593, 'loss/train': 1.3150365352630615} 11/07/2021 06:36:23 - INFO - __main__ - Step 66595: {'lr': 0.0002995842567861842, 'samples': 12786240, 'steps': 66594, 'loss/train': 1.4212453365325928} 11/07/2021 06:36:23 - INFO - __main__ - Step 66596: {'lr': 0.00029957905545508384, 'samples': 12786432, 'steps': 66595, 'loss/train': 1.2989754676818848} 11/07/2021 06:36:24 - INFO - __main__ - Step 66597: {'lr': 0.0002995738541016439, 'samples': 12786624, 'steps': 66596, 'loss/train': 1.5981863737106323} 11/07/2021 06:36:24 - INFO - __main__ - Step 66598: {'lr': 0.00029956865272586674, 'samples': 12786816, 'steps': 66597, 'loss/train': 1.3289105892181396} 11/07/2021 06:36:25 - INFO - __main__ - Step 66599: {'lr': 0.0002995634513277547, 'samples': 12787008, 'steps': 66598, 'loss/train': 0.6275550723075867} 11/07/2021 06:36:25 - INFO - __main__ - Step 66600: {'lr': 0.00029955824990731024, 'samples': 12787200, 'steps': 66599, 'loss/train': 1.2694706916809082} 11/07/2021 06:36:26 - INFO - __main__ - Step 66601: {'lr': 0.00029955304846453554, 'samples': 12787392, 'steps': 66600, 'loss/train': 1.751028299331665} 11/07/2021 06:36:26 - INFO - __main__ - Step 66602: {'lr': 0.00029954784699943294, 'samples': 12787584, 'steps': 66601, 'loss/train': 1.4270350933074951} 11/07/2021 06:36:27 - INFO - __main__ - Step 66603: {'lr': 0.0002995426455120049, 'samples': 12787776, 'steps': 66602, 'loss/train': 1.9400126934051514} 11/07/2021 06:36:27 - INFO - __main__ - Step 66604: {'lr': 0.00029953744400225364, 'samples': 12787968, 'steps': 66603, 'loss/train': 1.212218999862671} 11/07/2021 06:36:28 - INFO - __main__ - Step 66605: {'lr': 0.0002995322424701816, 'samples': 12788160, 'steps': 66604, 'loss/train': 1.4173436164855957} 11/07/2021 06:36:28 - INFO - __main__ - Step 66606: {'lr': 0.00029952704091579116, 'samples': 12788352, 'steps': 66605, 'loss/train': 0.22499550879001617} 11/07/2021 06:36:28 - INFO - __main__ - Step 66607: {'lr': 0.00029952183933908464, 'samples': 12788544, 'steps': 66606, 'loss/train': 1.5637128353118896} 11/07/2021 06:36:29 - INFO - __main__ - Step 66608: {'lr': 0.0002995166377400642, 'samples': 12788736, 'steps': 66607, 'loss/train': 1.1366767883300781} 11/07/2021 06:36:30 - INFO - __main__ - Step 66609: {'lr': 0.0002995114361187324, 'samples': 12788928, 'steps': 66608, 'loss/train': 1.4263811111450195} 11/07/2021 06:36:30 - INFO - __main__ - Step 66610: {'lr': 0.00029950623447509147, 'samples': 12789120, 'steps': 66609, 'loss/train': 1.3754113912582397} 11/07/2021 06:36:30 - INFO - __main__ - Step 66611: {'lr': 0.00029950103280914383, 'samples': 12789312, 'steps': 66610, 'loss/train': 1.7086067199707031} 11/07/2021 06:36:31 - INFO - __main__ - Step 66612: {'lr': 0.00029949583112089177, 'samples': 12789504, 'steps': 66611, 'loss/train': 1.2000460624694824} 11/07/2021 06:36:32 - INFO - __main__ - Step 66613: {'lr': 0.00029949062941033767, 'samples': 12789696, 'steps': 66612, 'loss/train': 1.1605932712554932} 11/07/2021 06:36:32 - INFO - __main__ - Step 66614: {'lr': 0.00029948542767748386, 'samples': 12789888, 'steps': 66613, 'loss/train': 1.5405899286270142} 11/07/2021 06:36:32 - INFO - __main__ - Step 66615: {'lr': 0.0002994802259223327, 'samples': 12790080, 'steps': 66614, 'loss/train': 1.2864160537719727} 11/07/2021 06:36:33 - INFO - __main__ - Step 66616: {'lr': 0.00029947502414488645, 'samples': 12790272, 'steps': 66615, 'loss/train': 1.8200397491455078} 11/07/2021 06:36:33 - INFO - __main__ - Step 66617: {'lr': 0.00029946982234514756, 'samples': 12790464, 'steps': 66616, 'loss/train': 1.2538164854049683} 11/07/2021 06:36:34 - INFO - __main__ - Step 66618: {'lr': 0.00029946462052311834, 'samples': 12790656, 'steps': 66617, 'loss/train': 1.1426197290420532} 11/07/2021 06:36:35 - INFO - __main__ - Step 66619: {'lr': 0.0002994594186788011, 'samples': 12790848, 'steps': 66618, 'loss/train': 1.2103958129882812} 11/07/2021 06:36:35 - INFO - __main__ - Step 66620: {'lr': 0.00029945421681219824, 'samples': 12791040, 'steps': 66619, 'loss/train': 1.6401814222335815} 11/07/2021 06:36:35 - INFO - __main__ - Step 66621: {'lr': 0.00029944901492331207, 'samples': 12791232, 'steps': 66620, 'loss/train': 1.2487449645996094} 11/07/2021 06:36:36 - INFO - __main__ - Step 66622: {'lr': 0.0002994438130121449, 'samples': 12791424, 'steps': 66621, 'loss/train': 1.6116596460342407} 11/07/2021 06:36:37 - INFO - __main__ - Step 66623: {'lr': 0.0002994386110786991, 'samples': 12791616, 'steps': 66622, 'loss/train': 1.3472295999526978} 11/07/2021 06:36:37 - INFO - __main__ - Step 66624: {'lr': 0.0002994334091229771, 'samples': 12791808, 'steps': 66623, 'loss/train': 1.2425020933151245} 11/07/2021 06:36:37 - INFO - __main__ - Step 66625: {'lr': 0.0002994282071449811, 'samples': 12792000, 'steps': 66624, 'loss/train': 1.2170974016189575} 11/07/2021 06:36:38 - INFO - __main__ - Step 66626: {'lr': 0.00029942300514471354, 'samples': 12792192, 'steps': 66625, 'loss/train': 1.4998339414596558} 11/07/2021 06:36:38 - INFO - __main__ - Step 66627: {'lr': 0.00029941780312217674, 'samples': 12792384, 'steps': 66626, 'loss/train': 1.1002341508865356} 11/07/2021 06:36:39 - INFO - __main__ - Step 66628: {'lr': 0.0002994126010773731, 'samples': 12792576, 'steps': 66627, 'loss/train': 1.2171169519424438} 11/07/2021 06:36:39 - INFO - __main__ - Step 66629: {'lr': 0.0002994073990103048, 'samples': 12792768, 'steps': 66628, 'loss/train': 1.437870740890503} 11/07/2021 06:36:40 - INFO - __main__ - Step 66630: {'lr': 0.0002994021969209743, 'samples': 12792960, 'steps': 66629, 'loss/train': 1.5274627208709717} 11/07/2021 06:36:40 - INFO - __main__ - Step 66631: {'lr': 0.0002993969948093839, 'samples': 12793152, 'steps': 66630, 'loss/train': 1.6669002771377563} 11/07/2021 06:36:40 - INFO - __main__ - Step 66632: {'lr': 0.0002993917926755361, 'samples': 12793344, 'steps': 66631, 'loss/train': 1.6844826936721802} 11/07/2021 06:36:42 - INFO - __main__ - Step 66633: {'lr': 0.000299386590519433, 'samples': 12793536, 'steps': 66632, 'loss/train': 1.7176611423492432} 11/07/2021 06:36:42 - INFO - __main__ - Step 66634: {'lr': 0.0002993813883410772, 'samples': 12793728, 'steps': 66633, 'loss/train': 1.3366267681121826} 11/07/2021 06:36:42 - INFO - __main__ - Step 66635: {'lr': 0.0002993761861404708, 'samples': 12793920, 'steps': 66634, 'loss/train': 0.6345985531806946} 11/07/2021 06:36:43 - INFO - __main__ - Step 66636: {'lr': 0.0002993709839176163, 'samples': 12794112, 'steps': 66635, 'loss/train': 1.2000610828399658} 11/07/2021 06:36:43 - INFO - __main__ - Step 66637: {'lr': 0.00029936578167251594, 'samples': 12794304, 'steps': 66636, 'loss/train': 1.6508160829544067} 11/07/2021 06:36:44 - INFO - __main__ - Step 66638: {'lr': 0.00029936057940517215, 'samples': 12794496, 'steps': 66637, 'loss/train': 1.283996343612671} 11/07/2021 06:36:44 - INFO - __main__ - Step 66639: {'lr': 0.00029935537711558725, 'samples': 12794688, 'steps': 66638, 'loss/train': 2.014195203781128} 11/07/2021 06:36:45 - INFO - __main__ - Step 66640: {'lr': 0.00029935017480376357, 'samples': 12794880, 'steps': 66639, 'loss/train': 1.6264197826385498} 11/07/2021 06:36:45 - INFO - __main__ - Step 66641: {'lr': 0.00029934497246970356, 'samples': 12795072, 'steps': 66640, 'loss/train': 1.6216342449188232} 11/07/2021 06:36:45 - INFO - __main__ - Step 66642: {'lr': 0.0002993397701134093, 'samples': 12795264, 'steps': 66641, 'loss/train': 0.8352605104446411} 11/07/2021 06:36:46 - INFO - __main__ - Step 66643: {'lr': 0.0002993345677348834, 'samples': 12795456, 'steps': 66642, 'loss/train': 1.6806817054748535} 11/07/2021 06:36:47 - INFO - __main__ - Step 66644: {'lr': 0.00029932936533412806, 'samples': 12795648, 'steps': 66643, 'loss/train': 1.536901593208313} 11/07/2021 06:36:47 - INFO - __main__ - Step 66645: {'lr': 0.00029932416291114574, 'samples': 12795840, 'steps': 66644, 'loss/train': 2.069255828857422} 11/07/2021 06:36:47 - INFO - __main__ - Step 66646: {'lr': 0.00029931896046593863, 'samples': 12796032, 'steps': 66645, 'loss/train': 1.263771891593933} 11/07/2021 06:36:48 - INFO - __main__ - Step 66647: {'lr': 0.00029931375799850923, 'samples': 12796224, 'steps': 66646, 'loss/train': 1.4258438348770142} 11/07/2021 06:36:48 - INFO - __main__ - Step 66648: {'lr': 0.0002993085555088598, 'samples': 12796416, 'steps': 66647, 'loss/train': 1.154746413230896} 11/07/2021 06:36:49 - INFO - __main__ - Step 66649: {'lr': 0.0002993033529969927, 'samples': 12796608, 'steps': 66648, 'loss/train': 2.0846140384674072} 11/07/2021 06:36:50 - INFO - __main__ - Step 66650: {'lr': 0.0002992981504629102, 'samples': 12796800, 'steps': 66649, 'loss/train': 1.2221615314483643} 11/07/2021 06:36:50 - INFO - __main__ - Step 66651: {'lr': 0.00029929294790661474, 'samples': 12796992, 'steps': 66650, 'loss/train': 1.6330254077911377} 11/07/2021 06:36:50 - INFO - __main__ - Step 66652: {'lr': 0.00029928774532810866, 'samples': 12797184, 'steps': 66651, 'loss/train': 1.2836331129074097} 11/07/2021 06:36:51 - INFO - __main__ - Step 66653: {'lr': 0.00029928254272739433, 'samples': 12797376, 'steps': 66652, 'loss/train': 1.5296533107757568} 11/07/2021 06:36:52 - INFO - __main__ - Step 66654: {'lr': 0.000299277340104474, 'samples': 12797568, 'steps': 66653, 'loss/train': 1.502974033355713} 11/07/2021 06:36:52 - INFO - __main__ - Step 66655: {'lr': 0.00029927213745935, 'samples': 12797760, 'steps': 66654, 'loss/train': 0.8159122467041016} 11/07/2021 06:36:52 - INFO - __main__ - Step 66656: {'lr': 0.00029926693479202484, 'samples': 12797952, 'steps': 66655, 'loss/train': 1.4643993377685547} 11/07/2021 06:36:53 - INFO - __main__ - Step 66657: {'lr': 0.0002992617321025007, 'samples': 12798144, 'steps': 66656, 'loss/train': 1.2506818771362305} 11/07/2021 06:36:53 - INFO - __main__ - Step 66658: {'lr': 0.00029925652939078, 'samples': 12798336, 'steps': 66657, 'loss/train': 1.3919063806533813} 11/07/2021 06:36:54 - INFO - __main__ - Step 66659: {'lr': 0.0002992513266568651, 'samples': 12798528, 'steps': 66658, 'loss/train': 1.155918002128601} 11/07/2021 06:36:54 - INFO - __main__ - Step 66660: {'lr': 0.00029924612390075817, 'samples': 12798720, 'steps': 66659, 'loss/train': 1.2491143941879272} 11/07/2021 06:36:55 - INFO - __main__ - Step 66661: {'lr': 0.0002992409211224619, 'samples': 12798912, 'steps': 66660, 'loss/train': 1.4071589708328247} 11/07/2021 06:36:55 - INFO - __main__ - Step 66662: {'lr': 0.00029923571832197825, 'samples': 12799104, 'steps': 66661, 'loss/train': 1.441361904144287} 11/07/2021 06:36:55 - INFO - __main__ - Step 66663: {'lr': 0.00029923051549930984, 'samples': 12799296, 'steps': 66662, 'loss/train': 1.315014123916626} 11/07/2021 06:36:56 - INFO - __main__ - Step 66664: {'lr': 0.0002992253126544589, 'samples': 12799488, 'steps': 66663, 'loss/train': 1.5070483684539795} 11/07/2021 06:36:57 - INFO - __main__ - Step 66665: {'lr': 0.0002992201097874278, 'samples': 12799680, 'steps': 66664, 'loss/train': 1.5447150468826294} 11/07/2021 06:36:57 - INFO - __main__ - Step 66666: {'lr': 0.0002992149068982189, 'samples': 12799872, 'steps': 66665, 'loss/train': 1.3013135194778442} 11/07/2021 06:36:57 - INFO - __main__ - Step 66667: {'lr': 0.0002992097039868346, 'samples': 12800064, 'steps': 66666, 'loss/train': 2.3599631786346436} 11/07/2021 06:36:58 - INFO - __main__ - Step 66668: {'lr': 0.000299204501053277, 'samples': 12800256, 'steps': 66667, 'loss/train': 1.5717421770095825} 11/07/2021 06:36:59 - INFO - __main__ - Step 66669: {'lr': 0.00029919929809754865, 'samples': 12800448, 'steps': 66668, 'loss/train': 1.3610479831695557} 11/07/2021 06:36:59 - INFO - __main__ - Step 66670: {'lr': 0.0002991940951196519, 'samples': 12800640, 'steps': 66669, 'loss/train': 1.3340564966201782} 11/07/2021 06:36:59 - INFO - __main__ - Step 66671: {'lr': 0.000299188892119589, 'samples': 12800832, 'steps': 66670, 'loss/train': 1.4163403511047363} 11/07/2021 06:37:00 - INFO - __main__ - Step 66672: {'lr': 0.00029918368909736235, 'samples': 12801024, 'steps': 66671, 'loss/train': 1.5787616968154907} 11/07/2021 06:37:00 - INFO - __main__ - Step 66673: {'lr': 0.0002991784860529744, 'samples': 12801216, 'steps': 66672, 'loss/train': 1.24003267288208} 11/07/2021 06:37:02 - INFO - __main__ - Step 66674: {'lr': 0.00029917328298642733, 'samples': 12801408, 'steps': 66673, 'loss/train': 0.9171093106269836} 11/07/2021 06:37:02 - INFO - __main__ - Step 66675: {'lr': 0.0002991680798977234, 'samples': 12801600, 'steps': 66674, 'loss/train': 1.0253745317459106} 11/07/2021 06:37:03 - INFO - __main__ - Step 66676: {'lr': 0.0002991628767868653, 'samples': 12801792, 'steps': 66675, 'loss/train': 1.6671199798583984} 11/07/2021 06:37:03 - INFO - __main__ - Step 66677: {'lr': 0.000299157673653855, 'samples': 12801984, 'steps': 66676, 'loss/train': 1.273770809173584} 11/07/2021 06:37:03 - INFO - __main__ - Step 66678: {'lr': 0.0002991524704986951, 'samples': 12802176, 'steps': 66677, 'loss/train': 3.9614017009735107} 11/07/2021 06:37:04 - INFO - __main__ - Step 66679: {'lr': 0.0002991472673213879, 'samples': 12802368, 'steps': 66678, 'loss/train': 3.8400394916534424} 11/07/2021 06:37:05 - INFO - __main__ - Step 66680: {'lr': 0.0002991420641219356, 'samples': 12802560, 'steps': 66679, 'loss/train': 1.231750726699829} 11/07/2021 06:37:05 - INFO - __main__ - Step 66681: {'lr': 0.00029913686090034063, 'samples': 12802752, 'steps': 66680, 'loss/train': 1.808229923248291} 11/07/2021 06:37:05 - INFO - __main__ - Step 66682: {'lr': 0.0002991316576566054, 'samples': 12802944, 'steps': 66681, 'loss/train': 1.4643852710723877} 11/07/2021 06:37:06 - INFO - __main__ - Step 66683: {'lr': 0.0002991264543907322, 'samples': 12803136, 'steps': 66682, 'loss/train': 1.079337239265442} 11/07/2021 06:37:06 - INFO - __main__ - Step 66684: {'lr': 0.0002991212511027234, 'samples': 12803328, 'steps': 66683, 'loss/train': 1.4099936485290527} 11/07/2021 06:37:06 - INFO - __main__ - Step 66685: {'lr': 0.0002991160477925813, 'samples': 12803520, 'steps': 66684, 'loss/train': 1.1532272100448608} 11/07/2021 06:37:07 - INFO - __main__ - Step 66686: {'lr': 0.00029911084446030827, 'samples': 12803712, 'steps': 66685, 'loss/train': 1.7582215070724487} 11/07/2021 06:37:08 - INFO - __main__ - Step 66687: {'lr': 0.0002991056411059067, 'samples': 12803904, 'steps': 66686, 'loss/train': 0.9156785011291504} 11/07/2021 06:37:08 - INFO - __main__ - Step 66688: {'lr': 0.0002991004377293788, 'samples': 12804096, 'steps': 66687, 'loss/train': 1.7741464376449585} 11/07/2021 06:37:08 - INFO - __main__ - Step 66689: {'lr': 0.000299095234330727, 'samples': 12804288, 'steps': 66688, 'loss/train': 1.4970356225967407} 11/07/2021 06:37:09 - INFO - __main__ - Step 66690: {'lr': 0.0002990900309099537, 'samples': 12804480, 'steps': 66689, 'loss/train': 1.419610619544983} 11/07/2021 06:37:10 - INFO - __main__ - Step 66691: {'lr': 0.00029908482746706115, 'samples': 12804672, 'steps': 66690, 'loss/train': 1.4580905437469482} 11/07/2021 06:37:10 - INFO - __main__ - Step 66692: {'lr': 0.00029907962400205175, 'samples': 12804864, 'steps': 66691, 'loss/train': 1.480798363685608} 11/07/2021 06:37:11 - INFO - __main__ - Step 66693: {'lr': 0.0002990744205149278, 'samples': 12805056, 'steps': 66692, 'loss/train': 1.357069730758667} 11/07/2021 06:37:11 - INFO - __main__ - Step 66694: {'lr': 0.00029906921700569174, 'samples': 12805248, 'steps': 66693, 'loss/train': 2.1000499725341797} 11/07/2021 06:37:11 - INFO - __main__ - Step 66695: {'lr': 0.00029906401347434586, 'samples': 12805440, 'steps': 66694, 'loss/train': 1.3660266399383545} 11/07/2021 06:37:13 - INFO - __main__ - Step 66696: {'lr': 0.0002990588099208924, 'samples': 12805632, 'steps': 66695, 'loss/train': 1.735975980758667} 11/07/2021 06:37:13 - INFO - __main__ - Step 66697: {'lr': 0.00029905360634533383, 'samples': 12805824, 'steps': 66696, 'loss/train': 1.4672845602035522} 11/07/2021 06:37:13 - INFO - __main__ - Step 66698: {'lr': 0.00029904840274767245, 'samples': 12806016, 'steps': 66697, 'loss/train': 1.2606635093688965} 11/07/2021 06:37:14 - INFO - __main__ - Step 66699: {'lr': 0.0002990431991279107, 'samples': 12806208, 'steps': 66698, 'loss/train': 1.0220121145248413} 11/07/2021 06:37:14 - INFO - __main__ - Step 66700: {'lr': 0.00029903799548605073, 'samples': 12806400, 'steps': 66699, 'loss/train': 0.2708311378955841} 11/07/2021 06:37:15 - INFO - __main__ - Step 66701: {'lr': 0.0002990327918220951, 'samples': 12806592, 'steps': 66700, 'loss/train': 1.5403228998184204} 11/07/2021 06:37:15 - INFO - __main__ - Step 66702: {'lr': 0.000299027588136046, 'samples': 12806784, 'steps': 66701, 'loss/train': 1.302499771118164} 11/07/2021 06:37:16 - INFO - __main__ - Step 66703: {'lr': 0.0002990223844279058, 'samples': 12806976, 'steps': 66702, 'loss/train': 1.8688122034072876} 11/07/2021 06:37:16 - INFO - __main__ - Step 66704: {'lr': 0.00029901718069767693, 'samples': 12807168, 'steps': 66703, 'loss/train': 1.2601574659347534} 11/07/2021 06:37:16 - INFO - __main__ - Step 66705: {'lr': 0.0002990119769453616, 'samples': 12807360, 'steps': 66704, 'loss/train': 1.669143557548523} 11/07/2021 06:37:17 - INFO - __main__ - Step 66706: {'lr': 0.00029900677317096225, 'samples': 12807552, 'steps': 66705, 'loss/train': 1.5028924942016602} 11/07/2021 06:37:18 - INFO - __main__ - Step 66707: {'lr': 0.0002990015693744812, 'samples': 12807744, 'steps': 66706, 'loss/train': 1.1219958066940308} 11/07/2021 06:37:18 - INFO - __main__ - Step 66708: {'lr': 0.00029899636555592087, 'samples': 12807936, 'steps': 66707, 'loss/train': 1.1235257387161255} 11/07/2021 06:37:18 - INFO - __main__ - Step 66709: {'lr': 0.0002989911617152835, 'samples': 12808128, 'steps': 66708, 'loss/train': 1.376554250717163} 11/07/2021 06:37:19 - INFO - __main__ - Step 66710: {'lr': 0.00029898595785257144, 'samples': 12808320, 'steps': 66709, 'loss/train': 1.666764497756958} 11/07/2021 06:37:20 - INFO - __main__ - Step 66711: {'lr': 0.0002989807539677871, 'samples': 12808512, 'steps': 66710, 'loss/train': 1.4726967811584473} 11/07/2021 06:37:20 - INFO - __main__ - Step 66712: {'lr': 0.00029897555006093266, 'samples': 12808704, 'steps': 66711, 'loss/train': 1.1139496564865112} 11/07/2021 06:37:20 - INFO - __main__ - Step 66713: {'lr': 0.0002989703461320107, 'samples': 12808896, 'steps': 66712, 'loss/train': 1.4790623188018799} 11/07/2021 06:37:21 - INFO - __main__ - Step 66714: {'lr': 0.0002989651421810235, 'samples': 12809088, 'steps': 66713, 'loss/train': 1.4524261951446533} 11/07/2021 06:37:21 - INFO - __main__ - Step 66715: {'lr': 0.00029895993820797334, 'samples': 12809280, 'steps': 66714, 'loss/train': 0.9966402053833008} 11/07/2021 06:37:22 - INFO - __main__ - Step 66716: {'lr': 0.0002989547342128626, 'samples': 12809472, 'steps': 66715, 'loss/train': 1.3880884647369385} 11/07/2021 06:37:23 - INFO - __main__ - Step 66717: {'lr': 0.0002989495301956935, 'samples': 12809664, 'steps': 66716, 'loss/train': 1.4738116264343262} 11/07/2021 06:37:23 - INFO - __main__ - Step 66718: {'lr': 0.00029894432615646863, 'samples': 12809856, 'steps': 66717, 'loss/train': 1.5079081058502197} 11/07/2021 06:37:23 - INFO - __main__ - Step 66719: {'lr': 0.0002989391220951901, 'samples': 12810048, 'steps': 66718, 'loss/train': 1.6964551210403442} 11/07/2021 06:37:24 - INFO - __main__ - Step 66720: {'lr': 0.0002989339180118604, 'samples': 12810240, 'steps': 66719, 'loss/train': 1.2050819396972656} 11/07/2021 06:37:24 - INFO - __main__ - Step 66721: {'lr': 0.0002989287139064819, 'samples': 12810432, 'steps': 66720, 'loss/train': 1.4247959852218628} 11/07/2021 06:37:25 - INFO - __main__ - Step 66722: {'lr': 0.0002989235097790568, 'samples': 12810624, 'steps': 66721, 'loss/train': 0.8229020833969116} 11/07/2021 06:37:25 - INFO - __main__ - Step 66723: {'lr': 0.0002989183056295875, 'samples': 12810816, 'steps': 66722, 'loss/train': 1.7214657068252563} 11/07/2021 06:37:26 - INFO - __main__ - Step 66724: {'lr': 0.00029891310145807636, 'samples': 12811008, 'steps': 66723, 'loss/train': 1.5530720949172974} 11/07/2021 06:37:26 - INFO - __main__ - Step 66725: {'lr': 0.00029890789726452576, 'samples': 12811200, 'steps': 66724, 'loss/train': 1.3859883546829224} 11/07/2021 06:37:26 - INFO - __main__ - Step 66726: {'lr': 0.00029890269304893804, 'samples': 12811392, 'steps': 66725, 'loss/train': 5.78005313873291} 11/07/2021 06:37:27 - INFO - __main__ - Step 66727: {'lr': 0.0002988974888113155, 'samples': 12811584, 'steps': 66726, 'loss/train': 1.1997411251068115} 11/07/2021 06:37:28 - INFO - __main__ - Step 66728: {'lr': 0.00029889228455166054, 'samples': 12811776, 'steps': 66727, 'loss/train': 1.6522181034088135} 11/07/2021 06:37:28 - INFO - __main__ - Step 66729: {'lr': 0.00029888708026997547, 'samples': 12811968, 'steps': 66728, 'loss/train': 1.972354769706726} 11/07/2021 06:37:29 - INFO - __main__ - Step 66730: {'lr': 0.0002988818759662626, 'samples': 12812160, 'steps': 66729, 'loss/train': 0.9652682542800903} 11/07/2021 06:37:29 - INFO - __main__ - Step 66731: {'lr': 0.0002988766716405243, 'samples': 12812352, 'steps': 66730, 'loss/train': 1.1850467920303345} 11/07/2021 06:37:29 - INFO - __main__ - Step 66732: {'lr': 0.00029887146729276295, 'samples': 12812544, 'steps': 66731, 'loss/train': 1.701433539390564} 11/07/2021 06:37:30 - INFO - __main__ - Step 66733: {'lr': 0.00029886626292298087, 'samples': 12812736, 'steps': 66732, 'loss/train': 1.2749086618423462} 11/07/2021 06:37:31 - INFO - __main__ - Step 66734: {'lr': 0.0002988610585311804, 'samples': 12812928, 'steps': 66733, 'loss/train': 0.9505759477615356} 11/07/2021 06:37:31 - INFO - __main__ - Step 66735: {'lr': 0.0002988558541173639, 'samples': 12813120, 'steps': 66734, 'loss/train': 1.4040273427963257} 11/07/2021 06:37:31 - INFO - __main__ - Step 66736: {'lr': 0.0002988506496815337, 'samples': 12813312, 'steps': 66735, 'loss/train': 1.42183518409729} 11/07/2021 06:37:32 - INFO - __main__ - Step 66737: {'lr': 0.00029884544522369217, 'samples': 12813504, 'steps': 66736, 'loss/train': 1.3288339376449585} 11/07/2021 06:37:33 - INFO - __main__ - Step 66738: {'lr': 0.00029884024074384156, 'samples': 12813696, 'steps': 66737, 'loss/train': 0.9100958704948425} 11/07/2021 06:37:33 - INFO - __main__ - Step 66739: {'lr': 0.00029883503624198436, 'samples': 12813888, 'steps': 66738, 'loss/train': 1.2472515106201172} 11/07/2021 06:37:33 - INFO - __main__ - Step 66740: {'lr': 0.00029882983171812283, 'samples': 12814080, 'steps': 66739, 'loss/train': 1.5223662853240967} 11/07/2021 06:37:34 - INFO - __main__ - Step 66741: {'lr': 0.0002988246271722594, 'samples': 12814272, 'steps': 66740, 'loss/train': 1.2772796154022217} 11/07/2021 06:37:34 - INFO - __main__ - Step 66742: {'lr': 0.0002988194226043963, 'samples': 12814464, 'steps': 66741, 'loss/train': 1.1423511505126953} 11/07/2021 06:37:35 - INFO - __main__ - Step 66743: {'lr': 0.0002988142180145359, 'samples': 12814656, 'steps': 66742, 'loss/train': 1.6041680574417114} 11/07/2021 06:37:35 - INFO - __main__ - Step 66744: {'lr': 0.00029880901340268053, 'samples': 12814848, 'steps': 66743, 'loss/train': 1.2186695337295532} 11/07/2021 06:37:36 - INFO - __main__ - Step 66745: {'lr': 0.0002988038087688326, 'samples': 12815040, 'steps': 66744, 'loss/train': 1.964593529701233} 11/07/2021 06:37:36 - INFO - __main__ - Step 66746: {'lr': 0.0002987986041129944, 'samples': 12815232, 'steps': 66745, 'loss/train': 1.5128560066223145} 11/07/2021 06:37:37 - INFO - __main__ - Step 66747: {'lr': 0.00029879339943516837, 'samples': 12815424, 'steps': 66746, 'loss/train': 1.5840238332748413} 11/07/2021 06:37:38 - INFO - __main__ - Step 66748: {'lr': 0.00029878819473535677, 'samples': 12815616, 'steps': 66747, 'loss/train': 1.4661997556686401} 11/07/2021 06:37:38 - INFO - __main__ - Step 66749: {'lr': 0.00029878299001356195, 'samples': 12815808, 'steps': 66748, 'loss/train': 1.4429296255111694} 11/07/2021 06:37:38 - INFO - __main__ - Step 66750: {'lr': 0.0002987777852697863, 'samples': 12816000, 'steps': 66749, 'loss/train': 1.508004903793335} 11/07/2021 06:37:39 - INFO - __main__ - Step 66751: {'lr': 0.0002987725805040321, 'samples': 12816192, 'steps': 66750, 'loss/train': 1.5701874494552612} 11/07/2021 06:37:39 - INFO - __main__ - Step 66752: {'lr': 0.0002987673757163017, 'samples': 12816384, 'steps': 66751, 'loss/train': 1.3562641143798828} 11/07/2021 06:37:40 - INFO - __main__ - Step 66753: {'lr': 0.0002987621709065975, 'samples': 12816576, 'steps': 66752, 'loss/train': 0.8534059524536133} 11/07/2021 06:37:40 - INFO - __main__ - Step 66754: {'lr': 0.0002987569660749218, 'samples': 12816768, 'steps': 66753, 'loss/train': 1.6957381963729858} 11/07/2021 06:37:41 - INFO - __main__ - Step 66755: {'lr': 0.0002987517612212771, 'samples': 12816960, 'steps': 66754, 'loss/train': 1.3278379440307617} 11/07/2021 06:37:41 - INFO - __main__ - Step 66756: {'lr': 0.00029874655634566546, 'samples': 12817152, 'steps': 66755, 'loss/train': 1.688977599143982} 11/07/2021 06:37:41 - INFO - __main__ - Step 66757: {'lr': 0.0002987413514480893, 'samples': 12817344, 'steps': 66756, 'loss/train': 1.0279124975204468} 11/07/2021 06:37:42 - INFO - __main__ - Step 66758: {'lr': 0.0002987361465285512, 'samples': 12817536, 'steps': 66757, 'loss/train': 1.5862793922424316} 11/07/2021 06:37:43 - INFO - __main__ - Step 66759: {'lr': 0.00029873094158705326, 'samples': 12817728, 'steps': 66758, 'loss/train': 1.3464899063110352} 11/07/2021 06:37:43 - INFO - __main__ - Step 66760: {'lr': 0.00029872573662359796, 'samples': 12817920, 'steps': 66759, 'loss/train': 0.8787490725517273} 11/07/2021 06:37:43 - INFO - __main__ - Step 66761: {'lr': 0.00029872053163818756, 'samples': 12818112, 'steps': 66760, 'loss/train': 0.5798304677009583} 11/07/2021 06:37:44 - INFO - __main__ - Step 66762: {'lr': 0.0002987153266308245, 'samples': 12818304, 'steps': 66761, 'loss/train': 1.4357562065124512} 11/07/2021 06:37:44 - INFO - __main__ - Step 66763: {'lr': 0.000298710121601511, 'samples': 12818496, 'steps': 66762, 'loss/train': 0.9775696396827698} 11/07/2021 06:37:45 - INFO - __main__ - Step 66764: {'lr': 0.0002987049165502495, 'samples': 12818688, 'steps': 66763, 'loss/train': 1.6725903749465942} 11/07/2021 06:37:46 - INFO - __main__ - Step 66765: {'lr': 0.0002986997114770423, 'samples': 12818880, 'steps': 66764, 'loss/train': 1.4881025552749634} 11/07/2021 06:37:46 - INFO - __main__ - Step 66766: {'lr': 0.0002986945063818918, 'samples': 12819072, 'steps': 66765, 'loss/train': 0.9914194345474243} 11/07/2021 06:37:46 - INFO - __main__ - Step 66767: {'lr': 0.0002986893012648002, 'samples': 12819264, 'steps': 66766, 'loss/train': 1.6973949670791626} 11/07/2021 06:37:47 - INFO - __main__ - Step 66768: {'lr': 0.00029868409612577007, 'samples': 12819456, 'steps': 66767, 'loss/train': 1.3088346719741821} 11/07/2021 06:37:48 - INFO - __main__ - Step 66769: {'lr': 0.0002986788909648036, 'samples': 12819648, 'steps': 66768, 'loss/train': 0.6884896755218506} 11/07/2021 06:37:48 - INFO - __main__ - Step 66770: {'lr': 0.00029867368578190317, 'samples': 12819840, 'steps': 66769, 'loss/train': 1.2965028285980225} 11/07/2021 06:37:48 - INFO - __main__ - Step 66771: {'lr': 0.0002986684805770711, 'samples': 12820032, 'steps': 66770, 'loss/train': 1.8383502960205078} 11/07/2021 06:37:49 - INFO - __main__ - Step 66772: {'lr': 0.0002986632753503098, 'samples': 12820224, 'steps': 66771, 'loss/train': 1.7567877769470215} 11/07/2021 06:37:49 - INFO - __main__ - Step 66773: {'lr': 0.00029865807010162154, 'samples': 12820416, 'steps': 66772, 'loss/train': 1.7771296501159668} 11/07/2021 06:37:50 - INFO - __main__ - Step 66774: {'lr': 0.0002986528648310087, 'samples': 12820608, 'steps': 66773, 'loss/train': 1.4823932647705078} 11/07/2021 06:37:50 - INFO - __main__ - Step 66775: {'lr': 0.0002986476595384738, 'samples': 12820800, 'steps': 66774, 'loss/train': 1.494023323059082} 11/07/2021 06:37:51 - INFO - __main__ - Step 66776: {'lr': 0.0002986424542240188, 'samples': 12820992, 'steps': 66775, 'loss/train': 1.2817413806915283} 11/07/2021 06:37:51 - INFO - __main__ - Step 66777: {'lr': 0.00029863724888764637, 'samples': 12821184, 'steps': 66776, 'loss/train': 1.4801435470581055} 11/07/2021 06:37:51 - INFO - __main__ - Step 66778: {'lr': 0.0002986320435293587, 'samples': 12821376, 'steps': 66777, 'loss/train': 1.575609564781189} 11/07/2021 06:37:53 - INFO - __main__ - Step 66779: {'lr': 0.0002986268381491582, 'samples': 12821568, 'steps': 66778, 'loss/train': 2.5653038024902344} 11/07/2021 06:37:53 - INFO - __main__ - Step 66780: {'lr': 0.0002986216327470472, 'samples': 12821760, 'steps': 66779, 'loss/train': 1.1783537864685059} 11/07/2021 06:37:53 - INFO - __main__ - Step 66781: {'lr': 0.000298616427323028, 'samples': 12821952, 'steps': 66780, 'loss/train': 0.6038593053817749} 11/07/2021 06:37:54 - INFO - __main__ - Step 66782: {'lr': 0.0002986112218771031, 'samples': 12822144, 'steps': 66781, 'loss/train': 1.5662717819213867} 11/07/2021 06:37:54 - INFO - __main__ - Step 66783: {'lr': 0.00029860601640927464, 'samples': 12822336, 'steps': 66782, 'loss/train': 1.4646527767181396} 11/07/2021 06:37:55 - INFO - __main__ - Step 66784: {'lr': 0.00029860081091954505, 'samples': 12822528, 'steps': 66783, 'loss/train': 1.4430545568466187} 11/07/2021 06:37:55 - INFO - __main__ - Step 66785: {'lr': 0.0002985956054079167, 'samples': 12822720, 'steps': 66784, 'loss/train': 1.3270846605300903} 11/07/2021 06:37:56 - INFO - __main__ - Step 66786: {'lr': 0.00029859039987439195, 'samples': 12822912, 'steps': 66785, 'loss/train': 1.2650504112243652} 11/07/2021 06:37:56 - INFO - __main__ - Step 66787: {'lr': 0.00029858519431897305, 'samples': 12823104, 'steps': 66786, 'loss/train': 1.4957919120788574} 11/07/2021 06:37:56 - INFO - __main__ - Step 66788: {'lr': 0.00029857998874166253, 'samples': 12823296, 'steps': 66787, 'loss/train': 1.0199611186981201} 11/07/2021 06:37:57 - INFO - __main__ - Step 66789: {'lr': 0.00029857478314246257, 'samples': 12823488, 'steps': 66788, 'loss/train': 1.5078045129776} 11/07/2021 06:37:58 - INFO - __main__ - Step 66790: {'lr': 0.0002985695775213755, 'samples': 12823680, 'steps': 66789, 'loss/train': 2.096320867538452} 11/07/2021 06:37:58 - INFO - __main__ - Step 66791: {'lr': 0.00029856437187840375, 'samples': 12823872, 'steps': 66790, 'loss/train': 1.8949998617172241} 11/07/2021 06:37:58 - INFO - __main__ - Step 66792: {'lr': 0.00029855916621354965, 'samples': 12824064, 'steps': 66791, 'loss/train': 1.4871115684509277} 11/07/2021 06:37:59 - INFO - __main__ - Step 66793: {'lr': 0.00029855396052681554, 'samples': 12824256, 'steps': 66792, 'loss/train': 1.14815354347229} 11/07/2021 06:38:00 - INFO - __main__ - Step 66794: {'lr': 0.00029854875481820375, 'samples': 12824448, 'steps': 66793, 'loss/train': 1.32339608669281} 11/07/2021 06:38:00 - INFO - __main__ - Step 66795: {'lr': 0.0002985435490877168, 'samples': 12824640, 'steps': 66794, 'loss/train': 1.3935388326644897} 11/07/2021 06:38:00 - INFO - __main__ - Step 66796: {'lr': 0.00029853834333535667, 'samples': 12824832, 'steps': 66795, 'loss/train': 1.6923490762710571} 11/07/2021 06:38:01 - INFO - __main__ - Step 66797: {'lr': 0.000298533137561126, 'samples': 12825024, 'steps': 66796, 'loss/train': 1.6044889688491821} 11/07/2021 06:38:01 - INFO - __main__ - Step 66798: {'lr': 0.00029852793176502704, 'samples': 12825216, 'steps': 66797, 'loss/train': 1.4463905096054077} 11/07/2021 06:38:02 - INFO - __main__ - Step 66799: {'lr': 0.0002985227259470621, 'samples': 12825408, 'steps': 66798, 'loss/train': 1.359729528427124} 11/07/2021 06:38:03 - INFO - __main__ - Step 66800: {'lr': 0.00029851752010723353, 'samples': 12825600, 'steps': 66799, 'loss/train': 2.1115899085998535} 11/07/2021 06:38:03 - INFO - __main__ - Step 66801: {'lr': 0.00029851231424554383, 'samples': 12825792, 'steps': 66800, 'loss/train': 1.0728291273117065} 11/07/2021 06:38:03 - INFO - __main__ - Step 66802: {'lr': 0.00029850710836199526, 'samples': 12825984, 'steps': 66801, 'loss/train': 1.314618706703186} 11/07/2021 06:38:04 - INFO - __main__ - Step 66803: {'lr': 0.00029850190245659, 'samples': 12826176, 'steps': 66802, 'loss/train': 1.0482720136642456} 11/07/2021 06:38:05 - INFO - __main__ - Step 66804: {'lr': 0.0002984966965293306, 'samples': 12826368, 'steps': 66803, 'loss/train': 1.9775843620300293} 11/07/2021 06:38:05 - INFO - __main__ - Step 66805: {'lr': 0.0002984914905802193, 'samples': 12826560, 'steps': 66804, 'loss/train': 1.374058723449707} 11/07/2021 06:38:05 - INFO - __main__ - Step 66806: {'lr': 0.0002984862846092585, 'samples': 12826752, 'steps': 66805, 'loss/train': 1.3550735712051392} 11/07/2021 06:38:06 - INFO - __main__ - Step 66807: {'lr': 0.0002984810786164505, 'samples': 12826944, 'steps': 66806, 'loss/train': 1.3779624700546265} 11/07/2021 06:38:06 - INFO - __main__ - Step 66808: {'lr': 0.00029847587260179776, 'samples': 12827136, 'steps': 66807, 'loss/train': 0.7480705976486206} 11/07/2021 06:38:06 - INFO - __main__ - Step 66809: {'lr': 0.0002984706665653025, 'samples': 12827328, 'steps': 66808, 'loss/train': 1.4802272319793701} 11/07/2021 06:38:08 - INFO - __main__ - Step 66810: {'lr': 0.0002984654605069671, 'samples': 12827520, 'steps': 66809, 'loss/train': 1.0153435468673706} 11/07/2021 06:38:08 - INFO - __main__ - Step 66811: {'lr': 0.00029846025442679394, 'samples': 12827712, 'steps': 66810, 'loss/train': 1.4833327531814575} 11/07/2021 06:38:08 - INFO - __main__ - Step 66812: {'lr': 0.00029845504832478524, 'samples': 12827904, 'steps': 66811, 'loss/train': 1.653140902519226} 11/07/2021 06:38:09 - INFO - __main__ - Step 66813: {'lr': 0.0002984498422009436, 'samples': 12828096, 'steps': 66812, 'loss/train': 1.9291290044784546} 11/07/2021 06:38:09 - INFO - __main__ - Step 66814: {'lr': 0.00029844463605527104, 'samples': 12828288, 'steps': 66813, 'loss/train': 1.2025952339172363} 11/07/2021 06:38:10 - INFO - __main__ - Step 66815: {'lr': 0.0002984394298877702, 'samples': 12828480, 'steps': 66814, 'loss/train': 1.5045051574707031} 11/07/2021 06:38:10 - INFO - __main__ - Step 66816: {'lr': 0.0002984342236984432, 'samples': 12828672, 'steps': 66815, 'loss/train': 1.7253719568252563} 11/07/2021 06:38:11 - INFO - __main__ - Step 66817: {'lr': 0.00029842901748729255, 'samples': 12828864, 'steps': 66816, 'loss/train': 1.642874002456665} 11/07/2021 06:38:11 - INFO - __main__ - Step 66818: {'lr': 0.0002984238112543205, 'samples': 12829056, 'steps': 66817, 'loss/train': 1.5155950784683228} 11/07/2021 06:38:11 - INFO - __main__ - Step 66819: {'lr': 0.0002984186049995295, 'samples': 12829248, 'steps': 66818, 'loss/train': 1.77615225315094} 11/07/2021 06:38:12 - INFO - __main__ - Step 66820: {'lr': 0.0002984133987229218, 'samples': 12829440, 'steps': 66819, 'loss/train': 1.180159330368042} 11/07/2021 06:38:13 - INFO - __main__ - Step 66821: {'lr': 0.0002984081924244997, 'samples': 12829632, 'steps': 66820, 'loss/train': 1.4455320835113525} 11/07/2021 06:38:13 - INFO - __main__ - Step 66822: {'lr': 0.00029840298610426565, 'samples': 12829824, 'steps': 66821, 'loss/train': 1.5049480199813843} 11/07/2021 06:38:13 - INFO - __main__ - Step 66823: {'lr': 0.00029839777976222196, 'samples': 12830016, 'steps': 66822, 'loss/train': 1.6011271476745605} 11/07/2021 06:38:14 - INFO - __main__ - Step 66824: {'lr': 0.0002983925733983711, 'samples': 12830208, 'steps': 66823, 'loss/train': 1.123693823814392} 11/07/2021 06:38:15 - INFO - __main__ - Step 66825: {'lr': 0.00029838736701271514, 'samples': 12830400, 'steps': 66824, 'loss/train': 1.7048213481903076} 11/07/2021 06:38:15 - INFO - __main__ - Step 66826: {'lr': 0.00029838216060525656, 'samples': 12830592, 'steps': 66825, 'loss/train': 1.6973764896392822} 11/07/2021 06:38:15 - INFO - __main__ - Step 66827: {'lr': 0.0002983769541759978, 'samples': 12830784, 'steps': 66826, 'loss/train': 1.4285223484039307} 11/07/2021 06:38:16 - INFO - __main__ - Step 66828: {'lr': 0.00029837174772494107, 'samples': 12830976, 'steps': 66827, 'loss/train': 1.5583018064498901} 11/07/2021 06:38:16 - INFO - __main__ - Step 66829: {'lr': 0.0002983665412520888, 'samples': 12831168, 'steps': 66828, 'loss/train': 1.0996744632720947} 11/07/2021 06:38:17 - INFO - __main__ - Step 66830: {'lr': 0.0002983613347574434, 'samples': 12831360, 'steps': 66829, 'loss/train': 1.0818766355514526} 11/07/2021 06:38:18 - INFO - __main__ - Step 66831: {'lr': 0.00029835612824100706, 'samples': 12831552, 'steps': 66830, 'loss/train': 1.4051878452301025} 11/07/2021 06:38:18 - INFO - __main__ - Step 66832: {'lr': 0.0002983509217027822, 'samples': 12831744, 'steps': 66831, 'loss/train': 1.4165452718734741} 11/07/2021 06:38:18 - INFO - __main__ - Step 66833: {'lr': 0.00029834571514277116, 'samples': 12831936, 'steps': 66832, 'loss/train': 1.0807420015335083} 11/07/2021 06:38:19 - INFO - __main__ - Step 66834: {'lr': 0.0002983405085609763, 'samples': 12832128, 'steps': 66833, 'loss/train': 1.6299808025360107} 11/07/2021 06:38:20 - INFO - __main__ - Step 66835: {'lr': 0.0002983353019573999, 'samples': 12832320, 'steps': 66834, 'loss/train': 1.2102845907211304} 11/07/2021 06:38:20 - INFO - __main__ - Step 66836: {'lr': 0.0002983300953320445, 'samples': 12832512, 'steps': 66835, 'loss/train': 1.2998665571212769} 11/07/2021 06:38:20 - INFO - __main__ - Step 66837: {'lr': 0.00029832488868491216, 'samples': 12832704, 'steps': 66836, 'loss/train': 1.6509366035461426} 11/07/2021 06:38:21 - INFO - __main__ - Step 66838: {'lr': 0.0002983196820160054, 'samples': 12832896, 'steps': 66837, 'loss/train': 1.6138354539871216} 11/07/2021 06:38:21 - INFO - __main__ - Step 66839: {'lr': 0.0002983144753253265, 'samples': 12833088, 'steps': 66838, 'loss/train': 1.570172667503357} 11/07/2021 06:38:22 - INFO - __main__ - Step 66840: {'lr': 0.0002983092686128779, 'samples': 12833280, 'steps': 66839, 'loss/train': 1.7665467262268066} 11/07/2021 06:38:22 - INFO - __main__ - Step 66841: {'lr': 0.00029830406187866186, 'samples': 12833472, 'steps': 66840, 'loss/train': 1.3683182001113892} 11/07/2021 06:38:23 - INFO - __main__ - Step 66842: {'lr': 0.00029829885512268084, 'samples': 12833664, 'steps': 66841, 'loss/train': 0.3093379735946655} 11/07/2021 06:38:23 - INFO - __main__ - Step 66843: {'lr': 0.000298293648344937, 'samples': 12833856, 'steps': 66842, 'loss/train': 1.363093614578247} 11/07/2021 06:38:23 - INFO - __main__ - Step 66844: {'lr': 0.0002982884415454328, 'samples': 12834048, 'steps': 66843, 'loss/train': 0.9937983751296997} 11/07/2021 06:38:24 - INFO - __main__ - Step 66845: {'lr': 0.00029828323472417065, 'samples': 12834240, 'steps': 66844, 'loss/train': 0.7615651488304138} 11/07/2021 06:38:25 - INFO - __main__ - Step 66846: {'lr': 0.00029827802788115276, 'samples': 12834432, 'steps': 66845, 'loss/train': 1.6342850923538208} 11/07/2021 06:38:25 - INFO - __main__ - Step 66847: {'lr': 0.00029827282101638154, 'samples': 12834624, 'steps': 66846, 'loss/train': 0.404573917388916} 11/07/2021 06:38:26 - INFO - __main__ - Step 66848: {'lr': 0.00029826761412985933, 'samples': 12834816, 'steps': 66847, 'loss/train': 1.6392889022827148} 11/07/2021 06:38:26 - INFO - __main__ - Step 66849: {'lr': 0.00029826240722158847, 'samples': 12835008, 'steps': 66848, 'loss/train': 1.7063122987747192} 11/07/2021 06:38:26 - INFO - __main__ - Step 66850: {'lr': 0.0002982572002915713, 'samples': 12835200, 'steps': 66849, 'loss/train': 1.0623157024383545} 11/07/2021 06:38:27 - INFO - __main__ - Step 66851: {'lr': 0.00029825199333981023, 'samples': 12835392, 'steps': 66850, 'loss/train': 1.3209699392318726} 11/07/2021 06:38:28 - INFO - __main__ - Step 66852: {'lr': 0.0002982467863663075, 'samples': 12835584, 'steps': 66851, 'loss/train': 1.580957055091858} 11/07/2021 06:38:28 - INFO - __main__ - Step 66853: {'lr': 0.00029824157937106553, 'samples': 12835776, 'steps': 66852, 'loss/train': 1.0455963611602783} 11/07/2021 06:38:28 - INFO - __main__ - Step 66854: {'lr': 0.0002982363723540867, 'samples': 12835968, 'steps': 66853, 'loss/train': 1.2406388521194458} 11/07/2021 06:38:29 - INFO - __main__ - Step 66855: {'lr': 0.00029823116531537325, 'samples': 12836160, 'steps': 66854, 'loss/train': 1.0157272815704346} 11/07/2021 06:38:30 - INFO - __main__ - Step 66856: {'lr': 0.00029822595825492766, 'samples': 12836352, 'steps': 66855, 'loss/train': 1.2167943716049194} 11/07/2021 06:38:30 - INFO - __main__ - Step 66857: {'lr': 0.0002982207511727522, 'samples': 12836544, 'steps': 66856, 'loss/train': 1.5537426471710205} 11/07/2021 06:38:30 - INFO - __main__ - Step 66858: {'lr': 0.0002982155440688491, 'samples': 12836736, 'steps': 66857, 'loss/train': 1.3024166822433472} 11/07/2021 06:38:31 - INFO - __main__ - Step 66859: {'lr': 0.00029821033694322086, 'samples': 12836928, 'steps': 66858, 'loss/train': 1.438707709312439} 11/07/2021 06:38:31 - INFO - __main__ - Step 66860: {'lr': 0.00029820512979586975, 'samples': 12837120, 'steps': 66859, 'loss/train': 1.0099835395812988} 11/07/2021 06:38:33 - INFO - __main__ - Step 66861: {'lr': 0.00029819992262679817, 'samples': 12837312, 'steps': 66860, 'loss/train': 1.7837340831756592} 11/07/2021 06:38:33 - INFO - __main__ - Step 66862: {'lr': 0.00029819471543600856, 'samples': 12837504, 'steps': 66861, 'loss/train': 1.3094764947891235} 11/07/2021 06:38:33 - INFO - __main__ - Step 66863: {'lr': 0.000298189508223503, 'samples': 12837696, 'steps': 66862, 'loss/train': 1.4650862216949463} 11/07/2021 06:38:34 - INFO - __main__ - Step 66864: {'lr': 0.0002981843009892841, 'samples': 12837888, 'steps': 66863, 'loss/train': 1.2200987339019775} 11/07/2021 06:38:34 - INFO - __main__ - Step 66865: {'lr': 0.00029817909373335407, 'samples': 12838080, 'steps': 66864, 'loss/train': 0.6177035570144653} 11/07/2021 06:38:34 - INFO - __main__ - Step 66866: {'lr': 0.0002981738864557153, 'samples': 12838272, 'steps': 66865, 'loss/train': 1.269160270690918} 11/07/2021 06:38:36 - INFO - __main__ - Step 66867: {'lr': 0.0002981686791563701, 'samples': 12838464, 'steps': 66866, 'loss/train': 0.14408384263515472} 11/07/2021 06:38:36 - INFO - __main__ - Step 66868: {'lr': 0.00029816347183532076, 'samples': 12838656, 'steps': 66867, 'loss/train': 1.33181631565094} 11/07/2021 06:38:36 - INFO - __main__ - Step 66869: {'lr': 0.00029815826449256985, 'samples': 12838848, 'steps': 66868, 'loss/train': 1.610174536705017} 11/07/2021 06:38:37 - INFO - __main__ - Step 66870: {'lr': 0.00029815305712811946, 'samples': 12839040, 'steps': 66869, 'loss/train': 1.2089568376541138} 11/07/2021 06:38:37 - INFO - __main__ - Step 66871: {'lr': 0.0002981478497419721, 'samples': 12839232, 'steps': 66870, 'loss/train': 1.4313249588012695} 11/07/2021 06:38:38 - INFO - __main__ - Step 66872: {'lr': 0.00029814264233413, 'samples': 12839424, 'steps': 66871, 'loss/train': 1.3109145164489746} 11/07/2021 06:38:38 - INFO - __main__ - Step 66873: {'lr': 0.00029813743490459565, 'samples': 12839616, 'steps': 66872, 'loss/train': 1.4413795471191406} 11/07/2021 06:38:39 - INFO - __main__ - Step 66874: {'lr': 0.00029813222745337124, 'samples': 12839808, 'steps': 66873, 'loss/train': 1.2799980640411377} 11/07/2021 06:38:39 - INFO - __main__ - Step 66875: {'lr': 0.0002981270199804592, 'samples': 12840000, 'steps': 66874, 'loss/train': 1.5270018577575684} 11/07/2021 06:38:40 - INFO - __main__ - Step 66876: {'lr': 0.00029812181248586194, 'samples': 12840192, 'steps': 66875, 'loss/train': 1.071372628211975} 11/07/2021 06:38:40 - INFO - __main__ - Step 66877: {'lr': 0.0002981166049695817, 'samples': 12840384, 'steps': 66876, 'loss/train': 1.4729077816009521} 11/07/2021 06:38:41 - INFO - __main__ - Step 66878: {'lr': 0.00029811139743162086, 'samples': 12840576, 'steps': 66877, 'loss/train': 1.4798557758331299} 11/07/2021 06:38:42 - INFO - __main__ - Step 66879: {'lr': 0.0002981061898719817, 'samples': 12840768, 'steps': 66878, 'loss/train': 1.6336976289749146} 11/07/2021 06:38:42 - INFO - __main__ - Step 66880: {'lr': 0.00029810098229066676, 'samples': 12840960, 'steps': 66879, 'loss/train': 0.9286589026451111} 11/07/2021 06:38:42 - INFO - __main__ - Step 66881: {'lr': 0.0002980957746876781, 'samples': 12841152, 'steps': 66880, 'loss/train': 1.5103578567504883} 11/07/2021 06:38:43 - INFO - __main__ - Step 66882: {'lr': 0.00029809056706301833, 'samples': 12841344, 'steps': 66881, 'loss/train': 1.6573008298873901} 11/07/2021 06:38:43 - INFO - __main__ - Step 66883: {'lr': 0.00029808535941668973, 'samples': 12841536, 'steps': 66882, 'loss/train': 1.8566683530807495} 11/07/2021 06:38:43 - INFO - __main__ - Step 66884: {'lr': 0.0002980801517486945, 'samples': 12841728, 'steps': 66883, 'loss/train': 1.8027366399765015} 11/07/2021 06:38:45 - INFO - __main__ - Step 66885: {'lr': 0.00029807494405903516, 'samples': 12841920, 'steps': 66884, 'loss/train': 0.936206579208374} 11/07/2021 06:38:45 - INFO - __main__ - Step 66886: {'lr': 0.000298069736347714, 'samples': 12842112, 'steps': 66885, 'loss/train': 1.5021564960479736} 11/07/2021 06:38:45 - INFO - __main__ - Step 66887: {'lr': 0.0002980645286147333, 'samples': 12842304, 'steps': 66886, 'loss/train': 2.0276877880096436} 11/07/2021 06:38:46 - INFO - __main__ - Step 66888: {'lr': 0.00029805932086009553, 'samples': 12842496, 'steps': 66887, 'loss/train': 1.1761277914047241} 11/07/2021 06:38:46 - INFO - __main__ - Step 66889: {'lr': 0.00029805411308380297, 'samples': 12842688, 'steps': 66888, 'loss/train': 1.254209280014038} 11/07/2021 06:38:48 - INFO - __main__ - Step 66890: {'lr': 0.0002980489052858579, 'samples': 12842880, 'steps': 66889, 'loss/train': 1.4902147054672241} 11/07/2021 06:38:48 - INFO - __main__ - Step 66891: {'lr': 0.0002980436974662628, 'samples': 12843072, 'steps': 66890, 'loss/train': 0.9828837513923645} 11/07/2021 06:38:48 - INFO - __main__ - Step 66892: {'lr': 0.0002980384896250199, 'samples': 12843264, 'steps': 66891, 'loss/train': 1.5215646028518677} 11/07/2021 06:38:49 - INFO - __main__ - Step 66893: {'lr': 0.0002980332817621317, 'samples': 12843456, 'steps': 66892, 'loss/train': 1.9449959993362427} 11/07/2021 06:38:49 - INFO - __main__ - Step 66894: {'lr': 0.0002980280738776003, 'samples': 12843648, 'steps': 66893, 'loss/train': 1.7029489278793335} 11/07/2021 06:38:49 - INFO - __main__ - Step 66895: {'lr': 0.0002980228659714283, 'samples': 12843840, 'steps': 66894, 'loss/train': 1.478158950805664} 11/07/2021 06:38:50 - INFO - __main__ - Step 66896: {'lr': 0.00029801765804361794, 'samples': 12844032, 'steps': 66895, 'loss/train': 0.7928040623664856} 11/07/2021 06:38:51 - INFO - __main__ - Step 66897: {'lr': 0.0002980124500941716, 'samples': 12844224, 'steps': 66896, 'loss/train': 1.4936224222183228} 11/07/2021 06:38:51 - INFO - __main__ - Step 66898: {'lr': 0.0002980072421230914, 'samples': 12844416, 'steps': 66897, 'loss/train': 1.4886605739593506} 11/07/2021 06:38:51 - INFO - __main__ - Step 66899: {'lr': 0.00029800203413038, 'samples': 12844608, 'steps': 66898, 'loss/train': 1.325537085533142} 11/07/2021 06:38:52 - INFO - __main__ - Step 66900: {'lr': 0.00029799682611603964, 'samples': 12844800, 'steps': 66899, 'loss/train': 1.838220477104187} 11/07/2021 06:38:53 - INFO - __main__ - Step 66901: {'lr': 0.00029799161808007264, 'samples': 12844992, 'steps': 66900, 'loss/train': 1.5058858394622803} 11/07/2021 06:38:53 - INFO - __main__ - Step 66902: {'lr': 0.0002979864100224813, 'samples': 12845184, 'steps': 66901, 'loss/train': 1.5068614482879639} 11/07/2021 06:38:53 - INFO - __main__ - Step 66903: {'lr': 0.0002979812019432681, 'samples': 12845376, 'steps': 66902, 'loss/train': 1.055471658706665} 11/07/2021 06:38:54 - INFO - __main__ - Step 66904: {'lr': 0.0002979759938424353, 'samples': 12845568, 'steps': 66903, 'loss/train': 1.328389048576355} 11/07/2021 06:38:54 - INFO - __main__ - Step 66905: {'lr': 0.00029797078571998527, 'samples': 12845760, 'steps': 66904, 'loss/train': 1.738703966140747} 11/07/2021 06:38:55 - INFO - __main__ - Step 66906: {'lr': 0.0002979655775759202, 'samples': 12845952, 'steps': 66905, 'loss/train': 1.465796709060669} 11/07/2021 06:38:56 - INFO - __main__ - Step 66907: {'lr': 0.00029796036941024274, 'samples': 12846144, 'steps': 66906, 'loss/train': 1.2369256019592285} 11/07/2021 06:38:56 - INFO - __main__ - Step 66908: {'lr': 0.000297955161222955, 'samples': 12846336, 'steps': 66907, 'loss/train': 1.4455788135528564} 11/07/2021 06:38:56 - INFO - __main__ - Step 66909: {'lr': 0.00029794995301405953, 'samples': 12846528, 'steps': 66908, 'loss/train': 1.5637747049331665} 11/07/2021 06:38:57 - INFO - __main__ - Step 66910: {'lr': 0.0002979447447835584, 'samples': 12846720, 'steps': 66909, 'loss/train': 1.360649824142456} 11/07/2021 06:38:57 - INFO - __main__ - Step 66911: {'lr': 0.00029793953653145424, 'samples': 12846912, 'steps': 66910, 'loss/train': 1.3279404640197754} 11/07/2021 06:38:58 - INFO - __main__ - Step 66912: {'lr': 0.00029793432825774913, 'samples': 12847104, 'steps': 66911, 'loss/train': 1.4519059658050537} 11/07/2021 06:38:58 - INFO - __main__ - Step 66913: {'lr': 0.0002979291199624456, 'samples': 12847296, 'steps': 66912, 'loss/train': 1.316642165184021} 11/07/2021 06:38:59 - INFO - __main__ - Step 66914: {'lr': 0.000297923911645546, 'samples': 12847488, 'steps': 66913, 'loss/train': 1.9718916416168213} 11/07/2021 06:38:59 - INFO - __main__ - Step 66915: {'lr': 0.00029791870330705256, 'samples': 12847680, 'steps': 66914, 'loss/train': 0.2852233052253723} 11/07/2021 06:38:59 - INFO - __main__ - Step 66916: {'lr': 0.0002979134949469677, 'samples': 12847872, 'steps': 66915, 'loss/train': 1.4380649328231812} 11/07/2021 06:39:00 - INFO - __main__ - Step 66917: {'lr': 0.00029790828656529384, 'samples': 12848064, 'steps': 66916, 'loss/train': 1.2586188316345215} 11/07/2021 06:39:01 - INFO - __main__ - Step 66918: {'lr': 0.0002979030781620332, 'samples': 12848256, 'steps': 66917, 'loss/train': 1.3651821613311768} 11/07/2021 06:39:01 - INFO - __main__ - Step 66919: {'lr': 0.00029789786973718807, 'samples': 12848448, 'steps': 66918, 'loss/train': 1.2751590013504028} 11/07/2021 06:39:01 - INFO - __main__ - Step 66920: {'lr': 0.000297892661290761, 'samples': 12848640, 'steps': 66919, 'loss/train': 1.4696381092071533} 11/07/2021 06:39:02 - INFO - __main__ - Step 66921: {'lr': 0.0002978874528227542, 'samples': 12848832, 'steps': 66920, 'loss/train': 0.785591721534729} 11/07/2021 06:39:03 - INFO - __main__ - Step 66922: {'lr': 0.0002978822443331701, 'samples': 12849024, 'steps': 66921, 'loss/train': 1.3189053535461426} 11/07/2021 06:39:03 - INFO - __main__ - Step 66923: {'lr': 0.000297877035822011, 'samples': 12849216, 'steps': 66922, 'loss/train': 1.360253930091858} 11/07/2021 06:39:04 - INFO - __main__ - Step 66924: {'lr': 0.0002978718272892792, 'samples': 12849408, 'steps': 66923, 'loss/train': 1.4389848709106445} 11/07/2021 06:39:04 - INFO - __main__ - Step 66925: {'lr': 0.00029786661873497714, 'samples': 12849600, 'steps': 66924, 'loss/train': 1.4933536052703857} 11/07/2021 06:39:04 - INFO - __main__ - Step 66926: {'lr': 0.00029786141015910705, 'samples': 12849792, 'steps': 66925, 'loss/train': 1.2595512866973877} 11/07/2021 06:39:05 - INFO - __main__ - Step 66927: {'lr': 0.00029785620156167137, 'samples': 12849984, 'steps': 66926, 'loss/train': 1.1449699401855469} 11/07/2021 06:39:06 - INFO - __main__ - Step 66928: {'lr': 0.0002978509929426724, 'samples': 12850176, 'steps': 66927, 'loss/train': 1.341023325920105} 11/07/2021 06:39:06 - INFO - __main__ - Step 66929: {'lr': 0.00029784578430211255, 'samples': 12850368, 'steps': 66928, 'loss/train': 1.242306113243103} 11/07/2021 06:39:06 - INFO - __main__ - Step 66930: {'lr': 0.0002978405756399942, 'samples': 12850560, 'steps': 66929, 'loss/train': 1.3540148735046387} 11/07/2021 06:39:07 - INFO - __main__ - Step 66931: {'lr': 0.00029783536695631954, 'samples': 12850752, 'steps': 66930, 'loss/train': 1.4597740173339844} 11/07/2021 06:39:08 - INFO - __main__ - Step 66932: {'lr': 0.000297830158251091, 'samples': 12850944, 'steps': 66931, 'loss/train': 1.1700499057769775} 11/07/2021 06:39:08 - INFO - __main__ - Step 66933: {'lr': 0.00029782494952431093, 'samples': 12851136, 'steps': 66932, 'loss/train': 1.4936296939849854} 11/07/2021 06:39:08 - INFO - __main__ - Step 66934: {'lr': 0.00029781974077598174, 'samples': 12851328, 'steps': 66933, 'loss/train': 1.6771787405014038} 11/07/2021 06:39:09 - INFO - __main__ - Step 66935: {'lr': 0.00029781453200610565, 'samples': 12851520, 'steps': 66934, 'loss/train': 1.7840930223464966} 11/07/2021 06:39:09 - INFO - __main__ - Step 66936: {'lr': 0.0002978093232146851, 'samples': 12851712, 'steps': 66935, 'loss/train': 1.1696276664733887} 11/07/2021 06:39:09 - INFO - __main__ - Step 66937: {'lr': 0.0002978041144017224, 'samples': 12851904, 'steps': 66936, 'loss/train': 1.4268096685409546} 11/07/2021 06:39:11 - INFO - __main__ - Step 66938: {'lr': 0.0002977989055672199, 'samples': 12852096, 'steps': 66937, 'loss/train': 2.0771682262420654} 11/07/2021 06:39:11 - INFO - __main__ - Step 66939: {'lr': 0.0002977936967111799, 'samples': 12852288, 'steps': 66938, 'loss/train': 1.5936490297317505} 11/07/2021 06:39:11 - INFO - __main__ - Step 66940: {'lr': 0.00029778848783360484, 'samples': 12852480, 'steps': 66939, 'loss/train': 1.4432034492492676} 11/07/2021 06:39:12 - INFO - __main__ - Step 66941: {'lr': 0.000297783278934497, 'samples': 12852672, 'steps': 66940, 'loss/train': 1.54490327835083} 11/07/2021 06:39:12 - INFO - __main__ - Step 66942: {'lr': 0.0002977780700138588, 'samples': 12852864, 'steps': 66941, 'loss/train': 1.3605393171310425} 11/07/2021 06:39:13 - INFO - __main__ - Step 66943: {'lr': 0.00029777286107169254, 'samples': 12853056, 'steps': 66942, 'loss/train': 1.4009031057357788} 11/07/2021 06:39:13 - INFO - __main__ - Step 66944: {'lr': 0.00029776765210800057, 'samples': 12853248, 'steps': 66943, 'loss/train': 1.181050419807434} 11/07/2021 06:39:14 - INFO - __main__ - Step 66945: {'lr': 0.0002977624431227852, 'samples': 12853440, 'steps': 66944, 'loss/train': 1.2142736911773682} 11/07/2021 06:39:14 - INFO - __main__ - Step 66946: {'lr': 0.00029775723411604876, 'samples': 12853632, 'steps': 66945, 'loss/train': 1.0470037460327148} 11/07/2021 06:39:14 - INFO - __main__ - Step 66947: {'lr': 0.0002977520250877937, 'samples': 12853824, 'steps': 66946, 'loss/train': 0.7729756236076355} 11/07/2021 06:39:15 - INFO - __main__ - Step 66948: {'lr': 0.0002977468160380224, 'samples': 12854016, 'steps': 66947, 'loss/train': 1.7640089988708496} 11/07/2021 06:39:16 - INFO - __main__ - Step 66949: {'lr': 0.00029774160696673704, 'samples': 12854208, 'steps': 66948, 'loss/train': 1.5957211256027222} 11/07/2021 06:39:16 - INFO - __main__ - Step 66950: {'lr': 0.00029773639787394, 'samples': 12854400, 'steps': 66949, 'loss/train': 1.686535358428955} 11/07/2021 06:39:17 - INFO - __main__ - Step 66951: {'lr': 0.0002977311887596337, 'samples': 12854592, 'steps': 66950, 'loss/train': 1.6551313400268555} 11/07/2021 06:39:17 - INFO - __main__ - Step 66952: {'lr': 0.0002977259796238205, 'samples': 12854784, 'steps': 66951, 'loss/train': 1.5048328638076782} 11/07/2021 06:39:18 - INFO - __main__ - Step 66953: {'lr': 0.00029772077046650273, 'samples': 12854976, 'steps': 66952, 'loss/train': 1.3543171882629395} 11/07/2021 06:39:18 - INFO - __main__ - Step 66954: {'lr': 0.00029771556128768266, 'samples': 12855168, 'steps': 66953, 'loss/train': 1.1700007915496826} 11/07/2021 06:39:19 - INFO - __main__ - Step 66955: {'lr': 0.00029771035208736276, 'samples': 12855360, 'steps': 66954, 'loss/train': 1.6650967597961426} 11/07/2021 06:39:19 - INFO - __main__ - Step 66956: {'lr': 0.00029770514286554524, 'samples': 12855552, 'steps': 66955, 'loss/train': 1.2106488943099976} 11/07/2021 06:39:19 - INFO - __main__ - Step 66957: {'lr': 0.0002976999336222326, 'samples': 12855744, 'steps': 66956, 'loss/train': 0.8247320652008057} 11/07/2021 06:39:20 - INFO - __main__ - Step 66958: {'lr': 0.000297694724357427, 'samples': 12855936, 'steps': 66957, 'loss/train': 1.1804513931274414} 11/07/2021 06:39:21 - INFO - __main__ - Step 66959: {'lr': 0.000297689515071131, 'samples': 12856128, 'steps': 66958, 'loss/train': 1.4928487539291382} 11/07/2021 06:39:21 - INFO - __main__ - Step 66960: {'lr': 0.00029768430576334676, 'samples': 12856320, 'steps': 66959, 'loss/train': 1.1935933828353882} 11/07/2021 06:39:21 - INFO - __main__ - Step 66961: {'lr': 0.0002976790964340768, 'samples': 12856512, 'steps': 66960, 'loss/train': 1.4265247583389282} 11/07/2021 06:39:22 - INFO - __main__ - Step 66962: {'lr': 0.00029767388708332323, 'samples': 12856704, 'steps': 66961, 'loss/train': 1.634390115737915} 11/07/2021 06:39:23 - INFO - __main__ - Step 66963: {'lr': 0.00029766867771108865, 'samples': 12856896, 'steps': 66962, 'loss/train': 1.6523627042770386} 11/07/2021 06:39:23 - INFO - __main__ - Step 66964: {'lr': 0.00029766346831737526, 'samples': 12857088, 'steps': 66963, 'loss/train': 1.503721833229065} 11/07/2021 06:39:23 - INFO - __main__ - Step 66965: {'lr': 0.0002976582589021855, 'samples': 12857280, 'steps': 66964, 'loss/train': 1.1395463943481445} 11/07/2021 06:39:24 - INFO - __main__ - Step 66966: {'lr': 0.0002976530494655216, 'samples': 12857472, 'steps': 66965, 'loss/train': 1.7678818702697754} 11/07/2021 06:39:24 - INFO - __main__ - Step 66967: {'lr': 0.000297647840007386, 'samples': 12857664, 'steps': 66966, 'loss/train': 1.836700677871704} 11/07/2021 06:39:25 - INFO - __main__ - Step 66968: {'lr': 0.000297642630527781, 'samples': 12857856, 'steps': 66967, 'loss/train': 1.4653239250183105} 11/07/2021 06:39:26 - INFO - __main__ - Step 66969: {'lr': 0.000297637421026709, 'samples': 12858048, 'steps': 66968, 'loss/train': 1.7425637245178223} 11/07/2021 06:39:26 - INFO - __main__ - Step 66970: {'lr': 0.0002976322115041723, 'samples': 12858240, 'steps': 66969, 'loss/train': 0.8802806735038757} 11/07/2021 06:39:26 - INFO - __main__ - Step 66971: {'lr': 0.00029762700196017325, 'samples': 12858432, 'steps': 66970, 'loss/train': 1.2041141986846924} 11/07/2021 06:39:27 - INFO - __main__ - Step 66972: {'lr': 0.0002976217923947142, 'samples': 12858624, 'steps': 66971, 'loss/train': 1.4949822425842285} 11/07/2021 06:39:27 - INFO - __main__ - Step 66973: {'lr': 0.0002976165828077975, 'samples': 12858816, 'steps': 66972, 'loss/train': 1.5712357759475708} 11/07/2021 06:39:28 - INFO - __main__ - Step 66974: {'lr': 0.0002976113731994255, 'samples': 12859008, 'steps': 66973, 'loss/train': 1.1531301736831665} 11/07/2021 06:39:28 - INFO - __main__ - Step 66975: {'lr': 0.0002976061635696006, 'samples': 12859200, 'steps': 66974, 'loss/train': 1.5073195695877075} 11/07/2021 06:39:29 - INFO - __main__ - Step 66976: {'lr': 0.00029760095391832505, 'samples': 12859392, 'steps': 66975, 'loss/train': 1.6333152055740356} 11/07/2021 06:39:29 - INFO - __main__ - Step 66977: {'lr': 0.00029759574424560134, 'samples': 12859584, 'steps': 66976, 'loss/train': 0.9771480560302734} 11/07/2021 06:39:29 - INFO - __main__ - Step 66978: {'lr': 0.0002975905345514316, 'samples': 12859776, 'steps': 66977, 'loss/train': 0.8987215757369995} 11/07/2021 06:39:31 - INFO - __main__ - Step 66979: {'lr': 0.00029758532483581835, 'samples': 12859968, 'steps': 66978, 'loss/train': 1.6353718042373657} 11/07/2021 06:39:31 - INFO - __main__ - Step 66980: {'lr': 0.00029758011509876383, 'samples': 12860160, 'steps': 66979, 'loss/train': 1.5596027374267578} 11/07/2021 06:39:31 - INFO - __main__ - Step 66981: {'lr': 0.00029757490534027046, 'samples': 12860352, 'steps': 66980, 'loss/train': 1.4445384740829468} 11/07/2021 06:39:32 - INFO - __main__ - Step 66982: {'lr': 0.00029756969556034063, 'samples': 12860544, 'steps': 66981, 'loss/train': 1.1100434064865112} 11/07/2021 06:39:32 - INFO - __main__ - Step 66983: {'lr': 0.00029756448575897666, 'samples': 12860736, 'steps': 66982, 'loss/train': 1.9028998613357544} 11/07/2021 06:39:33 - INFO - __main__ - Step 66984: {'lr': 0.00029755927593618083, 'samples': 12860928, 'steps': 66983, 'loss/train': 1.5562161207199097} 11/07/2021 06:39:33 - INFO - __main__ - Step 66985: {'lr': 0.0002975540660919555, 'samples': 12861120, 'steps': 66984, 'loss/train': 1.6550071239471436} 11/07/2021 06:39:34 - INFO - __main__ - Step 66986: {'lr': 0.000297548856226303, 'samples': 12861312, 'steps': 66985, 'loss/train': 1.287642240524292} 11/07/2021 06:39:34 - INFO - __main__ - Step 66987: {'lr': 0.0002975436463392258, 'samples': 12861504, 'steps': 66986, 'loss/train': 1.1941375732421875} 11/07/2021 06:39:34 - INFO - __main__ - Step 66988: {'lr': 0.0002975384364307261, 'samples': 12861696, 'steps': 66987, 'loss/train': 1.1110267639160156} 11/07/2021 06:39:35 - INFO - __main__ - Step 66989: {'lr': 0.00029753322650080634, 'samples': 12861888, 'steps': 66988, 'loss/train': 1.4144443273544312} 11/07/2021 06:39:36 - INFO - __main__ - Step 66990: {'lr': 0.00029752801654946886, 'samples': 12862080, 'steps': 66989, 'loss/train': 1.4310657978057861} 11/07/2021 06:39:36 - INFO - __main__ - Step 66991: {'lr': 0.000297522806576716, 'samples': 12862272, 'steps': 66990, 'loss/train': 0.7089109420776367} 11/07/2021 06:39:37 - INFO - __main__ - Step 66992: {'lr': 0.0002975175965825501, 'samples': 12862464, 'steps': 66991, 'loss/train': 1.2573726177215576} 11/07/2021 06:39:37 - INFO - __main__ - Step 66993: {'lr': 0.0002975123865669734, 'samples': 12862656, 'steps': 66992, 'loss/train': 1.3483322858810425} 11/07/2021 06:39:37 - INFO - __main__ - Step 66994: {'lr': 0.00029750717652998846, 'samples': 12862848, 'steps': 66993, 'loss/train': 1.1555858850479126} 11/07/2021 06:39:38 - INFO - __main__ - Step 66995: {'lr': 0.00029750196647159745, 'samples': 12863040, 'steps': 66994, 'loss/train': 1.2040671110153198} 11/07/2021 06:39:39 - INFO - __main__ - Step 66996: {'lr': 0.00029749675639180283, 'samples': 12863232, 'steps': 66995, 'loss/train': 1.29051673412323} 11/07/2021 06:39:39 - INFO - __main__ - Step 66997: {'lr': 0.0002974915462906069, 'samples': 12863424, 'steps': 66996, 'loss/train': 1.0926471948623657} 11/07/2021 06:39:39 - INFO - __main__ - Step 66998: {'lr': 0.00029748633616801206, 'samples': 12863616, 'steps': 66997, 'loss/train': 2.5142412185668945} 11/07/2021 06:39:40 - INFO - __main__ - Step 66999: {'lr': 0.00029748112602402053, 'samples': 12863808, 'steps': 66998, 'loss/train': 1.24008309841156} 11/07/2021 06:39:41 - INFO - __main__ - Step 67000: {'lr': 0.00029747591585863476, 'samples': 12864000, 'steps': 66999, 'loss/train': 1.4771854877471924} 11/07/2021 06:39:41 - INFO - __main__ - Step 67001: {'lr': 0.0002974707056718571, 'samples': 12864192, 'steps': 67000, 'loss/train': 1.4096732139587402} 11/07/2021 06:39:42 - INFO - __main__ - Step 67002: {'lr': 0.00029746549546368984, 'samples': 12864384, 'steps': 67001, 'loss/train': 1.4177114963531494} 11/07/2021 06:39:42 - INFO - __main__ - Step 67003: {'lr': 0.0002974602852341354, 'samples': 12864576, 'steps': 67002, 'loss/train': 1.85921049118042} 11/07/2021 06:39:42 - INFO - __main__ - Step 67004: {'lr': 0.00029745507498319605, 'samples': 12864768, 'steps': 67003, 'loss/train': 1.4495524168014526} 11/07/2021 06:39:43 - INFO - __main__ - Step 67005: {'lr': 0.0002974498647108742, 'samples': 12864960, 'steps': 67004, 'loss/train': 0.9126071929931641} 11/07/2021 06:39:44 - INFO - __main__ - Step 67006: {'lr': 0.00029744465441717215, 'samples': 12865152, 'steps': 67005, 'loss/train': 1.3722254037857056} 11/07/2021 06:39:44 - INFO - __main__ - Step 67007: {'lr': 0.00029743944410209227, 'samples': 12865344, 'steps': 67006, 'loss/train': 1.6932756900787354} 11/07/2021 06:39:44 - INFO - __main__ - Step 67008: {'lr': 0.00029743423376563696, 'samples': 12865536, 'steps': 67007, 'loss/train': 0.18909813463687897} 11/07/2021 06:39:45 - INFO - __main__ - Step 67009: {'lr': 0.00029742902340780845, 'samples': 12865728, 'steps': 67008, 'loss/train': 1.6135867834091187} 11/07/2021 06:39:46 - INFO - __main__ - Step 67010: {'lr': 0.00029742381302860923, 'samples': 12865920, 'steps': 67009, 'loss/train': 1.0165722370147705} 11/07/2021 06:39:46 - INFO - __main__ - Step 67011: {'lr': 0.0002974186026280415, 'samples': 12866112, 'steps': 67010, 'loss/train': 1.8511149883270264} 11/07/2021 06:39:46 - INFO - __main__ - Step 67012: {'lr': 0.0002974133922061077, 'samples': 12866304, 'steps': 67011, 'loss/train': 1.7038981914520264} 11/07/2021 06:39:47 - INFO - __main__ - Step 67013: {'lr': 0.00029740818176281013, 'samples': 12866496, 'steps': 67012, 'loss/train': 1.3993419408798218} 11/07/2021 06:39:47 - INFO - __main__ - Step 67014: {'lr': 0.0002974029712981512, 'samples': 12866688, 'steps': 67013, 'loss/train': 1.0877140760421753} 11/07/2021 06:39:48 - INFO - __main__ - Step 67015: {'lr': 0.0002973977608121332, 'samples': 12866880, 'steps': 67014, 'loss/train': 1.478947401046753} 11/07/2021 06:39:49 - INFO - __main__ - Step 67016: {'lr': 0.0002973925503047585, 'samples': 12867072, 'steps': 67015, 'loss/train': 1.751969814300537} 11/07/2021 06:39:49 - INFO - __main__ - Step 67017: {'lr': 0.00029738733977602955, 'samples': 12867264, 'steps': 67016, 'loss/train': 1.8304120302200317} 11/07/2021 06:39:49 - INFO - __main__ - Step 67018: {'lr': 0.0002973821292259485, 'samples': 12867456, 'steps': 67017, 'loss/train': 1.523973822593689} 11/07/2021 06:39:50 - INFO - __main__ - Step 67019: {'lr': 0.0002973769186545178, 'samples': 12867648, 'steps': 67018, 'loss/train': 1.6114330291748047} 11/07/2021 06:39:50 - INFO - __main__ - Step 67020: {'lr': 0.0002973717080617398, 'samples': 12867840, 'steps': 67019, 'loss/train': 1.2827070951461792} 11/07/2021 06:39:51 - INFO - __main__ - Step 67021: {'lr': 0.00029736649744761687, 'samples': 12868032, 'steps': 67020, 'loss/train': 0.9334750771522522} 11/07/2021 06:39:51 - INFO - __main__ - Step 67022: {'lr': 0.00029736128681215123, 'samples': 12868224, 'steps': 67021, 'loss/train': 1.554347038269043} 11/07/2021 06:39:52 - INFO - __main__ - Step 67023: {'lr': 0.00029735607615534535, 'samples': 12868416, 'steps': 67022, 'loss/train': 1.6144084930419922} 11/07/2021 06:39:52 - INFO - __main__ - Step 67024: {'lr': 0.00029735086547720167, 'samples': 12868608, 'steps': 67023, 'loss/train': 1.609191656112671} 11/07/2021 06:39:52 - INFO - __main__ - Step 67025: {'lr': 0.00029734565477772235, 'samples': 12868800, 'steps': 67024, 'loss/train': 1.4912182092666626} 11/07/2021 06:39:53 - INFO - __main__ - Step 67026: {'lr': 0.0002973404440569098, 'samples': 12868992, 'steps': 67025, 'loss/train': 1.228990912437439} 11/07/2021 06:39:54 - INFO - __main__ - Step 67027: {'lr': 0.00029733523331476635, 'samples': 12869184, 'steps': 67026, 'loss/train': 1.118418574333191} 11/07/2021 06:39:54 - INFO - __main__ - Step 67028: {'lr': 0.00029733002255129444, 'samples': 12869376, 'steps': 67027, 'loss/train': 1.585336446762085} 11/07/2021 06:39:55 - INFO - __main__ - Step 67029: {'lr': 0.00029732481176649627, 'samples': 12869568, 'steps': 67028, 'loss/train': 1.4133071899414062} 11/07/2021 06:39:55 - INFO - __main__ - Step 67030: {'lr': 0.00029731960096037434, 'samples': 12869760, 'steps': 67029, 'loss/train': 1.4086648225784302} 11/07/2021 06:39:56 - INFO - __main__ - Step 67031: {'lr': 0.0002973143901329309, 'samples': 12869952, 'steps': 67030, 'loss/train': 1.4288843870162964} 11/07/2021 06:39:56 - INFO - __main__ - Step 67032: {'lr': 0.00029730917928416834, 'samples': 12870144, 'steps': 67031, 'loss/train': 1.5943485498428345} 11/07/2021 06:39:57 - INFO - __main__ - Step 67033: {'lr': 0.00029730396841408895, 'samples': 12870336, 'steps': 67032, 'loss/train': 1.365109920501709} 11/07/2021 06:39:57 - INFO - __main__ - Step 67034: {'lr': 0.0002972987575226952, 'samples': 12870528, 'steps': 67033, 'loss/train': 1.0118077993392944} 11/07/2021 06:39:57 - INFO - __main__ - Step 67035: {'lr': 0.00029729354660998933, 'samples': 12870720, 'steps': 67034, 'loss/train': 1.4872190952301025} 11/07/2021 06:39:58 - INFO - __main__ - Step 67036: {'lr': 0.0002972883356759736, 'samples': 12870912, 'steps': 67035, 'loss/train': 0.4085468351840973} 11/07/2021 06:39:59 - INFO - __main__ - Step 67037: {'lr': 0.00029728312472065066, 'samples': 12871104, 'steps': 67036, 'loss/train': 1.2623963356018066} 11/07/2021 06:39:59 - INFO - __main__ - Step 67038: {'lr': 0.0002972779137440226, 'samples': 12871296, 'steps': 67037, 'loss/train': 1.121044635772705} 11/07/2021 06:39:59 - INFO - __main__ - Step 67039: {'lr': 0.0002972727027460918, 'samples': 12871488, 'steps': 67038, 'loss/train': 1.566612958908081} 11/07/2021 06:40:00 - INFO - __main__ - Step 67040: {'lr': 0.00029726749172686066, 'samples': 12871680, 'steps': 67039, 'loss/train': 1.220629334449768} 11/07/2021 06:40:02 - INFO - __main__ - Step 67041: {'lr': 0.00029726228068633156, 'samples': 12871872, 'steps': 67040, 'loss/train': 1.258463978767395} 11/07/2021 06:40:02 - INFO - __main__ - Step 67042: {'lr': 0.0002972570696245068, 'samples': 12872064, 'steps': 67041, 'loss/train': 2.057068109512329} 11/07/2021 06:40:02 - INFO - __main__ - Step 67043: {'lr': 0.0002972518585413887, 'samples': 12872256, 'steps': 67042, 'loss/train': 1.4139922857284546} 11/07/2021 06:40:03 - INFO - __main__ - Step 67044: {'lr': 0.0002972466474369797, 'samples': 12872448, 'steps': 67043, 'loss/train': 0.561575174331665} 11/07/2021 06:40:03 - INFO - __main__ - Step 67045: {'lr': 0.00029724143631128203, 'samples': 12872640, 'steps': 67044, 'loss/train': 0.5022367835044861} 11/07/2021 06:40:04 - INFO - __main__ - Step 67046: {'lr': 0.0002972362251642981, 'samples': 12872832, 'steps': 67045, 'loss/train': 0.7999610304832458} 11/07/2021 06:40:04 - INFO - __main__ - Step 67047: {'lr': 0.0002972310139960303, 'samples': 12873024, 'steps': 67046, 'loss/train': 1.8552603721618652} 11/07/2021 06:40:04 - INFO - __main__ - Step 67048: {'lr': 0.0002972258028064809, 'samples': 12873216, 'steps': 67047, 'loss/train': 1.5871509313583374} 11/07/2021 06:40:05 - INFO - __main__ - Step 67049: {'lr': 0.0002972205915956523, 'samples': 12873408, 'steps': 67048, 'loss/train': 1.216645359992981} 11/07/2021 06:40:05 - INFO - __main__ - Step 67050: {'lr': 0.0002972153803635468, 'samples': 12873600, 'steps': 67049, 'loss/train': 1.4869681596755981} 11/07/2021 06:40:06 - INFO - __main__ - Step 67051: {'lr': 0.00029721016911016685, 'samples': 12873792, 'steps': 67050, 'loss/train': 1.5569891929626465} 11/07/2021 06:40:06 - INFO - __main__ - Step 67052: {'lr': 0.00029720495783551465, 'samples': 12873984, 'steps': 67051, 'loss/train': 1.7700432538986206} 11/07/2021 06:40:07 - INFO - __main__ - Step 67053: {'lr': 0.0002971997465395926, 'samples': 12874176, 'steps': 67052, 'loss/train': 1.3065577745437622} 11/07/2021 06:40:08 - INFO - __main__ - Step 67054: {'lr': 0.00029719453522240316, 'samples': 12874368, 'steps': 67053, 'loss/train': 1.6069320440292358} 11/07/2021 06:40:08 - INFO - __main__ - Step 67055: {'lr': 0.00029718932388394853, 'samples': 12874560, 'steps': 67054, 'loss/train': 0.13815519213676453} 11/07/2021 06:40:09 - INFO - __main__ - Step 67056: {'lr': 0.0002971841125242312, 'samples': 12874752, 'steps': 67055, 'loss/train': 1.5455348491668701} 11/07/2021 06:40:09 - INFO - __main__ - Step 67057: {'lr': 0.0002971789011432534, 'samples': 12874944, 'steps': 67056, 'loss/train': 1.1024731397628784} 11/07/2021 06:40:09 - INFO - __main__ - Step 67058: {'lr': 0.0002971736897410174, 'samples': 12875136, 'steps': 67057, 'loss/train': 1.2483396530151367} 11/07/2021 06:40:10 - INFO - __main__ - Step 67059: {'lr': 0.0002971684783175258, 'samples': 12875328, 'steps': 67058, 'loss/train': 1.2752326726913452} 11/07/2021 06:40:11 - INFO - __main__ - Step 67060: {'lr': 0.0002971632668727808, 'samples': 12875520, 'steps': 67059, 'loss/train': 1.6183030605316162} 11/07/2021 06:40:11 - INFO - __main__ - Step 67061: {'lr': 0.0002971580554067847, 'samples': 12875712, 'steps': 67060, 'loss/train': 1.091696858406067} 11/07/2021 06:40:11 - INFO - __main__ - Step 67062: {'lr': 0.0002971528439195399, 'samples': 12875904, 'steps': 67061, 'loss/train': 1.752444863319397} 11/07/2021 06:40:12 - INFO - __main__ - Step 67063: {'lr': 0.0002971476324110488, 'samples': 12876096, 'steps': 67062, 'loss/train': 1.3162178993225098} 11/07/2021 06:40:13 - INFO - __main__ - Step 67064: {'lr': 0.0002971424208813137, 'samples': 12876288, 'steps': 67063, 'loss/train': 0.7254301905632019} 11/07/2021 06:40:13 - INFO - __main__ - Step 67065: {'lr': 0.00029713720933033697, 'samples': 12876480, 'steps': 67064, 'loss/train': 1.3384662866592407} 11/07/2021 06:40:13 - INFO - __main__ - Step 67066: {'lr': 0.00029713199775812093, 'samples': 12876672, 'steps': 67065, 'loss/train': 1.5109120607376099} 11/07/2021 06:40:14 - INFO - __main__ - Step 67067: {'lr': 0.0002971267861646679, 'samples': 12876864, 'steps': 67066, 'loss/train': 1.4875375032424927} 11/07/2021 06:40:14 - INFO - __main__ - Step 67068: {'lr': 0.0002971215745499803, 'samples': 12877056, 'steps': 67067, 'loss/train': 1.4645947217941284} 11/07/2021 06:40:15 - INFO - __main__ - Step 67069: {'lr': 0.0002971163629140604, 'samples': 12877248, 'steps': 67068, 'loss/train': 1.5053728818893433} 11/07/2021 06:40:16 - INFO - __main__ - Step 67070: {'lr': 0.00029711115125691066, 'samples': 12877440, 'steps': 67069, 'loss/train': 0.9668812155723572} 11/07/2021 06:40:16 - INFO - __main__ - Step 67071: {'lr': 0.0002971059395785334, 'samples': 12877632, 'steps': 67070, 'loss/train': 2.160822868347168} 11/07/2021 06:40:16 - INFO - __main__ - Step 67072: {'lr': 0.0002971007278789308, 'samples': 12877824, 'steps': 67071, 'loss/train': 1.3169044256210327} 11/07/2021 06:40:17 - INFO - __main__ - Step 67073: {'lr': 0.00029709551615810545, 'samples': 12878016, 'steps': 67072, 'loss/train': 0.8280175924301147} 11/07/2021 06:40:17 - INFO - __main__ - Step 67074: {'lr': 0.00029709030441605954, 'samples': 12878208, 'steps': 67073, 'loss/train': 0.852584719657898} 11/07/2021 06:40:18 - INFO - __main__ - Step 67075: {'lr': 0.0002970850926527954, 'samples': 12878400, 'steps': 67074, 'loss/train': 1.0725582838058472} 11/07/2021 06:40:18 - INFO - __main__ - Step 67076: {'lr': 0.0002970798808683156, 'samples': 12878592, 'steps': 67075, 'loss/train': 0.741349995136261} 11/07/2021 06:40:19 - INFO - __main__ - Step 67077: {'lr': 0.00029707466906262224, 'samples': 12878784, 'steps': 67076, 'loss/train': 1.2384543418884277} 11/07/2021 06:40:19 - INFO - __main__ - Step 67078: {'lr': 0.0002970694572357178, 'samples': 12878976, 'steps': 67077, 'loss/train': 1.2456824779510498} 11/07/2021 06:40:19 - INFO - __main__ - Step 67079: {'lr': 0.00029706424538760454, 'samples': 12879168, 'steps': 67078, 'loss/train': 1.463236689567566} 11/07/2021 06:40:20 - INFO - __main__ - Step 67080: {'lr': 0.00029705903351828484, 'samples': 12879360, 'steps': 67079, 'loss/train': 1.626608967781067} 11/07/2021 06:40:21 - INFO - __main__ - Step 67081: {'lr': 0.0002970538216277611, 'samples': 12879552, 'steps': 67080, 'loss/train': 1.7500168085098267} 11/07/2021 06:40:21 - INFO - __main__ - Step 67082: {'lr': 0.00029704860971603564, 'samples': 12879744, 'steps': 67081, 'loss/train': 1.140771508216858} 11/07/2021 06:40:21 - INFO - __main__ - Step 67083: {'lr': 0.0002970433977831108, 'samples': 12879936, 'steps': 67082, 'loss/train': 1.4952352046966553} 11/07/2021 06:40:22 - INFO - __main__ - Step 67084: {'lr': 0.0002970381858289889, 'samples': 12880128, 'steps': 67083, 'loss/train': 1.0424442291259766} 11/07/2021 06:40:23 - INFO - __main__ - Step 67085: {'lr': 0.0002970329738536723, 'samples': 12880320, 'steps': 67084, 'loss/train': 1.6111092567443848} 11/07/2021 06:40:23 - INFO - __main__ - Step 67086: {'lr': 0.00029702776185716346, 'samples': 12880512, 'steps': 67085, 'loss/train': 1.255607008934021} 11/07/2021 06:40:24 - INFO - __main__ - Step 67087: {'lr': 0.0002970225498394646, 'samples': 12880704, 'steps': 67086, 'loss/train': 1.5427439212799072} 11/07/2021 06:40:24 - INFO - __main__ - Step 67088: {'lr': 0.00029701733780057815, 'samples': 12880896, 'steps': 67087, 'loss/train': 1.1500351428985596} 11/07/2021 06:40:24 - INFO - __main__ - Step 67089: {'lr': 0.00029701212574050637, 'samples': 12881088, 'steps': 67088, 'loss/train': 1.2216719388961792} 11/07/2021 06:40:25 - INFO - __main__ - Step 67090: {'lr': 0.0002970069136592516, 'samples': 12881280, 'steps': 67089, 'loss/train': 0.9370459318161011} 11/07/2021 06:40:26 - INFO - __main__ - Step 67091: {'lr': 0.00029700170155681625, 'samples': 12881472, 'steps': 67090, 'loss/train': 1.2694952487945557} 11/07/2021 06:40:26 - INFO - __main__ - Step 67092: {'lr': 0.0002969964894332027, 'samples': 12881664, 'steps': 67091, 'loss/train': 1.2587406635284424} 11/07/2021 06:40:26 - INFO - __main__ - Step 67093: {'lr': 0.0002969912772884133, 'samples': 12881856, 'steps': 67092, 'loss/train': 1.2633686065673828} 11/07/2021 06:40:27 - INFO - __main__ - Step 67094: {'lr': 0.0002969860651224503, 'samples': 12882048, 'steps': 67093, 'loss/train': 1.5318396091461182} 11/07/2021 06:40:28 - INFO - __main__ - Step 67095: {'lr': 0.0002969808529353161, 'samples': 12882240, 'steps': 67094, 'loss/train': 1.5568805932998657} 11/07/2021 06:40:28 - INFO - __main__ - Step 67096: {'lr': 0.000296975640727013, 'samples': 12882432, 'steps': 67095, 'loss/train': 1.4741166830062866} 11/07/2021 06:40:28 - INFO - __main__ - Step 67097: {'lr': 0.00029697042849754346, 'samples': 12882624, 'steps': 67096, 'loss/train': 2.140047311782837} 11/07/2021 06:40:29 - INFO - __main__ - Step 67098: {'lr': 0.0002969652162469098, 'samples': 12882816, 'steps': 67097, 'loss/train': 1.3472776412963867} 11/07/2021 06:40:29 - INFO - __main__ - Step 67099: {'lr': 0.0002969600039751143, 'samples': 12883008, 'steps': 67098, 'loss/train': 1.1960976123809814} 11/07/2021 06:40:30 - INFO - __main__ - Step 67100: {'lr': 0.0002969547916821593, 'samples': 12883200, 'steps': 67099, 'loss/train': 0.6779386401176453} 11/07/2021 06:40:31 - INFO - __main__ - Step 67101: {'lr': 0.00029694957936804726, 'samples': 12883392, 'steps': 67100, 'loss/train': 2.004533052444458} 11/07/2021 06:40:31 - INFO - __main__ - Step 67102: {'lr': 0.0002969443670327805, 'samples': 12883584, 'steps': 67101, 'loss/train': 3.8656744956970215} 11/07/2021 06:40:31 - INFO - __main__ - Step 67103: {'lr': 0.0002969391546763612, 'samples': 12883776, 'steps': 67102, 'loss/train': 1.687469482421875} 11/07/2021 06:40:32 - INFO - __main__ - Step 67104: {'lr': 0.000296933942298792, 'samples': 12883968, 'steps': 67103, 'loss/train': 1.7009698152542114} 11/07/2021 06:40:32 - INFO - __main__ - Step 67105: {'lr': 0.000296928729900075, 'samples': 12884160, 'steps': 67104, 'loss/train': 1.6817033290863037} 11/07/2021 06:40:33 - INFO - __main__ - Step 67106: {'lr': 0.0002969235174802127, 'samples': 12884352, 'steps': 67105, 'loss/train': 1.4158658981323242} 11/07/2021 06:40:33 - INFO - __main__ - Step 67107: {'lr': 0.0002969183050392073, 'samples': 12884544, 'steps': 67106, 'loss/train': 1.2722418308258057} 11/07/2021 06:40:34 - INFO - __main__ - Step 67108: {'lr': 0.0002969130925770613, 'samples': 12884736, 'steps': 67107, 'loss/train': 1.0107451677322388} 11/07/2021 06:40:34 - INFO - __main__ - Step 67109: {'lr': 0.00029690788009377694, 'samples': 12884928, 'steps': 67108, 'loss/train': 1.1001718044281006} 11/07/2021 06:40:34 - INFO - __main__ - Step 67110: {'lr': 0.0002969026675893566, 'samples': 12885120, 'steps': 67109, 'loss/train': 0.7463164925575256} 11/07/2021 06:40:35 - INFO - __main__ - Step 67111: {'lr': 0.00029689745506380273, 'samples': 12885312, 'steps': 67110, 'loss/train': 1.0158376693725586} 11/07/2021 06:40:36 - INFO - __main__ - Step 67112: {'lr': 0.00029689224251711754, 'samples': 12885504, 'steps': 67111, 'loss/train': 1.303270697593689} 11/07/2021 06:40:36 - INFO - __main__ - Step 67113: {'lr': 0.0002968870299493034, 'samples': 12885696, 'steps': 67112, 'loss/train': 1.3065375089645386} 11/07/2021 06:40:36 - INFO - __main__ - Step 67114: {'lr': 0.00029688181736036275, 'samples': 12885888, 'steps': 67113, 'loss/train': 1.5816646814346313} 11/07/2021 06:40:37 - INFO - __main__ - Step 67115: {'lr': 0.0002968766047502978, 'samples': 12886080, 'steps': 67114, 'loss/train': 1.5420101881027222} 11/07/2021 06:40:38 - INFO - __main__ - Step 67116: {'lr': 0.00029687139211911104, 'samples': 12886272, 'steps': 67115, 'loss/train': 0.4222649931907654} 11/07/2021 06:40:38 - INFO - __main__ - Step 67117: {'lr': 0.0002968661794668047, 'samples': 12886464, 'steps': 67116, 'loss/train': 1.5341094732284546} 11/07/2021 06:40:39 - INFO - __main__ - Step 67118: {'lr': 0.0002968609667933813, 'samples': 12886656, 'steps': 67117, 'loss/train': 1.416428804397583} 11/07/2021 06:40:39 - INFO - __main__ - Step 67119: {'lr': 0.0002968557540988429, 'samples': 12886848, 'steps': 67118, 'loss/train': 1.2048025131225586} 11/07/2021 06:40:39 - INFO - __main__ - Step 67120: {'lr': 0.0002968505413831921, 'samples': 12887040, 'steps': 67119, 'loss/train': 0.3003278970718384} 11/07/2021 06:40:40 - INFO - __main__ - Step 67121: {'lr': 0.0002968453286464312, 'samples': 12887232, 'steps': 67120, 'loss/train': 3.2875659465789795} 11/07/2021 06:40:41 - INFO - __main__ - Step 67122: {'lr': 0.00029684011588856246, 'samples': 12887424, 'steps': 67121, 'loss/train': 1.3200675249099731} 11/07/2021 06:40:41 - INFO - __main__ - Step 67123: {'lr': 0.0002968349031095883, 'samples': 12887616, 'steps': 67122, 'loss/train': 1.4620637893676758} 11/07/2021 06:40:42 - INFO - __main__ - Step 67124: {'lr': 0.0002968296903095111, 'samples': 12887808, 'steps': 67123, 'loss/train': 1.8753186464309692} 11/07/2021 06:40:42 - INFO - __main__ - Step 67125: {'lr': 0.00029682447748833316, 'samples': 12888000, 'steps': 67124, 'loss/train': 1.4511147737503052} 11/07/2021 06:40:43 - INFO - __main__ - Step 67126: {'lr': 0.0002968192646460568, 'samples': 12888192, 'steps': 67125, 'loss/train': 1.5618705749511719} 11/07/2021 06:40:43 - INFO - __main__ - Step 67127: {'lr': 0.0002968140517826844, 'samples': 12888384, 'steps': 67126, 'loss/train': 1.1391730308532715} 11/07/2021 06:40:44 - INFO - __main__ - Step 67128: {'lr': 0.00029680883889821833, 'samples': 12888576, 'steps': 67127, 'loss/train': 2.0291786193847656} 11/07/2021 06:40:44 - INFO - __main__ - Step 67129: {'lr': 0.0002968036259926609, 'samples': 12888768, 'steps': 67128, 'loss/train': 1.1465319395065308} 11/07/2021 06:40:44 - INFO - __main__ - Step 67130: {'lr': 0.00029679841306601447, 'samples': 12888960, 'steps': 67129, 'loss/train': 1.7818641662597656} 11/07/2021 06:40:45 - INFO - __main__ - Step 67131: {'lr': 0.0002967932001182815, 'samples': 12889152, 'steps': 67130, 'loss/train': 1.3583564758300781} 11/07/2021 06:40:46 - INFO - __main__ - Step 67132: {'lr': 0.0002967879871494641, 'samples': 12889344, 'steps': 67131, 'loss/train': 0.965801477432251} 11/07/2021 06:40:46 - INFO - __main__ - Step 67133: {'lr': 0.00029678277415956484, 'samples': 12889536, 'steps': 67132, 'loss/train': 1.4921562671661377} 11/07/2021 06:40:46 - INFO - __main__ - Step 67134: {'lr': 0.0002967775611485859, 'samples': 12889728, 'steps': 67133, 'loss/train': 1.178891897201538} 11/07/2021 06:40:47 - INFO - __main__ - Step 67135: {'lr': 0.00029677234811652974, 'samples': 12889920, 'steps': 67134, 'loss/train': 1.296730399131775} 11/07/2021 06:40:47 - INFO - __main__ - Step 67136: {'lr': 0.00029676713506339875, 'samples': 12890112, 'steps': 67135, 'loss/train': 1.1068975925445557} 11/07/2021 06:40:48 - INFO - __main__ - Step 67137: {'lr': 0.00029676192198919516, 'samples': 12890304, 'steps': 67136, 'loss/train': 0.30180278420448303} 11/07/2021 06:40:49 - INFO - __main__ - Step 67138: {'lr': 0.00029675670889392144, 'samples': 12890496, 'steps': 67137, 'loss/train': 4.865960121154785} 11/07/2021 06:40:49 - INFO - __main__ - Step 67139: {'lr': 0.00029675149577757973, 'samples': 12890688, 'steps': 67138, 'loss/train': 5.068638324737549} 11/07/2021 06:40:49 - INFO - __main__ - Step 67140: {'lr': 0.0002967462826401726, 'samples': 12890880, 'steps': 67139, 'loss/train': 1.6736924648284912} 11/07/2021 06:40:50 - INFO - __main__ - Step 67141: {'lr': 0.00029674106948170234, 'samples': 12891072, 'steps': 67140, 'loss/train': 0.8406650424003601} 11/07/2021 06:40:50 - INFO - __main__ - Step 67142: {'lr': 0.0002967358563021712, 'samples': 12891264, 'steps': 67141, 'loss/train': 1.5773099660873413} 11/07/2021 06:40:51 - INFO - __main__ - Step 67143: {'lr': 0.00029673064310158163, 'samples': 12891456, 'steps': 67142, 'loss/train': 1.290140151977539} 11/07/2021 06:40:51 - INFO - __main__ - Step 67144: {'lr': 0.000296725429879936, 'samples': 12891648, 'steps': 67143, 'loss/train': 1.7093560695648193} 11/07/2021 06:40:52 - INFO - __main__ - Step 67145: {'lr': 0.0002967202166372366, 'samples': 12891840, 'steps': 67144, 'loss/train': 1.4479166269302368} 11/07/2021 06:40:52 - INFO - __main__ - Step 67146: {'lr': 0.00029671500337348576, 'samples': 12892032, 'steps': 67145, 'loss/train': 1.6327474117279053} 11/07/2021 06:40:53 - INFO - __main__ - Step 67147: {'lr': 0.00029670979008868586, 'samples': 12892224, 'steps': 67146, 'loss/train': 1.4773674011230469} 11/07/2021 06:40:53 - INFO - __main__ - Step 67148: {'lr': 0.0002967045767828393, 'samples': 12892416, 'steps': 67147, 'loss/train': 1.439872145652771} 11/07/2021 06:40:54 - INFO - __main__ - Step 67149: {'lr': 0.0002966993634559483, 'samples': 12892608, 'steps': 67148, 'loss/train': 1.304653286933899} 11/07/2021 06:40:54 - INFO - __main__ - Step 67150: {'lr': 0.0002966941501080154, 'samples': 12892800, 'steps': 67149, 'loss/train': 1.4943532943725586} 11/07/2021 06:40:55 - INFO - __main__ - Step 67151: {'lr': 0.00029668893673904275, 'samples': 12892992, 'steps': 67150, 'loss/train': 1.1438225507736206} 11/07/2021 06:40:55 - INFO - __main__ - Step 67152: {'lr': 0.0002966837233490328, 'samples': 12893184, 'steps': 67151, 'loss/train': 1.9694452285766602} 11/07/2021 06:40:56 - INFO - __main__ - Step 67153: {'lr': 0.0002966785099379879, 'samples': 12893376, 'steps': 67152, 'loss/train': 1.4582796096801758} 11/07/2021 06:40:56 - INFO - __main__ - Step 67154: {'lr': 0.00029667329650591033, 'samples': 12893568, 'steps': 67153, 'loss/train': 1.446618676185608} 11/07/2021 06:40:57 - INFO - __main__ - Step 67155: {'lr': 0.0002966680830528026, 'samples': 12893760, 'steps': 67154, 'loss/train': 1.5348787307739258} 11/07/2021 06:40:57 - INFO - __main__ - Step 67156: {'lr': 0.00029666286957866683, 'samples': 12893952, 'steps': 67155, 'loss/train': 1.5219265222549438} 11/07/2021 06:40:57 - INFO - __main__ - Step 67157: {'lr': 0.00029665765608350553, 'samples': 12894144, 'steps': 67156, 'loss/train': 0.8382376432418823} 11/07/2021 06:40:58 - INFO - __main__ - Step 67158: {'lr': 0.00029665244256732107, 'samples': 12894336, 'steps': 67157, 'loss/train': 1.1435331106185913} 11/07/2021 06:40:59 - INFO - __main__ - Step 67159: {'lr': 0.0002966472290301157, 'samples': 12894528, 'steps': 67158, 'loss/train': 1.6649951934814453} 11/07/2021 06:40:59 - INFO - __main__ - Step 67160: {'lr': 0.0002966420154718918, 'samples': 12894720, 'steps': 67159, 'loss/train': 1.4424943923950195} 11/07/2021 06:40:59 - INFO - __main__ - Step 67161: {'lr': 0.00029663680189265175, 'samples': 12894912, 'steps': 67160, 'loss/train': 1.4450571537017822} 11/07/2021 06:41:00 - INFO - __main__ - Step 67162: {'lr': 0.0002966315882923978, 'samples': 12895104, 'steps': 67161, 'loss/train': 1.3436044454574585} 11/07/2021 06:41:00 - INFO - __main__ - Step 67163: {'lr': 0.0002966263746711325, 'samples': 12895296, 'steps': 67162, 'loss/train': 1.6888748407363892} 11/07/2021 06:41:01 - INFO - __main__ - Step 67164: {'lr': 0.00029662116102885795, 'samples': 12895488, 'steps': 67163, 'loss/train': 1.0388453006744385} 11/07/2021 06:41:01 - INFO - __main__ - Step 67165: {'lr': 0.00029661594736557674, 'samples': 12895680, 'steps': 67164, 'loss/train': 1.759570598602295} 11/07/2021 06:41:02 - INFO - __main__ - Step 67166: {'lr': 0.00029661073368129106, 'samples': 12895872, 'steps': 67165, 'loss/train': 1.4399693012237549} 11/07/2021 06:41:02 - INFO - __main__ - Step 67167: {'lr': 0.00029660551997600325, 'samples': 12896064, 'steps': 67166, 'loss/train': 1.3128219842910767} 11/07/2021 06:41:03 - INFO - __main__ - Step 67168: {'lr': 0.00029660030624971574, 'samples': 12896256, 'steps': 67167, 'loss/train': 1.5974327325820923} 11/07/2021 06:41:04 - INFO - __main__ - Step 67169: {'lr': 0.0002965950925024308, 'samples': 12896448, 'steps': 67168, 'loss/train': 1.3574655055999756} 11/07/2021 06:41:04 - INFO - __main__ - Step 67170: {'lr': 0.0002965898787341509, 'samples': 12896640, 'steps': 67169, 'loss/train': 1.4396729469299316} 11/07/2021 06:41:04 - INFO - __main__ - Step 67171: {'lr': 0.00029658466494487837, 'samples': 12896832, 'steps': 67170, 'loss/train': 1.0462881326675415} 11/07/2021 06:41:05 - INFO - __main__ - Step 67172: {'lr': 0.0002965794511346155, 'samples': 12897024, 'steps': 67171, 'loss/train': 1.4523100852966309} 11/07/2021 06:41:05 - INFO - __main__ - Step 67173: {'lr': 0.0002965742373033646, 'samples': 12897216, 'steps': 67172, 'loss/train': 1.2295817136764526} 11/07/2021 06:41:06 - INFO - __main__ - Step 67174: {'lr': 0.00029656902345112803, 'samples': 12897408, 'steps': 67173, 'loss/train': 1.5333560705184937} 11/07/2021 06:41:06 - INFO - __main__ - Step 67175: {'lr': 0.0002965638095779082, 'samples': 12897600, 'steps': 67174, 'loss/train': 1.602988362312317} 11/07/2021 06:41:07 - INFO - __main__ - Step 67176: {'lr': 0.0002965585956837075, 'samples': 12897792, 'steps': 67175, 'loss/train': 1.123441219329834} 11/07/2021 06:41:07 - INFO - __main__ - Step 67177: {'lr': 0.0002965533817685281, 'samples': 12897984, 'steps': 67176, 'loss/train': 1.2390795946121216} 11/07/2021 06:41:07 - INFO - __main__ - Step 67178: {'lr': 0.0002965481678323726, 'samples': 12898176, 'steps': 67177, 'loss/train': 1.4082040786743164} 11/07/2021 06:41:08 - INFO - __main__ - Step 67179: {'lr': 0.0002965429538752431, 'samples': 12898368, 'steps': 67178, 'loss/train': 1.0656118392944336} 11/07/2021 06:41:09 - INFO - __main__ - Step 67180: {'lr': 0.00029653773989714213, 'samples': 12898560, 'steps': 67179, 'loss/train': 1.475760579109192} 11/07/2021 06:41:09 - INFO - __main__ - Step 67181: {'lr': 0.0002965325258980719, 'samples': 12898752, 'steps': 67180, 'loss/train': 1.405152678489685} 11/07/2021 06:41:10 - INFO - __main__ - Step 67182: {'lr': 0.0002965273118780349, 'samples': 12898944, 'steps': 67181, 'loss/train': 1.1664353609085083} 11/07/2021 06:41:10 - INFO - __main__ - Step 67183: {'lr': 0.00029652209783703336, 'samples': 12899136, 'steps': 67182, 'loss/train': 1.329433560371399} 11/07/2021 06:41:11 - INFO - __main__ - Step 67184: {'lr': 0.00029651688377506976, 'samples': 12899328, 'steps': 67183, 'loss/train': 1.6894463300704956} 11/07/2021 06:41:11 - INFO - __main__ - Step 67185: {'lr': 0.0002965116696921463, 'samples': 12899520, 'steps': 67184, 'loss/train': 0.9597010612487793} 11/07/2021 06:41:12 - INFO - __main__ - Step 67186: {'lr': 0.00029650645558826545, 'samples': 12899712, 'steps': 67185, 'loss/train': 1.1005011796951294} 11/07/2021 06:41:12 - INFO - __main__ - Step 67187: {'lr': 0.0002965012414634295, 'samples': 12899904, 'steps': 67186, 'loss/train': 1.417244553565979} 11/07/2021 06:41:12 - INFO - __main__ - Step 67188: {'lr': 0.00029649602731764076, 'samples': 12900096, 'steps': 67187, 'loss/train': 1.6728702783584595} 11/07/2021 06:41:13 - INFO - __main__ - Step 67189: {'lr': 0.00029649081315090165, 'samples': 12900288, 'steps': 67188, 'loss/train': 1.3391772508621216} 11/07/2021 06:41:14 - INFO - __main__ - Step 67190: {'lr': 0.00029648559896321445, 'samples': 12900480, 'steps': 67189, 'loss/train': 1.460227608680725} 11/07/2021 06:41:14 - INFO - __main__ - Step 67191: {'lr': 0.0002964803847545816, 'samples': 12900672, 'steps': 67190, 'loss/train': 1.6664870977401733} 11/07/2021 06:41:14 - INFO - __main__ - Step 67192: {'lr': 0.0002964751705250055, 'samples': 12900864, 'steps': 67191, 'loss/train': 1.8463743925094604} 11/07/2021 06:41:15 - INFO - __main__ - Step 67193: {'lr': 0.0002964699562744883, 'samples': 12901056, 'steps': 67192, 'loss/train': 1.2780705690383911} 11/07/2021 06:41:15 - INFO - __main__ - Step 67194: {'lr': 0.00029646474200303245, 'samples': 12901248, 'steps': 67193, 'loss/train': 1.8666656017303467} 11/07/2021 06:41:16 - INFO - __main__ - Step 67195: {'lr': 0.0002964595277106403, 'samples': 12901440, 'steps': 67194, 'loss/train': 1.2514108419418335} 11/07/2021 06:41:16 - INFO - __main__ - Step 67196: {'lr': 0.00029645431339731426, 'samples': 12901632, 'steps': 67195, 'loss/train': 1.2197933197021484} 11/07/2021 06:41:17 - INFO - __main__ - Step 67197: {'lr': 0.0002964490990630566, 'samples': 12901824, 'steps': 67196, 'loss/train': 1.5168050527572632} 11/07/2021 06:41:17 - INFO - __main__ - Step 67198: {'lr': 0.0002964438847078697, 'samples': 12902016, 'steps': 67197, 'loss/train': 1.105163812637329} 11/07/2021 06:41:17 - INFO - __main__ - Step 67199: {'lr': 0.0002964386703317559, 'samples': 12902208, 'steps': 67198, 'loss/train': 1.2078237533569336} 11/07/2021 06:41:18 - INFO - __main__ - Step 67200: {'lr': 0.0002964334559347175, 'samples': 12902400, 'steps': 67199, 'loss/train': 1.4502612352371216} 11/07/2021 06:41:19 - INFO - __main__ - Step 67201: {'lr': 0.000296428241516757, 'samples': 12902592, 'steps': 67200, 'loss/train': 1.0929640531539917} 11/07/2021 06:41:19 - INFO - __main__ - Step 67202: {'lr': 0.0002964230270778766, 'samples': 12902784, 'steps': 67201, 'loss/train': 1.4083642959594727} 11/07/2021 06:41:19 - INFO - __main__ - Step 67203: {'lr': 0.00029641781261807867, 'samples': 12902976, 'steps': 67202, 'loss/train': 1.3141698837280273} 11/07/2021 06:41:20 - INFO - __main__ - Step 67204: {'lr': 0.0002964125981373656, 'samples': 12903168, 'steps': 67203, 'loss/train': 0.9727628231048584} 11/07/2021 06:41:21 - INFO - __main__ - Step 67205: {'lr': 0.0002964073836357398, 'samples': 12903360, 'steps': 67204, 'loss/train': 1.1226649284362793} 11/07/2021 06:41:21 - INFO - __main__ - Step 67206: {'lr': 0.0002964021691132035, 'samples': 12903552, 'steps': 67205, 'loss/train': 1.375679850578308} 11/07/2021 06:41:22 - INFO - __main__ - Step 67207: {'lr': 0.00029639695456975905, 'samples': 12903744, 'steps': 67206, 'loss/train': 1.4306637048721313} 11/07/2021 06:41:22 - INFO - __main__ - Step 67208: {'lr': 0.0002963917400054089, 'samples': 12903936, 'steps': 67207, 'loss/train': 1.5436302423477173} 11/07/2021 06:41:22 - INFO - __main__ - Step 67209: {'lr': 0.0002963865254201553, 'samples': 12904128, 'steps': 67208, 'loss/train': 1.230542778968811} 11/07/2021 06:41:23 - INFO - __main__ - Step 67210: {'lr': 0.0002963813108140007, 'samples': 12904320, 'steps': 67209, 'loss/train': 1.15364670753479} 11/07/2021 06:41:24 - INFO - __main__ - Step 67211: {'lr': 0.00029637609618694745, 'samples': 12904512, 'steps': 67210, 'loss/train': 1.2815937995910645} 11/07/2021 06:41:24 - INFO - __main__ - Step 67212: {'lr': 0.0002963708815389978, 'samples': 12904704, 'steps': 67211, 'loss/train': 1.393286108970642} 11/07/2021 06:41:24 - INFO - __main__ - Step 67213: {'lr': 0.0002963656668701541, 'samples': 12904896, 'steps': 67212, 'loss/train': 1.4326157569885254} 11/07/2021 06:41:25 - INFO - __main__ - Step 67214: {'lr': 0.0002963604521804187, 'samples': 12905088, 'steps': 67213, 'loss/train': 0.7845842242240906} 11/07/2021 06:41:26 - INFO - __main__ - Step 67215: {'lr': 0.0002963552374697941, 'samples': 12905280, 'steps': 67214, 'loss/train': 1.5302273035049438} 11/07/2021 06:41:26 - INFO - __main__ - Step 67216: {'lr': 0.0002963500227382826, 'samples': 12905472, 'steps': 67215, 'loss/train': 1.1767405271530151} 11/07/2021 06:41:26 - INFO - __main__ - Step 67217: {'lr': 0.00029634480798588635, 'samples': 12905664, 'steps': 67216, 'loss/train': 1.5475831031799316} 11/07/2021 06:41:27 - INFO - __main__ - Step 67218: {'lr': 0.00029633959321260795, 'samples': 12905856, 'steps': 67217, 'loss/train': 1.439038872718811} 11/07/2021 06:41:27 - INFO - __main__ - Step 67219: {'lr': 0.00029633437841844956, 'samples': 12906048, 'steps': 67218, 'loss/train': 1.5945141315460205} 11/07/2021 06:41:28 - INFO - __main__ - Step 67220: {'lr': 0.00029632916360341366, 'samples': 12906240, 'steps': 67219, 'loss/train': 1.0888452529907227} 11/07/2021 06:41:29 - INFO - __main__ - Step 67221: {'lr': 0.0002963239487675025, 'samples': 12906432, 'steps': 67220, 'loss/train': 1.3889251947402954} 11/07/2021 06:41:29 - INFO - __main__ - Step 67222: {'lr': 0.0002963187339107186, 'samples': 12906624, 'steps': 67221, 'loss/train': 1.1052953004837036} 11/07/2021 06:41:29 - INFO - __main__ - Step 67223: {'lr': 0.0002963135190330641, 'samples': 12906816, 'steps': 67222, 'loss/train': 1.3654420375823975} 11/07/2021 06:41:30 - INFO - __main__ - Step 67224: {'lr': 0.00029630830413454145, 'samples': 12907008, 'steps': 67223, 'loss/train': 2.7563424110412598} 11/07/2021 06:41:31 - INFO - __main__ - Step 67225: {'lr': 0.00029630308921515305, 'samples': 12907200, 'steps': 67224, 'loss/train': 1.0720373392105103} 11/07/2021 06:41:31 - INFO - __main__ - Step 67226: {'lr': 0.0002962978742749011, 'samples': 12907392, 'steps': 67225, 'loss/train': 2.3472447395324707} 11/07/2021 06:41:31 - INFO - __main__ - Step 67227: {'lr': 0.00029629265931378816, 'samples': 12907584, 'steps': 67226, 'loss/train': 1.7221472263336182} 11/07/2021 06:41:32 - INFO - __main__ - Step 67228: {'lr': 0.00029628744433181635, 'samples': 12907776, 'steps': 67227, 'loss/train': 1.3805298805236816} 11/07/2021 06:41:32 - INFO - __main__ - Step 67229: {'lr': 0.0002962822293289882, 'samples': 12907968, 'steps': 67228, 'loss/train': 1.3620731830596924} 11/07/2021 06:41:32 - INFO - __main__ - Step 67230: {'lr': 0.00029627701430530597, 'samples': 12908160, 'steps': 67229, 'loss/train': 1.4415963888168335} 11/07/2021 06:41:33 - INFO - __main__ - Step 67231: {'lr': 0.000296271799260772, 'samples': 12908352, 'steps': 67230, 'loss/train': 0.8085143566131592} 11/07/2021 06:41:34 - INFO - __main__ - Step 67232: {'lr': 0.00029626658419538873, 'samples': 12908544, 'steps': 67231, 'loss/train': 1.427819013595581} 11/07/2021 06:41:34 - INFO - __main__ - Step 67233: {'lr': 0.00029626136910915847, 'samples': 12908736, 'steps': 67232, 'loss/train': 1.3961968421936035} 11/07/2021 06:41:34 - INFO - __main__ - Step 67234: {'lr': 0.0002962561540020835, 'samples': 12908928, 'steps': 67233, 'loss/train': 1.192178726196289} 11/07/2021 06:41:35 - INFO - __main__ - Step 67235: {'lr': 0.0002962509388741662, 'samples': 12909120, 'steps': 67234, 'loss/train': 1.2412205934524536} 11/07/2021 06:41:36 - INFO - __main__ - Step 67236: {'lr': 0.0002962457237254089, 'samples': 12909312, 'steps': 67235, 'loss/train': 1.3112506866455078} 11/07/2021 06:41:36 - INFO - __main__ - Step 67237: {'lr': 0.0002962405085558141, 'samples': 12909504, 'steps': 67236, 'loss/train': 1.1215780973434448} 11/07/2021 06:41:37 - INFO - __main__ - Step 67238: {'lr': 0.00029623529336538396, 'samples': 12909696, 'steps': 67237, 'loss/train': 1.6788434982299805} 11/07/2021 06:41:37 - INFO - __main__ - Step 67239: {'lr': 0.000296230078154121, 'samples': 12909888, 'steps': 67238, 'loss/train': 1.202149748802185} 11/07/2021 06:41:37 - INFO - __main__ - Step 67240: {'lr': 0.00029622486292202744, 'samples': 12910080, 'steps': 67239, 'loss/train': 1.5410884618759155} 11/07/2021 06:41:38 - INFO - __main__ - Step 67241: {'lr': 0.00029621964766910565, 'samples': 12910272, 'steps': 67240, 'loss/train': 1.4024691581726074} 11/07/2021 06:41:39 - INFO - __main__ - Step 67242: {'lr': 0.000296214432395358, 'samples': 12910464, 'steps': 67241, 'loss/train': 1.5753002166748047} 11/07/2021 06:41:39 - INFO - __main__ - Step 67243: {'lr': 0.00029620921710078686, 'samples': 12910656, 'steps': 67242, 'loss/train': 1.3172495365142822} 11/07/2021 06:41:39 - INFO - __main__ - Step 67244: {'lr': 0.00029620400178539453, 'samples': 12910848, 'steps': 67243, 'loss/train': 2.038566827774048} 11/07/2021 06:41:40 - INFO - __main__ - Step 67245: {'lr': 0.00029619878644918335, 'samples': 12911040, 'steps': 67244, 'loss/train': 1.3442645072937012} 11/07/2021 06:41:41 - INFO - __main__ - Step 67246: {'lr': 0.0002961935710921558, 'samples': 12911232, 'steps': 67245, 'loss/train': 1.2002062797546387} 11/07/2021 06:41:41 - INFO - __main__ - Step 67247: {'lr': 0.00029618835571431414, 'samples': 12911424, 'steps': 67246, 'loss/train': 1.1728612184524536} 11/07/2021 06:41:41 - INFO - __main__ - Step 67248: {'lr': 0.00029618314031566067, 'samples': 12911616, 'steps': 67247, 'loss/train': 1.6586601734161377} 11/07/2021 06:41:42 - INFO - __main__ - Step 67249: {'lr': 0.0002961779248961978, 'samples': 12911808, 'steps': 67248, 'loss/train': 1.7538111209869385} 11/07/2021 06:41:42 - INFO - __main__ - Step 67250: {'lr': 0.0002961727094559279, 'samples': 12912000, 'steps': 67249, 'loss/train': 1.819661021232605} 11/07/2021 06:41:43 - INFO - __main__ - Step 67251: {'lr': 0.00029616749399485323, 'samples': 12912192, 'steps': 67250, 'loss/train': 1.3368213176727295} 11/07/2021 06:41:43 - INFO - __main__ - Step 67252: {'lr': 0.0002961622785129763, 'samples': 12912384, 'steps': 67251, 'loss/train': 1.2311097383499146} 11/07/2021 06:41:44 - INFO - __main__ - Step 67253: {'lr': 0.00029615706301029925, 'samples': 12912576, 'steps': 67252, 'loss/train': 1.0393738746643066} 11/07/2021 06:41:44 - INFO - __main__ - Step 67254: {'lr': 0.00029615184748682456, 'samples': 12912768, 'steps': 67253, 'loss/train': 1.470147967338562} 11/07/2021 06:41:44 - INFO - __main__ - Step 67255: {'lr': 0.0002961466319425546, 'samples': 12912960, 'steps': 67254, 'loss/train': 1.4511679410934448} 11/07/2021 06:41:45 - INFO - __main__ - Step 67256: {'lr': 0.00029614141637749166, 'samples': 12913152, 'steps': 67255, 'loss/train': 1.540543556213379} 11/07/2021 06:41:46 - INFO - __main__ - Step 67257: {'lr': 0.00029613620079163805, 'samples': 12913344, 'steps': 67256, 'loss/train': 1.5947380065917969} 11/07/2021 06:41:46 - INFO - __main__ - Step 67258: {'lr': 0.00029613098518499627, 'samples': 12913536, 'steps': 67257, 'loss/train': 1.6932909488677979} 11/07/2021 06:41:46 - INFO - __main__ - Step 67259: {'lr': 0.0002961257695575686, 'samples': 12913728, 'steps': 67258, 'loss/train': 1.5311635732650757} 11/07/2021 06:41:47 - INFO - __main__ - Step 67260: {'lr': 0.0002961205539093573, 'samples': 12913920, 'steps': 67259, 'loss/train': 1.4215649366378784} 11/07/2021 06:41:47 - INFO - __main__ - Step 67261: {'lr': 0.0002961153382403648, 'samples': 12914112, 'steps': 67260, 'loss/train': 1.2219722270965576} 11/07/2021 06:41:48 - INFO - __main__ - Step 67262: {'lr': 0.00029611012255059346, 'samples': 12914304, 'steps': 67261, 'loss/train': 1.3862011432647705} 11/07/2021 06:41:49 - INFO - __main__ - Step 67263: {'lr': 0.0002961049068400456, 'samples': 12914496, 'steps': 67262, 'loss/train': 1.219513177871704} 11/07/2021 06:41:49 - INFO - __main__ - Step 67264: {'lr': 0.0002960996911087236, 'samples': 12914688, 'steps': 67263, 'loss/train': 1.6240166425704956} 11/07/2021 06:41:49 - INFO - __main__ - Step 67265: {'lr': 0.0002960944753566297, 'samples': 12914880, 'steps': 67264, 'loss/train': 2.1673576831817627} 11/07/2021 06:41:50 - INFO - __main__ - Step 67266: {'lr': 0.00029608925958376646, 'samples': 12915072, 'steps': 67265, 'loss/train': 1.2457538843154907} 11/07/2021 06:41:51 - INFO - __main__ - Step 67267: {'lr': 0.0002960840437901361, 'samples': 12915264, 'steps': 67266, 'loss/train': 1.6734851598739624} 11/07/2021 06:41:51 - INFO - __main__ - Step 67268: {'lr': 0.00029607882797574094, 'samples': 12915456, 'steps': 67267, 'loss/train': 2.0549368858337402} 11/07/2021 06:41:51 - INFO - __main__ - Step 67269: {'lr': 0.0002960736121405834, 'samples': 12915648, 'steps': 67268, 'loss/train': 1.8574023246765137} 11/07/2021 06:41:52 - INFO - __main__ - Step 67270: {'lr': 0.0002960683962846657, 'samples': 12915840, 'steps': 67269, 'loss/train': 1.3372911214828491} 11/07/2021 06:41:52 - INFO - __main__ - Step 67271: {'lr': 0.0002960631804079904, 'samples': 12916032, 'steps': 67270, 'loss/train': 1.6742159128189087} 11/07/2021 06:41:53 - INFO - __main__ - Step 67272: {'lr': 0.0002960579645105597, 'samples': 12916224, 'steps': 67271, 'loss/train': 0.9313632249832153} 11/07/2021 06:41:53 - INFO - __main__ - Step 67273: {'lr': 0.000296052748592376, 'samples': 12916416, 'steps': 67272, 'loss/train': 1.357178807258606} 11/07/2021 06:41:54 - INFO - __main__ - Step 67274: {'lr': 0.00029604753265344166, 'samples': 12916608, 'steps': 67273, 'loss/train': 1.3693057298660278} 11/07/2021 06:41:54 - INFO - __main__ - Step 67275: {'lr': 0.00029604231669375905, 'samples': 12916800, 'steps': 67274, 'loss/train': 1.69057035446167} 11/07/2021 06:41:54 - INFO - __main__ - Step 67276: {'lr': 0.00029603710071333033, 'samples': 12916992, 'steps': 67275, 'loss/train': 1.5522629022598267} 11/07/2021 06:41:55 - INFO - __main__ - Step 67277: {'lr': 0.0002960318847121581, 'samples': 12917184, 'steps': 67276, 'loss/train': 1.1994460821151733} 11/07/2021 06:41:56 - INFO - __main__ - Step 67278: {'lr': 0.00029602666869024463, 'samples': 12917376, 'steps': 67277, 'loss/train': 1.6545547246932983} 11/07/2021 06:41:56 - INFO - __main__ - Step 67279: {'lr': 0.0002960214526475923, 'samples': 12917568, 'steps': 67278, 'loss/train': 1.1809592247009277} 11/07/2021 06:41:56 - INFO - __main__ - Step 67280: {'lr': 0.00029601623658420337, 'samples': 12917760, 'steps': 67279, 'loss/train': 1.692123293876648} 11/07/2021 06:41:57 - INFO - __main__ - Step 67281: {'lr': 0.00029601102050008014, 'samples': 12917952, 'steps': 67280, 'loss/train': 1.5538049936294556} 11/07/2021 06:41:58 - INFO - __main__ - Step 67282: {'lr': 0.0002960058043952252, 'samples': 12918144, 'steps': 67281, 'loss/train': 1.46695077419281} 11/07/2021 06:41:58 - INFO - __main__ - Step 67283: {'lr': 0.00029600058826964067, 'samples': 12918336, 'steps': 67282, 'loss/train': 1.793686032295227} 11/07/2021 06:41:59 - INFO - __main__ - Step 67284: {'lr': 0.00029599537212332896, 'samples': 12918528, 'steps': 67283, 'loss/train': 1.779178500175476} 11/07/2021 06:41:59 - INFO - __main__ - Step 67285: {'lr': 0.00029599015595629247, 'samples': 12918720, 'steps': 67284, 'loss/train': 1.6840277910232544} 11/07/2021 06:41:59 - INFO - __main__ - Step 67286: {'lr': 0.00029598493976853356, 'samples': 12918912, 'steps': 67285, 'loss/train': 1.3315792083740234} 11/07/2021 06:42:00 - INFO - __main__ - Step 67287: {'lr': 0.0002959797235600545, 'samples': 12919104, 'steps': 67286, 'loss/train': 1.5449284315109253} 11/07/2021 06:42:01 - INFO - __main__ - Step 67288: {'lr': 0.0002959745073308577, 'samples': 12919296, 'steps': 67287, 'loss/train': 1.3196460008621216} 11/07/2021 06:42:01 - INFO - __main__ - Step 67289: {'lr': 0.0002959692910809456, 'samples': 12919488, 'steps': 67288, 'loss/train': 1.0435367822647095} 11/07/2021 06:42:02 - INFO - __main__ - Step 67290: {'lr': 0.0002959640748103203, 'samples': 12919680, 'steps': 67289, 'loss/train': 1.6259011030197144} 11/07/2021 06:42:02 - INFO - __main__ - Step 67291: {'lr': 0.00029595885851898434, 'samples': 12919872, 'steps': 67290, 'loss/train': 1.0763046741485596} 11/07/2021 06:42:03 - INFO - __main__ - Step 67292: {'lr': 0.00029595364220694003, 'samples': 12920064, 'steps': 67291, 'loss/train': 1.7945410013198853} 11/07/2021 06:42:03 - INFO - __main__ - Step 67293: {'lr': 0.0002959484258741898, 'samples': 12920256, 'steps': 67292, 'loss/train': 1.2260922193527222} 11/07/2021 06:42:04 - INFO - __main__ - Step 67294: {'lr': 0.00029594320952073584, 'samples': 12920448, 'steps': 67293, 'loss/train': 1.4928926229476929} 11/07/2021 06:42:04 - INFO - __main__ - Step 67295: {'lr': 0.00029593799314658057, 'samples': 12920640, 'steps': 67294, 'loss/train': 1.6437500715255737} 11/07/2021 06:42:05 - INFO - __main__ - Step 67296: {'lr': 0.00029593277675172636, 'samples': 12920832, 'steps': 67295, 'loss/train': 1.8065413236618042} 11/07/2021 06:42:05 - INFO - __main__ - Step 67297: {'lr': 0.0002959275603361755, 'samples': 12921024, 'steps': 67296, 'loss/train': 1.5035165548324585} 11/07/2021 06:42:05 - INFO - __main__ - Step 67298: {'lr': 0.00029592234389993045, 'samples': 12921216, 'steps': 67297, 'loss/train': 1.8658696413040161} 11/07/2021 06:42:06 - INFO - __main__ - Step 67299: {'lr': 0.0002959171274429936, 'samples': 12921408, 'steps': 67298, 'loss/train': 1.060956597328186} 11/07/2021 06:42:07 - INFO - __main__ - Step 67300: {'lr': 0.00029591191096536704, 'samples': 12921600, 'steps': 67299, 'loss/train': 1.4089699983596802} 11/07/2021 06:42:07 - INFO - __main__ - Step 67301: {'lr': 0.00029590669446705333, 'samples': 12921792, 'steps': 67300, 'loss/train': 1.1785938739776611} 11/07/2021 06:42:07 - INFO - __main__ - Step 67302: {'lr': 0.0002959014779480548, 'samples': 12921984, 'steps': 67301, 'loss/train': 1.0798202753067017} 11/07/2021 06:42:08 - INFO - __main__ - Step 67303: {'lr': 0.0002958962614083737, 'samples': 12922176, 'steps': 67302, 'loss/train': 1.4871125221252441} 11/07/2021 06:42:09 - INFO - __main__ - Step 67304: {'lr': 0.00029589104484801257, 'samples': 12922368, 'steps': 67303, 'loss/train': 1.338145136833191} 11/07/2021 06:42:09 - INFO - __main__ - Step 67305: {'lr': 0.0002958858282669735, 'samples': 12922560, 'steps': 67304, 'loss/train': 0.6897908449172974} 11/07/2021 06:42:10 - INFO - __main__ - Step 67306: {'lr': 0.0002958806116652591, 'samples': 12922752, 'steps': 67305, 'loss/train': 5.640976428985596} 11/07/2021 06:42:10 - INFO - __main__ - Step 67307: {'lr': 0.0002958753950428716, 'samples': 12922944, 'steps': 67306, 'loss/train': 1.1117682456970215} 11/07/2021 06:42:10 - INFO - __main__ - Step 67308: {'lr': 0.00029587017839981326, 'samples': 12923136, 'steps': 67307, 'loss/train': 1.626805305480957} 11/07/2021 06:42:11 - INFO - __main__ - Step 67309: {'lr': 0.0002958649617360866, 'samples': 12923328, 'steps': 67308, 'loss/train': 1.4719481468200684} 11/07/2021 06:42:12 - INFO - __main__ - Step 67310: {'lr': 0.0002958597450516939, 'samples': 12923520, 'steps': 67309, 'loss/train': 1.667314887046814} 11/07/2021 06:42:12 - INFO - __main__ - Step 67311: {'lr': 0.00029585452834663745, 'samples': 12923712, 'steps': 67310, 'loss/train': 1.0082054138183594} 11/07/2021 06:42:12 - INFO - __main__ - Step 67312: {'lr': 0.0002958493116209197, 'samples': 12923904, 'steps': 67311, 'loss/train': 2.1368331909179688} 11/07/2021 06:42:13 - INFO - __main__ - Step 67313: {'lr': 0.000295844094874543, 'samples': 12924096, 'steps': 67312, 'loss/train': 1.8352975845336914} 11/07/2021 06:42:14 - INFO - __main__ - Step 67314: {'lr': 0.0002958388781075096, 'samples': 12924288, 'steps': 67313, 'loss/train': 1.4177677631378174} 11/07/2021 06:42:14 - INFO - __main__ - Step 67315: {'lr': 0.00029583366131982194, 'samples': 12924480, 'steps': 67314, 'loss/train': 0.6808567047119141} 11/07/2021 06:42:14 - INFO - __main__ - Step 67316: {'lr': 0.0002958284445114823, 'samples': 12924672, 'steps': 67315, 'loss/train': 1.473549723625183} 11/07/2021 06:42:15 - INFO - __main__ - Step 67317: {'lr': 0.0002958232276824931, 'samples': 12924864, 'steps': 67316, 'loss/train': 1.5330766439437866} 11/07/2021 06:42:15 - INFO - __main__ - Step 67318: {'lr': 0.00029581801083285663, 'samples': 12925056, 'steps': 67317, 'loss/train': 1.312854290008545} 11/07/2021 06:42:15 - INFO - __main__ - Step 67319: {'lr': 0.00029581279396257527, 'samples': 12925248, 'steps': 67318, 'loss/train': 1.7543154954910278} 11/07/2021 06:42:16 - INFO - __main__ - Step 67320: {'lr': 0.00029580757707165146, 'samples': 12925440, 'steps': 67319, 'loss/train': 1.1033283472061157} 11/07/2021 06:42:17 - INFO - __main__ - Step 67321: {'lr': 0.00029580236016008737, 'samples': 12925632, 'steps': 67320, 'loss/train': 1.5729373693466187} 11/07/2021 06:42:17 - INFO - __main__ - Step 67322: {'lr': 0.0002957971432278855, 'samples': 12925824, 'steps': 67321, 'loss/train': 0.9444909691810608} 11/07/2021 06:42:17 - INFO - __main__ - Step 67323: {'lr': 0.0002957919262750481, 'samples': 12926016, 'steps': 67322, 'loss/train': 1.1255041360855103} 11/07/2021 06:42:18 - INFO - __main__ - Step 67324: {'lr': 0.0002957867093015775, 'samples': 12926208, 'steps': 67323, 'loss/train': 1.3447721004486084} 11/07/2021 06:42:19 - INFO - __main__ - Step 67325: {'lr': 0.0002957814923074762, 'samples': 12926400, 'steps': 67324, 'loss/train': 1.4823297262191772} 11/07/2021 06:42:19 - INFO - __main__ - Step 67326: {'lr': 0.00029577627529274653, 'samples': 12926592, 'steps': 67325, 'loss/train': 0.9225636124610901} 11/07/2021 06:42:20 - INFO - __main__ - Step 67327: {'lr': 0.0002957710582573907, 'samples': 12926784, 'steps': 67326, 'loss/train': 0.8526886105537415} 11/07/2021 06:42:20 - INFO - __main__ - Step 67328: {'lr': 0.0002957658412014111, 'samples': 12926976, 'steps': 67327, 'loss/train': 0.7508828639984131} 11/07/2021 06:42:20 - INFO - __main__ - Step 67329: {'lr': 0.0002957606241248102, 'samples': 12927168, 'steps': 67328, 'loss/train': 1.3756091594696045} 11/07/2021 06:42:21 - INFO - __main__ - Step 67330: {'lr': 0.0002957554070275902, 'samples': 12927360, 'steps': 67329, 'loss/train': 1.7690590620040894} 11/07/2021 06:42:22 - INFO - __main__ - Step 67331: {'lr': 0.00029575018990975356, 'samples': 12927552, 'steps': 67330, 'loss/train': 1.322121500968933} 11/07/2021 06:42:22 - INFO - __main__ - Step 67332: {'lr': 0.0002957449727713026, 'samples': 12927744, 'steps': 67331, 'loss/train': 1.2896018028259277} 11/07/2021 06:42:22 - INFO - __main__ - Step 67333: {'lr': 0.00029573975561223966, 'samples': 12927936, 'steps': 67332, 'loss/train': 1.6944011449813843} 11/07/2021 06:42:23 - INFO - __main__ - Step 67334: {'lr': 0.00029573453843256706, 'samples': 12928128, 'steps': 67333, 'loss/train': 0.9138181805610657} 11/07/2021 06:42:24 - INFO - __main__ - Step 67335: {'lr': 0.0002957293212322872, 'samples': 12928320, 'steps': 67334, 'loss/train': 1.5745400190353394} 11/07/2021 06:42:24 - INFO - __main__ - Step 67336: {'lr': 0.0002957241040114024, 'samples': 12928512, 'steps': 67335, 'loss/train': 1.4459081888198853} 11/07/2021 06:42:24 - INFO - __main__ - Step 67337: {'lr': 0.000295718886769915, 'samples': 12928704, 'steps': 67336, 'loss/train': 0.8469685912132263} 11/07/2021 06:42:25 - INFO - __main__ - Step 67338: {'lr': 0.0002957136695078274, 'samples': 12928896, 'steps': 67337, 'loss/train': 1.331802487373352} 11/07/2021 06:42:25 - INFO - __main__ - Step 67339: {'lr': 0.00029570845222514193, 'samples': 12929088, 'steps': 67338, 'loss/train': 1.4813008308410645} 11/07/2021 06:42:27 - INFO - __main__ - Step 67340: {'lr': 0.000295703234921861, 'samples': 12929280, 'steps': 67339, 'loss/train': 0.9031254053115845} 11/07/2021 06:42:27 - INFO - __main__ - Step 67341: {'lr': 0.0002956980175979868, 'samples': 12929472, 'steps': 67340, 'loss/train': 1.1657679080963135} 11/07/2021 06:42:27 - INFO - __main__ - Step 67342: {'lr': 0.00029569280025352183, 'samples': 12929664, 'steps': 67341, 'loss/train': 0.29837098717689514} 11/07/2021 06:42:28 - INFO - __main__ - Step 67343: {'lr': 0.0002956875828884684, 'samples': 12929856, 'steps': 67342, 'loss/train': 1.4493162631988525} 11/07/2021 06:42:28 - INFO - __main__ - Step 67344: {'lr': 0.00029568236550282876, 'samples': 12930048, 'steps': 67343, 'loss/train': 1.3838576078414917} 11/07/2021 06:42:29 - INFO - __main__ - Step 67345: {'lr': 0.0002956771480966055, 'samples': 12930240, 'steps': 67344, 'loss/train': 1.206146001815796} 11/07/2021 06:42:30 - INFO - __main__ - Step 67346: {'lr': 0.00029567193066980073, 'samples': 12930432, 'steps': 67345, 'loss/train': 1.3937331438064575} 11/07/2021 06:42:30 - INFO - __main__ - Step 67347: {'lr': 0.0002956667132224169, 'samples': 12930624, 'steps': 67346, 'loss/train': 1.358098030090332} 11/07/2021 06:42:30 - INFO - __main__ - Step 67348: {'lr': 0.0002956614957544563, 'samples': 12930816, 'steps': 67347, 'loss/train': 1.6929548978805542} 11/07/2021 06:42:31 - INFO - __main__ - Step 67349: {'lr': 0.00029565627826592147, 'samples': 12931008, 'steps': 67348, 'loss/train': 1.2317918539047241} 11/07/2021 06:42:32 - INFO - __main__ - Step 67350: {'lr': 0.00029565106075681453, 'samples': 12931200, 'steps': 67349, 'loss/train': 1.5595216751098633} 11/07/2021 06:42:32 - INFO - __main__ - Step 67351: {'lr': 0.00029564584322713794, 'samples': 12931392, 'steps': 67350, 'loss/train': 1.5252774953842163} 11/07/2021 06:42:32 - INFO - __main__ - Step 67352: {'lr': 0.00029564062567689404, 'samples': 12931584, 'steps': 67351, 'loss/train': 1.2886625528335571} 11/07/2021 06:42:33 - INFO - __main__ - Step 67353: {'lr': 0.0002956354081060852, 'samples': 12931776, 'steps': 67352, 'loss/train': 0.5774863958358765} 11/07/2021 06:42:33 - INFO - __main__ - Step 67354: {'lr': 0.0002956301905147137, 'samples': 12931968, 'steps': 67353, 'loss/train': 1.1435787677764893} 11/07/2021 06:42:33 - INFO - __main__ - Step 67355: {'lr': 0.00029562497290278197, 'samples': 12932160, 'steps': 67354, 'loss/train': 1.3419047594070435} 11/07/2021 06:42:34 - INFO - __main__ - Step 67356: {'lr': 0.0002956197552702924, 'samples': 12932352, 'steps': 67355, 'loss/train': 1.13185715675354} 11/07/2021 06:42:35 - INFO - __main__ - Step 67357: {'lr': 0.00029561453761724714, 'samples': 12932544, 'steps': 67356, 'loss/train': 1.004485011100769} 11/07/2021 06:42:35 - INFO - __main__ - Step 67358: {'lr': 0.00029560931994364873, 'samples': 12932736, 'steps': 67357, 'loss/train': 1.1058672666549683} 11/07/2021 06:42:35 - INFO - __main__ - Step 67359: {'lr': 0.00029560410224949954, 'samples': 12932928, 'steps': 67358, 'loss/train': 0.9884098172187805} 11/07/2021 06:42:36 - INFO - __main__ - Step 67360: {'lr': 0.00029559888453480174, 'samples': 12933120, 'steps': 67359, 'loss/train': 1.305698275566101} 11/07/2021 06:42:37 - INFO - __main__ - Step 67361: {'lr': 0.0002955936667995578, 'samples': 12933312, 'steps': 67360, 'loss/train': 1.221314549446106} 11/07/2021 06:42:37 - INFO - __main__ - Step 67362: {'lr': 0.00029558844904377016, 'samples': 12933504, 'steps': 67361, 'loss/train': 1.5256016254425049} 11/07/2021 06:42:37 - INFO - __main__ - Step 67363: {'lr': 0.0002955832312674409, 'samples': 12933696, 'steps': 67362, 'loss/train': 1.5568184852600098} 11/07/2021 06:42:38 - INFO - __main__ - Step 67364: {'lr': 0.00029557801347057265, 'samples': 12933888, 'steps': 67363, 'loss/train': 1.5593881607055664} 11/07/2021 06:42:38 - INFO - __main__ - Step 67365: {'lr': 0.0002955727956531676, 'samples': 12934080, 'steps': 67364, 'loss/train': 1.5833358764648438} 11/07/2021 06:42:40 - INFO - __main__ - Step 67366: {'lr': 0.00029556757781522817, 'samples': 12934272, 'steps': 67365, 'loss/train': 3.811782121658325} 11/07/2021 06:42:40 - INFO - __main__ - Step 67367: {'lr': 0.0002955623599567568, 'samples': 12934464, 'steps': 67366, 'loss/train': 2.035360336303711} 11/07/2021 06:42:40 - INFO - __main__ - Step 67368: {'lr': 0.0002955571420777556, 'samples': 12934656, 'steps': 67367, 'loss/train': 0.7842379212379456} 11/07/2021 06:42:41 - INFO - __main__ - Step 67369: {'lr': 0.0002955519241782271, 'samples': 12934848, 'steps': 67368, 'loss/train': 1.2278193235397339} 11/07/2021 06:42:41 - INFO - __main__ - Step 67370: {'lr': 0.00029554670625817357, 'samples': 12935040, 'steps': 67369, 'loss/train': 0.3931356966495514} 11/07/2021 06:42:42 - INFO - __main__ - Step 67371: {'lr': 0.0002955414883175974, 'samples': 12935232, 'steps': 67370, 'loss/train': 1.3525505065917969} 11/07/2021 06:42:42 - INFO - __main__ - Step 67372: {'lr': 0.00029553627035650096, 'samples': 12935424, 'steps': 67371, 'loss/train': 1.5124000310897827} 11/07/2021 06:42:43 - INFO - __main__ - Step 67373: {'lr': 0.00029553105237488663, 'samples': 12935616, 'steps': 67372, 'loss/train': 1.065120816230774} 11/07/2021 06:42:43 - INFO - __main__ - Step 67374: {'lr': 0.00029552583437275664, 'samples': 12935808, 'steps': 67373, 'loss/train': 0.9148622155189514} 11/07/2021 06:42:43 - INFO - __main__ - Step 67375: {'lr': 0.0002955206163501134, 'samples': 12936000, 'steps': 67374, 'loss/train': 1.490599513053894} 11/07/2021 06:42:44 - INFO - __main__ - Step 67376: {'lr': 0.00029551539830695935, 'samples': 12936192, 'steps': 67375, 'loss/train': 1.4792438745498657} 11/07/2021 06:42:45 - INFO - __main__ - Step 67377: {'lr': 0.00029551018024329666, 'samples': 12936384, 'steps': 67376, 'loss/train': 1.2318346500396729} 11/07/2021 06:42:45 - INFO - __main__ - Step 67378: {'lr': 0.00029550496215912785, 'samples': 12936576, 'steps': 67377, 'loss/train': 1.1884477138519287} 11/07/2021 06:42:46 - INFO - __main__ - Step 67379: {'lr': 0.0002954997440544552, 'samples': 12936768, 'steps': 67378, 'loss/train': 0.21065281331539154} 11/07/2021 06:42:46 - INFO - __main__ - Step 67380: {'lr': 0.0002954945259292811, 'samples': 12936960, 'steps': 67379, 'loss/train': 0.73481684923172} 11/07/2021 06:42:46 - INFO - __main__ - Step 67381: {'lr': 0.0002954893077836078, 'samples': 12937152, 'steps': 67380, 'loss/train': 0.9731177091598511} 11/07/2021 06:42:47 - INFO - __main__ - Step 67382: {'lr': 0.00029548408961743776, 'samples': 12937344, 'steps': 67381, 'loss/train': 1.7043912410736084} 11/07/2021 06:42:48 - INFO - __main__ - Step 67383: {'lr': 0.0002954788714307733, 'samples': 12937536, 'steps': 67382, 'loss/train': 1.8508274555206299} 11/07/2021 06:42:48 - INFO - __main__ - Step 67384: {'lr': 0.0002954736532236167, 'samples': 12937728, 'steps': 67383, 'loss/train': 0.9512622952461243} 11/07/2021 06:42:48 - INFO - __main__ - Step 67385: {'lr': 0.00029546843499597046, 'samples': 12937920, 'steps': 67384, 'loss/train': 1.5365334749221802} 11/07/2021 06:42:49 - INFO - __main__ - Step 67386: {'lr': 0.00029546321674783684, 'samples': 12938112, 'steps': 67385, 'loss/train': 1.279881477355957} 11/07/2021 06:42:50 - INFO - __main__ - Step 67387: {'lr': 0.0002954579984792182, 'samples': 12938304, 'steps': 67386, 'loss/train': 1.5805137157440186} 11/07/2021 06:42:50 - INFO - __main__ - Step 67388: {'lr': 0.0002954527801901168, 'samples': 12938496, 'steps': 67387, 'loss/train': 1.646398663520813} 11/07/2021 06:42:50 - INFO - __main__ - Step 67389: {'lr': 0.0002954475618805351, 'samples': 12938688, 'steps': 67388, 'loss/train': 1.5290743112564087} 11/07/2021 06:42:51 - INFO - __main__ - Step 67390: {'lr': 0.0002954423435504755, 'samples': 12938880, 'steps': 67389, 'loss/train': 1.4177465438842773} 11/07/2021 06:42:51 - INFO - __main__ - Step 67391: {'lr': 0.0002954371251999402, 'samples': 12939072, 'steps': 67390, 'loss/train': 1.0672544240951538} 11/07/2021 06:42:53 - INFO - __main__ - Step 67392: {'lr': 0.0002954319068289317, 'samples': 12939264, 'steps': 67391, 'loss/train': 0.38116517663002014} 11/07/2021 06:42:53 - INFO - __main__ - Step 67393: {'lr': 0.0002954266884374523, 'samples': 12939456, 'steps': 67392, 'loss/train': 1.194083333015442} 11/07/2021 06:42:53 - INFO - __main__ - Step 67394: {'lr': 0.0002954214700255043, 'samples': 12939648, 'steps': 67393, 'loss/train': 1.1810450553894043} 11/07/2021 06:42:54 - INFO - __main__ - Step 67395: {'lr': 0.00029541625159309006, 'samples': 12939840, 'steps': 67394, 'loss/train': 1.482737421989441} 11/07/2021 06:42:54 - INFO - __main__ - Step 67396: {'lr': 0.00029541103314021196, 'samples': 12940032, 'steps': 67395, 'loss/train': 1.1120525598526} 11/07/2021 06:42:55 - INFO - __main__ - Step 67397: {'lr': 0.0002954058146668723, 'samples': 12940224, 'steps': 67396, 'loss/train': 1.8000848293304443} 11/07/2021 06:42:56 - INFO - __main__ - Step 67398: {'lr': 0.00029540059617307355, 'samples': 12940416, 'steps': 67397, 'loss/train': 1.0670831203460693} 11/07/2021 06:42:56 - INFO - __main__ - Step 67399: {'lr': 0.000295395377658818, 'samples': 12940608, 'steps': 67398, 'loss/train': 1.3228427171707153} 11/07/2021 06:42:56 - INFO - __main__ - Step 67400: {'lr': 0.00029539015912410807, 'samples': 12940800, 'steps': 67399, 'loss/train': 1.8061679601669312} 11/07/2021 06:42:57 - INFO - __main__ - Step 67401: {'lr': 0.00029538494056894596, 'samples': 12940992, 'steps': 67400, 'loss/train': 1.118715524673462} 11/07/2021 06:42:57 - INFO - __main__ - Step 67402: {'lr': 0.000295379721993334, 'samples': 12941184, 'steps': 67401, 'loss/train': 1.3329960107803345} 11/07/2021 06:42:57 - INFO - __main__ - Step 67403: {'lr': 0.0002953745033972747, 'samples': 12941376, 'steps': 67402, 'loss/train': 1.133286476135254} 11/07/2021 06:42:58 - INFO - __main__ - Step 67404: {'lr': 0.0002953692847807704, 'samples': 12941568, 'steps': 67403, 'loss/train': 0.7876971960067749} 11/07/2021 06:42:59 - INFO - __main__ - Step 67405: {'lr': 0.0002953640661438234, 'samples': 12941760, 'steps': 67404, 'loss/train': 1.3462058305740356} 11/07/2021 06:42:59 - INFO - __main__ - Step 67406: {'lr': 0.00029535884748643597, 'samples': 12941952, 'steps': 67405, 'loss/train': 1.1049505472183228} 11/07/2021 06:42:59 - INFO - __main__ - Step 67407: {'lr': 0.00029535362880861064, 'samples': 12942144, 'steps': 67406, 'loss/train': 1.4122358560562134} 11/07/2021 06:43:00 - INFO - __main__ - Step 67408: {'lr': 0.0002953484101103496, 'samples': 12942336, 'steps': 67407, 'loss/train': 1.4089535474777222} 11/07/2021 06:43:01 - INFO - __main__ - Step 67409: {'lr': 0.0002953431913916553, 'samples': 12942528, 'steps': 67408, 'loss/train': 1.7333693504333496} 11/07/2021 06:43:01 - INFO - __main__ - Step 67410: {'lr': 0.00029533797265253003, 'samples': 12942720, 'steps': 67409, 'loss/train': 1.4364216327667236} 11/07/2021 06:43:01 - INFO - __main__ - Step 67411: {'lr': 0.00029533275389297613, 'samples': 12942912, 'steps': 67410, 'loss/train': 1.671748399734497} 11/07/2021 06:43:02 - INFO - __main__ - Step 67412: {'lr': 0.0002953275351129961, 'samples': 12943104, 'steps': 67411, 'loss/train': 1.5405405759811401} 11/07/2021 06:43:02 - INFO - __main__ - Step 67413: {'lr': 0.0002953223163125921, 'samples': 12943296, 'steps': 67412, 'loss/train': 1.4192684888839722} 11/07/2021 06:43:03 - INFO - __main__ - Step 67414: {'lr': 0.00029531709749176663, 'samples': 12943488, 'steps': 67413, 'loss/train': 1.3226354122161865} 11/07/2021 06:43:04 - INFO - __main__ - Step 67415: {'lr': 0.0002953118786505219, 'samples': 12943680, 'steps': 67414, 'loss/train': 0.5796250104904175} 11/07/2021 06:43:04 - INFO - __main__ - Step 67416: {'lr': 0.0002953066597888604, 'samples': 12943872, 'steps': 67415, 'loss/train': 1.548559546470642} 11/07/2021 06:43:04 - INFO - __main__ - Step 67417: {'lr': 0.0002953014409067844, 'samples': 12944064, 'steps': 67416, 'loss/train': 2.0174481868743896} 11/07/2021 06:43:05 - INFO - __main__ - Step 67418: {'lr': 0.0002952962220042962, 'samples': 12944256, 'steps': 67417, 'loss/train': 1.6063886880874634} 11/07/2021 06:43:06 - INFO - __main__ - Step 67419: {'lr': 0.0002952910030813983, 'samples': 12944448, 'steps': 67418, 'loss/train': 1.7090305089950562} 11/07/2021 06:43:06 - INFO - __main__ - Step 67420: {'lr': 0.000295285784138093, 'samples': 12944640, 'steps': 67419, 'loss/train': 1.1530746221542358} 11/07/2021 06:43:06 - INFO - __main__ - Step 67421: {'lr': 0.0002952805651743826, 'samples': 12944832, 'steps': 67420, 'loss/train': 1.5024399757385254} 11/07/2021 06:43:07 - INFO - __main__ - Step 67422: {'lr': 0.0002952753461902694, 'samples': 12945024, 'steps': 67421, 'loss/train': 0.9165745377540588} 11/07/2021 06:43:07 - INFO - __main__ - Step 67423: {'lr': 0.00029527012718575583, 'samples': 12945216, 'steps': 67422, 'loss/train': 1.663557529449463} 11/07/2021 06:43:08 - INFO - __main__ - Step 67424: {'lr': 0.00029526490816084427, 'samples': 12945408, 'steps': 67423, 'loss/train': 0.8265384435653687} 11/07/2021 06:43:08 - INFO - __main__ - Step 67425: {'lr': 0.00029525968911553707, 'samples': 12945600, 'steps': 67424, 'loss/train': 1.5440428256988525} 11/07/2021 06:43:09 - INFO - __main__ - Step 67426: {'lr': 0.00029525447004983657, 'samples': 12945792, 'steps': 67425, 'loss/train': 1.6390275955200195} 11/07/2021 06:43:09 - INFO - __main__ - Step 67427: {'lr': 0.0002952492509637451, 'samples': 12945984, 'steps': 67426, 'loss/train': 2.0134694576263428} 11/07/2021 06:43:09 - INFO - __main__ - Step 67428: {'lr': 0.000295244031857265, 'samples': 12946176, 'steps': 67427, 'loss/train': 1.4456619024276733} 11/07/2021 06:43:10 - INFO - __main__ - Step 67429: {'lr': 0.0002952388127303986, 'samples': 12946368, 'steps': 67428, 'loss/train': 1.0816978216171265} 11/07/2021 06:43:11 - INFO - __main__ - Step 67430: {'lr': 0.00029523359358314834, 'samples': 12946560, 'steps': 67429, 'loss/train': 1.5764751434326172} 11/07/2021 06:43:11 - INFO - __main__ - Step 67431: {'lr': 0.00029522837441551647, 'samples': 12946752, 'steps': 67430, 'loss/train': 1.6572699546813965} 11/07/2021 06:43:11 - INFO - __main__ - Step 67432: {'lr': 0.00029522315522750544, 'samples': 12946944, 'steps': 67431, 'loss/train': 1.368952751159668} 11/07/2021 06:43:12 - INFO - __main__ - Step 67433: {'lr': 0.0002952179360191175, 'samples': 12947136, 'steps': 67432, 'loss/train': 1.5698882341384888} 11/07/2021 06:43:12 - INFO - __main__ - Step 67434: {'lr': 0.00029521271679035514, 'samples': 12947328, 'steps': 67433, 'loss/train': 1.1715247631072998} 11/07/2021 06:43:13 - INFO - __main__ - Step 67435: {'lr': 0.00029520749754122054, 'samples': 12947520, 'steps': 67434, 'loss/train': 1.1618435382843018} 11/07/2021 06:43:14 - INFO - __main__ - Step 67436: {'lr': 0.0002952022782717162, 'samples': 12947712, 'steps': 67435, 'loss/train': 1.6000049114227295} 11/07/2021 06:43:14 - INFO - __main__ - Step 67437: {'lr': 0.0002951970589818444, 'samples': 12947904, 'steps': 67436, 'loss/train': 1.2519629001617432} 11/07/2021 06:43:14 - INFO - __main__ - Step 67438: {'lr': 0.00029519183967160746, 'samples': 12948096, 'steps': 67437, 'loss/train': 1.666115164756775} 11/07/2021 06:43:15 - INFO - __main__ - Step 67439: {'lr': 0.0002951866203410078, 'samples': 12948288, 'steps': 67438, 'loss/train': 1.4239304065704346} 11/07/2021 06:43:16 - INFO - __main__ - Step 67440: {'lr': 0.00029518140099004774, 'samples': 12948480, 'steps': 67439, 'loss/train': 1.405548334121704} 11/07/2021 06:43:16 - INFO - __main__ - Step 67441: {'lr': 0.00029517618161872973, 'samples': 12948672, 'steps': 67440, 'loss/train': 1.325614333152771} 11/07/2021 06:43:16 - INFO - __main__ - Step 67442: {'lr': 0.00029517096222705594, 'samples': 12948864, 'steps': 67441, 'loss/train': 1.170404076576233} 11/07/2021 06:43:17 - INFO - __main__ - Step 67443: {'lr': 0.00029516574281502884, 'samples': 12949056, 'steps': 67442, 'loss/train': 1.1908903121948242} 11/07/2021 06:43:17 - INFO - __main__ - Step 67444: {'lr': 0.0002951605233826507, 'samples': 12949248, 'steps': 67443, 'loss/train': 1.5878806114196777} 11/07/2021 06:43:18 - INFO - __main__ - Step 67445: {'lr': 0.00029515530392992394, 'samples': 12949440, 'steps': 67444, 'loss/train': 1.5288742780685425} 11/07/2021 06:43:18 - INFO - __main__ - Step 67446: {'lr': 0.00029515008445685096, 'samples': 12949632, 'steps': 67445, 'loss/train': 1.3055917024612427} 11/07/2021 06:43:19 - INFO - __main__ - Step 67447: {'lr': 0.000295144864963434, 'samples': 12949824, 'steps': 67446, 'loss/train': 1.6456325054168701} 11/07/2021 06:43:19 - INFO - __main__ - Step 67448: {'lr': 0.00029513964544967546, 'samples': 12950016, 'steps': 67447, 'loss/train': 1.3091834783554077} 11/07/2021 06:43:19 - INFO - __main__ - Step 67449: {'lr': 0.0002951344259155777, 'samples': 12950208, 'steps': 67448, 'loss/train': 1.2913283109664917} 11/07/2021 06:43:20 - INFO - __main__ - Step 67450: {'lr': 0.00029512920636114306, 'samples': 12950400, 'steps': 67449, 'loss/train': 1.213201880455017} 11/07/2021 06:43:21 - INFO - __main__ - Step 67451: {'lr': 0.00029512398678637386, 'samples': 12950592, 'steps': 67450, 'loss/train': 1.2953380346298218} 11/07/2021 06:43:21 - INFO - __main__ - Step 67452: {'lr': 0.0002951187671912725, 'samples': 12950784, 'steps': 67451, 'loss/train': 1.3883144855499268} 11/07/2021 06:43:21 - INFO - __main__ - Step 67453: {'lr': 0.00029511354757584134, 'samples': 12950976, 'steps': 67452, 'loss/train': 2.19447922706604} 11/07/2021 06:43:22 - INFO - __main__ - Step 67454: {'lr': 0.0002951083279400828, 'samples': 12951168, 'steps': 67453, 'loss/train': 1.3947129249572754} 11/07/2021 06:43:23 - INFO - __main__ - Step 67455: {'lr': 0.0002951031082839991, 'samples': 12951360, 'steps': 67454, 'loss/train': 1.586583137512207} 11/07/2021 06:43:23 - INFO - __main__ - Step 67456: {'lr': 0.0002950978886075926, 'samples': 12951552, 'steps': 67455, 'loss/train': 1.398451805114746} 11/07/2021 06:43:24 - INFO - __main__ - Step 67457: {'lr': 0.0002950926689108656, 'samples': 12951744, 'steps': 67456, 'loss/train': 1.2047621011734009} 11/07/2021 06:43:24 - INFO - __main__ - Step 67458: {'lr': 0.0002950874491938206, 'samples': 12951936, 'steps': 67457, 'loss/train': 2.0279834270477295} 11/07/2021 06:43:24 - INFO - __main__ - Step 67459: {'lr': 0.00029508222945645997, 'samples': 12952128, 'steps': 67458, 'loss/train': 1.1615384817123413} 11/07/2021 06:43:25 - INFO - __main__ - Step 67460: {'lr': 0.00029507700969878586, 'samples': 12952320, 'steps': 67459, 'loss/train': 1.1559696197509766} 11/07/2021 06:43:26 - INFO - __main__ - Step 67461: {'lr': 0.00029507178992080086, 'samples': 12952512, 'steps': 67460, 'loss/train': 0.9605089426040649} 11/07/2021 06:43:26 - INFO - __main__ - Step 67462: {'lr': 0.00029506657012250717, 'samples': 12952704, 'steps': 67461, 'loss/train': 1.2496997117996216} 11/07/2021 06:43:26 - INFO - __main__ - Step 67463: {'lr': 0.0002950613503039072, 'samples': 12952896, 'steps': 67462, 'loss/train': 1.1750344038009644} 11/07/2021 06:43:27 - INFO - __main__ - Step 67464: {'lr': 0.00029505613046500325, 'samples': 12953088, 'steps': 67463, 'loss/train': 1.6086345911026} 11/07/2021 06:43:28 - INFO - __main__ - Step 67465: {'lr': 0.0002950509106057976, 'samples': 12953280, 'steps': 67464, 'loss/train': 1.174880862236023} 11/07/2021 06:43:28 - INFO - __main__ - Step 67466: {'lr': 0.00029504569072629286, 'samples': 12953472, 'steps': 67465, 'loss/train': 1.4847548007965088} 11/07/2021 06:43:28 - INFO - __main__ - Step 67467: {'lr': 0.00029504047082649123, 'samples': 12953664, 'steps': 67466, 'loss/train': 0.8160943984985352} 11/07/2021 06:43:29 - INFO - __main__ - Step 67468: {'lr': 0.00029503525090639497, 'samples': 12953856, 'steps': 67467, 'loss/train': 1.5392935276031494} 11/07/2021 06:43:29 - INFO - __main__ - Step 67469: {'lr': 0.00029503003096600656, 'samples': 12954048, 'steps': 67468, 'loss/train': 1.7581582069396973} 11/07/2021 06:43:30 - INFO - __main__ - Step 67470: {'lr': 0.0002950248110053283, 'samples': 12954240, 'steps': 67469, 'loss/train': 1.4514210224151611} 11/07/2021 06:43:30 - INFO - __main__ - Step 67471: {'lr': 0.0002950195910243625, 'samples': 12954432, 'steps': 67470, 'loss/train': 0.8634672164916992} 11/07/2021 06:43:31 - INFO - __main__ - Step 67472: {'lr': 0.00029501437102311167, 'samples': 12954624, 'steps': 67471, 'loss/train': 1.3448046445846558} 11/07/2021 06:43:31 - INFO - __main__ - Step 67473: {'lr': 0.000295009151001578, 'samples': 12954816, 'steps': 67472, 'loss/train': 1.7162307500839233} 11/07/2021 06:43:32 - INFO - __main__ - Step 67474: {'lr': 0.000295003930959764, 'samples': 12955008, 'steps': 67473, 'loss/train': 1.3113059997558594} 11/07/2021 06:43:32 - INFO - __main__ - Step 67475: {'lr': 0.0002949987108976718, 'samples': 12955200, 'steps': 67474, 'loss/train': 1.2126057147979736} 11/07/2021 06:43:33 - INFO - __main__ - Step 67476: {'lr': 0.0002949934908153039, 'samples': 12955392, 'steps': 67475, 'loss/train': 1.4358679056167603} 11/07/2021 06:43:33 - INFO - __main__ - Step 67477: {'lr': 0.00029498827071266267, 'samples': 12955584, 'steps': 67476, 'loss/train': 1.4136987924575806} 11/07/2021 06:43:34 - INFO - __main__ - Step 67478: {'lr': 0.0002949830505897504, 'samples': 12955776, 'steps': 67477, 'loss/train': 1.2723720073699951} 11/07/2021 06:43:34 - INFO - __main__ - Step 67479: {'lr': 0.0002949778304465694, 'samples': 12955968, 'steps': 67478, 'loss/train': 1.5279046297073364} 11/07/2021 06:43:34 - INFO - __main__ - Step 67480: {'lr': 0.00029497261028312217, 'samples': 12956160, 'steps': 67479, 'loss/train': 1.0189787149429321} 11/07/2021 06:43:35 - INFO - __main__ - Step 67481: {'lr': 0.000294967390099411, 'samples': 12956352, 'steps': 67480, 'loss/train': 1.152927279472351} 11/07/2021 06:43:36 - INFO - __main__ - Step 67482: {'lr': 0.0002949621698954381, 'samples': 12956544, 'steps': 67481, 'loss/train': 1.5267388820648193} 11/07/2021 06:43:36 - INFO - __main__ - Step 67483: {'lr': 0.000294956949671206, 'samples': 12956736, 'steps': 67482, 'loss/train': 1.3274261951446533} 11/07/2021 06:43:36 - INFO - __main__ - Step 67484: {'lr': 0.000294951729426717, 'samples': 12956928, 'steps': 67483, 'loss/train': 0.3306242525577545} 11/07/2021 06:43:37 - INFO - __main__ - Step 67485: {'lr': 0.00029494650916197347, 'samples': 12957120, 'steps': 67484, 'loss/train': 1.780938744544983} 11/07/2021 06:43:38 - INFO - __main__ - Step 67486: {'lr': 0.0002949412888769777, 'samples': 12957312, 'steps': 67485, 'loss/train': 1.0722769498825073} 11/07/2021 06:43:38 - INFO - __main__ - Step 67487: {'lr': 0.0002949360685717321, 'samples': 12957504, 'steps': 67486, 'loss/train': 1.5581068992614746} 11/07/2021 06:43:39 - INFO - __main__ - Step 67488: {'lr': 0.0002949308482462389, 'samples': 12957696, 'steps': 67487, 'loss/train': 1.300032615661621} 11/07/2021 06:43:39 - INFO - __main__ - Step 67489: {'lr': 0.0002949256279005007, 'samples': 12957888, 'steps': 67488, 'loss/train': 1.2360568046569824} 11/07/2021 06:43:39 - INFO - __main__ - Step 67490: {'lr': 0.00029492040753451964, 'samples': 12958080, 'steps': 67489, 'loss/train': 1.4948294162750244} 11/07/2021 06:43:40 - INFO - __main__ - Step 67491: {'lr': 0.0002949151871482982, 'samples': 12958272, 'steps': 67490, 'loss/train': 1.6379096508026123} 11/07/2021 06:43:41 - INFO - __main__ - Step 67492: {'lr': 0.0002949099667418386, 'samples': 12958464, 'steps': 67491, 'loss/train': 1.61167573928833} 11/07/2021 06:43:41 - INFO - __main__ - Step 67493: {'lr': 0.0002949047463151432, 'samples': 12958656, 'steps': 67492, 'loss/train': 1.6648660898208618} 11/07/2021 06:43:41 - INFO - __main__ - Step 67494: {'lr': 0.0002948995258682145, 'samples': 12958848, 'steps': 67493, 'loss/train': 1.7950859069824219} 11/07/2021 06:43:42 - INFO - __main__ - Step 67495: {'lr': 0.0002948943054010548, 'samples': 12959040, 'steps': 67494, 'loss/train': 1.2048146724700928} 11/07/2021 06:43:42 - INFO - __main__ - Step 67496: {'lr': 0.0002948890849136664, 'samples': 12959232, 'steps': 67495, 'loss/train': 1.3873467445373535} 11/07/2021 06:43:43 - INFO - __main__ - Step 67497: {'lr': 0.00029488386440605164, 'samples': 12959424, 'steps': 67496, 'loss/train': 1.3942391872406006} 11/07/2021 06:43:43 - INFO - __main__ - Step 67498: {'lr': 0.0002948786438782129, 'samples': 12959616, 'steps': 67497, 'loss/train': 1.5589406490325928} 11/07/2021 06:43:44 - INFO - __main__ - Step 67499: {'lr': 0.00029487342333015253, 'samples': 12959808, 'steps': 67498, 'loss/train': 0.9639248847961426} 11/07/2021 06:43:44 - INFO - __main__ - Step 67500: {'lr': 0.0002948682027618729, 'samples': 12960000, 'steps': 67499, 'loss/train': 1.773863673210144} 11/07/2021 06:43:44 - INFO - __main__ - Step 67501: {'lr': 0.0002948629821733764, 'samples': 12960192, 'steps': 67500, 'loss/train': 2.0050768852233887} 11/07/2021 06:43:45 - INFO - __main__ - Step 67502: {'lr': 0.00029485776156466527, 'samples': 12960384, 'steps': 67501, 'loss/train': 1.5918501615524292} 11/07/2021 06:43:46 - INFO - __main__ - Step 67503: {'lr': 0.0002948525409357419, 'samples': 12960576, 'steps': 67502, 'loss/train': 1.3485249280929565} 11/07/2021 06:43:46 - INFO - __main__ - Step 67504: {'lr': 0.0002948473202866087, 'samples': 12960768, 'steps': 67503, 'loss/train': 1.337234616279602} 11/07/2021 06:43:47 - INFO - __main__ - Step 67505: {'lr': 0.000294842099617268, 'samples': 12960960, 'steps': 67504, 'loss/train': 1.617037296295166} 11/07/2021 06:43:47 - INFO - __main__ - Step 67506: {'lr': 0.00029483687892772214, 'samples': 12961152, 'steps': 67505, 'loss/train': 1.3751220703125} 11/07/2021 06:43:48 - INFO - __main__ - Step 67507: {'lr': 0.0002948316582179734, 'samples': 12961344, 'steps': 67506, 'loss/train': 1.131048321723938} 11/07/2021 06:43:48 - INFO - __main__ - Step 67508: {'lr': 0.00029482643748802436, 'samples': 12961536, 'steps': 67507, 'loss/train': 2.0153675079345703} 11/07/2021 06:43:49 - INFO - __main__ - Step 67509: {'lr': 0.00029482121673787717, 'samples': 12961728, 'steps': 67508, 'loss/train': 1.0668286085128784} 11/07/2021 06:43:49 - INFO - __main__ - Step 67510: {'lr': 0.00029481599596753417, 'samples': 12961920, 'steps': 67509, 'loss/train': 1.2364434003829956} 11/07/2021 06:43:49 - INFO - __main__ - Step 67511: {'lr': 0.0002948107751769978, 'samples': 12962112, 'steps': 67510, 'loss/train': 1.7720775604248047} 11/07/2021 06:43:50 - INFO - __main__ - Step 67512: {'lr': 0.00029480555436627037, 'samples': 12962304, 'steps': 67511, 'loss/train': 1.0474278926849365} 11/07/2021 06:43:51 - INFO - __main__ - Step 67513: {'lr': 0.00029480033353535424, 'samples': 12962496, 'steps': 67512, 'loss/train': 1.4412171840667725} 11/07/2021 06:43:51 - INFO - __main__ - Step 67514: {'lr': 0.00029479511268425183, 'samples': 12962688, 'steps': 67513, 'loss/train': 1.2462096214294434} 11/07/2021 06:43:52 - INFO - __main__ - Step 67515: {'lr': 0.0002947898918129654, 'samples': 12962880, 'steps': 67514, 'loss/train': 1.4359855651855469} 11/07/2021 06:43:52 - INFO - __main__ - Step 67516: {'lr': 0.00029478467092149737, 'samples': 12963072, 'steps': 67515, 'loss/train': 1.366926908493042} 11/07/2021 06:43:52 - INFO - __main__ - Step 67517: {'lr': 0.00029477945000984997, 'samples': 12963264, 'steps': 67516, 'loss/train': 1.2104990482330322} 11/07/2021 06:43:53 - INFO - __main__ - Step 67518: {'lr': 0.0002947742290780257, 'samples': 12963456, 'steps': 67517, 'loss/train': 1.3257981538772583} 11/07/2021 06:43:54 - INFO - __main__ - Step 67519: {'lr': 0.0002947690081260269, 'samples': 12963648, 'steps': 67518, 'loss/train': 1.793552041053772} 11/07/2021 06:43:54 - INFO - __main__ - Step 67520: {'lr': 0.0002947637871538558, 'samples': 12963840, 'steps': 67519, 'loss/train': 1.9090970754623413} 11/07/2021 06:43:54 - INFO - __main__ - Step 67521: {'lr': 0.00029475856616151486, 'samples': 12964032, 'steps': 67520, 'loss/train': 1.4569811820983887} 11/07/2021 06:43:55 - INFO - __main__ - Step 67522: {'lr': 0.00029475334514900636, 'samples': 12964224, 'steps': 67521, 'loss/train': 1.4351983070373535} 11/07/2021 06:43:56 - INFO - __main__ - Step 67523: {'lr': 0.0002947481241163327, 'samples': 12964416, 'steps': 67522, 'loss/train': 1.5374178886413574} 11/07/2021 06:43:56 - INFO - __main__ - Step 67524: {'lr': 0.0002947429030634963, 'samples': 12964608, 'steps': 67523, 'loss/train': 1.3977113962173462} 11/07/2021 06:43:56 - INFO - __main__ - Step 67525: {'lr': 0.0002947376819904994, 'samples': 12964800, 'steps': 67524, 'loss/train': 1.4908876419067383} 11/07/2021 06:43:57 - INFO - __main__ - Step 67526: {'lr': 0.00029473246089734435, 'samples': 12964992, 'steps': 67525, 'loss/train': 1.3118125200271606} 11/07/2021 06:43:57 - INFO - __main__ - Step 67527: {'lr': 0.00029472723978403356, 'samples': 12965184, 'steps': 67526, 'loss/train': 1.3787897825241089} 11/07/2021 06:43:57 - INFO - __main__ - Step 67528: {'lr': 0.0002947220186505694, 'samples': 12965376, 'steps': 67527, 'loss/train': 0.6916679739952087} 11/07/2021 06:43:58 - INFO - __main__ - Step 67529: {'lr': 0.0002947167974969542, 'samples': 12965568, 'steps': 67528, 'loss/train': 1.251089096069336} 11/07/2021 06:43:59 - INFO - __main__ - Step 67530: {'lr': 0.00029471157632319025, 'samples': 12965760, 'steps': 67529, 'loss/train': 1.1353669166564941} 11/07/2021 06:43:59 - INFO - __main__ - Step 67531: {'lr': 0.00029470635512928, 'samples': 12965952, 'steps': 67530, 'loss/train': 0.9455889463424683} 11/07/2021 06:44:00 - INFO - __main__ - Step 67532: {'lr': 0.00029470113391522567, 'samples': 12966144, 'steps': 67531, 'loss/train': 1.5763945579528809} 11/07/2021 06:44:00 - INFO - __main__ - Step 67533: {'lr': 0.0002946959126810298, 'samples': 12966336, 'steps': 67532, 'loss/train': 0.8402243256568909} 11/07/2021 06:44:01 - INFO - __main__ - Step 67534: {'lr': 0.00029469069142669456, 'samples': 12966528, 'steps': 67533, 'loss/train': 1.7858144044876099} 11/07/2021 06:44:01 - INFO - __main__ - Step 67535: {'lr': 0.0002946854701522225, 'samples': 12966720, 'steps': 67534, 'loss/train': 1.9126217365264893} 11/07/2021 06:44:02 - INFO - __main__ - Step 67536: {'lr': 0.00029468024885761574, 'samples': 12966912, 'steps': 67535, 'loss/train': 1.3904287815093994} 11/07/2021 06:44:02 - INFO - __main__ - Step 67537: {'lr': 0.00029467502754287677, 'samples': 12967104, 'steps': 67536, 'loss/train': 1.8435444831848145} 11/07/2021 06:44:02 - INFO - __main__ - Step 67538: {'lr': 0.00029466980620800797, 'samples': 12967296, 'steps': 67537, 'loss/train': 1.2992589473724365} 11/07/2021 06:44:03 - INFO - __main__ - Step 67539: {'lr': 0.0002946645848530116, 'samples': 12967488, 'steps': 67538, 'loss/train': 1.3533433675765991} 11/07/2021 06:44:04 - INFO - __main__ - Step 67540: {'lr': 0.00029465936347789005, 'samples': 12967680, 'steps': 67539, 'loss/train': 1.35379958152771} 11/07/2021 06:44:04 - INFO - __main__ - Step 67541: {'lr': 0.00029465414208264577, 'samples': 12967872, 'steps': 67540, 'loss/train': 0.7499917149543762} 11/07/2021 06:44:04 - INFO - __main__ - Step 67542: {'lr': 0.0002946489206672809, 'samples': 12968064, 'steps': 67541, 'loss/train': 1.060860276222229} 11/07/2021 06:44:05 - INFO - __main__ - Step 67543: {'lr': 0.00029464369923179804, 'samples': 12968256, 'steps': 67542, 'loss/train': 0.8851248025894165} 11/07/2021 06:44:05 - INFO - __main__ - Step 67544: {'lr': 0.00029463847777619936, 'samples': 12968448, 'steps': 67543, 'loss/train': 1.375417947769165} 11/07/2021 06:44:06 - INFO - __main__ - Step 67545: {'lr': 0.0002946332563004872, 'samples': 12968640, 'steps': 67544, 'loss/train': 1.195459008216858} 11/07/2021 06:44:07 - INFO - __main__ - Step 67546: {'lr': 0.00029462803480466405, 'samples': 12968832, 'steps': 67545, 'loss/train': 1.2593134641647339} 11/07/2021 06:44:07 - INFO - __main__ - Step 67547: {'lr': 0.0002946228132887322, 'samples': 12969024, 'steps': 67546, 'loss/train': 1.580300211906433} 11/07/2021 06:44:07 - INFO - __main__ - Step 67548: {'lr': 0.00029461759175269405, 'samples': 12969216, 'steps': 67547, 'loss/train': 1.8604092597961426} 11/07/2021 06:44:08 - INFO - __main__ - Step 67549: {'lr': 0.0002946123701965518, 'samples': 12969408, 'steps': 67548, 'loss/train': 1.474204421043396} 11/07/2021 06:44:09 - INFO - __main__ - Step 67550: {'lr': 0.0002946071486203079, 'samples': 12969600, 'steps': 67549, 'loss/train': 1.8708479404449463} 11/07/2021 06:44:09 - INFO - __main__ - Step 67551: {'lr': 0.0002946019270239648, 'samples': 12969792, 'steps': 67550, 'loss/train': 1.411010503768921} 11/07/2021 06:44:09 - INFO - __main__ - Step 67552: {'lr': 0.0002945967054075247, 'samples': 12969984, 'steps': 67551, 'loss/train': 1.180923581123352} 11/07/2021 06:44:10 - INFO - __main__ - Step 67553: {'lr': 0.00029459148377099, 'samples': 12970176, 'steps': 67552, 'loss/train': 1.0919218063354492} 11/07/2021 06:44:10 - INFO - __main__ - Step 67554: {'lr': 0.0002945862621143631, 'samples': 12970368, 'steps': 67553, 'loss/train': 1.6161205768585205} 11/07/2021 06:44:11 - INFO - __main__ - Step 67555: {'lr': 0.0002945810404376463, 'samples': 12970560, 'steps': 67554, 'loss/train': 1.054028868675232} 11/07/2021 06:44:12 - INFO - __main__ - Step 67556: {'lr': 0.000294575818740842, 'samples': 12970752, 'steps': 67555, 'loss/train': 0.746548593044281} 11/07/2021 06:44:12 - INFO - __main__ - Step 67557: {'lr': 0.0002945705970239525, 'samples': 12970944, 'steps': 67556, 'loss/train': 1.2051230669021606} 11/07/2021 06:44:12 - INFO - __main__ - Step 67558: {'lr': 0.0002945653752869802, 'samples': 12971136, 'steps': 67557, 'loss/train': 1.5524015426635742} 11/07/2021 06:44:13 - INFO - __main__ - Step 67559: {'lr': 0.0002945601535299274, 'samples': 12971328, 'steps': 67558, 'loss/train': 1.4036673307418823} 11/07/2021 06:44:14 - INFO - __main__ - Step 67560: {'lr': 0.0002945549317527965, 'samples': 12971520, 'steps': 67559, 'loss/train': 1.1313481330871582} 11/07/2021 06:44:14 - INFO - __main__ - Step 67561: {'lr': 0.0002945497099555898, 'samples': 12971712, 'steps': 67560, 'loss/train': 1.3439197540283203} 11/07/2021 06:44:14 - INFO - __main__ - Step 67562: {'lr': 0.00029454448813830977, 'samples': 12971904, 'steps': 67561, 'loss/train': 0.6227300763130188} 11/07/2021 06:44:15 - INFO - __main__ - Step 67563: {'lr': 0.0002945392663009586, 'samples': 12972096, 'steps': 67562, 'loss/train': 1.5734262466430664} 11/07/2021 06:44:15 - INFO - __main__ - Step 67564: {'lr': 0.00029453404444353874, 'samples': 12972288, 'steps': 67563, 'loss/train': 1.4377262592315674} 11/07/2021 06:44:16 - INFO - __main__ - Step 67565: {'lr': 0.0002945288225660525, 'samples': 12972480, 'steps': 67564, 'loss/train': 0.8138275146484375} 11/07/2021 06:44:16 - INFO - __main__ - Step 67566: {'lr': 0.00029452360066850234, 'samples': 12972672, 'steps': 67565, 'loss/train': 1.3299431800842285} 11/07/2021 06:44:17 - INFO - __main__ - Step 67567: {'lr': 0.0002945183787508905, 'samples': 12972864, 'steps': 67566, 'loss/train': 1.5751683712005615} 11/07/2021 06:44:17 - INFO - __main__ - Step 67568: {'lr': 0.0002945131568132194, 'samples': 12973056, 'steps': 67567, 'loss/train': 0.6837634444236755} 11/07/2021 06:44:18 - INFO - __main__ - Step 67569: {'lr': 0.00029450793485549125, 'samples': 12973248, 'steps': 67568, 'loss/train': 1.444915533065796} 11/07/2021 06:44:18 - INFO - __main__ - Step 67570: {'lr': 0.00029450271287770856, 'samples': 12973440, 'steps': 67569, 'loss/train': 1.0432664155960083} 11/07/2021 06:44:19 - INFO - __main__ - Step 67571: {'lr': 0.0002944974908798737, 'samples': 12973632, 'steps': 67570, 'loss/train': 1.4782037734985352} 11/07/2021 06:44:19 - INFO - __main__ - Step 67572: {'lr': 0.00029449226886198886, 'samples': 12973824, 'steps': 67571, 'loss/train': 1.4105771780014038} 11/07/2021 06:44:20 - INFO - __main__ - Step 67573: {'lr': 0.0002944870468240566, 'samples': 12974016, 'steps': 67572, 'loss/train': 1.306642770767212} 11/07/2021 06:44:20 - INFO - __main__ - Step 67574: {'lr': 0.00029448182476607903, 'samples': 12974208, 'steps': 67573, 'loss/train': 1.3808480501174927} 11/07/2021 06:44:20 - INFO - __main__ - Step 67575: {'lr': 0.00029447660268805875, 'samples': 12974400, 'steps': 67574, 'loss/train': 1.422202706336975} 11/07/2021 06:44:21 - INFO - __main__ - Step 67576: {'lr': 0.000294471380589998, 'samples': 12974592, 'steps': 67575, 'loss/train': 1.3623710870742798} 11/07/2021 06:44:22 - INFO - __main__ - Step 67577: {'lr': 0.0002944661584718991, 'samples': 12974784, 'steps': 67576, 'loss/train': 1.3710355758666992} 11/07/2021 06:44:22 - INFO - __main__ - Step 67578: {'lr': 0.00029446093633376434, 'samples': 12974976, 'steps': 67577, 'loss/train': 1.6406587362289429} 11/07/2021 06:44:22 - INFO - __main__ - Step 67579: {'lr': 0.00029445571417559626, 'samples': 12975168, 'steps': 67578, 'loss/train': 1.4526121616363525} 11/07/2021 06:44:23 - INFO - __main__ - Step 67580: {'lr': 0.0002944504919973971, 'samples': 12975360, 'steps': 67579, 'loss/train': 1.3253157138824463} 11/07/2021 06:44:24 - INFO - __main__ - Step 67581: {'lr': 0.00029444526979916923, 'samples': 12975552, 'steps': 67580, 'loss/train': 0.9096948504447937} 11/07/2021 06:44:24 - INFO - __main__ - Step 67582: {'lr': 0.0002944400475809151, 'samples': 12975744, 'steps': 67581, 'loss/train': 1.6154801845550537} 11/07/2021 06:44:25 - INFO - __main__ - Step 67583: {'lr': 0.0002944348253426369, 'samples': 12975936, 'steps': 67582, 'loss/train': 2.060927391052246} 11/07/2021 06:44:25 - INFO - __main__ - Step 67584: {'lr': 0.00029442960308433705, 'samples': 12976128, 'steps': 67583, 'loss/train': 1.4453072547912598} 11/07/2021 06:44:25 - INFO - __main__ - Step 67585: {'lr': 0.00029442438080601785, 'samples': 12976320, 'steps': 67584, 'loss/train': 1.1835463047027588} 11/07/2021 06:44:26 - INFO - __main__ - Step 67586: {'lr': 0.0002944191585076817, 'samples': 12976512, 'steps': 67585, 'loss/train': 1.1700879335403442} 11/07/2021 06:44:27 - INFO - __main__ - Step 67587: {'lr': 0.0002944139361893311, 'samples': 12976704, 'steps': 67586, 'loss/train': 1.6819411516189575} 11/07/2021 06:44:27 - INFO - __main__ - Step 67588: {'lr': 0.0002944087138509682, 'samples': 12976896, 'steps': 67587, 'loss/train': 1.4356629848480225} 11/07/2021 06:44:27 - INFO - __main__ - Step 67589: {'lr': 0.0002944034914925954, 'samples': 12977088, 'steps': 67588, 'loss/train': 1.4323147535324097} 11/07/2021 06:44:28 - INFO - __main__ - Step 67590: {'lr': 0.0002943982691142151, 'samples': 12977280, 'steps': 67589, 'loss/train': 0.882689356803894} 11/07/2021 06:44:29 - INFO - __main__ - Step 67591: {'lr': 0.0002943930467158296, 'samples': 12977472, 'steps': 67590, 'loss/train': 1.3341749906539917} 11/07/2021 06:44:29 - INFO - __main__ - Step 67592: {'lr': 0.00029438782429744124, 'samples': 12977664, 'steps': 67591, 'loss/train': 0.9791046380996704} 11/07/2021 06:44:29 - INFO - __main__ - Step 67593: {'lr': 0.00029438260185905255, 'samples': 12977856, 'steps': 67592, 'loss/train': 1.1314771175384521} 11/07/2021 06:44:30 - INFO - __main__ - Step 67594: {'lr': 0.00029437737940066563, 'samples': 12978048, 'steps': 67593, 'loss/train': 1.546276330947876} 11/07/2021 06:44:30 - INFO - __main__ - Step 67595: {'lr': 0.000294372156922283, 'samples': 12978240, 'steps': 67594, 'loss/train': 1.3178824186325073} 11/07/2021 06:44:30 - INFO - __main__ - Step 67596: {'lr': 0.0002943669344239069, 'samples': 12978432, 'steps': 67595, 'loss/train': 1.6986409425735474} 11/07/2021 06:44:31 - INFO - __main__ - Step 67597: {'lr': 0.00029436171190553976, 'samples': 12978624, 'steps': 67596, 'loss/train': 1.7780816555023193} 11/07/2021 06:44:32 - INFO - __main__ - Step 67598: {'lr': 0.00029435648936718394, 'samples': 12978816, 'steps': 67597, 'loss/train': 1.0091850757598877} 11/07/2021 06:44:32 - INFO - __main__ - Step 67599: {'lr': 0.0002943512668088417, 'samples': 12979008, 'steps': 67598, 'loss/train': 1.7304810285568237} 11/07/2021 06:44:33 - INFO - __main__ - Step 67600: {'lr': 0.0002943460442305156, 'samples': 12979200, 'steps': 67599, 'loss/train': 1.5329622030258179} 11/07/2021 06:44:33 - INFO - __main__ - Step 67601: {'lr': 0.0002943408216322077, 'samples': 12979392, 'steps': 67600, 'loss/train': 1.3480570316314697} 11/07/2021 06:44:34 - INFO - __main__ - Step 67602: {'lr': 0.00029433559901392067, 'samples': 12979584, 'steps': 67601, 'loss/train': 1.2191061973571777} 11/07/2021 06:44:34 - INFO - __main__ - Step 67603: {'lr': 0.00029433037637565664, 'samples': 12979776, 'steps': 67602, 'loss/train': 1.6961568593978882} 11/07/2021 06:44:35 - INFO - __main__ - Step 67604: {'lr': 0.000294325153717418, 'samples': 12979968, 'steps': 67603, 'loss/train': 1.380624532699585} 11/07/2021 06:44:35 - INFO - __main__ - Step 67605: {'lr': 0.00029431993103920713, 'samples': 12980160, 'steps': 67604, 'loss/train': 1.4717713594436646} 11/07/2021 06:44:35 - INFO - __main__ - Step 67606: {'lr': 0.00029431470834102635, 'samples': 12980352, 'steps': 67605, 'loss/train': 1.2092136144638062} 11/07/2021 06:44:36 - INFO - __main__ - Step 67607: {'lr': 0.00029430948562287815, 'samples': 12980544, 'steps': 67606, 'loss/train': 1.1289968490600586} 11/07/2021 06:44:37 - INFO - __main__ - Step 67608: {'lr': 0.00029430426288476464, 'samples': 12980736, 'steps': 67607, 'loss/train': 1.837908387184143} 11/07/2021 06:44:37 - INFO - __main__ - Step 67609: {'lr': 0.00029429904012668847, 'samples': 12980928, 'steps': 67608, 'loss/train': 1.199660062789917} 11/07/2021 06:44:37 - INFO - __main__ - Step 67610: {'lr': 0.00029429381734865176, 'samples': 12981120, 'steps': 67609, 'loss/train': 0.9489189982414246} 11/07/2021 06:44:38 - INFO - __main__ - Step 67611: {'lr': 0.00029428859455065694, 'samples': 12981312, 'steps': 67610, 'loss/train': 2.4099719524383545} 11/07/2021 06:44:39 - INFO - __main__ - Step 67612: {'lr': 0.00029428337173270636, 'samples': 12981504, 'steps': 67611, 'loss/train': 1.1757968664169312} 11/07/2021 06:44:39 - INFO - __main__ - Step 67613: {'lr': 0.0002942781488948024, 'samples': 12981696, 'steps': 67612, 'loss/train': 1.3482118844985962} 11/07/2021 06:44:39 - INFO - __main__ - Step 67614: {'lr': 0.0002942729260369473, 'samples': 12981888, 'steps': 67613, 'loss/train': 1.559475302696228} 11/07/2021 06:44:40 - INFO - __main__ - Step 67615: {'lr': 0.0002942677031591436, 'samples': 12982080, 'steps': 67614, 'loss/train': 1.265631079673767} 11/07/2021 06:44:40 - INFO - __main__ - Step 67616: {'lr': 0.00029426248026139353, 'samples': 12982272, 'steps': 67615, 'loss/train': 1.4808224439620972} 11/07/2021 06:44:41 - INFO - __main__ - Step 67617: {'lr': 0.00029425725734369944, 'samples': 12982464, 'steps': 67616, 'loss/train': 1.5285698175430298} 11/07/2021 06:44:42 - INFO - __main__ - Step 67618: {'lr': 0.0002942520344060637, 'samples': 12982656, 'steps': 67617, 'loss/train': 1.58915114402771} 11/07/2021 06:44:42 - INFO - __main__ - Step 67619: {'lr': 0.0002942468114484888, 'samples': 12982848, 'steps': 67618, 'loss/train': 0.8407416343688965} 11/07/2021 06:44:42 - INFO - __main__ - Step 67620: {'lr': 0.00029424158847097685, 'samples': 12983040, 'steps': 67619, 'loss/train': 1.3533486127853394} 11/07/2021 06:44:43 - INFO - __main__ - Step 67621: {'lr': 0.00029423636547353037, 'samples': 12983232, 'steps': 67620, 'loss/train': 0.09938500076532364} 11/07/2021 06:44:44 - INFO - __main__ - Step 67622: {'lr': 0.0002942311424561517, 'samples': 12983424, 'steps': 67621, 'loss/train': 1.2855315208435059} 11/07/2021 06:44:44 - INFO - __main__ - Step 67623: {'lr': 0.0002942259194188431, 'samples': 12983616, 'steps': 67622, 'loss/train': 1.0791648626327515} 11/07/2021 06:44:44 - INFO - __main__ - Step 67624: {'lr': 0.000294220696361607, 'samples': 12983808, 'steps': 67623, 'loss/train': 1.6347969770431519} 11/07/2021 06:44:45 - INFO - __main__ - Step 67625: {'lr': 0.0002942154732844458, 'samples': 12984000, 'steps': 67624, 'loss/train': 1.4128385782241821} 11/07/2021 06:44:45 - INFO - __main__ - Step 67626: {'lr': 0.00029421025018736165, 'samples': 12984192, 'steps': 67625, 'loss/train': 2.471226930618286} 11/07/2021 06:44:46 - INFO - __main__ - Step 67627: {'lr': 0.0002942050270703571, 'samples': 12984384, 'steps': 67626, 'loss/train': 1.4135394096374512} 11/07/2021 06:44:46 - INFO - __main__ - Step 67628: {'lr': 0.0002941998039334345, 'samples': 12984576, 'steps': 67627, 'loss/train': 1.7182193994522095} 11/07/2021 06:44:47 - INFO - __main__ - Step 67629: {'lr': 0.00029419458077659604, 'samples': 12984768, 'steps': 67628, 'loss/train': 1.322967529296875} 11/07/2021 06:44:47 - INFO - __main__ - Step 67630: {'lr': 0.0002941893575998443, 'samples': 12984960, 'steps': 67629, 'loss/train': 0.8378026485443115} 11/07/2021 06:44:48 - INFO - __main__ - Step 67631: {'lr': 0.00029418413440318147, 'samples': 12985152, 'steps': 67630, 'loss/train': 1.3750754594802856} 11/07/2021 06:44:48 - INFO - __main__ - Step 67632: {'lr': 0.00029417891118661, 'samples': 12985344, 'steps': 67631, 'loss/train': 1.374665379524231} 11/07/2021 06:44:49 - INFO - __main__ - Step 67633: {'lr': 0.0002941736879501321, 'samples': 12985536, 'steps': 67632, 'loss/train': 1.5098227262496948} 11/07/2021 06:44:49 - INFO - __main__ - Step 67634: {'lr': 0.00029416846469375026, 'samples': 12985728, 'steps': 67633, 'loss/train': 1.5644854307174683} 11/07/2021 06:44:50 - INFO - __main__ - Step 67635: {'lr': 0.0002941632414174668, 'samples': 12985920, 'steps': 67634, 'loss/train': 1.174215316772461} 11/07/2021 06:44:50 - INFO - __main__ - Step 67636: {'lr': 0.00029415801812128413, 'samples': 12986112, 'steps': 67635, 'loss/train': 1.7602753639221191} 11/07/2021 06:44:50 - INFO - __main__ - Step 67637: {'lr': 0.00029415279480520445, 'samples': 12986304, 'steps': 67636, 'loss/train': 1.3348424434661865} 11/07/2021 06:44:51 - INFO - __main__ - Step 67638: {'lr': 0.0002941475714692302, 'samples': 12986496, 'steps': 67637, 'loss/train': 0.9119597673416138} 11/07/2021 06:44:52 - INFO - __main__ - Step 67639: {'lr': 0.00029414234811336377, 'samples': 12986688, 'steps': 67638, 'loss/train': 1.3865532875061035} 11/07/2021 06:44:52 - INFO - __main__ - Step 67640: {'lr': 0.00029413712473760743, 'samples': 12986880, 'steps': 67639, 'loss/train': 2.130936622619629} 11/07/2021 06:44:52 - INFO - __main__ - Step 67641: {'lr': 0.0002941319013419637, 'samples': 12987072, 'steps': 67640, 'loss/train': 1.2819790840148926} 11/07/2021 06:44:53 - INFO - __main__ - Step 67642: {'lr': 0.00029412667792643474, 'samples': 12987264, 'steps': 67641, 'loss/train': 0.9998139142990112} 11/07/2021 06:44:54 - INFO - __main__ - Step 67643: {'lr': 0.00029412145449102294, 'samples': 12987456, 'steps': 67642, 'loss/train': 1.7249863147735596} 11/07/2021 06:44:54 - INFO - __main__ - Step 67644: {'lr': 0.0002941162310357307, 'samples': 12987648, 'steps': 67643, 'loss/train': 1.4128715991973877} 11/07/2021 06:44:55 - INFO - __main__ - Step 67645: {'lr': 0.0002941110075605604, 'samples': 12987840, 'steps': 67644, 'loss/train': 1.4006412029266357} 11/07/2021 06:44:55 - INFO - __main__ - Step 67646: {'lr': 0.00029410578406551435, 'samples': 12988032, 'steps': 67645, 'loss/train': 1.5767829418182373} 11/07/2021 06:44:55 - INFO - __main__ - Step 67647: {'lr': 0.0002941005605505949, 'samples': 12988224, 'steps': 67646, 'loss/train': 1.5303258895874023} 11/07/2021 06:44:56 - INFO - __main__ - Step 67648: {'lr': 0.0002940953370158045, 'samples': 12988416, 'steps': 67647, 'loss/train': 1.5220983028411865} 11/07/2021 06:44:57 - INFO - __main__ - Step 67649: {'lr': 0.00029409011346114537, 'samples': 12988608, 'steps': 67648, 'loss/train': 1.4435571432113647} 11/07/2021 06:44:57 - INFO - __main__ - Step 67650: {'lr': 0.0002940848898866199, 'samples': 12988800, 'steps': 67649, 'loss/train': 1.2332923412322998} 11/07/2021 06:44:58 - INFO - __main__ - Step 67651: {'lr': 0.00029407966629223047, 'samples': 12988992, 'steps': 67650, 'loss/train': 1.4737575054168701} 11/07/2021 06:44:58 - INFO - __main__ - Step 67652: {'lr': 0.0002940744426779794, 'samples': 12989184, 'steps': 67651, 'loss/train': 1.8472868204116821} 11/07/2021 06:44:58 - INFO - __main__ - Step 67653: {'lr': 0.0002940692190438691, 'samples': 12989376, 'steps': 67652, 'loss/train': 1.1598325967788696} 11/07/2021 06:44:59 - INFO - __main__ - Step 67654: {'lr': 0.00029406399538990186, 'samples': 12989568, 'steps': 67653, 'loss/train': 1.6936798095703125} 11/07/2021 06:45:00 - INFO - __main__ - Step 67655: {'lr': 0.00029405877171608007, 'samples': 12989760, 'steps': 67654, 'loss/train': 1.5669478178024292} 11/07/2021 06:45:00 - INFO - __main__ - Step 67656: {'lr': 0.0002940535480224061, 'samples': 12989952, 'steps': 67655, 'loss/train': 1.6236473321914673} 11/07/2021 06:45:00 - INFO - __main__ - Step 67657: {'lr': 0.0002940483243088823, 'samples': 12990144, 'steps': 67656, 'loss/train': 1.159584403038025} 11/07/2021 06:45:01 - INFO - __main__ - Step 67658: {'lr': 0.00029404310057551094, 'samples': 12990336, 'steps': 67657, 'loss/train': 0.7321416735649109} 11/07/2021 06:45:02 - INFO - __main__ - Step 67659: {'lr': 0.00029403787682229444, 'samples': 12990528, 'steps': 67658, 'loss/train': 1.3308852910995483} 11/07/2021 06:45:02 - INFO - __main__ - Step 67660: {'lr': 0.0002940326530492352, 'samples': 12990720, 'steps': 67659, 'loss/train': 0.9327400922775269} 11/07/2021 06:45:02 - INFO - __main__ - Step 67661: {'lr': 0.00029402742925633554, 'samples': 12990912, 'steps': 67660, 'loss/train': 1.5856022834777832} 11/07/2021 06:45:03 - INFO - __main__ - Step 67662: {'lr': 0.00029402220544359775, 'samples': 12991104, 'steps': 67661, 'loss/train': 1.8226215839385986} 11/07/2021 06:45:03 - INFO - __main__ - Step 67663: {'lr': 0.00029401698161102426, 'samples': 12991296, 'steps': 67662, 'loss/train': 1.697083592414856} 11/07/2021 06:45:04 - INFO - __main__ - Step 67664: {'lr': 0.00029401175775861736, 'samples': 12991488, 'steps': 67663, 'loss/train': 1.0526163578033447} 11/07/2021 06:45:04 - INFO - __main__ - Step 67665: {'lr': 0.00029400653388637947, 'samples': 12991680, 'steps': 67664, 'loss/train': 1.4246126413345337} 11/07/2021 06:45:05 - INFO - __main__ - Step 67666: {'lr': 0.00029400130999431294, 'samples': 12991872, 'steps': 67665, 'loss/train': 1.2303959131240845} 11/07/2021 06:45:05 - INFO - __main__ - Step 67667: {'lr': 0.0002939960860824201, 'samples': 12992064, 'steps': 67666, 'loss/train': 1.1356874704360962} 11/07/2021 06:45:06 - INFO - __main__ - Step 67668: {'lr': 0.00029399086215070326, 'samples': 12992256, 'steps': 67667, 'loss/train': 1.268603801727295} 11/07/2021 06:45:07 - INFO - __main__ - Step 67669: {'lr': 0.0002939856381991649, 'samples': 12992448, 'steps': 67668, 'loss/train': 1.4393104314804077} 11/07/2021 06:45:07 - INFO - __main__ - Step 67670: {'lr': 0.00029398041422780717, 'samples': 12992640, 'steps': 67669, 'loss/train': 1.305525302886963} 11/07/2021 06:45:07 - INFO - __main__ - Step 67671: {'lr': 0.0002939751902366326, 'samples': 12992832, 'steps': 67670, 'loss/train': 1.39747154712677} 11/07/2021 06:45:08 - INFO - __main__ - Step 67672: {'lr': 0.00029396996622564343, 'samples': 12993024, 'steps': 67671, 'loss/train': 1.762677550315857} 11/07/2021 06:45:08 - INFO - __main__ - Step 67673: {'lr': 0.00029396474219484217, 'samples': 12993216, 'steps': 67672, 'loss/train': 2.110948324203491} 11/07/2021 06:45:08 - INFO - __main__ - Step 67674: {'lr': 0.000293959518144231, 'samples': 12993408, 'steps': 67673, 'loss/train': 1.6394819021224976} 11/07/2021 06:45:10 - INFO - __main__ - Step 67675: {'lr': 0.00029395429407381236, 'samples': 12993600, 'steps': 67674, 'loss/train': 1.4628498554229736} 11/07/2021 06:45:10 - INFO - __main__ - Step 67676: {'lr': 0.0002939490699835887, 'samples': 12993792, 'steps': 67675, 'loss/train': 1.5600382089614868} 11/07/2021 06:45:10 - INFO - __main__ - Step 67677: {'lr': 0.0002939438458735622, 'samples': 12993984, 'steps': 67676, 'loss/train': 1.786300539970398} 11/07/2021 06:45:11 - INFO - __main__ - Step 67678: {'lr': 0.0002939386217437352, 'samples': 12994176, 'steps': 67677, 'loss/train': 1.2516683340072632} 11/07/2021 06:45:11 - INFO - __main__ - Step 67679: {'lr': 0.0002939333975941102, 'samples': 12994368, 'steps': 67678, 'loss/train': 1.2404353618621826} 11/07/2021 06:45:12 - INFO - __main__ - Step 67680: {'lr': 0.0002939281734246895, 'samples': 12994560, 'steps': 67679, 'loss/train': 1.8552244901657104} 11/07/2021 06:45:12 - INFO - __main__ - Step 67681: {'lr': 0.0002939229492354754, 'samples': 12994752, 'steps': 67680, 'loss/train': 1.4990710020065308} 11/07/2021 06:45:13 - INFO - __main__ - Step 67682: {'lr': 0.00029391772502647027, 'samples': 12994944, 'steps': 67681, 'loss/train': 1.046330213546753} 11/07/2021 06:45:13 - INFO - __main__ - Step 67683: {'lr': 0.0002939125007976766, 'samples': 12995136, 'steps': 67682, 'loss/train': 0.12831997871398926} 11/07/2021 06:45:13 - INFO - __main__ - Step 67684: {'lr': 0.0002939072765490966, 'samples': 12995328, 'steps': 67683, 'loss/train': 1.2401344776153564} 11/07/2021 06:45:14 - INFO - __main__ - Step 67685: {'lr': 0.00029390205228073266, 'samples': 12995520, 'steps': 67684, 'loss/train': 1.2251803874969482} 11/07/2021 06:45:15 - INFO - __main__ - Step 67686: {'lr': 0.0002938968279925871, 'samples': 12995712, 'steps': 67685, 'loss/train': 1.6785444021224976} 11/07/2021 06:45:15 - INFO - __main__ - Step 67687: {'lr': 0.00029389160368466227, 'samples': 12995904, 'steps': 67686, 'loss/train': 1.7713451385498047} 11/07/2021 06:45:15 - INFO - __main__ - Step 67688: {'lr': 0.0002938863793569606, 'samples': 12996096, 'steps': 67687, 'loss/train': 1.535368800163269} 11/07/2021 06:45:16 - INFO - __main__ - Step 67689: {'lr': 0.0002938811550094845, 'samples': 12996288, 'steps': 67688, 'loss/train': 1.718270182609558} 11/07/2021 06:45:17 - INFO - __main__ - Step 67690: {'lr': 0.00029387593064223615, 'samples': 12996480, 'steps': 67689, 'loss/train': 1.27736234664917} 11/07/2021 06:45:17 - INFO - __main__ - Step 67691: {'lr': 0.00029387070625521794, 'samples': 12996672, 'steps': 67690, 'loss/train': 1.7141504287719727} 11/07/2021 06:45:17 - INFO - __main__ - Step 67692: {'lr': 0.00029386548184843234, 'samples': 12996864, 'steps': 67691, 'loss/train': 1.4791574478149414} 11/07/2021 06:45:18 - INFO - __main__ - Step 67693: {'lr': 0.00029386025742188156, 'samples': 12997056, 'steps': 67692, 'loss/train': 1.587140679359436} 11/07/2021 06:45:18 - INFO - __main__ - Step 67694: {'lr': 0.00029385503297556806, 'samples': 12997248, 'steps': 67693, 'loss/train': 1.5859369039535522} 11/07/2021 06:45:20 - INFO - __main__ - Step 67695: {'lr': 0.00029384980850949416, 'samples': 12997440, 'steps': 67694, 'loss/train': 1.5437389612197876} 11/07/2021 06:45:20 - INFO - __main__ - Step 67696: {'lr': 0.0002938445840236622, 'samples': 12997632, 'steps': 67695, 'loss/train': 0.8446937799453735} 11/07/2021 06:45:20 - INFO - __main__ - Step 67697: {'lr': 0.0002938393595180746, 'samples': 12997824, 'steps': 67696, 'loss/train': 1.8493813276290894} 11/07/2021 06:45:21 - INFO - __main__ - Step 67698: {'lr': 0.0002938341349927336, 'samples': 12998016, 'steps': 67697, 'loss/train': 0.6476123332977295} 11/07/2021 06:45:21 - INFO - __main__ - Step 67699: {'lr': 0.00029382891044764164, 'samples': 12998208, 'steps': 67698, 'loss/train': 2.2428066730499268} 11/07/2021 06:45:21 - INFO - __main__ - Step 67700: {'lr': 0.000293823685882801, 'samples': 12998400, 'steps': 67699, 'loss/train': 1.475335955619812} 11/07/2021 06:45:22 - INFO - __main__ - Step 67701: {'lr': 0.00029381846129821414, 'samples': 12998592, 'steps': 67700, 'loss/train': 1.5356658697128296} 11/07/2021 06:45:23 - INFO - __main__ - Step 67702: {'lr': 0.0002938132366938833, 'samples': 12998784, 'steps': 67701, 'loss/train': 1.100490689277649} 11/07/2021 06:45:23 - INFO - __main__ - Step 67703: {'lr': 0.00029380801206981103, 'samples': 12998976, 'steps': 67702, 'loss/train': 1.2266888618469238} 11/07/2021 06:45:23 - INFO - __main__ - Step 67704: {'lr': 0.0002938027874259994, 'samples': 12999168, 'steps': 67703, 'loss/train': 1.428887963294983} 11/07/2021 06:45:24 - INFO - __main__ - Step 67705: {'lr': 0.000293797562762451, 'samples': 12999360, 'steps': 67704, 'loss/train': 1.131108045578003} 11/07/2021 06:45:25 - INFO - __main__ - Step 67706: {'lr': 0.00029379233807916804, 'samples': 12999552, 'steps': 67705, 'loss/train': 1.7322767972946167} 11/07/2021 06:45:25 - INFO - __main__ - Step 67707: {'lr': 0.0002937871133761529, 'samples': 12999744, 'steps': 67706, 'loss/train': 1.7032352685928345} 11/07/2021 06:45:26 - INFO - __main__ - Step 67708: {'lr': 0.00029378188865340803, 'samples': 12999936, 'steps': 67707, 'loss/train': 1.4463920593261719} 11/07/2021 06:45:26 - INFO - __main__ - Step 67709: {'lr': 0.0002937766639109357, 'samples': 13000128, 'steps': 67708, 'loss/train': 1.2764612436294556} 11/07/2021 06:45:26 - INFO - __main__ - Step 67710: {'lr': 0.00029377143914873833, 'samples': 13000320, 'steps': 67709, 'loss/train': 1.2835370302200317} 11/07/2021 06:45:27 - INFO - __main__ - Step 67711: {'lr': 0.0002937662143668182, 'samples': 13000512, 'steps': 67710, 'loss/train': 1.2280932664871216} 11/07/2021 06:45:28 - INFO - __main__ - Step 67712: {'lr': 0.0002937609895651776, 'samples': 13000704, 'steps': 67711, 'loss/train': 1.0433353185653687} 11/07/2021 06:45:28 - INFO - __main__ - Step 67713: {'lr': 0.00029375576474381903, 'samples': 13000896, 'steps': 67712, 'loss/train': 1.5165393352508545} 11/07/2021 06:45:28 - INFO - __main__ - Step 67714: {'lr': 0.00029375053990274476, 'samples': 13001088, 'steps': 67713, 'loss/train': 1.5346800088882446} 11/07/2021 06:45:29 - INFO - __main__ - Step 67715: {'lr': 0.00029374531504195724, 'samples': 13001280, 'steps': 67714, 'loss/train': 1.5782533884048462} 11/07/2021 06:45:29 - INFO - __main__ - Step 67716: {'lr': 0.0002937400901614588, 'samples': 13001472, 'steps': 67715, 'loss/train': 1.2486505508422852} 11/07/2021 06:45:30 - INFO - __main__ - Step 67717: {'lr': 0.00029373486526125157, 'samples': 13001664, 'steps': 67716, 'loss/train': 1.3265060186386108} 11/07/2021 06:45:30 - INFO - __main__ - Step 67718: {'lr': 0.0002937296403413382, 'samples': 13001856, 'steps': 67717, 'loss/train': 1.155900478363037} 11/07/2021 06:45:31 - INFO - __main__ - Step 67719: {'lr': 0.0002937244154017209, 'samples': 13002048, 'steps': 67718, 'loss/train': 1.07576322555542} 11/07/2021 06:45:31 - INFO - __main__ - Step 67720: {'lr': 0.00029371919044240204, 'samples': 13002240, 'steps': 67719, 'loss/train': 1.5691618919372559} 11/07/2021 06:45:31 - INFO - __main__ - Step 67721: {'lr': 0.000293713965463384, 'samples': 13002432, 'steps': 67720, 'loss/train': 1.5718516111373901} 11/07/2021 06:45:33 - INFO - __main__ - Step 67722: {'lr': 0.00029370874046466913, 'samples': 13002624, 'steps': 67721, 'loss/train': 1.3987618684768677} 11/07/2021 06:45:33 - INFO - __main__ - Step 67723: {'lr': 0.0002937035154462598, 'samples': 13002816, 'steps': 67722, 'loss/train': 1.4912370443344116} 11/07/2021 06:45:33 - INFO - __main__ - Step 67724: {'lr': 0.0002936982904081583, 'samples': 13003008, 'steps': 67723, 'loss/train': 1.4317913055419922} 11/07/2021 06:45:34 - INFO - __main__ - Step 67725: {'lr': 0.0002936930653503671, 'samples': 13003200, 'steps': 67724, 'loss/train': 0.48017990589141846} 11/07/2021 06:45:34 - INFO - __main__ - Step 67726: {'lr': 0.00029368784027288843, 'samples': 13003392, 'steps': 67725, 'loss/train': 0.7985760569572449} 11/07/2021 06:45:35 - INFO - __main__ - Step 67727: {'lr': 0.0002936826151757246, 'samples': 13003584, 'steps': 67726, 'loss/train': 0.35224592685699463} 11/07/2021 06:45:35 - INFO - __main__ - Step 67728: {'lr': 0.00029367739005887816, 'samples': 13003776, 'steps': 67727, 'loss/train': 1.611600399017334} 11/07/2021 06:45:36 - INFO - __main__ - Step 67729: {'lr': 0.00029367216492235136, 'samples': 13003968, 'steps': 67728, 'loss/train': 1.4997483491897583} 11/07/2021 06:45:36 - INFO - __main__ - Step 67730: {'lr': 0.00029366693976614656, 'samples': 13004160, 'steps': 67729, 'loss/train': 1.4235684871673584} 11/07/2021 06:45:36 - INFO - __main__ - Step 67731: {'lr': 0.00029366171459026616, 'samples': 13004352, 'steps': 67730, 'loss/train': 1.2596096992492676} 11/07/2021 06:45:38 - INFO - __main__ - Step 67732: {'lr': 0.00029365648939471236, 'samples': 13004544, 'steps': 67731, 'loss/train': 1.2938616275787354} 11/07/2021 06:45:38 - INFO - __main__ - Step 67733: {'lr': 0.0002936512641794876, 'samples': 13004736, 'steps': 67732, 'loss/train': 1.047756552696228} 11/07/2021 06:45:38 - INFO - __main__ - Step 67734: {'lr': 0.00029364603894459435, 'samples': 13004928, 'steps': 67733, 'loss/train': 0.2215581089258194} 11/07/2021 06:45:39 - INFO - __main__ - Step 67735: {'lr': 0.0002936408136900348, 'samples': 13005120, 'steps': 67734, 'loss/train': 1.5335664749145508} 11/07/2021 06:45:39 - INFO - __main__ - Step 67736: {'lr': 0.00029363558841581145, 'samples': 13005312, 'steps': 67735, 'loss/train': 1.4119343757629395} 11/07/2021 06:45:40 - INFO - __main__ - Step 67737: {'lr': 0.00029363036312192654, 'samples': 13005504, 'steps': 67736, 'loss/train': 1.4730050563812256} 11/07/2021 06:45:40 - INFO - __main__ - Step 67738: {'lr': 0.0002936251378083824, 'samples': 13005696, 'steps': 67737, 'loss/train': 1.794578194618225} 11/07/2021 06:45:41 - INFO - __main__ - Step 67739: {'lr': 0.00029361991247518147, 'samples': 13005888, 'steps': 67738, 'loss/train': 1.668533205986023} 11/07/2021 06:45:41 - INFO - __main__ - Step 67740: {'lr': 0.00029361468712232614, 'samples': 13006080, 'steps': 67739, 'loss/train': 1.5462231636047363} 11/07/2021 06:45:41 - INFO - __main__ - Step 67741: {'lr': 0.0002936094617498187, 'samples': 13006272, 'steps': 67740, 'loss/train': 1.2444697618484497} 11/07/2021 06:45:42 - INFO - __main__ - Step 67742: {'lr': 0.0002936042363576614, 'samples': 13006464, 'steps': 67741, 'loss/train': 0.6438389420509338} 11/07/2021 06:45:43 - INFO - __main__ - Step 67743: {'lr': 0.00029359901094585687, 'samples': 13006656, 'steps': 67742, 'loss/train': 1.7128174304962158} 11/07/2021 06:45:43 - INFO - __main__ - Step 67744: {'lr': 0.00029359378551440724, 'samples': 13006848, 'steps': 67743, 'loss/train': 1.3808714151382446} 11/07/2021 06:45:43 - INFO - __main__ - Step 67745: {'lr': 0.00029358856006331485, 'samples': 13007040, 'steps': 67744, 'loss/train': 1.2894682884216309} 11/07/2021 06:45:44 - INFO - __main__ - Step 67746: {'lr': 0.0002935833345925822, 'samples': 13007232, 'steps': 67745, 'loss/train': 1.4605473279953003} 11/07/2021 06:45:44 - INFO - __main__ - Step 67747: {'lr': 0.00029357810910221155, 'samples': 13007424, 'steps': 67746, 'loss/train': 1.7108898162841797} 11/07/2021 06:45:45 - INFO - __main__ - Step 67748: {'lr': 0.0002935728835922053, 'samples': 13007616, 'steps': 67747, 'loss/train': 1.1714744567871094} 11/07/2021 06:45:46 - INFO - __main__ - Step 67749: {'lr': 0.00029356765806256576, 'samples': 13007808, 'steps': 67748, 'loss/train': 1.3265000581741333} 11/07/2021 06:45:46 - INFO - __main__ - Step 67750: {'lr': 0.0002935624325132953, 'samples': 13008000, 'steps': 67749, 'loss/train': 1.398056149482727} 11/07/2021 06:45:46 - INFO - __main__ - Step 67751: {'lr': 0.00029355720694439625, 'samples': 13008192, 'steps': 67750, 'loss/train': 0.3532044589519501} 11/07/2021 06:45:47 - INFO - __main__ - Step 67752: {'lr': 0.00029355198135587105, 'samples': 13008384, 'steps': 67751, 'loss/train': 1.4595260620117188} 11/07/2021 06:45:48 - INFO - __main__ - Step 67753: {'lr': 0.00029354675574772194, 'samples': 13008576, 'steps': 67752, 'loss/train': 0.9040224552154541} 11/07/2021 06:45:48 - INFO - __main__ - Step 67754: {'lr': 0.00029354153011995144, 'samples': 13008768, 'steps': 67753, 'loss/train': 1.5326719284057617} 11/07/2021 06:45:48 - INFO - __main__ - Step 67755: {'lr': 0.0002935363044725617, 'samples': 13008960, 'steps': 67754, 'loss/train': 1.6250100135803223} 11/07/2021 06:45:49 - INFO - __main__ - Step 67756: {'lr': 0.00029353107880555516, 'samples': 13009152, 'steps': 67755, 'loss/train': 1.5251877307891846} 11/07/2021 06:45:49 - INFO - __main__ - Step 67757: {'lr': 0.00029352585311893427, 'samples': 13009344, 'steps': 67756, 'loss/train': 1.1718829870224} 11/07/2021 06:45:50 - INFO - __main__ - Step 67758: {'lr': 0.00029352062741270124, 'samples': 13009536, 'steps': 67757, 'loss/train': 0.4499034285545349} 11/07/2021 06:45:50 - INFO - __main__ - Step 67759: {'lr': 0.0002935154016868585, 'samples': 13009728, 'steps': 67758, 'loss/train': 1.7893010377883911} 11/07/2021 06:45:51 - INFO - __main__ - Step 67760: {'lr': 0.00029351017594140844, 'samples': 13009920, 'steps': 67759, 'loss/train': 1.1738927364349365} 11/07/2021 06:45:51 - INFO - __main__ - Step 67761: {'lr': 0.00029350495017635333, 'samples': 13010112, 'steps': 67760, 'loss/train': 1.4492186307907104} 11/07/2021 06:45:52 - INFO - __main__ - Step 67762: {'lr': 0.0002934997243916955, 'samples': 13010304, 'steps': 67761, 'loss/train': 1.6720632314682007} 11/07/2021 06:45:52 - INFO - __main__ - Step 67763: {'lr': 0.00029349449858743744, 'samples': 13010496, 'steps': 67762, 'loss/train': 1.674884557723999} 11/07/2021 06:45:53 - INFO - __main__ - Step 67764: {'lr': 0.0002934892727635814, 'samples': 13010688, 'steps': 67763, 'loss/train': 1.4176491498947144} 11/07/2021 06:45:53 - INFO - __main__ - Step 67765: {'lr': 0.00029348404692012983, 'samples': 13010880, 'steps': 67764, 'loss/train': 1.6738696098327637} 11/07/2021 06:45:54 - INFO - __main__ - Step 67766: {'lr': 0.00029347882105708496, 'samples': 13011072, 'steps': 67765, 'loss/train': 1.3215084075927734} 11/07/2021 06:45:54 - INFO - __main__ - Step 67767: {'lr': 0.00029347359517444915, 'samples': 13011264, 'steps': 67766, 'loss/train': 1.4288619756698608} 11/07/2021 06:45:55 - INFO - __main__ - Step 67768: {'lr': 0.0002934683692722249, 'samples': 13011456, 'steps': 67767, 'loss/train': 1.528983473777771} 11/07/2021 06:45:55 - INFO - __main__ - Step 67769: {'lr': 0.0002934631433504145, 'samples': 13011648, 'steps': 67768, 'loss/train': 1.3176019191741943} 11/07/2021 06:45:56 - INFO - __main__ - Step 67770: {'lr': 0.0002934579174090202, 'samples': 13011840, 'steps': 67769, 'loss/train': 5.8181610107421875} 11/07/2021 06:45:56 - INFO - __main__ - Step 67771: {'lr': 0.0002934526914480444, 'samples': 13012032, 'steps': 67770, 'loss/train': 1.4424809217453003} 11/07/2021 06:45:56 - INFO - __main__ - Step 67772: {'lr': 0.0002934474654674896, 'samples': 13012224, 'steps': 67771, 'loss/train': 1.5504717826843262} 11/07/2021 06:45:57 - INFO - __main__ - Step 67773: {'lr': 0.00029344223946735793, 'samples': 13012416, 'steps': 67772, 'loss/train': 1.3328442573547363} 11/07/2021 06:45:58 - INFO - __main__ - Step 67774: {'lr': 0.00029343701344765197, 'samples': 13012608, 'steps': 67773, 'loss/train': 1.7158020734786987} 11/07/2021 06:45:58 - INFO - __main__ - Step 67775: {'lr': 0.00029343178740837383, 'samples': 13012800, 'steps': 67774, 'loss/train': 1.3643943071365356} 11/07/2021 06:45:58 - INFO - __main__ - Step 67776: {'lr': 0.00029342656134952606, 'samples': 13012992, 'steps': 67775, 'loss/train': 1.5098317861557007} 11/07/2021 06:45:59 - INFO - __main__ - Step 67777: {'lr': 0.00029342133527111104, 'samples': 13013184, 'steps': 67776, 'loss/train': 1.380436658859253} 11/07/2021 06:45:59 - INFO - __main__ - Step 67778: {'lr': 0.00029341610917313094, 'samples': 13013376, 'steps': 67777, 'loss/train': 1.506555438041687} 11/07/2021 06:46:00 - INFO - __main__ - Step 67779: {'lr': 0.0002934108830555882, 'samples': 13013568, 'steps': 67778, 'loss/train': 1.5171773433685303} 11/07/2021 06:46:00 - INFO - __main__ - Step 67780: {'lr': 0.0002934056569184852, 'samples': 13013760, 'steps': 67779, 'loss/train': 1.6307318210601807} 11/07/2021 06:46:01 - INFO - __main__ - Step 67781: {'lr': 0.0002934004307618243, 'samples': 13013952, 'steps': 67780, 'loss/train': 3.893327474594116} 11/07/2021 06:46:01 - INFO - __main__ - Step 67782: {'lr': 0.0002933952045856078, 'samples': 13014144, 'steps': 67781, 'loss/train': 1.586611032485962} 11/07/2021 06:46:02 - INFO - __main__ - Step 67783: {'lr': 0.00029338997838983824, 'samples': 13014336, 'steps': 67782, 'loss/train': 1.3460667133331299} 11/07/2021 06:46:03 - INFO - __main__ - Step 67784: {'lr': 0.00029338475217451765, 'samples': 13014528, 'steps': 67783, 'loss/train': 1.6142607927322388} 11/07/2021 06:46:03 - INFO - __main__ - Step 67785: {'lr': 0.00029337952593964863, 'samples': 13014720, 'steps': 67784, 'loss/train': 1.5154666900634766} 11/07/2021 06:46:04 - INFO - __main__ - Step 67786: {'lr': 0.00029337429968523344, 'samples': 13014912, 'steps': 67785, 'loss/train': 1.4412298202514648} 11/07/2021 06:46:04 - INFO - __main__ - Step 67787: {'lr': 0.00029336907341127443, 'samples': 13015104, 'steps': 67786, 'loss/train': 1.3858187198638916} 11/07/2021 06:46:04 - INFO - __main__ - Step 67788: {'lr': 0.00029336384711777403, 'samples': 13015296, 'steps': 67787, 'loss/train': 1.364585041999817} 11/07/2021 06:46:05 - INFO - __main__ - Step 67789: {'lr': 0.0002933586208047345, 'samples': 13015488, 'steps': 67788, 'loss/train': 0.45335227251052856} 11/07/2021 06:46:06 - INFO - __main__ - Step 67790: {'lr': 0.0002933533944721584, 'samples': 13015680, 'steps': 67789, 'loss/train': 1.6483101844787598} 11/07/2021 06:46:06 - INFO - __main__ - Step 67791: {'lr': 0.0002933481681200478, 'samples': 13015872, 'steps': 67790, 'loss/train': 1.5478086471557617} 11/07/2021 06:46:07 - INFO - __main__ - Step 67792: {'lr': 0.0002933429417484052, 'samples': 13016064, 'steps': 67791, 'loss/train': 1.682565689086914} 11/07/2021 06:46:07 - INFO - __main__ - Step 67793: {'lr': 0.00029333771535723294, 'samples': 13016256, 'steps': 67792, 'loss/train': 0.9552404880523682} 11/07/2021 06:46:07 - INFO - __main__ - Step 67794: {'lr': 0.00029333248894653337, 'samples': 13016448, 'steps': 67793, 'loss/train': 1.6322975158691406} 11/07/2021 06:46:08 - INFO - __main__ - Step 67795: {'lr': 0.0002933272625163088, 'samples': 13016640, 'steps': 67794, 'loss/train': 1.8149187564849854} 11/07/2021 06:46:09 - INFO - __main__ - Step 67796: {'lr': 0.00029332203606656173, 'samples': 13016832, 'steps': 67795, 'loss/train': 1.4042128324508667} 11/07/2021 06:46:09 - INFO - __main__ - Step 67797: {'lr': 0.0002933168095972944, 'samples': 13017024, 'steps': 67796, 'loss/train': 1.453286051750183} 11/07/2021 06:46:09 - INFO - __main__ - Step 67798: {'lr': 0.00029331158310850916, 'samples': 13017216, 'steps': 67797, 'loss/train': 1.0225595235824585} 11/07/2021 06:46:10 - INFO - __main__ - Step 67799: {'lr': 0.00029330635660020836, 'samples': 13017408, 'steps': 67798, 'loss/train': 1.3090651035308838} 11/07/2021 06:46:11 - INFO - __main__ - Step 67800: {'lr': 0.00029330113007239447, 'samples': 13017600, 'steps': 67799, 'loss/train': 1.0399457216262817} 11/07/2021 06:46:11 - INFO - __main__ - Step 67801: {'lr': 0.0002932959035250697, 'samples': 13017792, 'steps': 67800, 'loss/train': 1.5757427215576172} 11/07/2021 06:46:11 - INFO - __main__ - Step 67802: {'lr': 0.0002932906769582364, 'samples': 13017984, 'steps': 67801, 'loss/train': 1.8805345296859741} 11/07/2021 06:46:12 - INFO - __main__ - Step 67803: {'lr': 0.00029328545037189707, 'samples': 13018176, 'steps': 67802, 'loss/train': 1.4377795457839966} 11/07/2021 06:46:12 - INFO - __main__ - Step 67804: {'lr': 0.000293280223766054, 'samples': 13018368, 'steps': 67803, 'loss/train': 1.385378122329712} 11/07/2021 06:46:13 - INFO - __main__ - Step 67805: {'lr': 0.0002932749971407095, 'samples': 13018560, 'steps': 67804, 'loss/train': 1.7448780536651611} 11/07/2021 06:46:14 - INFO - __main__ - Step 67806: {'lr': 0.000293269770495866, 'samples': 13018752, 'steps': 67805, 'loss/train': 1.7952548265457153} 11/07/2021 06:46:14 - INFO - __main__ - Step 67807: {'lr': 0.0002932645438315257, 'samples': 13018944, 'steps': 67806, 'loss/train': 0.8646383881568909} 11/07/2021 06:46:14 - INFO - __main__ - Step 67808: {'lr': 0.00029325931714769117, 'samples': 13019136, 'steps': 67807, 'loss/train': 1.612160325050354} 11/07/2021 06:46:15 - INFO - __main__ - Step 67809: {'lr': 0.00029325409044436457, 'samples': 13019328, 'steps': 67808, 'loss/train': 1.6077452898025513} 11/07/2021 06:46:16 - INFO - __main__ - Step 67810: {'lr': 0.00029324886372154846, 'samples': 13019520, 'steps': 67809, 'loss/train': 1.789527416229248} 11/07/2021 06:46:16 - INFO - __main__ - Step 67811: {'lr': 0.000293243636979245, 'samples': 13019712, 'steps': 67810, 'loss/train': 1.1569552421569824} 11/07/2021 06:46:16 - INFO - __main__ - Step 67812: {'lr': 0.0002932384102174566, 'samples': 13019904, 'steps': 67811, 'loss/train': 1.1858153343200684} 11/07/2021 06:46:17 - INFO - __main__ - Step 67813: {'lr': 0.00029323318343618573, 'samples': 13020096, 'steps': 67812, 'loss/train': 1.3310446739196777} 11/07/2021 06:46:17 - INFO - __main__ - Step 67814: {'lr': 0.00029322795663543457, 'samples': 13020288, 'steps': 67813, 'loss/train': 1.7281789779663086} 11/07/2021 06:46:17 - INFO - __main__ - Step 67815: {'lr': 0.0002932227298152056, 'samples': 13020480, 'steps': 67814, 'loss/train': 1.492798089981079} 11/07/2021 06:46:19 - INFO - __main__ - Step 67816: {'lr': 0.0002932175029755011, 'samples': 13020672, 'steps': 67815, 'loss/train': 1.278679370880127} 11/07/2021 06:46:19 - INFO - __main__ - Step 67817: {'lr': 0.0002932122761163235, 'samples': 13020864, 'steps': 67816, 'loss/train': 0.25564059615135193} 11/07/2021 06:46:19 - INFO - __main__ - Step 67818: {'lr': 0.0002932070492376751, 'samples': 13021056, 'steps': 67817, 'loss/train': 1.5026172399520874} 11/07/2021 06:46:20 - INFO - __main__ - Step 67819: {'lr': 0.00029320182233955825, 'samples': 13021248, 'steps': 67818, 'loss/train': 2.5258336067199707} 11/07/2021 06:46:20 - INFO - __main__ - Step 67820: {'lr': 0.00029319659542197536, 'samples': 13021440, 'steps': 67819, 'loss/train': 1.9028445482254028} 11/07/2021 06:46:21 - INFO - __main__ - Step 67821: {'lr': 0.0002931913684849287, 'samples': 13021632, 'steps': 67820, 'loss/train': 1.3017374277114868} 11/07/2021 06:46:22 - INFO - __main__ - Step 67822: {'lr': 0.00029318614152842073, 'samples': 13021824, 'steps': 67821, 'loss/train': 1.7306828498840332} 11/07/2021 06:46:22 - INFO - __main__ - Step 67823: {'lr': 0.0002931809145524537, 'samples': 13022016, 'steps': 67822, 'loss/train': 1.330524206161499} 11/07/2021 06:46:22 - INFO - __main__ - Step 67824: {'lr': 0.0002931756875570301, 'samples': 13022208, 'steps': 67823, 'loss/train': 0.9200707674026489} 11/07/2021 06:46:23 - INFO - __main__ - Step 67825: {'lr': 0.0002931704605421522, 'samples': 13022400, 'steps': 67824, 'loss/train': 1.4799758195877075} 11/07/2021 06:46:24 - INFO - __main__ - Step 67826: {'lr': 0.00029316523350782225, 'samples': 13022592, 'steps': 67825, 'loss/train': 1.141540765762329} 11/07/2021 06:46:24 - INFO - __main__ - Step 67827: {'lr': 0.0002931600064540428, 'samples': 13022784, 'steps': 67826, 'loss/train': 1.4677979946136475} 11/07/2021 06:46:24 - INFO - __main__ - Step 67828: {'lr': 0.0002931547793808161, 'samples': 13022976, 'steps': 67827, 'loss/train': 1.5646471977233887} 11/07/2021 06:46:25 - INFO - __main__ - Step 67829: {'lr': 0.0002931495522881445, 'samples': 13023168, 'steps': 67828, 'loss/train': 1.5731840133666992} 11/07/2021 06:46:25 - INFO - __main__ - Step 67830: {'lr': 0.00029314432517603043, 'samples': 13023360, 'steps': 67829, 'loss/train': 1.2360169887542725} 11/07/2021 06:46:26 - INFO - __main__ - Step 67831: {'lr': 0.0002931390980444761, 'samples': 13023552, 'steps': 67830, 'loss/train': 1.7597064971923828} 11/07/2021 06:46:26 - INFO - __main__ - Step 67832: {'lr': 0.000293133870893484, 'samples': 13023744, 'steps': 67831, 'loss/train': 1.8679454326629639} 11/07/2021 06:46:27 - INFO - __main__ - Step 67833: {'lr': 0.0002931286437230565, 'samples': 13023936, 'steps': 67832, 'loss/train': 1.776475191116333} 11/07/2021 06:46:27 - INFO - __main__ - Step 67834: {'lr': 0.0002931234165331958, 'samples': 13024128, 'steps': 67833, 'loss/train': 1.7404301166534424} 11/07/2021 06:46:28 - INFO - __main__ - Step 67835: {'lr': 0.0002931181893239044, 'samples': 13024320, 'steps': 67834, 'loss/train': 1.6701480150222778} 11/07/2021 06:46:28 - INFO - __main__ - Step 67836: {'lr': 0.0002931129620951846, 'samples': 13024512, 'steps': 67835, 'loss/train': 1.6823033094406128} 11/07/2021 06:46:29 - INFO - __main__ - Step 67837: {'lr': 0.0002931077348470388, 'samples': 13024704, 'steps': 67836, 'loss/train': 1.3984627723693848} 11/07/2021 06:46:29 - INFO - __main__ - Step 67838: {'lr': 0.00029310250757946934, 'samples': 13024896, 'steps': 67837, 'loss/train': 1.5243784189224243} 11/07/2021 06:46:30 - INFO - __main__ - Step 67839: {'lr': 0.0002930972802924785, 'samples': 13025088, 'steps': 67838, 'loss/train': 1.6794235706329346} 11/07/2021 06:46:30 - INFO - __main__ - Step 67840: {'lr': 0.00029309205298606866, 'samples': 13025280, 'steps': 67839, 'loss/train': 1.2660934925079346} 11/07/2021 06:46:30 - INFO - __main__ - Step 67841: {'lr': 0.00029308682566024224, 'samples': 13025472, 'steps': 67840, 'loss/train': 1.8421576023101807} 11/07/2021 06:46:31 - INFO - __main__ - Step 67842: {'lr': 0.0002930815983150016, 'samples': 13025664, 'steps': 67841, 'loss/train': 1.107167363166809} 11/07/2021 06:46:32 - INFO - __main__ - Step 67843: {'lr': 0.000293076370950349, 'samples': 13025856, 'steps': 67842, 'loss/train': 1.2882875204086304} 11/07/2021 06:46:32 - INFO - __main__ - Step 67844: {'lr': 0.00029307114356628695, 'samples': 13026048, 'steps': 67843, 'loss/train': 1.0883562564849854} 11/07/2021 06:46:32 - INFO - __main__ - Step 67845: {'lr': 0.0002930659161628176, 'samples': 13026240, 'steps': 67844, 'loss/train': 1.480782151222229} 11/07/2021 06:46:33 - INFO - __main__ - Step 67846: {'lr': 0.0002930606887399435, 'samples': 13026432, 'steps': 67845, 'loss/train': 1.5581554174423218} 11/07/2021 06:46:34 - INFO - __main__ - Step 67847: {'lr': 0.0002930554612976668, 'samples': 13026624, 'steps': 67846, 'loss/train': 0.9068315029144287} 11/07/2021 06:46:35 - INFO - __main__ - Step 67848: {'lr': 0.00029305023383599006, 'samples': 13026816, 'steps': 67847, 'loss/train': 1.5751036405563354} 11/07/2021 06:46:35 - INFO - __main__ - Step 67849: {'lr': 0.0002930450063549155, 'samples': 13027008, 'steps': 67848, 'loss/train': 1.1436244249343872} 11/07/2021 06:46:35 - INFO - __main__ - Step 67850: {'lr': 0.00029303977885444555, 'samples': 13027200, 'steps': 67849, 'loss/train': 1.6524689197540283} 11/07/2021 06:46:36 - INFO - __main__ - Step 67851: {'lr': 0.00029303455133458255, 'samples': 13027392, 'steps': 67850, 'loss/train': 1.7561218738555908} 11/07/2021 06:46:36 - INFO - __main__ - Step 67852: {'lr': 0.00029302932379532886, 'samples': 13027584, 'steps': 67851, 'loss/train': 1.5696805715560913} 11/07/2021 06:46:37 - INFO - __main__ - Step 67853: {'lr': 0.0002930240962366868, 'samples': 13027776, 'steps': 67852, 'loss/train': 1.2358317375183105} 11/07/2021 06:46:37 - INFO - __main__ - Step 67854: {'lr': 0.0002930188686586587, 'samples': 13027968, 'steps': 67853, 'loss/train': 1.4505199193954468} 11/07/2021 06:46:38 - INFO - __main__ - Step 67855: {'lr': 0.00029301364106124706, 'samples': 13028160, 'steps': 67854, 'loss/train': 1.4544733762741089} 11/07/2021 06:46:38 - INFO - __main__ - Step 67856: {'lr': 0.00029300841344445406, 'samples': 13028352, 'steps': 67855, 'loss/train': 1.4710739850997925} 11/07/2021 06:46:38 - INFO - __main__ - Step 67857: {'lr': 0.0002930031858082822, 'samples': 13028544, 'steps': 67856, 'loss/train': 1.2039042711257935} 11/07/2021 06:46:39 - INFO - __main__ - Step 67858: {'lr': 0.0002929979581527337, 'samples': 13028736, 'steps': 67857, 'loss/train': 1.4268791675567627} 11/07/2021 06:46:40 - INFO - __main__ - Step 67859: {'lr': 0.000292992730477811, 'samples': 13028928, 'steps': 67858, 'loss/train': 0.7209809422492981} 11/07/2021 06:46:40 - INFO - __main__ - Step 67860: {'lr': 0.00029298750278351646, 'samples': 13029120, 'steps': 67859, 'loss/train': 1.5948559045791626} 11/07/2021 06:46:41 - INFO - __main__ - Step 67861: {'lr': 0.0002929822750698524, 'samples': 13029312, 'steps': 67860, 'loss/train': 1.5644826889038086} 11/07/2021 06:46:41 - INFO - __main__ - Step 67862: {'lr': 0.0002929770473368212, 'samples': 13029504, 'steps': 67861, 'loss/train': 1.4290188550949097} 11/07/2021 06:46:41 - INFO - __main__ - Step 67863: {'lr': 0.00029297181958442517, 'samples': 13029696, 'steps': 67862, 'loss/train': 1.4936413764953613} 11/07/2021 06:46:42 - INFO - __main__ - Step 67864: {'lr': 0.00029296659181266677, 'samples': 13029888, 'steps': 67863, 'loss/train': 1.6023504734039307} 11/07/2021 06:46:43 - INFO - __main__ - Step 67865: {'lr': 0.0002929613640215482, 'samples': 13030080, 'steps': 67864, 'loss/train': 1.1260650157928467} 11/07/2021 06:46:43 - INFO - __main__ - Step 67866: {'lr': 0.00029295613621107197, 'samples': 13030272, 'steps': 67865, 'loss/train': 0.9479037523269653} 11/07/2021 06:46:43 - INFO - __main__ - Step 67867: {'lr': 0.00029295090838124034, 'samples': 13030464, 'steps': 67866, 'loss/train': 1.6932101249694824} 11/07/2021 06:46:44 - INFO - __main__ - Step 67868: {'lr': 0.00029294568053205564, 'samples': 13030656, 'steps': 67867, 'loss/train': 1.2533730268478394} 11/07/2021 06:46:45 - INFO - __main__ - Step 67869: {'lr': 0.0002929404526635204, 'samples': 13030848, 'steps': 67868, 'loss/train': 1.3720861673355103} 11/07/2021 06:46:45 - INFO - __main__ - Step 67870: {'lr': 0.00029293522477563677, 'samples': 13031040, 'steps': 67869, 'loss/train': 1.501493215560913} 11/07/2021 06:46:45 - INFO - __main__ - Step 67871: {'lr': 0.00029292999686840725, 'samples': 13031232, 'steps': 67870, 'loss/train': 1.7031155824661255} 11/07/2021 06:46:46 - INFO - __main__ - Step 67872: {'lr': 0.0002929247689418341, 'samples': 13031424, 'steps': 67871, 'loss/train': 1.5803680419921875} 11/07/2021 06:46:46 - INFO - __main__ - Step 67873: {'lr': 0.0002929195409959197, 'samples': 13031616, 'steps': 67872, 'loss/train': 0.9790428280830383} 11/07/2021 06:46:48 - INFO - __main__ - Step 67874: {'lr': 0.0002929143130306664, 'samples': 13031808, 'steps': 67873, 'loss/train': 1.5084224939346313} 11/07/2021 06:46:48 - INFO - __main__ - Step 67875: {'lr': 0.0002929090850460766, 'samples': 13032000, 'steps': 67874, 'loss/train': 1.391578197479248} 11/07/2021 06:46:48 - INFO - __main__ - Step 67876: {'lr': 0.0002929038570421526, 'samples': 13032192, 'steps': 67875, 'loss/train': 0.6951577663421631} 11/07/2021 06:46:49 - INFO - __main__ - Step 67877: {'lr': 0.0002928986290188969, 'samples': 13032384, 'steps': 67876, 'loss/train': 1.371014952659607} 11/07/2021 06:46:49 - INFO - __main__ - Step 67878: {'lr': 0.00029289340097631163, 'samples': 13032576, 'steps': 67877, 'loss/train': 1.0593816041946411} 11/07/2021 06:46:50 - INFO - __main__ - Step 67879: {'lr': 0.00029288817291439926, 'samples': 13032768, 'steps': 67878, 'loss/train': 1.3792310953140259} 11/07/2021 06:46:50 - INFO - __main__ - Step 67880: {'lr': 0.0002928829448331622, 'samples': 13032960, 'steps': 67879, 'loss/train': 1.6226941347122192} 11/07/2021 06:46:51 - INFO - __main__ - Step 67881: {'lr': 0.00029287771673260267, 'samples': 13033152, 'steps': 67880, 'loss/train': 1.7930266857147217} 11/07/2021 06:46:51 - INFO - __main__ - Step 67882: {'lr': 0.00029287248861272316, 'samples': 13033344, 'steps': 67881, 'loss/train': 1.6593440771102905} 11/07/2021 06:46:52 - INFO - __main__ - Step 67883: {'lr': 0.000292867260473526, 'samples': 13033536, 'steps': 67882, 'loss/train': 1.2487818002700806} 11/07/2021 06:46:52 - INFO - __main__ - Step 67884: {'lr': 0.0002928620323150134, 'samples': 13033728, 'steps': 67883, 'loss/train': 1.0659472942352295} 11/07/2021 06:46:53 - INFO - __main__ - Step 67885: {'lr': 0.0002928568041371879, 'samples': 13033920, 'steps': 67884, 'loss/train': 1.1508749723434448} 11/07/2021 06:46:53 - INFO - __main__ - Step 67886: {'lr': 0.00029285157594005173, 'samples': 13034112, 'steps': 67885, 'loss/train': 0.3812425434589386} 11/07/2021 06:46:54 - INFO - __main__ - Step 67887: {'lr': 0.00029284634772360743, 'samples': 13034304, 'steps': 67886, 'loss/train': 1.4919415712356567} 11/07/2021 06:46:54 - INFO - __main__ - Step 67888: {'lr': 0.0002928411194878571, 'samples': 13034496, 'steps': 67887, 'loss/train': 1.5720371007919312} 11/07/2021 06:46:54 - INFO - __main__ - Step 67889: {'lr': 0.0002928358912328033, 'samples': 13034688, 'steps': 67888, 'loss/train': 0.9751690030097961} 11/07/2021 06:46:56 - INFO - __main__ - Step 67890: {'lr': 0.0002928306629584483, 'samples': 13034880, 'steps': 67889, 'loss/train': 1.2985564470291138} 11/07/2021 06:46:56 - INFO - __main__ - Step 67891: {'lr': 0.00029282543466479437, 'samples': 13035072, 'steps': 67890, 'loss/train': 1.8491538763046265} 11/07/2021 06:46:56 - INFO - __main__ - Step 67892: {'lr': 0.00029282020635184404, 'samples': 13035264, 'steps': 67891, 'loss/train': 1.5685296058654785} 11/07/2021 06:46:57 - INFO - __main__ - Step 67893: {'lr': 0.00029281497801959957, 'samples': 13035456, 'steps': 67892, 'loss/train': 1.4465429782867432} 11/07/2021 06:46:57 - INFO - __main__ - Step 67894: {'lr': 0.0002928097496680634, 'samples': 13035648, 'steps': 67893, 'loss/train': 1.5115705728530884} 11/07/2021 06:46:58 - INFO - __main__ - Step 67895: {'lr': 0.0002928045212972377, 'samples': 13035840, 'steps': 67894, 'loss/train': 1.4614897966384888} 11/07/2021 06:46:58 - INFO - __main__ - Step 67896: {'lr': 0.00029279929290712504, 'samples': 13036032, 'steps': 67895, 'loss/train': 1.5754462480545044} 11/07/2021 06:46:59 - INFO - __main__ - Step 67897: {'lr': 0.0002927940644977276, 'samples': 13036224, 'steps': 67896, 'loss/train': 1.8583605289459229} 11/07/2021 06:46:59 - INFO - __main__ - Step 67898: {'lr': 0.0002927888360690478, 'samples': 13036416, 'steps': 67897, 'loss/train': 1.3727294206619263} 11/07/2021 06:46:59 - INFO - __main__ - Step 67899: {'lr': 0.0002927836076210881, 'samples': 13036608, 'steps': 67898, 'loss/train': 0.9963122606277466} 11/07/2021 06:47:00 - INFO - __main__ - Step 67900: {'lr': 0.0002927783791538508, 'samples': 13036800, 'steps': 67899, 'loss/train': 1.5893665552139282} 11/07/2021 06:47:01 - INFO - __main__ - Step 67901: {'lr': 0.0002927731506673381, 'samples': 13036992, 'steps': 67900, 'loss/train': 1.4355107545852661} 11/07/2021 06:47:01 - INFO - __main__ - Step 67902: {'lr': 0.00029276792216155256, 'samples': 13037184, 'steps': 67901, 'loss/train': 0.629015326499939} 11/07/2021 06:47:02 - INFO - __main__ - Step 67903: {'lr': 0.0002927626936364964, 'samples': 13037376, 'steps': 67902, 'loss/train': 1.302185297012329} 11/07/2021 06:47:02 - INFO - __main__ - Step 67904: {'lr': 0.00029275746509217207, 'samples': 13037568, 'steps': 67903, 'loss/train': 0.5309227108955383} 11/07/2021 06:47:03 - INFO - __main__ - Step 67905: {'lr': 0.0002927522365285819, 'samples': 13037760, 'steps': 67904, 'loss/train': 1.1485505104064941} 11/07/2021 06:47:03 - INFO - __main__ - Step 67906: {'lr': 0.00029274700794572816, 'samples': 13037952, 'steps': 67905, 'loss/train': 0.792292594909668} 11/07/2021 06:47:04 - INFO - __main__ - Step 67907: {'lr': 0.00029274177934361336, 'samples': 13038144, 'steps': 67906, 'loss/train': 1.5345567464828491} 11/07/2021 06:47:04 - INFO - __main__ - Step 67908: {'lr': 0.0002927365507222397, 'samples': 13038336, 'steps': 67907, 'loss/train': 1.411915898323059} 11/07/2021 06:47:04 - INFO - __main__ - Step 67909: {'lr': 0.0002927313220816096, 'samples': 13038528, 'steps': 67908, 'loss/train': 1.4048173427581787} 11/07/2021 06:47:05 - INFO - __main__ - Step 67910: {'lr': 0.00029272609342172553, 'samples': 13038720, 'steps': 67909, 'loss/train': 0.751419186592102} 11/07/2021 06:47:06 - INFO - __main__ - Step 67911: {'lr': 0.0002927208647425897, 'samples': 13038912, 'steps': 67910, 'loss/train': 1.1893341541290283} 11/07/2021 06:47:06 - INFO - __main__ - Step 67912: {'lr': 0.0002927156360442045, 'samples': 13039104, 'steps': 67911, 'loss/train': 1.1181118488311768} 11/07/2021 06:47:06 - INFO - __main__ - Step 67913: {'lr': 0.0002927104073265722, 'samples': 13039296, 'steps': 67912, 'loss/train': 1.2994173765182495} 11/07/2021 06:47:07 - INFO - __main__ - Step 67914: {'lr': 0.00029270517858969537, 'samples': 13039488, 'steps': 67913, 'loss/train': 1.4515304565429688} 11/07/2021 06:47:07 - INFO - __main__ - Step 67915: {'lr': 0.00029269994983357616, 'samples': 13039680, 'steps': 67914, 'loss/train': 1.689203143119812} 11/07/2021 06:47:08 - INFO - __main__ - Step 67916: {'lr': 0.00029269472105821707, 'samples': 13039872, 'steps': 67915, 'loss/train': 0.7949196696281433} 11/07/2021 06:47:09 - INFO - __main__ - Step 67917: {'lr': 0.0002926894922636204, 'samples': 13040064, 'steps': 67916, 'loss/train': 1.5518381595611572} 11/07/2021 06:47:09 - INFO - __main__ - Step 67918: {'lr': 0.00029268426344978855, 'samples': 13040256, 'steps': 67917, 'loss/train': 1.2979285717010498} 11/07/2021 06:47:09 - INFO - __main__ - Step 67919: {'lr': 0.0002926790346167237, 'samples': 13040448, 'steps': 67918, 'loss/train': 1.1756420135498047} 11/07/2021 06:47:10 - INFO - __main__ - Step 67920: {'lr': 0.0002926738057644284, 'samples': 13040640, 'steps': 67919, 'loss/train': 0.714340090751648} 11/07/2021 06:47:11 - INFO - __main__ - Step 67921: {'lr': 0.00029266857689290497, 'samples': 13040832, 'steps': 67920, 'loss/train': 1.6180168390274048} 11/07/2021 06:47:11 - INFO - __main__ - Step 67922: {'lr': 0.0002926633480021557, 'samples': 13041024, 'steps': 67921, 'loss/train': 1.4029033184051514} 11/07/2021 06:47:11 - INFO - __main__ - Step 67923: {'lr': 0.000292658119092183, 'samples': 13041216, 'steps': 67922, 'loss/train': 1.462997555732727} 11/07/2021 06:47:12 - INFO - __main__ - Step 67924: {'lr': 0.0002926528901629892, 'samples': 13041408, 'steps': 67923, 'loss/train': 1.3229644298553467} 11/07/2021 06:47:12 - INFO - __main__ - Step 67925: {'lr': 0.0002926476612145767, 'samples': 13041600, 'steps': 67924, 'loss/train': 1.6283477544784546} 11/07/2021 06:47:13 - INFO - __main__ - Step 67926: {'lr': 0.0002926424322469478, 'samples': 13041792, 'steps': 67925, 'loss/train': 1.7449398040771484} 11/07/2021 06:47:13 - INFO - __main__ - Step 67927: {'lr': 0.00029263720326010487, 'samples': 13041984, 'steps': 67926, 'loss/train': 1.5174771547317505} 11/07/2021 06:47:14 - INFO - __main__ - Step 67928: {'lr': 0.0002926319742540503, 'samples': 13042176, 'steps': 67927, 'loss/train': 1.4693808555603027} 11/07/2021 06:47:14 - INFO - __main__ - Step 67929: {'lr': 0.00029262674522878633, 'samples': 13042368, 'steps': 67928, 'loss/train': 1.589760422706604} 11/07/2021 06:47:15 - INFO - __main__ - Step 67930: {'lr': 0.00029262151618431547, 'samples': 13042560, 'steps': 67929, 'loss/train': 1.5376039743423462} 11/07/2021 06:47:16 - INFO - __main__ - Step 67931: {'lr': 0.0002926162871206401, 'samples': 13042752, 'steps': 67930, 'loss/train': 1.6650491952896118} 11/07/2021 06:47:16 - INFO - __main__ - Step 67932: {'lr': 0.0002926110580377624, 'samples': 13042944, 'steps': 67931, 'loss/train': 1.1447241306304932} 11/07/2021 06:47:16 - INFO - __main__ - Step 67933: {'lr': 0.0002926058289356848, 'samples': 13043136, 'steps': 67932, 'loss/train': 1.5742160081863403} 11/07/2021 06:47:17 - INFO - __main__ - Step 67934: {'lr': 0.0002926005998144097, 'samples': 13043328, 'steps': 67933, 'loss/train': 1.0379713773727417} 11/07/2021 06:47:17 - INFO - __main__ - Step 67935: {'lr': 0.0002925953706739394, 'samples': 13043520, 'steps': 67934, 'loss/train': 1.4961963891983032} 11/07/2021 06:47:19 - INFO - __main__ - Step 67936: {'lr': 0.0002925901415142763, 'samples': 13043712, 'steps': 67935, 'loss/train': 1.5262168645858765} 11/07/2021 06:47:19 - INFO - __main__ - Step 67937: {'lr': 0.00029258491233542273, 'samples': 13043904, 'steps': 67936, 'loss/train': 1.8781893253326416} 11/07/2021 06:47:19 - INFO - __main__ - Step 67938: {'lr': 0.0002925796831373811, 'samples': 13044096, 'steps': 67937, 'loss/train': 0.9241393208503723} 11/07/2021 06:47:20 - INFO - __main__ - Step 67939: {'lr': 0.00029257445392015367, 'samples': 13044288, 'steps': 67938, 'loss/train': 1.6536798477172852} 11/07/2021 06:47:20 - INFO - __main__ - Step 67940: {'lr': 0.00029256922468374287, 'samples': 13044480, 'steps': 67939, 'loss/train': 1.5441327095031738} 11/07/2021 06:47:21 - INFO - __main__ - Step 67941: {'lr': 0.000292563995428151, 'samples': 13044672, 'steps': 67940, 'loss/train': 1.8898353576660156} 11/07/2021 06:47:22 - INFO - __main__ - Step 67942: {'lr': 0.00029255876615338043, 'samples': 13044864, 'steps': 67941, 'loss/train': 1.4778002500534058} 11/07/2021 06:47:22 - INFO - __main__ - Step 67943: {'lr': 0.0002925535368594336, 'samples': 13045056, 'steps': 67942, 'loss/train': 0.9024084210395813} 11/07/2021 06:47:22 - INFO - __main__ - Step 67944: {'lr': 0.0002925483075463128, 'samples': 13045248, 'steps': 67943, 'loss/train': 1.5870898962020874} 11/07/2021 06:47:23 - INFO - __main__ - Step 67945: {'lr': 0.0002925430782140204, 'samples': 13045440, 'steps': 67944, 'loss/train': 1.2967140674591064} 11/07/2021 06:47:23 - INFO - __main__ - Step 67946: {'lr': 0.00029253784886255874, 'samples': 13045632, 'steps': 67945, 'loss/train': 1.3751264810562134} 11/07/2021 06:47:24 - INFO - __main__ - Step 67947: {'lr': 0.00029253261949193016, 'samples': 13045824, 'steps': 67946, 'loss/train': 1.783624291419983} 11/07/2021 06:47:24 - INFO - __main__ - Step 67948: {'lr': 0.000292527390102137, 'samples': 13046016, 'steps': 67947, 'loss/train': 1.352480411529541} 11/07/2021 06:47:25 - INFO - __main__ - Step 67949: {'lr': 0.0002925221606931817, 'samples': 13046208, 'steps': 67948, 'loss/train': 1.6577391624450684} 11/07/2021 06:47:25 - INFO - __main__ - Step 67950: {'lr': 0.0002925169312650666, 'samples': 13046400, 'steps': 67949, 'loss/train': 1.1330437660217285} 11/07/2021 06:47:25 - INFO - __main__ - Step 67951: {'lr': 0.000292511701817794, 'samples': 13046592, 'steps': 67950, 'loss/train': 1.7287474870681763} 11/07/2021 06:47:26 - INFO - __main__ - Step 67952: {'lr': 0.0002925064723513663, 'samples': 13046784, 'steps': 67951, 'loss/train': 1.1041522026062012} 11/07/2021 06:47:27 - INFO - __main__ - Step 67953: {'lr': 0.00029250124286578583, 'samples': 13046976, 'steps': 67952, 'loss/train': 1.483371376991272} 11/07/2021 06:47:27 - INFO - __main__ - Step 67954: {'lr': 0.00029249601336105494, 'samples': 13047168, 'steps': 67953, 'loss/train': 1.3948744535446167} 11/07/2021 06:47:27 - INFO - __main__ - Step 67955: {'lr': 0.00029249078383717595, 'samples': 13047360, 'steps': 67954, 'loss/train': 1.264193058013916} 11/07/2021 06:47:28 - INFO - __main__ - Step 67956: {'lr': 0.00029248555429415137, 'samples': 13047552, 'steps': 67955, 'loss/train': 1.2603790760040283} 11/07/2021 06:47:28 - INFO - __main__ - Step 67957: {'lr': 0.0002924803247319834, 'samples': 13047744, 'steps': 67956, 'loss/train': 1.4931252002716064} 11/07/2021 06:47:29 - INFO - __main__ - Step 67958: {'lr': 0.0002924750951506745, 'samples': 13047936, 'steps': 67957, 'loss/train': 1.2927435636520386} 11/07/2021 06:47:29 - INFO - __main__ - Step 67959: {'lr': 0.00029246986555022693, 'samples': 13048128, 'steps': 67958, 'loss/train': 1.4113887548446655} 11/07/2021 06:47:30 - INFO - __main__ - Step 67960: {'lr': 0.0002924646359306431, 'samples': 13048320, 'steps': 67959, 'loss/train': 1.3916600942611694} 11/07/2021 06:47:30 - INFO - __main__ - Step 67961: {'lr': 0.00029245940629192536, 'samples': 13048512, 'steps': 67960, 'loss/train': 1.6258999109268188} 11/07/2021 06:47:31 - INFO - __main__ - Step 67962: {'lr': 0.000292454176634076, 'samples': 13048704, 'steps': 67961, 'loss/train': 1.3527666330337524} 11/07/2021 06:47:32 - INFO - __main__ - Step 67963: {'lr': 0.00029244894695709754, 'samples': 13048896, 'steps': 67962, 'loss/train': 1.5713657140731812} 11/07/2021 06:47:32 - INFO - __main__ - Step 67964: {'lr': 0.0002924437172609922, 'samples': 13049088, 'steps': 67963, 'loss/train': 1.5963985919952393} 11/07/2021 06:47:32 - INFO - __main__ - Step 67965: {'lr': 0.0002924384875457624, 'samples': 13049280, 'steps': 67964, 'loss/train': 1.2620959281921387} 11/07/2021 06:47:33 - INFO - __main__ - Step 67966: {'lr': 0.0002924332578114105, 'samples': 13049472, 'steps': 67965, 'loss/train': 1.2762980461120605} 11/07/2021 06:47:33 - INFO - __main__ - Step 67967: {'lr': 0.0002924280280579388, 'samples': 13049664, 'steps': 67966, 'loss/train': 1.6653708219528198} 11/07/2021 06:47:34 - INFO - __main__ - Step 67968: {'lr': 0.00029242279828534963, 'samples': 13049856, 'steps': 67967, 'loss/train': 1.6869205236434937} 11/07/2021 06:47:34 - INFO - __main__ - Step 67969: {'lr': 0.00029241756849364544, 'samples': 13050048, 'steps': 67968, 'loss/train': 1.0092411041259766} 11/07/2021 06:47:35 - INFO - __main__ - Step 67970: {'lr': 0.00029241233868282856, 'samples': 13050240, 'steps': 67969, 'loss/train': 1.2237409353256226} 11/07/2021 06:47:35 - INFO - __main__ - Step 67971: {'lr': 0.00029240710885290136, 'samples': 13050432, 'steps': 67970, 'loss/train': 1.379517674446106} 11/07/2021 06:47:36 - INFO - __main__ - Step 67972: {'lr': 0.0002924018790038662, 'samples': 13050624, 'steps': 67971, 'loss/train': 1.5181320905685425} 11/07/2021 06:47:37 - INFO - __main__ - Step 67973: {'lr': 0.00029239664913572526, 'samples': 13050816, 'steps': 67972, 'loss/train': 1.1558784246444702} 11/07/2021 06:47:37 - INFO - __main__ - Step 67974: {'lr': 0.0002923914192484811, 'samples': 13051008, 'steps': 67973, 'loss/train': 0.9117531180381775} 11/07/2021 06:47:37 - INFO - __main__ - Step 67975: {'lr': 0.00029238618934213605, 'samples': 13051200, 'steps': 67974, 'loss/train': 1.3389110565185547} 11/07/2021 06:47:38 - INFO - __main__ - Step 67976: {'lr': 0.0002923809594166925, 'samples': 13051392, 'steps': 67975, 'loss/train': 1.227181315422058} 11/07/2021 06:47:38 - INFO - __main__ - Step 67977: {'lr': 0.00029237572947215265, 'samples': 13051584, 'steps': 67976, 'loss/train': 0.24813126027584076} 11/07/2021 06:47:39 - INFO - __main__ - Step 67978: {'lr': 0.00029237049950851904, 'samples': 13051776, 'steps': 67977, 'loss/train': 1.363016128540039} 11/07/2021 06:47:40 - INFO - __main__ - Step 67979: {'lr': 0.0002923652695257938, 'samples': 13051968, 'steps': 67978, 'loss/train': 1.3491395711898804} 11/07/2021 06:47:40 - INFO - __main__ - Step 67980: {'lr': 0.00029236003952397955, 'samples': 13052160, 'steps': 67979, 'loss/train': 1.448309302330017} 11/07/2021 06:47:40 - INFO - __main__ - Step 67981: {'lr': 0.0002923548095030785, 'samples': 13052352, 'steps': 67980, 'loss/train': 1.4190105199813843} 11/07/2021 06:47:41 - INFO - __main__ - Step 67982: {'lr': 0.0002923495794630929, 'samples': 13052544, 'steps': 67981, 'loss/train': 1.3138569593429565} 11/07/2021 06:47:42 - INFO - __main__ - Step 67983: {'lr': 0.0002923443494040254, 'samples': 13052736, 'steps': 67982, 'loss/train': 1.478877067565918} 11/07/2021 06:47:42 - INFO - __main__ - Step 67984: {'lr': 0.0002923391193258781, 'samples': 13052928, 'steps': 67983, 'loss/train': 1.2409542798995972} 11/07/2021 06:47:43 - INFO - __main__ - Step 67985: {'lr': 0.00029233388922865353, 'samples': 13053120, 'steps': 67984, 'loss/train': 1.2753809690475464} 11/07/2021 06:47:43 - INFO - __main__ - Step 67986: {'lr': 0.00029232865911235384, 'samples': 13053312, 'steps': 67985, 'loss/train': 1.2233995199203491} 11/07/2021 06:47:43 - INFO - __main__ - Step 67987: {'lr': 0.00029232342897698164, 'samples': 13053504, 'steps': 67986, 'loss/train': 1.5952048301696777} 11/07/2021 06:47:44 - INFO - __main__ - Step 67988: {'lr': 0.000292318198822539, 'samples': 13053696, 'steps': 67987, 'loss/train': 1.2954946756362915} 11/07/2021 06:47:45 - INFO - __main__ - Step 67989: {'lr': 0.0002923129686490286, 'samples': 13053888, 'steps': 67988, 'loss/train': 1.4916826486587524} 11/07/2021 06:47:45 - INFO - __main__ - Step 67990: {'lr': 0.00029230773845645246, 'samples': 13054080, 'steps': 67989, 'loss/train': 1.7603504657745361} 11/07/2021 06:47:45 - INFO - __main__ - Step 67991: {'lr': 0.0002923025082448132, 'samples': 13054272, 'steps': 67990, 'loss/train': 1.3967450857162476} 11/07/2021 06:47:46 - INFO - __main__ - Step 67992: {'lr': 0.00029229727801411315, 'samples': 13054464, 'steps': 67991, 'loss/train': 1.4297387599945068} 11/07/2021 06:47:46 - INFO - __main__ - Step 67993: {'lr': 0.00029229204776435447, 'samples': 13054656, 'steps': 67992, 'loss/train': 1.4179093837738037} 11/07/2021 06:47:47 - INFO - __main__ - Step 67994: {'lr': 0.0002922868174955397, 'samples': 13054848, 'steps': 67993, 'loss/train': 1.7806140184402466} 11/07/2021 06:47:47 - INFO - __main__ - Step 67995: {'lr': 0.0002922815872076712, 'samples': 13055040, 'steps': 67994, 'loss/train': 0.2957956790924072} 11/07/2021 06:47:48 - INFO - __main__ - Step 67996: {'lr': 0.00029227635690075115, 'samples': 13055232, 'steps': 67995, 'loss/train': 1.8342403173446655} 11/07/2021 06:47:48 - INFO - __main__ - Step 67997: {'lr': 0.0002922711265747821, 'samples': 13055424, 'steps': 67996, 'loss/train': 1.6499074697494507} 11/07/2021 06:47:48 - INFO - __main__ - Step 67998: {'lr': 0.0002922658962297663, 'samples': 13055616, 'steps': 67997, 'loss/train': 1.4335070848464966} 11/07/2021 06:47:50 - INFO - __main__ - Step 67999: {'lr': 0.0002922606658657062, 'samples': 13055808, 'steps': 67998, 'loss/train': 1.1446151733398438} 11/07/2021 06:47:50 - INFO - __main__ - Step 68000: {'lr': 0.0002922554354826041, 'samples': 13056000, 'steps': 67999, 'loss/train': 1.5709551572799683} 11/07/2021 06:47:50 - INFO - __main__ - Step 68001: {'lr': 0.0002922502050804623, 'samples': 13056192, 'steps': 68000, 'loss/train': 1.3486835956573486} 11/07/2021 06:47:51 - INFO - __main__ - Step 68002: {'lr': 0.0002922449746592832, 'samples': 13056384, 'steps': 68001, 'loss/train': 1.6175698041915894} 11/07/2021 06:47:51 - INFO - __main__ - Step 68003: {'lr': 0.0002922397442190692, 'samples': 13056576, 'steps': 68002, 'loss/train': 1.7200117111206055} 11/07/2021 06:47:52 - INFO - __main__ - Step 68004: {'lr': 0.00029223451375982255, 'samples': 13056768, 'steps': 68003, 'loss/train': 1.6782969236373901} 11/07/2021 06:47:52 - INFO - __main__ - Step 68005: {'lr': 0.0002922292832815458, 'samples': 13056960, 'steps': 68004, 'loss/train': 1.053714632987976} 11/07/2021 06:47:53 - INFO - __main__ - Step 68006: {'lr': 0.0002922240527842411, 'samples': 13057152, 'steps': 68005, 'loss/train': 1.4585663080215454} 11/07/2021 06:47:53 - INFO - __main__ - Step 68007: {'lr': 0.0002922188222679109, 'samples': 13057344, 'steps': 68006, 'loss/train': 1.7315226793289185} 11/07/2021 06:47:53 - INFO - __main__ - Step 68008: {'lr': 0.0002922135917325576, 'samples': 13057536, 'steps': 68007, 'loss/train': 5.7356438636779785} 11/07/2021 06:47:54 - INFO - __main__ - Step 68009: {'lr': 0.00029220836117818346, 'samples': 13057728, 'steps': 68008, 'loss/train': 0.8429231643676758} 11/07/2021 06:47:55 - INFO - __main__ - Step 68010: {'lr': 0.00029220313060479087, 'samples': 13057920, 'steps': 68009, 'loss/train': 1.1137572526931763} 11/07/2021 06:47:55 - INFO - __main__ - Step 68011: {'lr': 0.00029219790001238223, 'samples': 13058112, 'steps': 68010, 'loss/train': 1.7888160943984985} 11/07/2021 06:47:56 - INFO - __main__ - Step 68012: {'lr': 0.0002921926694009599, 'samples': 13058304, 'steps': 68011, 'loss/train': 1.0811158418655396} 11/07/2021 06:47:56 - INFO - __main__ - Step 68013: {'lr': 0.00029218743877052616, 'samples': 13058496, 'steps': 68012, 'loss/train': 1.161612629890442} 11/07/2021 06:47:56 - INFO - __main__ - Step 68014: {'lr': 0.00029218220812108345, 'samples': 13058688, 'steps': 68013, 'loss/train': 1.6271628141403198} 11/07/2021 06:47:57 - INFO - __main__ - Step 68015: {'lr': 0.000292176977452634, 'samples': 13058880, 'steps': 68014, 'loss/train': 1.3414416313171387} 11/07/2021 06:47:58 - INFO - __main__ - Step 68016: {'lr': 0.0002921717467651804, 'samples': 13059072, 'steps': 68015, 'loss/train': 1.6333184242248535} 11/07/2021 06:47:58 - INFO - __main__ - Step 68017: {'lr': 0.0002921665160587248, 'samples': 13059264, 'steps': 68016, 'loss/train': 0.6936393976211548} 11/07/2021 06:47:58 - INFO - __main__ - Step 68018: {'lr': 0.0002921612853332696, 'samples': 13059456, 'steps': 68017, 'loss/train': 2.0724897384643555} 11/07/2021 06:47:59 - INFO - __main__ - Step 68019: {'lr': 0.0002921560545888171, 'samples': 13059648, 'steps': 68018, 'loss/train': 1.0212653875350952} 11/07/2021 06:48:00 - INFO - __main__ - Step 68020: {'lr': 0.0002921508238253698, 'samples': 13059840, 'steps': 68019, 'loss/train': 1.2553805112838745} 11/07/2021 06:48:00 - INFO - __main__ - Step 68021: {'lr': 0.00029214559304293003, 'samples': 13060032, 'steps': 68020, 'loss/train': 1.2623250484466553} 11/07/2021 06:48:00 - INFO - __main__ - Step 68022: {'lr': 0.0002921403622415, 'samples': 13060224, 'steps': 68021, 'loss/train': 1.8434959650039673} 11/07/2021 06:48:01 - INFO - __main__ - Step 68023: {'lr': 0.00029213513142108236, 'samples': 13060416, 'steps': 68022, 'loss/train': 1.538432002067566} 11/07/2021 06:48:01 - INFO - __main__ - Step 68024: {'lr': 0.00029212990058167913, 'samples': 13060608, 'steps': 68023, 'loss/train': 1.0893725156784058} 11/07/2021 06:48:01 - INFO - __main__ - Step 68025: {'lr': 0.0002921246697232928, 'samples': 13060800, 'steps': 68024, 'loss/train': 1.3605868816375732} 11/07/2021 06:48:03 - INFO - __main__ - Step 68026: {'lr': 0.0002921194388459258, 'samples': 13060992, 'steps': 68025, 'loss/train': 1.2855455875396729} 11/07/2021 06:48:03 - INFO - __main__ - Step 68027: {'lr': 0.0002921142079495804, 'samples': 13061184, 'steps': 68026, 'loss/train': 1.3791016340255737} 11/07/2021 06:48:03 - INFO - __main__ - Step 68028: {'lr': 0.00029210897703425907, 'samples': 13061376, 'steps': 68027, 'loss/train': 1.2919573783874512} 11/07/2021 06:48:04 - INFO - __main__ - Step 68029: {'lr': 0.00029210374609996403, 'samples': 13061568, 'steps': 68028, 'loss/train': 1.4071853160858154} 11/07/2021 06:48:04 - INFO - __main__ - Step 68030: {'lr': 0.00029209851514669773, 'samples': 13061760, 'steps': 68029, 'loss/train': 1.6110366582870483} 11/07/2021 06:48:05 - INFO - __main__ - Step 68031: {'lr': 0.0002920932841744624, 'samples': 13061952, 'steps': 68030, 'loss/train': 1.4114595651626587} 11/07/2021 06:48:05 - INFO - __main__ - Step 68032: {'lr': 0.00029208805318326056, 'samples': 13062144, 'steps': 68031, 'loss/train': 2.059664726257324} 11/07/2021 06:48:06 - INFO - __main__ - Step 68033: {'lr': 0.00029208282217309446, 'samples': 13062336, 'steps': 68032, 'loss/train': 1.4595097303390503} 11/07/2021 06:48:06 - INFO - __main__ - Step 68034: {'lr': 0.00029207759114396653, 'samples': 13062528, 'steps': 68033, 'loss/train': 1.2013440132141113} 11/07/2021 06:48:06 - INFO - __main__ - Step 68035: {'lr': 0.000292072360095879, 'samples': 13062720, 'steps': 68034, 'loss/train': 1.7085713148117065} 11/07/2021 06:48:07 - INFO - __main__ - Step 68036: {'lr': 0.00029206712902883435, 'samples': 13062912, 'steps': 68035, 'loss/train': 1.3825970888137817} 11/07/2021 06:48:08 - INFO - __main__ - Step 68037: {'lr': 0.0002920618979428349, 'samples': 13063104, 'steps': 68036, 'loss/train': 1.4482835531234741} 11/07/2021 06:48:08 - INFO - __main__ - Step 68038: {'lr': 0.00029205666683788305, 'samples': 13063296, 'steps': 68037, 'loss/train': 1.8533695936203003} 11/07/2021 06:48:08 - INFO - __main__ - Step 68039: {'lr': 0.0002920514357139811, 'samples': 13063488, 'steps': 68038, 'loss/train': 1.0822328329086304} 11/07/2021 06:48:09 - INFO - __main__ - Step 68040: {'lr': 0.0002920462045711315, 'samples': 13063680, 'steps': 68039, 'loss/train': 1.5805364847183228} 11/07/2021 06:48:10 - INFO - __main__ - Step 68041: {'lr': 0.0002920409734093364, 'samples': 13063872, 'steps': 68040, 'loss/train': 1.8195055723190308} 11/07/2021 06:48:10 - INFO - __main__ - Step 68042: {'lr': 0.0002920357422285983, 'samples': 13064064, 'steps': 68041, 'loss/train': 1.2998055219650269} 11/07/2021 06:48:10 - INFO - __main__ - Step 68043: {'lr': 0.0002920305110289195, 'samples': 13064256, 'steps': 68042, 'loss/train': 1.5851820707321167} 11/07/2021 06:48:11 - INFO - __main__ - Step 68044: {'lr': 0.00029202527981030254, 'samples': 13064448, 'steps': 68043, 'loss/train': 1.084903597831726} 11/07/2021 06:48:11 - INFO - __main__ - Step 68045: {'lr': 0.00029202004857274954, 'samples': 13064640, 'steps': 68044, 'loss/train': 1.3909945487976074} 11/07/2021 06:48:12 - INFO - __main__ - Step 68046: {'lr': 0.000292014817316263, 'samples': 13064832, 'steps': 68045, 'loss/train': 1.446393609046936} 11/07/2021 06:48:12 - INFO - __main__ - Step 68047: {'lr': 0.0002920095860408452, 'samples': 13065024, 'steps': 68046, 'loss/train': 1.5862444639205933} 11/07/2021 06:48:13 - INFO - __main__ - Step 68048: {'lr': 0.00029200435474649857, 'samples': 13065216, 'steps': 68047, 'loss/train': 0.8843222856521606} 11/07/2021 06:48:13 - INFO - __main__ - Step 68049: {'lr': 0.00029199912343322537, 'samples': 13065408, 'steps': 68048, 'loss/train': 1.1107633113861084} 11/07/2021 06:48:14 - INFO - __main__ - Step 68050: {'lr': 0.0002919938921010281, 'samples': 13065600, 'steps': 68049, 'loss/train': 1.4369546175003052} 11/07/2021 06:48:15 - INFO - __main__ - Step 68051: {'lr': 0.0002919886607499089, 'samples': 13065792, 'steps': 68050, 'loss/train': 1.5312848091125488} 11/07/2021 06:48:15 - INFO - __main__ - Step 68052: {'lr': 0.00029198342937987036, 'samples': 13065984, 'steps': 68051, 'loss/train': 0.3557504713535309} 11/07/2021 06:48:15 - INFO - __main__ - Step 68053: {'lr': 0.00029197819799091476, 'samples': 13066176, 'steps': 68052, 'loss/train': 0.8202228546142578} 11/07/2021 06:48:16 - INFO - __main__ - Step 68054: {'lr': 0.00029197296658304433, 'samples': 13066368, 'steps': 68053, 'loss/train': 0.8976806402206421} 11/07/2021 06:48:16 - INFO - __main__ - Step 68055: {'lr': 0.00029196773515626157, 'samples': 13066560, 'steps': 68054, 'loss/train': 1.7020180225372314} 11/07/2021 06:48:17 - INFO - __main__ - Step 68056: {'lr': 0.00029196250371056875, 'samples': 13066752, 'steps': 68055, 'loss/train': 1.3405808210372925} 11/07/2021 06:48:18 - INFO - __main__ - Step 68057: {'lr': 0.00029195727224596836, 'samples': 13066944, 'steps': 68056, 'loss/train': 1.3786362409591675} 11/07/2021 06:48:18 - INFO - __main__ - Step 68058: {'lr': 0.00029195204076246263, 'samples': 13067136, 'steps': 68057, 'loss/train': 1.4522910118103027} 11/07/2021 06:48:18 - INFO - __main__ - Step 68059: {'lr': 0.000291946809260054, 'samples': 13067328, 'steps': 68058, 'loss/train': 1.6722952127456665} 11/07/2021 06:48:19 - INFO - __main__ - Step 68060: {'lr': 0.00029194157773874475, 'samples': 13067520, 'steps': 68059, 'loss/train': 1.4218744039535522} 11/07/2021 06:48:20 - INFO - __main__ - Step 68061: {'lr': 0.00029193634619853725, 'samples': 13067712, 'steps': 68060, 'loss/train': 1.4195605516433716} 11/07/2021 06:48:20 - INFO - __main__ - Step 68062: {'lr': 0.0002919311146394339, 'samples': 13067904, 'steps': 68061, 'loss/train': 1.4338382482528687} 11/07/2021 06:48:20 - INFO - __main__ - Step 68063: {'lr': 0.000291925883061437, 'samples': 13068096, 'steps': 68062, 'loss/train': 1.702924370765686} 11/07/2021 06:48:21 - INFO - __main__ - Step 68064: {'lr': 0.000291920651464549, 'samples': 13068288, 'steps': 68063, 'loss/train': 1.5139628648757935} 11/07/2021 06:48:21 - INFO - __main__ - Step 68065: {'lr': 0.0002919154198487722, 'samples': 13068480, 'steps': 68064, 'loss/train': 1.3311439752578735} 11/07/2021 06:48:22 - INFO - __main__ - Step 68066: {'lr': 0.0002919101882141089, 'samples': 13068672, 'steps': 68065, 'loss/train': 1.3313993215560913} 11/07/2021 06:48:22 - INFO - __main__ - Step 68067: {'lr': 0.0002919049565605616, 'samples': 13068864, 'steps': 68066, 'loss/train': 1.1038885116577148} 11/07/2021 06:48:23 - INFO - __main__ - Step 68068: {'lr': 0.0002918997248881325, 'samples': 13069056, 'steps': 68067, 'loss/train': 1.2299628257751465} 11/07/2021 06:48:23 - INFO - __main__ - Step 68069: {'lr': 0.00029189449319682405, 'samples': 13069248, 'steps': 68068, 'loss/train': 0.737657904624939} 11/07/2021 06:48:23 - INFO - __main__ - Step 68070: {'lr': 0.0002918892614866386, 'samples': 13069440, 'steps': 68069, 'loss/train': 1.4079958200454712} 11/07/2021 06:48:24 - INFO - __main__ - Step 68071: {'lr': 0.0002918840297575785, 'samples': 13069632, 'steps': 68070, 'loss/train': 1.1759923696517944} 11/07/2021 06:48:25 - INFO - __main__ - Step 68072: {'lr': 0.00029187879800964613, 'samples': 13069824, 'steps': 68071, 'loss/train': 1.1299690008163452} 11/07/2021 06:48:25 - INFO - __main__ - Step 68073: {'lr': 0.0002918735662428438, 'samples': 13070016, 'steps': 68072, 'loss/train': 1.3441193103790283} 11/07/2021 06:48:25 - INFO - __main__ - Step 68074: {'lr': 0.0002918683344571738, 'samples': 13070208, 'steps': 68073, 'loss/train': 1.154359221458435} 11/07/2021 06:48:26 - INFO - __main__ - Step 68075: {'lr': 0.0002918631026526387, 'samples': 13070400, 'steps': 68074, 'loss/train': 0.8819845914840698} 11/07/2021 06:48:26 - INFO - __main__ - Step 68076: {'lr': 0.00029185787082924066, 'samples': 13070592, 'steps': 68075, 'loss/train': 1.6025937795639038} 11/07/2021 06:48:27 - INFO - __main__ - Step 68077: {'lr': 0.0002918526389869821, 'samples': 13070784, 'steps': 68076, 'loss/train': 1.3263936042785645} 11/07/2021 06:48:28 - INFO - __main__ - Step 68078: {'lr': 0.0002918474071258654, 'samples': 13070976, 'steps': 68077, 'loss/train': 1.359310507774353} 11/07/2021 06:48:28 - INFO - __main__ - Step 68079: {'lr': 0.00029184217524589294, 'samples': 13071168, 'steps': 68078, 'loss/train': 0.43868303298950195} 11/07/2021 06:48:28 - INFO - __main__ - Step 68080: {'lr': 0.0002918369433470671, 'samples': 13071360, 'steps': 68079, 'loss/train': 1.5420781373977661} 11/07/2021 06:48:29 - INFO - __main__ - Step 68081: {'lr': 0.00029183171142939, 'samples': 13071552, 'steps': 68080, 'loss/train': 1.5007929801940918} 11/07/2021 06:48:30 - INFO - __main__ - Step 68082: {'lr': 0.00029182647949286427, 'samples': 13071744, 'steps': 68081, 'loss/train': 1.8024473190307617} 11/07/2021 06:48:30 - INFO - __main__ - Step 68083: {'lr': 0.0002918212475374922, 'samples': 13071936, 'steps': 68082, 'loss/train': 0.712893545627594} 11/07/2021 06:48:30 - INFO - __main__ - Step 68084: {'lr': 0.00029181601556327606, 'samples': 13072128, 'steps': 68083, 'loss/train': 1.2755922079086304} 11/07/2021 06:48:31 - INFO - __main__ - Step 68085: {'lr': 0.00029181078357021835, 'samples': 13072320, 'steps': 68084, 'loss/train': 1.2744536399841309} 11/07/2021 06:48:31 - INFO - __main__ - Step 68086: {'lr': 0.00029180555155832133, 'samples': 13072512, 'steps': 68085, 'loss/train': 1.4612269401550293} 11/07/2021 06:48:32 - INFO - __main__ - Step 68087: {'lr': 0.0002918003195275873, 'samples': 13072704, 'steps': 68086, 'loss/train': 1.044560194015503} 11/07/2021 06:48:32 - INFO - __main__ - Step 68088: {'lr': 0.0002917950874780188, 'samples': 13072896, 'steps': 68087, 'loss/train': 0.9986424446105957} 11/07/2021 06:48:33 - INFO - __main__ - Step 68089: {'lr': 0.000291789855409618, 'samples': 13073088, 'steps': 68088, 'loss/train': 1.4255791902542114} 11/07/2021 06:48:33 - INFO - __main__ - Step 68090: {'lr': 0.0002917846233223873, 'samples': 13073280, 'steps': 68089, 'loss/train': 1.244908332824707} 11/07/2021 06:48:33 - INFO - __main__ - Step 68091: {'lr': 0.0002917793912163292, 'samples': 13073472, 'steps': 68090, 'loss/train': 0.9613392949104309} 11/07/2021 06:48:34 - INFO - __main__ - Step 68092: {'lr': 0.0002917741590914458, 'samples': 13073664, 'steps': 68091, 'loss/train': 1.3920594453811646} 11/07/2021 06:48:35 - INFO - __main__ - Step 68093: {'lr': 0.00029176892694773984, 'samples': 13073856, 'steps': 68092, 'loss/train': 1.5784510374069214} 11/07/2021 06:48:35 - INFO - __main__ - Step 68094: {'lr': 0.00029176369478521325, 'samples': 13074048, 'steps': 68093, 'loss/train': 1.5861666202545166} 11/07/2021 06:48:36 - INFO - __main__ - Step 68095: {'lr': 0.00029175846260386866, 'samples': 13074240, 'steps': 68094, 'loss/train': 1.7889729738235474} 11/07/2021 06:48:36 - INFO - __main__ - Step 68096: {'lr': 0.00029175323040370833, 'samples': 13074432, 'steps': 68095, 'loss/train': 1.250440239906311} 11/07/2021 06:48:36 - INFO - __main__ - Step 68097: {'lr': 0.00029174799818473465, 'samples': 13074624, 'steps': 68096, 'loss/train': 1.2360121011734009} 11/07/2021 06:48:37 - INFO - __main__ - Step 68098: {'lr': 0.0002917427659469499, 'samples': 13074816, 'steps': 68097, 'loss/train': 0.9204027056694031} 11/07/2021 06:48:37 - INFO - __main__ - Step 68099: {'lr': 0.00029173753369035664, 'samples': 13075008, 'steps': 68098, 'loss/train': 1.5362008810043335} 11/07/2021 06:48:38 - INFO - __main__ - Step 68100: {'lr': 0.00029173230141495707, 'samples': 13075200, 'steps': 68099, 'loss/train': 1.720665454864502} 11/07/2021 06:48:38 - INFO - __main__ - Step 68101: {'lr': 0.0002917270691207535, 'samples': 13075392, 'steps': 68100, 'loss/train': 1.3655016422271729} 11/07/2021 06:48:39 - INFO - __main__ - Step 68102: {'lr': 0.0002917218368077483, 'samples': 13075584, 'steps': 68101, 'loss/train': 2.8102054595947266} 11/07/2021 06:48:40 - INFO - __main__ - Step 68103: {'lr': 0.00029171660447594393, 'samples': 13075776, 'steps': 68102, 'loss/train': 1.2692357301712036} 11/07/2021 06:48:40 - INFO - __main__ - Step 68104: {'lr': 0.00029171137212534275, 'samples': 13075968, 'steps': 68103, 'loss/train': 1.0552630424499512} 11/07/2021 06:48:40 - INFO - __main__ - Step 68105: {'lr': 0.0002917061397559471, 'samples': 13076160, 'steps': 68104, 'loss/train': 1.4483797550201416} 11/07/2021 06:48:41 - INFO - __main__ - Step 68106: {'lr': 0.00029170090736775926, 'samples': 13076352, 'steps': 68105, 'loss/train': 1.8255032300949097} 11/07/2021 06:48:41 - INFO - __main__ - Step 68107: {'lr': 0.0002916956749607816, 'samples': 13076544, 'steps': 68106, 'loss/train': 1.6824232339859009} 11/07/2021 06:48:42 - INFO - __main__ - Step 68108: {'lr': 0.00029169044253501655, 'samples': 13076736, 'steps': 68107, 'loss/train': 1.6576777696609497} 11/07/2021 06:48:42 - INFO - __main__ - Step 68109: {'lr': 0.0002916852100904664, 'samples': 13076928, 'steps': 68108, 'loss/train': 1.5415595769882202} 11/07/2021 06:48:43 - INFO - __main__ - Step 68110: {'lr': 0.00029167997762713353, 'samples': 13077120, 'steps': 68109, 'loss/train': 1.1686615943908691} 11/07/2021 06:48:43 - INFO - __main__ - Step 68111: {'lr': 0.00029167474514502035, 'samples': 13077312, 'steps': 68110, 'loss/train': 1.3679016828536987} 11/07/2021 06:48:43 - INFO - __main__ - Step 68112: {'lr': 0.0002916695126441292, 'samples': 13077504, 'steps': 68111, 'loss/train': 1.6378307342529297} 11/07/2021 06:48:44 - INFO - __main__ - Step 68113: {'lr': 0.0002916642801244624, 'samples': 13077696, 'steps': 68112, 'loss/train': 1.4338172674179077} 11/07/2021 06:48:45 - INFO - __main__ - Step 68114: {'lr': 0.00029165904758602225, 'samples': 13077888, 'steps': 68113, 'loss/train': 1.5692328214645386} 11/07/2021 06:48:45 - INFO - __main__ - Step 68115: {'lr': 0.0002916538150288112, 'samples': 13078080, 'steps': 68114, 'loss/train': 1.3801062107086182} 11/07/2021 06:48:45 - INFO - __main__ - Step 68116: {'lr': 0.0002916485824528316, 'samples': 13078272, 'steps': 68115, 'loss/train': 1.8759005069732666} 11/07/2021 06:48:46 - INFO - __main__ - Step 68117: {'lr': 0.00029164334985808577, 'samples': 13078464, 'steps': 68116, 'loss/train': 0.8295671939849854} 11/07/2021 06:48:47 - INFO - __main__ - Step 68118: {'lr': 0.0002916381172445761, 'samples': 13078656, 'steps': 68117, 'loss/train': 1.7659796476364136} 11/07/2021 06:48:47 - INFO - __main__ - Step 68119: {'lr': 0.00029163288461230496, 'samples': 13078848, 'steps': 68118, 'loss/train': 1.5676058530807495} 11/07/2021 06:48:48 - INFO - __main__ - Step 68120: {'lr': 0.0002916276519612747, 'samples': 13079040, 'steps': 68119, 'loss/train': 1.0648003816604614} 11/07/2021 06:48:48 - INFO - __main__ - Step 68121: {'lr': 0.00029162241929148766, 'samples': 13079232, 'steps': 68120, 'loss/train': 1.2830592393875122} 11/07/2021 06:48:48 - INFO - __main__ - Step 68122: {'lr': 0.00029161718660294613, 'samples': 13079424, 'steps': 68121, 'loss/train': 1.592630386352539} 11/07/2021 06:48:49 - INFO - __main__ - Step 68123: {'lr': 0.00029161195389565257, 'samples': 13079616, 'steps': 68122, 'loss/train': 0.9572741389274597} 11/07/2021 06:48:50 - INFO - __main__ - Step 68124: {'lr': 0.0002916067211696093, 'samples': 13079808, 'steps': 68123, 'loss/train': 1.5240716934204102} 11/07/2021 06:48:50 - INFO - __main__ - Step 68125: {'lr': 0.0002916014884248187, 'samples': 13080000, 'steps': 68124, 'loss/train': 0.8478837609291077} 11/07/2021 06:48:50 - INFO - __main__ - Step 68126: {'lr': 0.0002915962556612832, 'samples': 13080192, 'steps': 68125, 'loss/train': 1.661412239074707} 11/07/2021 06:48:51 - INFO - __main__ - Step 68127: {'lr': 0.0002915910228790049, 'samples': 13080384, 'steps': 68126, 'loss/train': 1.6150540113449097} 11/07/2021 06:48:51 - INFO - __main__ - Step 68128: {'lr': 0.0002915857900779864, 'samples': 13080576, 'steps': 68127, 'loss/train': 1.2570607662200928} 11/07/2021 06:48:52 - INFO - __main__ - Step 68129: {'lr': 0.00029158055725823, 'samples': 13080768, 'steps': 68128, 'loss/train': 1.442226529121399} 11/07/2021 06:48:53 - INFO - __main__ - Step 68130: {'lr': 0.000291575324419738, 'samples': 13080960, 'steps': 68129, 'loss/train': 0.8583510518074036} 11/07/2021 06:48:53 - INFO - __main__ - Step 68131: {'lr': 0.00029157009156251284, 'samples': 13081152, 'steps': 68130, 'loss/train': 0.8335437178611755} 11/07/2021 06:48:53 - INFO - __main__ - Step 68132: {'lr': 0.0002915648586865569, 'samples': 13081344, 'steps': 68131, 'loss/train': 1.234636664390564} 11/07/2021 06:48:54 - INFO - __main__ - Step 68133: {'lr': 0.0002915596257918724, 'samples': 13081536, 'steps': 68132, 'loss/train': 1.3213733434677124} 11/07/2021 06:48:55 - INFO - __main__ - Step 68134: {'lr': 0.00029155439287846177, 'samples': 13081728, 'steps': 68133, 'loss/train': 1.8021981716156006} 11/07/2021 06:48:55 - INFO - __main__ - Step 68135: {'lr': 0.00029154915994632734, 'samples': 13081920, 'steps': 68134, 'loss/train': 1.3035051822662354} 11/07/2021 06:48:56 - INFO - __main__ - Step 68136: {'lr': 0.00029154392699547155, 'samples': 13082112, 'steps': 68135, 'loss/train': 1.099106788635254} 11/07/2021 06:48:56 - INFO - __main__ - Step 68137: {'lr': 0.00029153869402589674, 'samples': 13082304, 'steps': 68136, 'loss/train': 1.2588967084884644} 11/07/2021 06:48:56 - INFO - __main__ - Step 68138: {'lr': 0.00029153346103760514, 'samples': 13082496, 'steps': 68137, 'loss/train': 1.5030261278152466} 11/07/2021 06:48:57 - INFO - __main__ - Step 68139: {'lr': 0.0002915282280305993, 'samples': 13082688, 'steps': 68138, 'loss/train': 1.3214619159698486} 11/07/2021 06:48:58 - INFO - __main__ - Step 68140: {'lr': 0.00029152299500488144, 'samples': 13082880, 'steps': 68139, 'loss/train': 1.1522941589355469} 11/07/2021 06:48:58 - INFO - __main__ - Step 68141: {'lr': 0.00029151776196045397, 'samples': 13083072, 'steps': 68140, 'loss/train': 1.8078041076660156} 11/07/2021 06:48:58 - INFO - __main__ - Step 68142: {'lr': 0.00029151252889731923, 'samples': 13083264, 'steps': 68141, 'loss/train': 1.372719168663025} 11/07/2021 06:48:59 - INFO - __main__ - Step 68143: {'lr': 0.0002915072958154795, 'samples': 13083456, 'steps': 68142, 'loss/train': 1.3903707265853882} 11/07/2021 06:48:59 - INFO - __main__ - Step 68144: {'lr': 0.0002915020627149373, 'samples': 13083648, 'steps': 68143, 'loss/train': 1.0719314813613892} 11/07/2021 06:49:00 - INFO - __main__ - Step 68145: {'lr': 0.00029149682959569496, 'samples': 13083840, 'steps': 68144, 'loss/train': 2.1324493885040283} 11/07/2021 06:49:01 - INFO - __main__ - Step 68146: {'lr': 0.00029149159645775483, 'samples': 13084032, 'steps': 68145, 'loss/train': 1.7165281772613525} 11/07/2021 06:49:01 - INFO - __main__ - Step 68147: {'lr': 0.0002914863633011191, 'samples': 13084224, 'steps': 68146, 'loss/train': 1.412397027015686} 11/07/2021 06:49:01 - INFO - __main__ - Step 68148: {'lr': 0.00029148113012579025, 'samples': 13084416, 'steps': 68147, 'loss/train': 0.8411949872970581} 11/07/2021 06:49:02 - INFO - __main__ - Step 68149: {'lr': 0.0002914758969317707, 'samples': 13084608, 'steps': 68148, 'loss/train': 1.7156939506530762} 11/07/2021 06:49:03 - INFO - __main__ - Step 68150: {'lr': 0.00029147066371906273, 'samples': 13084800, 'steps': 68149, 'loss/train': 1.0617287158966064} 11/07/2021 06:49:03 - INFO - __main__ - Step 68151: {'lr': 0.0002914654304876687, 'samples': 13084992, 'steps': 68150, 'loss/train': 1.5807886123657227} 11/07/2021 06:49:03 - INFO - __main__ - Step 68152: {'lr': 0.0002914601972375911, 'samples': 13085184, 'steps': 68151, 'loss/train': 1.6624988317489624} 11/07/2021 06:49:04 - INFO - __main__ - Step 68153: {'lr': 0.0002914549639688321, 'samples': 13085376, 'steps': 68152, 'loss/train': 1.4467017650604248} 11/07/2021 06:49:04 - INFO - __main__ - Step 68154: {'lr': 0.0002914497306813941, 'samples': 13085568, 'steps': 68153, 'loss/train': 1.188105583190918} 11/07/2021 06:49:05 - INFO - __main__ - Step 68155: {'lr': 0.0002914444973752795, 'samples': 13085760, 'steps': 68154, 'loss/train': 1.7155725955963135} 11/07/2021 06:49:05 - INFO - __main__ - Step 68156: {'lr': 0.0002914392640504907, 'samples': 13085952, 'steps': 68155, 'loss/train': 1.9305859804153442} 11/07/2021 06:49:06 - INFO - __main__ - Step 68157: {'lr': 0.00029143403070702994, 'samples': 13086144, 'steps': 68156, 'loss/train': 1.7001938819885254} 11/07/2021 06:49:06 - INFO - __main__ - Step 68158: {'lr': 0.0002914287973448997, 'samples': 13086336, 'steps': 68157, 'loss/train': 1.1809884309768677} 11/07/2021 06:49:06 - INFO - __main__ - Step 68159: {'lr': 0.00029142356396410227, 'samples': 13086528, 'steps': 68158, 'loss/train': 1.2738173007965088} 11/07/2021 06:49:07 - INFO - __main__ - Step 68160: {'lr': 0.00029141833056463995, 'samples': 13086720, 'steps': 68159, 'loss/train': 1.1251691579818726} 11/07/2021 06:49:08 - INFO - __main__ - Step 68161: {'lr': 0.00029141309714651525, 'samples': 13086912, 'steps': 68160, 'loss/train': 1.677585482597351} 11/07/2021 06:49:08 - INFO - __main__ - Step 68162: {'lr': 0.0002914078637097305, 'samples': 13087104, 'steps': 68161, 'loss/train': 1.5248414278030396} 11/07/2021 06:49:09 - INFO - __main__ - Step 68163: {'lr': 0.00029140263025428785, 'samples': 13087296, 'steps': 68162, 'loss/train': 1.8680981397628784} 11/07/2021 06:49:09 - INFO - __main__ - Step 68164: {'lr': 0.00029139739678018996, 'samples': 13087488, 'steps': 68163, 'loss/train': 1.0967706441879272} 11/07/2021 06:49:10 - INFO - __main__ - Step 68165: {'lr': 0.0002913921632874389, 'samples': 13087680, 'steps': 68164, 'loss/train': 0.8898423314094543} 11/07/2021 06:49:10 - INFO - __main__ - Step 68166: {'lr': 0.00029138692977603734, 'samples': 13087872, 'steps': 68165, 'loss/train': 1.744472861289978} 11/07/2021 06:49:11 - INFO - __main__ - Step 68167: {'lr': 0.0002913816962459873, 'samples': 13088064, 'steps': 68166, 'loss/train': 1.6884469985961914} 11/07/2021 06:49:11 - INFO - __main__ - Step 68168: {'lr': 0.00029137646269729143, 'samples': 13088256, 'steps': 68167, 'loss/train': 1.2714956998825073} 11/07/2021 06:49:11 - INFO - __main__ - Step 68169: {'lr': 0.0002913712291299519, 'samples': 13088448, 'steps': 68168, 'loss/train': 1.2399570941925049} 11/07/2021 06:49:12 - INFO - __main__ - Step 68170: {'lr': 0.0002913659955439711, 'samples': 13088640, 'steps': 68169, 'loss/train': 1.4693984985351562} 11/07/2021 06:49:13 - INFO - __main__ - Step 68171: {'lr': 0.0002913607619393515, 'samples': 13088832, 'steps': 68170, 'loss/train': 1.6042896509170532} 11/07/2021 06:49:13 - INFO - __main__ - Step 68172: {'lr': 0.00029135552831609533, 'samples': 13089024, 'steps': 68171, 'loss/train': 1.3957781791687012} 11/07/2021 06:49:13 - INFO - __main__ - Step 68173: {'lr': 0.0002913502946742051, 'samples': 13089216, 'steps': 68172, 'loss/train': 1.3288214206695557} 11/07/2021 06:49:14 - INFO - __main__ - Step 68174: {'lr': 0.00029134506101368297, 'samples': 13089408, 'steps': 68173, 'loss/train': 1.5000534057617188} 11/07/2021 06:49:14 - INFO - __main__ - Step 68175: {'lr': 0.0002913398273345314, 'samples': 13089600, 'steps': 68174, 'loss/train': 1.1355421543121338} 11/07/2021 06:49:15 - INFO - __main__ - Step 68176: {'lr': 0.00029133459363675274, 'samples': 13089792, 'steps': 68175, 'loss/train': 1.2041559219360352} 11/07/2021 06:49:15 - INFO - __main__ - Step 68177: {'lr': 0.0002913293599203494, 'samples': 13089984, 'steps': 68176, 'loss/train': 1.9036033153533936} 11/07/2021 06:49:16 - INFO - __main__ - Step 68178: {'lr': 0.00029132412618532356, 'samples': 13090176, 'steps': 68177, 'loss/train': 1.4889378547668457} 11/07/2021 06:49:16 - INFO - __main__ - Step 68179: {'lr': 0.0002913188924316778, 'samples': 13090368, 'steps': 68178, 'loss/train': 1.4152034521102905} 11/07/2021 06:49:16 - INFO - __main__ - Step 68180: {'lr': 0.0002913136586594144, 'samples': 13090560, 'steps': 68179, 'loss/train': 1.4040188789367676} 11/07/2021 06:49:17 - INFO - __main__ - Step 68181: {'lr': 0.0002913084248685357, 'samples': 13090752, 'steps': 68180, 'loss/train': 1.7153314352035522} 11/07/2021 06:49:18 - INFO - __main__ - Step 68182: {'lr': 0.000291303191059044, 'samples': 13090944, 'steps': 68181, 'loss/train': 1.7221741676330566} 11/07/2021 06:49:18 - INFO - __main__ - Step 68183: {'lr': 0.00029129795723094174, 'samples': 13091136, 'steps': 68182, 'loss/train': 1.9297444820404053} 11/07/2021 06:49:19 - INFO - __main__ - Step 68184: {'lr': 0.0002912927233842313, 'samples': 13091328, 'steps': 68183, 'loss/train': 1.386856198310852} 11/07/2021 06:49:19 - INFO - __main__ - Step 68185: {'lr': 0.000291287489518915, 'samples': 13091520, 'steps': 68184, 'loss/train': 1.687288522720337} 11/07/2021 06:49:20 - INFO - __main__ - Step 68186: {'lr': 0.0002912822556349951, 'samples': 13091712, 'steps': 68185, 'loss/train': 1.3477195501327515} 11/07/2021 06:49:20 - INFO - __main__ - Step 68187: {'lr': 0.00029127702173247416, 'samples': 13091904, 'steps': 68186, 'loss/train': 1.1901947259902954} 11/07/2021 06:49:21 - INFO - __main__ - Step 68188: {'lr': 0.0002912717878113544, 'samples': 13092096, 'steps': 68187, 'loss/train': 1.319311261177063} 11/07/2021 06:49:21 - INFO - __main__ - Step 68189: {'lr': 0.0002912665538716382, 'samples': 13092288, 'steps': 68188, 'loss/train': 1.1729542016983032} 11/07/2021 06:49:21 - INFO - __main__ - Step 68190: {'lr': 0.00029126131991332794, 'samples': 13092480, 'steps': 68189, 'loss/train': 1.4320168495178223} 11/07/2021 06:49:22 - INFO - __main__ - Step 68191: {'lr': 0.00029125608593642594, 'samples': 13092672, 'steps': 68190, 'loss/train': 1.609071969985962} 11/07/2021 06:49:23 - INFO - __main__ - Step 68192: {'lr': 0.0002912508519409346, 'samples': 13092864, 'steps': 68191, 'loss/train': 1.70625638961792} 11/07/2021 06:49:23 - INFO - __main__ - Step 68193: {'lr': 0.00029124561792685626, 'samples': 13093056, 'steps': 68192, 'loss/train': 0.9892821311950684} 11/07/2021 06:49:23 - INFO - __main__ - Step 68194: {'lr': 0.00029124038389419325, 'samples': 13093248, 'steps': 68193, 'loss/train': 1.380111575126648} 11/07/2021 06:49:24 - INFO - __main__ - Step 68195: {'lr': 0.00029123514984294804, 'samples': 13093440, 'steps': 68194, 'loss/train': 1.037576675415039} 11/07/2021 06:49:25 - INFO - __main__ - Step 68196: {'lr': 0.00029122991577312286, 'samples': 13093632, 'steps': 68195, 'loss/train': 0.8297615051269531} 11/07/2021 06:49:25 - INFO - __main__ - Step 68197: {'lr': 0.0002912246816847201, 'samples': 13093824, 'steps': 68196, 'loss/train': 1.3232370615005493} 11/07/2021 06:49:25 - INFO - __main__ - Step 68198: {'lr': 0.0002912194475777422, 'samples': 13094016, 'steps': 68197, 'loss/train': 1.4384106397628784} 11/07/2021 06:49:26 - INFO - __main__ - Step 68199: {'lr': 0.00029121421345219134, 'samples': 13094208, 'steps': 68198, 'loss/train': 0.9521325826644897} 11/07/2021 06:49:26 - INFO - __main__ - Step 68200: {'lr': 0.0002912089793080701, 'samples': 13094400, 'steps': 68199, 'loss/train': 1.1259582042694092} 11/07/2021 06:49:27 - INFO - __main__ - Step 68201: {'lr': 0.0002912037451453807, 'samples': 13094592, 'steps': 68200, 'loss/train': 1.3176496028900146} 11/07/2021 06:49:28 - INFO - __main__ - Step 68202: {'lr': 0.00029119851096412545, 'samples': 13094784, 'steps': 68201, 'loss/train': 2.089259624481201} 11/07/2021 06:49:28 - INFO - __main__ - Step 68203: {'lr': 0.00029119327676430687, 'samples': 13094976, 'steps': 68202, 'loss/train': 1.5250415802001953} 11/07/2021 06:49:28 - INFO - __main__ - Step 68204: {'lr': 0.0002911880425459272, 'samples': 13095168, 'steps': 68203, 'loss/train': 1.0607032775878906} 11/07/2021 06:49:29 - INFO - __main__ - Step 68205: {'lr': 0.0002911828083089889, 'samples': 13095360, 'steps': 68204, 'loss/train': 1.258955478668213} 11/07/2021 06:49:30 - INFO - __main__ - Step 68206: {'lr': 0.00029117757405349413, 'samples': 13095552, 'steps': 68205, 'loss/train': 1.6178526878356934} 11/07/2021 06:49:30 - INFO - __main__ - Step 68207: {'lr': 0.00029117233977944554, 'samples': 13095744, 'steps': 68206, 'loss/train': 1.6667149066925049} 11/07/2021 06:49:30 - INFO - __main__ - Step 68208: {'lr': 0.0002911671054868452, 'samples': 13095936, 'steps': 68207, 'loss/train': 1.2556581497192383} 11/07/2021 06:49:31 - INFO - __main__ - Step 68209: {'lr': 0.00029116187117569567, 'samples': 13096128, 'steps': 68208, 'loss/train': 1.4218934774398804} 11/07/2021 06:49:31 - INFO - __main__ - Step 68210: {'lr': 0.0002911566368459992, 'samples': 13096320, 'steps': 68209, 'loss/train': 1.395958662033081} 11/07/2021 06:49:31 - INFO - __main__ - Step 68211: {'lr': 0.0002911514024977582, 'samples': 13096512, 'steps': 68210, 'loss/train': 1.2826099395751953} 11/07/2021 06:49:32 - INFO - __main__ - Step 68212: {'lr': 0.000291146168130975, 'samples': 13096704, 'steps': 68211, 'loss/train': 1.3243491649627686} 11/07/2021 06:49:33 - INFO - __main__ - Step 68213: {'lr': 0.000291140933745652, 'samples': 13096896, 'steps': 68212, 'loss/train': 0.7599703073501587} 11/07/2021 06:49:33 - INFO - __main__ - Step 68214: {'lr': 0.0002911356993417915, 'samples': 13097088, 'steps': 68213, 'loss/train': 1.591809630393982} 11/07/2021 06:49:33 - INFO - __main__ - Step 68215: {'lr': 0.00029113046491939585, 'samples': 13097280, 'steps': 68214, 'loss/train': 1.4953416585922241} 11/07/2021 06:49:34 - INFO - __main__ - Step 68216: {'lr': 0.00029112523047846757, 'samples': 13097472, 'steps': 68215, 'loss/train': 1.6375960111618042} 11/07/2021 06:49:35 - INFO - __main__ - Step 68217: {'lr': 0.0002911199960190088, 'samples': 13097664, 'steps': 68216, 'loss/train': 1.2399197816848755} 11/07/2021 06:49:36 - INFO - __main__ - Step 68218: {'lr': 0.000291114761541022, 'samples': 13097856, 'steps': 68217, 'loss/train': 1.0874875783920288} 11/07/2021 06:49:36 - INFO - __main__ - Step 68219: {'lr': 0.00029110952704450955, 'samples': 13098048, 'steps': 68218, 'loss/train': 1.7893357276916504} 11/07/2021 06:49:36 - INFO - __main__ - Step 68220: {'lr': 0.00029110429252947377, 'samples': 13098240, 'steps': 68219, 'loss/train': 1.032353162765503} 11/07/2021 06:49:37 - INFO - __main__ - Step 68221: {'lr': 0.00029109905799591706, 'samples': 13098432, 'steps': 68220, 'loss/train': 2.3242993354797363} 11/07/2021 06:49:38 - INFO - __main__ - Step 68222: {'lr': 0.00029109382344384173, 'samples': 13098624, 'steps': 68221, 'loss/train': 1.6429455280303955} 11/07/2021 06:49:38 - INFO - __main__ - Step 68223: {'lr': 0.00029108858887325013, 'samples': 13098816, 'steps': 68222, 'loss/train': 1.1825287342071533} 11/07/2021 06:49:38 - INFO - __main__ - Step 68224: {'lr': 0.00029108335428414464, 'samples': 13099008, 'steps': 68223, 'loss/train': 1.8321518898010254} 11/07/2021 06:49:39 - INFO - __main__ - Step 68225: {'lr': 0.00029107811967652765, 'samples': 13099200, 'steps': 68224, 'loss/train': 0.7662699818611145} 11/07/2021 06:49:39 - INFO - __main__ - Step 68226: {'lr': 0.0002910728850504015, 'samples': 13099392, 'steps': 68225, 'loss/train': 0.18282726407051086} 11/07/2021 06:49:40 - INFO - __main__ - Step 68227: {'lr': 0.0002910676504057686, 'samples': 13099584, 'steps': 68226, 'loss/train': 1.3760473728179932} 11/07/2021 06:49:41 - INFO - __main__ - Step 68228: {'lr': 0.00029106241574263116, 'samples': 13099776, 'steps': 68227, 'loss/train': 1.415497064590454} 11/07/2021 06:49:41 - INFO - __main__ - Step 68229: {'lr': 0.0002910571810609916, 'samples': 13099968, 'steps': 68228, 'loss/train': 0.11058886349201202} 11/07/2021 06:49:41 - INFO - __main__ - Step 68230: {'lr': 0.0002910519463608524, 'samples': 13100160, 'steps': 68229, 'loss/train': 1.3259438276290894} 11/07/2021 06:49:42 - INFO - __main__ - Step 68231: {'lr': 0.00029104671164221574, 'samples': 13100352, 'steps': 68230, 'loss/train': 1.4960342645645142} 11/07/2021 06:49:43 - INFO - __main__ - Step 68232: {'lr': 0.0002910414769050841, 'samples': 13100544, 'steps': 68231, 'loss/train': 0.08011938631534576} 11/07/2021 06:49:43 - INFO - __main__ - Step 68233: {'lr': 0.0002910362421494598, 'samples': 13100736, 'steps': 68232, 'loss/train': 1.422576665878296} 11/07/2021 06:49:43 - INFO - __main__ - Step 68234: {'lr': 0.00029103100737534526, 'samples': 13100928, 'steps': 68233, 'loss/train': 1.2636744976043701} 11/07/2021 06:49:44 - INFO - __main__ - Step 68235: {'lr': 0.0002910257725827428, 'samples': 13101120, 'steps': 68234, 'loss/train': 1.52620267868042} 11/07/2021 06:49:44 - INFO - __main__ - Step 68236: {'lr': 0.00029102053777165464, 'samples': 13101312, 'steps': 68235, 'loss/train': 1.9026994705200195} 11/07/2021 06:49:45 - INFO - __main__ - Step 68237: {'lr': 0.00029101530294208336, 'samples': 13101504, 'steps': 68236, 'loss/train': 1.7687504291534424} 11/07/2021 06:49:46 - INFO - __main__ - Step 68238: {'lr': 0.00029101006809403114, 'samples': 13101696, 'steps': 68237, 'loss/train': 1.9396713972091675} 11/07/2021 06:49:46 - INFO - __main__ - Step 68239: {'lr': 0.00029100483322750043, 'samples': 13101888, 'steps': 68238, 'loss/train': 1.4138219356536865} 11/07/2021 06:49:46 - INFO - __main__ - Step 68240: {'lr': 0.00029099959834249356, 'samples': 13102080, 'steps': 68239, 'loss/train': 1.4143165349960327} 11/07/2021 06:49:47 - INFO - __main__ - Step 68241: {'lr': 0.00029099436343901303, 'samples': 13102272, 'steps': 68240, 'loss/train': 1.006747841835022} 11/07/2021 06:49:47 - INFO - __main__ - Step 68242: {'lr': 0.00029098912851706094, 'samples': 13102464, 'steps': 68241, 'loss/train': 1.5267326831817627} 11/07/2021 06:49:48 - INFO - __main__ - Step 68243: {'lr': 0.00029098389357663985, 'samples': 13102656, 'steps': 68242, 'loss/train': 1.0927807092666626} 11/07/2021 06:49:49 - INFO - __main__ - Step 68244: {'lr': 0.000290978658617752, 'samples': 13102848, 'steps': 68243, 'loss/train': 1.103101134300232} 11/07/2021 06:49:49 - INFO - __main__ - Step 68245: {'lr': 0.0002909734236403998, 'samples': 13103040, 'steps': 68244, 'loss/train': 1.4800076484680176} 11/07/2021 06:49:49 - INFO - __main__ - Step 68246: {'lr': 0.00029096818864458564, 'samples': 13103232, 'steps': 68245, 'loss/train': 1.6889880895614624} 11/07/2021 06:49:50 - INFO - __main__ - Step 68247: {'lr': 0.0002909629536303119, 'samples': 13103424, 'steps': 68246, 'loss/train': 1.0974656343460083} 11/07/2021 06:49:51 - INFO - __main__ - Step 68248: {'lr': 0.0002909577185975808, 'samples': 13103616, 'steps': 68247, 'loss/train': 1.3227789402008057} 11/07/2021 06:49:51 - INFO - __main__ - Step 68249: {'lr': 0.0002909524835463948, 'samples': 13103808, 'steps': 68248, 'loss/train': 1.194120168685913} 11/07/2021 06:49:51 - INFO - __main__ - Step 68250: {'lr': 0.00029094724847675627, 'samples': 13104000, 'steps': 68249, 'loss/train': 0.8920380473136902} 11/07/2021 06:49:52 - INFO - __main__ - Step 68251: {'lr': 0.0002909420133886675, 'samples': 13104192, 'steps': 68250, 'loss/train': 1.3221021890640259} 11/07/2021 06:49:52 - INFO - __main__ - Step 68252: {'lr': 0.0002909367782821309, 'samples': 13104384, 'steps': 68251, 'loss/train': 1.8148714303970337} 11/07/2021 06:49:53 - INFO - __main__ - Step 68253: {'lr': 0.00029093154315714884, 'samples': 13104576, 'steps': 68252, 'loss/train': 1.577372431755066} 11/07/2021 06:49:53 - INFO - __main__ - Step 68254: {'lr': 0.0002909263080137237, 'samples': 13104768, 'steps': 68253, 'loss/train': 1.7524852752685547} 11/07/2021 06:49:54 - INFO - __main__ - Step 68255: {'lr': 0.0002909210728518577, 'samples': 13104960, 'steps': 68254, 'loss/train': 1.5617554187774658} 11/07/2021 06:49:54 - INFO - __main__ - Step 68256: {'lr': 0.0002909158376715533, 'samples': 13105152, 'steps': 68255, 'loss/train': 1.4251855611801147} 11/07/2021 06:49:54 - INFO - __main__ - Step 68257: {'lr': 0.0002909106024728129, 'samples': 13105344, 'steps': 68256, 'loss/train': 0.6699118614196777} 11/07/2021 06:49:55 - INFO - __main__ - Step 68258: {'lr': 0.0002909053672556388, 'samples': 13105536, 'steps': 68257, 'loss/train': 0.5041144490242004} 11/07/2021 06:49:56 - INFO - __main__ - Step 68259: {'lr': 0.0002909001320200334, 'samples': 13105728, 'steps': 68258, 'loss/train': 1.8746187686920166} 11/07/2021 06:49:56 - INFO - __main__ - Step 68260: {'lr': 0.000290894896765999, 'samples': 13105920, 'steps': 68259, 'loss/train': 1.7287912368774414} 11/07/2021 06:49:56 - INFO - __main__ - Step 68261: {'lr': 0.00029088966149353807, 'samples': 13106112, 'steps': 68260, 'loss/train': 1.4757781028747559} 11/07/2021 06:49:57 - INFO - __main__ - Step 68262: {'lr': 0.0002908844262026528, 'samples': 13106304, 'steps': 68261, 'loss/train': 1.685481309890747} 11/07/2021 06:49:58 - INFO - __main__ - Step 68263: {'lr': 0.00029087919089334564, 'samples': 13106496, 'steps': 68262, 'loss/train': 0.7524610757827759} 11/07/2021 06:49:58 - INFO - __main__ - Step 68264: {'lr': 0.00029087395556561896, 'samples': 13106688, 'steps': 68263, 'loss/train': 1.3191889524459839} 11/07/2021 06:49:58 - INFO - __main__ - Step 68265: {'lr': 0.00029086872021947516, 'samples': 13106880, 'steps': 68264, 'loss/train': 1.4682047367095947} 11/07/2021 06:49:59 - INFO - __main__ - Step 68266: {'lr': 0.0002908634848549165, 'samples': 13107072, 'steps': 68265, 'loss/train': 0.9452065229415894} 11/07/2021 06:49:59 - INFO - __main__ - Step 68267: {'lr': 0.0002908582494719454, 'samples': 13107264, 'steps': 68266, 'loss/train': 0.5451973080635071} 11/07/2021 06:50:00 - INFO - __main__ - Step 68268: {'lr': 0.0002908530140705642, 'samples': 13107456, 'steps': 68267, 'loss/train': 0.8184689879417419} 11/07/2021 06:50:01 - INFO - __main__ - Step 68269: {'lr': 0.0002908477786507752, 'samples': 13107648, 'steps': 68268, 'loss/train': 1.5661739110946655} 11/07/2021 06:50:01 - INFO - __main__ - Step 68270: {'lr': 0.00029084254321258085, 'samples': 13107840, 'steps': 68269, 'loss/train': 1.0677062273025513} 11/07/2021 06:50:01 - INFO - __main__ - Step 68271: {'lr': 0.0002908373077559836, 'samples': 13108032, 'steps': 68270, 'loss/train': 1.6203348636627197} 11/07/2021 06:50:02 - INFO - __main__ - Step 68272: {'lr': 0.00029083207228098554, 'samples': 13108224, 'steps': 68271, 'loss/train': 1.976218342781067} 11/07/2021 06:50:03 - INFO - __main__ - Step 68273: {'lr': 0.0002908268367875892, 'samples': 13108416, 'steps': 68272, 'loss/train': 0.9720513224601746} 11/07/2021 06:50:03 - INFO - __main__ - Step 68274: {'lr': 0.000290821601275797, 'samples': 13108608, 'steps': 68273, 'loss/train': 0.3213220238685608} 11/07/2021 06:50:03 - INFO - __main__ - Step 68275: {'lr': 0.00029081636574561115, 'samples': 13108800, 'steps': 68274, 'loss/train': 1.5000417232513428} 11/07/2021 06:50:04 - INFO - __main__ - Step 68276: {'lr': 0.00029081113019703407, 'samples': 13108992, 'steps': 68275, 'loss/train': 1.1907516717910767} 11/07/2021 06:50:04 - INFO - __main__ - Step 68277: {'lr': 0.0002908058946300681, 'samples': 13109184, 'steps': 68276, 'loss/train': 1.4488155841827393} 11/07/2021 06:50:04 - INFO - __main__ - Step 68278: {'lr': 0.0002908006590447157, 'samples': 13109376, 'steps': 68277, 'loss/train': 0.12669456005096436} 11/07/2021 06:50:06 - INFO - __main__ - Step 68279: {'lr': 0.00029079542344097916, 'samples': 13109568, 'steps': 68278, 'loss/train': 1.8278450965881348} 11/07/2021 06:50:06 - INFO - __main__ - Step 68280: {'lr': 0.0002907901878188608, 'samples': 13109760, 'steps': 68279, 'loss/train': 1.575079321861267} 11/07/2021 06:50:06 - INFO - __main__ - Step 68281: {'lr': 0.000290784952178363, 'samples': 13109952, 'steps': 68280, 'loss/train': 1.4167966842651367} 11/07/2021 06:50:07 - INFO - __main__ - Step 68282: {'lr': 0.0002907797165194881, 'samples': 13110144, 'steps': 68281, 'loss/train': 1.2735719680786133} 11/07/2021 06:50:07 - INFO - __main__ - Step 68283: {'lr': 0.0002907744808422386, 'samples': 13110336, 'steps': 68282, 'loss/train': 1.2664971351623535} 11/07/2021 06:50:08 - INFO - __main__ - Step 68284: {'lr': 0.0002907692451466166, 'samples': 13110528, 'steps': 68283, 'loss/train': 1.2821389436721802} 11/07/2021 06:50:08 - INFO - __main__ - Step 68285: {'lr': 0.00029076400943262465, 'samples': 13110720, 'steps': 68284, 'loss/train': 1.20022714138031} 11/07/2021 06:50:09 - INFO - __main__ - Step 68286: {'lr': 0.00029075877370026516, 'samples': 13110912, 'steps': 68285, 'loss/train': 1.119512677192688} 11/07/2021 06:50:09 - INFO - __main__ - Step 68287: {'lr': 0.00029075353794954037, 'samples': 13111104, 'steps': 68286, 'loss/train': 1.4529776573181152} 11/07/2021 06:50:10 - INFO - __main__ - Step 68288: {'lr': 0.00029074830218045255, 'samples': 13111296, 'steps': 68287, 'loss/train': 1.1496108770370483} 11/07/2021 06:50:11 - INFO - __main__ - Step 68289: {'lr': 0.00029074306639300426, 'samples': 13111488, 'steps': 68288, 'loss/train': 1.039878487586975} 11/07/2021 06:50:11 - INFO - __main__ - Step 68290: {'lr': 0.00029073783058719777, 'samples': 13111680, 'steps': 68289, 'loss/train': 1.6210541725158691} 11/07/2021 06:50:11 - INFO - __main__ - Step 68291: {'lr': 0.00029073259476303546, 'samples': 13111872, 'steps': 68290, 'loss/train': 3.119511604309082} 11/07/2021 06:50:12 - INFO - __main__ - Step 68292: {'lr': 0.00029072735892051967, 'samples': 13112064, 'steps': 68291, 'loss/train': 0.6537272334098816} 11/07/2021 06:50:12 - INFO - __main__ - Step 68293: {'lr': 0.0002907221230596527, 'samples': 13112256, 'steps': 68292, 'loss/train': 1.445918321609497} 11/07/2021 06:50:13 - INFO - __main__ - Step 68294: {'lr': 0.00029071688718043697, 'samples': 13112448, 'steps': 68293, 'loss/train': 0.9763742089271545} 11/07/2021 06:50:13 - INFO - __main__ - Step 68295: {'lr': 0.00029071165128287494, 'samples': 13112640, 'steps': 68294, 'loss/train': 0.30311155319213867} 11/07/2021 06:50:14 - INFO - __main__ - Step 68296: {'lr': 0.00029070641536696874, 'samples': 13112832, 'steps': 68295, 'loss/train': 1.030411720275879} 11/07/2021 06:50:14 - INFO - __main__ - Step 68297: {'lr': 0.00029070117943272094, 'samples': 13113024, 'steps': 68296, 'loss/train': 1.655228853225708} 11/07/2021 06:50:14 - INFO - __main__ - Step 68298: {'lr': 0.00029069594348013386, 'samples': 13113216, 'steps': 68297, 'loss/train': 0.5943419933319092} 11/07/2021 06:50:16 - INFO - __main__ - Step 68299: {'lr': 0.00029069070750920966, 'samples': 13113408, 'steps': 68298, 'loss/train': 2.203680992126465} 11/07/2021 06:50:16 - INFO - __main__ - Step 68300: {'lr': 0.000290685471519951, 'samples': 13113600, 'steps': 68299, 'loss/train': 1.6403111219406128} 11/07/2021 06:50:17 - INFO - __main__ - Step 68301: {'lr': 0.00029068023551236, 'samples': 13113792, 'steps': 68300, 'loss/train': 1.4567750692367554} 11/07/2021 06:50:17 - INFO - __main__ - Step 68302: {'lr': 0.00029067499948643924, 'samples': 13113984, 'steps': 68301, 'loss/train': 1.8289284706115723} 11/07/2021 06:50:17 - INFO - __main__ - Step 68303: {'lr': 0.00029066976344219083, 'samples': 13114176, 'steps': 68302, 'loss/train': 2.0489280223846436} 11/07/2021 06:50:18 - INFO - __main__ - Step 68304: {'lr': 0.0002906645273796173, 'samples': 13114368, 'steps': 68303, 'loss/train': 2.4383907318115234} 11/07/2021 06:50:19 - INFO - __main__ - Step 68305: {'lr': 0.00029065929129872095, 'samples': 13114560, 'steps': 68304, 'loss/train': 1.4501837491989136} 11/07/2021 06:50:19 - INFO - __main__ - Step 68306: {'lr': 0.0002906540551995041, 'samples': 13114752, 'steps': 68305, 'loss/train': 1.4676204919815063} 11/07/2021 06:50:20 - INFO - __main__ - Step 68307: {'lr': 0.0002906488190819692, 'samples': 13114944, 'steps': 68306, 'loss/train': 1.5233500003814697} 11/07/2021 06:50:20 - INFO - __main__ - Step 68308: {'lr': 0.00029064358294611867, 'samples': 13115136, 'steps': 68307, 'loss/train': 1.571499228477478} 11/07/2021 06:50:20 - INFO - __main__ - Step 68309: {'lr': 0.00029063834679195465, 'samples': 13115328, 'steps': 68308, 'loss/train': 2.193169116973877} 11/07/2021 06:50:22 - INFO - __main__ - Step 68310: {'lr': 0.00029063311061947966, 'samples': 13115520, 'steps': 68309, 'loss/train': 1.5610278844833374} 11/07/2021 06:50:22 - INFO - __main__ - Step 68311: {'lr': 0.00029062787442869596, 'samples': 13115712, 'steps': 68310, 'loss/train': 1.41033935546875} 11/07/2021 06:50:22 - INFO - __main__ - Step 68312: {'lr': 0.00029062263821960605, 'samples': 13115904, 'steps': 68311, 'loss/train': 1.6679213047027588} 11/07/2021 06:50:23 - INFO - __main__ - Step 68313: {'lr': 0.00029061740199221215, 'samples': 13116096, 'steps': 68312, 'loss/train': 1.398289680480957} 11/07/2021 06:50:23 - INFO - __main__ - Step 68314: {'lr': 0.0002906121657465167, 'samples': 13116288, 'steps': 68313, 'loss/train': 1.5835660696029663} 11/07/2021 06:50:24 - INFO - __main__ - Step 68315: {'lr': 0.00029060692948252204, 'samples': 13116480, 'steps': 68314, 'loss/train': 1.278542160987854} 11/07/2021 06:50:24 - INFO - __main__ - Step 68316: {'lr': 0.0002906016932002305, 'samples': 13116672, 'steps': 68315, 'loss/train': 1.471867322921753} 11/07/2021 06:50:25 - INFO - __main__ - Step 68317: {'lr': 0.0002905964568996445, 'samples': 13116864, 'steps': 68316, 'loss/train': 2.3276844024658203} 11/07/2021 06:50:25 - INFO - __main__ - Step 68318: {'lr': 0.0002905912205807663, 'samples': 13117056, 'steps': 68317, 'loss/train': 1.6781389713287354} 11/07/2021 06:50:25 - INFO - __main__ - Step 68319: {'lr': 0.0002905859842435984, 'samples': 13117248, 'steps': 68318, 'loss/train': 1.7858524322509766} 11/07/2021 06:50:26 - INFO - __main__ - Step 68320: {'lr': 0.00029058074788814304, 'samples': 13117440, 'steps': 68319, 'loss/train': 1.37648606300354} 11/07/2021 06:50:27 - INFO - __main__ - Step 68321: {'lr': 0.00029057551151440267, 'samples': 13117632, 'steps': 68320, 'loss/train': 1.2051650285720825} 11/07/2021 06:50:27 - INFO - __main__ - Step 68322: {'lr': 0.00029057027512237955, 'samples': 13117824, 'steps': 68321, 'loss/train': 1.565006136894226} 11/07/2021 06:50:28 - INFO - __main__ - Step 68323: {'lr': 0.0002905650387120761, 'samples': 13118016, 'steps': 68322, 'loss/train': 1.0542129278182983} 11/07/2021 06:50:28 - INFO - __main__ - Step 68324: {'lr': 0.0002905598022834946, 'samples': 13118208, 'steps': 68323, 'loss/train': 1.6936713457107544} 11/07/2021 06:50:28 - INFO - __main__ - Step 68325: {'lr': 0.0002905545658366375, 'samples': 13118400, 'steps': 68324, 'loss/train': 1.2891348600387573} 11/07/2021 06:50:29 - INFO - __main__ - Step 68326: {'lr': 0.00029054932937150725, 'samples': 13118592, 'steps': 68325, 'loss/train': 1.4937114715576172} 11/07/2021 06:50:30 - INFO - __main__ - Step 68327: {'lr': 0.000290544092888106, 'samples': 13118784, 'steps': 68326, 'loss/train': 1.4305936098098755} 11/07/2021 06:50:30 - INFO - __main__ - Step 68328: {'lr': 0.0002905388563864363, 'samples': 13118976, 'steps': 68327, 'loss/train': 1.2263129949569702} 11/07/2021 06:50:30 - INFO - __main__ - Step 68329: {'lr': 0.00029053361986650035, 'samples': 13119168, 'steps': 68328, 'loss/train': 1.4837483167648315} 11/07/2021 06:50:31 - INFO - __main__ - Step 68330: {'lr': 0.00029052838332830055, 'samples': 13119360, 'steps': 68329, 'loss/train': 1.1079318523406982} 11/07/2021 06:50:31 - INFO - __main__ - Step 68331: {'lr': 0.0002905231467718393, 'samples': 13119552, 'steps': 68330, 'loss/train': 1.6463984251022339} 11/07/2021 06:50:32 - INFO - __main__ - Step 68332: {'lr': 0.00029051791019711897, 'samples': 13119744, 'steps': 68331, 'loss/train': 1.4312185049057007} 11/07/2021 06:50:32 - INFO - __main__ - Step 68333: {'lr': 0.00029051267360414185, 'samples': 13119936, 'steps': 68332, 'loss/train': 0.6980203986167908} 11/07/2021 06:50:33 - INFO - __main__ - Step 68334: {'lr': 0.00029050743699291035, 'samples': 13120128, 'steps': 68333, 'loss/train': 1.5538336038589478} 11/07/2021 06:50:33 - INFO - __main__ - Step 68335: {'lr': 0.00029050220036342696, 'samples': 13120320, 'steps': 68334, 'loss/train': 1.7769526243209839} 11/07/2021 06:50:33 - INFO - __main__ - Step 68336: {'lr': 0.0002904969637156938, 'samples': 13120512, 'steps': 68335, 'loss/train': 1.7576878070831299} 11/07/2021 06:50:34 - INFO - __main__ - Step 68337: {'lr': 0.00029049172704971333, 'samples': 13120704, 'steps': 68336, 'loss/train': 0.9747135043144226} 11/07/2021 06:50:35 - INFO - __main__ - Step 68338: {'lr': 0.0002904864903654879, 'samples': 13120896, 'steps': 68337, 'loss/train': 1.4783376455307007} 11/07/2021 06:50:35 - INFO - __main__ - Step 68339: {'lr': 0.0002904812536630199, 'samples': 13121088, 'steps': 68338, 'loss/train': 1.670639157295227} 11/07/2021 06:50:35 - INFO - __main__ - Step 68340: {'lr': 0.0002904760169423116, 'samples': 13121280, 'steps': 68339, 'loss/train': 1.450356125831604} 11/07/2021 06:50:36 - INFO - __main__ - Step 68341: {'lr': 0.0002904707802033656, 'samples': 13121472, 'steps': 68340, 'loss/train': 1.7658379077911377} 11/07/2021 06:50:37 - INFO - __main__ - Step 68342: {'lr': 0.000290465543446184, 'samples': 13121664, 'steps': 68341, 'loss/train': 1.0881530046463013} 11/07/2021 06:50:37 - INFO - __main__ - Step 68343: {'lr': 0.00029046030667076916, 'samples': 13121856, 'steps': 68342, 'loss/train': 1.4643042087554932} 11/07/2021 06:50:38 - INFO - __main__ - Step 68344: {'lr': 0.0002904550698771237, 'samples': 13122048, 'steps': 68343, 'loss/train': 1.763522744178772} 11/07/2021 06:50:38 - INFO - __main__ - Step 68345: {'lr': 0.0002904498330652496, 'samples': 13122240, 'steps': 68344, 'loss/train': 1.3697962760925293} 11/07/2021 06:50:38 - INFO - __main__ - Step 68346: {'lr': 0.0002904445962351496, 'samples': 13122432, 'steps': 68345, 'loss/train': 2.1640138626098633} 11/07/2021 06:50:39 - INFO - __main__ - Step 68347: {'lr': 0.00029043935938682583, 'samples': 13122624, 'steps': 68346, 'loss/train': 1.751818060874939} 11/07/2021 06:50:40 - INFO - __main__ - Step 68348: {'lr': 0.00029043412252028076, 'samples': 13122816, 'steps': 68347, 'loss/train': 1.6343302726745605} 11/07/2021 06:50:40 - INFO - __main__ - Step 68349: {'lr': 0.00029042888563551666, 'samples': 13123008, 'steps': 68348, 'loss/train': 1.7377183437347412} 11/07/2021 06:50:40 - INFO - __main__ - Step 68350: {'lr': 0.0002904236487325359, 'samples': 13123200, 'steps': 68349, 'loss/train': 1.4015158414840698} 11/07/2021 06:50:41 - INFO - __main__ - Step 68351: {'lr': 0.00029041841181134086, 'samples': 13123392, 'steps': 68350, 'loss/train': 1.3306204080581665} 11/07/2021 06:50:41 - INFO - __main__ - Step 68352: {'lr': 0.000290413174871934, 'samples': 13123584, 'steps': 68351, 'loss/train': 1.394282579421997} 11/07/2021 06:50:42 - INFO - __main__ - Step 68353: {'lr': 0.00029040793791431746, 'samples': 13123776, 'steps': 68352, 'loss/train': 1.2977263927459717} 11/07/2021 06:50:42 - INFO - __main__ - Step 68354: {'lr': 0.0002904027009384938, 'samples': 13123968, 'steps': 68353, 'loss/train': 1.3106837272644043} 11/07/2021 06:50:43 - INFO - __main__ - Step 68355: {'lr': 0.0002903974639444654, 'samples': 13124160, 'steps': 68354, 'loss/train': 1.2380667924880981} 11/07/2021 06:50:43 - INFO - __main__ - Step 68356: {'lr': 0.0002903922269322344, 'samples': 13124352, 'steps': 68355, 'loss/train': 1.912227749824524} 11/07/2021 06:50:44 - INFO - __main__ - Step 68357: {'lr': 0.0002903869899018033, 'samples': 13124544, 'steps': 68356, 'loss/train': 1.3231970071792603} 11/07/2021 06:50:45 - INFO - __main__ - Step 68358: {'lr': 0.0002903817528531744, 'samples': 13124736, 'steps': 68357, 'loss/train': 1.710659384727478} 11/07/2021 06:50:45 - INFO - __main__ - Step 68359: {'lr': 0.00029037651578635017, 'samples': 13124928, 'steps': 68358, 'loss/train': 0.9256336092948914} 11/07/2021 06:50:45 - INFO - __main__ - Step 68360: {'lr': 0.0002903712787013329, 'samples': 13125120, 'steps': 68359, 'loss/train': 1.4941877126693726} 11/07/2021 06:50:46 - INFO - __main__ - Step 68361: {'lr': 0.000290366041598125, 'samples': 13125312, 'steps': 68360, 'loss/train': 1.0233972072601318} 11/07/2021 06:50:46 - INFO - __main__ - Step 68362: {'lr': 0.00029036080447672875, 'samples': 13125504, 'steps': 68361, 'loss/train': 1.6296031475067139} 11/07/2021 06:50:47 - INFO - __main__ - Step 68363: {'lr': 0.0002903555673371465, 'samples': 13125696, 'steps': 68362, 'loss/train': 1.4717885255813599} 11/07/2021 06:50:47 - INFO - __main__ - Step 68364: {'lr': 0.00029035033017938067, 'samples': 13125888, 'steps': 68363, 'loss/train': 1.1890318393707275} 11/07/2021 06:50:48 - INFO - __main__ - Step 68365: {'lr': 0.0002903450930034336, 'samples': 13126080, 'steps': 68364, 'loss/train': 0.9339287281036377} 11/07/2021 06:50:48 - INFO - __main__ - Step 68366: {'lr': 0.00029033985580930767, 'samples': 13126272, 'steps': 68365, 'loss/train': 1.743006706237793} 11/07/2021 06:50:48 - INFO - __main__ - Step 68367: {'lr': 0.0002903346185970052, 'samples': 13126464, 'steps': 68366, 'loss/train': 1.0003883838653564} 11/07/2021 06:50:49 - INFO - __main__ - Step 68368: {'lr': 0.0002903293813665287, 'samples': 13126656, 'steps': 68367, 'loss/train': 0.9687442779541016} 11/07/2021 06:50:50 - INFO - __main__ - Step 68369: {'lr': 0.0002903241441178803, 'samples': 13126848, 'steps': 68368, 'loss/train': 0.9306632876396179} 11/07/2021 06:50:50 - INFO - __main__ - Step 68370: {'lr': 0.0002903189068510624, 'samples': 13127040, 'steps': 68369, 'loss/train': 0.8953437805175781} 11/07/2021 06:50:50 - INFO - __main__ - Step 68371: {'lr': 0.00029031366956607755, 'samples': 13127232, 'steps': 68370, 'loss/train': 1.1945736408233643} 11/07/2021 06:50:51 - INFO - __main__ - Step 68372: {'lr': 0.00029030843226292784, 'samples': 13127424, 'steps': 68371, 'loss/train': 1.506036639213562} 11/07/2021 06:50:51 - INFO - __main__ - Step 68373: {'lr': 0.0002903031949416159, 'samples': 13127616, 'steps': 68372, 'loss/train': 0.8467341065406799} 11/07/2021 06:50:52 - INFO - __main__ - Step 68374: {'lr': 0.0002902979576021439, 'samples': 13127808, 'steps': 68373, 'loss/train': 0.9681167006492615} 11/07/2021 06:50:53 - INFO - __main__ - Step 68375: {'lr': 0.0002902927202445143, 'samples': 13128000, 'steps': 68374, 'loss/train': 1.6655389070510864} 11/07/2021 06:50:53 - INFO - __main__ - Step 68376: {'lr': 0.0002902874828687294, 'samples': 13128192, 'steps': 68375, 'loss/train': 1.5621092319488525} 11/07/2021 06:50:53 - INFO - __main__ - Step 68377: {'lr': 0.0002902822454747916, 'samples': 13128384, 'steps': 68376, 'loss/train': 1.5150530338287354} 11/07/2021 06:50:54 - INFO - __main__ - Step 68378: {'lr': 0.0002902770080627032, 'samples': 13128576, 'steps': 68377, 'loss/train': 1.5701018571853638} 11/07/2021 06:50:55 - INFO - __main__ - Step 68379: {'lr': 0.0002902717706324666, 'samples': 13128768, 'steps': 68378, 'loss/train': 0.5546554327011108} 11/07/2021 06:50:55 - INFO - __main__ - Step 68380: {'lr': 0.0002902665331840842, 'samples': 13128960, 'steps': 68379, 'loss/train': 1.5163359642028809} 11/07/2021 06:50:55 - INFO - __main__ - Step 68381: {'lr': 0.0002902612957175583, 'samples': 13129152, 'steps': 68380, 'loss/train': 1.2377536296844482} 11/07/2021 06:50:56 - INFO - __main__ - Step 68382: {'lr': 0.0002902560582328913, 'samples': 13129344, 'steps': 68381, 'loss/train': 1.6811840534210205} 11/07/2021 06:50:56 - INFO - __main__ - Step 68383: {'lr': 0.0002902508207300856, 'samples': 13129536, 'steps': 68382, 'loss/train': 1.182804822921753} 11/07/2021 06:50:57 - INFO - __main__ - Step 68384: {'lr': 0.00029024558320914337, 'samples': 13129728, 'steps': 68383, 'loss/train': 0.5662233829498291} 11/07/2021 06:50:58 - INFO - __main__ - Step 68385: {'lr': 0.0002902403456700672, 'samples': 13129920, 'steps': 68384, 'loss/train': 1.444581389427185} 11/07/2021 06:50:58 - INFO - __main__ - Step 68386: {'lr': 0.00029023510811285923, 'samples': 13130112, 'steps': 68385, 'loss/train': 1.122300148010254} 11/07/2021 06:50:58 - INFO - __main__ - Step 68387: {'lr': 0.00029022987053752204, 'samples': 13130304, 'steps': 68386, 'loss/train': 1.371978998184204} 11/07/2021 06:50:59 - INFO - __main__ - Step 68388: {'lr': 0.00029022463294405796, 'samples': 13130496, 'steps': 68387, 'loss/train': 1.3577163219451904} 11/07/2021 06:51:00 - INFO - __main__ - Step 68389: {'lr': 0.00029021939533246916, 'samples': 13130688, 'steps': 68388, 'loss/train': 0.6848203539848328} 11/07/2021 06:51:00 - INFO - __main__ - Step 68390: {'lr': 0.00029021415770275814, 'samples': 13130880, 'steps': 68389, 'loss/train': 1.7163819074630737} 11/07/2021 06:51:00 - INFO - __main__ - Step 68391: {'lr': 0.0002902089200549273, 'samples': 13131072, 'steps': 68390, 'loss/train': 1.4812957048416138} 11/07/2021 06:51:01 - INFO - __main__ - Step 68392: {'lr': 0.0002902036823889789, 'samples': 13131264, 'steps': 68391, 'loss/train': 1.113471508026123} 11/07/2021 06:51:01 - INFO - __main__ - Step 68393: {'lr': 0.0002901984447049153, 'samples': 13131456, 'steps': 68392, 'loss/train': 1.0378623008728027} 11/07/2021 06:51:02 - INFO - __main__ - Step 68394: {'lr': 0.00029019320700273896, 'samples': 13131648, 'steps': 68393, 'loss/train': 1.7288631200790405} 11/07/2021 06:51:03 - INFO - __main__ - Step 68395: {'lr': 0.00029018796928245217, 'samples': 13131840, 'steps': 68394, 'loss/train': 1.0903762578964233} 11/07/2021 06:51:03 - INFO - __main__ - Step 68396: {'lr': 0.00029018273154405726, 'samples': 13132032, 'steps': 68395, 'loss/train': 1.3769478797912598} 11/07/2021 06:51:03 - INFO - __main__ - Step 68397: {'lr': 0.0002901774937875567, 'samples': 13132224, 'steps': 68396, 'loss/train': 1.2131785154342651} 11/07/2021 06:51:04 - INFO - __main__ - Step 68398: {'lr': 0.0002901722560129527, 'samples': 13132416, 'steps': 68397, 'loss/train': 1.8367029428482056} 11/07/2021 06:51:04 - INFO - __main__ - Step 68399: {'lr': 0.00029016701822024777, 'samples': 13132608, 'steps': 68398, 'loss/train': 1.1347662210464478} 11/07/2021 06:51:05 - INFO - __main__ - Step 68400: {'lr': 0.0002901617804094442, 'samples': 13132800, 'steps': 68399, 'loss/train': 1.4996942281723022} 11/07/2021 06:51:06 - INFO - __main__ - Step 68401: {'lr': 0.0002901565425805443, 'samples': 13132992, 'steps': 68400, 'loss/train': 0.9973898530006409} 11/07/2021 06:51:06 - INFO - __main__ - Step 68402: {'lr': 0.00029015130473355056, 'samples': 13133184, 'steps': 68401, 'loss/train': 1.0577818155288696} 11/07/2021 06:51:06 - INFO - __main__ - Step 68403: {'lr': 0.0002901460668684652, 'samples': 13133376, 'steps': 68402, 'loss/train': 1.4116963148117065} 11/07/2021 06:51:07 - INFO - __main__ - Step 68404: {'lr': 0.00029014082898529066, 'samples': 13133568, 'steps': 68403, 'loss/train': 2.027585506439209} 11/07/2021 06:51:08 - INFO - __main__ - Step 68405: {'lr': 0.0002901355910840293, 'samples': 13133760, 'steps': 68404, 'loss/train': 1.3669211864471436} 11/07/2021 06:51:08 - INFO - __main__ - Step 68406: {'lr': 0.0002901303531646834, 'samples': 13133952, 'steps': 68405, 'loss/train': 1.7643327713012695} 11/07/2021 06:51:08 - INFO - __main__ - Step 68407: {'lr': 0.00029012511522725544, 'samples': 13134144, 'steps': 68406, 'loss/train': 1.737592339515686} 11/07/2021 06:51:09 - INFO - __main__ - Step 68408: {'lr': 0.00029011987727174774, 'samples': 13134336, 'steps': 68407, 'loss/train': 1.3749315738677979} 11/07/2021 06:51:09 - INFO - __main__ - Step 68409: {'lr': 0.0002901146392981626, 'samples': 13134528, 'steps': 68408, 'loss/train': 1.4257880449295044} 11/07/2021 06:51:10 - INFO - __main__ - Step 68410: {'lr': 0.00029010940130650244, 'samples': 13134720, 'steps': 68409, 'loss/train': 1.7244784832000732} 11/07/2021 06:51:10 - INFO - __main__ - Step 68411: {'lr': 0.00029010416329676957, 'samples': 13134912, 'steps': 68410, 'loss/train': 1.8633055686950684} 11/07/2021 06:51:11 - INFO - __main__ - Step 68412: {'lr': 0.0002900989252689664, 'samples': 13135104, 'steps': 68411, 'loss/train': 1.1055731773376465} 11/07/2021 06:51:11 - INFO - __main__ - Step 68413: {'lr': 0.0002900936872230953, 'samples': 13135296, 'steps': 68412, 'loss/train': 0.7651512026786804} 11/07/2021 06:51:11 - INFO - __main__ - Step 68414: {'lr': 0.0002900884491591586, 'samples': 13135488, 'steps': 68413, 'loss/train': 1.7121444940567017} 11/07/2021 06:51:13 - INFO - __main__ - Step 68415: {'lr': 0.00029008321107715863, 'samples': 13135680, 'steps': 68414, 'loss/train': 1.412271499633789} 11/07/2021 06:51:13 - INFO - __main__ - Step 68416: {'lr': 0.00029007797297709784, 'samples': 13135872, 'steps': 68415, 'loss/train': 1.2432653903961182} 11/07/2021 06:51:13 - INFO - __main__ - Step 68417: {'lr': 0.00029007273485897846, 'samples': 13136064, 'steps': 68416, 'loss/train': 1.004776120185852} 11/07/2021 06:51:14 - INFO - __main__ - Step 68418: {'lr': 0.0002900674967228029, 'samples': 13136256, 'steps': 68417, 'loss/train': 1.4823297262191772} 11/07/2021 06:51:14 - INFO - __main__ - Step 68419: {'lr': 0.0002900622585685736, 'samples': 13136448, 'steps': 68418, 'loss/train': 1.1333390474319458} 11/07/2021 06:51:14 - INFO - __main__ - Step 68420: {'lr': 0.0002900570203962929, 'samples': 13136640, 'steps': 68419, 'loss/train': 1.6300430297851562} 11/07/2021 06:51:15 - INFO - __main__ - Step 68421: {'lr': 0.00029005178220596313, 'samples': 13136832, 'steps': 68420, 'loss/train': 1.6409834623336792} 11/07/2021 06:51:16 - INFO - __main__ - Step 68422: {'lr': 0.0002900465439975866, 'samples': 13137024, 'steps': 68421, 'loss/train': 0.9311343431472778} 11/07/2021 06:51:16 - INFO - __main__ - Step 68423: {'lr': 0.0002900413057711657, 'samples': 13137216, 'steps': 68422, 'loss/train': 1.6925649642944336} 11/07/2021 06:51:16 - INFO - __main__ - Step 68424: {'lr': 0.0002900360675267028, 'samples': 13137408, 'steps': 68423, 'loss/train': 1.2509633302688599} 11/07/2021 06:51:17 - INFO - __main__ - Step 68425: {'lr': 0.0002900308292642003, 'samples': 13137600, 'steps': 68424, 'loss/train': 1.3501322269439697} 11/07/2021 06:51:18 - INFO - __main__ - Step 68426: {'lr': 0.00029002559098366057, 'samples': 13137792, 'steps': 68425, 'loss/train': 1.6997222900390625} 11/07/2021 06:51:18 - INFO - __main__ - Step 68427: {'lr': 0.0002900203526850859, 'samples': 13137984, 'steps': 68426, 'loss/train': 1.0984742641448975} 11/07/2021 06:51:18 - INFO - __main__ - Step 68428: {'lr': 0.00029001511436847863, 'samples': 13138176, 'steps': 68427, 'loss/train': 1.1270266771316528} 11/07/2021 06:51:19 - INFO - __main__ - Step 68429: {'lr': 0.00029000987603384115, 'samples': 13138368, 'steps': 68428, 'loss/train': 1.7692044973373413} 11/07/2021 06:51:19 - INFO - __main__ - Step 68430: {'lr': 0.0002900046376811759, 'samples': 13138560, 'steps': 68429, 'loss/train': 1.3577048778533936} 11/07/2021 06:51:20 - INFO - __main__ - Step 68431: {'lr': 0.0002899993993104852, 'samples': 13138752, 'steps': 68430, 'loss/train': 1.3094487190246582} 11/07/2021 06:51:21 - INFO - __main__ - Step 68432: {'lr': 0.0002899941609217713, 'samples': 13138944, 'steps': 68431, 'loss/train': 1.3037089109420776} 11/07/2021 06:51:21 - INFO - __main__ - Step 68433: {'lr': 0.0002899889225150367, 'samples': 13139136, 'steps': 68432, 'loss/train': 1.5487947463989258} 11/07/2021 06:51:21 - INFO - __main__ - Step 68434: {'lr': 0.0002899836840902837, 'samples': 13139328, 'steps': 68433, 'loss/train': 1.5582205057144165} 11/07/2021 06:51:22 - INFO - __main__ - Step 68435: {'lr': 0.00028997844564751464, 'samples': 13139520, 'steps': 68434, 'loss/train': 0.9678363800048828} 11/07/2021 06:51:23 - INFO - __main__ - Step 68436: {'lr': 0.0002899732071867319, 'samples': 13139712, 'steps': 68435, 'loss/train': 1.4843837022781372} 11/07/2021 06:51:23 - INFO - __main__ - Step 68437: {'lr': 0.00028996796870793795, 'samples': 13139904, 'steps': 68436, 'loss/train': 1.2347352504730225} 11/07/2021 06:51:24 - INFO - __main__ - Step 68438: {'lr': 0.000289962730211135, 'samples': 13140096, 'steps': 68437, 'loss/train': 1.4181746244430542} 11/07/2021 06:51:24 - INFO - __main__ - Step 68439: {'lr': 0.00028995749169632545, 'samples': 13140288, 'steps': 68438, 'loss/train': 1.8149032592773438} 11/07/2021 06:51:24 - INFO - __main__ - Step 68440: {'lr': 0.00028995225316351164, 'samples': 13140480, 'steps': 68439, 'loss/train': 1.6549956798553467} 11/07/2021 06:51:25 - INFO - __main__ - Step 68441: {'lr': 0.00028994701461269596, 'samples': 13140672, 'steps': 68440, 'loss/train': 1.3404275178909302} 11/07/2021 06:51:26 - INFO - __main__ - Step 68442: {'lr': 0.00028994177604388084, 'samples': 13140864, 'steps': 68441, 'loss/train': 1.5823131799697876} 11/07/2021 06:51:26 - INFO - __main__ - Step 68443: {'lr': 0.00028993653745706857, 'samples': 13141056, 'steps': 68442, 'loss/train': 1.052336573600769} 11/07/2021 06:51:26 - INFO - __main__ - Step 68444: {'lr': 0.00028993129885226146, 'samples': 13141248, 'steps': 68443, 'loss/train': 0.8681807518005371} 11/07/2021 06:51:27 - INFO - __main__ - Step 68445: {'lr': 0.0002899260602294619, 'samples': 13141440, 'steps': 68444, 'loss/train': 1.511012315750122} 11/07/2021 06:51:28 - INFO - __main__ - Step 68446: {'lr': 0.00028992082158867236, 'samples': 13141632, 'steps': 68445, 'loss/train': 0.9733325242996216} 11/07/2021 06:51:28 - INFO - __main__ - Step 68447: {'lr': 0.000289915582929895, 'samples': 13141824, 'steps': 68446, 'loss/train': 1.382521152496338} 11/07/2021 06:51:28 - INFO - __main__ - Step 68448: {'lr': 0.00028991034425313234, 'samples': 13142016, 'steps': 68447, 'loss/train': 1.536354899406433} 11/07/2021 06:51:29 - INFO - __main__ - Step 68449: {'lr': 0.00028990510555838676, 'samples': 13142208, 'steps': 68448, 'loss/train': 1.7751996517181396} 11/07/2021 06:51:29 - INFO - __main__ - Step 68450: {'lr': 0.0002898998668456605, 'samples': 13142400, 'steps': 68449, 'loss/train': 1.2773467302322388} 11/07/2021 06:51:30 - INFO - __main__ - Step 68451: {'lr': 0.000289894628114956, 'samples': 13142592, 'steps': 68450, 'loss/train': 1.5260272026062012} 11/07/2021 06:51:31 - INFO - __main__ - Step 68452: {'lr': 0.0002898893893662756, 'samples': 13142784, 'steps': 68451, 'loss/train': 1.433614730834961} 11/07/2021 06:51:31 - INFO - __main__ - Step 68453: {'lr': 0.0002898841505996216, 'samples': 13142976, 'steps': 68452, 'loss/train': 1.9448894262313843} 11/07/2021 06:51:31 - INFO - __main__ - Step 68454: {'lr': 0.0002898789118149964, 'samples': 13143168, 'steps': 68453, 'loss/train': 1.4063912630081177} 11/07/2021 06:51:32 - INFO - __main__ - Step 68455: {'lr': 0.0002898736730124025, 'samples': 13143360, 'steps': 68454, 'loss/train': 1.4076200723648071} 11/07/2021 06:51:32 - INFO - __main__ - Step 68456: {'lr': 0.00028986843419184213, 'samples': 13143552, 'steps': 68455, 'loss/train': 1.2965714931488037} 11/07/2021 06:51:33 - INFO - __main__ - Step 68457: {'lr': 0.0002898631953533176, 'samples': 13143744, 'steps': 68456, 'loss/train': 1.403812289237976} 11/07/2021 06:51:33 - INFO - __main__ - Step 68458: {'lr': 0.00028985795649683126, 'samples': 13143936, 'steps': 68457, 'loss/train': 1.1866055727005005} 11/07/2021 06:51:34 - INFO - __main__ - Step 68459: {'lr': 0.0002898527176223856, 'samples': 13144128, 'steps': 68458, 'loss/train': 1.700981855392456} 11/07/2021 06:51:34 - INFO - __main__ - Step 68460: {'lr': 0.00028984747872998293, 'samples': 13144320, 'steps': 68459, 'loss/train': 1.2266649007797241} 11/07/2021 06:51:34 - INFO - __main__ - Step 68461: {'lr': 0.0002898422398196256, 'samples': 13144512, 'steps': 68460, 'loss/train': 1.0364898443222046} 11/07/2021 06:51:35 - INFO - __main__ - Step 68462: {'lr': 0.00028983700089131603, 'samples': 13144704, 'steps': 68461, 'loss/train': 1.1307865381240845} 11/07/2021 06:51:36 - INFO - __main__ - Step 68463: {'lr': 0.00028983176194505647, 'samples': 13144896, 'steps': 68462, 'loss/train': 1.1857506036758423} 11/07/2021 06:51:36 - INFO - __main__ - Step 68464: {'lr': 0.00028982652298084925, 'samples': 13145088, 'steps': 68463, 'loss/train': 1.8484725952148438} 11/07/2021 06:51:36 - INFO - __main__ - Step 68465: {'lr': 0.0002898212839986969, 'samples': 13145280, 'steps': 68464, 'loss/train': 0.5628888607025146} 11/07/2021 06:51:37 - INFO - __main__ - Step 68466: {'lr': 0.0002898160449986017, 'samples': 13145472, 'steps': 68465, 'loss/train': 1.366417646408081} 11/07/2021 06:51:38 - INFO - __main__ - Step 68467: {'lr': 0.00028981080598056597, 'samples': 13145664, 'steps': 68466, 'loss/train': 1.2942912578582764} 11/07/2021 06:51:38 - INFO - __main__ - Step 68468: {'lr': 0.00028980556694459215, 'samples': 13145856, 'steps': 68467, 'loss/train': 1.4696247577667236} 11/07/2021 06:51:39 - INFO - __main__ - Step 68469: {'lr': 0.00028980032789068254, 'samples': 13146048, 'steps': 68468, 'loss/train': 1.5575535297393799} 11/07/2021 06:51:39 - INFO - __main__ - Step 68470: {'lr': 0.00028979508881883946, 'samples': 13146240, 'steps': 68469, 'loss/train': 1.457610011100769} 11/07/2021 06:51:39 - INFO - __main__ - Step 68471: {'lr': 0.0002897898497290654, 'samples': 13146432, 'steps': 68470, 'loss/train': 1.7250171899795532} 11/07/2021 06:51:40 - INFO - __main__ - Step 68472: {'lr': 0.0002897846106213626, 'samples': 13146624, 'steps': 68471, 'loss/train': 1.5141150951385498} 11/07/2021 06:51:41 - INFO - __main__ - Step 68473: {'lr': 0.0002897793714957335, 'samples': 13146816, 'steps': 68472, 'loss/train': 1.5518889427185059} 11/07/2021 06:51:41 - INFO - __main__ - Step 68474: {'lr': 0.0002897741323521804, 'samples': 13147008, 'steps': 68473, 'loss/train': 1.9812042713165283} 11/07/2021 06:51:41 - INFO - __main__ - Step 68475: {'lr': 0.00028976889319070573, 'samples': 13147200, 'steps': 68474, 'loss/train': 1.3605916500091553} 11/07/2021 06:51:42 - INFO - __main__ - Step 68476: {'lr': 0.0002897636540113118, 'samples': 13147392, 'steps': 68475, 'loss/train': 1.3924505710601807} 11/07/2021 06:51:43 - INFO - __main__ - Step 68477: {'lr': 0.00028975841481400095, 'samples': 13147584, 'steps': 68476, 'loss/train': 1.3619931936264038} 11/07/2021 06:51:43 - INFO - __main__ - Step 68478: {'lr': 0.0002897531755987756, 'samples': 13147776, 'steps': 68477, 'loss/train': 1.5151163339614868} 11/07/2021 06:51:43 - INFO - __main__ - Step 68479: {'lr': 0.00028974793636563805, 'samples': 13147968, 'steps': 68478, 'loss/train': 1.0703946352005005} 11/07/2021 06:51:44 - INFO - __main__ - Step 68480: {'lr': 0.0002897426971145907, 'samples': 13148160, 'steps': 68479, 'loss/train': 1.8396652936935425} 11/07/2021 06:51:44 - INFO - __main__ - Step 68481: {'lr': 0.00028973745784563595, 'samples': 13148352, 'steps': 68480, 'loss/train': 1.443814992904663} 11/07/2021 06:51:45 - INFO - __main__ - Step 68482: {'lr': 0.00028973221855877607, 'samples': 13148544, 'steps': 68481, 'loss/train': 1.5705029964447021} 11/07/2021 06:51:45 - INFO - __main__ - Step 68483: {'lr': 0.0002897269792540135, 'samples': 13148736, 'steps': 68482, 'loss/train': 1.5697309970855713} 11/07/2021 06:51:46 - INFO - __main__ - Step 68484: {'lr': 0.0002897217399313505, 'samples': 13148928, 'steps': 68483, 'loss/train': 1.3257819414138794} 11/07/2021 06:51:46 - INFO - __main__ - Step 68485: {'lr': 0.00028971650059078955, 'samples': 13149120, 'steps': 68484, 'loss/train': 1.2374948263168335} 11/07/2021 06:51:47 - INFO - __main__ - Step 68486: {'lr': 0.00028971126123233297, 'samples': 13149312, 'steps': 68485, 'loss/train': 1.545937418937683} 11/07/2021 06:51:47 - INFO - __main__ - Step 68487: {'lr': 0.0002897060218559831, 'samples': 13149504, 'steps': 68486, 'loss/train': 1.3673526048660278} 11/07/2021 06:51:48 - INFO - __main__ - Step 68488: {'lr': 0.0002897007824617423, 'samples': 13149696, 'steps': 68487, 'loss/train': 1.2379512786865234} 11/07/2021 06:51:48 - INFO - __main__ - Step 68489: {'lr': 0.000289695543049613, 'samples': 13149888, 'steps': 68488, 'loss/train': 1.4833898544311523} 11/07/2021 06:51:49 - INFO - __main__ - Step 68490: {'lr': 0.0002896903036195974, 'samples': 13150080, 'steps': 68489, 'loss/train': 1.7637228965759277} 11/07/2021 06:51:49 - INFO - __main__ - Step 68491: {'lr': 0.000289685064171698, 'samples': 13150272, 'steps': 68490, 'loss/train': 1.3853679895401} 11/07/2021 06:51:49 - INFO - __main__ - Step 68492: {'lr': 0.00028967982470591715, 'samples': 13150464, 'steps': 68491, 'loss/train': 2.043489456176758} 11/07/2021 06:51:50 - INFO - __main__ - Step 68493: {'lr': 0.00028967458522225707, 'samples': 13150656, 'steps': 68492, 'loss/train': 1.1591657400131226} 11/07/2021 06:51:51 - INFO - __main__ - Step 68494: {'lr': 0.00028966934572072033, 'samples': 13150848, 'steps': 68493, 'loss/train': 1.529007911682129} 11/07/2021 06:51:51 - INFO - __main__ - Step 68495: {'lr': 0.0002896641062013092, 'samples': 13151040, 'steps': 68494, 'loss/train': 1.3931657075881958} 11/07/2021 06:51:51 - INFO - __main__ - Step 68496: {'lr': 0.00028965886666402606, 'samples': 13151232, 'steps': 68495, 'loss/train': 1.3691283464431763} 11/07/2021 06:51:52 - INFO - __main__ - Step 68497: {'lr': 0.0002896536271088732, 'samples': 13151424, 'steps': 68496, 'loss/train': 1.2257202863693237} 11/07/2021 06:51:53 - INFO - __main__ - Step 68498: {'lr': 0.00028964838753585306, 'samples': 13151616, 'steps': 68497, 'loss/train': 1.338841438293457} 11/07/2021 06:51:53 - INFO - __main__ - Step 68499: {'lr': 0.0002896431479449679, 'samples': 13151808, 'steps': 68498, 'loss/train': 1.8101718425750732} 11/07/2021 06:51:53 - INFO - __main__ - Step 68500: {'lr': 0.00028963790833622024, 'samples': 13152000, 'steps': 68499, 'loss/train': 0.7986083030700684} 11/07/2021 06:51:54 - INFO - __main__ - Step 68501: {'lr': 0.00028963266870961227, 'samples': 13152192, 'steps': 68500, 'loss/train': 1.033830165863037} 11/07/2021 06:51:54 - INFO - __main__ - Step 68502: {'lr': 0.00028962742906514646, 'samples': 13152384, 'steps': 68501, 'loss/train': 1.1887805461883545} 11/07/2021 06:51:55 - INFO - __main__ - Step 68503: {'lr': 0.0002896221894028252, 'samples': 13152576, 'steps': 68502, 'loss/train': 1.2000361680984497} 11/07/2021 06:51:55 - INFO - __main__ - Step 68504: {'lr': 0.00028961694972265076, 'samples': 13152768, 'steps': 68503, 'loss/train': 1.2702646255493164} 11/07/2021 06:51:56 - INFO - __main__ - Step 68505: {'lr': 0.0002896117100246254, 'samples': 13152960, 'steps': 68504, 'loss/train': 0.8816903829574585} 11/07/2021 06:51:56 - INFO - __main__ - Step 68506: {'lr': 0.0002896064703087518, 'samples': 13153152, 'steps': 68505, 'loss/train': 1.220678448677063} 11/07/2021 06:51:56 - INFO - __main__ - Step 68507: {'lr': 0.000289601230575032, 'samples': 13153344, 'steps': 68506, 'loss/train': 1.507650375366211} 11/07/2021 06:51:58 - INFO - __main__ - Step 68508: {'lr': 0.0002895959908234686, 'samples': 13153536, 'steps': 68507, 'loss/train': 1.428027868270874} 11/07/2021 06:51:58 - INFO - __main__ - Step 68509: {'lr': 0.00028959075105406383, 'samples': 13153728, 'steps': 68508, 'loss/train': 1.1192058324813843} 11/07/2021 06:51:58 - INFO - __main__ - Step 68510: {'lr': 0.0002895855112668201, 'samples': 13153920, 'steps': 68509, 'loss/train': 1.374442458152771} 11/07/2021 06:51:59 - INFO - __main__ - Step 68511: {'lr': 0.0002895802714617397, 'samples': 13154112, 'steps': 68510, 'loss/train': 1.5347217321395874} 11/07/2021 06:51:59 - INFO - __main__ - Step 68512: {'lr': 0.00028957503163882506, 'samples': 13154304, 'steps': 68511, 'loss/train': 1.6570401191711426} 11/07/2021 06:52:00 - INFO - __main__ - Step 68513: {'lr': 0.0002895697917980785, 'samples': 13154496, 'steps': 68512, 'loss/train': 1.3697264194488525} 11/07/2021 06:52:00 - INFO - __main__ - Step 68514: {'lr': 0.00028956455193950237, 'samples': 13154688, 'steps': 68513, 'loss/train': 1.184667706489563} 11/07/2021 06:52:01 - INFO - __main__ - Step 68515: {'lr': 0.00028955931206309915, 'samples': 13154880, 'steps': 68514, 'loss/train': 1.2657729387283325} 11/07/2021 06:52:01 - INFO - __main__ - Step 68516: {'lr': 0.0002895540721688711, 'samples': 13155072, 'steps': 68515, 'loss/train': 1.7466089725494385} 11/07/2021 06:52:01 - INFO - __main__ - Step 68517: {'lr': 0.0002895488322568206, 'samples': 13155264, 'steps': 68516, 'loss/train': 1.4141874313354492} 11/07/2021 06:52:02 - INFO - __main__ - Step 68518: {'lr': 0.00028954359232694993, 'samples': 13155456, 'steps': 68517, 'loss/train': 2.1865060329437256} 11/07/2021 06:52:03 - INFO - __main__ - Step 68519: {'lr': 0.00028953835237926156, 'samples': 13155648, 'steps': 68518, 'loss/train': 1.278867483139038} 11/07/2021 06:52:03 - INFO - __main__ - Step 68520: {'lr': 0.00028953311241375785, 'samples': 13155840, 'steps': 68519, 'loss/train': 1.1280168294906616} 11/07/2021 06:52:04 - INFO - __main__ - Step 68521: {'lr': 0.0002895278724304411, 'samples': 13156032, 'steps': 68520, 'loss/train': 1.5326554775238037} 11/07/2021 06:52:04 - INFO - __main__ - Step 68522: {'lr': 0.0002895226324293137, 'samples': 13156224, 'steps': 68521, 'loss/train': 1.0102843046188354} 11/07/2021 06:52:04 - INFO - __main__ - Step 68523: {'lr': 0.0002895173924103781, 'samples': 13156416, 'steps': 68522, 'loss/train': 1.2469130754470825} 11/07/2021 06:52:05 - INFO - __main__ - Step 68524: {'lr': 0.0002895121523736365, 'samples': 13156608, 'steps': 68523, 'loss/train': 1.4628396034240723} 11/07/2021 06:52:06 - INFO - __main__ - Step 68525: {'lr': 0.00028950691231909134, 'samples': 13156800, 'steps': 68524, 'loss/train': 1.1051979064941406} 11/07/2021 06:52:06 - INFO - __main__ - Step 68526: {'lr': 0.00028950167224674493, 'samples': 13156992, 'steps': 68525, 'loss/train': 1.4229503870010376} 11/07/2021 06:52:06 - INFO - __main__ - Step 68527: {'lr': 0.0002894964321565997, 'samples': 13157184, 'steps': 68526, 'loss/train': 1.1575002670288086} 11/07/2021 06:52:07 - INFO - __main__ - Step 68528: {'lr': 0.00028949119204865797, 'samples': 13157376, 'steps': 68527, 'loss/train': 1.2970683574676514} 11/07/2021 06:52:08 - INFO - __main__ - Step 68529: {'lr': 0.00028948595192292213, 'samples': 13157568, 'steps': 68528, 'loss/train': 1.2550990581512451} 11/07/2021 06:52:08 - INFO - __main__ - Step 68530: {'lr': 0.0002894807117793946, 'samples': 13157760, 'steps': 68529, 'loss/train': 1.3013359308242798} 11/07/2021 06:52:08 - INFO - __main__ - Step 68531: {'lr': 0.00028947547161807763, 'samples': 13157952, 'steps': 68530, 'loss/train': 0.8174132108688354} 11/07/2021 06:52:09 - INFO - __main__ - Step 68532: {'lr': 0.0002894702314389736, 'samples': 13158144, 'steps': 68531, 'loss/train': 1.7758294343948364} 11/07/2021 06:52:09 - INFO - __main__ - Step 68533: {'lr': 0.0002894649912420849, 'samples': 13158336, 'steps': 68532, 'loss/train': 1.383978247642517} 11/07/2021 06:52:11 - INFO - __main__ - Step 68534: {'lr': 0.0002894597510274139, 'samples': 13158528, 'steps': 68533, 'loss/train': 1.7964705228805542} 11/07/2021 06:52:11 - INFO - __main__ - Step 68535: {'lr': 0.00028945451079496294, 'samples': 13158720, 'steps': 68534, 'loss/train': 1.5951370000839233} 11/07/2021 06:52:11 - INFO - __main__ - Step 68536: {'lr': 0.0002894492705447344, 'samples': 13158912, 'steps': 68535, 'loss/train': 1.3448270559310913} 11/07/2021 06:52:12 - INFO - __main__ - Step 68537: {'lr': 0.0002894440302767306, 'samples': 13159104, 'steps': 68536, 'loss/train': 1.5190708637237549} 11/07/2021 06:52:12 - INFO - __main__ - Step 68538: {'lr': 0.000289438789990954, 'samples': 13159296, 'steps': 68537, 'loss/train': 2.0826213359832764} 11/07/2021 06:52:12 - INFO - __main__ - Step 68539: {'lr': 0.0002894335496874068, 'samples': 13159488, 'steps': 68538, 'loss/train': 1.4595119953155518} 11/07/2021 06:52:14 - INFO - __main__ - Step 68540: {'lr': 0.00028942830936609144, 'samples': 13159680, 'steps': 68539, 'loss/train': 1.6055976152420044} 11/07/2021 06:52:14 - INFO - __main__ - Step 68541: {'lr': 0.0002894230690270103, 'samples': 13159872, 'steps': 68540, 'loss/train': 0.10222981870174408} 11/07/2021 06:52:15 - INFO - __main__ - Step 68542: {'lr': 0.00028941782867016573, 'samples': 13160064, 'steps': 68541, 'loss/train': 0.9475650787353516} 11/07/2021 06:52:15 - INFO - __main__ - Step 68543: {'lr': 0.00028941258829556023, 'samples': 13160256, 'steps': 68542, 'loss/train': 1.384341835975647} 11/07/2021 06:52:15 - INFO - __main__ - Step 68544: {'lr': 0.0002894073479031959, 'samples': 13160448, 'steps': 68543, 'loss/train': 1.4221872091293335} 11/07/2021 06:52:16 - INFO - __main__ - Step 68545: {'lr': 0.0002894021074930752, 'samples': 13160640, 'steps': 68544, 'loss/train': 0.9279745221138} 11/07/2021 06:52:17 - INFO - __main__ - Step 68546: {'lr': 0.0002893968670652006, 'samples': 13160832, 'steps': 68545, 'loss/train': 1.1903822422027588} 11/07/2021 06:52:17 - INFO - __main__ - Step 68547: {'lr': 0.0002893916266195744, 'samples': 13161024, 'steps': 68546, 'loss/train': 1.8736354112625122} 11/07/2021 06:52:17 - INFO - __main__ - Step 68548: {'lr': 0.00028938638615619885, 'samples': 13161216, 'steps': 68547, 'loss/train': 1.1777520179748535} 11/07/2021 06:52:18 - INFO - __main__ - Step 68549: {'lr': 0.00028938114567507645, 'samples': 13161408, 'steps': 68548, 'loss/train': 1.0446878671646118} 11/07/2021 06:52:19 - INFO - __main__ - Step 68550: {'lr': 0.0002893759051762095, 'samples': 13161600, 'steps': 68549, 'loss/train': 1.3430054187774658} 11/07/2021 06:52:19 - INFO - __main__ - Step 68551: {'lr': 0.00028937066465960036, 'samples': 13161792, 'steps': 68550, 'loss/train': 1.2925267219543457} 11/07/2021 06:52:20 - INFO - __main__ - Step 68552: {'lr': 0.00028936542412525144, 'samples': 13161984, 'steps': 68551, 'loss/train': 1.5448753833770752} 11/07/2021 06:52:20 - INFO - __main__ - Step 68553: {'lr': 0.0002893601835731651, 'samples': 13162176, 'steps': 68552, 'loss/train': 1.3711864948272705} 11/07/2021 06:52:20 - INFO - __main__ - Step 68554: {'lr': 0.0002893549430033435, 'samples': 13162368, 'steps': 68553, 'loss/train': 2.3029096126556396} 11/07/2021 06:52:21 - INFO - __main__ - Step 68555: {'lr': 0.0002893497024157894, 'samples': 13162560, 'steps': 68554, 'loss/train': 1.5351048707962036} 11/07/2021 06:52:22 - INFO - __main__ - Step 68556: {'lr': 0.0002893444618105048, 'samples': 13162752, 'steps': 68555, 'loss/train': 0.20463217794895172} 11/07/2021 06:52:22 - INFO - __main__ - Step 68557: {'lr': 0.0002893392211874922, 'samples': 13162944, 'steps': 68556, 'loss/train': 1.0872955322265625} 11/07/2021 06:52:22 - INFO - __main__ - Step 68558: {'lr': 0.000289333980546754, 'samples': 13163136, 'steps': 68557, 'loss/train': 1.3457558155059814} 11/07/2021 06:52:23 - INFO - __main__ - Step 68559: {'lr': 0.00028932873988829244, 'samples': 13163328, 'steps': 68558, 'loss/train': 1.3973735570907593} 11/07/2021 06:52:23 - INFO - __main__ - Step 68560: {'lr': 0.00028932349921211004, 'samples': 13163520, 'steps': 68559, 'loss/train': 0.2949492037296295} 11/07/2021 06:52:24 - INFO - __main__ - Step 68561: {'lr': 0.000289318258518209, 'samples': 13163712, 'steps': 68560, 'loss/train': 1.494545578956604} 11/07/2021 06:52:25 - INFO - __main__ - Step 68562: {'lr': 0.00028931301780659184, 'samples': 13163904, 'steps': 68561, 'loss/train': 0.18235088884830475} 11/07/2021 06:52:25 - INFO - __main__ - Step 68563: {'lr': 0.0002893077770772608, 'samples': 13164096, 'steps': 68562, 'loss/train': 1.57148015499115} 11/07/2021 06:52:25 - INFO - __main__ - Step 68564: {'lr': 0.00028930253633021826, 'samples': 13164288, 'steps': 68563, 'loss/train': 1.5318199396133423} 11/07/2021 06:52:26 - INFO - __main__ - Step 68565: {'lr': 0.0002892972955654666, 'samples': 13164480, 'steps': 68564, 'loss/train': 0.6695629358291626} 11/07/2021 06:52:27 - INFO - __main__ - Step 68566: {'lr': 0.0002892920547830083, 'samples': 13164672, 'steps': 68565, 'loss/train': 1.164725422859192} 11/07/2021 06:52:27 - INFO - __main__ - Step 68567: {'lr': 0.0002892868139828455, 'samples': 13164864, 'steps': 68566, 'loss/train': 1.5269228219985962} 11/07/2021 06:52:27 - INFO - __main__ - Step 68568: {'lr': 0.00028928157316498066, 'samples': 13165056, 'steps': 68567, 'loss/train': 1.8205015659332275} 11/07/2021 06:52:28 - INFO - __main__ - Step 68569: {'lr': 0.0002892763323294162, 'samples': 13165248, 'steps': 68568, 'loss/train': 1.7416096925735474} 11/07/2021 06:52:28 - INFO - __main__ - Step 68570: {'lr': 0.00028927109147615436, 'samples': 13165440, 'steps': 68569, 'loss/train': 1.5282951593399048} 11/07/2021 06:52:29 - INFO - __main__ - Step 68571: {'lr': 0.0002892658506051977, 'samples': 13165632, 'steps': 68570, 'loss/train': 1.5587043762207031} 11/07/2021 06:52:30 - INFO - __main__ - Step 68572: {'lr': 0.0002892606097165483, 'samples': 13165824, 'steps': 68571, 'loss/train': 1.2393132448196411} 11/07/2021 06:52:30 - INFO - __main__ - Step 68573: {'lr': 0.00028925536881020875, 'samples': 13166016, 'steps': 68572, 'loss/train': 1.6842550039291382} 11/07/2021 06:52:30 - INFO - __main__ - Step 68574: {'lr': 0.0002892501278861813, 'samples': 13166208, 'steps': 68573, 'loss/train': 5.248936176300049} 11/07/2021 06:52:31 - INFO - __main__ - Step 68575: {'lr': 0.0002892448869444684, 'samples': 13166400, 'steps': 68574, 'loss/train': 1.447106957435608} 11/07/2021 06:52:31 - INFO - __main__ - Step 68576: {'lr': 0.00028923964598507235, 'samples': 13166592, 'steps': 68575, 'loss/train': 1.2530078887939453} 11/07/2021 06:52:31 - INFO - __main__ - Step 68577: {'lr': 0.0002892344050079956, 'samples': 13166784, 'steps': 68576, 'loss/train': 1.5531680583953857} 11/07/2021 06:52:32 - INFO - __main__ - Step 68578: {'lr': 0.0002892291640132403, 'samples': 13166976, 'steps': 68577, 'loss/train': 1.3544365167617798} 11/07/2021 06:52:33 - INFO - __main__ - Step 68579: {'lr': 0.00028922392300080894, 'samples': 13167168, 'steps': 68578, 'loss/train': 1.3869770765304565} 11/07/2021 06:52:33 - INFO - __main__ - Step 68580: {'lr': 0.00028921868197070397, 'samples': 13167360, 'steps': 68579, 'loss/train': 1.5916635990142822} 11/07/2021 06:52:34 - INFO - __main__ - Step 68581: {'lr': 0.00028921344092292764, 'samples': 13167552, 'steps': 68580, 'loss/train': 1.6355372667312622} 11/07/2021 06:52:34 - INFO - __main__ - Step 68582: {'lr': 0.0002892081998574823, 'samples': 13167744, 'steps': 68581, 'loss/train': 1.3367629051208496} 11/07/2021 06:52:35 - INFO - __main__ - Step 68583: {'lr': 0.0002892029587743704, 'samples': 13167936, 'steps': 68582, 'loss/train': 1.4195725917816162} 11/07/2021 06:52:35 - INFO - __main__ - Step 68584: {'lr': 0.00028919771767359426, 'samples': 13168128, 'steps': 68583, 'loss/train': 1.3855684995651245} 11/07/2021 06:52:35 - INFO - __main__ - Step 68585: {'lr': 0.0002891924765551562, 'samples': 13168320, 'steps': 68584, 'loss/train': 1.3464781045913696} 11/07/2021 06:52:36 - INFO - __main__ - Step 68586: {'lr': 0.0002891872354190586, 'samples': 13168512, 'steps': 68585, 'loss/train': 1.559922218322754} 11/07/2021 06:52:36 - INFO - __main__ - Step 68587: {'lr': 0.00028918199426530383, 'samples': 13168704, 'steps': 68586, 'loss/train': 1.7323822975158691} 11/07/2021 06:52:37 - INFO - __main__ - Step 68588: {'lr': 0.0002891767530938943, 'samples': 13168896, 'steps': 68587, 'loss/train': 1.5237305164337158} 11/07/2021 06:52:37 - INFO - __main__ - Step 68589: {'lr': 0.0002891715119048323, 'samples': 13169088, 'steps': 68588, 'loss/train': 1.4886772632598877} 11/07/2021 06:52:38 - INFO - __main__ - Step 68590: {'lr': 0.00028916627069812027, 'samples': 13169280, 'steps': 68589, 'loss/train': 1.3872697353363037} 11/07/2021 06:52:38 - INFO - __main__ - Step 68591: {'lr': 0.0002891610294737605, 'samples': 13169472, 'steps': 68590, 'loss/train': 0.9731568098068237} 11/07/2021 06:52:39 - INFO - __main__ - Step 68592: {'lr': 0.0002891557882317553, 'samples': 13169664, 'steps': 68591, 'loss/train': 1.1904104948043823} 11/07/2021 06:52:40 - INFO - __main__ - Step 68593: {'lr': 0.0002891505469721072, 'samples': 13169856, 'steps': 68592, 'loss/train': 1.4256020784378052} 11/07/2021 06:52:40 - INFO - __main__ - Step 68594: {'lr': 0.00028914530569481845, 'samples': 13170048, 'steps': 68593, 'loss/train': 0.684812068939209} 11/07/2021 06:52:40 - INFO - __main__ - Step 68595: {'lr': 0.00028914006439989136, 'samples': 13170240, 'steps': 68594, 'loss/train': 0.9781807065010071} 11/07/2021 06:52:41 - INFO - __main__ - Step 68596: {'lr': 0.0002891348230873284, 'samples': 13170432, 'steps': 68595, 'loss/train': 0.7420249581336975} 11/07/2021 06:52:41 - INFO - __main__ - Step 68597: {'lr': 0.000289129581757132, 'samples': 13170624, 'steps': 68596, 'loss/train': 0.8935926556587219} 11/07/2021 06:52:42 - INFO - __main__ - Step 68598: {'lr': 0.0002891243404093043, 'samples': 13170816, 'steps': 68597, 'loss/train': 1.497607707977295} 11/07/2021 06:52:42 - INFO - __main__ - Step 68599: {'lr': 0.0002891190990438478, 'samples': 13171008, 'steps': 68598, 'loss/train': 1.2739931344985962} 11/07/2021 06:52:43 - INFO - __main__ - Step 68600: {'lr': 0.0002891138576607648, 'samples': 13171200, 'steps': 68599, 'loss/train': 1.615509033203125} 11/07/2021 06:52:43 - INFO - __main__ - Step 68601: {'lr': 0.00028910861626005774, 'samples': 13171392, 'steps': 68600, 'loss/train': 1.5483686923980713} 11/07/2021 06:52:43 - INFO - __main__ - Step 68602: {'lr': 0.0002891033748417289, 'samples': 13171584, 'steps': 68601, 'loss/train': 1.2640442848205566} 11/07/2021 06:52:44 - INFO - __main__ - Step 68603: {'lr': 0.00028909813340578073, 'samples': 13171776, 'steps': 68602, 'loss/train': 1.442826271057129} 11/07/2021 06:52:45 - INFO - __main__ - Step 68604: {'lr': 0.0002890928919522156, 'samples': 13171968, 'steps': 68603, 'loss/train': 1.5252280235290527} 11/07/2021 06:52:45 - INFO - __main__ - Step 68605: {'lr': 0.0002890876504810357, 'samples': 13172160, 'steps': 68604, 'loss/train': 1.6068822145462036} 11/07/2021 06:52:45 - INFO - __main__ - Step 68606: {'lr': 0.0002890824089922436, 'samples': 13172352, 'steps': 68605, 'loss/train': 1.3848859071731567} 11/07/2021 06:52:46 - INFO - __main__ - Step 68607: {'lr': 0.0002890771674858415, 'samples': 13172544, 'steps': 68606, 'loss/train': 1.2241086959838867} 11/07/2021 06:52:47 - INFO - __main__ - Step 68608: {'lr': 0.00028907192596183185, 'samples': 13172736, 'steps': 68607, 'loss/train': 1.0867764949798584} 11/07/2021 06:52:47 - INFO - __main__ - Step 68609: {'lr': 0.000289066684420217, 'samples': 13172928, 'steps': 68608, 'loss/train': 1.4165880680084229} 11/07/2021 06:52:47 - INFO - __main__ - Step 68610: {'lr': 0.00028906144286099935, 'samples': 13173120, 'steps': 68609, 'loss/train': 1.6138248443603516} 11/07/2021 06:52:48 - INFO - __main__ - Step 68611: {'lr': 0.00028905620128418115, 'samples': 13173312, 'steps': 68610, 'loss/train': 1.5033382177352905} 11/07/2021 06:52:48 - INFO - __main__ - Step 68612: {'lr': 0.00028905095968976484, 'samples': 13173504, 'steps': 68611, 'loss/train': 1.5508837699890137} 11/07/2021 06:52:49 - INFO - __main__ - Step 68613: {'lr': 0.0002890457180777528, 'samples': 13173696, 'steps': 68612, 'loss/train': 1.1082264184951782} 11/07/2021 06:52:50 - INFO - __main__ - Step 68614: {'lr': 0.0002890404764481473, 'samples': 13173888, 'steps': 68613, 'loss/train': 1.4693334102630615} 11/07/2021 06:52:50 - INFO - __main__ - Step 68615: {'lr': 0.00028903523480095086, 'samples': 13174080, 'steps': 68614, 'loss/train': 1.207532286643982} 11/07/2021 06:52:50 - INFO - __main__ - Step 68616: {'lr': 0.00028902999313616565, 'samples': 13174272, 'steps': 68615, 'loss/train': 1.4219458103179932} 11/07/2021 06:52:51 - INFO - __main__ - Step 68617: {'lr': 0.0002890247514537942, 'samples': 13174464, 'steps': 68616, 'loss/train': 1.6656476259231567} 11/07/2021 06:52:51 - INFO - __main__ - Step 68618: {'lr': 0.0002890195097538388, 'samples': 13174656, 'steps': 68617, 'loss/train': 2.1112773418426514} 11/07/2021 06:52:52 - INFO - __main__ - Step 68619: {'lr': 0.0002890142680363017, 'samples': 13174848, 'steps': 68618, 'loss/train': 1.4690083265304565} 11/07/2021 06:52:52 - INFO - __main__ - Step 68620: {'lr': 0.00028900902630118547, 'samples': 13175040, 'steps': 68619, 'loss/train': 1.564023733139038} 11/07/2021 06:52:53 - INFO - __main__ - Step 68621: {'lr': 0.00028900378454849233, 'samples': 13175232, 'steps': 68620, 'loss/train': 1.684840440750122} 11/07/2021 06:52:53 - INFO - __main__ - Step 68622: {'lr': 0.00028899854277822476, 'samples': 13175424, 'steps': 68621, 'loss/train': 1.501598596572876} 11/07/2021 06:52:53 - INFO - __main__ - Step 68623: {'lr': 0.00028899330099038494, 'samples': 13175616, 'steps': 68622, 'loss/train': 1.5257139205932617} 11/07/2021 06:52:54 - INFO - __main__ - Step 68624: {'lr': 0.0002889880591849755, 'samples': 13175808, 'steps': 68623, 'loss/train': 1.6255637407302856} 11/07/2021 06:52:55 - INFO - __main__ - Step 68625: {'lr': 0.00028898281736199847, 'samples': 13176000, 'steps': 68624, 'loss/train': 1.6578673124313354} 11/07/2021 06:52:55 - INFO - __main__ - Step 68626: {'lr': 0.0002889775755214565, 'samples': 13176192, 'steps': 68625, 'loss/train': 1.9258816242218018} 11/07/2021 06:52:55 - INFO - __main__ - Step 68627: {'lr': 0.0002889723336633518, 'samples': 13176384, 'steps': 68626, 'loss/train': 1.4810529947280884} 11/07/2021 06:52:56 - INFO - __main__ - Step 68628: {'lr': 0.00028896709178768677, 'samples': 13176576, 'steps': 68627, 'loss/train': 1.5147489309310913} 11/07/2021 06:52:57 - INFO - __main__ - Step 68629: {'lr': 0.00028896184989446374, 'samples': 13176768, 'steps': 68628, 'loss/train': 2.945028066635132} 11/07/2021 06:52:57 - INFO - __main__ - Step 68630: {'lr': 0.0002889566079836852, 'samples': 13176960, 'steps': 68629, 'loss/train': 1.546177864074707} 11/07/2021 06:52:58 - INFO - __main__ - Step 68631: {'lr': 0.00028895136605535326, 'samples': 13177152, 'steps': 68630, 'loss/train': 1.1701334714889526} 11/07/2021 06:52:58 - INFO - __main__ - Step 68632: {'lr': 0.0002889461241094705, 'samples': 13177344, 'steps': 68631, 'loss/train': 0.6706123948097229} 11/07/2021 06:52:58 - INFO - __main__ - Step 68633: {'lr': 0.0002889408821460393, 'samples': 13177536, 'steps': 68632, 'loss/train': 1.5615003108978271} 11/07/2021 06:52:59 - INFO - __main__ - Step 68634: {'lr': 0.0002889356401650618, 'samples': 13177728, 'steps': 68633, 'loss/train': 1.6094282865524292} 11/07/2021 06:53:00 - INFO - __main__ - Step 68635: {'lr': 0.0002889303981665406, 'samples': 13177920, 'steps': 68634, 'loss/train': 0.552882969379425} 11/07/2021 06:53:00 - INFO - __main__ - Step 68636: {'lr': 0.0002889251561504779, 'samples': 13178112, 'steps': 68635, 'loss/train': 1.5402191877365112} 11/07/2021 06:53:00 - INFO - __main__ - Step 68637: {'lr': 0.0002889199141168762, 'samples': 13178304, 'steps': 68636, 'loss/train': 1.3362559080123901} 11/07/2021 06:53:01 - INFO - __main__ - Step 68638: {'lr': 0.00028891467206573773, 'samples': 13178496, 'steps': 68637, 'loss/train': 1.1978731155395508} 11/07/2021 06:53:02 - INFO - __main__ - Step 68639: {'lr': 0.0002889094299970649, 'samples': 13178688, 'steps': 68638, 'loss/train': 1.2871108055114746} 11/07/2021 06:53:02 - INFO - __main__ - Step 68640: {'lr': 0.00028890418791086014, 'samples': 13178880, 'steps': 68639, 'loss/train': 1.4690566062927246} 11/07/2021 06:53:02 - INFO - __main__ - Step 68641: {'lr': 0.0002888989458071257, 'samples': 13179072, 'steps': 68640, 'loss/train': 1.1232426166534424} 11/07/2021 06:53:03 - INFO - __main__ - Step 68642: {'lr': 0.000288893703685864, 'samples': 13179264, 'steps': 68641, 'loss/train': 1.6624882221221924} 11/07/2021 06:53:03 - INFO - __main__ - Step 68643: {'lr': 0.0002888884615470774, 'samples': 13179456, 'steps': 68642, 'loss/train': 1.3234843015670776} 11/07/2021 06:53:04 - INFO - __main__ - Step 68644: {'lr': 0.00028888321939076833, 'samples': 13179648, 'steps': 68643, 'loss/train': 1.8954886198043823} 11/07/2021 06:53:04 - INFO - __main__ - Step 68645: {'lr': 0.00028887797721693903, 'samples': 13179840, 'steps': 68644, 'loss/train': 1.2630764245986938} 11/07/2021 06:53:05 - INFO - __main__ - Step 68646: {'lr': 0.0002888727350255919, 'samples': 13180032, 'steps': 68645, 'loss/train': 1.2586029767990112} 11/07/2021 06:53:05 - INFO - __main__ - Step 68647: {'lr': 0.0002888674928167293, 'samples': 13180224, 'steps': 68646, 'loss/train': 0.9970539808273315} 11/07/2021 06:53:05 - INFO - __main__ - Step 68648: {'lr': 0.00028886225059035367, 'samples': 13180416, 'steps': 68647, 'loss/train': 1.7346444129943848} 11/07/2021 06:53:06 - INFO - __main__ - Step 68649: {'lr': 0.00028885700834646724, 'samples': 13180608, 'steps': 68648, 'loss/train': 0.873485267162323} 11/07/2021 06:53:07 - INFO - __main__ - Step 68650: {'lr': 0.00028885176608507246, 'samples': 13180800, 'steps': 68649, 'loss/train': 1.1988645792007446} 11/07/2021 06:53:07 - INFO - __main__ - Step 68651: {'lr': 0.0002888465238061717, 'samples': 13180992, 'steps': 68650, 'loss/train': 1.3051223754882812} 11/07/2021 06:53:08 - INFO - __main__ - Step 68652: {'lr': 0.0002888412815097673, 'samples': 13181184, 'steps': 68651, 'loss/train': 1.3747987747192383} 11/07/2021 06:53:08 - INFO - __main__ - Step 68653: {'lr': 0.0002888360391958616, 'samples': 13181376, 'steps': 68652, 'loss/train': 0.6782568097114563} 11/07/2021 06:53:08 - INFO - __main__ - Step 68654: {'lr': 0.00028883079686445697, 'samples': 13181568, 'steps': 68653, 'loss/train': 1.474259853363037} 11/07/2021 06:53:09 - INFO - __main__ - Step 68655: {'lr': 0.00028882555451555575, 'samples': 13181760, 'steps': 68654, 'loss/train': 1.569907307624817} 11/07/2021 06:53:10 - INFO - __main__ - Step 68656: {'lr': 0.0002888203121491604, 'samples': 13181952, 'steps': 68655, 'loss/train': 1.2499042749404907} 11/07/2021 06:53:10 - INFO - __main__ - Step 68657: {'lr': 0.0002888150697652732, 'samples': 13182144, 'steps': 68656, 'loss/train': 1.3580213785171509} 11/07/2021 06:53:10 - INFO - __main__ - Step 68658: {'lr': 0.00028880982736389653, 'samples': 13182336, 'steps': 68657, 'loss/train': 1.4374773502349854} 11/07/2021 06:53:11 - INFO - __main__ - Step 68659: {'lr': 0.00028880458494503277, 'samples': 13182528, 'steps': 68658, 'loss/train': 1.5532093048095703} 11/07/2021 06:53:12 - INFO - __main__ - Step 68660: {'lr': 0.0002887993425086842, 'samples': 13182720, 'steps': 68659, 'loss/train': 1.6056257486343384} 11/07/2021 06:53:12 - INFO - __main__ - Step 68661: {'lr': 0.0002887941000548533, 'samples': 13182912, 'steps': 68660, 'loss/train': 1.4871885776519775} 11/07/2021 06:53:12 - INFO - __main__ - Step 68662: {'lr': 0.0002887888575835423, 'samples': 13183104, 'steps': 68661, 'loss/train': 1.5413475036621094} 11/07/2021 06:53:13 - INFO - __main__ - Step 68663: {'lr': 0.0002887836150947537, 'samples': 13183296, 'steps': 68662, 'loss/train': 1.8570820093154907} 11/07/2021 06:53:13 - INFO - __main__ - Step 68664: {'lr': 0.0002887783725884898, 'samples': 13183488, 'steps': 68663, 'loss/train': 1.2113217115402222} 11/07/2021 06:53:14 - INFO - __main__ - Step 68665: {'lr': 0.000288773130064753, 'samples': 13183680, 'steps': 68664, 'loss/train': 1.511551856994629} 11/07/2021 06:53:14 - INFO - __main__ - Step 68666: {'lr': 0.00028876788752354554, 'samples': 13183872, 'steps': 68665, 'loss/train': 1.690229058265686} 11/07/2021 06:53:15 - INFO - __main__ - Step 68667: {'lr': 0.00028876264496486995, 'samples': 13184064, 'steps': 68666, 'loss/train': 1.6244231462478638} 11/07/2021 06:53:15 - INFO - __main__ - Step 68668: {'lr': 0.00028875740238872846, 'samples': 13184256, 'steps': 68667, 'loss/train': 1.3442434072494507} 11/07/2021 06:53:15 - INFO - __main__ - Step 68669: {'lr': 0.0002887521597951235, 'samples': 13184448, 'steps': 68668, 'loss/train': 0.9774678945541382} 11/07/2021 06:53:17 - INFO - __main__ - Step 68670: {'lr': 0.00028874691718405737, 'samples': 13184640, 'steps': 68669, 'loss/train': 1.1435712575912476} 11/07/2021 06:53:17 - INFO - __main__ - Step 68671: {'lr': 0.0002887416745555326, 'samples': 13184832, 'steps': 68670, 'loss/train': 1.4447605609893799} 11/07/2021 06:53:17 - INFO - __main__ - Step 68672: {'lr': 0.00028873643190955136, 'samples': 13185024, 'steps': 68671, 'loss/train': 3.1397199630737305} 11/07/2021 06:53:18 - INFO - __main__ - Step 68673: {'lr': 0.00028873118924611604, 'samples': 13185216, 'steps': 68672, 'loss/train': 1.452022910118103} 11/07/2021 06:53:18 - INFO - __main__ - Step 68674: {'lr': 0.00028872594656522907, 'samples': 13185408, 'steps': 68673, 'loss/train': 1.2466163635253906} 11/07/2021 06:53:19 - INFO - __main__ - Step 68675: {'lr': 0.00028872070386689274, 'samples': 13185600, 'steps': 68674, 'loss/train': 1.5625542402267456} 11/07/2021 06:53:19 - INFO - __main__ - Step 68676: {'lr': 0.00028871546115110953, 'samples': 13185792, 'steps': 68675, 'loss/train': 1.4401350021362305} 11/07/2021 06:53:20 - INFO - __main__ - Step 68677: {'lr': 0.00028871021841788173, 'samples': 13185984, 'steps': 68676, 'loss/train': 1.4187161922454834} 11/07/2021 06:53:20 - INFO - __main__ - Step 68678: {'lr': 0.0002887049756672117, 'samples': 13186176, 'steps': 68677, 'loss/train': 1.5487710237503052} 11/07/2021 06:53:20 - INFO - __main__ - Step 68679: {'lr': 0.00028869973289910177, 'samples': 13186368, 'steps': 68678, 'loss/train': 1.5529298782348633} 11/07/2021 06:53:21 - INFO - __main__ - Step 68680: {'lr': 0.0002886944901135544, 'samples': 13186560, 'steps': 68679, 'loss/train': 1.2850680351257324} 11/07/2021 06:53:22 - INFO - __main__ - Step 68681: {'lr': 0.0002886892473105718, 'samples': 13186752, 'steps': 68680, 'loss/train': 1.2167309522628784} 11/07/2021 06:53:22 - INFO - __main__ - Step 68682: {'lr': 0.0002886840044901564, 'samples': 13186944, 'steps': 68681, 'loss/train': 0.7142318487167358} 11/07/2021 06:53:22 - INFO - __main__ - Step 68683: {'lr': 0.00028867876165231067, 'samples': 13187136, 'steps': 68682, 'loss/train': 1.378138780593872} 11/07/2021 06:53:23 - INFO - __main__ - Step 68684: {'lr': 0.00028867351879703694, 'samples': 13187328, 'steps': 68683, 'loss/train': 1.518328070640564} 11/07/2021 06:53:23 - INFO - __main__ - Step 68685: {'lr': 0.0002886682759243374, 'samples': 13187520, 'steps': 68684, 'loss/train': 1.5086697340011597} 11/07/2021 06:53:24 - INFO - __main__ - Step 68686: {'lr': 0.0002886630330342146, 'samples': 13187712, 'steps': 68685, 'loss/train': 1.22567880153656} 11/07/2021 06:53:24 - INFO - __main__ - Step 68687: {'lr': 0.0002886577901266708, 'samples': 13187904, 'steps': 68686, 'loss/train': 1.812575101852417} 11/07/2021 06:53:25 - INFO - __main__ - Step 68688: {'lr': 0.0002886525472017084, 'samples': 13188096, 'steps': 68687, 'loss/train': 1.506011962890625} 11/07/2021 06:53:25 - INFO - __main__ - Step 68689: {'lr': 0.0002886473042593298, 'samples': 13188288, 'steps': 68688, 'loss/train': 1.349891185760498} 11/07/2021 06:53:26 - INFO - __main__ - Step 68690: {'lr': 0.0002886420612995373, 'samples': 13188480, 'steps': 68689, 'loss/train': 1.24039626121521} 11/07/2021 06:53:27 - INFO - __main__ - Step 68691: {'lr': 0.00028863681832233323, 'samples': 13188672, 'steps': 68690, 'loss/train': 1.600056767463684} 11/07/2021 06:53:27 - INFO - __main__ - Step 68692: {'lr': 0.00028863157532772006, 'samples': 13188864, 'steps': 68691, 'loss/train': 1.5467177629470825} 11/07/2021 06:53:27 - INFO - __main__ - Step 68693: {'lr': 0.00028862633231570013, 'samples': 13189056, 'steps': 68692, 'loss/train': 1.281374216079712} 11/07/2021 06:53:28 - INFO - __main__ - Step 68694: {'lr': 0.0002886210892862757, 'samples': 13189248, 'steps': 68693, 'loss/train': 1.7246155738830566} 11/07/2021 06:53:28 - INFO - __main__ - Step 68695: {'lr': 0.00028861584623944927, 'samples': 13189440, 'steps': 68694, 'loss/train': 1.3158059120178223} 11/07/2021 06:53:29 - INFO - __main__ - Step 68696: {'lr': 0.0002886106031752231, 'samples': 13189632, 'steps': 68695, 'loss/train': 0.906191349029541} 11/07/2021 06:53:29 - INFO - __main__ - Step 68697: {'lr': 0.00028860536009359957, 'samples': 13189824, 'steps': 68696, 'loss/train': 1.7008417844772339} 11/07/2021 06:53:30 - INFO - __main__ - Step 68698: {'lr': 0.00028860011699458104, 'samples': 13190016, 'steps': 68697, 'loss/train': 1.171958088874817} 11/07/2021 06:53:30 - INFO - __main__ - Step 68699: {'lr': 0.0002885948738781699, 'samples': 13190208, 'steps': 68698, 'loss/train': 0.2517613172531128} 11/07/2021 06:53:30 - INFO - __main__ - Step 68700: {'lr': 0.00028858963074436864, 'samples': 13190400, 'steps': 68699, 'loss/train': 1.3526908159255981} 11/07/2021 06:53:31 - INFO - __main__ - Step 68701: {'lr': 0.0002885843875931793, 'samples': 13190592, 'steps': 68700, 'loss/train': 1.5504770278930664} 11/07/2021 06:53:32 - INFO - __main__ - Step 68702: {'lr': 0.0002885791444246045, 'samples': 13190784, 'steps': 68701, 'loss/train': 3.4030418395996094} 11/07/2021 06:53:32 - INFO - __main__ - Step 68703: {'lr': 0.00028857390123864657, 'samples': 13190976, 'steps': 68702, 'loss/train': 1.468091607093811} 11/07/2021 06:53:33 - INFO - __main__ - Step 68704: {'lr': 0.0002885686580353078, 'samples': 13191168, 'steps': 68703, 'loss/train': 1.4066365957260132} 11/07/2021 06:53:33 - INFO - __main__ - Step 68705: {'lr': 0.00028856341481459064, 'samples': 13191360, 'steps': 68704, 'loss/train': 1.0208022594451904} 11/07/2021 06:53:34 - INFO - __main__ - Step 68706: {'lr': 0.0002885581715764973, 'samples': 13191552, 'steps': 68705, 'loss/train': 1.430219054222107} 11/07/2021 06:53:34 - INFO - __main__ - Step 68707: {'lr': 0.00028855292832103037, 'samples': 13191744, 'steps': 68706, 'loss/train': 1.70212984085083} 11/07/2021 06:53:35 - INFO - __main__ - Step 68708: {'lr': 0.00028854768504819195, 'samples': 13191936, 'steps': 68707, 'loss/train': 1.489814043045044} 11/07/2021 06:53:35 - INFO - __main__ - Step 68709: {'lr': 0.0002885424417579846, 'samples': 13192128, 'steps': 68708, 'loss/train': 2.3311526775360107} 11/07/2021 06:53:35 - INFO - __main__ - Step 68710: {'lr': 0.0002885371984504107, 'samples': 13192320, 'steps': 68709, 'loss/train': 1.607001543045044} 11/07/2021 06:53:36 - INFO - __main__ - Step 68711: {'lr': 0.0002885319551254725, 'samples': 13192512, 'steps': 68710, 'loss/train': 0.9068493247032166} 11/07/2021 06:53:37 - INFO - __main__ - Step 68712: {'lr': 0.00028852671178317233, 'samples': 13192704, 'steps': 68711, 'loss/train': 1.7448569536209106} 11/07/2021 06:53:37 - INFO - __main__ - Step 68713: {'lr': 0.00028852146842351257, 'samples': 13192896, 'steps': 68712, 'loss/train': 1.4623125791549683} 11/07/2021 06:53:37 - INFO - __main__ - Step 68714: {'lr': 0.0002885162250464957, 'samples': 13193088, 'steps': 68713, 'loss/train': 1.3421516418457031} 11/07/2021 06:53:38 - INFO - __main__ - Step 68715: {'lr': 0.000288510981652124, 'samples': 13193280, 'steps': 68714, 'loss/train': 1.1508461236953735} 11/07/2021 06:53:38 - INFO - __main__ - Step 68716: {'lr': 0.0002885057382403999, 'samples': 13193472, 'steps': 68715, 'loss/train': 1.6587640047073364} 11/07/2021 06:53:39 - INFO - __main__ - Step 68717: {'lr': 0.0002885004948113256, 'samples': 13193664, 'steps': 68716, 'loss/train': 1.3073647022247314} 11/07/2021 06:53:40 - INFO - __main__ - Step 68718: {'lr': 0.0002884952513649037, 'samples': 13193856, 'steps': 68717, 'loss/train': 1.4667097330093384} 11/07/2021 06:53:40 - INFO - __main__ - Step 68719: {'lr': 0.00028849000790113637, 'samples': 13194048, 'steps': 68718, 'loss/train': 1.3513203859329224} 11/07/2021 06:53:40 - INFO - __main__ - Step 68720: {'lr': 0.000288484764420026, 'samples': 13194240, 'steps': 68719, 'loss/train': 1.3984423875808716} 11/07/2021 06:53:41 - INFO - __main__ - Step 68721: {'lr': 0.0002884795209215751, 'samples': 13194432, 'steps': 68720, 'loss/train': 1.0205692052841187} 11/07/2021 06:53:42 - INFO - __main__ - Step 68722: {'lr': 0.0002884742774057858, 'samples': 13194624, 'steps': 68721, 'loss/train': 1.694258451461792} 11/07/2021 06:53:42 - INFO - __main__ - Step 68723: {'lr': 0.00028846903387266066, 'samples': 13194816, 'steps': 68722, 'loss/train': 1.3675493001937866} 11/07/2021 06:53:42 - INFO - __main__ - Step 68724: {'lr': 0.0002884637903222019, 'samples': 13195008, 'steps': 68723, 'loss/train': 1.0592072010040283} 11/07/2021 06:53:43 - INFO - __main__ - Step 68725: {'lr': 0.0002884585467544121, 'samples': 13195200, 'steps': 68724, 'loss/train': 1.2994358539581299} 11/07/2021 06:53:43 - INFO - __main__ - Step 68726: {'lr': 0.0002884533031692933, 'samples': 13195392, 'steps': 68725, 'loss/train': 2.0725536346435547} 11/07/2021 06:53:46 - INFO - __main__ - Step 68727: {'lr': 0.0002884480595668481, 'samples': 13195584, 'steps': 68726, 'loss/train': 1.380529761314392} 11/07/2021 06:53:46 - INFO - __main__ - Step 68728: {'lr': 0.00028844281594707876, 'samples': 13195776, 'steps': 68727, 'loss/train': 1.2343940734863281} 11/07/2021 06:53:46 - INFO - __main__ - Step 68729: {'lr': 0.00028843757230998776, 'samples': 13195968, 'steps': 68728, 'loss/train': 1.418465256690979} 11/07/2021 06:53:47 - INFO - __main__ - Step 68730: {'lr': 0.00028843232865557734, 'samples': 13196160, 'steps': 68729, 'loss/train': 1.3689936399459839} 11/07/2021 06:53:47 - INFO - __main__ - Step 68731: {'lr': 0.00028842708498384994, 'samples': 13196352, 'steps': 68730, 'loss/train': 1.3399474620819092} 11/07/2021 06:53:47 - INFO - __main__ - Step 68732: {'lr': 0.0002884218412948078, 'samples': 13196544, 'steps': 68731, 'loss/train': 1.4285093545913696} 11/07/2021 06:53:48 - INFO - __main__ - Step 68733: {'lr': 0.00028841659758845344, 'samples': 13196736, 'steps': 68732, 'loss/train': 1.8496149778366089} 11/07/2021 06:53:48 - INFO - __main__ - Step 68734: {'lr': 0.0002884113538647891, 'samples': 13196928, 'steps': 68733, 'loss/train': 1.8533610105514526} 11/07/2021 06:53:48 - INFO - __main__ - Step 68735: {'lr': 0.0002884061101238173, 'samples': 13197120, 'steps': 68734, 'loss/train': 1.7943593263626099} 11/07/2021 06:53:50 - INFO - __main__ - Step 68736: {'lr': 0.0002884008663655402, 'samples': 13197312, 'steps': 68735, 'loss/train': 0.7853092551231384} 11/07/2021 06:53:50 - INFO - __main__ - Step 68737: {'lr': 0.00028839562258996026, 'samples': 13197504, 'steps': 68736, 'loss/train': 1.5404702425003052} 11/07/2021 06:53:50 - INFO - __main__ - Step 68738: {'lr': 0.00028839037879708, 'samples': 13197696, 'steps': 68737, 'loss/train': 1.4736005067825317} 11/07/2021 06:53:51 - INFO - __main__ - Step 68739: {'lr': 0.00028838513498690143, 'samples': 13197888, 'steps': 68738, 'loss/train': 2.019211530685425} 11/07/2021 06:53:51 - INFO - __main__ - Step 68740: {'lr': 0.0002883798911594272, 'samples': 13198080, 'steps': 68739, 'loss/train': 0.8476840853691101} 11/07/2021 06:53:52 - INFO - __main__ - Step 68741: {'lr': 0.00028837464731465954, 'samples': 13198272, 'steps': 68740, 'loss/train': 1.140662431716919} 11/07/2021 06:53:52 - INFO - __main__ - Step 68742: {'lr': 0.00028836940345260093, 'samples': 13198464, 'steps': 68741, 'loss/train': 1.876542329788208} 11/07/2021 06:53:53 - INFO - __main__ - Step 68743: {'lr': 0.0002883641595732536, 'samples': 13198656, 'steps': 68742, 'loss/train': 1.4159777164459229} 11/07/2021 06:53:53 - INFO - __main__ - Step 68744: {'lr': 0.00028835891567662, 'samples': 13198848, 'steps': 68743, 'loss/train': 1.5047733783721924} 11/07/2021 06:53:53 - INFO - __main__ - Step 68745: {'lr': 0.0002883536717627025, 'samples': 13199040, 'steps': 68744, 'loss/train': 1.6188526153564453} 11/07/2021 06:53:54 - INFO - __main__ - Step 68746: {'lr': 0.0002883484278315033, 'samples': 13199232, 'steps': 68745, 'loss/train': 1.3095821142196655} 11/07/2021 06:53:55 - INFO - __main__ - Step 68747: {'lr': 0.00028834318388302506, 'samples': 13199424, 'steps': 68746, 'loss/train': 1.694445252418518} 11/07/2021 06:53:56 - INFO - __main__ - Step 68748: {'lr': 0.0002883379399172699, 'samples': 13199616, 'steps': 68747, 'loss/train': 2.6598801612854004} 11/07/2021 06:53:56 - INFO - __main__ - Step 68749: {'lr': 0.00028833269593424017, 'samples': 13199808, 'steps': 68748, 'loss/train': 2.9452710151672363} 11/07/2021 06:53:56 - INFO - __main__ - Step 68750: {'lr': 0.0002883274519339384, 'samples': 13200000, 'steps': 68749, 'loss/train': 1.59113609790802} 11/07/2021 06:53:57 - INFO - __main__ - Step 68751: {'lr': 0.0002883222079163669, 'samples': 13200192, 'steps': 68750, 'loss/train': 0.76740962266922} 11/07/2021 06:53:58 - INFO - __main__ - Step 68752: {'lr': 0.000288316963881528, 'samples': 13200384, 'steps': 68751, 'loss/train': 1.492805004119873} 11/07/2021 06:53:58 - INFO - __main__ - Step 68753: {'lr': 0.00028831171982942396, 'samples': 13200576, 'steps': 68752, 'loss/train': 1.639355182647705} 11/07/2021 06:53:58 - INFO - __main__ - Step 68754: {'lr': 0.00028830647576005733, 'samples': 13200768, 'steps': 68753, 'loss/train': 1.8954436779022217} 11/07/2021 06:53:59 - INFO - __main__ - Step 68755: {'lr': 0.00028830123167343036, 'samples': 13200960, 'steps': 68754, 'loss/train': 2.180112361907959} 11/07/2021 06:53:59 - INFO - __main__ - Step 68756: {'lr': 0.0002882959875695455, 'samples': 13201152, 'steps': 68755, 'loss/train': 1.4466708898544312} 11/07/2021 06:54:00 - INFO - __main__ - Step 68757: {'lr': 0.000288290743448405, 'samples': 13201344, 'steps': 68756, 'loss/train': 2.798267364501953} 11/07/2021 06:54:01 - INFO - __main__ - Step 68758: {'lr': 0.00028828549931001136, 'samples': 13201536, 'steps': 68757, 'loss/train': 1.8358933925628662} 11/07/2021 06:54:01 - INFO - __main__ - Step 68759: {'lr': 0.00028828025515436684, 'samples': 13201728, 'steps': 68758, 'loss/train': 1.7481921911239624} 11/07/2021 06:54:01 - INFO - __main__ - Step 68760: {'lr': 0.0002882750109814738, 'samples': 13201920, 'steps': 68759, 'loss/train': 1.4378331899642944} 11/07/2021 06:54:02 - INFO - __main__ - Step 68761: {'lr': 0.0002882697667913346, 'samples': 13202112, 'steps': 68760, 'loss/train': 1.4361995458602905} 11/07/2021 06:54:03 - INFO - __main__ - Step 68762: {'lr': 0.0002882645225839517, 'samples': 13202304, 'steps': 68761, 'loss/train': 1.3460475206375122} 11/07/2021 06:54:03 - INFO - __main__ - Step 68763: {'lr': 0.0002882592783593273, 'samples': 13202496, 'steps': 68762, 'loss/train': 3.2686145305633545} 11/07/2021 06:54:03 - INFO - __main__ - Step 68764: {'lr': 0.00028825403411746395, 'samples': 13202688, 'steps': 68763, 'loss/train': 1.1555014848709106} 11/07/2021 06:54:04 - INFO - __main__ - Step 68765: {'lr': 0.00028824878985836394, 'samples': 13202880, 'steps': 68764, 'loss/train': 1.3508121967315674} 11/07/2021 06:54:04 - INFO - __main__ - Step 68766: {'lr': 0.00028824354558202957, 'samples': 13203072, 'steps': 68765, 'loss/train': 1.5100679397583008} 11/07/2021 06:54:04 - INFO - __main__ - Step 68767: {'lr': 0.0002882383012884632, 'samples': 13203264, 'steps': 68766, 'loss/train': 1.2283340692520142} 11/07/2021 06:54:06 - INFO - __main__ - Step 68768: {'lr': 0.0002882330569776673, 'samples': 13203456, 'steps': 68767, 'loss/train': 1.745262622833252} 11/07/2021 06:54:06 - INFO - __main__ - Step 68769: {'lr': 0.0002882278126496442, 'samples': 13203648, 'steps': 68768, 'loss/train': 1.1287219524383545} 11/07/2021 06:54:06 - INFO - __main__ - Step 68770: {'lr': 0.0002882225683043962, 'samples': 13203840, 'steps': 68769, 'loss/train': 1.6779792308807373} 11/07/2021 06:54:07 - INFO - __main__ - Step 68771: {'lr': 0.0002882173239419257, 'samples': 13204032, 'steps': 68770, 'loss/train': 1.3601665496826172} 11/07/2021 06:54:07 - INFO - __main__ - Step 68772: {'lr': 0.0002882120795622351, 'samples': 13204224, 'steps': 68771, 'loss/train': 1.6758509874343872} 11/07/2021 06:54:08 - INFO - __main__ - Step 68773: {'lr': 0.0002882068351653267, 'samples': 13204416, 'steps': 68772, 'loss/train': 1.9556405544281006} 11/07/2021 06:54:08 - INFO - __main__ - Step 68774: {'lr': 0.00028820159075120287, 'samples': 13204608, 'steps': 68773, 'loss/train': 1.8062893152236938} 11/07/2021 06:54:09 - INFO - __main__ - Step 68775: {'lr': 0.000288196346319866, 'samples': 13204800, 'steps': 68774, 'loss/train': 1.4754974842071533} 11/07/2021 06:54:09 - INFO - __main__ - Step 68776: {'lr': 0.0002881911018713185, 'samples': 13204992, 'steps': 68775, 'loss/train': 1.5220545530319214} 11/07/2021 06:54:09 - INFO - __main__ - Step 68777: {'lr': 0.00028818585740556256, 'samples': 13205184, 'steps': 68776, 'loss/train': 1.64780855178833} 11/07/2021 06:54:11 - INFO - __main__ - Step 68778: {'lr': 0.00028818061292260077, 'samples': 13205376, 'steps': 68777, 'loss/train': 1.2730201482772827} 11/07/2021 06:54:11 - INFO - __main__ - Step 68779: {'lr': 0.00028817536842243535, 'samples': 13205568, 'steps': 68778, 'loss/train': 1.4944852590560913} 11/07/2021 06:54:11 - INFO - __main__ - Step 68780: {'lr': 0.0002881701239050687, 'samples': 13205760, 'steps': 68779, 'loss/train': 1.5966839790344238} 11/07/2021 06:54:12 - INFO - __main__ - Step 68781: {'lr': 0.00028816487937050316, 'samples': 13205952, 'steps': 68780, 'loss/train': 1.3402107954025269} 11/07/2021 06:54:12 - INFO - __main__ - Step 68782: {'lr': 0.0002881596348187412, 'samples': 13206144, 'steps': 68781, 'loss/train': 1.5101373195648193} 11/07/2021 06:54:13 - INFO - __main__ - Step 68783: {'lr': 0.00028815439024978495, 'samples': 13206336, 'steps': 68782, 'loss/train': 1.4325319528579712} 11/07/2021 06:54:13 - INFO - __main__ - Step 68784: {'lr': 0.00028814914566363704, 'samples': 13206528, 'steps': 68783, 'loss/train': 1.9134025573730469} 11/07/2021 06:54:14 - INFO - __main__ - Step 68785: {'lr': 0.0002881439010602997, 'samples': 13206720, 'steps': 68784, 'loss/train': 1.0383175611495972} 11/07/2021 06:54:14 - INFO - __main__ - Step 68786: {'lr': 0.00028813865643977527, 'samples': 13206912, 'steps': 68785, 'loss/train': 1.6336859464645386} 11/07/2021 06:54:14 - INFO - __main__ - Step 68787: {'lr': 0.00028813341180206623, 'samples': 13207104, 'steps': 68786, 'loss/train': 1.5880076885223389} 11/07/2021 06:54:15 - INFO - __main__ - Step 68788: {'lr': 0.0002881281671471747, 'samples': 13207296, 'steps': 68787, 'loss/train': 1.5060057640075684} 11/07/2021 06:54:16 - INFO - __main__ - Step 68789: {'lr': 0.0002881229224751033, 'samples': 13207488, 'steps': 68788, 'loss/train': 1.3176006078720093} 11/07/2021 06:54:16 - INFO - __main__ - Step 68790: {'lr': 0.0002881176777858543, 'samples': 13207680, 'steps': 68789, 'loss/train': 2.072941780090332} 11/07/2021 06:54:17 - INFO - __main__ - Step 68791: {'lr': 0.0002881124330794301, 'samples': 13207872, 'steps': 68790, 'loss/train': 1.3428821563720703} 11/07/2021 06:54:17 - INFO - __main__ - Step 68792: {'lr': 0.000288107188355833, 'samples': 13208064, 'steps': 68791, 'loss/train': 1.1934484243392944} 11/07/2021 06:54:17 - INFO - __main__ - Step 68793: {'lr': 0.00028810194361506534, 'samples': 13208256, 'steps': 68792, 'loss/train': 1.1867146492004395} 11/07/2021 06:54:18 - INFO - __main__ - Step 68794: {'lr': 0.0002880966988571296, 'samples': 13208448, 'steps': 68793, 'loss/train': 1.2820014953613281} 11/07/2021 06:54:19 - INFO - __main__ - Step 68795: {'lr': 0.00028809145408202803, 'samples': 13208640, 'steps': 68794, 'loss/train': 1.2156034708023071} 11/07/2021 06:54:19 - INFO - __main__ - Step 68796: {'lr': 0.00028808620928976304, 'samples': 13208832, 'steps': 68795, 'loss/train': 1.8367804288864136} 11/07/2021 06:54:19 - INFO - __main__ - Step 68797: {'lr': 0.00028808096448033703, 'samples': 13209024, 'steps': 68796, 'loss/train': 1.323794960975647} 11/07/2021 06:54:20 - INFO - __main__ - Step 68798: {'lr': 0.00028807571965375233, 'samples': 13209216, 'steps': 68797, 'loss/train': 1.4324678182601929} 11/07/2021 06:54:21 - INFO - __main__ - Step 68799: {'lr': 0.00028807047481001127, 'samples': 13209408, 'steps': 68798, 'loss/train': 1.3734685182571411} 11/07/2021 06:54:21 - INFO - __main__ - Step 68800: {'lr': 0.0002880652299491162, 'samples': 13209600, 'steps': 68799, 'loss/train': 1.4962210655212402} 11/07/2021 06:54:21 - INFO - __main__ - Step 68801: {'lr': 0.00028805998507106954, 'samples': 13209792, 'steps': 68800, 'loss/train': 1.092325210571289} 11/07/2021 06:54:22 - INFO - __main__ - Step 68802: {'lr': 0.00028805474017587376, 'samples': 13209984, 'steps': 68801, 'loss/train': 1.7781310081481934} 11/07/2021 06:54:22 - INFO - __main__ - Step 68803: {'lr': 0.00028804949526353094, 'samples': 13210176, 'steps': 68802, 'loss/train': 1.2796870470046997} 11/07/2021 06:54:23 - INFO - __main__ - Step 68804: {'lr': 0.0002880442503340437, 'samples': 13210368, 'steps': 68803, 'loss/train': 1.7962254285812378} 11/07/2021 06:54:23 - INFO - __main__ - Step 68805: {'lr': 0.0002880390053874143, 'samples': 13210560, 'steps': 68804, 'loss/train': 1.9241870641708374} 11/07/2021 06:54:24 - INFO - __main__ - Step 68806: {'lr': 0.0002880337604236451, 'samples': 13210752, 'steps': 68805, 'loss/train': 0.5275155901908875} 11/07/2021 06:54:24 - INFO - __main__ - Step 68807: {'lr': 0.0002880285154427385, 'samples': 13210944, 'steps': 68806, 'loss/train': 1.386277198791504} 11/07/2021 06:54:24 - INFO - __main__ - Step 68808: {'lr': 0.00028802327044469674, 'samples': 13211136, 'steps': 68807, 'loss/train': 1.433838129043579} 11/07/2021 06:54:25 - INFO - __main__ - Step 68809: {'lr': 0.00028801802542952233, 'samples': 13211328, 'steps': 68808, 'loss/train': 1.1382306814193726} 11/07/2021 06:54:26 - INFO - __main__ - Step 68810: {'lr': 0.0002880127803972176, 'samples': 13211520, 'steps': 68809, 'loss/train': 1.215707778930664} 11/07/2021 06:54:26 - INFO - __main__ - Step 68811: {'lr': 0.00028800753534778483, 'samples': 13211712, 'steps': 68810, 'loss/train': 0.20287396013736725} 11/07/2021 06:54:27 - INFO - __main__ - Step 68812: {'lr': 0.0002880022902812266, 'samples': 13211904, 'steps': 68811, 'loss/train': 1.3575838804244995} 11/07/2021 06:54:27 - INFO - __main__ - Step 68813: {'lr': 0.00028799704519754505, 'samples': 13212096, 'steps': 68812, 'loss/train': 1.2246707677841187} 11/07/2021 06:54:27 - INFO - __main__ - Step 68814: {'lr': 0.0002879918000967426, 'samples': 13212288, 'steps': 68813, 'loss/train': 1.3546993732452393} 11/07/2021 06:54:29 - INFO - __main__ - Step 68815: {'lr': 0.0002879865549788216, 'samples': 13212480, 'steps': 68814, 'loss/train': 1.7236602306365967} 11/07/2021 06:54:29 - INFO - __main__ - Step 68816: {'lr': 0.0002879813098437845, 'samples': 13212672, 'steps': 68815, 'loss/train': 1.4486937522888184} 11/07/2021 06:54:29 - INFO - __main__ - Step 68817: {'lr': 0.00028797606469163357, 'samples': 13212864, 'steps': 68816, 'loss/train': 1.4338408708572388} 11/07/2021 06:54:30 - INFO - __main__ - Step 68818: {'lr': 0.00028797081952237127, 'samples': 13213056, 'steps': 68817, 'loss/train': 1.588638424873352} 11/07/2021 06:54:30 - INFO - __main__ - Step 68819: {'lr': 0.0002879655743359999, 'samples': 13213248, 'steps': 68818, 'loss/train': 1.2221155166625977} 11/07/2021 06:54:31 - INFO - __main__ - Step 68820: {'lr': 0.0002879603291325217, 'samples': 13213440, 'steps': 68819, 'loss/train': 1.8071449995040894} 11/07/2021 06:54:31 - INFO - __main__ - Step 68821: {'lr': 0.0002879550839119393, 'samples': 13213632, 'steps': 68820, 'loss/train': 1.1308106184005737} 11/07/2021 06:54:32 - INFO - __main__ - Step 68822: {'lr': 0.0002879498386742549, 'samples': 13213824, 'steps': 68821, 'loss/train': 1.9170184135437012} 11/07/2021 06:54:32 - INFO - __main__ - Step 68823: {'lr': 0.0002879445934194709, 'samples': 13214016, 'steps': 68822, 'loss/train': 1.3819119930267334} 11/07/2021 06:54:32 - INFO - __main__ - Step 68824: {'lr': 0.0002879393481475896, 'samples': 13214208, 'steps': 68823, 'loss/train': 1.3610137701034546} 11/07/2021 06:54:33 - INFO - __main__ - Step 68825: {'lr': 0.00028793410285861344, 'samples': 13214400, 'steps': 68824, 'loss/train': 1.1006574630737305} 11/07/2021 06:54:34 - INFO - __main__ - Step 68826: {'lr': 0.0002879288575525447, 'samples': 13214592, 'steps': 68825, 'loss/train': 1.894059181213379} 11/07/2021 06:54:34 - INFO - __main__ - Step 68827: {'lr': 0.0002879236122293859, 'samples': 13214784, 'steps': 68826, 'loss/train': 1.2880157232284546} 11/07/2021 06:54:34 - INFO - __main__ - Step 68828: {'lr': 0.00028791836688913926, 'samples': 13214976, 'steps': 68827, 'loss/train': 2.0020172595977783} 11/07/2021 06:54:35 - INFO - __main__ - Step 68829: {'lr': 0.00028791312153180723, 'samples': 13215168, 'steps': 68828, 'loss/train': 1.3246510028839111} 11/07/2021 06:54:36 - INFO - __main__ - Step 68830: {'lr': 0.0002879078761573921, 'samples': 13215360, 'steps': 68829, 'loss/train': 1.0869559049606323} 11/07/2021 06:54:36 - INFO - __main__ - Step 68831: {'lr': 0.00028790263076589626, 'samples': 13215552, 'steps': 68830, 'loss/train': 1.6209595203399658} 11/07/2021 06:54:37 - INFO - __main__ - Step 68832: {'lr': 0.0002878973853573221, 'samples': 13215744, 'steps': 68831, 'loss/train': 1.745853304862976} 11/07/2021 06:54:37 - INFO - __main__ - Step 68833: {'lr': 0.0002878921399316719, 'samples': 13215936, 'steps': 68832, 'loss/train': 1.5663378238677979} 11/07/2021 06:54:37 - INFO - __main__ - Step 68834: {'lr': 0.0002878868944889482, 'samples': 13216128, 'steps': 68833, 'loss/train': 1.201583743095398} 11/07/2021 06:54:38 - INFO - __main__ - Step 68835: {'lr': 0.00028788164902915315, 'samples': 13216320, 'steps': 68834, 'loss/train': 1.2515606880187988} 11/07/2021 06:54:39 - INFO - __main__ - Step 68836: {'lr': 0.00028787640355228925, 'samples': 13216512, 'steps': 68835, 'loss/train': 1.2097516059875488} 11/07/2021 06:54:39 - INFO - __main__ - Step 68837: {'lr': 0.0002878711580583588, 'samples': 13216704, 'steps': 68836, 'loss/train': 1.2614885568618774} 11/07/2021 06:54:39 - INFO - __main__ - Step 68838: {'lr': 0.00028786591254736417, 'samples': 13216896, 'steps': 68837, 'loss/train': 1.6243563890457153} 11/07/2021 06:54:40 - INFO - __main__ - Step 68839: {'lr': 0.0002878606670193078, 'samples': 13217088, 'steps': 68838, 'loss/train': 0.9331453442573547} 11/07/2021 06:54:40 - INFO - __main__ - Step 68840: {'lr': 0.00028785542147419203, 'samples': 13217280, 'steps': 68839, 'loss/train': 1.219509243965149} 11/07/2021 06:54:41 - INFO - __main__ - Step 68841: {'lr': 0.00028785017591201914, 'samples': 13217472, 'steps': 68840, 'loss/train': 2.1272385120391846} 11/07/2021 06:54:41 - INFO - __main__ - Step 68842: {'lr': 0.00028784493033279153, 'samples': 13217664, 'steps': 68841, 'loss/train': 2.0299651622772217} 11/07/2021 06:54:42 - INFO - __main__ - Step 68843: {'lr': 0.00028783968473651154, 'samples': 13217856, 'steps': 68842, 'loss/train': 1.3273167610168457} 11/07/2021 06:54:42 - INFO - __main__ - Step 68844: {'lr': 0.0002878344391231817, 'samples': 13218048, 'steps': 68843, 'loss/train': 1.2992194890975952} 11/07/2021 06:54:42 - INFO - __main__ - Step 68845: {'lr': 0.0002878291934928041, 'samples': 13218240, 'steps': 68844, 'loss/train': 0.9596202373504639} 11/07/2021 06:54:44 - INFO - __main__ - Step 68846: {'lr': 0.00028782394784538143, 'samples': 13218432, 'steps': 68845, 'loss/train': 1.9360885620117188} 11/07/2021 06:54:44 - INFO - __main__ - Step 68847: {'lr': 0.0002878187021809157, 'samples': 13218624, 'steps': 68846, 'loss/train': 1.4440639019012451} 11/07/2021 06:54:44 - INFO - __main__ - Step 68848: {'lr': 0.00028781345649940955, 'samples': 13218816, 'steps': 68847, 'loss/train': 1.4395310878753662} 11/07/2021 06:54:45 - INFO - __main__ - Step 68849: {'lr': 0.0002878082108008652, 'samples': 13219008, 'steps': 68848, 'loss/train': 1.4562822580337524} 11/07/2021 06:54:45 - INFO - __main__ - Step 68850: {'lr': 0.00028780296508528505, 'samples': 13219200, 'steps': 68849, 'loss/train': 1.442258596420288} 11/07/2021 06:54:46 - INFO - __main__ - Step 68851: {'lr': 0.00028779771935267146, 'samples': 13219392, 'steps': 68850, 'loss/train': 1.6587554216384888} 11/07/2021 06:54:46 - INFO - __main__ - Step 68852: {'lr': 0.00028779247360302684, 'samples': 13219584, 'steps': 68851, 'loss/train': 1.7982509136199951} 11/07/2021 06:54:47 - INFO - __main__ - Step 68853: {'lr': 0.0002877872278363535, 'samples': 13219776, 'steps': 68852, 'loss/train': 1.4764078855514526} 11/07/2021 06:54:47 - INFO - __main__ - Step 68854: {'lr': 0.00028778198205265374, 'samples': 13219968, 'steps': 68853, 'loss/train': 1.5343388319015503} 11/07/2021 06:54:48 - INFO - __main__ - Step 68855: {'lr': 0.00028777673625193014, 'samples': 13220160, 'steps': 68854, 'loss/train': 1.0998382568359375} 11/07/2021 06:54:48 - INFO - __main__ - Step 68856: {'lr': 0.00028777149043418483, 'samples': 13220352, 'steps': 68855, 'loss/train': 1.5341941118240356} 11/07/2021 06:54:49 - INFO - __main__ - Step 68857: {'lr': 0.00028776624459942026, 'samples': 13220544, 'steps': 68856, 'loss/train': 1.1346256732940674} 11/07/2021 06:54:49 - INFO - __main__ - Step 68858: {'lr': 0.0002877609987476388, 'samples': 13220736, 'steps': 68857, 'loss/train': 1.4010471105575562} 11/07/2021 06:54:50 - INFO - __main__ - Step 68859: {'lr': 0.0002877557528788429, 'samples': 13220928, 'steps': 68858, 'loss/train': 1.6056302785873413} 11/07/2021 06:54:50 - INFO - __main__ - Step 68860: {'lr': 0.0002877505069930348, 'samples': 13221120, 'steps': 68859, 'loss/train': 1.6038180589675903} 11/07/2021 06:54:51 - INFO - __main__ - Step 68861: {'lr': 0.00028774526109021694, 'samples': 13221312, 'steps': 68860, 'loss/train': 1.3825795650482178} 11/07/2021 06:54:51 - INFO - __main__ - Step 68862: {'lr': 0.00028774001517039156, 'samples': 13221504, 'steps': 68861, 'loss/train': 2.525428533554077} 11/07/2021 06:54:52 - INFO - __main__ - Step 68863: {'lr': 0.0002877347692335612, 'samples': 13221696, 'steps': 68862, 'loss/train': 1.6654268503189087} 11/07/2021 06:54:52 - INFO - __main__ - Step 68864: {'lr': 0.00028772952327972806, 'samples': 13221888, 'steps': 68863, 'loss/train': 1.508238673210144} 11/07/2021 06:54:52 - INFO - __main__ - Step 68865: {'lr': 0.0002877242773088946, 'samples': 13222080, 'steps': 68864, 'loss/train': 1.251715064048767} 11/07/2021 06:54:53 - INFO - __main__ - Step 68866: {'lr': 0.0002877190313210632, 'samples': 13222272, 'steps': 68865, 'loss/train': 1.3308345079421997} 11/07/2021 06:54:54 - INFO - __main__ - Step 68867: {'lr': 0.00028771378531623613, 'samples': 13222464, 'steps': 68866, 'loss/train': 0.9163389205932617} 11/07/2021 06:54:54 - INFO - __main__ - Step 68868: {'lr': 0.0002877085392944159, 'samples': 13222656, 'steps': 68867, 'loss/train': 1.430783987045288} 11/07/2021 06:54:54 - INFO - __main__ - Step 68869: {'lr': 0.0002877032932556047, 'samples': 13222848, 'steps': 68868, 'loss/train': 0.906940758228302} 11/07/2021 06:54:55 - INFO - __main__ - Step 68870: {'lr': 0.00028769804719980496, 'samples': 13223040, 'steps': 68869, 'loss/train': 1.4446220397949219} 11/07/2021 06:54:56 - INFO - __main__ - Step 68871: {'lr': 0.00028769280112701914, 'samples': 13223232, 'steps': 68870, 'loss/train': 1.3984602689743042} 11/07/2021 06:54:56 - INFO - __main__ - Step 68872: {'lr': 0.0002876875550372495, 'samples': 13223424, 'steps': 68871, 'loss/train': 1.5043494701385498} 11/07/2021 06:54:57 - INFO - __main__ - Step 68873: {'lr': 0.0002876823089304984, 'samples': 13223616, 'steps': 68872, 'loss/train': 1.2409067153930664} 11/07/2021 06:54:57 - INFO - __main__ - Step 68874: {'lr': 0.00028767706280676827, 'samples': 13223808, 'steps': 68873, 'loss/train': 1.4935681819915771} 11/07/2021 06:54:57 - INFO - __main__ - Step 68875: {'lr': 0.0002876718166660614, 'samples': 13224000, 'steps': 68874, 'loss/train': 1.425476312637329} 11/07/2021 06:54:58 - INFO - __main__ - Step 68876: {'lr': 0.0002876665705083802, 'samples': 13224192, 'steps': 68875, 'loss/train': 1.749193549156189} 11/07/2021 06:54:59 - INFO - __main__ - Step 68877: {'lr': 0.00028766132433372707, 'samples': 13224384, 'steps': 68876, 'loss/train': 1.367321491241455} 11/07/2021 06:54:59 - INFO - __main__ - Step 68878: {'lr': 0.00028765607814210424, 'samples': 13224576, 'steps': 68877, 'loss/train': 0.9299314618110657} 11/07/2021 06:54:59 - INFO - __main__ - Step 68879: {'lr': 0.0002876508319335143, 'samples': 13224768, 'steps': 68878, 'loss/train': 1.3156849145889282} 11/07/2021 06:55:00 - INFO - __main__ - Step 68880: {'lr': 0.00028764558570795935, 'samples': 13224960, 'steps': 68879, 'loss/train': 2.0534472465515137} 11/07/2021 06:55:00 - INFO - __main__ - Step 68881: {'lr': 0.00028764033946544195, 'samples': 13225152, 'steps': 68880, 'loss/train': 1.2869504690170288} 11/07/2021 06:55:01 - INFO - __main__ - Step 68882: {'lr': 0.00028763509320596433, 'samples': 13225344, 'steps': 68881, 'loss/train': 1.4744547605514526} 11/07/2021 06:55:02 - INFO - __main__ - Step 68883: {'lr': 0.0002876298469295289, 'samples': 13225536, 'steps': 68882, 'loss/train': 0.13853143155574799} 11/07/2021 06:55:02 - INFO - __main__ - Step 68884: {'lr': 0.00028762460063613815, 'samples': 13225728, 'steps': 68883, 'loss/train': 0.4383002519607544} 11/07/2021 06:55:02 - INFO - __main__ - Step 68885: {'lr': 0.0002876193543257942, 'samples': 13225920, 'steps': 68884, 'loss/train': 1.5656626224517822} 11/07/2021 06:55:03 - INFO - __main__ - Step 68886: {'lr': 0.00028761410799849974, 'samples': 13226112, 'steps': 68885, 'loss/train': 1.6157227754592896} 11/07/2021 06:55:04 - INFO - __main__ - Step 68887: {'lr': 0.0002876088616542568, 'samples': 13226304, 'steps': 68886, 'loss/train': 1.7957937717437744} 11/07/2021 06:55:04 - INFO - __main__ - Step 68888: {'lr': 0.00028760361529306795, 'samples': 13226496, 'steps': 68887, 'loss/train': 1.7916768789291382} 11/07/2021 06:55:05 - INFO - __main__ - Step 68889: {'lr': 0.0002875983689149354, 'samples': 13226688, 'steps': 68888, 'loss/train': 1.3981820344924927} 11/07/2021 06:55:05 - INFO - __main__ - Step 68890: {'lr': 0.0002875931225198617, 'samples': 13226880, 'steps': 68889, 'loss/train': 1.493537425994873} 11/07/2021 06:55:05 - INFO - __main__ - Step 68891: {'lr': 0.0002875878761078491, 'samples': 13227072, 'steps': 68890, 'loss/train': 1.0983390808105469} 11/07/2021 06:55:06 - INFO - __main__ - Step 68892: {'lr': 0.00028758262967889994, 'samples': 13227264, 'steps': 68891, 'loss/train': 1.7687371969223022} 11/07/2021 06:55:07 - INFO - __main__ - Step 68893: {'lr': 0.0002875773832330167, 'samples': 13227456, 'steps': 68892, 'loss/train': 1.5718599557876587} 11/07/2021 06:55:07 - INFO - __main__ - Step 68894: {'lr': 0.0002875721367702016, 'samples': 13227648, 'steps': 68893, 'loss/train': 1.5229616165161133} 11/07/2021 06:55:07 - INFO - __main__ - Step 68895: {'lr': 0.00028756689029045714, 'samples': 13227840, 'steps': 68894, 'loss/train': 1.2042248249053955} 11/07/2021 06:55:08 - INFO - __main__ - Step 68896: {'lr': 0.0002875616437937855, 'samples': 13228032, 'steps': 68895, 'loss/train': 1.5609915256500244} 11/07/2021 06:55:09 - INFO - __main__ - Step 68897: {'lr': 0.0002875563972801893, 'samples': 13228224, 'steps': 68896, 'loss/train': 2.101658821105957} 11/07/2021 06:55:09 - INFO - __main__ - Step 68898: {'lr': 0.00028755115074967065, 'samples': 13228416, 'steps': 68897, 'loss/train': 0.11695777624845505} 11/07/2021 06:55:09 - INFO - __main__ - Step 68899: {'lr': 0.00028754590420223213, 'samples': 13228608, 'steps': 68898, 'loss/train': 1.0952425003051758} 11/07/2021 06:55:10 - INFO - __main__ - Step 68900: {'lr': 0.000287540657637876, 'samples': 13228800, 'steps': 68899, 'loss/train': 1.4780181646347046} 11/07/2021 06:55:10 - INFO - __main__ - Step 68901: {'lr': 0.00028753541105660456, 'samples': 13228992, 'steps': 68900, 'loss/train': 1.5206427574157715} 11/07/2021 06:55:11 - INFO - __main__ - Step 68902: {'lr': 0.0002875301644584203, 'samples': 13229184, 'steps': 68901, 'loss/train': 1.6031594276428223} 11/07/2021 06:55:12 - INFO - __main__ - Step 68903: {'lr': 0.0002875249178433255, 'samples': 13229376, 'steps': 68902, 'loss/train': 1.5170459747314453} 11/07/2021 06:55:12 - INFO - __main__ - Step 68904: {'lr': 0.00028751967121132255, 'samples': 13229568, 'steps': 68903, 'loss/train': 1.2033095359802246} 11/07/2021 06:55:12 - INFO - __main__ - Step 68905: {'lr': 0.00028751442456241376, 'samples': 13229760, 'steps': 68904, 'loss/train': 4.606032848358154} 11/07/2021 06:55:13 - INFO - __main__ - Step 68906: {'lr': 0.00028750917789660167, 'samples': 13229952, 'steps': 68905, 'loss/train': 1.1798127889633179} 11/07/2021 06:55:13 - INFO - __main__ - Step 68907: {'lr': 0.0002875039312138885, 'samples': 13230144, 'steps': 68906, 'loss/train': 1.2679096460342407} 11/07/2021 06:55:14 - INFO - __main__ - Step 68908: {'lr': 0.00028749868451427655, 'samples': 13230336, 'steps': 68907, 'loss/train': 1.5889419317245483} 11/07/2021 06:55:14 - INFO - __main__ - Step 68909: {'lr': 0.0002874934377977683, 'samples': 13230528, 'steps': 68908, 'loss/train': 1.464530110359192} 11/07/2021 06:55:15 - INFO - __main__ - Step 68910: {'lr': 0.0002874881910643661, 'samples': 13230720, 'steps': 68909, 'loss/train': 1.4315434694290161} 11/07/2021 06:55:15 - INFO - __main__ - Step 68911: {'lr': 0.00028748294431407234, 'samples': 13230912, 'steps': 68910, 'loss/train': 1.1619889736175537} 11/07/2021 06:55:15 - INFO - __main__ - Step 68912: {'lr': 0.0002874776975468893, 'samples': 13231104, 'steps': 68911, 'loss/train': 1.5657135248184204} 11/07/2021 06:55:16 - INFO - __main__ - Step 68913: {'lr': 0.0002874724507628195, 'samples': 13231296, 'steps': 68912, 'loss/train': 1.8346271514892578} 11/07/2021 06:55:17 - INFO - __main__ - Step 68914: {'lr': 0.00028746720396186505, 'samples': 13231488, 'steps': 68913, 'loss/train': 1.0269865989685059} 11/07/2021 06:55:17 - INFO - __main__ - Step 68915: {'lr': 0.00028746195714402845, 'samples': 13231680, 'steps': 68914, 'loss/train': 1.5843859910964966} 11/07/2021 06:55:17 - INFO - __main__ - Step 68916: {'lr': 0.00028745671030931214, 'samples': 13231872, 'steps': 68915, 'loss/train': 1.3876928091049194} 11/07/2021 06:55:18 - INFO - __main__ - Step 68917: {'lr': 0.00028745146345771837, 'samples': 13232064, 'steps': 68916, 'loss/train': 0.8730106949806213} 11/07/2021 06:55:19 - INFO - __main__ - Step 68918: {'lr': 0.0002874462165892496, 'samples': 13232256, 'steps': 68917, 'loss/train': 1.2460023164749146} 11/07/2021 06:55:19 - INFO - __main__ - Step 68919: {'lr': 0.00028744096970390807, 'samples': 13232448, 'steps': 68918, 'loss/train': 1.4531586170196533} 11/07/2021 06:55:20 - INFO - __main__ - Step 68920: {'lr': 0.00028743572280169626, 'samples': 13232640, 'steps': 68919, 'loss/train': 1.2715351581573486} 11/07/2021 06:55:20 - INFO - __main__ - Step 68921: {'lr': 0.0002874304758826165, 'samples': 13232832, 'steps': 68920, 'loss/train': 1.4546244144439697} 11/07/2021 06:55:20 - INFO - __main__ - Step 68922: {'lr': 0.00028742522894667114, 'samples': 13233024, 'steps': 68921, 'loss/train': 1.240707278251648} 11/07/2021 06:55:21 - INFO - __main__ - Step 68923: {'lr': 0.00028741998199386255, 'samples': 13233216, 'steps': 68922, 'loss/train': 1.6324728727340698} 11/07/2021 06:55:22 - INFO - __main__ - Step 68924: {'lr': 0.0002874147350241931, 'samples': 13233408, 'steps': 68923, 'loss/train': 0.8737909197807312} 11/07/2021 06:55:22 - INFO - __main__ - Step 68925: {'lr': 0.0002874094880376651, 'samples': 13233600, 'steps': 68924, 'loss/train': 1.1036890745162964} 11/07/2021 06:55:22 - INFO - __main__ - Step 68926: {'lr': 0.000287404241034281, 'samples': 13233792, 'steps': 68925, 'loss/train': 1.3615877628326416} 11/07/2021 06:55:23 - INFO - __main__ - Step 68927: {'lr': 0.0002873989940140432, 'samples': 13233984, 'steps': 68926, 'loss/train': 1.2661333084106445} 11/07/2021 06:55:24 - INFO - __main__ - Step 68928: {'lr': 0.00028739374697695386, 'samples': 13234176, 'steps': 68927, 'loss/train': 1.5532397031784058} 11/07/2021 06:55:24 - INFO - __main__ - Step 68929: {'lr': 0.00028738849992301555, 'samples': 13234368, 'steps': 68928, 'loss/train': 1.1162605285644531} 11/07/2021 06:55:24 - INFO - __main__ - Step 68930: {'lr': 0.0002873832528522305, 'samples': 13234560, 'steps': 68929, 'loss/train': 1.9029494524002075} 11/07/2021 06:55:25 - INFO - __main__ - Step 68931: {'lr': 0.00028737800576460117, 'samples': 13234752, 'steps': 68930, 'loss/train': 0.7072430849075317} 11/07/2021 06:55:25 - INFO - __main__ - Step 68932: {'lr': 0.00028737275866012993, 'samples': 13234944, 'steps': 68931, 'loss/train': 1.8631809949874878} 11/07/2021 06:55:26 - INFO - __main__ - Step 68933: {'lr': 0.0002873675115388191, 'samples': 13235136, 'steps': 68932, 'loss/train': 1.4826445579528809} 11/07/2021 06:55:26 - INFO - __main__ - Step 68934: {'lr': 0.000287362264400671, 'samples': 13235328, 'steps': 68933, 'loss/train': 1.3351417779922485} 11/07/2021 06:55:27 - INFO - __main__ - Step 68935: {'lr': 0.000287357017245688, 'samples': 13235520, 'steps': 68934, 'loss/train': 1.5006864070892334} 11/07/2021 06:55:27 - INFO - __main__ - Step 68936: {'lr': 0.00028735177007387254, 'samples': 13235712, 'steps': 68935, 'loss/train': 0.9772597551345825} 11/07/2021 06:55:27 - INFO - __main__ - Step 68937: {'lr': 0.00028734652288522693, 'samples': 13235904, 'steps': 68936, 'loss/train': 1.36562979221344} 11/07/2021 06:55:28 - INFO - __main__ - Step 68938: {'lr': 0.0002873412756797536, 'samples': 13236096, 'steps': 68937, 'loss/train': 1.6722534894943237} 11/07/2021 06:55:29 - INFO - __main__ - Step 68939: {'lr': 0.0002873360284574549, 'samples': 13236288, 'steps': 68938, 'loss/train': 1.5895389318466187} 11/07/2021 06:55:29 - INFO - __main__ - Step 68940: {'lr': 0.0002873307812183331, 'samples': 13236480, 'steps': 68939, 'loss/train': 1.4695757627487183} 11/07/2021 06:55:30 - INFO - __main__ - Step 68941: {'lr': 0.00028732553396239064, 'samples': 13236672, 'steps': 68940, 'loss/train': 1.375083088874817} 11/07/2021 06:55:30 - INFO - __main__ - Step 68942: {'lr': 0.00028732028668962986, 'samples': 13236864, 'steps': 68941, 'loss/train': 1.302897572517395} 11/07/2021 06:55:30 - INFO - __main__ - Step 68943: {'lr': 0.0002873150394000531, 'samples': 13237056, 'steps': 68942, 'loss/train': 1.3504356145858765} 11/07/2021 06:55:31 - INFO - __main__ - Step 68944: {'lr': 0.0002873097920936628, 'samples': 13237248, 'steps': 68943, 'loss/train': 1.151237964630127} 11/07/2021 06:55:32 - INFO - __main__ - Step 68945: {'lr': 0.0002873045447704613, 'samples': 13237440, 'steps': 68944, 'loss/train': 1.372614860534668} 11/07/2021 06:55:32 - INFO - __main__ - Step 68946: {'lr': 0.00028729929743045096, 'samples': 13237632, 'steps': 68945, 'loss/train': 1.5437469482421875} 11/07/2021 06:55:32 - INFO - __main__ - Step 68947: {'lr': 0.00028729405007363415, 'samples': 13237824, 'steps': 68946, 'loss/train': 1.0832746028900146} 11/07/2021 06:55:33 - INFO - __main__ - Step 68948: {'lr': 0.00028728880270001314, 'samples': 13238016, 'steps': 68947, 'loss/train': 1.9998877048492432} 11/07/2021 06:55:34 - INFO - __main__ - Step 68949: {'lr': 0.0002872835553095904, 'samples': 13238208, 'steps': 68948, 'loss/train': 1.1633950471878052} 11/07/2021 06:55:34 - INFO - __main__ - Step 68950: {'lr': 0.00028727830790236823, 'samples': 13238400, 'steps': 68949, 'loss/train': 1.2925668954849243} 11/07/2021 06:55:34 - INFO - __main__ - Step 68951: {'lr': 0.00028727306047834905, 'samples': 13238592, 'steps': 68950, 'loss/train': 1.5009865760803223} 11/07/2021 06:55:35 - INFO - __main__ - Step 68952: {'lr': 0.0002872678130375353, 'samples': 13238784, 'steps': 68951, 'loss/train': 1.0551140308380127} 11/07/2021 06:55:35 - INFO - __main__ - Step 68953: {'lr': 0.0002872625655799291, 'samples': 13238976, 'steps': 68952, 'loss/train': 1.1508533954620361} 11/07/2021 06:55:35 - INFO - __main__ - Step 68954: {'lr': 0.0002872573181055331, 'samples': 13239168, 'steps': 68953, 'loss/train': 1.2506681680679321} 11/07/2021 06:55:37 - INFO - __main__ - Step 68955: {'lr': 0.00028725207061434943, 'samples': 13239360, 'steps': 68954, 'loss/train': 0.9282740354537964} 11/07/2021 06:55:37 - INFO - __main__ - Step 68956: {'lr': 0.0002872468231063806, 'samples': 13239552, 'steps': 68955, 'loss/train': 0.9492714405059814} 11/07/2021 06:55:37 - INFO - __main__ - Step 68957: {'lr': 0.0002872415755816289, 'samples': 13239744, 'steps': 68956, 'loss/train': 1.3837673664093018} 11/07/2021 06:55:38 - INFO - __main__ - Step 68958: {'lr': 0.0002872363280400967, 'samples': 13239936, 'steps': 68957, 'loss/train': 0.9757857918739319} 11/07/2021 06:55:38 - INFO - __main__ - Step 68959: {'lr': 0.0002872310804817865, 'samples': 13240128, 'steps': 68958, 'loss/train': 1.571765422821045} 11/07/2021 06:55:39 - INFO - __main__ - Step 68960: {'lr': 0.0002872258329067005, 'samples': 13240320, 'steps': 68959, 'loss/train': 1.6151721477508545} 11/07/2021 06:55:39 - INFO - __main__ - Step 68961: {'lr': 0.000287220585314841, 'samples': 13240512, 'steps': 68960, 'loss/train': 1.1381922960281372} 11/07/2021 06:55:40 - INFO - __main__ - Step 68962: {'lr': 0.00028721533770621055, 'samples': 13240704, 'steps': 68961, 'loss/train': 1.2090362310409546} 11/07/2021 06:55:40 - INFO - __main__ - Step 68963: {'lr': 0.0002872100900808115, 'samples': 13240896, 'steps': 68962, 'loss/train': 1.3272781372070312} 11/07/2021 06:55:40 - INFO - __main__ - Step 68964: {'lr': 0.0002872048424386461, 'samples': 13241088, 'steps': 68963, 'loss/train': 1.5502249002456665} 11/07/2021 06:55:41 - INFO - __main__ - Step 68965: {'lr': 0.00028719959477971677, 'samples': 13241280, 'steps': 68964, 'loss/train': 1.290400505065918} 11/07/2021 06:55:42 - INFO - __main__ - Step 68966: {'lr': 0.00028719434710402586, 'samples': 13241472, 'steps': 68965, 'loss/train': 1.4054274559020996} 11/07/2021 06:55:42 - INFO - __main__ - Step 68967: {'lr': 0.0002871890994115758, 'samples': 13241664, 'steps': 68966, 'loss/train': 1.047038197517395} 11/07/2021 06:55:42 - INFO - __main__ - Step 68968: {'lr': 0.0002871838517023689, 'samples': 13241856, 'steps': 68967, 'loss/train': 0.5539084076881409} 11/07/2021 06:55:43 - INFO - __main__ - Step 68969: {'lr': 0.0002871786039764075, 'samples': 13242048, 'steps': 68968, 'loss/train': 1.6859019994735718} 11/07/2021 06:55:44 - INFO - __main__ - Step 68970: {'lr': 0.000287173356233694, 'samples': 13242240, 'steps': 68969, 'loss/train': 1.3177416324615479} 11/07/2021 06:55:44 - INFO - __main__ - Step 68971: {'lr': 0.0002871681084742308, 'samples': 13242432, 'steps': 68970, 'loss/train': 1.0091207027435303} 11/07/2021 06:55:44 - INFO - __main__ - Step 68972: {'lr': 0.00028716286069802017, 'samples': 13242624, 'steps': 68971, 'loss/train': 1.1612074375152588} 11/07/2021 06:55:45 - INFO - __main__ - Step 68973: {'lr': 0.00028715761290506455, 'samples': 13242816, 'steps': 68972, 'loss/train': 1.3303627967834473} 11/07/2021 06:55:45 - INFO - __main__ - Step 68974: {'lr': 0.0002871523650953663, 'samples': 13243008, 'steps': 68973, 'loss/train': 0.9490219950675964} 11/07/2021 06:55:46 - INFO - __main__ - Step 68975: {'lr': 0.0002871471172689277, 'samples': 13243200, 'steps': 68974, 'loss/train': 1.4422160387039185} 11/07/2021 06:55:46 - INFO - __main__ - Step 68976: {'lr': 0.00028714186942575126, 'samples': 13243392, 'steps': 68975, 'loss/train': 1.5008268356323242} 11/07/2021 06:55:47 - INFO - __main__ - Step 68977: {'lr': 0.00028713662156583923, 'samples': 13243584, 'steps': 68976, 'loss/train': 1.5090924501419067} 11/07/2021 06:55:47 - INFO - __main__ - Step 68978: {'lr': 0.00028713137368919405, 'samples': 13243776, 'steps': 68977, 'loss/train': 1.0299303531646729} 11/07/2021 06:55:47 - INFO - __main__ - Step 68979: {'lr': 0.000287126125795818, 'samples': 13243968, 'steps': 68978, 'loss/train': 0.9725791811943054} 11/07/2021 06:55:49 - INFO - __main__ - Step 68980: {'lr': 0.00028712087788571353, 'samples': 13244160, 'steps': 68979, 'loss/train': 1.2196526527404785} 11/07/2021 06:55:49 - INFO - __main__ - Step 68981: {'lr': 0.00028711562995888297, 'samples': 13244352, 'steps': 68980, 'loss/train': 1.4525353908538818} 11/07/2021 06:55:49 - INFO - __main__ - Step 68982: {'lr': 0.00028711038201532864, 'samples': 13244544, 'steps': 68981, 'loss/train': 1.533843994140625} 11/07/2021 06:55:50 - INFO - __main__ - Step 68983: {'lr': 0.00028710513405505293, 'samples': 13244736, 'steps': 68982, 'loss/train': 1.7768781185150146} 11/07/2021 06:55:50 - INFO - __main__ - Step 68984: {'lr': 0.0002870998860780583, 'samples': 13244928, 'steps': 68983, 'loss/train': 1.6348001956939697} 11/07/2021 06:55:51 - INFO - __main__ - Step 68985: {'lr': 0.0002870946380843469, 'samples': 13245120, 'steps': 68984, 'loss/train': 1.078094244003296} 11/07/2021 06:55:51 - INFO - __main__ - Step 68986: {'lr': 0.0002870893900739213, 'samples': 13245312, 'steps': 68985, 'loss/train': 1.5722733736038208} 11/07/2021 06:55:52 - INFO - __main__ - Step 68987: {'lr': 0.00028708414204678385, 'samples': 13245504, 'steps': 68986, 'loss/train': 1.1835976839065552} 11/07/2021 06:55:52 - INFO - __main__ - Step 68988: {'lr': 0.0002870788940029368, 'samples': 13245696, 'steps': 68987, 'loss/train': 1.1114223003387451} 11/07/2021 06:55:52 - INFO - __main__ - Step 68989: {'lr': 0.0002870736459423826, 'samples': 13245888, 'steps': 68988, 'loss/train': 0.8857323527336121} 11/07/2021 06:55:53 - INFO - __main__ - Step 68990: {'lr': 0.0002870683978651236, 'samples': 13246080, 'steps': 68989, 'loss/train': 1.7070600986480713} 11/07/2021 06:55:54 - INFO - __main__ - Step 68991: {'lr': 0.00028706314977116205, 'samples': 13246272, 'steps': 68990, 'loss/train': 1.5972468852996826} 11/07/2021 06:55:54 - INFO - __main__ - Step 68992: {'lr': 0.0002870579016605005, 'samples': 13246464, 'steps': 68991, 'loss/train': 1.2631964683532715} 11/07/2021 06:55:55 - INFO - __main__ - Step 68993: {'lr': 0.0002870526535331413, 'samples': 13246656, 'steps': 68992, 'loss/train': 1.2972602844238281} 11/07/2021 06:55:55 - INFO - __main__ - Step 68994: {'lr': 0.00028704740538908663, 'samples': 13246848, 'steps': 68993, 'loss/train': 1.3590611219406128} 11/07/2021 06:55:55 - INFO - __main__ - Step 68995: {'lr': 0.000287042157228339, 'samples': 13247040, 'steps': 68994, 'loss/train': 1.3825527429580688} 11/07/2021 06:55:56 - INFO - __main__ - Step 68996: {'lr': 0.00028703690905090075, 'samples': 13247232, 'steps': 68995, 'loss/train': 1.380897879600525} 11/07/2021 06:55:57 - INFO - __main__ - Step 68997: {'lr': 0.00028703166085677423, 'samples': 13247424, 'steps': 68996, 'loss/train': 1.6823382377624512} 11/07/2021 06:55:57 - INFO - __main__ - Step 68998: {'lr': 0.0002870264126459618, 'samples': 13247616, 'steps': 68997, 'loss/train': 1.450571894645691} 11/07/2021 06:55:57 - INFO - __main__ - Step 68999: {'lr': 0.00028702116441846586, 'samples': 13247808, 'steps': 68998, 'loss/train': 1.1970969438552856} 11/07/2021 06:55:58 - INFO - __main__ - Step 69000: {'lr': 0.0002870159161742888, 'samples': 13248000, 'steps': 68999, 'loss/train': 1.555063009262085} 11/07/2021 06:55:59 - INFO - __main__ - Step 69001: {'lr': 0.00028701066791343287, 'samples': 13248192, 'steps': 69000, 'loss/train': 0.5971619486808777} 11/07/2021 06:56:00 - INFO - __main__ - Step 69002: {'lr': 0.0002870054196359005, 'samples': 13248384, 'steps': 69001, 'loss/train': 0.7935200333595276} 11/07/2021 06:56:00 - INFO - __main__ - Step 69003: {'lr': 0.0002870001713416941, 'samples': 13248576, 'steps': 69002, 'loss/train': 0.5523234009742737} 11/07/2021 06:56:00 - INFO - __main__ - Step 69004: {'lr': 0.00028699492303081606, 'samples': 13248768, 'steps': 69003, 'loss/train': 1.7045564651489258} 11/07/2021 06:56:01 - INFO - __main__ - Step 69005: {'lr': 0.00028698967470326854, 'samples': 13248960, 'steps': 69004, 'loss/train': 1.500350832939148} 11/07/2021 06:56:01 - INFO - __main__ - Step 69006: {'lr': 0.00028698442635905413, 'samples': 13249152, 'steps': 69005, 'loss/train': 1.724902868270874} 11/07/2021 06:56:02 - INFO - __main__ - Step 69007: {'lr': 0.00028697917799817515, 'samples': 13249344, 'steps': 69006, 'loss/train': 1.060078501701355} 11/07/2021 06:56:02 - INFO - __main__ - Step 69008: {'lr': 0.0002869739296206338, 'samples': 13249536, 'steps': 69007, 'loss/train': 1.3363006114959717} 11/07/2021 06:56:03 - INFO - __main__ - Step 69009: {'lr': 0.00028696868122643265, 'samples': 13249728, 'steps': 69008, 'loss/train': 1.3495169878005981} 11/07/2021 06:56:03 - INFO - __main__ - Step 69010: {'lr': 0.00028696343281557396, 'samples': 13249920, 'steps': 69009, 'loss/train': 0.5474924445152283} 11/07/2021 06:56:03 - INFO - __main__ - Step 69011: {'lr': 0.0002869581843880601, 'samples': 13250112, 'steps': 69010, 'loss/train': 1.7163450717926025} 11/07/2021 06:56:04 - INFO - __main__ - Step 69012: {'lr': 0.0002869529359438935, 'samples': 13250304, 'steps': 69011, 'loss/train': 1.9699296951293945} 11/07/2021 06:56:05 - INFO - __main__ - Step 69013: {'lr': 0.00028694768748307645, 'samples': 13250496, 'steps': 69012, 'loss/train': 1.4558416604995728} 11/07/2021 06:56:05 - INFO - __main__ - Step 69014: {'lr': 0.00028694243900561137, 'samples': 13250688, 'steps': 69013, 'loss/train': 3.574247121810913} 11/07/2021 06:56:05 - INFO - __main__ - Step 69015: {'lr': 0.00028693719051150053, 'samples': 13250880, 'steps': 69014, 'loss/train': 1.6216717958450317} 11/07/2021 06:56:06 - INFO - __main__ - Step 69016: {'lr': 0.00028693194200074643, 'samples': 13251072, 'steps': 69015, 'loss/train': 0.34590283036231995} 11/07/2021 06:56:07 - INFO - __main__ - Step 69017: {'lr': 0.00028692669347335134, 'samples': 13251264, 'steps': 69016, 'loss/train': 1.4431560039520264} 11/07/2021 06:56:08 - INFO - __main__ - Step 69018: {'lr': 0.0002869214449293176, 'samples': 13251456, 'steps': 69017, 'loss/train': 1.5473229885101318} 11/07/2021 06:56:08 - INFO - __main__ - Step 69019: {'lr': 0.0002869161963686477, 'samples': 13251648, 'steps': 69018, 'loss/train': 1.376226782798767} 11/07/2021 06:56:08 - INFO - __main__ - Step 69020: {'lr': 0.0002869109477913439, 'samples': 13251840, 'steps': 69019, 'loss/train': 1.8477165699005127} 11/07/2021 06:56:09 - INFO - __main__ - Step 69021: {'lr': 0.00028690569919740864, 'samples': 13252032, 'steps': 69020, 'loss/train': 1.4141840934753418} 11/07/2021 06:56:09 - INFO - __main__ - Step 69022: {'lr': 0.0002869004505868442, 'samples': 13252224, 'steps': 69021, 'loss/train': 1.7129647731781006} 11/07/2021 06:56:10 - INFO - __main__ - Step 69023: {'lr': 0.00028689520195965295, 'samples': 13252416, 'steps': 69022, 'loss/train': 1.6945524215698242} 11/07/2021 06:56:11 - INFO - __main__ - Step 69024: {'lr': 0.0002868899533158373, 'samples': 13252608, 'steps': 69023, 'loss/train': 1.1881041526794434} 11/07/2021 06:56:11 - INFO - __main__ - Step 69025: {'lr': 0.0002868847046553997, 'samples': 13252800, 'steps': 69024, 'loss/train': 0.9607878923416138} 11/07/2021 06:56:11 - INFO - __main__ - Step 69026: {'lr': 0.0002868794559783423, 'samples': 13252992, 'steps': 69025, 'loss/train': 1.5151525735855103} 11/07/2021 06:56:12 - INFO - __main__ - Step 69027: {'lr': 0.0002868742072846677, 'samples': 13253184, 'steps': 69026, 'loss/train': 1.3868385553359985} 11/07/2021 06:56:13 - INFO - __main__ - Step 69028: {'lr': 0.0002868689585743781, 'samples': 13253376, 'steps': 69027, 'loss/train': 1.2734731435775757} 11/07/2021 06:56:13 - INFO - __main__ - Step 69029: {'lr': 0.0002868637098474759, 'samples': 13253568, 'steps': 69028, 'loss/train': 0.6665225028991699} 11/07/2021 06:56:13 - INFO - __main__ - Step 69030: {'lr': 0.00028685846110396347, 'samples': 13253760, 'steps': 69029, 'loss/train': 1.5360394716262817} 11/07/2021 06:56:14 - INFO - __main__ - Step 69031: {'lr': 0.0002868532123438432, 'samples': 13253952, 'steps': 69030, 'loss/train': 1.414385437965393} 11/07/2021 06:56:14 - INFO - __main__ - Step 69032: {'lr': 0.00028684796356711744, 'samples': 13254144, 'steps': 69031, 'loss/train': 1.5060175657272339} 11/07/2021 06:56:15 - INFO - __main__ - Step 69033: {'lr': 0.0002868427147737886, 'samples': 13254336, 'steps': 69032, 'loss/train': 1.3301931619644165} 11/07/2021 06:56:15 - INFO - __main__ - Step 69034: {'lr': 0.000286837465963859, 'samples': 13254528, 'steps': 69033, 'loss/train': 1.700761079788208} 11/07/2021 06:56:16 - INFO - __main__ - Step 69035: {'lr': 0.000286832217137331, 'samples': 13254720, 'steps': 69034, 'loss/train': 1.1522566080093384} 11/07/2021 06:56:16 - INFO - __main__ - Step 69036: {'lr': 0.0002868269682942069, 'samples': 13254912, 'steps': 69035, 'loss/train': 1.5434985160827637} 11/07/2021 06:56:17 - INFO - __main__ - Step 69037: {'lr': 0.0002868217194344891, 'samples': 13255104, 'steps': 69036, 'loss/train': 0.5878632068634033} 11/07/2021 06:56:17 - INFO - __main__ - Step 69038: {'lr': 0.00028681647055818016, 'samples': 13255296, 'steps': 69037, 'loss/train': 1.1717841625213623} 11/07/2021 06:56:18 - INFO - __main__ - Step 69039: {'lr': 0.00028681122166528215, 'samples': 13255488, 'steps': 69038, 'loss/train': 1.4345955848693848} 11/07/2021 06:56:18 - INFO - __main__ - Step 69040: {'lr': 0.00028680597275579774, 'samples': 13255680, 'steps': 69039, 'loss/train': 1.2996506690979004} 11/07/2021 06:56:19 - INFO - __main__ - Step 69041: {'lr': 0.000286800723829729, 'samples': 13255872, 'steps': 69040, 'loss/train': 1.6467556953430176} 11/07/2021 06:56:19 - INFO - __main__ - Step 69042: {'lr': 0.0002867954748870784, 'samples': 13256064, 'steps': 69041, 'loss/train': 1.192861557006836} 11/07/2021 06:56:19 - INFO - __main__ - Step 69043: {'lr': 0.00028679022592784835, 'samples': 13256256, 'steps': 69042, 'loss/train': 1.3712530136108398} 11/07/2021 06:56:20 - INFO - __main__ - Step 69044: {'lr': 0.00028678497695204123, 'samples': 13256448, 'steps': 69043, 'loss/train': 1.4911701679229736} 11/07/2021 06:56:21 - INFO - __main__ - Step 69045: {'lr': 0.0002867797279596593, 'samples': 13256640, 'steps': 69044, 'loss/train': 1.209159255027771} 11/07/2021 06:56:21 - INFO - __main__ - Step 69046: {'lr': 0.00028677447895070505, 'samples': 13256832, 'steps': 69045, 'loss/train': 1.3573557138442993} 11/07/2021 06:56:21 - INFO - __main__ - Step 69047: {'lr': 0.0002867692299251808, 'samples': 13257024, 'steps': 69046, 'loss/train': 1.5711727142333984} 11/07/2021 06:56:22 - INFO - __main__ - Step 69048: {'lr': 0.00028676398088308894, 'samples': 13257216, 'steps': 69047, 'loss/train': 1.1011545658111572} 11/07/2021 06:56:23 - INFO - __main__ - Step 69049: {'lr': 0.0002867587318244317, 'samples': 13257408, 'steps': 69048, 'loss/train': 1.3667947053909302} 11/07/2021 06:56:23 - INFO - __main__ - Step 69050: {'lr': 0.0002867534827492116, 'samples': 13257600, 'steps': 69049, 'loss/train': 1.4563266038894653} 11/07/2021 06:56:23 - INFO - __main__ - Step 69051: {'lr': 0.0002867482336574309, 'samples': 13257792, 'steps': 69050, 'loss/train': 1.109912633895874} 11/07/2021 06:56:24 - INFO - __main__ - Step 69052: {'lr': 0.00028674298454909203, 'samples': 13257984, 'steps': 69051, 'loss/train': 0.9857238531112671} 11/07/2021 06:56:24 - INFO - __main__ - Step 69053: {'lr': 0.00028673773542419736, 'samples': 13258176, 'steps': 69052, 'loss/train': 1.407091498374939} 11/07/2021 06:56:25 - INFO - __main__ - Step 69054: {'lr': 0.00028673248628274925, 'samples': 13258368, 'steps': 69053, 'loss/train': 1.5396255254745483} 11/07/2021 06:56:25 - INFO - __main__ - Step 69055: {'lr': 0.00028672723712475003, 'samples': 13258560, 'steps': 69054, 'loss/train': 1.8101540803909302} 11/07/2021 06:56:26 - INFO - __main__ - Step 69056: {'lr': 0.00028672198795020204, 'samples': 13258752, 'steps': 69055, 'loss/train': 1.3862489461898804} 11/07/2021 06:56:26 - INFO - __main__ - Step 69057: {'lr': 0.0002867167387591077, 'samples': 13258944, 'steps': 69056, 'loss/train': 2.0331928730010986} 11/07/2021 06:56:26 - INFO - __main__ - Step 69058: {'lr': 0.00028671148955146944, 'samples': 13259136, 'steps': 69057, 'loss/train': 1.4418970346450806} 11/07/2021 06:56:27 - INFO - __main__ - Step 69059: {'lr': 0.00028670624032728944, 'samples': 13259328, 'steps': 69058, 'loss/train': 1.4472780227661133} 11/07/2021 06:56:28 - INFO - __main__ - Step 69060: {'lr': 0.0002867009910865702, 'samples': 13259520, 'steps': 69059, 'loss/train': 1.3684754371643066} 11/07/2021 06:56:28 - INFO - __main__ - Step 69061: {'lr': 0.00028669574182931413, 'samples': 13259712, 'steps': 69060, 'loss/train': 1.8470221757888794} 11/07/2021 06:56:29 - INFO - __main__ - Step 69062: {'lr': 0.0002866904925555235, 'samples': 13259904, 'steps': 69061, 'loss/train': 1.2852203845977783} 11/07/2021 06:56:29 - INFO - __main__ - Step 69063: {'lr': 0.0002866852432652007, 'samples': 13260096, 'steps': 69062, 'loss/train': 1.7802534103393555} 11/07/2021 06:56:29 - INFO - __main__ - Step 69064: {'lr': 0.00028667999395834805, 'samples': 13260288, 'steps': 69063, 'loss/train': 1.3557896614074707} 11/07/2021 06:56:31 - INFO - __main__ - Step 69065: {'lr': 0.000286674744634968, 'samples': 13260480, 'steps': 69064, 'loss/train': 1.033657193183899} 11/07/2021 06:56:31 - INFO - __main__ - Step 69066: {'lr': 0.00028666949529506286, 'samples': 13260672, 'steps': 69065, 'loss/train': 1.2385368347167969} 11/07/2021 06:56:31 - INFO - __main__ - Step 69067: {'lr': 0.000286664245938635, 'samples': 13260864, 'steps': 69066, 'loss/train': 1.5927221775054932} 11/07/2021 06:56:32 - INFO - __main__ - Step 69068: {'lr': 0.0002866589965656868, 'samples': 13261056, 'steps': 69067, 'loss/train': 0.8617063760757446} 11/07/2021 06:56:32 - INFO - __main__ - Step 69069: {'lr': 0.0002866537471762207, 'samples': 13261248, 'steps': 69068, 'loss/train': 0.7203162908554077} 11/07/2021 06:56:34 - INFO - __main__ - Step 69070: {'lr': 0.0002866484977702389, 'samples': 13261440, 'steps': 69069, 'loss/train': 1.8268057107925415} 11/07/2021 06:56:34 - INFO - __main__ - Step 69071: {'lr': 0.00028664324834774385, 'samples': 13261632, 'steps': 69070, 'loss/train': 1.4764834642410278} 11/07/2021 06:56:34 - INFO - __main__ - Step 69072: {'lr': 0.00028663799890873797, 'samples': 13261824, 'steps': 69071, 'loss/train': 1.4043766260147095} 11/07/2021 06:56:35 - INFO - __main__ - Step 69073: {'lr': 0.00028663274945322354, 'samples': 13262016, 'steps': 69072, 'loss/train': 1.7864995002746582} 11/07/2021 06:56:35 - INFO - __main__ - Step 69074: {'lr': 0.00028662749998120294, 'samples': 13262208, 'steps': 69073, 'loss/train': 1.7080694437026978} 11/07/2021 06:56:35 - INFO - __main__ - Step 69075: {'lr': 0.0002866222504926786, 'samples': 13262400, 'steps': 69074, 'loss/train': 1.67825448513031} 11/07/2021 06:56:36 - INFO - __main__ - Step 69076: {'lr': 0.00028661700098765285, 'samples': 13262592, 'steps': 69075, 'loss/train': 1.0733956098556519} 11/07/2021 06:56:37 - INFO - __main__ - Step 69077: {'lr': 0.000286611751466128, 'samples': 13262784, 'steps': 69076, 'loss/train': 1.2088364362716675} 11/07/2021 06:56:38 - INFO - __main__ - Step 69078: {'lr': 0.00028660650192810646, 'samples': 13262976, 'steps': 69077, 'loss/train': 1.5185123682022095} 11/07/2021 06:56:38 - INFO - __main__ - Step 69079: {'lr': 0.0002866012523735906, 'samples': 13263168, 'steps': 69078, 'loss/train': 0.8983421325683594} 11/07/2021 06:56:38 - INFO - __main__ - Step 69080: {'lr': 0.0002865960028025828, 'samples': 13263360, 'steps': 69079, 'loss/train': 1.4060081243515015} 11/07/2021 06:56:39 - INFO - __main__ - Step 69081: {'lr': 0.00028659075321508544, 'samples': 13263552, 'steps': 69080, 'loss/train': 0.20737457275390625} 11/07/2021 06:56:40 - INFO - __main__ - Step 69082: {'lr': 0.00028658550361110075, 'samples': 13263744, 'steps': 69081, 'loss/train': 1.587318778038025} 11/07/2021 06:56:40 - INFO - __main__ - Step 69083: {'lr': 0.00028658025399063125, 'samples': 13263936, 'steps': 69082, 'loss/train': 0.740216851234436} 11/07/2021 06:56:41 - INFO - __main__ - Step 69084: {'lr': 0.00028657500435367927, 'samples': 13264128, 'steps': 69083, 'loss/train': 0.8854300379753113} 11/07/2021 06:56:41 - INFO - __main__ - Step 69085: {'lr': 0.0002865697547002471, 'samples': 13264320, 'steps': 69084, 'loss/train': 1.6349873542785645} 11/07/2021 06:56:41 - INFO - __main__ - Step 69086: {'lr': 0.0002865645050303372, 'samples': 13264512, 'steps': 69085, 'loss/train': 2.0956389904022217} 11/07/2021 06:56:42 - INFO - __main__ - Step 69087: {'lr': 0.000286559255343952, 'samples': 13264704, 'steps': 69086, 'loss/train': 1.2000313997268677} 11/07/2021 06:56:43 - INFO - __main__ - Step 69088: {'lr': 0.0002865540056410936, 'samples': 13264896, 'steps': 69087, 'loss/train': 1.5246450901031494} 11/07/2021 06:56:43 - INFO - __main__ - Step 69089: {'lr': 0.0002865487559217646, 'samples': 13265088, 'steps': 69088, 'loss/train': 1.3079895973205566} 11/07/2021 06:56:43 - INFO - __main__ - Step 69090: {'lr': 0.0002865435061859673, 'samples': 13265280, 'steps': 69089, 'loss/train': 1.0507279634475708} 11/07/2021 06:56:44 - INFO - __main__ - Step 69091: {'lr': 0.000286538256433704, 'samples': 13265472, 'steps': 69090, 'loss/train': 1.398301362991333} 11/07/2021 06:56:44 - INFO - __main__ - Step 69092: {'lr': 0.0002865330066649773, 'samples': 13265664, 'steps': 69091, 'loss/train': 1.5465784072875977} 11/07/2021 06:56:45 - INFO - __main__ - Step 69093: {'lr': 0.0002865277568797892, 'samples': 13265856, 'steps': 69092, 'loss/train': 1.6585279703140259} 11/07/2021 06:56:45 - INFO - __main__ - Step 69094: {'lr': 0.0002865225070781423, 'samples': 13266048, 'steps': 69093, 'loss/train': 1.1148093938827515} 11/07/2021 06:56:46 - INFO - __main__ - Step 69095: {'lr': 0.000286517257260039, 'samples': 13266240, 'steps': 69094, 'loss/train': 1.2144172191619873} 11/07/2021 06:56:46 - INFO - __main__ - Step 69096: {'lr': 0.0002865120074254815, 'samples': 13266432, 'steps': 69095, 'loss/train': 0.23669831454753876} 11/07/2021 06:56:46 - INFO - __main__ - Step 69097: {'lr': 0.00028650675757447224, 'samples': 13266624, 'steps': 69096, 'loss/train': 1.588027000427246} 11/07/2021 06:56:47 - INFO - __main__ - Step 69098: {'lr': 0.00028650150770701373, 'samples': 13266816, 'steps': 69097, 'loss/train': 1.6482049226760864} 11/07/2021 06:56:48 - INFO - __main__ - Step 69099: {'lr': 0.00028649625782310804, 'samples': 13267008, 'steps': 69098, 'loss/train': 1.2910990715026855} 11/07/2021 06:56:48 - INFO - __main__ - Step 69100: {'lr': 0.0002864910079227578, 'samples': 13267200, 'steps': 69099, 'loss/train': 1.5384615659713745} 11/07/2021 06:56:48 - INFO - __main__ - Step 69101: {'lr': 0.0002864857580059653, 'samples': 13267392, 'steps': 69100, 'loss/train': 1.208631992340088} 11/07/2021 06:56:49 - INFO - __main__ - Step 69102: {'lr': 0.0002864805080727328, 'samples': 13267584, 'steps': 69101, 'loss/train': 0.971734881401062} 11/07/2021 06:56:50 - INFO - __main__ - Step 69103: {'lr': 0.0002864752581230628, 'samples': 13267776, 'steps': 69102, 'loss/train': 1.186853289604187} 11/07/2021 06:56:50 - INFO - __main__ - Step 69104: {'lr': 0.00028647000815695757, 'samples': 13267968, 'steps': 69103, 'loss/train': 1.642696499824524} 11/07/2021 06:56:51 - INFO - __main__ - Step 69105: {'lr': 0.0002864647581744195, 'samples': 13268160, 'steps': 69104, 'loss/train': 1.5428794622421265} 11/07/2021 06:56:51 - INFO - __main__ - Step 69106: {'lr': 0.0002864595081754511, 'samples': 13268352, 'steps': 69105, 'loss/train': 1.4645195007324219} 11/07/2021 06:56:51 - INFO - __main__ - Step 69107: {'lr': 0.00028645425816005443, 'samples': 13268544, 'steps': 69106, 'loss/train': 1.3274604082107544} 11/07/2021 06:56:52 - INFO - __main__ - Step 69108: {'lr': 0.0002864490081282322, 'samples': 13268736, 'steps': 69107, 'loss/train': 1.3562471866607666} 11/07/2021 06:56:53 - INFO - __main__ - Step 69109: {'lr': 0.00028644375807998653, 'samples': 13268928, 'steps': 69108, 'loss/train': 1.2609643936157227} 11/07/2021 06:56:53 - INFO - __main__ - Step 69110: {'lr': 0.00028643850801531983, 'samples': 13269120, 'steps': 69109, 'loss/train': 1.218287467956543} 11/07/2021 06:56:54 - INFO - __main__ - Step 69111: {'lr': 0.0002864332579342345, 'samples': 13269312, 'steps': 69110, 'loss/train': 1.4411823749542236} 11/07/2021 06:56:54 - INFO - __main__ - Step 69112: {'lr': 0.000286428007836733, 'samples': 13269504, 'steps': 69111, 'loss/train': 1.4750704765319824} 11/07/2021 06:56:55 - INFO - __main__ - Step 69113: {'lr': 0.00028642275772281753, 'samples': 13269696, 'steps': 69112, 'loss/train': 1.1310231685638428} 11/07/2021 06:56:55 - INFO - __main__ - Step 69114: {'lr': 0.0002864175075924906, 'samples': 13269888, 'steps': 69113, 'loss/train': 0.8309020400047302} 11/07/2021 06:56:56 - INFO - __main__ - Step 69115: {'lr': 0.0002864122574457544, 'samples': 13270080, 'steps': 69114, 'loss/train': 1.3652012348175049} 11/07/2021 06:56:56 - INFO - __main__ - Step 69116: {'lr': 0.00028640700728261144, 'samples': 13270272, 'steps': 69115, 'loss/train': 1.654276967048645} 11/07/2021 06:56:56 - INFO - __main__ - Step 69117: {'lr': 0.00028640175710306404, 'samples': 13270464, 'steps': 69116, 'loss/train': 1.2626488208770752} 11/07/2021 06:56:57 - INFO - __main__ - Step 69118: {'lr': 0.00028639650690711455, 'samples': 13270656, 'steps': 69117, 'loss/train': 2.0365028381347656} 11/07/2021 06:56:58 - INFO - __main__ - Step 69119: {'lr': 0.0002863912566947654, 'samples': 13270848, 'steps': 69118, 'loss/train': 1.3372503519058228} 11/07/2021 06:56:58 - INFO - __main__ - Step 69120: {'lr': 0.0002863860064660189, 'samples': 13271040, 'steps': 69119, 'loss/train': 1.4568172693252563} 11/07/2021 06:56:58 - INFO - __main__ - Step 69121: {'lr': 0.00028638075622087745, 'samples': 13271232, 'steps': 69120, 'loss/train': 1.3584502935409546} 11/07/2021 06:56:59 - INFO - __main__ - Step 69122: {'lr': 0.0002863755059593434, 'samples': 13271424, 'steps': 69121, 'loss/train': 1.788694977760315} 11/07/2021 06:56:59 - INFO - __main__ - Step 69123: {'lr': 0.000286370255681419, 'samples': 13271616, 'steps': 69122, 'loss/train': 1.8174537420272827} 11/07/2021 06:57:00 - INFO - __main__ - Step 69124: {'lr': 0.0002863650053871068, 'samples': 13271808, 'steps': 69123, 'loss/train': 1.7934218645095825} 11/07/2021 06:57:00 - INFO - __main__ - Step 69125: {'lr': 0.0002863597550764091, 'samples': 13272000, 'steps': 69124, 'loss/train': 0.4041271507740021} 11/07/2021 06:57:01 - INFO - __main__ - Step 69126: {'lr': 0.0002863545047493282, 'samples': 13272192, 'steps': 69125, 'loss/train': 1.9239490032196045} 11/07/2021 06:57:01 - INFO - __main__ - Step 69127: {'lr': 0.0002863492544058666, 'samples': 13272384, 'steps': 69126, 'loss/train': 1.5626877546310425} 11/07/2021 06:57:01 - INFO - __main__ - Step 69128: {'lr': 0.00028634400404602654, 'samples': 13272576, 'steps': 69127, 'loss/train': 1.5234904289245605} 11/07/2021 06:57:03 - INFO - __main__ - Step 69129: {'lr': 0.00028633875366981045, 'samples': 13272768, 'steps': 69128, 'loss/train': 1.7100430727005005} 11/07/2021 06:57:03 - INFO - __main__ - Step 69130: {'lr': 0.0002863335032772207, 'samples': 13272960, 'steps': 69129, 'loss/train': 1.5069557428359985} 11/07/2021 06:57:04 - INFO - __main__ - Step 69131: {'lr': 0.00028632825286825956, 'samples': 13273152, 'steps': 69130, 'loss/train': 1.3284443616867065} 11/07/2021 06:57:04 - INFO - __main__ - Step 69132: {'lr': 0.00028632300244292954, 'samples': 13273344, 'steps': 69131, 'loss/train': 0.298868328332901} 11/07/2021 06:57:04 - INFO - __main__ - Step 69133: {'lr': 0.0002863177520012329, 'samples': 13273536, 'steps': 69132, 'loss/train': 1.3587360382080078} 11/07/2021 06:57:05 - INFO - __main__ - Step 69134: {'lr': 0.000286312501543172, 'samples': 13273728, 'steps': 69133, 'loss/train': 1.217275619506836} 11/07/2021 06:57:06 - INFO - __main__ - Step 69135: {'lr': 0.0002863072510687493, 'samples': 13273920, 'steps': 69134, 'loss/train': 1.2136849164962769} 11/07/2021 06:57:06 - INFO - __main__ - Step 69136: {'lr': 0.0002863020005779672, 'samples': 13274112, 'steps': 69135, 'loss/train': 1.3615630865097046} 11/07/2021 06:57:06 - INFO - __main__ - Step 69137: {'lr': 0.00028629675007082783, 'samples': 13274304, 'steps': 69136, 'loss/train': 1.756145715713501} 11/07/2021 06:57:07 - INFO - __main__ - Step 69138: {'lr': 0.00028629149954733377, 'samples': 13274496, 'steps': 69137, 'loss/train': 0.9486337900161743} 11/07/2021 06:57:08 - INFO - __main__ - Step 69139: {'lr': 0.0002862862490074873, 'samples': 13274688, 'steps': 69138, 'loss/train': 1.1829469203948975} 11/07/2021 06:57:08 - INFO - __main__ - Step 69140: {'lr': 0.0002862809984512908, 'samples': 13274880, 'steps': 69139, 'loss/train': 1.1300321817398071} 11/07/2021 06:57:08 - INFO - __main__ - Step 69141: {'lr': 0.00028627574787874673, 'samples': 13275072, 'steps': 69140, 'loss/train': 1.4841694831848145} 11/07/2021 06:57:09 - INFO - __main__ - Step 69142: {'lr': 0.0002862704972898573, 'samples': 13275264, 'steps': 69141, 'loss/train': 1.4997339248657227} 11/07/2021 06:57:09 - INFO - __main__ - Step 69143: {'lr': 0.00028626524668462494, 'samples': 13275456, 'steps': 69142, 'loss/train': 1.1033029556274414} 11/07/2021 06:57:10 - INFO - __main__ - Step 69144: {'lr': 0.000286259996063052, 'samples': 13275648, 'steps': 69143, 'loss/train': 1.5563961267471313} 11/07/2021 06:57:11 - INFO - __main__ - Step 69145: {'lr': 0.00028625474542514083, 'samples': 13275840, 'steps': 69144, 'loss/train': 1.7576802968978882} 11/07/2021 06:57:11 - INFO - __main__ - Step 69146: {'lr': 0.0002862494947708939, 'samples': 13276032, 'steps': 69145, 'loss/train': 1.1451499462127686} 11/07/2021 06:57:11 - INFO - __main__ - Step 69147: {'lr': 0.00028624424410031354, 'samples': 13276224, 'steps': 69146, 'loss/train': 0.9687842130661011} 11/07/2021 06:57:12 - INFO - __main__ - Step 69148: {'lr': 0.00028623899341340207, 'samples': 13276416, 'steps': 69147, 'loss/train': 1.3178061246871948} 11/07/2021 06:57:14 - INFO - __main__ - Step 69149: {'lr': 0.0002862337427101618, 'samples': 13276608, 'steps': 69148, 'loss/train': 1.461028814315796} 11/07/2021 06:57:14 - INFO - __main__ - Step 69150: {'lr': 0.0002862284919905952, 'samples': 13276800, 'steps': 69149, 'loss/train': 0.9381246566772461} 11/07/2021 06:57:15 - INFO - __main__ - Step 69151: {'lr': 0.00028622324125470464, 'samples': 13276992, 'steps': 69150, 'loss/train': 1.096596360206604} 11/07/2021 06:57:15 - INFO - __main__ - Step 69152: {'lr': 0.0002862179905024924, 'samples': 13277184, 'steps': 69151, 'loss/train': 1.372957468032837} 11/07/2021 06:57:15 - INFO - __main__ - Step 69153: {'lr': 0.00028621273973396087, 'samples': 13277376, 'steps': 69152, 'loss/train': 0.7879542708396912} 11/07/2021 06:57:16 - INFO - __main__ - Step 69154: {'lr': 0.00028620748894911245, 'samples': 13277568, 'steps': 69153, 'loss/train': 0.17341144382953644} 11/07/2021 06:57:16 - INFO - __main__ - Step 69155: {'lr': 0.00028620223814794954, 'samples': 13277760, 'steps': 69154, 'loss/train': 1.9564558267593384} 11/07/2021 06:57:17 - INFO - __main__ - Step 69156: {'lr': 0.00028619698733047444, 'samples': 13277952, 'steps': 69155, 'loss/train': 1.8816955089569092} 11/07/2021 06:57:17 - INFO - __main__ - Step 69157: {'lr': 0.0002861917364966896, 'samples': 13278144, 'steps': 69156, 'loss/train': 1.816093921661377} 11/07/2021 06:57:18 - INFO - __main__ - Step 69158: {'lr': 0.0002861864856465972, 'samples': 13278336, 'steps': 69157, 'loss/train': 1.181196928024292} 11/07/2021 06:57:18 - INFO - __main__ - Step 69159: {'lr': 0.0002861812347801998, 'samples': 13278528, 'steps': 69158, 'loss/train': 1.0874903202056885} 11/07/2021 06:57:19 - INFO - __main__ - Step 69160: {'lr': 0.00028617598389749966, 'samples': 13278720, 'steps': 69159, 'loss/train': 1.489866018295288} 11/07/2021 06:57:19 - INFO - __main__ - Step 69161: {'lr': 0.0002861707329984992, 'samples': 13278912, 'steps': 69160, 'loss/train': 0.1327560991048813} 11/07/2021 06:57:19 - INFO - __main__ - Step 69162: {'lr': 0.00028616548208320073, 'samples': 13279104, 'steps': 69161, 'loss/train': 1.7163227796554565} 11/07/2021 06:57:20 - INFO - __main__ - Step 69163: {'lr': 0.00028616023115160674, 'samples': 13279296, 'steps': 69162, 'loss/train': 1.406015157699585} 11/07/2021 06:57:21 - INFO - __main__ - Step 69164: {'lr': 0.00028615498020371946, 'samples': 13279488, 'steps': 69163, 'loss/train': 1.2085715532302856} 11/07/2021 06:57:21 - INFO - __main__ - Step 69165: {'lr': 0.00028614972923954123, 'samples': 13279680, 'steps': 69164, 'loss/train': 1.3384249210357666} 11/07/2021 06:57:21 - INFO - __main__ - Step 69166: {'lr': 0.00028614447825907455, 'samples': 13279872, 'steps': 69165, 'loss/train': 1.4200422763824463} 11/07/2021 06:57:22 - INFO - __main__ - Step 69167: {'lr': 0.00028613922726232173, 'samples': 13280064, 'steps': 69166, 'loss/train': 1.1208546161651611} 11/07/2021 06:57:22 - INFO - __main__ - Step 69168: {'lr': 0.0002861339762492852, 'samples': 13280256, 'steps': 69167, 'loss/train': 2.2300522327423096} 11/07/2021 06:57:23 - INFO - __main__ - Step 69169: {'lr': 0.0002861287252199671, 'samples': 13280448, 'steps': 69168, 'loss/train': 1.2687249183654785} 11/07/2021 06:57:24 - INFO - __main__ - Step 69170: {'lr': 0.00028612347417437007, 'samples': 13280640, 'steps': 69169, 'loss/train': 1.3818501234054565} 11/07/2021 06:57:24 - INFO - __main__ - Step 69171: {'lr': 0.00028611822311249633, 'samples': 13280832, 'steps': 69170, 'loss/train': 1.0795342922210693} 11/07/2021 06:57:24 - INFO - __main__ - Step 69172: {'lr': 0.0002861129720343483, 'samples': 13281024, 'steps': 69171, 'loss/train': 1.805603265762329} 11/07/2021 06:57:25 - INFO - __main__ - Step 69173: {'lr': 0.00028610772093992827, 'samples': 13281216, 'steps': 69172, 'loss/train': 2.5302388668060303} 11/07/2021 06:57:26 - INFO - __main__ - Step 69174: {'lr': 0.0002861024698292387, 'samples': 13281408, 'steps': 69173, 'loss/train': 1.390974998474121} 11/07/2021 06:57:26 - INFO - __main__ - Step 69175: {'lr': 0.00028609721870228195, 'samples': 13281600, 'steps': 69174, 'loss/train': 1.3459038734436035} 11/07/2021 06:57:26 - INFO - __main__ - Step 69176: {'lr': 0.0002860919675590603, 'samples': 13281792, 'steps': 69175, 'loss/train': 1.4640882015228271} 11/07/2021 06:57:27 - INFO - __main__ - Step 69177: {'lr': 0.0002860867163995762, 'samples': 13281984, 'steps': 69176, 'loss/train': 1.2412501573562622} 11/07/2021 06:57:27 - INFO - __main__ - Step 69178: {'lr': 0.0002860814652238319, 'samples': 13282176, 'steps': 69177, 'loss/train': 1.8449676036834717} 11/07/2021 06:57:28 - INFO - __main__ - Step 69179: {'lr': 0.0002860762140318299, 'samples': 13282368, 'steps': 69178, 'loss/train': 1.450145959854126} 11/07/2021 06:57:28 - INFO - __main__ - Step 69180: {'lr': 0.0002860709628235725, 'samples': 13282560, 'steps': 69179, 'loss/train': 1.39368736743927} 11/07/2021 06:57:29 - INFO - __main__ - Step 69181: {'lr': 0.00028606571159906207, 'samples': 13282752, 'steps': 69180, 'loss/train': 1.4579803943634033} 11/07/2021 06:57:29 - INFO - __main__ - Step 69182: {'lr': 0.0002860604603583011, 'samples': 13282944, 'steps': 69181, 'loss/train': 1.579431414604187} 11/07/2021 06:57:30 - INFO - __main__ - Step 69183: {'lr': 0.00028605520910129174, 'samples': 13283136, 'steps': 69182, 'loss/train': 1.451494574546814} 11/07/2021 06:57:31 - INFO - __main__ - Step 69184: {'lr': 0.0002860499578280364, 'samples': 13283328, 'steps': 69183, 'loss/train': 1.3747565746307373} 11/07/2021 06:57:32 - INFO - __main__ - Step 69185: {'lr': 0.00028604470653853764, 'samples': 13283520, 'steps': 69184, 'loss/train': 1.4611376523971558} 11/07/2021 06:57:32 - INFO - __main__ - Step 69186: {'lr': 0.0002860394552327976, 'samples': 13283712, 'steps': 69185, 'loss/train': 1.2259118556976318} 11/07/2021 06:57:32 - INFO - __main__ - Step 69187: {'lr': 0.0002860342039108188, 'samples': 13283904, 'steps': 69186, 'loss/train': 1.2597845792770386} 11/07/2021 06:57:33 - INFO - __main__ - Step 69188: {'lr': 0.00028602895257260355, 'samples': 13284096, 'steps': 69187, 'loss/train': 1.5653858184814453} 11/07/2021 06:57:33 - INFO - __main__ - Step 69189: {'lr': 0.0002860237012181541, 'samples': 13284288, 'steps': 69188, 'loss/train': 2.2011094093322754} 11/07/2021 06:57:33 - INFO - __main__ - Step 69190: {'lr': 0.00028601844984747304, 'samples': 13284480, 'steps': 69189, 'loss/train': 1.2245656251907349} 11/07/2021 06:57:34 - INFO - __main__ - Step 69191: {'lr': 0.00028601319846056255, 'samples': 13284672, 'steps': 69190, 'loss/train': 0.03911522775888443} 11/07/2021 06:57:35 - INFO - __main__ - Step 69192: {'lr': 0.0002860079470574251, 'samples': 13284864, 'steps': 69191, 'loss/train': 1.097650170326233} 11/07/2021 06:57:35 - INFO - __main__ - Step 69193: {'lr': 0.00028600269563806304, 'samples': 13285056, 'steps': 69192, 'loss/train': 1.6311590671539307} 11/07/2021 06:57:35 - INFO - __main__ - Step 69194: {'lr': 0.0002859974442024787, 'samples': 13285248, 'steps': 69193, 'loss/train': 1.1777502298355103} 11/07/2021 06:57:36 - INFO - __main__ - Step 69195: {'lr': 0.0002859921927506745, 'samples': 13285440, 'steps': 69194, 'loss/train': 0.5957779288291931} 11/07/2021 06:57:37 - INFO - __main__ - Step 69196: {'lr': 0.00028598694128265274, 'samples': 13285632, 'steps': 69195, 'loss/train': 1.2611949443817139} 11/07/2021 06:57:37 - INFO - __main__ - Step 69197: {'lr': 0.0002859816897984158, 'samples': 13285824, 'steps': 69196, 'loss/train': 1.5686441659927368} 11/07/2021 06:57:37 - INFO - __main__ - Step 69198: {'lr': 0.0002859764382979661, 'samples': 13286016, 'steps': 69197, 'loss/train': 1.3383713960647583} 11/07/2021 06:57:38 - INFO - __main__ - Step 69199: {'lr': 0.00028597118678130596, 'samples': 13286208, 'steps': 69198, 'loss/train': 1.3160333633422852} 11/07/2021 06:57:38 - INFO - __main__ - Step 69200: {'lr': 0.0002859659352484378, 'samples': 13286400, 'steps': 69199, 'loss/train': 2.065624475479126} 11/07/2021 06:57:39 - INFO - __main__ - Step 69201: {'lr': 0.00028596068369936387, 'samples': 13286592, 'steps': 69200, 'loss/train': 1.2424486875534058} 11/07/2021 06:57:39 - INFO - __main__ - Step 69202: {'lr': 0.0002859554321340867, 'samples': 13286784, 'steps': 69201, 'loss/train': 1.9164552688598633} 11/07/2021 06:57:40 - INFO - __main__ - Step 69203: {'lr': 0.0002859501805526085, 'samples': 13286976, 'steps': 69202, 'loss/train': 0.9564657211303711} 11/07/2021 06:57:40 - INFO - __main__ - Step 69204: {'lr': 0.0002859449289549317, 'samples': 13287168, 'steps': 69203, 'loss/train': 1.0610079765319824} 11/07/2021 06:57:41 - INFO - __main__ - Step 69205: {'lr': 0.0002859396773410587, 'samples': 13287360, 'steps': 69204, 'loss/train': 0.8592532873153687} 11/07/2021 06:57:41 - INFO - __main__ - Step 69206: {'lr': 0.0002859344257109918, 'samples': 13287552, 'steps': 69205, 'loss/train': 1.4113154411315918} 11/07/2021 06:57:42 - INFO - __main__ - Step 69207: {'lr': 0.0002859291740647334, 'samples': 13287744, 'steps': 69206, 'loss/train': 1.6470640897750854} 11/07/2021 06:57:42 - INFO - __main__ - Step 69208: {'lr': 0.00028592392240228595, 'samples': 13287936, 'steps': 69207, 'loss/train': 1.5705814361572266} 11/07/2021 06:57:43 - INFO - __main__ - Step 69209: {'lr': 0.00028591867072365166, 'samples': 13288128, 'steps': 69208, 'loss/train': 1.347712755203247} 11/07/2021 06:57:43 - INFO - __main__ - Step 69210: {'lr': 0.000285913419028833, 'samples': 13288320, 'steps': 69209, 'loss/train': 1.8720308542251587} 11/07/2021 06:57:44 - INFO - __main__ - Step 69211: {'lr': 0.0002859081673178323, 'samples': 13288512, 'steps': 69210, 'loss/train': 1.2995609045028687} 11/07/2021 06:57:45 - INFO - __main__ - Step 69212: {'lr': 0.0002859029155906519, 'samples': 13288704, 'steps': 69211, 'loss/train': 1.366962194442749} 11/07/2021 06:57:45 - INFO - __main__ - Step 69213: {'lr': 0.00028589766384729426, 'samples': 13288896, 'steps': 69212, 'loss/train': 0.9533960223197937} 11/07/2021 06:57:45 - INFO - __main__ - Step 69214: {'lr': 0.00028589241208776164, 'samples': 13289088, 'steps': 69213, 'loss/train': 1.0145014524459839} 11/07/2021 06:57:46 - INFO - __main__ - Step 69215: {'lr': 0.0002858871603120565, 'samples': 13289280, 'steps': 69214, 'loss/train': 1.6459301710128784} 11/07/2021 06:57:46 - INFO - __main__ - Step 69216: {'lr': 0.00028588190852018116, 'samples': 13289472, 'steps': 69215, 'loss/train': 1.4111679792404175} 11/07/2021 06:57:47 - INFO - __main__ - Step 69217: {'lr': 0.0002858766567121379, 'samples': 13289664, 'steps': 69216, 'loss/train': 1.495619297027588} 11/07/2021 06:57:47 - INFO - __main__ - Step 69218: {'lr': 0.0002858714048879292, 'samples': 13289856, 'steps': 69217, 'loss/train': 1.2843855619430542} 11/07/2021 06:57:48 - INFO - __main__ - Step 69219: {'lr': 0.0002858661530475575, 'samples': 13290048, 'steps': 69218, 'loss/train': 1.1060690879821777} 11/07/2021 06:57:48 - INFO - __main__ - Step 69220: {'lr': 0.000285860901191025, 'samples': 13290240, 'steps': 69219, 'loss/train': 1.70430326461792} 11/07/2021 06:57:48 - INFO - __main__ - Step 69221: {'lr': 0.00028585564931833413, 'samples': 13290432, 'steps': 69220, 'loss/train': 1.398499846458435} 11/07/2021 06:57:49 - INFO - __main__ - Step 69222: {'lr': 0.00028585039742948725, 'samples': 13290624, 'steps': 69221, 'loss/train': 1.504888653755188} 11/07/2021 06:57:50 - INFO - __main__ - Step 69223: {'lr': 0.0002858451455244867, 'samples': 13290816, 'steps': 69222, 'loss/train': 1.138569712638855} 11/07/2021 06:57:50 - INFO - __main__ - Step 69224: {'lr': 0.00028583989360333496, 'samples': 13291008, 'steps': 69223, 'loss/train': 1.4434324502944946} 11/07/2021 06:57:50 - INFO - __main__ - Step 69225: {'lr': 0.0002858346416660342, 'samples': 13291200, 'steps': 69224, 'loss/train': 0.7798174023628235} 11/07/2021 06:57:51 - INFO - __main__ - Step 69226: {'lr': 0.000285829389712587, 'samples': 13291392, 'steps': 69225, 'loss/train': 2.2788820266723633} 11/07/2021 06:57:51 - INFO - __main__ - Step 69227: {'lr': 0.00028582413774299567, 'samples': 13291584, 'steps': 69226, 'loss/train': 1.2899645566940308} 11/07/2021 06:57:52 - INFO - __main__ - Step 69228: {'lr': 0.0002858188857572624, 'samples': 13291776, 'steps': 69227, 'loss/train': 1.5380631685256958} 11/07/2021 06:57:53 - INFO - __main__ - Step 69229: {'lr': 0.0002858136337553898, 'samples': 13291968, 'steps': 69228, 'loss/train': 1.6994706392288208} 11/07/2021 06:57:53 - INFO - __main__ - Step 69230: {'lr': 0.0002858083817373801, 'samples': 13292160, 'steps': 69229, 'loss/train': 1.7674933671951294} 11/07/2021 06:57:53 - INFO - __main__ - Step 69231: {'lr': 0.0002858031297032357, 'samples': 13292352, 'steps': 69230, 'loss/train': 0.08054393529891968} 11/07/2021 06:57:54 - INFO - __main__ - Step 69232: {'lr': 0.00028579787765295895, 'samples': 13292544, 'steps': 69231, 'loss/train': 1.4416896104812622} 11/07/2021 06:57:55 - INFO - __main__ - Step 69233: {'lr': 0.0002857926255865523, 'samples': 13292736, 'steps': 69232, 'loss/train': 1.1560693979263306} 11/07/2021 06:57:55 - INFO - __main__ - Step 69234: {'lr': 0.0002857873735040179, 'samples': 13292928, 'steps': 69233, 'loss/train': 1.5826441049575806} 11/07/2021 06:57:55 - INFO - __main__ - Step 69235: {'lr': 0.00028578212140535836, 'samples': 13293120, 'steps': 69234, 'loss/train': 1.354468822479248} 11/07/2021 06:57:56 - INFO - __main__ - Step 69236: {'lr': 0.0002857768692905759, 'samples': 13293312, 'steps': 69235, 'loss/train': 1.0921388864517212} 11/07/2021 06:57:56 - INFO - __main__ - Step 69237: {'lr': 0.000285771617159673, 'samples': 13293504, 'steps': 69236, 'loss/train': 1.5721149444580078} 11/07/2021 06:57:57 - INFO - __main__ - Step 69238: {'lr': 0.00028576636501265195, 'samples': 13293696, 'steps': 69237, 'loss/train': 1.398463249206543} 11/07/2021 06:57:57 - INFO - __main__ - Step 69239: {'lr': 0.00028576111284951504, 'samples': 13293888, 'steps': 69238, 'loss/train': 1.5781663656234741} 11/07/2021 06:57:58 - INFO - __main__ - Step 69240: {'lr': 0.0002857558606702648, 'samples': 13294080, 'steps': 69239, 'loss/train': 1.5097819566726685} 11/07/2021 06:57:58 - INFO - __main__ - Step 69241: {'lr': 0.0002857506084749035, 'samples': 13294272, 'steps': 69240, 'loss/train': 1.6039165258407593} 11/07/2021 06:57:59 - INFO - __main__ - Step 69242: {'lr': 0.0002857453562634336, 'samples': 13294464, 'steps': 69241, 'loss/train': 1.4488674402236938} 11/07/2021 06:57:59 - INFO - __main__ - Step 69243: {'lr': 0.00028574010403585733, 'samples': 13294656, 'steps': 69242, 'loss/train': 1.6306350231170654} 11/07/2021 06:58:00 - INFO - __main__ - Step 69244: {'lr': 0.0002857348517921771, 'samples': 13294848, 'steps': 69243, 'loss/train': 0.3590436577796936} 11/07/2021 06:58:00 - INFO - __main__ - Step 69245: {'lr': 0.0002857295995323953, 'samples': 13295040, 'steps': 69244, 'loss/train': 1.2969825267791748} 11/07/2021 06:58:01 - INFO - __main__ - Step 69246: {'lr': 0.0002857243472565143, 'samples': 13295232, 'steps': 69245, 'loss/train': 1.6074599027633667} 11/07/2021 06:58:01 - INFO - __main__ - Step 69247: {'lr': 0.0002857190949645365, 'samples': 13295424, 'steps': 69246, 'loss/train': 1.53848397731781} 11/07/2021 06:58:02 - INFO - __main__ - Step 69248: {'lr': 0.0002857138426564642, 'samples': 13295616, 'steps': 69247, 'loss/train': 1.2085717916488647} 11/07/2021 06:58:02 - INFO - __main__ - Step 69249: {'lr': 0.0002857085903322998, 'samples': 13295808, 'steps': 69248, 'loss/train': 1.362234354019165} 11/07/2021 06:58:03 - INFO - __main__ - Step 69250: {'lr': 0.00028570333799204565, 'samples': 13296000, 'steps': 69249, 'loss/train': 1.4128340482711792} 11/07/2021 06:58:03 - INFO - __main__ - Step 69251: {'lr': 0.0002856980856357041, 'samples': 13296192, 'steps': 69250, 'loss/train': 1.1072179079055786} 11/07/2021 06:58:03 - INFO - __main__ - Step 69252: {'lr': 0.00028569283326327754, 'samples': 13296384, 'steps': 69251, 'loss/train': 1.3975954055786133} 11/07/2021 06:58:05 - INFO - __main__ - Step 69253: {'lr': 0.0002856875808747684, 'samples': 13296576, 'steps': 69252, 'loss/train': 1.4638686180114746} 11/07/2021 06:58:05 - INFO - __main__ - Step 69254: {'lr': 0.00028568232847017895, 'samples': 13296768, 'steps': 69253, 'loss/train': 1.4389312267303467} 11/07/2021 06:58:05 - INFO - __main__ - Step 69255: {'lr': 0.0002856770760495116, 'samples': 13296960, 'steps': 69254, 'loss/train': 2.0541281700134277} 11/07/2021 06:58:06 - INFO - __main__ - Step 69256: {'lr': 0.00028567182361276873, 'samples': 13297152, 'steps': 69255, 'loss/train': 1.1965467929840088} 11/07/2021 06:58:06 - INFO - __main__ - Step 69257: {'lr': 0.0002856665711599526, 'samples': 13297344, 'steps': 69256, 'loss/train': 1.1775065660476685} 11/07/2021 06:58:06 - INFO - __main__ - Step 69258: {'lr': 0.0002856613186910658, 'samples': 13297536, 'steps': 69257, 'loss/train': 1.3250231742858887} 11/07/2021 06:58:07 - INFO - __main__ - Step 69259: {'lr': 0.0002856560662061105, 'samples': 13297728, 'steps': 69258, 'loss/train': 1.2391231060028076} 11/07/2021 06:58:08 - INFO - __main__ - Step 69260: {'lr': 0.0002856508137050891, 'samples': 13297920, 'steps': 69259, 'loss/train': 1.3606767654418945} 11/07/2021 06:58:08 - INFO - __main__ - Step 69261: {'lr': 0.000285645561188004, 'samples': 13298112, 'steps': 69260, 'loss/train': 1.6736522912979126} 11/07/2021 06:58:08 - INFO - __main__ - Step 69262: {'lr': 0.0002856403086548576, 'samples': 13298304, 'steps': 69261, 'loss/train': 1.8544031381607056} 11/07/2021 06:58:10 - INFO - __main__ - Step 69263: {'lr': 0.0002856350561056522, 'samples': 13298496, 'steps': 69262, 'loss/train': 1.9161595106124878} 11/07/2021 06:58:10 - INFO - __main__ - Step 69264: {'lr': 0.0002856298035403902, 'samples': 13298688, 'steps': 69263, 'loss/train': 1.4145827293395996} 11/07/2021 06:58:10 - INFO - __main__ - Step 69265: {'lr': 0.00028562455095907394, 'samples': 13298880, 'steps': 69264, 'loss/train': 1.6013493537902832} 11/07/2021 06:58:11 - INFO - __main__ - Step 69266: {'lr': 0.0002856192983617058, 'samples': 13299072, 'steps': 69265, 'loss/train': 1.3936980962753296} 11/07/2021 06:58:11 - INFO - __main__ - Step 69267: {'lr': 0.0002856140457482882, 'samples': 13299264, 'steps': 69266, 'loss/train': 1.4931960105895996} 11/07/2021 06:58:11 - INFO - __main__ - Step 69268: {'lr': 0.00028560879311882335, 'samples': 13299456, 'steps': 69267, 'loss/train': 0.9447680711746216} 11/07/2021 06:58:12 - INFO - __main__ - Step 69269: {'lr': 0.0002856035404733139, 'samples': 13299648, 'steps': 69268, 'loss/train': 1.8749215602874756} 11/07/2021 06:58:13 - INFO - __main__ - Step 69270: {'lr': 0.00028559828781176197, 'samples': 13299840, 'steps': 69269, 'loss/train': 1.4562842845916748} 11/07/2021 06:58:13 - INFO - __main__ - Step 69271: {'lr': 0.00028559303513416993, 'samples': 13300032, 'steps': 69270, 'loss/train': 1.6873046159744263} 11/07/2021 06:58:13 - INFO - __main__ - Step 69272: {'lr': 0.00028558778244054027, 'samples': 13300224, 'steps': 69271, 'loss/train': 0.9327331781387329} 11/07/2021 06:58:14 - INFO - __main__ - Step 69273: {'lr': 0.00028558252973087537, 'samples': 13300416, 'steps': 69272, 'loss/train': 1.538641333580017} 11/07/2021 06:58:15 - INFO - __main__ - Step 69274: {'lr': 0.00028557727700517744, 'samples': 13300608, 'steps': 69273, 'loss/train': 1.5230000019073486} 11/07/2021 06:58:15 - INFO - __main__ - Step 69275: {'lr': 0.00028557202426344894, 'samples': 13300800, 'steps': 69274, 'loss/train': 1.7548086643218994} 11/07/2021 06:58:15 - INFO - __main__ - Step 69276: {'lr': 0.00028556677150569235, 'samples': 13300992, 'steps': 69275, 'loss/train': 1.5595418214797974} 11/07/2021 06:58:16 - INFO - __main__ - Step 69277: {'lr': 0.0002855615187319098, 'samples': 13301184, 'steps': 69276, 'loss/train': 1.6024545431137085} 11/07/2021 06:58:16 - INFO - __main__ - Step 69278: {'lr': 0.00028555626594210375, 'samples': 13301376, 'steps': 69277, 'loss/train': 1.6350831985473633} 11/07/2021 06:58:18 - INFO - __main__ - Step 69279: {'lr': 0.00028555101313627667, 'samples': 13301568, 'steps': 69278, 'loss/train': 1.615169644355774} 11/07/2021 06:58:18 - INFO - __main__ - Step 69280: {'lr': 0.0002855457603144309, 'samples': 13301760, 'steps': 69279, 'loss/train': 5.747090816497803} 11/07/2021 06:58:19 - INFO - __main__ - Step 69281: {'lr': 0.0002855405074765686, 'samples': 13301952, 'steps': 69280, 'loss/train': 2.0110063552856445} 11/07/2021 06:58:19 - INFO - __main__ - Step 69282: {'lr': 0.00028553525462269246, 'samples': 13302144, 'steps': 69281, 'loss/train': 0.9122362732887268} 11/07/2021 06:58:19 - INFO - __main__ - Step 69283: {'lr': 0.00028553000175280465, 'samples': 13302336, 'steps': 69282, 'loss/train': 1.0500602722167969} 11/07/2021 06:58:20 - INFO - __main__ - Step 69284: {'lr': 0.0002855247488669075, 'samples': 13302528, 'steps': 69283, 'loss/train': 1.1476657390594482} 11/07/2021 06:58:21 - INFO - __main__ - Step 69285: {'lr': 0.00028551949596500347, 'samples': 13302720, 'steps': 69284, 'loss/train': 1.6614024639129639} 11/07/2021 06:58:21 - INFO - __main__ - Step 69286: {'lr': 0.00028551424304709493, 'samples': 13302912, 'steps': 69285, 'loss/train': 1.2637434005737305} 11/07/2021 06:58:21 - INFO - __main__ - Step 69287: {'lr': 0.0002855089901131842, 'samples': 13303104, 'steps': 69286, 'loss/train': 1.289076328277588} 11/07/2021 06:58:22 - INFO - __main__ - Step 69288: {'lr': 0.00028550373716327367, 'samples': 13303296, 'steps': 69287, 'loss/train': 0.8430917263031006} 11/07/2021 06:58:22 - INFO - __main__ - Step 69289: {'lr': 0.0002854984841973657, 'samples': 13303488, 'steps': 69288, 'loss/train': 0.7853161692619324} 11/07/2021 06:58:23 - INFO - __main__ - Step 69290: {'lr': 0.0002854932312154627, 'samples': 13303680, 'steps': 69289, 'loss/train': 2.1617486476898193} 11/07/2021 06:58:24 - INFO - __main__ - Step 69291: {'lr': 0.00028548797821756697, 'samples': 13303872, 'steps': 69290, 'loss/train': 1.3545118570327759} 11/07/2021 06:58:24 - INFO - __main__ - Step 69292: {'lr': 0.00028548272520368084, 'samples': 13304064, 'steps': 69291, 'loss/train': 1.5515484809875488} 11/07/2021 06:58:24 - INFO - __main__ - Step 69293: {'lr': 0.0002854774721738068, 'samples': 13304256, 'steps': 69292, 'loss/train': 1.5498449802398682} 11/07/2021 06:58:25 - INFO - __main__ - Step 69294: {'lr': 0.00028547221912794717, 'samples': 13304448, 'steps': 69293, 'loss/train': 1.0722066164016724} 11/07/2021 06:58:25 - INFO - __main__ - Step 69295: {'lr': 0.0002854669660661043, 'samples': 13304640, 'steps': 69294, 'loss/train': 1.3619647026062012} 11/07/2021 06:58:26 - INFO - __main__ - Step 69296: {'lr': 0.0002854617129882806, 'samples': 13304832, 'steps': 69295, 'loss/train': 0.7313637733459473} 11/07/2021 06:58:26 - INFO - __main__ - Step 69297: {'lr': 0.0002854564598944783, 'samples': 13305024, 'steps': 69296, 'loss/train': 1.7463375329971313} 11/07/2021 06:58:27 - INFO - __main__ - Step 69298: {'lr': 0.0002854512067846999, 'samples': 13305216, 'steps': 69297, 'loss/train': 1.3992342948913574} 11/07/2021 06:58:27 - INFO - __main__ - Step 69299: {'lr': 0.0002854459536589478, 'samples': 13305408, 'steps': 69298, 'loss/train': 1.5097103118896484} 11/07/2021 06:58:27 - INFO - __main__ - Step 69300: {'lr': 0.0002854407005172243, 'samples': 13305600, 'steps': 69299, 'loss/train': 1.4915285110473633} 11/07/2021 06:58:29 - INFO - __main__ - Step 69301: {'lr': 0.0002854354473595317, 'samples': 13305792, 'steps': 69300, 'loss/train': 1.5372670888900757} 11/07/2021 06:58:29 - INFO - __main__ - Step 69302: {'lr': 0.0002854301941858724, 'samples': 13305984, 'steps': 69301, 'loss/train': 1.3633134365081787} 11/07/2021 06:58:29 - INFO - __main__ - Step 69303: {'lr': 0.00028542494099624896, 'samples': 13306176, 'steps': 69302, 'loss/train': 1.5285457372665405} 11/07/2021 06:58:30 - INFO - __main__ - Step 69304: {'lr': 0.0002854196877906635, 'samples': 13306368, 'steps': 69303, 'loss/train': 1.6479406356811523} 11/07/2021 06:58:30 - INFO - __main__ - Step 69305: {'lr': 0.00028541443456911843, 'samples': 13306560, 'steps': 69304, 'loss/train': 1.1048895120620728} 11/07/2021 06:58:31 - INFO - __main__ - Step 69306: {'lr': 0.0002854091813316162, 'samples': 13306752, 'steps': 69305, 'loss/train': 1.3913793563842773} 11/07/2021 06:58:32 - INFO - __main__ - Step 69307: {'lr': 0.0002854039280781591, 'samples': 13306944, 'steps': 69306, 'loss/train': 1.6685882806777954} 11/07/2021 06:58:32 - INFO - __main__ - Step 69308: {'lr': 0.00028539867480874954, 'samples': 13307136, 'steps': 69307, 'loss/train': 1.535331130027771} 11/07/2021 06:58:32 - INFO - __main__ - Step 69309: {'lr': 0.00028539342152339, 'samples': 13307328, 'steps': 69308, 'loss/train': 1.481994390487671} 11/07/2021 06:58:33 - INFO - __main__ - Step 69310: {'lr': 0.0002853881682220826, 'samples': 13307520, 'steps': 69309, 'loss/train': 0.10279141366481781} 11/07/2021 06:58:34 - INFO - __main__ - Step 69311: {'lr': 0.0002853829149048299, 'samples': 13307712, 'steps': 69310, 'loss/train': 1.5733083486557007} 11/07/2021 06:58:34 - INFO - __main__ - Step 69312: {'lr': 0.00028537766157163413, 'samples': 13307904, 'steps': 69311, 'loss/train': 0.1603562831878662} 11/07/2021 06:58:35 - INFO - __main__ - Step 69313: {'lr': 0.00028537240822249784, 'samples': 13308096, 'steps': 69312, 'loss/train': 1.474601149559021} 11/07/2021 06:58:35 - INFO - __main__ - Step 69314: {'lr': 0.0002853671548574232, 'samples': 13308288, 'steps': 69313, 'loss/train': 1.5023865699768066} 11/07/2021 06:58:35 - INFO - __main__ - Step 69315: {'lr': 0.0002853619014764127, 'samples': 13308480, 'steps': 69314, 'loss/train': 1.3153762817382812} 11/07/2021 06:58:36 - INFO - __main__ - Step 69316: {'lr': 0.0002853566480794687, 'samples': 13308672, 'steps': 69315, 'loss/train': 1.2447669506072998} 11/07/2021 06:58:37 - INFO - __main__ - Step 69317: {'lr': 0.00028535139466659355, 'samples': 13308864, 'steps': 69316, 'loss/train': 1.3789061307907104} 11/07/2021 06:58:37 - INFO - __main__ - Step 69318: {'lr': 0.00028534614123778955, 'samples': 13309056, 'steps': 69317, 'loss/train': 1.5582808256149292} 11/07/2021 06:58:37 - INFO - __main__ - Step 69319: {'lr': 0.0002853408877930591, 'samples': 13309248, 'steps': 69318, 'loss/train': 1.4972842931747437} 11/07/2021 06:58:38 - INFO - __main__ - Step 69320: {'lr': 0.0002853356343324047, 'samples': 13309440, 'steps': 69319, 'loss/train': 1.3297532796859741} 11/07/2021 06:58:39 - INFO - __main__ - Step 69321: {'lr': 0.0002853303808558285, 'samples': 13309632, 'steps': 69320, 'loss/train': 1.5361759662628174} 11/07/2021 06:58:39 - INFO - __main__ - Step 69322: {'lr': 0.00028532512736333305, 'samples': 13309824, 'steps': 69321, 'loss/train': 1.377709150314331} 11/07/2021 06:58:40 - INFO - __main__ - Step 69323: {'lr': 0.00028531987385492063, 'samples': 13310016, 'steps': 69322, 'loss/train': 0.9731813669204712} 11/07/2021 06:58:40 - INFO - __main__ - Step 69324: {'lr': 0.0002853146203305936, 'samples': 13310208, 'steps': 69323, 'loss/train': 1.741881251335144} 11/07/2021 06:58:40 - INFO - __main__ - Step 69325: {'lr': 0.00028530936679035436, 'samples': 13310400, 'steps': 69324, 'loss/train': 1.4561582803726196} 11/07/2021 06:58:41 - INFO - __main__ - Step 69326: {'lr': 0.0002853041132342052, 'samples': 13310592, 'steps': 69325, 'loss/train': 1.4781765937805176} 11/07/2021 06:58:42 - INFO - __main__ - Step 69327: {'lr': 0.0002852988596621486, 'samples': 13310784, 'steps': 69326, 'loss/train': 1.429011344909668} 11/07/2021 06:58:42 - INFO - __main__ - Step 69328: {'lr': 0.0002852936060741869, 'samples': 13310976, 'steps': 69327, 'loss/train': 1.2829310894012451} 11/07/2021 06:58:42 - INFO - __main__ - Step 69329: {'lr': 0.00028528835247032243, 'samples': 13311168, 'steps': 69328, 'loss/train': 0.7950383424758911} 11/07/2021 06:58:43 - INFO - __main__ - Step 69330: {'lr': 0.0002852830988505576, 'samples': 13311360, 'steps': 69329, 'loss/train': 2.256078004837036} 11/07/2021 06:58:43 - INFO - __main__ - Step 69331: {'lr': 0.0002852778452148947, 'samples': 13311552, 'steps': 69330, 'loss/train': 1.1507059335708618} 11/07/2021 06:58:44 - INFO - __main__ - Step 69332: {'lr': 0.0002852725915633362, 'samples': 13311744, 'steps': 69331, 'loss/train': 1.3986880779266357} 11/07/2021 06:58:45 - INFO - __main__ - Step 69333: {'lr': 0.00028526733789588436, 'samples': 13311936, 'steps': 69332, 'loss/train': 1.43605375289917} 11/07/2021 06:58:45 - INFO - __main__ - Step 69334: {'lr': 0.0002852620842125416, 'samples': 13312128, 'steps': 69333, 'loss/train': 1.099779486656189} 11/07/2021 06:58:45 - INFO - __main__ - Step 69335: {'lr': 0.00028525683051331037, 'samples': 13312320, 'steps': 69334, 'loss/train': 1.123693585395813} 11/07/2021 06:58:46 - INFO - __main__ - Step 69336: {'lr': 0.0002852515767981929, 'samples': 13312512, 'steps': 69335, 'loss/train': 1.3210476636886597} 11/07/2021 06:58:47 - INFO - __main__ - Step 69337: {'lr': 0.0002852463230671916, 'samples': 13312704, 'steps': 69336, 'loss/train': 1.654752254486084} 11/07/2021 06:58:47 - INFO - __main__ - Step 69338: {'lr': 0.0002852410693203089, 'samples': 13312896, 'steps': 69337, 'loss/train': 1.5029178857803345} 11/07/2021 06:58:47 - INFO - __main__ - Step 69339: {'lr': 0.00028523581555754706, 'samples': 13313088, 'steps': 69338, 'loss/train': 0.8586850762367249} 11/07/2021 06:58:48 - INFO - __main__ - Step 69340: {'lr': 0.0002852305617789085, 'samples': 13313280, 'steps': 69339, 'loss/train': 1.472642421722412} 11/07/2021 06:58:48 - INFO - __main__ - Step 69341: {'lr': 0.00028522530798439564, 'samples': 13313472, 'steps': 69340, 'loss/train': 1.353594183921814} 11/07/2021 06:58:49 - INFO - __main__ - Step 69342: {'lr': 0.00028522005417401075, 'samples': 13313664, 'steps': 69341, 'loss/train': 1.5232470035552979} 11/07/2021 06:58:50 - INFO - __main__ - Step 69343: {'lr': 0.0002852148003477564, 'samples': 13313856, 'steps': 69342, 'loss/train': 0.7349553108215332} 11/07/2021 06:58:50 - INFO - __main__ - Step 69344: {'lr': 0.0002852095465056346, 'samples': 13314048, 'steps': 69343, 'loss/train': 1.1553962230682373} 11/07/2021 06:58:50 - INFO - __main__ - Step 69345: {'lr': 0.00028520429264764803, 'samples': 13314240, 'steps': 69344, 'loss/train': 0.9490324258804321} 11/07/2021 06:58:51 - INFO - __main__ - Step 69346: {'lr': 0.00028519903877379893, 'samples': 13314432, 'steps': 69345, 'loss/train': 1.0498921871185303} 11/07/2021 06:58:52 - INFO - __main__ - Step 69347: {'lr': 0.0002851937848840896, 'samples': 13314624, 'steps': 69346, 'loss/train': 1.2131633758544922} 11/07/2021 06:58:52 - INFO - __main__ - Step 69348: {'lr': 0.0002851885309785227, 'samples': 13314816, 'steps': 69347, 'loss/train': 1.4921082258224487} 11/07/2021 06:58:52 - INFO - __main__ - Step 69349: {'lr': 0.0002851832770571002, 'samples': 13315008, 'steps': 69348, 'loss/train': 1.8089969158172607} 11/07/2021 06:58:53 - INFO - __main__ - Step 69350: {'lr': 0.00028517802311982477, 'samples': 13315200, 'steps': 69349, 'loss/train': 1.9380662441253662} 11/07/2021 06:58:53 - INFO - __main__ - Step 69351: {'lr': 0.0002851727691666986, 'samples': 13315392, 'steps': 69350, 'loss/train': 1.7079898118972778} 11/07/2021 06:58:54 - INFO - __main__ - Step 69352: {'lr': 0.0002851675151977242, 'samples': 13315584, 'steps': 69351, 'loss/train': 1.235461711883545} 11/07/2021 06:58:56 - INFO - __main__ - Step 69353: {'lr': 0.00028516226121290373, 'samples': 13315776, 'steps': 69352, 'loss/train': 1.637725591659546} 11/07/2021 06:58:56 - INFO - __main__ - Step 69354: {'lr': 0.0002851570072122397, 'samples': 13315968, 'steps': 69353, 'loss/train': 1.8360588550567627} 11/07/2021 06:58:56 - INFO - __main__ - Step 69355: {'lr': 0.0002851517531957346, 'samples': 13316160, 'steps': 69354, 'loss/train': 1.3676856756210327} 11/07/2021 06:58:57 - INFO - __main__ - Step 69356: {'lr': 0.00028514649916339065, 'samples': 13316352, 'steps': 69355, 'loss/train': 1.8087894916534424} 11/07/2021 06:58:57 - INFO - __main__ - Step 69357: {'lr': 0.0002851412451152101, 'samples': 13316544, 'steps': 69356, 'loss/train': 1.7678158283233643} 11/07/2021 06:58:57 - INFO - __main__ - Step 69358: {'lr': 0.00028513599105119554, 'samples': 13316736, 'steps': 69357, 'loss/train': 1.7620089054107666} 11/07/2021 06:58:58 - INFO - __main__ - Step 69359: {'lr': 0.0002851307369713492, 'samples': 13316928, 'steps': 69358, 'loss/train': 1.5831705331802368} 11/07/2021 06:58:59 - INFO - __main__ - Step 69360: {'lr': 0.00028512548287567353, 'samples': 13317120, 'steps': 69359, 'loss/train': 1.7530415058135986} 11/07/2021 06:58:59 - INFO - __main__ - Step 69361: {'lr': 0.0002851202287641709, 'samples': 13317312, 'steps': 69360, 'loss/train': 1.4109858274459839} 11/07/2021 06:59:00 - INFO - __main__ - Step 69362: {'lr': 0.00028511497463684356, 'samples': 13317504, 'steps': 69361, 'loss/train': 2.0012314319610596} 11/07/2021 06:59:00 - INFO - __main__ - Step 69363: {'lr': 0.0002851097204936939, 'samples': 13317696, 'steps': 69362, 'loss/train': 1.225242018699646} 11/07/2021 06:59:00 - INFO - __main__ - Step 69364: {'lr': 0.0002851044663347244, 'samples': 13317888, 'steps': 69363, 'loss/train': 1.105672836303711} 11/07/2021 06:59:01 - INFO - __main__ - Step 69365: {'lr': 0.0002850992121599374, 'samples': 13318080, 'steps': 69364, 'loss/train': 0.9637662172317505} 11/07/2021 06:59:02 - INFO - __main__ - Step 69366: {'lr': 0.0002850939579693353, 'samples': 13318272, 'steps': 69365, 'loss/train': 1.5389184951782227} 11/07/2021 06:59:02 - INFO - __main__ - Step 69367: {'lr': 0.0002850887037629203, 'samples': 13318464, 'steps': 69366, 'loss/train': 1.4475488662719727} 11/07/2021 06:59:02 - INFO - __main__ - Step 69368: {'lr': 0.00028508344954069487, 'samples': 13318656, 'steps': 69367, 'loss/train': 1.999205231666565} 11/07/2021 06:59:03 - INFO - __main__ - Step 69369: {'lr': 0.00028507819530266144, 'samples': 13318848, 'steps': 69368, 'loss/train': 1.2742044925689697} 11/07/2021 06:59:04 - INFO - __main__ - Step 69370: {'lr': 0.00028507294104882224, 'samples': 13319040, 'steps': 69369, 'loss/train': 1.6987495422363281} 11/07/2021 06:59:04 - INFO - __main__ - Step 69371: {'lr': 0.00028506768677917976, 'samples': 13319232, 'steps': 69370, 'loss/train': 1.1615947484970093} 11/07/2021 06:59:05 - INFO - __main__ - Step 69372: {'lr': 0.00028506243249373634, 'samples': 13319424, 'steps': 69371, 'loss/train': 1.447059154510498} 11/07/2021 06:59:05 - INFO - __main__ - Step 69373: {'lr': 0.0002850571781924943, 'samples': 13319616, 'steps': 69372, 'loss/train': 1.3193278312683105} 11/07/2021 06:59:05 - INFO - __main__ - Step 69374: {'lr': 0.00028505192387545604, 'samples': 13319808, 'steps': 69373, 'loss/train': 1.753889560699463} 11/07/2021 06:59:06 - INFO - __main__ - Step 69375: {'lr': 0.00028504666954262393, 'samples': 13320000, 'steps': 69374, 'loss/train': 1.4039536714553833} 11/07/2021 06:59:07 - INFO - __main__ - Step 69376: {'lr': 0.00028504141519400037, 'samples': 13320192, 'steps': 69375, 'loss/train': 1.4782894849777222} 11/07/2021 06:59:07 - INFO - __main__ - Step 69377: {'lr': 0.00028503616082958767, 'samples': 13320384, 'steps': 69376, 'loss/train': 2.5725464820861816} 11/07/2021 06:59:07 - INFO - __main__ - Step 69378: {'lr': 0.0002850309064493882, 'samples': 13320576, 'steps': 69377, 'loss/train': 1.9796520471572876} 11/07/2021 06:59:08 - INFO - __main__ - Step 69379: {'lr': 0.00028502565205340433, 'samples': 13320768, 'steps': 69378, 'loss/train': 1.51253080368042} 11/07/2021 06:59:08 - INFO - __main__ - Step 69380: {'lr': 0.0002850203976416384, 'samples': 13320960, 'steps': 69379, 'loss/train': 1.5107839107513428} 11/07/2021 06:59:10 - INFO - __main__ - Step 69381: {'lr': 0.00028501514321409283, 'samples': 13321152, 'steps': 69380, 'loss/train': 0.7972800731658936} 11/07/2021 06:59:10 - INFO - __main__ - Step 69382: {'lr': 0.00028500988877077006, 'samples': 13321344, 'steps': 69381, 'loss/train': 0.591680109500885} 11/07/2021 06:59:10 - INFO - __main__ - Step 69383: {'lr': 0.00028500463431167236, 'samples': 13321536, 'steps': 69382, 'loss/train': 1.5979576110839844} 11/07/2021 06:59:11 - INFO - __main__ - Step 69384: {'lr': 0.00028499937983680207, 'samples': 13321728, 'steps': 69383, 'loss/train': 1.7523329257965088} 11/07/2021 06:59:11 - INFO - __main__ - Step 69385: {'lr': 0.00028499412534616157, 'samples': 13321920, 'steps': 69384, 'loss/train': 1.260127305984497} 11/07/2021 06:59:11 - INFO - __main__ - Step 69386: {'lr': 0.00028498887083975335, 'samples': 13322112, 'steps': 69385, 'loss/train': 1.3774631023406982} 11/07/2021 06:59:12 - INFO - __main__ - Step 69387: {'lr': 0.0002849836163175796, 'samples': 13322304, 'steps': 69386, 'loss/train': 0.39626842737197876} 11/07/2021 06:59:13 - INFO - __main__ - Step 69388: {'lr': 0.0002849783617796428, 'samples': 13322496, 'steps': 69387, 'loss/train': 1.3527296781539917} 11/07/2021 06:59:13 - INFO - __main__ - Step 69389: {'lr': 0.0002849731072259453, 'samples': 13322688, 'steps': 69388, 'loss/train': 1.4630780220031738} 11/07/2021 06:59:14 - INFO - __main__ - Step 69390: {'lr': 0.0002849678526564895, 'samples': 13322880, 'steps': 69389, 'loss/train': 1.5831193923950195} 11/07/2021 06:59:14 - INFO - __main__ - Step 69391: {'lr': 0.00028496259807127766, 'samples': 13323072, 'steps': 69390, 'loss/train': 1.720147728919983} 11/07/2021 06:59:15 - INFO - __main__ - Step 69392: {'lr': 0.0002849573434703122, 'samples': 13323264, 'steps': 69391, 'loss/train': 1.6024327278137207} 11/07/2021 06:59:15 - INFO - __main__ - Step 69393: {'lr': 0.00028495208885359553, 'samples': 13323456, 'steps': 69392, 'loss/train': 1.3966275453567505} 11/07/2021 06:59:16 - INFO - __main__ - Step 69394: {'lr': 0.00028494683422113, 'samples': 13323648, 'steps': 69393, 'loss/train': 1.4451426267623901} 11/07/2021 06:59:16 - INFO - __main__ - Step 69395: {'lr': 0.00028494157957291796, 'samples': 13323840, 'steps': 69394, 'loss/train': 1.555580496788025} 11/07/2021 06:59:16 - INFO - __main__ - Step 69396: {'lr': 0.0002849363249089617, 'samples': 13324032, 'steps': 69395, 'loss/train': 1.648310899734497} 11/07/2021 06:59:17 - INFO - __main__ - Step 69397: {'lr': 0.00028493107022926385, 'samples': 13324224, 'steps': 69396, 'loss/train': 1.3584320545196533} 11/07/2021 06:59:18 - INFO - __main__ - Step 69398: {'lr': 0.00028492581553382645, 'samples': 13324416, 'steps': 69397, 'loss/train': 1.1891801357269287} 11/07/2021 06:59:18 - INFO - __main__ - Step 69399: {'lr': 0.0002849205608226521, 'samples': 13324608, 'steps': 69398, 'loss/train': 1.1007691621780396} 11/07/2021 06:59:18 - INFO - __main__ - Step 69400: {'lr': 0.000284915306095743, 'samples': 13324800, 'steps': 69399, 'loss/train': 1.0141301155090332} 11/07/2021 06:59:19 - INFO - __main__ - Step 69401: {'lr': 0.00028491005135310166, 'samples': 13324992, 'steps': 69400, 'loss/train': 1.154341697692871} 11/07/2021 06:59:19 - INFO - __main__ - Step 69402: {'lr': 0.00028490479659473033, 'samples': 13325184, 'steps': 69401, 'loss/train': 1.217817783355713} 11/07/2021 06:59:20 - INFO - __main__ - Step 69403: {'lr': 0.0002848995418206316, 'samples': 13325376, 'steps': 69402, 'loss/train': 1.6723992824554443} 11/07/2021 06:59:20 - INFO - __main__ - Step 69404: {'lr': 0.00028489428703080754, 'samples': 13325568, 'steps': 69403, 'loss/train': 1.333903193473816} 11/07/2021 06:59:21 - INFO - __main__ - Step 69405: {'lr': 0.00028488903222526063, 'samples': 13325760, 'steps': 69404, 'loss/train': 0.9096896648406982} 11/07/2021 06:59:21 - INFO - __main__ - Step 69406: {'lr': 0.0002848837774039933, 'samples': 13325952, 'steps': 69405, 'loss/train': 1.2957279682159424} 11/07/2021 06:59:21 - INFO - __main__ - Step 69407: {'lr': 0.0002848785225670079, 'samples': 13326144, 'steps': 69406, 'loss/train': 1.2255351543426514} 11/07/2021 06:59:22 - INFO - __main__ - Step 69408: {'lr': 0.00028487326771430677, 'samples': 13326336, 'steps': 69407, 'loss/train': 1.4800611734390259} 11/07/2021 06:59:23 - INFO - __main__ - Step 69409: {'lr': 0.00028486801284589223, 'samples': 13326528, 'steps': 69408, 'loss/train': 1.6229028701782227} 11/07/2021 06:59:23 - INFO - __main__ - Step 69410: {'lr': 0.0002848627579617668, 'samples': 13326720, 'steps': 69409, 'loss/train': 1.36991548538208} 11/07/2021 06:59:24 - INFO - __main__ - Step 69411: {'lr': 0.0002848575030619327, 'samples': 13326912, 'steps': 69410, 'loss/train': 1.7154704332351685} 11/07/2021 06:59:24 - INFO - __main__ - Step 69412: {'lr': 0.0002848522481463923, 'samples': 13327104, 'steps': 69411, 'loss/train': 1.497255802154541} 11/07/2021 06:59:25 - INFO - __main__ - Step 69413: {'lr': 0.00028484699321514804, 'samples': 13327296, 'steps': 69412, 'loss/train': 1.300992488861084} 11/07/2021 06:59:25 - INFO - __main__ - Step 69414: {'lr': 0.0002848417382682023, 'samples': 13327488, 'steps': 69413, 'loss/train': 1.1122655868530273} 11/07/2021 06:59:26 - INFO - __main__ - Step 69415: {'lr': 0.00028483648330555737, 'samples': 13327680, 'steps': 69414, 'loss/train': 1.374622106552124} 11/07/2021 06:59:26 - INFO - __main__ - Step 69416: {'lr': 0.0002848312283272157, 'samples': 13327872, 'steps': 69415, 'loss/train': 1.3780310153961182} 11/07/2021 06:59:26 - INFO - __main__ - Step 69417: {'lr': 0.0002848259733331796, 'samples': 13328064, 'steps': 69416, 'loss/train': 1.3134727478027344} 11/07/2021 06:59:27 - INFO - __main__ - Step 69418: {'lr': 0.0002848207183234515, 'samples': 13328256, 'steps': 69417, 'loss/train': 1.452608346939087} 11/07/2021 06:59:28 - INFO - __main__ - Step 69419: {'lr': 0.0002848154632980337, 'samples': 13328448, 'steps': 69418, 'loss/train': 1.2983453273773193} 11/07/2021 06:59:28 - INFO - __main__ - Step 69420: {'lr': 0.0002848102082569285, 'samples': 13328640, 'steps': 69419, 'loss/train': 2.3577141761779785} 11/07/2021 06:59:28 - INFO - __main__ - Step 69421: {'lr': 0.0002848049532001384, 'samples': 13328832, 'steps': 69420, 'loss/train': 1.8049498796463013} 11/07/2021 06:59:29 - INFO - __main__ - Step 69422: {'lr': 0.0002847996981276657, 'samples': 13329024, 'steps': 69421, 'loss/train': 1.6141338348388672} 11/07/2021 06:59:29 - INFO - __main__ - Step 69423: {'lr': 0.00028479444303951284, 'samples': 13329216, 'steps': 69422, 'loss/train': 1.615761637687683} 11/07/2021 06:59:30 - INFO - __main__ - Step 69424: {'lr': 0.0002847891879356822, 'samples': 13329408, 'steps': 69423, 'loss/train': 1.2792472839355469} 11/07/2021 06:59:31 - INFO - __main__ - Step 69425: {'lr': 0.00028478393281617596, 'samples': 13329600, 'steps': 69424, 'loss/train': 1.509810209274292} 11/07/2021 06:59:31 - INFO - __main__ - Step 69426: {'lr': 0.0002847786776809967, 'samples': 13329792, 'steps': 69425, 'loss/train': 1.610639214515686} 11/07/2021 06:59:31 - INFO - __main__ - Step 69427: {'lr': 0.0002847734225301467, 'samples': 13329984, 'steps': 69426, 'loss/train': 1.4865138530731201} 11/07/2021 06:59:32 - INFO - __main__ - Step 69428: {'lr': 0.0002847681673636283, 'samples': 13330176, 'steps': 69427, 'loss/train': 1.8116259574890137} 11/07/2021 06:59:33 - INFO - __main__ - Step 69429: {'lr': 0.0002847629121814439, 'samples': 13330368, 'steps': 69428, 'loss/train': 1.7716988325119019} 11/07/2021 06:59:33 - INFO - __main__ - Step 69430: {'lr': 0.0002847576569835959, 'samples': 13330560, 'steps': 69429, 'loss/train': 1.433172345161438} 11/07/2021 06:59:33 - INFO - __main__ - Step 69431: {'lr': 0.00028475240177008664, 'samples': 13330752, 'steps': 69430, 'loss/train': 1.3787528276443481} 11/07/2021 06:59:34 - INFO - __main__ - Step 69432: {'lr': 0.0002847471465409184, 'samples': 13330944, 'steps': 69431, 'loss/train': 1.5866791009902954} 11/07/2021 06:59:34 - INFO - __main__ - Step 69433: {'lr': 0.0002847418912960937, 'samples': 13331136, 'steps': 69432, 'loss/train': 1.7595936059951782} 11/07/2021 06:59:35 - INFO - __main__ - Step 69434: {'lr': 0.0002847366360356149, 'samples': 13331328, 'steps': 69433, 'loss/train': 1.5124648809432983} 11/07/2021 06:59:36 - INFO - __main__ - Step 69435: {'lr': 0.00028473138075948425, 'samples': 13331520, 'steps': 69434, 'loss/train': 1.266822099685669} 11/07/2021 06:59:36 - INFO - __main__ - Step 69436: {'lr': 0.0002847261254677041, 'samples': 13331712, 'steps': 69435, 'loss/train': 1.321885347366333} 11/07/2021 06:59:36 - INFO - __main__ - Step 69437: {'lr': 0.00028472087016027703, 'samples': 13331904, 'steps': 69436, 'loss/train': 1.3829954862594604} 11/07/2021 06:59:37 - INFO - __main__ - Step 69438: {'lr': 0.0002847156148372052, 'samples': 13332096, 'steps': 69437, 'loss/train': 1.2724924087524414} 11/07/2021 06:59:38 - INFO - __main__ - Step 69439: {'lr': 0.00028471035949849106, 'samples': 13332288, 'steps': 69438, 'loss/train': 1.3790868520736694} 11/07/2021 06:59:38 - INFO - __main__ - Step 69440: {'lr': 0.00028470510414413695, 'samples': 13332480, 'steps': 69439, 'loss/train': 0.9972846508026123} 11/07/2021 06:59:39 - INFO - __main__ - Step 69441: {'lr': 0.0002846998487741452, 'samples': 13332672, 'steps': 69440, 'loss/train': 1.2708884477615356} 11/07/2021 06:59:39 - INFO - __main__ - Step 69442: {'lr': 0.00028469459338851833, 'samples': 13332864, 'steps': 69441, 'loss/train': 1.0764415264129639} 11/07/2021 06:59:39 - INFO - __main__ - Step 69443: {'lr': 0.0002846893379872586, 'samples': 13333056, 'steps': 69442, 'loss/train': 1.3255633115768433} 11/07/2021 06:59:40 - INFO - __main__ - Step 69444: {'lr': 0.0002846840825703684, 'samples': 13333248, 'steps': 69443, 'loss/train': 1.031990885734558} 11/07/2021 06:59:41 - INFO - __main__ - Step 69445: {'lr': 0.0002846788271378501, 'samples': 13333440, 'steps': 69444, 'loss/train': 1.1940163373947144} 11/07/2021 06:59:41 - INFO - __main__ - Step 69446: {'lr': 0.000284673571689706, 'samples': 13333632, 'steps': 69445, 'loss/train': 1.2799419164657593} 11/07/2021 06:59:41 - INFO - __main__ - Step 69447: {'lr': 0.0002846683162259385, 'samples': 13333824, 'steps': 69446, 'loss/train': 1.7075542211532593} 11/07/2021 06:59:42 - INFO - __main__ - Step 69448: {'lr': 0.00028466306074655004, 'samples': 13334016, 'steps': 69447, 'loss/train': 1.8469188213348389} 11/07/2021 06:59:42 - INFO - __main__ - Step 69449: {'lr': 0.00028465780525154297, 'samples': 13334208, 'steps': 69448, 'loss/train': 1.3771471977233887} 11/07/2021 06:59:42 - INFO - __main__ - Step 69450: {'lr': 0.00028465254974091955, 'samples': 13334400, 'steps': 69449, 'loss/train': 1.5053107738494873} 11/07/2021 06:59:44 - INFO - __main__ - Step 69451: {'lr': 0.00028464729421468225, 'samples': 13334592, 'steps': 69450, 'loss/train': 1.32041597366333} 11/07/2021 06:59:44 - INFO - __main__ - Step 69452: {'lr': 0.0002846420386728334, 'samples': 13334784, 'steps': 69451, 'loss/train': 1.039367914199829} 11/07/2021 06:59:44 - INFO - __main__ - Step 69453: {'lr': 0.00028463678311537545, 'samples': 13334976, 'steps': 69452, 'loss/train': 0.7127858400344849} 11/07/2021 06:59:45 - INFO - __main__ - Step 69454: {'lr': 0.00028463152754231065, 'samples': 13335168, 'steps': 69453, 'loss/train': 1.3262513875961304} 11/07/2021 06:59:45 - INFO - __main__ - Step 69455: {'lr': 0.0002846262719536414, 'samples': 13335360, 'steps': 69454, 'loss/train': 0.9329615831375122} 11/07/2021 06:59:46 - INFO - __main__ - Step 69456: {'lr': 0.00028462101634937014, 'samples': 13335552, 'steps': 69455, 'loss/train': 1.5050126314163208} 11/07/2021 06:59:46 - INFO - __main__ - Step 69457: {'lr': 0.00028461576072949925, 'samples': 13335744, 'steps': 69456, 'loss/train': 1.9099366664886475} 11/07/2021 06:59:47 - INFO - __main__ - Step 69458: {'lr': 0.0002846105050940309, 'samples': 13335936, 'steps': 69457, 'loss/train': 1.726913332939148} 11/07/2021 06:59:47 - INFO - __main__ - Step 69459: {'lr': 0.00028460524944296764, 'samples': 13336128, 'steps': 69458, 'loss/train': 2.0578479766845703} 11/07/2021 06:59:48 - INFO - __main__ - Step 69460: {'lr': 0.00028459999377631175, 'samples': 13336320, 'steps': 69459, 'loss/train': 1.5252329111099243} 11/07/2021 06:59:48 - INFO - __main__ - Step 69461: {'lr': 0.0002845947380940657, 'samples': 13336512, 'steps': 69460, 'loss/train': 1.4309360980987549} 11/07/2021 06:59:49 - INFO - __main__ - Step 69462: {'lr': 0.0002845894823962317, 'samples': 13336704, 'steps': 69461, 'loss/train': 1.9604241847991943} 11/07/2021 06:59:49 - INFO - __main__ - Step 69463: {'lr': 0.0002845842266828123, 'samples': 13336896, 'steps': 69462, 'loss/train': 1.2829490900039673} 11/07/2021 06:59:50 - INFO - __main__ - Step 69464: {'lr': 0.0002845789709538098, 'samples': 13337088, 'steps': 69463, 'loss/train': 1.30887770652771} 11/07/2021 06:59:50 - INFO - __main__ - Step 69465: {'lr': 0.00028457371520922647, 'samples': 13337280, 'steps': 69464, 'loss/train': 1.5828886032104492} 11/07/2021 06:59:51 - INFO - __main__ - Step 69466: {'lr': 0.0002845684594490648, 'samples': 13337472, 'steps': 69465, 'loss/train': 1.3685013055801392} 11/07/2021 06:59:51 - INFO - __main__ - Step 69467: {'lr': 0.0002845632036733271, 'samples': 13337664, 'steps': 69466, 'loss/train': 1.9253170490264893} 11/07/2021 06:59:52 - INFO - __main__ - Step 69468: {'lr': 0.0002845579478820158, 'samples': 13337856, 'steps': 69467, 'loss/train': 1.7379348278045654} 11/07/2021 06:59:52 - INFO - __main__ - Step 69469: {'lr': 0.00028455269207513313, 'samples': 13338048, 'steps': 69468, 'loss/train': 0.9244740009307861} 11/07/2021 06:59:52 - INFO - __main__ - Step 69470: {'lr': 0.0002845474362526816, 'samples': 13338240, 'steps': 69469, 'loss/train': 1.7305673360824585} 11/07/2021 06:59:53 - INFO - __main__ - Step 69471: {'lr': 0.00028454218041466356, 'samples': 13338432, 'steps': 69470, 'loss/train': 1.2667357921600342} 11/07/2021 06:59:54 - INFO - __main__ - Step 69472: {'lr': 0.00028453692456108134, 'samples': 13338624, 'steps': 69471, 'loss/train': 0.16707639396190643} 11/07/2021 06:59:54 - INFO - __main__ - Step 69473: {'lr': 0.00028453166869193725, 'samples': 13338816, 'steps': 69472, 'loss/train': 1.5511664152145386} 11/07/2021 06:59:55 - INFO - __main__ - Step 69474: {'lr': 0.00028452641280723377, 'samples': 13339008, 'steps': 69473, 'loss/train': 1.1916675567626953} 11/07/2021 06:59:55 - INFO - __main__ - Step 69475: {'lr': 0.00028452115690697324, 'samples': 13339200, 'steps': 69474, 'loss/train': 1.483536720275879} 11/07/2021 06:59:56 - INFO - __main__ - Step 69476: {'lr': 0.000284515900991158, 'samples': 13339392, 'steps': 69475, 'loss/train': 0.7474353909492493} 11/07/2021 06:59:56 - INFO - __main__ - Step 69477: {'lr': 0.0002845106450597904, 'samples': 13339584, 'steps': 69476, 'loss/train': 1.1043896675109863} 11/07/2021 06:59:57 - INFO - __main__ - Step 69478: {'lr': 0.0002845053891128729, 'samples': 13339776, 'steps': 69477, 'loss/train': 1.4719339609146118} 11/07/2021 06:59:57 - INFO - __main__ - Step 69479: {'lr': 0.0002845001331504077, 'samples': 13339968, 'steps': 69478, 'loss/train': 1.261014461517334} 11/07/2021 06:59:57 - INFO - __main__ - Step 69480: {'lr': 0.00028449487717239737, 'samples': 13340160, 'steps': 69479, 'loss/train': 1.0647169351577759} 11/07/2021 06:59:58 - INFO - __main__ - Step 69481: {'lr': 0.00028448962117884406, 'samples': 13340352, 'steps': 69480, 'loss/train': 1.353854775428772} 11/07/2021 06:59:59 - INFO - __main__ - Step 69482: {'lr': 0.00028448436516975034, 'samples': 13340544, 'steps': 69481, 'loss/train': 1.3913668394088745} 11/07/2021 06:59:59 - INFO - __main__ - Step 69483: {'lr': 0.00028447910914511853, 'samples': 13340736, 'steps': 69482, 'loss/train': 1.266734004020691} 11/07/2021 06:59:59 - INFO - __main__ - Step 69484: {'lr': 0.0002844738531049509, 'samples': 13340928, 'steps': 69483, 'loss/train': 1.6609755754470825} 11/07/2021 07:00:00 - INFO - __main__ - Step 69485: {'lr': 0.00028446859704925, 'samples': 13341120, 'steps': 69484, 'loss/train': 0.897250235080719} 11/07/2021 07:00:00 - INFO - __main__ - Step 69486: {'lr': 0.00028446334097801795, 'samples': 13341312, 'steps': 69485, 'loss/train': 1.6307495832443237} 11/07/2021 07:00:01 - INFO - __main__ - Step 69487: {'lr': 0.0002844580848912573, 'samples': 13341504, 'steps': 69486, 'loss/train': 1.5133419036865234} 11/07/2021 07:00:02 - INFO - __main__ - Step 69488: {'lr': 0.0002844528287889703, 'samples': 13341696, 'steps': 69487, 'loss/train': 1.2832108736038208} 11/07/2021 07:00:02 - INFO - __main__ - Step 69489: {'lr': 0.0002844475726711595, 'samples': 13341888, 'steps': 69488, 'loss/train': 1.4705612659454346} 11/07/2021 07:00:02 - INFO - __main__ - Step 69490: {'lr': 0.00028444231653782713, 'samples': 13342080, 'steps': 69489, 'loss/train': 1.4464468955993652} 11/07/2021 07:00:03 - INFO - __main__ - Step 69491: {'lr': 0.0002844370603889755, 'samples': 13342272, 'steps': 69490, 'loss/train': 1.4410910606384277} 11/07/2021 07:00:04 - INFO - __main__ - Step 69492: {'lr': 0.0002844318042246072, 'samples': 13342464, 'steps': 69491, 'loss/train': 1.6681523323059082} 11/07/2021 07:00:04 - INFO - __main__ - Step 69493: {'lr': 0.00028442654804472435, 'samples': 13342656, 'steps': 69492, 'loss/train': 1.5299092531204224} 11/07/2021 07:00:05 - INFO - __main__ - Step 69494: {'lr': 0.00028442129184932946, 'samples': 13342848, 'steps': 69493, 'loss/train': 1.4272712469100952} 11/07/2021 07:00:05 - INFO - __main__ - Step 69495: {'lr': 0.00028441603563842495, 'samples': 13343040, 'steps': 69494, 'loss/train': 1.5834254026412964} 11/07/2021 07:00:06 - INFO - __main__ - Step 69496: {'lr': 0.000284410779412013, 'samples': 13343232, 'steps': 69495, 'loss/train': 1.1500484943389893} 11/07/2021 07:00:06 - INFO - __main__ - Step 69497: {'lr': 0.0002844055231700961, 'samples': 13343424, 'steps': 69496, 'loss/train': 0.18231552839279175} 11/07/2021 07:00:07 - INFO - __main__ - Step 69498: {'lr': 0.0002844002669126766, 'samples': 13343616, 'steps': 69497, 'loss/train': 1.4461398124694824} 11/07/2021 07:00:07 - INFO - __main__ - Step 69499: {'lr': 0.0002843950106397569, 'samples': 13343808, 'steps': 69498, 'loss/train': 0.9501680135726929} 11/07/2021 07:00:08 - INFO - __main__ - Step 69500: {'lr': 0.0002843897543513393, 'samples': 13344000, 'steps': 69499, 'loss/train': 1.807453989982605} 11/07/2021 07:00:08 - INFO - __main__ - Step 69501: {'lr': 0.00028438449804742626, 'samples': 13344192, 'steps': 69500, 'loss/train': 1.175445556640625} 11/07/2021 07:00:08 - INFO - __main__ - Step 69502: {'lr': 0.00028437924172802006, 'samples': 13344384, 'steps': 69501, 'loss/train': 1.442716121673584} 11/07/2021 07:00:09 - INFO - __main__ - Step 69503: {'lr': 0.0002843739853931231, 'samples': 13344576, 'steps': 69502, 'loss/train': 1.2830413579940796} 11/07/2021 07:00:10 - INFO - __main__ - Step 69504: {'lr': 0.00028436872904273776, 'samples': 13344768, 'steps': 69503, 'loss/train': 0.893492579460144} 11/07/2021 07:00:10 - INFO - __main__ - Step 69505: {'lr': 0.00028436347267686633, 'samples': 13344960, 'steps': 69504, 'loss/train': 2.3043200969696045} 11/07/2021 07:00:10 - INFO - __main__ - Step 69506: {'lr': 0.0002843582162955114, 'samples': 13345152, 'steps': 69505, 'loss/train': 1.498862385749817} 11/07/2021 07:00:11 - INFO - __main__ - Step 69507: {'lr': 0.0002843529598986751, 'samples': 13345344, 'steps': 69506, 'loss/train': 1.3914151191711426} 11/07/2021 07:00:12 - INFO - __main__ - Step 69508: {'lr': 0.0002843477034863599, 'samples': 13345536, 'steps': 69507, 'loss/train': 1.6607691049575806} 11/07/2021 07:00:12 - INFO - __main__ - Step 69509: {'lr': 0.00028434244705856815, 'samples': 13345728, 'steps': 69508, 'loss/train': 0.5044597387313843} 11/07/2021 07:00:13 - INFO - __main__ - Step 69510: {'lr': 0.00028433719061530215, 'samples': 13345920, 'steps': 69509, 'loss/train': 1.2353368997573853} 11/07/2021 07:00:13 - INFO - __main__ - Step 69511: {'lr': 0.00028433193415656447, 'samples': 13346112, 'steps': 69510, 'loss/train': 1.2917324304580688} 11/07/2021 07:00:13 - INFO - __main__ - Step 69512: {'lr': 0.00028432667768235734, 'samples': 13346304, 'steps': 69511, 'loss/train': 1.9239977598190308} 11/07/2021 07:00:14 - INFO - __main__ - Step 69513: {'lr': 0.0002843214211926831, 'samples': 13346496, 'steps': 69512, 'loss/train': 1.385046124458313} 11/07/2021 07:00:15 - INFO - __main__ - Step 69514: {'lr': 0.0002843161646875441, 'samples': 13346688, 'steps': 69513, 'loss/train': 1.2054096460342407} 11/07/2021 07:00:15 - INFO - __main__ - Step 69515: {'lr': 0.0002843109081669428, 'samples': 13346880, 'steps': 69514, 'loss/train': 1.5308952331542969} 11/07/2021 07:00:15 - INFO - __main__ - Step 69516: {'lr': 0.0002843056516308816, 'samples': 13347072, 'steps': 69515, 'loss/train': 1.5832221508026123} 11/07/2021 07:00:16 - INFO - __main__ - Step 69517: {'lr': 0.00028430039507936275, 'samples': 13347264, 'steps': 69516, 'loss/train': 1.4383820295333862} 11/07/2021 07:00:17 - INFO - __main__ - Step 69518: {'lr': 0.0002842951385123887, 'samples': 13347456, 'steps': 69517, 'loss/train': 1.1877727508544922} 11/07/2021 07:00:17 - INFO - __main__ - Step 69519: {'lr': 0.00028428988192996175, 'samples': 13347648, 'steps': 69518, 'loss/train': 1.5308458805084229} 11/07/2021 07:00:17 - INFO - __main__ - Step 69520: {'lr': 0.00028428462533208434, 'samples': 13347840, 'steps': 69519, 'loss/train': 1.5264285802841187} 11/07/2021 07:00:18 - INFO - __main__ - Step 69521: {'lr': 0.0002842793687187588, 'samples': 13348032, 'steps': 69520, 'loss/train': 1.5017364025115967} 11/07/2021 07:00:18 - INFO - __main__ - Step 69522: {'lr': 0.00028427411208998746, 'samples': 13348224, 'steps': 69521, 'loss/train': 1.0838444232940674} 11/07/2021 07:00:19 - INFO - __main__ - Step 69523: {'lr': 0.00028426885544577277, 'samples': 13348416, 'steps': 69522, 'loss/train': 1.562145471572876} 11/07/2021 07:00:19 - INFO - __main__ - Step 69524: {'lr': 0.0002842635987861171, 'samples': 13348608, 'steps': 69523, 'loss/train': 1.555387258529663} 11/07/2021 07:00:20 - INFO - __main__ - Step 69525: {'lr': 0.0002842583421110227, 'samples': 13348800, 'steps': 69524, 'loss/train': 1.4960672855377197} 11/07/2021 07:00:20 - INFO - __main__ - Step 69526: {'lr': 0.00028425308542049207, 'samples': 13348992, 'steps': 69525, 'loss/train': 1.4941802024841309} 11/07/2021 07:00:20 - INFO - __main__ - Step 69527: {'lr': 0.00028424782871452745, 'samples': 13349184, 'steps': 69526, 'loss/train': 1.3823983669281006} 11/07/2021 07:00:22 - INFO - __main__ - Step 69528: {'lr': 0.00028424257199313144, 'samples': 13349376, 'steps': 69527, 'loss/train': 1.1525033712387085} 11/07/2021 07:00:22 - INFO - __main__ - Step 69529: {'lr': 0.00028423731525630615, 'samples': 13349568, 'steps': 69528, 'loss/train': 1.376878023147583} 11/07/2021 07:00:23 - INFO - __main__ - Step 69530: {'lr': 0.000284232058504054, 'samples': 13349760, 'steps': 69529, 'loss/train': 1.578144907951355} 11/07/2021 07:00:23 - INFO - __main__ - Step 69531: {'lr': 0.0002842268017363776, 'samples': 13349952, 'steps': 69530, 'loss/train': 1.739909291267395} 11/07/2021 07:00:23 - INFO - __main__ - Step 69532: {'lr': 0.000284221544953279, 'samples': 13350144, 'steps': 69531, 'loss/train': 1.3503212928771973} 11/07/2021 07:00:24 - INFO - __main__ - Step 69533: {'lr': 0.0002842162881547607, 'samples': 13350336, 'steps': 69532, 'loss/train': 1.4216479063034058} 11/07/2021 07:00:24 - INFO - __main__ - Step 69534: {'lr': 0.0002842110313408251, 'samples': 13350528, 'steps': 69533, 'loss/train': 1.5198193788528442} 11/07/2021 07:00:25 - INFO - __main__ - Step 69535: {'lr': 0.0002842057745114745, 'samples': 13350720, 'steps': 69534, 'loss/train': 0.8929924368858337} 11/07/2021 07:00:25 - INFO - __main__ - Step 69536: {'lr': 0.00028420051766671133, 'samples': 13350912, 'steps': 69535, 'loss/train': 0.6976807117462158} 11/07/2021 07:00:26 - INFO - __main__ - Step 69537: {'lr': 0.0002841952608065379, 'samples': 13351104, 'steps': 69536, 'loss/train': 1.34248685836792} 11/07/2021 07:00:26 - INFO - __main__ - Step 69538: {'lr': 0.0002841900039309567, 'samples': 13351296, 'steps': 69537, 'loss/train': 1.9001487493515015} 11/07/2021 07:00:26 - INFO - __main__ - Step 69539: {'lr': 0.00028418474703997, 'samples': 13351488, 'steps': 69538, 'loss/train': 1.405310034751892} 11/07/2021 07:00:28 - INFO - __main__ - Step 69540: {'lr': 0.0002841794901335801, 'samples': 13351680, 'steps': 69539, 'loss/train': 1.1971741914749146} 11/07/2021 07:00:28 - INFO - __main__ - Step 69541: {'lr': 0.0002841742332117895, 'samples': 13351872, 'steps': 69540, 'loss/train': 1.3917133808135986} 11/07/2021 07:00:28 - INFO - __main__ - Step 69542: {'lr': 0.0002841689762746005, 'samples': 13352064, 'steps': 69541, 'loss/train': 1.8056408166885376} 11/07/2021 07:00:29 - INFO - __main__ - Step 69543: {'lr': 0.00028416371932201546, 'samples': 13352256, 'steps': 69542, 'loss/train': 1.795831322669983} 11/07/2021 07:00:29 - INFO - __main__ - Step 69544: {'lr': 0.0002841584623540368, 'samples': 13352448, 'steps': 69543, 'loss/train': 0.9098688960075378} 11/07/2021 07:00:29 - INFO - __main__ - Step 69545: {'lr': 0.00028415320537066697, 'samples': 13352640, 'steps': 69544, 'loss/train': 1.3899904489517212} 11/07/2021 07:00:31 - INFO - __main__ - Step 69546: {'lr': 0.0002841479483719081, 'samples': 13352832, 'steps': 69545, 'loss/train': 1.1752156019210815} 11/07/2021 07:00:31 - INFO - __main__ - Step 69547: {'lr': 0.00028414269135776274, 'samples': 13353024, 'steps': 69546, 'loss/train': 1.78446626663208} 11/07/2021 07:00:31 - INFO - __main__ - Step 69548: {'lr': 0.0002841374343282332, 'samples': 13353216, 'steps': 69547, 'loss/train': 1.232306718826294} 11/07/2021 07:00:32 - INFO - __main__ - Step 69549: {'lr': 0.00028413217728332185, 'samples': 13353408, 'steps': 69548, 'loss/train': 1.2978500127792358} 11/07/2021 07:00:32 - INFO - __main__ - Step 69550: {'lr': 0.0002841269202230311, 'samples': 13353600, 'steps': 69549, 'loss/train': 1.2779531478881836} 11/07/2021 07:00:33 - INFO - __main__ - Step 69551: {'lr': 0.0002841216631473633, 'samples': 13353792, 'steps': 69550, 'loss/train': 1.65526282787323} 11/07/2021 07:00:33 - INFO - __main__ - Step 69552: {'lr': 0.00028411640605632073, 'samples': 13353984, 'steps': 69551, 'loss/train': 1.870110034942627} 11/07/2021 07:00:34 - INFO - __main__ - Step 69553: {'lr': 0.0002841111489499059, 'samples': 13354176, 'steps': 69552, 'loss/train': 1.1734062433242798} 11/07/2021 07:00:34 - INFO - __main__ - Step 69554: {'lr': 0.000284105891828121, 'samples': 13354368, 'steps': 69553, 'loss/train': 1.541858196258545} 11/07/2021 07:00:34 - INFO - __main__ - Step 69555: {'lr': 0.0002841006346909686, 'samples': 13354560, 'steps': 69554, 'loss/train': 0.1924009770154953} 11/07/2021 07:00:35 - INFO - __main__ - Step 69556: {'lr': 0.000284095377538451, 'samples': 13354752, 'steps': 69555, 'loss/train': 1.494123101234436} 11/07/2021 07:00:36 - INFO - __main__ - Step 69557: {'lr': 0.00028409012037057047, 'samples': 13354944, 'steps': 69556, 'loss/train': 1.4773873090744019} 11/07/2021 07:00:36 - INFO - __main__ - Step 69558: {'lr': 0.00028408486318732954, 'samples': 13355136, 'steps': 69557, 'loss/train': 1.507440209388733} 11/07/2021 07:00:36 - INFO - __main__ - Step 69559: {'lr': 0.0002840796059887305, 'samples': 13355328, 'steps': 69558, 'loss/train': 1.5395426750183105} 11/07/2021 07:00:37 - INFO - __main__ - Step 69560: {'lr': 0.00028407434877477565, 'samples': 13355520, 'steps': 69559, 'loss/train': 1.7781226634979248} 11/07/2021 07:00:37 - INFO - __main__ - Step 69561: {'lr': 0.00028406909154546746, 'samples': 13355712, 'steps': 69560, 'loss/train': 1.5836024284362793} 11/07/2021 07:00:38 - INFO - __main__ - Step 69562: {'lr': 0.00028406383430080827, 'samples': 13355904, 'steps': 69561, 'loss/train': 1.576811671257019} 11/07/2021 07:00:39 - INFO - __main__ - Step 69563: {'lr': 0.0002840585770408004, 'samples': 13356096, 'steps': 69562, 'loss/train': 1.358682632446289} 11/07/2021 07:00:39 - INFO - __main__ - Step 69564: {'lr': 0.0002840533197654463, 'samples': 13356288, 'steps': 69563, 'loss/train': 1.4455560445785522} 11/07/2021 07:00:39 - INFO - __main__ - Step 69565: {'lr': 0.00028404806247474837, 'samples': 13356480, 'steps': 69564, 'loss/train': 1.4476898908615112} 11/07/2021 07:00:40 - INFO - __main__ - Step 69566: {'lr': 0.00028404280516870886, 'samples': 13356672, 'steps': 69565, 'loss/train': 1.4888988733291626} 11/07/2021 07:00:41 - INFO - __main__ - Step 69567: {'lr': 0.0002840375478473301, 'samples': 13356864, 'steps': 69566, 'loss/train': 5.722508430480957} 11/07/2021 07:00:41 - INFO - __main__ - Step 69568: {'lr': 0.00028403229051061457, 'samples': 13357056, 'steps': 69567, 'loss/train': 1.542909860610962} 11/07/2021 07:00:41 - INFO - __main__ - Step 69569: {'lr': 0.00028402703315856466, 'samples': 13357248, 'steps': 69568, 'loss/train': 1.7328336238861084} 11/07/2021 07:00:42 - INFO - __main__ - Step 69570: {'lr': 0.00028402177579118273, 'samples': 13357440, 'steps': 69569, 'loss/train': 1.4170881509780884} 11/07/2021 07:00:42 - INFO - __main__ - Step 69571: {'lr': 0.00028401651840847104, 'samples': 13357632, 'steps': 69570, 'loss/train': 1.0438237190246582} 11/07/2021 07:00:42 - INFO - __main__ - Step 69572: {'lr': 0.00028401126101043205, 'samples': 13357824, 'steps': 69571, 'loss/train': 1.9168272018432617} 11/07/2021 07:00:44 - INFO - __main__ - Step 69573: {'lr': 0.0002840060035970681, 'samples': 13358016, 'steps': 69572, 'loss/train': 1.2992286682128906} 11/07/2021 07:00:44 - INFO - __main__ - Step 69574: {'lr': 0.0002840007461683816, 'samples': 13358208, 'steps': 69573, 'loss/train': 1.1534667015075684} 11/07/2021 07:00:44 - INFO - __main__ - Step 69575: {'lr': 0.00028399548872437493, 'samples': 13358400, 'steps': 69574, 'loss/train': 1.2833493947982788} 11/07/2021 07:00:45 - INFO - __main__ - Step 69576: {'lr': 0.0002839902312650503, 'samples': 13358592, 'steps': 69575, 'loss/train': 1.4344264268875122} 11/07/2021 07:00:45 - INFO - __main__ - Step 69577: {'lr': 0.00028398497379041027, 'samples': 13358784, 'steps': 69576, 'loss/train': 0.09576896578073502} 11/07/2021 07:00:47 - INFO - __main__ - Step 69578: {'lr': 0.00028397971630045717, 'samples': 13358976, 'steps': 69577, 'loss/train': 1.3806473016738892} 11/07/2021 07:00:47 - INFO - __main__ - Step 69579: {'lr': 0.0002839744587951933, 'samples': 13359168, 'steps': 69578, 'loss/train': 1.4002598524093628} 11/07/2021 07:00:47 - INFO - __main__ - Step 69580: {'lr': 0.00028396920127462107, 'samples': 13359360, 'steps': 69579, 'loss/train': 1.6332323551177979} 11/07/2021 07:00:48 - INFO - __main__ - Step 69581: {'lr': 0.0002839639437387428, 'samples': 13359552, 'steps': 69580, 'loss/train': 0.7248629927635193} 11/07/2021 07:00:48 - INFO - __main__ - Step 69582: {'lr': 0.00028395868618756094, 'samples': 13359744, 'steps': 69581, 'loss/train': 0.15184824168682098} 11/07/2021 07:00:49 - INFO - __main__ - Step 69583: {'lr': 0.00028395342862107774, 'samples': 13359936, 'steps': 69582, 'loss/train': 1.5231870412826538} 11/07/2021 07:00:49 - INFO - __main__ - Step 69584: {'lr': 0.0002839481710392957, 'samples': 13360128, 'steps': 69583, 'loss/train': 1.5768697261810303} 11/07/2021 07:00:50 - INFO - __main__ - Step 69585: {'lr': 0.00028394291344221724, 'samples': 13360320, 'steps': 69584, 'loss/train': 0.8304705619812012} 11/07/2021 07:00:50 - INFO - __main__ - Step 69586: {'lr': 0.00028393765582984454, 'samples': 13360512, 'steps': 69585, 'loss/train': 1.556376576423645} 11/07/2021 07:00:50 - INFO - __main__ - Step 69587: {'lr': 0.00028393239820218003, 'samples': 13360704, 'steps': 69586, 'loss/train': 1.5798017978668213} 11/07/2021 07:00:52 - INFO - __main__ - Step 69588: {'lr': 0.00028392714055922616, 'samples': 13360896, 'steps': 69587, 'loss/train': 1.7354179620742798} 11/07/2021 07:00:52 - INFO - __main__ - Step 69589: {'lr': 0.0002839218829009852, 'samples': 13361088, 'steps': 69588, 'loss/train': 1.3214064836502075} 11/07/2021 07:00:52 - INFO - __main__ - Step 69590: {'lr': 0.00028391662522745954, 'samples': 13361280, 'steps': 69589, 'loss/train': 1.0108016729354858} 11/07/2021 07:00:53 - INFO - __main__ - Step 69591: {'lr': 0.0002839113675386516, 'samples': 13361472, 'steps': 69590, 'loss/train': 1.4592339992523193} 11/07/2021 07:00:53 - INFO - __main__ - Step 69592: {'lr': 0.00028390610983456376, 'samples': 13361664, 'steps': 69591, 'loss/train': 1.6779838800430298} 11/07/2021 07:00:54 - INFO - __main__ - Step 69593: {'lr': 0.00028390085211519835, 'samples': 13361856, 'steps': 69592, 'loss/train': 1.3625363111495972} 11/07/2021 07:00:54 - INFO - __main__ - Step 69594: {'lr': 0.0002838955943805577, 'samples': 13362048, 'steps': 69593, 'loss/train': 0.9646394848823547} 11/07/2021 07:00:55 - INFO - __main__ - Step 69595: {'lr': 0.0002838903366306442, 'samples': 13362240, 'steps': 69594, 'loss/train': 1.2676833868026733} 11/07/2021 07:00:55 - INFO - __main__ - Step 69596: {'lr': 0.0002838850788654603, 'samples': 13362432, 'steps': 69595, 'loss/train': 1.3325022459030151} 11/07/2021 07:00:55 - INFO - __main__ - Step 69597: {'lr': 0.00028387982108500826, 'samples': 13362624, 'steps': 69596, 'loss/train': 1.218185544013977} 11/07/2021 07:00:56 - INFO - __main__ - Step 69598: {'lr': 0.0002838745632892905, 'samples': 13362816, 'steps': 69597, 'loss/train': 1.0263361930847168} 11/07/2021 07:00:57 - INFO - __main__ - Step 69599: {'lr': 0.00028386930547830944, 'samples': 13363008, 'steps': 69598, 'loss/train': 1.4161254167556763} 11/07/2021 07:00:57 - INFO - __main__ - Step 69600: {'lr': 0.0002838640476520673, 'samples': 13363200, 'steps': 69599, 'loss/train': 1.7520554065704346} 11/07/2021 07:00:57 - INFO - __main__ - Step 69601: {'lr': 0.0002838587898105666, 'samples': 13363392, 'steps': 69600, 'loss/train': 1.3152990341186523} 11/07/2021 07:00:58 - INFO - __main__ - Step 69602: {'lr': 0.00028385353195380965, 'samples': 13363584, 'steps': 69601, 'loss/train': 1.5036002397537231} 11/07/2021 07:00:58 - INFO - __main__ - Step 69603: {'lr': 0.0002838482740817988, 'samples': 13363776, 'steps': 69602, 'loss/train': 1.4521002769470215} 11/07/2021 07:00:59 - INFO - __main__ - Step 69604: {'lr': 0.0002838430161945365, 'samples': 13363968, 'steps': 69603, 'loss/train': 0.9280676245689392} 11/07/2021 07:01:00 - INFO - __main__ - Step 69605: {'lr': 0.000283837758292025, 'samples': 13364160, 'steps': 69604, 'loss/train': 1.514127492904663} 11/07/2021 07:01:00 - INFO - __main__ - Step 69606: {'lr': 0.00028383250037426674, 'samples': 13364352, 'steps': 69605, 'loss/train': 0.7809870839118958} 11/07/2021 07:01:00 - INFO - __main__ - Step 69607: {'lr': 0.00028382724244126406, 'samples': 13364544, 'steps': 69606, 'loss/train': 1.041643500328064} 11/07/2021 07:01:01 - INFO - __main__ - Step 69608: {'lr': 0.0002838219844930193, 'samples': 13364736, 'steps': 69607, 'loss/train': 1.4894057512283325} 11/07/2021 07:01:02 - INFO - __main__ - Step 69609: {'lr': 0.000283816726529535, 'samples': 13364928, 'steps': 69608, 'loss/train': 1.8263635635375977} 11/07/2021 07:01:02 - INFO - __main__ - Step 69610: {'lr': 0.0002838114685508133, 'samples': 13365120, 'steps': 69609, 'loss/train': 1.11962890625} 11/07/2021 07:01:02 - INFO - __main__ - Step 69611: {'lr': 0.0002838062105568567, 'samples': 13365312, 'steps': 69610, 'loss/train': 1.3234519958496094} 11/07/2021 07:01:03 - INFO - __main__ - Step 69612: {'lr': 0.00028380095254766766, 'samples': 13365504, 'steps': 69611, 'loss/train': 1.7067478895187378} 11/07/2021 07:01:03 - INFO - __main__ - Step 69613: {'lr': 0.00028379569452324825, 'samples': 13365696, 'steps': 69612, 'loss/train': 1.4017804861068726} 11/07/2021 07:01:04 - INFO - __main__ - Step 69614: {'lr': 0.0002837904364836011, 'samples': 13365888, 'steps': 69613, 'loss/train': 1.3910576105117798} 11/07/2021 07:01:04 - INFO - __main__ - Step 69615: {'lr': 0.00028378517842872855, 'samples': 13366080, 'steps': 69614, 'loss/train': 1.5542138814926147} 11/07/2021 07:01:05 - INFO - __main__ - Step 69616: {'lr': 0.00028377992035863285, 'samples': 13366272, 'steps': 69615, 'loss/train': 1.239464282989502} 11/07/2021 07:01:05 - INFO - __main__ - Step 69617: {'lr': 0.0002837746622733165, 'samples': 13366464, 'steps': 69616, 'loss/train': 1.5045443773269653} 11/07/2021 07:01:06 - INFO - __main__ - Step 69618: {'lr': 0.00028376940417278174, 'samples': 13366656, 'steps': 69617, 'loss/train': 1.639836311340332} 11/07/2021 07:01:07 - INFO - __main__ - Step 69619: {'lr': 0.0002837641460570311, 'samples': 13366848, 'steps': 69618, 'loss/train': 1.5862843990325928} 11/07/2021 07:01:07 - INFO - __main__ - Step 69620: {'lr': 0.00028375888792606677, 'samples': 13367040, 'steps': 69619, 'loss/train': 1.611844539642334} 11/07/2021 07:01:07 - INFO - __main__ - Step 69621: {'lr': 0.00028375362977989125, 'samples': 13367232, 'steps': 69620, 'loss/train': 1.3559651374816895} 11/07/2021 07:01:08 - INFO - __main__ - Step 69622: {'lr': 0.0002837483716185068, 'samples': 13367424, 'steps': 69621, 'loss/train': 1.2764517068862915} 11/07/2021 07:01:08 - INFO - __main__ - Step 69623: {'lr': 0.0002837431134419159, 'samples': 13367616, 'steps': 69622, 'loss/train': 1.246187686920166} 11/07/2021 07:01:09 - INFO - __main__ - Step 69624: {'lr': 0.00028373785525012094, 'samples': 13367808, 'steps': 69623, 'loss/train': 1.2050379514694214} 11/07/2021 07:01:10 - INFO - __main__ - Step 69625: {'lr': 0.00028373259704312417, 'samples': 13368000, 'steps': 69624, 'loss/train': 1.9246450662612915} 11/07/2021 07:01:10 - INFO - __main__ - Step 69626: {'lr': 0.00028372733882092797, 'samples': 13368192, 'steps': 69625, 'loss/train': 1.6943516731262207} 11/07/2021 07:01:10 - INFO - __main__ - Step 69627: {'lr': 0.0002837220805835348, 'samples': 13368384, 'steps': 69626, 'loss/train': 1.029379963874817} 11/07/2021 07:01:11 - INFO - __main__ - Step 69628: {'lr': 0.0002837168223309469, 'samples': 13368576, 'steps': 69627, 'loss/train': 1.5422831773757935} 11/07/2021 07:01:12 - INFO - __main__ - Step 69629: {'lr': 0.0002837115640631668, 'samples': 13368768, 'steps': 69628, 'loss/train': 1.8013521432876587} 11/07/2021 07:01:12 - INFO - __main__ - Step 69630: {'lr': 0.00028370630578019684, 'samples': 13368960, 'steps': 69629, 'loss/train': 1.6415117979049683} 11/07/2021 07:01:12 - INFO - __main__ - Step 69631: {'lr': 0.00028370104748203927, 'samples': 13369152, 'steps': 69630, 'loss/train': 1.65702486038208} 11/07/2021 07:01:13 - INFO - __main__ - Step 69632: {'lr': 0.0002836957891686965, 'samples': 13369344, 'steps': 69631, 'loss/train': 1.6715012788772583} 11/07/2021 07:01:13 - INFO - __main__ - Step 69633: {'lr': 0.00028369053084017094, 'samples': 13369536, 'steps': 69632, 'loss/train': 1.4403592348098755} 11/07/2021 07:01:13 - INFO - __main__ - Step 69634: {'lr': 0.000283685272496465, 'samples': 13369728, 'steps': 69633, 'loss/train': 2.170525550842285} 11/07/2021 07:01:14 - INFO - __main__ - Step 69635: {'lr': 0.000283680014137581, 'samples': 13369920, 'steps': 69634, 'loss/train': 1.693056583404541} 11/07/2021 07:01:15 - INFO - __main__ - Step 69636: {'lr': 0.00028367475576352125, 'samples': 13370112, 'steps': 69635, 'loss/train': 1.3893488645553589} 11/07/2021 07:01:15 - INFO - __main__ - Step 69637: {'lr': 0.00028366949737428814, 'samples': 13370304, 'steps': 69636, 'loss/train': 1.4653552770614624} 11/07/2021 07:01:15 - INFO - __main__ - Step 69638: {'lr': 0.0002836642389698841, 'samples': 13370496, 'steps': 69637, 'loss/train': 1.4772484302520752} 11/07/2021 07:01:16 - INFO - __main__ - Step 69639: {'lr': 0.0002836589805503115, 'samples': 13370688, 'steps': 69638, 'loss/train': 1.7043513059616089} 11/07/2021 07:01:17 - INFO - __main__ - Step 69640: {'lr': 0.0002836537221155727, 'samples': 13370880, 'steps': 69639, 'loss/train': 0.8570964932441711} 11/07/2021 07:01:17 - INFO - __main__ - Step 69641: {'lr': 0.0002836484636656701, 'samples': 13371072, 'steps': 69640, 'loss/train': 1.6847418546676636} 11/07/2021 07:01:18 - INFO - __main__ - Step 69642: {'lr': 0.00028364320520060595, 'samples': 13371264, 'steps': 69641, 'loss/train': 1.2600992918014526} 11/07/2021 07:01:18 - INFO - __main__ - Step 69643: {'lr': 0.0002836379467203827, 'samples': 13371456, 'steps': 69642, 'loss/train': 1.5442591905593872} 11/07/2021 07:01:18 - INFO - __main__ - Step 69644: {'lr': 0.0002836326882250027, 'samples': 13371648, 'steps': 69643, 'loss/train': 1.5512570142745972} 11/07/2021 07:01:21 - INFO - __main__ - Step 69645: {'lr': 0.00028362742971446833, 'samples': 13371840, 'steps': 69644, 'loss/train': 0.660060703754425} 11/07/2021 07:01:21 - INFO - __main__ - Step 69646: {'lr': 0.000283622171188782, 'samples': 13372032, 'steps': 69645, 'loss/train': 1.9059040546417236} 11/07/2021 07:01:21 - INFO - __main__ - Step 69647: {'lr': 0.000283616912647946, 'samples': 13372224, 'steps': 69646, 'loss/train': 1.699963927268982} 11/07/2021 07:01:22 - INFO - __main__ - Step 69648: {'lr': 0.0002836116540919627, 'samples': 13372416, 'steps': 69647, 'loss/train': 1.429447054862976} 11/07/2021 07:01:22 - INFO - __main__ - Step 69649: {'lr': 0.00028360639552083456, 'samples': 13372608, 'steps': 69648, 'loss/train': 1.6337229013442993} 11/07/2021 07:01:22 - INFO - __main__ - Step 69650: {'lr': 0.0002836011369345639, 'samples': 13372800, 'steps': 69649, 'loss/train': 1.2598620653152466} 11/07/2021 07:01:23 - INFO - __main__ - Step 69651: {'lr': 0.00028359587833315305, 'samples': 13372992, 'steps': 69650, 'loss/train': 1.8607358932495117} 11/07/2021 07:01:23 - INFO - __main__ - Step 69652: {'lr': 0.0002835906197166045, 'samples': 13373184, 'steps': 69651, 'loss/train': 1.030529260635376} 11/07/2021 07:01:24 - INFO - __main__ - Step 69653: {'lr': 0.00028358536108492047, 'samples': 13373376, 'steps': 69652, 'loss/train': 1.447892665863037} 11/07/2021 07:01:25 - INFO - __main__ - Step 69654: {'lr': 0.0002835801024381033, 'samples': 13373568, 'steps': 69653, 'loss/train': 2.283487319946289} 11/07/2021 07:01:25 - INFO - __main__ - Step 69655: {'lr': 0.0002835748437761556, 'samples': 13373760, 'steps': 69654, 'loss/train': 1.4294872283935547} 11/07/2021 07:01:25 - INFO - __main__ - Step 69656: {'lr': 0.00028356958509907955, 'samples': 13373952, 'steps': 69655, 'loss/train': 1.2237132787704468} 11/07/2021 07:01:26 - INFO - __main__ - Step 69657: {'lr': 0.0002835643264068776, 'samples': 13374144, 'steps': 69656, 'loss/train': 1.5952813625335693} 11/07/2021 07:01:27 - INFO - __main__ - Step 69658: {'lr': 0.000283559067699552, 'samples': 13374336, 'steps': 69657, 'loss/train': 1.5115691423416138} 11/07/2021 07:01:27 - INFO - __main__ - Step 69659: {'lr': 0.0002835538089771053, 'samples': 13374528, 'steps': 69658, 'loss/train': 1.4096001386642456} 11/07/2021 07:01:27 - INFO - __main__ - Step 69660: {'lr': 0.0002835485502395397, 'samples': 13374720, 'steps': 69659, 'loss/train': 1.39717698097229} 11/07/2021 07:01:28 - INFO - __main__ - Step 69661: {'lr': 0.0002835432914868576, 'samples': 13374912, 'steps': 69660, 'loss/train': 1.497478723526001} 11/07/2021 07:01:28 - INFO - __main__ - Step 69662: {'lr': 0.00028353803271906146, 'samples': 13375104, 'steps': 69661, 'loss/train': 1.2594598531723022} 11/07/2021 07:01:29 - INFO - __main__ - Step 69663: {'lr': 0.00028353277393615363, 'samples': 13375296, 'steps': 69662, 'loss/train': 1.7236250638961792} 11/07/2021 07:01:30 - INFO - __main__ - Step 69664: {'lr': 0.0002835275151381364, 'samples': 13375488, 'steps': 69663, 'loss/train': 1.7804802656173706} 11/07/2021 07:01:30 - INFO - __main__ - Step 69665: {'lr': 0.00028352225632501224, 'samples': 13375680, 'steps': 69664, 'loss/train': 1.294723629951477} 11/07/2021 07:01:30 - INFO - __main__ - Step 69666: {'lr': 0.00028351699749678346, 'samples': 13375872, 'steps': 69665, 'loss/train': 1.255731463432312} 11/07/2021 07:01:31 - INFO - __main__ - Step 69667: {'lr': 0.0002835117386534524, 'samples': 13376064, 'steps': 69666, 'loss/train': 1.6843980550765991} 11/07/2021 07:01:31 - INFO - __main__ - Step 69668: {'lr': 0.00028350647979502147, 'samples': 13376256, 'steps': 69667, 'loss/train': 1.4447741508483887} 11/07/2021 07:01:32 - INFO - __main__ - Step 69669: {'lr': 0.00028350122092149304, 'samples': 13376448, 'steps': 69668, 'loss/train': 1.5507704019546509} 11/07/2021 07:01:32 - INFO - __main__ - Step 69670: {'lr': 0.0002834959620328695, 'samples': 13376640, 'steps': 69669, 'loss/train': 1.238291621208191} 11/07/2021 07:01:33 - INFO - __main__ - Step 69671: {'lr': 0.00028349070312915317, 'samples': 13376832, 'steps': 69670, 'loss/train': 1.7903375625610352} 11/07/2021 07:01:33 - INFO - __main__ - Step 69672: {'lr': 0.0002834854442103465, 'samples': 13377024, 'steps': 69671, 'loss/train': 1.5916814804077148} 11/07/2021 07:01:33 - INFO - __main__ - Step 69673: {'lr': 0.0002834801852764518, 'samples': 13377216, 'steps': 69672, 'loss/train': 1.412153720855713} 11/07/2021 07:01:35 - INFO - __main__ - Step 69674: {'lr': 0.0002834749263274714, 'samples': 13377408, 'steps': 69673, 'loss/train': 1.1209478378295898} 11/07/2021 07:01:35 - INFO - __main__ - Step 69675: {'lr': 0.00028346966736340776, 'samples': 13377600, 'steps': 69674, 'loss/train': 1.5645192861557007} 11/07/2021 07:01:35 - INFO - __main__ - Step 69676: {'lr': 0.00028346440838426313, 'samples': 13377792, 'steps': 69675, 'loss/train': 1.5029433965682983} 11/07/2021 07:01:36 - INFO - __main__ - Step 69677: {'lr': 0.00028345914939003995, 'samples': 13377984, 'steps': 69676, 'loss/train': 1.4738794565200806} 11/07/2021 07:01:36 - INFO - __main__ - Step 69678: {'lr': 0.0002834538903807407, 'samples': 13378176, 'steps': 69677, 'loss/train': 0.9359455108642578} 11/07/2021 07:01:37 - INFO - __main__ - Step 69679: {'lr': 0.0002834486313563676, 'samples': 13378368, 'steps': 69678, 'loss/train': 0.39955785870552063} 11/07/2021 07:01:37 - INFO - __main__ - Step 69680: {'lr': 0.00028344337231692304, 'samples': 13378560, 'steps': 69679, 'loss/train': 1.1930419206619263} 11/07/2021 07:01:38 - INFO - __main__ - Step 69681: {'lr': 0.00028343811326240944, 'samples': 13378752, 'steps': 69680, 'loss/train': 1.4529262781143188} 11/07/2021 07:01:38 - INFO - __main__ - Step 69682: {'lr': 0.00028343285419282907, 'samples': 13378944, 'steps': 69681, 'loss/train': 1.7352741956710815} 11/07/2021 07:01:38 - INFO - __main__ - Step 69683: {'lr': 0.0002834275951081844, 'samples': 13379136, 'steps': 69682, 'loss/train': 1.674020528793335} 11/07/2021 07:01:40 - INFO - __main__ - Step 69684: {'lr': 0.0002834223360084778, 'samples': 13379328, 'steps': 69683, 'loss/train': 1.4655622243881226} 11/07/2021 07:01:40 - INFO - __main__ - Step 69685: {'lr': 0.0002834170768937116, 'samples': 13379520, 'steps': 69684, 'loss/train': 1.507903814315796} 11/07/2021 07:01:41 - INFO - __main__ - Step 69686: {'lr': 0.00028341181776388825, 'samples': 13379712, 'steps': 69685, 'loss/train': 1.4127824306488037} 11/07/2021 07:01:41 - INFO - __main__ - Step 69687: {'lr': 0.00028340655861901, 'samples': 13379904, 'steps': 69686, 'loss/train': 0.9989972710609436} 11/07/2021 07:01:41 - INFO - __main__ - Step 69688: {'lr': 0.00028340129945907924, 'samples': 13380096, 'steps': 69687, 'loss/train': 0.6972829699516296} 11/07/2021 07:01:42 - INFO - __main__ - Step 69689: {'lr': 0.00028339604028409837, 'samples': 13380288, 'steps': 69688, 'loss/train': 1.8250564336776733} 11/07/2021 07:01:43 - INFO - __main__ - Step 69690: {'lr': 0.00028339078109406975, 'samples': 13380480, 'steps': 69689, 'loss/train': 1.304648756980896} 11/07/2021 07:01:43 - INFO - __main__ - Step 69691: {'lr': 0.0002833855218889958, 'samples': 13380672, 'steps': 69690, 'loss/train': 1.871893286705017} 11/07/2021 07:01:43 - INFO - __main__ - Step 69692: {'lr': 0.00028338026266887885, 'samples': 13380864, 'steps': 69691, 'loss/train': 1.1874854564666748} 11/07/2021 07:01:44 - INFO - __main__ - Step 69693: {'lr': 0.00028337500343372123, 'samples': 13381056, 'steps': 69692, 'loss/train': 1.573485255241394} 11/07/2021 07:01:44 - INFO - __main__ - Step 69694: {'lr': 0.0002833697441835254, 'samples': 13381248, 'steps': 69693, 'loss/train': 1.046938180923462} 11/07/2021 07:01:45 - INFO - __main__ - Step 69695: {'lr': 0.00028336448491829365, 'samples': 13381440, 'steps': 69694, 'loss/train': 1.6393135786056519} 11/07/2021 07:01:45 - INFO - __main__ - Step 69696: {'lr': 0.00028335922563802834, 'samples': 13381632, 'steps': 69695, 'loss/train': 1.1796326637268066} 11/07/2021 07:01:46 - INFO - __main__ - Step 69697: {'lr': 0.00028335396634273193, 'samples': 13381824, 'steps': 69696, 'loss/train': 1.592590093612671} 11/07/2021 07:01:46 - INFO - __main__ - Step 69698: {'lr': 0.00028334870703240674, 'samples': 13382016, 'steps': 69697, 'loss/train': 1.073430061340332} 11/07/2021 07:01:47 - INFO - __main__ - Step 69699: {'lr': 0.00028334344770705516, 'samples': 13382208, 'steps': 69698, 'loss/train': 1.3369996547698975} 11/07/2021 07:01:48 - INFO - __main__ - Step 69700: {'lr': 0.00028333818836667946, 'samples': 13382400, 'steps': 69699, 'loss/train': 1.615559458732605} 11/07/2021 07:01:48 - INFO - __main__ - Step 69701: {'lr': 0.00028333292901128215, 'samples': 13382592, 'steps': 69700, 'loss/train': 1.8792952299118042} 11/07/2021 07:01:48 - INFO - __main__ - Step 69702: {'lr': 0.0002833276696408655, 'samples': 13382784, 'steps': 69701, 'loss/train': 1.5619713068008423} 11/07/2021 07:01:49 - INFO - __main__ - Step 69703: {'lr': 0.0002833224102554319, 'samples': 13382976, 'steps': 69702, 'loss/train': 0.9436840415000916} 11/07/2021 07:01:49 - INFO - __main__ - Step 69704: {'lr': 0.0002833171508549838, 'samples': 13383168, 'steps': 69703, 'loss/train': 1.1436896324157715} 11/07/2021 07:01:50 - INFO - __main__ - Step 69705: {'lr': 0.0002833118914395234, 'samples': 13383360, 'steps': 69704, 'loss/train': 1.685064673423767} 11/07/2021 07:01:50 - INFO - __main__ - Step 69706: {'lr': 0.0002833066320090533, 'samples': 13383552, 'steps': 69705, 'loss/train': 1.5223287343978882} 11/07/2021 07:01:51 - INFO - __main__ - Step 69707: {'lr': 0.0002833013725635757, 'samples': 13383744, 'steps': 69706, 'loss/train': 1.0396921634674072} 11/07/2021 07:01:51 - INFO - __main__ - Step 69708: {'lr': 0.000283296113103093, 'samples': 13383936, 'steps': 69707, 'loss/train': 1.438378930091858} 11/07/2021 07:01:51 - INFO - __main__ - Step 69709: {'lr': 0.00028329085362760757, 'samples': 13384128, 'steps': 69708, 'loss/train': 1.4622262716293335} 11/07/2021 07:01:52 - INFO - __main__ - Step 69710: {'lr': 0.00028328559413712186, 'samples': 13384320, 'steps': 69709, 'loss/train': 1.588579535484314} 11/07/2021 07:01:53 - INFO - __main__ - Step 69711: {'lr': 0.0002832803346316381, 'samples': 13384512, 'steps': 69710, 'loss/train': 1.3922277688980103} 11/07/2021 07:01:53 - INFO - __main__ - Step 69712: {'lr': 0.00028327507511115876, 'samples': 13384704, 'steps': 69711, 'loss/train': 0.9794602990150452} 11/07/2021 07:01:54 - INFO - __main__ - Step 69713: {'lr': 0.0002832698155756862, 'samples': 13384896, 'steps': 69712, 'loss/train': 1.5095713138580322} 11/07/2021 07:01:54 - INFO - __main__ - Step 69714: {'lr': 0.00028326455602522275, 'samples': 13385088, 'steps': 69713, 'loss/train': 1.4232604503631592} 11/07/2021 07:01:54 - INFO - __main__ - Step 69715: {'lr': 0.00028325929645977086, 'samples': 13385280, 'steps': 69714, 'loss/train': 1.3985674381256104} 11/07/2021 07:01:55 - INFO - __main__ - Step 69716: {'lr': 0.00028325403687933274, 'samples': 13385472, 'steps': 69715, 'loss/train': 1.066099762916565} 11/07/2021 07:01:56 - INFO - __main__ - Step 69717: {'lr': 0.00028324877728391095, 'samples': 13385664, 'steps': 69716, 'loss/train': 1.5856654644012451} 11/07/2021 07:01:56 - INFO - __main__ - Step 69718: {'lr': 0.00028324351767350776, 'samples': 13385856, 'steps': 69717, 'loss/train': 0.880120575428009} 11/07/2021 07:01:56 - INFO - __main__ - Step 69719: {'lr': 0.00028323825804812557, 'samples': 13386048, 'steps': 69718, 'loss/train': 1.7005553245544434} 11/07/2021 07:01:57 - INFO - __main__ - Step 69720: {'lr': 0.00028323299840776674, 'samples': 13386240, 'steps': 69719, 'loss/train': 1.5140269994735718} 11/07/2021 07:01:58 - INFO - __main__ - Step 69721: {'lr': 0.00028322773875243357, 'samples': 13386432, 'steps': 69720, 'loss/train': 1.7142561674118042} 11/07/2021 07:01:58 - INFO - __main__ - Step 69722: {'lr': 0.0002832224790821285, 'samples': 13386624, 'steps': 69721, 'loss/train': 1.3578474521636963} 11/07/2021 07:01:58 - INFO - __main__ - Step 69723: {'lr': 0.000283217219396854, 'samples': 13386816, 'steps': 69722, 'loss/train': 1.9071615934371948} 11/07/2021 07:01:59 - INFO - __main__ - Step 69724: {'lr': 0.0002832119596966122, 'samples': 13387008, 'steps': 69723, 'loss/train': 1.665316104888916} 11/07/2021 07:01:59 - INFO - __main__ - Step 69725: {'lr': 0.0002832066999814056, 'samples': 13387200, 'steps': 69724, 'loss/train': 1.2593512535095215} 11/07/2021 07:02:00 - INFO - __main__ - Step 69726: {'lr': 0.00028320144025123674, 'samples': 13387392, 'steps': 69725, 'loss/train': 1.1850941181182861} 11/07/2021 07:02:01 - INFO - __main__ - Step 69727: {'lr': 0.00028319618050610766, 'samples': 13387584, 'steps': 69726, 'loss/train': 1.132413387298584} 11/07/2021 07:02:01 - INFO - __main__ - Step 69728: {'lr': 0.000283190920746021, 'samples': 13387776, 'steps': 69727, 'loss/train': 1.36204993724823} 11/07/2021 07:02:01 - INFO - __main__ - Step 69729: {'lr': 0.00028318566097097894, 'samples': 13387968, 'steps': 69728, 'loss/train': 1.5369501113891602} 11/07/2021 07:02:02 - INFO - __main__ - Step 69730: {'lr': 0.00028318040118098395, 'samples': 13388160, 'steps': 69729, 'loss/train': 1.6734263896942139} 11/07/2021 07:02:02 - INFO - __main__ - Step 69731: {'lr': 0.0002831751413760384, 'samples': 13388352, 'steps': 69730, 'loss/train': 1.262420654296875} 11/07/2021 07:02:03 - INFO - __main__ - Step 69732: {'lr': 0.0002831698815561446, 'samples': 13388544, 'steps': 69731, 'loss/train': 1.5938678979873657} 11/07/2021 07:02:03 - INFO - __main__ - Step 69733: {'lr': 0.0002831646217213051, 'samples': 13388736, 'steps': 69732, 'loss/train': 1.3369636535644531} 11/07/2021 07:02:04 - INFO - __main__ - Step 69734: {'lr': 0.000283159361871522, 'samples': 13388928, 'steps': 69733, 'loss/train': 1.2501462697982788} 11/07/2021 07:02:04 - INFO - __main__ - Step 69735: {'lr': 0.0002831541020067978, 'samples': 13389120, 'steps': 69734, 'loss/train': 1.537755012512207} 11/07/2021 07:02:04 - INFO - __main__ - Step 69736: {'lr': 0.00028314884212713495, 'samples': 13389312, 'steps': 69735, 'loss/train': 1.4725497961044312} 11/07/2021 07:02:05 - INFO - __main__ - Step 69737: {'lr': 0.00028314358223253564, 'samples': 13389504, 'steps': 69736, 'loss/train': 1.7943166494369507} 11/07/2021 07:02:06 - INFO - __main__ - Step 69738: {'lr': 0.0002831383223230025, 'samples': 13389696, 'steps': 69737, 'loss/train': 1.2248584032058716} 11/07/2021 07:02:06 - INFO - __main__ - Step 69739: {'lr': 0.0002831330623985376, 'samples': 13389888, 'steps': 69738, 'loss/train': 1.564974308013916} 11/07/2021 07:02:06 - INFO - __main__ - Step 69740: {'lr': 0.00028312780245914356, 'samples': 13390080, 'steps': 69739, 'loss/train': 1.3318883180618286} 11/07/2021 07:02:07 - INFO - __main__ - Step 69741: {'lr': 0.00028312254250482255, 'samples': 13390272, 'steps': 69740, 'loss/train': 1.9062494039535522} 11/07/2021 07:02:08 - INFO - __main__ - Step 69742: {'lr': 0.0002831172825355771, 'samples': 13390464, 'steps': 69741, 'loss/train': 1.5980726480484009} 11/07/2021 07:02:08 - INFO - __main__ - Step 69743: {'lr': 0.00028311202255140944, 'samples': 13390656, 'steps': 69742, 'loss/train': 1.3678659200668335} 11/07/2021 07:02:09 - INFO - __main__ - Step 69744: {'lr': 0.0002831067625523221, 'samples': 13390848, 'steps': 69743, 'loss/train': 0.8537619709968567} 11/07/2021 07:02:09 - INFO - __main__ - Step 69745: {'lr': 0.0002831015025383173, 'samples': 13391040, 'steps': 69744, 'loss/train': 1.7528706789016724} 11/07/2021 07:02:09 - INFO - __main__ - Step 69746: {'lr': 0.00028309624250939753, 'samples': 13391232, 'steps': 69745, 'loss/train': 1.5531495809555054} 11/07/2021 07:02:10 - INFO - __main__ - Step 69747: {'lr': 0.00028309098246556507, 'samples': 13391424, 'steps': 69746, 'loss/train': 2.1459012031555176} 11/07/2021 07:02:11 - INFO - __main__ - Step 69748: {'lr': 0.00028308572240682233, 'samples': 13391616, 'steps': 69747, 'loss/train': 1.7613551616668701} 11/07/2021 07:02:11 - INFO - __main__ - Step 69749: {'lr': 0.00028308046233317165, 'samples': 13391808, 'steps': 69748, 'loss/train': 1.5925954580307007} 11/07/2021 07:02:11 - INFO - __main__ - Step 69750: {'lr': 0.00028307520224461546, 'samples': 13392000, 'steps': 69749, 'loss/train': 1.5023645162582397} 11/07/2021 07:02:12 - INFO - __main__ - Step 69751: {'lr': 0.00028306994214115605, 'samples': 13392192, 'steps': 69750, 'loss/train': 0.6899903416633606} 11/07/2021 07:02:13 - INFO - __main__ - Step 69752: {'lr': 0.00028306468202279585, 'samples': 13392384, 'steps': 69751, 'loss/train': 1.1952555179595947} 11/07/2021 07:02:13 - INFO - __main__ - Step 69753: {'lr': 0.00028305942188953725, 'samples': 13392576, 'steps': 69752, 'loss/train': 1.5418429374694824} 11/07/2021 07:02:13 - INFO - __main__ - Step 69754: {'lr': 0.0002830541617413826, 'samples': 13392768, 'steps': 69753, 'loss/train': 1.3444637060165405} 11/07/2021 07:02:14 - INFO - __main__ - Step 69755: {'lr': 0.00028304890157833417, 'samples': 13392960, 'steps': 69754, 'loss/train': 1.3581031560897827} 11/07/2021 07:02:14 - INFO - __main__ - Step 69756: {'lr': 0.0002830436414003945, 'samples': 13393152, 'steps': 69755, 'loss/train': 1.3761928081512451} 11/07/2021 07:02:15 - INFO - __main__ - Step 69757: {'lr': 0.00028303838120756584, 'samples': 13393344, 'steps': 69756, 'loss/train': 1.1019593477249146} 11/07/2021 07:02:15 - INFO - __main__ - Step 69758: {'lr': 0.0002830331209998506, 'samples': 13393536, 'steps': 69757, 'loss/train': 1.1627528667449951} 11/07/2021 07:02:16 - INFO - __main__ - Step 69759: {'lr': 0.0002830278607772511, 'samples': 13393728, 'steps': 69758, 'loss/train': 1.607186198234558} 11/07/2021 07:02:16 - INFO - __main__ - Step 69760: {'lr': 0.0002830226005397698, 'samples': 13393920, 'steps': 69759, 'loss/train': 1.5068436861038208} 11/07/2021 07:02:17 - INFO - __main__ - Step 69761: {'lr': 0.00028301734028740903, 'samples': 13394112, 'steps': 69760, 'loss/train': 0.8519095182418823} 11/07/2021 07:02:17 - INFO - __main__ - Step 69762: {'lr': 0.0002830120800201712, 'samples': 13394304, 'steps': 69761, 'loss/train': 1.0333678722381592} 11/07/2021 07:02:18 - INFO - __main__ - Step 69763: {'lr': 0.00028300681973805855, 'samples': 13394496, 'steps': 69762, 'loss/train': 1.7830528020858765} 11/07/2021 07:02:18 - INFO - __main__ - Step 69764: {'lr': 0.0002830015594410736, 'samples': 13394688, 'steps': 69763, 'loss/train': 1.542104959487915} 11/07/2021 07:02:19 - INFO - __main__ - Step 69765: {'lr': 0.0002829962991292186, 'samples': 13394880, 'steps': 69764, 'loss/train': 1.8109781742095947} 11/07/2021 07:02:19 - INFO - __main__ - Step 69766: {'lr': 0.000282991038802496, 'samples': 13395072, 'steps': 69765, 'loss/train': 1.138613224029541} 11/07/2021 07:02:19 - INFO - __main__ - Step 69767: {'lr': 0.0002829857784609081, 'samples': 13395264, 'steps': 69766, 'loss/train': 1.371751308441162} 11/07/2021 07:02:21 - INFO - __main__ - Step 69768: {'lr': 0.0002829805181044574, 'samples': 13395456, 'steps': 69767, 'loss/train': 1.6778678894042969} 11/07/2021 07:02:21 - INFO - __main__ - Step 69769: {'lr': 0.0002829752577331462, 'samples': 13395648, 'steps': 69768, 'loss/train': 1.077284812927246} 11/07/2021 07:02:21 - INFO - __main__ - Step 69770: {'lr': 0.0002829699973469768, 'samples': 13395840, 'steps': 69769, 'loss/train': 1.6499450206756592} 11/07/2021 07:02:22 - INFO - __main__ - Step 69771: {'lr': 0.0002829647369459516, 'samples': 13396032, 'steps': 69770, 'loss/train': 1.39198899269104} 11/07/2021 07:02:22 - INFO - __main__ - Step 69772: {'lr': 0.00028295947653007305, 'samples': 13396224, 'steps': 69771, 'loss/train': 1.5466374158859253} 11/07/2021 07:02:22 - INFO - __main__ - Step 69773: {'lr': 0.00028295421609934347, 'samples': 13396416, 'steps': 69772, 'loss/train': 1.762952208518982} 11/07/2021 07:02:23 - INFO - __main__ - Step 69774: {'lr': 0.00028294895565376515, 'samples': 13396608, 'steps': 69773, 'loss/train': 2.048670530319214} 11/07/2021 07:02:24 - INFO - __main__ - Step 69775: {'lr': 0.0002829436951933407, 'samples': 13396800, 'steps': 69774, 'loss/train': 1.6640267372131348} 11/07/2021 07:02:24 - INFO - __main__ - Step 69776: {'lr': 0.00028293843471807224, 'samples': 13396992, 'steps': 69775, 'loss/train': 1.3654206991195679} 11/07/2021 07:02:25 - INFO - __main__ - Step 69777: {'lr': 0.00028293317422796216, 'samples': 13397184, 'steps': 69776, 'loss/train': 1.4607231616973877} 11/07/2021 07:02:25 - INFO - __main__ - Step 69778: {'lr': 0.000282927913723013, 'samples': 13397376, 'steps': 69777, 'loss/train': 1.647329568862915} 11/07/2021 07:02:26 - INFO - __main__ - Step 69779: {'lr': 0.000282922653203227, 'samples': 13397568, 'steps': 69778, 'loss/train': 0.732682466506958} 11/07/2021 07:02:26 - INFO - __main__ - Step 69780: {'lr': 0.00028291739266860655, 'samples': 13397760, 'steps': 69779, 'loss/train': 1.2173486948013306} 11/07/2021 07:02:27 - INFO - __main__ - Step 69781: {'lr': 0.000282912132119154, 'samples': 13397952, 'steps': 69780, 'loss/train': 1.468239426612854} 11/07/2021 07:02:27 - INFO - __main__ - Step 69782: {'lr': 0.0002829068715548718, 'samples': 13398144, 'steps': 69781, 'loss/train': 1.3777908086776733} 11/07/2021 07:02:27 - INFO - __main__ - Step 69783: {'lr': 0.0002829016109757623, 'samples': 13398336, 'steps': 69782, 'loss/train': 1.726395606994629} 11/07/2021 07:02:28 - INFO - __main__ - Step 69784: {'lr': 0.00028289635038182776, 'samples': 13398528, 'steps': 69783, 'loss/train': 1.3033732175827026} 11/07/2021 07:02:29 - INFO - __main__ - Step 69785: {'lr': 0.00028289108977307066, 'samples': 13398720, 'steps': 69784, 'loss/train': 1.7909679412841797} 11/07/2021 07:02:29 - INFO - __main__ - Step 69786: {'lr': 0.00028288582914949334, 'samples': 13398912, 'steps': 69785, 'loss/train': 1.168148159980774} 11/07/2021 07:02:29 - INFO - __main__ - Step 69787: {'lr': 0.0002828805685110982, 'samples': 13399104, 'steps': 69786, 'loss/train': 1.1381988525390625} 11/07/2021 07:02:30 - INFO - __main__ - Step 69788: {'lr': 0.00028287530785788754, 'samples': 13399296, 'steps': 69787, 'loss/train': 1.4175597429275513} 11/07/2021 07:02:30 - INFO - __main__ - Step 69789: {'lr': 0.00028287004718986384, 'samples': 13399488, 'steps': 69788, 'loss/train': 1.5167067050933838} 11/07/2021 07:02:31 - INFO - __main__ - Step 69790: {'lr': 0.00028286478650702934, 'samples': 13399680, 'steps': 69789, 'loss/train': 1.2538902759552002} 11/07/2021 07:02:32 - INFO - __main__ - Step 69791: {'lr': 0.00028285952580938653, 'samples': 13399872, 'steps': 69790, 'loss/train': 1.1983437538146973} 11/07/2021 07:02:32 - INFO - __main__ - Step 69792: {'lr': 0.0002828542650969377, 'samples': 13400064, 'steps': 69791, 'loss/train': 0.98863685131073} 11/07/2021 07:02:32 - INFO - __main__ - Step 69793: {'lr': 0.0002828490043696852, 'samples': 13400256, 'steps': 69792, 'loss/train': 1.5871353149414062} 11/07/2021 07:02:33 - INFO - __main__ - Step 69794: {'lr': 0.00028284374362763155, 'samples': 13400448, 'steps': 69793, 'loss/train': 1.3360881805419922} 11/07/2021 07:02:34 - INFO - __main__ - Step 69795: {'lr': 0.0002828384828707789, 'samples': 13400640, 'steps': 69794, 'loss/train': 1.2965446710586548} 11/07/2021 07:02:34 - INFO - __main__ - Step 69796: {'lr': 0.0002828332220991298, 'samples': 13400832, 'steps': 69795, 'loss/train': 0.6674795746803284} 11/07/2021 07:02:34 - INFO - __main__ - Step 69797: {'lr': 0.0002828279613126865, 'samples': 13401024, 'steps': 69796, 'loss/train': 1.6610136032104492} 11/07/2021 07:02:35 - INFO - __main__ - Step 69798: {'lr': 0.0002828227005114515, 'samples': 13401216, 'steps': 69797, 'loss/train': 1.8865190744400024} 11/07/2021 07:02:35 - INFO - __main__ - Step 69799: {'lr': 0.0002828174396954271, 'samples': 13401408, 'steps': 69798, 'loss/train': 1.6148277521133423} 11/07/2021 07:02:36 - INFO - __main__ - Step 69800: {'lr': 0.0002828121788646156, 'samples': 13401600, 'steps': 69799, 'loss/train': 1.7549197673797607} 11/07/2021 07:02:37 - INFO - __main__ - Step 69801: {'lr': 0.00028280691801901956, 'samples': 13401792, 'steps': 69800, 'loss/train': 0.9679011106491089} 11/07/2021 07:02:37 - INFO - __main__ - Step 69802: {'lr': 0.0002828016571586411, 'samples': 13401984, 'steps': 69801, 'loss/train': 1.2640646696090698} 11/07/2021 07:02:37 - INFO - __main__ - Step 69803: {'lr': 0.00028279639628348273, 'samples': 13402176, 'steps': 69802, 'loss/train': 1.7432588338851929} 11/07/2021 07:02:38 - INFO - __main__ - Step 69804: {'lr': 0.00028279113539354686, 'samples': 13402368, 'steps': 69803, 'loss/train': 0.8704463839530945} 11/07/2021 07:02:39 - INFO - __main__ - Step 69805: {'lr': 0.00028278587448883575, 'samples': 13402560, 'steps': 69804, 'loss/train': 1.0225379467010498} 11/07/2021 07:02:39 - INFO - __main__ - Step 69806: {'lr': 0.0002827806135693519, 'samples': 13402752, 'steps': 69805, 'loss/train': 1.7026162147521973} 11/07/2021 07:02:39 - INFO - __main__ - Step 69807: {'lr': 0.00028277535263509764, 'samples': 13402944, 'steps': 69806, 'loss/train': 1.2980328798294067} 11/07/2021 07:02:40 - INFO - __main__ - Step 69808: {'lr': 0.00028277009168607524, 'samples': 13403136, 'steps': 69807, 'loss/train': 0.5572847723960876} 11/07/2021 07:02:40 - INFO - __main__ - Step 69809: {'lr': 0.00028276483072228715, 'samples': 13403328, 'steps': 69808, 'loss/train': 1.3855544328689575} 11/07/2021 07:02:41 - INFO - __main__ - Step 69810: {'lr': 0.00028275956974373575, 'samples': 13403520, 'steps': 69809, 'loss/train': 1.5266567468643188} 11/07/2021 07:02:42 - INFO - __main__ - Step 69811: {'lr': 0.00028275430875042336, 'samples': 13403712, 'steps': 69810, 'loss/train': 1.421303153038025} 11/07/2021 07:02:42 - INFO - __main__ - Step 69812: {'lr': 0.00028274904774235244, 'samples': 13403904, 'steps': 69811, 'loss/train': 1.4184916019439697} 11/07/2021 07:02:42 - INFO - __main__ - Step 69813: {'lr': 0.0002827437867195252, 'samples': 13404096, 'steps': 69812, 'loss/train': 1.315234899520874} 11/07/2021 07:02:43 - INFO - __main__ - Step 69814: {'lr': 0.00028273852568194425, 'samples': 13404288, 'steps': 69813, 'loss/train': 1.4644705057144165} 11/07/2021 07:02:43 - INFO - __main__ - Step 69815: {'lr': 0.0002827332646296118, 'samples': 13404480, 'steps': 69814, 'loss/train': 1.471459984779358} 11/07/2021 07:02:44 - INFO - __main__ - Step 69816: {'lr': 0.0002827280035625302, 'samples': 13404672, 'steps': 69815, 'loss/train': 1.765279769897461} 11/07/2021 07:02:44 - INFO - __main__ - Step 69817: {'lr': 0.0002827227424807018, 'samples': 13404864, 'steps': 69816, 'loss/train': 1.2458817958831787} 11/07/2021 07:02:45 - INFO - __main__ - Step 69818: {'lr': 0.00028271748138412916, 'samples': 13405056, 'steps': 69817, 'loss/train': 1.2177259922027588} 11/07/2021 07:02:45 - INFO - __main__ - Step 69819: {'lr': 0.0002827122202728145, 'samples': 13405248, 'steps': 69818, 'loss/train': 1.1954164505004883} 11/07/2021 07:02:45 - INFO - __main__ - Step 69820: {'lr': 0.00028270695914676025, 'samples': 13405440, 'steps': 69819, 'loss/train': 1.6178313493728638} 11/07/2021 07:02:47 - INFO - __main__ - Step 69821: {'lr': 0.0002827016980059687, 'samples': 13405632, 'steps': 69820, 'loss/train': 1.4801957607269287} 11/07/2021 07:02:47 - INFO - __main__ - Step 69822: {'lr': 0.0002826964368504422, 'samples': 13405824, 'steps': 69821, 'loss/train': 1.6212036609649658} 11/07/2021 07:02:47 - INFO - __main__ - Step 69823: {'lr': 0.0002826911756801833, 'samples': 13406016, 'steps': 69822, 'loss/train': 1.7165981531143188} 11/07/2021 07:02:48 - INFO - __main__ - Step 69824: {'lr': 0.0002826859144951942, 'samples': 13406208, 'steps': 69823, 'loss/train': 1.2690798044204712} 11/07/2021 07:02:48 - INFO - __main__ - Step 69825: {'lr': 0.00028268065329547734, 'samples': 13406400, 'steps': 69824, 'loss/train': 1.3294720649719238} 11/07/2021 07:02:49 - INFO - __main__ - Step 69826: {'lr': 0.0002826753920810351, 'samples': 13406592, 'steps': 69825, 'loss/train': 1.4494513273239136} 11/07/2021 07:02:49 - INFO - __main__ - Step 69827: {'lr': 0.00028267013085186987, 'samples': 13406784, 'steps': 69826, 'loss/train': 0.7760505676269531} 11/07/2021 07:02:50 - INFO - __main__ - Step 69828: {'lr': 0.00028266486960798395, 'samples': 13406976, 'steps': 69827, 'loss/train': 2.1451005935668945} 11/07/2021 07:02:50 - INFO - __main__ - Step 69829: {'lr': 0.0002826596083493797, 'samples': 13407168, 'steps': 69828, 'loss/train': 1.2539608478546143} 11/07/2021 07:02:50 - INFO - __main__ - Step 69830: {'lr': 0.0002826543470760596, 'samples': 13407360, 'steps': 69829, 'loss/train': 0.989769697189331} 11/07/2021 07:02:51 - INFO - __main__ - Step 69831: {'lr': 0.0002826490857880259, 'samples': 13407552, 'steps': 69830, 'loss/train': 1.2131068706512451} 11/07/2021 07:02:52 - INFO - __main__ - Step 69832: {'lr': 0.00028264382448528106, 'samples': 13407744, 'steps': 69831, 'loss/train': 1.4786767959594727} 11/07/2021 07:02:52 - INFO - __main__ - Step 69833: {'lr': 0.00028263856316782735, 'samples': 13407936, 'steps': 69832, 'loss/train': 1.5443971157073975} 11/07/2021 07:02:52 - INFO - __main__ - Step 69834: {'lr': 0.0002826333018356673, 'samples': 13408128, 'steps': 69833, 'loss/train': 1.4976974725723267} 11/07/2021 07:02:53 - INFO - __main__ - Step 69835: {'lr': 0.00028262804048880317, 'samples': 13408320, 'steps': 69834, 'loss/train': 1.3311767578125} 11/07/2021 07:02:54 - INFO - __main__ - Step 69836: {'lr': 0.00028262277912723734, 'samples': 13408512, 'steps': 69835, 'loss/train': 1.0408663749694824} 11/07/2021 07:02:54 - INFO - __main__ - Step 69837: {'lr': 0.0002826175177509722, 'samples': 13408704, 'steps': 69836, 'loss/train': 2.0504868030548096} 11/07/2021 07:02:54 - INFO - __main__ - Step 69838: {'lr': 0.00028261225636001005, 'samples': 13408896, 'steps': 69837, 'loss/train': 1.3331451416015625} 11/07/2021 07:02:55 - INFO - __main__ - Step 69839: {'lr': 0.0002826069949543533, 'samples': 13409088, 'steps': 69838, 'loss/train': 1.1478097438812256} 11/07/2021 07:02:55 - INFO - __main__ - Step 69840: {'lr': 0.0002826017335340045, 'samples': 13409280, 'steps': 69839, 'loss/train': 1.184165358543396} 11/07/2021 07:02:56 - INFO - __main__ - Step 69841: {'lr': 0.00028259647209896574, 'samples': 13409472, 'steps': 69840, 'loss/train': 1.3897459506988525} 11/07/2021 07:02:56 - INFO - __main__ - Step 69842: {'lr': 0.00028259121064923954, 'samples': 13409664, 'steps': 69841, 'loss/train': 2.2764079570770264} 11/07/2021 07:02:57 - INFO - __main__ - Step 69843: {'lr': 0.0002825859491848282, 'samples': 13409856, 'steps': 69842, 'loss/train': 1.5526026487350464} 11/07/2021 07:02:57 - INFO - __main__ - Step 69844: {'lr': 0.00028258068770573415, 'samples': 13410048, 'steps': 69843, 'loss/train': 1.0821738243103027} 11/07/2021 07:02:58 - INFO - __main__ - Step 69845: {'lr': 0.00028257542621195974, 'samples': 13410240, 'steps': 69844, 'loss/train': 0.9768862724304199} 11/07/2021 07:02:58 - INFO - __main__ - Step 69846: {'lr': 0.0002825701647035074, 'samples': 13410432, 'steps': 69845, 'loss/train': 1.1735522747039795} 11/07/2021 07:02:59 - INFO - __main__ - Step 69847: {'lr': 0.00028256490318037946, 'samples': 13410624, 'steps': 69846, 'loss/train': 2.3598315715789795} 11/07/2021 07:02:59 - INFO - __main__ - Step 69848: {'lr': 0.00028255964164257825, 'samples': 13410816, 'steps': 69847, 'loss/train': 1.1209051609039307} 11/07/2021 07:03:00 - INFO - __main__ - Step 69849: {'lr': 0.00028255438009010616, 'samples': 13411008, 'steps': 69848, 'loss/train': 0.814133882522583} 11/07/2021 07:03:00 - INFO - __main__ - Step 69850: {'lr': 0.0002825491185229655, 'samples': 13411200, 'steps': 69849, 'loss/train': 1.6339199542999268} 11/07/2021 07:03:00 - INFO - __main__ - Step 69851: {'lr': 0.00028254385694115883, 'samples': 13411392, 'steps': 69850, 'loss/train': 0.8603284955024719} 11/07/2021 07:03:01 - INFO - __main__ - Step 69852: {'lr': 0.0002825385953446883, 'samples': 13411584, 'steps': 69851, 'loss/train': 1.1572558879852295} 11/07/2021 07:03:02 - INFO - __main__ - Step 69853: {'lr': 0.0002825333337335564, 'samples': 13411776, 'steps': 69852, 'loss/train': 1.4186362028121948} 11/07/2021 07:03:02 - INFO - __main__ - Step 69854: {'lr': 0.00028252807210776555, 'samples': 13411968, 'steps': 69853, 'loss/train': 0.9275687336921692} 11/07/2021 07:03:02 - INFO - __main__ - Step 69855: {'lr': 0.000282522810467318, 'samples': 13412160, 'steps': 69854, 'loss/train': 1.7185869216918945} 11/07/2021 07:03:03 - INFO - __main__ - Step 69856: {'lr': 0.0002825175488122162, 'samples': 13412352, 'steps': 69855, 'loss/train': 1.3034645318984985} 11/07/2021 07:03:04 - INFO - __main__ - Step 69857: {'lr': 0.0002825122871424625, 'samples': 13412544, 'steps': 69856, 'loss/train': 1.3431260585784912} 11/07/2021 07:03:04 - INFO - __main__ - Step 69858: {'lr': 0.0002825070254580592, 'samples': 13412736, 'steps': 69857, 'loss/train': 1.22487211227417} 11/07/2021 07:03:04 - INFO - __main__ - Step 69859: {'lr': 0.0002825017637590088, 'samples': 13412928, 'steps': 69858, 'loss/train': 1.5800387859344482} 11/07/2021 07:03:05 - INFO - __main__ - Step 69860: {'lr': 0.0002824965020453135, 'samples': 13413120, 'steps': 69859, 'loss/train': 1.3959628343582153} 11/07/2021 07:03:05 - INFO - __main__ - Step 69861: {'lr': 0.0002824912403169759, 'samples': 13413312, 'steps': 69860, 'loss/train': 1.527927279472351} 11/07/2021 07:03:06 - INFO - __main__ - Step 69862: {'lr': 0.0002824859785739982, 'samples': 13413504, 'steps': 69861, 'loss/train': 1.3954154253005981} 11/07/2021 07:03:07 - INFO - __main__ - Step 69863: {'lr': 0.0002824807168163829, 'samples': 13413696, 'steps': 69862, 'loss/train': 1.9153192043304443} 11/07/2021 07:03:07 - INFO - __main__ - Step 69864: {'lr': 0.00028247545504413217, 'samples': 13413888, 'steps': 69863, 'loss/train': 1.6992253065109253} 11/07/2021 07:03:07 - INFO - __main__ - Step 69865: {'lr': 0.0002824701932572485, 'samples': 13414080, 'steps': 69864, 'loss/train': 1.1399428844451904} 11/07/2021 07:03:08 - INFO - __main__ - Step 69866: {'lr': 0.00028246493145573433, 'samples': 13414272, 'steps': 69865, 'loss/train': 1.5830141305923462} 11/07/2021 07:03:09 - INFO - __main__ - Step 69867: {'lr': 0.00028245966963959203, 'samples': 13414464, 'steps': 69866, 'loss/train': 1.7029238939285278} 11/07/2021 07:03:09 - INFO - __main__ - Step 69868: {'lr': 0.00028245440780882373, 'samples': 13414656, 'steps': 69867, 'loss/train': 1.8786396980285645} 11/07/2021 07:03:09 - INFO - __main__ - Step 69869: {'lr': 0.0002824491459634321, 'samples': 13414848, 'steps': 69868, 'loss/train': 1.7029918432235718} 11/07/2021 07:03:10 - INFO - __main__ - Step 69870: {'lr': 0.0002824438841034194, 'samples': 13415040, 'steps': 69869, 'loss/train': 1.6893694400787354} 11/07/2021 07:03:10 - INFO - __main__ - Step 69871: {'lr': 0.00028243862222878784, 'samples': 13415232, 'steps': 69870, 'loss/train': 1.1757209300994873} 11/07/2021 07:03:11 - INFO - __main__ - Step 69872: {'lr': 0.00028243336033954, 'samples': 13415424, 'steps': 69871, 'loss/train': 1.2649699449539185} 11/07/2021 07:03:11 - INFO - __main__ - Step 69873: {'lr': 0.00028242809843567827, 'samples': 13415616, 'steps': 69872, 'loss/train': 1.1464052200317383} 11/07/2021 07:03:12 - INFO - __main__ - Step 69874: {'lr': 0.0002824228365172049, 'samples': 13415808, 'steps': 69873, 'loss/train': 0.9279029369354248} 11/07/2021 07:03:12 - INFO - __main__ - Step 69875: {'lr': 0.00028241757458412234, 'samples': 13416000, 'steps': 69874, 'loss/train': 0.9786774516105652} 11/07/2021 07:03:12 - INFO - __main__ - Step 69876: {'lr': 0.00028241231263643286, 'samples': 13416192, 'steps': 69875, 'loss/train': 1.8149336576461792} 11/07/2021 07:03:13 - INFO - __main__ - Step 69877: {'lr': 0.00028240705067413886, 'samples': 13416384, 'steps': 69876, 'loss/train': 0.9897128939628601} 11/07/2021 07:03:14 - INFO - __main__ - Step 69878: {'lr': 0.0002824017886972428, 'samples': 13416576, 'steps': 69877, 'loss/train': 1.2787513732910156} 11/07/2021 07:03:14 - INFO - __main__ - Step 69879: {'lr': 0.00028239652670574697, 'samples': 13416768, 'steps': 69878, 'loss/train': 1.8864834308624268} 11/07/2021 07:03:15 - INFO - __main__ - Step 69880: {'lr': 0.00028239126469965374, 'samples': 13416960, 'steps': 69879, 'loss/train': 1.8520394563674927} 11/07/2021 07:03:15 - INFO - __main__ - Step 69881: {'lr': 0.00028238600267896564, 'samples': 13417152, 'steps': 69880, 'loss/train': 1.0644419193267822} 11/07/2021 07:03:15 - INFO - __main__ - Step 69882: {'lr': 0.00028238074064368477, 'samples': 13417344, 'steps': 69881, 'loss/train': 1.4619807004928589} 11/07/2021 07:03:16 - INFO - __main__ - Step 69883: {'lr': 0.0002823754785938137, 'samples': 13417536, 'steps': 69882, 'loss/train': 0.9953399300575256} 11/07/2021 07:03:17 - INFO - __main__ - Step 69884: {'lr': 0.00028237021652935466, 'samples': 13417728, 'steps': 69883, 'loss/train': 1.7839189767837524} 11/07/2021 07:03:17 - INFO - __main__ - Step 69885: {'lr': 0.0002823649544503102, 'samples': 13417920, 'steps': 69884, 'loss/train': 1.2013607025146484} 11/07/2021 07:03:17 - INFO - __main__ - Step 69886: {'lr': 0.0002823596923566825, 'samples': 13418112, 'steps': 69885, 'loss/train': 2.151535749435425} 11/07/2021 07:03:18 - INFO - __main__ - Step 69887: {'lr': 0.0002823544302484741, 'samples': 13418304, 'steps': 69886, 'loss/train': 0.7603404521942139} 11/07/2021 07:03:19 - INFO - __main__ - Step 69888: {'lr': 0.0002823491681256873, 'samples': 13418496, 'steps': 69887, 'loss/train': 1.5891913175582886} 11/07/2021 07:03:19 - INFO - __main__ - Step 69889: {'lr': 0.0002823439059883244, 'samples': 13418688, 'steps': 69888, 'loss/train': 1.4614698886871338} 11/07/2021 07:03:20 - INFO - __main__ - Step 69890: {'lr': 0.00028233864383638783, 'samples': 13418880, 'steps': 69889, 'loss/train': 0.28020012378692627} 11/07/2021 07:03:20 - INFO - __main__ - Step 69891: {'lr': 0.00028233338166988, 'samples': 13419072, 'steps': 69890, 'loss/train': 1.3334178924560547} 11/07/2021 07:03:20 - INFO - __main__ - Step 69892: {'lr': 0.0002823281194888032, 'samples': 13419264, 'steps': 69891, 'loss/train': 1.4397729635238647} 11/07/2021 07:03:21 - INFO - __main__ - Step 69893: {'lr': 0.00028232285729315996, 'samples': 13419456, 'steps': 69892, 'loss/train': 1.692454218864441} 11/07/2021 07:03:22 - INFO - __main__ - Step 69894: {'lr': 0.00028231759508295245, 'samples': 13419648, 'steps': 69893, 'loss/train': 1.4821746349334717} 11/07/2021 07:03:22 - INFO - __main__ - Step 69895: {'lr': 0.00028231233285818313, 'samples': 13419840, 'steps': 69894, 'loss/train': 1.5744653940200806} 11/07/2021 07:03:22 - INFO - __main__ - Step 69896: {'lr': 0.0002823070706188544, 'samples': 13420032, 'steps': 69895, 'loss/train': 1.6890723705291748} 11/07/2021 07:03:23 - INFO - __main__ - Step 69897: {'lr': 0.0002823018083649686, 'samples': 13420224, 'steps': 69896, 'loss/train': 1.2592180967330933} 11/07/2021 07:03:24 - INFO - __main__ - Step 69898: {'lr': 0.00028229654609652816, 'samples': 13420416, 'steps': 69897, 'loss/train': 1.5103988647460938} 11/07/2021 07:03:24 - INFO - __main__ - Step 69899: {'lr': 0.0002822912838135353, 'samples': 13420608, 'steps': 69898, 'loss/train': 1.3689414262771606} 11/07/2021 07:03:24 - INFO - __main__ - Step 69900: {'lr': 0.0002822860215159925, 'samples': 13420800, 'steps': 69899, 'loss/train': 1.5509780645370483} 11/07/2021 07:03:25 - INFO - __main__ - Step 69901: {'lr': 0.00028228075920390215, 'samples': 13420992, 'steps': 69900, 'loss/train': 2.217252492904663} 11/07/2021 07:03:25 - INFO - __main__ - Step 69902: {'lr': 0.00028227549687726656, 'samples': 13421184, 'steps': 69901, 'loss/train': 1.3784031867980957} 11/07/2021 07:03:25 - INFO - __main__ - Step 69903: {'lr': 0.00028227023453608813, 'samples': 13421376, 'steps': 69902, 'loss/train': 0.9511266350746155} 11/07/2021 07:03:27 - INFO - __main__ - Step 69904: {'lr': 0.0002822649721803693, 'samples': 13421568, 'steps': 69903, 'loss/train': 1.2863913774490356} 11/07/2021 07:03:27 - INFO - __main__ - Step 69905: {'lr': 0.00028225970981011236, 'samples': 13421760, 'steps': 69904, 'loss/train': 1.8873090744018555} 11/07/2021 07:03:27 - INFO - __main__ - Step 69906: {'lr': 0.00028225444742531957, 'samples': 13421952, 'steps': 69905, 'loss/train': 1.373394250869751} 11/07/2021 07:03:28 - INFO - __main__ - Step 69907: {'lr': 0.0002822491850259935, 'samples': 13422144, 'steps': 69906, 'loss/train': 1.4326156377792358} 11/07/2021 07:03:28 - INFO - __main__ - Step 69908: {'lr': 0.00028224392261213643, 'samples': 13422336, 'steps': 69907, 'loss/train': 1.3921793699264526} 11/07/2021 07:03:29 - INFO - __main__ - Step 69909: {'lr': 0.00028223866018375085, 'samples': 13422528, 'steps': 69908, 'loss/train': 1.4734469652175903} 11/07/2021 07:03:30 - INFO - __main__ - Step 69910: {'lr': 0.0002822333977408389, 'samples': 13422720, 'steps': 69909, 'loss/train': 1.4660813808441162} 11/07/2021 07:03:30 - INFO - __main__ - Step 69911: {'lr': 0.0002822281352834031, 'samples': 13422912, 'steps': 69910, 'loss/train': 1.8473706245422363} 11/07/2021 07:03:30 - INFO - __main__ - Step 69912: {'lr': 0.00028222287281144584, 'samples': 13423104, 'steps': 69911, 'loss/train': 1.7898778915405273} 11/07/2021 07:03:31 - INFO - __main__ - Step 69913: {'lr': 0.0002822176103249694, 'samples': 13423296, 'steps': 69912, 'loss/train': 0.7646274566650391} 11/07/2021 07:03:31 - INFO - __main__ - Step 69914: {'lr': 0.0002822123478239763, 'samples': 13423488, 'steps': 69913, 'loss/train': 1.6253005266189575} 11/07/2021 07:03:32 - INFO - __main__ - Step 69915: {'lr': 0.0002822070853084687, 'samples': 13423680, 'steps': 69914, 'loss/train': 1.5035068988800049} 11/07/2021 07:03:33 - INFO - __main__ - Step 69916: {'lr': 0.00028220182277844915, 'samples': 13423872, 'steps': 69915, 'loss/train': 2.216240406036377} 11/07/2021 07:03:33 - INFO - __main__ - Step 69917: {'lr': 0.00028219656023391993, 'samples': 13424064, 'steps': 69916, 'loss/train': 1.2545379400253296} 11/07/2021 07:03:33 - INFO - __main__ - Step 69918: {'lr': 0.00028219129767488344, 'samples': 13424256, 'steps': 69917, 'loss/train': 1.4628326892852783} 11/07/2021 07:03:34 - INFO - __main__ - Step 69919: {'lr': 0.000282186035101342, 'samples': 13424448, 'steps': 69918, 'loss/train': 1.608262062072754} 11/07/2021 07:03:35 - INFO - __main__ - Step 69920: {'lr': 0.0002821807725132981, 'samples': 13424640, 'steps': 69919, 'loss/train': 1.490914225578308} 11/07/2021 07:03:35 - INFO - __main__ - Step 69921: {'lr': 0.0002821755099107541, 'samples': 13424832, 'steps': 69920, 'loss/train': 1.9919706583023071} 11/07/2021 07:03:35 - INFO - __main__ - Step 69922: {'lr': 0.0002821702472937122, 'samples': 13425024, 'steps': 69921, 'loss/train': 1.578952431678772} 11/07/2021 07:03:36 - INFO - __main__ - Step 69923: {'lr': 0.0002821649846621749, 'samples': 13425216, 'steps': 69922, 'loss/train': 0.979101836681366} 11/07/2021 07:03:36 - INFO - __main__ - Step 69924: {'lr': 0.00028215972201614455, 'samples': 13425408, 'steps': 69923, 'loss/train': 1.5198359489440918} 11/07/2021 07:03:37 - INFO - __main__ - Step 69925: {'lr': 0.0002821544593556235, 'samples': 13425600, 'steps': 69924, 'loss/train': 1.3703346252441406} 11/07/2021 07:03:37 - INFO - __main__ - Step 69926: {'lr': 0.0002821491966806142, 'samples': 13425792, 'steps': 69925, 'loss/train': 1.5762531757354736} 11/07/2021 07:03:38 - INFO - __main__ - Step 69927: {'lr': 0.00028214393399111893, 'samples': 13425984, 'steps': 69926, 'loss/train': 1.4349312782287598} 11/07/2021 07:03:38 - INFO - __main__ - Step 69928: {'lr': 0.0002821386712871402, 'samples': 13426176, 'steps': 69927, 'loss/train': 1.7479784488677979} 11/07/2021 07:03:38 - INFO - __main__ - Step 69929: {'lr': 0.0002821334085686802, 'samples': 13426368, 'steps': 69928, 'loss/train': 1.573347568511963} 11/07/2021 07:03:39 - INFO - __main__ - Step 69930: {'lr': 0.00028212814583574136, 'samples': 13426560, 'steps': 69929, 'loss/train': 1.3165560960769653} 11/07/2021 07:03:40 - INFO - __main__ - Step 69931: {'lr': 0.00028212288308832615, 'samples': 13426752, 'steps': 69930, 'loss/train': 1.418696641921997} 11/07/2021 07:03:40 - INFO - __main__ - Step 69932: {'lr': 0.0002821176203264368, 'samples': 13426944, 'steps': 69931, 'loss/train': 1.67484450340271} 11/07/2021 07:03:40 - INFO - __main__ - Step 69933: {'lr': 0.00028211235755007575, 'samples': 13427136, 'steps': 69932, 'loss/train': 2.055401086807251} 11/07/2021 07:03:41 - INFO - __main__ - Step 69934: {'lr': 0.00028210709475924535, 'samples': 13427328, 'steps': 69933, 'loss/train': 1.4785451889038086} 11/07/2021 07:03:41 - INFO - __main__ - Step 69935: {'lr': 0.00028210183195394805, 'samples': 13427520, 'steps': 69934, 'loss/train': 1.8237885236740112} 11/07/2021 07:03:42 - INFO - __main__ - Step 69936: {'lr': 0.00028209656913418614, 'samples': 13427712, 'steps': 69935, 'loss/train': 1.227683663368225} 11/07/2021 07:03:43 - INFO - __main__ - Step 69937: {'lr': 0.000282091306299962, 'samples': 13427904, 'steps': 69936, 'loss/train': 1.3478010892868042} 11/07/2021 07:03:43 - INFO - __main__ - Step 69938: {'lr': 0.00028208604345127797, 'samples': 13428096, 'steps': 69937, 'loss/train': 1.4306917190551758} 11/07/2021 07:03:43 - INFO - __main__ - Step 69939: {'lr': 0.00028208078058813654, 'samples': 13428288, 'steps': 69938, 'loss/train': 1.4209884405136108} 11/07/2021 07:03:44 - INFO - __main__ - Step 69940: {'lr': 0.00028207551771054, 'samples': 13428480, 'steps': 69939, 'loss/train': 1.5318794250488281} 11/07/2021 07:03:45 - INFO - __main__ - Step 69941: {'lr': 0.0002820702548184907, 'samples': 13428672, 'steps': 69940, 'loss/train': 1.753706693649292} 11/07/2021 07:03:45 - INFO - __main__ - Step 69942: {'lr': 0.000282064991911991, 'samples': 13428864, 'steps': 69941, 'loss/train': 1.3852105140686035} 11/07/2021 07:03:45 - INFO - __main__ - Step 69943: {'lr': 0.0002820597289910434, 'samples': 13429056, 'steps': 69942, 'loss/train': 1.2967956066131592} 11/07/2021 07:03:46 - INFO - __main__ - Step 69944: {'lr': 0.00028205446605565, 'samples': 13429248, 'steps': 69943, 'loss/train': 1.2423663139343262} 11/07/2021 07:03:46 - INFO - __main__ - Step 69945: {'lr': 0.00028204920310581356, 'samples': 13429440, 'steps': 69944, 'loss/train': 1.0805350542068481} 11/07/2021 07:03:47 - INFO - __main__ - Step 69946: {'lr': 0.0002820439401415361, 'samples': 13429632, 'steps': 69945, 'loss/train': 1.356957197189331} 11/07/2021 07:03:47 - INFO - __main__ - Step 69947: {'lr': 0.0002820386771628202, 'samples': 13429824, 'steps': 69946, 'loss/train': 1.2042468786239624} 11/07/2021 07:03:48 - INFO - __main__ - Step 69948: {'lr': 0.00028203341416966824, 'samples': 13430016, 'steps': 69947, 'loss/train': 1.2636767625808716} 11/07/2021 07:03:48 - INFO - __main__ - Step 69949: {'lr': 0.0002820281511620824, 'samples': 13430208, 'steps': 69948, 'loss/train': 1.2179765701293945} 11/07/2021 07:03:48 - INFO - __main__ - Step 69950: {'lr': 0.0002820228881400652, 'samples': 13430400, 'steps': 69949, 'loss/train': 0.8424012064933777} 11/07/2021 07:03:49 - INFO - __main__ - Step 69951: {'lr': 0.000282017625103619, 'samples': 13430592, 'steps': 69950, 'loss/train': 1.6954617500305176} 11/07/2021 07:03:50 - INFO - __main__ - Step 69952: {'lr': 0.0002820123620527462, 'samples': 13430784, 'steps': 69951, 'loss/train': 1.66225004196167} 11/07/2021 07:03:50 - INFO - __main__ - Step 69953: {'lr': 0.000282007098987449, 'samples': 13430976, 'steps': 69952, 'loss/train': 0.9673404693603516} 11/07/2021 07:03:50 - INFO - __main__ - Step 69954: {'lr': 0.00028200183590773, 'samples': 13431168, 'steps': 69953, 'loss/train': 1.2716357707977295} 11/07/2021 07:03:51 - INFO - __main__ - Step 69955: {'lr': 0.00028199657281359144, 'samples': 13431360, 'steps': 69954, 'loss/train': 1.6298788785934448} 11/07/2021 07:03:52 - INFO - __main__ - Step 69956: {'lr': 0.00028199130970503575, 'samples': 13431552, 'steps': 69955, 'loss/train': 1.9077999591827393} 11/07/2021 07:03:52 - INFO - __main__ - Step 69957: {'lr': 0.00028198604658206516, 'samples': 13431744, 'steps': 69956, 'loss/train': 1.0404301881790161} 11/07/2021 07:03:53 - INFO - __main__ - Step 69958: {'lr': 0.0002819807834446822, 'samples': 13431936, 'steps': 69957, 'loss/train': 1.7624942064285278} 11/07/2021 07:03:53 - INFO - __main__ - Step 69959: {'lr': 0.0002819755202928892, 'samples': 13432128, 'steps': 69958, 'loss/train': 1.4160411357879639} 11/07/2021 07:03:53 - INFO - __main__ - Step 69960: {'lr': 0.0002819702571266886, 'samples': 13432320, 'steps': 69959, 'loss/train': 1.2274724245071411} 11/07/2021 07:03:54 - INFO - __main__ - Step 69961: {'lr': 0.0002819649939460826, 'samples': 13432512, 'steps': 69960, 'loss/train': 0.9678301811218262} 11/07/2021 07:03:55 - INFO - __main__ - Step 69962: {'lr': 0.0002819597307510737, 'samples': 13432704, 'steps': 69961, 'loss/train': 0.9544733166694641} 11/07/2021 07:03:55 - INFO - __main__ - Step 69963: {'lr': 0.0002819544675416642, 'samples': 13432896, 'steps': 69962, 'loss/train': 1.1083710193634033} 11/07/2021 07:03:55 - INFO - __main__ - Step 69964: {'lr': 0.0002819492043178566, 'samples': 13433088, 'steps': 69963, 'loss/train': 1.0347108840942383} 11/07/2021 07:03:56 - INFO - __main__ - Step 69965: {'lr': 0.0002819439410796531, 'samples': 13433280, 'steps': 69964, 'loss/train': 1.4657142162322998} 11/07/2021 07:03:56 - INFO - __main__ - Step 69966: {'lr': 0.00028193867782705617, 'samples': 13433472, 'steps': 69965, 'loss/train': 1.1961543560028076} 11/07/2021 07:03:57 - INFO - __main__ - Step 69967: {'lr': 0.0002819334145600682, 'samples': 13433664, 'steps': 69966, 'loss/train': 1.4783226251602173} 11/07/2021 07:03:57 - INFO - __main__ - Step 69968: {'lr': 0.0002819281512786915, 'samples': 13433856, 'steps': 69967, 'loss/train': 1.3113588094711304} 11/07/2021 07:03:58 - INFO - __main__ - Step 69969: {'lr': 0.0002819228879829285, 'samples': 13434048, 'steps': 69968, 'loss/train': 1.7074155807495117} 11/07/2021 07:03:58 - INFO - __main__ - Step 69970: {'lr': 0.00028191762467278146, 'samples': 13434240, 'steps': 69969, 'loss/train': 1.7216426134109497} 11/07/2021 07:03:58 - INFO - __main__ - Step 69971: {'lr': 0.00028191236134825285, 'samples': 13434432, 'steps': 69970, 'loss/train': 1.3566231727600098} 11/07/2021 07:03:59 - INFO - __main__ - Step 69972: {'lr': 0.0002819070980093451, 'samples': 13434624, 'steps': 69971, 'loss/train': 1.5579841136932373} 11/07/2021 07:04:00 - INFO - __main__ - Step 69973: {'lr': 0.0002819018346560604, 'samples': 13434816, 'steps': 69972, 'loss/train': 0.9711529612541199} 11/07/2021 07:04:00 - INFO - __main__ - Step 69974: {'lr': 0.0002818965712884013, 'samples': 13435008, 'steps': 69973, 'loss/train': 0.9961733222007751} 11/07/2021 07:04:00 - INFO - __main__ - Step 69975: {'lr': 0.0002818913079063701, 'samples': 13435200, 'steps': 69974, 'loss/train': 0.9008897542953491} 11/07/2021 07:04:01 - INFO - __main__ - Step 69976: {'lr': 0.00028188604450996913, 'samples': 13435392, 'steps': 69975, 'loss/train': 1.3514758348464966} 11/07/2021 07:04:02 - INFO - __main__ - Step 69977: {'lr': 0.00028188078109920087, 'samples': 13435584, 'steps': 69976, 'loss/train': 1.057550072669983} 11/07/2021 07:04:02 - INFO - __main__ - Step 69978: {'lr': 0.0002818755176740675, 'samples': 13435776, 'steps': 69977, 'loss/train': 1.5660064220428467} 11/07/2021 07:04:02 - INFO - __main__ - Step 69979: {'lr': 0.0002818702542345716, 'samples': 13435968, 'steps': 69978, 'loss/train': 1.019077181816101} 11/07/2021 07:04:03 - INFO - __main__ - Step 69980: {'lr': 0.00028186499078071544, 'samples': 13436160, 'steps': 69979, 'loss/train': 1.0960158109664917} 11/07/2021 07:04:03 - INFO - __main__ - Step 69981: {'lr': 0.0002818597273125014, 'samples': 13436352, 'steps': 69980, 'loss/train': 1.7208386659622192} 11/07/2021 07:04:04 - INFO - __main__ - Step 69982: {'lr': 0.00028185446382993193, 'samples': 13436544, 'steps': 69981, 'loss/train': 1.718794345855713} 11/07/2021 07:04:05 - INFO - __main__ - Step 69983: {'lr': 0.0002818492003330092, 'samples': 13436736, 'steps': 69982, 'loss/train': 1.6090712547302246} 11/07/2021 07:04:05 - INFO - __main__ - Step 69984: {'lr': 0.00028184393682173574, 'samples': 13436928, 'steps': 69983, 'loss/train': 1.7094703912734985} 11/07/2021 07:04:05 - INFO - __main__ - Step 69985: {'lr': 0.000281838673296114, 'samples': 13437120, 'steps': 69984, 'loss/train': 1.7473630905151367} 11/07/2021 07:04:06 - INFO - __main__ - Step 69986: {'lr': 0.0002818334097561461, 'samples': 13437312, 'steps': 69985, 'loss/train': 1.0259631872177124} 11/07/2021 07:04:07 - INFO - __main__ - Step 69987: {'lr': 0.00028182814620183463, 'samples': 13437504, 'steps': 69986, 'loss/train': 1.6230839490890503} 11/07/2021 07:04:07 - INFO - __main__ - Step 69988: {'lr': 0.00028182288263318197, 'samples': 13437696, 'steps': 69987, 'loss/train': 2.101198196411133} 11/07/2021 07:04:08 - INFO - __main__ - Step 69989: {'lr': 0.0002818176190501903, 'samples': 13437888, 'steps': 69988, 'loss/train': 1.099234938621521} 11/07/2021 07:04:08 - INFO - __main__ - Step 69990: {'lr': 0.0002818123554528621, 'samples': 13438080, 'steps': 69989, 'loss/train': 1.522068738937378} 11/07/2021 07:04:08 - INFO - __main__ - Step 69991: {'lr': 0.0002818070918411998, 'samples': 13438272, 'steps': 69990, 'loss/train': 1.4207606315612793} 11/07/2021 07:04:09 - INFO - __main__ - Step 69992: {'lr': 0.00028180182821520565, 'samples': 13438464, 'steps': 69991, 'loss/train': 1.0151127576828003} 11/07/2021 07:04:10 - INFO - __main__ - Step 69993: {'lr': 0.00028179656457488214, 'samples': 13438656, 'steps': 69992, 'loss/train': 1.2911767959594727} 11/07/2021 07:04:10 - INFO - __main__ - Step 69994: {'lr': 0.00028179130092023154, 'samples': 13438848, 'steps': 69993, 'loss/train': 1.3432350158691406} 11/07/2021 07:04:10 - INFO - __main__ - Step 69995: {'lr': 0.0002817860372512564, 'samples': 13439040, 'steps': 69994, 'loss/train': 1.233182430267334} 11/07/2021 07:04:11 - INFO - __main__ - Step 69996: {'lr': 0.00028178077356795885, 'samples': 13439232, 'steps': 69995, 'loss/train': 1.6107655763626099} 11/07/2021 07:04:11 - INFO - __main__ - Step 69997: {'lr': 0.0002817755098703413, 'samples': 13439424, 'steps': 69996, 'loss/train': 1.662895679473877} 11/07/2021 07:04:12 - INFO - __main__ - Step 69998: {'lr': 0.00028177024615840636, 'samples': 13439616, 'steps': 69997, 'loss/train': 1.649576187133789} 11/07/2021 07:04:12 - INFO - __main__ - Step 69999: {'lr': 0.00028176498243215613, 'samples': 13439808, 'steps': 69998, 'loss/train': 2.0694620609283447} 11/07/2021 07:04:13 - INFO - __main__ - Step 70000: {'lr': 0.00028175971869159313, 'samples': 13440000, 'steps': 69999, 'loss/train': 1.0747917890548706} 11/07/2021 07:04:13 - INFO - __main__ - Step 70001: {'lr': 0.0002817544549367197, 'samples': 13440192, 'steps': 70000, 'loss/train': 1.5534011125564575} 11/07/2021 07:04:14 - INFO - __main__ - Step 70002: {'lr': 0.0002817491911675382, 'samples': 13440384, 'steps': 70001, 'loss/train': 1.6216151714324951} 11/07/2021 07:04:15 - INFO - __main__ - Step 70003: {'lr': 0.00028174392738405094, 'samples': 13440576, 'steps': 70002, 'loss/train': 1.3745920658111572} 11/07/2021 07:04:15 - INFO - __main__ - Step 70004: {'lr': 0.00028173866358626045, 'samples': 13440768, 'steps': 70003, 'loss/train': 1.2597569227218628} 11/07/2021 07:04:15 - INFO - __main__ - Step 70005: {'lr': 0.00028173339977416895, 'samples': 13440960, 'steps': 70004, 'loss/train': 1.359643578529358} 11/07/2021 07:04:16 - INFO - __main__ - Step 70006: {'lr': 0.0002817281359477789, 'samples': 13441152, 'steps': 70005, 'loss/train': 1.518772840499878} 11/07/2021 07:04:16 - INFO - __main__ - Step 70007: {'lr': 0.0002817228721070926, 'samples': 13441344, 'steps': 70006, 'loss/train': 1.6748002767562866} 11/07/2021 07:04:17 - INFO - __main__ - Step 70008: {'lr': 0.00028171760825211254, 'samples': 13441536, 'steps': 70007, 'loss/train': 0.40890175104141235} 11/07/2021 07:04:18 - INFO - __main__ - Step 70009: {'lr': 0.0002817123443828409, 'samples': 13441728, 'steps': 70008, 'loss/train': 1.157810926437378} 11/07/2021 07:04:18 - INFO - __main__ - Step 70010: {'lr': 0.0002817070804992803, 'samples': 13441920, 'steps': 70009, 'loss/train': 1.4067635536193848} 11/07/2021 07:04:18 - INFO - __main__ - Step 70011: {'lr': 0.0002817018166014329, 'samples': 13442112, 'steps': 70010, 'loss/train': 1.4998196363449097} 11/07/2021 07:04:19 - INFO - __main__ - Step 70012: {'lr': 0.0002816965526893011, 'samples': 13442304, 'steps': 70011, 'loss/train': 1.8388190269470215} 11/07/2021 07:04:20 - INFO - __main__ - Step 70013: {'lr': 0.0002816912887628874, 'samples': 13442496, 'steps': 70012, 'loss/train': 1.5574853420257568} 11/07/2021 07:04:20 - INFO - __main__ - Step 70014: {'lr': 0.00028168602482219406, 'samples': 13442688, 'steps': 70013, 'loss/train': 1.4235241413116455} 11/07/2021 07:04:20 - INFO - __main__ - Step 70015: {'lr': 0.00028168076086722353, 'samples': 13442880, 'steps': 70014, 'loss/train': 1.1751166582107544} 11/07/2021 07:04:21 - INFO - __main__ - Step 70016: {'lr': 0.0002816754968979781, 'samples': 13443072, 'steps': 70015, 'loss/train': 1.4872796535491943} 11/07/2021 07:04:21 - INFO - __main__ - Step 70017: {'lr': 0.0002816702329144602, 'samples': 13443264, 'steps': 70016, 'loss/train': 1.1145113706588745} 11/07/2021 07:04:21 - INFO - __main__ - Step 70018: {'lr': 0.0002816649689166722, 'samples': 13443456, 'steps': 70017, 'loss/train': 1.366551399230957} 11/07/2021 07:04:22 - INFO - __main__ - Step 70019: {'lr': 0.0002816597049046164, 'samples': 13443648, 'steps': 70018, 'loss/train': 1.7159873247146606} 11/07/2021 07:04:23 - INFO - __main__ - Step 70020: {'lr': 0.00028165444087829524, 'samples': 13443840, 'steps': 70019, 'loss/train': 1.3796002864837646} 11/07/2021 07:04:23 - INFO - __main__ - Step 70021: {'lr': 0.00028164917683771106, 'samples': 13444032, 'steps': 70020, 'loss/train': 1.5536030530929565} 11/07/2021 07:04:23 - INFO - __main__ - Step 70022: {'lr': 0.00028164391278286637, 'samples': 13444224, 'steps': 70021, 'loss/train': 1.5530551671981812} 11/07/2021 07:04:24 - INFO - __main__ - Step 70023: {'lr': 0.00028163864871376333, 'samples': 13444416, 'steps': 70022, 'loss/train': 1.3125108480453491} 11/07/2021 07:04:25 - INFO - __main__ - Step 70024: {'lr': 0.0002816333846304044, 'samples': 13444608, 'steps': 70023, 'loss/train': 1.9813573360443115} 11/07/2021 07:04:25 - INFO - __main__ - Step 70025: {'lr': 0.0002816281205327919, 'samples': 13444800, 'steps': 70024, 'loss/train': 1.0584003925323486} 11/07/2021 07:04:26 - INFO - __main__ - Step 70026: {'lr': 0.00028162285642092835, 'samples': 13444992, 'steps': 70025, 'loss/train': 1.2748953104019165} 11/07/2021 07:04:26 - INFO - __main__ - Step 70027: {'lr': 0.000281617592294816, 'samples': 13445184, 'steps': 70026, 'loss/train': 1.791848063468933} 11/07/2021 07:04:26 - INFO - __main__ - Step 70028: {'lr': 0.00028161232815445726, 'samples': 13445376, 'steps': 70027, 'loss/train': 1.4095613956451416} 11/07/2021 07:04:27 - INFO - __main__ - Step 70029: {'lr': 0.0002816070639998545, 'samples': 13445568, 'steps': 70028, 'loss/train': 1.2042460441589355} 11/07/2021 07:04:28 - INFO - __main__ - Step 70030: {'lr': 0.00028160179983101005, 'samples': 13445760, 'steps': 70029, 'loss/train': 1.5811312198638916} 11/07/2021 07:04:28 - INFO - __main__ - Step 70031: {'lr': 0.0002815965356479263, 'samples': 13445952, 'steps': 70030, 'loss/train': 0.8947386145591736} 11/07/2021 07:04:28 - INFO - __main__ - Step 70032: {'lr': 0.0002815912714506056, 'samples': 13446144, 'steps': 70031, 'loss/train': 1.5774840116500854} 11/07/2021 07:04:29 - INFO - __main__ - Step 70033: {'lr': 0.0002815860072390505, 'samples': 13446336, 'steps': 70032, 'loss/train': 1.4748690128326416} 11/07/2021 07:04:29 - INFO - __main__ - Step 70034: {'lr': 0.0002815807430132632, 'samples': 13446528, 'steps': 70033, 'loss/train': 1.6789054870605469} 11/07/2021 07:04:30 - INFO - __main__ - Step 70035: {'lr': 0.0002815754787732461, 'samples': 13446720, 'steps': 70034, 'loss/train': 1.4309719800949097} 11/07/2021 07:04:30 - INFO - __main__ - Step 70036: {'lr': 0.0002815702145190015, 'samples': 13446912, 'steps': 70035, 'loss/train': 1.3802638053894043} 11/07/2021 07:04:31 - INFO - __main__ - Step 70037: {'lr': 0.00028156495025053184, 'samples': 13447104, 'steps': 70036, 'loss/train': 1.2865850925445557} 11/07/2021 07:04:31 - INFO - __main__ - Step 70038: {'lr': 0.0002815596859678396, 'samples': 13447296, 'steps': 70037, 'loss/train': 1.4873507022857666} 11/07/2021 07:04:31 - INFO - __main__ - Step 70039: {'lr': 0.00028155442167092707, 'samples': 13447488, 'steps': 70038, 'loss/train': 3.8830976486206055} 11/07/2021 07:04:32 - INFO - __main__ - Step 70040: {'lr': 0.0002815491573597965, 'samples': 13447680, 'steps': 70039, 'loss/train': 2.224956512451172} 11/07/2021 07:04:33 - INFO - __main__ - Step 70041: {'lr': 0.0002815438930344504, 'samples': 13447872, 'steps': 70040, 'loss/train': 1.242979884147644} 11/07/2021 07:04:33 - INFO - __main__ - Step 70042: {'lr': 0.0002815386286948911, 'samples': 13448064, 'steps': 70041, 'loss/train': 1.666434645652771} 11/07/2021 07:04:33 - INFO - __main__ - Step 70043: {'lr': 0.00028153336434112096, 'samples': 13448256, 'steps': 70042, 'loss/train': 1.8728207349777222} 11/07/2021 07:04:34 - INFO - __main__ - Step 70044: {'lr': 0.0002815280999731424, 'samples': 13448448, 'steps': 70043, 'loss/train': 1.7922937870025635} 11/07/2021 07:04:35 - INFO - __main__ - Step 70045: {'lr': 0.00028152283559095784, 'samples': 13448640, 'steps': 70044, 'loss/train': 1.4558860063552856} 11/07/2021 07:04:35 - INFO - __main__ - Step 70046: {'lr': 0.0002815175711945695, 'samples': 13448832, 'steps': 70045, 'loss/train': 1.485945701599121} 11/07/2021 07:04:35 - INFO - __main__ - Step 70047: {'lr': 0.0002815123067839798, 'samples': 13449024, 'steps': 70046, 'loss/train': 1.2324637174606323} 11/07/2021 07:04:36 - INFO - __main__ - Step 70048: {'lr': 0.00028150704235919115, 'samples': 13449216, 'steps': 70047, 'loss/train': 1.332908272743225} 11/07/2021 07:04:36 - INFO - __main__ - Step 70049: {'lr': 0.00028150177792020604, 'samples': 13449408, 'steps': 70048, 'loss/train': 1.2488411664962769} 11/07/2021 07:04:37 - INFO - __main__ - Step 70050: {'lr': 0.0002814965134670266, 'samples': 13449600, 'steps': 70049, 'loss/train': 1.3666744232177734} 11/07/2021 07:04:38 - INFO - __main__ - Step 70051: {'lr': 0.0002814912489996553, 'samples': 13449792, 'steps': 70050, 'loss/train': 1.3768681287765503} 11/07/2021 07:04:38 - INFO - __main__ - Step 70052: {'lr': 0.00028148598451809454, 'samples': 13449984, 'steps': 70051, 'loss/train': 1.783030390739441} 11/07/2021 07:04:38 - INFO - __main__ - Step 70053: {'lr': 0.0002814807200223467, 'samples': 13450176, 'steps': 70052, 'loss/train': 1.5161598920822144} 11/07/2021 07:04:39 - INFO - __main__ - Step 70054: {'lr': 0.00028147545551241414, 'samples': 13450368, 'steps': 70053, 'loss/train': 1.536658763885498} 11/07/2021 07:04:39 - INFO - __main__ - Step 70055: {'lr': 0.00028147019098829926, 'samples': 13450560, 'steps': 70054, 'loss/train': 1.2588399648666382} 11/07/2021 07:04:40 - INFO - __main__ - Step 70056: {'lr': 0.0002814649264500044, 'samples': 13450752, 'steps': 70055, 'loss/train': 1.6274981498718262} 11/07/2021 07:04:40 - INFO - __main__ - Step 70057: {'lr': 0.00028145966189753186, 'samples': 13450944, 'steps': 70056, 'loss/train': 1.6995598077774048} 11/07/2021 07:04:41 - INFO - __main__ - Step 70058: {'lr': 0.00028145439733088406, 'samples': 13451136, 'steps': 70057, 'loss/train': 0.46912261843681335} 11/07/2021 07:04:41 - INFO - __main__ - Step 70059: {'lr': 0.00028144913275006346, 'samples': 13451328, 'steps': 70058, 'loss/train': 1.6064571142196655} 11/07/2021 07:04:41 - INFO - __main__ - Step 70060: {'lr': 0.00028144386815507234, 'samples': 13451520, 'steps': 70059, 'loss/train': 1.3197320699691772} 11/07/2021 07:04:43 - INFO - __main__ - Step 70061: {'lr': 0.00028143860354591313, 'samples': 13451712, 'steps': 70060, 'loss/train': 1.9545422792434692} 11/07/2021 07:04:43 - INFO - __main__ - Step 70062: {'lr': 0.00028143333892258817, 'samples': 13451904, 'steps': 70061, 'loss/train': 1.5332826375961304} 11/07/2021 07:04:43 - INFO - __main__ - Step 70063: {'lr': 0.0002814280742850998, 'samples': 13452096, 'steps': 70062, 'loss/train': 1.4998046159744263} 11/07/2021 07:04:44 - INFO - __main__ - Step 70064: {'lr': 0.0002814228096334505, 'samples': 13452288, 'steps': 70063, 'loss/train': 1.546063780784607} 11/07/2021 07:04:44 - INFO - __main__ - Step 70065: {'lr': 0.0002814175449676424, 'samples': 13452480, 'steps': 70064, 'loss/train': 1.2898380756378174} 11/07/2021 07:04:45 - INFO - __main__ - Step 70066: {'lr': 0.0002814122802876782, 'samples': 13452672, 'steps': 70065, 'loss/train': 1.453209638595581} 11/07/2021 07:04:45 - INFO - __main__ - Step 70067: {'lr': 0.00028140701559356004, 'samples': 13452864, 'steps': 70066, 'loss/train': 1.4202722311019897} 11/07/2021 07:04:46 - INFO - __main__ - Step 70068: {'lr': 0.00028140175088529033, 'samples': 13453056, 'steps': 70067, 'loss/train': 1.1303067207336426} 11/07/2021 07:04:46 - INFO - __main__ - Step 70069: {'lr': 0.00028139648616287157, 'samples': 13453248, 'steps': 70068, 'loss/train': 1.5046530961990356} 11/07/2021 07:04:46 - INFO - __main__ - Step 70070: {'lr': 0.000281391221426306, 'samples': 13453440, 'steps': 70069, 'loss/train': 0.645383894443512} 11/07/2021 07:04:47 - INFO - __main__ - Step 70071: {'lr': 0.00028138595667559605, 'samples': 13453632, 'steps': 70070, 'loss/train': 0.8995472192764282} 11/07/2021 07:04:48 - INFO - __main__ - Step 70072: {'lr': 0.000281380691910744, 'samples': 13453824, 'steps': 70071, 'loss/train': 1.7238733768463135} 11/07/2021 07:04:48 - INFO - __main__ - Step 70073: {'lr': 0.00028137542713175227, 'samples': 13454016, 'steps': 70072, 'loss/train': 1.1276856660842896} 11/07/2021 07:04:48 - INFO - __main__ - Step 70074: {'lr': 0.00028137016233862336, 'samples': 13454208, 'steps': 70073, 'loss/train': 0.8503963947296143} 11/07/2021 07:04:49 - INFO - __main__ - Step 70075: {'lr': 0.0002813648975313595, 'samples': 13454400, 'steps': 70074, 'loss/train': 1.6651809215545654} 11/07/2021 07:04:50 - INFO - __main__ - Step 70076: {'lr': 0.0002813596327099631, 'samples': 13454592, 'steps': 70075, 'loss/train': 1.5053411722183228} 11/07/2021 07:04:50 - INFO - __main__ - Step 70077: {'lr': 0.0002813543678744366, 'samples': 13454784, 'steps': 70076, 'loss/train': 1.1973044872283936} 11/07/2021 07:04:51 - INFO - __main__ - Step 70078: {'lr': 0.00028134910302478225, 'samples': 13454976, 'steps': 70077, 'loss/train': 1.8745156526565552} 11/07/2021 07:04:51 - INFO - __main__ - Step 70079: {'lr': 0.0002813438381610024, 'samples': 13455168, 'steps': 70078, 'loss/train': 1.5516690015792847} 11/07/2021 07:04:51 - INFO - __main__ - Step 70080: {'lr': 0.0002813385732830996, 'samples': 13455360, 'steps': 70079, 'loss/train': 1.2321680784225464} 11/07/2021 07:04:52 - INFO - __main__ - Step 70081: {'lr': 0.00028133330839107606, 'samples': 13455552, 'steps': 70080, 'loss/train': 0.7777218818664551} 11/07/2021 07:04:53 - INFO - __main__ - Step 70082: {'lr': 0.0002813280434849343, 'samples': 13455744, 'steps': 70081, 'loss/train': 1.6793484687805176} 11/07/2021 07:04:53 - INFO - __main__ - Step 70083: {'lr': 0.0002813227785646765, 'samples': 13455936, 'steps': 70082, 'loss/train': 1.226076364517212} 11/07/2021 07:04:53 - INFO - __main__ - Step 70084: {'lr': 0.00028131751363030523, 'samples': 13456128, 'steps': 70083, 'loss/train': 2.1532368659973145} 11/07/2021 07:04:54 - INFO - __main__ - Step 70085: {'lr': 0.0002813122486818228, 'samples': 13456320, 'steps': 70084, 'loss/train': 1.475824236869812} 11/07/2021 07:04:54 - INFO - __main__ - Step 70086: {'lr': 0.0002813069837192314, 'samples': 13456512, 'steps': 70085, 'loss/train': 1.6467963457107544} 11/07/2021 07:04:55 - INFO - __main__ - Step 70087: {'lr': 0.0002813017187425336, 'samples': 13456704, 'steps': 70086, 'loss/train': 1.5775514841079712} 11/07/2021 07:04:55 - INFO - __main__ - Step 70088: {'lr': 0.0002812964537517318, 'samples': 13456896, 'steps': 70087, 'loss/train': 1.5040746927261353} 11/07/2021 07:04:56 - INFO - __main__ - Step 70089: {'lr': 0.00028129118874682836, 'samples': 13457088, 'steps': 70088, 'loss/train': 2.025909185409546} 11/07/2021 07:04:56 - INFO - __main__ - Step 70090: {'lr': 0.00028128592372782545, 'samples': 13457280, 'steps': 70089, 'loss/train': 1.7318605184555054} 11/07/2021 07:04:57 - INFO - __main__ - Step 70091: {'lr': 0.0002812806586947257, 'samples': 13457472, 'steps': 70090, 'loss/train': 1.421805739402771} 11/07/2021 07:04:58 - INFO - __main__ - Step 70092: {'lr': 0.0002812753936475313, 'samples': 13457664, 'steps': 70091, 'loss/train': 0.6459929347038269} 11/07/2021 07:04:58 - INFO - __main__ - Step 70093: {'lr': 0.0002812701285862447, 'samples': 13457856, 'steps': 70092, 'loss/train': 1.1272480487823486} 11/07/2021 07:04:59 - INFO - __main__ - Step 70094: {'lr': 0.0002812648635108682, 'samples': 13458048, 'steps': 70093, 'loss/train': 0.24794311821460724} 11/07/2021 07:04:59 - INFO - __main__ - Step 70095: {'lr': 0.0002812595984214043, 'samples': 13458240, 'steps': 70094, 'loss/train': 1.456539273262024} 11/07/2021 07:04:59 - INFO - __main__ - Step 70096: {'lr': 0.0002812543333178554, 'samples': 13458432, 'steps': 70095, 'loss/train': 1.469517707824707} 11/07/2021 07:05:00 - INFO - __main__ - Step 70097: {'lr': 0.00028124906820022364, 'samples': 13458624, 'steps': 70096, 'loss/train': 1.2770909070968628} 11/07/2021 07:05:01 - INFO - __main__ - Step 70098: {'lr': 0.0002812438030685116, 'samples': 13458816, 'steps': 70097, 'loss/train': 0.3209877014160156} 11/07/2021 07:05:01 - INFO - __main__ - Step 70099: {'lr': 0.0002812385379227215, 'samples': 13459008, 'steps': 70098, 'loss/train': 1.3803415298461914} 11/07/2021 07:05:01 - INFO - __main__ - Step 70100: {'lr': 0.00028123327276285585, 'samples': 13459200, 'steps': 70099, 'loss/train': 1.351377248764038} 11/07/2021 07:05:02 - INFO - __main__ - Step 70101: {'lr': 0.00028122800758891703, 'samples': 13459392, 'steps': 70100, 'loss/train': 1.70977783203125} 11/07/2021 07:05:02 - INFO - __main__ - Step 70102: {'lr': 0.00028122274240090727, 'samples': 13459584, 'steps': 70101, 'loss/train': 1.2985137701034546} 11/07/2021 07:05:03 - INFO - __main__ - Step 70103: {'lr': 0.0002812174771988291, 'samples': 13459776, 'steps': 70102, 'loss/train': 1.3318462371826172} 11/07/2021 07:05:03 - INFO - __main__ - Step 70104: {'lr': 0.00028121221198268475, 'samples': 13459968, 'steps': 70103, 'loss/train': 1.0032402276992798} 11/07/2021 07:05:04 - INFO - __main__ - Step 70105: {'lr': 0.0002812069467524767, 'samples': 13460160, 'steps': 70104, 'loss/train': 1.9599453210830688} 11/07/2021 07:05:04 - INFO - __main__ - Step 70106: {'lr': 0.00028120168150820726, 'samples': 13460352, 'steps': 70105, 'loss/train': 1.4664413928985596} 11/07/2021 07:05:04 - INFO - __main__ - Step 70107: {'lr': 0.0002811964162498788, 'samples': 13460544, 'steps': 70106, 'loss/train': 1.2452869415283203} 11/07/2021 07:05:05 - INFO - __main__ - Step 70108: {'lr': 0.00028119115097749377, 'samples': 13460736, 'steps': 70107, 'loss/train': 1.0759719610214233} 11/07/2021 07:05:06 - INFO - __main__ - Step 70109: {'lr': 0.00028118588569105445, 'samples': 13460928, 'steps': 70108, 'loss/train': 1.2960989475250244} 11/07/2021 07:05:06 - INFO - __main__ - Step 70110: {'lr': 0.0002811806203905633, 'samples': 13461120, 'steps': 70109, 'loss/train': 1.0686899423599243} 11/07/2021 07:05:06 - INFO - __main__ - Step 70111: {'lr': 0.0002811753550760226, 'samples': 13461312, 'steps': 70110, 'loss/train': 1.647113561630249} 11/07/2021 07:05:07 - INFO - __main__ - Step 70112: {'lr': 0.00028117008974743476, 'samples': 13461504, 'steps': 70111, 'loss/train': 1.4219120740890503} 11/07/2021 07:05:08 - INFO - __main__ - Step 70113: {'lr': 0.00028116482440480216, 'samples': 13461696, 'steps': 70112, 'loss/train': 1.4109625816345215} 11/07/2021 07:05:08 - INFO - __main__ - Step 70114: {'lr': 0.0002811595590481272, 'samples': 13461888, 'steps': 70113, 'loss/train': 1.6145117282867432} 11/07/2021 07:05:09 - INFO - __main__ - Step 70115: {'lr': 0.0002811542936774122, 'samples': 13462080, 'steps': 70114, 'loss/train': 0.5629064440727234} 11/07/2021 07:05:09 - INFO - __main__ - Step 70116: {'lr': 0.00028114902829265957, 'samples': 13462272, 'steps': 70115, 'loss/train': 1.557202935218811} 11/07/2021 07:05:09 - INFO - __main__ - Step 70117: {'lr': 0.0002811437628938717, 'samples': 13462464, 'steps': 70116, 'loss/train': 1.2486830949783325} 11/07/2021 07:05:10 - INFO - __main__ - Step 70118: {'lr': 0.0002811384974810508, 'samples': 13462656, 'steps': 70117, 'loss/train': 1.5106003284454346} 11/07/2021 07:05:11 - INFO - __main__ - Step 70119: {'lr': 0.0002811332320541995, 'samples': 13462848, 'steps': 70118, 'loss/train': 1.5050218105316162} 11/07/2021 07:05:11 - INFO - __main__ - Step 70120: {'lr': 0.00028112796661332, 'samples': 13463040, 'steps': 70119, 'loss/train': 1.4084579944610596} 11/07/2021 07:05:12 - INFO - __main__ - Step 70121: {'lr': 0.0002811227011584147, 'samples': 13463232, 'steps': 70120, 'loss/train': 1.7831093072891235} 11/07/2021 07:05:12 - INFO - __main__ - Step 70122: {'lr': 0.000281117435689486, 'samples': 13463424, 'steps': 70121, 'loss/train': 1.4440362453460693} 11/07/2021 07:05:12 - INFO - __main__ - Step 70123: {'lr': 0.00028111217020653634, 'samples': 13463616, 'steps': 70122, 'loss/train': 2.113037109375} 11/07/2021 07:05:13 - INFO - __main__ - Step 70124: {'lr': 0.00028110690470956794, 'samples': 13463808, 'steps': 70123, 'loss/train': 1.7452740669250488} 11/07/2021 07:05:14 - INFO - __main__ - Step 70125: {'lr': 0.0002811016391985833, 'samples': 13464000, 'steps': 70124, 'loss/train': 1.3337187767028809} 11/07/2021 07:05:14 - INFO - __main__ - Step 70126: {'lr': 0.0002810963736735847, 'samples': 13464192, 'steps': 70125, 'loss/train': 1.5262748003005981} 11/07/2021 07:05:14 - INFO - __main__ - Step 70127: {'lr': 0.00028109110813457456, 'samples': 13464384, 'steps': 70126, 'loss/train': 1.200858235359192} 11/07/2021 07:05:15 - INFO - __main__ - Step 70128: {'lr': 0.00028108584258155524, 'samples': 13464576, 'steps': 70127, 'loss/train': 1.5168763399124146} 11/07/2021 07:05:16 - INFO - __main__ - Step 70129: {'lr': 0.00028108057701452916, 'samples': 13464768, 'steps': 70128, 'loss/train': 1.4522346258163452} 11/07/2021 07:05:16 - INFO - __main__ - Step 70130: {'lr': 0.0002810753114334986, 'samples': 13464960, 'steps': 70129, 'loss/train': 1.4761627912521362} 11/07/2021 07:05:16 - INFO - __main__ - Step 70131: {'lr': 0.000281070045838466, 'samples': 13465152, 'steps': 70130, 'loss/train': 1.6065784692764282} 11/07/2021 07:05:17 - INFO - __main__ - Step 70132: {'lr': 0.0002810647802294337, 'samples': 13465344, 'steps': 70131, 'loss/train': 0.7785089015960693} 11/07/2021 07:05:17 - INFO - __main__ - Step 70133: {'lr': 0.0002810595146064041, 'samples': 13465536, 'steps': 70132, 'loss/train': 1.3710218667984009} 11/07/2021 07:05:18 - INFO - __main__ - Step 70134: {'lr': 0.0002810542489693796, 'samples': 13465728, 'steps': 70133, 'loss/train': 1.3326586484909058} 11/07/2021 07:05:18 - INFO - __main__ - Step 70135: {'lr': 0.0002810489833183625, 'samples': 13465920, 'steps': 70134, 'loss/train': 0.6417489051818848} 11/07/2021 07:05:19 - INFO - __main__ - Step 70136: {'lr': 0.0002810437176533552, 'samples': 13466112, 'steps': 70135, 'loss/train': 1.8331691026687622} 11/07/2021 07:05:19 - INFO - __main__ - Step 70137: {'lr': 0.0002810384519743601, 'samples': 13466304, 'steps': 70136, 'loss/train': 1.3145228624343872} 11/07/2021 07:05:19 - INFO - __main__ - Step 70138: {'lr': 0.00028103318628137957, 'samples': 13466496, 'steps': 70137, 'loss/train': 1.0434253215789795} 11/07/2021 07:05:20 - INFO - __main__ - Step 70139: {'lr': 0.00028102792057441595, 'samples': 13466688, 'steps': 70138, 'loss/train': 1.4507935047149658} 11/07/2021 07:05:21 - INFO - __main__ - Step 70140: {'lr': 0.0002810226548534716, 'samples': 13466880, 'steps': 70139, 'loss/train': 1.479357123374939} 11/07/2021 07:05:21 - INFO - __main__ - Step 70141: {'lr': 0.0002810173891185489, 'samples': 13467072, 'steps': 70140, 'loss/train': 1.7065706253051758} 11/07/2021 07:05:22 - INFO - __main__ - Step 70142: {'lr': 0.0002810121233696503, 'samples': 13467264, 'steps': 70141, 'loss/train': 1.3569234609603882} 11/07/2021 07:05:22 - INFO - __main__ - Step 70143: {'lr': 0.0002810068576067781, 'samples': 13467456, 'steps': 70142, 'loss/train': 1.2014575004577637} 11/07/2021 07:05:22 - INFO - __main__ - Step 70144: {'lr': 0.00028100159182993474, 'samples': 13467648, 'steps': 70143, 'loss/train': 1.6802690029144287} 11/07/2021 07:05:23 - INFO - __main__ - Step 70145: {'lr': 0.00028099632603912245, 'samples': 13467840, 'steps': 70144, 'loss/train': 1.816648244857788} 11/07/2021 07:05:24 - INFO - __main__ - Step 70146: {'lr': 0.00028099106023434374, 'samples': 13468032, 'steps': 70145, 'loss/train': 1.1273798942565918} 11/07/2021 07:05:24 - INFO - __main__ - Step 70147: {'lr': 0.0002809857944156009, 'samples': 13468224, 'steps': 70146, 'loss/train': 1.3759719133377075} 11/07/2021 07:05:24 - INFO - __main__ - Step 70148: {'lr': 0.00028098052858289643, 'samples': 13468416, 'steps': 70147, 'loss/train': 1.510937213897705} 11/07/2021 07:05:25 - INFO - __main__ - Step 70149: {'lr': 0.00028097526273623255, 'samples': 13468608, 'steps': 70148, 'loss/train': 1.4689348936080933} 11/07/2021 07:05:26 - INFO - __main__ - Step 70150: {'lr': 0.0002809699968756117, 'samples': 13468800, 'steps': 70149, 'loss/train': 0.9380255937576294} 11/07/2021 07:05:26 - INFO - __main__ - Step 70151: {'lr': 0.0002809647310010362, 'samples': 13468992, 'steps': 70150, 'loss/train': 1.156260371208191} 11/07/2021 07:05:26 - INFO - __main__ - Step 70152: {'lr': 0.0002809594651125085, 'samples': 13469184, 'steps': 70151, 'loss/train': 1.7384750843048096} 11/07/2021 07:05:27 - INFO - __main__ - Step 70153: {'lr': 0.00028095419921003094, 'samples': 13469376, 'steps': 70152, 'loss/train': 1.2513140439987183} 11/07/2021 07:05:27 - INFO - __main__ - Step 70154: {'lr': 0.0002809489332936059, 'samples': 13469568, 'steps': 70153, 'loss/train': 1.076149344444275} 11/07/2021 07:05:28 - INFO - __main__ - Step 70155: {'lr': 0.00028094366736323577, 'samples': 13469760, 'steps': 70154, 'loss/train': 1.4948176145553589} 11/07/2021 07:05:29 - INFO - __main__ - Step 70156: {'lr': 0.00028093840141892295, 'samples': 13469952, 'steps': 70155, 'loss/train': 0.5562907457351685} 11/07/2021 07:05:29 - INFO - __main__ - Step 70157: {'lr': 0.0002809331354606697, 'samples': 13470144, 'steps': 70156, 'loss/train': 0.6407517790794373} 11/07/2021 07:05:29 - INFO - __main__ - Step 70158: {'lr': 0.00028092786948847844, 'samples': 13470336, 'steps': 70157, 'loss/train': 0.5516869425773621} 11/07/2021 07:05:30 - INFO - __main__ - Step 70159: {'lr': 0.0002809226035023516, 'samples': 13470528, 'steps': 70158, 'loss/train': 1.2316147089004517} 11/07/2021 07:05:31 - INFO - __main__ - Step 70160: {'lr': 0.00028091733750229146, 'samples': 13470720, 'steps': 70159, 'loss/train': 1.213350534439087} 11/07/2021 07:05:31 - INFO - __main__ - Step 70161: {'lr': 0.00028091207148830044, 'samples': 13470912, 'steps': 70160, 'loss/train': 1.3080586194992065} 11/07/2021 07:05:32 - INFO - __main__ - Step 70162: {'lr': 0.00028090680546038105, 'samples': 13471104, 'steps': 70161, 'loss/train': 1.2234539985656738} 11/07/2021 07:05:32 - INFO - __main__ - Step 70163: {'lr': 0.0002809015394185354, 'samples': 13471296, 'steps': 70162, 'loss/train': 1.4880269765853882} 11/07/2021 07:05:32 - INFO - __main__ - Step 70164: {'lr': 0.000280896273362766, 'samples': 13471488, 'steps': 70163, 'loss/train': 1.4034405946731567} 11/07/2021 07:05:33 - INFO - __main__ - Step 70165: {'lr': 0.0002808910072930753, 'samples': 13471680, 'steps': 70164, 'loss/train': 1.2177385091781616} 11/07/2021 07:05:34 - INFO - __main__ - Step 70166: {'lr': 0.0002808857412094655, 'samples': 13471872, 'steps': 70165, 'loss/train': 1.4271228313446045} 11/07/2021 07:05:34 - INFO - __main__ - Step 70167: {'lr': 0.00028088047511193917, 'samples': 13472064, 'steps': 70166, 'loss/train': 1.8744441270828247} 11/07/2021 07:05:35 - INFO - __main__ - Step 70168: {'lr': 0.0002808752090004985, 'samples': 13472256, 'steps': 70167, 'loss/train': 0.8054586052894592} 11/07/2021 07:05:35 - INFO - __main__ - Step 70169: {'lr': 0.0002808699428751459, 'samples': 13472448, 'steps': 70168, 'loss/train': 1.4343801736831665} 11/07/2021 07:05:35 - INFO - __main__ - Step 70170: {'lr': 0.0002808646767358838, 'samples': 13472640, 'steps': 70169, 'loss/train': 1.2909021377563477} 11/07/2021 07:05:36 - INFO - __main__ - Step 70171: {'lr': 0.00028085941058271453, 'samples': 13472832, 'steps': 70170, 'loss/train': 1.2467658519744873} 11/07/2021 07:05:37 - INFO - __main__ - Step 70172: {'lr': 0.0002808541444156405, 'samples': 13473024, 'steps': 70171, 'loss/train': 1.5316760540008545} 11/07/2021 07:05:37 - INFO - __main__ - Step 70173: {'lr': 0.00028084887823466413, 'samples': 13473216, 'steps': 70172, 'loss/train': 1.2118433713912964} 11/07/2021 07:05:37 - INFO - __main__ - Step 70174: {'lr': 0.0002808436120397877, 'samples': 13473408, 'steps': 70173, 'loss/train': 1.320369005203247} 11/07/2021 07:05:38 - INFO - __main__ - Step 70175: {'lr': 0.0002808383458310136, 'samples': 13473600, 'steps': 70174, 'loss/train': 1.6961201429367065} 11/07/2021 07:05:38 - INFO - __main__ - Step 70176: {'lr': 0.00028083307960834425, 'samples': 13473792, 'steps': 70175, 'loss/train': 1.612949252128601} 11/07/2021 07:05:39 - INFO - __main__ - Step 70177: {'lr': 0.0002808278133717819, 'samples': 13473984, 'steps': 70176, 'loss/train': 1.5961942672729492} 11/07/2021 07:05:39 - INFO - __main__ - Step 70178: {'lr': 0.00028082254712132916, 'samples': 13474176, 'steps': 70177, 'loss/train': 1.5785974264144897} 11/07/2021 07:05:40 - INFO - __main__ - Step 70179: {'lr': 0.00028081728085698816, 'samples': 13474368, 'steps': 70178, 'loss/train': 1.053095817565918} 11/07/2021 07:05:40 - INFO - __main__ - Step 70180: {'lr': 0.0002808120145787614, 'samples': 13474560, 'steps': 70179, 'loss/train': 1.907427191734314} 11/07/2021 07:05:40 - INFO - __main__ - Step 70181: {'lr': 0.0002808067482866512, 'samples': 13474752, 'steps': 70180, 'loss/train': 1.2245315313339233} 11/07/2021 07:05:41 - INFO - __main__ - Step 70182: {'lr': 0.00028080148198065993, 'samples': 13474944, 'steps': 70181, 'loss/train': 1.6653763055801392} 11/07/2021 07:05:42 - INFO - __main__ - Step 70183: {'lr': 0.00028079621566079005, 'samples': 13475136, 'steps': 70182, 'loss/train': 1.8072397708892822} 11/07/2021 07:05:42 - INFO - __main__ - Step 70184: {'lr': 0.00028079094932704384, 'samples': 13475328, 'steps': 70183, 'loss/train': 1.5060405731201172} 11/07/2021 07:05:42 - INFO - __main__ - Step 70185: {'lr': 0.0002807856829794237, 'samples': 13475520, 'steps': 70184, 'loss/train': 1.5942165851593018} 11/07/2021 07:05:43 - INFO - __main__ - Step 70186: {'lr': 0.000280780416617932, 'samples': 13475712, 'steps': 70185, 'loss/train': 1.5203369855880737} 11/07/2021 07:05:44 - INFO - __main__ - Step 70187: {'lr': 0.00028077515024257113, 'samples': 13475904, 'steps': 70186, 'loss/train': 1.701763391494751} 11/07/2021 07:05:44 - INFO - __main__ - Step 70188: {'lr': 0.0002807698838533435, 'samples': 13476096, 'steps': 70187, 'loss/train': 1.0664719343185425} 11/07/2021 07:05:44 - INFO - __main__ - Step 70189: {'lr': 0.00028076461745025127, 'samples': 13476288, 'steps': 70188, 'loss/train': 1.5290882587432861} 11/07/2021 07:05:45 - INFO - __main__ - Step 70190: {'lr': 0.0002807593510332972, 'samples': 13476480, 'steps': 70189, 'loss/train': 0.9088031649589539} 11/07/2021 07:05:45 - INFO - __main__ - Step 70191: {'lr': 0.0002807540846024833, 'samples': 13476672, 'steps': 70190, 'loss/train': 1.2668132781982422} 11/07/2021 07:05:46 - INFO - __main__ - Step 70192: {'lr': 0.0002807488181578121, 'samples': 13476864, 'steps': 70191, 'loss/train': 1.6250380277633667} 11/07/2021 07:05:47 - INFO - __main__ - Step 70193: {'lr': 0.000280743551699286, 'samples': 13477056, 'steps': 70192, 'loss/train': 1.0778205394744873} 11/07/2021 07:05:47 - INFO - __main__ - Step 70194: {'lr': 0.00028073828522690725, 'samples': 13477248, 'steps': 70193, 'loss/train': 1.2957053184509277} 11/07/2021 07:05:47 - INFO - __main__ - Step 70195: {'lr': 0.00028073301874067836, 'samples': 13477440, 'steps': 70194, 'loss/train': 1.256188988685608} 11/07/2021 07:05:48 - INFO - __main__ - Step 70196: {'lr': 0.00028072775224060166, 'samples': 13477632, 'steps': 70195, 'loss/train': 1.7546542882919312} 11/07/2021 07:05:49 - INFO - __main__ - Step 70197: {'lr': 0.00028072248572667954, 'samples': 13477824, 'steps': 70196, 'loss/train': 1.5651007890701294} 11/07/2021 07:05:49 - INFO - __main__ - Step 70198: {'lr': 0.00028071721919891427, 'samples': 13478016, 'steps': 70197, 'loss/train': 1.536921739578247} 11/07/2021 07:05:49 - INFO - __main__ - Step 70199: {'lr': 0.0002807119526573083, 'samples': 13478208, 'steps': 70198, 'loss/train': 1.6171990633010864} 11/07/2021 07:05:50 - INFO - __main__ - Step 70200: {'lr': 0.000280706686101864, 'samples': 13478400, 'steps': 70199, 'loss/train': 1.8583475351333618} 11/07/2021 07:05:50 - INFO - __main__ - Step 70201: {'lr': 0.00028070141953258376, 'samples': 13478592, 'steps': 70200, 'loss/train': 1.1770070791244507} 11/07/2021 07:05:51 - INFO - __main__ - Step 70202: {'lr': 0.0002806961529494699, 'samples': 13478784, 'steps': 70201, 'loss/train': 1.2824177742004395} 11/07/2021 07:05:51 - INFO - __main__ - Step 70203: {'lr': 0.00028069088635252496, 'samples': 13478976, 'steps': 70202, 'loss/train': 1.598665714263916} 11/07/2021 07:05:52 - INFO - __main__ - Step 70204: {'lr': 0.00028068561974175106, 'samples': 13479168, 'steps': 70203, 'loss/train': 1.571709394454956} 11/07/2021 07:05:52 - INFO - __main__ - Step 70205: {'lr': 0.0002806803531171507, 'samples': 13479360, 'steps': 70204, 'loss/train': 1.2022978067398071} 11/07/2021 07:05:52 - INFO - __main__ - Step 70206: {'lr': 0.00028067508647872623, 'samples': 13479552, 'steps': 70205, 'loss/train': 1.5945700407028198} 11/07/2021 07:05:53 - INFO - __main__ - Step 70207: {'lr': 0.0002806698198264801, 'samples': 13479744, 'steps': 70206, 'loss/train': 1.4551738500595093} 11/07/2021 07:05:54 - INFO - __main__ - Step 70208: {'lr': 0.0002806645531604146, 'samples': 13479936, 'steps': 70207, 'loss/train': 1.5660228729248047} 11/07/2021 07:05:54 - INFO - __main__ - Step 70209: {'lr': 0.00028065928648053206, 'samples': 13480128, 'steps': 70208, 'loss/train': 1.357697606086731} 11/07/2021 07:05:54 - INFO - __main__ - Step 70210: {'lr': 0.000280654019786835, 'samples': 13480320, 'steps': 70209, 'loss/train': 1.4971439838409424} 11/07/2021 07:05:55 - INFO - __main__ - Step 70211: {'lr': 0.00028064875307932567, 'samples': 13480512, 'steps': 70210, 'loss/train': 1.1610262393951416} 11/07/2021 07:05:55 - INFO - __main__ - Step 70212: {'lr': 0.0002806434863580065, 'samples': 13480704, 'steps': 70211, 'loss/train': 0.590472936630249} 11/07/2021 07:05:56 - INFO - __main__ - Step 70213: {'lr': 0.0002806382196228799, 'samples': 13480896, 'steps': 70212, 'loss/train': 1.2909220457077026} 11/07/2021 07:05:57 - INFO - __main__ - Step 70214: {'lr': 0.00028063295287394815, 'samples': 13481088, 'steps': 70213, 'loss/train': 1.1714955568313599} 11/07/2021 07:05:57 - INFO - __main__ - Step 70215: {'lr': 0.00028062768611121356, 'samples': 13481280, 'steps': 70214, 'loss/train': 1.2039859294891357} 11/07/2021 07:05:57 - INFO - __main__ - Step 70216: {'lr': 0.00028062241933467875, 'samples': 13481472, 'steps': 70215, 'loss/train': 1.6643041372299194} 11/07/2021 07:05:58 - INFO - __main__ - Step 70217: {'lr': 0.00028061715254434596, 'samples': 13481664, 'steps': 70216, 'loss/train': 1.4992241859436035} 11/07/2021 07:05:59 - INFO - __main__ - Step 70218: {'lr': 0.00028061188574021745, 'samples': 13481856, 'steps': 70217, 'loss/train': 1.4472992420196533} 11/07/2021 07:05:59 - INFO - __main__ - Step 70219: {'lr': 0.00028060661892229577, 'samples': 13482048, 'steps': 70218, 'loss/train': 1.6105271577835083} 11/07/2021 07:05:59 - INFO - __main__ - Step 70220: {'lr': 0.0002806013520905832, 'samples': 13482240, 'steps': 70219, 'loss/train': 1.1890822649002075} 11/07/2021 07:06:00 - INFO - __main__ - Step 70221: {'lr': 0.0002805960852450821, 'samples': 13482432, 'steps': 70220, 'loss/train': 1.1741105318069458} 11/07/2021 07:06:00 - INFO - __main__ - Step 70222: {'lr': 0.0002805908183857949, 'samples': 13482624, 'steps': 70221, 'loss/train': 1.2737280130386353} 11/07/2021 07:06:01 - INFO - __main__ - Step 70223: {'lr': 0.0002805855515127239, 'samples': 13482816, 'steps': 70222, 'loss/train': 1.704629898071289} 11/07/2021 07:06:02 - INFO - __main__ - Step 70224: {'lr': 0.00028058028462587165, 'samples': 13483008, 'steps': 70223, 'loss/train': 1.1140661239624023} 11/07/2021 07:06:02 - INFO - __main__ - Step 70225: {'lr': 0.0002805750177252403, 'samples': 13483200, 'steps': 70224, 'loss/train': 1.2337260246276855} 11/07/2021 07:06:02 - INFO - __main__ - Step 70226: {'lr': 0.0002805697508108323, 'samples': 13483392, 'steps': 70225, 'loss/train': 1.4169114828109741} 11/07/2021 07:06:03 - INFO - __main__ - Step 70227: {'lr': 0.0002805644838826501, 'samples': 13483584, 'steps': 70226, 'loss/train': 1.4990086555480957} 11/07/2021 07:06:04 - INFO - __main__ - Step 70228: {'lr': 0.000280559216940696, 'samples': 13483776, 'steps': 70227, 'loss/train': 1.4534393548965454} 11/07/2021 07:06:04 - INFO - __main__ - Step 70229: {'lr': 0.00028055394998497237, 'samples': 13483968, 'steps': 70228, 'loss/train': 1.4591227769851685} 11/07/2021 07:06:04 - INFO - __main__ - Step 70230: {'lr': 0.00028054868301548167, 'samples': 13484160, 'steps': 70229, 'loss/train': 1.3099806308746338} 11/07/2021 07:06:05 - INFO - __main__ - Step 70231: {'lr': 0.0002805434160322261, 'samples': 13484352, 'steps': 70230, 'loss/train': 1.6145957708358765} 11/07/2021 07:06:05 - INFO - __main__ - Step 70232: {'lr': 0.0002805381490352082, 'samples': 13484544, 'steps': 70231, 'loss/train': 1.66966712474823} 11/07/2021 07:06:05 - INFO - __main__ - Step 70233: {'lr': 0.0002805328820244303, 'samples': 13484736, 'steps': 70232, 'loss/train': 1.3262509107589722} 11/07/2021 07:06:06 - INFO - __main__ - Step 70234: {'lr': 0.00028052761499989463, 'samples': 13484928, 'steps': 70233, 'loss/train': 1.5852785110473633} 11/07/2021 07:06:07 - INFO - __main__ - Step 70235: {'lr': 0.0002805223479616038, 'samples': 13485120, 'steps': 70234, 'loss/train': 1.060062289237976} 11/07/2021 07:06:07 - INFO - __main__ - Step 70236: {'lr': 0.00028051708090956007, 'samples': 13485312, 'steps': 70235, 'loss/train': 1.6813052892684937} 11/07/2021 07:06:07 - INFO - __main__ - Step 70237: {'lr': 0.0002805118138437658, 'samples': 13485504, 'steps': 70236, 'loss/train': 1.4463438987731934} 11/07/2021 07:06:08 - INFO - __main__ - Step 70238: {'lr': 0.0002805065467642234, 'samples': 13485696, 'steps': 70237, 'loss/train': 1.4853531122207642} 11/07/2021 07:06:09 - INFO - __main__ - Step 70239: {'lr': 0.0002805012796709352, 'samples': 13485888, 'steps': 70238, 'loss/train': 1.0485813617706299} 11/07/2021 07:06:09 - INFO - __main__ - Step 70240: {'lr': 0.00028049601256390356, 'samples': 13486080, 'steps': 70239, 'loss/train': 1.5664608478546143} 11/07/2021 07:06:09 - INFO - __main__ - Step 70241: {'lr': 0.00028049074544313094, 'samples': 13486272, 'steps': 70240, 'loss/train': 1.2510805130004883} 11/07/2021 07:06:10 - INFO - __main__ - Step 70242: {'lr': 0.00028048547830861957, 'samples': 13486464, 'steps': 70241, 'loss/train': 1.481245756149292} 11/07/2021 07:06:10 - INFO - __main__ - Step 70243: {'lr': 0.000280480211160372, 'samples': 13486656, 'steps': 70242, 'loss/train': 1.1253571510314941} 11/07/2021 07:06:11 - INFO - __main__ - Step 70244: {'lr': 0.0002804749439983906, 'samples': 13486848, 'steps': 70243, 'loss/train': 0.7923824787139893} 11/07/2021 07:06:11 - INFO - __main__ - Step 70245: {'lr': 0.0002804696768226775, 'samples': 13487040, 'steps': 70244, 'loss/train': 1.429172158241272} 11/07/2021 07:06:12 - INFO - __main__ - Step 70246: {'lr': 0.0002804644096332353, 'samples': 13487232, 'steps': 70245, 'loss/train': 1.7476022243499756} 11/07/2021 07:06:12 - INFO - __main__ - Step 70247: {'lr': 0.00028045914243006627, 'samples': 13487424, 'steps': 70246, 'loss/train': 1.1971712112426758} 11/07/2021 07:06:13 - INFO - __main__ - Step 70248: {'lr': 0.00028045387521317283, 'samples': 13487616, 'steps': 70247, 'loss/train': 1.4380676746368408} 11/07/2021 07:06:14 - INFO - __main__ - Step 70249: {'lr': 0.0002804486079825574, 'samples': 13487808, 'steps': 70248, 'loss/train': 1.0800868272781372} 11/07/2021 07:06:14 - INFO - __main__ - Step 70250: {'lr': 0.00028044334073822226, 'samples': 13488000, 'steps': 70249, 'loss/train': 1.8516641855239868} 11/07/2021 07:06:14 - INFO - __main__ - Step 70251: {'lr': 0.00028043807348016985, 'samples': 13488192, 'steps': 70250, 'loss/train': 2.1653690338134766} 11/07/2021 07:06:15 - INFO - __main__ - Step 70252: {'lr': 0.00028043280620840245, 'samples': 13488384, 'steps': 70251, 'loss/train': 1.3561185598373413} 11/07/2021 07:06:15 - INFO - __main__ - Step 70253: {'lr': 0.00028042753892292254, 'samples': 13488576, 'steps': 70252, 'loss/train': 1.4564026594161987} 11/07/2021 07:06:16 - INFO - __main__ - Step 70254: {'lr': 0.00028042227162373246, 'samples': 13488768, 'steps': 70253, 'loss/train': 1.4093493223190308} 11/07/2021 07:06:16 - INFO - __main__ - Step 70255: {'lr': 0.0002804170043108345, 'samples': 13488960, 'steps': 70254, 'loss/train': 1.0755025148391724} 11/07/2021 07:06:17 - INFO - __main__ - Step 70256: {'lr': 0.0002804117369842312, 'samples': 13489152, 'steps': 70255, 'loss/train': 1.4661965370178223} 11/07/2021 07:06:17 - INFO - __main__ - Step 70257: {'lr': 0.0002804064696439248, 'samples': 13489344, 'steps': 70256, 'loss/train': 1.3796859979629517} 11/07/2021 07:06:17 - INFO - __main__ - Step 70258: {'lr': 0.00028040120228991773, 'samples': 13489536, 'steps': 70257, 'loss/train': 0.9224424958229065} 11/07/2021 07:06:18 - INFO - __main__ - Step 70259: {'lr': 0.0002803959349222123, 'samples': 13489728, 'steps': 70258, 'loss/train': 1.2996469736099243} 11/07/2021 07:06:19 - INFO - __main__ - Step 70260: {'lr': 0.000280390667540811, 'samples': 13489920, 'steps': 70259, 'loss/train': 1.418015480041504} 11/07/2021 07:06:19 - INFO - __main__ - Step 70261: {'lr': 0.00028038540014571606, 'samples': 13490112, 'steps': 70260, 'loss/train': 1.0798014402389526} 11/07/2021 07:06:19 - INFO - __main__ - Step 70262: {'lr': 0.00028038013273692995, 'samples': 13490304, 'steps': 70261, 'loss/train': 1.4568140506744385} 11/07/2021 07:06:20 - INFO - __main__ - Step 70263: {'lr': 0.00028037486531445503, 'samples': 13490496, 'steps': 70262, 'loss/train': 1.2408024072647095} 11/07/2021 07:06:20 - INFO - __main__ - Step 70264: {'lr': 0.00028036959787829373, 'samples': 13490688, 'steps': 70263, 'loss/train': 1.3893675804138184} 11/07/2021 07:06:21 - INFO - __main__ - Step 70265: {'lr': 0.00028036433042844834, 'samples': 13490880, 'steps': 70264, 'loss/train': 1.453956961631775} 11/07/2021 07:06:22 - INFO - __main__ - Step 70266: {'lr': 0.0002803590629649212, 'samples': 13491072, 'steps': 70265, 'loss/train': 0.9403622150421143} 11/07/2021 07:06:22 - INFO - __main__ - Step 70267: {'lr': 0.0002803537954877147, 'samples': 13491264, 'steps': 70266, 'loss/train': 1.0213634967803955} 11/07/2021 07:06:22 - INFO - __main__ - Step 70268: {'lr': 0.0002803485279968313, 'samples': 13491456, 'steps': 70267, 'loss/train': 1.7717117071151733} 11/07/2021 07:06:23 - INFO - __main__ - Step 70269: {'lr': 0.0002803432604922733, 'samples': 13491648, 'steps': 70268, 'loss/train': 1.3801803588867188} 11/07/2021 07:06:24 - INFO - __main__ - Step 70270: {'lr': 0.00028033799297404313, 'samples': 13491840, 'steps': 70269, 'loss/train': 1.5656094551086426} 11/07/2021 07:06:24 - INFO - __main__ - Step 70271: {'lr': 0.00028033272544214315, 'samples': 13492032, 'steps': 70270, 'loss/train': 1.4306217432022095} 11/07/2021 07:06:24 - INFO - __main__ - Step 70272: {'lr': 0.00028032745789657567, 'samples': 13492224, 'steps': 70271, 'loss/train': 1.375247597694397} 11/07/2021 07:06:25 - INFO - __main__ - Step 70273: {'lr': 0.00028032219033734306, 'samples': 13492416, 'steps': 70272, 'loss/train': 1.7582566738128662} 11/07/2021 07:06:25 - INFO - __main__ - Step 70274: {'lr': 0.0002803169227644478, 'samples': 13492608, 'steps': 70273, 'loss/train': 0.9502387642860413} 11/07/2021 07:06:26 - INFO - __main__ - Step 70275: {'lr': 0.0002803116551778922, 'samples': 13492800, 'steps': 70274, 'loss/train': 1.4886629581451416} 11/07/2021 07:06:26 - INFO - __main__ - Step 70276: {'lr': 0.00028030638757767863, 'samples': 13492992, 'steps': 70275, 'loss/train': 1.293895959854126} 11/07/2021 07:06:27 - INFO - __main__ - Step 70277: {'lr': 0.00028030111996380945, 'samples': 13493184, 'steps': 70276, 'loss/train': 1.1791244745254517} 11/07/2021 07:06:27 - INFO - __main__ - Step 70278: {'lr': 0.00028029585233628707, 'samples': 13493376, 'steps': 70277, 'loss/train': 1.743994116783142} 11/07/2021 07:06:28 - INFO - __main__ - Step 70279: {'lr': 0.0002802905846951139, 'samples': 13493568, 'steps': 70278, 'loss/train': 1.2787588834762573} 11/07/2021 07:06:29 - INFO - __main__ - Step 70280: {'lr': 0.00028028531704029215, 'samples': 13493760, 'steps': 70279, 'loss/train': 1.1556227207183838} 11/07/2021 07:06:29 - INFO - __main__ - Step 70281: {'lr': 0.0002802800493718244, 'samples': 13493952, 'steps': 70280, 'loss/train': 1.3866184949874878} 11/07/2021 07:06:29 - INFO - __main__ - Step 70282: {'lr': 0.0002802747816897128, 'samples': 13494144, 'steps': 70281, 'loss/train': 1.6330546140670776} 11/07/2021 07:06:30 - INFO - __main__ - Step 70283: {'lr': 0.00028026951399395995, 'samples': 13494336, 'steps': 70282, 'loss/train': 1.746026635169983} 11/07/2021 07:06:30 - INFO - __main__ - Step 70284: {'lr': 0.00028026424628456816, 'samples': 13494528, 'steps': 70283, 'loss/train': 1.4556728601455688} 11/07/2021 07:06:31 - INFO - __main__ - Step 70285: {'lr': 0.0002802589785615397, 'samples': 13494720, 'steps': 70284, 'loss/train': 1.1091783046722412} 11/07/2021 07:06:31 - INFO - __main__ - Step 70286: {'lr': 0.00028025371082487704, 'samples': 13494912, 'steps': 70285, 'loss/train': 1.170440912246704} 11/07/2021 07:06:32 - INFO - __main__ - Step 70287: {'lr': 0.00028024844307458253, 'samples': 13495104, 'steps': 70286, 'loss/train': 1.7250044345855713} 11/07/2021 07:06:32 - INFO - __main__ - Step 70288: {'lr': 0.00028024317531065847, 'samples': 13495296, 'steps': 70287, 'loss/train': 1.0308109521865845} 11/07/2021 07:06:32 - INFO - __main__ - Step 70289: {'lr': 0.00028023790753310733, 'samples': 13495488, 'steps': 70288, 'loss/train': 1.43985915184021} 11/07/2021 07:06:34 - INFO - __main__ - Step 70290: {'lr': 0.00028023263974193146, 'samples': 13495680, 'steps': 70289, 'loss/train': 1.392869472503662} 11/07/2021 07:06:34 - INFO - __main__ - Step 70291: {'lr': 0.0002802273719371333, 'samples': 13495872, 'steps': 70290, 'loss/train': 1.5459136962890625} 11/07/2021 07:06:34 - INFO - __main__ - Step 70292: {'lr': 0.0002802221041187151, 'samples': 13496064, 'steps': 70291, 'loss/train': 1.640762448310852} 11/07/2021 07:06:35 - INFO - __main__ - Step 70293: {'lr': 0.0002802168362866793, 'samples': 13496256, 'steps': 70292, 'loss/train': 1.1566588878631592} 11/07/2021 07:06:35 - INFO - __main__ - Step 70294: {'lr': 0.00028021156844102823, 'samples': 13496448, 'steps': 70293, 'loss/train': 1.7080742120742798} 11/07/2021 07:06:36 - INFO - __main__ - Step 70295: {'lr': 0.0002802063005817643, 'samples': 13496640, 'steps': 70294, 'loss/train': 1.4126756191253662} 11/07/2021 07:06:37 - INFO - __main__ - Step 70296: {'lr': 0.00028020103270888995, 'samples': 13496832, 'steps': 70295, 'loss/train': 1.5607787370681763} 11/07/2021 07:06:37 - INFO - __main__ - Step 70297: {'lr': 0.0002801957648224074, 'samples': 13497024, 'steps': 70296, 'loss/train': 1.5557453632354736} 11/07/2021 07:06:37 - INFO - __main__ - Step 70298: {'lr': 0.00028019049692231914, 'samples': 13497216, 'steps': 70297, 'loss/train': 0.9186941981315613} 11/07/2021 07:06:38 - INFO - __main__ - Step 70299: {'lr': 0.00028018522900862745, 'samples': 13497408, 'steps': 70298, 'loss/train': 1.5979396104812622} 11/07/2021 07:06:38 - INFO - __main__ - Step 70300: {'lr': 0.0002801799610813348, 'samples': 13497600, 'steps': 70299, 'loss/train': 1.2920544147491455} 11/07/2021 07:06:40 - INFO - __main__ - Step 70301: {'lr': 0.00028017469314044354, 'samples': 13497792, 'steps': 70300, 'loss/train': 1.5856877565383911} 11/07/2021 07:06:40 - INFO - __main__ - Step 70302: {'lr': 0.0002801694251859561, 'samples': 13497984, 'steps': 70301, 'loss/train': 1.1991963386535645} 11/07/2021 07:06:40 - INFO - __main__ - Step 70303: {'lr': 0.00028016415721787463, 'samples': 13498176, 'steps': 70302, 'loss/train': 1.6064494848251343} 11/07/2021 07:06:41 - INFO - __main__ - Step 70304: {'lr': 0.0002801588892362017, 'samples': 13498368, 'steps': 70303, 'loss/train': 1.7612313032150269} 11/07/2021 07:06:41 - INFO - __main__ - Step 70305: {'lr': 0.00028015362124093966, 'samples': 13498560, 'steps': 70304, 'loss/train': 0.7255281209945679} 11/07/2021 07:06:42 - INFO - __main__ - Step 70306: {'lr': 0.00028014835323209085, 'samples': 13498752, 'steps': 70305, 'loss/train': 1.160844087600708} 11/07/2021 07:06:42 - INFO - __main__ - Step 70307: {'lr': 0.00028014308520965775, 'samples': 13498944, 'steps': 70306, 'loss/train': 0.8887119889259338} 11/07/2021 07:06:43 - INFO - __main__ - Step 70308: {'lr': 0.0002801378171736426, 'samples': 13499136, 'steps': 70307, 'loss/train': 0.7149937152862549} 11/07/2021 07:06:43 - INFO - __main__ - Step 70309: {'lr': 0.0002801325491240477, 'samples': 13499328, 'steps': 70308, 'loss/train': 1.2627724409103394} 11/07/2021 07:06:43 - INFO - __main__ - Step 70310: {'lr': 0.00028012728106087566, 'samples': 13499520, 'steps': 70309, 'loss/train': 1.8421307802200317} 11/07/2021 07:06:44 - INFO - __main__ - Step 70311: {'lr': 0.00028012201298412864, 'samples': 13499712, 'steps': 70310, 'loss/train': 1.8571490049362183} 11/07/2021 07:06:45 - INFO - __main__ - Step 70312: {'lr': 0.00028011674489380925, 'samples': 13499904, 'steps': 70311, 'loss/train': 0.7904812097549438} 11/07/2021 07:06:45 - INFO - __main__ - Step 70313: {'lr': 0.00028011147678991955, 'samples': 13500096, 'steps': 70312, 'loss/train': 1.6256507635116577} 11/07/2021 07:06:45 - INFO - __main__ - Step 70314: {'lr': 0.0002801062086724622, 'samples': 13500288, 'steps': 70313, 'loss/train': 1.4525054693222046} 11/07/2021 07:06:46 - INFO - __main__ - Step 70315: {'lr': 0.00028010094054143936, 'samples': 13500480, 'steps': 70314, 'loss/train': 1.328505277633667} 11/07/2021 07:06:47 - INFO - __main__ - Step 70316: {'lr': 0.0002800956723968536, 'samples': 13500672, 'steps': 70315, 'loss/train': 1.23360276222229} 11/07/2021 07:06:47 - INFO - __main__ - Step 70317: {'lr': 0.0002800904042387071, 'samples': 13500864, 'steps': 70316, 'loss/train': 1.0548830032348633} 11/07/2021 07:06:48 - INFO - __main__ - Step 70318: {'lr': 0.0002800851360670024, 'samples': 13501056, 'steps': 70317, 'loss/train': 0.739326000213623} 11/07/2021 07:06:48 - INFO - __main__ - Step 70319: {'lr': 0.0002800798678817418, 'samples': 13501248, 'steps': 70318, 'loss/train': 1.3868708610534668} 11/07/2021 07:06:48 - INFO - __main__ - Step 70320: {'lr': 0.00028007459968292767, 'samples': 13501440, 'steps': 70319, 'loss/train': 1.8774471282958984} 11/07/2021 07:06:49 - INFO - __main__ - Step 70321: {'lr': 0.00028006933147056235, 'samples': 13501632, 'steps': 70320, 'loss/train': 1.7449485063552856} 11/07/2021 07:06:50 - INFO - __main__ - Step 70322: {'lr': 0.0002800640632446483, 'samples': 13501824, 'steps': 70321, 'loss/train': 1.392042875289917} 11/07/2021 07:06:50 - INFO - __main__ - Step 70323: {'lr': 0.00028005879500518784, 'samples': 13502016, 'steps': 70322, 'loss/train': 1.6802746057510376} 11/07/2021 07:06:50 - INFO - __main__ - Step 70324: {'lr': 0.00028005352675218337, 'samples': 13502208, 'steps': 70323, 'loss/train': 1.8649696111679077} 11/07/2021 07:06:51 - INFO - __main__ - Step 70325: {'lr': 0.0002800482584856372, 'samples': 13502400, 'steps': 70324, 'loss/train': 1.7290153503417969} 11/07/2021 07:06:52 - INFO - __main__ - Step 70326: {'lr': 0.00028004299020555176, 'samples': 13502592, 'steps': 70325, 'loss/train': 1.550599455833435} 11/07/2021 07:06:52 - INFO - __main__ - Step 70327: {'lr': 0.0002800377219119294, 'samples': 13502784, 'steps': 70326, 'loss/train': 1.6243053674697876} 11/07/2021 07:06:52 - INFO - __main__ - Step 70328: {'lr': 0.0002800324536047725, 'samples': 13502976, 'steps': 70327, 'loss/train': 1.6143710613250732} 11/07/2021 07:06:53 - INFO - __main__ - Step 70329: {'lr': 0.00028002718528408345, 'samples': 13503168, 'steps': 70328, 'loss/train': 1.5967457294464111} 11/07/2021 07:06:53 - INFO - __main__ - Step 70330: {'lr': 0.0002800219169498646, 'samples': 13503360, 'steps': 70329, 'loss/train': 4.237258434295654} 11/07/2021 07:06:54 - INFO - __main__ - Step 70331: {'lr': 0.0002800166486021184, 'samples': 13503552, 'steps': 70330, 'loss/train': 1.7117185592651367} 11/07/2021 07:06:55 - INFO - __main__ - Step 70332: {'lr': 0.0002800113802408471, 'samples': 13503744, 'steps': 70331, 'loss/train': 1.4682611227035522} 11/07/2021 07:06:55 - INFO - __main__ - Step 70333: {'lr': 0.00028000611186605317, 'samples': 13503936, 'steps': 70332, 'loss/train': 1.4599443674087524} 11/07/2021 07:06:55 - INFO - __main__ - Step 70334: {'lr': 0.0002800008434777389, 'samples': 13504128, 'steps': 70333, 'loss/train': 0.9214239716529846} 11/07/2021 07:06:56 - INFO - __main__ - Step 70335: {'lr': 0.00027999557507590677, 'samples': 13504320, 'steps': 70334, 'loss/train': 1.3970521688461304} 11/07/2021 07:06:56 - INFO - __main__ - Step 70336: {'lr': 0.00027999030666055907, 'samples': 13504512, 'steps': 70335, 'loss/train': 1.5850107669830322} 11/07/2021 07:06:56 - INFO - __main__ - Step 70337: {'lr': 0.0002799850382316982, 'samples': 13504704, 'steps': 70336, 'loss/train': 5.754030227661133} 11/07/2021 07:06:57 - INFO - __main__ - Step 70338: {'lr': 0.0002799797697893266, 'samples': 13504896, 'steps': 70337, 'loss/train': 1.3720394372940063} 11/07/2021 07:06:58 - INFO - __main__ - Step 70339: {'lr': 0.0002799745013334465, 'samples': 13505088, 'steps': 70338, 'loss/train': 1.55092191696167} 11/07/2021 07:06:58 - INFO - __main__ - Step 70340: {'lr': 0.00027996923286406037, 'samples': 13505280, 'steps': 70339, 'loss/train': 1.727339506149292} 11/07/2021 07:06:59 - INFO - __main__ - Step 70341: {'lr': 0.00027996396438117056, 'samples': 13505472, 'steps': 70340, 'loss/train': 1.367047667503357} 11/07/2021 07:06:59 - INFO - __main__ - Step 70342: {'lr': 0.0002799586958847794, 'samples': 13505664, 'steps': 70341, 'loss/train': 1.6188994646072388} 11/07/2021 07:07:00 - INFO - __main__ - Step 70343: {'lr': 0.0002799534273748894, 'samples': 13505856, 'steps': 70342, 'loss/train': 1.3429912328720093} 11/07/2021 07:07:00 - INFO - __main__ - Step 70344: {'lr': 0.00027994815885150283, 'samples': 13506048, 'steps': 70343, 'loss/train': 1.1607600450515747} 11/07/2021 07:07:01 - INFO - __main__ - Step 70345: {'lr': 0.00027994289031462203, 'samples': 13506240, 'steps': 70344, 'loss/train': 0.9097505211830139} 11/07/2021 07:07:01 - INFO - __main__ - Step 70346: {'lr': 0.00027993762176424953, 'samples': 13506432, 'steps': 70345, 'loss/train': 0.9877601861953735} 11/07/2021 07:07:01 - INFO - __main__ - Step 70347: {'lr': 0.0002799323532003875, 'samples': 13506624, 'steps': 70346, 'loss/train': 1.228316307067871} 11/07/2021 07:07:02 - INFO - __main__ - Step 70348: {'lr': 0.00027992708462303847, 'samples': 13506816, 'steps': 70347, 'loss/train': 1.1627976894378662} 11/07/2021 07:07:03 - INFO - __main__ - Step 70349: {'lr': 0.0002799218160322047, 'samples': 13507008, 'steps': 70348, 'loss/train': 1.0100388526916504} 11/07/2021 07:07:03 - INFO - __main__ - Step 70350: {'lr': 0.0002799165474278886, 'samples': 13507200, 'steps': 70349, 'loss/train': 1.3232061862945557} 11/07/2021 07:07:03 - INFO - __main__ - Step 70351: {'lr': 0.0002799112788100927, 'samples': 13507392, 'steps': 70350, 'loss/train': 1.0962305068969727} 11/07/2021 07:07:04 - INFO - __main__ - Step 70352: {'lr': 0.00027990601017881917, 'samples': 13507584, 'steps': 70351, 'loss/train': 1.0376815795898438} 11/07/2021 07:07:04 - INFO - __main__ - Step 70353: {'lr': 0.00027990074153407045, 'samples': 13507776, 'steps': 70352, 'loss/train': 1.3145564794540405} 11/07/2021 07:07:05 - INFO - __main__ - Step 70354: {'lr': 0.0002798954728758489, 'samples': 13507968, 'steps': 70353, 'loss/train': 1.441982388496399} 11/07/2021 07:07:05 - INFO - __main__ - Step 70355: {'lr': 0.00027989020420415687, 'samples': 13508160, 'steps': 70354, 'loss/train': 1.7514740228652954} 11/07/2021 07:07:06 - INFO - __main__ - Step 70356: {'lr': 0.00027988493551899684, 'samples': 13508352, 'steps': 70355, 'loss/train': 0.7540959119796753} 11/07/2021 07:07:06 - INFO - __main__ - Step 70357: {'lr': 0.00027987966682037113, 'samples': 13508544, 'steps': 70356, 'loss/train': 1.2666826248168945} 11/07/2021 07:07:07 - INFO - __main__ - Step 70358: {'lr': 0.0002798743981082821, 'samples': 13508736, 'steps': 70357, 'loss/train': 1.547939419746399} 11/07/2021 07:07:08 - INFO - __main__ - Step 70359: {'lr': 0.00027986912938273215, 'samples': 13508928, 'steps': 70358, 'loss/train': 1.269587516784668} 11/07/2021 07:07:08 - INFO - __main__ - Step 70360: {'lr': 0.00027986386064372354, 'samples': 13509120, 'steps': 70359, 'loss/train': 1.5885868072509766} 11/07/2021 07:07:08 - INFO - __main__ - Step 70361: {'lr': 0.0002798585918912588, 'samples': 13509312, 'steps': 70360, 'loss/train': 1.5522528886795044} 11/07/2021 07:07:09 - INFO - __main__ - Step 70362: {'lr': 0.0002798533231253402, 'samples': 13509504, 'steps': 70361, 'loss/train': 1.4097520112991333} 11/07/2021 07:07:09 - INFO - __main__ - Step 70363: {'lr': 0.0002798480543459702, 'samples': 13509696, 'steps': 70362, 'loss/train': 1.4602395296096802} 11/07/2021 07:07:10 - INFO - __main__ - Step 70364: {'lr': 0.00027984278555315105, 'samples': 13509888, 'steps': 70363, 'loss/train': 0.9544743895530701} 11/07/2021 07:07:10 - INFO - __main__ - Step 70365: {'lr': 0.00027983751674688536, 'samples': 13510080, 'steps': 70364, 'loss/train': 0.9736876487731934} 11/07/2021 07:07:11 - INFO - __main__ - Step 70366: {'lr': 0.0002798322479271752, 'samples': 13510272, 'steps': 70365, 'loss/train': 1.866879940032959} 11/07/2021 07:07:11 - INFO - __main__ - Step 70367: {'lr': 0.0002798269790940231, 'samples': 13510464, 'steps': 70366, 'loss/train': 1.0582116842269897} 11/07/2021 07:07:11 - INFO - __main__ - Step 70368: {'lr': 0.0002798217102474315, 'samples': 13510656, 'steps': 70367, 'loss/train': 1.048388123512268} 11/07/2021 07:07:12 - INFO - __main__ - Step 70369: {'lr': 0.00027981644138740265, 'samples': 13510848, 'steps': 70368, 'loss/train': 1.5402790307998657} 11/07/2021 07:07:13 - INFO - __main__ - Step 70370: {'lr': 0.00027981117251393893, 'samples': 13511040, 'steps': 70369, 'loss/train': 1.5775346755981445} 11/07/2021 07:07:13 - INFO - __main__ - Step 70371: {'lr': 0.00027980590362704276, 'samples': 13511232, 'steps': 70370, 'loss/train': 1.2097922563552856} 11/07/2021 07:07:13 - INFO - __main__ - Step 70372: {'lr': 0.00027980063472671663, 'samples': 13511424, 'steps': 70371, 'loss/train': 1.3351976871490479} 11/07/2021 07:07:14 - INFO - __main__ - Step 70373: {'lr': 0.0002797953658129627, 'samples': 13511616, 'steps': 70372, 'loss/train': 1.6422972679138184} 11/07/2021 07:07:15 - INFO - __main__ - Step 70374: {'lr': 0.00027979009688578344, 'samples': 13511808, 'steps': 70373, 'loss/train': 1.2559103965759277} 11/07/2021 07:07:15 - INFO - __main__ - Step 70375: {'lr': 0.0002797848279451812, 'samples': 13512000, 'steps': 70374, 'loss/train': 1.7757420539855957} 11/07/2021 07:07:16 - INFO - __main__ - Step 70376: {'lr': 0.00027977955899115845, 'samples': 13512192, 'steps': 70375, 'loss/train': 1.0861963033676147} 11/07/2021 07:07:16 - INFO - __main__ - Step 70377: {'lr': 0.00027977429002371744, 'samples': 13512384, 'steps': 70376, 'loss/train': 1.4246472120285034} 11/07/2021 07:07:16 - INFO - __main__ - Step 70378: {'lr': 0.0002797690210428606, 'samples': 13512576, 'steps': 70377, 'loss/train': 1.3178174495697021} 11/07/2021 07:07:17 - INFO - __main__ - Step 70379: {'lr': 0.0002797637520485903, 'samples': 13512768, 'steps': 70378, 'loss/train': 1.4196118116378784} 11/07/2021 07:07:18 - INFO - __main__ - Step 70380: {'lr': 0.00027975848304090894, 'samples': 13512960, 'steps': 70379, 'loss/train': 1.2504663467407227} 11/07/2021 07:07:18 - INFO - __main__ - Step 70381: {'lr': 0.00027975321401981884, 'samples': 13513152, 'steps': 70380, 'loss/train': 1.689044713973999} 11/07/2021 07:07:18 - INFO - __main__ - Step 70382: {'lr': 0.0002797479449853224, 'samples': 13513344, 'steps': 70381, 'loss/train': 1.3685611486434937} 11/07/2021 07:07:19 - INFO - __main__ - Step 70383: {'lr': 0.00027974267593742195, 'samples': 13513536, 'steps': 70382, 'loss/train': 1.230566143989563} 11/07/2021 07:07:19 - INFO - __main__ - Step 70384: {'lr': 0.00027973740687612, 'samples': 13513728, 'steps': 70383, 'loss/train': 1.2814925909042358} 11/07/2021 07:07:20 - INFO - __main__ - Step 70385: {'lr': 0.0002797321378014188, 'samples': 13513920, 'steps': 70384, 'loss/train': 1.1262083053588867} 11/07/2021 07:07:20 - INFO - __main__ - Step 70386: {'lr': 0.00027972686871332073, 'samples': 13514112, 'steps': 70385, 'loss/train': 1.789111852645874} 11/07/2021 07:07:21 - INFO - __main__ - Step 70387: {'lr': 0.00027972159961182826, 'samples': 13514304, 'steps': 70386, 'loss/train': 1.5203688144683838} 11/07/2021 07:07:21 - INFO - __main__ - Step 70388: {'lr': 0.0002797163304969436, 'samples': 13514496, 'steps': 70387, 'loss/train': 1.4223005771636963} 11/07/2021 07:07:21 - INFO - __main__ - Step 70389: {'lr': 0.00027971106136866924, 'samples': 13514688, 'steps': 70388, 'loss/train': 1.237849235534668} 11/07/2021 07:07:23 - INFO - __main__ - Step 70390: {'lr': 0.00027970579222700757, 'samples': 13514880, 'steps': 70389, 'loss/train': 1.5156159400939941} 11/07/2021 07:07:23 - INFO - __main__ - Step 70391: {'lr': 0.00027970052307196093, 'samples': 13515072, 'steps': 70390, 'loss/train': 0.4500690698623657} 11/07/2021 07:07:23 - INFO - __main__ - Step 70392: {'lr': 0.0002796952539035317, 'samples': 13515264, 'steps': 70391, 'loss/train': 1.4815627336502075} 11/07/2021 07:07:24 - INFO - __main__ - Step 70393: {'lr': 0.00027968998472172225, 'samples': 13515456, 'steps': 70392, 'loss/train': 2.062229633331299} 11/07/2021 07:07:24 - INFO - __main__ - Step 70394: {'lr': 0.00027968471552653493, 'samples': 13515648, 'steps': 70393, 'loss/train': 1.6824358701705933} 11/07/2021 07:07:25 - INFO - __main__ - Step 70395: {'lr': 0.00027967944631797207, 'samples': 13515840, 'steps': 70394, 'loss/train': 1.2586818933486938} 11/07/2021 07:07:25 - INFO - __main__ - Step 70396: {'lr': 0.00027967417709603623, 'samples': 13516032, 'steps': 70395, 'loss/train': 1.2680681943893433} 11/07/2021 07:07:26 - INFO - __main__ - Step 70397: {'lr': 0.0002796689078607296, 'samples': 13516224, 'steps': 70396, 'loss/train': 1.2495261430740356} 11/07/2021 07:07:26 - INFO - __main__ - Step 70398: {'lr': 0.0002796636386120546, 'samples': 13516416, 'steps': 70397, 'loss/train': 1.6894919872283936} 11/07/2021 07:07:26 - INFO - __main__ - Step 70399: {'lr': 0.00027965836935001364, 'samples': 13516608, 'steps': 70398, 'loss/train': 1.3067550659179688} 11/07/2021 07:07:27 - INFO - __main__ - Step 70400: {'lr': 0.00027965310007460907, 'samples': 13516800, 'steps': 70399, 'loss/train': 0.82297682762146} 11/07/2021 07:07:28 - INFO - __main__ - Step 70401: {'lr': 0.00027964783078584333, 'samples': 13516992, 'steps': 70400, 'loss/train': 0.6083166003227234} 11/07/2021 07:07:28 - INFO - __main__ - Step 70402: {'lr': 0.00027964256148371865, 'samples': 13517184, 'steps': 70401, 'loss/train': 1.5618491172790527} 11/07/2021 07:07:28 - INFO - __main__ - Step 70403: {'lr': 0.0002796372921682375, 'samples': 13517376, 'steps': 70402, 'loss/train': 0.925121009349823} 11/07/2021 07:07:29 - INFO - __main__ - Step 70404: {'lr': 0.00027963202283940233, 'samples': 13517568, 'steps': 70403, 'loss/train': 1.1370536088943481} 11/07/2021 07:07:29 - INFO - __main__ - Step 70405: {'lr': 0.0002796267534972154, 'samples': 13517760, 'steps': 70404, 'loss/train': 1.365068793296814} 11/07/2021 07:07:30 - INFO - __main__ - Step 70406: {'lr': 0.00027962148414167903, 'samples': 13517952, 'steps': 70405, 'loss/train': 1.5085047483444214} 11/07/2021 07:07:30 - INFO - __main__ - Step 70407: {'lr': 0.00027961621477279574, 'samples': 13518144, 'steps': 70406, 'loss/train': 1.3375440835952759} 11/07/2021 07:07:31 - INFO - __main__ - Step 70408: {'lr': 0.0002796109453905678, 'samples': 13518336, 'steps': 70407, 'loss/train': 1.0572898387908936} 11/07/2021 07:07:31 - INFO - __main__ - Step 70409: {'lr': 0.00027960567599499765, 'samples': 13518528, 'steps': 70408, 'loss/train': 1.6353636980056763} 11/07/2021 07:07:31 - INFO - __main__ - Step 70410: {'lr': 0.0002796004065860876, 'samples': 13518720, 'steps': 70409, 'loss/train': 1.484983205795288} 11/07/2021 07:07:32 - INFO - __main__ - Step 70411: {'lr': 0.0002795951371638402, 'samples': 13518912, 'steps': 70410, 'loss/train': 0.7977431416511536} 11/07/2021 07:07:33 - INFO - __main__ - Step 70412: {'lr': 0.0002795898677282576, 'samples': 13519104, 'steps': 70411, 'loss/train': 1.5254511833190918} 11/07/2021 07:07:33 - INFO - __main__ - Step 70413: {'lr': 0.00027958459827934223, 'samples': 13519296, 'steps': 70412, 'loss/train': 1.3187263011932373} 11/07/2021 07:07:33 - INFO - __main__ - Step 70414: {'lr': 0.0002795793288170965, 'samples': 13519488, 'steps': 70413, 'loss/train': 1.2080920934677124} 11/07/2021 07:07:34 - INFO - __main__ - Step 70415: {'lr': 0.0002795740593415228, 'samples': 13519680, 'steps': 70414, 'loss/train': 1.4325064420700073} 11/07/2021 07:07:35 - INFO - __main__ - Step 70416: {'lr': 0.0002795687898526235, 'samples': 13519872, 'steps': 70415, 'loss/train': 1.423401117324829} 11/07/2021 07:07:35 - INFO - __main__ - Step 70417: {'lr': 0.00027956352035040093, 'samples': 13520064, 'steps': 70416, 'loss/train': 1.3092477321624756} 11/07/2021 07:07:36 - INFO - __main__ - Step 70418: {'lr': 0.0002795582508348575, 'samples': 13520256, 'steps': 70417, 'loss/train': 1.5092692375183105} 11/07/2021 07:07:36 - INFO - __main__ - Step 70419: {'lr': 0.00027955298130599563, 'samples': 13520448, 'steps': 70418, 'loss/train': 1.330193281173706} 11/07/2021 07:07:36 - INFO - __main__ - Step 70420: {'lr': 0.0002795477117638176, 'samples': 13520640, 'steps': 70419, 'loss/train': 1.413239598274231} 11/07/2021 07:07:37 - INFO - __main__ - Step 70421: {'lr': 0.0002795424422083258, 'samples': 13520832, 'steps': 70420, 'loss/train': 0.4052659273147583} 11/07/2021 07:07:38 - INFO - __main__ - Step 70422: {'lr': 0.0002795371726395227, 'samples': 13521024, 'steps': 70421, 'loss/train': 1.4284462928771973} 11/07/2021 07:07:38 - INFO - __main__ - Step 70423: {'lr': 0.00027953190305741055, 'samples': 13521216, 'steps': 70422, 'loss/train': 1.1820026636123657} 11/07/2021 07:07:38 - INFO - __main__ - Step 70424: {'lr': 0.0002795266334619918, 'samples': 13521408, 'steps': 70423, 'loss/train': 2.0019805431365967} 11/07/2021 07:07:39 - INFO - __main__ - Step 70425: {'lr': 0.0002795213638532688, 'samples': 13521600, 'steps': 70424, 'loss/train': 1.3599473237991333} 11/07/2021 07:07:40 - INFO - __main__ - Step 70426: {'lr': 0.00027951609423124395, 'samples': 13521792, 'steps': 70425, 'loss/train': 1.5385489463806152} 11/07/2021 07:07:40 - INFO - __main__ - Step 70427: {'lr': 0.0002795108245959196, 'samples': 13521984, 'steps': 70426, 'loss/train': 1.5591480731964111} 11/07/2021 07:07:41 - INFO - __main__ - Step 70428: {'lr': 0.00027950555494729806, 'samples': 13522176, 'steps': 70427, 'loss/train': 1.5677794218063354} 11/07/2021 07:07:41 - INFO - __main__ - Step 70429: {'lr': 0.00027950028528538187, 'samples': 13522368, 'steps': 70428, 'loss/train': 1.1968350410461426} 11/07/2021 07:07:41 - INFO - __main__ - Step 70430: {'lr': 0.00027949501561017325, 'samples': 13522560, 'steps': 70429, 'loss/train': 1.2059558629989624} 11/07/2021 07:07:42 - INFO - __main__ - Step 70431: {'lr': 0.00027948974592167464, 'samples': 13522752, 'steps': 70430, 'loss/train': 1.4207218885421753} 11/07/2021 07:07:43 - INFO - __main__ - Step 70432: {'lr': 0.00027948447621988843, 'samples': 13522944, 'steps': 70431, 'loss/train': 1.4002174139022827} 11/07/2021 07:07:43 - INFO - __main__ - Step 70433: {'lr': 0.00027947920650481695, 'samples': 13523136, 'steps': 70432, 'loss/train': 1.2926979064941406} 11/07/2021 07:07:43 - INFO - __main__ - Step 70434: {'lr': 0.0002794739367764626, 'samples': 13523328, 'steps': 70433, 'loss/train': 1.056814432144165} 11/07/2021 07:07:44 - INFO - __main__ - Step 70435: {'lr': 0.0002794686670348277, 'samples': 13523520, 'steps': 70434, 'loss/train': 0.5713711977005005} 11/07/2021 07:07:44 - INFO - __main__ - Step 70436: {'lr': 0.0002794633972799148, 'samples': 13523712, 'steps': 70435, 'loss/train': 1.481515645980835} 11/07/2021 07:07:45 - INFO - __main__ - Step 70437: {'lr': 0.000279458127511726, 'samples': 13523904, 'steps': 70436, 'loss/train': 1.535004734992981} 11/07/2021 07:07:45 - INFO - __main__ - Step 70438: {'lr': 0.0002794528577302639, 'samples': 13524096, 'steps': 70437, 'loss/train': 1.4705102443695068} 11/07/2021 07:07:46 - INFO - __main__ - Step 70439: {'lr': 0.00027944758793553077, 'samples': 13524288, 'steps': 70438, 'loss/train': 1.5025629997253418} 11/07/2021 07:07:46 - INFO - __main__ - Step 70440: {'lr': 0.000279442318127529, 'samples': 13524480, 'steps': 70439, 'loss/train': 1.378456473350525} 11/07/2021 07:07:46 - INFO - __main__ - Step 70441: {'lr': 0.00027943704830626107, 'samples': 13524672, 'steps': 70440, 'loss/train': 1.7826459407806396} 11/07/2021 07:07:47 - INFO - __main__ - Step 70442: {'lr': 0.0002794317784717292, 'samples': 13524864, 'steps': 70441, 'loss/train': 1.7617183923721313} 11/07/2021 07:07:48 - INFO - __main__ - Step 70443: {'lr': 0.00027942650862393577, 'samples': 13525056, 'steps': 70442, 'loss/train': 1.376859426498413} 11/07/2021 07:07:48 - INFO - __main__ - Step 70444: {'lr': 0.0002794212387628833, 'samples': 13525248, 'steps': 70443, 'loss/train': 1.5183874368667603} 11/07/2021 07:07:48 - INFO - __main__ - Step 70445: {'lr': 0.00027941596888857395, 'samples': 13525440, 'steps': 70444, 'loss/train': 1.3314268589019775} 11/07/2021 07:07:49 - INFO - __main__ - Step 70446: {'lr': 0.0002794106990010103, 'samples': 13525632, 'steps': 70445, 'loss/train': 0.9634039402008057} 11/07/2021 07:07:50 - INFO - __main__ - Step 70447: {'lr': 0.00027940542910019465, 'samples': 13525824, 'steps': 70446, 'loss/train': 1.162543773651123} 11/07/2021 07:07:50 - INFO - __main__ - Step 70448: {'lr': 0.00027940015918612935, 'samples': 13526016, 'steps': 70447, 'loss/train': 1.6828947067260742} 11/07/2021 07:07:51 - INFO - __main__ - Step 70449: {'lr': 0.00027939488925881684, 'samples': 13526208, 'steps': 70448, 'loss/train': 1.3881641626358032} 11/07/2021 07:07:51 - INFO - __main__ - Step 70450: {'lr': 0.0002793896193182594, 'samples': 13526400, 'steps': 70449, 'loss/train': 0.7258246541023254} 11/07/2021 07:07:51 - INFO - __main__ - Step 70451: {'lr': 0.00027938434936445943, 'samples': 13526592, 'steps': 70450, 'loss/train': 1.7958176136016846} 11/07/2021 07:07:52 - INFO - __main__ - Step 70452: {'lr': 0.0002793790793974194, 'samples': 13526784, 'steps': 70451, 'loss/train': 0.7885095477104187} 11/07/2021 07:07:53 - INFO - __main__ - Step 70453: {'lr': 0.0002793738094171415, 'samples': 13526976, 'steps': 70452, 'loss/train': 1.1944618225097656} 11/07/2021 07:07:53 - INFO - __main__ - Step 70454: {'lr': 0.0002793685394236283, 'samples': 13527168, 'steps': 70453, 'loss/train': 1.631430983543396} 11/07/2021 07:07:53 - INFO - __main__ - Step 70455: {'lr': 0.00027936326941688206, 'samples': 13527360, 'steps': 70454, 'loss/train': 1.1770615577697754} 11/07/2021 07:07:54 - INFO - __main__ - Step 70456: {'lr': 0.00027935799939690523, 'samples': 13527552, 'steps': 70455, 'loss/train': 1.5877102613449097} 11/07/2021 07:07:54 - INFO - __main__ - Step 70457: {'lr': 0.00027935272936370004, 'samples': 13527744, 'steps': 70456, 'loss/train': 1.9822065830230713} 11/07/2021 07:07:55 - INFO - __main__ - Step 70458: {'lr': 0.000279347459317269, 'samples': 13527936, 'steps': 70457, 'loss/train': 1.4630907773971558} 11/07/2021 07:07:55 - INFO - __main__ - Step 70459: {'lr': 0.00027934218925761454, 'samples': 13528128, 'steps': 70458, 'loss/train': 1.3865855932235718} 11/07/2021 07:07:56 - INFO - __main__ - Step 70460: {'lr': 0.00027933691918473883, 'samples': 13528320, 'steps': 70459, 'loss/train': 1.6402121782302856} 11/07/2021 07:07:56 - INFO - __main__ - Step 70461: {'lr': 0.0002793316490986444, 'samples': 13528512, 'steps': 70460, 'loss/train': 1.381775975227356} 11/07/2021 07:07:57 - INFO - __main__ - Step 70462: {'lr': 0.0002793263789993336, 'samples': 13528704, 'steps': 70461, 'loss/train': 1.404442310333252} 11/07/2021 07:07:58 - INFO - __main__ - Step 70463: {'lr': 0.0002793211088868087, 'samples': 13528896, 'steps': 70462, 'loss/train': 0.9728654026985168} 11/07/2021 07:07:58 - INFO - __main__ - Step 70464: {'lr': 0.00027931583876107224, 'samples': 13529088, 'steps': 70463, 'loss/train': 1.5062602758407593} 11/07/2021 07:07:58 - INFO - __main__ - Step 70465: {'lr': 0.0002793105686221265, 'samples': 13529280, 'steps': 70464, 'loss/train': 1.8648451566696167} 11/07/2021 07:07:59 - INFO - __main__ - Step 70466: {'lr': 0.0002793052984699739, 'samples': 13529472, 'steps': 70465, 'loss/train': 1.176483392715454} 11/07/2021 07:07:59 - INFO - __main__ - Step 70467: {'lr': 0.00027930002830461675, 'samples': 13529664, 'steps': 70466, 'loss/train': 1.7008590698242188} 11/07/2021 07:08:00 - INFO - __main__ - Step 70468: {'lr': 0.00027929475812605746, 'samples': 13529856, 'steps': 70467, 'loss/train': 1.3853195905685425} 11/07/2021 07:08:00 - INFO - __main__ - Step 70469: {'lr': 0.00027928948793429844, 'samples': 13530048, 'steps': 70468, 'loss/train': 0.7541078329086304} 11/07/2021 07:08:01 - INFO - __main__ - Step 70470: {'lr': 0.00027928421772934197, 'samples': 13530240, 'steps': 70469, 'loss/train': 1.2332431077957153} 11/07/2021 07:08:01 - INFO - __main__ - Step 70471: {'lr': 0.00027927894751119054, 'samples': 13530432, 'steps': 70470, 'loss/train': 5.753940582275391} 11/07/2021 07:08:02 - INFO - __main__ - Step 70472: {'lr': 0.0002792736772798465, 'samples': 13530624, 'steps': 70471, 'loss/train': 1.3490321636199951} 11/07/2021 07:08:02 - INFO - __main__ - Step 70473: {'lr': 0.0002792684070353121, 'samples': 13530816, 'steps': 70472, 'loss/train': 1.4099597930908203} 11/07/2021 07:08:03 - INFO - __main__ - Step 70474: {'lr': 0.0002792631367775898, 'samples': 13531008, 'steps': 70473, 'loss/train': 1.7529820203781128} 11/07/2021 07:08:03 - INFO - __main__ - Step 70475: {'lr': 0.00027925786650668204, 'samples': 13531200, 'steps': 70474, 'loss/train': 1.1137138605117798} 11/07/2021 07:08:04 - INFO - __main__ - Step 70476: {'lr': 0.00027925259622259106, 'samples': 13531392, 'steps': 70475, 'loss/train': 1.54108464717865} 11/07/2021 07:08:04 - INFO - __main__ - Step 70477: {'lr': 0.00027924732592531944, 'samples': 13531584, 'steps': 70476, 'loss/train': 1.3991305828094482} 11/07/2021 07:08:04 - INFO - __main__ - Step 70478: {'lr': 0.00027924205561486934, 'samples': 13531776, 'steps': 70477, 'loss/train': 1.5062934160232544} 11/07/2021 07:08:05 - INFO - __main__ - Step 70479: {'lr': 0.00027923678529124325, 'samples': 13531968, 'steps': 70478, 'loss/train': 1.2469862699508667} 11/07/2021 07:08:06 - INFO - __main__ - Step 70480: {'lr': 0.00027923151495444346, 'samples': 13532160, 'steps': 70479, 'loss/train': 1.0205128192901611} 11/07/2021 07:08:06 - INFO - __main__ - Step 70481: {'lr': 0.0002792262446044725, 'samples': 13532352, 'steps': 70480, 'loss/train': 1.6164311170578003} 11/07/2021 07:08:06 - INFO - __main__ - Step 70482: {'lr': 0.0002792209742413325, 'samples': 13532544, 'steps': 70481, 'loss/train': 1.3211551904678345} 11/07/2021 07:08:07 - INFO - __main__ - Step 70483: {'lr': 0.0002792157038650261, 'samples': 13532736, 'steps': 70482, 'loss/train': 1.8448338508605957} 11/07/2021 07:08:08 - INFO - __main__ - Step 70484: {'lr': 0.00027921043347555553, 'samples': 13532928, 'steps': 70483, 'loss/train': 1.644320011138916} 11/07/2021 07:08:08 - INFO - __main__ - Step 70485: {'lr': 0.00027920516307292315, 'samples': 13533120, 'steps': 70484, 'loss/train': 0.9643008708953857} 11/07/2021 07:08:08 - INFO - __main__ - Step 70486: {'lr': 0.0002791998926571315, 'samples': 13533312, 'steps': 70485, 'loss/train': 1.4782222509384155} 11/07/2021 07:08:09 - INFO - __main__ - Step 70487: {'lr': 0.0002791946222281827, 'samples': 13533504, 'steps': 70486, 'loss/train': 1.3210663795471191} 11/07/2021 07:08:09 - INFO - __main__ - Step 70488: {'lr': 0.00027918935178607927, 'samples': 13533696, 'steps': 70487, 'loss/train': 1.6522035598754883} 11/07/2021 07:08:10 - INFO - __main__ - Step 70489: {'lr': 0.00027918408133082356, 'samples': 13533888, 'steps': 70488, 'loss/train': 1.4040849208831787} 11/07/2021 07:08:10 - INFO - __main__ - Step 70490: {'lr': 0.00027917881086241805, 'samples': 13534080, 'steps': 70489, 'loss/train': 0.9939002394676208} 11/07/2021 07:08:11 - INFO - __main__ - Step 70491: {'lr': 0.0002791735403808649, 'samples': 13534272, 'steps': 70490, 'loss/train': 1.3906009197235107} 11/07/2021 07:08:11 - INFO - __main__ - Step 70492: {'lr': 0.00027916826988616663, 'samples': 13534464, 'steps': 70491, 'loss/train': 1.103922963142395} 11/07/2021 07:08:11 - INFO - __main__ - Step 70493: {'lr': 0.00027916299937832565, 'samples': 13534656, 'steps': 70492, 'loss/train': 1.6441738605499268} 11/07/2021 07:08:13 - INFO - __main__ - Step 70494: {'lr': 0.00027915772885734425, 'samples': 13534848, 'steps': 70493, 'loss/train': 1.7177938222885132} 11/07/2021 07:08:13 - INFO - __main__ - Step 70495: {'lr': 0.00027915245832322476, 'samples': 13535040, 'steps': 70494, 'loss/train': 1.1700923442840576} 11/07/2021 07:08:14 - INFO - __main__ - Step 70496: {'lr': 0.0002791471877759697, 'samples': 13535232, 'steps': 70495, 'loss/train': 0.40609949827194214} 11/07/2021 07:08:14 - INFO - __main__ - Step 70497: {'lr': 0.0002791419172155814, 'samples': 13535424, 'steps': 70496, 'loss/train': 1.763586163520813} 11/07/2021 07:08:14 - INFO - __main__ - Step 70498: {'lr': 0.0002791366466420621, 'samples': 13535616, 'steps': 70497, 'loss/train': 1.3403496742248535} 11/07/2021 07:08:15 - INFO - __main__ - Step 70499: {'lr': 0.00027913137605541436, 'samples': 13535808, 'steps': 70498, 'loss/train': 1.4544827938079834} 11/07/2021 07:08:16 - INFO - __main__ - Step 70500: {'lr': 0.00027912610545564035, 'samples': 13536000, 'steps': 70499, 'loss/train': 1.457692265510559} 11/07/2021 07:08:16 - INFO - __main__ - Step 70501: {'lr': 0.0002791208348427426, 'samples': 13536192, 'steps': 70500, 'loss/train': 1.083436131477356} 11/07/2021 07:08:16 - INFO - __main__ - Step 70502: {'lr': 0.00027911556421672355, 'samples': 13536384, 'steps': 70501, 'loss/train': 0.28915491700172424} 11/07/2021 07:08:17 - INFO - __main__ - Step 70503: {'lr': 0.0002791102935775854, 'samples': 13536576, 'steps': 70502, 'loss/train': 1.5870354175567627} 11/07/2021 07:08:17 - INFO - __main__ - Step 70504: {'lr': 0.0002791050229253306, 'samples': 13536768, 'steps': 70503, 'loss/train': 1.4620076417922974} 11/07/2021 07:08:18 - INFO - __main__ - Step 70505: {'lr': 0.0002790997522599616, 'samples': 13536960, 'steps': 70504, 'loss/train': 1.4552417993545532} 11/07/2021 07:08:19 - INFO - __main__ - Step 70506: {'lr': 0.00027909448158148066, 'samples': 13537152, 'steps': 70505, 'loss/train': 0.9469470977783203} 11/07/2021 07:08:19 - INFO - __main__ - Step 70507: {'lr': 0.0002790892108898902, 'samples': 13537344, 'steps': 70506, 'loss/train': 1.632530689239502} 11/07/2021 07:08:19 - INFO - __main__ - Step 70508: {'lr': 0.00027908394018519257, 'samples': 13537536, 'steps': 70507, 'loss/train': 1.4218260049819946} 11/07/2021 07:08:20 - INFO - __main__ - Step 70509: {'lr': 0.00027907866946739015, 'samples': 13537728, 'steps': 70508, 'loss/train': 1.8497388362884521} 11/07/2021 07:08:21 - INFO - __main__ - Step 70510: {'lr': 0.00027907339873648536, 'samples': 13537920, 'steps': 70509, 'loss/train': 1.1216464042663574} 11/07/2021 07:08:21 - INFO - __main__ - Step 70511: {'lr': 0.0002790681279924805, 'samples': 13538112, 'steps': 70510, 'loss/train': 1.37663996219635} 11/07/2021 07:08:21 - INFO - __main__ - Step 70512: {'lr': 0.00027906285723537807, 'samples': 13538304, 'steps': 70511, 'loss/train': 2.0285422801971436} 11/07/2021 07:08:22 - INFO - __main__ - Step 70513: {'lr': 0.00027905758646518033, 'samples': 13538496, 'steps': 70512, 'loss/train': 1.4167397022247314} 11/07/2021 07:08:22 - INFO - __main__ - Step 70514: {'lr': 0.00027905231568188966, 'samples': 13538688, 'steps': 70513, 'loss/train': 1.1685248613357544} 11/07/2021 07:08:23 - INFO - __main__ - Step 70515: {'lr': 0.0002790470448855085, 'samples': 13538880, 'steps': 70514, 'loss/train': 1.5812125205993652} 11/07/2021 07:08:23 - INFO - __main__ - Step 70516: {'lr': 0.00027904177407603916, 'samples': 13539072, 'steps': 70515, 'loss/train': 1.6827398538589478} 11/07/2021 07:08:24 - INFO - __main__ - Step 70517: {'lr': 0.00027903650325348405, 'samples': 13539264, 'steps': 70516, 'loss/train': 1.6553329229354858} 11/07/2021 07:08:24 - INFO - __main__ - Step 70518: {'lr': 0.00027903123241784555, 'samples': 13539456, 'steps': 70517, 'loss/train': 1.6173216104507446} 11/07/2021 07:08:24 - INFO - __main__ - Step 70519: {'lr': 0.0002790259615691261, 'samples': 13539648, 'steps': 70518, 'loss/train': 1.1967189311981201} 11/07/2021 07:08:25 - INFO - __main__ - Step 70520: {'lr': 0.00027902069070732786, 'samples': 13539840, 'steps': 70519, 'loss/train': 1.3620891571044922} 11/07/2021 07:08:26 - INFO - __main__ - Step 70521: {'lr': 0.00027901541983245344, 'samples': 13540032, 'steps': 70520, 'loss/train': 1.5961666107177734} 11/07/2021 07:08:26 - INFO - __main__ - Step 70522: {'lr': 0.00027901014894450506, 'samples': 13540224, 'steps': 70521, 'loss/train': 1.0133477449417114} 11/07/2021 07:08:27 - INFO - __main__ - Step 70523: {'lr': 0.00027900487804348516, 'samples': 13540416, 'steps': 70522, 'loss/train': 1.3026834726333618} 11/07/2021 07:08:27 - INFO - __main__ - Step 70524: {'lr': 0.00027899960712939617, 'samples': 13540608, 'steps': 70523, 'loss/train': 1.2399920225143433} 11/07/2021 07:08:28 - INFO - __main__ - Step 70525: {'lr': 0.00027899433620224033, 'samples': 13540800, 'steps': 70524, 'loss/train': 1.7029109001159668} 11/07/2021 07:08:28 - INFO - __main__ - Step 70526: {'lr': 0.0002789890652620202, 'samples': 13540992, 'steps': 70525, 'loss/train': 1.4114327430725098} 11/07/2021 07:08:29 - INFO - __main__ - Step 70527: {'lr': 0.00027898379430873793, 'samples': 13541184, 'steps': 70526, 'loss/train': 1.194228172302246} 11/07/2021 07:08:29 - INFO - __main__ - Step 70528: {'lr': 0.00027897852334239604, 'samples': 13541376, 'steps': 70527, 'loss/train': 1.5930590629577637} 11/07/2021 07:08:29 - INFO - __main__ - Step 70529: {'lr': 0.0002789732523629969, 'samples': 13541568, 'steps': 70528, 'loss/train': 1.1550674438476562} 11/07/2021 07:08:30 - INFO - __main__ - Step 70530: {'lr': 0.0002789679813705428, 'samples': 13541760, 'steps': 70529, 'loss/train': 1.546850323677063} 11/07/2021 07:08:31 - INFO - __main__ - Step 70531: {'lr': 0.0002789627103650362, 'samples': 13541952, 'steps': 70530, 'loss/train': 1.6009258031845093} 11/07/2021 07:08:31 - INFO - __main__ - Step 70532: {'lr': 0.0002789574393464795, 'samples': 13542144, 'steps': 70531, 'loss/train': 2.1406009197235107} 11/07/2021 07:08:31 - INFO - __main__ - Step 70533: {'lr': 0.000278952168314875, 'samples': 13542336, 'steps': 70532, 'loss/train': 1.1898466348648071} 11/07/2021 07:08:32 - INFO - __main__ - Step 70534: {'lr': 0.00027894689727022516, 'samples': 13542528, 'steps': 70533, 'loss/train': 1.4474354982376099} 11/07/2021 07:08:32 - INFO - __main__ - Step 70535: {'lr': 0.0002789416262125322, 'samples': 13542720, 'steps': 70534, 'loss/train': 1.1027252674102783} 11/07/2021 07:08:33 - INFO - __main__ - Step 70536: {'lr': 0.0002789363551417986, 'samples': 13542912, 'steps': 70535, 'loss/train': 1.0231201648712158} 11/07/2021 07:08:34 - INFO - __main__ - Step 70537: {'lr': 0.00027893108405802676, 'samples': 13543104, 'steps': 70536, 'loss/train': 1.4566291570663452} 11/07/2021 07:08:34 - INFO - __main__ - Step 70538: {'lr': 0.0002789258129612189, 'samples': 13543296, 'steps': 70537, 'loss/train': 0.09858386218547821} 11/07/2021 07:08:34 - INFO - __main__ - Step 70539: {'lr': 0.00027892054185137767, 'samples': 13543488, 'steps': 70538, 'loss/train': 0.6774685978889465} 11/07/2021 07:08:35 - INFO - __main__ - Step 70540: {'lr': 0.00027891527072850534, 'samples': 13543680, 'steps': 70539, 'loss/train': 1.0203964710235596} 11/07/2021 07:08:36 - INFO - __main__ - Step 70541: {'lr': 0.0002789099995926041, 'samples': 13543872, 'steps': 70540, 'loss/train': 1.3418933153152466} 11/07/2021 07:08:36 - INFO - __main__ - Step 70542: {'lr': 0.0002789047284436765, 'samples': 13544064, 'steps': 70541, 'loss/train': 0.9420663714408875} 11/07/2021 07:08:36 - INFO - __main__ - Step 70543: {'lr': 0.00027889945728172484, 'samples': 13544256, 'steps': 70542, 'loss/train': 1.058574914932251} 11/07/2021 07:08:37 - INFO - __main__ - Step 70544: {'lr': 0.00027889418610675155, 'samples': 13544448, 'steps': 70543, 'loss/train': 1.496216893196106} 11/07/2021 07:08:37 - INFO - __main__ - Step 70545: {'lr': 0.00027888891491875895, 'samples': 13544640, 'steps': 70544, 'loss/train': 1.6137653589248657} 11/07/2021 07:08:38 - INFO - __main__ - Step 70546: {'lr': 0.0002788836437177495, 'samples': 13544832, 'steps': 70545, 'loss/train': 1.5163904428482056} 11/07/2021 07:08:38 - INFO - __main__ - Step 70547: {'lr': 0.00027887837250372555, 'samples': 13545024, 'steps': 70546, 'loss/train': 1.6370583772659302} 11/07/2021 07:08:39 - INFO - __main__ - Step 70548: {'lr': 0.0002788731012766894, 'samples': 13545216, 'steps': 70547, 'loss/train': 1.0212774276733398} 11/07/2021 07:08:39 - INFO - __main__ - Step 70549: {'lr': 0.0002788678300366435, 'samples': 13545408, 'steps': 70548, 'loss/train': 1.3658273220062256} 11/07/2021 07:08:39 - INFO - __main__ - Step 70550: {'lr': 0.00027886255878359025, 'samples': 13545600, 'steps': 70549, 'loss/train': 1.7129428386688232} 11/07/2021 07:08:41 - INFO - __main__ - Step 70551: {'lr': 0.00027885728751753184, 'samples': 13545792, 'steps': 70550, 'loss/train': 1.3123736381530762} 11/07/2021 07:08:41 - INFO - __main__ - Step 70552: {'lr': 0.0002788520162384709, 'samples': 13545984, 'steps': 70551, 'loss/train': 0.9682538509368896} 11/07/2021 07:08:41 - INFO - __main__ - Step 70553: {'lr': 0.0002788467449464097, 'samples': 13546176, 'steps': 70552, 'loss/train': 0.8099843859672546} 11/07/2021 07:08:42 - INFO - __main__ - Step 70554: {'lr': 0.00027884147364135053, 'samples': 13546368, 'steps': 70553, 'loss/train': 1.3585649728775024} 11/07/2021 07:08:42 - INFO - __main__ - Step 70555: {'lr': 0.0002788362023232959, 'samples': 13546560, 'steps': 70554, 'loss/train': 1.3078289031982422} 11/07/2021 07:08:43 - INFO - __main__ - Step 70556: {'lr': 0.00027883093099224807, 'samples': 13546752, 'steps': 70555, 'loss/train': 1.522147536277771} 11/07/2021 07:08:43 - INFO - __main__ - Step 70557: {'lr': 0.0002788256596482095, 'samples': 13546944, 'steps': 70556, 'loss/train': 1.3330596685409546} 11/07/2021 07:08:44 - INFO - __main__ - Step 70558: {'lr': 0.0002788203882911825, 'samples': 13547136, 'steps': 70557, 'loss/train': 0.9916807413101196} 11/07/2021 07:08:44 - INFO - __main__ - Step 70559: {'lr': 0.0002788151169211695, 'samples': 13547328, 'steps': 70558, 'loss/train': 1.476714015007019} 11/07/2021 07:08:44 - INFO - __main__ - Step 70560: {'lr': 0.0002788098455381728, 'samples': 13547520, 'steps': 70559, 'loss/train': 1.5472453832626343} 11/07/2021 07:08:45 - INFO - __main__ - Step 70561: {'lr': 0.0002788045741421949, 'samples': 13547712, 'steps': 70560, 'loss/train': 1.4303231239318848} 11/07/2021 07:08:46 - INFO - __main__ - Step 70562: {'lr': 0.0002787993027332381, 'samples': 13547904, 'steps': 70561, 'loss/train': 1.2174360752105713} 11/07/2021 07:08:46 - INFO - __main__ - Step 70563: {'lr': 0.00027879403131130475, 'samples': 13548096, 'steps': 70562, 'loss/train': 1.3174700736999512} 11/07/2021 07:08:46 - INFO - __main__ - Step 70564: {'lr': 0.00027878875987639724, 'samples': 13548288, 'steps': 70563, 'loss/train': 1.6516534090042114} 11/07/2021 07:08:47 - INFO - __main__ - Step 70565: {'lr': 0.000278783488428518, 'samples': 13548480, 'steps': 70564, 'loss/train': 1.5013868808746338} 11/07/2021 07:08:48 - INFO - __main__ - Step 70566: {'lr': 0.00027877821696766934, 'samples': 13548672, 'steps': 70565, 'loss/train': 1.3499739170074463} 11/07/2021 07:08:48 - INFO - __main__ - Step 70567: {'lr': 0.00027877294549385367, 'samples': 13548864, 'steps': 70566, 'loss/train': 0.9228886365890503} 11/07/2021 07:08:49 - INFO - __main__ - Step 70568: {'lr': 0.0002787676740070734, 'samples': 13549056, 'steps': 70567, 'loss/train': 1.5236775875091553} 11/07/2021 07:08:49 - INFO - __main__ - Step 70569: {'lr': 0.00027876240250733074, 'samples': 13549248, 'steps': 70568, 'loss/train': 1.2186427116394043} 11/07/2021 07:08:49 - INFO - __main__ - Step 70570: {'lr': 0.0002787571309946283, 'samples': 13549440, 'steps': 70569, 'loss/train': 1.2033042907714844} 11/07/2021 07:08:50 - INFO - __main__ - Step 70571: {'lr': 0.0002787518594689683, 'samples': 13549632, 'steps': 70570, 'loss/train': 1.4816943407058716} 11/07/2021 07:08:51 - INFO - __main__ - Step 70572: {'lr': 0.00027874658793035313, 'samples': 13549824, 'steps': 70571, 'loss/train': 1.1568748950958252} 11/07/2021 07:08:51 - INFO - __main__ - Step 70573: {'lr': 0.0002787413163787852, 'samples': 13550016, 'steps': 70572, 'loss/train': 1.2063486576080322} 11/07/2021 07:08:51 - INFO - __main__ - Step 70574: {'lr': 0.0002787360448142669, 'samples': 13550208, 'steps': 70573, 'loss/train': 1.532394528388977} 11/07/2021 07:08:52 - INFO - __main__ - Step 70575: {'lr': 0.0002787307732368006, 'samples': 13550400, 'steps': 70574, 'loss/train': 1.2683215141296387} 11/07/2021 07:08:52 - INFO - __main__ - Step 70576: {'lr': 0.0002787255016463886, 'samples': 13550592, 'steps': 70575, 'loss/train': 1.3644307851791382} 11/07/2021 07:08:53 - INFO - __main__ - Step 70577: {'lr': 0.0002787202300430334, 'samples': 13550784, 'steps': 70576, 'loss/train': 1.4019757509231567} 11/07/2021 07:08:54 - INFO - __main__ - Step 70578: {'lr': 0.00027871495842673723, 'samples': 13550976, 'steps': 70577, 'loss/train': 1.101379156112671} 11/07/2021 07:08:54 - INFO - __main__ - Step 70579: {'lr': 0.0002787096867975026, 'samples': 13551168, 'steps': 70578, 'loss/train': 1.4642106294631958} 11/07/2021 07:08:54 - INFO - __main__ - Step 70580: {'lr': 0.0002787044151553317, 'samples': 13551360, 'steps': 70579, 'loss/train': 1.6814250946044922} 11/07/2021 07:08:55 - INFO - __main__ - Step 70581: {'lr': 0.00027869914350022725, 'samples': 13551552, 'steps': 70580, 'loss/train': 1.723488688468933} 11/07/2021 07:08:56 - INFO - __main__ - Step 70582: {'lr': 0.0002786938718321913, 'samples': 13551744, 'steps': 70581, 'loss/train': 0.9852248430252075} 11/07/2021 07:08:56 - INFO - __main__ - Step 70583: {'lr': 0.0002786886001512263, 'samples': 13551936, 'steps': 70582, 'loss/train': 1.6963603496551514} 11/07/2021 07:08:56 - INFO - __main__ - Step 70584: {'lr': 0.0002786833284573347, 'samples': 13552128, 'steps': 70583, 'loss/train': 1.7493575811386108} 11/07/2021 07:08:57 - INFO - __main__ - Step 70585: {'lr': 0.0002786780567505188, 'samples': 13552320, 'steps': 70584, 'loss/train': 1.2189490795135498} 11/07/2021 07:08:57 - INFO - __main__ - Step 70586: {'lr': 0.00027867278503078104, 'samples': 13552512, 'steps': 70585, 'loss/train': 1.4387555122375488} 11/07/2021 07:08:58 - INFO - __main__ - Step 70587: {'lr': 0.0002786675132981238, 'samples': 13552704, 'steps': 70586, 'loss/train': 1.2731497287750244} 11/07/2021 07:08:59 - INFO - __main__ - Step 70588: {'lr': 0.0002786622415525494, 'samples': 13552896, 'steps': 70587, 'loss/train': 1.4351741075515747} 11/07/2021 07:08:59 - INFO - __main__ - Step 70589: {'lr': 0.00027865696979406017, 'samples': 13553088, 'steps': 70588, 'loss/train': 1.7151063680648804} 11/07/2021 07:08:59 - INFO - __main__ - Step 70590: {'lr': 0.0002786516980226586, 'samples': 13553280, 'steps': 70589, 'loss/train': 1.3963944911956787} 11/07/2021 07:09:00 - INFO - __main__ - Step 70591: {'lr': 0.00027864642623834704, 'samples': 13553472, 'steps': 70590, 'loss/train': 1.6893831491470337} 11/07/2021 07:09:01 - INFO - __main__ - Step 70592: {'lr': 0.0002786411544411278, 'samples': 13553664, 'steps': 70591, 'loss/train': 1.6053802967071533} 11/07/2021 07:09:01 - INFO - __main__ - Step 70593: {'lr': 0.0002786358826310034, 'samples': 13553856, 'steps': 70592, 'loss/train': 1.530112385749817} 11/07/2021 07:09:01 - INFO - __main__ - Step 70594: {'lr': 0.000278630610807976, 'samples': 13554048, 'steps': 70593, 'loss/train': 0.682702362537384} 11/07/2021 07:09:02 - INFO - __main__ - Step 70595: {'lr': 0.00027862533897204814, 'samples': 13554240, 'steps': 70594, 'loss/train': 1.2324657440185547} 11/07/2021 07:09:02 - INFO - __main__ - Step 70596: {'lr': 0.00027862006712322206, 'samples': 13554432, 'steps': 70595, 'loss/train': 1.5362902879714966} 11/07/2021 07:09:02 - INFO - __main__ - Step 70597: {'lr': 0.0002786147952615003, 'samples': 13554624, 'steps': 70596, 'loss/train': 1.6894762516021729} 11/07/2021 07:09:04 - INFO - __main__ - Step 70598: {'lr': 0.00027860952338688513, 'samples': 13554816, 'steps': 70597, 'loss/train': 1.1273564100265503} 11/07/2021 07:09:04 - INFO - __main__ - Step 70599: {'lr': 0.00027860425149937894, 'samples': 13555008, 'steps': 70598, 'loss/train': 1.428545355796814} 11/07/2021 07:09:04 - INFO - __main__ - Step 70600: {'lr': 0.0002785989795989842, 'samples': 13555200, 'steps': 70599, 'loss/train': 1.7711379528045654} 11/07/2021 07:09:05 - INFO - __main__ - Step 70601: {'lr': 0.0002785937076857031, 'samples': 13555392, 'steps': 70600, 'loss/train': 1.031951665878296} 11/07/2021 07:09:05 - INFO - __main__ - Step 70602: {'lr': 0.0002785884357595382, 'samples': 13555584, 'steps': 70601, 'loss/train': 1.6316430568695068} 11/07/2021 07:09:06 - INFO - __main__ - Step 70603: {'lr': 0.0002785831638204917, 'samples': 13555776, 'steps': 70602, 'loss/train': 1.2252708673477173} 11/07/2021 07:09:07 - INFO - __main__ - Step 70604: {'lr': 0.0002785778918685661, 'samples': 13555968, 'steps': 70603, 'loss/train': 2.8785126209259033} 11/07/2021 07:09:07 - INFO - __main__ - Step 70605: {'lr': 0.0002785726199037638, 'samples': 13556160, 'steps': 70604, 'loss/train': 0.32988330721855164} 11/07/2021 07:09:08 - INFO - __main__ - Step 70606: {'lr': 0.000278567347926087, 'samples': 13556352, 'steps': 70605, 'loss/train': 1.1840134859085083} 11/07/2021 07:09:08 - INFO - __main__ - Step 70607: {'lr': 0.00027856207593553834, 'samples': 13556544, 'steps': 70606, 'loss/train': 1.3914293050765991} 11/07/2021 07:09:09 - INFO - __main__ - Step 70608: {'lr': 0.00027855680393211994, 'samples': 13556736, 'steps': 70607, 'loss/train': 0.45522749423980713} 11/07/2021 07:09:09 - INFO - __main__ - Step 70609: {'lr': 0.00027855153191583433, 'samples': 13556928, 'steps': 70608, 'loss/train': 1.3554887771606445} 11/07/2021 07:09:10 - INFO - __main__ - Step 70610: {'lr': 0.0002785462598866838, 'samples': 13557120, 'steps': 70609, 'loss/train': 0.5543954372406006} 11/07/2021 07:09:10 - INFO - __main__ - Step 70611: {'lr': 0.0002785409878446708, 'samples': 13557312, 'steps': 70610, 'loss/train': 1.4150285720825195} 11/07/2021 07:09:10 - INFO - __main__ - Step 70612: {'lr': 0.00027853571578979766, 'samples': 13557504, 'steps': 70611, 'loss/train': 1.4228429794311523} 11/07/2021 07:09:11 - INFO - __main__ - Step 70613: {'lr': 0.00027853044372206677, 'samples': 13557696, 'steps': 70612, 'loss/train': 1.3056589365005493} 11/07/2021 07:09:12 - INFO - __main__ - Step 70614: {'lr': 0.00027852517164148055, 'samples': 13557888, 'steps': 70613, 'loss/train': 1.0286470651626587} 11/07/2021 07:09:12 - INFO - __main__ - Step 70615: {'lr': 0.0002785198995480413, 'samples': 13558080, 'steps': 70614, 'loss/train': 1.500592827796936} 11/07/2021 07:09:12 - INFO - __main__ - Step 70616: {'lr': 0.0002785146274417514, 'samples': 13558272, 'steps': 70615, 'loss/train': 1.41209077835083} 11/07/2021 07:09:13 - INFO - __main__ - Step 70617: {'lr': 0.0002785093553226132, 'samples': 13558464, 'steps': 70616, 'loss/train': 1.2917201519012451} 11/07/2021 07:09:13 - INFO - __main__ - Step 70618: {'lr': 0.0002785040831906292, 'samples': 13558656, 'steps': 70617, 'loss/train': 1.5135064125061035} 11/07/2021 07:09:14 - INFO - __main__ - Step 70619: {'lr': 0.00027849881104580166, 'samples': 13558848, 'steps': 70618, 'loss/train': 1.4286928176879883} 11/07/2021 07:09:15 - INFO - __main__ - Step 70620: {'lr': 0.00027849353888813306, 'samples': 13559040, 'steps': 70619, 'loss/train': 1.2647939920425415} 11/07/2021 07:09:15 - INFO - __main__ - Step 70621: {'lr': 0.00027848826671762565, 'samples': 13559232, 'steps': 70620, 'loss/train': 2.017130136489868} 11/07/2021 07:09:15 - INFO - __main__ - Step 70622: {'lr': 0.0002784829945342819, 'samples': 13559424, 'steps': 70621, 'loss/train': 1.211207389831543} 11/07/2021 07:09:16 - INFO - __main__ - Step 70623: {'lr': 0.00027847772233810416, 'samples': 13559616, 'steps': 70622, 'loss/train': 1.7866718769073486} 11/07/2021 07:09:16 - INFO - __main__ - Step 70624: {'lr': 0.00027847245012909474, 'samples': 13559808, 'steps': 70623, 'loss/train': 1.6412454843521118} 11/07/2021 07:09:17 - INFO - __main__ - Step 70625: {'lr': 0.0002784671779072561, 'samples': 13560000, 'steps': 70624, 'loss/train': 1.4935907125473022} 11/07/2021 07:09:17 - INFO - __main__ - Step 70626: {'lr': 0.0002784619056725906, 'samples': 13560192, 'steps': 70625, 'loss/train': 1.3818411827087402} 11/07/2021 07:09:18 - INFO - __main__ - Step 70627: {'lr': 0.0002784566334251006, 'samples': 13560384, 'steps': 70626, 'loss/train': 1.5722885131835938} 11/07/2021 07:09:18 - INFO - __main__ - Step 70628: {'lr': 0.00027845136116478854, 'samples': 13560576, 'steps': 70627, 'loss/train': 1.3813481330871582} 11/07/2021 07:09:18 - INFO - __main__ - Step 70629: {'lr': 0.00027844608889165663, 'samples': 13560768, 'steps': 70628, 'loss/train': 1.3232840299606323} 11/07/2021 07:09:19 - INFO - __main__ - Step 70630: {'lr': 0.0002784408166057074, 'samples': 13560960, 'steps': 70629, 'loss/train': 1.6443331241607666} 11/07/2021 07:09:20 - INFO - __main__ - Step 70631: {'lr': 0.00027843554430694316, 'samples': 13561152, 'steps': 70630, 'loss/train': 1.7480677366256714} 11/07/2021 07:09:20 - INFO - __main__ - Step 70632: {'lr': 0.0002784302719953663, 'samples': 13561344, 'steps': 70631, 'loss/train': 0.876837432384491} 11/07/2021 07:09:20 - INFO - __main__ - Step 70633: {'lr': 0.00027842499967097923, 'samples': 13561536, 'steps': 70632, 'loss/train': 1.3471646308898926} 11/07/2021 07:09:21 - INFO - __main__ - Step 70634: {'lr': 0.00027841972733378437, 'samples': 13561728, 'steps': 70633, 'loss/train': 1.1362682580947876} 11/07/2021 07:09:22 - INFO - __main__ - Step 70635: {'lr': 0.0002784144549837839, 'samples': 13561920, 'steps': 70634, 'loss/train': 1.4443871974945068} 11/07/2021 07:09:22 - INFO - __main__ - Step 70636: {'lr': 0.0002784091826209803, 'samples': 13562112, 'steps': 70635, 'loss/train': 0.8691002130508423} 11/07/2021 07:09:22 - INFO - __main__ - Step 70637: {'lr': 0.000278403910245376, 'samples': 13562304, 'steps': 70636, 'loss/train': 1.63344144821167} 11/07/2021 07:09:23 - INFO - __main__ - Step 70638: {'lr': 0.00027839863785697336, 'samples': 13562496, 'steps': 70637, 'loss/train': 1.190106749534607} 11/07/2021 07:09:23 - INFO - __main__ - Step 70639: {'lr': 0.0002783933654557747, 'samples': 13562688, 'steps': 70638, 'loss/train': 1.5888333320617676} 11/07/2021 07:09:24 - INFO - __main__ - Step 70640: {'lr': 0.00027838809304178247, 'samples': 13562880, 'steps': 70639, 'loss/train': 1.3501827716827393} 11/07/2021 07:09:24 - INFO - __main__ - Step 70641: {'lr': 0.000278382820614999, 'samples': 13563072, 'steps': 70640, 'loss/train': 1.5993764400482178} 11/07/2021 07:09:25 - INFO - __main__ - Step 70642: {'lr': 0.0002783775481754266, 'samples': 13563264, 'steps': 70641, 'loss/train': 0.9879317879676819} 11/07/2021 07:09:25 - INFO - __main__ - Step 70643: {'lr': 0.0002783722757230678, 'samples': 13563456, 'steps': 70642, 'loss/train': 1.2351183891296387} 11/07/2021 07:09:25 - INFO - __main__ - Step 70644: {'lr': 0.0002783670032579248, 'samples': 13563648, 'steps': 70643, 'loss/train': 1.6814396381378174} 11/07/2021 07:09:26 - INFO - __main__ - Step 70645: {'lr': 0.0002783617307800001, 'samples': 13563840, 'steps': 70644, 'loss/train': 1.4303150177001953} 11/07/2021 07:09:27 - INFO - __main__ - Step 70646: {'lr': 0.00027835645828929606, 'samples': 13564032, 'steps': 70645, 'loss/train': 1.4317177534103394} 11/07/2021 07:09:27 - INFO - __main__ - Step 70647: {'lr': 0.0002783511857858151, 'samples': 13564224, 'steps': 70646, 'loss/train': 1.3798998594284058} 11/07/2021 07:09:27 - INFO - __main__ - Step 70648: {'lr': 0.0002783459132695594, 'samples': 13564416, 'steps': 70647, 'loss/train': 1.2226121425628662} 11/07/2021 07:09:28 - INFO - __main__ - Step 70649: {'lr': 0.00027834064074053156, 'samples': 13564608, 'steps': 70648, 'loss/train': 1.5492680072784424} 11/07/2021 07:09:29 - INFO - __main__ - Step 70650: {'lr': 0.0002783353681987338, 'samples': 13564800, 'steps': 70649, 'loss/train': 1.5674946308135986} 11/07/2021 07:09:29 - INFO - __main__ - Step 70651: {'lr': 0.0002783300956441686, 'samples': 13564992, 'steps': 70650, 'loss/train': 1.7461271286010742} 11/07/2021 07:09:30 - INFO - __main__ - Step 70652: {'lr': 0.00027832482307683833, 'samples': 13565184, 'steps': 70651, 'loss/train': 1.4913543462753296} 11/07/2021 07:09:30 - INFO - __main__ - Step 70653: {'lr': 0.00027831955049674526, 'samples': 13565376, 'steps': 70652, 'loss/train': 1.3727444410324097} 11/07/2021 07:09:30 - INFO - __main__ - Step 70654: {'lr': 0.0002783142779038919, 'samples': 13565568, 'steps': 70653, 'loss/train': 1.5695013999938965} 11/07/2021 07:09:31 - INFO - __main__ - Step 70655: {'lr': 0.00027830900529828055, 'samples': 13565760, 'steps': 70654, 'loss/train': 1.7432290315628052} 11/07/2021 07:09:32 - INFO - __main__ - Step 70656: {'lr': 0.0002783037326799136, 'samples': 13565952, 'steps': 70655, 'loss/train': 1.5537822246551514} 11/07/2021 07:09:32 - INFO - __main__ - Step 70657: {'lr': 0.0002782984600487934, 'samples': 13566144, 'steps': 70656, 'loss/train': 1.2468711137771606} 11/07/2021 07:09:32 - INFO - __main__ - Step 70658: {'lr': 0.00027829318740492235, 'samples': 13566336, 'steps': 70657, 'loss/train': 1.4477020502090454} 11/07/2021 07:09:33 - INFO - __main__ - Step 70659: {'lr': 0.0002782879147483028, 'samples': 13566528, 'steps': 70658, 'loss/train': 1.4290199279785156} 11/07/2021 07:09:33 - INFO - __main__ - Step 70660: {'lr': 0.0002782826420789372, 'samples': 13566720, 'steps': 70659, 'loss/train': 1.2643589973449707} 11/07/2021 07:09:34 - INFO - __main__ - Step 70661: {'lr': 0.00027827736939682796, 'samples': 13566912, 'steps': 70660, 'loss/train': 1.0949851274490356} 11/07/2021 07:09:34 - INFO - __main__ - Step 70662: {'lr': 0.00027827209670197724, 'samples': 13567104, 'steps': 70661, 'loss/train': 1.439911127090454} 11/07/2021 07:09:35 - INFO - __main__ - Step 70663: {'lr': 0.0002782668239943876, 'samples': 13567296, 'steps': 70662, 'loss/train': 0.9747066497802734} 11/07/2021 07:09:35 - INFO - __main__ - Step 70664: {'lr': 0.0002782615512740613, 'samples': 13567488, 'steps': 70663, 'loss/train': 1.2346113920211792} 11/07/2021 07:09:35 - INFO - __main__ - Step 70665: {'lr': 0.00027825627854100087, 'samples': 13567680, 'steps': 70664, 'loss/train': 1.4695490598678589} 11/07/2021 07:09:36 - INFO - __main__ - Step 70666: {'lr': 0.0002782510057952086, 'samples': 13567872, 'steps': 70665, 'loss/train': 1.7512093782424927} 11/07/2021 07:09:37 - INFO - __main__ - Step 70667: {'lr': 0.00027824573303668684, 'samples': 13568064, 'steps': 70666, 'loss/train': 1.9453309774398804} 11/07/2021 07:09:37 - INFO - __main__ - Step 70668: {'lr': 0.000278240460265438, 'samples': 13568256, 'steps': 70667, 'loss/train': 1.3926864862442017} 11/07/2021 07:09:37 - INFO - __main__ - Step 70669: {'lr': 0.0002782351874814644, 'samples': 13568448, 'steps': 70668, 'loss/train': 1.4433891773223877} 11/07/2021 07:09:38 - INFO - __main__ - Step 70670: {'lr': 0.0002782299146847684, 'samples': 13568640, 'steps': 70669, 'loss/train': 1.2853816747665405} 11/07/2021 07:09:39 - INFO - __main__ - Step 70671: {'lr': 0.00027822464187535255, 'samples': 13568832, 'steps': 70670, 'loss/train': 1.4138010740280151} 11/07/2021 07:09:39 - INFO - __main__ - Step 70672: {'lr': 0.0002782193690532191, 'samples': 13569024, 'steps': 70671, 'loss/train': 1.0955736637115479} 11/07/2021 07:09:39 - INFO - __main__ - Step 70673: {'lr': 0.0002782140962183704, 'samples': 13569216, 'steps': 70672, 'loss/train': 1.5170079469680786} 11/07/2021 07:09:40 - INFO - __main__ - Step 70674: {'lr': 0.00027820882337080893, 'samples': 13569408, 'steps': 70673, 'loss/train': 1.1129891872406006} 11/07/2021 07:09:40 - INFO - __main__ - Step 70675: {'lr': 0.00027820355051053693, 'samples': 13569600, 'steps': 70674, 'loss/train': 1.132887363433838} 11/07/2021 07:09:41 - INFO - __main__ - Step 70676: {'lr': 0.0002781982776375569, 'samples': 13569792, 'steps': 70675, 'loss/train': 1.1577401161193848} 11/07/2021 07:09:42 - INFO - __main__ - Step 70677: {'lr': 0.0002781930047518711, 'samples': 13569984, 'steps': 70676, 'loss/train': 1.1156433820724487} 11/07/2021 07:09:42 - INFO - __main__ - Step 70678: {'lr': 0.000278187731853482, 'samples': 13570176, 'steps': 70677, 'loss/train': 1.6099117994308472} 11/07/2021 07:09:42 - INFO - __main__ - Step 70679: {'lr': 0.00027818245894239193, 'samples': 13570368, 'steps': 70678, 'loss/train': 1.4455037117004395} 11/07/2021 07:09:43 - INFO - __main__ - Step 70680: {'lr': 0.00027817718601860325, 'samples': 13570560, 'steps': 70679, 'loss/train': 1.6893229484558105} 11/07/2021 07:09:44 - INFO - __main__ - Step 70681: {'lr': 0.0002781719130821185, 'samples': 13570752, 'steps': 70680, 'loss/train': 1.6016168594360352} 11/07/2021 07:09:44 - INFO - __main__ - Step 70682: {'lr': 0.0002781666401329398, 'samples': 13570944, 'steps': 70681, 'loss/train': 1.5906577110290527} 11/07/2021 07:09:44 - INFO - __main__ - Step 70683: {'lr': 0.00027816136717106967, 'samples': 13571136, 'steps': 70682, 'loss/train': 1.1905170679092407} 11/07/2021 07:09:45 - INFO - __main__ - Step 70684: {'lr': 0.0002781560941965104, 'samples': 13571328, 'steps': 70683, 'loss/train': 1.0085197687149048} 11/07/2021 07:09:45 - INFO - __main__ - Step 70685: {'lr': 0.0002781508212092645, 'samples': 13571520, 'steps': 70684, 'loss/train': 1.3324549198150635} 11/07/2021 07:09:47 - INFO - __main__ - Step 70686: {'lr': 0.00027814554820933425, 'samples': 13571712, 'steps': 70685, 'loss/train': 1.4570213556289673} 11/07/2021 07:09:47 - INFO - __main__ - Step 70687: {'lr': 0.00027814027519672214, 'samples': 13571904, 'steps': 70686, 'loss/train': 0.08065914362668991} 11/07/2021 07:09:47 - INFO - __main__ - Step 70688: {'lr': 0.00027813500217143035, 'samples': 13572096, 'steps': 70687, 'loss/train': 1.028550148010254} 11/07/2021 07:09:48 - INFO - __main__ - Step 70689: {'lr': 0.0002781297291334614, 'samples': 13572288, 'steps': 70688, 'loss/train': 1.6267510652542114} 11/07/2021 07:09:48 - INFO - __main__ - Step 70690: {'lr': 0.0002781244560828176, 'samples': 13572480, 'steps': 70689, 'loss/train': 1.7705442905426025} 11/07/2021 07:09:49 - INFO - __main__ - Step 70691: {'lr': 0.00027811918301950137, 'samples': 13572672, 'steps': 70690, 'loss/train': 1.1303797960281372} 11/07/2021 07:09:49 - INFO - __main__ - Step 70692: {'lr': 0.00027811390994351504, 'samples': 13572864, 'steps': 70691, 'loss/train': 1.3704404830932617} 11/07/2021 07:09:50 - INFO - __main__ - Step 70693: {'lr': 0.00027810863685486106, 'samples': 13573056, 'steps': 70692, 'loss/train': 1.5599164962768555} 11/07/2021 07:09:50 - INFO - __main__ - Step 70694: {'lr': 0.0002781033637535418, 'samples': 13573248, 'steps': 70693, 'loss/train': 1.5683164596557617} 11/07/2021 07:09:50 - INFO - __main__ - Step 70695: {'lr': 0.00027809809063955956, 'samples': 13573440, 'steps': 70694, 'loss/train': 3.32000470161438} 11/07/2021 07:09:51 - INFO - __main__ - Step 70696: {'lr': 0.0002780928175129167, 'samples': 13573632, 'steps': 70695, 'loss/train': 1.2090742588043213} 11/07/2021 07:09:52 - INFO - __main__ - Step 70697: {'lr': 0.00027808754437361573, 'samples': 13573824, 'steps': 70696, 'loss/train': 1.4242146015167236} 11/07/2021 07:09:52 - INFO - __main__ - Step 70698: {'lr': 0.00027808227122165887, 'samples': 13574016, 'steps': 70697, 'loss/train': 1.6041350364685059} 11/07/2021 07:09:53 - INFO - __main__ - Step 70699: {'lr': 0.00027807699805704867, 'samples': 13574208, 'steps': 70698, 'loss/train': 1.715067982673645} 11/07/2021 07:09:53 - INFO - __main__ - Step 70700: {'lr': 0.00027807172487978734, 'samples': 13574400, 'steps': 70699, 'loss/train': 1.3146439790725708} 11/07/2021 07:09:53 - INFO - __main__ - Step 70701: {'lr': 0.00027806645168987733, 'samples': 13574592, 'steps': 70700, 'loss/train': 1.2579773664474487} 11/07/2021 07:09:54 - INFO - __main__ - Step 70702: {'lr': 0.00027806117848732097, 'samples': 13574784, 'steps': 70701, 'loss/train': 1.5384119749069214} 11/07/2021 07:09:55 - INFO - __main__ - Step 70703: {'lr': 0.00027805590527212075, 'samples': 13574976, 'steps': 70702, 'loss/train': 5.780084133148193} 11/07/2021 07:09:55 - INFO - __main__ - Step 70704: {'lr': 0.00027805063204427896, 'samples': 13575168, 'steps': 70703, 'loss/train': 1.0025701522827148} 11/07/2021 07:09:56 - INFO - __main__ - Step 70705: {'lr': 0.000278045358803798, 'samples': 13575360, 'steps': 70704, 'loss/train': 1.6974632740020752} 11/07/2021 07:09:56 - INFO - __main__ - Step 70706: {'lr': 0.00027804008555068016, 'samples': 13575552, 'steps': 70705, 'loss/train': 1.2348369359970093} 11/07/2021 07:09:56 - INFO - __main__ - Step 70707: {'lr': 0.00027803481228492793, 'samples': 13575744, 'steps': 70706, 'loss/train': 1.7099609375} 11/07/2021 07:09:57 - INFO - __main__ - Step 70708: {'lr': 0.00027802953900654367, 'samples': 13575936, 'steps': 70707, 'loss/train': 1.68367600440979} 11/07/2021 07:09:58 - INFO - __main__ - Step 70709: {'lr': 0.0002780242657155297, 'samples': 13576128, 'steps': 70708, 'loss/train': 1.200756549835205} 11/07/2021 07:09:58 - INFO - __main__ - Step 70710: {'lr': 0.0002780189924118885, 'samples': 13576320, 'steps': 70709, 'loss/train': 0.3517167568206787} 11/07/2021 07:09:58 - INFO - __main__ - Step 70711: {'lr': 0.00027801371909562226, 'samples': 13576512, 'steps': 70710, 'loss/train': 1.3761019706726074} 11/07/2021 07:09:59 - INFO - __main__ - Step 70712: {'lr': 0.0002780084457667336, 'samples': 13576704, 'steps': 70711, 'loss/train': 1.5318166017532349} 11/07/2021 07:09:59 - INFO - __main__ - Step 70713: {'lr': 0.0002780031724252247, 'samples': 13576896, 'steps': 70712, 'loss/train': 1.5372647047042847} 11/07/2021 07:10:00 - INFO - __main__ - Step 70714: {'lr': 0.0002779978990710979, 'samples': 13577088, 'steps': 70713, 'loss/train': 1.6188114881515503} 11/07/2021 07:10:00 - INFO - __main__ - Step 70715: {'lr': 0.0002779926257043558, 'samples': 13577280, 'steps': 70714, 'loss/train': 1.3631532192230225} 11/07/2021 07:10:01 - INFO - __main__ - Step 70716: {'lr': 0.00027798735232500066, 'samples': 13577472, 'steps': 70715, 'loss/train': 1.8005391359329224} 11/07/2021 07:10:01 - INFO - __main__ - Step 70717: {'lr': 0.00027798207893303483, 'samples': 13577664, 'steps': 70716, 'loss/train': 1.4862309694290161} 11/07/2021 07:10:02 - INFO - __main__ - Step 70718: {'lr': 0.00027797680552846065, 'samples': 13577856, 'steps': 70717, 'loss/train': 1.650911569595337} 11/07/2021 07:10:02 - INFO - __main__ - Step 70719: {'lr': 0.0002779715321112806, 'samples': 13578048, 'steps': 70718, 'loss/train': 1.0584200620651245} 11/07/2021 07:10:03 - INFO - __main__ - Step 70720: {'lr': 0.000277966258681497, 'samples': 13578240, 'steps': 70719, 'loss/train': 0.9981743097305298} 11/07/2021 07:10:03 - INFO - __main__ - Step 70721: {'lr': 0.0002779609852391123, 'samples': 13578432, 'steps': 70720, 'loss/train': 1.5896501541137695} 11/07/2021 07:10:04 - INFO - __main__ - Step 70722: {'lr': 0.00027795571178412874, 'samples': 13578624, 'steps': 70721, 'loss/train': 1.5412817001342773} 11/07/2021 07:10:04 - INFO - __main__ - Step 70723: {'lr': 0.0002779504383165488, 'samples': 13578816, 'steps': 70722, 'loss/train': 1.444070816040039} 11/07/2021 07:10:05 - INFO - __main__ - Step 70724: {'lr': 0.0002779451648363748, 'samples': 13579008, 'steps': 70723, 'loss/train': 1.4770317077636719} 11/07/2021 07:10:05 - INFO - __main__ - Step 70725: {'lr': 0.00027793989134360916, 'samples': 13579200, 'steps': 70724, 'loss/train': 1.3430774211883545} 11/07/2021 07:10:06 - INFO - __main__ - Step 70726: {'lr': 0.00027793461783825416, 'samples': 13579392, 'steps': 70725, 'loss/train': 1.7351657152175903} 11/07/2021 07:10:06 - INFO - __main__ - Step 70727: {'lr': 0.00027792934432031234, 'samples': 13579584, 'steps': 70726, 'loss/train': 1.479139804840088} 11/07/2021 07:10:06 - INFO - __main__ - Step 70728: {'lr': 0.000277924070789786, 'samples': 13579776, 'steps': 70727, 'loss/train': 1.4129656553268433} 11/07/2021 07:10:07 - INFO - __main__ - Step 70729: {'lr': 0.00027791879724667747, 'samples': 13579968, 'steps': 70728, 'loss/train': 1.3199594020843506} 11/07/2021 07:10:08 - INFO - __main__ - Step 70730: {'lr': 0.00027791352369098914, 'samples': 13580160, 'steps': 70729, 'loss/train': 1.3415756225585938} 11/07/2021 07:10:08 - INFO - __main__ - Step 70731: {'lr': 0.0002779082501227234, 'samples': 13580352, 'steps': 70730, 'loss/train': 0.9116508960723877} 11/07/2021 07:10:08 - INFO - __main__ - Step 70732: {'lr': 0.0002779029765418827, 'samples': 13580544, 'steps': 70731, 'loss/train': 1.4555904865264893} 11/07/2021 07:10:09 - INFO - __main__ - Step 70733: {'lr': 0.0002778977029484693, 'samples': 13580736, 'steps': 70732, 'loss/train': 0.7715266942977905} 11/07/2021 07:10:10 - INFO - __main__ - Step 70734: {'lr': 0.0002778924293424856, 'samples': 13580928, 'steps': 70733, 'loss/train': 1.2421051263809204} 11/07/2021 07:10:10 - INFO - __main__ - Step 70735: {'lr': 0.00027788715572393406, 'samples': 13581120, 'steps': 70734, 'loss/train': 1.3610775470733643} 11/07/2021 07:10:11 - INFO - __main__ - Step 70736: {'lr': 0.000277881882092817, 'samples': 13581312, 'steps': 70735, 'loss/train': 1.4820523262023926} 11/07/2021 07:10:11 - INFO - __main__ - Step 70737: {'lr': 0.00027787660844913676, 'samples': 13581504, 'steps': 70736, 'loss/train': 1.4029576778411865} 11/07/2021 07:10:11 - INFO - __main__ - Step 70738: {'lr': 0.00027787133479289573, 'samples': 13581696, 'steps': 70737, 'loss/train': 1.2918736934661865} 11/07/2021 07:10:12 - INFO - __main__ - Step 70739: {'lr': 0.00027786606112409633, 'samples': 13581888, 'steps': 70738, 'loss/train': 1.2608716487884521} 11/07/2021 07:10:12 - INFO - __main__ - Step 70740: {'lr': 0.0002778607874427409, 'samples': 13582080, 'steps': 70739, 'loss/train': 1.4104515314102173} 11/07/2021 07:10:13 - INFO - __main__ - Step 70741: {'lr': 0.00027785551374883197, 'samples': 13582272, 'steps': 70740, 'loss/train': 1.6606214046478271} 11/07/2021 07:10:13 - INFO - __main__ - Step 70742: {'lr': 0.00027785024004237156, 'samples': 13582464, 'steps': 70741, 'loss/train': 2.165208578109741} 11/07/2021 07:10:14 - INFO - __main__ - Step 70743: {'lr': 0.0002778449663233624, 'samples': 13582656, 'steps': 70742, 'loss/train': 1.6874990463256836} 11/07/2021 07:10:14 - INFO - __main__ - Step 70744: {'lr': 0.00027783969259180665, 'samples': 13582848, 'steps': 70743, 'loss/train': 1.5688798427581787} 11/07/2021 07:10:14 - INFO - __main__ - Step 70745: {'lr': 0.00027783441884770676, 'samples': 13583040, 'steps': 70744, 'loss/train': 1.1507110595703125} 11/07/2021 07:10:15 - INFO - __main__ - Step 70746: {'lr': 0.00027782914509106514, 'samples': 13583232, 'steps': 70745, 'loss/train': 0.9193888902664185} 11/07/2021 07:10:16 - INFO - __main__ - Step 70747: {'lr': 0.0002778238713218842, 'samples': 13583424, 'steps': 70746, 'loss/train': 1.3129788637161255} 11/07/2021 07:10:16 - INFO - __main__ - Step 70748: {'lr': 0.0002778185975401662, 'samples': 13583616, 'steps': 70747, 'loss/train': 1.6225484609603882} 11/07/2021 07:10:17 - INFO - __main__ - Step 70749: {'lr': 0.00027781332374591356, 'samples': 13583808, 'steps': 70748, 'loss/train': 2.6078555583953857} 11/07/2021 07:10:17 - INFO - __main__ - Step 70750: {'lr': 0.00027780804993912867, 'samples': 13584000, 'steps': 70749, 'loss/train': 1.210442066192627} 11/07/2021 07:10:18 - INFO - __main__ - Step 70751: {'lr': 0.00027780277611981393, 'samples': 13584192, 'steps': 70750, 'loss/train': 1.470720648765564} 11/07/2021 07:10:18 - INFO - __main__ - Step 70752: {'lr': 0.0002777975022879716, 'samples': 13584384, 'steps': 70751, 'loss/train': 1.2813632488250732} 11/07/2021 07:10:19 - INFO - __main__ - Step 70753: {'lr': 0.00027779222844360427, 'samples': 13584576, 'steps': 70752, 'loss/train': 1.577606201171875} 11/07/2021 07:10:19 - INFO - __main__ - Step 70754: {'lr': 0.00027778695458671406, 'samples': 13584768, 'steps': 70753, 'loss/train': 1.5214762687683105} 11/07/2021 07:10:19 - INFO - __main__ - Step 70755: {'lr': 0.0002777816807173036, 'samples': 13584960, 'steps': 70754, 'loss/train': 1.558935523033142} 11/07/2021 07:10:20 - INFO - __main__ - Step 70756: {'lr': 0.0002777764068353751, 'samples': 13585152, 'steps': 70755, 'loss/train': 0.9329754114151001} 11/07/2021 07:10:21 - INFO - __main__ - Step 70757: {'lr': 0.00027777113294093095, 'samples': 13585344, 'steps': 70756, 'loss/train': 1.37664794921875} 11/07/2021 07:10:21 - INFO - __main__ - Step 70758: {'lr': 0.00027776585903397353, 'samples': 13585536, 'steps': 70757, 'loss/train': 1.7193995714187622} 11/07/2021 07:10:21 - INFO - __main__ - Step 70759: {'lr': 0.0002777605851145053, 'samples': 13585728, 'steps': 70758, 'loss/train': 1.0814361572265625} 11/07/2021 07:10:22 - INFO - __main__ - Step 70760: {'lr': 0.00027775531118252856, 'samples': 13585920, 'steps': 70759, 'loss/train': 1.0217117071151733} 11/07/2021 07:10:23 - INFO - __main__ - Step 70761: {'lr': 0.00027775003723804577, 'samples': 13586112, 'steps': 70760, 'loss/train': 1.6838020086288452} 11/07/2021 07:10:23 - INFO - __main__ - Step 70762: {'lr': 0.00027774476328105914, 'samples': 13586304, 'steps': 70761, 'loss/train': 1.2010635137557983} 11/07/2021 07:10:23 - INFO - __main__ - Step 70763: {'lr': 0.0002777394893115712, 'samples': 13586496, 'steps': 70762, 'loss/train': 1.343734860420227} 11/07/2021 07:10:24 - INFO - __main__ - Step 70764: {'lr': 0.00027773421532958426, 'samples': 13586688, 'steps': 70763, 'loss/train': 1.4629162549972534} 11/07/2021 07:10:24 - INFO - __main__ - Step 70765: {'lr': 0.00027772894133510067, 'samples': 13586880, 'steps': 70764, 'loss/train': 1.3324865102767944} 11/07/2021 07:10:25 - INFO - __main__ - Step 70766: {'lr': 0.00027772366732812295, 'samples': 13587072, 'steps': 70765, 'loss/train': 1.8361135721206665} 11/07/2021 07:10:26 - INFO - __main__ - Step 70767: {'lr': 0.0002777183933086532, 'samples': 13587264, 'steps': 70766, 'loss/train': 1.363357663154602} 11/07/2021 07:10:26 - INFO - __main__ - Step 70768: {'lr': 0.00027771311927669417, 'samples': 13587456, 'steps': 70767, 'loss/train': 1.273458480834961} 11/07/2021 07:10:26 - INFO - __main__ - Step 70769: {'lr': 0.00027770784523224794, 'samples': 13587648, 'steps': 70768, 'loss/train': 1.564810037612915} 11/07/2021 07:10:27 - INFO - __main__ - Step 70770: {'lr': 0.000277702571175317, 'samples': 13587840, 'steps': 70769, 'loss/train': 1.3980605602264404} 11/07/2021 07:10:27 - INFO - __main__ - Step 70771: {'lr': 0.0002776972971059037, 'samples': 13588032, 'steps': 70770, 'loss/train': 1.1488498449325562} 11/07/2021 07:10:28 - INFO - __main__ - Step 70772: {'lr': 0.00027769202302401044, 'samples': 13588224, 'steps': 70771, 'loss/train': 1.7874830961227417} 11/07/2021 07:10:28 - INFO - __main__ - Step 70773: {'lr': 0.0002776867489296395, 'samples': 13588416, 'steps': 70772, 'loss/train': 1.7378891706466675} 11/07/2021 07:10:29 - INFO - __main__ - Step 70774: {'lr': 0.00027768147482279344, 'samples': 13588608, 'steps': 70773, 'loss/train': 1.4706732034683228} 11/07/2021 07:10:29 - INFO - __main__ - Step 70775: {'lr': 0.00027767620070347454, 'samples': 13588800, 'steps': 70774, 'loss/train': 1.75840163230896} 11/07/2021 07:10:29 - INFO - __main__ - Step 70776: {'lr': 0.00027767092657168514, 'samples': 13588992, 'steps': 70775, 'loss/train': 0.9327296018600464} 11/07/2021 07:10:30 - INFO - __main__ - Step 70777: {'lr': 0.0002776656524274276, 'samples': 13589184, 'steps': 70776, 'loss/train': 2.153815984725952} 11/07/2021 07:10:31 - INFO - __main__ - Step 70778: {'lr': 0.0002776603782707044, 'samples': 13589376, 'steps': 70777, 'loss/train': 1.484006404876709} 11/07/2021 07:10:31 - INFO - __main__ - Step 70779: {'lr': 0.0002776551041015178, 'samples': 13589568, 'steps': 70778, 'loss/train': 0.7651481628417969} 11/07/2021 07:10:31 - INFO - __main__ - Step 70780: {'lr': 0.00027764982991987033, 'samples': 13589760, 'steps': 70779, 'loss/train': 1.6515743732452393} 11/07/2021 07:10:32 - INFO - __main__ - Step 70781: {'lr': 0.0002776445557257642, 'samples': 13589952, 'steps': 70780, 'loss/train': 1.1292128562927246} 11/07/2021 07:10:33 - INFO - __main__ - Step 70782: {'lr': 0.00027763928151920193, 'samples': 13590144, 'steps': 70781, 'loss/train': 1.2826687097549438} 11/07/2021 07:10:33 - INFO - __main__ - Step 70783: {'lr': 0.00027763400730018576, 'samples': 13590336, 'steps': 70782, 'loss/train': 1.6203932762145996} 11/07/2021 07:10:34 - INFO - __main__ - Step 70784: {'lr': 0.0002776287330687181, 'samples': 13590528, 'steps': 70783, 'loss/train': 1.575171947479248} 11/07/2021 07:10:34 - INFO - __main__ - Step 70785: {'lr': 0.00027762345882480146, 'samples': 13590720, 'steps': 70784, 'loss/train': 1.3728625774383545} 11/07/2021 07:10:34 - INFO - __main__ - Step 70786: {'lr': 0.000277618184568438, 'samples': 13590912, 'steps': 70785, 'loss/train': 1.3956342935562134} 11/07/2021 07:10:36 - INFO - __main__ - Step 70787: {'lr': 0.0002776129102996303, 'samples': 13591104, 'steps': 70786, 'loss/train': 1.6897698640823364} 11/07/2021 07:10:36 - INFO - __main__ - Step 70788: {'lr': 0.0002776076360183807, 'samples': 13591296, 'steps': 70787, 'loss/train': 1.4961271286010742} 11/07/2021 07:10:36 - INFO - __main__ - Step 70789: {'lr': 0.0002776023617246914, 'samples': 13591488, 'steps': 70788, 'loss/train': 1.8119213581085205} 11/07/2021 07:10:37 - INFO - __main__ - Step 70790: {'lr': 0.00027759708741856493, 'samples': 13591680, 'steps': 70789, 'loss/train': 1.893054485321045} 11/07/2021 07:10:37 - INFO - __main__ - Step 70791: {'lr': 0.0002775918131000037, 'samples': 13591872, 'steps': 70790, 'loss/train': 1.5594568252563477} 11/07/2021 07:10:37 - INFO - __main__ - Step 70792: {'lr': 0.00027758653876900995, 'samples': 13592064, 'steps': 70791, 'loss/train': 2.384723424911499} 11/07/2021 07:10:38 - INFO - __main__ - Step 70793: {'lr': 0.0002775812644255862, 'samples': 13592256, 'steps': 70792, 'loss/train': 3.145094394683838} 11/07/2021 07:10:39 - INFO - __main__ - Step 70794: {'lr': 0.00027757599006973465, 'samples': 13592448, 'steps': 70793, 'loss/train': 1.2721518278121948} 11/07/2021 07:10:39 - INFO - __main__ - Step 70795: {'lr': 0.00027757071570145794, 'samples': 13592640, 'steps': 70794, 'loss/train': 0.9711647629737854} 11/07/2021 07:10:39 - INFO - __main__ - Step 70796: {'lr': 0.0002775654413207582, 'samples': 13592832, 'steps': 70795, 'loss/train': 1.1572130918502808} 11/07/2021 07:10:40 - INFO - __main__ - Step 70797: {'lr': 0.00027756016692763794, 'samples': 13593024, 'steps': 70796, 'loss/train': 1.1679341793060303} 11/07/2021 07:10:41 - INFO - __main__ - Step 70798: {'lr': 0.0002775548925220994, 'samples': 13593216, 'steps': 70797, 'loss/train': 1.3693984746932983} 11/07/2021 07:10:41 - INFO - __main__ - Step 70799: {'lr': 0.00027754961810414516, 'samples': 13593408, 'steps': 70798, 'loss/train': 1.599058747291565} 11/07/2021 07:10:41 - INFO - __main__ - Step 70800: {'lr': 0.0002775443436737774, 'samples': 13593600, 'steps': 70799, 'loss/train': 1.5106333494186401} 11/07/2021 07:10:42 - INFO - __main__ - Step 70801: {'lr': 0.00027753906923099863, 'samples': 13593792, 'steps': 70800, 'loss/train': 0.8597078323364258} 11/07/2021 07:10:42 - INFO - __main__ - Step 70802: {'lr': 0.0002775337947758112, 'samples': 13593984, 'steps': 70801, 'loss/train': 1.5813246965408325} 11/07/2021 07:10:43 - INFO - __main__ - Step 70803: {'lr': 0.00027752852030821744, 'samples': 13594176, 'steps': 70802, 'loss/train': 1.215569257736206} 11/07/2021 07:10:44 - INFO - __main__ - Step 70804: {'lr': 0.00027752324582821977, 'samples': 13594368, 'steps': 70803, 'loss/train': 1.4612300395965576} 11/07/2021 07:10:44 - INFO - __main__ - Step 70805: {'lr': 0.0002775179713358205, 'samples': 13594560, 'steps': 70804, 'loss/train': 1.549971103668213} 11/07/2021 07:10:44 - INFO - __main__ - Step 70806: {'lr': 0.0002775126968310221, 'samples': 13594752, 'steps': 70805, 'loss/train': 1.3063522577285767} 11/07/2021 07:10:45 - INFO - __main__ - Step 70807: {'lr': 0.00027750742231382684, 'samples': 13594944, 'steps': 70806, 'loss/train': 1.18684983253479} 11/07/2021 07:10:46 - INFO - __main__ - Step 70808: {'lr': 0.0002775021477842373, 'samples': 13595136, 'steps': 70807, 'loss/train': 0.9732040166854858} 11/07/2021 07:10:46 - INFO - __main__ - Step 70809: {'lr': 0.00027749687324225565, 'samples': 13595328, 'steps': 70808, 'loss/train': 1.0323837995529175} 11/07/2021 07:10:46 - INFO - __main__ - Step 70810: {'lr': 0.0002774915986878843, 'samples': 13595520, 'steps': 70809, 'loss/train': 1.7401589155197144} 11/07/2021 07:10:47 - INFO - __main__ - Step 70811: {'lr': 0.0002774863241211257, 'samples': 13595712, 'steps': 70810, 'loss/train': 1.544440746307373} 11/07/2021 07:10:47 - INFO - __main__ - Step 70812: {'lr': 0.0002774810495419821, 'samples': 13595904, 'steps': 70811, 'loss/train': 1.3379720449447632} 11/07/2021 07:10:48 - INFO - __main__ - Step 70813: {'lr': 0.00027747577495045603, 'samples': 13596096, 'steps': 70812, 'loss/train': 1.455852746963501} 11/07/2021 07:10:48 - INFO - __main__ - Step 70814: {'lr': 0.0002774705003465498, 'samples': 13596288, 'steps': 70813, 'loss/train': 1.718490481376648} 11/07/2021 07:10:49 - INFO - __main__ - Step 70815: {'lr': 0.0002774652257302658, 'samples': 13596480, 'steps': 70814, 'loss/train': 1.0712023973464966} 11/07/2021 07:10:49 - INFO - __main__ - Step 70816: {'lr': 0.00027745995110160635, 'samples': 13596672, 'steps': 70815, 'loss/train': 1.4272147417068481} 11/07/2021 07:10:49 - INFO - __main__ - Step 70817: {'lr': 0.0002774546764605739, 'samples': 13596864, 'steps': 70816, 'loss/train': 1.069909691810608} 11/07/2021 07:10:50 - INFO - __main__ - Step 70818: {'lr': 0.0002774494018071708, 'samples': 13597056, 'steps': 70817, 'loss/train': 1.4176558256149292} 11/07/2021 07:10:51 - INFO - __main__ - Step 70819: {'lr': 0.00027744412714139936, 'samples': 13597248, 'steps': 70818, 'loss/train': 1.3706468343734741} 11/07/2021 07:10:51 - INFO - __main__ - Step 70820: {'lr': 0.0002774388524632621, 'samples': 13597440, 'steps': 70819, 'loss/train': 1.5750672817230225} 11/07/2021 07:10:51 - INFO - __main__ - Step 70821: {'lr': 0.0002774335777727613, 'samples': 13597632, 'steps': 70820, 'loss/train': 1.4771288633346558} 11/07/2021 07:10:52 - INFO - __main__ - Step 70822: {'lr': 0.0002774283030698994, 'samples': 13597824, 'steps': 70821, 'loss/train': 1.5569273233413696} 11/07/2021 07:10:52 - INFO - __main__ - Step 70823: {'lr': 0.00027742302835467863, 'samples': 13598016, 'steps': 70822, 'loss/train': 1.4754000902175903} 11/07/2021 07:10:53 - INFO - __main__ - Step 70824: {'lr': 0.00027741775362710155, 'samples': 13598208, 'steps': 70823, 'loss/train': 1.6979821920394897} 11/07/2021 07:10:54 - INFO - __main__ - Step 70825: {'lr': 0.00027741247888717036, 'samples': 13598400, 'steps': 70824, 'loss/train': 1.483029842376709} 11/07/2021 07:10:54 - INFO - __main__ - Step 70826: {'lr': 0.00027740720413488756, 'samples': 13598592, 'steps': 70825, 'loss/train': 1.5064902305603027} 11/07/2021 07:10:54 - INFO - __main__ - Step 70827: {'lr': 0.00027740192937025554, 'samples': 13598784, 'steps': 70826, 'loss/train': 1.5071929693222046} 11/07/2021 07:10:55 - INFO - __main__ - Step 70828: {'lr': 0.0002773966545932767, 'samples': 13598976, 'steps': 70827, 'loss/train': 1.1448562145233154} 11/07/2021 07:10:56 - INFO - __main__ - Step 70829: {'lr': 0.00027739137980395325, 'samples': 13599168, 'steps': 70828, 'loss/train': 1.4246089458465576} 11/07/2021 07:10:56 - INFO - __main__ - Step 70830: {'lr': 0.0002773861050022876, 'samples': 13599360, 'steps': 70829, 'loss/train': 1.4266833066940308} 11/07/2021 07:10:56 - INFO - __main__ - Step 70831: {'lr': 0.0002773808301882823, 'samples': 13599552, 'steps': 70830, 'loss/train': 1.4436691999435425} 11/07/2021 07:10:57 - INFO - __main__ - Step 70832: {'lr': 0.0002773755553619396, 'samples': 13599744, 'steps': 70831, 'loss/train': 1.5150357484817505} 11/07/2021 07:10:57 - INFO - __main__ - Step 70833: {'lr': 0.00027737028052326183, 'samples': 13599936, 'steps': 70832, 'loss/train': 2.1907618045806885} 11/07/2021 07:10:58 - INFO - __main__ - Step 70834: {'lr': 0.0002773650056722516, 'samples': 13600128, 'steps': 70833, 'loss/train': 1.2962589263916016} 11/07/2021 07:10:59 - INFO - __main__ - Step 70835: {'lr': 0.00027735973080891097, 'samples': 13600320, 'steps': 70834, 'loss/train': 1.0993216037750244} 11/07/2021 07:10:59 - INFO - __main__ - Step 70836: {'lr': 0.00027735445593324255, 'samples': 13600512, 'steps': 70835, 'loss/train': 1.495584487915039} 11/07/2021 07:10:59 - INFO - __main__ - Step 70837: {'lr': 0.0002773491810452486, 'samples': 13600704, 'steps': 70836, 'loss/train': 1.566949725151062} 11/07/2021 07:11:00 - INFO - __main__ - Step 70838: {'lr': 0.0002773439061449315, 'samples': 13600896, 'steps': 70837, 'loss/train': 1.8302123546600342} 11/07/2021 07:11:01 - INFO - __main__ - Step 70839: {'lr': 0.0002773386312322937, 'samples': 13601088, 'steps': 70838, 'loss/train': 1.3197729587554932} 11/07/2021 07:11:01 - INFO - __main__ - Step 70840: {'lr': 0.0002773333563073375, 'samples': 13601280, 'steps': 70839, 'loss/train': 1.5789841413497925} 11/07/2021 07:11:02 - INFO - __main__ - Step 70841: {'lr': 0.0002773280813700654, 'samples': 13601472, 'steps': 70840, 'loss/train': 1.1554509401321411} 11/07/2021 07:11:02 - INFO - __main__ - Step 70842: {'lr': 0.0002773228064204796, 'samples': 13601664, 'steps': 70841, 'loss/train': 1.7769358158111572} 11/07/2021 07:11:02 - INFO - __main__ - Step 70843: {'lr': 0.00027731753145858256, 'samples': 13601856, 'steps': 70842, 'loss/train': 1.396985411643982} 11/07/2021 07:11:03 - INFO - __main__ - Step 70844: {'lr': 0.00027731225648437675, 'samples': 13602048, 'steps': 70843, 'loss/train': 2.8566133975982666} 11/07/2021 07:11:04 - INFO - __main__ - Step 70845: {'lr': 0.0002773069814978644, 'samples': 13602240, 'steps': 70844, 'loss/train': 1.093920111656189} 11/07/2021 07:11:04 - INFO - __main__ - Step 70846: {'lr': 0.0002773017064990479, 'samples': 13602432, 'steps': 70845, 'loss/train': 2.0141384601593018} 11/07/2021 07:11:05 - INFO - __main__ - Step 70847: {'lr': 0.0002772964314879297, 'samples': 13602624, 'steps': 70846, 'loss/train': 1.63483464717865} 11/07/2021 07:11:05 - INFO - __main__ - Step 70848: {'lr': 0.0002772911564645122, 'samples': 13602816, 'steps': 70847, 'loss/train': 1.6422630548477173} 11/07/2021 07:11:05 - INFO - __main__ - Step 70849: {'lr': 0.0002772858814287976, 'samples': 13603008, 'steps': 70848, 'loss/train': 1.5397984981536865} 11/07/2021 07:11:06 - INFO - __main__ - Step 70850: {'lr': 0.0002772806063807886, 'samples': 13603200, 'steps': 70849, 'loss/train': 1.097475290298462} 11/07/2021 07:11:07 - INFO - __main__ - Step 70851: {'lr': 0.00027727533132048727, 'samples': 13603392, 'steps': 70850, 'loss/train': 1.705122709274292} 11/07/2021 07:11:07 - INFO - __main__ - Step 70852: {'lr': 0.0002772700562478961, 'samples': 13603584, 'steps': 70851, 'loss/train': 1.3405554294586182} 11/07/2021 07:11:07 - INFO - __main__ - Step 70853: {'lr': 0.00027726478116301746, 'samples': 13603776, 'steps': 70852, 'loss/train': 1.434833288192749} 11/07/2021 07:11:08 - INFO - __main__ - Step 70854: {'lr': 0.0002772595060658537, 'samples': 13603968, 'steps': 70853, 'loss/train': 1.3377561569213867} 11/07/2021 07:11:08 - INFO - __main__ - Step 70855: {'lr': 0.0002772542309564072, 'samples': 13604160, 'steps': 70854, 'loss/train': 0.9256234169006348} 11/07/2021 07:11:09 - INFO - __main__ - Step 70856: {'lr': 0.0002772489558346805, 'samples': 13604352, 'steps': 70855, 'loss/train': 1.1197384595870972} 11/07/2021 07:11:10 - INFO - __main__ - Step 70857: {'lr': 0.00027724368070067577, 'samples': 13604544, 'steps': 70856, 'loss/train': 1.7131011486053467} 11/07/2021 07:11:10 - INFO - __main__ - Step 70858: {'lr': 0.0002772384055543954, 'samples': 13604736, 'steps': 70857, 'loss/train': 1.0418717861175537} 11/07/2021 07:11:10 - INFO - __main__ - Step 70859: {'lr': 0.0002772331303958419, 'samples': 13604928, 'steps': 70858, 'loss/train': 1.775635004043579} 11/07/2021 07:11:11 - INFO - __main__ - Step 70860: {'lr': 0.0002772278552250176, 'samples': 13605120, 'steps': 70859, 'loss/train': 1.5911098718643188} 11/07/2021 07:11:12 - INFO - __main__ - Step 70861: {'lr': 0.00027722258004192474, 'samples': 13605312, 'steps': 70860, 'loss/train': 1.6606075763702393} 11/07/2021 07:11:12 - INFO - __main__ - Step 70862: {'lr': 0.0002772173048465659, 'samples': 13605504, 'steps': 70861, 'loss/train': 1.6655049324035645} 11/07/2021 07:11:12 - INFO - __main__ - Step 70863: {'lr': 0.0002772120296389433, 'samples': 13605696, 'steps': 70862, 'loss/train': 1.1075948476791382} 11/07/2021 07:11:13 - INFO - __main__ - Step 70864: {'lr': 0.00027720675441905945, 'samples': 13605888, 'steps': 70863, 'loss/train': 1.7514095306396484} 11/07/2021 07:11:13 - INFO - __main__ - Step 70865: {'lr': 0.0002772014791869166, 'samples': 13606080, 'steps': 70864, 'loss/train': 1.3979367017745972} 11/07/2021 07:11:14 - INFO - __main__ - Step 70866: {'lr': 0.0002771962039425172, 'samples': 13606272, 'steps': 70865, 'loss/train': 1.4313197135925293} 11/07/2021 07:11:15 - INFO - __main__ - Step 70867: {'lr': 0.0002771909286858636, 'samples': 13606464, 'steps': 70866, 'loss/train': 1.2459571361541748} 11/07/2021 07:11:15 - INFO - __main__ - Step 70868: {'lr': 0.0002771856534169582, 'samples': 13606656, 'steps': 70867, 'loss/train': 1.2727289199829102} 11/07/2021 07:11:16 - INFO - __main__ - Step 70869: {'lr': 0.0002771803781358034, 'samples': 13606848, 'steps': 70868, 'loss/train': 0.1536039561033249} 11/07/2021 07:11:16 - INFO - __main__ - Step 70870: {'lr': 0.0002771751028424014, 'samples': 13607040, 'steps': 70869, 'loss/train': 1.2978876829147339} 11/07/2021 07:11:16 - INFO - __main__ - Step 70871: {'lr': 0.00027716982753675485, 'samples': 13607232, 'steps': 70870, 'loss/train': 1.352763295173645} 11/07/2021 07:11:17 - INFO - __main__ - Step 70872: {'lr': 0.00027716455221886595, 'samples': 13607424, 'steps': 70871, 'loss/train': 1.2685052156448364} 11/07/2021 07:11:18 - INFO - __main__ - Step 70873: {'lr': 0.00027715927688873717, 'samples': 13607616, 'steps': 70872, 'loss/train': 1.2824088335037231} 11/07/2021 07:11:18 - INFO - __main__ - Step 70874: {'lr': 0.0002771540015463708, 'samples': 13607808, 'steps': 70873, 'loss/train': 1.4337971210479736} 11/07/2021 07:11:19 - INFO - __main__ - Step 70875: {'lr': 0.0002771487261917692, 'samples': 13608000, 'steps': 70874, 'loss/train': 0.3502437472343445} 11/07/2021 07:11:19 - INFO - __main__ - Step 70876: {'lr': 0.00027714345082493493, 'samples': 13608192, 'steps': 70875, 'loss/train': 1.262947678565979} 11/07/2021 07:11:20 - INFO - __main__ - Step 70877: {'lr': 0.00027713817544587014, 'samples': 13608384, 'steps': 70876, 'loss/train': 1.5394175052642822} 11/07/2021 07:11:20 - INFO - __main__ - Step 70878: {'lr': 0.00027713290005457734, 'samples': 13608576, 'steps': 70877, 'loss/train': 1.5276885032653809} 11/07/2021 07:11:21 - INFO - __main__ - Step 70879: {'lr': 0.00027712762465105886, 'samples': 13608768, 'steps': 70878, 'loss/train': 1.3905174732208252} 11/07/2021 07:11:21 - INFO - __main__ - Step 70880: {'lr': 0.0002771223492353171, 'samples': 13608960, 'steps': 70879, 'loss/train': 1.5495288372039795} 11/07/2021 07:11:21 - INFO - __main__ - Step 70881: {'lr': 0.0002771170738073544, 'samples': 13609152, 'steps': 70880, 'loss/train': 1.5751477479934692} 11/07/2021 07:11:22 - INFO - __main__ - Step 70882: {'lr': 0.0002771117983671733, 'samples': 13609344, 'steps': 70881, 'loss/train': 1.7494494915008545} 11/07/2021 07:11:23 - INFO - __main__ - Step 70883: {'lr': 0.0002771065229147759, 'samples': 13609536, 'steps': 70882, 'loss/train': 1.223415493965149} 11/07/2021 07:11:23 - INFO - __main__ - Step 70884: {'lr': 0.0002771012474501647, 'samples': 13609728, 'steps': 70883, 'loss/train': 1.4108664989471436} 11/07/2021 07:11:23 - INFO - __main__ - Step 70885: {'lr': 0.0002770959719733422, 'samples': 13609920, 'steps': 70884, 'loss/train': 1.2523388862609863} 11/07/2021 07:11:24 - INFO - __main__ - Step 70886: {'lr': 0.00027709069648431056, 'samples': 13610112, 'steps': 70885, 'loss/train': 1.4009748697280884} 11/07/2021 07:11:25 - INFO - __main__ - Step 70887: {'lr': 0.0002770854209830724, 'samples': 13610304, 'steps': 70886, 'loss/train': 1.3797515630722046} 11/07/2021 07:11:25 - INFO - __main__ - Step 70888: {'lr': 0.00027708014546962986, 'samples': 13610496, 'steps': 70887, 'loss/train': 5.759355545043945} 11/07/2021 07:11:25 - INFO - __main__ - Step 70889: {'lr': 0.0002770748699439855, 'samples': 13610688, 'steps': 70888, 'loss/train': 1.4690853357315063} 11/07/2021 07:11:26 - INFO - __main__ - Step 70890: {'lr': 0.0002770695944061416, 'samples': 13610880, 'steps': 70889, 'loss/train': 1.4233694076538086} 11/07/2021 07:11:26 - INFO - __main__ - Step 70891: {'lr': 0.00027706431885610053, 'samples': 13611072, 'steps': 70890, 'loss/train': 1.9059330224990845} 11/07/2021 07:11:27 - INFO - __main__ - Step 70892: {'lr': 0.0002770590432938647, 'samples': 13611264, 'steps': 70891, 'loss/train': 1.5079165697097778} 11/07/2021 07:11:28 - INFO - __main__ - Step 70893: {'lr': 0.00027705376771943645, 'samples': 13611456, 'steps': 70892, 'loss/train': 1.5530319213867188} 11/07/2021 07:11:28 - INFO - __main__ - Step 70894: {'lr': 0.00027704849213281823, 'samples': 13611648, 'steps': 70893, 'loss/train': 1.4788427352905273} 11/07/2021 07:11:28 - INFO - __main__ - Step 70895: {'lr': 0.00027704321653401244, 'samples': 13611840, 'steps': 70894, 'loss/train': 1.526125192642212} 11/07/2021 07:11:29 - INFO - __main__ - Step 70896: {'lr': 0.00027703794092302135, 'samples': 13612032, 'steps': 70895, 'loss/train': 1.356782078742981} 11/07/2021 07:11:29 - INFO - __main__ - Step 70897: {'lr': 0.0002770326652998473, 'samples': 13612224, 'steps': 70896, 'loss/train': 1.2296890020370483} 11/07/2021 07:11:30 - INFO - __main__ - Step 70898: {'lr': 0.0002770273896644929, 'samples': 13612416, 'steps': 70897, 'loss/train': 1.4579156637191772} 11/07/2021 07:11:30 - INFO - __main__ - Step 70899: {'lr': 0.00027702211401696024, 'samples': 13612608, 'steps': 70898, 'loss/train': 1.2825143337249756} 11/07/2021 07:11:31 - INFO - __main__ - Step 70900: {'lr': 0.0002770168383572519, 'samples': 13612800, 'steps': 70899, 'loss/train': 1.340142846107483} 11/07/2021 07:11:31 - INFO - __main__ - Step 70901: {'lr': 0.00027701156268537016, 'samples': 13612992, 'steps': 70900, 'loss/train': 0.4394263029098511} 11/07/2021 07:11:31 - INFO - __main__ - Step 70902: {'lr': 0.0002770062870013174, 'samples': 13613184, 'steps': 70901, 'loss/train': 1.341834306716919} 11/07/2021 07:11:33 - INFO - __main__ - Step 70903: {'lr': 0.00027700101130509615, 'samples': 13613376, 'steps': 70902, 'loss/train': 1.724833607673645} 11/07/2021 07:11:33 - INFO - __main__ - Step 70904: {'lr': 0.00027699573559670853, 'samples': 13613568, 'steps': 70903, 'loss/train': 1.241002082824707} 11/07/2021 07:11:33 - INFO - __main__ - Step 70905: {'lr': 0.0002769904598761571, 'samples': 13613760, 'steps': 70904, 'loss/train': 1.7125757932662964} 11/07/2021 07:11:34 - INFO - __main__ - Step 70906: {'lr': 0.00027698518414344414, 'samples': 13613952, 'steps': 70905, 'loss/train': 1.3418973684310913} 11/07/2021 07:11:34 - INFO - __main__ - Step 70907: {'lr': 0.00027697990839857214, 'samples': 13614144, 'steps': 70906, 'loss/train': 1.617421269416809} 11/07/2021 07:11:35 - INFO - __main__ - Step 70908: {'lr': 0.0002769746326415433, 'samples': 13614336, 'steps': 70907, 'loss/train': 1.3237165212631226} 11/07/2021 07:11:36 - INFO - __main__ - Step 70909: {'lr': 0.0002769693568723603, 'samples': 13614528, 'steps': 70908, 'loss/train': 1.5012925863265991} 11/07/2021 07:11:36 - INFO - __main__ - Step 70910: {'lr': 0.00027696408109102516, 'samples': 13614720, 'steps': 70909, 'loss/train': 1.3399527072906494} 11/07/2021 07:11:36 - INFO - __main__ - Step 70911: {'lr': 0.00027695880529754046, 'samples': 13614912, 'steps': 70910, 'loss/train': 1.5068778991699219} 11/07/2021 07:11:37 - INFO - __main__ - Step 70912: {'lr': 0.0002769535294919086, 'samples': 13615104, 'steps': 70911, 'loss/train': 1.64387047290802} 11/07/2021 07:11:37 - INFO - __main__ - Step 70913: {'lr': 0.0002769482536741318, 'samples': 13615296, 'steps': 70912, 'loss/train': 1.6188863515853882} 11/07/2021 07:11:38 - INFO - __main__ - Step 70914: {'lr': 0.0002769429778442126, 'samples': 13615488, 'steps': 70913, 'loss/train': 1.8692851066589355} 11/07/2021 07:11:39 - INFO - __main__ - Step 70915: {'lr': 0.00027693770200215323, 'samples': 13615680, 'steps': 70914, 'loss/train': 1.3185337781906128} 11/07/2021 07:11:39 - INFO - __main__ - Step 70916: {'lr': 0.00027693242614795625, 'samples': 13615872, 'steps': 70915, 'loss/train': 1.2917298078536987} 11/07/2021 07:11:39 - INFO - __main__ - Step 70917: {'lr': 0.0002769271502816239, 'samples': 13616064, 'steps': 70916, 'loss/train': 1.5682700872421265} 11/07/2021 07:11:40 - INFO - __main__ - Step 70918: {'lr': 0.00027692187440315856, 'samples': 13616256, 'steps': 70917, 'loss/train': 1.353767991065979} 11/07/2021 07:11:41 - INFO - __main__ - Step 70919: {'lr': 0.0002769165985125627, 'samples': 13616448, 'steps': 70918, 'loss/train': 1.3526307344436646} 11/07/2021 07:11:41 - INFO - __main__ - Step 70920: {'lr': 0.00027691132260983855, 'samples': 13616640, 'steps': 70919, 'loss/train': 1.5744352340698242} 11/07/2021 07:11:41 - INFO - __main__ - Step 70921: {'lr': 0.0002769060466949886, 'samples': 13616832, 'steps': 70920, 'loss/train': 1.2083989381790161} 11/07/2021 07:11:42 - INFO - __main__ - Step 70922: {'lr': 0.00027690077076801523, 'samples': 13617024, 'steps': 70921, 'loss/train': 1.5060676336288452} 11/07/2021 07:11:42 - INFO - __main__ - Step 70923: {'lr': 0.00027689549482892077, 'samples': 13617216, 'steps': 70922, 'loss/train': 1.7911828756332397} 11/07/2021 07:11:43 - INFO - __main__ - Step 70924: {'lr': 0.00027689021887770764, 'samples': 13617408, 'steps': 70923, 'loss/train': 1.6195955276489258} 11/07/2021 07:11:43 - INFO - __main__ - Step 70925: {'lr': 0.00027688494291437817, 'samples': 13617600, 'steps': 70924, 'loss/train': 0.8282322883605957} 11/07/2021 07:11:44 - INFO - __main__ - Step 70926: {'lr': 0.00027687966693893475, 'samples': 13617792, 'steps': 70925, 'loss/train': 1.4763137102127075} 11/07/2021 07:11:44 - INFO - __main__ - Step 70927: {'lr': 0.0002768743909513798, 'samples': 13617984, 'steps': 70926, 'loss/train': 0.35663241147994995} 11/07/2021 07:11:44 - INFO - __main__ - Step 70928: {'lr': 0.00027686911495171564, 'samples': 13618176, 'steps': 70927, 'loss/train': 0.8945875763893127} 11/07/2021 07:11:45 - INFO - __main__ - Step 70929: {'lr': 0.0002768638389399447, 'samples': 13618368, 'steps': 70928, 'loss/train': 1.5209527015686035} 11/07/2021 07:11:46 - INFO - __main__ - Step 70930: {'lr': 0.00027685856291606933, 'samples': 13618560, 'steps': 70929, 'loss/train': 1.3235963582992554} 11/07/2021 07:11:46 - INFO - __main__ - Step 70931: {'lr': 0.00027685328688009187, 'samples': 13618752, 'steps': 70930, 'loss/train': 0.9927089810371399} 11/07/2021 07:11:46 - INFO - __main__ - Step 70932: {'lr': 0.0002768480108320147, 'samples': 13618944, 'steps': 70931, 'loss/train': 1.5759927034378052} 11/07/2021 07:11:47 - INFO - __main__ - Step 70933: {'lr': 0.0002768427347718403, 'samples': 13619136, 'steps': 70932, 'loss/train': 1.0105780363082886} 11/07/2021 07:11:48 - INFO - __main__ - Step 70934: {'lr': 0.00027683745869957094, 'samples': 13619328, 'steps': 70933, 'loss/train': 1.3825020790100098} 11/07/2021 07:11:48 - INFO - __main__ - Step 70935: {'lr': 0.00027683218261520906, 'samples': 13619520, 'steps': 70934, 'loss/train': 0.6810600757598877} 11/07/2021 07:11:49 - INFO - __main__ - Step 70936: {'lr': 0.000276826906518757, 'samples': 13619712, 'steps': 70935, 'loss/train': 1.6612778902053833} 11/07/2021 07:11:49 - INFO - __main__ - Step 70937: {'lr': 0.00027682163041021715, 'samples': 13619904, 'steps': 70936, 'loss/train': 1.3168389797210693} 11/07/2021 07:11:49 - INFO - __main__ - Step 70938: {'lr': 0.0002768163542895919, 'samples': 13620096, 'steps': 70937, 'loss/train': 1.7175992727279663} 11/07/2021 07:11:50 - INFO - __main__ - Step 70939: {'lr': 0.00027681107815688354, 'samples': 13620288, 'steps': 70938, 'loss/train': 1.5328965187072754} 11/07/2021 07:11:51 - INFO - __main__ - Step 70940: {'lr': 0.0002768058020120946, 'samples': 13620480, 'steps': 70939, 'loss/train': 1.6053578853607178} 11/07/2021 07:11:51 - INFO - __main__ - Step 70941: {'lr': 0.00027680052585522737, 'samples': 13620672, 'steps': 70940, 'loss/train': 0.12172715365886688} 11/07/2021 07:11:52 - INFO - __main__ - Step 70942: {'lr': 0.0002767952496862842, 'samples': 13620864, 'steps': 70941, 'loss/train': 1.328261137008667} 11/07/2021 07:11:52 - INFO - __main__ - Step 70943: {'lr': 0.0002767899735052676, 'samples': 13621056, 'steps': 70942, 'loss/train': 1.4933152198791504} 11/07/2021 07:11:52 - INFO - __main__ - Step 70944: {'lr': 0.00027678469731217976, 'samples': 13621248, 'steps': 70943, 'loss/train': 1.3249025344848633} 11/07/2021 07:11:53 - INFO - __main__ - Step 70945: {'lr': 0.0002767794211070232, 'samples': 13621440, 'steps': 70944, 'loss/train': 1.5616495609283447} 11/07/2021 07:11:54 - INFO - __main__ - Step 70946: {'lr': 0.00027677414488980017, 'samples': 13621632, 'steps': 70945, 'loss/train': 1.378873348236084} 11/07/2021 07:11:54 - INFO - __main__ - Step 70947: {'lr': 0.0002767688686605132, 'samples': 13621824, 'steps': 70946, 'loss/train': 1.3232896327972412} 11/07/2021 07:11:54 - INFO - __main__ - Step 70948: {'lr': 0.0002767635924191645, 'samples': 13622016, 'steps': 70947, 'loss/train': 0.950057864189148} 11/07/2021 07:11:55 - INFO - __main__ - Step 70949: {'lr': 0.00027675831616575666, 'samples': 13622208, 'steps': 70948, 'loss/train': 0.9325686693191528} 11/07/2021 07:11:56 - INFO - __main__ - Step 70950: {'lr': 0.00027675303990029186, 'samples': 13622400, 'steps': 70949, 'loss/train': 0.9676892161369324} 11/07/2021 07:11:56 - INFO - __main__ - Step 70951: {'lr': 0.0002767477636227726, 'samples': 13622592, 'steps': 70950, 'loss/train': 1.7105157375335693} 11/07/2021 07:11:57 - INFO - __main__ - Step 70952: {'lr': 0.00027674248733320115, 'samples': 13622784, 'steps': 70951, 'loss/train': 1.323728084564209} 11/07/2021 07:11:57 - INFO - __main__ - Step 70953: {'lr': 0.00027673721103158, 'samples': 13622976, 'steps': 70952, 'loss/train': 0.6903629302978516} 11/07/2021 07:11:57 - INFO - __main__ - Step 70954: {'lr': 0.0002767319347179115, 'samples': 13623168, 'steps': 70953, 'loss/train': 1.8537580966949463} 11/07/2021 07:11:58 - INFO - __main__ - Step 70955: {'lr': 0.0002767266583921979, 'samples': 13623360, 'steps': 70954, 'loss/train': 2.056684970855713} 11/07/2021 07:11:58 - INFO - __main__ - Step 70956: {'lr': 0.00027672138205444175, 'samples': 13623552, 'steps': 70955, 'loss/train': 1.210081934928894} 11/07/2021 07:11:59 - INFO - __main__ - Step 70957: {'lr': 0.0002767161057046453, 'samples': 13623744, 'steps': 70956, 'loss/train': 0.9634692072868347} 11/07/2021 07:11:59 - INFO - __main__ - Step 70958: {'lr': 0.0002767108293428111, 'samples': 13623936, 'steps': 70957, 'loss/train': 1.272959589958191} 11/07/2021 07:12:00 - INFO - __main__ - Step 70959: {'lr': 0.00027670555296894134, 'samples': 13624128, 'steps': 70958, 'loss/train': 1.5295593738555908} 11/07/2021 07:12:01 - INFO - __main__ - Step 70960: {'lr': 0.00027670027658303843, 'samples': 13624320, 'steps': 70959, 'loss/train': 1.511085867881775} 11/07/2021 07:12:01 - INFO - __main__ - Step 70961: {'lr': 0.00027669500018510484, 'samples': 13624512, 'steps': 70960, 'loss/train': 1.49752676486969} 11/07/2021 07:12:02 - INFO - __main__ - Step 70962: {'lr': 0.00027668972377514295, 'samples': 13624704, 'steps': 70961, 'loss/train': 1.2016971111297607} 11/07/2021 07:12:02 - INFO - __main__ - Step 70963: {'lr': 0.00027668444735315503, 'samples': 13624896, 'steps': 70962, 'loss/train': 1.640584111213684} 11/07/2021 07:12:02 - INFO - __main__ - Step 70964: {'lr': 0.0002766791709191435, 'samples': 13625088, 'steps': 70963, 'loss/train': 0.43915194272994995} 11/07/2021 07:12:03 - INFO - __main__ - Step 70965: {'lr': 0.0002766738944731107, 'samples': 13625280, 'steps': 70964, 'loss/train': 1.219412922859192} 11/07/2021 07:12:04 - INFO - __main__ - Step 70966: {'lr': 0.00027666861801505904, 'samples': 13625472, 'steps': 70965, 'loss/train': 0.48083341121673584} 11/07/2021 07:12:04 - INFO - __main__ - Step 70967: {'lr': 0.000276663341544991, 'samples': 13625664, 'steps': 70966, 'loss/train': 1.3608146905899048} 11/07/2021 07:12:04 - INFO - __main__ - Step 70968: {'lr': 0.0002766580650629089, 'samples': 13625856, 'steps': 70967, 'loss/train': 0.8792392611503601} 11/07/2021 07:12:05 - INFO - __main__ - Step 70969: {'lr': 0.00027665278856881496, 'samples': 13626048, 'steps': 70968, 'loss/train': 1.158103108406067} 11/07/2021 07:12:05 - INFO - __main__ - Step 70970: {'lr': 0.00027664751206271177, 'samples': 13626240, 'steps': 70969, 'loss/train': 1.4205186367034912} 11/07/2021 07:12:06 - INFO - __main__ - Step 70971: {'lr': 0.00027664223554460163, 'samples': 13626432, 'steps': 70970, 'loss/train': 1.7931982278823853} 11/07/2021 07:12:06 - INFO - __main__ - Step 70972: {'lr': 0.0002766369590144869, 'samples': 13626624, 'steps': 70971, 'loss/train': 0.7350602746009827} 11/07/2021 07:12:07 - INFO - __main__ - Step 70973: {'lr': 0.00027663168247236996, 'samples': 13626816, 'steps': 70972, 'loss/train': 1.7557532787322998} 11/07/2021 07:12:07 - INFO - __main__ - Step 70974: {'lr': 0.00027662640591825314, 'samples': 13627008, 'steps': 70973, 'loss/train': 1.769303560256958} 11/07/2021 07:12:08 - INFO - __main__ - Step 70975: {'lr': 0.000276621129352139, 'samples': 13627200, 'steps': 70974, 'loss/train': 1.311364769935608} 11/07/2021 07:12:09 - INFO - __main__ - Step 70976: {'lr': 0.0002766158527740297, 'samples': 13627392, 'steps': 70975, 'loss/train': 1.4112902879714966} 11/07/2021 07:12:09 - INFO - __main__ - Step 70977: {'lr': 0.00027661057618392766, 'samples': 13627584, 'steps': 70976, 'loss/train': 1.2742857933044434} 11/07/2021 07:12:10 - INFO - __main__ - Step 70978: {'lr': 0.00027660529958183533, 'samples': 13627776, 'steps': 70977, 'loss/train': 1.876170039176941} 11/07/2021 07:12:10 - INFO - __main__ - Step 70979: {'lr': 0.00027660002296775514, 'samples': 13627968, 'steps': 70978, 'loss/train': 1.525134801864624} 11/07/2021 07:12:10 - INFO - __main__ - Step 70980: {'lr': 0.00027659474634168937, 'samples': 13628160, 'steps': 70979, 'loss/train': 1.5416964292526245} 11/07/2021 07:12:11 - INFO - __main__ - Step 70981: {'lr': 0.00027658946970364034, 'samples': 13628352, 'steps': 70980, 'loss/train': 0.4218328893184662} 11/07/2021 07:12:12 - INFO - __main__ - Step 70982: {'lr': 0.0002765841930536106, 'samples': 13628544, 'steps': 70981, 'loss/train': 1.0746325254440308} 11/07/2021 07:12:12 - INFO - __main__ - Step 70983: {'lr': 0.0002765789163916024, 'samples': 13628736, 'steps': 70982, 'loss/train': 1.495705485343933} 11/07/2021 07:12:12 - INFO - __main__ - Step 70984: {'lr': 0.0002765736397176182, 'samples': 13628928, 'steps': 70983, 'loss/train': 1.4975239038467407} 11/07/2021 07:12:13 - INFO - __main__ - Step 70985: {'lr': 0.0002765683630316602, 'samples': 13629120, 'steps': 70984, 'loss/train': 1.347715139389038} 11/07/2021 07:12:14 - INFO - __main__ - Step 70986: {'lr': 0.000276563086333731, 'samples': 13629312, 'steps': 70985, 'loss/train': 1.5313959121704102} 11/07/2021 07:12:14 - INFO - __main__ - Step 70987: {'lr': 0.0002765578096238328, 'samples': 13629504, 'steps': 70986, 'loss/train': 1.9417791366577148} 11/07/2021 07:12:15 - INFO - __main__ - Step 70988: {'lr': 0.0002765525329019681, 'samples': 13629696, 'steps': 70987, 'loss/train': 1.4783977270126343} 11/07/2021 07:12:15 - INFO - __main__ - Step 70989: {'lr': 0.0002765472561681393, 'samples': 13629888, 'steps': 70988, 'loss/train': 4.123940944671631} 11/07/2021 07:12:15 - INFO - __main__ - Step 70990: {'lr': 0.0002765419794223487, 'samples': 13630080, 'steps': 70989, 'loss/train': 1.808131456375122} 11/07/2021 07:12:16 - INFO - __main__ - Step 70991: {'lr': 0.0002765367026645987, 'samples': 13630272, 'steps': 70990, 'loss/train': 0.43907609581947327} 11/07/2021 07:12:17 - INFO - __main__ - Step 70992: {'lr': 0.0002765314258948916, 'samples': 13630464, 'steps': 70991, 'loss/train': 1.057617425918579} 11/07/2021 07:12:17 - INFO - __main__ - Step 70993: {'lr': 0.0002765261491132299, 'samples': 13630656, 'steps': 70992, 'loss/train': 1.3513883352279663} 11/07/2021 07:12:17 - INFO - __main__ - Step 70994: {'lr': 0.0002765208723196159, 'samples': 13630848, 'steps': 70993, 'loss/train': 1.4643006324768066} 11/07/2021 07:12:18 - INFO - __main__ - Step 70995: {'lr': 0.000276515595514052, 'samples': 13631040, 'steps': 70994, 'loss/train': 1.1638760566711426} 11/07/2021 07:12:18 - INFO - __main__ - Step 70996: {'lr': 0.00027651031869654056, 'samples': 13631232, 'steps': 70995, 'loss/train': 0.5400805473327637} 11/07/2021 07:12:19 - INFO - __main__ - Step 70997: {'lr': 0.0002765050418670841, 'samples': 13631424, 'steps': 70996, 'loss/train': 1.6999567747116089} 11/07/2021 07:12:19 - INFO - __main__ - Step 70998: {'lr': 0.00027649976502568477, 'samples': 13631616, 'steps': 70997, 'loss/train': 1.778939962387085} 11/07/2021 07:12:20 - INFO - __main__ - Step 70999: {'lr': 0.00027649448817234506, 'samples': 13631808, 'steps': 70998, 'loss/train': 1.2746703624725342} 11/07/2021 07:12:20 - INFO - __main__ - Step 71000: {'lr': 0.00027648921130706737, 'samples': 13632000, 'steps': 70999, 'loss/train': 0.9459729790687561} 11/07/2021 07:12:21 - INFO - __main__ - Step 71001: {'lr': 0.000276483934429854, 'samples': 13632192, 'steps': 71000, 'loss/train': 1.502760887145996} 11/07/2021 07:12:22 - INFO - __main__ - Step 71002: {'lr': 0.00027647865754070746, 'samples': 13632384, 'steps': 71001, 'loss/train': 1.3625224828720093} 11/07/2021 07:12:22 - INFO - __main__ - Step 71003: {'lr': 0.00027647338063963, 'samples': 13632576, 'steps': 71002, 'loss/train': 1.4772975444793701} 11/07/2021 07:12:22 - INFO - __main__ - Step 71004: {'lr': 0.00027646810372662406, 'samples': 13632768, 'steps': 71003, 'loss/train': 1.43500816822052} 11/07/2021 07:12:23 - INFO - __main__ - Step 71005: {'lr': 0.000276462826801692, 'samples': 13632960, 'steps': 71004, 'loss/train': 1.5492775440216064} 11/07/2021 07:12:23 - INFO - __main__ - Step 71006: {'lr': 0.0002764575498648362, 'samples': 13633152, 'steps': 71005, 'loss/train': 1.7136622667312622} 11/07/2021 07:12:24 - INFO - __main__ - Step 71007: {'lr': 0.000276452272916059, 'samples': 13633344, 'steps': 71006, 'loss/train': 1.2247740030288696} 11/07/2021 07:12:24 - INFO - __main__ - Step 71008: {'lr': 0.00027644699595536285, 'samples': 13633536, 'steps': 71007, 'loss/train': 1.6638469696044922} 11/07/2021 07:12:25 - INFO - __main__ - Step 71009: {'lr': 0.00027644171898275006, 'samples': 13633728, 'steps': 71008, 'loss/train': 1.8925082683563232} 11/07/2021 07:12:25 - INFO - __main__ - Step 71010: {'lr': 0.0002764364419982231, 'samples': 13633920, 'steps': 71009, 'loss/train': 1.214255928993225} 11/07/2021 07:12:25 - INFO - __main__ - Step 71011: {'lr': 0.0002764311650017842, 'samples': 13634112, 'steps': 71010, 'loss/train': 1.388029932975769} 11/07/2021 07:12:26 - INFO - __main__ - Step 71012: {'lr': 0.0002764258879934359, 'samples': 13634304, 'steps': 71011, 'loss/train': 1.3780158758163452} 11/07/2021 07:12:27 - INFO - __main__ - Step 71013: {'lr': 0.0002764206109731805, 'samples': 13634496, 'steps': 71012, 'loss/train': 2.3616926670074463} 11/07/2021 07:12:27 - INFO - __main__ - Step 71014: {'lr': 0.0002764153339410203, 'samples': 13634688, 'steps': 71013, 'loss/train': 1.2490679025650024} 11/07/2021 07:12:28 - INFO - __main__ - Step 71015: {'lr': 0.0002764100568969578, 'samples': 13634880, 'steps': 71014, 'loss/train': 1.4707393646240234} 11/07/2021 07:12:28 - INFO - __main__ - Step 71016: {'lr': 0.0002764047798409954, 'samples': 13635072, 'steps': 71015, 'loss/train': 1.3951606750488281} 11/07/2021 07:12:28 - INFO - __main__ - Step 71017: {'lr': 0.00027639950277313533, 'samples': 13635264, 'steps': 71016, 'loss/train': 0.715826690196991} 11/07/2021 07:12:29 - INFO - __main__ - Step 71018: {'lr': 0.0002763942256933801, 'samples': 13635456, 'steps': 71017, 'loss/train': 1.194007158279419} 11/07/2021 07:12:30 - INFO - __main__ - Step 71019: {'lr': 0.000276388948601732, 'samples': 13635648, 'steps': 71018, 'loss/train': 1.4905595779418945} 11/07/2021 07:12:30 - INFO - __main__ - Step 71020: {'lr': 0.0002763836714981935, 'samples': 13635840, 'steps': 71019, 'loss/train': 1.0353513956069946} 11/07/2021 07:12:30 - INFO - __main__ - Step 71021: {'lr': 0.0002763783943827669, 'samples': 13636032, 'steps': 71020, 'loss/train': 1.054975986480713} 11/07/2021 07:12:31 - INFO - __main__ - Step 71022: {'lr': 0.00027637311725545454, 'samples': 13636224, 'steps': 71021, 'loss/train': 1.75421941280365} 11/07/2021 07:12:32 - INFO - __main__ - Step 71023: {'lr': 0.0002763678401162589, 'samples': 13636416, 'steps': 71022, 'loss/train': 1.0491652488708496} 11/07/2021 07:12:32 - INFO - __main__ - Step 71024: {'lr': 0.00027636256296518244, 'samples': 13636608, 'steps': 71023, 'loss/train': 1.0978909730911255} 11/07/2021 07:12:32 - INFO - __main__ - Step 71025: {'lr': 0.0002763572858022273, 'samples': 13636800, 'steps': 71024, 'loss/train': 1.663319706916809} 11/07/2021 07:12:33 - INFO - __main__ - Step 71026: {'lr': 0.00027635200862739594, 'samples': 13636992, 'steps': 71025, 'loss/train': 1.240088939666748} 11/07/2021 07:12:33 - INFO - __main__ - Step 71027: {'lr': 0.00027634673144069085, 'samples': 13637184, 'steps': 71026, 'loss/train': 1.017965316772461} 11/07/2021 07:12:34 - INFO - __main__ - Step 71028: {'lr': 0.0002763414542421143, 'samples': 13637376, 'steps': 71027, 'loss/train': 1.2370821237564087} 11/07/2021 07:12:35 - INFO - __main__ - Step 71029: {'lr': 0.0002763361770316687, 'samples': 13637568, 'steps': 71028, 'loss/train': 1.095389723777771} 11/07/2021 07:12:35 - INFO - __main__ - Step 71030: {'lr': 0.00027633089980935645, 'samples': 13637760, 'steps': 71029, 'loss/train': 2.6628026962280273} 11/07/2021 07:12:35 - INFO - __main__ - Step 71031: {'lr': 0.00027632562257517984, 'samples': 13637952, 'steps': 71030, 'loss/train': 1.0994313955307007} 11/07/2021 07:12:36 - INFO - __main__ - Step 71032: {'lr': 0.00027632034532914135, 'samples': 13638144, 'steps': 71031, 'loss/train': 2.5197744369506836} 11/07/2021 07:12:37 - INFO - __main__ - Step 71033: {'lr': 0.0002763150680712433, 'samples': 13638336, 'steps': 71032, 'loss/train': 1.651861310005188} 11/07/2021 07:12:37 - INFO - __main__ - Step 71034: {'lr': 0.0002763097908014881, 'samples': 13638528, 'steps': 71033, 'loss/train': 1.4917819499969482} 11/07/2021 07:12:37 - INFO - __main__ - Step 71035: {'lr': 0.00027630451351987804, 'samples': 13638720, 'steps': 71034, 'loss/train': 1.3341811895370483} 11/07/2021 07:12:38 - INFO - __main__ - Step 71036: {'lr': 0.0002762992362264157, 'samples': 13638912, 'steps': 71035, 'loss/train': 1.1894361972808838} 11/07/2021 07:12:38 - INFO - __main__ - Step 71037: {'lr': 0.0002762939589211033, 'samples': 13639104, 'steps': 71036, 'loss/train': 1.7016336917877197} 11/07/2021 07:12:38 - INFO - __main__ - Step 71038: {'lr': 0.00027628868160394323, 'samples': 13639296, 'steps': 71037, 'loss/train': 1.5827643871307373} 11/07/2021 07:12:39 - INFO - __main__ - Step 71039: {'lr': 0.00027628340427493785, 'samples': 13639488, 'steps': 71038, 'loss/train': 1.4437466859817505} 11/07/2021 07:12:40 - INFO - __main__ - Step 71040: {'lr': 0.0002762781269340896, 'samples': 13639680, 'steps': 71039, 'loss/train': 1.5108833312988281} 11/07/2021 07:12:40 - INFO - __main__ - Step 71041: {'lr': 0.0002762728495814008, 'samples': 13639872, 'steps': 71040, 'loss/train': 1.2198872566223145} 11/07/2021 07:12:40 - INFO - __main__ - Step 71042: {'lr': 0.00027626757221687394, 'samples': 13640064, 'steps': 71041, 'loss/train': 1.3653879165649414} 11/07/2021 07:12:41 - INFO - __main__ - Step 71043: {'lr': 0.00027626229484051126, 'samples': 13640256, 'steps': 71042, 'loss/train': 1.404168963432312} 11/07/2021 07:12:42 - INFO - __main__ - Step 71044: {'lr': 0.0002762570174523152, 'samples': 13640448, 'steps': 71043, 'loss/train': 2.186885356903076} 11/07/2021 07:12:42 - INFO - __main__ - Step 71045: {'lr': 0.00027625174005228815, 'samples': 13640640, 'steps': 71044, 'loss/train': 1.4896130561828613} 11/07/2021 07:12:42 - INFO - __main__ - Step 71046: {'lr': 0.0002762464626404325, 'samples': 13640832, 'steps': 71045, 'loss/train': 1.381691813468933} 11/07/2021 07:12:43 - INFO - __main__ - Step 71047: {'lr': 0.0002762411852167505, 'samples': 13641024, 'steps': 71046, 'loss/train': 1.653673529624939} 11/07/2021 07:12:43 - INFO - __main__ - Step 71048: {'lr': 0.0002762359077812447, 'samples': 13641216, 'steps': 71047, 'loss/train': 1.4044941663742065} 11/07/2021 07:12:44 - INFO - __main__ - Step 71049: {'lr': 0.00027623063033391736, 'samples': 13641408, 'steps': 71048, 'loss/train': 1.276509165763855} 11/07/2021 07:12:44 - INFO - __main__ - Step 71050: {'lr': 0.00027622535287477097, 'samples': 13641600, 'steps': 71049, 'loss/train': 1.1451396942138672} 11/07/2021 07:12:45 - INFO - __main__ - Step 71051: {'lr': 0.0002762200754038078, 'samples': 13641792, 'steps': 71050, 'loss/train': 1.2376048564910889} 11/07/2021 07:12:45 - INFO - __main__ - Step 71052: {'lr': 0.0002762147979210303, 'samples': 13641984, 'steps': 71051, 'loss/train': 1.5221631526947021} 11/07/2021 07:12:45 - INFO - __main__ - Step 71053: {'lr': 0.00027620952042644074, 'samples': 13642176, 'steps': 71052, 'loss/train': 1.193668246269226} 11/07/2021 07:12:47 - INFO - __main__ - Step 71054: {'lr': 0.00027620424292004167, 'samples': 13642368, 'steps': 71053, 'loss/train': 1.746040940284729} 11/07/2021 07:12:47 - INFO - __main__ - Step 71055: {'lr': 0.00027619896540183526, 'samples': 13642560, 'steps': 71054, 'loss/train': 1.680351734161377} 11/07/2021 07:12:47 - INFO - __main__ - Step 71056: {'lr': 0.0002761936878718241, 'samples': 13642752, 'steps': 71055, 'loss/train': 2.273935317993164} 11/07/2021 07:12:48 - INFO - __main__ - Step 71057: {'lr': 0.00027618841033001044, 'samples': 13642944, 'steps': 71056, 'loss/train': 1.7737926244735718} 11/07/2021 07:12:48 - INFO - __main__ - Step 71058: {'lr': 0.0002761831327763967, 'samples': 13643136, 'steps': 71057, 'loss/train': 1.275244116783142} 11/07/2021 07:12:48 - INFO - __main__ - Step 71059: {'lr': 0.0002761778552109852, 'samples': 13643328, 'steps': 71058, 'loss/train': 1.1517279148101807} 11/07/2021 07:12:49 - INFO - __main__ - Step 71060: {'lr': 0.00027617257763377836, 'samples': 13643520, 'steps': 71059, 'loss/train': 0.7027788162231445} 11/07/2021 07:12:50 - INFO - __main__ - Step 71061: {'lr': 0.0002761673000447786, 'samples': 13643712, 'steps': 71060, 'loss/train': 1.5836125612258911} 11/07/2021 07:12:50 - INFO - __main__ - Step 71062: {'lr': 0.0002761620224439882, 'samples': 13643904, 'steps': 71061, 'loss/train': 1.2547110319137573} 11/07/2021 07:12:50 - INFO - __main__ - Step 71063: {'lr': 0.00027615674483140966, 'samples': 13644096, 'steps': 71062, 'loss/train': 1.8366410732269287} 11/07/2021 07:12:51 - INFO - __main__ - Step 71064: {'lr': 0.00027615146720704533, 'samples': 13644288, 'steps': 71063, 'loss/train': 1.0302658081054688} 11/07/2021 07:12:52 - INFO - __main__ - Step 71065: {'lr': 0.0002761461895708975, 'samples': 13644480, 'steps': 71064, 'loss/train': 1.5463932752609253} 11/07/2021 07:12:52 - INFO - __main__ - Step 71066: {'lr': 0.0002761409119229686, 'samples': 13644672, 'steps': 71065, 'loss/train': 1.1627445220947266} 11/07/2021 07:12:52 - INFO - __main__ - Step 71067: {'lr': 0.000276135634263261, 'samples': 13644864, 'steps': 71066, 'loss/train': 1.5161465406417847} 11/07/2021 07:12:53 - INFO - __main__ - Step 71068: {'lr': 0.000276130356591777, 'samples': 13645056, 'steps': 71067, 'loss/train': 1.5323213338851929} 11/07/2021 07:12:53 - INFO - __main__ - Step 71069: {'lr': 0.0002761250789085192, 'samples': 13645248, 'steps': 71068, 'loss/train': 1.6718050241470337} 11/07/2021 07:12:54 - INFO - __main__ - Step 71070: {'lr': 0.0002761198012134898, 'samples': 13645440, 'steps': 71069, 'loss/train': 1.4402605295181274} 11/07/2021 07:12:55 - INFO - __main__ - Step 71071: {'lr': 0.00027611452350669133, 'samples': 13645632, 'steps': 71070, 'loss/train': 1.2424362897872925} 11/07/2021 07:12:55 - INFO - __main__ - Step 71072: {'lr': 0.00027610924578812593, 'samples': 13645824, 'steps': 71071, 'loss/train': 1.0806355476379395} 11/07/2021 07:12:55 - INFO - __main__ - Step 71073: {'lr': 0.00027610396805779607, 'samples': 13646016, 'steps': 71072, 'loss/train': 1.5792454481124878} 11/07/2021 07:12:56 - INFO - __main__ - Step 71074: {'lr': 0.00027609869031570424, 'samples': 13646208, 'steps': 71073, 'loss/train': 1.4021403789520264} 11/07/2021 07:12:57 - INFO - __main__ - Step 71075: {'lr': 0.0002760934125618527, 'samples': 13646400, 'steps': 71074, 'loss/train': 1.3979742527008057} 11/07/2021 07:12:57 - INFO - __main__ - Step 71076: {'lr': 0.0002760881347962439, 'samples': 13646592, 'steps': 71075, 'loss/train': 1.5283734798431396} 11/07/2021 07:12:57 - INFO - __main__ - Step 71077: {'lr': 0.00027608285701888026, 'samples': 13646784, 'steps': 71076, 'loss/train': 1.5323498249053955} 11/07/2021 07:12:58 - INFO - __main__ - Step 71078: {'lr': 0.00027607757922976393, 'samples': 13646976, 'steps': 71077, 'loss/train': 1.6387848854064941} 11/07/2021 07:12:58 - INFO - __main__ - Step 71079: {'lr': 0.00027607230142889756, 'samples': 13647168, 'steps': 71078, 'loss/train': 1.4092514514923096} 11/07/2021 07:12:58 - INFO - __main__ - Step 71080: {'lr': 0.00027606702361628337, 'samples': 13647360, 'steps': 71079, 'loss/train': 1.235788345336914} 11/07/2021 07:12:59 - INFO - __main__ - Step 71081: {'lr': 0.0002760617457919238, 'samples': 13647552, 'steps': 71080, 'loss/train': 1.9432390928268433} 11/07/2021 07:13:00 - INFO - __main__ - Step 71082: {'lr': 0.0002760564679558212, 'samples': 13647744, 'steps': 71081, 'loss/train': 1.3954497575759888} 11/07/2021 07:13:00 - INFO - __main__ - Step 71083: {'lr': 0.0002760511901079779, 'samples': 13647936, 'steps': 71082, 'loss/train': 1.6244450807571411} 11/07/2021 07:13:00 - INFO - __main__ - Step 71084: {'lr': 0.0002760459122483965, 'samples': 13648128, 'steps': 71083, 'loss/train': 1.6143406629562378} 11/07/2021 07:13:01 - INFO - __main__ - Step 71085: {'lr': 0.00027604063437707905, 'samples': 13648320, 'steps': 71084, 'loss/train': 1.5160905122756958} 11/07/2021 07:13:02 - INFO - __main__ - Step 71086: {'lr': 0.00027603535649402814, 'samples': 13648512, 'steps': 71085, 'loss/train': 1.531201958656311} 11/07/2021 07:13:02 - INFO - __main__ - Step 71087: {'lr': 0.0002760300785992461, 'samples': 13648704, 'steps': 71086, 'loss/train': 1.3326702117919922} 11/07/2021 07:13:03 - INFO - __main__ - Step 71088: {'lr': 0.00027602480069273535, 'samples': 13648896, 'steps': 71087, 'loss/train': 1.7622114419937134} 11/07/2021 07:13:03 - INFO - __main__ - Step 71089: {'lr': 0.00027601952277449813, 'samples': 13649088, 'steps': 71088, 'loss/train': 1.4648821353912354} 11/07/2021 07:13:03 - INFO - __main__ - Step 71090: {'lr': 0.000276014244844537, 'samples': 13649280, 'steps': 71089, 'loss/train': 1.1750984191894531} 11/07/2021 07:13:04 - INFO - __main__ - Step 71091: {'lr': 0.00027600896690285434, 'samples': 13649472, 'steps': 71090, 'loss/train': 0.6221761107444763} 11/07/2021 07:13:05 - INFO - __main__ - Step 71092: {'lr': 0.00027600368894945226, 'samples': 13649664, 'steps': 71091, 'loss/train': 1.412021279335022} 11/07/2021 07:13:05 - INFO - __main__ - Step 71093: {'lr': 0.00027599841098433343, 'samples': 13649856, 'steps': 71092, 'loss/train': 2.024472713470459} 11/07/2021 07:13:05 - INFO - __main__ - Step 71094: {'lr': 0.00027599313300750007, 'samples': 13650048, 'steps': 71093, 'loss/train': 1.2538578510284424} 11/07/2021 07:13:06 - INFO - __main__ - Step 71095: {'lr': 0.00027598785501895456, 'samples': 13650240, 'steps': 71094, 'loss/train': 1.4909164905548096} 11/07/2021 07:13:07 - INFO - __main__ - Step 71096: {'lr': 0.0002759825770186994, 'samples': 13650432, 'steps': 71095, 'loss/train': 1.429497241973877} 11/07/2021 07:13:07 - INFO - __main__ - Step 71097: {'lr': 0.0002759772990067369, 'samples': 13650624, 'steps': 71096, 'loss/train': 1.1649255752563477} 11/07/2021 07:13:07 - INFO - __main__ - Step 71098: {'lr': 0.0002759720209830694, 'samples': 13650816, 'steps': 71097, 'loss/train': 1.4670138359069824} 11/07/2021 07:13:08 - INFO - __main__ - Step 71099: {'lr': 0.0002759667429476993, 'samples': 13651008, 'steps': 71098, 'loss/train': 1.1935758590698242} 11/07/2021 07:13:08 - INFO - __main__ - Step 71100: {'lr': 0.00027596146490062903, 'samples': 13651200, 'steps': 71099, 'loss/train': 1.6938424110412598} 11/07/2021 07:13:09 - INFO - __main__ - Step 71101: {'lr': 0.0002759561868418609, 'samples': 13651392, 'steps': 71100, 'loss/train': 1.2166460752487183} 11/07/2021 07:13:09 - INFO - __main__ - Step 71102: {'lr': 0.0002759509087713973, 'samples': 13651584, 'steps': 71101, 'loss/train': 1.3402879238128662} 11/07/2021 07:13:10 - INFO - __main__ - Step 71103: {'lr': 0.0002759456306892406, 'samples': 13651776, 'steps': 71102, 'loss/train': 1.6574887037277222} 11/07/2021 07:13:10 - INFO - __main__ - Step 71104: {'lr': 0.0002759403525953932, 'samples': 13651968, 'steps': 71103, 'loss/train': 1.327664852142334} 11/07/2021 07:13:10 - INFO - __main__ - Step 71105: {'lr': 0.00027593507448985747, 'samples': 13652160, 'steps': 71104, 'loss/train': 2.0022737979888916} 11/07/2021 07:13:12 - INFO - __main__ - Step 71106: {'lr': 0.00027592979637263587, 'samples': 13652352, 'steps': 71105, 'loss/train': 1.2401671409606934} 11/07/2021 07:13:12 - INFO - __main__ - Step 71107: {'lr': 0.0002759245182437307, 'samples': 13652544, 'steps': 71106, 'loss/train': 1.6912767887115479} 11/07/2021 07:13:12 - INFO - __main__ - Step 71108: {'lr': 0.0002759192401031443, 'samples': 13652736, 'steps': 71107, 'loss/train': 1.4226139783859253} 11/07/2021 07:13:13 - INFO - __main__ - Step 71109: {'lr': 0.0002759139619508791, 'samples': 13652928, 'steps': 71108, 'loss/train': 1.551901936531067} 11/07/2021 07:13:13 - INFO - __main__ - Step 71110: {'lr': 0.00027590868378693745, 'samples': 13653120, 'steps': 71109, 'loss/train': 0.1625504046678543} 11/07/2021 07:13:14 - INFO - __main__ - Step 71111: {'lr': 0.0002759034056113217, 'samples': 13653312, 'steps': 71110, 'loss/train': 1.5160608291625977} 11/07/2021 07:13:15 - INFO - __main__ - Step 71112: {'lr': 0.0002758981274240344, 'samples': 13653504, 'steps': 71111, 'loss/train': 0.2563236653804779} 11/07/2021 07:13:15 - INFO - __main__ - Step 71113: {'lr': 0.00027589284922507776, 'samples': 13653696, 'steps': 71112, 'loss/train': 0.20866094529628754} 11/07/2021 07:13:15 - INFO - __main__ - Step 71114: {'lr': 0.00027588757101445414, 'samples': 13653888, 'steps': 71113, 'loss/train': 1.2528996467590332} 11/07/2021 07:13:16 - INFO - __main__ - Step 71115: {'lr': 0.000275882292792166, 'samples': 13654080, 'steps': 71114, 'loss/train': 1.1815557479858398} 11/07/2021 07:13:17 - INFO - __main__ - Step 71116: {'lr': 0.00027587701455821575, 'samples': 13654272, 'steps': 71115, 'loss/train': 1.4887083768844604} 11/07/2021 07:13:17 - INFO - __main__ - Step 71117: {'lr': 0.00027587173631260563, 'samples': 13654464, 'steps': 71116, 'loss/train': 1.4363927841186523} 11/07/2021 07:13:18 - INFO - __main__ - Step 71118: {'lr': 0.00027586645805533817, 'samples': 13654656, 'steps': 71117, 'loss/train': 1.518601655960083} 11/07/2021 07:13:18 - INFO - __main__ - Step 71119: {'lr': 0.0002758611797864157, 'samples': 13654848, 'steps': 71118, 'loss/train': 1.2362768650054932} 11/07/2021 07:13:18 - INFO - __main__ - Step 71120: {'lr': 0.00027585590150584055, 'samples': 13655040, 'steps': 71119, 'loss/train': 1.429235816001892} 11/07/2021 07:13:19 - INFO - __main__ - Step 71121: {'lr': 0.00027585062321361516, 'samples': 13655232, 'steps': 71120, 'loss/train': 1.7697621583938599} 11/07/2021 07:13:20 - INFO - __main__ - Step 71122: {'lr': 0.0002758453449097418, 'samples': 13655424, 'steps': 71121, 'loss/train': 0.9994570016860962} 11/07/2021 07:13:20 - INFO - __main__ - Step 71123: {'lr': 0.000275840066594223, 'samples': 13655616, 'steps': 71122, 'loss/train': 1.3998806476593018} 11/07/2021 07:13:21 - INFO - __main__ - Step 71124: {'lr': 0.0002758347882670611, 'samples': 13655808, 'steps': 71123, 'loss/train': 1.6954997777938843} 11/07/2021 07:13:21 - INFO - __main__ - Step 71125: {'lr': 0.0002758295099282583, 'samples': 13656000, 'steps': 71124, 'loss/train': 1.0225944519042969} 11/07/2021 07:13:21 - INFO - __main__ - Step 71126: {'lr': 0.00027582423157781723, 'samples': 13656192, 'steps': 71125, 'loss/train': 1.4491859674453735} 11/07/2021 07:13:22 - INFO - __main__ - Step 71127: {'lr': 0.0002758189532157401, 'samples': 13656384, 'steps': 71126, 'loss/train': 1.5483640432357788} 11/07/2021 07:13:23 - INFO - __main__ - Step 71128: {'lr': 0.0002758136748420294, 'samples': 13656576, 'steps': 71127, 'loss/train': 1.4164620637893677} 11/07/2021 07:13:23 - INFO - __main__ - Step 71129: {'lr': 0.0002758083964566874, 'samples': 13656768, 'steps': 71128, 'loss/train': 1.4979195594787598} 11/07/2021 07:13:23 - INFO - __main__ - Step 71130: {'lr': 0.0002758031180597166, 'samples': 13656960, 'steps': 71129, 'loss/train': 1.8485164642333984} 11/07/2021 07:13:24 - INFO - __main__ - Step 71131: {'lr': 0.0002757978396511194, 'samples': 13657152, 'steps': 71130, 'loss/train': 1.3791887760162354} 11/07/2021 07:13:24 - INFO - __main__ - Step 71132: {'lr': 0.00027579256123089793, 'samples': 13657344, 'steps': 71131, 'loss/train': 1.6199860572814941} 11/07/2021 07:13:25 - INFO - __main__ - Step 71133: {'lr': 0.00027578728279905473, 'samples': 13657536, 'steps': 71132, 'loss/train': 1.2211918830871582} 11/07/2021 07:13:25 - INFO - __main__ - Step 71134: {'lr': 0.00027578200435559225, 'samples': 13657728, 'steps': 71133, 'loss/train': 1.308647632598877} 11/07/2021 07:13:26 - INFO - __main__ - Step 71135: {'lr': 0.0002757767259005128, 'samples': 13657920, 'steps': 71134, 'loss/train': 0.7349300980567932} 11/07/2021 07:13:26 - INFO - __main__ - Step 71136: {'lr': 0.00027577144743381863, 'samples': 13658112, 'steps': 71135, 'loss/train': 1.1252607107162476} 11/07/2021 07:13:27 - INFO - __main__ - Step 71137: {'lr': 0.0002757661689555124, 'samples': 13658304, 'steps': 71136, 'loss/train': 0.9165046215057373} 11/07/2021 07:13:28 - INFO - __main__ - Step 71138: {'lr': 0.00027576089046559634, 'samples': 13658496, 'steps': 71137, 'loss/train': 1.6875742673873901} 11/07/2021 07:13:28 - INFO - __main__ - Step 71139: {'lr': 0.0002757556119640727, 'samples': 13658688, 'steps': 71138, 'loss/train': 1.339807152748108} 11/07/2021 07:13:28 - INFO - __main__ - Step 71140: {'lr': 0.000275750333450944, 'samples': 13658880, 'steps': 71139, 'loss/train': 1.3655120134353638} 11/07/2021 07:13:29 - INFO - __main__ - Step 71141: {'lr': 0.00027574505492621265, 'samples': 13659072, 'steps': 71140, 'loss/train': 1.195172667503357} 11/07/2021 07:13:29 - INFO - __main__ - Step 71142: {'lr': 0.000275739776389881, 'samples': 13659264, 'steps': 71141, 'loss/train': 0.8237540125846863} 11/07/2021 07:13:30 - INFO - __main__ - Step 71143: {'lr': 0.00027573449784195134, 'samples': 13659456, 'steps': 71142, 'loss/train': 1.2869925498962402} 11/07/2021 07:13:30 - INFO - __main__ - Step 71144: {'lr': 0.0002757292192824261, 'samples': 13659648, 'steps': 71143, 'loss/train': 1.0667797327041626} 11/07/2021 07:13:31 - INFO - __main__ - Step 71145: {'lr': 0.00027572394071130775, 'samples': 13659840, 'steps': 71144, 'loss/train': 1.1545109748840332} 11/07/2021 07:13:31 - INFO - __main__ - Step 71146: {'lr': 0.0002757186621285985, 'samples': 13660032, 'steps': 71145, 'loss/train': 1.47628915309906} 11/07/2021 07:13:31 - INFO - __main__ - Step 71147: {'lr': 0.00027571338353430086, 'samples': 13660224, 'steps': 71146, 'loss/train': 1.649911880493164} 11/07/2021 07:13:33 - INFO - __main__ - Step 71148: {'lr': 0.0002757081049284172, 'samples': 13660416, 'steps': 71147, 'loss/train': 1.3067210912704468} 11/07/2021 07:13:33 - INFO - __main__ - Step 71149: {'lr': 0.0002757028263109498, 'samples': 13660608, 'steps': 71148, 'loss/train': 1.1677813529968262} 11/07/2021 07:13:33 - INFO - __main__ - Step 71150: {'lr': 0.0002756975476819011, 'samples': 13660800, 'steps': 71149, 'loss/train': 1.5521981716156006} 11/07/2021 07:13:34 - INFO - __main__ - Step 71151: {'lr': 0.0002756922690412736, 'samples': 13660992, 'steps': 71150, 'loss/train': 0.2989867329597473} 11/07/2021 07:13:34 - INFO - __main__ - Step 71152: {'lr': 0.00027568699038906945, 'samples': 13661184, 'steps': 71151, 'loss/train': 1.2076270580291748} 11/07/2021 07:13:35 - INFO - __main__ - Step 71153: {'lr': 0.0002756817117252912, 'samples': 13661376, 'steps': 71152, 'loss/train': 1.3630497455596924} 11/07/2021 07:13:35 - INFO - __main__ - Step 71154: {'lr': 0.0002756764330499411, 'samples': 13661568, 'steps': 71153, 'loss/train': 1.4710662364959717} 11/07/2021 07:13:36 - INFO - __main__ - Step 71155: {'lr': 0.0002756711543630216, 'samples': 13661760, 'steps': 71154, 'loss/train': 1.5740491151809692} 11/07/2021 07:13:36 - INFO - __main__ - Step 71156: {'lr': 0.0002756658756645351, 'samples': 13661952, 'steps': 71155, 'loss/train': 0.8200165033340454} 11/07/2021 07:13:36 - INFO - __main__ - Step 71157: {'lr': 0.00027566059695448395, 'samples': 13662144, 'steps': 71156, 'loss/train': 1.3428208827972412} 11/07/2021 07:13:37 - INFO - __main__ - Step 71158: {'lr': 0.0002756553182328706, 'samples': 13662336, 'steps': 71157, 'loss/train': 1.2982152700424194} 11/07/2021 07:13:38 - INFO - __main__ - Step 71159: {'lr': 0.00027565003949969725, 'samples': 13662528, 'steps': 71158, 'loss/train': 1.42640221118927} 11/07/2021 07:13:38 - INFO - __main__ - Step 71160: {'lr': 0.0002756447607549664, 'samples': 13662720, 'steps': 71159, 'loss/train': 1.2327316999435425} 11/07/2021 07:13:38 - INFO - __main__ - Step 71161: {'lr': 0.0002756394819986805, 'samples': 13662912, 'steps': 71160, 'loss/train': 1.8571149110794067} 11/07/2021 07:13:39 - INFO - __main__ - Step 71162: {'lr': 0.00027563420323084174, 'samples': 13663104, 'steps': 71161, 'loss/train': 1.8095855712890625} 11/07/2021 07:13:39 - INFO - __main__ - Step 71163: {'lr': 0.00027562892445145266, 'samples': 13663296, 'steps': 71162, 'loss/train': 1.6515315771102905} 11/07/2021 07:13:40 - INFO - __main__ - Step 71164: {'lr': 0.00027562364566051557, 'samples': 13663488, 'steps': 71163, 'loss/train': 1.5682203769683838} 11/07/2021 07:13:41 - INFO - __main__ - Step 71165: {'lr': 0.00027561836685803293, 'samples': 13663680, 'steps': 71164, 'loss/train': 1.6738879680633545} 11/07/2021 07:13:41 - INFO - __main__ - Step 71166: {'lr': 0.000275613088044007, 'samples': 13663872, 'steps': 71165, 'loss/train': 1.572826623916626} 11/07/2021 07:13:41 - INFO - __main__ - Step 71167: {'lr': 0.0002756078092184401, 'samples': 13664064, 'steps': 71166, 'loss/train': 1.3971633911132812} 11/07/2021 07:13:42 - INFO - __main__ - Step 71168: {'lr': 0.0002756025303813349, 'samples': 13664256, 'steps': 71167, 'loss/train': 1.2006946802139282} 11/07/2021 07:13:43 - INFO - __main__ - Step 71169: {'lr': 0.0002755972515326934, 'samples': 13664448, 'steps': 71168, 'loss/train': 1.3694031238555908} 11/07/2021 07:13:43 - INFO - __main__ - Step 71170: {'lr': 0.0002755919726725183, 'samples': 13664640, 'steps': 71169, 'loss/train': 1.045654296875} 11/07/2021 07:13:43 - INFO - __main__ - Step 71171: {'lr': 0.0002755866938008119, 'samples': 13664832, 'steps': 71170, 'loss/train': 1.5553407669067383} 11/07/2021 07:13:44 - INFO - __main__ - Step 71172: {'lr': 0.0002755814149175765, 'samples': 13665024, 'steps': 71171, 'loss/train': 1.2498480081558228} 11/07/2021 07:13:44 - INFO - __main__ - Step 71173: {'lr': 0.00027557613602281446, 'samples': 13665216, 'steps': 71172, 'loss/train': 1.3560444116592407} 11/07/2021 07:13:45 - INFO - __main__ - Step 71174: {'lr': 0.0002755708571165282, 'samples': 13665408, 'steps': 71173, 'loss/train': 1.514541506767273} 11/07/2021 07:13:45 - INFO - __main__ - Step 71175: {'lr': 0.0002755655781987201, 'samples': 13665600, 'steps': 71174, 'loss/train': 1.5268187522888184} 11/07/2021 07:13:46 - INFO - __main__ - Step 71176: {'lr': 0.0002755602992693926, 'samples': 13665792, 'steps': 71175, 'loss/train': 1.644734263420105} 11/07/2021 07:13:46 - INFO - __main__ - Step 71177: {'lr': 0.00027555502032854795, 'samples': 13665984, 'steps': 71176, 'loss/train': 1.6064800024032593} 11/07/2021 07:13:46 - INFO - __main__ - Step 71178: {'lr': 0.0002755497413761887, 'samples': 13666176, 'steps': 71177, 'loss/train': 1.2309236526489258} 11/07/2021 07:13:48 - INFO - __main__ - Step 71179: {'lr': 0.00027554446241231706, 'samples': 13666368, 'steps': 71178, 'loss/train': 1.7296139001846313} 11/07/2021 07:13:48 - INFO - __main__ - Step 71180: {'lr': 0.0002755391834369355, 'samples': 13666560, 'steps': 71179, 'loss/train': 1.6015222072601318} 11/07/2021 07:13:49 - INFO - __main__ - Step 71181: {'lr': 0.0002755339044500464, 'samples': 13666752, 'steps': 71180, 'loss/train': 1.5488232374191284} 11/07/2021 07:13:49 - INFO - __main__ - Step 71182: {'lr': 0.0002755286254516521, 'samples': 13666944, 'steps': 71181, 'loss/train': 1.6599831581115723} 11/07/2021 07:13:49 - INFO - __main__ - Step 71183: {'lr': 0.0002755233464417549, 'samples': 13667136, 'steps': 71182, 'loss/train': 1.6913357973098755} 11/07/2021 07:13:50 - INFO - __main__ - Step 71184: {'lr': 0.0002755180674203574, 'samples': 13667328, 'steps': 71183, 'loss/train': 1.7288442850112915} 11/07/2021 07:13:51 - INFO - __main__ - Step 71185: {'lr': 0.00027551278838746187, 'samples': 13667520, 'steps': 71184, 'loss/train': 1.590076208114624} 11/07/2021 07:13:51 - INFO - __main__ - Step 71186: {'lr': 0.00027550750934307057, 'samples': 13667712, 'steps': 71185, 'loss/train': 1.80953049659729} 11/07/2021 07:13:52 - INFO - __main__ - Step 71187: {'lr': 0.00027550223028718603, 'samples': 13667904, 'steps': 71186, 'loss/train': 1.0337787866592407} 11/07/2021 07:13:52 - INFO - __main__ - Step 71188: {'lr': 0.00027549695121981057, 'samples': 13668096, 'steps': 71187, 'loss/train': 1.8380128145217896} 11/07/2021 07:13:52 - INFO - __main__ - Step 71189: {'lr': 0.0002754916721409466, 'samples': 13668288, 'steps': 71188, 'loss/train': 1.6560399532318115} 11/07/2021 07:13:53 - INFO - __main__ - Step 71190: {'lr': 0.00027548639305059644, 'samples': 13668480, 'steps': 71189, 'loss/train': 1.9109901189804077} 11/07/2021 07:13:54 - INFO - __main__ - Step 71191: {'lr': 0.00027548111394876254, 'samples': 13668672, 'steps': 71190, 'loss/train': 1.334922432899475} 11/07/2021 07:13:54 - INFO - __main__ - Step 71192: {'lr': 0.00027547583483544726, 'samples': 13668864, 'steps': 71191, 'loss/train': 1.9113277196884155} 11/07/2021 07:13:54 - INFO - __main__ - Step 71193: {'lr': 0.0002754705557106529, 'samples': 13669056, 'steps': 71192, 'loss/train': 0.670158863067627} 11/07/2021 07:13:55 - INFO - __main__ - Step 71194: {'lr': 0.0002754652765743819, 'samples': 13669248, 'steps': 71193, 'loss/train': 1.1926214694976807} 11/07/2021 07:13:55 - INFO - __main__ - Step 71195: {'lr': 0.0002754599974266367, 'samples': 13669440, 'steps': 71194, 'loss/train': 1.512096643447876} 11/07/2021 07:13:56 - INFO - __main__ - Step 71196: {'lr': 0.0002754547182674195, 'samples': 13669632, 'steps': 71195, 'loss/train': 1.4675980806350708} 11/07/2021 07:13:56 - INFO - __main__ - Step 71197: {'lr': 0.0002754494390967329, 'samples': 13669824, 'steps': 71196, 'loss/train': 1.4596399068832397} 11/07/2021 07:13:57 - INFO - __main__ - Step 71198: {'lr': 0.0002754441599145792, 'samples': 13670016, 'steps': 71197, 'loss/train': 1.8789609670639038} 11/07/2021 07:13:57 - INFO - __main__ - Step 71199: {'lr': 0.00027543888072096076, 'samples': 13670208, 'steps': 71198, 'loss/train': 1.0459246635437012} 11/07/2021 07:13:57 - INFO - __main__ - Step 71200: {'lr': 0.00027543360151587986, 'samples': 13670400, 'steps': 71199, 'loss/train': 1.1710048913955688} 11/07/2021 07:13:59 - INFO - __main__ - Step 71201: {'lr': 0.000275428322299339, 'samples': 13670592, 'steps': 71200, 'loss/train': 1.5217573642730713} 11/07/2021 07:13:59 - INFO - __main__ - Step 71202: {'lr': 0.0002754230430713405, 'samples': 13670784, 'steps': 71201, 'loss/train': 1.1589515209197998} 11/07/2021 07:13:59 - INFO - __main__ - Step 71203: {'lr': 0.00027541776383188687, 'samples': 13670976, 'steps': 71202, 'loss/train': 1.6483842134475708} 11/07/2021 07:14:00 - INFO - __main__ - Step 71204: {'lr': 0.00027541248458098027, 'samples': 13671168, 'steps': 71203, 'loss/train': 1.2828232049942017} 11/07/2021 07:14:00 - INFO - __main__ - Step 71205: {'lr': 0.00027540720531862335, 'samples': 13671360, 'steps': 71204, 'loss/train': 1.5730358362197876} 11/07/2021 07:14:01 - INFO - __main__ - Step 71206: {'lr': 0.00027540192604481824, 'samples': 13671552, 'steps': 71205, 'loss/train': 1.1340441703796387} 11/07/2021 07:14:01 - INFO - __main__ - Step 71207: {'lr': 0.00027539664675956736, 'samples': 13671744, 'steps': 71206, 'loss/train': 1.4331053495407104} 11/07/2021 07:14:02 - INFO - __main__ - Step 71208: {'lr': 0.0002753913674628732, 'samples': 13671936, 'steps': 71207, 'loss/train': 1.1455373764038086} 11/07/2021 07:14:02 - INFO - __main__ - Step 71209: {'lr': 0.0002753860881547381, 'samples': 13672128, 'steps': 71208, 'loss/train': 1.714458703994751} 11/07/2021 07:14:02 - INFO - __main__ - Step 71210: {'lr': 0.0002753808088351644, 'samples': 13672320, 'steps': 71209, 'loss/train': 1.2485015392303467} 11/07/2021 07:14:03 - INFO - __main__ - Step 71211: {'lr': 0.0002753755295041545, 'samples': 13672512, 'steps': 71210, 'loss/train': 1.5392725467681885} 11/07/2021 07:14:04 - INFO - __main__ - Step 71212: {'lr': 0.0002753702501617108, 'samples': 13672704, 'steps': 71211, 'loss/train': 1.2129371166229248} 11/07/2021 07:14:04 - INFO - __main__ - Step 71213: {'lr': 0.0002753649708078357, 'samples': 13672896, 'steps': 71212, 'loss/train': 1.3456979990005493} 11/07/2021 07:14:04 - INFO - __main__ - Step 71214: {'lr': 0.0002753596914425314, 'samples': 13673088, 'steps': 71213, 'loss/train': 1.6356736421585083} 11/07/2021 07:14:05 - INFO - __main__ - Step 71215: {'lr': 0.0002753544120658005, 'samples': 13673280, 'steps': 71214, 'loss/train': 1.2244212627410889} 11/07/2021 07:14:06 - INFO - __main__ - Step 71216: {'lr': 0.0002753491326776453, 'samples': 13673472, 'steps': 71215, 'loss/train': 1.3279415369033813} 11/07/2021 07:14:06 - INFO - __main__ - Step 71217: {'lr': 0.0002753438532780681, 'samples': 13673664, 'steps': 71216, 'loss/train': 1.554599404335022} 11/07/2021 07:14:06 - INFO - __main__ - Step 71218: {'lr': 0.0002753385738670714, 'samples': 13673856, 'steps': 71217, 'loss/train': 0.26164931058883667} 11/07/2021 07:14:07 - INFO - __main__ - Step 71219: {'lr': 0.0002753332944446576, 'samples': 13674048, 'steps': 71218, 'loss/train': 1.152084231376648} 11/07/2021 07:14:07 - INFO - __main__ - Step 71220: {'lr': 0.00027532801501082893, 'samples': 13674240, 'steps': 71219, 'loss/train': 1.5421525239944458} 11/07/2021 07:14:08 - INFO - __main__ - Step 71221: {'lr': 0.00027532273556558787, 'samples': 13674432, 'steps': 71220, 'loss/train': 1.1347700357437134} 11/07/2021 07:14:08 - INFO - __main__ - Step 71222: {'lr': 0.0002753174561089367, 'samples': 13674624, 'steps': 71221, 'loss/train': 0.9958797693252563} 11/07/2021 07:14:09 - INFO - __main__ - Step 71223: {'lr': 0.000275312176640878, 'samples': 13674816, 'steps': 71222, 'loss/train': 1.64528489112854} 11/07/2021 07:14:09 - INFO - __main__ - Step 71224: {'lr': 0.00027530689716141396, 'samples': 13675008, 'steps': 71223, 'loss/train': 1.4085071086883545} 11/07/2021 07:14:09 - INFO - __main__ - Step 71225: {'lr': 0.0002753016176705471, 'samples': 13675200, 'steps': 71224, 'loss/train': 1.3623502254486084} 11/07/2021 07:14:10 - INFO - __main__ - Step 71226: {'lr': 0.0002752963381682796, 'samples': 13675392, 'steps': 71225, 'loss/train': 1.7736294269561768} 11/07/2021 07:14:11 - INFO - __main__ - Step 71227: {'lr': 0.000275291058654614, 'samples': 13675584, 'steps': 71226, 'loss/train': 0.6281478404998779} 11/07/2021 07:14:11 - INFO - __main__ - Step 71228: {'lr': 0.0002752857791295526, 'samples': 13675776, 'steps': 71227, 'loss/train': 1.2476314306259155} 11/07/2021 07:14:12 - INFO - __main__ - Step 71229: {'lr': 0.0002752804995930979, 'samples': 13675968, 'steps': 71228, 'loss/train': 1.3667768239974976} 11/07/2021 07:14:12 - INFO - __main__ - Step 71230: {'lr': 0.0002752752200452521, 'samples': 13676160, 'steps': 71229, 'loss/train': 0.14415650069713593} 11/07/2021 07:14:12 - INFO - __main__ - Step 71231: {'lr': 0.0002752699404860178, 'samples': 13676352, 'steps': 71230, 'loss/train': 0.9738665819168091} 11/07/2021 07:14:13 - INFO - __main__ - Step 71232: {'lr': 0.0002752646609153972, 'samples': 13676544, 'steps': 71231, 'loss/train': 5.6748223304748535} 11/07/2021 07:14:14 - INFO - __main__ - Step 71233: {'lr': 0.00027525938133339273, 'samples': 13676736, 'steps': 71232, 'loss/train': 1.650808572769165} 11/07/2021 07:14:14 - INFO - __main__ - Step 71234: {'lr': 0.0002752541017400068, 'samples': 13676928, 'steps': 71233, 'loss/train': 1.0810346603393555} 11/07/2021 07:14:14 - INFO - __main__ - Step 71235: {'lr': 0.0002752488221352417, 'samples': 13677120, 'steps': 71234, 'loss/train': 1.386172890663147} 11/07/2021 07:14:15 - INFO - __main__ - Step 71236: {'lr': 0.0002752435425190999, 'samples': 13677312, 'steps': 71235, 'loss/train': 1.2144358158111572} 11/07/2021 07:14:16 - INFO - __main__ - Step 71237: {'lr': 0.00027523826289158374, 'samples': 13677504, 'steps': 71236, 'loss/train': 1.8318077325820923} 11/07/2021 07:14:16 - INFO - __main__ - Step 71238: {'lr': 0.0002752329832526956, 'samples': 13677696, 'steps': 71237, 'loss/train': 1.3270611763000488} 11/07/2021 07:14:16 - INFO - __main__ - Step 71239: {'lr': 0.00027522770360243794, 'samples': 13677888, 'steps': 71238, 'loss/train': 1.1862934827804565} 11/07/2021 07:14:17 - INFO - __main__ - Step 71240: {'lr': 0.000275222423940813, 'samples': 13678080, 'steps': 71239, 'loss/train': 1.0797983407974243} 11/07/2021 07:14:17 - INFO - __main__ - Step 71241: {'lr': 0.0002752171442678232, 'samples': 13678272, 'steps': 71240, 'loss/train': 1.3730906248092651} 11/07/2021 07:14:18 - INFO - __main__ - Step 71242: {'lr': 0.00027521186458347104, 'samples': 13678464, 'steps': 71241, 'loss/train': 1.2351354360580444} 11/07/2021 07:14:18 - INFO - __main__ - Step 71243: {'lr': 0.00027520658488775873, 'samples': 13678656, 'steps': 71242, 'loss/train': 1.2118690013885498} 11/07/2021 07:14:19 - INFO - __main__ - Step 71244: {'lr': 0.00027520130518068875, 'samples': 13678848, 'steps': 71243, 'loss/train': 1.10621178150177} 11/07/2021 07:14:19 - INFO - __main__ - Step 71245: {'lr': 0.0002751960254622634, 'samples': 13679040, 'steps': 71244, 'loss/train': 0.7085923552513123} 11/07/2021 07:14:20 - INFO - __main__ - Step 71246: {'lr': 0.0002751907457324851, 'samples': 13679232, 'steps': 71245, 'loss/train': 1.4668402671813965} 11/07/2021 07:14:21 - INFO - __main__ - Step 71247: {'lr': 0.0002751854659913563, 'samples': 13679424, 'steps': 71246, 'loss/train': 1.1975603103637695} 11/07/2021 07:14:21 - INFO - __main__ - Step 71248: {'lr': 0.0002751801862388794, 'samples': 13679616, 'steps': 71247, 'loss/train': 1.6584171056747437} 11/07/2021 07:14:21 - INFO - __main__ - Step 71249: {'lr': 0.0002751749064750566, 'samples': 13679808, 'steps': 71248, 'loss/train': 1.377097487449646} 11/07/2021 07:14:22 - INFO - __main__ - Step 71250: {'lr': 0.0002751696266998903, 'samples': 13680000, 'steps': 71249, 'loss/train': 1.445909023284912} 11/07/2021 07:14:22 - INFO - __main__ - Step 71251: {'lr': 0.00027516434691338305, 'samples': 13680192, 'steps': 71250, 'loss/train': 1.2118644714355469} 11/07/2021 07:14:23 - INFO - __main__ - Step 71252: {'lr': 0.0002751590671155371, 'samples': 13680384, 'steps': 71251, 'loss/train': 1.53481924533844} 11/07/2021 07:14:23 - INFO - __main__ - Step 71253: {'lr': 0.0002751537873063549, 'samples': 13680576, 'steps': 71252, 'loss/train': 1.6973060369491577} 11/07/2021 07:14:24 - INFO - __main__ - Step 71254: {'lr': 0.0002751485074858388, 'samples': 13680768, 'steps': 71253, 'loss/train': 1.4353396892547607} 11/07/2021 07:14:24 - INFO - __main__ - Step 71255: {'lr': 0.00027514322765399114, 'samples': 13680960, 'steps': 71254, 'loss/train': 1.5591257810592651} 11/07/2021 07:14:24 - INFO - __main__ - Step 71256: {'lr': 0.0002751379478108143, 'samples': 13681152, 'steps': 71255, 'loss/train': 1.345511555671692} 11/07/2021 07:14:25 - INFO - __main__ - Step 71257: {'lr': 0.0002751326679563107, 'samples': 13681344, 'steps': 71256, 'loss/train': 1.398862600326538} 11/07/2021 07:14:26 - INFO - __main__ - Step 71258: {'lr': 0.0002751273880904827, 'samples': 13681536, 'steps': 71257, 'loss/train': 1.588982105255127} 11/07/2021 07:14:26 - INFO - __main__ - Step 71259: {'lr': 0.00027512210821333276, 'samples': 13681728, 'steps': 71258, 'loss/train': 1.5447810888290405} 11/07/2021 07:14:27 - INFO - __main__ - Step 71260: {'lr': 0.00027511682832486313, 'samples': 13681920, 'steps': 71259, 'loss/train': 1.7476873397827148} 11/07/2021 07:14:27 - INFO - __main__ - Step 71261: {'lr': 0.0002751115484250762, 'samples': 13682112, 'steps': 71260, 'loss/train': 1.4104074239730835} 11/07/2021 07:14:28 - INFO - __main__ - Step 71262: {'lr': 0.00027510626851397446, 'samples': 13682304, 'steps': 71261, 'loss/train': 0.5439274311065674} 11/07/2021 07:14:28 - INFO - __main__ - Step 71263: {'lr': 0.00027510098859156025, 'samples': 13682496, 'steps': 71262, 'loss/train': 0.8621560335159302} 11/07/2021 07:14:29 - INFO - __main__ - Step 71264: {'lr': 0.00027509570865783586, 'samples': 13682688, 'steps': 71263, 'loss/train': 1.223739743232727} 11/07/2021 07:14:29 - INFO - __main__ - Step 71265: {'lr': 0.0002750904287128037, 'samples': 13682880, 'steps': 71264, 'loss/train': 1.3795528411865234} 11/07/2021 07:14:29 - INFO - __main__ - Step 71266: {'lr': 0.0002750851487564663, 'samples': 13683072, 'steps': 71265, 'loss/train': 1.3831324577331543} 11/07/2021 07:14:30 - INFO - __main__ - Step 71267: {'lr': 0.00027507986878882583, 'samples': 13683264, 'steps': 71266, 'loss/train': 1.2287906408309937} 11/07/2021 07:14:31 - INFO - __main__ - Step 71268: {'lr': 0.0002750745888098848, 'samples': 13683456, 'steps': 71267, 'loss/train': 1.3571794033050537} 11/07/2021 07:14:31 - INFO - __main__ - Step 71269: {'lr': 0.0002750693088196455, 'samples': 13683648, 'steps': 71268, 'loss/train': 1.1986110210418701} 11/07/2021 07:14:31 - INFO - __main__ - Step 71270: {'lr': 0.0002750640288181104, 'samples': 13683840, 'steps': 71269, 'loss/train': 1.4580188989639282} 11/07/2021 07:14:32 - INFO - __main__ - Step 71271: {'lr': 0.0002750587488052818, 'samples': 13684032, 'steps': 71270, 'loss/train': 1.4041759967803955} 11/07/2021 07:14:32 - INFO - __main__ - Step 71272: {'lr': 0.00027505346878116215, 'samples': 13684224, 'steps': 71271, 'loss/train': 1.4243991374969482} 11/07/2021 07:14:33 - INFO - __main__ - Step 71273: {'lr': 0.0002750481887457538, 'samples': 13684416, 'steps': 71272, 'loss/train': 1.1998562812805176} 11/07/2021 07:14:33 - INFO - __main__ - Step 71274: {'lr': 0.00027504290869905906, 'samples': 13684608, 'steps': 71273, 'loss/train': 1.1181451082229614} 11/07/2021 07:14:34 - INFO - __main__ - Step 71275: {'lr': 0.0002750376286410804, 'samples': 13684800, 'steps': 71274, 'loss/train': 1.6393680572509766} 11/07/2021 07:14:34 - INFO - __main__ - Step 71276: {'lr': 0.0002750323485718202, 'samples': 13684992, 'steps': 71275, 'loss/train': 1.6019461154937744} 11/07/2021 07:14:34 - INFO - __main__ - Step 71277: {'lr': 0.0002750270684912808, 'samples': 13685184, 'steps': 71276, 'loss/train': 1.5317611694335938} 11/07/2021 07:14:35 - INFO - __main__ - Step 71278: {'lr': 0.0002750217883994645, 'samples': 13685376, 'steps': 71277, 'loss/train': 0.7202000617980957} 11/07/2021 07:14:36 - INFO - __main__ - Step 71279: {'lr': 0.0002750165082963739, 'samples': 13685568, 'steps': 71278, 'loss/train': 1.423991084098816} 11/07/2021 07:14:36 - INFO - __main__ - Step 71280: {'lr': 0.0002750112281820112, 'samples': 13685760, 'steps': 71279, 'loss/train': 1.3511883020401} 11/07/2021 07:14:36 - INFO - __main__ - Step 71281: {'lr': 0.0002750059480563788, 'samples': 13685952, 'steps': 71280, 'loss/train': 1.4809459447860718} 11/07/2021 07:14:37 - INFO - __main__ - Step 71282: {'lr': 0.00027500066791947913, 'samples': 13686144, 'steps': 71281, 'loss/train': 1.1902567148208618} 11/07/2021 07:14:38 - INFO - __main__ - Step 71283: {'lr': 0.00027499538777131456, 'samples': 13686336, 'steps': 71282, 'loss/train': 1.2181047201156616} 11/07/2021 07:14:39 - INFO - __main__ - Step 71284: {'lr': 0.0002749901076118874, 'samples': 13686528, 'steps': 71283, 'loss/train': 1.2431211471557617} 11/07/2021 07:14:39 - INFO - __main__ - Step 71285: {'lr': 0.0002749848274412001, 'samples': 13686720, 'steps': 71284, 'loss/train': 1.4597340822219849} 11/07/2021 07:14:39 - INFO - __main__ - Step 71286: {'lr': 0.0002749795472592551, 'samples': 13686912, 'steps': 71285, 'loss/train': 1.7652560472488403} 11/07/2021 07:14:40 - INFO - __main__ - Step 71287: {'lr': 0.00027497426706605464, 'samples': 13687104, 'steps': 71286, 'loss/train': 1.8118071556091309} 11/07/2021 07:14:40 - INFO - __main__ - Step 71288: {'lr': 0.0002749689868616012, 'samples': 13687296, 'steps': 71287, 'loss/train': 1.7770894765853882} 11/07/2021 07:14:41 - INFO - __main__ - Step 71289: {'lr': 0.00027496370664589705, 'samples': 13687488, 'steps': 71288, 'loss/train': 1.6161762475967407} 11/07/2021 07:14:41 - INFO - __main__ - Step 71290: {'lr': 0.00027495842641894465, 'samples': 13687680, 'steps': 71289, 'loss/train': 1.3794187307357788} 11/07/2021 07:14:42 - INFO - __main__ - Step 71291: {'lr': 0.0002749531461807464, 'samples': 13687872, 'steps': 71290, 'loss/train': 1.5145400762557983} 11/07/2021 07:14:42 - INFO - __main__ - Step 71292: {'lr': 0.0002749478659313047, 'samples': 13688064, 'steps': 71291, 'loss/train': 1.5211567878723145} 11/07/2021 07:14:42 - INFO - __main__ - Step 71293: {'lr': 0.0002749425856706217, 'samples': 13688256, 'steps': 71292, 'loss/train': 1.6882669925689697} 11/07/2021 07:14:43 - INFO - __main__ - Step 71294: {'lr': 0.00027493730539870014, 'samples': 13688448, 'steps': 71293, 'loss/train': 1.3691319227218628} 11/07/2021 07:14:44 - INFO - __main__ - Step 71295: {'lr': 0.0002749320251155421, 'samples': 13688640, 'steps': 71294, 'loss/train': 1.3087007999420166} 11/07/2021 07:14:44 - INFO - __main__ - Step 71296: {'lr': 0.00027492674482115017, 'samples': 13688832, 'steps': 71295, 'loss/train': 1.5296615362167358} 11/07/2021 07:14:44 - INFO - __main__ - Step 71297: {'lr': 0.00027492146451552654, 'samples': 13689024, 'steps': 71296, 'loss/train': 1.9317114353179932} 11/07/2021 07:14:45 - INFO - __main__ - Step 71298: {'lr': 0.0002749161841986737, 'samples': 13689216, 'steps': 71297, 'loss/train': 1.9395312070846558} 11/07/2021 07:14:46 - INFO - __main__ - Step 71299: {'lr': 0.0002749109038705941, 'samples': 13689408, 'steps': 71298, 'loss/train': 1.1901003122329712} 11/07/2021 07:14:46 - INFO - __main__ - Step 71300: {'lr': 0.00027490562353128995, 'samples': 13689600, 'steps': 71299, 'loss/train': 1.371579647064209} 11/07/2021 07:14:47 - INFO - __main__ - Step 71301: {'lr': 0.0002749003431807637, 'samples': 13689792, 'steps': 71300, 'loss/train': 0.840147078037262} 11/07/2021 07:14:47 - INFO - __main__ - Step 71302: {'lr': 0.00027489506281901777, 'samples': 13689984, 'steps': 71301, 'loss/train': 1.7857561111450195} 11/07/2021 07:14:47 - INFO - __main__ - Step 71303: {'lr': 0.0002748897824460545, 'samples': 13690176, 'steps': 71302, 'loss/train': 1.0043296813964844} 11/07/2021 07:14:48 - INFO - __main__ - Step 71304: {'lr': 0.0002748845020618763, 'samples': 13690368, 'steps': 71303, 'loss/train': 1.3400806188583374} 11/07/2021 07:14:49 - INFO - __main__ - Step 71305: {'lr': 0.00027487922166648547, 'samples': 13690560, 'steps': 71304, 'loss/train': 1.5405648946762085} 11/07/2021 07:14:49 - INFO - __main__ - Step 71306: {'lr': 0.00027487394125988456, 'samples': 13690752, 'steps': 71305, 'loss/train': 0.9905560612678528} 11/07/2021 07:14:49 - INFO - __main__ - Step 71307: {'lr': 0.0002748686608420757, 'samples': 13690944, 'steps': 71306, 'loss/train': 2.131312370300293} 11/07/2021 07:14:50 - INFO - __main__ - Step 71308: {'lr': 0.00027486338041306154, 'samples': 13691136, 'steps': 71307, 'loss/train': 1.092340350151062} 11/07/2021 07:14:50 - INFO - __main__ - Step 71309: {'lr': 0.00027485809997284424, 'samples': 13691328, 'steps': 71308, 'loss/train': 1.225180983543396} 11/07/2021 07:14:51 - INFO - __main__ - Step 71310: {'lr': 0.00027485281952142627, 'samples': 13691520, 'steps': 71309, 'loss/train': 1.4247628450393677} 11/07/2021 07:14:51 - INFO - __main__ - Step 71311: {'lr': 0.00027484753905881, 'samples': 13691712, 'steps': 71310, 'loss/train': 1.6895430088043213} 11/07/2021 07:14:52 - INFO - __main__ - Step 71312: {'lr': 0.0002748422585849978, 'samples': 13691904, 'steps': 71311, 'loss/train': 1.2218685150146484} 11/07/2021 07:14:52 - INFO - __main__ - Step 71313: {'lr': 0.00027483697809999215, 'samples': 13692096, 'steps': 71312, 'loss/train': 1.6329668760299683} 11/07/2021 07:14:52 - INFO - __main__ - Step 71314: {'lr': 0.0002748316976037952, 'samples': 13692288, 'steps': 71313, 'loss/train': 0.952396810054779} 11/07/2021 07:14:53 - INFO - __main__ - Step 71315: {'lr': 0.0002748264170964096, 'samples': 13692480, 'steps': 71314, 'loss/train': 1.4084107875823975} 11/07/2021 07:14:54 - INFO - __main__ - Step 71316: {'lr': 0.00027482113657783754, 'samples': 13692672, 'steps': 71315, 'loss/train': 1.2414281368255615} 11/07/2021 07:14:54 - INFO - __main__ - Step 71317: {'lr': 0.00027481585604808146, 'samples': 13692864, 'steps': 71316, 'loss/train': 1.3736810684204102} 11/07/2021 07:14:55 - INFO - __main__ - Step 71318: {'lr': 0.00027481057550714374, 'samples': 13693056, 'steps': 71317, 'loss/train': 0.46729776263237} 11/07/2021 07:14:55 - INFO - __main__ - Step 71319: {'lr': 0.00027480529495502675, 'samples': 13693248, 'steps': 71318, 'loss/train': 1.3398480415344238} 11/07/2021 07:14:56 - INFO - __main__ - Step 71320: {'lr': 0.00027480001439173293, 'samples': 13693440, 'steps': 71319, 'loss/train': 1.3863580226898193} 11/07/2021 07:14:56 - INFO - __main__ - Step 71321: {'lr': 0.0002747947338172646, 'samples': 13693632, 'steps': 71320, 'loss/train': 1.1052837371826172} 11/07/2021 07:14:57 - INFO - __main__ - Step 71322: {'lr': 0.000274789453231624, 'samples': 13693824, 'steps': 71321, 'loss/train': 0.9178846478462219} 11/07/2021 07:14:57 - INFO - __main__ - Step 71323: {'lr': 0.0002747841726348138, 'samples': 13694016, 'steps': 71322, 'loss/train': 1.5226112604141235} 11/07/2021 07:14:57 - INFO - __main__ - Step 71324: {'lr': 0.0002747788920268362, 'samples': 13694208, 'steps': 71323, 'loss/train': 1.2959723472595215} 11/07/2021 07:14:59 - INFO - __main__ - Step 71325: {'lr': 0.0002747736114076936, 'samples': 13694400, 'steps': 71324, 'loss/train': 1.242644190788269} 11/07/2021 07:14:59 - INFO - __main__ - Step 71326: {'lr': 0.00027476833077738844, 'samples': 13694592, 'steps': 71325, 'loss/train': 1.3505254983901978} 11/07/2021 07:14:59 - INFO - __main__ - Step 71327: {'lr': 0.000274763050135923, 'samples': 13694784, 'steps': 71326, 'loss/train': 1.0647388696670532} 11/07/2021 07:15:00 - INFO - __main__ - Step 71328: {'lr': 0.0002747577694832997, 'samples': 13694976, 'steps': 71327, 'loss/train': 1.1627129316329956} 11/07/2021 07:15:00 - INFO - __main__ - Step 71329: {'lr': 0.00027475248881952095, 'samples': 13695168, 'steps': 71328, 'loss/train': 1.3868937492370605} 11/07/2021 07:15:01 - INFO - __main__ - Step 71330: {'lr': 0.0002747472081445891, 'samples': 13695360, 'steps': 71329, 'loss/train': 1.584923267364502} 11/07/2021 07:15:01 - INFO - __main__ - Step 71331: {'lr': 0.0002747419274585066, 'samples': 13695552, 'steps': 71330, 'loss/train': 0.6930942535400391} 11/07/2021 07:15:02 - INFO - __main__ - Step 71332: {'lr': 0.00027473664676127575, 'samples': 13695744, 'steps': 71331, 'loss/train': 1.2024109363555908} 11/07/2021 07:15:02 - INFO - __main__ - Step 71333: {'lr': 0.00027473136605289894, 'samples': 13695936, 'steps': 71332, 'loss/train': 1.4425878524780273} 11/07/2021 07:15:02 - INFO - __main__ - Step 71334: {'lr': 0.0002747260853333786, 'samples': 13696128, 'steps': 71333, 'loss/train': 1.616387963294983} 11/07/2021 07:15:03 - INFO - __main__ - Step 71335: {'lr': 0.0002747208046027169, 'samples': 13696320, 'steps': 71334, 'loss/train': 1.3550148010253906} 11/07/2021 07:15:04 - INFO - __main__ - Step 71336: {'lr': 0.00027471552386091653, 'samples': 13696512, 'steps': 71335, 'loss/train': 1.4574288129806519} 11/07/2021 07:15:04 - INFO - __main__ - Step 71337: {'lr': 0.0002747102431079797, 'samples': 13696704, 'steps': 71336, 'loss/train': 1.4343287944793701} 11/07/2021 07:15:04 - INFO - __main__ - Step 71338: {'lr': 0.0002747049623439088, 'samples': 13696896, 'steps': 71337, 'loss/train': 1.4061259031295776} 11/07/2021 07:15:05 - INFO - __main__ - Step 71339: {'lr': 0.00027469968156870625, 'samples': 13697088, 'steps': 71338, 'loss/train': 1.4564284086227417} 11/07/2021 07:15:05 - INFO - __main__ - Step 71340: {'lr': 0.0002746944007823744, 'samples': 13697280, 'steps': 71339, 'loss/train': 1.5185664892196655} 11/07/2021 07:15:06 - INFO - __main__ - Step 71341: {'lr': 0.0002746891199849156, 'samples': 13697472, 'steps': 71340, 'loss/train': 1.4897022247314453} 11/07/2021 07:15:07 - INFO - __main__ - Step 71342: {'lr': 0.00027468383917633233, 'samples': 13697664, 'steps': 71341, 'loss/train': 1.7090519666671753} 11/07/2021 07:15:07 - INFO - __main__ - Step 71343: {'lr': 0.00027467855835662687, 'samples': 13697856, 'steps': 71342, 'loss/train': 1.5612808465957642} 11/07/2021 07:15:07 - INFO - __main__ - Step 71344: {'lr': 0.00027467327752580157, 'samples': 13698048, 'steps': 71343, 'loss/train': 0.951160192489624} 11/07/2021 07:15:08 - INFO - __main__ - Step 71345: {'lr': 0.00027466799668385896, 'samples': 13698240, 'steps': 71344, 'loss/train': 1.49055016040802} 11/07/2021 07:15:09 - INFO - __main__ - Step 71346: {'lr': 0.0002746627158308013, 'samples': 13698432, 'steps': 71345, 'loss/train': 1.6014786958694458} 11/07/2021 07:15:09 - INFO - __main__ - Step 71347: {'lr': 0.00027465743496663106, 'samples': 13698624, 'steps': 71346, 'loss/train': 0.8197924494743347} 11/07/2021 07:15:09 - INFO - __main__ - Step 71348: {'lr': 0.0002746521540913505, 'samples': 13698816, 'steps': 71347, 'loss/train': 1.241744875907898} 11/07/2021 07:15:10 - INFO - __main__ - Step 71349: {'lr': 0.00027464687320496203, 'samples': 13699008, 'steps': 71348, 'loss/train': 1.2186650037765503} 11/07/2021 07:15:10 - INFO - __main__ - Step 71350: {'lr': 0.00027464159230746805, 'samples': 13699200, 'steps': 71349, 'loss/train': 1.810796856880188} 11/07/2021 07:15:11 - INFO - __main__ - Step 71351: {'lr': 0.00027463631139887097, 'samples': 13699392, 'steps': 71350, 'loss/train': 2.0005369186401367} 11/07/2021 07:15:12 - INFO - __main__ - Step 71352: {'lr': 0.0002746310304791732, 'samples': 13699584, 'steps': 71351, 'loss/train': 1.5896035432815552} 11/07/2021 07:15:12 - INFO - __main__ - Step 71353: {'lr': 0.00027462574954837705, 'samples': 13699776, 'steps': 71352, 'loss/train': 1.1732096672058105} 11/07/2021 07:15:12 - INFO - __main__ - Step 71354: {'lr': 0.0002746204686064849, 'samples': 13699968, 'steps': 71353, 'loss/train': 1.256343960762024} 11/07/2021 07:15:13 - INFO - __main__ - Step 71355: {'lr': 0.00027461518765349916, 'samples': 13700160, 'steps': 71354, 'loss/train': 1.2337709665298462} 11/07/2021 07:15:13 - INFO - __main__ - Step 71356: {'lr': 0.00027460990668942215, 'samples': 13700352, 'steps': 71355, 'loss/train': 1.7051080465316772} 11/07/2021 07:15:14 - INFO - __main__ - Step 71357: {'lr': 0.0002746046257142563, 'samples': 13700544, 'steps': 71356, 'loss/train': 1.3671813011169434} 11/07/2021 07:15:14 - INFO - __main__ - Step 71358: {'lr': 0.000274599344728004, 'samples': 13700736, 'steps': 71357, 'loss/train': 2.1497795581817627} 11/07/2021 07:15:15 - INFO - __main__ - Step 71359: {'lr': 0.00027459406373066763, 'samples': 13700928, 'steps': 71358, 'loss/train': 1.1068648099899292} 11/07/2021 07:15:15 - INFO - __main__ - Step 71360: {'lr': 0.0002745887827222496, 'samples': 13701120, 'steps': 71359, 'loss/train': 1.3281300067901611} 11/07/2021 07:15:15 - INFO - __main__ - Step 71361: {'lr': 0.0002745835017027522, 'samples': 13701312, 'steps': 71360, 'loss/train': 1.4353829622268677} 11/07/2021 07:15:16 - INFO - __main__ - Step 71362: {'lr': 0.00027457822067217784, 'samples': 13701504, 'steps': 71361, 'loss/train': 1.8424196243286133} 11/07/2021 07:15:17 - INFO - __main__ - Step 71363: {'lr': 0.00027457293963052893, 'samples': 13701696, 'steps': 71362, 'loss/train': 1.5699721574783325} 11/07/2021 07:15:17 - INFO - __main__ - Step 71364: {'lr': 0.0002745676585778078, 'samples': 13701888, 'steps': 71363, 'loss/train': 1.5187504291534424} 11/07/2021 07:15:18 - INFO - __main__ - Step 71365: {'lr': 0.0002745623775140169, 'samples': 13702080, 'steps': 71364, 'loss/train': 1.7420421838760376} 11/07/2021 07:15:18 - INFO - __main__ - Step 71366: {'lr': 0.0002745570964391586, 'samples': 13702272, 'steps': 71365, 'loss/train': 1.300134301185608} 11/07/2021 07:15:19 - INFO - __main__ - Step 71367: {'lr': 0.0002745518153532352, 'samples': 13702464, 'steps': 71366, 'loss/train': 1.5075318813323975} 11/07/2021 07:15:19 - INFO - __main__ - Step 71368: {'lr': 0.00027454653425624913, 'samples': 13702656, 'steps': 71367, 'loss/train': 0.6678598523139954} 11/07/2021 07:15:20 - INFO - __main__ - Step 71369: {'lr': 0.00027454125314820276, 'samples': 13702848, 'steps': 71368, 'loss/train': 0.9889968037605286} 11/07/2021 07:15:20 - INFO - __main__ - Step 71370: {'lr': 0.0002745359720290985, 'samples': 13703040, 'steps': 71369, 'loss/train': 1.7523322105407715} 11/07/2021 07:15:20 - INFO - __main__ - Step 71371: {'lr': 0.0002745306908989388, 'samples': 13703232, 'steps': 71370, 'loss/train': 1.6049296855926514} 11/07/2021 07:15:21 - INFO - __main__ - Step 71372: {'lr': 0.0002745254097577258, 'samples': 13703424, 'steps': 71371, 'loss/train': 1.5478973388671875} 11/07/2021 07:15:22 - INFO - __main__ - Step 71373: {'lr': 0.0002745201286054621, 'samples': 13703616, 'steps': 71372, 'loss/train': 1.3848745822906494} 11/07/2021 07:15:22 - INFO - __main__ - Step 71374: {'lr': 0.00027451484744215, 'samples': 13703808, 'steps': 71373, 'loss/train': 1.0655866861343384} 11/07/2021 07:15:22 - INFO - __main__ - Step 71375: {'lr': 0.00027450956626779186, 'samples': 13704000, 'steps': 71374, 'loss/train': 1.2443259954452515} 11/07/2021 07:15:23 - INFO - __main__ - Step 71376: {'lr': 0.0002745042850823902, 'samples': 13704192, 'steps': 71375, 'loss/train': 1.6070228815078735} 11/07/2021 07:15:24 - INFO - __main__ - Step 71377: {'lr': 0.00027449900388594716, 'samples': 13704384, 'steps': 71376, 'loss/train': 0.5277307629585266} 11/07/2021 07:15:24 - INFO - __main__ - Step 71378: {'lr': 0.0002744937226784653, 'samples': 13704576, 'steps': 71377, 'loss/train': 1.8773730993270874} 11/07/2021 07:15:24 - INFO - __main__ - Step 71379: {'lr': 0.00027448844145994697, 'samples': 13704768, 'steps': 71378, 'loss/train': 1.8165336847305298} 11/07/2021 07:15:25 - INFO - __main__ - Step 71380: {'lr': 0.00027448316023039444, 'samples': 13704960, 'steps': 71379, 'loss/train': 1.2282015085220337} 11/07/2021 07:15:25 - INFO - __main__ - Step 71381: {'lr': 0.00027447787898981027, 'samples': 13705152, 'steps': 71380, 'loss/train': 1.4385569095611572} 11/07/2021 07:15:27 - INFO - __main__ - Step 71382: {'lr': 0.0002744725977381967, 'samples': 13705344, 'steps': 71381, 'loss/train': 1.0971953868865967} 11/07/2021 07:15:27 - INFO - __main__ - Step 71383: {'lr': 0.0002744673164755562, 'samples': 13705536, 'steps': 71382, 'loss/train': 1.185634970664978} 11/07/2021 07:15:28 - INFO - __main__ - Step 71384: {'lr': 0.000274462035201891, 'samples': 13705728, 'steps': 71383, 'loss/train': 1.977835774421692} 11/07/2021 07:15:28 - INFO - __main__ - Step 71385: {'lr': 0.00027445675391720364, 'samples': 13705920, 'steps': 71384, 'loss/train': 1.853558897972107} 11/07/2021 07:15:28 - INFO - __main__ - Step 71386: {'lr': 0.00027445147262149646, 'samples': 13706112, 'steps': 71385, 'loss/train': 1.6707097291946411} 11/07/2021 07:15:29 - INFO - __main__ - Step 71387: {'lr': 0.0002744461913147719, 'samples': 13706304, 'steps': 71386, 'loss/train': 0.7564249634742737} 11/07/2021 07:15:29 - INFO - __main__ - Step 71388: {'lr': 0.00027444090999703214, 'samples': 13706496, 'steps': 71387, 'loss/train': 1.3912098407745361} 11/07/2021 07:15:29 - INFO - __main__ - Step 71389: {'lr': 0.0002744356286682797, 'samples': 13706688, 'steps': 71388, 'loss/train': 1.383608341217041} 11/07/2021 07:15:31 - INFO - __main__ - Step 71390: {'lr': 0.00027443034732851695, 'samples': 13706880, 'steps': 71389, 'loss/train': 1.918353796005249} 11/07/2021 07:15:31 - INFO - __main__ - Step 71391: {'lr': 0.0002744250659777463, 'samples': 13707072, 'steps': 71390, 'loss/train': 1.644439935684204} 11/07/2021 07:15:31 - INFO - __main__ - Step 71392: {'lr': 0.00027441978461597004, 'samples': 13707264, 'steps': 71391, 'loss/train': 1.5158100128173828} 11/07/2021 07:15:32 - INFO - __main__ - Step 71393: {'lr': 0.00027441450324319067, 'samples': 13707456, 'steps': 71392, 'loss/train': 1.3935070037841797} 11/07/2021 07:15:32 - INFO - __main__ - Step 71394: {'lr': 0.0002744092218594105, 'samples': 13707648, 'steps': 71393, 'loss/train': 1.7302964925765991} 11/07/2021 07:15:33 - INFO - __main__ - Step 71395: {'lr': 0.00027440394046463184, 'samples': 13707840, 'steps': 71394, 'loss/train': 1.322864055633545} 11/07/2021 07:15:33 - INFO - __main__ - Step 71396: {'lr': 0.0002743986590588572, 'samples': 13708032, 'steps': 71395, 'loss/train': 1.571046233177185} 11/07/2021 07:15:34 - INFO - __main__ - Step 71397: {'lr': 0.0002743933776420888, 'samples': 13708224, 'steps': 71396, 'loss/train': 1.3234783411026} 11/07/2021 07:15:34 - INFO - __main__ - Step 71398: {'lr': 0.0002743880962143292, 'samples': 13708416, 'steps': 71397, 'loss/train': 1.607382893562317} 11/07/2021 07:15:34 - INFO - __main__ - Step 71399: {'lr': 0.0002743828147755807, 'samples': 13708608, 'steps': 71398, 'loss/train': 1.5959231853485107} 11/07/2021 07:15:35 - INFO - __main__ - Step 71400: {'lr': 0.00027437753332584575, 'samples': 13708800, 'steps': 71399, 'loss/train': 1.5171130895614624} 11/07/2021 07:15:36 - INFO - __main__ - Step 71401: {'lr': 0.00027437225186512657, 'samples': 13708992, 'steps': 71400, 'loss/train': 1.2743990421295166} 11/07/2021 07:15:36 - INFO - __main__ - Step 71402: {'lr': 0.0002743669703934256, 'samples': 13709184, 'steps': 71401, 'loss/train': 1.9541758298873901} 11/07/2021 07:15:36 - INFO - __main__ - Step 71403: {'lr': 0.00027436168891074533, 'samples': 13709376, 'steps': 71402, 'loss/train': 0.8949573040008545} 11/07/2021 07:15:37 - INFO - __main__ - Step 71404: {'lr': 0.000274356407417088, 'samples': 13709568, 'steps': 71403, 'loss/train': 1.6382179260253906} 11/07/2021 07:15:38 - INFO - __main__ - Step 71405: {'lr': 0.00027435112591245607, 'samples': 13709760, 'steps': 71404, 'loss/train': 1.342519998550415} 11/07/2021 07:15:38 - INFO - __main__ - Step 71406: {'lr': 0.0002743458443968519, 'samples': 13709952, 'steps': 71405, 'loss/train': 1.2718549966812134} 11/07/2021 07:15:39 - INFO - __main__ - Step 71407: {'lr': 0.0002743405628702779, 'samples': 13710144, 'steps': 71406, 'loss/train': 1.3163485527038574} 11/07/2021 07:15:39 - INFO - __main__ - Step 71408: {'lr': 0.0002743352813327364, 'samples': 13710336, 'steps': 71407, 'loss/train': 2.078925132751465} 11/07/2021 07:15:39 - INFO - __main__ - Step 71409: {'lr': 0.0002743299997842297, 'samples': 13710528, 'steps': 71408, 'loss/train': 1.896649718284607} 11/07/2021 07:15:40 - INFO - __main__ - Step 71410: {'lr': 0.0002743247182247604, 'samples': 13710720, 'steps': 71409, 'loss/train': 0.9487393498420715} 11/07/2021 07:15:41 - INFO - __main__ - Step 71411: {'lr': 0.0002743194366543307, 'samples': 13710912, 'steps': 71410, 'loss/train': 1.0947136878967285} 11/07/2021 07:15:41 - INFO - __main__ - Step 71412: {'lr': 0.00027431415507294304, 'samples': 13711104, 'steps': 71411, 'loss/train': 1.4967085123062134} 11/07/2021 07:15:41 - INFO - __main__ - Step 71413: {'lr': 0.00027430887348059993, 'samples': 13711296, 'steps': 71412, 'loss/train': 1.3504605293273926} 11/07/2021 07:15:42 - INFO - __main__ - Step 71414: {'lr': 0.00027430359187730345, 'samples': 13711488, 'steps': 71413, 'loss/train': 1.3461675643920898} 11/07/2021 07:15:42 - INFO - __main__ - Step 71415: {'lr': 0.0002742983102630562, 'samples': 13711680, 'steps': 71414, 'loss/train': 1.6683109998703003} 11/07/2021 07:15:43 - INFO - __main__ - Step 71416: {'lr': 0.00027429302863786047, 'samples': 13711872, 'steps': 71415, 'loss/train': 1.3871798515319824} 11/07/2021 07:15:43 - INFO - __main__ - Step 71417: {'lr': 0.0002742877470017187, 'samples': 13712064, 'steps': 71416, 'loss/train': 2.567373514175415} 11/07/2021 07:15:44 - INFO - __main__ - Step 71418: {'lr': 0.00027428246535463323, 'samples': 13712256, 'steps': 71417, 'loss/train': 1.0842399597167969} 11/07/2021 07:15:44 - INFO - __main__ - Step 71419: {'lr': 0.0002742771836966065, 'samples': 13712448, 'steps': 71418, 'loss/train': 1.356150507926941} 11/07/2021 07:15:44 - INFO - __main__ - Step 71420: {'lr': 0.0002742719020276409, 'samples': 13712640, 'steps': 71419, 'loss/train': 1.1895147562026978} 11/07/2021 07:15:46 - INFO - __main__ - Step 71421: {'lr': 0.0002742666203477386, 'samples': 13712832, 'steps': 71420, 'loss/train': 1.0395863056182861} 11/07/2021 07:15:46 - INFO - __main__ - Step 71422: {'lr': 0.0002742613386569023, 'samples': 13713024, 'steps': 71421, 'loss/train': 1.6607599258422852} 11/07/2021 07:15:46 - INFO - __main__ - Step 71423: {'lr': 0.0002742560569551341, 'samples': 13713216, 'steps': 71422, 'loss/train': 1.4358278512954712} 11/07/2021 07:15:47 - INFO - __main__ - Step 71424: {'lr': 0.0002742507752424365, 'samples': 13713408, 'steps': 71423, 'loss/train': 1.4380711317062378} 11/07/2021 07:15:47 - INFO - __main__ - Step 71425: {'lr': 0.0002742454935188119, 'samples': 13713600, 'steps': 71424, 'loss/train': 1.3775399923324585} 11/07/2021 07:15:48 - INFO - __main__ - Step 71426: {'lr': 0.0002742402117842627, 'samples': 13713792, 'steps': 71425, 'loss/train': 1.6085941791534424} 11/07/2021 07:15:48 - INFO - __main__ - Step 71427: {'lr': 0.0002742349300387912, 'samples': 13713984, 'steps': 71426, 'loss/train': 1.4481743574142456} 11/07/2021 07:15:49 - INFO - __main__ - Step 71428: {'lr': 0.0002742296482823998, 'samples': 13714176, 'steps': 71427, 'loss/train': 1.4708694219589233} 11/07/2021 07:15:49 - INFO - __main__ - Step 71429: {'lr': 0.00027422436651509084, 'samples': 13714368, 'steps': 71428, 'loss/train': 2.118413209915161} 11/07/2021 07:15:49 - INFO - __main__ - Step 71430: {'lr': 0.00027421908473686685, 'samples': 13714560, 'steps': 71429, 'loss/train': 1.9580496549606323} 11/07/2021 07:15:50 - INFO - __main__ - Step 71431: {'lr': 0.0002742138029477301, 'samples': 13714752, 'steps': 71430, 'loss/train': 1.1286547183990479} 11/07/2021 07:15:51 - INFO - __main__ - Step 71432: {'lr': 0.0002742085211476829, 'samples': 13714944, 'steps': 71431, 'loss/train': 1.8267995119094849} 11/07/2021 07:15:51 - INFO - __main__ - Step 71433: {'lr': 0.0002742032393367278, 'samples': 13715136, 'steps': 71432, 'loss/train': 1.422378420829773} 11/07/2021 07:15:52 - INFO - __main__ - Step 71434: {'lr': 0.0002741979575148671, 'samples': 13715328, 'steps': 71433, 'loss/train': 1.3178259134292603} 11/07/2021 07:15:52 - INFO - __main__ - Step 71435: {'lr': 0.00027419267568210313, 'samples': 13715520, 'steps': 71434, 'loss/train': 1.2319822311401367} 11/07/2021 07:15:53 - INFO - __main__ - Step 71436: {'lr': 0.0002741873938384383, 'samples': 13715712, 'steps': 71435, 'loss/train': 0.0952538251876831} 11/07/2021 07:15:53 - INFO - __main__ - Step 71437: {'lr': 0.00027418211198387507, 'samples': 13715904, 'steps': 71436, 'loss/train': 1.4723689556121826} 11/07/2021 07:15:54 - INFO - __main__ - Step 71438: {'lr': 0.0002741768301184157, 'samples': 13716096, 'steps': 71437, 'loss/train': 1.2056556940078735} 11/07/2021 07:15:54 - INFO - __main__ - Step 71439: {'lr': 0.0002741715482420626, 'samples': 13716288, 'steps': 71438, 'loss/train': 1.4360625743865967} 11/07/2021 07:15:54 - INFO - __main__ - Step 71440: {'lr': 0.0002741662663548183, 'samples': 13716480, 'steps': 71439, 'loss/train': 1.2290204763412476} 11/07/2021 07:15:55 - INFO - __main__ - Step 71441: {'lr': 0.00027416098445668497, 'samples': 13716672, 'steps': 71440, 'loss/train': 1.3833037614822388} 11/07/2021 07:15:56 - INFO - __main__ - Step 71442: {'lr': 0.00027415570254766506, 'samples': 13716864, 'steps': 71441, 'loss/train': 1.5993765592575073} 11/07/2021 07:15:56 - INFO - __main__ - Step 71443: {'lr': 0.00027415042062776094, 'samples': 13717056, 'steps': 71442, 'loss/train': 1.4908348321914673} 11/07/2021 07:15:57 - INFO - __main__ - Step 71444: {'lr': 0.0002741451386969751, 'samples': 13717248, 'steps': 71443, 'loss/train': 1.3243712186813354} 11/07/2021 07:15:57 - INFO - __main__ - Step 71445: {'lr': 0.0002741398567553097, 'samples': 13717440, 'steps': 71444, 'loss/train': 1.435436487197876} 11/07/2021 07:15:57 - INFO - __main__ - Step 71446: {'lr': 0.00027413457480276733, 'samples': 13717632, 'steps': 71445, 'loss/train': 1.037746787071228} 11/07/2021 07:15:58 - INFO - __main__ - Step 71447: {'lr': 0.00027412929283935033, 'samples': 13717824, 'steps': 71446, 'loss/train': 1.4467499256134033} 11/07/2021 07:15:59 - INFO - __main__ - Step 71448: {'lr': 0.000274124010865061, 'samples': 13718016, 'steps': 71447, 'loss/train': 1.5819350481033325} 11/07/2021 07:15:59 - INFO - __main__ - Step 71449: {'lr': 0.00027411872887990175, 'samples': 13718208, 'steps': 71448, 'loss/train': 1.5322790145874023} 11/07/2021 07:15:59 - INFO - __main__ - Step 71450: {'lr': 0.000274113446883875, 'samples': 13718400, 'steps': 71449, 'loss/train': 0.967171311378479} 11/07/2021 07:16:00 - INFO - __main__ - Step 71451: {'lr': 0.00027410816487698306, 'samples': 13718592, 'steps': 71450, 'loss/train': 1.1777946949005127} 11/07/2021 07:16:01 - INFO - __main__ - Step 71452: {'lr': 0.0002741028828592284, 'samples': 13718784, 'steps': 71451, 'loss/train': 1.6233078241348267} 11/07/2021 07:16:01 - INFO - __main__ - Step 71453: {'lr': 0.00027409760083061335, 'samples': 13718976, 'steps': 71452, 'loss/train': 1.7838478088378906} 11/07/2021 07:16:01 - INFO - __main__ - Step 71454: {'lr': 0.0002740923187911403, 'samples': 13719168, 'steps': 71453, 'loss/train': 1.1441189050674438} 11/07/2021 07:16:02 - INFO - __main__ - Step 71455: {'lr': 0.0002740870367408116, 'samples': 13719360, 'steps': 71454, 'loss/train': 1.2753428220748901} 11/07/2021 07:16:02 - INFO - __main__ - Step 71456: {'lr': 0.0002740817546796297, 'samples': 13719552, 'steps': 71455, 'loss/train': 1.2974886894226074} 11/07/2021 07:16:03 - INFO - __main__ - Step 71457: {'lr': 0.0002740764726075969, 'samples': 13719744, 'steps': 71456, 'loss/train': 1.68586266040802} 11/07/2021 07:16:04 - INFO - __main__ - Step 71458: {'lr': 0.00027407119052471555, 'samples': 13719936, 'steps': 71457, 'loss/train': 1.3841911554336548} 11/07/2021 07:16:04 - INFO - __main__ - Step 71459: {'lr': 0.0002740659084309882, 'samples': 13720128, 'steps': 71458, 'loss/train': 1.2481576204299927} 11/07/2021 07:16:04 - INFO - __main__ - Step 71460: {'lr': 0.000274060626326417, 'samples': 13720320, 'steps': 71459, 'loss/train': 0.12151119112968445} 11/07/2021 07:16:05 - INFO - __main__ - Step 71461: {'lr': 0.0002740553442110046, 'samples': 13720512, 'steps': 71460, 'loss/train': 1.2746490240097046} 11/07/2021 07:16:06 - INFO - __main__ - Step 71462: {'lr': 0.00027405006208475316, 'samples': 13720704, 'steps': 71461, 'loss/train': 1.4728307723999023} 11/07/2021 07:16:06 - INFO - __main__ - Step 71463: {'lr': 0.00027404477994766514, 'samples': 13720896, 'steps': 71462, 'loss/train': 1.3623332977294922} 11/07/2021 07:16:07 - INFO - __main__ - Step 71464: {'lr': 0.00027403949779974284, 'samples': 13721088, 'steps': 71463, 'loss/train': 1.1182880401611328} 11/07/2021 07:16:07 - INFO - __main__ - Step 71465: {'lr': 0.0002740342156409888, 'samples': 13721280, 'steps': 71464, 'loss/train': 1.6431005001068115} 11/07/2021 07:16:07 - INFO - __main__ - Step 71466: {'lr': 0.00027402893347140526, 'samples': 13721472, 'steps': 71465, 'loss/train': 1.1134449243545532} 11/07/2021 07:16:09 - INFO - __main__ - Step 71467: {'lr': 0.00027402365129099474, 'samples': 13721664, 'steps': 71466, 'loss/train': 1.9045418500900269} 11/07/2021 07:16:09 - INFO - __main__ - Step 71468: {'lr': 0.00027401836909975944, 'samples': 13721856, 'steps': 71467, 'loss/train': 1.6549299955368042} 11/07/2021 07:16:09 - INFO - __main__ - Step 71469: {'lr': 0.0002740130868977019, 'samples': 13722048, 'steps': 71468, 'loss/train': 1.3986408710479736} 11/07/2021 07:16:10 - INFO - __main__ - Step 71470: {'lr': 0.0002740078046848244, 'samples': 13722240, 'steps': 71469, 'loss/train': 1.5062975883483887} 11/07/2021 07:16:10 - INFO - __main__ - Step 71471: {'lr': 0.00027400252246112934, 'samples': 13722432, 'steps': 71470, 'loss/train': 0.9942231774330139} 11/07/2021 07:16:10 - INFO - __main__ - Step 71472: {'lr': 0.00027399724022661914, 'samples': 13722624, 'steps': 71471, 'loss/train': 1.2770558595657349} 11/07/2021 07:16:11 - INFO - __main__ - Step 71473: {'lr': 0.00027399195798129614, 'samples': 13722816, 'steps': 71472, 'loss/train': 0.9044551849365234} 11/07/2021 07:16:12 - INFO - __main__ - Step 71474: {'lr': 0.00027398667572516277, 'samples': 13723008, 'steps': 71473, 'loss/train': 1.8345050811767578} 11/07/2021 07:16:12 - INFO - __main__ - Step 71475: {'lr': 0.00027398139345822137, 'samples': 13723200, 'steps': 71474, 'loss/train': 0.8272629976272583} 11/07/2021 07:16:12 - INFO - __main__ - Step 71476: {'lr': 0.00027397611118047427, 'samples': 13723392, 'steps': 71475, 'loss/train': 1.6260886192321777} 11/07/2021 07:16:13 - INFO - __main__ - Step 71477: {'lr': 0.0002739708288919239, 'samples': 13723584, 'steps': 71476, 'loss/train': 1.454353928565979} 11/07/2021 07:16:14 - INFO - __main__ - Step 71478: {'lr': 0.00027396554659257273, 'samples': 13723776, 'steps': 71477, 'loss/train': 1.2934911251068115} 11/07/2021 07:16:14 - INFO - __main__ - Step 71479: {'lr': 0.000273960264282423, 'samples': 13723968, 'steps': 71478, 'loss/train': 1.2226194143295288} 11/07/2021 07:16:15 - INFO - __main__ - Step 71480: {'lr': 0.0002739549819614771, 'samples': 13724160, 'steps': 71479, 'loss/train': 1.2395551204681396} 11/07/2021 07:16:15 - INFO - __main__ - Step 71481: {'lr': 0.00027394969962973756, 'samples': 13724352, 'steps': 71480, 'loss/train': 1.5443366765975952} 11/07/2021 07:16:15 - INFO - __main__ - Step 71482: {'lr': 0.0002739444172872066, 'samples': 13724544, 'steps': 71481, 'loss/train': 1.5558714866638184} 11/07/2021 07:16:16 - INFO - __main__ - Step 71483: {'lr': 0.0002739391349338866, 'samples': 13724736, 'steps': 71482, 'loss/train': 1.293716311454773} 11/07/2021 07:16:17 - INFO - __main__ - Step 71484: {'lr': 0.00027393385256978004, 'samples': 13724928, 'steps': 71483, 'loss/train': 1.334695816040039} 11/07/2021 07:16:17 - INFO - __main__ - Step 71485: {'lr': 0.00027392857019488924, 'samples': 13725120, 'steps': 71484, 'loss/train': 1.6655378341674805} 11/07/2021 07:16:17 - INFO - __main__ - Step 71486: {'lr': 0.00027392328780921664, 'samples': 13725312, 'steps': 71485, 'loss/train': 1.3145780563354492} 11/07/2021 07:16:18 - INFO - __main__ - Step 71487: {'lr': 0.00027391800541276464, 'samples': 13725504, 'steps': 71486, 'loss/train': 1.5563818216323853} 11/07/2021 07:16:19 - INFO - __main__ - Step 71488: {'lr': 0.00027391272300553545, 'samples': 13725696, 'steps': 71487, 'loss/train': 1.5327627658843994} 11/07/2021 07:16:19 - INFO - __main__ - Step 71489: {'lr': 0.0002739074405875315, 'samples': 13725888, 'steps': 71488, 'loss/train': 1.6024706363677979} 11/07/2021 07:16:19 - INFO - __main__ - Step 71490: {'lr': 0.0002739021581587554, 'samples': 13726080, 'steps': 71489, 'loss/train': 1.7251129150390625} 11/07/2021 07:16:20 - INFO - __main__ - Step 71491: {'lr': 0.0002738968757192092, 'samples': 13726272, 'steps': 71490, 'loss/train': 1.7450729608535767} 11/07/2021 07:16:20 - INFO - __main__ - Step 71492: {'lr': 0.00027389159326889545, 'samples': 13726464, 'steps': 71491, 'loss/train': 1.9571141004562378} 11/07/2021 07:16:21 - INFO - __main__ - Step 71493: {'lr': 0.0002738863108078166, 'samples': 13726656, 'steps': 71492, 'loss/train': 1.093753695487976} 11/07/2021 07:16:21 - INFO - __main__ - Step 71494: {'lr': 0.00027388102833597497, 'samples': 13726848, 'steps': 71493, 'loss/train': 1.5271574258804321} 11/07/2021 07:16:22 - INFO - __main__ - Step 71495: {'lr': 0.0002738757458533728, 'samples': 13727040, 'steps': 71494, 'loss/train': 1.7259454727172852} 11/07/2021 07:16:22 - INFO - __main__ - Step 71496: {'lr': 0.00027387046336001264, 'samples': 13727232, 'steps': 71495, 'loss/train': 1.4052928686141968} 11/07/2021 07:16:23 - INFO - __main__ - Step 71497: {'lr': 0.0002738651808558968, 'samples': 13727424, 'steps': 71496, 'loss/train': 1.0230106115341187} 11/07/2021 07:16:24 - INFO - __main__ - Step 71498: {'lr': 0.0002738598983410277, 'samples': 13727616, 'steps': 71497, 'loss/train': 1.3156508207321167} 11/07/2021 07:16:24 - INFO - __main__ - Step 71499: {'lr': 0.0002738546158154077, 'samples': 13727808, 'steps': 71498, 'loss/train': 1.430515170097351} 11/07/2021 07:16:24 - INFO - __main__ - Step 71500: {'lr': 0.00027384933327903924, 'samples': 13728000, 'steps': 71499, 'loss/train': 1.236210584640503} 11/07/2021 07:16:25 - INFO - __main__ - Step 71501: {'lr': 0.00027384405073192455, 'samples': 13728192, 'steps': 71500, 'loss/train': 0.9101112484931946} 11/07/2021 07:16:25 - INFO - __main__ - Step 71502: {'lr': 0.0002738387681740661, 'samples': 13728384, 'steps': 71501, 'loss/train': 2.01837158203125} 11/07/2021 07:16:25 - INFO - __main__ - Step 71503: {'lr': 0.0002738334856054663, 'samples': 13728576, 'steps': 71502, 'loss/train': 1.0450608730316162} 11/07/2021 07:16:26 - INFO - __main__ - Step 71504: {'lr': 0.0002738282030261274, 'samples': 13728768, 'steps': 71503, 'loss/train': 1.7995219230651855} 11/07/2021 07:16:27 - INFO - __main__ - Step 71505: {'lr': 0.00027382292043605204, 'samples': 13728960, 'steps': 71504, 'loss/train': 1.540338158607483} 11/07/2021 07:16:27 - INFO - __main__ - Step 71506: {'lr': 0.0002738176378352424, 'samples': 13729152, 'steps': 71505, 'loss/train': 1.6315492391586304} 11/07/2021 07:16:27 - INFO - __main__ - Step 71507: {'lr': 0.00027381235522370084, 'samples': 13729344, 'steps': 71506, 'loss/train': 1.5987588167190552} 11/07/2021 07:16:28 - INFO - __main__ - Step 71508: {'lr': 0.00027380707260142985, 'samples': 13729536, 'steps': 71507, 'loss/train': 2.1876039505004883} 11/07/2021 07:16:29 - INFO - __main__ - Step 71509: {'lr': 0.0002738017899684317, 'samples': 13729728, 'steps': 71508, 'loss/train': 1.3888273239135742} 11/07/2021 07:16:29 - INFO - __main__ - Step 71510: {'lr': 0.0002737965073247089, 'samples': 13729920, 'steps': 71509, 'loss/train': 1.3789317607879639} 11/07/2021 07:16:29 - INFO - __main__ - Step 71511: {'lr': 0.00027379122467026374, 'samples': 13730112, 'steps': 71510, 'loss/train': 1.5919208526611328} 11/07/2021 07:16:30 - INFO - __main__ - Step 71512: {'lr': 0.0002737859420050986, 'samples': 13730304, 'steps': 71511, 'loss/train': 1.8096576929092407} 11/07/2021 07:16:30 - INFO - __main__ - Step 71513: {'lr': 0.0002737806593292159, 'samples': 13730496, 'steps': 71512, 'loss/train': 1.9668010473251343} 11/07/2021 07:16:31 - INFO - __main__ - Step 71514: {'lr': 0.000273775376642618, 'samples': 13730688, 'steps': 71513, 'loss/train': 1.7009204626083374} 11/07/2021 07:16:32 - INFO - __main__ - Step 71515: {'lr': 0.00027377009394530727, 'samples': 13730880, 'steps': 71514, 'loss/train': 1.588576316833496} 11/07/2021 07:16:32 - INFO - __main__ - Step 71516: {'lr': 0.00027376481123728613, 'samples': 13731072, 'steps': 71515, 'loss/train': 0.6975960731506348} 11/07/2021 07:16:32 - INFO - __main__ - Step 71517: {'lr': 0.0002737595285185569, 'samples': 13731264, 'steps': 71516, 'loss/train': 1.1381394863128662} 11/07/2021 07:16:33 - INFO - __main__ - Step 71518: {'lr': 0.000273754245789122, 'samples': 13731456, 'steps': 71517, 'loss/train': 1.3796803951263428} 11/07/2021 07:16:34 - INFO - __main__ - Step 71519: {'lr': 0.00027374896304898386, 'samples': 13731648, 'steps': 71518, 'loss/train': 1.3886884450912476} 11/07/2021 07:16:34 - INFO - __main__ - Step 71520: {'lr': 0.0002737436802981447, 'samples': 13731840, 'steps': 71519, 'loss/train': 1.508918046951294} 11/07/2021 07:16:34 - INFO - __main__ - Step 71521: {'lr': 0.0002737383975366071, 'samples': 13732032, 'steps': 71520, 'loss/train': 0.11500144749879837} 11/07/2021 07:16:35 - INFO - __main__ - Step 71522: {'lr': 0.0002737331147643733, 'samples': 13732224, 'steps': 71521, 'loss/train': 1.467092514038086} 11/07/2021 07:16:35 - INFO - __main__ - Step 71523: {'lr': 0.00027372783198144574, 'samples': 13732416, 'steps': 71522, 'loss/train': 1.2078487873077393} 11/07/2021 07:16:36 - INFO - __main__ - Step 71524: {'lr': 0.00027372254918782673, 'samples': 13732608, 'steps': 71523, 'loss/train': 1.4625049829483032} 11/07/2021 07:16:37 - INFO - __main__ - Step 71525: {'lr': 0.00027371726638351874, 'samples': 13732800, 'steps': 71524, 'loss/train': 1.6728264093399048} 11/07/2021 07:16:37 - INFO - __main__ - Step 71526: {'lr': 0.0002737119835685241, 'samples': 13732992, 'steps': 71525, 'loss/train': 1.1468278169631958} 11/07/2021 07:16:37 - INFO - __main__ - Step 71527: {'lr': 0.0002737067007428453, 'samples': 13733184, 'steps': 71526, 'loss/train': 1.4115197658538818} 11/07/2021 07:16:38 - INFO - __main__ - Step 71528: {'lr': 0.00027370141790648454, 'samples': 13733376, 'steps': 71527, 'loss/train': 1.547634243965149} 11/07/2021 07:16:38 - INFO - __main__ - Step 71529: {'lr': 0.0002736961350594443, 'samples': 13733568, 'steps': 71528, 'loss/train': 0.8233464956283569} 11/07/2021 07:16:39 - INFO - __main__ - Step 71530: {'lr': 0.000273690852201727, 'samples': 13733760, 'steps': 71529, 'loss/train': 1.223924160003662} 11/07/2021 07:16:39 - INFO - __main__ - Step 71531: {'lr': 0.00027368556933333484, 'samples': 13733952, 'steps': 71530, 'loss/train': 1.369876503944397} 11/07/2021 07:16:40 - INFO - __main__ - Step 71532: {'lr': 0.0002736802864542704, 'samples': 13734144, 'steps': 71531, 'loss/train': 1.5044137239456177} 11/07/2021 07:16:40 - INFO - __main__ - Step 71533: {'lr': 0.000273675003564536, 'samples': 13734336, 'steps': 71532, 'loss/train': 1.355332374572754} 11/07/2021 07:16:41 - INFO - __main__ - Step 71534: {'lr': 0.00027366972066413404, 'samples': 13734528, 'steps': 71533, 'loss/train': 1.5195411443710327} 11/07/2021 07:16:41 - INFO - __main__ - Step 71535: {'lr': 0.00027366443775306683, 'samples': 13734720, 'steps': 71534, 'loss/train': 0.9296714067459106} 11/07/2021 07:16:42 - INFO - __main__ - Step 71536: {'lr': 0.00027365915483133676, 'samples': 13734912, 'steps': 71535, 'loss/train': 1.158231496810913} 11/07/2021 07:16:42 - INFO - __main__ - Step 71537: {'lr': 0.0002736538718989463, 'samples': 13735104, 'steps': 71536, 'loss/train': 1.3698588609695435} 11/07/2021 07:16:43 - INFO - __main__ - Step 71538: {'lr': 0.0002736485889558977, 'samples': 13735296, 'steps': 71537, 'loss/train': 1.1950956583023071} 11/07/2021 07:16:43 - INFO - __main__ - Step 71539: {'lr': 0.00027364330600219343, 'samples': 13735488, 'steps': 71538, 'loss/train': 0.7687745690345764} 11/07/2021 07:16:44 - INFO - __main__ - Step 71540: {'lr': 0.00027363802303783584, 'samples': 13735680, 'steps': 71539, 'loss/train': 1.6337188482284546} 11/07/2021 07:16:45 - INFO - __main__ - Step 71541: {'lr': 0.0002736327400628275, 'samples': 13735872, 'steps': 71540, 'loss/train': 1.3558539152145386} 11/07/2021 07:16:45 - INFO - __main__ - Step 71542: {'lr': 0.0002736274570771704, 'samples': 13736064, 'steps': 71541, 'loss/train': 1.430601954460144} 11/07/2021 07:16:45 - INFO - __main__ - Step 71543: {'lr': 0.0002736221740808672, 'samples': 13736256, 'steps': 71542, 'loss/train': 1.3299572467803955} 11/07/2021 07:16:46 - INFO - __main__ - Step 71544: {'lr': 0.0002736168910739202, 'samples': 13736448, 'steps': 71543, 'loss/train': 0.1079661175608635} 11/07/2021 07:16:46 - INFO - __main__ - Step 71545: {'lr': 0.0002736116080563318, 'samples': 13736640, 'steps': 71544, 'loss/train': 0.5116785168647766} 11/07/2021 07:16:47 - INFO - __main__ - Step 71546: {'lr': 0.00027360632502810433, 'samples': 13736832, 'steps': 71545, 'loss/train': 1.38383150100708} 11/07/2021 07:16:48 - INFO - __main__ - Step 71547: {'lr': 0.0002736010419892403, 'samples': 13737024, 'steps': 71546, 'loss/train': 0.6743972897529602} 11/07/2021 07:16:48 - INFO - __main__ - Step 71548: {'lr': 0.00027359575893974196, 'samples': 13737216, 'steps': 71547, 'loss/train': 1.101887822151184} 11/07/2021 07:16:48 - INFO - __main__ - Step 71549: {'lr': 0.0002735904758796118, 'samples': 13737408, 'steps': 71548, 'loss/train': 1.3761991262435913} 11/07/2021 07:16:49 - INFO - __main__ - Step 71550: {'lr': 0.000273585192808852, 'samples': 13737600, 'steps': 71549, 'loss/train': 1.2604410648345947} 11/07/2021 07:16:50 - INFO - __main__ - Step 71551: {'lr': 0.00027357990972746516, 'samples': 13737792, 'steps': 71550, 'loss/train': 1.744698405265808} 11/07/2021 07:16:51 - INFO - __main__ - Step 71552: {'lr': 0.00027357462663545355, 'samples': 13737984, 'steps': 71551, 'loss/train': 1.6247519254684448} 11/07/2021 07:16:51 - INFO - __main__ - Step 71553: {'lr': 0.0002735693435328196, 'samples': 13738176, 'steps': 71552, 'loss/train': 1.0784869194030762} 11/07/2021 07:16:51 - INFO - __main__ - Step 71554: {'lr': 0.0002735640604195657, 'samples': 13738368, 'steps': 71553, 'loss/train': 1.4915307760238647} 11/07/2021 07:16:52 - INFO - __main__ - Step 71555: {'lr': 0.0002735587772956942, 'samples': 13738560, 'steps': 71554, 'loss/train': 1.1278578042984009} 11/07/2021 07:16:52 - INFO - __main__ - Step 71556: {'lr': 0.0002735534941612074, 'samples': 13738752, 'steps': 71555, 'loss/train': 1.5047868490219116} 11/07/2021 07:16:53 - INFO - __main__ - Step 71557: {'lr': 0.00027354821101610783, 'samples': 13738944, 'steps': 71556, 'loss/train': 1.3356633186340332} 11/07/2021 07:16:53 - INFO - __main__ - Step 71558: {'lr': 0.00027354292786039777, 'samples': 13739136, 'steps': 71557, 'loss/train': 1.3653676509857178} 11/07/2021 07:16:54 - INFO - __main__ - Step 71559: {'lr': 0.0002735376446940796, 'samples': 13739328, 'steps': 71558, 'loss/train': 1.6549502611160278} 11/07/2021 07:16:54 - INFO - __main__ - Step 71560: {'lr': 0.0002735323615171558, 'samples': 13739520, 'steps': 71559, 'loss/train': 1.8861279487609863} 11/07/2021 07:16:54 - INFO - __main__ - Step 71561: {'lr': 0.0002735270783296286, 'samples': 13739712, 'steps': 71560, 'loss/train': 1.0716863870620728} 11/07/2021 07:16:55 - INFO - __main__ - Step 71562: {'lr': 0.00027352179513150056, 'samples': 13739904, 'steps': 71561, 'loss/train': 1.2208638191223145} 11/07/2021 07:16:56 - INFO - __main__ - Step 71563: {'lr': 0.0002735165119227739, 'samples': 13740096, 'steps': 71562, 'loss/train': 1.5898053646087646} 11/07/2021 07:16:56 - INFO - __main__ - Step 71564: {'lr': 0.000273511228703451, 'samples': 13740288, 'steps': 71563, 'loss/train': 4.173259735107422} 11/07/2021 07:16:56 - INFO - __main__ - Step 71565: {'lr': 0.0002735059454735344, 'samples': 13740480, 'steps': 71564, 'loss/train': 1.4175688028335571} 11/07/2021 07:16:57 - INFO - __main__ - Step 71566: {'lr': 0.0002735006622330263, 'samples': 13740672, 'steps': 71565, 'loss/train': 1.6496000289916992} 11/07/2021 07:16:58 - INFO - __main__ - Step 71567: {'lr': 0.00027349537898192924, 'samples': 13740864, 'steps': 71566, 'loss/train': 1.7263489961624146} 11/07/2021 07:16:58 - INFO - __main__ - Step 71568: {'lr': 0.0002734900957202455, 'samples': 13741056, 'steps': 71567, 'loss/train': 1.6284430027008057} 11/07/2021 07:16:58 - INFO - __main__ - Step 71569: {'lr': 0.0002734848124479775, 'samples': 13741248, 'steps': 71568, 'loss/train': 1.2480355501174927} 11/07/2021 07:16:59 - INFO - __main__ - Step 71570: {'lr': 0.00027347952916512765, 'samples': 13741440, 'steps': 71569, 'loss/train': 1.4997702836990356} 11/07/2021 07:16:59 - INFO - __main__ - Step 71571: {'lr': 0.00027347424587169817, 'samples': 13741632, 'steps': 71570, 'loss/train': 1.6163331270217896} 11/07/2021 07:17:00 - INFO - __main__ - Step 71572: {'lr': 0.0002734689625676916, 'samples': 13741824, 'steps': 71571, 'loss/train': 1.5308486223220825} 11/07/2021 07:17:00 - INFO - __main__ - Step 71573: {'lr': 0.00027346367925311035, 'samples': 13742016, 'steps': 71572, 'loss/train': 1.5095274448394775} 11/07/2021 07:17:01 - INFO - __main__ - Step 71574: {'lr': 0.0002734583959279566, 'samples': 13742208, 'steps': 71573, 'loss/train': 1.6620075702667236} 11/07/2021 07:17:01 - INFO - __main__ - Step 71575: {'lr': 0.000273453112592233, 'samples': 13742400, 'steps': 71574, 'loss/train': 1.2935177087783813} 11/07/2021 07:17:01 - INFO - __main__ - Step 71576: {'lr': 0.00027344782924594173, 'samples': 13742592, 'steps': 71575, 'loss/train': 1.0928163528442383} 11/07/2021 07:17:03 - INFO - __main__ - Step 71577: {'lr': 0.0002734425458890852, 'samples': 13742784, 'steps': 71576, 'loss/train': 1.1410901546478271} 11/07/2021 07:17:03 - INFO - __main__ - Step 71578: {'lr': 0.00027343726252166583, 'samples': 13742976, 'steps': 71577, 'loss/train': 1.2070436477661133} 11/07/2021 07:17:03 - INFO - __main__ - Step 71579: {'lr': 0.00027343197914368603, 'samples': 13743168, 'steps': 71578, 'loss/train': 1.4344658851623535} 11/07/2021 07:17:04 - INFO - __main__ - Step 71580: {'lr': 0.0002734266957551481, 'samples': 13743360, 'steps': 71579, 'loss/train': 1.3258605003356934} 11/07/2021 07:17:04 - INFO - __main__ - Step 71581: {'lr': 0.00027342141235605445, 'samples': 13743552, 'steps': 71580, 'loss/train': 1.3607800006866455} 11/07/2021 07:17:05 - INFO - __main__ - Step 71582: {'lr': 0.00027341612894640755, 'samples': 13743744, 'steps': 71581, 'loss/train': 1.8081328868865967} 11/07/2021 07:17:05 - INFO - __main__ - Step 71583: {'lr': 0.0002734108455262097, 'samples': 13743936, 'steps': 71582, 'loss/train': 1.230209469795227} 11/07/2021 07:17:06 - INFO - __main__ - Step 71584: {'lr': 0.00027340556209546317, 'samples': 13744128, 'steps': 71583, 'loss/train': 1.6682757139205933} 11/07/2021 07:17:06 - INFO - __main__ - Step 71585: {'lr': 0.00027340027865417057, 'samples': 13744320, 'steps': 71584, 'loss/train': 1.7429181337356567} 11/07/2021 07:17:06 - INFO - __main__ - Step 71586: {'lr': 0.00027339499520233405, 'samples': 13744512, 'steps': 71585, 'loss/train': 1.4475754499435425} 11/07/2021 07:17:07 - INFO - __main__ - Step 71587: {'lr': 0.0002733897117399562, 'samples': 13744704, 'steps': 71586, 'loss/train': 1.216809868812561} 11/07/2021 07:17:08 - INFO - __main__ - Step 71588: {'lr': 0.0002733844282670393, 'samples': 13744896, 'steps': 71587, 'loss/train': 1.6069141626358032} 11/07/2021 07:17:08 - INFO - __main__ - Step 71589: {'lr': 0.0002733791447835857, 'samples': 13745088, 'steps': 71588, 'loss/train': 1.8305784463882446} 11/07/2021 07:17:08 - INFO - __main__ - Step 71590: {'lr': 0.00027337386128959784, 'samples': 13745280, 'steps': 71589, 'loss/train': 1.2073999643325806} 11/07/2021 07:17:09 - INFO - __main__ - Step 71591: {'lr': 0.00027336857778507804, 'samples': 13745472, 'steps': 71590, 'loss/train': 1.3621364831924438} 11/07/2021 07:17:09 - INFO - __main__ - Step 71592: {'lr': 0.0002733632942700288, 'samples': 13745664, 'steps': 71591, 'loss/train': 1.858102798461914} 11/07/2021 07:17:10 - INFO - __main__ - Step 71593: {'lr': 0.0002733580107444524, 'samples': 13745856, 'steps': 71592, 'loss/train': 1.2631258964538574} 11/07/2021 07:17:11 - INFO - __main__ - Step 71594: {'lr': 0.0002733527272083512, 'samples': 13746048, 'steps': 71593, 'loss/train': 1.728498101234436} 11/07/2021 07:17:11 - INFO - __main__ - Step 71595: {'lr': 0.00027334744366172765, 'samples': 13746240, 'steps': 71594, 'loss/train': 1.4894189834594727} 11/07/2021 07:17:11 - INFO - __main__ - Step 71596: {'lr': 0.0002733421601045841, 'samples': 13746432, 'steps': 71595, 'loss/train': 1.483847737312317} 11/07/2021 07:17:12 - INFO - __main__ - Step 71597: {'lr': 0.0002733368765369229, 'samples': 13746624, 'steps': 71596, 'loss/train': 1.1782060861587524} 11/07/2021 07:17:13 - INFO - __main__ - Step 71598: {'lr': 0.0002733315929587465, 'samples': 13746816, 'steps': 71597, 'loss/train': 1.2424724102020264} 11/07/2021 07:17:13 - INFO - __main__ - Step 71599: {'lr': 0.0002733263093700572, 'samples': 13747008, 'steps': 71598, 'loss/train': 1.3932322263717651} 11/07/2021 07:17:13 - INFO - __main__ - Step 71600: {'lr': 0.00027332102577085743, 'samples': 13747200, 'steps': 71599, 'loss/train': 1.3928223848342896} 11/07/2021 07:17:14 - INFO - __main__ - Step 71601: {'lr': 0.00027331574216114964, 'samples': 13747392, 'steps': 71600, 'loss/train': 1.5340818166732788} 11/07/2021 07:17:14 - INFO - __main__ - Step 71602: {'lr': 0.0002733104585409361, 'samples': 13747584, 'steps': 71601, 'loss/train': 1.1866912841796875} 11/07/2021 07:17:15 - INFO - __main__ - Step 71603: {'lr': 0.00027330517491021923, 'samples': 13747776, 'steps': 71602, 'loss/train': 1.1277612447738647} 11/07/2021 07:17:15 - INFO - __main__ - Step 71604: {'lr': 0.0002732998912690013, 'samples': 13747968, 'steps': 71603, 'loss/train': 1.5194780826568604} 11/07/2021 07:17:16 - INFO - __main__ - Step 71605: {'lr': 0.0002732946076172849, 'samples': 13748160, 'steps': 71604, 'loss/train': 1.143024206161499} 11/07/2021 07:17:16 - INFO - __main__ - Step 71606: {'lr': 0.0002732893239550723, 'samples': 13748352, 'steps': 71605, 'loss/train': 1.068408727645874} 11/07/2021 07:17:16 - INFO - __main__ - Step 71607: {'lr': 0.0002732840402823659, 'samples': 13748544, 'steps': 71606, 'loss/train': 1.5188608169555664} 11/07/2021 07:17:17 - INFO - __main__ - Step 71608: {'lr': 0.00027327875659916815, 'samples': 13748736, 'steps': 71607, 'loss/train': 1.2339462041854858} 11/07/2021 07:17:18 - INFO - __main__ - Step 71609: {'lr': 0.0002732734729054812, 'samples': 13748928, 'steps': 71608, 'loss/train': 1.3798255920410156} 11/07/2021 07:17:18 - INFO - __main__ - Step 71610: {'lr': 0.0002732681892013077, 'samples': 13749120, 'steps': 71609, 'loss/train': 1.9228688478469849} 11/07/2021 07:17:19 - INFO - __main__ - Step 71611: {'lr': 0.0002732629054866498, 'samples': 13749312, 'steps': 71610, 'loss/train': 1.5784392356872559} 11/07/2021 07:17:19 - INFO - __main__ - Step 71612: {'lr': 0.00027325762176151, 'samples': 13749504, 'steps': 71611, 'loss/train': 1.4817358255386353} 11/07/2021 07:17:20 - INFO - __main__ - Step 71613: {'lr': 0.0002732523380258908, 'samples': 13749696, 'steps': 71612, 'loss/train': 0.983402669429779} 11/07/2021 07:17:20 - INFO - __main__ - Step 71614: {'lr': 0.00027324705427979437, 'samples': 13749888, 'steps': 71613, 'loss/train': 1.539167881011963} 11/07/2021 07:17:21 - INFO - __main__ - Step 71615: {'lr': 0.00027324177052322326, 'samples': 13750080, 'steps': 71614, 'loss/train': 1.0996453762054443} 11/07/2021 07:17:21 - INFO - __main__ - Step 71616: {'lr': 0.00027323648675617963, 'samples': 13750272, 'steps': 71615, 'loss/train': 1.5126112699508667} 11/07/2021 07:17:21 - INFO - __main__ - Step 71617: {'lr': 0.0002732312029786661, 'samples': 13750464, 'steps': 71616, 'loss/train': 1.3821830749511719} 11/07/2021 07:17:22 - INFO - __main__ - Step 71618: {'lr': 0.00027322591919068487, 'samples': 13750656, 'steps': 71617, 'loss/train': 1.8640248775482178} 11/07/2021 07:17:23 - INFO - __main__ - Step 71619: {'lr': 0.00027322063539223846, 'samples': 13750848, 'steps': 71618, 'loss/train': 1.4794384241104126} 11/07/2021 07:17:23 - INFO - __main__ - Step 71620: {'lr': 0.0002732153515833291, 'samples': 13751040, 'steps': 71619, 'loss/train': 1.326881766319275} 11/07/2021 07:17:23 - INFO - __main__ - Step 71621: {'lr': 0.00027321006776395934, 'samples': 13751232, 'steps': 71620, 'loss/train': 1.457057237625122} 11/07/2021 07:17:24 - INFO - __main__ - Step 71622: {'lr': 0.00027320478393413157, 'samples': 13751424, 'steps': 71621, 'loss/train': 0.9363170862197876} 11/07/2021 07:17:24 - INFO - __main__ - Step 71623: {'lr': 0.0002731995000938479, 'samples': 13751616, 'steps': 71622, 'loss/train': 1.4642523527145386} 11/07/2021 07:17:25 - INFO - __main__ - Step 71624: {'lr': 0.000273194216243111, 'samples': 13751808, 'steps': 71623, 'loss/train': 1.2044565677642822} 11/07/2021 07:17:26 - INFO - __main__ - Step 71625: {'lr': 0.0002731889323819231, 'samples': 13752000, 'steps': 71624, 'loss/train': 1.5911461114883423} 11/07/2021 07:17:26 - INFO - __main__ - Step 71626: {'lr': 0.0002731836485102866, 'samples': 13752192, 'steps': 71625, 'loss/train': 1.5660420656204224} 11/07/2021 07:17:26 - INFO - __main__ - Step 71627: {'lr': 0.000273178364628204, 'samples': 13752384, 'steps': 71626, 'loss/train': 0.12580645084381104} 11/07/2021 07:17:27 - INFO - __main__ - Step 71628: {'lr': 0.0002731730807356775, 'samples': 13752576, 'steps': 71627, 'loss/train': 1.393892765045166} 11/07/2021 07:17:28 - INFO - __main__ - Step 71629: {'lr': 0.00027316779683270973, 'samples': 13752768, 'steps': 71628, 'loss/train': 1.2622802257537842} 11/07/2021 07:17:28 - INFO - __main__ - Step 71630: {'lr': 0.0002731625129193027, 'samples': 13752960, 'steps': 71629, 'loss/train': 1.3243556022644043} 11/07/2021 07:17:28 - INFO - __main__ - Step 71631: {'lr': 0.00027315722899545915, 'samples': 13753152, 'steps': 71630, 'loss/train': 1.4147522449493408} 11/07/2021 07:17:29 - INFO - __main__ - Step 71632: {'lr': 0.0002731519450611812, 'samples': 13753344, 'steps': 71631, 'loss/train': 1.6158090829849243} 11/07/2021 07:17:29 - INFO - __main__ - Step 71633: {'lr': 0.0002731466611164714, 'samples': 13753536, 'steps': 71632, 'loss/train': 1.2019524574279785} 11/07/2021 07:17:30 - INFO - __main__ - Step 71634: {'lr': 0.0002731413771613321, 'samples': 13753728, 'steps': 71633, 'loss/train': 0.9804544448852539} 11/07/2021 07:17:30 - INFO - __main__ - Step 71635: {'lr': 0.0002731360931957656, 'samples': 13753920, 'steps': 71634, 'loss/train': 1.2195560932159424} 11/07/2021 07:17:31 - INFO - __main__ - Step 71636: {'lr': 0.00027313080921977437, 'samples': 13754112, 'steps': 71635, 'loss/train': 1.356359601020813} 11/07/2021 07:17:31 - INFO - __main__ - Step 71637: {'lr': 0.00027312552523336064, 'samples': 13754304, 'steps': 71636, 'loss/train': 1.247208595275879} 11/07/2021 07:17:31 - INFO - __main__ - Step 71638: {'lr': 0.000273120241236527, 'samples': 13754496, 'steps': 71637, 'loss/train': 1.3565181493759155} 11/07/2021 07:17:32 - INFO - __main__ - Step 71639: {'lr': 0.0002731149572292757, 'samples': 13754688, 'steps': 71638, 'loss/train': 0.9849694967269897} 11/07/2021 07:17:33 - INFO - __main__ - Step 71640: {'lr': 0.0002731096732116093, 'samples': 13754880, 'steps': 71639, 'loss/train': 1.622756004333496} 11/07/2021 07:17:33 - INFO - __main__ - Step 71641: {'lr': 0.0002731043891835299, 'samples': 13755072, 'steps': 71640, 'loss/train': 1.4108327627182007} 11/07/2021 07:17:33 - INFO - __main__ - Step 71642: {'lr': 0.00027309910514504, 'samples': 13755264, 'steps': 71641, 'loss/train': 1.4013400077819824} 11/07/2021 07:17:34 - INFO - __main__ - Step 71643: {'lr': 0.00027309382109614206, 'samples': 13755456, 'steps': 71642, 'loss/train': 1.3463839292526245} 11/07/2021 07:17:34 - INFO - __main__ - Step 71644: {'lr': 0.00027308853703683834, 'samples': 13755648, 'steps': 71643, 'loss/train': 1.2599596977233887} 11/07/2021 07:17:35 - INFO - __main__ - Step 71645: {'lr': 0.0002730832529671314, 'samples': 13755840, 'steps': 71644, 'loss/train': 1.5224957466125488} 11/07/2021 07:17:36 - INFO - __main__ - Step 71646: {'lr': 0.0002730779688870234, 'samples': 13756032, 'steps': 71645, 'loss/train': 1.6508429050445557} 11/07/2021 07:17:36 - INFO - __main__ - Step 71647: {'lr': 0.00027307268479651687, 'samples': 13756224, 'steps': 71646, 'loss/train': 1.6052387952804565} 11/07/2021 07:17:36 - INFO - __main__ - Step 71648: {'lr': 0.0002730674006956141, 'samples': 13756416, 'steps': 71647, 'loss/train': 1.4934430122375488} 11/07/2021 07:17:37 - INFO - __main__ - Step 71649: {'lr': 0.0002730621165843175, 'samples': 13756608, 'steps': 71648, 'loss/train': 1.4263286590576172} 11/07/2021 07:17:38 - INFO - __main__ - Step 71650: {'lr': 0.0002730568324626295, 'samples': 13756800, 'steps': 71649, 'loss/train': 1.3222668170928955} 11/07/2021 07:17:38 - INFO - __main__ - Step 71651: {'lr': 0.0002730515483305525, 'samples': 13756992, 'steps': 71650, 'loss/train': 1.6447111368179321} 11/07/2021 07:17:38 - INFO - __main__ - Step 71652: {'lr': 0.0002730462641880888, 'samples': 13757184, 'steps': 71651, 'loss/train': 0.948126494884491} 11/07/2021 07:17:39 - INFO - __main__ - Step 71653: {'lr': 0.00027304098003524073, 'samples': 13757376, 'steps': 71652, 'loss/train': 1.3134676218032837} 11/07/2021 07:17:39 - INFO - __main__ - Step 71654: {'lr': 0.0002730356958720108, 'samples': 13757568, 'steps': 71653, 'loss/train': 1.5176947116851807} 11/07/2021 07:17:40 - INFO - __main__ - Step 71655: {'lr': 0.0002730304116984014, 'samples': 13757760, 'steps': 71654, 'loss/train': 1.2331862449645996} 11/07/2021 07:17:40 - INFO - __main__ - Step 71656: {'lr': 0.0002730251275144148, 'samples': 13757952, 'steps': 71655, 'loss/train': 1.29258131980896} 11/07/2021 07:17:41 - INFO - __main__ - Step 71657: {'lr': 0.0002730198433200535, 'samples': 13758144, 'steps': 71656, 'loss/train': 1.4952707290649414} 11/07/2021 07:17:41 - INFO - __main__ - Step 71658: {'lr': 0.0002730145591153197, 'samples': 13758336, 'steps': 71657, 'loss/train': 1.2877060174942017} 11/07/2021 07:17:41 - INFO - __main__ - Step 71659: {'lr': 0.00027300927490021593, 'samples': 13758528, 'steps': 71658, 'loss/train': 1.3089258670806885} 11/07/2021 07:17:43 - INFO - __main__ - Step 71660: {'lr': 0.0002730039906747445, 'samples': 13758720, 'steps': 71659, 'loss/train': 1.1100952625274658} 11/07/2021 07:17:43 - INFO - __main__ - Step 71661: {'lr': 0.0002729987064389079, 'samples': 13758912, 'steps': 71660, 'loss/train': 1.4986259937286377} 11/07/2021 07:17:43 - INFO - __main__ - Step 71662: {'lr': 0.00027299342219270844, 'samples': 13759104, 'steps': 71661, 'loss/train': 1.3726941347122192} 11/07/2021 07:17:44 - INFO - __main__ - Step 71663: {'lr': 0.0002729881379361485, 'samples': 13759296, 'steps': 71662, 'loss/train': 1.5203903913497925} 11/07/2021 07:17:44 - INFO - __main__ - Step 71664: {'lr': 0.0002729828536692304, 'samples': 13759488, 'steps': 71663, 'loss/train': 1.5080887079238892} 11/07/2021 07:17:46 - INFO - __main__ - Step 71665: {'lr': 0.00027297756939195664, 'samples': 13759680, 'steps': 71664, 'loss/train': 1.3948955535888672} 11/07/2021 07:17:46 - INFO - __main__ - Step 71666: {'lr': 0.0002729722851043295, 'samples': 13759872, 'steps': 71665, 'loss/train': 1.4946236610412598} 11/07/2021 07:17:46 - INFO - __main__ - Step 71667: {'lr': 0.0002729670008063514, 'samples': 13760064, 'steps': 71666, 'loss/train': 1.6454639434814453} 11/07/2021 07:17:47 - INFO - __main__ - Step 71668: {'lr': 0.0002729617164980247, 'samples': 13760256, 'steps': 71667, 'loss/train': 2.1809325218200684} 11/07/2021 07:17:47 - INFO - __main__ - Step 71669: {'lr': 0.0002729564321793519, 'samples': 13760448, 'steps': 71668, 'loss/train': 1.4294313192367554} 11/07/2021 07:17:47 - INFO - __main__ - Step 71670: {'lr': 0.0002729511478503353, 'samples': 13760640, 'steps': 71669, 'loss/train': 1.0669949054718018} 11/07/2021 07:17:48 - INFO - __main__ - Step 71671: {'lr': 0.0002729458635109771, 'samples': 13760832, 'steps': 71670, 'loss/train': 1.5533499717712402} 11/07/2021 07:17:49 - INFO - __main__ - Step 71672: {'lr': 0.00027294057916127997, 'samples': 13761024, 'steps': 71671, 'loss/train': 1.619570255279541} 11/07/2021 07:17:49 - INFO - __main__ - Step 71673: {'lr': 0.0002729352948012461, 'samples': 13761216, 'steps': 71672, 'loss/train': 1.0027294158935547} 11/07/2021 07:17:49 - INFO - __main__ - Step 71674: {'lr': 0.000272930010430878, 'samples': 13761408, 'steps': 71673, 'loss/train': 1.1875890493392944} 11/07/2021 07:17:50 - INFO - __main__ - Step 71675: {'lr': 0.000272924726050178, 'samples': 13761600, 'steps': 71674, 'loss/train': 1.7553044557571411} 11/07/2021 07:17:51 - INFO - __main__ - Step 71676: {'lr': 0.0002729194416591485, 'samples': 13761792, 'steps': 71675, 'loss/train': 1.5734422206878662} 11/07/2021 07:17:51 - INFO - __main__ - Step 71677: {'lr': 0.00027291415725779177, 'samples': 13761984, 'steps': 71676, 'loss/train': 1.4327696561813354} 11/07/2021 07:17:52 - INFO - __main__ - Step 71678: {'lr': 0.0002729088728461103, 'samples': 13762176, 'steps': 71677, 'loss/train': 1.2686456441879272} 11/07/2021 07:17:52 - INFO - __main__ - Step 71679: {'lr': 0.00027290358842410644, 'samples': 13762368, 'steps': 71678, 'loss/train': 1.1744210720062256} 11/07/2021 07:17:52 - INFO - __main__ - Step 71680: {'lr': 0.00027289830399178264, 'samples': 13762560, 'steps': 71679, 'loss/train': 0.9518386125564575} 11/07/2021 07:17:53 - INFO - __main__ - Step 71681: {'lr': 0.0002728930195491411, 'samples': 13762752, 'steps': 71680, 'loss/train': 1.4578964710235596} 11/07/2021 07:17:54 - INFO - __main__ - Step 71682: {'lr': 0.0002728877350961844, 'samples': 13762944, 'steps': 71681, 'loss/train': 1.480638861656189} 11/07/2021 07:17:54 - INFO - __main__ - Step 71683: {'lr': 0.00027288245063291483, 'samples': 13763136, 'steps': 71682, 'loss/train': 1.0409940481185913} 11/07/2021 07:17:54 - INFO - __main__ - Step 71684: {'lr': 0.00027287716615933476, 'samples': 13763328, 'steps': 71683, 'loss/train': 1.135532259941101} 11/07/2021 07:17:55 - INFO - __main__ - Step 71685: {'lr': 0.0002728718816754466, 'samples': 13763520, 'steps': 71684, 'loss/train': 1.889318585395813} 11/07/2021 07:17:55 - INFO - __main__ - Step 71686: {'lr': 0.0002728665971812527, 'samples': 13763712, 'steps': 71685, 'loss/train': 1.4542580842971802} 11/07/2021 07:17:56 - INFO - __main__ - Step 71687: {'lr': 0.0002728613126767556, 'samples': 13763904, 'steps': 71686, 'loss/train': 1.1703417301177979} 11/07/2021 07:17:56 - INFO - __main__ - Step 71688: {'lr': 0.0002728560281619574, 'samples': 13764096, 'steps': 71687, 'loss/train': 0.5202018022537231} 11/07/2021 07:17:57 - INFO - __main__ - Step 71689: {'lr': 0.0002728507436368607, 'samples': 13764288, 'steps': 71688, 'loss/train': 1.845900297164917} 11/07/2021 07:17:57 - INFO - __main__ - Step 71690: {'lr': 0.00027284545910146775, 'samples': 13764480, 'steps': 71689, 'loss/train': 1.1852245330810547} 11/07/2021 07:17:58 - INFO - __main__ - Step 71691: {'lr': 0.000272840174555781, 'samples': 13764672, 'steps': 71690, 'loss/train': 1.7784628868103027} 11/07/2021 07:17:59 - INFO - __main__ - Step 71692: {'lr': 0.0002728348899998028, 'samples': 13764864, 'steps': 71691, 'loss/train': 1.3134840726852417} 11/07/2021 07:17:59 - INFO - __main__ - Step 71693: {'lr': 0.00027282960543353565, 'samples': 13765056, 'steps': 71692, 'loss/train': 1.7025331258773804} 11/07/2021 07:17:59 - INFO - __main__ - Step 71694: {'lr': 0.00027282432085698173, 'samples': 13765248, 'steps': 71693, 'loss/train': 1.5857707262039185} 11/07/2021 07:18:00 - INFO - __main__ - Step 71695: {'lr': 0.00027281903627014356, 'samples': 13765440, 'steps': 71694, 'loss/train': 1.4552063941955566} 11/07/2021 07:18:00 - INFO - __main__ - Step 71696: {'lr': 0.0002728137516730235, 'samples': 13765632, 'steps': 71695, 'loss/train': 1.4022231101989746} 11/07/2021 07:18:00 - INFO - __main__ - Step 71697: {'lr': 0.0002728084670656239, 'samples': 13765824, 'steps': 71696, 'loss/train': 1.4808449745178223} 11/07/2021 07:18:01 - INFO - __main__ - Step 71698: {'lr': 0.00027280318244794717, 'samples': 13766016, 'steps': 71697, 'loss/train': 1.945274829864502} 11/07/2021 07:18:02 - INFO - __main__ - Step 71699: {'lr': 0.0002727978978199956, 'samples': 13766208, 'steps': 71698, 'loss/train': 1.3672617673873901} 11/07/2021 07:18:02 - INFO - __main__ - Step 71700: {'lr': 0.00027279261318177174, 'samples': 13766400, 'steps': 71699, 'loss/train': 1.4318996667861938} 11/07/2021 07:18:02 - INFO - __main__ - Step 71701: {'lr': 0.0002727873285332778, 'samples': 13766592, 'steps': 71700, 'loss/train': 1.1312397718429565} 11/07/2021 07:18:03 - INFO - __main__ - Step 71702: {'lr': 0.00027278204387451633, 'samples': 13766784, 'steps': 71701, 'loss/train': 1.4049146175384521} 11/07/2021 07:18:04 - INFO - __main__ - Step 71703: {'lr': 0.0002727767592054896, 'samples': 13766976, 'steps': 71702, 'loss/train': 1.490297555923462} 11/07/2021 07:18:04 - INFO - __main__ - Step 71704: {'lr': 0.0002727714745262, 'samples': 13767168, 'steps': 71703, 'loss/train': 1.4116178750991821} 11/07/2021 07:18:05 - INFO - __main__ - Step 71705: {'lr': 0.0002727661898366499, 'samples': 13767360, 'steps': 71704, 'loss/train': 1.0525305271148682} 11/07/2021 07:18:05 - INFO - __main__ - Step 71706: {'lr': 0.00027276090513684176, 'samples': 13767552, 'steps': 71705, 'loss/train': 1.6191496849060059} 11/07/2021 07:18:05 - INFO - __main__ - Step 71707: {'lr': 0.0002727556204267779, 'samples': 13767744, 'steps': 71706, 'loss/train': 1.2976042032241821} 11/07/2021 07:18:06 - INFO - __main__ - Step 71708: {'lr': 0.0002727503357064606, 'samples': 13767936, 'steps': 71707, 'loss/train': 1.291426420211792} 11/07/2021 07:18:07 - INFO - __main__ - Step 71709: {'lr': 0.0002727450509758925, 'samples': 13768128, 'steps': 71708, 'loss/train': 0.8078199625015259} 11/07/2021 07:18:07 - INFO - __main__ - Step 71710: {'lr': 0.0002727397662350758, 'samples': 13768320, 'steps': 71709, 'loss/train': 1.5662643909454346} 11/07/2021 07:18:07 - INFO - __main__ - Step 71711: {'lr': 0.0002727344814840129, 'samples': 13768512, 'steps': 71710, 'loss/train': 1.1877719163894653} 11/07/2021 07:18:08 - INFO - __main__ - Step 71712: {'lr': 0.00027272919672270614, 'samples': 13768704, 'steps': 71711, 'loss/train': 1.711775779724121} 11/07/2021 07:18:09 - INFO - __main__ - Step 71713: {'lr': 0.000272723911951158, 'samples': 13768896, 'steps': 71712, 'loss/train': 1.1634262800216675} 11/07/2021 07:18:09 - INFO - __main__ - Step 71714: {'lr': 0.0002727186271693707, 'samples': 13769088, 'steps': 71713, 'loss/train': 1.181374430656433} 11/07/2021 07:18:09 - INFO - __main__ - Step 71715: {'lr': 0.0002727133423773469, 'samples': 13769280, 'steps': 71714, 'loss/train': 1.5312732458114624} 11/07/2021 07:18:10 - INFO - __main__ - Step 71716: {'lr': 0.0002727080575750888, 'samples': 13769472, 'steps': 71715, 'loss/train': 1.177751064300537} 11/07/2021 07:18:10 - INFO - __main__ - Step 71717: {'lr': 0.00027270277276259875, 'samples': 13769664, 'steps': 71716, 'loss/train': 2.8445985317230225} 11/07/2021 07:18:10 - INFO - __main__ - Step 71718: {'lr': 0.00027269748793987917, 'samples': 13769856, 'steps': 71717, 'loss/train': 2.2430453300476074} 11/07/2021 07:18:11 - INFO - __main__ - Step 71719: {'lr': 0.0002726922031069325, 'samples': 13770048, 'steps': 71718, 'loss/train': 1.6906251907348633} 11/07/2021 07:18:12 - INFO - __main__ - Step 71720: {'lr': 0.000272686918263761, 'samples': 13770240, 'steps': 71719, 'loss/train': 1.5659273862838745} 11/07/2021 07:18:12 - INFO - __main__ - Step 71721: {'lr': 0.00027268163341036717, 'samples': 13770432, 'steps': 71720, 'loss/train': 1.3988863229751587} 11/07/2021 07:18:12 - INFO - __main__ - Step 71722: {'lr': 0.0002726763485467533, 'samples': 13770624, 'steps': 71721, 'loss/train': 1.0068551301956177} 11/07/2021 07:18:13 - INFO - __main__ - Step 71723: {'lr': 0.00027267106367292196, 'samples': 13770816, 'steps': 71722, 'loss/train': 1.5173776149749756} 11/07/2021 07:18:14 - INFO - __main__ - Step 71724: {'lr': 0.0002726657787888753, 'samples': 13771008, 'steps': 71723, 'loss/train': 1.7516247034072876} 11/07/2021 07:18:14 - INFO - __main__ - Step 71725: {'lr': 0.0002726604938946158, 'samples': 13771200, 'steps': 71724, 'loss/train': 1.739363193511963} 11/07/2021 07:18:15 - INFO - __main__ - Step 71726: {'lr': 0.00027265520899014573, 'samples': 13771392, 'steps': 71725, 'loss/train': 1.3269766569137573} 11/07/2021 07:18:15 - INFO - __main__ - Step 71727: {'lr': 0.0002726499240754677, 'samples': 13771584, 'steps': 71726, 'loss/train': 1.5457168817520142} 11/07/2021 07:18:15 - INFO - __main__ - Step 71728: {'lr': 0.0002726446391505839, 'samples': 13771776, 'steps': 71727, 'loss/train': 1.3874201774597168} 11/07/2021 07:18:16 - INFO - __main__ - Step 71729: {'lr': 0.00027263935421549686, 'samples': 13771968, 'steps': 71728, 'loss/train': 1.4463399648666382} 11/07/2021 07:18:17 - INFO - __main__ - Step 71730: {'lr': 0.00027263406927020877, 'samples': 13772160, 'steps': 71729, 'loss/train': 1.414058804512024} 11/07/2021 07:18:17 - INFO - __main__ - Step 71731: {'lr': 0.00027262878431472213, 'samples': 13772352, 'steps': 71730, 'loss/train': 1.8876146078109741} 11/07/2021 07:18:17 - INFO - __main__ - Step 71732: {'lr': 0.0002726234993490393, 'samples': 13772544, 'steps': 71731, 'loss/train': 1.2631381750106812} 11/07/2021 07:18:18 - INFO - __main__ - Step 71733: {'lr': 0.00027261821437316275, 'samples': 13772736, 'steps': 71732, 'loss/train': 1.468881368637085} 11/07/2021 07:18:19 - INFO - __main__ - Step 71734: {'lr': 0.0002726129293870948, 'samples': 13772928, 'steps': 71733, 'loss/train': 2.077279806137085} 11/07/2021 07:18:19 - INFO - __main__ - Step 71735: {'lr': 0.0002726076443908377, 'samples': 13773120, 'steps': 71734, 'loss/train': 1.3273570537567139} 11/07/2021 07:18:19 - INFO - __main__ - Step 71736: {'lr': 0.00027260235938439403, 'samples': 13773312, 'steps': 71735, 'loss/train': 2.5034024715423584} 11/07/2021 07:18:20 - INFO - __main__ - Step 71737: {'lr': 0.00027259707436776603, 'samples': 13773504, 'steps': 71736, 'loss/train': 1.2359970808029175} 11/07/2021 07:18:20 - INFO - __main__ - Step 71738: {'lr': 0.00027259178934095613, 'samples': 13773696, 'steps': 71737, 'loss/train': 1.0055575370788574} 11/07/2021 07:18:21 - INFO - __main__ - Step 71739: {'lr': 0.00027258650430396676, 'samples': 13773888, 'steps': 71738, 'loss/train': 0.8898785710334778} 11/07/2021 07:18:22 - INFO - __main__ - Step 71740: {'lr': 0.00027258121925680025, 'samples': 13774080, 'steps': 71739, 'loss/train': 1.8488940000534058} 11/07/2021 07:18:22 - INFO - __main__ - Step 71741: {'lr': 0.000272575934199459, 'samples': 13774272, 'steps': 71740, 'loss/train': 1.1478852033615112} 11/07/2021 07:18:22 - INFO - __main__ - Step 71742: {'lr': 0.0002725706491319453, 'samples': 13774464, 'steps': 71741, 'loss/train': 1.2810786962509155} 11/07/2021 07:18:23 - INFO - __main__ - Step 71743: {'lr': 0.00027256536405426173, 'samples': 13774656, 'steps': 71742, 'loss/train': 1.883124828338623} 11/07/2021 07:18:23 - INFO - __main__ - Step 71744: {'lr': 0.00027256007896641054, 'samples': 13774848, 'steps': 71743, 'loss/train': 1.4364044666290283} 11/07/2021 07:18:24 - INFO - __main__ - Step 71745: {'lr': 0.0002725547938683941, 'samples': 13775040, 'steps': 71744, 'loss/train': 0.9552123546600342} 11/07/2021 07:18:24 - INFO - __main__ - Step 71746: {'lr': 0.0002725495087602148, 'samples': 13775232, 'steps': 71745, 'loss/train': 0.9292229413986206} 11/07/2021 07:18:25 - INFO - __main__ - Step 71747: {'lr': 0.00027254422364187504, 'samples': 13775424, 'steps': 71746, 'loss/train': 0.8764196038246155} 11/07/2021 07:18:25 - INFO - __main__ - Step 71748: {'lr': 0.0002725389385133772, 'samples': 13775616, 'steps': 71747, 'loss/train': 1.2086821794509888} 11/07/2021 07:18:25 - INFO - __main__ - Step 71749: {'lr': 0.00027253365337472367, 'samples': 13775808, 'steps': 71748, 'loss/train': 1.807279109954834} 11/07/2021 07:18:27 - INFO - __main__ - Step 71750: {'lr': 0.00027252836822591684, 'samples': 13776000, 'steps': 71749, 'loss/train': 1.7606780529022217} 11/07/2021 07:18:27 - INFO - __main__ - Step 71751: {'lr': 0.0002725230830669591, 'samples': 13776192, 'steps': 71750, 'loss/train': 0.6988004446029663} 11/07/2021 07:18:27 - INFO - __main__ - Step 71752: {'lr': 0.0002725177978978527, 'samples': 13776384, 'steps': 71751, 'loss/train': 1.6320033073425293} 11/07/2021 07:18:28 - INFO - __main__ - Step 71753: {'lr': 0.00027251251271860025, 'samples': 13776576, 'steps': 71752, 'loss/train': 1.7664155960083008} 11/07/2021 07:18:28 - INFO - __main__ - Step 71754: {'lr': 0.00027250722752920393, 'samples': 13776768, 'steps': 71753, 'loss/train': 2.0163354873657227} 11/07/2021 07:18:28 - INFO - __main__ - Step 71755: {'lr': 0.00027250194232966626, 'samples': 13776960, 'steps': 71754, 'loss/train': 1.2913800477981567} 11/07/2021 07:18:29 - INFO - __main__ - Step 71756: {'lr': 0.00027249665711998955, 'samples': 13777152, 'steps': 71755, 'loss/train': 1.3880993127822876} 11/07/2021 07:18:30 - INFO - __main__ - Step 71757: {'lr': 0.00027249137190017616, 'samples': 13777344, 'steps': 71756, 'loss/train': 1.8095706701278687} 11/07/2021 07:18:30 - INFO - __main__ - Step 71758: {'lr': 0.0002724860866702285, 'samples': 13777536, 'steps': 71757, 'loss/train': 1.36345636844635} 11/07/2021 07:18:30 - INFO - __main__ - Step 71759: {'lr': 0.000272480801430149, 'samples': 13777728, 'steps': 71758, 'loss/train': 1.6435717344284058} 11/07/2021 07:18:31 - INFO - __main__ - Step 71760: {'lr': 0.00027247551617993993, 'samples': 13777920, 'steps': 71759, 'loss/train': 1.5001981258392334} 11/07/2021 07:18:32 - INFO - __main__ - Step 71761: {'lr': 0.0002724702309196038, 'samples': 13778112, 'steps': 71760, 'loss/train': 1.1209968328475952} 11/07/2021 07:18:32 - INFO - __main__ - Step 71762: {'lr': 0.0002724649456491429, 'samples': 13778304, 'steps': 71761, 'loss/train': 1.3894530534744263} 11/07/2021 07:18:33 - INFO - __main__ - Step 71763: {'lr': 0.00027245966036855965, 'samples': 13778496, 'steps': 71762, 'loss/train': 1.6983643770217896} 11/07/2021 07:18:33 - INFO - __main__ - Step 71764: {'lr': 0.00027245437507785646, 'samples': 13778688, 'steps': 71763, 'loss/train': 1.6263850927352905} 11/07/2021 07:18:33 - INFO - __main__ - Step 71765: {'lr': 0.00027244908977703565, 'samples': 13778880, 'steps': 71764, 'loss/train': 1.645914912223816} 11/07/2021 07:18:34 - INFO - __main__ - Step 71766: {'lr': 0.0002724438044660996, 'samples': 13779072, 'steps': 71765, 'loss/train': 1.4284005165100098} 11/07/2021 07:18:35 - INFO - __main__ - Step 71767: {'lr': 0.00027243851914505074, 'samples': 13779264, 'steps': 71766, 'loss/train': 1.4358717203140259} 11/07/2021 07:18:35 - INFO - __main__ - Step 71768: {'lr': 0.0002724332338138914, 'samples': 13779456, 'steps': 71767, 'loss/train': 1.498349905014038} 11/07/2021 07:18:35 - INFO - __main__ - Step 71769: {'lr': 0.00027242794847262406, 'samples': 13779648, 'steps': 71768, 'loss/train': 1.1646442413330078} 11/07/2021 07:18:36 - INFO - __main__ - Step 71770: {'lr': 0.000272422663121251, 'samples': 13779840, 'steps': 71769, 'loss/train': 1.5667238235473633} 11/07/2021 07:18:37 - INFO - __main__ - Step 71771: {'lr': 0.0002724173777597746, 'samples': 13780032, 'steps': 71770, 'loss/train': 1.7032837867736816} 11/07/2021 07:18:37 - INFO - __main__ - Step 71772: {'lr': 0.00027241209238819733, 'samples': 13780224, 'steps': 71771, 'loss/train': 1.2010703086853027} 11/07/2021 07:18:37 - INFO - __main__ - Step 71773: {'lr': 0.0002724068070065215, 'samples': 13780416, 'steps': 71772, 'loss/train': 1.4387352466583252} 11/07/2021 07:18:38 - INFO - __main__ - Step 71774: {'lr': 0.0002724015216147495, 'samples': 13780608, 'steps': 71773, 'loss/train': 1.540673851966858} 11/07/2021 07:18:38 - INFO - __main__ - Step 71775: {'lr': 0.0002723962362128837, 'samples': 13780800, 'steps': 71774, 'loss/train': 0.7549400329589844} 11/07/2021 07:18:39 - INFO - __main__ - Step 71776: {'lr': 0.0002723909508009265, 'samples': 13780992, 'steps': 71775, 'loss/train': 1.9723349809646606} 11/07/2021 07:18:39 - INFO - __main__ - Step 71777: {'lr': 0.00027238566537888035, 'samples': 13781184, 'steps': 71776, 'loss/train': 1.2264635562896729} 11/07/2021 07:18:40 - INFO - __main__ - Step 71778: {'lr': 0.0002723803799467475, 'samples': 13781376, 'steps': 71777, 'loss/train': 1.2367433309555054} 11/07/2021 07:18:40 - INFO - __main__ - Step 71779: {'lr': 0.0002723750945045304, 'samples': 13781568, 'steps': 71778, 'loss/train': 1.441361427307129} 11/07/2021 07:18:40 - INFO - __main__ - Step 71780: {'lr': 0.00027236980905223147, 'samples': 13781760, 'steps': 71779, 'loss/train': 1.7305471897125244} 11/07/2021 07:18:42 - INFO - __main__ - Step 71781: {'lr': 0.00027236452358985304, 'samples': 13781952, 'steps': 71780, 'loss/train': 1.15060293674469} 11/07/2021 07:18:42 - INFO - __main__ - Step 71782: {'lr': 0.00027235923811739745, 'samples': 13782144, 'steps': 71781, 'loss/train': 1.5328782796859741} 11/07/2021 07:18:42 - INFO - __main__ - Step 71783: {'lr': 0.0002723539526348671, 'samples': 13782336, 'steps': 71782, 'loss/train': 1.3867238759994507} 11/07/2021 07:18:43 - INFO - __main__ - Step 71784: {'lr': 0.0002723486671422645, 'samples': 13782528, 'steps': 71783, 'loss/train': 1.3244807720184326} 11/07/2021 07:18:43 - INFO - __main__ - Step 71785: {'lr': 0.0002723433816395919, 'samples': 13782720, 'steps': 71784, 'loss/train': 1.4559651613235474} 11/07/2021 07:18:43 - INFO - __main__ - Step 71786: {'lr': 0.00027233809612685177, 'samples': 13782912, 'steps': 71785, 'loss/train': 1.5029219388961792} 11/07/2021 07:18:44 - INFO - __main__ - Step 71787: {'lr': 0.00027233281060404636, 'samples': 13783104, 'steps': 71786, 'loss/train': 1.3458538055419922} 11/07/2021 07:18:45 - INFO - __main__ - Step 71788: {'lr': 0.00027232752507117816, 'samples': 13783296, 'steps': 71787, 'loss/train': 1.6528074741363525} 11/07/2021 07:18:45 - INFO - __main__ - Step 71789: {'lr': 0.00027232223952824953, 'samples': 13783488, 'steps': 71788, 'loss/train': 1.2431132793426514} 11/07/2021 07:18:45 - INFO - __main__ - Step 71790: {'lr': 0.0002723169539752628, 'samples': 13783680, 'steps': 71789, 'loss/train': 1.4439797401428223} 11/07/2021 07:18:46 - INFO - __main__ - Step 71791: {'lr': 0.0002723116684122205, 'samples': 13783872, 'steps': 71790, 'loss/train': 1.9575101137161255} 11/07/2021 07:18:47 - INFO - __main__ - Step 71792: {'lr': 0.0002723063828391248, 'samples': 13784064, 'steps': 71791, 'loss/train': 1.0593788623809814} 11/07/2021 07:18:47 - INFO - __main__ - Step 71793: {'lr': 0.00027230109725597825, 'samples': 13784256, 'steps': 71792, 'loss/train': 1.3852258920669556} 11/07/2021 07:18:47 - INFO - __main__ - Step 71794: {'lr': 0.00027229581166278313, 'samples': 13784448, 'steps': 71793, 'loss/train': 1.7264368534088135} 11/07/2021 07:18:48 - INFO - __main__ - Step 71795: {'lr': 0.00027229052605954186, 'samples': 13784640, 'steps': 71794, 'loss/train': 1.3196929693222046} 11/07/2021 07:18:48 - INFO - __main__ - Step 71796: {'lr': 0.0002722852404462568, 'samples': 13784832, 'steps': 71795, 'loss/train': 1.4058696031570435} 11/07/2021 07:18:49 - INFO - __main__ - Step 71797: {'lr': 0.00027227995482293046, 'samples': 13785024, 'steps': 71796, 'loss/train': 1.494918942451477} 11/07/2021 07:18:49 - INFO - __main__ - Step 71798: {'lr': 0.000272274669189565, 'samples': 13785216, 'steps': 71797, 'loss/train': 1.0237503051757812} 11/07/2021 07:18:50 - INFO - __main__ - Step 71799: {'lr': 0.000272269383546163, 'samples': 13785408, 'steps': 71798, 'loss/train': 1.1571391820907593} 11/07/2021 07:18:50 - INFO - __main__ - Step 71800: {'lr': 0.0002722640978927267, 'samples': 13785600, 'steps': 71799, 'loss/train': 1.0003809928894043} 11/07/2021 07:18:51 - INFO - __main__ - Step 71801: {'lr': 0.0002722588122292585, 'samples': 13785792, 'steps': 71800, 'loss/train': 1.6405940055847168} 11/07/2021 07:18:52 - INFO - __main__ - Step 71802: {'lr': 0.00027225352655576093, 'samples': 13785984, 'steps': 71801, 'loss/train': 1.5484620332717896} 11/07/2021 07:18:52 - INFO - __main__ - Step 71803: {'lr': 0.0002722482408722363, 'samples': 13786176, 'steps': 71802, 'loss/train': 1.2634859085083008} 11/07/2021 07:18:52 - INFO - __main__ - Step 71804: {'lr': 0.0002722429551786868, 'samples': 13786368, 'steps': 71803, 'loss/train': 1.0498019456863403} 11/07/2021 07:18:53 - INFO - __main__ - Step 71805: {'lr': 0.0002722376694751151, 'samples': 13786560, 'steps': 71804, 'loss/train': 1.6656548976898193} 11/07/2021 07:18:53 - INFO - __main__ - Step 71806: {'lr': 0.0002722323837615234, 'samples': 13786752, 'steps': 71805, 'loss/train': 1.594114065170288} 11/07/2021 07:18:54 - INFO - __main__ - Step 71807: {'lr': 0.00027222709803791404, 'samples': 13786944, 'steps': 71806, 'loss/train': 1.0168334245681763} 11/07/2021 07:18:54 - INFO - __main__ - Step 71808: {'lr': 0.0002722218123042896, 'samples': 13787136, 'steps': 71807, 'loss/train': 1.4457693099975586} 11/07/2021 07:18:55 - INFO - __main__ - Step 71809: {'lr': 0.0002722165265606523, 'samples': 13787328, 'steps': 71808, 'loss/train': 1.3253625631332397} 11/07/2021 07:18:55 - INFO - __main__ - Step 71810: {'lr': 0.00027221124080700467, 'samples': 13787520, 'steps': 71809, 'loss/train': 1.3362228870391846} 11/07/2021 07:18:56 - INFO - __main__ - Step 71811: {'lr': 0.00027220595504334896, 'samples': 13787712, 'steps': 71810, 'loss/train': 1.3923481702804565} 11/07/2021 07:18:56 - INFO - __main__ - Step 71812: {'lr': 0.0002722006692696875, 'samples': 13787904, 'steps': 71811, 'loss/train': 0.7050343155860901} 11/07/2021 07:18:57 - INFO - __main__ - Step 71813: {'lr': 0.00027219538348602286, 'samples': 13788096, 'steps': 71812, 'loss/train': 1.43928062915802} 11/07/2021 07:18:57 - INFO - __main__ - Step 71814: {'lr': 0.00027219009769235725, 'samples': 13788288, 'steps': 71813, 'loss/train': 1.6575767993927002} 11/07/2021 07:18:58 - INFO - __main__ - Step 71815: {'lr': 0.0002721848118886931, 'samples': 13788480, 'steps': 71814, 'loss/train': 1.6967085599899292} 11/07/2021 07:18:58 - INFO - __main__ - Step 71816: {'lr': 0.0002721795260750329, 'samples': 13788672, 'steps': 71815, 'loss/train': 1.686553955078125} 11/07/2021 07:18:58 - INFO - __main__ - Step 71817: {'lr': 0.000272174240251379, 'samples': 13788864, 'steps': 71816, 'loss/train': 1.7797931432724} 11/07/2021 07:19:00 - INFO - __main__ - Step 71818: {'lr': 0.00027216895441773363, 'samples': 13789056, 'steps': 71817, 'loss/train': 1.4758398532867432} 11/07/2021 07:19:00 - INFO - __main__ - Step 71819: {'lr': 0.0002721636685740993, 'samples': 13789248, 'steps': 71818, 'loss/train': 0.8863992094993591} 11/07/2021 07:19:01 - INFO - __main__ - Step 71820: {'lr': 0.0002721583827204784, 'samples': 13789440, 'steps': 71819, 'loss/train': 1.3654811382293701} 11/07/2021 07:19:01 - INFO - __main__ - Step 71821: {'lr': 0.00027215309685687324, 'samples': 13789632, 'steps': 71820, 'loss/train': 1.916947841644287} 11/07/2021 07:19:01 - INFO - __main__ - Step 71822: {'lr': 0.00027214781098328615, 'samples': 13789824, 'steps': 71821, 'loss/train': 1.7421094179153442} 11/07/2021 07:19:02 - INFO - __main__ - Step 71823: {'lr': 0.0002721425250997197, 'samples': 13790016, 'steps': 71822, 'loss/train': 1.2075815200805664} 11/07/2021 07:19:02 - INFO - __main__ - Step 71824: {'lr': 0.00027213723920617623, 'samples': 13790208, 'steps': 71823, 'loss/train': 2.5777623653411865} 11/07/2021 07:19:03 - INFO - __main__ - Step 71825: {'lr': 0.00027213195330265795, 'samples': 13790400, 'steps': 71824, 'loss/train': 3.0138049125671387} 11/07/2021 07:19:03 - INFO - __main__ - Step 71826: {'lr': 0.00027212666738916734, 'samples': 13790592, 'steps': 71825, 'loss/train': 1.0744127035140991} 11/07/2021 07:19:04 - INFO - __main__ - Step 71827: {'lr': 0.00027212138146570685, 'samples': 13790784, 'steps': 71826, 'loss/train': 1.0147708654403687} 11/07/2021 07:19:04 - INFO - __main__ - Step 71828: {'lr': 0.0002721160955322788, 'samples': 13790976, 'steps': 71827, 'loss/train': 1.4786312580108643} 11/07/2021 07:19:05 - INFO - __main__ - Step 71829: {'lr': 0.00027211080958888556, 'samples': 13791168, 'steps': 71828, 'loss/train': 1.1269276142120361} 11/07/2021 07:19:06 - INFO - __main__ - Step 71830: {'lr': 0.0002721055236355296, 'samples': 13791360, 'steps': 71829, 'loss/train': 1.8186333179473877} 11/07/2021 07:19:06 - INFO - __main__ - Step 71831: {'lr': 0.00027210023767221313, 'samples': 13791552, 'steps': 71830, 'loss/train': 1.5412482023239136} 11/07/2021 07:19:06 - INFO - __main__ - Step 71832: {'lr': 0.00027209495169893875, 'samples': 13791744, 'steps': 71831, 'loss/train': 1.6194852590560913} 11/07/2021 07:19:07 - INFO - __main__ - Step 71833: {'lr': 0.00027208966571570857, 'samples': 13791936, 'steps': 71832, 'loss/train': 1.562589406967163} 11/07/2021 07:19:07 - INFO - __main__ - Step 71834: {'lr': 0.00027208437972252525, 'samples': 13792128, 'steps': 71833, 'loss/train': 1.1818796396255493} 11/07/2021 07:19:07 - INFO - __main__ - Step 71835: {'lr': 0.00027207909371939097, 'samples': 13792320, 'steps': 71834, 'loss/train': 1.286503553390503} 11/07/2021 07:19:09 - INFO - __main__ - Step 71836: {'lr': 0.00027207380770630826, 'samples': 13792512, 'steps': 71835, 'loss/train': 1.3817791938781738} 11/07/2021 07:19:09 - INFO - __main__ - Step 71837: {'lr': 0.00027206852168327946, 'samples': 13792704, 'steps': 71836, 'loss/train': 1.7262473106384277} 11/07/2021 07:19:09 - INFO - __main__ - Step 71838: {'lr': 0.0002720632356503069, 'samples': 13792896, 'steps': 71837, 'loss/train': 1.3961379528045654} 11/07/2021 07:19:10 - INFO - __main__ - Step 71839: {'lr': 0.00027205794960739296, 'samples': 13793088, 'steps': 71838, 'loss/train': 0.49601271748542786} 11/07/2021 07:19:10 - INFO - __main__ - Step 71840: {'lr': 0.00027205266355454, 'samples': 13793280, 'steps': 71839, 'loss/train': 1.637178897857666} 11/07/2021 07:19:11 - INFO - __main__ - Step 71841: {'lr': 0.00027204737749175046, 'samples': 13793472, 'steps': 71840, 'loss/train': 0.9615025520324707} 11/07/2021 07:19:11 - INFO - __main__ - Step 71842: {'lr': 0.00027204209141902676, 'samples': 13793664, 'steps': 71841, 'loss/train': 1.439517855644226} 11/07/2021 07:19:12 - INFO - __main__ - Step 71843: {'lr': 0.0002720368053363712, 'samples': 13793856, 'steps': 71842, 'loss/train': 1.719544768333435} 11/07/2021 07:19:12 - INFO - __main__ - Step 71844: {'lr': 0.00027203151924378626, 'samples': 13794048, 'steps': 71843, 'loss/train': 1.418763518333435} 11/07/2021 07:19:12 - INFO - __main__ - Step 71845: {'lr': 0.0002720262331412742, 'samples': 13794240, 'steps': 71844, 'loss/train': 1.186470866203308} 11/07/2021 07:19:13 - INFO - __main__ - Step 71846: {'lr': 0.0002720209470288375, 'samples': 13794432, 'steps': 71845, 'loss/train': 1.4667854309082031} 11/07/2021 07:19:14 - INFO - __main__ - Step 71847: {'lr': 0.00027201566090647843, 'samples': 13794624, 'steps': 71846, 'loss/train': 1.2136733531951904} 11/07/2021 07:19:14 - INFO - __main__ - Step 71848: {'lr': 0.00027201037477419957, 'samples': 13794816, 'steps': 71847, 'loss/train': 1.1326595544815063} 11/07/2021 07:19:14 - INFO - __main__ - Step 71849: {'lr': 0.000272005088632003, 'samples': 13795008, 'steps': 71848, 'loss/train': 0.7829946279525757} 11/07/2021 07:19:15 - INFO - __main__ - Step 71850: {'lr': 0.0002719998024798915, 'samples': 13795200, 'steps': 71849, 'loss/train': 1.0048640966415405} 11/07/2021 07:19:16 - INFO - __main__ - Step 71851: {'lr': 0.00027199451631786705, 'samples': 13795392, 'steps': 71850, 'loss/train': 1.1411794424057007} 11/07/2021 07:19:16 - INFO - __main__ - Step 71852: {'lr': 0.00027198923014593225, 'samples': 13795584, 'steps': 71851, 'loss/train': 1.866564154624939} 11/07/2021 07:19:16 - INFO - __main__ - Step 71853: {'lr': 0.0002719839439640894, 'samples': 13795776, 'steps': 71852, 'loss/train': 1.3603554964065552} 11/07/2021 07:19:17 - INFO - __main__ - Step 71854: {'lr': 0.00027197865777234097, 'samples': 13795968, 'steps': 71853, 'loss/train': 0.7264984846115112} 11/07/2021 07:19:17 - INFO - __main__ - Step 71855: {'lr': 0.00027197337157068937, 'samples': 13796160, 'steps': 71854, 'loss/train': 1.4848508834838867} 11/07/2021 07:19:18 - INFO - __main__ - Step 71856: {'lr': 0.0002719680853591368, 'samples': 13796352, 'steps': 71855, 'loss/train': 1.3015220165252686} 11/07/2021 07:19:18 - INFO - __main__ - Step 71857: {'lr': 0.00027196279913768587, 'samples': 13796544, 'steps': 71856, 'loss/train': 1.3843196630477905} 11/07/2021 07:19:19 - INFO - __main__ - Step 71858: {'lr': 0.00027195751290633874, 'samples': 13796736, 'steps': 71857, 'loss/train': 1.4419810771942139} 11/07/2021 07:19:19 - INFO - __main__ - Step 71859: {'lr': 0.0002719522266650979, 'samples': 13796928, 'steps': 71858, 'loss/train': 0.5234899520874023} 11/07/2021 07:19:19 - INFO - __main__ - Step 71860: {'lr': 0.00027194694041396574, 'samples': 13797120, 'steps': 71859, 'loss/train': 1.292435884475708} 11/07/2021 07:19:21 - INFO - __main__ - Step 71861: {'lr': 0.0002719416541529446, 'samples': 13797312, 'steps': 71860, 'loss/train': 1.5201945304870605} 11/07/2021 07:19:21 - INFO - __main__ - Step 71862: {'lr': 0.0002719363678820369, 'samples': 13797504, 'steps': 71861, 'loss/train': 1.415274977684021} 11/07/2021 07:19:21 - INFO - __main__ - Step 71863: {'lr': 0.000271931081601245, 'samples': 13797696, 'steps': 71862, 'loss/train': 3.3091330528259277} 11/07/2021 07:19:22 - INFO - __main__ - Step 71864: {'lr': 0.00027192579531057137, 'samples': 13797888, 'steps': 71863, 'loss/train': 1.7635477781295776} 11/07/2021 07:19:22 - INFO - __main__ - Step 71865: {'lr': 0.0002719205090100183, 'samples': 13798080, 'steps': 71864, 'loss/train': 1.329006552696228} 11/07/2021 07:19:23 - INFO - __main__ - Step 71866: {'lr': 0.0002719152226995881, 'samples': 13798272, 'steps': 71865, 'loss/train': 1.015618920326233} 11/07/2021 07:19:23 - INFO - __main__ - Step 71867: {'lr': 0.0002719099363792833, 'samples': 13798464, 'steps': 71866, 'loss/train': 1.3505867719650269} 11/07/2021 07:19:24 - INFO - __main__ - Step 71868: {'lr': 0.0002719046500491062, 'samples': 13798656, 'steps': 71867, 'loss/train': 1.840229868888855} 11/07/2021 07:19:24 - INFO - __main__ - Step 71869: {'lr': 0.0002718993637090592, 'samples': 13798848, 'steps': 71868, 'loss/train': 1.1633129119873047} 11/07/2021 07:19:24 - INFO - __main__ - Step 71870: {'lr': 0.0002718940773591447, 'samples': 13799040, 'steps': 71869, 'loss/train': 1.5994302034378052} 11/07/2021 07:19:25 - INFO - __main__ - Step 71871: {'lr': 0.00027188879099936515, 'samples': 13799232, 'steps': 71870, 'loss/train': 1.6047366857528687} 11/07/2021 07:19:26 - INFO - __main__ - Step 71872: {'lr': 0.0002718835046297227, 'samples': 13799424, 'steps': 71871, 'loss/train': 1.5819426774978638} 11/07/2021 07:19:26 - INFO - __main__ - Step 71873: {'lr': 0.00027187821825021995, 'samples': 13799616, 'steps': 71872, 'loss/train': 2.1467959880828857} 11/07/2021 07:19:26 - INFO - __main__ - Step 71874: {'lr': 0.0002718729318608592, 'samples': 13799808, 'steps': 71873, 'loss/train': 1.4272964000701904} 11/07/2021 07:19:27 - INFO - __main__ - Step 71875: {'lr': 0.0002718676454616428, 'samples': 13800000, 'steps': 71874, 'loss/train': 1.2887356281280518} 11/07/2021 07:19:27 - INFO - __main__ - Step 71876: {'lr': 0.00027186235905257326, 'samples': 13800192, 'steps': 71875, 'loss/train': 1.6287097930908203} 11/07/2021 07:19:28 - INFO - __main__ - Step 71877: {'lr': 0.0002718570726336529, 'samples': 13800384, 'steps': 71876, 'loss/train': 1.5640298128128052} 11/07/2021 07:19:29 - INFO - __main__ - Step 71878: {'lr': 0.00027185178620488406, 'samples': 13800576, 'steps': 71877, 'loss/train': 0.7295761704444885} 11/07/2021 07:19:29 - INFO - __main__ - Step 71879: {'lr': 0.00027184649976626907, 'samples': 13800768, 'steps': 71878, 'loss/train': 1.4187389612197876} 11/07/2021 07:19:29 - INFO - __main__ - Step 71880: {'lr': 0.0002718412133178104, 'samples': 13800960, 'steps': 71879, 'loss/train': 1.4224936962127686} 11/07/2021 07:19:30 - INFO - __main__ - Step 71881: {'lr': 0.0002718359268595105, 'samples': 13801152, 'steps': 71880, 'loss/train': 1.1800979375839233} 11/07/2021 07:19:31 - INFO - __main__ - Step 71882: {'lr': 0.0002718306403913716, 'samples': 13801344, 'steps': 71881, 'loss/train': 0.16164173185825348} 11/07/2021 07:19:31 - INFO - __main__ - Step 71883: {'lr': 0.0002718253539133961, 'samples': 13801536, 'steps': 71882, 'loss/train': 1.0012739896774292} 11/07/2021 07:19:31 - INFO - __main__ - Step 71884: {'lr': 0.0002718200674255865, 'samples': 13801728, 'steps': 71883, 'loss/train': 1.487130045890808} 11/07/2021 07:19:32 - INFO - __main__ - Step 71885: {'lr': 0.00027181478092794514, 'samples': 13801920, 'steps': 71884, 'loss/train': 1.0865854024887085} 11/07/2021 07:19:32 - INFO - __main__ - Step 71886: {'lr': 0.0002718094944204743, 'samples': 13802112, 'steps': 71885, 'loss/train': 1.1371097564697266} 11/07/2021 07:19:33 - INFO - __main__ - Step 71887: {'lr': 0.0002718042079031765, 'samples': 13802304, 'steps': 71886, 'loss/train': 1.389422059059143} 11/07/2021 07:19:33 - INFO - __main__ - Step 71888: {'lr': 0.00027179892137605403, 'samples': 13802496, 'steps': 71887, 'loss/train': 1.102777123451233} 11/07/2021 07:19:34 - INFO - __main__ - Step 71889: {'lr': 0.0002717936348391093, 'samples': 13802688, 'steps': 71888, 'loss/train': 1.3617103099822998} 11/07/2021 07:19:34 - INFO - __main__ - Step 71890: {'lr': 0.00027178834829234475, 'samples': 13802880, 'steps': 71889, 'loss/train': 1.3854864835739136} 11/07/2021 07:19:35 - INFO - __main__ - Step 71891: {'lr': 0.00027178306173576266, 'samples': 13803072, 'steps': 71890, 'loss/train': 1.3519576787948608} 11/07/2021 07:19:35 - INFO - __main__ - Step 71892: {'lr': 0.00027177777516936545, 'samples': 13803264, 'steps': 71891, 'loss/train': 1.7862367630004883} 11/07/2021 07:19:36 - INFO - __main__ - Step 71893: {'lr': 0.0002717724885931555, 'samples': 13803456, 'steps': 71892, 'loss/train': 1.4448678493499756} 11/07/2021 07:19:36 - INFO - __main__ - Step 71894: {'lr': 0.0002717672020071352, 'samples': 13803648, 'steps': 71893, 'loss/train': 1.756799340248108} 11/07/2021 07:19:37 - INFO - __main__ - Step 71895: {'lr': 0.000271761915411307, 'samples': 13803840, 'steps': 71894, 'loss/train': 1.6164072751998901} 11/07/2021 07:19:37 - INFO - __main__ - Step 71896: {'lr': 0.00027175662880567317, 'samples': 13804032, 'steps': 71895, 'loss/train': 1.8916233777999878} 11/07/2021 07:19:37 - INFO - __main__ - Step 71897: {'lr': 0.0002717513421902362, 'samples': 13804224, 'steps': 71896, 'loss/train': 0.8707305192947388} 11/07/2021 07:19:38 - INFO - __main__ - Step 71898: {'lr': 0.0002717460555649983, 'samples': 13804416, 'steps': 71897, 'loss/train': 1.2667864561080933} 11/07/2021 07:19:39 - INFO - __main__ - Step 71899: {'lr': 0.00027174076892996204, 'samples': 13804608, 'steps': 71898, 'loss/train': 1.4700604677200317} 11/07/2021 07:19:39 - INFO - __main__ - Step 71900: {'lr': 0.0002717354822851297, 'samples': 13804800, 'steps': 71899, 'loss/train': 5.703769207000732} 11/07/2021 07:19:39 - INFO - __main__ - Step 71901: {'lr': 0.0002717301956305037, 'samples': 13804992, 'steps': 71900, 'loss/train': 1.3549805879592896} 11/07/2021 07:19:40 - INFO - __main__ - Step 71902: {'lr': 0.00027172490896608636, 'samples': 13805184, 'steps': 71901, 'loss/train': 0.9425691962242126} 11/07/2021 07:19:41 - INFO - __main__ - Step 71903: {'lr': 0.00027171962229188026, 'samples': 13805376, 'steps': 71902, 'loss/train': 1.984231948852539} 11/07/2021 07:19:41 - INFO - __main__ - Step 71904: {'lr': 0.0002717143356078875, 'samples': 13805568, 'steps': 71903, 'loss/train': 1.2951658964157104} 11/07/2021 07:19:42 - INFO - __main__ - Step 71905: {'lr': 0.0002717090489141106, 'samples': 13805760, 'steps': 71904, 'loss/train': 1.0171985626220703} 11/07/2021 07:19:42 - INFO - __main__ - Step 71906: {'lr': 0.00027170376221055193, 'samples': 13805952, 'steps': 71905, 'loss/train': 1.4873056411743164} 11/07/2021 07:19:42 - INFO - __main__ - Step 71907: {'lr': 0.0002716984754972139, 'samples': 13806144, 'steps': 71906, 'loss/train': 0.29305437207221985} 11/07/2021 07:19:43 - INFO - __main__ - Step 71908: {'lr': 0.0002716931887740989, 'samples': 13806336, 'steps': 71907, 'loss/train': 1.5441101789474487} 11/07/2021 07:19:44 - INFO - __main__ - Step 71909: {'lr': 0.0002716879020412093, 'samples': 13806528, 'steps': 71908, 'loss/train': 0.10374259948730469} 11/07/2021 07:19:44 - INFO - __main__ - Step 71910: {'lr': 0.00027168261529854744, 'samples': 13806720, 'steps': 71909, 'loss/train': 1.5174217224121094} 11/07/2021 07:19:44 - INFO - __main__ - Step 71911: {'lr': 0.00027167732854611567, 'samples': 13806912, 'steps': 71910, 'loss/train': 1.4732733964920044} 11/07/2021 07:19:45 - INFO - __main__ - Step 71912: {'lr': 0.0002716720417839165, 'samples': 13807104, 'steps': 71911, 'loss/train': 1.1700587272644043} 11/07/2021 07:19:45 - INFO - __main__ - Step 71913: {'lr': 0.0002716667550119522, 'samples': 13807296, 'steps': 71912, 'loss/train': 1.1622542142868042} 11/07/2021 07:19:46 - INFO - __main__ - Step 71914: {'lr': 0.00027166146823022524, 'samples': 13807488, 'steps': 71913, 'loss/train': 1.7475919723510742} 11/07/2021 07:19:47 - INFO - __main__ - Step 71915: {'lr': 0.00027165618143873795, 'samples': 13807680, 'steps': 71914, 'loss/train': 1.2501615285873413} 11/07/2021 07:19:47 - INFO - __main__ - Step 71916: {'lr': 0.0002716508946374927, 'samples': 13807872, 'steps': 71915, 'loss/train': 1.4468562602996826} 11/07/2021 07:19:47 - INFO - __main__ - Step 71917: {'lr': 0.0002716456078264918, 'samples': 13808064, 'steps': 71916, 'loss/train': 1.2862318754196167} 11/07/2021 07:19:48 - INFO - __main__ - Step 71918: {'lr': 0.00027164032100573785, 'samples': 13808256, 'steps': 71917, 'loss/train': 1.7155340909957886} 11/07/2021 07:19:49 - INFO - __main__ - Step 71919: {'lr': 0.0002716350341752331, 'samples': 13808448, 'steps': 71918, 'loss/train': 1.3860960006713867} 11/07/2021 07:19:49 - INFO - __main__ - Step 71920: {'lr': 0.00027162974733497994, 'samples': 13808640, 'steps': 71919, 'loss/train': 1.2260115146636963} 11/07/2021 07:19:49 - INFO - __main__ - Step 71921: {'lr': 0.0002716244604849807, 'samples': 13808832, 'steps': 71920, 'loss/train': 0.261407732963562} 11/07/2021 07:19:50 - INFO - __main__ - Step 71922: {'lr': 0.0002716191736252378, 'samples': 13809024, 'steps': 71921, 'loss/train': 0.4085228741168976} 11/07/2021 07:19:50 - INFO - __main__ - Step 71923: {'lr': 0.00027161388675575365, 'samples': 13809216, 'steps': 71922, 'loss/train': 1.1583161354064941} 11/07/2021 07:19:51 - INFO - __main__ - Step 71924: {'lr': 0.0002716085998765306, 'samples': 13809408, 'steps': 71923, 'loss/train': 1.5590966939926147} 11/07/2021 07:19:51 - INFO - __main__ - Step 71925: {'lr': 0.00027160331298757117, 'samples': 13809600, 'steps': 71924, 'loss/train': 1.3895847797393799} 11/07/2021 07:19:52 - INFO - __main__ - Step 71926: {'lr': 0.0002715980260888775, 'samples': 13809792, 'steps': 71925, 'loss/train': 4.3891191482543945} 11/07/2021 07:19:52 - INFO - __main__ - Step 71927: {'lr': 0.0002715927391804521, 'samples': 13809984, 'steps': 71926, 'loss/train': 1.0232669115066528} 11/07/2021 07:19:53 - INFO - __main__ - Step 71928: {'lr': 0.00027158745226229744, 'samples': 13810176, 'steps': 71927, 'loss/train': 1.466808795928955} 11/07/2021 07:19:53 - INFO - __main__ - Step 71929: {'lr': 0.0002715821653344157, 'samples': 13810368, 'steps': 71928, 'loss/train': 1.7174477577209473} 11/07/2021 07:19:54 - INFO - __main__ - Step 71930: {'lr': 0.0002715768783968094, 'samples': 13810560, 'steps': 71929, 'loss/train': 1.569154143333435} 11/07/2021 07:19:54 - INFO - __main__ - Step 71931: {'lr': 0.0002715715914494809, 'samples': 13810752, 'steps': 71930, 'loss/train': 1.4820568561553955} 11/07/2021 07:19:55 - INFO - __main__ - Step 71932: {'lr': 0.00027156630449243256, 'samples': 13810944, 'steps': 71931, 'loss/train': 1.365361213684082} 11/07/2021 07:19:55 - INFO - __main__ - Step 71933: {'lr': 0.00027156101752566676, 'samples': 13811136, 'steps': 71932, 'loss/train': 1.1967675685882568} 11/07/2021 07:19:55 - INFO - __main__ - Step 71934: {'lr': 0.0002715557305491859, 'samples': 13811328, 'steps': 71933, 'loss/train': 1.4772173166275024} 11/07/2021 07:19:56 - INFO - __main__ - Step 71935: {'lr': 0.0002715504435629924, 'samples': 13811520, 'steps': 71934, 'loss/train': 1.6169116497039795} 11/07/2021 07:19:57 - INFO - __main__ - Step 71936: {'lr': 0.00027154515656708855, 'samples': 13811712, 'steps': 71935, 'loss/train': 1.502967119216919} 11/07/2021 07:19:57 - INFO - __main__ - Step 71937: {'lr': 0.00027153986956147686, 'samples': 13811904, 'steps': 71936, 'loss/train': 1.391472578048706} 11/07/2021 07:19:57 - INFO - __main__ - Step 71938: {'lr': 0.0002715345825461597, 'samples': 13812096, 'steps': 71937, 'loss/train': 1.389668345451355} 11/07/2021 07:19:58 - INFO - __main__ - Step 71939: {'lr': 0.0002715292955211392, 'samples': 13812288, 'steps': 71938, 'loss/train': 1.8333940505981445} 11/07/2021 07:19:59 - INFO - __main__ - Step 71940: {'lr': 0.000271524008486418, 'samples': 13812480, 'steps': 71939, 'loss/train': 0.6696743369102478} 11/07/2021 07:19:59 - INFO - __main__ - Step 71941: {'lr': 0.0002715187214419985, 'samples': 13812672, 'steps': 71940, 'loss/train': 1.3371471166610718} 11/07/2021 07:20:00 - INFO - __main__ - Step 71942: {'lr': 0.00027151343438788284, 'samples': 13812864, 'steps': 71941, 'loss/train': 1.7423876523971558} 11/07/2021 07:20:00 - INFO - __main__ - Step 71943: {'lr': 0.0002715081473240736, 'samples': 13813056, 'steps': 71942, 'loss/train': 1.338893175125122} 11/07/2021 07:20:00 - INFO - __main__ - Step 71944: {'lr': 0.0002715028602505732, 'samples': 13813248, 'steps': 71943, 'loss/train': 1.124343752861023} 11/07/2021 07:20:01 - INFO - __main__ - Step 71945: {'lr': 0.000271497573167384, 'samples': 13813440, 'steps': 71944, 'loss/train': 1.1108442544937134} 11/07/2021 07:20:02 - INFO - __main__ - Step 71946: {'lr': 0.00027149228607450823, 'samples': 13813632, 'steps': 71945, 'loss/train': 1.7439234256744385} 11/07/2021 07:20:02 - INFO - __main__ - Step 71947: {'lr': 0.00027148699897194833, 'samples': 13813824, 'steps': 71946, 'loss/train': 2.0873124599456787} 11/07/2021 07:20:02 - INFO - __main__ - Step 71948: {'lr': 0.00027148171185970677, 'samples': 13814016, 'steps': 71947, 'loss/train': 1.075976014137268} 11/07/2021 07:20:03 - INFO - __main__ - Step 71949: {'lr': 0.00027147642473778584, 'samples': 13814208, 'steps': 71948, 'loss/train': 1.1846051216125488} 11/07/2021 07:20:03 - INFO - __main__ - Step 71950: {'lr': 0.000271471137606188, 'samples': 13814400, 'steps': 71949, 'loss/train': 1.5739202499389648} 11/07/2021 07:20:04 - INFO - __main__ - Step 71951: {'lr': 0.00027146585046491564, 'samples': 13814592, 'steps': 71950, 'loss/train': 1.5196884870529175} 11/07/2021 07:20:05 - INFO - __main__ - Step 71952: {'lr': 0.00027146056331397105, 'samples': 13814784, 'steps': 71951, 'loss/train': 1.18962824344635} 11/07/2021 07:20:05 - INFO - __main__ - Step 71953: {'lr': 0.0002714552761533566, 'samples': 13814976, 'steps': 71952, 'loss/train': 1.8217041492462158} 11/07/2021 07:20:05 - INFO - __main__ - Step 71954: {'lr': 0.00027144998898307485, 'samples': 13815168, 'steps': 71953, 'loss/train': 1.4626535177230835} 11/07/2021 07:20:06 - INFO - __main__ - Step 71955: {'lr': 0.000271444701803128, 'samples': 13815360, 'steps': 71954, 'loss/train': 1.0254707336425781} 11/07/2021 07:20:07 - INFO - __main__ - Step 71956: {'lr': 0.0002714394146135185, 'samples': 13815552, 'steps': 71955, 'loss/train': 1.5484638214111328} 11/07/2021 07:20:07 - INFO - __main__ - Step 71957: {'lr': 0.0002714341274142488, 'samples': 13815744, 'steps': 71956, 'loss/train': 1.7999593019485474} 11/07/2021 07:20:08 - INFO - __main__ - Step 71958: {'lr': 0.00027142884020532116, 'samples': 13815936, 'steps': 71957, 'loss/train': 1.5082019567489624} 11/07/2021 07:20:08 - INFO - __main__ - Step 71959: {'lr': 0.00027142355298673796, 'samples': 13816128, 'steps': 71958, 'loss/train': 0.2713395059108734} 11/07/2021 07:20:08 - INFO - __main__ - Step 71960: {'lr': 0.0002714182657585017, 'samples': 13816320, 'steps': 71959, 'loss/train': 0.17491403222084045} 11/07/2021 07:20:09 - INFO - __main__ - Step 71961: {'lr': 0.0002714129785206147, 'samples': 13816512, 'steps': 71960, 'loss/train': 1.6141257286071777} 11/07/2021 07:20:10 - INFO - __main__ - Step 71962: {'lr': 0.00027140769127307935, 'samples': 13816704, 'steps': 71961, 'loss/train': 1.0667296648025513} 11/07/2021 07:20:10 - INFO - __main__ - Step 71963: {'lr': 0.000271402404015898, 'samples': 13816896, 'steps': 71962, 'loss/train': 1.664445161819458} 11/07/2021 07:20:10 - INFO - __main__ - Step 71964: {'lr': 0.000271397116749073, 'samples': 13817088, 'steps': 71963, 'loss/train': 1.5067662000656128} 11/07/2021 07:20:11 - INFO - __main__ - Step 71965: {'lr': 0.0002713918294726069, 'samples': 13817280, 'steps': 71964, 'loss/train': 1.619340419769287} 11/07/2021 07:20:11 - INFO - __main__ - Step 71966: {'lr': 0.00027138654218650195, 'samples': 13817472, 'steps': 71965, 'loss/train': 1.5641663074493408} 11/07/2021 07:20:12 - INFO - __main__ - Step 71967: {'lr': 0.0002713812548907605, 'samples': 13817664, 'steps': 71966, 'loss/train': 1.4743423461914062} 11/07/2021 07:20:13 - INFO - __main__ - Step 71968: {'lr': 0.000271375967585385, 'samples': 13817856, 'steps': 71967, 'loss/train': 1.5697886943817139} 11/07/2021 07:20:13 - INFO - __main__ - Step 71969: {'lr': 0.00027137068027037784, 'samples': 13818048, 'steps': 71968, 'loss/train': 1.1672922372817993} 11/07/2021 07:20:13 - INFO - __main__ - Step 71970: {'lr': 0.00027136539294574135, 'samples': 13818240, 'steps': 71969, 'loss/train': 1.2333934307098389} 11/07/2021 07:20:14 - INFO - __main__ - Step 71971: {'lr': 0.00027136010561147806, 'samples': 13818432, 'steps': 71970, 'loss/train': 1.7546418905258179} 11/07/2021 07:20:15 - INFO - __main__ - Step 71972: {'lr': 0.0002713548182675901, 'samples': 13818624, 'steps': 71971, 'loss/train': 1.4494398832321167} 11/07/2021 07:20:15 - INFO - __main__ - Step 71973: {'lr': 0.00027134953091408005, 'samples': 13818816, 'steps': 71972, 'loss/train': 1.359808087348938} 11/07/2021 07:20:15 - INFO - __main__ - Step 71974: {'lr': 0.0002713442435509502, 'samples': 13819008, 'steps': 71973, 'loss/train': 1.5676642656326294} 11/07/2021 07:20:16 - INFO - __main__ - Step 71975: {'lr': 0.000271338956178203, 'samples': 13819200, 'steps': 71974, 'loss/train': 1.59507417678833} 11/07/2021 07:20:16 - INFO - __main__ - Step 71976: {'lr': 0.00027133366879584077, 'samples': 13819392, 'steps': 71975, 'loss/train': 0.6084814667701721} 11/07/2021 07:20:16 - INFO - __main__ - Step 71977: {'lr': 0.0002713283814038659, 'samples': 13819584, 'steps': 71976, 'loss/train': 1.3840899467468262} 11/07/2021 07:20:17 - INFO - __main__ - Step 71978: {'lr': 0.00027132309400228086, 'samples': 13819776, 'steps': 71977, 'loss/train': 1.8094016313552856} 11/07/2021 07:20:18 - INFO - __main__ - Step 71979: {'lr': 0.0002713178065910879, 'samples': 13819968, 'steps': 71978, 'loss/train': 2.0368995666503906} 11/07/2021 07:20:18 - INFO - __main__ - Step 71980: {'lr': 0.0002713125191702895, 'samples': 13820160, 'steps': 71979, 'loss/train': 0.8345212340354919} 11/07/2021 07:20:19 - INFO - __main__ - Step 71981: {'lr': 0.0002713072317398879, 'samples': 13820352, 'steps': 71980, 'loss/train': 1.228306531906128} 11/07/2021 07:20:19 - INFO - __main__ - Step 71982: {'lr': 0.0002713019442998857, 'samples': 13820544, 'steps': 71981, 'loss/train': 1.096817970275879} 11/07/2021 07:20:20 - INFO - __main__ - Step 71983: {'lr': 0.00027129665685028513, 'samples': 13820736, 'steps': 71982, 'loss/train': 1.3760764598846436} 11/07/2021 07:20:20 - INFO - __main__ - Step 71984: {'lr': 0.0002712913693910887, 'samples': 13820928, 'steps': 71983, 'loss/train': 1.8897407054901123} 11/07/2021 07:20:21 - INFO - __main__ - Step 71985: {'lr': 0.0002712860819222986, 'samples': 13821120, 'steps': 71984, 'loss/train': 1.3728581666946411} 11/07/2021 07:20:21 - INFO - __main__ - Step 71986: {'lr': 0.0002712807944439174, 'samples': 13821312, 'steps': 71985, 'loss/train': 1.5250232219696045} 11/07/2021 07:20:21 - INFO - __main__ - Step 71987: {'lr': 0.0002712755069559474, 'samples': 13821504, 'steps': 71986, 'loss/train': 1.4294650554656982} 11/07/2021 07:20:23 - INFO - __main__ - Step 71988: {'lr': 0.0002712702194583909, 'samples': 13821696, 'steps': 71987, 'loss/train': 1.722814917564392} 11/07/2021 07:20:23 - INFO - __main__ - Step 71989: {'lr': 0.0002712649319512504, 'samples': 13821888, 'steps': 71988, 'loss/train': 1.289215326309204} 11/07/2021 07:20:24 - INFO - __main__ - Step 71990: {'lr': 0.0002712596444345283, 'samples': 13822080, 'steps': 71989, 'loss/train': 1.4016990661621094} 11/07/2021 07:20:24 - INFO - __main__ - Step 71991: {'lr': 0.00027125435690822684, 'samples': 13822272, 'steps': 71990, 'loss/train': 1.2959860563278198} 11/07/2021 07:20:25 - INFO - __main__ - Step 71992: {'lr': 0.0002712490693723486, 'samples': 13822464, 'steps': 71991, 'loss/train': 1.8700083494186401} 11/07/2021 07:20:25 - INFO - __main__ - Step 71993: {'lr': 0.0002712437818268958, 'samples': 13822656, 'steps': 71992, 'loss/train': 1.7410151958465576} 11/07/2021 07:20:25 - INFO - __main__ - Step 71994: {'lr': 0.0002712384942718709, 'samples': 13822848, 'steps': 71993, 'loss/train': 1.7882143259048462} 11/07/2021 07:20:26 - INFO - __main__ - Step 71995: {'lr': 0.00027123320670727625, 'samples': 13823040, 'steps': 71994, 'loss/train': 1.0398091077804565} 11/07/2021 07:20:27 - INFO - __main__ - Step 71996: {'lr': 0.0002712279191331142, 'samples': 13823232, 'steps': 71995, 'loss/train': 0.10263266414403915} 11/07/2021 07:20:27 - INFO - __main__ - Step 71997: {'lr': 0.0002712226315493873, 'samples': 13823424, 'steps': 71996, 'loss/train': 1.4143158197402954} 11/07/2021 07:20:27 - INFO - __main__ - Step 71998: {'lr': 0.00027121734395609774, 'samples': 13823616, 'steps': 71997, 'loss/train': 1.695756435394287} 11/07/2021 07:20:28 - INFO - __main__ - Step 71999: {'lr': 0.000271212056353248, 'samples': 13823808, 'steps': 71998, 'loss/train': 1.0205954313278198} 11/07/2021 07:20:29 - INFO - __main__ - Step 72000: {'lr': 0.00027120676874084037, 'samples': 13824000, 'steps': 71999, 'loss/train': 1.3063429594039917} 11/07/2021 07:20:29 - INFO - __main__ - Step 72001: {'lr': 0.0002712014811188773, 'samples': 13824192, 'steps': 72000, 'loss/train': 1.6698278188705444} 11/07/2021 07:20:30 - INFO - __main__ - Step 72002: {'lr': 0.0002711961934873612, 'samples': 13824384, 'steps': 72001, 'loss/train': 0.9344462752342224} 11/07/2021 07:20:30 - INFO - __main__ - Step 72003: {'lr': 0.0002711909058462944, 'samples': 13824576, 'steps': 72002, 'loss/train': 0.3766229450702667} 11/07/2021 07:20:30 - INFO - __main__ - Step 72004: {'lr': 0.00027118561819567934, 'samples': 13824768, 'steps': 72003, 'loss/train': 1.6645859479904175} 11/07/2021 07:20:31 - INFO - __main__ - Step 72005: {'lr': 0.0002711803305355184, 'samples': 13824960, 'steps': 72004, 'loss/train': 1.0579726696014404} 11/07/2021 07:20:32 - INFO - __main__ - Step 72006: {'lr': 0.00027117504286581384, 'samples': 13825152, 'steps': 72005, 'loss/train': 1.0097756385803223} 11/07/2021 07:20:32 - INFO - __main__ - Step 72007: {'lr': 0.0002711697551865682, 'samples': 13825344, 'steps': 72006, 'loss/train': 1.201095700263977} 11/07/2021 07:20:32 - INFO - __main__ - Step 72008: {'lr': 0.00027116446749778377, 'samples': 13825536, 'steps': 72007, 'loss/train': 1.5748845338821411} 11/07/2021 07:20:33 - INFO - __main__ - Step 72009: {'lr': 0.0002711591797994629, 'samples': 13825728, 'steps': 72008, 'loss/train': 1.4089946746826172} 11/07/2021 07:20:34 - INFO - __main__ - Step 72010: {'lr': 0.0002711538920916081, 'samples': 13825920, 'steps': 72009, 'loss/train': 0.7171818017959595} 11/07/2021 07:20:34 - INFO - __main__ - Step 72011: {'lr': 0.00027114860437422165, 'samples': 13826112, 'steps': 72010, 'loss/train': 0.7031287550926208} 11/07/2021 07:20:34 - INFO - __main__ - Step 72012: {'lr': 0.0002711433166473061, 'samples': 13826304, 'steps': 72011, 'loss/train': 1.6219135522842407} 11/07/2021 07:20:35 - INFO - __main__ - Step 72013: {'lr': 0.00027113802891086354, 'samples': 13826496, 'steps': 72012, 'loss/train': 1.7822237014770508} 11/07/2021 07:20:35 - INFO - __main__ - Step 72014: {'lr': 0.00027113274116489654, 'samples': 13826688, 'steps': 72013, 'loss/train': 1.6400471925735474} 11/07/2021 07:20:36 - INFO - __main__ - Step 72015: {'lr': 0.0002711274534094075, 'samples': 13826880, 'steps': 72014, 'loss/train': 1.0571937561035156} 11/07/2021 07:20:37 - INFO - __main__ - Step 72016: {'lr': 0.0002711221656443987, 'samples': 13827072, 'steps': 72015, 'loss/train': 1.697291612625122} 11/07/2021 07:20:37 - INFO - __main__ - Step 72017: {'lr': 0.0002711168778698726, 'samples': 13827264, 'steps': 72016, 'loss/train': 0.11712630838155746} 11/07/2021 07:20:37 - INFO - __main__ - Step 72018: {'lr': 0.0002711115900858316, 'samples': 13827456, 'steps': 72017, 'loss/train': 1.6073005199432373} 11/07/2021 07:20:38 - INFO - __main__ - Step 72019: {'lr': 0.00027110630229227803, 'samples': 13827648, 'steps': 72018, 'loss/train': 1.3891260623931885} 11/07/2021 07:20:39 - INFO - __main__ - Step 72020: {'lr': 0.0002711010144892142, 'samples': 13827840, 'steps': 72019, 'loss/train': 1.71371328830719} 11/07/2021 07:20:40 - INFO - __main__ - Step 72021: {'lr': 0.00027109572667664264, 'samples': 13828032, 'steps': 72020, 'loss/train': 1.5069829225540161} 11/07/2021 07:20:40 - INFO - __main__ - Step 72022: {'lr': 0.0002710904388545656, 'samples': 13828224, 'steps': 72021, 'loss/train': 1.0708646774291992} 11/07/2021 07:20:40 - INFO - __main__ - Step 72023: {'lr': 0.00027108515102298563, 'samples': 13828416, 'steps': 72022, 'loss/train': 0.7747466564178467} 11/07/2021 07:20:41 - INFO - __main__ - Step 72024: {'lr': 0.00027107986318190505, 'samples': 13828608, 'steps': 72023, 'loss/train': 1.3643238544464111} 11/07/2021 07:20:42 - INFO - __main__ - Step 72025: {'lr': 0.0002710745753313262, 'samples': 13828800, 'steps': 72024, 'loss/train': 0.978363037109375} 11/07/2021 07:20:42 - INFO - __main__ - Step 72026: {'lr': 0.00027106928747125137, 'samples': 13828992, 'steps': 72025, 'loss/train': 1.169105052947998} 11/07/2021 07:20:43 - INFO - __main__ - Step 72027: {'lr': 0.0002710639996016831, 'samples': 13829184, 'steps': 72026, 'loss/train': 1.2775607109069824} 11/07/2021 07:20:43 - INFO - __main__ - Step 72028: {'lr': 0.00027105871172262367, 'samples': 13829376, 'steps': 72027, 'loss/train': 1.0204143524169922} 11/07/2021 07:20:43 - INFO - __main__ - Step 72029: {'lr': 0.0002710534238340756, 'samples': 13829568, 'steps': 72028, 'loss/train': 0.9289034605026245} 11/07/2021 07:20:44 - INFO - __main__ - Step 72030: {'lr': 0.0002710481359360411, 'samples': 13829760, 'steps': 72029, 'loss/train': 0.7431296110153198} 11/07/2021 07:20:45 - INFO - __main__ - Step 72031: {'lr': 0.00027104284802852266, 'samples': 13829952, 'steps': 72030, 'loss/train': 0.6950465440750122} 11/07/2021 07:20:45 - INFO - __main__ - Step 72032: {'lr': 0.0002710375601115227, 'samples': 13830144, 'steps': 72031, 'loss/train': 1.0595848560333252} 11/07/2021 07:20:46 - INFO - __main__ - Step 72033: {'lr': 0.00027103227218504343, 'samples': 13830336, 'steps': 72032, 'loss/train': 1.4229867458343506} 11/07/2021 07:20:46 - INFO - __main__ - Step 72034: {'lr': 0.00027102698424908745, 'samples': 13830528, 'steps': 72033, 'loss/train': 0.9515621662139893} 11/07/2021 07:20:46 - INFO - __main__ - Step 72035: {'lr': 0.00027102169630365696, 'samples': 13830720, 'steps': 72034, 'loss/train': 1.219126582145691} 11/07/2021 07:20:47 - INFO - __main__ - Step 72036: {'lr': 0.0002710164083487544, 'samples': 13830912, 'steps': 72035, 'loss/train': 0.9838962554931641} 11/07/2021 07:20:48 - INFO - __main__ - Step 72037: {'lr': 0.0002710111203843823, 'samples': 13831104, 'steps': 72036, 'loss/train': 0.9157952666282654} 11/07/2021 07:20:48 - INFO - __main__ - Step 72038: {'lr': 0.0002710058324105428, 'samples': 13831296, 'steps': 72037, 'loss/train': 1.2705426216125488} 11/07/2021 07:20:48 - INFO - __main__ - Step 72039: {'lr': 0.00027100054442723845, 'samples': 13831488, 'steps': 72038, 'loss/train': 0.8702412843704224} 11/07/2021 07:20:49 - INFO - __main__ - Step 72040: {'lr': 0.00027099525643447153, 'samples': 13831680, 'steps': 72039, 'loss/train': 2.9115352630615234} 11/07/2021 07:20:49 - INFO - __main__ - Step 72041: {'lr': 0.00027098996843224446, 'samples': 13831872, 'steps': 72040, 'loss/train': 1.3667911291122437} 11/07/2021 07:20:50 - INFO - __main__ - Step 72042: {'lr': 0.0002709846804205597, 'samples': 13832064, 'steps': 72041, 'loss/train': 1.264648199081421} 11/07/2021 07:20:50 - INFO - __main__ - Step 72043: {'lr': 0.00027097939239941957, 'samples': 13832256, 'steps': 72042, 'loss/train': 1.5539690256118774} 11/07/2021 07:20:51 - INFO - __main__ - Step 72044: {'lr': 0.00027097410436882635, 'samples': 13832448, 'steps': 72043, 'loss/train': 1.574934720993042} 11/07/2021 07:20:51 - INFO - __main__ - Step 72045: {'lr': 0.00027096881632878263, 'samples': 13832640, 'steps': 72044, 'loss/train': 1.5896806716918945} 11/07/2021 07:20:52 - INFO - __main__ - Step 72046: {'lr': 0.0002709635282792906, 'samples': 13832832, 'steps': 72045, 'loss/train': 1.310776948928833} 11/07/2021 07:20:53 - INFO - __main__ - Step 72047: {'lr': 0.00027095824022035274, 'samples': 13833024, 'steps': 72046, 'loss/train': 1.731630563735962} 11/07/2021 07:20:53 - INFO - __main__ - Step 72048: {'lr': 0.0002709529521519715, 'samples': 13833216, 'steps': 72047, 'loss/train': 1.1781092882156372} 11/07/2021 07:20:53 - INFO - __main__ - Step 72049: {'lr': 0.0002709476640741492, 'samples': 13833408, 'steps': 72048, 'loss/train': 1.9045487642288208} 11/07/2021 07:20:54 - INFO - __main__ - Step 72050: {'lr': 0.0002709423759868881, 'samples': 13833600, 'steps': 72049, 'loss/train': 1.232208013534546} 11/07/2021 07:20:54 - INFO - __main__ - Step 72051: {'lr': 0.0002709370878901907, 'samples': 13833792, 'steps': 72050, 'loss/train': 1.0782406330108643} 11/07/2021 07:20:54 - INFO - __main__ - Step 72052: {'lr': 0.00027093179978405937, 'samples': 13833984, 'steps': 72051, 'loss/train': 1.2037409543991089} 11/07/2021 07:20:55 - INFO - __main__ - Step 72053: {'lr': 0.00027092651166849653, 'samples': 13834176, 'steps': 72052, 'loss/train': 1.3711283206939697} 11/07/2021 07:20:56 - INFO - __main__ - Step 72054: {'lr': 0.0002709212235435046, 'samples': 13834368, 'steps': 72053, 'loss/train': 1.5775240659713745} 11/07/2021 07:20:56 - INFO - __main__ - Step 72055: {'lr': 0.0002709159354090858, 'samples': 13834560, 'steps': 72054, 'loss/train': 1.5940998792648315} 11/07/2021 07:20:56 - INFO - __main__ - Step 72056: {'lr': 0.00027091064726524256, 'samples': 13834752, 'steps': 72055, 'loss/train': 1.3695969581604004} 11/07/2021 07:20:57 - INFO - __main__ - Step 72057: {'lr': 0.00027090535911197735, 'samples': 13834944, 'steps': 72056, 'loss/train': 1.098486304283142} 11/07/2021 07:20:58 - INFO - __main__ - Step 72058: {'lr': 0.0002709000709492925, 'samples': 13835136, 'steps': 72057, 'loss/train': 1.1426507234573364} 11/07/2021 07:20:58 - INFO - __main__ - Step 72059: {'lr': 0.00027089478277719044, 'samples': 13835328, 'steps': 72058, 'loss/train': 0.6672776937484741} 11/07/2021 07:20:58 - INFO - __main__ - Step 72060: {'lr': 0.00027088949459567346, 'samples': 13835520, 'steps': 72059, 'loss/train': 1.1970497369766235} 11/07/2021 07:20:59 - INFO - __main__ - Step 72061: {'lr': 0.00027088420640474404, 'samples': 13835712, 'steps': 72060, 'loss/train': 1.1421318054199219} 11/07/2021 07:20:59 - INFO - __main__ - Step 72062: {'lr': 0.00027087891820440455, 'samples': 13835904, 'steps': 72061, 'loss/train': 1.3115874528884888} 11/07/2021 07:21:00 - INFO - __main__ - Step 72063: {'lr': 0.0002708736299946573, 'samples': 13836096, 'steps': 72062, 'loss/train': 1.6765053272247314} 11/07/2021 07:21:01 - INFO - __main__ - Step 72064: {'lr': 0.0002708683417755046, 'samples': 13836288, 'steps': 72063, 'loss/train': 1.6512682437896729} 11/07/2021 07:21:01 - INFO - __main__ - Step 72065: {'lr': 0.0002708630535469491, 'samples': 13836480, 'steps': 72064, 'loss/train': 0.9572357535362244} 11/07/2021 07:21:01 - INFO - __main__ - Step 72066: {'lr': 0.00027085776530899304, 'samples': 13836672, 'steps': 72065, 'loss/train': 1.3508108854293823} 11/07/2021 07:21:02 - INFO - __main__ - Step 72067: {'lr': 0.0002708524770616387, 'samples': 13836864, 'steps': 72066, 'loss/train': 1.5466021299362183} 11/07/2021 07:21:03 - INFO - __main__ - Step 72068: {'lr': 0.00027084718880488856, 'samples': 13837056, 'steps': 72067, 'loss/train': 0.8806825876235962} 11/07/2021 07:21:03 - INFO - __main__ - Step 72069: {'lr': 0.00027084190053874505, 'samples': 13837248, 'steps': 72068, 'loss/train': 1.2255473136901855} 11/07/2021 07:21:03 - INFO - __main__ - Step 72070: {'lr': 0.0002708366122632105, 'samples': 13837440, 'steps': 72069, 'loss/train': 1.4437963962554932} 11/07/2021 07:21:04 - INFO - __main__ - Step 72071: {'lr': 0.00027083132397828725, 'samples': 13837632, 'steps': 72070, 'loss/train': 1.2410905361175537} 11/07/2021 07:21:04 - INFO - __main__ - Step 72072: {'lr': 0.0002708260356839778, 'samples': 13837824, 'steps': 72071, 'loss/train': 1.0697739124298096} 11/07/2021 07:21:05 - INFO - __main__ - Step 72073: {'lr': 0.0002708207473802844, 'samples': 13838016, 'steps': 72072, 'loss/train': 2.0194103717803955} 11/07/2021 07:21:05 - INFO - __main__ - Step 72074: {'lr': 0.00027081545906720953, 'samples': 13838208, 'steps': 72073, 'loss/train': 0.8481519818305969} 11/07/2021 07:21:06 - INFO - __main__ - Step 72075: {'lr': 0.00027081017074475543, 'samples': 13838400, 'steps': 72074, 'loss/train': 1.2538034915924072} 11/07/2021 07:21:06 - INFO - __main__ - Step 72076: {'lr': 0.00027080488241292466, 'samples': 13838592, 'steps': 72075, 'loss/train': 1.7363513708114624} 11/07/2021 07:21:07 - INFO - __main__ - Step 72077: {'lr': 0.00027079959407171956, 'samples': 13838784, 'steps': 72076, 'loss/train': 1.3203824758529663} 11/07/2021 07:21:08 - INFO - __main__ - Step 72078: {'lr': 0.00027079430572114245, 'samples': 13838976, 'steps': 72077, 'loss/train': 1.845058798789978} 11/07/2021 07:21:08 - INFO - __main__ - Step 72079: {'lr': 0.0002707890173611958, 'samples': 13839168, 'steps': 72078, 'loss/train': 1.7298356294631958} 11/07/2021 07:21:08 - INFO - __main__ - Step 72080: {'lr': 0.0002707837289918819, 'samples': 13839360, 'steps': 72079, 'loss/train': 1.6588529348373413} 11/07/2021 07:21:09 - INFO - __main__ - Step 72081: {'lr': 0.00027077844061320315, 'samples': 13839552, 'steps': 72080, 'loss/train': 1.4516735076904297} 11/07/2021 07:21:09 - INFO - __main__ - Step 72082: {'lr': 0.000270773152225162, 'samples': 13839744, 'steps': 72081, 'loss/train': 1.6789685487747192} 11/07/2021 07:21:09 - INFO - __main__ - Step 72083: {'lr': 0.0002707678638277608, 'samples': 13839936, 'steps': 72082, 'loss/train': 1.2323096990585327} 11/07/2021 07:21:10 - INFO - __main__ - Step 72084: {'lr': 0.0002707625754210018, 'samples': 13840128, 'steps': 72083, 'loss/train': 1.7377467155456543} 11/07/2021 07:21:11 - INFO - __main__ - Step 72085: {'lr': 0.0002707572870048876, 'samples': 13840320, 'steps': 72084, 'loss/train': 1.1149375438690186} 11/07/2021 07:21:11 - INFO - __main__ - Step 72086: {'lr': 0.0002707519985794205, 'samples': 13840512, 'steps': 72085, 'loss/train': 1.2115917205810547} 11/07/2021 07:21:11 - INFO - __main__ - Step 72087: {'lr': 0.0002707467101446029, 'samples': 13840704, 'steps': 72086, 'loss/train': 1.4458470344543457} 11/07/2021 07:21:12 - INFO - __main__ - Step 72088: {'lr': 0.00027074142170043706, 'samples': 13840896, 'steps': 72087, 'loss/train': 1.467828631401062} 11/07/2021 07:21:13 - INFO - __main__ - Step 72089: {'lr': 0.0002707361332469255, 'samples': 13841088, 'steps': 72088, 'loss/train': 1.3602782487869263} 11/07/2021 07:21:13 - INFO - __main__ - Step 72090: {'lr': 0.0002707308447840705, 'samples': 13841280, 'steps': 72089, 'loss/train': 0.6999763250350952} 11/07/2021 07:21:14 - INFO - __main__ - Step 72091: {'lr': 0.0002707255563118746, 'samples': 13841472, 'steps': 72090, 'loss/train': 1.5556395053863525} 11/07/2021 07:21:14 - INFO - __main__ - Step 72092: {'lr': 0.0002707202678303401, 'samples': 13841664, 'steps': 72091, 'loss/train': 1.3719475269317627} 11/07/2021 07:21:14 - INFO - __main__ - Step 72093: {'lr': 0.00027071497933946924, 'samples': 13841856, 'steps': 72092, 'loss/train': 1.456568956375122} 11/07/2021 07:21:16 - INFO - __main__ - Step 72094: {'lr': 0.0002707096908392646, 'samples': 13842048, 'steps': 72093, 'loss/train': 1.3909443616867065} 11/07/2021 07:21:16 - INFO - __main__ - Step 72095: {'lr': 0.0002707044023297285, 'samples': 13842240, 'steps': 72094, 'loss/train': 1.3529574871063232} 11/07/2021 07:21:16 - INFO - __main__ - Step 72096: {'lr': 0.0002706991138108633, 'samples': 13842432, 'steps': 72095, 'loss/train': 1.5070720911026} 11/07/2021 07:21:17 - INFO - __main__ - Step 72097: {'lr': 0.0002706938252826714, 'samples': 13842624, 'steps': 72096, 'loss/train': 1.4220762252807617} 11/07/2021 07:21:17 - INFO - __main__ - Step 72098: {'lr': 0.00027068853674515515, 'samples': 13842816, 'steps': 72097, 'loss/train': 1.2002320289611816} 11/07/2021 07:21:17 - INFO - __main__ - Step 72099: {'lr': 0.00027068324819831707, 'samples': 13843008, 'steps': 72098, 'loss/train': 1.5033400058746338} 11/07/2021 07:21:18 - INFO - __main__ - Step 72100: {'lr': 0.00027067795964215934, 'samples': 13843200, 'steps': 72099, 'loss/train': 1.3463431596755981} 11/07/2021 07:21:19 - INFO - __main__ - Step 72101: {'lr': 0.00027067267107668447, 'samples': 13843392, 'steps': 72100, 'loss/train': 1.6614686250686646} 11/07/2021 07:21:19 - INFO - __main__ - Step 72102: {'lr': 0.00027066738250189484, 'samples': 13843584, 'steps': 72101, 'loss/train': 1.2374134063720703} 11/07/2021 07:21:19 - INFO - __main__ - Step 72103: {'lr': 0.0002706620939177927, 'samples': 13843776, 'steps': 72102, 'loss/train': 1.3809596300125122} 11/07/2021 07:21:20 - INFO - __main__ - Step 72104: {'lr': 0.0002706568053243806, 'samples': 13843968, 'steps': 72103, 'loss/train': 1.784300446510315} 11/07/2021 07:21:21 - INFO - __main__ - Step 72105: {'lr': 0.0002706515167216609, 'samples': 13844160, 'steps': 72104, 'loss/train': 0.9978705048561096} 11/07/2021 07:21:22 - INFO - __main__ - Step 72106: {'lr': 0.000270646228109636, 'samples': 13844352, 'steps': 72105, 'loss/train': 1.5546983480453491} 11/07/2021 07:21:22 - INFO - __main__ - Step 72107: {'lr': 0.00027064093948830816, 'samples': 13844544, 'steps': 72106, 'loss/train': 1.015986442565918} 11/07/2021 07:21:22 - INFO - __main__ - Step 72108: {'lr': 0.0002706356508576798, 'samples': 13844736, 'steps': 72107, 'loss/train': 1.510198712348938} 11/07/2021 07:21:23 - INFO - __main__ - Step 72109: {'lr': 0.00027063036221775335, 'samples': 13844928, 'steps': 72108, 'loss/train': 1.2452939748764038} 11/07/2021 07:21:23 - INFO - __main__ - Step 72110: {'lr': 0.0002706250735685312, 'samples': 13845120, 'steps': 72109, 'loss/train': 1.4328819513320923} 11/07/2021 07:21:24 - INFO - __main__ - Step 72111: {'lr': 0.00027061978491001566, 'samples': 13845312, 'steps': 72110, 'loss/train': 0.22046160697937012} 11/07/2021 07:21:24 - INFO - __main__ - Step 72112: {'lr': 0.0002706144962422092, 'samples': 13845504, 'steps': 72111, 'loss/train': 1.5299347639083862} 11/07/2021 07:21:25 - INFO - __main__ - Step 72113: {'lr': 0.0002706092075651142, 'samples': 13845696, 'steps': 72112, 'loss/train': 1.7320064306259155} 11/07/2021 07:21:25 - INFO - __main__ - Step 72114: {'lr': 0.00027060391887873293, 'samples': 13845888, 'steps': 72113, 'loss/train': 1.4067615270614624} 11/07/2021 07:21:25 - INFO - __main__ - Step 72115: {'lr': 0.00027059863018306793, 'samples': 13846080, 'steps': 72114, 'loss/train': 1.8682799339294434} 11/07/2021 07:21:27 - INFO - __main__ - Step 72116: {'lr': 0.0002705933414781214, 'samples': 13846272, 'steps': 72115, 'loss/train': 1.1047533750534058} 11/07/2021 07:21:27 - INFO - __main__ - Step 72117: {'lr': 0.00027058805276389595, 'samples': 13846464, 'steps': 72116, 'loss/train': 1.4863767623901367} 11/07/2021 07:21:28 - INFO - __main__ - Step 72118: {'lr': 0.0002705827640403938, 'samples': 13846656, 'steps': 72117, 'loss/train': 2.074476718902588} 11/07/2021 07:21:28 - INFO - __main__ - Step 72119: {'lr': 0.0002705774753076174, 'samples': 13846848, 'steps': 72118, 'loss/train': 1.1297250986099243} 11/07/2021 07:21:28 - INFO - __main__ - Step 72120: {'lr': 0.00027057218656556905, 'samples': 13847040, 'steps': 72119, 'loss/train': 0.5183890461921692} 11/07/2021 07:21:29 - INFO - __main__ - Step 72121: {'lr': 0.0002705668978142512, 'samples': 13847232, 'steps': 72120, 'loss/train': 1.341989517211914} 11/07/2021 07:21:30 - INFO - __main__ - Step 72122: {'lr': 0.0002705616090536662, 'samples': 13847424, 'steps': 72121, 'loss/train': 1.2694873809814453} 11/07/2021 07:21:30 - INFO - __main__ - Step 72123: {'lr': 0.0002705563202838165, 'samples': 13847616, 'steps': 72122, 'loss/train': 0.9960101246833801} 11/07/2021 07:21:30 - INFO - __main__ - Step 72124: {'lr': 0.0002705510315047044, 'samples': 13847808, 'steps': 72123, 'loss/train': 1.7626956701278687} 11/07/2021 07:21:31 - INFO - __main__ - Step 72125: {'lr': 0.00027054574271633236, 'samples': 13848000, 'steps': 72124, 'loss/train': 1.7393853664398193} 11/07/2021 07:21:31 - INFO - __main__ - Step 72126: {'lr': 0.00027054045391870275, 'samples': 13848192, 'steps': 72125, 'loss/train': 1.6162110567092896} 11/07/2021 07:21:32 - INFO - __main__ - Step 72127: {'lr': 0.0002705351651118179, 'samples': 13848384, 'steps': 72126, 'loss/train': 1.5160715579986572} 11/07/2021 07:21:32 - INFO - __main__ - Step 72128: {'lr': 0.0002705298762956802, 'samples': 13848576, 'steps': 72127, 'loss/train': 1.2557923793792725} 11/07/2021 07:21:33 - INFO - __main__ - Step 72129: {'lr': 0.00027052458747029204, 'samples': 13848768, 'steps': 72128, 'loss/train': 1.1961584091186523} 11/07/2021 07:21:33 - INFO - __main__ - Step 72130: {'lr': 0.0002705192986356559, 'samples': 13848960, 'steps': 72129, 'loss/train': 1.5448700189590454} 11/07/2021 07:21:34 - INFO - __main__ - Step 72131: {'lr': 0.00027051400979177396, 'samples': 13849152, 'steps': 72130, 'loss/train': 1.6604419946670532} 11/07/2021 07:21:35 - INFO - __main__ - Step 72132: {'lr': 0.0002705087209386488, 'samples': 13849344, 'steps': 72131, 'loss/train': 1.244025707244873} 11/07/2021 07:21:35 - INFO - __main__ - Step 72133: {'lr': 0.0002705034320762828, 'samples': 13849536, 'steps': 72132, 'loss/train': 1.168668508529663} 11/07/2021 07:21:35 - INFO - __main__ - Step 72134: {'lr': 0.0002704981432046782, 'samples': 13849728, 'steps': 72133, 'loss/train': 1.4527937173843384} 11/07/2021 07:21:36 - INFO - __main__ - Step 72135: {'lr': 0.0002704928543238374, 'samples': 13849920, 'steps': 72134, 'loss/train': 1.5947238206863403} 11/07/2021 07:21:36 - INFO - __main__ - Step 72136: {'lr': 0.0002704875654337629, 'samples': 13850112, 'steps': 72135, 'loss/train': 1.1586410999298096} 11/07/2021 07:21:37 - INFO - __main__ - Step 72137: {'lr': 0.00027048227653445696, 'samples': 13850304, 'steps': 72136, 'loss/train': 1.412919044494629} 11/07/2021 07:21:38 - INFO - __main__ - Step 72138: {'lr': 0.00027047698762592203, 'samples': 13850496, 'steps': 72137, 'loss/train': 1.2450486421585083} 11/07/2021 07:21:38 - INFO - __main__ - Step 72139: {'lr': 0.00027047169870816055, 'samples': 13850688, 'steps': 72138, 'loss/train': 1.500587821006775} 11/07/2021 07:21:38 - INFO - __main__ - Step 72140: {'lr': 0.0002704664097811749, 'samples': 13850880, 'steps': 72139, 'loss/train': 0.4547742009162903} 11/07/2021 07:21:39 - INFO - __main__ - Step 72141: {'lr': 0.0002704611208449673, 'samples': 13851072, 'steps': 72140, 'loss/train': 1.4261140823364258} 11/07/2021 07:21:40 - INFO - __main__ - Step 72142: {'lr': 0.00027045583189954015, 'samples': 13851264, 'steps': 72141, 'loss/train': 1.1054242849349976} 11/07/2021 07:21:40 - INFO - __main__ - Step 72143: {'lr': 0.000270450542944896, 'samples': 13851456, 'steps': 72142, 'loss/train': 1.6600079536437988} 11/07/2021 07:21:40 - INFO - __main__ - Step 72144: {'lr': 0.0002704452539810372, 'samples': 13851648, 'steps': 72143, 'loss/train': 1.3017090559005737} 11/07/2021 07:21:41 - INFO - __main__ - Step 72145: {'lr': 0.00027043996500796604, 'samples': 13851840, 'steps': 72144, 'loss/train': 1.5457395315170288} 11/07/2021 07:21:41 - INFO - __main__ - Step 72146: {'lr': 0.00027043467602568493, 'samples': 13852032, 'steps': 72145, 'loss/train': 1.523869276046753} 11/07/2021 07:21:41 - INFO - __main__ - Step 72147: {'lr': 0.00027042938703419634, 'samples': 13852224, 'steps': 72146, 'loss/train': 1.7593357563018799} 11/07/2021 07:21:43 - INFO - __main__ - Step 72148: {'lr': 0.00027042409803350255, 'samples': 13852416, 'steps': 72147, 'loss/train': 1.316400408744812} 11/07/2021 07:21:43 - INFO - __main__ - Step 72149: {'lr': 0.00027041880902360595, 'samples': 13852608, 'steps': 72148, 'loss/train': 0.10492496937513351} 11/07/2021 07:21:44 - INFO - __main__ - Step 72150: {'lr': 0.0002704135200045089, 'samples': 13852800, 'steps': 72149, 'loss/train': 0.17814497649669647} 11/07/2021 07:21:44 - INFO - __main__ - Step 72151: {'lr': 0.00027040823097621393, 'samples': 13852992, 'steps': 72150, 'loss/train': 0.8031061291694641} 11/07/2021 07:21:44 - INFO - __main__ - Step 72152: {'lr': 0.0002704029419387233, 'samples': 13853184, 'steps': 72151, 'loss/train': 1.2044445276260376} 11/07/2021 07:21:45 - INFO - __main__ - Step 72153: {'lr': 0.00027039765289203944, 'samples': 13853376, 'steps': 72152, 'loss/train': 0.9950340390205383} 11/07/2021 07:21:46 - INFO - __main__ - Step 72154: {'lr': 0.00027039236383616464, 'samples': 13853568, 'steps': 72153, 'loss/train': 1.2485169172286987} 11/07/2021 07:21:46 - INFO - __main__ - Step 72155: {'lr': 0.0002703870747711014, 'samples': 13853760, 'steps': 72154, 'loss/train': 1.384201169013977} 11/07/2021 07:21:47 - INFO - __main__ - Step 72156: {'lr': 0.0002703817856968521, 'samples': 13853952, 'steps': 72155, 'loss/train': 1.0052708387374878} 11/07/2021 07:21:47 - INFO - __main__ - Step 72157: {'lr': 0.00027037649661341897, 'samples': 13854144, 'steps': 72156, 'loss/train': 1.256922721862793} 11/07/2021 07:21:48 - INFO - __main__ - Step 72158: {'lr': 0.00027037120752080457, 'samples': 13854336, 'steps': 72157, 'loss/train': 0.2543972432613373} 11/07/2021 07:21:48 - INFO - __main__ - Step 72159: {'lr': 0.0002703659184190112, 'samples': 13854528, 'steps': 72158, 'loss/train': 0.3338029980659485} 11/07/2021 07:21:49 - INFO - __main__ - Step 72160: {'lr': 0.0002703606293080413, 'samples': 13854720, 'steps': 72159, 'loss/train': 1.585300087928772} 11/07/2021 07:21:49 - INFO - __main__ - Step 72161: {'lr': 0.00027035534018789723, 'samples': 13854912, 'steps': 72160, 'loss/train': 1.7478266954421997} 11/07/2021 07:21:49 - INFO - __main__ - Step 72162: {'lr': 0.0002703500510585813, 'samples': 13855104, 'steps': 72161, 'loss/train': 1.1912503242492676} 11/07/2021 07:21:50 - INFO - __main__ - Step 72163: {'lr': 0.00027034476192009597, 'samples': 13855296, 'steps': 72162, 'loss/train': 1.2496072053909302} 11/07/2021 07:21:51 - INFO - __main__ - Step 72164: {'lr': 0.0002703394727724436, 'samples': 13855488, 'steps': 72163, 'loss/train': 1.4669476747512817} 11/07/2021 07:21:51 - INFO - __main__ - Step 72165: {'lr': 0.0002703341836156266, 'samples': 13855680, 'steps': 72164, 'loss/train': 1.3557006120681763} 11/07/2021 07:21:51 - INFO - __main__ - Step 72166: {'lr': 0.00027032889444964734, 'samples': 13855872, 'steps': 72165, 'loss/train': 1.4929249286651611} 11/07/2021 07:21:52 - INFO - __main__ - Step 72167: {'lr': 0.0002703236052745082, 'samples': 13856064, 'steps': 72166, 'loss/train': 1.3429430723190308} 11/07/2021 07:21:52 - INFO - __main__ - Step 72168: {'lr': 0.00027031831609021154, 'samples': 13856256, 'steps': 72167, 'loss/train': 1.8390934467315674} 11/07/2021 07:21:53 - INFO - __main__ - Step 72169: {'lr': 0.00027031302689675967, 'samples': 13856448, 'steps': 72168, 'loss/train': 1.4961583614349365} 11/07/2021 07:21:53 - INFO - __main__ - Step 72170: {'lr': 0.0002703077376941552, 'samples': 13856640, 'steps': 72169, 'loss/train': 1.4084911346435547} 11/07/2021 07:21:54 - INFO - __main__ - Step 72171: {'lr': 0.00027030244848240024, 'samples': 13856832, 'steps': 72170, 'loss/train': 1.4145528078079224} 11/07/2021 07:21:54 - INFO - __main__ - Step 72172: {'lr': 0.0002702971592614975, 'samples': 13857024, 'steps': 72171, 'loss/train': 1.236210823059082} 11/07/2021 07:21:54 - INFO - __main__ - Step 72173: {'lr': 0.00027029187003144904, 'samples': 13857216, 'steps': 72172, 'loss/train': 1.4762929677963257} 11/07/2021 07:21:56 - INFO - __main__ - Step 72174: {'lr': 0.0002702865807922574, 'samples': 13857408, 'steps': 72173, 'loss/train': 1.069431185722351} 11/07/2021 07:21:56 - INFO - __main__ - Step 72175: {'lr': 0.0002702812915439249, 'samples': 13857600, 'steps': 72174, 'loss/train': 1.3994063138961792} 11/07/2021 07:21:56 - INFO - __main__ - Step 72176: {'lr': 0.00027027600228645397, 'samples': 13857792, 'steps': 72175, 'loss/train': 2.023360252380371} 11/07/2021 07:21:57 - INFO - __main__ - Step 72177: {'lr': 0.00027027071301984713, 'samples': 13857984, 'steps': 72176, 'loss/train': 1.4612997770309448} 11/07/2021 07:21:57 - INFO - __main__ - Step 72178: {'lr': 0.00027026542374410643, 'samples': 13858176, 'steps': 72177, 'loss/train': 1.209877610206604} 11/07/2021 07:21:59 - INFO - __main__ - Step 72179: {'lr': 0.0002702601344592346, 'samples': 13858368, 'steps': 72178, 'loss/train': 1.213608741760254} 11/07/2021 07:21:59 - INFO - __main__ - Step 72180: {'lr': 0.00027025484516523374, 'samples': 13858560, 'steps': 72179, 'loss/train': 1.9778211116790771} 11/07/2021 07:22:00 - INFO - __main__ - Step 72181: {'lr': 0.0002702495558621064, 'samples': 13858752, 'steps': 72180, 'loss/train': 1.3351545333862305} 11/07/2021 07:22:00 - INFO - __main__ - Step 72182: {'lr': 0.0002702442665498549, 'samples': 13858944, 'steps': 72181, 'loss/train': 1.7070854902267456} 11/07/2021 07:22:00 - INFO - __main__ - Step 72183: {'lr': 0.00027023897722848174, 'samples': 13859136, 'steps': 72182, 'loss/train': 1.347947359085083} 11/07/2021 07:22:01 - INFO - __main__ - Step 72184: {'lr': 0.00027023368789798915, 'samples': 13859328, 'steps': 72183, 'loss/train': 2.833819627761841} 11/07/2021 07:22:01 - INFO - __main__ - Step 72185: {'lr': 0.00027022839855837957, 'samples': 13859520, 'steps': 72184, 'loss/train': 2.885972261428833} 11/07/2021 07:22:02 - INFO - __main__ - Step 72186: {'lr': 0.00027022310920965536, 'samples': 13859712, 'steps': 72185, 'loss/train': 2.680798053741455} 11/07/2021 07:22:02 - INFO - __main__ - Step 72187: {'lr': 0.0002702178198518189, 'samples': 13859904, 'steps': 72186, 'loss/train': 1.3175063133239746} 11/07/2021 07:22:03 - INFO - __main__ - Step 72188: {'lr': 0.0002702125304848727, 'samples': 13860096, 'steps': 72187, 'loss/train': 1.3798638582229614} 11/07/2021 07:22:03 - INFO - __main__ - Step 72189: {'lr': 0.000270207241108819, 'samples': 13860288, 'steps': 72188, 'loss/train': 1.498036503791809} 11/07/2021 07:22:03 - INFO - __main__ - Step 72190: {'lr': 0.00027020195172366025, 'samples': 13860480, 'steps': 72189, 'loss/train': 1.4644534587860107} 11/07/2021 07:22:04 - INFO - __main__ - Step 72191: {'lr': 0.0002701966623293988, 'samples': 13860672, 'steps': 72190, 'loss/train': 0.8097176551818848} 11/07/2021 07:22:05 - INFO - __main__ - Step 72192: {'lr': 0.00027019137292603703, 'samples': 13860864, 'steps': 72191, 'loss/train': 2.74067759513855} 11/07/2021 07:22:05 - INFO - __main__ - Step 72193: {'lr': 0.0002701860835135773, 'samples': 13861056, 'steps': 72192, 'loss/train': 1.4417890310287476} 11/07/2021 07:22:05 - INFO - __main__ - Step 72194: {'lr': 0.00027018079409202214, 'samples': 13861248, 'steps': 72193, 'loss/train': 1.0681679248809814} 11/07/2021 07:22:06 - INFO - __main__ - Step 72195: {'lr': 0.0002701755046613738, 'samples': 13861440, 'steps': 72194, 'loss/train': 1.4805039167404175} 11/07/2021 07:22:07 - INFO - __main__ - Step 72196: {'lr': 0.0002701702152216346, 'samples': 13861632, 'steps': 72195, 'loss/train': 1.3238468170166016} 11/07/2021 07:22:07 - INFO - __main__ - Step 72197: {'lr': 0.00027016492577280703, 'samples': 13861824, 'steps': 72196, 'loss/train': 1.3519846200942993} 11/07/2021 07:22:08 - INFO - __main__ - Step 72198: {'lr': 0.0002701596363148935, 'samples': 13862016, 'steps': 72197, 'loss/train': 1.3112977743148804} 11/07/2021 07:22:08 - INFO - __main__ - Step 72199: {'lr': 0.0002701543468478963, 'samples': 13862208, 'steps': 72198, 'loss/train': 1.029281497001648} 11/07/2021 07:22:08 - INFO - __main__ - Step 72200: {'lr': 0.000270149057371818, 'samples': 13862400, 'steps': 72199, 'loss/train': 1.7066352367401123} 11/07/2021 07:22:09 - INFO - __main__ - Step 72201: {'lr': 0.0002701437678866607, 'samples': 13862592, 'steps': 72200, 'loss/train': 1.5226260423660278} 11/07/2021 07:22:10 - INFO - __main__ - Step 72202: {'lr': 0.000270138478392427, 'samples': 13862784, 'steps': 72201, 'loss/train': 1.2786506414413452} 11/07/2021 07:22:10 - INFO - __main__ - Step 72203: {'lr': 0.0002701331888891191, 'samples': 13862976, 'steps': 72202, 'loss/train': 1.3493112325668335} 11/07/2021 07:22:11 - INFO - __main__ - Step 72204: {'lr': 0.00027012789937673964, 'samples': 13863168, 'steps': 72203, 'loss/train': 1.1683872938156128} 11/07/2021 07:22:11 - INFO - __main__ - Step 72205: {'lr': 0.0002701226098552908, 'samples': 13863360, 'steps': 72204, 'loss/train': 1.4704241752624512} 11/07/2021 07:22:11 - INFO - __main__ - Step 72206: {'lr': 0.000270117320324775, 'samples': 13863552, 'steps': 72205, 'loss/train': 1.6369669437408447} 11/07/2021 07:22:12 - INFO - __main__ - Step 72207: {'lr': 0.00027011203078519474, 'samples': 13863744, 'steps': 72206, 'loss/train': 1.0497584342956543} 11/07/2021 07:22:13 - INFO - __main__ - Step 72208: {'lr': 0.0002701067412365522, 'samples': 13863936, 'steps': 72207, 'loss/train': 1.4905699491500854} 11/07/2021 07:22:13 - INFO - __main__ - Step 72209: {'lr': 0.00027010145167884994, 'samples': 13864128, 'steps': 72208, 'loss/train': 1.467607021331787} 11/07/2021 07:22:13 - INFO - __main__ - Step 72210: {'lr': 0.0002700961621120902, 'samples': 13864320, 'steps': 72209, 'loss/train': 1.1523956060409546} 11/07/2021 07:22:14 - INFO - __main__ - Step 72211: {'lr': 0.0002700908725362755, 'samples': 13864512, 'steps': 72210, 'loss/train': 1.4060901403427124} 11/07/2021 07:22:14 - INFO - __main__ - Step 72212: {'lr': 0.00027008558295140816, 'samples': 13864704, 'steps': 72211, 'loss/train': 1.386639952659607} 11/07/2021 07:22:15 - INFO - __main__ - Step 72213: {'lr': 0.00027008029335749055, 'samples': 13864896, 'steps': 72212, 'loss/train': 1.4491735696792603} 11/07/2021 07:22:16 - INFO - __main__ - Step 72214: {'lr': 0.0002700750037545251, 'samples': 13865088, 'steps': 72213, 'loss/train': 1.4171723127365112} 11/07/2021 07:22:16 - INFO - __main__ - Step 72215: {'lr': 0.0002700697141425141, 'samples': 13865280, 'steps': 72214, 'loss/train': 1.3536121845245361} 11/07/2021 07:22:16 - INFO - __main__ - Step 72216: {'lr': 0.00027006442452146007, 'samples': 13865472, 'steps': 72215, 'loss/train': 0.7479320168495178} 11/07/2021 07:22:17 - INFO - __main__ - Step 72217: {'lr': 0.0002700591348913653, 'samples': 13865664, 'steps': 72216, 'loss/train': 1.1631628274917603} 11/07/2021 07:22:18 - INFO - __main__ - Step 72218: {'lr': 0.00027005384525223216, 'samples': 13865856, 'steps': 72217, 'loss/train': 0.9014812111854553} 11/07/2021 07:22:18 - INFO - __main__ - Step 72219: {'lr': 0.00027004855560406303, 'samples': 13866048, 'steps': 72218, 'loss/train': 1.6654959917068481} 11/07/2021 07:22:18 - INFO - __main__ - Step 72220: {'lr': 0.0002700432659468605, 'samples': 13866240, 'steps': 72219, 'loss/train': 1.278026819229126} 11/07/2021 07:22:19 - INFO - __main__ - Step 72221: {'lr': 0.00027003797628062664, 'samples': 13866432, 'steps': 72220, 'loss/train': 1.5085651874542236} 11/07/2021 07:22:19 - INFO - __main__ - Step 72222: {'lr': 0.000270032686605364, 'samples': 13866624, 'steps': 72221, 'loss/train': 1.1973530054092407} 11/07/2021 07:22:20 - INFO - __main__ - Step 72223: {'lr': 0.00027002739692107494, 'samples': 13866816, 'steps': 72222, 'loss/train': 1.498713493347168} 11/07/2021 07:22:21 - INFO - __main__ - Step 72224: {'lr': 0.00027002210722776185, 'samples': 13867008, 'steps': 72223, 'loss/train': 1.6383988857269287} 11/07/2021 07:22:21 - INFO - __main__ - Step 72225: {'lr': 0.0002700168175254271, 'samples': 13867200, 'steps': 72224, 'loss/train': 1.3633500337600708} 11/07/2021 07:22:21 - INFO - __main__ - Step 72226: {'lr': 0.00027001152781407306, 'samples': 13867392, 'steps': 72225, 'loss/train': 1.3233537673950195} 11/07/2021 07:22:22 - INFO - __main__ - Step 72227: {'lr': 0.00027000623809370224, 'samples': 13867584, 'steps': 72226, 'loss/train': 1.39150071144104} 11/07/2021 07:22:22 - INFO - __main__ - Step 72228: {'lr': 0.0002700009483643168, 'samples': 13867776, 'steps': 72227, 'loss/train': 1.149099588394165} 11/07/2021 07:22:23 - INFO - __main__ - Step 72229: {'lr': 0.0002699956586259193, 'samples': 13867968, 'steps': 72228, 'loss/train': 1.225111484527588} 11/07/2021 07:22:23 - INFO - __main__ - Step 72230: {'lr': 0.000269990368878512, 'samples': 13868160, 'steps': 72229, 'loss/train': 1.2601072788238525} 11/07/2021 07:22:24 - INFO - __main__ - Step 72231: {'lr': 0.0002699850791220974, 'samples': 13868352, 'steps': 72230, 'loss/train': 1.5583046674728394} 11/07/2021 07:22:24 - INFO - __main__ - Step 72232: {'lr': 0.00026997978935667784, 'samples': 13868544, 'steps': 72231, 'loss/train': 1.6482436656951904} 11/07/2021 07:22:24 - INFO - __main__ - Step 72233: {'lr': 0.0002699744995822557, 'samples': 13868736, 'steps': 72232, 'loss/train': 1.3306657075881958} 11/07/2021 07:22:25 - INFO - __main__ - Step 72234: {'lr': 0.00026996920979883337, 'samples': 13868928, 'steps': 72233, 'loss/train': 1.2129253149032593} 11/07/2021 07:22:26 - INFO - __main__ - Step 72235: {'lr': 0.0002699639200064132, 'samples': 13869120, 'steps': 72234, 'loss/train': 1.3157485723495483} 11/07/2021 07:22:26 - INFO - __main__ - Step 72236: {'lr': 0.00026995863020499755, 'samples': 13869312, 'steps': 72235, 'loss/train': 1.288037657737732} 11/07/2021 07:22:26 - INFO - __main__ - Step 72237: {'lr': 0.0002699533403945889, 'samples': 13869504, 'steps': 72236, 'loss/train': 1.3290889263153076} 11/07/2021 07:22:27 - INFO - __main__ - Step 72238: {'lr': 0.00026994805057518954, 'samples': 13869696, 'steps': 72237, 'loss/train': 1.2798678874969482} 11/07/2021 07:22:28 - INFO - __main__ - Step 72239: {'lr': 0.00026994276074680194, 'samples': 13869888, 'steps': 72238, 'loss/train': 1.4350699186325073} 11/07/2021 07:22:28 - INFO - __main__ - Step 72240: {'lr': 0.0002699374709094285, 'samples': 13870080, 'steps': 72239, 'loss/train': 1.4022358655929565} 11/07/2021 07:22:29 - INFO - __main__ - Step 72241: {'lr': 0.00026993218106307145, 'samples': 13870272, 'steps': 72240, 'loss/train': 1.1422368288040161} 11/07/2021 07:22:29 - INFO - __main__ - Step 72242: {'lr': 0.0002699268912077333, 'samples': 13870464, 'steps': 72241, 'loss/train': 1.50218665599823} 11/07/2021 07:22:29 - INFO - __main__ - Step 72243: {'lr': 0.00026992160134341637, 'samples': 13870656, 'steps': 72242, 'loss/train': 1.4683254957199097} 11/07/2021 07:22:30 - INFO - __main__ - Step 72244: {'lr': 0.00026991631147012306, 'samples': 13870848, 'steps': 72243, 'loss/train': 1.0805302858352661} 11/07/2021 07:22:31 - INFO - __main__ - Step 72245: {'lr': 0.0002699110215878558, 'samples': 13871040, 'steps': 72244, 'loss/train': 1.3586477041244507} 11/07/2021 07:22:31 - INFO - __main__ - Step 72246: {'lr': 0.00026990573169661695, 'samples': 13871232, 'steps': 72245, 'loss/train': 1.6439236402511597} 11/07/2021 07:22:31 - INFO - __main__ - Step 72247: {'lr': 0.0002699004417964089, 'samples': 13871424, 'steps': 72246, 'loss/train': 1.189723253250122} 11/07/2021 07:22:32 - INFO - __main__ - Step 72248: {'lr': 0.000269895151887234, 'samples': 13871616, 'steps': 72247, 'loss/train': 2.0430705547332764} 11/07/2021 07:22:33 - INFO - __main__ - Step 72249: {'lr': 0.00026988986196909467, 'samples': 13871808, 'steps': 72248, 'loss/train': 1.6081311702728271} 11/07/2021 07:22:33 - INFO - __main__ - Step 72250: {'lr': 0.0002698845720419932, 'samples': 13872000, 'steps': 72249, 'loss/train': 1.426196813583374} 11/07/2021 07:22:33 - INFO - __main__ - Step 72251: {'lr': 0.0002698792821059321, 'samples': 13872192, 'steps': 72250, 'loss/train': 1.1646143198013306} 11/07/2021 07:22:34 - INFO - __main__ - Step 72252: {'lr': 0.00026987399216091366, 'samples': 13872384, 'steps': 72251, 'loss/train': 1.5544908046722412} 11/07/2021 07:22:34 - INFO - __main__ - Step 72253: {'lr': 0.00026986870220694037, 'samples': 13872576, 'steps': 72252, 'loss/train': 1.489424228668213} 11/07/2021 07:22:34 - INFO - __main__ - Step 72254: {'lr': 0.00026986341224401455, 'samples': 13872768, 'steps': 72253, 'loss/train': 1.876319408416748} 11/07/2021 07:22:36 - INFO - __main__ - Step 72255: {'lr': 0.0002698581222721386, 'samples': 13872960, 'steps': 72254, 'loss/train': 1.25009286403656} 11/07/2021 07:22:36 - INFO - __main__ - Step 72256: {'lr': 0.0002698528322913148, 'samples': 13873152, 'steps': 72255, 'loss/train': 1.2641199827194214} 11/07/2021 07:22:36 - INFO - __main__ - Step 72257: {'lr': 0.00026984754230154566, 'samples': 13873344, 'steps': 72256, 'loss/train': 1.602243185043335} 11/07/2021 07:22:37 - INFO - __main__ - Step 72258: {'lr': 0.00026984225230283353, 'samples': 13873536, 'steps': 72257, 'loss/train': 1.9140081405639648} 11/07/2021 07:22:37 - INFO - __main__ - Step 72259: {'lr': 0.0002698369622951808, 'samples': 13873728, 'steps': 72258, 'loss/train': 1.7841241359710693} 11/07/2021 07:22:38 - INFO - __main__ - Step 72260: {'lr': 0.00026983167227858984, 'samples': 13873920, 'steps': 72259, 'loss/train': 1.0638529062271118} 11/07/2021 07:22:38 - INFO - __main__ - Step 72261: {'lr': 0.00026982638225306305, 'samples': 13874112, 'steps': 72260, 'loss/train': 1.3902723789215088} 11/07/2021 07:22:39 - INFO - __main__ - Step 72262: {'lr': 0.0002698210922186027, 'samples': 13874304, 'steps': 72261, 'loss/train': 1.2668262720108032} 11/07/2021 07:22:39 - INFO - __main__ - Step 72263: {'lr': 0.0002698158021752114, 'samples': 13874496, 'steps': 72262, 'loss/train': 0.5620172619819641} 11/07/2021 07:22:39 - INFO - __main__ - Step 72264: {'lr': 0.00026981051212289134, 'samples': 13874688, 'steps': 72263, 'loss/train': 1.1457308530807495} 11/07/2021 07:22:40 - INFO - __main__ - Step 72265: {'lr': 0.0002698052220616449, 'samples': 13874880, 'steps': 72264, 'loss/train': 1.6168158054351807} 11/07/2021 07:22:41 - INFO - __main__ - Step 72266: {'lr': 0.0002697999319914747, 'samples': 13875072, 'steps': 72265, 'loss/train': 1.6946322917938232} 11/07/2021 07:22:41 - INFO - __main__ - Step 72267: {'lr': 0.0002697946419123829, 'samples': 13875264, 'steps': 72266, 'loss/train': 0.9080970287322998} 11/07/2021 07:22:41 - INFO - __main__ - Step 72268: {'lr': 0.00026978935182437187, 'samples': 13875456, 'steps': 72267, 'loss/train': 0.6235992908477783} 11/07/2021 07:22:42 - INFO - __main__ - Step 72269: {'lr': 0.0002697840617274441, 'samples': 13875648, 'steps': 72268, 'loss/train': 1.155474305152893} 11/07/2021 07:22:43 - INFO - __main__ - Step 72270: {'lr': 0.00026977877162160193, 'samples': 13875840, 'steps': 72269, 'loss/train': 1.3150174617767334} 11/07/2021 07:22:43 - INFO - __main__ - Step 72271: {'lr': 0.0002697734815068477, 'samples': 13876032, 'steps': 72270, 'loss/train': 1.381999135017395} 11/07/2021 07:22:44 - INFO - __main__ - Step 72272: {'lr': 0.0002697681913831839, 'samples': 13876224, 'steps': 72271, 'loss/train': 1.5717498064041138} 11/07/2021 07:22:44 - INFO - __main__ - Step 72273: {'lr': 0.00026976290125061287, 'samples': 13876416, 'steps': 72272, 'loss/train': 1.4607657194137573} 11/07/2021 07:22:44 - INFO - __main__ - Step 72274: {'lr': 0.00026975761110913706, 'samples': 13876608, 'steps': 72273, 'loss/train': 1.4616016149520874} 11/07/2021 07:22:45 - INFO - __main__ - Step 72275: {'lr': 0.00026975232095875865, 'samples': 13876800, 'steps': 72274, 'loss/train': 1.323350191116333} 11/07/2021 07:22:46 - INFO - __main__ - Step 72276: {'lr': 0.00026974703079948013, 'samples': 13876992, 'steps': 72275, 'loss/train': 1.2456293106079102} 11/07/2021 07:22:46 - INFO - __main__ - Step 72277: {'lr': 0.00026974174063130394, 'samples': 13877184, 'steps': 72276, 'loss/train': 0.9992100596427917} 11/07/2021 07:22:46 - INFO - __main__ - Step 72278: {'lr': 0.00026973645045423253, 'samples': 13877376, 'steps': 72277, 'loss/train': 1.1028269529342651} 11/07/2021 07:22:47 - INFO - __main__ - Step 72279: {'lr': 0.00026973116026826805, 'samples': 13877568, 'steps': 72278, 'loss/train': 1.2763590812683105} 11/07/2021 07:22:48 - INFO - __main__ - Step 72280: {'lr': 0.000269725870073413, 'samples': 13877760, 'steps': 72279, 'loss/train': 1.5645804405212402} 11/07/2021 07:22:48 - INFO - __main__ - Step 72281: {'lr': 0.0002697205798696699, 'samples': 13877952, 'steps': 72280, 'loss/train': 1.5263464450836182} 11/07/2021 07:22:48 - INFO - __main__ - Step 72282: {'lr': 0.00026971528965704094, 'samples': 13878144, 'steps': 72281, 'loss/train': 0.6921829581260681} 11/07/2021 07:22:49 - INFO - __main__ - Step 72283: {'lr': 0.0002697099994355286, 'samples': 13878336, 'steps': 72282, 'loss/train': 0.5675265789031982} 11/07/2021 07:22:49 - INFO - __main__ - Step 72284: {'lr': 0.00026970470920513516, 'samples': 13878528, 'steps': 72283, 'loss/train': 0.9393694400787354} 11/07/2021 07:22:49 - INFO - __main__ - Step 72285: {'lr': 0.0002696994189658632, 'samples': 13878720, 'steps': 72284, 'loss/train': 1.5155227184295654} 11/07/2021 07:22:51 - INFO - __main__ - Step 72286: {'lr': 0.0002696941287177149, 'samples': 13878912, 'steps': 72285, 'loss/train': 0.9142941236495972} 11/07/2021 07:22:51 - INFO - __main__ - Step 72287: {'lr': 0.0002696888384606928, 'samples': 13879104, 'steps': 72286, 'loss/train': 1.279946208000183} 11/07/2021 07:22:51 - INFO - __main__ - Step 72288: {'lr': 0.0002696835481947992, 'samples': 13879296, 'steps': 72287, 'loss/train': 1.4349559545516968} 11/07/2021 07:22:52 - INFO - __main__ - Step 72289: {'lr': 0.00026967825792003643, 'samples': 13879488, 'steps': 72288, 'loss/train': 1.835464358329773} 11/07/2021 07:22:52 - INFO - __main__ - Step 72290: {'lr': 0.00026967296763640697, 'samples': 13879680, 'steps': 72289, 'loss/train': 1.184268832206726} 11/07/2021 07:22:53 - INFO - __main__ - Step 72291: {'lr': 0.0002696676773439132, 'samples': 13879872, 'steps': 72290, 'loss/train': 1.5858168601989746} 11/07/2021 07:22:53 - INFO - __main__ - Step 72292: {'lr': 0.0002696623870425574, 'samples': 13880064, 'steps': 72291, 'loss/train': 1.280076503753662} 11/07/2021 07:22:54 - INFO - __main__ - Step 72293: {'lr': 0.00026965709673234205, 'samples': 13880256, 'steps': 72292, 'loss/train': 1.4942796230316162} 11/07/2021 07:22:54 - INFO - __main__ - Step 72294: {'lr': 0.00026965180641326964, 'samples': 13880448, 'steps': 72293, 'loss/train': 1.6109766960144043} 11/07/2021 07:22:54 - INFO - __main__ - Step 72295: {'lr': 0.00026964651608534233, 'samples': 13880640, 'steps': 72294, 'loss/train': 1.5349963903427124} 11/07/2021 07:22:56 - INFO - __main__ - Step 72296: {'lr': 0.00026964122574856263, 'samples': 13880832, 'steps': 72295, 'loss/train': 1.430383563041687} 11/07/2021 07:22:56 - INFO - __main__ - Step 72297: {'lr': 0.00026963593540293285, 'samples': 13881024, 'steps': 72296, 'loss/train': 1.817386507987976} 11/07/2021 07:22:56 - INFO - __main__ - Step 72298: {'lr': 0.0002696306450484555, 'samples': 13881216, 'steps': 72297, 'loss/train': 1.7570712566375732} 11/07/2021 07:22:57 - INFO - __main__ - Step 72299: {'lr': 0.0002696253546851328, 'samples': 13881408, 'steps': 72298, 'loss/train': 1.2409791946411133} 11/07/2021 07:22:57 - INFO - __main__ - Step 72300: {'lr': 0.00026962006431296726, 'samples': 13881600, 'steps': 72299, 'loss/train': 1.6547940969467163} 11/07/2021 07:22:57 - INFO - __main__ - Step 72301: {'lr': 0.00026961477393196127, 'samples': 13881792, 'steps': 72300, 'loss/train': 1.555829644203186} 11/07/2021 07:22:58 - INFO - __main__ - Step 72302: {'lr': 0.0002696094835421171, 'samples': 13881984, 'steps': 72301, 'loss/train': 1.404901146888733} 11/07/2021 07:22:59 - INFO - __main__ - Step 72303: {'lr': 0.00026960419314343723, 'samples': 13882176, 'steps': 72302, 'loss/train': 1.270935297012329} 11/07/2021 07:22:59 - INFO - __main__ - Step 72304: {'lr': 0.00026959890273592395, 'samples': 13882368, 'steps': 72303, 'loss/train': 1.280813217163086} 11/07/2021 07:22:59 - INFO - __main__ - Step 72305: {'lr': 0.00026959361231957974, 'samples': 13882560, 'steps': 72304, 'loss/train': 1.0684213638305664} 11/07/2021 07:23:00 - INFO - __main__ - Step 72306: {'lr': 0.00026958832189440704, 'samples': 13882752, 'steps': 72305, 'loss/train': 1.5166704654693604} 11/07/2021 07:23:01 - INFO - __main__ - Step 72307: {'lr': 0.00026958303146040806, 'samples': 13882944, 'steps': 72306, 'loss/train': 1.3363001346588135} 11/07/2021 07:23:01 - INFO - __main__ - Step 72308: {'lr': 0.00026957774101758525, 'samples': 13883136, 'steps': 72307, 'loss/train': 1.4429895877838135} 11/07/2021 07:23:01 - INFO - __main__ - Step 72309: {'lr': 0.00026957245056594104, 'samples': 13883328, 'steps': 72308, 'loss/train': 1.202574610710144} 11/07/2021 07:23:02 - INFO - __main__ - Step 72310: {'lr': 0.0002695671601054778, 'samples': 13883520, 'steps': 72309, 'loss/train': 1.5066176652908325} 11/07/2021 07:23:02 - INFO - __main__ - Step 72311: {'lr': 0.0002695618696361979, 'samples': 13883712, 'steps': 72310, 'loss/train': 1.5462393760681152} 11/07/2021 07:23:03 - INFO - __main__ - Step 72312: {'lr': 0.0002695565791581037, 'samples': 13883904, 'steps': 72311, 'loss/train': 1.5074796676635742} 11/07/2021 07:23:04 - INFO - __main__ - Step 72313: {'lr': 0.0002695512886711976, 'samples': 13884096, 'steps': 72312, 'loss/train': 1.357519268989563} 11/07/2021 07:23:04 - INFO - __main__ - Step 72314: {'lr': 0.00026954599817548204, 'samples': 13884288, 'steps': 72313, 'loss/train': 1.5690175294876099} 11/07/2021 07:23:04 - INFO - __main__ - Step 72315: {'lr': 0.0002695407076709593, 'samples': 13884480, 'steps': 72314, 'loss/train': 1.142893671989441} 11/07/2021 07:23:05 - INFO - __main__ - Step 72316: {'lr': 0.00026953541715763184, 'samples': 13884672, 'steps': 72315, 'loss/train': 1.2781633138656616} 11/07/2021 07:23:05 - INFO - __main__ - Step 72317: {'lr': 0.0002695301266355021, 'samples': 13884864, 'steps': 72316, 'loss/train': 1.42661714553833} 11/07/2021 07:23:05 - INFO - __main__ - Step 72318: {'lr': 0.00026952483610457223, 'samples': 13885056, 'steps': 72317, 'loss/train': 1.6657639741897583} 11/07/2021 07:23:07 - INFO - __main__ - Step 72319: {'lr': 0.0002695195455648449, 'samples': 13885248, 'steps': 72318, 'loss/train': 1.527952790260315} 11/07/2021 07:23:07 - INFO - __main__ - Step 72320: {'lr': 0.00026951425501632224, 'samples': 13885440, 'steps': 72319, 'loss/train': 0.6747020483016968} 11/07/2021 07:23:07 - INFO - __main__ - Step 72321: {'lr': 0.00026950896445900686, 'samples': 13885632, 'steps': 72320, 'loss/train': 1.5930856466293335} 11/07/2021 07:23:08 - INFO - __main__ - Step 72322: {'lr': 0.000269503673892901, 'samples': 13885824, 'steps': 72321, 'loss/train': 0.9301367998123169} 11/07/2021 07:23:08 - INFO - __main__ - Step 72323: {'lr': 0.0002694983833180071, 'samples': 13886016, 'steps': 72322, 'loss/train': 1.1291146278381348} 11/07/2021 07:23:09 - INFO - __main__ - Step 72324: {'lr': 0.0002694930927343276, 'samples': 13886208, 'steps': 72323, 'loss/train': 1.0340807437896729} 11/07/2021 07:23:10 - INFO - __main__ - Step 72325: {'lr': 0.0002694878021418647, 'samples': 13886400, 'steps': 72324, 'loss/train': 1.4265918731689453} 11/07/2021 07:23:10 - INFO - __main__ - Step 72326: {'lr': 0.00026948251154062093, 'samples': 13886592, 'steps': 72325, 'loss/train': 1.4471815824508667} 11/07/2021 07:23:10 - INFO - __main__ - Step 72327: {'lr': 0.0002694772209305987, 'samples': 13886784, 'steps': 72326, 'loss/train': 1.1893293857574463} 11/07/2021 07:23:11 - INFO - __main__ - Step 72328: {'lr': 0.0002694719303118003, 'samples': 13886976, 'steps': 72327, 'loss/train': 1.6936746835708618} 11/07/2021 07:23:12 - INFO - __main__ - Step 72329: {'lr': 0.0002694666396842281, 'samples': 13887168, 'steps': 72328, 'loss/train': 1.5914669036865234} 11/07/2021 07:23:12 - INFO - __main__ - Step 72330: {'lr': 0.00026946134904788454, 'samples': 13887360, 'steps': 72329, 'loss/train': 1.62176513671875} 11/07/2021 07:23:12 - INFO - __main__ - Step 72331: {'lr': 0.00026945605840277204, 'samples': 13887552, 'steps': 72330, 'loss/train': 1.5721153020858765} 11/07/2021 07:23:13 - INFO - __main__ - Step 72332: {'lr': 0.0002694507677488929, 'samples': 13887744, 'steps': 72331, 'loss/train': 1.1696053743362427} 11/07/2021 07:23:13 - INFO - __main__ - Step 72333: {'lr': 0.00026944547708624957, 'samples': 13887936, 'steps': 72332, 'loss/train': 1.5928407907485962} 11/07/2021 07:23:14 - INFO - __main__ - Step 72334: {'lr': 0.00026944018641484447, 'samples': 13888128, 'steps': 72333, 'loss/train': 1.6451858282089233} 11/07/2021 07:23:15 - INFO - __main__ - Step 72335: {'lr': 0.0002694348957346798, 'samples': 13888320, 'steps': 72334, 'loss/train': 1.321061372756958} 11/07/2021 07:23:15 - INFO - __main__ - Step 72336: {'lr': 0.00026942960504575814, 'samples': 13888512, 'steps': 72335, 'loss/train': 1.607119083404541} 11/07/2021 07:23:15 - INFO - __main__ - Step 72337: {'lr': 0.0002694243143480818, 'samples': 13888704, 'steps': 72336, 'loss/train': 0.7336689829826355} 11/07/2021 07:23:16 - INFO - __main__ - Step 72338: {'lr': 0.0002694190236416531, 'samples': 13888896, 'steps': 72337, 'loss/train': 1.7712891101837158} 11/07/2021 07:23:16 - INFO - __main__ - Step 72339: {'lr': 0.00026941373292647453, 'samples': 13889088, 'steps': 72338, 'loss/train': 2.02551531791687} 11/07/2021 07:23:17 - INFO - __main__ - Step 72340: {'lr': 0.00026940844220254846, 'samples': 13889280, 'steps': 72339, 'loss/train': 1.4117519855499268} 11/07/2021 07:23:18 - INFO - __main__ - Step 72341: {'lr': 0.00026940315146987726, 'samples': 13889472, 'steps': 72340, 'loss/train': 1.4541548490524292} 11/07/2021 07:23:18 - INFO - __main__ - Step 72342: {'lr': 0.0002693978607284632, 'samples': 13889664, 'steps': 72341, 'loss/train': 1.092153787612915} 11/07/2021 07:23:18 - INFO - __main__ - Step 72343: {'lr': 0.0002693925699783089, 'samples': 13889856, 'steps': 72342, 'loss/train': 1.3944000005722046} 11/07/2021 07:23:19 - INFO - __main__ - Step 72344: {'lr': 0.00026938727921941647, 'samples': 13890048, 'steps': 72343, 'loss/train': 1.5954972505569458} 11/07/2021 07:23:20 - INFO - __main__ - Step 72345: {'lr': 0.0002693819884517885, 'samples': 13890240, 'steps': 72344, 'loss/train': 1.4575368165969849} 11/07/2021 07:23:20 - INFO - __main__ - Step 72346: {'lr': 0.0002693766976754273, 'samples': 13890432, 'steps': 72345, 'loss/train': 1.6160011291503906} 11/07/2021 07:23:20 - INFO - __main__ - Step 72347: {'lr': 0.00026937140689033525, 'samples': 13890624, 'steps': 72346, 'loss/train': 1.3438751697540283} 11/07/2021 07:23:21 - INFO - __main__ - Step 72348: {'lr': 0.0002693661160965147, 'samples': 13890816, 'steps': 72347, 'loss/train': 1.534360408782959} 11/07/2021 07:23:21 - INFO - __main__ - Step 72349: {'lr': 0.00026936082529396816, 'samples': 13891008, 'steps': 72348, 'loss/train': 1.8044207096099854} 11/07/2021 07:23:22 - INFO - __main__ - Step 72350: {'lr': 0.0002693555344826979, 'samples': 13891200, 'steps': 72349, 'loss/train': 0.9859577417373657} 11/07/2021 07:23:22 - INFO - __main__ - Step 72351: {'lr': 0.00026935024366270635, 'samples': 13891392, 'steps': 72350, 'loss/train': 1.550057053565979} 11/07/2021 07:23:23 - INFO - __main__ - Step 72352: {'lr': 0.00026934495283399587, 'samples': 13891584, 'steps': 72351, 'loss/train': 1.2793008089065552} 11/07/2021 07:23:23 - INFO - __main__ - Step 72353: {'lr': 0.0002693396619965688, 'samples': 13891776, 'steps': 72352, 'loss/train': 1.2330052852630615} 11/07/2021 07:23:23 - INFO - __main__ - Step 72354: {'lr': 0.0002693343711504276, 'samples': 13891968, 'steps': 72353, 'loss/train': 0.46184927225112915} 11/07/2021 07:23:24 - INFO - __main__ - Step 72355: {'lr': 0.00026932908029557467, 'samples': 13892160, 'steps': 72354, 'loss/train': 1.4687118530273438} 11/07/2021 07:23:25 - INFO - __main__ - Step 72356: {'lr': 0.00026932378943201235, 'samples': 13892352, 'steps': 72355, 'loss/train': 1.6239213943481445} 11/07/2021 07:23:25 - INFO - __main__ - Step 72357: {'lr': 0.000269318498559743, 'samples': 13892544, 'steps': 72356, 'loss/train': 1.3154746294021606} 11/07/2021 07:23:25 - INFO - __main__ - Step 72358: {'lr': 0.00026931320767876907, 'samples': 13892736, 'steps': 72357, 'loss/train': 1.1281800270080566} 11/07/2021 07:23:26 - INFO - __main__ - Step 72359: {'lr': 0.0002693079167890928, 'samples': 13892928, 'steps': 72358, 'loss/train': 1.4118841886520386} 11/07/2021 07:23:26 - INFO - __main__ - Step 72360: {'lr': 0.0002693026258907168, 'samples': 13893120, 'steps': 72359, 'loss/train': 1.1402521133422852} 11/07/2021 07:23:27 - INFO - __main__ - Step 72361: {'lr': 0.00026929733498364336, 'samples': 13893312, 'steps': 72360, 'loss/train': 1.529834508895874} 11/07/2021 07:23:28 - INFO - __main__ - Step 72362: {'lr': 0.00026929204406787475, 'samples': 13893504, 'steps': 72361, 'loss/train': 1.3774641752243042} 11/07/2021 07:23:28 - INFO - __main__ - Step 72363: {'lr': 0.0002692867531434135, 'samples': 13893696, 'steps': 72362, 'loss/train': 1.5297375917434692} 11/07/2021 07:23:28 - INFO - __main__ - Step 72364: {'lr': 0.0002692814622102619, 'samples': 13893888, 'steps': 72363, 'loss/train': 0.6478986740112305} 11/07/2021 07:23:29 - INFO - __main__ - Step 72365: {'lr': 0.00026927617126842234, 'samples': 13894080, 'steps': 72364, 'loss/train': 0.8943771719932556} 11/07/2021 07:23:30 - INFO - __main__ - Step 72366: {'lr': 0.00026927088031789725, 'samples': 13894272, 'steps': 72365, 'loss/train': 1.528095006942749} 11/07/2021 07:23:30 - INFO - __main__ - Step 72367: {'lr': 0.00026926558935868905, 'samples': 13894464, 'steps': 72366, 'loss/train': 1.32515549659729} 11/07/2021 07:23:30 - INFO - __main__ - Step 72368: {'lr': 0.0002692602983908001, 'samples': 13894656, 'steps': 72367, 'loss/train': 1.1109652519226074} 11/07/2021 07:23:31 - INFO - __main__ - Step 72369: {'lr': 0.0002692550074142326, 'samples': 13894848, 'steps': 72368, 'loss/train': 1.3979120254516602} 11/07/2021 07:23:31 - INFO - __main__ - Step 72370: {'lr': 0.0002692497164289892, 'samples': 13895040, 'steps': 72369, 'loss/train': 1.5364481210708618} 11/07/2021 07:23:32 - INFO - __main__ - Step 72371: {'lr': 0.00026924442543507223, 'samples': 13895232, 'steps': 72370, 'loss/train': 1.663030743598938} 11/07/2021 07:23:33 - INFO - __main__ - Step 72372: {'lr': 0.0002692391344324839, 'samples': 13895424, 'steps': 72371, 'loss/train': 1.6201318502426147} 11/07/2021 07:23:33 - INFO - __main__ - Step 72373: {'lr': 0.00026923384342122676, 'samples': 13895616, 'steps': 72372, 'loss/train': 1.4912710189819336} 11/07/2021 07:23:33 - INFO - __main__ - Step 72374: {'lr': 0.0002692285524013032, 'samples': 13895808, 'steps': 72373, 'loss/train': 1.1143356561660767} 11/07/2021 07:23:34 - INFO - __main__ - Step 72375: {'lr': 0.00026922326137271554, 'samples': 13896000, 'steps': 72374, 'loss/train': 0.28572866320610046} 11/07/2021 07:23:35 - INFO - __main__ - Step 72376: {'lr': 0.0002692179703354661, 'samples': 13896192, 'steps': 72375, 'loss/train': 1.3591059446334839} 11/07/2021 07:23:35 - INFO - __main__ - Step 72377: {'lr': 0.0002692126792895574, 'samples': 13896384, 'steps': 72376, 'loss/train': 1.4764258861541748} 11/07/2021 07:23:35 - INFO - __main__ - Step 72378: {'lr': 0.00026920738823499167, 'samples': 13896576, 'steps': 72377, 'loss/train': 1.506506085395813} 11/07/2021 07:23:36 - INFO - __main__ - Step 72379: {'lr': 0.00026920209717177146, 'samples': 13896768, 'steps': 72378, 'loss/train': 1.7772713899612427} 11/07/2021 07:23:36 - INFO - __main__ - Step 72380: {'lr': 0.0002691968060998991, 'samples': 13896960, 'steps': 72379, 'loss/train': 1.5974366664886475} 11/07/2021 07:23:37 - INFO - __main__ - Step 72381: {'lr': 0.000269191515019377, 'samples': 13897152, 'steps': 72380, 'loss/train': 1.1888227462768555} 11/07/2021 07:23:37 - INFO - __main__ - Step 72382: {'lr': 0.0002691862239302074, 'samples': 13897344, 'steps': 72381, 'loss/train': 1.4632400274276733} 11/07/2021 07:23:38 - INFO - __main__ - Step 72383: {'lr': 0.0002691809328323928, 'samples': 13897536, 'steps': 72382, 'loss/train': 1.473443627357483} 11/07/2021 07:23:38 - INFO - __main__ - Step 72384: {'lr': 0.0002691756417259356, 'samples': 13897728, 'steps': 72383, 'loss/train': 1.716002106666565} 11/07/2021 07:23:38 - INFO - __main__ - Step 72385: {'lr': 0.0002691703506108381, 'samples': 13897920, 'steps': 72384, 'loss/train': 0.9499492645263672} 11/07/2021 07:23:39 - INFO - __main__ - Step 72386: {'lr': 0.0002691650594871028, 'samples': 13898112, 'steps': 72385, 'loss/train': 0.6456695199012756} 11/07/2021 07:23:40 - INFO - __main__ - Step 72387: {'lr': 0.000269159768354732, 'samples': 13898304, 'steps': 72386, 'loss/train': 1.7995036840438843} 11/07/2021 07:23:40 - INFO - __main__ - Step 72388: {'lr': 0.0002691544772137281, 'samples': 13898496, 'steps': 72387, 'loss/train': 0.9023466110229492} 11/07/2021 07:23:40 - INFO - __main__ - Step 72389: {'lr': 0.0002691491860640935, 'samples': 13898688, 'steps': 72388, 'loss/train': 1.3880952596664429} 11/07/2021 07:23:41 - INFO - __main__ - Step 72390: {'lr': 0.0002691438949058306, 'samples': 13898880, 'steps': 72389, 'loss/train': 1.1117550134658813} 11/07/2021 07:23:41 - INFO - __main__ - Step 72391: {'lr': 0.0002691386037389417, 'samples': 13899072, 'steps': 72390, 'loss/train': 1.5602036714553833} 11/07/2021 07:23:43 - INFO - __main__ - Step 72392: {'lr': 0.0002691333125634292, 'samples': 13899264, 'steps': 72391, 'loss/train': 1.4660526514053345} 11/07/2021 07:23:43 - INFO - __main__ - Step 72393: {'lr': 0.0002691280213792956, 'samples': 13899456, 'steps': 72392, 'loss/train': 1.367077350616455} 11/07/2021 07:23:43 - INFO - __main__ - Step 72394: {'lr': 0.0002691227301865432, 'samples': 13899648, 'steps': 72393, 'loss/train': 0.765631914138794} 11/07/2021 07:23:44 - INFO - __main__ - Step 72395: {'lr': 0.00026911743898517436, 'samples': 13899840, 'steps': 72394, 'loss/train': 1.175826072692871} 11/07/2021 07:23:44 - INFO - __main__ - Step 72396: {'lr': 0.00026911214777519156, 'samples': 13900032, 'steps': 72395, 'loss/train': 1.3016319274902344} 11/07/2021 07:23:44 - INFO - __main__ - Step 72397: {'lr': 0.00026910685655659705, 'samples': 13900224, 'steps': 72396, 'loss/train': 1.3310871124267578} 11/07/2021 07:23:45 - INFO - __main__ - Step 72398: {'lr': 0.00026910156532939327, 'samples': 13900416, 'steps': 72397, 'loss/train': 1.544783353805542} 11/07/2021 07:23:46 - INFO - __main__ - Step 72399: {'lr': 0.00026909627409358266, 'samples': 13900608, 'steps': 72398, 'loss/train': 1.3153016567230225} 11/07/2021 07:23:46 - INFO - __main__ - Step 72400: {'lr': 0.0002690909828491676, 'samples': 13900800, 'steps': 72399, 'loss/train': 1.4088802337646484} 11/07/2021 07:23:46 - INFO - __main__ - Step 72401: {'lr': 0.0002690856915961504, 'samples': 13900992, 'steps': 72400, 'loss/train': 1.3697350025177002} 11/07/2021 07:23:47 - INFO - __main__ - Step 72402: {'lr': 0.00026908040033453353, 'samples': 13901184, 'steps': 72401, 'loss/train': 1.6139004230499268} 11/07/2021 07:23:48 - INFO - __main__ - Step 72403: {'lr': 0.0002690751090643193, 'samples': 13901376, 'steps': 72402, 'loss/train': 1.6356942653656006} 11/07/2021 07:23:48 - INFO - __main__ - Step 72404: {'lr': 0.00026906981778551, 'samples': 13901568, 'steps': 72403, 'loss/train': 1.2514721155166626} 11/07/2021 07:23:49 - INFO - __main__ - Step 72405: {'lr': 0.0002690645264981083, 'samples': 13901760, 'steps': 72404, 'loss/train': 1.9122215509414673} 11/07/2021 07:23:49 - INFO - __main__ - Step 72406: {'lr': 0.00026905923520211634, 'samples': 13901952, 'steps': 72405, 'loss/train': 1.8604824542999268} 11/07/2021 07:23:49 - INFO - __main__ - Step 72407: {'lr': 0.0002690539438975366, 'samples': 13902144, 'steps': 72406, 'loss/train': 1.73414146900177} 11/07/2021 07:23:50 - INFO - __main__ - Step 72408: {'lr': 0.0002690486525843715, 'samples': 13902336, 'steps': 72407, 'loss/train': 1.2580565214157104} 11/07/2021 07:23:51 - INFO - __main__ - Step 72409: {'lr': 0.0002690433612626233, 'samples': 13902528, 'steps': 72408, 'loss/train': 1.100620150566101} 11/07/2021 07:23:51 - INFO - __main__ - Step 72410: {'lr': 0.0002690380699322945, 'samples': 13902720, 'steps': 72409, 'loss/train': 1.5221073627471924} 11/07/2021 07:23:51 - INFO - __main__ - Step 72411: {'lr': 0.00026903277859338735, 'samples': 13902912, 'steps': 72410, 'loss/train': 1.6674836874008179} 11/07/2021 07:23:52 - INFO - __main__ - Step 72412: {'lr': 0.00026902748724590435, 'samples': 13903104, 'steps': 72411, 'loss/train': 1.5673837661743164} 11/07/2021 07:23:52 - INFO - __main__ - Step 72413: {'lr': 0.00026902219588984796, 'samples': 13903296, 'steps': 72412, 'loss/train': 1.7807892560958862} 11/07/2021 07:23:53 - INFO - __main__ - Step 72414: {'lr': 0.00026901690452522036, 'samples': 13903488, 'steps': 72413, 'loss/train': 0.8754287958145142} 11/07/2021 07:23:53 - INFO - __main__ - Step 72415: {'lr': 0.0002690116131520241, 'samples': 13903680, 'steps': 72414, 'loss/train': 1.6743340492248535} 11/07/2021 07:23:54 - INFO - __main__ - Step 72416: {'lr': 0.00026900632177026144, 'samples': 13903872, 'steps': 72415, 'loss/train': 1.3976987600326538} 11/07/2021 07:23:54 - INFO - __main__ - Step 72417: {'lr': 0.0002690010303799349, 'samples': 13904064, 'steps': 72416, 'loss/train': 1.7459379434585571} 11/07/2021 07:23:54 - INFO - __main__ - Step 72418: {'lr': 0.0002689957389810467, 'samples': 13904256, 'steps': 72417, 'loss/train': 1.5627552270889282} 11/07/2021 07:23:56 - INFO - __main__ - Step 72419: {'lr': 0.00026899044757359937, 'samples': 13904448, 'steps': 72418, 'loss/train': 1.0312589406967163} 11/07/2021 07:23:56 - INFO - __main__ - Step 72420: {'lr': 0.0002689851561575952, 'samples': 13904640, 'steps': 72419, 'loss/train': 1.265851378440857} 11/07/2021 07:23:56 - INFO - __main__ - Step 72421: {'lr': 0.00026897986473303667, 'samples': 13904832, 'steps': 72420, 'loss/train': 1.2627938985824585} 11/07/2021 07:23:57 - INFO - __main__ - Step 72422: {'lr': 0.0002689745732999261, 'samples': 13905024, 'steps': 72421, 'loss/train': 1.4600110054016113} 11/07/2021 07:23:57 - INFO - __main__ - Step 72423: {'lr': 0.00026896928185826587, 'samples': 13905216, 'steps': 72422, 'loss/train': 2.04185152053833} 11/07/2021 07:23:58 - INFO - __main__ - Step 72424: {'lr': 0.00026896399040805835, 'samples': 13905408, 'steps': 72423, 'loss/train': 1.3512372970581055} 11/07/2021 07:23:58 - INFO - __main__ - Step 72425: {'lr': 0.0002689586989493059, 'samples': 13905600, 'steps': 72424, 'loss/train': 1.6062817573547363} 11/07/2021 07:23:59 - INFO - __main__ - Step 72426: {'lr': 0.000268953407482011, 'samples': 13905792, 'steps': 72425, 'loss/train': 0.9096741676330566} 11/07/2021 07:23:59 - INFO - __main__ - Step 72427: {'lr': 0.00026894811600617605, 'samples': 13905984, 'steps': 72426, 'loss/train': 1.5065635442733765} 11/07/2021 07:23:59 - INFO - __main__ - Step 72428: {'lr': 0.0002689428245218033, 'samples': 13906176, 'steps': 72427, 'loss/train': 1.3690136671066284} 11/07/2021 07:24:00 - INFO - __main__ - Step 72429: {'lr': 0.00026893753302889524, 'samples': 13906368, 'steps': 72428, 'loss/train': 1.678085207939148} 11/07/2021 07:24:01 - INFO - __main__ - Step 72430: {'lr': 0.0002689322415274542, 'samples': 13906560, 'steps': 72429, 'loss/train': 1.6128747463226318} 11/07/2021 07:24:01 - INFO - __main__ - Step 72431: {'lr': 0.00026892695001748255, 'samples': 13906752, 'steps': 72430, 'loss/train': 1.634108066558838} 11/07/2021 07:24:01 - INFO - __main__ - Step 72432: {'lr': 0.00026892165849898275, 'samples': 13906944, 'steps': 72431, 'loss/train': 1.435976266860962} 11/07/2021 07:24:02 - INFO - __main__ - Step 72433: {'lr': 0.0002689163669719572, 'samples': 13907136, 'steps': 72432, 'loss/train': 1.32687246799469} 11/07/2021 07:24:02 - INFO - __main__ - Step 72434: {'lr': 0.00026891107543640814, 'samples': 13907328, 'steps': 72433, 'loss/train': 0.7582547068595886} 11/07/2021 07:24:03 - INFO - __main__ - Step 72435: {'lr': 0.0002689057838923381, 'samples': 13907520, 'steps': 72434, 'loss/train': 1.5709538459777832} 11/07/2021 07:24:04 - INFO - __main__ - Step 72436: {'lr': 0.00026890049233974935, 'samples': 13907712, 'steps': 72435, 'loss/train': 1.5535178184509277} 11/07/2021 07:24:04 - INFO - __main__ - Step 72437: {'lr': 0.0002688952007786443, 'samples': 13907904, 'steps': 72436, 'loss/train': 1.1556810140609741} 11/07/2021 07:24:04 - INFO - __main__ - Step 72438: {'lr': 0.00026888990920902547, 'samples': 13908096, 'steps': 72437, 'loss/train': 1.4525315761566162} 11/07/2021 07:24:05 - INFO - __main__ - Step 72439: {'lr': 0.00026888461763089505, 'samples': 13908288, 'steps': 72438, 'loss/train': 1.1050622463226318} 11/07/2021 07:24:06 - INFO - __main__ - Step 72440: {'lr': 0.00026887932604425553, 'samples': 13908480, 'steps': 72439, 'loss/train': 1.7866710424423218} 11/07/2021 07:24:06 - INFO - __main__ - Step 72441: {'lr': 0.00026887403444910936, 'samples': 13908672, 'steps': 72440, 'loss/train': 1.8418906927108765} 11/07/2021 07:24:06 - INFO - __main__ - Step 72442: {'lr': 0.00026886874284545877, 'samples': 13908864, 'steps': 72441, 'loss/train': 1.3678860664367676} 11/07/2021 07:24:07 - INFO - __main__ - Step 72443: {'lr': 0.0002688634512333062, 'samples': 13909056, 'steps': 72442, 'loss/train': 0.9631813168525696} 11/07/2021 07:24:07 - INFO - __main__ - Step 72444: {'lr': 0.00026885815961265406, 'samples': 13909248, 'steps': 72443, 'loss/train': 1.3086284399032593} 11/07/2021 07:24:08 - INFO - __main__ - Step 72445: {'lr': 0.0002688528679835047, 'samples': 13909440, 'steps': 72444, 'loss/train': 1.3953495025634766} 11/07/2021 07:24:08 - INFO - __main__ - Step 72446: {'lr': 0.00026884757634586064, 'samples': 13909632, 'steps': 72445, 'loss/train': 1.867783784866333} 11/07/2021 07:24:09 - INFO - __main__ - Step 72447: {'lr': 0.0002688422846997241, 'samples': 13909824, 'steps': 72446, 'loss/train': 1.672526240348816} 11/07/2021 07:24:09 - INFO - __main__ - Step 72448: {'lr': 0.00026883699304509743, 'samples': 13910016, 'steps': 72447, 'loss/train': 1.3161230087280273} 11/07/2021 07:24:09 - INFO - __main__ - Step 72449: {'lr': 0.00026883170138198323, 'samples': 13910208, 'steps': 72448, 'loss/train': 1.2264667749404907} 11/07/2021 07:24:10 - INFO - __main__ - Step 72450: {'lr': 0.0002688264097103836, 'samples': 13910400, 'steps': 72449, 'loss/train': 1.7073169946670532} 11/07/2021 07:24:11 - INFO - __main__ - Step 72451: {'lr': 0.0002688211180303013, 'samples': 13910592, 'steps': 72450, 'loss/train': 1.0732033252716064} 11/07/2021 07:24:11 - INFO - __main__ - Step 72452: {'lr': 0.00026881582634173836, 'samples': 13910784, 'steps': 72451, 'loss/train': 2.379196882247925} 11/07/2021 07:24:11 - INFO - __main__ - Step 72453: {'lr': 0.0002688105346446973, 'samples': 13910976, 'steps': 72452, 'loss/train': 1.5634996891021729} 11/07/2021 07:24:12 - INFO - __main__ - Step 72454: {'lr': 0.00026880524293918044, 'samples': 13911168, 'steps': 72453, 'loss/train': 1.6281565427780151} 11/07/2021 07:24:12 - INFO - __main__ - Step 72455: {'lr': 0.0002687999512251903, 'samples': 13911360, 'steps': 72454, 'loss/train': 1.3458439111709595} 11/07/2021 07:24:13 - INFO - __main__ - Step 72456: {'lr': 0.00026879465950272916, 'samples': 13911552, 'steps': 72455, 'loss/train': 1.853585124015808} 11/07/2021 07:24:13 - INFO - __main__ - Step 72457: {'lr': 0.0002687893677717995, 'samples': 13911744, 'steps': 72456, 'loss/train': 1.4029479026794434} 11/07/2021 07:24:14 - INFO - __main__ - Step 72458: {'lr': 0.0002687840760324036, 'samples': 13911936, 'steps': 72457, 'loss/train': 6.15109395980835} 11/07/2021 07:24:14 - INFO - __main__ - Step 72459: {'lr': 0.00026877878428454395, 'samples': 13912128, 'steps': 72458, 'loss/train': 1.6384015083312988} 11/07/2021 07:24:14 - INFO - __main__ - Step 72460: {'lr': 0.00026877349252822283, 'samples': 13912320, 'steps': 72459, 'loss/train': 1.4617292881011963} 11/07/2021 07:24:16 - INFO - __main__ - Step 72461: {'lr': 0.0002687682007634426, 'samples': 13912512, 'steps': 72460, 'loss/train': 1.3251161575317383} 11/07/2021 07:24:16 - INFO - __main__ - Step 72462: {'lr': 0.0002687629089902058, 'samples': 13912704, 'steps': 72461, 'loss/train': 0.7302871346473694} 11/07/2021 07:24:16 - INFO - __main__ - Step 72463: {'lr': 0.00026875761720851466, 'samples': 13912896, 'steps': 72462, 'loss/train': 1.3116360902786255} 11/07/2021 07:24:17 - INFO - __main__ - Step 72464: {'lr': 0.00026875232541837164, 'samples': 13913088, 'steps': 72463, 'loss/train': 1.5056184530258179} 11/07/2021 07:24:17 - INFO - __main__ - Step 72465: {'lr': 0.0002687470336197791, 'samples': 13913280, 'steps': 72464, 'loss/train': 1.1049559116363525} 11/07/2021 07:24:18 - INFO - __main__ - Step 72466: {'lr': 0.0002687417418127394, 'samples': 13913472, 'steps': 72465, 'loss/train': 0.9264216423034668} 11/07/2021 07:24:18 - INFO - __main__ - Step 72467: {'lr': 0.00026873644999725506, 'samples': 13913664, 'steps': 72466, 'loss/train': 1.0930614471435547} 11/07/2021 07:24:19 - INFO - __main__ - Step 72468: {'lr': 0.00026873115817332825, 'samples': 13913856, 'steps': 72467, 'loss/train': 1.3583152294158936} 11/07/2021 07:24:19 - INFO - __main__ - Step 72469: {'lr': 0.00026872586634096163, 'samples': 13914048, 'steps': 72468, 'loss/train': 1.4802862405776978} 11/07/2021 07:24:19 - INFO - __main__ - Step 72470: {'lr': 0.0002687205745001573, 'samples': 13914240, 'steps': 72469, 'loss/train': 1.332694172859192} 11/07/2021 07:24:20 - INFO - __main__ - Step 72471: {'lr': 0.0002687152826509178, 'samples': 13914432, 'steps': 72470, 'loss/train': 0.9303925633430481} 11/07/2021 07:24:21 - INFO - __main__ - Step 72472: {'lr': 0.00026870999079324547, 'samples': 13914624, 'steps': 72471, 'loss/train': 0.5986149907112122} 11/07/2021 07:24:21 - INFO - __main__ - Step 72473: {'lr': 0.0002687046989271427, 'samples': 13914816, 'steps': 72472, 'loss/train': 1.6128629446029663} 11/07/2021 07:24:22 - INFO - __main__ - Step 72474: {'lr': 0.0002686994070526119, 'samples': 13915008, 'steps': 72473, 'loss/train': 1.132859468460083} 11/07/2021 07:24:22 - INFO - __main__ - Step 72475: {'lr': 0.00026869411516965543, 'samples': 13915200, 'steps': 72474, 'loss/train': 1.0866470336914062} 11/07/2021 07:24:22 - INFO - __main__ - Step 72476: {'lr': 0.0002686888232782757, 'samples': 13915392, 'steps': 72475, 'loss/train': 1.1032925844192505} 11/07/2021 07:24:23 - INFO - __main__ - Step 72477: {'lr': 0.00026868353137847505, 'samples': 13915584, 'steps': 72476, 'loss/train': 1.1359399557113647} 11/07/2021 07:24:24 - INFO - __main__ - Step 72478: {'lr': 0.0002686782394702559, 'samples': 13915776, 'steps': 72477, 'loss/train': 1.1402443647384644} 11/07/2021 07:24:24 - INFO - __main__ - Step 72479: {'lr': 0.0002686729475536206, 'samples': 13915968, 'steps': 72478, 'loss/train': 1.2069370746612549} 11/07/2021 07:24:24 - INFO - __main__ - Step 72480: {'lr': 0.0002686676556285716, 'samples': 13916160, 'steps': 72479, 'loss/train': 1.1373592615127563} 11/07/2021 07:24:25 - INFO - __main__ - Step 72481: {'lr': 0.0002686623636951112, 'samples': 13916352, 'steps': 72480, 'loss/train': 1.438489556312561} 11/07/2021 07:24:25 - INFO - __main__ - Step 72482: {'lr': 0.0002686570717532419, 'samples': 13916544, 'steps': 72481, 'loss/train': 2.8523361682891846} 11/07/2021 07:24:26 - INFO - __main__ - Step 72483: {'lr': 0.000268651779802966, 'samples': 13916736, 'steps': 72482, 'loss/train': 1.1464494466781616} 11/07/2021 07:24:27 - INFO - __main__ - Step 72484: {'lr': 0.0002686464878442858, 'samples': 13916928, 'steps': 72483, 'loss/train': 1.5012127161026} 11/07/2021 07:24:27 - INFO - __main__ - Step 72485: {'lr': 0.0002686411958772038, 'samples': 13917120, 'steps': 72484, 'loss/train': 1.6954067945480347} 11/07/2021 07:24:27 - INFO - __main__ - Step 72486: {'lr': 0.00026863590390172244, 'samples': 13917312, 'steps': 72485, 'loss/train': 1.7312731742858887} 11/07/2021 07:24:28 - INFO - __main__ - Step 72487: {'lr': 0.00026863061191784393, 'samples': 13917504, 'steps': 72486, 'loss/train': 1.698258876800537} 11/07/2021 07:24:29 - INFO - __main__ - Step 72488: {'lr': 0.00026862531992557083, 'samples': 13917696, 'steps': 72487, 'loss/train': 0.7926193475723267} 11/07/2021 07:24:29 - INFO - __main__ - Step 72489: {'lr': 0.00026862002792490546, 'samples': 13917888, 'steps': 72488, 'loss/train': 1.6633285284042358} 11/07/2021 07:24:30 - INFO - __main__ - Step 72490: {'lr': 0.00026861473591585015, 'samples': 13918080, 'steps': 72489, 'loss/train': 1.431684970855713} 11/07/2021 07:24:30 - INFO - __main__ - Step 72491: {'lr': 0.00026860944389840735, 'samples': 13918272, 'steps': 72490, 'loss/train': 0.9755832552909851} 11/07/2021 07:24:30 - INFO - __main__ - Step 72492: {'lr': 0.00026860415187257943, 'samples': 13918464, 'steps': 72491, 'loss/train': 1.4687983989715576} 11/07/2021 07:24:31 - INFO - __main__ - Step 72493: {'lr': 0.00026859885983836874, 'samples': 13918656, 'steps': 72492, 'loss/train': 1.202286958694458} 11/07/2021 07:24:32 - INFO - __main__ - Step 72494: {'lr': 0.00026859356779577765, 'samples': 13918848, 'steps': 72493, 'loss/train': 1.3172661066055298} 11/07/2021 07:24:32 - INFO - __main__ - Step 72495: {'lr': 0.00026858827574480866, 'samples': 13919040, 'steps': 72494, 'loss/train': 1.3980494737625122} 11/07/2021 07:24:32 - INFO - __main__ - Step 72496: {'lr': 0.0002685829836854641, 'samples': 13919232, 'steps': 72495, 'loss/train': 1.4536645412445068} 11/07/2021 07:24:33 - INFO - __main__ - Step 72497: {'lr': 0.00026857769161774624, 'samples': 13919424, 'steps': 72496, 'loss/train': 1.3414231538772583} 11/07/2021 07:24:34 - INFO - __main__ - Step 72498: {'lr': 0.00026857239954165764, 'samples': 13919616, 'steps': 72497, 'loss/train': 0.8883165121078491} 11/07/2021 07:24:34 - INFO - __main__ - Step 72499: {'lr': 0.0002685671074572005, 'samples': 13919808, 'steps': 72498, 'loss/train': 1.1668848991394043} 11/07/2021 07:24:34 - INFO - __main__ - Step 72500: {'lr': 0.0002685618153643774, 'samples': 13920000, 'steps': 72499, 'loss/train': 1.0365407466888428} 11/07/2021 07:24:35 - INFO - __main__ - Step 72501: {'lr': 0.0002685565232631906, 'samples': 13920192, 'steps': 72500, 'loss/train': 1.4693527221679688} 11/07/2021 07:24:35 - INFO - __main__ - Step 72502: {'lr': 0.0002685512311536426, 'samples': 13920384, 'steps': 72501, 'loss/train': 1.446653127670288} 11/07/2021 07:24:36 - INFO - __main__ - Step 72503: {'lr': 0.00026854593903573564, 'samples': 13920576, 'steps': 72502, 'loss/train': 1.9191501140594482} 11/07/2021 07:24:37 - INFO - __main__ - Step 72504: {'lr': 0.00026854064690947217, 'samples': 13920768, 'steps': 72503, 'loss/train': 1.527864694595337} 11/07/2021 07:24:37 - INFO - __main__ - Step 72505: {'lr': 0.00026853535477485454, 'samples': 13920960, 'steps': 72504, 'loss/train': 1.607940673828125} 11/07/2021 07:24:37 - INFO - __main__ - Step 72506: {'lr': 0.0002685300626318852, 'samples': 13921152, 'steps': 72505, 'loss/train': 1.6575528383255005} 11/07/2021 07:24:38 - INFO - __main__ - Step 72507: {'lr': 0.00026852477048056647, 'samples': 13921344, 'steps': 72506, 'loss/train': 1.1951234340667725} 11/07/2021 07:24:38 - INFO - __main__ - Step 72508: {'lr': 0.00026851947832090073, 'samples': 13921536, 'steps': 72507, 'loss/train': 1.4023412466049194} 11/07/2021 07:24:39 - INFO - __main__ - Step 72509: {'lr': 0.0002685141861528905, 'samples': 13921728, 'steps': 72508, 'loss/train': 1.656510591506958} 11/07/2021 07:24:39 - INFO - __main__ - Step 72510: {'lr': 0.000268508893976538, 'samples': 13921920, 'steps': 72509, 'loss/train': 1.288915991783142} 11/07/2021 07:24:40 - INFO - __main__ - Step 72511: {'lr': 0.0002685036017918457, 'samples': 13922112, 'steps': 72510, 'loss/train': 1.2778717279434204} 11/07/2021 07:24:40 - INFO - __main__ - Step 72512: {'lr': 0.00026849830959881587, 'samples': 13922304, 'steps': 72511, 'loss/train': 1.2224057912826538} 11/07/2021 07:24:40 - INFO - __main__ - Step 72513: {'lr': 0.00026849301739745107, 'samples': 13922496, 'steps': 72512, 'loss/train': 1.3719875812530518} 11/07/2021 07:24:41 - INFO - __main__ - Step 72514: {'lr': 0.00026848772518775363, 'samples': 13922688, 'steps': 72513, 'loss/train': 1.5132997035980225} 11/07/2021 07:24:42 - INFO - __main__ - Step 72515: {'lr': 0.00026848243296972584, 'samples': 13922880, 'steps': 72514, 'loss/train': 1.655316710472107} 11/07/2021 07:24:42 - INFO - __main__ - Step 72516: {'lr': 0.0002684771407433703, 'samples': 13923072, 'steps': 72515, 'loss/train': 1.3509024381637573} 11/07/2021 07:24:43 - INFO - __main__ - Step 72517: {'lr': 0.00026847184850868904, 'samples': 13923264, 'steps': 72516, 'loss/train': 1.0989288091659546} 11/07/2021 07:24:43 - INFO - __main__ - Step 72518: {'lr': 0.00026846655626568475, 'samples': 13923456, 'steps': 72517, 'loss/train': 1.3323023319244385} 11/07/2021 07:24:44 - INFO - __main__ - Step 72519: {'lr': 0.0002684612640143597, 'samples': 13923648, 'steps': 72518, 'loss/train': 0.9339466094970703} 11/07/2021 07:24:44 - INFO - __main__ - Step 72520: {'lr': 0.00026845597175471626, 'samples': 13923840, 'steps': 72519, 'loss/train': 1.2686220407485962} 11/07/2021 07:24:45 - INFO - __main__ - Step 72521: {'lr': 0.0002684506794867569, 'samples': 13924032, 'steps': 72520, 'loss/train': 1.3618583679199219} 11/07/2021 07:24:45 - INFO - __main__ - Step 72522: {'lr': 0.0002684453872104839, 'samples': 13924224, 'steps': 72521, 'loss/train': 1.5883625745773315} 11/07/2021 07:24:45 - INFO - __main__ - Step 72523: {'lr': 0.00026844009492589977, 'samples': 13924416, 'steps': 72522, 'loss/train': 1.5780678987503052} 11/07/2021 07:24:46 - INFO - __main__ - Step 72524: {'lr': 0.0002684348026330068, 'samples': 13924608, 'steps': 72523, 'loss/train': 1.4852921962738037} 11/07/2021 07:24:47 - INFO - __main__ - Step 72525: {'lr': 0.00026842951033180735, 'samples': 13924800, 'steps': 72524, 'loss/train': 1.272704839706421} 11/07/2021 07:24:47 - INFO - __main__ - Step 72526: {'lr': 0.00026842421802230384, 'samples': 13924992, 'steps': 72525, 'loss/train': 1.4284738302230835} 11/07/2021 07:24:47 - INFO - __main__ - Step 72527: {'lr': 0.00026841892570449866, 'samples': 13925184, 'steps': 72526, 'loss/train': 1.4045451879501343} 11/07/2021 07:24:48 - INFO - __main__ - Step 72528: {'lr': 0.00026841363337839417, 'samples': 13925376, 'steps': 72527, 'loss/train': 1.2337586879730225} 11/07/2021 07:24:48 - INFO - __main__ - Step 72529: {'lr': 0.00026840834104399294, 'samples': 13925568, 'steps': 72528, 'loss/train': 1.7222769260406494} 11/07/2021 07:24:49 - INFO - __main__ - Step 72530: {'lr': 0.0002684030487012971, 'samples': 13925760, 'steps': 72529, 'loss/train': 0.9374347925186157} 11/07/2021 07:24:49 - INFO - __main__ - Step 72531: {'lr': 0.00026839775635030907, 'samples': 13925952, 'steps': 72530, 'loss/train': 1.0962427854537964} 11/07/2021 07:24:50 - INFO - __main__ - Step 72532: {'lr': 0.0002683924639910313, 'samples': 13926144, 'steps': 72531, 'loss/train': 1.767845869064331} 11/07/2021 07:24:50 - INFO - __main__ - Step 72533: {'lr': 0.00026838717162346623, 'samples': 13926336, 'steps': 72532, 'loss/train': 1.6249700784683228} 11/07/2021 07:24:51 - INFO - __main__ - Step 72534: {'lr': 0.00026838187924761617, 'samples': 13926528, 'steps': 72533, 'loss/train': 1.3998818397521973} 11/07/2021 07:24:52 - INFO - __main__ - Step 72535: {'lr': 0.00026837658686348345, 'samples': 13926720, 'steps': 72534, 'loss/train': 1.5638364553451538} 11/07/2021 07:24:52 - INFO - __main__ - Step 72536: {'lr': 0.0002683712944710706, 'samples': 13926912, 'steps': 72535, 'loss/train': 0.7527962923049927} 11/07/2021 07:24:52 - INFO - __main__ - Step 72537: {'lr': 0.0002683660020703799, 'samples': 13927104, 'steps': 72536, 'loss/train': 1.2198460102081299} 11/07/2021 07:24:53 - INFO - __main__ - Step 72538: {'lr': 0.0002683607096614138, 'samples': 13927296, 'steps': 72537, 'loss/train': 1.1737486124038696} 11/07/2021 07:24:53 - INFO - __main__ - Step 72539: {'lr': 0.0002683554172441746, 'samples': 13927488, 'steps': 72538, 'loss/train': 1.567372441291809} 11/07/2021 07:24:54 - INFO - __main__ - Step 72540: {'lr': 0.0002683501248186648, 'samples': 13927680, 'steps': 72539, 'loss/train': 1.1291459798812866} 11/07/2021 07:24:54 - INFO - __main__ - Step 72541: {'lr': 0.0002683448323848866, 'samples': 13927872, 'steps': 72540, 'loss/train': 1.1780321598052979} 11/07/2021 07:24:55 - INFO - __main__ - Step 72542: {'lr': 0.0002683395399428426, 'samples': 13928064, 'steps': 72541, 'loss/train': 1.0757434368133545} 11/07/2021 07:24:55 - INFO - __main__ - Step 72543: {'lr': 0.0002683342474925351, 'samples': 13928256, 'steps': 72542, 'loss/train': 1.7081429958343506} 11/07/2021 07:24:55 - INFO - __main__ - Step 72544: {'lr': 0.00026832895503396643, 'samples': 13928448, 'steps': 72543, 'loss/train': 1.4797818660736084} 11/07/2021 07:24:56 - INFO - __main__ - Step 72545: {'lr': 0.00026832366256713896, 'samples': 13928640, 'steps': 72544, 'loss/train': 1.6070680618286133} 11/07/2021 07:24:57 - INFO - __main__ - Step 72546: {'lr': 0.00026831837009205523, 'samples': 13928832, 'steps': 72545, 'loss/train': 1.8179566860198975} 11/07/2021 07:24:57 - INFO - __main__ - Step 72547: {'lr': 0.0002683130776087174, 'samples': 13929024, 'steps': 72546, 'loss/train': 1.7131249904632568} 11/07/2021 07:24:57 - INFO - __main__ - Step 72548: {'lr': 0.0002683077851171281, 'samples': 13929216, 'steps': 72547, 'loss/train': 1.317906141281128} 11/07/2021 07:24:58 - INFO - __main__ - Step 72549: {'lr': 0.00026830249261728956, 'samples': 13929408, 'steps': 72548, 'loss/train': 0.7801134586334229} 11/07/2021 07:24:59 - INFO - __main__ - Step 72550: {'lr': 0.00026829720010920424, 'samples': 13929600, 'steps': 72549, 'loss/train': 1.2550705671310425} 11/07/2021 07:24:59 - INFO - __main__ - Step 72551: {'lr': 0.00026829190759287443, 'samples': 13929792, 'steps': 72550, 'loss/train': 1.308559775352478} 11/07/2021 07:24:59 - INFO - __main__ - Step 72552: {'lr': 0.00026828661506830256, 'samples': 13929984, 'steps': 72551, 'loss/train': 1.596389651298523} 11/07/2021 07:25:00 - INFO - __main__ - Step 72553: {'lr': 0.00026828132253549103, 'samples': 13930176, 'steps': 72552, 'loss/train': 1.338829755783081} 11/07/2021 07:25:00 - INFO - __main__ - Step 72554: {'lr': 0.0002682760299944422, 'samples': 13930368, 'steps': 72553, 'loss/train': 1.4838206768035889} 11/07/2021 07:25:01 - INFO - __main__ - Step 72555: {'lr': 0.0002682707374451585, 'samples': 13930560, 'steps': 72554, 'loss/train': 1.0010170936584473} 11/07/2021 07:25:02 - INFO - __main__ - Step 72556: {'lr': 0.00026826544488764236, 'samples': 13930752, 'steps': 72555, 'loss/train': 1.204252004623413} 11/07/2021 07:25:02 - INFO - __main__ - Step 72557: {'lr': 0.00026826015232189596, 'samples': 13930944, 'steps': 72556, 'loss/train': 1.5046950578689575} 11/07/2021 07:25:02 - INFO - __main__ - Step 72558: {'lr': 0.0002682548597479219, 'samples': 13931136, 'steps': 72557, 'loss/train': 1.306104302406311} 11/07/2021 07:25:03 - INFO - __main__ - Step 72559: {'lr': 0.00026824956716572245, 'samples': 13931328, 'steps': 72558, 'loss/train': 1.4820377826690674} 11/07/2021 07:25:03 - INFO - __main__ - Step 72560: {'lr': 0.00026824427457530005, 'samples': 13931520, 'steps': 72559, 'loss/train': 1.733892798423767} 11/07/2021 07:25:04 - INFO - __main__ - Step 72561: {'lr': 0.000268238981976657, 'samples': 13931712, 'steps': 72560, 'loss/train': 0.971609354019165} 11/07/2021 07:25:04 - INFO - __main__ - Step 72562: {'lr': 0.00026823368936979583, 'samples': 13931904, 'steps': 72561, 'loss/train': 1.040374755859375} 11/07/2021 07:25:05 - INFO - __main__ - Step 72563: {'lr': 0.00026822839675471884, 'samples': 13932096, 'steps': 72562, 'loss/train': 1.2526631355285645} 11/07/2021 07:25:05 - INFO - __main__ - Step 72564: {'lr': 0.00026822310413142836, 'samples': 13932288, 'steps': 72563, 'loss/train': 1.4306977987289429} 11/07/2021 07:25:05 - INFO - __main__ - Step 72565: {'lr': 0.00026821781149992684, 'samples': 13932480, 'steps': 72564, 'loss/train': 1.0685921907424927} 11/07/2021 07:25:07 - INFO - __main__ - Step 72566: {'lr': 0.0002682125188602167, 'samples': 13932672, 'steps': 72565, 'loss/train': 1.915734052658081} 11/07/2021 07:25:07 - INFO - __main__ - Step 72567: {'lr': 0.0002682072262123002, 'samples': 13932864, 'steps': 72566, 'loss/train': 0.9075606465339661} 11/07/2021 07:25:07 - INFO - __main__ - Step 72568: {'lr': 0.0002682019335561799, 'samples': 13933056, 'steps': 72567, 'loss/train': 1.2191753387451172} 11/07/2021 07:25:08 - INFO - __main__ - Step 72569: {'lr': 0.00026819664089185803, 'samples': 13933248, 'steps': 72568, 'loss/train': 1.5231331586837769} 11/07/2021 07:25:08 - INFO - __main__ - Step 72570: {'lr': 0.000268191348219337, 'samples': 13933440, 'steps': 72569, 'loss/train': 0.996781051158905} 11/07/2021 07:25:09 - INFO - __main__ - Step 72571: {'lr': 0.00026818605553861934, 'samples': 13933632, 'steps': 72570, 'loss/train': 1.9763076305389404} 11/07/2021 07:25:09 - INFO - __main__ - Step 72572: {'lr': 0.0002681807628497072, 'samples': 13933824, 'steps': 72571, 'loss/train': 1.1089622974395752} 11/07/2021 07:25:10 - INFO - __main__ - Step 72573: {'lr': 0.0002681754701526032, 'samples': 13934016, 'steps': 72572, 'loss/train': 1.0428727865219116} 11/07/2021 07:25:10 - INFO - __main__ - Step 72574: {'lr': 0.00026817017744730953, 'samples': 13934208, 'steps': 72573, 'loss/train': 0.9188135266304016} 11/07/2021 07:25:10 - INFO - __main__ - Step 72575: {'lr': 0.0002681648847338287, 'samples': 13934400, 'steps': 72574, 'loss/train': 1.7348133325576782} 11/07/2021 07:25:11 - INFO - __main__ - Step 72576: {'lr': 0.00026815959201216305, 'samples': 13934592, 'steps': 72575, 'loss/train': 1.3450623750686646} 11/07/2021 07:25:12 - INFO - __main__ - Step 72577: {'lr': 0.000268154299282315, 'samples': 13934784, 'steps': 72576, 'loss/train': 1.4470540285110474} 11/07/2021 07:25:12 - INFO - __main__ - Step 72578: {'lr': 0.00026814900654428684, 'samples': 13934976, 'steps': 72577, 'loss/train': 1.6053494215011597} 11/07/2021 07:25:12 - INFO - __main__ - Step 72579: {'lr': 0.000268143713798081, 'samples': 13935168, 'steps': 72578, 'loss/train': 1.38178551197052} 11/07/2021 07:25:13 - INFO - __main__ - Step 72580: {'lr': 0.00026813842104370003, 'samples': 13935360, 'steps': 72579, 'loss/train': 1.3333747386932373} 11/07/2021 07:25:13 - INFO - __main__ - Step 72581: {'lr': 0.000268133128281146, 'samples': 13935552, 'steps': 72580, 'loss/train': 1.4484100341796875} 11/07/2021 07:25:14 - INFO - __main__ - Step 72582: {'lr': 0.00026812783551042154, 'samples': 13935744, 'steps': 72581, 'loss/train': 1.298819899559021} 11/07/2021 07:25:15 - INFO - __main__ - Step 72583: {'lr': 0.0002681225427315289, 'samples': 13935936, 'steps': 72582, 'loss/train': 1.241127610206604} 11/07/2021 07:25:15 - INFO - __main__ - Step 72584: {'lr': 0.00026811724994447056, 'samples': 13936128, 'steps': 72583, 'loss/train': 1.769435167312622} 11/07/2021 07:25:15 - INFO - __main__ - Step 72585: {'lr': 0.00026811195714924893, 'samples': 13936320, 'steps': 72584, 'loss/train': 1.949623703956604} 11/07/2021 07:25:16 - INFO - __main__ - Step 72586: {'lr': 0.0002681066643458663, 'samples': 13936512, 'steps': 72585, 'loss/train': 1.7945188283920288} 11/07/2021 07:25:17 - INFO - __main__ - Step 72587: {'lr': 0.00026810137153432503, 'samples': 13936704, 'steps': 72586, 'loss/train': 1.5485197305679321} 11/07/2021 07:25:17 - INFO - __main__ - Step 72588: {'lr': 0.0002680960787146276, 'samples': 13936896, 'steps': 72587, 'loss/train': 1.7288057804107666} 11/07/2021 07:25:17 - INFO - __main__ - Step 72589: {'lr': 0.0002680907858867763, 'samples': 13937088, 'steps': 72588, 'loss/train': 1.6845053434371948} 11/07/2021 07:25:18 - INFO - __main__ - Step 72590: {'lr': 0.0002680854930507736, 'samples': 13937280, 'steps': 72589, 'loss/train': 1.4406689405441284} 11/07/2021 07:25:18 - INFO - __main__ - Step 72591: {'lr': 0.0002680802002066219, 'samples': 13937472, 'steps': 72590, 'loss/train': 1.4976274967193604} 11/07/2021 07:25:19 - INFO - __main__ - Step 72592: {'lr': 0.00026807490735432355, 'samples': 13937664, 'steps': 72591, 'loss/train': 1.4353972673416138} 11/07/2021 07:25:19 - INFO - __main__ - Step 72593: {'lr': 0.0002680696144938809, 'samples': 13937856, 'steps': 72592, 'loss/train': 2.0788493156433105} 11/07/2021 07:25:20 - INFO - __main__ - Step 72594: {'lr': 0.0002680643216252963, 'samples': 13938048, 'steps': 72593, 'loss/train': 1.2977896928787231} 11/07/2021 07:25:20 - INFO - __main__ - Step 72595: {'lr': 0.0002680590287485722, 'samples': 13938240, 'steps': 72594, 'loss/train': 1.4139748811721802} 11/07/2021 07:25:20 - INFO - __main__ - Step 72596: {'lr': 0.0002680537358637111, 'samples': 13938432, 'steps': 72595, 'loss/train': 1.3091201782226562} 11/07/2021 07:25:21 - INFO - __main__ - Step 72597: {'lr': 0.00026804844297071524, 'samples': 13938624, 'steps': 72596, 'loss/train': 1.2416839599609375} 11/07/2021 07:25:22 - INFO - __main__ - Step 72598: {'lr': 0.00026804315006958695, 'samples': 13938816, 'steps': 72597, 'loss/train': 1.0384931564331055} 11/07/2021 07:25:22 - INFO - __main__ - Step 72599: {'lr': 0.0002680378571603287, 'samples': 13939008, 'steps': 72598, 'loss/train': 1.264803409576416} 11/07/2021 07:25:23 - INFO - __main__ - Step 72600: {'lr': 0.0002680325642429429, 'samples': 13939200, 'steps': 72599, 'loss/train': 1.8298156261444092} 11/07/2021 07:25:23 - INFO - __main__ - Step 72601: {'lr': 0.0002680272713174319, 'samples': 13939392, 'steps': 72600, 'loss/train': 1.915143609046936} 11/07/2021 07:25:23 - INFO - __main__ - Step 72602: {'lr': 0.00026802197838379804, 'samples': 13939584, 'steps': 72601, 'loss/train': 0.7734698057174683} 11/07/2021 07:25:24 - INFO - __main__ - Step 72603: {'lr': 0.0002680166854420439, 'samples': 13939776, 'steps': 72602, 'loss/train': 1.4005502462387085} 11/07/2021 07:25:25 - INFO - __main__ - Step 72604: {'lr': 0.0002680113924921716, 'samples': 13939968, 'steps': 72603, 'loss/train': 1.6130592823028564} 11/07/2021 07:25:25 - INFO - __main__ - Step 72605: {'lr': 0.0002680060995341836, 'samples': 13940160, 'steps': 72604, 'loss/train': 0.934808075428009} 11/07/2021 07:25:25 - INFO - __main__ - Step 72606: {'lr': 0.00026800080656808246, 'samples': 13940352, 'steps': 72605, 'loss/train': 1.4480392932891846} 11/07/2021 07:25:26 - INFO - __main__ - Step 72607: {'lr': 0.0002679955135938704, 'samples': 13940544, 'steps': 72606, 'loss/train': 1.3895111083984375} 11/07/2021 07:25:27 - INFO - __main__ - Step 72608: {'lr': 0.00026799022061154977, 'samples': 13940736, 'steps': 72607, 'loss/train': 1.516844630241394} 11/07/2021 07:25:27 - INFO - __main__ - Step 72609: {'lr': 0.000267984927621123, 'samples': 13940928, 'steps': 72608, 'loss/train': 1.4612932205200195} 11/07/2021 07:25:27 - INFO - __main__ - Step 72610: {'lr': 0.00026797963462259265, 'samples': 13941120, 'steps': 72609, 'loss/train': 1.472011923789978} 11/07/2021 07:25:28 - INFO - __main__ - Step 72611: {'lr': 0.00026797434161596087, 'samples': 13941312, 'steps': 72610, 'loss/train': 1.0328866243362427} 11/07/2021 07:25:28 - INFO - __main__ - Step 72612: {'lr': 0.0002679690486012301, 'samples': 13941504, 'steps': 72611, 'loss/train': 0.6268113851547241} 11/07/2021 07:25:29 - INFO - __main__ - Step 72613: {'lr': 0.0002679637555784028, 'samples': 13941696, 'steps': 72612, 'loss/train': 1.4261201620101929} 11/07/2021 07:25:29 - INFO - __main__ - Step 72614: {'lr': 0.00026795846254748127, 'samples': 13941888, 'steps': 72613, 'loss/train': 1.5938509702682495} 11/07/2021 07:25:30 - INFO - __main__ - Step 72615: {'lr': 0.00026795316950846795, 'samples': 13942080, 'steps': 72614, 'loss/train': 1.478294849395752} 11/07/2021 07:25:30 - INFO - __main__ - Step 72616: {'lr': 0.0002679478764613652, 'samples': 13942272, 'steps': 72615, 'loss/train': 1.5912957191467285} 11/07/2021 07:25:31 - INFO - __main__ - Step 72617: {'lr': 0.0002679425834061755, 'samples': 13942464, 'steps': 72616, 'loss/train': 1.4950710535049438} 11/07/2021 07:25:32 - INFO - __main__ - Step 72618: {'lr': 0.00026793729034290103, 'samples': 13942656, 'steps': 72617, 'loss/train': 1.3243745565414429} 11/07/2021 07:25:32 - INFO - __main__ - Step 72619: {'lr': 0.0002679319972715443, 'samples': 13942848, 'steps': 72618, 'loss/train': 1.5109552145004272} 11/07/2021 07:25:32 - INFO - __main__ - Step 72620: {'lr': 0.00026792670419210777, 'samples': 13943040, 'steps': 72619, 'loss/train': 1.2912553548812866} 11/07/2021 07:25:33 - INFO - __main__ - Step 72621: {'lr': 0.0002679214111045937, 'samples': 13943232, 'steps': 72620, 'loss/train': 2.106344223022461} 11/07/2021 07:25:33 - INFO - __main__ - Step 72622: {'lr': 0.0002679161180090045, 'samples': 13943424, 'steps': 72621, 'loss/train': 1.457303762435913} 11/07/2021 07:25:33 - INFO - __main__ - Step 72623: {'lr': 0.0002679108249053426, 'samples': 13943616, 'steps': 72622, 'loss/train': 1.4743715524673462} 11/07/2021 07:25:35 - INFO - __main__ - Step 72624: {'lr': 0.00026790553179361037, 'samples': 13943808, 'steps': 72623, 'loss/train': 2.150235414505005} 11/07/2021 07:25:35 - INFO - __main__ - Step 72625: {'lr': 0.0002679002386738102, 'samples': 13944000, 'steps': 72624, 'loss/train': 1.4302295446395874} 11/07/2021 07:25:35 - INFO - __main__ - Step 72626: {'lr': 0.0002678949455459444, 'samples': 13944192, 'steps': 72625, 'loss/train': 0.6807823777198792} 11/07/2021 07:25:36 - INFO - __main__ - Step 72627: {'lr': 0.00026788965241001544, 'samples': 13944384, 'steps': 72626, 'loss/train': 1.2159268856048584} 11/07/2021 07:25:36 - INFO - __main__ - Step 72628: {'lr': 0.00026788435926602565, 'samples': 13944576, 'steps': 72627, 'loss/train': 1.643389344215393} 11/07/2021 07:25:37 - INFO - __main__ - Step 72629: {'lr': 0.0002678790661139775, 'samples': 13944768, 'steps': 72628, 'loss/train': 1.7682656049728394} 11/07/2021 07:25:38 - INFO - __main__ - Step 72630: {'lr': 0.00026787377295387334, 'samples': 13944960, 'steps': 72629, 'loss/train': 1.4028786420822144} 11/07/2021 07:25:38 - INFO - __main__ - Step 72631: {'lr': 0.00026786847978571543, 'samples': 13945152, 'steps': 72630, 'loss/train': 1.8767619132995605} 11/07/2021 07:25:38 - INFO - __main__ - Step 72632: {'lr': 0.0002678631866095063, 'samples': 13945344, 'steps': 72631, 'loss/train': 1.2172993421554565} 11/07/2021 07:25:39 - INFO - __main__ - Step 72633: {'lr': 0.0002678578934252483, 'samples': 13945536, 'steps': 72632, 'loss/train': 1.1777138710021973} 11/07/2021 07:25:40 - INFO - __main__ - Step 72634: {'lr': 0.0002678526002329438, 'samples': 13945728, 'steps': 72633, 'loss/train': 1.4167197942733765} 11/07/2021 07:25:40 - INFO - __main__ - Step 72635: {'lr': 0.00026784730703259524, 'samples': 13945920, 'steps': 72634, 'loss/train': 1.7559168338775635} 11/07/2021 07:25:40 - INFO - __main__ - Step 72636: {'lr': 0.0002678420138242049, 'samples': 13946112, 'steps': 72635, 'loss/train': 1.9371525049209595} 11/07/2021 07:25:41 - INFO - __main__ - Step 72637: {'lr': 0.0002678367206077753, 'samples': 13946304, 'steps': 72636, 'loss/train': 1.3979206085205078} 11/07/2021 07:25:41 - INFO - __main__ - Step 72638: {'lr': 0.00026783142738330865, 'samples': 13946496, 'steps': 72637, 'loss/train': 0.7148512005805969} 11/07/2021 07:25:42 - INFO - __main__ - Step 72639: {'lr': 0.0002678261341508075, 'samples': 13946688, 'steps': 72638, 'loss/train': 1.0567002296447754} 11/07/2021 07:25:42 - INFO - __main__ - Step 72640: {'lr': 0.0002678208409102742, 'samples': 13946880, 'steps': 72639, 'loss/train': 1.2417311668395996} 11/07/2021 07:25:43 - INFO - __main__ - Step 72641: {'lr': 0.00026781554766171104, 'samples': 13947072, 'steps': 72640, 'loss/train': 1.399972677230835} 11/07/2021 07:25:43 - INFO - __main__ - Step 72642: {'lr': 0.00026781025440512045, 'samples': 13947264, 'steps': 72641, 'loss/train': 1.4011762142181396} 11/07/2021 07:25:43 - INFO - __main__ - Step 72643: {'lr': 0.0002678049611405049, 'samples': 13947456, 'steps': 72642, 'loss/train': 1.1450165510177612} 11/07/2021 07:25:44 - INFO - __main__ - Step 72644: {'lr': 0.0002677996678678667, 'samples': 13947648, 'steps': 72643, 'loss/train': 1.2095953226089478} 11/07/2021 07:25:45 - INFO - __main__ - Step 72645: {'lr': 0.0002677943745872082, 'samples': 13947840, 'steps': 72644, 'loss/train': 1.7188478708267212} 11/07/2021 07:25:45 - INFO - __main__ - Step 72646: {'lr': 0.00026778908129853187, 'samples': 13948032, 'steps': 72645, 'loss/train': 1.0378897190093994} 11/07/2021 07:25:46 - INFO - __main__ - Step 72647: {'lr': 0.00026778378800184, 'samples': 13948224, 'steps': 72646, 'loss/train': 1.4048717021942139} 11/07/2021 07:25:46 - INFO - __main__ - Step 72648: {'lr': 0.00026777849469713513, 'samples': 13948416, 'steps': 72647, 'loss/train': 1.4201749563217163} 11/07/2021 07:25:47 - INFO - __main__ - Step 72649: {'lr': 0.0002677732013844194, 'samples': 13948608, 'steps': 72648, 'loss/train': 1.1447674036026} 11/07/2021 07:25:47 - INFO - __main__ - Step 72650: {'lr': 0.0002677679080636955, 'samples': 13948800, 'steps': 72649, 'loss/train': 2.571249008178711} 11/07/2021 07:25:48 - INFO - __main__ - Step 72651: {'lr': 0.00026776261473496557, 'samples': 13948992, 'steps': 72650, 'loss/train': 1.6854143142700195} 11/07/2021 07:25:48 - INFO - __main__ - Step 72652: {'lr': 0.00026775732139823206, 'samples': 13949184, 'steps': 72651, 'loss/train': 1.5217723846435547} 11/07/2021 07:25:48 - INFO - __main__ - Step 72653: {'lr': 0.0002677520280534974, 'samples': 13949376, 'steps': 72652, 'loss/train': 1.7155711650848389} 11/07/2021 07:25:49 - INFO - __main__ - Step 72654: {'lr': 0.00026774673470076395, 'samples': 13949568, 'steps': 72653, 'loss/train': 1.8125756978988647} 11/07/2021 07:25:50 - INFO - __main__ - Step 72655: {'lr': 0.00026774144134003407, 'samples': 13949760, 'steps': 72654, 'loss/train': 1.3837733268737793} 11/07/2021 07:25:50 - INFO - __main__ - Step 72656: {'lr': 0.00026773614797131025, 'samples': 13949952, 'steps': 72655, 'loss/train': 1.4278502464294434} 11/07/2021 07:25:50 - INFO - __main__ - Step 72657: {'lr': 0.0002677308545945948, 'samples': 13950144, 'steps': 72656, 'loss/train': 1.3059513568878174} 11/07/2021 07:25:51 - INFO - __main__ - Step 72658: {'lr': 0.00026772556120989, 'samples': 13950336, 'steps': 72657, 'loss/train': 1.539501667022705} 11/07/2021 07:25:51 - INFO - __main__ - Step 72659: {'lr': 0.00026772026781719837, 'samples': 13950528, 'steps': 72658, 'loss/train': 1.5899226665496826} 11/07/2021 07:25:52 - INFO - __main__ - Step 72660: {'lr': 0.00026771497441652225, 'samples': 13950720, 'steps': 72659, 'loss/train': 1.1146423816680908} 11/07/2021 07:25:53 - INFO - __main__ - Step 72661: {'lr': 0.00026770968100786407, 'samples': 13950912, 'steps': 72660, 'loss/train': 1.2477779388427734} 11/07/2021 07:25:53 - INFO - __main__ - Step 72662: {'lr': 0.00026770438759122616, 'samples': 13951104, 'steps': 72661, 'loss/train': 1.4170360565185547} 11/07/2021 07:25:53 - INFO - __main__ - Step 72663: {'lr': 0.0002676990941666109, 'samples': 13951296, 'steps': 72662, 'loss/train': 1.4765942096710205} 11/07/2021 07:25:54 - INFO - __main__ - Step 72664: {'lr': 0.00026769380073402076, 'samples': 13951488, 'steps': 72663, 'loss/train': 1.4882735013961792} 11/07/2021 07:25:55 - INFO - __main__ - Step 72665: {'lr': 0.0002676885072934581, 'samples': 13951680, 'steps': 72664, 'loss/train': 1.599848985671997} 11/07/2021 07:25:55 - INFO - __main__ - Step 72666: {'lr': 0.00026768321384492517, 'samples': 13951872, 'steps': 72665, 'loss/train': 1.3601922988891602} 11/07/2021 07:25:55 - INFO - __main__ - Step 72667: {'lr': 0.00026767792038842446, 'samples': 13952064, 'steps': 72666, 'loss/train': 1.453302264213562} 11/07/2021 07:25:56 - INFO - __main__ - Step 72668: {'lr': 0.00026767262692395843, 'samples': 13952256, 'steps': 72667, 'loss/train': 1.2832262516021729} 11/07/2021 07:25:56 - INFO - __main__ - Step 72669: {'lr': 0.0002676673334515293, 'samples': 13952448, 'steps': 72668, 'loss/train': 1.2915233373641968} 11/07/2021 07:25:57 - INFO - __main__ - Step 72670: {'lr': 0.00026766203997113957, 'samples': 13952640, 'steps': 72669, 'loss/train': 1.3404359817504883} 11/07/2021 07:25:57 - INFO - __main__ - Step 72671: {'lr': 0.0002676567464827917, 'samples': 13952832, 'steps': 72670, 'loss/train': 1.4137177467346191} 11/07/2021 07:25:58 - INFO - __main__ - Step 72672: {'lr': 0.00026765145298648794, 'samples': 13953024, 'steps': 72671, 'loss/train': 1.7876784801483154} 11/07/2021 07:25:58 - INFO - __main__ - Step 72673: {'lr': 0.0002676461594822306, 'samples': 13953216, 'steps': 72672, 'loss/train': 1.2591344118118286} 11/07/2021 07:25:59 - INFO - __main__ - Step 72674: {'lr': 0.00026764086597002223, 'samples': 13953408, 'steps': 72673, 'loss/train': 1.1339951753616333} 11/07/2021 07:25:59 - INFO - __main__ - Step 72675: {'lr': 0.00026763557244986513, 'samples': 13953600, 'steps': 72674, 'loss/train': 1.32393479347229} 11/07/2021 07:26:00 - INFO - __main__ - Step 72676: {'lr': 0.0002676302789217617, 'samples': 13953792, 'steps': 72675, 'loss/train': 1.3300206661224365} 11/07/2021 07:26:00 - INFO - __main__ - Step 72677: {'lr': 0.00026762498538571443, 'samples': 13953984, 'steps': 72676, 'loss/train': 1.6197301149368286} 11/07/2021 07:26:01 - INFO - __main__ - Step 72678: {'lr': 0.0002676196918417256, 'samples': 13954176, 'steps': 72677, 'loss/train': 1.4037482738494873} 11/07/2021 07:26:01 - INFO - __main__ - Step 72679: {'lr': 0.0002676143982897976, 'samples': 13954368, 'steps': 72678, 'loss/train': 1.5613954067230225} 11/07/2021 07:26:01 - INFO - __main__ - Step 72680: {'lr': 0.0002676091047299327, 'samples': 13954560, 'steps': 72679, 'loss/train': 1.936429738998413} 11/07/2021 07:26:03 - INFO - __main__ - Step 72681: {'lr': 0.00026760381116213355, 'samples': 13954752, 'steps': 72680, 'loss/train': 1.5919585227966309} 11/07/2021 07:26:03 - INFO - __main__ - Step 72682: {'lr': 0.00026759851758640236, 'samples': 13954944, 'steps': 72681, 'loss/train': 1.2302850484848022} 11/07/2021 07:26:03 - INFO - __main__ - Step 72683: {'lr': 0.0002675932240027415, 'samples': 13955136, 'steps': 72682, 'loss/train': 1.0721853971481323} 11/07/2021 07:26:04 - INFO - __main__ - Step 72684: {'lr': 0.00026758793041115346, 'samples': 13955328, 'steps': 72683, 'loss/train': 1.564173936843872} 11/07/2021 07:26:04 - INFO - __main__ - Step 72685: {'lr': 0.00026758263681164057, 'samples': 13955520, 'steps': 72684, 'loss/train': 1.2328542470932007} 11/07/2021 07:26:05 - INFO - __main__ - Step 72686: {'lr': 0.0002675773432042052, 'samples': 13955712, 'steps': 72685, 'loss/train': 0.8934274315834045} 11/07/2021 07:26:05 - INFO - __main__ - Step 72687: {'lr': 0.00026757204958884973, 'samples': 13955904, 'steps': 72686, 'loss/train': 1.3041630983352661} 11/07/2021 07:26:06 - INFO - __main__ - Step 72688: {'lr': 0.0002675667559655766, 'samples': 13956096, 'steps': 72687, 'loss/train': 1.5742473602294922} 11/07/2021 07:26:06 - INFO - __main__ - Step 72689: {'lr': 0.00026756146233438815, 'samples': 13956288, 'steps': 72688, 'loss/train': 1.3753228187561035} 11/07/2021 07:26:06 - INFO - __main__ - Step 72690: {'lr': 0.00026755616869528675, 'samples': 13956480, 'steps': 72689, 'loss/train': 1.3721983432769775} 11/07/2021 07:26:07 - INFO - __main__ - Step 72691: {'lr': 0.00026755087504827486, 'samples': 13956672, 'steps': 72690, 'loss/train': 1.519019603729248} 11/07/2021 07:26:08 - INFO - __main__ - Step 72692: {'lr': 0.0002675455813933548, 'samples': 13956864, 'steps': 72691, 'loss/train': 0.7872691750526428} 11/07/2021 07:26:08 - INFO - __main__ - Step 72693: {'lr': 0.00026754028773052894, 'samples': 13957056, 'steps': 72692, 'loss/train': 1.8947665691375732} 11/07/2021 07:26:09 - INFO - __main__ - Step 72694: {'lr': 0.00026753499405979974, 'samples': 13957248, 'steps': 72693, 'loss/train': 1.6095494031906128} 11/07/2021 07:26:09 - INFO - __main__ - Step 72695: {'lr': 0.0002675297003811695, 'samples': 13957440, 'steps': 72694, 'loss/train': 1.1829142570495605} 11/07/2021 07:26:09 - INFO - __main__ - Step 72696: {'lr': 0.0002675244066946407, 'samples': 13957632, 'steps': 72695, 'loss/train': 2.0947113037109375} 11/07/2021 07:26:10 - INFO - __main__ - Step 72697: {'lr': 0.00026751911300021565, 'samples': 13957824, 'steps': 72696, 'loss/train': 1.5224584341049194} 11/07/2021 07:26:11 - INFO - __main__ - Step 72698: {'lr': 0.00026751381929789676, 'samples': 13958016, 'steps': 72697, 'loss/train': 1.32797372341156} 11/07/2021 07:26:11 - INFO - __main__ - Step 72699: {'lr': 0.00026750852558768634, 'samples': 13958208, 'steps': 72698, 'loss/train': 1.731775164604187} 11/07/2021 07:26:11 - INFO - __main__ - Step 72700: {'lr': 0.00026750323186958694, 'samples': 13958400, 'steps': 72699, 'loss/train': 1.8172688484191895} 11/07/2021 07:26:12 - INFO - __main__ - Step 72701: {'lr': 0.0002674979381436008, 'samples': 13958592, 'steps': 72700, 'loss/train': 1.4211548566818237} 11/07/2021 07:26:13 - INFO - __main__ - Step 72702: {'lr': 0.00026749264440973036, 'samples': 13958784, 'steps': 72701, 'loss/train': 1.5197982788085938} 11/07/2021 07:26:13 - INFO - __main__ - Step 72703: {'lr': 0.000267487350667978, 'samples': 13958976, 'steps': 72702, 'loss/train': 1.403113842010498} 11/07/2021 07:26:14 - INFO - __main__ - Step 72704: {'lr': 0.00026748205691834627, 'samples': 13959168, 'steps': 72703, 'loss/train': 1.4488133192062378} 11/07/2021 07:26:14 - INFO - __main__ - Step 72705: {'lr': 0.00026747676316083726, 'samples': 13959360, 'steps': 72704, 'loss/train': 1.5590615272521973} 11/07/2021 07:26:14 - INFO - __main__ - Step 72706: {'lr': 0.0002674714693954534, 'samples': 13959552, 'steps': 72705, 'loss/train': 1.216841697692871} 11/07/2021 07:26:15 - INFO - __main__ - Step 72707: {'lr': 0.0002674661756221973, 'samples': 13959744, 'steps': 72706, 'loss/train': 0.7178245782852173} 11/07/2021 07:26:16 - INFO - __main__ - Step 72708: {'lr': 0.00026746088184107116, 'samples': 13959936, 'steps': 72707, 'loss/train': 1.529034972190857} 11/07/2021 07:26:16 - INFO - __main__ - Step 72709: {'lr': 0.00026745558805207746, 'samples': 13960128, 'steps': 72708, 'loss/train': 1.7761566638946533} 11/07/2021 07:26:16 - INFO - __main__ - Step 72710: {'lr': 0.0002674502942552185, 'samples': 13960320, 'steps': 72709, 'loss/train': 1.372471570968628} 11/07/2021 07:26:17 - INFO - __main__ - Step 72711: {'lr': 0.0002674450004504967, 'samples': 13960512, 'steps': 72710, 'loss/train': 1.5064189434051514} 11/07/2021 07:26:18 - INFO - __main__ - Step 72712: {'lr': 0.00026743970663791443, 'samples': 13960704, 'steps': 72711, 'loss/train': 1.3116017580032349} 11/07/2021 07:26:18 - INFO - __main__ - Step 72713: {'lr': 0.00026743441281747415, 'samples': 13960896, 'steps': 72712, 'loss/train': 0.8791543841362} 11/07/2021 07:26:18 - INFO - __main__ - Step 72714: {'lr': 0.00026742911898917823, 'samples': 13961088, 'steps': 72713, 'loss/train': 0.7469682693481445} 11/07/2021 07:26:19 - INFO - __main__ - Step 72715: {'lr': 0.000267423825153029, 'samples': 13961280, 'steps': 72714, 'loss/train': 1.868187665939331} 11/07/2021 07:26:19 - INFO - __main__ - Step 72716: {'lr': 0.0002674185313090288, 'samples': 13961472, 'steps': 72715, 'loss/train': 1.2024091482162476} 11/07/2021 07:26:20 - INFO - __main__ - Step 72717: {'lr': 0.0002674132374571801, 'samples': 13961664, 'steps': 72716, 'loss/train': 1.823725700378418} 11/07/2021 07:26:21 - INFO - __main__ - Step 72718: {'lr': 0.0002674079435974852, 'samples': 13961856, 'steps': 72717, 'loss/train': 1.3806296586990356} 11/07/2021 07:26:21 - INFO - __main__ - Step 72719: {'lr': 0.0002674026497299467, 'samples': 13962048, 'steps': 72718, 'loss/train': 2.5938565731048584} 11/07/2021 07:26:22 - INFO - __main__ - Step 72720: {'lr': 0.00026739735585456674, 'samples': 13962240, 'steps': 72719, 'loss/train': 1.1187658309936523} 11/07/2021 07:26:22 - INFO - __main__ - Step 72721: {'lr': 0.0002673920619713478, 'samples': 13962432, 'steps': 72720, 'loss/train': 1.4470926523208618} 11/07/2021 07:26:22 - INFO - __main__ - Step 72722: {'lr': 0.0002673867680802923, 'samples': 13962624, 'steps': 72721, 'loss/train': 1.5292284488677979} 11/07/2021 07:26:23 - INFO - __main__ - Step 72723: {'lr': 0.00026738147418140255, 'samples': 13962816, 'steps': 72722, 'loss/train': 1.555342197418213} 11/07/2021 07:26:24 - INFO - __main__ - Step 72724: {'lr': 0.000267376180274681, 'samples': 13963008, 'steps': 72723, 'loss/train': 0.9571394920349121} 11/07/2021 07:26:24 - INFO - __main__ - Step 72725: {'lr': 0.00026737088636012994, 'samples': 13963200, 'steps': 72724, 'loss/train': 1.456715703010559} 11/07/2021 07:26:24 - INFO - __main__ - Step 72726: {'lr': 0.000267365592437752, 'samples': 13963392, 'steps': 72725, 'loss/train': 1.4794238805770874} 11/07/2021 07:26:25 - INFO - __main__ - Step 72727: {'lr': 0.00026736029850754926, 'samples': 13963584, 'steps': 72726, 'loss/train': 1.99520742893219} 11/07/2021 07:26:26 - INFO - __main__ - Step 72728: {'lr': 0.0002673550045695243, 'samples': 13963776, 'steps': 72727, 'loss/train': 1.0278048515319824} 11/07/2021 07:26:26 - INFO - __main__ - Step 72729: {'lr': 0.00026734971062367937, 'samples': 13963968, 'steps': 72728, 'loss/train': 1.5650207996368408} 11/07/2021 07:26:26 - INFO - __main__ - Step 72730: {'lr': 0.0002673444166700169, 'samples': 13964160, 'steps': 72729, 'loss/train': 1.4791680574417114} 11/07/2021 07:26:27 - INFO - __main__ - Step 72731: {'lr': 0.00026733912270853947, 'samples': 13964352, 'steps': 72730, 'loss/train': 0.8500827550888062} 11/07/2021 07:26:27 - INFO - __main__ - Step 72732: {'lr': 0.0002673338287392492, 'samples': 13964544, 'steps': 72731, 'loss/train': 1.2990503311157227} 11/07/2021 07:26:28 - INFO - __main__ - Step 72733: {'lr': 0.0002673285347621485, 'samples': 13964736, 'steps': 72732, 'loss/train': 1.4474400281906128} 11/07/2021 07:26:28 - INFO - __main__ - Step 72734: {'lr': 0.0002673232407772399, 'samples': 13964928, 'steps': 72733, 'loss/train': 1.2071399688720703} 11/07/2021 07:26:29 - INFO - __main__ - Step 72735: {'lr': 0.0002673179467845257, 'samples': 13965120, 'steps': 72734, 'loss/train': 0.9374812841415405} 11/07/2021 07:26:29 - INFO - __main__ - Step 72736: {'lr': 0.00026731265278400834, 'samples': 13965312, 'steps': 72735, 'loss/train': 1.852866768836975} 11/07/2021 07:26:29 - INFO - __main__ - Step 72737: {'lr': 0.00026730735877569014, 'samples': 13965504, 'steps': 72736, 'loss/train': 1.255841612815857} 11/07/2021 07:26:31 - INFO - __main__ - Step 72738: {'lr': 0.00026730206475957354, 'samples': 13965696, 'steps': 72737, 'loss/train': 1.4898868799209595} 11/07/2021 07:26:31 - INFO - __main__ - Step 72739: {'lr': 0.0002672967707356608, 'samples': 13965888, 'steps': 72738, 'loss/train': 1.2131037712097168} 11/07/2021 07:26:31 - INFO - __main__ - Step 72740: {'lr': 0.00026729147670395454, 'samples': 13966080, 'steps': 72739, 'loss/train': 0.11809580028057098} 11/07/2021 07:26:32 - INFO - __main__ - Step 72741: {'lr': 0.0002672861826644569, 'samples': 13966272, 'steps': 72740, 'loss/train': 1.6606783866882324} 11/07/2021 07:26:32 - INFO - __main__ - Step 72742: {'lr': 0.0002672808886171704, 'samples': 13966464, 'steps': 72741, 'loss/train': 1.3942312002182007} 11/07/2021 07:26:32 - INFO - __main__ - Step 72743: {'lr': 0.00026727559456209745, 'samples': 13966656, 'steps': 72742, 'loss/train': 1.5408155918121338} 11/07/2021 07:26:34 - INFO - __main__ - Step 72744: {'lr': 0.0002672703004992403, 'samples': 13966848, 'steps': 72743, 'loss/train': 2.0663232803344727} 11/07/2021 07:26:34 - INFO - __main__ - Step 72745: {'lr': 0.0002672650064286015, 'samples': 13967040, 'steps': 72744, 'loss/train': 1.2912648916244507} 11/07/2021 07:26:35 - INFO - __main__ - Step 72746: {'lr': 0.00026725971235018334, 'samples': 13967232, 'steps': 72745, 'loss/train': 1.1622644662857056} 11/07/2021 07:26:35 - INFO - __main__ - Step 72747: {'lr': 0.00026725441826398814, 'samples': 13967424, 'steps': 72746, 'loss/train': 1.3780267238616943} 11/07/2021 07:26:35 - INFO - __main__ - Step 72748: {'lr': 0.00026724912417001845, 'samples': 13967616, 'steps': 72747, 'loss/train': 1.0155454874038696} 11/07/2021 07:26:36 - INFO - __main__ - Step 72749: {'lr': 0.0002672438300682765, 'samples': 13967808, 'steps': 72748, 'loss/train': 1.4400891065597534} 11/07/2021 07:26:37 - INFO - __main__ - Step 72750: {'lr': 0.0002672385359587648, 'samples': 13968000, 'steps': 72749, 'loss/train': 1.5068769454956055} 11/07/2021 07:26:37 - INFO - __main__ - Step 72751: {'lr': 0.0002672332418414857, 'samples': 13968192, 'steps': 72750, 'loss/train': 0.9791757464408875} 11/07/2021 07:26:37 - INFO - __main__ - Step 72752: {'lr': 0.00026722794771644155, 'samples': 13968384, 'steps': 72751, 'loss/train': 1.4883177280426025} 11/07/2021 07:26:38 - INFO - __main__ - Step 72753: {'lr': 0.00026722265358363476, 'samples': 13968576, 'steps': 72752, 'loss/train': 1.6368886232376099} 11/07/2021 07:26:39 - INFO - __main__ - Step 72754: {'lr': 0.00026721735944306764, 'samples': 13968768, 'steps': 72753, 'loss/train': 1.4583431482315063} 11/07/2021 07:26:39 - INFO - __main__ - Step 72755: {'lr': 0.00026721206529474266, 'samples': 13968960, 'steps': 72754, 'loss/train': 1.4479984045028687} 11/07/2021 07:26:39 - INFO - __main__ - Step 72756: {'lr': 0.0002672067711386623, 'samples': 13969152, 'steps': 72755, 'loss/train': 1.2644007205963135} 11/07/2021 07:26:40 - INFO - __main__ - Step 72757: {'lr': 0.00026720147697482867, 'samples': 13969344, 'steps': 72756, 'loss/train': 1.50779128074646} 11/07/2021 07:26:40 - INFO - __main__ - Step 72758: {'lr': 0.0002671961828032445, 'samples': 13969536, 'steps': 72757, 'loss/train': 2.0665993690490723} 11/07/2021 07:26:41 - INFO - __main__ - Step 72759: {'lr': 0.00026719088862391186, 'samples': 13969728, 'steps': 72758, 'loss/train': 1.6720657348632812} 11/07/2021 07:26:41 - INFO - __main__ - Step 72760: {'lr': 0.00026718559443683333, 'samples': 13969920, 'steps': 72759, 'loss/train': 1.0273075103759766} 11/07/2021 07:26:42 - INFO - __main__ - Step 72761: {'lr': 0.00026718030024201116, 'samples': 13970112, 'steps': 72760, 'loss/train': 1.1391894817352295} 11/07/2021 07:26:42 - INFO - __main__ - Step 72762: {'lr': 0.0002671750060394479, 'samples': 13970304, 'steps': 72761, 'loss/train': 1.2661265134811401} 11/07/2021 07:26:43 - INFO - __main__ - Step 72763: {'lr': 0.0002671697118291458, 'samples': 13970496, 'steps': 72762, 'loss/train': 0.15297411382198334} 11/07/2021 07:26:43 - INFO - __main__ - Step 72764: {'lr': 0.00026716441761110734, 'samples': 13970688, 'steps': 72763, 'loss/train': 1.5645676851272583} 11/07/2021 07:26:44 - INFO - __main__ - Step 72765: {'lr': 0.0002671591233853348, 'samples': 13970880, 'steps': 72764, 'loss/train': 1.3377262353897095} 11/07/2021 07:26:44 - INFO - __main__ - Step 72766: {'lr': 0.0002671538291518307, 'samples': 13971072, 'steps': 72765, 'loss/train': 1.2922699451446533} 11/07/2021 07:26:45 - INFO - __main__ - Step 72767: {'lr': 0.00026714853491059725, 'samples': 13971264, 'steps': 72766, 'loss/train': 1.1044927835464478} 11/07/2021 07:26:45 - INFO - __main__ - Step 72768: {'lr': 0.00026714324066163695, 'samples': 13971456, 'steps': 72767, 'loss/train': 1.8340023756027222} 11/07/2021 07:26:45 - INFO - __main__ - Step 72769: {'lr': 0.00026713794640495226, 'samples': 13971648, 'steps': 72768, 'loss/train': 1.3116416931152344} 11/07/2021 07:26:47 - INFO - __main__ - Step 72770: {'lr': 0.0002671326521405454, 'samples': 13971840, 'steps': 72769, 'loss/train': 1.0315546989440918} 11/07/2021 07:26:47 - INFO - __main__ - Step 72771: {'lr': 0.0002671273578684189, 'samples': 13972032, 'steps': 72770, 'loss/train': 1.8137433528900146} 11/07/2021 07:26:47 - INFO - __main__ - Step 72772: {'lr': 0.000267122063588575, 'samples': 13972224, 'steps': 72771, 'loss/train': 1.4262194633483887} 11/07/2021 07:26:48 - INFO - __main__ - Step 72773: {'lr': 0.0002671167693010162, 'samples': 13972416, 'steps': 72772, 'loss/train': 1.0873794555664062} 11/07/2021 07:26:48 - INFO - __main__ - Step 72774: {'lr': 0.00026711147500574486, 'samples': 13972608, 'steps': 72773, 'loss/train': 1.4636635780334473} 11/07/2021 07:26:49 - INFO - __main__ - Step 72775: {'lr': 0.00026710618070276327, 'samples': 13972800, 'steps': 72774, 'loss/train': 1.3243474960327148} 11/07/2021 07:26:49 - INFO - __main__ - Step 72776: {'lr': 0.000267100886392074, 'samples': 13972992, 'steps': 72775, 'loss/train': 1.6796290874481201} 11/07/2021 07:26:50 - INFO - __main__ - Step 72777: {'lr': 0.00026709559207367927, 'samples': 13973184, 'steps': 72776, 'loss/train': 1.3970121145248413} 11/07/2021 07:26:50 - INFO - __main__ - Step 72778: {'lr': 0.0002670902977475816, 'samples': 13973376, 'steps': 72777, 'loss/train': 1.17753267288208} 11/07/2021 07:26:50 - INFO - __main__ - Step 72779: {'lr': 0.0002670850034137833, 'samples': 13973568, 'steps': 72778, 'loss/train': 1.4522733688354492} 11/07/2021 07:26:51 - INFO - __main__ - Step 72780: {'lr': 0.00026707970907228665, 'samples': 13973760, 'steps': 72779, 'loss/train': 1.2628660202026367} 11/07/2021 07:26:52 - INFO - __main__ - Step 72781: {'lr': 0.00026707441472309426, 'samples': 13973952, 'steps': 72780, 'loss/train': 1.422855257987976} 11/07/2021 07:26:52 - INFO - __main__ - Step 72782: {'lr': 0.00026706912036620836, 'samples': 13974144, 'steps': 72781, 'loss/train': 1.3108220100402832} 11/07/2021 07:26:52 - INFO - __main__ - Step 72783: {'lr': 0.0002670638260016313, 'samples': 13974336, 'steps': 72782, 'loss/train': 1.2263892889022827} 11/07/2021 07:26:53 - INFO - __main__ - Step 72784: {'lr': 0.00026705853162936567, 'samples': 13974528, 'steps': 72783, 'loss/train': 1.6538690328598022} 11/07/2021 07:26:54 - INFO - __main__ - Step 72785: {'lr': 0.0002670532372494137, 'samples': 13974720, 'steps': 72784, 'loss/train': 1.5943907499313354} 11/07/2021 07:26:55 - INFO - __main__ - Step 72786: {'lr': 0.0002670479428617778, 'samples': 13974912, 'steps': 72785, 'loss/train': 1.4015588760375977} 11/07/2021 07:26:55 - INFO - __main__ - Step 72787: {'lr': 0.0002670426484664603, 'samples': 13975104, 'steps': 72786, 'loss/train': 0.7691798210144043} 11/07/2021 07:26:55 - INFO - __main__ - Step 72788: {'lr': 0.00026703735406346374, 'samples': 13975296, 'steps': 72787, 'loss/train': 1.1911020278930664} 11/07/2021 07:26:56 - INFO - __main__ - Step 72789: {'lr': 0.0002670320596527903, 'samples': 13975488, 'steps': 72788, 'loss/train': 1.539013147354126} 11/07/2021 07:26:56 - INFO - __main__ - Step 72790: {'lr': 0.00026702676523444256, 'samples': 13975680, 'steps': 72789, 'loss/train': 1.5229483842849731} 11/07/2021 07:26:57 - INFO - __main__ - Step 72791: {'lr': 0.00026702147080842284, 'samples': 13975872, 'steps': 72790, 'loss/train': 1.466184139251709} 11/07/2021 07:26:57 - INFO - __main__ - Step 72792: {'lr': 0.00026701617637473347, 'samples': 13976064, 'steps': 72791, 'loss/train': 1.5235601663589478} 11/07/2021 07:26:58 - INFO - __main__ - Step 72793: {'lr': 0.00026701088193337684, 'samples': 13976256, 'steps': 72792, 'loss/train': 1.692855715751648} 11/07/2021 07:26:58 - INFO - __main__ - Step 72794: {'lr': 0.00026700558748435544, 'samples': 13976448, 'steps': 72793, 'loss/train': 1.1439056396484375} 11/07/2021 07:26:58 - INFO - __main__ - Step 72795: {'lr': 0.00026700029302767156, 'samples': 13976640, 'steps': 72794, 'loss/train': 0.9006438851356506} 11/07/2021 07:26:59 - INFO - __main__ - Step 72796: {'lr': 0.00026699499856332756, 'samples': 13976832, 'steps': 72795, 'loss/train': 1.6281927824020386} 11/07/2021 07:27:00 - INFO - __main__ - Step 72797: {'lr': 0.0002669897040913259, 'samples': 13977024, 'steps': 72796, 'loss/train': 1.5241526365280151} 11/07/2021 07:27:00 - INFO - __main__ - Step 72798: {'lr': 0.000266984409611669, 'samples': 13977216, 'steps': 72797, 'loss/train': 0.9634584784507751} 11/07/2021 07:27:01 - INFO - __main__ - Step 72799: {'lr': 0.00026697911512435914, 'samples': 13977408, 'steps': 72798, 'loss/train': 1.0903929471969604} 11/07/2021 07:27:01 - INFO - __main__ - Step 72800: {'lr': 0.00026697382062939874, 'samples': 13977600, 'steps': 72799, 'loss/train': 1.629724144935608} 11/07/2021 07:27:02 - INFO - __main__ - Step 72801: {'lr': 0.0002669685261267902, 'samples': 13977792, 'steps': 72800, 'loss/train': 1.4830453395843506} 11/07/2021 07:27:02 - INFO - __main__ - Step 72802: {'lr': 0.0002669632316165359, 'samples': 13977984, 'steps': 72801, 'loss/train': 1.2248756885528564} 11/07/2021 07:27:03 - INFO - __main__ - Step 72803: {'lr': 0.00026695793709863823, 'samples': 13978176, 'steps': 72802, 'loss/train': 1.5282821655273438} 11/07/2021 07:27:03 - INFO - __main__ - Step 72804: {'lr': 0.0002669526425730996, 'samples': 13978368, 'steps': 72803, 'loss/train': 1.6354515552520752} 11/07/2021 07:27:03 - INFO - __main__ - Step 72805: {'lr': 0.0002669473480399224, 'samples': 13978560, 'steps': 72804, 'loss/train': 1.4696828126907349} 11/07/2021 07:27:04 - INFO - __main__ - Step 72806: {'lr': 0.00026694205349910894, 'samples': 13978752, 'steps': 72805, 'loss/train': 1.431180477142334} 11/07/2021 07:27:05 - INFO - __main__ - Step 72807: {'lr': 0.00026693675895066166, 'samples': 13978944, 'steps': 72806, 'loss/train': 1.4967994689941406} 11/07/2021 07:27:05 - INFO - __main__ - Step 72808: {'lr': 0.00026693146439458294, 'samples': 13979136, 'steps': 72807, 'loss/train': 1.409214735031128} 11/07/2021 07:27:05 - INFO - __main__ - Step 72809: {'lr': 0.0002669261698308751, 'samples': 13979328, 'steps': 72808, 'loss/train': 1.3160253763198853} 11/07/2021 07:27:06 - INFO - __main__ - Step 72810: {'lr': 0.0002669208752595407, 'samples': 13979520, 'steps': 72809, 'loss/train': 1.092618465423584} 11/07/2021 07:27:08 - INFO - __main__ - Step 72811: {'lr': 0.00026691558068058196, 'samples': 13979712, 'steps': 72810, 'loss/train': 1.5045665502548218} 11/07/2021 07:27:09 - INFO - __main__ - Step 72812: {'lr': 0.0002669102860940014, 'samples': 13979904, 'steps': 72811, 'loss/train': 1.418875813484192} 11/07/2021 07:27:09 - INFO - __main__ - Step 72813: {'lr': 0.00026690499149980125, 'samples': 13980096, 'steps': 72812, 'loss/train': 1.5344008207321167} 11/07/2021 07:27:09 - INFO - __main__ - Step 72814: {'lr': 0.00026689969689798395, 'samples': 13980288, 'steps': 72813, 'loss/train': 1.883479118347168} 11/07/2021 07:27:10 - INFO - __main__ - Step 72815: {'lr': 0.00026689440228855197, 'samples': 13980480, 'steps': 72814, 'loss/train': 1.511064887046814} 11/07/2021 07:27:10 - INFO - __main__ - Step 72816: {'lr': 0.00026688910767150753, 'samples': 13980672, 'steps': 72815, 'loss/train': 1.5250297784805298} 11/07/2021 07:27:10 - INFO - __main__ - Step 72817: {'lr': 0.0002668838130468532, 'samples': 13980864, 'steps': 72816, 'loss/train': 1.443194031715393} 11/07/2021 07:27:11 - INFO - __main__ - Step 72818: {'lr': 0.0002668785184145913, 'samples': 13981056, 'steps': 72817, 'loss/train': 1.1562358140945435} 11/07/2021 07:27:12 - INFO - __main__ - Step 72819: {'lr': 0.00026687322377472416, 'samples': 13981248, 'steps': 72818, 'loss/train': 1.3337173461914062} 11/07/2021 07:27:12 - INFO - __main__ - Step 72820: {'lr': 0.0002668679291272542, 'samples': 13981440, 'steps': 72819, 'loss/train': 1.5917236804962158} 11/07/2021 07:27:13 - INFO - __main__ - Step 72821: {'lr': 0.00026686263447218386, 'samples': 13981632, 'steps': 72820, 'loss/train': 6.165134906768799} 11/07/2021 07:27:13 - INFO - __main__ - Step 72822: {'lr': 0.0002668573398095154, 'samples': 13981824, 'steps': 72821, 'loss/train': 1.8542944192886353} 11/07/2021 07:27:13 - INFO - __main__ - Step 72823: {'lr': 0.0002668520451392513, 'samples': 13982016, 'steps': 72822, 'loss/train': 1.1166030168533325} 11/07/2021 07:27:14 - INFO - __main__ - Step 72824: {'lr': 0.00026684675046139393, 'samples': 13982208, 'steps': 72823, 'loss/train': 1.3721237182617188} 11/07/2021 07:27:15 - INFO - __main__ - Step 72825: {'lr': 0.00026684145577594577, 'samples': 13982400, 'steps': 72824, 'loss/train': 1.514693021774292} 11/07/2021 07:27:15 - INFO - __main__ - Step 72826: {'lr': 0.00026683616108290906, 'samples': 13982592, 'steps': 72825, 'loss/train': 1.5365461111068726} 11/07/2021 07:27:15 - INFO - __main__ - Step 72827: {'lr': 0.00026683086638228614, 'samples': 13982784, 'steps': 72826, 'loss/train': 1.2643015384674072} 11/07/2021 07:27:16 - INFO - __main__ - Step 72828: {'lr': 0.0002668255716740796, 'samples': 13982976, 'steps': 72827, 'loss/train': 0.7827101349830627} 11/07/2021 07:27:16 - INFO - __main__ - Step 72829: {'lr': 0.00026682027695829167, 'samples': 13983168, 'steps': 72828, 'loss/train': 1.519393801689148} 11/07/2021 07:27:17 - INFO - __main__ - Step 72830: {'lr': 0.0002668149822349248, 'samples': 13983360, 'steps': 72829, 'loss/train': 1.3336896896362305} 11/07/2021 07:27:17 - INFO - __main__ - Step 72831: {'lr': 0.00026680968750398133, 'samples': 13983552, 'steps': 72830, 'loss/train': 1.3512060642242432} 11/07/2021 07:27:18 - INFO - __main__ - Step 72832: {'lr': 0.00026680439276546375, 'samples': 13983744, 'steps': 72831, 'loss/train': 1.1770318746566772} 11/07/2021 07:27:18 - INFO - __main__ - Step 72833: {'lr': 0.0002667990980193743, 'samples': 13983936, 'steps': 72832, 'loss/train': 1.204075813293457} 11/07/2021 07:27:18 - INFO - __main__ - Step 72834: {'lr': 0.0002667938032657155, 'samples': 13984128, 'steps': 72833, 'loss/train': 1.1525758504867554} 11/07/2021 07:27:20 - INFO - __main__ - Step 72835: {'lr': 0.00026678850850448955, 'samples': 13984320, 'steps': 72834, 'loss/train': 1.159059762954712} 11/07/2021 07:27:20 - INFO - __main__ - Step 72836: {'lr': 0.00026678321373569904, 'samples': 13984512, 'steps': 72835, 'loss/train': 1.2462222576141357} 11/07/2021 07:27:20 - INFO - __main__ - Step 72837: {'lr': 0.0002667779189593463, 'samples': 13984704, 'steps': 72836, 'loss/train': 1.285033941268921} 11/07/2021 07:27:21 - INFO - __main__ - Step 72838: {'lr': 0.00026677262417543364, 'samples': 13984896, 'steps': 72837, 'loss/train': 1.3465148210525513} 11/07/2021 07:27:21 - INFO - __main__ - Step 72839: {'lr': 0.0002667673293839635, 'samples': 13985088, 'steps': 72838, 'loss/train': 1.4330497980117798} 11/07/2021 07:27:22 - INFO - __main__ - Step 72840: {'lr': 0.00026676203458493824, 'samples': 13985280, 'steps': 72839, 'loss/train': 1.0221433639526367} 11/07/2021 07:27:22 - INFO - __main__ - Step 72841: {'lr': 0.00026675673977836036, 'samples': 13985472, 'steps': 72840, 'loss/train': 0.8223190307617188} 11/07/2021 07:27:23 - INFO - __main__ - Step 72842: {'lr': 0.00026675144496423204, 'samples': 13985664, 'steps': 72841, 'loss/train': 1.331637978553772} 11/07/2021 07:27:23 - INFO - __main__ - Step 72843: {'lr': 0.00026674615014255583, 'samples': 13985856, 'steps': 72842, 'loss/train': 1.6467255353927612} 11/07/2021 07:27:24 - INFO - __main__ - Step 72844: {'lr': 0.0002667408553133341, 'samples': 13986048, 'steps': 72843, 'loss/train': 1.4761741161346436} 11/07/2021 07:27:24 - INFO - __main__ - Step 72845: {'lr': 0.0002667355604765691, 'samples': 13986240, 'steps': 72844, 'loss/train': 1.44687819480896} 11/07/2021 07:27:25 - INFO - __main__ - Step 72846: {'lr': 0.0002667302656322634, 'samples': 13986432, 'steps': 72845, 'loss/train': 1.4872297048568726} 11/07/2021 07:27:25 - INFO - __main__ - Step 72847: {'lr': 0.00026672497078041924, 'samples': 13986624, 'steps': 72846, 'loss/train': 1.4453073740005493} 11/07/2021 07:27:26 - INFO - __main__ - Step 72848: {'lr': 0.0002667196759210391, 'samples': 13986816, 'steps': 72847, 'loss/train': 1.4829000234603882} 11/07/2021 07:27:26 - INFO - __main__ - Step 72849: {'lr': 0.00026671438105412535, 'samples': 13987008, 'steps': 72848, 'loss/train': 1.549554467201233} 11/07/2021 07:27:26 - INFO - __main__ - Step 72850: {'lr': 0.0002667090861796804, 'samples': 13987200, 'steps': 72849, 'loss/train': 0.8144473433494568} 11/07/2021 07:27:28 - INFO - __main__ - Step 72851: {'lr': 0.0002667037912977065, 'samples': 13987392, 'steps': 72850, 'loss/train': 1.6773470640182495} 11/07/2021 07:27:28 - INFO - __main__ - Step 72852: {'lr': 0.0002666984964082061, 'samples': 13987584, 'steps': 72851, 'loss/train': 1.458642840385437} 11/07/2021 07:27:29 - INFO - __main__ - Step 72853: {'lr': 0.0002666932015111817, 'samples': 13987776, 'steps': 72852, 'loss/train': 0.6936423182487488} 11/07/2021 07:27:29 - INFO - __main__ - Step 72854: {'lr': 0.00026668790660663557, 'samples': 13987968, 'steps': 72853, 'loss/train': 1.2398091554641724} 11/07/2021 07:27:29 - INFO - __main__ - Step 72855: {'lr': 0.0002666826116945701, 'samples': 13988160, 'steps': 72854, 'loss/train': 1.3292438983917236} 11/07/2021 07:27:30 - INFO - __main__ - Step 72856: {'lr': 0.0002666773167749878, 'samples': 13988352, 'steps': 72855, 'loss/train': 0.8000346422195435} 11/07/2021 07:27:31 - INFO - __main__ - Step 72857: {'lr': 0.00026667202184789087, 'samples': 13988544, 'steps': 72856, 'loss/train': 1.2618950605392456} 11/07/2021 07:27:31 - INFO - __main__ - Step 72858: {'lr': 0.00026666672691328183, 'samples': 13988736, 'steps': 72857, 'loss/train': 1.2181161642074585} 11/07/2021 07:27:32 - INFO - __main__ - Step 72859: {'lr': 0.00026666143197116296, 'samples': 13988928, 'steps': 72858, 'loss/train': 1.306178092956543} 11/07/2021 07:27:32 - INFO - __main__ - Step 72860: {'lr': 0.0002666561370215368, 'samples': 13989120, 'steps': 72859, 'loss/train': 1.1562331914901733} 11/07/2021 07:27:32 - INFO - __main__ - Step 72861: {'lr': 0.0002666508420644056, 'samples': 13989312, 'steps': 72860, 'loss/train': 1.2128386497497559} 11/07/2021 07:27:34 - INFO - __main__ - Step 72862: {'lr': 0.0002666455470997717, 'samples': 13989504, 'steps': 72861, 'loss/train': 1.310530185699463} 11/07/2021 07:27:34 - INFO - __main__ - Step 72863: {'lr': 0.0002666402521276376, 'samples': 13989696, 'steps': 72862, 'loss/train': 0.5505475997924805} 11/07/2021 07:27:34 - INFO - __main__ - Step 72864: {'lr': 0.0002666349571480058, 'samples': 13989888, 'steps': 72863, 'loss/train': 1.3317562341690063} 11/07/2021 07:27:35 - INFO - __main__ - Step 72865: {'lr': 0.0002666296621608784, 'samples': 13990080, 'steps': 72864, 'loss/train': 1.1878348588943481} 11/07/2021 07:27:35 - INFO - __main__ - Step 72866: {'lr': 0.00026662436716625804, 'samples': 13990272, 'steps': 72865, 'loss/train': 1.2770096063613892} 11/07/2021 07:27:36 - INFO - __main__ - Step 72867: {'lr': 0.00026661907216414695, 'samples': 13990464, 'steps': 72866, 'loss/train': 1.3498517274856567} 11/07/2021 07:27:36 - INFO - __main__ - Step 72868: {'lr': 0.0002666137771545475, 'samples': 13990656, 'steps': 72867, 'loss/train': 1.5861655473709106} 11/07/2021 07:27:37 - INFO - __main__ - Step 72869: {'lr': 0.0002666084821374622, 'samples': 13990848, 'steps': 72868, 'loss/train': 1.6780692338943481} 11/07/2021 07:27:37 - INFO - __main__ - Step 72870: {'lr': 0.00026660318711289334, 'samples': 13991040, 'steps': 72869, 'loss/train': 1.6725068092346191} 11/07/2021 07:27:37 - INFO - __main__ - Step 72871: {'lr': 0.0002665978920808433, 'samples': 13991232, 'steps': 72870, 'loss/train': 1.1656224727630615} 11/07/2021 07:27:39 - INFO - __main__ - Step 72872: {'lr': 0.0002665925970413147, 'samples': 13991424, 'steps': 72871, 'loss/train': 1.3603471517562866} 11/07/2021 07:27:39 - INFO - __main__ - Step 72873: {'lr': 0.0002665873019943096, 'samples': 13991616, 'steps': 72872, 'loss/train': 1.5481088161468506} 11/07/2021 07:27:39 - INFO - __main__ - Step 72874: {'lr': 0.00026658200693983045, 'samples': 13991808, 'steps': 72873, 'loss/train': 1.3072181940078735} 11/07/2021 07:27:40 - INFO - __main__ - Step 72875: {'lr': 0.0002665767118778798, 'samples': 13992000, 'steps': 72874, 'loss/train': 1.438332200050354} 11/07/2021 07:27:40 - INFO - __main__ - Step 72876: {'lr': 0.00026657141680845993, 'samples': 13992192, 'steps': 72875, 'loss/train': 0.9690179824829102} 11/07/2021 07:27:40 - INFO - __main__ - Step 72877: {'lr': 0.0002665661217315732, 'samples': 13992384, 'steps': 72876, 'loss/train': 0.53326416015625} 11/07/2021 07:27:41 - INFO - __main__ - Step 72878: {'lr': 0.000266560826647222, 'samples': 13992576, 'steps': 72877, 'loss/train': 1.5958470106124878} 11/07/2021 07:27:42 - INFO - __main__ - Step 72879: {'lr': 0.00026655553155540887, 'samples': 13992768, 'steps': 72878, 'loss/train': 1.1398341655731201} 11/07/2021 07:27:42 - INFO - __main__ - Step 72880: {'lr': 0.000266550236456136, 'samples': 13992960, 'steps': 72879, 'loss/train': 1.3675063848495483} 11/07/2021 07:27:42 - INFO - __main__ - Step 72881: {'lr': 0.00026654494134940583, 'samples': 13993152, 'steps': 72880, 'loss/train': 1.514992117881775} 11/07/2021 07:27:43 - INFO - __main__ - Step 72882: {'lr': 0.00026653964623522076, 'samples': 13993344, 'steps': 72881, 'loss/train': 1.6758196353912354} 11/07/2021 07:27:44 - INFO - __main__ - Step 72883: {'lr': 0.0002665343511135832, 'samples': 13993536, 'steps': 72882, 'loss/train': 1.209774374961853} 11/07/2021 07:27:44 - INFO - __main__ - Step 72884: {'lr': 0.0002665290559844955, 'samples': 13993728, 'steps': 72883, 'loss/train': 1.162291407585144} 11/07/2021 07:27:45 - INFO - __main__ - Step 72885: {'lr': 0.00026652376084796006, 'samples': 13993920, 'steps': 72884, 'loss/train': 0.848772406578064} 11/07/2021 07:27:45 - INFO - __main__ - Step 72886: {'lr': 0.0002665184657039794, 'samples': 13994112, 'steps': 72885, 'loss/train': 1.3977771997451782} 11/07/2021 07:27:45 - INFO - __main__ - Step 72887: {'lr': 0.0002665131705525556, 'samples': 13994304, 'steps': 72886, 'loss/train': 1.2313624620437622} 11/07/2021 07:27:46 - INFO - __main__ - Step 72888: {'lr': 0.00026650787539369127, 'samples': 13994496, 'steps': 72887, 'loss/train': 1.9203002452850342} 11/07/2021 07:27:47 - INFO - __main__ - Step 72889: {'lr': 0.00026650258022738876, 'samples': 13994688, 'steps': 72888, 'loss/train': 1.3820708990097046} 11/07/2021 07:27:47 - INFO - __main__ - Step 72890: {'lr': 0.0002664972850536505, 'samples': 13994880, 'steps': 72889, 'loss/train': 1.2722514867782593} 11/07/2021 07:27:47 - INFO - __main__ - Step 72891: {'lr': 0.0002664919898724787, 'samples': 13995072, 'steps': 72890, 'loss/train': 1.3983030319213867} 11/07/2021 07:27:48 - INFO - __main__ - Step 72892: {'lr': 0.00026648669468387593, 'samples': 13995264, 'steps': 72891, 'loss/train': 1.987378478050232} 11/07/2021 07:27:49 - INFO - __main__ - Step 72893: {'lr': 0.0002664813994878445, 'samples': 13995456, 'steps': 72892, 'loss/train': 1.4821940660476685} 11/07/2021 07:27:49 - INFO - __main__ - Step 72894: {'lr': 0.00026647610428438676, 'samples': 13995648, 'steps': 72893, 'loss/train': 1.4667367935180664} 11/07/2021 07:27:49 - INFO - __main__ - Step 72895: {'lr': 0.00026647080907350523, 'samples': 13995840, 'steps': 72894, 'loss/train': 0.8999030590057373} 11/07/2021 07:27:50 - INFO - __main__ - Step 72896: {'lr': 0.00026646551385520217, 'samples': 13996032, 'steps': 72895, 'loss/train': 0.9514999389648438} 11/07/2021 07:27:50 - INFO - __main__ - Step 72897: {'lr': 0.00026646021862948, 'samples': 13996224, 'steps': 72896, 'loss/train': 1.2365260124206543} 11/07/2021 07:27:51 - INFO - __main__ - Step 72898: {'lr': 0.00026645492339634106, 'samples': 13996416, 'steps': 72897, 'loss/train': 1.2557251453399658} 11/07/2021 07:27:51 - INFO - __main__ - Step 72899: {'lr': 0.00026644962815578795, 'samples': 13996608, 'steps': 72898, 'loss/train': 1.453535795211792} 11/07/2021 07:27:52 - INFO - __main__ - Step 72900: {'lr': 0.00026644433290782274, 'samples': 13996800, 'steps': 72899, 'loss/train': 1.435848355293274} 11/07/2021 07:27:52 - INFO - __main__ - Step 72901: {'lr': 0.000266439037652448, 'samples': 13996992, 'steps': 72900, 'loss/train': 1.133918285369873} 11/07/2021 07:27:52 - INFO - __main__ - Step 72902: {'lr': 0.0002664337423896661, 'samples': 13997184, 'steps': 72901, 'loss/train': 1.3967511653900146} 11/07/2021 07:27:53 - INFO - __main__ - Step 72903: {'lr': 0.00026642844711947933, 'samples': 13997376, 'steps': 72902, 'loss/train': 1.4561667442321777} 11/07/2021 07:27:54 - INFO - __main__ - Step 72904: {'lr': 0.00026642315184189025, 'samples': 13997568, 'steps': 72903, 'loss/train': 1.6766692399978638} 11/07/2021 07:27:54 - INFO - __main__ - Step 72905: {'lr': 0.0002664178565569011, 'samples': 13997760, 'steps': 72904, 'loss/train': 1.639355182647705} 11/07/2021 07:27:54 - INFO - __main__ - Step 72906: {'lr': 0.0002664125612645144, 'samples': 13997952, 'steps': 72905, 'loss/train': 1.3765538930892944} 11/07/2021 07:27:55 - INFO - __main__ - Step 72907: {'lr': 0.00026640726596473236, 'samples': 13998144, 'steps': 72906, 'loss/train': 1.1991841793060303} 11/07/2021 07:27:55 - INFO - __main__ - Step 72908: {'lr': 0.0002664019706575575, 'samples': 13998336, 'steps': 72907, 'loss/train': 1.5244472026824951} 11/07/2021 07:27:56 - INFO - __main__ - Step 72909: {'lr': 0.00026639667534299216, 'samples': 13998528, 'steps': 72908, 'loss/train': 1.2564074993133545} 11/07/2021 07:27:57 - INFO - __main__ - Step 72910: {'lr': 0.0002663913800210387, 'samples': 13998720, 'steps': 72909, 'loss/train': 1.1488145589828491} 11/07/2021 07:27:57 - INFO - __main__ - Step 72911: {'lr': 0.0002663860846916996, 'samples': 13998912, 'steps': 72910, 'loss/train': 1.727694034576416} 11/07/2021 07:27:57 - INFO - __main__ - Step 72912: {'lr': 0.00026638078935497714, 'samples': 13999104, 'steps': 72911, 'loss/train': 1.4885048866271973} 11/07/2021 07:27:58 - INFO - __main__ - Step 72913: {'lr': 0.0002663754940108738, 'samples': 13999296, 'steps': 72912, 'loss/train': 1.319175124168396} 11/07/2021 07:27:59 - INFO - __main__ - Step 72914: {'lr': 0.0002663701986593918, 'samples': 13999488, 'steps': 72913, 'loss/train': 1.6695107221603394} 11/07/2021 07:27:59 - INFO - __main__ - Step 72915: {'lr': 0.00026636490330053376, 'samples': 13999680, 'steps': 72914, 'loss/train': 1.6223137378692627} 11/07/2021 07:27:59 - INFO - __main__ - Step 72916: {'lr': 0.00026635960793430194, 'samples': 13999872, 'steps': 72915, 'loss/train': 1.4079495668411255} 11/07/2021 07:28:00 - INFO - __main__ - Step 72917: {'lr': 0.00026635431256069863, 'samples': 14000064, 'steps': 72916, 'loss/train': 1.6067932844161987} 11/07/2021 07:28:00 - INFO - __main__ - Step 72918: {'lr': 0.00026634901717972637, 'samples': 14000256, 'steps': 72917, 'loss/train': 1.5842949151992798} 11/07/2021 07:28:01 - INFO - __main__ - Step 72919: {'lr': 0.00026634372179138756, 'samples': 14000448, 'steps': 72918, 'loss/train': 2.1755545139312744} 11/07/2021 07:28:01 - INFO - __main__ - Step 72920: {'lr': 0.00026633842639568446, 'samples': 14000640, 'steps': 72919, 'loss/train': 1.169811725616455} 11/07/2021 07:28:02 - INFO - __main__ - Step 72921: {'lr': 0.00026633313099261953, 'samples': 14000832, 'steps': 72920, 'loss/train': 1.5910104513168335} 11/07/2021 07:28:02 - INFO - __main__ - Step 72922: {'lr': 0.00026632783558219514, 'samples': 14001024, 'steps': 72921, 'loss/train': 1.420208215713501} 11/07/2021 07:28:02 - INFO - __main__ - Step 72923: {'lr': 0.0002663225401644137, 'samples': 14001216, 'steps': 72922, 'loss/train': 0.2727329730987549} 11/07/2021 07:28:03 - INFO - __main__ - Step 72924: {'lr': 0.00026631724473927753, 'samples': 14001408, 'steps': 72923, 'loss/train': 1.478174090385437} 11/07/2021 07:28:04 - INFO - __main__ - Step 72925: {'lr': 0.0002663119493067891, 'samples': 14001600, 'steps': 72924, 'loss/train': 0.9086737632751465} 11/07/2021 07:28:04 - INFO - __main__ - Step 72926: {'lr': 0.0002663066538669507, 'samples': 14001792, 'steps': 72925, 'loss/train': 1.5837544202804565} 11/07/2021 07:28:04 - INFO - __main__ - Step 72927: {'lr': 0.0002663013584197649, 'samples': 14001984, 'steps': 72926, 'loss/train': 1.2144746780395508} 11/07/2021 07:28:05 - INFO - __main__ - Step 72928: {'lr': 0.00026629606296523384, 'samples': 14002176, 'steps': 72927, 'loss/train': 1.3151212930679321} 11/07/2021 07:28:06 - INFO - __main__ - Step 72929: {'lr': 0.00026629076750336005, 'samples': 14002368, 'steps': 72928, 'loss/train': 1.3898311853408813} 11/07/2021 07:28:06 - INFO - __main__ - Step 72930: {'lr': 0.0002662854720341459, 'samples': 14002560, 'steps': 72929, 'loss/train': 0.6865581274032593} 11/07/2021 07:28:07 - INFO - __main__ - Step 72931: {'lr': 0.0002662801765575937, 'samples': 14002752, 'steps': 72930, 'loss/train': 1.566680669784546} 11/07/2021 07:28:07 - INFO - __main__ - Step 72932: {'lr': 0.000266274881073706, 'samples': 14002944, 'steps': 72931, 'loss/train': 1.6072379350662231} 11/07/2021 07:28:07 - INFO - __main__ - Step 72933: {'lr': 0.00026626958558248514, 'samples': 14003136, 'steps': 72932, 'loss/train': 1.3341734409332275} 11/07/2021 07:28:09 - INFO - __main__ - Step 72934: {'lr': 0.0002662642900839334, 'samples': 14003328, 'steps': 72933, 'loss/train': 1.6749467849731445} 11/07/2021 07:28:09 - INFO - __main__ - Step 72935: {'lr': 0.0002662589945780531, 'samples': 14003520, 'steps': 72934, 'loss/train': 0.694390058517456} 11/07/2021 07:28:10 - INFO - __main__ - Step 72936: {'lr': 0.0002662536990648469, 'samples': 14003712, 'steps': 72935, 'loss/train': 1.2723716497421265} 11/07/2021 07:28:10 - INFO - __main__ - Step 72937: {'lr': 0.000266248403544317, 'samples': 14003904, 'steps': 72936, 'loss/train': 0.9914527535438538} 11/07/2021 07:28:10 - INFO - __main__ - Step 72938: {'lr': 0.00026624310801646577, 'samples': 14004096, 'steps': 72937, 'loss/train': 1.3188284635543823} 11/07/2021 07:28:11 - INFO - __main__ - Step 72939: {'lr': 0.00026623781248129574, 'samples': 14004288, 'steps': 72938, 'loss/train': 2.2075531482696533} 11/07/2021 07:28:12 - INFO - __main__ - Step 72940: {'lr': 0.0002662325169388091, 'samples': 14004480, 'steps': 72939, 'loss/train': 1.7176084518432617} 11/07/2021 07:28:12 - INFO - __main__ - Step 72941: {'lr': 0.0002662272213890084, 'samples': 14004672, 'steps': 72940, 'loss/train': 1.896933674812317} 11/07/2021 07:28:12 - INFO - __main__ - Step 72942: {'lr': 0.000266221925831896, 'samples': 14004864, 'steps': 72941, 'loss/train': 0.9730722904205322} 11/07/2021 07:28:13 - INFO - __main__ - Step 72943: {'lr': 0.0002662166302674741, 'samples': 14005056, 'steps': 72942, 'loss/train': 1.2019426822662354} 11/07/2021 07:28:13 - INFO - __main__ - Step 72944: {'lr': 0.0002662113346957454, 'samples': 14005248, 'steps': 72943, 'loss/train': 1.2654707431793213} 11/07/2021 07:28:14 - INFO - __main__ - Step 72945: {'lr': 0.000266206039116712, 'samples': 14005440, 'steps': 72944, 'loss/train': 1.773017168045044} 11/07/2021 07:28:14 - INFO - __main__ - Step 72946: {'lr': 0.00026620074353037656, 'samples': 14005632, 'steps': 72945, 'loss/train': 1.7588294744491577} 11/07/2021 07:28:15 - INFO - __main__ - Step 72947: {'lr': 0.0002661954479367412, 'samples': 14005824, 'steps': 72946, 'loss/train': 1.4080243110656738} 11/07/2021 07:28:15 - INFO - __main__ - Step 72948: {'lr': 0.0002661901523358084, 'samples': 14006016, 'steps': 72947, 'loss/train': 1.4687405824661255} 11/07/2021 07:28:15 - INFO - __main__ - Step 72949: {'lr': 0.00026618485672758057, 'samples': 14006208, 'steps': 72948, 'loss/train': 1.9796648025512695} 11/07/2021 07:28:16 - INFO - __main__ - Step 72950: {'lr': 0.00026617956111206015, 'samples': 14006400, 'steps': 72949, 'loss/train': 1.2129658460617065} 11/07/2021 07:28:17 - INFO - __main__ - Step 72951: {'lr': 0.00026617426548924944, 'samples': 14006592, 'steps': 72950, 'loss/train': 1.6596205234527588} 11/07/2021 07:28:17 - INFO - __main__ - Step 72952: {'lr': 0.00026616896985915084, 'samples': 14006784, 'steps': 72951, 'loss/train': 1.2240575551986694} 11/07/2021 07:28:18 - INFO - __main__ - Step 72953: {'lr': 0.00026616367422176683, 'samples': 14006976, 'steps': 72952, 'loss/train': 1.133549451828003} 11/07/2021 07:28:18 - INFO - __main__ - Step 72954: {'lr': 0.0002661583785770997, 'samples': 14007168, 'steps': 72953, 'loss/train': 1.4454625844955444} 11/07/2021 07:28:18 - INFO - __main__ - Step 72955: {'lr': 0.00026615308292515176, 'samples': 14007360, 'steps': 72954, 'loss/train': 1.6113765239715576} 11/07/2021 07:28:19 - INFO - __main__ - Step 72956: {'lr': 0.00026614778726592557, 'samples': 14007552, 'steps': 72955, 'loss/train': 1.7321571111679077} 11/07/2021 07:28:20 - INFO - __main__ - Step 72957: {'lr': 0.00026614249159942336, 'samples': 14007744, 'steps': 72956, 'loss/train': 1.5269185304641724} 11/07/2021 07:28:20 - INFO - __main__ - Step 72958: {'lr': 0.00026613719592564767, 'samples': 14007936, 'steps': 72957, 'loss/train': 1.211384892463684} 11/07/2021 07:28:20 - INFO - __main__ - Step 72959: {'lr': 0.00026613190024460083, 'samples': 14008128, 'steps': 72958, 'loss/train': 1.785028338432312} 11/07/2021 07:28:21 - INFO - __main__ - Step 72960: {'lr': 0.0002661266045562852, 'samples': 14008320, 'steps': 72959, 'loss/train': 1.1162476539611816} 11/07/2021 07:28:22 - INFO - __main__ - Step 72961: {'lr': 0.00026612130886070315, 'samples': 14008512, 'steps': 72960, 'loss/train': 1.786006212234497} 11/07/2021 07:28:22 - INFO - __main__ - Step 72962: {'lr': 0.000266116013157857, 'samples': 14008704, 'steps': 72961, 'loss/train': 1.563524603843689} 11/07/2021 07:28:22 - INFO - __main__ - Step 72963: {'lr': 0.0002661107174477493, 'samples': 14008896, 'steps': 72962, 'loss/train': 1.5605531930923462} 11/07/2021 07:28:23 - INFO - __main__ - Step 72964: {'lr': 0.0002661054217303823, 'samples': 14009088, 'steps': 72963, 'loss/train': 1.5667400360107422} 11/07/2021 07:28:23 - INFO - __main__ - Step 72965: {'lr': 0.0002661001260057586, 'samples': 14009280, 'steps': 72964, 'loss/train': 1.2000703811645508} 11/07/2021 07:28:24 - INFO - __main__ - Step 72966: {'lr': 0.00026609483027388033, 'samples': 14009472, 'steps': 72965, 'loss/train': 1.4331624507904053} 11/07/2021 07:28:24 - INFO - __main__ - Step 72967: {'lr': 0.0002660895345347499, 'samples': 14009664, 'steps': 72966, 'loss/train': 1.229699969291687} 11/07/2021 07:28:25 - INFO - __main__ - Step 72968: {'lr': 0.0002660842387883699, 'samples': 14009856, 'steps': 72967, 'loss/train': 1.5781819820404053} 11/07/2021 07:28:25 - INFO - __main__ - Step 72969: {'lr': 0.0002660789430347425, 'samples': 14010048, 'steps': 72968, 'loss/train': 1.1665468215942383} 11/07/2021 07:28:25 - INFO - __main__ - Step 72970: {'lr': 0.0002660736472738702, 'samples': 14010240, 'steps': 72969, 'loss/train': 0.573979914188385} 11/07/2021 07:28:26 - INFO - __main__ - Step 72971: {'lr': 0.00026606835150575544, 'samples': 14010432, 'steps': 72970, 'loss/train': 1.7927309274673462} 11/07/2021 07:28:27 - INFO - __main__ - Step 72972: {'lr': 0.0002660630557304004, 'samples': 14010624, 'steps': 72971, 'loss/train': 1.566294550895691} 11/07/2021 07:28:27 - INFO - __main__ - Step 72973: {'lr': 0.00026605775994780774, 'samples': 14010816, 'steps': 72972, 'loss/train': 1.2573753595352173} 11/07/2021 07:28:28 - INFO - __main__ - Step 72974: {'lr': 0.00026605246415797965, 'samples': 14011008, 'steps': 72973, 'loss/train': 1.4052484035491943} 11/07/2021 07:28:28 - INFO - __main__ - Step 72975: {'lr': 0.0002660471683609185, 'samples': 14011200, 'steps': 72974, 'loss/train': 1.7418931722640991} 11/07/2021 07:28:29 - INFO - __main__ - Step 72976: {'lr': 0.0002660418725566268, 'samples': 14011392, 'steps': 72975, 'loss/train': 0.47976770997047424} 11/07/2021 07:28:29 - INFO - __main__ - Step 72977: {'lr': 0.00026603657674510684, 'samples': 14011584, 'steps': 72976, 'loss/train': 1.605864405632019} 11/07/2021 07:28:30 - INFO - __main__ - Step 72978: {'lr': 0.0002660312809263611, 'samples': 14011776, 'steps': 72977, 'loss/train': 1.3128143548965454} 11/07/2021 07:28:30 - INFO - __main__ - Step 72979: {'lr': 0.0002660259851003919, 'samples': 14011968, 'steps': 72978, 'loss/train': 1.2972990274429321} 11/07/2021 07:28:30 - INFO - __main__ - Step 72980: {'lr': 0.0002660206892672016, 'samples': 14012160, 'steps': 72979, 'loss/train': 1.445817470550537} 11/07/2021 07:28:31 - INFO - __main__ - Step 72981: {'lr': 0.00026601539342679264, 'samples': 14012352, 'steps': 72980, 'loss/train': 1.368135929107666} 11/07/2021 07:28:32 - INFO - __main__ - Step 72982: {'lr': 0.0002660100975791674, 'samples': 14012544, 'steps': 72981, 'loss/train': 1.23689866065979} 11/07/2021 07:28:32 - INFO - __main__ - Step 72983: {'lr': 0.0002660048017243282, 'samples': 14012736, 'steps': 72982, 'loss/train': 1.5363876819610596} 11/07/2021 07:28:32 - INFO - __main__ - Step 72984: {'lr': 0.00026599950586227763, 'samples': 14012928, 'steps': 72983, 'loss/train': 1.3538293838500977} 11/07/2021 07:28:33 - INFO - __main__ - Step 72985: {'lr': 0.0002659942099930178, 'samples': 14013120, 'steps': 72984, 'loss/train': 1.4507176876068115} 11/07/2021 07:28:34 - INFO - __main__ - Step 72986: {'lr': 0.00026598891411655127, 'samples': 14013312, 'steps': 72985, 'loss/train': 1.7070355415344238} 11/07/2021 07:28:34 - INFO - __main__ - Step 72987: {'lr': 0.0002659836182328804, 'samples': 14013504, 'steps': 72986, 'loss/train': 1.3079071044921875} 11/07/2021 07:28:35 - INFO - __main__ - Step 72988: {'lr': 0.00026597832234200746, 'samples': 14013696, 'steps': 72987, 'loss/train': 1.4215214252471924} 11/07/2021 07:28:35 - INFO - __main__ - Step 72989: {'lr': 0.0002659730264439351, 'samples': 14013888, 'steps': 72988, 'loss/train': 2.160220146179199} 11/07/2021 07:28:35 - INFO - __main__ - Step 72990: {'lr': 0.0002659677305386654, 'samples': 14014080, 'steps': 72989, 'loss/train': 1.4182002544403076} 11/07/2021 07:28:36 - INFO - __main__ - Step 72991: {'lr': 0.000265962434626201, 'samples': 14014272, 'steps': 72990, 'loss/train': 1.3951705694198608} 11/07/2021 07:28:37 - INFO - __main__ - Step 72992: {'lr': 0.00026595713870654407, 'samples': 14014464, 'steps': 72991, 'loss/train': 1.538543701171875} 11/07/2021 07:28:37 - INFO - __main__ - Step 72993: {'lr': 0.00026595184277969716, 'samples': 14014656, 'steps': 72992, 'loss/train': 1.4402366876602173} 11/07/2021 07:28:37 - INFO - __main__ - Step 72994: {'lr': 0.0002659465468456626, 'samples': 14014848, 'steps': 72993, 'loss/train': 1.6505557298660278} 11/07/2021 07:28:38 - INFO - __main__ - Step 72995: {'lr': 0.00026594125090444274, 'samples': 14015040, 'steps': 72994, 'loss/train': 1.1958142518997192} 11/07/2021 07:28:38 - INFO - __main__ - Step 72996: {'lr': 0.00026593595495604, 'samples': 14015232, 'steps': 72995, 'loss/train': 1.2537226676940918} 11/07/2021 07:28:39 - INFO - __main__ - Step 72997: {'lr': 0.00026593065900045674, 'samples': 14015424, 'steps': 72996, 'loss/train': 1.8078463077545166} 11/07/2021 07:28:39 - INFO - __main__ - Step 72998: {'lr': 0.00026592536303769536, 'samples': 14015616, 'steps': 72997, 'loss/train': 1.472892165184021} 11/07/2021 07:28:40 - INFO - __main__ - Step 72999: {'lr': 0.00026592006706775836, 'samples': 14015808, 'steps': 72998, 'loss/train': 1.0550599098205566} 11/07/2021 07:28:40 - INFO - __main__ - Step 73000: {'lr': 0.000265914771090648, 'samples': 14016000, 'steps': 72999, 'loss/train': 1.311156988143921} 11/07/2021 07:28:41 - INFO - __main__ - Step 73001: {'lr': 0.0002659094751063666, 'samples': 14016192, 'steps': 73000, 'loss/train': 1.423760175704956} 11/07/2021 07:28:42 - INFO - __main__ - Step 73002: {'lr': 0.0002659041791149167, 'samples': 14016384, 'steps': 73001, 'loss/train': 0.9450411796569824} 11/07/2021 07:28:42 - INFO - __main__ - Step 73003: {'lr': 0.0002658988831163006, 'samples': 14016576, 'steps': 73002, 'loss/train': 0.663648247718811} 11/07/2021 07:28:42 - INFO - __main__ - Step 73004: {'lr': 0.0002658935871105207, 'samples': 14016768, 'steps': 73003, 'loss/train': 1.5997763872146606} 11/07/2021 07:28:43 - INFO - __main__ - Step 73005: {'lr': 0.0002658882910975794, 'samples': 14016960, 'steps': 73004, 'loss/train': 2.068537712097168} 11/07/2021 07:28:43 - INFO - __main__ - Step 73006: {'lr': 0.0002658829950774791, 'samples': 14017152, 'steps': 73005, 'loss/train': 1.681386113166809} 11/07/2021 07:28:44 - INFO - __main__ - Step 73007: {'lr': 0.0002658776990502222, 'samples': 14017344, 'steps': 73006, 'loss/train': 1.754514217376709} 11/07/2021 07:28:44 - INFO - __main__ - Step 73008: {'lr': 0.000265872403015811, 'samples': 14017536, 'steps': 73007, 'loss/train': 1.1365092992782593} 11/07/2021 07:28:45 - INFO - __main__ - Step 73009: {'lr': 0.00026586710697424796, 'samples': 14017728, 'steps': 73008, 'loss/train': 1.520639181137085} 11/07/2021 07:28:45 - INFO - __main__ - Step 73010: {'lr': 0.00026586181092553543, 'samples': 14017920, 'steps': 73009, 'loss/train': 1.275890827178955} 11/07/2021 07:28:45 - INFO - __main__ - Step 73011: {'lr': 0.00026585651486967584, 'samples': 14018112, 'steps': 73010, 'loss/train': 1.0587151050567627} 11/07/2021 07:28:46 - INFO - __main__ - Step 73012: {'lr': 0.0002658512188066715, 'samples': 14018304, 'steps': 73011, 'loss/train': 0.8956709504127502} 11/07/2021 07:28:47 - INFO - __main__ - Step 73013: {'lr': 0.0002658459227365249, 'samples': 14018496, 'steps': 73012, 'loss/train': 1.6238585710525513} 11/07/2021 07:28:47 - INFO - __main__ - Step 73014: {'lr': 0.0002658406266592384, 'samples': 14018688, 'steps': 73013, 'loss/train': 1.449949860572815} 11/07/2021 07:28:47 - INFO - __main__ - Step 73015: {'lr': 0.0002658353305748143, 'samples': 14018880, 'steps': 73014, 'loss/train': 1.3801509141921997} 11/07/2021 07:28:48 - INFO - __main__ - Step 73016: {'lr': 0.00026583003448325506, 'samples': 14019072, 'steps': 73015, 'loss/train': 1.0640230178833008} 11/07/2021 07:28:49 - INFO - __main__ - Step 73017: {'lr': 0.00026582473838456303, 'samples': 14019264, 'steps': 73016, 'loss/train': 1.3708451986312866} 11/07/2021 07:28:49 - INFO - __main__ - Step 73018: {'lr': 0.00026581944227874063, 'samples': 14019456, 'steps': 73017, 'loss/train': 1.4474126100540161} 11/07/2021 07:28:50 - INFO - __main__ - Step 73019: {'lr': 0.0002658141461657902, 'samples': 14019648, 'steps': 73018, 'loss/train': 1.2113916873931885} 11/07/2021 07:28:50 - INFO - __main__ - Step 73020: {'lr': 0.0002658088500457142, 'samples': 14019840, 'steps': 73019, 'loss/train': 1.3512011766433716} 11/07/2021 07:28:50 - INFO - __main__ - Step 73021: {'lr': 0.00026580355391851495, 'samples': 14020032, 'steps': 73020, 'loss/train': 0.8127725124359131} 11/07/2021 07:28:52 - INFO - __main__ - Step 73022: {'lr': 0.0002657982577841949, 'samples': 14020224, 'steps': 73021, 'loss/train': 1.439576268196106} 11/07/2021 07:28:52 - INFO - __main__ - Step 73023: {'lr': 0.0002657929616427564, 'samples': 14020416, 'steps': 73022, 'loss/train': 1.561935544013977} 11/07/2021 07:28:52 - INFO - __main__ - Step 73024: {'lr': 0.0002657876654942018, 'samples': 14020608, 'steps': 73023, 'loss/train': 1.2753077745437622} 11/07/2021 07:28:53 - INFO - __main__ - Step 73025: {'lr': 0.0002657823693385335, 'samples': 14020800, 'steps': 73024, 'loss/train': 1.0346366167068481} 11/07/2021 07:28:53 - INFO - __main__ - Step 73026: {'lr': 0.00026577707317575395, 'samples': 14020992, 'steps': 73025, 'loss/train': 1.5751335620880127} 11/07/2021 07:28:53 - INFO - __main__ - Step 73027: {'lr': 0.0002657717770058655, 'samples': 14021184, 'steps': 73026, 'loss/train': 1.4302908182144165} 11/07/2021 07:28:55 - INFO - __main__ - Step 73028: {'lr': 0.00026576648082887055, 'samples': 14021376, 'steps': 73027, 'loss/train': 1.1779491901397705} 11/07/2021 07:28:55 - INFO - __main__ - Step 73029: {'lr': 0.00026576118464477147, 'samples': 14021568, 'steps': 73028, 'loss/train': 1.5523583889007568} 11/07/2021 07:28:55 - INFO - __main__ - Step 73030: {'lr': 0.0002657558884535706, 'samples': 14021760, 'steps': 73029, 'loss/train': 0.9249657988548279} 11/07/2021 07:28:56 - INFO - __main__ - Step 73031: {'lr': 0.00026575059225527036, 'samples': 14021952, 'steps': 73030, 'loss/train': 1.2477401494979858} 11/07/2021 07:28:56 - INFO - __main__ - Step 73032: {'lr': 0.0002657452960498731, 'samples': 14022144, 'steps': 73031, 'loss/train': 1.5845158100128174} 11/07/2021 07:28:57 - INFO - __main__ - Step 73033: {'lr': 0.0002657399998373813, 'samples': 14022336, 'steps': 73032, 'loss/train': 1.9652020931243896} 11/07/2021 07:28:57 - INFO - __main__ - Step 73034: {'lr': 0.00026573470361779744, 'samples': 14022528, 'steps': 73033, 'loss/train': 1.5447072982788086} 11/07/2021 07:28:58 - INFO - __main__ - Step 73035: {'lr': 0.00026572940739112363, 'samples': 14022720, 'steps': 73034, 'loss/train': 0.9449756741523743} 11/07/2021 07:28:58 - INFO - __main__ - Step 73036: {'lr': 0.0002657241111573624, 'samples': 14022912, 'steps': 73035, 'loss/train': 1.0160529613494873} 11/07/2021 07:28:58 - INFO - __main__ - Step 73037: {'lr': 0.0002657188149165161, 'samples': 14023104, 'steps': 73036, 'loss/train': 1.5365455150604248} 11/07/2021 07:28:59 - INFO - __main__ - Step 73038: {'lr': 0.0002657135186685872, 'samples': 14023296, 'steps': 73037, 'loss/train': 1.338026762008667} 11/07/2021 07:29:00 - INFO - __main__ - Step 73039: {'lr': 0.00026570822241357803, 'samples': 14023488, 'steps': 73038, 'loss/train': 1.3506667613983154} 11/07/2021 07:29:00 - INFO - __main__ - Step 73040: {'lr': 0.00026570292615149093, 'samples': 14023680, 'steps': 73039, 'loss/train': 1.6842823028564453} 11/07/2021 07:29:01 - INFO - __main__ - Step 73041: {'lr': 0.0002656976298823284, 'samples': 14023872, 'steps': 73040, 'loss/train': 1.4819440841674805} 11/07/2021 07:29:01 - INFO - __main__ - Step 73042: {'lr': 0.00026569233360609266, 'samples': 14024064, 'steps': 73041, 'loss/train': 1.6497441530227661} 11/07/2021 07:29:01 - INFO - __main__ - Step 73043: {'lr': 0.0002656870373227863, 'samples': 14024256, 'steps': 73042, 'loss/train': 1.2973911762237549} 11/07/2021 07:29:02 - INFO - __main__ - Step 73044: {'lr': 0.0002656817410324116, 'samples': 14024448, 'steps': 73043, 'loss/train': 0.30129364132881165} 11/07/2021 07:29:03 - INFO - __main__ - Step 73045: {'lr': 0.0002656764447349708, 'samples': 14024640, 'steps': 73044, 'loss/train': 1.6499241590499878} 11/07/2021 07:29:03 - INFO - __main__ - Step 73046: {'lr': 0.0002656711484304666, 'samples': 14024832, 'steps': 73045, 'loss/train': 1.3945369720458984} 11/07/2021 07:29:04 - INFO - __main__ - Step 73047: {'lr': 0.0002656658521189012, 'samples': 14025024, 'steps': 73046, 'loss/train': 1.6287891864776611} 11/07/2021 07:29:04 - INFO - __main__ - Step 73048: {'lr': 0.000265660555800277, 'samples': 14025216, 'steps': 73047, 'loss/train': 1.0802373886108398} 11/07/2021 07:29:04 - INFO - __main__ - Step 73049: {'lr': 0.0002656552594745963, 'samples': 14025408, 'steps': 73048, 'loss/train': 1.4531779289245605} 11/07/2021 07:29:05 - INFO - __main__ - Step 73050: {'lr': 0.0002656499631418617, 'samples': 14025600, 'steps': 73049, 'loss/train': 0.4130958318710327} 11/07/2021 07:29:06 - INFO - __main__ - Step 73051: {'lr': 0.0002656446668020754, 'samples': 14025792, 'steps': 73050, 'loss/train': 1.360568881034851} 11/07/2021 07:29:06 - INFO - __main__ - Step 73052: {'lr': 0.00026563937045523986, 'samples': 14025984, 'steps': 73051, 'loss/train': 1.646904706954956} 11/07/2021 07:29:06 - INFO - __main__ - Step 73053: {'lr': 0.0002656340741013575, 'samples': 14026176, 'steps': 73052, 'loss/train': 1.7037334442138672} 11/07/2021 07:29:07 - INFO - __main__ - Step 73054: {'lr': 0.00026562877774043066, 'samples': 14026368, 'steps': 73053, 'loss/train': 1.6027559041976929} 11/07/2021 07:29:08 - INFO - __main__ - Step 73055: {'lr': 0.00026562348137246174, 'samples': 14026560, 'steps': 73054, 'loss/train': 1.6387618780136108} 11/07/2021 07:29:08 - INFO - __main__ - Step 73056: {'lr': 0.00026561818499745303, 'samples': 14026752, 'steps': 73055, 'loss/train': 0.6361719369888306} 11/07/2021 07:29:09 - INFO - __main__ - Step 73057: {'lr': 0.0002656128886154071, 'samples': 14026944, 'steps': 73056, 'loss/train': 1.3148664236068726} 11/07/2021 07:29:09 - INFO - __main__ - Step 73058: {'lr': 0.0002656075922263262, 'samples': 14027136, 'steps': 73057, 'loss/train': 1.407531499862671} 11/07/2021 07:29:09 - INFO - __main__ - Step 73059: {'lr': 0.0002656022958302128, 'samples': 14027328, 'steps': 73058, 'loss/train': 1.6813331842422485} 11/07/2021 07:29:10 - INFO - __main__ - Step 73060: {'lr': 0.00026559699942706926, 'samples': 14027520, 'steps': 73059, 'loss/train': 1.7398440837860107} 11/07/2021 07:29:11 - INFO - __main__ - Step 73061: {'lr': 0.00026559170301689787, 'samples': 14027712, 'steps': 73060, 'loss/train': 1.721442461013794} 11/07/2021 07:29:11 - INFO - __main__ - Step 73062: {'lr': 0.0002655864065997012, 'samples': 14027904, 'steps': 73061, 'loss/train': 1.6059670448303223} 11/07/2021 07:29:11 - INFO - __main__ - Step 73063: {'lr': 0.00026558111017548145, 'samples': 14028096, 'steps': 73062, 'loss/train': 1.5408258438110352} 11/07/2021 07:29:12 - INFO - __main__ - Step 73064: {'lr': 0.0002655758137442411, 'samples': 14028288, 'steps': 73063, 'loss/train': 1.9346352815628052} 11/07/2021 07:29:12 - INFO - __main__ - Step 73065: {'lr': 0.0002655705173059826, 'samples': 14028480, 'steps': 73064, 'loss/train': 1.1503026485443115} 11/07/2021 07:29:13 - INFO - __main__ - Step 73066: {'lr': 0.0002655652208607082, 'samples': 14028672, 'steps': 73065, 'loss/train': 1.2960200309753418} 11/07/2021 07:29:13 - INFO - __main__ - Step 73067: {'lr': 0.0002655599244084204, 'samples': 14028864, 'steps': 73066, 'loss/train': 1.2099710702896118} 11/07/2021 07:29:14 - INFO - __main__ - Step 73068: {'lr': 0.0002655546279491215, 'samples': 14029056, 'steps': 73067, 'loss/train': 1.7562971115112305} 11/07/2021 07:29:14 - INFO - __main__ - Step 73069: {'lr': 0.0002655493314828139, 'samples': 14029248, 'steps': 73068, 'loss/train': 1.5990136861801147} 11/07/2021 07:29:14 - INFO - __main__ - Step 73070: {'lr': 0.00026554403500950006, 'samples': 14029440, 'steps': 73069, 'loss/train': 1.2277911901474} 11/07/2021 07:29:15 - INFO - __main__ - Step 73071: {'lr': 0.0002655387385291823, 'samples': 14029632, 'steps': 73070, 'loss/train': 1.3970022201538086} 11/07/2021 07:29:16 - INFO - __main__ - Step 73072: {'lr': 0.000265533442041863, 'samples': 14029824, 'steps': 73071, 'loss/train': 1.5897657871246338} 11/07/2021 07:29:16 - INFO - __main__ - Step 73073: {'lr': 0.0002655281455475446, 'samples': 14030016, 'steps': 73072, 'loss/train': 1.4504876136779785} 11/07/2021 07:29:16 - INFO - __main__ - Step 73074: {'lr': 0.0002655228490462295, 'samples': 14030208, 'steps': 73073, 'loss/train': 1.4062799215316772} 11/07/2021 07:29:17 - INFO - __main__ - Step 73075: {'lr': 0.00026551755253792, 'samples': 14030400, 'steps': 73074, 'loss/train': 1.5136042833328247} 11/07/2021 07:29:18 - INFO - __main__ - Step 73076: {'lr': 0.0002655122560226185, 'samples': 14030592, 'steps': 73075, 'loss/train': 1.6451387405395508} 11/07/2021 07:29:18 - INFO - __main__ - Step 73077: {'lr': 0.00026550695950032743, 'samples': 14030784, 'steps': 73076, 'loss/train': 1.5550919771194458} 11/07/2021 07:29:18 - INFO - __main__ - Step 73078: {'lr': 0.0002655016629710492, 'samples': 14030976, 'steps': 73077, 'loss/train': 0.7822542190551758} 11/07/2021 07:29:19 - INFO - __main__ - Step 73079: {'lr': 0.00026549636643478615, 'samples': 14031168, 'steps': 73078, 'loss/train': 1.3916521072387695} 11/07/2021 07:29:19 - INFO - __main__ - Step 73080: {'lr': 0.00026549106989154066, 'samples': 14031360, 'steps': 73079, 'loss/train': 1.4726670980453491} 11/07/2021 07:29:20 - INFO - __main__ - Step 73081: {'lr': 0.0002654857733413152, 'samples': 14031552, 'steps': 73080, 'loss/train': 1.412758231163025} 11/07/2021 07:29:21 - INFO - __main__ - Step 73082: {'lr': 0.000265480476784112, 'samples': 14031744, 'steps': 73081, 'loss/train': 1.3774595260620117} 11/07/2021 07:29:21 - INFO - __main__ - Step 73083: {'lr': 0.00026547518021993353, 'samples': 14031936, 'steps': 73082, 'loss/train': 1.4081140756607056} 11/07/2021 07:29:21 - INFO - __main__ - Step 73084: {'lr': 0.00026546988364878224, 'samples': 14032128, 'steps': 73083, 'loss/train': 1.4229506254196167} 11/07/2021 07:29:22 - INFO - __main__ - Step 73085: {'lr': 0.0002654645870706604, 'samples': 14032320, 'steps': 73084, 'loss/train': 1.3699204921722412} 11/07/2021 07:29:23 - INFO - __main__ - Step 73086: {'lr': 0.0002654592904855705, 'samples': 14032512, 'steps': 73085, 'loss/train': 1.3876681327819824} 11/07/2021 07:29:23 - INFO - __main__ - Step 73087: {'lr': 0.00026545399389351493, 'samples': 14032704, 'steps': 73086, 'loss/train': 1.7291913032531738} 11/07/2021 07:29:23 - INFO - __main__ - Step 73088: {'lr': 0.00026544869729449596, 'samples': 14032896, 'steps': 73087, 'loss/train': 1.9715341329574585} 11/07/2021 07:29:24 - INFO - __main__ - Step 73089: {'lr': 0.00026544340068851603, 'samples': 14033088, 'steps': 73088, 'loss/train': 1.4785478115081787} 11/07/2021 07:29:24 - INFO - __main__ - Step 73090: {'lr': 0.00026543810407557753, 'samples': 14033280, 'steps': 73089, 'loss/train': 0.6350714564323425} 11/07/2021 07:29:25 - INFO - __main__ - Step 73091: {'lr': 0.00026543280745568295, 'samples': 14033472, 'steps': 73090, 'loss/train': 1.4049761295318604} 11/07/2021 07:29:25 - INFO - __main__ - Step 73092: {'lr': 0.00026542751082883446, 'samples': 14033664, 'steps': 73091, 'loss/train': 1.453104019165039} 11/07/2021 07:29:26 - INFO - __main__ - Step 73093: {'lr': 0.00026542221419503467, 'samples': 14033856, 'steps': 73092, 'loss/train': 0.8159093856811523} 11/07/2021 07:29:26 - INFO - __main__ - Step 73094: {'lr': 0.0002654169175542859, 'samples': 14034048, 'steps': 73093, 'loss/train': 1.1199350357055664} 11/07/2021 07:29:26 - INFO - __main__ - Step 73095: {'lr': 0.0002654116209065904, 'samples': 14034240, 'steps': 73094, 'loss/train': 1.3227519989013672} 11/07/2021 07:29:27 - INFO - __main__ - Step 73096: {'lr': 0.0002654063242519507, 'samples': 14034432, 'steps': 73095, 'loss/train': 2.3925058841705322} 11/07/2021 07:29:28 - INFO - __main__ - Step 73097: {'lr': 0.00026540102759036924, 'samples': 14034624, 'steps': 73096, 'loss/train': 1.7548182010650635} 11/07/2021 07:29:28 - INFO - __main__ - Step 73098: {'lr': 0.00026539573092184814, 'samples': 14034816, 'steps': 73097, 'loss/train': 1.1765769720077515} 11/07/2021 07:29:29 - INFO - __main__ - Step 73099: {'lr': 0.0002653904342463901, 'samples': 14035008, 'steps': 73098, 'loss/train': 1.488663911819458} 11/07/2021 07:29:29 - INFO - __main__ - Step 73100: {'lr': 0.0002653851375639973, 'samples': 14035200, 'steps': 73099, 'loss/train': 1.454494595527649} 11/07/2021 07:29:29 - INFO - __main__ - Step 73101: {'lr': 0.00026537984087467224, 'samples': 14035392, 'steps': 73100, 'loss/train': 1.0657116174697876} 11/07/2021 07:29:30 - INFO - __main__ - Step 73102: {'lr': 0.0002653745441784172, 'samples': 14035584, 'steps': 73101, 'loss/train': 1.6790345907211304} 11/07/2021 07:29:31 - INFO - __main__ - Step 73103: {'lr': 0.0002653692474752347, 'samples': 14035776, 'steps': 73102, 'loss/train': 2.009584903717041} 11/07/2021 07:29:31 - INFO - __main__ - Step 73104: {'lr': 0.00026536395076512696, 'samples': 14035968, 'steps': 73103, 'loss/train': 1.2411835193634033} 11/07/2021 07:29:31 - INFO - __main__ - Step 73105: {'lr': 0.00026535865404809654, 'samples': 14036160, 'steps': 73104, 'loss/train': 1.363701581954956} 11/07/2021 07:29:32 - INFO - __main__ - Step 73106: {'lr': 0.0002653533573241457, 'samples': 14036352, 'steps': 73105, 'loss/train': 1.8773647546768188} 11/07/2021 07:29:33 - INFO - __main__ - Step 73107: {'lr': 0.00026534806059327697, 'samples': 14036544, 'steps': 73106, 'loss/train': 1.3406614065170288} 11/07/2021 07:29:33 - INFO - __main__ - Step 73108: {'lr': 0.0002653427638554926, 'samples': 14036736, 'steps': 73107, 'loss/train': 1.1111726760864258} 11/07/2021 07:29:34 - INFO - __main__ - Step 73109: {'lr': 0.00026533746711079496, 'samples': 14036928, 'steps': 73108, 'loss/train': 1.316190242767334} 11/07/2021 07:29:34 - INFO - __main__ - Step 73110: {'lr': 0.0002653321703591865, 'samples': 14037120, 'steps': 73109, 'loss/train': 0.9615099430084229} 11/07/2021 07:29:34 - INFO - __main__ - Step 73111: {'lr': 0.00026532687360066964, 'samples': 14037312, 'steps': 73110, 'loss/train': 1.9771384000778198} 11/07/2021 07:29:35 - INFO - __main__ - Step 73112: {'lr': 0.0002653215768352468, 'samples': 14037504, 'steps': 73111, 'loss/train': 1.671366572380066} 11/07/2021 07:29:36 - INFO - __main__ - Step 73113: {'lr': 0.00026531628006292015, 'samples': 14037696, 'steps': 73112, 'loss/train': 1.4084208011627197} 11/07/2021 07:29:36 - INFO - __main__ - Step 73114: {'lr': 0.00026531098328369225, 'samples': 14037888, 'steps': 73113, 'loss/train': 1.2928564548492432} 11/07/2021 07:29:37 - INFO - __main__ - Step 73115: {'lr': 0.00026530568649756547, 'samples': 14038080, 'steps': 73114, 'loss/train': 1.4468200206756592} 11/07/2021 07:29:37 - INFO - __main__ - Step 73116: {'lr': 0.00026530038970454223, 'samples': 14038272, 'steps': 73115, 'loss/train': 1.1051238775253296} 11/07/2021 07:29:38 - INFO - __main__ - Step 73117: {'lr': 0.00026529509290462483, 'samples': 14038464, 'steps': 73116, 'loss/train': 1.9016867876052856} 11/07/2021 07:29:38 - INFO - __main__ - Step 73118: {'lr': 0.00026528979609781575, 'samples': 14038656, 'steps': 73117, 'loss/train': 1.3194754123687744} 11/07/2021 07:29:39 - INFO - __main__ - Step 73119: {'lr': 0.00026528449928411727, 'samples': 14038848, 'steps': 73118, 'loss/train': 1.3344435691833496} 11/07/2021 07:29:39 - INFO - __main__ - Step 73120: {'lr': 0.0002652792024635318, 'samples': 14039040, 'steps': 73119, 'loss/train': 1.4844073057174683} 11/07/2021 07:29:39 - INFO - __main__ - Step 73121: {'lr': 0.0002652739056360618, 'samples': 14039232, 'steps': 73120, 'loss/train': 1.265863299369812} 11/07/2021 07:29:40 - INFO - __main__ - Step 73122: {'lr': 0.0002652686088017096, 'samples': 14039424, 'steps': 73121, 'loss/train': 1.4896700382232666} 11/07/2021 07:29:41 - INFO - __main__ - Step 73123: {'lr': 0.00026526331196047764, 'samples': 14039616, 'steps': 73122, 'loss/train': 1.54007887840271} 11/07/2021 07:29:41 - INFO - __main__ - Step 73124: {'lr': 0.00026525801511236827, 'samples': 14039808, 'steps': 73123, 'loss/train': 1.4520031213760376} 11/07/2021 07:29:41 - INFO - __main__ - Step 73125: {'lr': 0.0002652527182573838, 'samples': 14040000, 'steps': 73124, 'loss/train': 1.812910795211792} 11/07/2021 07:29:42 - INFO - __main__ - Step 73126: {'lr': 0.0002652474213955267, 'samples': 14040192, 'steps': 73125, 'loss/train': 1.373225212097168} 11/07/2021 07:29:43 - INFO - __main__ - Step 73127: {'lr': 0.0002652421245267994, 'samples': 14040384, 'steps': 73126, 'loss/train': 1.1191024780273438} 11/07/2021 07:29:43 - INFO - __main__ - Step 73128: {'lr': 0.0002652368276512042, 'samples': 14040576, 'steps': 73127, 'loss/train': 1.347456932067871} 11/07/2021 07:29:43 - INFO - __main__ - Step 73129: {'lr': 0.00026523153076874357, 'samples': 14040768, 'steps': 73128, 'loss/train': 1.0543831586837769} 11/07/2021 07:29:44 - INFO - __main__ - Step 73130: {'lr': 0.0002652262338794198, 'samples': 14040960, 'steps': 73129, 'loss/train': 1.620888352394104} 11/07/2021 07:29:44 - INFO - __main__ - Step 73131: {'lr': 0.00026522093698323534, 'samples': 14041152, 'steps': 73130, 'loss/train': 2.0061872005462646} 11/07/2021 07:29:45 - INFO - __main__ - Step 73132: {'lr': 0.00026521564008019253, 'samples': 14041344, 'steps': 73131, 'loss/train': 1.3881540298461914} 11/07/2021 07:29:46 - INFO - __main__ - Step 73133: {'lr': 0.0002652103431702938, 'samples': 14041536, 'steps': 73132, 'loss/train': 1.4682480096817017} 11/07/2021 07:29:46 - INFO - __main__ - Step 73134: {'lr': 0.0002652050462535416, 'samples': 14041728, 'steps': 73133, 'loss/train': 1.4919573068618774} 11/07/2021 07:29:46 - INFO - __main__ - Step 73135: {'lr': 0.0002651997493299382, 'samples': 14041920, 'steps': 73134, 'loss/train': 1.3288867473602295} 11/07/2021 07:29:47 - INFO - __main__ - Step 73136: {'lr': 0.000265194452399486, 'samples': 14042112, 'steps': 73135, 'loss/train': 1.3832122087478638} 11/07/2021 07:29:47 - INFO - __main__ - Step 73137: {'lr': 0.0002651891554621874, 'samples': 14042304, 'steps': 73136, 'loss/train': 1.30440092086792} 11/07/2021 07:29:48 - INFO - __main__ - Step 73138: {'lr': 0.0002651838585180448, 'samples': 14042496, 'steps': 73137, 'loss/train': 1.3818693161010742} 11/07/2021 07:29:48 - INFO - __main__ - Step 73139: {'lr': 0.0002651785615670606, 'samples': 14042688, 'steps': 73138, 'loss/train': 1.2551720142364502} 11/07/2021 07:29:49 - INFO - __main__ - Step 73140: {'lr': 0.0002651732646092372, 'samples': 14042880, 'steps': 73139, 'loss/train': 1.3238672018051147} 11/07/2021 07:29:49 - INFO - __main__ - Step 73141: {'lr': 0.00026516796764457695, 'samples': 14043072, 'steps': 73140, 'loss/train': 1.3446102142333984} 11/07/2021 07:29:49 - INFO - __main__ - Step 73142: {'lr': 0.0002651626706730823, 'samples': 14043264, 'steps': 73141, 'loss/train': 1.5905901193618774} 11/07/2021 07:29:51 - INFO - __main__ - Step 73143: {'lr': 0.00026515737369475545, 'samples': 14043456, 'steps': 73142, 'loss/train': 1.2513090372085571} 11/07/2021 07:29:51 - INFO - __main__ - Step 73144: {'lr': 0.000265152076709599, 'samples': 14043648, 'steps': 73143, 'loss/train': 2.027472972869873} 11/07/2021 07:29:51 - INFO - __main__ - Step 73145: {'lr': 0.00026514677971761525, 'samples': 14043840, 'steps': 73144, 'loss/train': 1.034956693649292} 11/07/2021 07:29:52 - INFO - __main__ - Step 73146: {'lr': 0.0002651414827188066, 'samples': 14044032, 'steps': 73145, 'loss/train': 1.5163387060165405} 11/07/2021 07:29:52 - INFO - __main__ - Step 73147: {'lr': 0.00026513618571317543, 'samples': 14044224, 'steps': 73146, 'loss/train': 1.8055589199066162} 11/07/2021 07:29:52 - INFO - __main__ - Step 73148: {'lr': 0.00026513088870072415, 'samples': 14044416, 'steps': 73147, 'loss/train': 2.0009539127349854} 11/07/2021 07:29:54 - INFO - __main__ - Step 73149: {'lr': 0.00026512559168145514, 'samples': 14044608, 'steps': 73148, 'loss/train': 1.7886656522750854} 11/07/2021 07:29:54 - INFO - __main__ - Step 73150: {'lr': 0.00026512029465537067, 'samples': 14044800, 'steps': 73149, 'loss/train': 1.1222460269927979} 11/07/2021 07:29:54 - INFO - __main__ - Step 73151: {'lr': 0.00026511499762247334, 'samples': 14044992, 'steps': 73150, 'loss/train': 1.1208858489990234} 11/07/2021 07:29:55 - INFO - __main__ - Step 73152: {'lr': 0.00026510970058276533, 'samples': 14045184, 'steps': 73151, 'loss/train': 1.8946748971939087} 11/07/2021 07:29:55 - INFO - __main__ - Step 73153: {'lr': 0.0002651044035362492, 'samples': 14045376, 'steps': 73152, 'loss/train': 0.9673628211021423} 11/07/2021 07:29:56 - INFO - __main__ - Step 73154: {'lr': 0.00026509910648292716, 'samples': 14045568, 'steps': 73153, 'loss/train': 1.6901808977127075} 11/07/2021 07:29:56 - INFO - __main__ - Step 73155: {'lr': 0.0002650938094228019, 'samples': 14045760, 'steps': 73154, 'loss/train': 1.3827975988388062} 11/07/2021 07:29:57 - INFO - __main__ - Step 73156: {'lr': 0.0002650885123558754, 'samples': 14045952, 'steps': 73155, 'loss/train': 1.221262812614441} 11/07/2021 07:29:57 - INFO - __main__ - Step 73157: {'lr': 0.00026508321528215034, 'samples': 14046144, 'steps': 73156, 'loss/train': 1.5817073583602905} 11/07/2021 07:29:58 - INFO - __main__ - Step 73158: {'lr': 0.00026507791820162894, 'samples': 14046336, 'steps': 73157, 'loss/train': 1.3417863845825195} 11/07/2021 07:29:59 - INFO - __main__ - Step 73159: {'lr': 0.0002650726211143137, 'samples': 14046528, 'steps': 73158, 'loss/train': 1.278694748878479} 11/07/2021 07:29:59 - INFO - __main__ - Step 73160: {'lr': 0.000265067324020207, 'samples': 14046720, 'steps': 73159, 'loss/train': 1.334336280822754} 11/07/2021 07:29:59 - INFO - __main__ - Step 73161: {'lr': 0.0002650620269193112, 'samples': 14046912, 'steps': 73160, 'loss/train': 1.3396707773208618} 11/07/2021 07:30:00 - INFO - __main__ - Step 73162: {'lr': 0.0002650567298116287, 'samples': 14047104, 'steps': 73161, 'loss/train': 1.7373815774917603} 11/07/2021 07:30:00 - INFO - __main__ - Step 73163: {'lr': 0.0002650514326971618, 'samples': 14047296, 'steps': 73162, 'loss/train': 1.6049491167068481} 11/07/2021 07:30:01 - INFO - __main__ - Step 73164: {'lr': 0.00026504613557591303, 'samples': 14047488, 'steps': 73163, 'loss/train': 2.177133560180664} 11/07/2021 07:30:01 - INFO - __main__ - Step 73165: {'lr': 0.0002650408384478846, 'samples': 14047680, 'steps': 73164, 'loss/train': 1.4141457080841064} 11/07/2021 07:30:02 - INFO - __main__ - Step 73166: {'lr': 0.0002650355413130791, 'samples': 14047872, 'steps': 73165, 'loss/train': 1.2749212980270386} 11/07/2021 07:30:02 - INFO - __main__ - Step 73167: {'lr': 0.0002650302441714988, 'samples': 14048064, 'steps': 73166, 'loss/train': 1.0269606113433838} 11/07/2021 07:30:02 - INFO - __main__ - Step 73168: {'lr': 0.0002650249470231461, 'samples': 14048256, 'steps': 73167, 'loss/train': 1.1600035429000854} 11/07/2021 07:30:03 - INFO - __main__ - Step 73169: {'lr': 0.0002650196498680234, 'samples': 14048448, 'steps': 73168, 'loss/train': 1.4413583278656006} 11/07/2021 07:30:04 - INFO - __main__ - Step 73170: {'lr': 0.00026501435270613307, 'samples': 14048640, 'steps': 73169, 'loss/train': 1.4958806037902832} 11/07/2021 07:30:04 - INFO - __main__ - Step 73171: {'lr': 0.00026500905553747747, 'samples': 14048832, 'steps': 73170, 'loss/train': 1.3317484855651855} 11/07/2021 07:30:04 - INFO - __main__ - Step 73172: {'lr': 0.00026500375836205895, 'samples': 14049024, 'steps': 73171, 'loss/train': 1.8909112215042114} 11/07/2021 07:30:05 - INFO - __main__ - Step 73173: {'lr': 0.0002649984611798801, 'samples': 14049216, 'steps': 73172, 'loss/train': 1.587355613708496} 11/07/2021 07:30:06 - INFO - __main__ - Step 73174: {'lr': 0.00026499316399094316, 'samples': 14049408, 'steps': 73173, 'loss/train': 1.3915801048278809} 11/07/2021 07:30:06 - INFO - __main__ - Step 73175: {'lr': 0.0002649878667952505, 'samples': 14049600, 'steps': 73174, 'loss/train': 1.473784327507019} 11/07/2021 07:30:07 - INFO - __main__ - Step 73176: {'lr': 0.00026498256959280454, 'samples': 14049792, 'steps': 73175, 'loss/train': 1.3312948942184448} 11/07/2021 07:30:07 - INFO - __main__ - Step 73177: {'lr': 0.0002649772723836077, 'samples': 14049984, 'steps': 73176, 'loss/train': 0.8662620782852173} 11/07/2021 07:30:07 - INFO - __main__ - Step 73178: {'lr': 0.00026497197516766225, 'samples': 14050176, 'steps': 73177, 'loss/train': 1.4629820585250854} 11/07/2021 07:30:08 - INFO - __main__ - Step 73179: {'lr': 0.0002649666779449707, 'samples': 14050368, 'steps': 73178, 'loss/train': 1.5015774965286255} 11/07/2021 07:30:09 - INFO - __main__ - Step 73180: {'lr': 0.00026496138071553546, 'samples': 14050560, 'steps': 73179, 'loss/train': 1.3486051559448242} 11/07/2021 07:30:09 - INFO - __main__ - Step 73181: {'lr': 0.0002649560834793588, 'samples': 14050752, 'steps': 73180, 'loss/train': 2.7127082347869873} 11/07/2021 07:30:09 - INFO - __main__ - Step 73182: {'lr': 0.00026495078623644315, 'samples': 14050944, 'steps': 73181, 'loss/train': 1.4616963863372803} 11/07/2021 07:30:10 - INFO - __main__ - Step 73183: {'lr': 0.00026494548898679094, 'samples': 14051136, 'steps': 73182, 'loss/train': 1.3392764329910278} 11/07/2021 07:30:11 - INFO - __main__ - Step 73184: {'lr': 0.00026494019173040447, 'samples': 14051328, 'steps': 73183, 'loss/train': 1.7090328931808472} 11/07/2021 07:30:11 - INFO - __main__ - Step 73185: {'lr': 0.0002649348944672862, 'samples': 14051520, 'steps': 73184, 'loss/train': 1.326764464378357} 11/07/2021 07:30:11 - INFO - __main__ - Step 73186: {'lr': 0.0002649295971974385, 'samples': 14051712, 'steps': 73185, 'loss/train': 1.4067972898483276} 11/07/2021 07:30:12 - INFO - __main__ - Step 73187: {'lr': 0.00026492429992086374, 'samples': 14051904, 'steps': 73186, 'loss/train': 1.3165051937103271} 11/07/2021 07:30:12 - INFO - __main__ - Step 73188: {'lr': 0.0002649190026375644, 'samples': 14052096, 'steps': 73187, 'loss/train': 0.8914204835891724} 11/07/2021 07:30:12 - INFO - __main__ - Step 73189: {'lr': 0.0002649137053475427, 'samples': 14052288, 'steps': 73188, 'loss/train': 1.4923752546310425} 11/07/2021 07:30:13 - INFO - __main__ - Step 73190: {'lr': 0.0002649084080508011, 'samples': 14052480, 'steps': 73189, 'loss/train': 0.9512122869491577} 11/07/2021 07:30:14 - INFO - __main__ - Step 73191: {'lr': 0.0002649031107473421, 'samples': 14052672, 'steps': 73190, 'loss/train': 1.7689510583877563} 11/07/2021 07:30:14 - INFO - __main__ - Step 73192: {'lr': 0.0002648978134371679, 'samples': 14052864, 'steps': 73191, 'loss/train': 1.1373203992843628} 11/07/2021 07:30:14 - INFO - __main__ - Step 73193: {'lr': 0.000264892516120281, 'samples': 14053056, 'steps': 73192, 'loss/train': 1.0761643648147583} 11/07/2021 07:30:15 - INFO - __main__ - Step 73194: {'lr': 0.00026488721879668373, 'samples': 14053248, 'steps': 73193, 'loss/train': 1.5339394807815552} 11/07/2021 07:30:16 - INFO - __main__ - Step 73195: {'lr': 0.00026488192146637864, 'samples': 14053440, 'steps': 73194, 'loss/train': 1.0520070791244507} 11/07/2021 07:30:16 - INFO - __main__ - Step 73196: {'lr': 0.00026487662412936786, 'samples': 14053632, 'steps': 73195, 'loss/train': 1.3787482976913452} 11/07/2021 07:30:17 - INFO - __main__ - Step 73197: {'lr': 0.00026487132678565395, 'samples': 14053824, 'steps': 73196, 'loss/train': 1.805245280265808} 11/07/2021 07:30:17 - INFO - __main__ - Step 73198: {'lr': 0.00026486602943523923, 'samples': 14054016, 'steps': 73197, 'loss/train': 1.467148780822754} 11/07/2021 07:30:17 - INFO - __main__ - Step 73199: {'lr': 0.0002648607320781261, 'samples': 14054208, 'steps': 73198, 'loss/train': 1.6010602712631226} 11/07/2021 07:30:18 - INFO - __main__ - Step 73200: {'lr': 0.00026485543471431694, 'samples': 14054400, 'steps': 73199, 'loss/train': 1.3840086460113525} 11/07/2021 07:30:19 - INFO - __main__ - Step 73201: {'lr': 0.0002648501373438142, 'samples': 14054592, 'steps': 73200, 'loss/train': 1.4498353004455566} 11/07/2021 07:30:19 - INFO - __main__ - Step 73202: {'lr': 0.0002648448399666202, 'samples': 14054784, 'steps': 73201, 'loss/train': 1.5781067609786987} 11/07/2021 07:30:20 - INFO - __main__ - Step 73203: {'lr': 0.00026483954258273737, 'samples': 14054976, 'steps': 73202, 'loss/train': 1.249579668045044} 11/07/2021 07:30:20 - INFO - __main__ - Step 73204: {'lr': 0.00026483424519216803, 'samples': 14055168, 'steps': 73203, 'loss/train': 1.384129524230957} 11/07/2021 07:30:21 - INFO - __main__ - Step 73205: {'lr': 0.00026482894779491465, 'samples': 14055360, 'steps': 73204, 'loss/train': 1.5106583833694458} 11/07/2021 07:30:21 - INFO - __main__ - Step 73206: {'lr': 0.0002648236503909795, 'samples': 14055552, 'steps': 73205, 'loss/train': 1.362083077430725} 11/07/2021 07:30:22 - INFO - __main__ - Step 73207: {'lr': 0.00026481835298036504, 'samples': 14055744, 'steps': 73206, 'loss/train': 1.8932417631149292} 11/07/2021 07:30:22 - INFO - __main__ - Step 73208: {'lr': 0.00026481305556307376, 'samples': 14055936, 'steps': 73207, 'loss/train': 1.8060835599899292} 11/07/2021 07:30:22 - INFO - __main__ - Step 73209: {'lr': 0.0002648077581391079, 'samples': 14056128, 'steps': 73208, 'loss/train': 2.0162100791931152} 11/07/2021 07:30:23 - INFO - __main__ - Step 73210: {'lr': 0.00026480246070846987, 'samples': 14056320, 'steps': 73209, 'loss/train': 1.4693039655685425} 11/07/2021 07:30:24 - INFO - __main__ - Step 73211: {'lr': 0.00026479716327116204, 'samples': 14056512, 'steps': 73210, 'loss/train': 1.36268949508667} 11/07/2021 07:30:24 - INFO - __main__ - Step 73212: {'lr': 0.0002647918658271869, 'samples': 14056704, 'steps': 73211, 'loss/train': 1.5772420167922974} 11/07/2021 07:30:24 - INFO - __main__ - Step 73213: {'lr': 0.00026478656837654676, 'samples': 14056896, 'steps': 73212, 'loss/train': 1.2763335704803467} 11/07/2021 07:30:25 - INFO - __main__ - Step 73214: {'lr': 0.00026478127091924403, 'samples': 14057088, 'steps': 73213, 'loss/train': 1.2316159009933472} 11/07/2021 07:30:25 - INFO - __main__ - Step 73215: {'lr': 0.0002647759734552811, 'samples': 14057280, 'steps': 73214, 'loss/train': 1.3843523263931274} 11/07/2021 07:30:26 - INFO - __main__ - Step 73216: {'lr': 0.00026477067598466034, 'samples': 14057472, 'steps': 73215, 'loss/train': 0.8568363189697266} 11/07/2021 07:30:26 - INFO - __main__ - Step 73217: {'lr': 0.0002647653785073841, 'samples': 14057664, 'steps': 73216, 'loss/train': 1.6566593647003174} 11/07/2021 07:30:27 - INFO - __main__ - Step 73218: {'lr': 0.0002647600810234548, 'samples': 14057856, 'steps': 73217, 'loss/train': 1.3243262767791748} 11/07/2021 07:30:27 - INFO - __main__ - Step 73219: {'lr': 0.0002647547835328749, 'samples': 14058048, 'steps': 73218, 'loss/train': 1.045099139213562} 11/07/2021 07:30:28 - INFO - __main__ - Step 73220: {'lr': 0.0002647494860356467, 'samples': 14058240, 'steps': 73219, 'loss/train': 1.0445940494537354} 11/07/2021 07:30:29 - INFO - __main__ - Step 73221: {'lr': 0.0002647441885317726, 'samples': 14058432, 'steps': 73220, 'loss/train': 1.3133925199508667} 11/07/2021 07:30:29 - INFO - __main__ - Step 73222: {'lr': 0.000264738891021255, 'samples': 14058624, 'steps': 73221, 'loss/train': 1.2134671211242676} 11/07/2021 07:30:29 - INFO - __main__ - Step 73223: {'lr': 0.00026473359350409625, 'samples': 14058816, 'steps': 73222, 'loss/train': 1.7577052116394043} 11/07/2021 07:30:30 - INFO - __main__ - Step 73224: {'lr': 0.0002647282959802988, 'samples': 14059008, 'steps': 73223, 'loss/train': 1.4405970573425293} 11/07/2021 07:30:30 - INFO - __main__ - Step 73225: {'lr': 0.00026472299844986505, 'samples': 14059200, 'steps': 73224, 'loss/train': 1.1156309843063354} 11/07/2021 07:30:31 - INFO - __main__ - Step 73226: {'lr': 0.00026471770091279724, 'samples': 14059392, 'steps': 73225, 'loss/train': 1.5464210510253906} 11/07/2021 07:30:31 - INFO - __main__ - Step 73227: {'lr': 0.0002647124033690979, 'samples': 14059584, 'steps': 73226, 'loss/train': 1.4650181531906128} 11/07/2021 07:30:32 - INFO - __main__ - Step 73228: {'lr': 0.00026470710581876937, 'samples': 14059776, 'steps': 73227, 'loss/train': 1.508488416671753} 11/07/2021 07:30:32 - INFO - __main__ - Step 73229: {'lr': 0.0002647018082618142, 'samples': 14059968, 'steps': 73228, 'loss/train': 1.5490587949752808} 11/07/2021 07:30:32 - INFO - __main__ - Step 73230: {'lr': 0.0002646965106982345, 'samples': 14060160, 'steps': 73229, 'loss/train': 1.6272234916687012} 11/07/2021 07:30:33 - INFO - __main__ - Step 73231: {'lr': 0.00026469121312803275, 'samples': 14060352, 'steps': 73230, 'loss/train': 1.3650010824203491} 11/07/2021 07:30:34 - INFO - __main__ - Step 73232: {'lr': 0.00026468591555121136, 'samples': 14060544, 'steps': 73231, 'loss/train': 1.9106801748275757} 11/07/2021 07:30:34 - INFO - __main__ - Step 73233: {'lr': 0.00026468061796777276, 'samples': 14060736, 'steps': 73232, 'loss/train': 0.8108378052711487} 11/07/2021 07:30:34 - INFO - __main__ - Step 73234: {'lr': 0.0002646753203777192, 'samples': 14060928, 'steps': 73233, 'loss/train': 1.2150821685791016} 11/07/2021 07:30:35 - INFO - __main__ - Step 73235: {'lr': 0.0002646700227810534, 'samples': 14061120, 'steps': 73234, 'loss/train': 1.6123507022857666} 11/07/2021 07:30:36 - INFO - __main__ - Step 73236: {'lr': 0.0002646647251777773, 'samples': 14061312, 'steps': 73235, 'loss/train': 1.3934412002563477} 11/07/2021 07:30:36 - INFO - __main__ - Step 73237: {'lr': 0.0002646594275678936, 'samples': 14061504, 'steps': 73236, 'loss/train': 1.4463472366333008} 11/07/2021 07:30:37 - INFO - __main__ - Step 73238: {'lr': 0.0002646541299514046, 'samples': 14061696, 'steps': 73237, 'loss/train': 1.612061858177185} 11/07/2021 07:30:37 - INFO - __main__ - Step 73239: {'lr': 0.0002646488323283126, 'samples': 14061888, 'steps': 73238, 'loss/train': 1.389543056488037} 11/07/2021 07:30:37 - INFO - __main__ - Step 73240: {'lr': 0.0002646435346986201, 'samples': 14062080, 'steps': 73239, 'loss/train': 1.326606273651123} 11/07/2021 07:30:38 - INFO - __main__ - Step 73241: {'lr': 0.0002646382370623295, 'samples': 14062272, 'steps': 73240, 'loss/train': 1.097206950187683} 11/07/2021 07:30:39 - INFO - __main__ - Step 73242: {'lr': 0.00026463293941944306, 'samples': 14062464, 'steps': 73241, 'loss/train': 1.042150855064392} 11/07/2021 07:30:39 - INFO - __main__ - Step 73243: {'lr': 0.0002646276417699633, 'samples': 14062656, 'steps': 73242, 'loss/train': 0.5216575860977173} 11/07/2021 07:30:39 - INFO - __main__ - Step 73244: {'lr': 0.00026462234411389244, 'samples': 14062848, 'steps': 73243, 'loss/train': 1.5706937313079834} 11/07/2021 07:30:40 - INFO - __main__ - Step 73245: {'lr': 0.0002646170464512331, 'samples': 14063040, 'steps': 73244, 'loss/train': 1.2566920518875122} 11/07/2021 07:30:40 - INFO - __main__ - Step 73246: {'lr': 0.0002646117487819875, 'samples': 14063232, 'steps': 73245, 'loss/train': 1.6735076904296875} 11/07/2021 07:30:41 - INFO - __main__ - Step 73247: {'lr': 0.0002646064511061581, 'samples': 14063424, 'steps': 73246, 'loss/train': 1.2110353708267212} 11/07/2021 07:30:42 - INFO - __main__ - Step 73248: {'lr': 0.00026460115342374723, 'samples': 14063616, 'steps': 73247, 'loss/train': 1.2866219282150269} 11/07/2021 07:30:42 - INFO - __main__ - Step 73249: {'lr': 0.0002645958557347573, 'samples': 14063808, 'steps': 73248, 'loss/train': 0.7975257039070129} 11/07/2021 07:30:43 - INFO - __main__ - Step 73250: {'lr': 0.00026459055803919074, 'samples': 14064000, 'steps': 73249, 'loss/train': 1.5869426727294922} 11/07/2021 07:30:43 - INFO - __main__ - Step 73251: {'lr': 0.00026458526033704984, 'samples': 14064192, 'steps': 73250, 'loss/train': 1.5618150234222412} 11/07/2021 07:30:44 - INFO - __main__ - Step 73252: {'lr': 0.0002645799626283371, 'samples': 14064384, 'steps': 73251, 'loss/train': 0.29711487889289856} 11/07/2021 07:30:44 - INFO - __main__ - Step 73253: {'lr': 0.00026457466491305485, 'samples': 14064576, 'steps': 73252, 'loss/train': 1.535209059715271} 11/07/2021 07:30:45 - INFO - __main__ - Step 73254: {'lr': 0.00026456936719120543, 'samples': 14064768, 'steps': 73253, 'loss/train': 1.0766786336898804} 11/07/2021 07:30:45 - INFO - __main__ - Step 73255: {'lr': 0.0002645640694627913, 'samples': 14064960, 'steps': 73254, 'loss/train': 1.1168652772903442} 11/07/2021 07:30:45 - INFO - __main__ - Step 73256: {'lr': 0.0002645587717278148, 'samples': 14065152, 'steps': 73255, 'loss/train': 1.41000235080719} 11/07/2021 07:30:46 - INFO - __main__ - Step 73257: {'lr': 0.00026455347398627845, 'samples': 14065344, 'steps': 73256, 'loss/train': 1.413987398147583} 11/07/2021 07:30:47 - INFO - __main__ - Step 73258: {'lr': 0.0002645481762381845, 'samples': 14065536, 'steps': 73257, 'loss/train': 1.2592641115188599} 11/07/2021 07:30:47 - INFO - __main__ - Step 73259: {'lr': 0.0002645428784835353, 'samples': 14065728, 'steps': 73258, 'loss/train': 1.6087881326675415} 11/07/2021 07:30:47 - INFO - __main__ - Step 73260: {'lr': 0.0002645375807223333, 'samples': 14065920, 'steps': 73259, 'loss/train': 1.4973338842391968} 11/07/2021 07:30:48 - INFO - __main__ - Step 73261: {'lr': 0.00026453228295458093, 'samples': 14066112, 'steps': 73260, 'loss/train': 1.5012357234954834} 11/07/2021 07:30:49 - INFO - __main__ - Step 73262: {'lr': 0.0002645269851802805, 'samples': 14066304, 'steps': 73261, 'loss/train': 1.767212152481079} 11/07/2021 07:30:49 - INFO - __main__ - Step 73263: {'lr': 0.0002645216873994345, 'samples': 14066496, 'steps': 73262, 'loss/train': 0.9902823567390442} 11/07/2021 07:30:49 - INFO - __main__ - Step 73264: {'lr': 0.0002645163896120452, 'samples': 14066688, 'steps': 73263, 'loss/train': 1.587572455406189} 11/07/2021 07:30:50 - INFO - __main__ - Step 73265: {'lr': 0.00026451109181811506, 'samples': 14066880, 'steps': 73264, 'loss/train': 1.5949732065200806} 11/07/2021 07:30:50 - INFO - __main__ - Step 73266: {'lr': 0.0002645057940176464, 'samples': 14067072, 'steps': 73265, 'loss/train': 0.23885096609592438} 11/07/2021 07:30:51 - INFO - __main__ - Step 73267: {'lr': 0.00026450049621064173, 'samples': 14067264, 'steps': 73266, 'loss/train': 1.4799790382385254} 11/07/2021 07:30:52 - INFO - __main__ - Step 73268: {'lr': 0.0002644951983971033, 'samples': 14067456, 'steps': 73267, 'loss/train': 1.554643154144287} 11/07/2021 07:30:52 - INFO - __main__ - Step 73269: {'lr': 0.0002644899005770336, 'samples': 14067648, 'steps': 73268, 'loss/train': 1.4570050239562988} 11/07/2021 07:30:52 - INFO - __main__ - Step 73270: {'lr': 0.00026448460275043496, 'samples': 14067840, 'steps': 73269, 'loss/train': 1.4298330545425415} 11/07/2021 07:30:53 - INFO - __main__ - Step 73271: {'lr': 0.00026447930491730974, 'samples': 14068032, 'steps': 73270, 'loss/train': 1.4953545331954956} 11/07/2021 07:30:53 - INFO - __main__ - Step 73272: {'lr': 0.0002644740070776604, 'samples': 14068224, 'steps': 73271, 'loss/train': 6.484274864196777} 11/07/2021 07:30:54 - INFO - __main__ - Step 73273: {'lr': 0.0002644687092314893, 'samples': 14068416, 'steps': 73272, 'loss/train': 1.5054751634597778} 11/07/2021 07:30:54 - INFO - __main__ - Step 73274: {'lr': 0.0002644634113787988, 'samples': 14068608, 'steps': 73273, 'loss/train': 1.0734310150146484} 11/07/2021 07:30:55 - INFO - __main__ - Step 73275: {'lr': 0.0002644581135195913, 'samples': 14068800, 'steps': 73274, 'loss/train': 1.6468505859375} 11/07/2021 07:30:55 - INFO - __main__ - Step 73276: {'lr': 0.0002644528156538693, 'samples': 14068992, 'steps': 73275, 'loss/train': 1.689677119255066} 11/07/2021 07:30:55 - INFO - __main__ - Step 73277: {'lr': 0.000264447517781635, 'samples': 14069184, 'steps': 73276, 'loss/train': 1.5541988611221313} 11/07/2021 07:30:57 - INFO - __main__ - Step 73278: {'lr': 0.00026444221990289086, 'samples': 14069376, 'steps': 73277, 'loss/train': 1.704243779182434} 11/07/2021 07:30:57 - INFO - __main__ - Step 73279: {'lr': 0.0002644369220176393, 'samples': 14069568, 'steps': 73278, 'loss/train': 1.3018230199813843} 11/07/2021 07:30:57 - INFO - __main__ - Step 73280: {'lr': 0.0002644316241258827, 'samples': 14069760, 'steps': 73279, 'loss/train': 1.310102939605713} 11/07/2021 07:30:58 - INFO - __main__ - Step 73281: {'lr': 0.0002644263262276234, 'samples': 14069952, 'steps': 73280, 'loss/train': 1.1132911443710327} 11/07/2021 07:30:58 - INFO - __main__ - Step 73282: {'lr': 0.0002644210283228639, 'samples': 14070144, 'steps': 73281, 'loss/train': 1.9595458507537842} 11/07/2021 07:30:59 - INFO - __main__ - Step 73283: {'lr': 0.0002644157304116064, 'samples': 14070336, 'steps': 73282, 'loss/train': 1.6509206295013428} 11/07/2021 07:30:59 - INFO - __main__ - Step 73284: {'lr': 0.0002644104324938534, 'samples': 14070528, 'steps': 73283, 'loss/train': 1.4047960042953491} 11/07/2021 07:31:00 - INFO - __main__ - Step 73285: {'lr': 0.00026440513456960736, 'samples': 14070720, 'steps': 73284, 'loss/train': 1.3843730688095093} 11/07/2021 07:31:00 - INFO - __main__ - Step 73286: {'lr': 0.00026439983663887056, 'samples': 14070912, 'steps': 73285, 'loss/train': 1.605696439743042} 11/07/2021 07:31:00 - INFO - __main__ - Step 73287: {'lr': 0.0002643945387016454, 'samples': 14071104, 'steps': 73286, 'loss/train': 1.5149110555648804} 11/07/2021 07:31:01 - INFO - __main__ - Step 73288: {'lr': 0.0002643892407579343, 'samples': 14071296, 'steps': 73287, 'loss/train': 0.7814556956291199} 11/07/2021 07:31:02 - INFO - __main__ - Step 73289: {'lr': 0.00026438394280773963, 'samples': 14071488, 'steps': 73288, 'loss/train': 1.093995451927185} 11/07/2021 07:31:02 - INFO - __main__ - Step 73290: {'lr': 0.0002643786448510638, 'samples': 14071680, 'steps': 73289, 'loss/train': 1.2528247833251953} 11/07/2021 07:31:02 - INFO - __main__ - Step 73291: {'lr': 0.0002643733468879091, 'samples': 14071872, 'steps': 73290, 'loss/train': 1.6158552169799805} 11/07/2021 07:31:03 - INFO - __main__ - Step 73292: {'lr': 0.000264368048918278, 'samples': 14072064, 'steps': 73291, 'loss/train': 1.4070067405700684} 11/07/2021 07:31:03 - INFO - __main__ - Step 73293: {'lr': 0.00026436275094217295, 'samples': 14072256, 'steps': 73292, 'loss/train': 1.5379310846328735} 11/07/2021 07:31:04 - INFO - __main__ - Step 73294: {'lr': 0.0002643574529595962, 'samples': 14072448, 'steps': 73293, 'loss/train': 0.7694131135940552} 11/07/2021 07:31:04 - INFO - __main__ - Step 73295: {'lr': 0.0002643521549705502, 'samples': 14072640, 'steps': 73294, 'loss/train': 0.7382315397262573} 11/07/2021 07:31:05 - INFO - __main__ - Step 73296: {'lr': 0.0002643468569750375, 'samples': 14072832, 'steps': 73295, 'loss/train': 1.6286731958389282} 11/07/2021 07:31:05 - INFO - __main__ - Step 73297: {'lr': 0.0002643415589730602, 'samples': 14073024, 'steps': 73296, 'loss/train': 1.3326475620269775} 11/07/2021 07:31:06 - INFO - __main__ - Step 73298: {'lr': 0.0002643362609646208, 'samples': 14073216, 'steps': 73297, 'loss/train': 0.5655418038368225} 11/07/2021 07:31:06 - INFO - __main__ - Step 73299: {'lr': 0.00026433096294972166, 'samples': 14073408, 'steps': 73298, 'loss/train': 0.925666868686676} 11/07/2021 07:31:07 - INFO - __main__ - Step 73300: {'lr': 0.00026432566492836523, 'samples': 14073600, 'steps': 73299, 'loss/train': 1.2876529693603516} 11/07/2021 07:31:07 - INFO - __main__ - Step 73301: {'lr': 0.00026432036690055396, 'samples': 14073792, 'steps': 73300, 'loss/train': 1.4616127014160156} 11/07/2021 07:31:08 - INFO - __main__ - Step 73302: {'lr': 0.00026431506886629016, 'samples': 14073984, 'steps': 73301, 'loss/train': 0.8269844651222229} 11/07/2021 07:31:08 - INFO - __main__ - Step 73303: {'lr': 0.0002643097708255761, 'samples': 14074176, 'steps': 73302, 'loss/train': 1.4877357482910156} 11/07/2021 07:31:09 - INFO - __main__ - Step 73304: {'lr': 0.00026430447277841433, 'samples': 14074368, 'steps': 73303, 'loss/train': 1.7956236600875854} 11/07/2021 07:31:09 - INFO - __main__ - Step 73305: {'lr': 0.00026429917472480717, 'samples': 14074560, 'steps': 73304, 'loss/train': 1.3591513633728027} 11/07/2021 07:31:10 - INFO - __main__ - Step 73306: {'lr': 0.0002642938766647571, 'samples': 14074752, 'steps': 73305, 'loss/train': 1.5770020484924316} 11/07/2021 07:31:10 - INFO - __main__ - Step 73307: {'lr': 0.0002642885785982663, 'samples': 14074944, 'steps': 73306, 'loss/train': 1.2855327129364014} 11/07/2021 07:31:10 - INFO - __main__ - Step 73308: {'lr': 0.0002642832805253374, 'samples': 14075136, 'steps': 73307, 'loss/train': 1.7128260135650635} 11/07/2021 07:31:10 - INFO - __main__ - Dataset epoch: 1 11/07/2021 07:31:12 - INFO - __main__ - Step 73309: {'lr': 0.00026427798244597266, 'samples': 14075328, 'steps': 73308, 'loss/train': 1.4184777736663818} 11/07/2021 07:31:12 - INFO - __main__ - Step 73310: {'lr': 0.00026427268436017445, 'samples': 14075520, 'steps': 73309, 'loss/train': 1.7484649419784546} 11/07/2021 07:31:12 - INFO - __main__ - Step 73311: {'lr': 0.00026426738626794514, 'samples': 14075712, 'steps': 73310, 'loss/train': 1.5664538145065308} 11/07/2021 07:31:13 - INFO - __main__ - Step 73312: {'lr': 0.00026426208816928727, 'samples': 14075904, 'steps': 73311, 'loss/train': 1.1408133506774902} 11/07/2021 07:31:13 - INFO - __main__ - Step 73313: {'lr': 0.00026425679006420306, 'samples': 14076096, 'steps': 73312, 'loss/train': 1.4411534070968628} 11/07/2021 07:31:14 - INFO - __main__ - Step 73314: {'lr': 0.00026425149195269496, 'samples': 14076288, 'steps': 73313, 'loss/train': 1.0568625926971436} 11/07/2021 07:31:14 - INFO - __main__ - Step 73315: {'lr': 0.00026424619383476534, 'samples': 14076480, 'steps': 73314, 'loss/train': 1.0610008239746094} 11/07/2021 07:31:15 - INFO - __main__ - Step 73316: {'lr': 0.0002642408957104167, 'samples': 14076672, 'steps': 73315, 'loss/train': 1.6654385328292847} 11/07/2021 07:31:15 - INFO - __main__ - Step 73317: {'lr': 0.00026423559757965127, 'samples': 14076864, 'steps': 73316, 'loss/train': 1.3019654750823975} 11/07/2021 07:31:15 - INFO - __main__ - Step 73318: {'lr': 0.0002642302994424715, 'samples': 14077056, 'steps': 73317, 'loss/train': 1.0649635791778564} 11/07/2021 07:31:17 - INFO - __main__ - Step 73319: {'lr': 0.0002642250012988797, 'samples': 14077248, 'steps': 73318, 'loss/train': 1.1046881675720215} 11/07/2021 07:31:17 - INFO - __main__ - Step 73320: {'lr': 0.0002642197031488784, 'samples': 14077440, 'steps': 73319, 'loss/train': 1.5296472311019897} 11/07/2021 07:31:17 - INFO - __main__ - Step 73321: {'lr': 0.00026421440499247, 'samples': 14077632, 'steps': 73320, 'loss/train': 1.5324722528457642} 11/07/2021 07:31:18 - INFO - __main__ - Step 73322: {'lr': 0.0002642091068296567, 'samples': 14077824, 'steps': 73321, 'loss/train': 1.0602225065231323} 11/07/2021 07:31:18 - INFO - __main__ - Step 73323: {'lr': 0.0002642038086604411, 'samples': 14078016, 'steps': 73322, 'loss/train': 0.691872239112854} 11/07/2021 07:31:18 - INFO - __main__ - Step 73324: {'lr': 0.00026419851048482536, 'samples': 14078208, 'steps': 73323, 'loss/train': 1.595578670501709} 11/07/2021 07:31:19 - INFO - __main__ - Step 73325: {'lr': 0.00026419321230281207, 'samples': 14078400, 'steps': 73324, 'loss/train': 1.3862261772155762} 11/07/2021 07:31:20 - INFO - __main__ - Step 73326: {'lr': 0.0002641879141144035, 'samples': 14078592, 'steps': 73325, 'loss/train': 1.8586974143981934} 11/07/2021 07:31:20 - INFO - __main__ - Step 73327: {'lr': 0.00026418261591960206, 'samples': 14078784, 'steps': 73326, 'loss/train': 1.6526176929473877} 11/07/2021 07:31:20 - INFO - __main__ - Step 73328: {'lr': 0.0002641773177184102, 'samples': 14078976, 'steps': 73327, 'loss/train': 1.3967231512069702} 11/07/2021 07:31:21 - INFO - __main__ - Step 73329: {'lr': 0.00026417201951083025, 'samples': 14079168, 'steps': 73328, 'loss/train': 1.5444172620773315} 11/07/2021 07:31:22 - INFO - __main__ - Step 73330: {'lr': 0.0002641667212968646, 'samples': 14079360, 'steps': 73329, 'loss/train': 1.4669318199157715} 11/07/2021 07:31:22 - INFO - __main__ - Step 73331: {'lr': 0.0002641614230765156, 'samples': 14079552, 'steps': 73330, 'loss/train': 1.5660618543624878} 11/07/2021 07:31:22 - INFO - __main__ - Step 73332: {'lr': 0.00026415612484978577, 'samples': 14079744, 'steps': 73331, 'loss/train': 1.2396941184997559} 11/07/2021 07:31:23 - INFO - __main__ - Step 73333: {'lr': 0.00026415082661667734, 'samples': 14079936, 'steps': 73332, 'loss/train': 1.4296271800994873} 11/07/2021 07:31:23 - INFO - __main__ - Step 73334: {'lr': 0.0002641455283771928, 'samples': 14080128, 'steps': 73333, 'loss/train': 1.4232641458511353} 11/07/2021 07:31:24 - INFO - __main__ - Step 73335: {'lr': 0.00026414023013133446, 'samples': 14080320, 'steps': 73334, 'loss/train': 1.0325562953948975} 11/07/2021 07:31:25 - INFO - __main__ - Step 73336: {'lr': 0.0002641349318791048, 'samples': 14080512, 'steps': 73335, 'loss/train': 1.3605642318725586} 11/07/2021 07:31:25 - INFO - __main__ - Step 73337: {'lr': 0.0002641296336205062, 'samples': 14080704, 'steps': 73336, 'loss/train': 1.6466840505599976} 11/07/2021 07:31:25 - INFO - __main__ - Step 73338: {'lr': 0.00026412433535554094, 'samples': 14080896, 'steps': 73337, 'loss/train': 2.6825690269470215} 11/07/2021 07:31:26 - INFO - __main__ - Step 73339: {'lr': 0.0002641190370842114, 'samples': 14081088, 'steps': 73338, 'loss/train': 1.1557095050811768} 11/07/2021 07:31:27 - INFO - __main__ - Step 73340: {'lr': 0.0002641137388065201, 'samples': 14081280, 'steps': 73339, 'loss/train': 1.282266616821289} 11/07/2021 07:31:27 - INFO - __main__ - Step 73341: {'lr': 0.0002641084405224694, 'samples': 14081472, 'steps': 73340, 'loss/train': 1.422275424003601} 11/07/2021 07:31:27 - INFO - __main__ - Step 73342: {'lr': 0.0002641031422320616, 'samples': 14081664, 'steps': 73341, 'loss/train': 1.5566891431808472} 11/07/2021 07:31:28 - INFO - __main__ - Step 73343: {'lr': 0.0002640978439352993, 'samples': 14081856, 'steps': 73342, 'loss/train': 0.9899210333824158} 11/07/2021 07:31:28 - INFO - __main__ - Step 73344: {'lr': 0.00026409254563218457, 'samples': 14082048, 'steps': 73343, 'loss/train': 1.3463835716247559} 11/07/2021 07:31:29 - INFO - __main__ - Step 73345: {'lr': 0.00026408724732272, 'samples': 14082240, 'steps': 73344, 'loss/train': 0.9350692629814148} 11/07/2021 07:31:30 - INFO - __main__ - Step 73346: {'lr': 0.0002640819490069079, 'samples': 14082432, 'steps': 73345, 'loss/train': 1.388279676437378} 11/07/2021 07:31:30 - INFO - __main__ - Step 73347: {'lr': 0.00026407665068475073, 'samples': 14082624, 'steps': 73346, 'loss/train': 1.6500825881958008} 11/07/2021 07:31:31 - INFO - __main__ - Step 73348: {'lr': 0.0002640713523562508, 'samples': 14082816, 'steps': 73347, 'loss/train': 0.8155208230018616} 11/07/2021 07:31:31 - INFO - __main__ - Step 73349: {'lr': 0.00026406605402141053, 'samples': 14083008, 'steps': 73348, 'loss/train': 1.767763376235962} 11/07/2021 07:31:31 - INFO - __main__ - Step 73350: {'lr': 0.0002640607556802324, 'samples': 14083200, 'steps': 73349, 'loss/train': 0.15756888687610626} 11/07/2021 07:31:32 - INFO - __main__ - Step 73351: {'lr': 0.0002640554573327187, 'samples': 14083392, 'steps': 73350, 'loss/train': 1.6314390897750854} 11/07/2021 07:31:33 - INFO - __main__ - Step 73352: {'lr': 0.00026405015897887173, 'samples': 14083584, 'steps': 73351, 'loss/train': 1.3551455736160278} 11/07/2021 07:31:33 - INFO - __main__ - Step 73353: {'lr': 0.00026404486061869405, 'samples': 14083776, 'steps': 73352, 'loss/train': 1.3141658306121826} 11/07/2021 07:31:33 - INFO - __main__ - Step 73354: {'lr': 0.00026403956225218793, 'samples': 14083968, 'steps': 73353, 'loss/train': 1.38869047164917} 11/07/2021 07:31:34 - INFO - __main__ - Step 73355: {'lr': 0.0002640342638793558, 'samples': 14084160, 'steps': 73354, 'loss/train': 1.2493623495101929} 11/07/2021 07:31:35 - INFO - __main__ - Step 73356: {'lr': 0.0002640289655002001, 'samples': 14084352, 'steps': 73355, 'loss/train': 1.2432339191436768} 11/07/2021 07:31:35 - INFO - __main__ - Step 73357: {'lr': 0.00026402366711472317, 'samples': 14084544, 'steps': 73356, 'loss/train': 1.7879537343978882} 11/07/2021 07:31:35 - INFO - __main__ - Step 73358: {'lr': 0.00026401836872292733, 'samples': 14084736, 'steps': 73357, 'loss/train': 1.5465724468231201} 11/07/2021 07:31:36 - INFO - __main__ - Step 73359: {'lr': 0.00026401307032481504, 'samples': 14084928, 'steps': 73358, 'loss/train': 1.1337248086929321} 11/07/2021 07:31:36 - INFO - __main__ - Step 73360: {'lr': 0.00026400777192038874, 'samples': 14085120, 'steps': 73359, 'loss/train': 0.7039411664009094} 11/07/2021 07:31:37 - INFO - __main__ - Step 73361: {'lr': 0.00026400247350965065, 'samples': 14085312, 'steps': 73360, 'loss/train': 1.4627554416656494} 11/07/2021 07:31:38 - INFO - __main__ - Step 73362: {'lr': 0.0002639971750926033, 'samples': 14085504, 'steps': 73361, 'loss/train': 1.1057943105697632} 11/07/2021 07:31:38 - INFO - __main__ - Step 73363: {'lr': 0.0002639918766692491, 'samples': 14085696, 'steps': 73362, 'loss/train': 1.473806381225586} 11/07/2021 07:31:38 - INFO - __main__ - Step 73364: {'lr': 0.00026398657823959034, 'samples': 14085888, 'steps': 73363, 'loss/train': 1.6138883829116821} 11/07/2021 07:31:39 - INFO - __main__ - Step 73365: {'lr': 0.0002639812798036294, 'samples': 14086080, 'steps': 73364, 'loss/train': 1.7606889009475708} 11/07/2021 07:31:40 - INFO - __main__ - Step 73366: {'lr': 0.00026397598136136875, 'samples': 14086272, 'steps': 73365, 'loss/train': 1.3390576839447021} 11/07/2021 07:31:40 - INFO - __main__ - Step 73367: {'lr': 0.00026397068291281076, 'samples': 14086464, 'steps': 73366, 'loss/train': 0.5490933060646057} 11/07/2021 07:31:40 - INFO - __main__ - Step 73368: {'lr': 0.0002639653844579578, 'samples': 14086656, 'steps': 73367, 'loss/train': 1.3904494047164917} 11/07/2021 07:31:41 - INFO - __main__ - Step 73369: {'lr': 0.00026396008599681214, 'samples': 14086848, 'steps': 73368, 'loss/train': 1.3192704916000366} 11/07/2021 07:31:41 - INFO - __main__ - Step 73370: {'lr': 0.00026395478752937646, 'samples': 14087040, 'steps': 73369, 'loss/train': 1.3937463760375977} 11/07/2021 07:31:42 - INFO - __main__ - Step 73371: {'lr': 0.0002639494890556529, 'samples': 14087232, 'steps': 73370, 'loss/train': 1.5292079448699951} 11/07/2021 07:31:43 - INFO - __main__ - Step 73372: {'lr': 0.0002639441905756438, 'samples': 14087424, 'steps': 73371, 'loss/train': 1.5826969146728516} 11/07/2021 07:31:43 - INFO - __main__ - Step 73373: {'lr': 0.0002639388920893518, 'samples': 14087616, 'steps': 73372, 'loss/train': 1.3579381704330444} 11/07/2021 07:31:43 - INFO - __main__ - Step 73374: {'lr': 0.00026393359359677904, 'samples': 14087808, 'steps': 73373, 'loss/train': 1.2990039587020874} 11/07/2021 07:31:44 - INFO - __main__ - Step 73375: {'lr': 0.0002639282950979281, 'samples': 14088000, 'steps': 73374, 'loss/train': 5.569647312164307} 11/07/2021 07:31:44 - INFO - __main__ - Step 73376: {'lr': 0.0002639229965928013, 'samples': 14088192, 'steps': 73375, 'loss/train': 1.1307498216629028} 11/07/2021 07:31:45 - INFO - __main__ - Step 73377: {'lr': 0.00026391769808140097, 'samples': 14088384, 'steps': 73376, 'loss/train': 0.11651400476694107} 11/07/2021 07:31:45 - INFO - __main__ - Step 73378: {'lr': 0.00026391239956372953, 'samples': 14088576, 'steps': 73377, 'loss/train': 1.4104939699172974} 11/07/2021 07:31:46 - INFO - __main__ - Step 73379: {'lr': 0.00026390710103978946, 'samples': 14088768, 'steps': 73378, 'loss/train': 1.642152190208435} 11/07/2021 07:31:46 - INFO - __main__ - Step 73380: {'lr': 0.00026390180250958296, 'samples': 14088960, 'steps': 73379, 'loss/train': 0.9494820833206177} 11/07/2021 07:31:46 - INFO - __main__ - Step 73381: {'lr': 0.0002638965039731126, 'samples': 14089152, 'steps': 73380, 'loss/train': 1.634764313697815} 11/07/2021 07:31:48 - INFO - __main__ - Step 73382: {'lr': 0.00026389120543038064, 'samples': 14089344, 'steps': 73381, 'loss/train': 1.6432429552078247} 11/07/2021 07:31:48 - INFO - __main__ - Step 73383: {'lr': 0.00026388590688138954, 'samples': 14089536, 'steps': 73382, 'loss/train': 0.7487064003944397} 11/07/2021 07:31:48 - INFO - __main__ - Step 73384: {'lr': 0.00026388060832614166, 'samples': 14089728, 'steps': 73383, 'loss/train': 0.8557178378105164} 11/07/2021 07:31:49 - INFO - __main__ - Step 73385: {'lr': 0.00026387530976463934, 'samples': 14089920, 'steps': 73384, 'loss/train': 0.44739994406700134} 11/07/2021 07:31:49 - INFO - __main__ - Step 73386: {'lr': 0.0002638700111968851, 'samples': 14090112, 'steps': 73385, 'loss/train': 1.727648377418518} 11/07/2021 07:31:50 - INFO - __main__ - Step 73387: {'lr': 0.00026386471262288127, 'samples': 14090304, 'steps': 73386, 'loss/train': 1.3494420051574707} 11/07/2021 07:31:50 - INFO - __main__ - Step 73388: {'lr': 0.00026385941404263007, 'samples': 14090496, 'steps': 73387, 'loss/train': 1.5842058658599854} 11/07/2021 07:31:51 - INFO - __main__ - Step 73389: {'lr': 0.0002638541154561341, 'samples': 14090688, 'steps': 73388, 'loss/train': 1.4937677383422852} 11/07/2021 07:31:51 - INFO - __main__ - Step 73390: {'lr': 0.00026384881686339573, 'samples': 14090880, 'steps': 73389, 'loss/train': 1.1698782444000244} 11/07/2021 07:31:52 - INFO - __main__ - Step 73391: {'lr': 0.00026384351826441726, 'samples': 14091072, 'steps': 73390, 'loss/train': 1.5266245603561401} 11/07/2021 07:31:52 - INFO - __main__ - Step 73392: {'lr': 0.00026383821965920116, 'samples': 14091264, 'steps': 73391, 'loss/train': 1.5169172286987305} 11/07/2021 07:31:53 - INFO - __main__ - Step 73393: {'lr': 0.00026383292104774976, 'samples': 14091456, 'steps': 73392, 'loss/train': 1.5932095050811768} 11/07/2021 07:31:53 - INFO - __main__ - Step 73394: {'lr': 0.0002638276224300654, 'samples': 14091648, 'steps': 73393, 'loss/train': 1.2809420824050903} 11/07/2021 07:31:53 - INFO - __main__ - Step 73395: {'lr': 0.00026382232380615055, 'samples': 14091840, 'steps': 73394, 'loss/train': 1.115753412246704} 11/07/2021 07:31:54 - INFO - __main__ - Step 73396: {'lr': 0.0002638170251760076, 'samples': 14092032, 'steps': 73395, 'loss/train': 1.4338129758834839} 11/07/2021 07:31:55 - INFO - __main__ - Step 73397: {'lr': 0.00026381172653963886, 'samples': 14092224, 'steps': 73396, 'loss/train': 1.1834611892700195} 11/07/2021 07:31:55 - INFO - __main__ - Step 73398: {'lr': 0.00026380642789704684, 'samples': 14092416, 'steps': 73397, 'loss/train': 1.3882966041564941} 11/07/2021 07:31:55 - INFO - __main__ - Step 73399: {'lr': 0.0002638011292482338, 'samples': 14092608, 'steps': 73398, 'loss/train': 1.6943867206573486} 11/07/2021 07:31:56 - INFO - __main__ - Step 73400: {'lr': 0.0002637958305932022, 'samples': 14092800, 'steps': 73399, 'loss/train': 2.420372247695923} 11/07/2021 07:31:56 - INFO - __main__ - Step 73401: {'lr': 0.0002637905319319544, 'samples': 14092992, 'steps': 73400, 'loss/train': 1.5535094738006592} 11/07/2021 07:31:57 - INFO - __main__ - Step 73402: {'lr': 0.00026378523326449284, 'samples': 14093184, 'steps': 73401, 'loss/train': 1.684964895248413} 11/07/2021 07:31:58 - INFO - __main__ - Step 73403: {'lr': 0.0002637799345908199, 'samples': 14093376, 'steps': 73402, 'loss/train': 1.778743028640747} 11/07/2021 07:31:58 - INFO - __main__ - Step 73404: {'lr': 0.0002637746359109379, 'samples': 14093568, 'steps': 73403, 'loss/train': 1.6014965772628784} 11/07/2021 07:31:58 - INFO - __main__ - Step 73405: {'lr': 0.0002637693372248492, 'samples': 14093760, 'steps': 73404, 'loss/train': 1.3574732542037964} 11/07/2021 07:31:59 - INFO - __main__ - Step 73406: {'lr': 0.00026376403853255626, 'samples': 14093952, 'steps': 73405, 'loss/train': 1.2612240314483643} 11/07/2021 07:32:00 - INFO - __main__ - Step 73407: {'lr': 0.0002637587398340615, 'samples': 14094144, 'steps': 73406, 'loss/train': 1.4275256395339966} 11/07/2021 07:32:00 - INFO - __main__ - Step 73408: {'lr': 0.0002637534411293672, 'samples': 14094336, 'steps': 73407, 'loss/train': 1.142673134803772} 11/07/2021 07:32:00 - INFO - __main__ - Step 73409: {'lr': 0.00026374814241847584, 'samples': 14094528, 'steps': 73408, 'loss/train': 1.5304107666015625} 11/07/2021 07:32:01 - INFO - __main__ - Step 73410: {'lr': 0.00026374284370138986, 'samples': 14094720, 'steps': 73409, 'loss/train': 1.7202240228652954} 11/07/2021 07:32:01 - INFO - __main__ - Step 73411: {'lr': 0.00026373754497811147, 'samples': 14094912, 'steps': 73410, 'loss/train': 1.1492431163787842} 11/07/2021 07:32:01 - INFO - __main__ - Step 73412: {'lr': 0.00026373224624864325, 'samples': 14095104, 'steps': 73411, 'loss/train': 1.6635422706604004} 11/07/2021 07:32:02 - INFO - __main__ - Step 73413: {'lr': 0.0002637269475129874, 'samples': 14095296, 'steps': 73412, 'loss/train': 1.4766303300857544} 11/07/2021 07:32:03 - INFO - __main__ - Step 73414: {'lr': 0.0002637216487711464, 'samples': 14095488, 'steps': 73413, 'loss/train': 1.4677785634994507} 11/07/2021 07:32:03 - INFO - __main__ - Step 73415: {'lr': 0.0002637163500231227, 'samples': 14095680, 'steps': 73414, 'loss/train': 1.331092119216919} 11/07/2021 07:32:03 - INFO - __main__ - Step 73416: {'lr': 0.00026371105126891855, 'samples': 14095872, 'steps': 73415, 'loss/train': 1.0295296907424927} 11/07/2021 07:32:04 - INFO - __main__ - Step 73417: {'lr': 0.0002637057525085365, 'samples': 14096064, 'steps': 73416, 'loss/train': 1.280662178993225} 11/07/2021 07:32:05 - INFO - __main__ - Step 73418: {'lr': 0.0002637004537419788, 'samples': 14096256, 'steps': 73417, 'loss/train': 1.7180122137069702} 11/07/2021 07:32:05 - INFO - __main__ - Step 73419: {'lr': 0.0002636951549692478, 'samples': 14096448, 'steps': 73418, 'loss/train': 0.9246035814285278} 11/07/2021 07:32:06 - INFO - __main__ - Step 73420: {'lr': 0.0002636898561903461, 'samples': 14096640, 'steps': 73419, 'loss/train': 1.5742396116256714} 11/07/2021 07:32:06 - INFO - __main__ - Step 73421: {'lr': 0.00026368455740527594, 'samples': 14096832, 'steps': 73420, 'loss/train': 1.6525344848632812} 11/07/2021 07:32:06 - INFO - __main__ - Step 73422: {'lr': 0.0002636792586140397, 'samples': 14097024, 'steps': 73421, 'loss/train': 1.5796130895614624} 11/07/2021 07:32:07 - INFO - __main__ - Step 73423: {'lr': 0.0002636739598166398, 'samples': 14097216, 'steps': 73422, 'loss/train': 1.250181794166565} 11/07/2021 07:32:08 - INFO - __main__ - Step 73424: {'lr': 0.0002636686610130787, 'samples': 14097408, 'steps': 73423, 'loss/train': 2.6074676513671875} 11/07/2021 07:32:08 - INFO - __main__ - Step 73425: {'lr': 0.00026366336220335864, 'samples': 14097600, 'steps': 73424, 'loss/train': 1.6833827495574951} 11/07/2021 07:32:08 - INFO - __main__ - Step 73426: {'lr': 0.00026365806338748206, 'samples': 14097792, 'steps': 73425, 'loss/train': 0.97645503282547} 11/07/2021 07:32:09 - INFO - __main__ - Step 73427: {'lr': 0.0002636527645654514, 'samples': 14097984, 'steps': 73426, 'loss/train': 1.516000747680664} 11/07/2021 07:32:10 - INFO - __main__ - Step 73428: {'lr': 0.000263647465737269, 'samples': 14098176, 'steps': 73427, 'loss/train': 1.2044419050216675} 11/07/2021 07:32:10 - INFO - __main__ - Step 73429: {'lr': 0.00026364216690293724, 'samples': 14098368, 'steps': 73428, 'loss/train': 1.3205734491348267} 11/07/2021 07:32:11 - INFO - __main__ - Step 73430: {'lr': 0.00026363686806245865, 'samples': 14098560, 'steps': 73429, 'loss/train': 1.4832154512405396} 11/07/2021 07:32:11 - INFO - __main__ - Step 73431: {'lr': 0.00026363156921583534, 'samples': 14098752, 'steps': 73430, 'loss/train': 1.3717174530029297} 11/07/2021 07:32:11 - INFO - __main__ - Step 73432: {'lr': 0.00026362627036306997, 'samples': 14098944, 'steps': 73431, 'loss/train': 1.5484471321105957} 11/07/2021 07:32:13 - INFO - __main__ - Step 73433: {'lr': 0.00026362097150416477, 'samples': 14099136, 'steps': 73432, 'loss/train': 1.1530917882919312} 11/07/2021 07:32:13 - INFO - __main__ - Step 73434: {'lr': 0.0002636156726391221, 'samples': 14099328, 'steps': 73433, 'loss/train': 1.676065444946289} 11/07/2021 07:32:13 - INFO - __main__ - Step 73435: {'lr': 0.0002636103737679445, 'samples': 14099520, 'steps': 73434, 'loss/train': 0.9192968606948853} 11/07/2021 07:32:14 - INFO - __main__ - Step 73436: {'lr': 0.0002636050748906343, 'samples': 14099712, 'steps': 73435, 'loss/train': 1.632651925086975} 11/07/2021 07:32:14 - INFO - __main__ - Step 73437: {'lr': 0.0002635997760071939, 'samples': 14099904, 'steps': 73436, 'loss/train': 1.6834776401519775} 11/07/2021 07:32:14 - INFO - __main__ - Step 73438: {'lr': 0.00026359447711762554, 'samples': 14100096, 'steps': 73437, 'loss/train': 0.1142604798078537} 11/07/2021 07:32:15 - INFO - __main__ - Step 73439: {'lr': 0.00026358917822193173, 'samples': 14100288, 'steps': 73438, 'loss/train': 1.4892265796661377} 11/07/2021 07:32:16 - INFO - __main__ - Step 73440: {'lr': 0.00026358387932011484, 'samples': 14100480, 'steps': 73439, 'loss/train': 0.8656641840934753} 11/07/2021 07:32:16 - INFO - __main__ - Step 73441: {'lr': 0.0002635785804121773, 'samples': 14100672, 'steps': 73440, 'loss/train': 1.4229109287261963} 11/07/2021 07:32:17 - INFO - __main__ - Step 73442: {'lr': 0.00026357328149812144, 'samples': 14100864, 'steps': 73441, 'loss/train': 1.3376154899597168} 11/07/2021 07:32:17 - INFO - __main__ - Step 73443: {'lr': 0.00026356798257794965, 'samples': 14101056, 'steps': 73442, 'loss/train': 1.074589729309082} 11/07/2021 07:32:18 - INFO - __main__ - Step 73444: {'lr': 0.0002635626836516645, 'samples': 14101248, 'steps': 73443, 'loss/train': 1.471399188041687} 11/07/2021 07:32:18 - INFO - __main__ - Step 73445: {'lr': 0.00026355738471926804, 'samples': 14101440, 'steps': 73444, 'loss/train': 1.6229294538497925} 11/07/2021 07:32:19 - INFO - __main__ - Step 73446: {'lr': 0.0002635520857807629, 'samples': 14101632, 'steps': 73445, 'loss/train': 1.0469366312026978} 11/07/2021 07:32:19 - INFO - __main__ - Step 73447: {'lr': 0.00026354678683615133, 'samples': 14101824, 'steps': 73446, 'loss/train': 1.026949167251587} 11/07/2021 07:32:19 - INFO - __main__ - Step 73448: {'lr': 0.0002635414878854359, 'samples': 14102016, 'steps': 73447, 'loss/train': 1.5655450820922852} 11/07/2021 07:32:20 - INFO - __main__ - Step 73449: {'lr': 0.00026353618892861877, 'samples': 14102208, 'steps': 73448, 'loss/train': 1.3348051309585571} 11/07/2021 07:32:21 - INFO - __main__ - Step 73450: {'lr': 0.0002635308899657025, 'samples': 14102400, 'steps': 73449, 'loss/train': 1.5777565240859985} 11/07/2021 07:32:21 - INFO - __main__ - Step 73451: {'lr': 0.00026352559099668943, 'samples': 14102592, 'steps': 73450, 'loss/train': 1.1697922945022583} 11/07/2021 07:32:21 - INFO - __main__ - Step 73452: {'lr': 0.0002635202920215819, 'samples': 14102784, 'steps': 73451, 'loss/train': 1.4078989028930664} 11/07/2021 07:32:22 - INFO - __main__ - Step 73453: {'lr': 0.00026351499304038236, 'samples': 14102976, 'steps': 73452, 'loss/train': 1.345325231552124} 11/07/2021 07:32:23 - INFO - __main__ - Step 73454: {'lr': 0.00026350969405309314, 'samples': 14103168, 'steps': 73453, 'loss/train': 1.5120264291763306} 11/07/2021 07:32:23 - INFO - __main__ - Step 73455: {'lr': 0.0002635043950597167, 'samples': 14103360, 'steps': 73454, 'loss/train': 1.4767383337020874} 11/07/2021 07:32:24 - INFO - __main__ - Step 73456: {'lr': 0.00026349909606025534, 'samples': 14103552, 'steps': 73455, 'loss/train': 1.61503005027771} 11/07/2021 07:32:24 - INFO - __main__ - Step 73457: {'lr': 0.00026349379705471157, 'samples': 14103744, 'steps': 73456, 'loss/train': 1.0419584512710571} 11/07/2021 07:32:24 - INFO - __main__ - Step 73458: {'lr': 0.00026348849804308766, 'samples': 14103936, 'steps': 73457, 'loss/train': 0.6534438729286194} 11/07/2021 07:32:25 - INFO - __main__ - Step 73459: {'lr': 0.000263483199025386, 'samples': 14104128, 'steps': 73458, 'loss/train': 0.8691619634628296} 11/07/2021 07:32:26 - INFO - __main__ - Step 73460: {'lr': 0.00026347790000160907, 'samples': 14104320, 'steps': 73459, 'loss/train': 0.9545444250106812} 11/07/2021 07:32:26 - INFO - __main__ - Step 73461: {'lr': 0.00026347260097175923, 'samples': 14104512, 'steps': 73460, 'loss/train': 1.3606492280960083} 11/07/2021 07:32:26 - INFO - __main__ - Step 73462: {'lr': 0.0002634673019358388, 'samples': 14104704, 'steps': 73461, 'loss/train': 1.3364956378936768} 11/07/2021 07:32:27 - INFO - __main__ - Step 73463: {'lr': 0.0002634620028938502, 'samples': 14104896, 'steps': 73462, 'loss/train': 1.8914821147918701} 11/07/2021 07:32:28 - INFO - __main__ - Step 73464: {'lr': 0.0002634567038457959, 'samples': 14105088, 'steps': 73463, 'loss/train': 1.5569238662719727} 11/07/2021 07:32:28 - INFO - __main__ - Step 73465: {'lr': 0.0002634514047916782, 'samples': 14105280, 'steps': 73464, 'loss/train': 1.9267317056655884} 11/07/2021 07:32:28 - INFO - __main__ - Step 73466: {'lr': 0.00026344610573149943, 'samples': 14105472, 'steps': 73465, 'loss/train': 0.9405404329299927} 11/07/2021 07:32:29 - INFO - __main__ - Step 73467: {'lr': 0.0002634408066652621, 'samples': 14105664, 'steps': 73466, 'loss/train': 1.4130264520645142} 11/07/2021 07:32:29 - INFO - __main__ - Step 73468: {'lr': 0.00026343550759296854, 'samples': 14105856, 'steps': 73467, 'loss/train': 0.9245700240135193} 11/07/2021 07:32:30 - INFO - __main__ - Step 73469: {'lr': 0.00026343020851462114, 'samples': 14106048, 'steps': 73468, 'loss/train': 1.1799824237823486} 11/07/2021 07:32:31 - INFO - __main__ - Step 73470: {'lr': 0.00026342490943022227, 'samples': 14106240, 'steps': 73469, 'loss/train': 1.1505454778671265} 11/07/2021 07:32:31 - INFO - __main__ - Step 73471: {'lr': 0.00026341961033977447, 'samples': 14106432, 'steps': 73470, 'loss/train': 1.1919238567352295} 11/07/2021 07:32:31 - INFO - __main__ - Step 73472: {'lr': 0.00026341431124327986, 'samples': 14106624, 'steps': 73471, 'loss/train': 1.0279431343078613} 11/07/2021 07:32:32 - INFO - __main__ - Step 73473: {'lr': 0.00026340901214074103, 'samples': 14106816, 'steps': 73472, 'loss/train': 1.0941205024719238} 11/07/2021 07:32:33 - INFO - __main__ - Step 73474: {'lr': 0.00026340371303216033, 'samples': 14107008, 'steps': 73473, 'loss/train': 1.455562949180603} 11/07/2021 07:32:33 - INFO - __main__ - Step 73475: {'lr': 0.00026339841391754003, 'samples': 14107200, 'steps': 73474, 'loss/train': 1.4670759439468384} 11/07/2021 07:32:33 - INFO - __main__ - Step 73476: {'lr': 0.00026339311479688267, 'samples': 14107392, 'steps': 73475, 'loss/train': 1.287381887435913} 11/07/2021 07:32:34 - INFO - __main__ - Step 73477: {'lr': 0.00026338781567019064, 'samples': 14107584, 'steps': 73476, 'loss/train': 0.9314685463905334} 11/07/2021 07:32:34 - INFO - __main__ - Step 73478: {'lr': 0.0002633825165374662, 'samples': 14107776, 'steps': 73477, 'loss/train': 1.8386765718460083} 11/07/2021 07:32:35 - INFO - __main__ - Step 73479: {'lr': 0.0002633772173987118, 'samples': 14107968, 'steps': 73478, 'loss/train': 1.8100589513778687} 11/07/2021 07:32:35 - INFO - __main__ - Step 73480: {'lr': 0.00026337191825392985, 'samples': 14108160, 'steps': 73479, 'loss/train': 1.4622434377670288} 11/07/2021 07:32:36 - INFO - __main__ - Step 73481: {'lr': 0.00026336661910312273, 'samples': 14108352, 'steps': 73480, 'loss/train': 1.2313270568847656} 11/07/2021 07:32:36 - INFO - __main__ - Step 73482: {'lr': 0.00026336131994629275, 'samples': 14108544, 'steps': 73481, 'loss/train': 1.4556039571762085} 11/07/2021 07:32:36 - INFO - __main__ - Step 73483: {'lr': 0.0002633560207834425, 'samples': 14108736, 'steps': 73482, 'loss/train': 1.0425788164138794} 11/07/2021 07:32:37 - INFO - __main__ - Step 73484: {'lr': 0.0002633507216145741, 'samples': 14108928, 'steps': 73483, 'loss/train': 1.4366062879562378} 11/07/2021 07:32:38 - INFO - __main__ - Step 73485: {'lr': 0.0002633454224396901, 'samples': 14109120, 'steps': 73484, 'loss/train': 0.9920408725738525} 11/07/2021 07:32:38 - INFO - __main__ - Step 73486: {'lr': 0.0002633401232587929, 'samples': 14109312, 'steps': 73485, 'loss/train': 1.1125853061676025} 11/07/2021 07:32:38 - INFO - __main__ - Step 73487: {'lr': 0.0002633348240718848, 'samples': 14109504, 'steps': 73486, 'loss/train': 0.8118493556976318} 11/07/2021 07:32:39 - INFO - __main__ - Step 73488: {'lr': 0.0002633295248789683, 'samples': 14109696, 'steps': 73487, 'loss/train': 1.750980019569397} 11/07/2021 07:32:40 - INFO - __main__ - Step 73489: {'lr': 0.00026332422568004567, 'samples': 14109888, 'steps': 73488, 'loss/train': 2.1217169761657715} 11/07/2021 07:32:40 - INFO - __main__ - Step 73490: {'lr': 0.00026331892647511935, 'samples': 14110080, 'steps': 73489, 'loss/train': 1.55418062210083} 11/07/2021 07:32:41 - INFO - __main__ - Step 73491: {'lr': 0.0002633136272641918, 'samples': 14110272, 'steps': 73490, 'loss/train': 1.6086703538894653} 11/07/2021 07:32:41 - INFO - __main__ - Step 73492: {'lr': 0.0002633083280472652, 'samples': 14110464, 'steps': 73491, 'loss/train': 0.6289158463478088} 11/07/2021 07:32:41 - INFO - __main__ - Step 73493: {'lr': 0.0002633030288243422, 'samples': 14110656, 'steps': 73492, 'loss/train': 1.1205071210861206} 11/07/2021 07:32:43 - INFO - __main__ - Step 73494: {'lr': 0.000263297729595425, 'samples': 14110848, 'steps': 73493, 'loss/train': 1.1186702251434326} 11/07/2021 07:32:43 - INFO - __main__ - Step 73495: {'lr': 0.00026329243036051604, 'samples': 14111040, 'steps': 73494, 'loss/train': 1.6333640813827515} 11/07/2021 07:32:43 - INFO - __main__ - Step 73496: {'lr': 0.00026328713111961773, 'samples': 14111232, 'steps': 73495, 'loss/train': 5.795135498046875} 11/07/2021 07:32:44 - INFO - __main__ - Step 73497: {'lr': 0.00026328183187273246, 'samples': 14111424, 'steps': 73496, 'loss/train': 1.4591891765594482} 11/07/2021 07:32:44 - INFO - __main__ - Step 73498: {'lr': 0.0002632765326198626, 'samples': 14111616, 'steps': 73497, 'loss/train': 1.4948734045028687} 11/07/2021 07:32:44 - INFO - __main__ - Step 73499: {'lr': 0.0002632712333610105, 'samples': 14111808, 'steps': 73498, 'loss/train': 1.764873743057251} 11/07/2021 07:32:45 - INFO - __main__ - Step 73500: {'lr': 0.0002632659340961786, 'samples': 14112000, 'steps': 73499, 'loss/train': 1.7636234760284424} 11/07/2021 07:32:46 - INFO - __main__ - Step 73501: {'lr': 0.0002632606348253693, 'samples': 14112192, 'steps': 73500, 'loss/train': 1.7983137369155884} 11/07/2021 07:32:46 - INFO - __main__ - Step 73502: {'lr': 0.00026325533554858496, 'samples': 14112384, 'steps': 73501, 'loss/train': 1.5944185256958008} 11/07/2021 07:32:46 - INFO - __main__ - Step 73503: {'lr': 0.00026325003626582793, 'samples': 14112576, 'steps': 73502, 'loss/train': 0.5772606730461121} 11/07/2021 07:32:47 - INFO - __main__ - Step 73504: {'lr': 0.0002632447369771007, 'samples': 14112768, 'steps': 73503, 'loss/train': 1.6373181343078613} 11/07/2021 07:32:47 - INFO - __main__ - Step 73505: {'lr': 0.0002632394376824056, 'samples': 14112960, 'steps': 73504, 'loss/train': 1.292209267616272} 11/07/2021 07:32:48 - INFO - __main__ - Step 73506: {'lr': 0.00026323413838174497, 'samples': 14113152, 'steps': 73505, 'loss/train': 1.2625137567520142} 11/07/2021 07:32:48 - INFO - __main__ - Step 73507: {'lr': 0.00026322883907512124, 'samples': 14113344, 'steps': 73506, 'loss/train': 1.4801253080368042} 11/07/2021 07:32:49 - INFO - __main__ - Step 73508: {'lr': 0.0002632235397625368, 'samples': 14113536, 'steps': 73507, 'loss/train': 1.3351682424545288} 11/07/2021 07:32:49 - INFO - __main__ - Step 73509: {'lr': 0.000263218240443994, 'samples': 14113728, 'steps': 73508, 'loss/train': 1.083360195159912} 11/07/2021 07:32:49 - INFO - __main__ - Step 73510: {'lr': 0.0002632129411194954, 'samples': 14113920, 'steps': 73509, 'loss/train': 1.185500979423523} 11/07/2021 07:32:51 - INFO - __main__ - Step 73511: {'lr': 0.00026320764178904314, 'samples': 14114112, 'steps': 73510, 'loss/train': 2.594130039215088} 11/07/2021 07:32:51 - INFO - __main__ - Step 73512: {'lr': 0.00026320234245263974, 'samples': 14114304, 'steps': 73511, 'loss/train': 0.7185923457145691} 11/07/2021 07:32:51 - INFO - __main__ - Step 73513: {'lr': 0.0002631970431102876, 'samples': 14114496, 'steps': 73512, 'loss/train': 1.638533115386963} 11/07/2021 07:32:52 - INFO - __main__ - Step 73514: {'lr': 0.00026319174376198903, 'samples': 14114688, 'steps': 73513, 'loss/train': 1.9709815979003906} 11/07/2021 07:32:52 - INFO - __main__ - Step 73515: {'lr': 0.0002631864444077465, 'samples': 14114880, 'steps': 73514, 'loss/train': 1.6382509469985962} 11/07/2021 07:32:53 - INFO - __main__ - Step 73516: {'lr': 0.00026318114504756237, 'samples': 14115072, 'steps': 73515, 'loss/train': 1.274781584739685} 11/07/2021 07:32:53 - INFO - __main__ - Step 73517: {'lr': 0.000263175845681439, 'samples': 14115264, 'steps': 73516, 'loss/train': 1.5506175756454468} 11/07/2021 07:32:54 - INFO - __main__ - Step 73518: {'lr': 0.0002631705463093788, 'samples': 14115456, 'steps': 73517, 'loss/train': 1.6531357765197754} 11/07/2021 07:32:54 - INFO - __main__ - Step 73519: {'lr': 0.00026316524693138413, 'samples': 14115648, 'steps': 73518, 'loss/train': 1.0135602951049805} 11/07/2021 07:32:54 - INFO - __main__ - Step 73520: {'lr': 0.0002631599475474574, 'samples': 14115840, 'steps': 73519, 'loss/train': 1.5630708932876587} 11/07/2021 07:32:55 - INFO - __main__ - Step 73521: {'lr': 0.00026315464815760103, 'samples': 14116032, 'steps': 73520, 'loss/train': 1.6154102087020874} 11/07/2021 07:32:56 - INFO - __main__ - Step 73522: {'lr': 0.00026314934876181734, 'samples': 14116224, 'steps': 73521, 'loss/train': 1.2277296781539917} 11/07/2021 07:32:56 - INFO - __main__ - Step 73523: {'lr': 0.0002631440493601088, 'samples': 14116416, 'steps': 73522, 'loss/train': 0.9306148886680603} 11/07/2021 07:32:56 - INFO - __main__ - Step 73524: {'lr': 0.0002631387499524777, 'samples': 14116608, 'steps': 73523, 'loss/train': 1.5134989023208618} 11/07/2021 07:32:57 - INFO - __main__ - Step 73525: {'lr': 0.00026313345053892656, 'samples': 14116800, 'steps': 73524, 'loss/train': 2.7945797443389893} 11/07/2021 07:32:58 - INFO - __main__ - Step 73526: {'lr': 0.0002631281511194577, 'samples': 14116992, 'steps': 73525, 'loss/train': 1.1906851530075073} 11/07/2021 07:32:58 - INFO - __main__ - Step 73527: {'lr': 0.0002631228516940734, 'samples': 14117184, 'steps': 73526, 'loss/train': 1.067144751548767} 11/07/2021 07:32:58 - INFO - __main__ - Step 73528: {'lr': 0.00026311755226277625, 'samples': 14117376, 'steps': 73527, 'loss/train': 1.091658115386963} 11/07/2021 07:32:59 - INFO - __main__ - Step 73529: {'lr': 0.00026311225282556845, 'samples': 14117568, 'steps': 73528, 'loss/train': 1.8592686653137207} 11/07/2021 07:32:59 - INFO - __main__ - Step 73530: {'lr': 0.0002631069533824525, 'samples': 14117760, 'steps': 73529, 'loss/train': 1.8463764190673828} 11/07/2021 07:33:00 - INFO - __main__ - Step 73531: {'lr': 0.0002631016539334307, 'samples': 14117952, 'steps': 73530, 'loss/train': 1.0285314321517944} 11/07/2021 07:33:00 - INFO - __main__ - Step 73532: {'lr': 0.0002630963544785056, 'samples': 14118144, 'steps': 73531, 'loss/train': 1.546919822692871} 11/07/2021 07:33:01 - INFO - __main__ - Step 73533: {'lr': 0.00026309105501767945, 'samples': 14118336, 'steps': 73532, 'loss/train': 1.5371739864349365} 11/07/2021 07:33:01 - INFO - __main__ - Step 73534: {'lr': 0.0002630857555509547, 'samples': 14118528, 'steps': 73533, 'loss/train': 1.8549715280532837} 11/07/2021 07:33:01 - INFO - __main__ - Step 73535: {'lr': 0.00026308045607833364, 'samples': 14118720, 'steps': 73534, 'loss/train': 1.9854737520217896} 11/07/2021 07:33:02 - INFO - __main__ - Step 73536: {'lr': 0.0002630751565998187, 'samples': 14118912, 'steps': 73535, 'loss/train': 1.3013259172439575} 11/07/2021 07:33:03 - INFO - __main__ - Step 73537: {'lr': 0.0002630698571154124, 'samples': 14119104, 'steps': 73536, 'loss/train': 5.611090660095215} 11/07/2021 07:33:03 - INFO - __main__ - Step 73538: {'lr': 0.000263064557625117, 'samples': 14119296, 'steps': 73537, 'loss/train': 1.4265620708465576} 11/07/2021 07:33:03 - INFO - __main__ - Step 73539: {'lr': 0.0002630592581289349, 'samples': 14119488, 'steps': 73538, 'loss/train': 1.720953106880188} 11/07/2021 07:33:04 - INFO - __main__ - Step 73540: {'lr': 0.0002630539586268685, 'samples': 14119680, 'steps': 73539, 'loss/train': 1.6679341793060303} 11/07/2021 07:33:05 - INFO - __main__ - Step 73541: {'lr': 0.0002630486591189202, 'samples': 14119872, 'steps': 73540, 'loss/train': 1.28114652633667} 11/07/2021 07:33:05 - INFO - __main__ - Step 73542: {'lr': 0.0002630433596050923, 'samples': 14120064, 'steps': 73541, 'loss/train': 0.6228081583976746} 11/07/2021 07:33:06 - INFO - __main__ - Step 73543: {'lr': 0.00026303806008538735, 'samples': 14120256, 'steps': 73542, 'loss/train': 0.8615932464599609} 11/07/2021 07:33:06 - INFO - __main__ - Step 73544: {'lr': 0.00026303276055980764, 'samples': 14120448, 'steps': 73543, 'loss/train': 1.3157072067260742} 11/07/2021 07:33:06 - INFO - __main__ - Step 73545: {'lr': 0.0002630274610283555, 'samples': 14120640, 'steps': 73544, 'loss/train': 1.5275830030441284} 11/07/2021 07:33:07 - INFO - __main__ - Step 73546: {'lr': 0.00026302216149103345, 'samples': 14120832, 'steps': 73545, 'loss/train': 1.7149388790130615} 11/07/2021 07:33:08 - INFO - __main__ - Step 73547: {'lr': 0.0002630168619478438, 'samples': 14121024, 'steps': 73546, 'loss/train': 1.272601842880249} 11/07/2021 07:33:08 - INFO - __main__ - Step 73548: {'lr': 0.00026301156239878895, 'samples': 14121216, 'steps': 73547, 'loss/train': 1.5865328311920166} 11/07/2021 07:33:08 - INFO - __main__ - Step 73549: {'lr': 0.0002630062628438713, 'samples': 14121408, 'steps': 73548, 'loss/train': 1.307597279548645} 11/07/2021 07:33:09 - INFO - __main__ - Step 73550: {'lr': 0.0002630009632830932, 'samples': 14121600, 'steps': 73549, 'loss/train': 1.7125896215438843} 11/07/2021 07:33:09 - INFO - __main__ - Step 73551: {'lr': 0.00026299566371645715, 'samples': 14121792, 'steps': 73550, 'loss/train': 1.328169345855713} 11/07/2021 07:33:10 - INFO - __main__ - Step 73552: {'lr': 0.00026299036414396536, 'samples': 14121984, 'steps': 73551, 'loss/train': 1.5338331460952759} 11/07/2021 07:33:11 - INFO - __main__ - Step 73553: {'lr': 0.0002629850645656204, 'samples': 14122176, 'steps': 73552, 'loss/train': 1.7469751834869385} 11/07/2021 07:33:11 - INFO - __main__ - Step 73554: {'lr': 0.00026297976498142444, 'samples': 14122368, 'steps': 73553, 'loss/train': 1.8398361206054688} 11/07/2021 07:33:11 - INFO - __main__ - Step 73555: {'lr': 0.0002629744653913801, 'samples': 14122560, 'steps': 73554, 'loss/train': 1.3309578895568848} 11/07/2021 07:33:12 - INFO - __main__ - Step 73556: {'lr': 0.00026296916579548964, 'samples': 14122752, 'steps': 73555, 'loss/train': 1.5772515535354614} 11/07/2021 07:33:12 - INFO - __main__ - Step 73557: {'lr': 0.00026296386619375546, 'samples': 14122944, 'steps': 73556, 'loss/train': 2.0114693641662598} 11/07/2021 07:33:13 - INFO - __main__ - Step 73558: {'lr': 0.00026295856658618003, 'samples': 14123136, 'steps': 73557, 'loss/train': 1.3324248790740967} 11/07/2021 07:33:13 - INFO - __main__ - Step 73559: {'lr': 0.00026295326697276563, 'samples': 14123328, 'steps': 73558, 'loss/train': 1.4648255109786987} 11/07/2021 07:33:14 - INFO - __main__ - Step 73560: {'lr': 0.0002629479673535146, 'samples': 14123520, 'steps': 73559, 'loss/train': 1.4794964790344238} 11/07/2021 07:33:14 - INFO - __main__ - Step 73561: {'lr': 0.0002629426677284295, 'samples': 14123712, 'steps': 73560, 'loss/train': 1.4561183452606201} 11/07/2021 07:33:14 - INFO - __main__ - Step 73562: {'lr': 0.00026293736809751263, 'samples': 14123904, 'steps': 73561, 'loss/train': 1.0168625116348267} 11/07/2021 07:33:15 - INFO - __main__ - Step 73563: {'lr': 0.0002629320684607664, 'samples': 14124096, 'steps': 73562, 'loss/train': 1.733972430229187} 11/07/2021 07:33:16 - INFO - __main__ - Step 73564: {'lr': 0.0002629267688181931, 'samples': 14124288, 'steps': 73563, 'loss/train': 1.6192466020584106} 11/07/2021 07:33:16 - INFO - __main__ - Step 73565: {'lr': 0.0002629214691697953, 'samples': 14124480, 'steps': 73564, 'loss/train': 1.3140153884887695} 11/07/2021 07:33:17 - INFO - __main__ - Step 73566: {'lr': 0.00026291616951557527, 'samples': 14124672, 'steps': 73565, 'loss/train': 1.0091644525527954} 11/07/2021 07:33:17 - INFO - __main__ - Step 73567: {'lr': 0.00026291086985553535, 'samples': 14124864, 'steps': 73566, 'loss/train': 0.8630368709564209} 11/07/2021 07:33:18 - INFO - __main__ - Step 73568: {'lr': 0.00026290557018967804, 'samples': 14125056, 'steps': 73567, 'loss/train': 1.057663083076477} 11/07/2021 07:33:18 - INFO - __main__ - Step 73569: {'lr': 0.0002629002705180056, 'samples': 14125248, 'steps': 73568, 'loss/train': 1.7816413640975952} 11/07/2021 07:33:19 - INFO - __main__ - Step 73570: {'lr': 0.0002628949708405206, 'samples': 14125440, 'steps': 73569, 'loss/train': 1.1799432039260864} 11/07/2021 07:33:19 - INFO - __main__ - Step 73571: {'lr': 0.0002628896711572253, 'samples': 14125632, 'steps': 73570, 'loss/train': 1.5920301675796509} 11/07/2021 07:33:19 - INFO - __main__ - Step 73572: {'lr': 0.0002628843714681221, 'samples': 14125824, 'steps': 73571, 'loss/train': 1.0755422115325928} 11/07/2021 07:33:20 - INFO - __main__ - Step 73573: {'lr': 0.0002628790717732134, 'samples': 14126016, 'steps': 73572, 'loss/train': 1.098687767982483} 11/07/2021 07:33:21 - INFO - __main__ - Step 73574: {'lr': 0.0002628737720725016, 'samples': 14126208, 'steps': 73573, 'loss/train': 1.6414731740951538} 11/07/2021 07:33:21 - INFO - __main__ - Step 73575: {'lr': 0.000262868472365989, 'samples': 14126400, 'steps': 73574, 'loss/train': 1.693454623222351} 11/07/2021 07:33:21 - INFO - __main__ - Step 73576: {'lr': 0.00026286317265367815, 'samples': 14126592, 'steps': 73575, 'loss/train': 1.4261806011199951} 11/07/2021 07:33:22 - INFO - __main__ - Step 73577: {'lr': 0.00026285787293557134, 'samples': 14126784, 'steps': 73576, 'loss/train': 1.5998562574386597} 11/07/2021 07:33:22 - INFO - __main__ - Step 73578: {'lr': 0.000262852573211671, 'samples': 14126976, 'steps': 73577, 'loss/train': 1.585198163986206} 11/07/2021 07:33:23 - INFO - __main__ - Step 73579: {'lr': 0.00026284727348197944, 'samples': 14127168, 'steps': 73578, 'loss/train': 0.9310535192489624} 11/07/2021 07:33:23 - INFO - __main__ - Step 73580: {'lr': 0.0002628419737464991, 'samples': 14127360, 'steps': 73579, 'loss/train': 1.1110384464263916} 11/07/2021 07:33:24 - INFO - __main__ - Step 73581: {'lr': 0.0002628366740052324, 'samples': 14127552, 'steps': 73580, 'loss/train': 1.1002047061920166} 11/07/2021 07:33:24 - INFO - __main__ - Step 73582: {'lr': 0.0002628313742581817, 'samples': 14127744, 'steps': 73581, 'loss/train': 1.6270387172698975} 11/07/2021 07:33:24 - INFO - __main__ - Step 73583: {'lr': 0.0002628260745053493, 'samples': 14127936, 'steps': 73582, 'loss/train': 1.4103947877883911} 11/07/2021 07:33:25 - INFO - __main__ - Step 73584: {'lr': 0.0002628207747467377, 'samples': 14128128, 'steps': 73583, 'loss/train': 1.0038607120513916} 11/07/2021 07:33:26 - INFO - __main__ - Step 73585: {'lr': 0.0002628154749823493, 'samples': 14128320, 'steps': 73584, 'loss/train': 1.4659535884857178} 11/07/2021 07:33:26 - INFO - __main__ - Step 73586: {'lr': 0.00026281017521218643, 'samples': 14128512, 'steps': 73585, 'loss/train': 1.262128472328186} 11/07/2021 07:33:26 - INFO - __main__ - Step 73587: {'lr': 0.0002628048754362515, 'samples': 14128704, 'steps': 73586, 'loss/train': 1.3535209894180298} 11/07/2021 07:33:27 - INFO - __main__ - Step 73588: {'lr': 0.0002627995756545468, 'samples': 14128896, 'steps': 73587, 'loss/train': 1.1263231039047241} 11/07/2021 07:33:28 - INFO - __main__ - Step 73589: {'lr': 0.0002627942758670749, 'samples': 14129088, 'steps': 73588, 'loss/train': 1.4516197443008423} 11/07/2021 07:33:28 - INFO - __main__ - Step 73590: {'lr': 0.00026278897607383804, 'samples': 14129280, 'steps': 73589, 'loss/train': 1.2309329509735107} 11/07/2021 07:33:29 - INFO - __main__ - Step 73591: {'lr': 0.00026278367627483875, 'samples': 14129472, 'steps': 73590, 'loss/train': 1.1811481714248657} 11/07/2021 07:33:29 - INFO - __main__ - Step 73592: {'lr': 0.0002627783764700793, 'samples': 14129664, 'steps': 73591, 'loss/train': 1.256697654724121} 11/07/2021 07:33:29 - INFO - __main__ - Step 73593: {'lr': 0.00026277307665956205, 'samples': 14129856, 'steps': 73592, 'loss/train': 1.1436325311660767} 11/07/2021 07:33:30 - INFO - __main__ - Step 73594: {'lr': 0.0002627677768432896, 'samples': 14130048, 'steps': 73593, 'loss/train': 1.5051316022872925} 11/07/2021 07:33:31 - INFO - __main__ - Step 73595: {'lr': 0.000262762477021264, 'samples': 14130240, 'steps': 73594, 'loss/train': 1.3451640605926514} 11/07/2021 07:33:31 - INFO - __main__ - Step 73596: {'lr': 0.00026275717719348793, 'samples': 14130432, 'steps': 73595, 'loss/train': 1.304919719696045} 11/07/2021 07:33:31 - INFO - __main__ - Step 73597: {'lr': 0.00026275187735996363, 'samples': 14130624, 'steps': 73596, 'loss/train': 1.3150047063827515} 11/07/2021 07:33:32 - INFO - __main__ - Step 73598: {'lr': 0.0002627465775206936, 'samples': 14130816, 'steps': 73597, 'loss/train': 1.2585904598236084} 11/07/2021 07:33:33 - INFO - __main__ - Step 73599: {'lr': 0.00026274127767568007, 'samples': 14131008, 'steps': 73598, 'loss/train': 0.9263672232627869} 11/07/2021 07:33:33 - INFO - __main__ - Step 73600: {'lr': 0.0002627359778249255, 'samples': 14131200, 'steps': 73599, 'loss/train': 1.4720467329025269} 11/07/2021 07:33:33 - INFO - __main__ - Step 73601: {'lr': 0.0002627306779684324, 'samples': 14131392, 'steps': 73600, 'loss/train': 1.539931058883667} 11/07/2021 07:33:34 - INFO - __main__ - Step 73602: {'lr': 0.000262725378106203, 'samples': 14131584, 'steps': 73601, 'loss/train': 1.4592130184173584} 11/07/2021 07:33:34 - INFO - __main__ - Step 73603: {'lr': 0.00026272007823823976, 'samples': 14131776, 'steps': 73602, 'loss/train': 1.592402458190918} 11/07/2021 07:33:35 - INFO - __main__ - Step 73604: {'lr': 0.000262714778364545, 'samples': 14131968, 'steps': 73603, 'loss/train': 1.6249898672103882} 11/07/2021 07:33:36 - INFO - __main__ - Step 73605: {'lr': 0.00026270947848512123, 'samples': 14132160, 'steps': 73604, 'loss/train': 1.0117281675338745} 11/07/2021 07:33:36 - INFO - __main__ - Step 73606: {'lr': 0.0002627041785999707, 'samples': 14132352, 'steps': 73605, 'loss/train': 1.3981223106384277} 11/07/2021 07:33:36 - INFO - __main__ - Step 73607: {'lr': 0.00026269887870909595, 'samples': 14132544, 'steps': 73606, 'loss/train': 1.4311251640319824} 11/07/2021 07:33:37 - INFO - __main__ - Step 73608: {'lr': 0.00026269357881249916, 'samples': 14132736, 'steps': 73607, 'loss/train': 1.3374624252319336} 11/07/2021 07:33:37 - INFO - __main__ - Step 73609: {'lr': 0.0002626882789101829, 'samples': 14132928, 'steps': 73608, 'loss/train': 1.2387675046920776} 11/07/2021 07:33:38 - INFO - __main__ - Step 73610: {'lr': 0.0002626829790021495, 'samples': 14133120, 'steps': 73609, 'loss/train': 1.458367109298706} 11/07/2021 07:33:38 - INFO - __main__ - Step 73611: {'lr': 0.0002626776790884013, 'samples': 14133312, 'steps': 73610, 'loss/train': 1.451232671737671} 11/07/2021 07:33:39 - INFO - __main__ - Step 73612: {'lr': 0.0002626723791689408, 'samples': 14133504, 'steps': 73611, 'loss/train': 1.3305284976959229} 11/07/2021 07:33:39 - INFO - __main__ - Step 73613: {'lr': 0.0002626670792437703, 'samples': 14133696, 'steps': 73612, 'loss/train': 1.6977972984313965} 11/07/2021 07:33:39 - INFO - __main__ - Step 73614: {'lr': 0.0002626617793128922, 'samples': 14133888, 'steps': 73613, 'loss/train': 1.0062671899795532} 11/07/2021 07:33:41 - INFO - __main__ - Step 73615: {'lr': 0.00026265647937630894, 'samples': 14134080, 'steps': 73614, 'loss/train': 1.577463150024414} 11/07/2021 07:33:41 - INFO - __main__ - Step 73616: {'lr': 0.0002626511794340228, 'samples': 14134272, 'steps': 73615, 'loss/train': 1.5749123096466064} 11/07/2021 07:33:41 - INFO - __main__ - Step 73617: {'lr': 0.00026264587948603623, 'samples': 14134464, 'steps': 73616, 'loss/train': 1.5029549598693848} 11/07/2021 07:33:42 - INFO - __main__ - Step 73618: {'lr': 0.0002626405795323517, 'samples': 14134656, 'steps': 73617, 'loss/train': 1.3881982564926147} 11/07/2021 07:33:42 - INFO - __main__ - Step 73619: {'lr': 0.0002626352795729715, 'samples': 14134848, 'steps': 73618, 'loss/train': 1.5959465503692627} 11/07/2021 07:33:43 - INFO - __main__ - Step 73620: {'lr': 0.00026262997960789796, 'samples': 14135040, 'steps': 73619, 'loss/train': 1.3370717763900757} 11/07/2021 07:33:43 - INFO - __main__ - Step 73621: {'lr': 0.0002626246796371336, 'samples': 14135232, 'steps': 73620, 'loss/train': 1.2162421941757202} 11/07/2021 07:33:44 - INFO - __main__ - Step 73622: {'lr': 0.0002626193796606808, 'samples': 14135424, 'steps': 73621, 'loss/train': 1.4028996229171753} 11/07/2021 07:33:44 - INFO - __main__ - Step 73623: {'lr': 0.00026261407967854186, 'samples': 14135616, 'steps': 73622, 'loss/train': 1.309665560722351} 11/07/2021 07:33:44 - INFO - __main__ - Step 73624: {'lr': 0.00026260877969071916, 'samples': 14135808, 'steps': 73623, 'loss/train': 1.364871621131897} 11/07/2021 07:33:45 - INFO - __main__ - Step 73625: {'lr': 0.0002626034796972152, 'samples': 14136000, 'steps': 73624, 'loss/train': 1.5845026969909668} 11/07/2021 07:33:46 - INFO - __main__ - Step 73626: {'lr': 0.0002625981796980323, 'samples': 14136192, 'steps': 73625, 'loss/train': 1.6463907957077026} 11/07/2021 07:33:46 - INFO - __main__ - Step 73627: {'lr': 0.0002625928796931729, 'samples': 14136384, 'steps': 73626, 'loss/train': 1.2850722074508667} 11/07/2021 07:33:46 - INFO - __main__ - Step 73628: {'lr': 0.00026258757968263924, 'samples': 14136576, 'steps': 73627, 'loss/train': 1.132869005203247} 11/07/2021 07:33:47 - INFO - __main__ - Step 73629: {'lr': 0.0002625822796664338, 'samples': 14136768, 'steps': 73628, 'loss/train': 1.2379719018936157} 11/07/2021 07:33:48 - INFO - __main__ - Step 73630: {'lr': 0.0002625769796445591, 'samples': 14136960, 'steps': 73629, 'loss/train': 1.6053926944732666} 11/07/2021 07:33:48 - INFO - __main__ - Step 73631: {'lr': 0.0002625716796170173, 'samples': 14137152, 'steps': 73630, 'loss/train': 1.6961445808410645} 11/07/2021 07:33:49 - INFO - __main__ - Step 73632: {'lr': 0.000262566379583811, 'samples': 14137344, 'steps': 73631, 'loss/train': 1.3057830333709717} 11/07/2021 07:33:49 - INFO - __main__ - Step 73633: {'lr': 0.0002625610795449424, 'samples': 14137536, 'steps': 73632, 'loss/train': 1.0885180234909058} 11/07/2021 07:33:49 - INFO - __main__ - Step 73634: {'lr': 0.00026255577950041396, 'samples': 14137728, 'steps': 73633, 'loss/train': 1.3195528984069824} 11/07/2021 07:33:50 - INFO - __main__ - Step 73635: {'lr': 0.0002625504794502281, 'samples': 14137920, 'steps': 73634, 'loss/train': 1.3345017433166504} 11/07/2021 07:33:51 - INFO - __main__ - Step 73636: {'lr': 0.0002625451793943872, 'samples': 14138112, 'steps': 73635, 'loss/train': 1.3238353729248047} 11/07/2021 07:33:51 - INFO - __main__ - Step 73637: {'lr': 0.00026253987933289366, 'samples': 14138304, 'steps': 73636, 'loss/train': 0.775397777557373} 11/07/2021 07:33:51 - INFO - __main__ - Step 73638: {'lr': 0.0002625345792657498, 'samples': 14138496, 'steps': 73637, 'loss/train': 1.1115612983703613} 11/07/2021 07:33:52 - INFO - __main__ - Step 73639: {'lr': 0.00026252927919295815, 'samples': 14138688, 'steps': 73638, 'loss/train': 1.511742353439331} 11/07/2021 07:33:52 - INFO - __main__ - Step 73640: {'lr': 0.0002625239791145209, 'samples': 14138880, 'steps': 73639, 'loss/train': 0.6912667155265808} 11/07/2021 07:33:53 - INFO - __main__ - Step 73641: {'lr': 0.0002625186790304406, 'samples': 14139072, 'steps': 73640, 'loss/train': 1.3967219591140747} 11/07/2021 07:33:53 - INFO - __main__ - Step 73642: {'lr': 0.0002625133789407195, 'samples': 14139264, 'steps': 73641, 'loss/train': 1.2262506484985352} 11/07/2021 07:33:54 - INFO - __main__ - Step 73643: {'lr': 0.0002625080788453601, 'samples': 14139456, 'steps': 73642, 'loss/train': 1.5547293424606323} 11/07/2021 07:33:54 - INFO - __main__ - Step 73644: {'lr': 0.00026250277874436474, 'samples': 14139648, 'steps': 73643, 'loss/train': 1.3192291259765625} 11/07/2021 07:33:54 - INFO - __main__ - Step 73645: {'lr': 0.0002624974786377359, 'samples': 14139840, 'steps': 73644, 'loss/train': 0.848395049571991} 11/07/2021 07:33:55 - INFO - __main__ - Step 73646: {'lr': 0.0002624921785254758, 'samples': 14140032, 'steps': 73645, 'loss/train': 1.4386732578277588} 11/07/2021 07:33:56 - INFO - __main__ - Step 73647: {'lr': 0.0002624868784075869, 'samples': 14140224, 'steps': 73646, 'loss/train': 1.3019533157348633} 11/07/2021 07:33:56 - INFO - __main__ - Step 73648: {'lr': 0.0002624815782840717, 'samples': 14140416, 'steps': 73647, 'loss/train': 1.394015908241272} 11/07/2021 07:33:56 - INFO - __main__ - Step 73649: {'lr': 0.0002624762781549324, 'samples': 14140608, 'steps': 73648, 'loss/train': 1.1649892330169678} 11/07/2021 07:33:57 - INFO - __main__ - Step 73650: {'lr': 0.0002624709780201716, 'samples': 14140800, 'steps': 73649, 'loss/train': 1.3390545845031738} 11/07/2021 07:33:58 - INFO - __main__ - Step 73651: {'lr': 0.00026246567787979145, 'samples': 14140992, 'steps': 73650, 'loss/train': 1.0978772640228271} 11/07/2021 07:33:58 - INFO - __main__ - Step 73652: {'lr': 0.0002624603777337945, 'samples': 14141184, 'steps': 73651, 'loss/train': 1.2924984693527222} 11/07/2021 07:33:59 - INFO - __main__ - Step 73653: {'lr': 0.00026245507758218306, 'samples': 14141376, 'steps': 73652, 'loss/train': 1.5079180002212524} 11/07/2021 07:33:59 - INFO - __main__ - Step 73654: {'lr': 0.00026244977742495963, 'samples': 14141568, 'steps': 73653, 'loss/train': 1.5112214088439941} 11/07/2021 07:33:59 - INFO - __main__ - Step 73655: {'lr': 0.0002624444772621265, 'samples': 14141760, 'steps': 73654, 'loss/train': 1.6741827726364136} 11/07/2021 07:34:00 - INFO - __main__ - Step 73656: {'lr': 0.000262439177093686, 'samples': 14141952, 'steps': 73655, 'loss/train': 1.3923205137252808} 11/07/2021 07:34:01 - INFO - __main__ - Step 73657: {'lr': 0.0002624338769196407, 'samples': 14142144, 'steps': 73656, 'loss/train': 0.9574289321899414} 11/07/2021 07:34:01 - INFO - __main__ - Step 73658: {'lr': 0.0002624285767399929, 'samples': 14142336, 'steps': 73657, 'loss/train': 1.4731358289718628} 11/07/2021 07:34:01 - INFO - __main__ - Step 73659: {'lr': 0.00026242327655474483, 'samples': 14142528, 'steps': 73658, 'loss/train': 1.1931030750274658} 11/07/2021 07:34:02 - INFO - __main__ - Step 73660: {'lr': 0.00026241797636389916, 'samples': 14142720, 'steps': 73659, 'loss/train': 0.8817092776298523} 11/07/2021 07:34:03 - INFO - __main__ - Step 73661: {'lr': 0.00026241267616745813, 'samples': 14142912, 'steps': 73660, 'loss/train': 1.184167504310608} 11/07/2021 07:34:03 - INFO - __main__ - Step 73662: {'lr': 0.0002624073759654241, 'samples': 14143104, 'steps': 73661, 'loss/train': 1.3827475309371948} 11/07/2021 07:34:03 - INFO - __main__ - Step 73663: {'lr': 0.0002624020757577995, 'samples': 14143296, 'steps': 73662, 'loss/train': 1.5361649990081787} 11/07/2021 07:34:04 - INFO - __main__ - Step 73664: {'lr': 0.00026239677554458675, 'samples': 14143488, 'steps': 73663, 'loss/train': 1.1623822450637817} 11/07/2021 07:34:04 - INFO - __main__ - Step 73665: {'lr': 0.0002623914753257881, 'samples': 14143680, 'steps': 73664, 'loss/train': 1.4173872470855713} 11/07/2021 07:34:05 - INFO - __main__ - Step 73666: {'lr': 0.0002623861751014062, 'samples': 14143872, 'steps': 73665, 'loss/train': 1.393357753753662} 11/07/2021 07:34:05 - INFO - __main__ - Step 73667: {'lr': 0.0002623808748714432, 'samples': 14144064, 'steps': 73666, 'loss/train': 1.5995054244995117} 11/07/2021 07:34:06 - INFO - __main__ - Step 73668: {'lr': 0.00026237557463590155, 'samples': 14144256, 'steps': 73667, 'loss/train': 1.7126530408859253} 11/07/2021 07:34:06 - INFO - __main__ - Step 73669: {'lr': 0.0002623702743947837, 'samples': 14144448, 'steps': 73668, 'loss/train': 1.5810638666152954} 11/07/2021 07:34:06 - INFO - __main__ - Step 73670: {'lr': 0.0002623649741480919, 'samples': 14144640, 'steps': 73669, 'loss/train': 1.5720192193984985} 11/07/2021 07:34:07 - INFO - __main__ - Step 73671: {'lr': 0.0002623596738958287, 'samples': 14144832, 'steps': 73670, 'loss/train': 1.5267375707626343} 11/07/2021 07:34:08 - INFO - __main__ - Step 73672: {'lr': 0.00026235437363799654, 'samples': 14145024, 'steps': 73671, 'loss/train': 1.512125849723816} 11/07/2021 07:34:08 - INFO - __main__ - Step 73673: {'lr': 0.0002623490733745975, 'samples': 14145216, 'steps': 73672, 'loss/train': 0.8808572888374329} 11/07/2021 07:34:09 - INFO - __main__ - Step 73674: {'lr': 0.00026234377310563426, 'samples': 14145408, 'steps': 73673, 'loss/train': 1.3819468021392822} 11/07/2021 07:34:09 - INFO - __main__ - Step 73675: {'lr': 0.00026233847283110905, 'samples': 14145600, 'steps': 73674, 'loss/train': 1.2786221504211426} 11/07/2021 07:34:09 - INFO - __main__ - Step 73676: {'lr': 0.00026233317255102437, 'samples': 14145792, 'steps': 73675, 'loss/train': 1.1346569061279297} 11/07/2021 07:34:10 - INFO - __main__ - Step 73677: {'lr': 0.0002623278722653825, 'samples': 14145984, 'steps': 73676, 'loss/train': 1.5319968461990356} 11/07/2021 07:34:11 - INFO - __main__ - Step 73678: {'lr': 0.0002623225719741859, 'samples': 14146176, 'steps': 73677, 'loss/train': 1.2837698459625244} 11/07/2021 07:34:11 - INFO - __main__ - Step 73679: {'lr': 0.00026231727167743703, 'samples': 14146368, 'steps': 73678, 'loss/train': 1.0321415662765503} 11/07/2021 07:34:11 - INFO - __main__ - Step 73680: {'lr': 0.0002623119713751381, 'samples': 14146560, 'steps': 73679, 'loss/train': 1.3405877351760864} 11/07/2021 07:34:12 - INFO - __main__ - Step 73681: {'lr': 0.00026230667106729154, 'samples': 14146752, 'steps': 73680, 'loss/train': 1.4034554958343506} 11/07/2021 07:34:13 - INFO - __main__ - Step 73682: {'lr': 0.0002623013707538998, 'samples': 14146944, 'steps': 73681, 'loss/train': 1.55930495262146} 11/07/2021 07:34:13 - INFO - __main__ - Step 73683: {'lr': 0.00026229607043496534, 'samples': 14147136, 'steps': 73682, 'loss/train': 1.3421696424484253} 11/07/2021 07:34:13 - INFO - __main__ - Step 73684: {'lr': 0.00026229077011049034, 'samples': 14147328, 'steps': 73683, 'loss/train': 1.8084489107131958} 11/07/2021 07:34:14 - INFO - __main__ - Step 73685: {'lr': 0.0002622854697804774, 'samples': 14147520, 'steps': 73684, 'loss/train': 1.3382980823516846} 11/07/2021 07:34:14 - INFO - __main__ - Step 73686: {'lr': 0.00026228016944492883, 'samples': 14147712, 'steps': 73685, 'loss/train': 1.1213520765304565} 11/07/2021 07:34:15 - INFO - __main__ - Step 73687: {'lr': 0.00026227486910384694, 'samples': 14147904, 'steps': 73686, 'loss/train': 1.9020755290985107} 11/07/2021 07:34:16 - INFO - __main__ - Step 73688: {'lr': 0.0002622695687572342, 'samples': 14148096, 'steps': 73687, 'loss/train': 1.549675464630127} 11/07/2021 07:34:16 - INFO - __main__ - Step 73689: {'lr': 0.00026226426840509303, 'samples': 14148288, 'steps': 73688, 'loss/train': 1.2727683782577515} 11/07/2021 07:34:16 - INFO - __main__ - Step 73690: {'lr': 0.0002622589680474257, 'samples': 14148480, 'steps': 73689, 'loss/train': 1.452463984489441} 11/07/2021 07:34:17 - INFO - __main__ - Step 73691: {'lr': 0.0002622536676842347, 'samples': 14148672, 'steps': 73690, 'loss/train': 1.2596635818481445} 11/07/2021 07:34:18 - INFO - __main__ - Step 73692: {'lr': 0.0002622483673155224, 'samples': 14148864, 'steps': 73691, 'loss/train': 0.7243367433547974} 11/07/2021 07:34:18 - INFO - __main__ - Step 73693: {'lr': 0.00026224306694129116, 'samples': 14149056, 'steps': 73692, 'loss/train': 1.5584230422973633} 11/07/2021 07:34:18 - INFO - __main__ - Step 73694: {'lr': 0.0002622377665615434, 'samples': 14149248, 'steps': 73693, 'loss/train': 1.4154860973358154} 11/07/2021 07:34:19 - INFO - __main__ - Step 73695: {'lr': 0.0002622324661762815, 'samples': 14149440, 'steps': 73694, 'loss/train': 1.9395414590835571} 11/07/2021 07:34:19 - INFO - __main__ - Step 73696: {'lr': 0.0002622271657855078, 'samples': 14149632, 'steps': 73695, 'loss/train': 1.9087048768997192} 11/07/2021 07:34:19 - INFO - __main__ - Step 73697: {'lr': 0.0002622218653892247, 'samples': 14149824, 'steps': 73696, 'loss/train': 1.398221492767334} 11/07/2021 07:34:20 - INFO - __main__ - Step 73698: {'lr': 0.00026221656498743467, 'samples': 14150016, 'steps': 73697, 'loss/train': 1.3356423377990723} 11/07/2021 07:34:21 - INFO - __main__ - Step 73699: {'lr': 0.0002622112645801401, 'samples': 14150208, 'steps': 73698, 'loss/train': 0.9820262789726257} 11/07/2021 07:34:21 - INFO - __main__ - Step 73700: {'lr': 0.0002622059641673432, 'samples': 14150400, 'steps': 73699, 'loss/train': 0.8013916611671448} 11/07/2021 07:34:21 - INFO - __main__ - Step 73701: {'lr': 0.00026220066374904653, 'samples': 14150592, 'steps': 73700, 'loss/train': 1.5119199752807617} 11/07/2021 07:34:22 - INFO - __main__ - Step 73702: {'lr': 0.00026219536332525243, 'samples': 14150784, 'steps': 73701, 'loss/train': 1.5008699893951416} 11/07/2021 07:34:23 - INFO - __main__ - Step 73703: {'lr': 0.0002621900628959633, 'samples': 14150976, 'steps': 73702, 'loss/train': 1.5064697265625} 11/07/2021 07:34:23 - INFO - __main__ - Step 73704: {'lr': 0.0002621847624611815, 'samples': 14151168, 'steps': 73703, 'loss/train': 1.5458019971847534} 11/07/2021 07:34:23 - INFO - __main__ - Step 73705: {'lr': 0.00026217946202090946, 'samples': 14151360, 'steps': 73704, 'loss/train': 1.3321342468261719} 11/07/2021 07:34:24 - INFO - __main__ - Step 73706: {'lr': 0.0002621741615751496, 'samples': 14151552, 'steps': 73705, 'loss/train': 0.9121658802032471} 11/07/2021 07:34:24 - INFO - __main__ - Step 73707: {'lr': 0.00026216886112390413, 'samples': 14151744, 'steps': 73706, 'loss/train': 1.1415151357650757} 11/07/2021 07:34:25 - INFO - __main__ - Step 73708: {'lr': 0.0002621635606671756, 'samples': 14151936, 'steps': 73707, 'loss/train': 1.60361909866333} 11/07/2021 07:34:26 - INFO - __main__ - Step 73709: {'lr': 0.00026215826020496637, 'samples': 14152128, 'steps': 73708, 'loss/train': 1.169234275817871} 11/07/2021 07:34:26 - INFO - __main__ - Step 73710: {'lr': 0.00026215295973727883, 'samples': 14152320, 'steps': 73709, 'loss/train': 1.6786580085754395} 11/07/2021 07:34:26 - INFO - __main__ - Step 73711: {'lr': 0.00026214765926411526, 'samples': 14152512, 'steps': 73710, 'loss/train': 1.5472173690795898} 11/07/2021 07:34:27 - INFO - __main__ - Step 73712: {'lr': 0.00026214235878547825, 'samples': 14152704, 'steps': 73711, 'loss/train': 1.264760971069336} 11/07/2021 07:34:27 - INFO - __main__ - Step 73713: {'lr': 0.0002621370583013701, 'samples': 14152896, 'steps': 73712, 'loss/train': 1.4495534896850586} 11/07/2021 07:34:28 - INFO - __main__ - Step 73714: {'lr': 0.0002621317578117931, 'samples': 14153088, 'steps': 73713, 'loss/train': 0.900741457939148} 11/07/2021 07:34:28 - INFO - __main__ - Step 73715: {'lr': 0.00026212645731674974, 'samples': 14153280, 'steps': 73714, 'loss/train': 1.393385887145996} 11/07/2021 07:34:29 - INFO - __main__ - Step 73716: {'lr': 0.00026212115681624237, 'samples': 14153472, 'steps': 73715, 'loss/train': 1.1877429485321045} 11/07/2021 07:34:29 - INFO - __main__ - Step 73717: {'lr': 0.0002621158563102734, 'samples': 14153664, 'steps': 73716, 'loss/train': 1.484875202178955} 11/07/2021 07:34:30 - INFO - __main__ - Step 73718: {'lr': 0.00026211055579884523, 'samples': 14153856, 'steps': 73717, 'loss/train': 0.06729325652122498} 11/07/2021 07:34:31 - INFO - __main__ - Step 73719: {'lr': 0.0002621052552819603, 'samples': 14154048, 'steps': 73718, 'loss/train': 1.2274166345596313} 11/07/2021 07:34:31 - INFO - __main__ - Step 73720: {'lr': 0.00026209995475962077, 'samples': 14154240, 'steps': 73719, 'loss/train': 1.2965532541275024} 11/07/2021 07:34:31 - INFO - __main__ - Step 73721: {'lr': 0.00026209465423182934, 'samples': 14154432, 'steps': 73720, 'loss/train': 1.332154393196106} 11/07/2021 07:34:32 - INFO - __main__ - Step 73722: {'lr': 0.0002620893536985881, 'samples': 14154624, 'steps': 73721, 'loss/train': 1.6400153636932373} 11/07/2021 07:34:32 - INFO - __main__ - Step 73723: {'lr': 0.0002620840531598997, 'samples': 14154816, 'steps': 73722, 'loss/train': 1.4918123483657837} 11/07/2021 07:34:33 - INFO - __main__ - Step 73724: {'lr': 0.0002620787526157664, 'samples': 14155008, 'steps': 73723, 'loss/train': 1.5434646606445312} 11/07/2021 07:34:33 - INFO - __main__ - Step 73725: {'lr': 0.0002620734520661905, 'samples': 14155200, 'steps': 73724, 'loss/train': 1.3264973163604736} 11/07/2021 07:34:34 - INFO - __main__ - Step 73726: {'lr': 0.0002620681515111746, 'samples': 14155392, 'steps': 73725, 'loss/train': 1.5738415718078613} 11/07/2021 07:34:34 - INFO - __main__ - Step 73727: {'lr': 0.0002620628509507209, 'samples': 14155584, 'steps': 73726, 'loss/train': 0.9793825149536133} 11/07/2021 07:34:34 - INFO - __main__ - Step 73728: {'lr': 0.0002620575503848319, 'samples': 14155776, 'steps': 73727, 'loss/train': 1.4214247465133667} 11/07/2021 07:34:35 - INFO - __main__ - Step 73729: {'lr': 0.00026205224981350997, 'samples': 14155968, 'steps': 73728, 'loss/train': 1.5106761455535889} 11/07/2021 07:34:36 - INFO - __main__ - Step 73730: {'lr': 0.0002620469492367575, 'samples': 14156160, 'steps': 73729, 'loss/train': 1.3081717491149902} 11/07/2021 07:34:36 - INFO - __main__ - Step 73731: {'lr': 0.0002620416486545768, 'samples': 14156352, 'steps': 73730, 'loss/train': 1.6156162023544312} 11/07/2021 07:34:36 - INFO - __main__ - Step 73732: {'lr': 0.0002620363480669703, 'samples': 14156544, 'steps': 73731, 'loss/train': 1.3623043298721313} 11/07/2021 07:34:37 - INFO - __main__ - Step 73733: {'lr': 0.0002620310474739405, 'samples': 14156736, 'steps': 73732, 'loss/train': 0.9232141375541687} 11/07/2021 07:34:38 - INFO - __main__ - Step 73734: {'lr': 0.0002620257468754897, 'samples': 14156928, 'steps': 73733, 'loss/train': 1.099671721458435} 11/07/2021 07:34:38 - INFO - __main__ - Step 73735: {'lr': 0.0002620204462716202, 'samples': 14157120, 'steps': 73734, 'loss/train': 1.3243945837020874} 11/07/2021 07:34:38 - INFO - __main__ - Step 73736: {'lr': 0.00026201514566233445, 'samples': 14157312, 'steps': 73735, 'loss/train': 1.2921903133392334} 11/07/2021 07:34:39 - INFO - __main__ - Step 73737: {'lr': 0.00026200984504763495, 'samples': 14157504, 'steps': 73736, 'loss/train': 1.6894798278808594} 11/07/2021 07:34:39 - INFO - __main__ - Step 73738: {'lr': 0.000262004544427524, 'samples': 14157696, 'steps': 73737, 'loss/train': 1.3043813705444336} 11/07/2021 07:34:40 - INFO - __main__ - Step 73739: {'lr': 0.000261999243802004, 'samples': 14157888, 'steps': 73738, 'loss/train': 1.4763813018798828} 11/07/2021 07:34:41 - INFO - __main__ - Step 73740: {'lr': 0.00026199394317107723, 'samples': 14158080, 'steps': 73739, 'loss/train': 1.4028918743133545} 11/07/2021 07:34:41 - INFO - __main__ - Step 73741: {'lr': 0.0002619886425347462, 'samples': 14158272, 'steps': 73740, 'loss/train': 1.8728750944137573} 11/07/2021 07:34:41 - INFO - __main__ - Step 73742: {'lr': 0.00026198334189301333, 'samples': 14158464, 'steps': 73741, 'loss/train': 1.228582739830017} 11/07/2021 07:34:42 - INFO - __main__ - Step 73743: {'lr': 0.00026197804124588085, 'samples': 14158656, 'steps': 73742, 'loss/train': 1.5161250829696655} 11/07/2021 07:34:42 - INFO - __main__ - Step 73744: {'lr': 0.00026197274059335137, 'samples': 14158848, 'steps': 73743, 'loss/train': 1.3736807107925415} 11/07/2021 07:34:43 - INFO - __main__ - Step 73745: {'lr': 0.0002619674399354271, 'samples': 14159040, 'steps': 73744, 'loss/train': 0.7418132424354553} 11/07/2021 07:34:43 - INFO - __main__ - Step 73746: {'lr': 0.0002619621392721105, 'samples': 14159232, 'steps': 73745, 'loss/train': 0.5784057974815369} 11/07/2021 07:34:44 - INFO - __main__ - Step 73747: {'lr': 0.000261956838603404, 'samples': 14159424, 'steps': 73746, 'loss/train': 2.119431257247925} 11/07/2021 07:34:44 - INFO - __main__ - Step 73748: {'lr': 0.00026195153792930983, 'samples': 14159616, 'steps': 73747, 'loss/train': 1.63288414478302} 11/07/2021 07:34:45 - INFO - __main__ - Step 73749: {'lr': 0.0002619462372498305, 'samples': 14159808, 'steps': 73748, 'loss/train': 1.3972678184509277} 11/07/2021 07:34:45 - INFO - __main__ - Step 73750: {'lr': 0.0002619409365649684, 'samples': 14160000, 'steps': 73749, 'loss/train': 1.64277982711792} 11/07/2021 07:34:46 - INFO - __main__ - Step 73751: {'lr': 0.0002619356358747259, 'samples': 14160192, 'steps': 73750, 'loss/train': 1.2439706325531006} 11/07/2021 07:34:46 - INFO - __main__ - Step 73752: {'lr': 0.00026193033517910534, 'samples': 14160384, 'steps': 73751, 'loss/train': 1.417138695716858} 11/07/2021 07:34:46 - INFO - __main__ - Step 73753: {'lr': 0.00026192503447810926, 'samples': 14160576, 'steps': 73752, 'loss/train': 1.0069363117218018} 11/07/2021 07:34:47 - INFO - __main__ - Step 73754: {'lr': 0.00026191973377173987, 'samples': 14160768, 'steps': 73753, 'loss/train': 1.3320244550704956} 11/07/2021 07:34:48 - INFO - __main__ - Step 73755: {'lr': 0.0002619144330599996, 'samples': 14160960, 'steps': 73754, 'loss/train': 1.2210882902145386} 11/07/2021 07:34:48 - INFO - __main__ - Step 73756: {'lr': 0.00026190913234289093, 'samples': 14161152, 'steps': 73755, 'loss/train': 1.6110265254974365} 11/07/2021 07:34:49 - INFO - __main__ - Step 73757: {'lr': 0.0002619038316204162, 'samples': 14161344, 'steps': 73756, 'loss/train': 0.5824894905090332} 11/07/2021 07:34:49 - INFO - __main__ - Step 73758: {'lr': 0.0002618985308925778, 'samples': 14161536, 'steps': 73757, 'loss/train': 0.8449156284332275} 11/07/2021 07:34:50 - INFO - __main__ - Step 73759: {'lr': 0.000261893230159378, 'samples': 14161728, 'steps': 73758, 'loss/train': 0.630771815776825} 11/07/2021 07:34:50 - INFO - __main__ - Step 73760: {'lr': 0.0002618879294208194, 'samples': 14161920, 'steps': 73759, 'loss/train': 1.188537836074829} 11/07/2021 07:34:51 - INFO - __main__ - Step 73761: {'lr': 0.0002618826286769043, 'samples': 14162112, 'steps': 73760, 'loss/train': 1.2873560190200806} 11/07/2021 07:34:51 - INFO - __main__ - Step 73762: {'lr': 0.00026187732792763496, 'samples': 14162304, 'steps': 73761, 'loss/train': 1.4773072004318237} 11/07/2021 07:34:52 - INFO - __main__ - Step 73763: {'lr': 0.00026187202717301396, 'samples': 14162496, 'steps': 73762, 'loss/train': 1.8522205352783203} 11/07/2021 07:34:52 - INFO - __main__ - Step 73764: {'lr': 0.0002618667264130435, 'samples': 14162688, 'steps': 73763, 'loss/train': 0.31146085262298584} 11/07/2021 07:34:53 - INFO - __main__ - Step 73765: {'lr': 0.0002618614256477262, 'samples': 14162880, 'steps': 73764, 'loss/train': 1.6016685962677002} 11/07/2021 07:34:53 - INFO - __main__ - Step 73766: {'lr': 0.00026185612487706435, 'samples': 14163072, 'steps': 73765, 'loss/train': 1.3282114267349243} 11/07/2021 07:34:54 - INFO - __main__ - Step 73767: {'lr': 0.00026185082410106023, 'samples': 14163264, 'steps': 73766, 'loss/train': 1.547921895980835} 11/07/2021 07:34:54 - INFO - __main__ - Step 73768: {'lr': 0.0002618455233197163, 'samples': 14163456, 'steps': 73767, 'loss/train': 1.2786470651626587} 11/07/2021 07:34:54 - INFO - __main__ - Step 73769: {'lr': 0.00026184022253303497, 'samples': 14163648, 'steps': 73768, 'loss/train': 1.6941473484039307} 11/07/2021 07:34:55 - INFO - __main__ - Step 73770: {'lr': 0.00026183492174101865, 'samples': 14163840, 'steps': 73769, 'loss/train': 1.8479677438735962} 11/07/2021 07:34:56 - INFO - __main__ - Step 73771: {'lr': 0.0002618296209436697, 'samples': 14164032, 'steps': 73770, 'loss/train': 1.6985241174697876} 11/07/2021 07:34:56 - INFO - __main__ - Step 73772: {'lr': 0.00026182432014099045, 'samples': 14164224, 'steps': 73771, 'loss/train': 1.1780539751052856} 11/07/2021 07:34:56 - INFO - __main__ - Step 73773: {'lr': 0.0002618190193329834, 'samples': 14164416, 'steps': 73772, 'loss/train': 1.0450114011764526} 11/07/2021 07:34:57 - INFO - __main__ - Step 73774: {'lr': 0.0002618137185196509, 'samples': 14164608, 'steps': 73773, 'loss/train': 1.004625678062439} 11/07/2021 07:34:58 - INFO - __main__ - Step 73775: {'lr': 0.0002618084177009953, 'samples': 14164800, 'steps': 73774, 'loss/train': 0.6613650918006897} 11/07/2021 07:34:58 - INFO - __main__ - Step 73776: {'lr': 0.000261803116877019, 'samples': 14164992, 'steps': 73775, 'loss/train': 1.6508718729019165} 11/07/2021 07:34:58 - INFO - __main__ - Step 73777: {'lr': 0.0002617978160477243, 'samples': 14165184, 'steps': 73776, 'loss/train': 1.5685689449310303} 11/07/2021 07:34:59 - INFO - __main__ - Step 73778: {'lr': 0.0002617925152131138, 'samples': 14165376, 'steps': 73777, 'loss/train': 1.5058561563491821} 11/07/2021 07:34:59 - INFO - __main__ - Step 73779: {'lr': 0.0002617872143731898, 'samples': 14165568, 'steps': 73778, 'loss/train': 1.6595319509506226} 11/07/2021 07:35:00 - INFO - __main__ - Step 73780: {'lr': 0.0002617819135279546, 'samples': 14165760, 'steps': 73779, 'loss/train': 1.326180338859558} 11/07/2021 07:35:00 - INFO - __main__ - Step 73781: {'lr': 0.00026177661267741067, 'samples': 14165952, 'steps': 73780, 'loss/train': 0.9883223176002502} 11/07/2021 07:35:01 - INFO - __main__ - Step 73782: {'lr': 0.0002617713118215604, 'samples': 14166144, 'steps': 73781, 'loss/train': 1.9145374298095703} 11/07/2021 07:35:01 - INFO - __main__ - Step 73783: {'lr': 0.0002617660109604061, 'samples': 14166336, 'steps': 73782, 'loss/train': 0.8001314997673035} 11/07/2021 07:35:02 - INFO - __main__ - Step 73784: {'lr': 0.0002617607100939503, 'samples': 14166528, 'steps': 73783, 'loss/train': 1.5397807359695435} 11/07/2021 07:35:02 - INFO - __main__ - Step 73785: {'lr': 0.00026175540922219526, 'samples': 14166720, 'steps': 73784, 'loss/train': 1.2419830560684204} 11/07/2021 07:35:03 - INFO - __main__ - Step 73786: {'lr': 0.0002617501083451434, 'samples': 14166912, 'steps': 73785, 'loss/train': 1.3952889442443848} 11/07/2021 07:35:03 - INFO - __main__ - Step 73787: {'lr': 0.0002617448074627971, 'samples': 14167104, 'steps': 73786, 'loss/train': 1.684261441230774} 11/07/2021 07:35:04 - INFO - __main__ - Step 73788: {'lr': 0.0002617395065751588, 'samples': 14167296, 'steps': 73787, 'loss/train': 1.7052569389343262} 11/07/2021 07:35:04 - INFO - __main__ - Step 73789: {'lr': 0.00026173420568223086, 'samples': 14167488, 'steps': 73788, 'loss/train': 1.3427939414978027} 11/07/2021 07:35:05 - INFO - __main__ - Step 73790: {'lr': 0.00026172890478401575, 'samples': 14167680, 'steps': 73789, 'loss/train': 1.6053508520126343} 11/07/2021 07:35:05 - INFO - __main__ - Step 73791: {'lr': 0.0002617236038805157, 'samples': 14167872, 'steps': 73790, 'loss/train': 1.200613021850586} 11/07/2021 07:35:06 - INFO - __main__ - Step 73792: {'lr': 0.0002617183029717332, 'samples': 14168064, 'steps': 73791, 'loss/train': 1.7233703136444092} 11/07/2021 07:35:06 - INFO - __main__ - Step 73793: {'lr': 0.0002617130020576705, 'samples': 14168256, 'steps': 73792, 'loss/train': 1.1765732765197754} 11/07/2021 07:35:06 - INFO - __main__ - Step 73794: {'lr': 0.0002617077011383302, 'samples': 14168448, 'steps': 73793, 'loss/train': 1.3174383640289307} 11/07/2021 07:35:07 - INFO - __main__ - Step 73795: {'lr': 0.00026170240021371465, 'samples': 14168640, 'steps': 73794, 'loss/train': 1.2327593564987183} 11/07/2021 07:35:08 - INFO - __main__ - Step 73796: {'lr': 0.00026169709928382614, 'samples': 14168832, 'steps': 73795, 'loss/train': 1.657081961631775} 11/07/2021 07:35:08 - INFO - __main__ - Step 73797: {'lr': 0.000261691798348667, 'samples': 14169024, 'steps': 73796, 'loss/train': 2.0859858989715576} 11/07/2021 07:35:08 - INFO - __main__ - Step 73798: {'lr': 0.0002616864974082398, 'samples': 14169216, 'steps': 73797, 'loss/train': 1.1742597818374634} 11/07/2021 07:35:09 - INFO - __main__ - Step 73799: {'lr': 0.0002616811964625468, 'samples': 14169408, 'steps': 73798, 'loss/train': 0.9659584760665894} 11/07/2021 07:35:09 - INFO - __main__ - Step 73800: {'lr': 0.0002616758955115905, 'samples': 14169600, 'steps': 73799, 'loss/train': 1.4427169561386108} 11/07/2021 07:35:10 - INFO - __main__ - Step 73801: {'lr': 0.0002616705945553732, 'samples': 14169792, 'steps': 73800, 'loss/train': 1.5546900033950806} 11/07/2021 07:35:10 - INFO - __main__ - Step 73802: {'lr': 0.0002616652935938973, 'samples': 14169984, 'steps': 73801, 'loss/train': 1.5137083530426025} 11/07/2021 07:35:11 - INFO - __main__ - Step 73803: {'lr': 0.00026165999262716517, 'samples': 14170176, 'steps': 73802, 'loss/train': 1.3048580884933472} 11/07/2021 07:35:11 - INFO - __main__ - Step 73804: {'lr': 0.00026165469165517926, 'samples': 14170368, 'steps': 73803, 'loss/train': 1.5959454774856567} 11/07/2021 07:35:11 - INFO - __main__ - Step 73805: {'lr': 0.0002616493906779419, 'samples': 14170560, 'steps': 73804, 'loss/train': 1.2579923868179321} 11/07/2021 07:35:13 - INFO - __main__ - Step 73806: {'lr': 0.0002616440896954555, 'samples': 14170752, 'steps': 73805, 'loss/train': 1.7035069465637207} 11/07/2021 07:35:13 - INFO - __main__ - Step 73807: {'lr': 0.0002616387887077225, 'samples': 14170944, 'steps': 73806, 'loss/train': 1.5765997171401978} 11/07/2021 07:35:13 - INFO - __main__ - Step 73808: {'lr': 0.0002616334877147452, 'samples': 14171136, 'steps': 73807, 'loss/train': 0.8728405833244324} 11/07/2021 07:35:14 - INFO - __main__ - Step 73809: {'lr': 0.00026162818671652605, 'samples': 14171328, 'steps': 73808, 'loss/train': 0.971123218536377} 11/07/2021 07:35:14 - INFO - __main__ - Step 73810: {'lr': 0.00026162288571306743, 'samples': 14171520, 'steps': 73809, 'loss/train': 1.4021061658859253} 11/07/2021 07:35:15 - INFO - __main__ - Step 73811: {'lr': 0.0002616175847043717, 'samples': 14171712, 'steps': 73810, 'loss/train': 1.937752366065979} 11/07/2021 07:35:15 - INFO - __main__ - Step 73812: {'lr': 0.0002616122836904412, 'samples': 14171904, 'steps': 73811, 'loss/train': 1.168734073638916} 11/07/2021 07:35:16 - INFO - __main__ - Step 73813: {'lr': 0.00026160698267127855, 'samples': 14172096, 'steps': 73812, 'loss/train': 1.2931944131851196} 11/07/2021 07:35:16 - INFO - __main__ - Step 73814: {'lr': 0.00026160168164688583, 'samples': 14172288, 'steps': 73813, 'loss/train': 1.1605778932571411} 11/07/2021 07:35:16 - INFO - __main__ - Step 73815: {'lr': 0.0002615963806172656, 'samples': 14172480, 'steps': 73814, 'loss/train': 1.4079277515411377} 11/07/2021 07:35:17 - INFO - __main__ - Step 73816: {'lr': 0.0002615910795824202, 'samples': 14172672, 'steps': 73815, 'loss/train': 1.5101364850997925} 11/07/2021 07:35:18 - INFO - __main__ - Step 73817: {'lr': 0.0002615857785423521, 'samples': 14172864, 'steps': 73816, 'loss/train': 1.3097178936004639} 11/07/2021 07:35:18 - INFO - __main__ - Step 73818: {'lr': 0.0002615804774970636, 'samples': 14173056, 'steps': 73817, 'loss/train': 1.0923752784729004} 11/07/2021 07:35:18 - INFO - __main__ - Step 73819: {'lr': 0.0002615751764465571, 'samples': 14173248, 'steps': 73818, 'loss/train': 1.038710117340088} 11/07/2021 07:35:19 - INFO - __main__ - Step 73820: {'lr': 0.00026156987539083503, 'samples': 14173440, 'steps': 73819, 'loss/train': 1.5206000804901123} 11/07/2021 07:35:19 - INFO - __main__ - Step 73821: {'lr': 0.00026156457432989976, 'samples': 14173632, 'steps': 73820, 'loss/train': 1.5431010723114014} 11/07/2021 07:35:21 - INFO - __main__ - Step 73822: {'lr': 0.00026155927326375366, 'samples': 14173824, 'steps': 73821, 'loss/train': 1.129918098449707} 11/07/2021 07:35:21 - INFO - __main__ - Step 73823: {'lr': 0.0002615539721923991, 'samples': 14174016, 'steps': 73822, 'loss/train': 0.11207626760005951} 11/07/2021 07:35:21 - INFO - __main__ - Step 73824: {'lr': 0.00026154867111583853, 'samples': 14174208, 'steps': 73823, 'loss/train': 1.4611380100250244} 11/07/2021 07:35:22 - INFO - __main__ - Step 73825: {'lr': 0.0002615433700340743, 'samples': 14174400, 'steps': 73824, 'loss/train': 2.053687810897827} 11/07/2021 07:35:22 - INFO - __main__ - Step 73826: {'lr': 0.0002615380689471088, 'samples': 14174592, 'steps': 73825, 'loss/train': 0.18444734811782837} 11/07/2021 07:35:23 - INFO - __main__ - Step 73827: {'lr': 0.0002615327678549445, 'samples': 14174784, 'steps': 73826, 'loss/train': 0.36615657806396484} 11/07/2021 07:35:23 - INFO - __main__ - Step 73828: {'lr': 0.0002615274667575836, 'samples': 14174976, 'steps': 73827, 'loss/train': 1.344868779182434} 11/07/2021 07:35:24 - INFO - __main__ - Step 73829: {'lr': 0.00026152216565502863, 'samples': 14175168, 'steps': 73828, 'loss/train': 1.0471415519714355} 11/07/2021 07:35:24 - INFO - __main__ - Step 73830: {'lr': 0.00026151686454728196, 'samples': 14175360, 'steps': 73829, 'loss/train': 1.3680057525634766} 11/07/2021 07:35:24 - INFO - __main__ - Step 73831: {'lr': 0.00026151156343434597, 'samples': 14175552, 'steps': 73830, 'loss/train': 0.9119206070899963} 11/07/2021 07:35:25 - INFO - __main__ - Step 73832: {'lr': 0.000261506262316223, 'samples': 14175744, 'steps': 73831, 'loss/train': 1.3911038637161255} 11/07/2021 07:35:26 - INFO - __main__ - Step 73833: {'lr': 0.00026150096119291553, 'samples': 14175936, 'steps': 73832, 'loss/train': 1.3613322973251343} 11/07/2021 07:35:26 - INFO - __main__ - Step 73834: {'lr': 0.00026149566006442596, 'samples': 14176128, 'steps': 73833, 'loss/train': 1.4443755149841309} 11/07/2021 07:35:26 - INFO - __main__ - Step 73835: {'lr': 0.00026149035893075655, 'samples': 14176320, 'steps': 73834, 'loss/train': 1.6454734802246094} 11/07/2021 07:35:27 - INFO - __main__ - Step 73836: {'lr': 0.00026148505779190976, 'samples': 14176512, 'steps': 73835, 'loss/train': 1.343958854675293} 11/07/2021 07:35:28 - INFO - __main__ - Step 73837: {'lr': 0.000261479756647888, 'samples': 14176704, 'steps': 73836, 'loss/train': 0.8520376682281494} 11/07/2021 07:35:28 - INFO - __main__ - Step 73838: {'lr': 0.00026147445549869365, 'samples': 14176896, 'steps': 73837, 'loss/train': 1.1948286294937134} 11/07/2021 07:35:28 - INFO - __main__ - Step 73839: {'lr': 0.00026146915434432905, 'samples': 14177088, 'steps': 73838, 'loss/train': 1.3524543046951294} 11/07/2021 07:35:29 - INFO - __main__ - Step 73840: {'lr': 0.0002614638531847967, 'samples': 14177280, 'steps': 73839, 'loss/train': 1.0005398988723755} 11/07/2021 07:35:29 - INFO - __main__ - Step 73841: {'lr': 0.0002614585520200989, 'samples': 14177472, 'steps': 73840, 'loss/train': 1.5011956691741943} 11/07/2021 07:35:30 - INFO - __main__ - Step 73842: {'lr': 0.00026145325085023797, 'samples': 14177664, 'steps': 73841, 'loss/train': 1.2167248725891113} 11/07/2021 07:35:31 - INFO - __main__ - Step 73843: {'lr': 0.00026144794967521644, 'samples': 14177856, 'steps': 73842, 'loss/train': 0.5184603929519653} 11/07/2021 07:35:31 - INFO - __main__ - Step 73844: {'lr': 0.0002614426484950366, 'samples': 14178048, 'steps': 73843, 'loss/train': 1.0063061714172363} 11/07/2021 07:35:31 - INFO - __main__ - Step 73845: {'lr': 0.0002614373473097009, 'samples': 14178240, 'steps': 73844, 'loss/train': 1.6485103368759155} 11/07/2021 07:35:32 - INFO - __main__ - Step 73846: {'lr': 0.0002614320461192117, 'samples': 14178432, 'steps': 73845, 'loss/train': 1.04813814163208} 11/07/2021 07:35:32 - INFO - __main__ - Step 73847: {'lr': 0.0002614267449235715, 'samples': 14178624, 'steps': 73846, 'loss/train': 1.500369906425476} 11/07/2021 07:35:33 - INFO - __main__ - Step 73848: {'lr': 0.00026142144372278255, 'samples': 14178816, 'steps': 73847, 'loss/train': 1.4073277711868286} 11/07/2021 07:35:33 - INFO - __main__ - Step 73849: {'lr': 0.0002614161425168472, 'samples': 14179008, 'steps': 73848, 'loss/train': 1.6015865802764893} 11/07/2021 07:35:34 - INFO - __main__ - Step 73850: {'lr': 0.0002614108413057679, 'samples': 14179200, 'steps': 73849, 'loss/train': 2.41621470451355} 11/07/2021 07:35:34 - INFO - __main__ - Step 73851: {'lr': 0.00026140554008954707, 'samples': 14179392, 'steps': 73850, 'loss/train': 0.5372007489204407} 11/07/2021 07:35:34 - INFO - __main__ - Step 73852: {'lr': 0.00026140023886818707, 'samples': 14179584, 'steps': 73851, 'loss/train': 1.2210079431533813} 11/07/2021 07:35:35 - INFO - __main__ - Step 73853: {'lr': 0.0002613949376416904, 'samples': 14179776, 'steps': 73852, 'loss/train': 1.4738116264343262} 11/07/2021 07:35:36 - INFO - __main__ - Step 73854: {'lr': 0.0002613896364100593, 'samples': 14179968, 'steps': 73853, 'loss/train': 1.240185022354126} 11/07/2021 07:35:36 - INFO - __main__ - Step 73855: {'lr': 0.00026138433517329616, 'samples': 14180160, 'steps': 73854, 'loss/train': 1.9103463888168335} 11/07/2021 07:35:37 - INFO - __main__ - Step 73856: {'lr': 0.00026137903393140343, 'samples': 14180352, 'steps': 73855, 'loss/train': 1.228105902671814} 11/07/2021 07:35:37 - INFO - __main__ - Step 73857: {'lr': 0.00026137373268438345, 'samples': 14180544, 'steps': 73856, 'loss/train': 1.0119260549545288} 11/07/2021 07:35:38 - INFO - __main__ - Step 73858: {'lr': 0.0002613684314322387, 'samples': 14180736, 'steps': 73857, 'loss/train': 1.0713328123092651} 11/07/2021 07:35:38 - INFO - __main__ - Step 73859: {'lr': 0.00026136313017497147, 'samples': 14180928, 'steps': 73858, 'loss/train': 0.637360692024231} 11/07/2021 07:35:39 - INFO - __main__ - Step 73860: {'lr': 0.00026135782891258423, 'samples': 14181120, 'steps': 73859, 'loss/train': 1.5825657844543457} 11/07/2021 07:35:39 - INFO - __main__ - Step 73861: {'lr': 0.00026135252764507934, 'samples': 14181312, 'steps': 73860, 'loss/train': 1.5656408071517944} 11/07/2021 07:35:39 - INFO - __main__ - Step 73862: {'lr': 0.0002613472263724591, 'samples': 14181504, 'steps': 73861, 'loss/train': 1.0461448431015015} 11/07/2021 07:35:40 - INFO - __main__ - Step 73863: {'lr': 0.00026134192509472603, 'samples': 14181696, 'steps': 73862, 'loss/train': 1.4706366062164307} 11/07/2021 07:35:41 - INFO - __main__ - Step 73864: {'lr': 0.00026133662381188245, 'samples': 14181888, 'steps': 73863, 'loss/train': 1.7127434015274048} 11/07/2021 07:35:41 - INFO - __main__ - Step 73865: {'lr': 0.00026133132252393075, 'samples': 14182080, 'steps': 73864, 'loss/train': 1.2651506662368774} 11/07/2021 07:35:41 - INFO - __main__ - Step 73866: {'lr': 0.0002613260212308733, 'samples': 14182272, 'steps': 73865, 'loss/train': 0.45890113711357117} 11/07/2021 07:35:42 - INFO - __main__ - Step 73867: {'lr': 0.0002613207199327127, 'samples': 14182464, 'steps': 73866, 'loss/train': 1.5480154752731323} 11/07/2021 07:35:42 - INFO - __main__ - Step 73868: {'lr': 0.00026131541862945096, 'samples': 14182656, 'steps': 73867, 'loss/train': 1.1971349716186523} 11/07/2021 07:35:43 - INFO - __main__ - Step 73869: {'lr': 0.0002613101173210907, 'samples': 14182848, 'steps': 73868, 'loss/train': 1.5495645999908447} 11/07/2021 07:35:43 - INFO - __main__ - Step 73870: {'lr': 0.00026130481600763437, 'samples': 14183040, 'steps': 73869, 'loss/train': 1.5941271781921387} 11/07/2021 07:35:44 - INFO - __main__ - Step 73871: {'lr': 0.00026129951468908415, 'samples': 14183232, 'steps': 73870, 'loss/train': 1.3778854608535767} 11/07/2021 07:35:44 - INFO - __main__ - Step 73872: {'lr': 0.0002612942133654426, 'samples': 14183424, 'steps': 73871, 'loss/train': 0.85897296667099} 11/07/2021 07:35:44 - INFO - __main__ - Step 73873: {'lr': 0.00026128891203671203, 'samples': 14183616, 'steps': 73872, 'loss/train': 1.4955484867095947} 11/07/2021 07:35:46 - INFO - __main__ - Step 73874: {'lr': 0.00026128361070289484, 'samples': 14183808, 'steps': 73873, 'loss/train': 1.5792940855026245} 11/07/2021 07:35:46 - INFO - __main__ - Step 73875: {'lr': 0.0002612783093639935, 'samples': 14184000, 'steps': 73874, 'loss/train': 1.2175341844558716} 11/07/2021 07:35:46 - INFO - __main__ - Step 73876: {'lr': 0.00026127300802001024, 'samples': 14184192, 'steps': 73875, 'loss/train': 0.15997515618801117} 11/07/2021 07:35:47 - INFO - __main__ - Step 73877: {'lr': 0.0002612677066709476, 'samples': 14184384, 'steps': 73876, 'loss/train': 1.0928876399993896} 11/07/2021 07:35:47 - INFO - __main__ - Step 73878: {'lr': 0.00026126240531680785, 'samples': 14184576, 'steps': 73877, 'loss/train': 1.137357473373413} 11/07/2021 07:35:48 - INFO - __main__ - Step 73879: {'lr': 0.00026125710395759344, 'samples': 14184768, 'steps': 73878, 'loss/train': 1.6080210208892822} 11/07/2021 07:35:48 - INFO - __main__ - Step 73880: {'lr': 0.00026125180259330675, 'samples': 14184960, 'steps': 73879, 'loss/train': 1.383718729019165} 11/07/2021 07:35:49 - INFO - __main__ - Step 73881: {'lr': 0.0002612465012239503, 'samples': 14185152, 'steps': 73880, 'loss/train': 0.4687484800815582} 11/07/2021 07:35:49 - INFO - __main__ - Step 73882: {'lr': 0.0002612411998495262, 'samples': 14185344, 'steps': 73881, 'loss/train': 1.4966614246368408} 11/07/2021 07:35:49 - INFO - __main__ - Step 73883: {'lr': 0.0002612358984700371, 'samples': 14185536, 'steps': 73882, 'loss/train': 1.5263562202453613} 11/07/2021 07:35:50 - INFO - __main__ - Step 73884: {'lr': 0.0002612305970854852, 'samples': 14185728, 'steps': 73883, 'loss/train': 1.1151455640792847} 11/07/2021 07:35:51 - INFO - __main__ - Step 73885: {'lr': 0.0002612252956958729, 'samples': 14185920, 'steps': 73884, 'loss/train': 1.466444969177246} 11/07/2021 07:35:52 - INFO - __main__ - Step 73886: {'lr': 0.0002612199943012028, 'samples': 14186112, 'steps': 73885, 'loss/train': 0.12436756491661072} 11/07/2021 07:35:52 - INFO - __main__ - Step 73887: {'lr': 0.0002612146929014771, 'samples': 14186304, 'steps': 73886, 'loss/train': 1.4613484144210815} 11/07/2021 07:35:52 - INFO - __main__ - Step 73888: {'lr': 0.0002612093914966982, 'samples': 14186496, 'steps': 73887, 'loss/train': 1.6792044639587402} 11/07/2021 07:35:53 - INFO - __main__ - Step 73889: {'lr': 0.00026120409008686854, 'samples': 14186688, 'steps': 73888, 'loss/train': 1.4167795181274414} 11/07/2021 07:35:54 - INFO - __main__ - Step 73890: {'lr': 0.0002611987886719905, 'samples': 14186880, 'steps': 73889, 'loss/train': 1.3462460041046143} 11/07/2021 07:35:54 - INFO - __main__ - Step 73891: {'lr': 0.00026119348725206644, 'samples': 14187072, 'steps': 73890, 'loss/train': 1.6592715978622437} 11/07/2021 07:35:54 - INFO - __main__ - Step 73892: {'lr': 0.00026118818582709875, 'samples': 14187264, 'steps': 73891, 'loss/train': 1.438873052597046} 11/07/2021 07:35:55 - INFO - __main__ - Step 73893: {'lr': 0.0002611828843970898, 'samples': 14187456, 'steps': 73892, 'loss/train': 1.4543144702911377} 11/07/2021 07:35:55 - INFO - __main__ - Step 73894: {'lr': 0.00026117758296204216, 'samples': 14187648, 'steps': 73893, 'loss/train': 1.739402413368225} 11/07/2021 07:35:56 - INFO - __main__ - Step 73895: {'lr': 0.000261172281521958, 'samples': 14187840, 'steps': 73894, 'loss/train': 1.4370054006576538} 11/07/2021 07:35:57 - INFO - __main__ - Step 73896: {'lr': 0.00026116698007683974, 'samples': 14188032, 'steps': 73895, 'loss/train': 1.7505149841308594} 11/07/2021 07:35:57 - INFO - __main__ - Step 73897: {'lr': 0.0002611616786266898, 'samples': 14188224, 'steps': 73896, 'loss/train': 1.5009037256240845} 11/07/2021 07:35:57 - INFO - __main__ - Step 73898: {'lr': 0.0002611563771715106, 'samples': 14188416, 'steps': 73897, 'loss/train': 1.7976304292678833} 11/07/2021 07:35:58 - INFO - __main__ - Step 73899: {'lr': 0.0002611510757113045, 'samples': 14188608, 'steps': 73898, 'loss/train': 1.53610098361969} 11/07/2021 07:35:59 - INFO - __main__ - Step 73900: {'lr': 0.000261145774246074, 'samples': 14188800, 'steps': 73899, 'loss/train': 0.7171405553817749} 11/07/2021 07:35:59 - INFO - __main__ - Step 73901: {'lr': 0.0002611404727758213, 'samples': 14188992, 'steps': 73900, 'loss/train': 1.8524781465530396} 11/07/2021 07:36:00 - INFO - __main__ - Step 73902: {'lr': 0.0002611351713005489, 'samples': 14189184, 'steps': 73901, 'loss/train': 1.7918285131454468} 11/07/2021 07:36:00 - INFO - __main__ - Step 73903: {'lr': 0.00026112986982025914, 'samples': 14189376, 'steps': 73902, 'loss/train': 1.7687709331512451} 11/07/2021 07:36:00 - INFO - __main__ - Step 73904: {'lr': 0.00026112456833495446, 'samples': 14189568, 'steps': 73903, 'loss/train': 0.8558873534202576} 11/07/2021 07:36:01 - INFO - __main__ - Step 73905: {'lr': 0.0002611192668446372, 'samples': 14189760, 'steps': 73904, 'loss/train': 1.3983608484268188} 11/07/2021 07:36:02 - INFO - __main__ - Step 73906: {'lr': 0.00026111396534930976, 'samples': 14189952, 'steps': 73905, 'loss/train': 1.7135447263717651} 11/07/2021 07:36:02 - INFO - __main__ - Step 73907: {'lr': 0.00026110866384897457, 'samples': 14190144, 'steps': 73906, 'loss/train': 1.6224472522735596} 11/07/2021 07:36:02 - INFO - __main__ - Step 73908: {'lr': 0.000261103362343634, 'samples': 14190336, 'steps': 73907, 'loss/train': 1.687271237373352} 11/07/2021 07:36:03 - INFO - __main__ - Step 73909: {'lr': 0.00026109806083329036, 'samples': 14190528, 'steps': 73908, 'loss/train': 1.4493993520736694} 11/07/2021 07:36:04 - INFO - __main__ - Step 73910: {'lr': 0.0002610927593179461, 'samples': 14190720, 'steps': 73909, 'loss/train': 1.5052093267440796} 11/07/2021 07:36:04 - INFO - __main__ - Step 73911: {'lr': 0.00026108745779760366, 'samples': 14190912, 'steps': 73910, 'loss/train': 1.7142958641052246} 11/07/2021 07:36:04 - INFO - __main__ - Step 73912: {'lr': 0.0002610821562722654, 'samples': 14191104, 'steps': 73911, 'loss/train': 1.6577941179275513} 11/07/2021 07:36:05 - INFO - __main__ - Step 73913: {'lr': 0.0002610768547419337, 'samples': 14191296, 'steps': 73912, 'loss/train': 1.519148349761963} 11/07/2021 07:36:05 - INFO - __main__ - Step 73914: {'lr': 0.0002610715532066109, 'samples': 14191488, 'steps': 73913, 'loss/train': 1.8504401445388794} 11/07/2021 07:36:06 - INFO - __main__ - Step 73915: {'lr': 0.0002610662516662994, 'samples': 14191680, 'steps': 73914, 'loss/train': 1.1561640501022339} 11/07/2021 07:36:07 - INFO - __main__ - Step 73916: {'lr': 0.00026106095012100165, 'samples': 14191872, 'steps': 73915, 'loss/train': 1.7516156435012817} 11/07/2021 07:36:07 - INFO - __main__ - Step 73917: {'lr': 0.0002610556485707201, 'samples': 14192064, 'steps': 73916, 'loss/train': 1.557277798652649} 11/07/2021 07:36:07 - INFO - __main__ - Step 73918: {'lr': 0.00026105034701545687, 'samples': 14192256, 'steps': 73917, 'loss/train': 0.8029035925865173} 11/07/2021 07:36:08 - INFO - __main__ - Step 73919: {'lr': 0.0002610450454552147, 'samples': 14192448, 'steps': 73918, 'loss/train': 1.6875038146972656} 11/07/2021 07:36:08 - INFO - __main__ - Step 73920: {'lr': 0.0002610397438899957, 'samples': 14192640, 'steps': 73919, 'loss/train': 1.4842275381088257} 11/07/2021 07:36:09 - INFO - __main__ - Step 73921: {'lr': 0.0002610344423198023, 'samples': 14192832, 'steps': 73920, 'loss/train': 1.246299386024475} 11/07/2021 07:36:09 - INFO - __main__ - Step 73922: {'lr': 0.00026102914074463705, 'samples': 14193024, 'steps': 73921, 'loss/train': 1.808531403541565} 11/07/2021 07:36:10 - INFO - __main__ - Step 73923: {'lr': 0.00026102383916450225, 'samples': 14193216, 'steps': 73922, 'loss/train': 1.4077091217041016} 11/07/2021 07:36:10 - INFO - __main__ - Step 73924: {'lr': 0.0002610185375794002, 'samples': 14193408, 'steps': 73923, 'loss/train': 1.812597393989563} 11/07/2021 07:36:10 - INFO - __main__ - Step 73925: {'lr': 0.0002610132359893335, 'samples': 14193600, 'steps': 73924, 'loss/train': 1.490719199180603} 11/07/2021 07:36:11 - INFO - __main__ - Step 73926: {'lr': 0.0002610079343943043, 'samples': 14193792, 'steps': 73925, 'loss/train': 1.2882136106491089} 11/07/2021 07:36:12 - INFO - __main__ - Step 73927: {'lr': 0.0002610026327943151, 'samples': 14193984, 'steps': 73926, 'loss/train': 1.5444573163986206} 11/07/2021 07:36:12 - INFO - __main__ - Step 73928: {'lr': 0.00026099733118936826, 'samples': 14194176, 'steps': 73927, 'loss/train': 1.7308599948883057} 11/07/2021 07:36:13 - INFO - __main__ - Step 73929: {'lr': 0.0002609920295794662, 'samples': 14194368, 'steps': 73928, 'loss/train': 1.3535674810409546} 11/07/2021 07:36:13 - INFO - __main__ - Step 73930: {'lr': 0.00026098672796461144, 'samples': 14194560, 'steps': 73929, 'loss/train': 1.5287871360778809} 11/07/2021 07:36:14 - INFO - __main__ - Step 73931: {'lr': 0.0002609814263448061, 'samples': 14194752, 'steps': 73930, 'loss/train': 1.729267954826355} 11/07/2021 07:36:14 - INFO - __main__ - Step 73932: {'lr': 0.00026097612472005265, 'samples': 14194944, 'steps': 73931, 'loss/train': 1.3516991138458252} 11/07/2021 07:36:15 - INFO - __main__ - Step 73933: {'lr': 0.00026097082309035365, 'samples': 14195136, 'steps': 73932, 'loss/train': 1.2764004468917847} 11/07/2021 07:36:15 - INFO - __main__ - Step 73934: {'lr': 0.0002609655214557113, 'samples': 14195328, 'steps': 73933, 'loss/train': 1.7784318923950195} 11/07/2021 07:36:15 - INFO - __main__ - Step 73935: {'lr': 0.0002609602198161281, 'samples': 14195520, 'steps': 73934, 'loss/train': 0.9197655320167542} 11/07/2021 07:36:16 - INFO - __main__ - Step 73936: {'lr': 0.00026095491817160633, 'samples': 14195712, 'steps': 73935, 'loss/train': 1.3828628063201904} 11/07/2021 07:36:17 - INFO - __main__ - Step 73937: {'lr': 0.0002609496165221485, 'samples': 14195904, 'steps': 73936, 'loss/train': 1.3059377670288086} 11/07/2021 07:36:17 - INFO - __main__ - Step 73938: {'lr': 0.0002609443148677569, 'samples': 14196096, 'steps': 73937, 'loss/train': 1.728193759918213} 11/07/2021 07:36:17 - INFO - __main__ - Step 73939: {'lr': 0.00026093901320843393, 'samples': 14196288, 'steps': 73938, 'loss/train': 1.1920273303985596} 11/07/2021 07:36:18 - INFO - __main__ - Step 73940: {'lr': 0.00026093371154418206, 'samples': 14196480, 'steps': 73939, 'loss/train': 1.224786400794983} 11/07/2021 07:36:18 - INFO - __main__ - Step 73941: {'lr': 0.0002609284098750037, 'samples': 14196672, 'steps': 73940, 'loss/train': 0.903457760810852} 11/07/2021 07:36:19 - INFO - __main__ - Step 73942: {'lr': 0.000260923108200901, 'samples': 14196864, 'steps': 73941, 'loss/train': 1.4244612455368042} 11/07/2021 07:36:20 - INFO - __main__ - Step 73943: {'lr': 0.0002609178065218766, 'samples': 14197056, 'steps': 73942, 'loss/train': 1.2142831087112427} 11/07/2021 07:36:20 - INFO - __main__ - Step 73944: {'lr': 0.0002609125048379329, 'samples': 14197248, 'steps': 73943, 'loss/train': 1.424270749092102} 11/07/2021 07:36:20 - INFO - __main__ - Step 73945: {'lr': 0.00026090720314907206, 'samples': 14197440, 'steps': 73944, 'loss/train': 1.4370791912078857} 11/07/2021 07:36:21 - INFO - __main__ - Step 73946: {'lr': 0.00026090190145529665, 'samples': 14197632, 'steps': 73945, 'loss/train': 1.1708083152770996} 11/07/2021 07:36:22 - INFO - __main__ - Step 73947: {'lr': 0.000260896599756609, 'samples': 14197824, 'steps': 73946, 'loss/train': 1.5183460712432861} 11/07/2021 07:36:22 - INFO - __main__ - Step 73948: {'lr': 0.00026089129805301155, 'samples': 14198016, 'steps': 73947, 'loss/train': 0.8002166152000427} 11/07/2021 07:36:22 - INFO - __main__ - Step 73949: {'lr': 0.0002608859963445066, 'samples': 14198208, 'steps': 73948, 'loss/train': 1.1506842374801636} 11/07/2021 07:36:23 - INFO - __main__ - Step 73950: {'lr': 0.0002608806946310966, 'samples': 14198400, 'steps': 73949, 'loss/train': 1.5506618022918701} 11/07/2021 07:36:23 - INFO - __main__ - Step 73951: {'lr': 0.00026087539291278395, 'samples': 14198592, 'steps': 73950, 'loss/train': 1.2598166465759277} 11/07/2021 07:36:24 - INFO - __main__ - Step 73952: {'lr': 0.000260870091189571, 'samples': 14198784, 'steps': 73951, 'loss/train': 1.4625625610351562} 11/07/2021 07:36:24 - INFO - __main__ - Step 73953: {'lr': 0.00026086478946146015, 'samples': 14198976, 'steps': 73952, 'loss/train': 0.2549084424972534} 11/07/2021 07:36:25 - INFO - __main__ - Step 73954: {'lr': 0.00026085948772845377, 'samples': 14199168, 'steps': 73953, 'loss/train': 1.7248347997665405} 11/07/2021 07:36:25 - INFO - __main__ - Step 73955: {'lr': 0.0002608541859905543, 'samples': 14199360, 'steps': 73954, 'loss/train': 1.2283213138580322} 11/07/2021 07:36:25 - INFO - __main__ - Step 73956: {'lr': 0.00026084888424776414, 'samples': 14199552, 'steps': 73955, 'loss/train': 1.7223501205444336} 11/07/2021 07:36:26 - INFO - __main__ - Step 73957: {'lr': 0.0002608435825000856, 'samples': 14199744, 'steps': 73956, 'loss/train': 1.3859946727752686} 11/07/2021 07:36:27 - INFO - __main__ - Step 73958: {'lr': 0.0002608382807475211, 'samples': 14199936, 'steps': 73957, 'loss/train': 1.2398743629455566} 11/07/2021 07:36:27 - INFO - __main__ - Step 73959: {'lr': 0.00026083297899007305, 'samples': 14200128, 'steps': 73958, 'loss/train': 1.6133043766021729} 11/07/2021 07:36:27 - INFO - __main__ - Step 73960: {'lr': 0.0002608276772277438, 'samples': 14200320, 'steps': 73959, 'loss/train': 1.0746194124221802} 11/07/2021 07:36:28 - INFO - __main__ - Step 73961: {'lr': 0.00026082237546053584, 'samples': 14200512, 'steps': 73960, 'loss/train': 1.3485524654388428} 11/07/2021 07:36:29 - INFO - __main__ - Step 73962: {'lr': 0.00026081707368845144, 'samples': 14200704, 'steps': 73961, 'loss/train': 1.9352469444274902} 11/07/2021 07:36:29 - INFO - __main__ - Step 73963: {'lr': 0.000260811771911493, 'samples': 14200896, 'steps': 73962, 'loss/train': 1.2070081233978271} 11/07/2021 07:36:29 - INFO - __main__ - Step 73964: {'lr': 0.00026080647012966294, 'samples': 14201088, 'steps': 73963, 'loss/train': 1.6110843420028687} 11/07/2021 07:36:30 - INFO - __main__ - Step 73965: {'lr': 0.0002608011683429637, 'samples': 14201280, 'steps': 73964, 'loss/train': 1.4771405458450317} 11/07/2021 07:36:30 - INFO - __main__ - Step 73966: {'lr': 0.0002607958665513976, 'samples': 14201472, 'steps': 73965, 'loss/train': 1.6037946939468384} 11/07/2021 07:36:31 - INFO - __main__ - Step 73967: {'lr': 0.000260790564754967, 'samples': 14201664, 'steps': 73966, 'loss/train': 1.568526268005371} 11/07/2021 07:36:32 - INFO - __main__ - Step 73968: {'lr': 0.0002607852629536745, 'samples': 14201856, 'steps': 73967, 'loss/train': 0.9093716144561768} 11/07/2021 07:36:32 - INFO - __main__ - Step 73969: {'lr': 0.0002607799611475222, 'samples': 14202048, 'steps': 73968, 'loss/train': 1.358333706855774} 11/07/2021 07:36:32 - INFO - __main__ - Step 73970: {'lr': 0.0002607746593365126, 'samples': 14202240, 'steps': 73969, 'loss/train': 1.4241061210632324} 11/07/2021 07:36:33 - INFO - __main__ - Step 73971: {'lr': 0.0002607693575206481, 'samples': 14202432, 'steps': 73970, 'loss/train': 1.2015087604522705} 11/07/2021 07:36:33 - INFO - __main__ - Step 73972: {'lr': 0.0002607640556999312, 'samples': 14202624, 'steps': 73971, 'loss/train': 1.5715892314910889} 11/07/2021 07:36:34 - INFO - __main__ - Step 73973: {'lr': 0.00026075875387436407, 'samples': 14202816, 'steps': 73972, 'loss/train': 1.4070125818252563} 11/07/2021 07:36:34 - INFO - __main__ - Step 73974: {'lr': 0.0002607534520439492, 'samples': 14203008, 'steps': 73973, 'loss/train': 1.4610320329666138} 11/07/2021 07:36:35 - INFO - __main__ - Step 73975: {'lr': 0.0002607481502086891, 'samples': 14203200, 'steps': 73974, 'loss/train': 1.3459062576293945} 11/07/2021 07:36:35 - INFO - __main__ - Step 73976: {'lr': 0.00026074284836858605, 'samples': 14203392, 'steps': 73975, 'loss/train': 1.7890691757202148} 11/07/2021 07:36:35 - INFO - __main__ - Step 73977: {'lr': 0.00026073754652364235, 'samples': 14203584, 'steps': 73976, 'loss/train': 1.4965540170669556} 11/07/2021 07:36:36 - INFO - __main__ - Step 73978: {'lr': 0.00026073224467386056, 'samples': 14203776, 'steps': 73977, 'loss/train': 1.3359122276306152} 11/07/2021 07:36:37 - INFO - __main__ - Step 73979: {'lr': 0.00026072694281924284, 'samples': 14203968, 'steps': 73978, 'loss/train': 1.367215871810913} 11/07/2021 07:36:37 - INFO - __main__ - Step 73980: {'lr': 0.00026072164095979186, 'samples': 14204160, 'steps': 73979, 'loss/train': 1.2341424226760864} 11/07/2021 07:36:38 - INFO - __main__ - Step 73981: {'lr': 0.00026071633909550984, 'samples': 14204352, 'steps': 73980, 'loss/train': 0.7321109771728516} 11/07/2021 07:36:38 - INFO - __main__ - Step 73982: {'lr': 0.0002607110372263992, 'samples': 14204544, 'steps': 73981, 'loss/train': 1.4914946556091309} 11/07/2021 07:36:39 - INFO - __main__ - Step 73983: {'lr': 0.0002607057353524623, 'samples': 14204736, 'steps': 73982, 'loss/train': 0.7278684973716736} 11/07/2021 07:36:39 - INFO - __main__ - Step 73984: {'lr': 0.00026070043347370164, 'samples': 14204928, 'steps': 73983, 'loss/train': 0.06203259155154228} 11/07/2021 07:36:40 - INFO - __main__ - Step 73985: {'lr': 0.00026069513159011947, 'samples': 14205120, 'steps': 73984, 'loss/train': 1.8043150901794434} 11/07/2021 07:36:40 - INFO - __main__ - Step 73986: {'lr': 0.00026068982970171823, 'samples': 14205312, 'steps': 73985, 'loss/train': 1.4465044736862183} 11/07/2021 07:36:40 - INFO - __main__ - Step 73987: {'lr': 0.0002606845278085003, 'samples': 14205504, 'steps': 73986, 'loss/train': 1.0984848737716675} 11/07/2021 07:36:41 - INFO - __main__ - Step 73988: {'lr': 0.0002606792259104682, 'samples': 14205696, 'steps': 73987, 'loss/train': 1.4914169311523438} 11/07/2021 07:36:42 - INFO - __main__ - Step 73989: {'lr': 0.0002606739240076241, 'samples': 14205888, 'steps': 73988, 'loss/train': 1.633368730545044} 11/07/2021 07:36:42 - INFO - __main__ - Step 73990: {'lr': 0.00026066862209997053, 'samples': 14206080, 'steps': 73989, 'loss/train': 1.266632318496704} 11/07/2021 07:36:42 - INFO - __main__ - Step 73991: {'lr': 0.0002606633201875098, 'samples': 14206272, 'steps': 73990, 'loss/train': 1.1123273372650146} 11/07/2021 07:36:43 - INFO - __main__ - Step 73992: {'lr': 0.00026065801827024446, 'samples': 14206464, 'steps': 73991, 'loss/train': 1.3588014841079712} 11/07/2021 07:36:44 - INFO - __main__ - Step 73993: {'lr': 0.0002606527163481767, 'samples': 14206656, 'steps': 73992, 'loss/train': 1.3968135118484497} 11/07/2021 07:36:44 - INFO - __main__ - Step 73994: {'lr': 0.000260647414421309, 'samples': 14206848, 'steps': 73993, 'loss/train': 1.0585373640060425} 11/07/2021 07:36:44 - INFO - __main__ - Step 73995: {'lr': 0.0002606421124896437, 'samples': 14207040, 'steps': 73994, 'loss/train': 1.5680066347122192} 11/07/2021 07:36:45 - INFO - __main__ - Step 73996: {'lr': 0.0002606368105531833, 'samples': 14207232, 'steps': 73995, 'loss/train': 0.5984134078025818} 11/07/2021 07:36:45 - INFO - __main__ - Step 73997: {'lr': 0.00026063150861193, 'samples': 14207424, 'steps': 73996, 'loss/train': 1.5554147958755493} 11/07/2021 07:36:46 - INFO - __main__ - Step 73998: {'lr': 0.0002606262066658864, 'samples': 14207616, 'steps': 73997, 'loss/train': 1.7371010780334473} 11/07/2021 07:36:46 - INFO - __main__ - Step 73999: {'lr': 0.0002606209047150548, 'samples': 14207808, 'steps': 73998, 'loss/train': 1.3152037858963013} 11/07/2021 07:36:47 - INFO - __main__ - Step 74000: {'lr': 0.00026061560275943753, 'samples': 14208000, 'steps': 73999, 'loss/train': 1.3993027210235596} 11/07/2021 07:36:47 - INFO - __main__ - Step 74001: {'lr': 0.0002606103007990371, 'samples': 14208192, 'steps': 74000, 'loss/train': 1.4358649253845215} 11/07/2021 07:36:48 - INFO - __main__ - Step 74002: {'lr': 0.0002606049988338558, 'samples': 14208384, 'steps': 74001, 'loss/train': 1.617768406867981} 11/07/2021 07:36:49 - INFO - __main__ - Step 74003: {'lr': 0.00026059969686389605, 'samples': 14208576, 'steps': 74002, 'loss/train': 1.2955108880996704} 11/07/2021 07:36:49 - INFO - __main__ - Step 74004: {'lr': 0.0002605943948891603, 'samples': 14208768, 'steps': 74003, 'loss/train': 1.6065568923950195} 11/07/2021 07:36:49 - INFO - __main__ - Step 74005: {'lr': 0.00026058909290965077, 'samples': 14208960, 'steps': 74004, 'loss/train': 1.319629192352295} 11/07/2021 07:36:50 - INFO - __main__ - Step 74006: {'lr': 0.00026058379092537, 'samples': 14209152, 'steps': 74005, 'loss/train': 0.3488248884677887} 11/07/2021 07:36:50 - INFO - __main__ - Step 74007: {'lr': 0.0002605784889363203, 'samples': 14209344, 'steps': 74006, 'loss/train': 0.9389576315879822} 11/07/2021 07:36:51 - INFO - __main__ - Step 74008: {'lr': 0.00026057318694250423, 'samples': 14209536, 'steps': 74007, 'loss/train': 1.5373218059539795} 11/07/2021 07:36:52 - INFO - __main__ - Step 74009: {'lr': 0.0002605678849439239, 'samples': 14209728, 'steps': 74008, 'loss/train': 1.5059657096862793} 11/07/2021 07:36:52 - INFO - __main__ - Step 74010: {'lr': 0.00026056258294058186, 'samples': 14209920, 'steps': 74009, 'loss/train': 1.322625756263733} 11/07/2021 07:36:52 - INFO - __main__ - Step 74011: {'lr': 0.00026055728093248053, 'samples': 14210112, 'steps': 74010, 'loss/train': 1.1573173999786377} 11/07/2021 07:36:53 - INFO - __main__ - Step 74012: {'lr': 0.0002605519789196223, 'samples': 14210304, 'steps': 74011, 'loss/train': 1.5699769258499146} 11/07/2021 07:36:53 - INFO - __main__ - Step 74013: {'lr': 0.0002605466769020094, 'samples': 14210496, 'steps': 74012, 'loss/train': 1.5136665105819702} 11/07/2021 07:36:55 - INFO - __main__ - Step 74014: {'lr': 0.0002605413748796444, 'samples': 14210688, 'steps': 74013, 'loss/train': 0.8082688450813293} 11/07/2021 07:36:55 - INFO - __main__ - Step 74015: {'lr': 0.0002605360728525297, 'samples': 14210880, 'steps': 74014, 'loss/train': 1.4872387647628784} 11/07/2021 07:36:55 - INFO - __main__ - Step 74016: {'lr': 0.00026053077082066747, 'samples': 14211072, 'steps': 74015, 'loss/train': 1.1843292713165283} 11/07/2021 07:36:56 - INFO - __main__ - Step 74017: {'lr': 0.00026052546878406024, 'samples': 14211264, 'steps': 74016, 'loss/train': 1.7655560970306396} 11/07/2021 07:36:56 - INFO - __main__ - Step 74018: {'lr': 0.00026052016674271044, 'samples': 14211456, 'steps': 74017, 'loss/train': 1.7558780908584595} 11/07/2021 07:36:56 - INFO - __main__ - Step 74019: {'lr': 0.0002605148646966204, 'samples': 14211648, 'steps': 74018, 'loss/train': 1.377856969833374} 11/07/2021 07:36:57 - INFO - __main__ - Step 74020: {'lr': 0.00026050956264579256, 'samples': 14211840, 'steps': 74019, 'loss/train': 1.4060484170913696} 11/07/2021 07:36:58 - INFO - __main__ - Step 74021: {'lr': 0.00026050426059022924, 'samples': 14212032, 'steps': 74020, 'loss/train': 1.0360883474349976} 11/07/2021 07:36:58 - INFO - __main__ - Step 74022: {'lr': 0.0002604989585299329, 'samples': 14212224, 'steps': 74021, 'loss/train': 0.7780193090438843} 11/07/2021 07:36:59 - INFO - __main__ - Step 74023: {'lr': 0.00026049365646490586, 'samples': 14212416, 'steps': 74022, 'loss/train': 1.475223183631897} 11/07/2021 07:36:59 - INFO - __main__ - Step 74024: {'lr': 0.0002604883543951505, 'samples': 14212608, 'steps': 74023, 'loss/train': 1.1740490198135376} 11/07/2021 07:36:59 - INFO - __main__ - Step 74025: {'lr': 0.00026048305232066933, 'samples': 14212800, 'steps': 74024, 'loss/train': 2.2184982299804688} 11/07/2021 07:37:00 - INFO - __main__ - Step 74026: {'lr': 0.0002604777502414646, 'samples': 14212992, 'steps': 74025, 'loss/train': 1.3430914878845215} 11/07/2021 07:37:01 - INFO - __main__ - Step 74027: {'lr': 0.0002604724481575388, 'samples': 14213184, 'steps': 74026, 'loss/train': 0.7432152032852173} 11/07/2021 07:37:01 - INFO - __main__ - Step 74028: {'lr': 0.00026046714606889424, 'samples': 14213376, 'steps': 74027, 'loss/train': 1.4754674434661865} 11/07/2021 07:37:01 - INFO - __main__ - Step 74029: {'lr': 0.0002604618439755334, 'samples': 14213568, 'steps': 74028, 'loss/train': 0.9237378239631653} 11/07/2021 07:37:02 - INFO - __main__ - Step 74030: {'lr': 0.00026045654187745854, 'samples': 14213760, 'steps': 74029, 'loss/train': 0.9786043763160706} 11/07/2021 07:37:03 - INFO - __main__ - Step 74031: {'lr': 0.00026045123977467215, 'samples': 14213952, 'steps': 74030, 'loss/train': 0.8403305411338806} 11/07/2021 07:37:03 - INFO - __main__ - Step 74032: {'lr': 0.0002604459376671766, 'samples': 14214144, 'steps': 74031, 'loss/train': 1.110159158706665} 11/07/2021 07:37:04 - INFO - __main__ - Step 74033: {'lr': 0.0002604406355549743, 'samples': 14214336, 'steps': 74032, 'loss/train': 1.8246222734451294} 11/07/2021 07:37:04 - INFO - __main__ - Step 74034: {'lr': 0.0002604353334380675, 'samples': 14214528, 'steps': 74033, 'loss/train': 0.5677427053451538} 11/07/2021 07:37:04 - INFO - __main__ - Step 74035: {'lr': 0.0002604300313164589, 'samples': 14214720, 'steps': 74034, 'loss/train': 1.5183039903640747} 11/07/2021 07:37:05 - INFO - __main__ - Step 74036: {'lr': 0.0002604247291901505, 'samples': 14214912, 'steps': 74035, 'loss/train': 1.471750020980835} 11/07/2021 07:37:06 - INFO - __main__ - Step 74037: {'lr': 0.000260419427059145, 'samples': 14215104, 'steps': 74036, 'loss/train': 1.4545015096664429} 11/07/2021 07:37:06 - INFO - __main__ - Step 74038: {'lr': 0.00026041412492344457, 'samples': 14215296, 'steps': 74037, 'loss/train': 1.0596269369125366} 11/07/2021 07:37:06 - INFO - __main__ - Step 74039: {'lr': 0.00026040882278305176, 'samples': 14215488, 'steps': 74038, 'loss/train': 1.267478585243225} 11/07/2021 07:37:07 - INFO - __main__ - Step 74040: {'lr': 0.00026040352063796886, 'samples': 14215680, 'steps': 74039, 'loss/train': 1.315689206123352} 11/07/2021 07:37:08 - INFO - __main__ - Step 74041: {'lr': 0.00026039821848819835, 'samples': 14215872, 'steps': 74040, 'loss/train': 1.4508346319198608} 11/07/2021 07:37:08 - INFO - __main__ - Step 74042: {'lr': 0.0002603929163337425, 'samples': 14216064, 'steps': 74041, 'loss/train': 1.4967279434204102} 11/07/2021 07:37:08 - INFO - __main__ - Step 74043: {'lr': 0.0002603876141746038, 'samples': 14216256, 'steps': 74042, 'loss/train': 1.19334876537323} 11/07/2021 07:37:09 - INFO - __main__ - Step 74044: {'lr': 0.0002603823120107846, 'samples': 14216448, 'steps': 74043, 'loss/train': 1.4415279626846313} 11/07/2021 07:37:09 - INFO - __main__ - Step 74045: {'lr': 0.00026037700984228725, 'samples': 14216640, 'steps': 74044, 'loss/train': 1.8770835399627686} 11/07/2021 07:37:10 - INFO - __main__ - Step 74046: {'lr': 0.00026037170766911424, 'samples': 14216832, 'steps': 74045, 'loss/train': 1.1535464525222778} 11/07/2021 07:37:11 - INFO - __main__ - Step 74047: {'lr': 0.00026036640549126784, 'samples': 14217024, 'steps': 74046, 'loss/train': 1.4210782051086426} 11/07/2021 07:37:11 - INFO - __main__ - Step 74048: {'lr': 0.0002603611033087506, 'samples': 14217216, 'steps': 74047, 'loss/train': 1.39493727684021} 11/07/2021 07:37:11 - INFO - __main__ - Step 74049: {'lr': 0.0002603558011215647, 'samples': 14217408, 'steps': 74048, 'loss/train': 1.2759929895401} 11/07/2021 07:37:12 - INFO - __main__ - Step 74050: {'lr': 0.0002603504989297126, 'samples': 14217600, 'steps': 74049, 'loss/train': 1.6319648027420044} 11/07/2021 07:37:12 - INFO - __main__ - Step 74051: {'lr': 0.00026034519673319683, 'samples': 14217792, 'steps': 74050, 'loss/train': 1.558200716972351} 11/07/2021 07:37:13 - INFO - __main__ - Step 74052: {'lr': 0.00026033989453201964, 'samples': 14217984, 'steps': 74051, 'loss/train': 1.5180906057357788} 11/07/2021 07:37:13 - INFO - __main__ - Step 74053: {'lr': 0.0002603345923261835, 'samples': 14218176, 'steps': 74052, 'loss/train': 1.3811274766921997} 11/07/2021 07:37:14 - INFO - __main__ - Step 74054: {'lr': 0.0002603292901156907, 'samples': 14218368, 'steps': 74053, 'loss/train': 1.2863786220550537} 11/07/2021 07:37:14 - INFO - __main__ - Step 74055: {'lr': 0.00026032398790054367, 'samples': 14218560, 'steps': 74054, 'loss/train': 1.5050196647644043} 11/07/2021 07:37:14 - INFO - __main__ - Step 74056: {'lr': 0.0002603186856807448, 'samples': 14218752, 'steps': 74055, 'loss/train': 1.486984133720398} 11/07/2021 07:37:15 - INFO - __main__ - Step 74057: {'lr': 0.00026031338345629653, 'samples': 14218944, 'steps': 74056, 'loss/train': 1.2986152172088623} 11/07/2021 07:37:16 - INFO - __main__ - Step 74058: {'lr': 0.0002603080812272012, 'samples': 14219136, 'steps': 74057, 'loss/train': 1.1916109323501587} 11/07/2021 07:37:16 - INFO - __main__ - Step 74059: {'lr': 0.0002603027789934612, 'samples': 14219328, 'steps': 74058, 'loss/train': 1.5210802555084229} 11/07/2021 07:37:17 - INFO - __main__ - Step 74060: {'lr': 0.00026029747675507893, 'samples': 14219520, 'steps': 74059, 'loss/train': 1.3631500005722046} 11/07/2021 07:37:17 - INFO - __main__ - Step 74061: {'lr': 0.0002602921745120568, 'samples': 14219712, 'steps': 74060, 'loss/train': 1.2050325870513916} 11/07/2021 07:37:18 - INFO - __main__ - Step 74062: {'lr': 0.00026028687226439714, 'samples': 14219904, 'steps': 74061, 'loss/train': 1.3820157051086426} 11/07/2021 07:37:18 - INFO - __main__ - Step 74063: {'lr': 0.00026028157001210236, 'samples': 14220096, 'steps': 74062, 'loss/train': 1.4288398027420044} 11/07/2021 07:37:19 - INFO - __main__ - Step 74064: {'lr': 0.00026027626775517495, 'samples': 14220288, 'steps': 74063, 'loss/train': 2.0626144409179688} 11/07/2021 07:37:19 - INFO - __main__ - Step 74065: {'lr': 0.00026027096549361713, 'samples': 14220480, 'steps': 74064, 'loss/train': 1.4956849813461304} 11/07/2021 07:37:19 - INFO - __main__ - Step 74066: {'lr': 0.00026026566322743134, 'samples': 14220672, 'steps': 74065, 'loss/train': 0.7351279258728027} 11/07/2021 07:37:20 - INFO - __main__ - Step 74067: {'lr': 0.00026026036095662, 'samples': 14220864, 'steps': 74066, 'loss/train': 1.4536794424057007} 11/07/2021 07:37:21 - INFO - __main__ - Step 74068: {'lr': 0.0002602550586811856, 'samples': 14221056, 'steps': 74067, 'loss/train': 1.1144369840621948} 11/07/2021 07:37:21 - INFO - __main__ - Step 74069: {'lr': 0.0002602497564011304, 'samples': 14221248, 'steps': 74068, 'loss/train': 0.6947505474090576} 11/07/2021 07:37:21 - INFO - __main__ - Step 74070: {'lr': 0.0002602444541164568, 'samples': 14221440, 'steps': 74069, 'loss/train': 1.1133992671966553} 11/07/2021 07:37:22 - INFO - __main__ - Step 74071: {'lr': 0.00026023915182716716, 'samples': 14221632, 'steps': 74070, 'loss/train': 1.4353209733963013} 11/07/2021 07:37:23 - INFO - __main__ - Step 74072: {'lr': 0.00026023384953326395, 'samples': 14221824, 'steps': 74071, 'loss/train': 1.2775013446807861} 11/07/2021 07:37:23 - INFO - __main__ - Step 74073: {'lr': 0.00026022854723474953, 'samples': 14222016, 'steps': 74072, 'loss/train': 0.9570607542991638} 11/07/2021 07:37:24 - INFO - __main__ - Step 74074: {'lr': 0.0002602232449316263, 'samples': 14222208, 'steps': 74073, 'loss/train': 1.3460655212402344} 11/07/2021 07:37:24 - INFO - __main__ - Step 74075: {'lr': 0.00026021794262389667, 'samples': 14222400, 'steps': 74074, 'loss/train': 0.8769649863243103} 11/07/2021 07:37:24 - INFO - __main__ - Step 74076: {'lr': 0.00026021264031156295, 'samples': 14222592, 'steps': 74075, 'loss/train': 1.175830602645874} 11/07/2021 07:37:25 - INFO - __main__ - Step 74077: {'lr': 0.00026020733799462755, 'samples': 14222784, 'steps': 74076, 'loss/train': 1.8379894495010376} 11/07/2021 07:37:26 - INFO - __main__ - Step 74078: {'lr': 0.00026020203567309286, 'samples': 14222976, 'steps': 74077, 'loss/train': 1.1885850429534912} 11/07/2021 07:37:26 - INFO - __main__ - Step 74079: {'lr': 0.00026019673334696136, 'samples': 14223168, 'steps': 74078, 'loss/train': 0.9446211457252502} 11/07/2021 07:37:26 - INFO - __main__ - Step 74080: {'lr': 0.00026019143101623535, 'samples': 14223360, 'steps': 74079, 'loss/train': 1.932086706161499} 11/07/2021 07:37:27 - INFO - __main__ - Step 74081: {'lr': 0.0002601861286809172, 'samples': 14223552, 'steps': 74080, 'loss/train': 1.601418137550354} 11/07/2021 07:37:28 - INFO - __main__ - Step 74082: {'lr': 0.0002601808263410094, 'samples': 14223744, 'steps': 74081, 'loss/train': 1.5814746618270874} 11/07/2021 07:37:28 - INFO - __main__ - Step 74083: {'lr': 0.0002601755239965142, 'samples': 14223936, 'steps': 74082, 'loss/train': 1.1756126880645752} 11/07/2021 07:37:28 - INFO - __main__ - Step 74084: {'lr': 0.00026017022164743413, 'samples': 14224128, 'steps': 74083, 'loss/train': 1.420294165611267} 11/07/2021 07:37:29 - INFO - __main__ - Step 74085: {'lr': 0.00026016491929377143, 'samples': 14224320, 'steps': 74084, 'loss/train': 0.8807829022407532} 11/07/2021 07:37:29 - INFO - __main__ - Step 74086: {'lr': 0.0002601596169355287, 'samples': 14224512, 'steps': 74085, 'loss/train': 1.7643535137176514} 11/07/2021 07:37:30 - INFO - __main__ - Step 74087: {'lr': 0.0002601543145727081, 'samples': 14224704, 'steps': 74086, 'loss/train': 1.4183979034423828} 11/07/2021 07:37:31 - INFO - __main__ - Step 74088: {'lr': 0.00026014901220531217, 'samples': 14224896, 'steps': 74087, 'loss/train': 1.475930094718933} 11/07/2021 07:37:31 - INFO - __main__ - Step 74089: {'lr': 0.0002601437098333433, 'samples': 14225088, 'steps': 74088, 'loss/train': 1.3584308624267578} 11/07/2021 07:37:31 - INFO - __main__ - Step 74090: {'lr': 0.00026013840745680374, 'samples': 14225280, 'steps': 74089, 'loss/train': 1.6383793354034424} 11/07/2021 07:37:32 - INFO - __main__ - Step 74091: {'lr': 0.000260133105075696, 'samples': 14225472, 'steps': 74090, 'loss/train': 1.136425256729126} 11/07/2021 07:37:32 - INFO - __main__ - Step 74092: {'lr': 0.00026012780269002244, 'samples': 14225664, 'steps': 74091, 'loss/train': 1.263107419013977} 11/07/2021 07:37:33 - INFO - __main__ - Step 74093: {'lr': 0.00026012250029978543, 'samples': 14225856, 'steps': 74092, 'loss/train': 1.7317405939102173} 11/07/2021 07:37:33 - INFO - __main__ - Step 74094: {'lr': 0.0002601171979049874, 'samples': 14226048, 'steps': 74093, 'loss/train': 1.5971037149429321} 11/07/2021 07:37:34 - INFO - __main__ - Step 74095: {'lr': 0.0002601118955056307, 'samples': 14226240, 'steps': 74094, 'loss/train': 1.9216794967651367} 11/07/2021 07:37:34 - INFO - __main__ - Step 74096: {'lr': 0.0002601065931017178, 'samples': 14226432, 'steps': 74095, 'loss/train': 1.603088617324829} 11/07/2021 07:37:34 - INFO - __main__ - Step 74097: {'lr': 0.00026010129069325093, 'samples': 14226624, 'steps': 74096, 'loss/train': 1.7420728206634521} 11/07/2021 07:37:35 - INFO - __main__ - Step 74098: {'lr': 0.0002600959882802326, 'samples': 14226816, 'steps': 74097, 'loss/train': 1.6406248807907104} 11/07/2021 07:37:36 - INFO - __main__ - Step 74099: {'lr': 0.0002600906858626652, 'samples': 14227008, 'steps': 74098, 'loss/train': 1.3562737703323364} 11/07/2021 07:37:36 - INFO - __main__ - Step 74100: {'lr': 0.0002600853834405511, 'samples': 14227200, 'steps': 74099, 'loss/train': 1.626953125} 11/07/2021 07:37:36 - INFO - __main__ - Step 74101: {'lr': 0.0002600800810138927, 'samples': 14227392, 'steps': 74100, 'loss/train': 1.6024212837219238} 11/07/2021 07:37:37 - INFO - __main__ - Step 74102: {'lr': 0.00026007477858269235, 'samples': 14227584, 'steps': 74101, 'loss/train': 1.4915860891342163} 11/07/2021 07:37:38 - INFO - __main__ - Step 74103: {'lr': 0.0002600694761469524, 'samples': 14227776, 'steps': 74102, 'loss/train': 0.6107494235038757} 11/07/2021 07:37:38 - INFO - __main__ - Step 74104: {'lr': 0.0002600641737066754, 'samples': 14227968, 'steps': 74103, 'loss/train': 1.1486073732376099} 11/07/2021 07:37:38 - INFO - __main__ - Step 74105: {'lr': 0.00026005887126186357, 'samples': 14228160, 'steps': 74104, 'loss/train': 1.1360344886779785} 11/07/2021 07:37:39 - INFO - __main__ - Step 74106: {'lr': 0.0002600535688125194, 'samples': 14228352, 'steps': 74105, 'loss/train': 1.2323071956634521} 11/07/2021 07:37:39 - INFO - __main__ - Step 74107: {'lr': 0.0002600482663586452, 'samples': 14228544, 'steps': 74106, 'loss/train': 0.8072120547294617} 11/07/2021 07:37:40 - INFO - __main__ - Step 74108: {'lr': 0.00026004296390024346, 'samples': 14228736, 'steps': 74107, 'loss/train': 1.455039143562317} 11/07/2021 07:37:41 - INFO - __main__ - Step 74109: {'lr': 0.0002600376614373165, 'samples': 14228928, 'steps': 74108, 'loss/train': 1.114382028579712} 11/07/2021 07:37:41 - INFO - __main__ - Step 74110: {'lr': 0.00026003235896986674, 'samples': 14229120, 'steps': 74109, 'loss/train': 1.605353832244873} 11/07/2021 07:37:41 - INFO - __main__ - Step 74111: {'lr': 0.0002600270564978965, 'samples': 14229312, 'steps': 74110, 'loss/train': 1.6284304857254028} 11/07/2021 07:37:42 - INFO - __main__ - Step 74112: {'lr': 0.0002600217540214083, 'samples': 14229504, 'steps': 74111, 'loss/train': 1.3525408506393433} 11/07/2021 07:37:43 - INFO - __main__ - Step 74113: {'lr': 0.00026001645154040436, 'samples': 14229696, 'steps': 74112, 'loss/train': 1.4199422597885132} 11/07/2021 07:37:44 - INFO - __main__ - Step 74114: {'lr': 0.0002600111490548872, 'samples': 14229888, 'steps': 74113, 'loss/train': 1.5379486083984375} 11/07/2021 07:37:44 - INFO - __main__ - Step 74115: {'lr': 0.0002600058465648591, 'samples': 14230080, 'steps': 74114, 'loss/train': 1.8135780096054077} 11/07/2021 07:37:44 - INFO - __main__ - Step 74116: {'lr': 0.0002600005440703227, 'samples': 14230272, 'steps': 74115, 'loss/train': 1.8039053678512573} 11/07/2021 07:37:45 - INFO - __main__ - Step 74117: {'lr': 0.00025999524157128013, 'samples': 14230464, 'steps': 74116, 'loss/train': 1.4746071100234985} 11/07/2021 07:37:45 - INFO - __main__ - Step 74118: {'lr': 0.0002599899390677338, 'samples': 14230656, 'steps': 74117, 'loss/train': 1.5132744312286377} 11/07/2021 07:37:46 - INFO - __main__ - Step 74119: {'lr': 0.0002599846365596862, 'samples': 14230848, 'steps': 74118, 'loss/train': 1.3932750225067139} 11/07/2021 07:37:47 - INFO - __main__ - Step 74120: {'lr': 0.0002599793340471397, 'samples': 14231040, 'steps': 74119, 'loss/train': 1.4024126529693604} 11/07/2021 07:37:47 - INFO - __main__ - Step 74121: {'lr': 0.00025997403153009657, 'samples': 14231232, 'steps': 74120, 'loss/train': 1.6709834337234497} 11/07/2021 07:37:47 - INFO - __main__ - Step 74122: {'lr': 0.00025996872900855937, 'samples': 14231424, 'steps': 74121, 'loss/train': 1.3089085817337036} 11/07/2021 07:37:48 - INFO - __main__ - Step 74123: {'lr': 0.0002599634264825305, 'samples': 14231616, 'steps': 74122, 'loss/train': 0.8083512783050537} 11/07/2021 07:37:48 - INFO - __main__ - Step 74124: {'lr': 0.00025995812395201214, 'samples': 14231808, 'steps': 74123, 'loss/train': 1.461047887802124} 11/07/2021 07:37:49 - INFO - __main__ - Step 74125: {'lr': 0.0002599528214170068, 'samples': 14232000, 'steps': 74124, 'loss/train': 1.0361984968185425} 11/07/2021 07:37:49 - INFO - __main__ - Step 74126: {'lr': 0.0002599475188775169, 'samples': 14232192, 'steps': 74125, 'loss/train': 1.166124939918518} 11/07/2021 07:37:50 - INFO - __main__ - Step 74127: {'lr': 0.0002599422163335448, 'samples': 14232384, 'steps': 74126, 'loss/train': 1.3170247077941895} 11/07/2021 07:37:50 - INFO - __main__ - Step 74128: {'lr': 0.00025993691378509295, 'samples': 14232576, 'steps': 74127, 'loss/train': 1.6366820335388184} 11/07/2021 07:37:51 - INFO - __main__ - Step 74129: {'lr': 0.00025993161123216365, 'samples': 14232768, 'steps': 74128, 'loss/train': 1.5778322219848633} 11/07/2021 07:37:52 - INFO - __main__ - Step 74130: {'lr': 0.0002599263086747593, 'samples': 14232960, 'steps': 74129, 'loss/train': 1.7532154321670532} 11/07/2021 07:37:52 - INFO - __main__ - Step 74131: {'lr': 0.00025992100611288226, 'samples': 14233152, 'steps': 74130, 'loss/train': 1.508845567703247} 11/07/2021 07:37:52 - INFO - __main__ - Step 74132: {'lr': 0.00025991570354653504, 'samples': 14233344, 'steps': 74131, 'loss/train': 1.5051124095916748} 11/07/2021 07:37:53 - INFO - __main__ - Step 74133: {'lr': 0.0002599104009757199, 'samples': 14233536, 'steps': 74132, 'loss/train': 1.2660698890686035} 11/07/2021 07:37:53 - INFO - __main__ - Step 74134: {'lr': 0.0002599050984004393, 'samples': 14233728, 'steps': 74133, 'loss/train': 1.3402663469314575} 11/07/2021 07:37:53 - INFO - __main__ - Step 74135: {'lr': 0.00025989979582069565, 'samples': 14233920, 'steps': 74134, 'loss/train': 1.6223485469818115} 11/07/2021 07:37:54 - INFO - __main__ - Step 74136: {'lr': 0.00025989449323649135, 'samples': 14234112, 'steps': 74135, 'loss/train': 1.3725014925003052} 11/07/2021 07:37:55 - INFO - __main__ - Step 74137: {'lr': 0.00025988919064782865, 'samples': 14234304, 'steps': 74136, 'loss/train': 1.2206827402114868} 11/07/2021 07:37:55 - INFO - __main__ - Step 74138: {'lr': 0.0002598838880547101, 'samples': 14234496, 'steps': 74137, 'loss/train': 1.3898309469223022} 11/07/2021 07:37:55 - INFO - __main__ - Step 74139: {'lr': 0.00025987858545713796, 'samples': 14234688, 'steps': 74138, 'loss/train': 1.330733060836792} 11/07/2021 07:37:56 - INFO - __main__ - Step 74140: {'lr': 0.0002598732828551147, 'samples': 14234880, 'steps': 74139, 'loss/train': 1.6341909170150757} 11/07/2021 07:37:57 - INFO - __main__ - Step 74141: {'lr': 0.00025986798024864267, 'samples': 14235072, 'steps': 74140, 'loss/train': 1.2386772632598877} 11/07/2021 07:37:57 - INFO - __main__ - Step 74142: {'lr': 0.00025986267763772433, 'samples': 14235264, 'steps': 74141, 'loss/train': 1.3283995389938354} 11/07/2021 07:37:57 - INFO - __main__ - Step 74143: {'lr': 0.000259857375022362, 'samples': 14235456, 'steps': 74142, 'loss/train': 1.2296875715255737} 11/07/2021 07:37:58 - INFO - __main__ - Step 74144: {'lr': 0.0002598520724025581, 'samples': 14235648, 'steps': 74143, 'loss/train': 1.1299060583114624} 11/07/2021 07:37:58 - INFO - __main__ - Step 74145: {'lr': 0.00025984676977831503, 'samples': 14235840, 'steps': 74144, 'loss/train': 1.25080406665802} 11/07/2021 07:37:59 - INFO - __main__ - Step 74146: {'lr': 0.0002598414671496351, 'samples': 14236032, 'steps': 74145, 'loss/train': 0.43300530314445496} 11/07/2021 07:38:00 - INFO - __main__ - Step 74147: {'lr': 0.00025983616451652074, 'samples': 14236224, 'steps': 74146, 'loss/train': 0.6070417165756226} 11/07/2021 07:38:00 - INFO - __main__ - Step 74148: {'lr': 0.0002598308618789744, 'samples': 14236416, 'steps': 74147, 'loss/train': 1.906659722328186} 11/07/2021 07:38:00 - INFO - __main__ - Step 74149: {'lr': 0.00025982555923699844, 'samples': 14236608, 'steps': 74148, 'loss/train': 1.7929726839065552} 11/07/2021 07:38:01 - INFO - __main__ - Step 74150: {'lr': 0.00025982025659059525, 'samples': 14236800, 'steps': 74149, 'loss/train': 1.4385294914245605} 11/07/2021 07:38:02 - INFO - __main__ - Step 74151: {'lr': 0.00025981495393976716, 'samples': 14236992, 'steps': 74150, 'loss/train': 1.3388910293579102} 11/07/2021 07:38:02 - INFO - __main__ - Step 74152: {'lr': 0.0002598096512845166, 'samples': 14237184, 'steps': 74151, 'loss/train': 0.7648847103118896} 11/07/2021 07:38:03 - INFO - __main__ - Step 74153: {'lr': 0.000259804348624846, 'samples': 14237376, 'steps': 74152, 'loss/train': 0.6356877088546753} 11/07/2021 07:38:03 - INFO - __main__ - Step 74154: {'lr': 0.00025979904596075767, 'samples': 14237568, 'steps': 74153, 'loss/train': 1.3782659769058228} 11/07/2021 07:38:03 - INFO - __main__ - Step 74155: {'lr': 0.0002597937432922541, 'samples': 14237760, 'steps': 74154, 'loss/train': 1.205570936203003} 11/07/2021 07:38:05 - INFO - __main__ - Step 74156: {'lr': 0.0002597884406193376, 'samples': 14237952, 'steps': 74155, 'loss/train': 1.166016936302185} 11/07/2021 07:38:05 - INFO - __main__ - Step 74157: {'lr': 0.00025978313794201055, 'samples': 14238144, 'steps': 74156, 'loss/train': 1.14598548412323} 11/07/2021 07:38:05 - INFO - __main__ - Step 74158: {'lr': 0.0002597778352602754, 'samples': 14238336, 'steps': 74157, 'loss/train': 1.5117377042770386} 11/07/2021 07:38:06 - INFO - __main__ - Step 74159: {'lr': 0.00025977253257413444, 'samples': 14238528, 'steps': 74158, 'loss/train': 0.6463251113891602} 11/07/2021 07:38:06 - INFO - __main__ - Step 74160: {'lr': 0.00025976722988359013, 'samples': 14238720, 'steps': 74159, 'loss/train': 1.2857059240341187} 11/07/2021 07:38:06 - INFO - __main__ - Step 74161: {'lr': 0.00025976192718864493, 'samples': 14238912, 'steps': 74160, 'loss/train': 0.10731858760118484} 11/07/2021 07:38:08 - INFO - __main__ - Step 74162: {'lr': 0.00025975662448930113, 'samples': 14239104, 'steps': 74161, 'loss/train': 1.93372642993927} 11/07/2021 07:38:08 - INFO - __main__ - Step 74163: {'lr': 0.0002597513217855612, 'samples': 14239296, 'steps': 74162, 'loss/train': 1.5513801574707031} 11/07/2021 07:38:08 - INFO - __main__ - Step 74164: {'lr': 0.0002597460190774274, 'samples': 14239488, 'steps': 74163, 'loss/train': 0.8927552700042725} 11/07/2021 07:38:09 - INFO - __main__ - Step 74165: {'lr': 0.0002597407163649022, 'samples': 14239680, 'steps': 74164, 'loss/train': 1.2984040975570679} 11/07/2021 07:38:09 - INFO - __main__ - Step 74166: {'lr': 0.00025973541364798797, 'samples': 14239872, 'steps': 74165, 'loss/train': 0.4190027415752411} 11/07/2021 07:38:10 - INFO - __main__ - Step 74167: {'lr': 0.0002597301109266871, 'samples': 14240064, 'steps': 74166, 'loss/train': 1.4005733728408813} 11/07/2021 07:38:10 - INFO - __main__ - Step 74168: {'lr': 0.0002597248082010021, 'samples': 14240256, 'steps': 74167, 'loss/train': 1.113362193107605} 11/07/2021 07:38:11 - INFO - __main__ - Step 74169: {'lr': 0.0002597195054709351, 'samples': 14240448, 'steps': 74168, 'loss/train': 1.5790003538131714} 11/07/2021 07:38:11 - INFO - __main__ - Step 74170: {'lr': 0.0002597142027364888, 'samples': 14240640, 'steps': 74169, 'loss/train': 1.2077630758285522} 11/07/2021 07:38:11 - INFO - __main__ - Step 74171: {'lr': 0.0002597088999976654, 'samples': 14240832, 'steps': 74170, 'loss/train': 0.9463820457458496} 11/07/2021 07:38:12 - INFO - __main__ - Step 74172: {'lr': 0.00025970359725446725, 'samples': 14241024, 'steps': 74171, 'loss/train': 1.3162829875946045} 11/07/2021 07:38:13 - INFO - __main__ - Step 74173: {'lr': 0.0002596982945068968, 'samples': 14241216, 'steps': 74172, 'loss/train': 1.34275221824646} 11/07/2021 07:38:13 - INFO - __main__ - Step 74174: {'lr': 0.0002596929917549565, 'samples': 14241408, 'steps': 74173, 'loss/train': 1.3200763463974} 11/07/2021 07:38:13 - INFO - __main__ - Step 74175: {'lr': 0.00025968768899864864, 'samples': 14241600, 'steps': 74174, 'loss/train': 1.8201464414596558} 11/07/2021 07:38:14 - INFO - __main__ - Step 74176: {'lr': 0.00025968238623797575, 'samples': 14241792, 'steps': 74175, 'loss/train': 0.8290103077888489} 11/07/2021 07:38:15 - INFO - __main__ - Step 74177: {'lr': 0.0002596770834729401, 'samples': 14241984, 'steps': 74176, 'loss/train': 0.7194477915763855} 11/07/2021 07:38:15 - INFO - __main__ - Step 74178: {'lr': 0.000259671780703544, 'samples': 14242176, 'steps': 74177, 'loss/train': 1.337388038635254} 11/07/2021 07:38:15 - INFO - __main__ - Step 74179: {'lr': 0.00025966647792979, 'samples': 14242368, 'steps': 74178, 'loss/train': 1.3586846590042114} 11/07/2021 07:38:16 - INFO - __main__ - Step 74180: {'lr': 0.0002596611751516805, 'samples': 14242560, 'steps': 74179, 'loss/train': 1.3686732053756714} 11/07/2021 07:38:16 - INFO - __main__ - Step 74181: {'lr': 0.00025965587236921774, 'samples': 14242752, 'steps': 74180, 'loss/train': 1.533400297164917} 11/07/2021 07:38:17 - INFO - __main__ - Step 74182: {'lr': 0.00025965056958240424, 'samples': 14242944, 'steps': 74181, 'loss/train': 1.2689272165298462} 11/07/2021 07:38:18 - INFO - __main__ - Step 74183: {'lr': 0.00025964526679124234, 'samples': 14243136, 'steps': 74182, 'loss/train': 1.7158560752868652} 11/07/2021 07:38:18 - INFO - __main__ - Step 74184: {'lr': 0.00025963996399573435, 'samples': 14243328, 'steps': 74183, 'loss/train': 1.8947950601577759} 11/07/2021 07:38:18 - INFO - __main__ - Step 74185: {'lr': 0.00025963466119588284, 'samples': 14243520, 'steps': 74184, 'loss/train': 1.4260916709899902} 11/07/2021 07:38:19 - INFO - __main__ - Step 74186: {'lr': 0.00025962935839169007, 'samples': 14243712, 'steps': 74185, 'loss/train': 1.2608855962753296} 11/07/2021 07:38:19 - INFO - __main__ - Step 74187: {'lr': 0.0002596240555831585, 'samples': 14243904, 'steps': 74186, 'loss/train': 1.2070804834365845} 11/07/2021 07:38:20 - INFO - __main__ - Step 74188: {'lr': 0.0002596187527702904, 'samples': 14244096, 'steps': 74187, 'loss/train': 2.135457992553711} 11/07/2021 07:38:20 - INFO - __main__ - Step 74189: {'lr': 0.00025961344995308825, 'samples': 14244288, 'steps': 74188, 'loss/train': 1.3663051128387451} 11/07/2021 07:38:21 - INFO - __main__ - Step 74190: {'lr': 0.0002596081471315545, 'samples': 14244480, 'steps': 74189, 'loss/train': 1.5390411615371704} 11/07/2021 07:38:21 - INFO - __main__ - Step 74191: {'lr': 0.0002596028443056914, 'samples': 14244672, 'steps': 74190, 'loss/train': 1.9048758745193481} 11/07/2021 07:38:22 - INFO - __main__ - Step 74192: {'lr': 0.0002595975414755014, 'samples': 14244864, 'steps': 74191, 'loss/train': 0.8365814685821533} 11/07/2021 07:38:22 - INFO - __main__ - Step 74193: {'lr': 0.00025959223864098697, 'samples': 14245056, 'steps': 74192, 'loss/train': 1.2965538501739502} 11/07/2021 07:38:23 - INFO - __main__ - Step 74194: {'lr': 0.00025958693580215036, 'samples': 14245248, 'steps': 74193, 'loss/train': 0.052307020872831345} 11/07/2021 07:38:23 - INFO - __main__ - Step 74195: {'lr': 0.0002595816329589941, 'samples': 14245440, 'steps': 74194, 'loss/train': 1.2411909103393555} 11/07/2021 07:38:23 - INFO - __main__ - Step 74196: {'lr': 0.0002595763301115204, 'samples': 14245632, 'steps': 74195, 'loss/train': 0.9897867441177368} 11/07/2021 07:38:24 - INFO - __main__ - Step 74197: {'lr': 0.0002595710272597318, 'samples': 14245824, 'steps': 74196, 'loss/train': 1.1119487285614014} 11/07/2021 07:38:25 - INFO - __main__ - Step 74198: {'lr': 0.0002595657244036307, 'samples': 14246016, 'steps': 74197, 'loss/train': 1.6236943006515503} 11/07/2021 07:38:25 - INFO - __main__ - Step 74199: {'lr': 0.0002595604215432194, 'samples': 14246208, 'steps': 74198, 'loss/train': 1.3701636791229248} 11/07/2021 07:38:26 - INFO - __main__ - Step 74200: {'lr': 0.00025955511867850026, 'samples': 14246400, 'steps': 74199, 'loss/train': 0.8193984031677246} 11/07/2021 07:38:26 - INFO - __main__ - Step 74201: {'lr': 0.0002595498158094757, 'samples': 14246592, 'steps': 74200, 'loss/train': 1.498580813407898} 11/07/2021 07:38:27 - INFO - __main__ - Step 74202: {'lr': 0.0002595445129361482, 'samples': 14246784, 'steps': 74201, 'loss/train': 1.346059799194336} 11/07/2021 07:38:27 - INFO - __main__ - Step 74203: {'lr': 0.0002595392100585201, 'samples': 14246976, 'steps': 74202, 'loss/train': 0.8870660662651062} 11/07/2021 07:38:28 - INFO - __main__ - Step 74204: {'lr': 0.0002595339071765939, 'samples': 14247168, 'steps': 74203, 'loss/train': 1.2713900804519653} 11/07/2021 07:38:28 - INFO - __main__ - Step 74205: {'lr': 0.0002595286042903717, 'samples': 14247360, 'steps': 74204, 'loss/train': 2.0440714359283447} 11/07/2021 07:38:28 - INFO - __main__ - Step 74206: {'lr': 0.0002595233013998561, 'samples': 14247552, 'steps': 74205, 'loss/train': 1.5382384061813354} 11/07/2021 07:38:29 - INFO - __main__ - Step 74207: {'lr': 0.00025951799850504944, 'samples': 14247744, 'steps': 74206, 'loss/train': 1.5259653329849243} 11/07/2021 07:38:30 - INFO - __main__ - Step 74208: {'lr': 0.00025951269560595407, 'samples': 14247936, 'steps': 74207, 'loss/train': 1.5075438022613525} 11/07/2021 07:38:30 - INFO - __main__ - Step 74209: {'lr': 0.0002595073927025725, 'samples': 14248128, 'steps': 74208, 'loss/train': 1.5686675310134888} 11/07/2021 07:38:30 - INFO - __main__ - Step 74210: {'lr': 0.0002595020897949071, 'samples': 14248320, 'steps': 74209, 'loss/train': 2.000303268432617} 11/07/2021 07:38:31 - INFO - __main__ - Step 74211: {'lr': 0.0002594967868829601, 'samples': 14248512, 'steps': 74210, 'loss/train': 0.9608275890350342} 11/07/2021 07:38:31 - INFO - __main__ - Step 74212: {'lr': 0.0002594914839667341, 'samples': 14248704, 'steps': 74211, 'loss/train': 1.0453449487686157} 11/07/2021 07:38:32 - INFO - __main__ - Step 74213: {'lr': 0.00025948618104623125, 'samples': 14248896, 'steps': 74212, 'loss/train': 1.315184235572815} 11/07/2021 07:38:32 - INFO - __main__ - Step 74214: {'lr': 0.0002594808781214541, 'samples': 14249088, 'steps': 74213, 'loss/train': 1.5562193393707275} 11/07/2021 07:38:33 - INFO - __main__ - Step 74215: {'lr': 0.00025947557519240505, 'samples': 14249280, 'steps': 74214, 'loss/train': 1.5222798585891724} 11/07/2021 07:38:33 - INFO - __main__ - Step 74216: {'lr': 0.0002594702722590864, 'samples': 14249472, 'steps': 74215, 'loss/train': 1.8133201599121094} 11/07/2021 07:38:33 - INFO - __main__ - Step 74217: {'lr': 0.0002594649693215007, 'samples': 14249664, 'steps': 74216, 'loss/train': 0.9071162343025208} 11/07/2021 07:38:34 - INFO - __main__ - Step 74218: {'lr': 0.00025945966637965016, 'samples': 14249856, 'steps': 74217, 'loss/train': 0.11501283943653107} 11/07/2021 07:38:35 - INFO - __main__ - Step 74219: {'lr': 0.0002594543634335373, 'samples': 14250048, 'steps': 74218, 'loss/train': 1.339605689048767} 11/07/2021 07:38:35 - INFO - __main__ - Step 74220: {'lr': 0.00025944906048316435, 'samples': 14250240, 'steps': 74219, 'loss/train': 1.9004993438720703} 11/07/2021 07:38:36 - INFO - __main__ - Step 74221: {'lr': 0.00025944375752853387, 'samples': 14250432, 'steps': 74220, 'loss/train': 1.5242302417755127} 11/07/2021 07:38:36 - INFO - __main__ - Step 74222: {'lr': 0.00025943845456964816, 'samples': 14250624, 'steps': 74221, 'loss/train': 1.0706589221954346} 11/07/2021 07:38:37 - INFO - __main__ - Step 74223: {'lr': 0.0002594331516065097, 'samples': 14250816, 'steps': 74222, 'loss/train': 1.1589405536651611} 11/07/2021 07:38:37 - INFO - __main__ - Step 74224: {'lr': 0.00025942784863912074, 'samples': 14251008, 'steps': 74223, 'loss/train': 1.0941977500915527} 11/07/2021 07:38:38 - INFO - __main__ - Step 74225: {'lr': 0.0002594225456674837, 'samples': 14251200, 'steps': 74224, 'loss/train': 1.2613605260849} 11/07/2021 07:38:38 - INFO - __main__ - Step 74226: {'lr': 0.000259417242691601, 'samples': 14251392, 'steps': 74225, 'loss/train': 1.2695506811141968} 11/07/2021 07:38:38 - INFO - __main__ - Step 74227: {'lr': 0.0002594119397114751, 'samples': 14251584, 'steps': 74226, 'loss/train': 1.1565624475479126} 11/07/2021 07:38:39 - INFO - __main__ - Step 74228: {'lr': 0.00025940663672710827, 'samples': 14251776, 'steps': 74227, 'loss/train': 1.2848265171051025} 11/07/2021 07:38:40 - INFO - __main__ - Step 74229: {'lr': 0.000259401333738503, 'samples': 14251968, 'steps': 74228, 'loss/train': 1.3072781562805176} 11/07/2021 07:38:40 - INFO - __main__ - Step 74230: {'lr': 0.00025939603074566167, 'samples': 14252160, 'steps': 74229, 'loss/train': 1.0697988271713257} 11/07/2021 07:38:40 - INFO - __main__ - Step 74231: {'lr': 0.0002593907277485865, 'samples': 14252352, 'steps': 74230, 'loss/train': 1.4326177835464478} 11/07/2021 07:38:41 - INFO - __main__ - Step 74232: {'lr': 0.0002593854247472801, 'samples': 14252544, 'steps': 74231, 'loss/train': 1.481417179107666} 11/07/2021 07:38:42 - INFO - __main__ - Step 74233: {'lr': 0.0002593801217417448, 'samples': 14252736, 'steps': 74232, 'loss/train': 1.39808189868927} 11/07/2021 07:38:42 - INFO - __main__ - Step 74234: {'lr': 0.0002593748187319829, 'samples': 14252928, 'steps': 74233, 'loss/train': 1.412834882736206} 11/07/2021 07:38:43 - INFO - __main__ - Step 74235: {'lr': 0.00025936951571799686, 'samples': 14253120, 'steps': 74234, 'loss/train': 1.3063348531723022} 11/07/2021 07:38:43 - INFO - __main__ - Step 74236: {'lr': 0.0002593642126997891, 'samples': 14253312, 'steps': 74235, 'loss/train': 1.4275434017181396} 11/07/2021 07:38:43 - INFO - __main__ - Step 74237: {'lr': 0.000259358909677362, 'samples': 14253504, 'steps': 74236, 'loss/train': 1.1312474012374878} 11/07/2021 07:38:44 - INFO - __main__ - Step 74238: {'lr': 0.00025935360665071787, 'samples': 14253696, 'steps': 74237, 'loss/train': 1.6058800220489502} 11/07/2021 07:38:45 - INFO - __main__ - Step 74239: {'lr': 0.00025934830361985914, 'samples': 14253888, 'steps': 74238, 'loss/train': 2.115919351577759} 11/07/2021 07:38:45 - INFO - __main__ - Step 74240: {'lr': 0.0002593430005847882, 'samples': 14254080, 'steps': 74239, 'loss/train': 1.221165418624878} 11/07/2021 07:38:45 - INFO - __main__ - Step 74241: {'lr': 0.00025933769754550747, 'samples': 14254272, 'steps': 74240, 'loss/train': 1.8104760646820068} 11/07/2021 07:38:46 - INFO - __main__ - Step 74242: {'lr': 0.0002593323945020193, 'samples': 14254464, 'steps': 74241, 'loss/train': 1.4570326805114746} 11/07/2021 07:38:46 - INFO - __main__ - Step 74243: {'lr': 0.0002593270914543261, 'samples': 14254656, 'steps': 74242, 'loss/train': 1.477818489074707} 11/07/2021 07:38:47 - INFO - __main__ - Step 74244: {'lr': 0.00025932178840243033, 'samples': 14254848, 'steps': 74243, 'loss/train': 1.3918120861053467} 11/07/2021 07:38:47 - INFO - __main__ - Step 74245: {'lr': 0.00025931648534633424, 'samples': 14255040, 'steps': 74244, 'loss/train': 1.1776624917984009} 11/07/2021 07:38:48 - INFO - __main__ - Step 74246: {'lr': 0.0002593111822860403, 'samples': 14255232, 'steps': 74245, 'loss/train': 1.5860291719436646} 11/07/2021 07:38:48 - INFO - __main__ - Step 74247: {'lr': 0.00025930587922155086, 'samples': 14255424, 'steps': 74246, 'loss/train': 1.1968114376068115} 11/07/2021 07:38:48 - INFO - __main__ - Step 74248: {'lr': 0.0002593005761528683, 'samples': 14255616, 'steps': 74247, 'loss/train': 1.4957880973815918} 11/07/2021 07:38:50 - INFO - __main__ - Step 74249: {'lr': 0.00025929527307999513, 'samples': 14255808, 'steps': 74248, 'loss/train': 1.315032720565796} 11/07/2021 07:38:50 - INFO - __main__ - Step 74250: {'lr': 0.00025928997000293367, 'samples': 14256000, 'steps': 74249, 'loss/train': 1.251525640487671} 11/07/2021 07:38:50 - INFO - __main__ - Step 74251: {'lr': 0.00025928466692168615, 'samples': 14256192, 'steps': 74250, 'loss/train': 1.5238569974899292} 11/07/2021 07:38:51 - INFO - __main__ - Step 74252: {'lr': 0.00025927936383625524, 'samples': 14256384, 'steps': 74251, 'loss/train': 1.4057674407958984} 11/07/2021 07:38:51 - INFO - __main__ - Step 74253: {'lr': 0.0002592740607466431, 'samples': 14256576, 'steps': 74252, 'loss/train': 1.011864185333252} 11/07/2021 07:38:51 - INFO - __main__ - Step 74254: {'lr': 0.0002592687576528523, 'samples': 14256768, 'steps': 74253, 'loss/train': 1.1723570823669434} 11/07/2021 07:38:52 - INFO - __main__ - Step 74255: {'lr': 0.0002592634545548851, 'samples': 14256960, 'steps': 74254, 'loss/train': 1.5788244009017944} 11/07/2021 07:38:53 - INFO - __main__ - Step 74256: {'lr': 0.0002592581514527439, 'samples': 14257152, 'steps': 74255, 'loss/train': 1.3245863914489746} 11/07/2021 07:38:53 - INFO - __main__ - Step 74257: {'lr': 0.0002592528483464312, 'samples': 14257344, 'steps': 74256, 'loss/train': 1.0444680452346802} 11/07/2021 07:38:53 - INFO - __main__ - Step 74258: {'lr': 0.0002592475452359492, 'samples': 14257536, 'steps': 74257, 'loss/train': 1.2150628566741943} 11/07/2021 07:38:54 - INFO - __main__ - Step 74259: {'lr': 0.00025924224212130046, 'samples': 14257728, 'steps': 74258, 'loss/train': 1.089678406715393} 11/07/2021 07:38:55 - INFO - __main__ - Step 74260: {'lr': 0.0002592369390024873, 'samples': 14257920, 'steps': 74259, 'loss/train': 0.8783850073814392} 11/07/2021 07:38:55 - INFO - __main__ - Step 74261: {'lr': 0.0002592316358795121, 'samples': 14258112, 'steps': 74260, 'loss/train': 2.11264967918396} 11/07/2021 07:38:55 - INFO - __main__ - Step 74262: {'lr': 0.0002592263327523773, 'samples': 14258304, 'steps': 74261, 'loss/train': 0.98679518699646} 11/07/2021 07:38:56 - INFO - __main__ - Step 74263: {'lr': 0.0002592210296210852, 'samples': 14258496, 'steps': 74262, 'loss/train': 1.4478988647460938} 11/07/2021 07:38:56 - INFO - __main__ - Step 74264: {'lr': 0.00025921572648563833, 'samples': 14258688, 'steps': 74263, 'loss/train': 0.12421542406082153} 11/07/2021 07:38:57 - INFO - __main__ - Step 74265: {'lr': 0.000259210423346039, 'samples': 14258880, 'steps': 74264, 'loss/train': 1.1654797792434692} 11/07/2021 07:38:58 - INFO - __main__ - Step 74266: {'lr': 0.0002592051202022895, 'samples': 14259072, 'steps': 74265, 'loss/train': 1.6588045358657837} 11/07/2021 07:38:58 - INFO - __main__ - Step 74267: {'lr': 0.0002591998170543924, 'samples': 14259264, 'steps': 74266, 'loss/train': 0.5434480905532837} 11/07/2021 07:38:58 - INFO - __main__ - Step 74268: {'lr': 0.00025919451390234995, 'samples': 14259456, 'steps': 74267, 'loss/train': 1.5551049709320068} 11/07/2021 07:38:59 - INFO - __main__ - Step 74269: {'lr': 0.00025918921074616466, 'samples': 14259648, 'steps': 74268, 'loss/train': 1.4567458629608154} 11/07/2021 07:39:00 - INFO - __main__ - Step 74270: {'lr': 0.0002591839075858388, 'samples': 14259840, 'steps': 74269, 'loss/train': 1.275612473487854} 11/07/2021 07:39:00 - INFO - __main__ - Step 74271: {'lr': 0.0002591786044213748, 'samples': 14260032, 'steps': 74270, 'loss/train': 1.3094123601913452} 11/07/2021 07:39:00 - INFO - __main__ - Step 74272: {'lr': 0.00025917330125277513, 'samples': 14260224, 'steps': 74271, 'loss/train': 0.9232556223869324} 11/07/2021 07:39:01 - INFO - __main__ - Step 74273: {'lr': 0.00025916799808004204, 'samples': 14260416, 'steps': 74272, 'loss/train': 1.4283397197723389} 11/07/2021 07:39:01 - INFO - __main__ - Step 74274: {'lr': 0.00025916269490317803, 'samples': 14260608, 'steps': 74273, 'loss/train': 1.2672349214553833} 11/07/2021 07:39:02 - INFO - __main__ - Step 74275: {'lr': 0.0002591573917221854, 'samples': 14260800, 'steps': 74274, 'loss/train': 1.6838414669036865} 11/07/2021 07:39:03 - INFO - __main__ - Step 74276: {'lr': 0.00025915208853706664, 'samples': 14260992, 'steps': 74275, 'loss/train': 1.4256975650787354} 11/07/2021 07:39:03 - INFO - __main__ - Step 74277: {'lr': 0.0002591467853478241, 'samples': 14261184, 'steps': 74276, 'loss/train': 1.351181149482727} 11/07/2021 07:39:03 - INFO - __main__ - Step 74278: {'lr': 0.00025914148215446013, 'samples': 14261376, 'steps': 74277, 'loss/train': 1.735129475593567} 11/07/2021 07:39:04 - INFO - __main__ - Step 74279: {'lr': 0.00025913617895697715, 'samples': 14261568, 'steps': 74278, 'loss/train': 1.3158632516860962} 11/07/2021 07:39:04 - INFO - __main__ - Step 74280: {'lr': 0.00025913087575537755, 'samples': 14261760, 'steps': 74279, 'loss/train': 1.5731984376907349} 11/07/2021 07:39:05 - INFO - __main__ - Step 74281: {'lr': 0.00025912557254966374, 'samples': 14261952, 'steps': 74280, 'loss/train': 1.4065569639205933} 11/07/2021 07:39:05 - INFO - __main__ - Step 74282: {'lr': 0.0002591202693398381, 'samples': 14262144, 'steps': 74281, 'loss/train': 0.6793238520622253} 11/07/2021 07:39:06 - INFO - __main__ - Step 74283: {'lr': 0.0002591149661259029, 'samples': 14262336, 'steps': 74282, 'loss/train': 1.1024810075759888} 11/07/2021 07:39:06 - INFO - __main__ - Step 74284: {'lr': 0.0002591096629078608, 'samples': 14262528, 'steps': 74283, 'loss/train': 1.4745638370513916} 11/07/2021 07:39:06 - INFO - __main__ - Step 74285: {'lr': 0.00025910435968571396, 'samples': 14262720, 'steps': 74284, 'loss/train': 1.4851019382476807} 11/07/2021 07:39:07 - INFO - __main__ - Step 74286: {'lr': 0.0002590990564594648, 'samples': 14262912, 'steps': 74285, 'loss/train': 1.4173097610473633} 11/07/2021 07:39:08 - INFO - __main__ - Step 74287: {'lr': 0.0002590937532291157, 'samples': 14263104, 'steps': 74286, 'loss/train': 1.3466882705688477} 11/07/2021 07:39:08 - INFO - __main__ - Step 74288: {'lr': 0.00025908844999466917, 'samples': 14263296, 'steps': 74287, 'loss/train': 1.4704294204711914} 11/07/2021 07:39:09 - INFO - __main__ - Step 74289: {'lr': 0.00025908314675612755, 'samples': 14263488, 'steps': 74288, 'loss/train': 1.5893806219100952} 11/07/2021 07:39:09 - INFO - __main__ - Step 74290: {'lr': 0.00025907784351349313, 'samples': 14263680, 'steps': 74289, 'loss/train': 1.3345696926116943} 11/07/2021 07:39:10 - INFO - __main__ - Step 74291: {'lr': 0.00025907254026676845, 'samples': 14263872, 'steps': 74290, 'loss/train': 1.4758408069610596} 11/07/2021 07:39:10 - INFO - __main__ - Step 74292: {'lr': 0.0002590672370159558, 'samples': 14264064, 'steps': 74291, 'loss/train': 1.614288091659546} 11/07/2021 07:39:11 - INFO - __main__ - Step 74293: {'lr': 0.00025906193376105756, 'samples': 14264256, 'steps': 74292, 'loss/train': 1.3349756002426147} 11/07/2021 07:39:11 - INFO - __main__ - Step 74294: {'lr': 0.0002590566305020762, 'samples': 14264448, 'steps': 74293, 'loss/train': 1.4418445825576782} 11/07/2021 07:39:11 - INFO - __main__ - Step 74295: {'lr': 0.000259051327239014, 'samples': 14264640, 'steps': 74294, 'loss/train': 1.4722546339035034} 11/07/2021 07:39:12 - INFO - __main__ - Step 74296: {'lr': 0.00025904602397187345, 'samples': 14264832, 'steps': 74295, 'loss/train': 1.3508459329605103} 11/07/2021 07:39:13 - INFO - __main__ - Step 74297: {'lr': 0.0002590407207006569, 'samples': 14265024, 'steps': 74296, 'loss/train': 1.0974445343017578} 11/07/2021 07:39:13 - INFO - __main__ - Step 74298: {'lr': 0.00025903541742536675, 'samples': 14265216, 'steps': 74297, 'loss/train': 1.741417407989502} 11/07/2021 07:39:13 - INFO - __main__ - Step 74299: {'lr': 0.00025903011414600536, 'samples': 14265408, 'steps': 74298, 'loss/train': 1.2902774810791016} 11/07/2021 07:39:14 - INFO - __main__ - Step 74300: {'lr': 0.0002590248108625751, 'samples': 14265600, 'steps': 74299, 'loss/train': 0.9624218940734863} 11/07/2021 07:39:15 - INFO - __main__ - Step 74301: {'lr': 0.00025901950757507847, 'samples': 14265792, 'steps': 74300, 'loss/train': 1.3067641258239746} 11/07/2021 07:39:15 - INFO - __main__ - Step 74302: {'lr': 0.0002590142042835178, 'samples': 14265984, 'steps': 74301, 'loss/train': 2.7615857124328613} 11/07/2021 07:39:15 - INFO - __main__ - Step 74303: {'lr': 0.00025900890098789543, 'samples': 14266176, 'steps': 74302, 'loss/train': 1.9416279792785645} 11/07/2021 07:39:16 - INFO - __main__ - Step 74304: {'lr': 0.0002590035976882138, 'samples': 14266368, 'steps': 74303, 'loss/train': 1.3563358783721924} 11/07/2021 07:39:16 - INFO - __main__ - Step 74305: {'lr': 0.0002589982943844753, 'samples': 14266560, 'steps': 74304, 'loss/train': 1.359513282775879} 11/07/2021 07:39:17 - INFO - __main__ - Step 74306: {'lr': 0.0002589929910766823, 'samples': 14266752, 'steps': 74305, 'loss/train': 1.13882315158844} 11/07/2021 07:39:17 - INFO - __main__ - Step 74307: {'lr': 0.0002589876877648372, 'samples': 14266944, 'steps': 74306, 'loss/train': 0.9529486298561096} 11/07/2021 07:39:18 - INFO - __main__ - Step 74308: {'lr': 0.0002589823844489423, 'samples': 14267136, 'steps': 74307, 'loss/train': 1.7545114755630493} 11/07/2021 07:39:18 - INFO - __main__ - Step 74309: {'lr': 0.00025897708112900014, 'samples': 14267328, 'steps': 74308, 'loss/train': 1.3347012996673584} 11/07/2021 07:39:18 - INFO - __main__ - Step 74310: {'lr': 0.0002589717778050131, 'samples': 14267520, 'steps': 74309, 'loss/train': 1.2673076391220093} 11/07/2021 07:39:19 - INFO - __main__ - Step 74311: {'lr': 0.00025896647447698343, 'samples': 14267712, 'steps': 74310, 'loss/train': 1.551687479019165} 11/07/2021 07:39:20 - INFO - __main__ - Step 74312: {'lr': 0.0002589611711449137, 'samples': 14267904, 'steps': 74311, 'loss/train': 1.2641416788101196} 11/07/2021 07:39:20 - INFO - __main__ - Step 74313: {'lr': 0.0002589558678088061, 'samples': 14268096, 'steps': 74312, 'loss/train': 1.3390593528747559} 11/07/2021 07:39:20 - INFO - __main__ - Step 74314: {'lr': 0.00025895056446866314, 'samples': 14268288, 'steps': 74313, 'loss/train': 1.7267069816589355} 11/07/2021 07:39:21 - INFO - __main__ - Step 74315: {'lr': 0.0002589452611244872, 'samples': 14268480, 'steps': 74314, 'loss/train': 1.2781428098678589} 11/07/2021 07:39:22 - INFO - __main__ - Step 74316: {'lr': 0.00025893995777628083, 'samples': 14268672, 'steps': 74315, 'loss/train': 1.5139843225479126} 11/07/2021 07:39:22 - INFO - __main__ - Step 74317: {'lr': 0.0002589346544240461, 'samples': 14268864, 'steps': 74316, 'loss/train': 1.451177954673767} 11/07/2021 07:39:23 - INFO - __main__ - Step 74318: {'lr': 0.00025892935106778555, 'samples': 14269056, 'steps': 74317, 'loss/train': 1.8109080791473389} 11/07/2021 07:39:23 - INFO - __main__ - Step 74319: {'lr': 0.0002589240477075015, 'samples': 14269248, 'steps': 74318, 'loss/train': 0.6715771555900574} 11/07/2021 07:39:23 - INFO - __main__ - Step 74320: {'lr': 0.0002589187443431966, 'samples': 14269440, 'steps': 74319, 'loss/train': 1.6650458574295044} 11/07/2021 07:39:24 - INFO - __main__ - Step 74321: {'lr': 0.00025891344097487293, 'samples': 14269632, 'steps': 74320, 'loss/train': 1.4678661823272705} 11/07/2021 07:39:25 - INFO - __main__ - Step 74322: {'lr': 0.000258908137602533, 'samples': 14269824, 'steps': 74321, 'loss/train': 1.3598347902297974} 11/07/2021 07:39:25 - INFO - __main__ - Step 74323: {'lr': 0.0002589028342261793, 'samples': 14270016, 'steps': 74322, 'loss/train': 0.8696814179420471} 11/07/2021 07:39:25 - INFO - __main__ - Step 74324: {'lr': 0.000258897530845814, 'samples': 14270208, 'steps': 74323, 'loss/train': 1.5566436052322388} 11/07/2021 07:39:26 - INFO - __main__ - Step 74325: {'lr': 0.00025889222746143964, 'samples': 14270400, 'steps': 74324, 'loss/train': 1.4890215396881104} 11/07/2021 07:39:26 - INFO - __main__ - Step 74326: {'lr': 0.0002588869240730586, 'samples': 14270592, 'steps': 74325, 'loss/train': 1.3250806331634521} 11/07/2021 07:39:27 - INFO - __main__ - Step 74327: {'lr': 0.0002588816206806733, 'samples': 14270784, 'steps': 74326, 'loss/train': 1.4886579513549805} 11/07/2021 07:39:28 - INFO - __main__ - Step 74328: {'lr': 0.000258876317284286, 'samples': 14270976, 'steps': 74327, 'loss/train': 1.0517433881759644} 11/07/2021 07:39:28 - INFO - __main__ - Step 74329: {'lr': 0.00025887101388389917, 'samples': 14271168, 'steps': 74328, 'loss/train': 1.6481355428695679} 11/07/2021 07:39:28 - INFO - __main__ - Step 74330: {'lr': 0.00025886571047951517, 'samples': 14271360, 'steps': 74329, 'loss/train': 1.422075629234314} 11/07/2021 07:39:29 - INFO - __main__ - Step 74331: {'lr': 0.0002588604070711365, 'samples': 14271552, 'steps': 74330, 'loss/train': 0.9377438426017761} 11/07/2021 07:39:30 - INFO - __main__ - Step 74332: {'lr': 0.00025885510365876544, 'samples': 14271744, 'steps': 74331, 'loss/train': 1.117836356163025} 11/07/2021 07:39:30 - INFO - __main__ - Step 74333: {'lr': 0.0002588498002424044, 'samples': 14271936, 'steps': 74332, 'loss/train': 1.3403733968734741} 11/07/2021 07:39:30 - INFO - __main__ - Step 74334: {'lr': 0.0002588444968220558, 'samples': 14272128, 'steps': 74333, 'loss/train': 1.2824573516845703} 11/07/2021 07:39:31 - INFO - __main__ - Step 74335: {'lr': 0.00025883919339772196, 'samples': 14272320, 'steps': 74334, 'loss/train': 1.3094390630722046} 11/07/2021 07:39:31 - INFO - __main__ - Step 74336: {'lr': 0.00025883388996940533, 'samples': 14272512, 'steps': 74335, 'loss/train': 1.8424440622329712} 11/07/2021 07:39:31 - INFO - __main__ - Step 74337: {'lr': 0.0002588285865371083, 'samples': 14272704, 'steps': 74336, 'loss/train': 1.4216434955596924} 11/07/2021 07:39:32 - INFO - __main__ - Step 74338: {'lr': 0.00025882328310083323, 'samples': 14272896, 'steps': 74337, 'loss/train': 1.2544422149658203} 11/07/2021 07:39:33 - INFO - __main__ - Step 74339: {'lr': 0.0002588179796605826, 'samples': 14273088, 'steps': 74338, 'loss/train': 1.566950798034668} 11/07/2021 07:39:33 - INFO - __main__ - Step 74340: {'lr': 0.0002588126762163586, 'samples': 14273280, 'steps': 74339, 'loss/train': 2.169454574584961} 11/07/2021 07:39:33 - INFO - __main__ - Step 74341: {'lr': 0.0002588073727681638, 'samples': 14273472, 'steps': 74340, 'loss/train': 1.0948688983917236} 11/07/2021 07:39:34 - INFO - __main__ - Step 74342: {'lr': 0.0002588020693160005, 'samples': 14273664, 'steps': 74341, 'loss/train': 0.820020854473114} 11/07/2021 07:39:35 - INFO - __main__ - Step 74343: {'lr': 0.0002587967658598712, 'samples': 14273856, 'steps': 74342, 'loss/train': 1.024787425994873} 11/07/2021 07:39:35 - INFO - __main__ - Step 74344: {'lr': 0.0002587914623997782, 'samples': 14274048, 'steps': 74343, 'loss/train': 1.384168028831482} 11/07/2021 07:39:36 - INFO - __main__ - Step 74345: {'lr': 0.0002587861589357239, 'samples': 14274240, 'steps': 74344, 'loss/train': 1.9093822240829468} 11/07/2021 07:39:36 - INFO - __main__ - Step 74346: {'lr': 0.0002587808554677106, 'samples': 14274432, 'steps': 74345, 'loss/train': 0.9345235228538513} 11/07/2021 07:39:36 - INFO - __main__ - Step 74347: {'lr': 0.0002587755519957409, 'samples': 14274624, 'steps': 74346, 'loss/train': 1.564406394958496} 11/07/2021 07:39:37 - INFO - __main__ - Step 74348: {'lr': 0.00025877024851981694, 'samples': 14274816, 'steps': 74347, 'loss/train': 1.4301581382751465} 11/07/2021 07:39:38 - INFO - __main__ - Step 74349: {'lr': 0.00025876494503994135, 'samples': 14275008, 'steps': 74348, 'loss/train': 1.5004193782806396} 11/07/2021 07:39:38 - INFO - __main__ - Step 74350: {'lr': 0.00025875964155611634, 'samples': 14275200, 'steps': 74349, 'loss/train': 1.7115594148635864} 11/07/2021 07:39:38 - INFO - __main__ - Step 74351: {'lr': 0.00025875433806834446, 'samples': 14275392, 'steps': 74350, 'loss/train': 1.7385780811309814} 11/07/2021 07:39:39 - INFO - __main__ - Step 74352: {'lr': 0.00025874903457662803, 'samples': 14275584, 'steps': 74351, 'loss/train': 1.2499752044677734} 11/07/2021 07:39:40 - INFO - __main__ - Step 74353: {'lr': 0.00025874373108096934, 'samples': 14275776, 'steps': 74352, 'loss/train': 1.6803513765335083} 11/07/2021 07:39:40 - INFO - __main__ - Step 74354: {'lr': 0.00025873842758137087, 'samples': 14275968, 'steps': 74353, 'loss/train': 2.1240243911743164} 11/07/2021 07:39:40 - INFO - __main__ - Step 74355: {'lr': 0.00025873312407783495, 'samples': 14276160, 'steps': 74354, 'loss/train': 1.0731436014175415} 11/07/2021 07:39:41 - INFO - __main__ - Step 74356: {'lr': 0.0002587278205703641, 'samples': 14276352, 'steps': 74355, 'loss/train': 1.55015230178833} 11/07/2021 07:39:41 - INFO - __main__ - Step 74357: {'lr': 0.00025872251705896056, 'samples': 14276544, 'steps': 74356, 'loss/train': 1.1000670194625854} 11/07/2021 07:39:42 - INFO - __main__ - Step 74358: {'lr': 0.0002587172135436269, 'samples': 14276736, 'steps': 74357, 'loss/train': 1.4462724924087524} 11/07/2021 07:39:43 - INFO - __main__ - Step 74359: {'lr': 0.0002587119100243653, 'samples': 14276928, 'steps': 74358, 'loss/train': 1.1926589012145996} 11/07/2021 07:39:43 - INFO - __main__ - Step 74360: {'lr': 0.00025870660650117826, 'samples': 14277120, 'steps': 74359, 'loss/train': 1.4103094339370728} 11/07/2021 07:39:43 - INFO - __main__ - Step 74361: {'lr': 0.0002587013029740682, 'samples': 14277312, 'steps': 74360, 'loss/train': 1.2521964311599731} 11/07/2021 07:39:44 - INFO - __main__ - Step 74362: {'lr': 0.0002586959994430374, 'samples': 14277504, 'steps': 74361, 'loss/train': 1.2962908744812012} 11/07/2021 07:39:44 - INFO - __main__ - Step 74363: {'lr': 0.0002586906959080884, 'samples': 14277696, 'steps': 74362, 'loss/train': 1.4653390645980835} 11/07/2021 07:39:45 - INFO - __main__ - Step 74364: {'lr': 0.0002586853923692234, 'samples': 14277888, 'steps': 74363, 'loss/train': 1.5163931846618652} 11/07/2021 07:39:45 - INFO - __main__ - Step 74365: {'lr': 0.000258680088826445, 'samples': 14278080, 'steps': 74364, 'loss/train': 1.109358549118042} 11/07/2021 07:39:46 - INFO - __main__ - Step 74366: {'lr': 0.00025867478527975547, 'samples': 14278272, 'steps': 74365, 'loss/train': 1.7359944581985474} 11/07/2021 07:39:46 - INFO - __main__ - Step 74367: {'lr': 0.00025866948172915716, 'samples': 14278464, 'steps': 74366, 'loss/train': 1.5174002647399902} 11/07/2021 07:39:46 - INFO - __main__ - Step 74368: {'lr': 0.0002586641781746525, 'samples': 14278656, 'steps': 74367, 'loss/train': 1.9331029653549194} 11/07/2021 07:39:47 - INFO - __main__ - Step 74369: {'lr': 0.000258658874616244, 'samples': 14278848, 'steps': 74368, 'loss/train': 1.4188629388809204} 11/07/2021 07:39:48 - INFO - __main__ - Step 74370: {'lr': 0.0002586535710539338, 'samples': 14279040, 'steps': 74369, 'loss/train': 1.3646917343139648} 11/07/2021 07:39:48 - INFO - __main__ - Step 74371: {'lr': 0.0002586482674877246, 'samples': 14279232, 'steps': 74370, 'loss/train': 1.4447582960128784} 11/07/2021 07:39:49 - INFO - __main__ - Step 74372: {'lr': 0.00025864296391761853, 'samples': 14279424, 'steps': 74371, 'loss/train': 1.831477403640747} 11/07/2021 07:39:49 - INFO - __main__ - Step 74373: {'lr': 0.00025863766034361815, 'samples': 14279616, 'steps': 74372, 'loss/train': 1.3696845769882202} 11/07/2021 07:39:50 - INFO - __main__ - Step 74374: {'lr': 0.00025863235676572565, 'samples': 14279808, 'steps': 74373, 'loss/train': 1.3703426122665405} 11/07/2021 07:39:50 - INFO - __main__ - Step 74375: {'lr': 0.00025862705318394357, 'samples': 14280000, 'steps': 74374, 'loss/train': 1.104081392288208} 11/07/2021 07:39:51 - INFO - __main__ - Step 74376: {'lr': 0.00025862174959827435, 'samples': 14280192, 'steps': 74375, 'loss/train': 1.4245795011520386} 11/07/2021 07:39:51 - INFO - __main__ - Step 74377: {'lr': 0.0002586164460087203, 'samples': 14280384, 'steps': 74376, 'loss/train': 1.6824181079864502} 11/07/2021 07:39:51 - INFO - __main__ - Step 74378: {'lr': 0.0002586111424152838, 'samples': 14280576, 'steps': 74377, 'loss/train': 1.351428508758545} 11/07/2021 07:39:52 - INFO - __main__ - Step 74379: {'lr': 0.0002586058388179672, 'samples': 14280768, 'steps': 74378, 'loss/train': 0.864020824432373} 11/07/2021 07:39:53 - INFO - __main__ - Step 74380: {'lr': 0.00025860053521677297, 'samples': 14280960, 'steps': 74379, 'loss/train': 0.861754298210144} 11/07/2021 07:39:53 - INFO - __main__ - Step 74381: {'lr': 0.0002585952316117034, 'samples': 14281152, 'steps': 74380, 'loss/train': 1.3817157745361328} 11/07/2021 07:39:53 - INFO - __main__ - Step 74382: {'lr': 0.00025858992800276105, 'samples': 14281344, 'steps': 74381, 'loss/train': 3.349968671798706} 11/07/2021 07:39:54 - INFO - __main__ - Step 74383: {'lr': 0.0002585846243899482, 'samples': 14281536, 'steps': 74382, 'loss/train': 1.5711051225662231} 11/07/2021 07:39:55 - INFO - __main__ - Step 74384: {'lr': 0.00025857932077326715, 'samples': 14281728, 'steps': 74383, 'loss/train': 1.352344274520874} 11/07/2021 07:39:55 - INFO - __main__ - Step 74385: {'lr': 0.00025857401715272056, 'samples': 14281920, 'steps': 74384, 'loss/train': 0.9897006750106812} 11/07/2021 07:39:55 - INFO - __main__ - Step 74386: {'lr': 0.0002585687135283106, 'samples': 14282112, 'steps': 74385, 'loss/train': 1.5643647909164429} 11/07/2021 07:39:56 - INFO - __main__ - Step 74387: {'lr': 0.00025856340990003965, 'samples': 14282304, 'steps': 74386, 'loss/train': 1.8805630207061768} 11/07/2021 07:39:56 - INFO - __main__ - Step 74388: {'lr': 0.00025855810626791015, 'samples': 14282496, 'steps': 74387, 'loss/train': 1.015275239944458} 11/07/2021 07:39:57 - INFO - __main__ - Step 74389: {'lr': 0.00025855280263192447, 'samples': 14282688, 'steps': 74388, 'loss/train': 1.3336232900619507} 11/07/2021 07:39:58 - INFO - __main__ - Step 74390: {'lr': 0.00025854749899208515, 'samples': 14282880, 'steps': 74389, 'loss/train': 1.2031331062316895} 11/07/2021 07:39:58 - INFO - __main__ - Step 74391: {'lr': 0.0002585421953483944, 'samples': 14283072, 'steps': 74390, 'loss/train': 1.0750553607940674} 11/07/2021 07:39:58 - INFO - __main__ - Step 74392: {'lr': 0.00025853689170085467, 'samples': 14283264, 'steps': 74391, 'loss/train': 0.6488780379295349} 11/07/2021 07:39:59 - INFO - __main__ - Step 74393: {'lr': 0.0002585315880494684, 'samples': 14283456, 'steps': 74392, 'loss/train': 1.311055302619934} 11/07/2021 07:39:59 - INFO - __main__ - Step 74394: {'lr': 0.0002585262843942378, 'samples': 14283648, 'steps': 74393, 'loss/train': 1.0009983777999878} 11/07/2021 07:40:00 - INFO - __main__ - Step 74395: {'lr': 0.0002585209807351654, 'samples': 14283840, 'steps': 74394, 'loss/train': 1.9943547248840332} 11/07/2021 07:40:00 - INFO - __main__ - Step 74396: {'lr': 0.0002585156770722537, 'samples': 14284032, 'steps': 74395, 'loss/train': 1.8762719631195068} 11/07/2021 07:40:01 - INFO - __main__ - Step 74397: {'lr': 0.00025851037340550486, 'samples': 14284224, 'steps': 74396, 'loss/train': 1.1523233652114868} 11/07/2021 07:40:01 - INFO - __main__ - Step 74398: {'lr': 0.00025850506973492147, 'samples': 14284416, 'steps': 74397, 'loss/train': 1.3325246572494507} 11/07/2021 07:40:01 - INFO - __main__ - Step 74399: {'lr': 0.0002584997660605058, 'samples': 14284608, 'steps': 74398, 'loss/train': 1.175321340560913} 11/07/2021 07:40:02 - INFO - __main__ - Step 74400: {'lr': 0.00025849446238226026, 'samples': 14284800, 'steps': 74399, 'loss/train': 1.9408038854599} 11/07/2021 07:40:03 - INFO - __main__ - Step 74401: {'lr': 0.0002584891587001872, 'samples': 14284992, 'steps': 74400, 'loss/train': 1.521244764328003} 11/07/2021 07:40:03 - INFO - __main__ - Step 74402: {'lr': 0.00025848385501428913, 'samples': 14285184, 'steps': 74401, 'loss/train': 1.4356884956359863} 11/07/2021 07:40:04 - INFO - __main__ - Step 74403: {'lr': 0.0002584785513245683, 'samples': 14285376, 'steps': 74402, 'loss/train': 1.2553315162658691} 11/07/2021 07:40:04 - INFO - __main__ - Step 74404: {'lr': 0.0002584732476310271, 'samples': 14285568, 'steps': 74403, 'loss/train': 1.112333059310913} 11/07/2021 07:40:05 - INFO - __main__ - Step 74405: {'lr': 0.00025846794393366817, 'samples': 14285760, 'steps': 74404, 'loss/train': 1.374592661857605} 11/07/2021 07:40:05 - INFO - __main__ - Step 74406: {'lr': 0.0002584626402324936, 'samples': 14285952, 'steps': 74405, 'loss/train': 1.4813872575759888} 11/07/2021 07:40:06 - INFO - __main__ - Step 74407: {'lr': 0.0002584573365275059, 'samples': 14286144, 'steps': 74406, 'loss/train': 1.0959309339523315} 11/07/2021 07:40:06 - INFO - __main__ - Step 74408: {'lr': 0.0002584520328187075, 'samples': 14286336, 'steps': 74407, 'loss/train': 1.0456645488739014} 11/07/2021 07:40:06 - INFO - __main__ - Step 74409: {'lr': 0.00025844672910610076, 'samples': 14286528, 'steps': 74408, 'loss/train': 0.1745949685573578} 11/07/2021 07:40:08 - INFO - __main__ - Step 74410: {'lr': 0.000258441425389688, 'samples': 14286720, 'steps': 74409, 'loss/train': 1.6146148443222046} 11/07/2021 07:40:08 - INFO - __main__ - Step 74411: {'lr': 0.0002584361216694716, 'samples': 14286912, 'steps': 74410, 'loss/train': 0.7121769785881042} 11/07/2021 07:40:08 - INFO - __main__ - Step 74412: {'lr': 0.00025843081794545413, 'samples': 14287104, 'steps': 74411, 'loss/train': 1.6247999668121338} 11/07/2021 07:40:09 - INFO - __main__ - Step 74413: {'lr': 0.0002584255142176378, 'samples': 14287296, 'steps': 74412, 'loss/train': 1.5549695491790771} 11/07/2021 07:40:09 - INFO - __main__ - Step 74414: {'lr': 0.0002584202104860251, 'samples': 14287488, 'steps': 74413, 'loss/train': 1.7944563627243042} 11/07/2021 07:40:10 - INFO - __main__ - Step 74415: {'lr': 0.0002584149067506183, 'samples': 14287680, 'steps': 74414, 'loss/train': 2.0079474449157715} 11/07/2021 07:40:11 - INFO - __main__ - Step 74416: {'lr': 0.00025840960301142, 'samples': 14287872, 'steps': 74415, 'loss/train': 1.2334401607513428} 11/07/2021 07:40:11 - INFO - __main__ - Step 74417: {'lr': 0.0002584042992684324, 'samples': 14288064, 'steps': 74416, 'loss/train': 1.4273124933242798} 11/07/2021 07:40:11 - INFO - __main__ - Step 74418: {'lr': 0.000258398995521658, 'samples': 14288256, 'steps': 74417, 'loss/train': 0.6553047299385071} 11/07/2021 07:40:12 - INFO - __main__ - Step 74419: {'lr': 0.00025839369177109905, 'samples': 14288448, 'steps': 74418, 'loss/train': 1.270369052886963} 11/07/2021 07:40:13 - INFO - __main__ - Step 74420: {'lr': 0.0002583883880167581, 'samples': 14288640, 'steps': 74419, 'loss/train': 1.3347746133804321} 11/07/2021 07:40:13 - INFO - __main__ - Step 74421: {'lr': 0.00025838308425863744, 'samples': 14288832, 'steps': 74420, 'loss/train': 1.4499366283416748} 11/07/2021 07:40:13 - INFO - __main__ - Step 74422: {'lr': 0.0002583777804967395, 'samples': 14289024, 'steps': 74421, 'loss/train': 1.368275761604309} 11/07/2021 07:40:14 - INFO - __main__ - Step 74423: {'lr': 0.00025837247673106666, 'samples': 14289216, 'steps': 74422, 'loss/train': 1.797688603401184} 11/07/2021 07:40:14 - INFO - __main__ - Step 74424: {'lr': 0.00025836717296162133, 'samples': 14289408, 'steps': 74423, 'loss/train': 1.3924919366836548} 11/07/2021 07:40:14 - INFO - __main__ - Step 74425: {'lr': 0.00025836186918840585, 'samples': 14289600, 'steps': 74424, 'loss/train': 1.373907446861267} 11/07/2021 07:40:15 - INFO - __main__ - Step 74426: {'lr': 0.0002583565654114227, 'samples': 14289792, 'steps': 74425, 'loss/train': 1.3250741958618164} 11/07/2021 07:40:16 - INFO - __main__ - Step 74427: {'lr': 0.00025835126163067414, 'samples': 14289984, 'steps': 74426, 'loss/train': 1.4711334705352783} 11/07/2021 07:40:16 - INFO - __main__ - Step 74428: {'lr': 0.0002583459578461627, 'samples': 14290176, 'steps': 74427, 'loss/train': 1.19041907787323} 11/07/2021 07:40:16 - INFO - __main__ - Step 74429: {'lr': 0.0002583406540578906, 'samples': 14290368, 'steps': 74428, 'loss/train': 1.5522937774658203} 11/07/2021 07:40:17 - INFO - __main__ - Step 74430: {'lr': 0.0002583353502658604, 'samples': 14290560, 'steps': 74429, 'loss/train': 1.4862467050552368} 11/07/2021 07:40:18 - INFO - __main__ - Step 74431: {'lr': 0.0002583300464700744, 'samples': 14290752, 'steps': 74430, 'loss/train': 0.90705806016922} 11/07/2021 07:40:18 - INFO - __main__ - Step 74432: {'lr': 0.0002583247426705351, 'samples': 14290944, 'steps': 74431, 'loss/train': 1.7431365251541138} 11/07/2021 07:40:18 - INFO - __main__ - Step 74433: {'lr': 0.0002583194388672447, 'samples': 14291136, 'steps': 74432, 'loss/train': 1.7873979806900024} 11/07/2021 07:40:19 - INFO - __main__ - Step 74434: {'lr': 0.0002583141350602057, 'samples': 14291328, 'steps': 74433, 'loss/train': 1.6059752702713013} 11/07/2021 07:40:19 - INFO - __main__ - Step 74435: {'lr': 0.00025830883124942043, 'samples': 14291520, 'steps': 74434, 'loss/train': 1.617964267730713} 11/07/2021 07:40:21 - INFO - __main__ - Step 74436: {'lr': 0.00025830352743489137, 'samples': 14291712, 'steps': 74435, 'loss/train': 1.0209699869155884} 11/07/2021 07:40:21 - INFO - __main__ - Step 74437: {'lr': 0.0002582982236166209, 'samples': 14291904, 'steps': 74436, 'loss/train': 0.9868758916854858} 11/07/2021 07:40:21 - INFO - __main__ - Step 74438: {'lr': 0.0002582929197946113, 'samples': 14292096, 'steps': 74437, 'loss/train': 1.7808517217636108} 11/07/2021 07:40:22 - INFO - __main__ - Step 74439: {'lr': 0.00025828761596886516, 'samples': 14292288, 'steps': 74438, 'loss/train': 1.167428731918335} 11/07/2021 07:40:22 - INFO - __main__ - Step 74440: {'lr': 0.0002582823121393847, 'samples': 14292480, 'steps': 74439, 'loss/train': 0.11723041534423828} 11/07/2021 07:40:23 - INFO - __main__ - Step 74441: {'lr': 0.0002582770083061723, 'samples': 14292672, 'steps': 74440, 'loss/train': 1.1018959283828735} 11/07/2021 07:40:23 - INFO - __main__ - Step 74442: {'lr': 0.0002582717044692305, 'samples': 14292864, 'steps': 74441, 'loss/train': 1.1797122955322266} 11/07/2021 07:40:24 - INFO - __main__ - Step 74443: {'lr': 0.00025826640062856157, 'samples': 14293056, 'steps': 74442, 'loss/train': 1.5163490772247314} 11/07/2021 07:40:24 - INFO - __main__ - Step 74444: {'lr': 0.0002582610967841679, 'samples': 14293248, 'steps': 74443, 'loss/train': 1.3154608011245728} 11/07/2021 07:40:24 - INFO - __main__ - Step 74445: {'lr': 0.00025825579293605193, 'samples': 14293440, 'steps': 74444, 'loss/train': 1.3594944477081299} 11/07/2021 07:40:25 - INFO - __main__ - Step 74446: {'lr': 0.000258250489084216, 'samples': 14293632, 'steps': 74445, 'loss/train': 1.3515316247940063} 11/07/2021 07:40:26 - INFO - __main__ - Step 74447: {'lr': 0.00025824518522866253, 'samples': 14293824, 'steps': 74446, 'loss/train': 1.6126699447631836} 11/07/2021 07:40:26 - INFO - __main__ - Step 74448: {'lr': 0.0002582398813693939, 'samples': 14294016, 'steps': 74447, 'loss/train': 1.6008574962615967} 11/07/2021 07:40:26 - INFO - __main__ - Step 74449: {'lr': 0.00025823457750641257, 'samples': 14294208, 'steps': 74448, 'loss/train': 1.7819583415985107} 11/07/2021 07:40:27 - INFO - __main__ - Step 74450: {'lr': 0.00025822927363972076, 'samples': 14294400, 'steps': 74449, 'loss/train': 1.6836923360824585} 11/07/2021 07:40:28 - INFO - __main__ - Step 74451: {'lr': 0.00025822396976932113, 'samples': 14294592, 'steps': 74450, 'loss/train': 1.4867634773254395} 11/07/2021 07:40:28 - INFO - __main__ - Step 74452: {'lr': 0.00025821866589521576, 'samples': 14294784, 'steps': 74451, 'loss/train': 1.5400553941726685} 11/07/2021 07:40:28 - INFO - __main__ - Step 74453: {'lr': 0.0002582133620174072, 'samples': 14294976, 'steps': 74452, 'loss/train': 1.1299397945404053} 11/07/2021 07:40:29 - INFO - __main__ - Step 74454: {'lr': 0.00025820805813589785, 'samples': 14295168, 'steps': 74453, 'loss/train': 0.14320337772369385} 11/07/2021 07:40:29 - INFO - __main__ - Step 74455: {'lr': 0.0002582027542506901, 'samples': 14295360, 'steps': 74454, 'loss/train': 1.5587882995605469} 11/07/2021 07:40:30 - INFO - __main__ - Step 74456: {'lr': 0.0002581974503617863, 'samples': 14295552, 'steps': 74455, 'loss/train': 1.4819003343582153} 11/07/2021 07:40:31 - INFO - __main__ - Step 74457: {'lr': 0.00025819214646918885, 'samples': 14295744, 'steps': 74456, 'loss/train': 1.1479140520095825} 11/07/2021 07:40:31 - INFO - __main__ - Step 74458: {'lr': 0.00025818684257290016, 'samples': 14295936, 'steps': 74457, 'loss/train': 1.4143937826156616} 11/07/2021 07:40:31 - INFO - __main__ - Step 74459: {'lr': 0.0002581815386729226, 'samples': 14296128, 'steps': 74458, 'loss/train': 1.0749080181121826} 11/07/2021 07:40:32 - INFO - __main__ - Step 74460: {'lr': 0.0002581762347692585, 'samples': 14296320, 'steps': 74459, 'loss/train': 1.3188859224319458} 11/07/2021 07:40:32 - INFO - __main__ - Step 74461: {'lr': 0.0002581709308619104, 'samples': 14296512, 'steps': 74460, 'loss/train': 1.2853953838348389} 11/07/2021 07:40:33 - INFO - __main__ - Step 74462: {'lr': 0.00025816562695088057, 'samples': 14296704, 'steps': 74461, 'loss/train': 1.917846918106079} 11/07/2021 07:40:33 - INFO - __main__ - Step 74463: {'lr': 0.0002581603230361715, 'samples': 14296896, 'steps': 74462, 'loss/train': 1.319745659828186} 11/07/2021 07:40:34 - INFO - __main__ - Step 74464: {'lr': 0.00025815501911778546, 'samples': 14297088, 'steps': 74463, 'loss/train': 1.0347234010696411} 11/07/2021 07:40:34 - INFO - __main__ - Step 74465: {'lr': 0.00025814971519572485, 'samples': 14297280, 'steps': 74464, 'loss/train': 1.2105443477630615} 11/07/2021 07:40:34 - INFO - __main__ - Step 74466: {'lr': 0.0002581444112699921, 'samples': 14297472, 'steps': 74465, 'loss/train': 1.0544394254684448} 11/07/2021 07:40:35 - INFO - __main__ - Step 74467: {'lr': 0.0002581391073405897, 'samples': 14297664, 'steps': 74466, 'loss/train': 1.1519378423690796} 11/07/2021 07:40:36 - INFO - __main__ - Step 74468: {'lr': 0.0002581338034075199, 'samples': 14297856, 'steps': 74467, 'loss/train': 1.3207062482833862} 11/07/2021 07:40:36 - INFO - __main__ - Step 74469: {'lr': 0.0002581284994707851, 'samples': 14298048, 'steps': 74468, 'loss/train': 1.2933112382888794} 11/07/2021 07:40:36 - INFO - __main__ - Step 74470: {'lr': 0.00025812319553038775, 'samples': 14298240, 'steps': 74469, 'loss/train': 1.3749457597732544} 11/07/2021 07:40:37 - INFO - __main__ - Step 74471: {'lr': 0.0002581178915863302, 'samples': 14298432, 'steps': 74470, 'loss/train': 1.4375814199447632} 11/07/2021 07:40:38 - INFO - __main__ - Step 74472: {'lr': 0.00025811258763861486, 'samples': 14298624, 'steps': 74471, 'loss/train': 1.451672911643982} 11/07/2021 07:40:38 - INFO - __main__ - Step 74473: {'lr': 0.0002581072836872442, 'samples': 14298816, 'steps': 74472, 'loss/train': 0.9874915480613708} 11/07/2021 07:40:38 - INFO - __main__ - Step 74474: {'lr': 0.0002581019797322204, 'samples': 14299008, 'steps': 74473, 'loss/train': 0.9300159811973572} 11/07/2021 07:40:39 - INFO - __main__ - Step 74475: {'lr': 0.000258096675773546, 'samples': 14299200, 'steps': 74474, 'loss/train': 1.7522085905075073} 11/07/2021 07:40:39 - INFO - __main__ - Step 74476: {'lr': 0.00025809137181122336, 'samples': 14299392, 'steps': 74475, 'loss/train': 1.58255934715271} 11/07/2021 07:40:40 - INFO - __main__ - Step 74477: {'lr': 0.0002580860678452549, 'samples': 14299584, 'steps': 74476, 'loss/train': 1.290584683418274} 11/07/2021 07:40:41 - INFO - __main__ - Step 74478: {'lr': 0.00025808076387564297, 'samples': 14299776, 'steps': 74477, 'loss/train': 1.248029351234436} 11/07/2021 07:40:41 - INFO - __main__ - Step 74479: {'lr': 0.00025807545990239, 'samples': 14299968, 'steps': 74478, 'loss/train': 1.0863878726959229} 11/07/2021 07:40:41 - INFO - __main__ - Step 74480: {'lr': 0.0002580701559254983, 'samples': 14300160, 'steps': 74479, 'loss/train': 1.393072247505188} 11/07/2021 07:40:42 - INFO - __main__ - Step 74481: {'lr': 0.00025806485194497037, 'samples': 14300352, 'steps': 74480, 'loss/train': 1.2893452644348145} 11/07/2021 07:40:43 - INFO - __main__ - Step 74482: {'lr': 0.0002580595479608085, 'samples': 14300544, 'steps': 74481, 'loss/train': 1.423256754875183} 11/07/2021 07:40:43 - INFO - __main__ - Step 74483: {'lr': 0.00025805424397301515, 'samples': 14300736, 'steps': 74482, 'loss/train': 1.638431191444397} 11/07/2021 07:40:43 - INFO - __main__ - Step 74484: {'lr': 0.0002580489399815926, 'samples': 14300928, 'steps': 74483, 'loss/train': 1.3561584949493408} 11/07/2021 07:40:44 - INFO - __main__ - Step 74485: {'lr': 0.0002580436359865434, 'samples': 14301120, 'steps': 74484, 'loss/train': 1.1851006746292114} 11/07/2021 07:40:44 - INFO - __main__ - Step 74486: {'lr': 0.0002580383319878699, 'samples': 14301312, 'steps': 74485, 'loss/train': 1.1207520961761475} 11/07/2021 07:40:45 - INFO - __main__ - Step 74487: {'lr': 0.0002580330279855744, 'samples': 14301504, 'steps': 74486, 'loss/train': 1.2436383962631226} 11/07/2021 07:40:45 - INFO - __main__ - Step 74488: {'lr': 0.0002580277239796593, 'samples': 14301696, 'steps': 74487, 'loss/train': 1.4501054286956787} 11/07/2021 07:40:46 - INFO - __main__ - Step 74489: {'lr': 0.0002580224199701271, 'samples': 14301888, 'steps': 74488, 'loss/train': 1.3420891761779785} 11/07/2021 07:40:46 - INFO - __main__ - Step 74490: {'lr': 0.00025801711595698005, 'samples': 14302080, 'steps': 74489, 'loss/train': 1.11204993724823} 11/07/2021 07:40:46 - INFO - __main__ - Step 74491: {'lr': 0.00025801181194022067, 'samples': 14302272, 'steps': 74490, 'loss/train': 1.4074246883392334} 11/07/2021 07:40:47 - INFO - __main__ - Step 74492: {'lr': 0.00025800650791985133, 'samples': 14302464, 'steps': 74491, 'loss/train': 1.2577608823776245} 11/07/2021 07:40:48 - INFO - __main__ - Step 74493: {'lr': 0.0002580012038958743, 'samples': 14302656, 'steps': 74492, 'loss/train': 1.7827091217041016} 11/07/2021 07:40:48 - INFO - __main__ - Step 74494: {'lr': 0.0002579958998682921, 'samples': 14302848, 'steps': 74493, 'loss/train': 0.9008256196975708} 11/07/2021 07:40:49 - INFO - __main__ - Step 74495: {'lr': 0.000257990595837107, 'samples': 14303040, 'steps': 74494, 'loss/train': 1.4150004386901855} 11/07/2021 07:40:49 - INFO - __main__ - Step 74496: {'lr': 0.0002579852918023215, 'samples': 14303232, 'steps': 74495, 'loss/train': 1.6195881366729736} 11/07/2021 07:40:49 - INFO - __main__ - Step 74497: {'lr': 0.000257979987763938, 'samples': 14303424, 'steps': 74496, 'loss/train': 1.5579848289489746} 11/07/2021 07:40:50 - INFO - __main__ - Step 74498: {'lr': 0.0002579746837219588, 'samples': 14303616, 'steps': 74497, 'loss/train': 1.4286020994186401} 11/07/2021 07:40:51 - INFO - __main__ - Step 74499: {'lr': 0.00025796937967638634, 'samples': 14303808, 'steps': 74498, 'loss/train': 1.2357239723205566} 11/07/2021 07:40:51 - INFO - __main__ - Step 74500: {'lr': 0.00025796407562722303, 'samples': 14304000, 'steps': 74499, 'loss/train': 1.2148611545562744} 11/07/2021 07:40:51 - INFO - __main__ - Step 74501: {'lr': 0.00025795877157447117, 'samples': 14304192, 'steps': 74500, 'loss/train': 1.229591965675354} 11/07/2021 07:40:52 - INFO - __main__ - Step 74502: {'lr': 0.0002579534675181332, 'samples': 14304384, 'steps': 74501, 'loss/train': 1.7464946508407593} 11/07/2021 07:40:53 - INFO - __main__ - Step 74503: {'lr': 0.0002579481634582115, 'samples': 14304576, 'steps': 74502, 'loss/train': 1.649757981300354} 11/07/2021 07:40:53 - INFO - __main__ - Step 74504: {'lr': 0.0002579428593947086, 'samples': 14304768, 'steps': 74503, 'loss/train': 1.6499931812286377} 11/07/2021 07:40:53 - INFO - __main__ - Step 74505: {'lr': 0.0002579375553276267, 'samples': 14304960, 'steps': 74504, 'loss/train': 0.9287551045417786} 11/07/2021 07:40:54 - INFO - __main__ - Step 74506: {'lr': 0.0002579322512569683, 'samples': 14305152, 'steps': 74505, 'loss/train': 1.422741174697876} 11/07/2021 07:40:54 - INFO - __main__ - Step 74507: {'lr': 0.0002579269471827357, 'samples': 14305344, 'steps': 74506, 'loss/train': 1.2555986642837524} 11/07/2021 07:40:55 - INFO - __main__ - Step 74508: {'lr': 0.00025792164310493133, 'samples': 14305536, 'steps': 74507, 'loss/train': 1.6701951026916504} 11/07/2021 07:40:55 - INFO - __main__ - Step 74509: {'lr': 0.0002579163390235576, 'samples': 14305728, 'steps': 74508, 'loss/train': 1.23698890209198} 11/07/2021 07:40:56 - INFO - __main__ - Step 74510: {'lr': 0.0002579110349386169, 'samples': 14305920, 'steps': 74509, 'loss/train': 1.3083527088165283} 11/07/2021 07:40:56 - INFO - __main__ - Step 74511: {'lr': 0.0002579057308501116, 'samples': 14306112, 'steps': 74510, 'loss/train': 0.8638182878494263} 11/07/2021 07:40:56 - INFO - __main__ - Step 74512: {'lr': 0.00025790042675804414, 'samples': 14306304, 'steps': 74511, 'loss/train': 1.671295404434204} 11/07/2021 07:40:57 - INFO - __main__ - Step 74513: {'lr': 0.00025789512266241685, 'samples': 14306496, 'steps': 74512, 'loss/train': 1.6006171703338623} 11/07/2021 07:40:58 - INFO - __main__ - Step 74514: {'lr': 0.00025788981856323214, 'samples': 14306688, 'steps': 74513, 'loss/train': 1.3834500312805176} 11/07/2021 07:40:58 - INFO - __main__ - Step 74515: {'lr': 0.0002578845144604924, 'samples': 14306880, 'steps': 74514, 'loss/train': 1.4497520923614502} 11/07/2021 07:40:58 - INFO - __main__ - Step 74516: {'lr': 0.00025787921035419996, 'samples': 14307072, 'steps': 74515, 'loss/train': 1.9283190965652466} 11/07/2021 07:40:59 - INFO - __main__ - Step 74517: {'lr': 0.0002578739062443574, 'samples': 14307264, 'steps': 74516, 'loss/train': 1.168873906135559} 11/07/2021 07:41:00 - INFO - __main__ - Step 74518: {'lr': 0.00025786860213096685, 'samples': 14307456, 'steps': 74517, 'loss/train': 1.1773877143859863} 11/07/2021 07:41:00 - INFO - __main__ - Step 74519: {'lr': 0.00025786329801403093, 'samples': 14307648, 'steps': 74518, 'loss/train': 1.6256303787231445} 11/07/2021 07:41:01 - INFO - __main__ - Step 74520: {'lr': 0.00025785799389355183, 'samples': 14307840, 'steps': 74519, 'loss/train': 1.1663607358932495} 11/07/2021 07:41:01 - INFO - __main__ - Step 74521: {'lr': 0.00025785268976953206, 'samples': 14308032, 'steps': 74520, 'loss/train': 1.1596561670303345} 11/07/2021 07:41:01 - INFO - __main__ - Step 74522: {'lr': 0.0002578473856419741, 'samples': 14308224, 'steps': 74521, 'loss/train': 1.1591070890426636} 11/07/2021 07:41:02 - INFO - __main__ - Step 74523: {'lr': 0.00025784208151088007, 'samples': 14308416, 'steps': 74522, 'loss/train': 1.3617095947265625} 11/07/2021 07:41:03 - INFO - __main__ - Step 74524: {'lr': 0.0002578367773762526, 'samples': 14308608, 'steps': 74523, 'loss/train': 1.4291620254516602} 11/07/2021 07:41:03 - INFO - __main__ - Step 74525: {'lr': 0.000257831473238094, 'samples': 14308800, 'steps': 74524, 'loss/train': 1.8131674528121948} 11/07/2021 07:41:03 - INFO - __main__ - Step 74526: {'lr': 0.0002578261690964067, 'samples': 14308992, 'steps': 74525, 'loss/train': 1.2596547603607178} 11/07/2021 07:41:04 - INFO - __main__ - Step 74527: {'lr': 0.000257820864951193, 'samples': 14309184, 'steps': 74526, 'loss/train': 1.0988143682479858} 11/07/2021 07:41:04 - INFO - __main__ - Step 74528: {'lr': 0.0002578155608024553, 'samples': 14309376, 'steps': 74527, 'loss/train': 0.9382103681564331} 11/07/2021 07:41:05 - INFO - __main__ - Step 74529: {'lr': 0.0002578102566501961, 'samples': 14309568, 'steps': 74528, 'loss/train': 1.350445032119751} 11/07/2021 07:41:05 - INFO - __main__ - Step 74530: {'lr': 0.00025780495249441764, 'samples': 14309760, 'steps': 74529, 'loss/train': 1.4763590097427368} 11/07/2021 07:41:06 - INFO - __main__ - Step 74531: {'lr': 0.0002577996483351225, 'samples': 14309952, 'steps': 74530, 'loss/train': 1.4295326471328735} 11/07/2021 07:41:06 - INFO - __main__ - Step 74532: {'lr': 0.0002577943441723128, 'samples': 14310144, 'steps': 74531, 'loss/train': 1.2088348865509033} 11/07/2021 07:41:07 - INFO - __main__ - Step 74533: {'lr': 0.00025778904000599127, 'samples': 14310336, 'steps': 74532, 'loss/train': 1.569271445274353} 11/07/2021 07:41:07 - INFO - __main__ - Step 74534: {'lr': 0.00025778373583616005, 'samples': 14310528, 'steps': 74533, 'loss/train': 1.2892299890518188} 11/07/2021 07:41:08 - INFO - __main__ - Step 74535: {'lr': 0.00025777843166282155, 'samples': 14310720, 'steps': 74534, 'loss/train': 1.3397316932678223} 11/07/2021 07:41:08 - INFO - __main__ - Step 74536: {'lr': 0.00025777312748597825, 'samples': 14310912, 'steps': 74535, 'loss/train': 1.4855566024780273} 11/07/2021 07:41:09 - INFO - __main__ - Step 74537: {'lr': 0.0002577678233056325, 'samples': 14311104, 'steps': 74536, 'loss/train': 1.7218000888824463} 11/07/2021 07:41:09 - INFO - __main__ - Step 74538: {'lr': 0.00025776251912178666, 'samples': 14311296, 'steps': 74537, 'loss/train': 1.0902422666549683} 11/07/2021 07:41:10 - INFO - __main__ - Step 74539: {'lr': 0.0002577572149344432, 'samples': 14311488, 'steps': 74538, 'loss/train': 0.9529958963394165} 11/07/2021 07:41:10 - INFO - __main__ - Step 74540: {'lr': 0.0002577519107436044, 'samples': 14311680, 'steps': 74539, 'loss/train': 1.3749957084655762} 11/07/2021 07:41:11 - INFO - __main__ - Step 74541: {'lr': 0.0002577466065492727, 'samples': 14311872, 'steps': 74540, 'loss/train': 0.7172769904136658} 11/07/2021 07:41:11 - INFO - __main__ - Step 74542: {'lr': 0.00025774130235145054, 'samples': 14312064, 'steps': 74541, 'loss/train': 1.873766303062439} 11/07/2021 07:41:11 - INFO - __main__ - Step 74543: {'lr': 0.00025773599815014027, 'samples': 14312256, 'steps': 74542, 'loss/train': 1.6075291633605957} 11/07/2021 07:41:12 - INFO - __main__ - Step 74544: {'lr': 0.0002577306939453443, 'samples': 14312448, 'steps': 74543, 'loss/train': 1.4677939414978027} 11/07/2021 07:41:13 - INFO - __main__ - Step 74545: {'lr': 0.00025772538973706493, 'samples': 14312640, 'steps': 74544, 'loss/train': 1.0708194971084595} 11/07/2021 07:41:13 - INFO - __main__ - Step 74546: {'lr': 0.00025772008552530474, 'samples': 14312832, 'steps': 74545, 'loss/train': 0.46044886112213135} 11/07/2021 07:41:13 - INFO - __main__ - Step 74547: {'lr': 0.0002577147813100659, 'samples': 14313024, 'steps': 74546, 'loss/train': 1.5973005294799805} 11/07/2021 07:41:14 - INFO - __main__ - Step 74548: {'lr': 0.0002577094770913509, 'samples': 14313216, 'steps': 74547, 'loss/train': 1.1134880781173706} 11/07/2021 07:41:15 - INFO - __main__ - Step 74549: {'lr': 0.00025770417286916217, 'samples': 14313408, 'steps': 74548, 'loss/train': 1.5212047100067139} 11/07/2021 07:41:15 - INFO - __main__ - Step 74550: {'lr': 0.000257698868643502, 'samples': 14313600, 'steps': 74549, 'loss/train': 1.2074543237686157} 11/07/2021 07:41:16 - INFO - __main__ - Step 74551: {'lr': 0.00025769356441437285, 'samples': 14313792, 'steps': 74550, 'loss/train': 1.3471423387527466} 11/07/2021 07:41:16 - INFO - __main__ - Step 74552: {'lr': 0.0002576882601817771, 'samples': 14313984, 'steps': 74551, 'loss/train': 1.9963257312774658} 11/07/2021 07:41:16 - INFO - __main__ - Step 74553: {'lr': 0.00025768295594571724, 'samples': 14314176, 'steps': 74552, 'loss/train': 1.49478018283844} 11/07/2021 07:41:17 - INFO - __main__ - Step 74554: {'lr': 0.00025767765170619546, 'samples': 14314368, 'steps': 74553, 'loss/train': 0.7739112973213196} 11/07/2021 07:41:18 - INFO - __main__ - Step 74555: {'lr': 0.0002576723474632142, 'samples': 14314560, 'steps': 74554, 'loss/train': 1.3744009733200073} 11/07/2021 07:41:18 - INFO - __main__ - Step 74556: {'lr': 0.00025766704321677597, 'samples': 14314752, 'steps': 74555, 'loss/train': 1.370945930480957} 11/07/2021 07:41:18 - INFO - __main__ - Step 74557: {'lr': 0.0002576617389668831, 'samples': 14314944, 'steps': 74556, 'loss/train': 1.6127040386199951} 11/07/2021 07:41:19 - INFO - __main__ - Step 74558: {'lr': 0.00025765643471353794, 'samples': 14315136, 'steps': 74557, 'loss/train': 1.6008549928665161} 11/07/2021 07:41:19 - INFO - __main__ - Step 74559: {'lr': 0.0002576511304567429, 'samples': 14315328, 'steps': 74558, 'loss/train': 1.0307472944259644} 11/07/2021 07:41:20 - INFO - __main__ - Step 74560: {'lr': 0.00025764582619650046, 'samples': 14315520, 'steps': 74559, 'loss/train': 0.8030219674110413} 11/07/2021 07:41:20 - INFO - __main__ - Step 74561: {'lr': 0.00025764052193281284, 'samples': 14315712, 'steps': 74560, 'loss/train': 1.2325117588043213} 11/07/2021 07:41:21 - INFO - __main__ - Step 74562: {'lr': 0.00025763521766568255, 'samples': 14315904, 'steps': 74561, 'loss/train': 1.7354263067245483} 11/07/2021 07:41:21 - INFO - __main__ - Step 74563: {'lr': 0.00025762991339511193, 'samples': 14316096, 'steps': 74562, 'loss/train': 1.13173246383667} 11/07/2021 07:41:21 - INFO - __main__ - Step 74564: {'lr': 0.0002576246091211034, 'samples': 14316288, 'steps': 74563, 'loss/train': 1.64971125125885} 11/07/2021 07:41:22 - INFO - __main__ - Step 74565: {'lr': 0.0002576193048436594, 'samples': 14316480, 'steps': 74564, 'loss/train': 1.0235857963562012} 11/07/2021 07:41:23 - INFO - __main__ - Step 74566: {'lr': 0.00025761400056278217, 'samples': 14316672, 'steps': 74565, 'loss/train': 1.1955231428146362} 11/07/2021 07:41:23 - INFO - __main__ - Step 74567: {'lr': 0.0002576086962784742, 'samples': 14316864, 'steps': 74566, 'loss/train': 1.7397836446762085} 11/07/2021 07:41:23 - INFO - __main__ - Step 74568: {'lr': 0.0002576033919907379, 'samples': 14317056, 'steps': 74567, 'loss/train': 1.6541827917099} 11/07/2021 07:41:24 - INFO - __main__ - Step 74569: {'lr': 0.0002575980876995756, 'samples': 14317248, 'steps': 74568, 'loss/train': 0.7649467587471008} 11/07/2021 07:41:25 - INFO - __main__ - Step 74570: {'lr': 0.00025759278340498976, 'samples': 14317440, 'steps': 74569, 'loss/train': 1.9358104467391968} 11/07/2021 07:41:25 - INFO - __main__ - Step 74571: {'lr': 0.0002575874791069827, 'samples': 14317632, 'steps': 74570, 'loss/train': 1.4810479879379272} 11/07/2021 07:41:26 - INFO - __main__ - Step 74572: {'lr': 0.00025758217480555687, 'samples': 14317824, 'steps': 74571, 'loss/train': 1.6211732625961304} 11/07/2021 07:41:26 - INFO - __main__ - Step 74573: {'lr': 0.0002575768705007146, 'samples': 14318016, 'steps': 74572, 'loss/train': 1.7809889316558838} 11/07/2021 07:41:26 - INFO - __main__ - Step 74574: {'lr': 0.0002575715661924583, 'samples': 14318208, 'steps': 74573, 'loss/train': 1.2356553077697754} 11/07/2021 07:41:27 - INFO - __main__ - Step 74575: {'lr': 0.0002575662618807904, 'samples': 14318400, 'steps': 74574, 'loss/train': 0.9032836556434631} 11/07/2021 07:41:28 - INFO - __main__ - Step 74576: {'lr': 0.00025756095756571324, 'samples': 14318592, 'steps': 74575, 'loss/train': 1.525964617729187} 11/07/2021 07:41:28 - INFO - __main__ - Step 74577: {'lr': 0.0002575556532472292, 'samples': 14318784, 'steps': 74576, 'loss/train': 1.4593123197555542} 11/07/2021 07:41:28 - INFO - __main__ - Step 74578: {'lr': 0.0002575503489253407, 'samples': 14318976, 'steps': 74577, 'loss/train': 1.2952344417572021} 11/07/2021 07:41:29 - INFO - __main__ - Step 74579: {'lr': 0.0002575450446000502, 'samples': 14319168, 'steps': 74578, 'loss/train': 0.8929539322853088} 11/07/2021 07:41:29 - INFO - __main__ - Step 74580: {'lr': 0.00025753974027136, 'samples': 14319360, 'steps': 74579, 'loss/train': 1.164618968963623} 11/07/2021 07:41:31 - INFO - __main__ - Step 74581: {'lr': 0.0002575344359392725, 'samples': 14319552, 'steps': 74580, 'loss/train': 1.4454360008239746} 11/07/2021 07:41:31 - INFO - __main__ - Step 74582: {'lr': 0.00025752913160379003, 'samples': 14319744, 'steps': 74581, 'loss/train': 1.5725109577178955} 11/07/2021 07:41:31 - INFO - __main__ - Step 74583: {'lr': 0.0002575238272649151, 'samples': 14319936, 'steps': 74582, 'loss/train': 0.24063952267169952} 11/07/2021 07:41:32 - INFO - __main__ - Step 74584: {'lr': 0.0002575185229226501, 'samples': 14320128, 'steps': 74583, 'loss/train': 2.0240509510040283} 11/07/2021 07:41:32 - INFO - __main__ - Step 74585: {'lr': 0.00025751321857699733, 'samples': 14320320, 'steps': 74584, 'loss/train': 1.2435328960418701} 11/07/2021 07:41:33 - INFO - __main__ - Step 74586: {'lr': 0.0002575079142279592, 'samples': 14320512, 'steps': 74585, 'loss/train': 1.4951266050338745} 11/07/2021 07:41:33 - INFO - __main__ - Step 74587: {'lr': 0.00025750260987553815, 'samples': 14320704, 'steps': 74586, 'loss/train': 1.2258621454238892} 11/07/2021 07:41:34 - INFO - __main__ - Step 74588: {'lr': 0.00025749730551973655, 'samples': 14320896, 'steps': 74587, 'loss/train': 1.6399394273757935} 11/07/2021 07:41:34 - INFO - __main__ - Step 74589: {'lr': 0.0002574920011605567, 'samples': 14321088, 'steps': 74588, 'loss/train': 1.0327578783035278} 11/07/2021 07:41:34 - INFO - __main__ - Step 74590: {'lr': 0.00025748669679800116, 'samples': 14321280, 'steps': 74589, 'loss/train': 1.4026901721954346} 11/07/2021 07:41:35 - INFO - __main__ - Step 74591: {'lr': 0.0002574813924320722, 'samples': 14321472, 'steps': 74590, 'loss/train': 1.7586420774459839} 11/07/2021 07:41:36 - INFO - __main__ - Step 74592: {'lr': 0.0002574760880627722, 'samples': 14321664, 'steps': 74591, 'loss/train': 1.2353839874267578} 11/07/2021 07:41:36 - INFO - __main__ - Step 74593: {'lr': 0.0002574707836901037, 'samples': 14321856, 'steps': 74592, 'loss/train': 0.7707455158233643} 11/07/2021 07:41:37 - INFO - __main__ - Step 74594: {'lr': 0.0002574654793140688, 'samples': 14322048, 'steps': 74593, 'loss/train': 1.315688133239746} 11/07/2021 07:41:37 - INFO - __main__ - Step 74595: {'lr': 0.0002574601749346702, 'samples': 14322240, 'steps': 74594, 'loss/train': 1.1052722930908203} 11/07/2021 07:41:38 - INFO - __main__ - Step 74596: {'lr': 0.0002574548705519102, 'samples': 14322432, 'steps': 74595, 'loss/train': 1.4025135040283203} 11/07/2021 07:41:38 - INFO - __main__ - Step 74597: {'lr': 0.0002574495661657911, 'samples': 14322624, 'steps': 74596, 'loss/train': 1.4227774143218994} 11/07/2021 07:41:39 - INFO - __main__ - Step 74598: {'lr': 0.0002574442617763153, 'samples': 14322816, 'steps': 74597, 'loss/train': 1.3683608770370483} 11/07/2021 07:41:39 - INFO - __main__ - Step 74599: {'lr': 0.0002574389573834853, 'samples': 14323008, 'steps': 74598, 'loss/train': 1.635656714439392} 11/07/2021 07:41:39 - INFO - __main__ - Step 74600: {'lr': 0.00025743365298730333, 'samples': 14323200, 'steps': 74599, 'loss/train': 1.394126296043396} 11/07/2021 07:41:40 - INFO - __main__ - Step 74601: {'lr': 0.00025742834858777196, 'samples': 14323392, 'steps': 74600, 'loss/train': 1.2347973585128784} 11/07/2021 07:41:41 - INFO - __main__ - Step 74602: {'lr': 0.00025742304418489343, 'samples': 14323584, 'steps': 74601, 'loss/train': 1.3846920728683472} 11/07/2021 07:41:41 - INFO - __main__ - Step 74603: {'lr': 0.0002574177397786702, 'samples': 14323776, 'steps': 74602, 'loss/train': 1.2217469215393066} 11/07/2021 07:41:41 - INFO - __main__ - Step 74604: {'lr': 0.00025741243536910464, 'samples': 14323968, 'steps': 74603, 'loss/train': 1.4418702125549316} 11/07/2021 07:41:42 - INFO - __main__ - Step 74605: {'lr': 0.00025740713095619914, 'samples': 14324160, 'steps': 74604, 'loss/train': 0.8640434145927429} 11/07/2021 07:41:43 - INFO - __main__ - Step 74606: {'lr': 0.00025740182653995615, 'samples': 14324352, 'steps': 74605, 'loss/train': 1.5906058549880981} 11/07/2021 07:41:43 - INFO - __main__ - Step 74607: {'lr': 0.0002573965221203781, 'samples': 14324544, 'steps': 74606, 'loss/train': 1.36045503616333} 11/07/2021 07:41:43 - INFO - __main__ - Step 74608: {'lr': 0.00025739121769746714, 'samples': 14324736, 'steps': 74607, 'loss/train': 1.6197867393493652} 11/07/2021 07:41:44 - INFO - __main__ - Step 74609: {'lr': 0.00025738591327122585, 'samples': 14324928, 'steps': 74608, 'loss/train': 1.3967220783233643} 11/07/2021 07:41:44 - INFO - __main__ - Step 74610: {'lr': 0.0002573806088416566, 'samples': 14325120, 'steps': 74609, 'loss/train': 1.0136436223983765} 11/07/2021 07:41:45 - INFO - __main__ - Step 74611: {'lr': 0.0002573753044087617, 'samples': 14325312, 'steps': 74610, 'loss/train': 0.7427050471305847} 11/07/2021 07:41:46 - INFO - __main__ - Step 74612: {'lr': 0.0002573699999725437, 'samples': 14325504, 'steps': 74611, 'loss/train': 1.510387659072876} 11/07/2021 07:41:46 - INFO - __main__ - Step 74613: {'lr': 0.00025736469553300483, 'samples': 14325696, 'steps': 74612, 'loss/train': 1.562230110168457} 11/07/2021 07:41:46 - INFO - __main__ - Step 74614: {'lr': 0.00025735939109014754, 'samples': 14325888, 'steps': 74613, 'loss/train': 0.8745324015617371} 11/07/2021 07:41:47 - INFO - __main__ - Step 74615: {'lr': 0.0002573540866439742, 'samples': 14326080, 'steps': 74614, 'loss/train': 1.5946509838104248} 11/07/2021 07:41:48 - INFO - __main__ - Step 74616: {'lr': 0.0002573487821944873, 'samples': 14326272, 'steps': 74615, 'loss/train': 1.5732948780059814} 11/07/2021 07:41:48 - INFO - __main__ - Step 74617: {'lr': 0.0002573434777416891, 'samples': 14326464, 'steps': 74616, 'loss/train': 1.3004690408706665} 11/07/2021 07:41:48 - INFO - __main__ - Step 74618: {'lr': 0.0002573381732855821, 'samples': 14326656, 'steps': 74617, 'loss/train': 1.4302783012390137} 11/07/2021 07:41:49 - INFO - __main__ - Step 74619: {'lr': 0.00025733286882616854, 'samples': 14326848, 'steps': 74618, 'loss/train': 1.2668838500976562} 11/07/2021 07:41:49 - INFO - __main__ - Step 74620: {'lr': 0.00025732756436345095, 'samples': 14327040, 'steps': 74619, 'loss/train': 1.6277252435684204} 11/07/2021 07:41:50 - INFO - __main__ - Step 74621: {'lr': 0.0002573222598974317, 'samples': 14327232, 'steps': 74620, 'loss/train': 1.5695945024490356} 11/07/2021 07:41:50 - INFO - __main__ - Step 74622: {'lr': 0.00025731695542811315, 'samples': 14327424, 'steps': 74621, 'loss/train': 1.459820032119751} 11/07/2021 07:41:51 - INFO - __main__ - Step 74623: {'lr': 0.00025731165095549765, 'samples': 14327616, 'steps': 74622, 'loss/train': 1.1601301431655884} 11/07/2021 07:41:51 - INFO - __main__ - Step 74624: {'lr': 0.0002573063464795876, 'samples': 14327808, 'steps': 74623, 'loss/train': 0.470732718706131} 11/07/2021 07:41:51 - INFO - __main__ - Step 74625: {'lr': 0.00025730104200038546, 'samples': 14328000, 'steps': 74624, 'loss/train': 1.312210202217102} 11/07/2021 07:41:52 - INFO - __main__ - Step 74626: {'lr': 0.0002572957375178936, 'samples': 14328192, 'steps': 74625, 'loss/train': 1.2606523036956787} 11/07/2021 07:41:53 - INFO - __main__ - Step 74627: {'lr': 0.0002572904330321144, 'samples': 14328384, 'steps': 74626, 'loss/train': 1.0216057300567627} 11/07/2021 07:41:53 - INFO - __main__ - Step 74628: {'lr': 0.00025728512854305023, 'samples': 14328576, 'steps': 74627, 'loss/train': 0.9260098934173584} 11/07/2021 07:41:53 - INFO - __main__ - Step 74629: {'lr': 0.0002572798240507035, 'samples': 14328768, 'steps': 74628, 'loss/train': 1.001358151435852} 11/07/2021 07:41:54 - INFO - __main__ - Step 74630: {'lr': 0.0002572745195550766, 'samples': 14328960, 'steps': 74629, 'loss/train': 1.4209601879119873} 11/07/2021 07:41:54 - INFO - __main__ - Step 74631: {'lr': 0.0002572692150561719, 'samples': 14329152, 'steps': 74630, 'loss/train': 1.230394959449768} 11/07/2021 07:41:55 - INFO - __main__ - Step 74632: {'lr': 0.0002572639105539918, 'samples': 14329344, 'steps': 74631, 'loss/train': 1.1865184307098389} 11/07/2021 07:41:55 - INFO - __main__ - Step 74633: {'lr': 0.00025725860604853873, 'samples': 14329536, 'steps': 74632, 'loss/train': 1.0341624021530151} 11/07/2021 07:41:56 - INFO - __main__ - Step 74634: {'lr': 0.000257253301539815, 'samples': 14329728, 'steps': 74633, 'loss/train': 1.1882179975509644} 11/07/2021 07:41:56 - INFO - __main__ - Step 74635: {'lr': 0.00025724799702782304, 'samples': 14329920, 'steps': 74634, 'loss/train': 1.3310329914093018} 11/07/2021 07:41:56 - INFO - __main__ - Step 74636: {'lr': 0.0002572426925125653, 'samples': 14330112, 'steps': 74635, 'loss/train': 1.567496418952942} 11/07/2021 07:41:58 - INFO - __main__ - Step 74637: {'lr': 0.00025723738799404407, 'samples': 14330304, 'steps': 74636, 'loss/train': 1.3222063779830933} 11/07/2021 07:41:58 - INFO - __main__ - Step 74638: {'lr': 0.00025723208347226174, 'samples': 14330496, 'steps': 74637, 'loss/train': 0.961025059223175} 11/07/2021 07:41:58 - INFO - __main__ - Step 74639: {'lr': 0.0002572267789472208, 'samples': 14330688, 'steps': 74638, 'loss/train': 1.1237554550170898} 11/07/2021 07:41:59 - INFO - __main__ - Step 74640: {'lr': 0.0002572214744189236, 'samples': 14330880, 'steps': 74639, 'loss/train': 1.4818578958511353} 11/07/2021 07:41:59 - INFO - __main__ - Step 74641: {'lr': 0.0002572161698873725, 'samples': 14331072, 'steps': 74640, 'loss/train': 1.4707863330841064} 11/07/2021 07:42:00 - INFO - __main__ - Step 74642: {'lr': 0.00025721086535256994, 'samples': 14331264, 'steps': 74641, 'loss/train': 1.4201233386993408} 11/07/2021 07:42:00 - INFO - __main__ - Step 74643: {'lr': 0.0002572055608145182, 'samples': 14331456, 'steps': 74642, 'loss/train': 1.471823811531067} 11/07/2021 07:42:01 - INFO - __main__ - Step 74644: {'lr': 0.00025720025627321973, 'samples': 14331648, 'steps': 74643, 'loss/train': 1.5391995906829834} 11/07/2021 07:42:01 - INFO - __main__ - Step 74645: {'lr': 0.000257194951728677, 'samples': 14331840, 'steps': 74644, 'loss/train': 1.2282521724700928} 11/07/2021 07:42:01 - INFO - __main__ - Step 74646: {'lr': 0.0002571896471808923, 'samples': 14332032, 'steps': 74645, 'loss/train': 1.1050077676773071} 11/07/2021 07:42:03 - INFO - __main__ - Step 74647: {'lr': 0.0002571843426298682, 'samples': 14332224, 'steps': 74646, 'loss/train': 1.495161771774292} 11/07/2021 07:42:03 - INFO - __main__ - Step 74648: {'lr': 0.00025717903807560675, 'samples': 14332416, 'steps': 74647, 'loss/train': 1.7733937501907349} 11/07/2021 07:42:03 - INFO - __main__ - Step 74649: {'lr': 0.00025717373351811064, 'samples': 14332608, 'steps': 74648, 'loss/train': 1.3528335094451904} 11/07/2021 07:42:04 - INFO - __main__ - Step 74650: {'lr': 0.00025716842895738215, 'samples': 14332800, 'steps': 74649, 'loss/train': 0.6415321826934814} 11/07/2021 07:42:04 - INFO - __main__ - Step 74651: {'lr': 0.0002571631243934236, 'samples': 14332992, 'steps': 74650, 'loss/train': 1.7440075874328613} 11/07/2021 07:42:05 - INFO - __main__ - Step 74652: {'lr': 0.00025715781982623754, 'samples': 14333184, 'steps': 74651, 'loss/train': 0.19268275797367096} 11/07/2021 07:42:05 - INFO - __main__ - Step 74653: {'lr': 0.0002571525152558262, 'samples': 14333376, 'steps': 74652, 'loss/train': 1.576124668121338} 11/07/2021 07:42:06 - INFO - __main__ - Step 74654: {'lr': 0.0002571472106821922, 'samples': 14333568, 'steps': 74653, 'loss/train': 1.3009625673294067} 11/07/2021 07:42:06 - INFO - __main__ - Step 74655: {'lr': 0.0002571419061053376, 'samples': 14333760, 'steps': 74654, 'loss/train': 1.5784034729003906} 11/07/2021 07:42:06 - INFO - __main__ - Step 74656: {'lr': 0.0002571366015252651, 'samples': 14333952, 'steps': 74655, 'loss/train': 1.4547425508499146} 11/07/2021 07:42:07 - INFO - __main__ - Step 74657: {'lr': 0.00025713129694197683, 'samples': 14334144, 'steps': 74656, 'loss/train': 1.0273998975753784} 11/07/2021 07:42:08 - INFO - __main__ - Step 74658: {'lr': 0.0002571259923554754, 'samples': 14334336, 'steps': 74657, 'loss/train': 1.5801746845245361} 11/07/2021 07:42:08 - INFO - __main__ - Step 74659: {'lr': 0.0002571206877657631, 'samples': 14334528, 'steps': 74658, 'loss/train': 1.2082453966140747} 11/07/2021 07:42:08 - INFO - __main__ - Step 74660: {'lr': 0.00025711538317284234, 'samples': 14334720, 'steps': 74659, 'loss/train': 1.6234681606292725} 11/07/2021 07:42:09 - INFO - __main__ - Step 74661: {'lr': 0.0002571100785767154, 'samples': 14334912, 'steps': 74660, 'loss/train': 1.058789849281311} 11/07/2021 07:42:10 - INFO - __main__ - Step 74662: {'lr': 0.00025710477397738486, 'samples': 14335104, 'steps': 74661, 'loss/train': 1.5236732959747314} 11/07/2021 07:42:10 - INFO - __main__ - Step 74663: {'lr': 0.000257099469374853, 'samples': 14335296, 'steps': 74662, 'loss/train': 1.1963218450546265} 11/07/2021 07:42:11 - INFO - __main__ - Step 74664: {'lr': 0.0002570941647691222, 'samples': 14335488, 'steps': 74663, 'loss/train': 1.3460735082626343} 11/07/2021 07:42:11 - INFO - __main__ - Step 74665: {'lr': 0.0002570888601601949, 'samples': 14335680, 'steps': 74664, 'loss/train': 0.9215031266212463} 11/07/2021 07:42:11 - INFO - __main__ - Step 74666: {'lr': 0.0002570835555480735, 'samples': 14335872, 'steps': 74665, 'loss/train': 1.3501654863357544} 11/07/2021 07:42:12 - INFO - __main__ - Step 74667: {'lr': 0.00025707825093276035, 'samples': 14336064, 'steps': 74666, 'loss/train': 1.085124135017395} 11/07/2021 07:42:13 - INFO - __main__ - Step 74668: {'lr': 0.0002570729463142578, 'samples': 14336256, 'steps': 74667, 'loss/train': 1.2341796159744263} 11/07/2021 07:42:13 - INFO - __main__ - Step 74669: {'lr': 0.00025706764169256837, 'samples': 14336448, 'steps': 74668, 'loss/train': 1.3670616149902344} 11/07/2021 07:42:13 - INFO - __main__ - Step 74670: {'lr': 0.0002570623370676943, 'samples': 14336640, 'steps': 74669, 'loss/train': 1.572492241859436} 11/07/2021 07:42:14 - INFO - __main__ - Step 74671: {'lr': 0.00025705703243963804, 'samples': 14336832, 'steps': 74670, 'loss/train': 1.6872344017028809} 11/07/2021 07:42:14 - INFO - __main__ - Step 74672: {'lr': 0.00025705172780840204, 'samples': 14337024, 'steps': 74671, 'loss/train': 1.2287150621414185} 11/07/2021 07:42:15 - INFO - __main__ - Step 74673: {'lr': 0.00025704642317398856, 'samples': 14337216, 'steps': 74672, 'loss/train': 1.0802202224731445} 11/07/2021 07:42:15 - INFO - __main__ - Step 74674: {'lr': 0.0002570411185364002, 'samples': 14337408, 'steps': 74673, 'loss/train': 1.6297316551208496} 11/07/2021 07:42:16 - INFO - __main__ - Step 74675: {'lr': 0.0002570358138956391, 'samples': 14337600, 'steps': 74674, 'loss/train': 1.037557601928711} 11/07/2021 07:42:16 - INFO - __main__ - Step 74676: {'lr': 0.00025703050925170786, 'samples': 14337792, 'steps': 74675, 'loss/train': 1.5283852815628052} 11/07/2021 07:42:17 - INFO - __main__ - Step 74677: {'lr': 0.0002570252046046088, 'samples': 14337984, 'steps': 74676, 'loss/train': 1.2870293855667114} 11/07/2021 07:42:18 - INFO - __main__ - Step 74678: {'lr': 0.00025701989995434416, 'samples': 14338176, 'steps': 74677, 'loss/train': 1.2184354066848755} 11/07/2021 07:42:18 - INFO - __main__ - Step 74679: {'lr': 0.00025701459530091654, 'samples': 14338368, 'steps': 74678, 'loss/train': 0.07096521556377411} 11/07/2021 07:42:18 - INFO - __main__ - Step 74680: {'lr': 0.0002570092906443282, 'samples': 14338560, 'steps': 74679, 'loss/train': 1.1230372190475464} 11/07/2021 07:42:19 - INFO - __main__ - Step 74681: {'lr': 0.0002570039859845817, 'samples': 14338752, 'steps': 74680, 'loss/train': 1.2345619201660156} 11/07/2021 07:42:19 - INFO - __main__ - Step 74682: {'lr': 0.00025699868132167923, 'samples': 14338944, 'steps': 74681, 'loss/train': 1.5531346797943115} 11/07/2021 07:42:20 - INFO - __main__ - Step 74683: {'lr': 0.00025699337665562326, 'samples': 14339136, 'steps': 74682, 'loss/train': 1.332544207572937} 11/07/2021 07:42:20 - INFO - __main__ - Step 74684: {'lr': 0.0002569880719864162, 'samples': 14339328, 'steps': 74683, 'loss/train': 1.487433910369873} 11/07/2021 07:42:21 - INFO - __main__ - Step 74685: {'lr': 0.0002569827673140604, 'samples': 14339520, 'steps': 74684, 'loss/train': 1.5342936515808105} 11/07/2021 07:42:21 - INFO - __main__ - Step 74686: {'lr': 0.0002569774626385583, 'samples': 14339712, 'steps': 74685, 'loss/train': 1.2404531240463257} 11/07/2021 07:42:22 - INFO - __main__ - Step 74687: {'lr': 0.0002569721579599123, 'samples': 14339904, 'steps': 74686, 'loss/train': 1.390657663345337} 11/07/2021 07:42:22 - INFO - __main__ - Step 74688: {'lr': 0.00025696685327812466, 'samples': 14340096, 'steps': 74687, 'loss/train': 1.542125940322876} 11/07/2021 07:42:23 - INFO - __main__ - Step 74689: {'lr': 0.00025696154859319794, 'samples': 14340288, 'steps': 74688, 'loss/train': 0.9698209166526794} 11/07/2021 07:42:23 - INFO - __main__ - Step 74690: {'lr': 0.00025695624390513445, 'samples': 14340480, 'steps': 74689, 'loss/train': 1.6044825315475464} 11/07/2021 07:42:24 - INFO - __main__ - Step 74691: {'lr': 0.00025695093921393653, 'samples': 14340672, 'steps': 74690, 'loss/train': 1.2898313999176025} 11/07/2021 07:42:24 - INFO - __main__ - Step 74692: {'lr': 0.00025694563451960663, 'samples': 14340864, 'steps': 74691, 'loss/train': 2.0402731895446777} 11/07/2021 07:42:25 - INFO - __main__ - Step 74693: {'lr': 0.0002569403298221472, 'samples': 14341056, 'steps': 74692, 'loss/train': 2.0597832202911377} 11/07/2021 07:42:25 - INFO - __main__ - Step 74694: {'lr': 0.0002569350251215605, 'samples': 14341248, 'steps': 74693, 'loss/train': 1.3220133781433105} 11/07/2021 07:42:26 - INFO - __main__ - Step 74695: {'lr': 0.000256929720417849, 'samples': 14341440, 'steps': 74694, 'loss/train': 1.548861026763916} 11/07/2021 07:42:26 - INFO - __main__ - Step 74696: {'lr': 0.0002569244157110151, 'samples': 14341632, 'steps': 74695, 'loss/train': 1.2340368032455444} 11/07/2021 07:42:26 - INFO - __main__ - Step 74697: {'lr': 0.00025691911100106114, 'samples': 14341824, 'steps': 74696, 'loss/train': 1.2709871530532837} 11/07/2021 07:42:27 - INFO - __main__ - Step 74698: {'lr': 0.00025691380628798955, 'samples': 14342016, 'steps': 74697, 'loss/train': 1.1161503791809082} 11/07/2021 07:42:28 - INFO - __main__ - Step 74699: {'lr': 0.0002569085015718027, 'samples': 14342208, 'steps': 74698, 'loss/train': 1.22258722782135} 11/07/2021 07:42:28 - INFO - __main__ - Step 74700: {'lr': 0.00025690319685250294, 'samples': 14342400, 'steps': 74699, 'loss/train': 2.1833510398864746} 11/07/2021 07:42:28 - INFO - __main__ - Step 74701: {'lr': 0.0002568978921300928, 'samples': 14342592, 'steps': 74700, 'loss/train': 1.5719151496887207} 11/07/2021 07:42:29 - INFO - __main__ - Step 74702: {'lr': 0.0002568925874045745, 'samples': 14342784, 'steps': 74701, 'loss/train': 1.4817842245101929} 11/07/2021 07:42:30 - INFO - __main__ - Step 74703: {'lr': 0.00025688728267595054, 'samples': 14342976, 'steps': 74702, 'loss/train': 1.7014262676239014} 11/07/2021 07:42:30 - INFO - __main__ - Step 74704: {'lr': 0.00025688197794422325, 'samples': 14343168, 'steps': 74703, 'loss/train': 1.4043666124343872} 11/07/2021 07:42:31 - INFO - __main__ - Step 74705: {'lr': 0.0002568766732093951, 'samples': 14343360, 'steps': 74704, 'loss/train': 1.536466121673584} 11/07/2021 07:42:31 - INFO - __main__ - Step 74706: {'lr': 0.0002568713684714684, 'samples': 14343552, 'steps': 74705, 'loss/train': 1.4778594970703125} 11/07/2021 07:42:31 - INFO - __main__ - Step 74707: {'lr': 0.0002568660637304456, 'samples': 14343744, 'steps': 74706, 'loss/train': 1.159449815750122} 11/07/2021 07:42:32 - INFO - __main__ - Step 74708: {'lr': 0.00025686075898632895, 'samples': 14343936, 'steps': 74707, 'loss/train': 1.287912368774414} 11/07/2021 07:42:33 - INFO - __main__ - Step 74709: {'lr': 0.00025685545423912104, 'samples': 14344128, 'steps': 74708, 'loss/train': 1.0796597003936768} 11/07/2021 07:42:33 - INFO - __main__ - Step 74710: {'lr': 0.00025685014948882413, 'samples': 14344320, 'steps': 74709, 'loss/train': 1.6018414497375488} 11/07/2021 07:42:33 - INFO - __main__ - Step 74711: {'lr': 0.0002568448447354406, 'samples': 14344512, 'steps': 74710, 'loss/train': 1.5915751457214355} 11/07/2021 07:42:34 - INFO - __main__ - Step 74712: {'lr': 0.00025683953997897297, 'samples': 14344704, 'steps': 74711, 'loss/train': 1.6194968223571777} 11/07/2021 07:42:34 - INFO - __main__ - Step 74713: {'lr': 0.00025683423521942353, 'samples': 14344896, 'steps': 74712, 'loss/train': 1.6310328245162964} 11/07/2021 07:42:35 - INFO - __main__ - Step 74714: {'lr': 0.00025682893045679474, 'samples': 14345088, 'steps': 74713, 'loss/train': 1.6410804986953735} 11/07/2021 07:42:35 - INFO - __main__ - Step 74715: {'lr': 0.0002568236256910889, 'samples': 14345280, 'steps': 74714, 'loss/train': 1.7644520998001099} 11/07/2021 07:42:36 - INFO - __main__ - Step 74716: {'lr': 0.0002568183209223084, 'samples': 14345472, 'steps': 74715, 'loss/train': 1.7668037414550781} 11/07/2021 07:42:36 - INFO - __main__ - Step 74717: {'lr': 0.00025681301615045564, 'samples': 14345664, 'steps': 74716, 'loss/train': 1.4990813732147217} 11/07/2021 07:42:36 - INFO - __main__ - Step 74718: {'lr': 0.00025680771137553314, 'samples': 14345856, 'steps': 74717, 'loss/train': 1.066232681274414} 11/07/2021 07:42:38 - INFO - __main__ - Step 74719: {'lr': 0.00025680240659754316, 'samples': 14346048, 'steps': 74718, 'loss/train': 1.4157991409301758} 11/07/2021 07:42:38 - INFO - __main__ - Step 74720: {'lr': 0.00025679710181648814, 'samples': 14346240, 'steps': 74719, 'loss/train': 1.1826245784759521} 11/07/2021 07:42:38 - INFO - __main__ - Step 74721: {'lr': 0.00025679179703237036, 'samples': 14346432, 'steps': 74720, 'loss/train': 1.3838386535644531} 11/07/2021 07:42:39 - INFO - __main__ - Step 74722: {'lr': 0.0002567864922451924, 'samples': 14346624, 'steps': 74721, 'loss/train': 1.890321135520935} 11/07/2021 07:42:39 - INFO - __main__ - Step 74723: {'lr': 0.0002567811874549565, 'samples': 14346816, 'steps': 74722, 'loss/train': 1.8999841213226318} 11/07/2021 07:42:40 - INFO - __main__ - Step 74724: {'lr': 0.00025677588266166505, 'samples': 14347008, 'steps': 74723, 'loss/train': 0.6607469916343689} 11/07/2021 07:42:40 - INFO - __main__ - Step 74725: {'lr': 0.00025677057786532067, 'samples': 14347200, 'steps': 74724, 'loss/train': 1.2588120698928833} 11/07/2021 07:42:41 - INFO - __main__ - Step 74726: {'lr': 0.00025676527306592545, 'samples': 14347392, 'steps': 74725, 'loss/train': 1.88280189037323} 11/07/2021 07:42:41 - INFO - __main__ - Step 74727: {'lr': 0.0002567599682634819, 'samples': 14347584, 'steps': 74726, 'loss/train': 1.1429283618927002} 11/07/2021 07:42:41 - INFO - __main__ - Step 74728: {'lr': 0.00025675466345799236, 'samples': 14347776, 'steps': 74727, 'loss/train': 1.1769791841506958} 11/07/2021 07:42:42 - INFO - __main__ - Step 74729: {'lr': 0.0002567493586494594, 'samples': 14347968, 'steps': 74728, 'loss/train': 1.4815140962600708} 11/07/2021 07:42:43 - INFO - __main__ - Step 74730: {'lr': 0.00025674405383788526, 'samples': 14348160, 'steps': 74729, 'loss/train': 1.3089029788970947} 11/07/2021 07:42:43 - INFO - __main__ - Step 74731: {'lr': 0.0002567387490232723, 'samples': 14348352, 'steps': 74730, 'loss/train': 1.4264663457870483} 11/07/2021 07:42:44 - INFO - __main__ - Step 74732: {'lr': 0.00025673344420562295, 'samples': 14348544, 'steps': 74731, 'loss/train': 1.118865966796875} 11/07/2021 07:42:44 - INFO - __main__ - Step 74733: {'lr': 0.0002567281393849396, 'samples': 14348736, 'steps': 74732, 'loss/train': 1.7757594585418701} 11/07/2021 07:42:45 - INFO - __main__ - Step 74734: {'lr': 0.0002567228345612247, 'samples': 14348928, 'steps': 74733, 'loss/train': 1.5073280334472656} 11/07/2021 07:42:45 - INFO - __main__ - Step 74735: {'lr': 0.00025671752973448057, 'samples': 14349120, 'steps': 74734, 'loss/train': 1.2475311756134033} 11/07/2021 07:42:46 - INFO - __main__ - Step 74736: {'lr': 0.0002567122249047097, 'samples': 14349312, 'steps': 74735, 'loss/train': 1.176971673965454} 11/07/2021 07:42:46 - INFO - __main__ - Step 74737: {'lr': 0.0002567069200719143, 'samples': 14349504, 'steps': 74736, 'loss/train': 1.6068804264068604} 11/07/2021 07:42:46 - INFO - __main__ - Step 74738: {'lr': 0.0002567016152360969, 'samples': 14349696, 'steps': 74737, 'loss/train': 1.4081965684890747} 11/07/2021 07:42:47 - INFO - __main__ - Step 74739: {'lr': 0.00025669631039725987, 'samples': 14349888, 'steps': 74738, 'loss/train': 1.251906394958496} 11/07/2021 07:42:48 - INFO - __main__ - Step 74740: {'lr': 0.0002566910055554056, 'samples': 14350080, 'steps': 74739, 'loss/train': 1.4703494310379028} 11/07/2021 07:42:48 - INFO - __main__ - Step 74741: {'lr': 0.0002566857007105365, 'samples': 14350272, 'steps': 74740, 'loss/train': 1.5745338201522827} 11/07/2021 07:42:48 - INFO - __main__ - Step 74742: {'lr': 0.00025668039586265485, 'samples': 14350464, 'steps': 74741, 'loss/train': 1.4380362033843994} 11/07/2021 07:42:49 - INFO - __main__ - Step 74743: {'lr': 0.00025667509101176317, 'samples': 14350656, 'steps': 74742, 'loss/train': 1.0472491979599} 11/07/2021 07:42:49 - INFO - __main__ - Step 74744: {'lr': 0.00025666978615786375, 'samples': 14350848, 'steps': 74743, 'loss/train': 1.1282490491867065} 11/07/2021 07:42:50 - INFO - __main__ - Step 74745: {'lr': 0.00025666448130095903, 'samples': 14351040, 'steps': 74744, 'loss/train': 1.4397698640823364} 11/07/2021 07:42:51 - INFO - __main__ - Step 74746: {'lr': 0.0002566591764410514, 'samples': 14351232, 'steps': 74745, 'loss/train': 1.663600206375122} 11/07/2021 07:42:51 - INFO - __main__ - Step 74747: {'lr': 0.00025665387157814323, 'samples': 14351424, 'steps': 74746, 'loss/train': 1.6204328536987305} 11/07/2021 07:42:51 - INFO - __main__ - Step 74748: {'lr': 0.00025664856671223703, 'samples': 14351616, 'steps': 74747, 'loss/train': 1.5031384229660034} 11/07/2021 07:42:52 - INFO - __main__ - Step 74749: {'lr': 0.000256643261843335, 'samples': 14351808, 'steps': 74748, 'loss/train': 1.0660921335220337} 11/07/2021 07:42:53 - INFO - __main__ - Step 74750: {'lr': 0.00025663795697143964, 'samples': 14352000, 'steps': 74749, 'loss/train': 1.4176058769226074} 11/07/2021 07:42:53 - INFO - __main__ - Step 74751: {'lr': 0.00025663265209655337, 'samples': 14352192, 'steps': 74750, 'loss/train': 1.568129301071167} 11/07/2021 07:42:53 - INFO - __main__ - Step 74752: {'lr': 0.00025662734721867845, 'samples': 14352384, 'steps': 74751, 'loss/train': 1.415916085243225} 11/07/2021 07:42:54 - INFO - __main__ - Step 74753: {'lr': 0.0002566220423378173, 'samples': 14352576, 'steps': 74752, 'loss/train': 1.674440860748291} 11/07/2021 07:42:54 - INFO - __main__ - Step 74754: {'lr': 0.0002566167374539725, 'samples': 14352768, 'steps': 74753, 'loss/train': 1.5419219732284546} 11/07/2021 07:42:55 - INFO - __main__ - Step 74755: {'lr': 0.00025661143256714623, 'samples': 14352960, 'steps': 74754, 'loss/train': 1.1350572109222412} 11/07/2021 07:42:55 - INFO - __main__ - Step 74756: {'lr': 0.00025660612767734097, 'samples': 14353152, 'steps': 74755, 'loss/train': 0.8723674416542053} 11/07/2021 07:42:56 - INFO - __main__ - Step 74757: {'lr': 0.000256600822784559, 'samples': 14353344, 'steps': 74756, 'loss/train': 1.6337097883224487} 11/07/2021 07:42:56 - INFO - __main__ - Step 74758: {'lr': 0.00025659551788880295, 'samples': 14353536, 'steps': 74757, 'loss/train': 1.373630166053772} 11/07/2021 07:42:56 - INFO - __main__ - Step 74759: {'lr': 0.00025659021299007497, 'samples': 14353728, 'steps': 74758, 'loss/train': 1.4929072856903076} 11/07/2021 07:42:57 - INFO - __main__ - Step 74760: {'lr': 0.0002565849080883775, 'samples': 14353920, 'steps': 74759, 'loss/train': 1.3734697103500366} 11/07/2021 07:42:58 - INFO - __main__ - Step 74761: {'lr': 0.00025657960318371315, 'samples': 14354112, 'steps': 74760, 'loss/train': 1.4391684532165527} 11/07/2021 07:42:58 - INFO - __main__ - Step 74762: {'lr': 0.000256574298276084, 'samples': 14354304, 'steps': 74761, 'loss/train': 0.9952316880226135} 11/07/2021 07:42:58 - INFO - __main__ - Step 74763: {'lr': 0.00025656899336549255, 'samples': 14354496, 'steps': 74762, 'loss/train': 1.5046530961990356} 11/07/2021 07:42:59 - INFO - __main__ - Step 74764: {'lr': 0.0002565636884519413, 'samples': 14354688, 'steps': 74763, 'loss/train': 1.4808180332183838} 11/07/2021 07:43:00 - INFO - __main__ - Step 74765: {'lr': 0.00025655838353543246, 'samples': 14354880, 'steps': 74764, 'loss/train': 1.8485963344573975} 11/07/2021 07:43:00 - INFO - __main__ - Step 74766: {'lr': 0.0002565530786159686, 'samples': 14355072, 'steps': 74765, 'loss/train': 1.226456642150879} 11/07/2021 07:43:01 - INFO - __main__ - Step 74767: {'lr': 0.0002565477736935519, 'samples': 14355264, 'steps': 74766, 'loss/train': 1.2708745002746582} 11/07/2021 07:43:01 - INFO - __main__ - Step 74768: {'lr': 0.00025654246876818503, 'samples': 14355456, 'steps': 74767, 'loss/train': 1.112826943397522} 11/07/2021 07:43:01 - INFO - __main__ - Step 74769: {'lr': 0.00025653716383987015, 'samples': 14355648, 'steps': 74768, 'loss/train': 1.0915803909301758} 11/07/2021 07:43:02 - INFO - __main__ - Step 74770: {'lr': 0.0002565318589086097, 'samples': 14355840, 'steps': 74769, 'loss/train': 1.439963698387146} 11/07/2021 07:43:03 - INFO - __main__ - Step 74771: {'lr': 0.0002565265539744061, 'samples': 14356032, 'steps': 74770, 'loss/train': 1.1778656244277954} 11/07/2021 07:43:03 - INFO - __main__ - Step 74772: {'lr': 0.00025652124903726174, 'samples': 14356224, 'steps': 74771, 'loss/train': 1.0485948324203491} 11/07/2021 07:43:03 - INFO - __main__ - Step 74773: {'lr': 0.00025651594409717903, 'samples': 14356416, 'steps': 74772, 'loss/train': 0.8286210894584656} 11/07/2021 07:43:04 - INFO - __main__ - Step 74774: {'lr': 0.00025651063915416037, 'samples': 14356608, 'steps': 74773, 'loss/train': 1.8505760431289673} 11/07/2021 07:43:04 - INFO - __main__ - Step 74775: {'lr': 0.0002565053342082081, 'samples': 14356800, 'steps': 74774, 'loss/train': 1.505564570426941} 11/07/2021 07:43:05 - INFO - __main__ - Step 74776: {'lr': 0.00025650002925932456, 'samples': 14356992, 'steps': 74775, 'loss/train': 1.0035455226898193} 11/07/2021 07:43:05 - INFO - __main__ - Step 74777: {'lr': 0.00025649472430751226, 'samples': 14357184, 'steps': 74776, 'loss/train': 1.4388130903244019} 11/07/2021 07:43:06 - INFO - __main__ - Step 74778: {'lr': 0.0002564894193527735, 'samples': 14357376, 'steps': 74777, 'loss/train': 2.073012590408325} 11/07/2021 07:43:06 - INFO - __main__ - Step 74779: {'lr': 0.00025648411439511075, 'samples': 14357568, 'steps': 74778, 'loss/train': 1.4140303134918213} 11/07/2021 07:43:06 - INFO - __main__ - Step 74780: {'lr': 0.00025647880943452633, 'samples': 14357760, 'steps': 74779, 'loss/train': 1.5017492771148682} 11/07/2021 07:43:07 - INFO - __main__ - Step 74781: {'lr': 0.00025647350447102274, 'samples': 14357952, 'steps': 74780, 'loss/train': 1.3430224657058716} 11/07/2021 07:43:08 - INFO - __main__ - Step 74782: {'lr': 0.0002564681995046022, 'samples': 14358144, 'steps': 74781, 'loss/train': 1.2582926750183105} 11/07/2021 07:43:08 - INFO - __main__ - Step 74783: {'lr': 0.00025646289453526715, 'samples': 14358336, 'steps': 74782, 'loss/train': 0.8017022609710693} 11/07/2021 07:43:08 - INFO - __main__ - Step 74784: {'lr': 0.0002564575895630201, 'samples': 14358528, 'steps': 74783, 'loss/train': 1.2380846738815308} 11/07/2021 07:43:09 - INFO - __main__ - Step 74785: {'lr': 0.00025645228458786337, 'samples': 14358720, 'steps': 74784, 'loss/train': 0.550457775592804} 11/07/2021 07:43:10 - INFO - __main__ - Step 74786: {'lr': 0.0002564469796097993, 'samples': 14358912, 'steps': 74785, 'loss/train': 1.770780324935913} 11/07/2021 07:43:10 - INFO - __main__ - Step 74787: {'lr': 0.0002564416746288303, 'samples': 14359104, 'steps': 74786, 'loss/train': 1.5007905960083008} 11/07/2021 07:43:11 - INFO - __main__ - Step 74788: {'lr': 0.00025643636964495887, 'samples': 14359296, 'steps': 74787, 'loss/train': 1.544717788696289} 11/07/2021 07:43:11 - INFO - __main__ - Step 74789: {'lr': 0.0002564310646581872, 'samples': 14359488, 'steps': 74788, 'loss/train': 0.8952957987785339} 11/07/2021 07:43:11 - INFO - __main__ - Step 74790: {'lr': 0.00025642575966851783, 'samples': 14359680, 'steps': 74789, 'loss/train': 0.9645843505859375} 11/07/2021 07:43:13 - INFO - __main__ - Step 74791: {'lr': 0.0002564204546759531, 'samples': 14359872, 'steps': 74790, 'loss/train': 1.42986261844635} 11/07/2021 07:43:13 - INFO - __main__ - Step 74792: {'lr': 0.00025641514968049545, 'samples': 14360064, 'steps': 74791, 'loss/train': 1.6125439405441284} 11/07/2021 07:43:13 - INFO - __main__ - Step 74793: {'lr': 0.00025640984468214723, 'samples': 14360256, 'steps': 74792, 'loss/train': 1.408057689666748} 11/07/2021 07:43:14 - INFO - __main__ - Step 74794: {'lr': 0.0002564045396809108, 'samples': 14360448, 'steps': 74793, 'loss/train': 1.6539872884750366} 11/07/2021 07:43:14 - INFO - __main__ - Step 74795: {'lr': 0.00025639923467678867, 'samples': 14360640, 'steps': 74794, 'loss/train': 1.3198845386505127} 11/07/2021 07:43:16 - INFO - __main__ - Step 74796: {'lr': 0.00025639392966978305, 'samples': 14360832, 'steps': 74795, 'loss/train': 0.7614462971687317} 11/07/2021 07:43:16 - INFO - __main__ - Step 74797: {'lr': 0.0002563886246598964, 'samples': 14361024, 'steps': 74796, 'loss/train': 1.2568249702453613} 11/07/2021 07:43:16 - INFO - __main__ - Step 74798: {'lr': 0.00025638331964713125, 'samples': 14361216, 'steps': 74797, 'loss/train': 0.4276106655597687} 11/07/2021 07:43:17 - INFO - __main__ - Step 74799: {'lr': 0.0002563780146314898, 'samples': 14361408, 'steps': 74798, 'loss/train': 1.3329145908355713} 11/07/2021 07:43:17 - INFO - __main__ - Step 74800: {'lr': 0.0002563727096129745, 'samples': 14361600, 'steps': 74799, 'loss/train': 1.5136909484863281} 11/07/2021 07:43:18 - INFO - __main__ - Step 74801: {'lr': 0.00025636740459158774, 'samples': 14361792, 'steps': 74800, 'loss/train': 1.1098908185958862} 11/07/2021 07:43:18 - INFO - __main__ - Step 74802: {'lr': 0.000256362099567332, 'samples': 14361984, 'steps': 74801, 'loss/train': 1.3646321296691895} 11/07/2021 07:43:19 - INFO - __main__ - Step 74803: {'lr': 0.0002563567945402096, 'samples': 14362176, 'steps': 74802, 'loss/train': 1.5750564336776733} 11/07/2021 07:43:19 - INFO - __main__ - Step 74804: {'lr': 0.0002563514895102229, 'samples': 14362368, 'steps': 74803, 'loss/train': 1.149714708328247} 11/07/2021 07:43:19 - INFO - __main__ - Step 74805: {'lr': 0.0002563461844773743, 'samples': 14362560, 'steps': 74804, 'loss/train': 0.487000048160553} 11/07/2021 07:43:21 - INFO - __main__ - Step 74806: {'lr': 0.00025634087944166617, 'samples': 14362752, 'steps': 74805, 'loss/train': 1.6147058010101318} 11/07/2021 07:43:21 - INFO - __main__ - Step 74807: {'lr': 0.000256335574403101, 'samples': 14362944, 'steps': 74806, 'loss/train': 0.9106371998786926} 11/07/2021 07:43:22 - INFO - __main__ - Step 74808: {'lr': 0.00025633026936168116, 'samples': 14363136, 'steps': 74807, 'loss/train': 1.9908746480941772} 11/07/2021 07:43:22 - INFO - __main__ - Step 74809: {'lr': 0.0002563249643174089, 'samples': 14363328, 'steps': 74808, 'loss/train': 1.725812315940857} 11/07/2021 07:43:22 - INFO - __main__ - Step 74810: {'lr': 0.00025631965927028677, 'samples': 14363520, 'steps': 74809, 'loss/train': 0.7659514546394348} 11/07/2021 07:43:23 - INFO - __main__ - Step 74811: {'lr': 0.0002563143542203171, 'samples': 14363712, 'steps': 74810, 'loss/train': 1.977474331855774} 11/07/2021 07:43:23 - INFO - __main__ - Step 74812: {'lr': 0.0002563090491675022, 'samples': 14363904, 'steps': 74811, 'loss/train': 1.8736491203308105} 11/07/2021 07:43:24 - INFO - __main__ - Step 74813: {'lr': 0.0002563037441118446, 'samples': 14364096, 'steps': 74812, 'loss/train': 1.2005064487457275} 11/07/2021 07:43:25 - INFO - __main__ - Step 74814: {'lr': 0.0002562984390533466, 'samples': 14364288, 'steps': 74813, 'loss/train': 1.530672311782837} 11/07/2021 07:43:25 - INFO - __main__ - Step 74815: {'lr': 0.00025629313399201073, 'samples': 14364480, 'steps': 74814, 'loss/train': 1.5189435482025146} 11/07/2021 07:43:25 - INFO - __main__ - Step 74816: {'lr': 0.00025628782892783914, 'samples': 14364672, 'steps': 74815, 'loss/train': 1.1230249404907227} 11/07/2021 07:43:26 - INFO - __main__ - Step 74817: {'lr': 0.0002562825238608344, 'samples': 14364864, 'steps': 74816, 'loss/train': 1.361356496810913} 11/07/2021 07:43:27 - INFO - __main__ - Step 74818: {'lr': 0.00025627721879099884, 'samples': 14365056, 'steps': 74817, 'loss/train': 1.9221653938293457} 11/07/2021 07:43:27 - INFO - __main__ - Step 74819: {'lr': 0.00025627191371833485, 'samples': 14365248, 'steps': 74818, 'loss/train': 1.5846952199935913} 11/07/2021 07:43:27 - INFO - __main__ - Step 74820: {'lr': 0.00025626660864284484, 'samples': 14365440, 'steps': 74819, 'loss/train': 1.395248293876648} 11/07/2021 07:43:28 - INFO - __main__ - Step 74821: {'lr': 0.0002562613035645312, 'samples': 14365632, 'steps': 74820, 'loss/train': 1.1528257131576538} 11/07/2021 07:43:28 - INFO - __main__ - Step 74822: {'lr': 0.0002562559984833964, 'samples': 14365824, 'steps': 74821, 'loss/train': 1.515475869178772} 11/07/2021 07:43:29 - INFO - __main__ - Step 74823: {'lr': 0.00025625069339944265, 'samples': 14366016, 'steps': 74822, 'loss/train': 1.7438992261886597} 11/07/2021 07:43:30 - INFO - __main__ - Step 74824: {'lr': 0.00025624538831267243, 'samples': 14366208, 'steps': 74823, 'loss/train': 1.1850091218948364} 11/07/2021 07:43:30 - INFO - __main__ - Step 74825: {'lr': 0.0002562400832230881, 'samples': 14366400, 'steps': 74824, 'loss/train': 0.7457549571990967} 11/07/2021 07:43:30 - INFO - __main__ - Step 74826: {'lr': 0.0002562347781306922, 'samples': 14366592, 'steps': 74825, 'loss/train': 1.3944220542907715} 11/07/2021 07:43:31 - INFO - __main__ - Step 74827: {'lr': 0.0002562294730354869, 'samples': 14366784, 'steps': 74826, 'loss/train': 1.5576297044754028} 11/07/2021 07:43:31 - INFO - __main__ - Step 74828: {'lr': 0.0002562241679374748, 'samples': 14366976, 'steps': 74827, 'loss/train': 1.521988034248352} 11/07/2021 07:43:33 - INFO - __main__ - Step 74829: {'lr': 0.0002562188628366581, 'samples': 14367168, 'steps': 74828, 'loss/train': 1.636771559715271} 11/07/2021 07:43:33 - INFO - __main__ - Step 74830: {'lr': 0.00025621355773303926, 'samples': 14367360, 'steps': 74829, 'loss/train': 1.8142738342285156} 11/07/2021 07:43:34 - INFO - __main__ - Step 74831: {'lr': 0.00025620825262662075, 'samples': 14367552, 'steps': 74830, 'loss/train': 1.3742802143096924} 11/07/2021 07:43:34 - INFO - __main__ - Step 74832: {'lr': 0.00025620294751740484, 'samples': 14367744, 'steps': 74831, 'loss/train': 0.22599045932292938} 11/07/2021 07:43:35 - INFO - __main__ - Step 74833: {'lr': 0.000256197642405394, 'samples': 14367936, 'steps': 74832, 'loss/train': 0.3541136384010315} 11/07/2021 07:43:35 - INFO - __main__ - Step 74834: {'lr': 0.0002561923372905906, 'samples': 14368128, 'steps': 74833, 'loss/train': 0.2563682198524475} 11/07/2021 07:43:36 - INFO - __main__ - Step 74835: {'lr': 0.00025618703217299713, 'samples': 14368320, 'steps': 74834, 'loss/train': 1.3643786907196045} 11/07/2021 07:43:36 - INFO - __main__ - Step 74836: {'lr': 0.0002561817270526158, 'samples': 14368512, 'steps': 74835, 'loss/train': 1.4914418458938599} 11/07/2021 07:43:37 - INFO - __main__ - Step 74837: {'lr': 0.000256176421929449, 'samples': 14368704, 'steps': 74836, 'loss/train': 1.5181421041488647} 11/07/2021 07:43:37 - INFO - __main__ - Step 74838: {'lr': 0.00025617111680349924, 'samples': 14368896, 'steps': 74837, 'loss/train': 0.8326175212860107} 11/07/2021 07:43:37 - INFO - __main__ - Step 74839: {'lr': 0.00025616581167476894, 'samples': 14369088, 'steps': 74838, 'loss/train': 2.0874133110046387} 11/07/2021 07:43:38 - INFO - __main__ - Step 74840: {'lr': 0.00025616050654326037, 'samples': 14369280, 'steps': 74839, 'loss/train': 0.8065781593322754} 11/07/2021 07:43:39 - INFO - __main__ - Step 74841: {'lr': 0.00025615520140897597, 'samples': 14369472, 'steps': 74840, 'loss/train': 1.3080334663391113} 11/07/2021 07:43:39 - INFO - __main__ - Step 74842: {'lr': 0.0002561498962719181, 'samples': 14369664, 'steps': 74841, 'loss/train': 1.5170897245407104} 11/07/2021 07:43:40 - INFO - __main__ - Step 74843: {'lr': 0.0002561445911320893, 'samples': 14369856, 'steps': 74842, 'loss/train': 1.3659186363220215} 11/07/2021 07:43:40 - INFO - __main__ - Step 74844: {'lr': 0.0002561392859894917, 'samples': 14370048, 'steps': 74843, 'loss/train': 1.5118317604064941} 11/07/2021 07:43:40 - INFO - __main__ - Step 74845: {'lr': 0.00025613398084412795, 'samples': 14370240, 'steps': 74844, 'loss/train': 1.2863298654556274} 11/07/2021 07:43:41 - INFO - __main__ - Step 74846: {'lr': 0.00025612867569600023, 'samples': 14370432, 'steps': 74845, 'loss/train': 1.1244781017303467} 11/07/2021 07:43:42 - INFO - __main__ - Step 74847: {'lr': 0.000256123370545111, 'samples': 14370624, 'steps': 74846, 'loss/train': 1.5828090906143188} 11/07/2021 07:43:42 - INFO - __main__ - Step 74848: {'lr': 0.0002561180653914628, 'samples': 14370816, 'steps': 74847, 'loss/train': 1.0608482360839844} 11/07/2021 07:43:42 - INFO - __main__ - Step 74849: {'lr': 0.00025611276023505787, 'samples': 14371008, 'steps': 74848, 'loss/train': 1.6176093816757202} 11/07/2021 07:43:43 - INFO - __main__ - Step 74850: {'lr': 0.00025610745507589856, 'samples': 14371200, 'steps': 74849, 'loss/train': 1.4041086435317993} 11/07/2021 07:43:44 - INFO - __main__ - Step 74851: {'lr': 0.00025610214991398733, 'samples': 14371392, 'steps': 74850, 'loss/train': 1.5079138278961182} 11/07/2021 07:43:44 - INFO - __main__ - Step 74852: {'lr': 0.00025609684474932657, 'samples': 14371584, 'steps': 74851, 'loss/train': 1.4510973691940308} 11/07/2021 07:43:44 - INFO - __main__ - Step 74853: {'lr': 0.00025609153958191865, 'samples': 14371776, 'steps': 74852, 'loss/train': 2.0162928104400635} 11/07/2021 07:43:45 - INFO - __main__ - Step 74854: {'lr': 0.0002560862344117661, 'samples': 14371968, 'steps': 74853, 'loss/train': 1.1937873363494873} 11/07/2021 07:43:45 - INFO - __main__ - Step 74855: {'lr': 0.00025608092923887107, 'samples': 14372160, 'steps': 74854, 'loss/train': 1.7394788265228271} 11/07/2021 07:43:46 - INFO - __main__ - Step 74856: {'lr': 0.00025607562406323607, 'samples': 14372352, 'steps': 74855, 'loss/train': 1.1798646450042725} 11/07/2021 07:43:47 - INFO - __main__ - Step 74857: {'lr': 0.0002560703188848635, 'samples': 14372544, 'steps': 74856, 'loss/train': 1.6499180793762207} 11/07/2021 07:43:47 - INFO - __main__ - Step 74858: {'lr': 0.0002560650137037557, 'samples': 14372736, 'steps': 74857, 'loss/train': 1.4105228185653687} 11/07/2021 07:43:47 - INFO - __main__ - Step 74859: {'lr': 0.0002560597085199152, 'samples': 14372928, 'steps': 74858, 'loss/train': 1.6905792951583862} 11/07/2021 07:43:48 - INFO - __main__ - Step 74860: {'lr': 0.00025605440333334423, 'samples': 14373120, 'steps': 74859, 'loss/train': 1.6202589273452759} 11/07/2021 07:43:49 - INFO - __main__ - Step 74861: {'lr': 0.00025604909814404525, 'samples': 14373312, 'steps': 74860, 'loss/train': 1.588568925857544} 11/07/2021 07:43:49 - INFO - __main__ - Step 74862: {'lr': 0.00025604379295202063, 'samples': 14373504, 'steps': 74861, 'loss/train': 1.1337709426879883} 11/07/2021 07:43:49 - INFO - __main__ - Step 74863: {'lr': 0.0002560384877572727, 'samples': 14373696, 'steps': 74862, 'loss/train': 1.900575876235962} 11/07/2021 07:43:50 - INFO - __main__ - Step 74864: {'lr': 0.000256033182559804, 'samples': 14373888, 'steps': 74863, 'loss/train': 1.855250597000122} 11/07/2021 07:43:50 - INFO - __main__ - Step 74865: {'lr': 0.0002560278773596169, 'samples': 14374080, 'steps': 74864, 'loss/train': 1.514375925064087} 11/07/2021 07:43:51 - INFO - __main__ - Step 74866: {'lr': 0.00025602257215671367, 'samples': 14374272, 'steps': 74865, 'loss/train': 1.5239205360412598} 11/07/2021 07:43:52 - INFO - __main__ - Step 74867: {'lr': 0.00025601726695109674, 'samples': 14374464, 'steps': 74866, 'loss/train': 1.0592832565307617} 11/07/2021 07:43:52 - INFO - __main__ - Step 74868: {'lr': 0.00025601196174276854, 'samples': 14374656, 'steps': 74867, 'loss/train': 1.5970852375030518} 11/07/2021 07:43:52 - INFO - __main__ - Step 74869: {'lr': 0.00025600665653173146, 'samples': 14374848, 'steps': 74868, 'loss/train': 1.4313644170761108} 11/07/2021 07:43:53 - INFO - __main__ - Step 74870: {'lr': 0.00025600135131798783, 'samples': 14375040, 'steps': 74869, 'loss/train': 1.6549512147903442} 11/07/2021 07:43:53 - INFO - __main__ - Step 74871: {'lr': 0.00025599604610154015, 'samples': 14375232, 'steps': 74870, 'loss/train': 1.3937783241271973} 11/07/2021 07:43:54 - INFO - __main__ - Step 74872: {'lr': 0.00025599074088239064, 'samples': 14375424, 'steps': 74871, 'loss/train': 0.8701729774475098} 11/07/2021 07:43:54 - INFO - __main__ - Step 74873: {'lr': 0.0002559854356605419, 'samples': 14375616, 'steps': 74872, 'loss/train': 1.3027005195617676} 11/07/2021 07:43:55 - INFO - __main__ - Step 74874: {'lr': 0.00025598013043599615, 'samples': 14375808, 'steps': 74873, 'loss/train': 1.1076768636703491} 11/07/2021 07:43:55 - INFO - __main__ - Step 74875: {'lr': 0.0002559748252087559, 'samples': 14376000, 'steps': 74874, 'loss/train': 1.521704912185669} 11/07/2021 07:43:55 - INFO - __main__ - Step 74876: {'lr': 0.00025596951997882344, 'samples': 14376192, 'steps': 74875, 'loss/train': 0.9338189363479614} 11/07/2021 07:43:56 - INFO - __main__ - Step 74877: {'lr': 0.00025596421474620125, 'samples': 14376384, 'steps': 74876, 'loss/train': 1.5223544836044312} 11/07/2021 07:43:57 - INFO - __main__ - Step 74878: {'lr': 0.0002559589095108916, 'samples': 14376576, 'steps': 74877, 'loss/train': 2.054737091064453} 11/07/2021 07:43:57 - INFO - __main__ - Step 74879: {'lr': 0.000255953604272897, 'samples': 14376768, 'steps': 74878, 'loss/train': 1.1837843656539917} 11/07/2021 07:43:58 - INFO - __main__ - Step 74880: {'lr': 0.0002559482990322198, 'samples': 14376960, 'steps': 74879, 'loss/train': 1.2139451503753662} 11/07/2021 07:43:58 - INFO - __main__ - Step 74881: {'lr': 0.0002559429937888624, 'samples': 14377152, 'steps': 74880, 'loss/train': 1.7631266117095947} 11/07/2021 07:43:58 - INFO - __main__ - Step 74882: {'lr': 0.0002559376885428272, 'samples': 14377344, 'steps': 74881, 'loss/train': 1.7483736276626587} 11/07/2021 07:43:59 - INFO - __main__ - Step 74883: {'lr': 0.00025593238329411655, 'samples': 14377536, 'steps': 74882, 'loss/train': 0.11104434728622437} 11/07/2021 07:44:00 - INFO - __main__ - Step 74884: {'lr': 0.00025592707804273284, 'samples': 14377728, 'steps': 74883, 'loss/train': 1.9461543560028076} 11/07/2021 07:44:00 - INFO - __main__ - Step 74885: {'lr': 0.00025592177278867847, 'samples': 14377920, 'steps': 74884, 'loss/train': 1.4701064825057983} 11/07/2021 07:44:00 - INFO - __main__ - Step 74886: {'lr': 0.0002559164675319559, 'samples': 14378112, 'steps': 74885, 'loss/train': 1.6733100414276123} 11/07/2021 07:44:01 - INFO - __main__ - Step 74887: {'lr': 0.0002559111622725674, 'samples': 14378304, 'steps': 74886, 'loss/train': 1.4269174337387085} 11/07/2021 07:44:02 - INFO - __main__ - Step 74888: {'lr': 0.0002559058570105154, 'samples': 14378496, 'steps': 74887, 'loss/train': 1.7607879638671875} 11/07/2021 07:44:02 - INFO - __main__ - Step 74889: {'lr': 0.0002559005517458024, 'samples': 14378688, 'steps': 74888, 'loss/train': 1.4679356813430786} 11/07/2021 07:44:02 - INFO - __main__ - Step 74890: {'lr': 0.00025589524647843067, 'samples': 14378880, 'steps': 74889, 'loss/train': 1.5094043016433716} 11/07/2021 07:44:03 - INFO - __main__ - Step 74891: {'lr': 0.0002558899412084026, 'samples': 14379072, 'steps': 74890, 'loss/train': 1.1564861536026} 11/07/2021 07:44:03 - INFO - __main__ - Step 74892: {'lr': 0.0002558846359357206, 'samples': 14379264, 'steps': 74891, 'loss/train': 1.3720043897628784} 11/07/2021 07:44:04 - INFO - __main__ - Step 74893: {'lr': 0.00025587933066038707, 'samples': 14379456, 'steps': 74892, 'loss/train': 1.3935171365737915} 11/07/2021 07:44:05 - INFO - __main__ - Step 74894: {'lr': 0.00025587402538240447, 'samples': 14379648, 'steps': 74893, 'loss/train': 2.2383291721343994} 11/07/2021 07:44:05 - INFO - __main__ - Step 74895: {'lr': 0.0002558687201017751, 'samples': 14379840, 'steps': 74894, 'loss/train': 1.479805827140808} 11/07/2021 07:44:05 - INFO - __main__ - Step 74896: {'lr': 0.0002558634148185014, 'samples': 14380032, 'steps': 74895, 'loss/train': 1.4497950077056885} 11/07/2021 07:44:06 - INFO - __main__ - Step 74897: {'lr': 0.0002558581095325857, 'samples': 14380224, 'steps': 74896, 'loss/train': 1.4680635929107666} 11/07/2021 07:44:06 - INFO - __main__ - Step 74898: {'lr': 0.0002558528042440304, 'samples': 14380416, 'steps': 74897, 'loss/train': 1.7472633123397827} 11/07/2021 07:44:07 - INFO - __main__ - Step 74899: {'lr': 0.00025584749895283794, 'samples': 14380608, 'steps': 74898, 'loss/train': 1.6131373643875122} 11/07/2021 07:44:07 - INFO - __main__ - Step 74900: {'lr': 0.0002558421936590107, 'samples': 14380800, 'steps': 74899, 'loss/train': 1.6177500486373901} 11/07/2021 07:44:08 - INFO - __main__ - Step 74901: {'lr': 0.000255836888362551, 'samples': 14380992, 'steps': 74900, 'loss/train': 1.4893367290496826} 11/07/2021 07:44:08 - INFO - __main__ - Step 74902: {'lr': 0.00025583158306346143, 'samples': 14381184, 'steps': 74901, 'loss/train': 1.2250699996948242} 11/07/2021 07:44:08 - INFO - __main__ - Step 74903: {'lr': 0.0002558262777617442, 'samples': 14381376, 'steps': 74902, 'loss/train': 1.2135820388793945} 11/07/2021 07:44:09 - INFO - __main__ - Step 74904: {'lr': 0.0002558209724574016, 'samples': 14381568, 'steps': 74903, 'loss/train': 1.2064783573150635} 11/07/2021 07:44:10 - INFO - __main__ - Step 74905: {'lr': 0.00025581566715043624, 'samples': 14381760, 'steps': 74904, 'loss/train': 1.2792121171951294} 11/07/2021 07:44:10 - INFO - __main__ - Step 74906: {'lr': 0.00025581036184085045, 'samples': 14381952, 'steps': 74905, 'loss/train': 1.2813522815704346} 11/07/2021 07:44:10 - INFO - __main__ - Step 74907: {'lr': 0.0002558050565286466, 'samples': 14382144, 'steps': 74906, 'loss/train': 1.881303071975708} 11/07/2021 07:44:11 - INFO - __main__ - Step 74908: {'lr': 0.00025579975121382706, 'samples': 14382336, 'steps': 74907, 'loss/train': 1.1214250326156616} 11/07/2021 07:44:12 - INFO - __main__ - Step 74909: {'lr': 0.0002557944458963943, 'samples': 14382528, 'steps': 74908, 'loss/train': 1.105391502380371} 11/07/2021 07:44:12 - INFO - __main__ - Step 74910: {'lr': 0.0002557891405763506, 'samples': 14382720, 'steps': 74909, 'loss/train': 1.5048305988311768} 11/07/2021 07:44:12 - INFO - __main__ - Step 74911: {'lr': 0.0002557838352536984, 'samples': 14382912, 'steps': 74910, 'loss/train': 1.622340440750122} 11/07/2021 07:44:13 - INFO - __main__ - Step 74912: {'lr': 0.00025577852992844007, 'samples': 14383104, 'steps': 74911, 'loss/train': 1.5203931331634521} 11/07/2021 07:44:13 - INFO - __main__ - Step 74913: {'lr': 0.00025577322460057804, 'samples': 14383296, 'steps': 74912, 'loss/train': 1.6526609659194946} 11/07/2021 07:44:14 - INFO - __main__ - Step 74914: {'lr': 0.00025576791927011473, 'samples': 14383488, 'steps': 74913, 'loss/train': 1.600007176399231} 11/07/2021 07:44:14 - INFO - __main__ - Step 74915: {'lr': 0.00025576261393705244, 'samples': 14383680, 'steps': 74914, 'loss/train': 1.680120587348938} 11/07/2021 07:44:15 - INFO - __main__ - Step 74916: {'lr': 0.00025575730860139364, 'samples': 14383872, 'steps': 74915, 'loss/train': 1.3600491285324097} 11/07/2021 07:44:15 - INFO - __main__ - Step 74917: {'lr': 0.0002557520032631406, 'samples': 14384064, 'steps': 74916, 'loss/train': 1.4658739566802979} 11/07/2021 07:44:16 - INFO - __main__ - Step 74918: {'lr': 0.00025574669792229586, 'samples': 14384256, 'steps': 74917, 'loss/train': 1.3173986673355103} 11/07/2021 07:44:17 - INFO - __main__ - Step 74919: {'lr': 0.0002557413925788618, 'samples': 14384448, 'steps': 74918, 'loss/train': 0.9602280855178833} 11/07/2021 07:44:17 - INFO - __main__ - Step 74920: {'lr': 0.0002557360872328407, 'samples': 14384640, 'steps': 74919, 'loss/train': 1.6625378131866455} 11/07/2021 07:44:17 - INFO - __main__ - Step 74921: {'lr': 0.000255730781884235, 'samples': 14384832, 'steps': 74920, 'loss/train': 1.3238847255706787} 11/07/2021 07:44:18 - INFO - __main__ - Step 74922: {'lr': 0.00025572547653304707, 'samples': 14385024, 'steps': 74921, 'loss/train': 1.6771743297576904} 11/07/2021 07:44:18 - INFO - __main__ - Step 74923: {'lr': 0.00025572017117927944, 'samples': 14385216, 'steps': 74922, 'loss/train': 1.3349838256835938} 11/07/2021 07:44:19 - INFO - __main__ - Step 74924: {'lr': 0.0002557148658229343, 'samples': 14385408, 'steps': 74923, 'loss/train': 1.3507565259933472} 11/07/2021 07:44:19 - INFO - __main__ - Step 74925: {'lr': 0.00025570956046401413, 'samples': 14385600, 'steps': 74924, 'loss/train': 1.1305183172225952} 11/07/2021 07:44:20 - INFO - __main__ - Step 74926: {'lr': 0.00025570425510252135, 'samples': 14385792, 'steps': 74925, 'loss/train': 1.366011619567871} 11/07/2021 07:44:20 - INFO - __main__ - Step 74927: {'lr': 0.00025569894973845824, 'samples': 14385984, 'steps': 74926, 'loss/train': 1.4455945491790771} 11/07/2021 07:44:20 - INFO - __main__ - Step 74928: {'lr': 0.00025569364437182736, 'samples': 14386176, 'steps': 74927, 'loss/train': 1.7672086954116821} 11/07/2021 07:44:21 - INFO - __main__ - Step 74929: {'lr': 0.00025568833900263104, 'samples': 14386368, 'steps': 74928, 'loss/train': 1.4350829124450684} 11/07/2021 07:44:22 - INFO - __main__ - Step 74930: {'lr': 0.00025568303363087156, 'samples': 14386560, 'steps': 74929, 'loss/train': 1.7554491758346558} 11/07/2021 07:44:22 - INFO - __main__ - Step 74931: {'lr': 0.00025567772825655147, 'samples': 14386752, 'steps': 74930, 'loss/train': 1.4978513717651367} 11/07/2021 07:44:22 - INFO - __main__ - Step 74932: {'lr': 0.00025567242287967304, 'samples': 14386944, 'steps': 74931, 'loss/train': 1.3673851490020752} 11/07/2021 07:44:23 - INFO - __main__ - Step 74933: {'lr': 0.00025566711750023865, 'samples': 14387136, 'steps': 74932, 'loss/train': 1.682854413986206} 11/07/2021 07:44:23 - INFO - __main__ - Step 74934: {'lr': 0.0002556618121182508, 'samples': 14387328, 'steps': 74933, 'loss/train': 1.4353691339492798} 11/07/2021 07:44:24 - INFO - __main__ - Step 74935: {'lr': 0.00025565650673371184, 'samples': 14387520, 'steps': 74934, 'loss/train': 1.7748124599456787} 11/07/2021 07:44:25 - INFO - __main__ - Step 74936: {'lr': 0.00025565120134662413, 'samples': 14387712, 'steps': 74935, 'loss/train': 1.8926047086715698} 11/07/2021 07:44:25 - INFO - __main__ - Step 74937: {'lr': 0.00025564589595699006, 'samples': 14387904, 'steps': 74936, 'loss/train': 1.2919124364852905} 11/07/2021 07:44:25 - INFO - __main__ - Step 74938: {'lr': 0.000255640590564812, 'samples': 14388096, 'steps': 74937, 'loss/train': 0.8674811124801636} 11/07/2021 07:44:26 - INFO - __main__ - Step 74939: {'lr': 0.0002556352851700925, 'samples': 14388288, 'steps': 74938, 'loss/train': 1.3674557209014893} 11/07/2021 07:44:27 - INFO - __main__ - Step 74940: {'lr': 0.0002556299797728337, 'samples': 14388480, 'steps': 74939, 'loss/train': 1.2908436059951782} 11/07/2021 07:44:27 - INFO - __main__ - Step 74941: {'lr': 0.0002556246743730382, 'samples': 14388672, 'steps': 74940, 'loss/train': 1.1599918603897095} 11/07/2021 07:44:27 - INFO - __main__ - Step 74942: {'lr': 0.00025561936897070827, 'samples': 14388864, 'steps': 74941, 'loss/train': 1.2716045379638672} 11/07/2021 07:44:28 - INFO - __main__ - Step 74943: {'lr': 0.00025561406356584636, 'samples': 14389056, 'steps': 74942, 'loss/train': 1.6975677013397217} 11/07/2021 07:44:28 - INFO - __main__ - Step 74944: {'lr': 0.00025560875815845485, 'samples': 14389248, 'steps': 74943, 'loss/train': 1.3565281629562378} 11/07/2021 07:44:29 - INFO - __main__ - Step 74945: {'lr': 0.00025560345274853606, 'samples': 14389440, 'steps': 74944, 'loss/train': 1.527823567390442} 11/07/2021 07:44:29 - INFO - __main__ - Step 74946: {'lr': 0.0002555981473360925, 'samples': 14389632, 'steps': 74945, 'loss/train': 1.4602187871932983} 11/07/2021 07:44:30 - INFO - __main__ - Step 74947: {'lr': 0.00025559284192112647, 'samples': 14389824, 'steps': 74946, 'loss/train': 1.0925884246826172} 11/07/2021 07:44:30 - INFO - __main__ - Step 74948: {'lr': 0.0002555875365036404, 'samples': 14390016, 'steps': 74947, 'loss/train': 1.2407453060150146} 11/07/2021 07:44:30 - INFO - __main__ - Step 74949: {'lr': 0.00025558223108363673, 'samples': 14390208, 'steps': 74948, 'loss/train': 1.4147756099700928} 11/07/2021 07:44:32 - INFO - __main__ - Step 74950: {'lr': 0.00025557692566111767, 'samples': 14390400, 'steps': 74949, 'loss/train': 0.9655868411064148} 11/07/2021 07:44:32 - INFO - __main__ - Step 74951: {'lr': 0.0002555716202360858, 'samples': 14390592, 'steps': 74950, 'loss/train': 1.7911953926086426} 11/07/2021 07:44:32 - INFO - __main__ - Step 74952: {'lr': 0.0002555663148085435, 'samples': 14390784, 'steps': 74951, 'loss/train': 1.459280252456665} 11/07/2021 07:44:33 - INFO - __main__ - Step 74953: {'lr': 0.00025556100937849295, 'samples': 14390976, 'steps': 74952, 'loss/train': 1.1382604837417603} 11/07/2021 07:44:33 - INFO - __main__ - Step 74954: {'lr': 0.0002555557039459368, 'samples': 14391168, 'steps': 74953, 'loss/train': 1.4276893138885498} 11/07/2021 07:44:34 - INFO - __main__ - Step 74955: {'lr': 0.00025555039851087735, 'samples': 14391360, 'steps': 74954, 'loss/train': 1.1061612367630005} 11/07/2021 07:44:35 - INFO - __main__ - Step 74956: {'lr': 0.00025554509307331705, 'samples': 14391552, 'steps': 74955, 'loss/train': 1.3601436614990234} 11/07/2021 07:44:35 - INFO - __main__ - Step 74957: {'lr': 0.0002555397876332581, 'samples': 14391744, 'steps': 74956, 'loss/train': 1.7193306684494019} 11/07/2021 07:44:35 - INFO - __main__ - Step 74958: {'lr': 0.00025553448219070297, 'samples': 14391936, 'steps': 74957, 'loss/train': 1.7234625816345215} 11/07/2021 07:44:36 - INFO - __main__ - Step 74959: {'lr': 0.00025552917674565414, 'samples': 14392128, 'steps': 74958, 'loss/train': 0.29082927107810974} 11/07/2021 07:44:36 - INFO - __main__ - Step 74960: {'lr': 0.00025552387129811397, 'samples': 14392320, 'steps': 74959, 'loss/train': 1.2550804615020752} 11/07/2021 07:44:36 - INFO - __main__ - Step 74961: {'lr': 0.0002555185658480848, 'samples': 14392512, 'steps': 74960, 'loss/train': 1.4446065425872803} 11/07/2021 07:44:37 - INFO - __main__ - Step 74962: {'lr': 0.00025551326039556906, 'samples': 14392704, 'steps': 74961, 'loss/train': 1.264183521270752} 11/07/2021 07:44:38 - INFO - __main__ - Step 74963: {'lr': 0.00025550795494056914, 'samples': 14392896, 'steps': 74962, 'loss/train': 1.7045433521270752} 11/07/2021 07:44:38 - INFO - __main__ - Step 74964: {'lr': 0.00025550264948308744, 'samples': 14393088, 'steps': 74963, 'loss/train': 0.9161615371704102} 11/07/2021 07:44:39 - INFO - __main__ - Step 74965: {'lr': 0.0002554973440231263, 'samples': 14393280, 'steps': 74964, 'loss/train': 1.408797025680542} 11/07/2021 07:44:39 - INFO - __main__ - Step 74966: {'lr': 0.00025549203856068813, 'samples': 14393472, 'steps': 74965, 'loss/train': 1.207392692565918} 11/07/2021 07:44:40 - INFO - __main__ - Step 74967: {'lr': 0.00025548673309577536, 'samples': 14393664, 'steps': 74966, 'loss/train': 1.486254334449768} 11/07/2021 07:44:40 - INFO - __main__ - Step 74968: {'lr': 0.00025548142762839033, 'samples': 14393856, 'steps': 74967, 'loss/train': 1.4037221670150757} 11/07/2021 07:44:41 - INFO - __main__ - Step 74969: {'lr': 0.00025547612215853544, 'samples': 14394048, 'steps': 74968, 'loss/train': 1.7189786434173584} 11/07/2021 07:44:41 - INFO - __main__ - Step 74970: {'lr': 0.0002554708166862131, 'samples': 14394240, 'steps': 74969, 'loss/train': 1.1525588035583496} 11/07/2021 07:44:41 - INFO - __main__ - Step 74971: {'lr': 0.00025546551121142575, 'samples': 14394432, 'steps': 74970, 'loss/train': 1.0569759607315063} 11/07/2021 07:44:43 - INFO - __main__ - Step 74972: {'lr': 0.00025546020573417573, 'samples': 14394624, 'steps': 74971, 'loss/train': 1.460493564605713} 11/07/2021 07:44:43 - INFO - __main__ - Step 74973: {'lr': 0.00025545490025446533, 'samples': 14394816, 'steps': 74972, 'loss/train': 1.2841482162475586} 11/07/2021 07:44:43 - INFO - __main__ - Step 74974: {'lr': 0.00025544959477229705, 'samples': 14395008, 'steps': 74973, 'loss/train': 1.6878162622451782} 11/07/2021 07:44:44 - INFO - __main__ - Step 74975: {'lr': 0.0002554442892876733, 'samples': 14395200, 'steps': 74974, 'loss/train': 2.0781774520874023} 11/07/2021 07:44:44 - INFO - __main__ - Step 74976: {'lr': 0.0002554389838005966, 'samples': 14395392, 'steps': 74975, 'loss/train': 1.4365524053573608} 11/07/2021 07:44:45 - INFO - __main__ - Step 74977: {'lr': 0.00025543367831106894, 'samples': 14395584, 'steps': 74976, 'loss/train': 0.09541544318199158} 11/07/2021 07:44:46 - INFO - __main__ - Step 74978: {'lr': 0.000255428372819093, 'samples': 14395776, 'steps': 74977, 'loss/train': 1.5755704641342163} 11/07/2021 07:44:46 - INFO - __main__ - Step 74979: {'lr': 0.0002554230673246712, 'samples': 14395968, 'steps': 74978, 'loss/train': 1.2390360832214355} 11/07/2021 07:44:46 - INFO - __main__ - Step 74980: {'lr': 0.0002554177618278057, 'samples': 14396160, 'steps': 74979, 'loss/train': 0.12447037547826767} 11/07/2021 07:44:47 - INFO - __main__ - Step 74981: {'lr': 0.0002554124563284992, 'samples': 14396352, 'steps': 74980, 'loss/train': 1.6548631191253662} 11/07/2021 07:44:47 - INFO - __main__ - Step 74982: {'lr': 0.00025540715082675384, 'samples': 14396544, 'steps': 74981, 'loss/train': 1.7366724014282227} 11/07/2021 07:44:48 - INFO - __main__ - Step 74983: {'lr': 0.0002554018453225722, 'samples': 14396736, 'steps': 74982, 'loss/train': 1.2935041189193726} 11/07/2021 07:44:48 - INFO - __main__ - Step 74984: {'lr': 0.00025539653981595644, 'samples': 14396928, 'steps': 74983, 'loss/train': 1.5375725030899048} 11/07/2021 07:44:49 - INFO - __main__ - Step 74985: {'lr': 0.0002553912343069092, 'samples': 14397120, 'steps': 74984, 'loss/train': 1.2543518543243408} 11/07/2021 07:44:49 - INFO - __main__ - Step 74986: {'lr': 0.00025538592879543266, 'samples': 14397312, 'steps': 74985, 'loss/train': 1.0417346954345703} 11/07/2021 07:44:49 - INFO - __main__ - Step 74987: {'lr': 0.00025538062328152935, 'samples': 14397504, 'steps': 74986, 'loss/train': 0.7977452278137207} 11/07/2021 07:44:50 - INFO - __main__ - Step 74988: {'lr': 0.00025537531776520164, 'samples': 14397696, 'steps': 74987, 'loss/train': 1.1973623037338257} 11/07/2021 07:44:51 - INFO - __main__ - Step 74989: {'lr': 0.00025537001224645183, 'samples': 14397888, 'steps': 74988, 'loss/train': 0.8928842544555664} 11/07/2021 07:44:51 - INFO - __main__ - Step 74990: {'lr': 0.0002553647067252824, 'samples': 14398080, 'steps': 74989, 'loss/train': 1.3350070714950562} 11/07/2021 07:44:51 - INFO - __main__ - Step 74991: {'lr': 0.0002553594012016957, 'samples': 14398272, 'steps': 74990, 'loss/train': 1.6216644048690796} 11/07/2021 07:44:52 - INFO - __main__ - Step 74992: {'lr': 0.00025535409567569416, 'samples': 14398464, 'steps': 74991, 'loss/train': 1.4980790615081787} 11/07/2021 07:44:52 - INFO - __main__ - Step 74993: {'lr': 0.00025534879014728015, 'samples': 14398656, 'steps': 74992, 'loss/train': 1.0013575553894043} 11/07/2021 07:44:53 - INFO - __main__ - Step 74994: {'lr': 0.0002553434846164561, 'samples': 14398848, 'steps': 74993, 'loss/train': 0.5513453483581543} 11/07/2021 07:44:54 - INFO - __main__ - Step 74995: {'lr': 0.0002553381790832243, 'samples': 14399040, 'steps': 74994, 'loss/train': 1.3589390516281128} 11/07/2021 07:44:54 - INFO - __main__ - Step 74996: {'lr': 0.0002553328735475872, 'samples': 14399232, 'steps': 74995, 'loss/train': 0.8499316573143005} 11/07/2021 07:44:54 - INFO - __main__ - Step 74997: {'lr': 0.0002553275680095472, 'samples': 14399424, 'steps': 74996, 'loss/train': 1.2895264625549316} 11/07/2021 07:44:55 - INFO - __main__ - Step 74998: {'lr': 0.00025532226246910666, 'samples': 14399616, 'steps': 74997, 'loss/train': 1.3814592361450195} 11/07/2021 07:44:56 - INFO - __main__ - Step 74999: {'lr': 0.00025531695692626805, 'samples': 14399808, 'steps': 74998, 'loss/train': 1.6175603866577148} 11/07/2021 07:44:56 - INFO - __main__ - Step 75000: {'lr': 0.0002553116513810337, 'samples': 14400000, 'steps': 74999, 'loss/train': 1.4593740701675415} 11/07/2021 07:44:56 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 07:48:08 - INFO - __main__ - Step 75000: {'loss/eval': 1.3448442220687866, 'perplexity': 3.8375887870788574} 11/07/2021 07:48:21 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['log/debug_0.log']. This may take a bit of time if the files are large. 11/07/2021 07:48:26 - WARNING - huggingface_hub.repository - Several commits (5) will be pushed upstream. 11/07/2021 07:48:26 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 07:48:47 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small a409272..c8a25e7 proud-haze-135 -> proud-haze-135 11/07/2021 07:48:48 - INFO - __main__ - Step 75001: {'lr': 0.00025530634583340587, 'samples': 14400192, 'steps': 75000, 'loss/train': 1.5137684345245361} 11/07/2021 07:48:48 - INFO - __main__ - Step 75002: {'lr': 0.0002553010402833872, 'samples': 14400384, 'steps': 75001, 'loss/train': 1.1664304733276367} 11/07/2021 07:48:48 - INFO - __main__ - Step 75003: {'lr': 0.00025529573473097994, 'samples': 14400576, 'steps': 75002, 'loss/train': 1.3288052082061768} 11/07/2021 07:48:50 - INFO - __main__ - Step 75004: {'lr': 0.0002552904291761865, 'samples': 14400768, 'steps': 75003, 'loss/train': 1.5340694189071655} 11/07/2021 07:48:50 - INFO - __main__ - Step 75005: {'lr': 0.0002552851236190093, 'samples': 14400960, 'steps': 75004, 'loss/train': 1.6407618522644043} 11/07/2021 07:48:50 - INFO - __main__ - Step 75006: {'lr': 0.0002552798180594507, 'samples': 14401152, 'steps': 75005, 'loss/train': 1.3647115230560303} 11/07/2021 07:48:51 - INFO - __main__ - Step 75007: {'lr': 0.00025527451249751306, 'samples': 14401344, 'steps': 75006, 'loss/train': 1.3930567502975464} 11/07/2021 07:48:51 - INFO - __main__ - Step 75008: {'lr': 0.00025526920693319885, 'samples': 14401536, 'steps': 75007, 'loss/train': 1.695831060409546} 11/07/2021 07:48:52 - INFO - __main__ - Step 75009: {'lr': 0.00025526390136651033, 'samples': 14401728, 'steps': 75008, 'loss/train': 1.462052583694458} 11/07/2021 07:48:52 - INFO - __main__ - Step 75010: {'lr': 0.0002552585957974501, 'samples': 14401920, 'steps': 75009, 'loss/train': 1.534427285194397} 11/07/2021 07:48:53 - INFO - __main__ - Step 75011: {'lr': 0.00025525329022602034, 'samples': 14402112, 'steps': 75010, 'loss/train': 1.4223833084106445} 11/07/2021 07:48:53 - INFO - __main__ - Step 75012: {'lr': 0.00025524798465222353, 'samples': 14402304, 'steps': 75011, 'loss/train': 0.4538053870201111} 11/07/2021 07:48:54 - INFO - __main__ - Step 75013: {'lr': 0.00025524267907606207, 'samples': 14402496, 'steps': 75012, 'loss/train': 1.5491855144500732} 11/07/2021 07:48:55 - INFO - __main__ - Step 75014: {'lr': 0.0002552373734975384, 'samples': 14402688, 'steps': 75013, 'loss/train': 1.3471547365188599} 11/07/2021 07:48:55 - INFO - __main__ - Step 75015: {'lr': 0.00025523206791665476, 'samples': 14402880, 'steps': 75014, 'loss/train': 1.34130859375} 11/07/2021 07:48:55 - INFO - __main__ - Step 75016: {'lr': 0.0002552267623334137, 'samples': 14403072, 'steps': 75015, 'loss/train': 1.8002008199691772} 11/07/2021 07:48:56 - INFO - __main__ - Step 75017: {'lr': 0.00025522145674781755, 'samples': 14403264, 'steps': 75016, 'loss/train': 1.5267422199249268} 11/07/2021 07:48:56 - INFO - __main__ - Step 75018: {'lr': 0.00025521615115986864, 'samples': 14403456, 'steps': 75017, 'loss/train': 1.475608229637146} 11/07/2021 07:48:57 - INFO - __main__ - Step 75019: {'lr': 0.0002552108455695694, 'samples': 14403648, 'steps': 75018, 'loss/train': 1.3820617198944092} 11/07/2021 07:48:57 - INFO - __main__ - Step 75020: {'lr': 0.0002552055399769223, 'samples': 14403840, 'steps': 75019, 'loss/train': 1.2927247285842896} 11/07/2021 07:48:58 - INFO - __main__ - Step 75021: {'lr': 0.0002552002343819296, 'samples': 14404032, 'steps': 75020, 'loss/train': 1.4783025979995728} 11/07/2021 07:48:58 - INFO - __main__ - Step 75022: {'lr': 0.00025519492878459376, 'samples': 14404224, 'steps': 75021, 'loss/train': 1.3821964263916016} 11/07/2021 07:48:58 - INFO - __main__ - Step 75023: {'lr': 0.00025518962318491726, 'samples': 14404416, 'steps': 75022, 'loss/train': 1.438929557800293} 11/07/2021 07:48:59 - INFO - __main__ - Step 75024: {'lr': 0.0002551843175829023, 'samples': 14404608, 'steps': 75023, 'loss/train': 1.5146628618240356} 11/07/2021 07:49:00 - INFO - __main__ - Step 75025: {'lr': 0.00025517901197855136, 'samples': 14404800, 'steps': 75024, 'loss/train': 1.057277798652649} 11/07/2021 07:49:00 - INFO - __main__ - Step 75026: {'lr': 0.0002551737063718669, 'samples': 14404992, 'steps': 75025, 'loss/train': 1.1766287088394165} 11/07/2021 07:49:00 - INFO - __main__ - Step 75027: {'lr': 0.0002551684007628512, 'samples': 14405184, 'steps': 75026, 'loss/train': 1.3668113946914673} 11/07/2021 07:49:01 - INFO - __main__ - Step 75028: {'lr': 0.0002551630951515067, 'samples': 14405376, 'steps': 75027, 'loss/train': 1.0470987558364868} 11/07/2021 07:49:01 - INFO - __main__ - Step 75029: {'lr': 0.00025515778953783577, 'samples': 14405568, 'steps': 75028, 'loss/train': 0.8479883670806885} 11/07/2021 07:49:02 - INFO - __main__ - Step 75030: {'lr': 0.00025515248392184094, 'samples': 14405760, 'steps': 75029, 'loss/train': 1.4002456665039062} 11/07/2021 07:49:03 - INFO - __main__ - Step 75031: {'lr': 0.00025514717830352435, 'samples': 14405952, 'steps': 75030, 'loss/train': 1.3999742269515991} 11/07/2021 07:49:03 - INFO - __main__ - Step 75032: {'lr': 0.0002551418726828886, 'samples': 14406144, 'steps': 75031, 'loss/train': 1.4995075464248657} 11/07/2021 07:49:03 - INFO - __main__ - Step 75033: {'lr': 0.00025513656705993595, 'samples': 14406336, 'steps': 75032, 'loss/train': 1.306180715560913} 11/07/2021 07:49:04 - INFO - __main__ - Step 75034: {'lr': 0.0002551312614346688, 'samples': 14406528, 'steps': 75033, 'loss/train': 1.6603893041610718} 11/07/2021 07:49:05 - INFO - __main__ - Step 75035: {'lr': 0.00025512595580708965, 'samples': 14406720, 'steps': 75034, 'loss/train': 1.1000033617019653} 11/07/2021 07:49:05 - INFO - __main__ - Step 75036: {'lr': 0.00025512065017720077, 'samples': 14406912, 'steps': 75035, 'loss/train': 1.231903314590454} 11/07/2021 07:49:05 - INFO - __main__ - Step 75037: {'lr': 0.0002551153445450047, 'samples': 14407104, 'steps': 75036, 'loss/train': 1.5387201309204102} 11/07/2021 07:49:06 - INFO - __main__ - Step 75038: {'lr': 0.0002551100389105037, 'samples': 14407296, 'steps': 75037, 'loss/train': 1.7729767560958862} 11/07/2021 07:49:06 - INFO - __main__ - Step 75039: {'lr': 0.00025510473327370014, 'samples': 14407488, 'steps': 75038, 'loss/train': 1.5979923009872437} 11/07/2021 07:49:07 - INFO - __main__ - Step 75040: {'lr': 0.00025509942763459647, 'samples': 14407680, 'steps': 75039, 'loss/train': 1.2217621803283691} 11/07/2021 07:49:07 - INFO - __main__ - Step 75041: {'lr': 0.00025509412199319515, 'samples': 14407872, 'steps': 75040, 'loss/train': 1.3260895013809204} 11/07/2021 07:49:08 - INFO - __main__ - Step 75042: {'lr': 0.0002550888163494984, 'samples': 14408064, 'steps': 75041, 'loss/train': 1.0729926824569702} 11/07/2021 07:49:08 - INFO - __main__ - Step 75043: {'lr': 0.00025508351070350875, 'samples': 14408256, 'steps': 75042, 'loss/train': 1.5238538980484009} 11/07/2021 07:49:08 - INFO - __main__ - Step 75044: {'lr': 0.00025507820505522866, 'samples': 14408448, 'steps': 75043, 'loss/train': 1.4273226261138916} 11/07/2021 07:49:09 - INFO - __main__ - Step 75045: {'lr': 0.0002550728994046603, 'samples': 14408640, 'steps': 75044, 'loss/train': 1.7435234785079956} 11/07/2021 07:49:10 - INFO - __main__ - Step 75046: {'lr': 0.0002550675937518062, 'samples': 14408832, 'steps': 75045, 'loss/train': 1.212339997291565} 11/07/2021 07:49:10 - INFO - __main__ - Step 75047: {'lr': 0.00025506228809666866, 'samples': 14409024, 'steps': 75046, 'loss/train': 1.8545677661895752} 11/07/2021 07:49:11 - INFO - __main__ - Step 75048: {'lr': 0.0002550569824392502, 'samples': 14409216, 'steps': 75047, 'loss/train': 1.7802128791809082} 11/07/2021 07:49:11 - INFO - __main__ - Step 75049: {'lr': 0.00025505167677955303, 'samples': 14409408, 'steps': 75048, 'loss/train': 1.4147034883499146} 11/07/2021 07:49:12 - INFO - __main__ - Step 75050: {'lr': 0.00025504637111757985, 'samples': 14409600, 'steps': 75049, 'loss/train': 1.5165399312973022} 11/07/2021 07:49:12 - INFO - __main__ - Step 75051: {'lr': 0.0002550410654533327, 'samples': 14409792, 'steps': 75050, 'loss/train': 1.2680777311325073} 11/07/2021 07:49:13 - INFO - __main__ - Step 75052: {'lr': 0.00025503575978681417, 'samples': 14409984, 'steps': 75051, 'loss/train': 1.3734363317489624} 11/07/2021 07:49:13 - INFO - __main__ - Step 75053: {'lr': 0.00025503045411802655, 'samples': 14410176, 'steps': 75052, 'loss/train': 1.2210699319839478} 11/07/2021 07:49:14 - INFO - __main__ - Step 75054: {'lr': 0.00025502514844697236, 'samples': 14410368, 'steps': 75053, 'loss/train': 1.7722004652023315} 11/07/2021 07:49:14 - INFO - __main__ - Step 75055: {'lr': 0.00025501984277365386, 'samples': 14410560, 'steps': 75054, 'loss/train': 1.7080934047698975} 11/07/2021 07:49:15 - INFO - __main__ - Step 75056: {'lr': 0.00025501453709807356, 'samples': 14410752, 'steps': 75055, 'loss/train': 1.5802572965621948} 11/07/2021 07:49:15 - INFO - __main__ - Step 75057: {'lr': 0.0002550092314202337, 'samples': 14410944, 'steps': 75056, 'loss/train': 1.5367854833602905} 11/07/2021 07:49:16 - INFO - __main__ - Step 75058: {'lr': 0.00025500392574013685, 'samples': 14411136, 'steps': 75057, 'loss/train': 1.488821029663086} 11/07/2021 07:49:16 - INFO - __main__ - Step 75059: {'lr': 0.00025499862005778527, 'samples': 14411328, 'steps': 75058, 'loss/train': 1.6385307312011719} 11/07/2021 07:49:16 - INFO - __main__ - Step 75060: {'lr': 0.0002549933143731814, 'samples': 14411520, 'steps': 75059, 'loss/train': 2.0252578258514404} 11/07/2021 07:49:18 - INFO - __main__ - Step 75061: {'lr': 0.0002549880086863276, 'samples': 14411712, 'steps': 75060, 'loss/train': 1.4907774925231934} 11/07/2021 07:49:18 - INFO - __main__ - Step 75062: {'lr': 0.00025498270299722625, 'samples': 14411904, 'steps': 75061, 'loss/train': 1.2865424156188965} 11/07/2021 07:49:18 - INFO - __main__ - Step 75063: {'lr': 0.0002549773973058798, 'samples': 14412096, 'steps': 75062, 'loss/train': 1.2531402111053467} 11/07/2021 07:49:19 - INFO - __main__ - Step 75064: {'lr': 0.0002549720916122907, 'samples': 14412288, 'steps': 75063, 'loss/train': 1.3352128267288208} 11/07/2021 07:49:19 - INFO - __main__ - Step 75065: {'lr': 0.00025496678591646117, 'samples': 14412480, 'steps': 75064, 'loss/train': 1.5312432050704956} 11/07/2021 07:49:20 - INFO - __main__ - Step 75066: {'lr': 0.00025496148021839364, 'samples': 14412672, 'steps': 75065, 'loss/train': 1.4460479021072388} 11/07/2021 07:49:20 - INFO - __main__ - Step 75067: {'lr': 0.0002549561745180906, 'samples': 14412864, 'steps': 75066, 'loss/train': 1.4789406061172485} 11/07/2021 07:49:21 - INFO - __main__ - Step 75068: {'lr': 0.0002549508688155544, 'samples': 14413056, 'steps': 75067, 'loss/train': 1.156396746635437} 11/07/2021 07:49:21 - INFO - __main__ - Step 75069: {'lr': 0.0002549455631107873, 'samples': 14413248, 'steps': 75068, 'loss/train': 1.4540361166000366} 11/07/2021 07:49:21 - INFO - __main__ - Step 75070: {'lr': 0.00025494025740379196, 'samples': 14413440, 'steps': 75069, 'loss/train': 1.3556345701217651} 11/07/2021 07:49:22 - INFO - __main__ - Step 75071: {'lr': 0.00025493495169457054, 'samples': 14413632, 'steps': 75070, 'loss/train': 1.3952528238296509} 11/07/2021 07:49:23 - INFO - __main__ - Step 75072: {'lr': 0.00025492964598312554, 'samples': 14413824, 'steps': 75071, 'loss/train': 1.614591121673584} 11/07/2021 07:49:23 - INFO - __main__ - Step 75073: {'lr': 0.00025492434026945927, 'samples': 14414016, 'steps': 75072, 'loss/train': 1.1872966289520264} 11/07/2021 07:49:24 - INFO - __main__ - Step 75074: {'lr': 0.0002549190345535742, 'samples': 14414208, 'steps': 75073, 'loss/train': 0.7759877443313599} 11/07/2021 07:49:24 - INFO - __main__ - Step 75075: {'lr': 0.00025491372883547266, 'samples': 14414400, 'steps': 75074, 'loss/train': 1.4200193881988525} 11/07/2021 07:49:25 - INFO - __main__ - Step 75076: {'lr': 0.00025490842311515704, 'samples': 14414592, 'steps': 75075, 'loss/train': 1.5643527507781982} 11/07/2021 07:49:25 - INFO - __main__ - Step 75077: {'lr': 0.0002549031173926299, 'samples': 14414784, 'steps': 75076, 'loss/train': 1.4929510354995728} 11/07/2021 07:49:26 - INFO - __main__ - Step 75078: {'lr': 0.0002548978116678934, 'samples': 14414976, 'steps': 75077, 'loss/train': 1.4276909828186035} 11/07/2021 07:49:26 - INFO - __main__ - Step 75079: {'lr': 0.00025489250594095, 'samples': 14415168, 'steps': 75078, 'loss/train': 1.8915843963623047} 11/07/2021 07:49:26 - INFO - __main__ - Step 75080: {'lr': 0.00025488720021180213, 'samples': 14415360, 'steps': 75079, 'loss/train': 1.2629667520523071} 11/07/2021 07:49:27 - INFO - __main__ - Step 75081: {'lr': 0.00025488189448045215, 'samples': 14415552, 'steps': 75080, 'loss/train': 1.6678119897842407} 11/07/2021 07:49:28 - INFO - __main__ - Step 75082: {'lr': 0.00025487658874690243, 'samples': 14415744, 'steps': 75081, 'loss/train': 1.386113166809082} 11/07/2021 07:49:28 - INFO - __main__ - Step 75083: {'lr': 0.00025487128301115547, 'samples': 14415936, 'steps': 75082, 'loss/train': 2.54746150970459} 11/07/2021 07:49:28 - INFO - __main__ - Step 75084: {'lr': 0.00025486597727321364, 'samples': 14416128, 'steps': 75083, 'loss/train': 1.8313134908676147} 11/07/2021 07:49:29 - INFO - __main__ - Step 75085: {'lr': 0.0002548606715330792, 'samples': 14416320, 'steps': 75084, 'loss/train': 1.3472117185592651} 11/07/2021 07:49:30 - INFO - __main__ - Step 75086: {'lr': 0.0002548553657907546, 'samples': 14416512, 'steps': 75085, 'loss/train': 1.1311450004577637} 11/07/2021 07:49:30 - INFO - __main__ - Step 75087: {'lr': 0.0002548500600462422, 'samples': 14416704, 'steps': 75086, 'loss/train': 1.394846796989441} 11/07/2021 07:49:31 - INFO - __main__ - Step 75088: {'lr': 0.00025484475429954454, 'samples': 14416896, 'steps': 75087, 'loss/train': 1.8206827640533447} 11/07/2021 07:49:31 - INFO - __main__ - Step 75089: {'lr': 0.0002548394485506638, 'samples': 14417088, 'steps': 75088, 'loss/train': 1.3643178939819336} 11/07/2021 07:49:31 - INFO - __main__ - Step 75090: {'lr': 0.0002548341427996026, 'samples': 14417280, 'steps': 75089, 'loss/train': 1.3859105110168457} 11/07/2021 07:49:32 - INFO - __main__ - Step 75091: {'lr': 0.0002548288370463632, 'samples': 14417472, 'steps': 75090, 'loss/train': 0.970618724822998} 11/07/2021 07:49:33 - INFO - __main__ - Step 75092: {'lr': 0.0002548235312909479, 'samples': 14417664, 'steps': 75091, 'loss/train': 1.3917912244796753} 11/07/2021 07:49:33 - INFO - __main__ - Step 75093: {'lr': 0.00025481822553335927, 'samples': 14417856, 'steps': 75092, 'loss/train': 0.7936092615127563} 11/07/2021 07:49:34 - INFO - __main__ - Step 75094: {'lr': 0.0002548129197735996, 'samples': 14418048, 'steps': 75093, 'loss/train': 1.3952999114990234} 11/07/2021 07:49:34 - INFO - __main__ - Step 75095: {'lr': 0.0002548076140116713, 'samples': 14418240, 'steps': 75094, 'loss/train': 1.377379298210144} 11/07/2021 07:49:34 - INFO - __main__ - Step 75096: {'lr': 0.0002548023082475767, 'samples': 14418432, 'steps': 75095, 'loss/train': 1.6771985292434692} 11/07/2021 07:49:35 - INFO - __main__ - Step 75097: {'lr': 0.00025479700248131845, 'samples': 14418624, 'steps': 75096, 'loss/train': 1.786522626876831} 11/07/2021 07:49:36 - INFO - __main__ - Step 75098: {'lr': 0.0002547916967128985, 'samples': 14418816, 'steps': 75097, 'loss/train': 2.0996832847595215} 11/07/2021 07:49:36 - INFO - __main__ - Step 75099: {'lr': 0.00025478639094231965, 'samples': 14419008, 'steps': 75098, 'loss/train': 1.360795259475708} 11/07/2021 07:49:36 - INFO - __main__ - Step 75100: {'lr': 0.0002547810851695841, 'samples': 14419200, 'steps': 75099, 'loss/train': 0.7729213833808899} 11/07/2021 07:49:37 - INFO - __main__ - Step 75101: {'lr': 0.0002547757793946942, 'samples': 14419392, 'steps': 75100, 'loss/train': 1.4232792854309082} 11/07/2021 07:49:38 - INFO - __main__ - Step 75102: {'lr': 0.00025477047361765245, 'samples': 14419584, 'steps': 75101, 'loss/train': 1.2934640645980835} 11/07/2021 07:49:38 - INFO - __main__ - Step 75103: {'lr': 0.00025476516783846123, 'samples': 14419776, 'steps': 75102, 'loss/train': 1.3547033071517944} 11/07/2021 07:49:38 - INFO - __main__ - Step 75104: {'lr': 0.00025475986205712286, 'samples': 14419968, 'steps': 75103, 'loss/train': 1.5986814498901367} 11/07/2021 07:49:39 - INFO - __main__ - Step 75105: {'lr': 0.0002547545562736397, 'samples': 14420160, 'steps': 75104, 'loss/train': 1.6161458492279053} 11/07/2021 07:49:39 - INFO - __main__ - Step 75106: {'lr': 0.00025474925048801436, 'samples': 14420352, 'steps': 75105, 'loss/train': 1.1496621370315552} 11/07/2021 07:49:40 - INFO - __main__ - Step 75107: {'lr': 0.000254743944700249, 'samples': 14420544, 'steps': 75106, 'loss/train': 1.8519867658615112} 11/07/2021 07:49:40 - INFO - __main__ - Step 75108: {'lr': 0.0002547386389103461, 'samples': 14420736, 'steps': 75107, 'loss/train': 1.5356873273849487} 11/07/2021 07:49:41 - INFO - __main__ - Step 75109: {'lr': 0.00025473333311830805, 'samples': 14420928, 'steps': 75108, 'loss/train': 1.3522719144821167} 11/07/2021 07:49:41 - INFO - __main__ - Step 75110: {'lr': 0.00025472802732413717, 'samples': 14421120, 'steps': 75109, 'loss/train': 1.6090214252471924} 11/07/2021 07:49:41 - INFO - __main__ - Step 75111: {'lr': 0.00025472272152783605, 'samples': 14421312, 'steps': 75110, 'loss/train': 2.019955635070801} 11/07/2021 07:49:43 - INFO - __main__ - Step 75112: {'lr': 0.0002547174157294068, 'samples': 14421504, 'steps': 75111, 'loss/train': 1.159251093864441} 11/07/2021 07:49:43 - INFO - __main__ - Step 75113: {'lr': 0.0002547121099288521, 'samples': 14421696, 'steps': 75112, 'loss/train': 0.6481761932373047} 11/07/2021 07:49:43 - INFO - __main__ - Step 75114: {'lr': 0.000254706804126174, 'samples': 14421888, 'steps': 75113, 'loss/train': 1.3331024646759033} 11/07/2021 07:49:44 - INFO - __main__ - Step 75115: {'lr': 0.00025470149832137524, 'samples': 14422080, 'steps': 75114, 'loss/train': 1.3759794235229492} 11/07/2021 07:49:44 - INFO - __main__ - Step 75116: {'lr': 0.00025469619251445804, 'samples': 14422272, 'steps': 75115, 'loss/train': 1.327857494354248} 11/07/2021 07:49:44 - INFO - __main__ - Step 75117: {'lr': 0.0002546908867054248, 'samples': 14422464, 'steps': 75116, 'loss/train': 1.2071151733398438} 11/07/2021 07:49:45 - INFO - __main__ - Step 75118: {'lr': 0.0002546855808942779, 'samples': 14422656, 'steps': 75117, 'loss/train': 1.4911444187164307} 11/07/2021 07:49:46 - INFO - __main__ - Step 75119: {'lr': 0.0002546802750810198, 'samples': 14422848, 'steps': 75118, 'loss/train': 1.8870958089828491} 11/07/2021 07:49:46 - INFO - __main__ - Step 75120: {'lr': 0.00025467496926565275, 'samples': 14423040, 'steps': 75119, 'loss/train': 1.3429406881332397} 11/07/2021 07:49:46 - INFO - __main__ - Step 75121: {'lr': 0.00025466966344817927, 'samples': 14423232, 'steps': 75120, 'loss/train': 1.3380999565124512} 11/07/2021 07:49:47 - INFO - __main__ - Step 75122: {'lr': 0.0002546643576286017, 'samples': 14423424, 'steps': 75121, 'loss/train': 1.4875000715255737} 11/07/2021 07:49:48 - INFO - __main__ - Step 75123: {'lr': 0.0002546590518069225, 'samples': 14423616, 'steps': 75122, 'loss/train': 1.129173755645752} 11/07/2021 07:49:48 - INFO - __main__ - Step 75124: {'lr': 0.00025465374598314394, 'samples': 14423808, 'steps': 75123, 'loss/train': 1.4066352844238281} 11/07/2021 07:49:49 - INFO - __main__ - Step 75125: {'lr': 0.0002546484401572685, 'samples': 14424000, 'steps': 75124, 'loss/train': 1.3117668628692627} 11/07/2021 07:49:49 - INFO - __main__ - Step 75126: {'lr': 0.00025464313432929853, 'samples': 14424192, 'steps': 75125, 'loss/train': 1.2557932138442993} 11/07/2021 07:49:49 - INFO - __main__ - Step 75127: {'lr': 0.00025463782849923644, 'samples': 14424384, 'steps': 75126, 'loss/train': 0.2818050682544708} 11/07/2021 07:49:50 - INFO - __main__ - Step 75128: {'lr': 0.0002546325226670847, 'samples': 14424576, 'steps': 75127, 'loss/train': 1.260284185409546} 11/07/2021 07:49:51 - INFO - __main__ - Step 75129: {'lr': 0.0002546272168328455, 'samples': 14424768, 'steps': 75128, 'loss/train': 1.2928186655044556} 11/07/2021 07:49:51 - INFO - __main__ - Step 75130: {'lr': 0.00025462191099652145, 'samples': 14424960, 'steps': 75129, 'loss/train': 1.8051739931106567} 11/07/2021 07:49:51 - INFO - __main__ - Step 75131: {'lr': 0.00025461660515811474, 'samples': 14425152, 'steps': 75130, 'loss/train': 1.46943998336792} 11/07/2021 07:49:52 - INFO - __main__ - Step 75132: {'lr': 0.0002546112993176279, 'samples': 14425344, 'steps': 75131, 'loss/train': 1.2136257886886597} 11/07/2021 07:49:53 - INFO - __main__ - Step 75133: {'lr': 0.00025460599347506326, 'samples': 14425536, 'steps': 75132, 'loss/train': 1.02193284034729} 11/07/2021 07:49:53 - INFO - __main__ - Step 75134: {'lr': 0.00025460068763042326, 'samples': 14425728, 'steps': 75133, 'loss/train': 1.642673134803772} 11/07/2021 07:49:54 - INFO - __main__ - Step 75135: {'lr': 0.0002545953817837102, 'samples': 14425920, 'steps': 75134, 'loss/train': 1.3274219036102295} 11/07/2021 07:49:54 - INFO - __main__ - Step 75136: {'lr': 0.0002545900759349266, 'samples': 14426112, 'steps': 75135, 'loss/train': 2.1044585704803467} 11/07/2021 07:49:54 - INFO - __main__ - Step 75137: {'lr': 0.00025458477008407477, 'samples': 14426304, 'steps': 75136, 'loss/train': 0.955132007598877} 11/07/2021 07:49:55 - INFO - __main__ - Step 75138: {'lr': 0.0002545794642311571, 'samples': 14426496, 'steps': 75137, 'loss/train': 1.7561280727386475} 11/07/2021 07:49:56 - INFO - __main__ - Step 75139: {'lr': 0.00025457415837617603, 'samples': 14426688, 'steps': 75138, 'loss/train': 1.2235559225082397} 11/07/2021 07:49:56 - INFO - __main__ - Step 75140: {'lr': 0.00025456885251913384, 'samples': 14426880, 'steps': 75139, 'loss/train': 1.394185185432434} 11/07/2021 07:49:56 - INFO - __main__ - Step 75141: {'lr': 0.00025456354666003307, 'samples': 14427072, 'steps': 75140, 'loss/train': 0.7371060252189636} 11/07/2021 07:49:57 - INFO - __main__ - Step 75142: {'lr': 0.000254558240798876, 'samples': 14427264, 'steps': 75141, 'loss/train': 1.5562312602996826} 11/07/2021 07:49:57 - INFO - __main__ - Step 75143: {'lr': 0.0002545529349356651, 'samples': 14427456, 'steps': 75142, 'loss/train': 0.20005695521831512} 11/07/2021 07:49:59 - INFO - __main__ - Step 75144: {'lr': 0.0002545476290704027, 'samples': 14427648, 'steps': 75143, 'loss/train': 1.271121859550476} 11/07/2021 07:49:59 - INFO - __main__ - Step 75145: {'lr': 0.00025454232320309115, 'samples': 14427840, 'steps': 75144, 'loss/train': 1.1660263538360596} 11/07/2021 07:49:59 - INFO - __main__ - Step 75146: {'lr': 0.00025453701733373297, 'samples': 14428032, 'steps': 75145, 'loss/train': 1.367556095123291} 11/07/2021 07:50:00 - INFO - __main__ - Step 75147: {'lr': 0.0002545317114623304, 'samples': 14428224, 'steps': 75146, 'loss/train': 0.9396241307258606} 11/07/2021 07:50:00 - INFO - __main__ - Step 75148: {'lr': 0.00025452640558888597, 'samples': 14428416, 'steps': 75147, 'loss/train': 1.5581297874450684} 11/07/2021 07:50:00 - INFO - __main__ - Step 75149: {'lr': 0.000254521099713402, 'samples': 14428608, 'steps': 75148, 'loss/train': 1.2066595554351807} 11/07/2021 07:50:01 - INFO - __main__ - Step 75150: {'lr': 0.00025451579383588084, 'samples': 14428800, 'steps': 75149, 'loss/train': 2.0244102478027344} 11/07/2021 07:50:02 - INFO - __main__ - Step 75151: {'lr': 0.0002545104879563251, 'samples': 14428992, 'steps': 75150, 'loss/train': 1.8501635789871216} 11/07/2021 07:50:02 - INFO - __main__ - Step 75152: {'lr': 0.00025450518207473683, 'samples': 14429184, 'steps': 75151, 'loss/train': 1.3559404611587524} 11/07/2021 07:50:02 - INFO - __main__ - Step 75153: {'lr': 0.0002544998761911186, 'samples': 14429376, 'steps': 75152, 'loss/train': 1.415427803993225} 11/07/2021 07:50:03 - INFO - __main__ - Step 75154: {'lr': 0.0002544945703054729, 'samples': 14429568, 'steps': 75153, 'loss/train': 0.9125350713729858} 11/07/2021 07:50:03 - INFO - __main__ - Step 75155: {'lr': 0.00025448926441780194, 'samples': 14429760, 'steps': 75154, 'loss/train': 1.5392613410949707} 11/07/2021 07:50:04 - INFO - __main__ - Step 75156: {'lr': 0.00025448395852810824, 'samples': 14429952, 'steps': 75155, 'loss/train': 0.1951521635055542} 11/07/2021 07:50:04 - INFO - __main__ - Step 75157: {'lr': 0.0002544786526363941, 'samples': 14430144, 'steps': 75156, 'loss/train': 1.2957909107208252} 11/07/2021 07:50:05 - INFO - __main__ - Step 75158: {'lr': 0.000254473346742662, 'samples': 14430336, 'steps': 75157, 'loss/train': 0.8721071481704712} 11/07/2021 07:50:05 - INFO - __main__ - Step 75159: {'lr': 0.0002544680408469142, 'samples': 14430528, 'steps': 75158, 'loss/train': 1.4504339694976807} 11/07/2021 07:50:05 - INFO - __main__ - Step 75160: {'lr': 0.00025446273494915324, 'samples': 14430720, 'steps': 75159, 'loss/train': 1.8092877864837646} 11/07/2021 07:50:06 - INFO - __main__ - Step 75161: {'lr': 0.00025445742904938134, 'samples': 14430912, 'steps': 75160, 'loss/train': 1.3004454374313354} 11/07/2021 07:50:07 - INFO - __main__ - Step 75162: {'lr': 0.00025445212314760107, 'samples': 14431104, 'steps': 75161, 'loss/train': 1.5863252878189087} 11/07/2021 07:50:07 - INFO - __main__ - Step 75163: {'lr': 0.00025444681724381475, 'samples': 14431296, 'steps': 75162, 'loss/train': 1.84196937084198} 11/07/2021 07:50:07 - INFO - __main__ - Step 75164: {'lr': 0.0002544415113380247, 'samples': 14431488, 'steps': 75163, 'loss/train': 1.7269847393035889} 11/07/2021 07:50:08 - INFO - __main__ - Step 75165: {'lr': 0.0002544362054302335, 'samples': 14431680, 'steps': 75164, 'loss/train': 1.5517381429672241} 11/07/2021 07:50:09 - INFO - __main__ - Step 75166: {'lr': 0.0002544308995204433, 'samples': 14431872, 'steps': 75165, 'loss/train': 1.5890084505081177} 11/07/2021 07:50:09 - INFO - __main__ - Step 75167: {'lr': 0.00025442559360865666, 'samples': 14432064, 'steps': 75166, 'loss/train': 0.8741254806518555} 11/07/2021 07:50:10 - INFO - __main__ - Step 75168: {'lr': 0.00025442028769487584, 'samples': 14432256, 'steps': 75167, 'loss/train': 1.318655252456665} 11/07/2021 07:50:10 - INFO - __main__ - Step 75169: {'lr': 0.0002544149817791034, 'samples': 14432448, 'steps': 75168, 'loss/train': 1.638075351715088} 11/07/2021 07:50:10 - INFO - __main__ - Step 75170: {'lr': 0.00025440967586134154, 'samples': 14432640, 'steps': 75169, 'loss/train': 1.3312873840332031} 11/07/2021 07:50:11 - INFO - __main__ - Step 75171: {'lr': 0.00025440436994159283, 'samples': 14432832, 'steps': 75170, 'loss/train': 1.6674275398254395} 11/07/2021 07:50:12 - INFO - __main__ - Step 75172: {'lr': 0.00025439906401985955, 'samples': 14433024, 'steps': 75171, 'loss/train': 1.527146339416504} 11/07/2021 07:50:12 - INFO - __main__ - Step 75173: {'lr': 0.00025439375809614413, 'samples': 14433216, 'steps': 75172, 'loss/train': 0.9317908883094788} 11/07/2021 07:50:12 - INFO - __main__ - Step 75174: {'lr': 0.0002543884521704489, 'samples': 14433408, 'steps': 75173, 'loss/train': 1.3759413957595825} 11/07/2021 07:50:13 - INFO - __main__ - Step 75175: {'lr': 0.00025438314624277636, 'samples': 14433600, 'steps': 75174, 'loss/train': 1.429387092590332} 11/07/2021 07:50:14 - INFO - __main__ - Step 75176: {'lr': 0.00025437784031312883, 'samples': 14433792, 'steps': 75175, 'loss/train': 1.0877965688705444} 11/07/2021 07:50:14 - INFO - __main__ - Step 75177: {'lr': 0.0002543725343815087, 'samples': 14433984, 'steps': 75176, 'loss/train': 1.4481024742126465} 11/07/2021 07:50:14 - INFO - __main__ - Step 75178: {'lr': 0.00025436722844791843, 'samples': 14434176, 'steps': 75177, 'loss/train': 1.7255430221557617} 11/07/2021 07:50:15 - INFO - __main__ - Step 75179: {'lr': 0.00025436192251236027, 'samples': 14434368, 'steps': 75178, 'loss/train': 1.5657269954681396} 11/07/2021 07:50:15 - INFO - __main__ - Step 75180: {'lr': 0.0002543566165748367, 'samples': 14434560, 'steps': 75179, 'loss/train': 0.7702858448028564} 11/07/2021 07:50:16 - INFO - __main__ - Step 75181: {'lr': 0.00025435131063535017, 'samples': 14434752, 'steps': 75180, 'loss/train': 1.9455881118774414} 11/07/2021 07:50:17 - INFO - __main__ - Step 75182: {'lr': 0.00025434600469390295, 'samples': 14434944, 'steps': 75181, 'loss/train': 1.7303569316864014} 11/07/2021 07:50:17 - INFO - __main__ - Step 75183: {'lr': 0.00025434069875049755, 'samples': 14435136, 'steps': 75182, 'loss/train': 1.123862624168396} 11/07/2021 07:50:17 - INFO - __main__ - Step 75184: {'lr': 0.00025433539280513625, 'samples': 14435328, 'steps': 75183, 'loss/train': 1.3523306846618652} 11/07/2021 07:50:18 - INFO - __main__ - Step 75185: {'lr': 0.0002543300868578215, 'samples': 14435520, 'steps': 75184, 'loss/train': 1.3978761434555054} 11/07/2021 07:50:18 - INFO - __main__ - Step 75186: {'lr': 0.0002543247809085557, 'samples': 14435712, 'steps': 75185, 'loss/train': 1.583154559135437} 11/07/2021 07:50:19 - INFO - __main__ - Step 75187: {'lr': 0.00025431947495734117, 'samples': 14435904, 'steps': 75186, 'loss/train': 1.630918025970459} 11/07/2021 07:50:20 - INFO - __main__ - Step 75188: {'lr': 0.00025431416900418034, 'samples': 14436096, 'steps': 75187, 'loss/train': 1.385058879852295} 11/07/2021 07:50:20 - INFO - __main__ - Step 75189: {'lr': 0.0002543088630490757, 'samples': 14436288, 'steps': 75188, 'loss/train': 1.663903832435608} 11/07/2021 07:50:20 - INFO - __main__ - Step 75190: {'lr': 0.00025430355709202946, 'samples': 14436480, 'steps': 75189, 'loss/train': 1.7207449674606323} 11/07/2021 07:50:21 - INFO - __main__ - Step 75191: {'lr': 0.00025429825113304423, 'samples': 14436672, 'steps': 75190, 'loss/train': 0.9398420453071594} 11/07/2021 07:50:21 - INFO - __main__ - Step 75192: {'lr': 0.00025429294517212214, 'samples': 14436864, 'steps': 75191, 'loss/train': 1.4023085832595825} 11/07/2021 07:50:22 - INFO - __main__ - Step 75193: {'lr': 0.00025428763920926577, 'samples': 14437056, 'steps': 75192, 'loss/train': 1.257500410079956} 11/07/2021 07:50:22 - INFO - __main__ - Step 75194: {'lr': 0.00025428233324447747, 'samples': 14437248, 'steps': 75193, 'loss/train': 1.4186228513717651} 11/07/2021 07:50:23 - INFO - __main__ - Step 75195: {'lr': 0.0002542770272777596, 'samples': 14437440, 'steps': 75194, 'loss/train': 1.7579412460327148} 11/07/2021 07:50:23 - INFO - __main__ - Step 75196: {'lr': 0.0002542717213091145, 'samples': 14437632, 'steps': 75195, 'loss/train': 1.3093647956848145} 11/07/2021 07:50:23 - INFO - __main__ - Step 75197: {'lr': 0.0002542664153385447, 'samples': 14437824, 'steps': 75196, 'loss/train': 0.13691353797912598} 11/07/2021 07:50:25 - INFO - __main__ - Step 75198: {'lr': 0.00025426110936605255, 'samples': 14438016, 'steps': 75197, 'loss/train': 1.8047573566436768} 11/07/2021 07:50:25 - INFO - __main__ - Step 75199: {'lr': 0.0002542558033916404, 'samples': 14438208, 'steps': 75198, 'loss/train': 1.2950507402420044} 11/07/2021 07:50:25 - INFO - __main__ - Step 75200: {'lr': 0.0002542504974153106, 'samples': 14438400, 'steps': 75199, 'loss/train': 2.0213911533355713} 11/07/2021 07:50:26 - INFO - __main__ - Step 75201: {'lr': 0.0002542451914370656, 'samples': 14438592, 'steps': 75200, 'loss/train': 1.5069563388824463} 11/07/2021 07:50:26 - INFO - __main__ - Step 75202: {'lr': 0.0002542398854569078, 'samples': 14438784, 'steps': 75201, 'loss/train': 1.3524729013442993} 11/07/2021 07:50:27 - INFO - __main__ - Step 75203: {'lr': 0.0002542345794748396, 'samples': 14438976, 'steps': 75202, 'loss/train': 1.6968951225280762} 11/07/2021 07:50:27 - INFO - __main__ - Step 75204: {'lr': 0.0002542292734908633, 'samples': 14439168, 'steps': 75203, 'loss/train': 1.5059179067611694} 11/07/2021 07:50:28 - INFO - __main__ - Step 75205: {'lr': 0.00025422396750498144, 'samples': 14439360, 'steps': 75204, 'loss/train': 1.4529176950454712} 11/07/2021 07:50:28 - INFO - __main__ - Step 75206: {'lr': 0.00025421866151719623, 'samples': 14439552, 'steps': 75205, 'loss/train': 1.4231181144714355} 11/07/2021 07:50:28 - INFO - __main__ - Step 75207: {'lr': 0.00025421335552751025, 'samples': 14439744, 'steps': 75206, 'loss/train': 1.616113543510437} 11/07/2021 07:50:30 - INFO - __main__ - Step 75208: {'lr': 0.00025420804953592567, 'samples': 14439936, 'steps': 75207, 'loss/train': 1.0851155519485474} 11/07/2021 07:50:30 - INFO - __main__ - Step 75209: {'lr': 0.0002542027435424451, 'samples': 14440128, 'steps': 75208, 'loss/train': 1.3371527194976807} 11/07/2021 07:50:30 - INFO - __main__ - Step 75210: {'lr': 0.00025419743754707085, 'samples': 14440320, 'steps': 75209, 'loss/train': 1.4237802028656006} 11/07/2021 07:50:31 - INFO - __main__ - Step 75211: {'lr': 0.00025419213154980526, 'samples': 14440512, 'steps': 75210, 'loss/train': 1.2495518922805786} 11/07/2021 07:50:31 - INFO - __main__ - Step 75212: {'lr': 0.00025418682555065084, 'samples': 14440704, 'steps': 75211, 'loss/train': 1.8150877952575684} 11/07/2021 07:50:32 - INFO - __main__ - Step 75213: {'lr': 0.00025418151954960985, 'samples': 14440896, 'steps': 75212, 'loss/train': 1.3014488220214844} 11/07/2021 07:50:32 - INFO - __main__ - Step 75214: {'lr': 0.0002541762135466847, 'samples': 14441088, 'steps': 75213, 'loss/train': 1.3114720582962036} 11/07/2021 07:50:33 - INFO - __main__ - Step 75215: {'lr': 0.00025417090754187776, 'samples': 14441280, 'steps': 75214, 'loss/train': 0.9833541512489319} 11/07/2021 07:50:33 - INFO - __main__ - Step 75216: {'lr': 0.0002541656015351916, 'samples': 14441472, 'steps': 75215, 'loss/train': 1.5023857355117798} 11/07/2021 07:50:33 - INFO - __main__ - Step 75217: {'lr': 0.0002541602955266284, 'samples': 14441664, 'steps': 75216, 'loss/train': 1.742464542388916} 11/07/2021 07:50:34 - INFO - __main__ - Step 75218: {'lr': 0.0002541549895161907, 'samples': 14441856, 'steps': 75217, 'loss/train': 1.2699897289276123} 11/07/2021 07:50:35 - INFO - __main__ - Step 75219: {'lr': 0.0002541496835038808, 'samples': 14442048, 'steps': 75218, 'loss/train': 1.5839619636535645} 11/07/2021 07:50:35 - INFO - __main__ - Step 75220: {'lr': 0.00025414437748970105, 'samples': 14442240, 'steps': 75219, 'loss/train': 0.8674119710922241} 11/07/2021 07:50:35 - INFO - __main__ - Step 75221: {'lr': 0.00025413907147365394, 'samples': 14442432, 'steps': 75220, 'loss/train': 1.7704060077667236} 11/07/2021 07:50:36 - INFO - __main__ - Step 75222: {'lr': 0.00025413376545574184, 'samples': 14442624, 'steps': 75221, 'loss/train': 1.376041293144226} 11/07/2021 07:50:37 - INFO - __main__ - Step 75223: {'lr': 0.0002541284594359672, 'samples': 14442816, 'steps': 75222, 'loss/train': 1.5928291082382202} 11/07/2021 07:50:37 - INFO - __main__ - Step 75224: {'lr': 0.0002541231534143322, 'samples': 14443008, 'steps': 75223, 'loss/train': 2.085923910140991} 11/07/2021 07:50:37 - INFO - __main__ - Step 75225: {'lr': 0.00025411784739083957, 'samples': 14443200, 'steps': 75224, 'loss/train': 1.0336072444915771} 11/07/2021 07:50:38 - INFO - __main__ - Step 75226: {'lr': 0.00025411254136549136, 'samples': 14443392, 'steps': 75225, 'loss/train': 1.5868042707443237} 11/07/2021 07:50:38 - INFO - __main__ - Step 75227: {'lr': 0.0002541072353382901, 'samples': 14443584, 'steps': 75226, 'loss/train': 1.6916548013687134} 11/07/2021 07:50:39 - INFO - __main__ - Step 75228: {'lr': 0.0002541019293092382, 'samples': 14443776, 'steps': 75227, 'loss/train': 1.5564299821853638} 11/07/2021 07:50:40 - INFO - __main__ - Step 75229: {'lr': 0.00025409662327833805, 'samples': 14443968, 'steps': 75228, 'loss/train': 1.0134706497192383} 11/07/2021 07:50:40 - INFO - __main__ - Step 75230: {'lr': 0.00025409131724559196, 'samples': 14444160, 'steps': 75229, 'loss/train': 1.1206692457199097} 11/07/2021 07:50:40 - INFO - __main__ - Step 75231: {'lr': 0.00025408601121100244, 'samples': 14444352, 'steps': 75230, 'loss/train': 1.4854075908660889} 11/07/2021 07:50:41 - INFO - __main__ - Step 75232: {'lr': 0.0002540807051745719, 'samples': 14444544, 'steps': 75231, 'loss/train': 1.3208720684051514} 11/07/2021 07:50:41 - INFO - __main__ - Step 75233: {'lr': 0.00025407539913630255, 'samples': 14444736, 'steps': 75232, 'loss/train': 1.9849601984024048} 11/07/2021 07:50:42 - INFO - __main__ - Step 75234: {'lr': 0.00025407009309619694, 'samples': 14444928, 'steps': 75233, 'loss/train': 1.4564976692199707} 11/07/2021 07:50:42 - INFO - __main__ - Step 75235: {'lr': 0.0002540647870542574, 'samples': 14445120, 'steps': 75234, 'loss/train': 0.96142578125} 11/07/2021 07:50:43 - INFO - __main__ - Step 75236: {'lr': 0.0002540594810104863, 'samples': 14445312, 'steps': 75235, 'loss/train': 1.8914766311645508} 11/07/2021 07:50:43 - INFO - __main__ - Step 75237: {'lr': 0.0002540541749648861, 'samples': 14445504, 'steps': 75236, 'loss/train': 1.0706253051757812} 11/07/2021 07:50:43 - INFO - __main__ - Step 75238: {'lr': 0.0002540488689174591, 'samples': 14445696, 'steps': 75237, 'loss/train': 1.3477199077606201} 11/07/2021 07:50:44 - INFO - __main__ - Step 75239: {'lr': 0.0002540435628682078, 'samples': 14445888, 'steps': 75238, 'loss/train': 0.5572924017906189} 11/07/2021 07:50:45 - INFO - __main__ - Step 75240: {'lr': 0.0002540382568171345, 'samples': 14446080, 'steps': 75239, 'loss/train': 1.163404941558838} 11/07/2021 07:50:45 - INFO - __main__ - Step 75241: {'lr': 0.00025403295076424165, 'samples': 14446272, 'steps': 75240, 'loss/train': 1.7086423635482788} 11/07/2021 07:50:45 - INFO - __main__ - Step 75242: {'lr': 0.0002540276447095316, 'samples': 14446464, 'steps': 75241, 'loss/train': 1.6762926578521729} 11/07/2021 07:50:46 - INFO - __main__ - Step 75243: {'lr': 0.00025402233865300675, 'samples': 14446656, 'steps': 75242, 'loss/train': 0.9899435639381409} 11/07/2021 07:50:47 - INFO - __main__ - Step 75244: {'lr': 0.00025401703259466947, 'samples': 14446848, 'steps': 75243, 'loss/train': 0.6803355813026428} 11/07/2021 07:50:47 - INFO - __main__ - Step 75245: {'lr': 0.0002540117265345223, 'samples': 14447040, 'steps': 75244, 'loss/train': 1.4904367923736572} 11/07/2021 07:50:47 - INFO - __main__ - Step 75246: {'lr': 0.00025400642047256733, 'samples': 14447232, 'steps': 75245, 'loss/train': 1.0133156776428223} 11/07/2021 07:50:48 - INFO - __main__ - Step 75247: {'lr': 0.00025400111440880725, 'samples': 14447424, 'steps': 75246, 'loss/train': 1.4283314943313599} 11/07/2021 07:50:48 - INFO - __main__ - Step 75248: {'lr': 0.00025399580834324425, 'samples': 14447616, 'steps': 75247, 'loss/train': 1.1937611103057861} 11/07/2021 07:50:49 - INFO - __main__ - Step 75249: {'lr': 0.0002539905022758808, 'samples': 14447808, 'steps': 75248, 'loss/train': 1.5878654718399048} 11/07/2021 07:50:50 - INFO - __main__ - Step 75250: {'lr': 0.0002539851962067194, 'samples': 14448000, 'steps': 75249, 'loss/train': 1.6973320245742798} 11/07/2021 07:50:50 - INFO - __main__ - Step 75251: {'lr': 0.00025397989013576223, 'samples': 14448192, 'steps': 75250, 'loss/train': 1.267516851425171} 11/07/2021 07:50:50 - INFO - __main__ - Step 75252: {'lr': 0.0002539745840630119, 'samples': 14448384, 'steps': 75251, 'loss/train': 1.4600427150726318} 11/07/2021 07:50:51 - INFO - __main__ - Step 75253: {'lr': 0.00025396927798847056, 'samples': 14448576, 'steps': 75252, 'loss/train': 1.4877049922943115} 11/07/2021 07:50:52 - INFO - __main__ - Step 75254: {'lr': 0.0002539639719121408, 'samples': 14448768, 'steps': 75253, 'loss/train': 1.2507798671722412} 11/07/2021 07:50:52 - INFO - __main__ - Step 75255: {'lr': 0.00025395866583402483, 'samples': 14448960, 'steps': 75254, 'loss/train': 1.3463987112045288} 11/07/2021 07:50:52 - INFO - __main__ - Step 75256: {'lr': 0.00025395335975412527, 'samples': 14449152, 'steps': 75255, 'loss/train': 1.2155287265777588} 11/07/2021 07:50:53 - INFO - __main__ - Step 75257: {'lr': 0.00025394805367244435, 'samples': 14449344, 'steps': 75256, 'loss/train': 1.1235257387161255} 11/07/2021 07:50:53 - INFO - __main__ - Step 75258: {'lr': 0.0002539427475889845, 'samples': 14449536, 'steps': 75257, 'loss/train': 1.3952895402908325} 11/07/2021 07:50:54 - INFO - __main__ - Step 75259: {'lr': 0.00025393744150374804, 'samples': 14449728, 'steps': 75258, 'loss/train': 1.3139728307724} 11/07/2021 07:50:55 - INFO - __main__ - Step 75260: {'lr': 0.0002539321354167375, 'samples': 14449920, 'steps': 75259, 'loss/train': 1.1572433710098267} 11/07/2021 07:50:55 - INFO - __main__ - Step 75261: {'lr': 0.0002539268293279552, 'samples': 14450112, 'steps': 75260, 'loss/train': 1.5685847997665405} 11/07/2021 07:50:55 - INFO - __main__ - Step 75262: {'lr': 0.00025392152323740354, 'samples': 14450304, 'steps': 75261, 'loss/train': 1.33869469165802} 11/07/2021 07:50:56 - INFO - __main__ - Step 75263: {'lr': 0.0002539162171450848, 'samples': 14450496, 'steps': 75262, 'loss/train': 1.570290207862854} 11/07/2021 07:50:56 - INFO - __main__ - Step 75264: {'lr': 0.0002539109110510016, 'samples': 14450688, 'steps': 75263, 'loss/train': 1.0683108568191528} 11/07/2021 07:50:57 - INFO - __main__ - Step 75265: {'lr': 0.00025390560495515614, 'samples': 14450880, 'steps': 75264, 'loss/train': 1.2988804578781128} 11/07/2021 07:50:57 - INFO - __main__ - Step 75266: {'lr': 0.0002539002988575509, 'samples': 14451072, 'steps': 75265, 'loss/train': 1.6981645822525024} 11/07/2021 07:50:58 - INFO - __main__ - Step 75267: {'lr': 0.0002538949927581882, 'samples': 14451264, 'steps': 75266, 'loss/train': 1.396681785583496} 11/07/2021 07:50:58 - INFO - __main__ - Step 75268: {'lr': 0.0002538896866570706, 'samples': 14451456, 'steps': 75267, 'loss/train': 2.231861114501953} 11/07/2021 07:50:58 - INFO - __main__ - Step 75269: {'lr': 0.0002538843805542002, 'samples': 14451648, 'steps': 75268, 'loss/train': 1.069106101989746} 11/07/2021 07:50:59 - INFO - __main__ - Step 75270: {'lr': 0.0002538790744495796, 'samples': 14451840, 'steps': 75269, 'loss/train': 1.0780051946640015} 11/07/2021 07:51:00 - INFO - __main__ - Step 75271: {'lr': 0.00025387376834321127, 'samples': 14452032, 'steps': 75270, 'loss/train': 1.0574480295181274} 11/07/2021 07:51:00 - INFO - __main__ - Step 75272: {'lr': 0.00025386846223509734, 'samples': 14452224, 'steps': 75271, 'loss/train': 1.5367437601089478} 11/07/2021 07:51:00 - INFO - __main__ - Step 75273: {'lr': 0.00025386315612524045, 'samples': 14452416, 'steps': 75272, 'loss/train': 1.3843334913253784} 11/07/2021 07:51:01 - INFO - __main__ - Step 75274: {'lr': 0.0002538578500136428, 'samples': 14452608, 'steps': 75273, 'loss/train': 1.1913654804229736} 11/07/2021 07:51:02 - INFO - __main__ - Step 75275: {'lr': 0.0002538525439003069, 'samples': 14452800, 'steps': 75274, 'loss/train': 1.1598749160766602} 11/07/2021 07:51:02 - INFO - __main__ - Step 75276: {'lr': 0.0002538472377852351, 'samples': 14452992, 'steps': 75275, 'loss/train': 1.3387929201126099} 11/07/2021 07:51:03 - INFO - __main__ - Step 75277: {'lr': 0.0002538419316684298, 'samples': 14453184, 'steps': 75276, 'loss/train': 0.5017039179801941} 11/07/2021 07:51:03 - INFO - __main__ - Step 75278: {'lr': 0.00025383662554989337, 'samples': 14453376, 'steps': 75277, 'loss/train': 1.3267982006072998} 11/07/2021 07:51:03 - INFO - __main__ - Step 75279: {'lr': 0.00025383131942962825, 'samples': 14453568, 'steps': 75278, 'loss/train': 1.444012999534607} 11/07/2021 07:51:04 - INFO - __main__ - Step 75280: {'lr': 0.0002538260133076367, 'samples': 14453760, 'steps': 75279, 'loss/train': 1.8397247791290283} 11/07/2021 07:51:05 - INFO - __main__ - Step 75281: {'lr': 0.0002538207071839213, 'samples': 14453952, 'steps': 75280, 'loss/train': 1.3673542737960815} 11/07/2021 07:51:05 - INFO - __main__ - Step 75282: {'lr': 0.0002538154010584843, 'samples': 14454144, 'steps': 75281, 'loss/train': 1.249070644378662} 11/07/2021 07:51:05 - INFO - __main__ - Step 75283: {'lr': 0.00025381009493132814, 'samples': 14454336, 'steps': 75282, 'loss/train': 1.1884937286376953} 11/07/2021 07:51:06 - INFO - __main__ - Step 75284: {'lr': 0.0002538047888024552, 'samples': 14454528, 'steps': 75283, 'loss/train': 1.448066234588623} 11/07/2021 07:51:06 - INFO - __main__ - Step 75285: {'lr': 0.00025379948267186794, 'samples': 14454720, 'steps': 75284, 'loss/train': 1.2970176935195923} 11/07/2021 07:51:07 - INFO - __main__ - Step 75286: {'lr': 0.0002537941765395687, 'samples': 14454912, 'steps': 75285, 'loss/train': 1.4570682048797607} 11/07/2021 07:51:07 - INFO - __main__ - Step 75287: {'lr': 0.0002537888704055598, 'samples': 14455104, 'steps': 75286, 'loss/train': 2.0388216972351074} 11/07/2021 07:51:08 - INFO - __main__ - Step 75288: {'lr': 0.00025378356426984373, 'samples': 14455296, 'steps': 75287, 'loss/train': 1.1445907354354858} 11/07/2021 07:51:08 - INFO - __main__ - Step 75289: {'lr': 0.0002537782581324228, 'samples': 14455488, 'steps': 75288, 'loss/train': 1.301590919494629} 11/07/2021 07:51:08 - INFO - __main__ - Step 75290: {'lr': 0.0002537729519932995, 'samples': 14455680, 'steps': 75289, 'loss/train': 1.4951171875} 11/07/2021 07:51:10 - INFO - __main__ - Step 75291: {'lr': 0.00025376764585247606, 'samples': 14455872, 'steps': 75290, 'loss/train': 1.258278727531433} 11/07/2021 07:51:10 - INFO - __main__ - Step 75292: {'lr': 0.00025376233970995514, 'samples': 14456064, 'steps': 75291, 'loss/train': 1.7683035135269165} 11/07/2021 07:51:10 - INFO - __main__ - Step 75293: {'lr': 0.00025375703356573886, 'samples': 14456256, 'steps': 75292, 'loss/train': 1.2886078357696533} 11/07/2021 07:51:11 - INFO - __main__ - Step 75294: {'lr': 0.00025375172741982975, 'samples': 14456448, 'steps': 75293, 'loss/train': 1.655763030052185} 11/07/2021 07:51:11 - INFO - __main__ - Step 75295: {'lr': 0.0002537464212722302, 'samples': 14456640, 'steps': 75294, 'loss/train': 1.6612521409988403} 11/07/2021 07:51:12 - INFO - __main__ - Step 75296: {'lr': 0.0002537411151229425, 'samples': 14456832, 'steps': 75295, 'loss/train': 0.7257407307624817} 11/07/2021 07:51:12 - INFO - __main__ - Step 75297: {'lr': 0.0002537358089719691, 'samples': 14457024, 'steps': 75296, 'loss/train': 1.3316785097122192} 11/07/2021 07:51:13 - INFO - __main__ - Step 75298: {'lr': 0.00025373050281931247, 'samples': 14457216, 'steps': 75297, 'loss/train': 1.4083118438720703} 11/07/2021 07:51:13 - INFO - __main__ - Step 75299: {'lr': 0.00025372519666497494, 'samples': 14457408, 'steps': 75298, 'loss/train': 1.6546144485473633} 11/07/2021 07:51:13 - INFO - __main__ - Step 75300: {'lr': 0.0002537198905089589, 'samples': 14457600, 'steps': 75299, 'loss/train': 1.0418199300765991} 11/07/2021 07:51:14 - INFO - __main__ - Step 75301: {'lr': 0.00025371458435126664, 'samples': 14457792, 'steps': 75300, 'loss/train': 1.0069066286087036} 11/07/2021 07:51:15 - INFO - __main__ - Step 75302: {'lr': 0.0002537092781919007, 'samples': 14457984, 'steps': 75301, 'loss/train': 1.2811925411224365} 11/07/2021 07:51:15 - INFO - __main__ - Step 75303: {'lr': 0.00025370397203086344, 'samples': 14458176, 'steps': 75302, 'loss/train': 1.3440598249435425} 11/07/2021 07:51:15 - INFO - __main__ - Step 75304: {'lr': 0.0002536986658681572, 'samples': 14458368, 'steps': 75303, 'loss/train': 2.1501502990722656} 11/07/2021 07:51:16 - INFO - __main__ - Step 75305: {'lr': 0.0002536933597037844, 'samples': 14458560, 'steps': 75304, 'loss/train': 1.113883137702942} 11/07/2021 07:51:16 - INFO - __main__ - Step 75306: {'lr': 0.0002536880535377475, 'samples': 14458752, 'steps': 75305, 'loss/train': 1.6043601036071777} 11/07/2021 07:51:17 - INFO - __main__ - Step 75307: {'lr': 0.0002536827473700487, 'samples': 14458944, 'steps': 75306, 'loss/train': 1.5248668193817139} 11/07/2021 07:51:17 - INFO - __main__ - Step 75308: {'lr': 0.00025367744120069057, 'samples': 14459136, 'steps': 75307, 'loss/train': 1.342020034790039} 11/07/2021 07:51:18 - INFO - __main__ - Step 75309: {'lr': 0.00025367213502967546, 'samples': 14459328, 'steps': 75308, 'loss/train': 1.4774293899536133} 11/07/2021 07:51:18 - INFO - __main__ - Step 75310: {'lr': 0.0002536668288570057, 'samples': 14459520, 'steps': 75309, 'loss/train': 1.3770488500595093} 11/07/2021 07:51:18 - INFO - __main__ - Step 75311: {'lr': 0.0002536615226826837, 'samples': 14459712, 'steps': 75310, 'loss/train': 1.434457540512085} 11/07/2021 07:51:19 - INFO - __main__ - Step 75312: {'lr': 0.000253656216506712, 'samples': 14459904, 'steps': 75311, 'loss/train': 1.6141371726989746} 11/07/2021 07:51:20 - INFO - __main__ - Step 75313: {'lr': 0.00025365091032909277, 'samples': 14460096, 'steps': 75312, 'loss/train': 1.158160924911499} 11/07/2021 07:51:20 - INFO - __main__ - Step 75314: {'lr': 0.0002536456041498285, 'samples': 14460288, 'steps': 75313, 'loss/train': 1.7349166870117188} 11/07/2021 07:51:21 - INFO - __main__ - Step 75315: {'lr': 0.0002536402979689216, 'samples': 14460480, 'steps': 75314, 'loss/train': 1.1256468296051025} 11/07/2021 07:51:21 - INFO - __main__ - Step 75316: {'lr': 0.0002536349917863744, 'samples': 14460672, 'steps': 75315, 'loss/train': 1.146972894668579} 11/07/2021 07:51:22 - INFO - __main__ - Step 75317: {'lr': 0.00025362968560218934, 'samples': 14460864, 'steps': 75316, 'loss/train': 1.5113364458084106} 11/07/2021 07:51:22 - INFO - __main__ - Step 75318: {'lr': 0.00025362437941636886, 'samples': 14461056, 'steps': 75317, 'loss/train': 1.512551188468933} 11/07/2021 07:51:23 - INFO - __main__ - Step 75319: {'lr': 0.00025361907322891524, 'samples': 14461248, 'steps': 75318, 'loss/train': 1.7531845569610596} 11/07/2021 07:51:23 - INFO - __main__ - Step 75320: {'lr': 0.000253613767039831, 'samples': 14461440, 'steps': 75319, 'loss/train': 1.2932689189910889} 11/07/2021 07:51:23 - INFO - __main__ - Step 75321: {'lr': 0.0002536084608491183, 'samples': 14461632, 'steps': 75320, 'loss/train': 1.5140074491500854} 11/07/2021 07:51:24 - INFO - __main__ - Step 75322: {'lr': 0.00025360315465677976, 'samples': 14461824, 'steps': 75321, 'loss/train': 1.3509396314620972} 11/07/2021 07:51:25 - INFO - __main__ - Step 75323: {'lr': 0.0002535978484628177, 'samples': 14462016, 'steps': 75322, 'loss/train': 1.3715623617172241} 11/07/2021 07:51:25 - INFO - __main__ - Step 75324: {'lr': 0.0002535925422672345, 'samples': 14462208, 'steps': 75323, 'loss/train': 2.1865108013153076} 11/07/2021 07:51:25 - INFO - __main__ - Step 75325: {'lr': 0.00025358723607003255, 'samples': 14462400, 'steps': 75324, 'loss/train': 2.819215774536133} 11/07/2021 07:51:26 - INFO - __main__ - Step 75326: {'lr': 0.0002535819298712143, 'samples': 14462592, 'steps': 75325, 'loss/train': 1.3442507982254028} 11/07/2021 07:51:27 - INFO - __main__ - Step 75327: {'lr': 0.00025357662367078205, 'samples': 14462784, 'steps': 75326, 'loss/train': 0.9802465438842773} 11/07/2021 07:51:27 - INFO - __main__ - Step 75328: {'lr': 0.00025357131746873816, 'samples': 14462976, 'steps': 75327, 'loss/train': 1.8458515405654907} 11/07/2021 07:51:28 - INFO - __main__ - Step 75329: {'lr': 0.00025356601126508516, 'samples': 14463168, 'steps': 75328, 'loss/train': 1.2715280055999756} 11/07/2021 07:51:28 - INFO - __main__ - Step 75330: {'lr': 0.00025356070505982536, 'samples': 14463360, 'steps': 75329, 'loss/train': 1.223690152168274} 11/07/2021 07:51:28 - INFO - __main__ - Step 75331: {'lr': 0.00025355539885296116, 'samples': 14463552, 'steps': 75330, 'loss/train': 1.1072156429290771} 11/07/2021 07:51:29 - INFO - __main__ - Step 75332: {'lr': 0.0002535500926444949, 'samples': 14463744, 'steps': 75331, 'loss/train': 1.6381170749664307} 11/07/2021 07:51:30 - INFO - __main__ - Step 75333: {'lr': 0.0002535447864344291, 'samples': 14463936, 'steps': 75332, 'loss/train': 1.3503001928329468} 11/07/2021 07:51:30 - INFO - __main__ - Step 75334: {'lr': 0.00025353948022276607, 'samples': 14464128, 'steps': 75333, 'loss/train': 1.595320224761963} 11/07/2021 07:51:30 - INFO - __main__ - Step 75335: {'lr': 0.0002535341740095082, 'samples': 14464320, 'steps': 75334, 'loss/train': 1.3659533262252808} 11/07/2021 07:51:31 - INFO - __main__ - Step 75336: {'lr': 0.0002535288677946579, 'samples': 14464512, 'steps': 75335, 'loss/train': 1.1532881259918213} 11/07/2021 07:51:31 - INFO - __main__ - Step 75337: {'lr': 0.0002535235615782175, 'samples': 14464704, 'steps': 75336, 'loss/train': 1.3311339616775513} 11/07/2021 07:51:32 - INFO - __main__ - Step 75338: {'lr': 0.0002535182553601894, 'samples': 14464896, 'steps': 75337, 'loss/train': 1.4703575372695923} 11/07/2021 07:51:33 - INFO - __main__ - Step 75339: {'lr': 0.00025351294914057615, 'samples': 14465088, 'steps': 75338, 'loss/train': 1.5040816068649292} 11/07/2021 07:51:33 - INFO - __main__ - Step 75340: {'lr': 0.00025350764291937994, 'samples': 14465280, 'steps': 75339, 'loss/train': 0.9453341960906982} 11/07/2021 07:51:33 - INFO - __main__ - Step 75341: {'lr': 0.0002535023366966033, 'samples': 14465472, 'steps': 75340, 'loss/train': 1.0004407167434692} 11/07/2021 07:51:34 - INFO - __main__ - Step 75342: {'lr': 0.00025349703047224847, 'samples': 14465664, 'steps': 75341, 'loss/train': 1.1352657079696655} 11/07/2021 07:51:35 - INFO - __main__ - Step 75343: {'lr': 0.0002534917242463179, 'samples': 14465856, 'steps': 75342, 'loss/train': 1.7139407396316528} 11/07/2021 07:51:35 - INFO - __main__ - Step 75344: {'lr': 0.0002534864180188141, 'samples': 14466048, 'steps': 75343, 'loss/train': 0.9718571901321411} 11/07/2021 07:51:35 - INFO - __main__ - Step 75345: {'lr': 0.00025348111178973937, 'samples': 14466240, 'steps': 75344, 'loss/train': 1.5366946458816528} 11/07/2021 07:51:36 - INFO - __main__ - Step 75346: {'lr': 0.0002534758055590962, 'samples': 14466432, 'steps': 75345, 'loss/train': 1.5735554695129395} 11/07/2021 07:51:36 - INFO - __main__ - Step 75347: {'lr': 0.00025347049932688675, 'samples': 14466624, 'steps': 75346, 'loss/train': 1.6281208992004395} 11/07/2021 07:51:37 - INFO - __main__ - Step 75348: {'lr': 0.0002534651930931136, 'samples': 14466816, 'steps': 75347, 'loss/train': 1.3531928062438965} 11/07/2021 07:51:38 - INFO - __main__ - Step 75349: {'lr': 0.00025345988685777904, 'samples': 14467008, 'steps': 75348, 'loss/train': 1.4685802459716797} 11/07/2021 07:51:38 - INFO - __main__ - Step 75350: {'lr': 0.0002534545806208855, 'samples': 14467200, 'steps': 75349, 'loss/train': 1.3445026874542236} 11/07/2021 07:51:38 - INFO - __main__ - Step 75351: {'lr': 0.00025344927438243544, 'samples': 14467392, 'steps': 75350, 'loss/train': 1.4402142763137817} 11/07/2021 07:51:39 - INFO - __main__ - Step 75352: {'lr': 0.00025344396814243114, 'samples': 14467584, 'steps': 75351, 'loss/train': 1.1158939599990845} 11/07/2021 07:51:40 - INFO - __main__ - Step 75353: {'lr': 0.00025343866190087513, 'samples': 14467776, 'steps': 75352, 'loss/train': 1.4631329774856567} 11/07/2021 07:51:40 - INFO - __main__ - Step 75354: {'lr': 0.0002534333556577696, 'samples': 14467968, 'steps': 75353, 'loss/train': 1.382297158241272} 11/07/2021 07:51:40 - INFO - __main__ - Step 75355: {'lr': 0.0002534280494131171, 'samples': 14468160, 'steps': 75354, 'loss/train': 1.7659330368041992} 11/07/2021 07:51:41 - INFO - __main__ - Step 75356: {'lr': 0.00025342274316692, 'samples': 14468352, 'steps': 75355, 'loss/train': 1.3933122158050537} 11/07/2021 07:51:41 - INFO - __main__ - Step 75357: {'lr': 0.0002534174369191806, 'samples': 14468544, 'steps': 75356, 'loss/train': 1.307175874710083} 11/07/2021 07:51:42 - INFO - __main__ - Step 75358: {'lr': 0.0002534121306699014, 'samples': 14468736, 'steps': 75357, 'loss/train': 2.0710010528564453} 11/07/2021 07:51:43 - INFO - __main__ - Step 75359: {'lr': 0.0002534068244190847, 'samples': 14468928, 'steps': 75358, 'loss/train': 1.1019752025604248} 11/07/2021 07:51:43 - INFO - __main__ - Step 75360: {'lr': 0.0002534015181667331, 'samples': 14469120, 'steps': 75359, 'loss/train': 0.8357174396514893} 11/07/2021 07:51:43 - INFO - __main__ - Step 75361: {'lr': 0.0002533962119128487, 'samples': 14469312, 'steps': 75360, 'loss/train': 1.0759166479110718} 11/07/2021 07:51:44 - INFO - __main__ - Step 75362: {'lr': 0.000253390905657434, 'samples': 14469504, 'steps': 75361, 'loss/train': 1.4890903234481812} 11/07/2021 07:51:44 - INFO - __main__ - Step 75363: {'lr': 0.0002533855994004914, 'samples': 14469696, 'steps': 75362, 'loss/train': 1.4563063383102417} 11/07/2021 07:51:44 - INFO - __main__ - Step 75364: {'lr': 0.00025338029314202334, 'samples': 14469888, 'steps': 75363, 'loss/train': 1.6524369716644287} 11/07/2021 07:51:45 - INFO - __main__ - Step 75365: {'lr': 0.00025337498688203215, 'samples': 14470080, 'steps': 75364, 'loss/train': 1.7775744199752808} 11/07/2021 07:51:46 - INFO - __main__ - Step 75366: {'lr': 0.0002533696806205203, 'samples': 14470272, 'steps': 75365, 'loss/train': 1.1985300779342651} 11/07/2021 07:51:46 - INFO - __main__ - Step 75367: {'lr': 0.0002533643743574901, 'samples': 14470464, 'steps': 75366, 'loss/train': 1.5226013660430908} 11/07/2021 07:51:46 - INFO - __main__ - Step 75368: {'lr': 0.000253359068092944, 'samples': 14470656, 'steps': 75367, 'loss/train': 1.4805430173873901} 11/07/2021 07:51:47 - INFO - __main__ - Step 75369: {'lr': 0.00025335376182688424, 'samples': 14470848, 'steps': 75368, 'loss/train': 1.73745858669281} 11/07/2021 07:51:48 - INFO - __main__ - Step 75370: {'lr': 0.0002533484555593134, 'samples': 14471040, 'steps': 75369, 'loss/train': 1.6706236600875854} 11/07/2021 07:51:48 - INFO - __main__ - Step 75371: {'lr': 0.00025334314929023377, 'samples': 14471232, 'steps': 75370, 'loss/train': 1.3499064445495605} 11/07/2021 07:51:49 - INFO - __main__ - Step 75372: {'lr': 0.0002533378430196478, 'samples': 14471424, 'steps': 75371, 'loss/train': 1.0359594821929932} 11/07/2021 07:51:49 - INFO - __main__ - Step 75373: {'lr': 0.00025333253674755785, 'samples': 14471616, 'steps': 75372, 'loss/train': 1.400675892829895} 11/07/2021 07:51:49 - INFO - __main__ - Step 75374: {'lr': 0.0002533272304739663, 'samples': 14471808, 'steps': 75373, 'loss/train': 1.3308706283569336} 11/07/2021 07:51:50 - INFO - __main__ - Step 75375: {'lr': 0.00025332192419887556, 'samples': 14472000, 'steps': 75374, 'loss/train': 1.6262941360473633} 11/07/2021 07:51:51 - INFO - __main__ - Step 75376: {'lr': 0.000253316617922288, 'samples': 14472192, 'steps': 75375, 'loss/train': 1.43510103225708} 11/07/2021 07:51:51 - INFO - __main__ - Step 75377: {'lr': 0.00025331131164420603, 'samples': 14472384, 'steps': 75376, 'loss/train': 1.116639494895935} 11/07/2021 07:51:51 - INFO - __main__ - Step 75378: {'lr': 0.000253306005364632, 'samples': 14472576, 'steps': 75377, 'loss/train': 1.7187747955322266} 11/07/2021 07:51:52 - INFO - __main__ - Step 75379: {'lr': 0.00025330069908356835, 'samples': 14472768, 'steps': 75378, 'loss/train': 1.455057144165039} 11/07/2021 07:51:53 - INFO - __main__ - Step 75380: {'lr': 0.0002532953928010175, 'samples': 14472960, 'steps': 75379, 'loss/train': 1.557126522064209} 11/07/2021 07:51:53 - INFO - __main__ - Step 75381: {'lr': 0.0002532900865169818, 'samples': 14473152, 'steps': 75380, 'loss/train': 1.6563931703567505} 11/07/2021 07:51:54 - INFO - __main__ - Step 75382: {'lr': 0.00025328478023146363, 'samples': 14473344, 'steps': 75381, 'loss/train': 1.5127633810043335} 11/07/2021 07:51:54 - INFO - __main__ - Step 75383: {'lr': 0.0002532794739444653, 'samples': 14473536, 'steps': 75382, 'loss/train': 0.7465620636940002} 11/07/2021 07:51:54 - INFO - __main__ - Step 75384: {'lr': 0.00025327416765598935, 'samples': 14473728, 'steps': 75383, 'loss/train': 1.4714090824127197} 11/07/2021 07:51:55 - INFO - __main__ - Step 75385: {'lr': 0.0002532688613660381, 'samples': 14473920, 'steps': 75384, 'loss/train': 1.117944598197937} 11/07/2021 07:51:56 - INFO - __main__ - Step 75386: {'lr': 0.0002532635550746141, 'samples': 14474112, 'steps': 75385, 'loss/train': 1.5511711835861206} 11/07/2021 07:51:56 - INFO - __main__ - Step 75387: {'lr': 0.00025325824878171937, 'samples': 14474304, 'steps': 75386, 'loss/train': 1.6372959613800049} 11/07/2021 07:51:56 - INFO - __main__ - Step 75388: {'lr': 0.00025325294248735664, 'samples': 14474496, 'steps': 75387, 'loss/train': 1.2648327350616455} 11/07/2021 07:51:57 - INFO - __main__ - Step 75389: {'lr': 0.0002532476361915281, 'samples': 14474688, 'steps': 75388, 'loss/train': 1.3831405639648438} 11/07/2021 07:51:57 - INFO - __main__ - Step 75390: {'lr': 0.00025324232989423626, 'samples': 14474880, 'steps': 75389, 'loss/train': 0.6946053504943848} 11/07/2021 07:51:58 - INFO - __main__ - Step 75391: {'lr': 0.0002532370235954836, 'samples': 14475072, 'steps': 75390, 'loss/train': 1.6684911251068115} 11/07/2021 07:51:58 - INFO - __main__ - Step 75392: {'lr': 0.00025323171729527225, 'samples': 14475264, 'steps': 75391, 'loss/train': 1.6645394563674927} 11/07/2021 07:51:59 - INFO - __main__ - Step 75393: {'lr': 0.00025322641099360477, 'samples': 14475456, 'steps': 75392, 'loss/train': 1.1438496112823486} 11/07/2021 07:51:59 - INFO - __main__ - Step 75394: {'lr': 0.00025322110469048353, 'samples': 14475648, 'steps': 75393, 'loss/train': 1.2413641214370728} 11/07/2021 07:51:59 - INFO - __main__ - Step 75395: {'lr': 0.0002532157983859109, 'samples': 14475840, 'steps': 75394, 'loss/train': 1.056247591972351} 11/07/2021 07:52:01 - INFO - __main__ - Step 75396: {'lr': 0.0002532104920798893, 'samples': 14476032, 'steps': 75395, 'loss/train': 1.8224257230758667} 11/07/2021 07:52:01 - INFO - __main__ - Step 75397: {'lr': 0.00025320518577242115, 'samples': 14476224, 'steps': 75396, 'loss/train': 1.5952324867248535} 11/07/2021 07:52:01 - INFO - __main__ - Step 75398: {'lr': 0.0002531998794635087, 'samples': 14476416, 'steps': 75397, 'loss/train': 1.4374330043792725} 11/07/2021 07:52:02 - INFO - __main__ - Step 75399: {'lr': 0.00025319457315315443, 'samples': 14476608, 'steps': 75398, 'loss/train': 1.3189018964767456} 11/07/2021 07:52:02 - INFO - __main__ - Step 75400: {'lr': 0.00025318926684136077, 'samples': 14476800, 'steps': 75399, 'loss/train': 1.5596387386322021} 11/07/2021 07:52:03 - INFO - __main__ - Step 75401: {'lr': 0.0002531839605281301, 'samples': 14476992, 'steps': 75400, 'loss/train': 1.5822023153305054} 11/07/2021 07:52:03 - INFO - __main__ - Step 75402: {'lr': 0.00025317865421346477, 'samples': 14477184, 'steps': 75401, 'loss/train': 1.7023519277572632} 11/07/2021 07:52:04 - INFO - __main__ - Step 75403: {'lr': 0.0002531733478973672, 'samples': 14477376, 'steps': 75402, 'loss/train': 1.4840303659439087} 11/07/2021 07:52:04 - INFO - __main__ - Step 75404: {'lr': 0.0002531680415798397, 'samples': 14477568, 'steps': 75403, 'loss/train': 1.8631364107131958} 11/07/2021 07:52:04 - INFO - __main__ - Step 75405: {'lr': 0.0002531627352608848, 'samples': 14477760, 'steps': 75404, 'loss/train': 1.5357606410980225} 11/07/2021 07:52:05 - INFO - __main__ - Step 75406: {'lr': 0.00025315742894050475, 'samples': 14477952, 'steps': 75405, 'loss/train': 1.2967935800552368} 11/07/2021 07:52:06 - INFO - __main__ - Step 75407: {'lr': 0.00025315212261870206, 'samples': 14478144, 'steps': 75406, 'loss/train': 0.8611009120941162} 11/07/2021 07:52:06 - INFO - __main__ - Step 75408: {'lr': 0.00025314681629547907, 'samples': 14478336, 'steps': 75407, 'loss/train': 1.2463306188583374} 11/07/2021 07:52:07 - INFO - __main__ - Step 75409: {'lr': 0.0002531415099708382, 'samples': 14478528, 'steps': 75408, 'loss/train': 1.299153208732605} 11/07/2021 07:52:07 - INFO - __main__ - Step 75410: {'lr': 0.0002531362036447818, 'samples': 14478720, 'steps': 75409, 'loss/train': 1.3753751516342163} 11/07/2021 07:52:07 - INFO - __main__ - Step 75411: {'lr': 0.0002531308973173122, 'samples': 14478912, 'steps': 75410, 'loss/train': 1.3227649927139282} 11/07/2021 07:52:08 - INFO - __main__ - Step 75412: {'lr': 0.00025312559098843195, 'samples': 14479104, 'steps': 75411, 'loss/train': 1.2713130712509155} 11/07/2021 07:52:09 - INFO - __main__ - Step 75413: {'lr': 0.00025312028465814337, 'samples': 14479296, 'steps': 75412, 'loss/train': 1.5569119453430176} 11/07/2021 07:52:09 - INFO - __main__ - Step 75414: {'lr': 0.0002531149783264488, 'samples': 14479488, 'steps': 75413, 'loss/train': 1.144696831703186} 11/07/2021 07:52:09 - INFO - __main__ - Step 75415: {'lr': 0.0002531096719933507, 'samples': 14479680, 'steps': 75414, 'loss/train': 1.495134949684143} 11/07/2021 07:52:10 - INFO - __main__ - Step 75416: {'lr': 0.0002531043656588514, 'samples': 14479872, 'steps': 75415, 'loss/train': 1.7478837966918945} 11/07/2021 07:52:11 - INFO - __main__ - Step 75417: {'lr': 0.00025309905932295324, 'samples': 14480064, 'steps': 75416, 'loss/train': 1.4066935777664185} 11/07/2021 07:52:11 - INFO - __main__ - Step 75418: {'lr': 0.00025309375298565877, 'samples': 14480256, 'steps': 75417, 'loss/train': 1.119341254234314} 11/07/2021 07:52:11 - INFO - __main__ - Step 75419: {'lr': 0.00025308844664697033, 'samples': 14480448, 'steps': 75418, 'loss/train': 0.4082111716270447} 11/07/2021 07:52:12 - INFO - __main__ - Step 75420: {'lr': 0.00025308314030689027, 'samples': 14480640, 'steps': 75419, 'loss/train': 0.9850432872772217} 11/07/2021 07:52:12 - INFO - __main__ - Step 75421: {'lr': 0.000253077833965421, 'samples': 14480832, 'steps': 75420, 'loss/train': 1.5171492099761963} 11/07/2021 07:52:13 - INFO - __main__ - Step 75422: {'lr': 0.0002530725276225649, 'samples': 14481024, 'steps': 75421, 'loss/train': 1.411158561706543} 11/07/2021 07:52:14 - INFO - __main__ - Step 75423: {'lr': 0.0002530672212783243, 'samples': 14481216, 'steps': 75422, 'loss/train': 1.250654935836792} 11/07/2021 07:52:14 - INFO - __main__ - Step 75424: {'lr': 0.0002530619149327017, 'samples': 14481408, 'steps': 75423, 'loss/train': 1.5920629501342773} 11/07/2021 07:52:14 - INFO - __main__ - Step 75425: {'lr': 0.00025305660858569945, 'samples': 14481600, 'steps': 75424, 'loss/train': 1.43694007396698} 11/07/2021 07:52:15 - INFO - __main__ - Step 75426: {'lr': 0.00025305130223732, 'samples': 14481792, 'steps': 75425, 'loss/train': 1.4931929111480713} 11/07/2021 07:52:16 - INFO - __main__ - Step 75427: {'lr': 0.00025304599588756564, 'samples': 14481984, 'steps': 75426, 'loss/train': 0.9686033129692078} 11/07/2021 07:52:16 - INFO - __main__ - Step 75428: {'lr': 0.00025304068953643875, 'samples': 14482176, 'steps': 75427, 'loss/train': 1.3618084192276} 11/07/2021 07:52:16 - INFO - __main__ - Step 75429: {'lr': 0.00025303538318394186, 'samples': 14482368, 'steps': 75428, 'loss/train': 1.261627435684204} 11/07/2021 07:52:17 - INFO - __main__ - Step 75430: {'lr': 0.00025303007683007725, 'samples': 14482560, 'steps': 75429, 'loss/train': 1.6036971807479858} 11/07/2021 07:52:17 - INFO - __main__ - Step 75431: {'lr': 0.00025302477047484725, 'samples': 14482752, 'steps': 75430, 'loss/train': 1.7831908464431763} 11/07/2021 07:52:17 - INFO - __main__ - Step 75432: {'lr': 0.0002530194641182544, 'samples': 14482944, 'steps': 75431, 'loss/train': 1.4428341388702393} 11/07/2021 07:52:18 - INFO - __main__ - Step 75433: {'lr': 0.00025301415776030105, 'samples': 14483136, 'steps': 75432, 'loss/train': 1.2324117422103882} 11/07/2021 07:52:19 - INFO - __main__ - Step 75434: {'lr': 0.0002530088514009896, 'samples': 14483328, 'steps': 75433, 'loss/train': 1.2350614070892334} 11/07/2021 07:52:19 - INFO - __main__ - Step 75435: {'lr': 0.0002530035450403223, 'samples': 14483520, 'steps': 75434, 'loss/train': 1.3937768936157227} 11/07/2021 07:52:19 - INFO - __main__ - Step 75436: {'lr': 0.00025299823867830167, 'samples': 14483712, 'steps': 75435, 'loss/train': 1.6993942260742188} 11/07/2021 07:52:20 - INFO - __main__ - Step 75437: {'lr': 0.00025299293231493007, 'samples': 14483904, 'steps': 75436, 'loss/train': 1.2232825756072998} 11/07/2021 07:52:21 - INFO - __main__ - Step 75438: {'lr': 0.00025298762595020994, 'samples': 14484096, 'steps': 75437, 'loss/train': 0.7066746950149536} 11/07/2021 07:52:21 - INFO - __main__ - Step 75439: {'lr': 0.00025298231958414367, 'samples': 14484288, 'steps': 75438, 'loss/train': 1.5044188499450684} 11/07/2021 07:52:22 - INFO - __main__ - Step 75440: {'lr': 0.00025297701321673363, 'samples': 14484480, 'steps': 75439, 'loss/train': 1.372206449508667} 11/07/2021 07:52:22 - INFO - __main__ - Step 75441: {'lr': 0.0002529717068479821, 'samples': 14484672, 'steps': 75440, 'loss/train': 1.2842708826065063} 11/07/2021 07:52:22 - INFO - __main__ - Step 75442: {'lr': 0.0002529664004778916, 'samples': 14484864, 'steps': 75441, 'loss/train': 1.715751051902771} 11/07/2021 07:52:23 - INFO - __main__ - Step 75443: {'lr': 0.00025296109410646443, 'samples': 14485056, 'steps': 75442, 'loss/train': 2.308370351791382} 11/07/2021 07:52:24 - INFO - __main__ - Step 75444: {'lr': 0.0002529557877337031, 'samples': 14485248, 'steps': 75443, 'loss/train': 0.9396941661834717} 11/07/2021 07:52:24 - INFO - __main__ - Step 75445: {'lr': 0.0002529504813596099, 'samples': 14485440, 'steps': 75444, 'loss/train': 0.9364436268806458} 11/07/2021 07:52:24 - INFO - __main__ - Step 75446: {'lr': 0.00025294517498418727, 'samples': 14485632, 'steps': 75445, 'loss/train': 1.612847089767456} 11/07/2021 07:52:25 - INFO - __main__ - Step 75447: {'lr': 0.0002529398686074377, 'samples': 14485824, 'steps': 75446, 'loss/train': 1.620434284210205} 11/07/2021 07:52:26 - INFO - __main__ - Step 75448: {'lr': 0.00025293456222936334, 'samples': 14486016, 'steps': 75447, 'loss/train': 1.2262240648269653} 11/07/2021 07:52:26 - INFO - __main__ - Step 75449: {'lr': 0.0002529292558499668, 'samples': 14486208, 'steps': 75448, 'loss/train': 1.6129426956176758} 11/07/2021 07:52:27 - INFO - __main__ - Step 75450: {'lr': 0.0002529239494692503, 'samples': 14486400, 'steps': 75449, 'loss/train': 1.7252554893493652} 11/07/2021 07:52:27 - INFO - __main__ - Step 75451: {'lr': 0.0002529186430872163, 'samples': 14486592, 'steps': 75450, 'loss/train': 0.9683408141136169} 11/07/2021 07:52:27 - INFO - __main__ - Step 75452: {'lr': 0.00025291333670386727, 'samples': 14486784, 'steps': 75451, 'loss/train': 1.1005795001983643} 11/07/2021 07:52:28 - INFO - __main__ - Step 75453: {'lr': 0.0002529080303192055, 'samples': 14486976, 'steps': 75452, 'loss/train': 1.5532797574996948} 11/07/2021 07:52:29 - INFO - __main__ - Step 75454: {'lr': 0.0002529027239332335, 'samples': 14487168, 'steps': 75453, 'loss/train': 1.7942194938659668} 11/07/2021 07:52:29 - INFO - __main__ - Step 75455: {'lr': 0.0002528974175459535, 'samples': 14487360, 'steps': 75454, 'loss/train': 2.009967088699341} 11/07/2021 07:52:29 - INFO - __main__ - Step 75456: {'lr': 0.000252892111157368, 'samples': 14487552, 'steps': 75455, 'loss/train': 1.092315435409546} 11/07/2021 07:52:30 - INFO - __main__ - Step 75457: {'lr': 0.0002528868047674793, 'samples': 14487744, 'steps': 75456, 'loss/train': 1.411738634109497} 11/07/2021 07:52:30 - INFO - __main__ - Step 75458: {'lr': 0.0002528814983762899, 'samples': 14487936, 'steps': 75457, 'loss/train': 1.4527678489685059} 11/07/2021 07:52:31 - INFO - __main__ - Step 75459: {'lr': 0.0002528761919838021, 'samples': 14488128, 'steps': 75458, 'loss/train': 1.2935047149658203} 11/07/2021 07:52:32 - INFO - __main__ - Step 75460: {'lr': 0.0002528708855900184, 'samples': 14488320, 'steps': 75459, 'loss/train': 1.0292233228683472} 11/07/2021 07:52:32 - INFO - __main__ - Step 75461: {'lr': 0.0002528655791949411, 'samples': 14488512, 'steps': 75460, 'loss/train': 0.09214270859956741} 11/07/2021 07:52:32 - INFO - __main__ - Step 75462: {'lr': 0.0002528602727985726, 'samples': 14488704, 'steps': 75461, 'loss/train': 0.9213132262229919} 11/07/2021 07:52:33 - INFO - __main__ - Step 75463: {'lr': 0.00025285496640091524, 'samples': 14488896, 'steps': 75462, 'loss/train': 1.2956551313400269} 11/07/2021 07:52:33 - INFO - __main__ - Step 75464: {'lr': 0.00025284966000197156, 'samples': 14489088, 'steps': 75463, 'loss/train': 1.3304716348648071} 11/07/2021 07:52:34 - INFO - __main__ - Step 75465: {'lr': 0.00025284435360174387, 'samples': 14489280, 'steps': 75464, 'loss/train': 1.4104925394058228} 11/07/2021 07:52:34 - INFO - __main__ - Step 75466: {'lr': 0.0002528390472002345, 'samples': 14489472, 'steps': 75465, 'loss/train': 1.81874418258667} 11/07/2021 07:52:35 - INFO - __main__ - Step 75467: {'lr': 0.00025283374079744595, 'samples': 14489664, 'steps': 75466, 'loss/train': 1.5398235321044922} 11/07/2021 07:52:35 - INFO - __main__ - Step 75468: {'lr': 0.00025282843439338056, 'samples': 14489856, 'steps': 75467, 'loss/train': 2.7994227409362793} 11/07/2021 07:52:35 - INFO - __main__ - Step 75469: {'lr': 0.0002528231279880407, 'samples': 14490048, 'steps': 75468, 'loss/train': 1.6266322135925293} 11/07/2021 07:52:36 - INFO - __main__ - Step 75470: {'lr': 0.0002528178215814288, 'samples': 14490240, 'steps': 75469, 'loss/train': 1.6341184377670288} 11/07/2021 07:52:37 - INFO - __main__ - Step 75471: {'lr': 0.0002528125151735472, 'samples': 14490432, 'steps': 75470, 'loss/train': 1.246778964996338} 11/07/2021 07:52:37 - INFO - __main__ - Step 75472: {'lr': 0.0002528072087643983, 'samples': 14490624, 'steps': 75471, 'loss/train': 1.4757322072982788} 11/07/2021 07:52:37 - INFO - __main__ - Step 75473: {'lr': 0.0002528019023539846, 'samples': 14490816, 'steps': 75472, 'loss/train': 1.6444785594940186} 11/07/2021 07:52:38 - INFO - __main__ - Step 75474: {'lr': 0.0002527965959423084, 'samples': 14491008, 'steps': 75473, 'loss/train': 1.5515508651733398} 11/07/2021 07:52:39 - INFO - __main__ - Step 75475: {'lr': 0.0002527912895293721, 'samples': 14491200, 'steps': 75474, 'loss/train': 1.3220417499542236} 11/07/2021 07:52:39 - INFO - __main__ - Step 75476: {'lr': 0.000252785983115178, 'samples': 14491392, 'steps': 75475, 'loss/train': 1.3681797981262207} 11/07/2021 07:52:40 - INFO - __main__ - Step 75477: {'lr': 0.00025278067669972867, 'samples': 14491584, 'steps': 75476, 'loss/train': 1.7961232662200928} 11/07/2021 07:52:40 - INFO - __main__ - Step 75478: {'lr': 0.0002527753702830263, 'samples': 14491776, 'steps': 75477, 'loss/train': 1.447153925895691} 11/07/2021 07:52:40 - INFO - __main__ - Step 75479: {'lr': 0.0002527700638650735, 'samples': 14491968, 'steps': 75478, 'loss/train': 1.6221336126327515} 11/07/2021 07:52:41 - INFO - __main__ - Step 75480: {'lr': 0.00025276475744587246, 'samples': 14492160, 'steps': 75479, 'loss/train': 1.3602179288864136} 11/07/2021 07:52:42 - INFO - __main__ - Step 75481: {'lr': 0.0002527594510254258, 'samples': 14492352, 'steps': 75480, 'loss/train': 1.2456971406936646} 11/07/2021 07:52:42 - INFO - __main__ - Step 75482: {'lr': 0.00025275414460373567, 'samples': 14492544, 'steps': 75481, 'loss/train': 1.3782490491867065} 11/07/2021 07:52:42 - INFO - __main__ - Step 75483: {'lr': 0.00025274883818080456, 'samples': 14492736, 'steps': 75482, 'loss/train': 0.8985316157341003} 11/07/2021 07:52:43 - INFO - __main__ - Step 75484: {'lr': 0.0002527435317566349, 'samples': 14492928, 'steps': 75483, 'loss/train': 1.4850633144378662} 11/07/2021 07:52:43 - INFO - __main__ - Step 75485: {'lr': 0.000252738225331229, 'samples': 14493120, 'steps': 75484, 'loss/train': 1.3219400644302368} 11/07/2021 07:52:44 - INFO - __main__ - Step 75486: {'lr': 0.00025273291890458933, 'samples': 14493312, 'steps': 75485, 'loss/train': 1.2006717920303345} 11/07/2021 07:52:44 - INFO - __main__ - Step 75487: {'lr': 0.00025272761247671833, 'samples': 14493504, 'steps': 75486, 'loss/train': 1.3145794868469238} 11/07/2021 07:52:45 - INFO - __main__ - Step 75488: {'lr': 0.0002527223060476182, 'samples': 14493696, 'steps': 75487, 'loss/train': 1.8216626644134521} 11/07/2021 07:52:45 - INFO - __main__ - Step 75489: {'lr': 0.0002527169996172915, 'samples': 14493888, 'steps': 75488, 'loss/train': 1.914743423461914} 11/07/2021 07:52:45 - INFO - __main__ - Step 75490: {'lr': 0.0002527116931857405, 'samples': 14494080, 'steps': 75489, 'loss/train': 1.6434016227722168} 11/07/2021 07:52:47 - INFO - __main__ - Step 75491: {'lr': 0.0002527063867529677, 'samples': 14494272, 'steps': 75490, 'loss/train': 1.5227444171905518} 11/07/2021 07:52:47 - INFO - __main__ - Step 75492: {'lr': 0.0002527010803189754, 'samples': 14494464, 'steps': 75491, 'loss/train': 1.743856430053711} 11/07/2021 07:52:47 - INFO - __main__ - Step 75493: {'lr': 0.0002526957738837661, 'samples': 14494656, 'steps': 75492, 'loss/train': 1.387418270111084} 11/07/2021 07:52:48 - INFO - __main__ - Step 75494: {'lr': 0.00025269046744734214, 'samples': 14494848, 'steps': 75493, 'loss/train': 1.642258644104004} 11/07/2021 07:52:48 - INFO - __main__ - Step 75495: {'lr': 0.00025268516100970584, 'samples': 14495040, 'steps': 75494, 'loss/train': 1.663557529449463} 11/07/2021 07:52:49 - INFO - __main__ - Step 75496: {'lr': 0.0002526798545708596, 'samples': 14495232, 'steps': 75495, 'loss/train': 0.999075174331665} 11/07/2021 07:52:50 - INFO - __main__ - Step 75497: {'lr': 0.000252674548130806, 'samples': 14495424, 'steps': 75496, 'loss/train': 1.6037218570709229} 11/07/2021 07:52:50 - INFO - __main__ - Step 75498: {'lr': 0.00025266924168954714, 'samples': 14495616, 'steps': 75497, 'loss/train': 1.1312004327774048} 11/07/2021 07:52:50 - INFO - __main__ - Step 75499: {'lr': 0.00025266393524708564, 'samples': 14495808, 'steps': 75498, 'loss/train': 1.8397302627563477} 11/07/2021 07:52:51 - INFO - __main__ - Step 75500: {'lr': 0.0002526586288034238, 'samples': 14496000, 'steps': 75499, 'loss/train': 1.7909877300262451} 11/07/2021 07:52:51 - INFO - __main__ - Step 75501: {'lr': 0.0002526533223585641, 'samples': 14496192, 'steps': 75500, 'loss/train': 1.5831578969955444} 11/07/2021 07:52:52 - INFO - __main__ - Step 75502: {'lr': 0.00025264801591250873, 'samples': 14496384, 'steps': 75501, 'loss/train': 1.5292307138442993} 11/07/2021 07:52:52 - INFO - __main__ - Step 75503: {'lr': 0.0002526427094652602, 'samples': 14496576, 'steps': 75502, 'loss/train': 1.4603526592254639} 11/07/2021 07:52:53 - INFO - __main__ - Step 75504: {'lr': 0.00025263740301682103, 'samples': 14496768, 'steps': 75503, 'loss/train': 1.323386549949646} 11/07/2021 07:52:53 - INFO - __main__ - Step 75505: {'lr': 0.0002526320965671934, 'samples': 14496960, 'steps': 75504, 'loss/train': 1.6545321941375732} 11/07/2021 07:52:53 - INFO - __main__ - Step 75506: {'lr': 0.0002526267901163798, 'samples': 14497152, 'steps': 75505, 'loss/train': 1.7934285402297974} 11/07/2021 07:52:55 - INFO - __main__ - Step 75507: {'lr': 0.00025262148366438265, 'samples': 14497344, 'steps': 75506, 'loss/train': 1.3781983852386475} 11/07/2021 07:52:55 - INFO - __main__ - Step 75508: {'lr': 0.0002526161772112042, 'samples': 14497536, 'steps': 75507, 'loss/train': 1.3933711051940918} 11/07/2021 07:52:55 - INFO - __main__ - Step 75509: {'lr': 0.000252610870756847, 'samples': 14497728, 'steps': 75508, 'loss/train': 1.1456351280212402} 11/07/2021 07:52:56 - INFO - __main__ - Step 75510: {'lr': 0.00025260556430131345, 'samples': 14497920, 'steps': 75509, 'loss/train': 1.3757011890411377} 11/07/2021 07:52:56 - INFO - __main__ - Step 75511: {'lr': 0.00025260025784460576, 'samples': 14498112, 'steps': 75510, 'loss/train': 1.7329930067062378} 11/07/2021 07:52:57 - INFO - __main__ - Step 75512: {'lr': 0.0002525949513867265, 'samples': 14498304, 'steps': 75511, 'loss/train': 1.8970006704330444} 11/07/2021 07:52:57 - INFO - __main__ - Step 75513: {'lr': 0.00025258964492767794, 'samples': 14498496, 'steps': 75512, 'loss/train': 1.5916541814804077} 11/07/2021 07:52:58 - INFO - __main__ - Step 75514: {'lr': 0.00025258433846746264, 'samples': 14498688, 'steps': 75513, 'loss/train': 1.418007254600525} 11/07/2021 07:52:58 - INFO - __main__ - Step 75515: {'lr': 0.0002525790320060828, 'samples': 14498880, 'steps': 75514, 'loss/train': 1.324389100074768} 11/07/2021 07:52:58 - INFO - __main__ - Step 75516: {'lr': 0.00025257372554354085, 'samples': 14499072, 'steps': 75515, 'loss/train': 1.5560258626937866} 11/07/2021 07:52:59 - INFO - __main__ - Step 75517: {'lr': 0.00025256841907983924, 'samples': 14499264, 'steps': 75516, 'loss/train': 0.8213626742362976} 11/07/2021 07:53:00 - INFO - __main__ - Step 75518: {'lr': 0.00025256311261498036, 'samples': 14499456, 'steps': 75517, 'loss/train': 1.3587151765823364} 11/07/2021 07:53:00 - INFO - __main__ - Step 75519: {'lr': 0.0002525578061489666, 'samples': 14499648, 'steps': 75518, 'loss/train': 1.5855580568313599} 11/07/2021 07:53:00 - INFO - __main__ - Step 75520: {'lr': 0.00025255249968180035, 'samples': 14499840, 'steps': 75519, 'loss/train': 1.483980417251587} 11/07/2021 07:53:01 - INFO - __main__ - Step 75521: {'lr': 0.0002525471932134839, 'samples': 14500032, 'steps': 75520, 'loss/train': 1.40939462184906} 11/07/2021 07:53:01 - INFO - __main__ - Step 75522: {'lr': 0.0002525418867440198, 'samples': 14500224, 'steps': 75521, 'loss/train': 1.5124281644821167} 11/07/2021 07:53:02 - INFO - __main__ - Step 75523: {'lr': 0.0002525365802734103, 'samples': 14500416, 'steps': 75522, 'loss/train': 1.4270721673965454} 11/07/2021 07:53:02 - INFO - __main__ - Step 75524: {'lr': 0.00025253127380165784, 'samples': 14500608, 'steps': 75523, 'loss/train': 1.3051674365997314} 11/07/2021 07:53:03 - INFO - __main__ - Step 75525: {'lr': 0.0002525259673287649, 'samples': 14500800, 'steps': 75524, 'loss/train': 1.5422788858413696} 11/07/2021 07:53:03 - INFO - __main__ - Step 75526: {'lr': 0.00025252066085473384, 'samples': 14500992, 'steps': 75525, 'loss/train': 1.4087597131729126} 11/07/2021 07:53:04 - INFO - __main__ - Step 75527: {'lr': 0.0002525153543795669, 'samples': 14501184, 'steps': 75526, 'loss/train': 1.371196985244751} 11/07/2021 07:53:05 - INFO - __main__ - Step 75528: {'lr': 0.00025251004790326665, 'samples': 14501376, 'steps': 75527, 'loss/train': 1.0436625480651855} 11/07/2021 07:53:05 - INFO - __main__ - Step 75529: {'lr': 0.00025250474142583535, 'samples': 14501568, 'steps': 75528, 'loss/train': 1.679349422454834} 11/07/2021 07:53:05 - INFO - __main__ - Step 75530: {'lr': 0.0002524994349472755, 'samples': 14501760, 'steps': 75529, 'loss/train': 1.5646437406539917} 11/07/2021 07:53:06 - INFO - __main__ - Step 75531: {'lr': 0.00025249412846758946, 'samples': 14501952, 'steps': 75530, 'loss/train': 1.7525901794433594} 11/07/2021 07:53:06 - INFO - __main__ - Step 75532: {'lr': 0.00025248882198677957, 'samples': 14502144, 'steps': 75531, 'loss/train': 2.0125765800476074} 11/07/2021 07:53:07 - INFO - __main__ - Step 75533: {'lr': 0.0002524835155048483, 'samples': 14502336, 'steps': 75532, 'loss/train': 1.5300780534744263} 11/07/2021 07:53:07 - INFO - __main__ - Step 75534: {'lr': 0.0002524782090217979, 'samples': 14502528, 'steps': 75533, 'loss/train': 1.2869625091552734} 11/07/2021 07:53:08 - INFO - __main__ - Step 75535: {'lr': 0.0002524729025376309, 'samples': 14502720, 'steps': 75534, 'loss/train': 1.6376525163650513} 11/07/2021 07:53:08 - INFO - __main__ - Step 75536: {'lr': 0.00025246759605234966, 'samples': 14502912, 'steps': 75535, 'loss/train': 0.9173499941825867} 11/07/2021 07:53:08 - INFO - __main__ - Step 75537: {'lr': 0.0002524622895659566, 'samples': 14503104, 'steps': 75536, 'loss/train': 1.2595677375793457} 11/07/2021 07:53:09 - INFO - __main__ - Step 75538: {'lr': 0.00025245698307845406, 'samples': 14503296, 'steps': 75537, 'loss/train': 1.5234668254852295} 11/07/2021 07:53:10 - INFO - __main__ - Step 75539: {'lr': 0.00025245167658984437, 'samples': 14503488, 'steps': 75538, 'loss/train': 1.3415480852127075} 11/07/2021 07:53:10 - INFO - __main__ - Step 75540: {'lr': 0.00025244637010013004, 'samples': 14503680, 'steps': 75539, 'loss/train': 0.5989901423454285} 11/07/2021 07:53:10 - INFO - __main__ - Step 75541: {'lr': 0.0002524410636093134, 'samples': 14503872, 'steps': 75540, 'loss/train': 1.2438222169876099} 11/07/2021 07:53:11 - INFO - __main__ - Step 75542: {'lr': 0.0002524357571173969, 'samples': 14504064, 'steps': 75541, 'loss/train': 1.0076595544815063} 11/07/2021 07:53:11 - INFO - __main__ - Step 75543: {'lr': 0.00025243045062438285, 'samples': 14504256, 'steps': 75542, 'loss/train': 0.8266238570213318} 11/07/2021 07:53:12 - INFO - __main__ - Step 75544: {'lr': 0.00025242514413027366, 'samples': 14504448, 'steps': 75543, 'loss/train': 1.654298186302185} 11/07/2021 07:53:13 - INFO - __main__ - Step 75545: {'lr': 0.00025241983763507175, 'samples': 14504640, 'steps': 75544, 'loss/train': 1.422455072402954} 11/07/2021 07:53:13 - INFO - __main__ - Step 75546: {'lr': 0.0002524145311387795, 'samples': 14504832, 'steps': 75545, 'loss/train': 1.3144696950912476} 11/07/2021 07:53:13 - INFO - __main__ - Step 75547: {'lr': 0.0002524092246413993, 'samples': 14505024, 'steps': 75546, 'loss/train': 1.5562098026275635} 11/07/2021 07:53:14 - INFO - __main__ - Step 75548: {'lr': 0.00025240391814293354, 'samples': 14505216, 'steps': 75547, 'loss/train': 1.6427717208862305} 11/07/2021 07:53:15 - INFO - __main__ - Step 75549: {'lr': 0.0002523986116433846, 'samples': 14505408, 'steps': 75548, 'loss/train': 1.2732911109924316} 11/07/2021 07:53:15 - INFO - __main__ - Step 75550: {'lr': 0.0002523933051427549, 'samples': 14505600, 'steps': 75549, 'loss/train': 1.4382883310317993} 11/07/2021 07:53:15 - INFO - __main__ - Step 75551: {'lr': 0.0002523879986410468, 'samples': 14505792, 'steps': 75550, 'loss/train': 1.5960650444030762} 11/07/2021 07:53:16 - INFO - __main__ - Step 75552: {'lr': 0.0002523826921382627, 'samples': 14505984, 'steps': 75551, 'loss/train': 1.457029938697815} 11/07/2021 07:53:16 - INFO - __main__ - Step 75553: {'lr': 0.00025237738563440497, 'samples': 14506176, 'steps': 75552, 'loss/train': 1.132361650466919} 11/07/2021 07:53:17 - INFO - __main__ - Step 75554: {'lr': 0.00025237207912947614, 'samples': 14506368, 'steps': 75553, 'loss/train': 1.7738672494888306} 11/07/2021 07:53:17 - INFO - __main__ - Step 75555: {'lr': 0.0002523667726234784, 'samples': 14506560, 'steps': 75554, 'loss/train': 0.8743076920509338} 11/07/2021 07:53:18 - INFO - __main__ - Step 75556: {'lr': 0.00025236146611641423, 'samples': 14506752, 'steps': 75555, 'loss/train': 1.923452615737915} 11/07/2021 07:53:18 - INFO - __main__ - Step 75557: {'lr': 0.0002523561596082861, 'samples': 14506944, 'steps': 75556, 'loss/train': 1.2756961584091187} 11/07/2021 07:53:19 - INFO - __main__ - Step 75558: {'lr': 0.0002523508530990962, 'samples': 14507136, 'steps': 75557, 'loss/train': 0.19023942947387695} 11/07/2021 07:53:20 - INFO - __main__ - Step 75559: {'lr': 0.00025234554658884706, 'samples': 14507328, 'steps': 75558, 'loss/train': 1.1682294607162476} 11/07/2021 07:53:20 - INFO - __main__ - Step 75560: {'lr': 0.00025234024007754106, 'samples': 14507520, 'steps': 75559, 'loss/train': 1.4757914543151855} 11/07/2021 07:53:20 - INFO - __main__ - Step 75561: {'lr': 0.0002523349335651807, 'samples': 14507712, 'steps': 75560, 'loss/train': 1.3466964960098267} 11/07/2021 07:53:21 - INFO - __main__ - Step 75562: {'lr': 0.0002523296270517682, 'samples': 14507904, 'steps': 75561, 'loss/train': 1.9436055421829224} 11/07/2021 07:53:21 - INFO - __main__ - Step 75563: {'lr': 0.0002523243205373059, 'samples': 14508096, 'steps': 75562, 'loss/train': 0.2397017478942871} 11/07/2021 07:53:21 - INFO - __main__ - Step 75564: {'lr': 0.00025231901402179635, 'samples': 14508288, 'steps': 75563, 'loss/train': 0.3784647583961487} 11/07/2021 07:53:22 - INFO - __main__ - Step 75565: {'lr': 0.0002523137075052419, 'samples': 14508480, 'steps': 75564, 'loss/train': 1.4950296878814697} 11/07/2021 07:53:23 - INFO - __main__ - Step 75566: {'lr': 0.00025230840098764497, 'samples': 14508672, 'steps': 75565, 'loss/train': 1.32523512840271} 11/07/2021 07:53:23 - INFO - __main__ - Step 75567: {'lr': 0.00025230309446900787, 'samples': 14508864, 'steps': 75566, 'loss/train': 1.23416006565094} 11/07/2021 07:53:23 - INFO - __main__ - Step 75568: {'lr': 0.0002522977879493331, 'samples': 14509056, 'steps': 75567, 'loss/train': 1.558989405632019} 11/07/2021 07:53:24 - INFO - __main__ - Step 75569: {'lr': 0.00025229248142862287, 'samples': 14509248, 'steps': 75568, 'loss/train': 1.4533261060714722} 11/07/2021 07:53:25 - INFO - __main__ - Step 75570: {'lr': 0.00025228717490687974, 'samples': 14509440, 'steps': 75569, 'loss/train': 1.4529826641082764} 11/07/2021 07:53:25 - INFO - __main__ - Step 75571: {'lr': 0.000252281868384106, 'samples': 14509632, 'steps': 75570, 'loss/train': 1.4362505674362183} 11/07/2021 07:53:26 - INFO - __main__ - Step 75572: {'lr': 0.0002522765618603041, 'samples': 14509824, 'steps': 75571, 'loss/train': 1.3247181177139282} 11/07/2021 07:53:26 - INFO - __main__ - Step 75573: {'lr': 0.00025227125533547643, 'samples': 14510016, 'steps': 75572, 'loss/train': 1.3506766557693481} 11/07/2021 07:53:26 - INFO - __main__ - Step 75574: {'lr': 0.0002522659488096254, 'samples': 14510208, 'steps': 75573, 'loss/train': 1.4282550811767578} 11/07/2021 07:53:27 - INFO - __main__ - Step 75575: {'lr': 0.0002522606422827534, 'samples': 14510400, 'steps': 75574, 'loss/train': 1.29643714427948} 11/07/2021 07:53:28 - INFO - __main__ - Step 75576: {'lr': 0.0002522553357548627, 'samples': 14510592, 'steps': 75575, 'loss/train': 1.4119337797164917} 11/07/2021 07:53:28 - INFO - __main__ - Step 75577: {'lr': 0.0002522500292259558, 'samples': 14510784, 'steps': 75576, 'loss/train': 1.345603585243225} 11/07/2021 07:53:29 - INFO - __main__ - Step 75578: {'lr': 0.0002522447226960351, 'samples': 14510976, 'steps': 75577, 'loss/train': 1.8890777826309204} 11/07/2021 07:53:29 - INFO - __main__ - Step 75579: {'lr': 0.00025223941616510294, 'samples': 14511168, 'steps': 75578, 'loss/train': 1.5743776559829712} 11/07/2021 07:53:29 - INFO - __main__ - Step 75580: {'lr': 0.00025223410963316176, 'samples': 14511360, 'steps': 75579, 'loss/train': 1.456299901008606} 11/07/2021 07:53:30 - INFO - __main__ - Step 75581: {'lr': 0.0002522288031002139, 'samples': 14511552, 'steps': 75580, 'loss/train': 1.5698162317276} 11/07/2021 07:53:31 - INFO - __main__ - Step 75582: {'lr': 0.00025222349656626184, 'samples': 14511744, 'steps': 75581, 'loss/train': 1.123979091644287} 11/07/2021 07:53:31 - INFO - __main__ - Step 75583: {'lr': 0.0002522181900313078, 'samples': 14511936, 'steps': 75582, 'loss/train': 1.2883610725402832} 11/07/2021 07:53:31 - INFO - __main__ - Step 75584: {'lr': 0.0002522128834953543, 'samples': 14512128, 'steps': 75583, 'loss/train': 1.2955607175827026} 11/07/2021 07:53:32 - INFO - __main__ - Step 75585: {'lr': 0.00025220757695840375, 'samples': 14512320, 'steps': 75584, 'loss/train': 1.1162029504776} 11/07/2021 07:53:33 - INFO - __main__ - Step 75586: {'lr': 0.00025220227042045847, 'samples': 14512512, 'steps': 75585, 'loss/train': 1.6304047107696533} 11/07/2021 07:53:33 - INFO - __main__ - Step 75587: {'lr': 0.00025219696388152093, 'samples': 14512704, 'steps': 75586, 'loss/train': 1.7414430379867554} 11/07/2021 07:53:33 - INFO - __main__ - Step 75588: {'lr': 0.00025219165734159345, 'samples': 14512896, 'steps': 75587, 'loss/train': 1.38230299949646} 11/07/2021 07:53:34 - INFO - __main__ - Step 75589: {'lr': 0.00025218635080067844, 'samples': 14513088, 'steps': 75588, 'loss/train': 1.142987847328186} 11/07/2021 07:53:34 - INFO - __main__ - Step 75590: {'lr': 0.00025218104425877826, 'samples': 14513280, 'steps': 75589, 'loss/train': 1.6978983879089355} 11/07/2021 07:53:35 - INFO - __main__ - Step 75591: {'lr': 0.00025217573771589536, 'samples': 14513472, 'steps': 75590, 'loss/train': 1.5476864576339722} 11/07/2021 07:53:35 - INFO - __main__ - Step 75592: {'lr': 0.00025217043117203207, 'samples': 14513664, 'steps': 75591, 'loss/train': 1.4814691543579102} 11/07/2021 07:53:36 - INFO - __main__ - Step 75593: {'lr': 0.0002521651246271909, 'samples': 14513856, 'steps': 75592, 'loss/train': 1.391980528831482} 11/07/2021 07:53:36 - INFO - __main__ - Step 75594: {'lr': 0.0002521598180813741, 'samples': 14514048, 'steps': 75593, 'loss/train': 1.6648476123809814} 11/07/2021 07:53:37 - INFO - __main__ - Step 75595: {'lr': 0.00025215451153458415, 'samples': 14514240, 'steps': 75594, 'loss/train': 0.7931016087532043} 11/07/2021 07:53:37 - INFO - __main__ - Step 75596: {'lr': 0.0002521492049868234, 'samples': 14514432, 'steps': 75595, 'loss/train': 1.652696132659912} 11/07/2021 07:53:38 - INFO - __main__ - Step 75597: {'lr': 0.0002521438984380942, 'samples': 14514624, 'steps': 75596, 'loss/train': 1.2926969528198242} 11/07/2021 07:53:38 - INFO - __main__ - Step 75598: {'lr': 0.00025213859188839903, 'samples': 14514816, 'steps': 75597, 'loss/train': 0.9360073208808899} 11/07/2021 07:53:38 - INFO - __main__ - Step 75599: {'lr': 0.0002521332853377403, 'samples': 14515008, 'steps': 75598, 'loss/train': 1.824554443359375} 11/07/2021 07:53:39 - INFO - __main__ - Step 75600: {'lr': 0.00025212797878612024, 'samples': 14515200, 'steps': 75599, 'loss/train': 1.380595326423645} 11/07/2021 07:53:40 - INFO - __main__ - Step 75601: {'lr': 0.0002521226722335414, 'samples': 14515392, 'steps': 75600, 'loss/train': 1.5967692136764526} 11/07/2021 07:53:40 - INFO - __main__ - Step 75602: {'lr': 0.00025211736568000613, 'samples': 14515584, 'steps': 75601, 'loss/train': 1.335892915725708} 11/07/2021 07:53:41 - INFO - __main__ - Step 75603: {'lr': 0.00025211205912551685, 'samples': 14515776, 'steps': 75602, 'loss/train': 0.8130733966827393} 11/07/2021 07:53:41 - INFO - __main__ - Step 75604: {'lr': 0.0002521067525700758, 'samples': 14515968, 'steps': 75603, 'loss/train': 0.6120889782905579} 11/07/2021 07:53:42 - INFO - __main__ - Step 75605: {'lr': 0.00025210144601368553, 'samples': 14516160, 'steps': 75604, 'loss/train': 1.1562964916229248} 11/07/2021 07:53:42 - INFO - __main__ - Step 75606: {'lr': 0.00025209613945634837, 'samples': 14516352, 'steps': 75605, 'loss/train': 1.1393592357635498} 11/07/2021 07:53:43 - INFO - __main__ - Step 75607: {'lr': 0.0002520908328980667, 'samples': 14516544, 'steps': 75606, 'loss/train': 0.5815699100494385} 11/07/2021 07:53:43 - INFO - __main__ - Step 75608: {'lr': 0.0002520855263388431, 'samples': 14516736, 'steps': 75607, 'loss/train': 1.179397702217102} 11/07/2021 07:53:44 - INFO - __main__ - Step 75609: {'lr': 0.00025208021977867964, 'samples': 14516928, 'steps': 75608, 'loss/train': 1.8175472021102905} 11/07/2021 07:53:44 - INFO - __main__ - Step 75610: {'lr': 0.00025207491321757884, 'samples': 14517120, 'steps': 75609, 'loss/train': 1.4369949102401733} 11/07/2021 07:53:44 - INFO - __main__ - Step 75611: {'lr': 0.0002520696066555432, 'samples': 14517312, 'steps': 75610, 'loss/train': 1.3805783987045288} 11/07/2021 07:53:46 - INFO - __main__ - Step 75612: {'lr': 0.0002520643000925749, 'samples': 14517504, 'steps': 75611, 'loss/train': 1.4492499828338623} 11/07/2021 07:53:46 - INFO - __main__ - Step 75613: {'lr': 0.0002520589935286766, 'samples': 14517696, 'steps': 75612, 'loss/train': 0.662986159324646} 11/07/2021 07:53:46 - INFO - __main__ - Step 75614: {'lr': 0.00025205368696385046, 'samples': 14517888, 'steps': 75613, 'loss/train': 1.492806077003479} 11/07/2021 07:53:47 - INFO - __main__ - Step 75615: {'lr': 0.0002520483803980991, 'samples': 14518080, 'steps': 75614, 'loss/train': 1.3485599756240845} 11/07/2021 07:53:47 - INFO - __main__ - Step 75616: {'lr': 0.0002520430738314246, 'samples': 14518272, 'steps': 75615, 'loss/train': 1.147079586982727} 11/07/2021 07:53:48 - INFO - __main__ - Step 75617: {'lr': 0.0002520377672638296, 'samples': 14518464, 'steps': 75616, 'loss/train': 1.3806483745574951} 11/07/2021 07:53:48 - INFO - __main__ - Step 75618: {'lr': 0.0002520324606953164, 'samples': 14518656, 'steps': 75617, 'loss/train': 1.2562466859817505} 11/07/2021 07:53:49 - INFO - __main__ - Step 75619: {'lr': 0.0002520271541258874, 'samples': 14518848, 'steps': 75618, 'loss/train': 1.9849635362625122} 11/07/2021 07:53:49 - INFO - __main__ - Step 75620: {'lr': 0.000252021847555545, 'samples': 14519040, 'steps': 75619, 'loss/train': 1.2246744632720947} 11/07/2021 07:53:49 - INFO - __main__ - Step 75621: {'lr': 0.00025201654098429163, 'samples': 14519232, 'steps': 75620, 'loss/train': 1.213667392730713} 11/07/2021 07:53:50 - INFO - __main__ - Step 75622: {'lr': 0.0002520112344121296, 'samples': 14519424, 'steps': 75621, 'loss/train': 1.1384048461914062} 11/07/2021 07:53:51 - INFO - __main__ - Step 75623: {'lr': 0.00025200592783906136, 'samples': 14519616, 'steps': 75622, 'loss/train': 1.1657475233078003} 11/07/2021 07:53:51 - INFO - __main__ - Step 75624: {'lr': 0.00025200062126508923, 'samples': 14519808, 'steps': 75623, 'loss/train': 0.6929659843444824} 11/07/2021 07:53:51 - INFO - __main__ - Step 75625: {'lr': 0.0002519953146902157, 'samples': 14520000, 'steps': 75624, 'loss/train': 2.031132936477661} 11/07/2021 07:53:52 - INFO - __main__ - Step 75626: {'lr': 0.00025199000811444304, 'samples': 14520192, 'steps': 75625, 'loss/train': 0.9299135804176331} 11/07/2021 07:53:52 - INFO - __main__ - Step 75627: {'lr': 0.00025198470153777375, 'samples': 14520384, 'steps': 75626, 'loss/train': 1.4418513774871826} 11/07/2021 07:53:53 - INFO - __main__ - Step 75628: {'lr': 0.00025197939496021026, 'samples': 14520576, 'steps': 75627, 'loss/train': 1.2411917448043823} 11/07/2021 07:53:54 - INFO - __main__ - Step 75629: {'lr': 0.00025197408838175485, 'samples': 14520768, 'steps': 75628, 'loss/train': 1.510070562362671} 11/07/2021 07:53:54 - INFO - __main__ - Step 75630: {'lr': 0.0002519687818024099, 'samples': 14520960, 'steps': 75629, 'loss/train': 1.6546624898910522} 11/07/2021 07:53:54 - INFO - __main__ - Step 75631: {'lr': 0.0002519634752221778, 'samples': 14521152, 'steps': 75630, 'loss/train': 2.9269216060638428} 11/07/2021 07:53:55 - INFO - __main__ - Step 75632: {'lr': 0.00025195816864106107, 'samples': 14521344, 'steps': 75631, 'loss/train': 1.582208514213562} 11/07/2021 07:53:56 - INFO - __main__ - Step 75633: {'lr': 0.00025195286205906205, 'samples': 14521536, 'steps': 75632, 'loss/train': 1.2872178554534912} 11/07/2021 07:53:56 - INFO - __main__ - Step 75634: {'lr': 0.00025194755547618304, 'samples': 14521728, 'steps': 75633, 'loss/train': 1.7358516454696655} 11/07/2021 07:53:56 - INFO - __main__ - Step 75635: {'lr': 0.00025194224889242653, 'samples': 14521920, 'steps': 75634, 'loss/train': 1.1515306234359741} 11/07/2021 07:53:57 - INFO - __main__ - Step 75636: {'lr': 0.00025193694230779486, 'samples': 14522112, 'steps': 75635, 'loss/train': 1.6779463291168213} 11/07/2021 07:53:57 - INFO - __main__ - Step 75637: {'lr': 0.0002519316357222904, 'samples': 14522304, 'steps': 75636, 'loss/train': 1.0742621421813965} 11/07/2021 07:53:59 - INFO - __main__ - Step 75638: {'lr': 0.00025192632913591554, 'samples': 14522496, 'steps': 75637, 'loss/train': 1.005031943321228} 11/07/2021 07:53:59 - INFO - __main__ - Step 75639: {'lr': 0.00025192102254867284, 'samples': 14522688, 'steps': 75638, 'loss/train': 1.3441503047943115} 11/07/2021 07:53:59 - INFO - __main__ - Step 75640: {'lr': 0.00025191571596056445, 'samples': 14522880, 'steps': 75639, 'loss/train': 1.5909006595611572} 11/07/2021 07:54:00 - INFO - __main__ - Step 75641: {'lr': 0.0002519104093715929, 'samples': 14523072, 'steps': 75640, 'loss/train': 0.1046864315867424} 11/07/2021 07:54:00 - INFO - __main__ - Step 75642: {'lr': 0.0002519051027817606, 'samples': 14523264, 'steps': 75641, 'loss/train': 1.4673084020614624} 11/07/2021 07:54:01 - INFO - __main__ - Step 75643: {'lr': 0.00025189979619106976, 'samples': 14523456, 'steps': 75642, 'loss/train': 1.4692940711975098} 11/07/2021 07:54:01 - INFO - __main__ - Step 75644: {'lr': 0.00025189448959952304, 'samples': 14523648, 'steps': 75643, 'loss/train': 1.2158489227294922} 11/07/2021 07:54:02 - INFO - __main__ - Step 75645: {'lr': 0.0002518891830071226, 'samples': 14523840, 'steps': 75644, 'loss/train': 1.23605215549469} 11/07/2021 07:54:02 - INFO - __main__ - Step 75646: {'lr': 0.00025188387641387095, 'samples': 14524032, 'steps': 75645, 'loss/train': 1.715276837348938} 11/07/2021 07:54:02 - INFO - __main__ - Step 75647: {'lr': 0.0002518785698197705, 'samples': 14524224, 'steps': 75646, 'loss/train': 1.3380600214004517} 11/07/2021 07:54:03 - INFO - __main__ - Step 75648: {'lr': 0.0002518732632248235, 'samples': 14524416, 'steps': 75647, 'loss/train': 1.4197574853897095} 11/07/2021 07:54:04 - INFO - __main__ - Step 75649: {'lr': 0.0002518679566290326, 'samples': 14524608, 'steps': 75648, 'loss/train': 1.207599401473999} 11/07/2021 07:54:04 - INFO - __main__ - Step 75650: {'lr': 0.0002518626500323998, 'samples': 14524800, 'steps': 75649, 'loss/train': 1.6418232917785645} 11/07/2021 07:54:04 - INFO - __main__ - Step 75651: {'lr': 0.0002518573434349279, 'samples': 14524992, 'steps': 75650, 'loss/train': 1.4587256908416748} 11/07/2021 07:54:05 - INFO - __main__ - Step 75652: {'lr': 0.000251852036836619, 'samples': 14525184, 'steps': 75651, 'loss/train': 1.3622548580169678} 11/07/2021 07:54:06 - INFO - __main__ - Step 75653: {'lr': 0.0002518467302374757, 'samples': 14525376, 'steps': 75652, 'loss/train': 0.9353790283203125} 11/07/2021 07:54:06 - INFO - __main__ - Step 75654: {'lr': 0.0002518414236375002, 'samples': 14525568, 'steps': 75653, 'loss/train': 1.4856644868850708} 11/07/2021 07:54:07 - INFO - __main__ - Step 75655: {'lr': 0.0002518361170366951, 'samples': 14525760, 'steps': 75654, 'loss/train': 1.252266764640808} 11/07/2021 07:54:07 - INFO - __main__ - Step 75656: {'lr': 0.00025183081043506257, 'samples': 14525952, 'steps': 75655, 'loss/train': 1.0468562841415405} 11/07/2021 07:54:07 - INFO - __main__ - Step 75657: {'lr': 0.0002518255038326051, 'samples': 14526144, 'steps': 75656, 'loss/train': 1.2654701471328735} 11/07/2021 07:54:09 - INFO - __main__ - Step 75658: {'lr': 0.0002518201972293251, 'samples': 14526336, 'steps': 75657, 'loss/train': 1.1077407598495483} 11/07/2021 07:54:09 - INFO - __main__ - Step 75659: {'lr': 0.00025181489062522494, 'samples': 14526528, 'steps': 75658, 'loss/train': 1.4952503442764282} 11/07/2021 07:54:09 - INFO - __main__ - Step 75660: {'lr': 0.00025180958402030713, 'samples': 14526720, 'steps': 75659, 'loss/train': 0.9166703820228577} 11/07/2021 07:54:10 - INFO - __main__ - Step 75661: {'lr': 0.00025180427741457385, 'samples': 14526912, 'steps': 75660, 'loss/train': 0.7710058689117432} 11/07/2021 07:54:10 - INFO - __main__ - Step 75662: {'lr': 0.0002517989708080276, 'samples': 14527104, 'steps': 75661, 'loss/train': 1.2638438940048218} 11/07/2021 07:54:10 - INFO - __main__ - Step 75663: {'lr': 0.00025179366420067075, 'samples': 14527296, 'steps': 75662, 'loss/train': 1.2862082719802856} 11/07/2021 07:54:11 - INFO - __main__ - Step 75664: {'lr': 0.00025178835759250576, 'samples': 14527488, 'steps': 75663, 'loss/train': 0.7460633516311646} 11/07/2021 07:54:12 - INFO - __main__ - Step 75665: {'lr': 0.0002517830509835349, 'samples': 14527680, 'steps': 75664, 'loss/train': 1.0471469163894653} 11/07/2021 07:54:12 - INFO - __main__ - Step 75666: {'lr': 0.00025177774437376067, 'samples': 14527872, 'steps': 75665, 'loss/train': 1.3756502866744995} 11/07/2021 07:54:12 - INFO - __main__ - Step 75667: {'lr': 0.0002517724377631854, 'samples': 14528064, 'steps': 75666, 'loss/train': 1.3208271265029907} 11/07/2021 07:54:13 - INFO - __main__ - Step 75668: {'lr': 0.00025176713115181143, 'samples': 14528256, 'steps': 75667, 'loss/train': 1.274173617362976} 11/07/2021 07:54:14 - INFO - __main__ - Step 75669: {'lr': 0.0002517618245396413, 'samples': 14528448, 'steps': 75668, 'loss/train': 1.228684425354004} 11/07/2021 07:54:14 - INFO - __main__ - Step 75670: {'lr': 0.00025175651792667725, 'samples': 14528640, 'steps': 75669, 'loss/train': 1.0957001447677612} 11/07/2021 07:54:15 - INFO - __main__ - Step 75671: {'lr': 0.00025175121131292184, 'samples': 14528832, 'steps': 75670, 'loss/train': 1.4240753650665283} 11/07/2021 07:54:15 - INFO - __main__ - Step 75672: {'lr': 0.00025174590469837735, 'samples': 14529024, 'steps': 75671, 'loss/train': 1.5063936710357666} 11/07/2021 07:54:15 - INFO - __main__ - Step 75673: {'lr': 0.0002517405980830461, 'samples': 14529216, 'steps': 75672, 'loss/train': 1.3207509517669678} 11/07/2021 07:54:16 - INFO - __main__ - Step 75674: {'lr': 0.00025173529146693056, 'samples': 14529408, 'steps': 75673, 'loss/train': 1.2926827669143677} 11/07/2021 07:54:17 - INFO - __main__ - Step 75675: {'lr': 0.0002517299848500332, 'samples': 14529600, 'steps': 75674, 'loss/train': 1.6065360307693481} 11/07/2021 07:54:17 - INFO - __main__ - Step 75676: {'lr': 0.00025172467823235634, 'samples': 14529792, 'steps': 75675, 'loss/train': 1.3541004657745361} 11/07/2021 07:54:17 - INFO - __main__ - Step 75677: {'lr': 0.0002517193716139023, 'samples': 14529984, 'steps': 75676, 'loss/train': 0.9457648992538452} 11/07/2021 07:54:18 - INFO - __main__ - Step 75678: {'lr': 0.0002517140649946736, 'samples': 14530176, 'steps': 75677, 'loss/train': 1.6127121448516846} 11/07/2021 07:54:18 - INFO - __main__ - Step 75679: {'lr': 0.0002517087583746725, 'samples': 14530368, 'steps': 75678, 'loss/train': 1.618749976158142} 11/07/2021 07:54:19 - INFO - __main__ - Step 75680: {'lr': 0.00025170345175390147, 'samples': 14530560, 'steps': 75679, 'loss/train': 1.7360721826553345} 11/07/2021 07:54:20 - INFO - __main__ - Step 75681: {'lr': 0.00025169814513236296, 'samples': 14530752, 'steps': 75680, 'loss/train': 1.5553693771362305} 11/07/2021 07:54:20 - INFO - __main__ - Step 75682: {'lr': 0.00025169283851005927, 'samples': 14530944, 'steps': 75681, 'loss/train': 1.3378018140792847} 11/07/2021 07:54:20 - INFO - __main__ - Step 75683: {'lr': 0.0002516875318869928, 'samples': 14531136, 'steps': 75682, 'loss/train': 1.3036956787109375} 11/07/2021 07:54:21 - INFO - __main__ - Step 75684: {'lr': 0.0002516822252631659, 'samples': 14531328, 'steps': 75683, 'loss/train': 1.590129017829895} 11/07/2021 07:54:22 - INFO - __main__ - Step 75685: {'lr': 0.00025167691863858105, 'samples': 14531520, 'steps': 75684, 'loss/train': 1.5683066844940186} 11/07/2021 07:54:22 - INFO - __main__ - Step 75686: {'lr': 0.0002516716120132406, 'samples': 14531712, 'steps': 75685, 'loss/train': 1.39080011844635} 11/07/2021 07:54:22 - INFO - __main__ - Step 75687: {'lr': 0.00025166630538714694, 'samples': 14531904, 'steps': 75686, 'loss/train': 1.4660584926605225} 11/07/2021 07:54:23 - INFO - __main__ - Step 75688: {'lr': 0.0002516609987603025, 'samples': 14532096, 'steps': 75687, 'loss/train': 1.5348001718521118} 11/07/2021 07:54:23 - INFO - __main__ - Step 75689: {'lr': 0.00025165569213270975, 'samples': 14532288, 'steps': 75688, 'loss/train': 1.0445173978805542} 11/07/2021 07:54:24 - INFO - __main__ - Step 75690: {'lr': 0.0002516503855043708, 'samples': 14532480, 'steps': 75689, 'loss/train': 1.0880403518676758} 11/07/2021 07:54:24 - INFO - __main__ - Step 75691: {'lr': 0.00025164507887528824, 'samples': 14532672, 'steps': 75690, 'loss/train': 1.2964693307876587} 11/07/2021 07:54:25 - INFO - __main__ - Step 75692: {'lr': 0.00025163977224546447, 'samples': 14532864, 'steps': 75691, 'loss/train': 1.2836792469024658} 11/07/2021 07:54:25 - INFO - __main__ - Step 75693: {'lr': 0.0002516344656149018, 'samples': 14533056, 'steps': 75692, 'loss/train': 0.9858590960502625} 11/07/2021 07:54:25 - INFO - __main__ - Step 75694: {'lr': 0.0002516291589836027, 'samples': 14533248, 'steps': 75693, 'loss/train': 1.4505378007888794} 11/07/2021 07:54:27 - INFO - __main__ - Step 75695: {'lr': 0.00025162385235156956, 'samples': 14533440, 'steps': 75694, 'loss/train': 1.3827908039093018} 11/07/2021 07:54:27 - INFO - __main__ - Step 75696: {'lr': 0.00025161854571880473, 'samples': 14533632, 'steps': 75695, 'loss/train': 1.2921391725540161} 11/07/2021 07:54:27 - INFO - __main__ - Step 75697: {'lr': 0.0002516132390853106, 'samples': 14533824, 'steps': 75696, 'loss/train': 1.1429665088653564} 11/07/2021 07:54:28 - INFO - __main__ - Step 75698: {'lr': 0.0002516079324510895, 'samples': 14534016, 'steps': 75697, 'loss/train': 0.8571052551269531} 11/07/2021 07:54:28 - INFO - __main__ - Step 75699: {'lr': 0.00025160262581614394, 'samples': 14534208, 'steps': 75698, 'loss/train': 1.7319378852844238} 11/07/2021 07:54:29 - INFO - __main__ - Step 75700: {'lr': 0.00025159731918047626, 'samples': 14534400, 'steps': 75699, 'loss/train': 1.5737743377685547} 11/07/2021 07:54:29 - INFO - __main__ - Step 75701: {'lr': 0.0002515920125440888, 'samples': 14534592, 'steps': 75700, 'loss/train': 1.8644418716430664} 11/07/2021 07:54:30 - INFO - __main__ - Step 75702: {'lr': 0.0002515867059069841, 'samples': 14534784, 'steps': 75701, 'loss/train': 0.8239907622337341} 11/07/2021 07:54:30 - INFO - __main__ - Step 75703: {'lr': 0.00025158139926916446, 'samples': 14534976, 'steps': 75702, 'loss/train': 1.1542997360229492} 11/07/2021 07:54:30 - INFO - __main__ - Step 75704: {'lr': 0.0002515760926306322, 'samples': 14535168, 'steps': 75703, 'loss/train': 1.384289264678955} 11/07/2021 07:54:32 - INFO - __main__ - Step 75705: {'lr': 0.00025157078599138976, 'samples': 14535360, 'steps': 75704, 'loss/train': 1.2131681442260742} 11/07/2021 07:54:32 - INFO - __main__ - Step 75706: {'lr': 0.0002515654793514396, 'samples': 14535552, 'steps': 75705, 'loss/train': 1.0983096361160278} 11/07/2021 07:54:32 - INFO - __main__ - Step 75707: {'lr': 0.000251560172710784, 'samples': 14535744, 'steps': 75706, 'loss/train': 1.666165828704834} 11/07/2021 07:54:33 - INFO - __main__ - Step 75708: {'lr': 0.00025155486606942546, 'samples': 14535936, 'steps': 75707, 'loss/train': 3.1949448585510254} 11/07/2021 07:54:33 - INFO - __main__ - Step 75709: {'lr': 0.00025154955942736636, 'samples': 14536128, 'steps': 75708, 'loss/train': 1.421794056892395} 11/07/2021 07:54:34 - INFO - __main__ - Step 75710: {'lr': 0.00025154425278460903, 'samples': 14536320, 'steps': 75709, 'loss/train': 1.4755805730819702} 11/07/2021 07:54:34 - INFO - __main__ - Step 75711: {'lr': 0.0002515389461411558, 'samples': 14536512, 'steps': 75710, 'loss/train': 1.5852946043014526} 11/07/2021 07:54:35 - INFO - __main__ - Step 75712: {'lr': 0.0002515336394970092, 'samples': 14536704, 'steps': 75711, 'loss/train': 1.6140424013137817} 11/07/2021 07:54:35 - INFO - __main__ - Step 75713: {'lr': 0.0002515283328521716, 'samples': 14536896, 'steps': 75712, 'loss/train': 1.3694586753845215} 11/07/2021 07:54:35 - INFO - __main__ - Step 75714: {'lr': 0.0002515230262066453, 'samples': 14537088, 'steps': 75713, 'loss/train': 1.8814243078231812} 11/07/2021 07:54:36 - INFO - __main__ - Step 75715: {'lr': 0.00025151771956043276, 'samples': 14537280, 'steps': 75714, 'loss/train': 0.963657796382904} 11/07/2021 07:54:37 - INFO - __main__ - Step 75716: {'lr': 0.00025151241291353644, 'samples': 14537472, 'steps': 75715, 'loss/train': 1.6715525388717651} 11/07/2021 07:54:37 - INFO - __main__ - Step 75717: {'lr': 0.0002515071062659586, 'samples': 14537664, 'steps': 75716, 'loss/train': 1.3477096557617188} 11/07/2021 07:54:37 - INFO - __main__ - Step 75718: {'lr': 0.0002515017996177016, 'samples': 14537856, 'steps': 75717, 'loss/train': 1.1758840084075928} 11/07/2021 07:54:38 - INFO - __main__ - Step 75719: {'lr': 0.000251496492968768, 'samples': 14538048, 'steps': 75718, 'loss/train': 0.9307201504707336} 11/07/2021 07:54:38 - INFO - __main__ - Step 75720: {'lr': 0.0002514911863191601, 'samples': 14538240, 'steps': 75719, 'loss/train': 1.7209351062774658} 11/07/2021 07:54:39 - INFO - __main__ - Step 75721: {'lr': 0.0002514858796688802, 'samples': 14538432, 'steps': 75720, 'loss/train': 1.549994707107544} 11/07/2021 07:54:40 - INFO - __main__ - Step 75722: {'lr': 0.00025148057301793085, 'samples': 14538624, 'steps': 75721, 'loss/train': 1.551364779472351} 11/07/2021 07:54:40 - INFO - __main__ - Step 75723: {'lr': 0.00025147526636631445, 'samples': 14538816, 'steps': 75722, 'loss/train': 1.327945590019226} 11/07/2021 07:54:40 - INFO - __main__ - Step 75724: {'lr': 0.0002514699597140332, 'samples': 14539008, 'steps': 75723, 'loss/train': 1.6277081966400146} 11/07/2021 07:54:41 - INFO - __main__ - Step 75725: {'lr': 0.00025146465306108965, 'samples': 14539200, 'steps': 75724, 'loss/train': 1.067014455795288} 11/07/2021 07:54:42 - INFO - __main__ - Step 75726: {'lr': 0.0002514593464074862, 'samples': 14539392, 'steps': 75725, 'loss/train': 1.6112264394760132} 11/07/2021 07:54:42 - INFO - __main__ - Step 75727: {'lr': 0.00025145403975322515, 'samples': 14539584, 'steps': 75726, 'loss/train': 1.1798861026763916} 11/07/2021 07:54:42 - INFO - __main__ - Step 75728: {'lr': 0.000251448733098309, 'samples': 14539776, 'steps': 75727, 'loss/train': 1.2803003787994385} 11/07/2021 07:54:43 - INFO - __main__ - Step 75729: {'lr': 0.00025144342644273996, 'samples': 14539968, 'steps': 75728, 'loss/train': 1.738198161125183} 11/07/2021 07:54:43 - INFO - __main__ - Step 75730: {'lr': 0.0002514381197865206, 'samples': 14540160, 'steps': 75729, 'loss/train': 1.1517688035964966} 11/07/2021 07:54:44 - INFO - __main__ - Step 75731: {'lr': 0.00025143281312965324, 'samples': 14540352, 'steps': 75730, 'loss/train': 1.2858792543411255} 11/07/2021 07:54:45 - INFO - __main__ - Step 75732: {'lr': 0.00025142750647214025, 'samples': 14540544, 'steps': 75731, 'loss/train': 1.539987325668335} 11/07/2021 07:54:45 - INFO - __main__ - Step 75733: {'lr': 0.00025142219981398405, 'samples': 14540736, 'steps': 75732, 'loss/train': 1.7778434753417969} 11/07/2021 07:54:45 - INFO - __main__ - Step 75734: {'lr': 0.00025141689315518704, 'samples': 14540928, 'steps': 75733, 'loss/train': 1.5534603595733643} 11/07/2021 07:54:46 - INFO - __main__ - Step 75735: {'lr': 0.0002514115864957516, 'samples': 14541120, 'steps': 75734, 'loss/train': 1.489758849143982} 11/07/2021 07:54:46 - INFO - __main__ - Step 75736: {'lr': 0.00025140627983568015, 'samples': 14541312, 'steps': 75735, 'loss/train': 2.2512059211730957} 11/07/2021 07:54:47 - INFO - __main__ - Step 75737: {'lr': 0.000251400973174975, 'samples': 14541504, 'steps': 75736, 'loss/train': 1.4729973077774048} 11/07/2021 07:54:48 - INFO - __main__ - Step 75738: {'lr': 0.0002513956665136387, 'samples': 14541696, 'steps': 75737, 'loss/train': 1.4445890188217163} 11/07/2021 07:54:48 - INFO - __main__ - Step 75739: {'lr': 0.00025139035985167335, 'samples': 14541888, 'steps': 75738, 'loss/train': 1.393940806388855} 11/07/2021 07:54:48 - INFO - __main__ - Step 75740: {'lr': 0.00025138505318908163, 'samples': 14542080, 'steps': 75739, 'loss/train': 0.9992956519126892} 11/07/2021 07:54:49 - INFO - __main__ - Step 75741: {'lr': 0.0002513797465258658, 'samples': 14542272, 'steps': 75740, 'loss/train': 0.5662015676498413} 11/07/2021 07:54:50 - INFO - __main__ - Step 75742: {'lr': 0.00025137443986202827, 'samples': 14542464, 'steps': 75741, 'loss/train': 1.3857320547103882} 11/07/2021 07:54:50 - INFO - __main__ - Step 75743: {'lr': 0.00025136913319757156, 'samples': 14542656, 'steps': 75742, 'loss/train': 1.6481612920761108} 11/07/2021 07:54:50 - INFO - __main__ - Step 75744: {'lr': 0.0002513638265324978, 'samples': 14542848, 'steps': 75743, 'loss/train': 0.9387490153312683} 11/07/2021 07:54:51 - INFO - __main__ - Step 75745: {'lr': 0.0002513585198668096, 'samples': 14543040, 'steps': 75744, 'loss/train': 1.1938430070877075} 11/07/2021 07:54:51 - INFO - __main__ - Step 75746: {'lr': 0.0002513532132005092, 'samples': 14543232, 'steps': 75745, 'loss/train': 1.9839527606964111} 11/07/2021 07:54:51 - INFO - __main__ - Step 75747: {'lr': 0.00025134790653359913, 'samples': 14543424, 'steps': 75746, 'loss/train': 1.3250399827957153} 11/07/2021 07:54:52 - INFO - __main__ - Step 75748: {'lr': 0.0002513425998660817, 'samples': 14543616, 'steps': 75747, 'loss/train': 0.590793788433075} 11/07/2021 07:54:53 - INFO - __main__ - Step 75749: {'lr': 0.0002513372931979593, 'samples': 14543808, 'steps': 75748, 'loss/train': 1.4008212089538574} 11/07/2021 07:54:53 - INFO - __main__ - Step 75750: {'lr': 0.00025133198652923437, 'samples': 14544000, 'steps': 75749, 'loss/train': 1.9301464557647705} 11/07/2021 07:54:53 - INFO - __main__ - Step 75751: {'lr': 0.00025132667985990927, 'samples': 14544192, 'steps': 75750, 'loss/train': 1.9906821250915527} 11/07/2021 07:54:54 - INFO - __main__ - Step 75752: {'lr': 0.00025132137318998633, 'samples': 14544384, 'steps': 75751, 'loss/train': 1.5343847274780273} 11/07/2021 07:54:55 - INFO - __main__ - Step 75753: {'lr': 0.00025131606651946796, 'samples': 14544576, 'steps': 75752, 'loss/train': 1.905678629875183} 11/07/2021 07:54:55 - INFO - __main__ - Step 75754: {'lr': 0.00025131075984835674, 'samples': 14544768, 'steps': 75753, 'loss/train': 1.8222870826721191} 11/07/2021 07:54:56 - INFO - __main__ - Step 75755: {'lr': 0.00025130545317665474, 'samples': 14544960, 'steps': 75754, 'loss/train': 0.7465057969093323} 11/07/2021 07:54:56 - INFO - __main__ - Step 75756: {'lr': 0.00025130014650436467, 'samples': 14545152, 'steps': 75755, 'loss/train': 1.4906561374664307} 11/07/2021 07:54:56 - INFO - __main__ - Step 75757: {'lr': 0.0002512948398314887, 'samples': 14545344, 'steps': 75756, 'loss/train': 1.7952643632888794} 11/07/2021 07:54:57 - INFO - __main__ - Step 75758: {'lr': 0.0002512895331580293, 'samples': 14545536, 'steps': 75757, 'loss/train': 0.6069292426109314} 11/07/2021 07:54:58 - INFO - __main__ - Step 75759: {'lr': 0.0002512842264839889, 'samples': 14545728, 'steps': 75758, 'loss/train': 1.7441288232803345} 11/07/2021 07:54:58 - INFO - __main__ - Step 75760: {'lr': 0.0002512789198093698, 'samples': 14545920, 'steps': 75759, 'loss/train': 0.6146652102470398} 11/07/2021 07:54:58 - INFO - __main__ - Step 75761: {'lr': 0.00025127361313417445, 'samples': 14546112, 'steps': 75760, 'loss/train': 1.4223120212554932} 11/07/2021 07:54:59 - INFO - __main__ - Step 75762: {'lr': 0.0002512683064584052, 'samples': 14546304, 'steps': 75761, 'loss/train': 1.4900442361831665} 11/07/2021 07:55:00 - INFO - __main__ - Step 75763: {'lr': 0.00025126299978206457, 'samples': 14546496, 'steps': 75762, 'loss/train': 1.2148975133895874} 11/07/2021 07:55:00 - INFO - __main__ - Step 75764: {'lr': 0.00025125769310515477, 'samples': 14546688, 'steps': 75763, 'loss/train': 1.4039983749389648} 11/07/2021 07:55:00 - INFO - __main__ - Step 75765: {'lr': 0.0002512523864276783, 'samples': 14546880, 'steps': 75764, 'loss/train': 1.716384768486023} 11/07/2021 07:55:01 - INFO - __main__ - Step 75766: {'lr': 0.0002512470797496375, 'samples': 14547072, 'steps': 75765, 'loss/train': 1.2631378173828125} 11/07/2021 07:55:01 - INFO - __main__ - Step 75767: {'lr': 0.0002512417730710348, 'samples': 14547264, 'steps': 75766, 'loss/train': 1.3997573852539062} 11/07/2021 07:55:02 - INFO - __main__ - Step 75768: {'lr': 0.00025123646639187256, 'samples': 14547456, 'steps': 75767, 'loss/train': 1.206789493560791} 11/07/2021 07:55:03 - INFO - __main__ - Step 75769: {'lr': 0.00025123115971215315, 'samples': 14547648, 'steps': 75768, 'loss/train': 1.8523763418197632} 11/07/2021 07:55:03 - INFO - __main__ - Step 75770: {'lr': 0.0002512258530318791, 'samples': 14547840, 'steps': 75769, 'loss/train': 1.4473812580108643} 11/07/2021 07:55:03 - INFO - __main__ - Step 75771: {'lr': 0.0002512205463510527, 'samples': 14548032, 'steps': 75770, 'loss/train': 2.0976192951202393} 11/07/2021 07:55:04 - INFO - __main__ - Step 75772: {'lr': 0.00025121523966967625, 'samples': 14548224, 'steps': 75771, 'loss/train': 1.2263219356536865} 11/07/2021 07:55:04 - INFO - __main__ - Step 75773: {'lr': 0.00025120993298775223, 'samples': 14548416, 'steps': 75772, 'loss/train': 1.5887324810028076} 11/07/2021 07:55:05 - INFO - __main__ - Step 75774: {'lr': 0.00025120462630528307, 'samples': 14548608, 'steps': 75773, 'loss/train': 1.5171616077423096} 11/07/2021 07:55:06 - INFO - __main__ - Step 75775: {'lr': 0.00025119931962227116, 'samples': 14548800, 'steps': 75774, 'loss/train': 0.6858618259429932} 11/07/2021 07:55:06 - INFO - __main__ - Step 75776: {'lr': 0.00025119401293871883, 'samples': 14548992, 'steps': 75775, 'loss/train': 0.5375427007675171} 11/07/2021 07:55:07 - INFO - __main__ - Step 75777: {'lr': 0.0002511887062546285, 'samples': 14549184, 'steps': 75776, 'loss/train': 1.63423752784729} 11/07/2021 07:55:07 - INFO - __main__ - Step 75778: {'lr': 0.0002511833995700025, 'samples': 14549376, 'steps': 75777, 'loss/train': 1.3907642364501953} 11/07/2021 07:55:08 - INFO - __main__ - Step 75779: {'lr': 0.00025117809288484333, 'samples': 14549568, 'steps': 75778, 'loss/train': 1.1394309997558594} 11/07/2021 07:55:08 - INFO - __main__ - Step 75780: {'lr': 0.00025117278619915333, 'samples': 14549760, 'steps': 75779, 'loss/train': 1.1416680812835693} 11/07/2021 07:55:09 - INFO - __main__ - Step 75781: {'lr': 0.0002511674795129349, 'samples': 14549952, 'steps': 75780, 'loss/train': 1.7226893901824951} 11/07/2021 07:55:09 - INFO - __main__ - Step 75782: {'lr': 0.0002511621728261904, 'samples': 14550144, 'steps': 75781, 'loss/train': 1.511375069618225} 11/07/2021 07:55:09 - INFO - __main__ - Step 75783: {'lr': 0.0002511568661389223, 'samples': 14550336, 'steps': 75782, 'loss/train': 1.5504728555679321} 11/07/2021 07:55:10 - INFO - __main__ - Step 75784: {'lr': 0.0002511515594511329, 'samples': 14550528, 'steps': 75783, 'loss/train': 1.1704851388931274} 11/07/2021 07:55:11 - INFO - __main__ - Step 75785: {'lr': 0.00025114625276282456, 'samples': 14550720, 'steps': 75784, 'loss/train': 1.083986759185791} 11/07/2021 07:55:11 - INFO - __main__ - Step 75786: {'lr': 0.0002511409460739998, 'samples': 14550912, 'steps': 75785, 'loss/train': 1.1750513315200806} 11/07/2021 07:55:11 - INFO - __main__ - Step 75787: {'lr': 0.00025113563938466087, 'samples': 14551104, 'steps': 75786, 'loss/train': 1.0274572372436523} 11/07/2021 07:55:12 - INFO - __main__ - Step 75788: {'lr': 0.00025113033269481036, 'samples': 14551296, 'steps': 75787, 'loss/train': 1.6511693000793457} 11/07/2021 07:55:12 - INFO - __main__ - Step 75789: {'lr': 0.0002511250260044505, 'samples': 14551488, 'steps': 75788, 'loss/train': 1.5070381164550781} 11/07/2021 07:55:13 - INFO - __main__ - Step 75790: {'lr': 0.0002511197193135837, 'samples': 14551680, 'steps': 75789, 'loss/train': 0.7856423854827881} 11/07/2021 07:55:14 - INFO - __main__ - Step 75791: {'lr': 0.00025111441262221237, 'samples': 14551872, 'steps': 75790, 'loss/train': 1.1542682647705078} 11/07/2021 07:55:14 - INFO - __main__ - Step 75792: {'lr': 0.00025110910593033884, 'samples': 14552064, 'steps': 75791, 'loss/train': 0.6642158031463623} 11/07/2021 07:55:14 - INFO - __main__ - Step 75793: {'lr': 0.00025110379923796566, 'samples': 14552256, 'steps': 75792, 'loss/train': 1.4181135892868042} 11/07/2021 07:55:15 - INFO - __main__ - Step 75794: {'lr': 0.00025109849254509515, 'samples': 14552448, 'steps': 75793, 'loss/train': 1.6040147542953491} 11/07/2021 07:55:16 - INFO - __main__ - Step 75795: {'lr': 0.0002510931858517296, 'samples': 14552640, 'steps': 75794, 'loss/train': 1.4291069507598877} 11/07/2021 07:55:16 - INFO - __main__ - Step 75796: {'lr': 0.0002510878791578715, 'samples': 14552832, 'steps': 75795, 'loss/train': 1.5616148710250854} 11/07/2021 07:55:16 - INFO - __main__ - Step 75797: {'lr': 0.0002510825724635232, 'samples': 14553024, 'steps': 75796, 'loss/train': 1.261106252670288} 11/07/2021 07:55:17 - INFO - __main__ - Step 75798: {'lr': 0.0002510772657686871, 'samples': 14553216, 'steps': 75797, 'loss/train': 1.0835176706314087} 11/07/2021 07:55:17 - INFO - __main__ - Step 75799: {'lr': 0.00025107195907336566, 'samples': 14553408, 'steps': 75798, 'loss/train': 0.9044124484062195} 11/07/2021 07:55:18 - INFO - __main__ - Step 75800: {'lr': 0.0002510666523775612, 'samples': 14553600, 'steps': 75799, 'loss/train': 1.6345882415771484} 11/07/2021 07:55:18 - INFO - __main__ - Step 75801: {'lr': 0.0002510613456812761, 'samples': 14553792, 'steps': 75800, 'loss/train': 0.47598397731781006} 11/07/2021 07:55:19 - INFO - __main__ - Step 75802: {'lr': 0.00025105603898451276, 'samples': 14553984, 'steps': 75801, 'loss/train': 1.2344043254852295} 11/07/2021 07:55:19 - INFO - __main__ - Step 75803: {'lr': 0.0002510507322872736, 'samples': 14554176, 'steps': 75802, 'loss/train': 0.8119331002235413} 11/07/2021 07:55:19 - INFO - __main__ - Step 75804: {'lr': 0.000251045425589561, 'samples': 14554368, 'steps': 75803, 'loss/train': 1.6642518043518066} 11/07/2021 07:55:20 - INFO - __main__ - Step 75805: {'lr': 0.0002510401188913774, 'samples': 14554560, 'steps': 75804, 'loss/train': 1.1600234508514404} 11/07/2021 07:55:21 - INFO - __main__ - Step 75806: {'lr': 0.00025103481219272504, 'samples': 14554752, 'steps': 75805, 'loss/train': 0.6865310668945312} 11/07/2021 07:55:21 - INFO - __main__ - Step 75807: {'lr': 0.0002510295054936065, 'samples': 14554944, 'steps': 75806, 'loss/train': 1.3993583917617798} 11/07/2021 07:55:21 - INFO - __main__ - Step 75808: {'lr': 0.00025102419879402397, 'samples': 14555136, 'steps': 75807, 'loss/train': 1.2370327711105347} 11/07/2021 07:55:22 - INFO - __main__ - Step 75809: {'lr': 0.00025101889209398006, 'samples': 14555328, 'steps': 75808, 'loss/train': 1.0978151559829712} 11/07/2021 07:55:23 - INFO - __main__ - Step 75810: {'lr': 0.000251013585393477, 'samples': 14555520, 'steps': 75809, 'loss/train': 1.6947129964828491} 11/07/2021 07:55:23 - INFO - __main__ - Step 75811: {'lr': 0.00025100827869251724, 'samples': 14555712, 'steps': 75810, 'loss/train': 1.4553614854812622} 11/07/2021 07:55:24 - INFO - __main__ - Step 75812: {'lr': 0.00025100297199110317, 'samples': 14555904, 'steps': 75811, 'loss/train': 1.11416494846344} 11/07/2021 07:55:24 - INFO - __main__ - Step 75813: {'lr': 0.0002509976652892372, 'samples': 14556096, 'steps': 75812, 'loss/train': 1.4311691522598267} 11/07/2021 07:55:24 - INFO - __main__ - Step 75814: {'lr': 0.0002509923585869216, 'samples': 14556288, 'steps': 75813, 'loss/train': 1.7212860584259033} 11/07/2021 07:55:25 - INFO - __main__ - Step 75815: {'lr': 0.00025098705188415896, 'samples': 14556480, 'steps': 75814, 'loss/train': 1.7334593534469604} 11/07/2021 07:55:26 - INFO - __main__ - Step 75816: {'lr': 0.0002509817451809515, 'samples': 14556672, 'steps': 75815, 'loss/train': 1.2235658168792725} 11/07/2021 07:55:26 - INFO - __main__ - Step 75817: {'lr': 0.0002509764384773018, 'samples': 14556864, 'steps': 75816, 'loss/train': 1.5162497758865356} 11/07/2021 07:55:26 - INFO - __main__ - Step 75818: {'lr': 0.00025097113177321203, 'samples': 14557056, 'steps': 75817, 'loss/train': 1.3463454246520996} 11/07/2021 07:55:27 - INFO - __main__ - Step 75819: {'lr': 0.0002509658250686847, 'samples': 14557248, 'steps': 75818, 'loss/train': 1.4849716424942017} 11/07/2021 07:55:28 - INFO - __main__ - Step 75820: {'lr': 0.00025096051836372217, 'samples': 14557440, 'steps': 75819, 'loss/train': 0.6309524178504944} 11/07/2021 07:55:28 - INFO - __main__ - Step 75821: {'lr': 0.00025095521165832685, 'samples': 14557632, 'steps': 75820, 'loss/train': 2.0421700477600098} 11/07/2021 07:55:29 - INFO - __main__ - Step 75822: {'lr': 0.00025094990495250116, 'samples': 14557824, 'steps': 75821, 'loss/train': 0.7481685280799866} 11/07/2021 07:55:29 - INFO - __main__ - Step 75823: {'lr': 0.0002509445982462475, 'samples': 14558016, 'steps': 75822, 'loss/train': 1.1436259746551514} 11/07/2021 07:55:29 - INFO - __main__ - Step 75824: {'lr': 0.00025093929153956814, 'samples': 14558208, 'steps': 75823, 'loss/train': 0.5035507678985596} 11/07/2021 07:55:30 - INFO - __main__ - Step 75825: {'lr': 0.00025093398483246553, 'samples': 14558400, 'steps': 75824, 'loss/train': 0.6510461568832397} 11/07/2021 07:55:31 - INFO - __main__ - Step 75826: {'lr': 0.00025092867812494214, 'samples': 14558592, 'steps': 75825, 'loss/train': 1.294669508934021} 11/07/2021 07:55:31 - INFO - __main__ - Step 75827: {'lr': 0.00025092337141700025, 'samples': 14558784, 'steps': 75826, 'loss/train': 1.8018057346343994} 11/07/2021 07:55:31 - INFO - __main__ - Step 75828: {'lr': 0.0002509180647086423, 'samples': 14558976, 'steps': 75827, 'loss/train': 0.9486640691757202} 11/07/2021 07:55:32 - INFO - __main__ - Step 75829: {'lr': 0.0002509127579998707, 'samples': 14559168, 'steps': 75828, 'loss/train': 1.246127963066101} 11/07/2021 07:55:33 - INFO - __main__ - Step 75830: {'lr': 0.00025090745129068795, 'samples': 14559360, 'steps': 75829, 'loss/train': 1.5460243225097656} 11/07/2021 07:55:33 - INFO - __main__ - Step 75831: {'lr': 0.0002509021445810962, 'samples': 14559552, 'steps': 75830, 'loss/train': 1.3021342754364014} 11/07/2021 07:55:33 - INFO - __main__ - Step 75832: {'lr': 0.0002508968378710979, 'samples': 14559744, 'steps': 75831, 'loss/train': 1.1552554368972778} 11/07/2021 07:55:34 - INFO - __main__ - Step 75833: {'lr': 0.00025089153116069555, 'samples': 14559936, 'steps': 75832, 'loss/train': 1.1962554454803467} 11/07/2021 07:55:34 - INFO - __main__ - Step 75834: {'lr': 0.00025088622444989153, 'samples': 14560128, 'steps': 75833, 'loss/train': 1.747011661529541} 11/07/2021 07:55:35 - INFO - __main__ - Step 75835: {'lr': 0.00025088091773868814, 'samples': 14560320, 'steps': 75834, 'loss/train': 1.7209023237228394} 11/07/2021 07:55:36 - INFO - __main__ - Step 75836: {'lr': 0.0002508756110270878, 'samples': 14560512, 'steps': 75835, 'loss/train': 1.5676599740982056} 11/07/2021 07:55:36 - INFO - __main__ - Step 75837: {'lr': 0.0002508703043150931, 'samples': 14560704, 'steps': 75836, 'loss/train': 1.6117311716079712} 11/07/2021 07:55:36 - INFO - __main__ - Step 75838: {'lr': 0.00025086499760270607, 'samples': 14560896, 'steps': 75837, 'loss/train': 0.9887049198150635} 11/07/2021 07:55:37 - INFO - __main__ - Step 75839: {'lr': 0.00025085969088992934, 'samples': 14561088, 'steps': 75838, 'loss/train': 1.0363117456436157} 11/07/2021 07:55:37 - INFO - __main__ - Step 75840: {'lr': 0.0002508543841767653, 'samples': 14561280, 'steps': 75839, 'loss/train': 1.8424348831176758} 11/07/2021 07:55:38 - INFO - __main__ - Step 75841: {'lr': 0.00025084907746321616, 'samples': 14561472, 'steps': 75840, 'loss/train': 1.3220397233963013} 11/07/2021 07:55:38 - INFO - __main__ - Step 75842: {'lr': 0.0002508437707492845, 'samples': 14561664, 'steps': 75841, 'loss/train': 0.8469317555427551} 11/07/2021 07:55:39 - INFO - __main__ - Step 75843: {'lr': 0.0002508384640349727, 'samples': 14561856, 'steps': 75842, 'loss/train': 1.327317714691162} 11/07/2021 07:55:39 - INFO - __main__ - Step 75844: {'lr': 0.00025083315732028305, 'samples': 14562048, 'steps': 75843, 'loss/train': 1.4277927875518799} 11/07/2021 07:55:39 - INFO - __main__ - Step 75845: {'lr': 0.000250827850605218, 'samples': 14562240, 'steps': 75844, 'loss/train': 1.1526923179626465} 11/07/2021 07:55:41 - INFO - __main__ - Step 75846: {'lr': 0.00025082254388977994, 'samples': 14562432, 'steps': 75845, 'loss/train': 1.6198941469192505} 11/07/2021 07:55:41 - INFO - __main__ - Step 75847: {'lr': 0.00025081723717397124, 'samples': 14562624, 'steps': 75846, 'loss/train': 1.5997467041015625} 11/07/2021 07:55:41 - INFO - __main__ - Step 75848: {'lr': 0.00025081193045779434, 'samples': 14562816, 'steps': 75847, 'loss/train': 1.7412569522857666} 11/07/2021 07:55:42 - INFO - __main__ - Step 75849: {'lr': 0.0002508066237412516, 'samples': 14563008, 'steps': 75848, 'loss/train': 1.4970424175262451} 11/07/2021 07:55:42 - INFO - __main__ - Step 75850: {'lr': 0.0002508013170243454, 'samples': 14563200, 'steps': 75849, 'loss/train': 1.296249270439148} 11/07/2021 07:55:43 - INFO - __main__ - Step 75851: {'lr': 0.0002507960103070781, 'samples': 14563392, 'steps': 75850, 'loss/train': 1.6220232248306274} 11/07/2021 07:55:43 - INFO - __main__ - Step 75852: {'lr': 0.00025079070358945214, 'samples': 14563584, 'steps': 75851, 'loss/train': 1.1596626043319702} 11/07/2021 07:55:44 - INFO - __main__ - Step 75853: {'lr': 0.0002507853968714699, 'samples': 14563776, 'steps': 75852, 'loss/train': 1.1841484308242798} 11/07/2021 07:55:44 - INFO - __main__ - Step 75854: {'lr': 0.0002507800901531338, 'samples': 14563968, 'steps': 75853, 'loss/train': 1.5168920755386353} 11/07/2021 07:55:44 - INFO - __main__ - Step 75855: {'lr': 0.00025077478343444616, 'samples': 14564160, 'steps': 75854, 'loss/train': 1.3000717163085938} 11/07/2021 07:55:45 - INFO - __main__ - Step 75856: {'lr': 0.00025076947671540947, 'samples': 14564352, 'steps': 75855, 'loss/train': 0.9143718481063843} 11/07/2021 07:55:46 - INFO - __main__ - Step 75857: {'lr': 0.0002507641699960261, 'samples': 14564544, 'steps': 75856, 'loss/train': 1.1748628616333008} 11/07/2021 07:55:46 - INFO - __main__ - Step 75858: {'lr': 0.00025075886327629833, 'samples': 14564736, 'steps': 75857, 'loss/train': 2.1484780311584473} 11/07/2021 07:55:46 - INFO - __main__ - Step 75859: {'lr': 0.0002507535565562286, 'samples': 14564928, 'steps': 75858, 'loss/train': 1.1951005458831787} 11/07/2021 07:55:47 - INFO - __main__ - Step 75860: {'lr': 0.0002507482498358194, 'samples': 14565120, 'steps': 75859, 'loss/train': 1.4508905410766602} 11/07/2021 07:55:48 - INFO - __main__ - Step 75861: {'lr': 0.000250742943115073, 'samples': 14565312, 'steps': 75860, 'loss/train': 1.1010336875915527} 11/07/2021 07:55:48 - INFO - __main__ - Step 75862: {'lr': 0.0002507376363939918, 'samples': 14565504, 'steps': 75861, 'loss/train': 1.4081711769104004} 11/07/2021 07:55:49 - INFO - __main__ - Step 75863: {'lr': 0.00025073232967257834, 'samples': 14565696, 'steps': 75862, 'loss/train': 1.1816424131393433} 11/07/2021 07:55:49 - INFO - __main__ - Step 75864: {'lr': 0.00025072702295083493, 'samples': 14565888, 'steps': 75863, 'loss/train': 1.511121153831482} 11/07/2021 07:55:49 - INFO - __main__ - Step 75865: {'lr': 0.0002507217162287638, 'samples': 14566080, 'steps': 75864, 'loss/train': 1.0832107067108154} 11/07/2021 07:55:50 - INFO - __main__ - Step 75866: {'lr': 0.00025071640950636757, 'samples': 14566272, 'steps': 75865, 'loss/train': 1.5941731929779053} 11/07/2021 07:55:50 - INFO - __main__ - Step 75867: {'lr': 0.0002507111027836485, 'samples': 14566464, 'steps': 75866, 'loss/train': 1.504408359527588} 11/07/2021 07:55:51 - INFO - __main__ - Step 75868: {'lr': 0.000250705796060609, 'samples': 14566656, 'steps': 75867, 'loss/train': 1.478071928024292} 11/07/2021 07:55:51 - INFO - __main__ - Step 75869: {'lr': 0.0002507004893372515, 'samples': 14566848, 'steps': 75868, 'loss/train': 1.435184121131897} 11/07/2021 07:55:52 - INFO - __main__ - Step 75870: {'lr': 0.00025069518261357844, 'samples': 14567040, 'steps': 75869, 'loss/train': 0.9867746233940125} 11/07/2021 07:55:52 - INFO - __main__ - Step 75871: {'lr': 0.0002506898758895921, 'samples': 14567232, 'steps': 75870, 'loss/train': 1.5490058660507202} 11/07/2021 07:55:53 - INFO - __main__ - Step 75872: {'lr': 0.00025068456916529485, 'samples': 14567424, 'steps': 75871, 'loss/train': 1.461609125137329} 11/07/2021 07:55:53 - INFO - __main__ - Step 75873: {'lr': 0.00025067926244068915, 'samples': 14567616, 'steps': 75872, 'loss/train': 1.3181471824645996} 11/07/2021 07:55:54 - INFO - __main__ - Step 75874: {'lr': 0.00025067395571577744, 'samples': 14567808, 'steps': 75873, 'loss/train': 1.3092252016067505} 11/07/2021 07:55:54 - INFO - __main__ - Step 75875: {'lr': 0.00025066864899056204, 'samples': 14568000, 'steps': 75874, 'loss/train': 1.5703142881393433} 11/07/2021 07:55:54 - INFO - __main__ - Step 75876: {'lr': 0.00025066334226504533, 'samples': 14568192, 'steps': 75875, 'loss/train': 1.1497797966003418} 11/07/2021 07:55:55 - INFO - __main__ - Step 75877: {'lr': 0.0002506580355392298, 'samples': 14568384, 'steps': 75876, 'loss/train': 1.5915114879608154} 11/07/2021 07:55:56 - INFO - __main__ - Step 75878: {'lr': 0.0002506527288131177, 'samples': 14568576, 'steps': 75877, 'loss/train': 0.7852066159248352} 11/07/2021 07:55:56 - INFO - __main__ - Step 75879: {'lr': 0.0002506474220867115, 'samples': 14568768, 'steps': 75878, 'loss/train': 1.5623735189437866} 11/07/2021 07:55:56 - INFO - __main__ - Step 75880: {'lr': 0.00025064211536001356, 'samples': 14568960, 'steps': 75879, 'loss/train': 1.384422779083252} 11/07/2021 07:55:57 - INFO - __main__ - Step 75881: {'lr': 0.00025063680863302636, 'samples': 14569152, 'steps': 75880, 'loss/train': 1.6964831352233887} 11/07/2021 07:55:58 - INFO - __main__ - Step 75882: {'lr': 0.00025063150190575217, 'samples': 14569344, 'steps': 75881, 'loss/train': 1.0762230157852173} 11/07/2021 07:55:58 - INFO - __main__ - Step 75883: {'lr': 0.0002506261951781935, 'samples': 14569536, 'steps': 75882, 'loss/train': 1.385465383529663} 11/07/2021 07:55:59 - INFO - __main__ - Step 75884: {'lr': 0.00025062088845035263, 'samples': 14569728, 'steps': 75883, 'loss/train': 0.8452988266944885} 11/07/2021 07:55:59 - INFO - __main__ - Step 75885: {'lr': 0.000250615581722232, 'samples': 14569920, 'steps': 75884, 'loss/train': 1.5801970958709717} 11/07/2021 07:55:59 - INFO - __main__ - Step 75886: {'lr': 0.000250610274993834, 'samples': 14570112, 'steps': 75885, 'loss/train': 1.4315427541732788} 11/07/2021 07:56:00 - INFO - __main__ - Step 75887: {'lr': 0.000250604968265161, 'samples': 14570304, 'steps': 75886, 'loss/train': 1.4028500318527222} 11/07/2021 07:56:01 - INFO - __main__ - Step 75888: {'lr': 0.0002505996615362154, 'samples': 14570496, 'steps': 75887, 'loss/train': 1.40639066696167} 11/07/2021 07:56:01 - INFO - __main__ - Step 75889: {'lr': 0.0002505943548069996, 'samples': 14570688, 'steps': 75888, 'loss/train': 1.2188544273376465} 11/07/2021 07:56:01 - INFO - __main__ - Step 75890: {'lr': 0.00025058904807751604, 'samples': 14570880, 'steps': 75889, 'loss/train': 1.3342626094818115} 11/07/2021 07:56:02 - INFO - __main__ - Step 75891: {'lr': 0.00025058374134776705, 'samples': 14571072, 'steps': 75890, 'loss/train': 1.463808298110962} 11/07/2021 07:56:02 - INFO - __main__ - Step 75892: {'lr': 0.00025057843461775503, 'samples': 14571264, 'steps': 75891, 'loss/train': 1.1501171588897705} 11/07/2021 07:56:03 - INFO - __main__ - Step 75893: {'lr': 0.00025057312788748237, 'samples': 14571456, 'steps': 75892, 'loss/train': 1.565967321395874} 11/07/2021 07:56:03 - INFO - __main__ - Step 75894: {'lr': 0.0002505678211569515, 'samples': 14571648, 'steps': 75893, 'loss/train': 1.2646021842956543} 11/07/2021 07:56:04 - INFO - __main__ - Step 75895: {'lr': 0.0002505625144261647, 'samples': 14571840, 'steps': 75894, 'loss/train': 1.240929365158081} 11/07/2021 07:56:04 - INFO - __main__ - Step 75896: {'lr': 0.0002505572076951245, 'samples': 14572032, 'steps': 75895, 'loss/train': 0.9666742086410522} 11/07/2021 07:56:04 - INFO - __main__ - Step 75897: {'lr': 0.0002505519009638332, 'samples': 14572224, 'steps': 75896, 'loss/train': 1.1939209699630737} 11/07/2021 07:56:05 - INFO - __main__ - Step 75898: {'lr': 0.0002505465942322933, 'samples': 14572416, 'steps': 75897, 'loss/train': 1.2114617824554443} 11/07/2021 07:56:06 - INFO - __main__ - Step 75899: {'lr': 0.00025054128750050703, 'samples': 14572608, 'steps': 75898, 'loss/train': 1.8133254051208496} 11/07/2021 07:56:06 - INFO - __main__ - Step 75900: {'lr': 0.0002505359807684769, 'samples': 14572800, 'steps': 75899, 'loss/train': 1.9103953838348389} 11/07/2021 07:56:06 - INFO - __main__ - Step 75901: {'lr': 0.0002505306740362052, 'samples': 14572992, 'steps': 75900, 'loss/train': 1.4682319164276123} 11/07/2021 07:56:07 - INFO - __main__ - Step 75902: {'lr': 0.00025052536730369444, 'samples': 14573184, 'steps': 75901, 'loss/train': 0.9135860204696655} 11/07/2021 07:56:08 - INFO - __main__ - Step 75903: {'lr': 0.00025052006057094703, 'samples': 14573376, 'steps': 75902, 'loss/train': 1.0640478134155273} 11/07/2021 07:56:08 - INFO - __main__ - Step 75904: {'lr': 0.0002505147538379652, 'samples': 14573568, 'steps': 75903, 'loss/train': 1.5892547369003296} 11/07/2021 07:56:09 - INFO - __main__ - Step 75905: {'lr': 0.0002505094471047515, 'samples': 14573760, 'steps': 75904, 'loss/train': 1.1155338287353516} 11/07/2021 07:56:09 - INFO - __main__ - Step 75906: {'lr': 0.00025050414037130814, 'samples': 14573952, 'steps': 75905, 'loss/train': 1.25679349899292} 11/07/2021 07:56:09 - INFO - __main__ - Step 75907: {'lr': 0.0002504988336376377, 'samples': 14574144, 'steps': 75906, 'loss/train': 1.4876989126205444} 11/07/2021 07:56:10 - INFO - __main__ - Step 75908: {'lr': 0.00025049352690374244, 'samples': 14574336, 'steps': 75907, 'loss/train': 1.1901755332946777} 11/07/2021 07:56:11 - INFO - __main__ - Step 75909: {'lr': 0.00025048822016962487, 'samples': 14574528, 'steps': 75908, 'loss/train': 1.3865257501602173} 11/07/2021 07:56:11 - INFO - __main__ - Step 75910: {'lr': 0.0002504829134352872, 'samples': 14574720, 'steps': 75909, 'loss/train': 1.9152882099151611} 11/07/2021 07:56:12 - INFO - __main__ - Step 75911: {'lr': 0.0002504776067007321, 'samples': 14574912, 'steps': 75910, 'loss/train': 0.848821759223938} 11/07/2021 07:56:12 - INFO - __main__ - Step 75912: {'lr': 0.0002504722999659617, 'samples': 14575104, 'steps': 75911, 'loss/train': 1.5025463104248047} 11/07/2021 07:56:12 - INFO - __main__ - Step 75913: {'lr': 0.00025046699323097855, 'samples': 14575296, 'steps': 75912, 'loss/train': 1.677574634552002} 11/07/2021 07:56:13 - INFO - __main__ - Step 75914: {'lr': 0.0002504616864957849, 'samples': 14575488, 'steps': 75913, 'loss/train': 1.175293207168579} 11/07/2021 07:56:14 - INFO - __main__ - Step 75915: {'lr': 0.00025045637976038327, 'samples': 14575680, 'steps': 75914, 'loss/train': 1.324916958808899} 11/07/2021 07:56:14 - INFO - __main__ - Step 75916: {'lr': 0.000250451073024776, 'samples': 14575872, 'steps': 75915, 'loss/train': 1.648932933807373} 11/07/2021 07:56:14 - INFO - __main__ - Step 75917: {'lr': 0.00025044576628896546, 'samples': 14576064, 'steps': 75916, 'loss/train': 0.9209709167480469} 11/07/2021 07:56:15 - INFO - __main__ - Step 75918: {'lr': 0.0002504404595529541, 'samples': 14576256, 'steps': 75917, 'loss/train': 1.3566973209381104} 11/07/2021 07:56:16 - INFO - __main__ - Step 75919: {'lr': 0.0002504351528167443, 'samples': 14576448, 'steps': 75918, 'loss/train': 1.8411492109298706} 11/07/2021 07:56:16 - INFO - __main__ - Step 75920: {'lr': 0.0002504298460803383, 'samples': 14576640, 'steps': 75919, 'loss/train': 1.5431208610534668} 11/07/2021 07:56:16 - INFO - __main__ - Step 75921: {'lr': 0.00025042453934373874, 'samples': 14576832, 'steps': 75920, 'loss/train': 1.6588026285171509} 11/07/2021 07:56:17 - INFO - __main__ - Step 75922: {'lr': 0.0002504192326069478, 'samples': 14577024, 'steps': 75921, 'loss/train': 1.4866636991500854} 11/07/2021 07:56:17 - INFO - __main__ - Step 75923: {'lr': 0.0002504139258699681, 'samples': 14577216, 'steps': 75922, 'loss/train': 0.6049333214759827} 11/07/2021 07:56:18 - INFO - __main__ - Step 75924: {'lr': 0.00025040861913280175, 'samples': 14577408, 'steps': 75923, 'loss/train': 1.0636184215545654} 11/07/2021 07:56:19 - INFO - __main__ - Step 75925: {'lr': 0.0002504033123954513, 'samples': 14577600, 'steps': 75924, 'loss/train': 1.4197412729263306} 11/07/2021 07:56:19 - INFO - __main__ - Step 75926: {'lr': 0.0002503980056579192, 'samples': 14577792, 'steps': 75925, 'loss/train': 1.4449059963226318} 11/07/2021 07:56:19 - INFO - __main__ - Step 75927: {'lr': 0.00025039269892020773, 'samples': 14577984, 'steps': 75926, 'loss/train': 1.2717106342315674} 11/07/2021 07:56:20 - INFO - __main__ - Step 75928: {'lr': 0.00025038739218231925, 'samples': 14578176, 'steps': 75927, 'loss/train': 1.1692113876342773} 11/07/2021 07:56:21 - INFO - __main__ - Step 75929: {'lr': 0.00025038208544425633, 'samples': 14578368, 'steps': 75928, 'loss/train': 1.6836061477661133} 11/07/2021 07:56:21 - INFO - __main__ - Step 75930: {'lr': 0.00025037677870602123, 'samples': 14578560, 'steps': 75929, 'loss/train': 1.224339485168457} 11/07/2021 07:56:22 - INFO - __main__ - Step 75931: {'lr': 0.0002503714719676163, 'samples': 14578752, 'steps': 75930, 'loss/train': 1.3133459091186523} 11/07/2021 07:56:22 - INFO - __main__ - Step 75932: {'lr': 0.000250366165229044, 'samples': 14578944, 'steps': 75931, 'loss/train': 1.6120251417160034} 11/07/2021 07:56:22 - INFO - __main__ - Step 75933: {'lr': 0.0002503608584903067, 'samples': 14579136, 'steps': 75932, 'loss/train': 1.7198761701583862} 11/07/2021 07:56:23 - INFO - __main__ - Step 75934: {'lr': 0.0002503555517514069, 'samples': 14579328, 'steps': 75933, 'loss/train': 1.7586578130722046} 11/07/2021 07:56:24 - INFO - __main__ - Step 75935: {'lr': 0.0002503502450123468, 'samples': 14579520, 'steps': 75934, 'loss/train': 1.6710827350616455} 11/07/2021 07:56:24 - INFO - __main__ - Step 75936: {'lr': 0.00025034493827312895, 'samples': 14579712, 'steps': 75935, 'loss/train': 1.3910273313522339} 11/07/2021 07:56:24 - INFO - __main__ - Step 75937: {'lr': 0.00025033963153375555, 'samples': 14579904, 'steps': 75936, 'loss/train': 2.0271267890930176} 11/07/2021 07:56:25 - INFO - __main__ - Step 75938: {'lr': 0.0002503343247942292, 'samples': 14580096, 'steps': 75937, 'loss/train': 1.5978237390518188} 11/07/2021 07:56:25 - INFO - __main__ - Step 75939: {'lr': 0.0002503290180545522, 'samples': 14580288, 'steps': 75938, 'loss/train': 1.3051180839538574} 11/07/2021 07:56:26 - INFO - __main__ - Step 75940: {'lr': 0.00025032371131472706, 'samples': 14580480, 'steps': 75939, 'loss/train': 1.420851707458496} 11/07/2021 07:56:26 - INFO - __main__ - Step 75941: {'lr': 0.0002503184045747559, 'samples': 14580672, 'steps': 75940, 'loss/train': 1.3912776708602905} 11/07/2021 07:56:27 - INFO - __main__ - Step 75942: {'lr': 0.0002503130978346413, 'samples': 14580864, 'steps': 75941, 'loss/train': 1.1473777294158936} 11/07/2021 07:56:27 - INFO - __main__ - Step 75943: {'lr': 0.00025030779109438565, 'samples': 14581056, 'steps': 75942, 'loss/train': 1.160209059715271} 11/07/2021 07:56:28 - INFO - __main__ - Step 75944: {'lr': 0.0002503024843539913, 'samples': 14581248, 'steps': 75943, 'loss/train': 1.5173412561416626} 11/07/2021 07:56:29 - INFO - __main__ - Step 75945: {'lr': 0.00025029717761346074, 'samples': 14581440, 'steps': 75944, 'loss/train': 1.4906294345855713} 11/07/2021 07:56:29 - INFO - __main__ - Step 75946: {'lr': 0.0002502918708727962, 'samples': 14581632, 'steps': 75945, 'loss/train': 1.465821623802185} 11/07/2021 07:56:29 - INFO - __main__ - Step 75947: {'lr': 0.0002502865641320002, 'samples': 14581824, 'steps': 75946, 'loss/train': 1.5985748767852783} 11/07/2021 07:56:30 - INFO - __main__ - Step 75948: {'lr': 0.00025028125739107496, 'samples': 14582016, 'steps': 75947, 'loss/train': 1.12180495262146} 11/07/2021 07:56:30 - INFO - __main__ - Step 75949: {'lr': 0.00025027595065002306, 'samples': 14582208, 'steps': 75948, 'loss/train': 2.0632643699645996} 11/07/2021 07:56:30 - INFO - __main__ - Step 75950: {'lr': 0.00025027064390884684, 'samples': 14582400, 'steps': 75949, 'loss/train': 1.5303655862808228} 11/07/2021 07:56:31 - INFO - __main__ - Step 75951: {'lr': 0.0002502653371675487, 'samples': 14582592, 'steps': 75950, 'loss/train': 1.4610973596572876} 11/07/2021 07:56:32 - INFO - __main__ - Step 75952: {'lr': 0.000250260030426131, 'samples': 14582784, 'steps': 75951, 'loss/train': 1.4492267370224} 11/07/2021 07:56:32 - INFO - __main__ - Step 75953: {'lr': 0.0002502547236845961, 'samples': 14582976, 'steps': 75952, 'loss/train': 1.5012365579605103} 11/07/2021 07:56:32 - INFO - __main__ - Step 75954: {'lr': 0.00025024941694294634, 'samples': 14583168, 'steps': 75953, 'loss/train': 1.5756441354751587} 11/07/2021 07:56:33 - INFO - __main__ - Step 75955: {'lr': 0.00025024411020118433, 'samples': 14583360, 'steps': 75954, 'loss/train': 1.4125357866287231} 11/07/2021 07:56:34 - INFO - __main__ - Step 75956: {'lr': 0.0002502388034593122, 'samples': 14583552, 'steps': 75955, 'loss/train': 1.4391905069351196} 11/07/2021 07:56:34 - INFO - __main__ - Step 75957: {'lr': 0.00025023349671733256, 'samples': 14583744, 'steps': 75956, 'loss/train': 1.6384336948394775} 11/07/2021 07:56:34 - INFO - __main__ - Step 75958: {'lr': 0.0002502281899752478, 'samples': 14583936, 'steps': 75957, 'loss/train': 1.4057897329330444} 11/07/2021 07:56:35 - INFO - __main__ - Step 75959: {'lr': 0.00025022288323306, 'samples': 14584128, 'steps': 75958, 'loss/train': 1.2502776384353638} 11/07/2021 07:56:35 - INFO - __main__ - Step 75960: {'lr': 0.0002502175764907719, 'samples': 14584320, 'steps': 75959, 'loss/train': 1.0189470052719116} 11/07/2021 07:56:36 - INFO - __main__ - Step 75961: {'lr': 0.0002502122697483858, 'samples': 14584512, 'steps': 75960, 'loss/train': 1.2674895524978638} 11/07/2021 07:56:37 - INFO - __main__ - Step 75962: {'lr': 0.00025020696300590397, 'samples': 14584704, 'steps': 75961, 'loss/train': 1.4862141609191895} 11/07/2021 07:56:37 - INFO - __main__ - Step 75963: {'lr': 0.0002502016562633289, 'samples': 14584896, 'steps': 75962, 'loss/train': 0.549892008304596} 11/07/2021 07:56:37 - INFO - __main__ - Step 75964: {'lr': 0.000250196349520663, 'samples': 14585088, 'steps': 75963, 'loss/train': 1.451192021369934} 11/07/2021 07:56:38 - INFO - __main__ - Step 75965: {'lr': 0.0002501910427779087, 'samples': 14585280, 'steps': 75964, 'loss/train': 1.5425066947937012} 11/07/2021 07:56:39 - INFO - __main__ - Step 75966: {'lr': 0.00025018573603506817, 'samples': 14585472, 'steps': 75965, 'loss/train': 1.3007954359054565} 11/07/2021 07:56:39 - INFO - __main__ - Step 75967: {'lr': 0.000250180429292144, 'samples': 14585664, 'steps': 75966, 'loss/train': 1.4663282632827759} 11/07/2021 07:56:39 - INFO - __main__ - Step 75968: {'lr': 0.00025017512254913853, 'samples': 14585856, 'steps': 75967, 'loss/train': 1.598353385925293} 11/07/2021 07:56:40 - INFO - __main__ - Step 75969: {'lr': 0.0002501698158060542, 'samples': 14586048, 'steps': 75968, 'loss/train': 1.2132710218429565} 11/07/2021 07:56:40 - INFO - __main__ - Step 75970: {'lr': 0.0002501645090628933, 'samples': 14586240, 'steps': 75969, 'loss/train': 1.6770910024642944} 11/07/2021 07:56:41 - INFO - __main__ - Step 75971: {'lr': 0.00025015920231965833, 'samples': 14586432, 'steps': 75970, 'loss/train': 1.4955878257751465} 11/07/2021 07:56:42 - INFO - __main__ - Step 75972: {'lr': 0.0002501538955763516, 'samples': 14586624, 'steps': 75971, 'loss/train': 1.587708592414856} 11/07/2021 07:56:42 - INFO - __main__ - Step 75973: {'lr': 0.00025014858883297555, 'samples': 14586816, 'steps': 75972, 'loss/train': 1.5014097690582275} 11/07/2021 07:56:42 - INFO - __main__ - Step 75974: {'lr': 0.0002501432820895325, 'samples': 14587008, 'steps': 75973, 'loss/train': 1.5521570444107056} 11/07/2021 07:56:43 - INFO - __main__ - Step 75975: {'lr': 0.0002501379753460249, 'samples': 14587200, 'steps': 75974, 'loss/train': 1.5219438076019287} 11/07/2021 07:56:43 - INFO - __main__ - Step 75976: {'lr': 0.0002501326686024551, 'samples': 14587392, 'steps': 75975, 'loss/train': 1.3734158277511597} 11/07/2021 07:56:44 - INFO - __main__ - Step 75977: {'lr': 0.00025012736185882556, 'samples': 14587584, 'steps': 75976, 'loss/train': 1.1997487545013428} 11/07/2021 07:56:44 - INFO - __main__ - Step 75978: {'lr': 0.00025012205511513866, 'samples': 14587776, 'steps': 75977, 'loss/train': 1.1447240114212036} 11/07/2021 07:56:45 - INFO - __main__ - Step 75979: {'lr': 0.00025011674837139674, 'samples': 14587968, 'steps': 75978, 'loss/train': 1.53408682346344} 11/07/2021 07:56:45 - INFO - __main__ - Step 75980: {'lr': 0.0002501114416276022, 'samples': 14588160, 'steps': 75979, 'loss/train': 0.8043447732925415} 11/07/2021 07:56:45 - INFO - __main__ - Step 75981: {'lr': 0.0002501061348837574, 'samples': 14588352, 'steps': 75980, 'loss/train': 1.4965687990188599} 11/07/2021 07:56:46 - INFO - __main__ - Step 75982: {'lr': 0.00025010082813986485, 'samples': 14588544, 'steps': 75981, 'loss/train': 1.146720051765442} 11/07/2021 07:56:47 - INFO - __main__ - Step 75983: {'lr': 0.00025009552139592685, 'samples': 14588736, 'steps': 75982, 'loss/train': 1.096457839012146} 11/07/2021 07:56:47 - INFO - __main__ - Step 75984: {'lr': 0.0002500902146519458, 'samples': 14588928, 'steps': 75983, 'loss/train': 1.3978060483932495} 11/07/2021 07:56:47 - INFO - __main__ - Step 75985: {'lr': 0.00025008490790792416, 'samples': 14589120, 'steps': 75984, 'loss/train': 1.4430311918258667} 11/07/2021 07:56:48 - INFO - __main__ - Step 75986: {'lr': 0.00025007960116386424, 'samples': 14589312, 'steps': 75985, 'loss/train': 1.6316848993301392} 11/07/2021 07:56:49 - INFO - __main__ - Step 75987: {'lr': 0.00025007429441976844, 'samples': 14589504, 'steps': 75986, 'loss/train': 1.3493390083312988} 11/07/2021 07:56:49 - INFO - __main__ - Step 75988: {'lr': 0.00025006898767563913, 'samples': 14589696, 'steps': 75987, 'loss/train': 1.4021546840667725} 11/07/2021 07:56:50 - INFO - __main__ - Step 75989: {'lr': 0.00025006368093147876, 'samples': 14589888, 'steps': 75988, 'loss/train': 1.524061679840088} 11/07/2021 07:56:50 - INFO - __main__ - Step 75990: {'lr': 0.00025005837418728966, 'samples': 14590080, 'steps': 75989, 'loss/train': 1.5440903902053833} 11/07/2021 07:56:50 - INFO - __main__ - Step 75991: {'lr': 0.0002500530674430743, 'samples': 14590272, 'steps': 75990, 'loss/train': 1.4802489280700684} 11/07/2021 07:56:52 - INFO - __main__ - Step 75992: {'lr': 0.00025004776069883507, 'samples': 14590464, 'steps': 75991, 'loss/train': 1.1837457418441772} 11/07/2021 07:56:52 - INFO - __main__ - Step 75993: {'lr': 0.00025004245395457425, 'samples': 14590656, 'steps': 75992, 'loss/train': 1.5116360187530518} 11/07/2021 07:56:52 - INFO - __main__ - Step 75994: {'lr': 0.0002500371472102943, 'samples': 14590848, 'steps': 75993, 'loss/train': 1.1807961463928223} 11/07/2021 07:56:53 - INFO - __main__ - Step 75995: {'lr': 0.00025003184046599764, 'samples': 14591040, 'steps': 75994, 'loss/train': 1.4021614789962769} 11/07/2021 07:56:53 - INFO - __main__ - Step 75996: {'lr': 0.0002500265337216866, 'samples': 14591232, 'steps': 75995, 'loss/train': 1.3902008533477783} 11/07/2021 07:56:53 - INFO - __main__ - Step 75997: {'lr': 0.0002500212269773636, 'samples': 14591424, 'steps': 75996, 'loss/train': 1.502200722694397} 11/07/2021 07:56:54 - INFO - __main__ - Step 75998: {'lr': 0.0002500159202330311, 'samples': 14591616, 'steps': 75997, 'loss/train': 0.6292885541915894} 11/07/2021 07:56:55 - INFO - __main__ - Step 75999: {'lr': 0.00025001061348869143, 'samples': 14591808, 'steps': 75998, 'loss/train': 1.0220848321914673} 11/07/2021 07:56:55 - INFO - __main__ - Step 76000: {'lr': 0.0002500053067443469, 'samples': 14592000, 'steps': 75999, 'loss/train': 1.1647452116012573} 11/07/2021 07:56:55 - INFO - __main__ - Step 76001: {'lr': 0.00025, 'samples': 14592192, 'steps': 76000, 'loss/train': 1.3442778587341309} 11/07/2021 07:56:56 - INFO - __main__ - Step 76002: {'lr': 0.00024999469325565315, 'samples': 14592384, 'steps': 76001, 'loss/train': 0.6235542297363281} 11/07/2021 07:56:57 - INFO - __main__ - Step 76003: {'lr': 0.00024998938651130863, 'samples': 14592576, 'steps': 76002, 'loss/train': 1.5866349935531616} 11/07/2021 07:56:57 - INFO - __main__ - Step 76004: {'lr': 0.00024998407976696893, 'samples': 14592768, 'steps': 76003, 'loss/train': 2.0218818187713623} 11/07/2021 07:56:58 - INFO - __main__ - Step 76005: {'lr': 0.00024997877302263634, 'samples': 14592960, 'steps': 76004, 'loss/train': 1.6040161848068237} 11/07/2021 07:56:58 - INFO - __main__ - Step 76006: {'lr': 0.0002499734662783134, 'samples': 14593152, 'steps': 76005, 'loss/train': 1.3627349138259888} 11/07/2021 07:56:58 - INFO - __main__ - Step 76007: {'lr': 0.00024996815953400237, 'samples': 14593344, 'steps': 76006, 'loss/train': 1.6809355020523071} 11/07/2021 07:56:59 - INFO - __main__ - Step 76008: {'lr': 0.00024996285278970577, 'samples': 14593536, 'steps': 76007, 'loss/train': 1.8275433778762817} 11/07/2021 07:57:00 - INFO - __main__ - Step 76009: {'lr': 0.00024995754604542587, 'samples': 14593728, 'steps': 76008, 'loss/train': 1.0879461765289307} 11/07/2021 07:57:00 - INFO - __main__ - Step 76010: {'lr': 0.00024995223930116505, 'samples': 14593920, 'steps': 76009, 'loss/train': 1.8186765909194946} 11/07/2021 07:57:00 - INFO - __main__ - Step 76011: {'lr': 0.00024994693255692575, 'samples': 14594112, 'steps': 76010, 'loss/train': 0.19194859266281128} 11/07/2021 07:57:01 - INFO - __main__ - Step 76012: {'lr': 0.00024994162581271035, 'samples': 14594304, 'steps': 76011, 'loss/train': 1.817427396774292} 11/07/2021 07:57:01 - INFO - __main__ - Step 76013: {'lr': 0.0002499363190685213, 'samples': 14594496, 'steps': 76012, 'loss/train': 0.6695510745048523} 11/07/2021 07:57:03 - INFO - __main__ - Step 76014: {'lr': 0.00024993101232436094, 'samples': 14594688, 'steps': 76013, 'loss/train': 1.529610276222229} 11/07/2021 07:57:03 - INFO - __main__ - Step 76015: {'lr': 0.00024992570558023163, 'samples': 14594880, 'steps': 76014, 'loss/train': 1.1115609407424927} 11/07/2021 07:57:03 - INFO - __main__ - Step 76016: {'lr': 0.0002499203988361358, 'samples': 14595072, 'steps': 76015, 'loss/train': 1.4883826971054077} 11/07/2021 07:57:04 - INFO - __main__ - Step 76017: {'lr': 0.00024991509209207585, 'samples': 14595264, 'steps': 76016, 'loss/train': 1.7612441778182983} 11/07/2021 07:57:04 - INFO - __main__ - Step 76018: {'lr': 0.00024990978534805415, 'samples': 14595456, 'steps': 76017, 'loss/train': 0.49820661544799805} 11/07/2021 07:57:04 - INFO - __main__ - Step 76019: {'lr': 0.0002499044786040731, 'samples': 14595648, 'steps': 76018, 'loss/train': 1.24971604347229} 11/07/2021 07:57:05 - INFO - __main__ - Step 76020: {'lr': 0.0002498991718601351, 'samples': 14595840, 'steps': 76019, 'loss/train': 1.2158485651016235} 11/07/2021 07:57:06 - INFO - __main__ - Step 76021: {'lr': 0.00024989386511624253, 'samples': 14596032, 'steps': 76020, 'loss/train': 1.4775220155715942} 11/07/2021 07:57:06 - INFO - __main__ - Step 76022: {'lr': 0.0002498885583723979, 'samples': 14596224, 'steps': 76021, 'loss/train': 1.6165755987167358} 11/07/2021 07:57:07 - INFO - __main__ - Step 76023: {'lr': 0.0002498832516286034, 'samples': 14596416, 'steps': 76022, 'loss/train': 1.5841991901397705} 11/07/2021 07:57:07 - INFO - __main__ - Step 76024: {'lr': 0.00024987794488486145, 'samples': 14596608, 'steps': 76023, 'loss/train': 1.2210148572921753} 11/07/2021 07:57:08 - INFO - __main__ - Step 76025: {'lr': 0.0002498726381411745, 'samples': 14596800, 'steps': 76024, 'loss/train': 1.4447492361068726} 11/07/2021 07:57:08 - INFO - __main__ - Step 76026: {'lr': 0.00024986733139754496, 'samples': 14596992, 'steps': 76025, 'loss/train': 1.3900177478790283} 11/07/2021 07:57:09 - INFO - __main__ - Step 76027: {'lr': 0.00024986202465397515, 'samples': 14597184, 'steps': 76026, 'loss/train': 1.4366132020950317} 11/07/2021 07:57:09 - INFO - __main__ - Step 76028: {'lr': 0.0002498567179104676, 'samples': 14597376, 'steps': 76027, 'loss/train': 1.4514410495758057} 11/07/2021 07:57:09 - INFO - __main__ - Step 76029: {'lr': 0.0002498514111670245, 'samples': 14597568, 'steps': 76028, 'loss/train': 1.714815378189087} 11/07/2021 07:57:10 - INFO - __main__ - Step 76030: {'lr': 0.00024984610442364846, 'samples': 14597760, 'steps': 76029, 'loss/train': 1.5074849128723145} 11/07/2021 07:57:11 - INFO - __main__ - Step 76031: {'lr': 0.0002498407976803417, 'samples': 14597952, 'steps': 76030, 'loss/train': 1.054606556892395} 11/07/2021 07:57:11 - INFO - __main__ - Step 76032: {'lr': 0.0002498354909371067, 'samples': 14598144, 'steps': 76031, 'loss/train': 1.3333684206008911} 11/07/2021 07:57:11 - INFO - __main__ - Step 76033: {'lr': 0.0002498301841939458, 'samples': 14598336, 'steps': 76032, 'loss/train': 1.0803929567337036} 11/07/2021 07:57:12 - INFO - __main__ - Step 76034: {'lr': 0.0002498248774508614, 'samples': 14598528, 'steps': 76033, 'loss/train': 1.731592059135437} 11/07/2021 07:57:13 - INFO - __main__ - Step 76035: {'lr': 0.00024981957070785606, 'samples': 14598720, 'steps': 76034, 'loss/train': 1.500646710395813} 11/07/2021 07:57:13 - INFO - __main__ - Step 76036: {'lr': 0.00024981426396493195, 'samples': 14598912, 'steps': 76035, 'loss/train': 1.434212327003479} 11/07/2021 07:57:13 - INFO - __main__ - Step 76037: {'lr': 0.00024980895722209143, 'samples': 14599104, 'steps': 76036, 'loss/train': 1.4967256784439087} 11/07/2021 07:57:14 - INFO - __main__ - Step 76038: {'lr': 0.00024980365047933705, 'samples': 14599296, 'steps': 76037, 'loss/train': 1.3708903789520264} 11/07/2021 07:57:14 - INFO - __main__ - Step 76039: {'lr': 0.00024979834373667115, 'samples': 14599488, 'steps': 76038, 'loss/train': 1.2811964750289917} 11/07/2021 07:57:15 - INFO - __main__ - Step 76040: {'lr': 0.00024979303699409604, 'samples': 14599680, 'steps': 76039, 'loss/train': 1.2950509786605835} 11/07/2021 07:57:15 - INFO - __main__ - Step 76041: {'lr': 0.0002497877302516143, 'samples': 14599872, 'steps': 76040, 'loss/train': 1.6807739734649658} 11/07/2021 07:57:16 - INFO - __main__ - Step 76042: {'lr': 0.0002497824235092281, 'samples': 14600064, 'steps': 76041, 'loss/train': 1.0309079885482788} 11/07/2021 07:57:16 - INFO - __main__ - Step 76043: {'lr': 0.00024977711676694, 'samples': 14600256, 'steps': 76042, 'loss/train': 1.5983768701553345} 11/07/2021 07:57:17 - INFO - __main__ - Step 76044: {'lr': 0.0002497718100247523, 'samples': 14600448, 'steps': 76043, 'loss/train': 1.0588738918304443} 11/07/2021 07:57:17 - INFO - __main__ - Step 76045: {'lr': 0.00024976650328266745, 'samples': 14600640, 'steps': 76044, 'loss/train': 1.1014389991760254} 11/07/2021 07:57:18 - INFO - __main__ - Step 76046: {'lr': 0.00024976119654068773, 'samples': 14600832, 'steps': 76045, 'loss/train': 1.3700790405273438} 11/07/2021 07:57:18 - INFO - __main__ - Step 76047: {'lr': 0.00024975588979881573, 'samples': 14601024, 'steps': 76046, 'loss/train': 1.008915901184082} 11/07/2021 07:57:19 - INFO - __main__ - Step 76048: {'lr': 0.0002497505830570537, 'samples': 14601216, 'steps': 76047, 'loss/train': 1.3404244184494019} 11/07/2021 07:57:19 - INFO - __main__ - Step 76049: {'lr': 0.00024974527631540404, 'samples': 14601408, 'steps': 76048, 'loss/train': 1.6699880361557007} 11/07/2021 07:57:19 - INFO - __main__ - Step 76050: {'lr': 0.0002497399695738691, 'samples': 14601600, 'steps': 76049, 'loss/train': 1.4687833786010742} 11/07/2021 07:57:20 - INFO - __main__ - Step 76051: {'lr': 0.0002497346628324514, 'samples': 14601792, 'steps': 76050, 'loss/train': 1.7123953104019165} 11/07/2021 07:57:21 - INFO - __main__ - Step 76052: {'lr': 0.00024972935609115317, 'samples': 14601984, 'steps': 76051, 'loss/train': 1.3949007987976074} 11/07/2021 07:57:21 - INFO - __main__ - Step 76053: {'lr': 0.00024972404934997695, 'samples': 14602176, 'steps': 76052, 'loss/train': 1.2532819509506226} 11/07/2021 07:57:21 - INFO - __main__ - Step 76054: {'lr': 0.00024971874260892505, 'samples': 14602368, 'steps': 76053, 'loss/train': 0.13495463132858276} 11/07/2021 07:57:22 - INFO - __main__ - Step 76055: {'lr': 0.0002497134358679999, 'samples': 14602560, 'steps': 76054, 'loss/train': 1.3517156839370728} 11/07/2021 07:57:23 - INFO - __main__ - Step 76056: {'lr': 0.0002497081291272038, 'samples': 14602752, 'steps': 76055, 'loss/train': 1.5699530839920044} 11/07/2021 07:57:23 - INFO - __main__ - Step 76057: {'lr': 0.00024970282238653927, 'samples': 14602944, 'steps': 76056, 'loss/train': 1.207828164100647} 11/07/2021 07:57:23 - INFO - __main__ - Step 76058: {'lr': 0.0002496975156460087, 'samples': 14603136, 'steps': 76057, 'loss/train': 1.5915114879608154} 11/07/2021 07:57:24 - INFO - __main__ - Step 76059: {'lr': 0.00024969220890561436, 'samples': 14603328, 'steps': 76058, 'loss/train': 1.5116307735443115} 11/07/2021 07:57:24 - INFO - __main__ - Step 76060: {'lr': 0.0002496869021653587, 'samples': 14603520, 'steps': 76059, 'loss/train': 1.5512595176696777} 11/07/2021 07:57:25 - INFO - __main__ - Step 76061: {'lr': 0.00024968159542524413, 'samples': 14603712, 'steps': 76060, 'loss/train': 1.556296944618225} 11/07/2021 07:57:26 - INFO - __main__ - Step 76062: {'lr': 0.00024967628868527306, 'samples': 14603904, 'steps': 76061, 'loss/train': 1.5545810461044312} 11/07/2021 07:57:26 - INFO - __main__ - Step 76063: {'lr': 0.0002496709819454478, 'samples': 14604096, 'steps': 76062, 'loss/train': 1.5528708696365356} 11/07/2021 07:57:26 - INFO - __main__ - Step 76064: {'lr': 0.00024966567520577084, 'samples': 14604288, 'steps': 76063, 'loss/train': 1.584981918334961} 11/07/2021 07:57:27 - INFO - __main__ - Step 76065: {'lr': 0.00024966036846624446, 'samples': 14604480, 'steps': 76064, 'loss/train': 0.6869581937789917} 11/07/2021 07:57:28 - INFO - __main__ - Step 76066: {'lr': 0.0002496550617268711, 'samples': 14604672, 'steps': 76065, 'loss/train': 0.4148690104484558} 11/07/2021 07:57:29 - INFO - __main__ - Step 76067: {'lr': 0.00024964975498765324, 'samples': 14604864, 'steps': 76066, 'loss/train': 1.897500991821289} 11/07/2021 07:57:29 - INFO - __main__ - Step 76068: {'lr': 0.0002496444482485931, 'samples': 14605056, 'steps': 76067, 'loss/train': 1.506227731704712} 11/07/2021 07:57:30 - INFO - __main__ - Step 76069: {'lr': 0.00024963914150969335, 'samples': 14605248, 'steps': 76068, 'loss/train': 1.1318538188934326} 11/07/2021 07:57:30 - INFO - __main__ - Step 76070: {'lr': 0.000249633834770956, 'samples': 14605440, 'steps': 76069, 'loss/train': 1.948724627494812} 11/07/2021 07:57:30 - INFO - __main__ - Step 76071: {'lr': 0.00024962852803238377, 'samples': 14605632, 'steps': 76070, 'loss/train': 1.2522996664047241} 11/07/2021 07:57:31 - INFO - __main__ - Step 76072: {'lr': 0.00024962322129397883, 'samples': 14605824, 'steps': 76071, 'loss/train': 0.972348690032959} 11/07/2021 07:57:32 - INFO - __main__ - Step 76073: {'lr': 0.0002496179145557437, 'samples': 14606016, 'steps': 76072, 'loss/train': 1.759624719619751} 11/07/2021 07:57:32 - INFO - __main__ - Step 76074: {'lr': 0.0002496126078176807, 'samples': 14606208, 'steps': 76073, 'loss/train': 1.6676180362701416} 11/07/2021 07:57:32 - INFO - __main__ - Step 76075: {'lr': 0.00024960730107979233, 'samples': 14606400, 'steps': 76074, 'loss/train': 1.9657405614852905} 11/07/2021 07:57:33 - INFO - __main__ - Step 76076: {'lr': 0.00024960199434208085, 'samples': 14606592, 'steps': 76075, 'loss/train': 1.553739309310913} 11/07/2021 07:57:33 - INFO - __main__ - Step 76077: {'lr': 0.0002495966876045487, 'samples': 14606784, 'steps': 76076, 'loss/train': 1.6068137884140015} 11/07/2021 07:57:34 - INFO - __main__ - Step 76078: {'lr': 0.00024959138086719826, 'samples': 14606976, 'steps': 76077, 'loss/train': 1.6484330892562866} 11/07/2021 07:57:34 - INFO - __main__ - Step 76079: {'lr': 0.00024958607413003197, 'samples': 14607168, 'steps': 76078, 'loss/train': 0.21074071526527405} 11/07/2021 07:57:35 - INFO - __main__ - Step 76080: {'lr': 0.0002495807673930522, 'samples': 14607360, 'steps': 76079, 'loss/train': 1.210264801979065} 11/07/2021 07:57:35 - INFO - __main__ - Step 76081: {'lr': 0.00024957546065626133, 'samples': 14607552, 'steps': 76080, 'loss/train': 1.2132234573364258} 11/07/2021 07:57:35 - INFO - __main__ - Step 76082: {'lr': 0.0002495701539196617, 'samples': 14607744, 'steps': 76081, 'loss/train': 2.165330648422241} 11/07/2021 07:57:36 - INFO - __main__ - Step 76083: {'lr': 0.0002495648471832558, 'samples': 14607936, 'steps': 76082, 'loss/train': 1.5264180898666382} 11/07/2021 07:57:37 - INFO - __main__ - Step 76084: {'lr': 0.00024955954044704595, 'samples': 14608128, 'steps': 76083, 'loss/train': 1.5279476642608643} 11/07/2021 07:57:37 - INFO - __main__ - Step 76085: {'lr': 0.00024955423371103455, 'samples': 14608320, 'steps': 76084, 'loss/train': 1.6028717756271362} 11/07/2021 07:57:37 - INFO - __main__ - Step 76086: {'lr': 0.000249548926975224, 'samples': 14608512, 'steps': 76085, 'loss/train': 1.637375831604004} 11/07/2021 07:57:38 - INFO - __main__ - Step 76087: {'lr': 0.00024954362023961674, 'samples': 14608704, 'steps': 76086, 'loss/train': 0.9501951336860657} 11/07/2021 07:57:39 - INFO - __main__ - Step 76088: {'lr': 0.0002495383135042151, 'samples': 14608896, 'steps': 76087, 'loss/train': 1.473802924156189} 11/07/2021 07:57:39 - INFO - __main__ - Step 76089: {'lr': 0.0002495330067690215, 'samples': 14609088, 'steps': 76088, 'loss/train': 1.728408694267273} 11/07/2021 07:57:39 - INFO - __main__ - Step 76090: {'lr': 0.00024952770003403837, 'samples': 14609280, 'steps': 76089, 'loss/train': 1.4852184057235718} 11/07/2021 07:57:40 - INFO - __main__ - Step 76091: {'lr': 0.000249522393299268, 'samples': 14609472, 'steps': 76090, 'loss/train': 1.5921896696090698} 11/07/2021 07:57:40 - INFO - __main__ - Step 76092: {'lr': 0.0002495170865647128, 'samples': 14609664, 'steps': 76091, 'loss/train': 1.484971046447754} 11/07/2021 07:57:41 - INFO - __main__ - Step 76093: {'lr': 0.0002495117798303752, 'samples': 14609856, 'steps': 76092, 'loss/train': 1.5992398262023926} 11/07/2021 07:57:42 - INFO - __main__ - Step 76094: {'lr': 0.0002495064730962576, 'samples': 14610048, 'steps': 76093, 'loss/train': 1.3608057498931885} 11/07/2021 07:57:42 - INFO - __main__ - Step 76095: {'lr': 0.00024950116636236237, 'samples': 14610240, 'steps': 76094, 'loss/train': 1.491039752960205} 11/07/2021 07:57:42 - INFO - __main__ - Step 76096: {'lr': 0.0002494958596286919, 'samples': 14610432, 'steps': 76095, 'loss/train': 1.7627124786376953} 11/07/2021 07:57:43 - INFO - __main__ - Step 76097: {'lr': 0.00024949055289524857, 'samples': 14610624, 'steps': 76096, 'loss/train': 1.3127518892288208} 11/07/2021 07:57:44 - INFO - __main__ - Step 76098: {'lr': 0.00024948524616203485, 'samples': 14610816, 'steps': 76097, 'loss/train': 1.2463401556015015} 11/07/2021 07:57:44 - INFO - __main__ - Step 76099: {'lr': 0.000249479939429053, 'samples': 14611008, 'steps': 76098, 'loss/train': 1.5079193115234375} 11/07/2021 07:57:44 - INFO - __main__ - Step 76100: {'lr': 0.0002494746326963055, 'samples': 14611200, 'steps': 76099, 'loss/train': 1.2058415412902832} 11/07/2021 07:57:45 - INFO - __main__ - Step 76101: {'lr': 0.00024946932596379474, 'samples': 14611392, 'steps': 76100, 'loss/train': 0.08260802179574966} 11/07/2021 07:57:45 - INFO - __main__ - Step 76102: {'lr': 0.0002494640192315232, 'samples': 14611584, 'steps': 76101, 'loss/train': 1.501927137374878} 11/07/2021 07:57:46 - INFO - __main__ - Step 76103: {'lr': 0.00024945871249949304, 'samples': 14611776, 'steps': 76102, 'loss/train': 1.2044010162353516} 11/07/2021 07:57:47 - INFO - __main__ - Step 76104: {'lr': 0.00024945340576770683, 'samples': 14611968, 'steps': 76103, 'loss/train': 1.3317184448242188} 11/07/2021 07:57:47 - INFO - __main__ - Step 76105: {'lr': 0.00024944809903616684, 'samples': 14612160, 'steps': 76104, 'loss/train': 1.492837905883789} 11/07/2021 07:57:48 - INFO - __main__ - Step 76106: {'lr': 0.0002494427923048755, 'samples': 14612352, 'steps': 76105, 'loss/train': 1.2194786071777344} 11/07/2021 07:57:48 - INFO - __main__ - Step 76107: {'lr': 0.00024943748557383535, 'samples': 14612544, 'steps': 76106, 'loss/train': 1.148874282836914} 11/07/2021 07:57:48 - INFO - __main__ - Step 76108: {'lr': 0.00024943217884304856, 'samples': 14612736, 'steps': 76107, 'loss/train': 1.541368842124939} 11/07/2021 07:57:49 - INFO - __main__ - Step 76109: {'lr': 0.00024942687211251764, 'samples': 14612928, 'steps': 76108, 'loss/train': 0.48051342368125916} 11/07/2021 07:57:50 - INFO - __main__ - Step 76110: {'lr': 0.000249421565382245, 'samples': 14613120, 'steps': 76109, 'loss/train': 1.5056025981903076} 11/07/2021 07:57:50 - INFO - __main__ - Step 76111: {'lr': 0.00024941625865223296, 'samples': 14613312, 'steps': 76110, 'loss/train': 1.427408218383789} 11/07/2021 07:57:50 - INFO - __main__ - Step 76112: {'lr': 0.00024941095192248397, 'samples': 14613504, 'steps': 76111, 'loss/train': 1.6209131479263306} 11/07/2021 07:57:51 - INFO - __main__ - Step 76113: {'lr': 0.0002494056451930004, 'samples': 14613696, 'steps': 76112, 'loss/train': 1.3596649169921875} 11/07/2021 07:57:52 - INFO - __main__ - Step 76114: {'lr': 0.0002494003384637846, 'samples': 14613888, 'steps': 76113, 'loss/train': 2.0451622009277344} 11/07/2021 07:57:52 - INFO - __main__ - Step 76115: {'lr': 0.000249395031734839, 'samples': 14614080, 'steps': 76114, 'loss/train': 1.648255467414856} 11/07/2021 07:57:52 - INFO - __main__ - Step 76116: {'lr': 0.00024938972500616614, 'samples': 14614272, 'steps': 76115, 'loss/train': 1.2789889574050903} 11/07/2021 07:57:53 - INFO - __main__ - Step 76117: {'lr': 0.0002493844182777681, 'samples': 14614464, 'steps': 76116, 'loss/train': 1.3347331285476685} 11/07/2021 07:57:53 - INFO - __main__ - Step 76118: {'lr': 0.0002493791115496475, 'samples': 14614656, 'steps': 76117, 'loss/train': 1.251667857170105} 11/07/2021 07:57:54 - INFO - __main__ - Step 76119: {'lr': 0.0002493738048218066, 'samples': 14614848, 'steps': 76118, 'loss/train': 1.6747586727142334} 11/07/2021 07:57:55 - INFO - __main__ - Step 76120: {'lr': 0.0002493684980942479, 'samples': 14615040, 'steps': 76119, 'loss/train': 1.4375171661376953} 11/07/2021 07:57:55 - INFO - __main__ - Step 76121: {'lr': 0.0002493631913669737, 'samples': 14615232, 'steps': 76120, 'loss/train': 1.297980546951294} 11/07/2021 07:57:55 - INFO - __main__ - Step 76122: {'lr': 0.00024935788463998645, 'samples': 14615424, 'steps': 76121, 'loss/train': 1.4775468111038208} 11/07/2021 07:57:56 - INFO - __main__ - Step 76123: {'lr': 0.0002493525779132885, 'samples': 14615616, 'steps': 76122, 'loss/train': 0.09369588643312454} 11/07/2021 07:57:57 - INFO - __main__ - Step 76124: {'lr': 0.0002493472711868823, 'samples': 14615808, 'steps': 76123, 'loss/train': 1.3258434534072876} 11/07/2021 07:57:57 - INFO - __main__ - Step 76125: {'lr': 0.00024934196446077024, 'samples': 14616000, 'steps': 76124, 'loss/train': 1.055733561515808} 11/07/2021 07:57:57 - INFO - __main__ - Step 76126: {'lr': 0.0002493366577349546, 'samples': 14616192, 'steps': 76125, 'loss/train': 0.07454677671194077} 11/07/2021 07:57:58 - INFO - __main__ - Step 76127: {'lr': 0.000249331351009438, 'samples': 14616384, 'steps': 76126, 'loss/train': 1.3524525165557861} 11/07/2021 07:57:58 - INFO - __main__ - Step 76128: {'lr': 0.0002493260442842225, 'samples': 14616576, 'steps': 76127, 'loss/train': 1.5633758306503296} 11/07/2021 07:57:59 - INFO - __main__ - Step 76129: {'lr': 0.0002493207375593109, 'samples': 14616768, 'steps': 76128, 'loss/train': 1.172372817993164} 11/07/2021 07:58:00 - INFO - __main__ - Step 76130: {'lr': 0.0002493154308347052, 'samples': 14616960, 'steps': 76129, 'loss/train': 0.9517183899879456} 11/07/2021 07:58:00 - INFO - __main__ - Step 76131: {'lr': 0.000249310124110408, 'samples': 14617152, 'steps': 76130, 'loss/train': 1.168681263923645} 11/07/2021 07:58:00 - INFO - __main__ - Step 76132: {'lr': 0.0002493048173864217, 'samples': 14617344, 'steps': 76131, 'loss/train': 1.1775059700012207} 11/07/2021 07:58:01 - INFO - __main__ - Step 76133: {'lr': 0.00024929951066274855, 'samples': 14617536, 'steps': 76132, 'loss/train': 1.2792195081710815} 11/07/2021 07:58:01 - INFO - __main__ - Step 76134: {'lr': 0.000249294203939391, 'samples': 14617728, 'steps': 76133, 'loss/train': 1.180377721786499} 11/07/2021 07:58:02 - INFO - __main__ - Step 76135: {'lr': 0.0002492888972163515, 'samples': 14617920, 'steps': 76134, 'loss/train': 1.1301960945129395} 11/07/2021 07:58:02 - INFO - __main__ - Step 76136: {'lr': 0.0002492835904936325, 'samples': 14618112, 'steps': 76135, 'loss/train': 1.1775020360946655} 11/07/2021 07:58:03 - INFO - __main__ - Step 76137: {'lr': 0.0002492782837712362, 'samples': 14618304, 'steps': 76136, 'loss/train': 1.5011383295059204} 11/07/2021 07:58:03 - INFO - __main__ - Step 76138: {'lr': 0.00024927297704916513, 'samples': 14618496, 'steps': 76137, 'loss/train': 2.0560286045074463} 11/07/2021 07:58:03 - INFO - __main__ - Step 76139: {'lr': 0.0002492676703274217, 'samples': 14618688, 'steps': 76138, 'loss/train': 1.3583039045333862} 11/07/2021 07:58:04 - INFO - __main__ - Step 76140: {'lr': 0.00024926236360600814, 'samples': 14618880, 'steps': 76139, 'loss/train': 1.244752287864685} 11/07/2021 07:58:05 - INFO - __main__ - Step 76141: {'lr': 0.000249257056884927, 'samples': 14619072, 'steps': 76140, 'loss/train': 1.7779114246368408} 11/07/2021 07:58:05 - INFO - __main__ - Step 76142: {'lr': 0.0002492517501641806, 'samples': 14619264, 'steps': 76141, 'loss/train': 1.4009606838226318} 11/07/2021 07:58:05 - INFO - __main__ - Step 76143: {'lr': 0.00024924644344377145, 'samples': 14619456, 'steps': 76142, 'loss/train': 1.4015616178512573} 11/07/2021 07:58:06 - INFO - __main__ - Step 76144: {'lr': 0.0002492411367237018, 'samples': 14619648, 'steps': 76143, 'loss/train': 1.6415237188339233} 11/07/2021 07:58:07 - INFO - __main__ - Step 76145: {'lr': 0.000249235830003974, 'samples': 14619840, 'steps': 76144, 'loss/train': 1.6176866292953491} 11/07/2021 07:58:07 - INFO - __main__ - Step 76146: {'lr': 0.0002492305232845906, 'samples': 14620032, 'steps': 76145, 'loss/train': 1.1348201036453247} 11/07/2021 07:58:07 - INFO - __main__ - Step 76147: {'lr': 0.00024922521656555385, 'samples': 14620224, 'steps': 76146, 'loss/train': 1.1714402437210083} 11/07/2021 07:58:08 - INFO - __main__ - Step 76148: {'lr': 0.00024921990984686626, 'samples': 14620416, 'steps': 76147, 'loss/train': 0.7236326336860657} 11/07/2021 07:58:08 - INFO - __main__ - Step 76149: {'lr': 0.0002492146031285301, 'samples': 14620608, 'steps': 76148, 'loss/train': 1.334855079650879} 11/07/2021 07:58:09 - INFO - __main__ - Step 76150: {'lr': 0.00024920929641054787, 'samples': 14620800, 'steps': 76149, 'loss/train': 1.377122402191162} 11/07/2021 07:58:09 - INFO - __main__ - Step 76151: {'lr': 0.00024920398969292194, 'samples': 14620992, 'steps': 76150, 'loss/train': 1.576151728630066} 11/07/2021 07:58:10 - INFO - __main__ - Step 76152: {'lr': 0.00024919868297565466, 'samples': 14621184, 'steps': 76151, 'loss/train': 1.5308481454849243} 11/07/2021 07:58:10 - INFO - __main__ - Step 76153: {'lr': 0.0002491933762587484, 'samples': 14621376, 'steps': 76152, 'loss/train': 1.1985853910446167} 11/07/2021 07:58:11 - INFO - __main__ - Step 76154: {'lr': 0.00024918806954220567, 'samples': 14621568, 'steps': 76153, 'loss/train': 1.3753303289413452} 11/07/2021 07:58:12 - INFO - __main__ - Step 76155: {'lr': 0.0002491827628260287, 'samples': 14621760, 'steps': 76154, 'loss/train': 1.6378319263458252} 11/07/2021 07:58:12 - INFO - __main__ - Step 76156: {'lr': 0.0002491774561102201, 'samples': 14621952, 'steps': 76155, 'loss/train': 0.5102927088737488} 11/07/2021 07:58:12 - INFO - __main__ - Step 76157: {'lr': 0.00024917214939478206, 'samples': 14622144, 'steps': 76156, 'loss/train': 1.6565346717834473} 11/07/2021 07:58:13 - INFO - __main__ - Step 76158: {'lr': 0.000249166842679717, 'samples': 14622336, 'steps': 76157, 'loss/train': 1.5469129085540771} 11/07/2021 07:58:13 - INFO - __main__ - Step 76159: {'lr': 0.0002491615359650274, 'samples': 14622528, 'steps': 76158, 'loss/train': 1.7013723850250244} 11/07/2021 07:58:14 - INFO - __main__ - Step 76160: {'lr': 0.0002491562292507155, 'samples': 14622720, 'steps': 76159, 'loss/train': 1.3401707410812378} 11/07/2021 07:58:14 - INFO - __main__ - Step 76161: {'lr': 0.00024915092253678385, 'samples': 14622912, 'steps': 76160, 'loss/train': 1.6660205125808716} 11/07/2021 07:58:15 - INFO - __main__ - Step 76162: {'lr': 0.0002491456158232348, 'samples': 14623104, 'steps': 76161, 'loss/train': 1.2889145612716675} 11/07/2021 07:58:15 - INFO - __main__ - Step 76163: {'lr': 0.00024914030911007073, 'samples': 14623296, 'steps': 76162, 'loss/train': 1.421042799949646} 11/07/2021 07:58:16 - INFO - __main__ - Step 76164: {'lr': 0.00024913500239729394, 'samples': 14623488, 'steps': 76163, 'loss/train': 1.1639128923416138} 11/07/2021 07:58:17 - INFO - __main__ - Step 76165: {'lr': 0.000249129695684907, 'samples': 14623680, 'steps': 76164, 'loss/train': 1.6027281284332275} 11/07/2021 07:58:17 - INFO - __main__ - Step 76166: {'lr': 0.00024912438897291213, 'samples': 14623872, 'steps': 76165, 'loss/train': 1.6341381072998047} 11/07/2021 07:58:17 - INFO - __main__ - Step 76167: {'lr': 0.0002491190822613118, 'samples': 14624064, 'steps': 76166, 'loss/train': 1.471014380455017} 11/07/2021 07:58:18 - INFO - __main__ - Step 76168: {'lr': 0.0002491137755501085, 'samples': 14624256, 'steps': 76167, 'loss/train': 1.1312371492385864} 11/07/2021 07:58:18 - INFO - __main__ - Step 76169: {'lr': 0.0002491084688393044, 'samples': 14624448, 'steps': 76168, 'loss/train': 1.3793246746063232} 11/07/2021 07:58:19 - INFO - __main__ - Step 76170: {'lr': 0.0002491031621289022, 'samples': 14624640, 'steps': 76169, 'loss/train': 1.6201661825180054} 11/07/2021 07:58:20 - INFO - __main__ - Step 76171: {'lr': 0.00024909785541890394, 'samples': 14624832, 'steps': 76170, 'loss/train': 1.689718246459961} 11/07/2021 07:58:20 - INFO - __main__ - Step 76172: {'lr': 0.0002490925487093122, 'samples': 14625024, 'steps': 76171, 'loss/train': 1.4406282901763916} 11/07/2021 07:58:20 - INFO - __main__ - Step 76173: {'lr': 0.0002490872420001293, 'samples': 14625216, 'steps': 76172, 'loss/train': 2.065009117126465} 11/07/2021 07:58:21 - INFO - __main__ - Step 76174: {'lr': 0.0002490819352913577, 'samples': 14625408, 'steps': 76173, 'loss/train': 1.4048024415969849} 11/07/2021 07:58:21 - INFO - __main__ - Step 76175: {'lr': 0.00024907662858299976, 'samples': 14625600, 'steps': 76174, 'loss/train': 1.2271267175674438} 11/07/2021 07:58:22 - INFO - __main__ - Step 76176: {'lr': 0.0002490713218750579, 'samples': 14625792, 'steps': 76175, 'loss/train': 2.231947183609009} 11/07/2021 07:58:22 - INFO - __main__ - Step 76177: {'lr': 0.00024906601516753454, 'samples': 14625984, 'steps': 76176, 'loss/train': 1.4816296100616455} 11/07/2021 07:58:23 - INFO - __main__ - Step 76178: {'lr': 0.0002490607084604319, 'samples': 14626176, 'steps': 76177, 'loss/train': 1.3932490348815918} 11/07/2021 07:58:23 - INFO - __main__ - Step 76179: {'lr': 0.0002490554017537526, 'samples': 14626368, 'steps': 76178, 'loss/train': 1.3691521883010864} 11/07/2021 07:58:23 - INFO - __main__ - Step 76180: {'lr': 0.00024905009504749885, 'samples': 14626560, 'steps': 76179, 'loss/train': 1.1980735063552856} 11/07/2021 07:58:24 - INFO - __main__ - Step 76181: {'lr': 0.00024904478834167316, 'samples': 14626752, 'steps': 76180, 'loss/train': 1.4226038455963135} 11/07/2021 07:58:25 - INFO - __main__ - Step 76182: {'lr': 0.0002490394816362779, 'samples': 14626944, 'steps': 76181, 'loss/train': 2.0149929523468018} 11/07/2021 07:58:25 - INFO - __main__ - Step 76183: {'lr': 0.0002490341749313154, 'samples': 14627136, 'steps': 76182, 'loss/train': 2.066580295562744} 11/07/2021 07:58:25 - INFO - __main__ - Step 76184: {'lr': 0.0002490288682267881, 'samples': 14627328, 'steps': 76183, 'loss/train': 1.4672654867172241} 11/07/2021 07:58:26 - INFO - __main__ - Step 76185: {'lr': 0.0002490235615226983, 'samples': 14627520, 'steps': 76184, 'loss/train': 1.095548391342163} 11/07/2021 07:58:27 - INFO - __main__ - Step 76186: {'lr': 0.0002490182548190485, 'samples': 14627712, 'steps': 76185, 'loss/train': 1.308817744255066} 11/07/2021 07:58:27 - INFO - __main__ - Step 76187: {'lr': 0.0002490129481158411, 'samples': 14627904, 'steps': 76186, 'loss/train': 0.5969200730323792} 11/07/2021 07:58:28 - INFO - __main__ - Step 76188: {'lr': 0.0002490076414130784, 'samples': 14628096, 'steps': 76187, 'loss/train': 1.1471903324127197} 11/07/2021 07:58:28 - INFO - __main__ - Step 76189: {'lr': 0.0002490023347107629, 'samples': 14628288, 'steps': 76188, 'loss/train': 1.5689789056777954} 11/07/2021 07:58:28 - INFO - __main__ - Step 76190: {'lr': 0.0002489970280088969, 'samples': 14628480, 'steps': 76189, 'loss/train': 1.3649770021438599} 11/07/2021 07:58:29 - INFO - __main__ - Step 76191: {'lr': 0.0002489917213074828, 'samples': 14628672, 'steps': 76190, 'loss/train': 1.4588379859924316} 11/07/2021 07:58:30 - INFO - __main__ - Step 76192: {'lr': 0.0002489864146065231, 'samples': 14628864, 'steps': 76191, 'loss/train': 1.2918556928634644} 11/07/2021 07:58:30 - INFO - __main__ - Step 76193: {'lr': 0.00024898110790602, 'samples': 14629056, 'steps': 76192, 'loss/train': 0.4964410960674286} 11/07/2021 07:58:30 - INFO - __main__ - Step 76194: {'lr': 0.0002489758012059761, 'samples': 14629248, 'steps': 76193, 'loss/train': 1.1895486116409302} 11/07/2021 07:58:31 - INFO - __main__ - Step 76195: {'lr': 0.00024897049450639357, 'samples': 14629440, 'steps': 76194, 'loss/train': 1.3312439918518066} 11/07/2021 07:58:32 - INFO - __main__ - Step 76196: {'lr': 0.000248965187807275, 'samples': 14629632, 'steps': 76195, 'loss/train': 1.2490971088409424} 11/07/2021 07:58:32 - INFO - __main__ - Step 76197: {'lr': 0.00024895988110862274, 'samples': 14629824, 'steps': 76196, 'loss/train': 1.3451340198516846} 11/07/2021 07:58:32 - INFO - __main__ - Step 76198: {'lr': 0.00024895457441043904, 'samples': 14630016, 'steps': 76197, 'loss/train': 1.570677638053894} 11/07/2021 07:58:33 - INFO - __main__ - Step 76199: {'lr': 0.00024894926771272644, 'samples': 14630208, 'steps': 76198, 'loss/train': 1.2298223972320557} 11/07/2021 07:58:33 - INFO - __main__ - Step 76200: {'lr': 0.0002489439610154873, 'samples': 14630400, 'steps': 76199, 'loss/train': 1.2966794967651367} 11/07/2021 07:58:34 - INFO - __main__ - Step 76201: {'lr': 0.00024893865431872397, 'samples': 14630592, 'steps': 76200, 'loss/train': 1.4168918132781982} 11/07/2021 07:58:34 - INFO - __main__ - Step 76202: {'lr': 0.0002489333476224388, 'samples': 14630784, 'steps': 76201, 'loss/train': 0.9236571192741394} 11/07/2021 07:58:35 - INFO - __main__ - Step 76203: {'lr': 0.0002489280409266344, 'samples': 14630976, 'steps': 76202, 'loss/train': 1.7898688316345215} 11/07/2021 07:58:35 - INFO - __main__ - Step 76204: {'lr': 0.0002489227342313129, 'samples': 14631168, 'steps': 76203, 'loss/train': 1.4034485816955566} 11/07/2021 07:58:35 - INFO - __main__ - Step 76205: {'lr': 0.00024891742753647685, 'samples': 14631360, 'steps': 76204, 'loss/train': 1.000670313835144} 11/07/2021 07:58:36 - INFO - __main__ - Step 76206: {'lr': 0.00024891212084212857, 'samples': 14631552, 'steps': 76205, 'loss/train': 1.196594476699829} 11/07/2021 07:58:37 - INFO - __main__ - Step 76207: {'lr': 0.0002489068141482704, 'samples': 14631744, 'steps': 76206, 'loss/train': 1.4546853303909302} 11/07/2021 07:58:37 - INFO - __main__ - Step 76208: {'lr': 0.0002489015074549049, 'samples': 14631936, 'steps': 76207, 'loss/train': 1.518090009689331} 11/07/2021 07:58:38 - INFO - __main__ - Step 76209: {'lr': 0.0002488962007620343, 'samples': 14632128, 'steps': 76208, 'loss/train': 2.2520503997802734} 11/07/2021 07:58:38 - INFO - __main__ - Step 76210: {'lr': 0.00024889089406966117, 'samples': 14632320, 'steps': 76209, 'loss/train': 0.9822400212287903} 11/07/2021 07:58:38 - INFO - __main__ - Step 76211: {'lr': 0.0002488855873777877, 'samples': 14632512, 'steps': 76210, 'loss/train': 1.2907357215881348} 11/07/2021 07:58:39 - INFO - __main__ - Step 76212: {'lr': 0.00024888028068641637, 'samples': 14632704, 'steps': 76211, 'loss/train': 1.3304460048675537} 11/07/2021 07:58:40 - INFO - __main__ - Step 76213: {'lr': 0.0002488749739955495, 'samples': 14632896, 'steps': 76212, 'loss/train': 1.160390019416809} 11/07/2021 07:58:40 - INFO - __main__ - Step 76214: {'lr': 0.0002488696673051897, 'samples': 14633088, 'steps': 76213, 'loss/train': 1.9437386989593506} 11/07/2021 07:58:40 - INFO - __main__ - Step 76215: {'lr': 0.00024886436061533914, 'samples': 14633280, 'steps': 76214, 'loss/train': 1.0962886810302734} 11/07/2021 07:58:41 - INFO - __main__ - Step 76216: {'lr': 0.00024885905392600026, 'samples': 14633472, 'steps': 76215, 'loss/train': 1.4024581909179688} 11/07/2021 07:58:42 - INFO - __main__ - Step 76217: {'lr': 0.00024885374723717545, 'samples': 14633664, 'steps': 76216, 'loss/train': 1.3515055179595947} 11/07/2021 07:58:42 - INFO - __main__ - Step 76218: {'lr': 0.00024884844054886716, 'samples': 14633856, 'steps': 76217, 'loss/train': 1.1170822381973267} 11/07/2021 07:58:42 - INFO - __main__ - Step 76219: {'lr': 0.00024884313386107777, 'samples': 14634048, 'steps': 76218, 'loss/train': 1.2766796350479126} 11/07/2021 07:58:43 - INFO - __main__ - Step 76220: {'lr': 0.0002488378271738096, 'samples': 14634240, 'steps': 76219, 'loss/train': 1.647944450378418} 11/07/2021 07:58:43 - INFO - __main__ - Step 76221: {'lr': 0.0002488325204870651, 'samples': 14634432, 'steps': 76220, 'loss/train': 1.5331645011901855} 11/07/2021 07:58:45 - INFO - __main__ - Step 76222: {'lr': 0.0002488272138008466, 'samples': 14634624, 'steps': 76221, 'loss/train': 1.7774180173873901} 11/07/2021 07:58:45 - INFO - __main__ - Step 76223: {'lr': 0.0002488219071151567, 'samples': 14634816, 'steps': 76222, 'loss/train': 1.0503464937210083} 11/07/2021 07:58:46 - INFO - __main__ - Step 76224: {'lr': 0.0002488166004299975, 'samples': 14635008, 'steps': 76223, 'loss/train': 1.1545758247375488} 11/07/2021 07:58:46 - INFO - __main__ - Step 76225: {'lr': 0.0002488112937453716, 'samples': 14635200, 'steps': 76224, 'loss/train': 1.1912389993667603} 11/07/2021 07:58:46 - INFO - __main__ - Step 76226: {'lr': 0.00024880598706128124, 'samples': 14635392, 'steps': 76225, 'loss/train': 1.9311835765838623} 11/07/2021 07:58:47 - INFO - __main__ - Step 76227: {'lr': 0.0002488006803777289, 'samples': 14635584, 'steps': 76226, 'loss/train': 0.5436522364616394} 11/07/2021 07:58:48 - INFO - __main__ - Step 76228: {'lr': 0.00024879537369471694, 'samples': 14635776, 'steps': 76227, 'loss/train': 0.8401231169700623} 11/07/2021 07:58:48 - INFO - __main__ - Step 76229: {'lr': 0.0002487900670122478, 'samples': 14635968, 'steps': 76228, 'loss/train': 1.677695870399475} 11/07/2021 07:58:48 - INFO - __main__ - Step 76230: {'lr': 0.0002487847603303238, 'samples': 14636160, 'steps': 76229, 'loss/train': 1.3713219165802002} 11/07/2021 07:58:49 - INFO - __main__ - Step 76231: {'lr': 0.00024877945364894737, 'samples': 14636352, 'steps': 76230, 'loss/train': 1.2925119400024414} 11/07/2021 07:58:49 - INFO - __main__ - Step 76232: {'lr': 0.00024877414696812094, 'samples': 14636544, 'steps': 76231, 'loss/train': 1.2410993576049805} 11/07/2021 07:58:50 - INFO - __main__ - Step 76233: {'lr': 0.0002487688402878468, 'samples': 14636736, 'steps': 76232, 'loss/train': 0.9990208745002747} 11/07/2021 07:58:50 - INFO - __main__ - Step 76234: {'lr': 0.00024876353360812745, 'samples': 14636928, 'steps': 76233, 'loss/train': 1.407866358757019} 11/07/2021 07:58:51 - INFO - __main__ - Step 76235: {'lr': 0.0002487582269289652, 'samples': 14637120, 'steps': 76234, 'loss/train': 0.8666215538978577} 11/07/2021 07:58:51 - INFO - __main__ - Step 76236: {'lr': 0.0002487529202503625, 'samples': 14637312, 'steps': 76235, 'loss/train': 1.4352829456329346} 11/07/2021 07:58:51 - INFO - __main__ - Step 76237: {'lr': 0.0002487476135723218, 'samples': 14637504, 'steps': 76236, 'loss/train': 1.1367716789245605} 11/07/2021 07:58:52 - INFO - __main__ - Step 76238: {'lr': 0.0002487423068948453, 'samples': 14637696, 'steps': 76237, 'loss/train': 5.0644426345825195} 11/07/2021 07:58:53 - INFO - __main__ - Step 76239: {'lr': 0.00024873700021793555, 'samples': 14637888, 'steps': 76238, 'loss/train': 1.309165120124817} 11/07/2021 07:58:53 - INFO - __main__ - Step 76240: {'lr': 0.00024873169354159484, 'samples': 14638080, 'steps': 76239, 'loss/train': 1.1172558069229126} 11/07/2021 07:58:53 - INFO - __main__ - Step 76241: {'lr': 0.0002487263868658256, 'samples': 14638272, 'steps': 76240, 'loss/train': 1.6025476455688477} 11/07/2021 07:58:54 - INFO - __main__ - Step 76242: {'lr': 0.00024872108019063027, 'samples': 14638464, 'steps': 76241, 'loss/train': 1.115061640739441} 11/07/2021 07:58:54 - INFO - __main__ - Step 76243: {'lr': 0.00024871577351601116, 'samples': 14638656, 'steps': 76242, 'loss/train': 0.837797999382019} 11/07/2021 07:58:55 - INFO - __main__ - Step 76244: {'lr': 0.0002487104668419707, 'samples': 14638848, 'steps': 76243, 'loss/train': 1.618603229522705} 11/07/2021 07:58:56 - INFO - __main__ - Step 76245: {'lr': 0.0002487051601685113, 'samples': 14639040, 'steps': 76244, 'loss/train': 0.8842902183532715} 11/07/2021 07:58:56 - INFO - __main__ - Step 76246: {'lr': 0.00024869985349563534, 'samples': 14639232, 'steps': 76245, 'loss/train': 1.291719913482666} 11/07/2021 07:58:56 - INFO - __main__ - Step 76247: {'lr': 0.0002486945468233452, 'samples': 14639424, 'steps': 76246, 'loss/train': 1.4228571653366089} 11/07/2021 07:58:57 - INFO - __main__ - Step 76248: {'lr': 0.00024868924015164327, 'samples': 14639616, 'steps': 76247, 'loss/train': 1.4155406951904297} 11/07/2021 07:58:58 - INFO - __main__ - Step 76249: {'lr': 0.000248683933480532, 'samples': 14639808, 'steps': 76248, 'loss/train': 1.370173454284668} 11/07/2021 07:58:58 - INFO - __main__ - Step 76250: {'lr': 0.0002486786268100138, 'samples': 14640000, 'steps': 76249, 'loss/train': 1.300243616104126} 11/07/2021 07:58:58 - INFO - __main__ - Step 76251: {'lr': 0.00024867332014009085, 'samples': 14640192, 'steps': 76250, 'loss/train': 1.2269190549850464} 11/07/2021 07:58:59 - INFO - __main__ - Step 76252: {'lr': 0.00024866801347076575, 'samples': 14640384, 'steps': 76251, 'loss/train': 1.4167512655258179} 11/07/2021 07:58:59 - INFO - __main__ - Step 76253: {'lr': 0.00024866270680204075, 'samples': 14640576, 'steps': 76252, 'loss/train': 0.5143240094184875} 11/07/2021 07:59:00 - INFO - __main__ - Step 76254: {'lr': 0.00024865740013391835, 'samples': 14640768, 'steps': 76253, 'loss/train': 1.356732964515686} 11/07/2021 07:59:01 - INFO - __main__ - Step 76255: {'lr': 0.00024865209346640094, 'samples': 14640960, 'steps': 76254, 'loss/train': 1.1405973434448242} 11/07/2021 07:59:01 - INFO - __main__ - Step 76256: {'lr': 0.0002486467867994908, 'samples': 14641152, 'steps': 76255, 'loss/train': 1.2723129987716675} 11/07/2021 07:59:01 - INFO - __main__ - Step 76257: {'lr': 0.00024864148013319044, 'samples': 14641344, 'steps': 76256, 'loss/train': 1.332780361175537} 11/07/2021 07:59:02 - INFO - __main__ - Step 76258: {'lr': 0.0002486361734675022, 'samples': 14641536, 'steps': 76257, 'loss/train': 0.10650795698165894} 11/07/2021 07:59:03 - INFO - __main__ - Step 76259: {'lr': 0.00024863086680242846, 'samples': 14641728, 'steps': 76258, 'loss/train': 1.0839641094207764} 11/07/2021 07:59:03 - INFO - __main__ - Step 76260: {'lr': 0.00024862556013797164, 'samples': 14641920, 'steps': 76259, 'loss/train': 1.210099697113037} 11/07/2021 07:59:03 - INFO - __main__ - Step 76261: {'lr': 0.00024862025347413417, 'samples': 14642112, 'steps': 76260, 'loss/train': 1.3567615747451782} 11/07/2021 07:59:04 - INFO - __main__ - Step 76262: {'lr': 0.0002486149468109183, 'samples': 14642304, 'steps': 76261, 'loss/train': 1.5943490266799927} 11/07/2021 07:59:04 - INFO - __main__ - Step 76263: {'lr': 0.0002486096401483266, 'samples': 14642496, 'steps': 76262, 'loss/train': 1.9876846075057983} 11/07/2021 07:59:04 - INFO - __main__ - Step 76264: {'lr': 0.00024860433348636144, 'samples': 14642688, 'steps': 76263, 'loss/train': 1.5635632276535034} 11/07/2021 07:59:05 - INFO - __main__ - Step 76265: {'lr': 0.00024859902682502507, 'samples': 14642880, 'steps': 76264, 'loss/train': 1.1976110935211182} 11/07/2021 07:59:06 - INFO - __main__ - Step 76266: {'lr': 0.00024859372016431997, 'samples': 14643072, 'steps': 76265, 'loss/train': 1.4164905548095703} 11/07/2021 07:59:06 - INFO - __main__ - Step 76267: {'lr': 0.0002485884135042485, 'samples': 14643264, 'steps': 76266, 'loss/train': 1.4977298974990845} 11/07/2021 07:59:06 - INFO - __main__ - Step 76268: {'lr': 0.000248583106844813, 'samples': 14643456, 'steps': 76267, 'loss/train': 1.625083327293396} 11/07/2021 07:59:07 - INFO - __main__ - Step 76269: {'lr': 0.000248577800186016, 'samples': 14643648, 'steps': 76268, 'loss/train': 1.5285018682479858} 11/07/2021 07:59:08 - INFO - __main__ - Step 76270: {'lr': 0.0002485724935278598, 'samples': 14643840, 'steps': 76269, 'loss/train': 1.1682522296905518} 11/07/2021 07:59:08 - INFO - __main__ - Step 76271: {'lr': 0.0002485671868703468, 'samples': 14644032, 'steps': 76270, 'loss/train': 1.438327431678772} 11/07/2021 07:59:08 - INFO - __main__ - Step 76272: {'lr': 0.0002485618802134794, 'samples': 14644224, 'steps': 76271, 'loss/train': 1.5102035999298096} 11/07/2021 07:59:09 - INFO - __main__ - Step 76273: {'lr': 0.00024855657355726005, 'samples': 14644416, 'steps': 76272, 'loss/train': 0.9706743955612183} 11/07/2021 07:59:09 - INFO - __main__ - Step 76274: {'lr': 0.00024855126690169103, 'samples': 14644608, 'steps': 76273, 'loss/train': 1.3331869840621948} 11/07/2021 07:59:10 - INFO - __main__ - Step 76275: {'lr': 0.00024854596024677486, 'samples': 14644800, 'steps': 76274, 'loss/train': 0.9603033065795898} 11/07/2021 07:59:10 - INFO - __main__ - Step 76276: {'lr': 0.00024854065359251377, 'samples': 14644992, 'steps': 76275, 'loss/train': 1.3761730194091797} 11/07/2021 07:59:11 - INFO - __main__ - Step 76277: {'lr': 0.0002485353469389104, 'samples': 14645184, 'steps': 76276, 'loss/train': 1.139607548713684} 11/07/2021 07:59:11 - INFO - __main__ - Step 76278: {'lr': 0.00024853004028596684, 'samples': 14645376, 'steps': 76277, 'loss/train': 1.59321129322052} 11/07/2021 07:59:11 - INFO - __main__ - Step 76279: {'lr': 0.0002485247336336856, 'samples': 14645568, 'steps': 76278, 'loss/train': 0.901130735874176} 11/07/2021 07:59:13 - INFO - __main__ - Step 76280: {'lr': 0.00024851942698206916, 'samples': 14645760, 'steps': 76279, 'loss/train': 1.443162202835083} 11/07/2021 07:59:13 - INFO - __main__ - Step 76281: {'lr': 0.0002485141203311198, 'samples': 14645952, 'steps': 76280, 'loss/train': 1.5169481039047241} 11/07/2021 07:59:13 - INFO - __main__ - Step 76282: {'lr': 0.00024850881368084, 'samples': 14646144, 'steps': 76281, 'loss/train': 1.2158576250076294} 11/07/2021 07:59:14 - INFO - __main__ - Step 76283: {'lr': 0.00024850350703123207, 'samples': 14646336, 'steps': 76282, 'loss/train': 1.510652780532837} 11/07/2021 07:59:14 - INFO - __main__ - Step 76284: {'lr': 0.0002484982003822984, 'samples': 14646528, 'steps': 76283, 'loss/train': 1.1032111644744873} 11/07/2021 07:59:15 - INFO - __main__ - Step 76285: {'lr': 0.0002484928937340415, 'samples': 14646720, 'steps': 76284, 'loss/train': 0.9914588928222656} 11/07/2021 07:59:15 - INFO - __main__ - Step 76286: {'lr': 0.0002484875870864636, 'samples': 14646912, 'steps': 76285, 'loss/train': 1.4682154655456543} 11/07/2021 07:59:16 - INFO - __main__ - Step 76287: {'lr': 0.0002484822804395672, 'samples': 14647104, 'steps': 76286, 'loss/train': 2.2338435649871826} 11/07/2021 07:59:16 - INFO - __main__ - Step 76288: {'lr': 0.0002484769737933547, 'samples': 14647296, 'steps': 76287, 'loss/train': 1.4601658582687378} 11/07/2021 07:59:16 - INFO - __main__ - Step 76289: {'lr': 0.0002484716671478284, 'samples': 14647488, 'steps': 76288, 'loss/train': 1.5890862941741943} 11/07/2021 07:59:17 - INFO - __main__ - Step 76290: {'lr': 0.00024846636050299077, 'samples': 14647680, 'steps': 76289, 'loss/train': 1.792888879776001} 11/07/2021 07:59:18 - INFO - __main__ - Step 76291: {'lr': 0.00024846105385884426, 'samples': 14647872, 'steps': 76290, 'loss/train': 1.5221519470214844} 11/07/2021 07:59:18 - INFO - __main__ - Step 76292: {'lr': 0.0002484557472153911, 'samples': 14648064, 'steps': 76291, 'loss/train': 1.22990882396698} 11/07/2021 07:59:19 - INFO - __main__ - Step 76293: {'lr': 0.00024845044057263376, 'samples': 14648256, 'steps': 76292, 'loss/train': 1.0949760675430298} 11/07/2021 07:59:19 - INFO - __main__ - Step 76294: {'lr': 0.00024844513393057455, 'samples': 14648448, 'steps': 76293, 'loss/train': 1.419012188911438} 11/07/2021 07:59:19 - INFO - __main__ - Step 76295: {'lr': 0.000248439827289216, 'samples': 14648640, 'steps': 76294, 'loss/train': 1.5554801225662231} 11/07/2021 07:59:20 - INFO - __main__ - Step 76296: {'lr': 0.00024843452064856047, 'samples': 14648832, 'steps': 76295, 'loss/train': 1.6784054040908813} 11/07/2021 07:59:21 - INFO - __main__ - Step 76297: {'lr': 0.00024842921400861025, 'samples': 14649024, 'steps': 76296, 'loss/train': 1.4448970556259155} 11/07/2021 07:59:21 - INFO - __main__ - Step 76298: {'lr': 0.00024842390736936785, 'samples': 14649216, 'steps': 76297, 'loss/train': 1.421246886253357} 11/07/2021 07:59:21 - INFO - __main__ - Step 76299: {'lr': 0.0002484186007308356, 'samples': 14649408, 'steps': 76298, 'loss/train': 1.671202301979065} 11/07/2021 07:59:22 - INFO - __main__ - Step 76300: {'lr': 0.0002484132940930159, 'samples': 14649600, 'steps': 76299, 'loss/train': 1.1606310606002808} 11/07/2021 07:59:23 - INFO - __main__ - Step 76301: {'lr': 0.00024840798745591117, 'samples': 14649792, 'steps': 76300, 'loss/train': 1.2821084260940552} 11/07/2021 07:59:23 - INFO - __main__ - Step 76302: {'lr': 0.00024840268081952375, 'samples': 14649984, 'steps': 76301, 'loss/train': 1.8107998371124268} 11/07/2021 07:59:23 - INFO - __main__ - Step 76303: {'lr': 0.0002483973741838561, 'samples': 14650176, 'steps': 76302, 'loss/train': 1.4946749210357666} 11/07/2021 07:59:24 - INFO - __main__ - Step 76304: {'lr': 0.0002483920675489106, 'samples': 14650368, 'steps': 76303, 'loss/train': 1.4578615427017212} 11/07/2021 07:59:24 - INFO - __main__ - Step 76305: {'lr': 0.0002483867609146895, 'samples': 14650560, 'steps': 76304, 'loss/train': 1.8013464212417603} 11/07/2021 07:59:25 - INFO - __main__ - Step 76306: {'lr': 0.0002483814542811954, 'samples': 14650752, 'steps': 76305, 'loss/train': 1.2567837238311768} 11/07/2021 07:59:25 - INFO - __main__ - Step 76307: {'lr': 0.0002483761476484305, 'samples': 14650944, 'steps': 76306, 'loss/train': 0.9212275147438049} 11/07/2021 07:59:26 - INFO - __main__ - Step 76308: {'lr': 0.0002483708410163973, 'samples': 14651136, 'steps': 76307, 'loss/train': 1.4320794343948364} 11/07/2021 07:59:26 - INFO - __main__ - Step 76309: {'lr': 0.0002483655343850982, 'samples': 14651328, 'steps': 76308, 'loss/train': 1.0826417207717896} 11/07/2021 07:59:26 - INFO - __main__ - Step 76310: {'lr': 0.00024836022775453554, 'samples': 14651520, 'steps': 76309, 'loss/train': 1.2783805131912231} 11/07/2021 07:59:27 - INFO - __main__ - Step 76311: {'lr': 0.00024835492112471177, 'samples': 14651712, 'steps': 76310, 'loss/train': 1.5874912738800049} 11/07/2021 07:59:28 - INFO - __main__ - Step 76312: {'lr': 0.0002483496144956292, 'samples': 14651904, 'steps': 76311, 'loss/train': 0.9286174774169922} 11/07/2021 07:59:28 - INFO - __main__ - Step 76313: {'lr': 0.0002483443078672903, 'samples': 14652096, 'steps': 76312, 'loss/train': 1.3494019508361816} 11/07/2021 07:59:29 - INFO - __main__ - Step 76314: {'lr': 0.00024833900123969745, 'samples': 14652288, 'steps': 76313, 'loss/train': 1.2973034381866455} 11/07/2021 07:59:29 - INFO - __main__ - Step 76315: {'lr': 0.00024833369461285296, 'samples': 14652480, 'steps': 76314, 'loss/train': 1.2763888835906982} 11/07/2021 07:59:30 - INFO - __main__ - Step 76316: {'lr': 0.0002483283879867594, 'samples': 14652672, 'steps': 76315, 'loss/train': 1.5674257278442383} 11/07/2021 07:59:30 - INFO - __main__ - Step 76317: {'lr': 0.000248323081361419, 'samples': 14652864, 'steps': 76316, 'loss/train': 0.37565064430236816} 11/07/2021 07:59:31 - INFO - __main__ - Step 76318: {'lr': 0.00024831777473683416, 'samples': 14653056, 'steps': 76317, 'loss/train': 1.6298624277114868} 11/07/2021 07:59:31 - INFO - __main__ - Step 76319: {'lr': 0.00024831246811300733, 'samples': 14653248, 'steps': 76318, 'loss/train': 1.5390558242797852} 11/07/2021 07:59:31 - INFO - __main__ - Step 76320: {'lr': 0.0002483071614899408, 'samples': 14653440, 'steps': 76319, 'loss/train': 1.1732182502746582} 11/07/2021 07:59:32 - INFO - __main__ - Step 76321: {'lr': 0.0002483018548676371, 'samples': 14653632, 'steps': 76320, 'loss/train': 1.5197813510894775} 11/07/2021 07:59:33 - INFO - __main__ - Step 76322: {'lr': 0.00024829654824609854, 'samples': 14653824, 'steps': 76321, 'loss/train': 1.2362163066864014} 11/07/2021 07:59:33 - INFO - __main__ - Step 76323: {'lr': 0.00024829124162532753, 'samples': 14654016, 'steps': 76322, 'loss/train': 1.4561065435409546} 11/07/2021 07:59:33 - INFO - __main__ - Step 76324: {'lr': 0.00024828593500532647, 'samples': 14654208, 'steps': 76323, 'loss/train': 1.533267617225647} 11/07/2021 07:59:34 - INFO - __main__ - Step 76325: {'lr': 0.00024828062838609774, 'samples': 14654400, 'steps': 76324, 'loss/train': 1.490100622177124} 11/07/2021 07:59:35 - INFO - __main__ - Step 76326: {'lr': 0.0002482753217676437, 'samples': 14654592, 'steps': 76325, 'loss/train': 0.9819637537002563} 11/07/2021 07:59:35 - INFO - __main__ - Step 76327: {'lr': 0.00024827001514996687, 'samples': 14654784, 'steps': 76326, 'loss/train': 1.4077465534210205} 11/07/2021 07:59:36 - INFO - __main__ - Step 76328: {'lr': 0.00024826470853306945, 'samples': 14654976, 'steps': 76327, 'loss/train': 1.7476680278778076} 11/07/2021 07:59:36 - INFO - __main__ - Step 76329: {'lr': 0.00024825940191695395, 'samples': 14655168, 'steps': 76328, 'loss/train': 1.4160255193710327} 11/07/2021 07:59:36 - INFO - __main__ - Step 76330: {'lr': 0.0002482540953016227, 'samples': 14655360, 'steps': 76329, 'loss/train': 1.3433990478515625} 11/07/2021 07:59:37 - INFO - __main__ - Step 76331: {'lr': 0.0002482487886870782, 'samples': 14655552, 'steps': 76330, 'loss/train': 1.5589566230773926} 11/07/2021 07:59:38 - INFO - __main__ - Step 76332: {'lr': 0.00024824348207332276, 'samples': 14655744, 'steps': 76331, 'loss/train': 1.414574384689331} 11/07/2021 07:59:38 - INFO - __main__ - Step 76333: {'lr': 0.00024823817546035877, 'samples': 14655936, 'steps': 76332, 'loss/train': 0.05834440886974335} 11/07/2021 07:59:38 - INFO - __main__ - Step 76334: {'lr': 0.0002482328688481886, 'samples': 14656128, 'steps': 76333, 'loss/train': 1.3033193349838257} 11/07/2021 07:59:39 - INFO - __main__ - Step 76335: {'lr': 0.0002482275622368147, 'samples': 14656320, 'steps': 76334, 'loss/train': 1.133558988571167} 11/07/2021 07:59:40 - INFO - __main__ - Step 76336: {'lr': 0.0002482222556262394, 'samples': 14656512, 'steps': 76335, 'loss/train': 1.6040339469909668} 11/07/2021 07:59:41 - INFO - __main__ - Step 76337: {'lr': 0.0002482169490164652, 'samples': 14656704, 'steps': 76336, 'loss/train': 1.7895164489746094} 11/07/2021 07:59:41 - INFO - __main__ - Step 76338: {'lr': 0.0002482116424074943, 'samples': 14656896, 'steps': 76337, 'loss/train': 1.3131802082061768} 11/07/2021 07:59:41 - INFO - __main__ - Step 76339: {'lr': 0.0002482063357993293, 'samples': 14657088, 'steps': 76338, 'loss/train': 1.7814159393310547} 11/07/2021 07:59:42 - INFO - __main__ - Step 76340: {'lr': 0.00024820102919197244, 'samples': 14657280, 'steps': 76339, 'loss/train': 0.1830153912305832} 11/07/2021 07:59:42 - INFO - __main__ - Step 76341: {'lr': 0.0002481957225854262, 'samples': 14657472, 'steps': 76340, 'loss/train': 0.21734528243541718} 11/07/2021 07:59:43 - INFO - __main__ - Step 76342: {'lr': 0.00024819041597969293, 'samples': 14657664, 'steps': 76341, 'loss/train': 1.3999935388565063} 11/07/2021 07:59:44 - INFO - __main__ - Step 76343: {'lr': 0.000248185109374775, 'samples': 14657856, 'steps': 76342, 'loss/train': 1.4772474765777588} 11/07/2021 07:59:44 - INFO - __main__ - Step 76344: {'lr': 0.0002481798027706749, 'samples': 14658048, 'steps': 76343, 'loss/train': 1.5476009845733643} 11/07/2021 07:59:44 - INFO - __main__ - Step 76345: {'lr': 0.0002481744961673949, 'samples': 14658240, 'steps': 76344, 'loss/train': 1.1479146480560303} 11/07/2021 07:59:45 - INFO - __main__ - Step 76346: {'lr': 0.0002481691895649375, 'samples': 14658432, 'steps': 76345, 'loss/train': 1.2010236978530884} 11/07/2021 07:59:46 - INFO - __main__ - Step 76347: {'lr': 0.000248163882963305, 'samples': 14658624, 'steps': 76346, 'loss/train': 0.8273810148239136} 11/07/2021 07:59:46 - INFO - __main__ - Step 76348: {'lr': 0.0002481585763624999, 'samples': 14658816, 'steps': 76347, 'loss/train': 1.4446707963943481} 11/07/2021 07:59:46 - INFO - __main__ - Step 76349: {'lr': 0.00024815326976252436, 'samples': 14659008, 'steps': 76348, 'loss/train': 1.6315653324127197} 11/07/2021 07:59:47 - INFO - __main__ - Step 76350: {'lr': 0.000248147963163381, 'samples': 14659200, 'steps': 76349, 'loss/train': 1.1169434785842896} 11/07/2021 07:59:47 - INFO - __main__ - Step 76351: {'lr': 0.00024814265656507214, 'samples': 14659392, 'steps': 76350, 'loss/train': 1.1316214799880981} 11/07/2021 07:59:48 - INFO - __main__ - Step 76352: {'lr': 0.0002481373499676002, 'samples': 14659584, 'steps': 76351, 'loss/train': 1.1426366567611694} 11/07/2021 07:59:49 - INFO - __main__ - Step 76353: {'lr': 0.0002481320433709675, 'samples': 14659776, 'steps': 76352, 'loss/train': 1.7141135931015015} 11/07/2021 07:59:49 - INFO - __main__ - Step 76354: {'lr': 0.0002481267367751765, 'samples': 14659968, 'steps': 76353, 'loss/train': 1.7945656776428223} 11/07/2021 07:59:49 - INFO - __main__ - Step 76355: {'lr': 0.0002481214301802295, 'samples': 14660160, 'steps': 76354, 'loss/train': 1.3089460134506226} 11/07/2021 07:59:50 - INFO - __main__ - Step 76356: {'lr': 0.000248116123586129, 'samples': 14660352, 'steps': 76355, 'loss/train': 4.291075229644775} 11/07/2021 07:59:50 - INFO - __main__ - Step 76357: {'lr': 0.0002481108169928774, 'samples': 14660544, 'steps': 76356, 'loss/train': 3.908785104751587} 11/07/2021 07:59:51 - INFO - __main__ - Step 76358: {'lr': 0.000248105510400477, 'samples': 14660736, 'steps': 76357, 'loss/train': 1.7429088354110718} 11/07/2021 07:59:51 - INFO - __main__ - Step 76359: {'lr': 0.0002481002038089303, 'samples': 14660928, 'steps': 76358, 'loss/train': 1.4764283895492554} 11/07/2021 07:59:52 - INFO - __main__ - Step 76360: {'lr': 0.0002480948972182395, 'samples': 14661120, 'steps': 76359, 'loss/train': 1.6600401401519775} 11/07/2021 07:59:52 - INFO - __main__ - Step 76361: {'lr': 0.0002480895906284072, 'samples': 14661312, 'steps': 76360, 'loss/train': 1.4087741374969482} 11/07/2021 07:59:52 - INFO - __main__ - Step 76362: {'lr': 0.0002480842840394356, 'samples': 14661504, 'steps': 76361, 'loss/train': 1.8324368000030518} 11/07/2021 07:59:53 - INFO - __main__ - Step 76363: {'lr': 0.0002480789774513272, 'samples': 14661696, 'steps': 76362, 'loss/train': 1.2178622484207153} 11/07/2021 07:59:54 - INFO - __main__ - Step 76364: {'lr': 0.00024807367086408447, 'samples': 14661888, 'steps': 76363, 'loss/train': 1.3268598318099976} 11/07/2021 07:59:54 - INFO - __main__ - Step 76365: {'lr': 0.00024806836427770967, 'samples': 14662080, 'steps': 76364, 'loss/train': 0.9592685103416443} 11/07/2021 07:59:54 - INFO - __main__ - Step 76366: {'lr': 0.0002480630576922052, 'samples': 14662272, 'steps': 76365, 'loss/train': 0.6530741453170776} 11/07/2021 07:59:55 - INFO - __main__ - Step 76367: {'lr': 0.0002480577511075735, 'samples': 14662464, 'steps': 76366, 'loss/train': 1.3975293636322021} 11/07/2021 07:59:56 - INFO - __main__ - Step 76368: {'lr': 0.00024805244452381697, 'samples': 14662656, 'steps': 76367, 'loss/train': 1.5363714694976807} 11/07/2021 07:59:56 - INFO - __main__ - Step 76369: {'lr': 0.00024804713794093796, 'samples': 14662848, 'steps': 76368, 'loss/train': 1.6320029497146606} 11/07/2021 07:59:57 - INFO - __main__ - Step 76370: {'lr': 0.0002480418313589389, 'samples': 14663040, 'steps': 76369, 'loss/train': 1.7676888704299927} 11/07/2021 07:59:57 - INFO - __main__ - Step 76371: {'lr': 0.00024803652477782225, 'samples': 14663232, 'steps': 76370, 'loss/train': 1.1656920909881592} 11/07/2021 07:59:58 - INFO - __main__ - Step 76372: {'lr': 0.0002480312181975902, 'samples': 14663424, 'steps': 76371, 'loss/train': 1.3241409063339233} 11/07/2021 07:59:58 - INFO - __main__ - Step 76373: {'lr': 0.00024802591161824527, 'samples': 14663616, 'steps': 76372, 'loss/train': 1.5082569122314453} 11/07/2021 07:59:59 - INFO - __main__ - Step 76374: {'lr': 0.0002480206050397898, 'samples': 14663808, 'steps': 76373, 'loss/train': 1.5051571130752563} 11/07/2021 07:59:59 - INFO - __main__ - Step 76375: {'lr': 0.00024801529846222626, 'samples': 14664000, 'steps': 76374, 'loss/train': 1.4197486639022827} 11/07/2021 08:00:00 - INFO - __main__ - Step 76376: {'lr': 0.00024800999188555697, 'samples': 14664192, 'steps': 76375, 'loss/train': 1.5471467971801758} 11/07/2021 08:00:00 - INFO - __main__ - Step 76377: {'lr': 0.00024800468530978436, 'samples': 14664384, 'steps': 76376, 'loss/train': 1.7511957883834839} 11/07/2021 08:00:00 - INFO - __main__ - Step 76378: {'lr': 0.00024799937873491083, 'samples': 14664576, 'steps': 76377, 'loss/train': 0.9491642713546753} 11/07/2021 08:00:02 - INFO - __main__ - Step 76379: {'lr': 0.0002479940721609387, 'samples': 14664768, 'steps': 76378, 'loss/train': 1.2701504230499268} 11/07/2021 08:00:02 - INFO - __main__ - Step 76380: {'lr': 0.00024798876558787043, 'samples': 14664960, 'steps': 76379, 'loss/train': 1.6766835451126099} 11/07/2021 08:00:02 - INFO - __main__ - Step 76381: {'lr': 0.0002479834590157084, 'samples': 14665152, 'steps': 76380, 'loss/train': 1.7989826202392578} 11/07/2021 08:00:03 - INFO - __main__ - Step 76382: {'lr': 0.000247978152444455, 'samples': 14665344, 'steps': 76381, 'loss/train': 1.4477298259735107} 11/07/2021 08:00:03 - INFO - __main__ - Step 76383: {'lr': 0.00024797284587411257, 'samples': 14665536, 'steps': 76382, 'loss/train': 1.2258532047271729} 11/07/2021 08:00:04 - INFO - __main__ - Step 76384: {'lr': 0.00024796753930468357, 'samples': 14665728, 'steps': 76383, 'loss/train': 0.13239026069641113} 11/07/2021 08:00:04 - INFO - __main__ - Step 76385: {'lr': 0.0002479622327361705, 'samples': 14665920, 'steps': 76384, 'loss/train': 1.3292256593704224} 11/07/2021 08:00:05 - INFO - __main__ - Step 76386: {'lr': 0.0002479569261685755, 'samples': 14666112, 'steps': 76385, 'loss/train': 1.3238399028778076} 11/07/2021 08:00:05 - INFO - __main__ - Step 76387: {'lr': 0.00024795161960190103, 'samples': 14666304, 'steps': 76386, 'loss/train': 1.5906357765197754} 11/07/2021 08:00:06 - INFO - __main__ - Step 76388: {'lr': 0.00024794631303614955, 'samples': 14666496, 'steps': 76387, 'loss/train': 1.332434058189392} 11/07/2021 08:00:06 - INFO - __main__ - Step 76389: {'lr': 0.0002479410064713234, 'samples': 14666688, 'steps': 76388, 'loss/train': 1.096675157546997} 11/07/2021 08:00:07 - INFO - __main__ - Step 76390: {'lr': 0.0002479356999074251, 'samples': 14666880, 'steps': 76389, 'loss/train': 1.6088452339172363} 11/07/2021 08:00:08 - INFO - __main__ - Step 76391: {'lr': 0.0002479303933444569, 'samples': 14667072, 'steps': 76390, 'loss/train': 2.675046920776367} 11/07/2021 08:00:08 - INFO - __main__ - Step 76392: {'lr': 0.0002479250867824212, 'samples': 14667264, 'steps': 76391, 'loss/train': 2.552844285964966} 11/07/2021 08:00:08 - INFO - __main__ - Step 76393: {'lr': 0.0002479197802213204, 'samples': 14667456, 'steps': 76392, 'loss/train': 1.3399900197982788} 11/07/2021 08:00:09 - INFO - __main__ - Step 76394: {'lr': 0.00024791447366115697, 'samples': 14667648, 'steps': 76393, 'loss/train': 1.6222590208053589} 11/07/2021 08:00:09 - INFO - __main__ - Step 76395: {'lr': 0.00024790916710193324, 'samples': 14667840, 'steps': 76394, 'loss/train': 1.062508463859558} 11/07/2021 08:00:10 - INFO - __main__ - Step 76396: {'lr': 0.0002479038605436516, 'samples': 14668032, 'steps': 76395, 'loss/train': 1.2724058628082275} 11/07/2021 08:00:11 - INFO - __main__ - Step 76397: {'lr': 0.0002478985539863144, 'samples': 14668224, 'steps': 76396, 'loss/train': 1.0563656091690063} 11/07/2021 08:00:11 - INFO - __main__ - Step 76398: {'lr': 0.00024789324742992427, 'samples': 14668416, 'steps': 76397, 'loss/train': 1.738440990447998} 11/07/2021 08:00:11 - INFO - __main__ - Step 76399: {'lr': 0.00024788794087448327, 'samples': 14668608, 'steps': 76398, 'loss/train': 1.3715945482254028} 11/07/2021 08:00:12 - INFO - __main__ - Step 76400: {'lr': 0.00024788263431999393, 'samples': 14668800, 'steps': 76399, 'loss/train': 1.7890095710754395} 11/07/2021 08:00:12 - INFO - __main__ - Step 76401: {'lr': 0.00024787732776645864, 'samples': 14668992, 'steps': 76400, 'loss/train': 1.501358985900879} 11/07/2021 08:00:13 - INFO - __main__ - Step 76402: {'lr': 0.00024787202121387983, 'samples': 14669184, 'steps': 76401, 'loss/train': 1.7363619804382324} 11/07/2021 08:00:13 - INFO - __main__ - Step 76403: {'lr': 0.0002478667146622598, 'samples': 14669376, 'steps': 76402, 'loss/train': 1.6018526554107666} 11/07/2021 08:00:14 - INFO - __main__ - Step 76404: {'lr': 0.000247861408111601, 'samples': 14669568, 'steps': 76403, 'loss/train': 1.2765802145004272} 11/07/2021 08:00:14 - INFO - __main__ - Step 76405: {'lr': 0.0002478561015619058, 'samples': 14669760, 'steps': 76404, 'loss/train': 0.9152487516403198} 11/07/2021 08:00:14 - INFO - __main__ - Step 76406: {'lr': 0.0002478507950131767, 'samples': 14669952, 'steps': 76405, 'loss/train': 1.9501985311508179} 11/07/2021 08:00:15 - INFO - __main__ - Step 76407: {'lr': 0.00024784548846541586, 'samples': 14670144, 'steps': 76406, 'loss/train': 1.2367459535598755} 11/07/2021 08:00:16 - INFO - __main__ - Step 76408: {'lr': 0.00024784018191862593, 'samples': 14670336, 'steps': 76407, 'loss/train': 1.5471326112747192} 11/07/2021 08:00:16 - INFO - __main__ - Step 76409: {'lr': 0.0002478348753728091, 'samples': 14670528, 'steps': 76408, 'loss/train': 1.2570348978042603} 11/07/2021 08:00:16 - INFO - __main__ - Step 76410: {'lr': 0.0002478295688279679, 'samples': 14670720, 'steps': 76409, 'loss/train': 1.1716830730438232} 11/07/2021 08:00:17 - INFO - __main__ - Step 76411: {'lr': 0.00024782426228410465, 'samples': 14670912, 'steps': 76410, 'loss/train': 1.7024093866348267} 11/07/2021 08:00:18 - INFO - __main__ - Step 76412: {'lr': 0.00024781895574122186, 'samples': 14671104, 'steps': 76411, 'loss/train': 1.8364087343215942} 11/07/2021 08:00:18 - INFO - __main__ - Step 76413: {'lr': 0.0002478136491993217, 'samples': 14671296, 'steps': 76412, 'loss/train': 2.2997357845306396} 11/07/2021 08:00:19 - INFO - __main__ - Step 76414: {'lr': 0.00024780834265840666, 'samples': 14671488, 'steps': 76413, 'loss/train': 1.1029179096221924} 11/07/2021 08:00:19 - INFO - __main__ - Step 76415: {'lr': 0.00024780303611847914, 'samples': 14671680, 'steps': 76414, 'loss/train': 2.511110305786133} 11/07/2021 08:00:19 - INFO - __main__ - Step 76416: {'lr': 0.0002477977295795416, 'samples': 14671872, 'steps': 76415, 'loss/train': 1.5545880794525146} 11/07/2021 08:00:20 - INFO - __main__ - Step 76417: {'lr': 0.0002477924230415963, 'samples': 14672064, 'steps': 76416, 'loss/train': 1.9429829120635986} 11/07/2021 08:00:21 - INFO - __main__ - Step 76418: {'lr': 0.00024778711650464574, 'samples': 14672256, 'steps': 76417, 'loss/train': 1.254936695098877} 11/07/2021 08:00:21 - INFO - __main__ - Step 76419: {'lr': 0.00024778180996869225, 'samples': 14672448, 'steps': 76418, 'loss/train': 1.3025134801864624} 11/07/2021 08:00:21 - INFO - __main__ - Step 76420: {'lr': 0.0002477765034337383, 'samples': 14672640, 'steps': 76419, 'loss/train': 1.5756897926330566} 11/07/2021 08:00:22 - INFO - __main__ - Step 76421: {'lr': 0.0002477711968997861, 'samples': 14672832, 'steps': 76420, 'loss/train': 1.5596116781234741} 11/07/2021 08:00:22 - INFO - __main__ - Step 76422: {'lr': 0.00024776589036683825, 'samples': 14673024, 'steps': 76421, 'loss/train': 0.8915702104568481} 11/07/2021 08:00:23 - INFO - __main__ - Step 76423: {'lr': 0.00024776058383489707, 'samples': 14673216, 'steps': 76422, 'loss/train': 1.6767845153808594} 11/07/2021 08:00:24 - INFO - __main__ - Step 76424: {'lr': 0.0002477552773039649, 'samples': 14673408, 'steps': 76423, 'loss/train': 1.3067383766174316} 11/07/2021 08:00:24 - INFO - __main__ - Step 76425: {'lr': 0.0002477499707740443, 'samples': 14673600, 'steps': 76424, 'loss/train': 0.09288504719734192} 11/07/2021 08:00:24 - INFO - __main__ - Step 76426: {'lr': 0.0002477446642451374, 'samples': 14673792, 'steps': 76425, 'loss/train': 1.7415355443954468} 11/07/2021 08:00:25 - INFO - __main__ - Step 76427: {'lr': 0.0002477393577172467, 'samples': 14673984, 'steps': 76426, 'loss/train': 1.3293097019195557} 11/07/2021 08:00:26 - INFO - __main__ - Step 76428: {'lr': 0.00024773405119037464, 'samples': 14674176, 'steps': 76427, 'loss/train': 1.3692983388900757} 11/07/2021 08:00:26 - INFO - __main__ - Step 76429: {'lr': 0.0002477287446645236, 'samples': 14674368, 'steps': 76428, 'loss/train': 1.5127613544464111} 11/07/2021 08:00:26 - INFO - __main__ - Step 76430: {'lr': 0.0002477234381396959, 'samples': 14674560, 'steps': 76429, 'loss/train': 1.7909499406814575} 11/07/2021 08:00:27 - INFO - __main__ - Step 76431: {'lr': 0.00024771813161589403, 'samples': 14674752, 'steps': 76430, 'loss/train': 1.242266058921814} 11/07/2021 08:00:27 - INFO - __main__ - Step 76432: {'lr': 0.0002477128250931203, 'samples': 14674944, 'steps': 76431, 'loss/train': 1.4826338291168213} 11/07/2021 08:00:28 - INFO - __main__ - Step 76433: {'lr': 0.00024770751857137714, 'samples': 14675136, 'steps': 76432, 'loss/train': 0.9649997353553772} 11/07/2021 08:00:28 - INFO - __main__ - Step 76434: {'lr': 0.000247702212050667, 'samples': 14675328, 'steps': 76433, 'loss/train': 0.7279593348503113} 11/07/2021 08:00:29 - INFO - __main__ - Step 76435: {'lr': 0.00024769690553099214, 'samples': 14675520, 'steps': 76434, 'loss/train': 1.3344528675079346} 11/07/2021 08:00:29 - INFO - __main__ - Step 76436: {'lr': 0.00024769159901235504, 'samples': 14675712, 'steps': 76435, 'loss/train': 1.2055915594100952} 11/07/2021 08:00:29 - INFO - __main__ - Step 76437: {'lr': 0.00024768629249475807, 'samples': 14675904, 'steps': 76436, 'loss/train': 1.4198837280273438} 11/07/2021 08:00:31 - INFO - __main__ - Step 76438: {'lr': 0.0002476809859782037, 'samples': 14676096, 'steps': 76437, 'loss/train': 1.2398426532745361} 11/07/2021 08:00:31 - INFO - __main__ - Step 76439: {'lr': 0.00024767567946269416, 'samples': 14676288, 'steps': 76438, 'loss/train': 1.9029954671859741} 11/07/2021 08:00:31 - INFO - __main__ - Step 76440: {'lr': 0.00024767037294823194, 'samples': 14676480, 'steps': 76439, 'loss/train': 1.5263675451278687} 11/07/2021 08:00:32 - INFO - __main__ - Step 76441: {'lr': 0.0002476650664348194, 'samples': 14676672, 'steps': 76440, 'loss/train': 1.1287744045257568} 11/07/2021 08:00:32 - INFO - __main__ - Step 76442: {'lr': 0.00024765975992245895, 'samples': 14676864, 'steps': 76441, 'loss/train': 1.1207387447357178} 11/07/2021 08:00:33 - INFO - __main__ - Step 76443: {'lr': 0.00024765445341115295, 'samples': 14677056, 'steps': 76442, 'loss/train': 0.9569492936134338} 11/07/2021 08:00:34 - INFO - __main__ - Step 76444: {'lr': 0.00024764914690090384, 'samples': 14677248, 'steps': 76443, 'loss/train': 1.0062737464904785} 11/07/2021 08:00:34 - INFO - __main__ - Step 76445: {'lr': 0.000247643840391714, 'samples': 14677440, 'steps': 76444, 'loss/train': 1.4390857219696045} 11/07/2021 08:00:34 - INFO - __main__ - Step 76446: {'lr': 0.0002476385338835858, 'samples': 14677632, 'steps': 76445, 'loss/train': 2.015591621398926} 11/07/2021 08:00:35 - INFO - __main__ - Step 76447: {'lr': 0.0002476332273765216, 'samples': 14677824, 'steps': 76446, 'loss/train': 1.9910944700241089} 11/07/2021 08:00:35 - INFO - __main__ - Step 76448: {'lr': 0.00024762792087052387, 'samples': 14678016, 'steps': 76447, 'loss/train': 1.5972868204116821} 11/07/2021 08:00:36 - INFO - __main__ - Step 76449: {'lr': 0.000247622614365595, 'samples': 14678208, 'steps': 76448, 'loss/train': 1.0795228481292725} 11/07/2021 08:00:36 - INFO - __main__ - Step 76450: {'lr': 0.00024761730786173735, 'samples': 14678400, 'steps': 76449, 'loss/train': 1.5008212327957153} 11/07/2021 08:00:37 - INFO - __main__ - Step 76451: {'lr': 0.00024761200135895323, 'samples': 14678592, 'steps': 76450, 'loss/train': 2.448173999786377} 11/07/2021 08:00:37 - INFO - __main__ - Step 76452: {'lr': 0.0002476066948572452, 'samples': 14678784, 'steps': 76451, 'loss/train': 1.3580819368362427} 11/07/2021 08:00:37 - INFO - __main__ - Step 76453: {'lr': 0.0002476013883566155, 'samples': 14678976, 'steps': 76452, 'loss/train': 1.2634109258651733} 11/07/2021 08:00:38 - INFO - __main__ - Step 76454: {'lr': 0.00024759608185706653, 'samples': 14679168, 'steps': 76453, 'loss/train': 1.10969877243042} 11/07/2021 08:00:39 - INFO - __main__ - Step 76455: {'lr': 0.0002475907753586008, 'samples': 14679360, 'steps': 76454, 'loss/train': 1.2738584280014038} 11/07/2021 08:00:39 - INFO - __main__ - Step 76456: {'lr': 0.00024758546886122055, 'samples': 14679552, 'steps': 76455, 'loss/train': 1.3105924129486084} 11/07/2021 08:00:39 - INFO - __main__ - Step 76457: {'lr': 0.0002475801623649283, 'samples': 14679744, 'steps': 76456, 'loss/train': 1.6688863039016724} 11/07/2021 08:00:40 - INFO - __main__ - Step 76458: {'lr': 0.0002475748558697264, 'samples': 14679936, 'steps': 76457, 'loss/train': 1.2082278728485107} 11/07/2021 08:00:41 - INFO - __main__ - Step 76459: {'lr': 0.0002475695493756172, 'samples': 14680128, 'steps': 76458, 'loss/train': 1.3925544023513794} 11/07/2021 08:00:41 - INFO - __main__ - Step 76460: {'lr': 0.00024756424288260317, 'samples': 14680320, 'steps': 76459, 'loss/train': 1.394300103187561} 11/07/2021 08:00:42 - INFO - __main__ - Step 76461: {'lr': 0.00024755893639068666, 'samples': 14680512, 'steps': 76460, 'loss/train': 1.3018441200256348} 11/07/2021 08:00:42 - INFO - __main__ - Step 76462: {'lr': 0.00024755362989987, 'samples': 14680704, 'steps': 76461, 'loss/train': 1.2514402866363525} 11/07/2021 08:00:42 - INFO - __main__ - Step 76463: {'lr': 0.0002475483234101557, 'samples': 14680896, 'steps': 76462, 'loss/train': 1.2183928489685059} 11/07/2021 08:00:43 - INFO - __main__ - Step 76464: {'lr': 0.000247543016921546, 'samples': 14681088, 'steps': 76463, 'loss/train': 1.6193593740463257} 11/07/2021 08:00:44 - INFO - __main__ - Step 76465: {'lr': 0.0002475377104340435, 'samples': 14681280, 'steps': 76464, 'loss/train': 1.1794843673706055} 11/07/2021 08:00:44 - INFO - __main__ - Step 76466: {'lr': 0.0002475324039476504, 'samples': 14681472, 'steps': 76465, 'loss/train': 1.5782068967819214} 11/07/2021 08:00:44 - INFO - __main__ - Step 76467: {'lr': 0.0002475270974623691, 'samples': 14681664, 'steps': 76466, 'loss/train': 1.3467661142349243} 11/07/2021 08:00:45 - INFO - __main__ - Step 76468: {'lr': 0.0002475217909782021, 'samples': 14681856, 'steps': 76467, 'loss/train': 1.0799028873443604} 11/07/2021 08:00:45 - INFO - __main__ - Step 76469: {'lr': 0.00024751648449515177, 'samples': 14682048, 'steps': 76468, 'loss/train': 1.5998741388320923} 11/07/2021 08:00:46 - INFO - __main__ - Step 76470: {'lr': 0.00024751117801322044, 'samples': 14682240, 'steps': 76469, 'loss/train': 1.2522507905960083} 11/07/2021 08:00:46 - INFO - __main__ - Step 76471: {'lr': 0.0002475058715324106, 'samples': 14682432, 'steps': 76470, 'loss/train': 1.7010105848312378} 11/07/2021 08:00:47 - INFO - __main__ - Step 76472: {'lr': 0.00024750056505272455, 'samples': 14682624, 'steps': 76471, 'loss/train': 1.6531257629394531} 11/07/2021 08:00:47 - INFO - __main__ - Step 76473: {'lr': 0.00024749525857416466, 'samples': 14682816, 'steps': 76472, 'loss/train': 1.6174747943878174} 11/07/2021 08:00:47 - INFO - __main__ - Step 76474: {'lr': 0.00024748995209673336, 'samples': 14683008, 'steps': 76473, 'loss/train': 1.3149439096450806} 11/07/2021 08:00:48 - INFO - __main__ - Step 76475: {'lr': 0.0002474846456204331, 'samples': 14683200, 'steps': 76474, 'loss/train': 1.1820114850997925} 11/07/2021 08:00:49 - INFO - __main__ - Step 76476: {'lr': 0.0002474793391452662, 'samples': 14683392, 'steps': 76475, 'loss/train': 1.4067726135253906} 11/07/2021 08:00:49 - INFO - __main__ - Step 76477: {'lr': 0.0002474740326712351, 'samples': 14683584, 'steps': 76476, 'loss/train': 1.1133842468261719} 11/07/2021 08:00:49 - INFO - __main__ - Step 76478: {'lr': 0.0002474687261983421, 'samples': 14683776, 'steps': 76477, 'loss/train': 1.4842219352722168} 11/07/2021 08:00:50 - INFO - __main__ - Step 76479: {'lr': 0.0002474634197265897, 'samples': 14683968, 'steps': 76478, 'loss/train': 0.9015663862228394} 11/07/2021 08:00:51 - INFO - __main__ - Step 76480: {'lr': 0.00024745811325598027, 'samples': 14684160, 'steps': 76479, 'loss/train': 1.1607993841171265} 11/07/2021 08:00:51 - INFO - __main__ - Step 76481: {'lr': 0.0002474528067865161, 'samples': 14684352, 'steps': 76480, 'loss/train': 1.901441216468811} 11/07/2021 08:00:52 - INFO - __main__ - Step 76482: {'lr': 0.0002474475003181997, 'samples': 14684544, 'steps': 76481, 'loss/train': 1.0616123676300049} 11/07/2021 08:00:52 - INFO - __main__ - Step 76483: {'lr': 0.0002474421938510335, 'samples': 14684736, 'steps': 76482, 'loss/train': 1.134796142578125} 11/07/2021 08:00:52 - INFO - __main__ - Step 76484: {'lr': 0.0002474368873850197, 'samples': 14684928, 'steps': 76483, 'loss/train': 1.062170147895813} 11/07/2021 08:00:53 - INFO - __main__ - Step 76485: {'lr': 0.0002474315809201608, 'samples': 14685120, 'steps': 76484, 'loss/train': 1.6231729984283447} 11/07/2021 08:00:54 - INFO - __main__ - Step 76486: {'lr': 0.00024742627445645916, 'samples': 14685312, 'steps': 76485, 'loss/train': 1.4239836931228638} 11/07/2021 08:00:54 - INFO - __main__ - Step 76487: {'lr': 0.00024742096799391727, 'samples': 14685504, 'steps': 76486, 'loss/train': 1.32004714012146} 11/07/2021 08:00:54 - INFO - __main__ - Step 76488: {'lr': 0.0002474156615325374, 'samples': 14685696, 'steps': 76487, 'loss/train': 1.4191263914108276} 11/07/2021 08:00:55 - INFO - __main__ - Step 76489: {'lr': 0.00024741035507232207, 'samples': 14685888, 'steps': 76488, 'loss/train': 1.3464304208755493} 11/07/2021 08:00:55 - INFO - __main__ - Step 76490: {'lr': 0.00024740504861327353, 'samples': 14686080, 'steps': 76489, 'loss/train': 0.8290773034095764} 11/07/2021 08:00:56 - INFO - __main__ - Step 76491: {'lr': 0.0002473997421553942, 'samples': 14686272, 'steps': 76490, 'loss/train': 1.2817902565002441} 11/07/2021 08:00:56 - INFO - __main__ - Step 76492: {'lr': 0.0002473944356986866, 'samples': 14686464, 'steps': 76491, 'loss/train': 0.919394314289093} 11/07/2021 08:00:57 - INFO - __main__ - Step 76493: {'lr': 0.000247389129243153, 'samples': 14686656, 'steps': 76492, 'loss/train': 1.1656180620193481} 11/07/2021 08:00:57 - INFO - __main__ - Step 76494: {'lr': 0.00024738382278879586, 'samples': 14686848, 'steps': 76493, 'loss/train': 1.4588127136230469} 11/07/2021 08:00:57 - INFO - __main__ - Step 76495: {'lr': 0.00024737851633561747, 'samples': 14687040, 'steps': 76494, 'loss/train': 1.559408187866211} 11/07/2021 08:00:59 - INFO - __main__ - Step 76496: {'lr': 0.00024737320988362025, 'samples': 14687232, 'steps': 76495, 'loss/train': 1.5464478731155396} 11/07/2021 08:00:59 - INFO - __main__ - Step 76497: {'lr': 0.00024736790343280667, 'samples': 14687424, 'steps': 76496, 'loss/train': 1.247726559638977} 11/07/2021 08:00:59 - INFO - __main__ - Step 76498: {'lr': 0.00024736259698317903, 'samples': 14687616, 'steps': 76497, 'loss/train': 1.3336431980133057} 11/07/2021 08:01:00 - INFO - __main__ - Step 76499: {'lr': 0.0002473572905347398, 'samples': 14687808, 'steps': 76498, 'loss/train': 0.5694930553436279} 11/07/2021 08:01:00 - INFO - __main__ - Step 76500: {'lr': 0.0002473519840874913, 'samples': 14688000, 'steps': 76499, 'loss/train': 1.5795866250991821} 11/07/2021 08:01:01 - INFO - __main__ - Step 76501: {'lr': 0.000247346677641436, 'samples': 14688192, 'steps': 76500, 'loss/train': 1.4317065477371216} 11/07/2021 08:01:01 - INFO - __main__ - Step 76502: {'lr': 0.0002473413711965762, 'samples': 14688384, 'steps': 76501, 'loss/train': 1.4046834707260132} 11/07/2021 08:01:02 - INFO - __main__ - Step 76503: {'lr': 0.00024733606475291437, 'samples': 14688576, 'steps': 76502, 'loss/train': 1.4411689043045044} 11/07/2021 08:01:02 - INFO - __main__ - Step 76504: {'lr': 0.0002473307583104528, 'samples': 14688768, 'steps': 76503, 'loss/train': 1.9234129190444946} 11/07/2021 08:01:02 - INFO - __main__ - Step 76505: {'lr': 0.00024732545186919403, 'samples': 14688960, 'steps': 76504, 'loss/train': 1.481528878211975} 11/07/2021 08:01:04 - INFO - __main__ - Step 76506: {'lr': 0.00024732014542914045, 'samples': 14689152, 'steps': 76505, 'loss/train': 1.705116629600525} 11/07/2021 08:01:04 - INFO - __main__ - Step 76507: {'lr': 0.00024731483899029423, 'samples': 14689344, 'steps': 76506, 'loss/train': 1.4580529928207397} 11/07/2021 08:01:04 - INFO - __main__ - Step 76508: {'lr': 0.0002473095325526579, 'samples': 14689536, 'steps': 76507, 'loss/train': 1.047247052192688} 11/07/2021 08:01:05 - INFO - __main__ - Step 76509: {'lr': 0.000247304226116234, 'samples': 14689728, 'steps': 76508, 'loss/train': 1.0716395378112793} 11/07/2021 08:01:05 - INFO - __main__ - Step 76510: {'lr': 0.0002472989196810246, 'samples': 14689920, 'steps': 76509, 'loss/train': 0.8011226058006287} 11/07/2021 08:01:06 - INFO - __main__ - Step 76511: {'lr': 0.00024729361324703236, 'samples': 14690112, 'steps': 76510, 'loss/train': 1.287645697593689} 11/07/2021 08:01:06 - INFO - __main__ - Step 76512: {'lr': 0.0002472883068142595, 'samples': 14690304, 'steps': 76511, 'loss/train': 1.181587815284729} 11/07/2021 08:01:07 - INFO - __main__ - Step 76513: {'lr': 0.00024728300038270856, 'samples': 14690496, 'steps': 76512, 'loss/train': 1.2998803853988647} 11/07/2021 08:01:07 - INFO - __main__ - Step 76514: {'lr': 0.0002472776939523818, 'samples': 14690688, 'steps': 76513, 'loss/train': 1.8938759565353394} 11/07/2021 08:01:07 - INFO - __main__ - Step 76515: {'lr': 0.00024727238752328173, 'samples': 14690880, 'steps': 76514, 'loss/train': 1.2651901245117188} 11/07/2021 08:01:08 - INFO - __main__ - Step 76516: {'lr': 0.0002472670810954106, 'samples': 14691072, 'steps': 76515, 'loss/train': 1.5846993923187256} 11/07/2021 08:01:09 - INFO - __main__ - Step 76517: {'lr': 0.00024726177466877095, 'samples': 14691264, 'steps': 76516, 'loss/train': 1.2036821842193604} 11/07/2021 08:01:09 - INFO - __main__ - Step 76518: {'lr': 0.0002472564682433651, 'samples': 14691456, 'steps': 76517, 'loss/train': 1.1470553874969482} 11/07/2021 08:01:10 - INFO - __main__ - Step 76519: {'lr': 0.0002472511618191955, 'samples': 14691648, 'steps': 76518, 'loss/train': 4.62135124206543} 11/07/2021 08:01:10 - INFO - __main__ - Step 76520: {'lr': 0.00024724585539626445, 'samples': 14691840, 'steps': 76519, 'loss/train': 1.0083783864974976} 11/07/2021 08:01:10 - INFO - __main__ - Step 76521: {'lr': 0.0002472405489745743, 'samples': 14692032, 'steps': 76520, 'loss/train': 1.068682074546814} 11/07/2021 08:01:12 - INFO - __main__ - Step 76522: {'lr': 0.00024723524255412755, 'samples': 14692224, 'steps': 76521, 'loss/train': 1.707319974899292} 11/07/2021 08:01:12 - INFO - __main__ - Step 76523: {'lr': 0.00024722993613492654, 'samples': 14692416, 'steps': 76522, 'loss/train': 1.3720544576644897} 11/07/2021 08:01:12 - INFO - __main__ - Step 76524: {'lr': 0.0002472246297169737, 'samples': 14692608, 'steps': 76523, 'loss/train': 0.8118338584899902} 11/07/2021 08:01:13 - INFO - __main__ - Step 76525: {'lr': 0.0002472193233002714, 'samples': 14692800, 'steps': 76524, 'loss/train': 1.426585078239441} 11/07/2021 08:01:13 - INFO - __main__ - Step 76526: {'lr': 0.00024721401688482204, 'samples': 14692992, 'steps': 76525, 'loss/train': 1.0391615629196167} 11/07/2021 08:01:14 - INFO - __main__ - Step 76527: {'lr': 0.000247208710470628, 'samples': 14693184, 'steps': 76526, 'loss/train': 1.3284995555877686} 11/07/2021 08:01:14 - INFO - __main__ - Step 76528: {'lr': 0.00024720340405769164, 'samples': 14693376, 'steps': 76527, 'loss/train': 1.541872262954712} 11/07/2021 08:01:15 - INFO - __main__ - Step 76529: {'lr': 0.00024719809764601537, 'samples': 14693568, 'steps': 76528, 'loss/train': 1.6208542585372925} 11/07/2021 08:01:15 - INFO - __main__ - Step 76530: {'lr': 0.00024719279123560164, 'samples': 14693760, 'steps': 76529, 'loss/train': 1.34487783908844} 11/07/2021 08:01:15 - INFO - __main__ - Step 76531: {'lr': 0.0002471874848264528, 'samples': 14693952, 'steps': 76530, 'loss/train': 1.3352975845336914} 11/07/2021 08:01:16 - INFO - __main__ - Step 76532: {'lr': 0.0002471821784185712, 'samples': 14694144, 'steps': 76531, 'loss/train': 0.8995624780654907} 11/07/2021 08:01:17 - INFO - __main__ - Step 76533: {'lr': 0.0002471768720119594, 'samples': 14694336, 'steps': 76532, 'loss/train': 1.2066659927368164} 11/07/2021 08:01:17 - INFO - __main__ - Step 76534: {'lr': 0.0002471715656066195, 'samples': 14694528, 'steps': 76533, 'loss/train': 0.5254634618759155} 11/07/2021 08:01:18 - INFO - __main__ - Step 76535: {'lr': 0.0002471662592025541, 'samples': 14694720, 'steps': 76534, 'loss/train': 1.2104488611221313} 11/07/2021 08:01:18 - INFO - __main__ - Step 76536: {'lr': 0.00024716095279976553, 'samples': 14694912, 'steps': 76535, 'loss/train': 1.3267451524734497} 11/07/2021 08:01:19 - INFO - __main__ - Step 76537: {'lr': 0.0002471556463982562, 'samples': 14695104, 'steps': 76536, 'loss/train': 1.4361915588378906} 11/07/2021 08:01:19 - INFO - __main__ - Step 76538: {'lr': 0.00024715033999802845, 'samples': 14695296, 'steps': 76537, 'loss/train': 1.4701838493347168} 11/07/2021 08:01:20 - INFO - __main__ - Step 76539: {'lr': 0.00024714503359908477, 'samples': 14695488, 'steps': 76538, 'loss/train': 1.4801044464111328} 11/07/2021 08:01:20 - INFO - __main__ - Step 76540: {'lr': 0.00024713972720142745, 'samples': 14695680, 'steps': 76539, 'loss/train': 1.1059015989303589} 11/07/2021 08:01:20 - INFO - __main__ - Step 76541: {'lr': 0.00024713442080505897, 'samples': 14695872, 'steps': 76540, 'loss/train': 2.1534981727600098} 11/07/2021 08:01:21 - INFO - __main__ - Step 76542: {'lr': 0.00024712911440998166, 'samples': 14696064, 'steps': 76541, 'loss/train': 1.2947605848312378} 11/07/2021 08:01:22 - INFO - __main__ - Step 76543: {'lr': 0.00024712380801619787, 'samples': 14696256, 'steps': 76542, 'loss/train': 1.275869369506836} 11/07/2021 08:01:22 - INFO - __main__ - Step 76544: {'lr': 0.00024711850162371013, 'samples': 14696448, 'steps': 76543, 'loss/train': 1.3427972793579102} 11/07/2021 08:01:22 - INFO - __main__ - Step 76545: {'lr': 0.00024711319523252066, 'samples': 14696640, 'steps': 76544, 'loss/train': 1.1615556478500366} 11/07/2021 08:01:23 - INFO - __main__ - Step 76546: {'lr': 0.00024710788884263214, 'samples': 14696832, 'steps': 76545, 'loss/train': 1.3651036024093628} 11/07/2021 08:01:23 - INFO - __main__ - Step 76547: {'lr': 0.0002471025824540466, 'samples': 14697024, 'steps': 76546, 'loss/train': 1.6322492361068726} 11/07/2021 08:01:24 - INFO - __main__ - Step 76548: {'lr': 0.0002470972760667666, 'samples': 14697216, 'steps': 76547, 'loss/train': 1.4258074760437012} 11/07/2021 08:01:24 - INFO - __main__ - Step 76549: {'lr': 0.00024709196968079455, 'samples': 14697408, 'steps': 76548, 'loss/train': 1.1681201457977295} 11/07/2021 08:01:25 - INFO - __main__ - Step 76550: {'lr': 0.0002470866632961328, 'samples': 14697600, 'steps': 76549, 'loss/train': 1.2840583324432373} 11/07/2021 08:01:25 - INFO - __main__ - Step 76551: {'lr': 0.00024708135691278374, 'samples': 14697792, 'steps': 76550, 'loss/train': 1.0630611181259155} 11/07/2021 08:01:25 - INFO - __main__ - Step 76552: {'lr': 0.00024707605053074977, 'samples': 14697984, 'steps': 76551, 'loss/train': 1.673276662826538} 11/07/2021 08:01:26 - INFO - __main__ - Step 76553: {'lr': 0.00024707074415003327, 'samples': 14698176, 'steps': 76552, 'loss/train': 1.5242913961410522} 11/07/2021 08:01:27 - INFO - __main__ - Step 76554: {'lr': 0.00024706543777063667, 'samples': 14698368, 'steps': 76553, 'loss/train': 0.9089794754981995} 11/07/2021 08:01:27 - INFO - __main__ - Step 76555: {'lr': 0.0002470601313925623, 'samples': 14698560, 'steps': 76554, 'loss/train': 1.045477032661438} 11/07/2021 08:01:27 - INFO - __main__ - Step 76556: {'lr': 0.0002470548250158127, 'samples': 14698752, 'steps': 76555, 'loss/train': 2.310344696044922} 11/07/2021 08:01:28 - INFO - __main__ - Step 76557: {'lr': 0.0002470495186403901, 'samples': 14698944, 'steps': 76556, 'loss/train': 0.6723943948745728} 11/07/2021 08:01:29 - INFO - __main__ - Step 76558: {'lr': 0.00024704421226629685, 'samples': 14699136, 'steps': 76557, 'loss/train': 1.610715627670288} 11/07/2021 08:01:29 - INFO - __main__ - Step 76559: {'lr': 0.00024703890589353563, 'samples': 14699328, 'steps': 76558, 'loss/train': 1.5074776411056519} 11/07/2021 08:01:30 - INFO - __main__ - Step 76560: {'lr': 0.0002470335995221085, 'samples': 14699520, 'steps': 76559, 'loss/train': 1.2225558757781982} 11/07/2021 08:01:30 - INFO - __main__ - Step 76561: {'lr': 0.000247028293152018, 'samples': 14699712, 'steps': 76560, 'loss/train': 0.4580589234828949} 11/07/2021 08:01:30 - INFO - __main__ - Step 76562: {'lr': 0.0002470229867832665, 'samples': 14699904, 'steps': 76561, 'loss/train': 1.8701986074447632} 11/07/2021 08:01:31 - INFO - __main__ - Step 76563: {'lr': 0.0002470176804158564, 'samples': 14700096, 'steps': 76562, 'loss/train': 1.619586706161499} 11/07/2021 08:01:32 - INFO - __main__ - Step 76564: {'lr': 0.00024701237404979007, 'samples': 14700288, 'steps': 76563, 'loss/train': 0.6901351809501648} 11/07/2021 08:01:32 - INFO - __main__ - Step 76565: {'lr': 0.00024700706768506994, 'samples': 14700480, 'steps': 76564, 'loss/train': 1.6862456798553467} 11/07/2021 08:01:32 - INFO - __main__ - Step 76566: {'lr': 0.0002470017613216984, 'samples': 14700672, 'steps': 76565, 'loss/train': 1.3801110982894897} 11/07/2021 08:01:33 - INFO - __main__ - Step 76567: {'lr': 0.00024699645495967777, 'samples': 14700864, 'steps': 76566, 'loss/train': 1.3866775035858154} 11/07/2021 08:01:34 - INFO - __main__ - Step 76568: {'lr': 0.0002469911485990105, 'samples': 14701056, 'steps': 76567, 'loss/train': 1.138789176940918} 11/07/2021 08:01:34 - INFO - __main__ - Step 76569: {'lr': 0.00024698584223969896, 'samples': 14701248, 'steps': 76568, 'loss/train': 1.230918526649475} 11/07/2021 08:01:35 - INFO - __main__ - Step 76570: {'lr': 0.0002469805358817456, 'samples': 14701440, 'steps': 76569, 'loss/train': 1.4274860620498657} 11/07/2021 08:01:35 - INFO - __main__ - Step 76571: {'lr': 0.0002469752295251527, 'samples': 14701632, 'steps': 76570, 'loss/train': 0.1303202509880066} 11/07/2021 08:01:35 - INFO - __main__ - Step 76572: {'lr': 0.00024696992316992276, 'samples': 14701824, 'steps': 76571, 'loss/train': 0.9270639419555664} 11/07/2021 08:01:36 - INFO - __main__ - Step 76573: {'lr': 0.00024696461681605826, 'samples': 14702016, 'steps': 76572, 'loss/train': 1.12636137008667} 11/07/2021 08:01:37 - INFO - __main__ - Step 76574: {'lr': 0.0002469593104635613, 'samples': 14702208, 'steps': 76573, 'loss/train': 1.505749225616455} 11/07/2021 08:01:37 - INFO - __main__ - Step 76575: {'lr': 0.0002469540041124345, 'samples': 14702400, 'steps': 76574, 'loss/train': 1.7264330387115479} 11/07/2021 08:01:37 - INFO - __main__ - Step 76576: {'lr': 0.0002469486977626801, 'samples': 14702592, 'steps': 76575, 'loss/train': 1.4142321348190308} 11/07/2021 08:01:38 - INFO - __main__ - Step 76577: {'lr': 0.00024694339141430056, 'samples': 14702784, 'steps': 76576, 'loss/train': 1.302528977394104} 11/07/2021 08:01:39 - INFO - __main__ - Step 76578: {'lr': 0.0002469380850672983, 'samples': 14702976, 'steps': 76577, 'loss/train': 0.6734764575958252} 11/07/2021 08:01:40 - INFO - __main__ - Step 76579: {'lr': 0.00024693277872167574, 'samples': 14703168, 'steps': 76578, 'loss/train': 1.4832706451416016} 11/07/2021 08:01:40 - INFO - __main__ - Step 76580: {'lr': 0.00024692747237743516, 'samples': 14703360, 'steps': 76579, 'loss/train': 1.2438503503799438} 11/07/2021 08:01:40 - INFO - __main__ - Step 76581: {'lr': 0.00024692216603457905, 'samples': 14703552, 'steps': 76580, 'loss/train': 1.6934715509414673} 11/07/2021 08:01:41 - INFO - __main__ - Step 76582: {'lr': 0.00024691685969310974, 'samples': 14703744, 'steps': 76581, 'loss/train': 1.4793946743011475} 11/07/2021 08:01:41 - INFO - __main__ - Step 76583: {'lr': 0.0002469115533530297, 'samples': 14703936, 'steps': 76582, 'loss/train': 1.5837756395339966} 11/07/2021 08:01:41 - INFO - __main__ - Step 76584: {'lr': 0.00024690624701434124, 'samples': 14704128, 'steps': 76583, 'loss/train': 0.1526641547679901} 11/07/2021 08:01:42 - INFO - __main__ - Step 76585: {'lr': 0.00024690094067704677, 'samples': 14704320, 'steps': 76584, 'loss/train': 1.1480419635772705} 11/07/2021 08:01:43 - INFO - __main__ - Step 76586: {'lr': 0.00024689563434114874, 'samples': 14704512, 'steps': 76585, 'loss/train': 0.7274542450904846} 11/07/2021 08:01:43 - INFO - __main__ - Step 76587: {'lr': 0.00024689032800664945, 'samples': 14704704, 'steps': 76586, 'loss/train': 1.7875186204910278} 11/07/2021 08:01:43 - INFO - __main__ - Step 76588: {'lr': 0.00024688502167355126, 'samples': 14704896, 'steps': 76587, 'loss/train': 2.2177488803863525} 11/07/2021 08:01:44 - INFO - __main__ - Step 76589: {'lr': 0.0002468797153418567, 'samples': 14705088, 'steps': 76588, 'loss/train': 1.1151325702667236} 11/07/2021 08:01:45 - INFO - __main__ - Step 76590: {'lr': 0.0002468744090115681, 'samples': 14705280, 'steps': 76589, 'loss/train': 1.5156402587890625} 11/07/2021 08:01:45 - INFO - __main__ - Step 76591: {'lr': 0.0002468691026826878, 'samples': 14705472, 'steps': 76590, 'loss/train': 1.291366457939148} 11/07/2021 08:01:45 - INFO - __main__ - Step 76592: {'lr': 0.00024686379635521826, 'samples': 14705664, 'steps': 76591, 'loss/train': 1.0264614820480347} 11/07/2021 08:01:46 - INFO - __main__ - Step 76593: {'lr': 0.0002468584900291618, 'samples': 14705856, 'steps': 76592, 'loss/train': 1.284133791923523} 11/07/2021 08:01:46 - INFO - __main__ - Step 76594: {'lr': 0.00024685318370452094, 'samples': 14706048, 'steps': 76593, 'loss/train': 1.2998480796813965} 11/07/2021 08:01:47 - INFO - __main__ - Step 76595: {'lr': 0.000246847877381298, 'samples': 14706240, 'steps': 76594, 'loss/train': 1.3607838153839111} 11/07/2021 08:01:48 - INFO - __main__ - Step 76596: {'lr': 0.0002468425710594953, 'samples': 14706432, 'steps': 76595, 'loss/train': 1.1160211563110352} 11/07/2021 08:01:48 - INFO - __main__ - Step 76597: {'lr': 0.00024683726473911525, 'samples': 14706624, 'steps': 76596, 'loss/train': 1.445143699645996} 11/07/2021 08:01:48 - INFO - __main__ - Step 76598: {'lr': 0.00024683195842016033, 'samples': 14706816, 'steps': 76597, 'loss/train': 1.0936453342437744} 11/07/2021 08:01:49 - INFO - __main__ - Step 76599: {'lr': 0.00024682665210263286, 'samples': 14707008, 'steps': 76598, 'loss/train': 2.2913193702697754} 11/07/2021 08:01:49 - INFO - __main__ - Step 76600: {'lr': 0.00024682134578653535, 'samples': 14707200, 'steps': 76599, 'loss/train': 1.3899445533752441} 11/07/2021 08:01:50 - INFO - __main__ - Step 76601: {'lr': 0.00024681603947186996, 'samples': 14707392, 'steps': 76600, 'loss/train': 0.8496217727661133} 11/07/2021 08:01:50 - INFO - __main__ - Step 76602: {'lr': 0.0002468107331586393, 'samples': 14707584, 'steps': 76601, 'loss/train': 1.1833692789077759} 11/07/2021 08:01:51 - INFO - __main__ - Step 76603: {'lr': 0.0002468054268468456, 'samples': 14707776, 'steps': 76602, 'loss/train': 1.354637622833252} 11/07/2021 08:01:51 - INFO - __main__ - Step 76604: {'lr': 0.00024680012053649136, 'samples': 14707968, 'steps': 76603, 'loss/train': 1.5890040397644043} 11/07/2021 08:01:52 - INFO - __main__ - Step 76605: {'lr': 0.0002467948142275789, 'samples': 14708160, 'steps': 76604, 'loss/train': 1.3070472478866577} 11/07/2021 08:01:52 - INFO - __main__ - Step 76606: {'lr': 0.0002467895079201108, 'samples': 14708352, 'steps': 76605, 'loss/train': 1.183074951171875} 11/07/2021 08:01:53 - INFO - __main__ - Step 76607: {'lr': 0.00024678420161408914, 'samples': 14708544, 'steps': 76606, 'loss/train': 1.1909582614898682} 11/07/2021 08:01:53 - INFO - __main__ - Step 76608: {'lr': 0.0002467788953095165, 'samples': 14708736, 'steps': 76607, 'loss/train': 0.9930100440979004} 11/07/2021 08:01:53 - INFO - __main__ - Step 76609: {'lr': 0.00024677358900639524, 'samples': 14708928, 'steps': 76608, 'loss/train': 1.0762027502059937} 11/07/2021 08:01:54 - INFO - __main__ - Step 76610: {'lr': 0.00024676828270472776, 'samples': 14709120, 'steps': 76609, 'loss/train': 1.494868516921997} 11/07/2021 08:01:55 - INFO - __main__ - Step 76611: {'lr': 0.00024676297640451646, 'samples': 14709312, 'steps': 76610, 'loss/train': 1.089908242225647} 11/07/2021 08:01:55 - INFO - __main__ - Step 76612: {'lr': 0.00024675767010576364, 'samples': 14709504, 'steps': 76611, 'loss/train': 1.3267914056777954} 11/07/2021 08:01:55 - INFO - __main__ - Step 76613: {'lr': 0.0002467523638084719, 'samples': 14709696, 'steps': 76612, 'loss/train': 1.4841245412826538} 11/07/2021 08:01:56 - INFO - __main__ - Step 76614: {'lr': 0.00024674705751264337, 'samples': 14709888, 'steps': 76613, 'loss/train': 1.2493855953216553} 11/07/2021 08:01:56 - INFO - __main__ - Step 76615: {'lr': 0.00024674175121828064, 'samples': 14710080, 'steps': 76614, 'loss/train': 1.4910814762115479} 11/07/2021 08:01:57 - INFO - __main__ - Step 76616: {'lr': 0.000246736444925386, 'samples': 14710272, 'steps': 76615, 'loss/train': 1.0136698484420776} 11/07/2021 08:01:58 - INFO - __main__ - Step 76617: {'lr': 0.00024673113863396193, 'samples': 14710464, 'steps': 76616, 'loss/train': 1.2077316045761108} 11/07/2021 08:01:58 - INFO - __main__ - Step 76618: {'lr': 0.00024672583234401066, 'samples': 14710656, 'steps': 76617, 'loss/train': 1.5574195384979248} 11/07/2021 08:01:58 - INFO - __main__ - Step 76619: {'lr': 0.0002467205260555347, 'samples': 14710848, 'steps': 76618, 'loss/train': 1.2378000020980835} 11/07/2021 08:01:59 - INFO - __main__ - Step 76620: {'lr': 0.00024671521976853643, 'samples': 14711040, 'steps': 76619, 'loss/train': 0.498569518327713} 11/07/2021 08:02:00 - INFO - __main__ - Step 76621: {'lr': 0.00024670991348301824, 'samples': 14711232, 'steps': 76620, 'loss/train': 1.4461103677749634} 11/07/2021 08:02:00 - INFO - __main__ - Step 76622: {'lr': 0.0002467046071989825, 'samples': 14711424, 'steps': 76621, 'loss/train': 0.9186320304870605} 11/07/2021 08:02:00 - INFO - __main__ - Step 76623: {'lr': 0.0002466993009164316, 'samples': 14711616, 'steps': 76622, 'loss/train': 2.0588691234588623} 11/07/2021 08:02:01 - INFO - __main__ - Step 76624: {'lr': 0.00024669399463536797, 'samples': 14711808, 'steps': 76623, 'loss/train': 1.2559785842895508} 11/07/2021 08:02:01 - INFO - __main__ - Step 76625: {'lr': 0.000246688688355794, 'samples': 14712000, 'steps': 76624, 'loss/train': 1.021915316581726} 11/07/2021 08:02:02 - INFO - __main__ - Step 76626: {'lr': 0.000246683382077712, 'samples': 14712192, 'steps': 76625, 'loss/train': 1.6454546451568604} 11/07/2021 08:02:02 - INFO - __main__ - Step 76627: {'lr': 0.00024667807580112445, 'samples': 14712384, 'steps': 76626, 'loss/train': 1.6585079431533813} 11/07/2021 08:02:03 - INFO - __main__ - Step 76628: {'lr': 0.0002466727695260338, 'samples': 14712576, 'steps': 76627, 'loss/train': 1.4170700311660767} 11/07/2021 08:02:03 - INFO - __main__ - Step 76629: {'lr': 0.0002466674632524422, 'samples': 14712768, 'steps': 76628, 'loss/train': 0.9875236749649048} 11/07/2021 08:02:03 - INFO - __main__ - Step 76630: {'lr': 0.00024666215698035225, 'samples': 14712960, 'steps': 76629, 'loss/train': 1.0590367317199707} 11/07/2021 08:02:04 - INFO - __main__ - Step 76631: {'lr': 0.00024665685070976624, 'samples': 14713152, 'steps': 76630, 'loss/train': 0.8285797834396362} 11/07/2021 08:02:05 - INFO - __main__ - Step 76632: {'lr': 0.0002466515444406867, 'samples': 14713344, 'steps': 76631, 'loss/train': 1.7022645473480225} 11/07/2021 08:02:05 - INFO - __main__ - Step 76633: {'lr': 0.0002466462381731158, 'samples': 14713536, 'steps': 76632, 'loss/train': 1.26755690574646} 11/07/2021 08:02:05 - INFO - __main__ - Step 76634: {'lr': 0.0002466409319070561, 'samples': 14713728, 'steps': 76633, 'loss/train': 1.5335373878479004} 11/07/2021 08:02:06 - INFO - __main__ - Step 76635: {'lr': 0.00024663562564250995, 'samples': 14713920, 'steps': 76634, 'loss/train': 0.9397090673446655} 11/07/2021 08:02:06 - INFO - __main__ - Step 76636: {'lr': 0.0002466303193794797, 'samples': 14714112, 'steps': 76635, 'loss/train': 1.295532464981079} 11/07/2021 08:02:07 - INFO - __main__ - Step 76637: {'lr': 0.00024662501311796786, 'samples': 14714304, 'steps': 76636, 'loss/train': 1.1557908058166504} 11/07/2021 08:02:07 - INFO - __main__ - Step 76638: {'lr': 0.00024661970685797667, 'samples': 14714496, 'steps': 76637, 'loss/train': 1.1361124515533447} 11/07/2021 08:02:08 - INFO - __main__ - Step 76639: {'lr': 0.00024661440059950857, 'samples': 14714688, 'steps': 76638, 'loss/train': 1.7921228408813477} 11/07/2021 08:02:08 - INFO - __main__ - Step 76640: {'lr': 0.0002466090943425661, 'samples': 14714880, 'steps': 76639, 'loss/train': 1.4242888689041138} 11/07/2021 08:02:08 - INFO - __main__ - Step 76641: {'lr': 0.00024660378808715144, 'samples': 14715072, 'steps': 76640, 'loss/train': 1.7543394565582275} 11/07/2021 08:02:10 - INFO - __main__ - Step 76642: {'lr': 0.0002465984818332671, 'samples': 14715264, 'steps': 76641, 'loss/train': 0.9056379199028015} 11/07/2021 08:02:10 - INFO - __main__ - Step 76643: {'lr': 0.00024659317558091534, 'samples': 14715456, 'steps': 76642, 'loss/train': 1.4632047414779663} 11/07/2021 08:02:10 - INFO - __main__ - Step 76644: {'lr': 0.0002465878693300986, 'samples': 14715648, 'steps': 76643, 'loss/train': 1.571347951889038} 11/07/2021 08:02:11 - INFO - __main__ - Step 76645: {'lr': 0.00024658256308081946, 'samples': 14715840, 'steps': 76644, 'loss/train': 1.3503916263580322} 11/07/2021 08:02:11 - INFO - __main__ - Step 76646: {'lr': 0.00024657725683308, 'samples': 14716032, 'steps': 76645, 'loss/train': 1.2439981698989868} 11/07/2021 08:02:12 - INFO - __main__ - Step 76647: {'lr': 0.0002465719505868829, 'samples': 14716224, 'steps': 76646, 'loss/train': 1.4597787857055664} 11/07/2021 08:02:12 - INFO - __main__ - Step 76648: {'lr': 0.00024656664434223043, 'samples': 14716416, 'steps': 76647, 'loss/train': 1.0420657396316528} 11/07/2021 08:02:13 - INFO - __main__ - Step 76649: {'lr': 0.00024656133809912494, 'samples': 14716608, 'steps': 76648, 'loss/train': 1.8174464702606201} 11/07/2021 08:02:13 - INFO - __main__ - Step 76650: {'lr': 0.00024655603185756887, 'samples': 14716800, 'steps': 76649, 'loss/train': 1.3307658433914185} 11/07/2021 08:02:13 - INFO - __main__ - Step 76651: {'lr': 0.00024655072561756457, 'samples': 14716992, 'steps': 76650, 'loss/train': 0.7172119617462158} 11/07/2021 08:02:15 - INFO - __main__ - Step 76652: {'lr': 0.0002465454193791145, 'samples': 14717184, 'steps': 76651, 'loss/train': 1.1434423923492432} 11/07/2021 08:02:15 - INFO - __main__ - Step 76653: {'lr': 0.00024654011314222097, 'samples': 14717376, 'steps': 76652, 'loss/train': 1.5075443983078003} 11/07/2021 08:02:15 - INFO - __main__ - Step 76654: {'lr': 0.00024653480690688654, 'samples': 14717568, 'steps': 76653, 'loss/train': 0.6642592549324036} 11/07/2021 08:02:16 - INFO - __main__ - Step 76655: {'lr': 0.00024652950067311337, 'samples': 14717760, 'steps': 76654, 'loss/train': 1.3383814096450806} 11/07/2021 08:02:16 - INFO - __main__ - Step 76656: {'lr': 0.00024652419444090394, 'samples': 14717952, 'steps': 76655, 'loss/train': 1.7738767862319946} 11/07/2021 08:02:17 - INFO - __main__ - Step 76657: {'lr': 0.00024651888821026064, 'samples': 14718144, 'steps': 76656, 'loss/train': 1.3207433223724365} 11/07/2021 08:02:17 - INFO - __main__ - Step 76658: {'lr': 0.0002465135819811859, 'samples': 14718336, 'steps': 76657, 'loss/train': 1.035367727279663} 11/07/2021 08:02:18 - INFO - __main__ - Step 76659: {'lr': 0.0002465082757536821, 'samples': 14718528, 'steps': 76658, 'loss/train': 0.6026999950408936} 11/07/2021 08:02:18 - INFO - __main__ - Step 76660: {'lr': 0.0002465029695277516, 'samples': 14718720, 'steps': 76659, 'loss/train': 1.3964773416519165} 11/07/2021 08:02:18 - INFO - __main__ - Step 76661: {'lr': 0.0002464976633033968, 'samples': 14718912, 'steps': 76660, 'loss/train': 1.3076417446136475} 11/07/2021 08:02:20 - INFO - __main__ - Step 76662: {'lr': 0.0002464923570806201, 'samples': 14719104, 'steps': 76661, 'loss/train': 0.8207348585128784} 11/07/2021 08:02:20 - INFO - __main__ - Step 76663: {'lr': 0.00024648705085942386, 'samples': 14719296, 'steps': 76662, 'loss/train': 1.363863229751587} 11/07/2021 08:02:20 - INFO - __main__ - Step 76664: {'lr': 0.0002464817446398106, 'samples': 14719488, 'steps': 76663, 'loss/train': 1.6540833711624146} 11/07/2021 08:02:21 - INFO - __main__ - Step 76665: {'lr': 0.00024647643842178247, 'samples': 14719680, 'steps': 76664, 'loss/train': 1.4034157991409302} 11/07/2021 08:02:21 - INFO - __main__ - Step 76666: {'lr': 0.0002464711322053421, 'samples': 14719872, 'steps': 76665, 'loss/train': 1.6112196445465088} 11/07/2021 08:02:21 - INFO - __main__ - Step 76667: {'lr': 0.0002464658259904919, 'samples': 14720064, 'steps': 76666, 'loss/train': 1.4636626243591309} 11/07/2021 08:02:23 - INFO - __main__ - Step 76668: {'lr': 0.000246460519777234, 'samples': 14720256, 'steps': 76667, 'loss/train': 1.3585150241851807} 11/07/2021 08:02:23 - INFO - __main__ - Step 76669: {'lr': 0.00024645521356557096, 'samples': 14720448, 'steps': 76668, 'loss/train': 1.1563570499420166} 11/07/2021 08:02:23 - INFO - __main__ - Step 76670: {'lr': 0.0002464499073555051, 'samples': 14720640, 'steps': 76669, 'loss/train': 0.08913744240999222} 11/07/2021 08:02:24 - INFO - __main__ - Step 76671: {'lr': 0.0002464446011470389, 'samples': 14720832, 'steps': 76670, 'loss/train': 1.5149065256118774} 11/07/2021 08:02:24 - INFO - __main__ - Step 76672: {'lr': 0.0002464392949401747, 'samples': 14721024, 'steps': 76671, 'loss/train': 1.059507131576538} 11/07/2021 08:02:25 - INFO - __main__ - Step 76673: {'lr': 0.00024643398873491485, 'samples': 14721216, 'steps': 76672, 'loss/train': 1.5948097705841064} 11/07/2021 08:02:25 - INFO - __main__ - Step 76674: {'lr': 0.00024642868253126185, 'samples': 14721408, 'steps': 76673, 'loss/train': 1.046478271484375} 11/07/2021 08:02:26 - INFO - __main__ - Step 76675: {'lr': 0.000246423376329218, 'samples': 14721600, 'steps': 76674, 'loss/train': 1.1145156621932983} 11/07/2021 08:02:26 - INFO - __main__ - Step 76676: {'lr': 0.00024641807012878574, 'samples': 14721792, 'steps': 76675, 'loss/train': 1.2590599060058594} 11/07/2021 08:02:26 - INFO - __main__ - Step 76677: {'lr': 0.00024641276392996746, 'samples': 14721984, 'steps': 76676, 'loss/train': 1.4751871824264526} 11/07/2021 08:02:27 - INFO - __main__ - Step 76678: {'lr': 0.0002464074577327655, 'samples': 14722176, 'steps': 76677, 'loss/train': 0.8810686469078064} 11/07/2021 08:02:28 - INFO - __main__ - Step 76679: {'lr': 0.0002464021515371823, 'samples': 14722368, 'steps': 76678, 'loss/train': 1.8042490482330322} 11/07/2021 08:02:28 - INFO - __main__ - Step 76680: {'lr': 0.0002463968453432203, 'samples': 14722560, 'steps': 76679, 'loss/train': 1.5836824178695679} 11/07/2021 08:02:28 - INFO - __main__ - Step 76681: {'lr': 0.00024639153915088176, 'samples': 14722752, 'steps': 76680, 'loss/train': 1.3831942081451416} 11/07/2021 08:02:29 - INFO - __main__ - Step 76682: {'lr': 0.00024638623296016914, 'samples': 14722944, 'steps': 76681, 'loss/train': 1.364651083946228} 11/07/2021 08:02:30 - INFO - __main__ - Step 76683: {'lr': 0.0002463809267710848, 'samples': 14723136, 'steps': 76682, 'loss/train': 1.740533471107483} 11/07/2021 08:02:30 - INFO - __main__ - Step 76684: {'lr': 0.0002463756205836312, 'samples': 14723328, 'steps': 76683, 'loss/train': 1.865334391593933} 11/07/2021 08:02:31 - INFO - __main__ - Step 76685: {'lr': 0.00024637031439781067, 'samples': 14723520, 'steps': 76684, 'loss/train': 1.2578744888305664} 11/07/2021 08:02:31 - INFO - __main__ - Step 76686: {'lr': 0.0002463650082136256, 'samples': 14723712, 'steps': 76685, 'loss/train': 1.2790454626083374} 11/07/2021 08:02:31 - INFO - __main__ - Step 76687: {'lr': 0.00024635970203107843, 'samples': 14723904, 'steps': 76686, 'loss/train': 1.196732521057129} 11/07/2021 08:02:32 - INFO - __main__ - Step 76688: {'lr': 0.00024635439585017155, 'samples': 14724096, 'steps': 76687, 'loss/train': 1.001429557800293} 11/07/2021 08:02:33 - INFO - __main__ - Step 76689: {'lr': 0.00024634908967090724, 'samples': 14724288, 'steps': 76688, 'loss/train': 1.48528254032135} 11/07/2021 08:02:33 - INFO - __main__ - Step 76690: {'lr': 0.00024634378349328804, 'samples': 14724480, 'steps': 76689, 'loss/train': 0.7626192569732666} 11/07/2021 08:02:33 - INFO - __main__ - Step 76691: {'lr': 0.00024633847731731623, 'samples': 14724672, 'steps': 76690, 'loss/train': 1.3206279277801514} 11/07/2021 08:02:34 - INFO - __main__ - Step 76692: {'lr': 0.00024633317114299425, 'samples': 14724864, 'steps': 76691, 'loss/train': 1.3468283414840698} 11/07/2021 08:02:34 - INFO - __main__ - Step 76693: {'lr': 0.00024632786497032455, 'samples': 14725056, 'steps': 76692, 'loss/train': 1.4563124179840088} 11/07/2021 08:02:35 - INFO - __main__ - Step 76694: {'lr': 0.0002463225587993095, 'samples': 14725248, 'steps': 76693, 'loss/train': 1.1120356321334839} 11/07/2021 08:02:36 - INFO - __main__ - Step 76695: {'lr': 0.0002463172526299514, 'samples': 14725440, 'steps': 76694, 'loss/train': 1.5484782457351685} 11/07/2021 08:02:36 - INFO - __main__ - Step 76696: {'lr': 0.0002463119464622526, 'samples': 14725632, 'steps': 76695, 'loss/train': 1.0942527055740356} 11/07/2021 08:02:36 - INFO - __main__ - Step 76697: {'lr': 0.00024630664029621563, 'samples': 14725824, 'steps': 76696, 'loss/train': 1.2448046207427979} 11/07/2021 08:02:37 - INFO - __main__ - Step 76698: {'lr': 0.00024630133413184284, 'samples': 14726016, 'steps': 76697, 'loss/train': 1.5364726781845093} 11/07/2021 08:02:38 - INFO - __main__ - Step 76699: {'lr': 0.0002462960279691366, 'samples': 14726208, 'steps': 76698, 'loss/train': 1.4430357217788696} 11/07/2021 08:02:38 - INFO - __main__ - Step 76700: {'lr': 0.0002462907218080993, 'samples': 14726400, 'steps': 76699, 'loss/train': 1.3795617818832397} 11/07/2021 08:02:38 - INFO - __main__ - Step 76701: {'lr': 0.00024628541564873337, 'samples': 14726592, 'steps': 76700, 'loss/train': 1.6026405096054077} 11/07/2021 08:02:39 - INFO - __main__ - Step 76702: {'lr': 0.00024628010949104116, 'samples': 14726784, 'steps': 76701, 'loss/train': 1.9679542779922485} 11/07/2021 08:02:39 - INFO - __main__ - Step 76703: {'lr': 0.00024627480333502507, 'samples': 14726976, 'steps': 76702, 'loss/train': 1.6213657855987549} 11/07/2021 08:02:39 - INFO - __main__ - Step 76704: {'lr': 0.0002462694971806875, 'samples': 14727168, 'steps': 76703, 'loss/train': 1.4787118434906006} 11/07/2021 08:02:40 - INFO - __main__ - Step 76705: {'lr': 0.00024626419102803085, 'samples': 14727360, 'steps': 76704, 'loss/train': 0.6005276441574097} 11/07/2021 08:02:41 - INFO - __main__ - Step 76706: {'lr': 0.0002462588848770575, 'samples': 14727552, 'steps': 76705, 'loss/train': 1.449279546737671} 11/07/2021 08:02:41 - INFO - __main__ - Step 76707: {'lr': 0.00024625357872776996, 'samples': 14727744, 'steps': 76706, 'loss/train': 0.13440662622451782} 11/07/2021 08:02:41 - INFO - __main__ - Step 76708: {'lr': 0.0002462482725801703, 'samples': 14727936, 'steps': 76707, 'loss/train': 1.5100070238113403} 11/07/2021 08:02:42 - INFO - __main__ - Step 76709: {'lr': 0.0002462429664342612, 'samples': 14728128, 'steps': 76708, 'loss/train': 1.1980469226837158} 11/07/2021 08:02:43 - INFO - __main__ - Step 76710: {'lr': 0.000246237660290045, 'samples': 14728320, 'steps': 76709, 'loss/train': 1.3456650972366333} 11/07/2021 08:02:43 - INFO - __main__ - Step 76711: {'lr': 0.00024623235414752395, 'samples': 14728512, 'steps': 76710, 'loss/train': 1.5172561407089233} 11/07/2021 08:02:44 - INFO - __main__ - Step 76712: {'lr': 0.00024622704800670057, 'samples': 14728704, 'steps': 76711, 'loss/train': 1.2026206254959106} 11/07/2021 08:02:44 - INFO - __main__ - Step 76713: {'lr': 0.0002462217418675772, 'samples': 14728896, 'steps': 76712, 'loss/train': 1.5312399864196777} 11/07/2021 08:02:44 - INFO - __main__ - Step 76714: {'lr': 0.00024621643573015633, 'samples': 14729088, 'steps': 76713, 'loss/train': 0.939614474773407} 11/07/2021 08:02:45 - INFO - __main__ - Step 76715: {'lr': 0.00024621112959444025, 'samples': 14729280, 'steps': 76714, 'loss/train': 1.3980942964553833} 11/07/2021 08:02:46 - INFO - __main__ - Step 76716: {'lr': 0.00024620582346043134, 'samples': 14729472, 'steps': 76715, 'loss/train': 1.698594093322754} 11/07/2021 08:02:46 - INFO - __main__ - Step 76717: {'lr': 0.00024620051732813207, 'samples': 14729664, 'steps': 76716, 'loss/train': 1.2461769580841064} 11/07/2021 08:02:46 - INFO - __main__ - Step 76718: {'lr': 0.00024619521119754475, 'samples': 14729856, 'steps': 76717, 'loss/train': 1.486879587173462} 11/07/2021 08:02:47 - INFO - __main__ - Step 76719: {'lr': 0.0002461899050686719, 'samples': 14730048, 'steps': 76718, 'loss/train': 1.7836129665374756} 11/07/2021 08:02:48 - INFO - __main__ - Step 76720: {'lr': 0.00024618459894151573, 'samples': 14730240, 'steps': 76719, 'loss/train': 0.7466821074485779} 11/07/2021 08:02:48 - INFO - __main__ - Step 76721: {'lr': 0.0002461792928160788, 'samples': 14730432, 'steps': 76720, 'loss/train': 1.2198079824447632} 11/07/2021 08:02:48 - INFO - __main__ - Step 76722: {'lr': 0.00024617398669236337, 'samples': 14730624, 'steps': 76721, 'loss/train': 1.104842185974121} 11/07/2021 08:02:49 - INFO - __main__ - Step 76723: {'lr': 0.0002461686805703719, 'samples': 14730816, 'steps': 76722, 'loss/train': 1.3768633604049683} 11/07/2021 08:02:49 - INFO - __main__ - Step 76724: {'lr': 0.0002461633744501067, 'samples': 14731008, 'steps': 76723, 'loss/train': 1.1273056268692017} 11/07/2021 08:02:50 - INFO - __main__ - Step 76725: {'lr': 0.0002461580683315703, 'samples': 14731200, 'steps': 76724, 'loss/train': 1.4466913938522339} 11/07/2021 08:02:50 - INFO - __main__ - Step 76726: {'lr': 0.00024615276221476496, 'samples': 14731392, 'steps': 76725, 'loss/train': 1.390758752822876} 11/07/2021 08:02:51 - INFO - __main__ - Step 76727: {'lr': 0.0002461474560996931, 'samples': 14731584, 'steps': 76726, 'loss/train': 1.956682801246643} 11/07/2021 08:02:51 - INFO - __main__ - Step 76728: {'lr': 0.00024614214998635724, 'samples': 14731776, 'steps': 76727, 'loss/train': 1.2855634689331055} 11/07/2021 08:02:51 - INFO - __main__ - Step 76729: {'lr': 0.0002461368438747596, 'samples': 14731968, 'steps': 76728, 'loss/train': 1.3724498748779297} 11/07/2021 08:02:53 - INFO - __main__ - Step 76730: {'lr': 0.00024613153776490267, 'samples': 14732160, 'steps': 76729, 'loss/train': 1.4399871826171875} 11/07/2021 08:02:53 - INFO - __main__ - Step 76731: {'lr': 0.0002461262316567888, 'samples': 14732352, 'steps': 76730, 'loss/train': 1.2031302452087402} 11/07/2021 08:02:54 - INFO - __main__ - Step 76732: {'lr': 0.0002461209255504204, 'samples': 14732544, 'steps': 76731, 'loss/train': 1.284171462059021} 11/07/2021 08:02:54 - INFO - __main__ - Step 76733: {'lr': 0.0002461156194457998, 'samples': 14732736, 'steps': 76732, 'loss/train': 0.305054634809494} 11/07/2021 08:02:54 - INFO - __main__ - Step 76734: {'lr': 0.0002461103133429295, 'samples': 14732928, 'steps': 76733, 'loss/train': 1.9754533767700195} 11/07/2021 08:02:55 - INFO - __main__ - Step 76735: {'lr': 0.00024610500724181185, 'samples': 14733120, 'steps': 76734, 'loss/train': 1.200461506843567} 11/07/2021 08:02:56 - INFO - __main__ - Step 76736: {'lr': 0.00024609970114244917, 'samples': 14733312, 'steps': 76735, 'loss/train': 0.10057651251554489} 11/07/2021 08:02:56 - INFO - __main__ - Step 76737: {'lr': 0.00024609439504484393, 'samples': 14733504, 'steps': 76736, 'loss/train': 0.8001716732978821} 11/07/2021 08:02:57 - INFO - __main__ - Step 76738: {'lr': 0.00024608908894899846, 'samples': 14733696, 'steps': 76737, 'loss/train': 1.2778750658035278} 11/07/2021 08:02:57 - INFO - __main__ - Step 76739: {'lr': 0.0002460837828549152, 'samples': 14733888, 'steps': 76738, 'loss/train': 1.291102647781372} 11/07/2021 08:02:57 - INFO - __main__ - Step 76740: {'lr': 0.0002460784767625965, 'samples': 14734080, 'steps': 76739, 'loss/train': 1.2345775365829468} 11/07/2021 08:02:58 - INFO - __main__ - Step 76741: {'lr': 0.0002460731706720449, 'samples': 14734272, 'steps': 76740, 'loss/train': 1.2564960718154907} 11/07/2021 08:02:59 - INFO - __main__ - Step 76742: {'lr': 0.00024606786458326255, 'samples': 14734464, 'steps': 76741, 'loss/train': 1.7426457405090332} 11/07/2021 08:02:59 - INFO - __main__ - Step 76743: {'lr': 0.000246062558496252, 'samples': 14734656, 'steps': 76742, 'loss/train': 1.7728071212768555} 11/07/2021 08:02:59 - INFO - __main__ - Step 76744: {'lr': 0.0002460572524110156, 'samples': 14734848, 'steps': 76743, 'loss/train': 1.505932092666626} 11/07/2021 08:03:00 - INFO - __main__ - Step 76745: {'lr': 0.0002460519463275557, 'samples': 14735040, 'steps': 76744, 'loss/train': 1.2167574167251587} 11/07/2021 08:03:01 - INFO - __main__ - Step 76746: {'lr': 0.00024604664024587474, 'samples': 14735232, 'steps': 76745, 'loss/train': 1.187989354133606} 11/07/2021 08:03:01 - INFO - __main__ - Step 76747: {'lr': 0.0002460413341659751, 'samples': 14735424, 'steps': 76746, 'loss/train': 1.1628621816635132} 11/07/2021 08:03:02 - INFO - __main__ - Step 76748: {'lr': 0.0002460360280878593, 'samples': 14735616, 'steps': 76747, 'loss/train': 1.7984827756881714} 11/07/2021 08:03:02 - INFO - __main__ - Step 76749: {'lr': 0.0002460307220115295, 'samples': 14735808, 'steps': 76748, 'loss/train': 0.19099602103233337} 11/07/2021 08:03:02 - INFO - __main__ - Step 76750: {'lr': 0.00024602541593698817, 'samples': 14736000, 'steps': 76749, 'loss/train': 1.5792913436889648} 11/07/2021 08:03:03 - INFO - __main__ - Step 76751: {'lr': 0.00024602010986423783, 'samples': 14736192, 'steps': 76750, 'loss/train': 1.1679766178131104} 11/07/2021 08:03:04 - INFO - __main__ - Step 76752: {'lr': 0.00024601480379328065, 'samples': 14736384, 'steps': 76751, 'loss/train': 1.3778399229049683} 11/07/2021 08:03:04 - INFO - __main__ - Step 76753: {'lr': 0.0002460094977241192, 'samples': 14736576, 'steps': 76752, 'loss/train': 1.4168452024459839} 11/07/2021 08:03:04 - INFO - __main__ - Step 76754: {'lr': 0.00024600419165675576, 'samples': 14736768, 'steps': 76753, 'loss/train': 1.0924179553985596} 11/07/2021 08:03:05 - INFO - __main__ - Step 76755: {'lr': 0.0002459988855911928, 'samples': 14736960, 'steps': 76754, 'loss/train': 0.4065673351287842} 11/07/2021 08:03:05 - INFO - __main__ - Step 76756: {'lr': 0.0002459935795274327, 'samples': 14737152, 'steps': 76755, 'loss/train': 1.1468924283981323} 11/07/2021 08:03:06 - INFO - __main__ - Step 76757: {'lr': 0.0002459882734654778, 'samples': 14737344, 'steps': 76756, 'loss/train': 1.2569146156311035} 11/07/2021 08:03:06 - INFO - __main__ - Step 76758: {'lr': 0.00024598296740533054, 'samples': 14737536, 'steps': 76757, 'loss/train': 0.9179450273513794} 11/07/2021 08:03:07 - INFO - __main__ - Step 76759: {'lr': 0.00024597766134699326, 'samples': 14737728, 'steps': 76758, 'loss/train': 1.0337741374969482} 11/07/2021 08:03:07 - INFO - __main__ - Step 76760: {'lr': 0.0002459723552904684, 'samples': 14737920, 'steps': 76759, 'loss/train': 1.387908935546875} 11/07/2021 08:03:08 - INFO - __main__ - Step 76761: {'lr': 0.0002459670492357584, 'samples': 14738112, 'steps': 76760, 'loss/train': 1.7826623916625977} 11/07/2021 08:03:09 - INFO - __main__ - Step 76762: {'lr': 0.00024596174318286556, 'samples': 14738304, 'steps': 76761, 'loss/train': 0.835540771484375} 11/07/2021 08:03:09 - INFO - __main__ - Step 76763: {'lr': 0.00024595643713179227, 'samples': 14738496, 'steps': 76762, 'loss/train': 1.7446123361587524} 11/07/2021 08:03:09 - INFO - __main__ - Step 76764: {'lr': 0.00024595113108254093, 'samples': 14738688, 'steps': 76763, 'loss/train': 1.1932026147842407} 11/07/2021 08:03:10 - INFO - __main__ - Step 76765: {'lr': 0.000245945825035114, 'samples': 14738880, 'steps': 76764, 'loss/train': 1.7118688821792603} 11/07/2021 08:03:10 - INFO - __main__ - Step 76766: {'lr': 0.00024594051898951374, 'samples': 14739072, 'steps': 76765, 'loss/train': 1.1819913387298584} 11/07/2021 08:03:11 - INFO - __main__ - Step 76767: {'lr': 0.00024593521294574266, 'samples': 14739264, 'steps': 76766, 'loss/train': 1.1990677118301392} 11/07/2021 08:03:11 - INFO - __main__ - Step 76768: {'lr': 0.0002459299069038031, 'samples': 14739456, 'steps': 76767, 'loss/train': 1.479621410369873} 11/07/2021 08:03:12 - INFO - __main__ - Step 76769: {'lr': 0.00024592460086369746, 'samples': 14739648, 'steps': 76768, 'loss/train': 1.6498048305511475} 11/07/2021 08:03:12 - INFO - __main__ - Step 76770: {'lr': 0.00024591929482542816, 'samples': 14739840, 'steps': 76769, 'loss/train': 1.0332034826278687} 11/07/2021 08:03:12 - INFO - __main__ - Step 76771: {'lr': 0.0002459139887889975, 'samples': 14740032, 'steps': 76770, 'loss/train': 1.4481689929962158} 11/07/2021 08:03:13 - INFO - __main__ - Step 76772: {'lr': 0.000245908682754408, 'samples': 14740224, 'steps': 76771, 'loss/train': 1.0589261054992676} 11/07/2021 08:03:14 - INFO - __main__ - Step 76773: {'lr': 0.00024590337672166196, 'samples': 14740416, 'steps': 76772, 'loss/train': 1.0714242458343506} 11/07/2021 08:03:14 - INFO - __main__ - Step 76774: {'lr': 0.00024589807069076177, 'samples': 14740608, 'steps': 76773, 'loss/train': 1.4221255779266357} 11/07/2021 08:03:14 - INFO - __main__ - Step 76775: {'lr': 0.00024589276466171003, 'samples': 14740800, 'steps': 76774, 'loss/train': 0.6900624632835388} 11/07/2021 08:03:15 - INFO - __main__ - Step 76776: {'lr': 0.00024588745863450876, 'samples': 14740992, 'steps': 76775, 'loss/train': 1.2323378324508667} 11/07/2021 08:03:15 - INFO - __main__ - Step 76777: {'lr': 0.00024588215260916055, 'samples': 14741184, 'steps': 76776, 'loss/train': 1.0489407777786255} 11/07/2021 08:03:16 - INFO - __main__ - Step 76778: {'lr': 0.0002458768465856678, 'samples': 14741376, 'steps': 76777, 'loss/train': 1.5813219547271729} 11/07/2021 08:03:17 - INFO - __main__ - Step 76779: {'lr': 0.00024587154056403287, 'samples': 14741568, 'steps': 76778, 'loss/train': 1.637369155883789} 11/07/2021 08:03:17 - INFO - __main__ - Step 76780: {'lr': 0.00024586623454425817, 'samples': 14741760, 'steps': 76779, 'loss/train': 1.5485496520996094} 11/07/2021 08:03:17 - INFO - __main__ - Step 76781: {'lr': 0.00024586092852634607, 'samples': 14741952, 'steps': 76780, 'loss/train': 1.0762066841125488} 11/07/2021 08:03:18 - INFO - __main__ - Step 76782: {'lr': 0.00024585562251029896, 'samples': 14742144, 'steps': 76781, 'loss/train': 1.1076279878616333} 11/07/2021 08:03:19 - INFO - __main__ - Step 76783: {'lr': 0.0002458503164961193, 'samples': 14742336, 'steps': 76782, 'loss/train': 1.2935209274291992} 11/07/2021 08:03:19 - INFO - __main__ - Step 76784: {'lr': 0.00024584501048380937, 'samples': 14742528, 'steps': 76783, 'loss/train': 1.3249938488006592} 11/07/2021 08:03:19 - INFO - __main__ - Step 76785: {'lr': 0.0002458397044733716, 'samples': 14742720, 'steps': 76784, 'loss/train': 1.0764834880828857} 11/07/2021 08:03:20 - INFO - __main__ - Step 76786: {'lr': 0.0002458343984648084, 'samples': 14742912, 'steps': 76785, 'loss/train': 1.5493054389953613} 11/07/2021 08:03:20 - INFO - __main__ - Step 76787: {'lr': 0.0002458290924581222, 'samples': 14743104, 'steps': 76786, 'loss/train': 1.3206331729888916} 11/07/2021 08:03:21 - INFO - __main__ - Step 76788: {'lr': 0.00024582378645331544, 'samples': 14743296, 'steps': 76787, 'loss/train': 1.7453452348709106} 11/07/2021 08:03:21 - INFO - __main__ - Step 76789: {'lr': 0.00024581848045039027, 'samples': 14743488, 'steps': 76788, 'loss/train': 1.2320716381072998} 11/07/2021 08:03:22 - INFO - __main__ - Step 76790: {'lr': 0.0002458131744493493, 'samples': 14743680, 'steps': 76789, 'loss/train': 1.223210334777832} 11/07/2021 08:03:22 - INFO - __main__ - Step 76791: {'lr': 0.0002458078684501948, 'samples': 14743872, 'steps': 76790, 'loss/train': 1.2636829614639282} 11/07/2021 08:03:22 - INFO - __main__ - Step 76792: {'lr': 0.0002458025624529292, 'samples': 14744064, 'steps': 76791, 'loss/train': 1.198747158050537} 11/07/2021 08:03:23 - INFO - __main__ - Step 76793: {'lr': 0.0002457972564575549, 'samples': 14744256, 'steps': 76792, 'loss/train': 2.124105215072632} 11/07/2021 08:03:24 - INFO - __main__ - Step 76794: {'lr': 0.00024579195046407434, 'samples': 14744448, 'steps': 76793, 'loss/train': 1.269729733467102} 11/07/2021 08:03:24 - INFO - __main__ - Step 76795: {'lr': 0.0002457866444724898, 'samples': 14744640, 'steps': 76794, 'loss/train': 1.8261176347732544} 11/07/2021 08:03:25 - INFO - __main__ - Step 76796: {'lr': 0.0002457813384828038, 'samples': 14744832, 'steps': 76795, 'loss/train': 1.2882393598556519} 11/07/2021 08:03:25 - INFO - __main__ - Step 76797: {'lr': 0.0002457760324950186, 'samples': 14745024, 'steps': 76796, 'loss/train': 1.6120997667312622} 11/07/2021 08:03:25 - INFO - __main__ - Step 76798: {'lr': 0.0002457707265091367, 'samples': 14745216, 'steps': 76797, 'loss/train': 1.4743472337722778} 11/07/2021 08:03:26 - INFO - __main__ - Step 76799: {'lr': 0.0002457654205251604, 'samples': 14745408, 'steps': 76798, 'loss/train': 0.6890661716461182} 11/07/2021 08:03:27 - INFO - __main__ - Step 76800: {'lr': 0.00024576011454309217, 'samples': 14745600, 'steps': 76799, 'loss/train': 1.2517191171646118} 11/07/2021 08:03:27 - INFO - __main__ - Step 76801: {'lr': 0.0002457548085629345, 'samples': 14745792, 'steps': 76800, 'loss/train': 1.2163347005844116} 11/07/2021 08:03:27 - INFO - __main__ - Step 76802: {'lr': 0.0002457495025846895, 'samples': 14745984, 'steps': 76801, 'loss/train': 1.1135468482971191} 11/07/2021 08:03:28 - INFO - __main__ - Step 76803: {'lr': 0.0002457441966083597, 'samples': 14746176, 'steps': 76802, 'loss/train': 1.4091882705688477} 11/07/2021 08:03:29 - INFO - __main__ - Step 76804: {'lr': 0.0002457388906339475, 'samples': 14746368, 'steps': 76803, 'loss/train': 0.7652434706687927} 11/07/2021 08:03:29 - INFO - __main__ - Step 76805: {'lr': 0.0002457335846614553, 'samples': 14746560, 'steps': 76804, 'loss/train': 0.8113361597061157} 11/07/2021 08:03:30 - INFO - __main__ - Step 76806: {'lr': 0.0002457282786908855, 'samples': 14746752, 'steps': 76805, 'loss/train': 1.6003224849700928} 11/07/2021 08:03:30 - INFO - __main__ - Step 76807: {'lr': 0.00024572297272224046, 'samples': 14746944, 'steps': 76806, 'loss/train': 1.1553409099578857} 11/07/2021 08:03:30 - INFO - __main__ - Step 76808: {'lr': 0.0002457176667555226, 'samples': 14747136, 'steps': 76807, 'loss/train': 1.2673949003219604} 11/07/2021 08:03:31 - INFO - __main__ - Step 76809: {'lr': 0.0002457123607907343, 'samples': 14747328, 'steps': 76808, 'loss/train': 5.4859538078308105} 11/07/2021 08:03:32 - INFO - __main__ - Step 76810: {'lr': 0.00024570705482787787, 'samples': 14747520, 'steps': 76809, 'loss/train': 5.443273544311523} 11/07/2021 08:03:32 - INFO - __main__ - Step 76811: {'lr': 0.00024570174886695583, 'samples': 14747712, 'steps': 76810, 'loss/train': 1.473250389099121} 11/07/2021 08:03:33 - INFO - __main__ - Step 76812: {'lr': 0.0002456964429079705, 'samples': 14747904, 'steps': 76811, 'loss/train': 1.3195512294769287} 11/07/2021 08:03:33 - INFO - __main__ - Step 76813: {'lr': 0.0002456911369509243, 'samples': 14748096, 'steps': 76812, 'loss/train': 1.2708039283752441} 11/07/2021 08:03:33 - INFO - __main__ - Step 76814: {'lr': 0.0002456858309958196, 'samples': 14748288, 'steps': 76813, 'loss/train': 1.2218632698059082} 11/07/2021 08:03:34 - INFO - __main__ - Step 76815: {'lr': 0.00024568052504265895, 'samples': 14748480, 'steps': 76814, 'loss/train': 1.7262938022613525} 11/07/2021 08:03:35 - INFO - __main__ - Step 76816: {'lr': 0.0002456752190914444, 'samples': 14748672, 'steps': 76815, 'loss/train': 0.980808675289154} 11/07/2021 08:03:35 - INFO - __main__ - Step 76817: {'lr': 0.0002456699131421786, 'samples': 14748864, 'steps': 76816, 'loss/train': 1.0498096942901611} 11/07/2021 08:03:35 - INFO - __main__ - Step 76818: {'lr': 0.0002456646071948638, 'samples': 14749056, 'steps': 76817, 'loss/train': 1.2671862840652466} 11/07/2021 08:03:36 - INFO - __main__ - Step 76819: {'lr': 0.0002456593012495025, 'samples': 14749248, 'steps': 76818, 'loss/train': 1.4676893949508667} 11/07/2021 08:03:37 - INFO - __main__ - Step 76820: {'lr': 0.00024565399530609706, 'samples': 14749440, 'steps': 76819, 'loss/train': 1.834214448928833} 11/07/2021 08:03:37 - INFO - __main__ - Step 76821: {'lr': 0.0002456486893646499, 'samples': 14749632, 'steps': 76820, 'loss/train': 1.2810649871826172} 11/07/2021 08:03:38 - INFO - __main__ - Step 76822: {'lr': 0.0002456433834251633, 'samples': 14749824, 'steps': 76821, 'loss/train': 1.7881081104278564} 11/07/2021 08:03:38 - INFO - __main__ - Step 76823: {'lr': 0.00024563807748763974, 'samples': 14750016, 'steps': 76822, 'loss/train': 1.8701037168502808} 11/07/2021 08:03:38 - INFO - __main__ - Step 76824: {'lr': 0.00024563277155208163, 'samples': 14750208, 'steps': 76823, 'loss/train': 1.368783712387085} 11/07/2021 08:03:39 - INFO - __main__ - Step 76825: {'lr': 0.0002456274656184913, 'samples': 14750400, 'steps': 76824, 'loss/train': 0.7417070269584656} 11/07/2021 08:03:39 - INFO - __main__ - Step 76826: {'lr': 0.0002456221596868712, 'samples': 14750592, 'steps': 76825, 'loss/train': 1.7941696643829346} 11/07/2021 08:03:40 - INFO - __main__ - Step 76827: {'lr': 0.0002456168537572236, 'samples': 14750784, 'steps': 76826, 'loss/train': 1.486674189567566} 11/07/2021 08:03:40 - INFO - __main__ - Step 76828: {'lr': 0.00024561154782955116, 'samples': 14750976, 'steps': 76827, 'loss/train': 1.4759241342544556} 11/07/2021 08:03:41 - INFO - __main__ - Step 76829: {'lr': 0.000245606241903856, 'samples': 14751168, 'steps': 76828, 'loss/train': 0.803244948387146} 11/07/2021 08:03:41 - INFO - __main__ - Step 76830: {'lr': 0.0002456009359801405, 'samples': 14751360, 'steps': 76829, 'loss/train': 1.7115498781204224} 11/07/2021 08:03:42 - INFO - __main__ - Step 76831: {'lr': 0.00024559563005840723, 'samples': 14751552, 'steps': 76830, 'loss/train': 1.3924096822738647} 11/07/2021 08:03:42 - INFO - __main__ - Step 76832: {'lr': 0.0002455903241386585, 'samples': 14751744, 'steps': 76831, 'loss/train': 0.9643773436546326} 11/07/2021 08:03:43 - INFO - __main__ - Step 76833: {'lr': 0.0002455850182208967, 'samples': 14751936, 'steps': 76832, 'loss/train': 1.1329880952835083} 11/07/2021 08:03:43 - INFO - __main__ - Step 76834: {'lr': 0.0002455797123051242, 'samples': 14752128, 'steps': 76833, 'loss/train': 1.339852213859558} 11/07/2021 08:03:43 - INFO - __main__ - Step 76835: {'lr': 0.0002455744063913434, 'samples': 14752320, 'steps': 76834, 'loss/train': 1.3654959201812744} 11/07/2021 08:03:44 - INFO - __main__ - Step 76836: {'lr': 0.00024556910047955676, 'samples': 14752512, 'steps': 76835, 'loss/train': 1.0473854541778564} 11/07/2021 08:03:45 - INFO - __main__ - Step 76837: {'lr': 0.00024556379456976656, 'samples': 14752704, 'steps': 76836, 'loss/train': 1.573424220085144} 11/07/2021 08:03:45 - INFO - __main__ - Step 76838: {'lr': 0.0002455584886619753, 'samples': 14752896, 'steps': 76837, 'loss/train': 1.4086358547210693} 11/07/2021 08:03:45 - INFO - __main__ - Step 76839: {'lr': 0.00024555318275618527, 'samples': 14753088, 'steps': 76838, 'loss/train': 1.2138149738311768} 11/07/2021 08:03:46 - INFO - __main__ - Step 76840: {'lr': 0.0002455478768523989, 'samples': 14753280, 'steps': 76839, 'loss/train': 1.198025107383728} 11/07/2021 08:03:47 - INFO - __main__ - Step 76841: {'lr': 0.0002455425709506186, 'samples': 14753472, 'steps': 76840, 'loss/train': 1.5535506010055542} 11/07/2021 08:03:47 - INFO - __main__ - Step 76842: {'lr': 0.0002455372650508469, 'samples': 14753664, 'steps': 76841, 'loss/train': 1.1736602783203125} 11/07/2021 08:03:47 - INFO - __main__ - Step 76843: {'lr': 0.0002455319591530859, 'samples': 14753856, 'steps': 76842, 'loss/train': 1.4806410074234009} 11/07/2021 08:03:48 - INFO - __main__ - Step 76844: {'lr': 0.0002455266532573381, 'samples': 14754048, 'steps': 76843, 'loss/train': 1.3252155780792236} 11/07/2021 08:03:48 - INFO - __main__ - Step 76845: {'lr': 0.00024552134736360593, 'samples': 14754240, 'steps': 76844, 'loss/train': 1.5195667743682861} 11/07/2021 08:03:49 - INFO - __main__ - Step 76846: {'lr': 0.0002455160414718918, 'samples': 14754432, 'steps': 76845, 'loss/train': 1.8446989059448242} 11/07/2021 08:03:50 - INFO - __main__ - Step 76847: {'lr': 0.00024551073558219807, 'samples': 14754624, 'steps': 76846, 'loss/train': 1.1733005046844482} 11/07/2021 08:03:50 - INFO - __main__ - Step 76848: {'lr': 0.0002455054296945271, 'samples': 14754816, 'steps': 76847, 'loss/train': 1.5031883716583252} 11/07/2021 08:03:50 - INFO - __main__ - Step 76849: {'lr': 0.0002455001238088814, 'samples': 14755008, 'steps': 76848, 'loss/train': 1.6900622844696045} 11/07/2021 08:03:51 - INFO - __main__ - Step 76850: {'lr': 0.0002454948179252632, 'samples': 14755200, 'steps': 76849, 'loss/train': 1.4668524265289307} 11/07/2021 08:03:52 - INFO - __main__ - Step 76851: {'lr': 0.000245489512043675, 'samples': 14755392, 'steps': 76850, 'loss/train': 1.0857958793640137} 11/07/2021 08:03:52 - INFO - __main__ - Step 76852: {'lr': 0.0002454842061641191, 'samples': 14755584, 'steps': 76851, 'loss/train': 1.1876888275146484} 11/07/2021 08:03:53 - INFO - __main__ - Step 76853: {'lr': 0.0002454789002865981, 'samples': 14755776, 'steps': 76852, 'loss/train': 1.4066447019577026} 11/07/2021 08:03:53 - INFO - __main__ - Step 76854: {'lr': 0.0002454735944111141, 'samples': 14755968, 'steps': 76853, 'loss/train': 1.4414361715316772} 11/07/2021 08:03:53 - INFO - __main__ - Step 76855: {'lr': 0.00024546828853766966, 'samples': 14756160, 'steps': 76854, 'loss/train': 1.7240570783615112} 11/07/2021 08:03:54 - INFO - __main__ - Step 76856: {'lr': 0.00024546298266626715, 'samples': 14756352, 'steps': 76855, 'loss/train': 1.5673657655715942} 11/07/2021 08:03:55 - INFO - __main__ - Step 76857: {'lr': 0.0002454576767969089, 'samples': 14756544, 'steps': 76856, 'loss/train': 1.1979707479476929} 11/07/2021 08:03:55 - INFO - __main__ - Step 76858: {'lr': 0.00024545237092959743, 'samples': 14756736, 'steps': 76857, 'loss/train': 1.2746305465698242} 11/07/2021 08:03:55 - INFO - __main__ - Step 76859: {'lr': 0.000245447065064335, 'samples': 14756928, 'steps': 76858, 'loss/train': 1.5282477140426636} 11/07/2021 08:03:56 - INFO - __main__ - Step 76860: {'lr': 0.000245441759201124, 'samples': 14757120, 'steps': 76859, 'loss/train': 1.6429702043533325} 11/07/2021 08:03:57 - INFO - __main__ - Step 76861: {'lr': 0.00024543645333996694, 'samples': 14757312, 'steps': 76860, 'loss/train': 1.5968984365463257} 11/07/2021 08:03:57 - INFO - __main__ - Step 76862: {'lr': 0.00024543114748086617, 'samples': 14757504, 'steps': 76861, 'loss/train': 1.5302538871765137} 11/07/2021 08:03:58 - INFO - __main__ - Step 76863: {'lr': 0.000245425841623824, 'samples': 14757696, 'steps': 76862, 'loss/train': 1.2176884412765503} 11/07/2021 08:03:58 - INFO - __main__ - Step 76864: {'lr': 0.0002454205357688429, 'samples': 14757888, 'steps': 76863, 'loss/train': 1.43990957736969} 11/07/2021 08:03:58 - INFO - __main__ - Step 76865: {'lr': 0.0002454152299159253, 'samples': 14758080, 'steps': 76864, 'loss/train': 0.07654985785484314} 11/07/2021 08:03:59 - INFO - __main__ - Step 76866: {'lr': 0.0002454099240650734, 'samples': 14758272, 'steps': 76865, 'loss/train': 1.4801673889160156} 11/07/2021 08:04:00 - INFO - __main__ - Step 76867: {'lr': 0.0002454046182162898, 'samples': 14758464, 'steps': 76866, 'loss/train': 1.4682655334472656} 11/07/2021 08:04:00 - INFO - __main__ - Step 76868: {'lr': 0.00024539931236957675, 'samples': 14758656, 'steps': 76867, 'loss/train': 1.6199994087219238} 11/07/2021 08:04:00 - INFO - __main__ - Step 76869: {'lr': 0.0002453940065249368, 'samples': 14758848, 'steps': 76868, 'loss/train': 1.1483479738235474} 11/07/2021 08:04:01 - INFO - __main__ - Step 76870: {'lr': 0.0002453887006823722, 'samples': 14759040, 'steps': 76869, 'loss/train': 1.4184190034866333} 11/07/2021 08:04:01 - INFO - __main__ - Step 76871: {'lr': 0.0002453833948418853, 'samples': 14759232, 'steps': 76870, 'loss/train': 0.17134273052215576} 11/07/2021 08:04:02 - INFO - __main__ - Step 76872: {'lr': 0.0002453780890034786, 'samples': 14759424, 'steps': 76871, 'loss/train': 1.2249608039855957} 11/07/2021 08:04:02 - INFO - __main__ - Step 76873: {'lr': 0.0002453727831671545, 'samples': 14759616, 'steps': 76872, 'loss/train': 0.9359713792800903} 11/07/2021 08:04:03 - INFO - __main__ - Step 76874: {'lr': 0.00024536747733291535, 'samples': 14759808, 'steps': 76873, 'loss/train': 1.4032608270645142} 11/07/2021 08:04:03 - INFO - __main__ - Step 76875: {'lr': 0.00024536217150076357, 'samples': 14760000, 'steps': 76874, 'loss/train': 1.5366166830062866} 11/07/2021 08:04:03 - INFO - __main__ - Step 76876: {'lr': 0.0002453568656707015, 'samples': 14760192, 'steps': 76875, 'loss/train': 1.3825615644454956} 11/07/2021 08:04:05 - INFO - __main__ - Step 76877: {'lr': 0.0002453515598427315, 'samples': 14760384, 'steps': 76876, 'loss/train': 1.4631497859954834} 11/07/2021 08:04:05 - INFO - __main__ - Step 76878: {'lr': 0.00024534625401685607, 'samples': 14760576, 'steps': 76877, 'loss/train': 1.196781873703003} 11/07/2021 08:04:05 - INFO - __main__ - Step 76879: {'lr': 0.00024534094819307753, 'samples': 14760768, 'steps': 76878, 'loss/train': 1.5745725631713867} 11/07/2021 08:04:06 - INFO - __main__ - Step 76880: {'lr': 0.00024533564237139827, 'samples': 14760960, 'steps': 76879, 'loss/train': 0.48412269353866577} 11/07/2021 08:04:06 - INFO - __main__ - Step 76881: {'lr': 0.00024533033655182074, 'samples': 14761152, 'steps': 76880, 'loss/train': 1.05147385597229} 11/07/2021 08:04:07 - INFO - __main__ - Step 76882: {'lr': 0.00024532503073434726, 'samples': 14761344, 'steps': 76881, 'loss/train': 1.3089295625686646} 11/07/2021 08:04:07 - INFO - __main__ - Step 76883: {'lr': 0.0002453197249189803, 'samples': 14761536, 'steps': 76882, 'loss/train': 1.7115262746810913} 11/07/2021 08:04:08 - INFO - __main__ - Step 76884: {'lr': 0.00024531441910572214, 'samples': 14761728, 'steps': 76883, 'loss/train': 1.0743463039398193} 11/07/2021 08:04:08 - INFO - __main__ - Step 76885: {'lr': 0.00024530911329457525, 'samples': 14761920, 'steps': 76884, 'loss/train': 1.4408290386199951} 11/07/2021 08:04:08 - INFO - __main__ - Step 76886: {'lr': 0.000245303807485542, 'samples': 14762112, 'steps': 76885, 'loss/train': 1.4725364446640015} 11/07/2021 08:04:09 - INFO - __main__ - Step 76887: {'lr': 0.0002452985016786248, 'samples': 14762304, 'steps': 76886, 'loss/train': 1.77055823802948} 11/07/2021 08:04:10 - INFO - __main__ - Step 76888: {'lr': 0.000245293195873826, 'samples': 14762496, 'steps': 76887, 'loss/train': 0.6718248724937439} 11/07/2021 08:04:10 - INFO - __main__ - Step 76889: {'lr': 0.000245287890071148, 'samples': 14762688, 'steps': 76888, 'loss/train': 1.6433682441711426} 11/07/2021 08:04:10 - INFO - __main__ - Step 76890: {'lr': 0.0002452825842705932, 'samples': 14762880, 'steps': 76889, 'loss/train': 1.2490406036376953} 11/07/2021 08:04:11 - INFO - __main__ - Step 76891: {'lr': 0.000245277278472164, 'samples': 14763072, 'steps': 76890, 'loss/train': 1.2754764556884766} 11/07/2021 08:04:12 - INFO - __main__ - Step 76892: {'lr': 0.0002452719726758628, 'samples': 14763264, 'steps': 76891, 'loss/train': 1.449884295463562} 11/07/2021 08:04:12 - INFO - __main__ - Step 76893: {'lr': 0.00024526666688169196, 'samples': 14763456, 'steps': 76892, 'loss/train': 1.32686448097229} 11/07/2021 08:04:12 - INFO - __main__ - Step 76894: {'lr': 0.00024526136108965393, 'samples': 14763648, 'steps': 76893, 'loss/train': 1.6991963386535645} 11/07/2021 08:04:13 - INFO - __main__ - Step 76895: {'lr': 0.00024525605529975096, 'samples': 14763840, 'steps': 76894, 'loss/train': 1.2201361656188965} 11/07/2021 08:04:13 - INFO - __main__ - Step 76896: {'lr': 0.0002452507495119857, 'samples': 14764032, 'steps': 76895, 'loss/train': 1.1969770193099976} 11/07/2021 08:04:14 - INFO - __main__ - Step 76897: {'lr': 0.00024524544372636034, 'samples': 14764224, 'steps': 76896, 'loss/train': 1.6322991847991943} 11/07/2021 08:04:15 - INFO - __main__ - Step 76898: {'lr': 0.0002452401379428772, 'samples': 14764416, 'steps': 76897, 'loss/train': 1.725379228591919} 11/07/2021 08:04:15 - INFO - __main__ - Step 76899: {'lr': 0.00024523483216153883, 'samples': 14764608, 'steps': 76898, 'loss/train': 1.429263710975647} 11/07/2021 08:04:15 - INFO - __main__ - Step 76900: {'lr': 0.0002452295263823476, 'samples': 14764800, 'steps': 76899, 'loss/train': 0.06362394243478775} 11/07/2021 08:04:16 - INFO - __main__ - Step 76901: {'lr': 0.00024522422060530583, 'samples': 14764992, 'steps': 76900, 'loss/train': 1.358339548110962} 11/07/2021 08:04:17 - INFO - __main__ - Step 76902: {'lr': 0.00024521891483041597, 'samples': 14765184, 'steps': 76901, 'loss/train': 1.35888671875} 11/07/2021 08:04:17 - INFO - __main__ - Step 76903: {'lr': 0.0002452136090576804, 'samples': 14765376, 'steps': 76902, 'loss/train': 1.6333355903625488} 11/07/2021 08:04:17 - INFO - __main__ - Step 76904: {'lr': 0.0002452083032871015, 'samples': 14765568, 'steps': 76903, 'loss/train': 1.5744194984436035} 11/07/2021 08:04:18 - INFO - __main__ - Step 76905: {'lr': 0.0002452029975186816, 'samples': 14765760, 'steps': 76904, 'loss/train': 0.8303229212760925} 11/07/2021 08:04:18 - INFO - __main__ - Step 76906: {'lr': 0.00024519769175242325, 'samples': 14765952, 'steps': 76905, 'loss/train': 1.3251503705978394} 11/07/2021 08:04:19 - INFO - __main__ - Step 76907: {'lr': 0.00024519238598832874, 'samples': 14766144, 'steps': 76906, 'loss/train': 1.545627474784851} 11/07/2021 08:04:19 - INFO - __main__ - Step 76908: {'lr': 0.0002451870802264004, 'samples': 14766336, 'steps': 76907, 'loss/train': 1.329997181892395} 11/07/2021 08:04:20 - INFO - __main__ - Step 76909: {'lr': 0.00024518177446664085, 'samples': 14766528, 'steps': 76908, 'loss/train': 1.4699362516403198} 11/07/2021 08:04:20 - INFO - __main__ - Step 76910: {'lr': 0.00024517646870905215, 'samples': 14766720, 'steps': 76909, 'loss/train': 1.4864815473556519} 11/07/2021 08:04:20 - INFO - __main__ - Step 76911: {'lr': 0.00024517116295363694, 'samples': 14766912, 'steps': 76910, 'loss/train': 1.0885987281799316} 11/07/2021 08:04:21 - INFO - __main__ - Step 76912: {'lr': 0.00024516585720039746, 'samples': 14767104, 'steps': 76911, 'loss/train': 1.3147423267364502} 11/07/2021 08:04:22 - INFO - __main__ - Step 76913: {'lr': 0.0002451605514493362, 'samples': 14767296, 'steps': 76912, 'loss/train': 1.8628180027008057} 11/07/2021 08:04:22 - INFO - __main__ - Step 76914: {'lr': 0.0002451552457004555, 'samples': 14767488, 'steps': 76913, 'loss/train': 1.4182536602020264} 11/07/2021 08:04:23 - INFO - __main__ - Step 76915: {'lr': 0.0002451499399537578, 'samples': 14767680, 'steps': 76914, 'loss/train': 1.1994245052337646} 11/07/2021 08:04:23 - INFO - __main__ - Step 76916: {'lr': 0.00024514463420924543, 'samples': 14767872, 'steps': 76915, 'loss/train': 0.9824380278587341} 11/07/2021 08:04:23 - INFO - __main__ - Step 76917: {'lr': 0.0002451393284669209, 'samples': 14768064, 'steps': 76916, 'loss/train': 1.3481825590133667} 11/07/2021 08:04:24 - INFO - __main__ - Step 76918: {'lr': 0.0002451340227267864, 'samples': 14768256, 'steps': 76917, 'loss/train': 1.7353925704956055} 11/07/2021 08:04:25 - INFO - __main__ - Step 76919: {'lr': 0.0002451287169888445, 'samples': 14768448, 'steps': 76918, 'loss/train': 1.5241162776947021} 11/07/2021 08:04:25 - INFO - __main__ - Step 76920: {'lr': 0.0002451234112530975, 'samples': 14768640, 'steps': 76919, 'loss/train': 1.2167030572891235} 11/07/2021 08:04:25 - INFO - __main__ - Step 76921: {'lr': 0.0002451181055195478, 'samples': 14768832, 'steps': 76920, 'loss/train': 1.0925891399383545} 11/07/2021 08:04:26 - INFO - __main__ - Step 76922: {'lr': 0.000245112799788198, 'samples': 14769024, 'steps': 76921, 'loss/train': 0.05850300192832947} 11/07/2021 08:04:27 - INFO - __main__ - Step 76923: {'lr': 0.0002451074940590501, 'samples': 14769216, 'steps': 76922, 'loss/train': 2.058838367462158} 11/07/2021 08:04:27 - INFO - __main__ - Step 76924: {'lr': 0.0002451021883321067, 'samples': 14769408, 'steps': 76923, 'loss/train': 1.0186220407485962} 11/07/2021 08:04:27 - INFO - __main__ - Step 76925: {'lr': 0.0002450968826073702, 'samples': 14769600, 'steps': 76924, 'loss/train': 1.3305134773254395} 11/07/2021 08:04:28 - INFO - __main__ - Step 76926: {'lr': 0.00024509157688484297, 'samples': 14769792, 'steps': 76925, 'loss/train': 1.1473878622055054} 11/07/2021 08:04:28 - INFO - __main__ - Step 76927: {'lr': 0.0002450862711645274, 'samples': 14769984, 'steps': 76926, 'loss/train': 1.1302530765533447} 11/07/2021 08:04:29 - INFO - __main__ - Step 76928: {'lr': 0.0002450809654464259, 'samples': 14770176, 'steps': 76927, 'loss/train': 1.2005916833877563} 11/07/2021 08:04:30 - INFO - __main__ - Step 76929: {'lr': 0.0002450756597305408, 'samples': 14770368, 'steps': 76928, 'loss/train': 1.5854460000991821} 11/07/2021 08:04:30 - INFO - __main__ - Step 76930: {'lr': 0.00024507035401687453, 'samples': 14770560, 'steps': 76929, 'loss/train': 1.258776307106018} 11/07/2021 08:04:30 - INFO - __main__ - Step 76931: {'lr': 0.0002450650483054295, 'samples': 14770752, 'steps': 76930, 'loss/train': 1.5109692811965942} 11/07/2021 08:04:31 - INFO - __main__ - Step 76932: {'lr': 0.0002450597425962081, 'samples': 14770944, 'steps': 76931, 'loss/train': 1.1945048570632935} 11/07/2021 08:04:32 - INFO - __main__ - Step 76933: {'lr': 0.00024505443688921266, 'samples': 14771136, 'steps': 76932, 'loss/train': 1.443285584449768} 11/07/2021 08:04:32 - INFO - __main__ - Step 76934: {'lr': 0.00024504913118444564, 'samples': 14771328, 'steps': 76933, 'loss/train': 2.087686538696289} 11/07/2021 08:04:32 - INFO - __main__ - Step 76935: {'lr': 0.0002450438254819094, 'samples': 14771520, 'steps': 76934, 'loss/train': 1.5928337574005127} 11/07/2021 08:04:33 - INFO - __main__ - Step 76936: {'lr': 0.0002450385197816065, 'samples': 14771712, 'steps': 76935, 'loss/train': 1.345129370689392} 11/07/2021 08:04:33 - INFO - __main__ - Step 76937: {'lr': 0.00024503321408353895, 'samples': 14771904, 'steps': 76936, 'loss/train': 1.276594638824463} 11/07/2021 08:04:35 - INFO - __main__ - Step 76938: {'lr': 0.00024502790838770944, 'samples': 14772096, 'steps': 76937, 'loss/train': 1.0992586612701416} 11/07/2021 08:04:35 - INFO - __main__ - Step 76939: {'lr': 0.0002450226026941202, 'samples': 14772288, 'steps': 76938, 'loss/train': 1.2140003442764282} 11/07/2021 08:04:36 - INFO - __main__ - Step 76940: {'lr': 0.00024501729700277376, 'samples': 14772480, 'steps': 76939, 'loss/train': 1.5024678707122803} 11/07/2021 08:04:36 - INFO - __main__ - Step 76941: {'lr': 0.0002450119913136725, 'samples': 14772672, 'steps': 76940, 'loss/train': 1.7962745428085327} 11/07/2021 08:04:36 - INFO - __main__ - Step 76942: {'lr': 0.00024500668562681864, 'samples': 14772864, 'steps': 76941, 'loss/train': 1.807651400566101} 11/07/2021 08:04:37 - INFO - __main__ - Step 76943: {'lr': 0.0002450013799422148, 'samples': 14773056, 'steps': 76942, 'loss/train': 1.4031862020492554} 11/07/2021 08:04:37 - INFO - __main__ - Step 76944: {'lr': 0.00024499607425986316, 'samples': 14773248, 'steps': 76943, 'loss/train': 1.438443660736084} 11/07/2021 08:04:38 - INFO - __main__ - Step 76945: {'lr': 0.0002449907685797663, 'samples': 14773440, 'steps': 76944, 'loss/train': 1.291234016418457} 11/07/2021 08:04:38 - INFO - __main__ - Step 76946: {'lr': 0.00024498546290192645, 'samples': 14773632, 'steps': 76945, 'loss/train': 1.2523618936538696} 11/07/2021 08:04:39 - INFO - __main__ - Step 76947: {'lr': 0.0002449801572263461, 'samples': 14773824, 'steps': 76946, 'loss/train': 1.3303213119506836} 11/07/2021 08:04:39 - INFO - __main__ - Step 76948: {'lr': 0.0002449748515530276, 'samples': 14774016, 'steps': 76947, 'loss/train': 1.2593556642532349} 11/07/2021 08:04:39 - INFO - __main__ - Step 76949: {'lr': 0.0002449695458819735, 'samples': 14774208, 'steps': 76948, 'loss/train': 1.2611387968063354} 11/07/2021 08:04:40 - INFO - __main__ - Step 76950: {'lr': 0.00024496424021318595, 'samples': 14774400, 'steps': 76949, 'loss/train': 1.102180004119873} 11/07/2021 08:04:41 - INFO - __main__ - Step 76951: {'lr': 0.0002449589345466674, 'samples': 14774592, 'steps': 76950, 'loss/train': 1.8911654949188232} 11/07/2021 08:04:41 - INFO - __main__ - Step 76952: {'lr': 0.00024495362888242027, 'samples': 14774784, 'steps': 76951, 'loss/train': 0.9576366543769836} 11/07/2021 08:04:41 - INFO - __main__ - Step 76953: {'lr': 0.000244948323220447, 'samples': 14774976, 'steps': 76952, 'loss/train': 1.6792160272598267} 11/07/2021 08:04:42 - INFO - __main__ - Step 76954: {'lr': 0.0002449430175607499, 'samples': 14775168, 'steps': 76953, 'loss/train': 1.3542288541793823} 11/07/2021 08:04:43 - INFO - __main__ - Step 76955: {'lr': 0.0002449377119033314, 'samples': 14775360, 'steps': 76954, 'loss/train': 1.1939189434051514} 11/07/2021 08:04:43 - INFO - __main__ - Step 76956: {'lr': 0.0002449324062481939, 'samples': 14775552, 'steps': 76955, 'loss/train': 1.127403736114502} 11/07/2021 08:04:44 - INFO - __main__ - Step 76957: {'lr': 0.00024492710059533976, 'samples': 14775744, 'steps': 76956, 'loss/train': 1.475690484046936} 11/07/2021 08:04:44 - INFO - __main__ - Step 76958: {'lr': 0.0002449217949447714, 'samples': 14775936, 'steps': 76957, 'loss/train': 1.4330335855484009} 11/07/2021 08:04:44 - INFO - __main__ - Step 76959: {'lr': 0.0002449164892964912, 'samples': 14776128, 'steps': 76958, 'loss/train': 0.618186891078949} 11/07/2021 08:04:45 - INFO - __main__ - Step 76960: {'lr': 0.00024491118365050154, 'samples': 14776320, 'steps': 76959, 'loss/train': 1.2793655395507812} 11/07/2021 08:04:46 - INFO - __main__ - Step 76961: {'lr': 0.00024490587800680486, 'samples': 14776512, 'steps': 76960, 'loss/train': 0.28845223784446716} 11/07/2021 08:04:46 - INFO - __main__ - Step 76962: {'lr': 0.0002449005723654035, 'samples': 14776704, 'steps': 76961, 'loss/train': 1.3476678133010864} 11/07/2021 08:04:46 - INFO - __main__ - Step 76963: {'lr': 0.0002448952667262999, 'samples': 14776896, 'steps': 76962, 'loss/train': 0.6114858984947205} 11/07/2021 08:04:47 - INFO - __main__ - Step 76964: {'lr': 0.0002448899610894964, 'samples': 14777088, 'steps': 76963, 'loss/train': 1.328792691230774} 11/07/2021 08:04:48 - INFO - __main__ - Step 76965: {'lr': 0.0002448846554549954, 'samples': 14777280, 'steps': 76964, 'loss/train': 0.8833823204040527} 11/07/2021 08:04:48 - INFO - __main__ - Step 76966: {'lr': 0.00024487934982279924, 'samples': 14777472, 'steps': 76965, 'loss/train': 1.3880270719528198} 11/07/2021 08:04:49 - INFO - __main__ - Step 76967: {'lr': 0.0002448740441929104, 'samples': 14777664, 'steps': 76966, 'loss/train': 0.12067818641662598} 11/07/2021 08:04:49 - INFO - __main__ - Step 76968: {'lr': 0.0002448687385653312, 'samples': 14777856, 'steps': 76967, 'loss/train': 2.6235461235046387} 11/07/2021 08:04:49 - INFO - __main__ - Step 76969: {'lr': 0.0002448634329400641, 'samples': 14778048, 'steps': 76968, 'loss/train': 1.1092267036437988} 11/07/2021 08:04:50 - INFO - __main__ - Step 76970: {'lr': 0.0002448581273171115, 'samples': 14778240, 'steps': 76969, 'loss/train': 1.1909406185150146} 11/07/2021 08:04:51 - INFO - __main__ - Step 76971: {'lr': 0.00024485282169647567, 'samples': 14778432, 'steps': 76970, 'loss/train': 1.539265513420105} 11/07/2021 08:04:51 - INFO - __main__ - Step 76972: {'lr': 0.0002448475160781591, 'samples': 14778624, 'steps': 76971, 'loss/train': 1.4587475061416626} 11/07/2021 08:04:51 - INFO - __main__ - Step 76973: {'lr': 0.0002448422104621642, 'samples': 14778816, 'steps': 76972, 'loss/train': 1.0606558322906494} 11/07/2021 08:04:52 - INFO - __main__ - Step 76974: {'lr': 0.0002448369048484933, 'samples': 14779008, 'steps': 76973, 'loss/train': 1.553078055381775} 11/07/2021 08:04:52 - INFO - __main__ - Step 76975: {'lr': 0.0002448315992371488, 'samples': 14779200, 'steps': 76974, 'loss/train': 1.385348916053772} 11/07/2021 08:04:53 - INFO - __main__ - Step 76976: {'lr': 0.0002448262936281332, 'samples': 14779392, 'steps': 76975, 'loss/train': 1.558933138847351} 11/07/2021 08:04:53 - INFO - __main__ - Step 76977: {'lr': 0.0002448209880214487, 'samples': 14779584, 'steps': 76976, 'loss/train': 1.4963696002960205} 11/07/2021 08:04:54 - INFO - __main__ - Step 76978: {'lr': 0.0002448156824170978, 'samples': 14779776, 'steps': 76977, 'loss/train': 1.0530407428741455} 11/07/2021 08:04:54 - INFO - __main__ - Step 76979: {'lr': 0.00024481037681508286, 'samples': 14779968, 'steps': 76978, 'loss/train': 1.6334797143936157} 11/07/2021 08:04:54 - INFO - __main__ - Step 76980: {'lr': 0.00024480507121540625, 'samples': 14780160, 'steps': 76979, 'loss/train': 1.3903037309646606} 11/07/2021 08:04:56 - INFO - __main__ - Step 76981: {'lr': 0.00024479976561807043, 'samples': 14780352, 'steps': 76980, 'loss/train': 0.7785665392875671} 11/07/2021 08:04:56 - INFO - __main__ - Step 76982: {'lr': 0.00024479446002307774, 'samples': 14780544, 'steps': 76981, 'loss/train': 1.593837022781372} 11/07/2021 08:04:56 - INFO - __main__ - Step 76983: {'lr': 0.0002447891544304306, 'samples': 14780736, 'steps': 76982, 'loss/train': 1.315466284751892} 11/07/2021 08:04:57 - INFO - __main__ - Step 76984: {'lr': 0.0002447838488401314, 'samples': 14780928, 'steps': 76983, 'loss/train': 1.2033195495605469} 11/07/2021 08:04:57 - INFO - __main__ - Step 76985: {'lr': 0.00024477854325218246, 'samples': 14781120, 'steps': 76984, 'loss/train': 1.0105469226837158} 11/07/2021 08:04:58 - INFO - __main__ - Step 76986: {'lr': 0.0002447732376665863, 'samples': 14781312, 'steps': 76985, 'loss/train': 0.05331409350037575} 11/07/2021 08:04:58 - INFO - __main__ - Step 76987: {'lr': 0.0002447679320833452, 'samples': 14781504, 'steps': 76986, 'loss/train': 1.1390658617019653} 11/07/2021 08:04:59 - INFO - __main__ - Step 76988: {'lr': 0.00024476262650246166, 'samples': 14781696, 'steps': 76987, 'loss/train': 1.276497483253479} 11/07/2021 08:04:59 - INFO - __main__ - Step 76989: {'lr': 0.00024475732092393794, 'samples': 14781888, 'steps': 76988, 'loss/train': 1.01667058467865} 11/07/2021 08:04:59 - INFO - __main__ - Step 76990: {'lr': 0.00024475201534777653, 'samples': 14782080, 'steps': 76989, 'loss/train': 1.5409717559814453} 11/07/2021 08:05:01 - INFO - __main__ - Step 76991: {'lr': 0.0002447467097739797, 'samples': 14782272, 'steps': 76990, 'loss/train': 1.2484931945800781} 11/07/2021 08:05:01 - INFO - __main__ - Step 76992: {'lr': 0.00024474140420255, 'samples': 14782464, 'steps': 76991, 'loss/train': 1.3283965587615967} 11/07/2021 08:05:01 - INFO - __main__ - Step 76993: {'lr': 0.0002447360986334897, 'samples': 14782656, 'steps': 76992, 'loss/train': 1.261622667312622} 11/07/2021 08:05:02 - INFO - __main__ - Step 76994: {'lr': 0.0002447307930668012, 'samples': 14782848, 'steps': 76993, 'loss/train': 1.808642864227295} 11/07/2021 08:05:02 - INFO - __main__ - Step 76995: {'lr': 0.00024472548750248695, 'samples': 14783040, 'steps': 76994, 'loss/train': 1.2320916652679443} 11/07/2021 08:05:03 - INFO - __main__ - Step 76996: {'lr': 0.00024472018194054935, 'samples': 14783232, 'steps': 76995, 'loss/train': 0.8354291915893555} 11/07/2021 08:05:03 - INFO - __main__ - Step 76997: {'lr': 0.0002447148763809907, 'samples': 14783424, 'steps': 76996, 'loss/train': 1.638268232345581} 11/07/2021 08:05:04 - INFO - __main__ - Step 76998: {'lr': 0.00024470957082381353, 'samples': 14783616, 'steps': 76997, 'loss/train': 1.359287977218628} 11/07/2021 08:05:04 - INFO - __main__ - Step 76999: {'lr': 0.00024470426526902007, 'samples': 14783808, 'steps': 76998, 'loss/train': 1.1498854160308838} 11/07/2021 08:05:04 - INFO - __main__ - Step 77000: {'lr': 0.00024469895971661283, 'samples': 14784000, 'steps': 76999, 'loss/train': 1.328945517539978} 11/07/2021 08:05:05 - INFO - __main__ - Step 77001: {'lr': 0.00024469365416659414, 'samples': 14784192, 'steps': 77000, 'loss/train': 0.9817360043525696} 11/07/2021 08:05:06 - INFO - __main__ - Step 77002: {'lr': 0.0002446883486189664, 'samples': 14784384, 'steps': 77001, 'loss/train': 1.208369255065918} 11/07/2021 08:05:06 - INFO - __main__ - Step 77003: {'lr': 0.00024468304307373207, 'samples': 14784576, 'steps': 77002, 'loss/train': 1.6005783081054688} 11/07/2021 08:05:06 - INFO - __main__ - Step 77004: {'lr': 0.00024467773753089335, 'samples': 14784768, 'steps': 77003, 'loss/train': 1.05034601688385} 11/07/2021 08:05:07 - INFO - __main__ - Step 77005: {'lr': 0.0002446724319904529, 'samples': 14784960, 'steps': 77004, 'loss/train': 1.3828831911087036} 11/07/2021 08:05:07 - INFO - __main__ - Step 77006: {'lr': 0.00024466712645241284, 'samples': 14785152, 'steps': 77005, 'loss/train': 1.460713267326355} 11/07/2021 08:05:08 - INFO - __main__ - Step 77007: {'lr': 0.00024466182091677577, 'samples': 14785344, 'steps': 77006, 'loss/train': 0.0640299916267395} 11/07/2021 08:05:09 - INFO - __main__ - Step 77008: {'lr': 0.00024465651538354394, 'samples': 14785536, 'steps': 77007, 'loss/train': 1.1755805015563965} 11/07/2021 08:05:09 - INFO - __main__ - Step 77009: {'lr': 0.00024465120985271986, 'samples': 14785728, 'steps': 77008, 'loss/train': 1.3961265087127686} 11/07/2021 08:05:09 - INFO - __main__ - Step 77010: {'lr': 0.00024464590432430586, 'samples': 14785920, 'steps': 77009, 'loss/train': 1.035172700881958} 11/07/2021 08:05:10 - INFO - __main__ - Step 77011: {'lr': 0.0002446405987983043, 'samples': 14786112, 'steps': 77010, 'loss/train': 1.6632879972457886} 11/07/2021 08:05:11 - INFO - __main__ - Step 77012: {'lr': 0.0002446352932747176, 'samples': 14786304, 'steps': 77011, 'loss/train': 1.4461389780044556} 11/07/2021 08:05:11 - INFO - __main__ - Step 77013: {'lr': 0.0002446299877535482, 'samples': 14786496, 'steps': 77012, 'loss/train': 0.6142511367797852} 11/07/2021 08:05:11 - INFO - __main__ - Step 77014: {'lr': 0.0002446246822347984, 'samples': 14786688, 'steps': 77013, 'loss/train': 1.86135995388031} 11/07/2021 08:05:12 - INFO - __main__ - Step 77015: {'lr': 0.00024461937671847066, 'samples': 14786880, 'steps': 77014, 'loss/train': 1.4189670085906982} 11/07/2021 08:05:12 - INFO - __main__ - Step 77016: {'lr': 0.00024461407120456735, 'samples': 14787072, 'steps': 77015, 'loss/train': 1.3620116710662842} 11/07/2021 08:05:13 - INFO - __main__ - Step 77017: {'lr': 0.00024460876569309085, 'samples': 14787264, 'steps': 77016, 'loss/train': 1.4381818771362305} 11/07/2021 08:05:13 - INFO - __main__ - Step 77018: {'lr': 0.00024460346018404357, 'samples': 14787456, 'steps': 77017, 'loss/train': 1.221471905708313} 11/07/2021 08:05:14 - INFO - __main__ - Step 77019: {'lr': 0.0002445981546774278, 'samples': 14787648, 'steps': 77018, 'loss/train': 1.4376636743545532} 11/07/2021 08:05:14 - INFO - __main__ - Step 77020: {'lr': 0.0002445928491732462, 'samples': 14787840, 'steps': 77019, 'loss/train': 0.8712760806083679} 11/07/2021 08:05:14 - INFO - __main__ - Step 77021: {'lr': 0.0002445875436715008, 'samples': 14788032, 'steps': 77020, 'loss/train': 1.0977952480316162} 11/07/2021 08:05:16 - INFO - __main__ - Step 77022: {'lr': 0.0002445822381721943, 'samples': 14788224, 'steps': 77021, 'loss/train': 1.7493480443954468} 11/07/2021 08:05:16 - INFO - __main__ - Step 77023: {'lr': 0.0002445769326753289, 'samples': 14788416, 'steps': 77022, 'loss/train': 1.3983063697814941} 11/07/2021 08:05:16 - INFO - __main__ - Step 77024: {'lr': 0.00024457162718090705, 'samples': 14788608, 'steps': 77023, 'loss/train': 1.2981511354446411} 11/07/2021 08:05:17 - INFO - __main__ - Step 77025: {'lr': 0.0002445663216889311, 'samples': 14788800, 'steps': 77024, 'loss/train': 1.3641622066497803} 11/07/2021 08:05:17 - INFO - __main__ - Step 77026: {'lr': 0.00024456101619940355, 'samples': 14788992, 'steps': 77025, 'loss/train': 1.7861838340759277} 11/07/2021 08:05:18 - INFO - __main__ - Step 77027: {'lr': 0.00024455571071232664, 'samples': 14789184, 'steps': 77026, 'loss/train': 1.0376733541488647} 11/07/2021 08:05:18 - INFO - __main__ - Step 77028: {'lr': 0.0002445504052277029, 'samples': 14789376, 'steps': 77027, 'loss/train': 1.5089868307113647} 11/07/2021 08:05:19 - INFO - __main__ - Step 77029: {'lr': 0.0002445450997455347, 'samples': 14789568, 'steps': 77028, 'loss/train': 1.545082926750183} 11/07/2021 08:05:19 - INFO - __main__ - Step 77030: {'lr': 0.00024453979426582433, 'samples': 14789760, 'steps': 77029, 'loss/train': 1.165966272354126} 11/07/2021 08:05:19 - INFO - __main__ - Step 77031: {'lr': 0.00024453448878857437, 'samples': 14789952, 'steps': 77030, 'loss/train': 1.0760854482650757} 11/07/2021 08:05:20 - INFO - __main__ - Step 77032: {'lr': 0.00024452918331378695, 'samples': 14790144, 'steps': 77031, 'loss/train': 1.606485366821289} 11/07/2021 08:05:21 - INFO - __main__ - Step 77033: {'lr': 0.0002445238778414646, 'samples': 14790336, 'steps': 77032, 'loss/train': 1.662955641746521} 11/07/2021 08:05:21 - INFO - __main__ - Step 77034: {'lr': 0.00024451857237160974, 'samples': 14790528, 'steps': 77033, 'loss/train': 1.2188911437988281} 11/07/2021 08:05:21 - INFO - __main__ - Step 77035: {'lr': 0.0002445132669042247, 'samples': 14790720, 'steps': 77034, 'loss/train': 1.3713712692260742} 11/07/2021 08:05:22 - INFO - __main__ - Step 77036: {'lr': 0.00024450796143931193, 'samples': 14790912, 'steps': 77035, 'loss/train': 1.6719160079956055} 11/07/2021 08:05:22 - INFO - __main__ - Step 77037: {'lr': 0.00024450265597687374, 'samples': 14791104, 'steps': 77036, 'loss/train': 2.277939558029175} 11/07/2021 08:05:23 - INFO - __main__ - Step 77038: {'lr': 0.00024449735051691263, 'samples': 14791296, 'steps': 77037, 'loss/train': 1.5240164995193481} 11/07/2021 08:05:23 - INFO - __main__ - Step 77039: {'lr': 0.00024449204505943087, 'samples': 14791488, 'steps': 77038, 'loss/train': 1.3053297996520996} 11/07/2021 08:05:24 - INFO - __main__ - Step 77040: {'lr': 0.00024448673960443095, 'samples': 14791680, 'steps': 77039, 'loss/train': 0.9574661254882812} 11/07/2021 08:05:24 - INFO - __main__ - Step 77041: {'lr': 0.0002444814341519152, 'samples': 14791872, 'steps': 77040, 'loss/train': 0.8414875268936157} 11/07/2021 08:05:25 - INFO - __main__ - Step 77042: {'lr': 0.000244476128701886, 'samples': 14792064, 'steps': 77041, 'loss/train': 1.1279499530792236} 11/07/2021 08:05:25 - INFO - __main__ - Step 77043: {'lr': 0.00024447082325434593, 'samples': 14792256, 'steps': 77042, 'loss/train': 0.7041897773742676} 11/07/2021 08:05:26 - INFO - __main__ - Step 77044: {'lr': 0.0002444655178092971, 'samples': 14792448, 'steps': 77043, 'loss/train': 1.2537028789520264} 11/07/2021 08:05:26 - INFO - __main__ - Step 77045: {'lr': 0.00024446021236674203, 'samples': 14792640, 'steps': 77044, 'loss/train': 1.4929614067077637} 11/07/2021 08:05:26 - INFO - __main__ - Step 77046: {'lr': 0.0002444549069266831, 'samples': 14792832, 'steps': 77045, 'loss/train': 1.5071887969970703} 11/07/2021 08:05:27 - INFO - __main__ - Step 77047: {'lr': 0.00024444960148912266, 'samples': 14793024, 'steps': 77046, 'loss/train': 1.0201588869094849} 11/07/2021 08:05:28 - INFO - __main__ - Step 77048: {'lr': 0.00024444429605406323, 'samples': 14793216, 'steps': 77047, 'loss/train': 1.3538590669631958} 11/07/2021 08:05:28 - INFO - __main__ - Step 77049: {'lr': 0.00024443899062150706, 'samples': 14793408, 'steps': 77048, 'loss/train': 1.3225842714309692} 11/07/2021 08:05:29 - INFO - __main__ - Step 77050: {'lr': 0.0002444336851914566, 'samples': 14793600, 'steps': 77049, 'loss/train': 1.83426833152771} 11/07/2021 08:05:29 - INFO - __main__ - Step 77051: {'lr': 0.0002444283797639142, 'samples': 14793792, 'steps': 77050, 'loss/train': 1.3330873250961304} 11/07/2021 08:05:29 - INFO - __main__ - Step 77052: {'lr': 0.00024442307433888234, 'samples': 14793984, 'steps': 77051, 'loss/train': 0.040904924273490906} 11/07/2021 08:05:30 - INFO - __main__ - Step 77053: {'lr': 0.00024441776891636333, 'samples': 14794176, 'steps': 77052, 'loss/train': 1.5652263164520264} 11/07/2021 08:05:31 - INFO - __main__ - Step 77054: {'lr': 0.0002444124634963596, 'samples': 14794368, 'steps': 77053, 'loss/train': 1.644631266593933} 11/07/2021 08:05:31 - INFO - __main__ - Step 77055: {'lr': 0.00024440715807887354, 'samples': 14794560, 'steps': 77054, 'loss/train': 1.431357741355896} 11/07/2021 08:05:31 - INFO - __main__ - Step 77056: {'lr': 0.0002444018526639075, 'samples': 14794752, 'steps': 77055, 'loss/train': 0.9996850490570068} 11/07/2021 08:05:32 - INFO - __main__ - Step 77057: {'lr': 0.000244396547251464, 'samples': 14794944, 'steps': 77056, 'loss/train': 1.599865198135376} 11/07/2021 08:05:33 - INFO - __main__ - Step 77058: {'lr': 0.00024439124184154527, 'samples': 14795136, 'steps': 77057, 'loss/train': 1.7284107208251953} 11/07/2021 08:05:33 - INFO - __main__ - Step 77059: {'lr': 0.0002443859364341537, 'samples': 14795328, 'steps': 77058, 'loss/train': 1.5562244653701782} 11/07/2021 08:05:34 - INFO - __main__ - Step 77060: {'lr': 0.0002443806310292918, 'samples': 14795520, 'steps': 77059, 'loss/train': 1.267248511314392} 11/07/2021 08:05:34 - INFO - __main__ - Step 77061: {'lr': 0.0002443753256269618, 'samples': 14795712, 'steps': 77060, 'loss/train': 1.1319782733917236} 11/07/2021 08:05:34 - INFO - __main__ - Step 77062: {'lr': 0.00024437002022716634, 'samples': 14795904, 'steps': 77061, 'loss/train': 2.0104761123657227} 11/07/2021 08:05:35 - INFO - __main__ - Step 77063: {'lr': 0.00024436471482990757, 'samples': 14796096, 'steps': 77062, 'loss/train': 0.9117782711982727} 11/07/2021 08:05:36 - INFO - __main__ - Step 77064: {'lr': 0.000244359409435188, 'samples': 14796288, 'steps': 77063, 'loss/train': 1.2544325590133667} 11/07/2021 08:05:36 - INFO - __main__ - Step 77065: {'lr': 0.00024435410404301, 'samples': 14796480, 'steps': 77064, 'loss/train': 1.537714958190918} 11/07/2021 08:05:36 - INFO - __main__ - Step 77066: {'lr': 0.0002443487986533759, 'samples': 14796672, 'steps': 77065, 'loss/train': 1.3376342058181763} 11/07/2021 08:05:37 - INFO - __main__ - Step 77067: {'lr': 0.00024434349326628817, 'samples': 14796864, 'steps': 77066, 'loss/train': 1.5218833684921265} 11/07/2021 08:05:37 - INFO - __main__ - Step 77068: {'lr': 0.0002443381878817492, 'samples': 14797056, 'steps': 77067, 'loss/train': 0.5281939506530762} 11/07/2021 08:05:38 - INFO - __main__ - Step 77069: {'lr': 0.0002443328824997613, 'samples': 14797248, 'steps': 77068, 'loss/train': 1.5787198543548584} 11/07/2021 08:05:38 - INFO - __main__ - Step 77070: {'lr': 0.0002443275771203271, 'samples': 14797440, 'steps': 77069, 'loss/train': 1.4363547563552856} 11/07/2021 08:05:39 - INFO - __main__ - Step 77071: {'lr': 0.00024432227174344865, 'samples': 14797632, 'steps': 77070, 'loss/train': 1.4578860998153687} 11/07/2021 08:05:39 - INFO - __main__ - Step 77072: {'lr': 0.0002443169663691285, 'samples': 14797824, 'steps': 77071, 'loss/train': 1.8261866569519043} 11/07/2021 08:05:39 - INFO - __main__ - Step 77073: {'lr': 0.0002443116609973691, 'samples': 14798016, 'steps': 77072, 'loss/train': 1.0055640935897827} 11/07/2021 08:05:40 - INFO - __main__ - Step 77074: {'lr': 0.0002443063556281727, 'samples': 14798208, 'steps': 77073, 'loss/train': 1.1506941318511963} 11/07/2021 08:05:41 - INFO - __main__ - Step 77075: {'lr': 0.00024430105026154177, 'samples': 14798400, 'steps': 77074, 'loss/train': 1.5054328441619873} 11/07/2021 08:05:41 - INFO - __main__ - Step 77076: {'lr': 0.0002442957448974787, 'samples': 14798592, 'steps': 77075, 'loss/train': 1.5390222072601318} 11/07/2021 08:05:41 - INFO - __main__ - Step 77077: {'lr': 0.0002442904395359859, 'samples': 14798784, 'steps': 77076, 'loss/train': 1.575799584388733} 11/07/2021 08:05:42 - INFO - __main__ - Step 77078: {'lr': 0.00024428513417706574, 'samples': 14798976, 'steps': 77077, 'loss/train': 1.1862832307815552} 11/07/2021 08:05:43 - INFO - __main__ - Step 77079: {'lr': 0.00024427982882072063, 'samples': 14799168, 'steps': 77078, 'loss/train': 0.908709704875946} 11/07/2021 08:05:43 - INFO - __main__ - Step 77080: {'lr': 0.0002442745234669529, 'samples': 14799360, 'steps': 77079, 'loss/train': 1.6314150094985962} 11/07/2021 08:05:44 - INFO - __main__ - Step 77081: {'lr': 0.000244269218115765, 'samples': 14799552, 'steps': 77080, 'loss/train': 1.9132434129714966} 11/07/2021 08:05:44 - INFO - __main__ - Step 77082: {'lr': 0.0002442639127671593, 'samples': 14799744, 'steps': 77081, 'loss/train': 1.25007164478302} 11/07/2021 08:05:44 - INFO - __main__ - Step 77083: {'lr': 0.0002442586074211382, 'samples': 14799936, 'steps': 77082, 'loss/train': 0.786638617515564} 11/07/2021 08:05:45 - INFO - __main__ - Step 77084: {'lr': 0.0002442533020777042, 'samples': 14800128, 'steps': 77083, 'loss/train': 1.0962682962417603} 11/07/2021 08:05:46 - INFO - __main__ - Step 77085: {'lr': 0.00024424799673685945, 'samples': 14800320, 'steps': 77084, 'loss/train': 0.7747199535369873} 11/07/2021 08:05:46 - INFO - __main__ - Step 77086: {'lr': 0.00024424269139860643, 'samples': 14800512, 'steps': 77085, 'loss/train': 1.1276017427444458} 11/07/2021 08:05:46 - INFO - __main__ - Step 77087: {'lr': 0.00024423738606294763, 'samples': 14800704, 'steps': 77086, 'loss/train': 1.3824372291564941} 11/07/2021 08:05:47 - INFO - __main__ - Step 77088: {'lr': 0.00024423208072988533, 'samples': 14800896, 'steps': 77087, 'loss/train': 2.0242486000061035} 11/07/2021 08:05:47 - INFO - __main__ - Step 77089: {'lr': 0.000244226775399422, 'samples': 14801088, 'steps': 77088, 'loss/train': 1.6554758548736572} 11/07/2021 08:05:48 - INFO - __main__ - Step 77090: {'lr': 0.00024422147007155994, 'samples': 14801280, 'steps': 77089, 'loss/train': 1.7193135023117065} 11/07/2021 08:05:48 - INFO - __main__ - Step 77091: {'lr': 0.0002442161647463016, 'samples': 14801472, 'steps': 77090, 'loss/train': 0.4779888391494751} 11/07/2021 08:05:49 - INFO - __main__ - Step 77092: {'lr': 0.00024421085942364946, 'samples': 14801664, 'steps': 77091, 'loss/train': 1.3818544149398804} 11/07/2021 08:05:49 - INFO - __main__ - Step 77093: {'lr': 0.00024420555410360577, 'samples': 14801856, 'steps': 77092, 'loss/train': 1.3606436252593994} 11/07/2021 08:05:49 - INFO - __main__ - Step 77094: {'lr': 0.00024420024878617295, 'samples': 14802048, 'steps': 77093, 'loss/train': 1.4150662422180176} 11/07/2021 08:05:51 - INFO - __main__ - Step 77095: {'lr': 0.0002441949434713534, 'samples': 14802240, 'steps': 77094, 'loss/train': 0.8305088877677917} 11/07/2021 08:05:51 - INFO - __main__ - Step 77096: {'lr': 0.0002441896381591495, 'samples': 14802432, 'steps': 77095, 'loss/train': 1.3637127876281738} 11/07/2021 08:05:51 - INFO - __main__ - Step 77097: {'lr': 0.0002441843328495638, 'samples': 14802624, 'steps': 77096, 'loss/train': 1.0074883699417114} 11/07/2021 08:05:52 - INFO - __main__ - Step 77098: {'lr': 0.0002441790275425985, 'samples': 14802816, 'steps': 77097, 'loss/train': 1.061774492263794} 11/07/2021 08:05:52 - INFO - __main__ - Step 77099: {'lr': 0.00024417372223825594, 'samples': 14803008, 'steps': 77098, 'loss/train': 1.508961796760559} 11/07/2021 08:05:53 - INFO - __main__ - Step 77100: {'lr': 0.00024416841693653864, 'samples': 14803200, 'steps': 77099, 'loss/train': 1.6749138832092285} 11/07/2021 08:05:53 - INFO - __main__ - Step 77101: {'lr': 0.000244163111637449, 'samples': 14803392, 'steps': 77100, 'loss/train': 1.161179542541504} 11/07/2021 08:05:54 - INFO - __main__ - Step 77102: {'lr': 0.0002441578063409893, 'samples': 14803584, 'steps': 77101, 'loss/train': 0.9801385998725891} 11/07/2021 08:05:54 - INFO - __main__ - Step 77103: {'lr': 0.00024415250104716207, 'samples': 14803776, 'steps': 77102, 'loss/train': 1.8790621757507324} 11/07/2021 08:05:54 - INFO - __main__ - Step 77104: {'lr': 0.00024414719575596965, 'samples': 14803968, 'steps': 77103, 'loss/train': 1.3747440576553345} 11/07/2021 08:05:55 - INFO - __main__ - Step 77105: {'lr': 0.00024414189046741434, 'samples': 14804160, 'steps': 77104, 'loss/train': 1.2515541315078735} 11/07/2021 08:05:56 - INFO - __main__ - Step 77106: {'lr': 0.00024413658518149863, 'samples': 14804352, 'steps': 77105, 'loss/train': 1.4079062938690186} 11/07/2021 08:05:56 - INFO - __main__ - Step 77107: {'lr': 0.0002441312798982249, 'samples': 14804544, 'steps': 77106, 'loss/train': 1.3919878005981445} 11/07/2021 08:05:57 - INFO - __main__ - Step 77108: {'lr': 0.00024412597461759554, 'samples': 14804736, 'steps': 77107, 'loss/train': 0.5660611391067505} 11/07/2021 08:05:57 - INFO - __main__ - Step 77109: {'lr': 0.00024412066933961288, 'samples': 14804928, 'steps': 77108, 'loss/train': 1.235586404800415} 11/07/2021 08:05:58 - INFO - __main__ - Step 77110: {'lr': 0.0002441153640642794, 'samples': 14805120, 'steps': 77109, 'loss/train': 1.7937854528427124} 11/07/2021 08:05:58 - INFO - __main__ - Step 77111: {'lr': 0.0002441100587915975, 'samples': 14805312, 'steps': 77110, 'loss/train': 5.348827362060547} 11/07/2021 08:05:59 - INFO - __main__ - Step 77112: {'lr': 0.00024410475352156947, 'samples': 14805504, 'steps': 77111, 'loss/train': 1.6101804971694946} 11/07/2021 08:05:59 - INFO - __main__ - Step 77113: {'lr': 0.00024409944825419768, 'samples': 14805696, 'steps': 77112, 'loss/train': 1.5814675092697144} 11/07/2021 08:05:59 - INFO - __main__ - Step 77114: {'lr': 0.00024409414298948466, 'samples': 14805888, 'steps': 77113, 'loss/train': 1.1785398721694946} 11/07/2021 08:06:00 - INFO - __main__ - Step 77115: {'lr': 0.00024408883772743267, 'samples': 14806080, 'steps': 77114, 'loss/train': 1.6784781217575073} 11/07/2021 08:06:01 - INFO - __main__ - Step 77116: {'lr': 0.00024408353246804419, 'samples': 14806272, 'steps': 77115, 'loss/train': 1.2405812740325928} 11/07/2021 08:06:01 - INFO - __main__ - Step 77117: {'lr': 0.00024407822721132157, 'samples': 14806464, 'steps': 77116, 'loss/train': 1.5646965503692627} 11/07/2021 08:06:01 - INFO - __main__ - Step 77118: {'lr': 0.00024407292195726722, 'samples': 14806656, 'steps': 77117, 'loss/train': 1.2653931379318237} 11/07/2021 08:06:02 - INFO - __main__ - Step 77119: {'lr': 0.0002440676167058835, 'samples': 14806848, 'steps': 77118, 'loss/train': 1.3889886140823364} 11/07/2021 08:06:02 - INFO - __main__ - Step 77120: {'lr': 0.00024406231145717285, 'samples': 14807040, 'steps': 77119, 'loss/train': 1.7366807460784912} 11/07/2021 08:06:04 - INFO - __main__ - Step 77121: {'lr': 0.00024405700621113759, 'samples': 14807232, 'steps': 77120, 'loss/train': 1.5950422286987305} 11/07/2021 08:06:04 - INFO - __main__ - Step 77122: {'lr': 0.00024405170096778022, 'samples': 14807424, 'steps': 77121, 'loss/train': 1.2564481496810913} 11/07/2021 08:06:04 - INFO - __main__ - Step 77123: {'lr': 0.000244046395727103, 'samples': 14807616, 'steps': 77122, 'loss/train': 0.8293803930282593} 11/07/2021 08:06:05 - INFO - __main__ - Step 77124: {'lr': 0.00024404109048910847, 'samples': 14807808, 'steps': 77123, 'loss/train': 1.2122448682785034} 11/07/2021 08:06:05 - INFO - __main__ - Step 77125: {'lr': 0.00024403578525379887, 'samples': 14808000, 'steps': 77124, 'loss/train': 0.7626751661300659} 11/07/2021 08:06:06 - INFO - __main__ - Step 77126: {'lr': 0.00024403048002117662, 'samples': 14808192, 'steps': 77125, 'loss/train': 1.1480365991592407} 11/07/2021 08:06:06 - INFO - __main__ - Step 77127: {'lr': 0.00024402517479124418, 'samples': 14808384, 'steps': 77126, 'loss/train': 2.1109585762023926} 11/07/2021 08:06:07 - INFO - __main__ - Step 77128: {'lr': 0.0002440198695640039, 'samples': 14808576, 'steps': 77127, 'loss/train': 1.902608036994934} 11/07/2021 08:06:07 - INFO - __main__ - Step 77129: {'lr': 0.00024401456433945814, 'samples': 14808768, 'steps': 77128, 'loss/train': 1.648815393447876} 11/07/2021 08:06:07 - INFO - __main__ - Step 77130: {'lr': 0.00024400925911760934, 'samples': 14808960, 'steps': 77129, 'loss/train': 1.771060585975647} 11/07/2021 08:06:08 - INFO - __main__ - Step 77131: {'lr': 0.00024400395389845988, 'samples': 14809152, 'steps': 77130, 'loss/train': 1.549401044845581} 11/07/2021 08:06:09 - INFO - __main__ - Step 77132: {'lr': 0.00024399864868201215, 'samples': 14809344, 'steps': 77131, 'loss/train': 1.075242280960083} 11/07/2021 08:06:09 - INFO - __main__ - Step 77133: {'lr': 0.0002439933434682686, 'samples': 14809536, 'steps': 77132, 'loss/train': 0.9593841433525085} 11/07/2021 08:06:09 - INFO - __main__ - Step 77134: {'lr': 0.0002439880382572315, 'samples': 14809728, 'steps': 77133, 'loss/train': 1.6022580862045288} 11/07/2021 08:06:10 - INFO - __main__ - Step 77135: {'lr': 0.00024398273304890327, 'samples': 14809920, 'steps': 77134, 'loss/train': 0.9359227418899536} 11/07/2021 08:06:11 - INFO - __main__ - Step 77136: {'lr': 0.00024397742784328636, 'samples': 14810112, 'steps': 77135, 'loss/train': 1.2858350276947021} 11/07/2021 08:06:11 - INFO - __main__ - Step 77137: {'lr': 0.00024397212264038313, 'samples': 14810304, 'steps': 77136, 'loss/train': 0.8328785300254822} 11/07/2021 08:06:12 - INFO - __main__ - Step 77138: {'lr': 0.000243966817440196, 'samples': 14810496, 'steps': 77137, 'loss/train': 1.0843018293380737} 11/07/2021 08:06:12 - INFO - __main__ - Step 77139: {'lr': 0.00024396151224272727, 'samples': 14810688, 'steps': 77138, 'loss/train': 1.8484201431274414} 11/07/2021 08:06:12 - INFO - __main__ - Step 77140: {'lr': 0.0002439562070479794, 'samples': 14810880, 'steps': 77139, 'loss/train': 1.3496774435043335} 11/07/2021 08:06:13 - INFO - __main__ - Step 77141: {'lr': 0.0002439509018559548, 'samples': 14811072, 'steps': 77140, 'loss/train': 1.0977964401245117} 11/07/2021 08:06:14 - INFO - __main__ - Step 77142: {'lr': 0.0002439455966666558, 'samples': 14811264, 'steps': 77141, 'loss/train': 1.4775019884109497} 11/07/2021 08:06:14 - INFO - __main__ - Step 77143: {'lr': 0.0002439402914800848, 'samples': 14811456, 'steps': 77142, 'loss/train': 1.1439287662506104} 11/07/2021 08:06:14 - INFO - __main__ - Step 77144: {'lr': 0.00024393498629624431, 'samples': 14811648, 'steps': 77143, 'loss/train': 1.3034321069717407} 11/07/2021 08:06:15 - INFO - __main__ - Step 77145: {'lr': 0.00024392968111513656, 'samples': 14811840, 'steps': 77144, 'loss/train': 1.0136826038360596} 11/07/2021 08:06:15 - INFO - __main__ - Step 77146: {'lr': 0.00024392437593676397, 'samples': 14812032, 'steps': 77145, 'loss/train': 1.6606204509735107} 11/07/2021 08:06:16 - INFO - __main__ - Step 77147: {'lr': 0.000243919070761129, 'samples': 14812224, 'steps': 77146, 'loss/train': 1.095513105392456} 11/07/2021 08:06:16 - INFO - __main__ - Step 77148: {'lr': 0.00024391376558823398, 'samples': 14812416, 'steps': 77147, 'loss/train': 1.1010468006134033} 11/07/2021 08:06:17 - INFO - __main__ - Step 77149: {'lr': 0.00024390846041808133, 'samples': 14812608, 'steps': 77148, 'loss/train': 1.229461431503296} 11/07/2021 08:06:17 - INFO - __main__ - Step 77150: {'lr': 0.00024390315525067341, 'samples': 14812800, 'steps': 77149, 'loss/train': 0.9809114336967468} 11/07/2021 08:06:17 - INFO - __main__ - Step 77151: {'lr': 0.00024389785008601273, 'samples': 14812992, 'steps': 77150, 'loss/train': 1.1629105806350708} 11/07/2021 08:06:19 - INFO - __main__ - Step 77152: {'lr': 0.00024389254492410148, 'samples': 14813184, 'steps': 77151, 'loss/train': 1.7189916372299194} 11/07/2021 08:06:19 - INFO - __main__ - Step 77153: {'lr': 0.0002438872397649422, 'samples': 14813376, 'steps': 77152, 'loss/train': 1.4377434253692627} 11/07/2021 08:06:19 - INFO - __main__ - Step 77154: {'lr': 0.00024388193460853723, 'samples': 14813568, 'steps': 77153, 'loss/train': 1.2334100008010864} 11/07/2021 08:06:20 - INFO - __main__ - Step 77155: {'lr': 0.000243876629454889, 'samples': 14813760, 'steps': 77154, 'loss/train': 1.2389452457427979} 11/07/2021 08:06:20 - INFO - __main__ - Step 77156: {'lr': 0.00024387132430399983, 'samples': 14813952, 'steps': 77155, 'loss/train': 1.174056887626648} 11/07/2021 08:06:20 - INFO - __main__ - Step 77157: {'lr': 0.00024386601915587215, 'samples': 14814144, 'steps': 77156, 'loss/train': 1.1643872261047363} 11/07/2021 08:06:21 - INFO - __main__ - Step 77158: {'lr': 0.00024386071401050834, 'samples': 14814336, 'steps': 77157, 'loss/train': 1.2969132661819458} 11/07/2021 08:06:22 - INFO - __main__ - Step 77159: {'lr': 0.00024385540886791076, 'samples': 14814528, 'steps': 77158, 'loss/train': 1.6540391445159912} 11/07/2021 08:06:22 - INFO - __main__ - Step 77160: {'lr': 0.0002438501037280819, 'samples': 14814720, 'steps': 77159, 'loss/train': 1.7635687589645386} 11/07/2021 08:06:22 - INFO - __main__ - Step 77161: {'lr': 0.00024384479859102404, 'samples': 14814912, 'steps': 77160, 'loss/train': 1.2834597826004028} 11/07/2021 08:06:23 - INFO - __main__ - Step 77162: {'lr': 0.00024383949345673964, 'samples': 14815104, 'steps': 77161, 'loss/train': 1.1849015951156616} 11/07/2021 08:06:24 - INFO - __main__ - Step 77163: {'lr': 0.00024383418832523107, 'samples': 14815296, 'steps': 77162, 'loss/train': 1.4364253282546997} 11/07/2021 08:06:24 - INFO - __main__ - Step 77164: {'lr': 0.00024382888319650077, 'samples': 14815488, 'steps': 77163, 'loss/train': 1.6538550853729248} 11/07/2021 08:06:24 - INFO - __main__ - Step 77165: {'lr': 0.0002438235780705511, 'samples': 14815680, 'steps': 77164, 'loss/train': 1.0839741230010986} 11/07/2021 08:06:25 - INFO - __main__ - Step 77166: {'lr': 0.00024381827294738434, 'samples': 14815872, 'steps': 77165, 'loss/train': 1.2831363677978516} 11/07/2021 08:06:25 - INFO - __main__ - Step 77167: {'lr': 0.000243812967827003, 'samples': 14816064, 'steps': 77166, 'loss/train': 1.3345087766647339} 11/07/2021 08:06:26 - INFO - __main__ - Step 77168: {'lr': 0.0002438076627094094, 'samples': 14816256, 'steps': 77167, 'loss/train': 0.8525732755661011} 11/07/2021 08:06:27 - INFO - __main__ - Step 77169: {'lr': 0.000243802357594606, 'samples': 14816448, 'steps': 77168, 'loss/train': 0.8579629063606262} 11/07/2021 08:06:27 - INFO - __main__ - Step 77170: {'lr': 0.00024379705248259517, 'samples': 14816640, 'steps': 77169, 'loss/train': 0.6402370929718018} 11/07/2021 08:06:27 - INFO - __main__ - Step 77171: {'lr': 0.00024379174737337931, 'samples': 14816832, 'steps': 77170, 'loss/train': 1.353801965713501} 11/07/2021 08:06:28 - INFO - __main__ - Step 77172: {'lr': 0.00024378644226696075, 'samples': 14817024, 'steps': 77171, 'loss/train': 1.2130389213562012} 11/07/2021 08:06:29 - INFO - __main__ - Step 77173: {'lr': 0.00024378113716334193, 'samples': 14817216, 'steps': 77172, 'loss/train': 1.4855520725250244} 11/07/2021 08:06:29 - INFO - __main__ - Step 77174: {'lr': 0.00024377583206252527, 'samples': 14817408, 'steps': 77173, 'loss/train': 1.2695380449295044} 11/07/2021 08:06:29 - INFO - __main__ - Step 77175: {'lr': 0.0002437705269645131, 'samples': 14817600, 'steps': 77174, 'loss/train': 1.0538581609725952} 11/07/2021 08:06:30 - INFO - __main__ - Step 77176: {'lr': 0.00024376522186930782, 'samples': 14817792, 'steps': 77175, 'loss/train': 1.6803646087646484} 11/07/2021 08:06:30 - INFO - __main__ - Step 77177: {'lr': 0.00024375991677691187, 'samples': 14817984, 'steps': 77176, 'loss/train': 1.4129892587661743} 11/07/2021 08:06:31 - INFO - __main__ - Step 77178: {'lr': 0.00024375461168732769, 'samples': 14818176, 'steps': 77177, 'loss/train': 0.9951822757720947} 11/07/2021 08:06:31 - INFO - __main__ - Step 77179: {'lr': 0.00024374930660055747, 'samples': 14818368, 'steps': 77178, 'loss/train': 1.907820224761963} 11/07/2021 08:06:32 - INFO - __main__ - Step 77180: {'lr': 0.0002437440015166037, 'samples': 14818560, 'steps': 77179, 'loss/train': 0.9074069857597351} 11/07/2021 08:06:32 - INFO - __main__ - Step 77181: {'lr': 0.00024373869643546883, 'samples': 14818752, 'steps': 77180, 'loss/train': 1.6906421184539795} 11/07/2021 08:06:32 - INFO - __main__ - Step 77182: {'lr': 0.0002437333913571552, 'samples': 14818944, 'steps': 77181, 'loss/train': 1.2036380767822266} 11/07/2021 08:06:34 - INFO - __main__ - Step 77183: {'lr': 0.00024372808628166518, 'samples': 14819136, 'steps': 77182, 'loss/train': 1.5232938528060913} 11/07/2021 08:06:34 - INFO - __main__ - Step 77184: {'lr': 0.0002437227812090012, 'samples': 14819328, 'steps': 77183, 'loss/train': 1.534103274345398} 11/07/2021 08:06:34 - INFO - __main__ - Step 77185: {'lr': 0.00024371747613916565, 'samples': 14819520, 'steps': 77184, 'loss/train': 1.549660325050354} 11/07/2021 08:06:35 - INFO - __main__ - Step 77186: {'lr': 0.0002437121710721609, 'samples': 14819712, 'steps': 77185, 'loss/train': 1.1148306131362915} 11/07/2021 08:06:35 - INFO - __main__ - Step 77187: {'lr': 0.00024370686600798936, 'samples': 14819904, 'steps': 77186, 'loss/train': 1.2572064399719238} 11/07/2021 08:06:36 - INFO - __main__ - Step 77188: {'lr': 0.0002437015609466534, 'samples': 14820096, 'steps': 77187, 'loss/train': 1.4118355512619019} 11/07/2021 08:06:37 - INFO - __main__ - Step 77189: {'lr': 0.00024369625588815542, 'samples': 14820288, 'steps': 77188, 'loss/train': 1.431102991104126} 11/07/2021 08:06:37 - INFO - __main__ - Step 77190: {'lr': 0.0002436909508324978, 'samples': 14820480, 'steps': 77189, 'loss/train': 5.64987850189209} 11/07/2021 08:06:37 - INFO - __main__ - Step 77191: {'lr': 0.00024368564577968306, 'samples': 14820672, 'steps': 77190, 'loss/train': 5.351370811462402} 11/07/2021 08:06:38 - INFO - __main__ - Step 77192: {'lr': 0.00024368034072971335, 'samples': 14820864, 'steps': 77191, 'loss/train': 1.1335526704788208} 11/07/2021 08:06:38 - INFO - __main__ - Step 77193: {'lr': 0.0002436750356825912, 'samples': 14821056, 'steps': 77192, 'loss/train': 1.1559418439865112} 11/07/2021 08:06:40 - INFO - __main__ - Step 77194: {'lr': 0.00024366973063831896, 'samples': 14821248, 'steps': 77193, 'loss/train': 1.5985333919525146} 11/07/2021 08:06:40 - INFO - __main__ - Step 77195: {'lr': 0.00024366442559689905, 'samples': 14821440, 'steps': 77194, 'loss/train': 2.061516761779785} 11/07/2021 08:06:40 - INFO - __main__ - Step 77196: {'lr': 0.00024365912055833384, 'samples': 14821632, 'steps': 77195, 'loss/train': 1.3594844341278076} 11/07/2021 08:06:41 - INFO - __main__ - Step 77197: {'lr': 0.00024365381552262575, 'samples': 14821824, 'steps': 77196, 'loss/train': 1.5351567268371582} 11/07/2021 08:06:41 - INFO - __main__ - Step 77198: {'lr': 0.00024364851048977714, 'samples': 14822016, 'steps': 77197, 'loss/train': 1.5407627820968628} 11/07/2021 08:06:41 - INFO - __main__ - Step 77199: {'lr': 0.00024364320545979044, 'samples': 14822208, 'steps': 77198, 'loss/train': 1.7870759963989258} 11/07/2021 08:06:42 - INFO - __main__ - Step 77200: {'lr': 0.000243637900432668, 'samples': 14822400, 'steps': 77199, 'loss/train': 1.5852988958358765} 11/07/2021 08:06:43 - INFO - __main__ - Step 77201: {'lr': 0.00024363259540841222, 'samples': 14822592, 'steps': 77200, 'loss/train': 1.7323989868164062} 11/07/2021 08:06:43 - INFO - __main__ - Step 77202: {'lr': 0.00024362729038702546, 'samples': 14822784, 'steps': 77201, 'loss/train': 1.536412000656128} 11/07/2021 08:06:43 - INFO - __main__ - Step 77203: {'lr': 0.00024362198536851022, 'samples': 14822976, 'steps': 77202, 'loss/train': 0.9148334860801697} 11/07/2021 08:06:44 - INFO - __main__ - Step 77204: {'lr': 0.00024361668035286874, 'samples': 14823168, 'steps': 77203, 'loss/train': 1.0970460176467896} 11/07/2021 08:06:44 - INFO - __main__ - Step 77205: {'lr': 0.00024361137534010364, 'samples': 14823360, 'steps': 77204, 'loss/train': 1.5645166635513306} 11/07/2021 08:06:45 - INFO - __main__ - Step 77206: {'lr': 0.00024360607033021704, 'samples': 14823552, 'steps': 77205, 'loss/train': 1.6056488752365112} 11/07/2021 08:06:45 - INFO - __main__ - Step 77207: {'lr': 0.00024360076532321142, 'samples': 14823744, 'steps': 77206, 'loss/train': 1.9627788066864014} 11/07/2021 08:06:46 - INFO - __main__ - Step 77208: {'lr': 0.00024359546031908926, 'samples': 14823936, 'steps': 77207, 'loss/train': 0.9500669240951538} 11/07/2021 08:06:46 - INFO - __main__ - Step 77209: {'lr': 0.0002435901553178528, 'samples': 14824128, 'steps': 77208, 'loss/train': 1.3286521434783936} 11/07/2021 08:06:46 - INFO - __main__ - Step 77210: {'lr': 0.0002435848503195046, 'samples': 14824320, 'steps': 77209, 'loss/train': 1.168330192565918} 11/07/2021 08:06:48 - INFO - __main__ - Step 77211: {'lr': 0.0002435795453240469, 'samples': 14824512, 'steps': 77210, 'loss/train': 1.332122802734375} 11/07/2021 08:06:48 - INFO - __main__ - Step 77212: {'lr': 0.00024357424033148218, 'samples': 14824704, 'steps': 77211, 'loss/train': 1.6652809381484985} 11/07/2021 08:06:48 - INFO - __main__ - Step 77213: {'lr': 0.00024356893534181281, 'samples': 14824896, 'steps': 77212, 'loss/train': 1.3720766305923462} 11/07/2021 08:06:49 - INFO - __main__ - Step 77214: {'lr': 0.0002435636303550412, 'samples': 14825088, 'steps': 77213, 'loss/train': 1.5212087631225586} 11/07/2021 08:06:49 - INFO - __main__ - Step 77215: {'lr': 0.0002435583253711697, 'samples': 14825280, 'steps': 77214, 'loss/train': 1.4354784488677979} 11/07/2021 08:06:50 - INFO - __main__ - Step 77216: {'lr': 0.0002435530203902007, 'samples': 14825472, 'steps': 77215, 'loss/train': 1.3907815217971802} 11/07/2021 08:06:50 - INFO - __main__ - Step 77217: {'lr': 0.00024354771541213664, 'samples': 14825664, 'steps': 77216, 'loss/train': 1.5158028602600098} 11/07/2021 08:06:51 - INFO - __main__ - Step 77218: {'lr': 0.00024354241043698, 'samples': 14825856, 'steps': 77217, 'loss/train': 1.4001771211624146} 11/07/2021 08:06:51 - INFO - __main__ - Step 77219: {'lr': 0.0002435371054647329, 'samples': 14826048, 'steps': 77218, 'loss/train': 1.4908639192581177} 11/07/2021 08:06:51 - INFO - __main__ - Step 77220: {'lr': 0.00024353180049539792, 'samples': 14826240, 'steps': 77219, 'loss/train': 1.9044488668441772} 11/07/2021 08:06:53 - INFO - __main__ - Step 77221: {'lr': 0.00024352649552897737, 'samples': 14826432, 'steps': 77220, 'loss/train': 1.6589397192001343} 11/07/2021 08:06:53 - INFO - __main__ - Step 77222: {'lr': 0.0002435211905654737, 'samples': 14826624, 'steps': 77221, 'loss/train': 1.4423848390579224} 11/07/2021 08:06:53 - INFO - __main__ - Step 77223: {'lr': 0.0002435158856048893, 'samples': 14826816, 'steps': 77222, 'loss/train': 1.2965167760849} 11/07/2021 08:06:54 - INFO - __main__ - Step 77224: {'lr': 0.00024351058064722654, 'samples': 14827008, 'steps': 77223, 'loss/train': 1.454052448272705} 11/07/2021 08:06:54 - INFO - __main__ - Step 77225: {'lr': 0.00024350527569248778, 'samples': 14827200, 'steps': 77224, 'loss/train': 1.341172456741333} 11/07/2021 08:06:54 - INFO - __main__ - Step 77226: {'lr': 0.00024349997074067545, 'samples': 14827392, 'steps': 77225, 'loss/train': 1.0758427381515503} 11/07/2021 08:06:55 - INFO - __main__ - Step 77227: {'lr': 0.00024349466579179195, 'samples': 14827584, 'steps': 77226, 'loss/train': 1.0713145732879639} 11/07/2021 08:06:56 - INFO - __main__ - Step 77228: {'lr': 0.00024348936084583964, 'samples': 14827776, 'steps': 77227, 'loss/train': 1.7411354780197144} 11/07/2021 08:06:56 - INFO - __main__ - Step 77229: {'lr': 0.00024348405590282095, 'samples': 14827968, 'steps': 77228, 'loss/train': 0.7990846037864685} 11/07/2021 08:06:56 - INFO - __main__ - Step 77230: {'lr': 0.00024347875096273822, 'samples': 14828160, 'steps': 77229, 'loss/train': 1.1112957000732422} 11/07/2021 08:06:57 - INFO - __main__ - Step 77231: {'lr': 0.00024347344602559386, 'samples': 14828352, 'steps': 77230, 'loss/train': 1.4207795858383179} 11/07/2021 08:06:58 - INFO - __main__ - Step 77232: {'lr': 0.0002434681410913904, 'samples': 14828544, 'steps': 77231, 'loss/train': 1.3744854927062988} 11/07/2021 08:06:58 - INFO - __main__ - Step 77233: {'lr': 0.00024346283616012997, 'samples': 14828736, 'steps': 77232, 'loss/train': 0.9108613729476929} 11/07/2021 08:06:59 - INFO - __main__ - Step 77234: {'lr': 0.00024345753123181509, 'samples': 14828928, 'steps': 77233, 'loss/train': 1.4381871223449707} 11/07/2021 08:06:59 - INFO - __main__ - Step 77235: {'lr': 0.00024345222630644812, 'samples': 14829120, 'steps': 77234, 'loss/train': 5.9948601722717285} 11/07/2021 08:06:59 - INFO - __main__ - Step 77236: {'lr': 0.0002434469213840315, 'samples': 14829312, 'steps': 77235, 'loss/train': 1.5535041093826294} 11/07/2021 08:07:00 - INFO - __main__ - Step 77237: {'lr': 0.00024344161646456757, 'samples': 14829504, 'steps': 77236, 'loss/train': 1.321803331375122} 11/07/2021 08:07:01 - INFO - __main__ - Step 77238: {'lr': 0.00024343631154805879, 'samples': 14829696, 'steps': 77237, 'loss/train': 1.253240942955017} 11/07/2021 08:07:01 - INFO - __main__ - Step 77239: {'lr': 0.00024343100663450746, 'samples': 14829888, 'steps': 77238, 'loss/train': 1.5471917390823364} 11/07/2021 08:07:01 - INFO - __main__ - Step 77240: {'lr': 0.00024342570172391603, 'samples': 14830080, 'steps': 77239, 'loss/train': 1.4038749933242798} 11/07/2021 08:07:02 - INFO - __main__ - Step 77241: {'lr': 0.0002434203968162869, 'samples': 14830272, 'steps': 77240, 'loss/train': 1.0581086874008179} 11/07/2021 08:07:02 - INFO - __main__ - Step 77242: {'lr': 0.0002434150919116224, 'samples': 14830464, 'steps': 77241, 'loss/train': 0.7046974897384644} 11/07/2021 08:07:03 - INFO - __main__ - Step 77243: {'lr': 0.00024340978700992502, 'samples': 14830656, 'steps': 77242, 'loss/train': 1.706594705581665} 11/07/2021 08:07:03 - INFO - __main__ - Step 77244: {'lr': 0.00024340448211119706, 'samples': 14830848, 'steps': 77243, 'loss/train': 1.5473564863204956} 11/07/2021 08:07:04 - INFO - __main__ - Step 77245: {'lr': 0.00024339917721544102, 'samples': 14831040, 'steps': 77244, 'loss/train': 1.0399194955825806} 11/07/2021 08:07:04 - INFO - __main__ - Step 77246: {'lr': 0.00024339387232265913, 'samples': 14831232, 'steps': 77245, 'loss/train': 1.2233623266220093} 11/07/2021 08:07:04 - INFO - __main__ - Step 77247: {'lr': 0.00024338856743285383, 'samples': 14831424, 'steps': 77246, 'loss/train': 1.2614167928695679} 11/07/2021 08:07:05 - INFO - __main__ - Step 77248: {'lr': 0.00024338326254602757, 'samples': 14831616, 'steps': 77247, 'loss/train': 1.7292766571044922} 11/07/2021 08:07:06 - INFO - __main__ - Step 77249: {'lr': 0.0002433779576621827, 'samples': 14831808, 'steps': 77248, 'loss/train': 1.7178709506988525} 11/07/2021 08:07:06 - INFO - __main__ - Step 77250: {'lr': 0.00024337265278132162, 'samples': 14832000, 'steps': 77249, 'loss/train': 1.1658836603164673} 11/07/2021 08:07:06 - INFO - __main__ - Step 77251: {'lr': 0.00024336734790344672, 'samples': 14832192, 'steps': 77250, 'loss/train': 1.5247507095336914} 11/07/2021 08:07:07 - INFO - __main__ - Step 77252: {'lr': 0.00024336204302856038, 'samples': 14832384, 'steps': 77251, 'loss/train': 1.75771164894104} 11/07/2021 08:07:08 - INFO - __main__ - Step 77253: {'lr': 0.00024335673815666502, 'samples': 14832576, 'steps': 77252, 'loss/train': 1.1768929958343506} 11/07/2021 08:07:08 - INFO - __main__ - Step 77254: {'lr': 0.000243351433287763, 'samples': 14832768, 'steps': 77253, 'loss/train': 1.6161079406738281} 11/07/2021 08:07:09 - INFO - __main__ - Step 77255: {'lr': 0.00024334612842185672, 'samples': 14832960, 'steps': 77254, 'loss/train': 1.4208766222000122} 11/07/2021 08:07:09 - INFO - __main__ - Step 77256: {'lr': 0.00024334082355894861, 'samples': 14833152, 'steps': 77255, 'loss/train': 1.2621004581451416} 11/07/2021 08:07:09 - INFO - __main__ - Step 77257: {'lr': 0.000243335518699041, 'samples': 14833344, 'steps': 77256, 'loss/train': 1.2221410274505615} 11/07/2021 08:07:10 - INFO - __main__ - Step 77258: {'lr': 0.0002433302138421363, 'samples': 14833536, 'steps': 77257, 'loss/train': 1.603335976600647} 11/07/2021 08:07:11 - INFO - __main__ - Step 77259: {'lr': 0.00024332490898823695, 'samples': 14833728, 'steps': 77258, 'loss/train': 0.9262852668762207} 11/07/2021 08:07:11 - INFO - __main__ - Step 77260: {'lr': 0.00024331960413734522, 'samples': 14833920, 'steps': 77259, 'loss/train': 0.07079558074474335} 11/07/2021 08:07:11 - INFO - __main__ - Step 77261: {'lr': 0.0002433142992894636, 'samples': 14834112, 'steps': 77260, 'loss/train': 1.48430335521698} 11/07/2021 08:07:12 - INFO - __main__ - Step 77262: {'lr': 0.00024330899444459446, 'samples': 14834304, 'steps': 77261, 'loss/train': 1.2664763927459717} 11/07/2021 08:07:13 - INFO - __main__ - Step 77263: {'lr': 0.00024330368960274017, 'samples': 14834496, 'steps': 77262, 'loss/train': 1.7040868997573853} 11/07/2021 08:07:13 - INFO - __main__ - Step 77264: {'lr': 0.0002432983847639031, 'samples': 14834688, 'steps': 77263, 'loss/train': 1.3179529905319214} 11/07/2021 08:07:14 - INFO - __main__ - Step 77265: {'lr': 0.00024329307992808572, 'samples': 14834880, 'steps': 77264, 'loss/train': 1.4151057004928589} 11/07/2021 08:07:14 - INFO - __main__ - Step 77266: {'lr': 0.00024328777509529035, 'samples': 14835072, 'steps': 77265, 'loss/train': 1.421807050704956} 11/07/2021 08:07:14 - INFO - __main__ - Step 77267: {'lr': 0.00024328247026551947, 'samples': 14835264, 'steps': 77266, 'loss/train': 1.1813071966171265} 11/07/2021 08:07:15 - INFO - __main__ - Step 77268: {'lr': 0.00024327716543877533, 'samples': 14835456, 'steps': 77267, 'loss/train': 1.5929232835769653} 11/07/2021 08:07:16 - INFO - __main__ - Step 77269: {'lr': 0.00024327186061506043, 'samples': 14835648, 'steps': 77268, 'loss/train': 1.40629243850708} 11/07/2021 08:07:16 - INFO - __main__ - Step 77270: {'lr': 0.0002432665557943771, 'samples': 14835840, 'steps': 77269, 'loss/train': 1.5326287746429443} 11/07/2021 08:07:16 - INFO - __main__ - Step 77271: {'lr': 0.00024326125097672778, 'samples': 14836032, 'steps': 77270, 'loss/train': 1.4814289808273315} 11/07/2021 08:07:17 - INFO - __main__ - Step 77272: {'lr': 0.00024325594616211488, 'samples': 14836224, 'steps': 77271, 'loss/train': 1.169271469116211} 11/07/2021 08:07:17 - INFO - __main__ - Step 77273: {'lr': 0.00024325064135054069, 'samples': 14836416, 'steps': 77272, 'loss/train': 1.5453182458877563} 11/07/2021 08:07:18 - INFO - __main__ - Step 77274: {'lr': 0.00024324533654200765, 'samples': 14836608, 'steps': 77273, 'loss/train': 1.0139737129211426} 11/07/2021 08:07:18 - INFO - __main__ - Step 77275: {'lr': 0.00024324003173651815, 'samples': 14836800, 'steps': 77274, 'loss/train': 1.4664005041122437} 11/07/2021 08:07:19 - INFO - __main__ - Step 77276: {'lr': 0.00024323472693407462, 'samples': 14836992, 'steps': 77275, 'loss/train': 1.3438518047332764} 11/07/2021 08:07:19 - INFO - __main__ - Step 77277: {'lr': 0.0002432294221346794, 'samples': 14837184, 'steps': 77276, 'loss/train': 1.475545048713684} 11/07/2021 08:07:19 - INFO - __main__ - Step 77278: {'lr': 0.00024322411733833493, 'samples': 14837376, 'steps': 77277, 'loss/train': 1.6083718538284302} 11/07/2021 08:07:21 - INFO - __main__ - Step 77279: {'lr': 0.00024321881254504355, 'samples': 14837568, 'steps': 77278, 'loss/train': 1.320188045501709} 11/07/2021 08:07:21 - INFO - __main__ - Step 77280: {'lr': 0.00024321350775480767, 'samples': 14837760, 'steps': 77279, 'loss/train': 1.3699054718017578} 11/07/2021 08:07:21 - INFO - __main__ - Step 77281: {'lr': 0.00024320820296762962, 'samples': 14837952, 'steps': 77280, 'loss/train': 1.7172269821166992} 11/07/2021 08:07:22 - INFO - __main__ - Step 77282: {'lr': 0.0002432028981835119, 'samples': 14838144, 'steps': 77281, 'loss/train': 1.7602776288986206} 11/07/2021 08:07:22 - INFO - __main__ - Step 77283: {'lr': 0.00024319759340245685, 'samples': 14838336, 'steps': 77282, 'loss/train': 1.3605321645736694} 11/07/2021 08:07:23 - INFO - __main__ - Step 77284: {'lr': 0.00024319228862446684, 'samples': 14838528, 'steps': 77283, 'loss/train': 1.3787841796875} 11/07/2021 08:07:23 - INFO - __main__ - Step 77285: {'lr': 0.00024318698384954434, 'samples': 14838720, 'steps': 77284, 'loss/train': 1.6208113431930542} 11/07/2021 08:07:24 - INFO - __main__ - Step 77286: {'lr': 0.00024318167907769165, 'samples': 14838912, 'steps': 77285, 'loss/train': 1.0420054197311401} 11/07/2021 08:07:24 - INFO - __main__ - Step 77287: {'lr': 0.00024317637430891115, 'samples': 14839104, 'steps': 77286, 'loss/train': 1.360581636428833} 11/07/2021 08:07:24 - INFO - __main__ - Step 77288: {'lr': 0.00024317106954320532, 'samples': 14839296, 'steps': 77287, 'loss/train': 0.5321884751319885} 11/07/2021 08:07:25 - INFO - __main__ - Step 77289: {'lr': 0.0002431657647805765, 'samples': 14839488, 'steps': 77288, 'loss/train': 1.128486156463623} 11/07/2021 08:07:26 - INFO - __main__ - Step 77290: {'lr': 0.00024316046002102707, 'samples': 14839680, 'steps': 77289, 'loss/train': 1.2245303392410278} 11/07/2021 08:07:26 - INFO - __main__ - Step 77291: {'lr': 0.0002431551552645594, 'samples': 14839872, 'steps': 77290, 'loss/train': 1.2947348356246948} 11/07/2021 08:07:26 - INFO - __main__ - Step 77292: {'lr': 0.00024314985051117593, 'samples': 14840064, 'steps': 77291, 'loss/train': 1.6155587434768677} 11/07/2021 08:07:27 - INFO - __main__ - Step 77293: {'lr': 0.00024314454576087902, 'samples': 14840256, 'steps': 77292, 'loss/train': 1.5896639823913574} 11/07/2021 08:07:28 - INFO - __main__ - Step 77294: {'lr': 0.0002431392410136711, 'samples': 14840448, 'steps': 77293, 'loss/train': 1.6358823776245117} 11/07/2021 08:07:28 - INFO - __main__ - Step 77295: {'lr': 0.00024313393626955448, 'samples': 14840640, 'steps': 77294, 'loss/train': 0.8925075531005859} 11/07/2021 08:07:29 - INFO - __main__ - Step 77296: {'lr': 0.00024312863152853165, 'samples': 14840832, 'steps': 77295, 'loss/train': 1.766013741493225} 11/07/2021 08:07:29 - INFO - __main__ - Step 77297: {'lr': 0.00024312332679060492, 'samples': 14841024, 'steps': 77296, 'loss/train': 1.0331512689590454} 11/07/2021 08:07:29 - INFO - __main__ - Step 77298: {'lr': 0.00024311802205577673, 'samples': 14841216, 'steps': 77297, 'loss/train': 1.6258437633514404} 11/07/2021 08:07:30 - INFO - __main__ - Step 77299: {'lr': 0.0002431127173240495, 'samples': 14841408, 'steps': 77298, 'loss/train': 1.277793288230896} 11/07/2021 08:07:31 - INFO - __main__ - Step 77300: {'lr': 0.0002431074125954256, 'samples': 14841600, 'steps': 77299, 'loss/train': 0.7824445366859436} 11/07/2021 08:07:31 - INFO - __main__ - Step 77301: {'lr': 0.0002431021078699073, 'samples': 14841792, 'steps': 77300, 'loss/train': 1.3337219953536987} 11/07/2021 08:07:31 - INFO - __main__ - Step 77302: {'lr': 0.0002430968031474971, 'samples': 14841984, 'steps': 77301, 'loss/train': 2.190783739089966} 11/07/2021 08:07:32 - INFO - __main__ - Step 77303: {'lr': 0.0002430914984281974, 'samples': 14842176, 'steps': 77302, 'loss/train': 0.739013135433197} 11/07/2021 08:07:32 - INFO - __main__ - Step 77304: {'lr': 0.0002430861937120105, 'samples': 14842368, 'steps': 77303, 'loss/train': 1.1957889795303345} 11/07/2021 08:07:34 - INFO - __main__ - Step 77305: {'lr': 0.0002430808889989389, 'samples': 14842560, 'steps': 77304, 'loss/train': 1.5518698692321777} 11/07/2021 08:07:34 - INFO - __main__ - Step 77306: {'lr': 0.00024307558428898494, 'samples': 14842752, 'steps': 77305, 'loss/train': 1.456223487854004} 11/07/2021 08:07:34 - INFO - __main__ - Step 77307: {'lr': 0.00024307027958215104, 'samples': 14842944, 'steps': 77306, 'loss/train': 1.5856930017471313} 11/07/2021 08:07:35 - INFO - __main__ - Step 77308: {'lr': 0.0002430649748784395, 'samples': 14843136, 'steps': 77307, 'loss/train': 0.5811446309089661} 11/07/2021 08:07:35 - INFO - __main__ - Step 77309: {'lr': 0.00024305967017785283, 'samples': 14843328, 'steps': 77308, 'loss/train': 1.2227129936218262} 11/07/2021 08:07:36 - INFO - __main__ - Step 77310: {'lr': 0.00024305436548039335, 'samples': 14843520, 'steps': 77309, 'loss/train': 1.4248727560043335} 11/07/2021 08:07:36 - INFO - __main__ - Step 77311: {'lr': 0.00024304906078606345, 'samples': 14843712, 'steps': 77310, 'loss/train': 1.3721096515655518} 11/07/2021 08:07:37 - INFO - __main__ - Step 77312: {'lr': 0.00024304375609486567, 'samples': 14843904, 'steps': 77311, 'loss/train': 1.2200723886489868} 11/07/2021 08:07:37 - INFO - __main__ - Step 77313: {'lr': 0.00024303845140680213, 'samples': 14844096, 'steps': 77312, 'loss/train': 1.5251052379608154} 11/07/2021 08:07:37 - INFO - __main__ - Step 77314: {'lr': 0.0002430331467218754, 'samples': 14844288, 'steps': 77313, 'loss/train': 1.2430717945098877} 11/07/2021 08:07:39 - INFO - __main__ - Step 77315: {'lr': 0.0002430278420400878, 'samples': 14844480, 'steps': 77314, 'loss/train': 1.8377927541732788} 11/07/2021 08:07:39 - INFO - __main__ - Step 77316: {'lr': 0.00024302253736144177, 'samples': 14844672, 'steps': 77315, 'loss/train': 1.6643786430358887} 11/07/2021 08:07:39 - INFO - __main__ - Step 77317: {'lr': 0.0002430172326859396, 'samples': 14844864, 'steps': 77316, 'loss/train': 0.595706045627594} 11/07/2021 08:07:40 - INFO - __main__ - Step 77318: {'lr': 0.00024301192801358385, 'samples': 14845056, 'steps': 77317, 'loss/train': 0.6503601670265198} 11/07/2021 08:07:40 - INFO - __main__ - Step 77319: {'lr': 0.00024300662334437675, 'samples': 14845248, 'steps': 77318, 'loss/train': 1.2582428455352783} 11/07/2021 08:07:41 - INFO - __main__ - Step 77320: {'lr': 0.00024300131867832078, 'samples': 14845440, 'steps': 77319, 'loss/train': 0.9650900959968567} 11/07/2021 08:07:41 - INFO - __main__ - Step 77321: {'lr': 0.00024299601401541832, 'samples': 14845632, 'steps': 77320, 'loss/train': 1.3192274570465088} 11/07/2021 08:07:42 - INFO - __main__ - Step 77322: {'lr': 0.00024299070935567175, 'samples': 14845824, 'steps': 77321, 'loss/train': 1.3281534910202026} 11/07/2021 08:07:42 - INFO - __main__ - Step 77323: {'lr': 0.00024298540469908344, 'samples': 14846016, 'steps': 77322, 'loss/train': 1.4194080829620361} 11/07/2021 08:07:42 - INFO - __main__ - Step 77324: {'lr': 0.00024298010004565582, 'samples': 14846208, 'steps': 77323, 'loss/train': 1.2442657947540283} 11/07/2021 08:07:43 - INFO - __main__ - Step 77325: {'lr': 0.00024297479539539126, 'samples': 14846400, 'steps': 77324, 'loss/train': 1.222756266593933} 11/07/2021 08:07:44 - INFO - __main__ - Step 77326: {'lr': 0.00024296949074829223, 'samples': 14846592, 'steps': 77325, 'loss/train': 1.5890272855758667} 11/07/2021 08:07:44 - INFO - __main__ - Step 77327: {'lr': 0.00024296418610436095, 'samples': 14846784, 'steps': 77326, 'loss/train': 0.9064580202102661} 11/07/2021 08:07:44 - INFO - __main__ - Step 77328: {'lr': 0.0002429588814635999, 'samples': 14846976, 'steps': 77327, 'loss/train': 1.403618574142456} 11/07/2021 08:07:45 - INFO - __main__ - Step 77329: {'lr': 0.00024295357682601145, 'samples': 14847168, 'steps': 77328, 'loss/train': 1.5271867513656616} 11/07/2021 08:07:45 - INFO - __main__ - Step 77330: {'lr': 0.00024294827219159803, 'samples': 14847360, 'steps': 77329, 'loss/train': 1.3388699293136597} 11/07/2021 08:07:46 - INFO - __main__ - Step 77331: {'lr': 0.000242942967560362, 'samples': 14847552, 'steps': 77330, 'loss/train': 0.6532538533210754} 11/07/2021 08:07:46 - INFO - __main__ - Step 77332: {'lr': 0.00024293766293230577, 'samples': 14847744, 'steps': 77331, 'loss/train': 1.4468451738357544} 11/07/2021 08:07:47 - INFO - __main__ - Step 77333: {'lr': 0.00024293235830743172, 'samples': 14847936, 'steps': 77332, 'loss/train': 1.5194889307022095} 11/07/2021 08:07:47 - INFO - __main__ - Step 77334: {'lr': 0.00024292705368574223, 'samples': 14848128, 'steps': 77333, 'loss/train': 1.3753728866577148} 11/07/2021 08:07:48 - INFO - __main__ - Step 77335: {'lr': 0.0002429217490672397, 'samples': 14848320, 'steps': 77334, 'loss/train': 1.043643832206726} 11/07/2021 08:07:49 - INFO - __main__ - Step 77336: {'lr': 0.00024291644445192652, 'samples': 14848512, 'steps': 77335, 'loss/train': 1.4041626453399658} 11/07/2021 08:07:49 - INFO - __main__ - Step 77337: {'lr': 0.00024291113983980505, 'samples': 14848704, 'steps': 77336, 'loss/train': 1.399369239807129} 11/07/2021 08:07:49 - INFO - __main__ - Step 77338: {'lr': 0.00024290583523087778, 'samples': 14848896, 'steps': 77337, 'loss/train': 1.4518929719924927} 11/07/2021 08:07:50 - INFO - __main__ - Step 77339: {'lr': 0.00024290053062514712, 'samples': 14849088, 'steps': 77338, 'loss/train': 1.4797405004501343} 11/07/2021 08:07:50 - INFO - __main__ - Step 77340: {'lr': 0.00024289522602261523, 'samples': 14849280, 'steps': 77339, 'loss/train': 1.2330266237258911} 11/07/2021 08:07:51 - INFO - __main__ - Step 77341: {'lr': 0.00024288992142328463, 'samples': 14849472, 'steps': 77340, 'loss/train': 1.3195902109146118} 11/07/2021 08:07:51 - INFO - __main__ - Step 77342: {'lr': 0.00024288461682715778, 'samples': 14849664, 'steps': 77341, 'loss/train': 1.2648746967315674} 11/07/2021 08:07:52 - INFO - __main__ - Step 77343: {'lr': 0.000242879312234237, 'samples': 14849856, 'steps': 77342, 'loss/train': 2.117140531539917} 11/07/2021 08:07:52 - INFO - __main__ - Step 77344: {'lr': 0.00024287400764452465, 'samples': 14850048, 'steps': 77343, 'loss/train': 1.5465422868728638} 11/07/2021 08:07:52 - INFO - __main__ - Step 77345: {'lr': 0.00024286870305802318, 'samples': 14850240, 'steps': 77344, 'loss/train': 1.2472810745239258} 11/07/2021 08:07:53 - INFO - __main__ - Step 77346: {'lr': 0.000242863398474735, 'samples': 14850432, 'steps': 77345, 'loss/train': 0.6695953011512756} 11/07/2021 08:07:54 - INFO - __main__ - Step 77347: {'lr': 0.0002428580938946624, 'samples': 14850624, 'steps': 77346, 'loss/train': 1.3412233591079712} 11/07/2021 08:07:54 - INFO - __main__ - Step 77348: {'lr': 0.0002428527893178079, 'samples': 14850816, 'steps': 77347, 'loss/train': 1.150341272354126} 11/07/2021 08:07:54 - INFO - __main__ - Step 77349: {'lr': 0.00024284748474417376, 'samples': 14851008, 'steps': 77348, 'loss/train': 1.4893953800201416} 11/07/2021 08:07:55 - INFO - __main__ - Step 77350: {'lr': 0.00024284218017376247, 'samples': 14851200, 'steps': 77349, 'loss/train': 1.2568562030792236} 11/07/2021 08:07:56 - INFO - __main__ - Step 77351: {'lr': 0.00024283687560657636, 'samples': 14851392, 'steps': 77350, 'loss/train': 1.237494945526123} 11/07/2021 08:07:56 - INFO - __main__ - Step 77352: {'lr': 0.00024283157104261786, 'samples': 14851584, 'steps': 77351, 'loss/train': 1.4639768600463867} 11/07/2021 08:07:56 - INFO - __main__ - Step 77353: {'lr': 0.00024282626648188947, 'samples': 14851776, 'steps': 77352, 'loss/train': 1.872445821762085} 11/07/2021 08:07:57 - INFO - __main__ - Step 77354: {'lr': 0.0002428209619243933, 'samples': 14851968, 'steps': 77353, 'loss/train': 1.3759032487869263} 11/07/2021 08:07:57 - INFO - __main__ - Step 77355: {'lr': 0.00024281565737013192, 'samples': 14852160, 'steps': 77354, 'loss/train': 1.0399302244186401} 11/07/2021 08:07:58 - INFO - __main__ - Step 77356: {'lr': 0.0002428103528191077, 'samples': 14852352, 'steps': 77355, 'loss/train': 1.1898012161254883} 11/07/2021 08:07:59 - INFO - __main__ - Step 77357: {'lr': 0.00024280504827132302, 'samples': 14852544, 'steps': 77356, 'loss/train': 1.3916815519332886} 11/07/2021 08:07:59 - INFO - __main__ - Step 77358: {'lr': 0.00024279974372678025, 'samples': 14852736, 'steps': 77357, 'loss/train': 2.014653205871582} 11/07/2021 08:07:59 - INFO - __main__ - Step 77359: {'lr': 0.00024279443918548183, 'samples': 14852928, 'steps': 77358, 'loss/train': 1.3783212900161743} 11/07/2021 08:08:00 - INFO - __main__ - Step 77360: {'lr': 0.00024278913464743012, 'samples': 14853120, 'steps': 77359, 'loss/train': 1.390712022781372} 11/07/2021 08:08:01 - INFO - __main__ - Step 77361: {'lr': 0.00024278383011262753, 'samples': 14853312, 'steps': 77360, 'loss/train': 1.2299758195877075} 11/07/2021 08:08:01 - INFO - __main__ - Step 77362: {'lr': 0.0002427785255810764, 'samples': 14853504, 'steps': 77361, 'loss/train': 1.2896214723587036} 11/07/2021 08:08:01 - INFO - __main__ - Step 77363: {'lr': 0.0002427732210527792, 'samples': 14853696, 'steps': 77362, 'loss/train': 1.1762917041778564} 11/07/2021 08:08:02 - INFO - __main__ - Step 77364: {'lr': 0.00024276791652773824, 'samples': 14853888, 'steps': 77363, 'loss/train': 1.112641453742981} 11/07/2021 08:08:02 - INFO - __main__ - Step 77365: {'lr': 0.00024276261200595594, 'samples': 14854080, 'steps': 77364, 'loss/train': 1.3651773929595947} 11/07/2021 08:08:02 - INFO - __main__ - Step 77366: {'lr': 0.0002427573074874348, 'samples': 14854272, 'steps': 77365, 'loss/train': 1.2727223634719849} 11/07/2021 08:08:04 - INFO - __main__ - Step 77367: {'lr': 0.00024275200297217703, 'samples': 14854464, 'steps': 77366, 'loss/train': 1.5395344495773315} 11/07/2021 08:08:04 - INFO - __main__ - Step 77368: {'lr': 0.0002427466984601851, 'samples': 14854656, 'steps': 77367, 'loss/train': 0.7735517621040344} 11/07/2021 08:08:04 - INFO - __main__ - Step 77369: {'lr': 0.0002427413939514614, 'samples': 14854848, 'steps': 77368, 'loss/train': 1.0303922891616821} 11/07/2021 08:08:05 - INFO - __main__ - Step 77370: {'lr': 0.00024273608944600826, 'samples': 14855040, 'steps': 77369, 'loss/train': 1.7977803945541382} 11/07/2021 08:08:05 - INFO - __main__ - Step 77371: {'lr': 0.00024273078494382817, 'samples': 14855232, 'steps': 77370, 'loss/train': 0.9842549562454224} 11/07/2021 08:08:06 - INFO - __main__ - Step 77372: {'lr': 0.00024272548044492346, 'samples': 14855424, 'steps': 77371, 'loss/train': 1.4834753274917603} 11/07/2021 08:08:06 - INFO - __main__ - Step 77373: {'lr': 0.00024272017594929654, 'samples': 14855616, 'steps': 77372, 'loss/train': 1.2977368831634521} 11/07/2021 08:08:07 - INFO - __main__ - Step 77374: {'lr': 0.00024271487145694978, 'samples': 14855808, 'steps': 77373, 'loss/train': 0.9374265074729919} 11/07/2021 08:08:07 - INFO - __main__ - Step 77375: {'lr': 0.00024270956696788561, 'samples': 14856000, 'steps': 77374, 'loss/train': 0.6624443531036377} 11/07/2021 08:08:08 - INFO - __main__ - Step 77376: {'lr': 0.0002427042624821064, 'samples': 14856192, 'steps': 77375, 'loss/train': 1.3912248611450195} 11/07/2021 08:08:09 - INFO - __main__ - Step 77377: {'lr': 0.00024269895799961452, 'samples': 14856384, 'steps': 77376, 'loss/train': 1.4425969123840332} 11/07/2021 08:08:09 - INFO - __main__ - Step 77378: {'lr': 0.0002426936535204124, 'samples': 14856576, 'steps': 77377, 'loss/train': 0.6668657660484314} 11/07/2021 08:08:09 - INFO - __main__ - Step 77379: {'lr': 0.00024268834904450239, 'samples': 14856768, 'steps': 77378, 'loss/train': 1.0640660524368286} 11/07/2021 08:08:10 - INFO - __main__ - Step 77380: {'lr': 0.000242683044571887, 'samples': 14856960, 'steps': 77379, 'loss/train': 1.5248159170150757} 11/07/2021 08:08:10 - INFO - __main__ - Step 77381: {'lr': 0.0002426777401025684, 'samples': 14857152, 'steps': 77380, 'loss/train': 0.8127138614654541} 11/07/2021 08:08:11 - INFO - __main__ - Step 77382: {'lr': 0.00024267243563654912, 'samples': 14857344, 'steps': 77381, 'loss/train': 0.9805072546005249} 11/07/2021 08:08:11 - INFO - __main__ - Step 77383: {'lr': 0.00024266713117383152, 'samples': 14857536, 'steps': 77382, 'loss/train': 1.631851077079773} 11/07/2021 08:08:12 - INFO - __main__ - Step 77384: {'lr': 0.000242661826714418, 'samples': 14857728, 'steps': 77383, 'loss/train': 1.5533703565597534} 11/07/2021 08:08:12 - INFO - __main__ - Step 77385: {'lr': 0.00024265652225831095, 'samples': 14857920, 'steps': 77384, 'loss/train': 1.260006308555603} 11/07/2021 08:08:12 - INFO - __main__ - Step 77386: {'lr': 0.00024265121780551275, 'samples': 14858112, 'steps': 77385, 'loss/train': 1.644644021987915} 11/07/2021 08:08:13 - INFO - __main__ - Step 77387: {'lr': 0.00024264591335602579, 'samples': 14858304, 'steps': 77386, 'loss/train': 1.7311694622039795} 11/07/2021 08:08:14 - INFO - __main__ - Step 77388: {'lr': 0.0002426406089098525, 'samples': 14858496, 'steps': 77387, 'loss/train': 0.09306634217500687} 11/07/2021 08:08:14 - INFO - __main__ - Step 77389: {'lr': 0.0002426353044669952, 'samples': 14858688, 'steps': 77388, 'loss/train': 0.6030405759811401} 11/07/2021 08:08:15 - INFO - __main__ - Step 77390: {'lr': 0.00024263000002745634, 'samples': 14858880, 'steps': 77389, 'loss/train': 1.130959391593933} 11/07/2021 08:08:15 - INFO - __main__ - Step 77391: {'lr': 0.00024262469559123835, 'samples': 14859072, 'steps': 77390, 'loss/train': 1.5165767669677734} 11/07/2021 08:08:15 - INFO - __main__ - Step 77392: {'lr': 0.00024261939115834347, 'samples': 14859264, 'steps': 77391, 'loss/train': 1.4083068370819092} 11/07/2021 08:08:16 - INFO - __main__ - Step 77393: {'lr': 0.00024261408672877425, 'samples': 14859456, 'steps': 77392, 'loss/train': 1.2765570878982544} 11/07/2021 08:08:17 - INFO - __main__ - Step 77394: {'lr': 0.00024260878230253298, 'samples': 14859648, 'steps': 77393, 'loss/train': 1.2262364625930786} 11/07/2021 08:08:17 - INFO - __main__ - Step 77395: {'lr': 0.00024260347787962203, 'samples': 14859840, 'steps': 77394, 'loss/train': 0.04071944206953049} 11/07/2021 08:08:17 - INFO - __main__ - Step 77396: {'lr': 0.00024259817346004387, 'samples': 14860032, 'steps': 77395, 'loss/train': 1.1950758695602417} 11/07/2021 08:08:18 - INFO - __main__ - Step 77397: {'lr': 0.00024259286904380087, 'samples': 14860224, 'steps': 77396, 'loss/train': 1.4006537199020386} 11/07/2021 08:08:19 - INFO - __main__ - Step 77398: {'lr': 0.00024258756463089537, 'samples': 14860416, 'steps': 77397, 'loss/train': 1.4543312788009644} 11/07/2021 08:08:19 - INFO - __main__ - Step 77399: {'lr': 0.00024258226022132984, 'samples': 14860608, 'steps': 77398, 'loss/train': 0.8618334531784058} 11/07/2021 08:08:20 - INFO - __main__ - Step 77400: {'lr': 0.0002425769558151066, 'samples': 14860800, 'steps': 77399, 'loss/train': 1.2702281475067139} 11/07/2021 08:08:20 - INFO - __main__ - Step 77401: {'lr': 0.00024257165141222808, 'samples': 14860992, 'steps': 77400, 'loss/train': 1.5591015815734863} 11/07/2021 08:08:20 - INFO - __main__ - Step 77402: {'lr': 0.00024256634701269673, 'samples': 14861184, 'steps': 77401, 'loss/train': 1.4564236402511597} 11/07/2021 08:08:21 - INFO - __main__ - Step 77403: {'lr': 0.0002425610426165148, 'samples': 14861376, 'steps': 77402, 'loss/train': 1.2984081506729126} 11/07/2021 08:08:22 - INFO - __main__ - Step 77404: {'lr': 0.00024255573822368475, 'samples': 14861568, 'steps': 77403, 'loss/train': 1.5113611221313477} 11/07/2021 08:08:22 - INFO - __main__ - Step 77405: {'lr': 0.00024255043383420897, 'samples': 14861760, 'steps': 77404, 'loss/train': 1.1883043050765991} 11/07/2021 08:08:22 - INFO - __main__ - Step 77406: {'lr': 0.0002425451294480899, 'samples': 14861952, 'steps': 77405, 'loss/train': 1.1631968021392822} 11/07/2021 08:08:23 - INFO - __main__ - Step 77407: {'lr': 0.0002425398250653298, 'samples': 14862144, 'steps': 77406, 'loss/train': 1.1778439283370972} 11/07/2021 08:08:24 - INFO - __main__ - Step 77408: {'lr': 0.00024253452068593117, 'samples': 14862336, 'steps': 77407, 'loss/train': 0.9006516337394714} 11/07/2021 08:08:24 - INFO - __main__ - Step 77409: {'lr': 0.00024252921630989638, 'samples': 14862528, 'steps': 77408, 'loss/train': 1.3970249891281128} 11/07/2021 08:08:24 - INFO - __main__ - Step 77410: {'lr': 0.00024252391193722782, 'samples': 14862720, 'steps': 77409, 'loss/train': 1.6251318454742432} 11/07/2021 08:08:25 - INFO - __main__ - Step 77411: {'lr': 0.00024251860756792782, 'samples': 14862912, 'steps': 77410, 'loss/train': 1.4484665393829346} 11/07/2021 08:08:25 - INFO - __main__ - Step 77412: {'lr': 0.0002425133032019989, 'samples': 14863104, 'steps': 77411, 'loss/train': 1.516265869140625} 11/07/2021 08:08:26 - INFO - __main__ - Step 77413: {'lr': 0.00024250799883944333, 'samples': 14863296, 'steps': 77412, 'loss/train': 1.5801037549972534} 11/07/2021 08:08:27 - INFO - __main__ - Step 77414: {'lr': 0.00024250269448026352, 'samples': 14863488, 'steps': 77413, 'loss/train': 1.532049536705017} 11/07/2021 08:08:27 - INFO - __main__ - Step 77415: {'lr': 0.0002424973901244619, 'samples': 14863680, 'steps': 77414, 'loss/train': 0.31908419728279114} 11/07/2021 08:08:27 - INFO - __main__ - Step 77416: {'lr': 0.00024249208577204083, 'samples': 14863872, 'steps': 77415, 'loss/train': 1.1119234561920166} 11/07/2021 08:08:28 - INFO - __main__ - Step 77417: {'lr': 0.00024248678142300268, 'samples': 14864064, 'steps': 77416, 'loss/train': 1.1886780261993408} 11/07/2021 08:08:29 - INFO - __main__ - Step 77418: {'lr': 0.0002424814770773499, 'samples': 14864256, 'steps': 77417, 'loss/train': 1.1690480709075928} 11/07/2021 08:08:29 - INFO - __main__ - Step 77419: {'lr': 0.00024247617273508485, 'samples': 14864448, 'steps': 77418, 'loss/train': 1.5271391868591309} 11/07/2021 08:08:29 - INFO - __main__ - Step 77420: {'lr': 0.00024247086839620998, 'samples': 14864640, 'steps': 77419, 'loss/train': 1.5641120672225952} 11/07/2021 08:08:30 - INFO - __main__ - Step 77421: {'lr': 0.00024246556406072757, 'samples': 14864832, 'steps': 77420, 'loss/train': 1.291731834411621} 11/07/2021 08:08:30 - INFO - __main__ - Step 77422: {'lr': 0.00024246025972864002, 'samples': 14865024, 'steps': 77421, 'loss/train': 1.2584542036056519} 11/07/2021 08:08:30 - INFO - __main__ - Step 77423: {'lr': 0.00024245495539994985, 'samples': 14865216, 'steps': 77422, 'loss/train': 1.2125024795532227} 11/07/2021 08:08:32 - INFO - __main__ - Step 77424: {'lr': 0.00024244965107465932, 'samples': 14865408, 'steps': 77423, 'loss/train': 1.526087760925293} 11/07/2021 08:08:32 - INFO - __main__ - Step 77425: {'lr': 0.00024244434675277084, 'samples': 14865600, 'steps': 77424, 'loss/train': 1.152956247329712} 11/07/2021 08:08:32 - INFO - __main__ - Step 77426: {'lr': 0.00024243904243428683, 'samples': 14865792, 'steps': 77425, 'loss/train': 1.526873230934143} 11/07/2021 08:08:33 - INFO - __main__ - Step 77427: {'lr': 0.00024243373811920965, 'samples': 14865984, 'steps': 77426, 'loss/train': 1.709824562072754} 11/07/2021 08:08:33 - INFO - __main__ - Step 77428: {'lr': 0.00024242843380754172, 'samples': 14866176, 'steps': 77427, 'loss/train': 1.0717161893844604} 11/07/2021 08:08:34 - INFO - __main__ - Step 77429: {'lr': 0.00024242312949928545, 'samples': 14866368, 'steps': 77428, 'loss/train': 1.2370343208312988} 11/07/2021 08:08:34 - INFO - __main__ - Step 77430: {'lr': 0.00024241782519444317, 'samples': 14866560, 'steps': 77429, 'loss/train': 1.7945356369018555} 11/07/2021 08:08:35 - INFO - __main__ - Step 77431: {'lr': 0.0002424125208930173, 'samples': 14866752, 'steps': 77430, 'loss/train': 1.0924053192138672} 11/07/2021 08:08:35 - INFO - __main__ - Step 77432: {'lr': 0.00024240721659501022, 'samples': 14866944, 'steps': 77431, 'loss/train': 1.3508018255233765} 11/07/2021 08:08:35 - INFO - __main__ - Step 77433: {'lr': 0.0002424019123004244, 'samples': 14867136, 'steps': 77432, 'loss/train': 1.2935858964920044} 11/07/2021 08:08:36 - INFO - __main__ - Step 77434: {'lr': 0.00024239660800926216, 'samples': 14867328, 'steps': 77433, 'loss/train': 1.2697700262069702} 11/07/2021 08:08:37 - INFO - __main__ - Step 77435: {'lr': 0.00024239130372152585, 'samples': 14867520, 'steps': 77434, 'loss/train': 1.337643027305603} 11/07/2021 08:08:37 - INFO - __main__ - Step 77436: {'lr': 0.0002423859994372179, 'samples': 14867712, 'steps': 77435, 'loss/train': 1.483481526374817} 11/07/2021 08:08:37 - INFO - __main__ - Step 77437: {'lr': 0.00024238069515634071, 'samples': 14867904, 'steps': 77436, 'loss/train': 1.6863250732421875} 11/07/2021 08:08:38 - INFO - __main__ - Step 77438: {'lr': 0.00024237539087889663, 'samples': 14868096, 'steps': 77437, 'loss/train': 1.541810393333435} 11/07/2021 08:08:39 - INFO - __main__ - Step 77439: {'lr': 0.0002423700866048881, 'samples': 14868288, 'steps': 77438, 'loss/train': 1.063332438468933} 11/07/2021 08:08:39 - INFO - __main__ - Step 77440: {'lr': 0.00024236478233431746, 'samples': 14868480, 'steps': 77439, 'loss/train': 1.3180826902389526} 11/07/2021 08:08:39 - INFO - __main__ - Step 77441: {'lr': 0.00024235947806718717, 'samples': 14868672, 'steps': 77440, 'loss/train': 1.427065372467041} 11/07/2021 08:08:40 - INFO - __main__ - Step 77442: {'lr': 0.00024235417380349958, 'samples': 14868864, 'steps': 77441, 'loss/train': 0.966543436050415} 11/07/2021 08:08:40 - INFO - __main__ - Step 77443: {'lr': 0.00024234886954325706, 'samples': 14869056, 'steps': 77442, 'loss/train': 1.3691569566726685} 11/07/2021 08:08:41 - INFO - __main__ - Step 77444: {'lr': 0.00024234356528646204, 'samples': 14869248, 'steps': 77443, 'loss/train': 1.429647445678711} 11/07/2021 08:08:42 - INFO - __main__ - Step 77445: {'lr': 0.00024233826103311687, 'samples': 14869440, 'steps': 77444, 'loss/train': 1.6567381620407104} 11/07/2021 08:08:42 - INFO - __main__ - Step 77446: {'lr': 0.000242332956783224, 'samples': 14869632, 'steps': 77445, 'loss/train': 1.4661368131637573} 11/07/2021 08:08:42 - INFO - __main__ - Step 77447: {'lr': 0.00024232765253678584, 'samples': 14869824, 'steps': 77446, 'loss/train': 1.2316001653671265} 11/07/2021 08:08:43 - INFO - __main__ - Step 77448: {'lr': 0.00024232234829380463, 'samples': 14870016, 'steps': 77447, 'loss/train': 1.2894598245620728} 11/07/2021 08:08:44 - INFO - __main__ - Step 77449: {'lr': 0.00024231704405428288, 'samples': 14870208, 'steps': 77448, 'loss/train': 1.2733372449874878} 11/07/2021 08:08:44 - INFO - __main__ - Step 77450: {'lr': 0.00024231173981822292, 'samples': 14870400, 'steps': 77449, 'loss/train': 1.1037789583206177} 11/07/2021 08:08:44 - INFO - __main__ - Step 77451: {'lr': 0.0002423064355856272, 'samples': 14870592, 'steps': 77450, 'loss/train': 1.075829029083252} 11/07/2021 08:08:45 - INFO - __main__ - Step 77452: {'lr': 0.00024230113135649805, 'samples': 14870784, 'steps': 77451, 'loss/train': 1.2175674438476562} 11/07/2021 08:08:45 - INFO - __main__ - Step 77453: {'lr': 0.00024229582713083793, 'samples': 14870976, 'steps': 77452, 'loss/train': 1.1540544033050537} 11/07/2021 08:08:46 - INFO - __main__ - Step 77454: {'lr': 0.00024229052290864915, 'samples': 14871168, 'steps': 77453, 'loss/train': 1.5043772459030151} 11/07/2021 08:08:46 - INFO - __main__ - Step 77455: {'lr': 0.00024228521868993418, 'samples': 14871360, 'steps': 77454, 'loss/train': 1.6147018671035767} 11/07/2021 08:08:47 - INFO - __main__ - Step 77456: {'lr': 0.00024227991447469533, 'samples': 14871552, 'steps': 77455, 'loss/train': 1.2803176641464233} 11/07/2021 08:08:47 - INFO - __main__ - Step 77457: {'lr': 0.00024227461026293505, 'samples': 14871744, 'steps': 77456, 'loss/train': 1.7564617395401} 11/07/2021 08:08:47 - INFO - __main__ - Step 77458: {'lr': 0.0002422693060546557, 'samples': 14871936, 'steps': 77457, 'loss/train': 1.4779802560806274} 11/07/2021 08:08:49 - INFO - __main__ - Step 77459: {'lr': 0.00024226400184985969, 'samples': 14872128, 'steps': 77458, 'loss/train': 1.184459924697876} 11/07/2021 08:08:49 - INFO - __main__ - Step 77460: {'lr': 0.00024225869764854952, 'samples': 14872320, 'steps': 77459, 'loss/train': 1.5996710062026978} 11/07/2021 08:08:49 - INFO - __main__ - Step 77461: {'lr': 0.00024225339345072735, 'samples': 14872512, 'steps': 77460, 'loss/train': 1.2855033874511719} 11/07/2021 08:08:50 - INFO - __main__ - Step 77462: {'lr': 0.00024224808925639568, 'samples': 14872704, 'steps': 77461, 'loss/train': 1.2121318578720093} 11/07/2021 08:08:50 - INFO - __main__ - Step 77463: {'lr': 0.00024224278506555688, 'samples': 14872896, 'steps': 77462, 'loss/train': 1.6253774166107178} 11/07/2021 08:08:51 - INFO - __main__ - Step 77464: {'lr': 0.0002422374808782134, 'samples': 14873088, 'steps': 77463, 'loss/train': 1.1082031726837158} 11/07/2021 08:08:51 - INFO - __main__ - Step 77465: {'lr': 0.00024223217669436757, 'samples': 14873280, 'steps': 77464, 'loss/train': 1.415853500366211} 11/07/2021 08:08:52 - INFO - __main__ - Step 77466: {'lr': 0.0002422268725140218, 'samples': 14873472, 'steps': 77465, 'loss/train': 2.1416683197021484} 11/07/2021 08:08:52 - INFO - __main__ - Step 77467: {'lr': 0.0002422215683371785, 'samples': 14873664, 'steps': 77466, 'loss/train': 1.6791924238204956} 11/07/2021 08:08:52 - INFO - __main__ - Step 77468: {'lr': 0.00024221626416384, 'samples': 14873856, 'steps': 77467, 'loss/train': 1.9172587394714355} 11/07/2021 08:08:53 - INFO - __main__ - Step 77469: {'lr': 0.00024221095999400877, 'samples': 14874048, 'steps': 77468, 'loss/train': 0.6369022130966187} 11/07/2021 08:08:54 - INFO - __main__ - Step 77470: {'lr': 0.00024220565582768714, 'samples': 14874240, 'steps': 77469, 'loss/train': 1.3494449853897095} 11/07/2021 08:08:54 - INFO - __main__ - Step 77471: {'lr': 0.00024220035166487753, 'samples': 14874432, 'steps': 77470, 'loss/train': 1.0098835229873657} 11/07/2021 08:08:54 - INFO - __main__ - Step 77472: {'lr': 0.00024219504750558232, 'samples': 14874624, 'steps': 77471, 'loss/train': 1.4686152935028076} 11/07/2021 08:08:55 - INFO - __main__ - Step 77473: {'lr': 0.0002421897433498039, 'samples': 14874816, 'steps': 77472, 'loss/train': 1.0887479782104492} 11/07/2021 08:08:55 - INFO - __main__ - Step 77474: {'lr': 0.00024218443919754476, 'samples': 14875008, 'steps': 77473, 'loss/train': 1.2022284269332886} 11/07/2021 08:08:56 - INFO - __main__ - Step 77475: {'lr': 0.00024217913504880713, 'samples': 14875200, 'steps': 77474, 'loss/train': 1.1873749494552612} 11/07/2021 08:08:56 - INFO - __main__ - Step 77476: {'lr': 0.0002421738309035934, 'samples': 14875392, 'steps': 77475, 'loss/train': 0.8074208498001099} 11/07/2021 08:08:57 - INFO - __main__ - Step 77477: {'lr': 0.00024216852676190603, 'samples': 14875584, 'steps': 77476, 'loss/train': 1.3483952283859253} 11/07/2021 08:08:57 - INFO - __main__ - Step 77478: {'lr': 0.00024216322262374742, 'samples': 14875776, 'steps': 77477, 'loss/train': 1.5057380199432373} 11/07/2021 08:08:57 - INFO - __main__ - Step 77479: {'lr': 0.00024215791848911994, 'samples': 14875968, 'steps': 77478, 'loss/train': 1.6243833303451538} 11/07/2021 08:08:58 - INFO - __main__ - Step 77480: {'lr': 0.000242152614358026, 'samples': 14876160, 'steps': 77479, 'loss/train': 1.3363271951675415} 11/07/2021 08:08:59 - INFO - __main__ - Step 77481: {'lr': 0.00024214731023046793, 'samples': 14876352, 'steps': 77480, 'loss/train': 1.0012089014053345} 11/07/2021 08:08:59 - INFO - __main__ - Step 77482: {'lr': 0.00024214200610644818, 'samples': 14876544, 'steps': 77481, 'loss/train': 1.5028456449508667} 11/07/2021 08:08:59 - INFO - __main__ - Step 77483: {'lr': 0.00024213670198596914, 'samples': 14876736, 'steps': 77482, 'loss/train': 1.3179248571395874} 11/07/2021 08:09:00 - INFO - __main__ - Step 77484: {'lr': 0.00024213139786903316, 'samples': 14876928, 'steps': 77483, 'loss/train': 1.0938512086868286} 11/07/2021 08:09:01 - INFO - __main__ - Step 77485: {'lr': 0.00024212609375564266, 'samples': 14877120, 'steps': 77484, 'loss/train': 1.8275235891342163} 11/07/2021 08:09:01 - INFO - __main__ - Step 77486: {'lr': 0.0002421207896458, 'samples': 14877312, 'steps': 77485, 'loss/train': 1.089330792427063} 11/07/2021 08:09:02 - INFO - __main__ - Step 77487: {'lr': 0.0002421154855395077, 'samples': 14877504, 'steps': 77486, 'loss/train': 1.3229570388793945} 11/07/2021 08:09:02 - INFO - __main__ - Step 77488: {'lr': 0.00024211018143676795, 'samples': 14877696, 'steps': 77487, 'loss/train': 1.5199757814407349} 11/07/2021 08:09:02 - INFO - __main__ - Step 77489: {'lr': 0.00024210487733758324, 'samples': 14877888, 'steps': 77488, 'loss/train': 1.6385473012924194} 11/07/2021 08:09:03 - INFO - __main__ - Step 77490: {'lr': 0.00024209957324195593, 'samples': 14878080, 'steps': 77489, 'loss/train': 1.0250452756881714} 11/07/2021 08:09:04 - INFO - __main__ - Step 77491: {'lr': 0.0002420942691498884, 'samples': 14878272, 'steps': 77490, 'loss/train': 1.5670082569122314} 11/07/2021 08:09:04 - INFO - __main__ - Step 77492: {'lr': 0.00024208896506138313, 'samples': 14878464, 'steps': 77491, 'loss/train': 1.4692994356155396} 11/07/2021 08:09:04 - INFO - __main__ - Step 77493: {'lr': 0.00024208366097644245, 'samples': 14878656, 'steps': 77492, 'loss/train': 1.61858069896698} 11/07/2021 08:09:05 - INFO - __main__ - Step 77494: {'lr': 0.0002420783568950687, 'samples': 14878848, 'steps': 77493, 'loss/train': 1.2887697219848633} 11/07/2021 08:09:06 - INFO - __main__ - Step 77495: {'lr': 0.00024207305281726435, 'samples': 14879040, 'steps': 77494, 'loss/train': 1.4220049381256104} 11/07/2021 08:09:06 - INFO - __main__ - Step 77496: {'lr': 0.00024206774874303174, 'samples': 14879232, 'steps': 77495, 'loss/train': 1.399061679840088} 11/07/2021 08:09:07 - INFO - __main__ - Step 77497: {'lr': 0.0002420624446723733, 'samples': 14879424, 'steps': 77496, 'loss/train': 1.5045894384384155} 11/07/2021 08:09:07 - INFO - __main__ - Step 77498: {'lr': 0.0002420571406052914, 'samples': 14879616, 'steps': 77497, 'loss/train': 0.8656699657440186} 11/07/2021 08:09:07 - INFO - __main__ - Step 77499: {'lr': 0.00024205183654178844, 'samples': 14879808, 'steps': 77498, 'loss/train': 1.585283637046814} 11/07/2021 08:09:08 - INFO - __main__ - Step 77500: {'lr': 0.00024204653248186678, 'samples': 14880000, 'steps': 77499, 'loss/train': 1.249230980873108} 11/07/2021 08:09:09 - INFO - __main__ - Step 77501: {'lr': 0.00024204122842552895, 'samples': 14880192, 'steps': 77500, 'loss/train': 1.5575621128082275} 11/07/2021 08:09:09 - INFO - __main__ - Step 77502: {'lr': 0.00024203592437277712, 'samples': 14880384, 'steps': 77501, 'loss/train': 1.2867835760116577} 11/07/2021 08:09:09 - INFO - __main__ - Step 77503: {'lr': 0.00024203062032361375, 'samples': 14880576, 'steps': 77502, 'loss/train': 1.6020716428756714} 11/07/2021 08:09:10 - INFO - __main__ - Step 77504: {'lr': 0.0002420253162780413, 'samples': 14880768, 'steps': 77503, 'loss/train': 1.522786021232605} 11/07/2021 08:09:11 - INFO - __main__ - Step 77505: {'lr': 0.00024202001223606206, 'samples': 14880960, 'steps': 77504, 'loss/train': 0.9777777791023254} 11/07/2021 08:09:12 - INFO - __main__ - Step 77506: {'lr': 0.0002420147081976785, 'samples': 14881152, 'steps': 77505, 'loss/train': 1.4096237421035767} 11/07/2021 08:09:12 - INFO - __main__ - Step 77507: {'lr': 0.00024200940416289302, 'samples': 14881344, 'steps': 77506, 'loss/train': 1.087598443031311} 11/07/2021 08:09:12 - INFO - __main__ - Step 77508: {'lr': 0.00024200410013170795, 'samples': 14881536, 'steps': 77507, 'loss/train': 2.9902570247650146} 11/07/2021 08:09:13 - INFO - __main__ - Step 77509: {'lr': 0.00024199879610412573, 'samples': 14881728, 'steps': 77508, 'loss/train': 0.8592475652694702} 11/07/2021 08:09:13 - INFO - __main__ - Step 77510: {'lr': 0.00024199349208014874, 'samples': 14881920, 'steps': 77509, 'loss/train': 0.07055601477622986} 11/07/2021 08:09:14 - INFO - __main__ - Step 77511: {'lr': 0.0002419881880597793, 'samples': 14882112, 'steps': 77510, 'loss/train': 1.3507509231567383} 11/07/2021 08:09:14 - INFO - __main__ - Step 77512: {'lr': 0.0002419828840430199, 'samples': 14882304, 'steps': 77511, 'loss/train': 1.4588669538497925} 11/07/2021 08:09:15 - INFO - __main__ - Step 77513: {'lr': 0.00024197758002987292, 'samples': 14882496, 'steps': 77512, 'loss/train': 1.6152458190917969} 11/07/2021 08:09:15 - INFO - __main__ - Step 77514: {'lr': 0.00024197227602034077, 'samples': 14882688, 'steps': 77513, 'loss/train': 1.1294245719909668} 11/07/2021 08:09:15 - INFO - __main__ - Step 77515: {'lr': 0.00024196697201442572, 'samples': 14882880, 'steps': 77514, 'loss/train': 1.391996145248413} 11/07/2021 08:09:16 - INFO - __main__ - Step 77516: {'lr': 0.0002419616680121302, 'samples': 14883072, 'steps': 77515, 'loss/train': 1.3826345205307007} 11/07/2021 08:09:17 - INFO - __main__ - Step 77517: {'lr': 0.00024195636401345662, 'samples': 14883264, 'steps': 77516, 'loss/train': 1.5241754055023193} 11/07/2021 08:09:17 - INFO - __main__ - Step 77518: {'lr': 0.00024195106001840741, 'samples': 14883456, 'steps': 77517, 'loss/train': 1.324196457862854} 11/07/2021 08:09:17 - INFO - __main__ - Step 77519: {'lr': 0.00024194575602698494, 'samples': 14883648, 'steps': 77518, 'loss/train': 1.5698907375335693} 11/07/2021 08:09:18 - INFO - __main__ - Step 77520: {'lr': 0.00024194045203919156, 'samples': 14883840, 'steps': 77519, 'loss/train': 1.2081401348114014} 11/07/2021 08:09:19 - INFO - __main__ - Step 77521: {'lr': 0.0002419351480550297, 'samples': 14884032, 'steps': 77520, 'loss/train': 1.6827497482299805} 11/07/2021 08:09:19 - INFO - __main__ - Step 77522: {'lr': 0.00024192984407450172, 'samples': 14884224, 'steps': 77521, 'loss/train': 1.2518483400344849} 11/07/2021 08:09:19 - INFO - __main__ - Step 77523: {'lr': 0.00024192454009761002, 'samples': 14884416, 'steps': 77522, 'loss/train': 0.6747350096702576} 11/07/2021 08:09:20 - INFO - __main__ - Step 77524: {'lr': 0.00024191923612435705, 'samples': 14884608, 'steps': 77523, 'loss/train': 1.2610511779785156} 11/07/2021 08:09:20 - INFO - __main__ - Step 77525: {'lr': 0.00024191393215474517, 'samples': 14884800, 'steps': 77524, 'loss/train': 1.6739617586135864} 11/07/2021 08:09:21 - INFO - __main__ - Step 77526: {'lr': 0.00024190862818877667, 'samples': 14884992, 'steps': 77525, 'loss/train': 1.051301121711731} 11/07/2021 08:09:21 - INFO - __main__ - Step 77527: {'lr': 0.00024190332422645408, 'samples': 14885184, 'steps': 77526, 'loss/train': 1.2545245885849} 11/07/2021 08:09:22 - INFO - __main__ - Step 77528: {'lr': 0.00024189802026777972, 'samples': 14885376, 'steps': 77527, 'loss/train': 1.5009537935256958} 11/07/2021 08:09:22 - INFO - __main__ - Step 77529: {'lr': 0.00024189271631275594, 'samples': 14885568, 'steps': 77528, 'loss/train': 1.422996997833252} 11/07/2021 08:09:23 - INFO - __main__ - Step 77530: {'lr': 0.00024188741236138517, 'samples': 14885760, 'steps': 77529, 'loss/train': 1.3015167713165283} 11/07/2021 08:09:24 - INFO - __main__ - Step 77531: {'lr': 0.00024188210841366985, 'samples': 14885952, 'steps': 77530, 'loss/train': 1.2663004398345947} 11/07/2021 08:09:24 - INFO - __main__ - Step 77532: {'lr': 0.0002418768044696123, 'samples': 14886144, 'steps': 77531, 'loss/train': 0.961565375328064} 11/07/2021 08:09:24 - INFO - __main__ - Step 77533: {'lr': 0.00024187150052921495, 'samples': 14886336, 'steps': 77532, 'loss/train': 1.6711211204528809} 11/07/2021 08:09:25 - INFO - __main__ - Step 77534: {'lr': 0.00024186619659248015, 'samples': 14886528, 'steps': 77533, 'loss/train': 1.7845927476882935} 11/07/2021 08:09:25 - INFO - __main__ - Step 77535: {'lr': 0.00024186089265941033, 'samples': 14886720, 'steps': 77534, 'loss/train': 1.232000708580017} 11/07/2021 08:09:25 - INFO - __main__ - Step 77536: {'lr': 0.00024185558873000794, 'samples': 14886912, 'steps': 77535, 'loss/train': 1.3859745264053345} 11/07/2021 08:09:26 - INFO - __main__ - Step 77537: {'lr': 0.0002418502848042752, 'samples': 14887104, 'steps': 77536, 'loss/train': 1.3946055173873901} 11/07/2021 08:09:27 - INFO - __main__ - Step 77538: {'lr': 0.00024184498088221463, 'samples': 14887296, 'steps': 77537, 'loss/train': 1.0912150144577026} 11/07/2021 08:09:27 - INFO - __main__ - Step 77539: {'lr': 0.00024183967696382857, 'samples': 14887488, 'steps': 77538, 'loss/train': 1.7170493602752686} 11/07/2021 08:09:28 - INFO - __main__ - Step 77540: {'lr': 0.00024183437304911942, 'samples': 14887680, 'steps': 77539, 'loss/train': 1.6326180696487427} 11/07/2021 08:09:28 - INFO - __main__ - Step 77541: {'lr': 0.00024182906913808967, 'samples': 14887872, 'steps': 77540, 'loss/train': 1.6075845956802368} 11/07/2021 08:09:29 - INFO - __main__ - Step 77542: {'lr': 0.00024182376523074152, 'samples': 14888064, 'steps': 77541, 'loss/train': 1.2952436208724976} 11/07/2021 08:09:29 - INFO - __main__ - Step 77543: {'lr': 0.00024181846132707746, 'samples': 14888256, 'steps': 77542, 'loss/train': 0.986761748790741} 11/07/2021 08:09:30 - INFO - __main__ - Step 77544: {'lr': 0.00024181315742709988, 'samples': 14888448, 'steps': 77543, 'loss/train': 1.5235687494277954} 11/07/2021 08:09:30 - INFO - __main__ - Step 77545: {'lr': 0.00024180785353081116, 'samples': 14888640, 'steps': 77544, 'loss/train': 1.597390055656433} 11/07/2021 08:09:31 - INFO - __main__ - Step 77546: {'lr': 0.0002418025496382137, 'samples': 14888832, 'steps': 77545, 'loss/train': 1.7199922800064087} 11/07/2021 08:09:31 - INFO - __main__ - Step 77547: {'lr': 0.00024179724574930998, 'samples': 14889024, 'steps': 77546, 'loss/train': 1.025982141494751} 11/07/2021 08:09:32 - INFO - __main__ - Step 77548: {'lr': 0.0002417919418641022, 'samples': 14889216, 'steps': 77547, 'loss/train': 0.8809354305267334} 11/07/2021 08:09:32 - INFO - __main__ - Step 77549: {'lr': 0.00024178663798259283, 'samples': 14889408, 'steps': 77548, 'loss/train': 1.4300669431686401} 11/07/2021 08:09:33 - INFO - __main__ - Step 77550: {'lr': 0.00024178133410478428, 'samples': 14889600, 'steps': 77549, 'loss/train': 0.35378164052963257} 11/07/2021 08:09:33 - INFO - __main__ - Step 77551: {'lr': 0.00024177603023067896, 'samples': 14889792, 'steps': 77550, 'loss/train': 1.6987085342407227} 11/07/2021 08:09:34 - INFO - __main__ - Step 77552: {'lr': 0.00024177072636027923, 'samples': 14889984, 'steps': 77551, 'loss/train': 1.5674664974212646} 11/07/2021 08:09:35 - INFO - __main__ - Step 77553: {'lr': 0.00024176542249358747, 'samples': 14890176, 'steps': 77552, 'loss/train': 1.6314823627471924} 11/07/2021 08:09:35 - INFO - __main__ - Step 77554: {'lr': 0.00024176011863060611, 'samples': 14890368, 'steps': 77553, 'loss/train': 1.3035184144973755} 11/07/2021 08:09:35 - INFO - __main__ - Step 77555: {'lr': 0.0002417548147713375, 'samples': 14890560, 'steps': 77554, 'loss/train': 1.4352474212646484} 11/07/2021 08:09:36 - INFO - __main__ - Step 77556: {'lr': 0.00024174951091578405, 'samples': 14890752, 'steps': 77555, 'loss/train': 1.113909125328064} 11/07/2021 08:09:36 - INFO - __main__ - Step 77557: {'lr': 0.0002417442070639481, 'samples': 14890944, 'steps': 77556, 'loss/train': 1.2028870582580566} 11/07/2021 08:09:37 - INFO - __main__ - Step 77558: {'lr': 0.00024173890321583217, 'samples': 14891136, 'steps': 77557, 'loss/train': 0.6919180750846863} 11/07/2021 08:09:37 - INFO - __main__ - Step 77559: {'lr': 0.00024173359937143852, 'samples': 14891328, 'steps': 77558, 'loss/train': 1.5722498893737793} 11/07/2021 08:09:38 - INFO - __main__ - Step 77560: {'lr': 0.00024172829553076956, 'samples': 14891520, 'steps': 77559, 'loss/train': 1.3963934183120728} 11/07/2021 08:09:38 - INFO - __main__ - Step 77561: {'lr': 0.00024172299169382768, 'samples': 14891712, 'steps': 77560, 'loss/train': 1.5849003791809082} 11/07/2021 08:09:38 - INFO - __main__ - Step 77562: {'lr': 0.00024171768786061533, 'samples': 14891904, 'steps': 77561, 'loss/train': 1.2256088256835938} 11/07/2021 08:09:39 - INFO - __main__ - Step 77563: {'lr': 0.00024171238403113485, 'samples': 14892096, 'steps': 77562, 'loss/train': 1.2547581195831299} 11/07/2021 08:09:40 - INFO - __main__ - Step 77564: {'lr': 0.00024170708020538866, 'samples': 14892288, 'steps': 77563, 'loss/train': 1.5910018682479858} 11/07/2021 08:09:40 - INFO - __main__ - Step 77565: {'lr': 0.0002417017763833791, 'samples': 14892480, 'steps': 77564, 'loss/train': 1.2703773975372314} 11/07/2021 08:09:40 - INFO - __main__ - Step 77566: {'lr': 0.0002416964725651086, 'samples': 14892672, 'steps': 77565, 'loss/train': 0.6872929334640503} 11/07/2021 08:09:41 - INFO - __main__ - Step 77567: {'lr': 0.00024169116875057952, 'samples': 14892864, 'steps': 77566, 'loss/train': 1.5949677228927612} 11/07/2021 08:09:41 - INFO - __main__ - Step 77568: {'lr': 0.00024168586493979438, 'samples': 14893056, 'steps': 77567, 'loss/train': 1.225726842880249} 11/07/2021 08:09:42 - INFO - __main__ - Step 77569: {'lr': 0.00024168056113275544, 'samples': 14893248, 'steps': 77568, 'loss/train': 1.237242341041565} 11/07/2021 08:09:43 - INFO - __main__ - Step 77570: {'lr': 0.00024167525732946506, 'samples': 14893440, 'steps': 77569, 'loss/train': 1.6555200815200806} 11/07/2021 08:09:43 - INFO - __main__ - Step 77571: {'lr': 0.00024166995352992567, 'samples': 14893632, 'steps': 77570, 'loss/train': 1.3607689142227173} 11/07/2021 08:09:43 - INFO - __main__ - Step 77572: {'lr': 0.00024166464973413964, 'samples': 14893824, 'steps': 77571, 'loss/train': 1.4480373859405518} 11/07/2021 08:09:44 - INFO - __main__ - Step 77573: {'lr': 0.00024165934594210943, 'samples': 14894016, 'steps': 77572, 'loss/train': 1.6478627920150757} 11/07/2021 08:09:45 - INFO - __main__ - Step 77574: {'lr': 0.0002416540421538374, 'samples': 14894208, 'steps': 77573, 'loss/train': 1.2738831043243408} 11/07/2021 08:09:46 - INFO - __main__ - Step 77575: {'lr': 0.00024164873836932587, 'samples': 14894400, 'steps': 77574, 'loss/train': 0.8490543365478516} 11/07/2021 08:09:46 - INFO - __main__ - Step 77576: {'lr': 0.00024164343458857735, 'samples': 14894592, 'steps': 77575, 'loss/train': 0.513917863368988} 11/07/2021 08:09:46 - INFO - __main__ - Step 77577: {'lr': 0.00024163813081159413, 'samples': 14894784, 'steps': 77576, 'loss/train': 1.5292097330093384} 11/07/2021 08:09:47 - INFO - __main__ - Step 77578: {'lr': 0.00024163282703837868, 'samples': 14894976, 'steps': 77577, 'loss/train': 1.0532124042510986} 11/07/2021 08:09:47 - INFO - __main__ - Step 77579: {'lr': 0.00024162752326893335, 'samples': 14895168, 'steps': 77578, 'loss/train': 1.57536780834198} 11/07/2021 08:09:48 - INFO - __main__ - Step 77580: {'lr': 0.0002416222195032605, 'samples': 14895360, 'steps': 77579, 'loss/train': 1.5633000135421753} 11/07/2021 08:09:48 - INFO - __main__ - Step 77581: {'lr': 0.00024161691574136265, 'samples': 14895552, 'steps': 77580, 'loss/train': 1.3300788402557373} 11/07/2021 08:09:49 - INFO - __main__ - Step 77582: {'lr': 0.000241611611983242, 'samples': 14895744, 'steps': 77581, 'loss/train': 1.364041805267334} 11/07/2021 08:09:49 - INFO - __main__ - Step 77583: {'lr': 0.000241606308228901, 'samples': 14895936, 'steps': 77582, 'loss/train': 1.5614112615585327} 11/07/2021 08:09:50 - INFO - __main__ - Step 77584: {'lr': 0.0002416010044783421, 'samples': 14896128, 'steps': 77583, 'loss/train': 1.5323536396026611} 11/07/2021 08:09:50 - INFO - __main__ - Step 77585: {'lr': 0.00024159570073156765, 'samples': 14896320, 'steps': 77584, 'loss/train': 1.2818206548690796} 11/07/2021 08:09:51 - INFO - __main__ - Step 77586: {'lr': 0.00024159039698858005, 'samples': 14896512, 'steps': 77585, 'loss/train': 1.9564303159713745} 11/07/2021 08:09:51 - INFO - __main__ - Step 77587: {'lr': 0.00024158509324938168, 'samples': 14896704, 'steps': 77586, 'loss/train': 1.1357886791229248} 11/07/2021 08:09:51 - INFO - __main__ - Step 77588: {'lr': 0.00024157978951397493, 'samples': 14896896, 'steps': 77587, 'loss/train': 1.2902246713638306} 11/07/2021 08:09:52 - INFO - __main__ - Step 77589: {'lr': 0.00024157448578236221, 'samples': 14897088, 'steps': 77588, 'loss/train': 2.0813381671905518} 11/07/2021 08:09:53 - INFO - __main__ - Step 77590: {'lr': 0.00024156918205454588, 'samples': 14897280, 'steps': 77589, 'loss/train': 1.7141979932785034} 11/07/2021 08:09:53 - INFO - __main__ - Step 77591: {'lr': 0.00024156387833052838, 'samples': 14897472, 'steps': 77590, 'loss/train': 1.8478772640228271} 11/07/2021 08:09:53 - INFO - __main__ - Step 77592: {'lr': 0.00024155857461031203, 'samples': 14897664, 'steps': 77591, 'loss/train': 1.2606786489486694} 11/07/2021 08:09:54 - INFO - __main__ - Step 77593: {'lr': 0.00024155327089389928, 'samples': 14897856, 'steps': 77592, 'loss/train': 1.152688980102539} 11/07/2021 08:09:54 - INFO - __main__ - Step 77594: {'lr': 0.0002415479671812925, 'samples': 14898048, 'steps': 77593, 'loss/train': 0.9224274754524231} 11/07/2021 08:09:55 - INFO - __main__ - Step 77595: {'lr': 0.00024154266347249415, 'samples': 14898240, 'steps': 77594, 'loss/train': 1.5903297662734985} 11/07/2021 08:09:56 - INFO - __main__ - Step 77596: {'lr': 0.00024153735976750645, 'samples': 14898432, 'steps': 77595, 'loss/train': 1.4361460208892822} 11/07/2021 08:09:56 - INFO - __main__ - Step 77597: {'lr': 0.00024153205606633192, 'samples': 14898624, 'steps': 77596, 'loss/train': 2.8967151641845703} 11/07/2021 08:09:56 - INFO - __main__ - Step 77598: {'lr': 0.00024152675236897286, 'samples': 14898816, 'steps': 77597, 'loss/train': 1.542585849761963} 11/07/2021 08:09:57 - INFO - __main__ - Step 77599: {'lr': 0.00024152144867543176, 'samples': 14899008, 'steps': 77598, 'loss/train': 1.2802128791809082} 11/07/2021 08:09:58 - INFO - __main__ - Step 77600: {'lr': 0.00024151614498571096, 'samples': 14899200, 'steps': 77599, 'loss/train': 1.390236258506775} 11/07/2021 08:09:58 - INFO - __main__ - Step 77601: {'lr': 0.0002415108412998128, 'samples': 14899392, 'steps': 77600, 'loss/train': 1.083944320678711} 11/07/2021 08:09:58 - INFO - __main__ - Step 77602: {'lr': 0.0002415055376177398, 'samples': 14899584, 'steps': 77601, 'loss/train': 1.5750551223754883} 11/07/2021 08:09:59 - INFO - __main__ - Step 77603: {'lr': 0.00024150023393949426, 'samples': 14899776, 'steps': 77602, 'loss/train': 1.7947691679000854} 11/07/2021 08:09:59 - INFO - __main__ - Step 77604: {'lr': 0.00024149493026507854, 'samples': 14899968, 'steps': 77603, 'loss/train': 0.9161434769630432} 11/07/2021 08:10:00 - INFO - __main__ - Step 77605: {'lr': 0.00024148962659449507, 'samples': 14900160, 'steps': 77604, 'loss/train': 1.1722602844238281} 11/07/2021 08:10:00 - INFO - __main__ - Step 77606: {'lr': 0.0002414843229277463, 'samples': 14900352, 'steps': 77605, 'loss/train': 1.1815069913864136} 11/07/2021 08:10:01 - INFO - __main__ - Step 77607: {'lr': 0.00024147901926483453, 'samples': 14900544, 'steps': 77606, 'loss/train': 1.2523226737976074} 11/07/2021 08:10:01 - INFO - __main__ - Step 77608: {'lr': 0.00024147371560576228, 'samples': 14900736, 'steps': 77607, 'loss/train': 0.9681571125984192} 11/07/2021 08:10:01 - INFO - __main__ - Step 77609: {'lr': 0.00024146841195053176, 'samples': 14900928, 'steps': 77608, 'loss/train': 2.0443220138549805} 11/07/2021 08:10:03 - INFO - __main__ - Step 77610: {'lr': 0.00024146310829914542, 'samples': 14901120, 'steps': 77609, 'loss/train': 1.8939778804779053} 11/07/2021 08:10:03 - INFO - __main__ - Step 77611: {'lr': 0.0002414578046516057, 'samples': 14901312, 'steps': 77610, 'loss/train': 1.482336163520813} 11/07/2021 08:10:03 - INFO - __main__ - Step 77612: {'lr': 0.00024145250100791494, 'samples': 14901504, 'steps': 77611, 'loss/train': 1.8243052959442139} 11/07/2021 08:10:04 - INFO - __main__ - Step 77613: {'lr': 0.00024144719736807552, 'samples': 14901696, 'steps': 77612, 'loss/train': 0.43176794052124023} 11/07/2021 08:10:04 - INFO - __main__ - Step 77614: {'lr': 0.00024144189373208992, 'samples': 14901888, 'steps': 77613, 'loss/train': 1.4135652780532837} 11/07/2021 08:10:04 - INFO - __main__ - Step 77615: {'lr': 0.00024143659009996044, 'samples': 14902080, 'steps': 77614, 'loss/train': 1.104856252670288} 11/07/2021 08:10:05 - INFO - __main__ - Step 77616: {'lr': 0.00024143128647168948, 'samples': 14902272, 'steps': 77615, 'loss/train': 1.135025143623352} 11/07/2021 08:10:06 - INFO - __main__ - Step 77617: {'lr': 0.00024142598284727947, 'samples': 14902464, 'steps': 77616, 'loss/train': 1.445526123046875} 11/07/2021 08:10:06 - INFO - __main__ - Step 77618: {'lr': 0.0002414206792267328, 'samples': 14902656, 'steps': 77617, 'loss/train': 1.5975595712661743} 11/07/2021 08:10:06 - INFO - __main__ - Step 77619: {'lr': 0.00024141537561005183, 'samples': 14902848, 'steps': 77618, 'loss/train': 1.730878472328186} 11/07/2021 08:10:07 - INFO - __main__ - Step 77620: {'lr': 0.00024141007199723893, 'samples': 14903040, 'steps': 77619, 'loss/train': 1.3323757648468018} 11/07/2021 08:10:08 - INFO - __main__ - Step 77621: {'lr': 0.00024140476838829656, 'samples': 14903232, 'steps': 77620, 'loss/train': 1.3629229068756104} 11/07/2021 08:10:08 - INFO - __main__ - Step 77622: {'lr': 0.00024139946478322717, 'samples': 14903424, 'steps': 77621, 'loss/train': 0.566703736782074} 11/07/2021 08:10:08 - INFO - __main__ - Step 77623: {'lr': 0.00024139416118203292, 'samples': 14903616, 'steps': 77622, 'loss/train': 1.7206910848617554} 11/07/2021 08:10:09 - INFO - __main__ - Step 77624: {'lr': 0.00024138885758471633, 'samples': 14903808, 'steps': 77623, 'loss/train': 1.35834801197052} 11/07/2021 08:10:09 - INFO - __main__ - Step 77625: {'lr': 0.00024138355399127981, 'samples': 14904000, 'steps': 77624, 'loss/train': 0.5947864055633545} 11/07/2021 08:10:10 - INFO - __main__ - Step 77626: {'lr': 0.0002413782504017257, 'samples': 14904192, 'steps': 77625, 'loss/train': 2.2394886016845703} 11/07/2021 08:10:11 - INFO - __main__ - Step 77627: {'lr': 0.00024137294681605642, 'samples': 14904384, 'steps': 77626, 'loss/train': 1.1752034425735474} 11/07/2021 08:10:11 - INFO - __main__ - Step 77628: {'lr': 0.00024136764323427437, 'samples': 14904576, 'steps': 77627, 'loss/train': 1.7286384105682373} 11/07/2021 08:10:11 - INFO - __main__ - Step 77629: {'lr': 0.00024136233965638194, 'samples': 14904768, 'steps': 77628, 'loss/train': 1.3744635581970215} 11/07/2021 08:10:12 - INFO - __main__ - Step 77630: {'lr': 0.00024135703608238148, 'samples': 14904960, 'steps': 77629, 'loss/train': 1.3076014518737793} 11/07/2021 08:10:13 - INFO - __main__ - Step 77631: {'lr': 0.00024135173251227545, 'samples': 14905152, 'steps': 77630, 'loss/train': 0.8511968851089478} 11/07/2021 08:10:13 - INFO - __main__ - Step 77632: {'lr': 0.00024134642894606612, 'samples': 14905344, 'steps': 77631, 'loss/train': 1.345628023147583} 11/07/2021 08:10:13 - INFO - __main__ - Step 77633: {'lr': 0.000241341125383756, 'samples': 14905536, 'steps': 77632, 'loss/train': 1.0771452188491821} 11/07/2021 08:10:14 - INFO - __main__ - Step 77634: {'lr': 0.00024133582182534743, 'samples': 14905728, 'steps': 77633, 'loss/train': 1.174552321434021} 11/07/2021 08:10:14 - INFO - __main__ - Step 77635: {'lr': 0.00024133051827084293, 'samples': 14905920, 'steps': 77634, 'loss/train': 1.5464860200881958} 11/07/2021 08:10:15 - INFO - __main__ - Step 77636: {'lr': 0.00024132521472024465, 'samples': 14906112, 'steps': 77635, 'loss/train': 1.9600989818572998} 11/07/2021 08:10:15 - INFO - __main__ - Step 77637: {'lr': 0.0002413199111735551, 'samples': 14906304, 'steps': 77636, 'loss/train': 1.0708067417144775} 11/07/2021 08:10:16 - INFO - __main__ - Step 77638: {'lr': 0.00024131460763077665, 'samples': 14906496, 'steps': 77637, 'loss/train': 1.9043291807174683} 11/07/2021 08:10:16 - INFO - __main__ - Step 77639: {'lr': 0.0002413093040919117, 'samples': 14906688, 'steps': 77638, 'loss/train': 1.26895272731781} 11/07/2021 08:10:17 - INFO - __main__ - Step 77640: {'lr': 0.00024130400055696264, 'samples': 14906880, 'steps': 77639, 'loss/train': 1.382437825202942} 11/07/2021 08:10:17 - INFO - __main__ - Step 77641: {'lr': 0.00024129869702593188, 'samples': 14907072, 'steps': 77640, 'loss/train': 2.1098713874816895} 11/07/2021 08:10:18 - INFO - __main__ - Step 77642: {'lr': 0.00024129339349882175, 'samples': 14907264, 'steps': 77641, 'loss/train': 1.1318111419677734} 11/07/2021 08:10:18 - INFO - __main__ - Step 77643: {'lr': 0.00024128808997563473, 'samples': 14907456, 'steps': 77642, 'loss/train': 1.4998935461044312} 11/07/2021 08:10:19 - INFO - __main__ - Step 77644: {'lr': 0.00024128278645637317, 'samples': 14907648, 'steps': 77643, 'loss/train': 1.090847134590149} 11/07/2021 08:10:19 - INFO - __main__ - Step 77645: {'lr': 0.00024127748294103943, 'samples': 14907840, 'steps': 77644, 'loss/train': 1.8911464214324951} 11/07/2021 08:10:19 - INFO - __main__ - Step 77646: {'lr': 0.00024127217942963592, 'samples': 14908032, 'steps': 77645, 'loss/train': 1.266910195350647} 11/07/2021 08:10:20 - INFO - __main__ - Step 77647: {'lr': 0.00024126687592216503, 'samples': 14908224, 'steps': 77646, 'loss/train': 1.512333869934082} 11/07/2021 08:10:21 - INFO - __main__ - Step 77648: {'lr': 0.00024126157241862925, 'samples': 14908416, 'steps': 77647, 'loss/train': 1.140256404876709} 11/07/2021 08:10:21 - INFO - __main__ - Step 77649: {'lr': 0.00024125626891903078, 'samples': 14908608, 'steps': 77648, 'loss/train': 1.6800087690353394} 11/07/2021 08:10:21 - INFO - __main__ - Step 77650: {'lr': 0.00024125096542337211, 'samples': 14908800, 'steps': 77649, 'loss/train': 1.0826760530471802} 11/07/2021 08:10:22 - INFO - __main__ - Step 77651: {'lr': 0.0002412456619316556, 'samples': 14908992, 'steps': 77650, 'loss/train': 1.502442717552185} 11/07/2021 08:10:23 - INFO - __main__ - Step 77652: {'lr': 0.00024124035844388367, 'samples': 14909184, 'steps': 77651, 'loss/train': 1.109333872795105} 11/07/2021 08:10:23 - INFO - __main__ - Step 77653: {'lr': 0.00024123505496005868, 'samples': 14909376, 'steps': 77652, 'loss/train': 1.6737326383590698} 11/07/2021 08:10:23 - INFO - __main__ - Step 77654: {'lr': 0.00024122975148018304, 'samples': 14909568, 'steps': 77653, 'loss/train': 1.2667243480682373} 11/07/2021 08:10:24 - INFO - __main__ - Step 77655: {'lr': 0.00024122444800425919, 'samples': 14909760, 'steps': 77654, 'loss/train': 1.8813623189926147} 11/07/2021 08:10:24 - INFO - __main__ - Step 77656: {'lr': 0.0002412191445322894, 'samples': 14909952, 'steps': 77655, 'loss/train': 1.0920720100402832} 11/07/2021 08:10:25 - INFO - __main__ - Step 77657: {'lr': 0.0002412138410642762, 'samples': 14910144, 'steps': 77656, 'loss/train': 1.3318971395492554} 11/07/2021 08:10:25 - INFO - __main__ - Step 77658: {'lr': 0.00024120853760022185, 'samples': 14910336, 'steps': 77657, 'loss/train': 1.1441189050674438} 11/07/2021 08:10:26 - INFO - __main__ - Step 77659: {'lr': 0.00024120323414012886, 'samples': 14910528, 'steps': 77658, 'loss/train': 1.0201956033706665} 11/07/2021 08:10:26 - INFO - __main__ - Step 77660: {'lr': 0.0002411979306839995, 'samples': 14910720, 'steps': 77659, 'loss/train': 1.3437641859054565} 11/07/2021 08:10:26 - INFO - __main__ - Step 77661: {'lr': 0.00024119262723183623, 'samples': 14910912, 'steps': 77660, 'loss/train': 1.2149600982666016} 11/07/2021 08:10:28 - INFO - __main__ - Step 77662: {'lr': 0.0002411873237836415, 'samples': 14911104, 'steps': 77661, 'loss/train': 1.3661493062973022} 11/07/2021 08:10:28 - INFO - __main__ - Step 77663: {'lr': 0.00024118202033941756, 'samples': 14911296, 'steps': 77662, 'loss/train': 1.4869979619979858} 11/07/2021 08:10:28 - INFO - __main__ - Step 77664: {'lr': 0.00024117671689916683, 'samples': 14911488, 'steps': 77663, 'loss/train': 1.4226006269454956} 11/07/2021 08:10:29 - INFO - __main__ - Step 77665: {'lr': 0.00024117141346289176, 'samples': 14911680, 'steps': 77664, 'loss/train': 1.630541205406189} 11/07/2021 08:10:29 - INFO - __main__ - Step 77666: {'lr': 0.0002411661100305947, 'samples': 14911872, 'steps': 77665, 'loss/train': 1.2832515239715576} 11/07/2021 08:10:29 - INFO - __main__ - Step 77667: {'lr': 0.0002411608066022781, 'samples': 14912064, 'steps': 77666, 'loss/train': 1.242453932762146} 11/07/2021 08:10:31 - INFO - __main__ - Step 77668: {'lr': 0.00024115550317794428, 'samples': 14912256, 'steps': 77667, 'loss/train': 1.5849342346191406} 11/07/2021 08:10:31 - INFO - __main__ - Step 77669: {'lr': 0.00024115019975759564, 'samples': 14912448, 'steps': 77668, 'loss/train': 0.9399653673171997} 11/07/2021 08:10:31 - INFO - __main__ - Step 77670: {'lr': 0.00024114489634123463, 'samples': 14912640, 'steps': 77669, 'loss/train': 1.3855512142181396} 11/07/2021 08:10:32 - INFO - __main__ - Step 77671: {'lr': 0.00024113959292886356, 'samples': 14912832, 'steps': 77670, 'loss/train': 1.0337145328521729} 11/07/2021 08:10:32 - INFO - __main__ - Step 77672: {'lr': 0.00024113428952048487, 'samples': 14913024, 'steps': 77671, 'loss/train': 1.7450519800186157} 11/07/2021 08:10:32 - INFO - __main__ - Step 77673: {'lr': 0.00024112898611610087, 'samples': 14913216, 'steps': 77672, 'loss/train': 1.290311574935913} 11/07/2021 08:10:33 - INFO - __main__ - Step 77674: {'lr': 0.00024112368271571406, 'samples': 14913408, 'steps': 77673, 'loss/train': 1.6603467464447021} 11/07/2021 08:10:34 - INFO - __main__ - Step 77675: {'lr': 0.00024111837931932683, 'samples': 14913600, 'steps': 77674, 'loss/train': 1.466395616531372} 11/07/2021 08:10:34 - INFO - __main__ - Step 77676: {'lr': 0.00024111307592694146, 'samples': 14913792, 'steps': 77675, 'loss/train': 1.316085696220398} 11/07/2021 08:10:34 - INFO - __main__ - Step 77677: {'lr': 0.00024110777253856042, 'samples': 14913984, 'steps': 77676, 'loss/train': 1.456813097000122} 11/07/2021 08:10:35 - INFO - __main__ - Step 77678: {'lr': 0.00024110246915418605, 'samples': 14914176, 'steps': 77677, 'loss/train': 1.400478720664978} 11/07/2021 08:10:36 - INFO - __main__ - Step 77679: {'lr': 0.0002410971657738208, 'samples': 14914368, 'steps': 77678, 'loss/train': 1.2462760210037231} 11/07/2021 08:10:36 - INFO - __main__ - Step 77680: {'lr': 0.000241091862397467, 'samples': 14914560, 'steps': 77679, 'loss/train': 1.2672793865203857} 11/07/2021 08:10:37 - INFO - __main__ - Step 77681: {'lr': 0.00024108655902512714, 'samples': 14914752, 'steps': 77680, 'loss/train': 1.7853217124938965} 11/07/2021 08:10:37 - INFO - __main__ - Step 77682: {'lr': 0.0002410812556568035, 'samples': 14914944, 'steps': 77681, 'loss/train': 1.657317876815796} 11/07/2021 08:10:37 - INFO - __main__ - Step 77683: {'lr': 0.00024107595229249848, 'samples': 14915136, 'steps': 77682, 'loss/train': 1.8959122896194458} 11/07/2021 08:10:38 - INFO - __main__ - Step 77684: {'lr': 0.0002410706489322145, 'samples': 14915328, 'steps': 77683, 'loss/train': 2.2136619091033936} 11/07/2021 08:10:39 - INFO - __main__ - Step 77685: {'lr': 0.00024106534557595397, 'samples': 14915520, 'steps': 77684, 'loss/train': 1.26851224899292} 11/07/2021 08:10:39 - INFO - __main__ - Step 77686: {'lr': 0.00024106004222371926, 'samples': 14915712, 'steps': 77685, 'loss/train': 2.387057065963745} 11/07/2021 08:10:39 - INFO - __main__ - Step 77687: {'lr': 0.0002410547388755127, 'samples': 14915904, 'steps': 77686, 'loss/train': 1.1857188940048218} 11/07/2021 08:10:40 - INFO - __main__ - Step 77688: {'lr': 0.0002410494355313368, 'samples': 14916096, 'steps': 77687, 'loss/train': 1.491365909576416} 11/07/2021 08:10:41 - INFO - __main__ - Step 77689: {'lr': 0.0002410441321911939, 'samples': 14916288, 'steps': 77688, 'loss/train': 1.706040859222412} 11/07/2021 08:10:41 - INFO - __main__ - Step 77690: {'lr': 0.00024103882885508638, 'samples': 14916480, 'steps': 77689, 'loss/train': 1.3399722576141357} 11/07/2021 08:10:41 - INFO - __main__ - Step 77691: {'lr': 0.00024103352552301658, 'samples': 14916672, 'steps': 77690, 'loss/train': 1.571684718132019} 11/07/2021 08:10:42 - INFO - __main__ - Step 77692: {'lr': 0.000241028222194987, 'samples': 14916864, 'steps': 77691, 'loss/train': 1.179751992225647} 11/07/2021 08:10:42 - INFO - __main__ - Step 77693: {'lr': 0.0002410229188709999, 'samples': 14917056, 'steps': 77692, 'loss/train': 1.4590928554534912} 11/07/2021 08:10:43 - INFO - __main__ - Step 77694: {'lr': 0.00024101761555105772, 'samples': 14917248, 'steps': 77693, 'loss/train': 1.2167140245437622} 11/07/2021 08:10:44 - INFO - __main__ - Step 77695: {'lr': 0.0002410123122351629, 'samples': 14917440, 'steps': 77694, 'loss/train': 1.5716336965560913} 11/07/2021 08:10:44 - INFO - __main__ - Step 77696: {'lr': 0.0002410070089233178, 'samples': 14917632, 'steps': 77695, 'loss/train': 1.6846543550491333} 11/07/2021 08:10:44 - INFO - __main__ - Step 77697: {'lr': 0.00024100170561552477, 'samples': 14917824, 'steps': 77696, 'loss/train': 1.1072500944137573} 11/07/2021 08:10:45 - INFO - __main__ - Step 77698: {'lr': 0.00024099640231178623, 'samples': 14918016, 'steps': 77697, 'loss/train': 1.4452025890350342} 11/07/2021 08:10:45 - INFO - __main__ - Step 77699: {'lr': 0.00024099109901210458, 'samples': 14918208, 'steps': 77698, 'loss/train': 0.5614736080169678} 11/07/2021 08:10:46 - INFO - __main__ - Step 77700: {'lr': 0.00024098579571648222, 'samples': 14918400, 'steps': 77699, 'loss/train': 1.3316500186920166} 11/07/2021 08:10:46 - INFO - __main__ - Step 77701: {'lr': 0.00024098049242492152, 'samples': 14918592, 'steps': 77700, 'loss/train': 1.3469505310058594} 11/07/2021 08:10:47 - INFO - __main__ - Step 77702: {'lr': 0.0002409751891374249, 'samples': 14918784, 'steps': 77701, 'loss/train': 1.7920747995376587} 11/07/2021 08:10:47 - INFO - __main__ - Step 77703: {'lr': 0.00024096988585399474, 'samples': 14918976, 'steps': 77702, 'loss/train': 1.6142724752426147} 11/07/2021 08:10:47 - INFO - __main__ - Step 77704: {'lr': 0.00024096458257463332, 'samples': 14919168, 'steps': 77703, 'loss/train': 1.1531968116760254} 11/07/2021 08:10:49 - INFO - __main__ - Step 77705: {'lr': 0.00024095927929934316, 'samples': 14919360, 'steps': 77704, 'loss/train': 1.5529978275299072} 11/07/2021 08:10:49 - INFO - __main__ - Step 77706: {'lr': 0.00024095397602812662, 'samples': 14919552, 'steps': 77705, 'loss/train': 0.9849553108215332} 11/07/2021 08:10:49 - INFO - __main__ - Step 77707: {'lr': 0.00024094867276098605, 'samples': 14919744, 'steps': 77706, 'loss/train': 1.5030598640441895} 11/07/2021 08:10:50 - INFO - __main__ - Step 77708: {'lr': 0.00024094336949792388, 'samples': 14919936, 'steps': 77707, 'loss/train': 1.7648119926452637} 11/07/2021 08:10:50 - INFO - __main__ - Step 77709: {'lr': 0.00024093806623894248, 'samples': 14920128, 'steps': 77708, 'loss/train': 1.1394892930984497} 11/07/2021 08:10:51 - INFO - __main__ - Step 77710: {'lr': 0.00024093276298404426, 'samples': 14920320, 'steps': 77709, 'loss/train': 1.2398154735565186} 11/07/2021 08:10:51 - INFO - __main__ - Step 77711: {'lr': 0.00024092745973323156, 'samples': 14920512, 'steps': 77710, 'loss/train': 0.8878004550933838} 11/07/2021 08:10:52 - INFO - __main__ - Step 77712: {'lr': 0.00024092215648650685, 'samples': 14920704, 'steps': 77711, 'loss/train': 1.438417673110962} 11/07/2021 08:10:52 - INFO - __main__ - Step 77713: {'lr': 0.00024091685324387246, 'samples': 14920896, 'steps': 77712, 'loss/train': 0.905992865562439} 11/07/2021 08:10:52 - INFO - __main__ - Step 77714: {'lr': 0.0002409115500053308, 'samples': 14921088, 'steps': 77713, 'loss/train': 1.9524047374725342} 11/07/2021 08:10:53 - INFO - __main__ - Step 77715: {'lr': 0.00024090624677088426, 'samples': 14921280, 'steps': 77714, 'loss/train': 1.4520729780197144} 11/07/2021 08:10:54 - INFO - __main__ - Step 77716: {'lr': 0.0002409009435405353, 'samples': 14921472, 'steps': 77715, 'loss/train': 0.5438885688781738} 11/07/2021 08:10:54 - INFO - __main__ - Step 77717: {'lr': 0.0002408956403142862, 'samples': 14921664, 'steps': 77716, 'loss/train': 1.517018437385559} 11/07/2021 08:10:54 - INFO - __main__ - Step 77718: {'lr': 0.0002408903370921393, 'samples': 14921856, 'steps': 77717, 'loss/train': 1.4085990190505981} 11/07/2021 08:10:55 - INFO - __main__ - Step 77719: {'lr': 0.00024088503387409714, 'samples': 14922048, 'steps': 77718, 'loss/train': 0.8452578783035278} 11/07/2021 08:10:56 - INFO - __main__ - Step 77720: {'lr': 0.000240879730660162, 'samples': 14922240, 'steps': 77719, 'loss/train': 1.3852866888046265} 11/07/2021 08:10:56 - INFO - __main__ - Step 77721: {'lr': 0.00024087442745033633, 'samples': 14922432, 'steps': 77720, 'loss/train': 1.5808273553848267} 11/07/2021 08:10:57 - INFO - __main__ - Step 77722: {'lr': 0.00024086912424462248, 'samples': 14922624, 'steps': 77721, 'loss/train': 1.63779878616333} 11/07/2021 08:10:57 - INFO - __main__ - Step 77723: {'lr': 0.00024086382104302286, 'samples': 14922816, 'steps': 77722, 'loss/train': 1.7133854627609253} 11/07/2021 08:10:57 - INFO - __main__ - Step 77724: {'lr': 0.0002408585178455399, 'samples': 14923008, 'steps': 77723, 'loss/train': 1.0789141654968262} 11/07/2021 08:10:58 - INFO - __main__ - Step 77725: {'lr': 0.00024085321465217594, 'samples': 14923200, 'steps': 77724, 'loss/train': 1.2964184284210205} 11/07/2021 08:10:59 - INFO - __main__ - Step 77726: {'lr': 0.00024084791146293337, 'samples': 14923392, 'steps': 77725, 'loss/train': 1.7123281955718994} 11/07/2021 08:10:59 - INFO - __main__ - Step 77727: {'lr': 0.0002408426082778146, 'samples': 14923584, 'steps': 77726, 'loss/train': 0.062153611332178116} 11/07/2021 08:11:00 - INFO - __main__ - Step 77728: {'lr': 0.000240837305096822, 'samples': 14923776, 'steps': 77727, 'loss/train': 1.2971042394638062} 11/07/2021 08:11:00 - INFO - __main__ - Step 77729: {'lr': 0.00024083200191995808, 'samples': 14923968, 'steps': 77728, 'loss/train': 1.7498525381088257} 11/07/2021 08:11:00 - INFO - __main__ - Step 77730: {'lr': 0.00024082669874722499, 'samples': 14924160, 'steps': 77729, 'loss/train': 0.9504896402359009} 11/07/2021 08:11:01 - INFO - __main__ - Step 77731: {'lr': 0.00024082139557862528, 'samples': 14924352, 'steps': 77730, 'loss/train': 0.6925825476646423} 11/07/2021 08:11:02 - INFO - __main__ - Step 77732: {'lr': 0.00024081609241416126, 'samples': 14924544, 'steps': 77731, 'loss/train': 1.4540188312530518} 11/07/2021 08:11:02 - INFO - __main__ - Step 77733: {'lr': 0.00024081078925383543, 'samples': 14924736, 'steps': 77732, 'loss/train': 1.3802274465560913} 11/07/2021 08:11:02 - INFO - __main__ - Step 77734: {'lr': 0.00024080548609765008, 'samples': 14924928, 'steps': 77733, 'loss/train': 1.6898791790008545} 11/07/2021 08:11:03 - INFO - __main__ - Step 77735: {'lr': 0.00024080018294560766, 'samples': 14925120, 'steps': 77734, 'loss/train': 1.3557395935058594} 11/07/2021 08:11:04 - INFO - __main__ - Step 77736: {'lr': 0.0002407948797977105, 'samples': 14925312, 'steps': 77735, 'loss/train': 1.709507942199707} 11/07/2021 08:11:04 - INFO - __main__ - Step 77737: {'lr': 0.00024078957665396106, 'samples': 14925504, 'steps': 77736, 'loss/train': 0.8489279747009277} 11/07/2021 08:11:04 - INFO - __main__ - Step 77738: {'lr': 0.00024078427351436165, 'samples': 14925696, 'steps': 77737, 'loss/train': 1.0913091897964478} 11/07/2021 08:11:05 - INFO - __main__ - Step 77739: {'lr': 0.00024077897037891476, 'samples': 14925888, 'steps': 77738, 'loss/train': 1.5118685960769653} 11/07/2021 08:11:05 - INFO - __main__ - Step 77740: {'lr': 0.0002407736672476227, 'samples': 14926080, 'steps': 77739, 'loss/train': 1.3434242010116577} 11/07/2021 08:11:06 - INFO - __main__ - Step 77741: {'lr': 0.00024076836412048787, 'samples': 14926272, 'steps': 77740, 'loss/train': 0.9836006760597229} 11/07/2021 08:11:07 - INFO - __main__ - Step 77742: {'lr': 0.0002407630609975127, 'samples': 14926464, 'steps': 77741, 'loss/train': 1.479466438293457} 11/07/2021 08:11:07 - INFO - __main__ - Step 77743: {'lr': 0.00024075775787869963, 'samples': 14926656, 'steps': 77742, 'loss/train': 1.5528488159179688} 11/07/2021 08:11:07 - INFO - __main__ - Step 77744: {'lr': 0.00024075245476405088, 'samples': 14926848, 'steps': 77743, 'loss/train': 1.80381178855896} 11/07/2021 08:11:08 - INFO - __main__ - Step 77745: {'lr': 0.0002407471516535689, 'samples': 14927040, 'steps': 77744, 'loss/train': 1.5264397859573364} 11/07/2021 08:11:09 - INFO - __main__ - Step 77746: {'lr': 0.00024074184854725616, 'samples': 14927232, 'steps': 77745, 'loss/train': 1.7315911054611206} 11/07/2021 08:11:09 - INFO - __main__ - Step 77747: {'lr': 0.00024073654544511498, 'samples': 14927424, 'steps': 77746, 'loss/train': 1.6290496587753296} 11/07/2021 08:11:09 - INFO - __main__ - Step 77748: {'lr': 0.00024073124234714777, 'samples': 14927616, 'steps': 77747, 'loss/train': 1.573716163635254} 11/07/2021 08:11:10 - INFO - __main__ - Step 77749: {'lr': 0.00024072593925335693, 'samples': 14927808, 'steps': 77748, 'loss/train': 1.3007596731185913} 11/07/2021 08:11:10 - INFO - __main__ - Step 77750: {'lr': 0.00024072063616374482, 'samples': 14928000, 'steps': 77749, 'loss/train': 1.9381190538406372} 11/07/2021 08:11:12 - INFO - __main__ - Step 77751: {'lr': 0.00024071533307831383, 'samples': 14928192, 'steps': 77750, 'loss/train': 1.7180202007293701} 11/07/2021 08:11:12 - INFO - __main__ - Step 77752: {'lr': 0.0002407100299970664, 'samples': 14928384, 'steps': 77751, 'loss/train': 1.2816087007522583} 11/07/2021 08:11:12 - INFO - __main__ - Step 77753: {'lr': 0.00024070472692000488, 'samples': 14928576, 'steps': 77752, 'loss/train': 1.4602566957473755} 11/07/2021 08:11:13 - INFO - __main__ - Step 77754: {'lr': 0.00024069942384713166, 'samples': 14928768, 'steps': 77753, 'loss/train': 1.4668219089508057} 11/07/2021 08:11:13 - INFO - __main__ - Step 77755: {'lr': 0.00024069412077844916, 'samples': 14928960, 'steps': 77754, 'loss/train': 1.775299310684204} 11/07/2021 08:11:14 - INFO - __main__ - Step 77756: {'lr': 0.00024068881771395983, 'samples': 14929152, 'steps': 77755, 'loss/train': 1.419834017753601} 11/07/2021 08:11:14 - INFO - __main__ - Step 77757: {'lr': 0.00024068351465366587, 'samples': 14929344, 'steps': 77756, 'loss/train': 1.7235313653945923} 11/07/2021 08:11:15 - INFO - __main__ - Step 77758: {'lr': 0.0002406782115975698, 'samples': 14929536, 'steps': 77757, 'loss/train': 1.8461928367614746} 11/07/2021 08:11:15 - INFO - __main__ - Step 77759: {'lr': 0.00024067290854567396, 'samples': 14929728, 'steps': 77758, 'loss/train': 1.1260579824447632} 11/07/2021 08:11:16 - INFO - __main__ - Step 77760: {'lr': 0.00024066760549798074, 'samples': 14929920, 'steps': 77759, 'loss/train': 1.4394782781600952} 11/07/2021 08:11:16 - INFO - __main__ - Step 77761: {'lr': 0.00024066230245449257, 'samples': 14930112, 'steps': 77760, 'loss/train': 1.4412258863449097} 11/07/2021 08:11:16 - INFO - __main__ - Step 77762: {'lr': 0.00024065699941521184, 'samples': 14930304, 'steps': 77761, 'loss/train': 1.1707969903945923} 11/07/2021 08:11:17 - INFO - __main__ - Step 77763: {'lr': 0.0002406516963801409, 'samples': 14930496, 'steps': 77762, 'loss/train': 1.1256816387176514} 11/07/2021 08:11:18 - INFO - __main__ - Step 77764: {'lr': 0.00024064639334928217, 'samples': 14930688, 'steps': 77763, 'loss/train': 0.7217990159988403} 11/07/2021 08:11:18 - INFO - __main__ - Step 77765: {'lr': 0.00024064109032263803, 'samples': 14930880, 'steps': 77764, 'loss/train': 1.442512035369873} 11/07/2021 08:11:18 - INFO - __main__ - Step 77766: {'lr': 0.00024063578730021087, 'samples': 14931072, 'steps': 77765, 'loss/train': 0.8899643421173096} 11/07/2021 08:11:19 - INFO - __main__ - Step 77767: {'lr': 0.0002406304842820031, 'samples': 14931264, 'steps': 77766, 'loss/train': 1.8368879556655884} 11/07/2021 08:11:19 - INFO - __main__ - Step 77768: {'lr': 0.00024062518126801707, 'samples': 14931456, 'steps': 77767, 'loss/train': 1.5176351070404053} 11/07/2021 08:11:20 - INFO - __main__ - Step 77769: {'lr': 0.0002406198782582553, 'samples': 14931648, 'steps': 77768, 'loss/train': 1.2589620351791382} 11/07/2021 08:11:20 - INFO - __main__ - Step 77770: {'lr': 0.00024061457525271997, 'samples': 14931840, 'steps': 77769, 'loss/train': 1.3175467252731323} 11/07/2021 08:11:21 - INFO - __main__ - Step 77771: {'lr': 0.00024060927225141355, 'samples': 14932032, 'steps': 77770, 'loss/train': 1.8162599802017212} 11/07/2021 08:11:21 - INFO - __main__ - Step 77772: {'lr': 0.00024060396925433845, 'samples': 14932224, 'steps': 77771, 'loss/train': 1.3306254148483276} 11/07/2021 08:11:22 - INFO - __main__ - Step 77773: {'lr': 0.00024059866626149708, 'samples': 14932416, 'steps': 77772, 'loss/train': 1.5780062675476074} 11/07/2021 08:11:23 - INFO - __main__ - Step 77774: {'lr': 0.00024059336327289177, 'samples': 14932608, 'steps': 77773, 'loss/train': 1.3578064441680908} 11/07/2021 08:11:23 - INFO - __main__ - Step 77775: {'lr': 0.00024058806028852495, 'samples': 14932800, 'steps': 77774, 'loss/train': 1.6188416481018066} 11/07/2021 08:11:23 - INFO - __main__ - Step 77776: {'lr': 0.00024058275730839905, 'samples': 14932992, 'steps': 77775, 'loss/train': 1.458539366722107} 11/07/2021 08:11:24 - INFO - __main__ - Step 77777: {'lr': 0.00024057745433251636, 'samples': 14933184, 'steps': 77776, 'loss/train': 1.1978888511657715} 11/07/2021 08:11:24 - INFO - __main__ - Step 77778: {'lr': 0.00024057215136087936, 'samples': 14933376, 'steps': 77777, 'loss/train': 1.9433304071426392} 11/07/2021 08:11:25 - INFO - __main__ - Step 77779: {'lr': 0.0002405668483934904, 'samples': 14933568, 'steps': 77778, 'loss/train': 1.9592233896255493} 11/07/2021 08:11:25 - INFO - __main__ - Step 77780: {'lr': 0.00024056154543035182, 'samples': 14933760, 'steps': 77779, 'loss/train': 1.5875864028930664} 11/07/2021 08:11:26 - INFO - __main__ - Step 77781: {'lr': 0.00024055624247146612, 'samples': 14933952, 'steps': 77780, 'loss/train': 1.9477838277816772} 11/07/2021 08:11:26 - INFO - __main__ - Step 77782: {'lr': 0.0002405509395168356, 'samples': 14934144, 'steps': 77781, 'loss/train': 1.167641520500183} 11/07/2021 08:11:27 - INFO - __main__ - Step 77783: {'lr': 0.0002405456365664628, 'samples': 14934336, 'steps': 77782, 'loss/train': 1.3976948261260986} 11/07/2021 08:11:27 - INFO - __main__ - Step 77784: {'lr': 0.0002405403336203499, 'samples': 14934528, 'steps': 77783, 'loss/train': 1.0208288431167603} 11/07/2021 08:11:28 - INFO - __main__ - Step 77785: {'lr': 0.00024053503067849935, 'samples': 14934720, 'steps': 77784, 'loss/train': 1.6367591619491577} 11/07/2021 08:11:28 - INFO - __main__ - Step 77786: {'lr': 0.00024052972774091358, 'samples': 14934912, 'steps': 77785, 'loss/train': 1.6039555072784424} 11/07/2021 08:11:29 - INFO - __main__ - Step 77787: {'lr': 0.000240524424807595, 'samples': 14935104, 'steps': 77786, 'loss/train': 1.077457308769226} 11/07/2021 08:11:29 - INFO - __main__ - Step 77788: {'lr': 0.00024051912187854593, 'samples': 14935296, 'steps': 77787, 'loss/train': 1.5053424835205078} 11/07/2021 08:11:29 - INFO - __main__ - Step 77789: {'lr': 0.00024051381895376882, 'samples': 14935488, 'steps': 77788, 'loss/train': 0.8548309803009033} 11/07/2021 08:11:30 - INFO - __main__ - Step 77790: {'lr': 0.000240508516033266, 'samples': 14935680, 'steps': 77789, 'loss/train': 1.221774935722351} 11/07/2021 08:11:31 - INFO - __main__ - Step 77791: {'lr': 0.00024050321311703992, 'samples': 14935872, 'steps': 77790, 'loss/train': 1.3900091648101807} 11/07/2021 08:11:31 - INFO - __main__ - Step 77792: {'lr': 0.00024049791020509296, 'samples': 14936064, 'steps': 77791, 'loss/train': 0.9879840612411499} 11/07/2021 08:11:31 - INFO - __main__ - Step 77793: {'lr': 0.00024049260729742746, 'samples': 14936256, 'steps': 77792, 'loss/train': 1.6446752548217773} 11/07/2021 08:11:32 - INFO - __main__ - Step 77794: {'lr': 0.00024048730439404594, 'samples': 14936448, 'steps': 77793, 'loss/train': 1.2520250082015991} 11/07/2021 08:11:33 - INFO - __main__ - Step 77795: {'lr': 0.00024048200149495063, 'samples': 14936640, 'steps': 77794, 'loss/train': 1.3834477663040161} 11/07/2021 08:11:33 - INFO - __main__ - Step 77796: {'lr': 0.000240476698600144, 'samples': 14936832, 'steps': 77795, 'loss/train': 1.6756792068481445} 11/07/2021 08:11:33 - INFO - __main__ - Step 77797: {'lr': 0.00024047139570962842, 'samples': 14937024, 'steps': 77796, 'loss/train': 1.1981652975082397} 11/07/2021 08:11:34 - INFO - __main__ - Step 77798: {'lr': 0.00024046609282340627, 'samples': 14937216, 'steps': 77797, 'loss/train': 1.6105031967163086} 11/07/2021 08:11:34 - INFO - __main__ - Step 77799: {'lr': 0.00024046078994147992, 'samples': 14937408, 'steps': 77798, 'loss/train': 1.3788340091705322} 11/07/2021 08:11:35 - INFO - __main__ - Step 77800: {'lr': 0.0002404554870638518, 'samples': 14937600, 'steps': 77799, 'loss/train': 1.1532248258590698} 11/07/2021 08:11:36 - INFO - __main__ - Step 77801: {'lr': 0.0002404501841905243, 'samples': 14937792, 'steps': 77800, 'loss/train': 1.1600710153579712} 11/07/2021 08:11:36 - INFO - __main__ - Step 77802: {'lr': 0.0002404448813214998, 'samples': 14937984, 'steps': 77801, 'loss/train': 0.9547685980796814} 11/07/2021 08:11:36 - INFO - __main__ - Step 77803: {'lr': 0.0002404395784567807, 'samples': 14938176, 'steps': 77802, 'loss/train': 1.4424628019332886} 11/07/2021 08:11:37 - INFO - __main__ - Step 77804: {'lr': 0.00024043427559636936, 'samples': 14938368, 'steps': 77803, 'loss/train': 0.770857036113739} 11/07/2021 08:11:38 - INFO - __main__ - Step 77805: {'lr': 0.00024042897274026827, 'samples': 14938560, 'steps': 77804, 'loss/train': 1.5722451210021973} 11/07/2021 08:11:38 - INFO - __main__ - Step 77806: {'lr': 0.00024042366988847965, 'samples': 14938752, 'steps': 77805, 'loss/train': 1.3348190784454346} 11/07/2021 08:11:38 - INFO - __main__ - Step 77807: {'lr': 0.000240418367041006, 'samples': 14938944, 'steps': 77806, 'loss/train': 1.411423921585083} 11/07/2021 08:11:39 - INFO - __main__ - Step 77808: {'lr': 0.00024041306419784968, 'samples': 14939136, 'steps': 77807, 'loss/train': 0.9060186743736267} 11/07/2021 08:11:39 - INFO - __main__ - Step 77809: {'lr': 0.00024040776135901304, 'samples': 14939328, 'steps': 77808, 'loss/train': 1.6340594291687012} 11/07/2021 08:11:39 - INFO - __main__ - Step 77810: {'lr': 0.00024040245852449864, 'samples': 14939520, 'steps': 77809, 'loss/train': 1.8312350511550903} 11/07/2021 08:11:40 - INFO - __main__ - Step 77811: {'lr': 0.00024039715569430865, 'samples': 14939712, 'steps': 77810, 'loss/train': 1.0254576206207275} 11/07/2021 08:11:41 - INFO - __main__ - Step 77812: {'lr': 0.00024039185286844555, 'samples': 14939904, 'steps': 77811, 'loss/train': 1.1758249998092651} 11/07/2021 08:11:41 - INFO - __main__ - Step 77813: {'lr': 0.00024038655004691176, 'samples': 14940096, 'steps': 77812, 'loss/train': 0.4921795427799225} 11/07/2021 08:11:41 - INFO - __main__ - Step 77814: {'lr': 0.00024038124722970962, 'samples': 14940288, 'steps': 77813, 'loss/train': 1.6434391736984253} 11/07/2021 08:11:42 - INFO - __main__ - Step 77815: {'lr': 0.00024037594441684155, 'samples': 14940480, 'steps': 77814, 'loss/train': 1.2928398847579956} 11/07/2021 08:11:43 - INFO - __main__ - Step 77816: {'lr': 0.00024037064160831, 'samples': 14940672, 'steps': 77815, 'loss/train': 0.949518084526062} 11/07/2021 08:11:43 - INFO - __main__ - Step 77817: {'lr': 0.0002403653388041172, 'samples': 14940864, 'steps': 77816, 'loss/train': 1.4100078344345093} 11/07/2021 08:11:43 - INFO - __main__ - Step 77818: {'lr': 0.00024036003600426566, 'samples': 14941056, 'steps': 77817, 'loss/train': 1.3880600929260254} 11/07/2021 08:11:44 - INFO - __main__ - Step 77819: {'lr': 0.00024035473320875773, 'samples': 14941248, 'steps': 77818, 'loss/train': 1.0559751987457275} 11/07/2021 08:11:44 - INFO - __main__ - Step 77820: {'lr': 0.0002403494304175958, 'samples': 14941440, 'steps': 77819, 'loss/train': 1.8806668519973755} 11/07/2021 08:11:45 - INFO - __main__ - Step 77821: {'lr': 0.00024034412763078227, 'samples': 14941632, 'steps': 77820, 'loss/train': 1.293351650238037} 11/07/2021 08:11:45 - INFO - __main__ - Step 77822: {'lr': 0.00024033882484831955, 'samples': 14941824, 'steps': 77821, 'loss/train': 1.558099627494812} 11/07/2021 08:11:46 - INFO - __main__ - Step 77823: {'lr': 0.00024033352207021002, 'samples': 14942016, 'steps': 77822, 'loss/train': 1.5036122798919678} 11/07/2021 08:11:46 - INFO - __main__ - Step 77824: {'lr': 0.00024032821929645604, 'samples': 14942208, 'steps': 77823, 'loss/train': 1.308311939239502} 11/07/2021 08:11:47 - INFO - __main__ - Step 77825: {'lr': 0.00024032291652705998, 'samples': 14942400, 'steps': 77824, 'loss/train': 1.2972612380981445} 11/07/2021 08:11:48 - INFO - __main__ - Step 77826: {'lr': 0.00024031761376202434, 'samples': 14942592, 'steps': 77825, 'loss/train': 1.438514232635498} 11/07/2021 08:11:48 - INFO - __main__ - Step 77827: {'lr': 0.0002403123110013514, 'samples': 14942784, 'steps': 77826, 'loss/train': 1.535505771636963} 11/07/2021 08:11:48 - INFO - __main__ - Step 77828: {'lr': 0.00024030700824504355, 'samples': 14942976, 'steps': 77827, 'loss/train': 1.3065706491470337} 11/07/2021 08:11:49 - INFO - __main__ - Step 77829: {'lr': 0.00024030170549310323, 'samples': 14943168, 'steps': 77828, 'loss/train': 0.3707010746002197} 11/07/2021 08:11:49 - INFO - __main__ - Step 77830: {'lr': 0.0002402964027455328, 'samples': 14943360, 'steps': 77829, 'loss/train': 1.6961816549301147} 11/07/2021 08:11:50 - INFO - __main__ - Step 77831: {'lr': 0.00024029110000233468, 'samples': 14943552, 'steps': 77830, 'loss/train': 1.8510551452636719} 11/07/2021 08:11:50 - INFO - __main__ - Step 77832: {'lr': 0.00024028579726351123, 'samples': 14943744, 'steps': 77831, 'loss/train': 1.123478889465332} 11/07/2021 08:11:51 - INFO - __main__ - Step 77833: {'lr': 0.00024028049452906483, 'samples': 14943936, 'steps': 77832, 'loss/train': 1.526466727256775} 11/07/2021 08:11:51 - INFO - __main__ - Step 77834: {'lr': 0.0002402751917989979, 'samples': 14944128, 'steps': 77833, 'loss/train': 1.2069807052612305} 11/07/2021 08:11:51 - INFO - __main__ - Step 77835: {'lr': 0.00024026988907331281, 'samples': 14944320, 'steps': 77834, 'loss/train': 0.959693431854248} 11/07/2021 08:11:52 - INFO - __main__ - Step 77836: {'lr': 0.000240264586352012, 'samples': 14944512, 'steps': 77835, 'loss/train': 1.2415456771850586} 11/07/2021 08:11:53 - INFO - __main__ - Step 77837: {'lr': 0.00024025928363509788, 'samples': 14944704, 'steps': 77836, 'loss/train': 1.8618324995040894} 11/07/2021 08:11:53 - INFO - __main__ - Step 77838: {'lr': 0.0002402539809225727, 'samples': 14944896, 'steps': 77837, 'loss/train': 1.3082020282745361} 11/07/2021 08:11:53 - INFO - __main__ - Step 77839: {'lr': 0.0002402486782144389, 'samples': 14945088, 'steps': 77838, 'loss/train': 0.7745999097824097} 11/07/2021 08:11:54 - INFO - __main__ - Step 77840: {'lr': 0.0002402433755106989, 'samples': 14945280, 'steps': 77839, 'loss/train': 1.4281647205352783} 11/07/2021 08:11:55 - INFO - __main__ - Step 77841: {'lr': 0.0002402380728113551, 'samples': 14945472, 'steps': 77840, 'loss/train': 1.7777011394500732} 11/07/2021 08:11:55 - INFO - __main__ - Step 77842: {'lr': 0.00024023277011640988, 'samples': 14945664, 'steps': 77841, 'loss/train': 1.8391262292861938} 11/07/2021 08:11:56 - INFO - __main__ - Step 77843: {'lr': 0.0002402274674258656, 'samples': 14945856, 'steps': 77842, 'loss/train': 1.7700812816619873} 11/07/2021 08:11:56 - INFO - __main__ - Step 77844: {'lr': 0.0002402221647397247, 'samples': 14946048, 'steps': 77843, 'loss/train': 1.154042363166809} 11/07/2021 08:11:56 - INFO - __main__ - Step 77845: {'lr': 0.00024021686205798952, 'samples': 14946240, 'steps': 77844, 'loss/train': 1.859104871749878} 11/07/2021 08:11:57 - INFO - __main__ - Step 77846: {'lr': 0.00024021155938066247, 'samples': 14946432, 'steps': 77845, 'loss/train': 1.4594192504882812} 11/07/2021 08:11:58 - INFO - __main__ - Step 77847: {'lr': 0.00024020625670774593, 'samples': 14946624, 'steps': 77846, 'loss/train': 1.775148630142212} 11/07/2021 08:11:58 - INFO - __main__ - Step 77848: {'lr': 0.0002402009540392423, 'samples': 14946816, 'steps': 77847, 'loss/train': 1.839240312576294} 11/07/2021 08:11:58 - INFO - __main__ - Step 77849: {'lr': 0.000240195651375154, 'samples': 14947008, 'steps': 77848, 'loss/train': 1.145022988319397} 11/07/2021 08:11:59 - INFO - __main__ - Step 77850: {'lr': 0.00024019034871548348, 'samples': 14947200, 'steps': 77849, 'loss/train': 1.1296638250350952} 11/07/2021 08:11:59 - INFO - __main__ - Step 77851: {'lr': 0.00024018504606023293, 'samples': 14947392, 'steps': 77850, 'loss/train': 0.8606487512588501} 11/07/2021 08:12:00 - INFO - __main__ - Step 77852: {'lr': 0.00024017974340940484, 'samples': 14947584, 'steps': 77851, 'loss/train': 1.2698441743850708} 11/07/2021 08:12:00 - INFO - __main__ - Step 77853: {'lr': 0.0002401744407630016, 'samples': 14947776, 'steps': 77852, 'loss/train': 1.111094355583191} 11/07/2021 08:12:01 - INFO - __main__ - Step 77854: {'lr': 0.00024016913812102561, 'samples': 14947968, 'steps': 77853, 'loss/train': 1.504739761352539} 11/07/2021 08:12:01 - INFO - __main__ - Step 77855: {'lr': 0.00024016383548347927, 'samples': 14948160, 'steps': 77854, 'loss/train': 1.2613294124603271} 11/07/2021 08:12:02 - INFO - __main__ - Step 77856: {'lr': 0.00024015853285036496, 'samples': 14948352, 'steps': 77855, 'loss/train': 1.2508184909820557} 11/07/2021 08:12:03 - INFO - __main__ - Step 77857: {'lr': 0.00024015323022168503, 'samples': 14948544, 'steps': 77856, 'loss/train': 1.3004895448684692} 11/07/2021 08:12:03 - INFO - __main__ - Step 77858: {'lr': 0.0002401479275974419, 'samples': 14948736, 'steps': 77857, 'loss/train': 1.7275447845458984} 11/07/2021 08:12:03 - INFO - __main__ - Step 77859: {'lr': 0.000240142624977638, 'samples': 14948928, 'steps': 77858, 'loss/train': 1.341060996055603} 11/07/2021 08:12:04 - INFO - __main__ - Step 77860: {'lr': 0.00024013732236227568, 'samples': 14949120, 'steps': 77859, 'loss/train': 1.149538278579712} 11/07/2021 08:12:04 - INFO - __main__ - Step 77861: {'lr': 0.0002401320197513573, 'samples': 14949312, 'steps': 77860, 'loss/train': 0.7336187958717346} 11/07/2021 08:12:04 - INFO - __main__ - Step 77862: {'lr': 0.0002401267171448853, 'samples': 14949504, 'steps': 77861, 'loss/train': 1.9134246110916138} 11/07/2021 08:12:05 - INFO - __main__ - Step 77863: {'lr': 0.00024012141454286205, 'samples': 14949696, 'steps': 77862, 'loss/train': 0.8516228199005127} 11/07/2021 08:12:06 - INFO - __main__ - Step 77864: {'lr': 0.00024011611194529005, 'samples': 14949888, 'steps': 77863, 'loss/train': 0.8406289219856262} 11/07/2021 08:12:06 - INFO - __main__ - Step 77865: {'lr': 0.00024011080935217144, 'samples': 14950080, 'steps': 77864, 'loss/train': 1.1992095708847046} 11/07/2021 08:12:06 - INFO - __main__ - Step 77866: {'lr': 0.00024010550676350877, 'samples': 14950272, 'steps': 77865, 'loss/train': 1.26303231716156} 11/07/2021 08:12:07 - INFO - __main__ - Step 77867: {'lr': 0.00024010020417930439, 'samples': 14950464, 'steps': 77866, 'loss/train': 0.5883327126502991} 11/07/2021 08:12:08 - INFO - __main__ - Step 77868: {'lr': 0.0002400949015995607, 'samples': 14950656, 'steps': 77867, 'loss/train': 1.3596906661987305} 11/07/2021 08:12:08 - INFO - __main__ - Step 77869: {'lr': 0.00024008959902428013, 'samples': 14950848, 'steps': 77868, 'loss/train': 1.4876338243484497} 11/07/2021 08:12:09 - INFO - __main__ - Step 77870: {'lr': 0.00024008429645346502, 'samples': 14951040, 'steps': 77869, 'loss/train': 1.7067862749099731} 11/07/2021 08:12:09 - INFO - __main__ - Step 77871: {'lr': 0.00024007899388711778, 'samples': 14951232, 'steps': 77870, 'loss/train': 1.1221191883087158} 11/07/2021 08:12:09 - INFO - __main__ - Step 77872: {'lr': 0.00024007369132524075, 'samples': 14951424, 'steps': 77871, 'loss/train': 1.0310590267181396} 11/07/2021 08:12:10 - INFO - __main__ - Step 77873: {'lr': 0.0002400683887678364, 'samples': 14951616, 'steps': 77872, 'loss/train': 1.4109236001968384} 11/07/2021 08:12:11 - INFO - __main__ - Step 77874: {'lr': 0.0002400630862149071, 'samples': 14951808, 'steps': 77873, 'loss/train': 1.2652363777160645} 11/07/2021 08:12:11 - INFO - __main__ - Step 77875: {'lr': 0.00024005778366645517, 'samples': 14952000, 'steps': 77874, 'loss/train': 2.225954055786133} 11/07/2021 08:12:11 - INFO - __main__ - Step 77876: {'lr': 0.00024005248112248307, 'samples': 14952192, 'steps': 77875, 'loss/train': 1.349465250968933} 11/07/2021 08:12:12 - INFO - __main__ - Step 77877: {'lr': 0.00024004717858299327, 'samples': 14952384, 'steps': 77876, 'loss/train': 1.4414795637130737} 11/07/2021 08:12:13 - INFO - __main__ - Step 77878: {'lr': 0.00024004187604798798, 'samples': 14952576, 'steps': 77877, 'loss/train': 1.5698652267456055} 11/07/2021 08:12:13 - INFO - __main__ - Step 77879: {'lr': 0.00024003657351746963, 'samples': 14952768, 'steps': 77878, 'loss/train': 1.0372892618179321} 11/07/2021 08:12:13 - INFO - __main__ - Step 77880: {'lr': 0.00024003127099144064, 'samples': 14952960, 'steps': 77879, 'loss/train': 1.2537838220596313} 11/07/2021 08:12:14 - INFO - __main__ - Step 77881: {'lr': 0.00024002596846990344, 'samples': 14953152, 'steps': 77880, 'loss/train': 1.504740595817566} 11/07/2021 08:12:14 - INFO - __main__ - Step 77882: {'lr': 0.00024002066595286037, 'samples': 14953344, 'steps': 77881, 'loss/train': 1.4044195413589478} 11/07/2021 08:12:15 - INFO - __main__ - Step 77883: {'lr': 0.00024001536344031384, 'samples': 14953536, 'steps': 77882, 'loss/train': 0.9994714856147766} 11/07/2021 08:12:15 - INFO - __main__ - Step 77884: {'lr': 0.0002400100609322662, 'samples': 14953728, 'steps': 77883, 'loss/train': 1.6605747938156128} 11/07/2021 08:12:16 - INFO - __main__ - Step 77885: {'lr': 0.0002400047584287199, 'samples': 14953920, 'steps': 77884, 'loss/train': 1.278984785079956} 11/07/2021 08:12:16 - INFO - __main__ - Step 77886: {'lr': 0.0002399994559296773, 'samples': 14954112, 'steps': 77885, 'loss/train': 1.04181706905365} 11/07/2021 08:12:16 - INFO - __main__ - Step 77887: {'lr': 0.0002399941534351408, 'samples': 14954304, 'steps': 77886, 'loss/train': 1.2080788612365723} 11/07/2021 08:12:17 - INFO - __main__ - Step 77888: {'lr': 0.00023998885094511277, 'samples': 14954496, 'steps': 77887, 'loss/train': 1.3483186960220337} 11/07/2021 08:12:18 - INFO - __main__ - Step 77889: {'lr': 0.00023998354845959565, 'samples': 14954688, 'steps': 77888, 'loss/train': 1.7265336513519287} 11/07/2021 08:12:18 - INFO - __main__ - Step 77890: {'lr': 0.00023997824597859184, 'samples': 14954880, 'steps': 77889, 'loss/train': 1.6177208423614502} 11/07/2021 08:12:18 - INFO - __main__ - Step 77891: {'lr': 0.0002399729435021036, 'samples': 14955072, 'steps': 77890, 'loss/train': 1.7436586618423462} 11/07/2021 08:12:19 - INFO - __main__ - Step 77892: {'lr': 0.00023996764103013338, 'samples': 14955264, 'steps': 77891, 'loss/train': 1.4988399744033813} 11/07/2021 08:12:19 - INFO - __main__ - Step 77893: {'lr': 0.00023996233856268356, 'samples': 14955456, 'steps': 77892, 'loss/train': 1.6517431735992432} 11/07/2021 08:12:20 - INFO - __main__ - Step 77894: {'lr': 0.0002399570360997566, 'samples': 14955648, 'steps': 77893, 'loss/train': 1.3559690713882446} 11/07/2021 08:12:21 - INFO - __main__ - Step 77895: {'lr': 0.00023995173364135483, 'samples': 14955840, 'steps': 77894, 'loss/train': 0.990135669708252} 11/07/2021 08:12:21 - INFO - __main__ - Step 77896: {'lr': 0.00023994643118748065, 'samples': 14956032, 'steps': 77895, 'loss/train': 0.9539682865142822} 11/07/2021 08:12:21 - INFO - __main__ - Step 77897: {'lr': 0.00023994112873813647, 'samples': 14956224, 'steps': 77896, 'loss/train': 1.3821481466293335} 11/07/2021 08:12:22 - INFO - __main__ - Step 77898: {'lr': 0.00023993582629332463, 'samples': 14956416, 'steps': 77897, 'loss/train': 1.567352533340454} 11/07/2021 08:12:23 - INFO - __main__ - Step 77899: {'lr': 0.0002399305238530476, 'samples': 14956608, 'steps': 77898, 'loss/train': 1.502428412437439} 11/07/2021 08:12:23 - INFO - __main__ - Step 77900: {'lr': 0.00023992522141730768, 'samples': 14956800, 'steps': 77899, 'loss/train': 1.6957865953445435} 11/07/2021 08:12:23 - INFO - __main__ - Step 77901: {'lr': 0.00023991991898610732, 'samples': 14956992, 'steps': 77900, 'loss/train': 1.4621461629867554} 11/07/2021 08:12:24 - INFO - __main__ - Step 77902: {'lr': 0.00023991461655944888, 'samples': 14957184, 'steps': 77901, 'loss/train': 1.2674745321273804} 11/07/2021 08:12:24 - INFO - __main__ - Step 77903: {'lr': 0.00023990931413733475, 'samples': 14957376, 'steps': 77902, 'loss/train': 1.2775654792785645} 11/07/2021 08:12:25 - INFO - __main__ - Step 77904: {'lr': 0.00023990401171976745, 'samples': 14957568, 'steps': 77903, 'loss/train': 1.4685544967651367} 11/07/2021 08:12:25 - INFO - __main__ - Step 77905: {'lr': 0.00023989870930674913, 'samples': 14957760, 'steps': 77904, 'loss/train': 0.8097760081291199} 11/07/2021 08:12:26 - INFO - __main__ - Step 77906: {'lr': 0.0002398934068982823, 'samples': 14957952, 'steps': 77905, 'loss/train': 1.0335663557052612} 11/07/2021 08:12:26 - INFO - __main__ - Step 77907: {'lr': 0.00023988810449436935, 'samples': 14958144, 'steps': 77906, 'loss/train': 2.2798473834991455} 11/07/2021 08:12:26 - INFO - __main__ - Step 77908: {'lr': 0.00023988280209501266, 'samples': 14958336, 'steps': 77907, 'loss/train': 1.5945552587509155} 11/07/2021 08:12:28 - INFO - __main__ - Step 77909: {'lr': 0.0002398774997002146, 'samples': 14958528, 'steps': 77908, 'loss/train': 1.3107926845550537} 11/07/2021 08:12:28 - INFO - __main__ - Step 77910: {'lr': 0.00023987219730997762, 'samples': 14958720, 'steps': 77909, 'loss/train': 1.72169029712677} 11/07/2021 08:12:28 - INFO - __main__ - Step 77911: {'lr': 0.00023986689492430405, 'samples': 14958912, 'steps': 77910, 'loss/train': 1.7004035711288452} 11/07/2021 08:12:29 - INFO - __main__ - Step 77912: {'lr': 0.0002398615925431963, 'samples': 14959104, 'steps': 77911, 'loss/train': 1.8249080181121826} 11/07/2021 08:12:29 - INFO - __main__ - Step 77913: {'lr': 0.00023985629016665678, 'samples': 14959296, 'steps': 77912, 'loss/train': 1.2065646648406982} 11/07/2021 08:12:30 - INFO - __main__ - Step 77914: {'lr': 0.00023985098779468784, 'samples': 14959488, 'steps': 77913, 'loss/train': 1.1458313465118408} 11/07/2021 08:12:30 - INFO - __main__ - Step 77915: {'lr': 0.0002398456854272919, 'samples': 14959680, 'steps': 77914, 'loss/train': 1.2910380363464355} 11/07/2021 08:12:31 - INFO - __main__ - Step 77916: {'lr': 0.00023984038306447132, 'samples': 14959872, 'steps': 77915, 'loss/train': 1.3125139474868774} 11/07/2021 08:12:31 - INFO - __main__ - Step 77917: {'lr': 0.0002398350807062286, 'samples': 14960064, 'steps': 77916, 'loss/train': 1.3618106842041016} 11/07/2021 08:12:32 - INFO - __main__ - Step 77918: {'lr': 0.00023982977835256596, 'samples': 14960256, 'steps': 77917, 'loss/train': 1.1997334957122803} 11/07/2021 08:12:33 - INFO - __main__ - Step 77919: {'lr': 0.00023982447600348587, 'samples': 14960448, 'steps': 77918, 'loss/train': 1.3221087455749512} 11/07/2021 08:12:33 - INFO - __main__ - Step 77920: {'lr': 0.0002398191736589907, 'samples': 14960640, 'steps': 77919, 'loss/train': 1.3875223398208618} 11/07/2021 08:12:33 - INFO - __main__ - Step 77921: {'lr': 0.00023981387131908287, 'samples': 14960832, 'steps': 77920, 'loss/train': 1.2744557857513428} 11/07/2021 08:12:34 - INFO - __main__ - Step 77922: {'lr': 0.00023980856898376472, 'samples': 14961024, 'steps': 77921, 'loss/train': 1.3896815776824951} 11/07/2021 08:12:34 - INFO - __main__ - Step 77923: {'lr': 0.0002398032666530387, 'samples': 14961216, 'steps': 77922, 'loss/train': 0.8239076137542725} 11/07/2021 08:12:34 - INFO - __main__ - Step 77924: {'lr': 0.00023979796432690715, 'samples': 14961408, 'steps': 77923, 'loss/train': 1.7624874114990234} 11/07/2021 08:12:35 - INFO - __main__ - Step 77925: {'lr': 0.00023979266200537251, 'samples': 14961600, 'steps': 77924, 'loss/train': 1.5399729013442993} 11/07/2021 08:12:36 - INFO - __main__ - Step 77926: {'lr': 0.0002397873596884371, 'samples': 14961792, 'steps': 77925, 'loss/train': 1.5106821060180664} 11/07/2021 08:12:36 - INFO - __main__ - Step 77927: {'lr': 0.00023978205737610337, 'samples': 14961984, 'steps': 77926, 'loss/train': 2.292548656463623} 11/07/2021 08:12:36 - INFO - __main__ - Step 77928: {'lr': 0.00023977675506837374, 'samples': 14962176, 'steps': 77927, 'loss/train': 0.8225629925727844} 11/07/2021 08:12:37 - INFO - __main__ - Step 77929: {'lr': 0.00023977145276525048, 'samples': 14962368, 'steps': 77928, 'loss/train': 1.5448728799819946} 11/07/2021 08:12:38 - INFO - __main__ - Step 77930: {'lr': 0.00023976615046673606, 'samples': 14962560, 'steps': 77929, 'loss/train': 1.5356934070587158} 11/07/2021 08:12:38 - INFO - __main__ - Step 77931: {'lr': 0.00023976084817283288, 'samples': 14962752, 'steps': 77930, 'loss/train': 1.736335277557373} 11/07/2021 08:12:38 - INFO - __main__ - Step 77932: {'lr': 0.00023975554588354328, 'samples': 14962944, 'steps': 77931, 'loss/train': 1.1285618543624878} 11/07/2021 08:12:39 - INFO - __main__ - Step 77933: {'lr': 0.00023975024359886967, 'samples': 14963136, 'steps': 77932, 'loss/train': 0.8659287095069885} 11/07/2021 08:12:39 - INFO - __main__ - Step 77934: {'lr': 0.00023974494131881447, 'samples': 14963328, 'steps': 77933, 'loss/train': 1.8252848386764526} 11/07/2021 08:12:40 - INFO - __main__ - Step 77935: {'lr': 0.00023973963904338, 'samples': 14963520, 'steps': 77934, 'loss/train': 1.1703217029571533} 11/07/2021 08:12:41 - INFO - __main__ - Step 77936: {'lr': 0.0002397343367725687, 'samples': 14963712, 'steps': 77935, 'loss/train': 1.083784818649292} 11/07/2021 08:12:41 - INFO - __main__ - Step 77937: {'lr': 0.00023972903450638296, 'samples': 14963904, 'steps': 77936, 'loss/train': 1.4291589260101318} 11/07/2021 08:12:41 - INFO - __main__ - Step 77938: {'lr': 0.00023972373224482514, 'samples': 14964096, 'steps': 77937, 'loss/train': 1.626272439956665} 11/07/2021 08:12:42 - INFO - __main__ - Step 77939: {'lr': 0.0002397184299878977, 'samples': 14964288, 'steps': 77938, 'loss/train': 1.241249442100525} 11/07/2021 08:12:43 - INFO - __main__ - Step 77940: {'lr': 0.00023971312773560295, 'samples': 14964480, 'steps': 77939, 'loss/train': 1.3634552955627441} 11/07/2021 08:12:43 - INFO - __main__ - Step 77941: {'lr': 0.00023970782548794327, 'samples': 14964672, 'steps': 77940, 'loss/train': 1.5058910846710205} 11/07/2021 08:12:43 - INFO - __main__ - Step 77942: {'lr': 0.0002397025232449211, 'samples': 14964864, 'steps': 77941, 'loss/train': 1.3092432022094727} 11/07/2021 08:12:44 - INFO - __main__ - Step 77943: {'lr': 0.0002396972210065388, 'samples': 14965056, 'steps': 77942, 'loss/train': 1.4826523065567017} 11/07/2021 08:12:44 - INFO - __main__ - Step 77944: {'lr': 0.00023969191877279888, 'samples': 14965248, 'steps': 77943, 'loss/train': 1.42387855052948} 11/07/2021 08:12:45 - INFO - __main__ - Step 77945: {'lr': 0.0002396866165437035, 'samples': 14965440, 'steps': 77944, 'loss/train': 0.889860212802887} 11/07/2021 08:12:45 - INFO - __main__ - Step 77946: {'lr': 0.00023968131431925525, 'samples': 14965632, 'steps': 77945, 'loss/train': 1.5161428451538086} 11/07/2021 08:12:46 - INFO - __main__ - Step 77947: {'lr': 0.0002396760120994564, 'samples': 14965824, 'steps': 77946, 'loss/train': 1.1741281747817993} 11/07/2021 08:12:46 - INFO - __main__ - Step 77948: {'lr': 0.00023967070988430936, 'samples': 14966016, 'steps': 77947, 'loss/train': 1.4080688953399658} 11/07/2021 08:12:46 - INFO - __main__ - Step 77949: {'lr': 0.00023966540767381657, 'samples': 14966208, 'steps': 77948, 'loss/train': 0.6405399441719055} 11/07/2021 08:12:48 - INFO - __main__ - Step 77950: {'lr': 0.0002396601054679804, 'samples': 14966400, 'steps': 77949, 'loss/train': 1.7217782735824585} 11/07/2021 08:12:48 - INFO - __main__ - Step 77951: {'lr': 0.0002396548032668032, 'samples': 14966592, 'steps': 77950, 'loss/train': 1.7868560552597046} 11/07/2021 08:12:48 - INFO - __main__ - Step 77952: {'lr': 0.00023964950107028738, 'samples': 14966784, 'steps': 77951, 'loss/train': 0.9535073041915894} 11/07/2021 08:12:49 - INFO - __main__ - Step 77953: {'lr': 0.00023964419887843535, 'samples': 14966976, 'steps': 77952, 'loss/train': 1.0653002262115479} 11/07/2021 08:12:49 - INFO - __main__ - Step 77954: {'lr': 0.00023963889669124946, 'samples': 14967168, 'steps': 77953, 'loss/train': 0.06731796264648438} 11/07/2021 08:12:50 - INFO - __main__ - Step 77955: {'lr': 0.00023963359450873215, 'samples': 14967360, 'steps': 77954, 'loss/train': 1.3748929500579834} 11/07/2021 08:12:50 - INFO - __main__ - Step 77956: {'lr': 0.00023962829233088577, 'samples': 14967552, 'steps': 77955, 'loss/train': 0.9482280015945435} 11/07/2021 08:12:51 - INFO - __main__ - Step 77957: {'lr': 0.00023962299015771273, 'samples': 14967744, 'steps': 77956, 'loss/train': 1.225379228591919} 11/07/2021 08:12:51 - INFO - __main__ - Step 77958: {'lr': 0.00023961768798921545, 'samples': 14967936, 'steps': 77957, 'loss/train': 0.6319544315338135} 11/07/2021 08:12:51 - INFO - __main__ - Step 77959: {'lr': 0.00023961238582539623, 'samples': 14968128, 'steps': 77958, 'loss/train': 1.3388479948043823} 11/07/2021 08:12:52 - INFO - __main__ - Step 77960: {'lr': 0.0002396070836662575, 'samples': 14968320, 'steps': 77959, 'loss/train': 1.4157596826553345} 11/07/2021 08:12:53 - INFO - __main__ - Step 77961: {'lr': 0.00023960178151180174, 'samples': 14968512, 'steps': 77960, 'loss/train': 1.4485349655151367} 11/07/2021 08:12:53 - INFO - __main__ - Step 77962: {'lr': 0.00023959647936203118, 'samples': 14968704, 'steps': 77961, 'loss/train': 1.4186756610870361} 11/07/2021 08:12:54 - INFO - __main__ - Step 77963: {'lr': 0.00023959117721694827, 'samples': 14968896, 'steps': 77962, 'loss/train': 1.4329206943511963} 11/07/2021 08:12:54 - INFO - __main__ - Step 77964: {'lr': 0.00023958587507655544, 'samples': 14969088, 'steps': 77963, 'loss/train': 1.3736602067947388} 11/07/2021 08:12:54 - INFO - __main__ - Step 77965: {'lr': 0.00023958057294085506, 'samples': 14969280, 'steps': 77964, 'loss/train': 1.4046721458435059} 11/07/2021 08:12:55 - INFO - __main__ - Step 77966: {'lr': 0.00023957527080984952, 'samples': 14969472, 'steps': 77965, 'loss/train': 1.5178924798965454} 11/07/2021 08:12:56 - INFO - __main__ - Step 77967: {'lr': 0.00023956996868354117, 'samples': 14969664, 'steps': 77966, 'loss/train': 1.4707967042922974} 11/07/2021 08:12:56 - INFO - __main__ - Step 77968: {'lr': 0.00023956466656193244, 'samples': 14969856, 'steps': 77967, 'loss/train': 0.9666402339935303} 11/07/2021 08:12:56 - INFO - __main__ - Step 77969: {'lr': 0.0002395593644450257, 'samples': 14970048, 'steps': 77968, 'loss/train': 1.2636483907699585} 11/07/2021 08:12:57 - INFO - __main__ - Step 77970: {'lr': 0.0002395540623328234, 'samples': 14970240, 'steps': 77969, 'loss/train': 1.3566055297851562} 11/07/2021 08:12:58 - INFO - __main__ - Step 77971: {'lr': 0.00023954876022532788, 'samples': 14970432, 'steps': 77970, 'loss/train': 0.8166517615318298} 11/07/2021 08:12:58 - INFO - __main__ - Step 77972: {'lr': 0.00023954345812254155, 'samples': 14970624, 'steps': 77971, 'loss/train': 1.3668203353881836} 11/07/2021 08:12:59 - INFO - __main__ - Step 77973: {'lr': 0.00023953815602446673, 'samples': 14970816, 'steps': 77972, 'loss/train': 1.255781650543213} 11/07/2021 08:12:59 - INFO - __main__ - Step 77974: {'lr': 0.00023953285393110582, 'samples': 14971008, 'steps': 77973, 'loss/train': 1.4998356103897095} 11/07/2021 08:12:59 - INFO - __main__ - Step 77975: {'lr': 0.00023952755184246128, 'samples': 14971200, 'steps': 77974, 'loss/train': 1.0410019159317017} 11/07/2021 08:13:02 - INFO - __main__ - Step 77976: {'lr': 0.00023952224975853546, 'samples': 14971392, 'steps': 77975, 'loss/train': 0.9589124321937561} 11/07/2021 08:13:02 - INFO - __main__ - Step 77977: {'lr': 0.0002395169476793307, 'samples': 14971584, 'steps': 77976, 'loss/train': 1.165202021598816} 11/07/2021 08:13:02 - INFO - __main__ - Step 77978: {'lr': 0.0002395116456048495, 'samples': 14971776, 'steps': 77977, 'loss/train': 1.7770085334777832} 11/07/2021 08:13:03 - INFO - __main__ - Step 77979: {'lr': 0.00023950634353509418, 'samples': 14971968, 'steps': 77978, 'loss/train': 1.6130321025848389} 11/07/2021 08:13:03 - INFO - __main__ - Step 77980: {'lr': 0.00023950104147006716, 'samples': 14972160, 'steps': 77979, 'loss/train': 1.844005823135376} 11/07/2021 08:13:03 - INFO - __main__ - Step 77981: {'lr': 0.00023949573940977077, 'samples': 14972352, 'steps': 77980, 'loss/train': 1.7680816650390625} 11/07/2021 08:13:04 - INFO - __main__ - Step 77982: {'lr': 0.00023949043735420746, 'samples': 14972544, 'steps': 77981, 'loss/train': 1.768716812133789} 11/07/2021 08:13:04 - INFO - __main__ - Step 77983: {'lr': 0.00023948513530337956, 'samples': 14972736, 'steps': 77982, 'loss/train': 1.755103588104248} 11/07/2021 08:13:05 - INFO - __main__ - Step 77984: {'lr': 0.00023947983325728952, 'samples': 14972928, 'steps': 77983, 'loss/train': 1.7576842308044434} 11/07/2021 08:13:05 - INFO - __main__ - Step 77985: {'lr': 0.00023947453121593985, 'samples': 14973120, 'steps': 77984, 'loss/train': 1.372873306274414} 11/07/2021 08:13:06 - INFO - __main__ - Step 77986: {'lr': 0.00023946922917933265, 'samples': 14973312, 'steps': 77985, 'loss/train': 1.0541188716888428} 11/07/2021 08:13:06 - INFO - __main__ - Step 77987: {'lr': 0.00023946392714747046, 'samples': 14973504, 'steps': 77986, 'loss/train': 1.5228298902511597} 11/07/2021 08:13:07 - INFO - __main__ - Step 77988: {'lr': 0.00023945862512035566, 'samples': 14973696, 'steps': 77987, 'loss/train': 1.1608003377914429} 11/07/2021 08:13:07 - INFO - __main__ - Step 77989: {'lr': 0.00023945332309799062, 'samples': 14973888, 'steps': 77988, 'loss/train': 1.6137566566467285} 11/07/2021 08:13:08 - INFO - __main__ - Step 77990: {'lr': 0.00023944802108037777, 'samples': 14974080, 'steps': 77989, 'loss/train': 0.9046348333358765} 11/07/2021 08:13:08 - INFO - __main__ - Step 77991: {'lr': 0.00023944271906751948, 'samples': 14974272, 'steps': 77990, 'loss/train': 1.0335562229156494} 11/07/2021 08:13:09 - INFO - __main__ - Step 77992: {'lr': 0.00023943741705941812, 'samples': 14974464, 'steps': 77991, 'loss/train': 1.4845428466796875} 11/07/2021 08:13:09 - INFO - __main__ - Step 77993: {'lr': 0.0002394321150560761, 'samples': 14974656, 'steps': 77992, 'loss/train': 0.9945977926254272} 11/07/2021 08:13:09 - INFO - __main__ - Step 77994: {'lr': 0.00023942681305749584, 'samples': 14974848, 'steps': 77993, 'loss/train': 1.717674970626831} 11/07/2021 08:13:10 - INFO - __main__ - Step 77995: {'lr': 0.00023942151106367968, 'samples': 14975040, 'steps': 77994, 'loss/train': 1.4194217920303345} 11/07/2021 08:13:11 - INFO - __main__ - Step 77996: {'lr': 0.00023941620907463003, 'samples': 14975232, 'steps': 77995, 'loss/train': 1.328128457069397} 11/07/2021 08:13:11 - INFO - __main__ - Step 77997: {'lr': 0.00023941090709034924, 'samples': 14975424, 'steps': 77996, 'loss/train': 1.6301342248916626} 11/07/2021 08:13:11 - INFO - __main__ - Step 77998: {'lr': 0.00023940560511083987, 'samples': 14975616, 'steps': 77997, 'loss/train': 1.3980534076690674} 11/07/2021 08:13:12 - INFO - __main__ - Step 77999: {'lr': 0.00023940030313610402, 'samples': 14975808, 'steps': 77998, 'loss/train': 1.5504627227783203} 11/07/2021 08:13:13 - INFO - __main__ - Step 78000: {'lr': 0.0002393950011661443, 'samples': 14976000, 'steps': 77999, 'loss/train': 0.9291006326675415} 11/07/2021 08:13:13 - INFO - __main__ - Step 78001: {'lr': 0.00023938969920096298, 'samples': 14976192, 'steps': 78000, 'loss/train': 1.7082850933074951} 11/07/2021 08:13:14 - INFO - __main__ - Step 78002: {'lr': 0.0002393843972405625, 'samples': 14976384, 'steps': 78001, 'loss/train': 1.1167800426483154} 11/07/2021 08:13:14 - INFO - __main__ - Step 78003: {'lr': 0.00023937909528494526, 'samples': 14976576, 'steps': 78002, 'loss/train': 1.2255433797836304} 11/07/2021 08:13:14 - INFO - __main__ - Step 78004: {'lr': 0.00023937379333411363, 'samples': 14976768, 'steps': 78003, 'loss/train': 1.6689029932022095} 11/07/2021 08:13:15 - INFO - __main__ - Step 78005: {'lr': 0.00023936849138807002, 'samples': 14976960, 'steps': 78004, 'loss/train': 1.4094953536987305} 11/07/2021 08:13:16 - INFO - __main__ - Step 78006: {'lr': 0.0002393631894468168, 'samples': 14977152, 'steps': 78005, 'loss/train': 1.2968460321426392} 11/07/2021 08:13:16 - INFO - __main__ - Step 78007: {'lr': 0.00023935788751035635, 'samples': 14977344, 'steps': 78006, 'loss/train': 1.4097015857696533} 11/07/2021 08:13:16 - INFO - __main__ - Step 78008: {'lr': 0.00023935258557869105, 'samples': 14977536, 'steps': 78007, 'loss/train': 1.4578518867492676} 11/07/2021 08:13:17 - INFO - __main__ - Step 78009: {'lr': 0.00023934728365182335, 'samples': 14977728, 'steps': 78008, 'loss/train': 1.354276180267334} 11/07/2021 08:13:18 - INFO - __main__ - Step 78010: {'lr': 0.00023934198172975558, 'samples': 14977920, 'steps': 78009, 'loss/train': 0.6393221020698547} 11/07/2021 08:13:18 - INFO - __main__ - Step 78011: {'lr': 0.00023933667981249025, 'samples': 14978112, 'steps': 78010, 'loss/train': 1.1933711767196655} 11/07/2021 08:13:18 - INFO - __main__ - Step 78012: {'lr': 0.00023933137790002956, 'samples': 14978304, 'steps': 78011, 'loss/train': 1.244978427886963} 11/07/2021 08:13:19 - INFO - __main__ - Step 78013: {'lr': 0.00023932607599237596, 'samples': 14978496, 'steps': 78012, 'loss/train': 1.6985658407211304} 11/07/2021 08:13:19 - INFO - __main__ - Step 78014: {'lr': 0.0002393207740895319, 'samples': 14978688, 'steps': 78013, 'loss/train': 1.1570593118667603} 11/07/2021 08:13:20 - INFO - __main__ - Step 78015: {'lr': 0.00023931547219149972, 'samples': 14978880, 'steps': 78014, 'loss/train': 1.3710498809814453} 11/07/2021 08:13:21 - INFO - __main__ - Step 78016: {'lr': 0.0002393101702982818, 'samples': 14979072, 'steps': 78015, 'loss/train': 1.2116552591323853} 11/07/2021 08:13:21 - INFO - __main__ - Step 78017: {'lr': 0.00023930486840988057, 'samples': 14979264, 'steps': 78016, 'loss/train': 1.8387013673782349} 11/07/2021 08:13:21 - INFO - __main__ - Step 78018: {'lr': 0.00023929956652629842, 'samples': 14979456, 'steps': 78017, 'loss/train': 0.7936192154884338} 11/07/2021 08:13:22 - INFO - __main__ - Step 78019: {'lr': 0.0002392942646475377, 'samples': 14979648, 'steps': 78018, 'loss/train': 1.3565257787704468} 11/07/2021 08:13:22 - INFO - __main__ - Step 78020: {'lr': 0.00023928896277360082, 'samples': 14979840, 'steps': 78019, 'loss/train': 1.958703875541687} 11/07/2021 08:13:23 - INFO - __main__ - Step 78021: {'lr': 0.00023928366090449017, 'samples': 14980032, 'steps': 78020, 'loss/train': 1.6551637649536133} 11/07/2021 08:13:23 - INFO - __main__ - Step 78022: {'lr': 0.00023927835904020815, 'samples': 14980224, 'steps': 78021, 'loss/train': 0.9644224643707275} 11/07/2021 08:13:24 - INFO - __main__ - Step 78023: {'lr': 0.00023927305718075712, 'samples': 14980416, 'steps': 78022, 'loss/train': 1.4657728672027588} 11/07/2021 08:13:24 - INFO - __main__ - Step 78024: {'lr': 0.00023926775532613948, 'samples': 14980608, 'steps': 78023, 'loss/train': 1.7870502471923828} 11/07/2021 08:13:24 - INFO - __main__ - Step 78025: {'lr': 0.00023926245347635774, 'samples': 14980800, 'steps': 78024, 'loss/train': 1.6764940023422241} 11/07/2021 08:13:25 - INFO - __main__ - Step 78026: {'lr': 0.00023925715163141407, 'samples': 14980992, 'steps': 78025, 'loss/train': 1.349534273147583} 11/07/2021 08:13:26 - INFO - __main__ - Step 78027: {'lr': 0.00023925184979131095, 'samples': 14981184, 'steps': 78026, 'loss/train': 1.3730826377868652} 11/07/2021 08:13:26 - INFO - __main__ - Step 78028: {'lr': 0.0002392465479560508, 'samples': 14981376, 'steps': 78027, 'loss/train': 1.244956135749817} 11/07/2021 08:13:26 - INFO - __main__ - Step 78029: {'lr': 0.00023924124612563597, 'samples': 14981568, 'steps': 78028, 'loss/train': 1.2242895364761353} 11/07/2021 08:13:27 - INFO - __main__ - Step 78030: {'lr': 0.00023923594430006888, 'samples': 14981760, 'steps': 78029, 'loss/train': 2.008558988571167} 11/07/2021 08:13:27 - INFO - __main__ - Step 78031: {'lr': 0.0002392306424793519, 'samples': 14981952, 'steps': 78030, 'loss/train': 1.6127583980560303} 11/07/2021 08:13:28 - INFO - __main__ - Step 78032: {'lr': 0.00023922534066348744, 'samples': 14982144, 'steps': 78031, 'loss/train': 1.1032663583755493} 11/07/2021 08:13:29 - INFO - __main__ - Step 78033: {'lr': 0.00023922003885247788, 'samples': 14982336, 'steps': 78032, 'loss/train': 1.3462462425231934} 11/07/2021 08:13:29 - INFO - __main__ - Step 78034: {'lr': 0.00023921473704632557, 'samples': 14982528, 'steps': 78033, 'loss/train': 1.2361431121826172} 11/07/2021 08:13:29 - INFO - __main__ - Step 78035: {'lr': 0.00023920943524503293, 'samples': 14982720, 'steps': 78034, 'loss/train': 0.9588436484336853} 11/07/2021 08:13:30 - INFO - __main__ - Step 78036: {'lr': 0.0002392041334486024, 'samples': 14982912, 'steps': 78035, 'loss/train': 0.9037348628044128} 11/07/2021 08:13:31 - INFO - __main__ - Step 78037: {'lr': 0.0002391988316570363, 'samples': 14983104, 'steps': 78036, 'loss/train': 1.5177912712097168} 11/07/2021 08:13:31 - INFO - __main__ - Step 78038: {'lr': 0.00023919352987033713, 'samples': 14983296, 'steps': 78037, 'loss/train': 1.163002610206604} 11/07/2021 08:13:31 - INFO - __main__ - Step 78039: {'lr': 0.0002391882280885071, 'samples': 14983488, 'steps': 78038, 'loss/train': 1.5717113018035889} 11/07/2021 08:13:32 - INFO - __main__ - Step 78040: {'lr': 0.00023918292631154868, 'samples': 14983680, 'steps': 78039, 'loss/train': 1.3509796857833862} 11/07/2021 08:13:32 - INFO - __main__ - Step 78041: {'lr': 0.00023917762453946426, 'samples': 14983872, 'steps': 78040, 'loss/train': 1.0830093622207642} 11/07/2021 08:13:33 - INFO - __main__ - Step 78042: {'lr': 0.00023917232277225625, 'samples': 14984064, 'steps': 78041, 'loss/train': 1.128140926361084} 11/07/2021 08:13:34 - INFO - __main__ - Step 78043: {'lr': 0.00023916702100992702, 'samples': 14984256, 'steps': 78042, 'loss/train': 0.7475703954696655} 11/07/2021 08:13:34 - INFO - __main__ - Step 78044: {'lr': 0.00023916171925247894, 'samples': 14984448, 'steps': 78043, 'loss/train': 1.6173160076141357} 11/07/2021 08:13:34 - INFO - __main__ - Step 78045: {'lr': 0.00023915641749991447, 'samples': 14984640, 'steps': 78044, 'loss/train': 1.690755844116211} 11/07/2021 08:13:35 - INFO - __main__ - Step 78046: {'lr': 0.00023915111575223592, 'samples': 14984832, 'steps': 78045, 'loss/train': 1.5247211456298828} 11/07/2021 08:13:36 - INFO - __main__ - Step 78047: {'lr': 0.00023914581400944572, 'samples': 14985024, 'steps': 78046, 'loss/train': 1.526320457458496} 11/07/2021 08:13:36 - INFO - __main__ - Step 78048: {'lr': 0.00023914051227154622, 'samples': 14985216, 'steps': 78047, 'loss/train': 0.962614119052887} 11/07/2021 08:13:36 - INFO - __main__ - Step 78049: {'lr': 0.00023913521053853988, 'samples': 14985408, 'steps': 78048, 'loss/train': 1.2538427114486694} 11/07/2021 08:13:37 - INFO - __main__ - Step 78050: {'lr': 0.00023912990881042902, 'samples': 14985600, 'steps': 78049, 'loss/train': 1.5927335023880005} 11/07/2021 08:13:37 - INFO - __main__ - Step 78051: {'lr': 0.00023912460708721607, 'samples': 14985792, 'steps': 78050, 'loss/train': 1.1865047216415405} 11/07/2021 08:13:38 - INFO - __main__ - Step 78052: {'lr': 0.00023911930536890346, 'samples': 14985984, 'steps': 78051, 'loss/train': 1.5707935094833374} 11/07/2021 08:13:38 - INFO - __main__ - Step 78053: {'lr': 0.00023911400365549348, 'samples': 14986176, 'steps': 78052, 'loss/train': 1.4973784685134888} 11/07/2021 08:13:39 - INFO - __main__ - Step 78054: {'lr': 0.00023910870194698855, 'samples': 14986368, 'steps': 78053, 'loss/train': 0.9313587546348572} 11/07/2021 08:13:39 - INFO - __main__ - Step 78055: {'lr': 0.00023910340024339106, 'samples': 14986560, 'steps': 78054, 'loss/train': 1.358694076538086} 11/07/2021 08:13:39 - INFO - __main__ - Step 78056: {'lr': 0.0002390980985447034, 'samples': 14986752, 'steps': 78055, 'loss/train': 1.1973398923873901} 11/07/2021 08:13:40 - INFO - __main__ - Step 78057: {'lr': 0.000239092796850928, 'samples': 14986944, 'steps': 78056, 'loss/train': 1.4888583421707153} 11/07/2021 08:13:41 - INFO - __main__ - Step 78058: {'lr': 0.0002390874951620672, 'samples': 14987136, 'steps': 78057, 'loss/train': 1.0732277631759644} 11/07/2021 08:13:41 - INFO - __main__ - Step 78059: {'lr': 0.0002390821934781234, 'samples': 14987328, 'steps': 78058, 'loss/train': 1.498191237449646} 11/07/2021 08:13:42 - INFO - __main__ - Step 78060: {'lr': 0.00023907689179909896, 'samples': 14987520, 'steps': 78059, 'loss/train': 1.2570701837539673} 11/07/2021 08:13:42 - INFO - __main__ - Step 78061: {'lr': 0.00023907159012499636, 'samples': 14987712, 'steps': 78060, 'loss/train': 1.6908997297286987} 11/07/2021 08:13:43 - INFO - __main__ - Step 78062: {'lr': 0.00023906628845581798, 'samples': 14987904, 'steps': 78061, 'loss/train': 1.4306678771972656} 11/07/2021 08:13:44 - INFO - __main__ - Step 78063: {'lr': 0.0002390609867915661, 'samples': 14988096, 'steps': 78062, 'loss/train': 1.537276268005371} 11/07/2021 08:13:44 - INFO - __main__ - Step 78064: {'lr': 0.00023905568513224316, 'samples': 14988288, 'steps': 78063, 'loss/train': 1.445479393005371} 11/07/2021 08:13:44 - INFO - __main__ - Step 78065: {'lr': 0.00023905038347785164, 'samples': 14988480, 'steps': 78064, 'loss/train': 1.8595983982086182} 11/07/2021 08:13:45 - INFO - __main__ - Step 78066: {'lr': 0.00023904508182839376, 'samples': 14988672, 'steps': 78065, 'loss/train': 1.57357656955719} 11/07/2021 08:13:45 - INFO - __main__ - Step 78067: {'lr': 0.00023903978018387201, 'samples': 14988864, 'steps': 78066, 'loss/train': 3.3155486583709717} 11/07/2021 08:13:46 - INFO - __main__ - Step 78068: {'lr': 0.00023903447854428878, 'samples': 14989056, 'steps': 78067, 'loss/train': 3.196226119995117} 11/07/2021 08:13:46 - INFO - __main__ - Step 78069: {'lr': 0.00023902917690964644, 'samples': 14989248, 'steps': 78068, 'loss/train': 1.1449272632598877} 11/07/2021 08:13:47 - INFO - __main__ - Step 78070: {'lr': 0.00023902387527994734, 'samples': 14989440, 'steps': 78069, 'loss/train': 1.226767659187317} 11/07/2021 08:13:47 - INFO - __main__ - Step 78071: {'lr': 0.00023901857365519398, 'samples': 14989632, 'steps': 78070, 'loss/train': 1.8433715105056763} 11/07/2021 08:13:47 - INFO - __main__ - Step 78072: {'lr': 0.00023901327203538865, 'samples': 14989824, 'steps': 78071, 'loss/train': 0.9840089082717896} 11/07/2021 08:13:48 - INFO - __main__ - Step 78073: {'lr': 0.00023900797042053382, 'samples': 14990016, 'steps': 78072, 'loss/train': 1.3959020376205444} 11/07/2021 08:13:49 - INFO - __main__ - Step 78074: {'lr': 0.00023900266881063175, 'samples': 14990208, 'steps': 78073, 'loss/train': 1.0110763311386108} 11/07/2021 08:13:49 - INFO - __main__ - Step 78075: {'lr': 0.00023899736720568496, 'samples': 14990400, 'steps': 78074, 'loss/train': 1.469193696975708} 11/07/2021 08:13:50 - INFO - __main__ - Step 78076: {'lr': 0.00023899206560569575, 'samples': 14990592, 'steps': 78075, 'loss/train': 1.4406479597091675} 11/07/2021 08:13:50 - INFO - __main__ - Step 78077: {'lr': 0.00023898676401066659, 'samples': 14990784, 'steps': 78076, 'loss/train': 1.4482618570327759} 11/07/2021 08:13:50 - INFO - __main__ - Step 78078: {'lr': 0.00023898146242059976, 'samples': 14990976, 'steps': 78077, 'loss/train': 1.6897931098937988} 11/07/2021 08:13:51 - INFO - __main__ - Step 78079: {'lr': 0.00023897616083549782, 'samples': 14991168, 'steps': 78078, 'loss/train': 1.6180381774902344} 11/07/2021 08:13:52 - INFO - __main__ - Step 78080: {'lr': 0.00023897085925536296, 'samples': 14991360, 'steps': 78079, 'loss/train': 1.7888966798782349} 11/07/2021 08:13:52 - INFO - __main__ - Step 78081: {'lr': 0.0002389655576801977, 'samples': 14991552, 'steps': 78080, 'loss/train': 1.228509545326233} 11/07/2021 08:13:52 - INFO - __main__ - Step 78082: {'lr': 0.00023896025611000435, 'samples': 14991744, 'steps': 78081, 'loss/train': 1.4067648649215698} 11/07/2021 08:13:53 - INFO - __main__ - Step 78083: {'lr': 0.00023895495454478535, 'samples': 14991936, 'steps': 78082, 'loss/train': 1.2394611835479736} 11/07/2021 08:13:54 - INFO - __main__ - Step 78084: {'lr': 0.00023894965298454316, 'samples': 14992128, 'steps': 78083, 'loss/train': 0.635328471660614} 11/07/2021 08:13:54 - INFO - __main__ - Step 78085: {'lr': 0.00023894435142928, 'samples': 14992320, 'steps': 78084, 'loss/train': 1.2093278169631958} 11/07/2021 08:13:54 - INFO - __main__ - Step 78086: {'lr': 0.00023893904987899836, 'samples': 14992512, 'steps': 78085, 'loss/train': 1.222593069076538} 11/07/2021 08:13:55 - INFO - __main__ - Step 78087: {'lr': 0.0002389337483337006, 'samples': 14992704, 'steps': 78086, 'loss/train': 1.3543957471847534} 11/07/2021 08:13:55 - INFO - __main__ - Step 78088: {'lr': 0.00023892844679338914, 'samples': 14992896, 'steps': 78087, 'loss/train': 1.468048334121704} 11/07/2021 08:13:56 - INFO - __main__ - Step 78089: {'lr': 0.00023892314525806633, 'samples': 14993088, 'steps': 78088, 'loss/train': 1.4523447751998901} 11/07/2021 08:13:57 - INFO - __main__ - Step 78090: {'lr': 0.0002389178437277346, 'samples': 14993280, 'steps': 78089, 'loss/train': 1.3588682413101196} 11/07/2021 08:13:57 - INFO - __main__ - Step 78091: {'lr': 0.0002389125422023963, 'samples': 14993472, 'steps': 78090, 'loss/train': 1.6464015245437622} 11/07/2021 08:13:57 - INFO - __main__ - Step 78092: {'lr': 0.0002389072406820539, 'samples': 14993664, 'steps': 78091, 'loss/train': 1.3125405311584473} 11/07/2021 08:13:58 - INFO - __main__ - Step 78093: {'lr': 0.00023890193916670967, 'samples': 14993856, 'steps': 78092, 'loss/train': 0.9914565682411194} 11/07/2021 08:13:59 - INFO - __main__ - Step 78094: {'lr': 0.00023889663765636607, 'samples': 14994048, 'steps': 78093, 'loss/train': 0.594708263874054} 11/07/2021 08:13:59 - INFO - __main__ - Step 78095: {'lr': 0.0002388913361510255, 'samples': 14994240, 'steps': 78094, 'loss/train': 1.1296038627624512} 11/07/2021 08:13:59 - INFO - __main__ - Step 78096: {'lr': 0.0002388860346506903, 'samples': 14994432, 'steps': 78095, 'loss/train': 1.4597665071487427} 11/07/2021 08:14:00 - INFO - __main__ - Step 78097: {'lr': 0.00023888073315536285, 'samples': 14994624, 'steps': 78096, 'loss/train': 1.591378927230835} 11/07/2021 08:14:00 - INFO - __main__ - Step 78098: {'lr': 0.0002388754316650456, 'samples': 14994816, 'steps': 78097, 'loss/train': 1.7551658153533936} 11/07/2021 08:14:01 - INFO - __main__ - Step 78099: {'lr': 0.00023887013017974087, 'samples': 14995008, 'steps': 78098, 'loss/train': 1.3114577531814575} 11/07/2021 08:14:02 - INFO - __main__ - Step 78100: {'lr': 0.00023886482869945114, 'samples': 14995200, 'steps': 78099, 'loss/train': 1.425139307975769} 11/07/2021 08:14:02 - INFO - __main__ - Step 78101: {'lr': 0.0002388595272241787, 'samples': 14995392, 'steps': 78100, 'loss/train': 1.7731469869613647} 11/07/2021 08:14:02 - INFO - __main__ - Step 78102: {'lr': 0.000238854225753926, 'samples': 14995584, 'steps': 78101, 'loss/train': 1.5100336074829102} 11/07/2021 08:14:03 - INFO - __main__ - Step 78103: {'lr': 0.0002388489242886954, 'samples': 14995776, 'steps': 78102, 'loss/train': 1.7184019088745117} 11/07/2021 08:14:03 - INFO - __main__ - Step 78104: {'lr': 0.00023884362282848933, 'samples': 14995968, 'steps': 78103, 'loss/train': 1.2291226387023926} 11/07/2021 08:14:04 - INFO - __main__ - Step 78105: {'lr': 0.00023883832137331016, 'samples': 14996160, 'steps': 78104, 'loss/train': 0.9150051474571228} 11/07/2021 08:14:04 - INFO - __main__ - Step 78106: {'lr': 0.00023883301992316038, 'samples': 14996352, 'steps': 78105, 'loss/train': 0.8867225050926208} 11/07/2021 08:14:05 - INFO - __main__ - Step 78107: {'lr': 0.00023882771847804214, 'samples': 14996544, 'steps': 78106, 'loss/train': 1.1396229267120361} 11/07/2021 08:14:05 - INFO - __main__ - Step 78108: {'lr': 0.00023882241703795793, 'samples': 14996736, 'steps': 78107, 'loss/train': 1.0661993026733398} 11/07/2021 08:14:05 - INFO - __main__ - Step 78109: {'lr': 0.0002388171156029102, 'samples': 14996928, 'steps': 78108, 'loss/train': 1.0602679252624512} 11/07/2021 08:14:06 - INFO - __main__ - Step 78110: {'lr': 0.00023881181417290129, 'samples': 14997120, 'steps': 78109, 'loss/train': 1.401971459388733} 11/07/2021 08:14:07 - INFO - __main__ - Step 78111: {'lr': 0.00023880651274793365, 'samples': 14997312, 'steps': 78110, 'loss/train': 1.23435640335083} 11/07/2021 08:14:07 - INFO - __main__ - Step 78112: {'lr': 0.00023880121132800955, 'samples': 14997504, 'steps': 78111, 'loss/train': 1.2486180067062378} 11/07/2021 08:14:08 - INFO - __main__ - Step 78113: {'lr': 0.0002387959099131315, 'samples': 14997696, 'steps': 78112, 'loss/train': 1.5697765350341797} 11/07/2021 08:14:08 - INFO - __main__ - Step 78114: {'lr': 0.00023879060850330182, 'samples': 14997888, 'steps': 78113, 'loss/train': 1.2316522598266602} 11/07/2021 08:14:09 - INFO - __main__ - Step 78115: {'lr': 0.00023878530709852292, 'samples': 14998080, 'steps': 78114, 'loss/train': 0.9981136918067932} 11/07/2021 08:14:09 - INFO - __main__ - Step 78116: {'lr': 0.00023878000569879722, 'samples': 14998272, 'steps': 78115, 'loss/train': 0.7359042167663574} 11/07/2021 08:14:10 - INFO - __main__ - Step 78117: {'lr': 0.00023877470430412704, 'samples': 14998464, 'steps': 78116, 'loss/train': 1.3312729597091675} 11/07/2021 08:14:10 - INFO - __main__ - Step 78118: {'lr': 0.00023876940291451483, 'samples': 14998656, 'steps': 78117, 'loss/train': 1.0470448732376099} 11/07/2021 08:14:10 - INFO - __main__ - Step 78119: {'lr': 0.00023876410152996302, 'samples': 14998848, 'steps': 78118, 'loss/train': 1.636806607246399} 11/07/2021 08:14:11 - INFO - __main__ - Step 78120: {'lr': 0.00023875880015047387, 'samples': 14999040, 'steps': 78119, 'loss/train': 1.4413422346115112} 11/07/2021 08:14:12 - INFO - __main__ - Step 78121: {'lr': 0.00023875349877604978, 'samples': 14999232, 'steps': 78120, 'loss/train': 1.9252433776855469} 11/07/2021 08:14:12 - INFO - __main__ - Step 78122: {'lr': 0.00023874819740669323, 'samples': 14999424, 'steps': 78121, 'loss/train': 1.5409380197525024} 11/07/2021 08:14:12 - INFO - __main__ - Step 78123: {'lr': 0.00023874289604240657, 'samples': 14999616, 'steps': 78122, 'loss/train': 2.090256690979004} 11/07/2021 08:14:13 - INFO - __main__ - Step 78124: {'lr': 0.00023873759468319216, 'samples': 14999808, 'steps': 78123, 'loss/train': 1.5459468364715576} 11/07/2021 08:14:13 - INFO - __main__ - Step 78125: {'lr': 0.00023873229332905244, 'samples': 15000000, 'steps': 78124, 'loss/train': 1.4222909212112427} 11/07/2021 08:14:14 - INFO - __main__ - Step 78126: {'lr': 0.0002387269919799898, 'samples': 15000192, 'steps': 78125, 'loss/train': 1.596811294555664} 11/07/2021 08:14:15 - INFO - __main__ - Step 78127: {'lr': 0.00023872169063600653, 'samples': 15000384, 'steps': 78126, 'loss/train': 0.48156511783599854} 11/07/2021 08:14:15 - INFO - __main__ - Step 78128: {'lr': 0.00023871638929710514, 'samples': 15000576, 'steps': 78127, 'loss/train': 1.3937110900878906} 11/07/2021 08:14:15 - INFO - __main__ - Step 78129: {'lr': 0.000238711087963288, 'samples': 15000768, 'steps': 78128, 'loss/train': 1.2218196392059326} 11/07/2021 08:14:16 - INFO - __main__ - Step 78130: {'lr': 0.0002387057866345574, 'samples': 15000960, 'steps': 78129, 'loss/train': 2.0534346103668213} 11/07/2021 08:14:17 - INFO - __main__ - Step 78131: {'lr': 0.00023870048531091583, 'samples': 15001152, 'steps': 78130, 'loss/train': 1.4377130270004272} 11/07/2021 08:14:17 - INFO - __main__ - Step 78132: {'lr': 0.00023869518399236578, 'samples': 15001344, 'steps': 78131, 'loss/train': 1.4135092496871948} 11/07/2021 08:14:17 - INFO - __main__ - Step 78133: {'lr': 0.00023868988267890937, 'samples': 15001536, 'steps': 78132, 'loss/train': 1.326572060585022} 11/07/2021 08:14:18 - INFO - __main__ - Step 78134: {'lr': 0.00023868458137054913, 'samples': 15001728, 'steps': 78133, 'loss/train': 1.4945846796035767} 11/07/2021 08:14:18 - INFO - __main__ - Step 78135: {'lr': 0.00023867928006728745, 'samples': 15001920, 'steps': 78134, 'loss/train': 1.4010143280029297} 11/07/2021 08:14:19 - INFO - __main__ - Step 78136: {'lr': 0.0002386739787691267, 'samples': 15002112, 'steps': 78135, 'loss/train': 1.136203408241272} 11/07/2021 08:14:19 - INFO - __main__ - Step 78137: {'lr': 0.0002386686774760693, 'samples': 15002304, 'steps': 78136, 'loss/train': 1.1112319231033325} 11/07/2021 08:14:20 - INFO - __main__ - Step 78138: {'lr': 0.0002386633761881176, 'samples': 15002496, 'steps': 78137, 'loss/train': 1.482151746749878} 11/07/2021 08:14:20 - INFO - __main__ - Step 78139: {'lr': 0.00023865807490527403, 'samples': 15002688, 'steps': 78138, 'loss/train': 1.6139439344406128} 11/07/2021 08:14:20 - INFO - __main__ - Step 78140: {'lr': 0.0002386527736275409, 'samples': 15002880, 'steps': 78139, 'loss/train': 0.9661740660667419} 11/07/2021 08:14:21 - INFO - __main__ - Step 78141: {'lr': 0.0002386474723549207, 'samples': 15003072, 'steps': 78140, 'loss/train': 1.2991164922714233} 11/07/2021 08:14:22 - INFO - __main__ - Step 78142: {'lr': 0.00023864217108741578, 'samples': 15003264, 'steps': 78141, 'loss/train': 1.5202686786651611} 11/07/2021 08:14:22 - INFO - __main__ - Step 78143: {'lr': 0.00023863686982502852, 'samples': 15003456, 'steps': 78142, 'loss/train': 1.362499713897705} 11/07/2021 08:14:22 - INFO - __main__ - Step 78144: {'lr': 0.0002386315685677613, 'samples': 15003648, 'steps': 78143, 'loss/train': 1.3921178579330444} 11/07/2021 08:14:23 - INFO - __main__ - Step 78145: {'lr': 0.0002386262673156165, 'samples': 15003840, 'steps': 78144, 'loss/train': 1.32278311252594} 11/07/2021 08:14:24 - INFO - __main__ - Step 78146: {'lr': 0.0002386209660685967, 'samples': 15004032, 'steps': 78145, 'loss/train': 1.7171539068222046} 11/07/2021 08:14:24 - INFO - __main__ - Step 78147: {'lr': 0.00023861566482670393, 'samples': 15004224, 'steps': 78146, 'loss/train': 0.9696818590164185} 11/07/2021 08:14:25 - INFO - __main__ - Step 78148: {'lr': 0.0002386103635899408, 'samples': 15004416, 'steps': 78147, 'loss/train': 1.9404878616333008} 11/07/2021 08:14:25 - INFO - __main__ - Step 78149: {'lr': 0.00023860506235830967, 'samples': 15004608, 'steps': 78148, 'loss/train': 1.1505050659179688} 11/07/2021 08:14:25 - INFO - __main__ - Step 78150: {'lr': 0.00023859976113181291, 'samples': 15004800, 'steps': 78149, 'loss/train': 0.17432953417301178} 11/07/2021 08:14:26 - INFO - __main__ - Step 78151: {'lr': 0.00023859445991045294, 'samples': 15004992, 'steps': 78150, 'loss/train': 1.5567532777786255} 11/07/2021 08:14:27 - INFO - __main__ - Step 78152: {'lr': 0.00023858915869423214, 'samples': 15005184, 'steps': 78151, 'loss/train': 1.337003469467163} 11/07/2021 08:14:27 - INFO - __main__ - Step 78153: {'lr': 0.00023858385748315287, 'samples': 15005376, 'steps': 78152, 'loss/train': 1.3242172002792358} 11/07/2021 08:14:28 - INFO - __main__ - Step 78154: {'lr': 0.00023857855627721752, 'samples': 15005568, 'steps': 78153, 'loss/train': 1.4481998682022095} 11/07/2021 08:14:28 - INFO - __main__ - Step 78155: {'lr': 0.00023857325507642852, 'samples': 15005760, 'steps': 78154, 'loss/train': 1.3660293817520142} 11/07/2021 08:14:28 - INFO - __main__ - Step 78156: {'lr': 0.00023856795388078824, 'samples': 15005952, 'steps': 78155, 'loss/train': 0.9424132108688354} 11/07/2021 08:14:29 - INFO - __main__ - Step 78157: {'lr': 0.00023856265269029902, 'samples': 15006144, 'steps': 78156, 'loss/train': 1.4837368726730347} 11/07/2021 08:14:30 - INFO - __main__ - Step 78158: {'lr': 0.00023855735150496335, 'samples': 15006336, 'steps': 78157, 'loss/train': 1.220105767250061} 11/07/2021 08:14:30 - INFO - __main__ - Step 78159: {'lr': 0.00023855205032478365, 'samples': 15006528, 'steps': 78158, 'loss/train': 2.1212382316589355} 11/07/2021 08:14:30 - INFO - __main__ - Step 78160: {'lr': 0.0002385467491497621, 'samples': 15006720, 'steps': 78159, 'loss/train': 1.2228741645812988} 11/07/2021 08:14:31 - INFO - __main__ - Step 78161: {'lr': 0.0002385414479799012, 'samples': 15006912, 'steps': 78160, 'loss/train': 1.2131872177124023} 11/07/2021 08:14:32 - INFO - __main__ - Step 78162: {'lr': 0.00023853614681520338, 'samples': 15007104, 'steps': 78161, 'loss/train': 1.7264634370803833} 11/07/2021 08:14:32 - INFO - __main__ - Step 78163: {'lr': 0.00023853084565567099, 'samples': 15007296, 'steps': 78162, 'loss/train': 0.9268559217453003} 11/07/2021 08:14:32 - INFO - __main__ - Step 78164: {'lr': 0.00023852554450130639, 'samples': 15007488, 'steps': 78163, 'loss/train': 1.645409345626831} 11/07/2021 08:14:33 - INFO - __main__ - Step 78165: {'lr': 0.00023852024335211202, 'samples': 15007680, 'steps': 78164, 'loss/train': 1.4518036842346191} 11/07/2021 08:14:33 - INFO - __main__ - Step 78166: {'lr': 0.00023851494220809025, 'samples': 15007872, 'steps': 78165, 'loss/train': 1.093623161315918} 11/07/2021 08:14:34 - INFO - __main__ - Step 78167: {'lr': 0.00023850964106924348, 'samples': 15008064, 'steps': 78166, 'loss/train': 1.7588783502578735} 11/07/2021 08:14:34 - INFO - __main__ - Step 78168: {'lr': 0.00023850433993557408, 'samples': 15008256, 'steps': 78167, 'loss/train': 1.4154406785964966} 11/07/2021 08:14:35 - INFO - __main__ - Step 78169: {'lr': 0.00023849903880708445, 'samples': 15008448, 'steps': 78168, 'loss/train': 1.7850645780563354} 11/07/2021 08:14:35 - INFO - __main__ - Step 78170: {'lr': 0.00023849373768377696, 'samples': 15008640, 'steps': 78169, 'loss/train': 1.763724684715271} 11/07/2021 08:14:36 - INFO - __main__ - Step 78171: {'lr': 0.00023848843656565407, 'samples': 15008832, 'steps': 78170, 'loss/train': 1.4197041988372803} 11/07/2021 08:14:37 - INFO - __main__ - Step 78172: {'lr': 0.00023848313545271805, 'samples': 15009024, 'steps': 78171, 'loss/train': 1.5919677019119263} 11/07/2021 08:14:37 - INFO - __main__ - Step 78173: {'lr': 0.00023847783434497146, 'samples': 15009216, 'steps': 78172, 'loss/train': 1.6209498643875122} 11/07/2021 08:14:37 - INFO - __main__ - Step 78174: {'lr': 0.00023847253324241652, 'samples': 15009408, 'steps': 78173, 'loss/train': 1.164219856262207} 11/07/2021 08:14:38 - INFO - __main__ - Step 78175: {'lr': 0.00023846723214505564, 'samples': 15009600, 'steps': 78174, 'loss/train': 1.5058425664901733} 11/07/2021 08:14:38 - INFO - __main__ - Step 78176: {'lr': 0.00023846193105289126, 'samples': 15009792, 'steps': 78175, 'loss/train': 1.748576045036316} 11/07/2021 08:14:38 - INFO - __main__ - Step 78177: {'lr': 0.00023845662996592576, 'samples': 15009984, 'steps': 78176, 'loss/train': 1.4155415296554565} 11/07/2021 08:14:39 - INFO - __main__ - Step 78178: {'lr': 0.0002384513288841615, 'samples': 15010176, 'steps': 78177, 'loss/train': 1.0324021577835083} 11/07/2021 08:14:40 - INFO - __main__ - Step 78179: {'lr': 0.00023844602780760094, 'samples': 15010368, 'steps': 78178, 'loss/train': 1.5024482011795044} 11/07/2021 08:14:40 - INFO - __main__ - Step 78180: {'lr': 0.0002384407267362464, 'samples': 15010560, 'steps': 78179, 'loss/train': 1.439399003982544} 11/07/2021 08:14:40 - INFO - __main__ - Step 78181: {'lr': 0.00023843542567010027, 'samples': 15010752, 'steps': 78180, 'loss/train': 1.1001219749450684} 11/07/2021 08:14:41 - INFO - __main__ - Step 78182: {'lr': 0.00023843012460916498, 'samples': 15010944, 'steps': 78181, 'loss/train': 1.4900442361831665} 11/07/2021 08:14:42 - INFO - __main__ - Step 78183: {'lr': 0.00023842482355344288, 'samples': 15011136, 'steps': 78182, 'loss/train': 1.5510663986206055} 11/07/2021 08:14:42 - INFO - __main__ - Step 78184: {'lr': 0.0002384195225029364, 'samples': 15011328, 'steps': 78183, 'loss/train': 1.4024056196212769} 11/07/2021 08:14:42 - INFO - __main__ - Step 78185: {'lr': 0.00023841422145764787, 'samples': 15011520, 'steps': 78184, 'loss/train': 1.4217909574508667} 11/07/2021 08:14:43 - INFO - __main__ - Step 78186: {'lr': 0.00023840892041757987, 'samples': 15011712, 'steps': 78185, 'loss/train': 1.6043528318405151} 11/07/2021 08:14:43 - INFO - __main__ - Step 78187: {'lr': 0.00023840361938273446, 'samples': 15011904, 'steps': 78186, 'loss/train': 1.54881751537323} 11/07/2021 08:14:44 - INFO - __main__ - Step 78188: {'lr': 0.00023839831835311426, 'samples': 15012096, 'steps': 78187, 'loss/train': 1.3425018787384033} 11/07/2021 08:14:45 - INFO - __main__ - Step 78189: {'lr': 0.00023839301732872157, 'samples': 15012288, 'steps': 78188, 'loss/train': 1.7896711826324463} 11/07/2021 08:14:45 - INFO - __main__ - Step 78190: {'lr': 0.0002383877163095588, 'samples': 15012480, 'steps': 78189, 'loss/train': 0.6571738719940186} 11/07/2021 08:14:45 - INFO - __main__ - Step 78191: {'lr': 0.00023838241529562838, 'samples': 15012672, 'steps': 78190, 'loss/train': 1.2280545234680176} 11/07/2021 08:14:46 - INFO - __main__ - Step 78192: {'lr': 0.00023837711428693263, 'samples': 15012864, 'steps': 78191, 'loss/train': 1.1818337440490723} 11/07/2021 08:14:47 - INFO - __main__ - Step 78193: {'lr': 0.00023837181328347398, 'samples': 15013056, 'steps': 78192, 'loss/train': 1.1200580596923828} 11/07/2021 08:14:47 - INFO - __main__ - Step 78194: {'lr': 0.00023836651228525483, 'samples': 15013248, 'steps': 78193, 'loss/train': 1.6257773637771606} 11/07/2021 08:14:47 - INFO - __main__ - Step 78195: {'lr': 0.00023836121129227754, 'samples': 15013440, 'steps': 78194, 'loss/train': 1.6217399835586548} 11/07/2021 08:14:48 - INFO - __main__ - Step 78196: {'lr': 0.0002383559103045445, 'samples': 15013632, 'steps': 78195, 'loss/train': 1.1956870555877686} 11/07/2021 08:14:48 - INFO - __main__ - Step 78197: {'lr': 0.00023835060932205816, 'samples': 15013824, 'steps': 78196, 'loss/train': 1.3433539867401123} 11/07/2021 08:14:49 - INFO - __main__ - Step 78198: {'lr': 0.00023834530834482078, 'samples': 15014016, 'steps': 78197, 'loss/train': 1.3544726371765137} 11/07/2021 08:14:49 - INFO - __main__ - Step 78199: {'lr': 0.00023834000737283487, 'samples': 15014208, 'steps': 78198, 'loss/train': 1.0987510681152344} 11/07/2021 08:14:50 - INFO - __main__ - Step 78200: {'lr': 0.00023833470640610281, 'samples': 15014400, 'steps': 78199, 'loss/train': 1.606430172920227} 11/07/2021 08:14:50 - INFO - __main__ - Step 78201: {'lr': 0.0002383294054446269, 'samples': 15014592, 'steps': 78200, 'loss/train': 1.3926993608474731} 11/07/2021 08:14:50 - INFO - __main__ - Step 78202: {'lr': 0.0002383241044884096, 'samples': 15014784, 'steps': 78201, 'loss/train': 1.5726244449615479} 11/07/2021 08:14:52 - INFO - __main__ - Step 78203: {'lr': 0.00023831880353745321, 'samples': 15014976, 'steps': 78202, 'loss/train': 1.2681773900985718} 11/07/2021 08:14:52 - INFO - __main__ - Step 78204: {'lr': 0.00023831350259176024, 'samples': 15015168, 'steps': 78203, 'loss/train': 1.5147393941879272} 11/07/2021 08:14:52 - INFO - __main__ - Step 78205: {'lr': 0.000238308201651333, 'samples': 15015360, 'steps': 78204, 'loss/train': 1.638044834136963} 11/07/2021 08:14:53 - INFO - __main__ - Step 78206: {'lr': 0.00023830290071617395, 'samples': 15015552, 'steps': 78205, 'loss/train': 1.49178946018219} 11/07/2021 08:14:53 - INFO - __main__ - Step 78207: {'lr': 0.0002382975997862854, 'samples': 15015744, 'steps': 78206, 'loss/train': 1.205287218093872} 11/07/2021 08:14:54 - INFO - __main__ - Step 78208: {'lr': 0.00023829229886166984, 'samples': 15015936, 'steps': 78207, 'loss/train': 1.069957971572876} 11/07/2021 08:14:54 - INFO - __main__ - Step 78209: {'lr': 0.0002382869979423295, 'samples': 15016128, 'steps': 78208, 'loss/train': 1.3917100429534912} 11/07/2021 08:14:55 - INFO - __main__ - Step 78210: {'lr': 0.00023828169702826688, 'samples': 15016320, 'steps': 78209, 'loss/train': 1.3400708436965942} 11/07/2021 08:14:55 - INFO - __main__ - Step 78211: {'lr': 0.00023827639611948435, 'samples': 15016512, 'steps': 78210, 'loss/train': 1.3840092420578003} 11/07/2021 08:14:55 - INFO - __main__ - Step 78212: {'lr': 0.00023827109521598432, 'samples': 15016704, 'steps': 78211, 'loss/train': 1.3896925449371338} 11/07/2021 08:14:56 - INFO - __main__ - Step 78213: {'lr': 0.00023826579431776915, 'samples': 15016896, 'steps': 78212, 'loss/train': 1.5260467529296875} 11/07/2021 08:14:57 - INFO - __main__ - Step 78214: {'lr': 0.0002382604934248412, 'samples': 15017088, 'steps': 78213, 'loss/train': 1.3431482315063477} 11/07/2021 08:14:57 - INFO - __main__ - Step 78215: {'lr': 0.0002382551925372029, 'samples': 15017280, 'steps': 78214, 'loss/train': 1.584030032157898} 11/07/2021 08:14:57 - INFO - __main__ - Step 78216: {'lr': 0.00023824989165485664, 'samples': 15017472, 'steps': 78215, 'loss/train': 1.3302334547042847} 11/07/2021 08:14:58 - INFO - __main__ - Step 78217: {'lr': 0.00023824459077780477, 'samples': 15017664, 'steps': 78216, 'loss/train': 1.4841762781143188} 11/07/2021 08:14:59 - INFO - __main__ - Step 78218: {'lr': 0.00023823928990604972, 'samples': 15017856, 'steps': 78217, 'loss/train': 1.9532908201217651} 11/07/2021 08:14:59 - INFO - __main__ - Step 78219: {'lr': 0.00023823398903959395, 'samples': 15018048, 'steps': 78218, 'loss/train': 1.7440917491912842} 11/07/2021 08:14:59 - INFO - __main__ - Step 78220: {'lr': 0.00023822868817843969, 'samples': 15018240, 'steps': 78219, 'loss/train': 1.1505388021469116} 11/07/2021 08:15:00 - INFO - __main__ - Step 78221: {'lr': 0.00023822338732258937, 'samples': 15018432, 'steps': 78220, 'loss/train': 1.3128224611282349} 11/07/2021 08:15:00 - INFO - __main__ - Step 78222: {'lr': 0.00023821808647204543, 'samples': 15018624, 'steps': 78221, 'loss/train': 1.3288787603378296} 11/07/2021 08:15:01 - INFO - __main__ - Step 78223: {'lr': 0.00023821278562681023, 'samples': 15018816, 'steps': 78222, 'loss/train': 1.093958854675293} 11/07/2021 08:15:02 - INFO - __main__ - Step 78224: {'lr': 0.00023820748478688616, 'samples': 15019008, 'steps': 78223, 'loss/train': 1.399006724357605} 11/07/2021 08:15:02 - INFO - __main__ - Step 78225: {'lr': 0.00023820218395227566, 'samples': 15019200, 'steps': 78224, 'loss/train': 5.700150966644287} 11/07/2021 08:15:02 - INFO - __main__ - Step 78226: {'lr': 0.00023819688312298106, 'samples': 15019392, 'steps': 78225, 'loss/train': 1.107609510421753} 11/07/2021 08:15:03 - INFO - __main__ - Step 78227: {'lr': 0.0002381915822990048, 'samples': 15019584, 'steps': 78226, 'loss/train': 1.3318525552749634} 11/07/2021 08:15:03 - INFO - __main__ - Step 78228: {'lr': 0.00023818628148034916, 'samples': 15019776, 'steps': 78227, 'loss/train': 1.5045417547225952} 11/07/2021 08:15:04 - INFO - __main__ - Step 78229: {'lr': 0.0002381809806670166, 'samples': 15019968, 'steps': 78228, 'loss/train': 1.4232417345046997} 11/07/2021 08:15:04 - INFO - __main__ - Step 78230: {'lr': 0.00023817567985900959, 'samples': 15020160, 'steps': 78229, 'loss/train': 0.9274079203605652} 11/07/2021 08:15:05 - INFO - __main__ - Step 78231: {'lr': 0.00023817037905633038, 'samples': 15020352, 'steps': 78230, 'loss/train': 1.42348051071167} 11/07/2021 08:15:05 - INFO - __main__ - Step 78232: {'lr': 0.0002381650782589814, 'samples': 15020544, 'steps': 78231, 'loss/train': 0.836295485496521} 11/07/2021 08:15:05 - INFO - __main__ - Step 78233: {'lr': 0.00023815977746696504, 'samples': 15020736, 'steps': 78232, 'loss/train': 1.7257134914398193} 11/07/2021 08:15:06 - INFO - __main__ - Step 78234: {'lr': 0.00023815447668028373, 'samples': 15020928, 'steps': 78233, 'loss/train': 1.1568012237548828} 11/07/2021 08:15:07 - INFO - __main__ - Step 78235: {'lr': 0.00023814917589893984, 'samples': 15021120, 'steps': 78234, 'loss/train': 1.28466796875} 11/07/2021 08:15:08 - INFO - __main__ - Step 78236: {'lr': 0.00023814387512293572, 'samples': 15021312, 'steps': 78235, 'loss/train': 1.1804789304733276} 11/07/2021 08:15:08 - INFO - __main__ - Step 78237: {'lr': 0.0002381385743522738, 'samples': 15021504, 'steps': 78236, 'loss/train': 1.309135913848877} 11/07/2021 08:15:08 - INFO - __main__ - Step 78238: {'lr': 0.00023813327358695644, 'samples': 15021696, 'steps': 78237, 'loss/train': 1.5479167699813843} 11/07/2021 08:15:09 - INFO - __main__ - Step 78239: {'lr': 0.00023812797282698607, 'samples': 15021888, 'steps': 78238, 'loss/train': 1.554565668106079} 11/07/2021 08:15:10 - INFO - __main__ - Step 78240: {'lr': 0.00023812267207236513, 'samples': 15022080, 'steps': 78239, 'loss/train': 1.5285117626190186} 11/07/2021 08:15:10 - INFO - __main__ - Step 78241: {'lr': 0.00023811737132309582, 'samples': 15022272, 'steps': 78240, 'loss/train': 1.78187096118927} 11/07/2021 08:15:11 - INFO - __main__ - Step 78242: {'lr': 0.00023811207057918067, 'samples': 15022464, 'steps': 78241, 'loss/train': 1.1644411087036133} 11/07/2021 08:15:11 - INFO - __main__ - Step 78243: {'lr': 0.00023810676984062202, 'samples': 15022656, 'steps': 78242, 'loss/train': 1.0053400993347168} 11/07/2021 08:15:11 - INFO - __main__ - Step 78244: {'lr': 0.0002381014691074223, 'samples': 15022848, 'steps': 78243, 'loss/train': 1.3019005060195923} 11/07/2021 08:15:12 - INFO - __main__ - Step 78245: {'lr': 0.00023809616837958383, 'samples': 15023040, 'steps': 78244, 'loss/train': 1.577720284461975} 11/07/2021 08:15:13 - INFO - __main__ - Step 78246: {'lr': 0.00023809086765710908, 'samples': 15023232, 'steps': 78245, 'loss/train': 1.1614983081817627} 11/07/2021 08:15:13 - INFO - __main__ - Step 78247: {'lr': 0.0002380855669400004, 'samples': 15023424, 'steps': 78246, 'loss/train': 2.202878952026367} 11/07/2021 08:15:13 - INFO - __main__ - Step 78248: {'lr': 0.00023808026622826014, 'samples': 15023616, 'steps': 78247, 'loss/train': 1.9314451217651367} 11/07/2021 08:15:14 - INFO - __main__ - Step 78249: {'lr': 0.00023807496552189078, 'samples': 15023808, 'steps': 78248, 'loss/train': 1.489593505859375} 11/07/2021 08:15:14 - INFO - __main__ - Step 78250: {'lr': 0.0002380696648208946, 'samples': 15024000, 'steps': 78249, 'loss/train': 1.3026467561721802} 11/07/2021 08:15:15 - INFO - __main__ - Step 78251: {'lr': 0.0002380643641252741, 'samples': 15024192, 'steps': 78250, 'loss/train': 1.5684330463409424} 11/07/2021 08:15:15 - INFO - __main__ - Step 78252: {'lr': 0.00023805906343503158, 'samples': 15024384, 'steps': 78251, 'loss/train': 1.4670028686523438} 11/07/2021 08:15:16 - INFO - __main__ - Step 78253: {'lr': 0.0002380537627501696, 'samples': 15024576, 'steps': 78252, 'loss/train': 0.5058871507644653} 11/07/2021 08:15:16 - INFO - __main__ - Step 78254: {'lr': 0.00023804846207069029, 'samples': 15024768, 'steps': 78253, 'loss/train': 1.208536982536316} 11/07/2021 08:15:17 - INFO - __main__ - Step 78255: {'lr': 0.00023804316139659616, 'samples': 15024960, 'steps': 78254, 'loss/train': 0.6622090339660645} 11/07/2021 08:15:18 - INFO - __main__ - Step 78256: {'lr': 0.00023803786072788957, 'samples': 15025152, 'steps': 78255, 'loss/train': 1.3569724559783936} 11/07/2021 08:15:18 - INFO - __main__ - Step 78257: {'lr': 0.00023803256006457298, 'samples': 15025344, 'steps': 78256, 'loss/train': 1.6250207424163818} 11/07/2021 08:15:18 - INFO - __main__ - Step 78258: {'lr': 0.00023802725940664867, 'samples': 15025536, 'steps': 78257, 'loss/train': 1.4256014823913574} 11/07/2021 08:15:19 - INFO - __main__ - Step 78259: {'lr': 0.00023802195875411914, 'samples': 15025728, 'steps': 78258, 'loss/train': 1.1056547164916992} 11/07/2021 08:15:19 - INFO - __main__ - Step 78260: {'lr': 0.00023801665810698673, 'samples': 15025920, 'steps': 78259, 'loss/train': 1.5385874509811401} 11/07/2021 08:15:20 - INFO - __main__ - Step 78261: {'lr': 0.00023801135746525382, 'samples': 15026112, 'steps': 78260, 'loss/train': 1.409900426864624} 11/07/2021 08:15:20 - INFO - __main__ - Step 78262: {'lr': 0.00023800605682892278, 'samples': 15026304, 'steps': 78261, 'loss/train': 0.41778936982154846} 11/07/2021 08:15:21 - INFO - __main__ - Step 78263: {'lr': 0.00023800075619799608, 'samples': 15026496, 'steps': 78262, 'loss/train': 1.1803562641143799} 11/07/2021 08:15:21 - INFO - __main__ - Step 78264: {'lr': 0.000237995455572476, 'samples': 15026688, 'steps': 78263, 'loss/train': 1.5084893703460693} 11/07/2021 08:15:22 - INFO - __main__ - Step 78265: {'lr': 0.00023799015495236503, 'samples': 15026880, 'steps': 78264, 'loss/train': 0.730487585067749} 11/07/2021 08:15:22 - INFO - __main__ - Step 78266: {'lr': 0.00023798485433766548, 'samples': 15027072, 'steps': 78265, 'loss/train': 1.7337735891342163} 11/07/2021 08:15:23 - INFO - __main__ - Step 78267: {'lr': 0.0002379795537283799, 'samples': 15027264, 'steps': 78266, 'loss/train': 1.6343131065368652} 11/07/2021 08:15:23 - INFO - __main__ - Step 78268: {'lr': 0.00023797425312451043, 'samples': 15027456, 'steps': 78267, 'loss/train': 1.5074056386947632} 11/07/2021 08:15:23 - INFO - __main__ - Step 78269: {'lr': 0.00023796895252605957, 'samples': 15027648, 'steps': 78268, 'loss/train': 0.8857414722442627} 11/07/2021 08:15:24 - INFO - __main__ - Step 78270: {'lr': 0.00023796365193302972, 'samples': 15027840, 'steps': 78269, 'loss/train': 0.7365593910217285} 11/07/2021 08:15:25 - INFO - __main__ - Step 78271: {'lr': 0.00023795835134542327, 'samples': 15028032, 'steps': 78270, 'loss/train': 1.0171983242034912} 11/07/2021 08:15:25 - INFO - __main__ - Step 78272: {'lr': 0.00023795305076324257, 'samples': 15028224, 'steps': 78271, 'loss/train': 1.5652141571044922} 11/07/2021 08:15:26 - INFO - __main__ - Step 78273: {'lr': 0.00023794775018649007, 'samples': 15028416, 'steps': 78272, 'loss/train': 1.9230235815048218} 11/07/2021 08:15:26 - INFO - __main__ - Step 78274: {'lr': 0.00023794244961516811, 'samples': 15028608, 'steps': 78273, 'loss/train': 1.999682068824768} 11/07/2021 08:15:26 - INFO - __main__ - Step 78275: {'lr': 0.0002379371490492791, 'samples': 15028800, 'steps': 78274, 'loss/train': 1.3189377784729004} 11/07/2021 08:15:27 - INFO - __main__ - Step 78276: {'lr': 0.00023793184848882543, 'samples': 15028992, 'steps': 78275, 'loss/train': 1.8647279739379883} 11/07/2021 08:15:28 - INFO - __main__ - Step 78277: {'lr': 0.0002379265479338095, 'samples': 15029184, 'steps': 78276, 'loss/train': 0.7970114946365356} 11/07/2021 08:15:28 - INFO - __main__ - Step 78278: {'lr': 0.00023792124738423366, 'samples': 15029376, 'steps': 78277, 'loss/train': 1.4703465700149536} 11/07/2021 08:15:28 - INFO - __main__ - Step 78279: {'lr': 0.0002379159468401003, 'samples': 15029568, 'steps': 78278, 'loss/train': 1.3478336334228516} 11/07/2021 08:15:29 - INFO - __main__ - Step 78280: {'lr': 0.000237910646301412, 'samples': 15029760, 'steps': 78279, 'loss/train': 1.2487001419067383} 11/07/2021 08:15:29 - INFO - __main__ - Step 78281: {'lr': 0.0002379053457681708, 'samples': 15029952, 'steps': 78280, 'loss/train': 1.0884490013122559} 11/07/2021 08:15:30 - INFO - __main__ - Step 78282: {'lr': 0.00023790004524037927, 'samples': 15030144, 'steps': 78281, 'loss/train': 1.1965007781982422} 11/07/2021 08:15:30 - INFO - __main__ - Step 78283: {'lr': 0.00023789474471803984, 'samples': 15030336, 'steps': 78282, 'loss/train': 1.5971015691757202} 11/07/2021 08:15:31 - INFO - __main__ - Step 78284: {'lr': 0.00023788944420115483, 'samples': 15030528, 'steps': 78283, 'loss/train': 2.0717618465423584} 11/07/2021 08:15:31 - INFO - __main__ - Step 78285: {'lr': 0.00023788414368972662, 'samples': 15030720, 'steps': 78284, 'loss/train': 0.9848440885543823} 11/07/2021 08:15:31 - INFO - __main__ - Step 78286: {'lr': 0.00023787884318375767, 'samples': 15030912, 'steps': 78285, 'loss/train': 0.9350055456161499} 11/07/2021 08:15:33 - INFO - __main__ - Step 78287: {'lr': 0.00023787354268325032, 'samples': 15031104, 'steps': 78286, 'loss/train': 1.0312621593475342} 11/07/2021 08:15:33 - INFO - __main__ - Step 78288: {'lr': 0.00023786824218820693, 'samples': 15031296, 'steps': 78287, 'loss/train': 1.33237624168396} 11/07/2021 08:15:33 - INFO - __main__ - Step 78289: {'lr': 0.00023786294169862997, 'samples': 15031488, 'steps': 78288, 'loss/train': 1.5401535034179688} 11/07/2021 08:15:34 - INFO - __main__ - Step 78290: {'lr': 0.00023785764121452176, 'samples': 15031680, 'steps': 78289, 'loss/train': 0.6905503869056702} 11/07/2021 08:15:34 - INFO - __main__ - Step 78291: {'lr': 0.0002378523407358847, 'samples': 15031872, 'steps': 78290, 'loss/train': 0.8779095411300659} 11/07/2021 08:15:35 - INFO - __main__ - Step 78292: {'lr': 0.0002378470402627212, 'samples': 15032064, 'steps': 78291, 'loss/train': 1.273160696029663} 11/07/2021 08:15:35 - INFO - __main__ - Step 78293: {'lr': 0.0002378417397950336, 'samples': 15032256, 'steps': 78292, 'loss/train': 1.207613468170166} 11/07/2021 08:15:36 - INFO - __main__ - Step 78294: {'lr': 0.00023783643933282446, 'samples': 15032448, 'steps': 78293, 'loss/train': 1.3619685173034668} 11/07/2021 08:15:36 - INFO - __main__ - Step 78295: {'lr': 0.00023783113887609596, 'samples': 15032640, 'steps': 78294, 'loss/train': 1.4954946041107178} 11/07/2021 08:15:36 - INFO - __main__ - Step 78296: {'lr': 0.00023782583842485054, 'samples': 15032832, 'steps': 78295, 'loss/train': 1.2761632204055786} 11/07/2021 08:15:37 - INFO - __main__ - Step 78297: {'lr': 0.00023782053797909058, 'samples': 15033024, 'steps': 78296, 'loss/train': 1.4791802167892456} 11/07/2021 08:15:38 - INFO - __main__ - Step 78298: {'lr': 0.0002378152375388185, 'samples': 15033216, 'steps': 78297, 'loss/train': 2.0011775493621826} 11/07/2021 08:15:38 - INFO - __main__ - Step 78299: {'lr': 0.00023780993710403672, 'samples': 15033408, 'steps': 78298, 'loss/train': 1.2265820503234863} 11/07/2021 08:15:38 - INFO - __main__ - Step 78300: {'lr': 0.00023780463667474758, 'samples': 15033600, 'steps': 78299, 'loss/train': 1.4169127941131592} 11/07/2021 08:15:39 - INFO - __main__ - Step 78301: {'lr': 0.00023779933625095348, 'samples': 15033792, 'steps': 78300, 'loss/train': 1.628448486328125} 11/07/2021 08:15:40 - INFO - __main__ - Step 78302: {'lr': 0.0002377940358326568, 'samples': 15033984, 'steps': 78301, 'loss/train': 1.1474318504333496} 11/07/2021 08:15:40 - INFO - __main__ - Step 78303: {'lr': 0.00023778873541985995, 'samples': 15034176, 'steps': 78302, 'loss/train': 0.9283702969551086} 11/07/2021 08:15:41 - INFO - __main__ - Step 78304: {'lr': 0.00023778343501256531, 'samples': 15034368, 'steps': 78303, 'loss/train': 1.902745008468628} 11/07/2021 08:15:41 - INFO - __main__ - Step 78305: {'lr': 0.00023777813461077526, 'samples': 15034560, 'steps': 78304, 'loss/train': 1.2706679105758667} 11/07/2021 08:15:41 - INFO - __main__ - Step 78306: {'lr': 0.0002377728342144922, 'samples': 15034752, 'steps': 78305, 'loss/train': 0.9756082892417908} 11/07/2021 08:15:42 - INFO - __main__ - Step 78307: {'lr': 0.0002377675338237186, 'samples': 15034944, 'steps': 78306, 'loss/train': 1.020423412322998} 11/07/2021 08:15:43 - INFO - __main__ - Step 78308: {'lr': 0.0002377622334384567, 'samples': 15035136, 'steps': 78307, 'loss/train': 1.5718268156051636} 11/07/2021 08:15:43 - INFO - __main__ - Step 78309: {'lr': 0.0002377569330587089, 'samples': 15035328, 'steps': 78308, 'loss/train': 1.4150774478912354} 11/07/2021 08:15:43 - INFO - __main__ - Step 78310: {'lr': 0.00023775163268447766, 'samples': 15035520, 'steps': 78309, 'loss/train': 1.054679274559021} 11/07/2021 08:15:44 - INFO - __main__ - Step 78311: {'lr': 0.00023774633231576534, 'samples': 15035712, 'steps': 78310, 'loss/train': 1.4969860315322876} 11/07/2021 08:15:44 - INFO - __main__ - Step 78312: {'lr': 0.00023774103195257432, 'samples': 15035904, 'steps': 78311, 'loss/train': 1.35812509059906} 11/07/2021 08:15:45 - INFO - __main__ - Step 78313: {'lr': 0.000237735731594907, 'samples': 15036096, 'steps': 78312, 'loss/train': 0.9612992405891418} 11/07/2021 08:15:46 - INFO - __main__ - Step 78314: {'lr': 0.0002377304312427658, 'samples': 15036288, 'steps': 78313, 'loss/train': 0.8438857793807983} 11/07/2021 08:15:46 - INFO - __main__ - Step 78315: {'lr': 0.0002377251308961531, 'samples': 15036480, 'steps': 78314, 'loss/train': 0.8091064691543579} 11/07/2021 08:15:47 - INFO - __main__ - Step 78316: {'lr': 0.0002377198305550712, 'samples': 15036672, 'steps': 78315, 'loss/train': 1.1944525241851807} 11/07/2021 08:15:47 - INFO - __main__ - Step 78317: {'lr': 0.0002377145302195226, 'samples': 15036864, 'steps': 78316, 'loss/train': 0.15594756603240967} 11/07/2021 08:15:48 - INFO - __main__ - Step 78318: {'lr': 0.0002377092298895096, 'samples': 15037056, 'steps': 78317, 'loss/train': 1.889938473701477} 11/07/2021 08:15:48 - INFO - __main__ - Step 78319: {'lr': 0.00023770392956503467, 'samples': 15037248, 'steps': 78318, 'loss/train': 1.0858454704284668} 11/07/2021 08:15:49 - INFO - __main__ - Step 78320: {'lr': 0.00023769862924610019, 'samples': 15037440, 'steps': 78319, 'loss/train': 1.6795244216918945} 11/07/2021 08:15:49 - INFO - __main__ - Step 78321: {'lr': 0.00023769332893270855, 'samples': 15037632, 'steps': 78320, 'loss/train': 1.2025288343429565} 11/07/2021 08:15:49 - INFO - __main__ - Step 78322: {'lr': 0.00023768802862486203, 'samples': 15037824, 'steps': 78321, 'loss/train': 1.2136869430541992} 11/07/2021 08:15:50 - INFO - __main__ - Step 78323: {'lr': 0.0002376827283225631, 'samples': 15038016, 'steps': 78322, 'loss/train': 1.7921142578125} 11/07/2021 08:15:51 - INFO - __main__ - Step 78324: {'lr': 0.00023767742802581414, 'samples': 15038208, 'steps': 78323, 'loss/train': 1.2602007389068604} 11/07/2021 08:15:51 - INFO - __main__ - Step 78325: {'lr': 0.00023767212773461756, 'samples': 15038400, 'steps': 78324, 'loss/train': 1.096219539642334} 11/07/2021 08:15:52 - INFO - __main__ - Step 78326: {'lr': 0.0002376668274489757, 'samples': 15038592, 'steps': 78325, 'loss/train': 1.498761534690857} 11/07/2021 08:15:52 - INFO - __main__ - Step 78327: {'lr': 0.00023766152716889097, 'samples': 15038784, 'steps': 78326, 'loss/train': 1.3997470140457153} 11/07/2021 08:15:53 - INFO - __main__ - Step 78328: {'lr': 0.00023765622689436578, 'samples': 15038976, 'steps': 78327, 'loss/train': 1.2895276546478271} 11/07/2021 08:15:53 - INFO - __main__ - Step 78329: {'lr': 0.00023765092662540252, 'samples': 15039168, 'steps': 78328, 'loss/train': 3.1209633350372314} 11/07/2021 08:15:54 - INFO - __main__ - Step 78330: {'lr': 0.00023764562636200353, 'samples': 15039360, 'steps': 78329, 'loss/train': 1.2769293785095215} 11/07/2021 08:15:54 - INFO - __main__ - Step 78331: {'lr': 0.0002376403261041713, 'samples': 15039552, 'steps': 78330, 'loss/train': 0.6477429866790771} 11/07/2021 08:15:54 - INFO - __main__ - Step 78332: {'lr': 0.0002376350258519081, 'samples': 15039744, 'steps': 78331, 'loss/train': 1.4230844974517822} 11/07/2021 08:15:55 - INFO - __main__ - Step 78333: {'lr': 0.00023762972560521637, 'samples': 15039936, 'steps': 78332, 'loss/train': 0.760384738445282} 11/07/2021 08:15:56 - INFO - __main__ - Step 78334: {'lr': 0.00023762442536409855, 'samples': 15040128, 'steps': 78333, 'loss/train': 1.5360395908355713} 11/07/2021 08:15:56 - INFO - __main__ - Step 78335: {'lr': 0.0002376191251285569, 'samples': 15040320, 'steps': 78334, 'loss/train': 0.8055431246757507} 11/07/2021 08:15:56 - INFO - __main__ - Step 78336: {'lr': 0.0002376138248985939, 'samples': 15040512, 'steps': 78335, 'loss/train': 1.467577576637268} 11/07/2021 08:15:57 - INFO - __main__ - Step 78337: {'lr': 0.0002376085246742119, 'samples': 15040704, 'steps': 78336, 'loss/train': 0.7251951694488525} 11/07/2021 08:15:58 - INFO - __main__ - Step 78338: {'lr': 0.00023760322445541332, 'samples': 15040896, 'steps': 78337, 'loss/train': 1.070620059967041} 11/07/2021 08:15:58 - INFO - __main__ - Step 78339: {'lr': 0.00023759792424220052, 'samples': 15041088, 'steps': 78338, 'loss/train': 1.5222327709197998} 11/07/2021 08:15:58 - INFO - __main__ - Step 78340: {'lr': 0.00023759262403457592, 'samples': 15041280, 'steps': 78339, 'loss/train': 0.8716329336166382} 11/07/2021 08:15:59 - INFO - __main__ - Step 78341: {'lr': 0.0002375873238325419, 'samples': 15041472, 'steps': 78340, 'loss/train': 1.032734990119934} 11/07/2021 08:15:59 - INFO - __main__ - Step 78342: {'lr': 0.0002375820236361009, 'samples': 15041664, 'steps': 78341, 'loss/train': 0.8845007419586182} 11/07/2021 08:15:59 - INFO - __main__ - Step 78343: {'lr': 0.00023757672344525518, 'samples': 15041856, 'steps': 78342, 'loss/train': 1.7778666019439697} 11/07/2021 08:16:00 - INFO - __main__ - Step 78344: {'lr': 0.00023757142326000718, 'samples': 15042048, 'steps': 78343, 'loss/train': 0.8257474899291992} 11/07/2021 08:16:01 - INFO - __main__ - Step 78345: {'lr': 0.00023756612308035934, 'samples': 15042240, 'steps': 78344, 'loss/train': 0.990485429763794} 11/07/2021 08:16:01 - INFO - __main__ - Step 78346: {'lr': 0.00023756082290631397, 'samples': 15042432, 'steps': 78345, 'loss/train': 0.8448871970176697} 11/07/2021 08:16:01 - INFO - __main__ - Step 78347: {'lr': 0.00023755552273787355, 'samples': 15042624, 'steps': 78346, 'loss/train': 1.7800319194793701} 11/07/2021 08:16:02 - INFO - __main__ - Step 78348: {'lr': 0.00023755022257504043, 'samples': 15042816, 'steps': 78347, 'loss/train': 1.1419426202774048} 11/07/2021 08:16:03 - INFO - __main__ - Step 78349: {'lr': 0.00023754492241781698, 'samples': 15043008, 'steps': 78348, 'loss/train': 1.368589997291565} 11/07/2021 08:16:03 - INFO - __main__ - Step 78350: {'lr': 0.00023753962226620557, 'samples': 15043200, 'steps': 78349, 'loss/train': 1.5720188617706299} 11/07/2021 08:16:04 - INFO - __main__ - Step 78351: {'lr': 0.0002375343221202086, 'samples': 15043392, 'steps': 78350, 'loss/train': 1.6728876829147339} 11/07/2021 08:16:04 - INFO - __main__ - Step 78352: {'lr': 0.0002375290219798285, 'samples': 15043584, 'steps': 78351, 'loss/train': 1.6176739931106567} 11/07/2021 08:16:04 - INFO - __main__ - Step 78353: {'lr': 0.00023752372184506764, 'samples': 15043776, 'steps': 78352, 'loss/train': 0.8000569343566895} 11/07/2021 08:16:05 - INFO - __main__ - Step 78354: {'lr': 0.00023751842171592838, 'samples': 15043968, 'steps': 78353, 'loss/train': 1.453902006149292} 11/07/2021 08:16:06 - INFO - __main__ - Step 78355: {'lr': 0.00023751312159241313, 'samples': 15044160, 'steps': 78354, 'loss/train': 1.6708987951278687} 11/07/2021 08:16:06 - INFO - __main__ - Step 78356: {'lr': 0.00023750782147452426, 'samples': 15044352, 'steps': 78355, 'loss/train': 1.307442545890808} 11/07/2021 08:16:06 - INFO - __main__ - Step 78357: {'lr': 0.00023750252136226416, 'samples': 15044544, 'steps': 78356, 'loss/train': 0.9143853783607483} 11/07/2021 08:16:07 - INFO - __main__ - Step 78358: {'lr': 0.00023749722125563524, 'samples': 15044736, 'steps': 78357, 'loss/train': 1.751927137374878} 11/07/2021 08:16:08 - INFO - __main__ - Step 78359: {'lr': 0.00023749192115463992, 'samples': 15044928, 'steps': 78358, 'loss/train': 1.5156149864196777} 11/07/2021 08:16:08 - INFO - __main__ - Step 78360: {'lr': 0.00023748662105928052, 'samples': 15045120, 'steps': 78359, 'loss/train': 1.5293611288070679} 11/07/2021 08:16:08 - INFO - __main__ - Step 78361: {'lr': 0.0002374813209695595, 'samples': 15045312, 'steps': 78360, 'loss/train': 1.526658535003662} 11/07/2021 08:16:09 - INFO - __main__ - Step 78362: {'lr': 0.00023747602088547914, 'samples': 15045504, 'steps': 78361, 'loss/train': 0.9323634505271912} 11/07/2021 08:16:09 - INFO - __main__ - Step 78363: {'lr': 0.00023747072080704192, 'samples': 15045696, 'steps': 78362, 'loss/train': 1.071200966835022} 11/07/2021 08:16:10 - INFO - __main__ - Step 78364: {'lr': 0.00023746542073425022, 'samples': 15045888, 'steps': 78363, 'loss/train': 1.7084037065505981} 11/07/2021 08:16:11 - INFO - __main__ - Step 78365: {'lr': 0.00023746012066710637, 'samples': 15046080, 'steps': 78364, 'loss/train': 1.2703919410705566} 11/07/2021 08:16:11 - INFO - __main__ - Step 78366: {'lr': 0.0002374548206056128, 'samples': 15046272, 'steps': 78365, 'loss/train': 1.3456052541732788} 11/07/2021 08:16:11 - INFO - __main__ - Step 78367: {'lr': 0.0002374495205497719, 'samples': 15046464, 'steps': 78366, 'loss/train': 1.5909122228622437} 11/07/2021 08:16:12 - INFO - __main__ - Step 78368: {'lr': 0.00023744422049958605, 'samples': 15046656, 'steps': 78367, 'loss/train': 1.2278974056243896} 11/07/2021 08:16:13 - INFO - __main__ - Step 78369: {'lr': 0.00023743892045505763, 'samples': 15046848, 'steps': 78368, 'loss/train': 1.400081992149353} 11/07/2021 08:16:13 - INFO - __main__ - Step 78370: {'lr': 0.00023743362041618905, 'samples': 15047040, 'steps': 78369, 'loss/train': 1.8510643243789673} 11/07/2021 08:16:13 - INFO - __main__ - Step 78371: {'lr': 0.00023742832038298268, 'samples': 15047232, 'steps': 78370, 'loss/train': 1.3184434175491333} 11/07/2021 08:16:14 - INFO - __main__ - Step 78372: {'lr': 0.00023742302035544092, 'samples': 15047424, 'steps': 78371, 'loss/train': 1.2709327936172485} 11/07/2021 08:16:14 - INFO - __main__ - Step 78373: {'lr': 0.00023741772033356615, 'samples': 15047616, 'steps': 78372, 'loss/train': 1.073760986328125} 11/07/2021 08:16:15 - INFO - __main__ - Step 78374: {'lr': 0.00023741242031736077, 'samples': 15047808, 'steps': 78373, 'loss/train': 1.204638123512268} 11/07/2021 08:16:15 - INFO - __main__ - Step 78375: {'lr': 0.00023740712030682727, 'samples': 15048000, 'steps': 78374, 'loss/train': 1.483496904373169} 11/07/2021 08:16:16 - INFO - __main__ - Step 78376: {'lr': 0.00023740182030196778, 'samples': 15048192, 'steps': 78375, 'loss/train': 0.7277933955192566} 11/07/2021 08:16:16 - INFO - __main__ - Step 78377: {'lr': 0.00023739652030278487, 'samples': 15048384, 'steps': 78376, 'loss/train': 1.1585100889205933} 11/07/2021 08:16:17 - INFO - __main__ - Step 78378: {'lr': 0.0002373912203092809, 'samples': 15048576, 'steps': 78377, 'loss/train': 5.706219673156738} 11/07/2021 08:16:17 - INFO - __main__ - Step 78379: {'lr': 0.00023738592032145823, 'samples': 15048768, 'steps': 78378, 'loss/train': 2.2612268924713135} 11/07/2021 08:16:18 - INFO - __main__ - Step 78380: {'lr': 0.00023738062033931925, 'samples': 15048960, 'steps': 78379, 'loss/train': 1.2473574876785278} 11/07/2021 08:16:19 - INFO - __main__ - Step 78381: {'lr': 0.0002373753203628664, 'samples': 15049152, 'steps': 78380, 'loss/train': 0.9070594310760498} 11/07/2021 08:16:19 - INFO - __main__ - Step 78382: {'lr': 0.00023737002039210203, 'samples': 15049344, 'steps': 78381, 'loss/train': 1.338396668434143} 11/07/2021 08:16:19 - INFO - __main__ - Step 78383: {'lr': 0.00023736472042702855, 'samples': 15049536, 'steps': 78382, 'loss/train': 1.4891611337661743} 11/07/2021 08:16:20 - INFO - __main__ - Step 78384: {'lr': 0.0002373594204676483, 'samples': 15049728, 'steps': 78383, 'loss/train': 0.07396310567855835} 11/07/2021 08:16:21 - INFO - __main__ - Step 78385: {'lr': 0.00023735412051396375, 'samples': 15049920, 'steps': 78384, 'loss/train': 1.8089544773101807} 11/07/2021 08:16:21 - INFO - __main__ - Step 78386: {'lr': 0.00023734882056597716, 'samples': 15050112, 'steps': 78385, 'loss/train': 1.3239314556121826} 11/07/2021 08:16:21 - INFO - __main__ - Step 78387: {'lr': 0.00023734352062369107, 'samples': 15050304, 'steps': 78386, 'loss/train': 1.2542396783828735} 11/07/2021 08:16:22 - INFO - __main__ - Step 78388: {'lr': 0.00023733822068710785, 'samples': 15050496, 'steps': 78387, 'loss/train': 1.2120615243911743} 11/07/2021 08:16:22 - INFO - __main__ - Step 78389: {'lr': 0.0002373329207562298, 'samples': 15050688, 'steps': 78388, 'loss/train': 1.5192275047302246} 11/07/2021 08:16:22 - INFO - __main__ - Step 78390: {'lr': 0.00023732762083105926, 'samples': 15050880, 'steps': 78389, 'loss/train': 0.7917437553405762} 11/07/2021 08:16:24 - INFO - __main__ - Step 78391: {'lr': 0.00023732232091159872, 'samples': 15051072, 'steps': 78390, 'loss/train': 1.6538676023483276} 11/07/2021 08:16:24 - INFO - __main__ - Step 78392: {'lr': 0.00023731702099785058, 'samples': 15051264, 'steps': 78391, 'loss/train': 2.1931283473968506} 11/07/2021 08:16:24 - INFO - __main__ - Step 78393: {'lr': 0.00023731172108981712, 'samples': 15051456, 'steps': 78392, 'loss/train': 1.6626347303390503} 11/07/2021 08:16:25 - INFO - __main__ - Step 78394: {'lr': 0.00023730642118750087, 'samples': 15051648, 'steps': 78393, 'loss/train': 1.0388731956481934} 11/07/2021 08:16:25 - INFO - __main__ - Step 78395: {'lr': 0.00023730112129090414, 'samples': 15051840, 'steps': 78394, 'loss/train': 1.4360074996948242} 11/07/2021 08:16:26 - INFO - __main__ - Step 78396: {'lr': 0.00023729582140002932, 'samples': 15052032, 'steps': 78395, 'loss/train': 1.5349535942077637} 11/07/2021 08:16:26 - INFO - __main__ - Step 78397: {'lr': 0.0002372905215148788, 'samples': 15052224, 'steps': 78396, 'loss/train': 1.6133387088775635} 11/07/2021 08:16:27 - INFO - __main__ - Step 78398: {'lr': 0.000237285221635455, 'samples': 15052416, 'steps': 78397, 'loss/train': 1.2419694662094116} 11/07/2021 08:16:27 - INFO - __main__ - Step 78399: {'lr': 0.00023727992176176025, 'samples': 15052608, 'steps': 78398, 'loss/train': 1.3119349479675293} 11/07/2021 08:16:28 - INFO - __main__ - Step 78400: {'lr': 0.00023727462189379698, 'samples': 15052800, 'steps': 78399, 'loss/train': 1.6276768445968628} 11/07/2021 08:16:28 - INFO - __main__ - Step 78401: {'lr': 0.0002372693220315677, 'samples': 15052992, 'steps': 78400, 'loss/train': 1.1451342105865479} 11/07/2021 08:16:29 - INFO - __main__ - Step 78402: {'lr': 0.00023726402217507454, 'samples': 15053184, 'steps': 78401, 'loss/train': 1.3167154788970947} 11/07/2021 08:16:29 - INFO - __main__ - Step 78403: {'lr': 0.00023725872232432002, 'samples': 15053376, 'steps': 78402, 'loss/train': 1.3877038955688477} 11/07/2021 08:16:30 - INFO - __main__ - Step 78404: {'lr': 0.00023725342247930652, 'samples': 15053568, 'steps': 78403, 'loss/train': 1.0738027095794678} 11/07/2021 08:16:30 - INFO - __main__ - Step 78405: {'lr': 0.00023724812264003643, 'samples': 15053760, 'steps': 78404, 'loss/train': 1.4574552774429321} 11/07/2021 08:16:31 - INFO - __main__ - Step 78406: {'lr': 0.00023724282280651214, 'samples': 15053952, 'steps': 78405, 'loss/train': 1.4619626998901367} 11/07/2021 08:16:31 - INFO - __main__ - Step 78407: {'lr': 0.00023723752297873603, 'samples': 15054144, 'steps': 78406, 'loss/train': 1.282263159751892} 11/07/2021 08:16:32 - INFO - __main__ - Step 78408: {'lr': 0.0002372322231567105, 'samples': 15054336, 'steps': 78407, 'loss/train': 1.4185372591018677} 11/07/2021 08:16:32 - INFO - __main__ - Step 78409: {'lr': 0.00023722692334043793, 'samples': 15054528, 'steps': 78408, 'loss/train': 1.8125057220458984} 11/07/2021 08:16:32 - INFO - __main__ - Step 78410: {'lr': 0.00023722162352992073, 'samples': 15054720, 'steps': 78409, 'loss/train': 1.4555070400238037} 11/07/2021 08:16:33 - INFO - __main__ - Step 78411: {'lr': 0.00023721632372516126, 'samples': 15054912, 'steps': 78410, 'loss/train': 1.7111295461654663} 11/07/2021 08:16:34 - INFO - __main__ - Step 78412: {'lr': 0.0002372110239261619, 'samples': 15055104, 'steps': 78411, 'loss/train': 2.059758186340332} 11/07/2021 08:16:34 - INFO - __main__ - Step 78413: {'lr': 0.00023720572413292508, 'samples': 15055296, 'steps': 78412, 'loss/train': 1.560768961906433} 11/07/2021 08:16:34 - INFO - __main__ - Step 78414: {'lr': 0.00023720042434545314, 'samples': 15055488, 'steps': 78413, 'loss/train': 1.1806663274765015} 11/07/2021 08:16:35 - INFO - __main__ - Step 78415: {'lr': 0.0002371951245637486, 'samples': 15055680, 'steps': 78414, 'loss/train': 1.3600941896438599} 11/07/2021 08:16:35 - INFO - __main__ - Step 78416: {'lr': 0.00023718982478781369, 'samples': 15055872, 'steps': 78415, 'loss/train': 1.4243086576461792} 11/07/2021 08:16:36 - INFO - __main__ - Step 78417: {'lr': 0.00023718452501765078, 'samples': 15056064, 'steps': 78416, 'loss/train': 1.3038227558135986} 11/07/2021 08:16:37 - INFO - __main__ - Step 78418: {'lr': 0.00023717922525326235, 'samples': 15056256, 'steps': 78417, 'loss/train': 0.747045636177063} 11/07/2021 08:16:37 - INFO - __main__ - Step 78419: {'lr': 0.00023717392549465075, 'samples': 15056448, 'steps': 78418, 'loss/train': 1.4522700309753418} 11/07/2021 08:16:37 - INFO - __main__ - Step 78420: {'lr': 0.0002371686257418184, 'samples': 15056640, 'steps': 78419, 'loss/train': 1.396594524383545} 11/07/2021 08:16:38 - INFO - __main__ - Step 78421: {'lr': 0.00023716332599476764, 'samples': 15056832, 'steps': 78420, 'loss/train': 2.3152377605438232} 11/07/2021 08:16:39 - INFO - __main__ - Step 78422: {'lr': 0.0002371580262535009, 'samples': 15057024, 'steps': 78421, 'loss/train': 1.4942203760147095} 11/07/2021 08:16:39 - INFO - __main__ - Step 78423: {'lr': 0.00023715272651802057, 'samples': 15057216, 'steps': 78422, 'loss/train': 1.5445606708526611} 11/07/2021 08:16:40 - INFO - __main__ - Step 78424: {'lr': 0.00023714742678832901, 'samples': 15057408, 'steps': 78423, 'loss/train': 1.6515966653823853} 11/07/2021 08:16:40 - INFO - __main__ - Step 78425: {'lr': 0.00023714212706442864, 'samples': 15057600, 'steps': 78424, 'loss/train': 1.5830756425857544} 11/07/2021 08:16:40 - INFO - __main__ - Step 78426: {'lr': 0.0002371368273463218, 'samples': 15057792, 'steps': 78425, 'loss/train': 1.7015711069107056} 11/07/2021 08:16:41 - INFO - __main__ - Step 78427: {'lr': 0.00023713152763401094, 'samples': 15057984, 'steps': 78426, 'loss/train': 1.191994309425354} 11/07/2021 08:16:42 - INFO - __main__ - Step 78428: {'lr': 0.00023712622792749848, 'samples': 15058176, 'steps': 78427, 'loss/train': 1.5549618005752563} 11/07/2021 08:16:42 - INFO - __main__ - Step 78429: {'lr': 0.00023712092822678667, 'samples': 15058368, 'steps': 78428, 'loss/train': 0.35685867071151733} 11/07/2021 08:16:42 - INFO - __main__ - Step 78430: {'lr': 0.00023711562853187797, 'samples': 15058560, 'steps': 78429, 'loss/train': 1.2878955602645874} 11/07/2021 08:16:43 - INFO - __main__ - Step 78431: {'lr': 0.00023711032884277475, 'samples': 15058752, 'steps': 78430, 'loss/train': 1.5788501501083374} 11/07/2021 08:16:43 - INFO - __main__ - Step 78432: {'lr': 0.00023710502915947942, 'samples': 15058944, 'steps': 78431, 'loss/train': 1.635392189025879} 11/07/2021 08:16:44 - INFO - __main__ - Step 78433: {'lr': 0.0002370997294819944, 'samples': 15059136, 'steps': 78432, 'loss/train': 1.6776543855667114} 11/07/2021 08:16:45 - INFO - __main__ - Step 78434: {'lr': 0.000237094429810322, 'samples': 15059328, 'steps': 78433, 'loss/train': 1.5520778894424438} 11/07/2021 08:16:45 - INFO - __main__ - Step 78435: {'lr': 0.00023708913014446468, 'samples': 15059520, 'steps': 78434, 'loss/train': 1.6050688028335571} 11/07/2021 08:16:45 - INFO - __main__ - Step 78436: {'lr': 0.00023708383048442477, 'samples': 15059712, 'steps': 78435, 'loss/train': 1.4467064142227173} 11/07/2021 08:16:46 - INFO - __main__ - Step 78437: {'lr': 0.00023707853083020469, 'samples': 15059904, 'steps': 78436, 'loss/train': 1.2797132730484009} 11/07/2021 08:16:47 - INFO - __main__ - Step 78438: {'lr': 0.00023707323118180685, 'samples': 15060096, 'steps': 78437, 'loss/train': 1.49309504032135} 11/07/2021 08:16:47 - INFO - __main__ - Step 78439: {'lr': 0.00023706793153923362, 'samples': 15060288, 'steps': 78438, 'loss/train': 1.8735697269439697} 11/07/2021 08:16:47 - INFO - __main__ - Step 78440: {'lr': 0.00023706263190248733, 'samples': 15060480, 'steps': 78439, 'loss/train': 1.3583664894104004} 11/07/2021 08:16:48 - INFO - __main__ - Step 78441: {'lr': 0.00023705733227157044, 'samples': 15060672, 'steps': 78440, 'loss/train': 0.611353874206543} 11/07/2021 08:16:48 - INFO - __main__ - Step 78442: {'lr': 0.00023705203264648544, 'samples': 15060864, 'steps': 78441, 'loss/train': 1.5827494859695435} 11/07/2021 08:16:49 - INFO - __main__ - Step 78443: {'lr': 0.00023704673302723449, 'samples': 15061056, 'steps': 78442, 'loss/train': 1.832383155822754} 11/07/2021 08:16:49 - INFO - __main__ - Step 78444: {'lr': 0.00023704143341382006, 'samples': 15061248, 'steps': 78443, 'loss/train': 1.1651207208633423} 11/07/2021 08:16:50 - INFO - __main__ - Step 78445: {'lr': 0.00023703613380624458, 'samples': 15061440, 'steps': 78444, 'loss/train': 0.8255406618118286} 11/07/2021 08:16:50 - INFO - __main__ - Step 78446: {'lr': 0.0002370308342045104, 'samples': 15061632, 'steps': 78445, 'loss/train': 1.202544093132019} 11/07/2021 08:16:50 - INFO - __main__ - Step 78447: {'lr': 0.00023702553460861993, 'samples': 15061824, 'steps': 78446, 'loss/train': 1.3099054098129272} 11/07/2021 08:16:51 - INFO - __main__ - Step 78448: {'lr': 0.00023702023501857557, 'samples': 15062016, 'steps': 78447, 'loss/train': 1.1549376249313354} 11/07/2021 08:16:52 - INFO - __main__ - Step 78449: {'lr': 0.0002370149354343797, 'samples': 15062208, 'steps': 78448, 'loss/train': 1.5071245431900024} 11/07/2021 08:16:52 - INFO - __main__ - Step 78450: {'lr': 0.00023700963585603465, 'samples': 15062400, 'steps': 78449, 'loss/train': 1.5536998510360718} 11/07/2021 08:16:53 - INFO - __main__ - Step 78451: {'lr': 0.0002370043362835429, 'samples': 15062592, 'steps': 78450, 'loss/train': 1.2229397296905518} 11/07/2021 08:16:53 - INFO - __main__ - Step 78452: {'lr': 0.00023699903671690678, 'samples': 15062784, 'steps': 78451, 'loss/train': 1.602128267288208} 11/07/2021 08:16:54 - INFO - __main__ - Step 78453: {'lr': 0.0002369937371561287, 'samples': 15062976, 'steps': 78452, 'loss/train': 1.5418908596038818} 11/07/2021 08:16:54 - INFO - __main__ - Step 78454: {'lr': 0.00023698843760121103, 'samples': 15063168, 'steps': 78453, 'loss/train': 1.5749990940093994} 11/07/2021 08:16:55 - INFO - __main__ - Step 78455: {'lr': 0.00023698313805215629, 'samples': 15063360, 'steps': 78454, 'loss/train': 1.5102553367614746} 11/07/2021 08:16:55 - INFO - __main__ - Step 78456: {'lr': 0.00023697783850896664, 'samples': 15063552, 'steps': 78455, 'loss/train': 1.865027904510498} 11/07/2021 08:16:56 - INFO - __main__ - Step 78457: {'lr': 0.00023697253897164456, 'samples': 15063744, 'steps': 78456, 'loss/train': 1.7991961240768433} 11/07/2021 08:16:56 - INFO - __main__ - Step 78458: {'lr': 0.00023696723944019246, 'samples': 15063936, 'steps': 78457, 'loss/train': 1.3071097135543823} 11/07/2021 08:16:58 - INFO - __main__ - Step 78459: {'lr': 0.00023696193991461274, 'samples': 15064128, 'steps': 78458, 'loss/train': 1.0334477424621582} 11/07/2021 08:16:58 - INFO - __main__ - Step 78460: {'lr': 0.00023695664039490776, 'samples': 15064320, 'steps': 78459, 'loss/train': 1.3320327997207642} 11/07/2021 08:16:59 - INFO - __main__ - Step 78461: {'lr': 0.0002369513408810799, 'samples': 15064512, 'steps': 78460, 'loss/train': 1.5335798263549805} 11/07/2021 08:16:59 - INFO - __main__ - Step 78462: {'lr': 0.00023694604137313154, 'samples': 15064704, 'steps': 78461, 'loss/train': 1.497559905052185} 11/07/2021 08:16:59 - INFO - __main__ - Step 78463: {'lr': 0.00023694074187106514, 'samples': 15064896, 'steps': 78462, 'loss/train': 1.3746939897537231} 11/07/2021 08:17:00 - INFO - __main__ - Step 78464: {'lr': 0.00023693544237488303, 'samples': 15065088, 'steps': 78463, 'loss/train': 0.7138352990150452} 11/07/2021 08:17:00 - INFO - __main__ - Step 78465: {'lr': 0.00023693014288458762, 'samples': 15065280, 'steps': 78464, 'loss/train': 1.1956987380981445} 11/07/2021 08:17:01 - INFO - __main__ - Step 78466: {'lr': 0.0002369248434001813, 'samples': 15065472, 'steps': 78465, 'loss/train': 1.0146081447601318} 11/07/2021 08:17:01 - INFO - __main__ - Step 78467: {'lr': 0.00023691954392166643, 'samples': 15065664, 'steps': 78466, 'loss/train': 1.4947649240493774} 11/07/2021 08:17:02 - INFO - __main__ - Step 78468: {'lr': 0.00023691424444904539, 'samples': 15065856, 'steps': 78467, 'loss/train': 1.4225661754608154} 11/07/2021 08:17:02 - INFO - __main__ - Step 78469: {'lr': 0.00023690894498232067, 'samples': 15066048, 'steps': 78468, 'loss/train': 1.548831582069397} 11/07/2021 08:17:02 - INFO - __main__ - Step 78470: {'lr': 0.0002369036455214945, 'samples': 15066240, 'steps': 78469, 'loss/train': 1.3649097681045532} 11/07/2021 08:17:03 - INFO - __main__ - Step 78471: {'lr': 0.00023689834606656932, 'samples': 15066432, 'steps': 78470, 'loss/train': 1.5032105445861816} 11/07/2021 08:17:04 - INFO - __main__ - Step 78472: {'lr': 0.00023689304661754756, 'samples': 15066624, 'steps': 78471, 'loss/train': 1.8311700820922852} 11/07/2021 08:17:04 - INFO - __main__ - Step 78473: {'lr': 0.00023688774717443162, 'samples': 15066816, 'steps': 78472, 'loss/train': 1.2733502388000488} 11/07/2021 08:17:04 - INFO - __main__ - Step 78474: {'lr': 0.00023688244773722384, 'samples': 15067008, 'steps': 78473, 'loss/train': 1.6041905879974365} 11/07/2021 08:17:05 - INFO - __main__ - Step 78475: {'lr': 0.0002368771483059266, 'samples': 15067200, 'steps': 78474, 'loss/train': 1.6166503429412842} 11/07/2021 08:17:06 - INFO - __main__ - Step 78476: {'lr': 0.00023687184888054237, 'samples': 15067392, 'steps': 78475, 'loss/train': 1.046291708946228} 11/07/2021 08:17:06 - INFO - __main__ - Step 78477: {'lr': 0.0002368665494610735, 'samples': 15067584, 'steps': 78476, 'loss/train': 1.6152358055114746} 11/07/2021 08:17:06 - INFO - __main__ - Step 78478: {'lr': 0.00023686125004752231, 'samples': 15067776, 'steps': 78477, 'loss/train': 1.3067001104354858} 11/07/2021 08:17:07 - INFO - __main__ - Step 78479: {'lr': 0.00023685595063989125, 'samples': 15067968, 'steps': 78478, 'loss/train': 1.5228525400161743} 11/07/2021 08:17:07 - INFO - __main__ - Step 78480: {'lr': 0.00023685065123818267, 'samples': 15068160, 'steps': 78479, 'loss/train': 1.8476910591125488} 11/07/2021 08:17:08 - INFO - __main__ - Step 78481: {'lr': 0.000236845351842399, 'samples': 15068352, 'steps': 78480, 'loss/train': 1.7188435792922974} 11/07/2021 08:17:09 - INFO - __main__ - Step 78482: {'lr': 0.00023684005245254268, 'samples': 15068544, 'steps': 78481, 'loss/train': 1.2567678689956665} 11/07/2021 08:17:09 - INFO - __main__ - Step 78483: {'lr': 0.00023683475306861596, 'samples': 15068736, 'steps': 78482, 'loss/train': 1.460429072380066} 11/07/2021 08:17:09 - INFO - __main__ - Step 78484: {'lr': 0.0002368294536906213, 'samples': 15068928, 'steps': 78483, 'loss/train': 1.266924262046814} 11/07/2021 08:17:10 - INFO - __main__ - Step 78485: {'lr': 0.00023682415431856108, 'samples': 15069120, 'steps': 78484, 'loss/train': 1.711340308189392} 11/07/2021 08:17:11 - INFO - __main__ - Step 78486: {'lr': 0.0002368188549524377, 'samples': 15069312, 'steps': 78485, 'loss/train': 1.8394052982330322} 11/07/2021 08:17:11 - INFO - __main__ - Step 78487: {'lr': 0.0002368135555922536, 'samples': 15069504, 'steps': 78486, 'loss/train': 1.1783363819122314} 11/07/2021 08:17:12 - INFO - __main__ - Step 78488: {'lr': 0.00023680825623801103, 'samples': 15069696, 'steps': 78487, 'loss/train': 0.6918118596076965} 11/07/2021 08:17:12 - INFO - __main__ - Step 78489: {'lr': 0.00023680295688971247, 'samples': 15069888, 'steps': 78488, 'loss/train': 1.4950602054595947} 11/07/2021 08:17:12 - INFO - __main__ - Step 78490: {'lr': 0.0002367976575473603, 'samples': 15070080, 'steps': 78489, 'loss/train': 0.7174280881881714} 11/07/2021 08:17:13 - INFO - __main__ - Step 78491: {'lr': 0.0002367923582109569, 'samples': 15070272, 'steps': 78490, 'loss/train': 1.579588532447815} 11/07/2021 08:17:14 - INFO - __main__ - Step 78492: {'lr': 0.00023678705888050463, 'samples': 15070464, 'steps': 78491, 'loss/train': 1.5151890516281128} 11/07/2021 08:17:14 - INFO - __main__ - Step 78493: {'lr': 0.00023678175955600594, 'samples': 15070656, 'steps': 78492, 'loss/train': 1.4461368322372437} 11/07/2021 08:17:14 - INFO - __main__ - Step 78494: {'lr': 0.0002367764602374632, 'samples': 15070848, 'steps': 78493, 'loss/train': 1.3677223920822144} 11/07/2021 08:17:15 - INFO - __main__ - Step 78495: {'lr': 0.00023677116092487874, 'samples': 15071040, 'steps': 78494, 'loss/train': 1.6156103610992432} 11/07/2021 08:17:16 - INFO - __main__ - Step 78496: {'lr': 0.0002367658616182551, 'samples': 15071232, 'steps': 78495, 'loss/train': 1.4454927444458008} 11/07/2021 08:17:16 - INFO - __main__ - Step 78497: {'lr': 0.00023676056231759446, 'samples': 15071424, 'steps': 78496, 'loss/train': 1.8212751150131226} 11/07/2021 08:17:16 - INFO - __main__ - Step 78498: {'lr': 0.00023675526302289936, 'samples': 15071616, 'steps': 78497, 'loss/train': 1.459794044494629} 11/07/2021 08:17:17 - INFO - __main__ - Step 78499: {'lr': 0.0002367499637341721, 'samples': 15071808, 'steps': 78498, 'loss/train': 1.8047553300857544} 11/07/2021 08:17:17 - INFO - __main__ - Step 78500: {'lr': 0.0002367446644514151, 'samples': 15072000, 'steps': 78499, 'loss/train': 1.2665859460830688} 11/07/2021 08:17:18 - INFO - __main__ - Step 78501: {'lr': 0.00023673936517463074, 'samples': 15072192, 'steps': 78500, 'loss/train': 1.533730387687683} 11/07/2021 08:17:19 - INFO - __main__ - Step 78502: {'lr': 0.0002367340659038214, 'samples': 15072384, 'steps': 78501, 'loss/train': 1.3837090730667114} 11/07/2021 08:17:19 - INFO - __main__ - Step 78503: {'lr': 0.0002367287666389895, 'samples': 15072576, 'steps': 78502, 'loss/train': 1.0685851573944092} 11/07/2021 08:17:19 - INFO - __main__ - Step 78504: {'lr': 0.00023672346738013746, 'samples': 15072768, 'steps': 78503, 'loss/train': 1.4019584655761719} 11/07/2021 08:17:20 - INFO - __main__ - Step 78505: {'lr': 0.00023671816812726758, 'samples': 15072960, 'steps': 78504, 'loss/train': 0.10874465852975845} 11/07/2021 08:17:20 - INFO - __main__ - Step 78506: {'lr': 0.00023671286888038225, 'samples': 15073152, 'steps': 78505, 'loss/train': 1.386419653892517} 11/07/2021 08:17:21 - INFO - __main__ - Step 78507: {'lr': 0.00023670756963948395, 'samples': 15073344, 'steps': 78506, 'loss/train': 1.429351806640625} 11/07/2021 08:17:22 - INFO - __main__ - Step 78508: {'lr': 0.00023670227040457502, 'samples': 15073536, 'steps': 78507, 'loss/train': 0.5274412631988525} 11/07/2021 08:17:22 - INFO - __main__ - Step 78509: {'lr': 0.0002366969711756579, 'samples': 15073728, 'steps': 78508, 'loss/train': 1.3421379327774048} 11/07/2021 08:17:22 - INFO - __main__ - Step 78510: {'lr': 0.00023669167195273486, 'samples': 15073920, 'steps': 78509, 'loss/train': 1.0236046314239502} 11/07/2021 08:17:23 - INFO - __main__ - Step 78511: {'lr': 0.0002366863727358083, 'samples': 15074112, 'steps': 78510, 'loss/train': 1.8053920269012451} 11/07/2021 08:17:24 - INFO - __main__ - Step 78512: {'lr': 0.0002366810735248807, 'samples': 15074304, 'steps': 78511, 'loss/train': 1.3634251356124878} 11/07/2021 08:17:24 - INFO - __main__ - Step 78513: {'lr': 0.00023667577431995437, 'samples': 15074496, 'steps': 78512, 'loss/train': 1.5383292436599731} 11/07/2021 08:17:24 - INFO - __main__ - Step 78514: {'lr': 0.00023667047512103176, 'samples': 15074688, 'steps': 78513, 'loss/train': 1.0657726526260376} 11/07/2021 08:17:25 - INFO - __main__ - Step 78515: {'lr': 0.0002366651759281152, 'samples': 15074880, 'steps': 78514, 'loss/train': 1.3179658651351929} 11/07/2021 08:17:25 - INFO - __main__ - Step 78516: {'lr': 0.00023665987674120713, 'samples': 15075072, 'steps': 78515, 'loss/train': 1.5152899026870728} 11/07/2021 08:17:26 - INFO - __main__ - Step 78517: {'lr': 0.0002366545775603099, 'samples': 15075264, 'steps': 78516, 'loss/train': 0.13468097150325775} 11/07/2021 08:17:27 - INFO - __main__ - Step 78518: {'lr': 0.00023664927838542592, 'samples': 15075456, 'steps': 78517, 'loss/train': 1.501265287399292} 11/07/2021 08:17:27 - INFO - __main__ - Step 78519: {'lr': 0.00023664397921655756, 'samples': 15075648, 'steps': 78518, 'loss/train': 1.5760780572891235} 11/07/2021 08:17:27 - INFO - __main__ - Step 78520: {'lr': 0.00023663868005370723, 'samples': 15075840, 'steps': 78519, 'loss/train': 1.6334118843078613} 11/07/2021 08:17:28 - INFO - __main__ - Step 78521: {'lr': 0.00023663338089687728, 'samples': 15076032, 'steps': 78520, 'loss/train': 1.4389015436172485} 11/07/2021 08:17:29 - INFO - __main__ - Step 78522: {'lr': 0.00023662808174607027, 'samples': 15076224, 'steps': 78521, 'loss/train': 1.7287014722824097} 11/07/2021 08:17:29 - INFO - __main__ - Step 78523: {'lr': 0.00023662278260128827, 'samples': 15076416, 'steps': 78522, 'loss/train': 1.3116964101791382} 11/07/2021 08:17:29 - INFO - __main__ - Step 78524: {'lr': 0.0002366174834625339, 'samples': 15076608, 'steps': 78523, 'loss/train': 1.6118143796920776} 11/07/2021 08:17:30 - INFO - __main__ - Step 78525: {'lr': 0.00023661218432980948, 'samples': 15076800, 'steps': 78524, 'loss/train': 0.9036106467247009} 11/07/2021 08:17:30 - INFO - __main__ - Step 78526: {'lr': 0.00023660688520311734, 'samples': 15076992, 'steps': 78525, 'loss/train': 1.1729029417037964} 11/07/2021 08:17:31 - INFO - __main__ - Step 78527: {'lr': 0.00023660158608245998, 'samples': 15077184, 'steps': 78526, 'loss/train': 1.4635796546936035} 11/07/2021 08:17:31 - INFO - __main__ - Step 78528: {'lr': 0.00023659628696783976, 'samples': 15077376, 'steps': 78527, 'loss/train': 1.855009913444519} 11/07/2021 08:17:32 - INFO - __main__ - Step 78529: {'lr': 0.000236590987859259, 'samples': 15077568, 'steps': 78528, 'loss/train': 1.3662022352218628} 11/07/2021 08:17:32 - INFO - __main__ - Step 78530: {'lr': 0.00023658568875672015, 'samples': 15077760, 'steps': 78529, 'loss/train': 1.5825384855270386} 11/07/2021 08:17:32 - INFO - __main__ - Step 78531: {'lr': 0.0002365803896602256, 'samples': 15077952, 'steps': 78530, 'loss/train': 1.8303492069244385} 11/07/2021 08:17:33 - INFO - __main__ - Step 78532: {'lr': 0.0002365750905697777, 'samples': 15078144, 'steps': 78531, 'loss/train': 1.4574815034866333} 11/07/2021 08:17:34 - INFO - __main__ - Step 78533: {'lr': 0.00023656979148537884, 'samples': 15078336, 'steps': 78532, 'loss/train': 1.5289089679718018} 11/07/2021 08:17:35 - INFO - __main__ - Step 78534: {'lr': 0.00023656449240703144, 'samples': 15078528, 'steps': 78533, 'loss/train': 1.0574699640274048} 11/07/2021 08:17:35 - INFO - __main__ - Step 78535: {'lr': 0.0002365591933347379, 'samples': 15078720, 'steps': 78534, 'loss/train': 1.2659494876861572} 11/07/2021 08:17:35 - INFO - __main__ - Step 78536: {'lr': 0.00023655389426850066, 'samples': 15078912, 'steps': 78535, 'loss/train': 0.07689263671636581} 11/07/2021 08:17:36 - INFO - __main__ - Step 78537: {'lr': 0.00023654859520832195, 'samples': 15079104, 'steps': 78536, 'loss/train': 1.3753983974456787} 11/07/2021 08:17:37 - INFO - __main__ - Step 78538: {'lr': 0.0002365432961542042, 'samples': 15079296, 'steps': 78537, 'loss/train': 1.1823710203170776} 11/07/2021 08:17:37 - INFO - __main__ - Step 78539: {'lr': 0.00023653799710614983, 'samples': 15079488, 'steps': 78538, 'loss/train': 1.7574691772460938} 11/07/2021 08:17:38 - INFO - __main__ - Step 78540: {'lr': 0.00023653269806416126, 'samples': 15079680, 'steps': 78539, 'loss/train': 1.5694210529327393} 11/07/2021 08:17:38 - INFO - __main__ - Step 78541: {'lr': 0.00023652739902824084, 'samples': 15079872, 'steps': 78540, 'loss/train': 2.047109603881836} 11/07/2021 08:17:38 - INFO - __main__ - Step 78542: {'lr': 0.00023652209999839094, 'samples': 15080064, 'steps': 78541, 'loss/train': 1.5992379188537598} 11/07/2021 08:17:39 - INFO - __main__ - Step 78543: {'lr': 0.000236516800974614, 'samples': 15080256, 'steps': 78542, 'loss/train': 1.5177440643310547} 11/07/2021 08:17:40 - INFO - __main__ - Step 78544: {'lr': 0.00023651150195691238, 'samples': 15080448, 'steps': 78543, 'loss/train': 1.3464949131011963} 11/07/2021 08:17:40 - INFO - __main__ - Step 78545: {'lr': 0.00023650620294528847, 'samples': 15080640, 'steps': 78544, 'loss/train': 1.4840773344039917} 11/07/2021 08:17:40 - INFO - __main__ - Step 78546: {'lr': 0.00023650090393974467, 'samples': 15080832, 'steps': 78545, 'loss/train': 1.7269287109375} 11/07/2021 08:17:41 - INFO - __main__ - Step 78547: {'lr': 0.0002364956049402833, 'samples': 15081024, 'steps': 78546, 'loss/train': 1.5008747577667236} 11/07/2021 08:17:42 - INFO - __main__ - Step 78548: {'lr': 0.00023649030594690684, 'samples': 15081216, 'steps': 78547, 'loss/train': 1.2569128274917603} 11/07/2021 08:17:42 - INFO - __main__ - Step 78549: {'lr': 0.00023648500695961776, 'samples': 15081408, 'steps': 78548, 'loss/train': 1.1408205032348633} 11/07/2021 08:17:42 - INFO - __main__ - Step 78550: {'lr': 0.00023647970797841818, 'samples': 15081600, 'steps': 78549, 'loss/train': 1.2301063537597656} 11/07/2021 08:17:43 - INFO - __main__ - Step 78551: {'lr': 0.00023647440900331068, 'samples': 15081792, 'steps': 78550, 'loss/train': 1.670587182044983} 11/07/2021 08:17:43 - INFO - __main__ - Step 78552: {'lr': 0.00023646911003429757, 'samples': 15081984, 'steps': 78551, 'loss/train': 1.0424855947494507} 11/07/2021 08:17:44 - INFO - __main__ - Step 78553: {'lr': 0.0002364638110713813, 'samples': 15082176, 'steps': 78552, 'loss/train': 1.0019537210464478} 11/07/2021 08:17:45 - INFO - __main__ - Step 78554: {'lr': 0.0002364585121145642, 'samples': 15082368, 'steps': 78553, 'loss/train': 1.3655552864074707} 11/07/2021 08:17:45 - INFO - __main__ - Step 78555: {'lr': 0.0002364532131638487, 'samples': 15082560, 'steps': 78554, 'loss/train': 1.475820779800415} 11/07/2021 08:17:45 - INFO - __main__ - Step 78556: {'lr': 0.00023644791421923716, 'samples': 15082752, 'steps': 78555, 'loss/train': 0.8495631814002991} 11/07/2021 08:17:46 - INFO - __main__ - Step 78557: {'lr': 0.000236442615280732, 'samples': 15082944, 'steps': 78556, 'loss/train': 1.352036476135254} 11/07/2021 08:17:47 - INFO - __main__ - Step 78558: {'lr': 0.00023643731634833556, 'samples': 15083136, 'steps': 78557, 'loss/train': 1.408713698387146} 11/07/2021 08:17:47 - INFO - __main__ - Step 78559: {'lr': 0.00023643201742205028, 'samples': 15083328, 'steps': 78558, 'loss/train': 1.747580885887146} 11/07/2021 08:17:47 - INFO - __main__ - Step 78560: {'lr': 0.00023642671850187852, 'samples': 15083520, 'steps': 78559, 'loss/train': 1.4235519170761108} 11/07/2021 08:17:48 - INFO - __main__ - Step 78561: {'lr': 0.0002364214195878227, 'samples': 15083712, 'steps': 78560, 'loss/train': 1.4405553340911865} 11/07/2021 08:17:48 - INFO - __main__ - Step 78562: {'lr': 0.00023641612067988512, 'samples': 15083904, 'steps': 78561, 'loss/train': 1.4414929151535034} 11/07/2021 08:17:49 - INFO - __main__ - Step 78563: {'lr': 0.00023641082177806836, 'samples': 15084096, 'steps': 78562, 'loss/train': 1.7849204540252686} 11/07/2021 08:17:49 - INFO - __main__ - Step 78564: {'lr': 0.00023640552288237458, 'samples': 15084288, 'steps': 78563, 'loss/train': 1.2922585010528564} 11/07/2021 08:17:50 - INFO - __main__ - Step 78565: {'lr': 0.00023640022399280626, 'samples': 15084480, 'steps': 78564, 'loss/train': 1.5846595764160156} 11/07/2021 08:17:50 - INFO - __main__ - Step 78566: {'lr': 0.00023639492510936575, 'samples': 15084672, 'steps': 78565, 'loss/train': 1.4358001947402954} 11/07/2021 08:17:50 - INFO - __main__ - Step 78567: {'lr': 0.0002363896262320555, 'samples': 15084864, 'steps': 78566, 'loss/train': 1.7945977449417114} 11/07/2021 08:17:52 - INFO - __main__ - Step 78568: {'lr': 0.0002363843273608779, 'samples': 15085056, 'steps': 78567, 'loss/train': 1.5258232355117798} 11/07/2021 08:17:52 - INFO - __main__ - Step 78569: {'lr': 0.0002363790284958353, 'samples': 15085248, 'steps': 78568, 'loss/train': 1.230426549911499} 11/07/2021 08:17:52 - INFO - __main__ - Step 78570: {'lr': 0.00023637372963693007, 'samples': 15085440, 'steps': 78569, 'loss/train': 1.5386549234390259} 11/07/2021 08:17:53 - INFO - __main__ - Step 78571: {'lr': 0.00023636843078416464, 'samples': 15085632, 'steps': 78570, 'loss/train': 1.773424506187439} 11/07/2021 08:17:53 - INFO - __main__ - Step 78572: {'lr': 0.00023636313193754142, 'samples': 15085824, 'steps': 78571, 'loss/train': 0.42366740107536316} 11/07/2021 08:17:53 - INFO - __main__ - Step 78573: {'lr': 0.00023635783309706272, 'samples': 15086016, 'steps': 78572, 'loss/train': 1.9022780656814575} 11/07/2021 08:17:54 - INFO - __main__ - Step 78574: {'lr': 0.00023635253426273098, 'samples': 15086208, 'steps': 78573, 'loss/train': 5.761111736297607} 11/07/2021 08:17:55 - INFO - __main__ - Step 78575: {'lr': 0.0002363472354345486, 'samples': 15086400, 'steps': 78574, 'loss/train': 1.4671516418457031} 11/07/2021 08:17:55 - INFO - __main__ - Step 78576: {'lr': 0.00023634193661251803, 'samples': 15086592, 'steps': 78575, 'loss/train': 1.657737135887146} 11/07/2021 08:17:55 - INFO - __main__ - Step 78577: {'lr': 0.00023633663779664148, 'samples': 15086784, 'steps': 78576, 'loss/train': 1.3186098337173462} 11/07/2021 08:17:56 - INFO - __main__ - Step 78578: {'lr': 0.0002363313389869214, 'samples': 15086976, 'steps': 78577, 'loss/train': 1.2802293300628662} 11/07/2021 08:17:57 - INFO - __main__ - Step 78579: {'lr': 0.00023632604018336025, 'samples': 15087168, 'steps': 78578, 'loss/train': 0.9495767951011658} 11/07/2021 08:17:58 - INFO - __main__ - Step 78580: {'lr': 0.00023632074138596034, 'samples': 15087360, 'steps': 78579, 'loss/train': 1.5070927143096924} 11/07/2021 08:17:58 - INFO - __main__ - Step 78581: {'lr': 0.00023631544259472413, 'samples': 15087552, 'steps': 78580, 'loss/train': 0.8467374444007874} 11/07/2021 08:17:58 - INFO - __main__ - Step 78582: {'lr': 0.00023631014380965393, 'samples': 15087744, 'steps': 78581, 'loss/train': 0.7975701093673706} 11/07/2021 08:17:59 - INFO - __main__ - Step 78583: {'lr': 0.0002363048450307522, 'samples': 15087936, 'steps': 78582, 'loss/train': 1.5213536024093628} 11/07/2021 08:18:00 - INFO - __main__ - Step 78584: {'lr': 0.00023629954625802127, 'samples': 15088128, 'steps': 78583, 'loss/train': 1.593153953552246} 11/07/2021 08:18:00 - INFO - __main__ - Step 78585: {'lr': 0.00023629424749146356, 'samples': 15088320, 'steps': 78584, 'loss/train': 1.6717585325241089} 11/07/2021 08:18:00 - INFO - __main__ - Step 78586: {'lr': 0.00023628894873108146, 'samples': 15088512, 'steps': 78585, 'loss/train': 1.4363751411437988} 11/07/2021 08:18:01 - INFO - __main__ - Step 78587: {'lr': 0.00023628364997687733, 'samples': 15088704, 'steps': 78586, 'loss/train': 1.1326910257339478} 11/07/2021 08:18:01 - INFO - __main__ - Step 78588: {'lr': 0.0002362783512288536, 'samples': 15088896, 'steps': 78587, 'loss/train': 1.1237150430679321} 11/07/2021 08:18:02 - INFO - __main__ - Step 78589: {'lr': 0.00023627305248701268, 'samples': 15089088, 'steps': 78588, 'loss/train': 1.3672873973846436} 11/07/2021 08:18:02 - INFO - __main__ - Step 78590: {'lr': 0.0002362677537513569, 'samples': 15089280, 'steps': 78589, 'loss/train': 1.1306521892547607} 11/07/2021 08:18:03 - INFO - __main__ - Step 78591: {'lr': 0.00023626245502188863, 'samples': 15089472, 'steps': 78590, 'loss/train': 1.4413628578186035} 11/07/2021 08:18:03 - INFO - __main__ - Step 78592: {'lr': 0.00023625715629861026, 'samples': 15089664, 'steps': 78591, 'loss/train': 1.044220209121704} 11/07/2021 08:18:03 - INFO - __main__ - Step 78593: {'lr': 0.00023625185758152417, 'samples': 15089856, 'steps': 78592, 'loss/train': 1.5939316749572754} 11/07/2021 08:18:04 - INFO - __main__ - Step 78594: {'lr': 0.00023624655887063284, 'samples': 15090048, 'steps': 78593, 'loss/train': 1.6231434345245361} 11/07/2021 08:18:05 - INFO - __main__ - Step 78595: {'lr': 0.0002362412601659386, 'samples': 15090240, 'steps': 78594, 'loss/train': 1.5680797100067139} 11/07/2021 08:18:05 - INFO - __main__ - Step 78596: {'lr': 0.0002362359614674438, 'samples': 15090432, 'steps': 78595, 'loss/train': 0.9235043525695801} 11/07/2021 08:18:06 - INFO - __main__ - Step 78597: {'lr': 0.00023623066277515088, 'samples': 15090624, 'steps': 78596, 'loss/train': 0.6823720932006836} 11/07/2021 08:18:06 - INFO - __main__ - Step 78598: {'lr': 0.0002362253640890622, 'samples': 15090816, 'steps': 78597, 'loss/train': 0.6055607795715332} 11/07/2021 08:18:06 - INFO - __main__ - Step 78599: {'lr': 0.00023622006540918017, 'samples': 15091008, 'steps': 78598, 'loss/train': 1.4532023668289185} 11/07/2021 08:18:07 - INFO - __main__ - Step 78600: {'lr': 0.0002362147667355072, 'samples': 15091200, 'steps': 78599, 'loss/train': 1.3274508714675903} 11/07/2021 08:18:08 - INFO - __main__ - Step 78601: {'lr': 0.00023620946806804561, 'samples': 15091392, 'steps': 78600, 'loss/train': 1.1673362255096436} 11/07/2021 08:18:08 - INFO - __main__ - Step 78602: {'lr': 0.0002362041694067978, 'samples': 15091584, 'steps': 78601, 'loss/train': 1.2404645681381226} 11/07/2021 08:18:08 - INFO - __main__ - Step 78603: {'lr': 0.00023619887075176628, 'samples': 15091776, 'steps': 78602, 'loss/train': 1.4238471984863281} 11/07/2021 08:18:09 - INFO - __main__ - Step 78604: {'lr': 0.00023619357210295325, 'samples': 15091968, 'steps': 78603, 'loss/train': 0.9610432982444763} 11/07/2021 08:18:10 - INFO - __main__ - Step 78605: {'lr': 0.00023618827346036118, 'samples': 15092160, 'steps': 78604, 'loss/train': 1.006853699684143} 11/07/2021 08:18:10 - INFO - __main__ - Step 78606: {'lr': 0.00023618297482399248, 'samples': 15092352, 'steps': 78605, 'loss/train': 1.2682172060012817} 11/07/2021 08:18:11 - INFO - __main__ - Step 78607: {'lr': 0.0002361776761938495, 'samples': 15092544, 'steps': 78606, 'loss/train': 1.5368462800979614} 11/07/2021 08:18:11 - INFO - __main__ - Step 78608: {'lr': 0.00023617237756993464, 'samples': 15092736, 'steps': 78607, 'loss/train': 1.2126870155334473} 11/07/2021 08:18:11 - INFO - __main__ - Step 78609: {'lr': 0.00023616707895225033, 'samples': 15092928, 'steps': 78608, 'loss/train': 1.1530799865722656} 11/07/2021 08:18:12 - INFO - __main__ - Step 78610: {'lr': 0.00023616178034079887, 'samples': 15093120, 'steps': 78609, 'loss/train': 1.603016972541809} 11/07/2021 08:18:13 - INFO - __main__ - Step 78611: {'lr': 0.00023615648173558277, 'samples': 15093312, 'steps': 78610, 'loss/train': 1.322507619857788} 11/07/2021 08:18:13 - INFO - __main__ - Step 78612: {'lr': 0.0002361511831366043, 'samples': 15093504, 'steps': 78611, 'loss/train': 1.2198961973190308} 11/07/2021 08:18:13 - INFO - __main__ - Step 78613: {'lr': 0.0002361458845438659, 'samples': 15093696, 'steps': 78612, 'loss/train': 1.4455581903457642} 11/07/2021 08:18:14 - INFO - __main__ - Step 78614: {'lr': 0.00023614058595736992, 'samples': 15093888, 'steps': 78613, 'loss/train': 2.090867757797241} 11/07/2021 08:18:14 - INFO - __main__ - Step 78615: {'lr': 0.00023613528737711882, 'samples': 15094080, 'steps': 78614, 'loss/train': 1.7427576780319214} 11/07/2021 08:18:15 - INFO - __main__ - Step 78616: {'lr': 0.00023612998880311492, 'samples': 15094272, 'steps': 78615, 'loss/train': 1.1763334274291992} 11/07/2021 08:18:15 - INFO - __main__ - Step 78617: {'lr': 0.0002361246902353607, 'samples': 15094464, 'steps': 78616, 'loss/train': 1.382069706916809} 11/07/2021 08:18:16 - INFO - __main__ - Step 78618: {'lr': 0.0002361193916738584, 'samples': 15094656, 'steps': 78617, 'loss/train': 1.2120968103408813} 11/07/2021 08:18:16 - INFO - __main__ - Step 78619: {'lr': 0.0002361140931186105, 'samples': 15094848, 'steps': 78618, 'loss/train': 1.4280979633331299} 11/07/2021 08:18:16 - INFO - __main__ - Step 78620: {'lr': 0.0002361087945696194, 'samples': 15095040, 'steps': 78619, 'loss/train': 1.2060693502426147} 11/07/2021 08:18:17 - INFO - __main__ - Step 78621: {'lr': 0.00023610349602688744, 'samples': 15095232, 'steps': 78620, 'loss/train': 1.4779119491577148} 11/07/2021 08:18:18 - INFO - __main__ - Step 78622: {'lr': 0.00023609819749041707, 'samples': 15095424, 'steps': 78621, 'loss/train': 1.3189915418624878} 11/07/2021 08:18:18 - INFO - __main__ - Step 78623: {'lr': 0.00023609289896021064, 'samples': 15095616, 'steps': 78622, 'loss/train': 0.9666324257850647} 11/07/2021 08:18:18 - INFO - __main__ - Step 78624: {'lr': 0.00023608760043627048, 'samples': 15095808, 'steps': 78623, 'loss/train': 1.4486818313598633} 11/07/2021 08:18:19 - INFO - __main__ - Step 78625: {'lr': 0.00023608230191859907, 'samples': 15096000, 'steps': 78624, 'loss/train': 1.4938937425613403} 11/07/2021 08:18:19 - INFO - __main__ - Step 78626: {'lr': 0.00023607700340719874, 'samples': 15096192, 'steps': 78625, 'loss/train': 1.5127308368682861} 11/07/2021 08:18:20 - INFO - __main__ - Step 78627: {'lr': 0.00023607170490207188, 'samples': 15096384, 'steps': 78626, 'loss/train': 0.8914988040924072} 11/07/2021 08:18:21 - INFO - __main__ - Step 78628: {'lr': 0.00023606640640322092, 'samples': 15096576, 'steps': 78627, 'loss/train': 1.4134178161621094} 11/07/2021 08:18:21 - INFO - __main__ - Step 78629: {'lr': 0.00023606110791064822, 'samples': 15096768, 'steps': 78628, 'loss/train': 1.2997390031814575} 11/07/2021 08:18:21 - INFO - __main__ - Step 78630: {'lr': 0.0002360558094243562, 'samples': 15096960, 'steps': 78629, 'loss/train': 1.314825177192688} 11/07/2021 08:18:22 - INFO - __main__ - Step 78631: {'lr': 0.00023605051094434718, 'samples': 15097152, 'steps': 78630, 'loss/train': 0.9484968185424805} 11/07/2021 08:18:23 - INFO - __main__ - Step 78632: {'lr': 0.0002360452124706236, 'samples': 15097344, 'steps': 78631, 'loss/train': 1.709856390953064} 11/07/2021 08:18:23 - INFO - __main__ - Step 78633: {'lr': 0.00023603991400318787, 'samples': 15097536, 'steps': 78632, 'loss/train': 1.4213343858718872} 11/07/2021 08:18:23 - INFO - __main__ - Step 78634: {'lr': 0.0002360346155420423, 'samples': 15097728, 'steps': 78633, 'loss/train': 1.4623042345046997} 11/07/2021 08:18:24 - INFO - __main__ - Step 78635: {'lr': 0.0002360293170871893, 'samples': 15097920, 'steps': 78634, 'loss/train': 1.5759402513504028} 11/07/2021 08:18:24 - INFO - __main__ - Step 78636: {'lr': 0.00023602401863863126, 'samples': 15098112, 'steps': 78635, 'loss/train': 1.5623772144317627} 11/07/2021 08:18:25 - INFO - __main__ - Step 78637: {'lr': 0.00023601872019637061, 'samples': 15098304, 'steps': 78636, 'loss/train': 1.18293297290802} 11/07/2021 08:18:26 - INFO - __main__ - Step 78638: {'lr': 0.0002360134217604097, 'samples': 15098496, 'steps': 78637, 'loss/train': 1.517102837562561} 11/07/2021 08:18:26 - INFO - __main__ - Step 78639: {'lr': 0.00023600812333075092, 'samples': 15098688, 'steps': 78638, 'loss/train': 1.1537842750549316} 11/07/2021 08:18:26 - INFO - __main__ - Step 78640: {'lr': 0.00023600282490739667, 'samples': 15098880, 'steps': 78639, 'loss/train': 0.9167101979255676} 11/07/2021 08:18:27 - INFO - __main__ - Step 78641: {'lr': 0.00023599752649034933, 'samples': 15099072, 'steps': 78640, 'loss/train': 1.184833288192749} 11/07/2021 08:18:28 - INFO - __main__ - Step 78642: {'lr': 0.0002359922280796113, 'samples': 15099264, 'steps': 78641, 'loss/train': 0.45397311449050903} 11/07/2021 08:18:28 - INFO - __main__ - Step 78643: {'lr': 0.000235986929675185, 'samples': 15099456, 'steps': 78642, 'loss/train': 1.5867775678634644} 11/07/2021 08:18:28 - INFO - __main__ - Step 78644: {'lr': 0.00023598163127707276, 'samples': 15099648, 'steps': 78643, 'loss/train': 1.467056393623352} 11/07/2021 08:18:29 - INFO - __main__ - Step 78645: {'lr': 0.00023597633288527695, 'samples': 15099840, 'steps': 78644, 'loss/train': 1.8922157287597656} 11/07/2021 08:18:29 - INFO - __main__ - Step 78646: {'lr': 0.00023597103449979996, 'samples': 15100032, 'steps': 78645, 'loss/train': 1.238088846206665} 11/07/2021 08:18:30 - INFO - __main__ - Step 78647: {'lr': 0.00023596573612064424, 'samples': 15100224, 'steps': 78646, 'loss/train': 1.1676255464553833} 11/07/2021 08:18:30 - INFO - __main__ - Step 78648: {'lr': 0.00023596043774781213, 'samples': 15100416, 'steps': 78647, 'loss/train': 1.7148020267486572} 11/07/2021 08:18:31 - INFO - __main__ - Step 78649: {'lr': 0.000235955139381306, 'samples': 15100608, 'steps': 78648, 'loss/train': 1.389120101928711} 11/07/2021 08:18:31 - INFO - __main__ - Step 78650: {'lr': 0.00023594984102112828, 'samples': 15100800, 'steps': 78649, 'loss/train': 1.304937720298767} 11/07/2021 08:18:31 - INFO - __main__ - Step 78651: {'lr': 0.0002359445426672814, 'samples': 15100992, 'steps': 78650, 'loss/train': 1.8417341709136963} 11/07/2021 08:18:32 - INFO - __main__ - Step 78652: {'lr': 0.00023593924431976763, 'samples': 15101184, 'steps': 78651, 'loss/train': 1.5211910009384155} 11/07/2021 08:18:33 - INFO - __main__ - Step 78653: {'lr': 0.00023593394597858945, 'samples': 15101376, 'steps': 78652, 'loss/train': 1.4161216020584106} 11/07/2021 08:18:33 - INFO - __main__ - Step 78654: {'lr': 0.0002359286476437492, 'samples': 15101568, 'steps': 78653, 'loss/train': 1.1811041831970215} 11/07/2021 08:18:34 - INFO - __main__ - Step 78655: {'lr': 0.00023592334931524928, 'samples': 15101760, 'steps': 78654, 'loss/train': 4.12826681137085} 11/07/2021 08:18:34 - INFO - __main__ - Step 78656: {'lr': 0.00023591805099309207, 'samples': 15101952, 'steps': 78655, 'loss/train': 1.339523434638977} 11/07/2021 08:18:35 - INFO - __main__ - Step 78657: {'lr': 0.00023591275267728014, 'samples': 15102144, 'steps': 78656, 'loss/train': 1.4139028787612915} 11/07/2021 08:18:35 - INFO - __main__ - Step 78658: {'lr': 0.00023590745436781552, 'samples': 15102336, 'steps': 78657, 'loss/train': 1.670822024345398} 11/07/2021 08:18:36 - INFO - __main__ - Step 78659: {'lr': 0.00023590215606470084, 'samples': 15102528, 'steps': 78658, 'loss/train': 1.6527948379516602} 11/07/2021 08:18:36 - INFO - __main__ - Step 78660: {'lr': 0.0002358968577679384, 'samples': 15102720, 'steps': 78659, 'loss/train': 1.870732069015503} 11/07/2021 08:18:36 - INFO - __main__ - Step 78661: {'lr': 0.00023589155947753064, 'samples': 15102912, 'steps': 78660, 'loss/train': 1.6388700008392334} 11/07/2021 08:18:37 - INFO - __main__ - Step 78662: {'lr': 0.00023588626119347991, 'samples': 15103104, 'steps': 78661, 'loss/train': 1.1891746520996094} 11/07/2021 08:18:38 - INFO - __main__ - Step 78663: {'lr': 0.0002358809629157886, 'samples': 15103296, 'steps': 78662, 'loss/train': 1.2981629371643066} 11/07/2021 08:18:38 - INFO - __main__ - Step 78664: {'lr': 0.00023587566464445915, 'samples': 15103488, 'steps': 78663, 'loss/train': 1.4199750423431396} 11/07/2021 08:18:38 - INFO - __main__ - Step 78665: {'lr': 0.00023587036637949389, 'samples': 15103680, 'steps': 78664, 'loss/train': 1.080276608467102} 11/07/2021 08:18:39 - INFO - __main__ - Step 78666: {'lr': 0.0002358650681208952, 'samples': 15103872, 'steps': 78665, 'loss/train': 1.338783860206604} 11/07/2021 08:18:39 - INFO - __main__ - Step 78667: {'lr': 0.00023585976986866553, 'samples': 15104064, 'steps': 78666, 'loss/train': 0.830841600894928} 11/07/2021 08:18:40 - INFO - __main__ - Step 78668: {'lr': 0.0002358544716228072, 'samples': 15104256, 'steps': 78667, 'loss/train': 1.6115301847457886} 11/07/2021 08:18:40 - INFO - __main__ - Step 78669: {'lr': 0.00023584917338332264, 'samples': 15104448, 'steps': 78668, 'loss/train': 1.2263078689575195} 11/07/2021 08:18:41 - INFO - __main__ - Step 78670: {'lr': 0.00023584387515021433, 'samples': 15104640, 'steps': 78669, 'loss/train': 1.4935050010681152} 11/07/2021 08:18:41 - INFO - __main__ - Step 78671: {'lr': 0.00023583857692348445, 'samples': 15104832, 'steps': 78670, 'loss/train': 1.3774821758270264} 11/07/2021 08:18:41 - INFO - __main__ - Step 78672: {'lr': 0.00023583327870313548, 'samples': 15105024, 'steps': 78671, 'loss/train': 1.5957450866699219} 11/07/2021 08:18:42 - INFO - __main__ - Step 78673: {'lr': 0.0002358279804891698, 'samples': 15105216, 'steps': 78672, 'loss/train': 1.2576810121536255} 11/07/2021 08:18:43 - INFO - __main__ - Step 78674: {'lr': 0.00023582268228158985, 'samples': 15105408, 'steps': 78673, 'loss/train': 0.34300750494003296} 11/07/2021 08:18:43 - INFO - __main__ - Step 78675: {'lr': 0.00023581738408039797, 'samples': 15105600, 'steps': 78674, 'loss/train': 1.3223811388015747} 11/07/2021 08:18:43 - INFO - __main__ - Step 78676: {'lr': 0.00023581208588559655, 'samples': 15105792, 'steps': 78675, 'loss/train': 2.2766122817993164} 11/07/2021 08:18:44 - INFO - __main__ - Step 78677: {'lr': 0.000235806787697188, 'samples': 15105984, 'steps': 78676, 'loss/train': 1.2125650644302368} 11/07/2021 08:18:45 - INFO - __main__ - Step 78678: {'lr': 0.00023580148951517465, 'samples': 15106176, 'steps': 78677, 'loss/train': 1.4355180263519287} 11/07/2021 08:18:45 - INFO - __main__ - Step 78679: {'lr': 0.00023579619133955897, 'samples': 15106368, 'steps': 78678, 'loss/train': 0.8845800161361694} 11/07/2021 08:18:46 - INFO - __main__ - Step 78680: {'lr': 0.0002357908931703433, 'samples': 15106560, 'steps': 78679, 'loss/train': 1.695426106452942} 11/07/2021 08:18:46 - INFO - __main__ - Step 78681: {'lr': 0.00023578559500753003, 'samples': 15106752, 'steps': 78680, 'loss/train': 1.3537603616714478} 11/07/2021 08:18:46 - INFO - __main__ - Step 78682: {'lr': 0.00023578029685112153, 'samples': 15106944, 'steps': 78681, 'loss/train': 1.5438731908798218} 11/07/2021 08:18:47 - INFO - __main__ - Step 78683: {'lr': 0.00023577499870112024, 'samples': 15107136, 'steps': 78682, 'loss/train': 1.4258521795272827} 11/07/2021 08:18:48 - INFO - __main__ - Step 78684: {'lr': 0.0002357697005575286, 'samples': 15107328, 'steps': 78683, 'loss/train': 1.997011661529541} 11/07/2021 08:18:48 - INFO - __main__ - Step 78685: {'lr': 0.00023576440242034885, 'samples': 15107520, 'steps': 78684, 'loss/train': 1.4013164043426514} 11/07/2021 08:18:48 - INFO - __main__ - Step 78686: {'lr': 0.00023575910428958342, 'samples': 15107712, 'steps': 78685, 'loss/train': 1.272080421447754} 11/07/2021 08:18:49 - INFO - __main__ - Step 78687: {'lr': 0.0002357538061652347, 'samples': 15107904, 'steps': 78686, 'loss/train': 1.049033761024475} 11/07/2021 08:18:50 - INFO - __main__ - Step 78688: {'lr': 0.0002357485080473051, 'samples': 15108096, 'steps': 78687, 'loss/train': 1.1504279375076294} 11/07/2021 08:18:50 - INFO - __main__ - Step 78689: {'lr': 0.000235743209935797, 'samples': 15108288, 'steps': 78688, 'loss/train': 1.8256717920303345} 11/07/2021 08:18:51 - INFO - __main__ - Step 78690: {'lr': 0.0002357379118307128, 'samples': 15108480, 'steps': 78689, 'loss/train': 1.4799262285232544} 11/07/2021 08:18:51 - INFO - __main__ - Step 78691: {'lr': 0.00023573261373205487, 'samples': 15108672, 'steps': 78690, 'loss/train': 1.677964448928833} 11/07/2021 08:18:51 - INFO - __main__ - Step 78692: {'lr': 0.0002357273156398256, 'samples': 15108864, 'steps': 78691, 'loss/train': 1.5257205963134766} 11/07/2021 08:18:52 - INFO - __main__ - Step 78693: {'lr': 0.00023572201755402738, 'samples': 15109056, 'steps': 78692, 'loss/train': 1.3485177755355835} 11/07/2021 08:18:53 - INFO - __main__ - Step 78694: {'lr': 0.00023571671947466262, 'samples': 15109248, 'steps': 78693, 'loss/train': 0.9756007194519043} 11/07/2021 08:18:53 - INFO - __main__ - Step 78695: {'lr': 0.00023571142140173365, 'samples': 15109440, 'steps': 78694, 'loss/train': 0.9527813196182251} 11/07/2021 08:18:53 - INFO - __main__ - Step 78696: {'lr': 0.00023570612333524295, 'samples': 15109632, 'steps': 78695, 'loss/train': 1.4486165046691895} 11/07/2021 08:18:54 - INFO - __main__ - Step 78697: {'lr': 0.0002357008252751929, 'samples': 15109824, 'steps': 78696, 'loss/train': 1.327458143234253} 11/07/2021 08:18:54 - INFO - __main__ - Step 78698: {'lr': 0.00023569552722158574, 'samples': 15110016, 'steps': 78697, 'loss/train': 1.5408861637115479} 11/07/2021 08:18:55 - INFO - __main__ - Step 78699: {'lr': 0.00023569022917442397, 'samples': 15110208, 'steps': 78698, 'loss/train': 1.3786529302597046} 11/07/2021 08:18:55 - INFO - __main__ - Step 78700: {'lr': 0.00023568493113371, 'samples': 15110400, 'steps': 78699, 'loss/train': 1.750770092010498} 11/07/2021 08:18:56 - INFO - __main__ - Step 78701: {'lr': 0.0002356796330994461, 'samples': 15110592, 'steps': 78700, 'loss/train': 1.5502500534057617} 11/07/2021 08:18:56 - INFO - __main__ - Step 78702: {'lr': 0.00023567433507163478, 'samples': 15110784, 'steps': 78701, 'loss/train': 1.069288730621338} 11/07/2021 08:18:56 - INFO - __main__ - Step 78703: {'lr': 0.00023566903705027836, 'samples': 15110976, 'steps': 78702, 'loss/train': 0.9206255078315735} 11/07/2021 08:18:58 - INFO - __main__ - Step 78704: {'lr': 0.0002356637390353793, 'samples': 15111168, 'steps': 78703, 'loss/train': 1.541756510734558} 11/07/2021 08:18:58 - INFO - __main__ - Step 78705: {'lr': 0.0002356584410269399, 'samples': 15111360, 'steps': 78704, 'loss/train': 1.5584995746612549} 11/07/2021 08:18:58 - INFO - __main__ - Step 78706: {'lr': 0.0002356531430249626, 'samples': 15111552, 'steps': 78705, 'loss/train': 1.3601480722427368} 11/07/2021 08:18:59 - INFO - __main__ - Step 78707: {'lr': 0.00023564784502944975, 'samples': 15111744, 'steps': 78706, 'loss/train': 1.2725803852081299} 11/07/2021 08:18:59 - INFO - __main__ - Step 78708: {'lr': 0.00023564254704040377, 'samples': 15111936, 'steps': 78707, 'loss/train': 1.395926594734192} 11/07/2021 08:19:00 - INFO - __main__ - Step 78709: {'lr': 0.00023563724905782704, 'samples': 15112128, 'steps': 78708, 'loss/train': 1.0809710025787354} 11/07/2021 08:19:01 - INFO - __main__ - Step 78710: {'lr': 0.00023563195108172196, 'samples': 15112320, 'steps': 78709, 'loss/train': 1.6934159994125366} 11/07/2021 08:19:01 - INFO - __main__ - Step 78711: {'lr': 0.00023562665311209098, 'samples': 15112512, 'steps': 78710, 'loss/train': 1.5526338815689087} 11/07/2021 08:19:01 - INFO - __main__ - Step 78712: {'lr': 0.00023562135514893631, 'samples': 15112704, 'steps': 78711, 'loss/train': 1.2976139783859253} 11/07/2021 08:19:02 - INFO - __main__ - Step 78713: {'lr': 0.00023561605719226046, 'samples': 15112896, 'steps': 78712, 'loss/train': 1.1693201065063477} 11/07/2021 08:19:03 - INFO - __main__ - Step 78714: {'lr': 0.00023561075924206576, 'samples': 15113088, 'steps': 78713, 'loss/train': 1.6710336208343506} 11/07/2021 08:19:03 - INFO - __main__ - Step 78715: {'lr': 0.00023560546129835464, 'samples': 15113280, 'steps': 78714, 'loss/train': 1.5202016830444336} 11/07/2021 08:19:03 - INFO - __main__ - Step 78716: {'lr': 0.00023560016336112948, 'samples': 15113472, 'steps': 78715, 'loss/train': 0.5278545618057251} 11/07/2021 08:19:04 - INFO - __main__ - Step 78717: {'lr': 0.00023559486543039265, 'samples': 15113664, 'steps': 78716, 'loss/train': 1.3541830778121948} 11/07/2021 08:19:04 - INFO - __main__ - Step 78718: {'lr': 0.00023558956750614657, 'samples': 15113856, 'steps': 78717, 'loss/train': 1.3889025449752808} 11/07/2021 08:19:05 - INFO - __main__ - Step 78719: {'lr': 0.0002355842695883936, 'samples': 15114048, 'steps': 78718, 'loss/train': 1.5788244009017944} 11/07/2021 08:19:05 - INFO - __main__ - Step 78720: {'lr': 0.00023557897167713615, 'samples': 15114240, 'steps': 78719, 'loss/train': 2.153195381164551} 11/07/2021 08:19:06 - INFO - __main__ - Step 78721: {'lr': 0.00023557367377237658, 'samples': 15114432, 'steps': 78720, 'loss/train': 1.314360499382019} 11/07/2021 08:19:06 - INFO - __main__ - Step 78722: {'lr': 0.00023556837587411728, 'samples': 15114624, 'steps': 78721, 'loss/train': 0.9148507714271545} 11/07/2021 08:19:07 - INFO - __main__ - Step 78723: {'lr': 0.00023556307798236074, 'samples': 15114816, 'steps': 78722, 'loss/train': 1.4894471168518066} 11/07/2021 08:19:07 - INFO - __main__ - Step 78724: {'lr': 0.0002355577800971092, 'samples': 15115008, 'steps': 78723, 'loss/train': 1.889638900756836} 11/07/2021 08:19:08 - INFO - __main__ - Step 78725: {'lr': 0.00023555248221836508, 'samples': 15115200, 'steps': 78724, 'loss/train': 1.6020649671554565} 11/07/2021 08:19:08 - INFO - __main__ - Step 78726: {'lr': 0.0002355471843461308, 'samples': 15115392, 'steps': 78725, 'loss/train': 1.2224469184875488} 11/07/2021 08:19:09 - INFO - __main__ - Step 78727: {'lr': 0.0002355418864804087, 'samples': 15115584, 'steps': 78726, 'loss/train': 1.3974682092666626} 11/07/2021 08:19:09 - INFO - __main__ - Step 78728: {'lr': 0.00023553658862120124, 'samples': 15115776, 'steps': 78727, 'loss/train': 1.284716010093689} 11/07/2021 08:19:09 - INFO - __main__ - Step 78729: {'lr': 0.00023553129076851073, 'samples': 15115968, 'steps': 78728, 'loss/train': 1.2349016666412354} 11/07/2021 08:19:10 - INFO - __main__ - Step 78730: {'lr': 0.00023552599292233964, 'samples': 15116160, 'steps': 78729, 'loss/train': 1.4709209203720093} 11/07/2021 08:19:11 - INFO - __main__ - Step 78731: {'lr': 0.00023552069508269027, 'samples': 15116352, 'steps': 78730, 'loss/train': 1.617026448249817} 11/07/2021 08:19:11 - INFO - __main__ - Step 78732: {'lr': 0.00023551539724956508, 'samples': 15116544, 'steps': 78731, 'loss/train': 1.2769546508789062} 11/07/2021 08:19:11 - INFO - __main__ - Step 78733: {'lr': 0.00023551009942296641, 'samples': 15116736, 'steps': 78732, 'loss/train': 1.4100379943847656} 11/07/2021 08:19:12 - INFO - __main__ - Step 78734: {'lr': 0.00023550480160289676, 'samples': 15116928, 'steps': 78733, 'loss/train': 1.5314468145370483} 11/07/2021 08:19:13 - INFO - __main__ - Step 78735: {'lr': 0.00023549950378935834, 'samples': 15117120, 'steps': 78734, 'loss/train': 1.406087040901184} 11/07/2021 08:19:14 - INFO - __main__ - Step 78736: {'lr': 0.00023549420598235362, 'samples': 15117312, 'steps': 78735, 'loss/train': 1.2606911659240723} 11/07/2021 08:19:14 - INFO - __main__ - Step 78737: {'lr': 0.00023548890818188498, 'samples': 15117504, 'steps': 78736, 'loss/train': 0.8743999600410461} 11/07/2021 08:19:14 - INFO - __main__ - Step 78738: {'lr': 0.00023548361038795487, 'samples': 15117696, 'steps': 78737, 'loss/train': 1.4819422960281372} 11/07/2021 08:19:15 - INFO - __main__ - Step 78739: {'lr': 0.00023547831260056556, 'samples': 15117888, 'steps': 78738, 'loss/train': 1.418462872505188} 11/07/2021 08:19:15 - INFO - __main__ - Step 78740: {'lr': 0.00023547301481971952, 'samples': 15118080, 'steps': 78739, 'loss/train': 0.33582496643066406} 11/07/2021 08:19:16 - INFO - __main__ - Step 78741: {'lr': 0.0002354677170454191, 'samples': 15118272, 'steps': 78740, 'loss/train': 1.4294935464859009} 11/07/2021 08:19:17 - INFO - __main__ - Step 78742: {'lr': 0.00023546241927766673, 'samples': 15118464, 'steps': 78741, 'loss/train': 1.809476613998413} 11/07/2021 08:19:17 - INFO - __main__ - Step 78743: {'lr': 0.00023545712151646476, 'samples': 15118656, 'steps': 78742, 'loss/train': 1.7656502723693848} 11/07/2021 08:19:17 - INFO - __main__ - Step 78744: {'lr': 0.00023545182376181556, 'samples': 15118848, 'steps': 78743, 'loss/train': 1.3036110401153564} 11/07/2021 08:19:18 - INFO - __main__ - Step 78745: {'lr': 0.00023544652601372162, 'samples': 15119040, 'steps': 78744, 'loss/train': 1.5911122560501099} 11/07/2021 08:19:18 - INFO - __main__ - Step 78746: {'lr': 0.0002354412282721852, 'samples': 15119232, 'steps': 78745, 'loss/train': 1.5575050115585327} 11/07/2021 08:19:20 - INFO - __main__ - Step 78747: {'lr': 0.00023543593053720871, 'samples': 15119424, 'steps': 78746, 'loss/train': 1.3875850439071655} 11/07/2021 08:19:20 - INFO - __main__ - Step 78748: {'lr': 0.00023543063280879458, 'samples': 15119616, 'steps': 78747, 'loss/train': 1.6066099405288696} 11/07/2021 08:19:20 - INFO - __main__ - Step 78749: {'lr': 0.0002354253350869452, 'samples': 15119808, 'steps': 78748, 'loss/train': 1.470960021018982} 11/07/2021 08:19:21 - INFO - __main__ - Step 78750: {'lr': 0.00023542003737166294, 'samples': 15120000, 'steps': 78749, 'loss/train': 1.1223474740982056} 11/07/2021 08:19:21 - INFO - __main__ - Step 78751: {'lr': 0.0002354147396629502, 'samples': 15120192, 'steps': 78750, 'loss/train': 1.7508946657180786} 11/07/2021 08:19:21 - INFO - __main__ - Step 78752: {'lr': 0.00023540944196080932, 'samples': 15120384, 'steps': 78751, 'loss/train': 1.578732967376709} 11/07/2021 08:19:22 - INFO - __main__ - Step 78753: {'lr': 0.00023540414426524272, 'samples': 15120576, 'steps': 78752, 'loss/train': 0.14157436788082123} 11/07/2021 08:19:23 - INFO - __main__ - Step 78754: {'lr': 0.00023539884657625278, 'samples': 15120768, 'steps': 78753, 'loss/train': 0.30096736550331116} 11/07/2021 08:19:23 - INFO - __main__ - Step 78755: {'lr': 0.00023539354889384192, 'samples': 15120960, 'steps': 78754, 'loss/train': 1.5358916521072388} 11/07/2021 08:19:23 - INFO - __main__ - Step 78756: {'lr': 0.00023538825121801254, 'samples': 15121152, 'steps': 78755, 'loss/train': 1.1386042833328247} 11/07/2021 08:19:24 - INFO - __main__ - Step 78757: {'lr': 0.00023538295354876693, 'samples': 15121344, 'steps': 78756, 'loss/train': 1.5196471214294434} 11/07/2021 08:19:25 - INFO - __main__ - Step 78758: {'lr': 0.00023537765588610754, 'samples': 15121536, 'steps': 78757, 'loss/train': 1.5139065980911255} 11/07/2021 08:19:25 - INFO - __main__ - Step 78759: {'lr': 0.00023537235823003678, 'samples': 15121728, 'steps': 78758, 'loss/train': 0.9450996518135071} 11/07/2021 08:19:25 - INFO - __main__ - Step 78760: {'lr': 0.00023536706058055695, 'samples': 15121920, 'steps': 78759, 'loss/train': 1.7974703311920166} 11/07/2021 08:19:26 - INFO - __main__ - Step 78761: {'lr': 0.00023536176293767057, 'samples': 15122112, 'steps': 78760, 'loss/train': 1.413333773612976} 11/07/2021 08:19:26 - INFO - __main__ - Step 78762: {'lr': 0.00023535646530137988, 'samples': 15122304, 'steps': 78761, 'loss/train': 1.3178094625473022} 11/07/2021 08:19:27 - INFO - __main__ - Step 78763: {'lr': 0.00023535116767168737, 'samples': 15122496, 'steps': 78762, 'loss/train': 1.2109543085098267} 11/07/2021 08:19:28 - INFO - __main__ - Step 78764: {'lr': 0.00023534587004859548, 'samples': 15122688, 'steps': 78763, 'loss/train': 0.28527525067329407} 11/07/2021 08:19:28 - INFO - __main__ - Step 78765: {'lr': 0.00023534057243210644, 'samples': 15122880, 'steps': 78764, 'loss/train': 1.3567603826522827} 11/07/2021 08:19:28 - INFO - __main__ - Step 78766: {'lr': 0.0002353352748222227, 'samples': 15123072, 'steps': 78765, 'loss/train': 1.2556949853897095} 11/07/2021 08:19:29 - INFO - __main__ - Step 78767: {'lr': 0.0002353299772189467, 'samples': 15123264, 'steps': 78766, 'loss/train': 1.1117528676986694} 11/07/2021 08:19:29 - INFO - __main__ - Step 78768: {'lr': 0.00023532467962228076, 'samples': 15123456, 'steps': 78767, 'loss/train': 1.4717881679534912} 11/07/2021 08:19:30 - INFO - __main__ - Step 78769: {'lr': 0.0002353193820322273, 'samples': 15123648, 'steps': 78768, 'loss/train': 1.7151371240615845} 11/07/2021 08:19:30 - INFO - __main__ - Step 78770: {'lr': 0.00023531408444878868, 'samples': 15123840, 'steps': 78769, 'loss/train': 1.193092703819275} 11/07/2021 08:19:31 - INFO - __main__ - Step 78771: {'lr': 0.00023530878687196734, 'samples': 15124032, 'steps': 78770, 'loss/train': 1.4550886154174805} 11/07/2021 08:19:31 - INFO - __main__ - Step 78772: {'lr': 0.0002353034893017656, 'samples': 15124224, 'steps': 78771, 'loss/train': 1.4918019771575928} 11/07/2021 08:19:31 - INFO - __main__ - Step 78773: {'lr': 0.00023529819173818587, 'samples': 15124416, 'steps': 78772, 'loss/train': 1.3634732961654663} 11/07/2021 08:19:32 - INFO - __main__ - Step 78774: {'lr': 0.00023529289418123056, 'samples': 15124608, 'steps': 78773, 'loss/train': 1.5004349946975708} 11/07/2021 08:19:33 - INFO - __main__ - Step 78775: {'lr': 0.00023528759663090209, 'samples': 15124800, 'steps': 78774, 'loss/train': 1.253008484840393} 11/07/2021 08:19:33 - INFO - __main__ - Step 78776: {'lr': 0.00023528229908720272, 'samples': 15124992, 'steps': 78775, 'loss/train': 1.5047191381454468} 11/07/2021 08:19:33 - INFO - __main__ - Step 78777: {'lr': 0.00023527700155013498, 'samples': 15125184, 'steps': 78776, 'loss/train': 1.428037405014038} 11/07/2021 08:19:34 - INFO - __main__ - Step 78778: {'lr': 0.00023527170401970126, 'samples': 15125376, 'steps': 78777, 'loss/train': 1.3197312355041504} 11/07/2021 08:19:35 - INFO - __main__ - Step 78779: {'lr': 0.00023526640649590384, 'samples': 15125568, 'steps': 78778, 'loss/train': 1.0601166486740112} 11/07/2021 08:19:35 - INFO - __main__ - Step 78780: {'lr': 0.0002352611089787451, 'samples': 15125760, 'steps': 78779, 'loss/train': 1.5575069189071655} 11/07/2021 08:19:36 - INFO - __main__ - Step 78781: {'lr': 0.00023525581146822746, 'samples': 15125952, 'steps': 78780, 'loss/train': 1.4182459115982056} 11/07/2021 08:19:36 - INFO - __main__ - Step 78782: {'lr': 0.00023525051396435336, 'samples': 15126144, 'steps': 78781, 'loss/train': 1.3626939058303833} 11/07/2021 08:19:36 - INFO - __main__ - Step 78783: {'lr': 0.00023524521646712515, 'samples': 15126336, 'steps': 78782, 'loss/train': 1.3035415410995483} 11/07/2021 08:19:37 - INFO - __main__ - Step 78784: {'lr': 0.00023523991897654524, 'samples': 15126528, 'steps': 78783, 'loss/train': 1.8247365951538086} 11/07/2021 08:19:38 - INFO - __main__ - Step 78785: {'lr': 0.00023523462149261593, 'samples': 15126720, 'steps': 78784, 'loss/train': 1.1004818677902222} 11/07/2021 08:19:38 - INFO - __main__ - Step 78786: {'lr': 0.00023522932401533973, 'samples': 15126912, 'steps': 78785, 'loss/train': 1.330359935760498} 11/07/2021 08:19:38 - INFO - __main__ - Step 78787: {'lr': 0.00023522402654471895, 'samples': 15127104, 'steps': 78786, 'loss/train': 1.739025354385376} 11/07/2021 08:19:39 - INFO - __main__ - Step 78788: {'lr': 0.00023521872908075598, 'samples': 15127296, 'steps': 78787, 'loss/train': 1.5404126644134521} 11/07/2021 08:19:40 - INFO - __main__ - Step 78789: {'lr': 0.00023521343162345322, 'samples': 15127488, 'steps': 78788, 'loss/train': 1.0229383707046509} 11/07/2021 08:19:40 - INFO - __main__ - Step 78790: {'lr': 0.0002352081341728131, 'samples': 15127680, 'steps': 78789, 'loss/train': 1.1396963596343994} 11/07/2021 08:19:40 - INFO - __main__ - Step 78791: {'lr': 0.00023520283672883802, 'samples': 15127872, 'steps': 78790, 'loss/train': 1.52204167842865} 11/07/2021 08:19:41 - INFO - __main__ - Step 78792: {'lr': 0.00023519753929153022, 'samples': 15128064, 'steps': 78791, 'loss/train': 1.5679070949554443} 11/07/2021 08:19:41 - INFO - __main__ - Step 78793: {'lr': 0.00023519224186089222, 'samples': 15128256, 'steps': 78792, 'loss/train': 1.7231303453445435} 11/07/2021 08:19:42 - INFO - __main__ - Step 78794: {'lr': 0.00023518694443692634, 'samples': 15128448, 'steps': 78793, 'loss/train': 1.4763528108596802} 11/07/2021 08:19:42 - INFO - __main__ - Step 78795: {'lr': 0.000235181647019635, 'samples': 15128640, 'steps': 78794, 'loss/train': 1.2271145582199097} 11/07/2021 08:19:43 - INFO - __main__ - Step 78796: {'lr': 0.00023517634960902058, 'samples': 15128832, 'steps': 78795, 'loss/train': 1.6459277868270874} 11/07/2021 08:19:43 - INFO - __main__ - Step 78797: {'lr': 0.00023517105220508544, 'samples': 15129024, 'steps': 78796, 'loss/train': 1.101730227470398} 11/07/2021 08:19:43 - INFO - __main__ - Step 78798: {'lr': 0.00023516575480783203, 'samples': 15129216, 'steps': 78797, 'loss/train': 1.4442484378814697} 11/07/2021 08:19:44 - INFO - __main__ - Step 78799: {'lr': 0.0002351604574172627, 'samples': 15129408, 'steps': 78798, 'loss/train': 1.649656891822815} 11/07/2021 08:19:45 - INFO - __main__ - Step 78800: {'lr': 0.0002351551600333798, 'samples': 15129600, 'steps': 78799, 'loss/train': 1.59848153591156} 11/07/2021 08:19:45 - INFO - __main__ - Step 78801: {'lr': 0.0002351498626561858, 'samples': 15129792, 'steps': 78800, 'loss/train': 1.4430488348007202} 11/07/2021 08:19:46 - INFO - __main__ - Step 78802: {'lr': 0.00023514456528568305, 'samples': 15129984, 'steps': 78801, 'loss/train': 0.44081300497055054} 11/07/2021 08:19:46 - INFO - __main__ - Step 78803: {'lr': 0.0002351392679218739, 'samples': 15130176, 'steps': 78802, 'loss/train': 0.814163088798523} 11/07/2021 08:19:46 - INFO - __main__ - Step 78804: {'lr': 0.00023513397056476078, 'samples': 15130368, 'steps': 78803, 'loss/train': 1.3176974058151245} 11/07/2021 08:19:47 - INFO - __main__ - Step 78805: {'lr': 0.00023512867321434617, 'samples': 15130560, 'steps': 78804, 'loss/train': 1.7293133735656738} 11/07/2021 08:19:48 - INFO - __main__ - Step 78806: {'lr': 0.00023512337587063223, 'samples': 15130752, 'steps': 78805, 'loss/train': 1.3063490390777588} 11/07/2021 08:19:48 - INFO - __main__ - Step 78807: {'lr': 0.00023511807853362145, 'samples': 15130944, 'steps': 78806, 'loss/train': 1.1344884634017944} 11/07/2021 08:19:48 - INFO - __main__ - Step 78808: {'lr': 0.00023511278120331628, 'samples': 15131136, 'steps': 78807, 'loss/train': 1.2556183338165283} 11/07/2021 08:19:49 - INFO - __main__ - Step 78809: {'lr': 0.00023510748387971903, 'samples': 15131328, 'steps': 78808, 'loss/train': 1.445407748222351} 11/07/2021 08:19:50 - INFO - __main__ - Step 78810: {'lr': 0.00023510218656283213, 'samples': 15131520, 'steps': 78809, 'loss/train': 1.4211081266403198} 11/07/2021 08:19:50 - INFO - __main__ - Step 78811: {'lr': 0.00023509688925265796, 'samples': 15131712, 'steps': 78810, 'loss/train': 1.388648271560669} 11/07/2021 08:19:50 - INFO - __main__ - Step 78812: {'lr': 0.0002350915919491989, 'samples': 15131904, 'steps': 78811, 'loss/train': 1.3923343420028687} 11/07/2021 08:19:51 - INFO - __main__ - Step 78813: {'lr': 0.00023508629465245735, 'samples': 15132096, 'steps': 78812, 'loss/train': 1.419500708580017} 11/07/2021 08:19:51 - INFO - __main__ - Step 78814: {'lr': 0.00023508099736243565, 'samples': 15132288, 'steps': 78813, 'loss/train': 1.66846764087677} 11/07/2021 08:19:52 - INFO - __main__ - Step 78815: {'lr': 0.00023507570007913624, 'samples': 15132480, 'steps': 78814, 'loss/train': 1.061916708946228} 11/07/2021 08:19:52 - INFO - __main__ - Step 78816: {'lr': 0.0002350704028025615, 'samples': 15132672, 'steps': 78815, 'loss/train': 0.9144483804702759} 11/07/2021 08:19:53 - INFO - __main__ - Step 78817: {'lr': 0.0002350651055327138, 'samples': 15132864, 'steps': 78816, 'loss/train': 1.2443746328353882} 11/07/2021 08:19:53 - INFO - __main__ - Step 78818: {'lr': 0.00023505980826959565, 'samples': 15133056, 'steps': 78817, 'loss/train': 1.5328270196914673} 11/07/2021 08:19:54 - INFO - __main__ - Step 78819: {'lr': 0.00023505451101320918, 'samples': 15133248, 'steps': 78818, 'loss/train': 1.3769311904907227} 11/07/2021 08:19:54 - INFO - __main__ - Step 78820: {'lr': 0.00023504921376355696, 'samples': 15133440, 'steps': 78819, 'loss/train': 1.4920454025268555} 11/07/2021 08:19:55 - INFO - __main__ - Step 78821: {'lr': 0.00023504391652064127, 'samples': 15133632, 'steps': 78820, 'loss/train': 1.2609570026397705} 11/07/2021 08:19:55 - INFO - __main__ - Step 78822: {'lr': 0.00023503861928446463, 'samples': 15133824, 'steps': 78821, 'loss/train': 1.152811884880066} 11/07/2021 08:19:56 - INFO - __main__ - Step 78823: {'lr': 0.00023503332205502932, 'samples': 15134016, 'steps': 78822, 'loss/train': 1.7226767539978027} 11/07/2021 08:19:56 - INFO - __main__ - Step 78824: {'lr': 0.00023502802483233776, 'samples': 15134208, 'steps': 78823, 'loss/train': 1.3926432132720947} 11/07/2021 08:19:56 - INFO - __main__ - Step 78825: {'lr': 0.00023502272761639236, 'samples': 15134400, 'steps': 78824, 'loss/train': 0.9899486303329468} 11/07/2021 08:19:57 - INFO - __main__ - Step 78826: {'lr': 0.00023501743040719547, 'samples': 15134592, 'steps': 78825, 'loss/train': 1.269952416419983} 11/07/2021 08:19:58 - INFO - __main__ - Step 78827: {'lr': 0.00023501213320474952, 'samples': 15134784, 'steps': 78826, 'loss/train': 1.0947366952896118} 11/07/2021 08:19:58 - INFO - __main__ - Step 78828: {'lr': 0.00023500683600905686, 'samples': 15134976, 'steps': 78827, 'loss/train': 1.191603660583496} 11/07/2021 08:19:58 - INFO - __main__ - Step 78829: {'lr': 0.00023500153882011988, 'samples': 15135168, 'steps': 78828, 'loss/train': 1.5440046787261963} 11/07/2021 08:19:59 - INFO - __main__ - Step 78830: {'lr': 0.00023499624163794098, 'samples': 15135360, 'steps': 78829, 'loss/train': 2.2161731719970703} 11/07/2021 08:20:00 - INFO - __main__ - Step 78831: {'lr': 0.00023499094446252256, 'samples': 15135552, 'steps': 78830, 'loss/train': 4.954830646514893} 11/07/2021 08:20:00 - INFO - __main__ - Step 78832: {'lr': 0.00023498564729386705, 'samples': 15135744, 'steps': 78831, 'loss/train': 1.4623548984527588} 11/07/2021 08:20:00 - INFO - __main__ - Step 78833: {'lr': 0.00023498035013197672, 'samples': 15135936, 'steps': 78832, 'loss/train': 1.2822858095169067} 11/07/2021 08:20:01 - INFO - __main__ - Step 78834: {'lr': 0.00023497505297685398, 'samples': 15136128, 'steps': 78833, 'loss/train': 1.4647974967956543} 11/07/2021 08:20:01 - INFO - __main__ - Step 78835: {'lr': 0.0002349697558285013, 'samples': 15136320, 'steps': 78834, 'loss/train': 0.786292552947998} 11/07/2021 08:20:02 - INFO - __main__ - Step 78836: {'lr': 0.00023496445868692093, 'samples': 15136512, 'steps': 78835, 'loss/train': 1.4079318046569824} 11/07/2021 08:20:02 - INFO - __main__ - Step 78837: {'lr': 0.00023495916155211538, 'samples': 15136704, 'steps': 78836, 'loss/train': 1.4982831478118896} 11/07/2021 08:20:03 - INFO - __main__ - Step 78838: {'lr': 0.00023495386442408704, 'samples': 15136896, 'steps': 78837, 'loss/train': 1.369764804840088} 11/07/2021 08:20:03 - INFO - __main__ - Step 78839: {'lr': 0.0002349485673028382, 'samples': 15137088, 'steps': 78838, 'loss/train': 1.7835743427276611} 11/07/2021 08:20:03 - INFO - __main__ - Step 78840: {'lr': 0.00023494327018837134, 'samples': 15137280, 'steps': 78839, 'loss/train': 1.584977626800537} 11/07/2021 08:20:05 - INFO - __main__ - Step 78841: {'lr': 0.00023493797308068878, 'samples': 15137472, 'steps': 78840, 'loss/train': 1.5643457174301147} 11/07/2021 08:20:05 - INFO - __main__ - Step 78842: {'lr': 0.00023493267597979298, 'samples': 15137664, 'steps': 78841, 'loss/train': 1.4897798299789429} 11/07/2021 08:20:05 - INFO - __main__ - Step 78843: {'lr': 0.00023492737888568623, 'samples': 15137856, 'steps': 78842, 'loss/train': 1.4253910779953003} 11/07/2021 08:20:06 - INFO - __main__ - Step 78844: {'lr': 0.000234922081798371, 'samples': 15138048, 'steps': 78843, 'loss/train': 1.3820469379425049} 11/07/2021 08:20:06 - INFO - __main__ - Step 78845: {'lr': 0.00023491678471784978, 'samples': 15138240, 'steps': 78844, 'loss/train': 1.448639154434204} 11/07/2021 08:20:07 - INFO - __main__ - Step 78846: {'lr': 0.00023491148764412468, 'samples': 15138432, 'steps': 78845, 'loss/train': 0.5768639445304871} 11/07/2021 08:20:08 - INFO - __main__ - Step 78847: {'lr': 0.00023490619057719826, 'samples': 15138624, 'steps': 78846, 'loss/train': 1.1465294361114502} 11/07/2021 08:20:08 - INFO - __main__ - Step 78848: {'lr': 0.00023490089351707282, 'samples': 15138816, 'steps': 78847, 'loss/train': 1.435262680053711} 11/07/2021 08:20:08 - INFO - __main__ - Step 78849: {'lr': 0.00023489559646375086, 'samples': 15139008, 'steps': 78848, 'loss/train': 1.3186798095703125} 11/07/2021 08:20:09 - INFO - __main__ - Step 78850: {'lr': 0.00023489029941723468, 'samples': 15139200, 'steps': 78849, 'loss/train': 0.9155345559120178} 11/07/2021 08:20:10 - INFO - __main__ - Step 78851: {'lr': 0.00023488500237752675, 'samples': 15139392, 'steps': 78850, 'loss/train': 1.5123435258865356} 11/07/2021 08:20:10 - INFO - __main__ - Step 78852: {'lr': 0.00023487970534462934, 'samples': 15139584, 'steps': 78851, 'loss/train': 1.2385296821594238} 11/07/2021 08:20:10 - INFO - __main__ - Step 78853: {'lr': 0.00023487440831854492, 'samples': 15139776, 'steps': 78852, 'loss/train': 1.000961422920227} 11/07/2021 08:20:11 - INFO - __main__ - Step 78854: {'lr': 0.00023486911129927588, 'samples': 15139968, 'steps': 78853, 'loss/train': 0.2035507708787918} 11/07/2021 08:20:11 - INFO - __main__ - Step 78855: {'lr': 0.00023486381428682458, 'samples': 15140160, 'steps': 78854, 'loss/train': 1.5139714479446411} 11/07/2021 08:20:12 - INFO - __main__ - Step 78856: {'lr': 0.0002348585172811934, 'samples': 15140352, 'steps': 78855, 'loss/train': 1.651902198791504} 11/07/2021 08:20:12 - INFO - __main__ - Step 78857: {'lr': 0.00023485322028238474, 'samples': 15140544, 'steps': 78856, 'loss/train': 1.3368909358978271} 11/07/2021 08:20:13 - INFO - __main__ - Step 78858: {'lr': 0.00023484792329040105, 'samples': 15140736, 'steps': 78857, 'loss/train': 1.1306445598602295} 11/07/2021 08:20:13 - INFO - __main__ - Step 78859: {'lr': 0.00023484262630524464, 'samples': 15140928, 'steps': 78858, 'loss/train': 1.8436537981033325} 11/07/2021 08:20:13 - INFO - __main__ - Step 78860: {'lr': 0.00023483732932691784, 'samples': 15141120, 'steps': 78859, 'loss/train': 1.5956246852874756} 11/07/2021 08:20:14 - INFO - __main__ - Step 78861: {'lr': 0.00023483203235542314, 'samples': 15141312, 'steps': 78860, 'loss/train': 1.497064471244812} 11/07/2021 08:20:15 - INFO - __main__ - Step 78862: {'lr': 0.00023482673539076287, 'samples': 15141504, 'steps': 78861, 'loss/train': 1.0911563634872437} 11/07/2021 08:20:15 - INFO - __main__ - Step 78863: {'lr': 0.00023482143843293944, 'samples': 15141696, 'steps': 78862, 'loss/train': 1.4758939743041992} 11/07/2021 08:20:15 - INFO - __main__ - Step 78864: {'lr': 0.00023481614148195524, 'samples': 15141888, 'steps': 78863, 'loss/train': 1.0740888118743896} 11/07/2021 08:20:16 - INFO - __main__ - Step 78865: {'lr': 0.00023481084453781266, 'samples': 15142080, 'steps': 78864, 'loss/train': 1.3037925958633423} 11/07/2021 08:20:17 - INFO - __main__ - Step 78866: {'lr': 0.00023480554760051407, 'samples': 15142272, 'steps': 78865, 'loss/train': 1.4260854721069336} 11/07/2021 08:20:17 - INFO - __main__ - Step 78867: {'lr': 0.00023480025067006187, 'samples': 15142464, 'steps': 78866, 'loss/train': 1.5105661153793335} 11/07/2021 08:20:18 - INFO - __main__ - Step 78868: {'lr': 0.00023479495374645844, 'samples': 15142656, 'steps': 78867, 'loss/train': 1.0462582111358643} 11/07/2021 08:20:18 - INFO - __main__ - Step 78869: {'lr': 0.00023478965682970622, 'samples': 15142848, 'steps': 78868, 'loss/train': 1.7078030109405518} 11/07/2021 08:20:18 - INFO - __main__ - Step 78870: {'lr': 0.00023478435991980748, 'samples': 15143040, 'steps': 78869, 'loss/train': 0.9526431560516357} 11/07/2021 08:20:19 - INFO - __main__ - Step 78871: {'lr': 0.0002347790630167647, 'samples': 15143232, 'steps': 78870, 'loss/train': 1.481642484664917} 11/07/2021 08:20:20 - INFO - __main__ - Step 78872: {'lr': 0.00023477376612058028, 'samples': 15143424, 'steps': 78871, 'loss/train': 1.1791157722473145} 11/07/2021 08:20:20 - INFO - __main__ - Step 78873: {'lr': 0.0002347684692312565, 'samples': 15143616, 'steps': 78872, 'loss/train': 1.408525824546814} 11/07/2021 08:20:20 - INFO - __main__ - Step 78874: {'lr': 0.00023476317234879583, 'samples': 15143808, 'steps': 78873, 'loss/train': 1.6569455862045288} 11/07/2021 08:20:21 - INFO - __main__ - Step 78875: {'lr': 0.00023475787547320062, 'samples': 15144000, 'steps': 78874, 'loss/train': 0.5933690667152405} 11/07/2021 08:20:21 - INFO - __main__ - Step 78876: {'lr': 0.0002347525786044733, 'samples': 15144192, 'steps': 78875, 'loss/train': 1.210984706878662} 11/07/2021 08:20:22 - INFO - __main__ - Step 78877: {'lr': 0.00023474728174261624, 'samples': 15144384, 'steps': 78876, 'loss/train': 1.0771452188491821} 11/07/2021 08:20:23 - INFO - __main__ - Step 78878: {'lr': 0.0002347419848876318, 'samples': 15144576, 'steps': 78877, 'loss/train': 0.9432421922683716} 11/07/2021 08:20:23 - INFO - __main__ - Step 78879: {'lr': 0.0002347366880395224, 'samples': 15144768, 'steps': 78878, 'loss/train': 1.4264867305755615} 11/07/2021 08:20:23 - INFO - __main__ - Step 78880: {'lr': 0.00023473139119829046, 'samples': 15144960, 'steps': 78879, 'loss/train': 1.1953630447387695} 11/07/2021 08:20:24 - INFO - __main__ - Step 78881: {'lr': 0.00023472609436393823, 'samples': 15145152, 'steps': 78880, 'loss/train': 1.4842424392700195} 11/07/2021 08:20:25 - INFO - __main__ - Step 78882: {'lr': 0.00023472079753646824, 'samples': 15145344, 'steps': 78881, 'loss/train': 1.5268317461013794} 11/07/2021 08:20:25 - INFO - __main__ - Step 78883: {'lr': 0.0002347155007158828, 'samples': 15145536, 'steps': 78882, 'loss/train': 1.6539921760559082} 11/07/2021 08:20:25 - INFO - __main__ - Step 78884: {'lr': 0.00023471020390218432, 'samples': 15145728, 'steps': 78883, 'loss/train': 1.4346773624420166} 11/07/2021 08:20:26 - INFO - __main__ - Step 78885: {'lr': 0.00023470490709537523, 'samples': 15145920, 'steps': 78884, 'loss/train': 1.4095085859298706} 11/07/2021 08:20:26 - INFO - __main__ - Step 78886: {'lr': 0.00023469961029545783, 'samples': 15146112, 'steps': 78885, 'loss/train': 0.46947038173675537} 11/07/2021 08:20:27 - INFO - __main__ - Step 78887: {'lr': 0.00023469431350243457, 'samples': 15146304, 'steps': 78886, 'loss/train': 1.3046505451202393} 11/07/2021 08:20:28 - INFO - __main__ - Step 78888: {'lr': 0.00023468901671630776, 'samples': 15146496, 'steps': 78887, 'loss/train': 1.091797947883606} 11/07/2021 08:20:28 - INFO - __main__ - Step 78889: {'lr': 0.0002346837199370799, 'samples': 15146688, 'steps': 78888, 'loss/train': 1.0805253982543945} 11/07/2021 08:20:28 - INFO - __main__ - Step 78890: {'lr': 0.00023467842316475328, 'samples': 15146880, 'steps': 78889, 'loss/train': 1.4559996128082275} 11/07/2021 08:20:29 - INFO - __main__ - Step 78891: {'lr': 0.00023467312639933042, 'samples': 15147072, 'steps': 78890, 'loss/train': 1.27452552318573} 11/07/2021 08:20:30 - INFO - __main__ - Step 78892: {'lr': 0.00023466782964081352, 'samples': 15147264, 'steps': 78891, 'loss/train': 1.5633394718170166} 11/07/2021 08:20:30 - INFO - __main__ - Step 78893: {'lr': 0.00023466253288920508, 'samples': 15147456, 'steps': 78892, 'loss/train': 1.3725790977478027} 11/07/2021 08:20:30 - INFO - __main__ - Step 78894: {'lr': 0.00023465723614450744, 'samples': 15147648, 'steps': 78893, 'loss/train': 1.9665026664733887} 11/07/2021 08:20:31 - INFO - __main__ - Step 78895: {'lr': 0.00023465193940672307, 'samples': 15147840, 'steps': 78894, 'loss/train': 0.9372266530990601} 11/07/2021 08:20:31 - INFO - __main__ - Step 78896: {'lr': 0.00023464664267585424, 'samples': 15148032, 'steps': 78895, 'loss/train': 0.11049840599298477} 11/07/2021 08:20:32 - INFO - __main__ - Step 78897: {'lr': 0.00023464134595190341, 'samples': 15148224, 'steps': 78896, 'loss/train': 1.713075876235962} 11/07/2021 08:20:32 - INFO - __main__ - Step 78898: {'lr': 0.00023463604923487297, 'samples': 15148416, 'steps': 78897, 'loss/train': 1.1307874917984009} 11/07/2021 08:20:33 - INFO - __main__ - Step 78899: {'lr': 0.00023463075252476534, 'samples': 15148608, 'steps': 78898, 'loss/train': 1.3998067378997803} 11/07/2021 08:20:33 - INFO - __main__ - Step 78900: {'lr': 0.0002346254558215828, 'samples': 15148800, 'steps': 78899, 'loss/train': 1.5017876625061035} 11/07/2021 08:20:34 - INFO - __main__ - Step 78901: {'lr': 0.00023462015912532782, 'samples': 15148992, 'steps': 78900, 'loss/train': 1.387466311454773} 11/07/2021 08:20:35 - INFO - __main__ - Step 78902: {'lr': 0.00023461486243600275, 'samples': 15149184, 'steps': 78901, 'loss/train': 0.9821335077285767} 11/07/2021 08:20:35 - INFO - __main__ - Step 78903: {'lr': 0.00023460956575360997, 'samples': 15149376, 'steps': 78902, 'loss/train': 1.2463085651397705} 11/07/2021 08:20:35 - INFO - __main__ - Step 78904: {'lr': 0.00023460426907815184, 'samples': 15149568, 'steps': 78903, 'loss/train': 1.2995775938034058} 11/07/2021 08:20:36 - INFO - __main__ - Step 78905: {'lr': 0.00023459897240963085, 'samples': 15149760, 'steps': 78904, 'loss/train': 2.3992536067962646} 11/07/2021 08:20:36 - INFO - __main__ - Step 78906: {'lr': 0.0002345936757480493, 'samples': 15149952, 'steps': 78905, 'loss/train': 1.7229299545288086} 11/07/2021 08:20:36 - INFO - __main__ - Step 78907: {'lr': 0.00023458837909340962, 'samples': 15150144, 'steps': 78906, 'loss/train': 1.5482691526412964} 11/07/2021 08:20:37 - INFO - __main__ - Step 78908: {'lr': 0.00023458308244571414, 'samples': 15150336, 'steps': 78907, 'loss/train': 1.2042595148086548} 11/07/2021 08:20:38 - INFO - __main__ - Step 78909: {'lr': 0.00023457778580496531, 'samples': 15150528, 'steps': 78908, 'loss/train': 1.7615898847579956} 11/07/2021 08:20:38 - INFO - __main__ - Step 78910: {'lr': 0.0002345724891711655, 'samples': 15150720, 'steps': 78909, 'loss/train': 1.6363855600357056} 11/07/2021 08:20:39 - INFO - __main__ - Step 78911: {'lr': 0.00023456719254431708, 'samples': 15150912, 'steps': 78910, 'loss/train': 1.2085970640182495} 11/07/2021 08:20:39 - INFO - __main__ - Step 78912: {'lr': 0.00023456189592442253, 'samples': 15151104, 'steps': 78911, 'loss/train': 1.7294037342071533} 11/07/2021 08:20:40 - INFO - __main__ - Step 78913: {'lr': 0.00023455659931148406, 'samples': 15151296, 'steps': 78912, 'loss/train': 0.7925570607185364} 11/07/2021 08:20:40 - INFO - __main__ - Step 78914: {'lr': 0.00023455130270550416, 'samples': 15151488, 'steps': 78913, 'loss/train': 1.392969012260437} 11/07/2021 08:20:41 - INFO - __main__ - Step 78915: {'lr': 0.0002345460061064852, 'samples': 15151680, 'steps': 78914, 'loss/train': 1.0358035564422607} 11/07/2021 08:20:41 - INFO - __main__ - Step 78916: {'lr': 0.00023454070951442954, 'samples': 15151872, 'steps': 78915, 'loss/train': 1.5007683038711548} 11/07/2021 08:20:41 - INFO - __main__ - Step 78917: {'lr': 0.00023453541292933964, 'samples': 15152064, 'steps': 78916, 'loss/train': 1.1427826881408691} 11/07/2021 08:20:42 - INFO - __main__ - Step 78918: {'lr': 0.00023453011635121782, 'samples': 15152256, 'steps': 78917, 'loss/train': 1.2714699506759644} 11/07/2021 08:20:43 - INFO - __main__ - Step 78919: {'lr': 0.0002345248197800665, 'samples': 15152448, 'steps': 78918, 'loss/train': 1.02646005153656} 11/07/2021 08:20:43 - INFO - __main__ - Step 78920: {'lr': 0.00023451952321588808, 'samples': 15152640, 'steps': 78919, 'loss/train': 0.4648517966270447} 11/07/2021 08:20:43 - INFO - __main__ - Step 78921: {'lr': 0.0002345142266586849, 'samples': 15152832, 'steps': 78920, 'loss/train': 1.0738226175308228} 11/07/2021 08:20:44 - INFO - __main__ - Step 78922: {'lr': 0.00023450893010845935, 'samples': 15153024, 'steps': 78921, 'loss/train': 1.4947274923324585} 11/07/2021 08:20:44 - INFO - __main__ - Step 78923: {'lr': 0.00023450363356521386, 'samples': 15153216, 'steps': 78922, 'loss/train': 1.4988887310028076} 11/07/2021 08:20:45 - INFO - __main__ - Step 78924: {'lr': 0.00023449833702895079, 'samples': 15153408, 'steps': 78923, 'loss/train': 1.1363389492034912} 11/07/2021 08:20:46 - INFO - __main__ - Step 78925: {'lr': 0.00023449304049967252, 'samples': 15153600, 'steps': 78924, 'loss/train': 1.6763062477111816} 11/07/2021 08:20:46 - INFO - __main__ - Step 78926: {'lr': 0.00023448774397738157, 'samples': 15153792, 'steps': 78925, 'loss/train': 1.4428927898406982} 11/07/2021 08:20:46 - INFO - __main__ - Step 78927: {'lr': 0.00023448244746208008, 'samples': 15153984, 'steps': 78926, 'loss/train': 1.149531364440918} 11/07/2021 08:20:47 - INFO - __main__ - Step 78928: {'lr': 0.00023447715095377059, 'samples': 15154176, 'steps': 78927, 'loss/train': 1.6099770069122314} 11/07/2021 08:20:48 - INFO - __main__ - Step 78929: {'lr': 0.00023447185445245544, 'samples': 15154368, 'steps': 78928, 'loss/train': 1.3625353574752808} 11/07/2021 08:20:48 - INFO - __main__ - Step 78930: {'lr': 0.00023446655795813704, 'samples': 15154560, 'steps': 78929, 'loss/train': 1.5125863552093506} 11/07/2021 08:20:49 - INFO - __main__ - Step 78931: {'lr': 0.00023446126147081775, 'samples': 15154752, 'steps': 78930, 'loss/train': 0.07313112169504166} 11/07/2021 08:20:49 - INFO - __main__ - Step 78932: {'lr': 0.00023445596499049997, 'samples': 15154944, 'steps': 78931, 'loss/train': 1.208801031112671} 11/07/2021 08:20:49 - INFO - __main__ - Step 78933: {'lr': 0.00023445066851718611, 'samples': 15155136, 'steps': 78932, 'loss/train': 1.1257461309432983} 11/07/2021 08:20:50 - INFO - __main__ - Step 78934: {'lr': 0.00023444537205087853, 'samples': 15155328, 'steps': 78933, 'loss/train': 1.3791240453720093} 11/07/2021 08:20:51 - INFO - __main__ - Step 78935: {'lr': 0.00023444007559157964, 'samples': 15155520, 'steps': 78934, 'loss/train': 1.5946508646011353} 11/07/2021 08:20:51 - INFO - __main__ - Step 78936: {'lr': 0.00023443477913929182, 'samples': 15155712, 'steps': 78935, 'loss/train': 1.7040854692459106} 11/07/2021 08:20:51 - INFO - __main__ - Step 78937: {'lr': 0.00023442948269401743, 'samples': 15155904, 'steps': 78936, 'loss/train': 1.3189224004745483} 11/07/2021 08:20:52 - INFO - __main__ - Step 78938: {'lr': 0.00023442418625575887, 'samples': 15156096, 'steps': 78937, 'loss/train': 1.4564920663833618} 11/07/2021 08:20:52 - INFO - __main__ - Step 78939: {'lr': 0.00023441888982451864, 'samples': 15156288, 'steps': 78938, 'loss/train': 1.3625301122665405} 11/07/2021 08:20:53 - INFO - __main__ - Step 78940: {'lr': 0.00023441359340029892, 'samples': 15156480, 'steps': 78939, 'loss/train': 1.2245279550552368} 11/07/2021 08:20:53 - INFO - __main__ - Step 78941: {'lr': 0.00023440829698310217, 'samples': 15156672, 'steps': 78940, 'loss/train': 0.9183041453361511} 11/07/2021 08:20:54 - INFO - __main__ - Step 78942: {'lr': 0.00023440300057293083, 'samples': 15156864, 'steps': 78941, 'loss/train': 1.4539763927459717} 11/07/2021 08:20:54 - INFO - __main__ - Step 78943: {'lr': 0.00023439770416978724, 'samples': 15157056, 'steps': 78942, 'loss/train': 1.8258615732192993} 11/07/2021 08:20:55 - INFO - __main__ - Step 78944: {'lr': 0.0002343924077736738, 'samples': 15157248, 'steps': 78943, 'loss/train': 1.3736017942428589} 11/07/2021 08:20:55 - INFO - __main__ - Step 78945: {'lr': 0.00023438711138459292, 'samples': 15157440, 'steps': 78944, 'loss/train': 1.6336915493011475} 11/07/2021 08:20:56 - INFO - __main__ - Step 78946: {'lr': 0.00023438181500254695, 'samples': 15157632, 'steps': 78945, 'loss/train': 0.6913437843322754} 11/07/2021 08:20:56 - INFO - __main__ - Step 78947: {'lr': 0.00023437651862753833, 'samples': 15157824, 'steps': 78946, 'loss/train': 1.217934012413025} 11/07/2021 08:20:57 - INFO - __main__ - Step 78948: {'lr': 0.0002343712222595694, 'samples': 15158016, 'steps': 78947, 'loss/train': 1.3597526550292969} 11/07/2021 08:20:57 - INFO - __main__ - Step 78949: {'lr': 0.00023436592589864253, 'samples': 15158208, 'steps': 78948, 'loss/train': 1.5254743099212646} 11/07/2021 08:20:57 - INFO - __main__ - Step 78950: {'lr': 0.00023436062954476013, 'samples': 15158400, 'steps': 78949, 'loss/train': 1.0545237064361572} 11/07/2021 08:20:58 - INFO - __main__ - Step 78951: {'lr': 0.0002343553331979246, 'samples': 15158592, 'steps': 78950, 'loss/train': 1.5773332118988037} 11/07/2021 08:20:59 - INFO - __main__ - Step 78952: {'lr': 0.0002343500368581383, 'samples': 15158784, 'steps': 78951, 'loss/train': 1.2757328748703003} 11/07/2021 08:20:59 - INFO - __main__ - Step 78953: {'lr': 0.00023434474052540377, 'samples': 15158976, 'steps': 78952, 'loss/train': 1.5580739974975586} 11/07/2021 08:20:59 - INFO - __main__ - Step 78954: {'lr': 0.00023433944419972314, 'samples': 15159168, 'steps': 78953, 'loss/train': 1.2468165159225464} 11/07/2021 08:21:00 - INFO - __main__ - Step 78955: {'lr': 0.0002343341478810989, 'samples': 15159360, 'steps': 78954, 'loss/train': 0.977459728717804} 11/07/2021 08:21:01 - INFO - __main__ - Step 78956: {'lr': 0.00023432885156953346, 'samples': 15159552, 'steps': 78955, 'loss/train': 0.6595324277877808} 11/07/2021 08:21:01 - INFO - __main__ - Step 78957: {'lr': 0.0002343235552650292, 'samples': 15159744, 'steps': 78956, 'loss/train': 1.2378205060958862} 11/07/2021 08:21:01 - INFO - __main__ - Step 78958: {'lr': 0.0002343182589675885, 'samples': 15159936, 'steps': 78957, 'loss/train': 1.7019208669662476} 11/07/2021 08:21:02 - INFO - __main__ - Step 78959: {'lr': 0.00023431296267721374, 'samples': 15160128, 'steps': 78958, 'loss/train': 1.605518102645874} 11/07/2021 08:21:02 - INFO - __main__ - Step 78960: {'lr': 0.00023430766639390732, 'samples': 15160320, 'steps': 78959, 'loss/train': 1.5418118238449097} 11/07/2021 08:21:03 - INFO - __main__ - Step 78961: {'lr': 0.00023430237011767165, 'samples': 15160512, 'steps': 78960, 'loss/train': 1.134387731552124} 11/07/2021 08:21:04 - INFO - __main__ - Step 78962: {'lr': 0.00023429707384850908, 'samples': 15160704, 'steps': 78961, 'loss/train': 1.4847259521484375} 11/07/2021 08:21:04 - INFO - __main__ - Step 78963: {'lr': 0.000234291777586422, 'samples': 15160896, 'steps': 78962, 'loss/train': 1.968131422996521} 11/07/2021 08:21:04 - INFO - __main__ - Step 78964: {'lr': 0.0002342864813314128, 'samples': 15161088, 'steps': 78963, 'loss/train': 1.2314051389694214} 11/07/2021 08:21:05 - INFO - __main__ - Step 78965: {'lr': 0.00023428118508348386, 'samples': 15161280, 'steps': 78964, 'loss/train': 1.147827386856079} 11/07/2021 08:21:06 - INFO - __main__ - Step 78966: {'lr': 0.0002342758888426377, 'samples': 15161472, 'steps': 78965, 'loss/train': 1.3301243782043457} 11/07/2021 08:21:06 - INFO - __main__ - Step 78967: {'lr': 0.00023427059260887649, 'samples': 15161664, 'steps': 78966, 'loss/train': 1.577296257019043} 11/07/2021 08:21:06 - INFO - __main__ - Step 78968: {'lr': 0.00023426529638220268, 'samples': 15161856, 'steps': 78967, 'loss/train': 1.6240477561950684} 11/07/2021 08:21:07 - INFO - __main__ - Step 78969: {'lr': 0.00023426000016261867, 'samples': 15162048, 'steps': 78968, 'loss/train': 1.7767510414123535} 11/07/2021 08:21:07 - INFO - __main__ - Step 78970: {'lr': 0.00023425470395012688, 'samples': 15162240, 'steps': 78969, 'loss/train': 1.4588532447814941} 11/07/2021 08:21:08 - INFO - __main__ - Step 78971: {'lr': 0.0002342494077447297, 'samples': 15162432, 'steps': 78970, 'loss/train': 1.0888723134994507} 11/07/2021 08:21:08 - INFO - __main__ - Step 78972: {'lr': 0.00023424411154642947, 'samples': 15162624, 'steps': 78971, 'loss/train': 1.5490411520004272} 11/07/2021 08:21:09 - INFO - __main__ - Step 78973: {'lr': 0.0002342388153552286, 'samples': 15162816, 'steps': 78972, 'loss/train': 1.3275376558303833} 11/07/2021 08:21:09 - INFO - __main__ - Step 78974: {'lr': 0.0002342335191711295, 'samples': 15163008, 'steps': 78973, 'loss/train': 1.297610878944397} 11/07/2021 08:21:09 - INFO - __main__ - Step 78975: {'lr': 0.00023422822299413448, 'samples': 15163200, 'steps': 78974, 'loss/train': 1.567434549331665} 11/07/2021 08:21:11 - INFO - __main__ - Step 78976: {'lr': 0.00023422292682424603, 'samples': 15163392, 'steps': 78975, 'loss/train': 1.4241106510162354} 11/07/2021 08:21:11 - INFO - __main__ - Step 78977: {'lr': 0.00023421763066146646, 'samples': 15163584, 'steps': 78976, 'loss/train': 1.3530325889587402} 11/07/2021 08:21:11 - INFO - __main__ - Step 78978: {'lr': 0.0002342123345057982, 'samples': 15163776, 'steps': 78977, 'loss/train': 1.7925827503204346} 11/07/2021 08:21:12 - INFO - __main__ - Step 78979: {'lr': 0.0002342070383572436, 'samples': 15163968, 'steps': 78978, 'loss/train': 1.411354899406433} 11/07/2021 08:21:12 - INFO - __main__ - Step 78980: {'lr': 0.00023420174221580516, 'samples': 15164160, 'steps': 78979, 'loss/train': 1.330289602279663} 11/07/2021 08:21:13 - INFO - __main__ - Step 78981: {'lr': 0.0002341964460814851, 'samples': 15164352, 'steps': 78980, 'loss/train': 1.0763057470321655} 11/07/2021 08:21:13 - INFO - __main__ - Step 78982: {'lr': 0.00023419114995428585, 'samples': 15164544, 'steps': 78981, 'loss/train': 1.3688822984695435} 11/07/2021 08:21:14 - INFO - __main__ - Step 78983: {'lr': 0.00023418585383420986, 'samples': 15164736, 'steps': 78982, 'loss/train': 0.8268703818321228} 11/07/2021 08:21:14 - INFO - __main__ - Step 78984: {'lr': 0.00023418055772125946, 'samples': 15164928, 'steps': 78983, 'loss/train': 1.0336565971374512} 11/07/2021 08:21:14 - INFO - __main__ - Step 78985: {'lr': 0.00023417526161543704, 'samples': 15165120, 'steps': 78984, 'loss/train': 1.2229394912719727} 11/07/2021 08:21:15 - INFO - __main__ - Step 78986: {'lr': 0.00023416996551674503, 'samples': 15165312, 'steps': 78985, 'loss/train': 1.0825941562652588} 11/07/2021 08:21:16 - INFO - __main__ - Step 78987: {'lr': 0.00023416466942518578, 'samples': 15165504, 'steps': 78986, 'loss/train': 1.5312813520431519} 11/07/2021 08:21:16 - INFO - __main__ - Step 78988: {'lr': 0.00023415937334076169, 'samples': 15165696, 'steps': 78987, 'loss/train': 1.3108470439910889} 11/07/2021 08:21:16 - INFO - __main__ - Step 78989: {'lr': 0.00023415407726347509, 'samples': 15165888, 'steps': 78988, 'loss/train': 1.2697688341140747} 11/07/2021 08:21:17 - INFO - __main__ - Step 78990: {'lr': 0.0002341487811933285, 'samples': 15166080, 'steps': 78989, 'loss/train': 1.4919759035110474} 11/07/2021 08:21:17 - INFO - __main__ - Step 78991: {'lr': 0.00023414348513032415, 'samples': 15166272, 'steps': 78990, 'loss/train': 1.5379470586776733} 11/07/2021 08:21:18 - INFO - __main__ - Step 78992: {'lr': 0.0002341381890744646, 'samples': 15166464, 'steps': 78991, 'loss/train': 1.0763914585113525} 11/07/2021 08:21:18 - INFO - __main__ - Step 78993: {'lr': 0.00023413289302575213, 'samples': 15166656, 'steps': 78992, 'loss/train': 1.3975589275360107} 11/07/2021 08:21:19 - INFO - __main__ - Step 78994: {'lr': 0.0002341275969841891, 'samples': 15166848, 'steps': 78993, 'loss/train': 1.4676454067230225} 11/07/2021 08:21:19 - INFO - __main__ - Step 78995: {'lr': 0.00023412230094977787, 'samples': 15167040, 'steps': 78994, 'loss/train': 1.4952449798583984} 11/07/2021 08:21:20 - INFO - __main__ - Step 78996: {'lr': 0.00023411700492252094, 'samples': 15167232, 'steps': 78995, 'loss/train': 1.5041698217391968} 11/07/2021 08:21:20 - INFO - __main__ - Step 78997: {'lr': 0.0002341117089024206, 'samples': 15167424, 'steps': 78996, 'loss/train': 0.845630407333374} 11/07/2021 08:21:21 - INFO - __main__ - Step 78998: {'lr': 0.00023410641288947935, 'samples': 15167616, 'steps': 78997, 'loss/train': 1.5714654922485352} 11/07/2021 08:21:21 - INFO - __main__ - Step 78999: {'lr': 0.00023410111688369946, 'samples': 15167808, 'steps': 78998, 'loss/train': 2.0994911193847656} 11/07/2021 08:21:22 - INFO - __main__ - Step 79000: {'lr': 0.00023409582088508335, 'samples': 15168000, 'steps': 78999, 'loss/train': 1.4320629835128784} 11/07/2021 08:21:22 - INFO - __main__ - Step 79001: {'lr': 0.00023409052489363342, 'samples': 15168192, 'steps': 79000, 'loss/train': 1.4972175359725952} 11/07/2021 08:21:23 - INFO - __main__ - Step 79002: {'lr': 0.00023408522890935206, 'samples': 15168384, 'steps': 79001, 'loss/train': 1.4329948425292969} 11/07/2021 08:21:23 - INFO - __main__ - Step 79003: {'lr': 0.00023407993293224173, 'samples': 15168576, 'steps': 79002, 'loss/train': 1.0187675952911377} 11/07/2021 08:21:24 - INFO - __main__ - Step 79004: {'lr': 0.00023407463696230462, 'samples': 15168768, 'steps': 79003, 'loss/train': 1.2416104078292847} 11/07/2021 08:21:24 - INFO - __main__ - Step 79005: {'lr': 0.0002340693409995433, 'samples': 15168960, 'steps': 79004, 'loss/train': 1.928124189376831} 11/07/2021 08:21:24 - INFO - __main__ - Step 79006: {'lr': 0.00023406404504396013, 'samples': 15169152, 'steps': 79005, 'loss/train': 1.112452507019043} 11/07/2021 08:21:25 - INFO - __main__ - Step 79007: {'lr': 0.00023405874909555738, 'samples': 15169344, 'steps': 79006, 'loss/train': 1.3063738346099854} 11/07/2021 08:21:26 - INFO - __main__ - Step 79008: {'lr': 0.0002340534531543375, 'samples': 15169536, 'steps': 79007, 'loss/train': 1.2307993173599243} 11/07/2021 08:21:26 - INFO - __main__ - Step 79009: {'lr': 0.00023404815722030293, 'samples': 15169728, 'steps': 79008, 'loss/train': 1.8207272291183472} 11/07/2021 08:21:27 - INFO - __main__ - Step 79010: {'lr': 0.00023404286129345597, 'samples': 15169920, 'steps': 79009, 'loss/train': 1.3764091730117798} 11/07/2021 08:21:27 - INFO - __main__ - Step 79011: {'lr': 0.00023403756537379908, 'samples': 15170112, 'steps': 79010, 'loss/train': 1.5755562782287598} 11/07/2021 08:21:28 - INFO - __main__ - Step 79012: {'lr': 0.0002340322694613346, 'samples': 15170304, 'steps': 79011, 'loss/train': 1.6754965782165527} 11/07/2021 08:21:28 - INFO - __main__ - Step 79013: {'lr': 0.00023402697355606495, 'samples': 15170496, 'steps': 79012, 'loss/train': 1.0487126111984253} 11/07/2021 08:21:29 - INFO - __main__ - Step 79014: {'lr': 0.00023402167765799255, 'samples': 15170688, 'steps': 79013, 'loss/train': 1.851610541343689} 11/07/2021 08:21:29 - INFO - __main__ - Step 79015: {'lr': 0.00023401638176711968, 'samples': 15170880, 'steps': 79014, 'loss/train': 1.5900989770889282} 11/07/2021 08:21:29 - INFO - __main__ - Step 79016: {'lr': 0.00023401108588344877, 'samples': 15171072, 'steps': 79015, 'loss/train': 0.859584391117096} 11/07/2021 08:21:30 - INFO - __main__ - Step 79017: {'lr': 0.00023400579000698222, 'samples': 15171264, 'steps': 79016, 'loss/train': 1.606259822845459} 11/07/2021 08:21:31 - INFO - __main__ - Step 79018: {'lr': 0.00023400049413772243, 'samples': 15171456, 'steps': 79017, 'loss/train': 1.2995742559432983} 11/07/2021 08:21:31 - INFO - __main__ - Step 79019: {'lr': 0.00023399519827567176, 'samples': 15171648, 'steps': 79018, 'loss/train': 1.0465563535690308} 11/07/2021 08:21:31 - INFO - __main__ - Step 79020: {'lr': 0.00023398990242083265, 'samples': 15171840, 'steps': 79019, 'loss/train': 1.0839457511901855} 11/07/2021 08:21:32 - INFO - __main__ - Step 79021: {'lr': 0.00023398460657320742, 'samples': 15172032, 'steps': 79020, 'loss/train': 0.7729608416557312} 11/07/2021 08:21:32 - INFO - __main__ - Step 79022: {'lr': 0.00023397931073279842, 'samples': 15172224, 'steps': 79021, 'loss/train': 1.2846758365631104} 11/07/2021 08:21:33 - INFO - __main__ - Step 79023: {'lr': 0.00023397401489960815, 'samples': 15172416, 'steps': 79022, 'loss/train': 1.0448323488235474} 11/07/2021 08:21:34 - INFO - __main__ - Step 79024: {'lr': 0.00023396871907363894, 'samples': 15172608, 'steps': 79023, 'loss/train': 1.6574007272720337} 11/07/2021 08:21:34 - INFO - __main__ - Step 79025: {'lr': 0.0002339634232548932, 'samples': 15172800, 'steps': 79024, 'loss/train': 1.289030909538269} 11/07/2021 08:21:34 - INFO - __main__ - Step 79026: {'lr': 0.00023395812744337328, 'samples': 15172992, 'steps': 79025, 'loss/train': 1.6367663145065308} 11/07/2021 08:21:35 - INFO - __main__ - Step 79027: {'lr': 0.00023395283163908155, 'samples': 15173184, 'steps': 79026, 'loss/train': 1.6455258131027222} 11/07/2021 08:21:36 - INFO - __main__ - Step 79028: {'lr': 0.00023394753584202044, 'samples': 15173376, 'steps': 79027, 'loss/train': 1.9182629585266113} 11/07/2021 08:21:36 - INFO - __main__ - Step 79029: {'lr': 0.0002339422400521923, 'samples': 15173568, 'steps': 79028, 'loss/train': 1.5408953428268433} 11/07/2021 08:21:37 - INFO - __main__ - Step 79030: {'lr': 0.00023393694426959954, 'samples': 15173760, 'steps': 79029, 'loss/train': 1.368380069732666} 11/07/2021 08:21:37 - INFO - __main__ - Step 79031: {'lr': 0.0002339316484942446, 'samples': 15173952, 'steps': 79030, 'loss/train': 1.488456130027771} 11/07/2021 08:21:37 - INFO - __main__ - Step 79032: {'lr': 0.00023392635272612974, 'samples': 15174144, 'steps': 79031, 'loss/train': 1.543100357055664} 11/07/2021 08:21:38 - INFO - __main__ - Step 79033: {'lr': 0.00023392105696525752, 'samples': 15174336, 'steps': 79032, 'loss/train': 1.3685312271118164} 11/07/2021 08:21:39 - INFO - __main__ - Step 79034: {'lr': 0.00023391576121163017, 'samples': 15174528, 'steps': 79033, 'loss/train': 1.5274136066436768} 11/07/2021 08:21:39 - INFO - __main__ - Step 79035: {'lr': 0.0002339104654652501, 'samples': 15174720, 'steps': 79034, 'loss/train': 1.395469307899475} 11/07/2021 08:21:39 - INFO - __main__ - Step 79036: {'lr': 0.0002339051697261198, 'samples': 15174912, 'steps': 79035, 'loss/train': 1.1119321584701538} 11/07/2021 08:21:40 - INFO - __main__ - Step 79037: {'lr': 0.0002338998739942415, 'samples': 15175104, 'steps': 79036, 'loss/train': 1.4974420070648193} 11/07/2021 08:21:40 - INFO - __main__ - Step 79038: {'lr': 0.0002338945782696177, 'samples': 15175296, 'steps': 79037, 'loss/train': 1.632419466972351} 11/07/2021 08:21:42 - INFO - __main__ - Step 79039: {'lr': 0.00023388928255225073, 'samples': 15175488, 'steps': 79038, 'loss/train': 1.6048450469970703} 11/07/2021 08:21:42 - INFO - __main__ - Step 79040: {'lr': 0.00023388398684214302, 'samples': 15175680, 'steps': 79039, 'loss/train': 1.8700329065322876} 11/07/2021 08:21:42 - INFO - __main__ - Step 79041: {'lr': 0.00023387869113929694, 'samples': 15175872, 'steps': 79040, 'loss/train': 0.5286266803741455} 11/07/2021 08:21:43 - INFO - __main__ - Step 79042: {'lr': 0.00023387339544371486, 'samples': 15176064, 'steps': 79041, 'loss/train': 0.40756067633628845} 11/07/2021 08:21:43 - INFO - __main__ - Step 79043: {'lr': 0.00023386809975539918, 'samples': 15176256, 'steps': 79042, 'loss/train': 1.6055116653442383} 11/07/2021 08:21:44 - INFO - __main__ - Step 79044: {'lr': 0.00023386280407435229, 'samples': 15176448, 'steps': 79043, 'loss/train': 1.3424040079116821} 11/07/2021 08:21:44 - INFO - __main__ - Step 79045: {'lr': 0.00023385750840057657, 'samples': 15176640, 'steps': 79044, 'loss/train': 1.3733030557632446} 11/07/2021 08:21:45 - INFO - __main__ - Step 79046: {'lr': 0.0002338522127340744, 'samples': 15176832, 'steps': 79045, 'loss/train': 1.3322370052337646} 11/07/2021 08:21:45 - INFO - __main__ - Step 79047: {'lr': 0.0002338469170748483, 'samples': 15177024, 'steps': 79046, 'loss/train': 1.5456372499465942} 11/07/2021 08:21:45 - INFO - __main__ - Step 79048: {'lr': 0.0002338416214229004, 'samples': 15177216, 'steps': 79047, 'loss/train': 1.3519034385681152} 11/07/2021 08:21:46 - INFO - __main__ - Step 79049: {'lr': 0.00023383632577823324, 'samples': 15177408, 'steps': 79048, 'loss/train': 1.3346357345581055} 11/07/2021 08:21:47 - INFO - __main__ - Step 79050: {'lr': 0.00023383103014084917, 'samples': 15177600, 'steps': 79049, 'loss/train': 1.574970006942749} 11/07/2021 08:21:47 - INFO - __main__ - Step 79051: {'lr': 0.0002338257345107506, 'samples': 15177792, 'steps': 79050, 'loss/train': 1.4355778694152832} 11/07/2021 08:21:48 - INFO - __main__ - Step 79052: {'lr': 0.0002338204388879399, 'samples': 15177984, 'steps': 79051, 'loss/train': 1.369844913482666} 11/07/2021 08:21:48 - INFO - __main__ - Step 79053: {'lr': 0.00023381514327241944, 'samples': 15178176, 'steps': 79052, 'loss/train': 0.4191116392612457} 11/07/2021 08:21:48 - INFO - __main__ - Step 79054: {'lr': 0.00023380984766419163, 'samples': 15178368, 'steps': 79053, 'loss/train': 0.07338829338550568} 11/07/2021 08:21:50 - INFO - __main__ - Step 79055: {'lr': 0.00023380455206325888, 'samples': 15178560, 'steps': 79054, 'loss/train': 1.7084369659423828} 11/07/2021 08:21:50 - INFO - __main__ - Step 79056: {'lr': 0.00023379925646962354, 'samples': 15178752, 'steps': 79055, 'loss/train': 1.5869529247283936} 11/07/2021 08:21:50 - INFO - __main__ - Step 79057: {'lr': 0.00023379396088328797, 'samples': 15178944, 'steps': 79056, 'loss/train': 1.247064232826233} 11/07/2021 08:21:51 - INFO - __main__ - Step 79058: {'lr': 0.00023378866530425463, 'samples': 15179136, 'steps': 79057, 'loss/train': 1.1656653881072998} 11/07/2021 08:21:51 - INFO - __main__ - Step 79059: {'lr': 0.00023378336973252584, 'samples': 15179328, 'steps': 79058, 'loss/train': 1.046830177307129} 11/07/2021 08:21:52 - INFO - __main__ - Step 79060: {'lr': 0.00023377807416810414, 'samples': 15179520, 'steps': 79059, 'loss/train': 1.4847331047058105} 11/07/2021 08:21:52 - INFO - __main__ - Step 79061: {'lr': 0.0002337727786109917, 'samples': 15179712, 'steps': 79060, 'loss/train': 1.0933465957641602} 11/07/2021 08:21:53 - INFO - __main__ - Step 79062: {'lr': 0.00023376748306119097, 'samples': 15179904, 'steps': 79061, 'loss/train': 1.5931296348571777} 11/07/2021 08:21:53 - INFO - __main__ - Step 79063: {'lr': 0.00023376218751870436, 'samples': 15180096, 'steps': 79062, 'loss/train': 1.31902015209198} 11/07/2021 08:21:53 - INFO - __main__ - Step 79064: {'lr': 0.00023375689198353427, 'samples': 15180288, 'steps': 79063, 'loss/train': 1.621446132659912} 11/07/2021 08:21:54 - INFO - __main__ - Step 79065: {'lr': 0.00023375159645568305, 'samples': 15180480, 'steps': 79064, 'loss/train': 1.2549458742141724} 11/07/2021 08:21:55 - INFO - __main__ - Step 79066: {'lr': 0.00023374630093515313, 'samples': 15180672, 'steps': 79065, 'loss/train': 1.3531298637390137} 11/07/2021 08:21:55 - INFO - __main__ - Step 79067: {'lr': 0.00023374100542194686, 'samples': 15180864, 'steps': 79066, 'loss/train': 0.8330700397491455} 11/07/2021 08:21:55 - INFO - __main__ - Step 79068: {'lr': 0.00023373570991606666, 'samples': 15181056, 'steps': 79067, 'loss/train': 1.316853642463684} 11/07/2021 08:21:56 - INFO - __main__ - Step 79069: {'lr': 0.00023373041441751493, 'samples': 15181248, 'steps': 79068, 'loss/train': 1.4833215475082397} 11/07/2021 08:21:57 - INFO - __main__ - Step 79070: {'lr': 0.00023372511892629395, 'samples': 15181440, 'steps': 79069, 'loss/train': 1.337597370147705} 11/07/2021 08:21:57 - INFO - __main__ - Step 79071: {'lr': 0.0002337198234424062, 'samples': 15181632, 'steps': 79070, 'loss/train': 0.8602887988090515} 11/07/2021 08:21:57 - INFO - __main__ - Step 79072: {'lr': 0.00023371452796585408, 'samples': 15181824, 'steps': 79071, 'loss/train': 0.9384371638298035} 11/07/2021 08:21:58 - INFO - __main__ - Step 79073: {'lr': 0.00023370923249663994, 'samples': 15182016, 'steps': 79072, 'loss/train': 1.3661487102508545} 11/07/2021 08:21:58 - INFO - __main__ - Step 79074: {'lr': 0.00023370393703476625, 'samples': 15182208, 'steps': 79073, 'loss/train': 1.3214094638824463} 11/07/2021 08:21:59 - INFO - __main__ - Step 79075: {'lr': 0.00023369864158023524, 'samples': 15182400, 'steps': 79074, 'loss/train': 1.4641287326812744} 11/07/2021 08:21:59 - INFO - __main__ - Step 79076: {'lr': 0.00023369334613304935, 'samples': 15182592, 'steps': 79075, 'loss/train': 1.3736318349838257} 11/07/2021 08:22:00 - INFO - __main__ - Step 79077: {'lr': 0.00023368805069321098, 'samples': 15182784, 'steps': 79076, 'loss/train': 1.1579067707061768} 11/07/2021 08:22:00 - INFO - __main__ - Step 79078: {'lr': 0.00023368275526072254, 'samples': 15182976, 'steps': 79077, 'loss/train': 1.3275063037872314} 11/07/2021 08:22:00 - INFO - __main__ - Step 79079: {'lr': 0.00023367745983558636, 'samples': 15183168, 'steps': 79078, 'loss/train': 1.389373540878296} 11/07/2021 08:22:02 - INFO - __main__ - Step 79080: {'lr': 0.0002336721644178049, 'samples': 15183360, 'steps': 79079, 'loss/train': 1.3717715740203857} 11/07/2021 08:22:02 - INFO - __main__ - Step 79081: {'lr': 0.0002336668690073805, 'samples': 15183552, 'steps': 79080, 'loss/train': 1.6145254373550415} 11/07/2021 08:22:02 - INFO - __main__ - Step 79082: {'lr': 0.00023366157360431555, 'samples': 15183744, 'steps': 79081, 'loss/train': 1.4561445713043213} 11/07/2021 08:22:03 - INFO - __main__ - Step 79083: {'lr': 0.00023365627820861245, 'samples': 15183936, 'steps': 79082, 'loss/train': 1.3851670026779175} 11/07/2021 08:22:03 - INFO - __main__ - Step 79084: {'lr': 0.0002336509828202736, 'samples': 15184128, 'steps': 79083, 'loss/train': 1.3775843381881714} 11/07/2021 08:22:03 - INFO - __main__ - Step 79085: {'lr': 0.00023364568743930133, 'samples': 15184320, 'steps': 79084, 'loss/train': 0.9791346192359924} 11/07/2021 08:22:04 - INFO - __main__ - Step 79086: {'lr': 0.0002336403920656981, 'samples': 15184512, 'steps': 79085, 'loss/train': 1.8014562129974365} 11/07/2021 08:22:05 - INFO - __main__ - Step 79087: {'lr': 0.00023363509669946633, 'samples': 15184704, 'steps': 79086, 'loss/train': 1.2894909381866455} 11/07/2021 08:22:05 - INFO - __main__ - Step 79088: {'lr': 0.00023362980134060824, 'samples': 15184896, 'steps': 79087, 'loss/train': 1.5213499069213867} 11/07/2021 08:22:05 - INFO - __main__ - Step 79089: {'lr': 0.00023362450598912632, 'samples': 15185088, 'steps': 79088, 'loss/train': 2.0677289962768555} 11/07/2021 08:22:06 - INFO - __main__ - Step 79090: {'lr': 0.00023361921064502292, 'samples': 15185280, 'steps': 79089, 'loss/train': 0.7823951244354248} 11/07/2021 08:22:07 - INFO - __main__ - Step 79091: {'lr': 0.00023361391530830045, 'samples': 15185472, 'steps': 79090, 'loss/train': 1.23990797996521} 11/07/2021 08:22:07 - INFO - __main__ - Step 79092: {'lr': 0.00023360861997896132, 'samples': 15185664, 'steps': 79091, 'loss/train': 1.342383623123169} 11/07/2021 08:22:07 - INFO - __main__ - Step 79093: {'lr': 0.00023360332465700788, 'samples': 15185856, 'steps': 79092, 'loss/train': 1.0632033348083496} 11/07/2021 08:22:08 - INFO - __main__ - Step 79094: {'lr': 0.00023359802934244255, 'samples': 15186048, 'steps': 79093, 'loss/train': 1.5688767433166504} 11/07/2021 08:22:08 - INFO - __main__ - Step 79095: {'lr': 0.00023359273403526765, 'samples': 15186240, 'steps': 79094, 'loss/train': 1.1350631713867188} 11/07/2021 08:22:09 - INFO - __main__ - Step 79096: {'lr': 0.00023358743873548566, 'samples': 15186432, 'steps': 79095, 'loss/train': 1.6472290754318237} 11/07/2021 08:22:10 - INFO - __main__ - Step 79097: {'lr': 0.0002335821434430989, 'samples': 15186624, 'steps': 79096, 'loss/train': 1.8527562618255615} 11/07/2021 08:22:10 - INFO - __main__ - Step 79098: {'lr': 0.00023357684815810976, 'samples': 15186816, 'steps': 79097, 'loss/train': 1.528745412826538} 11/07/2021 08:22:10 - INFO - __main__ - Step 79099: {'lr': 0.00023357155288052063, 'samples': 15187008, 'steps': 79098, 'loss/train': 1.2070586681365967} 11/07/2021 08:22:11 - INFO - __main__ - Step 79100: {'lr': 0.00023356625761033394, 'samples': 15187200, 'steps': 79099, 'loss/train': 1.255898118019104} 11/07/2021 08:22:12 - INFO - __main__ - Step 79101: {'lr': 0.0002335609623475521, 'samples': 15187392, 'steps': 79100, 'loss/train': 1.6962031126022339} 11/07/2021 08:22:12 - INFO - __main__ - Step 79102: {'lr': 0.00023355566709217735, 'samples': 15187584, 'steps': 79101, 'loss/train': 0.6123949289321899} 11/07/2021 08:22:12 - INFO - __main__ - Step 79103: {'lr': 0.00023355037184421217, 'samples': 15187776, 'steps': 79102, 'loss/train': 0.8979575634002686} 11/07/2021 08:22:13 - INFO - __main__ - Step 79104: {'lr': 0.00023354507660365895, 'samples': 15187968, 'steps': 79103, 'loss/train': 1.4642943143844604} 11/07/2021 08:22:13 - INFO - __main__ - Step 79105: {'lr': 0.00023353978137052007, 'samples': 15188160, 'steps': 79104, 'loss/train': 1.0590362548828125} 11/07/2021 08:22:14 - INFO - __main__ - Step 79106: {'lr': 0.0002335344861447979, 'samples': 15188352, 'steps': 79105, 'loss/train': 1.4213030338287354} 11/07/2021 08:22:14 - INFO - __main__ - Step 79107: {'lr': 0.0002335291909264948, 'samples': 15188544, 'steps': 79106, 'loss/train': 1.1651562452316284} 11/07/2021 08:22:15 - INFO - __main__ - Step 79108: {'lr': 0.00023352389571561322, 'samples': 15188736, 'steps': 79107, 'loss/train': 1.7335718870162964} 11/07/2021 08:22:15 - INFO - __main__ - Step 79109: {'lr': 0.00023351860051215554, 'samples': 15188928, 'steps': 79108, 'loss/train': 1.2124626636505127} 11/07/2021 08:22:15 - INFO - __main__ - Step 79110: {'lr': 0.00023351330531612408, 'samples': 15189120, 'steps': 79109, 'loss/train': 1.0762420892715454} 11/07/2021 08:22:16 - INFO - __main__ - Step 79111: {'lr': 0.00023350801012752133, 'samples': 15189312, 'steps': 79110, 'loss/train': 1.4289743900299072} 11/07/2021 08:22:17 - INFO - __main__ - Step 79112: {'lr': 0.00023350271494634956, 'samples': 15189504, 'steps': 79111, 'loss/train': 1.5338619947433472} 11/07/2021 08:22:17 - INFO - __main__ - Step 79113: {'lr': 0.00023349741977261125, 'samples': 15189696, 'steps': 79112, 'loss/train': 1.4151761531829834} 11/07/2021 08:22:17 - INFO - __main__ - Step 79114: {'lr': 0.0002334921246063088, 'samples': 15189888, 'steps': 79113, 'loss/train': 1.5225964784622192} 11/07/2021 08:22:18 - INFO - __main__ - Step 79115: {'lr': 0.0002334868294474445, 'samples': 15190080, 'steps': 79114, 'loss/train': 1.1262978315353394} 11/07/2021 08:22:18 - INFO - __main__ - Step 79116: {'lr': 0.00023348153429602077, 'samples': 15190272, 'steps': 79115, 'loss/train': 2.318514108657837} 11/07/2021 08:22:19 - INFO - __main__ - Step 79117: {'lr': 0.00023347623915203998, 'samples': 15190464, 'steps': 79116, 'loss/train': 1.4600870609283447} 11/07/2021 08:22:20 - INFO - __main__ - Step 79118: {'lr': 0.00023347094401550457, 'samples': 15190656, 'steps': 79117, 'loss/train': 1.2745673656463623} 11/07/2021 08:22:20 - INFO - __main__ - Step 79119: {'lr': 0.00023346564888641685, 'samples': 15190848, 'steps': 79118, 'loss/train': 1.7436014413833618} 11/07/2021 08:22:20 - INFO - __main__ - Step 79120: {'lr': 0.00023346035376477928, 'samples': 15191040, 'steps': 79119, 'loss/train': 1.1190803050994873} 11/07/2021 08:22:21 - INFO - __main__ - Step 79121: {'lr': 0.00023345505865059424, 'samples': 15191232, 'steps': 79120, 'loss/train': 1.329880714416504} 11/07/2021 08:22:22 - INFO - __main__ - Step 79122: {'lr': 0.00023344976354386406, 'samples': 15191424, 'steps': 79121, 'loss/train': 1.5450630187988281} 11/07/2021 08:22:22 - INFO - __main__ - Step 79123: {'lr': 0.0002334444684445912, 'samples': 15191616, 'steps': 79122, 'loss/train': 0.7996379733085632} 11/07/2021 08:22:23 - INFO - __main__ - Step 79124: {'lr': 0.00023343917335277799, 'samples': 15191808, 'steps': 79123, 'loss/train': 1.254042625427246} 11/07/2021 08:22:23 - INFO - __main__ - Step 79125: {'lr': 0.00023343387826842683, 'samples': 15192000, 'steps': 79124, 'loss/train': 1.3839759826660156} 11/07/2021 08:22:23 - INFO - __main__ - Step 79126: {'lr': 0.00023342858319154008, 'samples': 15192192, 'steps': 79125, 'loss/train': 1.6531891822814941} 11/07/2021 08:22:24 - INFO - __main__ - Step 79127: {'lr': 0.0002334232881221203, 'samples': 15192384, 'steps': 79126, 'loss/train': 0.3584250509738922} 11/07/2021 08:22:25 - INFO - __main__ - Step 79128: {'lr': 0.0002334179930601696, 'samples': 15192576, 'steps': 79127, 'loss/train': 1.394895076751709} 11/07/2021 08:22:25 - INFO - __main__ - Step 79129: {'lr': 0.00023341269800569053, 'samples': 15192768, 'steps': 79128, 'loss/train': 1.3016774654388428} 11/07/2021 08:22:25 - INFO - __main__ - Step 79130: {'lr': 0.00023340740295868542, 'samples': 15192960, 'steps': 79129, 'loss/train': 1.507116436958313} 11/07/2021 08:22:26 - INFO - __main__ - Step 79131: {'lr': 0.00023340210791915667, 'samples': 15193152, 'steps': 79130, 'loss/train': 1.7738046646118164} 11/07/2021 08:22:26 - INFO - __main__ - Step 79132: {'lr': 0.00023339681288710667, 'samples': 15193344, 'steps': 79131, 'loss/train': 1.5383957624435425} 11/07/2021 08:22:27 - INFO - __main__ - Step 79133: {'lr': 0.00023339151786253785, 'samples': 15193536, 'steps': 79132, 'loss/train': 0.9222170114517212} 11/07/2021 08:22:27 - INFO - __main__ - Step 79134: {'lr': 0.00023338622284545252, 'samples': 15193728, 'steps': 79133, 'loss/train': 1.5068905353546143} 11/07/2021 08:22:28 - INFO - __main__ - Step 79135: {'lr': 0.00023338092783585312, 'samples': 15193920, 'steps': 79134, 'loss/train': 1.2909624576568604} 11/07/2021 08:22:28 - INFO - __main__ - Step 79136: {'lr': 0.000233375632833742, 'samples': 15194112, 'steps': 79135, 'loss/train': 1.0364900827407837} 11/07/2021 08:22:28 - INFO - __main__ - Step 79137: {'lr': 0.00023337033783912164, 'samples': 15194304, 'steps': 79136, 'loss/train': 1.4328901767730713} 11/07/2021 08:22:29 - INFO - __main__ - Step 79138: {'lr': 0.00023336504285199428, 'samples': 15194496, 'steps': 79137, 'loss/train': 0.981006920337677} 11/07/2021 08:22:30 - INFO - __main__ - Step 79139: {'lr': 0.00023335974787236236, 'samples': 15194688, 'steps': 79138, 'loss/train': 1.5939972400665283} 11/07/2021 08:22:30 - INFO - __main__ - Step 79140: {'lr': 0.0002333544529002283, 'samples': 15194880, 'steps': 79139, 'loss/train': 1.3801411390304565} 11/07/2021 08:22:30 - INFO - __main__ - Step 79141: {'lr': 0.00023334915793559453, 'samples': 15195072, 'steps': 79140, 'loss/train': 1.7362990379333496} 11/07/2021 08:22:31 - INFO - __main__ - Step 79142: {'lr': 0.0002333438629784633, 'samples': 15195264, 'steps': 79141, 'loss/train': 1.1231622695922852} 11/07/2021 08:22:32 - INFO - __main__ - Step 79143: {'lr': 0.00023333856802883708, 'samples': 15195456, 'steps': 79142, 'loss/train': 3.707226514816284} 11/07/2021 08:22:32 - INFO - __main__ - Step 79144: {'lr': 0.00023333327308671823, 'samples': 15195648, 'steps': 79143, 'loss/train': 1.1926708221435547} 11/07/2021 08:22:32 - INFO - __main__ - Step 79145: {'lr': 0.00023332797815210917, 'samples': 15195840, 'steps': 79144, 'loss/train': 1.4193570613861084} 11/07/2021 08:22:33 - INFO - __main__ - Step 79146: {'lr': 0.00023332268322501226, 'samples': 15196032, 'steps': 79145, 'loss/train': 1.6068527698516846} 11/07/2021 08:22:33 - INFO - __main__ - Step 79147: {'lr': 0.00023331738830542986, 'samples': 15196224, 'steps': 79146, 'loss/train': 1.480541467666626} 11/07/2021 08:22:35 - INFO - __main__ - Step 79148: {'lr': 0.00023331209339336447, 'samples': 15196416, 'steps': 79147, 'loss/train': 1.492283582687378} 11/07/2021 08:22:35 - INFO - __main__ - Step 79149: {'lr': 0.00023330679848881835, 'samples': 15196608, 'steps': 79148, 'loss/train': 1.5777631998062134} 11/07/2021 08:22:35 - INFO - __main__ - Step 79150: {'lr': 0.0002333015035917939, 'samples': 15196800, 'steps': 79149, 'loss/train': 1.7412437200546265} 11/07/2021 08:22:36 - INFO - __main__ - Step 79151: {'lr': 0.00023329620870229356, 'samples': 15196992, 'steps': 79150, 'loss/train': 1.4386646747589111} 11/07/2021 08:22:36 - INFO - __main__ - Step 79152: {'lr': 0.0002332909138203197, 'samples': 15197184, 'steps': 79151, 'loss/train': 1.0775591135025024} 11/07/2021 08:22:36 - INFO - __main__ - Step 79153: {'lr': 0.00023328561894587466, 'samples': 15197376, 'steps': 79152, 'loss/train': 5.530993461608887} 11/07/2021 08:22:37 - INFO - __main__ - Step 79154: {'lr': 0.00023328032407896095, 'samples': 15197568, 'steps': 79153, 'loss/train': 4.83119010925293} 11/07/2021 08:22:38 - INFO - __main__ - Step 79155: {'lr': 0.0002332750292195808, 'samples': 15197760, 'steps': 79154, 'loss/train': 1.2583056688308716} 11/07/2021 08:22:38 - INFO - __main__ - Step 79156: {'lr': 0.00023326973436773666, 'samples': 15197952, 'steps': 79155, 'loss/train': 1.3428046703338623} 11/07/2021 08:22:38 - INFO - __main__ - Step 79157: {'lr': 0.0002332644395234309, 'samples': 15198144, 'steps': 79156, 'loss/train': 1.2476365566253662} 11/07/2021 08:22:39 - INFO - __main__ - Step 79158: {'lr': 0.00023325914468666595, 'samples': 15198336, 'steps': 79157, 'loss/train': 1.3683562278747559} 11/07/2021 08:22:39 - INFO - __main__ - Step 79159: {'lr': 0.00023325384985744424, 'samples': 15198528, 'steps': 79158, 'loss/train': 1.5467634201049805} 11/07/2021 08:22:40 - INFO - __main__ - Step 79160: {'lr': 0.000233248555035768, 'samples': 15198720, 'steps': 79159, 'loss/train': 0.8324890732765198} 11/07/2021 08:22:41 - INFO - __main__ - Step 79161: {'lr': 0.00023324326022163973, 'samples': 15198912, 'steps': 79160, 'loss/train': 0.7135121822357178} 11/07/2021 08:22:41 - INFO - __main__ - Step 79162: {'lr': 0.00023323796541506177, 'samples': 15199104, 'steps': 79161, 'loss/train': 1.4318604469299316} 11/07/2021 08:22:41 - INFO - __main__ - Step 79163: {'lr': 0.00023323267061603654, 'samples': 15199296, 'steps': 79162, 'loss/train': 1.3587616682052612} 11/07/2021 08:22:42 - INFO - __main__ - Step 79164: {'lr': 0.00023322737582456637, 'samples': 15199488, 'steps': 79163, 'loss/train': 1.4586519002914429} 11/07/2021 08:22:43 - INFO - __main__ - Step 79165: {'lr': 0.00023322208104065373, 'samples': 15199680, 'steps': 79164, 'loss/train': 1.518717646598816} 11/07/2021 08:22:43 - INFO - __main__ - Step 79166: {'lr': 0.00023321678626430097, 'samples': 15199872, 'steps': 79165, 'loss/train': 1.3144996166229248} 11/07/2021 08:22:44 - INFO - __main__ - Step 79167: {'lr': 0.0002332114914955104, 'samples': 15200064, 'steps': 79166, 'loss/train': 1.4349217414855957} 11/07/2021 08:22:44 - INFO - __main__ - Step 79168: {'lr': 0.0002332061967342846, 'samples': 15200256, 'steps': 79167, 'loss/train': 1.7371025085449219} 11/07/2021 08:22:44 - INFO - __main__ - Step 79169: {'lr': 0.00023320090198062575, 'samples': 15200448, 'steps': 79168, 'loss/train': 1.0201741456985474} 11/07/2021 08:22:46 - INFO - __main__ - Step 79170: {'lr': 0.00023319560723453637, 'samples': 15200640, 'steps': 79169, 'loss/train': 1.1883729696273804} 11/07/2021 08:22:46 - INFO - __main__ - Step 79171: {'lr': 0.0002331903124960187, 'samples': 15200832, 'steps': 79170, 'loss/train': 1.2551798820495605} 11/07/2021 08:22:46 - INFO - __main__ - Step 79172: {'lr': 0.00023318501776507526, 'samples': 15201024, 'steps': 79171, 'loss/train': 0.25039586424827576} 11/07/2021 08:22:47 - INFO - __main__ - Step 79173: {'lr': 0.00023317972304170837, 'samples': 15201216, 'steps': 79172, 'loss/train': 0.15279355645179749} 11/07/2021 08:22:47 - INFO - __main__ - Step 79174: {'lr': 0.00023317442832592044, 'samples': 15201408, 'steps': 79173, 'loss/train': 1.4934816360473633} 11/07/2021 08:22:47 - INFO - __main__ - Step 79175: {'lr': 0.00023316913361771385, 'samples': 15201600, 'steps': 79174, 'loss/train': 1.7526586055755615} 11/07/2021 08:22:48 - INFO - __main__ - Step 79176: {'lr': 0.000233163838917091, 'samples': 15201792, 'steps': 79175, 'loss/train': 1.7313175201416016} 11/07/2021 08:22:49 - INFO - __main__ - Step 79177: {'lr': 0.00023315854422405427, 'samples': 15201984, 'steps': 79176, 'loss/train': 1.0570261478424072} 11/07/2021 08:22:49 - INFO - __main__ - Step 79178: {'lr': 0.00023315324953860603, 'samples': 15202176, 'steps': 79177, 'loss/train': 1.6213085651397705} 11/07/2021 08:22:50 - INFO - __main__ - Step 79179: {'lr': 0.0002331479548607487, 'samples': 15202368, 'steps': 79178, 'loss/train': 1.0811794996261597} 11/07/2021 08:22:50 - INFO - __main__ - Step 79180: {'lr': 0.00023314266019048457, 'samples': 15202560, 'steps': 79179, 'loss/train': 1.077506422996521} 11/07/2021 08:22:50 - INFO - __main__ - Step 79181: {'lr': 0.00023313736552781628, 'samples': 15202752, 'steps': 79180, 'loss/train': 1.277818202972412} 11/07/2021 08:22:51 - INFO - __main__ - Step 79182: {'lr': 0.0002331320708727459, 'samples': 15202944, 'steps': 79181, 'loss/train': 1.3789921998977661} 11/07/2021 08:22:52 - INFO - __main__ - Step 79183: {'lr': 0.00023312677622527595, 'samples': 15203136, 'steps': 79182, 'loss/train': 1.1544930934906006} 11/07/2021 08:22:52 - INFO - __main__ - Step 79184: {'lr': 0.0002331214815854088, 'samples': 15203328, 'steps': 79183, 'loss/train': 1.6623756885528564} 11/07/2021 08:22:52 - INFO - __main__ - Step 79185: {'lr': 0.00023311618695314685, 'samples': 15203520, 'steps': 79184, 'loss/train': 1.148361325263977} 11/07/2021 08:22:53 - INFO - __main__ - Step 79186: {'lr': 0.0002331108923284925, 'samples': 15203712, 'steps': 79185, 'loss/train': 1.326241135597229} 11/07/2021 08:22:54 - INFO - __main__ - Step 79187: {'lr': 0.00023310559771144812, 'samples': 15203904, 'steps': 79186, 'loss/train': 1.4526739120483398} 11/07/2021 08:22:54 - INFO - __main__ - Step 79188: {'lr': 0.00023310030310201612, 'samples': 15204096, 'steps': 79187, 'loss/train': 1.1805384159088135} 11/07/2021 08:22:55 - INFO - __main__ - Step 79189: {'lr': 0.0002330950085001988, 'samples': 15204288, 'steps': 79188, 'loss/train': 1.4231035709381104} 11/07/2021 08:22:55 - INFO - __main__ - Step 79190: {'lr': 0.00023308971390599865, 'samples': 15204480, 'steps': 79189, 'loss/train': 1.5927916765213013} 11/07/2021 08:22:55 - INFO - __main__ - Step 79191: {'lr': 0.00023308441931941802, 'samples': 15204672, 'steps': 79190, 'loss/train': 1.3535107374191284} 11/07/2021 08:22:56 - INFO - __main__ - Step 79192: {'lr': 0.00023307912474045928, 'samples': 15204864, 'steps': 79191, 'loss/train': 1.1296337842941284} 11/07/2021 08:22:57 - INFO - __main__ - Step 79193: {'lr': 0.0002330738301691248, 'samples': 15205056, 'steps': 79192, 'loss/train': 0.08424742519855499} 11/07/2021 08:22:57 - INFO - __main__ - Step 79194: {'lr': 0.00023306853560541705, 'samples': 15205248, 'steps': 79193, 'loss/train': 0.9297148585319519} 11/07/2021 08:22:57 - INFO - __main__ - Step 79195: {'lr': 0.0002330632410493384, 'samples': 15205440, 'steps': 79194, 'loss/train': 1.1406724452972412} 11/07/2021 08:22:58 - INFO - __main__ - Step 79196: {'lr': 0.00023305794650089112, 'samples': 15205632, 'steps': 79195, 'loss/train': 0.8804008364677429} 11/07/2021 08:22:58 - INFO - __main__ - Step 79197: {'lr': 0.0002330526519600777, 'samples': 15205824, 'steps': 79196, 'loss/train': 1.513545274734497} 11/07/2021 08:22:59 - INFO - __main__ - Step 79198: {'lr': 0.00023304735742690042, 'samples': 15206016, 'steps': 79197, 'loss/train': 1.4575684070587158} 11/07/2021 08:22:59 - INFO - __main__ - Step 79199: {'lr': 0.00023304206290136178, 'samples': 15206208, 'steps': 79198, 'loss/train': 1.1212830543518066} 11/07/2021 08:23:00 - INFO - __main__ - Step 79200: {'lr': 0.00023303676838346414, 'samples': 15206400, 'steps': 79199, 'loss/train': 1.727722406387329} 11/07/2021 08:23:00 - INFO - __main__ - Step 79201: {'lr': 0.00023303147387320982, 'samples': 15206592, 'steps': 79200, 'loss/train': 1.4507288932800293} 11/07/2021 08:23:00 - INFO - __main__ - Step 79202: {'lr': 0.0002330261793706013, 'samples': 15206784, 'steps': 79201, 'loss/train': 1.5810400247573853} 11/07/2021 08:23:01 - INFO - __main__ - Step 79203: {'lr': 0.0002330208848756409, 'samples': 15206976, 'steps': 79202, 'loss/train': 1.392287254333496} 11/07/2021 08:23:02 - INFO - __main__ - Step 79204: {'lr': 0.00023301559038833104, 'samples': 15207168, 'steps': 79203, 'loss/train': 1.1359015703201294} 11/07/2021 08:23:02 - INFO - __main__ - Step 79205: {'lr': 0.0002330102959086741, 'samples': 15207360, 'steps': 79204, 'loss/train': 1.3386640548706055} 11/07/2021 08:23:03 - INFO - __main__ - Step 79206: {'lr': 0.00023300500143667245, 'samples': 15207552, 'steps': 79205, 'loss/train': 1.8008710145950317} 11/07/2021 08:23:03 - INFO - __main__ - Step 79207: {'lr': 0.00023299970697232848, 'samples': 15207744, 'steps': 79206, 'loss/train': 2.0807344913482666} 11/07/2021 08:23:04 - INFO - __main__ - Step 79208: {'lr': 0.00023299441251564468, 'samples': 15207936, 'steps': 79207, 'loss/train': 1.4861589670181274} 11/07/2021 08:23:04 - INFO - __main__ - Step 79209: {'lr': 0.00023298911806662322, 'samples': 15208128, 'steps': 79208, 'loss/train': 1.4361035823822021} 11/07/2021 08:23:05 - INFO - __main__ - Step 79210: {'lr': 0.00023298382362526662, 'samples': 15208320, 'steps': 79209, 'loss/train': 1.2435904741287231} 11/07/2021 08:23:05 - INFO - __main__ - Step 79211: {'lr': 0.00023297852919157725, 'samples': 15208512, 'steps': 79210, 'loss/train': 1.242535948753357} 11/07/2021 08:23:05 - INFO - __main__ - Step 79212: {'lr': 0.00023297323476555748, 'samples': 15208704, 'steps': 79211, 'loss/train': 1.143212080001831} 11/07/2021 08:23:06 - INFO - __main__ - Step 79213: {'lr': 0.0002329679403472097, 'samples': 15208896, 'steps': 79212, 'loss/train': 1.512702465057373} 11/07/2021 08:23:07 - INFO - __main__ - Step 79214: {'lr': 0.00023296264593653632, 'samples': 15209088, 'steps': 79213, 'loss/train': 1.365240454673767} 11/07/2021 08:23:07 - INFO - __main__ - Step 79215: {'lr': 0.0002329573515335397, 'samples': 15209280, 'steps': 79214, 'loss/train': 1.432896614074707} 11/07/2021 08:23:07 - INFO - __main__ - Step 79216: {'lr': 0.00023295205713822227, 'samples': 15209472, 'steps': 79215, 'loss/train': 0.7084894180297852} 11/07/2021 08:23:08 - INFO - __main__ - Step 79217: {'lr': 0.0002329467627505863, 'samples': 15209664, 'steps': 79216, 'loss/train': 0.3141186237335205} 11/07/2021 08:23:09 - INFO - __main__ - Step 79218: {'lr': 0.00023294146837063431, 'samples': 15209856, 'steps': 79217, 'loss/train': 0.9346727728843689} 11/07/2021 08:23:09 - INFO - __main__ - Step 79219: {'lr': 0.00023293617399836863, 'samples': 15210048, 'steps': 79218, 'loss/train': 1.6084920167922974} 11/07/2021 08:23:09 - INFO - __main__ - Step 79220: {'lr': 0.00023293087963379168, 'samples': 15210240, 'steps': 79219, 'loss/train': 1.129433274269104} 11/07/2021 08:23:10 - INFO - __main__ - Step 79221: {'lr': 0.00023292558527690575, 'samples': 15210432, 'steps': 79220, 'loss/train': 1.3122498989105225} 11/07/2021 08:23:10 - INFO - __main__ - Step 79222: {'lr': 0.0002329202909277134, 'samples': 15210624, 'steps': 79221, 'loss/train': 1.0628948211669922} 11/07/2021 08:23:11 - INFO - __main__ - Step 79223: {'lr': 0.00023291499658621684, 'samples': 15210816, 'steps': 79222, 'loss/train': 1.311030387878418} 11/07/2021 08:23:11 - INFO - __main__ - Step 79224: {'lr': 0.0002329097022524185, 'samples': 15211008, 'steps': 79223, 'loss/train': 1.9823675155639648} 11/07/2021 08:23:12 - INFO - __main__ - Step 79225: {'lr': 0.0002329044079263208, 'samples': 15211200, 'steps': 79224, 'loss/train': 1.2212371826171875} 11/07/2021 08:23:12 - INFO - __main__ - Step 79226: {'lr': 0.00023289911360792608, 'samples': 15211392, 'steps': 79225, 'loss/train': 1.7283151149749756} 11/07/2021 08:23:13 - INFO - __main__ - Step 79227: {'lr': 0.00023289381929723674, 'samples': 15211584, 'steps': 79226, 'loss/train': 1.2741262912750244} 11/07/2021 08:23:13 - INFO - __main__ - Step 79228: {'lr': 0.00023288852499425523, 'samples': 15211776, 'steps': 79227, 'loss/train': 1.1083340644836426} 11/07/2021 08:23:14 - INFO - __main__ - Step 79229: {'lr': 0.00023288323069898384, 'samples': 15211968, 'steps': 79228, 'loss/train': 1.3584377765655518} 11/07/2021 08:23:14 - INFO - __main__ - Step 79230: {'lr': 0.000232877936411425, 'samples': 15212160, 'steps': 79229, 'loss/train': 1.316588282585144} 11/07/2021 08:23:15 - INFO - __main__ - Step 79231: {'lr': 0.00023287264213158116, 'samples': 15212352, 'steps': 79230, 'loss/train': 1.3951135873794556} 11/07/2021 08:23:15 - INFO - __main__ - Step 79232: {'lr': 0.00023286734785945458, 'samples': 15212544, 'steps': 79231, 'loss/train': 1.418782353401184} 11/07/2021 08:23:15 - INFO - __main__ - Step 79233: {'lr': 0.00023286205359504775, 'samples': 15212736, 'steps': 79232, 'loss/train': 1.243523120880127} 11/07/2021 08:23:16 - INFO - __main__ - Step 79234: {'lr': 0.00023285675933836297, 'samples': 15212928, 'steps': 79233, 'loss/train': 1.4337025880813599} 11/07/2021 08:23:17 - INFO - __main__ - Step 79235: {'lr': 0.00023285146508940282, 'samples': 15213120, 'steps': 79234, 'loss/train': 0.6020420789718628} 11/07/2021 08:23:17 - INFO - __main__ - Step 79236: {'lr': 0.00023284617084816942, 'samples': 15213312, 'steps': 79235, 'loss/train': 1.3317800760269165} 11/07/2021 08:23:17 - INFO - __main__ - Step 79237: {'lr': 0.00023284087661466527, 'samples': 15213504, 'steps': 79236, 'loss/train': 1.5524448156356812} 11/07/2021 08:23:18 - INFO - __main__ - Step 79238: {'lr': 0.00023283558238889273, 'samples': 15213696, 'steps': 79237, 'loss/train': 1.2264188528060913} 11/07/2021 08:23:19 - INFO - __main__ - Step 79239: {'lr': 0.00023283028817085424, 'samples': 15213888, 'steps': 79238, 'loss/train': 1.769691824913025} 11/07/2021 08:23:19 - INFO - __main__ - Step 79240: {'lr': 0.00023282499396055215, 'samples': 15214080, 'steps': 79239, 'loss/train': 1.4624508619308472} 11/07/2021 08:23:20 - INFO - __main__ - Step 79241: {'lr': 0.00023281969975798885, 'samples': 15214272, 'steps': 79240, 'loss/train': 1.4395947456359863} 11/07/2021 08:23:20 - INFO - __main__ - Step 79242: {'lr': 0.00023281440556316673, 'samples': 15214464, 'steps': 79241, 'loss/train': 1.2980259656906128} 11/07/2021 08:23:20 - INFO - __main__ - Step 79243: {'lr': 0.00023280911137608818, 'samples': 15214656, 'steps': 79242, 'loss/train': 1.1164358854293823} 11/07/2021 08:23:21 - INFO - __main__ - Step 79244: {'lr': 0.0002328038171967556, 'samples': 15214848, 'steps': 79243, 'loss/train': 2.019165277481079} 11/07/2021 08:23:22 - INFO - __main__ - Step 79245: {'lr': 0.00023279852302517129, 'samples': 15215040, 'steps': 79244, 'loss/train': 1.6850519180297852} 11/07/2021 08:23:22 - INFO - __main__ - Step 79246: {'lr': 0.00023279322886133775, 'samples': 15215232, 'steps': 79245, 'loss/train': 1.96170973777771} 11/07/2021 08:23:22 - INFO - __main__ - Step 79247: {'lr': 0.0002327879347052573, 'samples': 15215424, 'steps': 79246, 'loss/train': 1.1388607025146484} 11/07/2021 08:23:23 - INFO - __main__ - Step 79248: {'lr': 0.00023278264055693243, 'samples': 15215616, 'steps': 79247, 'loss/train': 1.646540641784668} 11/07/2021 08:23:23 - INFO - __main__ - Step 79249: {'lr': 0.00023277734641636536, 'samples': 15215808, 'steps': 79248, 'loss/train': 1.7629815340042114} 11/07/2021 08:23:24 - INFO - __main__ - Step 79250: {'lr': 0.00023277205228355855, 'samples': 15216000, 'steps': 79249, 'loss/train': 1.2493926286697388} 11/07/2021 08:23:25 - INFO - __main__ - Step 79251: {'lr': 0.0002327667581585144, 'samples': 15216192, 'steps': 79250, 'loss/train': 1.4506274461746216} 11/07/2021 08:23:25 - INFO - __main__ - Step 79252: {'lr': 0.00023276146404123524, 'samples': 15216384, 'steps': 79251, 'loss/train': 1.6379759311676025} 11/07/2021 08:23:25 - INFO - __main__ - Step 79253: {'lr': 0.0002327561699317235, 'samples': 15216576, 'steps': 79252, 'loss/train': 0.7411928772926331} 11/07/2021 08:23:26 - INFO - __main__ - Step 79254: {'lr': 0.0002327508758299816, 'samples': 15216768, 'steps': 79253, 'loss/train': 1.409316062927246} 11/07/2021 08:23:26 - INFO - __main__ - Step 79255: {'lr': 0.00023274558173601187, 'samples': 15216960, 'steps': 79254, 'loss/train': 0.9556497931480408} 11/07/2021 08:23:27 - INFO - __main__ - Step 79256: {'lr': 0.00023274028764981672, 'samples': 15217152, 'steps': 79255, 'loss/train': 1.3879393339157104} 11/07/2021 08:23:27 - INFO - __main__ - Step 79257: {'lr': 0.00023273499357139853, 'samples': 15217344, 'steps': 79256, 'loss/train': 1.344911813735962} 11/07/2021 08:23:28 - INFO - __main__ - Step 79258: {'lr': 0.0002327296995007597, 'samples': 15217536, 'steps': 79257, 'loss/train': 1.2210578918457031} 11/07/2021 08:23:28 - INFO - __main__ - Step 79259: {'lr': 0.0002327244054379026, 'samples': 15217728, 'steps': 79258, 'loss/train': 1.5794956684112549} 11/07/2021 08:23:28 - INFO - __main__ - Step 79260: {'lr': 0.00023271911138282957, 'samples': 15217920, 'steps': 79259, 'loss/train': 1.1738842725753784} 11/07/2021 08:23:30 - INFO - __main__ - Step 79261: {'lr': 0.00023271381733554314, 'samples': 15218112, 'steps': 79260, 'loss/train': 1.3883312940597534} 11/07/2021 08:23:31 - INFO - __main__ - Step 79262: {'lr': 0.00023270852329604558, 'samples': 15218304, 'steps': 79261, 'loss/train': 1.2810932397842407} 11/07/2021 08:23:31 - INFO - __main__ - Step 79263: {'lr': 0.00023270322926433924, 'samples': 15218496, 'steps': 79262, 'loss/train': 1.2059062719345093} 11/07/2021 08:23:31 - INFO - __main__ - Step 79264: {'lr': 0.00023269793524042658, 'samples': 15218688, 'steps': 79263, 'loss/train': 0.8284554481506348} 11/07/2021 08:23:32 - INFO - __main__ - Step 79265: {'lr': 0.00023269264122430992, 'samples': 15218880, 'steps': 79264, 'loss/train': 1.6664894819259644} 11/07/2021 08:23:32 - INFO - __main__ - Step 79266: {'lr': 0.00023268734721599172, 'samples': 15219072, 'steps': 79265, 'loss/train': 1.7637454271316528} 11/07/2021 08:23:32 - INFO - __main__ - Step 79267: {'lr': 0.0002326820532154743, 'samples': 15219264, 'steps': 79266, 'loss/train': 1.5214507579803467} 11/07/2021 08:23:33 - INFO - __main__ - Step 79268: {'lr': 0.00023267675922276012, 'samples': 15219456, 'steps': 79267, 'loss/train': 1.7072652578353882} 11/07/2021 08:23:34 - INFO - __main__ - Step 79269: {'lr': 0.00023267146523785152, 'samples': 15219648, 'steps': 79268, 'loss/train': 1.023737907409668} 11/07/2021 08:23:34 - INFO - __main__ - Step 79270: {'lr': 0.00023266617126075089, 'samples': 15219840, 'steps': 79269, 'loss/train': 1.2627450227737427} 11/07/2021 08:23:34 - INFO - __main__ - Step 79271: {'lr': 0.0002326608772914606, 'samples': 15220032, 'steps': 79270, 'loss/train': 1.3596179485321045} 11/07/2021 08:23:35 - INFO - __main__ - Step 79272: {'lr': 0.0002326555833299831, 'samples': 15220224, 'steps': 79271, 'loss/train': 1.503133773803711} 11/07/2021 08:23:36 - INFO - __main__ - Step 79273: {'lr': 0.0002326502893763207, 'samples': 15220416, 'steps': 79272, 'loss/train': 1.1494591236114502} 11/07/2021 08:23:36 - INFO - __main__ - Step 79274: {'lr': 0.0002326449954304758, 'samples': 15220608, 'steps': 79273, 'loss/train': 1.2784368991851807} 11/07/2021 08:23:36 - INFO - __main__ - Step 79275: {'lr': 0.00023263970149245083, 'samples': 15220800, 'steps': 79274, 'loss/train': 1.7190133333206177} 11/07/2021 08:23:37 - INFO - __main__ - Step 79276: {'lr': 0.00023263440756224812, 'samples': 15220992, 'steps': 79275, 'loss/train': 5.7585906982421875} 11/07/2021 08:23:37 - INFO - __main__ - Step 79277: {'lr': 0.00023262911363987004, 'samples': 15221184, 'steps': 79276, 'loss/train': 1.1436561346054077} 11/07/2021 08:23:38 - INFO - __main__ - Step 79278: {'lr': 0.00023262381972531906, 'samples': 15221376, 'steps': 79277, 'loss/train': 1.3434067964553833} 11/07/2021 08:23:39 - INFO - __main__ - Step 79279: {'lr': 0.00023261852581859749, 'samples': 15221568, 'steps': 79278, 'loss/train': 1.5774697065353394} 11/07/2021 08:23:39 - INFO - __main__ - Step 79280: {'lr': 0.00023261323191970775, 'samples': 15221760, 'steps': 79279, 'loss/train': 1.357744574546814} 11/07/2021 08:23:39 - INFO - __main__ - Step 79281: {'lr': 0.00023260793802865225, 'samples': 15221952, 'steps': 79280, 'loss/train': 1.5860165357589722} 11/07/2021 08:23:40 - INFO - __main__ - Step 79282: {'lr': 0.0002326026441454333, 'samples': 15222144, 'steps': 79281, 'loss/train': 1.4235786199569702} 11/07/2021 08:23:41 - INFO - __main__ - Step 79283: {'lr': 0.00023259735027005338, 'samples': 15222336, 'steps': 79282, 'loss/train': 1.5066219568252563} 11/07/2021 08:23:41 - INFO - __main__ - Step 79284: {'lr': 0.0002325920564025148, 'samples': 15222528, 'steps': 79283, 'loss/train': 1.4037792682647705} 11/07/2021 08:23:41 - INFO - __main__ - Step 79285: {'lr': 0.00023258676254281997, 'samples': 15222720, 'steps': 79284, 'loss/train': 1.4326006174087524} 11/07/2021 08:23:42 - INFO - __main__ - Step 79286: {'lr': 0.00023258146869097128, 'samples': 15222912, 'steps': 79285, 'loss/train': 1.6085623502731323} 11/07/2021 08:23:42 - INFO - __main__ - Step 79287: {'lr': 0.00023257617484697107, 'samples': 15223104, 'steps': 79286, 'loss/train': 1.0833748579025269} 11/07/2021 08:23:43 - INFO - __main__ - Step 79288: {'lr': 0.0002325708810108218, 'samples': 15223296, 'steps': 79287, 'loss/train': 1.5850975513458252} 11/07/2021 08:23:43 - INFO - __main__ - Step 79289: {'lr': 0.0002325655871825259, 'samples': 15223488, 'steps': 79288, 'loss/train': 1.3168023824691772} 11/07/2021 08:23:44 - INFO - __main__ - Step 79290: {'lr': 0.00023256029336208556, 'samples': 15223680, 'steps': 79289, 'loss/train': 1.8390777111053467} 11/07/2021 08:23:44 - INFO - __main__ - Step 79291: {'lr': 0.00023255499954950333, 'samples': 15223872, 'steps': 79290, 'loss/train': 1.0111920833587646} 11/07/2021 08:23:44 - INFO - __main__ - Step 79292: {'lr': 0.00023254970574478154, 'samples': 15224064, 'steps': 79291, 'loss/train': 1.0953881740570068} 11/07/2021 08:23:45 - INFO - __main__ - Step 79293: {'lr': 0.00023254441194792258, 'samples': 15224256, 'steps': 79292, 'loss/train': 1.3632129430770874} 11/07/2021 08:23:46 - INFO - __main__ - Step 79294: {'lr': 0.00023253911815892888, 'samples': 15224448, 'steps': 79293, 'loss/train': 0.9986279010772705} 11/07/2021 08:23:46 - INFO - __main__ - Step 79295: {'lr': 0.00023253382437780275, 'samples': 15224640, 'steps': 79294, 'loss/train': 1.090044379234314} 11/07/2021 08:23:47 - INFO - __main__ - Step 79296: {'lr': 0.0002325285306045466, 'samples': 15224832, 'steps': 79295, 'loss/train': 1.3060760498046875} 11/07/2021 08:23:47 - INFO - __main__ - Step 79297: {'lr': 0.00023252323683916283, 'samples': 15225024, 'steps': 79296, 'loss/train': 0.8223416805267334} 11/07/2021 08:23:47 - INFO - __main__ - Step 79298: {'lr': 0.0002325179430816538, 'samples': 15225216, 'steps': 79297, 'loss/train': 1.2629300355911255} 11/07/2021 08:23:48 - INFO - __main__ - Step 79299: {'lr': 0.00023251264933202192, 'samples': 15225408, 'steps': 79298, 'loss/train': 1.4792897701263428} 11/07/2021 08:23:49 - INFO - __main__ - Step 79300: {'lr': 0.0002325073555902696, 'samples': 15225600, 'steps': 79299, 'loss/train': 1.2232736349105835} 11/07/2021 08:23:49 - INFO - __main__ - Step 79301: {'lr': 0.00023250206185639917, 'samples': 15225792, 'steps': 79300, 'loss/train': 0.1465628445148468} 11/07/2021 08:23:49 - INFO - __main__ - Step 79302: {'lr': 0.0002324967681304131, 'samples': 15225984, 'steps': 79301, 'loss/train': 1.2194322347640991} 11/07/2021 08:23:50 - INFO - __main__ - Step 79303: {'lr': 0.00023249147441231367, 'samples': 15226176, 'steps': 79302, 'loss/train': 1.3201889991760254} 11/07/2021 08:23:51 - INFO - __main__ - Step 79304: {'lr': 0.0002324861807021033, 'samples': 15226368, 'steps': 79303, 'loss/train': 1.74861741065979} 11/07/2021 08:23:51 - INFO - __main__ - Step 79305: {'lr': 0.00023248088699978446, 'samples': 15226560, 'steps': 79304, 'loss/train': 1.5743396282196045} 11/07/2021 08:23:52 - INFO - __main__ - Step 79306: {'lr': 0.00023247559330535938, 'samples': 15226752, 'steps': 79305, 'loss/train': 1.7616487741470337} 11/07/2021 08:23:52 - INFO - __main__ - Step 79307: {'lr': 0.00023247029961883053, 'samples': 15226944, 'steps': 79306, 'loss/train': 1.4580777883529663} 11/07/2021 08:23:52 - INFO - __main__ - Step 79308: {'lr': 0.00023246500594020032, 'samples': 15227136, 'steps': 79307, 'loss/train': 1.620802402496338} 11/07/2021 08:23:53 - INFO - __main__ - Step 79309: {'lr': 0.0002324597122694711, 'samples': 15227328, 'steps': 79308, 'loss/train': 1.3757174015045166} 11/07/2021 08:23:54 - INFO - __main__ - Step 79310: {'lr': 0.00023245441860664525, 'samples': 15227520, 'steps': 79309, 'loss/train': 0.871997594833374} 11/07/2021 08:23:54 - INFO - __main__ - Step 79311: {'lr': 0.00023244912495172515, 'samples': 15227712, 'steps': 79310, 'loss/train': 1.6329517364501953} 11/07/2021 08:23:54 - INFO - __main__ - Step 79312: {'lr': 0.00023244383130471326, 'samples': 15227904, 'steps': 79311, 'loss/train': 2.1226634979248047} 11/07/2021 08:23:55 - INFO - __main__ - Step 79313: {'lr': 0.00023243853766561186, 'samples': 15228096, 'steps': 79312, 'loss/train': 1.3318076133728027} 11/07/2021 08:23:56 - INFO - __main__ - Step 79314: {'lr': 0.0002324332440344234, 'samples': 15228288, 'steps': 79313, 'loss/train': 0.9030703902244568} 11/07/2021 08:23:56 - INFO - __main__ - Step 79315: {'lr': 0.00023242795041115023, 'samples': 15228480, 'steps': 79314, 'loss/train': 1.2358404397964478} 11/07/2021 08:23:56 - INFO - __main__ - Step 79316: {'lr': 0.0002324226567957949, 'samples': 15228672, 'steps': 79315, 'loss/train': 1.4345242977142334} 11/07/2021 08:23:57 - INFO - __main__ - Step 79317: {'lr': 0.00023241736318835952, 'samples': 15228864, 'steps': 79316, 'loss/train': 0.6873733401298523} 11/07/2021 08:23:57 - INFO - __main__ - Step 79318: {'lr': 0.00023241206958884658, 'samples': 15229056, 'steps': 79317, 'loss/train': 1.5186179876327515} 11/07/2021 08:23:58 - INFO - __main__ - Step 79319: {'lr': 0.00023240677599725853, 'samples': 15229248, 'steps': 79318, 'loss/train': 1.55604887008667} 11/07/2021 08:23:58 - INFO - __main__ - Step 79320: {'lr': 0.0002324014824135977, 'samples': 15229440, 'steps': 79319, 'loss/train': 1.4040621519088745} 11/07/2021 08:23:59 - INFO - __main__ - Step 79321: {'lr': 0.00023239618883786652, 'samples': 15229632, 'steps': 79320, 'loss/train': 1.501682996749878} 11/07/2021 08:23:59 - INFO - __main__ - Step 79322: {'lr': 0.0002323908952700673, 'samples': 15229824, 'steps': 79321, 'loss/train': 1.4015249013900757} 11/07/2021 08:23:59 - INFO - __main__ - Step 79323: {'lr': 0.0002323856017102025, 'samples': 15230016, 'steps': 79322, 'loss/train': 1.7136762142181396} 11/07/2021 08:24:00 - INFO - __main__ - Step 79324: {'lr': 0.00023238030815827445, 'samples': 15230208, 'steps': 79323, 'loss/train': 1.454314947128296} 11/07/2021 08:24:01 - INFO - __main__ - Step 79325: {'lr': 0.00023237501461428555, 'samples': 15230400, 'steps': 79324, 'loss/train': 1.3848026990890503} 11/07/2021 08:24:01 - INFO - __main__ - Step 79326: {'lr': 0.00023236972107823825, 'samples': 15230592, 'steps': 79325, 'loss/train': 1.6366961002349854} 11/07/2021 08:24:01 - INFO - __main__ - Step 79327: {'lr': 0.00023236442755013485, 'samples': 15230784, 'steps': 79326, 'loss/train': 1.487972378730774} 11/07/2021 08:24:02 - INFO - __main__ - Step 79328: {'lr': 0.00023235913402997778, 'samples': 15230976, 'steps': 79327, 'loss/train': 1.3670638799667358} 11/07/2021 08:24:02 - INFO - __main__ - Step 79329: {'lr': 0.0002323538405177695, 'samples': 15231168, 'steps': 79328, 'loss/train': 1.3504809141159058} 11/07/2021 08:24:03 - INFO - __main__ - Step 79330: {'lr': 0.0002323485470135122, 'samples': 15231360, 'steps': 79329, 'loss/train': 1.577420711517334} 11/07/2021 08:24:04 - INFO - __main__ - Step 79331: {'lr': 0.0002323432535172084, 'samples': 15231552, 'steps': 79330, 'loss/train': 1.0527249574661255} 11/07/2021 08:24:04 - INFO - __main__ - Step 79332: {'lr': 0.00023233796002886044, 'samples': 15231744, 'steps': 79331, 'loss/train': 1.2644673585891724} 11/07/2021 08:24:04 - INFO - __main__ - Step 79333: {'lr': 0.0002323326665484707, 'samples': 15231936, 'steps': 79332, 'loss/train': 0.46686995029449463} 11/07/2021 08:24:05 - INFO - __main__ - Step 79334: {'lr': 0.00023232737307604163, 'samples': 15232128, 'steps': 79333, 'loss/train': 1.7230169773101807} 11/07/2021 08:24:06 - INFO - __main__ - Step 79335: {'lr': 0.00023232207961157555, 'samples': 15232320, 'steps': 79334, 'loss/train': 1.6353979110717773} 11/07/2021 08:24:06 - INFO - __main__ - Step 79336: {'lr': 0.00023231678615507487, 'samples': 15232512, 'steps': 79335, 'loss/train': 1.834824800491333} 11/07/2021 08:24:07 - INFO - __main__ - Step 79337: {'lr': 0.00023231149270654198, 'samples': 15232704, 'steps': 79336, 'loss/train': 1.2709972858428955} 11/07/2021 08:24:07 - INFO - __main__ - Step 79338: {'lr': 0.00023230619926597923, 'samples': 15232896, 'steps': 79337, 'loss/train': 1.2398263216018677} 11/07/2021 08:24:07 - INFO - __main__ - Step 79339: {'lr': 0.00023230090583338907, 'samples': 15233088, 'steps': 79338, 'loss/train': 1.7251631021499634} 11/07/2021 08:24:08 - INFO - __main__ - Step 79340: {'lr': 0.00023229561240877385, 'samples': 15233280, 'steps': 79339, 'loss/train': 1.4816089868545532} 11/07/2021 08:24:09 - INFO - __main__ - Step 79341: {'lr': 0.00023229031899213594, 'samples': 15233472, 'steps': 79340, 'loss/train': 0.9207149744033813} 11/07/2021 08:24:09 - INFO - __main__ - Step 79342: {'lr': 0.0002322850255834777, 'samples': 15233664, 'steps': 79341, 'loss/train': 1.3581275939941406} 11/07/2021 08:24:09 - INFO - __main__ - Step 79343: {'lr': 0.00023227973218280175, 'samples': 15233856, 'steps': 79342, 'loss/train': 1.39609956741333} 11/07/2021 08:24:10 - INFO - __main__ - Step 79344: {'lr': 0.0002322744387901101, 'samples': 15234048, 'steps': 79343, 'loss/train': 1.3648910522460938} 11/07/2021 08:24:11 - INFO - __main__ - Step 79345: {'lr': 0.00023226914540540534, 'samples': 15234240, 'steps': 79344, 'loss/train': 1.1627647876739502} 11/07/2021 08:24:11 - INFO - __main__ - Step 79346: {'lr': 0.00023226385202868984, 'samples': 15234432, 'steps': 79345, 'loss/train': 1.2489286661148071} 11/07/2021 08:24:12 - INFO - __main__ - Step 79347: {'lr': 0.00023225855865996594, 'samples': 15234624, 'steps': 79346, 'loss/train': 1.2679617404937744} 11/07/2021 08:24:12 - INFO - __main__ - Step 79348: {'lr': 0.00023225326529923608, 'samples': 15234816, 'steps': 79347, 'loss/train': 1.4454456567764282} 11/07/2021 08:24:12 - INFO - __main__ - Step 79349: {'lr': 0.00023224797194650263, 'samples': 15235008, 'steps': 79348, 'loss/train': 1.6476624011993408} 11/07/2021 08:24:13 - INFO - __main__ - Step 79350: {'lr': 0.00023224267860176795, 'samples': 15235200, 'steps': 79349, 'loss/train': 1.191177248954773} 11/07/2021 08:24:14 - INFO - __main__ - Step 79351: {'lr': 0.00023223738526503447, 'samples': 15235392, 'steps': 79350, 'loss/train': 1.5252631902694702} 11/07/2021 08:24:14 - INFO - __main__ - Step 79352: {'lr': 0.00023223209193630454, 'samples': 15235584, 'steps': 79351, 'loss/train': 1.2713568210601807} 11/07/2021 08:24:14 - INFO - __main__ - Step 79353: {'lr': 0.00023222679861558055, 'samples': 15235776, 'steps': 79352, 'loss/train': 1.6771903038024902} 11/07/2021 08:24:15 - INFO - __main__ - Step 79354: {'lr': 0.0002322215053028649, 'samples': 15235968, 'steps': 79353, 'loss/train': 1.4523099660873413} 11/07/2021 08:24:15 - INFO - __main__ - Step 79355: {'lr': 0.00023221621199815995, 'samples': 15236160, 'steps': 79354, 'loss/train': 1.5404949188232422} 11/07/2021 08:24:16 - INFO - __main__ - Step 79356: {'lr': 0.00023221091870146823, 'samples': 15236352, 'steps': 79355, 'loss/train': 1.0778933763504028} 11/07/2021 08:24:17 - INFO - __main__ - Step 79357: {'lr': 0.0002322056254127919, 'samples': 15236544, 'steps': 79356, 'loss/train': 1.52402663230896} 11/07/2021 08:24:17 - INFO - __main__ - Step 79358: {'lr': 0.0002322003321321334, 'samples': 15236736, 'steps': 79357, 'loss/train': 1.4027888774871826} 11/07/2021 08:24:17 - INFO - __main__ - Step 79359: {'lr': 0.00023219503885949517, 'samples': 15236928, 'steps': 79358, 'loss/train': 1.1096720695495605} 11/07/2021 08:24:18 - INFO - __main__ - Step 79360: {'lr': 0.00023218974559487956, 'samples': 15237120, 'steps': 79359, 'loss/train': 1.2412766218185425} 11/07/2021 08:24:19 - INFO - __main__ - Step 79361: {'lr': 0.00023218445233828903, 'samples': 15237312, 'steps': 79360, 'loss/train': 1.506622076034546} 11/07/2021 08:24:19 - INFO - __main__ - Step 79362: {'lr': 0.00023217915908972588, 'samples': 15237504, 'steps': 79361, 'loss/train': 0.6386646032333374} 11/07/2021 08:24:19 - INFO - __main__ - Step 79363: {'lr': 0.00023217386584919252, 'samples': 15237696, 'steps': 79362, 'loss/train': 1.2894153594970703} 11/07/2021 08:24:20 - INFO - __main__ - Step 79364: {'lr': 0.00023216857261669133, 'samples': 15237888, 'steps': 79363, 'loss/train': 1.221582055091858} 11/07/2021 08:24:20 - INFO - __main__ - Step 79365: {'lr': 0.00023216327939222473, 'samples': 15238080, 'steps': 79364, 'loss/train': 1.4498575925827026} 11/07/2021 08:24:21 - INFO - __main__ - Step 79366: {'lr': 0.00023215798617579509, 'samples': 15238272, 'steps': 79365, 'loss/train': 0.8788038492202759} 11/07/2021 08:24:21 - INFO - __main__ - Step 79367: {'lr': 0.00023215269296740477, 'samples': 15238464, 'steps': 79366, 'loss/train': 1.3904895782470703} 11/07/2021 08:24:22 - INFO - __main__ - Step 79368: {'lr': 0.00023214739976705614, 'samples': 15238656, 'steps': 79367, 'loss/train': 1.606799602508545} 11/07/2021 08:24:22 - INFO - __main__ - Step 79369: {'lr': 0.00023214210657475178, 'samples': 15238848, 'steps': 79368, 'loss/train': 1.1918920278549194} 11/07/2021 08:24:22 - INFO - __main__ - Step 79370: {'lr': 0.00023213681339049377, 'samples': 15239040, 'steps': 79369, 'loss/train': 1.589494228363037} 11/07/2021 08:24:23 - INFO - __main__ - Step 79371: {'lr': 0.00023213152021428466, 'samples': 15239232, 'steps': 79370, 'loss/train': 1.5864378213882446} 11/07/2021 08:24:24 - INFO - __main__ - Step 79372: {'lr': 0.00023212622704612678, 'samples': 15239424, 'steps': 79371, 'loss/train': 1.447008728981018} 11/07/2021 08:24:24 - INFO - __main__ - Step 79373: {'lr': 0.00023212093388602257, 'samples': 15239616, 'steps': 79372, 'loss/train': 1.6674143075942993} 11/07/2021 08:24:24 - INFO - __main__ - Step 79374: {'lr': 0.00023211564073397436, 'samples': 15239808, 'steps': 79373, 'loss/train': 1.0001037120819092} 11/07/2021 08:24:25 - INFO - __main__ - Step 79375: {'lr': 0.00023211034758998463, 'samples': 15240000, 'steps': 79374, 'loss/train': 0.8398929834365845} 11/07/2021 08:24:26 - INFO - __main__ - Step 79376: {'lr': 0.00023210505445405563, 'samples': 15240192, 'steps': 79375, 'loss/train': 1.1636691093444824} 11/07/2021 08:24:26 - INFO - __main__ - Step 79377: {'lr': 0.00023209976132618988, 'samples': 15240384, 'steps': 79376, 'loss/train': 1.428697943687439} 11/07/2021 08:24:27 - INFO - __main__ - Step 79378: {'lr': 0.00023209446820638967, 'samples': 15240576, 'steps': 79377, 'loss/train': 1.3042436838150024} 11/07/2021 08:24:27 - INFO - __main__ - Step 79379: {'lr': 0.00023208917509465738, 'samples': 15240768, 'steps': 79378, 'loss/train': 1.1410346031188965} 11/07/2021 08:24:27 - INFO - __main__ - Step 79380: {'lr': 0.00023208388199099546, 'samples': 15240960, 'steps': 79379, 'loss/train': 1.2497917413711548} 11/07/2021 08:24:28 - INFO - __main__ - Step 79381: {'lr': 0.00023207858889540627, 'samples': 15241152, 'steps': 79380, 'loss/train': 0.9170209169387817} 11/07/2021 08:24:29 - INFO - __main__ - Step 79382: {'lr': 0.00023207329580789222, 'samples': 15241344, 'steps': 79381, 'loss/train': 1.5171867609024048} 11/07/2021 08:24:29 - INFO - __main__ - Step 79383: {'lr': 0.00023206800272845574, 'samples': 15241536, 'steps': 79382, 'loss/train': 1.357049584388733} 11/07/2021 08:24:29 - INFO - __main__ - Step 79384: {'lr': 0.00023206270965709906, 'samples': 15241728, 'steps': 79383, 'loss/train': 1.5833055973052979} 11/07/2021 08:24:30 - INFO - __main__ - Step 79385: {'lr': 0.00023205741659382463, 'samples': 15241920, 'steps': 79384, 'loss/train': 1.3941882848739624} 11/07/2021 08:24:30 - INFO - __main__ - Step 79386: {'lr': 0.00023205212353863484, 'samples': 15242112, 'steps': 79385, 'loss/train': 1.1481996774673462} 11/07/2021 08:24:31 - INFO - __main__ - Step 79387: {'lr': 0.0002320468304915321, 'samples': 15242304, 'steps': 79386, 'loss/train': 1.0214793682098389} 11/07/2021 08:24:31 - INFO - __main__ - Step 79388: {'lr': 0.00023204153745251877, 'samples': 15242496, 'steps': 79387, 'loss/train': 1.2809851169586182} 11/07/2021 08:24:32 - INFO - __main__ - Step 79389: {'lr': 0.00023203624442159727, 'samples': 15242688, 'steps': 79388, 'loss/train': 1.690600037574768} 11/07/2021 08:24:32 - INFO - __main__ - Step 79390: {'lr': 0.00023203095139876992, 'samples': 15242880, 'steps': 79389, 'loss/train': 1.6263920068740845} 11/07/2021 08:24:32 - INFO - __main__ - Step 79391: {'lr': 0.00023202565838403917, 'samples': 15243072, 'steps': 79390, 'loss/train': 0.7409564256668091} 11/07/2021 08:24:33 - INFO - __main__ - Step 79392: {'lr': 0.00023202036537740738, 'samples': 15243264, 'steps': 79391, 'loss/train': 1.6689879894256592} 11/07/2021 08:24:34 - INFO - __main__ - Step 79393: {'lr': 0.00023201507237887695, 'samples': 15243456, 'steps': 79392, 'loss/train': 1.300165057182312} 11/07/2021 08:24:34 - INFO - __main__ - Step 79394: {'lr': 0.00023200977938845022, 'samples': 15243648, 'steps': 79393, 'loss/train': 1.1755506992340088} 11/07/2021 08:24:34 - INFO - __main__ - Step 79395: {'lr': 0.0002320044864061297, 'samples': 15243840, 'steps': 79394, 'loss/train': 1.190096139907837} 11/07/2021 08:24:35 - INFO - __main__ - Step 79396: {'lr': 0.00023199919343191763, 'samples': 15244032, 'steps': 79395, 'loss/train': 1.608437418937683} 11/07/2021 08:24:36 - INFO - __main__ - Step 79397: {'lr': 0.00023199390046581644, 'samples': 15244224, 'steps': 79396, 'loss/train': 1.560620903968811} 11/07/2021 08:24:36 - INFO - __main__ - Step 79398: {'lr': 0.0002319886075078285, 'samples': 15244416, 'steps': 79397, 'loss/train': 1.2538100481033325} 11/07/2021 08:24:37 - INFO - __main__ - Step 79399: {'lr': 0.0002319833145579562, 'samples': 15244608, 'steps': 79398, 'loss/train': 0.9340146780014038} 11/07/2021 08:24:37 - INFO - __main__ - Step 79400: {'lr': 0.00023197802161620197, 'samples': 15244800, 'steps': 79399, 'loss/train': 1.1764893531799316} 11/07/2021 08:24:37 - INFO - __main__ - Step 79401: {'lr': 0.00023197272868256816, 'samples': 15244992, 'steps': 79400, 'loss/train': 1.3651752471923828} 11/07/2021 08:24:38 - INFO - __main__ - Step 79402: {'lr': 0.00023196743575705714, 'samples': 15245184, 'steps': 79401, 'loss/train': 1.8242478370666504} 11/07/2021 08:24:39 - INFO - __main__ - Step 79403: {'lr': 0.00023196214283967132, 'samples': 15245376, 'steps': 79402, 'loss/train': 1.171291708946228} 11/07/2021 08:24:39 - INFO - __main__ - Step 79404: {'lr': 0.00023195684993041312, 'samples': 15245568, 'steps': 79403, 'loss/train': 1.3272444009780884} 11/07/2021 08:24:39 - INFO - __main__ - Step 79405: {'lr': 0.00023195155702928483, 'samples': 15245760, 'steps': 79404, 'loss/train': 2.2344765663146973} 11/07/2021 08:24:40 - INFO - __main__ - Step 79406: {'lr': 0.00023194626413628898, 'samples': 15245952, 'steps': 79405, 'loss/train': 1.063006043434143} 11/07/2021 08:24:41 - INFO - __main__ - Step 79407: {'lr': 0.00023194097125142776, 'samples': 15246144, 'steps': 79406, 'loss/train': 0.9586953520774841} 11/07/2021 08:24:41 - INFO - __main__ - Step 79408: {'lr': 0.00023193567837470372, 'samples': 15246336, 'steps': 79407, 'loss/train': 1.4355149269104004} 11/07/2021 08:24:41 - INFO - __main__ - Step 79409: {'lr': 0.00023193038550611917, 'samples': 15246528, 'steps': 79408, 'loss/train': 1.3049103021621704} 11/07/2021 08:24:42 - INFO - __main__ - Step 79410: {'lr': 0.00023192509264567654, 'samples': 15246720, 'steps': 79409, 'loss/train': 1.1346160173416138} 11/07/2021 08:24:42 - INFO - __main__ - Step 79411: {'lr': 0.00023191979979337815, 'samples': 15246912, 'steps': 79410, 'loss/train': 1.2888894081115723} 11/07/2021 08:24:43 - INFO - __main__ - Step 79412: {'lr': 0.00023191450694922642, 'samples': 15247104, 'steps': 79411, 'loss/train': 1.237973928451538} 11/07/2021 08:24:43 - INFO - __main__ - Step 79413: {'lr': 0.0002319092141132237, 'samples': 15247296, 'steps': 79412, 'loss/train': 1.2970833778381348} 11/07/2021 08:24:44 - INFO - __main__ - Step 79414: {'lr': 0.00023190392128537247, 'samples': 15247488, 'steps': 79413, 'loss/train': 1.5298140048980713} 11/07/2021 08:24:44 - INFO - __main__ - Step 79415: {'lr': 0.000231898628465675, 'samples': 15247680, 'steps': 79414, 'loss/train': 1.4100698232650757} 11/07/2021 08:24:44 - INFO - __main__ - Step 79416: {'lr': 0.00023189333565413377, 'samples': 15247872, 'steps': 79415, 'loss/train': 1.369760274887085} 11/07/2021 08:24:45 - INFO - __main__ - Step 79417: {'lr': 0.00023188804285075116, 'samples': 15248064, 'steps': 79416, 'loss/train': 1.3430336713790894} 11/07/2021 08:24:46 - INFO - __main__ - Step 79418: {'lr': 0.00023188275005552945, 'samples': 15248256, 'steps': 79417, 'loss/train': 1.5535914897918701} 11/07/2021 08:24:46 - INFO - __main__ - Step 79419: {'lr': 0.0002318774572684711, 'samples': 15248448, 'steps': 79418, 'loss/train': 1.3531484603881836} 11/07/2021 08:24:46 - INFO - __main__ - Step 79420: {'lr': 0.0002318721644895785, 'samples': 15248640, 'steps': 79419, 'loss/train': 0.8405978083610535} 11/07/2021 08:24:47 - INFO - __main__ - Step 79421: {'lr': 0.000231866871718854, 'samples': 15248832, 'steps': 79420, 'loss/train': 1.2949519157409668} 11/07/2021 08:24:47 - INFO - __main__ - Step 79422: {'lr': 0.00023186157895630004, 'samples': 15249024, 'steps': 79421, 'loss/train': 1.2869014739990234} 11/07/2021 08:24:48 - INFO - __main__ - Step 79423: {'lr': 0.000231856286201919, 'samples': 15249216, 'steps': 79422, 'loss/train': 1.2090624570846558} 11/07/2021 08:24:49 - INFO - __main__ - Step 79424: {'lr': 0.0002318509934557132, 'samples': 15249408, 'steps': 79423, 'loss/train': 1.3191158771514893} 11/07/2021 08:24:49 - INFO - __main__ - Step 79425: {'lr': 0.00023184570071768508, 'samples': 15249600, 'steps': 79424, 'loss/train': 1.3753005266189575} 11/07/2021 08:24:49 - INFO - __main__ - Step 79426: {'lr': 0.00023184040798783696, 'samples': 15249792, 'steps': 79425, 'loss/train': 1.257692575454712} 11/07/2021 08:24:50 - INFO - __main__ - Step 79427: {'lr': 0.0002318351152661713, 'samples': 15249984, 'steps': 79426, 'loss/train': 1.332297921180725} 11/07/2021 08:24:51 - INFO - __main__ - Step 79428: {'lr': 0.0002318298225526905, 'samples': 15250176, 'steps': 79427, 'loss/train': 1.8571020364761353} 11/07/2021 08:24:51 - INFO - __main__ - Step 79429: {'lr': 0.00023182452984739686, 'samples': 15250368, 'steps': 79428, 'loss/train': 1.0184612274169922} 11/07/2021 08:24:51 - INFO - __main__ - Step 79430: {'lr': 0.00023181923715029278, 'samples': 15250560, 'steps': 79429, 'loss/train': 1.3615764379501343} 11/07/2021 08:24:52 - INFO - __main__ - Step 79431: {'lr': 0.00023181394446138072, 'samples': 15250752, 'steps': 79430, 'loss/train': 1.9239321947097778} 11/07/2021 08:24:52 - INFO - __main__ - Step 79432: {'lr': 0.00023180865178066298, 'samples': 15250944, 'steps': 79431, 'loss/train': 1.2540556192398071} 11/07/2021 08:24:53 - INFO - __main__ - Step 79433: {'lr': 0.00023180335910814198, 'samples': 15251136, 'steps': 79432, 'loss/train': 1.2285747528076172} 11/07/2021 08:24:54 - INFO - __main__ - Step 79434: {'lr': 0.0002317980664438201, 'samples': 15251328, 'steps': 79433, 'loss/train': 1.3837521076202393} 11/07/2021 08:24:54 - INFO - __main__ - Step 79435: {'lr': 0.00023179277378769975, 'samples': 15251520, 'steps': 79434, 'loss/train': 1.1557294130325317} 11/07/2021 08:24:54 - INFO - __main__ - Step 79436: {'lr': 0.00023178748113978332, 'samples': 15251712, 'steps': 79435, 'loss/train': 1.6231330633163452} 11/07/2021 08:24:55 - INFO - __main__ - Step 79437: {'lr': 0.00023178218850007317, 'samples': 15251904, 'steps': 79436, 'loss/train': 1.3658252954483032} 11/07/2021 08:24:56 - INFO - __main__ - Step 79438: {'lr': 0.00023177689586857165, 'samples': 15252096, 'steps': 79437, 'loss/train': 1.263288974761963} 11/07/2021 08:24:56 - INFO - __main__ - Step 79439: {'lr': 0.00023177160324528123, 'samples': 15252288, 'steps': 79438, 'loss/train': 1.6159965991973877} 11/07/2021 08:24:57 - INFO - __main__ - Step 79440: {'lr': 0.0002317663106302042, 'samples': 15252480, 'steps': 79439, 'loss/train': 1.3942173719406128} 11/07/2021 08:24:57 - INFO - __main__ - Step 79441: {'lr': 0.00023176101802334302, 'samples': 15252672, 'steps': 79440, 'loss/train': 1.4755074977874756} 11/07/2021 08:24:57 - INFO - __main__ - Step 79442: {'lr': 0.00023175572542469998, 'samples': 15252864, 'steps': 79441, 'loss/train': 1.2727272510528564} 11/07/2021 08:24:58 - INFO - __main__ - Step 79443: {'lr': 0.00023175043283427758, 'samples': 15253056, 'steps': 79442, 'loss/train': 1.5820308923721313} 11/07/2021 08:24:59 - INFO - __main__ - Step 79444: {'lr': 0.00023174514025207812, 'samples': 15253248, 'steps': 79443, 'loss/train': 1.654071569442749} 11/07/2021 08:24:59 - INFO - __main__ - Step 79445: {'lr': 0.00023173984767810402, 'samples': 15253440, 'steps': 79444, 'loss/train': 1.2499686479568481} 11/07/2021 08:24:59 - INFO - __main__ - Step 79446: {'lr': 0.00023173455511235768, 'samples': 15253632, 'steps': 79445, 'loss/train': 1.78867506980896} 11/07/2021 08:25:00 - INFO - __main__ - Step 79447: {'lr': 0.00023172926255484146, 'samples': 15253824, 'steps': 79446, 'loss/train': 1.33341646194458} 11/07/2021 08:25:01 - INFO - __main__ - Step 79448: {'lr': 0.00023172397000555776, 'samples': 15254016, 'steps': 79447, 'loss/train': 1.1664210557937622} 11/07/2021 08:25:01 - INFO - __main__ - Step 79449: {'lr': 0.00023171867746450895, 'samples': 15254208, 'steps': 79448, 'loss/train': 1.2761154174804688} 11/07/2021 08:25:01 - INFO - __main__ - Step 79450: {'lr': 0.00023171338493169753, 'samples': 15254400, 'steps': 79449, 'loss/train': 1.3931586742401123} 11/07/2021 08:25:02 - INFO - __main__ - Step 79451: {'lr': 0.00023170809240712566, 'samples': 15254592, 'steps': 79450, 'loss/train': 1.6822530031204224} 11/07/2021 08:25:02 - INFO - __main__ - Step 79452: {'lr': 0.00023170279989079588, 'samples': 15254784, 'steps': 79451, 'loss/train': 1.4689079523086548} 11/07/2021 08:25:03 - INFO - __main__ - Step 79453: {'lr': 0.0002316975073827105, 'samples': 15254976, 'steps': 79452, 'loss/train': 0.9494279623031616} 11/07/2021 08:25:04 - INFO - __main__ - Step 79454: {'lr': 0.00023169221488287194, 'samples': 15255168, 'steps': 79453, 'loss/train': 0.971210241317749} 11/07/2021 08:25:04 - INFO - __main__ - Step 79455: {'lr': 0.0002316869223912826, 'samples': 15255360, 'steps': 79454, 'loss/train': 1.815500020980835} 11/07/2021 08:25:04 - INFO - __main__ - Step 79456: {'lr': 0.00023168162990794484, 'samples': 15255552, 'steps': 79455, 'loss/train': 1.7862575054168701} 11/07/2021 08:25:05 - INFO - __main__ - Step 79457: {'lr': 0.00023167633743286103, 'samples': 15255744, 'steps': 79456, 'loss/train': 1.0951443910598755} 11/07/2021 08:25:05 - INFO - __main__ - Step 79458: {'lr': 0.00023167104496603363, 'samples': 15255936, 'steps': 79457, 'loss/train': 1.4028515815734863} 11/07/2021 08:25:06 - INFO - __main__ - Step 79459: {'lr': 0.00023166575250746496, 'samples': 15256128, 'steps': 79458, 'loss/train': 0.33724886178970337} 11/07/2021 08:25:06 - INFO - __main__ - Step 79460: {'lr': 0.0002316604600571574, 'samples': 15256320, 'steps': 79459, 'loss/train': 1.0531797409057617} 11/07/2021 08:25:07 - INFO - __main__ - Step 79461: {'lr': 0.00023165516761511338, 'samples': 15256512, 'steps': 79460, 'loss/train': 1.4812167882919312} 11/07/2021 08:25:07 - INFO - __main__ - Step 79462: {'lr': 0.00023164987518133523, 'samples': 15256704, 'steps': 79461, 'loss/train': 1.4441653490066528} 11/07/2021 08:25:08 - INFO - __main__ - Step 79463: {'lr': 0.00023164458275582537, 'samples': 15256896, 'steps': 79462, 'loss/train': 1.2292476892471313} 11/07/2021 08:25:09 - INFO - __main__ - Step 79464: {'lr': 0.0002316392903385863, 'samples': 15257088, 'steps': 79463, 'loss/train': 1.4278147220611572} 11/07/2021 08:25:09 - INFO - __main__ - Step 79465: {'lr': 0.00023163399792962017, 'samples': 15257280, 'steps': 79464, 'loss/train': 1.8533909320831299} 11/07/2021 08:25:09 - INFO - __main__ - Step 79466: {'lr': 0.00023162870552892947, 'samples': 15257472, 'steps': 79465, 'loss/train': 1.5825752019882202} 11/07/2021 08:25:10 - INFO - __main__ - Step 79467: {'lr': 0.0002316234131365166, 'samples': 15257664, 'steps': 79466, 'loss/train': 1.887851595878601} 11/07/2021 08:25:10 - INFO - __main__ - Step 79468: {'lr': 0.00023161812075238393, 'samples': 15257856, 'steps': 79467, 'loss/train': 1.5995460748672485} 11/07/2021 08:25:11 - INFO - __main__ - Step 79469: {'lr': 0.00023161282837653386, 'samples': 15258048, 'steps': 79468, 'loss/train': 1.3449106216430664} 11/07/2021 08:25:11 - INFO - __main__ - Step 79470: {'lr': 0.00023160753600896876, 'samples': 15258240, 'steps': 79469, 'loss/train': 1.385043740272522} 11/07/2021 08:25:12 - INFO - __main__ - Step 79471: {'lr': 0.00023160224364969102, 'samples': 15258432, 'steps': 79470, 'loss/train': 1.4092565774917603} 11/07/2021 08:25:12 - INFO - __main__ - Step 79472: {'lr': 0.00023159695129870302, 'samples': 15258624, 'steps': 79471, 'loss/train': 0.9745123982429504} 11/07/2021 08:25:12 - INFO - __main__ - Step 79473: {'lr': 0.00023159165895600715, 'samples': 15258816, 'steps': 79472, 'loss/train': 1.0517314672470093} 11/07/2021 08:25:13 - INFO - __main__ - Step 79474: {'lr': 0.00023158636662160578, 'samples': 15259008, 'steps': 79473, 'loss/train': 1.2887792587280273} 11/07/2021 08:25:14 - INFO - __main__ - Step 79475: {'lr': 0.00023158107429550136, 'samples': 15259200, 'steps': 79474, 'loss/train': 0.9172324538230896} 11/07/2021 08:25:14 - INFO - __main__ - Step 79476: {'lr': 0.00023157578197769617, 'samples': 15259392, 'steps': 79475, 'loss/train': 1.3183379173278809} 11/07/2021 08:25:15 - INFO - __main__ - Step 79477: {'lr': 0.00023157048966819277, 'samples': 15259584, 'steps': 79476, 'loss/train': 0.9042297601699829} 11/07/2021 08:25:15 - INFO - __main__ - Step 79478: {'lr': 0.00023156519736699334, 'samples': 15259776, 'steps': 79477, 'loss/train': 1.3593603372573853} 11/07/2021 08:25:16 - INFO - __main__ - Step 79479: {'lr': 0.00023155990507410032, 'samples': 15259968, 'steps': 79478, 'loss/train': 1.3094836473464966} 11/07/2021 08:25:16 - INFO - __main__ - Step 79480: {'lr': 0.00023155461278951612, 'samples': 15260160, 'steps': 79479, 'loss/train': 1.35271155834198} 11/07/2021 08:25:17 - INFO - __main__ - Step 79481: {'lr': 0.00023154932051324315, 'samples': 15260352, 'steps': 79480, 'loss/train': 1.4235063791275024} 11/07/2021 08:25:17 - INFO - __main__ - Step 79482: {'lr': 0.00023154402824528375, 'samples': 15260544, 'steps': 79481, 'loss/train': 1.3510419130325317} 11/07/2021 08:25:17 - INFO - __main__ - Step 79483: {'lr': 0.00023153873598564034, 'samples': 15260736, 'steps': 79482, 'loss/train': 1.2912826538085938} 11/07/2021 08:25:18 - INFO - __main__ - Step 79484: {'lr': 0.0002315334437343153, 'samples': 15260928, 'steps': 79483, 'loss/train': 1.3820990324020386} 11/07/2021 08:25:19 - INFO - __main__ - Step 79485: {'lr': 0.00023152815149131097, 'samples': 15261120, 'steps': 79484, 'loss/train': 1.3903934955596924} 11/07/2021 08:25:19 - INFO - __main__ - Step 79486: {'lr': 0.0002315228592566298, 'samples': 15261312, 'steps': 79485, 'loss/train': 1.3636828660964966} 11/07/2021 08:25:19 - INFO - __main__ - Step 79487: {'lr': 0.00023151756703027412, 'samples': 15261504, 'steps': 79486, 'loss/train': 1.9251270294189453} 11/07/2021 08:25:20 - INFO - __main__ - Step 79488: {'lr': 0.00023151227481224638, 'samples': 15261696, 'steps': 79487, 'loss/train': 1.3751360177993774} 11/07/2021 08:25:20 - INFO - __main__ - Step 79489: {'lr': 0.00023150698260254892, 'samples': 15261888, 'steps': 79488, 'loss/train': 5.683528900146484} 11/07/2021 08:25:21 - INFO - __main__ - Step 79490: {'lr': 0.00023150169040118417, 'samples': 15262080, 'steps': 79489, 'loss/train': 1.0362896919250488} 11/07/2021 08:25:22 - INFO - __main__ - Step 79491: {'lr': 0.00023149639820815445, 'samples': 15262272, 'steps': 79490, 'loss/train': 1.1897485256195068} 11/07/2021 08:25:22 - INFO - __main__ - Step 79492: {'lr': 0.00023149110602346213, 'samples': 15262464, 'steps': 79491, 'loss/train': 1.3167893886566162} 11/07/2021 08:25:22 - INFO - __main__ - Step 79493: {'lr': 0.00023148581384710963, 'samples': 15262656, 'steps': 79492, 'loss/train': 1.3816924095153809} 11/07/2021 08:25:23 - INFO - __main__ - Step 79494: {'lr': 0.00023148052167909933, 'samples': 15262848, 'steps': 79493, 'loss/train': 1.1400266885757446} 11/07/2021 08:25:24 - INFO - __main__ - Step 79495: {'lr': 0.00023147522951943363, 'samples': 15263040, 'steps': 79494, 'loss/train': 1.186140775680542} 11/07/2021 08:25:24 - INFO - __main__ - Step 79496: {'lr': 0.0002314699373681149, 'samples': 15263232, 'steps': 79495, 'loss/train': 1.0951745510101318} 11/07/2021 08:25:24 - INFO - __main__ - Step 79497: {'lr': 0.00023146464522514552, 'samples': 15263424, 'steps': 79496, 'loss/train': 1.5716007947921753} 11/07/2021 08:25:25 - INFO - __main__ - Step 79498: {'lr': 0.0002314593530905279, 'samples': 15263616, 'steps': 79497, 'loss/train': 1.388745903968811} 11/07/2021 08:25:25 - INFO - __main__ - Step 79499: {'lr': 0.00023145406096426442, 'samples': 15263808, 'steps': 79498, 'loss/train': 1.7311710119247437} 11/07/2021 08:25:26 - INFO - __main__ - Step 79500: {'lr': 0.00023144876884635744, 'samples': 15264000, 'steps': 79499, 'loss/train': 1.1970555782318115} 11/07/2021 08:25:26 - INFO - __main__ - Step 79501: {'lr': 0.00023144347673680936, 'samples': 15264192, 'steps': 79500, 'loss/train': 1.5556788444519043} 11/07/2021 08:25:27 - INFO - __main__ - Step 79502: {'lr': 0.00023143818463562256, 'samples': 15264384, 'steps': 79501, 'loss/train': 1.351776123046875} 11/07/2021 08:25:27 - INFO - __main__ - Step 79503: {'lr': 0.0002314328925427994, 'samples': 15264576, 'steps': 79502, 'loss/train': 1.3857003450393677} 11/07/2021 08:25:27 - INFO - __main__ - Step 79504: {'lr': 0.00023142760045834245, 'samples': 15264768, 'steps': 79503, 'loss/train': 0.9212246537208557} 11/07/2021 08:25:28 - INFO - __main__ - Step 79505: {'lr': 0.00023142230838225382, 'samples': 15264960, 'steps': 79504, 'loss/train': 1.3707581758499146} 11/07/2021 08:25:29 - INFO - __main__ - Step 79506: {'lr': 0.000231417016314536, 'samples': 15265152, 'steps': 79505, 'loss/train': 1.427509069442749} 11/07/2021 08:25:29 - INFO - __main__ - Step 79507: {'lr': 0.00023141172425519138, 'samples': 15265344, 'steps': 79506, 'loss/train': 1.3169046640396118} 11/07/2021 08:25:30 - INFO - __main__ - Step 79508: {'lr': 0.00023140643220422236, 'samples': 15265536, 'steps': 79507, 'loss/train': 0.6762175559997559} 11/07/2021 08:25:30 - INFO - __main__ - Step 79509: {'lr': 0.00023140114016163133, 'samples': 15265728, 'steps': 79508, 'loss/train': 1.6029117107391357} 11/07/2021 08:25:30 - INFO - __main__ - Step 79510: {'lr': 0.00023139584812742063, 'samples': 15265920, 'steps': 79509, 'loss/train': 1.7370357513427734} 11/07/2021 08:25:31 - INFO - __main__ - Step 79511: {'lr': 0.0002313905561015927, 'samples': 15266112, 'steps': 79510, 'loss/train': 1.2598730325698853} 11/07/2021 08:25:32 - INFO - __main__ - Step 79512: {'lr': 0.00023138526408414986, 'samples': 15266304, 'steps': 79511, 'loss/train': 1.068659782409668} 11/07/2021 08:25:32 - INFO - __main__ - Step 79513: {'lr': 0.00023137997207509455, 'samples': 15266496, 'steps': 79512, 'loss/train': 1.1774154901504517} 11/07/2021 08:25:32 - INFO - __main__ - Step 79514: {'lr': 0.00023137468007442916, 'samples': 15266688, 'steps': 79513, 'loss/train': 1.22505521774292} 11/07/2021 08:25:33 - INFO - __main__ - Step 79515: {'lr': 0.00023136938808215602, 'samples': 15266880, 'steps': 79514, 'loss/train': 1.0965279340744019} 11/07/2021 08:25:34 - INFO - __main__ - Step 79516: {'lr': 0.00023136409609827757, 'samples': 15267072, 'steps': 79515, 'loss/train': 1.5490096807479858} 11/07/2021 08:25:34 - INFO - __main__ - Step 79517: {'lr': 0.00023135880412279627, 'samples': 15267264, 'steps': 79516, 'loss/train': 1.4051097631454468} 11/07/2021 08:25:35 - INFO - __main__ - Step 79518: {'lr': 0.0002313535121557143, 'samples': 15267456, 'steps': 79517, 'loss/train': 1.751974105834961} 11/07/2021 08:25:35 - INFO - __main__ - Step 79519: {'lr': 0.00023134822019703414, 'samples': 15267648, 'steps': 79518, 'loss/train': 1.4774143695831299} 11/07/2021 08:25:35 - INFO - __main__ - Step 79520: {'lr': 0.0002313429282467582, 'samples': 15267840, 'steps': 79519, 'loss/train': 0.6725125312805176} 11/07/2021 08:25:36 - INFO - __main__ - Step 79521: {'lr': 0.00023133763630488882, 'samples': 15268032, 'steps': 79520, 'loss/train': 1.065293550491333} 11/07/2021 08:25:37 - INFO - __main__ - Step 79522: {'lr': 0.00023133234437142845, 'samples': 15268224, 'steps': 79521, 'loss/train': 1.2380372285842896} 11/07/2021 08:25:37 - INFO - __main__ - Step 79523: {'lr': 0.0002313270524463794, 'samples': 15268416, 'steps': 79522, 'loss/train': 1.6829650402069092} 11/07/2021 08:25:38 - INFO - __main__ - Step 79524: {'lr': 0.00023132176052974412, 'samples': 15268608, 'steps': 79523, 'loss/train': 0.1394408494234085} 11/07/2021 08:25:38 - INFO - __main__ - Step 79525: {'lr': 0.00023131646862152496, 'samples': 15268800, 'steps': 79524, 'loss/train': 2.501371145248413} 11/07/2021 08:25:38 - INFO - __main__ - Step 79526: {'lr': 0.0002313111767217243, 'samples': 15268992, 'steps': 79525, 'loss/train': 0.9028856754302979} 11/07/2021 08:25:39 - INFO - __main__ - Step 79527: {'lr': 0.00023130588483034456, 'samples': 15269184, 'steps': 79526, 'loss/train': 1.233431339263916} 11/07/2021 08:25:40 - INFO - __main__ - Step 79528: {'lr': 0.0002313005929473881, 'samples': 15269376, 'steps': 79527, 'loss/train': 1.1437064409255981} 11/07/2021 08:25:40 - INFO - __main__ - Step 79529: {'lr': 0.00023129530107285728, 'samples': 15269568, 'steps': 79528, 'loss/train': 1.804162621498108} 11/07/2021 08:25:40 - INFO - __main__ - Step 79530: {'lr': 0.00023129000920675457, 'samples': 15269760, 'steps': 79529, 'loss/train': 1.4647082090377808} 11/07/2021 08:25:41 - INFO - __main__ - Step 79531: {'lr': 0.0002312847173490823, 'samples': 15269952, 'steps': 79530, 'loss/train': 1.728250503540039} 11/07/2021 08:25:42 - INFO - __main__ - Step 79532: {'lr': 0.0002312794254998428, 'samples': 15270144, 'steps': 79531, 'loss/train': 1.2324224710464478} 11/07/2021 08:25:43 - INFO - __main__ - Step 79533: {'lr': 0.0002312741336590385, 'samples': 15270336, 'steps': 79532, 'loss/train': 0.47591400146484375} 11/07/2021 08:25:43 - INFO - __main__ - Step 79534: {'lr': 0.00023126884182667173, 'samples': 15270528, 'steps': 79533, 'loss/train': 2.618673801422119} 11/07/2021 08:25:43 - INFO - __main__ - Step 79535: {'lr': 0.00023126355000274498, 'samples': 15270720, 'steps': 79534, 'loss/train': 1.1604617834091187} 11/07/2021 08:25:44 - INFO - __main__ - Step 79536: {'lr': 0.0002312582581872606, 'samples': 15270912, 'steps': 79535, 'loss/train': 0.08517173677682877} 11/07/2021 08:25:45 - INFO - __main__ - Step 79537: {'lr': 0.00023125296638022095, 'samples': 15271104, 'steps': 79536, 'loss/train': 1.434615135192871} 11/07/2021 08:25:45 - INFO - __main__ - Step 79538: {'lr': 0.0002312476745816284, 'samples': 15271296, 'steps': 79537, 'loss/train': 1.513961911201477} 11/07/2021 08:25:45 - INFO - __main__ - Step 79539: {'lr': 0.00023124238279148538, 'samples': 15271488, 'steps': 79538, 'loss/train': 1.1934690475463867} 11/07/2021 08:25:46 - INFO - __main__ - Step 79540: {'lr': 0.00023123709100979426, 'samples': 15271680, 'steps': 79539, 'loss/train': 1.2048866748809814} 11/07/2021 08:25:46 - INFO - __main__ - Step 79541: {'lr': 0.00023123179923655745, 'samples': 15271872, 'steps': 79540, 'loss/train': 1.4448766708374023} 11/07/2021 08:25:47 - INFO - __main__ - Step 79542: {'lr': 0.00023122650747177726, 'samples': 15272064, 'steps': 79541, 'loss/train': 1.5102274417877197} 11/07/2021 08:25:47 - INFO - __main__ - Step 79543: {'lr': 0.00023122121571545612, 'samples': 15272256, 'steps': 79542, 'loss/train': 1.7145488262176514} 11/07/2021 08:25:48 - INFO - __main__ - Step 79544: {'lr': 0.00023121592396759645, 'samples': 15272448, 'steps': 79543, 'loss/train': 1.2505393028259277} 11/07/2021 08:25:48 - INFO - __main__ - Step 79545: {'lr': 0.00023121063222820054, 'samples': 15272640, 'steps': 79544, 'loss/train': 1.585133671760559} 11/07/2021 08:25:49 - INFO - __main__ - Step 79546: {'lr': 0.00023120534049727085, 'samples': 15272832, 'steps': 79545, 'loss/train': 1.4120118618011475} 11/07/2021 08:25:49 - INFO - __main__ - Step 79547: {'lr': 0.00023120004877480972, 'samples': 15273024, 'steps': 79546, 'loss/train': 1.173295497894287} 11/07/2021 08:25:50 - INFO - __main__ - Step 79548: {'lr': 0.00023119475706081957, 'samples': 15273216, 'steps': 79547, 'loss/train': 1.3053492307662964} 11/07/2021 08:25:50 - INFO - __main__ - Step 79549: {'lr': 0.00023118946535530277, 'samples': 15273408, 'steps': 79548, 'loss/train': 1.5560202598571777} 11/07/2021 08:25:51 - INFO - __main__ - Step 79550: {'lr': 0.0002311841736582617, 'samples': 15273600, 'steps': 79549, 'loss/train': 1.3028062582015991} 11/07/2021 08:25:51 - INFO - __main__ - Step 79551: {'lr': 0.00023117888196969879, 'samples': 15273792, 'steps': 79550, 'loss/train': 0.7900233864784241} 11/07/2021 08:25:52 - INFO - __main__ - Step 79552: {'lr': 0.0002311735902896164, 'samples': 15273984, 'steps': 79551, 'loss/train': 1.3285503387451172} 11/07/2021 08:25:52 - INFO - __main__ - Step 79553: {'lr': 0.00023116829861801686, 'samples': 15274176, 'steps': 79552, 'loss/train': 0.9676381349563599} 11/07/2021 08:25:52 - INFO - __main__ - Step 79554: {'lr': 0.00023116300695490258, 'samples': 15274368, 'steps': 79553, 'loss/train': 0.624239981174469} 11/07/2021 08:25:53 - INFO - __main__ - Step 79555: {'lr': 0.00023115771530027597, 'samples': 15274560, 'steps': 79554, 'loss/train': 1.4552406072616577} 11/07/2021 08:25:53 - INFO - __main__ - Step 79556: {'lr': 0.00023115242365413937, 'samples': 15274752, 'steps': 79555, 'loss/train': 1.3472644090652466} 11/07/2021 08:25:55 - INFO - __main__ - Step 79557: {'lr': 0.00023114713201649524, 'samples': 15274944, 'steps': 79556, 'loss/train': 1.9347584247589111} 11/07/2021 08:25:55 - INFO - __main__ - Step 79558: {'lr': 0.00023114184038734598, 'samples': 15275136, 'steps': 79557, 'loss/train': 0.34295839071273804} 11/07/2021 08:25:55 - INFO - __main__ - Step 79559: {'lr': 0.00023113654876669382, 'samples': 15275328, 'steps': 79558, 'loss/train': 1.3907761573791504} 11/07/2021 08:25:56 - INFO - __main__ - Step 79560: {'lr': 0.0002311312571545413, 'samples': 15275520, 'steps': 79559, 'loss/train': 0.6296599507331848} 11/07/2021 08:25:56 - INFO - __main__ - Step 79561: {'lr': 0.0002311259655508907, 'samples': 15275712, 'steps': 79560, 'loss/train': 1.6540786027908325} 11/07/2021 08:25:57 - INFO - __main__ - Step 79562: {'lr': 0.00023112067395574448, 'samples': 15275904, 'steps': 79561, 'loss/train': 1.2538455724716187} 11/07/2021 08:25:57 - INFO - __main__ - Step 79563: {'lr': 0.000231115382369105, 'samples': 15276096, 'steps': 79562, 'loss/train': 1.691498875617981} 11/07/2021 08:25:58 - INFO - __main__ - Step 79564: {'lr': 0.0002311100907909746, 'samples': 15276288, 'steps': 79563, 'loss/train': 1.2960878610610962} 11/07/2021 08:25:58 - INFO - __main__ - Step 79565: {'lr': 0.0002311047992213557, 'samples': 15276480, 'steps': 79564, 'loss/train': 1.2619714736938477} 11/07/2021 08:25:58 - INFO - __main__ - Step 79566: {'lr': 0.00023109950766025071, 'samples': 15276672, 'steps': 79565, 'loss/train': 1.2802847623825073} 11/07/2021 08:26:00 - INFO - __main__ - Step 79567: {'lr': 0.00023109421610766195, 'samples': 15276864, 'steps': 79566, 'loss/train': 0.893398106098175} 11/07/2021 08:26:00 - INFO - __main__ - Step 79568: {'lr': 0.00023108892456359187, 'samples': 15277056, 'steps': 79567, 'loss/train': 1.375313401222229} 11/07/2021 08:26:00 - INFO - __main__ - Step 79569: {'lr': 0.00023108363302804284, 'samples': 15277248, 'steps': 79568, 'loss/train': 1.3171261548995972} 11/07/2021 08:26:01 - INFO - __main__ - Step 79570: {'lr': 0.0002310783415010172, 'samples': 15277440, 'steps': 79569, 'loss/train': 1.673750638961792} 11/07/2021 08:26:01 - INFO - __main__ - Step 79571: {'lr': 0.00023107304998251746, 'samples': 15277632, 'steps': 79570, 'loss/train': 1.3928048610687256} 11/07/2021 08:26:02 - INFO - __main__ - Step 79572: {'lr': 0.0002310677584725458, 'samples': 15277824, 'steps': 79571, 'loss/train': 1.545754313468933} 11/07/2021 08:26:02 - INFO - __main__ - Step 79573: {'lr': 0.00023106246697110483, 'samples': 15278016, 'steps': 79572, 'loss/train': 1.0257874727249146} 11/07/2021 08:26:03 - INFO - __main__ - Step 79574: {'lr': 0.00023105717547819676, 'samples': 15278208, 'steps': 79573, 'loss/train': 1.4722168445587158} 11/07/2021 08:26:03 - INFO - __main__ - Step 79575: {'lr': 0.00023105188399382402, 'samples': 15278400, 'steps': 79574, 'loss/train': 1.5238254070281982} 11/07/2021 08:26:03 - INFO - __main__ - Step 79576: {'lr': 0.00023104659251798902, 'samples': 15278592, 'steps': 79575, 'loss/train': 1.492822527885437} 11/07/2021 08:26:04 - INFO - __main__ - Step 79577: {'lr': 0.00023104130105069408, 'samples': 15278784, 'steps': 79576, 'loss/train': 1.0514148473739624} 11/07/2021 08:26:05 - INFO - __main__ - Step 79578: {'lr': 0.00023103600959194172, 'samples': 15278976, 'steps': 79577, 'loss/train': 1.4399038553237915} 11/07/2021 08:26:05 - INFO - __main__ - Step 79579: {'lr': 0.0002310307181417342, 'samples': 15279168, 'steps': 79578, 'loss/train': 1.9156620502471924} 11/07/2021 08:26:05 - INFO - __main__ - Step 79580: {'lr': 0.00023102542670007392, 'samples': 15279360, 'steps': 79579, 'loss/train': 1.054512858390808} 11/07/2021 08:26:06 - INFO - __main__ - Step 79581: {'lr': 0.00023102013526696334, 'samples': 15279552, 'steps': 79580, 'loss/train': 1.616941213607788} 11/07/2021 08:26:06 - INFO - __main__ - Step 79582: {'lr': 0.00023101484384240476, 'samples': 15279744, 'steps': 79581, 'loss/train': 1.4785219430923462} 11/07/2021 08:26:07 - INFO - __main__ - Step 79583: {'lr': 0.00023100955242640061, 'samples': 15279936, 'steps': 79582, 'loss/train': 1.1722992658615112} 11/07/2021 08:26:07 - INFO - __main__ - Step 79584: {'lr': 0.00023100426101895324, 'samples': 15280128, 'steps': 79583, 'loss/train': 1.5256754159927368} 11/07/2021 08:26:08 - INFO - __main__ - Step 79585: {'lr': 0.0002309989696200652, 'samples': 15280320, 'steps': 79584, 'loss/train': 0.7706775665283203} 11/07/2021 08:26:08 - INFO - __main__ - Step 79586: {'lr': 0.00023099367822973862, 'samples': 15280512, 'steps': 79585, 'loss/train': 0.6529847979545593} 11/07/2021 08:26:08 - INFO - __main__ - Step 79587: {'lr': 0.000230988386847976, 'samples': 15280704, 'steps': 79586, 'loss/train': 1.3891690969467163} 11/07/2021 08:26:09 - INFO - __main__ - Step 79588: {'lr': 0.0002309830954747797, 'samples': 15280896, 'steps': 79587, 'loss/train': 1.377179741859436} 11/07/2021 08:26:10 - INFO - __main__ - Step 79589: {'lr': 0.00023097780411015213, 'samples': 15281088, 'steps': 79588, 'loss/train': 1.0871362686157227} 11/07/2021 08:26:10 - INFO - __main__ - Step 79590: {'lr': 0.00023097251275409564, 'samples': 15281280, 'steps': 79589, 'loss/train': 1.3017847537994385} 11/07/2021 08:26:10 - INFO - __main__ - Step 79591: {'lr': 0.00023096722140661266, 'samples': 15281472, 'steps': 79590, 'loss/train': 1.2310842275619507} 11/07/2021 08:26:11 - INFO - __main__ - Step 79592: {'lr': 0.0002309619300677056, 'samples': 15281664, 'steps': 79591, 'loss/train': 1.5376555919647217} 11/07/2021 08:26:12 - INFO - __main__ - Step 79593: {'lr': 0.00023095663873737673, 'samples': 15281856, 'steps': 79592, 'loss/train': 1.559167504310608} 11/07/2021 08:26:12 - INFO - __main__ - Step 79594: {'lr': 0.00023095134741562856, 'samples': 15282048, 'steps': 79593, 'loss/train': 1.3238060474395752} 11/07/2021 08:26:12 - INFO - __main__ - Step 79595: {'lr': 0.00023094605610246338, 'samples': 15282240, 'steps': 79594, 'loss/train': 1.2808837890625} 11/07/2021 08:26:13 - INFO - __main__ - Step 79596: {'lr': 0.00023094076479788364, 'samples': 15282432, 'steps': 79595, 'loss/train': 1.7395559549331665} 11/07/2021 08:26:13 - INFO - __main__ - Step 79597: {'lr': 0.0002309354735018917, 'samples': 15282624, 'steps': 79596, 'loss/train': 1.3781299591064453} 11/07/2021 08:26:14 - INFO - __main__ - Step 79598: {'lr': 0.00023093018221449004, 'samples': 15282816, 'steps': 79597, 'loss/train': 1.636840581893921} 11/07/2021 08:26:15 - INFO - __main__ - Step 79599: {'lr': 0.00023092489093568084, 'samples': 15283008, 'steps': 79598, 'loss/train': 1.4495773315429688} 11/07/2021 08:26:15 - INFO - __main__ - Step 79600: {'lr': 0.0002309195996654666, 'samples': 15283200, 'steps': 79599, 'loss/train': 1.655630111694336} 11/07/2021 08:26:15 - INFO - __main__ - Step 79601: {'lr': 0.00023091430840384964, 'samples': 15283392, 'steps': 79600, 'loss/train': 2.446629047393799} 11/07/2021 08:26:16 - INFO - __main__ - Step 79602: {'lr': 0.00023090901715083247, 'samples': 15283584, 'steps': 79601, 'loss/train': 1.0961195230484009} 11/07/2021 08:26:17 - INFO - __main__ - Step 79603: {'lr': 0.00023090372590641733, 'samples': 15283776, 'steps': 79602, 'loss/train': 1.2670172452926636} 11/07/2021 08:26:17 - INFO - __main__ - Step 79604: {'lr': 0.00023089843467060672, 'samples': 15283968, 'steps': 79603, 'loss/train': 1.0516464710235596} 11/07/2021 08:26:17 - INFO - __main__ - Step 79605: {'lr': 0.000230893143443403, 'samples': 15284160, 'steps': 79604, 'loss/train': 1.5794326066970825} 11/07/2021 08:26:18 - INFO - __main__ - Step 79606: {'lr': 0.0002308878522248085, 'samples': 15284352, 'steps': 79605, 'loss/train': 1.5813130140304565} 11/07/2021 08:26:18 - INFO - __main__ - Step 79607: {'lr': 0.00023088256101482565, 'samples': 15284544, 'steps': 79606, 'loss/train': 1.1064841747283936} 11/07/2021 08:26:19 - INFO - __main__ - Step 79608: {'lr': 0.00023087726981345683, 'samples': 15284736, 'steps': 79607, 'loss/train': 1.3168221712112427} 11/07/2021 08:26:20 - INFO - __main__ - Step 79609: {'lr': 0.00023087197862070442, 'samples': 15284928, 'steps': 79608, 'loss/train': 1.4090098142623901} 11/07/2021 08:26:20 - INFO - __main__ - Step 79610: {'lr': 0.00023086668743657078, 'samples': 15285120, 'steps': 79609, 'loss/train': 0.7868743538856506} 11/07/2021 08:26:20 - INFO - __main__ - Step 79611: {'lr': 0.00023086139626105843, 'samples': 15285312, 'steps': 79610, 'loss/train': 0.7388406991958618} 11/07/2021 08:26:21 - INFO - __main__ - Step 79612: {'lr': 0.00023085610509416955, 'samples': 15285504, 'steps': 79611, 'loss/train': 1.2122981548309326} 11/07/2021 08:26:21 - INFO - __main__ - Step 79613: {'lr': 0.0002308508139359066, 'samples': 15285696, 'steps': 79612, 'loss/train': 1.3642616271972656} 11/07/2021 08:26:22 - INFO - __main__ - Step 79614: {'lr': 0.00023084552278627196, 'samples': 15285888, 'steps': 79613, 'loss/train': 1.0689077377319336} 11/07/2021 08:26:22 - INFO - __main__ - Step 79615: {'lr': 0.00023084023164526808, 'samples': 15286080, 'steps': 79614, 'loss/train': 1.523692011833191} 11/07/2021 08:26:23 - INFO - __main__ - Step 79616: {'lr': 0.00023083494051289724, 'samples': 15286272, 'steps': 79615, 'loss/train': 1.5207016468048096} 11/07/2021 08:26:23 - INFO - __main__ - Step 79617: {'lr': 0.0002308296493891619, 'samples': 15286464, 'steps': 79616, 'loss/train': 1.2325903177261353} 11/07/2021 08:26:23 - INFO - __main__ - Step 79618: {'lr': 0.00023082435827406444, 'samples': 15286656, 'steps': 79617, 'loss/train': 1.5064235925674438} 11/07/2021 08:26:24 - INFO - __main__ - Step 79619: {'lr': 0.00023081906716760722, 'samples': 15286848, 'steps': 79618, 'loss/train': 1.7271994352340698} 11/07/2021 08:26:25 - INFO - __main__ - Step 79620: {'lr': 0.00023081377606979265, 'samples': 15287040, 'steps': 79619, 'loss/train': 1.0840002298355103} 11/07/2021 08:26:25 - INFO - __main__ - Step 79621: {'lr': 0.00023080848498062306, 'samples': 15287232, 'steps': 79620, 'loss/train': 0.8367189168930054} 11/07/2021 08:26:26 - INFO - __main__ - Step 79622: {'lr': 0.00023080319390010088, 'samples': 15287424, 'steps': 79621, 'loss/train': 1.2227813005447388} 11/07/2021 08:26:26 - INFO - __main__ - Step 79623: {'lr': 0.0002307979028282285, 'samples': 15287616, 'steps': 79622, 'loss/train': 1.7469456195831299} 11/07/2021 08:26:27 - INFO - __main__ - Step 79624: {'lr': 0.0002307926117650083, 'samples': 15287808, 'steps': 79623, 'loss/train': 1.6743509769439697} 11/07/2021 08:26:27 - INFO - __main__ - Step 79625: {'lr': 0.00023078732071044272, 'samples': 15288000, 'steps': 79624, 'loss/train': 1.7048580646514893} 11/07/2021 08:26:28 - INFO - __main__ - Step 79626: {'lr': 0.000230782029664534, 'samples': 15288192, 'steps': 79625, 'loss/train': 1.1353721618652344} 11/07/2021 08:26:28 - INFO - __main__ - Step 79627: {'lr': 0.0002307767386272846, 'samples': 15288384, 'steps': 79626, 'loss/train': 1.5574742555618286} 11/07/2021 08:26:28 - INFO - __main__ - Step 79628: {'lr': 0.00023077144759869688, 'samples': 15288576, 'steps': 79627, 'loss/train': 1.081004023551941} 11/07/2021 08:26:29 - INFO - __main__ - Step 79629: {'lr': 0.00023076615657877326, 'samples': 15288768, 'steps': 79628, 'loss/train': 0.7757850885391235} 11/07/2021 08:26:30 - INFO - __main__ - Step 79630: {'lr': 0.00023076086556751612, 'samples': 15288960, 'steps': 79629, 'loss/train': 1.0944302082061768} 11/07/2021 08:26:30 - INFO - __main__ - Step 79631: {'lr': 0.00023075557456492786, 'samples': 15289152, 'steps': 79630, 'loss/train': 0.8847152590751648} 11/07/2021 08:26:31 - INFO - __main__ - Step 79632: {'lr': 0.0002307502835710108, 'samples': 15289344, 'steps': 79631, 'loss/train': 1.3222955465316772} 11/07/2021 08:26:31 - INFO - __main__ - Step 79633: {'lr': 0.0002307449925857674, 'samples': 15289536, 'steps': 79632, 'loss/train': 1.2614034414291382} 11/07/2021 08:26:31 - INFO - __main__ - Step 79634: {'lr': 0.00023073970160919995, 'samples': 15289728, 'steps': 79633, 'loss/train': 1.2364290952682495} 11/07/2021 08:26:32 - INFO - __main__ - Step 79635: {'lr': 0.00023073441064131096, 'samples': 15289920, 'steps': 79634, 'loss/train': 1.5574520826339722} 11/07/2021 08:26:33 - INFO - __main__ - Step 79636: {'lr': 0.00023072911968210274, 'samples': 15290112, 'steps': 79635, 'loss/train': 1.571665644645691} 11/07/2021 08:26:33 - INFO - __main__ - Step 79637: {'lr': 0.00023072382873157765, 'samples': 15290304, 'steps': 79636, 'loss/train': 1.1550140380859375} 11/07/2021 08:26:33 - INFO - __main__ - Step 79638: {'lr': 0.00023071853778973823, 'samples': 15290496, 'steps': 79637, 'loss/train': 0.8889645934104919} 11/07/2021 08:26:34 - INFO - __main__ - Step 79639: {'lr': 0.00023071324685658662, 'samples': 15290688, 'steps': 79638, 'loss/train': 1.3686878681182861} 11/07/2021 08:26:35 - INFO - __main__ - Step 79640: {'lr': 0.0002307079559321253, 'samples': 15290880, 'steps': 79639, 'loss/train': 1.4260860681533813} 11/07/2021 08:26:35 - INFO - __main__ - Step 79641: {'lr': 0.00023070266501635674, 'samples': 15291072, 'steps': 79640, 'loss/train': 1.9794788360595703} 11/07/2021 08:26:35 - INFO - __main__ - Step 79642: {'lr': 0.00023069737410928324, 'samples': 15291264, 'steps': 79641, 'loss/train': 1.8885358572006226} 11/07/2021 08:26:36 - INFO - __main__ - Step 79643: {'lr': 0.00023069208321090717, 'samples': 15291456, 'steps': 79642, 'loss/train': 1.2742620706558228} 11/07/2021 08:26:36 - INFO - __main__ - Step 79644: {'lr': 0.000230686792321231, 'samples': 15291648, 'steps': 79643, 'loss/train': 1.304626703262329} 11/07/2021 08:26:37 - INFO - __main__ - Step 79645: {'lr': 0.00023068150144025702, 'samples': 15291840, 'steps': 79644, 'loss/train': 1.6012871265411377} 11/07/2021 08:26:38 - INFO - __main__ - Step 79646: {'lr': 0.0002306762105679877, 'samples': 15292032, 'steps': 79645, 'loss/train': 1.6206495761871338} 11/07/2021 08:26:38 - INFO - __main__ - Step 79647: {'lr': 0.00023067091970442534, 'samples': 15292224, 'steps': 79646, 'loss/train': 1.3646444082260132} 11/07/2021 08:26:38 - INFO - __main__ - Step 79648: {'lr': 0.00023066562884957236, 'samples': 15292416, 'steps': 79647, 'loss/train': 0.8606871962547302} 11/07/2021 08:26:39 - INFO - __main__ - Step 79649: {'lr': 0.0002306603380034312, 'samples': 15292608, 'steps': 79648, 'loss/train': 0.8234711289405823} 11/07/2021 08:26:39 - INFO - __main__ - Step 79650: {'lr': 0.00023065504716600417, 'samples': 15292800, 'steps': 79649, 'loss/train': 1.3319276571273804} 11/07/2021 08:26:40 - INFO - __main__ - Step 79651: {'lr': 0.00023064975633729366, 'samples': 15292992, 'steps': 79650, 'loss/train': 0.09113195538520813} 11/07/2021 08:26:41 - INFO - __main__ - Step 79652: {'lr': 0.0002306444655173022, 'samples': 15293184, 'steps': 79651, 'loss/train': 1.0410878658294678} 11/07/2021 08:26:41 - INFO - __main__ - Step 79653: {'lr': 0.0002306391747060319, 'samples': 15293376, 'steps': 79652, 'loss/train': 1.450329303741455} 11/07/2021 08:26:41 - INFO - __main__ - Step 79654: {'lr': 0.00023063388390348534, 'samples': 15293568, 'steps': 79653, 'loss/train': 1.482818841934204} 11/07/2021 08:26:42 - INFO - __main__ - Step 79655: {'lr': 0.00023062859310966482, 'samples': 15293760, 'steps': 79654, 'loss/train': 1.2680517435073853} 11/07/2021 08:26:43 - INFO - __main__ - Step 79656: {'lr': 0.00023062330232457277, 'samples': 15293952, 'steps': 79655, 'loss/train': 1.5318527221679688} 11/07/2021 08:26:43 - INFO - __main__ - Step 79657: {'lr': 0.00023061801154821156, 'samples': 15294144, 'steps': 79656, 'loss/train': 1.5787434577941895} 11/07/2021 08:26:43 - INFO - __main__ - Step 79658: {'lr': 0.00023061272078058357, 'samples': 15294336, 'steps': 79657, 'loss/train': 1.2956947088241577} 11/07/2021 08:26:44 - INFO - __main__ - Step 79659: {'lr': 0.00023060743002169118, 'samples': 15294528, 'steps': 79658, 'loss/train': 1.1253013610839844} 11/07/2021 08:26:44 - INFO - __main__ - Step 79660: {'lr': 0.00023060213927153682, 'samples': 15294720, 'steps': 79659, 'loss/train': 1.6919769048690796} 11/07/2021 08:26:45 - INFO - __main__ - Step 79661: {'lr': 0.0002305968485301228, 'samples': 15294912, 'steps': 79660, 'loss/train': 1.0864368677139282} 11/07/2021 08:26:45 - INFO - __main__ - Step 79662: {'lr': 0.00023059155779745155, 'samples': 15295104, 'steps': 79661, 'loss/train': 1.4375165700912476} 11/07/2021 08:26:46 - INFO - __main__ - Step 79663: {'lr': 0.00023058626707352545, 'samples': 15295296, 'steps': 79662, 'loss/train': 1.2763662338256836} 11/07/2021 08:26:46 - INFO - __main__ - Step 79664: {'lr': 0.00023058097635834693, 'samples': 15295488, 'steps': 79663, 'loss/train': 1.1003414392471313} 11/07/2021 08:26:46 - INFO - __main__ - Step 79665: {'lr': 0.00023057568565191833, 'samples': 15295680, 'steps': 79664, 'loss/train': 1.2445683479309082} 11/07/2021 08:26:47 - INFO - __main__ - Step 79666: {'lr': 0.00023057039495424196, 'samples': 15295872, 'steps': 79665, 'loss/train': 1.1075637340545654} 11/07/2021 08:26:48 - INFO - __main__ - Step 79667: {'lr': 0.00023056510426532027, 'samples': 15296064, 'steps': 79666, 'loss/train': 1.6230145692825317} 11/07/2021 08:26:48 - INFO - __main__ - Step 79668: {'lr': 0.00023055981358515565, 'samples': 15296256, 'steps': 79667, 'loss/train': 1.6623480319976807} 11/07/2021 08:26:48 - INFO - __main__ - Step 79669: {'lr': 0.00023055452291375047, 'samples': 15296448, 'steps': 79668, 'loss/train': 1.3561513423919678} 11/07/2021 08:26:49 - INFO - __main__ - Step 79670: {'lr': 0.00023054923225110713, 'samples': 15296640, 'steps': 79669, 'loss/train': 1.539768934249878} 11/07/2021 08:26:50 - INFO - __main__ - Step 79671: {'lr': 0.000230543941597228, 'samples': 15296832, 'steps': 79670, 'loss/train': 1.4771229028701782} 11/07/2021 08:26:50 - INFO - __main__ - Step 79672: {'lr': 0.00023053865095211547, 'samples': 15297024, 'steps': 79671, 'loss/train': 1.6834200620651245} 11/07/2021 08:26:50 - INFO - __main__ - Step 79673: {'lr': 0.00023053336031577193, 'samples': 15297216, 'steps': 79672, 'loss/train': 0.46944525837898254} 11/07/2021 08:26:51 - INFO - __main__ - Step 79674: {'lr': 0.00023052806968819976, 'samples': 15297408, 'steps': 79673, 'loss/train': 0.3993986248970032} 11/07/2021 08:26:51 - INFO - __main__ - Step 79675: {'lr': 0.0002305227790694014, 'samples': 15297600, 'steps': 79674, 'loss/train': 1.8011653423309326} 11/07/2021 08:26:52 - INFO - __main__ - Step 79676: {'lr': 0.0002305174884593791, 'samples': 15297792, 'steps': 79675, 'loss/train': 1.3469222784042358} 11/07/2021 08:26:53 - INFO - __main__ - Step 79677: {'lr': 0.00023051219785813533, 'samples': 15297984, 'steps': 79676, 'loss/train': 0.9914976954460144} 11/07/2021 08:26:53 - INFO - __main__ - Step 79678: {'lr': 0.00023050690726567248, 'samples': 15298176, 'steps': 79677, 'loss/train': 1.0420585870742798} 11/07/2021 08:26:53 - INFO - __main__ - Step 79679: {'lr': 0.00023050161668199294, 'samples': 15298368, 'steps': 79678, 'loss/train': 1.1643062829971313} 11/07/2021 08:26:54 - INFO - __main__ - Step 79680: {'lr': 0.00023049632610709902, 'samples': 15298560, 'steps': 79679, 'loss/train': 1.3935422897338867} 11/07/2021 08:26:55 - INFO - __main__ - Step 79681: {'lr': 0.00023049103554099318, 'samples': 15298752, 'steps': 79680, 'loss/train': 1.5953550338745117} 11/07/2021 08:26:55 - INFO - __main__ - Step 79682: {'lr': 0.00023048574498367775, 'samples': 15298944, 'steps': 79681, 'loss/train': 2.20381236076355} 11/07/2021 08:26:55 - INFO - __main__ - Step 79683: {'lr': 0.00023048045443515517, 'samples': 15299136, 'steps': 79682, 'loss/train': 1.4348722696304321} 11/07/2021 08:26:56 - INFO - __main__ - Step 79684: {'lr': 0.00023047516389542778, 'samples': 15299328, 'steps': 79683, 'loss/train': 1.5202128887176514} 11/07/2021 08:26:56 - INFO - __main__ - Step 79685: {'lr': 0.00023046987336449798, 'samples': 15299520, 'steps': 79684, 'loss/train': 1.240891456604004} 11/07/2021 08:26:56 - INFO - __main__ - Step 79686: {'lr': 0.00023046458284236822, 'samples': 15299712, 'steps': 79685, 'loss/train': 1.417527198791504} 11/07/2021 08:26:59 - INFO - __main__ - Step 79687: {'lr': 0.00023045929232904075, 'samples': 15299904, 'steps': 79686, 'loss/train': 1.0292255878448486} 11/07/2021 08:26:59 - INFO - __main__ - Step 79688: {'lr': 0.000230454001824518, 'samples': 15300096, 'steps': 79687, 'loss/train': 1.2860561609268188} 11/07/2021 08:27:00 - INFO - __main__ - Step 79689: {'lr': 0.0002304487113288024, 'samples': 15300288, 'steps': 79688, 'loss/train': 1.3514142036437988} 11/07/2021 08:27:00 - INFO - __main__ - Step 79690: {'lr': 0.00023044342084189631, 'samples': 15300480, 'steps': 79689, 'loss/train': 1.7475224733352661} 11/07/2021 08:27:00 - INFO - __main__ - Step 79691: {'lr': 0.00023043813036380213, 'samples': 15300672, 'steps': 79690, 'loss/train': 1.7405807971954346} 11/07/2021 08:27:01 - INFO - __main__ - Step 79692: {'lr': 0.00023043283989452226, 'samples': 15300864, 'steps': 79691, 'loss/train': 1.754797101020813} 11/07/2021 08:27:01 - INFO - __main__ - Step 79693: {'lr': 0.000230427549434059, 'samples': 15301056, 'steps': 79692, 'loss/train': 1.5346554517745972} 11/07/2021 08:27:01 - INFO - __main__ - Step 79694: {'lr': 0.0002304222589824148, 'samples': 15301248, 'steps': 79693, 'loss/train': 1.7253472805023193} 11/07/2021 08:27:02 - INFO - __main__ - Step 79695: {'lr': 0.00023041696853959198, 'samples': 15301440, 'steps': 79694, 'loss/train': 1.506368637084961} 11/07/2021 08:27:03 - INFO - __main__ - Step 79696: {'lr': 0.00023041167810559303, 'samples': 15301632, 'steps': 79695, 'loss/train': 2.311166524887085} 11/07/2021 08:27:03 - INFO - __main__ - Step 79697: {'lr': 0.00023040638768042027, 'samples': 15301824, 'steps': 79696, 'loss/train': 1.0359715223312378} 11/07/2021 08:27:03 - INFO - __main__ - Step 79698: {'lr': 0.00023040109726407606, 'samples': 15302016, 'steps': 79697, 'loss/train': 1.2955647706985474} 11/07/2021 08:27:04 - INFO - __main__ - Step 79699: {'lr': 0.00023039580685656284, 'samples': 15302208, 'steps': 79698, 'loss/train': 0.8539668917655945} 11/07/2021 08:27:05 - INFO - __main__ - Step 79700: {'lr': 0.00023039051645788294, 'samples': 15302400, 'steps': 79699, 'loss/train': 1.3596549034118652} 11/07/2021 08:27:05 - INFO - __main__ - Step 79701: {'lr': 0.0002303852260680388, 'samples': 15302592, 'steps': 79700, 'loss/train': 1.704916000366211} 11/07/2021 08:27:05 - INFO - __main__ - Step 79702: {'lr': 0.00023037993568703275, 'samples': 15302784, 'steps': 79701, 'loss/train': 0.06708662211894989} 11/07/2021 08:27:06 - INFO - __main__ - Step 79703: {'lr': 0.00023037464531486718, 'samples': 15302976, 'steps': 79702, 'loss/train': 1.4108229875564575} 11/07/2021 08:27:06 - INFO - __main__ - Step 79704: {'lr': 0.00023036935495154452, 'samples': 15303168, 'steps': 79703, 'loss/train': 1.2272166013717651} 11/07/2021 08:27:07 - INFO - __main__ - Step 79705: {'lr': 0.0002303640645970671, 'samples': 15303360, 'steps': 79704, 'loss/train': 1.59709632396698} 11/07/2021 08:27:07 - INFO - __main__ - Step 79706: {'lr': 0.0002303587742514374, 'samples': 15303552, 'steps': 79705, 'loss/train': 0.7705163359642029} 11/07/2021 08:27:08 - INFO - __main__ - Step 79707: {'lr': 0.0002303534839146577, 'samples': 15303744, 'steps': 79706, 'loss/train': 0.948482096195221} 11/07/2021 08:27:08 - INFO - __main__ - Step 79708: {'lr': 0.00023034819358673045, 'samples': 15303936, 'steps': 79707, 'loss/train': 0.9655011296272278} 11/07/2021 08:27:08 - INFO - __main__ - Step 79709: {'lr': 0.00023034290326765793, 'samples': 15304128, 'steps': 79708, 'loss/train': 1.2765501737594604} 11/07/2021 08:27:09 - INFO - __main__ - Step 79710: {'lr': 0.00023033761295744262, 'samples': 15304320, 'steps': 79709, 'loss/train': 1.4382667541503906} 11/07/2021 08:27:10 - INFO - __main__ - Step 79711: {'lr': 0.00023033232265608688, 'samples': 15304512, 'steps': 79710, 'loss/train': 1.4744540452957153} 11/07/2021 08:27:10 - INFO - __main__ - Step 79712: {'lr': 0.0002303270323635931, 'samples': 15304704, 'steps': 79711, 'loss/train': 1.3198357820510864} 11/07/2021 08:27:10 - INFO - __main__ - Step 79713: {'lr': 0.00023032174207996363, 'samples': 15304896, 'steps': 79712, 'loss/train': 1.8038930892944336} 11/07/2021 08:27:11 - INFO - __main__ - Step 79714: {'lr': 0.00023031645180520089, 'samples': 15305088, 'steps': 79713, 'loss/train': 2.442979335784912} 11/07/2021 08:27:12 - INFO - __main__ - Step 79715: {'lr': 0.00023031116153930726, 'samples': 15305280, 'steps': 79714, 'loss/train': 1.4216595888137817} 11/07/2021 08:27:12 - INFO - __main__ - Step 79716: {'lr': 0.00023030587128228507, 'samples': 15305472, 'steps': 79715, 'loss/train': 1.5261123180389404} 11/07/2021 08:27:13 - INFO - __main__ - Step 79717: {'lr': 0.0002303005810341368, 'samples': 15305664, 'steps': 79716, 'loss/train': 1.216512680053711} 11/07/2021 08:27:13 - INFO - __main__ - Step 79718: {'lr': 0.00023029529079486477, 'samples': 15305856, 'steps': 79717, 'loss/train': 5.295358657836914} 11/07/2021 08:27:14 - INFO - __main__ - Step 79719: {'lr': 0.0002302900005644715, 'samples': 15306048, 'steps': 79718, 'loss/train': 2.1481614112854004} 11/07/2021 08:27:14 - INFO - __main__ - Step 79720: {'lr': 0.00023028471034295913, 'samples': 15306240, 'steps': 79719, 'loss/train': 1.0376840829849243} 11/07/2021 08:27:15 - INFO - __main__ - Step 79721: {'lr': 0.0002302794201303302, 'samples': 15306432, 'steps': 79720, 'loss/train': 1.1257002353668213} 11/07/2021 08:27:15 - INFO - __main__ - Step 79722: {'lr': 0.000230274129926587, 'samples': 15306624, 'steps': 79721, 'loss/train': 1.0669578313827515} 11/07/2021 08:27:16 - INFO - __main__ - Step 79723: {'lr': 0.000230268839731732, 'samples': 15306816, 'steps': 79722, 'loss/train': 1.487399697303772} 11/07/2021 08:27:16 - INFO - __main__ - Step 79724: {'lr': 0.00023026354954576756, 'samples': 15307008, 'steps': 79723, 'loss/train': 0.9005876779556274} 11/07/2021 08:27:16 - INFO - __main__ - Step 79725: {'lr': 0.00023025825936869604, 'samples': 15307200, 'steps': 79724, 'loss/train': 1.2283976078033447} 11/07/2021 08:27:17 - INFO - __main__ - Step 79726: {'lr': 0.00023025296920051988, 'samples': 15307392, 'steps': 79725, 'loss/train': 2.0580148696899414} 11/07/2021 08:27:18 - INFO - __main__ - Step 79727: {'lr': 0.0002302476790412414, 'samples': 15307584, 'steps': 79726, 'loss/train': 1.5426007509231567} 11/07/2021 08:27:18 - INFO - __main__ - Step 79728: {'lr': 0.00023024238889086303, 'samples': 15307776, 'steps': 79727, 'loss/train': 1.5487509965896606} 11/07/2021 08:27:18 - INFO - __main__ - Step 79729: {'lr': 0.0002302370987493871, 'samples': 15307968, 'steps': 79728, 'loss/train': 1.2465546131134033} 11/07/2021 08:27:19 - INFO - __main__ - Step 79730: {'lr': 0.00023023180861681607, 'samples': 15308160, 'steps': 79729, 'loss/train': 0.9958863854408264} 11/07/2021 08:27:20 - INFO - __main__ - Step 79731: {'lr': 0.00023022651849315224, 'samples': 15308352, 'steps': 79730, 'loss/train': 1.1168737411499023} 11/07/2021 08:27:20 - INFO - __main__ - Step 79732: {'lr': 0.0002302212283783982, 'samples': 15308544, 'steps': 79731, 'loss/train': 1.7624419927597046} 11/07/2021 08:27:20 - INFO - __main__ - Step 79733: {'lr': 0.000230215938272556, 'samples': 15308736, 'steps': 79732, 'loss/train': 1.611841082572937} 11/07/2021 08:27:21 - INFO - __main__ - Step 79734: {'lr': 0.00023021064817562822, 'samples': 15308928, 'steps': 79733, 'loss/train': 1.8784775733947754} 11/07/2021 08:27:21 - INFO - __main__ - Step 79735: {'lr': 0.0002302053580876172, 'samples': 15309120, 'steps': 79734, 'loss/train': 1.0591129064559937} 11/07/2021 08:27:22 - INFO - __main__ - Step 79736: {'lr': 0.0002302000680085254, 'samples': 15309312, 'steps': 79735, 'loss/train': 1.4480555057525635} 11/07/2021 08:27:22 - INFO - __main__ - Step 79737: {'lr': 0.0002301947779383551, 'samples': 15309504, 'steps': 79736, 'loss/train': 1.6549832820892334} 11/07/2021 08:27:23 - INFO - __main__ - Step 79738: {'lr': 0.00023018948787710872, 'samples': 15309696, 'steps': 79737, 'loss/train': 1.2943987846374512} 11/07/2021 08:27:23 - INFO - __main__ - Step 79739: {'lr': 0.00023018419782478867, 'samples': 15309888, 'steps': 79738, 'loss/train': 1.3928625583648682} 11/07/2021 08:27:24 - INFO - __main__ - Step 79740: {'lr': 0.00023017890778139727, 'samples': 15310080, 'steps': 79739, 'loss/train': 0.8978331685066223} 11/07/2021 08:27:24 - INFO - __main__ - Step 79741: {'lr': 0.000230173617746937, 'samples': 15310272, 'steps': 79740, 'loss/train': 1.3237978219985962} 11/07/2021 08:27:26 - INFO - __main__ - Step 79742: {'lr': 0.00023016832772141017, 'samples': 15310464, 'steps': 79741, 'loss/train': 0.8137255311012268} 11/07/2021 08:27:26 - INFO - __main__ - Step 79743: {'lr': 0.0002301630377048192, 'samples': 15310656, 'steps': 79742, 'loss/train': 1.1453802585601807} 11/07/2021 08:27:27 - INFO - __main__ - Step 79744: {'lr': 0.00023015774769716643, 'samples': 15310848, 'steps': 79743, 'loss/train': 1.588730812072754} 11/07/2021 08:27:27 - INFO - __main__ - Step 79745: {'lr': 0.0002301524576984543, 'samples': 15311040, 'steps': 79744, 'loss/train': 1.177702784538269} 11/07/2021 08:27:27 - INFO - __main__ - Step 79746: {'lr': 0.00023014716770868525, 'samples': 15311232, 'steps': 79745, 'loss/train': 1.2888368368148804} 11/07/2021 08:27:28 - INFO - __main__ - Step 79747: {'lr': 0.00023014187772786153, 'samples': 15311424, 'steps': 79746, 'loss/train': 1.5698318481445312} 11/07/2021 08:27:28 - INFO - __main__ - Step 79748: {'lr': 0.00023013658775598552, 'samples': 15311616, 'steps': 79747, 'loss/train': 1.3400473594665527} 11/07/2021 08:27:29 - INFO - __main__ - Step 79749: {'lr': 0.00023013129779305967, 'samples': 15311808, 'steps': 79748, 'loss/train': 0.6859661936759949} 11/07/2021 08:27:30 - INFO - __main__ - Step 79750: {'lr': 0.00023012600783908633, 'samples': 15312000, 'steps': 79749, 'loss/train': 0.9632338285446167} 11/07/2021 08:27:30 - INFO - __main__ - Step 79751: {'lr': 0.00023012071789406795, 'samples': 15312192, 'steps': 79750, 'loss/train': 1.0117583274841309} 11/07/2021 08:27:30 - INFO - __main__ - Step 79752: {'lr': 0.00023011542795800682, 'samples': 15312384, 'steps': 79751, 'loss/train': 0.9297181963920593} 11/07/2021 08:27:31 - INFO - __main__ - Step 79753: {'lr': 0.0002301101380309054, 'samples': 15312576, 'steps': 79752, 'loss/train': 1.785306453704834} 11/07/2021 08:27:31 - INFO - __main__ - Step 79754: {'lr': 0.00023010484811276602, 'samples': 15312768, 'steps': 79753, 'loss/train': 1.715018391609192} 11/07/2021 08:27:32 - INFO - __main__ - Step 79755: {'lr': 0.00023009955820359112, 'samples': 15312960, 'steps': 79754, 'loss/train': 1.2929112911224365} 11/07/2021 08:27:32 - INFO - __main__ - Step 79756: {'lr': 0.00023009426830338303, 'samples': 15313152, 'steps': 79755, 'loss/train': 1.6950390338897705} 11/07/2021 08:27:33 - INFO - __main__ - Step 79757: {'lr': 0.00023008897841214415, 'samples': 15313344, 'steps': 79756, 'loss/train': 0.7735011577606201} 11/07/2021 08:27:33 - INFO - __main__ - Step 79758: {'lr': 0.0002300836885298769, 'samples': 15313536, 'steps': 79757, 'loss/train': 1.7133259773254395} 11/07/2021 08:27:33 - INFO - __main__ - Step 79759: {'lr': 0.00023007839865658373, 'samples': 15313728, 'steps': 79758, 'loss/train': 2.143049955368042} 11/07/2021 08:27:34 - INFO - __main__ - Step 79760: {'lr': 0.0002300731087922668, 'samples': 15313920, 'steps': 79759, 'loss/train': 1.5173732042312622} 11/07/2021 08:27:35 - INFO - __main__ - Step 79761: {'lr': 0.00023006781893692864, 'samples': 15314112, 'steps': 79760, 'loss/train': 1.3644198179244995} 11/07/2021 08:27:35 - INFO - __main__ - Step 79762: {'lr': 0.0002300625290905716, 'samples': 15314304, 'steps': 79761, 'loss/train': 1.3269479274749756} 11/07/2021 08:27:35 - INFO - __main__ - Step 79763: {'lr': 0.00023005723925319807, 'samples': 15314496, 'steps': 79762, 'loss/train': 0.6566521525382996} 11/07/2021 08:27:36 - INFO - __main__ - Step 79764: {'lr': 0.00023005194942481047, 'samples': 15314688, 'steps': 79763, 'loss/train': 1.3848979473114014} 11/07/2021 08:27:37 - INFO - __main__ - Step 79765: {'lr': 0.00023004665960541112, 'samples': 15314880, 'steps': 79764, 'loss/train': 0.5247137546539307} 11/07/2021 08:27:37 - INFO - __main__ - Step 79766: {'lr': 0.00023004136979500246, 'samples': 15315072, 'steps': 79765, 'loss/train': 0.6782026290893555} 11/07/2021 08:27:38 - INFO - __main__ - Step 79767: {'lr': 0.00023003607999358685, 'samples': 15315264, 'steps': 79766, 'loss/train': 1.5586668252944946} 11/07/2021 08:27:38 - INFO - __main__ - Step 79768: {'lr': 0.00023003079020116664, 'samples': 15315456, 'steps': 79767, 'loss/train': 4.53874397277832} 11/07/2021 08:27:38 - INFO - __main__ - Step 79769: {'lr': 0.0002300255004177443, 'samples': 15315648, 'steps': 79768, 'loss/train': 1.8848094940185547} 11/07/2021 08:27:39 - INFO - __main__ - Step 79770: {'lr': 0.00023002021064332212, 'samples': 15315840, 'steps': 79769, 'loss/train': 1.6596232652664185} 11/07/2021 08:27:40 - INFO - __main__ - Step 79771: {'lr': 0.00023001492087790253, 'samples': 15316032, 'steps': 79770, 'loss/train': 1.5995010137557983} 11/07/2021 08:27:40 - INFO - __main__ - Step 79772: {'lr': 0.00023000963112148793, 'samples': 15316224, 'steps': 79771, 'loss/train': 0.9136497378349304} 11/07/2021 08:27:41 - INFO - __main__ - Step 79773: {'lr': 0.0002300043413740808, 'samples': 15316416, 'steps': 79772, 'loss/train': 1.541831135749817} 11/07/2021 08:27:41 - INFO - __main__ - Step 79774: {'lr': 0.00022999905163568327, 'samples': 15316608, 'steps': 79773, 'loss/train': 1.1454018354415894} 11/07/2021 08:27:41 - INFO - __main__ - Step 79775: {'lr': 0.00022999376190629788, 'samples': 15316800, 'steps': 79774, 'loss/train': 1.5577061176300049} 11/07/2021 08:27:42 - INFO - __main__ - Step 79776: {'lr': 0.00022998847218592698, 'samples': 15316992, 'steps': 79775, 'loss/train': 1.5379384756088257} 11/07/2021 08:27:43 - INFO - __main__ - Step 79777: {'lr': 0.00022998318247457295, 'samples': 15317184, 'steps': 79776, 'loss/train': 1.238409161567688} 11/07/2021 08:27:43 - INFO - __main__ - Step 79778: {'lr': 0.0002299778927722382, 'samples': 15317376, 'steps': 79777, 'loss/train': 1.1540559530258179} 11/07/2021 08:27:43 - INFO - __main__ - Step 79779: {'lr': 0.0002299726030789251, 'samples': 15317568, 'steps': 79778, 'loss/train': 1.2522932291030884} 11/07/2021 08:27:44 - INFO - __main__ - Step 79780: {'lr': 0.00022996731339463604, 'samples': 15317760, 'steps': 79779, 'loss/train': 1.7270190715789795} 11/07/2021 08:27:45 - INFO - __main__ - Step 79781: {'lr': 0.00022996202371937342, 'samples': 15317952, 'steps': 79780, 'loss/train': 0.9803669452667236} 11/07/2021 08:27:45 - INFO - __main__ - Step 79782: {'lr': 0.00022995673405313955, 'samples': 15318144, 'steps': 79781, 'loss/train': 1.0342141389846802} 11/07/2021 08:27:45 - INFO - __main__ - Step 79783: {'lr': 0.0002299514443959369, 'samples': 15318336, 'steps': 79782, 'loss/train': 1.6731730699539185} 11/07/2021 08:27:46 - INFO - __main__ - Step 79784: {'lr': 0.00022994615474776785, 'samples': 15318528, 'steps': 79783, 'loss/train': 1.367175579071045} 11/07/2021 08:27:46 - INFO - __main__ - Step 79785: {'lr': 0.00022994086510863472, 'samples': 15318720, 'steps': 79784, 'loss/train': 1.081807017326355} 11/07/2021 08:27:47 - INFO - __main__ - Step 79786: {'lr': 0.00022993557547854002, 'samples': 15318912, 'steps': 79785, 'loss/train': 1.0319243669509888} 11/07/2021 08:27:47 - INFO - __main__ - Step 79787: {'lr': 0.00022993028585748597, 'samples': 15319104, 'steps': 79786, 'loss/train': 1.4999525547027588} 11/07/2021 08:27:48 - INFO - __main__ - Step 79788: {'lr': 0.00022992499624547498, 'samples': 15319296, 'steps': 79787, 'loss/train': 1.622420310974121} 11/07/2021 08:27:48 - INFO - __main__ - Step 79789: {'lr': 0.0002299197066425095, 'samples': 15319488, 'steps': 79788, 'loss/train': 1.2486767768859863} 11/07/2021 08:27:49 - INFO - __main__ - Step 79790: {'lr': 0.0002299144170485919, 'samples': 15319680, 'steps': 79789, 'loss/train': 1.333480954170227} 11/07/2021 08:27:49 - INFO - __main__ - Step 79791: {'lr': 0.00022990912746372454, 'samples': 15319872, 'steps': 79790, 'loss/train': 1.3754236698150635} 11/07/2021 08:27:50 - INFO - __main__ - Step 79792: {'lr': 0.0002299038378879098, 'samples': 15320064, 'steps': 79791, 'loss/train': 0.5838215947151184} 11/07/2021 08:27:50 - INFO - __main__ - Step 79793: {'lr': 0.00022989854832115012, 'samples': 15320256, 'steps': 79792, 'loss/train': 1.6623133420944214} 11/07/2021 08:27:51 - INFO - __main__ - Step 79794: {'lr': 0.00022989325876344782, 'samples': 15320448, 'steps': 79793, 'loss/train': 1.7074930667877197} 11/07/2021 08:27:51 - INFO - __main__ - Step 79795: {'lr': 0.00022988796921480533, 'samples': 15320640, 'steps': 79794, 'loss/train': 1.3099621534347534} 11/07/2021 08:27:51 - INFO - __main__ - Step 79796: {'lr': 0.00022988267967522498, 'samples': 15320832, 'steps': 79795, 'loss/train': 1.494253158569336} 11/07/2021 08:27:52 - INFO - __main__ - Step 79797: {'lr': 0.0002298773901447092, 'samples': 15321024, 'steps': 79796, 'loss/train': 1.246923565864563} 11/07/2021 08:27:53 - INFO - __main__ - Step 79798: {'lr': 0.00022987210062326043, 'samples': 15321216, 'steps': 79797, 'loss/train': 1.3267674446105957} 11/07/2021 08:27:53 - INFO - __main__ - Step 79799: {'lr': 0.00022986681111088086, 'samples': 15321408, 'steps': 79798, 'loss/train': 1.6260688304901123} 11/07/2021 08:27:53 - INFO - __main__ - Step 79800: {'lr': 0.00022986152160757312, 'samples': 15321600, 'steps': 79799, 'loss/train': 1.267752766609192} 11/07/2021 08:27:54 - INFO - __main__ - Step 79801: {'lr': 0.0002298562321133394, 'samples': 15321792, 'steps': 79800, 'loss/train': 1.4129209518432617} 11/07/2021 08:27:55 - INFO - __main__ - Step 79802: {'lr': 0.00022985094262818214, 'samples': 15321984, 'steps': 79801, 'loss/train': 1.3967362642288208} 11/07/2021 08:27:55 - INFO - __main__ - Step 79803: {'lr': 0.0002298456531521037, 'samples': 15322176, 'steps': 79802, 'loss/train': 0.07013366371393204} 11/07/2021 08:27:55 - INFO - __main__ - Step 79804: {'lr': 0.00022984036368510656, 'samples': 15322368, 'steps': 79803, 'loss/train': 1.3587563037872314} 11/07/2021 08:27:56 - INFO - __main__ - Step 79805: {'lr': 0.00022983507422719298, 'samples': 15322560, 'steps': 79804, 'loss/train': 1.2905155420303345} 11/07/2021 08:27:56 - INFO - __main__ - Step 79806: {'lr': 0.00022982978477836545, 'samples': 15322752, 'steps': 79805, 'loss/train': 1.7019988298416138} 11/07/2021 08:27:57 - INFO - __main__ - Step 79807: {'lr': 0.0002298244953386263, 'samples': 15322944, 'steps': 79806, 'loss/train': 1.3765721321105957} 11/07/2021 08:27:58 - INFO - __main__ - Step 79808: {'lr': 0.00022981920590797793, 'samples': 15323136, 'steps': 79807, 'loss/train': 0.4085588753223419} 11/07/2021 08:27:58 - INFO - __main__ - Step 79809: {'lr': 0.00022981391648642274, 'samples': 15323328, 'steps': 79808, 'loss/train': 1.3212484121322632} 11/07/2021 08:27:58 - INFO - __main__ - Step 79810: {'lr': 0.00022980862707396306, 'samples': 15323520, 'steps': 79809, 'loss/train': 1.5756726264953613} 11/07/2021 08:27:59 - INFO - __main__ - Step 79811: {'lr': 0.00022980333767060127, 'samples': 15323712, 'steps': 79810, 'loss/train': 1.2367199659347534} 11/07/2021 08:27:59 - INFO - __main__ - Step 79812: {'lr': 0.0002297980482763398, 'samples': 15323904, 'steps': 79811, 'loss/train': 1.496454119682312} 11/07/2021 08:28:00 - INFO - __main__ - Step 79813: {'lr': 0.00022979275889118105, 'samples': 15324096, 'steps': 79812, 'loss/train': 1.5559802055358887} 11/07/2021 08:28:00 - INFO - __main__ - Step 79814: {'lr': 0.00022978746951512737, 'samples': 15324288, 'steps': 79813, 'loss/train': 1.5251805782318115} 11/07/2021 08:28:01 - INFO - __main__ - Step 79815: {'lr': 0.0002297821801481811, 'samples': 15324480, 'steps': 79814, 'loss/train': 1.29502272605896} 11/07/2021 08:28:01 - INFO - __main__ - Step 79816: {'lr': 0.0002297768907903447, 'samples': 15324672, 'steps': 79815, 'loss/train': 2.5854573249816895} 11/07/2021 08:28:01 - INFO - __main__ - Step 79817: {'lr': 0.0002297716014416205, 'samples': 15324864, 'steps': 79816, 'loss/train': 1.2176529169082642} 11/07/2021 08:28:02 - INFO - __main__ - Step 79818: {'lr': 0.0002297663121020109, 'samples': 15325056, 'steps': 79817, 'loss/train': 1.3698030710220337} 11/07/2021 08:28:03 - INFO - __main__ - Step 79819: {'lr': 0.0002297610227715183, 'samples': 15325248, 'steps': 79818, 'loss/train': 1.3839244842529297} 11/07/2021 08:28:03 - INFO - __main__ - Step 79820: {'lr': 0.0002297557334501451, 'samples': 15325440, 'steps': 79819, 'loss/train': 0.10593345761299133} 11/07/2021 08:28:03 - INFO - __main__ - Step 79821: {'lr': 0.00022975044413789365, 'samples': 15325632, 'steps': 79820, 'loss/train': 1.373984932899475} 11/07/2021 08:28:04 - INFO - __main__ - Step 79822: {'lr': 0.0002297451548347663, 'samples': 15325824, 'steps': 79821, 'loss/train': 1.2060734033584595} 11/07/2021 08:28:05 - INFO - __main__ - Step 79823: {'lr': 0.0002297398655407655, 'samples': 15326016, 'steps': 79822, 'loss/train': 1.7920578718185425} 11/07/2021 08:28:05 - INFO - __main__ - Step 79824: {'lr': 0.00022973457625589355, 'samples': 15326208, 'steps': 79823, 'loss/train': 1.2837471961975098} 11/07/2021 08:28:05 - INFO - __main__ - Step 79825: {'lr': 0.00022972928698015293, 'samples': 15326400, 'steps': 79824, 'loss/train': 1.0413275957107544} 11/07/2021 08:28:06 - INFO - __main__ - Step 79826: {'lr': 0.00022972399771354596, 'samples': 15326592, 'steps': 79825, 'loss/train': 1.3726071119308472} 11/07/2021 08:28:06 - INFO - __main__ - Step 79827: {'lr': 0.00022971870845607512, 'samples': 15326784, 'steps': 79826, 'loss/train': 1.6392492055892944} 11/07/2021 08:28:07 - INFO - __main__ - Step 79828: {'lr': 0.00022971341920774267, 'samples': 15326976, 'steps': 79827, 'loss/train': 1.5346013307571411} 11/07/2021 08:28:08 - INFO - __main__ - Step 79829: {'lr': 0.000229708129968551, 'samples': 15327168, 'steps': 79828, 'loss/train': 1.3376235961914062} 11/07/2021 08:28:08 - INFO - __main__ - Step 79830: {'lr': 0.00022970284073850256, 'samples': 15327360, 'steps': 79829, 'loss/train': 1.0237656831741333} 11/07/2021 08:28:08 - INFO - __main__ - Step 79831: {'lr': 0.00022969755151759974, 'samples': 15327552, 'steps': 79830, 'loss/train': 1.448690414428711} 11/07/2021 08:28:09 - INFO - __main__ - Step 79832: {'lr': 0.00022969226230584486, 'samples': 15327744, 'steps': 79831, 'loss/train': 1.2395650148391724} 11/07/2021 08:28:10 - INFO - __main__ - Step 79833: {'lr': 0.00022968697310324032, 'samples': 15327936, 'steps': 79832, 'loss/train': 1.4972881078720093} 11/07/2021 08:28:10 - INFO - __main__ - Step 79834: {'lr': 0.00022968168390978853, 'samples': 15328128, 'steps': 79833, 'loss/train': 1.4186813831329346} 11/07/2021 08:28:10 - INFO - __main__ - Step 79835: {'lr': 0.00022967639472549185, 'samples': 15328320, 'steps': 79834, 'loss/train': 1.432180404663086} 11/07/2021 08:28:11 - INFO - __main__ - Step 79836: {'lr': 0.00022967110555035267, 'samples': 15328512, 'steps': 79835, 'loss/train': 1.5943642854690552} 11/07/2021 08:28:11 - INFO - __main__ - Step 79837: {'lr': 0.0002296658163843734, 'samples': 15328704, 'steps': 79836, 'loss/train': 1.1281059980392456} 11/07/2021 08:28:12 - INFO - __main__ - Step 79838: {'lr': 0.00022966052722755637, 'samples': 15328896, 'steps': 79837, 'loss/train': 1.2914085388183594} 11/07/2021 08:28:12 - INFO - __main__ - Step 79839: {'lr': 0.00022965523807990399, 'samples': 15329088, 'steps': 79838, 'loss/train': 1.532502293586731} 11/07/2021 08:28:13 - INFO - __main__ - Step 79840: {'lr': 0.00022964994894141873, 'samples': 15329280, 'steps': 79839, 'loss/train': 1.2881628274917603} 11/07/2021 08:28:13 - INFO - __main__ - Step 79841: {'lr': 0.0002296446598121028, 'samples': 15329472, 'steps': 79840, 'loss/train': 1.3750369548797607} 11/07/2021 08:28:13 - INFO - __main__ - Step 79842: {'lr': 0.00022963937069195875, 'samples': 15329664, 'steps': 79841, 'loss/train': 1.4511752128601074} 11/07/2021 08:28:15 - INFO - __main__ - Step 79843: {'lr': 0.00022963408158098884, 'samples': 15329856, 'steps': 79842, 'loss/train': 1.363360047340393} 11/07/2021 08:28:15 - INFO - __main__ - Step 79844: {'lr': 0.00022962879247919547, 'samples': 15330048, 'steps': 79843, 'loss/train': 1.046235203742981} 11/07/2021 08:28:15 - INFO - __main__ - Step 79845: {'lr': 0.00022962350338658107, 'samples': 15330240, 'steps': 79844, 'loss/train': 1.403529405593872} 11/07/2021 08:28:16 - INFO - __main__ - Step 79846: {'lr': 0.000229618214303148, 'samples': 15330432, 'steps': 79845, 'loss/train': 1.2813661098480225} 11/07/2021 08:28:16 - INFO - __main__ - Step 79847: {'lr': 0.00022961292522889865, 'samples': 15330624, 'steps': 79846, 'loss/train': 1.647111415863037} 11/07/2021 08:28:17 - INFO - __main__ - Step 79848: {'lr': 0.0002296076361638354, 'samples': 15330816, 'steps': 79847, 'loss/train': 0.9745298027992249} 11/07/2021 08:28:17 - INFO - __main__ - Step 79849: {'lr': 0.00022960234710796062, 'samples': 15331008, 'steps': 79848, 'loss/train': 1.4340178966522217} 11/07/2021 08:28:18 - INFO - __main__ - Step 79850: {'lr': 0.00022959705806127674, 'samples': 15331200, 'steps': 79849, 'loss/train': 1.2653424739837646} 11/07/2021 08:28:18 - INFO - __main__ - Step 79851: {'lr': 0.0002295917690237861, 'samples': 15331392, 'steps': 79850, 'loss/train': 1.2027767896652222} 11/07/2021 08:28:18 - INFO - __main__ - Step 79852: {'lr': 0.00022958647999549107, 'samples': 15331584, 'steps': 79851, 'loss/train': 1.5028241872787476} 11/07/2021 08:28:19 - INFO - __main__ - Step 79853: {'lr': 0.00022958119097639417, 'samples': 15331776, 'steps': 79852, 'loss/train': 1.5222716331481934} 11/07/2021 08:28:20 - INFO - __main__ - Step 79854: {'lr': 0.00022957590196649757, 'samples': 15331968, 'steps': 79853, 'loss/train': 1.1195296049118042} 11/07/2021 08:28:20 - INFO - __main__ - Step 79855: {'lr': 0.00022957061296580378, 'samples': 15332160, 'steps': 79854, 'loss/train': 2.0562336444854736} 11/07/2021 08:28:20 - INFO - __main__ - Step 79856: {'lr': 0.0002295653239743151, 'samples': 15332352, 'steps': 79855, 'loss/train': 1.1434096097946167} 11/07/2021 08:28:21 - INFO - __main__ - Step 79857: {'lr': 0.00022956003499203403, 'samples': 15332544, 'steps': 79856, 'loss/train': 1.1382882595062256} 11/07/2021 08:28:21 - INFO - __main__ - Step 79858: {'lr': 0.00022955474601896286, 'samples': 15332736, 'steps': 79857, 'loss/train': 1.0835182666778564} 11/07/2021 08:28:22 - INFO - __main__ - Step 79859: {'lr': 0.00022954945705510403, 'samples': 15332928, 'steps': 79858, 'loss/train': 1.6638842821121216} 11/07/2021 08:28:23 - INFO - __main__ - Step 79860: {'lr': 0.00022954416810045986, 'samples': 15333120, 'steps': 79859, 'loss/train': 0.7995797991752625} 11/07/2021 08:28:23 - INFO - __main__ - Step 79861: {'lr': 0.0002295388791550328, 'samples': 15333312, 'steps': 79860, 'loss/train': 1.4185725450515747} 11/07/2021 08:28:23 - INFO - __main__ - Step 79862: {'lr': 0.0002295335902188252, 'samples': 15333504, 'steps': 79861, 'loss/train': 1.4709798097610474} 11/07/2021 08:28:24 - INFO - __main__ - Step 79863: {'lr': 0.00022952830129183943, 'samples': 15333696, 'steps': 79862, 'loss/train': 0.8694478273391724} 11/07/2021 08:28:25 - INFO - __main__ - Step 79864: {'lr': 0.00022952301237407792, 'samples': 15333888, 'steps': 79863, 'loss/train': 1.3359230756759644} 11/07/2021 08:28:25 - INFO - __main__ - Step 79865: {'lr': 0.000229517723465543, 'samples': 15334080, 'steps': 79864, 'loss/train': 1.734090805053711} 11/07/2021 08:28:25 - INFO - __main__ - Step 79866: {'lr': 0.0002295124345662371, 'samples': 15334272, 'steps': 79865, 'loss/train': 1.3956975936889648} 11/07/2021 08:28:26 - INFO - __main__ - Step 79867: {'lr': 0.00022950714567616267, 'samples': 15334464, 'steps': 79866, 'loss/train': 1.430510401725769} 11/07/2021 08:28:26 - INFO - __main__ - Step 79868: {'lr': 0.00022950185679532193, 'samples': 15334656, 'steps': 79867, 'loss/train': 1.5364747047424316} 11/07/2021 08:28:27 - INFO - __main__ - Step 79869: {'lr': 0.00022949656792371732, 'samples': 15334848, 'steps': 79868, 'loss/train': 1.008333444595337} 11/07/2021 08:28:27 - INFO - __main__ - Step 79870: {'lr': 0.00022949127906135122, 'samples': 15335040, 'steps': 79869, 'loss/train': 1.0128475427627563} 11/07/2021 08:28:28 - INFO - __main__ - Step 79871: {'lr': 0.00022948599020822605, 'samples': 15335232, 'steps': 79870, 'loss/train': 1.723996639251709} 11/07/2021 08:28:28 - INFO - __main__ - Step 79872: {'lr': 0.00022948070136434416, 'samples': 15335424, 'steps': 79871, 'loss/train': 1.0591048002243042} 11/07/2021 08:28:28 - INFO - __main__ - Step 79873: {'lr': 0.00022947541252970797, 'samples': 15335616, 'steps': 79872, 'loss/train': 1.0666372776031494} 11/07/2021 08:28:30 - INFO - __main__ - Step 79874: {'lr': 0.00022947012370431983, 'samples': 15335808, 'steps': 79873, 'loss/train': 1.7142202854156494} 11/07/2021 08:28:30 - INFO - __main__ - Step 79875: {'lr': 0.00022946483488818216, 'samples': 15336000, 'steps': 79874, 'loss/train': 1.4508953094482422} 11/07/2021 08:28:30 - INFO - __main__ - Step 79876: {'lr': 0.00022945954608129726, 'samples': 15336192, 'steps': 79875, 'loss/train': 1.3177998065948486} 11/07/2021 08:28:31 - INFO - __main__ - Step 79877: {'lr': 0.00022945425728366763, 'samples': 15336384, 'steps': 79876, 'loss/train': 1.061009168624878} 11/07/2021 08:28:31 - INFO - __main__ - Step 79878: {'lr': 0.00022944896849529556, 'samples': 15336576, 'steps': 79877, 'loss/train': 1.0129402875900269} 11/07/2021 08:28:31 - INFO - __main__ - Step 79879: {'lr': 0.00022944367971618348, 'samples': 15336768, 'steps': 79878, 'loss/train': 1.5442044734954834} 11/07/2021 08:28:32 - INFO - __main__ - Step 79880: {'lr': 0.0002294383909463339, 'samples': 15336960, 'steps': 79879, 'loss/train': 5.738125801086426} 11/07/2021 08:28:33 - INFO - __main__ - Step 79881: {'lr': 0.00022943310218574893, 'samples': 15337152, 'steps': 79880, 'loss/train': 1.6824594736099243} 11/07/2021 08:28:33 - INFO - __main__ - Step 79882: {'lr': 0.00022942781343443107, 'samples': 15337344, 'steps': 79881, 'loss/train': 0.525524914264679} 11/07/2021 08:28:33 - INFO - __main__ - Step 79883: {'lr': 0.00022942252469238274, 'samples': 15337536, 'steps': 79882, 'loss/train': 1.3511989116668701} 11/07/2021 08:28:34 - INFO - __main__ - Step 79884: {'lr': 0.00022941723595960628, 'samples': 15337728, 'steps': 79883, 'loss/train': 1.4892412424087524} 11/07/2021 08:28:35 - INFO - __main__ - Step 79885: {'lr': 0.00022941194723610412, 'samples': 15337920, 'steps': 79884, 'loss/train': 1.3595565557479858} 11/07/2021 08:28:35 - INFO - __main__ - Step 79886: {'lr': 0.0002294066585218786, 'samples': 15338112, 'steps': 79885, 'loss/train': 1.1978139877319336} 11/07/2021 08:28:35 - INFO - __main__ - Step 79887: {'lr': 0.00022940136981693213, 'samples': 15338304, 'steps': 79886, 'loss/train': 1.27634596824646} 11/07/2021 08:28:36 - INFO - __main__ - Step 79888: {'lr': 0.00022939608112126708, 'samples': 15338496, 'steps': 79887, 'loss/train': 1.4139235019683838} 11/07/2021 08:28:36 - INFO - __main__ - Step 79889: {'lr': 0.00022939079243488586, 'samples': 15338688, 'steps': 79888, 'loss/train': 1.652719259262085} 11/07/2021 08:28:37 - INFO - __main__ - Step 79890: {'lr': 0.00022938550375779083, 'samples': 15338880, 'steps': 79889, 'loss/train': 1.2229371070861816} 11/07/2021 08:28:37 - INFO - __main__ - Step 79891: {'lr': 0.00022938021508998435, 'samples': 15339072, 'steps': 79890, 'loss/train': 0.8301841020584106} 11/07/2021 08:28:38 - INFO - __main__ - Step 79892: {'lr': 0.00022937492643146886, 'samples': 15339264, 'steps': 79891, 'loss/train': 1.8084802627563477} 11/07/2021 08:28:38 - INFO - __main__ - Step 79893: {'lr': 0.00022936963778224666, 'samples': 15339456, 'steps': 79892, 'loss/train': 1.4383283853530884} 11/07/2021 08:28:39 - INFO - __main__ - Step 79894: {'lr': 0.00022936434914232033, 'samples': 15339648, 'steps': 79893, 'loss/train': 1.5026119947433472} 11/07/2021 08:28:40 - INFO - __main__ - Step 79895: {'lr': 0.00022935906051169198, 'samples': 15339840, 'steps': 79894, 'loss/train': 0.6573525071144104} 11/07/2021 08:28:40 - INFO - __main__ - Step 79896: {'lr': 0.0002293537718903641, 'samples': 15340032, 'steps': 79895, 'loss/train': 1.3547418117523193} 11/07/2021 08:28:40 - INFO - __main__ - Step 79897: {'lr': 0.00022934848327833913, 'samples': 15340224, 'steps': 79896, 'loss/train': 0.9993791580200195} 11/07/2021 08:28:41 - INFO - __main__ - Step 79898: {'lr': 0.0002293431946756194, 'samples': 15340416, 'steps': 79897, 'loss/train': 1.3407343626022339} 11/07/2021 08:28:41 - INFO - __main__ - Step 79899: {'lr': 0.00022933790608220731, 'samples': 15340608, 'steps': 79898, 'loss/train': 0.5556599497795105} 11/07/2021 08:28:41 - INFO - __main__ - Step 79900: {'lr': 0.00022933261749810525, 'samples': 15340800, 'steps': 79899, 'loss/train': 2.4314961433410645} 11/07/2021 08:28:42 - INFO - __main__ - Step 79901: {'lr': 0.00022932732892331557, 'samples': 15340992, 'steps': 79900, 'loss/train': 1.3316954374313354} 11/07/2021 08:28:43 - INFO - __main__ - Step 79902: {'lr': 0.00022932204035784067, 'samples': 15341184, 'steps': 79901, 'loss/train': 1.525676965713501} 11/07/2021 08:28:43 - INFO - __main__ - Step 79903: {'lr': 0.000229316751801683, 'samples': 15341376, 'steps': 79902, 'loss/train': 2.010554075241089} 11/07/2021 08:28:43 - INFO - __main__ - Step 79904: {'lr': 0.0002293114632548448, 'samples': 15341568, 'steps': 79903, 'loss/train': 1.5300846099853516} 11/07/2021 08:28:44 - INFO - __main__ - Step 79905: {'lr': 0.00022930617471732858, 'samples': 15341760, 'steps': 79904, 'loss/train': 1.629190444946289} 11/07/2021 08:28:45 - INFO - __main__ - Step 79906: {'lr': 0.00022930088618913668, 'samples': 15341952, 'steps': 79905, 'loss/train': 1.3382898569107056} 11/07/2021 08:28:45 - INFO - __main__ - Step 79907: {'lr': 0.0002292955976702716, 'samples': 15342144, 'steps': 79906, 'loss/train': 1.3419005870819092} 11/07/2021 08:28:45 - INFO - __main__ - Step 79908: {'lr': 0.00022929030916073547, 'samples': 15342336, 'steps': 79907, 'loss/train': 1.3361066579818726} 11/07/2021 08:28:46 - INFO - __main__ - Step 79909: {'lr': 0.00022928502066053085, 'samples': 15342528, 'steps': 79908, 'loss/train': 1.166159987449646} 11/07/2021 08:28:46 - INFO - __main__ - Step 79910: {'lr': 0.00022927973216966004, 'samples': 15342720, 'steps': 79909, 'loss/train': 1.5385888814926147} 11/07/2021 08:28:47 - INFO - __main__ - Step 79911: {'lr': 0.00022927444368812545, 'samples': 15342912, 'steps': 79910, 'loss/train': 1.162705898284912} 11/07/2021 08:28:48 - INFO - __main__ - Step 79912: {'lr': 0.0002292691552159295, 'samples': 15343104, 'steps': 79911, 'loss/train': 1.0277003049850464} 11/07/2021 08:28:48 - INFO - __main__ - Step 79913: {'lr': 0.00022926386675307454, 'samples': 15343296, 'steps': 79912, 'loss/train': 1.1219159364700317} 11/07/2021 08:28:48 - INFO - __main__ - Step 79914: {'lr': 0.00022925857829956297, 'samples': 15343488, 'steps': 79913, 'loss/train': 1.2234524488449097} 11/07/2021 08:28:49 - INFO - __main__ - Step 79915: {'lr': 0.00022925328985539718, 'samples': 15343680, 'steps': 79914, 'loss/train': 1.2108120918273926} 11/07/2021 08:28:50 - INFO - __main__ - Step 79916: {'lr': 0.0002292480014205795, 'samples': 15343872, 'steps': 79915, 'loss/train': 1.4757109880447388} 11/07/2021 08:28:50 - INFO - __main__ - Step 79917: {'lr': 0.00022924271299511238, 'samples': 15344064, 'steps': 79916, 'loss/train': 1.1580193042755127} 11/07/2021 08:28:50 - INFO - __main__ - Step 79918: {'lr': 0.00022923742457899815, 'samples': 15344256, 'steps': 79917, 'loss/train': 1.0969698429107666} 11/07/2021 08:28:51 - INFO - __main__ - Step 79919: {'lr': 0.00022923213617223923, 'samples': 15344448, 'steps': 79918, 'loss/train': 1.5978435277938843} 11/07/2021 08:28:51 - INFO - __main__ - Step 79920: {'lr': 0.00022922684777483798, 'samples': 15344640, 'steps': 79919, 'loss/train': 1.3867871761322021} 11/07/2021 08:28:52 - INFO - __main__ - Step 79921: {'lr': 0.0002292215593867969, 'samples': 15344832, 'steps': 79920, 'loss/train': 1.3885996341705322} 11/07/2021 08:28:52 - INFO - __main__ - Step 79922: {'lr': 0.0002292162710081182, 'samples': 15345024, 'steps': 79921, 'loss/train': 1.2381337881088257} 11/07/2021 08:28:53 - INFO - __main__ - Step 79923: {'lr': 0.00022921098263880427, 'samples': 15345216, 'steps': 79922, 'loss/train': 1.162507176399231} 11/07/2021 08:28:53 - INFO - __main__ - Step 79924: {'lr': 0.0002292056942788576, 'samples': 15345408, 'steps': 79923, 'loss/train': 0.9673729538917542} 11/07/2021 08:28:53 - INFO - __main__ - Step 79925: {'lr': 0.00022920040592828048, 'samples': 15345600, 'steps': 79924, 'loss/train': 0.9327985644340515} 11/07/2021 08:28:55 - INFO - __main__ - Step 79926: {'lr': 0.00022919511758707535, 'samples': 15345792, 'steps': 79925, 'loss/train': 0.6029524207115173} 11/07/2021 08:28:55 - INFO - __main__ - Step 79927: {'lr': 0.00022918982925524458, 'samples': 15345984, 'steps': 79926, 'loss/train': 1.2569876909255981} 11/07/2021 08:28:55 - INFO - __main__ - Step 79928: {'lr': 0.00022918454093279056, 'samples': 15346176, 'steps': 79927, 'loss/train': 0.5344040393829346} 11/07/2021 08:28:56 - INFO - __main__ - Step 79929: {'lr': 0.00022917925261971566, 'samples': 15346368, 'steps': 79928, 'loss/train': 1.0783864259719849} 11/07/2021 08:28:56 - INFO - __main__ - Step 79930: {'lr': 0.00022917396431602224, 'samples': 15346560, 'steps': 79929, 'loss/train': 1.4798184633255005} 11/07/2021 08:28:57 - INFO - __main__ - Step 79931: {'lr': 0.00022916867602171276, 'samples': 15346752, 'steps': 79930, 'loss/train': 1.5611289739608765} 11/07/2021 08:28:58 - INFO - __main__ - Step 79932: {'lr': 0.0002291633877367895, 'samples': 15346944, 'steps': 79931, 'loss/train': 1.4855656623840332} 11/07/2021 08:28:58 - INFO - __main__ - Step 79933: {'lr': 0.000229158099461255, 'samples': 15347136, 'steps': 79932, 'loss/train': 0.8068594932556152} 11/07/2021 08:28:58 - INFO - __main__ - Step 79934: {'lr': 0.00022915281119511153, 'samples': 15347328, 'steps': 79933, 'loss/train': 1.0032789707183838} 11/07/2021 08:28:59 - INFO - __main__ - Step 79935: {'lr': 0.0002291475229383614, 'samples': 15347520, 'steps': 79934, 'loss/train': 1.4455139636993408} 11/07/2021 08:28:59 - INFO - __main__ - Step 79936: {'lr': 0.0002291422346910071, 'samples': 15347712, 'steps': 79935, 'loss/train': 1.3381215333938599} 11/07/2021 08:29:00 - INFO - __main__ - Step 79937: {'lr': 0.00022913694645305098, 'samples': 15347904, 'steps': 79936, 'loss/train': 1.4224604368209839} 11/07/2021 08:29:00 - INFO - __main__ - Step 79938: {'lr': 0.0002291316582244954, 'samples': 15348096, 'steps': 79937, 'loss/train': 1.3390038013458252} 11/07/2021 08:29:01 - INFO - __main__ - Step 79939: {'lr': 0.0002291263700053428, 'samples': 15348288, 'steps': 79938, 'loss/train': 1.3953622579574585} 11/07/2021 08:29:01 - INFO - __main__ - Step 79940: {'lr': 0.00022912108179559554, 'samples': 15348480, 'steps': 79939, 'loss/train': 1.1654584407806396} 11/07/2021 08:29:01 - INFO - __main__ - Step 79941: {'lr': 0.000229115793595256, 'samples': 15348672, 'steps': 79940, 'loss/train': 1.0009214878082275} 11/07/2021 08:29:02 - INFO - __main__ - Step 79942: {'lr': 0.00022911050540432655, 'samples': 15348864, 'steps': 79941, 'loss/train': 1.4627151489257812} 11/07/2021 08:29:03 - INFO - __main__ - Step 79943: {'lr': 0.00022910521722280957, 'samples': 15349056, 'steps': 79942, 'loss/train': 0.4691723883152008} 11/07/2021 08:29:03 - INFO - __main__ - Step 79944: {'lr': 0.00022909992905070754, 'samples': 15349248, 'steps': 79943, 'loss/train': 1.2405091524124146} 11/07/2021 08:29:03 - INFO - __main__ - Step 79945: {'lr': 0.0002290946408880227, 'samples': 15349440, 'steps': 79944, 'loss/train': 1.2040876150131226} 11/07/2021 08:29:04 - INFO - __main__ - Step 79946: {'lr': 0.00022908935273475747, 'samples': 15349632, 'steps': 79945, 'loss/train': 1.5572001934051514} 11/07/2021 08:29:05 - INFO - __main__ - Step 79947: {'lr': 0.0002290840645909143, 'samples': 15349824, 'steps': 79946, 'loss/train': 1.2879934310913086} 11/07/2021 08:29:05 - INFO - __main__ - Step 79948: {'lr': 0.00022907877645649555, 'samples': 15350016, 'steps': 79947, 'loss/train': 1.0374622344970703} 11/07/2021 08:29:06 - INFO - __main__ - Step 79949: {'lr': 0.0002290734883315035, 'samples': 15350208, 'steps': 79948, 'loss/train': 1.2621259689331055} 11/07/2021 08:29:06 - INFO - __main__ - Step 79950: {'lr': 0.00022906820021594067, 'samples': 15350400, 'steps': 79949, 'loss/train': 1.3140959739685059} 11/07/2021 08:29:06 - INFO - __main__ - Step 79951: {'lr': 0.00022906291210980935, 'samples': 15350592, 'steps': 79950, 'loss/train': 1.0481468439102173} 11/07/2021 08:29:07 - INFO - __main__ - Step 79952: {'lr': 0.00022905762401311197, 'samples': 15350784, 'steps': 79951, 'loss/train': 2.0662553310394287} 11/07/2021 08:29:08 - INFO - __main__ - Step 79953: {'lr': 0.0002290523359258509, 'samples': 15350976, 'steps': 79952, 'loss/train': 1.287536382675171} 11/07/2021 08:29:08 - INFO - __main__ - Step 79954: {'lr': 0.0002290470478480285, 'samples': 15351168, 'steps': 79953, 'loss/train': 1.5657542943954468} 11/07/2021 08:29:08 - INFO - __main__ - Step 79955: {'lr': 0.00022904175977964727, 'samples': 15351360, 'steps': 79954, 'loss/train': 1.6092790365219116} 11/07/2021 08:29:09 - INFO - __main__ - Step 79956: {'lr': 0.00022903647172070943, 'samples': 15351552, 'steps': 79955, 'loss/train': 0.9256002902984619} 11/07/2021 08:29:10 - INFO - __main__ - Step 79957: {'lr': 0.00022903118367121746, 'samples': 15351744, 'steps': 79956, 'loss/train': 0.8180684447288513} 11/07/2021 08:29:10 - INFO - __main__ - Step 79958: {'lr': 0.00022902589563117366, 'samples': 15351936, 'steps': 79957, 'loss/train': 1.5922398567199707} 11/07/2021 08:29:10 - INFO - __main__ - Step 79959: {'lr': 0.0002290206076005805, 'samples': 15352128, 'steps': 79958, 'loss/train': 1.2842808961868286} 11/07/2021 08:29:11 - INFO - __main__ - Step 79960: {'lr': 0.00022901531957944033, 'samples': 15352320, 'steps': 79959, 'loss/train': 1.4345359802246094} 11/07/2021 08:29:11 - INFO - __main__ - Step 79961: {'lr': 0.00022901003156775558, 'samples': 15352512, 'steps': 79960, 'loss/train': 0.6105697154998779} 11/07/2021 08:29:12 - INFO - __main__ - Step 79962: {'lr': 0.00022900474356552853, 'samples': 15352704, 'steps': 79961, 'loss/train': 1.5103693008422852} 11/07/2021 08:29:13 - INFO - __main__ - Step 79963: {'lr': 0.00022899945557276164, 'samples': 15352896, 'steps': 79962, 'loss/train': 1.1638203859329224} 11/07/2021 08:29:13 - INFO - __main__ - Step 79964: {'lr': 0.00022899416758945723, 'samples': 15353088, 'steps': 79963, 'loss/train': 0.7561293840408325} 11/07/2021 08:29:13 - INFO - __main__ - Step 79965: {'lr': 0.00022898887961561777, 'samples': 15353280, 'steps': 79964, 'loss/train': 1.3727288246154785} 11/07/2021 08:29:14 - INFO - __main__ - Step 79966: {'lr': 0.00022898359165124561, 'samples': 15353472, 'steps': 79965, 'loss/train': 1.1162902116775513} 11/07/2021 08:29:15 - INFO - __main__ - Step 79967: {'lr': 0.0002289783036963431, 'samples': 15353664, 'steps': 79966, 'loss/train': 1.380855917930603} 11/07/2021 08:29:15 - INFO - __main__ - Step 79968: {'lr': 0.0002289730157509126, 'samples': 15353856, 'steps': 79967, 'loss/train': 0.7604290843009949} 11/07/2021 08:29:15 - INFO - __main__ - Step 79969: {'lr': 0.00022896772781495658, 'samples': 15354048, 'steps': 79968, 'loss/train': 1.2017508745193481} 11/07/2021 08:29:16 - INFO - __main__ - Step 79970: {'lr': 0.00022896243988847738, 'samples': 15354240, 'steps': 79969, 'loss/train': 0.8415154814720154} 11/07/2021 08:29:16 - INFO - __main__ - Step 79971: {'lr': 0.00022895715197147732, 'samples': 15354432, 'steps': 79970, 'loss/train': 1.1215813159942627} 11/07/2021 08:29:16 - INFO - __main__ - Step 79972: {'lr': 0.00022895186406395892, 'samples': 15354624, 'steps': 79971, 'loss/train': 1.4050142765045166} 11/07/2021 08:29:17 - INFO - __main__ - Step 79973: {'lr': 0.00022894657616592443, 'samples': 15354816, 'steps': 79972, 'loss/train': 1.2337349653244019} 11/07/2021 08:29:18 - INFO - __main__ - Step 79974: {'lr': 0.00022894128827737634, 'samples': 15355008, 'steps': 79973, 'loss/train': 1.772722840309143} 11/07/2021 08:29:18 - INFO - __main__ - Step 79975: {'lr': 0.00022893600039831694, 'samples': 15355200, 'steps': 79974, 'loss/train': 1.83729887008667} 11/07/2021 08:29:18 - INFO - __main__ - Step 79976: {'lr': 0.00022893071252874872, 'samples': 15355392, 'steps': 79975, 'loss/train': 0.7570523023605347} 11/07/2021 08:29:19 - INFO - __main__ - Step 79977: {'lr': 0.00022892542466867395, 'samples': 15355584, 'steps': 79976, 'loss/train': 1.4147313833236694} 11/07/2021 08:29:20 - INFO - __main__ - Step 79978: {'lr': 0.00022892013681809504, 'samples': 15355776, 'steps': 79977, 'loss/train': 1.1485235691070557} 11/07/2021 08:29:20 - INFO - __main__ - Step 79979: {'lr': 0.00022891484897701438, 'samples': 15355968, 'steps': 79978, 'loss/train': 1.1514527797698975} 11/07/2021 08:29:21 - INFO - __main__ - Step 79980: {'lr': 0.00022890956114543439, 'samples': 15356160, 'steps': 79979, 'loss/train': 1.5377910137176514} 11/07/2021 08:29:21 - INFO - __main__ - Step 79981: {'lr': 0.0002289042733233574, 'samples': 15356352, 'steps': 79980, 'loss/train': 1.5074360370635986} 11/07/2021 08:29:21 - INFO - __main__ - Step 79982: {'lr': 0.00022889898551078583, 'samples': 15356544, 'steps': 79981, 'loss/train': 1.036891222000122} 11/07/2021 08:29:22 - INFO - __main__ - Step 79983: {'lr': 0.00022889369770772206, 'samples': 15356736, 'steps': 79982, 'loss/train': 1.3799346685409546} 11/07/2021 08:29:23 - INFO - __main__ - Step 79984: {'lr': 0.00022888840991416845, 'samples': 15356928, 'steps': 79983, 'loss/train': 1.4294331073760986} 11/07/2021 08:29:23 - INFO - __main__ - Step 79985: {'lr': 0.00022888312213012742, 'samples': 15357120, 'steps': 79984, 'loss/train': 1.2691847085952759} 11/07/2021 08:29:23 - INFO - __main__ - Step 79986: {'lr': 0.00022887783435560132, 'samples': 15357312, 'steps': 79985, 'loss/train': 1.4859777688980103} 11/07/2021 08:29:24 - INFO - __main__ - Step 79987: {'lr': 0.0002288725465905925, 'samples': 15357504, 'steps': 79986, 'loss/train': 1.5059484243392944} 11/07/2021 08:29:25 - INFO - __main__ - Step 79988: {'lr': 0.00022886725883510353, 'samples': 15357696, 'steps': 79987, 'loss/train': 1.1438242197036743} 11/07/2021 08:29:25 - INFO - __main__ - Step 79989: {'lr': 0.00022886197108913656, 'samples': 15357888, 'steps': 79988, 'loss/train': 1.5640641450881958} 11/07/2021 08:29:25 - INFO - __main__ - Step 79990: {'lr': 0.00022885668335269403, 'samples': 15358080, 'steps': 79989, 'loss/train': 0.6770896315574646} 11/07/2021 08:29:26 - INFO - __main__ - Step 79991: {'lr': 0.00022885139562577836, 'samples': 15358272, 'steps': 79990, 'loss/train': 1.781388521194458} 11/07/2021 08:29:26 - INFO - __main__ - Step 79992: {'lr': 0.0002288461079083919, 'samples': 15358464, 'steps': 79991, 'loss/train': 1.2112284898757935} 11/07/2021 08:29:27 - INFO - __main__ - Step 79993: {'lr': 0.00022884082020053708, 'samples': 15358656, 'steps': 79992, 'loss/train': 1.3542810678482056} 11/07/2021 08:29:27 - INFO - __main__ - Step 79994: {'lr': 0.00022883553250221627, 'samples': 15358848, 'steps': 79993, 'loss/train': 1.4127070903778076} 11/07/2021 08:29:28 - INFO - __main__ - Step 79995: {'lr': 0.00022883024481343183, 'samples': 15359040, 'steps': 79994, 'loss/train': 0.9270737171173096} 11/07/2021 08:29:28 - INFO - __main__ - Step 79996: {'lr': 0.00022882495713418617, 'samples': 15359232, 'steps': 79995, 'loss/train': 1.4470360279083252} 11/07/2021 08:29:28 - INFO - __main__ - Step 79997: {'lr': 0.00022881966946448166, 'samples': 15359424, 'steps': 79996, 'loss/train': 1.339922308921814} 11/07/2021 08:29:29 - INFO - __main__ - Step 79998: {'lr': 0.00022881438180432064, 'samples': 15359616, 'steps': 79997, 'loss/train': 1.0792616605758667} 11/07/2021 08:29:30 - INFO - __main__ - Step 79999: {'lr': 0.00022880909415370557, 'samples': 15359808, 'steps': 79998, 'loss/train': 1.4091567993164062} 11/07/2021 08:29:30 - INFO - __main__ - Step 80000: {'lr': 0.0002288038065126388, 'samples': 15360000, 'steps': 79999, 'loss/train': 1.5689784288406372} 11/07/2021 08:29:30 - INFO - __main__ - Step 80001: {'lr': 0.0002287985188811228, 'samples': 15360192, 'steps': 80000, 'loss/train': 1.2454047203063965} 11/07/2021 08:29:31 - INFO - __main__ - Step 80002: {'lr': 0.00022879323125915975, 'samples': 15360384, 'steps': 80001, 'loss/train': 1.0398808717727661} 11/07/2021 08:29:32 - INFO - __main__ - Step 80003: {'lr': 0.00022878794364675212, 'samples': 15360576, 'steps': 80002, 'loss/train': 1.3181356191635132} 11/07/2021 08:29:32 - INFO - __main__ - Step 80004: {'lr': 0.00022878265604390236, 'samples': 15360768, 'steps': 80003, 'loss/train': 1.3304119110107422} 11/07/2021 08:29:33 - INFO - __main__ - Step 80005: {'lr': 0.0002287773684506128, 'samples': 15360960, 'steps': 80004, 'loss/train': 1.478332281112671} 11/07/2021 08:29:33 - INFO - __main__ - Step 80006: {'lr': 0.0002287720808668858, 'samples': 15361152, 'steps': 80005, 'loss/train': 1.4737575054168701} 11/07/2021 08:29:33 - INFO - __main__ - Step 80007: {'lr': 0.00022876679329272379, 'samples': 15361344, 'steps': 80006, 'loss/train': 0.21817269921302795} 11/07/2021 08:29:34 - INFO - __main__ - Step 80008: {'lr': 0.00022876150572812912, 'samples': 15361536, 'steps': 80007, 'loss/train': 1.0873039960861206} 11/07/2021 08:29:35 - INFO - __main__ - Step 80009: {'lr': 0.00022875621817310422, 'samples': 15361728, 'steps': 80008, 'loss/train': 2.1439576148986816} 11/07/2021 08:29:35 - INFO - __main__ - Step 80010: {'lr': 0.00022875093062765141, 'samples': 15361920, 'steps': 80009, 'loss/train': 1.1672947406768799} 11/07/2021 08:29:35 - INFO - __main__ - Step 80011: {'lr': 0.00022874564309177312, 'samples': 15362112, 'steps': 80010, 'loss/train': 1.3307663202285767} 11/07/2021 08:29:36 - INFO - __main__ - Step 80012: {'lr': 0.00022874035556547171, 'samples': 15362304, 'steps': 80011, 'loss/train': 0.818961501121521} 11/07/2021 08:29:37 - INFO - __main__ - Step 80013: {'lr': 0.0002287350680487496, 'samples': 15362496, 'steps': 80012, 'loss/train': 0.783221423625946} 11/07/2021 08:29:37 - INFO - __main__ - Step 80014: {'lr': 0.0002287297805416091, 'samples': 15362688, 'steps': 80013, 'loss/train': 1.5347182750701904} 11/07/2021 08:29:37 - INFO - __main__ - Step 80015: {'lr': 0.00022872449304405274, 'samples': 15362880, 'steps': 80014, 'loss/train': 1.291562557220459} 11/07/2021 08:29:38 - INFO - __main__ - Step 80016: {'lr': 0.00022871920555608268, 'samples': 15363072, 'steps': 80015, 'loss/train': 1.455331563949585} 11/07/2021 08:29:38 - INFO - __main__ - Step 80017: {'lr': 0.00022871391807770144, 'samples': 15363264, 'steps': 80016, 'loss/train': 1.3613983392715454} 11/07/2021 08:29:39 - INFO - __main__ - Step 80018: {'lr': 0.00022870863060891138, 'samples': 15363456, 'steps': 80017, 'loss/train': 1.4645850658416748} 11/07/2021 08:29:40 - INFO - __main__ - Step 80019: {'lr': 0.00022870334314971488, 'samples': 15363648, 'steps': 80018, 'loss/train': 1.619707703590393} 11/07/2021 08:29:40 - INFO - __main__ - Step 80020: {'lr': 0.0002286980557001143, 'samples': 15363840, 'steps': 80019, 'loss/train': 1.7483359575271606} 11/07/2021 08:29:40 - INFO - __main__ - Step 80021: {'lr': 0.0002286927682601121, 'samples': 15364032, 'steps': 80020, 'loss/train': 0.9461550712585449} 11/07/2021 08:29:41 - INFO - __main__ - Step 80022: {'lr': 0.0002286874808297106, 'samples': 15364224, 'steps': 80021, 'loss/train': 1.2097058296203613} 11/07/2021 08:29:42 - INFO - __main__ - Step 80023: {'lr': 0.00022868219340891214, 'samples': 15364416, 'steps': 80022, 'loss/train': 1.3960076570510864} 11/07/2021 08:29:42 - INFO - __main__ - Step 80024: {'lr': 0.0002286769059977192, 'samples': 15364608, 'steps': 80023, 'loss/train': 1.4506983757019043} 11/07/2021 08:29:42 - INFO - __main__ - Step 80025: {'lr': 0.00022867161859613412, 'samples': 15364800, 'steps': 80024, 'loss/train': 0.6167816519737244} 11/07/2021 08:29:43 - INFO - __main__ - Step 80026: {'lr': 0.00022866633120415924, 'samples': 15364992, 'steps': 80025, 'loss/train': 1.596759557723999} 11/07/2021 08:29:43 - INFO - __main__ - Step 80027: {'lr': 0.000228661043821797, 'samples': 15365184, 'steps': 80026, 'loss/train': 0.8697806596755981} 11/07/2021 08:29:44 - INFO - __main__ - Step 80028: {'lr': 0.0002286557564490499, 'samples': 15365376, 'steps': 80027, 'loss/train': 1.54068124294281} 11/07/2021 08:29:44 - INFO - __main__ - Step 80029: {'lr': 0.00022865046908592004, 'samples': 15365568, 'steps': 80028, 'loss/train': 1.0478217601776123} 11/07/2021 08:29:45 - INFO - __main__ - Step 80030: {'lr': 0.00022864518173240997, 'samples': 15365760, 'steps': 80029, 'loss/train': 1.6893256902694702} 11/07/2021 08:29:45 - INFO - __main__ - Step 80031: {'lr': 0.00022863989438852206, 'samples': 15365952, 'steps': 80030, 'loss/train': 1.4359244108200073} 11/07/2021 08:29:46 - INFO - __main__ - Step 80032: {'lr': 0.00022863460705425866, 'samples': 15366144, 'steps': 80031, 'loss/train': 1.347676396369934} 11/07/2021 08:29:47 - INFO - __main__ - Step 80033: {'lr': 0.0002286293197296222, 'samples': 15366336, 'steps': 80032, 'loss/train': 1.0326159000396729} 11/07/2021 08:29:47 - INFO - __main__ - Step 80034: {'lr': 0.00022862403241461502, 'samples': 15366528, 'steps': 80033, 'loss/train': 1.3683505058288574} 11/07/2021 08:29:47 - INFO - __main__ - Step 80035: {'lr': 0.0002286187451092395, 'samples': 15366720, 'steps': 80034, 'loss/train': 1.3600291013717651} 11/07/2021 08:29:48 - INFO - __main__ - Step 80036: {'lr': 0.0002286134578134981, 'samples': 15366912, 'steps': 80035, 'loss/train': 1.3249889612197876} 11/07/2021 08:29:48 - INFO - __main__ - Step 80037: {'lr': 0.00022860817052739311, 'samples': 15367104, 'steps': 80036, 'loss/train': 1.1485474109649658} 11/07/2021 08:29:48 - INFO - __main__ - Step 80038: {'lr': 0.00022860288325092696, 'samples': 15367296, 'steps': 80037, 'loss/train': 1.015221357345581} 11/07/2021 08:29:49 - INFO - __main__ - Step 80039: {'lr': 0.000228597595984102, 'samples': 15367488, 'steps': 80038, 'loss/train': 1.311610460281372} 11/07/2021 08:29:50 - INFO - __main__ - Step 80040: {'lr': 0.00022859230872692067, 'samples': 15367680, 'steps': 80039, 'loss/train': 1.8427269458770752} 11/07/2021 08:29:50 - INFO - __main__ - Step 80041: {'lr': 0.00022858702147938529, 'samples': 15367872, 'steps': 80040, 'loss/train': 0.9021161794662476} 11/07/2021 08:29:50 - INFO - __main__ - Step 80042: {'lr': 0.00022858173424149836, 'samples': 15368064, 'steps': 80041, 'loss/train': 1.6237515211105347} 11/07/2021 08:29:51 - INFO - __main__ - Step 80043: {'lr': 0.0002285764470132621, 'samples': 15368256, 'steps': 80042, 'loss/train': 1.3109725713729858} 11/07/2021 08:29:52 - INFO - __main__ - Step 80044: {'lr': 0.00022857115979467893, 'samples': 15368448, 'steps': 80043, 'loss/train': 1.0129390954971313} 11/07/2021 08:29:52 - INFO - __main__ - Step 80045: {'lr': 0.00022856587258575129, 'samples': 15368640, 'steps': 80044, 'loss/train': 1.547563076019287} 11/07/2021 08:29:52 - INFO - __main__ - Step 80046: {'lr': 0.00022856058538648152, 'samples': 15368832, 'steps': 80045, 'loss/train': 1.4495830535888672} 11/07/2021 08:29:53 - INFO - __main__ - Step 80047: {'lr': 0.00022855529819687203, 'samples': 15369024, 'steps': 80046, 'loss/train': 1.2709938287734985} 11/07/2021 08:29:53 - INFO - __main__ - Step 80048: {'lr': 0.0002285500110169252, 'samples': 15369216, 'steps': 80047, 'loss/train': 0.7860226631164551} 11/07/2021 08:29:54 - INFO - __main__ - Step 80049: {'lr': 0.00022854472384664336, 'samples': 15369408, 'steps': 80048, 'loss/train': 1.058828353881836} 11/07/2021 08:29:55 - INFO - __main__ - Step 80050: {'lr': 0.00022853943668602901, 'samples': 15369600, 'steps': 80049, 'loss/train': 1.1636710166931152} 11/07/2021 08:29:55 - INFO - __main__ - Step 80051: {'lr': 0.0002285341495350844, 'samples': 15369792, 'steps': 80050, 'loss/train': 1.1332917213439941} 11/07/2021 08:29:55 - INFO - __main__ - Step 80052: {'lr': 0.000228528862393812, 'samples': 15369984, 'steps': 80051, 'loss/train': 1.4825166463851929} 11/07/2021 08:29:56 - INFO - __main__ - Step 80053: {'lr': 0.00022852357526221412, 'samples': 15370176, 'steps': 80052, 'loss/train': 3.26055645942688} 11/07/2021 08:29:57 - INFO - __main__ - Step 80054: {'lr': 0.00022851828814029324, 'samples': 15370368, 'steps': 80053, 'loss/train': 1.0874274969100952} 11/07/2021 08:29:57 - INFO - __main__ - Step 80055: {'lr': 0.00022851300102805176, 'samples': 15370560, 'steps': 80054, 'loss/train': 1.3812429904937744} 11/07/2021 08:29:57 - INFO - __main__ - Step 80056: {'lr': 0.0002285077139254919, 'samples': 15370752, 'steps': 80055, 'loss/train': 1.191349983215332} 11/07/2021 08:29:58 - INFO - __main__ - Step 80057: {'lr': 0.00022850242683261613, 'samples': 15370944, 'steps': 80056, 'loss/train': 1.5440489053726196} 11/07/2021 08:29:58 - INFO - __main__ - Step 80058: {'lr': 0.00022849713974942682, 'samples': 15371136, 'steps': 80057, 'loss/train': 1.4353903532028198} 11/07/2021 08:29:59 - INFO - __main__ - Step 80059: {'lr': 0.00022849185267592636, 'samples': 15371328, 'steps': 80058, 'loss/train': 1.3853461742401123} 11/07/2021 08:29:59 - INFO - __main__ - Step 80060: {'lr': 0.00022848656561211717, 'samples': 15371520, 'steps': 80059, 'loss/train': 1.6882236003875732} 11/07/2021 08:30:00 - INFO - __main__ - Step 80061: {'lr': 0.0002284812785580016, 'samples': 15371712, 'steps': 80060, 'loss/train': 1.3652480840682983} 11/07/2021 08:30:00 - INFO - __main__ - Step 80062: {'lr': 0.00022847599151358202, 'samples': 15371904, 'steps': 80061, 'loss/train': 1.7136074304580688} 11/07/2021 08:30:01 - INFO - __main__ - Step 80063: {'lr': 0.00022847070447886084, 'samples': 15372096, 'steps': 80062, 'loss/train': 2.057471990585327} 11/07/2021 08:30:01 - INFO - __main__ - Step 80064: {'lr': 0.00022846541745384042, 'samples': 15372288, 'steps': 80063, 'loss/train': 1.7229481935501099} 11/07/2021 08:30:02 - INFO - __main__ - Step 80065: {'lr': 0.00022846013043852315, 'samples': 15372480, 'steps': 80064, 'loss/train': 1.0737769603729248} 11/07/2021 08:30:02 - INFO - __main__ - Step 80066: {'lr': 0.0002284548434329114, 'samples': 15372672, 'steps': 80065, 'loss/train': 1.097105622291565} 11/07/2021 08:30:03 - INFO - __main__ - Step 80067: {'lr': 0.00022844955643700762, 'samples': 15372864, 'steps': 80066, 'loss/train': 1.5038481950759888} 11/07/2021 08:30:03 - INFO - __main__ - Step 80068: {'lr': 0.0002284442694508141, 'samples': 15373056, 'steps': 80067, 'loss/train': 1.6257350444793701} 11/07/2021 08:30:03 - INFO - __main__ - Step 80069: {'lr': 0.0002284389824743333, 'samples': 15373248, 'steps': 80068, 'loss/train': 1.5572208166122437} 11/07/2021 08:30:04 - INFO - __main__ - Step 80070: {'lr': 0.00022843369550756755, 'samples': 15373440, 'steps': 80069, 'loss/train': 1.089964747428894} 11/07/2021 08:30:05 - INFO - __main__ - Step 80071: {'lr': 0.00022842840855051918, 'samples': 15373632, 'steps': 80070, 'loss/train': 1.3257135152816772} 11/07/2021 08:30:05 - INFO - __main__ - Step 80072: {'lr': 0.00022842312160319068, 'samples': 15373824, 'steps': 80071, 'loss/train': 1.3352080583572388} 11/07/2021 08:30:06 - INFO - __main__ - Step 80073: {'lr': 0.00022841783466558436, 'samples': 15374016, 'steps': 80072, 'loss/train': 1.0291552543640137} 11/07/2021 08:30:06 - INFO - __main__ - Step 80074: {'lr': 0.00022841254773770265, 'samples': 15374208, 'steps': 80073, 'loss/train': 1.2918634414672852} 11/07/2021 08:30:07 - INFO - __main__ - Step 80075: {'lr': 0.0002284072608195479, 'samples': 15374400, 'steps': 80074, 'loss/train': 1.5564907789230347} 11/07/2021 08:30:07 - INFO - __main__ - Step 80076: {'lr': 0.00022840197391112252, 'samples': 15374592, 'steps': 80075, 'loss/train': 1.6004633903503418} 11/07/2021 08:30:08 - INFO - __main__ - Step 80077: {'lr': 0.0002283966870124289, 'samples': 15374784, 'steps': 80076, 'loss/train': 1.262022614479065} 11/07/2021 08:30:08 - INFO - __main__ - Step 80078: {'lr': 0.0002283914001234694, 'samples': 15374976, 'steps': 80077, 'loss/train': 0.3791220188140869} 11/07/2021 08:30:08 - INFO - __main__ - Step 80079: {'lr': 0.00022838611324424636, 'samples': 15375168, 'steps': 80078, 'loss/train': 1.6516852378845215} 11/07/2021 08:30:10 - INFO - __main__ - Step 80080: {'lr': 0.00022838082637476222, 'samples': 15375360, 'steps': 80079, 'loss/train': 1.292216420173645} 11/07/2021 08:30:10 - INFO - __main__ - Step 80081: {'lr': 0.00022837553951501934, 'samples': 15375552, 'steps': 80080, 'loss/train': 1.390507459640503} 11/07/2021 08:30:10 - INFO - __main__ - Step 80082: {'lr': 0.00022837025266502016, 'samples': 15375744, 'steps': 80081, 'loss/train': 1.5410178899765015} 11/07/2021 08:30:11 - INFO - __main__ - Step 80083: {'lr': 0.00022836496582476695, 'samples': 15375936, 'steps': 80082, 'loss/train': 1.7032277584075928} 11/07/2021 08:30:11 - INFO - __main__ - Step 80084: {'lr': 0.00022835967899426218, 'samples': 15376128, 'steps': 80083, 'loss/train': 1.5728617906570435} 11/07/2021 08:30:11 - INFO - __main__ - Step 80085: {'lr': 0.00022835439217350816, 'samples': 15376320, 'steps': 80084, 'loss/train': 1.4836058616638184} 11/07/2021 08:30:12 - INFO - __main__ - Step 80086: {'lr': 0.00022834910536250735, 'samples': 15376512, 'steps': 80085, 'loss/train': 0.14370061457157135} 11/07/2021 08:30:13 - INFO - __main__ - Step 80087: {'lr': 0.0002283438185612621, 'samples': 15376704, 'steps': 80086, 'loss/train': 1.4311305284500122} 11/07/2021 08:30:13 - INFO - __main__ - Step 80088: {'lr': 0.00022833853176977477, 'samples': 15376896, 'steps': 80087, 'loss/train': 1.1290397644042969} 11/07/2021 08:30:13 - INFO - __main__ - Step 80089: {'lr': 0.00022833324498804786, 'samples': 15377088, 'steps': 80088, 'loss/train': 0.5271612405776978} 11/07/2021 08:30:14 - INFO - __main__ - Step 80090: {'lr': 0.00022832795821608356, 'samples': 15377280, 'steps': 80089, 'loss/train': 0.9586971998214722} 11/07/2021 08:30:15 - INFO - __main__ - Step 80091: {'lr': 0.00022832267145388437, 'samples': 15377472, 'steps': 80090, 'loss/train': 1.3321597576141357} 11/07/2021 08:30:15 - INFO - __main__ - Step 80092: {'lr': 0.00022831738470145262, 'samples': 15377664, 'steps': 80091, 'loss/train': 0.5946000218391418} 11/07/2021 08:30:15 - INFO - __main__ - Step 80093: {'lr': 0.00022831209795879076, 'samples': 15377856, 'steps': 80092, 'loss/train': 1.03903329372406} 11/07/2021 08:30:16 - INFO - __main__ - Step 80094: {'lr': 0.0002283068112259011, 'samples': 15378048, 'steps': 80093, 'loss/train': 1.4773588180541992} 11/07/2021 08:30:16 - INFO - __main__ - Step 80095: {'lr': 0.00022830152450278613, 'samples': 15378240, 'steps': 80094, 'loss/train': 1.0425366163253784} 11/07/2021 08:30:17 - INFO - __main__ - Step 80096: {'lr': 0.0002282962377894481, 'samples': 15378432, 'steps': 80095, 'loss/train': 1.3801902532577515} 11/07/2021 08:30:18 - INFO - __main__ - Step 80097: {'lr': 0.00022829095108588946, 'samples': 15378624, 'steps': 80096, 'loss/train': 1.2717844247817993} 11/07/2021 08:30:18 - INFO - __main__ - Step 80098: {'lr': 0.00022828566439211256, 'samples': 15378816, 'steps': 80097, 'loss/train': 1.432274341583252} 11/07/2021 08:30:18 - INFO - __main__ - Step 80099: {'lr': 0.00022828037770811983, 'samples': 15379008, 'steps': 80098, 'loss/train': 1.1968940496444702} 11/07/2021 08:30:19 - INFO - __main__ - Step 80100: {'lr': 0.00022827509103391368, 'samples': 15379200, 'steps': 80099, 'loss/train': 1.4244201183319092} 11/07/2021 08:30:20 - INFO - __main__ - Step 80101: {'lr': 0.00022826980436949635, 'samples': 15379392, 'steps': 80100, 'loss/train': 1.3610038757324219} 11/07/2021 08:30:20 - INFO - __main__ - Step 80102: {'lr': 0.00022826451771487035, 'samples': 15379584, 'steps': 80101, 'loss/train': 0.9785888195037842} 11/07/2021 08:30:20 - INFO - __main__ - Step 80103: {'lr': 0.000228259231070038, 'samples': 15379776, 'steps': 80102, 'loss/train': 0.06795503944158554} 11/07/2021 08:30:21 - INFO - __main__ - Step 80104: {'lr': 0.0002282539444350017, 'samples': 15379968, 'steps': 80103, 'loss/train': 1.7834789752960205} 11/07/2021 08:30:21 - INFO - __main__ - Step 80105: {'lr': 0.00022824865780976387, 'samples': 15380160, 'steps': 80104, 'loss/train': 1.495713710784912} 11/07/2021 08:30:22 - INFO - __main__ - Step 80106: {'lr': 0.0002282433711943268, 'samples': 15380352, 'steps': 80105, 'loss/train': 1.702386736869812} 11/07/2021 08:30:22 - INFO - __main__ - Step 80107: {'lr': 0.000228238084588693, 'samples': 15380544, 'steps': 80106, 'loss/train': 1.2804756164550781} 11/07/2021 08:30:23 - INFO - __main__ - Step 80108: {'lr': 0.00022823279799286472, 'samples': 15380736, 'steps': 80107, 'loss/train': 1.2881290912628174} 11/07/2021 08:30:23 - INFO - __main__ - Step 80109: {'lr': 0.0002282275114068445, 'samples': 15380928, 'steps': 80108, 'loss/train': 1.4930719137191772} 11/07/2021 08:30:23 - INFO - __main__ - Step 80110: {'lr': 0.00022822222483063456, 'samples': 15381120, 'steps': 80109, 'loss/train': 1.1664756536483765} 11/07/2021 08:30:24 - INFO - __main__ - Step 80111: {'lr': 0.00022821693826423743, 'samples': 15381312, 'steps': 80110, 'loss/train': 1.6123549938201904} 11/07/2021 08:30:25 - INFO - __main__ - Step 80112: {'lr': 0.00022821165170765534, 'samples': 15381504, 'steps': 80111, 'loss/train': 1.499474048614502} 11/07/2021 08:30:25 - INFO - __main__ - Step 80113: {'lr': 0.00022820636516089073, 'samples': 15381696, 'steps': 80112, 'loss/train': 1.7402663230895996} 11/07/2021 08:30:26 - INFO - __main__ - Step 80114: {'lr': 0.000228201078623946, 'samples': 15381888, 'steps': 80113, 'loss/train': 1.3581849336624146} 11/07/2021 08:30:26 - INFO - __main__ - Step 80115: {'lr': 0.00022819579209682354, 'samples': 15382080, 'steps': 80114, 'loss/train': 1.364000678062439} 11/07/2021 08:30:26 - INFO - __main__ - Step 80116: {'lr': 0.0002281905055795257, 'samples': 15382272, 'steps': 80115, 'loss/train': 1.2023382186889648} 11/07/2021 08:30:27 - INFO - __main__ - Step 80117: {'lr': 0.00022818521907205493, 'samples': 15382464, 'steps': 80116, 'loss/train': 1.3071715831756592} 11/07/2021 08:30:28 - INFO - __main__ - Step 80118: {'lr': 0.00022817993257441348, 'samples': 15382656, 'steps': 80117, 'loss/train': 0.9517554044723511} 11/07/2021 08:30:28 - INFO - __main__ - Step 80119: {'lr': 0.00022817464608660388, 'samples': 15382848, 'steps': 80118, 'loss/train': 1.4177541732788086} 11/07/2021 08:30:28 - INFO - __main__ - Step 80120: {'lr': 0.00022816935960862846, 'samples': 15383040, 'steps': 80119, 'loss/train': 1.2642087936401367} 11/07/2021 08:30:29 - INFO - __main__ - Step 80121: {'lr': 0.00022816407314048953, 'samples': 15383232, 'steps': 80120, 'loss/train': 1.7936975955963135} 11/07/2021 08:30:30 - INFO - __main__ - Step 80122: {'lr': 0.00022815878668218967, 'samples': 15383424, 'steps': 80121, 'loss/train': 0.9575067758560181} 11/07/2021 08:30:30 - INFO - __main__ - Step 80123: {'lr': 0.00022815350023373102, 'samples': 15383616, 'steps': 80122, 'loss/train': 1.3452482223510742} 11/07/2021 08:30:31 - INFO - __main__ - Step 80124: {'lr': 0.0002281482137951161, 'samples': 15383808, 'steps': 80123, 'loss/train': 0.06426374614238739} 11/07/2021 08:30:31 - INFO - __main__ - Step 80125: {'lr': 0.00022814292736634718, 'samples': 15384000, 'steps': 80124, 'loss/train': 1.4666191339492798} 11/07/2021 08:30:31 - INFO - __main__ - Step 80126: {'lr': 0.00022813764094742675, 'samples': 15384192, 'steps': 80125, 'loss/train': 1.3326711654663086} 11/07/2021 08:30:33 - INFO - __main__ - Step 80127: {'lr': 0.00022813235453835717, 'samples': 15384384, 'steps': 80126, 'loss/train': 1.4909451007843018} 11/07/2021 08:30:33 - INFO - __main__ - Step 80128: {'lr': 0.00022812706813914082, 'samples': 15384576, 'steps': 80127, 'loss/train': 1.2108441591262817} 11/07/2021 08:30:33 - INFO - __main__ - Step 80129: {'lr': 0.00022812178174978008, 'samples': 15384768, 'steps': 80128, 'loss/train': 1.3809728622436523} 11/07/2021 08:30:34 - INFO - __main__ - Step 80130: {'lr': 0.00022811649537027732, 'samples': 15384960, 'steps': 80129, 'loss/train': 1.3688794374465942} 11/07/2021 08:30:34 - INFO - __main__ - Step 80131: {'lr': 0.0002281112090006349, 'samples': 15385152, 'steps': 80130, 'loss/train': 1.1959515810012817} 11/07/2021 08:30:35 - INFO - __main__ - Step 80132: {'lr': 0.00022810592264085528, 'samples': 15385344, 'steps': 80131, 'loss/train': 1.3045631647109985} 11/07/2021 08:30:35 - INFO - __main__ - Step 80133: {'lr': 0.00022810063629094077, 'samples': 15385536, 'steps': 80132, 'loss/train': 1.2589772939682007} 11/07/2021 08:30:36 - INFO - __main__ - Step 80134: {'lr': 0.00022809534995089377, 'samples': 15385728, 'steps': 80133, 'loss/train': 0.4865916669368744} 11/07/2021 08:30:36 - INFO - __main__ - Step 80135: {'lr': 0.00022809006362071668, 'samples': 15385920, 'steps': 80134, 'loss/train': 1.8398596048355103} 11/07/2021 08:30:36 - INFO - __main__ - Step 80136: {'lr': 0.00022808477730041198, 'samples': 15386112, 'steps': 80135, 'loss/train': 1.4897964000701904} 11/07/2021 08:30:38 - INFO - __main__ - Step 80137: {'lr': 0.0002280794909899818, 'samples': 15386304, 'steps': 80136, 'loss/train': 1.34486722946167} 11/07/2021 08:30:38 - INFO - __main__ - Step 80138: {'lr': 0.00022807420468942872, 'samples': 15386496, 'steps': 80137, 'loss/train': 1.368167519569397} 11/07/2021 08:30:38 - INFO - __main__ - Step 80139: {'lr': 0.00022806891839875502, 'samples': 15386688, 'steps': 80138, 'loss/train': 1.6024298667907715} 11/07/2021 08:30:39 - INFO - __main__ - Step 80140: {'lr': 0.00022806363211796314, 'samples': 15386880, 'steps': 80139, 'loss/train': 1.1618658304214478} 11/07/2021 08:30:39 - INFO - __main__ - Step 80141: {'lr': 0.00022805834584705545, 'samples': 15387072, 'steps': 80140, 'loss/train': 1.1570954322814941} 11/07/2021 08:30:40 - INFO - __main__ - Step 80142: {'lr': 0.00022805305958603433, 'samples': 15387264, 'steps': 80141, 'loss/train': 1.7222278118133545} 11/07/2021 08:30:40 - INFO - __main__ - Step 80143: {'lr': 0.00022804777333490216, 'samples': 15387456, 'steps': 80142, 'loss/train': 1.3088655471801758} 11/07/2021 08:30:41 - INFO - __main__ - Step 80144: {'lr': 0.00022804248709366133, 'samples': 15387648, 'steps': 80143, 'loss/train': 1.7016111612319946} 11/07/2021 08:30:41 - INFO - __main__ - Step 80145: {'lr': 0.00022803720086231422, 'samples': 15387840, 'steps': 80144, 'loss/train': 1.5018794536590576} 11/07/2021 08:30:41 - INFO - __main__ - Step 80146: {'lr': 0.0002280319146408632, 'samples': 15388032, 'steps': 80145, 'loss/train': 0.9659920334815979} 11/07/2021 08:30:42 - INFO - __main__ - Step 80147: {'lr': 0.00022802662842931067, 'samples': 15388224, 'steps': 80146, 'loss/train': 1.3442792892456055} 11/07/2021 08:30:43 - INFO - __main__ - Step 80148: {'lr': 0.00022802134222765896, 'samples': 15388416, 'steps': 80147, 'loss/train': 1.5405443906784058} 11/07/2021 08:30:44 - INFO - __main__ - Step 80149: {'lr': 0.00022801605603591066, 'samples': 15388608, 'steps': 80148, 'loss/train': 1.3050669431686401} 11/07/2021 08:30:44 - INFO - __main__ - Step 80150: {'lr': 0.00022801076985406784, 'samples': 15388800, 'steps': 80149, 'loss/train': 0.9965713620185852} 11/07/2021 08:30:44 - INFO - __main__ - Step 80151: {'lr': 0.00022800548368213307, 'samples': 15388992, 'steps': 80150, 'loss/train': 1.1945414543151855} 11/07/2021 08:30:45 - INFO - __main__ - Step 80152: {'lr': 0.00022800019752010865, 'samples': 15389184, 'steps': 80151, 'loss/train': 1.5382393598556519} 11/07/2021 08:30:46 - INFO - __main__ - Step 80153: {'lr': 0.000227994911367997, 'samples': 15389376, 'steps': 80152, 'loss/train': 1.2043726444244385} 11/07/2021 08:30:46 - INFO - __main__ - Step 80154: {'lr': 0.00022798962522580052, 'samples': 15389568, 'steps': 80153, 'loss/train': 1.44231379032135} 11/07/2021 08:30:46 - INFO - __main__ - Step 80155: {'lr': 0.00022798433909352158, 'samples': 15389760, 'steps': 80154, 'loss/train': 1.4728736877441406} 11/07/2021 08:30:47 - INFO - __main__ - Step 80156: {'lr': 0.00022797905297116254, 'samples': 15389952, 'steps': 80155, 'loss/train': 1.853633999824524} 11/07/2021 08:30:47 - INFO - __main__ - Step 80157: {'lr': 0.00022797376685872582, 'samples': 15390144, 'steps': 80156, 'loss/train': 1.1551145315170288} 11/07/2021 08:30:48 - INFO - __main__ - Step 80158: {'lr': 0.00022796848075621375, 'samples': 15390336, 'steps': 80157, 'loss/train': 1.7742446660995483} 11/07/2021 08:30:49 - INFO - __main__ - Step 80159: {'lr': 0.00022796319466362875, 'samples': 15390528, 'steps': 80158, 'loss/train': 1.0657671689987183} 11/07/2021 08:30:49 - INFO - __main__ - Step 80160: {'lr': 0.0002279579085809732, 'samples': 15390720, 'steps': 80159, 'loss/train': 1.5573605298995972} 11/07/2021 08:30:49 - INFO - __main__ - Step 80161: {'lr': 0.0002279526225082495, 'samples': 15390912, 'steps': 80160, 'loss/train': 1.0962175130844116} 11/07/2021 08:30:50 - INFO - __main__ - Step 80162: {'lr': 0.00022794733644545997, 'samples': 15391104, 'steps': 80161, 'loss/train': 1.8878331184387207} 11/07/2021 08:30:50 - INFO - __main__ - Step 80163: {'lr': 0.00022794205039260718, 'samples': 15391296, 'steps': 80162, 'loss/train': 1.3166725635528564} 11/07/2021 08:30:51 - INFO - __main__ - Step 80164: {'lr': 0.00022793676434969325, 'samples': 15391488, 'steps': 80163, 'loss/train': 1.0350663661956787} 11/07/2021 08:30:51 - INFO - __main__ - Step 80165: {'lr': 0.00022793147831672063, 'samples': 15391680, 'steps': 80164, 'loss/train': 1.2554962635040283} 11/07/2021 08:30:52 - INFO - __main__ - Step 80166: {'lr': 0.00022792619229369178, 'samples': 15391872, 'steps': 80165, 'loss/train': 1.1626676321029663} 11/07/2021 08:30:52 - INFO - __main__ - Step 80167: {'lr': 0.00022792090628060902, 'samples': 15392064, 'steps': 80166, 'loss/train': 1.0391981601715088} 11/07/2021 08:30:52 - INFO - __main__ - Step 80168: {'lr': 0.00022791562027747478, 'samples': 15392256, 'steps': 80167, 'loss/train': 1.172463059425354} 11/07/2021 08:30:54 - INFO - __main__ - Step 80169: {'lr': 0.00022791033428429141, 'samples': 15392448, 'steps': 80168, 'loss/train': 1.5019227266311646} 11/07/2021 08:30:54 - INFO - __main__ - Step 80170: {'lr': 0.00022790504830106132, 'samples': 15392640, 'steps': 80169, 'loss/train': 1.5685135126113892} 11/07/2021 08:30:54 - INFO - __main__ - Step 80171: {'lr': 0.00022789976232778686, 'samples': 15392832, 'steps': 80170, 'loss/train': 1.2002328634262085} 11/07/2021 08:30:55 - INFO - __main__ - Step 80172: {'lr': 0.00022789447636447044, 'samples': 15393024, 'steps': 80171, 'loss/train': 0.8864812850952148} 11/07/2021 08:30:55 - INFO - __main__ - Step 80173: {'lr': 0.00022788919041111442, 'samples': 15393216, 'steps': 80172, 'loss/train': 0.8004513382911682} 11/07/2021 08:30:56 - INFO - __main__ - Step 80174: {'lr': 0.00022788390446772116, 'samples': 15393408, 'steps': 80173, 'loss/train': 1.6562883853912354} 11/07/2021 08:30:56 - INFO - __main__ - Step 80175: {'lr': 0.0002278786185342931, 'samples': 15393600, 'steps': 80174, 'loss/train': 5.715243339538574} 11/07/2021 08:30:57 - INFO - __main__ - Step 80176: {'lr': 0.0002278733326108327, 'samples': 15393792, 'steps': 80175, 'loss/train': 1.242234230041504} 11/07/2021 08:30:57 - INFO - __main__ - Step 80177: {'lr': 0.00022786804669734214, 'samples': 15393984, 'steps': 80176, 'loss/train': 1.3738588094711304} 11/07/2021 08:30:57 - INFO - __main__ - Step 80178: {'lr': 0.0002278627607938239, 'samples': 15394176, 'steps': 80177, 'loss/train': 1.4264473915100098} 11/07/2021 08:30:58 - INFO - __main__ - Step 80179: {'lr': 0.00022785747490028033, 'samples': 15394368, 'steps': 80178, 'loss/train': 1.3250051736831665} 11/07/2021 08:30:59 - INFO - __main__ - Step 80180: {'lr': 0.00022785218901671383, 'samples': 15394560, 'steps': 80179, 'loss/train': 1.1326239109039307} 11/07/2021 08:30:59 - INFO - __main__ - Step 80181: {'lr': 0.00022784690314312683, 'samples': 15394752, 'steps': 80180, 'loss/train': 0.914219856262207} 11/07/2021 08:30:59 - INFO - __main__ - Step 80182: {'lr': 0.00022784161727952166, 'samples': 15394944, 'steps': 80181, 'loss/train': 0.7372615933418274} 11/07/2021 08:31:00 - INFO - __main__ - Step 80183: {'lr': 0.0002278363314259007, 'samples': 15395136, 'steps': 80182, 'loss/train': 1.4505114555358887} 11/07/2021 08:31:01 - INFO - __main__ - Step 80184: {'lr': 0.00022783104558226638, 'samples': 15395328, 'steps': 80183, 'loss/train': 1.1339608430862427} 11/07/2021 08:31:01 - INFO - __main__ - Step 80185: {'lr': 0.00022782575974862103, 'samples': 15395520, 'steps': 80184, 'loss/train': 1.284942865371704} 11/07/2021 08:31:02 - INFO - __main__ - Step 80186: {'lr': 0.00022782047392496706, 'samples': 15395712, 'steps': 80185, 'loss/train': 1.050528883934021} 11/07/2021 08:31:02 - INFO - __main__ - Step 80187: {'lr': 0.0002278151881113068, 'samples': 15395904, 'steps': 80186, 'loss/train': 1.6552443504333496} 11/07/2021 08:31:02 - INFO - __main__ - Step 80188: {'lr': 0.00022780990230764273, 'samples': 15396096, 'steps': 80187, 'loss/train': 1.5865546464920044} 11/07/2021 08:31:03 - INFO - __main__ - Step 80189: {'lr': 0.00022780461651397712, 'samples': 15396288, 'steps': 80188, 'loss/train': 1.3515771627426147} 11/07/2021 08:31:03 - INFO - __main__ - Step 80190: {'lr': 0.00022779933073031257, 'samples': 15396480, 'steps': 80189, 'loss/train': 1.2773871421813965} 11/07/2021 08:31:04 - INFO - __main__ - Step 80191: {'lr': 0.00022779404495665116, 'samples': 15396672, 'steps': 80190, 'loss/train': 1.1456263065338135} 11/07/2021 08:31:04 - INFO - __main__ - Step 80192: {'lr': 0.0002277887591929954, 'samples': 15396864, 'steps': 80191, 'loss/train': 1.553185224533081} 11/07/2021 08:31:05 - INFO - __main__ - Step 80193: {'lr': 0.00022778347343934771, 'samples': 15397056, 'steps': 80192, 'loss/train': 1.4126638174057007} 11/07/2021 08:31:05 - INFO - __main__ - Step 80194: {'lr': 0.00022777818769571047, 'samples': 15397248, 'steps': 80193, 'loss/train': 1.8108800649642944} 11/07/2021 08:31:06 - INFO - __main__ - Step 80195: {'lr': 0.00022777290196208597, 'samples': 15397440, 'steps': 80194, 'loss/train': 1.1390376091003418} 11/07/2021 08:31:06 - INFO - __main__ - Step 80196: {'lr': 0.0002277676162384767, 'samples': 15397632, 'steps': 80195, 'loss/train': 1.2128419876098633} 11/07/2021 08:31:07 - INFO - __main__ - Step 80197: {'lr': 0.00022776233052488497, 'samples': 15397824, 'steps': 80196, 'loss/train': 1.376460075378418} 11/07/2021 08:31:07 - INFO - __main__ - Step 80198: {'lr': 0.0002277570448213132, 'samples': 15398016, 'steps': 80197, 'loss/train': 1.5175747871398926} 11/07/2021 08:31:07 - INFO - __main__ - Step 80199: {'lr': 0.00022775175912776376, 'samples': 15398208, 'steps': 80198, 'loss/train': 1.5369930267333984} 11/07/2021 08:31:08 - INFO - __main__ - Step 80200: {'lr': 0.00022774647344423908, 'samples': 15398400, 'steps': 80199, 'loss/train': 1.4539483785629272} 11/07/2021 08:31:09 - INFO - __main__ - Step 80201: {'lr': 0.00022774118777074142, 'samples': 15398592, 'steps': 80200, 'loss/train': 1.4297382831573486} 11/07/2021 08:31:09 - INFO - __main__ - Step 80202: {'lr': 0.00022773590210727336, 'samples': 15398784, 'steps': 80201, 'loss/train': 0.9214758276939392} 11/07/2021 08:31:09 - INFO - __main__ - Step 80203: {'lr': 0.00022773061645383713, 'samples': 15398976, 'steps': 80202, 'loss/train': 1.4258410930633545} 11/07/2021 08:31:10 - INFO - __main__ - Step 80204: {'lr': 0.00022772533081043508, 'samples': 15399168, 'steps': 80203, 'loss/train': 0.9514046311378479} 11/07/2021 08:31:11 - INFO - __main__ - Step 80205: {'lr': 0.00022772004517706965, 'samples': 15399360, 'steps': 80204, 'loss/train': 1.0748714208602905} 11/07/2021 08:31:11 - INFO - __main__ - Step 80206: {'lr': 0.00022771475955374323, 'samples': 15399552, 'steps': 80205, 'loss/train': 2.1067419052124023} 11/07/2021 08:31:12 - INFO - __main__ - Step 80207: {'lr': 0.0002277094739404582, 'samples': 15399744, 'steps': 80206, 'loss/train': 1.2038735151290894} 11/07/2021 08:31:12 - INFO - __main__ - Step 80208: {'lr': 0.00022770418833721696, 'samples': 15399936, 'steps': 80207, 'loss/train': 1.1924725770950317} 11/07/2021 08:31:12 - INFO - __main__ - Step 80209: {'lr': 0.00022769890274402182, 'samples': 15400128, 'steps': 80208, 'loss/train': 0.8587379455566406} 11/07/2021 08:31:13 - INFO - __main__ - Step 80210: {'lr': 0.00022769361716087525, 'samples': 15400320, 'steps': 80209, 'loss/train': 1.2890608310699463} 11/07/2021 08:31:14 - INFO - __main__ - Step 80211: {'lr': 0.00022768833158777957, 'samples': 15400512, 'steps': 80210, 'loss/train': 1.4593414068222046} 11/07/2021 08:31:14 - INFO - __main__ - Step 80212: {'lr': 0.00022768304602473725, 'samples': 15400704, 'steps': 80211, 'loss/train': 1.2544668912887573} 11/07/2021 08:31:14 - INFO - __main__ - Step 80213: {'lr': 0.00022767776047175054, 'samples': 15400896, 'steps': 80212, 'loss/train': 1.2293373346328735} 11/07/2021 08:31:15 - INFO - __main__ - Step 80214: {'lr': 0.00022767247492882188, 'samples': 15401088, 'steps': 80213, 'loss/train': 1.513638973236084} 11/07/2021 08:31:16 - INFO - __main__ - Step 80215: {'lr': 0.00022766718939595367, 'samples': 15401280, 'steps': 80214, 'loss/train': 1.4241890907287598} 11/07/2021 08:31:16 - INFO - __main__ - Step 80216: {'lr': 0.00022766190387314833, 'samples': 15401472, 'steps': 80215, 'loss/train': 1.7972595691680908} 11/07/2021 08:31:16 - INFO - __main__ - Step 80217: {'lr': 0.00022765661836040817, 'samples': 15401664, 'steps': 80216, 'loss/train': 1.425374150276184} 11/07/2021 08:31:17 - INFO - __main__ - Step 80218: {'lr': 0.00022765133285773553, 'samples': 15401856, 'steps': 80217, 'loss/train': 1.072849154472351} 11/07/2021 08:31:17 - INFO - __main__ - Step 80219: {'lr': 0.0002276460473651329, 'samples': 15402048, 'steps': 80218, 'loss/train': 1.6545765399932861} 11/07/2021 08:31:18 - INFO - __main__ - Step 80220: {'lr': 0.00022764076188260262, 'samples': 15402240, 'steps': 80219, 'loss/train': 1.614337682723999} 11/07/2021 08:31:18 - INFO - __main__ - Step 80221: {'lr': 0.00022763547641014705, 'samples': 15402432, 'steps': 80220, 'loss/train': 1.3374284505844116} 11/07/2021 08:31:19 - INFO - __main__ - Step 80222: {'lr': 0.00022763019094776857, 'samples': 15402624, 'steps': 80221, 'loss/train': 1.3118219375610352} 11/07/2021 08:31:19 - INFO - __main__ - Step 80223: {'lr': 0.00022762490549546965, 'samples': 15402816, 'steps': 80222, 'loss/train': 1.2022185325622559} 11/07/2021 08:31:19 - INFO - __main__ - Step 80224: {'lr': 0.00022761962005325256, 'samples': 15403008, 'steps': 80223, 'loss/train': 0.9862080216407776} 11/07/2021 08:31:20 - INFO - __main__ - Step 80225: {'lr': 0.00022761433462111972, 'samples': 15403200, 'steps': 80224, 'loss/train': 1.513165831565857} 11/07/2021 08:31:21 - INFO - __main__ - Step 80226: {'lr': 0.0002276090491990735, 'samples': 15403392, 'steps': 80225, 'loss/train': 1.451872706413269} 11/07/2021 08:31:21 - INFO - __main__ - Step 80227: {'lr': 0.00022760376378711634, 'samples': 15403584, 'steps': 80226, 'loss/train': 0.8358976244926453} 11/07/2021 08:31:21 - INFO - __main__ - Step 80228: {'lr': 0.00022759847838525052, 'samples': 15403776, 'steps': 80227, 'loss/train': 1.4918917417526245} 11/07/2021 08:31:22 - INFO - __main__ - Step 80229: {'lr': 0.0002275931929934785, 'samples': 15403968, 'steps': 80228, 'loss/train': 1.197944164276123} 11/07/2021 08:31:23 - INFO - __main__ - Step 80230: {'lr': 0.00022758790761180273, 'samples': 15404160, 'steps': 80229, 'loss/train': 1.3790589570999146} 11/07/2021 08:31:23 - INFO - __main__ - Step 80231: {'lr': 0.0002275826222402254, 'samples': 15404352, 'steps': 80230, 'loss/train': 1.535316824913025} 11/07/2021 08:31:24 - INFO - __main__ - Step 80232: {'lr': 0.00022757733687874904, 'samples': 15404544, 'steps': 80231, 'loss/train': 1.5550071001052856} 11/07/2021 08:31:24 - INFO - __main__ - Step 80233: {'lr': 0.00022757205152737595, 'samples': 15404736, 'steps': 80232, 'loss/train': 1.530596137046814} 11/07/2021 08:31:24 - INFO - __main__ - Step 80234: {'lr': 0.0002275667661861086, 'samples': 15404928, 'steps': 80233, 'loss/train': 1.333622694015503} 11/07/2021 08:31:25 - INFO - __main__ - Step 80235: {'lr': 0.0002275614808549493, 'samples': 15405120, 'steps': 80234, 'loss/train': 0.9863141775131226} 11/07/2021 08:31:26 - INFO - __main__ - Step 80236: {'lr': 0.00022755619553390045, 'samples': 15405312, 'steps': 80235, 'loss/train': 1.519700527191162} 11/07/2021 08:31:26 - INFO - __main__ - Step 80237: {'lr': 0.00022755091022296439, 'samples': 15405504, 'steps': 80236, 'loss/train': 1.0303802490234375} 11/07/2021 08:31:26 - INFO - __main__ - Step 80238: {'lr': 0.00022754562492214355, 'samples': 15405696, 'steps': 80237, 'loss/train': 1.17923903465271} 11/07/2021 08:31:27 - INFO - __main__ - Step 80239: {'lr': 0.00022754033963144033, 'samples': 15405888, 'steps': 80238, 'loss/train': 1.5564266443252563} 11/07/2021 08:31:28 - INFO - __main__ - Step 80240: {'lr': 0.00022753505435085708, 'samples': 15406080, 'steps': 80239, 'loss/train': 1.1764854192733765} 11/07/2021 08:31:28 - INFO - __main__ - Step 80241: {'lr': 0.00022752976908039618, 'samples': 15406272, 'steps': 80240, 'loss/train': 1.3549847602844238} 11/07/2021 08:31:28 - INFO - __main__ - Step 80242: {'lr': 0.00022752448382006002, 'samples': 15406464, 'steps': 80241, 'loss/train': 1.4361562728881836} 11/07/2021 08:31:29 - INFO - __main__ - Step 80243: {'lr': 0.00022751919856985107, 'samples': 15406656, 'steps': 80242, 'loss/train': 1.136683464050293} 11/07/2021 08:31:29 - INFO - __main__ - Step 80244: {'lr': 0.00022751391332977153, 'samples': 15406848, 'steps': 80243, 'loss/train': 1.5307854413986206} 11/07/2021 08:31:30 - INFO - __main__ - Step 80245: {'lr': 0.00022750862809982393, 'samples': 15407040, 'steps': 80244, 'loss/train': 1.5629611015319824} 11/07/2021 08:31:31 - INFO - __main__ - Step 80246: {'lr': 0.00022750334288001054, 'samples': 15407232, 'steps': 80245, 'loss/train': 1.5208383798599243} 11/07/2021 08:31:31 - INFO - __main__ - Step 80247: {'lr': 0.0002274980576703338, 'samples': 15407424, 'steps': 80246, 'loss/train': 1.2852647304534912} 11/07/2021 08:31:31 - INFO - __main__ - Step 80248: {'lr': 0.00022749277247079608, 'samples': 15407616, 'steps': 80247, 'loss/train': 1.458848476409912} 11/07/2021 08:31:32 - INFO - __main__ - Step 80249: {'lr': 0.00022748748728139979, 'samples': 15407808, 'steps': 80248, 'loss/train': 1.5074228048324585} 11/07/2021 08:31:33 - INFO - __main__ - Step 80250: {'lr': 0.0002274822021021473, 'samples': 15408000, 'steps': 80249, 'loss/train': 1.3460896015167236} 11/07/2021 08:31:33 - INFO - __main__ - Step 80251: {'lr': 0.00022747691693304094, 'samples': 15408192, 'steps': 80250, 'loss/train': 1.4797114133834839} 11/07/2021 08:31:33 - INFO - __main__ - Step 80252: {'lr': 0.00022747163177408317, 'samples': 15408384, 'steps': 80251, 'loss/train': 1.235272765159607} 11/07/2021 08:31:34 - INFO - __main__ - Step 80253: {'lr': 0.0002274663466252763, 'samples': 15408576, 'steps': 80252, 'loss/train': 1.2706284523010254} 11/07/2021 08:31:34 - INFO - __main__ - Step 80254: {'lr': 0.0002274610614866228, 'samples': 15408768, 'steps': 80253, 'loss/train': 1.6577320098876953} 11/07/2021 08:31:35 - INFO - __main__ - Step 80255: {'lr': 0.00022745577635812495, 'samples': 15408960, 'steps': 80254, 'loss/train': 1.5747712850570679} 11/07/2021 08:31:36 - INFO - __main__ - Step 80256: {'lr': 0.0002274504912397852, 'samples': 15409152, 'steps': 80255, 'loss/train': 1.2674041986465454} 11/07/2021 08:31:36 - INFO - __main__ - Step 80257: {'lr': 0.000227445206131606, 'samples': 15409344, 'steps': 80256, 'loss/train': 1.4153741598129272} 11/07/2021 08:31:36 - INFO - __main__ - Step 80258: {'lr': 0.00022743992103358958, 'samples': 15409536, 'steps': 80257, 'loss/train': 1.2375733852386475} 11/07/2021 08:31:37 - INFO - __main__ - Step 80259: {'lr': 0.00022743463594573834, 'samples': 15409728, 'steps': 80258, 'loss/train': 1.4268437623977661} 11/07/2021 08:31:37 - INFO - __main__ - Step 80260: {'lr': 0.0002274293508680547, 'samples': 15409920, 'steps': 80259, 'loss/train': 1.173180103302002} 11/07/2021 08:31:38 - INFO - __main__ - Step 80261: {'lr': 0.00022742406580054106, 'samples': 15410112, 'steps': 80260, 'loss/train': 0.8498280048370361} 11/07/2021 08:31:38 - INFO - __main__ - Step 80262: {'lr': 0.0002274187807431998, 'samples': 15410304, 'steps': 80261, 'loss/train': 1.6250836849212646} 11/07/2021 08:31:39 - INFO - __main__ - Step 80263: {'lr': 0.00022741349569603328, 'samples': 15410496, 'steps': 80262, 'loss/train': 1.5892409086227417} 11/07/2021 08:31:39 - INFO - __main__ - Step 80264: {'lr': 0.00022740821065904388, 'samples': 15410688, 'steps': 80263, 'loss/train': 1.7252424955368042} 11/07/2021 08:31:39 - INFO - __main__ - Step 80265: {'lr': 0.000227402925632234, 'samples': 15410880, 'steps': 80264, 'loss/train': 1.8570009469985962} 11/07/2021 08:31:40 - INFO - __main__ - Step 80266: {'lr': 0.00022739764061560603, 'samples': 15411072, 'steps': 80265, 'loss/train': 1.6397666931152344} 11/07/2021 08:31:41 - INFO - __main__ - Step 80267: {'lr': 0.00022739235560916232, 'samples': 15411264, 'steps': 80266, 'loss/train': 1.7413972616195679} 11/07/2021 08:31:41 - INFO - __main__ - Step 80268: {'lr': 0.00022738707061290526, 'samples': 15411456, 'steps': 80267, 'loss/train': 1.2757515907287598} 11/07/2021 08:31:42 - INFO - __main__ - Step 80269: {'lr': 0.00022738178562683724, 'samples': 15411648, 'steps': 80268, 'loss/train': 1.1452776193618774} 11/07/2021 08:31:42 - INFO - __main__ - Step 80270: {'lr': 0.00022737650065096074, 'samples': 15411840, 'steps': 80269, 'loss/train': 1.1538201570510864} 11/07/2021 08:31:43 - INFO - __main__ - Step 80271: {'lr': 0.00022737121568527794, 'samples': 15412032, 'steps': 80270, 'loss/train': 0.6795769929885864} 11/07/2021 08:31:43 - INFO - __main__ - Step 80272: {'lr': 0.00022736593072979135, 'samples': 15412224, 'steps': 80271, 'loss/train': 1.534676194190979} 11/07/2021 08:31:44 - INFO - __main__ - Step 80273: {'lr': 0.00022736064578450328, 'samples': 15412416, 'steps': 80272, 'loss/train': 1.3233915567398071} 11/07/2021 08:31:44 - INFO - __main__ - Step 80274: {'lr': 0.00022735536084941615, 'samples': 15412608, 'steps': 80273, 'loss/train': 1.2204018831253052} 11/07/2021 08:31:44 - INFO - __main__ - Step 80275: {'lr': 0.00022735007592453236, 'samples': 15412800, 'steps': 80274, 'loss/train': 3.2526938915252686} 11/07/2021 08:31:46 - INFO - __main__ - Step 80276: {'lr': 0.00022734479100985428, 'samples': 15412992, 'steps': 80275, 'loss/train': 1.2375479936599731} 11/07/2021 08:31:46 - INFO - __main__ - Step 80277: {'lr': 0.00022733950610538429, 'samples': 15413184, 'steps': 80276, 'loss/train': 1.0655003786087036} 11/07/2021 08:31:46 - INFO - __main__ - Step 80278: {'lr': 0.00022733422121112476, 'samples': 15413376, 'steps': 80277, 'loss/train': 1.3760015964508057} 11/07/2021 08:31:47 - INFO - __main__ - Step 80279: {'lr': 0.00022732893632707808, 'samples': 15413568, 'steps': 80278, 'loss/train': 1.8169236183166504} 11/07/2021 08:31:47 - INFO - __main__ - Step 80280: {'lr': 0.00022732365145324666, 'samples': 15413760, 'steps': 80279, 'loss/train': 1.3300740718841553} 11/07/2021 08:31:47 - INFO - __main__ - Step 80281: {'lr': 0.00022731836658963282, 'samples': 15413952, 'steps': 80280, 'loss/train': 1.2664580345153809} 11/07/2021 08:31:48 - INFO - __main__ - Step 80282: {'lr': 0.00022731308173623896, 'samples': 15414144, 'steps': 80281, 'loss/train': 0.7569831013679504} 11/07/2021 08:31:49 - INFO - __main__ - Step 80283: {'lr': 0.0002273077968930675, 'samples': 15414336, 'steps': 80282, 'loss/train': 1.3099533319473267} 11/07/2021 08:31:49 - INFO - __main__ - Step 80284: {'lr': 0.00022730251206012092, 'samples': 15414528, 'steps': 80283, 'loss/train': 0.9874062538146973} 11/07/2021 08:31:50 - INFO - __main__ - Step 80285: {'lr': 0.00022729722723740134, 'samples': 15414720, 'steps': 80284, 'loss/train': 1.227113127708435} 11/07/2021 08:31:50 - INFO - __main__ - Step 80286: {'lr': 0.0002272919424249113, 'samples': 15414912, 'steps': 80285, 'loss/train': 1.3904186487197876} 11/07/2021 08:31:51 - INFO - __main__ - Step 80287: {'lr': 0.00022728665762265316, 'samples': 15415104, 'steps': 80286, 'loss/train': 0.7525635361671448} 11/07/2021 08:31:51 - INFO - __main__ - Step 80288: {'lr': 0.00022728137283062927, 'samples': 15415296, 'steps': 80287, 'loss/train': 1.3139816522598267} 11/07/2021 08:31:52 - INFO - __main__ - Step 80289: {'lr': 0.00022727608804884209, 'samples': 15415488, 'steps': 80288, 'loss/train': 0.9464805126190186} 11/07/2021 08:31:52 - INFO - __main__ - Step 80290: {'lr': 0.0002272708032772939, 'samples': 15415680, 'steps': 80289, 'loss/train': 1.2836220264434814} 11/07/2021 08:31:52 - INFO - __main__ - Step 80291: {'lr': 0.00022726551851598719, 'samples': 15415872, 'steps': 80290, 'loss/train': 1.5398194789886475} 11/07/2021 08:31:54 - INFO - __main__ - Step 80292: {'lr': 0.00022726023376492424, 'samples': 15416064, 'steps': 80291, 'loss/train': 1.4523439407348633} 11/07/2021 08:31:55 - INFO - __main__ - Step 80293: {'lr': 0.0002272549490241075, 'samples': 15416256, 'steps': 80292, 'loss/train': 0.8713854551315308} 11/07/2021 08:31:55 - INFO - __main__ - Step 80294: {'lr': 0.00022724966429353934, 'samples': 15416448, 'steps': 80293, 'loss/train': 1.6889492273330688} 11/07/2021 08:31:55 - INFO - __main__ - Step 80295: {'lr': 0.00022724437957322215, 'samples': 15416640, 'steps': 80294, 'loss/train': 1.7129504680633545} 11/07/2021 08:31:56 - INFO - __main__ - Step 80296: {'lr': 0.00022723909486315825, 'samples': 15416832, 'steps': 80295, 'loss/train': 0.8479904532432556} 11/07/2021 08:31:56 - INFO - __main__ - Step 80297: {'lr': 0.00022723381016335014, 'samples': 15417024, 'steps': 80296, 'loss/train': 1.2875561714172363} 11/07/2021 08:31:56 - INFO - __main__ - Step 80298: {'lr': 0.00022722852547380008, 'samples': 15417216, 'steps': 80297, 'loss/train': 1.541023850440979} 11/07/2021 08:31:57 - INFO - __main__ - Step 80299: {'lr': 0.00022722324079451047, 'samples': 15417408, 'steps': 80298, 'loss/train': 2.069721221923828} 11/07/2021 08:31:58 - INFO - __main__ - Step 80300: {'lr': 0.00022721795612548373, 'samples': 15417600, 'steps': 80299, 'loss/train': 1.5776804685592651} 11/07/2021 08:31:58 - INFO - __main__ - Step 80301: {'lr': 0.0002272126714667222, 'samples': 15417792, 'steps': 80300, 'loss/train': 0.15357792377471924} 11/07/2021 08:31:59 - INFO - __main__ - Step 80302: {'lr': 0.0002272073868182283, 'samples': 15417984, 'steps': 80301, 'loss/train': 1.6787950992584229} 11/07/2021 08:31:59 - INFO - __main__ - Step 80303: {'lr': 0.00022720210218000442, 'samples': 15418176, 'steps': 80302, 'loss/train': 0.99489825963974} 11/07/2021 08:32:00 - INFO - __main__ - Step 80304: {'lr': 0.0002271968175520529, 'samples': 15418368, 'steps': 80303, 'loss/train': 1.5233747959136963} 11/07/2021 08:32:00 - INFO - __main__ - Step 80305: {'lr': 0.00022719153293437614, 'samples': 15418560, 'steps': 80304, 'loss/train': 1.4045875072479248} 11/07/2021 08:32:01 - INFO - __main__ - Step 80306: {'lr': 0.00022718624832697654, 'samples': 15418752, 'steps': 80305, 'loss/train': 1.5152599811553955} 11/07/2021 08:32:01 - INFO - __main__ - Step 80307: {'lr': 0.00022718096372985645, 'samples': 15418944, 'steps': 80306, 'loss/train': 1.2986161708831787} 11/07/2021 08:32:01 - INFO - __main__ - Step 80308: {'lr': 0.00022717567914301828, 'samples': 15419136, 'steps': 80307, 'loss/train': 1.1427717208862305} 11/07/2021 08:32:02 - INFO - __main__ - Step 80309: {'lr': 0.0002271703945664644, 'samples': 15419328, 'steps': 80308, 'loss/train': 1.1220579147338867} 11/07/2021 08:32:03 - INFO - __main__ - Step 80310: {'lr': 0.00022716511000019717, 'samples': 15419520, 'steps': 80309, 'loss/train': 1.3188680410385132} 11/07/2021 08:32:03 - INFO - __main__ - Step 80311: {'lr': 0.0002271598254442191, 'samples': 15419712, 'steps': 80310, 'loss/train': 1.4665130376815796} 11/07/2021 08:32:03 - INFO - __main__ - Step 80312: {'lr': 0.00022715454089853234, 'samples': 15419904, 'steps': 80311, 'loss/train': 1.7992103099822998} 11/07/2021 08:32:04 - INFO - __main__ - Step 80313: {'lr': 0.0002271492563631394, 'samples': 15420096, 'steps': 80312, 'loss/train': 0.12943339347839355} 11/07/2021 08:32:04 - INFO - __main__ - Step 80314: {'lr': 0.00022714397183804267, 'samples': 15420288, 'steps': 80313, 'loss/train': 1.4647018909454346} 11/07/2021 08:32:05 - INFO - __main__ - Step 80315: {'lr': 0.0002271386873232445, 'samples': 15420480, 'steps': 80314, 'loss/train': 0.5564707517623901} 11/07/2021 08:32:05 - INFO - __main__ - Step 80316: {'lr': 0.0002271334028187473, 'samples': 15420672, 'steps': 80315, 'loss/train': 2.1669363975524902} 11/07/2021 08:32:06 - INFO - __main__ - Step 80317: {'lr': 0.00022712811832455341, 'samples': 15420864, 'steps': 80316, 'loss/train': 1.7576414346694946} 11/07/2021 08:32:06 - INFO - __main__ - Step 80318: {'lr': 0.00022712283384066523, 'samples': 15421056, 'steps': 80317, 'loss/train': 1.4598629474639893} 11/07/2021 08:32:06 - INFO - __main__ - Step 80319: {'lr': 0.00022711754936708518, 'samples': 15421248, 'steps': 80318, 'loss/train': 1.3516608476638794} 11/07/2021 08:32:08 - INFO - __main__ - Step 80320: {'lr': 0.0002271122649038156, 'samples': 15421440, 'steps': 80319, 'loss/train': 1.5991711616516113} 11/07/2021 08:32:08 - INFO - __main__ - Step 80321: {'lr': 0.00022710698045085888, 'samples': 15421632, 'steps': 80320, 'loss/train': 1.7302504777908325} 11/07/2021 08:32:08 - INFO - __main__ - Step 80322: {'lr': 0.0002271016960082174, 'samples': 15421824, 'steps': 80321, 'loss/train': 1.9067524671554565} 11/07/2021 08:32:09 - INFO - __main__ - Step 80323: {'lr': 0.00022709641157589352, 'samples': 15422016, 'steps': 80322, 'loss/train': 1.2394781112670898} 11/07/2021 08:32:09 - INFO - __main__ - Step 80324: {'lr': 0.0002270911271538898, 'samples': 15422208, 'steps': 80323, 'loss/train': 1.214584231376648} 11/07/2021 08:32:10 - INFO - __main__ - Step 80325: {'lr': 0.00022708584274220832, 'samples': 15422400, 'steps': 80324, 'loss/train': 1.5900359153747559} 11/07/2021 08:32:10 - INFO - __main__ - Step 80326: {'lr': 0.0002270805583408516, 'samples': 15422592, 'steps': 80325, 'loss/train': 1.723962664604187} 11/07/2021 08:32:11 - INFO - __main__ - Step 80327: {'lr': 0.00022707527394982206, 'samples': 15422784, 'steps': 80326, 'loss/train': 1.2769920825958252} 11/07/2021 08:32:11 - INFO - __main__ - Step 80328: {'lr': 0.00022706998956912203, 'samples': 15422976, 'steps': 80327, 'loss/train': 1.5395835638046265} 11/07/2021 08:32:11 - INFO - __main__ - Step 80329: {'lr': 0.00022706470519875388, 'samples': 15423168, 'steps': 80328, 'loss/train': 1.4430166482925415} 11/07/2021 08:32:12 - INFO - __main__ - Step 80330: {'lr': 0.00022705942083872004, 'samples': 15423360, 'steps': 80329, 'loss/train': 1.9142855405807495} 11/07/2021 08:32:13 - INFO - __main__ - Step 80331: {'lr': 0.0002270541364890229, 'samples': 15423552, 'steps': 80330, 'loss/train': 1.2028204202651978} 11/07/2021 08:32:13 - INFO - __main__ - Step 80332: {'lr': 0.0002270488521496648, 'samples': 15423744, 'steps': 80331, 'loss/train': 1.6860262155532837} 11/07/2021 08:32:13 - INFO - __main__ - Step 80333: {'lr': 0.0002270435678206481, 'samples': 15423936, 'steps': 80332, 'loss/train': 1.6285881996154785} 11/07/2021 08:32:14 - INFO - __main__ - Step 80334: {'lr': 0.00022703828350197525, 'samples': 15424128, 'steps': 80333, 'loss/train': 1.330811858177185} 11/07/2021 08:32:14 - INFO - __main__ - Step 80335: {'lr': 0.0002270329991936486, 'samples': 15424320, 'steps': 80334, 'loss/train': 1.3903027772903442} 11/07/2021 08:32:15 - INFO - __main__ - Step 80336: {'lr': 0.00022702771489567055, 'samples': 15424512, 'steps': 80335, 'loss/train': 1.303515076637268} 11/07/2021 08:32:16 - INFO - __main__ - Step 80337: {'lr': 0.00022702243060804348, 'samples': 15424704, 'steps': 80336, 'loss/train': 1.3322941064834595} 11/07/2021 08:32:16 - INFO - __main__ - Step 80338: {'lr': 0.00022701714633076967, 'samples': 15424896, 'steps': 80337, 'loss/train': 0.950071394443512} 11/07/2021 08:32:16 - INFO - __main__ - Step 80339: {'lr': 0.00022701186206385162, 'samples': 15425088, 'steps': 80338, 'loss/train': 1.4282892942428589} 11/07/2021 08:32:17 - INFO - __main__ - Step 80340: {'lr': 0.00022700657780729162, 'samples': 15425280, 'steps': 80339, 'loss/train': 1.6224639415740967} 11/07/2021 08:32:18 - INFO - __main__ - Step 80341: {'lr': 0.00022700129356109213, 'samples': 15425472, 'steps': 80340, 'loss/train': 1.434701919555664} 11/07/2021 08:32:18 - INFO - __main__ - Step 80342: {'lr': 0.0002269960093252555, 'samples': 15425664, 'steps': 80341, 'loss/train': 1.015322208404541} 11/07/2021 08:32:18 - INFO - __main__ - Step 80343: {'lr': 0.0002269907250997841, 'samples': 15425856, 'steps': 80342, 'loss/train': 1.4212368726730347} 11/07/2021 08:32:19 - INFO - __main__ - Step 80344: {'lr': 0.00022698544088468035, 'samples': 15426048, 'steps': 80343, 'loss/train': 1.4361785650253296} 11/07/2021 08:32:19 - INFO - __main__ - Step 80345: {'lr': 0.0002269801566799466, 'samples': 15426240, 'steps': 80344, 'loss/train': 1.5195316076278687} 11/07/2021 08:32:20 - INFO - __main__ - Step 80346: {'lr': 0.0002269748724855852, 'samples': 15426432, 'steps': 80345, 'loss/train': 1.3836925029754639} 11/07/2021 08:32:20 - INFO - __main__ - Step 80347: {'lr': 0.00022696958830159867, 'samples': 15426624, 'steps': 80346, 'loss/train': 1.6172064542770386} 11/07/2021 08:32:21 - INFO - __main__ - Step 80348: {'lr': 0.0002269643041279892, 'samples': 15426816, 'steps': 80347, 'loss/train': 2.858489513397217} 11/07/2021 08:32:21 - INFO - __main__ - Step 80349: {'lr': 0.00022695901996475925, 'samples': 15427008, 'steps': 80348, 'loss/train': 1.1880229711532593} 11/07/2021 08:32:21 - INFO - __main__ - Step 80350: {'lr': 0.00022695373581191125, 'samples': 15427200, 'steps': 80349, 'loss/train': 1.5420030355453491} 11/07/2021 08:32:23 - INFO - __main__ - Step 80351: {'lr': 0.0002269484516694476, 'samples': 15427392, 'steps': 80350, 'loss/train': 1.5037108659744263} 11/07/2021 08:32:23 - INFO - __main__ - Step 80352: {'lr': 0.00022694316753737052, 'samples': 15427584, 'steps': 80351, 'loss/train': 1.348111629486084} 11/07/2021 08:32:23 - INFO - __main__ - Step 80353: {'lr': 0.00022693788341568254, 'samples': 15427776, 'steps': 80352, 'loss/train': 1.401909351348877} 11/07/2021 08:32:24 - INFO - __main__ - Step 80354: {'lr': 0.00022693259930438596, 'samples': 15427968, 'steps': 80353, 'loss/train': 0.8998346328735352} 11/07/2021 08:32:24 - INFO - __main__ - Step 80355: {'lr': 0.0002269273152034832, 'samples': 15428160, 'steps': 80354, 'loss/train': 1.065050721168518} 11/07/2021 08:32:24 - INFO - __main__ - Step 80356: {'lr': 0.00022692203111297662, 'samples': 15428352, 'steps': 80355, 'loss/train': 1.5072253942489624} 11/07/2021 08:32:25 - INFO - __main__ - Step 80357: {'lr': 0.00022691674703286866, 'samples': 15428544, 'steps': 80356, 'loss/train': 0.9458984732627869} 11/07/2021 08:32:26 - INFO - __main__ - Step 80358: {'lr': 0.00022691146296316167, 'samples': 15428736, 'steps': 80357, 'loss/train': 1.184208869934082} 11/07/2021 08:32:26 - INFO - __main__ - Step 80359: {'lr': 0.000226906178903858, 'samples': 15428928, 'steps': 80358, 'loss/train': 1.178529143333435} 11/07/2021 08:32:26 - INFO - __main__ - Step 80360: {'lr': 0.00022690089485496003, 'samples': 15429120, 'steps': 80359, 'loss/train': 0.5451751947402954} 11/07/2021 08:32:27 - INFO - __main__ - Step 80361: {'lr': 0.00022689561081647017, 'samples': 15429312, 'steps': 80360, 'loss/train': 1.7062832117080688} 11/07/2021 08:32:28 - INFO - __main__ - Step 80362: {'lr': 0.00022689032678839077, 'samples': 15429504, 'steps': 80361, 'loss/train': 1.2073699235916138} 11/07/2021 08:32:28 - INFO - __main__ - Step 80363: {'lr': 0.00022688504277072424, 'samples': 15429696, 'steps': 80362, 'loss/train': 1.6661701202392578} 11/07/2021 08:32:28 - INFO - __main__ - Step 80364: {'lr': 0.00022687975876347304, 'samples': 15429888, 'steps': 80363, 'loss/train': 0.9583553075790405} 11/07/2021 08:32:29 - INFO - __main__ - Step 80365: {'lr': 0.00022687447476663937, 'samples': 15430080, 'steps': 80364, 'loss/train': 1.43746817111969} 11/07/2021 08:32:29 - INFO - __main__ - Step 80366: {'lr': 0.00022686919078022572, 'samples': 15430272, 'steps': 80365, 'loss/train': 0.55706387758255} 11/07/2021 08:32:30 - INFO - __main__ - Step 80367: {'lr': 0.00022686390680423446, 'samples': 15430464, 'steps': 80366, 'loss/train': 0.9459983110427856} 11/07/2021 08:32:31 - INFO - __main__ - Step 80368: {'lr': 0.00022685862283866796, 'samples': 15430656, 'steps': 80367, 'loss/train': 1.5005486011505127} 11/07/2021 08:32:31 - INFO - __main__ - Step 80369: {'lr': 0.00022685333888352867, 'samples': 15430848, 'steps': 80368, 'loss/train': 1.493485927581787} 11/07/2021 08:32:31 - INFO - __main__ - Step 80370: {'lr': 0.00022684805493881883, 'samples': 15431040, 'steps': 80369, 'loss/train': 1.2613518238067627} 11/07/2021 08:32:32 - INFO - __main__ - Step 80371: {'lr': 0.0002268427710045409, 'samples': 15431232, 'steps': 80370, 'loss/train': 1.3479903936386108} 11/07/2021 08:32:33 - INFO - __main__ - Step 80372: {'lr': 0.00022683748708069728, 'samples': 15431424, 'steps': 80371, 'loss/train': 1.4525763988494873} 11/07/2021 08:32:33 - INFO - __main__ - Step 80373: {'lr': 0.00022683220316729034, 'samples': 15431616, 'steps': 80372, 'loss/train': 1.1053258180618286} 11/07/2021 08:32:33 - INFO - __main__ - Step 80374: {'lr': 0.00022682691926432245, 'samples': 15431808, 'steps': 80373, 'loss/train': 1.596994400024414} 11/07/2021 08:32:34 - INFO - __main__ - Step 80375: {'lr': 0.000226821635371796, 'samples': 15432000, 'steps': 80374, 'loss/train': 1.456938624382019} 11/07/2021 08:32:34 - INFO - __main__ - Step 80376: {'lr': 0.00022681635148971333, 'samples': 15432192, 'steps': 80375, 'loss/train': 1.0378397703170776} 11/07/2021 08:32:35 - INFO - __main__ - Step 80377: {'lr': 0.00022681106761807685, 'samples': 15432384, 'steps': 80376, 'loss/train': 1.2032421827316284} 11/07/2021 08:32:35 - INFO - __main__ - Step 80378: {'lr': 0.00022680578375688904, 'samples': 15432576, 'steps': 80377, 'loss/train': 1.4977059364318848} 11/07/2021 08:32:36 - INFO - __main__ - Step 80379: {'lr': 0.0002268004999061521, 'samples': 15432768, 'steps': 80378, 'loss/train': 1.6890228986740112} 11/07/2021 08:32:36 - INFO - __main__ - Step 80380: {'lr': 0.00022679521606586855, 'samples': 15432960, 'steps': 80379, 'loss/train': 1.363366723060608} 11/07/2021 08:32:37 - INFO - __main__ - Step 80381: {'lr': 0.0002267899322360407, 'samples': 15433152, 'steps': 80380, 'loss/train': 1.3705031871795654} 11/07/2021 08:32:37 - INFO - __main__ - Step 80382: {'lr': 0.0002267846484166709, 'samples': 15433344, 'steps': 80381, 'loss/train': 0.7400863170623779} 11/07/2021 08:32:38 - INFO - __main__ - Step 80383: {'lr': 0.0002267793646077616, 'samples': 15433536, 'steps': 80382, 'loss/train': 1.5416817665100098} 11/07/2021 08:32:38 - INFO - __main__ - Step 80384: {'lr': 0.00022677408080931517, 'samples': 15433728, 'steps': 80383, 'loss/train': 0.9293023943901062} 11/07/2021 08:32:39 - INFO - __main__ - Step 80385: {'lr': 0.00022676879702133396, 'samples': 15433920, 'steps': 80384, 'loss/train': 1.1092456579208374} 11/07/2021 08:32:39 - INFO - __main__ - Step 80386: {'lr': 0.00022676351324382038, 'samples': 15434112, 'steps': 80385, 'loss/train': 1.6330891847610474} 11/07/2021 08:32:39 - INFO - __main__ - Step 80387: {'lr': 0.00022675822947677683, 'samples': 15434304, 'steps': 80386, 'loss/train': 1.3594927787780762} 11/07/2021 08:32:41 - INFO - __main__ - Step 80388: {'lr': 0.00022675294572020564, 'samples': 15434496, 'steps': 80387, 'loss/train': 1.1162834167480469} 11/07/2021 08:32:41 - INFO - __main__ - Step 80389: {'lr': 0.0002267476619741092, 'samples': 15434688, 'steps': 80388, 'loss/train': 1.285407304763794} 11/07/2021 08:32:41 - INFO - __main__ - Step 80390: {'lr': 0.00022674237823848992, 'samples': 15434880, 'steps': 80389, 'loss/train': 1.6997095346450806} 11/07/2021 08:32:42 - INFO - __main__ - Step 80391: {'lr': 0.0002267370945133503, 'samples': 15435072, 'steps': 80390, 'loss/train': 1.1748409271240234} 11/07/2021 08:32:42 - INFO - __main__ - Step 80392: {'lr': 0.00022673181079869244, 'samples': 15435264, 'steps': 80391, 'loss/train': 1.3545259237289429} 11/07/2021 08:32:43 - INFO - __main__ - Step 80393: {'lr': 0.00022672652709451884, 'samples': 15435456, 'steps': 80392, 'loss/train': 1.4459797143936157} 11/07/2021 08:32:44 - INFO - __main__ - Step 80394: {'lr': 0.00022672124340083197, 'samples': 15435648, 'steps': 80393, 'loss/train': 1.2458226680755615} 11/07/2021 08:32:44 - INFO - __main__ - Step 80395: {'lr': 0.0002267159597176341, 'samples': 15435840, 'steps': 80394, 'loss/train': 1.4926286935806274} 11/07/2021 08:32:44 - INFO - __main__ - Step 80396: {'lr': 0.0002267106760449277, 'samples': 15436032, 'steps': 80395, 'loss/train': 1.3483481407165527} 11/07/2021 08:32:45 - INFO - __main__ - Step 80397: {'lr': 0.00022670539238271508, 'samples': 15436224, 'steps': 80396, 'loss/train': 1.352221131324768} 11/07/2021 08:32:45 - INFO - __main__ - Step 80398: {'lr': 0.00022670010873099866, 'samples': 15436416, 'steps': 80397, 'loss/train': 1.3933802843093872} 11/07/2021 08:32:46 - INFO - __main__ - Step 80399: {'lr': 0.00022669482508978084, 'samples': 15436608, 'steps': 80398, 'loss/train': 1.1343199014663696} 11/07/2021 08:32:46 - INFO - __main__ - Step 80400: {'lr': 0.00022668954145906394, 'samples': 15436800, 'steps': 80399, 'loss/train': 1.1533664464950562} 11/07/2021 08:32:47 - INFO - __main__ - Step 80401: {'lr': 0.0002266842578388504, 'samples': 15436992, 'steps': 80400, 'loss/train': 1.6140471696853638} 11/07/2021 08:32:47 - INFO - __main__ - Step 80402: {'lr': 0.00022667897422914252, 'samples': 15437184, 'steps': 80401, 'loss/train': 1.6481002569198608} 11/07/2021 08:32:47 - INFO - __main__ - Step 80403: {'lr': 0.0002266736906299428, 'samples': 15437376, 'steps': 80402, 'loss/train': 1.422335147857666} 11/07/2021 08:32:49 - INFO - __main__ - Step 80404: {'lr': 0.00022666840704125353, 'samples': 15437568, 'steps': 80403, 'loss/train': 1.7618523836135864} 11/07/2021 08:32:49 - INFO - __main__ - Step 80405: {'lr': 0.00022666312346307719, 'samples': 15437760, 'steps': 80404, 'loss/train': 1.3823027610778809} 11/07/2021 08:32:49 - INFO - __main__ - Step 80406: {'lr': 0.000226657839895416, 'samples': 15437952, 'steps': 80405, 'loss/train': 1.6943824291229248} 11/07/2021 08:32:50 - INFO - __main__ - Step 80407: {'lr': 0.00022665255633827245, 'samples': 15438144, 'steps': 80406, 'loss/train': 1.5283958911895752} 11/07/2021 08:32:50 - INFO - __main__ - Step 80408: {'lr': 0.00022664727279164888, 'samples': 15438336, 'steps': 80407, 'loss/train': 1.539694905281067} 11/07/2021 08:32:52 - INFO - __main__ - Step 80409: {'lr': 0.00022664198925554768, 'samples': 15438528, 'steps': 80408, 'loss/train': 1.182418942451477} 11/07/2021 08:32:52 - INFO - __main__ - Step 80410: {'lr': 0.00022663670572997124, 'samples': 15438720, 'steps': 80409, 'loss/train': 1.8067249059677124} 11/07/2021 08:32:52 - INFO - __main__ - Step 80411: {'lr': 0.00022663142221492194, 'samples': 15438912, 'steps': 80410, 'loss/train': 1.548316240310669} 11/07/2021 08:32:53 - INFO - __main__ - Step 80412: {'lr': 0.0002266261387104022, 'samples': 15439104, 'steps': 80411, 'loss/train': 1.7307393550872803} 11/07/2021 08:32:53 - INFO - __main__ - Step 80413: {'lr': 0.0002266208552164143, 'samples': 15439296, 'steps': 80412, 'loss/train': 1.8881574869155884} 11/07/2021 08:32:53 - INFO - __main__ - Step 80414: {'lr': 0.00022661557173296072, 'samples': 15439488, 'steps': 80413, 'loss/train': 1.5918453931808472} 11/07/2021 08:32:54 - INFO - __main__ - Step 80415: {'lr': 0.0002266102882600438, 'samples': 15439680, 'steps': 80414, 'loss/train': 0.09243754297494888} 11/07/2021 08:32:55 - INFO - __main__ - Step 80416: {'lr': 0.0002266050047976659, 'samples': 15439872, 'steps': 80415, 'loss/train': 1.1321505308151245} 11/07/2021 08:32:55 - INFO - __main__ - Step 80417: {'lr': 0.00022659972134582947, 'samples': 15440064, 'steps': 80416, 'loss/train': 1.7138075828552246} 11/07/2021 08:32:55 - INFO - __main__ - Step 80418: {'lr': 0.0002265944379045369, 'samples': 15440256, 'steps': 80417, 'loss/train': 0.7976477742195129} 11/07/2021 08:32:56 - INFO - __main__ - Step 80419: {'lr': 0.00022658915447379044, 'samples': 15440448, 'steps': 80418, 'loss/train': 1.3998206853866577} 11/07/2021 08:32:57 - INFO - __main__ - Step 80420: {'lr': 0.00022658387105359255, 'samples': 15440640, 'steps': 80419, 'loss/train': 1.1777361631393433} 11/07/2021 08:32:57 - INFO - __main__ - Step 80421: {'lr': 0.0002265785876439456, 'samples': 15440832, 'steps': 80420, 'loss/train': 2.368927001953125} 11/07/2021 08:32:58 - INFO - __main__ - Step 80422: {'lr': 0.00022657330424485196, 'samples': 15441024, 'steps': 80421, 'loss/train': 1.380586862564087} 11/07/2021 08:32:58 - INFO - __main__ - Step 80423: {'lr': 0.00022656802085631403, 'samples': 15441216, 'steps': 80422, 'loss/train': 1.5333266258239746} 11/07/2021 08:32:58 - INFO - __main__ - Step 80424: {'lr': 0.0002265627374783342, 'samples': 15441408, 'steps': 80423, 'loss/train': 1.1237794160842896} 11/07/2021 08:32:59 - INFO - __main__ - Step 80425: {'lr': 0.00022655745411091484, 'samples': 15441600, 'steps': 80424, 'loss/train': 1.3578449487686157} 11/07/2021 08:33:00 - INFO - __main__ - Step 80426: {'lr': 0.0002265521707540583, 'samples': 15441792, 'steps': 80425, 'loss/train': 1.0633922815322876} 11/07/2021 08:33:00 - INFO - __main__ - Step 80427: {'lr': 0.00022654688740776703, 'samples': 15441984, 'steps': 80426, 'loss/train': 1.387481927871704} 11/07/2021 08:33:00 - INFO - __main__ - Step 80428: {'lr': 0.00022654160407204336, 'samples': 15442176, 'steps': 80427, 'loss/train': 1.118837833404541} 11/07/2021 08:33:01 - INFO - __main__ - Step 80429: {'lr': 0.0002265363207468897, 'samples': 15442368, 'steps': 80428, 'loss/train': 1.3397881984710693} 11/07/2021 08:33:01 - INFO - __main__ - Step 80430: {'lr': 0.00022653103743230834, 'samples': 15442560, 'steps': 80429, 'loss/train': 0.7896357178688049} 11/07/2021 08:33:02 - INFO - __main__ - Step 80431: {'lr': 0.0002265257541283018, 'samples': 15442752, 'steps': 80430, 'loss/train': 1.511634111404419} 11/07/2021 08:33:03 - INFO - __main__ - Step 80432: {'lr': 0.0002265204708348725, 'samples': 15442944, 'steps': 80431, 'loss/train': 1.533661127090454} 11/07/2021 08:33:03 - INFO - __main__ - Step 80433: {'lr': 0.00022651518755202255, 'samples': 15443136, 'steps': 80432, 'loss/train': 0.9661164283752441} 11/07/2021 08:33:03 - INFO - __main__ - Step 80434: {'lr': 0.00022650990427975455, 'samples': 15443328, 'steps': 80433, 'loss/train': 1.476048231124878} 11/07/2021 08:33:04 - INFO - __main__ - Step 80435: {'lr': 0.0002265046210180708, 'samples': 15443520, 'steps': 80434, 'loss/train': 1.4840716123580933} 11/07/2021 08:33:05 - INFO - __main__ - Step 80436: {'lr': 0.0002264993377669737, 'samples': 15443712, 'steps': 80435, 'loss/train': 2.115273952484131} 11/07/2021 08:33:05 - INFO - __main__ - Step 80437: {'lr': 0.00022649405452646566, 'samples': 15443904, 'steps': 80436, 'loss/train': 1.4025521278381348} 11/07/2021 08:33:05 - INFO - __main__ - Step 80438: {'lr': 0.000226488771296549, 'samples': 15444096, 'steps': 80437, 'loss/train': 1.9101814031600952} 11/07/2021 08:33:06 - INFO - __main__ - Step 80439: {'lr': 0.00022648348807722618, 'samples': 15444288, 'steps': 80438, 'loss/train': 1.695709466934204} 11/07/2021 08:33:06 - INFO - __main__ - Step 80440: {'lr': 0.0002264782048684995, 'samples': 15444480, 'steps': 80439, 'loss/train': 1.544562816619873} 11/07/2021 08:33:07 - INFO - __main__ - Step 80441: {'lr': 0.00022647292167037142, 'samples': 15444672, 'steps': 80440, 'loss/train': 1.199527382850647} 11/07/2021 08:33:07 - INFO - __main__ - Step 80442: {'lr': 0.00022646763848284423, 'samples': 15444864, 'steps': 80441, 'loss/train': 1.32038152217865} 11/07/2021 08:33:08 - INFO - __main__ - Step 80443: {'lr': 0.00022646235530592037, 'samples': 15445056, 'steps': 80442, 'loss/train': 1.9227005243301392} 11/07/2021 08:33:08 - INFO - __main__ - Step 80444: {'lr': 0.00022645707213960224, 'samples': 15445248, 'steps': 80443, 'loss/train': 1.6205847263336182} 11/07/2021 08:33:09 - INFO - __main__ - Step 80445: {'lr': 0.0002264517889838923, 'samples': 15445440, 'steps': 80444, 'loss/train': 1.544402837753296} 11/07/2021 08:33:09 - INFO - __main__ - Step 80446: {'lr': 0.00022644650583879267, 'samples': 15445632, 'steps': 80445, 'loss/train': 1.1498667001724243} 11/07/2021 08:33:10 - INFO - __main__ - Step 80447: {'lr': 0.00022644122270430592, 'samples': 15445824, 'steps': 80446, 'loss/train': 1.4284141063690186} 11/07/2021 08:33:10 - INFO - __main__ - Step 80448: {'lr': 0.00022643593958043438, 'samples': 15446016, 'steps': 80447, 'loss/train': 1.1631766557693481} 11/07/2021 08:33:11 - INFO - __main__ - Step 80449: {'lr': 0.0002264306564671804, 'samples': 15446208, 'steps': 80448, 'loss/train': 1.328924298286438} 11/07/2021 08:33:11 - INFO - __main__ - Step 80450: {'lr': 0.00022642537336454646, 'samples': 15446400, 'steps': 80449, 'loss/train': 1.5398777723312378} 11/07/2021 08:33:11 - INFO - __main__ - Step 80451: {'lr': 0.00022642009027253485, 'samples': 15446592, 'steps': 80450, 'loss/train': 1.8749066591262817} 11/07/2021 08:33:12 - INFO - __main__ - Step 80452: {'lr': 0.00022641480719114802, 'samples': 15446784, 'steps': 80451, 'loss/train': 1.5275803804397583} 11/07/2021 08:33:13 - INFO - __main__ - Step 80453: {'lr': 0.0002264095241203883, 'samples': 15446976, 'steps': 80452, 'loss/train': 1.4894533157348633} 11/07/2021 08:33:13 - INFO - __main__ - Step 80454: {'lr': 0.00022640424106025805, 'samples': 15447168, 'steps': 80453, 'loss/train': 1.5721566677093506} 11/07/2021 08:33:13 - INFO - __main__ - Step 80455: {'lr': 0.00022639895801075972, 'samples': 15447360, 'steps': 80454, 'loss/train': 1.421612024307251} 11/07/2021 08:33:14 - INFO - __main__ - Step 80456: {'lr': 0.00022639367497189565, 'samples': 15447552, 'steps': 80455, 'loss/train': 0.9693846702575684} 11/07/2021 08:33:15 - INFO - __main__ - Step 80457: {'lr': 0.0002263883919436682, 'samples': 15447744, 'steps': 80456, 'loss/train': 1.7742680311203003} 11/07/2021 08:33:15 - INFO - __main__ - Step 80458: {'lr': 0.0002263831089260799, 'samples': 15447936, 'steps': 80457, 'loss/train': 0.17746445536613464} 11/07/2021 08:33:16 - INFO - __main__ - Step 80459: {'lr': 0.0002263778259191329, 'samples': 15448128, 'steps': 80458, 'loss/train': 1.4547885656356812} 11/07/2021 08:33:16 - INFO - __main__ - Step 80460: {'lr': 0.0002263725429228297, 'samples': 15448320, 'steps': 80459, 'loss/train': 1.5587782859802246} 11/07/2021 08:33:16 - INFO - __main__ - Step 80461: {'lr': 0.00022636725993717267, 'samples': 15448512, 'steps': 80460, 'loss/train': 1.5326534509658813} 11/07/2021 08:33:17 - INFO - __main__ - Step 80462: {'lr': 0.00022636197696216415, 'samples': 15448704, 'steps': 80461, 'loss/train': 1.4252240657806396} 11/07/2021 08:33:18 - INFO - __main__ - Step 80463: {'lr': 0.00022635669399780658, 'samples': 15448896, 'steps': 80462, 'loss/train': 1.303495168685913} 11/07/2021 08:33:18 - INFO - __main__ - Step 80464: {'lr': 0.00022635141104410234, 'samples': 15449088, 'steps': 80463, 'loss/train': 2.1545631885528564} 11/07/2021 08:33:18 - INFO - __main__ - Step 80465: {'lr': 0.00022634612810105376, 'samples': 15449280, 'steps': 80464, 'loss/train': 1.5582047700881958} 11/07/2021 08:33:19 - INFO - __main__ - Step 80466: {'lr': 0.00022634084516866328, 'samples': 15449472, 'steps': 80465, 'loss/train': 1.512609839439392} 11/07/2021 08:33:20 - INFO - __main__ - Step 80467: {'lr': 0.0002263355622469332, 'samples': 15449664, 'steps': 80466, 'loss/train': 1.6828184127807617} 11/07/2021 08:33:20 - INFO - __main__ - Step 80468: {'lr': 0.000226330279335866, 'samples': 15449856, 'steps': 80467, 'loss/train': 1.2491827011108398} 11/07/2021 08:33:20 - INFO - __main__ - Step 80469: {'lr': 0.000226324996435464, 'samples': 15450048, 'steps': 80468, 'loss/train': 1.8985967636108398} 11/07/2021 08:33:21 - INFO - __main__ - Step 80470: {'lr': 0.00022631971354572964, 'samples': 15450240, 'steps': 80469, 'loss/train': 1.5150834321975708} 11/07/2021 08:33:21 - INFO - __main__ - Step 80471: {'lr': 0.00022631443066666517, 'samples': 15450432, 'steps': 80470, 'loss/train': 1.1702529191970825} 11/07/2021 08:33:22 - INFO - __main__ - Step 80472: {'lr': 0.00022630914779827316, 'samples': 15450624, 'steps': 80471, 'loss/train': 0.8752043843269348} 11/07/2021 08:33:22 - INFO - __main__ - Step 80473: {'lr': 0.0002263038649405558, 'samples': 15450816, 'steps': 80472, 'loss/train': 1.1142491102218628} 11/07/2021 08:33:23 - INFO - __main__ - Step 80474: {'lr': 0.00022629858209351555, 'samples': 15451008, 'steps': 80473, 'loss/train': 1.1472994089126587} 11/07/2021 08:33:23 - INFO - __main__ - Step 80475: {'lr': 0.0002262932992571548, 'samples': 15451200, 'steps': 80474, 'loss/train': 1.2585649490356445} 11/07/2021 08:33:23 - INFO - __main__ - Step 80476: {'lr': 0.00022628801643147592, 'samples': 15451392, 'steps': 80475, 'loss/train': 1.548427700996399} 11/07/2021 08:33:24 - INFO - __main__ - Step 80477: {'lr': 0.0002262827336164813, 'samples': 15451584, 'steps': 80476, 'loss/train': 2.0128931999206543} 11/07/2021 08:33:25 - INFO - __main__ - Step 80478: {'lr': 0.0002262774508121733, 'samples': 15451776, 'steps': 80477, 'loss/train': 2.0816409587860107} 11/07/2021 08:33:25 - INFO - __main__ - Step 80479: {'lr': 0.00022627216801855433, 'samples': 15451968, 'steps': 80478, 'loss/train': 1.289298415184021} 11/07/2021 08:33:26 - INFO - __main__ - Step 80480: {'lr': 0.00022626688523562675, 'samples': 15452160, 'steps': 80479, 'loss/train': 0.9806556105613708} 11/07/2021 08:33:26 - INFO - __main__ - Step 80481: {'lr': 0.000226261602463393, 'samples': 15452352, 'steps': 80480, 'loss/train': 1.8725296258926392} 11/07/2021 08:33:26 - INFO - __main__ - Step 80482: {'lr': 0.00022625631970185533, 'samples': 15452544, 'steps': 80481, 'loss/train': 1.6132179498672485} 11/07/2021 08:33:27 - INFO - __main__ - Step 80483: {'lr': 0.00022625103695101623, 'samples': 15452736, 'steps': 80482, 'loss/train': 1.146705150604248} 11/07/2021 08:33:28 - INFO - __main__ - Step 80484: {'lr': 0.00022624575421087802, 'samples': 15452928, 'steps': 80483, 'loss/train': 1.6059215068817139} 11/07/2021 08:33:28 - INFO - __main__ - Step 80485: {'lr': 0.00022624047148144316, 'samples': 15453120, 'steps': 80484, 'loss/train': 1.415229320526123} 11/07/2021 08:33:28 - INFO - __main__ - Step 80486: {'lr': 0.00022623518876271396, 'samples': 15453312, 'steps': 80485, 'loss/train': 1.4211862087249756} 11/07/2021 08:33:29 - INFO - __main__ - Step 80487: {'lr': 0.0002262299060546928, 'samples': 15453504, 'steps': 80486, 'loss/train': 1.5090579986572266} 11/07/2021 08:33:30 - INFO - __main__ - Step 80488: {'lr': 0.00022622462335738206, 'samples': 15453696, 'steps': 80487, 'loss/train': 1.4838758707046509} 11/07/2021 08:33:31 - INFO - __main__ - Step 80489: {'lr': 0.00022621934067078414, 'samples': 15453888, 'steps': 80488, 'loss/train': 2.163022994995117} 11/07/2021 08:33:31 - INFO - __main__ - Step 80490: {'lr': 0.00022621405799490142, 'samples': 15454080, 'steps': 80489, 'loss/train': 1.7805863618850708} 11/07/2021 08:33:31 - INFO - __main__ - Step 80491: {'lr': 0.0002262087753297363, 'samples': 15454272, 'steps': 80490, 'loss/train': 1.6066306829452515} 11/07/2021 08:33:32 - INFO - __main__ - Step 80492: {'lr': 0.00022620349267529118, 'samples': 15454464, 'steps': 80491, 'loss/train': 1.3867416381835938} 11/07/2021 08:33:32 - INFO - __main__ - Step 80493: {'lr': 0.00022619821003156833, 'samples': 15454656, 'steps': 80492, 'loss/train': 1.7011184692382812} 11/07/2021 08:33:33 - INFO - __main__ - Step 80494: {'lr': 0.0002261929273985702, 'samples': 15454848, 'steps': 80493, 'loss/train': 1.4833426475524902} 11/07/2021 08:33:33 - INFO - __main__ - Step 80495: {'lr': 0.0002261876447762992, 'samples': 15455040, 'steps': 80494, 'loss/train': 1.6304357051849365} 11/07/2021 08:33:34 - INFO - __main__ - Step 80496: {'lr': 0.00022618236216475767, 'samples': 15455232, 'steps': 80495, 'loss/train': 1.1123439073562622} 11/07/2021 08:33:34 - INFO - __main__ - Step 80497: {'lr': 0.00022617707956394797, 'samples': 15455424, 'steps': 80496, 'loss/train': 1.1661372184753418} 11/07/2021 08:33:34 - INFO - __main__ - Step 80498: {'lr': 0.00022617179697387253, 'samples': 15455616, 'steps': 80497, 'loss/train': 1.5999706983566284} 11/07/2021 08:33:35 - INFO - __main__ - Step 80499: {'lr': 0.00022616651439453375, 'samples': 15455808, 'steps': 80498, 'loss/train': 1.3515535593032837} 11/07/2021 08:33:36 - INFO - __main__ - Step 80500: {'lr': 0.00022616123182593394, 'samples': 15456000, 'steps': 80499, 'loss/train': 1.3484830856323242} 11/07/2021 08:33:36 - INFO - __main__ - Step 80501: {'lr': 0.00022615594926807551, 'samples': 15456192, 'steps': 80500, 'loss/train': 1.2665990591049194} 11/07/2021 08:33:36 - INFO - __main__ - Step 80502: {'lr': 0.00022615066672096082, 'samples': 15456384, 'steps': 80501, 'loss/train': 1.845191478729248} 11/07/2021 08:33:37 - INFO - __main__ - Step 80503: {'lr': 0.00022614538418459234, 'samples': 15456576, 'steps': 80502, 'loss/train': 1.2231080532073975} 11/07/2021 08:33:37 - INFO - __main__ - Step 80504: {'lr': 0.00022614010165897234, 'samples': 15456768, 'steps': 80503, 'loss/train': 1.5240583419799805} 11/07/2021 08:33:38 - INFO - __main__ - Step 80505: {'lr': 0.00022613481914410323, 'samples': 15456960, 'steps': 80504, 'loss/train': 1.0357080698013306} 11/07/2021 08:33:39 - INFO - __main__ - Step 80506: {'lr': 0.0002261295366399874, 'samples': 15457152, 'steps': 80505, 'loss/train': 1.5251353979110718} 11/07/2021 08:33:39 - INFO - __main__ - Step 80507: {'lr': 0.00022612425414662724, 'samples': 15457344, 'steps': 80506, 'loss/train': 1.5559176206588745} 11/07/2021 08:33:39 - INFO - __main__ - Step 80508: {'lr': 0.00022611897166402512, 'samples': 15457536, 'steps': 80507, 'loss/train': 1.3056142330169678} 11/07/2021 08:33:40 - INFO - __main__ - Step 80509: {'lr': 0.00022611368919218342, 'samples': 15457728, 'steps': 80508, 'loss/train': 1.4360926151275635} 11/07/2021 08:33:41 - INFO - __main__ - Step 80510: {'lr': 0.00022610840673110454, 'samples': 15457920, 'steps': 80509, 'loss/train': 1.540337085723877} 11/07/2021 08:33:41 - INFO - __main__ - Step 80511: {'lr': 0.0002261031242807908, 'samples': 15458112, 'steps': 80510, 'loss/train': 0.42389872670173645} 11/07/2021 08:33:41 - INFO - __main__ - Step 80512: {'lr': 0.00022609784184124472, 'samples': 15458304, 'steps': 80511, 'loss/train': 1.341906189918518} 11/07/2021 08:33:42 - INFO - __main__ - Step 80513: {'lr': 0.0002260925594124685, 'samples': 15458496, 'steps': 80512, 'loss/train': 1.1356186866760254} 11/07/2021 08:33:42 - INFO - __main__ - Step 80514: {'lr': 0.0002260872769944647, 'samples': 15458688, 'steps': 80513, 'loss/train': 1.6595057249069214} 11/07/2021 08:33:43 - INFO - __main__ - Step 80515: {'lr': 0.0002260819945872355, 'samples': 15458880, 'steps': 80514, 'loss/train': 1.7675800323486328} 11/07/2021 08:33:44 - INFO - __main__ - Step 80516: {'lr': 0.00022607671219078342, 'samples': 15459072, 'steps': 80515, 'loss/train': 0.6574535965919495} 11/07/2021 08:33:44 - INFO - __main__ - Step 80517: {'lr': 0.00022607142980511077, 'samples': 15459264, 'steps': 80516, 'loss/train': 1.766332983970642} 11/07/2021 08:33:44 - INFO - __main__ - Step 80518: {'lr': 0.00022606614743021997, 'samples': 15459456, 'steps': 80517, 'loss/train': 1.2056119441986084} 11/07/2021 08:33:45 - INFO - __main__ - Step 80519: {'lr': 0.00022606086506611343, 'samples': 15459648, 'steps': 80518, 'loss/train': 1.9394713640213013} 11/07/2021 08:33:46 - INFO - __main__ - Step 80520: {'lr': 0.00022605558271279348, 'samples': 15459840, 'steps': 80519, 'loss/train': 1.4893428087234497} 11/07/2021 08:33:46 - INFO - __main__ - Step 80521: {'lr': 0.0002260503003702625, 'samples': 15460032, 'steps': 80520, 'loss/train': 1.5025920867919922} 11/07/2021 08:33:46 - INFO - __main__ - Step 80522: {'lr': 0.0002260450180385229, 'samples': 15460224, 'steps': 80521, 'loss/train': 1.319111704826355} 11/07/2021 08:33:47 - INFO - __main__ - Step 80523: {'lr': 0.000226039735717577, 'samples': 15460416, 'steps': 80522, 'loss/train': 1.0720123052597046} 11/07/2021 08:33:47 - INFO - __main__ - Step 80524: {'lr': 0.00022603445340742728, 'samples': 15460608, 'steps': 80523, 'loss/train': 1.2223602533340454} 11/07/2021 08:33:48 - INFO - __main__ - Step 80525: {'lr': 0.00022602917110807605, 'samples': 15460800, 'steps': 80524, 'loss/train': 1.0172762870788574} 11/07/2021 08:33:48 - INFO - __main__ - Step 80526: {'lr': 0.00022602388881952582, 'samples': 15460992, 'steps': 80525, 'loss/train': 1.0567365884780884} 11/07/2021 08:33:49 - INFO - __main__ - Step 80527: {'lr': 0.00022601860654177875, 'samples': 15461184, 'steps': 80526, 'loss/train': 1.2062358856201172} 11/07/2021 08:33:49 - INFO - __main__ - Step 80528: {'lr': 0.00022601332427483732, 'samples': 15461376, 'steps': 80527, 'loss/train': 1.2433240413665771} 11/07/2021 08:33:49 - INFO - __main__ - Step 80529: {'lr': 0.0002260080420187039, 'samples': 15461568, 'steps': 80528, 'loss/train': 1.2329833507537842} 11/07/2021 08:33:50 - INFO - __main__ - Step 80530: {'lr': 0.0002260027597733809, 'samples': 15461760, 'steps': 80529, 'loss/train': 0.9756776690483093} 11/07/2021 08:33:51 - INFO - __main__ - Step 80531: {'lr': 0.00022599747753887067, 'samples': 15461952, 'steps': 80530, 'loss/train': 1.2882192134857178} 11/07/2021 08:33:51 - INFO - __main__ - Step 80532: {'lr': 0.00022599219531517565, 'samples': 15462144, 'steps': 80531, 'loss/train': 1.4110585451126099} 11/07/2021 08:33:52 - INFO - __main__ - Step 80533: {'lr': 0.00022598691310229813, 'samples': 15462336, 'steps': 80532, 'loss/train': 1.5204352140426636} 11/07/2021 08:33:52 - INFO - __main__ - Step 80534: {'lr': 0.00022598163090024054, 'samples': 15462528, 'steps': 80533, 'loss/train': 1.6188116073608398} 11/07/2021 08:33:52 - INFO - __main__ - Step 80535: {'lr': 0.0002259763487090053, 'samples': 15462720, 'steps': 80534, 'loss/train': 1.888447642326355} 11/07/2021 08:33:53 - INFO - __main__ - Step 80536: {'lr': 0.0002259710665285947, 'samples': 15462912, 'steps': 80535, 'loss/train': 1.6160876750946045} 11/07/2021 08:33:54 - INFO - __main__ - Step 80537: {'lr': 0.00022596578435901118, 'samples': 15463104, 'steps': 80536, 'loss/train': 1.2052834033966064} 11/07/2021 08:33:54 - INFO - __main__ - Step 80538: {'lr': 0.00022596050220025714, 'samples': 15463296, 'steps': 80537, 'loss/train': 1.5233542919158936} 11/07/2021 08:33:54 - INFO - __main__ - Step 80539: {'lr': 0.00022595522005233498, 'samples': 15463488, 'steps': 80538, 'loss/train': 1.5565589666366577} 11/07/2021 08:33:55 - INFO - __main__ - Step 80540: {'lr': 0.00022594993791524696, 'samples': 15463680, 'steps': 80539, 'loss/train': 1.3424928188323975} 11/07/2021 08:33:56 - INFO - __main__ - Step 80541: {'lr': 0.0002259446557889955, 'samples': 15463872, 'steps': 80540, 'loss/train': 1.4751676321029663} 11/07/2021 08:33:56 - INFO - __main__ - Step 80542: {'lr': 0.00022593937367358302, 'samples': 15464064, 'steps': 80541, 'loss/train': 1.313802719116211} 11/07/2021 08:33:57 - INFO - __main__ - Step 80543: {'lr': 0.00022593409156901188, 'samples': 15464256, 'steps': 80542, 'loss/train': 1.0363740921020508} 11/07/2021 08:33:57 - INFO - __main__ - Step 80544: {'lr': 0.00022592880947528446, 'samples': 15464448, 'steps': 80543, 'loss/train': 1.3610750436782837} 11/07/2021 08:33:57 - INFO - __main__ - Step 80545: {'lr': 0.00022592352739240318, 'samples': 15464640, 'steps': 80544, 'loss/train': 1.4732662439346313} 11/07/2021 08:33:58 - INFO - __main__ - Step 80546: {'lr': 0.00022591824532037036, 'samples': 15464832, 'steps': 80545, 'loss/train': 1.4774832725524902} 11/07/2021 08:33:59 - INFO - __main__ - Step 80547: {'lr': 0.0002259129632591884, 'samples': 15465024, 'steps': 80546, 'loss/train': 1.1501625776290894} 11/07/2021 08:33:59 - INFO - __main__ - Step 80548: {'lr': 0.0002259076812088597, 'samples': 15465216, 'steps': 80547, 'loss/train': 1.0433588027954102} 11/07/2021 08:33:59 - INFO - __main__ - Step 80549: {'lr': 0.0002259023991693866, 'samples': 15465408, 'steps': 80548, 'loss/train': 1.361999750137329} 11/07/2021 08:34:00 - INFO - __main__ - Step 80550: {'lr': 0.00022589711714077158, 'samples': 15465600, 'steps': 80549, 'loss/train': 0.9724106192588806} 11/07/2021 08:34:00 - INFO - __main__ - Step 80551: {'lr': 0.0002258918351230169, 'samples': 15465792, 'steps': 80550, 'loss/train': 1.431313395500183} 11/07/2021 08:34:01 - INFO - __main__ - Step 80552: {'lr': 0.00022588655311612496, 'samples': 15465984, 'steps': 80551, 'loss/train': 0.3090928792953491} 11/07/2021 08:34:01 - INFO - __main__ - Step 80553: {'lr': 0.0002258812711200983, 'samples': 15466176, 'steps': 80552, 'loss/train': 1.4647575616836548} 11/07/2021 08:34:02 - INFO - __main__ - Step 80554: {'lr': 0.0002258759891349391, 'samples': 15466368, 'steps': 80553, 'loss/train': 1.3999080657958984} 11/07/2021 08:34:02 - INFO - __main__ - Step 80555: {'lr': 0.00022587070716064976, 'samples': 15466560, 'steps': 80554, 'loss/train': 1.9135547876358032} 11/07/2021 08:34:02 - INFO - __main__ - Step 80556: {'lr': 0.0002258654251972327, 'samples': 15466752, 'steps': 80555, 'loss/train': 0.9675297141075134} 11/07/2021 08:34:04 - INFO - __main__ - Step 80557: {'lr': 0.00022586014324469034, 'samples': 15466944, 'steps': 80556, 'loss/train': 1.5580227375030518} 11/07/2021 08:34:04 - INFO - __main__ - Step 80558: {'lr': 0.00022585486130302502, 'samples': 15467136, 'steps': 80557, 'loss/train': 1.6223194599151611} 11/07/2021 08:34:04 - INFO - __main__ - Step 80559: {'lr': 0.0002258495793722391, 'samples': 15467328, 'steps': 80558, 'loss/train': 1.811387538909912} 11/07/2021 08:34:05 - INFO - __main__ - Step 80560: {'lr': 0.000225844297452335, 'samples': 15467520, 'steps': 80559, 'loss/train': 0.9714920520782471} 11/07/2021 08:34:05 - INFO - __main__ - Step 80561: {'lr': 0.0002258390155433151, 'samples': 15467712, 'steps': 80560, 'loss/train': 1.245568037033081} 11/07/2021 08:34:06 - INFO - __main__ - Step 80562: {'lr': 0.00022583373364518176, 'samples': 15467904, 'steps': 80561, 'loss/train': 1.4076169729232788} 11/07/2021 08:34:06 - INFO - __main__ - Step 80563: {'lr': 0.00022582845175793734, 'samples': 15468096, 'steps': 80562, 'loss/train': 1.8017926216125488} 11/07/2021 08:34:07 - INFO - __main__ - Step 80564: {'lr': 0.00022582316988158427, 'samples': 15468288, 'steps': 80563, 'loss/train': 1.6317559480667114} 11/07/2021 08:34:07 - INFO - __main__ - Step 80565: {'lr': 0.00022581788801612492, 'samples': 15468480, 'steps': 80564, 'loss/train': 1.4500597715377808} 11/07/2021 08:34:07 - INFO - __main__ - Step 80566: {'lr': 0.00022581260616156177, 'samples': 15468672, 'steps': 80565, 'loss/train': 1.240522027015686} 11/07/2021 08:34:08 - INFO - __main__ - Step 80567: {'lr': 0.00022580732431789693, 'samples': 15468864, 'steps': 80566, 'loss/train': 0.8213973045349121} 11/07/2021 08:34:09 - INFO - __main__ - Step 80568: {'lr': 0.00022580204248513297, 'samples': 15469056, 'steps': 80567, 'loss/train': 1.4999028444290161} 11/07/2021 08:34:09 - INFO - __main__ - Step 80569: {'lr': 0.00022579676066327226, 'samples': 15469248, 'steps': 80568, 'loss/train': 1.3608683347702026} 11/07/2021 08:34:09 - INFO - __main__ - Step 80570: {'lr': 0.0002257914788523171, 'samples': 15469440, 'steps': 80569, 'loss/train': 1.305245041847229} 11/07/2021 08:34:10 - INFO - __main__ - Step 80571: {'lr': 0.00022578619705226996, 'samples': 15469632, 'steps': 80570, 'loss/train': 1.1667766571044922} 11/07/2021 08:34:11 - INFO - __main__ - Step 80572: {'lr': 0.00022578091526313318, 'samples': 15469824, 'steps': 80571, 'loss/train': 1.4439765214920044} 11/07/2021 08:34:11 - INFO - __main__ - Step 80573: {'lr': 0.00022577563348490914, 'samples': 15470016, 'steps': 80572, 'loss/train': 0.8957734107971191} 11/07/2021 08:34:11 - INFO - __main__ - Step 80574: {'lr': 0.00022577035171760025, 'samples': 15470208, 'steps': 80573, 'loss/train': 0.8497878909111023} 11/07/2021 08:34:12 - INFO - __main__ - Step 80575: {'lr': 0.00022576506996120884, 'samples': 15470400, 'steps': 80574, 'loss/train': 1.4345989227294922} 11/07/2021 08:34:12 - INFO - __main__ - Step 80576: {'lr': 0.00022575978821573733, 'samples': 15470592, 'steps': 80575, 'loss/train': 1.5704526901245117} 11/07/2021 08:34:13 - INFO - __main__ - Step 80577: {'lr': 0.0002257545064811881, 'samples': 15470784, 'steps': 80576, 'loss/train': 1.5417858362197876} 11/07/2021 08:34:14 - INFO - __main__ - Step 80578: {'lr': 0.0002257492247575635, 'samples': 15470976, 'steps': 80577, 'loss/train': 1.08879816532135} 11/07/2021 08:34:14 - INFO - __main__ - Step 80579: {'lr': 0.000225743943044866, 'samples': 15471168, 'steps': 80578, 'loss/train': 1.5425992012023926} 11/07/2021 08:34:14 - INFO - __main__ - Step 80580: {'lr': 0.00022573866134309784, 'samples': 15471360, 'steps': 80579, 'loss/train': 1.2751532793045044} 11/07/2021 08:34:15 - INFO - __main__ - Step 80581: {'lr': 0.00022573337965226144, 'samples': 15471552, 'steps': 80580, 'loss/train': 1.9915066957473755} 11/07/2021 08:34:16 - INFO - __main__ - Step 80582: {'lr': 0.00022572809797235922, 'samples': 15471744, 'steps': 80581, 'loss/train': 1.3670494556427002} 11/07/2021 08:34:16 - INFO - __main__ - Step 80583: {'lr': 0.00022572281630339354, 'samples': 15471936, 'steps': 80582, 'loss/train': 1.0503040552139282} 11/07/2021 08:34:17 - INFO - __main__ - Step 80584: {'lr': 0.00022571753464536675, 'samples': 15472128, 'steps': 80583, 'loss/train': 1.2765288352966309} 11/07/2021 08:34:17 - INFO - __main__ - Step 80585: {'lr': 0.00022571225299828132, 'samples': 15472320, 'steps': 80584, 'loss/train': 1.0832786560058594} 11/07/2021 08:34:17 - INFO - __main__ - Step 80586: {'lr': 0.00022570697136213956, 'samples': 15472512, 'steps': 80585, 'loss/train': 1.6188730001449585} 11/07/2021 08:34:18 - INFO - __main__ - Step 80587: {'lr': 0.00022570168973694386, 'samples': 15472704, 'steps': 80586, 'loss/train': 1.5356863737106323} 11/07/2021 08:34:19 - INFO - __main__ - Step 80588: {'lr': 0.00022569640812269658, 'samples': 15472896, 'steps': 80587, 'loss/train': 0.4931543469429016} 11/07/2021 08:34:19 - INFO - __main__ - Step 80589: {'lr': 0.00022569112651940016, 'samples': 15473088, 'steps': 80588, 'loss/train': 1.3764066696166992} 11/07/2021 08:34:19 - INFO - __main__ - Step 80590: {'lr': 0.00022568584492705691, 'samples': 15473280, 'steps': 80589, 'loss/train': 1.1642069816589355} 11/07/2021 08:34:20 - INFO - __main__ - Step 80591: {'lr': 0.0002256805633456693, 'samples': 15473472, 'steps': 80590, 'loss/train': 1.516658067703247} 11/07/2021 08:34:20 - INFO - __main__ - Step 80592: {'lr': 0.00022567528177523959, 'samples': 15473664, 'steps': 80591, 'loss/train': 1.568845272064209} 11/07/2021 08:34:21 - INFO - __main__ - Step 80593: {'lr': 0.00022567000021577033, 'samples': 15473856, 'steps': 80592, 'loss/train': 1.4256888628005981} 11/07/2021 08:34:22 - INFO - __main__ - Step 80594: {'lr': 0.0002256647186672637, 'samples': 15474048, 'steps': 80593, 'loss/train': 1.36960768699646} 11/07/2021 08:34:22 - INFO - __main__ - Step 80595: {'lr': 0.0002256594371297222, 'samples': 15474240, 'steps': 80594, 'loss/train': 1.1643837690353394} 11/07/2021 08:34:22 - INFO - __main__ - Step 80596: {'lr': 0.00022565415560314814, 'samples': 15474432, 'steps': 80595, 'loss/train': 1.1731911897659302} 11/07/2021 08:34:23 - INFO - __main__ - Step 80597: {'lr': 0.00022564887408754397, 'samples': 15474624, 'steps': 80596, 'loss/train': 1.6271408796310425} 11/07/2021 08:34:23 - INFO - __main__ - Step 80598: {'lr': 0.00022564359258291203, 'samples': 15474816, 'steps': 80597, 'loss/train': 4.287817001342773} 11/07/2021 08:34:24 - INFO - __main__ - Step 80599: {'lr': 0.00022563831108925474, 'samples': 15475008, 'steps': 80598, 'loss/train': 1.8216471672058105} 11/07/2021 08:34:24 - INFO - __main__ - Step 80600: {'lr': 0.00022563302960657442, 'samples': 15475200, 'steps': 80599, 'loss/train': 1.5126662254333496} 11/07/2021 08:34:25 - INFO - __main__ - Step 80601: {'lr': 0.00022562774813487347, 'samples': 15475392, 'steps': 80600, 'loss/train': 1.3541117906570435} 11/07/2021 08:34:25 - INFO - __main__ - Step 80602: {'lr': 0.00022562246667415432, 'samples': 15475584, 'steps': 80601, 'loss/train': 1.5962584018707275} 11/07/2021 08:34:25 - INFO - __main__ - Step 80603: {'lr': 0.00022561718522441928, 'samples': 15475776, 'steps': 80602, 'loss/train': 1.4248570203781128} 11/07/2021 08:34:26 - INFO - __main__ - Step 80604: {'lr': 0.00022561190378567075, 'samples': 15475968, 'steps': 80603, 'loss/train': 0.2176593393087387} 11/07/2021 08:34:27 - INFO - __main__ - Step 80605: {'lr': 0.0002256066223579112, 'samples': 15476160, 'steps': 80604, 'loss/train': 1.5267314910888672} 11/07/2021 08:34:27 - INFO - __main__ - Step 80606: {'lr': 0.00022560134094114294, 'samples': 15476352, 'steps': 80605, 'loss/train': 1.695860743522644} 11/07/2021 08:34:27 - INFO - __main__ - Step 80607: {'lr': 0.00022559605953536828, 'samples': 15476544, 'steps': 80606, 'loss/train': 1.6167504787445068} 11/07/2021 08:34:28 - INFO - __main__ - Step 80608: {'lr': 0.00022559077814058963, 'samples': 15476736, 'steps': 80607, 'loss/train': 1.0465153455734253} 11/07/2021 08:34:29 - INFO - __main__ - Step 80609: {'lr': 0.0002255854967568094, 'samples': 15476928, 'steps': 80608, 'loss/train': 1.3078842163085938} 11/07/2021 08:34:29 - INFO - __main__ - Step 80610: {'lr': 0.00022558021538403, 'samples': 15477120, 'steps': 80609, 'loss/train': 1.4558740854263306} 11/07/2021 08:34:30 - INFO - __main__ - Step 80611: {'lr': 0.00022557493402225375, 'samples': 15477312, 'steps': 80610, 'loss/train': 0.6679564714431763} 11/07/2021 08:34:30 - INFO - __main__ - Step 80612: {'lr': 0.00022556965267148308, 'samples': 15477504, 'steps': 80611, 'loss/train': 1.514877200126648} 11/07/2021 08:34:30 - INFO - __main__ - Step 80613: {'lr': 0.00022556437133172035, 'samples': 15477696, 'steps': 80612, 'loss/train': 1.8079675436019897} 11/07/2021 08:34:31 - INFO - __main__ - Step 80614: {'lr': 0.0002255590900029679, 'samples': 15477888, 'steps': 80613, 'loss/train': 1.0377817153930664} 11/07/2021 08:34:32 - INFO - __main__ - Step 80615: {'lr': 0.00022555380868522818, 'samples': 15478080, 'steps': 80614, 'loss/train': 1.444411039352417} 11/07/2021 08:34:32 - INFO - __main__ - Step 80616: {'lr': 0.00022554852737850355, 'samples': 15478272, 'steps': 80615, 'loss/train': 1.0468945503234863} 11/07/2021 08:34:32 - INFO - __main__ - Step 80617: {'lr': 0.0002255432460827964, 'samples': 15478464, 'steps': 80616, 'loss/train': 1.3267664909362793} 11/07/2021 08:34:33 - INFO - __main__ - Step 80618: {'lr': 0.00022553796479810902, 'samples': 15478656, 'steps': 80617, 'loss/train': 1.7767034769058228} 11/07/2021 08:34:33 - INFO - __main__ - Step 80619: {'lr': 0.00022553268352444385, 'samples': 15478848, 'steps': 80618, 'loss/train': 1.4471195936203003} 11/07/2021 08:34:34 - INFO - __main__ - Step 80620: {'lr': 0.00022552740226180337, 'samples': 15479040, 'steps': 80619, 'loss/train': 1.7566757202148438} 11/07/2021 08:34:34 - INFO - __main__ - Step 80621: {'lr': 0.00022552212101018982, 'samples': 15479232, 'steps': 80620, 'loss/train': 0.5465812087059021} 11/07/2021 08:34:35 - INFO - __main__ - Step 80622: {'lr': 0.00022551683976960557, 'samples': 15479424, 'steps': 80621, 'loss/train': 1.4596538543701172} 11/07/2021 08:34:35 - INFO - __main__ - Step 80623: {'lr': 0.0002255115585400531, 'samples': 15479616, 'steps': 80622, 'loss/train': 1.2943699359893799} 11/07/2021 08:34:36 - INFO - __main__ - Step 80624: {'lr': 0.00022550627732153473, 'samples': 15479808, 'steps': 80623, 'loss/train': 1.3106456995010376} 11/07/2021 08:34:37 - INFO - __main__ - Step 80625: {'lr': 0.00022550099611405285, 'samples': 15480000, 'steps': 80624, 'loss/train': 1.364869236946106} 11/07/2021 08:34:37 - INFO - __main__ - Step 80626: {'lr': 0.00022549571491760985, 'samples': 15480192, 'steps': 80625, 'loss/train': 1.6101000308990479} 11/07/2021 08:34:37 - INFO - __main__ - Step 80627: {'lr': 0.00022549043373220815, 'samples': 15480384, 'steps': 80626, 'loss/train': 1.3071379661560059} 11/07/2021 08:34:38 - INFO - __main__ - Step 80628: {'lr': 0.00022548515255785002, 'samples': 15480576, 'steps': 80627, 'loss/train': 1.105211615562439} 11/07/2021 08:34:38 - INFO - __main__ - Step 80629: {'lr': 0.0002254798713945379, 'samples': 15480768, 'steps': 80628, 'loss/train': 1.7412773370742798} 11/07/2021 08:34:39 - INFO - __main__ - Step 80630: {'lr': 0.0002254745902422742, 'samples': 15480960, 'steps': 80629, 'loss/train': 1.4480619430541992} 11/07/2021 08:34:39 - INFO - __main__ - Step 80631: {'lr': 0.00022546930910106127, 'samples': 15481152, 'steps': 80630, 'loss/train': 1.6834145784378052} 11/07/2021 08:34:40 - INFO - __main__ - Step 80632: {'lr': 0.00022546402797090146, 'samples': 15481344, 'steps': 80631, 'loss/train': 0.5942888855934143} 11/07/2021 08:34:40 - INFO - __main__ - Step 80633: {'lr': 0.00022545874685179723, 'samples': 15481536, 'steps': 80632, 'loss/train': 3.1098971366882324} 11/07/2021 08:34:40 - INFO - __main__ - Step 80634: {'lr': 0.00022545346574375088, 'samples': 15481728, 'steps': 80633, 'loss/train': 1.521401286125183} 11/07/2021 08:34:42 - INFO - __main__ - Step 80635: {'lr': 0.00022544818464676484, 'samples': 15481920, 'steps': 80634, 'loss/train': 0.6122680306434631} 11/07/2021 08:34:42 - INFO - __main__ - Step 80636: {'lr': 0.00022544290356084142, 'samples': 15482112, 'steps': 80635, 'loss/train': 1.138116478919983} 11/07/2021 08:34:42 - INFO - __main__ - Step 80637: {'lr': 0.00022543762248598316, 'samples': 15482304, 'steps': 80636, 'loss/train': 1.5049232244491577} 11/07/2021 08:34:43 - INFO - __main__ - Step 80638: {'lr': 0.00022543234142219221, 'samples': 15482496, 'steps': 80637, 'loss/train': 1.8194828033447266} 11/07/2021 08:34:43 - INFO - __main__ - Step 80639: {'lr': 0.0002254270603694711, 'samples': 15482688, 'steps': 80638, 'loss/train': 1.3661718368530273} 11/07/2021 08:34:43 - INFO - __main__ - Step 80640: {'lr': 0.00022542177932782217, 'samples': 15482880, 'steps': 80639, 'loss/train': 1.0430372953414917} 11/07/2021 08:34:44 - INFO - __main__ - Step 80641: {'lr': 0.00022541649829724782, 'samples': 15483072, 'steps': 80640, 'loss/train': 1.3946927785873413} 11/07/2021 08:34:45 - INFO - __main__ - Step 80642: {'lr': 0.00022541121727775044, 'samples': 15483264, 'steps': 80641, 'loss/train': 1.245740294456482} 11/07/2021 08:34:45 - INFO - __main__ - Step 80643: {'lr': 0.00022540593626933233, 'samples': 15483456, 'steps': 80642, 'loss/train': 1.6919032335281372} 11/07/2021 08:34:45 - INFO - __main__ - Step 80644: {'lr': 0.00022540065527199596, 'samples': 15483648, 'steps': 80643, 'loss/train': 1.677855134010315} 11/07/2021 08:34:46 - INFO - __main__ - Step 80645: {'lr': 0.00022539537428574365, 'samples': 15483840, 'steps': 80644, 'loss/train': 1.3844760656356812} 11/07/2021 08:34:47 - INFO - __main__ - Step 80646: {'lr': 0.00022539009331057783, 'samples': 15484032, 'steps': 80645, 'loss/train': 1.2552680969238281} 11/07/2021 08:34:47 - INFO - __main__ - Step 80647: {'lr': 0.0002253848123465009, 'samples': 15484224, 'steps': 80646, 'loss/train': 1.5461146831512451} 11/07/2021 08:34:47 - INFO - __main__ - Step 80648: {'lr': 0.00022537953139351518, 'samples': 15484416, 'steps': 80647, 'loss/train': 1.4503343105316162} 11/07/2021 08:34:48 - INFO - __main__ - Step 80649: {'lr': 0.00022537425045162304, 'samples': 15484608, 'steps': 80648, 'loss/train': 1.6235275268554688} 11/07/2021 08:34:48 - INFO - __main__ - Step 80650: {'lr': 0.00022536896952082686, 'samples': 15484800, 'steps': 80649, 'loss/train': 1.4109872579574585} 11/07/2021 08:34:49 - INFO - __main__ - Step 80651: {'lr': 0.00022536368860112904, 'samples': 15484992, 'steps': 80650, 'loss/train': 1.4273738861083984} 11/07/2021 08:34:49 - INFO - __main__ - Step 80652: {'lr': 0.000225358407692532, 'samples': 15485184, 'steps': 80651, 'loss/train': 1.529388189315796} 11/07/2021 08:34:50 - INFO - __main__ - Step 80653: {'lr': 0.00022535312679503803, 'samples': 15485376, 'steps': 80652, 'loss/train': 0.474989116191864} 11/07/2021 08:34:50 - INFO - __main__ - Step 80654: {'lr': 0.0002253478459086496, 'samples': 15485568, 'steps': 80653, 'loss/train': 1.1909754276275635} 11/07/2021 08:34:50 - INFO - __main__ - Step 80655: {'lr': 0.00022534256503336904, 'samples': 15485760, 'steps': 80654, 'loss/train': 1.2758711576461792} 11/07/2021 08:34:51 - INFO - __main__ - Step 80656: {'lr': 0.0002253372841691987, 'samples': 15485952, 'steps': 80655, 'loss/train': 1.243231177330017} 11/07/2021 08:34:52 - INFO - __main__ - Step 80657: {'lr': 0.00022533200331614103, 'samples': 15486144, 'steps': 80656, 'loss/train': 0.4772784411907196} 11/07/2021 08:34:52 - INFO - __main__ - Step 80658: {'lr': 0.0002253267224741984, 'samples': 15486336, 'steps': 80657, 'loss/train': 1.6453700065612793} 11/07/2021 08:34:52 - INFO - __main__ - Step 80659: {'lr': 0.00022532144164337314, 'samples': 15486528, 'steps': 80658, 'loss/train': 1.0846344232559204} 11/07/2021 08:34:53 - INFO - __main__ - Step 80660: {'lr': 0.00022531616082366776, 'samples': 15486720, 'steps': 80659, 'loss/train': 0.5688796639442444} 11/07/2021 08:34:54 - INFO - __main__ - Step 80661: {'lr': 0.00022531088001508445, 'samples': 15486912, 'steps': 80660, 'loss/train': 1.3255258798599243} 11/07/2021 08:34:54 - INFO - __main__ - Step 80662: {'lr': 0.00022530559921762566, 'samples': 15487104, 'steps': 80661, 'loss/train': 1.8000710010528564} 11/07/2021 08:34:55 - INFO - __main__ - Step 80663: {'lr': 0.0002253003184312938, 'samples': 15487296, 'steps': 80662, 'loss/train': 0.9938427209854126} 11/07/2021 08:34:55 - INFO - __main__ - Step 80664: {'lr': 0.00022529503765609125, 'samples': 15487488, 'steps': 80663, 'loss/train': 1.4292786121368408} 11/07/2021 08:34:55 - INFO - __main__ - Step 80665: {'lr': 0.00022528975689202032, 'samples': 15487680, 'steps': 80664, 'loss/train': 1.4757312536239624} 11/07/2021 08:34:56 - INFO - __main__ - Step 80666: {'lr': 0.0002252844761390835, 'samples': 15487872, 'steps': 80665, 'loss/train': 1.2221074104309082} 11/07/2021 08:34:57 - INFO - __main__ - Step 80667: {'lr': 0.0002252791953972831, 'samples': 15488064, 'steps': 80666, 'loss/train': 1.23210871219635} 11/07/2021 08:34:57 - INFO - __main__ - Step 80668: {'lr': 0.0002252739146666215, 'samples': 15488256, 'steps': 80667, 'loss/train': 1.4095131158828735} 11/07/2021 08:34:57 - INFO - __main__ - Step 80669: {'lr': 0.0002252686339471011, 'samples': 15488448, 'steps': 80668, 'loss/train': 1.257148265838623} 11/07/2021 08:34:58 - INFO - __main__ - Step 80670: {'lr': 0.00022526335323872426, 'samples': 15488640, 'steps': 80669, 'loss/train': 1.273970603942871} 11/07/2021 08:34:59 - INFO - __main__ - Step 80671: {'lr': 0.0002252580725414934, 'samples': 15488832, 'steps': 80670, 'loss/train': 1.4994465112686157} 11/07/2021 08:34:59 - INFO - __main__ - Step 80672: {'lr': 0.00022525279185541084, 'samples': 15489024, 'steps': 80671, 'loss/train': 1.4226590394973755} 11/07/2021 08:34:59 - INFO - __main__ - Step 80673: {'lr': 0.000225247511180479, 'samples': 15489216, 'steps': 80672, 'loss/train': 1.2913951873779297} 11/07/2021 08:35:00 - INFO - __main__ - Step 80674: {'lr': 0.00022524223051670038, 'samples': 15489408, 'steps': 80673, 'loss/train': 3.7020773887634277} 11/07/2021 08:35:00 - INFO - __main__ - Step 80675: {'lr': 0.0002252369498640771, 'samples': 15489600, 'steps': 80674, 'loss/train': 1.514850378036499} 11/07/2021 08:35:01 - INFO - __main__ - Step 80676: {'lr': 0.00022523166922261165, 'samples': 15489792, 'steps': 80675, 'loss/train': 1.4839847087860107} 11/07/2021 08:35:02 - INFO - __main__ - Step 80677: {'lr': 0.00022522638859230645, 'samples': 15489984, 'steps': 80676, 'loss/train': 1.6459780931472778} 11/07/2021 08:35:02 - INFO - __main__ - Step 80678: {'lr': 0.00022522110797316386, 'samples': 15490176, 'steps': 80677, 'loss/train': 1.6953870058059692} 11/07/2021 08:35:02 - INFO - __main__ - Step 80679: {'lr': 0.00022521582736518625, 'samples': 15490368, 'steps': 80678, 'loss/train': 1.157078742980957} 11/07/2021 08:35:03 - INFO - __main__ - Step 80680: {'lr': 0.00022521054676837598, 'samples': 15490560, 'steps': 80679, 'loss/train': 1.4726091623306274} 11/07/2021 08:35:03 - INFO - __main__ - Step 80681: {'lr': 0.00022520526618273552, 'samples': 15490752, 'steps': 80680, 'loss/train': 0.7515679597854614} 11/07/2021 08:35:04 - INFO - __main__ - Step 80682: {'lr': 0.00022519998560826713, 'samples': 15490944, 'steps': 80681, 'loss/train': 1.5267738103866577} 11/07/2021 08:35:04 - INFO - __main__ - Step 80683: {'lr': 0.00022519470504497324, 'samples': 15491136, 'steps': 80682, 'loss/train': 1.1751161813735962} 11/07/2021 08:35:05 - INFO - __main__ - Step 80684: {'lr': 0.00022518942449285627, 'samples': 15491328, 'steps': 80683, 'loss/train': 1.063218593597412} 11/07/2021 08:35:05 - INFO - __main__ - Step 80685: {'lr': 0.00022518414395191855, 'samples': 15491520, 'steps': 80684, 'loss/train': 1.4702832698822021} 11/07/2021 08:35:05 - INFO - __main__ - Step 80686: {'lr': 0.00022517886342216247, 'samples': 15491712, 'steps': 80685, 'loss/train': 1.9066805839538574} 11/07/2021 08:35:06 - INFO - __main__ - Step 80687: {'lr': 0.0002251735829035905, 'samples': 15491904, 'steps': 80686, 'loss/train': 1.3418264389038086} 11/07/2021 08:35:07 - INFO - __main__ - Step 80688: {'lr': 0.00022516830239620485, 'samples': 15492096, 'steps': 80687, 'loss/train': 0.9247857332229614} 11/07/2021 08:35:07 - INFO - __main__ - Step 80689: {'lr': 0.00022516302190000794, 'samples': 15492288, 'steps': 80688, 'loss/train': 1.3183207511901855} 11/07/2021 08:35:08 - INFO - __main__ - Step 80690: {'lr': 0.00022515774141500223, 'samples': 15492480, 'steps': 80689, 'loss/train': 0.9570081830024719} 11/07/2021 08:35:08 - INFO - __main__ - Step 80691: {'lr': 0.00022515246094119006, 'samples': 15492672, 'steps': 80690, 'loss/train': 1.2567578554153442} 11/07/2021 08:35:09 - INFO - __main__ - Step 80692: {'lr': 0.0002251471804785738, 'samples': 15492864, 'steps': 80691, 'loss/train': 1.5071887969970703} 11/07/2021 08:35:09 - INFO - __main__ - Step 80693: {'lr': 0.00022514190002715582, 'samples': 15493056, 'steps': 80692, 'loss/train': 1.786765217781067} 11/07/2021 08:35:10 - INFO - __main__ - Step 80694: {'lr': 0.00022513661958693853, 'samples': 15493248, 'steps': 80693, 'loss/train': 1.2894681692123413} 11/07/2021 08:35:10 - INFO - __main__ - Step 80695: {'lr': 0.00022513133915792426, 'samples': 15493440, 'steps': 80694, 'loss/train': 1.2800477743148804} 11/07/2021 08:35:10 - INFO - __main__ - Step 80696: {'lr': 0.0002251260587401155, 'samples': 15493632, 'steps': 80695, 'loss/train': 1.0910143852233887} 11/07/2021 08:35:11 - INFO - __main__ - Step 80697: {'lr': 0.0002251207783335145, 'samples': 15493824, 'steps': 80696, 'loss/train': 1.2823392152786255} 11/07/2021 08:35:12 - INFO - __main__ - Step 80698: {'lr': 0.0002251154979381237, 'samples': 15494016, 'steps': 80697, 'loss/train': 1.4597963094711304} 11/07/2021 08:35:12 - INFO - __main__ - Step 80699: {'lr': 0.00022511021755394547, 'samples': 15494208, 'steps': 80698, 'loss/train': 2.6117894649505615} 11/07/2021 08:35:12 - INFO - __main__ - Step 80700: {'lr': 0.0002251049371809823, 'samples': 15494400, 'steps': 80699, 'loss/train': 1.16843581199646} 11/07/2021 08:35:13 - INFO - __main__ - Step 80701: {'lr': 0.00022509965681923635, 'samples': 15494592, 'steps': 80700, 'loss/train': 1.2788619995117188} 11/07/2021 08:35:14 - INFO - __main__ - Step 80702: {'lr': 0.00022509437646871014, 'samples': 15494784, 'steps': 80701, 'loss/train': 1.4121482372283936} 11/07/2021 08:35:14 - INFO - __main__ - Step 80703: {'lr': 0.00022508909612940602, 'samples': 15494976, 'steps': 80702, 'loss/train': 1.078048825263977} 11/07/2021 08:35:14 - INFO - __main__ - Step 80704: {'lr': 0.00022508381580132634, 'samples': 15495168, 'steps': 80703, 'loss/train': 0.9678399562835693} 11/07/2021 08:35:15 - INFO - __main__ - Step 80705: {'lr': 0.0002250785354844735, 'samples': 15495360, 'steps': 80704, 'loss/train': 0.9539954662322998} 11/07/2021 08:35:15 - INFO - __main__ - Step 80706: {'lr': 0.00022507325517884992, 'samples': 15495552, 'steps': 80705, 'loss/train': 1.4801852703094482} 11/07/2021 08:35:15 - INFO - __main__ - Step 80707: {'lr': 0.0002250679748844579, 'samples': 15495744, 'steps': 80706, 'loss/train': 1.5358397960662842} 11/07/2021 08:35:16 - INFO - __main__ - Step 80708: {'lr': 0.00022506269460129992, 'samples': 15495936, 'steps': 80707, 'loss/train': 2.1817214488983154} 11/07/2021 08:35:17 - INFO - __main__ - Step 80709: {'lr': 0.00022505741432937826, 'samples': 15496128, 'steps': 80708, 'loss/train': 1.6048455238342285} 11/07/2021 08:35:17 - INFO - __main__ - Step 80710: {'lr': 0.00022505213406869538, 'samples': 15496320, 'steps': 80709, 'loss/train': 1.1593743562698364} 11/07/2021 08:35:17 - INFO - __main__ - Step 80711: {'lr': 0.0002250468538192536, 'samples': 15496512, 'steps': 80710, 'loss/train': 2.047420024871826} 11/07/2021 08:35:18 - INFO - __main__ - Step 80712: {'lr': 0.00022504157358105534, 'samples': 15496704, 'steps': 80711, 'loss/train': 1.3369947671890259} 11/07/2021 08:35:19 - INFO - __main__ - Step 80713: {'lr': 0.00022503629335410294, 'samples': 15496896, 'steps': 80712, 'loss/train': 1.1815054416656494} 11/07/2021 08:35:19 - INFO - __main__ - Step 80714: {'lr': 0.00022503101313839895, 'samples': 15497088, 'steps': 80713, 'loss/train': 0.9088521599769592} 11/07/2021 08:35:20 - INFO - __main__ - Step 80715: {'lr': 0.00022502573293394545, 'samples': 15497280, 'steps': 80714, 'loss/train': 1.023290753364563} 11/07/2021 08:35:20 - INFO - __main__ - Step 80716: {'lr': 0.00022502045274074497, 'samples': 15497472, 'steps': 80715, 'loss/train': 1.185556173324585} 11/07/2021 08:35:20 - INFO - __main__ - Step 80717: {'lr': 0.00022501517255879992, 'samples': 15497664, 'steps': 80716, 'loss/train': 1.5024391412734985} 11/07/2021 08:35:22 - INFO - __main__ - Step 80718: {'lr': 0.00022500989238811262, 'samples': 15497856, 'steps': 80717, 'loss/train': 1.5557371377944946} 11/07/2021 08:35:22 - INFO - __main__ - Step 80719: {'lr': 0.00022500461222868548, 'samples': 15498048, 'steps': 80718, 'loss/train': 0.5873566269874573} 11/07/2021 08:35:22 - INFO - __main__ - Step 80720: {'lr': 0.00022499933208052088, 'samples': 15498240, 'steps': 80719, 'loss/train': 1.3107901811599731} 11/07/2021 08:35:23 - INFO - __main__ - Step 80721: {'lr': 0.0002249940519436212, 'samples': 15498432, 'steps': 80720, 'loss/train': 0.27388232946395874} 11/07/2021 08:35:23 - INFO - __main__ - Step 80722: {'lr': 0.00022498877181798883, 'samples': 15498624, 'steps': 80721, 'loss/train': 1.3682502508163452} 11/07/2021 08:35:24 - INFO - __main__ - Step 80723: {'lr': 0.0002249834917036261, 'samples': 15498816, 'steps': 80722, 'loss/train': 1.4883555173873901} 11/07/2021 08:35:24 - INFO - __main__ - Step 80724: {'lr': 0.00022497821160053543, 'samples': 15499008, 'steps': 80723, 'loss/train': 1.4622938632965088} 11/07/2021 08:35:25 - INFO - __main__ - Step 80725: {'lr': 0.0002249729315087192, 'samples': 15499200, 'steps': 80724, 'loss/train': 1.145703911781311} 11/07/2021 08:35:25 - INFO - __main__ - Step 80726: {'lr': 0.0002249676514281798, 'samples': 15499392, 'steps': 80725, 'loss/train': 1.284766674041748} 11/07/2021 08:35:25 - INFO - __main__ - Step 80727: {'lr': 0.0002249623713589197, 'samples': 15499584, 'steps': 80726, 'loss/train': 1.0197991132736206} 11/07/2021 08:35:26 - INFO - __main__ - Step 80728: {'lr': 0.00022495709130094103, 'samples': 15499776, 'steps': 80727, 'loss/train': 1.3220434188842773} 11/07/2021 08:35:27 - INFO - __main__ - Step 80729: {'lr': 0.00022495181125424632, 'samples': 15499968, 'steps': 80728, 'loss/train': 1.7520462274551392} 11/07/2021 08:35:27 - INFO - __main__ - Step 80730: {'lr': 0.00022494653121883792, 'samples': 15500160, 'steps': 80729, 'loss/train': 1.3024741411209106} 11/07/2021 08:35:27 - INFO - __main__ - Step 80731: {'lr': 0.00022494125119471825, 'samples': 15500352, 'steps': 80730, 'loss/train': 1.6078293323516846} 11/07/2021 08:35:28 - INFO - __main__ - Step 80732: {'lr': 0.00022493597118188966, 'samples': 15500544, 'steps': 80731, 'loss/train': 1.4483540058135986} 11/07/2021 08:35:29 - INFO - __main__ - Step 80733: {'lr': 0.00022493069118035452, 'samples': 15500736, 'steps': 80732, 'loss/train': 1.5234005451202393} 11/07/2021 08:35:29 - INFO - __main__ - Step 80734: {'lr': 0.00022492541119011525, 'samples': 15500928, 'steps': 80733, 'loss/train': 1.311050295829773} 11/07/2021 08:35:29 - INFO - __main__ - Step 80735: {'lr': 0.00022492013121117418, 'samples': 15501120, 'steps': 80734, 'loss/train': 1.2862311601638794} 11/07/2021 08:35:30 - INFO - __main__ - Step 80736: {'lr': 0.00022491485124353372, 'samples': 15501312, 'steps': 80735, 'loss/train': 1.586753249168396} 11/07/2021 08:35:30 - INFO - __main__ - Step 80737: {'lr': 0.00022490957128719626, 'samples': 15501504, 'steps': 80736, 'loss/train': 1.71043860912323} 11/07/2021 08:35:30 - INFO - __main__ - Step 80738: {'lr': 0.00022490429134216415, 'samples': 15501696, 'steps': 80737, 'loss/train': 1.5472383499145508} 11/07/2021 08:35:32 - INFO - __main__ - Step 80739: {'lr': 0.00022489901140843982, 'samples': 15501888, 'steps': 80738, 'loss/train': 2.1787772178649902} 11/07/2021 08:35:32 - INFO - __main__ - Step 80740: {'lr': 0.00022489373148602555, 'samples': 15502080, 'steps': 80739, 'loss/train': 1.3872920274734497} 11/07/2021 08:35:32 - INFO - __main__ - Step 80741: {'lr': 0.00022488845157492385, 'samples': 15502272, 'steps': 80740, 'loss/train': 1.2981818914413452} 11/07/2021 08:35:33 - INFO - __main__ - Step 80742: {'lr': 0.00022488317167513694, 'samples': 15502464, 'steps': 80741, 'loss/train': 0.47459864616394043} 11/07/2021 08:35:33 - INFO - __main__ - Step 80743: {'lr': 0.0002248778917866673, 'samples': 15502656, 'steps': 80742, 'loss/train': 1.5476396083831787} 11/07/2021 08:35:34 - INFO - __main__ - Step 80744: {'lr': 0.00022487261190951732, 'samples': 15502848, 'steps': 80743, 'loss/train': 1.1796338558197021} 11/07/2021 08:35:34 - INFO - __main__ - Step 80745: {'lr': 0.00022486733204368932, 'samples': 15503040, 'steps': 80744, 'loss/train': 1.441382884979248} 11/07/2021 08:35:35 - INFO - __main__ - Step 80746: {'lr': 0.00022486205218918574, 'samples': 15503232, 'steps': 80745, 'loss/train': 1.3278940916061401} 11/07/2021 08:35:35 - INFO - __main__ - Step 80747: {'lr': 0.00022485677234600893, 'samples': 15503424, 'steps': 80746, 'loss/train': 0.9227728843688965} 11/07/2021 08:35:35 - INFO - __main__ - Step 80748: {'lr': 0.00022485149251416127, 'samples': 15503616, 'steps': 80747, 'loss/train': 1.5252063274383545} 11/07/2021 08:35:37 - INFO - __main__ - Step 80749: {'lr': 0.00022484621269364512, 'samples': 15503808, 'steps': 80748, 'loss/train': 1.7832531929016113} 11/07/2021 08:35:37 - INFO - __main__ - Step 80750: {'lr': 0.00022484093288446295, 'samples': 15504000, 'steps': 80749, 'loss/train': 1.6874979734420776} 11/07/2021 08:35:37 - INFO - __main__ - Step 80751: {'lr': 0.00022483565308661698, 'samples': 15504192, 'steps': 80750, 'loss/train': 1.3009285926818848} 11/07/2021 08:35:38 - INFO - __main__ - Step 80752: {'lr': 0.0002248303733001097, 'samples': 15504384, 'steps': 80751, 'loss/train': 1.5967967510223389} 11/07/2021 08:35:38 - INFO - __main__ - Step 80753: {'lr': 0.0002248250935249435, 'samples': 15504576, 'steps': 80752, 'loss/train': 1.4899035692214966} 11/07/2021 08:35:39 - INFO - __main__ - Step 80754: {'lr': 0.00022481981376112073, 'samples': 15504768, 'steps': 80753, 'loss/train': 0.4356486201286316} 11/07/2021 08:35:39 - INFO - __main__ - Step 80755: {'lr': 0.00022481453400864372, 'samples': 15504960, 'steps': 80754, 'loss/train': 1.2370890378952026} 11/07/2021 08:35:40 - INFO - __main__ - Step 80756: {'lr': 0.0002248092542675149, 'samples': 15505152, 'steps': 80755, 'loss/train': 1.577149748802185} 11/07/2021 08:35:40 - INFO - __main__ - Step 80757: {'lr': 0.00022480397453773662, 'samples': 15505344, 'steps': 80756, 'loss/train': 1.3694096803665161} 11/07/2021 08:35:41 - INFO - __main__ - Step 80758: {'lr': 0.0002247986948193113, 'samples': 15505536, 'steps': 80757, 'loss/train': 1.2822121381759644} 11/07/2021 08:35:42 - INFO - __main__ - Step 80759: {'lr': 0.0002247934151122413, 'samples': 15505728, 'steps': 80758, 'loss/train': 1.610863447189331} 11/07/2021 08:35:42 - INFO - __main__ - Step 80760: {'lr': 0.000224788135416529, 'samples': 15505920, 'steps': 80759, 'loss/train': 1.837355136871338} 11/07/2021 08:35:42 - INFO - __main__ - Step 80761: {'lr': 0.00022478285573217683, 'samples': 15506112, 'steps': 80760, 'loss/train': 1.6488137245178223} 11/07/2021 08:35:43 - INFO - __main__ - Step 80762: {'lr': 0.00022477757605918707, 'samples': 15506304, 'steps': 80761, 'loss/train': 1.151611089706421} 11/07/2021 08:35:43 - INFO - __main__ - Step 80763: {'lr': 0.00022477229639756213, 'samples': 15506496, 'steps': 80762, 'loss/train': 1.2302714586257935} 11/07/2021 08:35:43 - INFO - __main__ - Step 80764: {'lr': 0.0002247670167473044, 'samples': 15506688, 'steps': 80763, 'loss/train': 1.3344621658325195} 11/07/2021 08:35:44 - INFO - __main__ - Step 80765: {'lr': 0.00022476173710841627, 'samples': 15506880, 'steps': 80764, 'loss/train': 1.5007318258285522} 11/07/2021 08:35:45 - INFO - __main__ - Step 80766: {'lr': 0.00022475645748090011, 'samples': 15507072, 'steps': 80765, 'loss/train': 1.218327283859253} 11/07/2021 08:35:45 - INFO - __main__ - Step 80767: {'lr': 0.0002247511778647583, 'samples': 15507264, 'steps': 80766, 'loss/train': 1.2718863487243652} 11/07/2021 08:35:45 - INFO - __main__ - Step 80768: {'lr': 0.0002247458982599933, 'samples': 15507456, 'steps': 80767, 'loss/train': 1.2654016017913818} 11/07/2021 08:35:46 - INFO - __main__ - Step 80769: {'lr': 0.00022474061866660733, 'samples': 15507648, 'steps': 80768, 'loss/train': 1.4134010076522827} 11/07/2021 08:35:47 - INFO - __main__ - Step 80770: {'lr': 0.00022473533908460284, 'samples': 15507840, 'steps': 80769, 'loss/train': 1.0511232614517212} 11/07/2021 08:35:47 - INFO - __main__ - Step 80771: {'lr': 0.00022473005951398223, 'samples': 15508032, 'steps': 80770, 'loss/train': 1.4783809185028076} 11/07/2021 08:35:48 - INFO - __main__ - Step 80772: {'lr': 0.0002247247799547479, 'samples': 15508224, 'steps': 80771, 'loss/train': 1.031894326210022} 11/07/2021 08:35:48 - INFO - __main__ - Step 80773: {'lr': 0.00022471950040690218, 'samples': 15508416, 'steps': 80772, 'loss/train': 1.2182190418243408} 11/07/2021 08:35:48 - INFO - __main__ - Step 80774: {'lr': 0.0002247142208704474, 'samples': 15508608, 'steps': 80773, 'loss/train': 1.3846535682678223} 11/07/2021 08:35:49 - INFO - __main__ - Step 80775: {'lr': 0.00022470894134538606, 'samples': 15508800, 'steps': 80774, 'loss/train': 1.4729069471359253} 11/07/2021 08:35:50 - INFO - __main__ - Step 80776: {'lr': 0.00022470366183172048, 'samples': 15508992, 'steps': 80775, 'loss/train': 1.3715780973434448} 11/07/2021 08:35:50 - INFO - __main__ - Step 80777: {'lr': 0.000224698382329453, 'samples': 15509184, 'steps': 80776, 'loss/train': 1.432997703552246} 11/07/2021 08:35:50 - INFO - __main__ - Step 80778: {'lr': 0.00022469310283858607, 'samples': 15509376, 'steps': 80777, 'loss/train': 1.5801724195480347} 11/07/2021 08:35:51 - INFO - __main__ - Step 80779: {'lr': 0.000224687823359122, 'samples': 15509568, 'steps': 80778, 'loss/train': 1.3477342128753662} 11/07/2021 08:35:52 - INFO - __main__ - Step 80780: {'lr': 0.00022468254389106324, 'samples': 15509760, 'steps': 80779, 'loss/train': 1.262682318687439} 11/07/2021 08:35:52 - INFO - __main__ - Step 80781: {'lr': 0.0002246772644344122, 'samples': 15509952, 'steps': 80780, 'loss/train': 1.487728238105774} 11/07/2021 08:35:53 - INFO - __main__ - Step 80782: {'lr': 0.0002246719849891711, 'samples': 15510144, 'steps': 80781, 'loss/train': 1.4227968454360962} 11/07/2021 08:35:53 - INFO - __main__ - Step 80783: {'lr': 0.0002246667055553425, 'samples': 15510336, 'steps': 80782, 'loss/train': 1.8113983869552612} 11/07/2021 08:35:53 - INFO - __main__ - Step 80784: {'lr': 0.0002246614261329286, 'samples': 15510528, 'steps': 80783, 'loss/train': 1.2361042499542236} 11/07/2021 08:35:54 - INFO - __main__ - Step 80785: {'lr': 0.0002246561467219319, 'samples': 15510720, 'steps': 80784, 'loss/train': 1.5145856142044067} 11/07/2021 08:35:55 - INFO - __main__ - Step 80786: {'lr': 0.00022465086732235476, 'samples': 15510912, 'steps': 80785, 'loss/train': 1.1027177572250366} 11/07/2021 08:35:55 - INFO - __main__ - Step 80787: {'lr': 0.00022464558793419952, 'samples': 15511104, 'steps': 80786, 'loss/train': 1.36738920211792} 11/07/2021 08:35:55 - INFO - __main__ - Step 80788: {'lr': 0.0002246403085574686, 'samples': 15511296, 'steps': 80787, 'loss/train': 1.38271963596344} 11/07/2021 08:35:56 - INFO - __main__ - Step 80789: {'lr': 0.00022463502919216439, 'samples': 15511488, 'steps': 80788, 'loss/train': 1.2434253692626953} 11/07/2021 08:35:56 - INFO - __main__ - Step 80790: {'lr': 0.0002246297498382892, 'samples': 15511680, 'steps': 80789, 'loss/train': 1.5089260339736938} 11/07/2021 08:35:57 - INFO - __main__ - Step 80791: {'lr': 0.00022462447049584547, 'samples': 15511872, 'steps': 80790, 'loss/train': 1.189721941947937} 11/07/2021 08:35:57 - INFO - __main__ - Step 80792: {'lr': 0.0002246191911648356, 'samples': 15512064, 'steps': 80791, 'loss/train': 1.3435364961624146} 11/07/2021 08:35:58 - INFO - __main__ - Step 80793: {'lr': 0.00022461391184526187, 'samples': 15512256, 'steps': 80792, 'loss/train': 1.03127920627594} 11/07/2021 08:35:58 - INFO - __main__ - Step 80794: {'lr': 0.00022460863253712674, 'samples': 15512448, 'steps': 80793, 'loss/train': 1.1294794082641602} 11/07/2021 08:35:58 - INFO - __main__ - Step 80795: {'lr': 0.00022460335324043268, 'samples': 15512640, 'steps': 80794, 'loss/train': 1.5461589097976685} 11/07/2021 08:35:59 - INFO - __main__ - Step 80796: {'lr': 0.00022459807395518186, 'samples': 15512832, 'steps': 80795, 'loss/train': 1.5579999685287476} 11/07/2021 08:36:00 - INFO - __main__ - Step 80797: {'lr': 0.00022459279468137674, 'samples': 15513024, 'steps': 80796, 'loss/train': 1.823824167251587} 11/07/2021 08:36:00 - INFO - __main__ - Step 80798: {'lr': 0.00022458751541901972, 'samples': 15513216, 'steps': 80797, 'loss/train': 1.6980633735656738} 11/07/2021 08:36:01 - INFO - __main__ - Step 80799: {'lr': 0.0002245822361681132, 'samples': 15513408, 'steps': 80798, 'loss/train': 1.3783795833587646} 11/07/2021 08:36:01 - INFO - __main__ - Step 80800: {'lr': 0.00022457695692865948, 'samples': 15513600, 'steps': 80799, 'loss/train': 1.4891538619995117} 11/07/2021 08:36:02 - INFO - __main__ - Step 80801: {'lr': 0.00022457167770066104, 'samples': 15513792, 'steps': 80800, 'loss/train': 1.3105747699737549} 11/07/2021 08:36:02 - INFO - __main__ - Step 80802: {'lr': 0.0002245663984841202, 'samples': 15513984, 'steps': 80801, 'loss/train': 1.2669440507888794} 11/07/2021 08:36:03 - INFO - __main__ - Step 80803: {'lr': 0.00022456111927903933, 'samples': 15514176, 'steps': 80802, 'loss/train': 1.2374635934829712} 11/07/2021 08:36:03 - INFO - __main__ - Step 80804: {'lr': 0.00022455584008542083, 'samples': 15514368, 'steps': 80803, 'loss/train': 1.65982186794281} 11/07/2021 08:36:03 - INFO - __main__ - Step 80805: {'lr': 0.00022455056090326707, 'samples': 15514560, 'steps': 80804, 'loss/train': 1.521307110786438} 11/07/2021 08:36:04 - INFO - __main__ - Step 80806: {'lr': 0.00022454528173258042, 'samples': 15514752, 'steps': 80805, 'loss/train': 1.3846873044967651} 11/07/2021 08:36:05 - INFO - __main__ - Step 80807: {'lr': 0.00022454000257336333, 'samples': 15514944, 'steps': 80806, 'loss/train': 1.4394569396972656} 11/07/2021 08:36:05 - INFO - __main__ - Step 80808: {'lr': 0.0002245347234256182, 'samples': 15515136, 'steps': 80807, 'loss/train': 0.8547345995903015} 11/07/2021 08:36:05 - INFO - __main__ - Step 80809: {'lr': 0.0002245294442893472, 'samples': 15515328, 'steps': 80808, 'loss/train': 1.5742443799972534} 11/07/2021 08:36:06 - INFO - __main__ - Step 80810: {'lr': 0.00022452416516455289, 'samples': 15515520, 'steps': 80809, 'loss/train': 1.5366214513778687} 11/07/2021 08:36:07 - INFO - __main__ - Step 80811: {'lr': 0.00022451888605123756, 'samples': 15515712, 'steps': 80810, 'loss/train': 1.405924677848816} 11/07/2021 08:36:07 - INFO - __main__ - Step 80812: {'lr': 0.00022451360694940363, 'samples': 15515904, 'steps': 80811, 'loss/train': 1.287876844406128} 11/07/2021 08:36:08 - INFO - __main__ - Step 80813: {'lr': 0.00022450832785905346, 'samples': 15516096, 'steps': 80812, 'loss/train': 1.4052379131317139} 11/07/2021 08:36:08 - INFO - __main__ - Step 80814: {'lr': 0.00022450304878018944, 'samples': 15516288, 'steps': 80813, 'loss/train': 0.9777100086212158} 11/07/2021 08:36:08 - INFO - __main__ - Step 80815: {'lr': 0.00022449776971281398, 'samples': 15516480, 'steps': 80814, 'loss/train': 0.1802147626876831} 11/07/2021 08:36:09 - INFO - __main__ - Step 80816: {'lr': 0.00022449249065692944, 'samples': 15516672, 'steps': 80815, 'loss/train': 1.4950566291809082} 11/07/2021 08:36:10 - INFO - __main__ - Step 80817: {'lr': 0.0002244872116125382, 'samples': 15516864, 'steps': 80816, 'loss/train': 1.5944888591766357} 11/07/2021 08:36:10 - INFO - __main__ - Step 80818: {'lr': 0.0002244819325796426, 'samples': 15517056, 'steps': 80817, 'loss/train': 1.042711615562439} 11/07/2021 08:36:10 - INFO - __main__ - Step 80819: {'lr': 0.00022447665355824505, 'samples': 15517248, 'steps': 80818, 'loss/train': 1.3681672811508179} 11/07/2021 08:36:11 - INFO - __main__ - Step 80820: {'lr': 0.00022447137454834792, 'samples': 15517440, 'steps': 80819, 'loss/train': 1.959350347518921} 11/07/2021 08:36:12 - INFO - __main__ - Step 80821: {'lr': 0.00022446609554995373, 'samples': 15517632, 'steps': 80820, 'loss/train': 1.6076358556747437} 11/07/2021 08:36:12 - INFO - __main__ - Step 80822: {'lr': 0.00022446081656306462, 'samples': 15517824, 'steps': 80821, 'loss/train': 1.7454428672790527} 11/07/2021 08:36:13 - INFO - __main__ - Step 80823: {'lr': 0.00022445553758768303, 'samples': 15518016, 'steps': 80822, 'loss/train': 1.733018398284912} 11/07/2021 08:36:13 - INFO - __main__ - Step 80824: {'lr': 0.0002244502586238114, 'samples': 15518208, 'steps': 80823, 'loss/train': 2.299412965774536} 11/07/2021 08:36:13 - INFO - __main__ - Step 80825: {'lr': 0.00022444497967145208, 'samples': 15518400, 'steps': 80824, 'loss/train': 1.3762400150299072} 11/07/2021 08:36:14 - INFO - __main__ - Step 80826: {'lr': 0.00022443970073060746, 'samples': 15518592, 'steps': 80825, 'loss/train': 1.6147394180297852} 11/07/2021 08:36:15 - INFO - __main__ - Step 80827: {'lr': 0.00022443442180127994, 'samples': 15518784, 'steps': 80826, 'loss/train': 1.0255590677261353} 11/07/2021 08:36:15 - INFO - __main__ - Step 80828: {'lr': 0.00022442914288347185, 'samples': 15518976, 'steps': 80827, 'loss/train': 1.5834286212921143} 11/07/2021 08:36:15 - INFO - __main__ - Step 80829: {'lr': 0.00022442386397718563, 'samples': 15519168, 'steps': 80828, 'loss/train': 1.189112663269043} 11/07/2021 08:36:16 - INFO - __main__ - Step 80830: {'lr': 0.00022441858508242358, 'samples': 15519360, 'steps': 80829, 'loss/train': 1.4057350158691406} 11/07/2021 08:36:16 - INFO - __main__ - Step 80831: {'lr': 0.00022441330619918812, 'samples': 15519552, 'steps': 80830, 'loss/train': 1.384081482887268} 11/07/2021 08:36:17 - INFO - __main__ - Step 80832: {'lr': 0.00022440802732748164, 'samples': 15519744, 'steps': 80831, 'loss/train': 1.1327812671661377} 11/07/2021 08:36:17 - INFO - __main__ - Step 80833: {'lr': 0.0002244027484673065, 'samples': 15519936, 'steps': 80832, 'loss/train': 1.3243685960769653} 11/07/2021 08:36:18 - INFO - __main__ - Step 80834: {'lr': 0.00022439746961866512, 'samples': 15520128, 'steps': 80833, 'loss/train': 1.6132937669754028} 11/07/2021 08:36:18 - INFO - __main__ - Step 80835: {'lr': 0.00022439219078155992, 'samples': 15520320, 'steps': 80834, 'loss/train': 1.8627837896347046} 11/07/2021 08:36:18 - INFO - __main__ - Step 80836: {'lr': 0.00022438691195599312, 'samples': 15520512, 'steps': 80835, 'loss/train': 1.3777847290039062} 11/07/2021 08:36:19 - INFO - __main__ - Step 80837: {'lr': 0.00022438163314196716, 'samples': 15520704, 'steps': 80836, 'loss/train': 1.196541666984558} 11/07/2021 08:36:20 - INFO - __main__ - Step 80838: {'lr': 0.00022437635433948447, 'samples': 15520896, 'steps': 80837, 'loss/train': 1.261157751083374} 11/07/2021 08:36:20 - INFO - __main__ - Step 80839: {'lr': 0.00022437107554854738, 'samples': 15521088, 'steps': 80838, 'loss/train': 1.3536930084228516} 11/07/2021 08:36:20 - INFO - __main__ - Step 80840: {'lr': 0.00022436579676915827, 'samples': 15521280, 'steps': 80839, 'loss/train': 1.5714622735977173} 11/07/2021 08:36:21 - INFO - __main__ - Step 80841: {'lr': 0.00022436051800131957, 'samples': 15521472, 'steps': 80840, 'loss/train': 1.5047577619552612} 11/07/2021 08:36:22 - INFO - __main__ - Step 80842: {'lr': 0.0002243552392450336, 'samples': 15521664, 'steps': 80841, 'loss/train': 1.1950838565826416} 11/07/2021 08:36:22 - INFO - __main__ - Step 80843: {'lr': 0.0002243499605003028, 'samples': 15521856, 'steps': 80842, 'loss/train': 1.1200554370880127} 11/07/2021 08:36:23 - INFO - __main__ - Step 80844: {'lr': 0.00022434468176712948, 'samples': 15522048, 'steps': 80843, 'loss/train': 1.318935751914978} 11/07/2021 08:36:23 - INFO - __main__ - Step 80845: {'lr': 0.00022433940304551604, 'samples': 15522240, 'steps': 80844, 'loss/train': 1.9300111532211304} 11/07/2021 08:36:23 - INFO - __main__ - Step 80846: {'lr': 0.00022433412433546488, 'samples': 15522432, 'steps': 80845, 'loss/train': 0.7652961611747742} 11/07/2021 08:36:24 - INFO - __main__ - Step 80847: {'lr': 0.0002243288456369784, 'samples': 15522624, 'steps': 80846, 'loss/train': 1.0391693115234375} 11/07/2021 08:36:25 - INFO - __main__ - Step 80848: {'lr': 0.00022432356695005902, 'samples': 15522816, 'steps': 80847, 'loss/train': 0.6528677344322205} 11/07/2021 08:36:25 - INFO - __main__ - Step 80849: {'lr': 0.00022431828827470894, 'samples': 15523008, 'steps': 80848, 'loss/train': 1.6542878150939941} 11/07/2021 08:36:26 - INFO - __main__ - Step 80850: {'lr': 0.00022431300961093064, 'samples': 15523200, 'steps': 80849, 'loss/train': 1.5101834535598755} 11/07/2021 08:36:26 - INFO - __main__ - Step 80851: {'lr': 0.0002243077309587265, 'samples': 15523392, 'steps': 80850, 'loss/train': 1.5971906185150146} 11/07/2021 08:36:26 - INFO - __main__ - Step 80852: {'lr': 0.00022430245231809892, 'samples': 15523584, 'steps': 80851, 'loss/train': 1.326186180114746} 11/07/2021 08:36:27 - INFO - __main__ - Step 80853: {'lr': 0.00022429717368905021, 'samples': 15523776, 'steps': 80852, 'loss/train': 1.5089420080184937} 11/07/2021 08:36:28 - INFO - __main__ - Step 80854: {'lr': 0.00022429189507158288, 'samples': 15523968, 'steps': 80853, 'loss/train': 1.5872516632080078} 11/07/2021 08:36:28 - INFO - __main__ - Step 80855: {'lr': 0.00022428661646569915, 'samples': 15524160, 'steps': 80854, 'loss/train': 1.3968805074691772} 11/07/2021 08:36:28 - INFO - __main__ - Step 80856: {'lr': 0.00022428133787140151, 'samples': 15524352, 'steps': 80855, 'loss/train': 0.8319019675254822} 11/07/2021 08:36:29 - INFO - __main__ - Step 80857: {'lr': 0.0002242760592886923, 'samples': 15524544, 'steps': 80856, 'loss/train': 1.2803643941879272} 11/07/2021 08:36:30 - INFO - __main__ - Step 80858: {'lr': 0.0002242707807175739, 'samples': 15524736, 'steps': 80857, 'loss/train': 1.2547370195388794} 11/07/2021 08:36:30 - INFO - __main__ - Step 80859: {'lr': 0.00022426550215804867, 'samples': 15524928, 'steps': 80858, 'loss/train': 0.07144899666309357} 11/07/2021 08:36:30 - INFO - __main__ - Step 80860: {'lr': 0.00022426022361011903, 'samples': 15525120, 'steps': 80859, 'loss/train': 1.7828084230422974} 11/07/2021 08:36:31 - INFO - __main__ - Step 80861: {'lr': 0.00022425494507378733, 'samples': 15525312, 'steps': 80860, 'loss/train': 1.086612343788147} 11/07/2021 08:36:31 - INFO - __main__ - Step 80862: {'lr': 0.00022424966654905604, 'samples': 15525504, 'steps': 80861, 'loss/train': 2.048454523086548} 11/07/2021 08:36:32 - INFO - __main__ - Step 80863: {'lr': 0.00022424438803592738, 'samples': 15525696, 'steps': 80862, 'loss/train': 1.7557367086410522} 11/07/2021 08:36:33 - INFO - __main__ - Step 80864: {'lr': 0.00022423910953440378, 'samples': 15525888, 'steps': 80863, 'loss/train': 0.7118558883666992} 11/07/2021 08:36:33 - INFO - __main__ - Step 80865: {'lr': 0.00022423383104448765, 'samples': 15526080, 'steps': 80864, 'loss/train': 1.5460364818572998} 11/07/2021 08:36:33 - INFO - __main__ - Step 80866: {'lr': 0.00022422855256618135, 'samples': 15526272, 'steps': 80865, 'loss/train': 1.5451401472091675} 11/07/2021 08:36:34 - INFO - __main__ - Step 80867: {'lr': 0.00022422327409948729, 'samples': 15526464, 'steps': 80866, 'loss/train': 1.3837157487869263} 11/07/2021 08:36:34 - INFO - __main__ - Step 80868: {'lr': 0.0002242179956444078, 'samples': 15526656, 'steps': 80867, 'loss/train': 1.6659358739852905} 11/07/2021 08:36:35 - INFO - __main__ - Step 80869: {'lr': 0.00022421271720094528, 'samples': 15526848, 'steps': 80868, 'loss/train': 1.4698587656021118} 11/07/2021 08:36:35 - INFO - __main__ - Step 80870: {'lr': 0.00022420743876910214, 'samples': 15527040, 'steps': 80869, 'loss/train': 1.5548267364501953} 11/07/2021 08:36:36 - INFO - __main__ - Step 80871: {'lr': 0.0002242021603488807, 'samples': 15527232, 'steps': 80870, 'loss/train': 1.3045732975006104} 11/07/2021 08:36:36 - INFO - __main__ - Step 80872: {'lr': 0.00022419688194028338, 'samples': 15527424, 'steps': 80871, 'loss/train': 1.0798381567001343} 11/07/2021 08:36:37 - INFO - __main__ - Step 80873: {'lr': 0.00022419160354331257, 'samples': 15527616, 'steps': 80872, 'loss/train': 0.3975071609020233} 11/07/2021 08:36:38 - INFO - __main__ - Step 80874: {'lr': 0.00022418632515797064, 'samples': 15527808, 'steps': 80873, 'loss/train': 1.16277277469635} 11/07/2021 08:36:38 - INFO - __main__ - Step 80875: {'lr': 0.00022418104678425995, 'samples': 15528000, 'steps': 80874, 'loss/train': 1.5723512172698975} 11/07/2021 08:36:38 - INFO - __main__ - Step 80876: {'lr': 0.00022417576842218286, 'samples': 15528192, 'steps': 80875, 'loss/train': 1.2146714925765991} 11/07/2021 08:36:39 - INFO - __main__ - Step 80877: {'lr': 0.00022417049007174175, 'samples': 15528384, 'steps': 80876, 'loss/train': 1.6834372282028198} 11/07/2021 08:36:39 - INFO - __main__ - Step 80878: {'lr': 0.00022416521173293904, 'samples': 15528576, 'steps': 80877, 'loss/train': 1.9211366176605225} 11/07/2021 08:36:40 - INFO - __main__ - Step 80879: {'lr': 0.00022415993340577707, 'samples': 15528768, 'steps': 80878, 'loss/train': 0.6731970310211182} 11/07/2021 08:36:40 - INFO - __main__ - Step 80880: {'lr': 0.00022415465509025823, 'samples': 15528960, 'steps': 80879, 'loss/train': 1.774720549583435} 11/07/2021 08:36:41 - INFO - __main__ - Step 80881: {'lr': 0.00022414937678638493, 'samples': 15529152, 'steps': 80880, 'loss/train': 1.189456582069397} 11/07/2021 08:36:41 - INFO - __main__ - Step 80882: {'lr': 0.00022414409849415948, 'samples': 15529344, 'steps': 80881, 'loss/train': 1.6671228408813477} 11/07/2021 08:36:42 - INFO - __main__ - Step 80883: {'lr': 0.00022413882021358434, 'samples': 15529536, 'steps': 80882, 'loss/train': 0.7952503561973572} 11/07/2021 08:36:42 - INFO - __main__ - Step 80884: {'lr': 0.00022413354194466187, 'samples': 15529728, 'steps': 80883, 'loss/train': 1.4252911806106567} 11/07/2021 08:36:43 - INFO - __main__ - Step 80885: {'lr': 0.00022412826368739438, 'samples': 15529920, 'steps': 80884, 'loss/train': 1.478074073791504} 11/07/2021 08:36:43 - INFO - __main__ - Step 80886: {'lr': 0.0002241229854417843, 'samples': 15530112, 'steps': 80885, 'loss/train': 1.5448095798492432} 11/07/2021 08:36:44 - INFO - __main__ - Step 80887: {'lr': 0.00022411770720783404, 'samples': 15530304, 'steps': 80886, 'loss/train': 1.610823631286621} 11/07/2021 08:36:44 - INFO - __main__ - Step 80888: {'lr': 0.0002241124289855459, 'samples': 15530496, 'steps': 80887, 'loss/train': 0.8464186191558838} 11/07/2021 08:36:45 - INFO - __main__ - Step 80889: {'lr': 0.00022410715077492236, 'samples': 15530688, 'steps': 80888, 'loss/train': 0.9055548906326294} 11/07/2021 08:36:45 - INFO - __main__ - Step 80890: {'lr': 0.00022410187257596568, 'samples': 15530880, 'steps': 80889, 'loss/train': 1.6287803649902344} 11/07/2021 08:36:46 - INFO - __main__ - Step 80891: {'lr': 0.0002240965943886783, 'samples': 15531072, 'steps': 80890, 'loss/train': 1.164123773574829} 11/07/2021 08:36:46 - INFO - __main__ - Step 80892: {'lr': 0.00022409131621306262, 'samples': 15531264, 'steps': 80891, 'loss/train': 1.4751553535461426} 11/07/2021 08:36:46 - INFO - __main__ - Step 80893: {'lr': 0.00022408603804912095, 'samples': 15531456, 'steps': 80892, 'loss/train': 2.069591999053955} 11/07/2021 08:36:47 - INFO - __main__ - Step 80894: {'lr': 0.00022408075989685576, 'samples': 15531648, 'steps': 80893, 'loss/train': 1.1557596921920776} 11/07/2021 08:36:48 - INFO - __main__ - Step 80895: {'lr': 0.0002240754817562694, 'samples': 15531840, 'steps': 80894, 'loss/train': 1.4488208293914795} 11/07/2021 08:36:48 - INFO - __main__ - Step 80896: {'lr': 0.0002240702036273642, 'samples': 15532032, 'steps': 80895, 'loss/train': 0.9614261984825134} 11/07/2021 08:36:48 - INFO - __main__ - Step 80897: {'lr': 0.0002240649255101425, 'samples': 15532224, 'steps': 80896, 'loss/train': 1.4024665355682373} 11/07/2021 08:36:49 - INFO - __main__ - Step 80898: {'lr': 0.00022405964740460682, 'samples': 15532416, 'steps': 80897, 'loss/train': 1.6652562618255615} 11/07/2021 08:36:49 - INFO - __main__ - Step 80899: {'lr': 0.00022405436931075942, 'samples': 15532608, 'steps': 80898, 'loss/train': 1.0714455842971802} 11/07/2021 08:36:50 - INFO - __main__ - Step 80900: {'lr': 0.00022404909122860272, 'samples': 15532800, 'steps': 80899, 'loss/train': 1.4996579885482788} 11/07/2021 08:36:51 - INFO - __main__ - Step 80901: {'lr': 0.00022404381315813913, 'samples': 15532992, 'steps': 80900, 'loss/train': 1.0926685333251953} 11/07/2021 08:36:51 - INFO - __main__ - Step 80902: {'lr': 0.00022403853509937106, 'samples': 15533184, 'steps': 80901, 'loss/train': 1.578445315361023} 11/07/2021 08:36:52 - INFO - __main__ - Step 80903: {'lr': 0.00022403325705230072, 'samples': 15533376, 'steps': 80902, 'loss/train': 1.055283784866333} 11/07/2021 08:36:52 - INFO - __main__ - Step 80904: {'lr': 0.0002240279790169306, 'samples': 15533568, 'steps': 80903, 'loss/train': 1.5158926248550415} 11/07/2021 08:36:52 - INFO - __main__ - Step 80905: {'lr': 0.00022402270099326313, 'samples': 15533760, 'steps': 80904, 'loss/train': 1.5264755487442017} 11/07/2021 08:36:53 - INFO - __main__ - Step 80906: {'lr': 0.00022401742298130064, 'samples': 15533952, 'steps': 80905, 'loss/train': 1.0306066274642944} 11/07/2021 08:36:54 - INFO - __main__ - Step 80907: {'lr': 0.00022401214498104545, 'samples': 15534144, 'steps': 80906, 'loss/train': 1.366942048072815} 11/07/2021 08:36:54 - INFO - __main__ - Step 80908: {'lr': 0.0002240068669925, 'samples': 15534336, 'steps': 80907, 'loss/train': 1.5778182744979858} 11/07/2021 08:36:54 - INFO - __main__ - Step 80909: {'lr': 0.00022400158901566663, 'samples': 15534528, 'steps': 80908, 'loss/train': 1.6383624076843262} 11/07/2021 08:36:55 - INFO - __main__ - Step 80910: {'lr': 0.00022399631105054775, 'samples': 15534720, 'steps': 80909, 'loss/train': 1.517841100692749} 11/07/2021 08:36:56 - INFO - __main__ - Step 80911: {'lr': 0.00022399103309714576, 'samples': 15534912, 'steps': 80910, 'loss/train': 1.3418582677841187} 11/07/2021 08:36:56 - INFO - __main__ - Step 80912: {'lr': 0.00022398575515546296, 'samples': 15535104, 'steps': 80911, 'loss/train': 1.3569591045379639} 11/07/2021 08:36:56 - INFO - __main__ - Step 80913: {'lr': 0.0002239804772255018, 'samples': 15535296, 'steps': 80912, 'loss/train': 1.7389191389083862} 11/07/2021 08:36:57 - INFO - __main__ - Step 80914: {'lr': 0.00022397519930726466, 'samples': 15535488, 'steps': 80913, 'loss/train': 1.2210655212402344} 11/07/2021 08:36:57 - INFO - __main__ - Step 80915: {'lr': 0.00022396992140075387, 'samples': 15535680, 'steps': 80914, 'loss/train': 1.3055769205093384} 11/07/2021 08:36:58 - INFO - __main__ - Step 80916: {'lr': 0.00022396464350597187, 'samples': 15535872, 'steps': 80915, 'loss/train': 1.4181137084960938} 11/07/2021 08:36:59 - INFO - __main__ - Step 80917: {'lr': 0.00022395936562292102, 'samples': 15536064, 'steps': 80916, 'loss/train': 0.8738125562667847} 11/07/2021 08:36:59 - INFO - __main__ - Step 80918: {'lr': 0.00022395408775160362, 'samples': 15536256, 'steps': 80917, 'loss/train': 1.5244858264923096} 11/07/2021 08:36:59 - INFO - __main__ - Step 80919: {'lr': 0.0002239488098920221, 'samples': 15536448, 'steps': 80918, 'loss/train': 0.6997354030609131} 11/07/2021 08:37:00 - INFO - __main__ - Step 80920: {'lr': 0.00022394353204417886, 'samples': 15536640, 'steps': 80919, 'loss/train': 1.4993243217468262} 11/07/2021 08:37:01 - INFO - __main__ - Step 80921: {'lr': 0.00022393825420807627, 'samples': 15536832, 'steps': 80920, 'loss/train': 1.3746718168258667} 11/07/2021 08:37:01 - INFO - __main__ - Step 80922: {'lr': 0.00022393297638371667, 'samples': 15537024, 'steps': 80921, 'loss/train': 1.4612367153167725} 11/07/2021 08:37:01 - INFO - __main__ - Step 80923: {'lr': 0.00022392769857110248, 'samples': 15537216, 'steps': 80922, 'loss/train': 1.4780032634735107} 11/07/2021 08:37:02 - INFO - __main__ - Step 80924: {'lr': 0.00022392242077023608, 'samples': 15537408, 'steps': 80923, 'loss/train': 1.5322582721710205} 11/07/2021 08:37:02 - INFO - __main__ - Step 80925: {'lr': 0.00022391714298111983, 'samples': 15537600, 'steps': 80924, 'loss/train': 1.4370492696762085} 11/07/2021 08:37:03 - INFO - __main__ - Step 80926: {'lr': 0.00022391186520375608, 'samples': 15537792, 'steps': 80925, 'loss/train': 1.1310056447982788} 11/07/2021 08:37:03 - INFO - __main__ - Step 80927: {'lr': 0.0002239065874381473, 'samples': 15537984, 'steps': 80926, 'loss/train': 1.528582215309143} 11/07/2021 08:37:04 - INFO - __main__ - Step 80928: {'lr': 0.00022390130968429577, 'samples': 15538176, 'steps': 80927, 'loss/train': 1.628496766090393} 11/07/2021 08:37:04 - INFO - __main__ - Step 80929: {'lr': 0.00022389603194220402, 'samples': 15538368, 'steps': 80928, 'loss/train': 1.3950637578964233} 11/07/2021 08:37:04 - INFO - __main__ - Step 80930: {'lr': 0.00022389075421187421, 'samples': 15538560, 'steps': 80929, 'loss/train': 1.1341971158981323} 11/07/2021 08:37:05 - INFO - __main__ - Step 80931: {'lr': 0.00022388547649330881, 'samples': 15538752, 'steps': 80930, 'loss/train': 0.33745911717414856} 11/07/2021 08:37:06 - INFO - __main__ - Step 80932: {'lr': 0.00022388019878651023, 'samples': 15538944, 'steps': 80931, 'loss/train': 1.6784181594848633} 11/07/2021 08:37:06 - INFO - __main__ - Step 80933: {'lr': 0.00022387492109148083, 'samples': 15539136, 'steps': 80932, 'loss/train': 0.536235511302948} 11/07/2021 08:37:06 - INFO - __main__ - Step 80934: {'lr': 0.00022386964340822296, 'samples': 15539328, 'steps': 80933, 'loss/train': 0.6595482230186462} 11/07/2021 08:37:07 - INFO - __main__ - Step 80935: {'lr': 0.00022386436573673907, 'samples': 15539520, 'steps': 80934, 'loss/train': 1.3058478832244873} 11/07/2021 08:37:07 - INFO - __main__ - Step 80936: {'lr': 0.00022385908807703145, 'samples': 15539712, 'steps': 80935, 'loss/train': 1.4483643770217896} 11/07/2021 08:37:08 - INFO - __main__ - Step 80937: {'lr': 0.00022385381042910256, 'samples': 15539904, 'steps': 80936, 'loss/train': 1.7842570543289185} 11/07/2021 08:37:09 - INFO - __main__ - Step 80938: {'lr': 0.00022384853279295474, 'samples': 15540096, 'steps': 80937, 'loss/train': 1.550698161125183} 11/07/2021 08:37:09 - INFO - __main__ - Step 80939: {'lr': 0.00022384325516859032, 'samples': 15540288, 'steps': 80938, 'loss/train': 1.405117392539978} 11/07/2021 08:37:09 - INFO - __main__ - Step 80940: {'lr': 0.00022383797755601176, 'samples': 15540480, 'steps': 80939, 'loss/train': 1.3832370042800903} 11/07/2021 08:37:10 - INFO - __main__ - Step 80941: {'lr': 0.0002238326999552214, 'samples': 15540672, 'steps': 80940, 'loss/train': 1.8369022607803345} 11/07/2021 08:37:11 - INFO - __main__ - Step 80942: {'lr': 0.00022382742236622173, 'samples': 15540864, 'steps': 80941, 'loss/train': 1.6140663623809814} 11/07/2021 08:37:11 - INFO - __main__ - Step 80943: {'lr': 0.0002238221447890149, 'samples': 15541056, 'steps': 80942, 'loss/train': 1.7681217193603516} 11/07/2021 08:37:11 - INFO - __main__ - Step 80944: {'lr': 0.00022381686722360342, 'samples': 15541248, 'steps': 80943, 'loss/train': 1.1612637042999268} 11/07/2021 08:37:12 - INFO - __main__ - Step 80945: {'lr': 0.00022381158966998965, 'samples': 15541440, 'steps': 80944, 'loss/train': 1.4282017946243286} 11/07/2021 08:37:12 - INFO - __main__ - Step 80946: {'lr': 0.00022380631212817599, 'samples': 15541632, 'steps': 80945, 'loss/train': 1.4848556518554688} 11/07/2021 08:37:13 - INFO - __main__ - Step 80947: {'lr': 0.00022380103459816478, 'samples': 15541824, 'steps': 80946, 'loss/train': 1.5736324787139893} 11/07/2021 08:37:13 - INFO - __main__ - Step 80948: {'lr': 0.00022379575707995842, 'samples': 15542016, 'steps': 80947, 'loss/train': 1.4223175048828125} 11/07/2021 08:37:14 - INFO - __main__ - Step 80949: {'lr': 0.0002237904795735593, 'samples': 15542208, 'steps': 80948, 'loss/train': 1.297677993774414} 11/07/2021 08:37:14 - INFO - __main__ - Step 80950: {'lr': 0.00022378520207896977, 'samples': 15542400, 'steps': 80949, 'loss/train': 0.9073521494865417} 11/07/2021 08:37:15 - INFO - __main__ - Step 80951: {'lr': 0.00022377992459619224, 'samples': 15542592, 'steps': 80950, 'loss/train': 1.2523530721664429} 11/07/2021 08:37:16 - INFO - __main__ - Step 80952: {'lr': 0.00022377464712522907, 'samples': 15542784, 'steps': 80951, 'loss/train': 1.4063057899475098} 11/07/2021 08:37:16 - INFO - __main__ - Step 80953: {'lr': 0.00022376936966608262, 'samples': 15542976, 'steps': 80952, 'loss/train': 1.1552128791809082} 11/07/2021 08:37:16 - INFO - __main__ - Step 80954: {'lr': 0.00022376409221875533, 'samples': 15543168, 'steps': 80953, 'loss/train': 1.280600905418396} 11/07/2021 08:37:17 - INFO - __main__ - Step 80955: {'lr': 0.0002237588147832495, 'samples': 15543360, 'steps': 80954, 'loss/train': 1.2275558710098267} 11/07/2021 08:37:17 - INFO - __main__ - Step 80956: {'lr': 0.00022375353735956766, 'samples': 15543552, 'steps': 80955, 'loss/train': 1.4362624883651733} 11/07/2021 08:37:17 - INFO - __main__ - Step 80957: {'lr': 0.00022374825994771194, 'samples': 15543744, 'steps': 80956, 'loss/train': 1.6634576320648193} 11/07/2021 08:37:18 - INFO - __main__ - Step 80958: {'lr': 0.00022374298254768487, 'samples': 15543936, 'steps': 80957, 'loss/train': 1.2239667177200317} 11/07/2021 08:37:19 - INFO - __main__ - Step 80959: {'lr': 0.00022373770515948883, 'samples': 15544128, 'steps': 80958, 'loss/train': 1.0190563201904297} 11/07/2021 08:37:19 - INFO - __main__ - Step 80960: {'lr': 0.00022373242778312615, 'samples': 15544320, 'steps': 80959, 'loss/train': 1.2229951620101929} 11/07/2021 08:37:19 - INFO - __main__ - Step 80961: {'lr': 0.00022372715041859924, 'samples': 15544512, 'steps': 80960, 'loss/train': 1.4121094942092896} 11/07/2021 08:37:20 - INFO - __main__ - Step 80962: {'lr': 0.00022372187306591044, 'samples': 15544704, 'steps': 80961, 'loss/train': 1.1357707977294922} 11/07/2021 08:37:21 - INFO - __main__ - Step 80963: {'lr': 0.0002237165957250622, 'samples': 15544896, 'steps': 80962, 'loss/train': 1.7460111379623413} 11/07/2021 08:37:21 - INFO - __main__ - Step 80964: {'lr': 0.00022371131839605683, 'samples': 15545088, 'steps': 80963, 'loss/train': 1.2060043811798096} 11/07/2021 08:37:21 - INFO - __main__ - Step 80965: {'lr': 0.00022370604107889674, 'samples': 15545280, 'steps': 80964, 'loss/train': 1.7962433099746704} 11/07/2021 08:37:22 - INFO - __main__ - Step 80966: {'lr': 0.0002237007637735843, 'samples': 15545472, 'steps': 80965, 'loss/train': 0.7760006189346313} 11/07/2021 08:37:22 - INFO - __main__ - Step 80967: {'lr': 0.00022369548648012188, 'samples': 15545664, 'steps': 80966, 'loss/train': 1.2879409790039062} 11/07/2021 08:37:23 - INFO - __main__ - Step 80968: {'lr': 0.00022369020919851192, 'samples': 15545856, 'steps': 80967, 'loss/train': 1.4455422163009644} 11/07/2021 08:37:24 - INFO - __main__ - Step 80969: {'lr': 0.0002236849319287568, 'samples': 15546048, 'steps': 80968, 'loss/train': 1.1197110414505005} 11/07/2021 08:37:24 - INFO - __main__ - Step 80970: {'lr': 0.00022367965467085877, 'samples': 15546240, 'steps': 80969, 'loss/train': 1.3679572343826294} 11/07/2021 08:37:24 - INFO - __main__ - Step 80971: {'lr': 0.00022367437742482025, 'samples': 15546432, 'steps': 80970, 'loss/train': 1.4557398557662964} 11/07/2021 08:37:25 - INFO - __main__ - Step 80972: {'lr': 0.00022366910019064367, 'samples': 15546624, 'steps': 80971, 'loss/train': 1.4284754991531372} 11/07/2021 08:37:26 - INFO - __main__ - Step 80973: {'lr': 0.00022366382296833137, 'samples': 15546816, 'steps': 80972, 'loss/train': 1.1641956567764282} 11/07/2021 08:37:26 - INFO - __main__ - Step 80974: {'lr': 0.00022365854575788578, 'samples': 15547008, 'steps': 80973, 'loss/train': 3.733318567276001} 11/07/2021 08:37:26 - INFO - __main__ - Step 80975: {'lr': 0.0002236532685593092, 'samples': 15547200, 'steps': 80974, 'loss/train': 1.3512310981750488} 11/07/2021 08:37:27 - INFO - __main__ - Step 80976: {'lr': 0.0002236479913726041, 'samples': 15547392, 'steps': 80975, 'loss/train': 1.4333949089050293} 11/07/2021 08:37:27 - INFO - __main__ - Step 80977: {'lr': 0.00022364271419777275, 'samples': 15547584, 'steps': 80976, 'loss/train': 0.2820529341697693} 11/07/2021 08:37:28 - INFO - __main__ - Step 80978: {'lr': 0.00022363743703481762, 'samples': 15547776, 'steps': 80977, 'loss/train': 1.3365898132324219} 11/07/2021 08:37:28 - INFO - __main__ - Step 80979: {'lr': 0.00022363215988374102, 'samples': 15547968, 'steps': 80978, 'loss/train': 1.3197747468948364} 11/07/2021 08:37:29 - INFO - __main__ - Step 80980: {'lr': 0.0002236268827445454, 'samples': 15548160, 'steps': 80979, 'loss/train': 1.8832721710205078} 11/07/2021 08:37:29 - INFO - __main__ - Step 80981: {'lr': 0.0002236216056172331, 'samples': 15548352, 'steps': 80980, 'loss/train': 1.2062305212020874} 11/07/2021 08:37:29 - INFO - __main__ - Step 80982: {'lr': 0.00022361632850180648, 'samples': 15548544, 'steps': 80981, 'loss/train': 1.5496517419815063} 11/07/2021 08:37:30 - INFO - __main__ - Step 80983: {'lr': 0.00022361105139826807, 'samples': 15548736, 'steps': 80982, 'loss/train': 1.2737003564834595} 11/07/2021 08:37:31 - INFO - __main__ - Step 80984: {'lr': 0.00022360577430662, 'samples': 15548928, 'steps': 80983, 'loss/train': 1.3357962369918823} 11/07/2021 08:37:31 - INFO - __main__ - Step 80985: {'lr': 0.00022360049722686473, 'samples': 15549120, 'steps': 80984, 'loss/train': 1.314703106880188} 11/07/2021 08:37:31 - INFO - __main__ - Step 80986: {'lr': 0.00022359522015900468, 'samples': 15549312, 'steps': 80985, 'loss/train': 1.3081929683685303} 11/07/2021 08:37:32 - INFO - __main__ - Step 80987: {'lr': 0.00022358994310304223, 'samples': 15549504, 'steps': 80986, 'loss/train': 1.3505083322525024} 11/07/2021 08:37:32 - INFO - __main__ - Step 80988: {'lr': 0.00022358466605897973, 'samples': 15549696, 'steps': 80987, 'loss/train': 1.292600393295288} 11/07/2021 08:37:33 - INFO - __main__ - Step 80989: {'lr': 0.00022357938902681956, 'samples': 15549888, 'steps': 80988, 'loss/train': 0.8674932718276978} 11/07/2021 08:37:34 - INFO - __main__ - Step 80990: {'lr': 0.00022357411200656414, 'samples': 15550080, 'steps': 80989, 'loss/train': 1.285007357597351} 11/07/2021 08:37:34 - INFO - __main__ - Step 80991: {'lr': 0.0002235688349982158, 'samples': 15550272, 'steps': 80990, 'loss/train': 1.576724648475647} 11/07/2021 08:37:34 - INFO - __main__ - Step 80992: {'lr': 0.00022356355800177695, 'samples': 15550464, 'steps': 80991, 'loss/train': 0.7192810773849487} 11/07/2021 08:37:35 - INFO - __main__ - Step 80993: {'lr': 0.00022355828101724992, 'samples': 15550656, 'steps': 80992, 'loss/train': 1.2351776361465454} 11/07/2021 08:37:36 - INFO - __main__ - Step 80994: {'lr': 0.00022355300404463716, 'samples': 15550848, 'steps': 80993, 'loss/train': 1.0153809785842896} 11/07/2021 08:37:36 - INFO - __main__ - Step 80995: {'lr': 0.000223547727083941, 'samples': 15551040, 'steps': 80994, 'loss/train': 1.7630422115325928} 11/07/2021 08:37:36 - INFO - __main__ - Step 80996: {'lr': 0.00022354245013516392, 'samples': 15551232, 'steps': 80995, 'loss/train': 0.7580217123031616} 11/07/2021 08:37:37 - INFO - __main__ - Step 80997: {'lr': 0.0002235371731983081, 'samples': 15551424, 'steps': 80996, 'loss/train': 1.290104627609253} 11/07/2021 08:37:37 - INFO - __main__ - Step 80998: {'lr': 0.00022353189627337603, 'samples': 15551616, 'steps': 80997, 'loss/train': 1.4756395816802979} 11/07/2021 08:37:38 - INFO - __main__ - Step 80999: {'lr': 0.00022352661936037005, 'samples': 15551808, 'steps': 80998, 'loss/train': 1.193306565284729} 11/07/2021 08:37:39 - INFO - __main__ - Step 81000: {'lr': 0.0002235213424592926, 'samples': 15552000, 'steps': 80999, 'loss/train': 1.7824620008468628} 11/07/2021 08:37:39 - INFO - __main__ - Step 81001: {'lr': 0.000223516065570146, 'samples': 15552192, 'steps': 81000, 'loss/train': 1.3649611473083496} 11/07/2021 08:37:39 - INFO - __main__ - Step 81002: {'lr': 0.00022351078869293267, 'samples': 15552384, 'steps': 81001, 'loss/train': 1.483574390411377} 11/07/2021 08:37:40 - INFO - __main__ - Step 81003: {'lr': 0.00022350551182765497, 'samples': 15552576, 'steps': 81002, 'loss/train': 1.6595410108566284} 11/07/2021 08:37:41 - INFO - __main__ - Step 81004: {'lr': 0.00022350023497431527, 'samples': 15552768, 'steps': 81003, 'loss/train': 1.312238335609436} 11/07/2021 08:37:41 - INFO - __main__ - Step 81005: {'lr': 0.00022349495813291594, 'samples': 15552960, 'steps': 81004, 'loss/train': 1.2294448614120483} 11/07/2021 08:37:41 - INFO - __main__ - Step 81006: {'lr': 0.00022348968130345942, 'samples': 15553152, 'steps': 81005, 'loss/train': 1.42177152633667} 11/07/2021 08:37:42 - INFO - __main__ - Step 81007: {'lr': 0.000223484404485948, 'samples': 15553344, 'steps': 81006, 'loss/train': 1.5835216045379639} 11/07/2021 08:37:42 - INFO - __main__ - Step 81008: {'lr': 0.00022347912768038418, 'samples': 15553536, 'steps': 81007, 'loss/train': 1.731229305267334} 11/07/2021 08:37:43 - INFO - __main__ - Step 81009: {'lr': 0.00022347385088677016, 'samples': 15553728, 'steps': 81008, 'loss/train': 1.2174232006072998} 11/07/2021 08:37:43 - INFO - __main__ - Step 81010: {'lr': 0.0002234685741051085, 'samples': 15553920, 'steps': 81009, 'loss/train': 1.4875938892364502} 11/07/2021 08:37:44 - INFO - __main__ - Step 81011: {'lr': 0.00022346329733540144, 'samples': 15554112, 'steps': 81010, 'loss/train': 0.9840750694274902} 11/07/2021 08:37:44 - INFO - __main__ - Step 81012: {'lr': 0.00022345802057765137, 'samples': 15554304, 'steps': 81011, 'loss/train': 1.5396360158920288} 11/07/2021 08:37:45 - INFO - __main__ - Step 81013: {'lr': 0.00022345274383186076, 'samples': 15554496, 'steps': 81012, 'loss/train': 1.6194052696228027} 11/07/2021 08:37:45 - INFO - __main__ - Step 81014: {'lr': 0.0002234474670980319, 'samples': 15554688, 'steps': 81013, 'loss/train': 1.726041316986084} 11/07/2021 08:37:46 - INFO - __main__ - Step 81015: {'lr': 0.0002234421903761672, 'samples': 15554880, 'steps': 81014, 'loss/train': 1.4061791896820068} 11/07/2021 08:37:46 - INFO - __main__ - Step 81016: {'lr': 0.00022343691366626906, 'samples': 15555072, 'steps': 81015, 'loss/train': 1.3761388063430786} 11/07/2021 08:37:47 - INFO - __main__ - Step 81017: {'lr': 0.00022343163696833982, 'samples': 15555264, 'steps': 81016, 'loss/train': 2.470365285873413} 11/07/2021 08:37:47 - INFO - __main__ - Step 81018: {'lr': 0.0002234263602823819, 'samples': 15555456, 'steps': 81017, 'loss/train': 1.5745511054992676} 11/07/2021 08:37:47 - INFO - __main__ - Step 81019: {'lr': 0.0002234210836083977, 'samples': 15555648, 'steps': 81018, 'loss/train': 1.2915709018707275} 11/07/2021 08:37:48 - INFO - __main__ - Step 81020: {'lr': 0.00022341580694638946, 'samples': 15555840, 'steps': 81019, 'loss/train': 1.11332368850708} 11/07/2021 08:37:49 - INFO - __main__ - Step 81021: {'lr': 0.00022341053029635967, 'samples': 15556032, 'steps': 81020, 'loss/train': 1.2464735507965088} 11/07/2021 08:37:49 - INFO - __main__ - Step 81022: {'lr': 0.0002234052536583107, 'samples': 15556224, 'steps': 81021, 'loss/train': 1.591964840888977} 11/07/2021 08:37:49 - INFO - __main__ - Step 81023: {'lr': 0.00022339997703224492, 'samples': 15556416, 'steps': 81022, 'loss/train': 1.9689282178878784} 11/07/2021 08:37:50 - INFO - __main__ - Step 81024: {'lr': 0.00022339470041816468, 'samples': 15556608, 'steps': 81023, 'loss/train': 1.2653857469558716} 11/07/2021 08:37:51 - INFO - __main__ - Step 81025: {'lr': 0.00022338942381607238, 'samples': 15556800, 'steps': 81024, 'loss/train': 1.1661536693572998} 11/07/2021 08:37:51 - INFO - __main__ - Step 81026: {'lr': 0.0002233841472259704, 'samples': 15556992, 'steps': 81025, 'loss/train': 1.551882028579712} 11/07/2021 08:37:51 - INFO - __main__ - Step 81027: {'lr': 0.00022337887064786109, 'samples': 15557184, 'steps': 81026, 'loss/train': 1.8376262187957764} 11/07/2021 08:37:52 - INFO - __main__ - Step 81028: {'lr': 0.00022337359408174687, 'samples': 15557376, 'steps': 81027, 'loss/train': 1.3368299007415771} 11/07/2021 08:37:52 - INFO - __main__ - Step 81029: {'lr': 0.0002233683175276301, 'samples': 15557568, 'steps': 81028, 'loss/train': 1.724554181098938} 11/07/2021 08:37:53 - INFO - __main__ - Step 81030: {'lr': 0.00022336304098551318, 'samples': 15557760, 'steps': 81029, 'loss/train': 1.4345371723175049} 11/07/2021 08:37:53 - INFO - __main__ - Step 81031: {'lr': 0.00022335776445539843, 'samples': 15557952, 'steps': 81030, 'loss/train': 1.8789499998092651} 11/07/2021 08:37:54 - INFO - __main__ - Step 81032: {'lr': 0.00022335248793728824, 'samples': 15558144, 'steps': 81031, 'loss/train': 1.683642864227295} 11/07/2021 08:37:54 - INFO - __main__ - Step 81033: {'lr': 0.00022334721143118502, 'samples': 15558336, 'steps': 81032, 'loss/train': 1.180135726928711} 11/07/2021 08:37:55 - INFO - __main__ - Step 81034: {'lr': 0.00022334193493709115, 'samples': 15558528, 'steps': 81033, 'loss/train': 2.5371975898742676} 11/07/2021 08:37:55 - INFO - __main__ - Step 81035: {'lr': 0.000223336658455009, 'samples': 15558720, 'steps': 81034, 'loss/train': 1.5354771614074707} 11/07/2021 08:37:56 - INFO - __main__ - Step 81036: {'lr': 0.00022333138198494092, 'samples': 15558912, 'steps': 81035, 'loss/train': 1.1937205791473389} 11/07/2021 08:37:56 - INFO - __main__ - Step 81037: {'lr': 0.00022332610552688937, 'samples': 15559104, 'steps': 81036, 'loss/train': 1.0651581287384033} 11/07/2021 08:37:57 - INFO - __main__ - Step 81038: {'lr': 0.0002233208290808566, 'samples': 15559296, 'steps': 81037, 'loss/train': 1.3848762512207031} 11/07/2021 08:37:57 - INFO - __main__ - Step 81039: {'lr': 0.00022331555264684506, 'samples': 15559488, 'steps': 81038, 'loss/train': 1.2844079732894897} 11/07/2021 08:37:57 - INFO - __main__ - Step 81040: {'lr': 0.00022331027622485712, 'samples': 15559680, 'steps': 81039, 'loss/train': 1.1881752014160156} 11/07/2021 08:37:59 - INFO - __main__ - Step 81041: {'lr': 0.00022330499981489522, 'samples': 15559872, 'steps': 81040, 'loss/train': 1.5324710607528687} 11/07/2021 08:37:59 - INFO - __main__ - Step 81042: {'lr': 0.00022329972341696158, 'samples': 15560064, 'steps': 81041, 'loss/train': 0.29029834270477295} 11/07/2021 08:38:00 - INFO - __main__ - Step 81043: {'lr': 0.00022329444703105873, 'samples': 15560256, 'steps': 81042, 'loss/train': 0.8124886155128479} 11/07/2021 08:38:00 - INFO - __main__ - Step 81044: {'lr': 0.00022328917065718895, 'samples': 15560448, 'steps': 81043, 'loss/train': 0.9648832678794861} 11/07/2021 08:38:00 - INFO - __main__ - Step 81045: {'lr': 0.00022328389429535469, 'samples': 15560640, 'steps': 81044, 'loss/train': 0.9975717067718506} 11/07/2021 08:38:01 - INFO - __main__ - Step 81046: {'lr': 0.00022327861794555826, 'samples': 15560832, 'steps': 81045, 'loss/train': 1.52727472782135} 11/07/2021 08:38:02 - INFO - __main__ - Step 81047: {'lr': 0.0002232733416078021, 'samples': 15561024, 'steps': 81046, 'loss/train': 1.5272172689437866} 11/07/2021 08:38:02 - INFO - __main__ - Step 81048: {'lr': 0.00022326806528208854, 'samples': 15561216, 'steps': 81047, 'loss/train': 1.2915468215942383} 11/07/2021 08:38:02 - INFO - __main__ - Step 81049: {'lr': 0.00022326278896841998, 'samples': 15561408, 'steps': 81048, 'loss/train': 1.1834607124328613} 11/07/2021 08:38:03 - INFO - __main__ - Step 81050: {'lr': 0.00022325751266679886, 'samples': 15561600, 'steps': 81049, 'loss/train': 1.2369561195373535} 11/07/2021 08:38:04 - INFO - __main__ - Step 81051: {'lr': 0.0002232522363772275, 'samples': 15561792, 'steps': 81050, 'loss/train': 1.2281270027160645} 11/07/2021 08:38:04 - INFO - __main__ - Step 81052: {'lr': 0.00022324696009970817, 'samples': 15561984, 'steps': 81051, 'loss/train': 1.4040820598602295} 11/07/2021 08:38:04 - INFO - __main__ - Step 81053: {'lr': 0.0002232416838342434, 'samples': 15562176, 'steps': 81052, 'loss/train': 1.6937462091445923} 11/07/2021 08:38:05 - INFO - __main__ - Step 81054: {'lr': 0.00022323640758083548, 'samples': 15562368, 'steps': 81053, 'loss/train': 1.5735746622085571} 11/07/2021 08:38:05 - INFO - __main__ - Step 81055: {'lr': 0.00022323113133948687, 'samples': 15562560, 'steps': 81054, 'loss/train': 1.6249396800994873} 11/07/2021 08:38:05 - INFO - __main__ - Step 81056: {'lr': 0.00022322585511019984, 'samples': 15562752, 'steps': 81055, 'loss/train': 0.8763903975486755} 11/07/2021 08:38:06 - INFO - __main__ - Step 81057: {'lr': 0.00022322057889297687, 'samples': 15562944, 'steps': 81056, 'loss/train': 1.2998309135437012} 11/07/2021 08:38:07 - INFO - __main__ - Step 81058: {'lr': 0.00022321530268782025, 'samples': 15563136, 'steps': 81057, 'loss/train': 1.373473882675171} 11/07/2021 08:38:07 - INFO - __main__ - Step 81059: {'lr': 0.00022321002649473243, 'samples': 15563328, 'steps': 81058, 'loss/train': 1.6086353063583374} 11/07/2021 08:38:08 - INFO - __main__ - Step 81060: {'lr': 0.00022320475031371577, 'samples': 15563520, 'steps': 81059, 'loss/train': 1.6095505952835083} 11/07/2021 08:38:08 - INFO - __main__ - Step 81061: {'lr': 0.0002231994741447726, 'samples': 15563712, 'steps': 81060, 'loss/train': 1.2541828155517578} 11/07/2021 08:38:09 - INFO - __main__ - Step 81062: {'lr': 0.00022319419798790539, 'samples': 15563904, 'steps': 81061, 'loss/train': 1.0087366104125977} 11/07/2021 08:38:09 - INFO - __main__ - Step 81063: {'lr': 0.00022318892184311652, 'samples': 15564096, 'steps': 81062, 'loss/train': 1.4029723405838013} 11/07/2021 08:38:10 - INFO - __main__ - Step 81064: {'lr': 0.00022318364571040822, 'samples': 15564288, 'steps': 81063, 'loss/train': 1.217900037765503} 11/07/2021 08:38:10 - INFO - __main__ - Step 81065: {'lr': 0.00022317836958978294, 'samples': 15564480, 'steps': 81064, 'loss/train': 1.2823548316955566} 11/07/2021 08:38:10 - INFO - __main__ - Step 81066: {'lr': 0.0002231730934812431, 'samples': 15564672, 'steps': 81065, 'loss/train': 0.8859658241271973} 11/07/2021 08:38:11 - INFO - __main__ - Step 81067: {'lr': 0.00022316781738479104, 'samples': 15564864, 'steps': 81066, 'loss/train': 5.672998905181885} 11/07/2021 08:38:12 - INFO - __main__ - Step 81068: {'lr': 0.00022316254130042912, 'samples': 15565056, 'steps': 81067, 'loss/train': 1.289939284324646} 11/07/2021 08:38:12 - INFO - __main__ - Step 81069: {'lr': 0.00022315726522815978, 'samples': 15565248, 'steps': 81068, 'loss/train': 1.4720898866653442} 11/07/2021 08:38:12 - INFO - __main__ - Step 81070: {'lr': 0.00022315198916798533, 'samples': 15565440, 'steps': 81069, 'loss/train': 1.2393419742584229} 11/07/2021 08:38:13 - INFO - __main__ - Step 81071: {'lr': 0.0002231467131199082, 'samples': 15565632, 'steps': 81070, 'loss/train': 1.4116898775100708} 11/07/2021 08:38:14 - INFO - __main__ - Step 81072: {'lr': 0.00022314143708393073, 'samples': 15565824, 'steps': 81071, 'loss/train': 1.566779375076294} 11/07/2021 08:38:14 - INFO - __main__ - Step 81073: {'lr': 0.00022313616106005532, 'samples': 15566016, 'steps': 81072, 'loss/train': 1.5462146997451782} 11/07/2021 08:38:15 - INFO - __main__ - Step 81074: {'lr': 0.00022313088504828435, 'samples': 15566208, 'steps': 81073, 'loss/train': 1.1273494958877563} 11/07/2021 08:38:15 - INFO - __main__ - Step 81075: {'lr': 0.0002231256090486202, 'samples': 15566400, 'steps': 81074, 'loss/train': 1.6030092239379883} 11/07/2021 08:38:15 - INFO - __main__ - Step 81076: {'lr': 0.0002231203330610652, 'samples': 15566592, 'steps': 81075, 'loss/train': 1.3499562740325928} 11/07/2021 08:38:16 - INFO - __main__ - Step 81077: {'lr': 0.0002231150570856219, 'samples': 15566784, 'steps': 81076, 'loss/train': 1.4361449480056763} 11/07/2021 08:38:17 - INFO - __main__ - Step 81078: {'lr': 0.00022310978112229243, 'samples': 15566976, 'steps': 81077, 'loss/train': 1.3512580394744873} 11/07/2021 08:38:17 - INFO - __main__ - Step 81079: {'lr': 0.00022310450517107927, 'samples': 15567168, 'steps': 81078, 'loss/train': 1.342712640762329} 11/07/2021 08:38:17 - INFO - __main__ - Step 81080: {'lr': 0.00022309922923198483, 'samples': 15567360, 'steps': 81079, 'loss/train': 1.1499004364013672} 11/07/2021 08:38:18 - INFO - __main__ - Step 81081: {'lr': 0.00022309395330501143, 'samples': 15567552, 'steps': 81080, 'loss/train': 1.3336507081985474} 11/07/2021 08:38:19 - INFO - __main__ - Step 81082: {'lr': 0.00022308867739016152, 'samples': 15567744, 'steps': 81081, 'loss/train': 1.7565720081329346} 11/07/2021 08:38:19 - INFO - __main__ - Step 81083: {'lr': 0.00022308340148743738, 'samples': 15567936, 'steps': 81082, 'loss/train': 1.245194435119629} 11/07/2021 08:38:20 - INFO - __main__ - Step 81084: {'lr': 0.00022307812559684147, 'samples': 15568128, 'steps': 81083, 'loss/train': 1.0421957969665527} 11/07/2021 08:38:20 - INFO - __main__ - Step 81085: {'lr': 0.00022307284971837617, 'samples': 15568320, 'steps': 81084, 'loss/train': 1.3061131238937378} 11/07/2021 08:38:20 - INFO - __main__ - Step 81086: {'lr': 0.00022306757385204376, 'samples': 15568512, 'steps': 81085, 'loss/train': 1.2193384170532227} 11/07/2021 08:38:21 - INFO - __main__ - Step 81087: {'lr': 0.00022306229799784675, 'samples': 15568704, 'steps': 81086, 'loss/train': 1.306126594543457} 11/07/2021 08:38:22 - INFO - __main__ - Step 81088: {'lr': 0.0002230570221557874, 'samples': 15568896, 'steps': 81087, 'loss/train': 1.8141027688980103} 11/07/2021 08:38:22 - INFO - __main__ - Step 81089: {'lr': 0.0002230517463258682, 'samples': 15569088, 'steps': 81088, 'loss/train': 1.3640278577804565} 11/07/2021 08:38:22 - INFO - __main__ - Step 81090: {'lr': 0.00022304647050809155, 'samples': 15569280, 'steps': 81089, 'loss/train': 1.1844193935394287} 11/07/2021 08:38:23 - INFO - __main__ - Step 81091: {'lr': 0.00022304119470245963, 'samples': 15569472, 'steps': 81090, 'loss/train': 1.3139398097991943} 11/07/2021 08:38:23 - INFO - __main__ - Step 81092: {'lr': 0.00022303591890897493, 'samples': 15569664, 'steps': 81091, 'loss/train': 1.58975350856781} 11/07/2021 08:38:24 - INFO - __main__ - Step 81093: {'lr': 0.00022303064312763983, 'samples': 15569856, 'steps': 81092, 'loss/train': 1.1675945520401} 11/07/2021 08:38:25 - INFO - __main__ - Step 81094: {'lr': 0.00022302536735845668, 'samples': 15570048, 'steps': 81093, 'loss/train': 1.0813888311386108} 11/07/2021 08:38:25 - INFO - __main__ - Step 81095: {'lr': 0.0002230200916014279, 'samples': 15570240, 'steps': 81094, 'loss/train': 0.21382401883602142} 11/07/2021 08:38:25 - INFO - __main__ - Step 81096: {'lr': 0.0002230148158565559, 'samples': 15570432, 'steps': 81095, 'loss/train': 0.9025365710258484} 11/07/2021 08:38:26 - INFO - __main__ - Step 81097: {'lr': 0.00022300954012384296, 'samples': 15570624, 'steps': 81096, 'loss/train': 0.9892308712005615} 11/07/2021 08:38:27 - INFO - __main__ - Step 81098: {'lr': 0.0002230042644032915, 'samples': 15570816, 'steps': 81097, 'loss/train': 1.0751776695251465} 11/07/2021 08:38:27 - INFO - __main__ - Step 81099: {'lr': 0.00022299898869490394, 'samples': 15571008, 'steps': 81098, 'loss/train': 1.552776575088501} 11/07/2021 08:38:27 - INFO - __main__ - Step 81100: {'lr': 0.00022299371299868258, 'samples': 15571200, 'steps': 81099, 'loss/train': 1.6624702215194702} 11/07/2021 08:38:28 - INFO - __main__ - Step 81101: {'lr': 0.00022298843731462985, 'samples': 15571392, 'steps': 81100, 'loss/train': 1.6242549419403076} 11/07/2021 08:38:28 - INFO - __main__ - Step 81102: {'lr': 0.00022298316164274813, 'samples': 15571584, 'steps': 81101, 'loss/train': 1.9202165603637695} 11/07/2021 08:38:29 - INFO - __main__ - Step 81103: {'lr': 0.00022297788598303974, 'samples': 15571776, 'steps': 81102, 'loss/train': 1.6432095766067505} 11/07/2021 08:38:29 - INFO - __main__ - Step 81104: {'lr': 0.00022297261033550722, 'samples': 15571968, 'steps': 81103, 'loss/train': 1.405078649520874} 11/07/2021 08:38:30 - INFO - __main__ - Step 81105: {'lr': 0.00022296733470015273, 'samples': 15572160, 'steps': 81104, 'loss/train': 1.5146398544311523} 11/07/2021 08:38:30 - INFO - __main__ - Step 81106: {'lr': 0.00022296205907697874, 'samples': 15572352, 'steps': 81105, 'loss/train': 1.6052268743515015} 11/07/2021 08:38:30 - INFO - __main__ - Step 81107: {'lr': 0.00022295678346598763, 'samples': 15572544, 'steps': 81106, 'loss/train': 1.5264652967453003} 11/07/2021 08:38:31 - INFO - __main__ - Step 81108: {'lr': 0.00022295150786718178, 'samples': 15572736, 'steps': 81107, 'loss/train': 1.0461469888687134} 11/07/2021 08:38:32 - INFO - __main__ - Step 81109: {'lr': 0.00022294623228056353, 'samples': 15572928, 'steps': 81108, 'loss/train': 0.8923303484916687} 11/07/2021 08:38:32 - INFO - __main__ - Step 81110: {'lr': 0.00022294095670613535, 'samples': 15573120, 'steps': 81109, 'loss/train': 1.1364799737930298} 11/07/2021 08:38:32 - INFO - __main__ - Step 81111: {'lr': 0.0002229356811438995, 'samples': 15573312, 'steps': 81110, 'loss/train': 1.7671177387237549} 11/07/2021 08:38:33 - INFO - __main__ - Step 81112: {'lr': 0.00022293040559385848, 'samples': 15573504, 'steps': 81111, 'loss/train': 1.1989061832427979} 11/07/2021 08:38:33 - INFO - __main__ - Step 81113: {'lr': 0.00022292513005601453, 'samples': 15573696, 'steps': 81112, 'loss/train': 1.6629011631011963} 11/07/2021 08:38:34 - INFO - __main__ - Step 81114: {'lr': 0.00022291985453037016, 'samples': 15573888, 'steps': 81113, 'loss/train': 1.7021855115890503} 11/07/2021 08:38:35 - INFO - __main__ - Step 81115: {'lr': 0.00022291457901692764, 'samples': 15574080, 'steps': 81114, 'loss/train': 1.7304954528808594} 11/07/2021 08:38:35 - INFO - __main__ - Step 81116: {'lr': 0.0002229093035156894, 'samples': 15574272, 'steps': 81115, 'loss/train': 0.45037347078323364} 11/07/2021 08:38:35 - INFO - __main__ - Step 81117: {'lr': 0.00022290402802665793, 'samples': 15574464, 'steps': 81116, 'loss/train': 1.3707894086837769} 11/07/2021 08:38:36 - INFO - __main__ - Step 81118: {'lr': 0.00022289875254983537, 'samples': 15574656, 'steps': 81117, 'loss/train': 1.5604908466339111} 11/07/2021 08:38:37 - INFO - __main__ - Step 81119: {'lr': 0.00022289347708522424, 'samples': 15574848, 'steps': 81118, 'loss/train': 1.6272398233413696} 11/07/2021 08:38:37 - INFO - __main__ - Step 81120: {'lr': 0.00022288820163282683, 'samples': 15575040, 'steps': 81119, 'loss/train': 1.38294517993927} 11/07/2021 08:38:38 - INFO - __main__ - Step 81121: {'lr': 0.00022288292619264565, 'samples': 15575232, 'steps': 81120, 'loss/train': 1.2993077039718628} 11/07/2021 08:38:38 - INFO - __main__ - Step 81122: {'lr': 0.00022287765076468293, 'samples': 15575424, 'steps': 81121, 'loss/train': 1.52181875705719} 11/07/2021 08:38:38 - INFO - __main__ - Step 81123: {'lr': 0.00022287237534894118, 'samples': 15575616, 'steps': 81122, 'loss/train': 1.6517326831817627} 11/07/2021 08:38:39 - INFO - __main__ - Step 81124: {'lr': 0.0002228670999454227, 'samples': 15575808, 'steps': 81123, 'loss/train': 1.338042140007019} 11/07/2021 08:38:40 - INFO - __main__ - Step 81125: {'lr': 0.0002228618245541299, 'samples': 15576000, 'steps': 81124, 'loss/train': 1.3839524984359741} 11/07/2021 08:38:40 - INFO - __main__ - Step 81126: {'lr': 0.00022285654917506513, 'samples': 15576192, 'steps': 81125, 'loss/train': 1.545672059059143} 11/07/2021 08:38:40 - INFO - __main__ - Step 81127: {'lr': 0.0002228512738082308, 'samples': 15576384, 'steps': 81126, 'loss/train': 1.5291666984558105} 11/07/2021 08:38:41 - INFO - __main__ - Step 81128: {'lr': 0.00022284599845362924, 'samples': 15576576, 'steps': 81127, 'loss/train': 1.7574716806411743} 11/07/2021 08:38:41 - INFO - __main__ - Step 81129: {'lr': 0.00022284072311126284, 'samples': 15576768, 'steps': 81128, 'loss/train': 1.3525563478469849} 11/07/2021 08:38:43 - INFO - __main__ - Step 81130: {'lr': 0.000222835447781134, 'samples': 15576960, 'steps': 81129, 'loss/train': 0.922478973865509} 11/07/2021 08:38:43 - INFO - __main__ - Step 81131: {'lr': 0.00022283017246324524, 'samples': 15577152, 'steps': 81130, 'loss/train': 1.102968454360962} 11/07/2021 08:38:43 - INFO - __main__ - Step 81132: {'lr': 0.00022282489715759867, 'samples': 15577344, 'steps': 81131, 'loss/train': 1.4605306386947632} 11/07/2021 08:38:44 - INFO - __main__ - Step 81133: {'lr': 0.00022281962186419674, 'samples': 15577536, 'steps': 81132, 'loss/train': 1.6647733449935913} 11/07/2021 08:38:44 - INFO - __main__ - Step 81134: {'lr': 0.00022281434658304189, 'samples': 15577728, 'steps': 81133, 'loss/train': 1.0200510025024414} 11/07/2021 08:38:45 - INFO - __main__ - Step 81135: {'lr': 0.00022280907131413648, 'samples': 15577920, 'steps': 81134, 'loss/train': 1.5646631717681885} 11/07/2021 08:38:45 - INFO - __main__ - Step 81136: {'lr': 0.00022280379605748286, 'samples': 15578112, 'steps': 81135, 'loss/train': 0.27513086795806885} 11/07/2021 08:38:46 - INFO - __main__ - Step 81137: {'lr': 0.00022279852081308343, 'samples': 15578304, 'steps': 81136, 'loss/train': 1.2063571214675903} 11/07/2021 08:38:46 - INFO - __main__ - Step 81138: {'lr': 0.0002227932455809406, 'samples': 15578496, 'steps': 81137, 'loss/train': 1.5104670524597168} 11/07/2021 08:38:46 - INFO - __main__ - Step 81139: {'lr': 0.00022278797036105668, 'samples': 15578688, 'steps': 81138, 'loss/train': 1.821107029914856} 11/07/2021 08:38:47 - INFO - __main__ - Step 81140: {'lr': 0.0002227826951534341, 'samples': 15578880, 'steps': 81139, 'loss/train': 1.0026990175247192} 11/07/2021 08:38:48 - INFO - __main__ - Step 81141: {'lr': 0.00022277741995807524, 'samples': 15579072, 'steps': 81140, 'loss/train': 1.3603482246398926} 11/07/2021 08:38:48 - INFO - __main__ - Step 81142: {'lr': 0.0002227721447749825, 'samples': 15579264, 'steps': 81141, 'loss/train': 1.2583632469177246} 11/07/2021 08:38:48 - INFO - __main__ - Step 81143: {'lr': 0.00022276686960415813, 'samples': 15579456, 'steps': 81142, 'loss/train': 1.526748538017273} 11/07/2021 08:38:49 - INFO - __main__ - Step 81144: {'lr': 0.00022276159444560464, 'samples': 15579648, 'steps': 81143, 'loss/train': 1.7782952785491943} 11/07/2021 08:38:50 - INFO - __main__ - Step 81145: {'lr': 0.00022275631929932432, 'samples': 15579840, 'steps': 81144, 'loss/train': 1.5744783878326416} 11/07/2021 08:38:50 - INFO - __main__ - Step 81146: {'lr': 0.00022275104416531958, 'samples': 15580032, 'steps': 81145, 'loss/train': 1.838991403579712} 11/07/2021 08:38:51 - INFO - __main__ - Step 81147: {'lr': 0.00022274576904359277, 'samples': 15580224, 'steps': 81146, 'loss/train': 1.041379451751709} 11/07/2021 08:38:51 - INFO - __main__ - Step 81148: {'lr': 0.00022274049393414635, 'samples': 15580416, 'steps': 81147, 'loss/train': 2.2373290061950684} 11/07/2021 08:38:51 - INFO - __main__ - Step 81149: {'lr': 0.0002227352188369826, 'samples': 15580608, 'steps': 81148, 'loss/train': 1.188989281654358} 11/07/2021 08:38:53 - INFO - __main__ - Step 81150: {'lr': 0.00022272994375210396, 'samples': 15580800, 'steps': 81149, 'loss/train': 1.084976077079773} 11/07/2021 08:38:53 - INFO - __main__ - Step 81151: {'lr': 0.0002227246686795128, 'samples': 15580992, 'steps': 81150, 'loss/train': 1.6546152830123901} 11/07/2021 08:38:53 - INFO - __main__ - Step 81152: {'lr': 0.00022271939361921146, 'samples': 15581184, 'steps': 81151, 'loss/train': 0.11291224509477615} 11/07/2021 08:38:54 - INFO - __main__ - Step 81153: {'lr': 0.00022271411857120239, 'samples': 15581376, 'steps': 81152, 'loss/train': 1.818670630455017} 11/07/2021 08:38:54 - INFO - __main__ - Step 81154: {'lr': 0.00022270884353548786, 'samples': 15581568, 'steps': 81153, 'loss/train': 1.5362132787704468} 11/07/2021 08:38:55 - INFO - __main__ - Step 81155: {'lr': 0.00022270356851207033, 'samples': 15581760, 'steps': 81154, 'loss/train': 1.3309926986694336} 11/07/2021 08:38:56 - INFO - __main__ - Step 81156: {'lr': 0.00022269829350095213, 'samples': 15581952, 'steps': 81155, 'loss/train': 1.7306725978851318} 11/07/2021 08:38:56 - INFO - __main__ - Step 81157: {'lr': 0.00022269301850213566, 'samples': 15582144, 'steps': 81156, 'loss/train': 1.3178678750991821} 11/07/2021 08:38:56 - INFO - __main__ - Step 81158: {'lr': 0.00022268774351562337, 'samples': 15582336, 'steps': 81157, 'loss/train': 1.7398192882537842} 11/07/2021 08:38:57 - INFO - __main__ - Step 81159: {'lr': 0.0002226824685414175, 'samples': 15582528, 'steps': 81158, 'loss/train': 1.184030294418335} 11/07/2021 08:38:57 - INFO - __main__ - Step 81160: {'lr': 0.00022267719357952045, 'samples': 15582720, 'steps': 81159, 'loss/train': 1.7429919242858887} 11/07/2021 08:38:58 - INFO - __main__ - Step 81161: {'lr': 0.00022267191862993468, 'samples': 15582912, 'steps': 81160, 'loss/train': 1.5734676122665405} 11/07/2021 08:38:59 - INFO - __main__ - Step 81162: {'lr': 0.00022266664369266248, 'samples': 15583104, 'steps': 81161, 'loss/train': 1.9514714479446411} 11/07/2021 08:38:59 - INFO - __main__ - Step 81163: {'lr': 0.00022266136876770631, 'samples': 15583296, 'steps': 81162, 'loss/train': 0.6063646674156189} 11/07/2021 08:38:59 - INFO - __main__ - Step 81164: {'lr': 0.00022265609385506855, 'samples': 15583488, 'steps': 81163, 'loss/train': 2.115286111831665} 11/07/2021 08:39:00 - INFO - __main__ - Step 81165: {'lr': 0.00022265081895475147, 'samples': 15583680, 'steps': 81164, 'loss/train': 1.3512589931488037} 11/07/2021 08:39:01 - INFO - __main__ - Step 81166: {'lr': 0.00022264554406675751, 'samples': 15583872, 'steps': 81165, 'loss/train': 1.0923943519592285} 11/07/2021 08:39:01 - INFO - __main__ - Step 81167: {'lr': 0.00022264026919108904, 'samples': 15584064, 'steps': 81166, 'loss/train': 1.552407145500183} 11/07/2021 08:39:01 - INFO - __main__ - Step 81168: {'lr': 0.00022263499432774846, 'samples': 15584256, 'steps': 81167, 'loss/train': 0.8838455677032471} 11/07/2021 08:39:02 - INFO - __main__ - Step 81169: {'lr': 0.0002226297194767381, 'samples': 15584448, 'steps': 81168, 'loss/train': 1.2928820848464966} 11/07/2021 08:39:02 - INFO - __main__ - Step 81170: {'lr': 0.00022262444463806038, 'samples': 15584640, 'steps': 81169, 'loss/train': 0.9394320249557495} 11/07/2021 08:39:03 - INFO - __main__ - Step 81171: {'lr': 0.00022261916981171774, 'samples': 15584832, 'steps': 81170, 'loss/train': 4.1464104652404785} 11/07/2021 08:39:04 - INFO - __main__ - Step 81172: {'lr': 0.0002226138949977124, 'samples': 15585024, 'steps': 81171, 'loss/train': 1.522558331489563} 11/07/2021 08:39:04 - INFO - __main__ - Step 81173: {'lr': 0.00022260862019604684, 'samples': 15585216, 'steps': 81172, 'loss/train': 0.9694672226905823} 11/07/2021 08:39:04 - INFO - __main__ - Step 81174: {'lr': 0.0002226033454067234, 'samples': 15585408, 'steps': 81173, 'loss/train': 0.8782249093055725} 11/07/2021 08:39:05 - INFO - __main__ - Step 81175: {'lr': 0.0002225980706297445, 'samples': 15585600, 'steps': 81174, 'loss/train': 1.3946982622146606} 11/07/2021 08:39:06 - INFO - __main__ - Step 81176: {'lr': 0.00022259279586511245, 'samples': 15585792, 'steps': 81175, 'loss/train': 1.4048892259597778} 11/07/2021 08:39:06 - INFO - __main__ - Step 81177: {'lr': 0.00022258752111282967, 'samples': 15585984, 'steps': 81176, 'loss/train': 1.0944188833236694} 11/07/2021 08:39:06 - INFO - __main__ - Step 81178: {'lr': 0.00022258224637289854, 'samples': 15586176, 'steps': 81177, 'loss/train': 1.6480129957199097} 11/07/2021 08:39:07 - INFO - __main__ - Step 81179: {'lr': 0.0002225769716453214, 'samples': 15586368, 'steps': 81178, 'loss/train': 1.4536868333816528} 11/07/2021 08:39:07 - INFO - __main__ - Step 81180: {'lr': 0.00022257169693010065, 'samples': 15586560, 'steps': 81179, 'loss/train': 1.5950005054473877} 11/07/2021 08:39:09 - INFO - __main__ - Step 81181: {'lr': 0.00022256642222723868, 'samples': 15586752, 'steps': 81180, 'loss/train': 1.360231637954712} 11/07/2021 08:39:09 - INFO - __main__ - Step 81182: {'lr': 0.00022256114753673787, 'samples': 15586944, 'steps': 81181, 'loss/train': 1.544901967048645} 11/07/2021 08:39:09 - INFO - __main__ - Step 81183: {'lr': 0.00022255587285860057, 'samples': 15587136, 'steps': 81182, 'loss/train': 1.6275460720062256} 11/07/2021 08:39:10 - INFO - __main__ - Step 81184: {'lr': 0.00022255059819282924, 'samples': 15587328, 'steps': 81183, 'loss/train': 1.9033209085464478} 11/07/2021 08:39:10 - INFO - __main__ - Step 81185: {'lr': 0.00022254532353942614, 'samples': 15587520, 'steps': 81184, 'loss/train': 1.0315401554107666} 11/07/2021 08:39:10 - INFO - __main__ - Step 81186: {'lr': 0.0002225400488983937, 'samples': 15587712, 'steps': 81185, 'loss/train': 1.769028663635254} 11/07/2021 08:39:11 - INFO - __main__ - Step 81187: {'lr': 0.00022253477426973427, 'samples': 15587904, 'steps': 81186, 'loss/train': 1.9362456798553467} 11/07/2021 08:39:12 - INFO - __main__ - Step 81188: {'lr': 0.00022252949965345027, 'samples': 15588096, 'steps': 81187, 'loss/train': 1.1343423128128052} 11/07/2021 08:39:12 - INFO - __main__ - Step 81189: {'lr': 0.000222524225049544, 'samples': 15588288, 'steps': 81188, 'loss/train': 1.4242193698883057} 11/07/2021 08:39:13 - INFO - __main__ - Step 81190: {'lr': 0.00022251895045801793, 'samples': 15588480, 'steps': 81189, 'loss/train': 1.2498195171356201} 11/07/2021 08:39:13 - INFO - __main__ - Step 81191: {'lr': 0.00022251367587887438, 'samples': 15588672, 'steps': 81190, 'loss/train': 1.6328538656234741} 11/07/2021 08:39:13 - INFO - __main__ - Step 81192: {'lr': 0.00022250840131211576, 'samples': 15588864, 'steps': 81191, 'loss/train': 1.0273276567459106} 11/07/2021 08:39:14 - INFO - __main__ - Step 81193: {'lr': 0.00022250312675774442, 'samples': 15589056, 'steps': 81192, 'loss/train': 1.599449634552002} 11/07/2021 08:39:15 - INFO - __main__ - Step 81194: {'lr': 0.00022249785221576274, 'samples': 15589248, 'steps': 81193, 'loss/train': 1.6343443393707275} 11/07/2021 08:39:15 - INFO - __main__ - Step 81195: {'lr': 0.0002224925776861731, 'samples': 15589440, 'steps': 81194, 'loss/train': 1.3860065937042236} 11/07/2021 08:39:15 - INFO - __main__ - Step 81196: {'lr': 0.00022248730316897788, 'samples': 15589632, 'steps': 81195, 'loss/train': 1.1382756233215332} 11/07/2021 08:39:16 - INFO - __main__ - Step 81197: {'lr': 0.00022248202866417946, 'samples': 15589824, 'steps': 81196, 'loss/train': 0.892274796962738} 11/07/2021 08:39:17 - INFO - __main__ - Step 81198: {'lr': 0.00022247675417178035, 'samples': 15590016, 'steps': 81197, 'loss/train': 1.0510642528533936} 11/07/2021 08:39:17 - INFO - __main__ - Step 81199: {'lr': 0.00022247147969178265, 'samples': 15590208, 'steps': 81198, 'loss/train': 1.4961920976638794} 11/07/2021 08:39:17 - INFO - __main__ - Step 81200: {'lr': 0.0002224662052241889, 'samples': 15590400, 'steps': 81199, 'loss/train': 1.327295184135437} 11/07/2021 08:39:18 - INFO - __main__ - Step 81201: {'lr': 0.00022246093076900143, 'samples': 15590592, 'steps': 81200, 'loss/train': 0.8693020343780518} 11/07/2021 08:39:18 - INFO - __main__ - Step 81202: {'lr': 0.00022245565632622263, 'samples': 15590784, 'steps': 81201, 'loss/train': 1.28905189037323} 11/07/2021 08:39:20 - INFO - __main__ - Step 81203: {'lr': 0.00022245038189585493, 'samples': 15590976, 'steps': 81202, 'loss/train': 1.6483715772628784} 11/07/2021 08:39:20 - INFO - __main__ - Step 81204: {'lr': 0.00022244510747790062, 'samples': 15591168, 'steps': 81203, 'loss/train': 1.7164620161056519} 11/07/2021 08:39:20 - INFO - __main__ - Step 81205: {'lr': 0.00022243983307236213, 'samples': 15591360, 'steps': 81204, 'loss/train': 4.350152015686035} 11/07/2021 08:39:21 - INFO - __main__ - Step 81206: {'lr': 0.00022243455867924184, 'samples': 15591552, 'steps': 81205, 'loss/train': 3.695627450942993} 11/07/2021 08:39:21 - INFO - __main__ - Step 81207: {'lr': 0.0002224292842985421, 'samples': 15591744, 'steps': 81206, 'loss/train': 1.405455231666565} 11/07/2021 08:39:21 - INFO - __main__ - Step 81208: {'lr': 0.00022242400993026528, 'samples': 15591936, 'steps': 81207, 'loss/train': 1.2394561767578125} 11/07/2021 08:39:22 - INFO - __main__ - Step 81209: {'lr': 0.0002224187355744138, 'samples': 15592128, 'steps': 81208, 'loss/train': 1.360274076461792} 11/07/2021 08:39:23 - INFO - __main__ - Step 81210: {'lr': 0.00022241346123099, 'samples': 15592320, 'steps': 81209, 'loss/train': 1.7031408548355103} 11/07/2021 08:39:23 - INFO - __main__ - Step 81211: {'lr': 0.00022240818689999643, 'samples': 15592512, 'steps': 81210, 'loss/train': 0.8700584769248962} 11/07/2021 08:39:24 - INFO - __main__ - Step 81212: {'lr': 0.00022240291258143513, 'samples': 15592704, 'steps': 81211, 'loss/train': 1.6831244230270386} 11/07/2021 08:39:24 - INFO - __main__ - Step 81213: {'lr': 0.00022239763827530868, 'samples': 15592896, 'steps': 81212, 'loss/train': 1.0905026197433472} 11/07/2021 08:39:25 - INFO - __main__ - Step 81214: {'lr': 0.00022239236398161944, 'samples': 15593088, 'steps': 81213, 'loss/train': 1.2847509384155273} 11/07/2021 08:39:25 - INFO - __main__ - Step 81215: {'lr': 0.00022238708970036974, 'samples': 15593280, 'steps': 81214, 'loss/train': 1.6134698390960693} 11/07/2021 08:39:26 - INFO - __main__ - Step 81216: {'lr': 0.00022238181543156202, 'samples': 15593472, 'steps': 81215, 'loss/train': 1.0489720106124878} 11/07/2021 08:39:26 - INFO - __main__ - Step 81217: {'lr': 0.00022237654117519863, 'samples': 15593664, 'steps': 81216, 'loss/train': 1.5093296766281128} 11/07/2021 08:39:26 - INFO - __main__ - Step 81218: {'lr': 0.00022237126693128192, 'samples': 15593856, 'steps': 81217, 'loss/train': 2.023012161254883} 11/07/2021 08:39:27 - INFO - __main__ - Step 81219: {'lr': 0.0002223659926998143, 'samples': 15594048, 'steps': 81218, 'loss/train': 1.6456555128097534} 11/07/2021 08:39:28 - INFO - __main__ - Step 81220: {'lr': 0.00022236071848079814, 'samples': 15594240, 'steps': 81219, 'loss/train': 1.3098630905151367} 11/07/2021 08:39:28 - INFO - __main__ - Step 81221: {'lr': 0.00022235544427423582, 'samples': 15594432, 'steps': 81220, 'loss/train': 0.9166532754898071} 11/07/2021 08:39:28 - INFO - __main__ - Step 81222: {'lr': 0.0002223501700801297, 'samples': 15594624, 'steps': 81221, 'loss/train': 1.2516273260116577} 11/07/2021 08:39:29 - INFO - __main__ - Step 81223: {'lr': 0.00022234489589848216, 'samples': 15594816, 'steps': 81222, 'loss/train': 1.6229921579360962} 11/07/2021 08:39:29 - INFO - __main__ - Step 81224: {'lr': 0.00022233962172929562, 'samples': 15595008, 'steps': 81223, 'loss/train': 1.81756591796875} 11/07/2021 08:39:30 - INFO - __main__ - Step 81225: {'lr': 0.00022233434757257248, 'samples': 15595200, 'steps': 81224, 'loss/train': 1.1337592601776123} 11/07/2021 08:39:31 - INFO - __main__ - Step 81226: {'lr': 0.00022232907342831497, 'samples': 15595392, 'steps': 81225, 'loss/train': 1.2113802433013916} 11/07/2021 08:39:31 - INFO - __main__ - Step 81227: {'lr': 0.00022232379929652558, 'samples': 15595584, 'steps': 81226, 'loss/train': 1.5781282186508179} 11/07/2021 08:39:31 - INFO - __main__ - Step 81228: {'lr': 0.00022231852517720662, 'samples': 15595776, 'steps': 81227, 'loss/train': 1.6022754907608032} 11/07/2021 08:39:32 - INFO - __main__ - Step 81229: {'lr': 0.00022231325107036052, 'samples': 15595968, 'steps': 81228, 'loss/train': 1.6887590885162354} 11/07/2021 08:39:33 - INFO - __main__ - Step 81230: {'lr': 0.00022230797697598965, 'samples': 15596160, 'steps': 81229, 'loss/train': 0.34591126441955566} 11/07/2021 08:39:33 - INFO - __main__ - Step 81231: {'lr': 0.00022230270289409635, 'samples': 15596352, 'steps': 81230, 'loss/train': 1.2273656129837036} 11/07/2021 08:39:34 - INFO - __main__ - Step 81232: {'lr': 0.00022229742882468303, 'samples': 15596544, 'steps': 81231, 'loss/train': 0.17018882930278778} 11/07/2021 08:39:34 - INFO - __main__ - Step 81233: {'lr': 0.00022229215476775207, 'samples': 15596736, 'steps': 81232, 'loss/train': 1.465503215789795} 11/07/2021 08:39:34 - INFO - __main__ - Step 81234: {'lr': 0.00022228688072330587, 'samples': 15596928, 'steps': 81233, 'loss/train': 1.7946460247039795} 11/07/2021 08:39:36 - INFO - __main__ - Step 81235: {'lr': 0.00022228160669134672, 'samples': 15597120, 'steps': 81234, 'loss/train': 1.2033923864364624} 11/07/2021 08:39:36 - INFO - __main__ - Step 81236: {'lr': 0.0002222763326718771, 'samples': 15597312, 'steps': 81235, 'loss/train': 1.198492407798767} 11/07/2021 08:39:36 - INFO - __main__ - Step 81237: {'lr': 0.00022227105866489931, 'samples': 15597504, 'steps': 81236, 'loss/train': 1.2784783840179443} 11/07/2021 08:39:37 - INFO - __main__ - Step 81238: {'lr': 0.00022226578467041586, 'samples': 15597696, 'steps': 81237, 'loss/train': 1.2125014066696167} 11/07/2021 08:39:37 - INFO - __main__ - Step 81239: {'lr': 0.00022226051068842889, 'samples': 15597888, 'steps': 81238, 'loss/train': 0.4166297912597656} 11/07/2021 08:39:38 - INFO - __main__ - Step 81240: {'lr': 0.00022225523671894092, 'samples': 15598080, 'steps': 81239, 'loss/train': 1.7882308959960938} 11/07/2021 08:39:39 - INFO - __main__ - Step 81241: {'lr': 0.00022224996276195435, 'samples': 15598272, 'steps': 81240, 'loss/train': 1.3554812669754028} 11/07/2021 08:39:39 - INFO - __main__ - Step 81242: {'lr': 0.00022224468881747148, 'samples': 15598464, 'steps': 81241, 'loss/train': 1.5929476022720337} 11/07/2021 08:39:39 - INFO - __main__ - Step 81243: {'lr': 0.00022223941488549475, 'samples': 15598656, 'steps': 81242, 'loss/train': 1.457880973815918} 11/07/2021 08:39:40 - INFO - __main__ - Step 81244: {'lr': 0.00022223414096602648, 'samples': 15598848, 'steps': 81243, 'loss/train': 1.5378473997116089} 11/07/2021 08:39:40 - INFO - __main__ - Step 81245: {'lr': 0.00022222886705906912, 'samples': 15599040, 'steps': 81244, 'loss/train': 1.04153573513031} 11/07/2021 08:39:41 - INFO - __main__ - Step 81246: {'lr': 0.00022222359316462497, 'samples': 15599232, 'steps': 81245, 'loss/train': 1.7506418228149414} 11/07/2021 08:39:42 - INFO - __main__ - Step 81247: {'lr': 0.0002222183192826964, 'samples': 15599424, 'steps': 81246, 'loss/train': 1.313470721244812} 11/07/2021 08:39:42 - INFO - __main__ - Step 81248: {'lr': 0.00022221304541328592, 'samples': 15599616, 'steps': 81247, 'loss/train': 1.517435908317566} 11/07/2021 08:39:42 - INFO - __main__ - Step 81249: {'lr': 0.00022220777155639577, 'samples': 15599808, 'steps': 81248, 'loss/train': 1.5860196352005005} 11/07/2021 08:39:43 - INFO - __main__ - Step 81250: {'lr': 0.00022220249771202837, 'samples': 15600000, 'steps': 81249, 'loss/train': 1.2242426872253418} 11/07/2021 08:39:44 - INFO - __main__ - Step 81251: {'lr': 0.0002221972238801861, 'samples': 15600192, 'steps': 81250, 'loss/train': 1.7500357627868652} 11/07/2021 08:39:44 - INFO - __main__ - Step 81252: {'lr': 0.00022219195006087142, 'samples': 15600384, 'steps': 81251, 'loss/train': 1.6053122282028198} 11/07/2021 08:39:44 - INFO - __main__ - Step 81253: {'lr': 0.0002221866762540865, 'samples': 15600576, 'steps': 81252, 'loss/train': 1.2603938579559326} 11/07/2021 08:39:45 - INFO - __main__ - Step 81254: {'lr': 0.00022218140245983386, 'samples': 15600768, 'steps': 81253, 'loss/train': 1.1246281862258911} 11/07/2021 08:39:45 - INFO - __main__ - Step 81255: {'lr': 0.0002221761286781159, 'samples': 15600960, 'steps': 81254, 'loss/train': 1.5521705150604248} 11/07/2021 08:39:46 - INFO - __main__ - Step 81256: {'lr': 0.00022217085490893485, 'samples': 15601152, 'steps': 81255, 'loss/train': 1.0992668867111206} 11/07/2021 08:39:46 - INFO - __main__ - Step 81257: {'lr': 0.00022216558115229325, 'samples': 15601344, 'steps': 81256, 'loss/train': 0.46657106280326843} 11/07/2021 08:39:47 - INFO - __main__ - Step 81258: {'lr': 0.00022216030740819338, 'samples': 15601536, 'steps': 81257, 'loss/train': 1.3033703565597534} 11/07/2021 08:39:47 - INFO - __main__ - Step 81259: {'lr': 0.00022215503367663765, 'samples': 15601728, 'steps': 81258, 'loss/train': 1.7593556642532349} 11/07/2021 08:39:47 - INFO - __main__ - Step 81260: {'lr': 0.00022214975995762842, 'samples': 15601920, 'steps': 81259, 'loss/train': 1.679241418838501} 11/07/2021 08:39:48 - INFO - __main__ - Step 81261: {'lr': 0.00022214448625116812, 'samples': 15602112, 'steps': 81260, 'loss/train': 1.3315980434417725} 11/07/2021 08:39:49 - INFO - __main__ - Step 81262: {'lr': 0.00022213921255725906, 'samples': 15602304, 'steps': 81261, 'loss/train': 1.6601587533950806} 11/07/2021 08:39:49 - INFO - __main__ - Step 81263: {'lr': 0.00022213393887590363, 'samples': 15602496, 'steps': 81262, 'loss/train': 1.4072685241699219} 11/07/2021 08:39:49 - INFO - __main__ - Step 81264: {'lr': 0.00022212866520710423, 'samples': 15602688, 'steps': 81263, 'loss/train': 1.1666525602340698} 11/07/2021 08:39:50 - INFO - __main__ - Step 81265: {'lr': 0.00022212339155086333, 'samples': 15602880, 'steps': 81264, 'loss/train': 1.488768219947815} 11/07/2021 08:39:50 - INFO - __main__ - Step 81266: {'lr': 0.00022211811790718308, 'samples': 15603072, 'steps': 81265, 'loss/train': 0.7584787607192993} 11/07/2021 08:39:51 - INFO - __main__ - Step 81267: {'lr': 0.000222112844276066, 'samples': 15603264, 'steps': 81266, 'loss/train': 2.1093504428863525} 11/07/2021 08:39:52 - INFO - __main__ - Step 81268: {'lr': 0.0002221075706575144, 'samples': 15603456, 'steps': 81267, 'loss/train': 1.1269625425338745} 11/07/2021 08:39:52 - INFO - __main__ - Step 81269: {'lr': 0.00022210229705153076, 'samples': 15603648, 'steps': 81268, 'loss/train': 1.388452172279358} 11/07/2021 08:39:52 - INFO - __main__ - Step 81270: {'lr': 0.00022209702345811735, 'samples': 15603840, 'steps': 81269, 'loss/train': 1.080098271369934} 11/07/2021 08:39:53 - INFO - __main__ - Step 81271: {'lr': 0.0002220917498772766, 'samples': 15604032, 'steps': 81270, 'loss/train': 1.566182255744934} 11/07/2021 08:39:54 - INFO - __main__ - Step 81272: {'lr': 0.00022208647630901087, 'samples': 15604224, 'steps': 81271, 'loss/train': 1.5190675258636475} 11/07/2021 08:39:54 - INFO - __main__ - Step 81273: {'lr': 0.00022208120275332254, 'samples': 15604416, 'steps': 81272, 'loss/train': 1.9059034585952759} 11/07/2021 08:39:54 - INFO - __main__ - Step 81274: {'lr': 0.00022207592921021403, 'samples': 15604608, 'steps': 81273, 'loss/train': 1.5215303897857666} 11/07/2021 08:39:55 - INFO - __main__ - Step 81275: {'lr': 0.00022207065567968765, 'samples': 15604800, 'steps': 81274, 'loss/train': 1.67576003074646} 11/07/2021 08:39:55 - INFO - __main__ - Step 81276: {'lr': 0.0002220653821617458, 'samples': 15604992, 'steps': 81275, 'loss/train': 1.0405590534210205} 11/07/2021 08:39:56 - INFO - __main__ - Step 81277: {'lr': 0.0002220601086563909, 'samples': 15605184, 'steps': 81276, 'loss/train': 1.9017575979232788} 11/07/2021 08:39:57 - INFO - __main__ - Step 81278: {'lr': 0.00022205483516362523, 'samples': 15605376, 'steps': 81277, 'loss/train': 1.6615840196609497} 11/07/2021 08:39:57 - INFO - __main__ - Step 81279: {'lr': 0.0002220495616834513, 'samples': 15605568, 'steps': 81278, 'loss/train': 1.3405165672302246} 11/07/2021 08:39:57 - INFO - __main__ - Step 81280: {'lr': 0.00022204428821587133, 'samples': 15605760, 'steps': 81279, 'loss/train': 2.4312169551849365} 11/07/2021 08:39:58 - INFO - __main__ - Step 81281: {'lr': 0.0002220390147608878, 'samples': 15605952, 'steps': 81280, 'loss/train': 1.5932925939559937} 11/07/2021 08:39:58 - INFO - __main__ - Step 81282: {'lr': 0.000222033741318503, 'samples': 15606144, 'steps': 81281, 'loss/train': 0.369733065366745} 11/07/2021 08:40:00 - INFO - __main__ - Step 81283: {'lr': 0.00022202846788871942, 'samples': 15606336, 'steps': 81282, 'loss/train': 1.2883937358856201} 11/07/2021 08:40:00 - INFO - __main__ - Step 81284: {'lr': 0.00022202319447153933, 'samples': 15606528, 'steps': 81283, 'loss/train': 0.6243148446083069} 11/07/2021 08:40:00 - INFO - __main__ - Step 81285: {'lr': 0.0002220179210669652, 'samples': 15606720, 'steps': 81284, 'loss/train': 1.091718077659607} 11/07/2021 08:40:01 - INFO - __main__ - Step 81286: {'lr': 0.00022201264767499938, 'samples': 15606912, 'steps': 81285, 'loss/train': 0.8044429421424866} 11/07/2021 08:40:01 - INFO - __main__ - Step 81287: {'lr': 0.00022200737429564423, 'samples': 15607104, 'steps': 81286, 'loss/train': 1.1362919807434082} 11/07/2021 08:40:02 - INFO - __main__ - Step 81288: {'lr': 0.0002220021009289021, 'samples': 15607296, 'steps': 81287, 'loss/train': 1.578102707862854} 11/07/2021 08:40:02 - INFO - __main__ - Step 81289: {'lr': 0.0002219968275747754, 'samples': 15607488, 'steps': 81288, 'loss/train': 0.872670590877533} 11/07/2021 08:40:03 - INFO - __main__ - Step 81290: {'lr': 0.0002219915542332665, 'samples': 15607680, 'steps': 81289, 'loss/train': 1.8935414552688599} 11/07/2021 08:40:03 - INFO - __main__ - Step 81291: {'lr': 0.00022198628090437775, 'samples': 15607872, 'steps': 81290, 'loss/train': 1.1317814588546753} 11/07/2021 08:40:03 - INFO - __main__ - Step 81292: {'lr': 0.0002219810075881116, 'samples': 15608064, 'steps': 81291, 'loss/train': 1.5534695386886597} 11/07/2021 08:40:04 - INFO - __main__ - Step 81293: {'lr': 0.00022197573428447035, 'samples': 15608256, 'steps': 81292, 'loss/train': 1.5997240543365479} 11/07/2021 08:40:05 - INFO - __main__ - Step 81294: {'lr': 0.0002219704609934564, 'samples': 15608448, 'steps': 81293, 'loss/train': 0.9214150905609131} 11/07/2021 08:40:05 - INFO - __main__ - Step 81295: {'lr': 0.0002219651877150721, 'samples': 15608640, 'steps': 81294, 'loss/train': 1.1303367614746094} 11/07/2021 08:40:06 - INFO - __main__ - Step 81296: {'lr': 0.00022195991444931985, 'samples': 15608832, 'steps': 81295, 'loss/train': 1.311289668083191} 11/07/2021 08:40:06 - INFO - __main__ - Step 81297: {'lr': 0.00022195464119620207, 'samples': 15609024, 'steps': 81296, 'loss/train': 1.0694701671600342} 11/07/2021 08:40:06 - INFO - __main__ - Step 81298: {'lr': 0.0002219493679557211, 'samples': 15609216, 'steps': 81297, 'loss/train': 1.5334547758102417} 11/07/2021 08:40:08 - INFO - __main__ - Step 81299: {'lr': 0.0002219440947278793, 'samples': 15609408, 'steps': 81298, 'loss/train': 1.1984330415725708} 11/07/2021 08:40:08 - INFO - __main__ - Step 81300: {'lr': 0.000221938821512679, 'samples': 15609600, 'steps': 81299, 'loss/train': 1.0682132244110107} 11/07/2021 08:40:09 - INFO - __main__ - Step 81301: {'lr': 0.0002219335483101227, 'samples': 15609792, 'steps': 81300, 'loss/train': 1.5665656328201294} 11/07/2021 08:40:09 - INFO - __main__ - Step 81302: {'lr': 0.0002219282751202127, 'samples': 15609984, 'steps': 81301, 'loss/train': 1.521166443824768} 11/07/2021 08:40:09 - INFO - __main__ - Step 81303: {'lr': 0.00022192300194295137, 'samples': 15610176, 'steps': 81302, 'loss/train': 1.4442930221557617} 11/07/2021 08:40:10 - INFO - __main__ - Step 81304: {'lr': 0.0002219177287783411, 'samples': 15610368, 'steps': 81303, 'loss/train': 1.141269326210022} 11/07/2021 08:40:10 - INFO - __main__ - Step 81305: {'lr': 0.00022191245562638436, 'samples': 15610560, 'steps': 81304, 'loss/train': 0.39735329151153564} 11/07/2021 08:40:11 - INFO - __main__ - Step 81306: {'lr': 0.00022190718248708333, 'samples': 15610752, 'steps': 81305, 'loss/train': 0.18735170364379883} 11/07/2021 08:40:11 - INFO - __main__ - Step 81307: {'lr': 0.0002219019093604405, 'samples': 15610944, 'steps': 81306, 'loss/train': 1.1718677282333374} 11/07/2021 08:40:12 - INFO - __main__ - Step 81308: {'lr': 0.00022189663624645823, 'samples': 15611136, 'steps': 81307, 'loss/train': 1.4064913988113403} 11/07/2021 08:40:12 - INFO - __main__ - Step 81309: {'lr': 0.00022189136314513898, 'samples': 15611328, 'steps': 81308, 'loss/train': 1.415719985961914} 11/07/2021 08:40:12 - INFO - __main__ - Step 81310: {'lr': 0.00022188609005648497, 'samples': 15611520, 'steps': 81309, 'loss/train': 1.0926330089569092} 11/07/2021 08:40:14 - INFO - __main__ - Step 81311: {'lr': 0.00022188081698049867, 'samples': 15611712, 'steps': 81310, 'loss/train': 1.6214427947998047} 11/07/2021 08:40:14 - INFO - __main__ - Step 81312: {'lr': 0.00022187554391718247, 'samples': 15611904, 'steps': 81311, 'loss/train': 1.6269463300704956} 11/07/2021 08:40:14 - INFO - __main__ - Step 81313: {'lr': 0.00022187027086653866, 'samples': 15612096, 'steps': 81312, 'loss/train': 1.0383402109146118} 11/07/2021 08:40:15 - INFO - __main__ - Step 81314: {'lr': 0.0002218649978285697, 'samples': 15612288, 'steps': 81313, 'loss/train': 1.383771300315857} 11/07/2021 08:40:15 - INFO - __main__ - Step 81315: {'lr': 0.00022185972480327792, 'samples': 15612480, 'steps': 81314, 'loss/train': 1.3340604305267334} 11/07/2021 08:40:17 - INFO - __main__ - Step 81316: {'lr': 0.00022185445179066573, 'samples': 15612672, 'steps': 81315, 'loss/train': 1.6331653594970703} 11/07/2021 08:40:17 - INFO - __main__ - Step 81317: {'lr': 0.00022184917879073548, 'samples': 15612864, 'steps': 81316, 'loss/train': 1.5606329441070557} 11/07/2021 08:40:18 - INFO - __main__ - Step 81318: {'lr': 0.00022184390580348956, 'samples': 15613056, 'steps': 81317, 'loss/train': 1.444534182548523} 11/07/2021 08:40:18 - INFO - __main__ - Step 81319: {'lr': 0.0002218386328289304, 'samples': 15613248, 'steps': 81318, 'loss/train': 1.3804857730865479} 11/07/2021 08:40:18 - INFO - __main__ - Step 81320: {'lr': 0.00022183335986706033, 'samples': 15613440, 'steps': 81319, 'loss/train': 1.4933738708496094} 11/07/2021 08:40:19 - INFO - __main__ - Step 81321: {'lr': 0.00022182808691788164, 'samples': 15613632, 'steps': 81320, 'loss/train': 1.406497597694397} 11/07/2021 08:40:19 - INFO - __main__ - Step 81322: {'lr': 0.0002218228139813968, 'samples': 15613824, 'steps': 81321, 'loss/train': 1.7705904245376587} 11/07/2021 08:40:20 - INFO - __main__ - Step 81323: {'lr': 0.00022181754105760813, 'samples': 15614016, 'steps': 81322, 'loss/train': 1.7539489269256592} 11/07/2021 08:40:20 - INFO - __main__ - Step 81324: {'lr': 0.00022181226814651806, 'samples': 15614208, 'steps': 81323, 'loss/train': 1.1534367799758911} 11/07/2021 08:40:21 - INFO - __main__ - Step 81325: {'lr': 0.00022180699524812896, 'samples': 15614400, 'steps': 81324, 'loss/train': 1.9090449810028076} 11/07/2021 08:40:21 - INFO - __main__ - Step 81326: {'lr': 0.00022180172236244318, 'samples': 15614592, 'steps': 81325, 'loss/train': 1.3401083946228027} 11/07/2021 08:40:21 - INFO - __main__ - Step 81327: {'lr': 0.00022179644948946308, 'samples': 15614784, 'steps': 81326, 'loss/train': 2.2692508697509766} 11/07/2021 08:40:22 - INFO - __main__ - Step 81328: {'lr': 0.0002217911766291911, 'samples': 15614976, 'steps': 81327, 'loss/train': 1.390570878982544} 11/07/2021 08:40:23 - INFO - __main__ - Step 81329: {'lr': 0.00022178590378162956, 'samples': 15615168, 'steps': 81328, 'loss/train': 0.6597272157669067} 11/07/2021 08:40:23 - INFO - __main__ - Step 81330: {'lr': 0.00022178063094678089, 'samples': 15615360, 'steps': 81329, 'loss/train': 1.4529078006744385} 11/07/2021 08:40:24 - INFO - __main__ - Step 81331: {'lr': 0.0002217753581246474, 'samples': 15615552, 'steps': 81330, 'loss/train': 1.3694990873336792} 11/07/2021 08:40:24 - INFO - __main__ - Step 81332: {'lr': 0.00022177008531523162, 'samples': 15615744, 'steps': 81331, 'loss/train': 1.2544881105422974} 11/07/2021 08:40:24 - INFO - __main__ - Step 81333: {'lr': 0.0002217648125185357, 'samples': 15615936, 'steps': 81332, 'loss/train': 1.6329851150512695} 11/07/2021 08:40:25 - INFO - __main__ - Step 81334: {'lr': 0.0002217595397345621, 'samples': 15616128, 'steps': 81333, 'loss/train': 1.2386950254440308} 11/07/2021 08:40:26 - INFO - __main__ - Step 81335: {'lr': 0.00022175426696331325, 'samples': 15616320, 'steps': 81334, 'loss/train': 1.4761407375335693} 11/07/2021 08:40:26 - INFO - __main__ - Step 81336: {'lr': 0.00022174899420479148, 'samples': 15616512, 'steps': 81335, 'loss/train': 2.0759665966033936} 11/07/2021 08:40:26 - INFO - __main__ - Step 81337: {'lr': 0.00022174372145899914, 'samples': 15616704, 'steps': 81336, 'loss/train': 1.6816694736480713} 11/07/2021 08:40:27 - INFO - __main__ - Step 81338: {'lr': 0.00022173844872593867, 'samples': 15616896, 'steps': 81337, 'loss/train': 1.5590559244155884} 11/07/2021 08:40:28 - INFO - __main__ - Step 81339: {'lr': 0.00022173317600561243, 'samples': 15617088, 'steps': 81338, 'loss/train': 1.882919430732727} 11/07/2021 08:40:28 - INFO - __main__ - Step 81340: {'lr': 0.0002217279032980228, 'samples': 15617280, 'steps': 81339, 'loss/train': 1.1763722896575928} 11/07/2021 08:40:28 - INFO - __main__ - Step 81341: {'lr': 0.0002217226306031721, 'samples': 15617472, 'steps': 81340, 'loss/train': 1.3184220790863037} 11/07/2021 08:40:29 - INFO - __main__ - Step 81342: {'lr': 0.00022171735792106276, 'samples': 15617664, 'steps': 81341, 'loss/train': 1.5536761283874512} 11/07/2021 08:40:29 - INFO - __main__ - Step 81343: {'lr': 0.00022171208525169713, 'samples': 15617856, 'steps': 81342, 'loss/train': 1.4037847518920898} 11/07/2021 08:40:30 - INFO - __main__ - Step 81344: {'lr': 0.00022170681259507763, 'samples': 15618048, 'steps': 81343, 'loss/train': 1.8228284120559692} 11/07/2021 08:40:30 - INFO - __main__ - Step 81345: {'lr': 0.0002217015399512066, 'samples': 15618240, 'steps': 81344, 'loss/train': 1.5156787633895874} 11/07/2021 08:40:31 - INFO - __main__ - Step 81346: {'lr': 0.0002216962673200865, 'samples': 15618432, 'steps': 81345, 'loss/train': 1.7459542751312256} 11/07/2021 08:40:31 - INFO - __main__ - Step 81347: {'lr': 0.0002216909947017195, 'samples': 15618624, 'steps': 81346, 'loss/train': 1.4884614944458008} 11/07/2021 08:40:31 - INFO - __main__ - Step 81348: {'lr': 0.00022168572209610814, 'samples': 15618816, 'steps': 81347, 'loss/train': 1.316468358039856} 11/07/2021 08:40:33 - INFO - __main__ - Step 81349: {'lr': 0.00022168044950325477, 'samples': 15619008, 'steps': 81348, 'loss/train': 1.3556042909622192} 11/07/2021 08:40:33 - INFO - __main__ - Step 81350: {'lr': 0.00022167517692316173, 'samples': 15619200, 'steps': 81349, 'loss/train': 1.2561143636703491} 11/07/2021 08:40:33 - INFO - __main__ - Step 81351: {'lr': 0.00022166990435583143, 'samples': 15619392, 'steps': 81350, 'loss/train': 1.3982186317443848} 11/07/2021 08:40:34 - INFO - __main__ - Step 81352: {'lr': 0.00022166463180126622, 'samples': 15619584, 'steps': 81351, 'loss/train': 1.6554920673370361} 11/07/2021 08:40:34 - INFO - __main__ - Step 81353: {'lr': 0.00022165935925946847, 'samples': 15619776, 'steps': 81352, 'loss/train': 1.0933725833892822} 11/07/2021 08:40:35 - INFO - __main__ - Step 81354: {'lr': 0.0002216540867304406, 'samples': 15619968, 'steps': 81353, 'loss/train': 1.1761667728424072} 11/07/2021 08:40:35 - INFO - __main__ - Step 81355: {'lr': 0.00022164881421418497, 'samples': 15620160, 'steps': 81354, 'loss/train': 1.4931756258010864} 11/07/2021 08:40:36 - INFO - __main__ - Step 81356: {'lr': 0.00022164354171070396, 'samples': 15620352, 'steps': 81355, 'loss/train': 0.8754560351371765} 11/07/2021 08:40:36 - INFO - __main__ - Step 81357: {'lr': 0.00022163826921999988, 'samples': 15620544, 'steps': 81356, 'loss/train': 2.0758872032165527} 11/07/2021 08:40:36 - INFO - __main__ - Step 81358: {'lr': 0.00022163299674207517, 'samples': 15620736, 'steps': 81357, 'loss/train': 1.4577057361602783} 11/07/2021 08:40:37 - INFO - __main__ - Step 81359: {'lr': 0.00022162772427693233, 'samples': 15620928, 'steps': 81358, 'loss/train': 1.5409222841262817} 11/07/2021 08:40:38 - INFO - __main__ - Step 81360: {'lr': 0.00022162245182457348, 'samples': 15621120, 'steps': 81359, 'loss/train': 1.137533187866211} 11/07/2021 08:40:38 - INFO - __main__ - Step 81361: {'lr': 0.00022161717938500112, 'samples': 15621312, 'steps': 81360, 'loss/train': 1.7608715295791626} 11/07/2021 08:40:38 - INFO - __main__ - Step 81362: {'lr': 0.00022161190695821762, 'samples': 15621504, 'steps': 81361, 'loss/train': 1.1331850290298462} 11/07/2021 08:40:39 - INFO - __main__ - Step 81363: {'lr': 0.00022160663454422536, 'samples': 15621696, 'steps': 81362, 'loss/train': 1.502111554145813} 11/07/2021 08:40:39 - INFO - __main__ - Step 81364: {'lr': 0.0002216013621430267, 'samples': 15621888, 'steps': 81363, 'loss/train': 1.7858400344848633} 11/07/2021 08:40:40 - INFO - __main__ - Step 81365: {'lr': 0.00022159608975462402, 'samples': 15622080, 'steps': 81364, 'loss/train': 0.8490015864372253} 11/07/2021 08:40:41 - INFO - __main__ - Step 81366: {'lr': 0.00022159081737901975, 'samples': 15622272, 'steps': 81365, 'loss/train': 1.5152276754379272} 11/07/2021 08:40:41 - INFO - __main__ - Step 81367: {'lr': 0.00022158554501621616, 'samples': 15622464, 'steps': 81366, 'loss/train': 1.8796212673187256} 11/07/2021 08:40:41 - INFO - __main__ - Step 81368: {'lr': 0.00022158027266621573, 'samples': 15622656, 'steps': 81367, 'loss/train': 1.3835654258728027} 11/07/2021 08:40:42 - INFO - __main__ - Step 81369: {'lr': 0.00022157500032902075, 'samples': 15622848, 'steps': 81368, 'loss/train': 1.2048386335372925} 11/07/2021 08:40:43 - INFO - __main__ - Step 81370: {'lr': 0.00022156972800463365, 'samples': 15623040, 'steps': 81369, 'loss/train': 1.2085812091827393} 11/07/2021 08:40:43 - INFO - __main__ - Step 81371: {'lr': 0.0002215644556930568, 'samples': 15623232, 'steps': 81370, 'loss/train': 1.0621416568756104} 11/07/2021 08:40:43 - INFO - __main__ - Step 81372: {'lr': 0.0002215591833942926, 'samples': 15623424, 'steps': 81371, 'loss/train': 1.4916216135025024} 11/07/2021 08:40:44 - INFO - __main__ - Step 81373: {'lr': 0.00022155391110834343, 'samples': 15623616, 'steps': 81372, 'loss/train': 1.5184599161148071} 11/07/2021 08:40:44 - INFO - __main__ - Step 81374: {'lr': 0.00022154863883521158, 'samples': 15623808, 'steps': 81373, 'loss/train': 1.5970689058303833} 11/07/2021 08:40:44 - INFO - __main__ - Step 81375: {'lr': 0.00022154336657489947, 'samples': 15624000, 'steps': 81374, 'loss/train': 1.4144377708435059} 11/07/2021 08:40:46 - INFO - __main__ - Step 81376: {'lr': 0.00022153809432740946, 'samples': 15624192, 'steps': 81375, 'loss/train': 5.756356239318848} 11/07/2021 08:40:46 - INFO - __main__ - Step 81377: {'lr': 0.00022153282209274396, 'samples': 15624384, 'steps': 81376, 'loss/train': 1.3974777460098267} 11/07/2021 08:40:46 - INFO - __main__ - Step 81378: {'lr': 0.0002215275498709053, 'samples': 15624576, 'steps': 81377, 'loss/train': 1.4668529033660889} 11/07/2021 08:40:47 - INFO - __main__ - Step 81379: {'lr': 0.0002215222776618959, 'samples': 15624768, 'steps': 81378, 'loss/train': 0.9790958166122437} 11/07/2021 08:40:47 - INFO - __main__ - Step 81380: {'lr': 0.00022151700546571812, 'samples': 15624960, 'steps': 81379, 'loss/train': 1.4081364870071411} 11/07/2021 08:40:47 - INFO - __main__ - Step 81381: {'lr': 0.00022151173328237436, 'samples': 15625152, 'steps': 81380, 'loss/train': 1.987032175064087} 11/07/2021 08:40:48 - INFO - __main__ - Step 81382: {'lr': 0.00022150646111186695, 'samples': 15625344, 'steps': 81381, 'loss/train': 1.1109267473220825} 11/07/2021 08:40:49 - INFO - __main__ - Step 81383: {'lr': 0.0002215011889541983, 'samples': 15625536, 'steps': 81382, 'loss/train': 1.1986761093139648} 11/07/2021 08:40:49 - INFO - __main__ - Step 81384: {'lr': 0.0002214959168093708, 'samples': 15625728, 'steps': 81383, 'loss/train': 2.106708526611328} 11/07/2021 08:40:49 - INFO - __main__ - Step 81385: {'lr': 0.00022149064467738675, 'samples': 15625920, 'steps': 81384, 'loss/train': 1.5148444175720215} 11/07/2021 08:40:50 - INFO - __main__ - Step 81386: {'lr': 0.0002214853725582487, 'samples': 15626112, 'steps': 81385, 'loss/train': 1.588966727256775} 11/07/2021 08:40:52 - INFO - __main__ - Step 81387: {'lr': 0.00022148010045195882, 'samples': 15626304, 'steps': 81386, 'loss/train': 1.451953649520874} 11/07/2021 08:40:52 - INFO - __main__ - Step 81388: {'lr': 0.00022147482835851954, 'samples': 15626496, 'steps': 81387, 'loss/train': 1.4189082384109497} 11/07/2021 08:40:52 - INFO - __main__ - Step 81389: {'lr': 0.00022146955627793327, 'samples': 15626688, 'steps': 81388, 'loss/train': 1.6022979021072388} 11/07/2021 08:40:53 - INFO - __main__ - Step 81390: {'lr': 0.00022146428421020238, 'samples': 15626880, 'steps': 81389, 'loss/train': 1.2521147727966309} 11/07/2021 08:40:53 - INFO - __main__ - Step 81391: {'lr': 0.00022145901215532923, 'samples': 15627072, 'steps': 81390, 'loss/train': 1.7278599739074707} 11/07/2021 08:40:53 - INFO - __main__ - Step 81392: {'lr': 0.00022145374011331624, 'samples': 15627264, 'steps': 81391, 'loss/train': 1.7222901582717896} 11/07/2021 08:40:54 - INFO - __main__ - Step 81393: {'lr': 0.0002214484680841657, 'samples': 15627456, 'steps': 81392, 'loss/train': 1.3699156045913696} 11/07/2021 08:40:55 - INFO - __main__ - Step 81394: {'lr': 0.00022144319606788007, 'samples': 15627648, 'steps': 81393, 'loss/train': 1.5833204984664917} 11/07/2021 08:40:55 - INFO - __main__ - Step 81395: {'lr': 0.00022143792406446172, 'samples': 15627840, 'steps': 81394, 'loss/train': 1.6244580745697021} 11/07/2021 08:40:56 - INFO - __main__ - Step 81396: {'lr': 0.00022143265207391296, 'samples': 15628032, 'steps': 81395, 'loss/train': 1.4800916910171509} 11/07/2021 08:40:56 - INFO - __main__ - Step 81397: {'lr': 0.00022142738009623626, 'samples': 15628224, 'steps': 81396, 'loss/train': 1.9322938919067383} 11/07/2021 08:40:56 - INFO - __main__ - Step 81398: {'lr': 0.00022142210813143388, 'samples': 15628416, 'steps': 81397, 'loss/train': 0.9978832602500916} 11/07/2021 08:40:57 - INFO - __main__ - Step 81399: {'lr': 0.00022141683617950828, 'samples': 15628608, 'steps': 81398, 'loss/train': 1.706028938293457} 11/07/2021 08:40:58 - INFO - __main__ - Step 81400: {'lr': 0.00022141156424046194, 'samples': 15628800, 'steps': 81399, 'loss/train': 1.5272903442382812} 11/07/2021 08:40:58 - INFO - __main__ - Step 81401: {'lr': 0.00022140629231429698, 'samples': 15628992, 'steps': 81400, 'loss/train': 1.5237863063812256} 11/07/2021 08:40:58 - INFO - __main__ - Step 81402: {'lr': 0.0002214010204010159, 'samples': 15629184, 'steps': 81401, 'loss/train': 1.2897047996520996} 11/07/2021 08:40:59 - INFO - __main__ - Step 81403: {'lr': 0.0002213957485006211, 'samples': 15629376, 'steps': 81402, 'loss/train': 0.8416489958763123} 11/07/2021 08:41:00 - INFO - __main__ - Step 81404: {'lr': 0.0002213904766131149, 'samples': 15629568, 'steps': 81403, 'loss/train': 1.7013877630233765} 11/07/2021 08:41:00 - INFO - __main__ - Step 81405: {'lr': 0.00022138520473849975, 'samples': 15629760, 'steps': 81404, 'loss/train': 1.9818179607391357} 11/07/2021 08:41:01 - INFO - __main__ - Step 81406: {'lr': 0.00022137993287677795, 'samples': 15629952, 'steps': 81405, 'loss/train': 1.8104580640792847} 11/07/2021 08:41:01 - INFO - __main__ - Step 81407: {'lr': 0.00022137466102795192, 'samples': 15630144, 'steps': 81406, 'loss/train': 1.0727227926254272} 11/07/2021 08:41:01 - INFO - __main__ - Step 81408: {'lr': 0.00022136938919202403, 'samples': 15630336, 'steps': 81407, 'loss/train': 1.134738802909851} 11/07/2021 08:41:02 - INFO - __main__ - Step 81409: {'lr': 0.00022136411736899667, 'samples': 15630528, 'steps': 81408, 'loss/train': 1.3601337671279907} 11/07/2021 08:41:03 - INFO - __main__ - Step 81410: {'lr': 0.00022135884555887216, 'samples': 15630720, 'steps': 81409, 'loss/train': 1.3294240236282349} 11/07/2021 08:41:03 - INFO - __main__ - Step 81411: {'lr': 0.000221353573761653, 'samples': 15630912, 'steps': 81410, 'loss/train': 1.345991849899292} 11/07/2021 08:41:03 - INFO - __main__ - Step 81412: {'lr': 0.00022134830197734142, 'samples': 15631104, 'steps': 81411, 'loss/train': 1.6588410139083862} 11/07/2021 08:41:04 - INFO - __main__ - Step 81413: {'lr': 0.0002213430302059399, 'samples': 15631296, 'steps': 81412, 'loss/train': 1.4674087762832642} 11/07/2021 08:41:04 - INFO - __main__ - Step 81414: {'lr': 0.0002213377584474507, 'samples': 15631488, 'steps': 81413, 'loss/train': 1.6827104091644287} 11/07/2021 08:41:05 - INFO - __main__ - Step 81415: {'lr': 0.00022133248670187628, 'samples': 15631680, 'steps': 81414, 'loss/train': 1.905460238456726} 11/07/2021 08:41:05 - INFO - __main__ - Step 81416: {'lr': 0.00022132721496921897, 'samples': 15631872, 'steps': 81415, 'loss/train': 1.2710932493209839} 11/07/2021 08:41:06 - INFO - __main__ - Step 81417: {'lr': 0.00022132194324948123, 'samples': 15632064, 'steps': 81416, 'loss/train': 2.235311269760132} 11/07/2021 08:41:06 - INFO - __main__ - Step 81418: {'lr': 0.00022131667154266535, 'samples': 15632256, 'steps': 81417, 'loss/train': 1.2018892765045166} 11/07/2021 08:41:06 - INFO - __main__ - Step 81419: {'lr': 0.00022131139984877372, 'samples': 15632448, 'steps': 81418, 'loss/train': 1.330674409866333} 11/07/2021 08:41:07 - INFO - __main__ - Step 81420: {'lr': 0.00022130612816780878, 'samples': 15632640, 'steps': 81419, 'loss/train': 1.255860447883606} 11/07/2021 08:41:08 - INFO - __main__ - Step 81421: {'lr': 0.0002213008564997728, 'samples': 15632832, 'steps': 81420, 'loss/train': 1.513433575630188} 11/07/2021 08:41:08 - INFO - __main__ - Step 81422: {'lr': 0.00022129558484466826, 'samples': 15633024, 'steps': 81421, 'loss/train': 1.6675376892089844} 11/07/2021 08:41:09 - INFO - __main__ - Step 81423: {'lr': 0.00022129031320249748, 'samples': 15633216, 'steps': 81422, 'loss/train': 0.9841530323028564} 11/07/2021 08:41:09 - INFO - __main__ - Step 81424: {'lr': 0.0002212850415732628, 'samples': 15633408, 'steps': 81423, 'loss/train': 1.3924810886383057} 11/07/2021 08:41:10 - INFO - __main__ - Step 81425: {'lr': 0.00022127976995696665, 'samples': 15633600, 'steps': 81424, 'loss/train': 1.3688157796859741} 11/07/2021 08:41:11 - INFO - __main__ - Step 81426: {'lr': 0.00022127449835361145, 'samples': 15633792, 'steps': 81425, 'loss/train': 1.7876124382019043} 11/07/2021 08:41:11 - INFO - __main__ - Step 81427: {'lr': 0.00022126922676319948, 'samples': 15633984, 'steps': 81426, 'loss/train': 1.473946452140808} 11/07/2021 08:41:11 - INFO - __main__ - Step 81428: {'lr': 0.00022126395518573316, 'samples': 15634176, 'steps': 81427, 'loss/train': 1.516481876373291} 11/07/2021 08:41:12 - INFO - __main__ - Step 81429: {'lr': 0.00022125868362121481, 'samples': 15634368, 'steps': 81428, 'loss/train': 1.9723396301269531} 11/07/2021 08:41:12 - INFO - __main__ - Step 81430: {'lr': 0.0002212534120696469, 'samples': 15634560, 'steps': 81429, 'loss/train': 1.8790661096572876} 11/07/2021 08:41:13 - INFO - __main__ - Step 81431: {'lr': 0.00022124814053103175, 'samples': 15634752, 'steps': 81430, 'loss/train': 1.6156835556030273} 11/07/2021 08:41:14 - INFO - __main__ - Step 81432: {'lr': 0.00022124286900537175, 'samples': 15634944, 'steps': 81431, 'loss/train': 1.447946548461914} 11/07/2021 08:41:14 - INFO - __main__ - Step 81433: {'lr': 0.0002212375974926693, 'samples': 15635136, 'steps': 81432, 'loss/train': 0.7549266219139099} 11/07/2021 08:41:14 - INFO - __main__ - Step 81434: {'lr': 0.0002212323259929267, 'samples': 15635328, 'steps': 81433, 'loss/train': 0.9334145784378052} 11/07/2021 08:41:15 - INFO - __main__ - Step 81435: {'lr': 0.00022122705450614637, 'samples': 15635520, 'steps': 81434, 'loss/train': 1.2942836284637451} 11/07/2021 08:41:15 - INFO - __main__ - Step 81436: {'lr': 0.00022122178303233067, 'samples': 15635712, 'steps': 81435, 'loss/train': 1.3267384767532349} 11/07/2021 08:41:16 - INFO - __main__ - Step 81437: {'lr': 0.000221216511571482, 'samples': 15635904, 'steps': 81436, 'loss/train': 1.3867055177688599} 11/07/2021 08:41:16 - INFO - __main__ - Step 81438: {'lr': 0.00022121124012360274, 'samples': 15636096, 'steps': 81437, 'loss/train': 1.5528647899627686} 11/07/2021 08:41:17 - INFO - __main__ - Step 81439: {'lr': 0.00022120596868869524, 'samples': 15636288, 'steps': 81438, 'loss/train': 0.8174430131912231} 11/07/2021 08:41:17 - INFO - __main__ - Step 81440: {'lr': 0.00022120069726676194, 'samples': 15636480, 'steps': 81439, 'loss/train': 1.2853165864944458} 11/07/2021 08:41:17 - INFO - __main__ - Step 81441: {'lr': 0.00022119542585780511, 'samples': 15636672, 'steps': 81440, 'loss/train': 1.3428754806518555} 11/07/2021 08:41:19 - INFO - __main__ - Step 81442: {'lr': 0.0002211901544618272, 'samples': 15636864, 'steps': 81441, 'loss/train': 1.5777281522750854} 11/07/2021 08:41:19 - INFO - __main__ - Step 81443: {'lr': 0.00022118488307883052, 'samples': 15637056, 'steps': 81442, 'loss/train': 1.163357138633728} 11/07/2021 08:41:20 - INFO - __main__ - Step 81444: {'lr': 0.00022117961170881756, 'samples': 15637248, 'steps': 81443, 'loss/train': 1.8648403882980347} 11/07/2021 08:41:20 - INFO - __main__ - Step 81445: {'lr': 0.00022117434035179057, 'samples': 15637440, 'steps': 81444, 'loss/train': 1.8291057348251343} 11/07/2021 08:41:20 - INFO - __main__ - Step 81446: {'lr': 0.00022116906900775197, 'samples': 15637632, 'steps': 81445, 'loss/train': 1.5300315618515015} 11/07/2021 08:41:21 - INFO - __main__ - Step 81447: {'lr': 0.00022116379767670417, 'samples': 15637824, 'steps': 81446, 'loss/train': 1.5991153717041016} 11/07/2021 08:41:21 - INFO - __main__ - Step 81448: {'lr': 0.00022115852635864948, 'samples': 15638016, 'steps': 81447, 'loss/train': 1.2081236839294434} 11/07/2021 08:41:22 - INFO - __main__ - Step 81449: {'lr': 0.00022115325505359034, 'samples': 15638208, 'steps': 81448, 'loss/train': 0.11903572827577591} 11/07/2021 08:41:22 - INFO - __main__ - Step 81450: {'lr': 0.0002211479837615291, 'samples': 15638400, 'steps': 81449, 'loss/train': 1.7189496755599976} 11/07/2021 08:41:23 - INFO - __main__ - Step 81451: {'lr': 0.0002211427124824681, 'samples': 15638592, 'steps': 81450, 'loss/train': 1.148453712463379} 11/07/2021 08:41:23 - INFO - __main__ - Step 81452: {'lr': 0.00022113744121640978, 'samples': 15638784, 'steps': 81451, 'loss/train': 1.7301207780838013} 11/07/2021 08:41:23 - INFO - __main__ - Step 81453: {'lr': 0.0002211321699633565, 'samples': 15638976, 'steps': 81452, 'loss/train': 1.5914726257324219} 11/07/2021 08:41:24 - INFO - __main__ - Step 81454: {'lr': 0.0002211268987233106, 'samples': 15639168, 'steps': 81453, 'loss/train': 1.2931146621704102} 11/07/2021 08:41:25 - INFO - __main__ - Step 81455: {'lr': 0.00022112162749627452, 'samples': 15639360, 'steps': 81454, 'loss/train': 1.3790569305419922} 11/07/2021 08:41:25 - INFO - __main__ - Step 81456: {'lr': 0.00022111635628225052, 'samples': 15639552, 'steps': 81455, 'loss/train': 1.5916931629180908} 11/07/2021 08:41:26 - INFO - __main__ - Step 81457: {'lr': 0.00022111108508124106, 'samples': 15639744, 'steps': 81456, 'loss/train': 1.7809518575668335} 11/07/2021 08:41:26 - INFO - __main__ - Step 81458: {'lr': 0.0002211058138932485, 'samples': 15639936, 'steps': 81457, 'loss/train': 1.3359379768371582} 11/07/2021 08:41:27 - INFO - __main__ - Step 81459: {'lr': 0.00022110054271827522, 'samples': 15640128, 'steps': 81458, 'loss/train': 1.8137708902359009} 11/07/2021 08:41:27 - INFO - __main__ - Step 81460: {'lr': 0.00022109527155632358, 'samples': 15640320, 'steps': 81459, 'loss/train': 1.3984426259994507} 11/07/2021 08:41:28 - INFO - __main__ - Step 81461: {'lr': 0.00022109000040739597, 'samples': 15640512, 'steps': 81460, 'loss/train': 1.4421693086624146} 11/07/2021 08:41:28 - INFO - __main__ - Step 81462: {'lr': 0.00022108472927149475, 'samples': 15640704, 'steps': 81461, 'loss/train': 1.6936124563217163} 11/07/2021 08:41:28 - INFO - __main__ - Step 81463: {'lr': 0.0002210794581486223, 'samples': 15640896, 'steps': 81462, 'loss/train': 1.3355088233947754} 11/07/2021 08:41:29 - INFO - __main__ - Step 81464: {'lr': 0.000221074187038781, 'samples': 15641088, 'steps': 81463, 'loss/train': 1.8706961870193481} 11/07/2021 08:41:30 - INFO - __main__ - Step 81465: {'lr': 0.00022106891594197325, 'samples': 15641280, 'steps': 81464, 'loss/train': 1.290623426437378} 11/07/2021 08:41:30 - INFO - __main__ - Step 81466: {'lr': 0.0002210636448582014, 'samples': 15641472, 'steps': 81465, 'loss/train': 1.712324857711792} 11/07/2021 08:41:30 - INFO - __main__ - Step 81467: {'lr': 0.0002210583737874679, 'samples': 15641664, 'steps': 81466, 'loss/train': 0.9874708652496338} 11/07/2021 08:41:31 - INFO - __main__ - Step 81468: {'lr': 0.00022105310272977496, 'samples': 15641856, 'steps': 81467, 'loss/train': 1.3966691493988037} 11/07/2021 08:41:31 - INFO - __main__ - Step 81469: {'lr': 0.00022104783168512505, 'samples': 15642048, 'steps': 81468, 'loss/train': 1.5475894212722778} 11/07/2021 08:41:32 - INFO - __main__ - Step 81470: {'lr': 0.00022104256065352056, 'samples': 15642240, 'steps': 81469, 'loss/train': 0.8355773687362671} 11/07/2021 08:41:33 - INFO - __main__ - Step 81471: {'lr': 0.00022103728963496382, 'samples': 15642432, 'steps': 81470, 'loss/train': 0.5221130847930908} 11/07/2021 08:41:33 - INFO - __main__ - Step 81472: {'lr': 0.0002210320186294572, 'samples': 15642624, 'steps': 81471, 'loss/train': 1.4967979192733765} 11/07/2021 08:41:33 - INFO - __main__ - Step 81473: {'lr': 0.00022102674763700315, 'samples': 15642816, 'steps': 81472, 'loss/train': 1.4939239025115967} 11/07/2021 08:41:34 - INFO - __main__ - Step 81474: {'lr': 0.000221021476657604, 'samples': 15643008, 'steps': 81473, 'loss/train': 1.3826847076416016} 11/07/2021 08:41:35 - INFO - __main__ - Step 81475: {'lr': 0.0002210162056912621, 'samples': 15643200, 'steps': 81474, 'loss/train': 0.9719699025154114} 11/07/2021 08:41:35 - INFO - __main__ - Step 81476: {'lr': 0.00022101093473797986, 'samples': 15643392, 'steps': 81475, 'loss/train': 1.375934362411499} 11/07/2021 08:41:35 - INFO - __main__ - Step 81477: {'lr': 0.00022100566379775965, 'samples': 15643584, 'steps': 81476, 'loss/train': 1.3991092443466187} 11/07/2021 08:41:36 - INFO - __main__ - Step 81478: {'lr': 0.00022100039287060384, 'samples': 15643776, 'steps': 81477, 'loss/train': 2.9924914836883545} 11/07/2021 08:41:36 - INFO - __main__ - Step 81479: {'lr': 0.0002209951219565148, 'samples': 15643968, 'steps': 81478, 'loss/train': 1.4612351655960083} 11/07/2021 08:41:37 - INFO - __main__ - Step 81480: {'lr': 0.00022098985105549503, 'samples': 15644160, 'steps': 81479, 'loss/train': 1.4185240268707275} 11/07/2021 08:41:37 - INFO - __main__ - Step 81481: {'lr': 0.00022098458016754665, 'samples': 15644352, 'steps': 81480, 'loss/train': 1.2346453666687012} 11/07/2021 08:41:38 - INFO - __main__ - Step 81482: {'lr': 0.0002209793092926722, 'samples': 15644544, 'steps': 81481, 'loss/train': 1.981862187385559} 11/07/2021 08:41:38 - INFO - __main__ - Step 81483: {'lr': 0.00022097403843087402, 'samples': 15644736, 'steps': 81482, 'loss/train': 1.6779892444610596} 11/07/2021 08:41:39 - INFO - __main__ - Step 81484: {'lr': 0.0002209687675821545, 'samples': 15644928, 'steps': 81483, 'loss/train': 1.799046277999878} 11/07/2021 08:41:39 - INFO - __main__ - Step 81485: {'lr': 0.000220963496746516, 'samples': 15645120, 'steps': 81484, 'loss/train': 1.523764967918396} 11/07/2021 08:41:40 - INFO - __main__ - Step 81486: {'lr': 0.0002209582259239609, 'samples': 15645312, 'steps': 81485, 'loss/train': 1.2257719039916992} 11/07/2021 08:41:40 - INFO - __main__ - Step 81487: {'lr': 0.00022095295511449155, 'samples': 15645504, 'steps': 81486, 'loss/train': 1.70524263381958} 11/07/2021 08:41:40 - INFO - __main__ - Step 81488: {'lr': 0.00022094768431811035, 'samples': 15645696, 'steps': 81487, 'loss/train': 1.2351430654525757} 11/07/2021 08:41:41 - INFO - __main__ - Step 81489: {'lr': 0.0002209424135348197, 'samples': 15645888, 'steps': 81488, 'loss/train': 1.2936632633209229} 11/07/2021 08:41:42 - INFO - __main__ - Step 81490: {'lr': 0.00022093714276462194, 'samples': 15646080, 'steps': 81489, 'loss/train': 1.3530148267745972} 11/07/2021 08:41:42 - INFO - __main__ - Step 81491: {'lr': 0.00022093187200751947, 'samples': 15646272, 'steps': 81490, 'loss/train': 1.1857666969299316} 11/07/2021 08:41:43 - INFO - __main__ - Step 81492: {'lr': 0.00022092660126351462, 'samples': 15646464, 'steps': 81491, 'loss/train': 1.5717123746871948} 11/07/2021 08:41:43 - INFO - __main__ - Step 81493: {'lr': 0.0002209213305326098, 'samples': 15646656, 'steps': 81492, 'loss/train': 1.4293546676635742} 11/07/2021 08:41:43 - INFO - __main__ - Step 81494: {'lr': 0.00022091605981480752, 'samples': 15646848, 'steps': 81493, 'loss/train': 1.418731927871704} 11/07/2021 08:41:44 - INFO - __main__ - Step 81495: {'lr': 0.00022091078911010988, 'samples': 15647040, 'steps': 81494, 'loss/train': 1.2859251499176025} 11/07/2021 08:41:45 - INFO - __main__ - Step 81496: {'lr': 0.0002209055184185194, 'samples': 15647232, 'steps': 81495, 'loss/train': 1.593951940536499} 11/07/2021 08:41:45 - INFO - __main__ - Step 81497: {'lr': 0.00022090024774003847, 'samples': 15647424, 'steps': 81496, 'loss/train': 1.2384610176086426} 11/07/2021 08:41:45 - INFO - __main__ - Step 81498: {'lr': 0.0002208949770746694, 'samples': 15647616, 'steps': 81497, 'loss/train': 0.6696984767913818} 11/07/2021 08:41:46 - INFO - __main__ - Step 81499: {'lr': 0.00022088970642241462, 'samples': 15647808, 'steps': 81498, 'loss/train': 1.735770344734192} 11/07/2021 08:41:46 - INFO - __main__ - Step 81500: {'lr': 0.00022088443578327648, 'samples': 15648000, 'steps': 81499, 'loss/train': 0.5378074049949646} 11/07/2021 08:41:47 - INFO - __main__ - Step 81501: {'lr': 0.00022087916515725736, 'samples': 15648192, 'steps': 81500, 'loss/train': 1.524087905883789} 11/07/2021 08:41:48 - INFO - __main__ - Step 81502: {'lr': 0.00022087389454435966, 'samples': 15648384, 'steps': 81501, 'loss/train': 1.1371899843215942} 11/07/2021 08:41:48 - INFO - __main__ - Step 81503: {'lr': 0.0002208686239445857, 'samples': 15648576, 'steps': 81502, 'loss/train': 1.3870971202850342} 11/07/2021 08:41:48 - INFO - __main__ - Step 81504: {'lr': 0.00022086335335793792, 'samples': 15648768, 'steps': 81503, 'loss/train': 1.2175148725509644} 11/07/2021 08:41:49 - INFO - __main__ - Step 81505: {'lr': 0.00022085808278441866, 'samples': 15648960, 'steps': 81504, 'loss/train': 4.608717441558838} 11/07/2021 08:41:50 - INFO - __main__ - Step 81506: {'lr': 0.00022085281222403028, 'samples': 15649152, 'steps': 81505, 'loss/train': 1.4042989015579224} 11/07/2021 08:41:50 - INFO - __main__ - Step 81507: {'lr': 0.00022084754167677527, 'samples': 15649344, 'steps': 81506, 'loss/train': 1.36408269405365} 11/07/2021 08:41:50 - INFO - __main__ - Step 81508: {'lr': 0.00022084227114265584, 'samples': 15649536, 'steps': 81507, 'loss/train': 1.060217022895813} 11/07/2021 08:41:51 - INFO - __main__ - Step 81509: {'lr': 0.0002208370006216744, 'samples': 15649728, 'steps': 81508, 'loss/train': 2.3465824127197266} 11/07/2021 08:41:51 - INFO - __main__ - Step 81510: {'lr': 0.0002208317301138334, 'samples': 15649920, 'steps': 81509, 'loss/train': 0.9992831945419312} 11/07/2021 08:41:52 - INFO - __main__ - Step 81511: {'lr': 0.00022082645961913513, 'samples': 15650112, 'steps': 81510, 'loss/train': 1.8549271821975708} 11/07/2021 08:41:52 - INFO - __main__ - Step 81512: {'lr': 0.00022082118913758204, 'samples': 15650304, 'steps': 81511, 'loss/train': 1.5118159055709839} 11/07/2021 08:41:53 - INFO - __main__ - Step 81513: {'lr': 0.00022081591866917645, 'samples': 15650496, 'steps': 81512, 'loss/train': 1.0114243030548096} 11/07/2021 08:41:53 - INFO - __main__ - Step 81514: {'lr': 0.00022081064821392074, 'samples': 15650688, 'steps': 81513, 'loss/train': 2.567021369934082} 11/07/2021 08:41:53 - INFO - __main__ - Step 81515: {'lr': 0.00022080537777181733, 'samples': 15650880, 'steps': 81514, 'loss/train': 1.7146042585372925} 11/07/2021 08:41:54 - INFO - __main__ - Step 81516: {'lr': 0.00022080010734286856, 'samples': 15651072, 'steps': 81515, 'loss/train': 1.353407621383667} 11/07/2021 08:41:55 - INFO - __main__ - Step 81517: {'lr': 0.0002207948369270768, 'samples': 15651264, 'steps': 81516, 'loss/train': 1.1327277421951294} 11/07/2021 08:41:55 - INFO - __main__ - Step 81518: {'lr': 0.00022078956652444445, 'samples': 15651456, 'steps': 81517, 'loss/train': 1.4920704364776611} 11/07/2021 08:41:56 - INFO - __main__ - Step 81519: {'lr': 0.00022078429613497385, 'samples': 15651648, 'steps': 81518, 'loss/train': 1.92818284034729} 11/07/2021 08:41:56 - INFO - __main__ - Step 81520: {'lr': 0.00022077902575866744, 'samples': 15651840, 'steps': 81519, 'loss/train': 1.4802296161651611} 11/07/2021 08:41:56 - INFO - __main__ - Step 81521: {'lr': 0.00022077375539552763, 'samples': 15652032, 'steps': 81520, 'loss/train': 1.4204685688018799} 11/07/2021 08:41:57 - INFO - __main__ - Step 81522: {'lr': 0.0002207684850455566, 'samples': 15652224, 'steps': 81521, 'loss/train': 0.39572471380233765} 11/07/2021 08:41:58 - INFO - __main__ - Step 81523: {'lr': 0.00022076321470875684, 'samples': 15652416, 'steps': 81522, 'loss/train': 1.5373841524124146} 11/07/2021 08:41:58 - INFO - __main__ - Step 81524: {'lr': 0.00022075794438513073, 'samples': 15652608, 'steps': 81523, 'loss/train': 1.4693243503570557} 11/07/2021 08:41:58 - INFO - __main__ - Step 81525: {'lr': 0.00022075267407468063, 'samples': 15652800, 'steps': 81524, 'loss/train': 1.5586457252502441} 11/07/2021 08:41:59 - INFO - __main__ - Step 81526: {'lr': 0.00022074740377740892, 'samples': 15652992, 'steps': 81525, 'loss/train': 1.4636069536209106} 11/07/2021 08:42:00 - INFO - __main__ - Step 81527: {'lr': 0.000220742133493318, 'samples': 15653184, 'steps': 81526, 'loss/train': 1.2551987171173096} 11/07/2021 08:42:00 - INFO - __main__ - Step 81528: {'lr': 0.00022073686322241021, 'samples': 15653376, 'steps': 81527, 'loss/train': 1.1624832153320312} 11/07/2021 08:42:00 - INFO - __main__ - Step 81529: {'lr': 0.00022073159296468796, 'samples': 15653568, 'steps': 81528, 'loss/train': 1.6096713542938232} 11/07/2021 08:42:01 - INFO - __main__ - Step 81530: {'lr': 0.00022072632272015358, 'samples': 15653760, 'steps': 81529, 'loss/train': 1.3173918724060059} 11/07/2021 08:42:01 - INFO - __main__ - Step 81531: {'lr': 0.00022072105248880947, 'samples': 15653952, 'steps': 81530, 'loss/train': 1.3073091506958008} 11/07/2021 08:42:02 - INFO - __main__ - Step 81532: {'lr': 0.000220715782270658, 'samples': 15654144, 'steps': 81531, 'loss/train': 1.294578194618225} 11/07/2021 08:42:03 - INFO - __main__ - Step 81533: {'lr': 0.00022071051206570155, 'samples': 15654336, 'steps': 81532, 'loss/train': 1.1717476844787598} 11/07/2021 08:42:03 - INFO - __main__ - Step 81534: {'lr': 0.0002207052418739426, 'samples': 15654528, 'steps': 81533, 'loss/train': 1.729412317276001} 11/07/2021 08:42:03 - INFO - __main__ - Step 81535: {'lr': 0.00022069997169538332, 'samples': 15654720, 'steps': 81534, 'loss/train': 1.526706337928772} 11/07/2021 08:42:04 - INFO - __main__ - Step 81536: {'lr': 0.00022069470153002617, 'samples': 15654912, 'steps': 81535, 'loss/train': 1.6874281167984009} 11/07/2021 08:42:04 - INFO - __main__ - Step 81537: {'lr': 0.00022068943137787354, 'samples': 15655104, 'steps': 81536, 'loss/train': 1.621427059173584} 11/07/2021 08:42:05 - INFO - __main__ - Step 81538: {'lr': 0.00022068416123892777, 'samples': 15655296, 'steps': 81537, 'loss/train': 1.772334098815918} 11/07/2021 08:42:05 - INFO - __main__ - Step 81539: {'lr': 0.00022067889111319127, 'samples': 15655488, 'steps': 81538, 'loss/train': 1.4772838354110718} 11/07/2021 08:42:06 - INFO - __main__ - Step 81540: {'lr': 0.00022067362100066645, 'samples': 15655680, 'steps': 81539, 'loss/train': 1.732142448425293} 11/07/2021 08:42:06 - INFO - __main__ - Step 81541: {'lr': 0.00022066835090135562, 'samples': 15655872, 'steps': 81540, 'loss/train': 1.716836929321289} 11/07/2021 08:42:06 - INFO - __main__ - Step 81542: {'lr': 0.00022066308081526118, 'samples': 15656064, 'steps': 81541, 'loss/train': 1.3544036149978638} 11/07/2021 08:42:07 - INFO - __main__ - Step 81543: {'lr': 0.0002206578107423855, 'samples': 15656256, 'steps': 81542, 'loss/train': 1.2555657625198364} 11/07/2021 08:42:08 - INFO - __main__ - Step 81544: {'lr': 0.00022065254068273096, 'samples': 15656448, 'steps': 81543, 'loss/train': 1.1573244333267212} 11/07/2021 08:42:08 - INFO - __main__ - Step 81545: {'lr': 0.0002206472706363, 'samples': 15656640, 'steps': 81544, 'loss/train': 2.099613666534424} 11/07/2021 08:42:09 - INFO - __main__ - Step 81546: {'lr': 0.00022064200060309486, 'samples': 15656832, 'steps': 81545, 'loss/train': 1.1610832214355469} 11/07/2021 08:42:09 - INFO - __main__ - Step 81547: {'lr': 0.000220636730583118, 'samples': 15657024, 'steps': 81546, 'loss/train': 1.3477455377578735} 11/07/2021 08:42:10 - INFO - __main__ - Step 81548: {'lr': 0.0002206314605763718, 'samples': 15657216, 'steps': 81547, 'loss/train': 1.2643202543258667} 11/07/2021 08:42:10 - INFO - __main__ - Step 81549: {'lr': 0.00022062619058285855, 'samples': 15657408, 'steps': 81548, 'loss/train': 1.585000991821289} 11/07/2021 08:42:11 - INFO - __main__ - Step 81550: {'lr': 0.0002206209206025807, 'samples': 15657600, 'steps': 81549, 'loss/train': 1.1619623899459839} 11/07/2021 08:42:11 - INFO - __main__ - Step 81551: {'lr': 0.00022061565063554063, 'samples': 15657792, 'steps': 81550, 'loss/train': 1.3193070888519287} 11/07/2021 08:42:11 - INFO - __main__ - Step 81552: {'lr': 0.00022061038068174065, 'samples': 15657984, 'steps': 81551, 'loss/train': 1.8980082273483276} 11/07/2021 08:42:12 - INFO - __main__ - Step 81553: {'lr': 0.00022060511074118322, 'samples': 15658176, 'steps': 81552, 'loss/train': 1.664791226387024} 11/07/2021 08:42:13 - INFO - __main__ - Step 81554: {'lr': 0.00022059984081387066, 'samples': 15658368, 'steps': 81553, 'loss/train': 1.2106046676635742} 11/07/2021 08:42:13 - INFO - __main__ - Step 81555: {'lr': 0.00022059457089980533, 'samples': 15658560, 'steps': 81554, 'loss/train': 1.1428526639938354} 11/07/2021 08:42:13 - INFO - __main__ - Step 81556: {'lr': 0.00022058930099898974, 'samples': 15658752, 'steps': 81555, 'loss/train': 1.329706072807312} 11/07/2021 08:42:14 - INFO - __main__ - Step 81557: {'lr': 0.00022058403111142609, 'samples': 15658944, 'steps': 81556, 'loss/train': 1.4028483629226685} 11/07/2021 08:42:14 - INFO - __main__ - Step 81558: {'lr': 0.0002205787612371168, 'samples': 15659136, 'steps': 81557, 'loss/train': 1.446208119392395} 11/07/2021 08:42:15 - INFO - __main__ - Step 81559: {'lr': 0.00022057349137606424, 'samples': 15659328, 'steps': 81558, 'loss/train': 1.521811604499817} 11/07/2021 08:42:15 - INFO - __main__ - Step 81560: {'lr': 0.00022056822152827086, 'samples': 15659520, 'steps': 81559, 'loss/train': 1.0440915822982788} 11/07/2021 08:42:16 - INFO - __main__ - Step 81561: {'lr': 0.00022056295169373903, 'samples': 15659712, 'steps': 81560, 'loss/train': 1.1651185750961304} 11/07/2021 08:42:16 - INFO - __main__ - Step 81562: {'lr': 0.00022055768187247103, 'samples': 15659904, 'steps': 81561, 'loss/train': 1.3978832960128784} 11/07/2021 08:42:17 - INFO - __main__ - Step 81563: {'lr': 0.00022055241206446927, 'samples': 15660096, 'steps': 81562, 'loss/train': 0.7358897924423218} 11/07/2021 08:42:17 - INFO - __main__ - Step 81564: {'lr': 0.00022054714226973617, 'samples': 15660288, 'steps': 81563, 'loss/train': 1.649998426437378} 11/07/2021 08:42:18 - INFO - __main__ - Step 81565: {'lr': 0.000220541872488274, 'samples': 15660480, 'steps': 81564, 'loss/train': 1.5707076787948608} 11/07/2021 08:42:18 - INFO - __main__ - Step 81566: {'lr': 0.00022053660272008528, 'samples': 15660672, 'steps': 81565, 'loss/train': 1.3680903911590576} 11/07/2021 08:42:19 - INFO - __main__ - Step 81567: {'lr': 0.00022053133296517233, 'samples': 15660864, 'steps': 81566, 'loss/train': 1.8508960008621216} 11/07/2021 08:42:19 - INFO - __main__ - Step 81568: {'lr': 0.00022052606322353746, 'samples': 15661056, 'steps': 81567, 'loss/train': 1.2082380056381226} 11/07/2021 08:42:20 - INFO - __main__ - Step 81569: {'lr': 0.00022052079349518312, 'samples': 15661248, 'steps': 81568, 'loss/train': 1.3578557968139648} 11/07/2021 08:42:20 - INFO - __main__ - Step 81570: {'lr': 0.0002205155237801116, 'samples': 15661440, 'steps': 81569, 'loss/train': 1.0590089559555054} 11/07/2021 08:42:21 - INFO - __main__ - Step 81571: {'lr': 0.00022051025407832537, 'samples': 15661632, 'steps': 81570, 'loss/train': 1.355408787727356} 11/07/2021 08:42:21 - INFO - __main__ - Step 81572: {'lr': 0.00022050498438982673, 'samples': 15661824, 'steps': 81571, 'loss/train': 0.9452390074729919} 11/07/2021 08:42:21 - INFO - __main__ - Step 81573: {'lr': 0.00022049971471461814, 'samples': 15662016, 'steps': 81572, 'loss/train': 1.414781928062439} 11/07/2021 08:42:22 - INFO - __main__ - Step 81574: {'lr': 0.00022049444505270195, 'samples': 15662208, 'steps': 81573, 'loss/train': 1.6954917907714844} 11/07/2021 08:42:23 - INFO - __main__ - Step 81575: {'lr': 0.00022048917540408046, 'samples': 15662400, 'steps': 81574, 'loss/train': 0.44563767313957214} 11/07/2021 08:42:23 - INFO - __main__ - Step 81576: {'lr': 0.00022048390576875608, 'samples': 15662592, 'steps': 81575, 'loss/train': 1.2124638557434082} 11/07/2021 08:42:23 - INFO - __main__ - Step 81577: {'lr': 0.00022047863614673118, 'samples': 15662784, 'steps': 81576, 'loss/train': 1.3773339986801147} 11/07/2021 08:42:24 - INFO - __main__ - Step 81578: {'lr': 0.00022047336653800825, 'samples': 15662976, 'steps': 81577, 'loss/train': 1.28872549533844} 11/07/2021 08:42:25 - INFO - __main__ - Step 81579: {'lr': 0.00022046809694258949, 'samples': 15663168, 'steps': 81578, 'loss/train': 1.3140770196914673} 11/07/2021 08:42:25 - INFO - __main__ - Step 81580: {'lr': 0.00022046282736047735, 'samples': 15663360, 'steps': 81579, 'loss/train': 1.8905231952667236} 11/07/2021 08:42:25 - INFO - __main__ - Step 81581: {'lr': 0.0002204575577916742, 'samples': 15663552, 'steps': 81580, 'loss/train': 1.338619351387024} 11/07/2021 08:42:26 - INFO - __main__ - Step 81582: {'lr': 0.00022045228823618242, 'samples': 15663744, 'steps': 81581, 'loss/train': 0.1179317831993103} 11/07/2021 08:42:26 - INFO - __main__ - Step 81583: {'lr': 0.0002204470186940044, 'samples': 15663936, 'steps': 81582, 'loss/train': 1.485188364982605} 11/07/2021 08:42:27 - INFO - __main__ - Step 81584: {'lr': 0.00022044174916514248, 'samples': 15664128, 'steps': 81583, 'loss/train': 1.4322404861450195} 11/07/2021 08:42:28 - INFO - __main__ - Step 81585: {'lr': 0.00022043647964959905, 'samples': 15664320, 'steps': 81584, 'loss/train': 1.1540896892547607} 11/07/2021 08:42:28 - INFO - __main__ - Step 81586: {'lr': 0.0002204312101473765, 'samples': 15664512, 'steps': 81585, 'loss/train': 0.6694316864013672} 11/07/2021 08:42:28 - INFO - __main__ - Step 81587: {'lr': 0.00022042594065847717, 'samples': 15664704, 'steps': 81586, 'loss/train': 1.3438936471939087} 11/07/2021 08:42:29 - INFO - __main__ - Step 81588: {'lr': 0.0002204206711829035, 'samples': 15664896, 'steps': 81587, 'loss/train': 1.2515846490859985} 11/07/2021 08:42:29 - INFO - __main__ - Step 81589: {'lr': 0.00022041540172065786, 'samples': 15665088, 'steps': 81588, 'loss/train': 1.4491418600082397} 11/07/2021 08:42:30 - INFO - __main__ - Step 81590: {'lr': 0.0002204101322717425, 'samples': 15665280, 'steps': 81589, 'loss/train': 1.7183257341384888} 11/07/2021 08:42:30 - INFO - __main__ - Step 81591: {'lr': 0.00022040486283615991, 'samples': 15665472, 'steps': 81590, 'loss/train': 1.1449838876724243} 11/07/2021 08:42:31 - INFO - __main__ - Step 81592: {'lr': 0.00022039959341391238, 'samples': 15665664, 'steps': 81591, 'loss/train': 1.4804725646972656} 11/07/2021 08:42:31 - INFO - __main__ - Step 81593: {'lr': 0.00022039432400500236, 'samples': 15665856, 'steps': 81592, 'loss/train': 0.8201496005058289} 11/07/2021 08:42:32 - INFO - __main__ - Step 81594: {'lr': 0.00022038905460943224, 'samples': 15666048, 'steps': 81593, 'loss/train': 1.4501105546951294} 11/07/2021 08:42:33 - INFO - __main__ - Step 81595: {'lr': 0.00022038378522720432, 'samples': 15666240, 'steps': 81594, 'loss/train': 1.887821912765503} 11/07/2021 08:42:33 - INFO - __main__ - Step 81596: {'lr': 0.000220378515858321, 'samples': 15666432, 'steps': 81595, 'loss/train': 1.3482261896133423} 11/07/2021 08:42:33 - INFO - __main__ - Step 81597: {'lr': 0.00022037324650278468, 'samples': 15666624, 'steps': 81596, 'loss/train': 1.7391127347946167} 11/07/2021 08:42:34 - INFO - __main__ - Step 81598: {'lr': 0.0002203679771605977, 'samples': 15666816, 'steps': 81597, 'loss/train': 1.584681749343872} 11/07/2021 08:42:34 - INFO - __main__ - Step 81599: {'lr': 0.00022036270783176246, 'samples': 15667008, 'steps': 81598, 'loss/train': 1.4165384769439697} 11/07/2021 08:42:35 - INFO - __main__ - Step 81600: {'lr': 0.00022035743851628133, 'samples': 15667200, 'steps': 81599, 'loss/train': 1.8134219646453857} 11/07/2021 08:42:35 - INFO - __main__ - Step 81601: {'lr': 0.00022035216921415679, 'samples': 15667392, 'steps': 81600, 'loss/train': 1.4275217056274414} 11/07/2021 08:42:36 - INFO - __main__ - Step 81602: {'lr': 0.000220346899925391, 'samples': 15667584, 'steps': 81601, 'loss/train': 1.5954638719558716} 11/07/2021 08:42:36 - INFO - __main__ - Step 81603: {'lr': 0.00022034163064998645, 'samples': 15667776, 'steps': 81602, 'loss/train': 1.5892845392227173} 11/07/2021 08:42:36 - INFO - __main__ - Step 81604: {'lr': 0.00022033636138794546, 'samples': 15667968, 'steps': 81603, 'loss/train': 1.165216088294983} 11/07/2021 08:42:37 - INFO - __main__ - Step 81605: {'lr': 0.00022033109213927049, 'samples': 15668160, 'steps': 81604, 'loss/train': 1.4798848628997803} 11/07/2021 08:42:38 - INFO - __main__ - Step 81606: {'lr': 0.00022032582290396386, 'samples': 15668352, 'steps': 81605, 'loss/train': 1.3410285711288452} 11/07/2021 08:42:38 - INFO - __main__ - Step 81607: {'lr': 0.00022032055368202794, 'samples': 15668544, 'steps': 81606, 'loss/train': 1.4691270589828491} 11/07/2021 08:42:39 - INFO - __main__ - Step 81608: {'lr': 0.00022031528447346514, 'samples': 15668736, 'steps': 81607, 'loss/train': 1.5525095462799072} 11/07/2021 08:42:39 - INFO - __main__ - Step 81609: {'lr': 0.0002203100152782778, 'samples': 15668928, 'steps': 81608, 'loss/train': 1.2540836334228516} 11/07/2021 08:42:39 - INFO - __main__ - Step 81610: {'lr': 0.00022030474609646832, 'samples': 15669120, 'steps': 81609, 'loss/train': 1.1484986543655396} 11/07/2021 08:42:40 - INFO - __main__ - Step 81611: {'lr': 0.00022029947692803908, 'samples': 15669312, 'steps': 81610, 'loss/train': 1.0473926067352295} 11/07/2021 08:42:41 - INFO - __main__ - Step 81612: {'lr': 0.00022029420777299242, 'samples': 15669504, 'steps': 81611, 'loss/train': 1.5808777809143066} 11/07/2021 08:42:41 - INFO - __main__ - Step 81613: {'lr': 0.00022028893863133074, 'samples': 15669696, 'steps': 81612, 'loss/train': 1.4139825105667114} 11/07/2021 08:42:41 - INFO - __main__ - Step 81614: {'lr': 0.0002202836695030564, 'samples': 15669888, 'steps': 81613, 'loss/train': 1.5367612838745117} 11/07/2021 08:42:42 - INFO - __main__ - Step 81615: {'lr': 0.00022027840038817188, 'samples': 15670080, 'steps': 81614, 'loss/train': 1.3861445188522339} 11/07/2021 08:42:43 - INFO - __main__ - Step 81616: {'lr': 0.00022027313128667933, 'samples': 15670272, 'steps': 81615, 'loss/train': 1.7022160291671753} 11/07/2021 08:42:43 - INFO - __main__ - Step 81617: {'lr': 0.00022026786219858129, 'samples': 15670464, 'steps': 81616, 'loss/train': 1.7355926036834717} 11/07/2021 08:42:44 - INFO - __main__ - Step 81618: {'lr': 0.00022026259312388005, 'samples': 15670656, 'steps': 81617, 'loss/train': 1.3028373718261719} 11/07/2021 08:42:44 - INFO - __main__ - Step 81619: {'lr': 0.00022025732406257806, 'samples': 15670848, 'steps': 81618, 'loss/train': 1.422918677330017} 11/07/2021 08:42:44 - INFO - __main__ - Step 81620: {'lr': 0.00022025205501467765, 'samples': 15671040, 'steps': 81619, 'loss/train': 1.585235357284546} 11/07/2021 08:42:45 - INFO - __main__ - Step 81621: {'lr': 0.00022024678598018123, 'samples': 15671232, 'steps': 81620, 'loss/train': 1.2805286645889282} 11/07/2021 08:42:46 - INFO - __main__ - Step 81622: {'lr': 0.00022024151695909108, 'samples': 15671424, 'steps': 81621, 'loss/train': 1.1511701345443726} 11/07/2021 08:42:46 - INFO - __main__ - Step 81623: {'lr': 0.0002202362479514097, 'samples': 15671616, 'steps': 81622, 'loss/train': 1.2306632995605469} 11/07/2021 08:42:46 - INFO - __main__ - Step 81624: {'lr': 0.0002202309789571394, 'samples': 15671808, 'steps': 81623, 'loss/train': 0.575927197933197} 11/07/2021 08:42:47 - INFO - __main__ - Step 81625: {'lr': 0.00022022570997628254, 'samples': 15672000, 'steps': 81624, 'loss/train': 0.923926055431366} 11/07/2021 08:42:47 - INFO - __main__ - Step 81626: {'lr': 0.00022022044100884154, 'samples': 15672192, 'steps': 81625, 'loss/train': 0.7808361053466797} 11/07/2021 08:42:48 - INFO - __main__ - Step 81627: {'lr': 0.00022021517205481875, 'samples': 15672384, 'steps': 81626, 'loss/train': 1.2154935598373413} 11/07/2021 08:42:48 - INFO - __main__ - Step 81628: {'lr': 0.00022020990311421665, 'samples': 15672576, 'steps': 81627, 'loss/train': 1.4538606405258179} 11/07/2021 08:42:49 - INFO - __main__ - Step 81629: {'lr': 0.0002202046341870374, 'samples': 15672768, 'steps': 81628, 'loss/train': 1.3381571769714355} 11/07/2021 08:42:49 - INFO - __main__ - Step 81630: {'lr': 0.00022019936527328346, 'samples': 15672960, 'steps': 81629, 'loss/train': 1.4374077320098877} 11/07/2021 08:42:49 - INFO - __main__ - Step 81631: {'lr': 0.00022019409637295722, 'samples': 15673152, 'steps': 81630, 'loss/train': 1.6593637466430664} 11/07/2021 08:42:50 - INFO - __main__ - Step 81632: {'lr': 0.0002201888274860611, 'samples': 15673344, 'steps': 81631, 'loss/train': 1.5551111698150635} 11/07/2021 08:42:51 - INFO - __main__ - Step 81633: {'lr': 0.0002201835586125974, 'samples': 15673536, 'steps': 81632, 'loss/train': 2.3972113132476807} 11/07/2021 08:42:51 - INFO - __main__ - Step 81634: {'lr': 0.00022017828975256856, 'samples': 15673728, 'steps': 81633, 'loss/train': 1.5453431606292725} 11/07/2021 08:42:52 - INFO - __main__ - Step 81635: {'lr': 0.0002201730209059769, 'samples': 15673920, 'steps': 81634, 'loss/train': 1.4439364671707153} 11/07/2021 08:42:52 - INFO - __main__ - Step 81636: {'lr': 0.00022016775207282484, 'samples': 15674112, 'steps': 81635, 'loss/train': 1.4720247983932495} 11/07/2021 08:42:53 - INFO - __main__ - Step 81637: {'lr': 0.00022016248325311473, 'samples': 15674304, 'steps': 81636, 'loss/train': 1.205870509147644} 11/07/2021 08:42:53 - INFO - __main__ - Step 81638: {'lr': 0.0002201572144468489, 'samples': 15674496, 'steps': 81637, 'loss/train': 0.9499673843383789} 11/07/2021 08:42:54 - INFO - __main__ - Step 81639: {'lr': 0.0002201519456540298, 'samples': 15674688, 'steps': 81638, 'loss/train': 1.3201888799667358} 11/07/2021 08:42:54 - INFO - __main__ - Step 81640: {'lr': 0.00022014667687465979, 'samples': 15674880, 'steps': 81639, 'loss/train': 1.3721216917037964} 11/07/2021 08:42:54 - INFO - __main__ - Step 81641: {'lr': 0.0002201414081087412, 'samples': 15675072, 'steps': 81640, 'loss/train': 0.4138330817222595} 11/07/2021 08:42:55 - INFO - __main__ - Step 81642: {'lr': 0.00022013613935627653, 'samples': 15675264, 'steps': 81641, 'loss/train': 1.2527780532836914} 11/07/2021 08:42:56 - INFO - __main__ - Step 81643: {'lr': 0.00022013087061726797, 'samples': 15675456, 'steps': 81642, 'loss/train': 1.7050459384918213} 11/07/2021 08:42:56 - INFO - __main__ - Step 81644: {'lr': 0.00022012560189171797, 'samples': 15675648, 'steps': 81643, 'loss/train': 1.5164048671722412} 11/07/2021 08:42:56 - INFO - __main__ - Step 81645: {'lr': 0.0002201203331796289, 'samples': 15675840, 'steps': 81644, 'loss/train': 1.1446678638458252} 11/07/2021 08:42:57 - INFO - __main__ - Step 81646: {'lr': 0.00022011506448100317, 'samples': 15676032, 'steps': 81645, 'loss/train': 1.2869737148284912} 11/07/2021 08:42:58 - INFO - __main__ - Step 81647: {'lr': 0.0002201097957958431, 'samples': 15676224, 'steps': 81646, 'loss/train': 1.435243010520935} 11/07/2021 08:42:58 - INFO - __main__ - Step 81648: {'lr': 0.00022010452712415112, 'samples': 15676416, 'steps': 81647, 'loss/train': 1.4130334854125977} 11/07/2021 08:42:59 - INFO - __main__ - Step 81649: {'lr': 0.0002200992584659296, 'samples': 15676608, 'steps': 81648, 'loss/train': 1.4145926237106323} 11/07/2021 08:42:59 - INFO - __main__ - Step 81650: {'lr': 0.00022009398982118087, 'samples': 15676800, 'steps': 81649, 'loss/train': 1.1974486112594604} 11/07/2021 08:42:59 - INFO - __main__ - Step 81651: {'lr': 0.0002200887211899073, 'samples': 15676992, 'steps': 81650, 'loss/train': 1.358222484588623} 11/07/2021 08:43:00 - INFO - __main__ - Step 81652: {'lr': 0.0002200834525721113, 'samples': 15677184, 'steps': 81651, 'loss/train': 0.37498846650123596} 11/07/2021 08:43:01 - INFO - __main__ - Step 81653: {'lr': 0.00022007818396779528, 'samples': 15677376, 'steps': 81652, 'loss/train': 0.6411874294281006} 11/07/2021 08:43:01 - INFO - __main__ - Step 81654: {'lr': 0.00022007291537696154, 'samples': 15677568, 'steps': 81653, 'loss/train': 1.5512737035751343} 11/07/2021 08:43:01 - INFO - __main__ - Step 81655: {'lr': 0.00022006764679961257, 'samples': 15677760, 'steps': 81654, 'loss/train': 1.159802794456482} 11/07/2021 08:43:02 - INFO - __main__ - Step 81656: {'lr': 0.0002200623782357506, 'samples': 15677952, 'steps': 81655, 'loss/train': 1.1908243894577026} 11/07/2021 08:43:03 - INFO - __main__ - Step 81657: {'lr': 0.000220057109685378, 'samples': 15678144, 'steps': 81656, 'loss/train': 1.8299839496612549} 11/07/2021 08:43:03 - INFO - __main__ - Step 81658: {'lr': 0.00022005184114849723, 'samples': 15678336, 'steps': 81657, 'loss/train': 1.6591215133666992} 11/07/2021 08:43:03 - INFO - __main__ - Step 81659: {'lr': 0.00022004657262511066, 'samples': 15678528, 'steps': 81658, 'loss/train': 0.9272492527961731} 11/07/2021 08:43:04 - INFO - __main__ - Step 81660: {'lr': 0.0002200413041152206, 'samples': 15678720, 'steps': 81659, 'loss/train': 1.4340544939041138} 11/07/2021 08:43:04 - INFO - __main__ - Step 81661: {'lr': 0.0002200360356188295, 'samples': 15678912, 'steps': 81660, 'loss/train': 1.2879737615585327} 11/07/2021 08:43:05 - INFO - __main__ - Step 81662: {'lr': 0.0002200307671359397, 'samples': 15679104, 'steps': 81661, 'loss/train': 1.3511829376220703} 11/07/2021 08:43:06 - INFO - __main__ - Step 81663: {'lr': 0.00022002549866655355, 'samples': 15679296, 'steps': 81662, 'loss/train': 1.6625138521194458} 11/07/2021 08:43:06 - INFO - __main__ - Step 81664: {'lr': 0.00022002023021067347, 'samples': 15679488, 'steps': 81663, 'loss/train': 1.2090250253677368} 11/07/2021 08:43:06 - INFO - __main__ - Step 81665: {'lr': 0.00022001496176830178, 'samples': 15679680, 'steps': 81664, 'loss/train': 1.3197599649429321} 11/07/2021 08:43:07 - INFO - __main__ - Step 81666: {'lr': 0.00022000969333944094, 'samples': 15679872, 'steps': 81665, 'loss/train': 1.0524674654006958} 11/07/2021 08:43:08 - INFO - __main__ - Step 81667: {'lr': 0.00022000442492409322, 'samples': 15680064, 'steps': 81666, 'loss/train': 0.2519298493862152} 11/07/2021 08:43:08 - INFO - __main__ - Step 81668: {'lr': 0.00021999915652226115, 'samples': 15680256, 'steps': 81667, 'loss/train': 1.5178312063217163} 11/07/2021 08:43:08 - INFO - __main__ - Step 81669: {'lr': 0.00021999388813394695, 'samples': 15680448, 'steps': 81668, 'loss/train': 1.5668760538101196} 11/07/2021 08:43:09 - INFO - __main__ - Step 81670: {'lr': 0.00021998861975915297, 'samples': 15680640, 'steps': 81669, 'loss/train': 1.3736240863800049} 11/07/2021 08:43:09 - INFO - __main__ - Step 81671: {'lr': 0.0002199833513978817, 'samples': 15680832, 'steps': 81670, 'loss/train': 1.9104045629501343} 11/07/2021 08:43:09 - INFO - __main__ - Step 81672: {'lr': 0.00021997808305013544, 'samples': 15681024, 'steps': 81671, 'loss/train': 1.227770447731018} 11/07/2021 08:43:10 - INFO - __main__ - Step 81673: {'lr': 0.00021997281471591658, 'samples': 15681216, 'steps': 81672, 'loss/train': 1.626141905784607} 11/07/2021 08:43:11 - INFO - __main__ - Step 81674: {'lr': 0.00021996754639522757, 'samples': 15681408, 'steps': 81673, 'loss/train': 1.475173830986023} 11/07/2021 08:43:11 - INFO - __main__ - Step 81675: {'lr': 0.00021996227808807067, 'samples': 15681600, 'steps': 81674, 'loss/train': 1.430219054222107} 11/07/2021 08:43:11 - INFO - __main__ - Step 81676: {'lr': 0.0002199570097944483, 'samples': 15681792, 'steps': 81675, 'loss/train': 1.7412233352661133} 11/07/2021 08:43:12 - INFO - __main__ - Step 81677: {'lr': 0.00021995174151436287, 'samples': 15681984, 'steps': 81676, 'loss/train': 1.6773056983947754} 11/07/2021 08:43:13 - INFO - __main__ - Step 81678: {'lr': 0.0002199464732478167, 'samples': 15682176, 'steps': 81677, 'loss/train': 1.6808876991271973} 11/07/2021 08:43:13 - INFO - __main__ - Step 81679: {'lr': 0.0002199412049948122, 'samples': 15682368, 'steps': 81678, 'loss/train': 1.2350404262542725} 11/07/2021 08:43:14 - INFO - __main__ - Step 81680: {'lr': 0.00021993593675535177, 'samples': 15682560, 'steps': 81679, 'loss/train': 1.5070523023605347} 11/07/2021 08:43:14 - INFO - __main__ - Step 81681: {'lr': 0.00021993066852943766, 'samples': 15682752, 'steps': 81680, 'loss/train': 1.1225248575210571} 11/07/2021 08:43:14 - INFO - __main__ - Step 81682: {'lr': 0.00021992540031707243, 'samples': 15682944, 'steps': 81681, 'loss/train': 1.570336103439331} 11/07/2021 08:43:15 - INFO - __main__ - Step 81683: {'lr': 0.00021992013211825828, 'samples': 15683136, 'steps': 81682, 'loss/train': 1.4859445095062256} 11/07/2021 08:43:16 - INFO - __main__ - Step 81684: {'lr': 0.00021991486393299763, 'samples': 15683328, 'steps': 81683, 'loss/train': 1.341497778892517} 11/07/2021 08:43:16 - INFO - __main__ - Step 81685: {'lr': 0.00021990959576129293, 'samples': 15683520, 'steps': 81684, 'loss/train': 1.2851293087005615} 11/07/2021 08:43:16 - INFO - __main__ - Step 81686: {'lr': 0.00021990432760314647, 'samples': 15683712, 'steps': 81685, 'loss/train': 1.1809523105621338} 11/07/2021 08:43:17 - INFO - __main__ - Step 81687: {'lr': 0.00021989905945856065, 'samples': 15683904, 'steps': 81686, 'loss/train': 1.736850619316101} 11/07/2021 08:43:18 - INFO - __main__ - Step 81688: {'lr': 0.00021989379132753787, 'samples': 15684096, 'steps': 81687, 'loss/train': 1.5289820432662964} 11/07/2021 08:43:18 - INFO - __main__ - Step 81689: {'lr': 0.00021988852321008046, 'samples': 15684288, 'steps': 81688, 'loss/train': 1.680861473083496} 11/07/2021 08:43:18 - INFO - __main__ - Step 81690: {'lr': 0.00021988325510619085, 'samples': 15684480, 'steps': 81689, 'loss/train': 1.383697271347046} 11/07/2021 08:43:19 - INFO - __main__ - Step 81691: {'lr': 0.0002198779870158714, 'samples': 15684672, 'steps': 81690, 'loss/train': 1.8878403902053833} 11/07/2021 08:43:19 - INFO - __main__ - Step 81692: {'lr': 0.0002198727189391244, 'samples': 15684864, 'steps': 81691, 'loss/train': 1.8985921144485474} 11/07/2021 08:43:20 - INFO - __main__ - Step 81693: {'lr': 0.00021986745087595232, 'samples': 15685056, 'steps': 81692, 'loss/train': 1.2088817358016968} 11/07/2021 08:43:21 - INFO - __main__ - Step 81694: {'lr': 0.0002198621828263575, 'samples': 15685248, 'steps': 81693, 'loss/train': 1.5639595985412598} 11/07/2021 08:43:21 - INFO - __main__ - Step 81695: {'lr': 0.00021985691479034237, 'samples': 15685440, 'steps': 81694, 'loss/train': 1.4690934419631958} 11/07/2021 08:43:21 - INFO - __main__ - Step 81696: {'lr': 0.00021985164676790916, 'samples': 15685632, 'steps': 81695, 'loss/train': 1.4531172513961792} 11/07/2021 08:43:22 - INFO - __main__ - Step 81697: {'lr': 0.00021984637875906038, 'samples': 15685824, 'steps': 81696, 'loss/train': 1.2555263042449951} 11/07/2021 08:43:22 - INFO - __main__ - Step 81698: {'lr': 0.00021984111076379833, 'samples': 15686016, 'steps': 81697, 'loss/train': 1.0831526517868042} 11/07/2021 08:43:23 - INFO - __main__ - Step 81699: {'lr': 0.00021983584278212543, 'samples': 15686208, 'steps': 81698, 'loss/train': 1.4305229187011719} 11/07/2021 08:43:23 - INFO - __main__ - Step 81700: {'lr': 0.000219830574814044, 'samples': 15686400, 'steps': 81699, 'loss/train': 1.6276277303695679} 11/07/2021 08:43:24 - INFO - __main__ - Step 81701: {'lr': 0.00021982530685955653, 'samples': 15686592, 'steps': 81700, 'loss/train': 1.4878467321395874} 11/07/2021 08:43:24 - INFO - __main__ - Step 81702: {'lr': 0.00021982003891866527, 'samples': 15686784, 'steps': 81701, 'loss/train': 1.2821080684661865} 11/07/2021 08:43:24 - INFO - __main__ - Step 81703: {'lr': 0.00021981477099137259, 'samples': 15686976, 'steps': 81702, 'loss/train': 1.3852630853652954} 11/07/2021 08:43:25 - INFO - __main__ - Step 81704: {'lr': 0.00021980950307768095, 'samples': 15687168, 'steps': 81703, 'loss/train': 1.592743992805481} 11/07/2021 08:43:26 - INFO - __main__ - Step 81705: {'lr': 0.00021980423517759264, 'samples': 15687360, 'steps': 81704, 'loss/train': 1.7417699098587036} 11/07/2021 08:43:26 - INFO - __main__ - Step 81706: {'lr': 0.0002197989672911101, 'samples': 15687552, 'steps': 81705, 'loss/train': 0.09705986082553864} 11/07/2021 08:43:27 - INFO - __main__ - Step 81707: {'lr': 0.00021979369941823568, 'samples': 15687744, 'steps': 81706, 'loss/train': 1.5477957725524902} 11/07/2021 08:43:27 - INFO - __main__ - Step 81708: {'lr': 0.00021978843155897175, 'samples': 15687936, 'steps': 81707, 'loss/train': 1.5518919229507446} 11/07/2021 08:43:28 - INFO - __main__ - Step 81709: {'lr': 0.00021978316371332074, 'samples': 15688128, 'steps': 81708, 'loss/train': 1.7065976858139038} 11/07/2021 08:43:28 - INFO - __main__ - Step 81710: {'lr': 0.00021977789588128492, 'samples': 15688320, 'steps': 81709, 'loss/train': 0.6119842529296875} 11/07/2021 08:43:29 - INFO - __main__ - Step 81711: {'lr': 0.0002197726280628667, 'samples': 15688512, 'steps': 81710, 'loss/train': 1.3903589248657227} 11/07/2021 08:43:29 - INFO - __main__ - Step 81712: {'lr': 0.00021976736025806855, 'samples': 15688704, 'steps': 81711, 'loss/train': 1.6514801979064941} 11/07/2021 08:43:29 - INFO - __main__ - Step 81713: {'lr': 0.00021976209246689268, 'samples': 15688896, 'steps': 81712, 'loss/train': 1.5005594491958618} 11/07/2021 08:43:30 - INFO - __main__ - Step 81714: {'lr': 0.00021975682468934154, 'samples': 15689088, 'steps': 81713, 'loss/train': 1.4196094274520874} 11/07/2021 08:43:31 - INFO - __main__ - Step 81715: {'lr': 0.00021975155692541753, 'samples': 15689280, 'steps': 81714, 'loss/train': 1.1423624753952026} 11/07/2021 08:43:31 - INFO - __main__ - Step 81716: {'lr': 0.00021974628917512302, 'samples': 15689472, 'steps': 81715, 'loss/train': 1.6110972166061401} 11/07/2021 08:43:31 - INFO - __main__ - Step 81717: {'lr': 0.00021974102143846032, 'samples': 15689664, 'steps': 81716, 'loss/train': 1.5989995002746582} 11/07/2021 08:43:32 - INFO - __main__ - Step 81718: {'lr': 0.00021973575371543187, 'samples': 15689856, 'steps': 81717, 'loss/train': 0.11134147644042969} 11/07/2021 08:43:33 - INFO - __main__ - Step 81719: {'lr': 0.00021973048600604, 'samples': 15690048, 'steps': 81718, 'loss/train': 1.681930661201477} 11/07/2021 08:43:33 - INFO - __main__ - Step 81720: {'lr': 0.00021972521831028715, 'samples': 15690240, 'steps': 81719, 'loss/train': 1.2365776300430298} 11/07/2021 08:43:33 - INFO - __main__ - Step 81721: {'lr': 0.00021971995062817563, 'samples': 15690432, 'steps': 81720, 'loss/train': 1.4358240365982056} 11/07/2021 08:43:34 - INFO - __main__ - Step 81722: {'lr': 0.00021971468295970786, 'samples': 15690624, 'steps': 81721, 'loss/train': 0.965334951877594} 11/07/2021 08:43:34 - INFO - __main__ - Step 81723: {'lr': 0.00021970941530488622, 'samples': 15690816, 'steps': 81722, 'loss/train': 1.3515374660491943} 11/07/2021 08:43:35 - INFO - __main__ - Step 81724: {'lr': 0.000219704147663713, 'samples': 15691008, 'steps': 81723, 'loss/train': 1.3732637166976929} 11/07/2021 08:43:36 - INFO - __main__ - Step 81725: {'lr': 0.0002196988800361906, 'samples': 15691200, 'steps': 81724, 'loss/train': 1.1984286308288574} 11/07/2021 08:43:36 - INFO - __main__ - Step 81726: {'lr': 0.00021969361242232143, 'samples': 15691392, 'steps': 81725, 'loss/train': 0.6077271103858948} 11/07/2021 08:43:36 - INFO - __main__ - Step 81727: {'lr': 0.00021968834482210783, 'samples': 15691584, 'steps': 81726, 'loss/train': 1.0926482677459717} 11/07/2021 08:43:37 - INFO - __main__ - Step 81728: {'lr': 0.0002196830772355522, 'samples': 15691776, 'steps': 81727, 'loss/train': 1.323989987373352} 11/07/2021 08:43:37 - INFO - __main__ - Step 81729: {'lr': 0.00021967780966265692, 'samples': 15691968, 'steps': 81728, 'loss/train': 1.3810734748840332} 11/07/2021 08:43:38 - INFO - __main__ - Step 81730: {'lr': 0.00021967254210342437, 'samples': 15692160, 'steps': 81729, 'loss/train': 1.6592577695846558} 11/07/2021 08:43:38 - INFO - __main__ - Step 81731: {'lr': 0.0002196672745578569, 'samples': 15692352, 'steps': 81730, 'loss/train': 1.2747776508331299} 11/07/2021 08:43:39 - INFO - __main__ - Step 81732: {'lr': 0.00021966200702595688, 'samples': 15692544, 'steps': 81731, 'loss/train': 1.3567525148391724} 11/07/2021 08:43:39 - INFO - __main__ - Step 81733: {'lr': 0.00021965673950772666, 'samples': 15692736, 'steps': 81732, 'loss/train': 1.8659203052520752} 11/07/2021 08:43:39 - INFO - __main__ - Step 81734: {'lr': 0.0002196514720031687, 'samples': 15692928, 'steps': 81733, 'loss/train': 1.4155669212341309} 11/07/2021 08:43:41 - INFO - __main__ - Step 81735: {'lr': 0.00021964620451228527, 'samples': 15693120, 'steps': 81734, 'loss/train': 1.6196352243423462} 11/07/2021 08:43:41 - INFO - __main__ - Step 81736: {'lr': 0.00021964093703507893, 'samples': 15693312, 'steps': 81735, 'loss/train': 1.6716891527175903} 11/07/2021 08:43:42 - INFO - __main__ - Step 81737: {'lr': 0.00021963566957155178, 'samples': 15693504, 'steps': 81736, 'loss/train': 1.23566472530365} 11/07/2021 08:43:42 - INFO - __main__ - Step 81738: {'lr': 0.00021963040212170636, 'samples': 15693696, 'steps': 81737, 'loss/train': 1.4538891315460205} 11/07/2021 08:43:42 - INFO - __main__ - Step 81739: {'lr': 0.00021962513468554498, 'samples': 15693888, 'steps': 81738, 'loss/train': 1.7894468307495117} 11/07/2021 08:43:43 - INFO - __main__ - Step 81740: {'lr': 0.00021961986726307006, 'samples': 15694080, 'steps': 81739, 'loss/train': 0.1260071098804474} 11/07/2021 08:43:44 - INFO - __main__ - Step 81741: {'lr': 0.00021961459985428395, 'samples': 15694272, 'steps': 81740, 'loss/train': 1.3848665952682495} 11/07/2021 08:43:44 - INFO - __main__ - Step 81742: {'lr': 0.00021960933245918903, 'samples': 15694464, 'steps': 81741, 'loss/train': 1.1008399724960327} 11/07/2021 08:43:44 - INFO - __main__ - Step 81743: {'lr': 0.0002196040650777877, 'samples': 15694656, 'steps': 81742, 'loss/train': 1.6353232860565186} 11/07/2021 08:43:45 - INFO - __main__ - Step 81744: {'lr': 0.00021959879771008228, 'samples': 15694848, 'steps': 81743, 'loss/train': 1.1745723485946655} 11/07/2021 08:43:45 - INFO - __main__ - Step 81745: {'lr': 0.00021959353035607522, 'samples': 15695040, 'steps': 81744, 'loss/train': 1.0483794212341309} 11/07/2021 08:43:46 - INFO - __main__ - Step 81746: {'lr': 0.0002195882630157688, 'samples': 15695232, 'steps': 81745, 'loss/train': 1.2218928337097168} 11/07/2021 08:43:46 - INFO - __main__ - Step 81747: {'lr': 0.00021958299568916546, 'samples': 15695424, 'steps': 81746, 'loss/train': 1.5027987957000732} 11/07/2021 08:43:47 - INFO - __main__ - Step 81748: {'lr': 0.00021957772837626755, 'samples': 15695616, 'steps': 81747, 'loss/train': 1.6050641536712646} 11/07/2021 08:43:47 - INFO - __main__ - Step 81749: {'lr': 0.00021957246107707758, 'samples': 15695808, 'steps': 81748, 'loss/train': 1.5497397184371948} 11/07/2021 08:43:47 - INFO - __main__ - Step 81750: {'lr': 0.00021956719379159762, 'samples': 15696000, 'steps': 81749, 'loss/train': 1.4605358839035034} 11/07/2021 08:43:48 - INFO - __main__ - Step 81751: {'lr': 0.00021956192651983027, 'samples': 15696192, 'steps': 81750, 'loss/train': 1.4995907545089722} 11/07/2021 08:43:49 - INFO - __main__ - Step 81752: {'lr': 0.0002195566592617778, 'samples': 15696384, 'steps': 81751, 'loss/train': 1.7870819568634033} 11/07/2021 08:43:49 - INFO - __main__ - Step 81753: {'lr': 0.00021955139201744266, 'samples': 15696576, 'steps': 81752, 'loss/train': 1.209183931350708} 11/07/2021 08:43:49 - INFO - __main__ - Step 81754: {'lr': 0.00021954612478682718, 'samples': 15696768, 'steps': 81753, 'loss/train': 2.140399932861328} 11/07/2021 08:43:50 - INFO - __main__ - Step 81755: {'lr': 0.00021954085756993374, 'samples': 15696960, 'steps': 81754, 'loss/train': 1.3503656387329102} 11/07/2021 08:43:51 - INFO - __main__ - Step 81756: {'lr': 0.00021953559036676473, 'samples': 15697152, 'steps': 81755, 'loss/train': 1.3702806234359741} 11/07/2021 08:43:51 - INFO - __main__ - Step 81757: {'lr': 0.00021953032317732253, 'samples': 15697344, 'steps': 81756, 'loss/train': 1.3751754760742188} 11/07/2021 08:43:52 - INFO - __main__ - Step 81758: {'lr': 0.0002195250560016095, 'samples': 15697536, 'steps': 81757, 'loss/train': 1.3358396291732788} 11/07/2021 08:43:52 - INFO - __main__ - Step 81759: {'lr': 0.00021951978883962797, 'samples': 15697728, 'steps': 81758, 'loss/train': 1.3910143375396729} 11/07/2021 08:43:52 - INFO - __main__ - Step 81760: {'lr': 0.00021951452169138036, 'samples': 15697920, 'steps': 81759, 'loss/train': 1.8249781131744385} 11/07/2021 08:43:53 - INFO - __main__ - Step 81761: {'lr': 0.00021950925455686908, 'samples': 15698112, 'steps': 81760, 'loss/train': 1.4410347938537598} 11/07/2021 08:43:54 - INFO - __main__ - Step 81762: {'lr': 0.0002195039874360964, 'samples': 15698304, 'steps': 81761, 'loss/train': 1.2035881280899048} 11/07/2021 08:43:54 - INFO - __main__ - Step 81763: {'lr': 0.0002194987203290649, 'samples': 15698496, 'steps': 81762, 'loss/train': 0.6848349571228027} 11/07/2021 08:43:54 - INFO - __main__ - Step 81764: {'lr': 0.00021949345323577668, 'samples': 15698688, 'steps': 81763, 'loss/train': 1.2248495817184448} 11/07/2021 08:43:55 - INFO - __main__ - Step 81765: {'lr': 0.00021948818615623425, 'samples': 15698880, 'steps': 81764, 'loss/train': 2.319912910461426} 11/07/2021 08:43:56 - INFO - __main__ - Step 81766: {'lr': 0.00021948291909043997, 'samples': 15699072, 'steps': 81765, 'loss/train': 0.7757716774940491} 11/07/2021 08:43:56 - INFO - __main__ - Step 81767: {'lr': 0.0002194776520383962, 'samples': 15699264, 'steps': 81766, 'loss/train': 1.2002075910568237} 11/07/2021 08:43:57 - INFO - __main__ - Step 81768: {'lr': 0.00021947238500010535, 'samples': 15699456, 'steps': 81767, 'loss/train': 2.004162073135376} 11/07/2021 08:43:57 - INFO - __main__ - Step 81769: {'lr': 0.0002194671179755698, 'samples': 15699648, 'steps': 81768, 'loss/train': 1.8498233556747437} 11/07/2021 08:43:57 - INFO - __main__ - Step 81770: {'lr': 0.00021946185096479186, 'samples': 15699840, 'steps': 81769, 'loss/train': 1.8227311372756958} 11/07/2021 08:43:58 - INFO - __main__ - Step 81771: {'lr': 0.0002194565839677739, 'samples': 15700032, 'steps': 81770, 'loss/train': 1.562067985534668} 11/07/2021 08:43:59 - INFO - __main__ - Step 81772: {'lr': 0.0002194513169845184, 'samples': 15700224, 'steps': 81771, 'loss/train': 1.770891547203064} 11/07/2021 08:43:59 - INFO - __main__ - Step 81773: {'lr': 0.00021944605001502761, 'samples': 15700416, 'steps': 81772, 'loss/train': 1.4528216123580933} 11/07/2021 08:43:59 - INFO - __main__ - Step 81774: {'lr': 0.000219440783059304, 'samples': 15700608, 'steps': 81773, 'loss/train': 1.2211928367614746} 11/07/2021 08:44:00 - INFO - __main__ - Step 81775: {'lr': 0.00021943551611734987, 'samples': 15700800, 'steps': 81774, 'loss/train': 0.9278920292854309} 11/07/2021 08:44:00 - INFO - __main__ - Step 81776: {'lr': 0.00021943024918916776, 'samples': 15700992, 'steps': 81775, 'loss/train': 1.4322787523269653} 11/07/2021 08:44:01 - INFO - __main__ - Step 81777: {'lr': 0.0002194249822747598, 'samples': 15701184, 'steps': 81776, 'loss/train': 1.710916519165039} 11/07/2021 08:44:01 - INFO - __main__ - Step 81778: {'lr': 0.00021941971537412847, 'samples': 15701376, 'steps': 81777, 'loss/train': 1.5158095359802246} 11/07/2021 08:44:02 - INFO - __main__ - Step 81779: {'lr': 0.00021941444848727612, 'samples': 15701568, 'steps': 81778, 'loss/train': 1.0082858800888062} 11/07/2021 08:44:02 - INFO - __main__ - Step 81780: {'lr': 0.00021940918161420517, 'samples': 15701760, 'steps': 81779, 'loss/train': 1.0920706987380981} 11/07/2021 08:44:02 - INFO - __main__ - Step 81781: {'lr': 0.00021940391475491793, 'samples': 15701952, 'steps': 81780, 'loss/train': 1.762493371963501} 11/07/2021 08:44:04 - INFO - __main__ - Step 81782: {'lr': 0.0002193986479094169, 'samples': 15702144, 'steps': 81781, 'loss/train': 1.4752006530761719} 11/07/2021 08:44:04 - INFO - __main__ - Step 81783: {'lr': 0.0002193933810777043, 'samples': 15702336, 'steps': 81782, 'loss/train': 0.8506064414978027} 11/07/2021 08:44:04 - INFO - __main__ - Step 81784: {'lr': 0.0002193881142597826, 'samples': 15702528, 'steps': 81783, 'loss/train': 1.3037147521972656} 11/07/2021 08:44:05 - INFO - __main__ - Step 81785: {'lr': 0.00021938284745565408, 'samples': 15702720, 'steps': 81784, 'loss/train': 1.1150015592575073} 11/07/2021 08:44:05 - INFO - __main__ - Step 81786: {'lr': 0.00021937758066532123, 'samples': 15702912, 'steps': 81785, 'loss/train': 0.9177032709121704} 11/07/2021 08:44:06 - INFO - __main__ - Step 81787: {'lr': 0.00021937231388878637, 'samples': 15703104, 'steps': 81786, 'loss/train': 1.2755452394485474} 11/07/2021 08:44:06 - INFO - __main__ - Step 81788: {'lr': 0.00021936704712605188, 'samples': 15703296, 'steps': 81787, 'loss/train': 1.5016417503356934} 11/07/2021 08:44:07 - INFO - __main__ - Step 81789: {'lr': 0.0002193617803771202, 'samples': 15703488, 'steps': 81788, 'loss/train': 1.3572160005569458} 11/07/2021 08:44:07 - INFO - __main__ - Step 81790: {'lr': 0.00021935651364199355, 'samples': 15703680, 'steps': 81789, 'loss/train': 1.643572211265564} 11/07/2021 08:44:07 - INFO - __main__ - Step 81791: {'lr': 0.00021935124692067437, 'samples': 15703872, 'steps': 81790, 'loss/train': 1.5238391160964966} 11/07/2021 08:44:08 - INFO - __main__ - Step 81792: {'lr': 0.00021934598021316508, 'samples': 15704064, 'steps': 81791, 'loss/train': 1.634735107421875} 11/07/2021 08:44:09 - INFO - __main__ - Step 81793: {'lr': 0.00021934071351946795, 'samples': 15704256, 'steps': 81792, 'loss/train': 0.9222462773323059} 11/07/2021 08:44:09 - INFO - __main__ - Step 81794: {'lr': 0.0002193354468395855, 'samples': 15704448, 'steps': 81793, 'loss/train': 1.5245214700698853} 11/07/2021 08:44:10 - INFO - __main__ - Step 81795: {'lr': 0.00021933018017351996, 'samples': 15704640, 'steps': 81794, 'loss/train': 1.655958890914917} 11/07/2021 08:44:10 - INFO - __main__ - Step 81796: {'lr': 0.00021932491352127378, 'samples': 15704832, 'steps': 81795, 'loss/train': 1.6035531759262085} 11/07/2021 08:44:10 - INFO - __main__ - Step 81797: {'lr': 0.00021931964688284933, 'samples': 15705024, 'steps': 81796, 'loss/train': 1.0351927280426025} 11/07/2021 08:44:11 - INFO - __main__ - Step 81798: {'lr': 0.00021931438025824898, 'samples': 15705216, 'steps': 81797, 'loss/train': 1.3137394189834595} 11/07/2021 08:44:12 - INFO - __main__ - Step 81799: {'lr': 0.0002193091136474751, 'samples': 15705408, 'steps': 81798, 'loss/train': 1.3677879571914673} 11/07/2021 08:44:12 - INFO - __main__ - Step 81800: {'lr': 0.00021930384705053004, 'samples': 15705600, 'steps': 81799, 'loss/train': 1.5106375217437744} 11/07/2021 08:44:12 - INFO - __main__ - Step 81801: {'lr': 0.00021929858046741623, 'samples': 15705792, 'steps': 81800, 'loss/train': 1.5269354581832886} 11/07/2021 08:44:13 - INFO - __main__ - Step 81802: {'lr': 0.00021929331389813596, 'samples': 15705984, 'steps': 81801, 'loss/train': 0.914970874786377} 11/07/2021 08:44:14 - INFO - __main__ - Step 81803: {'lr': 0.00021928804734269177, 'samples': 15706176, 'steps': 81802, 'loss/train': 1.448287844657898} 11/07/2021 08:44:14 - INFO - __main__ - Step 81804: {'lr': 0.00021928278080108582, 'samples': 15706368, 'steps': 81803, 'loss/train': 1.5939825773239136} 11/07/2021 08:44:14 - INFO - __main__ - Step 81805: {'lr': 0.00021927751427332058, 'samples': 15706560, 'steps': 81804, 'loss/train': 1.5776233673095703} 11/07/2021 08:44:15 - INFO - __main__ - Step 81806: {'lr': 0.00021927224775939838, 'samples': 15706752, 'steps': 81805, 'loss/train': 1.2894569635391235} 11/07/2021 08:44:15 - INFO - __main__ - Step 81807: {'lr': 0.00021926698125932168, 'samples': 15706944, 'steps': 81806, 'loss/train': 1.5209496021270752} 11/07/2021 08:44:16 - INFO - __main__ - Step 81808: {'lr': 0.00021926171477309276, 'samples': 15707136, 'steps': 81807, 'loss/train': 0.938679039478302} 11/07/2021 08:44:17 - INFO - __main__ - Step 81809: {'lr': 0.00021925644830071407, 'samples': 15707328, 'steps': 81808, 'loss/train': 1.7045031785964966} 11/07/2021 08:44:17 - INFO - __main__ - Step 81810: {'lr': 0.00021925118184218793, 'samples': 15707520, 'steps': 81809, 'loss/train': 1.3983052968978882} 11/07/2021 08:44:17 - INFO - __main__ - Step 81811: {'lr': 0.00021924591539751673, 'samples': 15707712, 'steps': 81810, 'loss/train': 0.9287119507789612} 11/07/2021 08:44:18 - INFO - __main__ - Step 81812: {'lr': 0.00021924064896670288, 'samples': 15707904, 'steps': 81811, 'loss/train': 1.5622773170471191} 11/07/2021 08:44:19 - INFO - __main__ - Step 81813: {'lr': 0.00021923538254974868, 'samples': 15708096, 'steps': 81812, 'loss/train': 1.4727410078048706} 11/07/2021 08:44:19 - INFO - __main__ - Step 81814: {'lr': 0.0002192301161466566, 'samples': 15708288, 'steps': 81813, 'loss/train': 1.1603304147720337} 11/07/2021 08:44:19 - INFO - __main__ - Step 81815: {'lr': 0.00021922484975742893, 'samples': 15708480, 'steps': 81814, 'loss/train': 0.943850040435791} 11/07/2021 08:44:20 - INFO - __main__ - Step 81816: {'lr': 0.0002192195833820681, 'samples': 15708672, 'steps': 81815, 'loss/train': 1.8960974216461182} 11/07/2021 08:44:20 - INFO - __main__ - Step 81817: {'lr': 0.0002192143170205764, 'samples': 15708864, 'steps': 81816, 'loss/train': 1.4154162406921387} 11/07/2021 08:44:21 - INFO - __main__ - Step 81818: {'lr': 0.00021920905067295625, 'samples': 15709056, 'steps': 81817, 'loss/train': 1.5239670276641846} 11/07/2021 08:44:21 - INFO - __main__ - Step 81819: {'lr': 0.00021920378433921002, 'samples': 15709248, 'steps': 81818, 'loss/train': 1.5357328653335571} 11/07/2021 08:44:22 - INFO - __main__ - Step 81820: {'lr': 0.0002191985180193401, 'samples': 15709440, 'steps': 81819, 'loss/train': 0.8983633518218994} 11/07/2021 08:44:22 - INFO - __main__ - Step 81821: {'lr': 0.00021919325171334885, 'samples': 15709632, 'steps': 81820, 'loss/train': 1.4406765699386597} 11/07/2021 08:44:23 - INFO - __main__ - Step 81822: {'lr': 0.00021918798542123864, 'samples': 15709824, 'steps': 81821, 'loss/train': 1.1993900537490845} 11/07/2021 08:44:23 - INFO - __main__ - Step 81823: {'lr': 0.00021918271914301185, 'samples': 15710016, 'steps': 81822, 'loss/train': 0.37001076340675354} 11/07/2021 08:44:24 - INFO - __main__ - Step 81824: {'lr': 0.00021917745287867085, 'samples': 15710208, 'steps': 81823, 'loss/train': 1.3202654123306274} 11/07/2021 08:44:24 - INFO - __main__ - Step 81825: {'lr': 0.0002191721866282181, 'samples': 15710400, 'steps': 81824, 'loss/train': 1.1840965747833252} 11/07/2021 08:44:25 - INFO - __main__ - Step 81826: {'lr': 0.00021916692039165582, 'samples': 15710592, 'steps': 81825, 'loss/train': 0.9618353843688965} 11/07/2021 08:44:25 - INFO - __main__ - Step 81827: {'lr': 0.00021916165416898642, 'samples': 15710784, 'steps': 81826, 'loss/train': 1.2716294527053833} 11/07/2021 08:44:25 - INFO - __main__ - Step 81828: {'lr': 0.00021915638796021232, 'samples': 15710976, 'steps': 81827, 'loss/train': 1.3472450971603394} 11/07/2021 08:44:26 - INFO - __main__ - Step 81829: {'lr': 0.00021915112176533588, 'samples': 15711168, 'steps': 81828, 'loss/train': 0.5666459202766418} 11/07/2021 08:44:27 - INFO - __main__ - Step 81830: {'lr': 0.0002191458555843595, 'samples': 15711360, 'steps': 81829, 'loss/train': 1.418014407157898} 11/07/2021 08:44:27 - INFO - __main__ - Step 81831: {'lr': 0.0002191405894172855, 'samples': 15711552, 'steps': 81830, 'loss/train': 1.4239994287490845} 11/07/2021 08:44:27 - INFO - __main__ - Step 81832: {'lr': 0.00021913532326411626, 'samples': 15711744, 'steps': 81831, 'loss/train': 1.4199299812316895} 11/07/2021 08:44:28 - INFO - __main__ - Step 81833: {'lr': 0.0002191300571248542, 'samples': 15711936, 'steps': 81832, 'loss/train': 1.0008704662322998} 11/07/2021 08:44:29 - INFO - __main__ - Step 81834: {'lr': 0.00021912479099950161, 'samples': 15712128, 'steps': 81833, 'loss/train': 1.765632152557373} 11/07/2021 08:44:29 - INFO - __main__ - Step 81835: {'lr': 0.00021911952488806093, 'samples': 15712320, 'steps': 81834, 'loss/train': 1.327449083328247} 11/07/2021 08:44:30 - INFO - __main__ - Step 81836: {'lr': 0.00021911425879053456, 'samples': 15712512, 'steps': 81835, 'loss/train': 1.55491042137146} 11/07/2021 08:44:30 - INFO - __main__ - Step 81837: {'lr': 0.00021910899270692478, 'samples': 15712704, 'steps': 81836, 'loss/train': 1.4408612251281738} 11/07/2021 08:44:30 - INFO - __main__ - Step 81838: {'lr': 0.00021910372663723403, 'samples': 15712896, 'steps': 81837, 'loss/train': 1.555741548538208} 11/07/2021 08:44:31 - INFO - __main__ - Step 81839: {'lr': 0.00021909846058146463, 'samples': 15713088, 'steps': 81838, 'loss/train': 2.1258535385131836} 11/07/2021 08:44:32 - INFO - __main__ - Step 81840: {'lr': 0.00021909319453961902, 'samples': 15713280, 'steps': 81839, 'loss/train': 3.8428032398223877} 11/07/2021 08:44:32 - INFO - __main__ - Step 81841: {'lr': 0.00021908792851169952, 'samples': 15713472, 'steps': 81840, 'loss/train': 1.164108157157898} 11/07/2021 08:44:32 - INFO - __main__ - Step 81842: {'lr': 0.00021908266249770852, 'samples': 15713664, 'steps': 81841, 'loss/train': 1.5313407182693481} 11/07/2021 08:44:33 - INFO - __main__ - Step 81843: {'lr': 0.00021907739649764846, 'samples': 15713856, 'steps': 81842, 'loss/train': 1.3132789134979248} 11/07/2021 08:44:33 - INFO - __main__ - Step 81844: {'lr': 0.00021907213051152157, 'samples': 15714048, 'steps': 81843, 'loss/train': 1.482437014579773} 11/07/2021 08:44:34 - INFO - __main__ - Step 81845: {'lr': 0.00021906686453933034, 'samples': 15714240, 'steps': 81844, 'loss/train': 1.0574501752853394} 11/07/2021 08:44:34 - INFO - __main__ - Step 81846: {'lr': 0.0002190615985810771, 'samples': 15714432, 'steps': 81845, 'loss/train': 1.5509016513824463} 11/07/2021 08:44:35 - INFO - __main__ - Step 81847: {'lr': 0.00021905633263676424, 'samples': 15714624, 'steps': 81846, 'loss/train': 1.1591476202011108} 11/07/2021 08:44:35 - INFO - __main__ - Step 81848: {'lr': 0.0002190510667063941, 'samples': 15714816, 'steps': 81847, 'loss/train': 1.5975843667984009} 11/07/2021 08:44:35 - INFO - __main__ - Step 81849: {'lr': 0.00021904580078996904, 'samples': 15715008, 'steps': 81848, 'loss/train': 1.328046202659607} 11/07/2021 08:44:37 - INFO - __main__ - Step 81850: {'lr': 0.00021904053488749152, 'samples': 15715200, 'steps': 81849, 'loss/train': 1.0789021253585815} 11/07/2021 08:44:37 - INFO - __main__ - Step 81851: {'lr': 0.0002190352689989638, 'samples': 15715392, 'steps': 81850, 'loss/train': 1.2321277856826782} 11/07/2021 08:44:37 - INFO - __main__ - Step 81852: {'lr': 0.00021903000312438833, 'samples': 15715584, 'steps': 81851, 'loss/train': 1.4328861236572266} 11/07/2021 08:44:38 - INFO - __main__ - Step 81853: {'lr': 0.0002190247372637675, 'samples': 15715776, 'steps': 81852, 'loss/train': 1.6071841716766357} 11/07/2021 08:44:38 - INFO - __main__ - Step 81854: {'lr': 0.0002190194714171036, 'samples': 15715968, 'steps': 81853, 'loss/train': 1.365761637687683} 11/07/2021 08:44:39 - INFO - __main__ - Step 81855: {'lr': 0.00021901420558439905, 'samples': 15716160, 'steps': 81854, 'loss/train': 1.3484658002853394} 11/07/2021 08:44:39 - INFO - __main__ - Step 81856: {'lr': 0.00021900893976565622, 'samples': 15716352, 'steps': 81855, 'loss/train': 1.4486804008483887} 11/07/2021 08:44:40 - INFO - __main__ - Step 81857: {'lr': 0.00021900367396087756, 'samples': 15716544, 'steps': 81856, 'loss/train': 1.6089874505996704} 11/07/2021 08:44:40 - INFO - __main__ - Step 81858: {'lr': 0.00021899840817006535, 'samples': 15716736, 'steps': 81857, 'loss/train': 1.659891128540039} 11/07/2021 08:44:40 - INFO - __main__ - Step 81859: {'lr': 0.00021899314239322192, 'samples': 15716928, 'steps': 81858, 'loss/train': 1.5814769268035889} 11/07/2021 08:44:42 - INFO - __main__ - Step 81860: {'lr': 0.00021898787663034974, 'samples': 15717120, 'steps': 81859, 'loss/train': 1.1334338188171387} 11/07/2021 08:44:42 - INFO - __main__ - Step 81861: {'lr': 0.0002189826108814511, 'samples': 15717312, 'steps': 81860, 'loss/train': 1.1215496063232422} 11/07/2021 08:44:42 - INFO - __main__ - Step 81862: {'lr': 0.00021897734514652844, 'samples': 15717504, 'steps': 81861, 'loss/train': 1.1770102977752686} 11/07/2021 08:44:43 - INFO - __main__ - Step 81863: {'lr': 0.00021897207942558411, 'samples': 15717696, 'steps': 81862, 'loss/train': 1.4443387985229492} 11/07/2021 08:44:43 - INFO - __main__ - Step 81864: {'lr': 0.00021896681371862047, 'samples': 15717888, 'steps': 81863, 'loss/train': 1.4161992073059082} 11/07/2021 08:44:44 - INFO - __main__ - Step 81865: {'lr': 0.00021896154802563992, 'samples': 15718080, 'steps': 81864, 'loss/train': 1.275030493736267} 11/07/2021 08:44:45 - INFO - __main__ - Step 81866: {'lr': 0.0002189562823466448, 'samples': 15718272, 'steps': 81865, 'loss/train': 1.0452250242233276} 11/07/2021 08:44:45 - INFO - __main__ - Step 81867: {'lr': 0.0002189510166816375, 'samples': 15718464, 'steps': 81866, 'loss/train': 1.355798602104187} 11/07/2021 08:44:46 - INFO - __main__ - Step 81868: {'lr': 0.00021894575103062038, 'samples': 15718656, 'steps': 81867, 'loss/train': 0.6811831593513489} 11/07/2021 08:44:46 - INFO - __main__ - Step 81869: {'lr': 0.00021894048539359588, 'samples': 15718848, 'steps': 81868, 'loss/train': 1.5100421905517578} 11/07/2021 08:44:46 - INFO - __main__ - Step 81870: {'lr': 0.00021893521977056637, 'samples': 15719040, 'steps': 81869, 'loss/train': 1.6450531482696533} 11/07/2021 08:44:47 - INFO - __main__ - Step 81871: {'lr': 0.00021892995416153408, 'samples': 15719232, 'steps': 81870, 'loss/train': 0.6634085774421692} 11/07/2021 08:44:48 - INFO - __main__ - Step 81872: {'lr': 0.00021892468856650148, 'samples': 15719424, 'steps': 81871, 'loss/train': 1.126741886138916} 11/07/2021 08:44:48 - INFO - __main__ - Step 81873: {'lr': 0.00021891942298547093, 'samples': 15719616, 'steps': 81872, 'loss/train': 1.784936547279358} 11/07/2021 08:44:48 - INFO - __main__ - Step 81874: {'lr': 0.0002189141574184448, 'samples': 15719808, 'steps': 81873, 'loss/train': 1.6559330224990845} 11/07/2021 08:44:49 - INFO - __main__ - Step 81875: {'lr': 0.0002189088918654255, 'samples': 15720000, 'steps': 81874, 'loss/train': 1.5979567766189575} 11/07/2021 08:44:49 - INFO - __main__ - Step 81876: {'lr': 0.00021890362632641537, 'samples': 15720192, 'steps': 81875, 'loss/train': 1.455506443977356} 11/07/2021 08:44:50 - INFO - __main__ - Step 81877: {'lr': 0.00021889836080141677, 'samples': 15720384, 'steps': 81876, 'loss/train': 1.5928109884262085} 11/07/2021 08:44:51 - INFO - __main__ - Step 81878: {'lr': 0.00021889309529043207, 'samples': 15720576, 'steps': 81877, 'loss/train': 1.5861215591430664} 11/07/2021 08:44:51 - INFO - __main__ - Step 81879: {'lr': 0.0002188878297934637, 'samples': 15720768, 'steps': 81878, 'loss/train': 1.4372271299362183} 11/07/2021 08:44:51 - INFO - __main__ - Step 81880: {'lr': 0.00021888256431051395, 'samples': 15720960, 'steps': 81879, 'loss/train': 1.7066025733947754} 11/07/2021 08:44:52 - INFO - __main__ - Step 81881: {'lr': 0.00021887729884158527, 'samples': 15721152, 'steps': 81880, 'loss/train': 1.7062877416610718} 11/07/2021 08:44:53 - INFO - __main__ - Step 81882: {'lr': 0.00021887203338668, 'samples': 15721344, 'steps': 81881, 'loss/train': 1.6168873310089111} 11/07/2021 08:44:53 - INFO - __main__ - Step 81883: {'lr': 0.0002188667679458005, 'samples': 15721536, 'steps': 81882, 'loss/train': 2.0987067222595215} 11/07/2021 08:44:53 - INFO - __main__ - Step 81884: {'lr': 0.00021886150251894927, 'samples': 15721728, 'steps': 81883, 'loss/train': 1.7823930978775024} 11/07/2021 08:44:54 - INFO - __main__ - Step 81885: {'lr': 0.00021885623710612845, 'samples': 15721920, 'steps': 81884, 'loss/train': 1.3161180019378662} 11/07/2021 08:44:54 - INFO - __main__ - Step 81886: {'lr': 0.00021885097170734052, 'samples': 15722112, 'steps': 81885, 'loss/train': 1.5367300510406494} 11/07/2021 08:44:55 - INFO - __main__ - Step 81887: {'lr': 0.00021884570632258788, 'samples': 15722304, 'steps': 81886, 'loss/train': 1.1786556243896484} 11/07/2021 08:44:55 - INFO - __main__ - Step 81888: {'lr': 0.0002188404409518729, 'samples': 15722496, 'steps': 81887, 'loss/train': 1.0150196552276611} 11/07/2021 08:44:56 - INFO - __main__ - Step 81889: {'lr': 0.00021883517559519787, 'samples': 15722688, 'steps': 81888, 'loss/train': 1.1865622997283936} 11/07/2021 08:44:56 - INFO - __main__ - Step 81890: {'lr': 0.0002188299102525653, 'samples': 15722880, 'steps': 81889, 'loss/train': 1.2968388795852661} 11/07/2021 08:44:56 - INFO - __main__ - Step 81891: {'lr': 0.00021882464492397748, 'samples': 15723072, 'steps': 81890, 'loss/train': 0.8124095797538757} 11/07/2021 08:44:57 - INFO - __main__ - Step 81892: {'lr': 0.00021881937960943677, 'samples': 15723264, 'steps': 81891, 'loss/train': 1.4005656242370605} 11/07/2021 08:44:59 - INFO - __main__ - Step 81893: {'lr': 0.00021881411430894554, 'samples': 15723456, 'steps': 81892, 'loss/train': 0.5554535984992981} 11/07/2021 08:44:59 - INFO - __main__ - Step 81894: {'lr': 0.00021880884902250624, 'samples': 15723648, 'steps': 81893, 'loss/train': 1.590369701385498} 11/07/2021 08:44:59 - INFO - __main__ - Step 81895: {'lr': 0.00021880358375012116, 'samples': 15723840, 'steps': 81894, 'loss/train': 1.3703712224960327} 11/07/2021 08:45:00 - INFO - __main__ - Step 81896: {'lr': 0.00021879831849179275, 'samples': 15724032, 'steps': 81895, 'loss/train': 1.7759495973587036} 11/07/2021 08:45:00 - INFO - __main__ - Step 81897: {'lr': 0.00021879305324752342, 'samples': 15724224, 'steps': 81896, 'loss/train': 1.7670613527297974} 11/07/2021 08:45:00 - INFO - __main__ - Step 81898: {'lr': 0.00021878778801731532, 'samples': 15724416, 'steps': 81897, 'loss/train': 1.7577279806137085} 11/07/2021 08:45:01 - INFO - __main__ - Step 81899: {'lr': 0.00021878252280117097, 'samples': 15724608, 'steps': 81898, 'loss/train': 1.5230650901794434} 11/07/2021 08:45:02 - INFO - __main__ - Step 81900: {'lr': 0.00021877725759909274, 'samples': 15724800, 'steps': 81899, 'loss/train': 1.2282017469406128} 11/07/2021 08:45:02 - INFO - __main__ - Step 81901: {'lr': 0.00021877199241108304, 'samples': 15724992, 'steps': 81900, 'loss/train': 1.452487587928772} 11/07/2021 08:45:02 - INFO - __main__ - Step 81902: {'lr': 0.00021876672723714413, 'samples': 15725184, 'steps': 81901, 'loss/train': 1.2938870191574097} 11/07/2021 08:45:03 - INFO - __main__ - Step 81903: {'lr': 0.00021876146207727847, 'samples': 15725376, 'steps': 81902, 'loss/train': 1.787327527999878} 11/07/2021 08:45:03 - INFO - __main__ - Step 81904: {'lr': 0.00021875619693148847, 'samples': 15725568, 'steps': 81903, 'loss/train': 1.6098909378051758} 11/07/2021 08:45:04 - INFO - __main__ - Step 81905: {'lr': 0.0002187509317997764, 'samples': 15725760, 'steps': 81904, 'loss/train': 1.7381401062011719} 11/07/2021 08:45:04 - INFO - __main__ - Step 81906: {'lr': 0.00021874566668214466, 'samples': 15725952, 'steps': 81905, 'loss/train': 1.241761326789856} 11/07/2021 08:45:05 - INFO - __main__ - Step 81907: {'lr': 0.00021874040157859564, 'samples': 15726144, 'steps': 81906, 'loss/train': 0.550673246383667} 11/07/2021 08:45:05 - INFO - __main__ - Step 81908: {'lr': 0.00021873513648913178, 'samples': 15726336, 'steps': 81907, 'loss/train': 1.8351263999938965} 11/07/2021 08:45:06 - INFO - __main__ - Step 81909: {'lr': 0.0002187298714137553, 'samples': 15726528, 'steps': 81908, 'loss/train': 0.9531960487365723} 11/07/2021 08:45:07 - INFO - __main__ - Step 81910: {'lr': 0.00021872460635246883, 'samples': 15726720, 'steps': 81909, 'loss/train': 1.373153805732727} 11/07/2021 08:45:07 - INFO - __main__ - Step 81911: {'lr': 0.00021871934130527444, 'samples': 15726912, 'steps': 81910, 'loss/train': 1.3881722688674927} 11/07/2021 08:45:08 - INFO - __main__ - Step 81912: {'lr': 0.0002187140762721746, 'samples': 15727104, 'steps': 81911, 'loss/train': 1.2986241579055786} 11/07/2021 08:45:08 - INFO - __main__ - Step 81913: {'lr': 0.00021870881125317173, 'samples': 15727296, 'steps': 81912, 'loss/train': 1.5847065448760986} 11/07/2021 08:45:08 - INFO - __main__ - Step 81914: {'lr': 0.0002187035462482682, 'samples': 15727488, 'steps': 81913, 'loss/train': 1.6330777406692505} 11/07/2021 08:45:09 - INFO - __main__ - Step 81915: {'lr': 0.00021869828125746637, 'samples': 15727680, 'steps': 81914, 'loss/train': 0.9026507139205933} 11/07/2021 08:45:10 - INFO - __main__ - Step 81916: {'lr': 0.00021869301628076862, 'samples': 15727872, 'steps': 81915, 'loss/train': 1.453425407409668} 11/07/2021 08:45:10 - INFO - __main__ - Step 81917: {'lr': 0.0002186877513181773, 'samples': 15728064, 'steps': 81916, 'loss/train': 1.5080713033676147} 11/07/2021 08:45:10 - INFO - __main__ - Step 81918: {'lr': 0.00021868248636969478, 'samples': 15728256, 'steps': 81917, 'loss/train': 1.3975954055786133} 11/07/2021 08:45:11 - INFO - __main__ - Step 81919: {'lr': 0.0002186772214353235, 'samples': 15728448, 'steps': 81918, 'loss/train': 0.6736943125724792} 11/07/2021 08:45:11 - INFO - __main__ - Step 81920: {'lr': 0.00021867195651506576, 'samples': 15728640, 'steps': 81919, 'loss/train': 1.5684986114501953} 11/07/2021 08:45:12 - INFO - __main__ - Step 81921: {'lr': 0.00021866669160892392, 'samples': 15728832, 'steps': 81920, 'loss/train': 1.346764326095581} 11/07/2021 08:45:13 - INFO - __main__ - Step 81922: {'lr': 0.0002186614267169004, 'samples': 15729024, 'steps': 81921, 'loss/train': 1.0549620389938354} 11/07/2021 08:45:13 - INFO - __main__ - Step 81923: {'lr': 0.00021865616183899757, 'samples': 15729216, 'steps': 81922, 'loss/train': 1.6571847200393677} 11/07/2021 08:45:13 - INFO - __main__ - Step 81924: {'lr': 0.0002186508969752179, 'samples': 15729408, 'steps': 81923, 'loss/train': 1.451051950454712} 11/07/2021 08:45:14 - INFO - __main__ - Step 81925: {'lr': 0.00021864563212556351, 'samples': 15729600, 'steps': 81924, 'loss/train': 1.5553735494613647} 11/07/2021 08:45:15 - INFO - __main__ - Step 81926: {'lr': 0.00021864036729003693, 'samples': 15729792, 'steps': 81925, 'loss/train': 1.5820026397705078} 11/07/2021 08:45:15 - INFO - __main__ - Step 81927: {'lr': 0.00021863510246864054, 'samples': 15729984, 'steps': 81926, 'loss/train': 1.5591566562652588} 11/07/2021 08:45:15 - INFO - __main__ - Step 81928: {'lr': 0.00021862983766137667, 'samples': 15730176, 'steps': 81927, 'loss/train': 1.6230748891830444} 11/07/2021 08:45:16 - INFO - __main__ - Step 81929: {'lr': 0.0002186245728682477, 'samples': 15730368, 'steps': 81928, 'loss/train': 1.8752391338348389} 11/07/2021 08:45:16 - INFO - __main__ - Step 81930: {'lr': 0.00021861930808925607, 'samples': 15730560, 'steps': 81929, 'loss/train': 1.393430233001709} 11/07/2021 08:45:17 - INFO - __main__ - Step 81931: {'lr': 0.00021861404332440405, 'samples': 15730752, 'steps': 81930, 'loss/train': 1.6335017681121826} 11/07/2021 08:45:18 - INFO - __main__ - Step 81932: {'lr': 0.00021860877857369403, 'samples': 15730944, 'steps': 81931, 'loss/train': 1.359986424446106} 11/07/2021 08:45:18 - INFO - __main__ - Step 81933: {'lr': 0.00021860351383712847, 'samples': 15731136, 'steps': 81932, 'loss/train': 1.343477725982666} 11/07/2021 08:45:18 - INFO - __main__ - Step 81934: {'lr': 0.00021859824911470965, 'samples': 15731328, 'steps': 81933, 'loss/train': 1.6282157897949219} 11/07/2021 08:45:19 - INFO - __main__ - Step 81935: {'lr': 0.00021859298440644, 'samples': 15731520, 'steps': 81934, 'loss/train': 1.2157790660858154} 11/07/2021 08:45:19 - INFO - __main__ - Step 81936: {'lr': 0.00021858771971232184, 'samples': 15731712, 'steps': 81935, 'loss/train': 0.7408631443977356} 11/07/2021 08:45:20 - INFO - __main__ - Step 81937: {'lr': 0.00021858245503235765, 'samples': 15731904, 'steps': 81936, 'loss/train': 1.607974886894226} 11/07/2021 08:45:21 - INFO - __main__ - Step 81938: {'lr': 0.00021857719036654966, 'samples': 15732096, 'steps': 81937, 'loss/train': 1.3546884059906006} 11/07/2021 08:45:21 - INFO - __main__ - Step 81939: {'lr': 0.00021857192571490028, 'samples': 15732288, 'steps': 81938, 'loss/train': 1.3780652284622192} 11/07/2021 08:45:21 - INFO - __main__ - Step 81940: {'lr': 0.00021856666107741192, 'samples': 15732480, 'steps': 81939, 'loss/train': 1.4554966688156128} 11/07/2021 08:45:22 - INFO - __main__ - Step 81941: {'lr': 0.00021856139645408694, 'samples': 15732672, 'steps': 81940, 'loss/train': 1.152280569076538} 11/07/2021 08:45:23 - INFO - __main__ - Step 81942: {'lr': 0.0002185561318449277, 'samples': 15732864, 'steps': 81941, 'loss/train': 0.5793785452842712} 11/07/2021 08:45:23 - INFO - __main__ - Step 81943: {'lr': 0.00021855086724993658, 'samples': 15733056, 'steps': 81942, 'loss/train': 1.1815279722213745} 11/07/2021 08:45:23 - INFO - __main__ - Step 81944: {'lr': 0.00021854560266911595, 'samples': 15733248, 'steps': 81943, 'loss/train': 1.2100149393081665} 11/07/2021 08:45:24 - INFO - __main__ - Step 81945: {'lr': 0.0002185403381024682, 'samples': 15733440, 'steps': 81944, 'loss/train': 1.0882725715637207} 11/07/2021 08:45:24 - INFO - __main__ - Step 81946: {'lr': 0.0002185350735499957, 'samples': 15733632, 'steps': 81945, 'loss/train': 1.3889715671539307} 11/07/2021 08:45:25 - INFO - __main__ - Step 81947: {'lr': 0.00021852980901170078, 'samples': 15733824, 'steps': 81946, 'loss/train': 1.683424472808838} 11/07/2021 08:45:25 - INFO - __main__ - Step 81948: {'lr': 0.0002185245444875859, 'samples': 15734016, 'steps': 81947, 'loss/train': 1.4219671487808228} 11/07/2021 08:45:26 - INFO - __main__ - Step 81949: {'lr': 0.00021851927997765334, 'samples': 15734208, 'steps': 81948, 'loss/train': 1.2475048303604126} 11/07/2021 08:45:26 - INFO - __main__ - Step 81950: {'lr': 0.00021851401548190547, 'samples': 15734400, 'steps': 81949, 'loss/train': 3.8088505268096924} 11/07/2021 08:45:26 - INFO - __main__ - Step 81951: {'lr': 0.00021850875100034477, 'samples': 15734592, 'steps': 81950, 'loss/train': 1.6100938320159912} 11/07/2021 08:45:28 - INFO - __main__ - Step 81952: {'lr': 0.00021850348653297351, 'samples': 15734784, 'steps': 81951, 'loss/train': 0.9282059669494629} 11/07/2021 08:45:29 - INFO - __main__ - Step 81953: {'lr': 0.00021849822207979408, 'samples': 15734976, 'steps': 81952, 'loss/train': 1.7571454048156738} 11/07/2021 08:45:29 - INFO - __main__ - Step 81954: {'lr': 0.00021849295764080886, 'samples': 15735168, 'steps': 81953, 'loss/train': 1.102122187614441} 11/07/2021 08:45:29 - INFO - __main__ - Step 81955: {'lr': 0.00021848769321602024, 'samples': 15735360, 'steps': 81954, 'loss/train': 0.4921179711818695} 11/07/2021 08:45:30 - INFO - __main__ - Step 81956: {'lr': 0.00021848242880543058, 'samples': 15735552, 'steps': 81955, 'loss/train': 0.8086460828781128} 11/07/2021 08:45:30 - INFO - __main__ - Step 81957: {'lr': 0.00021847716440904222, 'samples': 15735744, 'steps': 81956, 'loss/train': 1.058969259262085} 11/07/2021 08:45:30 - INFO - __main__ - Step 81958: {'lr': 0.00021847190002685757, 'samples': 15735936, 'steps': 81957, 'loss/train': 1.1957626342773438} 11/07/2021 08:45:31 - INFO - __main__ - Step 81959: {'lr': 0.00021846663565887908, 'samples': 15736128, 'steps': 81958, 'loss/train': 1.165013313293457} 11/07/2021 08:45:32 - INFO - __main__ - Step 81960: {'lr': 0.00021846137130510895, 'samples': 15736320, 'steps': 81959, 'loss/train': 1.107596755027771} 11/07/2021 08:45:32 - INFO - __main__ - Step 81961: {'lr': 0.00021845610696554968, 'samples': 15736512, 'steps': 81960, 'loss/train': 1.4544600248336792} 11/07/2021 08:45:32 - INFO - __main__ - Step 81962: {'lr': 0.00021845084264020357, 'samples': 15736704, 'steps': 81961, 'loss/train': 1.0679354667663574} 11/07/2021 08:45:33 - INFO - __main__ - Step 81963: {'lr': 0.00021844557832907303, 'samples': 15736896, 'steps': 81962, 'loss/train': 1.2373062372207642} 11/07/2021 08:45:34 - INFO - __main__ - Step 81964: {'lr': 0.00021844031403216047, 'samples': 15737088, 'steps': 81963, 'loss/train': 0.9305351376533508} 11/07/2021 08:45:34 - INFO - __main__ - Step 81965: {'lr': 0.00021843504974946817, 'samples': 15737280, 'steps': 81964, 'loss/train': 1.8959134817123413} 11/07/2021 08:45:34 - INFO - __main__ - Step 81966: {'lr': 0.00021842978548099857, 'samples': 15737472, 'steps': 81965, 'loss/train': 1.0857735872268677} 11/07/2021 08:45:35 - INFO - __main__ - Step 81967: {'lr': 0.000218424521226754, 'samples': 15737664, 'steps': 81966, 'loss/train': 1.6676563024520874} 11/07/2021 08:45:35 - INFO - __main__ - Step 81968: {'lr': 0.00021841925698673687, 'samples': 15737856, 'steps': 81967, 'loss/train': 1.5017001628875732} 11/07/2021 08:45:36 - INFO - __main__ - Step 81969: {'lr': 0.0002184139927609495, 'samples': 15738048, 'steps': 81968, 'loss/train': 1.5002306699752808} 11/07/2021 08:45:36 - INFO - __main__ - Step 81970: {'lr': 0.00021840872854939436, 'samples': 15738240, 'steps': 81969, 'loss/train': 1.2939642667770386} 11/07/2021 08:45:37 - INFO - __main__ - Step 81971: {'lr': 0.00021840346435207376, 'samples': 15738432, 'steps': 81970, 'loss/train': 1.4957914352416992} 11/07/2021 08:45:37 - INFO - __main__ - Step 81972: {'lr': 0.00021839820016899002, 'samples': 15738624, 'steps': 81971, 'loss/train': 1.641781210899353} 11/07/2021 08:45:38 - INFO - __main__ - Step 81973: {'lr': 0.00021839293600014556, 'samples': 15738816, 'steps': 81972, 'loss/train': 1.3858494758605957} 11/07/2021 08:45:39 - INFO - __main__ - Step 81974: {'lr': 0.00021838767184554278, 'samples': 15739008, 'steps': 81973, 'loss/train': 1.5774500370025635} 11/07/2021 08:45:39 - INFO - __main__ - Step 81975: {'lr': 0.00021838240770518402, 'samples': 15739200, 'steps': 81974, 'loss/train': 1.249455451965332} 11/07/2021 08:45:39 - INFO - __main__ - Step 81976: {'lr': 0.00021837714357907166, 'samples': 15739392, 'steps': 81975, 'loss/train': 1.819058895111084} 11/07/2021 08:45:40 - INFO - __main__ - Step 81977: {'lr': 0.00021837187946720804, 'samples': 15739584, 'steps': 81976, 'loss/train': 1.1091678142547607} 11/07/2021 08:45:40 - INFO - __main__ - Step 81978: {'lr': 0.00021836661536959567, 'samples': 15739776, 'steps': 81977, 'loss/train': 1.368850588798523} 11/07/2021 08:45:41 - INFO - __main__ - Step 81979: {'lr': 0.00021836135128623673, 'samples': 15739968, 'steps': 81978, 'loss/train': 1.3429418802261353} 11/07/2021 08:45:41 - INFO - __main__ - Step 81980: {'lr': 0.00021835608721713367, 'samples': 15740160, 'steps': 81979, 'loss/train': 1.4896680116653442} 11/07/2021 08:45:42 - INFO - __main__ - Step 81981: {'lr': 0.00021835082316228892, 'samples': 15740352, 'steps': 81980, 'loss/train': 0.9008151292800903} 11/07/2021 08:45:42 - INFO - __main__ - Step 81982: {'lr': 0.0002183455591217048, 'samples': 15740544, 'steps': 81981, 'loss/train': 1.6770747900009155} 11/07/2021 08:45:42 - INFO - __main__ - Step 81983: {'lr': 0.00021834029509538365, 'samples': 15740736, 'steps': 81982, 'loss/train': 1.6874535083770752} 11/07/2021 08:45:44 - INFO - __main__ - Step 81984: {'lr': 0.00021833503108332786, 'samples': 15740928, 'steps': 81983, 'loss/train': 1.239082932472229} 11/07/2021 08:45:44 - INFO - __main__ - Step 81985: {'lr': 0.0002183297670855398, 'samples': 15741120, 'steps': 81984, 'loss/train': 1.4883664846420288} 11/07/2021 08:45:44 - INFO - __main__ - Step 81986: {'lr': 0.0002183245031020219, 'samples': 15741312, 'steps': 81985, 'loss/train': 1.2150758504867554} 11/07/2021 08:45:45 - INFO - __main__ - Step 81987: {'lr': 0.00021831923913277648, 'samples': 15741504, 'steps': 81986, 'loss/train': 1.2229024171829224} 11/07/2021 08:45:45 - INFO - __main__ - Step 81988: {'lr': 0.00021831397517780592, 'samples': 15741696, 'steps': 81987, 'loss/train': 1.1767107248306274} 11/07/2021 08:45:46 - INFO - __main__ - Step 81989: {'lr': 0.0002183087112371126, 'samples': 15741888, 'steps': 81988, 'loss/train': 1.7111279964447021} 11/07/2021 08:45:46 - INFO - __main__ - Step 81990: {'lr': 0.00021830344731069886, 'samples': 15742080, 'steps': 81989, 'loss/train': 1.7641246318817139} 11/07/2021 08:45:47 - INFO - __main__ - Step 81991: {'lr': 0.00021829818339856716, 'samples': 15742272, 'steps': 81990, 'loss/train': 1.2710163593292236} 11/07/2021 08:45:47 - INFO - __main__ - Step 81992: {'lr': 0.00021829291950071984, 'samples': 15742464, 'steps': 81991, 'loss/train': 0.7822362780570984} 11/07/2021 08:45:48 - INFO - __main__ - Step 81993: {'lr': 0.00021828765561715915, 'samples': 15742656, 'steps': 81992, 'loss/train': 1.9347063302993774} 11/07/2021 08:45:48 - INFO - __main__ - Step 81994: {'lr': 0.00021828239174788756, 'samples': 15742848, 'steps': 81993, 'loss/train': 1.3748453855514526} 11/07/2021 08:45:49 - INFO - __main__ - Step 81995: {'lr': 0.00021827712789290743, 'samples': 15743040, 'steps': 81994, 'loss/train': 1.4550189971923828} 11/07/2021 08:45:49 - INFO - __main__ - Step 81996: {'lr': 0.00021827186405222115, 'samples': 15743232, 'steps': 81995, 'loss/train': 1.4831171035766602} 11/07/2021 08:45:50 - INFO - __main__ - Step 81997: {'lr': 0.0002182666002258311, 'samples': 15743424, 'steps': 81996, 'loss/train': 1.3291908502578735} 11/07/2021 08:45:50 - INFO - __main__ - Step 81998: {'lr': 0.00021826133641373961, 'samples': 15743616, 'steps': 81997, 'loss/train': 1.1738017797470093} 11/07/2021 08:45:50 - INFO - __main__ - Step 81999: {'lr': 0.00021825607261594904, 'samples': 15743808, 'steps': 81998, 'loss/train': 1.6931029558181763} 11/07/2021 08:45:51 - INFO - __main__ - Step 82000: {'lr': 0.00021825080883246186, 'samples': 15744000, 'steps': 81999, 'loss/train': 1.3202465772628784} 11/07/2021 08:45:52 - INFO - __main__ - Step 82001: {'lr': 0.0002182455450632803, 'samples': 15744192, 'steps': 82000, 'loss/train': 1.6192408800125122} 11/07/2021 08:45:52 - INFO - __main__ - Step 82002: {'lr': 0.00021824028130840688, 'samples': 15744384, 'steps': 82001, 'loss/train': 1.2046760320663452} 11/07/2021 08:45:52 - INFO - __main__ - Step 82003: {'lr': 0.00021823501756784385, 'samples': 15744576, 'steps': 82002, 'loss/train': 0.4714474678039551} 11/07/2021 08:45:53 - INFO - __main__ - Step 82004: {'lr': 0.00021822975384159365, 'samples': 15744768, 'steps': 82003, 'loss/train': 1.3434090614318848} 11/07/2021 08:45:54 - INFO - __main__ - Step 82005: {'lr': 0.00021822449012965872, 'samples': 15744960, 'steps': 82004, 'loss/train': 1.7563966512680054} 11/07/2021 08:45:54 - INFO - __main__ - Step 82006: {'lr': 0.00021821922643204127, 'samples': 15745152, 'steps': 82005, 'loss/train': 1.6675560474395752} 11/07/2021 08:45:54 - INFO - __main__ - Step 82007: {'lr': 0.00021821396274874372, 'samples': 15745344, 'steps': 82006, 'loss/train': 1.0713787078857422} 11/07/2021 08:45:55 - INFO - __main__ - Step 82008: {'lr': 0.00021820869907976847, 'samples': 15745536, 'steps': 82007, 'loss/train': 1.5076754093170166} 11/07/2021 08:45:55 - INFO - __main__ - Step 82009: {'lr': 0.0002182034354251179, 'samples': 15745728, 'steps': 82008, 'loss/train': 1.337306261062622} 11/07/2021 08:45:56 - INFO - __main__ - Step 82010: {'lr': 0.00021819817178479436, 'samples': 15745920, 'steps': 82009, 'loss/train': 1.6475399732589722} 11/07/2021 08:45:57 - INFO - __main__ - Step 82011: {'lr': 0.00021819290815880028, 'samples': 15746112, 'steps': 82010, 'loss/train': 1.5261069536209106} 11/07/2021 08:45:57 - INFO - __main__ - Step 82012: {'lr': 0.00021818764454713792, 'samples': 15746304, 'steps': 82011, 'loss/train': 0.7869037985801697} 11/07/2021 08:45:58 - INFO - __main__ - Step 82013: {'lr': 0.00021818238094980973, 'samples': 15746496, 'steps': 82012, 'loss/train': 1.2298970222473145} 11/07/2021 08:45:58 - INFO - __main__ - Step 82014: {'lr': 0.00021817711736681812, 'samples': 15746688, 'steps': 82013, 'loss/train': 0.8995493054389954} 11/07/2021 08:45:58 - INFO - __main__ - Step 82015: {'lr': 0.00021817185379816536, 'samples': 15746880, 'steps': 82014, 'loss/train': 1.6532591581344604} 11/07/2021 08:45:59 - INFO - __main__ - Step 82016: {'lr': 0.00021816659024385387, 'samples': 15747072, 'steps': 82015, 'loss/train': 1.670758605003357} 11/07/2021 08:46:00 - INFO - __main__ - Step 82017: {'lr': 0.00021816132670388603, 'samples': 15747264, 'steps': 82016, 'loss/train': 1.5032260417938232} 11/07/2021 08:46:00 - INFO - __main__ - Step 82018: {'lr': 0.0002181560631782643, 'samples': 15747456, 'steps': 82017, 'loss/train': 0.8556668162345886} 11/07/2021 08:46:00 - INFO - __main__ - Step 82019: {'lr': 0.0002181507996669909, 'samples': 15747648, 'steps': 82018, 'loss/train': 1.5723460912704468} 11/07/2021 08:46:01 - INFO - __main__ - Step 82020: {'lr': 0.00021814553617006822, 'samples': 15747840, 'steps': 82019, 'loss/train': 1.4315682649612427} 11/07/2021 08:46:02 - INFO - __main__ - Step 82021: {'lr': 0.00021814027268749866, 'samples': 15748032, 'steps': 82020, 'loss/train': 1.1757631301879883} 11/07/2021 08:46:02 - INFO - __main__ - Step 82022: {'lr': 0.00021813500921928465, 'samples': 15748224, 'steps': 82021, 'loss/train': 0.8575325012207031} 11/07/2021 08:46:02 - INFO - __main__ - Step 82023: {'lr': 0.00021812974576542845, 'samples': 15748416, 'steps': 82022, 'loss/train': 1.2892738580703735} 11/07/2021 08:46:03 - INFO - __main__ - Step 82024: {'lr': 0.00021812448232593252, 'samples': 15748608, 'steps': 82023, 'loss/train': 1.3165781497955322} 11/07/2021 08:46:03 - INFO - __main__ - Step 82025: {'lr': 0.00021811921890079922, 'samples': 15748800, 'steps': 82024, 'loss/train': 2.1055314540863037} 11/07/2021 08:46:04 - INFO - __main__ - Step 82026: {'lr': 0.00021811395549003088, 'samples': 15748992, 'steps': 82025, 'loss/train': 1.5667247772216797} 11/07/2021 08:46:04 - INFO - __main__ - Step 82027: {'lr': 0.00021810869209362994, 'samples': 15749184, 'steps': 82026, 'loss/train': 0.6982947587966919} 11/07/2021 08:46:05 - INFO - __main__ - Step 82028: {'lr': 0.00021810342871159873, 'samples': 15749376, 'steps': 82027, 'loss/train': 1.498687744140625} 11/07/2021 08:46:05 - INFO - __main__ - Step 82029: {'lr': 0.00021809816534393956, 'samples': 15749568, 'steps': 82028, 'loss/train': 1.2569727897644043} 11/07/2021 08:46:05 - INFO - __main__ - Step 82030: {'lr': 0.00021809290199065494, 'samples': 15749760, 'steps': 82029, 'loss/train': 1.6555235385894775} 11/07/2021 08:46:06 - INFO - __main__ - Step 82031: {'lr': 0.0002180876386517472, 'samples': 15749952, 'steps': 82030, 'loss/train': 1.5145519971847534} 11/07/2021 08:46:07 - INFO - __main__ - Step 82032: {'lr': 0.00021808237532721864, 'samples': 15750144, 'steps': 82031, 'loss/train': 1.3495635986328125} 11/07/2021 08:46:07 - INFO - __main__ - Step 82033: {'lr': 0.00021807711201707165, 'samples': 15750336, 'steps': 82032, 'loss/train': 1.3300036191940308} 11/07/2021 08:46:07 - INFO - __main__ - Step 82034: {'lr': 0.00021807184872130858, 'samples': 15750528, 'steps': 82033, 'loss/train': 1.4596832990646362} 11/07/2021 08:46:08 - INFO - __main__ - Step 82035: {'lr': 0.00021806658543993188, 'samples': 15750720, 'steps': 82034, 'loss/train': 1.544372320175171} 11/07/2021 08:46:09 - INFO - __main__ - Step 82036: {'lr': 0.0002180613221729439, 'samples': 15750912, 'steps': 82035, 'loss/train': 1.544357180595398} 11/07/2021 08:46:09 - INFO - __main__ - Step 82037: {'lr': 0.00021805605892034695, 'samples': 15751104, 'steps': 82036, 'loss/train': 1.0296381711959839} 11/07/2021 08:46:10 - INFO - __main__ - Step 82038: {'lr': 0.00021805079568214348, 'samples': 15751296, 'steps': 82037, 'loss/train': 1.3956890106201172} 11/07/2021 08:46:10 - INFO - __main__ - Step 82039: {'lr': 0.0002180455324583358, 'samples': 15751488, 'steps': 82038, 'loss/train': 1.561665654182434} 11/07/2021 08:46:10 - INFO - __main__ - Step 82040: {'lr': 0.00021804026924892634, 'samples': 15751680, 'steps': 82039, 'loss/train': 1.5974175930023193} 11/07/2021 08:46:11 - INFO - __main__ - Step 82041: {'lr': 0.0002180350060539174, 'samples': 15751872, 'steps': 82040, 'loss/train': 1.5888649225234985} 11/07/2021 08:46:12 - INFO - __main__ - Step 82042: {'lr': 0.00021802974287331146, 'samples': 15752064, 'steps': 82041, 'loss/train': 1.433175802230835} 11/07/2021 08:46:12 - INFO - __main__ - Step 82043: {'lr': 0.00021802447970711074, 'samples': 15752256, 'steps': 82042, 'loss/train': 1.4260708093643188} 11/07/2021 08:46:12 - INFO - __main__ - Step 82044: {'lr': 0.00021801921655531775, 'samples': 15752448, 'steps': 82043, 'loss/train': 0.9659870266914368} 11/07/2021 08:46:13 - INFO - __main__ - Step 82045: {'lr': 0.00021801395341793493, 'samples': 15752640, 'steps': 82044, 'loss/train': 0.7174556255340576} 11/07/2021 08:46:14 - INFO - __main__ - Step 82046: {'lr': 0.0002180086902949644, 'samples': 15752832, 'steps': 82045, 'loss/train': 1.2462643384933472} 11/07/2021 08:46:14 - INFO - __main__ - Step 82047: {'lr': 0.00021800342718640865, 'samples': 15753024, 'steps': 82046, 'loss/train': 1.7058537006378174} 11/07/2021 08:46:15 - INFO - __main__ - Step 82048: {'lr': 0.00021799816409227008, 'samples': 15753216, 'steps': 82047, 'loss/train': 1.247818112373352} 11/07/2021 08:46:15 - INFO - __main__ - Step 82049: {'lr': 0.00021799290101255105, 'samples': 15753408, 'steps': 82048, 'loss/train': 1.1653344631195068} 11/07/2021 08:46:16 - INFO - __main__ - Step 82050: {'lr': 0.00021798763794725391, 'samples': 15753600, 'steps': 82049, 'loss/train': 1.195488691329956} 11/07/2021 08:46:16 - INFO - __main__ - Step 82051: {'lr': 0.00021798237489638103, 'samples': 15753792, 'steps': 82050, 'loss/train': 0.8652616739273071} 11/07/2021 08:46:17 - INFO - __main__ - Step 82052: {'lr': 0.00021797711185993478, 'samples': 15753984, 'steps': 82051, 'loss/train': 0.5875667929649353} 11/07/2021 08:46:17 - INFO - __main__ - Step 82053: {'lr': 0.00021797184883791762, 'samples': 15754176, 'steps': 82052, 'loss/train': 1.1149930953979492} 11/07/2021 08:46:18 - INFO - __main__ - Step 82054: {'lr': 0.0002179665858303318, 'samples': 15754368, 'steps': 82053, 'loss/train': 1.3044710159301758} 11/07/2021 08:46:18 - INFO - __main__ - Step 82055: {'lr': 0.00021796132283717976, 'samples': 15754560, 'steps': 82054, 'loss/train': 1.426466703414917} 11/07/2021 08:46:18 - INFO - __main__ - Step 82056: {'lr': 0.00021795605985846383, 'samples': 15754752, 'steps': 82055, 'loss/train': 1.459130883216858} 11/07/2021 08:46:19 - INFO - __main__ - Step 82057: {'lr': 0.00021795079689418645, 'samples': 15754944, 'steps': 82056, 'loss/train': 1.196845531463623} 11/07/2021 08:46:20 - INFO - __main__ - Step 82058: {'lr': 0.00021794553394435, 'samples': 15755136, 'steps': 82057, 'loss/train': 1.0092157125473022} 11/07/2021 08:46:20 - INFO - __main__ - Step 82059: {'lr': 0.00021794027100895675, 'samples': 15755328, 'steps': 82058, 'loss/train': 1.3031846284866333} 11/07/2021 08:46:20 - INFO - __main__ - Step 82060: {'lr': 0.0002179350080880091, 'samples': 15755520, 'steps': 82059, 'loss/train': 1.5268797874450684} 11/07/2021 08:46:21 - INFO - __main__ - Step 82061: {'lr': 0.00021792974518150941, 'samples': 15755712, 'steps': 82060, 'loss/train': 1.371804118156433} 11/07/2021 08:46:22 - INFO - __main__ - Step 82062: {'lr': 0.00021792448228946011, 'samples': 15755904, 'steps': 82061, 'loss/train': 1.5474791526794434} 11/07/2021 08:46:22 - INFO - __main__ - Step 82063: {'lr': 0.00021791921941186353, 'samples': 15756096, 'steps': 82062, 'loss/train': 1.492661952972412} 11/07/2021 08:46:22 - INFO - __main__ - Step 82064: {'lr': 0.00021791395654872204, 'samples': 15756288, 'steps': 82063, 'loss/train': 1.4947316646575928} 11/07/2021 08:46:23 - INFO - __main__ - Step 82065: {'lr': 0.00021790869370003803, 'samples': 15756480, 'steps': 82064, 'loss/train': 1.2878139019012451} 11/07/2021 08:46:23 - INFO - __main__ - Step 82066: {'lr': 0.0002179034308658139, 'samples': 15756672, 'steps': 82065, 'loss/train': 1.4263224601745605} 11/07/2021 08:46:24 - INFO - __main__ - Step 82067: {'lr': 0.000217898168046052, 'samples': 15756864, 'steps': 82066, 'loss/train': 1.476071834564209} 11/07/2021 08:46:25 - INFO - __main__ - Step 82068: {'lr': 0.00021789290524075464, 'samples': 15757056, 'steps': 82067, 'loss/train': 1.2845193147659302} 11/07/2021 08:46:25 - INFO - __main__ - Step 82069: {'lr': 0.00021788764244992426, 'samples': 15757248, 'steps': 82068, 'loss/train': 1.5151344537734985} 11/07/2021 08:46:25 - INFO - __main__ - Step 82070: {'lr': 0.00021788237967356323, 'samples': 15757440, 'steps': 82069, 'loss/train': 1.9583321809768677} 11/07/2021 08:46:26 - INFO - __main__ - Step 82071: {'lr': 0.00021787711691167387, 'samples': 15757632, 'steps': 82070, 'loss/train': 1.695358157157898} 11/07/2021 08:46:27 - INFO - __main__ - Step 82072: {'lr': 0.00021787185416425873, 'samples': 15757824, 'steps': 82071, 'loss/train': 1.614760398864746} 11/07/2021 08:46:27 - INFO - __main__ - Step 82073: {'lr': 0.0002178665914313199, 'samples': 15758016, 'steps': 82072, 'loss/train': 1.0201679468154907} 11/07/2021 08:46:27 - INFO - __main__ - Step 82074: {'lr': 0.0002178613287128599, 'samples': 15758208, 'steps': 82073, 'loss/train': 1.3502111434936523} 11/07/2021 08:46:28 - INFO - __main__ - Step 82075: {'lr': 0.00021785606600888108, 'samples': 15758400, 'steps': 82074, 'loss/train': 0.9810562133789062} 11/07/2021 08:46:28 - INFO - __main__ - Step 82076: {'lr': 0.00021785080331938585, 'samples': 15758592, 'steps': 82075, 'loss/train': 1.9164631366729736} 11/07/2021 08:46:28 - INFO - __main__ - Step 82077: {'lr': 0.0002178455406443765, 'samples': 15758784, 'steps': 82076, 'loss/train': 1.179831862449646} 11/07/2021 08:46:29 - INFO - __main__ - Step 82078: {'lr': 0.0002178402779838555, 'samples': 15758976, 'steps': 82077, 'loss/train': 1.1268047094345093} 11/07/2021 08:46:30 - INFO - __main__ - Step 82079: {'lr': 0.00021783501533782516, 'samples': 15759168, 'steps': 82078, 'loss/train': 1.2355530261993408} 11/07/2021 08:46:30 - INFO - __main__ - Step 82080: {'lr': 0.00021782975270628785, 'samples': 15759360, 'steps': 82079, 'loss/train': 1.5639796257019043} 11/07/2021 08:46:31 - INFO - __main__ - Step 82081: {'lr': 0.000217824490089246, 'samples': 15759552, 'steps': 82080, 'loss/train': 0.4731065034866333} 11/07/2021 08:46:31 - INFO - __main__ - Step 82082: {'lr': 0.00021781922748670188, 'samples': 15759744, 'steps': 82081, 'loss/train': 1.6820828914642334} 11/07/2021 08:46:32 - INFO - __main__ - Step 82083: {'lr': 0.000217813964898658, 'samples': 15759936, 'steps': 82082, 'loss/train': 1.3736705780029297} 11/07/2021 08:46:32 - INFO - __main__ - Step 82084: {'lr': 0.00021780870232511663, 'samples': 15760128, 'steps': 82083, 'loss/train': 1.382871389389038} 11/07/2021 08:46:33 - INFO - __main__ - Step 82085: {'lr': 0.00021780343976608016, 'samples': 15760320, 'steps': 82084, 'loss/train': 1.1126434803009033} 11/07/2021 08:46:33 - INFO - __main__ - Step 82086: {'lr': 0.00021779817722155094, 'samples': 15760512, 'steps': 82085, 'loss/train': 1.3529363870620728} 11/07/2021 08:46:33 - INFO - __main__ - Step 82087: {'lr': 0.00021779291469153136, 'samples': 15760704, 'steps': 82086, 'loss/train': 1.4401578903198242} 11/07/2021 08:46:34 - INFO - __main__ - Step 82088: {'lr': 0.00021778765217602382, 'samples': 15760896, 'steps': 82087, 'loss/train': 0.8974550366401672} 11/07/2021 08:46:35 - INFO - __main__ - Step 82089: {'lr': 0.0002177823896750306, 'samples': 15761088, 'steps': 82088, 'loss/train': 1.4128690958023071} 11/07/2021 08:46:35 - INFO - __main__ - Step 82090: {'lr': 0.0002177771271885542, 'samples': 15761280, 'steps': 82089, 'loss/train': 1.3190064430236816} 11/07/2021 08:46:35 - INFO - __main__ - Step 82091: {'lr': 0.0002177718647165969, 'samples': 15761472, 'steps': 82090, 'loss/train': 0.9847179651260376} 11/07/2021 08:46:36 - INFO - __main__ - Step 82092: {'lr': 0.00021776660225916112, 'samples': 15761664, 'steps': 82091, 'loss/train': 1.3228940963745117} 11/07/2021 08:46:37 - INFO - __main__ - Step 82093: {'lr': 0.00021776133981624921, 'samples': 15761856, 'steps': 82092, 'loss/train': 1.4376360177993774} 11/07/2021 08:46:37 - INFO - __main__ - Step 82094: {'lr': 0.00021775607738786358, 'samples': 15762048, 'steps': 82093, 'loss/train': 1.5539735555648804} 11/07/2021 08:46:37 - INFO - __main__ - Step 82095: {'lr': 0.0002177508149740065, 'samples': 15762240, 'steps': 82094, 'loss/train': 1.5022786855697632} 11/07/2021 08:46:38 - INFO - __main__ - Step 82096: {'lr': 0.00021774555257468044, 'samples': 15762432, 'steps': 82095, 'loss/train': 1.3096548318862915} 11/07/2021 08:46:38 - INFO - __main__ - Step 82097: {'lr': 0.00021774029018988773, 'samples': 15762624, 'steps': 82096, 'loss/train': 1.0525095462799072} 11/07/2021 08:46:39 - INFO - __main__ - Step 82098: {'lr': 0.00021773502781963073, 'samples': 15762816, 'steps': 82097, 'loss/train': 1.349319338798523} 11/07/2021 08:46:40 - INFO - __main__ - Step 82099: {'lr': 0.00021772976546391188, 'samples': 15763008, 'steps': 82098, 'loss/train': 1.6015585660934448} 11/07/2021 08:46:40 - INFO - __main__ - Step 82100: {'lr': 0.00021772450312273345, 'samples': 15763200, 'steps': 82099, 'loss/train': 1.5636249780654907} 11/07/2021 08:46:40 - INFO - __main__ - Step 82101: {'lr': 0.00021771924079609788, 'samples': 15763392, 'steps': 82100, 'loss/train': 1.6320403814315796} 11/07/2021 08:46:41 - INFO - __main__ - Step 82102: {'lr': 0.00021771397848400752, 'samples': 15763584, 'steps': 82101, 'loss/train': 1.0503520965576172} 11/07/2021 08:46:42 - INFO - __main__ - Step 82103: {'lr': 0.0002177087161864647, 'samples': 15763776, 'steps': 82102, 'loss/train': 0.862181544303894} 11/07/2021 08:46:42 - INFO - __main__ - Step 82104: {'lr': 0.00021770345390347188, 'samples': 15763968, 'steps': 82103, 'loss/train': 1.1757900714874268} 11/07/2021 08:46:42 - INFO - __main__ - Step 82105: {'lr': 0.00021769819163503144, 'samples': 15764160, 'steps': 82104, 'loss/train': 1.6990904808044434} 11/07/2021 08:46:43 - INFO - __main__ - Step 82106: {'lr': 0.00021769292938114563, 'samples': 15764352, 'steps': 82105, 'loss/train': 1.2109259366989136} 11/07/2021 08:46:43 - INFO - __main__ - Step 82107: {'lr': 0.00021768766714181688, 'samples': 15764544, 'steps': 82106, 'loss/train': 1.4924436807632446} 11/07/2021 08:46:44 - INFO - __main__ - Step 82108: {'lr': 0.00021768240491704756, 'samples': 15764736, 'steps': 82107, 'loss/train': 1.4491297006607056} 11/07/2021 08:46:44 - INFO - __main__ - Step 82109: {'lr': 0.00021767714270684008, 'samples': 15764928, 'steps': 82108, 'loss/train': 1.7933909893035889} 11/07/2021 08:46:45 - INFO - __main__ - Step 82110: {'lr': 0.00021767188051119673, 'samples': 15765120, 'steps': 82109, 'loss/train': 1.8250222206115723} 11/07/2021 08:46:45 - INFO - __main__ - Step 82111: {'lr': 0.00021766661833012, 'samples': 15765312, 'steps': 82110, 'loss/train': 1.2448533773422241} 11/07/2021 08:46:45 - INFO - __main__ - Step 82112: {'lr': 0.0002176613561636122, 'samples': 15765504, 'steps': 82111, 'loss/train': 1.447120189666748} 11/07/2021 08:46:46 - INFO - __main__ - Step 82113: {'lr': 0.00021765609401167566, 'samples': 15765696, 'steps': 82112, 'loss/train': 1.3028802871704102} 11/07/2021 08:46:47 - INFO - __main__ - Step 82114: {'lr': 0.0002176508318743128, 'samples': 15765888, 'steps': 82113, 'loss/train': 1.029557704925537} 11/07/2021 08:46:47 - INFO - __main__ - Step 82115: {'lr': 0.00021764556975152591, 'samples': 15766080, 'steps': 82114, 'loss/train': 0.842783510684967} 11/07/2021 08:46:47 - INFO - __main__ - Step 82116: {'lr': 0.00021764030764331754, 'samples': 15766272, 'steps': 82115, 'loss/train': 1.3299297094345093} 11/07/2021 08:46:48 - INFO - __main__ - Step 82117: {'lr': 0.00021763504554968987, 'samples': 15766464, 'steps': 82116, 'loss/train': 1.6470216512680054} 11/07/2021 08:46:49 - INFO - __main__ - Step 82118: {'lr': 0.00021762978347064535, 'samples': 15766656, 'steps': 82117, 'loss/train': 1.3106601238250732} 11/07/2021 08:46:49 - INFO - __main__ - Step 82119: {'lr': 0.00021762452140618638, 'samples': 15766848, 'steps': 82118, 'loss/train': 1.5940463542938232} 11/07/2021 08:46:50 - INFO - __main__ - Step 82120: {'lr': 0.00021761925935631526, 'samples': 15767040, 'steps': 82119, 'loss/train': 1.684791922569275} 11/07/2021 08:46:50 - INFO - __main__ - Step 82121: {'lr': 0.00021761399732103442, 'samples': 15767232, 'steps': 82120, 'loss/train': 1.0932059288024902} 11/07/2021 08:46:50 - INFO - __main__ - Step 82122: {'lr': 0.0002176087353003462, 'samples': 15767424, 'steps': 82121, 'loss/train': 1.5647751092910767} 11/07/2021 08:46:51 - INFO - __main__ - Step 82123: {'lr': 0.00021760347329425302, 'samples': 15767616, 'steps': 82122, 'loss/train': 0.20576435327529907} 11/07/2021 08:46:52 - INFO - __main__ - Step 82124: {'lr': 0.0002175982113027572, 'samples': 15767808, 'steps': 82123, 'loss/train': 1.6196383237838745} 11/07/2021 08:46:52 - INFO - __main__ - Step 82125: {'lr': 0.0002175929493258611, 'samples': 15768000, 'steps': 82124, 'loss/train': 1.0810267925262451} 11/07/2021 08:46:52 - INFO - __main__ - Step 82126: {'lr': 0.00021758768736356726, 'samples': 15768192, 'steps': 82125, 'loss/train': 1.424064040184021} 11/07/2021 08:46:53 - INFO - __main__ - Step 82127: {'lr': 0.00021758242541587778, 'samples': 15768384, 'steps': 82126, 'loss/train': 1.1415009498596191} 11/07/2021 08:46:53 - INFO - __main__ - Step 82128: {'lr': 0.00021757716348279517, 'samples': 15768576, 'steps': 82127, 'loss/train': 0.8589686751365662} 11/07/2021 08:46:54 - INFO - __main__ - Step 82129: {'lr': 0.00021757190156432177, 'samples': 15768768, 'steps': 82128, 'loss/train': 1.410621166229248} 11/07/2021 08:46:55 - INFO - __main__ - Step 82130: {'lr': 0.00021756663966045997, 'samples': 15768960, 'steps': 82129, 'loss/train': 1.500764012336731} 11/07/2021 08:46:55 - INFO - __main__ - Step 82131: {'lr': 0.00021756137777121217, 'samples': 15769152, 'steps': 82130, 'loss/train': 1.556894063949585} 11/07/2021 08:46:55 - INFO - __main__ - Step 82132: {'lr': 0.0002175561158965807, 'samples': 15769344, 'steps': 82131, 'loss/train': 1.8855935335159302} 11/07/2021 08:46:56 - INFO - __main__ - Step 82133: {'lr': 0.00021755085403656795, 'samples': 15769536, 'steps': 82132, 'loss/train': 0.9357612729072571} 11/07/2021 08:46:57 - INFO - __main__ - Step 82134: {'lr': 0.00021754559219117626, 'samples': 15769728, 'steps': 82133, 'loss/train': 1.3933197259902954} 11/07/2021 08:46:57 - INFO - __main__ - Step 82135: {'lr': 0.00021754033036040804, 'samples': 15769920, 'steps': 82134, 'loss/train': 0.5369923710823059} 11/07/2021 08:46:57 - INFO - __main__ - Step 82136: {'lr': 0.00021753506854426562, 'samples': 15770112, 'steps': 82135, 'loss/train': 1.3237394094467163} 11/07/2021 08:46:58 - INFO - __main__ - Step 82137: {'lr': 0.00021752980674275146, 'samples': 15770304, 'steps': 82136, 'loss/train': 1.5046104192733765} 11/07/2021 08:46:58 - INFO - __main__ - Step 82138: {'lr': 0.0002175245449558678, 'samples': 15770496, 'steps': 82137, 'loss/train': 0.5570573806762695} 11/07/2021 08:46:59 - INFO - __main__ - Step 82139: {'lr': 0.00021751928318361724, 'samples': 15770688, 'steps': 82138, 'loss/train': 1.04371178150177} 11/07/2021 08:46:59 - INFO - __main__ - Step 82140: {'lr': 0.00021751402142600185, 'samples': 15770880, 'steps': 82139, 'loss/train': 1.4097011089324951} 11/07/2021 08:47:00 - INFO - __main__ - Step 82141: {'lr': 0.00021750875968302416, 'samples': 15771072, 'steps': 82140, 'loss/train': 0.9564287066459656} 11/07/2021 08:47:00 - INFO - __main__ - Step 82142: {'lr': 0.0002175034979546865, 'samples': 15771264, 'steps': 82141, 'loss/train': 1.4841325283050537} 11/07/2021 08:47:00 - INFO - __main__ - Step 82143: {'lr': 0.0002174982362409913, 'samples': 15771456, 'steps': 82142, 'loss/train': 1.464124083518982} 11/07/2021 08:47:01 - INFO - __main__ - Step 82144: {'lr': 0.00021749297454194086, 'samples': 15771648, 'steps': 82143, 'loss/train': 1.180593729019165} 11/07/2021 08:47:02 - INFO - __main__ - Step 82145: {'lr': 0.0002174877128575376, 'samples': 15771840, 'steps': 82144, 'loss/train': 1.8037662506103516} 11/07/2021 08:47:02 - INFO - __main__ - Step 82146: {'lr': 0.00021748245118778387, 'samples': 15772032, 'steps': 82145, 'loss/train': 1.5108537673950195} 11/07/2021 08:47:03 - INFO - __main__ - Step 82147: {'lr': 0.00021747718953268202, 'samples': 15772224, 'steps': 82146, 'loss/train': 1.481870412826538} 11/07/2021 08:47:03 - INFO - __main__ - Step 82148: {'lr': 0.0002174719278922345, 'samples': 15772416, 'steps': 82147, 'loss/train': 1.65669584274292} 11/07/2021 08:47:03 - INFO - __main__ - Step 82149: {'lr': 0.00021746666626644358, 'samples': 15772608, 'steps': 82148, 'loss/train': 1.6149030923843384} 11/07/2021 08:47:04 - INFO - __main__ - Step 82150: {'lr': 0.00021746140465531168, 'samples': 15772800, 'steps': 82149, 'loss/train': 0.8530946373939514} 11/07/2021 08:47:05 - INFO - __main__ - Step 82151: {'lr': 0.0002174561430588412, 'samples': 15772992, 'steps': 82150, 'loss/train': 1.5059431791305542} 11/07/2021 08:47:05 - INFO - __main__ - Step 82152: {'lr': 0.00021745088147703457, 'samples': 15773184, 'steps': 82151, 'loss/train': 1.4046889543533325} 11/07/2021 08:47:05 - INFO - __main__ - Step 82153: {'lr': 0.00021744561990989398, 'samples': 15773376, 'steps': 82152, 'loss/train': 1.412557601928711} 11/07/2021 08:47:06 - INFO - __main__ - Step 82154: {'lr': 0.00021744035835742187, 'samples': 15773568, 'steps': 82153, 'loss/train': 1.4406532049179077} 11/07/2021 08:47:07 - INFO - __main__ - Step 82155: {'lr': 0.00021743509681962066, 'samples': 15773760, 'steps': 82154, 'loss/train': 1.2832975387573242} 11/07/2021 08:47:07 - INFO - __main__ - Step 82156: {'lr': 0.00021742983529649264, 'samples': 15773952, 'steps': 82155, 'loss/train': 1.5281085968017578} 11/07/2021 08:47:08 - INFO - __main__ - Step 82157: {'lr': 0.00021742457378804027, 'samples': 15774144, 'steps': 82156, 'loss/train': 0.5109511017799377} 11/07/2021 08:47:08 - INFO - __main__ - Step 82158: {'lr': 0.00021741931229426586, 'samples': 15774336, 'steps': 82157, 'loss/train': 1.1202375888824463} 11/07/2021 08:47:08 - INFO - __main__ - Step 82159: {'lr': 0.00021741405081517184, 'samples': 15774528, 'steps': 82158, 'loss/train': 1.8004937171936035} 11/07/2021 08:47:10 - INFO - __main__ - Step 82160: {'lr': 0.00021740878935076053, 'samples': 15774720, 'steps': 82159, 'loss/train': 1.3087412118911743} 11/07/2021 08:47:10 - INFO - __main__ - Step 82161: {'lr': 0.00021740352790103432, 'samples': 15774912, 'steps': 82160, 'loss/train': 1.2149611711502075} 11/07/2021 08:47:10 - INFO - __main__ - Step 82162: {'lr': 0.00021739826646599558, 'samples': 15775104, 'steps': 82161, 'loss/train': 1.3002281188964844} 11/07/2021 08:47:11 - INFO - __main__ - Step 82163: {'lr': 0.00021739300504564665, 'samples': 15775296, 'steps': 82162, 'loss/train': 1.2519593238830566} 11/07/2021 08:47:11 - INFO - __main__ - Step 82164: {'lr': 0.00021738774363998998, 'samples': 15775488, 'steps': 82163, 'loss/train': 1.1920803785324097} 11/07/2021 08:47:12 - INFO - __main__ - Step 82165: {'lr': 0.00021738248224902783, 'samples': 15775680, 'steps': 82164, 'loss/train': 1.4887646436691284} 11/07/2021 08:47:12 - INFO - __main__ - Step 82166: {'lr': 0.0002173772208727628, 'samples': 15775872, 'steps': 82165, 'loss/train': 0.5746583342552185} 11/07/2021 08:47:13 - INFO - __main__ - Step 82167: {'lr': 0.00021737195951119693, 'samples': 15776064, 'steps': 82166, 'loss/train': 1.3147293329238892} 11/07/2021 08:47:13 - INFO - __main__ - Step 82168: {'lr': 0.00021736669816433278, 'samples': 15776256, 'steps': 82167, 'loss/train': 1.1735742092132568} 11/07/2021 08:47:13 - INFO - __main__ - Step 82169: {'lr': 0.00021736143683217268, 'samples': 15776448, 'steps': 82168, 'loss/train': 1.7227606773376465} 11/07/2021 08:47:14 - INFO - __main__ - Step 82170: {'lr': 0.00021735617551471903, 'samples': 15776640, 'steps': 82169, 'loss/train': 1.498842716217041} 11/07/2021 08:47:15 - INFO - __main__ - Step 82171: {'lr': 0.00021735091421197416, 'samples': 15776832, 'steps': 82170, 'loss/train': 1.402569055557251} 11/07/2021 08:47:15 - INFO - __main__ - Step 82172: {'lr': 0.00021734565292394047, 'samples': 15777024, 'steps': 82171, 'loss/train': 1.4097849130630493} 11/07/2021 08:47:15 - INFO - __main__ - Step 82173: {'lr': 0.00021734039165062033, 'samples': 15777216, 'steps': 82172, 'loss/train': 1.185702919960022} 11/07/2021 08:47:16 - INFO - __main__ - Step 82174: {'lr': 0.00021733513039201612, 'samples': 15777408, 'steps': 82173, 'loss/train': 1.186396837234497} 11/07/2021 08:47:17 - INFO - __main__ - Step 82175: {'lr': 0.0002173298691481302, 'samples': 15777600, 'steps': 82174, 'loss/train': 0.7828895449638367} 11/07/2021 08:47:17 - INFO - __main__ - Step 82176: {'lr': 0.0002173246079189649, 'samples': 15777792, 'steps': 82175, 'loss/train': 1.495869755744934} 11/07/2021 08:47:18 - INFO - __main__ - Step 82177: {'lr': 0.00021731934670452265, 'samples': 15777984, 'steps': 82176, 'loss/train': 1.4489529132843018} 11/07/2021 08:47:18 - INFO - __main__ - Step 82178: {'lr': 0.00021731408550480576, 'samples': 15778176, 'steps': 82177, 'loss/train': 1.3920923471450806} 11/07/2021 08:47:18 - INFO - __main__ - Step 82179: {'lr': 0.0002173088243198168, 'samples': 15778368, 'steps': 82178, 'loss/train': 1.336738109588623} 11/07/2021 08:47:19 - INFO - __main__ - Step 82180: {'lr': 0.00021730356314955785, 'samples': 15778560, 'steps': 82179, 'loss/train': 1.5435596704483032} 11/07/2021 08:47:20 - INFO - __main__ - Step 82181: {'lr': 0.00021729830199403142, 'samples': 15778752, 'steps': 82180, 'loss/train': 0.8988296985626221} 11/07/2021 08:47:20 - INFO - __main__ - Step 82182: {'lr': 0.00021729304085323987, 'samples': 15778944, 'steps': 82181, 'loss/train': 1.501800537109375} 11/07/2021 08:47:20 - INFO - __main__ - Step 82183: {'lr': 0.00021728777972718555, 'samples': 15779136, 'steps': 82182, 'loss/train': 1.3226938247680664} 11/07/2021 08:47:21 - INFO - __main__ - Step 82184: {'lr': 0.00021728251861587085, 'samples': 15779328, 'steps': 82183, 'loss/train': 1.9839762449264526} 11/07/2021 08:47:21 - INFO - __main__ - Step 82185: {'lr': 0.00021727725751929816, 'samples': 15779520, 'steps': 82184, 'loss/train': 1.5928778648376465} 11/07/2021 08:47:22 - INFO - __main__ - Step 82186: {'lr': 0.00021727199643746986, 'samples': 15779712, 'steps': 82185, 'loss/train': 1.5906364917755127} 11/07/2021 08:47:23 - INFO - __main__ - Step 82187: {'lr': 0.00021726673537038826, 'samples': 15779904, 'steps': 82186, 'loss/train': 1.1316500902175903} 11/07/2021 08:47:23 - INFO - __main__ - Step 82188: {'lr': 0.00021726147431805576, 'samples': 15780096, 'steps': 82187, 'loss/train': 1.4459519386291504} 11/07/2021 08:47:23 - INFO - __main__ - Step 82189: {'lr': 0.00021725621328047472, 'samples': 15780288, 'steps': 82188, 'loss/train': 1.523677945137024} 11/07/2021 08:47:24 - INFO - __main__ - Step 82190: {'lr': 0.00021725095225764757, 'samples': 15780480, 'steps': 82189, 'loss/train': 1.6762028932571411} 11/07/2021 08:47:25 - INFO - __main__ - Step 82191: {'lr': 0.0002172456912495766, 'samples': 15780672, 'steps': 82190, 'loss/train': 1.2775845527648926} 11/07/2021 08:47:25 - INFO - __main__ - Step 82192: {'lr': 0.00021724043025626424, 'samples': 15780864, 'steps': 82191, 'loss/train': 1.2838033437728882} 11/07/2021 08:47:26 - INFO - __main__ - Step 82193: {'lr': 0.00021723516927771294, 'samples': 15781056, 'steps': 82192, 'loss/train': 1.4139622449874878} 11/07/2021 08:47:26 - INFO - __main__ - Step 82194: {'lr': 0.00021722990831392485, 'samples': 15781248, 'steps': 82193, 'loss/train': 1.2633918523788452} 11/07/2021 08:47:26 - INFO - __main__ - Step 82195: {'lr': 0.00021722464736490245, 'samples': 15781440, 'steps': 82194, 'loss/train': 1.323365569114685} 11/07/2021 08:47:27 - INFO - __main__ - Step 82196: {'lr': 0.00021721938643064814, 'samples': 15781632, 'steps': 82195, 'loss/train': 1.1404484510421753} 11/07/2021 08:47:28 - INFO - __main__ - Step 82197: {'lr': 0.00021721412551116426, 'samples': 15781824, 'steps': 82196, 'loss/train': 1.5269397497177124} 11/07/2021 08:47:28 - INFO - __main__ - Step 82198: {'lr': 0.00021720886460645318, 'samples': 15782016, 'steps': 82197, 'loss/train': 0.11088129878044128} 11/07/2021 08:47:28 - INFO - __main__ - Step 82199: {'lr': 0.0002172036037165173, 'samples': 15782208, 'steps': 82198, 'loss/train': 0.6195211410522461} 11/07/2021 08:47:29 - INFO - __main__ - Step 82200: {'lr': 0.00021719834284135894, 'samples': 15782400, 'steps': 82199, 'loss/train': 1.6129366159439087} 11/07/2021 08:47:29 - INFO - __main__ - Step 82201: {'lr': 0.00021719308198098054, 'samples': 15782592, 'steps': 82200, 'loss/train': 1.5029542446136475} 11/07/2021 08:47:30 - INFO - __main__ - Step 82202: {'lr': 0.00021718782113538438, 'samples': 15782784, 'steps': 82201, 'loss/train': 1.4563955068588257} 11/07/2021 08:47:31 - INFO - __main__ - Step 82203: {'lr': 0.00021718256030457293, 'samples': 15782976, 'steps': 82202, 'loss/train': 0.06734652817249298} 11/07/2021 08:47:31 - INFO - __main__ - Step 82204: {'lr': 0.00021717729948854847, 'samples': 15783168, 'steps': 82203, 'loss/train': 1.7509551048278809} 11/07/2021 08:47:31 - INFO - __main__ - Step 82205: {'lr': 0.00021717203868731346, 'samples': 15783360, 'steps': 82204, 'loss/train': 1.4647048711776733} 11/07/2021 08:47:32 - INFO - __main__ - Step 82206: {'lr': 0.0002171667779008703, 'samples': 15783552, 'steps': 82205, 'loss/train': 1.4087166786193848} 11/07/2021 08:47:33 - INFO - __main__ - Step 82207: {'lr': 0.00021716151712922118, 'samples': 15783744, 'steps': 82206, 'loss/train': 1.9956691265106201} 11/07/2021 08:47:33 - INFO - __main__ - Step 82208: {'lr': 0.00021715625637236857, 'samples': 15783936, 'steps': 82207, 'loss/train': 1.2597554922103882} 11/07/2021 08:47:34 - INFO - __main__ - Step 82209: {'lr': 0.00021715099563031484, 'samples': 15784128, 'steps': 82208, 'loss/train': 0.9845597743988037} 11/07/2021 08:47:34 - INFO - __main__ - Step 82210: {'lr': 0.0002171457349030624, 'samples': 15784320, 'steps': 82209, 'loss/train': 1.7420628070831299} 11/07/2021 08:47:34 - INFO - __main__ - Step 82211: {'lr': 0.00021714047419061353, 'samples': 15784512, 'steps': 82210, 'loss/train': 1.4935683012008667} 11/07/2021 08:47:35 - INFO - __main__ - Step 82212: {'lr': 0.0002171352134929707, 'samples': 15784704, 'steps': 82211, 'loss/train': 1.5345619916915894} 11/07/2021 08:47:36 - INFO - __main__ - Step 82213: {'lr': 0.0002171299528101362, 'samples': 15784896, 'steps': 82212, 'loss/train': 1.261459469795227} 11/07/2021 08:47:36 - INFO - __main__ - Step 82214: {'lr': 0.00021712469214211244, 'samples': 15785088, 'steps': 82213, 'loss/train': 1.1508798599243164} 11/07/2021 08:47:36 - INFO - __main__ - Step 82215: {'lr': 0.0002171194314889018, 'samples': 15785280, 'steps': 82214, 'loss/train': 1.260228157043457} 11/07/2021 08:47:37 - INFO - __main__ - Step 82216: {'lr': 0.00021711417085050667, 'samples': 15785472, 'steps': 82215, 'loss/train': 1.3424739837646484} 11/07/2021 08:47:38 - INFO - __main__ - Step 82217: {'lr': 0.00021710891022692937, 'samples': 15785664, 'steps': 82216, 'loss/train': 1.3222343921661377} 11/07/2021 08:47:38 - INFO - __main__ - Step 82218: {'lr': 0.0002171036496181723, 'samples': 15785856, 'steps': 82217, 'loss/train': 1.480460286140442} 11/07/2021 08:47:38 - INFO - __main__ - Step 82219: {'lr': 0.00021709838902423778, 'samples': 15786048, 'steps': 82218, 'loss/train': 1.4220476150512695} 11/07/2021 08:47:39 - INFO - __main__ - Step 82220: {'lr': 0.0002170931284451283, 'samples': 15786240, 'steps': 82219, 'loss/train': 1.457039475440979} 11/07/2021 08:47:39 - INFO - __main__ - Step 82221: {'lr': 0.00021708786788084605, 'samples': 15786432, 'steps': 82220, 'loss/train': 1.9708147048950195} 11/07/2021 08:47:40 - INFO - __main__ - Step 82222: {'lr': 0.00021708260733139354, 'samples': 15786624, 'steps': 82221, 'loss/train': 1.3837456703186035} 11/07/2021 08:47:40 - INFO - __main__ - Step 82223: {'lr': 0.00021707734679677308, 'samples': 15786816, 'steps': 82222, 'loss/train': 1.5942310094833374} 11/07/2021 08:47:41 - INFO - __main__ - Step 82224: {'lr': 0.00021707208627698709, 'samples': 15787008, 'steps': 82223, 'loss/train': 0.975177526473999} 11/07/2021 08:47:41 - INFO - __main__ - Step 82225: {'lr': 0.00021706682577203785, 'samples': 15787200, 'steps': 82224, 'loss/train': 0.7181575894355774} 11/07/2021 08:47:41 - INFO - __main__ - Step 82226: {'lr': 0.00021706156528192782, 'samples': 15787392, 'steps': 82225, 'loss/train': 1.5143333673477173} 11/07/2021 08:47:43 - INFO - __main__ - Step 82227: {'lr': 0.00021705630480665935, 'samples': 15787584, 'steps': 82226, 'loss/train': 1.6079506874084473} 11/07/2021 08:47:43 - INFO - __main__ - Step 82228: {'lr': 0.00021705104434623486, 'samples': 15787776, 'steps': 82227, 'loss/train': 1.4526047706604004} 11/07/2021 08:47:43 - INFO - __main__ - Step 82229: {'lr': 0.0002170457839006566, 'samples': 15787968, 'steps': 82228, 'loss/train': 1.5086019039154053} 11/07/2021 08:47:44 - INFO - __main__ - Step 82230: {'lr': 0.000217040523469927, 'samples': 15788160, 'steps': 82229, 'loss/train': 1.904676914215088} 11/07/2021 08:47:44 - INFO - __main__ - Step 82231: {'lr': 0.0002170352630540484, 'samples': 15788352, 'steps': 82230, 'loss/train': 0.9084446430206299} 11/07/2021 08:47:45 - INFO - __main__ - Step 82232: {'lr': 0.00021703000265302326, 'samples': 15788544, 'steps': 82231, 'loss/train': 1.1601076126098633} 11/07/2021 08:47:45 - INFO - __main__ - Step 82233: {'lr': 0.0002170247422668539, 'samples': 15788736, 'steps': 82232, 'loss/train': 1.5283873081207275} 11/07/2021 08:47:46 - INFO - __main__ - Step 82234: {'lr': 0.00021701948189554267, 'samples': 15788928, 'steps': 82233, 'loss/train': 1.2299184799194336} 11/07/2021 08:47:46 - INFO - __main__ - Step 82235: {'lr': 0.0002170142215390919, 'samples': 15789120, 'steps': 82234, 'loss/train': 1.1585843563079834} 11/07/2021 08:47:46 - INFO - __main__ - Step 82236: {'lr': 0.00021700896119750406, 'samples': 15789312, 'steps': 82235, 'loss/train': 1.5350855588912964} 11/07/2021 08:47:47 - INFO - __main__ - Step 82237: {'lr': 0.00021700370087078145, 'samples': 15789504, 'steps': 82236, 'loss/train': 1.4827117919921875} 11/07/2021 08:47:48 - INFO - __main__ - Step 82238: {'lr': 0.00021699844055892646, 'samples': 15789696, 'steps': 82237, 'loss/train': 1.4730280637741089} 11/07/2021 08:47:48 - INFO - __main__ - Step 82239: {'lr': 0.00021699318026194154, 'samples': 15789888, 'steps': 82238, 'loss/train': 1.3059090375900269} 11/07/2021 08:47:49 - INFO - __main__ - Step 82240: {'lr': 0.0002169879199798289, 'samples': 15790080, 'steps': 82239, 'loss/train': 3.2104499340057373} 11/07/2021 08:47:49 - INFO - __main__ - Step 82241: {'lr': 0.000216982659712591, 'samples': 15790272, 'steps': 82240, 'loss/train': 1.0276882648468018} 11/07/2021 08:47:49 - INFO - __main__ - Step 82242: {'lr': 0.0002169773994602302, 'samples': 15790464, 'steps': 82241, 'loss/train': 1.1359614133834839} 11/07/2021 08:47:50 - INFO - __main__ - Step 82243: {'lr': 0.0002169721392227489, 'samples': 15790656, 'steps': 82242, 'loss/train': 1.649759292602539} 11/07/2021 08:47:51 - INFO - __main__ - Step 82244: {'lr': 0.00021696687900014944, 'samples': 15790848, 'steps': 82243, 'loss/train': 1.0976670980453491} 11/07/2021 08:47:51 - INFO - __main__ - Step 82245: {'lr': 0.00021696161879243417, 'samples': 15791040, 'steps': 82244, 'loss/train': 1.3390578031539917} 11/07/2021 08:47:51 - INFO - __main__ - Step 82246: {'lr': 0.0002169563585996055, 'samples': 15791232, 'steps': 82245, 'loss/train': 1.14545476436615} 11/07/2021 08:47:52 - INFO - __main__ - Step 82247: {'lr': 0.0002169510984216658, 'samples': 15791424, 'steps': 82246, 'loss/train': 1.8138208389282227} 11/07/2021 08:47:53 - INFO - __main__ - Step 82248: {'lr': 0.00021694583825861743, 'samples': 15791616, 'steps': 82247, 'loss/train': 1.6466832160949707} 11/07/2021 08:47:53 - INFO - __main__ - Step 82249: {'lr': 0.00021694057811046276, 'samples': 15791808, 'steps': 82248, 'loss/train': 0.8448277711868286} 11/07/2021 08:47:54 - INFO - __main__ - Step 82250: {'lr': 0.00021693531797720416, 'samples': 15792000, 'steps': 82249, 'loss/train': 0.5495770573616028} 11/07/2021 08:47:54 - INFO - __main__ - Step 82251: {'lr': 0.000216930057858844, 'samples': 15792192, 'steps': 82250, 'loss/train': 1.454027533531189} 11/07/2021 08:47:54 - INFO - __main__ - Step 82252: {'lr': 0.0002169247977553846, 'samples': 15792384, 'steps': 82251, 'loss/train': 1.4217313528060913} 11/07/2021 08:47:55 - INFO - __main__ - Step 82253: {'lr': 0.00021691953766682837, 'samples': 15792576, 'steps': 82252, 'loss/train': 1.4297786951065063} 11/07/2021 08:47:56 - INFO - __main__ - Step 82254: {'lr': 0.0002169142775931777, 'samples': 15792768, 'steps': 82253, 'loss/train': 1.293133020401001} 11/07/2021 08:47:56 - INFO - __main__ - Step 82255: {'lr': 0.00021690901753443494, 'samples': 15792960, 'steps': 82254, 'loss/train': 1.2317036390304565} 11/07/2021 08:47:57 - INFO - __main__ - Step 82256: {'lr': 0.00021690375749060248, 'samples': 15793152, 'steps': 82255, 'loss/train': 1.3794862031936646} 11/07/2021 08:47:57 - INFO - __main__ - Step 82257: {'lr': 0.0002168984974616827, 'samples': 15793344, 'steps': 82256, 'loss/train': 1.4925988912582397} 11/07/2021 08:47:57 - INFO - __main__ - Step 82258: {'lr': 0.0002168932374476779, 'samples': 15793536, 'steps': 82257, 'loss/train': 1.7526681423187256} 11/07/2021 08:47:58 - INFO - __main__ - Step 82259: {'lr': 0.00021688797744859052, 'samples': 15793728, 'steps': 82258, 'loss/train': 0.4174860715866089} 11/07/2021 08:47:59 - INFO - __main__ - Step 82260: {'lr': 0.00021688271746442294, 'samples': 15793920, 'steps': 82259, 'loss/train': 1.370161771774292} 11/07/2021 08:47:59 - INFO - __main__ - Step 82261: {'lr': 0.00021687745749517751, 'samples': 15794112, 'steps': 82260, 'loss/train': 1.2876334190368652} 11/07/2021 08:47:59 - INFO - __main__ - Step 82262: {'lr': 0.00021687219754085654, 'samples': 15794304, 'steps': 82261, 'loss/train': 1.4785056114196777} 11/07/2021 08:48:00 - INFO - __main__ - Step 82263: {'lr': 0.00021686693760146245, 'samples': 15794496, 'steps': 82262, 'loss/train': 1.865691900253296} 11/07/2021 08:48:01 - INFO - __main__ - Step 82264: {'lr': 0.0002168616776769976, 'samples': 15794688, 'steps': 82263, 'loss/train': 0.8839965462684631} 11/07/2021 08:48:01 - INFO - __main__ - Step 82265: {'lr': 0.00021685641776746434, 'samples': 15794880, 'steps': 82264, 'loss/train': 1.5362359285354614} 11/07/2021 08:48:01 - INFO - __main__ - Step 82266: {'lr': 0.00021685115787286512, 'samples': 15795072, 'steps': 82265, 'loss/train': 1.7085498571395874} 11/07/2021 08:48:02 - INFO - __main__ - Step 82267: {'lr': 0.0002168458979932022, 'samples': 15795264, 'steps': 82266, 'loss/train': 1.2904337644577026} 11/07/2021 08:48:02 - INFO - __main__ - Step 82268: {'lr': 0.00021684063812847803, 'samples': 15795456, 'steps': 82267, 'loss/train': 1.2648056745529175} 11/07/2021 08:48:03 - INFO - __main__ - Step 82269: {'lr': 0.00021683537827869498, 'samples': 15795648, 'steps': 82268, 'loss/train': 1.5951553583145142} 11/07/2021 08:48:03 - INFO - __main__ - Step 82270: {'lr': 0.00021683011844385536, 'samples': 15795840, 'steps': 82269, 'loss/train': 1.2676150798797607} 11/07/2021 08:48:04 - INFO - __main__ - Step 82271: {'lr': 0.00021682485862396163, 'samples': 15796032, 'steps': 82270, 'loss/train': 1.0480279922485352} 11/07/2021 08:48:04 - INFO - __main__ - Step 82272: {'lr': 0.00021681959881901603, 'samples': 15796224, 'steps': 82271, 'loss/train': 1.3386061191558838} 11/07/2021 08:48:04 - INFO - __main__ - Step 82273: {'lr': 0.00021681433902902118, 'samples': 15796416, 'steps': 82272, 'loss/train': 1.1850166320800781} 11/07/2021 08:48:05 - INFO - __main__ - Step 82274: {'lr': 0.00021680907925397913, 'samples': 15796608, 'steps': 82273, 'loss/train': 0.6932000517845154} 11/07/2021 08:48:06 - INFO - __main__ - Step 82275: {'lr': 0.0002168038194938924, 'samples': 15796800, 'steps': 82274, 'loss/train': 1.4362573623657227} 11/07/2021 08:48:06 - INFO - __main__ - Step 82276: {'lr': 0.00021679855974876338, 'samples': 15796992, 'steps': 82275, 'loss/train': 2.0263617038726807} 11/07/2021 08:48:07 - INFO - __main__ - Step 82277: {'lr': 0.0002167933000185944, 'samples': 15797184, 'steps': 82276, 'loss/train': 0.9115645885467529} 11/07/2021 08:48:07 - INFO - __main__ - Step 82278: {'lr': 0.00021678804030338786, 'samples': 15797376, 'steps': 82277, 'loss/train': 1.1355844736099243} 11/07/2021 08:48:08 - INFO - __main__ - Step 82279: {'lr': 0.0002167827806031461, 'samples': 15797568, 'steps': 82278, 'loss/train': 1.0504260063171387} 11/07/2021 08:48:08 - INFO - __main__ - Step 82280: {'lr': 0.0002167775209178715, 'samples': 15797760, 'steps': 82279, 'loss/train': 1.3916829824447632} 11/07/2021 08:48:09 - INFO - __main__ - Step 82281: {'lr': 0.00021677226124756647, 'samples': 15797952, 'steps': 82280, 'loss/train': 1.2472126483917236} 11/07/2021 08:48:09 - INFO - __main__ - Step 82282: {'lr': 0.0002167670015922333, 'samples': 15798144, 'steps': 82281, 'loss/train': 1.3428550958633423} 11/07/2021 08:48:09 - INFO - __main__ - Step 82283: {'lr': 0.00021676174195187444, 'samples': 15798336, 'steps': 82282, 'loss/train': 1.635020136833191} 11/07/2021 08:48:10 - INFO - __main__ - Step 82284: {'lr': 0.00021675648232649222, 'samples': 15798528, 'steps': 82283, 'loss/train': 1.2527453899383545} 11/07/2021 08:48:11 - INFO - __main__ - Step 82285: {'lr': 0.00021675122271608903, 'samples': 15798720, 'steps': 82284, 'loss/train': 1.714677333831787} 11/07/2021 08:48:11 - INFO - __main__ - Step 82286: {'lr': 0.0002167459631206672, 'samples': 15798912, 'steps': 82285, 'loss/train': 1.2560973167419434} 11/07/2021 08:48:11 - INFO - __main__ - Step 82287: {'lr': 0.00021674070354022926, 'samples': 15799104, 'steps': 82286, 'loss/train': 1.631447434425354} 11/07/2021 08:48:12 - INFO - __main__ - Step 82288: {'lr': 0.00021673544397477732, 'samples': 15799296, 'steps': 82287, 'loss/train': 0.6208655834197998} 11/07/2021 08:48:12 - INFO - __main__ - Step 82289: {'lr': 0.00021673018442431387, 'samples': 15799488, 'steps': 82288, 'loss/train': 0.37389492988586426} 11/07/2021 08:48:13 - INFO - __main__ - Step 82290: {'lr': 0.0002167249248888413, 'samples': 15799680, 'steps': 82289, 'loss/train': 2.0703229904174805} 11/07/2021 08:48:14 - INFO - __main__ - Step 82291: {'lr': 0.00021671966536836195, 'samples': 15799872, 'steps': 82290, 'loss/train': 1.560700535774231} 11/07/2021 08:48:14 - INFO - __main__ - Step 82292: {'lr': 0.00021671440586287823, 'samples': 15800064, 'steps': 82291, 'loss/train': 1.8934600353240967} 11/07/2021 08:48:14 - INFO - __main__ - Step 82293: {'lr': 0.00021670914637239244, 'samples': 15800256, 'steps': 82292, 'loss/train': 1.2434757947921753} 11/07/2021 08:48:15 - INFO - __main__ - Step 82294: {'lr': 0.00021670388689690705, 'samples': 15800448, 'steps': 82293, 'loss/train': 1.611977219581604} 11/07/2021 08:48:16 - INFO - __main__ - Step 82295: {'lr': 0.00021669862743642433, 'samples': 15800640, 'steps': 82294, 'loss/train': 1.0629143714904785} 11/07/2021 08:48:16 - INFO - __main__ - Step 82296: {'lr': 0.00021669336799094672, 'samples': 15800832, 'steps': 82295, 'loss/train': 0.926811933517456} 11/07/2021 08:48:16 - INFO - __main__ - Step 82297: {'lr': 0.00021668810856047654, 'samples': 15801024, 'steps': 82296, 'loss/train': 0.2410055547952652} 11/07/2021 08:48:17 - INFO - __main__ - Step 82298: {'lr': 0.00021668284914501623, 'samples': 15801216, 'steps': 82297, 'loss/train': 1.5740180015563965} 11/07/2021 08:48:17 - INFO - __main__ - Step 82299: {'lr': 0.0002166775897445681, 'samples': 15801408, 'steps': 82298, 'loss/train': 1.952474594116211} 11/07/2021 08:48:18 - INFO - __main__ - Step 82300: {'lr': 0.0002166723303591346, 'samples': 15801600, 'steps': 82299, 'loss/train': 1.298730731010437} 11/07/2021 08:48:19 - INFO - __main__ - Step 82301: {'lr': 0.00021666707098871797, 'samples': 15801792, 'steps': 82300, 'loss/train': 1.7413954734802246} 11/07/2021 08:48:19 - INFO - __main__ - Step 82302: {'lr': 0.0002166618116333206, 'samples': 15801984, 'steps': 82301, 'loss/train': 0.8319569230079651} 11/07/2021 08:48:19 - INFO - __main__ - Step 82303: {'lr': 0.00021665655229294496, 'samples': 15802176, 'steps': 82302, 'loss/train': 1.2617549896240234} 11/07/2021 08:48:20 - INFO - __main__ - Step 82304: {'lr': 0.00021665129296759335, 'samples': 15802368, 'steps': 82303, 'loss/train': 1.061643362045288} 11/07/2021 08:48:20 - INFO - __main__ - Step 82305: {'lr': 0.0002166460336572681, 'samples': 15802560, 'steps': 82304, 'loss/train': 1.5633600950241089} 11/07/2021 08:48:21 - INFO - __main__ - Step 82306: {'lr': 0.0002166407743619717, 'samples': 15802752, 'steps': 82305, 'loss/train': 1.6482905149459839} 11/07/2021 08:48:21 - INFO - __main__ - Step 82307: {'lr': 0.0002166355150817064, 'samples': 15802944, 'steps': 82306, 'loss/train': 1.2967463731765747} 11/07/2021 08:48:22 - INFO - __main__ - Step 82308: {'lr': 0.00021663025581647463, 'samples': 15803136, 'steps': 82307, 'loss/train': 1.8735558986663818} 11/07/2021 08:48:22 - INFO - __main__ - Step 82309: {'lr': 0.00021662499656627878, 'samples': 15803328, 'steps': 82308, 'loss/train': 1.6957284212112427} 11/07/2021 08:48:22 - INFO - __main__ - Step 82310: {'lr': 0.00021661973733112116, 'samples': 15803520, 'steps': 82309, 'loss/train': 0.6996250748634338} 11/07/2021 08:48:23 - INFO - __main__ - Step 82311: {'lr': 0.00021661447811100422, 'samples': 15803712, 'steps': 82310, 'loss/train': 1.415596842765808} 11/07/2021 08:48:24 - INFO - __main__ - Step 82312: {'lr': 0.0002166092189059302, 'samples': 15803904, 'steps': 82311, 'loss/train': 1.6804182529449463} 11/07/2021 08:48:24 - INFO - __main__ - Step 82313: {'lr': 0.00021660395971590164, 'samples': 15804096, 'steps': 82312, 'loss/train': 1.4952932596206665} 11/07/2021 08:48:24 - INFO - __main__ - Step 82314: {'lr': 0.00021659870054092087, 'samples': 15804288, 'steps': 82313, 'loss/train': 1.2700345516204834} 11/07/2021 08:48:25 - INFO - __main__ - Step 82315: {'lr': 0.00021659344138099014, 'samples': 15804480, 'steps': 82314, 'loss/train': 1.3716893196105957} 11/07/2021 08:48:26 - INFO - __main__ - Step 82316: {'lr': 0.00021658818223611184, 'samples': 15804672, 'steps': 82315, 'loss/train': 1.3816914558410645} 11/07/2021 08:48:26 - INFO - __main__ - Step 82317: {'lr': 0.00021658292310628842, 'samples': 15804864, 'steps': 82316, 'loss/train': 1.7439775466918945} 11/07/2021 08:48:26 - INFO - __main__ - Step 82318: {'lr': 0.00021657766399152224, 'samples': 15805056, 'steps': 82317, 'loss/train': 1.154612421989441} 11/07/2021 08:48:27 - INFO - __main__ - Step 82319: {'lr': 0.00021657240489181563, 'samples': 15805248, 'steps': 82318, 'loss/train': 1.2341985702514648} 11/07/2021 08:48:27 - INFO - __main__ - Step 82320: {'lr': 0.00021656714580717097, 'samples': 15805440, 'steps': 82319, 'loss/train': 1.3369379043579102} 11/07/2021 08:48:28 - INFO - __main__ - Step 82321: {'lr': 0.00021656188673759065, 'samples': 15805632, 'steps': 82320, 'loss/train': 1.2800626754760742} 11/07/2021 08:48:29 - INFO - __main__ - Step 82322: {'lr': 0.00021655662768307703, 'samples': 15805824, 'steps': 82321, 'loss/train': 1.3746644258499146} 11/07/2021 08:48:29 - INFO - __main__ - Step 82323: {'lr': 0.00021655136864363246, 'samples': 15806016, 'steps': 82322, 'loss/train': 1.523117184638977} 11/07/2021 08:48:29 - INFO - __main__ - Step 82324: {'lr': 0.00021654610961925933, 'samples': 15806208, 'steps': 82323, 'loss/train': 0.25915244221687317} 11/07/2021 08:48:30 - INFO - __main__ - Step 82325: {'lr': 0.00021654085060996, 'samples': 15806400, 'steps': 82324, 'loss/train': 4.664159774780273} 11/07/2021 08:48:31 - INFO - __main__ - Step 82326: {'lr': 0.00021653559161573688, 'samples': 15806592, 'steps': 82325, 'loss/train': 1.8274937868118286} 11/07/2021 08:48:31 - INFO - __main__ - Step 82327: {'lr': 0.00021653033263659239, 'samples': 15806784, 'steps': 82326, 'loss/train': 1.6945345401763916} 11/07/2021 08:48:31 - INFO - __main__ - Step 82328: {'lr': 0.0002165250736725287, 'samples': 15806976, 'steps': 82327, 'loss/train': 1.3112517595291138} 11/07/2021 08:48:32 - INFO - __main__ - Step 82329: {'lr': 0.00021651981472354832, 'samples': 15807168, 'steps': 82328, 'loss/train': 0.8511509895324707} 11/07/2021 08:48:32 - INFO - __main__ - Step 82330: {'lr': 0.00021651455578965357, 'samples': 15807360, 'steps': 82329, 'loss/train': 1.879684567451477} 11/07/2021 08:48:33 - INFO - __main__ - Step 82331: {'lr': 0.00021650929687084687, 'samples': 15807552, 'steps': 82330, 'loss/train': 1.4752750396728516} 11/07/2021 08:48:33 - INFO - __main__ - Step 82332: {'lr': 0.00021650403796713054, 'samples': 15807744, 'steps': 82331, 'loss/train': 1.5846912860870361} 11/07/2021 08:48:34 - INFO - __main__ - Step 82333: {'lr': 0.00021649877907850697, 'samples': 15807936, 'steps': 82332, 'loss/train': 2.196727752685547} 11/07/2021 08:48:34 - INFO - __main__ - Step 82334: {'lr': 0.00021649352020497857, 'samples': 15808128, 'steps': 82333, 'loss/train': 1.4083918333053589} 11/07/2021 08:48:35 - INFO - __main__ - Step 82335: {'lr': 0.00021648826134654765, 'samples': 15808320, 'steps': 82334, 'loss/train': 1.5011447668075562} 11/07/2021 08:48:35 - INFO - __main__ - Step 82336: {'lr': 0.00021648300250321658, 'samples': 15808512, 'steps': 82335, 'loss/train': 1.6811764240264893} 11/07/2021 08:48:36 - INFO - __main__ - Step 82337: {'lr': 0.0002164777436749878, 'samples': 15808704, 'steps': 82336, 'loss/train': 1.2723349332809448} 11/07/2021 08:48:36 - INFO - __main__ - Step 82338: {'lr': 0.0002164724848618636, 'samples': 15808896, 'steps': 82337, 'loss/train': 1.223347783088684} 11/07/2021 08:48:37 - INFO - __main__ - Step 82339: {'lr': 0.00021646722606384638, 'samples': 15809088, 'steps': 82338, 'loss/train': 1.4610787630081177} 11/07/2021 08:48:37 - INFO - __main__ - Step 82340: {'lr': 0.00021646196728093852, 'samples': 15809280, 'steps': 82339, 'loss/train': 0.8330068588256836} 11/07/2021 08:48:37 - INFO - __main__ - Step 82341: {'lr': 0.00021645670851314249, 'samples': 15809472, 'steps': 82340, 'loss/train': 1.4683737754821777} 11/07/2021 08:48:38 - INFO - __main__ - Step 82342: {'lr': 0.00021645144976046045, 'samples': 15809664, 'steps': 82341, 'loss/train': 1.470578908920288} 11/07/2021 08:48:39 - INFO - __main__ - Step 82343: {'lr': 0.00021644619102289484, 'samples': 15809856, 'steps': 82342, 'loss/train': 1.5089973211288452} 11/07/2021 08:48:39 - INFO - __main__ - Step 82344: {'lr': 0.00021644093230044806, 'samples': 15810048, 'steps': 82343, 'loss/train': 1.6551679372787476} 11/07/2021 08:48:39 - INFO - __main__ - Step 82345: {'lr': 0.0002164356735931225, 'samples': 15810240, 'steps': 82344, 'loss/train': 0.7161324620246887} 11/07/2021 08:48:40 - INFO - __main__ - Step 82346: {'lr': 0.0002164304149009205, 'samples': 15810432, 'steps': 82345, 'loss/train': 1.5563243627548218} 11/07/2021 08:48:41 - INFO - __main__ - Step 82347: {'lr': 0.00021642515622384442, 'samples': 15810624, 'steps': 82346, 'loss/train': 1.2438005208969116} 11/07/2021 08:48:41 - INFO - __main__ - Step 82348: {'lr': 0.00021641989756189666, 'samples': 15810816, 'steps': 82347, 'loss/train': 1.9541091918945312} 11/07/2021 08:48:41 - INFO - __main__ - Step 82349: {'lr': 0.0002164146389150796, 'samples': 15811008, 'steps': 82348, 'loss/train': 1.2063138484954834} 11/07/2021 08:48:42 - INFO - __main__ - Step 82350: {'lr': 0.00021640938028339557, 'samples': 15811200, 'steps': 82349, 'loss/train': 1.6165790557861328} 11/07/2021 08:48:42 - INFO - __main__ - Step 82351: {'lr': 0.00021640412166684694, 'samples': 15811392, 'steps': 82350, 'loss/train': 1.4361768960952759} 11/07/2021 08:48:43 - INFO - __main__ - Step 82352: {'lr': 0.00021639886306543615, 'samples': 15811584, 'steps': 82351, 'loss/train': 1.0592172145843506} 11/07/2021 08:48:43 - INFO - __main__ - Step 82353: {'lr': 0.00021639360447916548, 'samples': 15811776, 'steps': 82352, 'loss/train': 1.8283718824386597} 11/07/2021 08:48:44 - INFO - __main__ - Step 82354: {'lr': 0.00021638834590803738, 'samples': 15811968, 'steps': 82353, 'loss/train': 1.3534836769104004} 11/07/2021 08:48:44 - INFO - __main__ - Step 82355: {'lr': 0.00021638308735205412, 'samples': 15812160, 'steps': 82354, 'loss/train': 1.2997920513153076} 11/07/2021 08:48:45 - INFO - __main__ - Step 82356: {'lr': 0.00021637782881121808, 'samples': 15812352, 'steps': 82355, 'loss/train': 1.7433862686157227} 11/07/2021 08:48:46 - INFO - __main__ - Step 82357: {'lr': 0.00021637257028553174, 'samples': 15812544, 'steps': 82356, 'loss/train': 1.5682525634765625} 11/07/2021 08:48:46 - INFO - __main__ - Step 82358: {'lr': 0.00021636731177499736, 'samples': 15812736, 'steps': 82357, 'loss/train': 1.8070957660675049} 11/07/2021 08:48:47 - INFO - __main__ - Step 82359: {'lr': 0.00021636205327961737, 'samples': 15812928, 'steps': 82358, 'loss/train': 1.4294458627700806} 11/07/2021 08:48:47 - INFO - __main__ - Step 82360: {'lr': 0.0002163567947993941, 'samples': 15813120, 'steps': 82359, 'loss/train': 1.3873543739318848} 11/07/2021 08:48:47 - INFO - __main__ - Step 82361: {'lr': 0.00021635153633432994, 'samples': 15813312, 'steps': 82360, 'loss/train': 1.5948606729507446} 11/07/2021 08:48:48 - INFO - __main__ - Step 82362: {'lr': 0.00021634627788442732, 'samples': 15813504, 'steps': 82361, 'loss/train': 1.0727235078811646} 11/07/2021 08:48:49 - INFO - __main__ - Step 82363: {'lr': 0.0002163410194496885, 'samples': 15813696, 'steps': 82362, 'loss/train': 0.12548650801181793} 11/07/2021 08:48:49 - INFO - __main__ - Step 82364: {'lr': 0.0002163357610301159, 'samples': 15813888, 'steps': 82363, 'loss/train': 1.3155778646469116} 11/07/2021 08:48:49 - INFO - __main__ - Step 82365: {'lr': 0.00021633050262571187, 'samples': 15814080, 'steps': 82364, 'loss/train': 2.716172456741333} 11/07/2021 08:48:50 - INFO - __main__ - Step 82366: {'lr': 0.0002163252442364788, 'samples': 15814272, 'steps': 82365, 'loss/train': 1.1650230884552002} 11/07/2021 08:48:50 - INFO - __main__ - Step 82367: {'lr': 0.00021631998586241904, 'samples': 15814464, 'steps': 82366, 'loss/train': 1.1708108186721802} 11/07/2021 08:48:51 - INFO - __main__ - Step 82368: {'lr': 0.00021631472750353506, 'samples': 15814656, 'steps': 82367, 'loss/train': 0.9122490882873535} 11/07/2021 08:48:52 - INFO - __main__ - Step 82369: {'lr': 0.00021630946915982907, 'samples': 15814848, 'steps': 82368, 'loss/train': 1.9276838302612305} 11/07/2021 08:48:52 - INFO - __main__ - Step 82370: {'lr': 0.00021630421083130351, 'samples': 15815040, 'steps': 82369, 'loss/train': 1.5654551982879639} 11/07/2021 08:48:52 - INFO - __main__ - Step 82371: {'lr': 0.00021629895251796077, 'samples': 15815232, 'steps': 82370, 'loss/train': 1.6708747148513794} 11/07/2021 08:48:53 - INFO - __main__ - Step 82372: {'lr': 0.00021629369421980322, 'samples': 15815424, 'steps': 82371, 'loss/train': 1.203360915184021} 11/07/2021 08:48:54 - INFO - __main__ - Step 82373: {'lr': 0.00021628843593683324, 'samples': 15815616, 'steps': 82372, 'loss/train': 1.349286675453186} 11/07/2021 08:48:54 - INFO - __main__ - Step 82374: {'lr': 0.0002162831776690531, 'samples': 15815808, 'steps': 82373, 'loss/train': 1.3758751153945923} 11/07/2021 08:48:54 - INFO - __main__ - Step 82375: {'lr': 0.00021627791941646526, 'samples': 15816000, 'steps': 82374, 'loss/train': 1.8710232973098755} 11/07/2021 08:48:55 - INFO - __main__ - Step 82376: {'lr': 0.00021627266117907207, 'samples': 15816192, 'steps': 82375, 'loss/train': 1.0988789796829224} 11/07/2021 08:48:55 - INFO - __main__ - Step 82377: {'lr': 0.0002162674029568759, 'samples': 15816384, 'steps': 82376, 'loss/train': 0.8648630380630493} 11/07/2021 08:48:55 - INFO - __main__ - Step 82378: {'lr': 0.0002162621447498791, 'samples': 15816576, 'steps': 82377, 'loss/train': 1.571662425994873} 11/07/2021 08:48:57 - INFO - __main__ - Step 82379: {'lr': 0.00021625688655808406, 'samples': 15816768, 'steps': 82378, 'loss/train': 1.392462968826294} 11/07/2021 08:48:57 - INFO - __main__ - Step 82380: {'lr': 0.00021625162838149317, 'samples': 15816960, 'steps': 82379, 'loss/train': 1.2882369756698608} 11/07/2021 08:48:57 - INFO - __main__ - Step 82381: {'lr': 0.00021624637022010882, 'samples': 15817152, 'steps': 82380, 'loss/train': 1.2051262855529785} 11/07/2021 08:48:58 - INFO - __main__ - Step 82382: {'lr': 0.00021624111207393327, 'samples': 15817344, 'steps': 82381, 'loss/train': 1.3293401002883911} 11/07/2021 08:48:58 - INFO - __main__ - Step 82383: {'lr': 0.00021623585394296897, 'samples': 15817536, 'steps': 82382, 'loss/train': 0.8947815299034119} 11/07/2021 08:48:59 - INFO - __main__ - Step 82384: {'lr': 0.00021623059582721833, 'samples': 15817728, 'steps': 82383, 'loss/train': 2.022454261779785} 11/07/2021 08:48:59 - INFO - __main__ - Step 82385: {'lr': 0.00021622533772668357, 'samples': 15817920, 'steps': 82384, 'loss/train': 1.5650677680969238} 11/07/2021 08:49:00 - INFO - __main__ - Step 82386: {'lr': 0.0002162200796413672, 'samples': 15818112, 'steps': 82385, 'loss/train': 0.8192574381828308} 11/07/2021 08:49:00 - INFO - __main__ - Step 82387: {'lr': 0.00021621482157127152, 'samples': 15818304, 'steps': 82386, 'loss/train': 1.7111178636550903} 11/07/2021 08:49:00 - INFO - __main__ - Step 82388: {'lr': 0.00021620956351639888, 'samples': 15818496, 'steps': 82387, 'loss/train': 1.0632648468017578} 11/07/2021 08:49:02 - INFO - __main__ - Step 82389: {'lr': 0.00021620430547675173, 'samples': 15818688, 'steps': 82388, 'loss/train': 1.7630128860473633} 11/07/2021 08:49:02 - INFO - __main__ - Step 82390: {'lr': 0.0002161990474523324, 'samples': 15818880, 'steps': 82389, 'loss/train': 1.8392200469970703} 11/07/2021 08:49:02 - INFO - __main__ - Step 82391: {'lr': 0.00021619378944314328, 'samples': 15819072, 'steps': 82390, 'loss/train': 0.6502275466918945} 11/07/2021 08:49:03 - INFO - __main__ - Step 82392: {'lr': 0.00021618853144918668, 'samples': 15819264, 'steps': 82391, 'loss/train': 0.6742346882820129} 11/07/2021 08:49:03 - INFO - __main__ - Step 82393: {'lr': 0.00021618327347046502, 'samples': 15819456, 'steps': 82392, 'loss/train': 0.9320583343505859} 11/07/2021 08:49:04 - INFO - __main__ - Step 82394: {'lr': 0.00021617801550698068, 'samples': 15819648, 'steps': 82393, 'loss/train': 1.1314960718154907} 11/07/2021 08:49:04 - INFO - __main__ - Step 82395: {'lr': 0.00021617275755873605, 'samples': 15819840, 'steps': 82394, 'loss/train': 1.388449788093567} 11/07/2021 08:49:05 - INFO - __main__ - Step 82396: {'lr': 0.00021616749962573338, 'samples': 15820032, 'steps': 82395, 'loss/train': 1.9981833696365356} 11/07/2021 08:49:05 - INFO - __main__ - Step 82397: {'lr': 0.0002161622417079751, 'samples': 15820224, 'steps': 82396, 'loss/train': 1.2091734409332275} 11/07/2021 08:49:05 - INFO - __main__ - Step 82398: {'lr': 0.00021615698380546362, 'samples': 15820416, 'steps': 82397, 'loss/train': 1.339726448059082} 11/07/2021 08:49:06 - INFO - __main__ - Step 82399: {'lr': 0.00021615172591820127, 'samples': 15820608, 'steps': 82398, 'loss/train': 1.437271237373352} 11/07/2021 08:49:07 - INFO - __main__ - Step 82400: {'lr': 0.0002161464680461904, 'samples': 15820800, 'steps': 82399, 'loss/train': 0.983259379863739} 11/07/2021 08:49:07 - INFO - __main__ - Step 82401: {'lr': 0.00021614121018943345, 'samples': 15820992, 'steps': 82400, 'loss/train': 1.5186721086502075} 11/07/2021 08:49:08 - INFO - __main__ - Step 82402: {'lr': 0.0002161359523479327, 'samples': 15821184, 'steps': 82401, 'loss/train': 1.4043078422546387} 11/07/2021 08:49:08 - INFO - __main__ - Step 82403: {'lr': 0.00021613069452169063, 'samples': 15821376, 'steps': 82402, 'loss/train': 2.0134763717651367} 11/07/2021 08:49:09 - INFO - __main__ - Step 82404: {'lr': 0.0002161254367107095, 'samples': 15821568, 'steps': 82403, 'loss/train': 0.7177676558494568} 11/07/2021 08:49:10 - INFO - __main__ - Step 82405: {'lr': 0.00021612017891499175, 'samples': 15821760, 'steps': 82404, 'loss/train': 1.3408764600753784} 11/07/2021 08:49:10 - INFO - __main__ - Step 82406: {'lr': 0.0002161149211345397, 'samples': 15821952, 'steps': 82405, 'loss/train': 0.6415361762046814} 11/07/2021 08:49:10 - INFO - __main__ - Step 82407: {'lr': 0.00021610966336935579, 'samples': 15822144, 'steps': 82406, 'loss/train': 1.406456470489502} 11/07/2021 08:49:11 - INFO - __main__ - Step 82408: {'lr': 0.0002161044056194424, 'samples': 15822336, 'steps': 82407, 'loss/train': 1.1649730205535889} 11/07/2021 08:49:11 - INFO - __main__ - Step 82409: {'lr': 0.00021609914788480177, 'samples': 15822528, 'steps': 82408, 'loss/train': 2.1085479259490967} 11/07/2021 08:49:11 - INFO - __main__ - Step 82410: {'lr': 0.00021609389016543628, 'samples': 15822720, 'steps': 82409, 'loss/train': 1.2011067867279053} 11/07/2021 08:49:12 - INFO - __main__ - Step 82411: {'lr': 0.00021608863246134845, 'samples': 15822912, 'steps': 82410, 'loss/train': 5.769225120544434} 11/07/2021 08:49:13 - INFO - __main__ - Step 82412: {'lr': 0.00021608337477254047, 'samples': 15823104, 'steps': 82411, 'loss/train': 1.502665638923645} 11/07/2021 08:49:13 - INFO - __main__ - Step 82413: {'lr': 0.00021607811709901487, 'samples': 15823296, 'steps': 82412, 'loss/train': 1.2005491256713867} 11/07/2021 08:49:13 - INFO - __main__ - Step 82414: {'lr': 0.00021607285944077393, 'samples': 15823488, 'steps': 82413, 'loss/train': 1.2938534021377563} 11/07/2021 08:49:14 - INFO - __main__ - Step 82415: {'lr': 0.00021606760179782, 'samples': 15823680, 'steps': 82414, 'loss/train': 1.2972111701965332} 11/07/2021 08:49:15 - INFO - __main__ - Step 82416: {'lr': 0.00021606234417015553, 'samples': 15823872, 'steps': 82415, 'loss/train': 1.1712924242019653} 11/07/2021 08:49:15 - INFO - __main__ - Step 82417: {'lr': 0.0002160570865577828, 'samples': 15824064, 'steps': 82416, 'loss/train': 1.52073073387146} 11/07/2021 08:49:16 - INFO - __main__ - Step 82418: {'lr': 0.00021605182896070423, 'samples': 15824256, 'steps': 82417, 'loss/train': 1.749821662902832} 11/07/2021 08:49:16 - INFO - __main__ - Step 82419: {'lr': 0.00021604657137892221, 'samples': 15824448, 'steps': 82418, 'loss/train': 0.6590684652328491} 11/07/2021 08:49:16 - INFO - __main__ - Step 82420: {'lr': 0.00021604131381243907, 'samples': 15824640, 'steps': 82419, 'loss/train': 1.7954163551330566} 11/07/2021 08:49:17 - INFO - __main__ - Step 82421: {'lr': 0.0002160360562612573, 'samples': 15824832, 'steps': 82420, 'loss/train': 1.2560361623764038} 11/07/2021 08:49:18 - INFO - __main__ - Step 82422: {'lr': 0.00021603079872537905, 'samples': 15825024, 'steps': 82421, 'loss/train': 1.4011738300323486} 11/07/2021 08:49:18 - INFO - __main__ - Step 82423: {'lr': 0.0002160255412048068, 'samples': 15825216, 'steps': 82422, 'loss/train': 1.1810200214385986} 11/07/2021 08:49:18 - INFO - __main__ - Step 82424: {'lr': 0.0002160202836995429, 'samples': 15825408, 'steps': 82423, 'loss/train': 0.6438449025154114} 11/07/2021 08:49:19 - INFO - __main__ - Step 82425: {'lr': 0.00021601502620958977, 'samples': 15825600, 'steps': 82424, 'loss/train': 1.3219455480575562} 11/07/2021 08:49:19 - INFO - __main__ - Step 82426: {'lr': 0.00021600976873494972, 'samples': 15825792, 'steps': 82425, 'loss/train': 0.8861755132675171} 11/07/2021 08:49:20 - INFO - __main__ - Step 82427: {'lr': 0.00021600451127562514, 'samples': 15825984, 'steps': 82426, 'loss/train': 1.0410735607147217} 11/07/2021 08:49:21 - INFO - __main__ - Step 82428: {'lr': 0.0002159992538316184, 'samples': 15826176, 'steps': 82427, 'loss/train': 1.3977434635162354} 11/07/2021 08:49:21 - INFO - __main__ - Step 82429: {'lr': 0.0002159939964029319, 'samples': 15826368, 'steps': 82428, 'loss/train': 1.4722278118133545} 11/07/2021 08:49:21 - INFO - __main__ - Step 82430: {'lr': 0.00021598873898956794, 'samples': 15826560, 'steps': 82429, 'loss/train': 1.339455485343933} 11/07/2021 08:49:22 - INFO - __main__ - Step 82431: {'lr': 0.00021598348159152897, 'samples': 15826752, 'steps': 82430, 'loss/train': 0.1790490448474884} 11/07/2021 08:49:23 - INFO - __main__ - Step 82432: {'lr': 0.0002159782242088173, 'samples': 15826944, 'steps': 82431, 'loss/train': 1.1209291219711304} 11/07/2021 08:49:23 - INFO - __main__ - Step 82433: {'lr': 0.0002159729668414353, 'samples': 15827136, 'steps': 82432, 'loss/train': 1.3025895357131958} 11/07/2021 08:49:23 - INFO - __main__ - Step 82434: {'lr': 0.0002159677094893854, 'samples': 15827328, 'steps': 82433, 'loss/train': 1.2974982261657715} 11/07/2021 08:49:24 - INFO - __main__ - Step 82435: {'lr': 0.00021596245215267, 'samples': 15827520, 'steps': 82434, 'loss/train': 1.4562209844589233} 11/07/2021 08:49:24 - INFO - __main__ - Step 82436: {'lr': 0.00021595719483129128, 'samples': 15827712, 'steps': 82435, 'loss/train': 1.3657569885253906} 11/07/2021 08:49:24 - INFO - __main__ - Step 82437: {'lr': 0.00021595193752525175, 'samples': 15827904, 'steps': 82436, 'loss/train': 1.6930005550384521} 11/07/2021 08:49:26 - INFO - __main__ - Step 82438: {'lr': 0.00021594668023455373, 'samples': 15828096, 'steps': 82437, 'loss/train': 1.1010037660598755} 11/07/2021 08:49:26 - INFO - __main__ - Step 82439: {'lr': 0.0002159414229591996, 'samples': 15828288, 'steps': 82438, 'loss/train': 1.505906343460083} 11/07/2021 08:49:26 - INFO - __main__ - Step 82440: {'lr': 0.00021593616569919177, 'samples': 15828480, 'steps': 82439, 'loss/train': 1.3766446113586426} 11/07/2021 08:49:27 - INFO - __main__ - Step 82441: {'lr': 0.00021593090845453255, 'samples': 15828672, 'steps': 82440, 'loss/train': 1.2599527835845947} 11/07/2021 08:49:27 - INFO - __main__ - Step 82442: {'lr': 0.00021592565122522436, 'samples': 15828864, 'steps': 82441, 'loss/train': 1.009476900100708} 11/07/2021 08:49:28 - INFO - __main__ - Step 82443: {'lr': 0.00021592039401126953, 'samples': 15829056, 'steps': 82442, 'loss/train': 1.1314730644226074} 11/07/2021 08:49:28 - INFO - __main__ - Step 82444: {'lr': 0.00021591513681267044, 'samples': 15829248, 'steps': 82443, 'loss/train': 1.627152681350708} 11/07/2021 08:49:29 - INFO - __main__ - Step 82445: {'lr': 0.00021590987962942949, 'samples': 15829440, 'steps': 82444, 'loss/train': 1.7044917345046997} 11/07/2021 08:49:29 - INFO - __main__ - Step 82446: {'lr': 0.00021590462246154902, 'samples': 15829632, 'steps': 82445, 'loss/train': 1.4969298839569092} 11/07/2021 08:49:30 - INFO - __main__ - Step 82447: {'lr': 0.00021589936530903137, 'samples': 15829824, 'steps': 82446, 'loss/train': 1.5658384561538696} 11/07/2021 08:49:31 - INFO - __main__ - Step 82448: {'lr': 0.00021589410817187906, 'samples': 15830016, 'steps': 82447, 'loss/train': 0.9874282479286194} 11/07/2021 08:49:31 - INFO - __main__ - Step 82449: {'lr': 0.00021588885105009427, 'samples': 15830208, 'steps': 82448, 'loss/train': 1.373499870300293} 11/07/2021 08:49:31 - INFO - __main__ - Step 82450: {'lr': 0.00021588359394367936, 'samples': 15830400, 'steps': 82449, 'loss/train': 1.5265902280807495} 11/07/2021 08:49:32 - INFO - __main__ - Step 82451: {'lr': 0.00021587833685263684, 'samples': 15830592, 'steps': 82450, 'loss/train': 1.5708879232406616} 11/07/2021 08:49:32 - INFO - __main__ - Step 82452: {'lr': 0.000215873079776969, 'samples': 15830784, 'steps': 82451, 'loss/train': 1.3391411304473877} 11/07/2021 08:49:33 - INFO - __main__ - Step 82453: {'lr': 0.00021586782271667822, 'samples': 15830976, 'steps': 82452, 'loss/train': 1.1047168970108032} 11/07/2021 08:49:33 - INFO - __main__ - Step 82454: {'lr': 0.00021586256567176688, 'samples': 15831168, 'steps': 82453, 'loss/train': 1.7720516920089722} 11/07/2021 08:49:34 - INFO - __main__ - Step 82455: {'lr': 0.00021585730864223733, 'samples': 15831360, 'steps': 82454, 'loss/train': 1.5368720293045044} 11/07/2021 08:49:34 - INFO - __main__ - Step 82456: {'lr': 0.00021585205162809193, 'samples': 15831552, 'steps': 82455, 'loss/train': 1.0519119501113892} 11/07/2021 08:49:34 - INFO - __main__ - Step 82457: {'lr': 0.0002158467946293331, 'samples': 15831744, 'steps': 82456, 'loss/train': 1.6409900188446045} 11/07/2021 08:49:35 - INFO - __main__ - Step 82458: {'lr': 0.00021584153764596316, 'samples': 15831936, 'steps': 82457, 'loss/train': 1.6133962869644165} 11/07/2021 08:49:36 - INFO - __main__ - Step 82459: {'lr': 0.0002158362806779845, 'samples': 15832128, 'steps': 82458, 'loss/train': 1.4809589385986328} 11/07/2021 08:49:36 - INFO - __main__ - Step 82460: {'lr': 0.00021583102372539948, 'samples': 15832320, 'steps': 82459, 'loss/train': 1.7810782194137573} 11/07/2021 08:49:36 - INFO - __main__ - Step 82461: {'lr': 0.00021582576678821048, 'samples': 15832512, 'steps': 82460, 'loss/train': 1.3781596422195435} 11/07/2021 08:49:37 - INFO - __main__ - Step 82462: {'lr': 0.00021582050986642, 'samples': 15832704, 'steps': 82461, 'loss/train': 1.362908959388733} 11/07/2021 08:49:37 - INFO - __main__ - Step 82463: {'lr': 0.00021581525296003013, 'samples': 15832896, 'steps': 82462, 'loss/train': 1.5613840818405151} 11/07/2021 08:49:38 - INFO - __main__ - Step 82464: {'lr': 0.00021580999606904337, 'samples': 15833088, 'steps': 82463, 'loss/train': 1.3650645017623901} 11/07/2021 08:49:39 - INFO - __main__ - Step 82465: {'lr': 0.0002158047391934621, 'samples': 15833280, 'steps': 82464, 'loss/train': 0.7001506686210632} 11/07/2021 08:49:39 - INFO - __main__ - Step 82466: {'lr': 0.00021579948233328873, 'samples': 15833472, 'steps': 82465, 'loss/train': 1.302310824394226} 11/07/2021 08:49:39 - INFO - __main__ - Step 82467: {'lr': 0.00021579422548852553, 'samples': 15833664, 'steps': 82466, 'loss/train': 1.58114755153656} 11/07/2021 08:49:40 - INFO - __main__ - Step 82468: {'lr': 0.00021578896865917497, 'samples': 15833856, 'steps': 82467, 'loss/train': 1.5659801959991455} 11/07/2021 08:49:41 - INFO - __main__ - Step 82469: {'lr': 0.00021578371184523935, 'samples': 15834048, 'steps': 82468, 'loss/train': 1.8371319770812988} 11/07/2021 08:49:41 - INFO - __main__ - Step 82470: {'lr': 0.00021577845504672105, 'samples': 15834240, 'steps': 82469, 'loss/train': 1.5195611715316772} 11/07/2021 08:49:41 - INFO - __main__ - Step 82471: {'lr': 0.00021577319826362245, 'samples': 15834432, 'steps': 82470, 'loss/train': 1.2625882625579834} 11/07/2021 08:49:42 - INFO - __main__ - Step 82472: {'lr': 0.00021576794149594594, 'samples': 15834624, 'steps': 82471, 'loss/train': 1.675347089767456} 11/07/2021 08:49:42 - INFO - __main__ - Step 82473: {'lr': 0.00021576268474369386, 'samples': 15834816, 'steps': 82472, 'loss/train': 1.7494972944259644} 11/07/2021 08:49:43 - INFO - __main__ - Step 82474: {'lr': 0.0002157574280068686, 'samples': 15835008, 'steps': 82473, 'loss/train': 2.2723207473754883} 11/07/2021 08:49:43 - INFO - __main__ - Step 82475: {'lr': 0.00021575217128547258, 'samples': 15835200, 'steps': 82474, 'loss/train': 1.1240862607955933} 11/07/2021 08:49:44 - INFO - __main__ - Step 82476: {'lr': 0.00021574691457950805, 'samples': 15835392, 'steps': 82475, 'loss/train': 1.2762317657470703} 11/07/2021 08:49:44 - INFO - __main__ - Step 82477: {'lr': 0.0002157416578889774, 'samples': 15835584, 'steps': 82476, 'loss/train': 1.6190712451934814} 11/07/2021 08:49:44 - INFO - __main__ - Step 82478: {'lr': 0.000215736401213883, 'samples': 15835776, 'steps': 82477, 'loss/train': 1.4904332160949707} 11/07/2021 08:49:45 - INFO - __main__ - Step 82479: {'lr': 0.00021573114455422732, 'samples': 15835968, 'steps': 82478, 'loss/train': 1.4099061489105225} 11/07/2021 08:49:46 - INFO - __main__ - Step 82480: {'lr': 0.0002157258879100126, 'samples': 15836160, 'steps': 82479, 'loss/train': 1.0733166933059692} 11/07/2021 08:49:46 - INFO - __main__ - Step 82481: {'lr': 0.0002157206312812413, 'samples': 15836352, 'steps': 82480, 'loss/train': 1.5414074659347534} 11/07/2021 08:49:46 - INFO - __main__ - Step 82482: {'lr': 0.00021571537466791576, 'samples': 15836544, 'steps': 82481, 'loss/train': 1.406557559967041} 11/07/2021 08:49:47 - INFO - __main__ - Step 82483: {'lr': 0.00021571011807003832, 'samples': 15836736, 'steps': 82482, 'loss/train': 1.4902015924453735} 11/07/2021 08:49:48 - INFO - __main__ - Step 82484: {'lr': 0.00021570486148761136, 'samples': 15836928, 'steps': 82483, 'loss/train': 1.7375357151031494} 11/07/2021 08:49:48 - INFO - __main__ - Step 82485: {'lr': 0.00021569960492063729, 'samples': 15837120, 'steps': 82484, 'loss/train': 1.1894687414169312} 11/07/2021 08:49:49 - INFO - __main__ - Step 82486: {'lr': 0.00021569434836911846, 'samples': 15837312, 'steps': 82485, 'loss/train': 1.67225980758667} 11/07/2021 08:49:49 - INFO - __main__ - Step 82487: {'lr': 0.00021568909183305722, 'samples': 15837504, 'steps': 82486, 'loss/train': 1.0744554996490479} 11/07/2021 08:49:49 - INFO - __main__ - Step 82488: {'lr': 0.00021568383531245594, 'samples': 15837696, 'steps': 82487, 'loss/train': 1.1411857604980469} 11/07/2021 08:49:50 - INFO - __main__ - Step 82489: {'lr': 0.00021567857880731703, 'samples': 15837888, 'steps': 82488, 'loss/train': 1.0261824131011963} 11/07/2021 08:49:51 - INFO - __main__ - Step 82490: {'lr': 0.00021567332231764278, 'samples': 15838080, 'steps': 82489, 'loss/train': 1.5002728700637817} 11/07/2021 08:49:51 - INFO - __main__ - Step 82491: {'lr': 0.0002156680658434356, 'samples': 15838272, 'steps': 82490, 'loss/train': 1.0142550468444824} 11/07/2021 08:49:51 - INFO - __main__ - Step 82492: {'lr': 0.00021566280938469784, 'samples': 15838464, 'steps': 82491, 'loss/train': 1.4601317644119263} 11/07/2021 08:49:52 - INFO - __main__ - Step 82493: {'lr': 0.0002156575529414319, 'samples': 15838656, 'steps': 82492, 'loss/train': 1.6895371675491333} 11/07/2021 08:49:52 - INFO - __main__ - Step 82494: {'lr': 0.00021565229651364015, 'samples': 15838848, 'steps': 82493, 'loss/train': 1.7975903749465942} 11/07/2021 08:49:53 - INFO - __main__ - Step 82495: {'lr': 0.00021564704010132495, 'samples': 15839040, 'steps': 82494, 'loss/train': 1.3440723419189453} 11/07/2021 08:49:53 - INFO - __main__ - Step 82496: {'lr': 0.00021564178370448865, 'samples': 15839232, 'steps': 82495, 'loss/train': 1.3517591953277588} 11/07/2021 08:49:54 - INFO - __main__ - Step 82497: {'lr': 0.00021563652732313365, 'samples': 15839424, 'steps': 82496, 'loss/train': 1.2946252822875977} 11/07/2021 08:49:54 - INFO - __main__ - Step 82498: {'lr': 0.0002156312709572623, 'samples': 15839616, 'steps': 82497, 'loss/train': 1.151808500289917} 11/07/2021 08:49:55 - INFO - __main__ - Step 82499: {'lr': 0.00021562601460687697, 'samples': 15839808, 'steps': 82498, 'loss/train': 0.919017493724823} 11/07/2021 08:49:56 - INFO - __main__ - Step 82500: {'lr': 0.00021562075827197998, 'samples': 15840000, 'steps': 82499, 'loss/train': 0.8140735626220703} 11/07/2021 08:49:56 - INFO - __main__ - Step 82501: {'lr': 0.0002156155019525738, 'samples': 15840192, 'steps': 82500, 'loss/train': 1.6583458185195923} 11/07/2021 08:49:57 - INFO - __main__ - Step 82502: {'lr': 0.00021561024564866079, 'samples': 15840384, 'steps': 82501, 'loss/train': 1.3316515684127808} 11/07/2021 08:49:57 - INFO - __main__ - Step 82503: {'lr': 0.00021560498936024316, 'samples': 15840576, 'steps': 82502, 'loss/train': 1.5916963815689087} 11/07/2021 08:49:58 - INFO - __main__ - Step 82504: {'lr': 0.00021559973308732345, 'samples': 15840768, 'steps': 82503, 'loss/train': 1.7684450149536133} 11/07/2021 08:49:58 - INFO - __main__ - Step 82505: {'lr': 0.00021559447682990395, 'samples': 15840960, 'steps': 82504, 'loss/train': 1.7611323595046997} 11/07/2021 08:49:58 - INFO - __main__ - Step 82506: {'lr': 0.00021558922058798706, 'samples': 15841152, 'steps': 82505, 'loss/train': 1.8884209394454956} 11/07/2021 08:49:59 - INFO - __main__ - Step 82507: {'lr': 0.00021558396436157512, 'samples': 15841344, 'steps': 82506, 'loss/train': 1.0111958980560303} 11/07/2021 08:50:00 - INFO - __main__ - Step 82508: {'lr': 0.00021557870815067058, 'samples': 15841536, 'steps': 82507, 'loss/train': 1.7528259754180908} 11/07/2021 08:50:00 - INFO - __main__ - Step 82509: {'lr': 0.00021557345195527566, 'samples': 15841728, 'steps': 82508, 'loss/train': 2.1162667274475098} 11/07/2021 08:50:00 - INFO - __main__ - Step 82510: {'lr': 0.00021556819577539285, 'samples': 15841920, 'steps': 82509, 'loss/train': 0.6832578182220459} 11/07/2021 08:50:01 - INFO - __main__ - Step 82511: {'lr': 0.00021556293961102446, 'samples': 15842112, 'steps': 82510, 'loss/train': 1.610316514968872} 11/07/2021 08:50:02 - INFO - __main__ - Step 82512: {'lr': 0.00021555768346217288, 'samples': 15842304, 'steps': 82511, 'loss/train': 1.4024232625961304} 11/07/2021 08:50:02 - INFO - __main__ - Step 82513: {'lr': 0.0002155524273288405, 'samples': 15842496, 'steps': 82512, 'loss/train': 0.9365935325622559} 11/07/2021 08:50:02 - INFO - __main__ - Step 82514: {'lr': 0.00021554717121102964, 'samples': 15842688, 'steps': 82513, 'loss/train': 1.3550587892532349} 11/07/2021 08:50:03 - INFO - __main__ - Step 82515: {'lr': 0.00021554191510874275, 'samples': 15842880, 'steps': 82514, 'loss/train': 1.3022289276123047} 11/07/2021 08:50:03 - INFO - __main__ - Step 82516: {'lr': 0.0002155366590219821, 'samples': 15843072, 'steps': 82515, 'loss/train': 1.3388551473617554} 11/07/2021 08:50:04 - INFO - __main__ - Step 82517: {'lr': 0.00021553140295075009, 'samples': 15843264, 'steps': 82516, 'loss/train': 1.305236577987671} 11/07/2021 08:50:05 - INFO - __main__ - Step 82518: {'lr': 0.00021552614689504906, 'samples': 15843456, 'steps': 82517, 'loss/train': 1.4457136392593384} 11/07/2021 08:50:05 - INFO - __main__ - Step 82519: {'lr': 0.00021552089085488153, 'samples': 15843648, 'steps': 82518, 'loss/train': 2.1465113162994385} 11/07/2021 08:50:05 - INFO - __main__ - Step 82520: {'lr': 0.00021551563483024967, 'samples': 15843840, 'steps': 82519, 'loss/train': 1.4150965213775635} 11/07/2021 08:50:06 - INFO - __main__ - Step 82521: {'lr': 0.00021551037882115592, 'samples': 15844032, 'steps': 82520, 'loss/train': 1.1606382131576538} 11/07/2021 08:50:07 - INFO - __main__ - Step 82522: {'lr': 0.0002155051228276027, 'samples': 15844224, 'steps': 82521, 'loss/train': 1.798244595527649} 11/07/2021 08:50:07 - INFO - __main__ - Step 82523: {'lr': 0.00021549986684959234, 'samples': 15844416, 'steps': 82522, 'loss/train': 1.5806000232696533} 11/07/2021 08:50:08 - INFO - __main__ - Step 82524: {'lr': 0.00021549461088712717, 'samples': 15844608, 'steps': 82523, 'loss/train': 0.8816275000572205} 11/07/2021 08:50:08 - INFO - __main__ - Step 82525: {'lr': 0.0002154893549402096, 'samples': 15844800, 'steps': 82524, 'loss/train': 1.4721014499664307} 11/07/2021 08:50:08 - INFO - __main__ - Step 82526: {'lr': 0.00021548409900884203, 'samples': 15844992, 'steps': 82525, 'loss/train': 1.6915454864501953} 11/07/2021 08:50:09 - INFO - __main__ - Step 82527: {'lr': 0.00021547884309302675, 'samples': 15845184, 'steps': 82526, 'loss/train': 1.6098629236221313} 11/07/2021 08:50:10 - INFO - __main__ - Step 82528: {'lr': 0.0002154735871927662, 'samples': 15845376, 'steps': 82527, 'loss/train': 1.1455192565917969} 11/07/2021 08:50:10 - INFO - __main__ - Step 82529: {'lr': 0.00021546833130806276, 'samples': 15845568, 'steps': 82528, 'loss/train': 1.506150484085083} 11/07/2021 08:50:10 - INFO - __main__ - Step 82530: {'lr': 0.00021546307543891878, 'samples': 15845760, 'steps': 82529, 'loss/train': 1.6445621252059937} 11/07/2021 08:50:11 - INFO - __main__ - Step 82531: {'lr': 0.0002154578195853365, 'samples': 15845952, 'steps': 82530, 'loss/train': 1.5118776559829712} 11/07/2021 08:50:11 - INFO - __main__ - Step 82532: {'lr': 0.00021545256374731845, 'samples': 15846144, 'steps': 82531, 'loss/train': 1.8129405975341797} 11/07/2021 08:50:12 - INFO - __main__ - Step 82533: {'lr': 0.0002154473079248669, 'samples': 15846336, 'steps': 82532, 'loss/train': 1.739901065826416} 11/07/2021 08:50:12 - INFO - __main__ - Step 82534: {'lr': 0.0002154420521179843, 'samples': 15846528, 'steps': 82533, 'loss/train': 1.449499249458313} 11/07/2021 08:50:13 - INFO - __main__ - Step 82535: {'lr': 0.00021543679632667293, 'samples': 15846720, 'steps': 82534, 'loss/train': 1.3827987909317017} 11/07/2021 08:50:13 - INFO - __main__ - Step 82536: {'lr': 0.00021543154055093524, 'samples': 15846912, 'steps': 82535, 'loss/train': 1.4406712055206299} 11/07/2021 08:50:14 - INFO - __main__ - Step 82537: {'lr': 0.00021542628479077354, 'samples': 15847104, 'steps': 82536, 'loss/train': 1.2621335983276367} 11/07/2021 08:50:14 - INFO - __main__ - Step 82538: {'lr': 0.00021542102904619027, 'samples': 15847296, 'steps': 82537, 'loss/train': 0.8809535503387451} 11/07/2021 08:50:15 - INFO - __main__ - Step 82539: {'lr': 0.0002154157733171877, 'samples': 15847488, 'steps': 82538, 'loss/train': 1.5921837091445923} 11/07/2021 08:50:15 - INFO - __main__ - Step 82540: {'lr': 0.00021541051760376828, 'samples': 15847680, 'steps': 82539, 'loss/train': 1.452816128730774} 11/07/2021 08:50:16 - INFO - __main__ - Step 82541: {'lr': 0.0002154052619059343, 'samples': 15847872, 'steps': 82540, 'loss/train': 1.35382878780365} 11/07/2021 08:50:16 - INFO - __main__ - Step 82542: {'lr': 0.00021540000622368832, 'samples': 15848064, 'steps': 82541, 'loss/train': 1.5400842428207397} 11/07/2021 08:50:16 - INFO - __main__ - Step 82543: {'lr': 0.00021539475055703248, 'samples': 15848256, 'steps': 82542, 'loss/train': 1.4573625326156616} 11/07/2021 08:50:17 - INFO - __main__ - Step 82544: {'lr': 0.0002153894949059692, 'samples': 15848448, 'steps': 82543, 'loss/train': 1.5536811351776123} 11/07/2021 08:50:18 - INFO - __main__ - Step 82545: {'lr': 0.00021538423927050087, 'samples': 15848640, 'steps': 82544, 'loss/train': 0.4224875867366791} 11/07/2021 08:50:18 - INFO - __main__ - Step 82546: {'lr': 0.0002153789836506299, 'samples': 15848832, 'steps': 82545, 'loss/train': 1.3181488513946533} 11/07/2021 08:50:18 - INFO - __main__ - Step 82547: {'lr': 0.0002153737280463586, 'samples': 15849024, 'steps': 82546, 'loss/train': 1.0387552976608276} 11/07/2021 08:50:19 - INFO - __main__ - Step 82548: {'lr': 0.00021536847245768936, 'samples': 15849216, 'steps': 82547, 'loss/train': 2.109060049057007} 11/07/2021 08:50:20 - INFO - __main__ - Step 82549: {'lr': 0.00021536321688462456, 'samples': 15849408, 'steps': 82548, 'loss/train': 2.350878953933716} 11/07/2021 08:50:20 - INFO - __main__ - Step 82550: {'lr': 0.00021535796132716658, 'samples': 15849600, 'steps': 82549, 'loss/train': 1.7062913179397583} 11/07/2021 08:50:21 - INFO - __main__ - Step 82551: {'lr': 0.00021535270578531773, 'samples': 15849792, 'steps': 82550, 'loss/train': 1.5820292234420776} 11/07/2021 08:50:21 - INFO - __main__ - Step 82552: {'lr': 0.00021534745025908046, 'samples': 15849984, 'steps': 82551, 'loss/train': 1.4237111806869507} 11/07/2021 08:50:21 - INFO - __main__ - Step 82553: {'lr': 0.00021534219474845707, 'samples': 15850176, 'steps': 82552, 'loss/train': 1.2820985317230225} 11/07/2021 08:50:22 - INFO - __main__ - Step 82554: {'lr': 0.00021533693925344995, 'samples': 15850368, 'steps': 82553, 'loss/train': 1.5866881608963013} 11/07/2021 08:50:23 - INFO - __main__ - Step 82555: {'lr': 0.0002153316837740615, 'samples': 15850560, 'steps': 82554, 'loss/train': 1.4372062683105469} 11/07/2021 08:50:23 - INFO - __main__ - Step 82556: {'lr': 0.0002153264283102941, 'samples': 15850752, 'steps': 82555, 'loss/train': 1.1200287342071533} 11/07/2021 08:50:23 - INFO - __main__ - Step 82557: {'lr': 0.00021532117286215003, 'samples': 15850944, 'steps': 82556, 'loss/train': 1.3195509910583496} 11/07/2021 08:50:24 - INFO - __main__ - Step 82558: {'lr': 0.0002153159174296317, 'samples': 15851136, 'steps': 82557, 'loss/train': 1.5278773307800293} 11/07/2021 08:50:25 - INFO - __main__ - Step 82559: {'lr': 0.00021531066201274144, 'samples': 15851328, 'steps': 82558, 'loss/train': 1.4323607683181763} 11/07/2021 08:50:25 - INFO - __main__ - Step 82560: {'lr': 0.00021530540661148168, 'samples': 15851520, 'steps': 82559, 'loss/train': 1.138346552848816} 11/07/2021 08:50:25 - INFO - __main__ - Step 82561: {'lr': 0.00021530015122585478, 'samples': 15851712, 'steps': 82560, 'loss/train': 1.4971433877944946} 11/07/2021 08:50:26 - INFO - __main__ - Step 82562: {'lr': 0.0002152948958558631, 'samples': 15851904, 'steps': 82561, 'loss/train': 1.5105804204940796} 11/07/2021 08:50:26 - INFO - __main__ - Step 82563: {'lr': 0.00021528964050150897, 'samples': 15852096, 'steps': 82562, 'loss/train': 1.1816637516021729} 11/07/2021 08:50:27 - INFO - __main__ - Step 82564: {'lr': 0.00021528438516279483, 'samples': 15852288, 'steps': 82563, 'loss/train': 0.7409285306930542} 11/07/2021 08:50:28 - INFO - __main__ - Step 82565: {'lr': 0.000215279129839723, 'samples': 15852480, 'steps': 82564, 'loss/train': 1.0075961351394653} 11/07/2021 08:50:28 - INFO - __main__ - Step 82566: {'lr': 0.00021527387453229585, 'samples': 15852672, 'steps': 82565, 'loss/train': 1.4603925943374634} 11/07/2021 08:50:28 - INFO - __main__ - Step 82567: {'lr': 0.00021526861924051578, 'samples': 15852864, 'steps': 82566, 'loss/train': 1.5156173706054688} 11/07/2021 08:50:29 - INFO - __main__ - Step 82568: {'lr': 0.00021526336396438512, 'samples': 15853056, 'steps': 82567, 'loss/train': 1.0303725004196167} 11/07/2021 08:50:29 - INFO - __main__ - Step 82569: {'lr': 0.00021525810870390635, 'samples': 15853248, 'steps': 82568, 'loss/train': 1.4258087873458862} 11/07/2021 08:50:30 - INFO - __main__ - Step 82570: {'lr': 0.00021525285345908162, 'samples': 15853440, 'steps': 82569, 'loss/train': 0.6993128061294556} 11/07/2021 08:50:31 - INFO - __main__ - Step 82571: {'lr': 0.00021524759822991348, 'samples': 15853632, 'steps': 82570, 'loss/train': 0.8701749444007874} 11/07/2021 08:50:31 - INFO - __main__ - Step 82572: {'lr': 0.00021524234301640416, 'samples': 15853824, 'steps': 82571, 'loss/train': 1.084681510925293} 11/07/2021 08:50:31 - INFO - __main__ - Step 82573: {'lr': 0.00021523708781855615, 'samples': 15854016, 'steps': 82572, 'loss/train': 1.3900279998779297} 11/07/2021 08:50:32 - INFO - __main__ - Step 82574: {'lr': 0.00021523183263637174, 'samples': 15854208, 'steps': 82573, 'loss/train': 1.4518247842788696} 11/07/2021 08:50:33 - INFO - __main__ - Step 82575: {'lr': 0.00021522657746985335, 'samples': 15854400, 'steps': 82574, 'loss/train': 1.2199461460113525} 11/07/2021 08:50:33 - INFO - __main__ - Step 82576: {'lr': 0.00021522132231900336, 'samples': 15854592, 'steps': 82575, 'loss/train': 0.8734093308448792} 11/07/2021 08:50:33 - INFO - __main__ - Step 82577: {'lr': 0.00021521606718382405, 'samples': 15854784, 'steps': 82576, 'loss/train': 1.2558624744415283} 11/07/2021 08:50:34 - INFO - __main__ - Step 82578: {'lr': 0.00021521081206431786, 'samples': 15854976, 'steps': 82577, 'loss/train': 1.1634727716445923} 11/07/2021 08:50:34 - INFO - __main__ - Step 82579: {'lr': 0.00021520555696048717, 'samples': 15855168, 'steps': 82578, 'loss/train': 1.519073247909546} 11/07/2021 08:50:35 - INFO - __main__ - Step 82580: {'lr': 0.00021520030187233429, 'samples': 15855360, 'steps': 82579, 'loss/train': 1.188177466392517} 11/07/2021 08:50:35 - INFO - __main__ - Step 82581: {'lr': 0.0002151950467998616, 'samples': 15855552, 'steps': 82580, 'loss/train': 1.2983920574188232} 11/07/2021 08:50:36 - INFO - __main__ - Step 82582: {'lr': 0.0002151897917430715, 'samples': 15855744, 'steps': 82581, 'loss/train': 1.2638027667999268} 11/07/2021 08:50:36 - INFO - __main__ - Step 82583: {'lr': 0.00021518453670196647, 'samples': 15855936, 'steps': 82582, 'loss/train': 1.477757453918457} 11/07/2021 08:50:36 - INFO - __main__ - Step 82584: {'lr': 0.00021517928167654862, 'samples': 15856128, 'steps': 82583, 'loss/train': 2.1945300102233887} 11/07/2021 08:50:37 - INFO - __main__ - Step 82585: {'lr': 0.0002151740266668205, 'samples': 15856320, 'steps': 82584, 'loss/train': 1.2346068620681763} 11/07/2021 08:50:38 - INFO - __main__ - Step 82586: {'lr': 0.00021516877167278436, 'samples': 15856512, 'steps': 82585, 'loss/train': 1.8505228757858276} 11/07/2021 08:50:38 - INFO - __main__ - Step 82587: {'lr': 0.00021516351669444267, 'samples': 15856704, 'steps': 82586, 'loss/train': 1.6565080881118774} 11/07/2021 08:50:38 - INFO - __main__ - Step 82588: {'lr': 0.00021515826173179774, 'samples': 15856896, 'steps': 82587, 'loss/train': 1.3509105443954468} 11/07/2021 08:50:39 - INFO - __main__ - Step 82589: {'lr': 0.00021515300678485197, 'samples': 15857088, 'steps': 82588, 'loss/train': 1.358515977859497} 11/07/2021 08:50:40 - INFO - __main__ - Step 82590: {'lr': 0.00021514775185360773, 'samples': 15857280, 'steps': 82589, 'loss/train': 1.3482956886291504} 11/07/2021 08:50:40 - INFO - __main__ - Step 82591: {'lr': 0.00021514249693806734, 'samples': 15857472, 'steps': 82590, 'loss/train': 1.0121067762374878} 11/07/2021 08:50:41 - INFO - __main__ - Step 82592: {'lr': 0.00021513724203823324, 'samples': 15857664, 'steps': 82591, 'loss/train': 1.1953262090682983} 11/07/2021 08:50:41 - INFO - __main__ - Step 82593: {'lr': 0.00021513198715410775, 'samples': 15857856, 'steps': 82592, 'loss/train': 1.4955424070358276} 11/07/2021 08:50:41 - INFO - __main__ - Step 82594: {'lr': 0.00021512673228569324, 'samples': 15858048, 'steps': 82593, 'loss/train': 1.0253771543502808} 11/07/2021 08:50:42 - INFO - __main__ - Step 82595: {'lr': 0.0002151214774329921, 'samples': 15858240, 'steps': 82594, 'loss/train': 1.2395638227462769} 11/07/2021 08:50:43 - INFO - __main__ - Step 82596: {'lr': 0.00021511622259600676, 'samples': 15858432, 'steps': 82595, 'loss/train': 1.3913979530334473} 11/07/2021 08:50:43 - INFO - __main__ - Step 82597: {'lr': 0.00021511096777473943, 'samples': 15858624, 'steps': 82596, 'loss/train': 0.9776957035064697} 11/07/2021 08:50:43 - INFO - __main__ - Step 82598: {'lr': 0.00021510571296919258, 'samples': 15858816, 'steps': 82597, 'loss/train': 0.8311463594436646} 11/07/2021 08:50:44 - INFO - __main__ - Step 82599: {'lr': 0.00021510045817936852, 'samples': 15859008, 'steps': 82598, 'loss/train': 1.0182503461837769} 11/07/2021 08:50:44 - INFO - __main__ - Step 82600: {'lr': 0.00021509520340526968, 'samples': 15859200, 'steps': 82599, 'loss/train': 1.1785240173339844} 11/07/2021 08:50:45 - INFO - __main__ - Step 82601: {'lr': 0.00021508994864689838, 'samples': 15859392, 'steps': 82600, 'loss/train': 1.2533423900604248} 11/07/2021 08:50:46 - INFO - __main__ - Step 82602: {'lr': 0.00021508469390425704, 'samples': 15859584, 'steps': 82601, 'loss/train': 0.060346819460392} 11/07/2021 08:50:46 - INFO - __main__ - Step 82603: {'lr': 0.00021507943917734796, 'samples': 15859776, 'steps': 82602, 'loss/train': 1.1079959869384766} 11/07/2021 08:50:46 - INFO - __main__ - Step 82604: {'lr': 0.00021507418446617359, 'samples': 15859968, 'steps': 82603, 'loss/train': 1.1859251260757446} 11/07/2021 08:50:47 - INFO - __main__ - Step 82605: {'lr': 0.0002150689297707362, 'samples': 15860160, 'steps': 82604, 'loss/train': 6.191585540771484} 11/07/2021 08:50:48 - INFO - __main__ - Step 82606: {'lr': 0.00021506367509103826, 'samples': 15860352, 'steps': 82605, 'loss/train': 1.3791728019714355} 11/07/2021 08:50:48 - INFO - __main__ - Step 82607: {'lr': 0.00021505842042708208, 'samples': 15860544, 'steps': 82606, 'loss/train': 1.7871335744857788} 11/07/2021 08:50:49 - INFO - __main__ - Step 82608: {'lr': 0.00021505316577887003, 'samples': 15860736, 'steps': 82607, 'loss/train': 1.0094845294952393} 11/07/2021 08:50:49 - INFO - __main__ - Step 82609: {'lr': 0.0002150479111464045, 'samples': 15860928, 'steps': 82608, 'loss/train': 1.3189888000488281} 11/07/2021 08:50:49 - INFO - __main__ - Step 82610: {'lr': 0.0002150426565296879, 'samples': 15861120, 'steps': 82609, 'loss/train': 1.5714364051818848} 11/07/2021 08:50:50 - INFO - __main__ - Step 82611: {'lr': 0.00021503740192872246, 'samples': 15861312, 'steps': 82610, 'loss/train': 1.6966698169708252} 11/07/2021 08:50:51 - INFO - __main__ - Step 82612: {'lr': 0.0002150321473435106, 'samples': 15861504, 'steps': 82611, 'loss/train': 1.698811411857605} 11/07/2021 08:50:51 - INFO - __main__ - Step 82613: {'lr': 0.00021502689277405477, 'samples': 15861696, 'steps': 82612, 'loss/train': 1.4872727394104004} 11/07/2021 08:50:51 - INFO - __main__ - Step 82614: {'lr': 0.00021502163822035726, 'samples': 15861888, 'steps': 82613, 'loss/train': 1.3208465576171875} 11/07/2021 08:50:52 - INFO - __main__ - Step 82615: {'lr': 0.00021501638368242045, 'samples': 15862080, 'steps': 82614, 'loss/train': 1.166513204574585} 11/07/2021 08:50:53 - INFO - __main__ - Step 82616: {'lr': 0.00021501112916024674, 'samples': 15862272, 'steps': 82615, 'loss/train': 1.0460972785949707} 11/07/2021 08:50:53 - INFO - __main__ - Step 82617: {'lr': 0.00021500587465383844, 'samples': 15862464, 'steps': 82616, 'loss/train': 1.2064082622528076} 11/07/2021 08:50:53 - INFO - __main__ - Step 82618: {'lr': 0.000215000620163198, 'samples': 15862656, 'steps': 82617, 'loss/train': 1.5203351974487305} 11/07/2021 08:50:54 - INFO - __main__ - Step 82619: {'lr': 0.0002149953656883277, 'samples': 15862848, 'steps': 82618, 'loss/train': 1.4682459831237793} 11/07/2021 08:50:54 - INFO - __main__ - Step 82620: {'lr': 0.00021499011122923, 'samples': 15863040, 'steps': 82619, 'loss/train': 1.2063813209533691} 11/07/2021 08:50:55 - INFO - __main__ - Step 82621: {'lr': 0.00021498485678590718, 'samples': 15863232, 'steps': 82620, 'loss/train': 0.8858861923217773} 11/07/2021 08:50:55 - INFO - __main__ - Step 82622: {'lr': 0.00021497960235836164, 'samples': 15863424, 'steps': 82621, 'loss/train': 1.1098650693893433} 11/07/2021 08:50:56 - INFO - __main__ - Step 82623: {'lr': 0.00021497434794659582, 'samples': 15863616, 'steps': 82622, 'loss/train': 1.6696362495422363} 11/07/2021 08:50:56 - INFO - __main__ - Step 82624: {'lr': 0.00021496909355061194, 'samples': 15863808, 'steps': 82623, 'loss/train': 1.5455541610717773} 11/07/2021 08:50:57 - INFO - __main__ - Step 82625: {'lr': 0.00021496383917041245, 'samples': 15864000, 'steps': 82624, 'loss/train': 1.0934135913848877} 11/07/2021 08:50:57 - INFO - __main__ - Step 82626: {'lr': 0.00021495858480599973, 'samples': 15864192, 'steps': 82625, 'loss/train': 1.1730319261550903} 11/07/2021 08:50:58 - INFO - __main__ - Step 82627: {'lr': 0.0002149533304573761, 'samples': 15864384, 'steps': 82626, 'loss/train': 1.1868492364883423} 11/07/2021 08:50:58 - INFO - __main__ - Step 82628: {'lr': 0.00021494807612454397, 'samples': 15864576, 'steps': 82627, 'loss/train': 1.2742805480957031} 11/07/2021 08:50:59 - INFO - __main__ - Step 82629: {'lr': 0.00021494282180750573, 'samples': 15864768, 'steps': 82628, 'loss/train': 1.4770649671554565} 11/07/2021 08:50:59 - INFO - __main__ - Step 82630: {'lr': 0.0002149375675062637, 'samples': 15864960, 'steps': 82629, 'loss/train': 1.5500187873840332} 11/07/2021 08:50:59 - INFO - __main__ - Step 82631: {'lr': 0.0002149323132208203, 'samples': 15865152, 'steps': 82630, 'loss/train': 1.2989474534988403} 11/07/2021 08:51:01 - INFO - __main__ - Step 82632: {'lr': 0.00021492705895117777, 'samples': 15865344, 'steps': 82631, 'loss/train': 1.2776308059692383} 11/07/2021 08:51:01 - INFO - __main__ - Step 82633: {'lr': 0.00021492180469733863, 'samples': 15865536, 'steps': 82632, 'loss/train': 1.1949962377548218} 11/07/2021 08:51:01 - INFO - __main__ - Step 82634: {'lr': 0.00021491655045930515, 'samples': 15865728, 'steps': 82633, 'loss/train': 1.6698806285858154} 11/07/2021 08:51:02 - INFO - __main__ - Step 82635: {'lr': 0.0002149112962370797, 'samples': 15865920, 'steps': 82634, 'loss/train': 1.3935543298721313} 11/07/2021 08:51:02 - INFO - __main__ - Step 82636: {'lr': 0.0002149060420306648, 'samples': 15866112, 'steps': 82635, 'loss/train': 1.4004994630813599} 11/07/2021 08:51:02 - INFO - __main__ - Step 82637: {'lr': 0.00021490078784006263, 'samples': 15866304, 'steps': 82636, 'loss/train': 1.9699249267578125} 11/07/2021 08:51:03 - INFO - __main__ - Step 82638: {'lr': 0.0002148955336652756, 'samples': 15866496, 'steps': 82637, 'loss/train': 1.5084896087646484} 11/07/2021 08:51:04 - INFO - __main__ - Step 82639: {'lr': 0.0002148902795063061, 'samples': 15866688, 'steps': 82638, 'loss/train': 1.2675844430923462} 11/07/2021 08:51:04 - INFO - __main__ - Step 82640: {'lr': 0.0002148850253631565, 'samples': 15866880, 'steps': 82639, 'loss/train': 1.569905400276184} 11/07/2021 08:51:04 - INFO - __main__ - Step 82641: {'lr': 0.0002148797712358292, 'samples': 15867072, 'steps': 82640, 'loss/train': 1.469749927520752} 11/07/2021 08:51:05 - INFO - __main__ - Step 82642: {'lr': 0.00021487451712432653, 'samples': 15867264, 'steps': 82641, 'loss/train': 1.5549705028533936} 11/07/2021 08:51:06 - INFO - __main__ - Step 82643: {'lr': 0.00021486926302865085, 'samples': 15867456, 'steps': 82642, 'loss/train': 1.0584627389907837} 11/07/2021 08:51:06 - INFO - __main__ - Step 82644: {'lr': 0.0002148640089488045, 'samples': 15867648, 'steps': 82643, 'loss/train': 1.4899319410324097} 11/07/2021 08:51:06 - INFO - __main__ - Step 82645: {'lr': 0.0002148587548847899, 'samples': 15867840, 'steps': 82644, 'loss/train': 1.2221523523330688} 11/07/2021 08:51:07 - INFO - __main__ - Step 82646: {'lr': 0.00021485350083660942, 'samples': 15868032, 'steps': 82645, 'loss/train': 1.290649175643921} 11/07/2021 08:51:07 - INFO - __main__ - Step 82647: {'lr': 0.0002148482468042654, 'samples': 15868224, 'steps': 82646, 'loss/train': 1.995383381843567} 11/07/2021 08:51:08 - INFO - __main__ - Step 82648: {'lr': 0.00021484299278776023, 'samples': 15868416, 'steps': 82647, 'loss/train': 1.3429487943649292} 11/07/2021 08:51:09 - INFO - __main__ - Step 82649: {'lr': 0.00021483773878709625, 'samples': 15868608, 'steps': 82648, 'loss/train': 1.61557137966156} 11/07/2021 08:51:09 - INFO - __main__ - Step 82650: {'lr': 0.0002148324848022759, 'samples': 15868800, 'steps': 82649, 'loss/train': 1.716193675994873} 11/07/2021 08:51:09 - INFO - __main__ - Step 82651: {'lr': 0.00021482723083330143, 'samples': 15868992, 'steps': 82650, 'loss/train': 0.9294813871383667} 11/07/2021 08:51:10 - INFO - __main__ - Step 82652: {'lr': 0.00021482197688017527, 'samples': 15869184, 'steps': 82651, 'loss/train': 1.5527698993682861} 11/07/2021 08:51:11 - INFO - __main__ - Step 82653: {'lr': 0.00021481672294289982, 'samples': 15869376, 'steps': 82652, 'loss/train': 1.5247972011566162} 11/07/2021 08:51:11 - INFO - __main__ - Step 82654: {'lr': 0.00021481146902147742, 'samples': 15869568, 'steps': 82653, 'loss/train': 1.414066195487976} 11/07/2021 08:51:11 - INFO - __main__ - Step 82655: {'lr': 0.00021480621511591036, 'samples': 15869760, 'steps': 82654, 'loss/train': 1.8463201522827148} 11/07/2021 08:51:12 - INFO - __main__ - Step 82656: {'lr': 0.00021480096122620114, 'samples': 15869952, 'steps': 82655, 'loss/train': 1.180302381515503} 11/07/2021 08:51:12 - INFO - __main__ - Step 82657: {'lr': 0.00021479570735235198, 'samples': 15870144, 'steps': 82656, 'loss/train': 1.7238938808441162} 11/07/2021 08:51:12 - INFO - __main__ - Step 82658: {'lr': 0.0002147904534943654, 'samples': 15870336, 'steps': 82657, 'loss/train': 1.5439066886901855} 11/07/2021 08:51:14 - INFO - __main__ - Step 82659: {'lr': 0.00021478519965224368, 'samples': 15870528, 'steps': 82658, 'loss/train': 1.2385506629943848} 11/07/2021 08:51:14 - INFO - __main__ - Step 82660: {'lr': 0.0002147799458259892, 'samples': 15870720, 'steps': 82659, 'loss/train': 1.4230011701583862} 11/07/2021 08:51:14 - INFO - __main__ - Step 82661: {'lr': 0.00021477469201560434, 'samples': 15870912, 'steps': 82660, 'loss/train': 1.6178014278411865} 11/07/2021 08:51:15 - INFO - __main__ - Step 82662: {'lr': 0.00021476943822109146, 'samples': 15871104, 'steps': 82661, 'loss/train': 2.5007681846618652} 11/07/2021 08:51:15 - INFO - __main__ - Step 82663: {'lr': 0.00021476418444245297, 'samples': 15871296, 'steps': 82662, 'loss/train': 1.5838128328323364} 11/07/2021 08:51:16 - INFO - __main__ - Step 82664: {'lr': 0.00021475893067969122, 'samples': 15871488, 'steps': 82663, 'loss/train': 1.627266526222229} 11/07/2021 08:51:16 - INFO - __main__ - Step 82665: {'lr': 0.00021475367693280845, 'samples': 15871680, 'steps': 82664, 'loss/train': 1.5458399057388306} 11/07/2021 08:51:17 - INFO - __main__ - Step 82666: {'lr': 0.00021474842320180716, 'samples': 15871872, 'steps': 82665, 'loss/train': 1.105714201927185} 11/07/2021 08:51:17 - INFO - __main__ - Step 82667: {'lr': 0.0002147431694866897, 'samples': 15872064, 'steps': 82666, 'loss/train': 1.5922152996063232} 11/07/2021 08:51:17 - INFO - __main__ - Step 82668: {'lr': 0.0002147379157874584, 'samples': 15872256, 'steps': 82667, 'loss/train': 1.4888540506362915} 11/07/2021 08:51:18 - INFO - __main__ - Step 82669: {'lr': 0.00021473266210411565, 'samples': 15872448, 'steps': 82668, 'loss/train': 0.9436287879943848} 11/07/2021 08:51:19 - INFO - __main__ - Step 82670: {'lr': 0.00021472740843666384, 'samples': 15872640, 'steps': 82669, 'loss/train': 1.5465508699417114} 11/07/2021 08:51:19 - INFO - __main__ - Step 82671: {'lr': 0.00021472215478510531, 'samples': 15872832, 'steps': 82670, 'loss/train': 1.4720330238342285} 11/07/2021 08:51:19 - INFO - __main__ - Step 82672: {'lr': 0.00021471690114944242, 'samples': 15873024, 'steps': 82671, 'loss/train': 1.3960769176483154} 11/07/2021 08:51:20 - INFO - __main__ - Step 82673: {'lr': 0.00021471164752967755, 'samples': 15873216, 'steps': 82672, 'loss/train': 1.1674267053604126} 11/07/2021 08:51:21 - INFO - __main__ - Step 82674: {'lr': 0.00021470639392581309, 'samples': 15873408, 'steps': 82673, 'loss/train': 1.1575077772140503} 11/07/2021 08:51:21 - INFO - __main__ - Step 82675: {'lr': 0.00021470114033785137, 'samples': 15873600, 'steps': 82674, 'loss/train': 1.321555495262146} 11/07/2021 08:51:22 - INFO - __main__ - Step 82676: {'lr': 0.00021469588676579476, 'samples': 15873792, 'steps': 82675, 'loss/train': 0.704667866230011} 11/07/2021 08:51:22 - INFO - __main__ - Step 82677: {'lr': 0.00021469063320964576, 'samples': 15873984, 'steps': 82676, 'loss/train': 1.7382822036743164} 11/07/2021 08:51:22 - INFO - __main__ - Step 82678: {'lr': 0.00021468537966940648, 'samples': 15874176, 'steps': 82677, 'loss/train': 1.4835494756698608} 11/07/2021 08:51:23 - INFO - __main__ - Step 82679: {'lr': 0.0002146801261450795, 'samples': 15874368, 'steps': 82678, 'loss/train': 1.6091833114624023} 11/07/2021 08:51:24 - INFO - __main__ - Step 82680: {'lr': 0.000214674872636667, 'samples': 15874560, 'steps': 82679, 'loss/train': 1.3749690055847168} 11/07/2021 08:51:24 - INFO - __main__ - Step 82681: {'lr': 0.00021466961914417155, 'samples': 15874752, 'steps': 82680, 'loss/train': 1.6376901865005493} 11/07/2021 08:51:24 - INFO - __main__ - Step 82682: {'lr': 0.00021466436566759537, 'samples': 15874944, 'steps': 82681, 'loss/train': 1.5415542125701904} 11/07/2021 08:51:25 - INFO - __main__ - Step 82683: {'lr': 0.0002146591122069409, 'samples': 15875136, 'steps': 82682, 'loss/train': 1.3438351154327393} 11/07/2021 08:51:25 - INFO - __main__ - Step 82684: {'lr': 0.0002146538587622105, 'samples': 15875328, 'steps': 82683, 'loss/train': 1.0718023777008057} 11/07/2021 08:51:26 - INFO - __main__ - Step 82685: {'lr': 0.0002146486053334065, 'samples': 15875520, 'steps': 82684, 'loss/train': 1.5834407806396484} 11/07/2021 08:51:27 - INFO - __main__ - Step 82686: {'lr': 0.0002146433519205313, 'samples': 15875712, 'steps': 82685, 'loss/train': 1.3342164754867554} 11/07/2021 08:51:27 - INFO - __main__ - Step 82687: {'lr': 0.00021463809852358728, 'samples': 15875904, 'steps': 82686, 'loss/train': 2.0655677318573} 11/07/2021 08:51:27 - INFO - __main__ - Step 82688: {'lr': 0.00021463284514257677, 'samples': 15876096, 'steps': 82687, 'loss/train': 1.0886660814285278} 11/07/2021 08:51:28 - INFO - __main__ - Step 82689: {'lr': 0.0002146275917775022, 'samples': 15876288, 'steps': 82688, 'loss/train': 1.8396873474121094} 11/07/2021 08:51:29 - INFO - __main__ - Step 82690: {'lr': 0.00021462233842836593, 'samples': 15876480, 'steps': 82689, 'loss/train': 1.3614848852157593} 11/07/2021 08:51:29 - INFO - __main__ - Step 82691: {'lr': 0.0002146170850951702, 'samples': 15876672, 'steps': 82690, 'loss/train': 1.3077834844589233} 11/07/2021 08:51:30 - INFO - __main__ - Step 82692: {'lr': 0.00021461183177791748, 'samples': 15876864, 'steps': 82691, 'loss/train': 1.4886780977249146} 11/07/2021 08:51:30 - INFO - __main__ - Step 82693: {'lr': 0.00021460657847661012, 'samples': 15877056, 'steps': 82692, 'loss/train': 1.4749325513839722} 11/07/2021 08:51:30 - INFO - __main__ - Step 82694: {'lr': 0.00021460132519125047, 'samples': 15877248, 'steps': 82693, 'loss/train': 1.532044768333435} 11/07/2021 08:51:31 - INFO - __main__ - Step 82695: {'lr': 0.00021459607192184094, 'samples': 15877440, 'steps': 82694, 'loss/train': 0.5074987411499023} 11/07/2021 08:51:32 - INFO - __main__ - Step 82696: {'lr': 0.00021459081866838386, 'samples': 15877632, 'steps': 82695, 'loss/train': 1.365346074104309} 11/07/2021 08:51:32 - INFO - __main__ - Step 82697: {'lr': 0.00021458556543088163, 'samples': 15877824, 'steps': 82696, 'loss/train': 1.4960296154022217} 11/07/2021 08:51:33 - INFO - __main__ - Step 82698: {'lr': 0.0002145803122093366, 'samples': 15878016, 'steps': 82697, 'loss/train': 1.6735286712646484} 11/07/2021 08:51:33 - INFO - __main__ - Step 82699: {'lr': 0.0002145750590037511, 'samples': 15878208, 'steps': 82698, 'loss/train': 1.3522460460662842} 11/07/2021 08:51:33 - INFO - __main__ - Step 82700: {'lr': 0.00021456980581412754, 'samples': 15878400, 'steps': 82699, 'loss/train': 1.3731358051300049} 11/07/2021 08:51:35 - INFO - __main__ - Step 82701: {'lr': 0.00021456455264046831, 'samples': 15878592, 'steps': 82700, 'loss/train': 1.280830979347229} 11/07/2021 08:51:35 - INFO - __main__ - Step 82702: {'lr': 0.00021455929948277573, 'samples': 15878784, 'steps': 82701, 'loss/train': 1.4599008560180664} 11/07/2021 08:51:35 - INFO - __main__ - Step 82703: {'lr': 0.0002145540463410522, 'samples': 15878976, 'steps': 82702, 'loss/train': 0.16095943748950958} 11/07/2021 08:51:36 - INFO - __main__ - Step 82704: {'lr': 0.00021454879321530012, 'samples': 15879168, 'steps': 82703, 'loss/train': 1.7700997591018677} 11/07/2021 08:51:36 - INFO - __main__ - Step 82705: {'lr': 0.00021454354010552173, 'samples': 15879360, 'steps': 82704, 'loss/train': 1.4002645015716553} 11/07/2021 08:51:36 - INFO - __main__ - Step 82706: {'lr': 0.00021453828701171952, 'samples': 15879552, 'steps': 82705, 'loss/train': 1.7017767429351807} 11/07/2021 08:51:37 - INFO - __main__ - Step 82707: {'lr': 0.00021453303393389575, 'samples': 15879744, 'steps': 82706, 'loss/train': 1.4379063844680786} 11/07/2021 08:51:38 - INFO - __main__ - Step 82708: {'lr': 0.0002145277808720529, 'samples': 15879936, 'steps': 82707, 'loss/train': 1.40292227268219} 11/07/2021 08:51:38 - INFO - __main__ - Step 82709: {'lr': 0.0002145225278261932, 'samples': 15880128, 'steps': 82708, 'loss/train': 1.570507287979126} 11/07/2021 08:51:38 - INFO - __main__ - Step 82710: {'lr': 0.00021451727479631917, 'samples': 15880320, 'steps': 82709, 'loss/train': 0.44780486822128296} 11/07/2021 08:51:39 - INFO - __main__ - Step 82711: {'lr': 0.0002145120217824331, 'samples': 15880512, 'steps': 82710, 'loss/train': 1.2710981369018555} 11/07/2021 08:51:40 - INFO - __main__ - Step 82712: {'lr': 0.00021450676878453736, 'samples': 15880704, 'steps': 82711, 'loss/train': 1.0734025239944458} 11/07/2021 08:51:40 - INFO - __main__ - Step 82713: {'lr': 0.0002145015158026343, 'samples': 15880896, 'steps': 82712, 'loss/train': 1.3205821514129639} 11/07/2021 08:51:40 - INFO - __main__ - Step 82714: {'lr': 0.00021449626283672634, 'samples': 15881088, 'steps': 82713, 'loss/train': 1.4587551355361938} 11/07/2021 08:51:41 - INFO - __main__ - Step 82715: {'lr': 0.0002144910098868158, 'samples': 15881280, 'steps': 82714, 'loss/train': 1.3450337648391724} 11/07/2021 08:51:41 - INFO - __main__ - Step 82716: {'lr': 0.00021448575695290508, 'samples': 15881472, 'steps': 82715, 'loss/train': 1.4304943084716797} 11/07/2021 08:51:42 - INFO - __main__ - Step 82717: {'lr': 0.00021448050403499662, 'samples': 15881664, 'steps': 82716, 'loss/train': 1.10431969165802} 11/07/2021 08:51:43 - INFO - __main__ - Step 82718: {'lr': 0.0002144752511330926, 'samples': 15881856, 'steps': 82717, 'loss/train': 1.2585493326187134} 11/07/2021 08:51:43 - INFO - __main__ - Step 82719: {'lr': 0.0002144699982471955, 'samples': 15882048, 'steps': 82718, 'loss/train': 1.6491687297821045} 11/07/2021 08:51:43 - INFO - __main__ - Step 82720: {'lr': 0.00021446474537730763, 'samples': 15882240, 'steps': 82719, 'loss/train': 1.0763362646102905} 11/07/2021 08:51:44 - INFO - __main__ - Step 82721: {'lr': 0.0002144594925234314, 'samples': 15882432, 'steps': 82720, 'loss/train': 1.6178399324417114} 11/07/2021 08:51:45 - INFO - __main__ - Step 82722: {'lr': 0.0002144542396855692, 'samples': 15882624, 'steps': 82721, 'loss/train': 2.4635512828826904} 11/07/2021 08:51:45 - INFO - __main__ - Step 82723: {'lr': 0.00021444898686372337, 'samples': 15882816, 'steps': 82722, 'loss/train': 1.6029655933380127} 11/07/2021 08:51:45 - INFO - __main__ - Step 82724: {'lr': 0.00021444373405789623, 'samples': 15883008, 'steps': 82723, 'loss/train': 1.8693342208862305} 11/07/2021 08:51:46 - INFO - __main__ - Step 82725: {'lr': 0.00021443848126809028, 'samples': 15883200, 'steps': 82724, 'loss/train': 1.2980314493179321} 11/07/2021 08:51:46 - INFO - __main__ - Step 82726: {'lr': 0.00021443322849430774, 'samples': 15883392, 'steps': 82725, 'loss/train': 1.1873598098754883} 11/07/2021 08:51:47 - INFO - __main__ - Step 82727: {'lr': 0.00021442797573655104, 'samples': 15883584, 'steps': 82726, 'loss/train': 1.0803484916687012} 11/07/2021 08:51:48 - INFO - __main__ - Step 82728: {'lr': 0.00021442272299482257, 'samples': 15883776, 'steps': 82727, 'loss/train': 1.3750605583190918} 11/07/2021 08:51:48 - INFO - __main__ - Step 82729: {'lr': 0.00021441747026912467, 'samples': 15883968, 'steps': 82728, 'loss/train': 1.3510823249816895} 11/07/2021 08:51:48 - INFO - __main__ - Step 82730: {'lr': 0.00021441221755945968, 'samples': 15884160, 'steps': 82729, 'loss/train': 1.3971812725067139} 11/07/2021 08:51:49 - INFO - __main__ - Step 82731: {'lr': 0.00021440696486583013, 'samples': 15884352, 'steps': 82730, 'loss/train': 1.317294955253601} 11/07/2021 08:51:49 - INFO - __main__ - Step 82732: {'lr': 0.00021440171218823815, 'samples': 15884544, 'steps': 82731, 'loss/train': 1.4203640222549438} 11/07/2021 08:51:50 - INFO - __main__ - Step 82733: {'lr': 0.00021439645952668618, 'samples': 15884736, 'steps': 82732, 'loss/train': 1.1984682083129883} 11/07/2021 08:51:50 - INFO - __main__ - Step 82734: {'lr': 0.00021439120688117663, 'samples': 15884928, 'steps': 82733, 'loss/train': 1.255486249923706} 11/07/2021 08:51:51 - INFO - __main__ - Step 82735: {'lr': 0.00021438595425171187, 'samples': 15885120, 'steps': 82734, 'loss/train': 1.9885833263397217} 11/07/2021 08:51:51 - INFO - __main__ - Step 82736: {'lr': 0.00021438070163829422, 'samples': 15885312, 'steps': 82735, 'loss/train': 1.5066838264465332} 11/07/2021 08:51:51 - INFO - __main__ - Step 82737: {'lr': 0.0002143754490409261, 'samples': 15885504, 'steps': 82736, 'loss/train': 0.674815833568573} 11/07/2021 08:51:52 - INFO - __main__ - Step 82738: {'lr': 0.00021437019645960986, 'samples': 15885696, 'steps': 82737, 'loss/train': 1.3737637996673584} 11/07/2021 08:51:53 - INFO - __main__ - Step 82739: {'lr': 0.00021436494389434786, 'samples': 15885888, 'steps': 82738, 'loss/train': 1.684408187866211} 11/07/2021 08:51:53 - INFO - __main__ - Step 82740: {'lr': 0.00021435969134514244, 'samples': 15886080, 'steps': 82739, 'loss/train': 1.128886342048645} 11/07/2021 08:51:53 - INFO - __main__ - Step 82741: {'lr': 0.00021435443881199598, 'samples': 15886272, 'steps': 82740, 'loss/train': 0.8912045359611511} 11/07/2021 08:51:54 - INFO - __main__ - Step 82742: {'lr': 0.0002143491862949109, 'samples': 15886464, 'steps': 82741, 'loss/train': 1.6274001598358154} 11/07/2021 08:51:55 - INFO - __main__ - Step 82743: {'lr': 0.0002143439337938895, 'samples': 15886656, 'steps': 82742, 'loss/train': 1.4492822885513306} 11/07/2021 08:51:55 - INFO - __main__ - Step 82744: {'lr': 0.0002143386813089343, 'samples': 15886848, 'steps': 82743, 'loss/train': 0.9154844880104065} 11/07/2021 08:51:56 - INFO - __main__ - Step 82745: {'lr': 0.0002143334288400474, 'samples': 15887040, 'steps': 82744, 'loss/train': 1.9101563692092896} 11/07/2021 08:51:56 - INFO - __main__ - Step 82746: {'lr': 0.00021432817638723136, 'samples': 15887232, 'steps': 82745, 'loss/train': 1.52516770362854} 11/07/2021 08:51:56 - INFO - __main__ - Step 82747: {'lr': 0.00021432292395048846, 'samples': 15887424, 'steps': 82746, 'loss/train': 1.4135704040527344} 11/07/2021 08:51:57 - INFO - __main__ - Step 82748: {'lr': 0.0002143176715298211, 'samples': 15887616, 'steps': 82747, 'loss/train': 1.391538381576538} 11/07/2021 08:51:58 - INFO - __main__ - Step 82749: {'lr': 0.00021431241912523165, 'samples': 15887808, 'steps': 82748, 'loss/train': 1.7606536149978638} 11/07/2021 08:51:58 - INFO - __main__ - Step 82750: {'lr': 0.00021430716673672247, 'samples': 15888000, 'steps': 82749, 'loss/train': 1.4243435859680176} 11/07/2021 08:51:58 - INFO - __main__ - Step 82751: {'lr': 0.00021430191436429594, 'samples': 15888192, 'steps': 82750, 'loss/train': 1.4869717359542847} 11/07/2021 08:51:59 - INFO - __main__ - Step 82752: {'lr': 0.0002142966620079544, 'samples': 15888384, 'steps': 82751, 'loss/train': 1.5391044616699219} 11/07/2021 08:52:00 - INFO - __main__ - Step 82753: {'lr': 0.00021429140966770026, 'samples': 15888576, 'steps': 82752, 'loss/train': 1.6143991947174072} 11/07/2021 08:52:00 - INFO - __main__ - Step 82754: {'lr': 0.00021428615734353585, 'samples': 15888768, 'steps': 82753, 'loss/train': 1.5389397144317627} 11/07/2021 08:52:00 - INFO - __main__ - Step 82755: {'lr': 0.00021428090503546358, 'samples': 15888960, 'steps': 82754, 'loss/train': 1.488959550857544} 11/07/2021 08:52:01 - INFO - __main__ - Step 82756: {'lr': 0.00021427565274348575, 'samples': 15889152, 'steps': 82755, 'loss/train': 1.901084303855896} 11/07/2021 08:52:01 - INFO - __main__ - Step 82757: {'lr': 0.0002142704004676048, 'samples': 15889344, 'steps': 82756, 'loss/train': 1.448004961013794} 11/07/2021 08:52:02 - INFO - __main__ - Step 82758: {'lr': 0.000214265148207823, 'samples': 15889536, 'steps': 82757, 'loss/train': 1.6510214805603027} 11/07/2021 08:52:02 - INFO - __main__ - Step 82759: {'lr': 0.00021425989596414279, 'samples': 15889728, 'steps': 82758, 'loss/train': 1.377843976020813} 11/07/2021 08:52:03 - INFO - __main__ - Step 82760: {'lr': 0.0002142546437365665, 'samples': 15889920, 'steps': 82759, 'loss/train': 0.16070395708084106} 11/07/2021 08:52:03 - INFO - __main__ - Step 82761: {'lr': 0.00021424939152509654, 'samples': 15890112, 'steps': 82760, 'loss/train': 1.3003110885620117} 11/07/2021 08:52:03 - INFO - __main__ - Step 82762: {'lr': 0.0002142441393297352, 'samples': 15890304, 'steps': 82761, 'loss/train': 1.5301556587219238} 11/07/2021 08:52:04 - INFO - __main__ - Step 82763: {'lr': 0.00021423888715048494, 'samples': 15890496, 'steps': 82762, 'loss/train': 1.4305428266525269} 11/07/2021 08:52:05 - INFO - __main__ - Step 82764: {'lr': 0.0002142336349873481, 'samples': 15890688, 'steps': 82763, 'loss/train': 1.3677923679351807} 11/07/2021 08:52:05 - INFO - __main__ - Step 82765: {'lr': 0.00021422838284032702, 'samples': 15890880, 'steps': 82764, 'loss/train': 1.5116249322891235} 11/07/2021 08:52:05 - INFO - __main__ - Step 82766: {'lr': 0.00021422313070942412, 'samples': 15891072, 'steps': 82765, 'loss/train': 1.6535831689834595} 11/07/2021 08:52:06 - INFO - __main__ - Step 82767: {'lr': 0.0002142178785946417, 'samples': 15891264, 'steps': 82766, 'loss/train': 1.6440249681472778} 11/07/2021 08:52:06 - INFO - __main__ - Step 82768: {'lr': 0.0002142126264959821, 'samples': 15891456, 'steps': 82767, 'loss/train': 1.477770447731018} 11/07/2021 08:52:07 - INFO - __main__ - Step 82769: {'lr': 0.0002142073744134478, 'samples': 15891648, 'steps': 82768, 'loss/train': 1.7465757131576538} 11/07/2021 08:52:08 - INFO - __main__ - Step 82770: {'lr': 0.00021420212234704106, 'samples': 15891840, 'steps': 82769, 'loss/train': 1.1969530582427979} 11/07/2021 08:52:08 - INFO - __main__ - Step 82771: {'lr': 0.0002141968702967644, 'samples': 15892032, 'steps': 82770, 'loss/train': 1.5164542198181152} 11/07/2021 08:52:08 - INFO - __main__ - Step 82772: {'lr': 0.00021419161826261997, 'samples': 15892224, 'steps': 82771, 'loss/train': 1.5984655618667603} 11/07/2021 08:52:09 - INFO - __main__ - Step 82773: {'lr': 0.00021418636624461024, 'samples': 15892416, 'steps': 82772, 'loss/train': 1.3066580295562744} 11/07/2021 08:52:10 - INFO - __main__ - Step 82774: {'lr': 0.00021418111424273759, 'samples': 15892608, 'steps': 82773, 'loss/train': 1.344897747039795} 11/07/2021 08:52:10 - INFO - __main__ - Step 82775: {'lr': 0.0002141758622570044, 'samples': 15892800, 'steps': 82774, 'loss/train': 1.1447681188583374} 11/07/2021 08:52:10 - INFO - __main__ - Step 82776: {'lr': 0.00021417061028741302, 'samples': 15892992, 'steps': 82775, 'loss/train': 1.518928050994873} 11/07/2021 08:52:11 - INFO - __main__ - Step 82777: {'lr': 0.0002141653583339658, 'samples': 15893184, 'steps': 82776, 'loss/train': 0.7934789061546326} 11/07/2021 08:52:11 - INFO - __main__ - Step 82778: {'lr': 0.0002141601063966651, 'samples': 15893376, 'steps': 82777, 'loss/train': 1.2440431118011475} 11/07/2021 08:52:12 - INFO - __main__ - Step 82779: {'lr': 0.0002141548544755133, 'samples': 15893568, 'steps': 82778, 'loss/train': 1.1640405654907227} 11/07/2021 08:52:12 - INFO - __main__ - Step 82780: {'lr': 0.0002141496025705128, 'samples': 15893760, 'steps': 82779, 'loss/train': 1.7889482975006104} 11/07/2021 08:52:13 - INFO - __main__ - Step 82781: {'lr': 0.0002141443506816659, 'samples': 15893952, 'steps': 82780, 'loss/train': 1.5021179914474487} 11/07/2021 08:52:13 - INFO - __main__ - Step 82782: {'lr': 0.00021413909880897502, 'samples': 15894144, 'steps': 82781, 'loss/train': 1.4583921432495117} 11/07/2021 08:52:14 - INFO - __main__ - Step 82783: {'lr': 0.0002141338469524425, 'samples': 15894336, 'steps': 82782, 'loss/train': 1.5198349952697754} 11/07/2021 08:52:15 - INFO - __main__ - Step 82784: {'lr': 0.00021412859511207077, 'samples': 15894528, 'steps': 82783, 'loss/train': 1.6683772802352905} 11/07/2021 08:52:15 - INFO - __main__ - Step 82785: {'lr': 0.0002141233432878621, 'samples': 15894720, 'steps': 82784, 'loss/train': 1.0988445281982422} 11/07/2021 08:52:15 - INFO - __main__ - Step 82786: {'lr': 0.0002141180914798189, 'samples': 15894912, 'steps': 82785, 'loss/train': 0.9070233106613159} 11/07/2021 08:52:16 - INFO - __main__ - Step 82787: {'lr': 0.0002141128396879436, 'samples': 15895104, 'steps': 82786, 'loss/train': 0.863956093788147} 11/07/2021 08:52:16 - INFO - __main__ - Step 82788: {'lr': 0.0002141075879122384, 'samples': 15895296, 'steps': 82787, 'loss/train': 1.6270421743392944} 11/07/2021 08:52:17 - INFO - __main__ - Step 82789: {'lr': 0.0002141023361527058, 'samples': 15895488, 'steps': 82788, 'loss/train': 1.3286457061767578} 11/07/2021 08:52:17 - INFO - __main__ - Step 82790: {'lr': 0.00021409708440934812, 'samples': 15895680, 'steps': 82789, 'loss/train': 1.77594792842865} 11/07/2021 08:52:18 - INFO - __main__ - Step 82791: {'lr': 0.00021409183268216776, 'samples': 15895872, 'steps': 82790, 'loss/train': 1.2315257787704468} 11/07/2021 08:52:18 - INFO - __main__ - Step 82792: {'lr': 0.00021408658097116703, 'samples': 15896064, 'steps': 82791, 'loss/train': 1.2129948139190674} 11/07/2021 08:52:18 - INFO - __main__ - Step 82793: {'lr': 0.00021408132927634835, 'samples': 15896256, 'steps': 82792, 'loss/train': 1.5709056854248047} 11/07/2021 08:52:19 - INFO - __main__ - Step 82794: {'lr': 0.0002140760775977141, 'samples': 15896448, 'steps': 82793, 'loss/train': 1.3952656984329224} 11/07/2021 08:52:20 - INFO - __main__ - Step 82795: {'lr': 0.00021407082593526657, 'samples': 15896640, 'steps': 82794, 'loss/train': 1.335263967514038} 11/07/2021 08:52:20 - INFO - __main__ - Step 82796: {'lr': 0.00021406557428900819, 'samples': 15896832, 'steps': 82795, 'loss/train': 1.4252978563308716} 11/07/2021 08:52:21 - INFO - __main__ - Step 82797: {'lr': 0.00021406032265894128, 'samples': 15897024, 'steps': 82796, 'loss/train': 1.3900517225265503} 11/07/2021 08:52:21 - INFO - __main__ - Step 82798: {'lr': 0.00021405507104506837, 'samples': 15897216, 'steps': 82797, 'loss/train': 1.3811674118041992} 11/07/2021 08:52:21 - INFO - __main__ - Step 82799: {'lr': 0.0002140498194473916, 'samples': 15897408, 'steps': 82798, 'loss/train': 1.5189629793167114} 11/07/2021 08:52:22 - INFO - __main__ - Step 82800: {'lr': 0.00021404456786591343, 'samples': 15897600, 'steps': 82799, 'loss/train': 1.3795015811920166} 11/07/2021 08:52:23 - INFO - __main__ - Step 82801: {'lr': 0.00021403931630063617, 'samples': 15897792, 'steps': 82800, 'loss/train': 1.3598803281784058} 11/07/2021 08:52:23 - INFO - __main__ - Step 82802: {'lr': 0.00021403406475156228, 'samples': 15897984, 'steps': 82801, 'loss/train': 1.6407673358917236} 11/07/2021 08:52:23 - INFO - __main__ - Step 82803: {'lr': 0.00021402881321869408, 'samples': 15898176, 'steps': 82802, 'loss/train': 1.058691143989563} 11/07/2021 08:52:24 - INFO - __main__ - Step 82804: {'lr': 0.00021402356170203393, 'samples': 15898368, 'steps': 82803, 'loss/train': 1.4732893705368042} 11/07/2021 08:52:25 - INFO - __main__ - Step 82805: {'lr': 0.0002140183102015842, 'samples': 15898560, 'steps': 82804, 'loss/train': 1.1975328922271729} 11/07/2021 08:52:25 - INFO - __main__ - Step 82806: {'lr': 0.00021401305871734727, 'samples': 15898752, 'steps': 82805, 'loss/train': 1.444944977760315} 11/07/2021 08:52:25 - INFO - __main__ - Step 82807: {'lr': 0.00021400780724932554, 'samples': 15898944, 'steps': 82806, 'loss/train': 0.9756119251251221} 11/07/2021 08:52:26 - INFO - __main__ - Step 82808: {'lr': 0.0002140025557975213, 'samples': 15899136, 'steps': 82807, 'loss/train': 1.376913070678711} 11/07/2021 08:52:26 - INFO - __main__ - Step 82809: {'lr': 0.00021399730436193694, 'samples': 15899328, 'steps': 82808, 'loss/train': 0.916516900062561} 11/07/2021 08:52:27 - INFO - __main__ - Step 82810: {'lr': 0.00021399205294257486, 'samples': 15899520, 'steps': 82809, 'loss/train': 0.9124927520751953} 11/07/2021 08:52:28 - INFO - __main__ - Step 82811: {'lr': 0.00021398680153943752, 'samples': 15899712, 'steps': 82810, 'loss/train': 1.1888575553894043} 11/07/2021 08:52:28 - INFO - __main__ - Step 82812: {'lr': 0.00021398155015252707, 'samples': 15899904, 'steps': 82811, 'loss/train': 1.8354958295822144} 11/07/2021 08:52:28 - INFO - __main__ - Step 82813: {'lr': 0.00021397629878184594, 'samples': 15900096, 'steps': 82812, 'loss/train': 1.9845139980316162} 11/07/2021 08:52:29 - INFO - __main__ - Step 82814: {'lr': 0.00021397104742739657, 'samples': 15900288, 'steps': 82813, 'loss/train': 1.2858141660690308} 11/07/2021 08:52:29 - INFO - __main__ - Step 82815: {'lr': 0.00021396579608918127, 'samples': 15900480, 'steps': 82814, 'loss/train': 0.8698006272315979} 11/07/2021 08:52:30 - INFO - __main__ - Step 82816: {'lr': 0.00021396054476720245, 'samples': 15900672, 'steps': 82815, 'loss/train': 1.6359655857086182} 11/07/2021 08:52:30 - INFO - __main__ - Step 82817: {'lr': 0.00021395529346146243, 'samples': 15900864, 'steps': 82816, 'loss/train': 1.4236960411071777} 11/07/2021 08:52:31 - INFO - __main__ - Step 82818: {'lr': 0.0002139500421719636, 'samples': 15901056, 'steps': 82817, 'loss/train': 1.3693302869796753} 11/07/2021 08:52:31 - INFO - __main__ - Step 82819: {'lr': 0.00021394479089870832, 'samples': 15901248, 'steps': 82818, 'loss/train': 1.179842233657837} 11/07/2021 08:52:31 - INFO - __main__ - Step 82820: {'lr': 0.00021393953964169896, 'samples': 15901440, 'steps': 82819, 'loss/train': 1.1809114217758179} 11/07/2021 08:52:32 - INFO - __main__ - Step 82821: {'lr': 0.0002139342884009379, 'samples': 15901632, 'steps': 82820, 'loss/train': 1.3309239149093628} 11/07/2021 08:52:33 - INFO - __main__ - Step 82822: {'lr': 0.00021392903717642748, 'samples': 15901824, 'steps': 82821, 'loss/train': 1.1843628883361816} 11/07/2021 08:52:33 - INFO - __main__ - Step 82823: {'lr': 0.00021392378596817008, 'samples': 15902016, 'steps': 82822, 'loss/train': 0.5288830399513245} 11/07/2021 08:52:33 - INFO - __main__ - Step 82824: {'lr': 0.0002139185347761681, 'samples': 15902208, 'steps': 82823, 'loss/train': 1.093958854675293} 11/07/2021 08:52:34 - INFO - __main__ - Step 82825: {'lr': 0.00021391328360042394, 'samples': 15902400, 'steps': 82824, 'loss/train': 1.4136189222335815} 11/07/2021 08:52:35 - INFO - __main__ - Step 82826: {'lr': 0.0002139080324409398, 'samples': 15902592, 'steps': 82825, 'loss/train': 0.8757756352424622} 11/07/2021 08:52:35 - INFO - __main__ - Step 82827: {'lr': 0.00021390278129771814, 'samples': 15902784, 'steps': 82826, 'loss/train': 1.567372441291809} 11/07/2021 08:52:35 - INFO - __main__ - Step 82828: {'lr': 0.00021389753017076135, 'samples': 15902976, 'steps': 82827, 'loss/train': 1.567658543586731} 11/07/2021 08:52:36 - INFO - __main__ - Step 82829: {'lr': 0.00021389227906007174, 'samples': 15903168, 'steps': 82828, 'loss/train': 1.815155267715454} 11/07/2021 08:52:36 - INFO - __main__ - Step 82830: {'lr': 0.00021388702796565177, 'samples': 15903360, 'steps': 82829, 'loss/train': 1.1394366025924683} 11/07/2021 08:52:37 - INFO - __main__ - Step 82831: {'lr': 0.0002138817768875037, 'samples': 15903552, 'steps': 82830, 'loss/train': 1.3126152753829956} 11/07/2021 08:52:38 - INFO - __main__ - Step 82832: {'lr': 0.00021387652582562994, 'samples': 15903744, 'steps': 82831, 'loss/train': 1.4023090600967407} 11/07/2021 08:52:38 - INFO - __main__ - Step 82833: {'lr': 0.00021387127478003287, 'samples': 15903936, 'steps': 82832, 'loss/train': 1.4556524753570557} 11/07/2021 08:52:38 - INFO - __main__ - Step 82834: {'lr': 0.00021386602375071488, 'samples': 15904128, 'steps': 82833, 'loss/train': 1.7287575006484985} 11/07/2021 08:52:39 - INFO - __main__ - Step 82835: {'lr': 0.00021386077273767825, 'samples': 15904320, 'steps': 82834, 'loss/train': 1.421716332435608} 11/07/2021 08:52:39 - INFO - __main__ - Step 82836: {'lr': 0.00021385552174092544, 'samples': 15904512, 'steps': 82835, 'loss/train': 1.2345284223556519} 11/07/2021 08:52:41 - INFO - __main__ - Step 82837: {'lr': 0.00021385027076045875, 'samples': 15904704, 'steps': 82836, 'loss/train': 1.0417516231536865} 11/07/2021 08:52:41 - INFO - __main__ - Step 82838: {'lr': 0.0002138450197962807, 'samples': 15904896, 'steps': 82837, 'loss/train': 0.8023229241371155} 11/07/2021 08:52:41 - INFO - __main__ - Step 82839: {'lr': 0.0002138397688483934, 'samples': 15905088, 'steps': 82838, 'loss/train': 0.7636064887046814} 11/07/2021 08:52:42 - INFO - __main__ - Step 82840: {'lr': 0.00021383451791679933, 'samples': 15905280, 'steps': 82839, 'loss/train': 1.1593281030654907} 11/07/2021 08:52:42 - INFO - __main__ - Step 82841: {'lr': 0.00021382926700150087, 'samples': 15905472, 'steps': 82840, 'loss/train': 1.3836307525634766} 11/07/2021 08:52:42 - INFO - __main__ - Step 82842: {'lr': 0.0002138240161025004, 'samples': 15905664, 'steps': 82841, 'loss/train': 1.4420781135559082} 11/07/2021 08:52:43 - INFO - __main__ - Step 82843: {'lr': 0.0002138187652198003, 'samples': 15905856, 'steps': 82842, 'loss/train': 1.1944366693496704} 11/07/2021 08:52:44 - INFO - __main__ - Step 82844: {'lr': 0.00021381351435340284, 'samples': 15906048, 'steps': 82843, 'loss/train': 1.641129493713379} 11/07/2021 08:52:44 - INFO - __main__ - Step 82845: {'lr': 0.00021380826350331052, 'samples': 15906240, 'steps': 82844, 'loss/train': 1.6863285303115845} 11/07/2021 08:52:44 - INFO - __main__ - Step 82846: {'lr': 0.00021380301266952557, 'samples': 15906432, 'steps': 82845, 'loss/train': 0.07471131533384323} 11/07/2021 08:52:45 - INFO - __main__ - Step 82847: {'lr': 0.00021379776185205047, 'samples': 15906624, 'steps': 82846, 'loss/train': 1.1742208003997803} 11/07/2021 08:52:46 - INFO - __main__ - Step 82848: {'lr': 0.00021379251105088754, 'samples': 15906816, 'steps': 82847, 'loss/train': 1.893693208694458} 11/07/2021 08:52:46 - INFO - __main__ - Step 82849: {'lr': 0.0002137872602660391, 'samples': 15907008, 'steps': 82848, 'loss/train': 1.7910711765289307} 11/07/2021 08:52:47 - INFO - __main__ - Step 82850: {'lr': 0.0002137820094975076, 'samples': 15907200, 'steps': 82849, 'loss/train': 1.5460599660873413} 11/07/2021 08:52:47 - INFO - __main__ - Step 82851: {'lr': 0.00021377675874529537, 'samples': 15907392, 'steps': 82850, 'loss/train': 1.3010714054107666} 11/07/2021 08:52:47 - INFO - __main__ - Step 82852: {'lr': 0.00021377150800940486, 'samples': 15907584, 'steps': 82851, 'loss/train': 1.1064209938049316} 11/07/2021 08:52:48 - INFO - __main__ - Step 82853: {'lr': 0.00021376625728983828, 'samples': 15907776, 'steps': 82852, 'loss/train': 1.613031268119812} 11/07/2021 08:52:49 - INFO - __main__ - Step 82854: {'lr': 0.00021376100658659802, 'samples': 15907968, 'steps': 82853, 'loss/train': 1.6454812288284302} 11/07/2021 08:52:49 - INFO - __main__ - Step 82855: {'lr': 0.00021375575589968653, 'samples': 15908160, 'steps': 82854, 'loss/train': 1.1581329107284546} 11/07/2021 08:52:49 - INFO - __main__ - Step 82856: {'lr': 0.0002137505052291061, 'samples': 15908352, 'steps': 82855, 'loss/train': 0.8898391127586365} 11/07/2021 08:52:50 - INFO - __main__ - Step 82857: {'lr': 0.00021374525457485915, 'samples': 15908544, 'steps': 82856, 'loss/train': 1.9959588050842285} 11/07/2021 08:52:51 - INFO - __main__ - Step 82858: {'lr': 0.00021374000393694804, 'samples': 15908736, 'steps': 82857, 'loss/train': 1.3195174932479858} 11/07/2021 08:52:51 - INFO - __main__ - Step 82859: {'lr': 0.00021373475331537512, 'samples': 15908928, 'steps': 82858, 'loss/train': 1.3460018634796143} 11/07/2021 08:52:51 - INFO - __main__ - Step 82860: {'lr': 0.00021372950271014273, 'samples': 15909120, 'steps': 82859, 'loss/train': 0.10491007566452026} 11/07/2021 08:52:52 - INFO - __main__ - Step 82861: {'lr': 0.00021372425212125333, 'samples': 15909312, 'steps': 82860, 'loss/train': 1.5283737182617188} 11/07/2021 08:52:52 - INFO - __main__ - Step 82862: {'lr': 0.00021371900154870915, 'samples': 15909504, 'steps': 82861, 'loss/train': 1.329035997390747} 11/07/2021 08:52:54 - INFO - __main__ - Step 82863: {'lr': 0.00021371375099251268, 'samples': 15909696, 'steps': 82862, 'loss/train': 1.219808578491211} 11/07/2021 08:52:54 - INFO - __main__ - Step 82864: {'lr': 0.0002137085004526662, 'samples': 15909888, 'steps': 82863, 'loss/train': 1.4243643283843994} 11/07/2021 08:52:54 - INFO - __main__ - Step 82865: {'lr': 0.00021370324992917226, 'samples': 15910080, 'steps': 82864, 'loss/train': 1.507402777671814} 11/07/2021 08:52:55 - INFO - __main__ - Step 82866: {'lr': 0.00021369799942203295, 'samples': 15910272, 'steps': 82865, 'loss/train': 1.285009741783142} 11/07/2021 08:52:55 - INFO - __main__ - Step 82867: {'lr': 0.00021369274893125073, 'samples': 15910464, 'steps': 82866, 'loss/train': 0.11308404058218002} 11/07/2021 08:52:55 - INFO - __main__ - Step 82868: {'lr': 0.00021368749845682803, 'samples': 15910656, 'steps': 82867, 'loss/train': 1.6087260246276855} 11/07/2021 08:52:56 - INFO - __main__ - Step 82869: {'lr': 0.00021368224799876717, 'samples': 15910848, 'steps': 82868, 'loss/train': 1.280238151550293} 11/07/2021 08:52:57 - INFO - __main__ - Step 82870: {'lr': 0.00021367699755707055, 'samples': 15911040, 'steps': 82869, 'loss/train': 1.10280179977417} 11/07/2021 08:52:57 - INFO - __main__ - Step 82871: {'lr': 0.00021367174713174048, 'samples': 15911232, 'steps': 82870, 'loss/train': 1.5144048929214478} 11/07/2021 08:52:57 - INFO - __main__ - Step 82872: {'lr': 0.0002136664967227794, 'samples': 15911424, 'steps': 82871, 'loss/train': 0.9083629250526428} 11/07/2021 08:52:58 - INFO - __main__ - Step 82873: {'lr': 0.00021366124633018956, 'samples': 15911616, 'steps': 82872, 'loss/train': 1.6873902082443237} 11/07/2021 08:52:59 - INFO - __main__ - Step 82874: {'lr': 0.00021365599595397347, 'samples': 15911808, 'steps': 82873, 'loss/train': 0.8444439172744751} 11/07/2021 08:52:59 - INFO - __main__ - Step 82875: {'lr': 0.0002136507455941334, 'samples': 15912000, 'steps': 82874, 'loss/train': 1.3579444885253906} 11/07/2021 08:52:59 - INFO - __main__ - Step 82876: {'lr': 0.0002136454952506718, 'samples': 15912192, 'steps': 82875, 'loss/train': 1.6317554712295532} 11/07/2021 08:53:00 - INFO - __main__ - Step 82877: {'lr': 0.0002136402449235909, 'samples': 15912384, 'steps': 82876, 'loss/train': 1.307649850845337} 11/07/2021 08:53:00 - INFO - __main__ - Step 82878: {'lr': 0.0002136349946128933, 'samples': 15912576, 'steps': 82877, 'loss/train': 1.0510640144348145} 11/07/2021 08:53:01 - INFO - __main__ - Step 82879: {'lr': 0.0002136297443185811, 'samples': 15912768, 'steps': 82878, 'loss/train': 2.04874324798584} 11/07/2021 08:53:01 - INFO - __main__ - Step 82880: {'lr': 0.00021362449404065676, 'samples': 15912960, 'steps': 82879, 'loss/train': 1.3116623163223267} 11/07/2021 08:53:02 - INFO - __main__ - Step 82881: {'lr': 0.00021361924377912264, 'samples': 15913152, 'steps': 82880, 'loss/train': 1.659755825996399} 11/07/2021 08:53:02 - INFO - __main__ - Step 82882: {'lr': 0.00021361399353398116, 'samples': 15913344, 'steps': 82881, 'loss/train': 1.587425708770752} 11/07/2021 08:53:03 - INFO - __main__ - Step 82883: {'lr': 0.00021360874330523467, 'samples': 15913536, 'steps': 82882, 'loss/train': 1.742073893547058} 11/07/2021 08:53:04 - INFO - __main__ - Step 82884: {'lr': 0.00021360349309288546, 'samples': 15913728, 'steps': 82883, 'loss/train': 1.257192850112915} 11/07/2021 08:53:04 - INFO - __main__ - Step 82885: {'lr': 0.000213598242896936, 'samples': 15913920, 'steps': 82884, 'loss/train': 0.24808700382709503} 11/07/2021 08:53:04 - INFO - __main__ - Step 82886: {'lr': 0.0002135929927173886, 'samples': 15914112, 'steps': 82885, 'loss/train': 5.729852199554443} 11/07/2021 08:53:05 - INFO - __main__ - Step 82887: {'lr': 0.00021358774255424563, 'samples': 15914304, 'steps': 82886, 'loss/train': 1.7447973489761353} 11/07/2021 08:53:05 - INFO - __main__ - Step 82888: {'lr': 0.0002135824924075095, 'samples': 15914496, 'steps': 82887, 'loss/train': 1.1901764869689941} 11/07/2021 08:53:05 - INFO - __main__ - Step 82889: {'lr': 0.00021357724227718253, 'samples': 15914688, 'steps': 82888, 'loss/train': 1.2411746978759766} 11/07/2021 08:53:06 - INFO - __main__ - Step 82890: {'lr': 0.00021357199216326706, 'samples': 15914880, 'steps': 82889, 'loss/train': 1.6751869916915894} 11/07/2021 08:53:07 - INFO - __main__ - Step 82891: {'lr': 0.0002135667420657655, 'samples': 15915072, 'steps': 82890, 'loss/train': 1.3550159931182861} 11/07/2021 08:53:07 - INFO - __main__ - Step 82892: {'lr': 0.00021356149198468026, 'samples': 15915264, 'steps': 82891, 'loss/train': 1.5675801038742065} 11/07/2021 08:53:08 - INFO - __main__ - Step 82893: {'lr': 0.0002135562419200136, 'samples': 15915456, 'steps': 82892, 'loss/train': 1.4824846982955933} 11/07/2021 08:53:08 - INFO - __main__ - Step 82894: {'lr': 0.00021355099187176792, 'samples': 15915648, 'steps': 82893, 'loss/train': 1.4741499423980713} 11/07/2021 08:53:09 - INFO - __main__ - Step 82895: {'lr': 0.00021354574183994558, 'samples': 15915840, 'steps': 82894, 'loss/train': 1.8388993740081787} 11/07/2021 08:53:09 - INFO - __main__ - Step 82896: {'lr': 0.000213540491824549, 'samples': 15916032, 'steps': 82895, 'loss/train': 1.6732596158981323} 11/07/2021 08:53:10 - INFO - __main__ - Step 82897: {'lr': 0.0002135352418255805, 'samples': 15916224, 'steps': 82896, 'loss/train': 1.7132014036178589} 11/07/2021 08:53:10 - INFO - __main__ - Step 82898: {'lr': 0.00021352999184304244, 'samples': 15916416, 'steps': 82897, 'loss/train': 1.5038566589355469} 11/07/2021 08:53:10 - INFO - __main__ - Step 82899: {'lr': 0.00021352474187693723, 'samples': 15916608, 'steps': 82898, 'loss/train': 1.5323952436447144} 11/07/2021 08:53:11 - INFO - __main__ - Step 82900: {'lr': 0.00021351949192726727, 'samples': 15916800, 'steps': 82899, 'loss/train': 1.5410219430923462} 11/07/2021 08:53:12 - INFO - __main__ - Step 82901: {'lr': 0.00021351424199403477, 'samples': 15916992, 'steps': 82900, 'loss/train': 1.3479599952697754} 11/07/2021 08:53:12 - INFO - __main__ - Step 82902: {'lr': 0.00021350899207724222, 'samples': 15917184, 'steps': 82901, 'loss/train': 1.6611595153808594} 11/07/2021 08:53:12 - INFO - __main__ - Step 82903: {'lr': 0.00021350374217689194, 'samples': 15917376, 'steps': 82902, 'loss/train': 1.5237295627593994} 11/07/2021 08:53:13 - INFO - __main__ - Step 82904: {'lr': 0.0002134984922929863, 'samples': 15917568, 'steps': 82903, 'loss/train': 1.4869142770767212} 11/07/2021 08:53:14 - INFO - __main__ - Step 82905: {'lr': 0.0002134932424255278, 'samples': 15917760, 'steps': 82904, 'loss/train': 1.178855061531067} 11/07/2021 08:53:14 - INFO - __main__ - Step 82906: {'lr': 0.00021348799257451856, 'samples': 15917952, 'steps': 82905, 'loss/train': 1.3459304571151733} 11/07/2021 08:53:14 - INFO - __main__ - Step 82907: {'lr': 0.00021348274273996106, 'samples': 15918144, 'steps': 82906, 'loss/train': 1.2290962934494019} 11/07/2021 08:53:15 - INFO - __main__ - Step 82908: {'lr': 0.00021347749292185768, 'samples': 15918336, 'steps': 82907, 'loss/train': 1.52650785446167} 11/07/2021 08:53:15 - INFO - __main__ - Step 82909: {'lr': 0.00021347224312021082, 'samples': 15918528, 'steps': 82908, 'loss/train': 1.3166356086730957} 11/07/2021 08:53:16 - INFO - __main__ - Step 82910: {'lr': 0.0002134669933350228, 'samples': 15918720, 'steps': 82909, 'loss/train': 1.4122581481933594} 11/07/2021 08:53:17 - INFO - __main__ - Step 82911: {'lr': 0.000213461743566296, 'samples': 15918912, 'steps': 82910, 'loss/train': 1.0439566373825073} 11/07/2021 08:53:17 - INFO - __main__ - Step 82912: {'lr': 0.00021345649381403277, 'samples': 15919104, 'steps': 82911, 'loss/train': 1.1400636434555054} 11/07/2021 08:53:18 - INFO - __main__ - Step 82913: {'lr': 0.00021345124407823543, 'samples': 15919296, 'steps': 82912, 'loss/train': 1.452233076095581} 11/07/2021 08:53:18 - INFO - __main__ - Step 82914: {'lr': 0.0002134459943589064, 'samples': 15919488, 'steps': 82913, 'loss/train': 1.3374580144882202} 11/07/2021 08:53:18 - INFO - __main__ - Step 82915: {'lr': 0.00021344074465604808, 'samples': 15919680, 'steps': 82914, 'loss/train': 1.8868011236190796} 11/07/2021 08:53:19 - INFO - __main__ - Step 82916: {'lr': 0.00021343549496966277, 'samples': 15919872, 'steps': 82915, 'loss/train': 1.7292139530181885} 11/07/2021 08:53:20 - INFO - __main__ - Step 82917: {'lr': 0.00021343024529975286, 'samples': 15920064, 'steps': 82916, 'loss/train': 1.7011361122131348} 11/07/2021 08:53:20 - INFO - __main__ - Step 82918: {'lr': 0.00021342499564632074, 'samples': 15920256, 'steps': 82917, 'loss/train': 1.1072242259979248} 11/07/2021 08:53:20 - INFO - __main__ - Step 82919: {'lr': 0.0002134197460093688, 'samples': 15920448, 'steps': 82918, 'loss/train': 1.3282198905944824} 11/07/2021 08:53:21 - INFO - __main__ - Step 82920: {'lr': 0.00021341449638889926, 'samples': 15920640, 'steps': 82919, 'loss/train': 1.3151381015777588} 11/07/2021 08:53:21 - INFO - __main__ - Step 82921: {'lr': 0.00021340924678491462, 'samples': 15920832, 'steps': 82920, 'loss/train': 0.8348560333251953} 11/07/2021 08:53:22 - INFO - __main__ - Step 82922: {'lr': 0.00021340399719741725, 'samples': 15921024, 'steps': 82921, 'loss/train': 1.3709580898284912} 11/07/2021 08:53:22 - INFO - __main__ - Step 82923: {'lr': 0.00021339874762640946, 'samples': 15921216, 'steps': 82922, 'loss/train': 0.2319975346326828} 11/07/2021 08:53:23 - INFO - __main__ - Step 82924: {'lr': 0.0002133934980718936, 'samples': 15921408, 'steps': 82923, 'loss/train': 0.9721094369888306} 11/07/2021 08:53:23 - INFO - __main__ - Step 82925: {'lr': 0.00021338824853387207, 'samples': 15921600, 'steps': 82924, 'loss/train': 1.35724937915802} 11/07/2021 08:53:23 - INFO - __main__ - Step 82926: {'lr': 0.0002133829990123472, 'samples': 15921792, 'steps': 82925, 'loss/train': 1.1125544309616089} 11/07/2021 08:53:25 - INFO - __main__ - Step 82927: {'lr': 0.00021337774950732141, 'samples': 15921984, 'steps': 82926, 'loss/train': 1.8224024772644043} 11/07/2021 08:53:25 - INFO - __main__ - Step 82928: {'lr': 0.00021337250001879704, 'samples': 15922176, 'steps': 82927, 'loss/train': 1.3644585609436035} 11/07/2021 08:53:25 - INFO - __main__ - Step 82929: {'lr': 0.00021336725054677647, 'samples': 15922368, 'steps': 82928, 'loss/train': 1.5516806840896606} 11/07/2021 08:53:26 - INFO - __main__ - Step 82930: {'lr': 0.00021336200109126202, 'samples': 15922560, 'steps': 82929, 'loss/train': 1.420628309249878} 11/07/2021 08:53:26 - INFO - __main__ - Step 82931: {'lr': 0.00021335675165225614, 'samples': 15922752, 'steps': 82930, 'loss/train': 1.2541275024414062} 11/07/2021 08:53:26 - INFO - __main__ - Step 82932: {'lr': 0.00021335150222976114, 'samples': 15922944, 'steps': 82931, 'loss/train': 1.3614033460617065} 11/07/2021 08:53:28 - INFO - __main__ - Step 82933: {'lr': 0.0002133462528237794, 'samples': 15923136, 'steps': 82932, 'loss/train': 0.8297854065895081} 11/07/2021 08:53:28 - INFO - __main__ - Step 82934: {'lr': 0.00021334100343431322, 'samples': 15923328, 'steps': 82933, 'loss/train': 1.2496775388717651} 11/07/2021 08:53:28 - INFO - __main__ - Step 82935: {'lr': 0.00021333575406136504, 'samples': 15923520, 'steps': 82934, 'loss/train': 0.6052599549293518} 11/07/2021 08:53:29 - INFO - __main__ - Step 82936: {'lr': 0.0002133305047049372, 'samples': 15923712, 'steps': 82935, 'loss/train': 1.7900173664093018} 11/07/2021 08:53:29 - INFO - __main__ - Step 82937: {'lr': 0.00021332525536503207, 'samples': 15923904, 'steps': 82936, 'loss/train': 0.779689371585846} 11/07/2021 08:53:30 - INFO - __main__ - Step 82938: {'lr': 0.00021332000604165198, 'samples': 15924096, 'steps': 82937, 'loss/train': 1.1074659824371338} 11/07/2021 08:53:30 - INFO - __main__ - Step 82939: {'lr': 0.00021331475673479935, 'samples': 15924288, 'steps': 82938, 'loss/train': 1.509322166442871} 11/07/2021 08:53:31 - INFO - __main__ - Step 82940: {'lr': 0.00021330950744447653, 'samples': 15924480, 'steps': 82939, 'loss/train': 1.5419836044311523} 11/07/2021 08:53:31 - INFO - __main__ - Step 82941: {'lr': 0.00021330425817068588, 'samples': 15924672, 'steps': 82940, 'loss/train': 1.6699000597000122} 11/07/2021 08:53:31 - INFO - __main__ - Step 82942: {'lr': 0.00021329900891342977, 'samples': 15924864, 'steps': 82941, 'loss/train': 1.1707799434661865} 11/07/2021 08:53:32 - INFO - __main__ - Step 82943: {'lr': 0.00021329375967271054, 'samples': 15925056, 'steps': 82942, 'loss/train': 1.0133914947509766} 11/07/2021 08:53:33 - INFO - __main__ - Step 82944: {'lr': 0.0002132885104485306, 'samples': 15925248, 'steps': 82943, 'loss/train': 1.498528003692627} 11/07/2021 08:53:33 - INFO - __main__ - Step 82945: {'lr': 0.00021328326124089227, 'samples': 15925440, 'steps': 82944, 'loss/train': 1.4339463710784912} 11/07/2021 08:53:34 - INFO - __main__ - Step 82946: {'lr': 0.00021327801204979805, 'samples': 15925632, 'steps': 82945, 'loss/train': 1.446331262588501} 11/07/2021 08:53:34 - INFO - __main__ - Step 82947: {'lr': 0.0002132727628752501, 'samples': 15925824, 'steps': 82946, 'loss/train': 1.0026330947875977} 11/07/2021 08:53:35 - INFO - __main__ - Step 82948: {'lr': 0.00021326751371725084, 'samples': 15926016, 'steps': 82947, 'loss/train': 1.3972630500793457} 11/07/2021 08:53:36 - INFO - __main__ - Step 82949: {'lr': 0.0002132622645758027, 'samples': 15926208, 'steps': 82948, 'loss/train': 1.6726566553115845} 11/07/2021 08:53:36 - INFO - __main__ - Step 82950: {'lr': 0.000213257015450908, 'samples': 15926400, 'steps': 82949, 'loss/train': 0.9319494366645813} 11/07/2021 08:53:36 - INFO - __main__ - Step 82951: {'lr': 0.00021325176634256915, 'samples': 15926592, 'steps': 82950, 'loss/train': 1.4915146827697754} 11/07/2021 08:53:37 - INFO - __main__ - Step 82952: {'lr': 0.00021324651725078848, 'samples': 15926784, 'steps': 82951, 'loss/train': 1.4312231540679932} 11/07/2021 08:53:37 - INFO - __main__ - Step 82953: {'lr': 0.00021324126817556831, 'samples': 15926976, 'steps': 82952, 'loss/train': 1.5324152708053589} 11/07/2021 08:53:38 - INFO - __main__ - Step 82954: {'lr': 0.00021323601911691113, 'samples': 15927168, 'steps': 82953, 'loss/train': 0.46486809849739075} 11/07/2021 08:53:38 - INFO - __main__ - Step 82955: {'lr': 0.0002132307700748192, 'samples': 15927360, 'steps': 82954, 'loss/train': 1.801579475402832} 11/07/2021 08:53:39 - INFO - __main__ - Step 82956: {'lr': 0.0002132255210492949, 'samples': 15927552, 'steps': 82955, 'loss/train': 1.1838387250900269} 11/07/2021 08:53:39 - INFO - __main__ - Step 82957: {'lr': 0.00021322027204034063, 'samples': 15927744, 'steps': 82956, 'loss/train': 1.2343189716339111} 11/07/2021 08:53:39 - INFO - __main__ - Step 82958: {'lr': 0.00021321502304795875, 'samples': 15927936, 'steps': 82957, 'loss/train': 0.7403579950332642} 11/07/2021 08:53:40 - INFO - __main__ - Step 82959: {'lr': 0.00021320977407215168, 'samples': 15928128, 'steps': 82958, 'loss/train': 1.6346306800842285} 11/07/2021 08:53:41 - INFO - __main__ - Step 82960: {'lr': 0.00021320452511292167, 'samples': 15928320, 'steps': 82959, 'loss/train': 1.3728737831115723} 11/07/2021 08:53:41 - INFO - __main__ - Step 82961: {'lr': 0.0002131992761702711, 'samples': 15928512, 'steps': 82960, 'loss/train': 1.5513471364974976} 11/07/2021 08:53:41 - INFO - __main__ - Step 82962: {'lr': 0.00021319402724420236, 'samples': 15928704, 'steps': 82961, 'loss/train': 0.9314621090888977} 11/07/2021 08:53:42 - INFO - __main__ - Step 82963: {'lr': 0.00021318877833471784, 'samples': 15928896, 'steps': 82962, 'loss/train': 1.2848483324050903} 11/07/2021 08:53:42 - INFO - __main__ - Step 82964: {'lr': 0.0002131835294418199, 'samples': 15929088, 'steps': 82963, 'loss/train': 1.5334826707839966} 11/07/2021 08:53:43 - INFO - __main__ - Step 82965: {'lr': 0.00021317828056551086, 'samples': 15929280, 'steps': 82964, 'loss/train': 1.5232306718826294} 11/07/2021 08:53:44 - INFO - __main__ - Step 82966: {'lr': 0.00021317303170579314, 'samples': 15929472, 'steps': 82965, 'loss/train': 0.733238697052002} 11/07/2021 08:53:44 - INFO - __main__ - Step 82967: {'lr': 0.0002131677828626691, 'samples': 15929664, 'steps': 82966, 'loss/train': 2.0562052726745605} 11/07/2021 08:53:44 - INFO - __main__ - Step 82968: {'lr': 0.00021316253403614105, 'samples': 15929856, 'steps': 82967, 'loss/train': 1.9052573442459106} 11/07/2021 08:53:45 - INFO - __main__ - Step 82969: {'lr': 0.00021315728522621142, 'samples': 15930048, 'steps': 82968, 'loss/train': 2.0653374195098877} 11/07/2021 08:53:46 - INFO - __main__ - Step 82970: {'lr': 0.00021315203643288252, 'samples': 15930240, 'steps': 82969, 'loss/train': 1.303946852684021} 11/07/2021 08:53:46 - INFO - __main__ - Step 82971: {'lr': 0.00021314678765615676, 'samples': 15930432, 'steps': 82970, 'loss/train': 1.089308738708496} 11/07/2021 08:53:46 - INFO - __main__ - Step 82972: {'lr': 0.0002131415388960365, 'samples': 15930624, 'steps': 82971, 'loss/train': 1.900699496269226} 11/07/2021 08:53:47 - INFO - __main__ - Step 82973: {'lr': 0.00021313629015252419, 'samples': 15930816, 'steps': 82972, 'loss/train': 1.3655720949172974} 11/07/2021 08:53:47 - INFO - __main__ - Step 82974: {'lr': 0.000213131041425622, 'samples': 15931008, 'steps': 82973, 'loss/train': 1.5868369340896606} 11/07/2021 08:53:48 - INFO - __main__ - Step 82975: {'lr': 0.00021312579271533239, 'samples': 15931200, 'steps': 82974, 'loss/train': 0.9050278067588806} 11/07/2021 08:53:48 - INFO - __main__ - Step 82976: {'lr': 0.00021312054402165774, 'samples': 15931392, 'steps': 82975, 'loss/train': 1.293277621269226} 11/07/2021 08:53:49 - INFO - __main__ - Step 82977: {'lr': 0.0002131152953446004, 'samples': 15931584, 'steps': 82976, 'loss/train': 1.4249566793441772} 11/07/2021 08:53:49 - INFO - __main__ - Step 82978: {'lr': 0.00021311004668416272, 'samples': 15931776, 'steps': 82977, 'loss/train': 1.3958017826080322} 11/07/2021 08:53:50 - INFO - __main__ - Step 82979: {'lr': 0.00021310479804034711, 'samples': 15931968, 'steps': 82978, 'loss/train': 1.516789197921753} 11/07/2021 08:53:51 - INFO - __main__ - Step 82980: {'lr': 0.00021309954941315588, 'samples': 15932160, 'steps': 82979, 'loss/train': 1.3746492862701416} 11/07/2021 08:53:51 - INFO - __main__ - Step 82981: {'lr': 0.00021309430080259143, 'samples': 15932352, 'steps': 82980, 'loss/train': 1.496142864227295} 11/07/2021 08:53:51 - INFO - __main__ - Step 82982: {'lr': 0.00021308905220865612, 'samples': 15932544, 'steps': 82981, 'loss/train': 1.0375401973724365} 11/07/2021 08:53:52 - INFO - __main__ - Step 82983: {'lr': 0.00021308380363135233, 'samples': 15932736, 'steps': 82982, 'loss/train': 1.3278131484985352} 11/07/2021 08:53:52 - INFO - __main__ - Step 82984: {'lr': 0.00021307855507068238, 'samples': 15932928, 'steps': 82983, 'loss/train': 1.3751792907714844} 11/07/2021 08:53:53 - INFO - __main__ - Step 82985: {'lr': 0.0002130733065266487, 'samples': 15933120, 'steps': 82984, 'loss/train': 0.8402908444404602} 11/07/2021 08:53:53 - INFO - __main__ - Step 82986: {'lr': 0.0002130680579992537, 'samples': 15933312, 'steps': 82985, 'loss/train': 1.2843250036239624} 11/07/2021 08:53:54 - INFO - __main__ - Step 82987: {'lr': 0.00021306280948849953, 'samples': 15933504, 'steps': 82986, 'loss/train': 1.3449280261993408} 11/07/2021 08:53:54 - INFO - __main__ - Step 82988: {'lr': 0.00021305756099438875, 'samples': 15933696, 'steps': 82987, 'loss/train': 0.8322696089744568} 11/07/2021 08:53:55 - INFO - __main__ - Step 82989: {'lr': 0.00021305231251692364, 'samples': 15933888, 'steps': 82988, 'loss/train': 1.566329836845398} 11/07/2021 08:53:55 - INFO - __main__ - Step 82990: {'lr': 0.00021304706405610656, 'samples': 15934080, 'steps': 82989, 'loss/train': 1.7834689617156982} 11/07/2021 08:53:56 - INFO - __main__ - Step 82991: {'lr': 0.00021304181561193993, 'samples': 15934272, 'steps': 82990, 'loss/train': 1.5941563844680786} 11/07/2021 08:53:56 - INFO - __main__ - Step 82992: {'lr': 0.0002130365671844261, 'samples': 15934464, 'steps': 82991, 'loss/train': 1.875651478767395} 11/07/2021 08:53:57 - INFO - __main__ - Step 82993: {'lr': 0.00021303131877356738, 'samples': 15934656, 'steps': 82992, 'loss/train': 1.3696211576461792} 11/07/2021 08:53:57 - INFO - __main__ - Step 82994: {'lr': 0.0002130260703793662, 'samples': 15934848, 'steps': 82993, 'loss/train': 1.4857447147369385} 11/07/2021 08:53:57 - INFO - __main__ - Step 82995: {'lr': 0.00021302082200182491, 'samples': 15935040, 'steps': 82994, 'loss/train': 0.9883451461791992} 11/07/2021 08:53:59 - INFO - __main__ - Step 82996: {'lr': 0.00021301557364094588, 'samples': 15935232, 'steps': 82995, 'loss/train': 1.3516336679458618} 11/07/2021 08:53:59 - INFO - __main__ - Step 82997: {'lr': 0.00021301032529673141, 'samples': 15935424, 'steps': 82996, 'loss/train': 1.5924900770187378} 11/07/2021 08:53:59 - INFO - __main__ - Step 82998: {'lr': 0.00021300507696918398, 'samples': 15935616, 'steps': 82997, 'loss/train': 1.6833003759384155} 11/07/2021 08:54:00 - INFO - __main__ - Step 82999: {'lr': 0.00021299982865830583, 'samples': 15935808, 'steps': 82998, 'loss/train': 1.6279245615005493} 11/07/2021 08:54:00 - INFO - __main__ - Step 83000: {'lr': 0.0002129945803640995, 'samples': 15936000, 'steps': 82999, 'loss/train': 1.5734821557998657} 11/07/2021 08:54:01 - INFO - __main__ - Step 83001: {'lr': 0.00021298933208656717, 'samples': 15936192, 'steps': 83000, 'loss/train': 1.2168976068496704} 11/07/2021 08:54:01 - INFO - __main__ - Step 83002: {'lr': 0.00021298408382571128, 'samples': 15936384, 'steps': 83001, 'loss/train': 0.6668559312820435} 11/07/2021 08:54:02 - INFO - __main__ - Step 83003: {'lr': 0.0002129788355815342, 'samples': 15936576, 'steps': 83002, 'loss/train': 0.7128962874412537} 11/07/2021 08:54:02 - INFO - __main__ - Step 83004: {'lr': 0.00021297358735403824, 'samples': 15936768, 'steps': 83003, 'loss/train': 1.4292856454849243} 11/07/2021 08:54:02 - INFO - __main__ - Step 83005: {'lr': 0.00021296833914322583, 'samples': 15936960, 'steps': 83004, 'loss/train': 1.4610044956207275} 11/07/2021 08:54:03 - INFO - __main__ - Step 83006: {'lr': 0.0002129630909490993, 'samples': 15937152, 'steps': 83005, 'loss/train': 1.178308367729187} 11/07/2021 08:54:04 - INFO - __main__ - Step 83007: {'lr': 0.00021295784277166105, 'samples': 15937344, 'steps': 83006, 'loss/train': 2.232886552810669} 11/07/2021 08:54:04 - INFO - __main__ - Step 83008: {'lr': 0.00021295259461091343, 'samples': 15937536, 'steps': 83007, 'loss/train': 1.7456659078598022} 11/07/2021 08:54:04 - INFO - __main__ - Step 83009: {'lr': 0.0002129473464668588, 'samples': 15937728, 'steps': 83008, 'loss/train': 1.8539913892745972} 11/07/2021 08:54:05 - INFO - __main__ - Step 83010: {'lr': 0.00021294209833949948, 'samples': 15937920, 'steps': 83009, 'loss/train': 1.1954654455184937} 11/07/2021 08:54:05 - INFO - __main__ - Step 83011: {'lr': 0.0002129368502288379, 'samples': 15938112, 'steps': 83010, 'loss/train': 1.3598002195358276} 11/07/2021 08:54:06 - INFO - __main__ - Step 83012: {'lr': 0.00021293160213487644, 'samples': 15938304, 'steps': 83011, 'loss/train': 1.7709189653396606} 11/07/2021 08:54:07 - INFO - __main__ - Step 83013: {'lr': 0.0002129263540576175, 'samples': 15938496, 'steps': 83012, 'loss/train': 1.7032772302627563} 11/07/2021 08:54:07 - INFO - __main__ - Step 83014: {'lr': 0.00021292110599706326, 'samples': 15938688, 'steps': 83013, 'loss/train': 1.1579567193984985} 11/07/2021 08:54:07 - INFO - __main__ - Step 83015: {'lr': 0.00021291585795321622, 'samples': 15938880, 'steps': 83014, 'loss/train': 1.3217731714248657} 11/07/2021 08:54:08 - INFO - __main__ - Step 83016: {'lr': 0.0002129106099260787, 'samples': 15939072, 'steps': 83015, 'loss/train': 1.4589412212371826} 11/07/2021 08:54:09 - INFO - __main__ - Step 83017: {'lr': 0.00021290536191565312, 'samples': 15939264, 'steps': 83016, 'loss/train': 1.4238793849945068} 11/07/2021 08:54:09 - INFO - __main__ - Step 83018: {'lr': 0.0002129001139219418, 'samples': 15939456, 'steps': 83017, 'loss/train': 1.396759271621704} 11/07/2021 08:54:09 - INFO - __main__ - Step 83019: {'lr': 0.0002128948659449471, 'samples': 15939648, 'steps': 83018, 'loss/train': 1.3901067972183228} 11/07/2021 08:54:10 - INFO - __main__ - Step 83020: {'lr': 0.0002128896179846714, 'samples': 15939840, 'steps': 83019, 'loss/train': 1.199028730392456} 11/07/2021 08:54:10 - INFO - __main__ - Step 83021: {'lr': 0.0002128843700411171, 'samples': 15940032, 'steps': 83020, 'loss/train': 1.5691940784454346} 11/07/2021 08:54:11 - INFO - __main__ - Step 83022: {'lr': 0.0002128791221142865, 'samples': 15940224, 'steps': 83021, 'loss/train': 1.3836830854415894} 11/07/2021 08:54:11 - INFO - __main__ - Step 83023: {'lr': 0.00021287387420418206, 'samples': 15940416, 'steps': 83022, 'loss/train': 1.4726742506027222} 11/07/2021 08:54:12 - INFO - __main__ - Step 83024: {'lr': 0.000212868626310806, 'samples': 15940608, 'steps': 83023, 'loss/train': 1.3992334604263306} 11/07/2021 08:54:12 - INFO - __main__ - Step 83025: {'lr': 0.00021286337843416078, 'samples': 15940800, 'steps': 83024, 'loss/train': 1.415687918663025} 11/07/2021 08:54:12 - INFO - __main__ - Step 83026: {'lr': 0.0002128581305742488, 'samples': 15940992, 'steps': 83025, 'loss/train': 1.2343642711639404} 11/07/2021 08:54:13 - INFO - __main__ - Step 83027: {'lr': 0.00021285288273107235, 'samples': 15941184, 'steps': 83026, 'loss/train': 1.356139063835144} 11/07/2021 08:54:14 - INFO - __main__ - Step 83028: {'lr': 0.00021284763490463378, 'samples': 15941376, 'steps': 83027, 'loss/train': 1.3804460763931274} 11/07/2021 08:54:14 - INFO - __main__ - Step 83029: {'lr': 0.0002128423870949355, 'samples': 15941568, 'steps': 83028, 'loss/train': 1.3702670335769653} 11/07/2021 08:54:14 - INFO - __main__ - Step 83030: {'lr': 0.00021283713930197987, 'samples': 15941760, 'steps': 83029, 'loss/train': 1.4473607540130615} 11/07/2021 08:54:15 - INFO - __main__ - Step 83031: {'lr': 0.00021283189152576927, 'samples': 15941952, 'steps': 83030, 'loss/train': 1.5771772861480713} 11/07/2021 08:54:15 - INFO - __main__ - Step 83032: {'lr': 0.000212826643766306, 'samples': 15942144, 'steps': 83031, 'loss/train': 1.51813805103302} 11/07/2021 08:54:16 - INFO - __main__ - Step 83033: {'lr': 0.00021282139602359253, 'samples': 15942336, 'steps': 83032, 'loss/train': 1.4845898151397705} 11/07/2021 08:54:17 - INFO - __main__ - Step 83034: {'lr': 0.00021281614829763118, 'samples': 15942528, 'steps': 83033, 'loss/train': 1.034505009651184} 11/07/2021 08:54:17 - INFO - __main__ - Step 83035: {'lr': 0.00021281090058842425, 'samples': 15942720, 'steps': 83034, 'loss/train': 1.2283055782318115} 11/07/2021 08:54:17 - INFO - __main__ - Step 83036: {'lr': 0.00021280565289597418, 'samples': 15942912, 'steps': 83035, 'loss/train': 1.1293939352035522} 11/07/2021 08:54:18 - INFO - __main__ - Step 83037: {'lr': 0.00021280040522028327, 'samples': 15943104, 'steps': 83036, 'loss/train': 0.6549732089042664} 11/07/2021 08:54:19 - INFO - __main__ - Step 83038: {'lr': 0.00021279515756135396, 'samples': 15943296, 'steps': 83037, 'loss/train': 1.4445931911468506} 11/07/2021 08:54:19 - INFO - __main__ - Step 83039: {'lr': 0.00021278990991918857, 'samples': 15943488, 'steps': 83038, 'loss/train': 1.408880591392517} 11/07/2021 08:54:19 - INFO - __main__ - Step 83040: {'lr': 0.00021278466229378951, 'samples': 15943680, 'steps': 83039, 'loss/train': 0.6358308792114258} 11/07/2021 08:54:20 - INFO - __main__ - Step 83041: {'lr': 0.00021277941468515906, 'samples': 15943872, 'steps': 83040, 'loss/train': 1.2144601345062256} 11/07/2021 08:54:20 - INFO - __main__ - Step 83042: {'lr': 0.0002127741670932996, 'samples': 15944064, 'steps': 83041, 'loss/train': 1.18519127368927} 11/07/2021 08:54:21 - INFO - __main__ - Step 83043: {'lr': 0.00021276891951821359, 'samples': 15944256, 'steps': 83042, 'loss/train': 1.0617836713790894} 11/07/2021 08:54:21 - INFO - __main__ - Step 83044: {'lr': 0.00021276367195990328, 'samples': 15944448, 'steps': 83043, 'loss/train': 1.1079087257385254} 11/07/2021 08:54:22 - INFO - __main__ - Step 83045: {'lr': 0.00021275842441837115, 'samples': 15944640, 'steps': 83044, 'loss/train': 1.3530020713806152} 11/07/2021 08:54:22 - INFO - __main__ - Step 83046: {'lr': 0.00021275317689361945, 'samples': 15944832, 'steps': 83045, 'loss/train': 2.5179388523101807} 11/07/2021 08:54:23 - INFO - __main__ - Step 83047: {'lr': 0.0002127479293856506, 'samples': 15945024, 'steps': 83046, 'loss/train': 1.4507946968078613} 11/07/2021 08:54:24 - INFO - __main__ - Step 83048: {'lr': 0.00021274268189446695, 'samples': 15945216, 'steps': 83047, 'loss/train': 1.4457151889801025} 11/07/2021 08:54:24 - INFO - __main__ - Step 83049: {'lr': 0.00021273743442007089, 'samples': 15945408, 'steps': 83048, 'loss/train': 1.1216251850128174} 11/07/2021 08:54:24 - INFO - __main__ - Step 83050: {'lr': 0.00021273218696246475, 'samples': 15945600, 'steps': 83049, 'loss/train': 1.318080186843872} 11/07/2021 08:54:25 - INFO - __main__ - Step 83051: {'lr': 0.0002127269395216509, 'samples': 15945792, 'steps': 83050, 'loss/train': 4.4670090675354} 11/07/2021 08:54:25 - INFO - __main__ - Step 83052: {'lr': 0.00021272169209763173, 'samples': 15945984, 'steps': 83051, 'loss/train': 1.532947301864624} 11/07/2021 08:54:25 - INFO - __main__ - Step 83053: {'lr': 0.00021271644469040966, 'samples': 15946176, 'steps': 83052, 'loss/train': 1.0138952732086182} 11/07/2021 08:54:27 - INFO - __main__ - Step 83054: {'lr': 0.0002127111972999869, 'samples': 15946368, 'steps': 83053, 'loss/train': 1.3368873596191406} 11/07/2021 08:54:27 - INFO - __main__ - Step 83055: {'lr': 0.0002127059499263659, 'samples': 15946560, 'steps': 83054, 'loss/train': 1.6013405323028564} 11/07/2021 08:54:27 - INFO - __main__ - Step 83056: {'lr': 0.0002127007025695491, 'samples': 15946752, 'steps': 83055, 'loss/train': 2.1428191661834717} 11/07/2021 08:54:28 - INFO - __main__ - Step 83057: {'lr': 0.00021269545522953874, 'samples': 15946944, 'steps': 83056, 'loss/train': 1.2680658102035522} 11/07/2021 08:54:28 - INFO - __main__ - Step 83058: {'lr': 0.0002126902079063372, 'samples': 15947136, 'steps': 83057, 'loss/train': 1.6048269271850586} 11/07/2021 08:54:29 - INFO - __main__ - Step 83059: {'lr': 0.0002126849605999469, 'samples': 15947328, 'steps': 83058, 'loss/train': 1.0738879442214966} 11/07/2021 08:54:29 - INFO - __main__ - Step 83060: {'lr': 0.00021267971331037018, 'samples': 15947520, 'steps': 83059, 'loss/train': 1.442766547203064} 11/07/2021 08:54:30 - INFO - __main__ - Step 83061: {'lr': 0.0002126744660376094, 'samples': 15947712, 'steps': 83060, 'loss/train': 1.437110185623169} 11/07/2021 08:54:30 - INFO - __main__ - Step 83062: {'lr': 0.00021266921878166693, 'samples': 15947904, 'steps': 83061, 'loss/train': 1.5651453733444214} 11/07/2021 08:54:30 - INFO - __main__ - Step 83063: {'lr': 0.00021266397154254512, 'samples': 15948096, 'steps': 83062, 'loss/train': 1.4265732765197754} 11/07/2021 08:54:31 - INFO - __main__ - Step 83064: {'lr': 0.0002126587243202464, 'samples': 15948288, 'steps': 83063, 'loss/train': 1.3125122785568237} 11/07/2021 08:54:32 - INFO - __main__ - Step 83065: {'lr': 0.00021265347711477302, 'samples': 15948480, 'steps': 83064, 'loss/train': 1.6505398750305176} 11/07/2021 08:54:32 - INFO - __main__ - Step 83066: {'lr': 0.00021264822992612741, 'samples': 15948672, 'steps': 83065, 'loss/train': 1.4645313024520874} 11/07/2021 08:54:32 - INFO - __main__ - Step 83067: {'lr': 0.0002126429827543121, 'samples': 15948864, 'steps': 83066, 'loss/train': 1.425595998764038} 11/07/2021 08:54:33 - INFO - __main__ - Step 83068: {'lr': 0.00021263773559932915, 'samples': 15949056, 'steps': 83067, 'loss/train': 1.3263300657272339} 11/07/2021 08:54:34 - INFO - __main__ - Step 83069: {'lr': 0.00021263248846118101, 'samples': 15949248, 'steps': 83068, 'loss/train': 1.38229238986969} 11/07/2021 08:54:34 - INFO - __main__ - Step 83070: {'lr': 0.00021262724133987016, 'samples': 15949440, 'steps': 83069, 'loss/train': 1.1479581594467163} 11/07/2021 08:54:34 - INFO - __main__ - Step 83071: {'lr': 0.00021262199423539884, 'samples': 15949632, 'steps': 83070, 'loss/train': 0.8108075857162476} 11/07/2021 08:54:35 - INFO - __main__ - Step 83072: {'lr': 0.00021261674714776951, 'samples': 15949824, 'steps': 83071, 'loss/train': 1.5783058404922485} 11/07/2021 08:54:35 - INFO - __main__ - Step 83073: {'lr': 0.0002126115000769845, 'samples': 15950016, 'steps': 83072, 'loss/train': 1.292910099029541} 11/07/2021 08:54:36 - INFO - __main__ - Step 83074: {'lr': 0.00021260625302304615, 'samples': 15950208, 'steps': 83073, 'loss/train': 1.4119499921798706} 11/07/2021 08:54:36 - INFO - __main__ - Step 83075: {'lr': 0.00021260100598595688, 'samples': 15950400, 'steps': 83074, 'loss/train': 1.4264771938323975} 11/07/2021 08:54:37 - INFO - __main__ - Step 83076: {'lr': 0.000212595758965719, 'samples': 15950592, 'steps': 83075, 'loss/train': 1.4361679553985596} 11/07/2021 08:54:37 - INFO - __main__ - Step 83077: {'lr': 0.00021259051196233485, 'samples': 15950784, 'steps': 83076, 'loss/train': 2.064507484436035} 11/07/2021 08:54:38 - INFO - __main__ - Step 83078: {'lr': 0.00021258526497580692, 'samples': 15950976, 'steps': 83077, 'loss/train': 1.5217922925949097} 11/07/2021 08:54:39 - INFO - __main__ - Step 83079: {'lr': 0.00021258001800613743, 'samples': 15951168, 'steps': 83078, 'loss/train': 1.349166750907898} 11/07/2021 08:54:39 - INFO - __main__ - Step 83080: {'lr': 0.00021257477105332895, 'samples': 15951360, 'steps': 83079, 'loss/train': 1.6997226476669312} 11/07/2021 08:54:39 - INFO - __main__ - Step 83081: {'lr': 0.00021256952411738357, 'samples': 15951552, 'steps': 83080, 'loss/train': 1.396453619003296} 11/07/2021 08:54:40 - INFO - __main__ - Step 83082: {'lr': 0.0002125642771983038, 'samples': 15951744, 'steps': 83081, 'loss/train': 1.7520577907562256} 11/07/2021 08:54:40 - INFO - __main__ - Step 83083: {'lr': 0.00021255903029609197, 'samples': 15951936, 'steps': 83082, 'loss/train': 1.365059733390808} 11/07/2021 08:54:41 - INFO - __main__ - Step 83084: {'lr': 0.00021255378341075048, 'samples': 15952128, 'steps': 83083, 'loss/train': 1.8239349126815796} 11/07/2021 08:54:41 - INFO - __main__ - Step 83085: {'lr': 0.00021254853654228167, 'samples': 15952320, 'steps': 83084, 'loss/train': 1.3452365398406982} 11/07/2021 08:54:42 - INFO - __main__ - Step 83086: {'lr': 0.00021254328969068793, 'samples': 15952512, 'steps': 83085, 'loss/train': 1.385663390159607} 11/07/2021 08:54:42 - INFO - __main__ - Step 83087: {'lr': 0.00021253804285597156, 'samples': 15952704, 'steps': 83086, 'loss/train': 1.5410774946212769} 11/07/2021 08:54:42 - INFO - __main__ - Step 83088: {'lr': 0.00021253279603813502, 'samples': 15952896, 'steps': 83087, 'loss/train': 1.7399224042892456} 11/07/2021 08:54:43 - INFO - __main__ - Step 83089: {'lr': 0.0002125275492371806, 'samples': 15953088, 'steps': 83088, 'loss/train': 1.597059726715088} 11/07/2021 08:54:44 - INFO - __main__ - Step 83090: {'lr': 0.0002125223024531107, 'samples': 15953280, 'steps': 83089, 'loss/train': 1.5465501546859741} 11/07/2021 08:54:44 - INFO - __main__ - Step 83091: {'lr': 0.00021251705568592767, 'samples': 15953472, 'steps': 83090, 'loss/train': 1.162074089050293} 11/07/2021 08:54:44 - INFO - __main__ - Step 83092: {'lr': 0.00021251180893563384, 'samples': 15953664, 'steps': 83091, 'loss/train': 1.0551055669784546} 11/07/2021 08:54:45 - INFO - __main__ - Step 83093: {'lr': 0.00021250656220223163, 'samples': 15953856, 'steps': 83092, 'loss/train': 1.5652679204940796} 11/07/2021 08:54:45 - INFO - __main__ - Step 83094: {'lr': 0.00021250131548572351, 'samples': 15954048, 'steps': 83093, 'loss/train': 1.4998122453689575} 11/07/2021 08:54:46 - INFO - __main__ - Step 83095: {'lr': 0.0002124960687861116, 'samples': 15954240, 'steps': 83094, 'loss/train': 1.377756953239441} 11/07/2021 08:54:47 - INFO - __main__ - Step 83096: {'lr': 0.0002124908221033984, 'samples': 15954432, 'steps': 83095, 'loss/train': 1.2456079721450806} 11/07/2021 08:54:47 - INFO - __main__ - Step 83097: {'lr': 0.00021248557543758622, 'samples': 15954624, 'steps': 83096, 'loss/train': 1.589415431022644} 11/07/2021 08:54:47 - INFO - __main__ - Step 83098: {'lr': 0.00021248032878867752, 'samples': 15954816, 'steps': 83097, 'loss/train': 1.827432632446289} 11/07/2021 08:54:48 - INFO - __main__ - Step 83099: {'lr': 0.00021247508215667456, 'samples': 15955008, 'steps': 83098, 'loss/train': 1.280060887336731} 11/07/2021 08:54:49 - INFO - __main__ - Step 83100: {'lr': 0.00021246983554157976, 'samples': 15955200, 'steps': 83099, 'loss/train': 1.495246410369873} 11/07/2021 08:54:49 - INFO - __main__ - Step 83101: {'lr': 0.00021246458894339545, 'samples': 15955392, 'steps': 83100, 'loss/train': 1.1954243183135986} 11/07/2021 08:54:49 - INFO - __main__ - Step 83102: {'lr': 0.00021245934236212405, 'samples': 15955584, 'steps': 83101, 'loss/train': 1.2976564168930054} 11/07/2021 08:54:50 - INFO - __main__ - Step 83103: {'lr': 0.00021245409579776785, 'samples': 15955776, 'steps': 83102, 'loss/train': 1.4586740732192993} 11/07/2021 08:54:50 - INFO - __main__ - Step 83104: {'lr': 0.0002124488492503293, 'samples': 15955968, 'steps': 83103, 'loss/train': 1.3212870359420776} 11/07/2021 08:54:51 - INFO - __main__ - Step 83105: {'lr': 0.00021244360271981073, 'samples': 15956160, 'steps': 83104, 'loss/train': 1.1631128787994385} 11/07/2021 08:54:52 - INFO - __main__ - Step 83106: {'lr': 0.00021243835620621444, 'samples': 15956352, 'steps': 83105, 'loss/train': 1.654597520828247} 11/07/2021 08:54:52 - INFO - __main__ - Step 83107: {'lr': 0.00021243310970954298, 'samples': 15956544, 'steps': 83106, 'loss/train': 1.5792824029922485} 11/07/2021 08:54:52 - INFO - __main__ - Step 83108: {'lr': 0.0002124278632297985, 'samples': 15956736, 'steps': 83107, 'loss/train': 1.5776135921478271} 11/07/2021 08:54:53 - INFO - __main__ - Step 83109: {'lr': 0.0002124226167669834, 'samples': 15956928, 'steps': 83108, 'loss/train': 1.7160780429840088} 11/07/2021 08:54:54 - INFO - __main__ - Step 83110: {'lr': 0.00021241737032110013, 'samples': 15957120, 'steps': 83109, 'loss/train': 1.3451975584030151} 11/07/2021 08:54:54 - INFO - __main__ - Step 83111: {'lr': 0.00021241212389215097, 'samples': 15957312, 'steps': 83110, 'loss/train': 1.6832993030548096} 11/07/2021 08:54:54 - INFO - __main__ - Step 83112: {'lr': 0.00021240687748013835, 'samples': 15957504, 'steps': 83111, 'loss/train': 1.4156371355056763} 11/07/2021 08:54:55 - INFO - __main__ - Step 83113: {'lr': 0.0002124016310850646, 'samples': 15957696, 'steps': 83112, 'loss/train': 1.5315613746643066} 11/07/2021 08:54:55 - INFO - __main__ - Step 83114: {'lr': 0.0002123963847069321, 'samples': 15957888, 'steps': 83113, 'loss/train': 1.5366908311843872} 11/07/2021 08:54:56 - INFO - __main__ - Step 83115: {'lr': 0.00021239113834574323, 'samples': 15958080, 'steps': 83114, 'loss/train': 1.5658212900161743} 11/07/2021 08:54:56 - INFO - __main__ - Step 83116: {'lr': 0.00021238589200150033, 'samples': 15958272, 'steps': 83115, 'loss/train': 1.2542399168014526} 11/07/2021 08:54:57 - INFO - __main__ - Step 83117: {'lr': 0.00021238064567420572, 'samples': 15958464, 'steps': 83116, 'loss/train': 1.3431178331375122} 11/07/2021 08:54:57 - INFO - __main__ - Step 83118: {'lr': 0.00021237539936386186, 'samples': 15958656, 'steps': 83117, 'loss/train': 1.6207233667373657} 11/07/2021 08:54:58 - INFO - __main__ - Step 83119: {'lr': 0.00021237015307047104, 'samples': 15958848, 'steps': 83118, 'loss/train': 1.5209319591522217} 11/07/2021 08:54:58 - INFO - __main__ - Step 83120: {'lr': 0.00021236490679403563, 'samples': 15959040, 'steps': 83119, 'loss/train': 0.3974725008010864} 11/07/2021 08:54:59 - INFO - __main__ - Step 83121: {'lr': 0.0002123596605345582, 'samples': 15959232, 'steps': 83120, 'loss/train': 0.5940276980400085} 11/07/2021 08:54:59 - INFO - __main__ - Step 83122: {'lr': 0.00021235441429204072, 'samples': 15959424, 'steps': 83121, 'loss/train': 1.5330896377563477} 11/07/2021 08:55:00 - INFO - __main__ - Step 83123: {'lr': 0.00021234916806648583, 'samples': 15959616, 'steps': 83122, 'loss/train': 1.4225155115127563} 11/07/2021 08:55:00 - INFO - __main__ - Step 83124: {'lr': 0.00021234392185789577, 'samples': 15959808, 'steps': 83123, 'loss/train': 1.1065521240234375} 11/07/2021 08:55:00 - INFO - __main__ - Step 83125: {'lr': 0.00021233867566627302, 'samples': 15960000, 'steps': 83124, 'loss/train': 1.5213972330093384} 11/07/2021 08:55:01 - INFO - __main__ - Step 83126: {'lr': 0.00021233342949161983, 'samples': 15960192, 'steps': 83125, 'loss/train': 1.5350936651229858} 11/07/2021 08:55:02 - INFO - __main__ - Step 83127: {'lr': 0.00021232818333393862, 'samples': 15960384, 'steps': 83126, 'loss/train': 1.220457911491394} 11/07/2021 08:55:02 - INFO - __main__ - Step 83128: {'lr': 0.00021232293719323177, 'samples': 15960576, 'steps': 83127, 'loss/train': 1.4497697353363037} 11/07/2021 08:55:02 - INFO - __main__ - Step 83129: {'lr': 0.0002123176910695016, 'samples': 15960768, 'steps': 83128, 'loss/train': 0.9919561743736267} 11/07/2021 08:55:03 - INFO - __main__ - Step 83130: {'lr': 0.00021231244496275055, 'samples': 15960960, 'steps': 83129, 'loss/train': 1.4500949382781982} 11/07/2021 08:55:04 - INFO - __main__ - Step 83131: {'lr': 0.00021230719887298087, 'samples': 15961152, 'steps': 83130, 'loss/train': 1.3594791889190674} 11/07/2021 08:55:04 - INFO - __main__ - Step 83132: {'lr': 0.00021230195280019502, 'samples': 15961344, 'steps': 83131, 'loss/train': 1.5264201164245605} 11/07/2021 08:55:05 - INFO - __main__ - Step 83133: {'lr': 0.0002122967067443953, 'samples': 15961536, 'steps': 83132, 'loss/train': 1.7700964212417603} 11/07/2021 08:55:05 - INFO - __main__ - Step 83134: {'lr': 0.00021229146070558423, 'samples': 15961728, 'steps': 83133, 'loss/train': 1.5362130403518677} 11/07/2021 08:55:05 - INFO - __main__ - Step 83135: {'lr': 0.00021228621468376394, 'samples': 15961920, 'steps': 83134, 'loss/train': 1.3393474817276} 11/07/2021 08:55:06 - INFO - __main__ - Step 83136: {'lr': 0.00021228096867893686, 'samples': 15962112, 'steps': 83135, 'loss/train': 1.483100175857544} 11/07/2021 08:55:07 - INFO - __main__ - Step 83137: {'lr': 0.00021227572269110544, 'samples': 15962304, 'steps': 83136, 'loss/train': 1.517601490020752} 11/07/2021 08:55:07 - INFO - __main__ - Step 83138: {'lr': 0.000212270476720272, 'samples': 15962496, 'steps': 83137, 'loss/train': 1.542493224143982} 11/07/2021 08:55:07 - INFO - __main__ - Step 83139: {'lr': 0.0002122652307664389, 'samples': 15962688, 'steps': 83138, 'loss/train': 1.5511261224746704} 11/07/2021 08:55:08 - INFO - __main__ - Step 83140: {'lr': 0.00021225998482960845, 'samples': 15962880, 'steps': 83139, 'loss/train': 1.5264017581939697} 11/07/2021 08:55:08 - INFO - __main__ - Step 83141: {'lr': 0.00021225473890978315, 'samples': 15963072, 'steps': 83140, 'loss/train': 1.5518029928207397} 11/07/2021 08:55:09 - INFO - __main__ - Step 83142: {'lr': 0.00021224949300696522, 'samples': 15963264, 'steps': 83141, 'loss/train': 1.473220705986023} 11/07/2021 08:55:09 - INFO - __main__ - Step 83143: {'lr': 0.00021224424712115708, 'samples': 15963456, 'steps': 83142, 'loss/train': 1.1009246110916138} 11/07/2021 08:55:10 - INFO - __main__ - Step 83144: {'lr': 0.00021223900125236114, 'samples': 15963648, 'steps': 83143, 'loss/train': 1.1191191673278809} 11/07/2021 08:55:10 - INFO - __main__ - Step 83145: {'lr': 0.00021223375540057972, 'samples': 15963840, 'steps': 83144, 'loss/train': 1.0139341354370117} 11/07/2021 08:55:10 - INFO - __main__ - Step 83146: {'lr': 0.00021222850956581518, 'samples': 15964032, 'steps': 83145, 'loss/train': 1.3774056434631348} 11/07/2021 08:55:12 - INFO - __main__ - Step 83147: {'lr': 0.00021222326374807, 'samples': 15964224, 'steps': 83146, 'loss/train': 1.4654414653778076} 11/07/2021 08:55:12 - INFO - __main__ - Step 83148: {'lr': 0.0002122180179473463, 'samples': 15964416, 'steps': 83147, 'loss/train': 1.3308273553848267} 11/07/2021 08:55:12 - INFO - __main__ - Step 83149: {'lr': 0.0002122127721636466, 'samples': 15964608, 'steps': 83148, 'loss/train': 1.4034326076507568} 11/07/2021 08:55:13 - INFO - __main__ - Step 83150: {'lr': 0.00021220752639697325, 'samples': 15964800, 'steps': 83149, 'loss/train': 5.7197184562683105} 11/07/2021 08:55:13 - INFO - __main__ - Step 83151: {'lr': 0.0002122022806473286, 'samples': 15964992, 'steps': 83150, 'loss/train': 1.2546863555908203} 11/07/2021 08:55:14 - INFO - __main__ - Step 83152: {'lr': 0.00021219703491471501, 'samples': 15965184, 'steps': 83151, 'loss/train': 1.6108133792877197} 11/07/2021 08:55:14 - INFO - __main__ - Step 83153: {'lr': 0.00021219178919913484, 'samples': 15965376, 'steps': 83152, 'loss/train': 1.5120749473571777} 11/07/2021 08:55:15 - INFO - __main__ - Step 83154: {'lr': 0.00021218654350059048, 'samples': 15965568, 'steps': 83153, 'loss/train': 1.7369511127471924} 11/07/2021 08:55:15 - INFO - __main__ - Step 83155: {'lr': 0.0002121812978190843, 'samples': 15965760, 'steps': 83154, 'loss/train': 1.4460220336914062} 11/07/2021 08:55:15 - INFO - __main__ - Step 83156: {'lr': 0.00021217605215461863, 'samples': 15965952, 'steps': 83155, 'loss/train': 1.5675697326660156} 11/07/2021 08:55:16 - INFO - __main__ - Step 83157: {'lr': 0.00021217080650719582, 'samples': 15966144, 'steps': 83156, 'loss/train': 1.623565673828125} 11/07/2021 08:55:17 - INFO - __main__ - Step 83158: {'lr': 0.00021216556087681838, 'samples': 15966336, 'steps': 83157, 'loss/train': 1.6422138214111328} 11/07/2021 08:55:18 - INFO - __main__ - Step 83159: {'lr': 0.00021216031526348844, 'samples': 15966528, 'steps': 83158, 'loss/train': 1.4573988914489746} 11/07/2021 08:55:18 - INFO - __main__ - Step 83160: {'lr': 0.00021215506966720853, 'samples': 15966720, 'steps': 83159, 'loss/train': 1.0165588855743408} 11/07/2021 08:55:18 - INFO - __main__ - Step 83161: {'lr': 0.00021214982408798098, 'samples': 15966912, 'steps': 83160, 'loss/train': 1.4756511449813843} 11/07/2021 08:55:19 - INFO - __main__ - Step 83162: {'lr': 0.00021214457852580806, 'samples': 15967104, 'steps': 83161, 'loss/train': 1.4411842823028564} 11/07/2021 08:55:20 - INFO - __main__ - Step 83163: {'lr': 0.00021213933298069225, 'samples': 15967296, 'steps': 83162, 'loss/train': 1.3705345392227173} 11/07/2021 08:55:20 - INFO - __main__ - Step 83164: {'lr': 0.00021213408745263584, 'samples': 15967488, 'steps': 83163, 'loss/train': 0.9349966049194336} 11/07/2021 08:55:20 - INFO - __main__ - Step 83165: {'lr': 0.00021212884194164126, 'samples': 15967680, 'steps': 83164, 'loss/train': 1.4022438526153564} 11/07/2021 08:55:21 - INFO - __main__ - Step 83166: {'lr': 0.00021212359644771082, 'samples': 15967872, 'steps': 83165, 'loss/train': 1.6382954120635986} 11/07/2021 08:55:21 - INFO - __main__ - Step 83167: {'lr': 0.0002121183509708469, 'samples': 15968064, 'steps': 83166, 'loss/train': 1.1403731107711792} 11/07/2021 08:55:21 - INFO - __main__ - Step 83168: {'lr': 0.00021211310551105187, 'samples': 15968256, 'steps': 83167, 'loss/train': 1.338265299797058} 11/07/2021 08:55:22 - INFO - __main__ - Step 83169: {'lr': 0.00021210786006832817, 'samples': 15968448, 'steps': 83168, 'loss/train': 0.12473930418491364} 11/07/2021 08:55:23 - INFO - __main__ - Step 83170: {'lr': 0.000212102614642678, 'samples': 15968640, 'steps': 83169, 'loss/train': 0.6183196902275085} 11/07/2021 08:55:23 - INFO - __main__ - Step 83171: {'lr': 0.0002120973692341038, 'samples': 15968832, 'steps': 83170, 'loss/train': 0.9985255599021912} 11/07/2021 08:55:23 - INFO - __main__ - Step 83172: {'lr': 0.00021209212384260795, 'samples': 15969024, 'steps': 83171, 'loss/train': 1.1746658086776733} 11/07/2021 08:55:24 - INFO - __main__ - Step 83173: {'lr': 0.0002120868784681928, 'samples': 15969216, 'steps': 83172, 'loss/train': 1.160849690437317} 11/07/2021 08:55:26 - INFO - __main__ - Step 83174: {'lr': 0.00021208163311086078, 'samples': 15969408, 'steps': 83173, 'loss/train': 0.7026991844177246} 11/07/2021 08:55:26 - INFO - __main__ - Step 83175: {'lr': 0.00021207638777061413, 'samples': 15969600, 'steps': 83174, 'loss/train': 1.633721947669983} 11/07/2021 08:55:26 - INFO - __main__ - Step 83176: {'lr': 0.0002120711424474553, 'samples': 15969792, 'steps': 83175, 'loss/train': 1.6189020872116089} 11/07/2021 08:55:27 - INFO - __main__ - Step 83177: {'lr': 0.0002120658971413866, 'samples': 15969984, 'steps': 83176, 'loss/train': 1.4441410303115845} 11/07/2021 08:55:27 - INFO - __main__ - Step 83178: {'lr': 0.0002120606518524104, 'samples': 15970176, 'steps': 83177, 'loss/train': 1.7531095743179321} 11/07/2021 08:55:27 - INFO - __main__ - Step 83179: {'lr': 0.00021205540658052912, 'samples': 15970368, 'steps': 83178, 'loss/train': 1.7724515199661255} 11/07/2021 08:55:28 - INFO - __main__ - Step 83180: {'lr': 0.00021205016132574517, 'samples': 15970560, 'steps': 83179, 'loss/train': 0.8873422145843506} 11/07/2021 08:55:29 - INFO - __main__ - Step 83181: {'lr': 0.00021204491608806073, 'samples': 15970752, 'steps': 83180, 'loss/train': 1.523728609085083} 11/07/2021 08:55:29 - INFO - __main__ - Step 83182: {'lr': 0.00021203967086747826, 'samples': 15970944, 'steps': 83181, 'loss/train': 1.0951120853424072} 11/07/2021 08:55:30 - INFO - __main__ - Step 83183: {'lr': 0.00021203442566400016, 'samples': 15971136, 'steps': 83182, 'loss/train': 1.1803972721099854} 11/07/2021 08:55:30 - INFO - __main__ - Step 83184: {'lr': 0.00021202918047762874, 'samples': 15971328, 'steps': 83183, 'loss/train': 1.4762126207351685} 11/07/2021 08:55:30 - INFO - __main__ - Step 83185: {'lr': 0.00021202393530836641, 'samples': 15971520, 'steps': 83184, 'loss/train': 0.5752042531967163} 11/07/2021 08:55:31 - INFO - __main__ - Step 83186: {'lr': 0.0002120186901562155, 'samples': 15971712, 'steps': 83185, 'loss/train': 1.5797386169433594} 11/07/2021 08:55:31 - INFO - __main__ - Step 83187: {'lr': 0.00021201344502117837, 'samples': 15971904, 'steps': 83186, 'loss/train': 1.4186441898345947} 11/07/2021 08:55:32 - INFO - __main__ - Step 83188: {'lr': 0.00021200819990325746, 'samples': 15972096, 'steps': 83187, 'loss/train': 0.568461537361145} 11/07/2021 08:55:32 - INFO - __main__ - Step 83189: {'lr': 0.00021200295480245502, 'samples': 15972288, 'steps': 83188, 'loss/train': 0.9486547112464905} 11/07/2021 08:55:33 - INFO - __main__ - Step 83190: {'lr': 0.00021199770971877345, 'samples': 15972480, 'steps': 83189, 'loss/train': 1.4257153272628784} 11/07/2021 08:55:34 - INFO - __main__ - Step 83191: {'lr': 0.00021199246465221515, 'samples': 15972672, 'steps': 83190, 'loss/train': 1.4346961975097656} 11/07/2021 08:55:34 - INFO - __main__ - Step 83192: {'lr': 0.00021198721960278245, 'samples': 15972864, 'steps': 83191, 'loss/train': 1.817556381225586} 11/07/2021 08:55:34 - INFO - __main__ - Step 83193: {'lr': 0.0002119819745704777, 'samples': 15973056, 'steps': 83192, 'loss/train': 1.966058611869812} 11/07/2021 08:55:35 - INFO - __main__ - Step 83194: {'lr': 0.0002119767295553033, 'samples': 15973248, 'steps': 83193, 'loss/train': 1.3428928852081299} 11/07/2021 08:55:35 - INFO - __main__ - Step 83195: {'lr': 0.00021197148455726162, 'samples': 15973440, 'steps': 83194, 'loss/train': 1.2307020425796509} 11/07/2021 08:55:36 - INFO - __main__ - Step 83196: {'lr': 0.00021196623957635497, 'samples': 15973632, 'steps': 83195, 'loss/train': 1.267531394958496} 11/07/2021 08:55:36 - INFO - __main__ - Step 83197: {'lr': 0.00021196099461258576, 'samples': 15973824, 'steps': 83196, 'loss/train': 1.556112289428711} 11/07/2021 08:55:37 - INFO - __main__ - Step 83198: {'lr': 0.00021195574966595632, 'samples': 15974016, 'steps': 83197, 'loss/train': 1.4427413940429688} 11/07/2021 08:55:37 - INFO - __main__ - Step 83199: {'lr': 0.00021195050473646904, 'samples': 15974208, 'steps': 83198, 'loss/train': 1.3152685165405273} 11/07/2021 08:55:37 - INFO - __main__ - Step 83200: {'lr': 0.00021194525982412628, 'samples': 15974400, 'steps': 83199, 'loss/train': 2.0935373306274414} 11/07/2021 08:55:38 - INFO - __main__ - Step 83201: {'lr': 0.0002119400149289305, 'samples': 15974592, 'steps': 83200, 'loss/train': 1.0782129764556885} 11/07/2021 08:55:39 - INFO - __main__ - Step 83202: {'lr': 0.00021193477005088386, 'samples': 15974784, 'steps': 83201, 'loss/train': 1.9926780462265015} 11/07/2021 08:55:39 - INFO - __main__ - Step 83203: {'lr': 0.00021192952518998883, 'samples': 15974976, 'steps': 83202, 'loss/train': 1.7861075401306152} 11/07/2021 08:55:40 - INFO - __main__ - Step 83204: {'lr': 0.00021192428034624776, 'samples': 15975168, 'steps': 83203, 'loss/train': 1.0071985721588135} 11/07/2021 08:55:40 - INFO - __main__ - Step 83205: {'lr': 0.000211919035519663, 'samples': 15975360, 'steps': 83204, 'loss/train': 1.2852228879928589} 11/07/2021 08:55:40 - INFO - __main__ - Step 83206: {'lr': 0.00021191379071023697, 'samples': 15975552, 'steps': 83205, 'loss/train': 1.1823320388793945} 11/07/2021 08:55:42 - INFO - __main__ - Step 83207: {'lr': 0.00021190854591797198, 'samples': 15975744, 'steps': 83206, 'loss/train': 1.611666202545166} 11/07/2021 08:55:42 - INFO - __main__ - Step 83208: {'lr': 0.00021190330114287043, 'samples': 15975936, 'steps': 83207, 'loss/train': 1.9884495735168457} 11/07/2021 08:55:43 - INFO - __main__ - Step 83209: {'lr': 0.00021189805638493464, 'samples': 15976128, 'steps': 83208, 'loss/train': 1.4454609155654907} 11/07/2021 08:55:43 - INFO - __main__ - Step 83210: {'lr': 0.000211892811644167, 'samples': 15976320, 'steps': 83209, 'loss/train': 1.46965491771698} 11/07/2021 08:55:43 - INFO - __main__ - Step 83211: {'lr': 0.0002118875669205699, 'samples': 15976512, 'steps': 83210, 'loss/train': 0.83240807056427} 11/07/2021 08:55:44 - INFO - __main__ - Step 83212: {'lr': 0.00021188232221414565, 'samples': 15976704, 'steps': 83211, 'loss/train': 0.9158499836921692} 11/07/2021 08:55:45 - INFO - __main__ - Step 83213: {'lr': 0.00021187707752489665, 'samples': 15976896, 'steps': 83212, 'loss/train': 0.4215027093887329} 11/07/2021 08:55:45 - INFO - __main__ - Step 83214: {'lr': 0.00021187183285282523, 'samples': 15977088, 'steps': 83213, 'loss/train': 2.953139543533325} 11/07/2021 08:55:45 - INFO - __main__ - Step 83215: {'lr': 0.00021186658819793392, 'samples': 15977280, 'steps': 83214, 'loss/train': 0.9622189998626709} 11/07/2021 08:55:46 - INFO - __main__ - Step 83216: {'lr': 0.0002118613435602248, 'samples': 15977472, 'steps': 83215, 'loss/train': 1.2137945890426636} 11/07/2021 08:55:46 - INFO - __main__ - Step 83217: {'lr': 0.00021185609893970036, 'samples': 15977664, 'steps': 83216, 'loss/train': 1.7741776704788208} 11/07/2021 08:55:46 - INFO - __main__ - Step 83218: {'lr': 0.000211850854336363, 'samples': 15977856, 'steps': 83217, 'loss/train': 1.8370214700698853} 11/07/2021 08:55:48 - INFO - __main__ - Step 83219: {'lr': 0.00021184560975021506, 'samples': 15978048, 'steps': 83218, 'loss/train': 1.4027482271194458} 11/07/2021 08:55:48 - INFO - __main__ - Step 83220: {'lr': 0.00021184036518125888, 'samples': 15978240, 'steps': 83219, 'loss/train': 1.7137888669967651} 11/07/2021 08:55:48 - INFO - __main__ - Step 83221: {'lr': 0.00021183512062949682, 'samples': 15978432, 'steps': 83220, 'loss/train': 0.9923030734062195} 11/07/2021 08:55:49 - INFO - __main__ - Step 83222: {'lr': 0.00021182987609493132, 'samples': 15978624, 'steps': 83221, 'loss/train': 0.7891316413879395} 11/07/2021 08:55:49 - INFO - __main__ - Step 83223: {'lr': 0.00021182463157756466, 'samples': 15978816, 'steps': 83222, 'loss/train': 1.3603626489639282} 11/07/2021 08:55:50 - INFO - __main__ - Step 83224: {'lr': 0.00021181938707739924, 'samples': 15979008, 'steps': 83223, 'loss/train': 0.8427649140357971} 11/07/2021 08:55:50 - INFO - __main__ - Step 83225: {'lr': 0.0002118141425944374, 'samples': 15979200, 'steps': 83224, 'loss/train': 1.3526803255081177} 11/07/2021 08:55:51 - INFO - __main__ - Step 83226: {'lr': 0.00021180889812868155, 'samples': 15979392, 'steps': 83225, 'loss/train': 1.2312235832214355} 11/07/2021 08:55:51 - INFO - __main__ - Step 83227: {'lr': 0.000211803653680134, 'samples': 15979584, 'steps': 83226, 'loss/train': 1.427273154258728} 11/07/2021 08:55:51 - INFO - __main__ - Step 83228: {'lr': 0.00021179840924879723, 'samples': 15979776, 'steps': 83227, 'loss/train': 1.282185673713684} 11/07/2021 08:55:52 - INFO - __main__ - Step 83229: {'lr': 0.0002117931648346734, 'samples': 15979968, 'steps': 83228, 'loss/train': 1.1111574172973633} 11/07/2021 08:55:53 - INFO - __main__ - Step 83230: {'lr': 0.000211787920437765, 'samples': 15980160, 'steps': 83229, 'loss/train': 1.392897129058838} 11/07/2021 08:55:53 - INFO - __main__ - Step 83231: {'lr': 0.00021178267605807436, 'samples': 15980352, 'steps': 83230, 'loss/train': 1.402983546257019} 11/07/2021 08:55:53 - INFO - __main__ - Step 83232: {'lr': 0.00021177743169560384, 'samples': 15980544, 'steps': 83231, 'loss/train': 1.20995032787323} 11/07/2021 08:55:54 - INFO - __main__ - Step 83233: {'lr': 0.00021177218735035587, 'samples': 15980736, 'steps': 83232, 'loss/train': 1.69191575050354} 11/07/2021 08:55:54 - INFO - __main__ - Step 83234: {'lr': 0.0002117669430223327, 'samples': 15980928, 'steps': 83233, 'loss/train': 1.061666488647461} 11/07/2021 08:55:55 - INFO - __main__ - Step 83235: {'lr': 0.0002117616987115368, 'samples': 15981120, 'steps': 83234, 'loss/train': 1.3020817041397095} 11/07/2021 08:55:56 - INFO - __main__ - Step 83236: {'lr': 0.00021175645441797047, 'samples': 15981312, 'steps': 83235, 'loss/train': 1.0322692394256592} 11/07/2021 08:55:56 - INFO - __main__ - Step 83237: {'lr': 0.0002117512101416361, 'samples': 15981504, 'steps': 83236, 'loss/train': 1.4856520891189575} 11/07/2021 08:55:56 - INFO - __main__ - Step 83238: {'lr': 0.00021174596588253603, 'samples': 15981696, 'steps': 83237, 'loss/train': 1.6768252849578857} 11/07/2021 08:55:57 - INFO - __main__ - Step 83239: {'lr': 0.00021174072164067265, 'samples': 15981888, 'steps': 83238, 'loss/train': 1.3104288578033447} 11/07/2021 08:55:58 - INFO - __main__ - Step 83240: {'lr': 0.00021173547741604832, 'samples': 15982080, 'steps': 83239, 'loss/train': 1.356255054473877} 11/07/2021 08:55:58 - INFO - __main__ - Step 83241: {'lr': 0.00021173023320866539, 'samples': 15982272, 'steps': 83240, 'loss/train': 1.416607141494751} 11/07/2021 08:55:58 - INFO - __main__ - Step 83242: {'lr': 0.00021172498901852633, 'samples': 15982464, 'steps': 83241, 'loss/train': 1.3878520727157593} 11/07/2021 08:55:59 - INFO - __main__ - Step 83243: {'lr': 0.00021171974484563327, 'samples': 15982656, 'steps': 83242, 'loss/train': 1.08922278881073} 11/07/2021 08:55:59 - INFO - __main__ - Step 83244: {'lr': 0.0002117145006899887, 'samples': 15982848, 'steps': 83243, 'loss/train': 1.163310170173645} 11/07/2021 08:56:00 - INFO - __main__ - Step 83245: {'lr': 0.00021170925655159505, 'samples': 15983040, 'steps': 83244, 'loss/train': 1.3027540445327759} 11/07/2021 08:56:00 - INFO - __main__ - Step 83246: {'lr': 0.00021170401243045457, 'samples': 15983232, 'steps': 83245, 'loss/train': 1.319145679473877} 11/07/2021 08:56:01 - INFO - __main__ - Step 83247: {'lr': 0.00021169876832656965, 'samples': 15983424, 'steps': 83246, 'loss/train': 1.531407356262207} 11/07/2021 08:56:01 - INFO - __main__ - Step 83248: {'lr': 0.0002116935242399427, 'samples': 15983616, 'steps': 83247, 'loss/train': 1.5469825267791748} 11/07/2021 08:56:02 - INFO - __main__ - Step 83249: {'lr': 0.00021168828017057605, 'samples': 15983808, 'steps': 83248, 'loss/train': 1.6223937273025513} 11/07/2021 08:56:02 - INFO - __main__ - Step 83250: {'lr': 0.0002116830361184721, 'samples': 15984000, 'steps': 83249, 'loss/train': 1.788224697113037} 11/07/2021 08:56:03 - INFO - __main__ - Step 83251: {'lr': 0.00021167779208363313, 'samples': 15984192, 'steps': 83250, 'loss/train': 1.6097074747085571} 11/07/2021 08:56:03 - INFO - __main__ - Step 83252: {'lr': 0.0002116725480660616, 'samples': 15984384, 'steps': 83251, 'loss/train': 1.4753044843673706} 11/07/2021 08:56:04 - INFO - __main__ - Step 83253: {'lr': 0.0002116673040657598, 'samples': 15984576, 'steps': 83252, 'loss/train': 1.2268121242523193} 11/07/2021 08:56:04 - INFO - __main__ - Step 83254: {'lr': 0.00021166206008273015, 'samples': 15984768, 'steps': 83253, 'loss/train': 1.7177486419677734} 11/07/2021 08:56:04 - INFO - __main__ - Step 83255: {'lr': 0.00021165681611697506, 'samples': 15984960, 'steps': 83254, 'loss/train': 1.5054033994674683} 11/07/2021 08:56:06 - INFO - __main__ - Step 83256: {'lr': 0.00021165157216849673, 'samples': 15985152, 'steps': 83255, 'loss/train': 2.0880682468414307} 11/07/2021 08:56:06 - INFO - __main__ - Step 83257: {'lr': 0.00021164632823729763, 'samples': 15985344, 'steps': 83256, 'loss/train': 1.5752499103546143} 11/07/2021 08:56:06 - INFO - __main__ - Step 83258: {'lr': 0.00021164108432338005, 'samples': 15985536, 'steps': 83257, 'loss/train': 1.1696264743804932} 11/07/2021 08:56:07 - INFO - __main__ - Step 83259: {'lr': 0.00021163584042674643, 'samples': 15985728, 'steps': 83258, 'loss/train': 1.606964349746704} 11/07/2021 08:56:07 - INFO - __main__ - Step 83260: {'lr': 0.00021163059654739913, 'samples': 15985920, 'steps': 83259, 'loss/train': 1.5329041481018066} 11/07/2021 08:56:08 - INFO - __main__ - Step 83261: {'lr': 0.00021162535268534047, 'samples': 15986112, 'steps': 83260, 'loss/train': 1.1448386907577515} 11/07/2021 08:56:08 - INFO - __main__ - Step 83262: {'lr': 0.00021162010884057285, 'samples': 15986304, 'steps': 83261, 'loss/train': 1.591734766960144} 11/07/2021 08:56:09 - INFO - __main__ - Step 83263: {'lr': 0.00021161486501309858, 'samples': 15986496, 'steps': 83262, 'loss/train': 1.1316134929656982} 11/07/2021 08:56:09 - INFO - __main__ - Step 83264: {'lr': 0.0002116096212029201, 'samples': 15986688, 'steps': 83263, 'loss/train': 1.5618021488189697} 11/07/2021 08:56:09 - INFO - __main__ - Step 83265: {'lr': 0.00021160437741003972, 'samples': 15986880, 'steps': 83264, 'loss/train': 0.9991094470024109} 11/07/2021 08:56:10 - INFO - __main__ - Step 83266: {'lr': 0.00021159913363445979, 'samples': 15987072, 'steps': 83265, 'loss/train': 1.0251260995864868} 11/07/2021 08:56:11 - INFO - __main__ - Step 83267: {'lr': 0.00021159388987618272, 'samples': 15987264, 'steps': 83266, 'loss/train': 1.2591004371643066} 11/07/2021 08:56:11 - INFO - __main__ - Step 83268: {'lr': 0.00021158864613521095, 'samples': 15987456, 'steps': 83267, 'loss/train': 1.1865975856781006} 11/07/2021 08:56:11 - INFO - __main__ - Step 83269: {'lr': 0.00021158340241154663, 'samples': 15987648, 'steps': 83268, 'loss/train': 1.319443702697754} 11/07/2021 08:56:12 - INFO - __main__ - Step 83270: {'lr': 0.00021157815870519227, 'samples': 15987840, 'steps': 83269, 'loss/train': 0.9793227314949036} 11/07/2021 08:56:12 - INFO - __main__ - Step 83271: {'lr': 0.00021157291501615016, 'samples': 15988032, 'steps': 83270, 'loss/train': 1.7460414171218872} 11/07/2021 08:56:13 - INFO - __main__ - Step 83272: {'lr': 0.00021156767134442272, 'samples': 15988224, 'steps': 83271, 'loss/train': 0.9785990715026855} 11/07/2021 08:56:14 - INFO - __main__ - Step 83273: {'lr': 0.0002115624276900123, 'samples': 15988416, 'steps': 83272, 'loss/train': 1.591993808746338} 11/07/2021 08:56:14 - INFO - __main__ - Step 83274: {'lr': 0.00021155718405292123, 'samples': 15988608, 'steps': 83273, 'loss/train': 1.603302001953125} 11/07/2021 08:56:14 - INFO - __main__ - Step 83275: {'lr': 0.00021155194043315193, 'samples': 15988800, 'steps': 83274, 'loss/train': 0.2953245937824249} 11/07/2021 08:56:15 - INFO - __main__ - Step 83276: {'lr': 0.00021154669683070673, 'samples': 15988992, 'steps': 83275, 'loss/train': 1.0644950866699219} 11/07/2021 08:56:16 - INFO - __main__ - Step 83277: {'lr': 0.000211541453245588, 'samples': 15989184, 'steps': 83276, 'loss/train': 1.225883960723877} 11/07/2021 08:56:16 - INFO - __main__ - Step 83278: {'lr': 0.00021153620967779806, 'samples': 15989376, 'steps': 83277, 'loss/train': 1.586089015007019} 11/07/2021 08:56:16 - INFO - __main__ - Step 83279: {'lr': 0.00021153096612733935, 'samples': 15989568, 'steps': 83278, 'loss/train': 1.6209444999694824} 11/07/2021 08:56:17 - INFO - __main__ - Step 83280: {'lr': 0.00021152572259421415, 'samples': 15989760, 'steps': 83279, 'loss/train': 0.8968253135681152} 11/07/2021 08:56:17 - INFO - __main__ - Step 83281: {'lr': 0.00021152047907842497, 'samples': 15989952, 'steps': 83280, 'loss/train': 1.016717553138733} 11/07/2021 08:56:18 - INFO - __main__ - Step 83282: {'lr': 0.00021151523557997405, 'samples': 15990144, 'steps': 83281, 'loss/train': 1.5953892469406128} 11/07/2021 08:56:19 - INFO - __main__ - Step 83283: {'lr': 0.0002115099920988637, 'samples': 15990336, 'steps': 83282, 'loss/train': 0.8330215215682983} 11/07/2021 08:56:19 - INFO - __main__ - Step 83284: {'lr': 0.00021150474863509635, 'samples': 15990528, 'steps': 83283, 'loss/train': 1.3105312585830688} 11/07/2021 08:56:19 - INFO - __main__ - Step 83285: {'lr': 0.0002114995051886744, 'samples': 15990720, 'steps': 83284, 'loss/train': 1.044884204864502} 11/07/2021 08:56:20 - INFO - __main__ - Step 83286: {'lr': 0.00021149426175960017, 'samples': 15990912, 'steps': 83285, 'loss/train': 1.6663355827331543} 11/07/2021 08:56:21 - INFO - __main__ - Step 83287: {'lr': 0.00021148901834787601, 'samples': 15991104, 'steps': 83286, 'loss/train': 1.510914921760559} 11/07/2021 08:56:21 - INFO - __main__ - Step 83288: {'lr': 0.00021148377495350433, 'samples': 15991296, 'steps': 83287, 'loss/train': 1.119842767715454} 11/07/2021 08:56:21 - INFO - __main__ - Step 83289: {'lr': 0.00021147853157648744, 'samples': 15991488, 'steps': 83288, 'loss/train': 1.7724864482879639} 11/07/2021 08:56:22 - INFO - __main__ - Step 83290: {'lr': 0.00021147328821682776, 'samples': 15991680, 'steps': 83289, 'loss/train': 1.4016841650009155} 11/07/2021 08:56:22 - INFO - __main__ - Step 83291: {'lr': 0.0002114680448745276, 'samples': 15991872, 'steps': 83290, 'loss/train': 1.1693896055221558} 11/07/2021 08:56:23 - INFO - __main__ - Step 83292: {'lr': 0.00021146280154958939, 'samples': 15992064, 'steps': 83291, 'loss/train': 1.3385727405548096} 11/07/2021 08:56:23 - INFO - __main__ - Step 83293: {'lr': 0.0002114575582420154, 'samples': 15992256, 'steps': 83292, 'loss/train': 1.3949005603790283} 11/07/2021 08:56:24 - INFO - __main__ - Step 83294: {'lr': 0.00021145231495180806, 'samples': 15992448, 'steps': 83293, 'loss/train': 1.2665486335754395} 11/07/2021 08:56:24 - INFO - __main__ - Step 83295: {'lr': 0.00021144707167896975, 'samples': 15992640, 'steps': 83294, 'loss/train': 0.8554737567901611} 11/07/2021 08:56:24 - INFO - __main__ - Step 83296: {'lr': 0.00021144182842350274, 'samples': 15992832, 'steps': 83295, 'loss/train': 1.626944661140442} 11/07/2021 08:56:25 - INFO - __main__ - Step 83297: {'lr': 0.00021143658518540945, 'samples': 15993024, 'steps': 83296, 'loss/train': 1.5875986814498901} 11/07/2021 08:56:26 - INFO - __main__ - Step 83298: {'lr': 0.00021143134196469223, 'samples': 15993216, 'steps': 83297, 'loss/train': 1.1747392416000366} 11/07/2021 08:56:26 - INFO - __main__ - Step 83299: {'lr': 0.00021142609876135347, 'samples': 15993408, 'steps': 83298, 'loss/train': 1.3142411708831787} 11/07/2021 08:56:27 - INFO - __main__ - Step 83300: {'lr': 0.0002114208555753955, 'samples': 15993600, 'steps': 83299, 'loss/train': 1.2683542966842651} 11/07/2021 08:56:27 - INFO - __main__ - Step 83301: {'lr': 0.00021141561240682068, 'samples': 15993792, 'steps': 83300, 'loss/train': 0.777286171913147} 11/07/2021 08:56:27 - INFO - __main__ - Step 83302: {'lr': 0.00021141036925563145, 'samples': 15993984, 'steps': 83301, 'loss/train': 1.3953053951263428} 11/07/2021 08:56:28 - INFO - __main__ - Step 83303: {'lr': 0.00021140512612183012, 'samples': 15994176, 'steps': 83302, 'loss/train': 1.223080039024353} 11/07/2021 08:56:29 - INFO - __main__ - Step 83304: {'lr': 0.00021139988300541897, 'samples': 15994368, 'steps': 83303, 'loss/train': 1.232917070388794} 11/07/2021 08:56:29 - INFO - __main__ - Step 83305: {'lr': 0.0002113946399064005, 'samples': 15994560, 'steps': 83304, 'loss/train': 1.157072901725769} 11/07/2021 08:56:29 - INFO - __main__ - Step 83306: {'lr': 0.00021138939682477698, 'samples': 15994752, 'steps': 83305, 'loss/train': 1.5020670890808105} 11/07/2021 08:56:30 - INFO - __main__ - Step 83307: {'lr': 0.0002113841537605508, 'samples': 15994944, 'steps': 83306, 'loss/train': 1.1637519598007202} 11/07/2021 08:56:31 - INFO - __main__ - Step 83308: {'lr': 0.00021137891071372432, 'samples': 15995136, 'steps': 83307, 'loss/train': 1.1072921752929688} 11/07/2021 08:56:31 - INFO - __main__ - Step 83309: {'lr': 0.00021137366768429994, 'samples': 15995328, 'steps': 83308, 'loss/train': 1.5891664028167725} 11/07/2021 08:56:31 - INFO - __main__ - Step 83310: {'lr': 0.00021136842467227995, 'samples': 15995520, 'steps': 83309, 'loss/train': 1.5219218730926514} 11/07/2021 08:56:32 - INFO - __main__ - Step 83311: {'lr': 0.00021136318167766678, 'samples': 15995712, 'steps': 83310, 'loss/train': 1.7637544870376587} 11/07/2021 08:56:32 - INFO - __main__ - Step 83312: {'lr': 0.00021135793870046275, 'samples': 15995904, 'steps': 83311, 'loss/train': 1.4476454257965088} 11/07/2021 08:56:33 - INFO - __main__ - Step 83313: {'lr': 0.00021135269574067023, 'samples': 15996096, 'steps': 83312, 'loss/train': 1.0372350215911865} 11/07/2021 08:56:33 - INFO - __main__ - Step 83314: {'lr': 0.00021134745279829164, 'samples': 15996288, 'steps': 83313, 'loss/train': 1.3685963153839111} 11/07/2021 08:56:34 - INFO - __main__ - Step 83315: {'lr': 0.00021134220987332924, 'samples': 15996480, 'steps': 83314, 'loss/train': 1.4459056854248047} 11/07/2021 08:56:34 - INFO - __main__ - Step 83316: {'lr': 0.00021133696696578545, 'samples': 15996672, 'steps': 83315, 'loss/train': 1.8209038972854614} 11/07/2021 08:56:35 - INFO - __main__ - Step 83317: {'lr': 0.00021133172407566261, 'samples': 15996864, 'steps': 83316, 'loss/train': 1.6111400127410889} 11/07/2021 08:56:35 - INFO - __main__ - Step 83318: {'lr': 0.0002113264812029631, 'samples': 15997056, 'steps': 83317, 'loss/train': 1.4026800394058228} 11/07/2021 08:56:36 - INFO - __main__ - Step 83319: {'lr': 0.0002113212383476893, 'samples': 15997248, 'steps': 83318, 'loss/train': 1.0307908058166504} 11/07/2021 08:56:36 - INFO - __main__ - Step 83320: {'lr': 0.00021131599550984354, 'samples': 15997440, 'steps': 83319, 'loss/train': 1.3521126508712769} 11/07/2021 08:56:36 - INFO - __main__ - Step 83321: {'lr': 0.0002113107526894282, 'samples': 15997632, 'steps': 83320, 'loss/train': 1.1317954063415527} 11/07/2021 08:56:37 - INFO - __main__ - Step 83322: {'lr': 0.0002113055098864457, 'samples': 15997824, 'steps': 83321, 'loss/train': 1.7021816968917847} 11/07/2021 08:56:38 - INFO - __main__ - Step 83323: {'lr': 0.00021130026710089827, 'samples': 15998016, 'steps': 83322, 'loss/train': 1.1306196451187134} 11/07/2021 08:56:38 - INFO - __main__ - Step 83324: {'lr': 0.00021129502433278835, 'samples': 15998208, 'steps': 83323, 'loss/train': 1.5081478357315063} 11/07/2021 08:56:39 - INFO - __main__ - Step 83325: {'lr': 0.00021128978158211834, 'samples': 15998400, 'steps': 83324, 'loss/train': 0.8866333961486816} 11/07/2021 08:56:39 - INFO - __main__ - Step 83326: {'lr': 0.0002112845388488905, 'samples': 15998592, 'steps': 83325, 'loss/train': 1.5150878429412842} 11/07/2021 08:56:39 - INFO - __main__ - Step 83327: {'lr': 0.00021127929613310725, 'samples': 15998784, 'steps': 83326, 'loss/train': 1.8637477159500122} 11/07/2021 08:56:40 - INFO - __main__ - Step 83328: {'lr': 0.000211274053434771, 'samples': 15998976, 'steps': 83327, 'loss/train': 1.2927945852279663} 11/07/2021 08:56:41 - INFO - __main__ - Step 83329: {'lr': 0.00021126881075388403, 'samples': 15999168, 'steps': 83328, 'loss/train': 0.5891136527061462} 11/07/2021 08:56:41 - INFO - __main__ - Step 83330: {'lr': 0.00021126356809044873, 'samples': 15999360, 'steps': 83329, 'loss/train': 2.0784943103790283} 11/07/2021 08:56:41 - INFO - __main__ - Step 83331: {'lr': 0.00021125832544446744, 'samples': 15999552, 'steps': 83330, 'loss/train': 1.1865962743759155} 11/07/2021 08:56:42 - INFO - __main__ - Step 83332: {'lr': 0.0002112530828159426, 'samples': 15999744, 'steps': 83331, 'loss/train': 1.2070759534835815} 11/07/2021 08:56:42 - INFO - __main__ - Step 83333: {'lr': 0.0002112478402048765, 'samples': 15999936, 'steps': 83332, 'loss/train': 1.4726951122283936} 11/07/2021 08:56:43 - INFO - __main__ - Step 83334: {'lr': 0.00021124259761127153, 'samples': 16000128, 'steps': 83333, 'loss/train': 1.4778567552566528} 11/07/2021 08:56:44 - INFO - __main__ - Step 83335: {'lr': 0.00021123735503513004, 'samples': 16000320, 'steps': 83334, 'loss/train': 1.5722079277038574} 11/07/2021 08:56:44 - INFO - __main__ - Step 83336: {'lr': 0.00021123211247645453, 'samples': 16000512, 'steps': 83335, 'loss/train': 1.0639222860336304} 11/07/2021 08:56:44 - INFO - __main__ - Step 83337: {'lr': 0.0002112268699352471, 'samples': 16000704, 'steps': 83336, 'loss/train': 1.0317113399505615} 11/07/2021 08:56:45 - INFO - __main__ - Step 83338: {'lr': 0.00021122162741151024, 'samples': 16000896, 'steps': 83337, 'loss/train': 1.69904625415802} 11/07/2021 08:56:46 - INFO - __main__ - Step 83339: {'lr': 0.00021121638490524634, 'samples': 16001088, 'steps': 83338, 'loss/train': 1.1519994735717773} 11/07/2021 08:56:46 - INFO - __main__ - Step 83340: {'lr': 0.00021121114241645772, 'samples': 16001280, 'steps': 83339, 'loss/train': 0.9768998622894287} 11/07/2021 08:56:46 - INFO - __main__ - Step 83341: {'lr': 0.0002112058999451468, 'samples': 16001472, 'steps': 83340, 'loss/train': 1.001176118850708} 11/07/2021 08:56:47 - INFO - __main__ - Step 83342: {'lr': 0.00021120065749131585, 'samples': 16001664, 'steps': 83341, 'loss/train': 1.5471521615982056} 11/07/2021 08:56:47 - INFO - __main__ - Step 83343: {'lr': 0.0002111954150549673, 'samples': 16001856, 'steps': 83342, 'loss/train': 1.6011393070220947} 11/07/2021 08:56:48 - INFO - __main__ - Step 83344: {'lr': 0.0002111901726361035, 'samples': 16002048, 'steps': 83343, 'loss/train': 1.5285390615463257} 11/07/2021 08:56:48 - INFO - __main__ - Step 83345: {'lr': 0.00021118493023472682, 'samples': 16002240, 'steps': 83344, 'loss/train': 1.7862969636917114} 11/07/2021 08:56:49 - INFO - __main__ - Step 83346: {'lr': 0.0002111796878508396, 'samples': 16002432, 'steps': 83345, 'loss/train': 1.61788010597229} 11/07/2021 08:56:49 - INFO - __main__ - Step 83347: {'lr': 0.00021117444548444424, 'samples': 16002624, 'steps': 83346, 'loss/train': 1.7721161842346191} 11/07/2021 08:56:50 - INFO - __main__ - Step 83348: {'lr': 0.00021116920313554304, 'samples': 16002816, 'steps': 83347, 'loss/train': 1.3009597063064575} 11/07/2021 08:56:51 - INFO - __main__ - Step 83349: {'lr': 0.00021116396080413853, 'samples': 16003008, 'steps': 83348, 'loss/train': 1.7173711061477661} 11/07/2021 08:56:51 - INFO - __main__ - Step 83350: {'lr': 0.0002111587184902328, 'samples': 16003200, 'steps': 83349, 'loss/train': 1.512762188911438} 11/07/2021 08:56:51 - INFO - __main__ - Step 83351: {'lr': 0.00021115347619382838, 'samples': 16003392, 'steps': 83350, 'loss/train': 1.133920431137085} 11/07/2021 08:56:52 - INFO - __main__ - Step 83352: {'lr': 0.0002111482339149276, 'samples': 16003584, 'steps': 83351, 'loss/train': 1.5825413465499878} 11/07/2021 08:56:52 - INFO - __main__ - Step 83353: {'lr': 0.00021114299165353283, 'samples': 16003776, 'steps': 83352, 'loss/train': 0.718298077583313} 11/07/2021 08:56:53 - INFO - __main__ - Step 83354: {'lr': 0.00021113774940964642, 'samples': 16003968, 'steps': 83353, 'loss/train': 0.6031382083892822} 11/07/2021 08:56:53 - INFO - __main__ - Step 83355: {'lr': 0.00021113250718327072, 'samples': 16004160, 'steps': 83354, 'loss/train': 1.459113359451294} 11/07/2021 08:56:54 - INFO - __main__ - Step 83356: {'lr': 0.00021112726497440814, 'samples': 16004352, 'steps': 83355, 'loss/train': 1.3543751239776611} 11/07/2021 08:56:54 - INFO - __main__ - Step 83357: {'lr': 0.00021112202278306103, 'samples': 16004544, 'steps': 83356, 'loss/train': 1.274914264678955} 11/07/2021 08:56:54 - INFO - __main__ - Step 83358: {'lr': 0.0002111167806092317, 'samples': 16004736, 'steps': 83357, 'loss/train': 1.5799342393875122} 11/07/2021 08:56:55 - INFO - __main__ - Step 83359: {'lr': 0.00021111153845292257, 'samples': 16004928, 'steps': 83358, 'loss/train': 1.6243009567260742} 11/07/2021 08:56:56 - INFO - __main__ - Step 83360: {'lr': 0.00021110629631413598, 'samples': 16005120, 'steps': 83359, 'loss/train': 1.375030517578125} 11/07/2021 08:56:56 - INFO - __main__ - Step 83361: {'lr': 0.00021110105419287428, 'samples': 16005312, 'steps': 83360, 'loss/train': 1.913496732711792} 11/07/2021 08:56:56 - INFO - __main__ - Step 83362: {'lr': 0.00021109581208913987, 'samples': 16005504, 'steps': 83361, 'loss/train': 1.219424843788147} 11/07/2021 08:56:57 - INFO - __main__ - Step 83363: {'lr': 0.00021109057000293516, 'samples': 16005696, 'steps': 83362, 'loss/train': 1.255723476409912} 11/07/2021 08:56:57 - INFO - __main__ - Step 83364: {'lr': 0.00021108532793426236, 'samples': 16005888, 'steps': 83363, 'loss/train': 1.3953269720077515} 11/07/2021 08:56:58 - INFO - __main__ - Step 83365: {'lr': 0.00021108008588312387, 'samples': 16006080, 'steps': 83364, 'loss/train': 1.3048955202102661} 11/07/2021 08:56:58 - INFO - __main__ - Step 83366: {'lr': 0.00021107484384952214, 'samples': 16006272, 'steps': 83365, 'loss/train': 0.5328114032745361} 11/07/2021 08:56:59 - INFO - __main__ - Step 83367: {'lr': 0.00021106960183345946, 'samples': 16006464, 'steps': 83366, 'loss/train': 1.6338480710983276} 11/07/2021 08:56:59 - INFO - __main__ - Step 83368: {'lr': 0.00021106435983493822, 'samples': 16006656, 'steps': 83367, 'loss/train': 1.0360337495803833} 11/07/2021 08:57:00 - INFO - __main__ - Step 83369: {'lr': 0.0002110591178539608, 'samples': 16006848, 'steps': 83368, 'loss/train': 0.7061944007873535} 11/07/2021 08:57:00 - INFO - __main__ - Step 83370: {'lr': 0.0002110538758905295, 'samples': 16007040, 'steps': 83369, 'loss/train': 0.9538165330886841} 11/07/2021 08:57:01 - INFO - __main__ - Step 83371: {'lr': 0.00021104863394464678, 'samples': 16007232, 'steps': 83370, 'loss/train': 1.4816464185714722} 11/07/2021 08:57:01 - INFO - __main__ - Step 83372: {'lr': 0.0002110433920163149, 'samples': 16007424, 'steps': 83371, 'loss/train': 1.4192955493927002} 11/07/2021 08:57:02 - INFO - __main__ - Step 83373: {'lr': 0.00021103815010553627, 'samples': 16007616, 'steps': 83372, 'loss/train': 1.2986207008361816} 11/07/2021 08:57:02 - INFO - __main__ - Step 83374: {'lr': 0.00021103290821231324, 'samples': 16007808, 'steps': 83373, 'loss/train': 0.4397847056388855} 11/07/2021 08:57:03 - INFO - __main__ - Step 83375: {'lr': 0.0002110276663366482, 'samples': 16008000, 'steps': 83374, 'loss/train': 1.249151349067688} 11/07/2021 08:57:03 - INFO - __main__ - Step 83376: {'lr': 0.0002110224244785436, 'samples': 16008192, 'steps': 83375, 'loss/train': 1.3706738948822021} 11/07/2021 08:57:04 - INFO - __main__ - Step 83377: {'lr': 0.00021101718263800157, 'samples': 16008384, 'steps': 83376, 'loss/train': 0.8260037302970886} 11/07/2021 08:57:04 - INFO - __main__ - Step 83378: {'lr': 0.00021101194081502462, 'samples': 16008576, 'steps': 83377, 'loss/train': 1.9235990047454834} 11/07/2021 08:57:04 - INFO - __main__ - Step 83379: {'lr': 0.00021100669900961505, 'samples': 16008768, 'steps': 83378, 'loss/train': 1.420021653175354} 11/07/2021 08:57:06 - INFO - __main__ - Step 83380: {'lr': 0.0002110014572217753, 'samples': 16008960, 'steps': 83379, 'loss/train': 1.7184096574783325} 11/07/2021 08:57:06 - INFO - __main__ - Step 83381: {'lr': 0.00021099621545150768, 'samples': 16009152, 'steps': 83380, 'loss/train': 1.421028733253479} 11/07/2021 08:57:06 - INFO - __main__ - Step 83382: {'lr': 0.00021099097369881457, 'samples': 16009344, 'steps': 83381, 'loss/train': 1.2555001974105835} 11/07/2021 08:57:07 - INFO - __main__ - Step 83383: {'lr': 0.0002109857319636983, 'samples': 16009536, 'steps': 83382, 'loss/train': 1.5199716091156006} 11/07/2021 08:57:07 - INFO - __main__ - Step 83384: {'lr': 0.00021098049024616128, 'samples': 16009728, 'steps': 83383, 'loss/train': 0.837031900882721} 11/07/2021 08:57:07 - INFO - __main__ - Step 83385: {'lr': 0.00021097524854620585, 'samples': 16009920, 'steps': 83384, 'loss/train': 1.3412896394729614} 11/07/2021 08:57:08 - INFO - __main__ - Step 83386: {'lr': 0.00021097000686383437, 'samples': 16010112, 'steps': 83385, 'loss/train': 1.2970209121704102} 11/07/2021 08:57:09 - INFO - __main__ - Step 83387: {'lr': 0.00021096476519904918, 'samples': 16010304, 'steps': 83386, 'loss/train': 0.8276333808898926} 11/07/2021 08:57:09 - INFO - __main__ - Step 83388: {'lr': 0.00021095952355185265, 'samples': 16010496, 'steps': 83387, 'loss/train': 1.2900227308273315} 11/07/2021 08:57:09 - INFO - __main__ - Step 83389: {'lr': 0.0002109542819222473, 'samples': 16010688, 'steps': 83388, 'loss/train': 1.1252162456512451} 11/07/2021 08:57:10 - INFO - __main__ - Step 83390: {'lr': 0.00021094904031023525, 'samples': 16010880, 'steps': 83389, 'loss/train': 1.1487760543823242} 11/07/2021 08:57:11 - INFO - __main__ - Step 83391: {'lr': 0.00021094379871581896, 'samples': 16011072, 'steps': 83390, 'loss/train': 1.6290485858917236} 11/07/2021 08:57:11 - INFO - __main__ - Step 83392: {'lr': 0.00021093855713900077, 'samples': 16011264, 'steps': 83391, 'loss/train': 1.021379828453064} 11/07/2021 08:57:11 - INFO - __main__ - Step 83393: {'lr': 0.00021093331557978307, 'samples': 16011456, 'steps': 83392, 'loss/train': 1.397596001625061} 11/07/2021 08:57:12 - INFO - __main__ - Step 83394: {'lr': 0.00021092807403816819, 'samples': 16011648, 'steps': 83393, 'loss/train': 0.8783442974090576} 11/07/2021 08:57:12 - INFO - __main__ - Step 83395: {'lr': 0.00021092283251415855, 'samples': 16011840, 'steps': 83394, 'loss/train': 1.253726601600647} 11/07/2021 08:57:13 - INFO - __main__ - Step 83396: {'lr': 0.0002109175910077565, 'samples': 16012032, 'steps': 83395, 'loss/train': 1.2632439136505127} 11/07/2021 08:57:13 - INFO - __main__ - Step 83397: {'lr': 0.0002109123495189643, 'samples': 16012224, 'steps': 83396, 'loss/train': 1.196160912513733} 11/07/2021 08:57:14 - INFO - __main__ - Step 83398: {'lr': 0.00021090710804778446, 'samples': 16012416, 'steps': 83397, 'loss/train': 1.3880608081817627} 11/07/2021 08:57:14 - INFO - __main__ - Step 83399: {'lr': 0.00021090186659421926, 'samples': 16012608, 'steps': 83398, 'loss/train': 1.2488640546798706} 11/07/2021 08:57:14 - INFO - __main__ - Step 83400: {'lr': 0.00021089662515827107, 'samples': 16012800, 'steps': 83399, 'loss/train': 1.4608893394470215} 11/07/2021 08:57:16 - INFO - __main__ - Step 83401: {'lr': 0.00021089138373994224, 'samples': 16012992, 'steps': 83400, 'loss/train': 1.584160327911377} 11/07/2021 08:57:16 - INFO - __main__ - Step 83402: {'lr': 0.00021088614233923518, 'samples': 16013184, 'steps': 83401, 'loss/train': 1.4685927629470825} 11/07/2021 08:57:16 - INFO - __main__ - Step 83403: {'lr': 0.0002108809009561523, 'samples': 16013376, 'steps': 83402, 'loss/train': 1.3068557977676392} 11/07/2021 08:57:17 - INFO - __main__ - Step 83404: {'lr': 0.0002108756595906958, 'samples': 16013568, 'steps': 83403, 'loss/train': 1.3447431325912476} 11/07/2021 08:57:17 - INFO - __main__ - Step 83405: {'lr': 0.00021087041824286812, 'samples': 16013760, 'steps': 83404, 'loss/train': 1.1512293815612793} 11/07/2021 08:57:17 - INFO - __main__ - Step 83406: {'lr': 0.00021086517691267163, 'samples': 16013952, 'steps': 83405, 'loss/train': 1.4870041608810425} 11/07/2021 08:57:18 - INFO - __main__ - Step 83407: {'lr': 0.00021085993560010865, 'samples': 16014144, 'steps': 83406, 'loss/train': 1.68416428565979} 11/07/2021 08:57:19 - INFO - __main__ - Step 83408: {'lr': 0.0002108546943051816, 'samples': 16014336, 'steps': 83407, 'loss/train': 1.3740625381469727} 11/07/2021 08:57:19 - INFO - __main__ - Step 83409: {'lr': 0.00021084945302789286, 'samples': 16014528, 'steps': 83408, 'loss/train': 1.2757221460342407} 11/07/2021 08:57:19 - INFO - __main__ - Step 83410: {'lr': 0.0002108442117682447, 'samples': 16014720, 'steps': 83409, 'loss/train': 1.254460096359253} 11/07/2021 08:57:20 - INFO - __main__ - Step 83411: {'lr': 0.00021083897052623956, 'samples': 16014912, 'steps': 83410, 'loss/train': 0.8761326670646667} 11/07/2021 08:57:21 - INFO - __main__ - Step 83412: {'lr': 0.00021083372930187977, 'samples': 16015104, 'steps': 83411, 'loss/train': 1.4420185089111328} 11/07/2021 08:57:21 - INFO - __main__ - Step 83413: {'lr': 0.0002108284880951677, 'samples': 16015296, 'steps': 83412, 'loss/train': 1.567954421043396} 11/07/2021 08:57:22 - INFO - __main__ - Step 83414: {'lr': 0.0002108232469061057, 'samples': 16015488, 'steps': 83413, 'loss/train': 1.1590265035629272} 11/07/2021 08:57:22 - INFO - __main__ - Step 83415: {'lr': 0.00021081800573469615, 'samples': 16015680, 'steps': 83414, 'loss/train': 1.656207799911499} 11/07/2021 08:57:22 - INFO - __main__ - Step 83416: {'lr': 0.0002108127645809415, 'samples': 16015872, 'steps': 83415, 'loss/train': 1.2733378410339355} 11/07/2021 08:57:24 - INFO - __main__ - Step 83417: {'lr': 0.00021080752344484392, 'samples': 16016064, 'steps': 83416, 'loss/train': 1.393990159034729} 11/07/2021 08:57:24 - INFO - __main__ - Step 83418: {'lr': 0.00021080228232640586, 'samples': 16016256, 'steps': 83417, 'loss/train': 1.7242448329925537} 11/07/2021 08:57:25 - INFO - __main__ - Step 83419: {'lr': 0.0002107970412256297, 'samples': 16016448, 'steps': 83418, 'loss/train': 1.7404683828353882} 11/07/2021 08:57:25 - INFO - __main__ - Step 83420: {'lr': 0.00021079180014251775, 'samples': 16016640, 'steps': 83419, 'loss/train': 1.3190842866897583} 11/07/2021 08:57:26 - INFO - __main__ - Step 83421: {'lr': 0.00021078655907707242, 'samples': 16016832, 'steps': 83420, 'loss/train': 1.9387242794036865} 11/07/2021 08:57:26 - INFO - __main__ - Step 83422: {'lr': 0.00021078131802929607, 'samples': 16017024, 'steps': 83421, 'loss/train': 0.9477857351303101} 11/07/2021 08:57:26 - INFO - __main__ - Step 83423: {'lr': 0.00021077607699919104, 'samples': 16017216, 'steps': 83422, 'loss/train': 0.5824348330497742} 11/07/2021 08:57:27 - INFO - __main__ - Step 83424: {'lr': 0.00021077083598675973, 'samples': 16017408, 'steps': 83423, 'loss/train': 0.9199294447898865} 11/07/2021 08:57:28 - INFO - __main__ - Step 83425: {'lr': 0.0002107655949920045, 'samples': 16017600, 'steps': 83424, 'loss/train': 1.3642189502716064} 11/07/2021 08:57:28 - INFO - __main__ - Step 83426: {'lr': 0.00021076035401492764, 'samples': 16017792, 'steps': 83425, 'loss/train': 1.4785370826721191} 11/07/2021 08:57:28 - INFO - __main__ - Step 83427: {'lr': 0.0002107551130555316, 'samples': 16017984, 'steps': 83426, 'loss/train': 1.247207760810852} 11/07/2021 08:57:29 - INFO - __main__ - Step 83428: {'lr': 0.00021074987211381867, 'samples': 16018176, 'steps': 83427, 'loss/train': 1.1188818216323853} 11/07/2021 08:57:30 - INFO - __main__ - Step 83429: {'lr': 0.00021074463118979126, 'samples': 16018368, 'steps': 83428, 'loss/train': 0.7137718796730042} 11/07/2021 08:57:30 - INFO - __main__ - Step 83430: {'lr': 0.00021073939028345173, 'samples': 16018560, 'steps': 83429, 'loss/train': 1.1367719173431396} 11/07/2021 08:57:30 - INFO - __main__ - Step 83431: {'lr': 0.00021073414939480243, 'samples': 16018752, 'steps': 83430, 'loss/train': 1.4425382614135742} 11/07/2021 08:57:31 - INFO - __main__ - Step 83432: {'lr': 0.00021072890852384565, 'samples': 16018944, 'steps': 83431, 'loss/train': 1.0240894556045532} 11/07/2021 08:57:31 - INFO - __main__ - Step 83433: {'lr': 0.00021072366767058387, 'samples': 16019136, 'steps': 83432, 'loss/train': 1.278111457824707} 11/07/2021 08:57:32 - INFO - __main__ - Step 83434: {'lr': 0.00021071842683501938, 'samples': 16019328, 'steps': 83433, 'loss/train': 1.8675843477249146} 11/07/2021 08:57:33 - INFO - __main__ - Step 83435: {'lr': 0.00021071318601715455, 'samples': 16019520, 'steps': 83434, 'loss/train': 1.3930026292800903} 11/07/2021 08:57:33 - INFO - __main__ - Step 83436: {'lr': 0.00021070794521699178, 'samples': 16019712, 'steps': 83435, 'loss/train': 1.622039556503296} 11/07/2021 08:57:33 - INFO - __main__ - Step 83437: {'lr': 0.0002107027044345334, 'samples': 16019904, 'steps': 83436, 'loss/train': 1.6213334798812866} 11/07/2021 08:57:34 - INFO - __main__ - Step 83438: {'lr': 0.00021069746366978177, 'samples': 16020096, 'steps': 83437, 'loss/train': 1.58895742893219} 11/07/2021 08:57:35 - INFO - __main__ - Step 83439: {'lr': 0.00021069222292273922, 'samples': 16020288, 'steps': 83438, 'loss/train': 1.5567148923873901} 11/07/2021 08:57:35 - INFO - __main__ - Step 83440: {'lr': 0.0002106869821934082, 'samples': 16020480, 'steps': 83439, 'loss/train': 1.334125280380249} 11/07/2021 08:57:35 - INFO - __main__ - Step 83441: {'lr': 0.00021068174148179098, 'samples': 16020672, 'steps': 83440, 'loss/train': 1.1879734992980957} 11/07/2021 08:57:36 - INFO - __main__ - Step 83442: {'lr': 0.00021067650078788997, 'samples': 16020864, 'steps': 83441, 'loss/train': 1.5484520196914673} 11/07/2021 08:57:36 - INFO - __main__ - Step 83443: {'lr': 0.0002106712601117076, 'samples': 16021056, 'steps': 83442, 'loss/train': 1.5112675428390503} 11/07/2021 08:57:37 - INFO - __main__ - Step 83444: {'lr': 0.00021066601945324607, 'samples': 16021248, 'steps': 83443, 'loss/train': 1.2217806577682495} 11/07/2021 08:57:37 - INFO - __main__ - Step 83445: {'lr': 0.00021066077881250783, 'samples': 16021440, 'steps': 83444, 'loss/train': 1.6001546382904053} 11/07/2021 08:57:38 - INFO - __main__ - Step 83446: {'lr': 0.00021065553818949524, 'samples': 16021632, 'steps': 83445, 'loss/train': 1.545863389968872} 11/07/2021 08:57:38 - INFO - __main__ - Step 83447: {'lr': 0.00021065029758421063, 'samples': 16021824, 'steps': 83446, 'loss/train': 1.1390502452850342} 11/07/2021 08:57:39 - INFO - __main__ - Step 83448: {'lr': 0.00021064505699665647, 'samples': 16022016, 'steps': 83447, 'loss/train': 1.0240962505340576} 11/07/2021 08:57:40 - INFO - __main__ - Step 83449: {'lr': 0.000210639816426835, 'samples': 16022208, 'steps': 83448, 'loss/train': 1.4738805294036865} 11/07/2021 08:57:40 - INFO - __main__ - Step 83450: {'lr': 0.0002106345758747486, 'samples': 16022400, 'steps': 83449, 'loss/train': 1.3253740072250366} 11/07/2021 08:57:40 - INFO - __main__ - Step 83451: {'lr': 0.00021062933534039965, 'samples': 16022592, 'steps': 83450, 'loss/train': 1.22856605052948} 11/07/2021 08:57:41 - INFO - __main__ - Step 83452: {'lr': 0.00021062409482379052, 'samples': 16022784, 'steps': 83451, 'loss/train': 1.4060171842575073} 11/07/2021 08:57:41 - INFO - __main__ - Step 83453: {'lr': 0.00021061885432492358, 'samples': 16022976, 'steps': 83452, 'loss/train': 1.3765658140182495} 11/07/2021 08:57:41 - INFO - __main__ - Step 83454: {'lr': 0.00021061361384380119, 'samples': 16023168, 'steps': 83453, 'loss/train': 1.5301783084869385} 11/07/2021 08:57:42 - INFO - __main__ - Step 83455: {'lr': 0.00021060837338042566, 'samples': 16023360, 'steps': 83454, 'loss/train': 0.9394820928573608} 11/07/2021 08:57:43 - INFO - __main__ - Step 83456: {'lr': 0.0002106031329347994, 'samples': 16023552, 'steps': 83455, 'loss/train': 1.3677053451538086} 11/07/2021 08:57:43 - INFO - __main__ - Step 83457: {'lr': 0.0002105978925069248, 'samples': 16023744, 'steps': 83456, 'loss/train': 1.3825469017028809} 11/07/2021 08:57:43 - INFO - __main__ - Step 83458: {'lr': 0.00021059265209680413, 'samples': 16023936, 'steps': 83457, 'loss/train': 1.3004236221313477} 11/07/2021 08:57:44 - INFO - __main__ - Step 83459: {'lr': 0.0002105874117044399, 'samples': 16024128, 'steps': 83458, 'loss/train': 1.3754031658172607} 11/07/2021 08:57:45 - INFO - __main__ - Step 83460: {'lr': 0.00021058217132983426, 'samples': 16024320, 'steps': 83459, 'loss/train': 1.490535020828247} 11/07/2021 08:57:45 - INFO - __main__ - Step 83461: {'lr': 0.00021057693097298975, 'samples': 16024512, 'steps': 83460, 'loss/train': 1.3722461462020874} 11/07/2021 08:57:46 - INFO - __main__ - Step 83462: {'lr': 0.0002105716906339086, 'samples': 16024704, 'steps': 83461, 'loss/train': 1.715531826019287} 11/07/2021 08:57:46 - INFO - __main__ - Step 83463: {'lr': 0.0002105664503125933, 'samples': 16024896, 'steps': 83462, 'loss/train': 1.7472988367080688} 11/07/2021 08:57:46 - INFO - __main__ - Step 83464: {'lr': 0.0002105612100090461, 'samples': 16025088, 'steps': 83463, 'loss/train': 1.3564813137054443} 11/07/2021 08:57:47 - INFO - __main__ - Step 83465: {'lr': 0.00021055596972326942, 'samples': 16025280, 'steps': 83464, 'loss/train': 1.3038005828857422} 11/07/2021 08:57:48 - INFO - __main__ - Step 83466: {'lr': 0.00021055072945526564, 'samples': 16025472, 'steps': 83465, 'loss/train': 1.4225668907165527} 11/07/2021 08:57:48 - INFO - __main__ - Step 83467: {'lr': 0.00021054548920503705, 'samples': 16025664, 'steps': 83466, 'loss/train': 1.6301249265670776} 11/07/2021 08:57:48 - INFO - __main__ - Step 83468: {'lr': 0.0002105402489725861, 'samples': 16025856, 'steps': 83467, 'loss/train': 1.298643946647644} 11/07/2021 08:57:49 - INFO - __main__ - Step 83469: {'lr': 0.00021053500875791508, 'samples': 16026048, 'steps': 83468, 'loss/train': 1.7299658060073853} 11/07/2021 08:57:50 - INFO - __main__ - Step 83470: {'lr': 0.00021052976856102647, 'samples': 16026240, 'steps': 83469, 'loss/train': 1.040542483329773} 11/07/2021 08:57:50 - INFO - __main__ - Step 83471: {'lr': 0.00021052452838192244, 'samples': 16026432, 'steps': 83470, 'loss/train': 1.403140664100647} 11/07/2021 08:57:50 - INFO - __main__ - Step 83472: {'lr': 0.00021051928822060544, 'samples': 16026624, 'steps': 83471, 'loss/train': 1.3443766832351685} 11/07/2021 08:57:51 - INFO - __main__ - Step 83473: {'lr': 0.00021051404807707785, 'samples': 16026816, 'steps': 83472, 'loss/train': 1.2713309526443481} 11/07/2021 08:57:51 - INFO - __main__ - Step 83474: {'lr': 0.00021050880795134202, 'samples': 16027008, 'steps': 83473, 'loss/train': 1.1096330881118774} 11/07/2021 08:57:52 - INFO - __main__ - Step 83475: {'lr': 0.00021050356784340033, 'samples': 16027200, 'steps': 83474, 'loss/train': 1.4221384525299072} 11/07/2021 08:57:52 - INFO - __main__ - Step 83476: {'lr': 0.0002104983277532551, 'samples': 16027392, 'steps': 83475, 'loss/train': 1.3695119619369507} 11/07/2021 08:57:53 - INFO - __main__ - Step 83477: {'lr': 0.00021049308768090875, 'samples': 16027584, 'steps': 83476, 'loss/train': 1.5701600313186646} 11/07/2021 08:57:53 - INFO - __main__ - Step 83478: {'lr': 0.00021048784762636355, 'samples': 16027776, 'steps': 83477, 'loss/train': 2.040220022201538} 11/07/2021 08:57:54 - INFO - __main__ - Step 83479: {'lr': 0.00021048260758962196, 'samples': 16027968, 'steps': 83478, 'loss/train': 1.1251978874206543} 11/07/2021 08:57:54 - INFO - __main__ - Step 83480: {'lr': 0.00021047736757068627, 'samples': 16028160, 'steps': 83479, 'loss/train': 1.525803804397583} 11/07/2021 08:57:55 - INFO - __main__ - Step 83481: {'lr': 0.00021047212756955888, 'samples': 16028352, 'steps': 83480, 'loss/train': 1.3793784379959106} 11/07/2021 08:57:55 - INFO - __main__ - Step 83482: {'lr': 0.00021046688758624213, 'samples': 16028544, 'steps': 83481, 'loss/train': 1.4483782052993774} 11/07/2021 08:57:56 - INFO - __main__ - Step 83483: {'lr': 0.0002104616476207384, 'samples': 16028736, 'steps': 83482, 'loss/train': 1.5615118741989136} 11/07/2021 08:57:56 - INFO - __main__ - Step 83484: {'lr': 0.00021045640767305016, 'samples': 16028928, 'steps': 83483, 'loss/train': 1.6452358961105347} 11/07/2021 08:57:56 - INFO - __main__ - Step 83485: {'lr': 0.00021045116774317952, 'samples': 16029120, 'steps': 83484, 'loss/train': 1.6108089685440063} 11/07/2021 08:57:58 - INFO - __main__ - Step 83486: {'lr': 0.00021044592783112898, 'samples': 16029312, 'steps': 83485, 'loss/train': 1.2119991779327393} 11/07/2021 08:57:58 - INFO - __main__ - Step 83487: {'lr': 0.0002104406879369009, 'samples': 16029504, 'steps': 83486, 'loss/train': 1.4699110984802246} 11/07/2021 08:57:58 - INFO - __main__ - Step 83488: {'lr': 0.00021043544806049764, 'samples': 16029696, 'steps': 83487, 'loss/train': 2.180607318878174} 11/07/2021 08:57:59 - INFO - __main__ - Step 83489: {'lr': 0.00021043020820192155, 'samples': 16029888, 'steps': 83488, 'loss/train': 1.1794710159301758} 11/07/2021 08:57:59 - INFO - __main__ - Step 83490: {'lr': 0.000210424968361175, 'samples': 16030080, 'steps': 83489, 'loss/train': 1.917650818824768} 11/07/2021 08:58:00 - INFO - __main__ - Step 83491: {'lr': 0.00021041972853826036, 'samples': 16030272, 'steps': 83490, 'loss/train': 1.1436198949813843} 11/07/2021 08:58:00 - INFO - __main__ - Step 83492: {'lr': 0.00021041448873317998, 'samples': 16030464, 'steps': 83491, 'loss/train': 1.4628353118896484} 11/07/2021 08:58:01 - INFO - __main__ - Step 83493: {'lr': 0.00021040924894593618, 'samples': 16030656, 'steps': 83492, 'loss/train': 1.1220301389694214} 11/07/2021 08:58:01 - INFO - __main__ - Step 83494: {'lr': 0.00021040400917653142, 'samples': 16030848, 'steps': 83493, 'loss/train': 1.3537291288375854} 11/07/2021 08:58:01 - INFO - __main__ - Step 83495: {'lr': 0.00021039876942496793, 'samples': 16031040, 'steps': 83494, 'loss/train': 0.8892397284507751} 11/07/2021 08:58:02 - INFO - __main__ - Step 83496: {'lr': 0.0002103935296912482, 'samples': 16031232, 'steps': 83495, 'loss/train': 1.6328057050704956} 11/07/2021 08:58:03 - INFO - __main__ - Step 83497: {'lr': 0.00021038828997537462, 'samples': 16031424, 'steps': 83496, 'loss/train': 1.4791982173919678} 11/07/2021 08:58:03 - INFO - __main__ - Step 83498: {'lr': 0.0002103830502773494, 'samples': 16031616, 'steps': 83497, 'loss/train': 1.6226483583450317} 11/07/2021 08:58:03 - INFO - __main__ - Step 83499: {'lr': 0.00021037781059717492, 'samples': 16031808, 'steps': 83498, 'loss/train': 0.4508468210697174} 11/07/2021 08:58:04 - INFO - __main__ - Step 83500: {'lr': 0.0002103725709348536, 'samples': 16032000, 'steps': 83499, 'loss/train': 1.4416112899780273} 11/07/2021 08:58:05 - INFO - __main__ - Step 83501: {'lr': 0.0002103673312903878, 'samples': 16032192, 'steps': 83500, 'loss/train': 1.1726138591766357} 11/07/2021 08:58:05 - INFO - __main__ - Step 83502: {'lr': 0.00021036209166377985, 'samples': 16032384, 'steps': 83501, 'loss/train': 1.5021908283233643} 11/07/2021 08:58:05 - INFO - __main__ - Step 83503: {'lr': 0.00021035685205503214, 'samples': 16032576, 'steps': 83502, 'loss/train': 1.6316149234771729} 11/07/2021 08:58:06 - INFO - __main__ - Step 83504: {'lr': 0.000210351612464147, 'samples': 16032768, 'steps': 83503, 'loss/train': 0.8357467651367188} 11/07/2021 08:58:06 - INFO - __main__ - Step 83505: {'lr': 0.0002103463728911268, 'samples': 16032960, 'steps': 83504, 'loss/train': 1.7667721509933472} 11/07/2021 08:58:07 - INFO - __main__ - Step 83506: {'lr': 0.00021034113333597397, 'samples': 16033152, 'steps': 83505, 'loss/train': 1.1174176931381226} 11/07/2021 08:58:07 - INFO - __main__ - Step 83507: {'lr': 0.0002103358937986908, 'samples': 16033344, 'steps': 83506, 'loss/train': 1.2233984470367432} 11/07/2021 08:58:08 - INFO - __main__ - Step 83508: {'lr': 0.00021033065427927963, 'samples': 16033536, 'steps': 83507, 'loss/train': 1.467056393623352} 11/07/2021 08:58:08 - INFO - __main__ - Step 83509: {'lr': 0.00021032541477774286, 'samples': 16033728, 'steps': 83508, 'loss/train': 1.1620055437088013} 11/07/2021 08:58:08 - INFO - __main__ - Step 83510: {'lr': 0.000210320175294083, 'samples': 16033920, 'steps': 83509, 'loss/train': 1.4385617971420288} 11/07/2021 08:58:09 - INFO - __main__ - Step 83511: {'lr': 0.0002103149358283021, 'samples': 16034112, 'steps': 83510, 'loss/train': 1.2373830080032349} 11/07/2021 08:58:11 - INFO - __main__ - Step 83512: {'lr': 0.0002103096963804027, 'samples': 16034304, 'steps': 83511, 'loss/train': 1.3466191291809082} 11/07/2021 08:58:11 - INFO - __main__ - Step 83513: {'lr': 0.00021030445695038714, 'samples': 16034496, 'steps': 83512, 'loss/train': 1.5916081666946411} 11/07/2021 08:58:11 - INFO - __main__ - Step 83514: {'lr': 0.00021029921753825775, 'samples': 16034688, 'steps': 83513, 'loss/train': 1.070117473602295} 11/07/2021 08:58:12 - INFO - __main__ - Step 83515: {'lr': 0.00021029397814401694, 'samples': 16034880, 'steps': 83514, 'loss/train': 1.3816585540771484} 11/07/2021 08:58:12 - INFO - __main__ - Step 83516: {'lr': 0.00021028873876766704, 'samples': 16035072, 'steps': 83515, 'loss/train': 1.2612124681472778} 11/07/2021 08:58:12 - INFO - __main__ - Step 83517: {'lr': 0.00021028349940921043, 'samples': 16035264, 'steps': 83516, 'loss/train': 1.7810221910476685} 11/07/2021 08:58:13 - INFO - __main__ - Step 83518: {'lr': 0.00021027826006864947, 'samples': 16035456, 'steps': 83517, 'loss/train': 1.7784501314163208} 11/07/2021 08:58:14 - INFO - __main__ - Step 83519: {'lr': 0.00021027302074598652, 'samples': 16035648, 'steps': 83518, 'loss/train': 1.718997597694397} 11/07/2021 08:58:14 - INFO - __main__ - Step 83520: {'lr': 0.00021026778144122394, 'samples': 16035840, 'steps': 83519, 'loss/train': 1.4667421579360962} 11/07/2021 08:58:15 - INFO - __main__ - Step 83521: {'lr': 0.00021026254215436406, 'samples': 16036032, 'steps': 83520, 'loss/train': 1.3315653800964355} 11/07/2021 08:58:15 - INFO - __main__ - Step 83522: {'lr': 0.00021025730288540926, 'samples': 16036224, 'steps': 83521, 'loss/train': 1.4334558248519897} 11/07/2021 08:58:15 - INFO - __main__ - Step 83523: {'lr': 0.0002102520636343619, 'samples': 16036416, 'steps': 83522, 'loss/train': 1.4028044939041138} 11/07/2021 08:58:16 - INFO - __main__ - Step 83524: {'lr': 0.0002102468244012245, 'samples': 16036608, 'steps': 83523, 'loss/train': 1.1008625030517578} 11/07/2021 08:58:17 - INFO - __main__ - Step 83525: {'lr': 0.0002102415851859991, 'samples': 16036800, 'steps': 83524, 'loss/train': 1.411881685256958} 11/07/2021 08:58:17 - INFO - __main__ - Step 83526: {'lr': 0.00021023634598868829, 'samples': 16036992, 'steps': 83525, 'loss/train': 1.3697092533111572} 11/07/2021 08:58:17 - INFO - __main__ - Step 83527: {'lr': 0.00021023110680929433, 'samples': 16037184, 'steps': 83526, 'loss/train': 1.2278779745101929} 11/07/2021 08:58:18 - INFO - __main__ - Step 83528: {'lr': 0.00021022586764781964, 'samples': 16037376, 'steps': 83527, 'loss/train': 1.5410817861557007} 11/07/2021 08:58:19 - INFO - __main__ - Step 83529: {'lr': 0.00021022062850426654, 'samples': 16037568, 'steps': 83528, 'loss/train': 1.5009253025054932} 11/07/2021 08:58:19 - INFO - __main__ - Step 83530: {'lr': 0.00021021538937863744, 'samples': 16037760, 'steps': 83529, 'loss/train': 1.8281649351119995} 11/07/2021 08:58:19 - INFO - __main__ - Step 83531: {'lr': 0.00021021015027093465, 'samples': 16037952, 'steps': 83530, 'loss/train': 1.5903724431991577} 11/07/2021 08:58:20 - INFO - __main__ - Step 83532: {'lr': 0.00021020491118116052, 'samples': 16038144, 'steps': 83531, 'loss/train': 1.2790356874465942} 11/07/2021 08:58:20 - INFO - __main__ - Step 83533: {'lr': 0.0002101996721093175, 'samples': 16038336, 'steps': 83532, 'loss/train': 1.485987663269043} 11/07/2021 08:58:21 - INFO - __main__ - Step 83534: {'lr': 0.00021019443305540786, 'samples': 16038528, 'steps': 83533, 'loss/train': 0.18069399893283844} 11/07/2021 08:58:21 - INFO - __main__ - Step 83535: {'lr': 0.000210189194019434, 'samples': 16038720, 'steps': 83534, 'loss/train': 1.5100440979003906} 11/07/2021 08:58:22 - INFO - __main__ - Step 83536: {'lr': 0.00021018395500139832, 'samples': 16038912, 'steps': 83535, 'loss/train': 0.9017482995986938} 11/07/2021 08:58:22 - INFO - __main__ - Step 83537: {'lr': 0.00021017871600130316, 'samples': 16039104, 'steps': 83536, 'loss/train': 1.688592553138733} 11/07/2021 08:58:22 - INFO - __main__ - Step 83538: {'lr': 0.0002101734770191508, 'samples': 16039296, 'steps': 83537, 'loss/train': 1.4960671663284302} 11/07/2021 08:58:23 - INFO - __main__ - Step 83539: {'lr': 0.00021016823805494368, 'samples': 16039488, 'steps': 83538, 'loss/train': 0.7482104301452637} 11/07/2021 08:58:24 - INFO - __main__ - Step 83540: {'lr': 0.0002101629991086841, 'samples': 16039680, 'steps': 83539, 'loss/train': 1.4292677640914917} 11/07/2021 08:58:24 - INFO - __main__ - Step 83541: {'lr': 0.00021015776018037445, 'samples': 16039872, 'steps': 83540, 'loss/train': 1.3963497877120972} 11/07/2021 08:58:25 - INFO - __main__ - Step 83542: {'lr': 0.0002101525212700171, 'samples': 16040064, 'steps': 83541, 'loss/train': 1.4993306398391724} 11/07/2021 08:58:25 - INFO - __main__ - Step 83543: {'lr': 0.00021014728237761445, 'samples': 16040256, 'steps': 83542, 'loss/train': 0.7061284184455872} 11/07/2021 08:58:25 - INFO - __main__ - Step 83544: {'lr': 0.00021014204350316875, 'samples': 16040448, 'steps': 83543, 'loss/train': 1.459466814994812} 11/07/2021 08:58:26 - INFO - __main__ - Step 83545: {'lr': 0.0002101368046466825, 'samples': 16040640, 'steps': 83544, 'loss/train': 0.057327236980199814} 11/07/2021 08:58:27 - INFO - __main__ - Step 83546: {'lr': 0.00021013156580815796, 'samples': 16040832, 'steps': 83545, 'loss/train': 1.0966298580169678} 11/07/2021 08:58:27 - INFO - __main__ - Step 83547: {'lr': 0.00021012632698759752, 'samples': 16041024, 'steps': 83546, 'loss/train': 1.3772133588790894} 11/07/2021 08:58:27 - INFO - __main__ - Step 83548: {'lr': 0.00021012108818500353, 'samples': 16041216, 'steps': 83547, 'loss/train': 1.54081392288208} 11/07/2021 08:58:28 - INFO - __main__ - Step 83549: {'lr': 0.00021011584940037838, 'samples': 16041408, 'steps': 83548, 'loss/train': 1.6308245658874512} 11/07/2021 08:58:29 - INFO - __main__ - Step 83550: {'lr': 0.00021011061063372447, 'samples': 16041600, 'steps': 83549, 'loss/train': 1.1561625003814697} 11/07/2021 08:58:29 - INFO - __main__ - Step 83551: {'lr': 0.0002101053718850441, 'samples': 16041792, 'steps': 83550, 'loss/train': 0.6809167265892029} 11/07/2021 08:58:30 - INFO - __main__ - Step 83552: {'lr': 0.00021010013315433956, 'samples': 16041984, 'steps': 83551, 'loss/train': 0.6865038871765137} 11/07/2021 08:58:30 - INFO - __main__ - Step 83553: {'lr': 0.0002100948944416133, 'samples': 16042176, 'steps': 83552, 'loss/train': 1.3915693759918213} 11/07/2021 08:58:30 - INFO - __main__ - Step 83554: {'lr': 0.00021008965574686767, 'samples': 16042368, 'steps': 83553, 'loss/train': 0.9567397832870483} 11/07/2021 08:58:31 - INFO - __main__ - Step 83555: {'lr': 0.00021008441707010504, 'samples': 16042560, 'steps': 83554, 'loss/train': 1.3773128986358643} 11/07/2021 08:58:32 - INFO - __main__ - Step 83556: {'lr': 0.00021007917841132774, 'samples': 16042752, 'steps': 83555, 'loss/train': 1.2909345626831055} 11/07/2021 08:58:32 - INFO - __main__ - Step 83557: {'lr': 0.00021007393977053813, 'samples': 16042944, 'steps': 83556, 'loss/train': 1.6249529123306274} 11/07/2021 08:58:32 - INFO - __main__ - Step 83558: {'lr': 0.0002100687011477386, 'samples': 16043136, 'steps': 83557, 'loss/train': 1.8430856466293335} 11/07/2021 08:58:33 - INFO - __main__ - Step 83559: {'lr': 0.0002100634625429315, 'samples': 16043328, 'steps': 83558, 'loss/train': 1.584303855895996} 11/07/2021 08:58:33 - INFO - __main__ - Step 83560: {'lr': 0.00021005822395611917, 'samples': 16043520, 'steps': 83559, 'loss/train': 1.2982008457183838} 11/07/2021 08:58:34 - INFO - __main__ - Step 83561: {'lr': 0.00021005298538730405, 'samples': 16043712, 'steps': 83560, 'loss/train': 1.9208136796951294} 11/07/2021 08:58:34 - INFO - __main__ - Step 83562: {'lr': 0.00021004774683648842, 'samples': 16043904, 'steps': 83561, 'loss/train': 1.2496850490570068} 11/07/2021 08:58:35 - INFO - __main__ - Step 83563: {'lr': 0.00021004250830367462, 'samples': 16044096, 'steps': 83562, 'loss/train': 1.1597808599472046} 11/07/2021 08:58:35 - INFO - __main__ - Step 83564: {'lr': 0.00021003726978886513, 'samples': 16044288, 'steps': 83563, 'loss/train': 1.0042037963867188} 11/07/2021 08:58:36 - INFO - __main__ - Step 83565: {'lr': 0.00021003203129206215, 'samples': 16044480, 'steps': 83564, 'loss/train': 1.1905936002731323} 11/07/2021 08:58:37 - INFO - __main__ - Step 83566: {'lr': 0.00021002679281326812, 'samples': 16044672, 'steps': 83565, 'loss/train': 1.4327963590621948} 11/07/2021 08:58:37 - INFO - __main__ - Step 83567: {'lr': 0.0002100215543524854, 'samples': 16044864, 'steps': 83566, 'loss/train': 1.386487603187561} 11/07/2021 08:58:37 - INFO - __main__ - Step 83568: {'lr': 0.00021001631590971637, 'samples': 16045056, 'steps': 83567, 'loss/train': 1.2820061445236206} 11/07/2021 08:58:38 - INFO - __main__ - Step 83569: {'lr': 0.00021001107748496334, 'samples': 16045248, 'steps': 83568, 'loss/train': 0.7661770582199097} 11/07/2021 08:58:38 - INFO - __main__ - Step 83570: {'lr': 0.00021000583907822873, 'samples': 16045440, 'steps': 83569, 'loss/train': 1.8138598203659058} 11/07/2021 08:58:39 - INFO - __main__ - Step 83571: {'lr': 0.00021000060068951488, 'samples': 16045632, 'steps': 83570, 'loss/train': 1.2082511186599731} 11/07/2021 08:58:39 - INFO - __main__ - Step 83572: {'lr': 0.00020999536231882415, 'samples': 16045824, 'steps': 83571, 'loss/train': 1.2763701677322388} 11/07/2021 08:58:40 - INFO - __main__ - Step 83573: {'lr': 0.00020999012396615889, 'samples': 16046016, 'steps': 83572, 'loss/train': 1.6137133836746216} 11/07/2021 08:58:40 - INFO - __main__ - Step 83574: {'lr': 0.00020998488563152143, 'samples': 16046208, 'steps': 83573, 'loss/train': 1.464600920677185} 11/07/2021 08:58:41 - INFO - __main__ - Step 83575: {'lr': 0.00020997964731491418, 'samples': 16046400, 'steps': 83574, 'loss/train': 1.1249076128005981} 11/07/2021 08:58:41 - INFO - __main__ - Step 83576: {'lr': 0.00020997440901633947, 'samples': 16046592, 'steps': 83575, 'loss/train': 1.4560160636901855} 11/07/2021 08:58:42 - INFO - __main__ - Step 83577: {'lr': 0.0002099691707357997, 'samples': 16046784, 'steps': 83576, 'loss/train': 1.7958000898361206} 11/07/2021 08:58:42 - INFO - __main__ - Step 83578: {'lr': 0.0002099639324732972, 'samples': 16046976, 'steps': 83577, 'loss/train': 1.6647893190383911} 11/07/2021 08:58:43 - INFO - __main__ - Step 83579: {'lr': 0.00020995869422883436, 'samples': 16047168, 'steps': 83578, 'loss/train': 1.5033341646194458} 11/07/2021 08:58:43 - INFO - __main__ - Step 83580: {'lr': 0.00020995345600241346, 'samples': 16047360, 'steps': 83579, 'loss/train': 1.4768191576004028} 11/07/2021 08:58:44 - INFO - __main__ - Step 83581: {'lr': 0.0002099482177940369, 'samples': 16047552, 'steps': 83580, 'loss/train': 1.2800939083099365} 11/07/2021 08:58:44 - INFO - __main__ - Step 83582: {'lr': 0.00020994297960370712, 'samples': 16047744, 'steps': 83581, 'loss/train': 0.6313899159431458} 11/07/2021 08:58:45 - INFO - __main__ - Step 83583: {'lr': 0.00020993774143142642, 'samples': 16047936, 'steps': 83582, 'loss/train': 0.7881156802177429} 11/07/2021 08:58:45 - INFO - __main__ - Step 83584: {'lr': 0.0002099325032771971, 'samples': 16048128, 'steps': 83583, 'loss/train': 1.464874505996704} 11/07/2021 08:58:45 - INFO - __main__ - Step 83585: {'lr': 0.00020992726514102158, 'samples': 16048320, 'steps': 83584, 'loss/train': 1.6794919967651367} 11/07/2021 08:58:46 - INFO - __main__ - Step 83586: {'lr': 0.00020992202702290225, 'samples': 16048512, 'steps': 83585, 'loss/train': 1.118046760559082} 11/07/2021 08:58:47 - INFO - __main__ - Step 83587: {'lr': 0.0002099167889228414, 'samples': 16048704, 'steps': 83586, 'loss/train': 1.1045465469360352} 11/07/2021 08:58:47 - INFO - __main__ - Step 83588: {'lr': 0.00020991155084084146, 'samples': 16048896, 'steps': 83587, 'loss/train': 1.8129031658172607} 11/07/2021 08:58:48 - INFO - __main__ - Step 83589: {'lr': 0.0002099063127769047, 'samples': 16049088, 'steps': 83588, 'loss/train': 1.1714909076690674} 11/07/2021 08:58:48 - INFO - __main__ - Step 83590: {'lr': 0.00020990107473103358, 'samples': 16049280, 'steps': 83589, 'loss/train': 1.456800103187561} 11/07/2021 08:58:48 - INFO - __main__ - Step 83591: {'lr': 0.00020989583670323047, 'samples': 16049472, 'steps': 83590, 'loss/train': 1.5991640090942383} 11/07/2021 08:58:49 - INFO - __main__ - Step 83592: {'lr': 0.00020989059869349762, 'samples': 16049664, 'steps': 83591, 'loss/train': 1.2668801546096802} 11/07/2021 08:58:50 - INFO - __main__ - Step 83593: {'lr': 0.00020988536070183744, 'samples': 16049856, 'steps': 83592, 'loss/train': 0.8344578742980957} 11/07/2021 08:58:50 - INFO - __main__ - Step 83594: {'lr': 0.00020988012272825236, 'samples': 16050048, 'steps': 83593, 'loss/train': 1.408453106880188} 11/07/2021 08:58:50 - INFO - __main__ - Step 83595: {'lr': 0.0002098748847727446, 'samples': 16050240, 'steps': 83594, 'loss/train': 1.32889986038208} 11/07/2021 08:58:51 - INFO - __main__ - Step 83596: {'lr': 0.00020986964683531662, 'samples': 16050432, 'steps': 83595, 'loss/train': 0.15289165079593658} 11/07/2021 08:58:52 - INFO - __main__ - Step 83597: {'lr': 0.00020986440891597075, 'samples': 16050624, 'steps': 83596, 'loss/train': 1.645027756690979} 11/07/2021 08:58:52 - INFO - __main__ - Step 83598: {'lr': 0.00020985917101470935, 'samples': 16050816, 'steps': 83597, 'loss/train': 1.5633678436279297} 11/07/2021 08:58:53 - INFO - __main__ - Step 83599: {'lr': 0.00020985393313153485, 'samples': 16051008, 'steps': 83598, 'loss/train': 1.1680338382720947} 11/07/2021 08:58:53 - INFO - __main__ - Step 83600: {'lr': 0.00020984869526644948, 'samples': 16051200, 'steps': 83599, 'loss/train': 1.439728856086731} 11/07/2021 08:58:53 - INFO - __main__ - Step 83601: {'lr': 0.00020984345741945567, 'samples': 16051392, 'steps': 83600, 'loss/train': 1.7462159395217896} 11/07/2021 08:58:54 - INFO - __main__ - Step 83602: {'lr': 0.0002098382195905558, 'samples': 16051584, 'steps': 83601, 'loss/train': 1.199462890625} 11/07/2021 08:58:55 - INFO - __main__ - Step 83603: {'lr': 0.00020983298177975222, 'samples': 16051776, 'steps': 83602, 'loss/train': 1.4414639472961426} 11/07/2021 08:58:55 - INFO - __main__ - Step 83604: {'lr': 0.00020982774398704723, 'samples': 16051968, 'steps': 83603, 'loss/train': 1.056777000427246} 11/07/2021 08:58:56 - INFO - __main__ - Step 83605: {'lr': 0.00020982250621244338, 'samples': 16052160, 'steps': 83604, 'loss/train': 0.5871017575263977} 11/07/2021 08:58:56 - INFO - __main__ - Step 83606: {'lr': 0.00020981726845594278, 'samples': 16052352, 'steps': 83605, 'loss/train': 1.6574134826660156} 11/07/2021 08:58:56 - INFO - __main__ - Step 83607: {'lr': 0.0002098120307175479, 'samples': 16052544, 'steps': 83606, 'loss/train': 1.442009687423706} 11/07/2021 08:58:57 - INFO - __main__ - Step 83608: {'lr': 0.0002098067929972611, 'samples': 16052736, 'steps': 83607, 'loss/train': 1.7659811973571777} 11/07/2021 08:58:58 - INFO - __main__ - Step 83609: {'lr': 0.00020980155529508473, 'samples': 16052928, 'steps': 83608, 'loss/train': 1.8550797700881958} 11/07/2021 08:58:58 - INFO - __main__ - Step 83610: {'lr': 0.0002097963176110212, 'samples': 16053120, 'steps': 83609, 'loss/train': 1.261565089225769} 11/07/2021 08:58:58 - INFO - __main__ - Step 83611: {'lr': 0.00020979107994507278, 'samples': 16053312, 'steps': 83610, 'loss/train': 1.367170810699463} 11/07/2021 08:58:59 - INFO - __main__ - Step 83612: {'lr': 0.00020978584229724187, 'samples': 16053504, 'steps': 83611, 'loss/train': 1.2573022842407227} 11/07/2021 08:59:00 - INFO - __main__ - Step 83613: {'lr': 0.00020978060466753088, 'samples': 16053696, 'steps': 83612, 'loss/train': 1.0930570363998413} 11/07/2021 08:59:00 - INFO - __main__ - Step 83614: {'lr': 0.0002097753670559421, 'samples': 16053888, 'steps': 83613, 'loss/train': 1.3741780519485474} 11/07/2021 08:59:01 - INFO - __main__ - Step 83615: {'lr': 0.00020977012946247792, 'samples': 16054080, 'steps': 83614, 'loss/train': 1.8805042505264282} 11/07/2021 08:59:01 - INFO - __main__ - Step 83616: {'lr': 0.0002097648918871407, 'samples': 16054272, 'steps': 83615, 'loss/train': 1.4136145114898682} 11/07/2021 08:59:01 - INFO - __main__ - Step 83617: {'lr': 0.00020975965432993283, 'samples': 16054464, 'steps': 83616, 'loss/train': 1.3945698738098145} 11/07/2021 08:59:02 - INFO - __main__ - Step 83618: {'lr': 0.00020975441679085672, 'samples': 16054656, 'steps': 83617, 'loss/train': 1.4398654699325562} 11/07/2021 08:59:03 - INFO - __main__ - Step 83619: {'lr': 0.00020974917926991455, 'samples': 16054848, 'steps': 83618, 'loss/train': 1.152955412864685} 11/07/2021 08:59:03 - INFO - __main__ - Step 83620: {'lr': 0.00020974394176710877, 'samples': 16055040, 'steps': 83619, 'loss/train': 1.5330642461776733} 11/07/2021 08:59:03 - INFO - __main__ - Step 83621: {'lr': 0.00020973870428244175, 'samples': 16055232, 'steps': 83620, 'loss/train': 0.8609625101089478} 11/07/2021 08:59:04 - INFO - __main__ - Step 83622: {'lr': 0.00020973346681591584, 'samples': 16055424, 'steps': 83621, 'loss/train': 2.0233802795410156} 11/07/2021 08:59:04 - INFO - __main__ - Step 83623: {'lr': 0.00020972822936753344, 'samples': 16055616, 'steps': 83622, 'loss/train': 0.17053619027137756} 11/07/2021 08:59:05 - INFO - __main__ - Step 83624: {'lr': 0.00020972299193729686, 'samples': 16055808, 'steps': 83623, 'loss/train': 1.5949528217315674} 11/07/2021 08:59:05 - INFO - __main__ - Step 83625: {'lr': 0.00020971775452520848, 'samples': 16056000, 'steps': 83624, 'loss/train': 1.0830289125442505} 11/07/2021 08:59:06 - INFO - __main__ - Step 83626: {'lr': 0.00020971251713127064, 'samples': 16056192, 'steps': 83625, 'loss/train': 1.2424099445343018} 11/07/2021 08:59:06 - INFO - __main__ - Step 83627: {'lr': 0.00020970727975548573, 'samples': 16056384, 'steps': 83626, 'loss/train': 1.5680882930755615} 11/07/2021 08:59:07 - INFO - __main__ - Step 83628: {'lr': 0.0002097020423978561, 'samples': 16056576, 'steps': 83627, 'loss/train': 1.7556369304656982} 11/07/2021 08:59:08 - INFO - __main__ - Step 83629: {'lr': 0.00020969680505838413, 'samples': 16056768, 'steps': 83628, 'loss/train': 1.7382874488830566} 11/07/2021 08:59:08 - INFO - __main__ - Step 83630: {'lr': 0.00020969156773707209, 'samples': 16056960, 'steps': 83629, 'loss/train': 1.6071797609329224} 11/07/2021 08:59:08 - INFO - __main__ - Step 83631: {'lr': 0.0002096863304339226, 'samples': 16057152, 'steps': 83630, 'loss/train': 1.5093952417373657} 11/07/2021 08:59:09 - INFO - __main__ - Step 83632: {'lr': 0.00020968109314893765, 'samples': 16057344, 'steps': 83631, 'loss/train': 1.4477235078811646} 11/07/2021 08:59:09 - INFO - __main__ - Step 83633: {'lr': 0.00020967585588211983, 'samples': 16057536, 'steps': 83632, 'loss/train': 0.8664186596870422} 11/07/2021 08:59:10 - INFO - __main__ - Step 83634: {'lr': 0.00020967061863347143, 'samples': 16057728, 'steps': 83633, 'loss/train': 1.377524495124817} 11/07/2021 08:59:10 - INFO - __main__ - Step 83635: {'lr': 0.0002096653814029948, 'samples': 16057920, 'steps': 83634, 'loss/train': 1.1678588390350342} 11/07/2021 08:59:11 - INFO - __main__ - Step 83636: {'lr': 0.00020966014419069234, 'samples': 16058112, 'steps': 83635, 'loss/train': 1.5834099054336548} 11/07/2021 08:59:11 - INFO - __main__ - Step 83637: {'lr': 0.00020965490699656643, 'samples': 16058304, 'steps': 83636, 'loss/train': 1.2841575145721436} 11/07/2021 08:59:11 - INFO - __main__ - Step 83638: {'lr': 0.00020964966982061936, 'samples': 16058496, 'steps': 83637, 'loss/train': 1.2915549278259277} 11/07/2021 08:59:12 - INFO - __main__ - Step 83639: {'lr': 0.00020964443266285356, 'samples': 16058688, 'steps': 83638, 'loss/train': 1.2166664600372314} 11/07/2021 08:59:13 - INFO - __main__ - Step 83640: {'lr': 0.0002096391955232713, 'samples': 16058880, 'steps': 83639, 'loss/train': 1.5332636833190918} 11/07/2021 08:59:13 - INFO - __main__ - Step 83641: {'lr': 0.00020963395840187504, 'samples': 16059072, 'steps': 83640, 'loss/train': 1.1485800743103027} 11/07/2021 08:59:13 - INFO - __main__ - Step 83642: {'lr': 0.0002096287212986671, 'samples': 16059264, 'steps': 83641, 'loss/train': 1.2221275568008423} 11/07/2021 08:59:14 - INFO - __main__ - Step 83643: {'lr': 0.0002096234842136498, 'samples': 16059456, 'steps': 83642, 'loss/train': 1.80941903591156} 11/07/2021 08:59:15 - INFO - __main__ - Step 83644: {'lr': 0.00020961824714682556, 'samples': 16059648, 'steps': 83643, 'loss/train': 1.8626381158828735} 11/07/2021 08:59:15 - INFO - __main__ - Step 83645: {'lr': 0.00020961301009819684, 'samples': 16059840, 'steps': 83644, 'loss/train': 1.6032720804214478} 11/07/2021 08:59:16 - INFO - __main__ - Step 83646: {'lr': 0.00020960777306776573, 'samples': 16060032, 'steps': 83645, 'loss/train': 1.0117336511611938} 11/07/2021 08:59:16 - INFO - __main__ - Step 83647: {'lr': 0.00020960253605553476, 'samples': 16060224, 'steps': 83646, 'loss/train': 1.2429488897323608} 11/07/2021 08:59:16 - INFO - __main__ - Step 83648: {'lr': 0.00020959729906150622, 'samples': 16060416, 'steps': 83647, 'loss/train': 2.0702168941497803} 11/07/2021 08:59:17 - INFO - __main__ - Step 83649: {'lr': 0.00020959206208568255, 'samples': 16060608, 'steps': 83648, 'loss/train': 1.4249948263168335} 11/07/2021 08:59:18 - INFO - __main__ - Step 83650: {'lr': 0.0002095868251280661, 'samples': 16060800, 'steps': 83649, 'loss/train': 1.4278565645217896} 11/07/2021 08:59:18 - INFO - __main__ - Step 83651: {'lr': 0.00020958158818865915, 'samples': 16060992, 'steps': 83650, 'loss/train': 1.5112117528915405} 11/07/2021 08:59:18 - INFO - __main__ - Step 83652: {'lr': 0.00020957635126746415, 'samples': 16061184, 'steps': 83651, 'loss/train': 2.031426191329956} 11/07/2021 08:59:19 - INFO - __main__ - Step 83653: {'lr': 0.0002095711143644834, 'samples': 16061376, 'steps': 83652, 'loss/train': 1.234047770500183} 11/07/2021 08:59:19 - INFO - __main__ - Step 83654: {'lr': 0.00020956587747971927, 'samples': 16061568, 'steps': 83653, 'loss/train': 0.7332591414451599} 11/07/2021 08:59:20 - INFO - __main__ - Step 83655: {'lr': 0.00020956064061317415, 'samples': 16061760, 'steps': 83654, 'loss/train': 1.296992540359497} 11/07/2021 08:59:20 - INFO - __main__ - Step 83656: {'lr': 0.00020955540376485038, 'samples': 16061952, 'steps': 83655, 'loss/train': 1.498653531074524} 11/07/2021 08:59:21 - INFO - __main__ - Step 83657: {'lr': 0.0002095501669347503, 'samples': 16062144, 'steps': 83656, 'loss/train': 1.9437345266342163} 11/07/2021 08:59:21 - INFO - __main__ - Step 83658: {'lr': 0.00020954493012287646, 'samples': 16062336, 'steps': 83657, 'loss/train': 1.0361660718917847} 11/07/2021 08:59:21 - INFO - __main__ - Step 83659: {'lr': 0.0002095396933292309, 'samples': 16062528, 'steps': 83658, 'loss/train': 1.2664586305618286} 11/07/2021 08:59:23 - INFO - __main__ - Step 83660: {'lr': 0.00020953445655381615, 'samples': 16062720, 'steps': 83659, 'loss/train': 0.8810487985610962} 11/07/2021 08:59:23 - INFO - __main__ - Step 83661: {'lr': 0.00020952921979663453, 'samples': 16062912, 'steps': 83660, 'loss/train': 1.7090495824813843} 11/07/2021 08:59:24 - INFO - __main__ - Step 83662: {'lr': 0.0002095239830576884, 'samples': 16063104, 'steps': 83661, 'loss/train': 0.27745020389556885} 11/07/2021 08:59:24 - INFO - __main__ - Step 83663: {'lr': 0.00020951874633698018, 'samples': 16063296, 'steps': 83662, 'loss/train': 1.5622085332870483} 11/07/2021 08:59:24 - INFO - __main__ - Step 83664: {'lr': 0.00020951350963451215, 'samples': 16063488, 'steps': 83663, 'loss/train': 1.4542484283447266} 11/07/2021 08:59:25 - INFO - __main__ - Step 83665: {'lr': 0.00020950827295028674, 'samples': 16063680, 'steps': 83664, 'loss/train': 1.6138378381729126} 11/07/2021 08:59:26 - INFO - __main__ - Step 83666: {'lr': 0.00020950303628430625, 'samples': 16063872, 'steps': 83665, 'loss/train': 1.325897455215454} 11/07/2021 08:59:26 - INFO - __main__ - Step 83667: {'lr': 0.00020949779963657308, 'samples': 16064064, 'steps': 83666, 'loss/train': 1.457987904548645} 11/07/2021 08:59:26 - INFO - __main__ - Step 83668: {'lr': 0.00020949256300708958, 'samples': 16064256, 'steps': 83667, 'loss/train': 1.066782832145691} 11/07/2021 08:59:27 - INFO - __main__ - Step 83669: {'lr': 0.0002094873263958581, 'samples': 16064448, 'steps': 83668, 'loss/train': 0.7105569839477539} 11/07/2021 08:59:28 - INFO - __main__ - Step 83670: {'lr': 0.00020948208980288102, 'samples': 16064640, 'steps': 83669, 'loss/train': 0.9110370874404907} 11/07/2021 08:59:28 - INFO - __main__ - Step 83671: {'lr': 0.00020947685322816068, 'samples': 16064832, 'steps': 83670, 'loss/train': 1.515222430229187} 11/07/2021 08:59:28 - INFO - __main__ - Step 83672: {'lr': 0.00020947161667169957, 'samples': 16065024, 'steps': 83671, 'loss/train': 1.451373815536499} 11/07/2021 08:59:29 - INFO - __main__ - Step 83673: {'lr': 0.00020946638013349977, 'samples': 16065216, 'steps': 83672, 'loss/train': 1.225081205368042} 11/07/2021 08:59:29 - INFO - __main__ - Step 83674: {'lr': 0.0002094611436135638, 'samples': 16065408, 'steps': 83673, 'loss/train': 1.437026023864746} 11/07/2021 08:59:30 - INFO - __main__ - Step 83675: {'lr': 0.00020945590711189406, 'samples': 16065600, 'steps': 83674, 'loss/train': 1.6522597074508667} 11/07/2021 08:59:31 - INFO - __main__ - Step 83676: {'lr': 0.0002094506706284928, 'samples': 16065792, 'steps': 83675, 'loss/train': 0.8937448859214783} 11/07/2021 08:59:31 - INFO - __main__ - Step 83677: {'lr': 0.00020944543416336249, 'samples': 16065984, 'steps': 83676, 'loss/train': 1.2289555072784424} 11/07/2021 08:59:31 - INFO - __main__ - Step 83678: {'lr': 0.0002094401977165054, 'samples': 16066176, 'steps': 83677, 'loss/train': 1.5690863132476807} 11/07/2021 08:59:32 - INFO - __main__ - Step 83679: {'lr': 0.000209434961287924, 'samples': 16066368, 'steps': 83678, 'loss/train': 1.56061851978302} 11/07/2021 08:59:32 - INFO - __main__ - Step 83680: {'lr': 0.0002094297248776205, 'samples': 16066560, 'steps': 83679, 'loss/train': 1.5661020278930664} 11/07/2021 08:59:33 - INFO - __main__ - Step 83681: {'lr': 0.0002094244884855974, 'samples': 16066752, 'steps': 83680, 'loss/train': 0.8864182829856873} 11/07/2021 08:59:34 - INFO - __main__ - Step 83682: {'lr': 0.00020941925211185697, 'samples': 16066944, 'steps': 83681, 'loss/train': 1.3343411684036255} 11/07/2021 08:59:34 - INFO - __main__ - Step 83683: {'lr': 0.00020941401575640163, 'samples': 16067136, 'steps': 83682, 'loss/train': 1.5219364166259766} 11/07/2021 08:59:34 - INFO - __main__ - Step 83684: {'lr': 0.00020940877941923373, 'samples': 16067328, 'steps': 83683, 'loss/train': 1.4095520973205566} 11/07/2021 08:59:35 - INFO - __main__ - Step 83685: {'lr': 0.0002094035431003556, 'samples': 16067520, 'steps': 83684, 'loss/train': 1.0382230281829834} 11/07/2021 08:59:37 - INFO - __main__ - Step 83686: {'lr': 0.00020939830679976958, 'samples': 16067712, 'steps': 83685, 'loss/train': 0.49059009552001953} 11/07/2021 08:59:37 - INFO - __main__ - Step 83687: {'lr': 0.00020939307051747803, 'samples': 16067904, 'steps': 83686, 'loss/train': 1.475785732269287} 11/07/2021 08:59:37 - INFO - __main__ - Step 83688: {'lr': 0.00020938783425348333, 'samples': 16068096, 'steps': 83687, 'loss/train': 1.0770094394683838} 11/07/2021 08:59:38 - INFO - __main__ - Step 83689: {'lr': 0.00020938259800778788, 'samples': 16068288, 'steps': 83688, 'loss/train': 0.7488996386528015} 11/07/2021 08:59:38 - INFO - __main__ - Step 83690: {'lr': 0.000209377361780394, 'samples': 16068480, 'steps': 83689, 'loss/train': 0.9735754728317261} 11/07/2021 08:59:38 - INFO - __main__ - Step 83691: {'lr': 0.00020937212557130405, 'samples': 16068672, 'steps': 83690, 'loss/train': 1.54496169090271} 11/07/2021 08:59:39 - INFO - __main__ - Step 83692: {'lr': 0.0002093668893805204, 'samples': 16068864, 'steps': 83691, 'loss/train': 0.7855187058448792} 11/07/2021 08:59:40 - INFO - __main__ - Step 83693: {'lr': 0.00020936165320804538, 'samples': 16069056, 'steps': 83692, 'loss/train': 1.6027950048446655} 11/07/2021 08:59:40 - INFO - __main__ - Step 83694: {'lr': 0.00020935641705388137, 'samples': 16069248, 'steps': 83693, 'loss/train': 1.5322123765945435} 11/07/2021 08:59:41 - INFO - __main__ - Step 83695: {'lr': 0.00020935118091803078, 'samples': 16069440, 'steps': 83694, 'loss/train': 1.3613433837890625} 11/07/2021 08:59:41 - INFO - __main__ - Step 83696: {'lr': 0.0002093459448004959, 'samples': 16069632, 'steps': 83695, 'loss/train': 1.6087080240249634} 11/07/2021 08:59:41 - INFO - __main__ - Step 83697: {'lr': 0.0002093407087012791, 'samples': 16069824, 'steps': 83696, 'loss/train': 1.2440696954727173} 11/07/2021 08:59:42 - INFO - __main__ - Step 83698: {'lr': 0.00020933547262038274, 'samples': 16070016, 'steps': 83697, 'loss/train': 1.6127159595489502} 11/07/2021 08:59:43 - INFO - __main__ - Step 83699: {'lr': 0.00020933023655780926, 'samples': 16070208, 'steps': 83698, 'loss/train': 1.5919214487075806} 11/07/2021 08:59:43 - INFO - __main__ - Step 83700: {'lr': 0.00020932500051356088, 'samples': 16070400, 'steps': 83699, 'loss/train': 1.3813296556472778} 11/07/2021 08:59:43 - INFO - __main__ - Step 83701: {'lr': 0.00020931976448764001, 'samples': 16070592, 'steps': 83700, 'loss/train': 1.3937182426452637} 11/07/2021 08:59:44 - INFO - __main__ - Step 83702: {'lr': 0.00020931452848004905, 'samples': 16070784, 'steps': 83701, 'loss/train': 1.5815602540969849} 11/07/2021 08:59:45 - INFO - __main__ - Step 83703: {'lr': 0.00020930929249079035, 'samples': 16070976, 'steps': 83702, 'loss/train': 1.321686029434204} 11/07/2021 08:59:45 - INFO - __main__ - Step 83704: {'lr': 0.00020930405651986623, 'samples': 16071168, 'steps': 83703, 'loss/train': 1.0986860990524292} 11/07/2021 08:59:46 - INFO - __main__ - Step 83705: {'lr': 0.00020929882056727907, 'samples': 16071360, 'steps': 83704, 'loss/train': 0.8947431445121765} 11/07/2021 08:59:46 - INFO - __main__ - Step 83706: {'lr': 0.0002092935846330313, 'samples': 16071552, 'steps': 83705, 'loss/train': 0.9047760367393494} 11/07/2021 08:59:46 - INFO - __main__ - Step 83707: {'lr': 0.00020928834871712516, 'samples': 16071744, 'steps': 83706, 'loss/train': 1.191382646560669} 11/07/2021 08:59:48 - INFO - __main__ - Step 83708: {'lr': 0.00020928311281956307, 'samples': 16071936, 'steps': 83707, 'loss/train': 0.7546603679656982} 11/07/2021 08:59:48 - INFO - __main__ - Step 83709: {'lr': 0.00020927787694034733, 'samples': 16072128, 'steps': 83708, 'loss/train': 1.35368812084198} 11/07/2021 08:59:48 - INFO - __main__ - Step 83710: {'lr': 0.00020927264107948042, 'samples': 16072320, 'steps': 83709, 'loss/train': 1.6441971063613892} 11/07/2021 08:59:49 - INFO - __main__ - Step 83711: {'lr': 0.00020926740523696458, 'samples': 16072512, 'steps': 83710, 'loss/train': 1.8063139915466309} 11/07/2021 08:59:49 - INFO - __main__ - Step 83712: {'lr': 0.0002092621694128023, 'samples': 16072704, 'steps': 83711, 'loss/train': 1.2478009462356567} 11/07/2021 08:59:50 - INFO - __main__ - Step 83713: {'lr': 0.00020925693360699578, 'samples': 16072896, 'steps': 83712, 'loss/train': 1.692299485206604} 11/07/2021 08:59:50 - INFO - __main__ - Step 83714: {'lr': 0.0002092516978195475, 'samples': 16073088, 'steps': 83713, 'loss/train': 1.7329274415969849} 11/07/2021 08:59:51 - INFO - __main__ - Step 83715: {'lr': 0.00020924646205045972, 'samples': 16073280, 'steps': 83714, 'loss/train': 1.2918683290481567} 11/07/2021 08:59:51 - INFO - __main__ - Step 83716: {'lr': 0.00020924122629973488, 'samples': 16073472, 'steps': 83715, 'loss/train': 1.657727837562561} 11/07/2021 08:59:51 - INFO - __main__ - Step 83717: {'lr': 0.00020923599056737536, 'samples': 16073664, 'steps': 83716, 'loss/train': 1.4094048738479614} 11/07/2021 08:59:52 - INFO - __main__ - Step 83718: {'lr': 0.00020923075485338344, 'samples': 16073856, 'steps': 83717, 'loss/train': 1.6512296199798584} 11/07/2021 08:59:53 - INFO - __main__ - Step 83719: {'lr': 0.0002092255191577615, 'samples': 16074048, 'steps': 83718, 'loss/train': 1.0438659191131592} 11/07/2021 08:59:53 - INFO - __main__ - Step 83720: {'lr': 0.0002092202834805119, 'samples': 16074240, 'steps': 83719, 'loss/train': 1.7097846269607544} 11/07/2021 08:59:53 - INFO - __main__ - Step 83721: {'lr': 0.00020921504782163704, 'samples': 16074432, 'steps': 83720, 'loss/train': 1.5128012895584106} 11/07/2021 08:59:54 - INFO - __main__ - Step 83722: {'lr': 0.00020920981218113923, 'samples': 16074624, 'steps': 83721, 'loss/train': 1.4803990125656128} 11/07/2021 08:59:54 - INFO - __main__ - Step 83723: {'lr': 0.00020920457655902087, 'samples': 16074816, 'steps': 83722, 'loss/train': 1.7054870128631592} 11/07/2021 08:59:55 - INFO - __main__ - Step 83724: {'lr': 0.0002091993409552843, 'samples': 16075008, 'steps': 83723, 'loss/train': 1.707615613937378} 11/07/2021 08:59:55 - INFO - __main__ - Step 83725: {'lr': 0.00020919410536993183, 'samples': 16075200, 'steps': 83724, 'loss/train': 1.2756493091583252} 11/07/2021 08:59:56 - INFO - __main__ - Step 83726: {'lr': 0.00020918886980296594, 'samples': 16075392, 'steps': 83725, 'loss/train': 1.5481704473495483} 11/07/2021 08:59:56 - INFO - __main__ - Step 83727: {'lr': 0.00020918363425438888, 'samples': 16075584, 'steps': 83726, 'loss/train': 1.7949153184890747} 11/07/2021 08:59:56 - INFO - __main__ - Step 83728: {'lr': 0.00020917839872420312, 'samples': 16075776, 'steps': 83727, 'loss/train': 1.6519144773483276} 11/07/2021 08:59:58 - INFO - __main__ - Step 83729: {'lr': 0.00020917316321241084, 'samples': 16075968, 'steps': 83728, 'loss/train': 1.4459229707717896} 11/07/2021 08:59:58 - INFO - __main__ - Step 83730: {'lr': 0.00020916792771901452, 'samples': 16076160, 'steps': 83729, 'loss/train': 1.452683448791504} 11/07/2021 08:59:58 - INFO - __main__ - Step 83731: {'lr': 0.00020916269224401652, 'samples': 16076352, 'steps': 83730, 'loss/train': 1.0103224515914917} 11/07/2021 08:59:59 - INFO - __main__ - Step 83732: {'lr': 0.00020915745678741916, 'samples': 16076544, 'steps': 83731, 'loss/train': 1.8635808229446411} 11/07/2021 08:59:59 - INFO - __main__ - Step 83733: {'lr': 0.00020915222134922483, 'samples': 16076736, 'steps': 83732, 'loss/train': 1.918007731437683} 11/07/2021 09:00:00 - INFO - __main__ - Step 83734: {'lr': 0.00020914698592943586, 'samples': 16076928, 'steps': 83733, 'loss/train': 1.7315771579742432} 11/07/2021 09:00:00 - INFO - __main__ - Step 83735: {'lr': 0.00020914175052805464, 'samples': 16077120, 'steps': 83734, 'loss/train': 1.1469104290008545} 11/07/2021 09:00:01 - INFO - __main__ - Step 83736: {'lr': 0.00020913651514508353, 'samples': 16077312, 'steps': 83735, 'loss/train': 1.7030526399612427} 11/07/2021 09:00:01 - INFO - __main__ - Step 83737: {'lr': 0.00020913127978052488, 'samples': 16077504, 'steps': 83736, 'loss/train': 1.456048607826233} 11/07/2021 09:00:02 - INFO - __main__ - Step 83738: {'lr': 0.00020912604443438102, 'samples': 16077696, 'steps': 83737, 'loss/train': 1.2968586683273315} 11/07/2021 09:00:02 - INFO - __main__ - Step 83739: {'lr': 0.00020912080910665443, 'samples': 16077888, 'steps': 83738, 'loss/train': 1.9577102661132812} 11/07/2021 09:00:03 - INFO - __main__ - Step 83740: {'lr': 0.00020911557379734732, 'samples': 16078080, 'steps': 83739, 'loss/train': 0.9283596277236938} 11/07/2021 09:00:03 - INFO - __main__ - Step 83741: {'lr': 0.00020911033850646205, 'samples': 16078272, 'steps': 83740, 'loss/train': 1.5144710540771484} 11/07/2021 09:00:04 - INFO - __main__ - Step 83742: {'lr': 0.00020910510323400103, 'samples': 16078464, 'steps': 83741, 'loss/train': 0.8770965337753296} 11/07/2021 09:00:04 - INFO - __main__ - Step 83743: {'lr': 0.00020909986797996665, 'samples': 16078656, 'steps': 83742, 'loss/train': 0.5744217038154602} 11/07/2021 09:00:04 - INFO - __main__ - Step 83744: {'lr': 0.00020909463274436122, 'samples': 16078848, 'steps': 83743, 'loss/train': 1.5120652914047241} 11/07/2021 09:00:05 - INFO - __main__ - Step 83745: {'lr': 0.00020908939752718714, 'samples': 16079040, 'steps': 83744, 'loss/train': 1.0456889867782593} 11/07/2021 09:00:06 - INFO - __main__ - Step 83746: {'lr': 0.0002090841623284467, 'samples': 16079232, 'steps': 83745, 'loss/train': 1.6186096668243408} 11/07/2021 09:00:06 - INFO - __main__ - Step 83747: {'lr': 0.00020907892714814235, 'samples': 16079424, 'steps': 83746, 'loss/train': 1.3002930879592896} 11/07/2021 09:00:06 - INFO - __main__ - Step 83748: {'lr': 0.00020907369198627638, 'samples': 16079616, 'steps': 83747, 'loss/train': 0.39319831132888794} 11/07/2021 09:00:07 - INFO - __main__ - Step 83749: {'lr': 0.0002090684568428512, 'samples': 16079808, 'steps': 83748, 'loss/train': 1.5381273031234741} 11/07/2021 09:00:08 - INFO - __main__ - Step 83750: {'lr': 0.00020906322171786914, 'samples': 16080000, 'steps': 83749, 'loss/train': 0.812274694442749} 11/07/2021 09:00:08 - INFO - __main__ - Step 83751: {'lr': 0.00020905798661133252, 'samples': 16080192, 'steps': 83750, 'loss/train': 1.2983275651931763} 11/07/2021 09:00:08 - INFO - __main__ - Step 83752: {'lr': 0.00020905275152324388, 'samples': 16080384, 'steps': 83751, 'loss/train': 1.1989604234695435} 11/07/2021 09:00:09 - INFO - __main__ - Step 83753: {'lr': 0.00020904751645360532, 'samples': 16080576, 'steps': 83752, 'loss/train': 0.7909756898880005} 11/07/2021 09:00:09 - INFO - __main__ - Step 83754: {'lr': 0.0002090422814024193, 'samples': 16080768, 'steps': 83753, 'loss/train': 0.912140965461731} 11/07/2021 09:00:10 - INFO - __main__ - Step 83755: {'lr': 0.00020903704636968822, 'samples': 16080960, 'steps': 83754, 'loss/train': 1.5383822917938232} 11/07/2021 09:00:11 - INFO - __main__ - Step 83756: {'lr': 0.0002090318113554144, 'samples': 16081152, 'steps': 83755, 'loss/train': 0.532385528087616} 11/07/2021 09:00:11 - INFO - __main__ - Step 83757: {'lr': 0.00020902657635960022, 'samples': 16081344, 'steps': 83756, 'loss/train': 1.565487265586853} 11/07/2021 09:00:11 - INFO - __main__ - Step 83758: {'lr': 0.00020902134138224804, 'samples': 16081536, 'steps': 83757, 'loss/train': 1.1650856733322144} 11/07/2021 09:00:12 - INFO - __main__ - Step 83759: {'lr': 0.00020901610642336022, 'samples': 16081728, 'steps': 83758, 'loss/train': 1.5337694883346558} 11/07/2021 09:00:12 - INFO - __main__ - Step 83760: {'lr': 0.00020901087148293907, 'samples': 16081920, 'steps': 83759, 'loss/train': 0.42072299122810364} 11/07/2021 09:00:13 - INFO - __main__ - Step 83761: {'lr': 0.00020900563656098704, 'samples': 16082112, 'steps': 83760, 'loss/train': 1.4498190879821777} 11/07/2021 09:00:13 - INFO - __main__ - Step 83762: {'lr': 0.0002090004016575064, 'samples': 16082304, 'steps': 83761, 'loss/train': 1.7259681224822998} 11/07/2021 09:00:14 - INFO - __main__ - Step 83763: {'lr': 0.00020899516677249955, 'samples': 16082496, 'steps': 83762, 'loss/train': 1.5421323776245117} 11/07/2021 09:00:14 - INFO - __main__ - Step 83764: {'lr': 0.00020898993190596887, 'samples': 16082688, 'steps': 83763, 'loss/train': 1.369718313217163} 11/07/2021 09:00:15 - INFO - __main__ - Step 83765: {'lr': 0.00020898469705791668, 'samples': 16082880, 'steps': 83764, 'loss/train': 1.5514711141586304} 11/07/2021 09:00:16 - INFO - __main__ - Step 83766: {'lr': 0.00020897946222834548, 'samples': 16083072, 'steps': 83765, 'loss/train': 1.631303310394287} 11/07/2021 09:00:16 - INFO - __main__ - Step 83767: {'lr': 0.00020897422741725734, 'samples': 16083264, 'steps': 83766, 'loss/train': 1.588024616241455} 11/07/2021 09:00:17 - INFO - __main__ - Step 83768: {'lr': 0.00020896899262465483, 'samples': 16083456, 'steps': 83767, 'loss/train': 1.204027533531189} 11/07/2021 09:00:17 - INFO - __main__ - Step 83769: {'lr': 0.00020896375785054021, 'samples': 16083648, 'steps': 83768, 'loss/train': 1.6855151653289795} 11/07/2021 09:00:17 - INFO - __main__ - Step 83770: {'lr': 0.0002089585230949159, 'samples': 16083840, 'steps': 83769, 'loss/train': 0.797265350818634} 11/07/2021 09:00:18 - INFO - __main__ - Step 83771: {'lr': 0.0002089532883577843, 'samples': 16084032, 'steps': 83770, 'loss/train': 1.6161431074142456} 11/07/2021 09:00:19 - INFO - __main__ - Step 83772: {'lr': 0.00020894805363914768, 'samples': 16084224, 'steps': 83771, 'loss/train': 1.1342394351959229} 11/07/2021 09:00:19 - INFO - __main__ - Step 83773: {'lr': 0.00020894281893900842, 'samples': 16084416, 'steps': 83772, 'loss/train': 1.2691731452941895} 11/07/2021 09:00:20 - INFO - __main__ - Step 83774: {'lr': 0.0002089375842573689, 'samples': 16084608, 'steps': 83773, 'loss/train': 1.5280275344848633} 11/07/2021 09:00:20 - INFO - __main__ - Step 83775: {'lr': 0.0002089323495942315, 'samples': 16084800, 'steps': 83774, 'loss/train': 2.329075574874878} 11/07/2021 09:00:20 - INFO - __main__ - Step 83776: {'lr': 0.0002089271149495985, 'samples': 16084992, 'steps': 83775, 'loss/train': 1.1015026569366455} 11/07/2021 09:00:21 - INFO - __main__ - Step 83777: {'lr': 0.00020892188032347234, 'samples': 16085184, 'steps': 83776, 'loss/train': 1.366994023323059} 11/07/2021 09:00:22 - INFO - __main__ - Step 83778: {'lr': 0.00020891664571585534, 'samples': 16085376, 'steps': 83777, 'loss/train': 1.7202162742614746} 11/07/2021 09:00:22 - INFO - __main__ - Step 83779: {'lr': 0.00020891141112675, 'samples': 16085568, 'steps': 83778, 'loss/train': 1.8701061010360718} 11/07/2021 09:00:22 - INFO - __main__ - Step 83780: {'lr': 0.0002089061765561584, 'samples': 16085760, 'steps': 83779, 'loss/train': 1.4151183366775513} 11/07/2021 09:00:23 - INFO - __main__ - Step 83781: {'lr': 0.00020890094200408304, 'samples': 16085952, 'steps': 83780, 'loss/train': 0.9866533875465393} 11/07/2021 09:00:24 - INFO - __main__ - Step 83782: {'lr': 0.0002088957074705263, 'samples': 16086144, 'steps': 83781, 'loss/train': 1.504136323928833} 11/07/2021 09:00:24 - INFO - __main__ - Step 83783: {'lr': 0.00020889047295549051, 'samples': 16086336, 'steps': 83782, 'loss/train': 1.5966031551361084} 11/07/2021 09:00:24 - INFO - __main__ - Step 83784: {'lr': 0.00020888523845897806, 'samples': 16086528, 'steps': 83783, 'loss/train': 1.4301835298538208} 11/07/2021 09:00:25 - INFO - __main__ - Step 83785: {'lr': 0.00020888000398099126, 'samples': 16086720, 'steps': 83784, 'loss/train': 1.0477784872055054} 11/07/2021 09:00:25 - INFO - __main__ - Step 83786: {'lr': 0.0002088747695215325, 'samples': 16086912, 'steps': 83785, 'loss/train': 1.2329835891723633} 11/07/2021 09:00:25 - INFO - __main__ - Step 83787: {'lr': 0.00020886953508060413, 'samples': 16087104, 'steps': 83786, 'loss/train': 1.4958109855651855} 11/07/2021 09:00:27 - INFO - __main__ - Step 83788: {'lr': 0.00020886430065820852, 'samples': 16087296, 'steps': 83787, 'loss/train': 1.2376539707183838} 11/07/2021 09:00:27 - INFO - __main__ - Step 83789: {'lr': 0.00020885906625434802, 'samples': 16087488, 'steps': 83788, 'loss/train': 1.3169374465942383} 11/07/2021 09:00:27 - INFO - __main__ - Step 83790: {'lr': 0.000208853831869025, 'samples': 16087680, 'steps': 83789, 'loss/train': 1.616309404373169} 11/07/2021 09:00:28 - INFO - __main__ - Step 83791: {'lr': 0.0002088485975022418, 'samples': 16087872, 'steps': 83790, 'loss/train': 1.372228741645813} 11/07/2021 09:00:28 - INFO - __main__ - Step 83792: {'lr': 0.0002088433631540008, 'samples': 16088064, 'steps': 83791, 'loss/train': 1.598220705986023} 11/07/2021 09:00:29 - INFO - __main__ - Step 83793: {'lr': 0.00020883812882430445, 'samples': 16088256, 'steps': 83792, 'loss/train': 1.5644418001174927} 11/07/2021 09:00:29 - INFO - __main__ - Step 83794: {'lr': 0.00020883289451315487, 'samples': 16088448, 'steps': 83793, 'loss/train': 1.6851880550384521} 11/07/2021 09:00:30 - INFO - __main__ - Step 83795: {'lr': 0.00020882766022055458, 'samples': 16088640, 'steps': 83794, 'loss/train': 1.0259793996810913} 11/07/2021 09:00:30 - INFO - __main__ - Step 83796: {'lr': 0.0002088224259465059, 'samples': 16088832, 'steps': 83795, 'loss/train': 1.103283405303955} 11/07/2021 09:00:30 - INFO - __main__ - Step 83797: {'lr': 0.0002088171916910112, 'samples': 16089024, 'steps': 83796, 'loss/train': 1.2786221504211426} 11/07/2021 09:00:31 - INFO - __main__ - Step 83798: {'lr': 0.00020881195745407283, 'samples': 16089216, 'steps': 83797, 'loss/train': 1.3730149269104004} 11/07/2021 09:00:32 - INFO - __main__ - Step 83799: {'lr': 0.0002088067232356932, 'samples': 16089408, 'steps': 83798, 'loss/train': 1.8143819570541382} 11/07/2021 09:00:32 - INFO - __main__ - Step 83800: {'lr': 0.00020880148903587456, 'samples': 16089600, 'steps': 83799, 'loss/train': 0.5166717171669006} 11/07/2021 09:00:32 - INFO - __main__ - Step 83801: {'lr': 0.00020879625485461937, 'samples': 16089792, 'steps': 83800, 'loss/train': 1.3955928087234497} 11/07/2021 09:00:33 - INFO - __main__ - Step 83802: {'lr': 0.00020879102069192997, 'samples': 16089984, 'steps': 83801, 'loss/train': 1.1160906553268433} 11/07/2021 09:00:34 - INFO - __main__ - Step 83803: {'lr': 0.00020878578654780867, 'samples': 16090176, 'steps': 83802, 'loss/train': 1.3237996101379395} 11/07/2021 09:00:34 - INFO - __main__ - Step 83804: {'lr': 0.00020878055242225786, 'samples': 16090368, 'steps': 83803, 'loss/train': 1.460894227027893} 11/07/2021 09:00:34 - INFO - __main__ - Step 83805: {'lr': 0.00020877531831527992, 'samples': 16090560, 'steps': 83804, 'loss/train': 0.6846575140953064} 11/07/2021 09:00:35 - INFO - __main__ - Step 83806: {'lr': 0.00020877008422687726, 'samples': 16090752, 'steps': 83805, 'loss/train': 1.1978201866149902} 11/07/2021 09:00:35 - INFO - __main__ - Step 83807: {'lr': 0.00020876485015705205, 'samples': 16090944, 'steps': 83806, 'loss/train': 1.5161850452423096} 11/07/2021 09:00:36 - INFO - __main__ - Step 83808: {'lr': 0.0002087596161058068, 'samples': 16091136, 'steps': 83807, 'loss/train': 1.2149397134780884} 11/07/2021 09:00:37 - INFO - __main__ - Step 83809: {'lr': 0.00020875438207314378, 'samples': 16091328, 'steps': 83808, 'loss/train': 1.2303067445755005} 11/07/2021 09:00:37 - INFO - __main__ - Step 83810: {'lr': 0.00020874914805906549, 'samples': 16091520, 'steps': 83809, 'loss/train': 1.7069731950759888} 11/07/2021 09:00:37 - INFO - __main__ - Step 83811: {'lr': 0.00020874391406357413, 'samples': 16091712, 'steps': 83810, 'loss/train': 1.4844176769256592} 11/07/2021 09:00:38 - INFO - __main__ - Step 83812: {'lr': 0.00020873868008667212, 'samples': 16091904, 'steps': 83811, 'loss/train': 1.157900333404541} 11/07/2021 09:00:39 - INFO - __main__ - Step 83813: {'lr': 0.00020873344612836186, 'samples': 16092096, 'steps': 83812, 'loss/train': 0.9369563460350037} 11/07/2021 09:00:39 - INFO - __main__ - Step 83814: {'lr': 0.00020872821218864563, 'samples': 16092288, 'steps': 83813, 'loss/train': 1.4853614568710327} 11/07/2021 09:00:39 - INFO - __main__ - Step 83815: {'lr': 0.00020872297826752585, 'samples': 16092480, 'steps': 83814, 'loss/train': 1.8011350631713867} 11/07/2021 09:00:40 - INFO - __main__ - Step 83816: {'lr': 0.00020871774436500486, 'samples': 16092672, 'steps': 83815, 'loss/train': 1.5960434675216675} 11/07/2021 09:00:40 - INFO - __main__ - Step 83817: {'lr': 0.00020871251048108503, 'samples': 16092864, 'steps': 83816, 'loss/train': 1.4905976057052612} 11/07/2021 09:00:40 - INFO - __main__ - Step 83818: {'lr': 0.00020870727661576868, 'samples': 16093056, 'steps': 83817, 'loss/train': 1.0630302429199219} 11/07/2021 09:00:41 - INFO - __main__ - Step 83819: {'lr': 0.00020870204276905827, 'samples': 16093248, 'steps': 83818, 'loss/train': 1.3033182621002197} 11/07/2021 09:00:42 - INFO - __main__ - Step 83820: {'lr': 0.00020869680894095607, 'samples': 16093440, 'steps': 83819, 'loss/train': 1.1226990222930908} 11/07/2021 09:00:42 - INFO - __main__ - Step 83821: {'lr': 0.00020869157513146442, 'samples': 16093632, 'steps': 83820, 'loss/train': 1.0713390111923218} 11/07/2021 09:00:42 - INFO - __main__ - Step 83822: {'lr': 0.00020868634134058568, 'samples': 16093824, 'steps': 83821, 'loss/train': 1.81453537940979} 11/07/2021 09:00:43 - INFO - __main__ - Step 83823: {'lr': 0.00020868110756832225, 'samples': 16094016, 'steps': 83822, 'loss/train': 1.432496428489685} 11/07/2021 09:00:44 - INFO - __main__ - Step 83824: {'lr': 0.00020867587381467645, 'samples': 16094208, 'steps': 83823, 'loss/train': 1.5984445810317993} 11/07/2021 09:00:44 - INFO - __main__ - Step 83825: {'lr': 0.0002086706400796507, 'samples': 16094400, 'steps': 83824, 'loss/train': 1.4301577806472778} 11/07/2021 09:00:44 - INFO - __main__ - Step 83826: {'lr': 0.00020866540636324733, 'samples': 16094592, 'steps': 83825, 'loss/train': 1.1527595520019531} 11/07/2021 09:00:45 - INFO - __main__ - Step 83827: {'lr': 0.00020866017266546867, 'samples': 16094784, 'steps': 83826, 'loss/train': 1.0448111295700073} 11/07/2021 09:00:45 - INFO - __main__ - Step 83828: {'lr': 0.00020865493898631707, 'samples': 16094976, 'steps': 83827, 'loss/train': 1.3536326885223389} 11/07/2021 09:00:46 - INFO - __main__ - Step 83829: {'lr': 0.00020864970532579494, 'samples': 16095168, 'steps': 83828, 'loss/train': 1.6505451202392578} 11/07/2021 09:00:47 - INFO - __main__ - Step 83830: {'lr': 0.00020864447168390468, 'samples': 16095360, 'steps': 83829, 'loss/train': 1.2739068269729614} 11/07/2021 09:00:47 - INFO - __main__ - Step 83831: {'lr': 0.00020863923806064852, 'samples': 16095552, 'steps': 83830, 'loss/train': 1.599412202835083} 11/07/2021 09:00:47 - INFO - __main__ - Step 83832: {'lr': 0.00020863400445602886, 'samples': 16095744, 'steps': 83831, 'loss/train': 1.6197586059570312} 11/07/2021 09:00:48 - INFO - __main__ - Step 83833: {'lr': 0.00020862877087004817, 'samples': 16095936, 'steps': 83832, 'loss/train': 2.2105305194854736} 11/07/2021 09:00:49 - INFO - __main__ - Step 83834: {'lr': 0.00020862353730270866, 'samples': 16096128, 'steps': 83833, 'loss/train': 1.6346933841705322} 11/07/2021 09:00:49 - INFO - __main__ - Step 83835: {'lr': 0.00020861830375401273, 'samples': 16096320, 'steps': 83834, 'loss/train': 1.088067650794983} 11/07/2021 09:00:49 - INFO - __main__ - Step 83836: {'lr': 0.00020861307022396276, 'samples': 16096512, 'steps': 83835, 'loss/train': 1.5044517517089844} 11/07/2021 09:00:50 - INFO - __main__ - Step 83837: {'lr': 0.00020860783671256108, 'samples': 16096704, 'steps': 83836, 'loss/train': 1.3177521228790283} 11/07/2021 09:00:50 - INFO - __main__ - Step 83838: {'lr': 0.00020860260321981013, 'samples': 16096896, 'steps': 83837, 'loss/train': 1.0852220058441162} 11/07/2021 09:00:51 - INFO - __main__ - Step 83839: {'lr': 0.00020859736974571214, 'samples': 16097088, 'steps': 83838, 'loss/train': 1.635308027267456} 11/07/2021 09:00:52 - INFO - __main__ - Step 83840: {'lr': 0.00020859213629026958, 'samples': 16097280, 'steps': 83839, 'loss/train': 1.2573750019073486} 11/07/2021 09:00:52 - INFO - __main__ - Step 83841: {'lr': 0.00020858690285348482, 'samples': 16097472, 'steps': 83840, 'loss/train': 1.6284408569335938} 11/07/2021 09:00:52 - INFO - __main__ - Step 83842: {'lr': 0.00020858166943536007, 'samples': 16097664, 'steps': 83841, 'loss/train': 1.6103852987289429} 11/07/2021 09:00:53 - INFO - __main__ - Step 83843: {'lr': 0.0002085764360358978, 'samples': 16097856, 'steps': 83842, 'loss/train': 1.749546766281128} 11/07/2021 09:00:53 - INFO - __main__ - Step 83844: {'lr': 0.00020857120265510036, 'samples': 16098048, 'steps': 83843, 'loss/train': 1.5271364450454712} 11/07/2021 09:00:54 - INFO - __main__ - Step 83845: {'lr': 0.00020856596929297007, 'samples': 16098240, 'steps': 83844, 'loss/train': 1.852829098701477} 11/07/2021 09:00:54 - INFO - __main__ - Step 83846: {'lr': 0.00020856073594950934, 'samples': 16098432, 'steps': 83845, 'loss/train': 1.3662689924240112} 11/07/2021 09:00:55 - INFO - __main__ - Step 83847: {'lr': 0.00020855550262472057, 'samples': 16098624, 'steps': 83846, 'loss/train': 1.5113133192062378} 11/07/2021 09:00:55 - INFO - __main__ - Step 83848: {'lr': 0.00020855026931860596, 'samples': 16098816, 'steps': 83847, 'loss/train': 1.429944634437561} 11/07/2021 09:00:55 - INFO - __main__ - Step 83849: {'lr': 0.000208545036031168, 'samples': 16099008, 'steps': 83848, 'loss/train': 1.211594820022583} 11/07/2021 09:00:57 - INFO - __main__ - Step 83850: {'lr': 0.00020853980276240895, 'samples': 16099200, 'steps': 83849, 'loss/train': 1.07640540599823} 11/07/2021 09:00:57 - INFO - __main__ - Step 83851: {'lr': 0.00020853456951233133, 'samples': 16099392, 'steps': 83850, 'loss/train': 1.6877421140670776} 11/07/2021 09:00:57 - INFO - __main__ - Step 83852: {'lr': 0.00020852933628093728, 'samples': 16099584, 'steps': 83851, 'loss/train': 1.8176063299179077} 11/07/2021 09:00:58 - INFO - __main__ - Step 83853: {'lr': 0.00020852410306822932, 'samples': 16099776, 'steps': 83852, 'loss/train': 1.433441162109375} 11/07/2021 09:00:58 - INFO - __main__ - Step 83854: {'lr': 0.00020851886987420976, 'samples': 16099968, 'steps': 83853, 'loss/train': 1.369248867034912} 11/07/2021 09:00:59 - INFO - __main__ - Step 83855: {'lr': 0.00020851363669888097, 'samples': 16100160, 'steps': 83854, 'loss/train': 0.6131535172462463} 11/07/2021 09:01:00 - INFO - __main__ - Step 83856: {'lr': 0.00020850840354224526, 'samples': 16100352, 'steps': 83855, 'loss/train': 0.8484835028648376} 11/07/2021 09:01:00 - INFO - __main__ - Step 83857: {'lr': 0.00020850317040430503, 'samples': 16100544, 'steps': 83856, 'loss/train': 1.7474876642227173} 11/07/2021 09:01:00 - INFO - __main__ - Step 83858: {'lr': 0.00020849793728506264, 'samples': 16100736, 'steps': 83857, 'loss/train': 1.4216201305389404} 11/07/2021 09:01:01 - INFO - __main__ - Step 83859: {'lr': 0.00020849270418452044, 'samples': 16100928, 'steps': 83858, 'loss/train': 1.3688045740127563} 11/07/2021 09:01:01 - INFO - __main__ - Step 83860: {'lr': 0.00020848747110268086, 'samples': 16101120, 'steps': 83859, 'loss/train': 1.3453682661056519} 11/07/2021 09:01:02 - INFO - __main__ - Step 83861: {'lr': 0.00020848223803954607, 'samples': 16101312, 'steps': 83860, 'loss/train': 1.3541256189346313} 11/07/2021 09:01:02 - INFO - __main__ - Step 83862: {'lr': 0.00020847700499511865, 'samples': 16101504, 'steps': 83861, 'loss/train': 0.5599523782730103} 11/07/2021 09:01:03 - INFO - __main__ - Step 83863: {'lr': 0.0002084717719694008, 'samples': 16101696, 'steps': 83862, 'loss/train': 1.3027843236923218} 11/07/2021 09:01:03 - INFO - __main__ - Step 83864: {'lr': 0.0002084665389623949, 'samples': 16101888, 'steps': 83863, 'loss/train': 1.5852291584014893} 11/07/2021 09:01:04 - INFO - __main__ - Step 83865: {'lr': 0.00020846130597410335, 'samples': 16102080, 'steps': 83864, 'loss/train': 1.3348530530929565} 11/07/2021 09:01:04 - INFO - __main__ - Step 83866: {'lr': 0.00020845607300452849, 'samples': 16102272, 'steps': 83865, 'loss/train': 1.5627857446670532} 11/07/2021 09:01:05 - INFO - __main__ - Step 83867: {'lr': 0.00020845084005367267, 'samples': 16102464, 'steps': 83866, 'loss/train': 1.7210725545883179} 11/07/2021 09:01:05 - INFO - __main__ - Step 83868: {'lr': 0.0002084456071215383, 'samples': 16102656, 'steps': 83867, 'loss/train': 2.0317418575286865} 11/07/2021 09:01:06 - INFO - __main__ - Step 83869: {'lr': 0.00020844037420812768, 'samples': 16102848, 'steps': 83868, 'loss/train': 1.252623200416565} 11/07/2021 09:01:06 - INFO - __main__ - Step 83870: {'lr': 0.00020843514131344316, 'samples': 16103040, 'steps': 83869, 'loss/train': 0.9782075881958008} 11/07/2021 09:01:07 - INFO - __main__ - Step 83871: {'lr': 0.00020842990843748712, 'samples': 16103232, 'steps': 83870, 'loss/train': 1.7209392786026} 11/07/2021 09:01:07 - INFO - __main__ - Step 83872: {'lr': 0.00020842467558026195, 'samples': 16103424, 'steps': 83871, 'loss/train': 0.6245827078819275} 11/07/2021 09:01:08 - INFO - __main__ - Step 83873: {'lr': 0.0002084194427417701, 'samples': 16103616, 'steps': 83872, 'loss/train': 1.2306087017059326} 11/07/2021 09:01:08 - INFO - __main__ - Step 83874: {'lr': 0.0002084142099220137, 'samples': 16103808, 'steps': 83873, 'loss/train': 1.9318463802337646} 11/07/2021 09:01:08 - INFO - __main__ - Step 83875: {'lr': 0.00020840897712099516, 'samples': 16104000, 'steps': 83874, 'loss/train': 1.3419229984283447} 11/07/2021 09:01:10 - INFO - __main__ - Step 83876: {'lr': 0.00020840374433871695, 'samples': 16104192, 'steps': 83875, 'loss/train': 1.5975602865219116} 11/07/2021 09:01:10 - INFO - __main__ - Step 83877: {'lr': 0.00020839851157518135, 'samples': 16104384, 'steps': 83876, 'loss/train': 1.25478994846344} 11/07/2021 09:01:10 - INFO - __main__ - Step 83878: {'lr': 0.0002083932788303907, 'samples': 16104576, 'steps': 83877, 'loss/train': 1.5634489059448242} 11/07/2021 09:01:11 - INFO - __main__ - Step 83879: {'lr': 0.00020838804610434747, 'samples': 16104768, 'steps': 83878, 'loss/train': 1.1274446249008179} 11/07/2021 09:01:11 - INFO - __main__ - Step 83880: {'lr': 0.00020838281339705393, 'samples': 16104960, 'steps': 83879, 'loss/train': 1.3557802438735962} 11/07/2021 09:01:12 - INFO - __main__ - Step 83881: {'lr': 0.00020837758070851243, 'samples': 16105152, 'steps': 83880, 'loss/train': 1.1514641046524048} 11/07/2021 09:01:12 - INFO - __main__ - Step 83882: {'lr': 0.00020837234803872535, 'samples': 16105344, 'steps': 83881, 'loss/train': 1.3495932817459106} 11/07/2021 09:01:13 - INFO - __main__ - Step 83883: {'lr': 0.00020836711538769505, 'samples': 16105536, 'steps': 83882, 'loss/train': 1.4523934125900269} 11/07/2021 09:01:13 - INFO - __main__ - Step 83884: {'lr': 0.00020836188275542386, 'samples': 16105728, 'steps': 83883, 'loss/train': 1.0607653856277466} 11/07/2021 09:01:13 - INFO - __main__ - Step 83885: {'lr': 0.00020835665014191422, 'samples': 16105920, 'steps': 83884, 'loss/train': 1.481237530708313} 11/07/2021 09:01:14 - INFO - __main__ - Step 83886: {'lr': 0.00020835141754716837, 'samples': 16106112, 'steps': 83885, 'loss/train': 1.8850094079971313} 11/07/2021 09:01:15 - INFO - __main__ - Step 83887: {'lr': 0.00020834618497118888, 'samples': 16106304, 'steps': 83886, 'loss/train': 1.676806926727295} 11/07/2021 09:01:15 - INFO - __main__ - Step 83888: {'lr': 0.00020834095241397782, 'samples': 16106496, 'steps': 83887, 'loss/train': 1.424288272857666} 11/07/2021 09:01:15 - INFO - __main__ - Step 83889: {'lr': 0.0002083357198755377, 'samples': 16106688, 'steps': 83888, 'loss/train': 1.3621623516082764} 11/07/2021 09:01:16 - INFO - __main__ - Step 83890: {'lr': 0.00020833048735587086, 'samples': 16106880, 'steps': 83889, 'loss/train': 1.49019455909729} 11/07/2021 09:01:16 - INFO - __main__ - Step 83891: {'lr': 0.00020832525485497966, 'samples': 16107072, 'steps': 83890, 'loss/train': 1.1436312198638916} 11/07/2021 09:01:17 - INFO - __main__ - Step 83892: {'lr': 0.00020832002237286646, 'samples': 16107264, 'steps': 83891, 'loss/train': 1.7062605619430542} 11/07/2021 09:01:18 - INFO - __main__ - Step 83893: {'lr': 0.00020831478990953361, 'samples': 16107456, 'steps': 83892, 'loss/train': 1.2262623310089111} 11/07/2021 09:01:18 - INFO - __main__ - Step 83894: {'lr': 0.0002083095574649835, 'samples': 16107648, 'steps': 83893, 'loss/train': 0.9489645957946777} 11/07/2021 09:01:18 - INFO - __main__ - Step 83895: {'lr': 0.00020830432503921842, 'samples': 16107840, 'steps': 83894, 'loss/train': 1.3952958583831787} 11/07/2021 09:01:19 - INFO - __main__ - Step 83896: {'lr': 0.00020829909263224078, 'samples': 16108032, 'steps': 83895, 'loss/train': 1.6478849649429321} 11/07/2021 09:01:20 - INFO - __main__ - Step 83897: {'lr': 0.00020829386024405293, 'samples': 16108224, 'steps': 83896, 'loss/train': 0.8292049765586853} 11/07/2021 09:01:20 - INFO - __main__ - Step 83898: {'lr': 0.0002082886278746572, 'samples': 16108416, 'steps': 83897, 'loss/train': 0.7032541632652283} 11/07/2021 09:01:20 - INFO - __main__ - Step 83899: {'lr': 0.000208283395524056, 'samples': 16108608, 'steps': 83898, 'loss/train': 1.3166640996932983} 11/07/2021 09:01:21 - INFO - __main__ - Step 83900: {'lr': 0.00020827816319225176, 'samples': 16108800, 'steps': 83899, 'loss/train': 1.6290268898010254} 11/07/2021 09:01:21 - INFO - __main__ - Step 83901: {'lr': 0.00020827293087924664, 'samples': 16108992, 'steps': 83900, 'loss/train': 0.8092569708824158} 11/07/2021 09:01:22 - INFO - __main__ - Step 83902: {'lr': 0.00020826769858504307, 'samples': 16109184, 'steps': 83901, 'loss/train': 1.573468804359436} 11/07/2021 09:01:22 - INFO - __main__ - Step 83903: {'lr': 0.00020826246630964342, 'samples': 16109376, 'steps': 83902, 'loss/train': 1.4428027868270874} 11/07/2021 09:01:23 - INFO - __main__ - Step 83904: {'lr': 0.00020825723405305008, 'samples': 16109568, 'steps': 83903, 'loss/train': 1.658541202545166} 11/07/2021 09:01:23 - INFO - __main__ - Step 83905: {'lr': 0.0002082520018152654, 'samples': 16109760, 'steps': 83904, 'loss/train': 1.3338252305984497} 11/07/2021 09:01:24 - INFO - __main__ - Step 83906: {'lr': 0.0002082467695962917, 'samples': 16109952, 'steps': 83905, 'loss/train': 1.5728518962860107} 11/07/2021 09:01:25 - INFO - __main__ - Step 83907: {'lr': 0.00020824153739613138, 'samples': 16110144, 'steps': 83906, 'loss/train': 0.9816521406173706} 11/07/2021 09:01:25 - INFO - __main__ - Step 83908: {'lr': 0.00020823630521478676, 'samples': 16110336, 'steps': 83907, 'loss/train': 1.6277347803115845} 11/07/2021 09:01:25 - INFO - __main__ - Step 83909: {'lr': 0.00020823107305226025, 'samples': 16110528, 'steps': 83908, 'loss/train': 1.6712236404418945} 11/07/2021 09:01:26 - INFO - __main__ - Step 83910: {'lr': 0.0002082258409085541, 'samples': 16110720, 'steps': 83909, 'loss/train': 1.4994627237319946} 11/07/2021 09:01:26 - INFO - __main__ - Step 83911: {'lr': 0.00020822060878367084, 'samples': 16110912, 'steps': 83910, 'loss/train': 1.3681895732879639} 11/07/2021 09:01:26 - INFO - __main__ - Step 83912: {'lr': 0.00020821537667761264, 'samples': 16111104, 'steps': 83911, 'loss/train': 1.52574622631073} 11/07/2021 09:01:27 - INFO - __main__ - Step 83913: {'lr': 0.000208210144590382, 'samples': 16111296, 'steps': 83912, 'loss/train': 1.1982486248016357} 11/07/2021 09:01:28 - INFO - __main__ - Step 83914: {'lr': 0.00020820491252198132, 'samples': 16111488, 'steps': 83913, 'loss/train': 1.079163670539856} 11/07/2021 09:01:28 - INFO - __main__ - Step 83915: {'lr': 0.00020819968047241274, 'samples': 16111680, 'steps': 83914, 'loss/train': 1.43185555934906} 11/07/2021 09:01:28 - INFO - __main__ - Step 83916: {'lr': 0.00020819444844167876, 'samples': 16111872, 'steps': 83915, 'loss/train': 1.1272528171539307} 11/07/2021 09:01:29 - INFO - __main__ - Step 83917: {'lr': 0.0002081892164297817, 'samples': 16112064, 'steps': 83916, 'loss/train': 1.5601099729537964} 11/07/2021 09:01:30 - INFO - __main__ - Step 83918: {'lr': 0.00020818398443672395, 'samples': 16112256, 'steps': 83917, 'loss/train': 1.6178767681121826} 11/07/2021 09:01:30 - INFO - __main__ - Step 83919: {'lr': 0.00020817875246250783, 'samples': 16112448, 'steps': 83918, 'loss/train': 1.8135102987289429} 11/07/2021 09:01:31 - INFO - __main__ - Step 83920: {'lr': 0.00020817352050713574, 'samples': 16112640, 'steps': 83919, 'loss/train': 1.4608805179595947} 11/07/2021 09:01:31 - INFO - __main__ - Step 83921: {'lr': 0.00020816828857061, 'samples': 16112832, 'steps': 83920, 'loss/train': 1.4679839611053467} 11/07/2021 09:01:31 - INFO - __main__ - Step 83922: {'lr': 0.000208163056652933, 'samples': 16113024, 'steps': 83921, 'loss/train': 1.2754074335098267} 11/07/2021 09:01:34 - INFO - __main__ - Step 83923: {'lr': 0.00020815782475410707, 'samples': 16113216, 'steps': 83922, 'loss/train': 1.4832444190979004} 11/07/2021 09:01:34 - INFO - __main__ - Step 83924: {'lr': 0.00020815259287413457, 'samples': 16113408, 'steps': 83923, 'loss/train': 1.4476499557495117} 11/07/2021 09:01:34 - INFO - __main__ - Step 83925: {'lr': 0.0002081473610130179, 'samples': 16113600, 'steps': 83924, 'loss/train': 1.0786272287368774} 11/07/2021 09:01:35 - INFO - __main__ - Step 83926: {'lr': 0.00020814212917075935, 'samples': 16113792, 'steps': 83925, 'loss/train': 1.6553009748458862} 11/07/2021 09:01:35 - INFO - __main__ - Step 83927: {'lr': 0.00020813689734736142, 'samples': 16113984, 'steps': 83926, 'loss/train': 1.0292274951934814} 11/07/2021 09:01:35 - INFO - __main__ - Step 83928: {'lr': 0.00020813166554282624, 'samples': 16114176, 'steps': 83927, 'loss/train': 2.0323119163513184} 11/07/2021 09:01:36 - INFO - __main__ - Step 83929: {'lr': 0.00020812643375715635, 'samples': 16114368, 'steps': 83928, 'loss/train': 0.7720268368721008} 11/07/2021 09:01:36 - INFO - __main__ - Step 83930: {'lr': 0.00020812120199035396, 'samples': 16114560, 'steps': 83929, 'loss/train': 0.6830273270606995} 11/07/2021 09:01:36 - INFO - __main__ - Step 83931: {'lr': 0.00020811597024242157, 'samples': 16114752, 'steps': 83930, 'loss/train': 0.7245432734489441} 11/07/2021 09:01:37 - INFO - __main__ - Step 83932: {'lr': 0.00020811073851336142, 'samples': 16114944, 'steps': 83931, 'loss/train': 1.025283694267273} 11/07/2021 09:01:38 - INFO - __main__ - Step 83933: {'lr': 0.00020810550680317596, 'samples': 16115136, 'steps': 83932, 'loss/train': 1.4968065023422241} 11/07/2021 09:01:38 - INFO - __main__ - Step 83934: {'lr': 0.00020810027511186752, 'samples': 16115328, 'steps': 83933, 'loss/train': 1.5567044019699097} 11/07/2021 09:01:39 - INFO - __main__ - Step 83935: {'lr': 0.00020809504343943848, 'samples': 16115520, 'steps': 83934, 'loss/train': 1.6025570631027222} 11/07/2021 09:01:39 - INFO - __main__ - Step 83936: {'lr': 0.0002080898117858911, 'samples': 16115712, 'steps': 83935, 'loss/train': 1.758922815322876} 11/07/2021 09:01:40 - INFO - __main__ - Step 83937: {'lr': 0.00020808458015122782, 'samples': 16115904, 'steps': 83936, 'loss/train': 1.3902573585510254} 11/07/2021 09:01:40 - INFO - __main__ - Step 83938: {'lr': 0.00020807934853545103, 'samples': 16116096, 'steps': 83937, 'loss/train': 1.398813247680664} 11/07/2021 09:01:41 - INFO - __main__ - Step 83939: {'lr': 0.00020807411693856299, 'samples': 16116288, 'steps': 83938, 'loss/train': 1.128726601600647} 11/07/2021 09:01:41 - INFO - __main__ - Step 83940: {'lr': 0.0002080688853605661, 'samples': 16116480, 'steps': 83939, 'loss/train': 1.5802743434906006} 11/07/2021 09:01:41 - INFO - __main__ - Step 83941: {'lr': 0.00020806365380146287, 'samples': 16116672, 'steps': 83940, 'loss/train': 1.5130760669708252} 11/07/2021 09:01:42 - INFO - __main__ - Step 83942: {'lr': 0.00020805842226125537, 'samples': 16116864, 'steps': 83941, 'loss/train': 2.1089460849761963} 11/07/2021 09:01:43 - INFO - __main__ - Step 83943: {'lr': 0.00020805319073994612, 'samples': 16117056, 'steps': 83942, 'loss/train': 1.4947805404663086} 11/07/2021 09:01:43 - INFO - __main__ - Step 83944: {'lr': 0.0002080479592375374, 'samples': 16117248, 'steps': 83943, 'loss/train': 1.1573902368545532} 11/07/2021 09:01:44 - INFO - __main__ - Step 83945: {'lr': 0.00020804272775403168, 'samples': 16117440, 'steps': 83944, 'loss/train': 1.1340796947479248} 11/07/2021 09:01:44 - INFO - __main__ - Step 83946: {'lr': 0.00020803749628943124, 'samples': 16117632, 'steps': 83945, 'loss/train': 1.0379844903945923} 11/07/2021 09:01:45 - INFO - __main__ - Step 83947: {'lr': 0.00020803226484373847, 'samples': 16117824, 'steps': 83946, 'loss/train': 1.290690302848816} 11/07/2021 09:01:45 - INFO - __main__ - Step 83948: {'lr': 0.00020802703341695573, 'samples': 16118016, 'steps': 83947, 'loss/train': 1.7251665592193604} 11/07/2021 09:01:46 - INFO - __main__ - Step 83949: {'lr': 0.00020802180200908533, 'samples': 16118208, 'steps': 83948, 'loss/train': 1.4326674938201904} 11/07/2021 09:01:46 - INFO - __main__ - Step 83950: {'lr': 0.00020801657062012965, 'samples': 16118400, 'steps': 83949, 'loss/train': 1.5216772556304932} 11/07/2021 09:01:46 - INFO - __main__ - Step 83951: {'lr': 0.00020801133925009107, 'samples': 16118592, 'steps': 83950, 'loss/train': 1.3588011264801025} 11/07/2021 09:01:47 - INFO - __main__ - Step 83952: {'lr': 0.00020800610789897196, 'samples': 16118784, 'steps': 83951, 'loss/train': 0.6293691992759705} 11/07/2021 09:01:48 - INFO - __main__ - Step 83953: {'lr': 0.00020800087656677467, 'samples': 16118976, 'steps': 83952, 'loss/train': 1.2236673831939697} 11/07/2021 09:01:48 - INFO - __main__ - Step 83954: {'lr': 0.00020799564525350153, 'samples': 16119168, 'steps': 83953, 'loss/train': 1.6369707584381104} 11/07/2021 09:01:49 - INFO - __main__ - Step 83955: {'lr': 0.00020799041395915484, 'samples': 16119360, 'steps': 83954, 'loss/train': 1.3965636491775513} 11/07/2021 09:01:49 - INFO - __main__ - Step 83956: {'lr': 0.00020798518268373706, 'samples': 16119552, 'steps': 83955, 'loss/train': 1.4301732778549194} 11/07/2021 09:01:49 - INFO - __main__ - Step 83957: {'lr': 0.00020797995142725052, 'samples': 16119744, 'steps': 83956, 'loss/train': 1.1903551816940308} 11/07/2021 09:01:50 - INFO - __main__ - Step 83958: {'lr': 0.00020797472018969752, 'samples': 16119936, 'steps': 83957, 'loss/train': 1.384149432182312} 11/07/2021 09:01:51 - INFO - __main__ - Step 83959: {'lr': 0.0002079694889710805, 'samples': 16120128, 'steps': 83958, 'loss/train': 1.7000943422317505} 11/07/2021 09:01:51 - INFO - __main__ - Step 83960: {'lr': 0.00020796425777140173, 'samples': 16120320, 'steps': 83959, 'loss/train': 1.7113362550735474} 11/07/2021 09:01:51 - INFO - __main__ - Step 83961: {'lr': 0.00020795902659066366, 'samples': 16120512, 'steps': 83960, 'loss/train': 2.2575185298919678} 11/07/2021 09:01:52 - INFO - __main__ - Step 83962: {'lr': 0.0002079537954288686, 'samples': 16120704, 'steps': 83961, 'loss/train': 1.2966182231903076} 11/07/2021 09:01:53 - INFO - __main__ - Step 83963: {'lr': 0.00020794856428601888, 'samples': 16120896, 'steps': 83962, 'loss/train': 0.699146032333374} 11/07/2021 09:01:53 - INFO - __main__ - Step 83964: {'lr': 0.000207943333162117, 'samples': 16121088, 'steps': 83963, 'loss/train': 1.746199369430542} 11/07/2021 09:01:54 - INFO - __main__ - Step 83965: {'lr': 0.0002079381020571651, 'samples': 16121280, 'steps': 83964, 'loss/train': 1.5744060277938843} 11/07/2021 09:01:54 - INFO - __main__ - Step 83966: {'lr': 0.00020793287097116563, 'samples': 16121472, 'steps': 83965, 'loss/train': 1.4332010746002197} 11/07/2021 09:01:54 - INFO - __main__ - Step 83967: {'lr': 0.00020792763990412101, 'samples': 16121664, 'steps': 83966, 'loss/train': 1.1951569318771362} 11/07/2021 09:01:55 - INFO - __main__ - Step 83968: {'lr': 0.0002079224088560336, 'samples': 16121856, 'steps': 83967, 'loss/train': 1.566225528717041} 11/07/2021 09:01:56 - INFO - __main__ - Step 83969: {'lr': 0.0002079171778269056, 'samples': 16122048, 'steps': 83968, 'loss/train': 1.0769503116607666} 11/07/2021 09:01:56 - INFO - __main__ - Step 83970: {'lr': 0.0002079119468167395, 'samples': 16122240, 'steps': 83969, 'loss/train': 1.4037474393844604} 11/07/2021 09:01:56 - INFO - __main__ - Step 83971: {'lr': 0.00020790671582553762, 'samples': 16122432, 'steps': 83970, 'loss/train': 1.6328234672546387} 11/07/2021 09:01:57 - INFO - __main__ - Step 83972: {'lr': 0.00020790148485330234, 'samples': 16122624, 'steps': 83971, 'loss/train': 1.5418490171432495} 11/07/2021 09:01:58 - INFO - __main__ - Step 83973: {'lr': 0.000207896253900036, 'samples': 16122816, 'steps': 83972, 'loss/train': 1.383674144744873} 11/07/2021 09:01:59 - INFO - __main__ - Step 83974: {'lr': 0.00020789102296574094, 'samples': 16123008, 'steps': 83973, 'loss/train': 1.6535053253173828} 11/07/2021 09:01:59 - INFO - __main__ - Step 83975: {'lr': 0.0002078857920504196, 'samples': 16123200, 'steps': 83974, 'loss/train': 1.3789656162261963} 11/07/2021 09:01:59 - INFO - __main__ - Step 83976: {'lr': 0.0002078805611540742, 'samples': 16123392, 'steps': 83975, 'loss/train': 2.170715808868408} 11/07/2021 09:02:00 - INFO - __main__ - Step 83977: {'lr': 0.00020787533027670719, 'samples': 16123584, 'steps': 83976, 'loss/train': 1.5323517322540283} 11/07/2021 09:02:00 - INFO - __main__ - Step 83978: {'lr': 0.0002078700994183209, 'samples': 16123776, 'steps': 83977, 'loss/train': 1.6684435606002808} 11/07/2021 09:02:01 - INFO - __main__ - Step 83979: {'lr': 0.0002078648685789177, 'samples': 16123968, 'steps': 83978, 'loss/train': 1.2355632781982422} 11/07/2021 09:02:01 - INFO - __main__ - Step 83980: {'lr': 0.00020785963775849992, 'samples': 16124160, 'steps': 83979, 'loss/train': 1.4111213684082031} 11/07/2021 09:02:02 - INFO - __main__ - Step 83981: {'lr': 0.00020785440695707003, 'samples': 16124352, 'steps': 83980, 'loss/train': 1.430979609489441} 11/07/2021 09:02:02 - INFO - __main__ - Step 83982: {'lr': 0.0002078491761746302, 'samples': 16124544, 'steps': 83981, 'loss/train': 1.3379125595092773} 11/07/2021 09:02:02 - INFO - __main__ - Step 83983: {'lr': 0.0002078439454111829, 'samples': 16124736, 'steps': 83982, 'loss/train': 1.5852185487747192} 11/07/2021 09:02:04 - INFO - __main__ - Step 83984: {'lr': 0.00020783871466673046, 'samples': 16124928, 'steps': 83983, 'loss/train': 1.0069197416305542} 11/07/2021 09:02:04 - INFO - __main__ - Step 83985: {'lr': 0.00020783348394127528, 'samples': 16125120, 'steps': 83984, 'loss/train': 1.2424628734588623} 11/07/2021 09:02:04 - INFO - __main__ - Step 83986: {'lr': 0.00020782825323481969, 'samples': 16125312, 'steps': 83985, 'loss/train': 1.4811630249023438} 11/07/2021 09:02:05 - INFO - __main__ - Step 83987: {'lr': 0.00020782302254736598, 'samples': 16125504, 'steps': 83986, 'loss/train': 1.614022135734558} 11/07/2021 09:02:05 - INFO - __main__ - Step 83988: {'lr': 0.00020781779187891659, 'samples': 16125696, 'steps': 83987, 'loss/train': 0.9952728152275085} 11/07/2021 09:02:05 - INFO - __main__ - Step 83989: {'lr': 0.00020781256122947385, 'samples': 16125888, 'steps': 83988, 'loss/train': 1.3835177421569824} 11/07/2021 09:02:07 - INFO - __main__ - Step 83990: {'lr': 0.00020780733059904013, 'samples': 16126080, 'steps': 83989, 'loss/train': 2.5435495376586914} 11/07/2021 09:02:07 - INFO - __main__ - Step 83991: {'lr': 0.00020780209998761773, 'samples': 16126272, 'steps': 83990, 'loss/train': 1.5798200368881226} 11/07/2021 09:02:07 - INFO - __main__ - Step 83992: {'lr': 0.0002077968693952091, 'samples': 16126464, 'steps': 83991, 'loss/train': 1.6318475008010864} 11/07/2021 09:02:08 - INFO - __main__ - Step 83993: {'lr': 0.00020779163882181655, 'samples': 16126656, 'steps': 83992, 'loss/train': 1.4164519309997559} 11/07/2021 09:02:08 - INFO - __main__ - Step 83994: {'lr': 0.00020778640826744243, 'samples': 16126848, 'steps': 83993, 'loss/train': 1.130423665046692} 11/07/2021 09:02:09 - INFO - __main__ - Step 83995: {'lr': 0.0002077811777320891, 'samples': 16127040, 'steps': 83994, 'loss/train': 1.6549595594406128} 11/07/2021 09:02:09 - INFO - __main__ - Step 83996: {'lr': 0.00020777594721575892, 'samples': 16127232, 'steps': 83995, 'loss/train': 1.3134655952453613} 11/07/2021 09:02:10 - INFO - __main__ - Step 83997: {'lr': 0.0002077707167184543, 'samples': 16127424, 'steps': 83996, 'loss/train': 1.6702651977539062} 11/07/2021 09:02:10 - INFO - __main__ - Step 83998: {'lr': 0.0002077654862401775, 'samples': 16127616, 'steps': 83997, 'loss/train': 1.0948961973190308} 11/07/2021 09:02:10 - INFO - __main__ - Step 83999: {'lr': 0.0002077602557809309, 'samples': 16127808, 'steps': 83998, 'loss/train': 1.1822878122329712} 11/07/2021 09:02:11 - INFO - __main__ - Step 84000: {'lr': 0.00020775502534071686, 'samples': 16128000, 'steps': 83999, 'loss/train': 1.1325435638427734} 11/07/2021 09:02:12 - INFO - __main__ - Step 84001: {'lr': 0.00020774979491953777, 'samples': 16128192, 'steps': 84000, 'loss/train': 0.8687522411346436} 11/07/2021 09:02:12 - INFO - __main__ - Step 84002: {'lr': 0.00020774456451739599, 'samples': 16128384, 'steps': 84001, 'loss/train': 0.9000738263130188} 11/07/2021 09:02:12 - INFO - __main__ - Step 84003: {'lr': 0.00020773933413429383, 'samples': 16128576, 'steps': 84002, 'loss/train': 0.6384520530700684} 11/07/2021 09:02:13 - INFO - __main__ - Step 84004: {'lr': 0.00020773410377023367, 'samples': 16128768, 'steps': 84003, 'loss/train': 1.3517948389053345} 11/07/2021 09:02:14 - INFO - __main__ - Step 84005: {'lr': 0.0002077288734252179, 'samples': 16128960, 'steps': 84004, 'loss/train': 1.4163153171539307} 11/07/2021 09:02:14 - INFO - __main__ - Step 84006: {'lr': 0.0002077236430992488, 'samples': 16129152, 'steps': 84005, 'loss/train': 1.5560685396194458} 11/07/2021 09:02:15 - INFO - __main__ - Step 84007: {'lr': 0.00020771841279232885, 'samples': 16129344, 'steps': 84006, 'loss/train': 1.1921495199203491} 11/07/2021 09:02:15 - INFO - __main__ - Step 84008: {'lr': 0.00020771318250446035, 'samples': 16129536, 'steps': 84007, 'loss/train': 1.4096578359603882} 11/07/2021 09:02:15 - INFO - __main__ - Step 84009: {'lr': 0.0002077079522356456, 'samples': 16129728, 'steps': 84008, 'loss/train': 1.0161938667297363} 11/07/2021 09:02:16 - INFO - __main__ - Step 84010: {'lr': 0.00020770272198588697, 'samples': 16129920, 'steps': 84009, 'loss/train': 1.3735771179199219} 11/07/2021 09:02:17 - INFO - __main__ - Step 84011: {'lr': 0.00020769749175518682, 'samples': 16130112, 'steps': 84010, 'loss/train': 1.196280598640442} 11/07/2021 09:02:17 - INFO - __main__ - Step 84012: {'lr': 0.00020769226154354755, 'samples': 16130304, 'steps': 84011, 'loss/train': 1.72838294506073} 11/07/2021 09:02:17 - INFO - __main__ - Step 84013: {'lr': 0.0002076870313509715, 'samples': 16130496, 'steps': 84012, 'loss/train': 1.3834309577941895} 11/07/2021 09:02:18 - INFO - __main__ - Step 84014: {'lr': 0.000207681801177461, 'samples': 16130688, 'steps': 84013, 'loss/train': 1.5580400228500366} 11/07/2021 09:02:19 - INFO - __main__ - Step 84015: {'lr': 0.00020767657102301846, 'samples': 16130880, 'steps': 84014, 'loss/train': 1.322243094444275} 11/07/2021 09:02:19 - INFO - __main__ - Step 84016: {'lr': 0.00020767134088764617, 'samples': 16131072, 'steps': 84015, 'loss/train': 1.7181251049041748} 11/07/2021 09:02:19 - INFO - __main__ - Step 84017: {'lr': 0.00020766611077134654, 'samples': 16131264, 'steps': 84016, 'loss/train': 1.3413203954696655} 11/07/2021 09:02:20 - INFO - __main__ - Step 84018: {'lr': 0.0002076608806741219, 'samples': 16131456, 'steps': 84017, 'loss/train': 1.4150772094726562} 11/07/2021 09:02:20 - INFO - __main__ - Step 84019: {'lr': 0.0002076556505959746, 'samples': 16131648, 'steps': 84018, 'loss/train': 1.3461328744888306} 11/07/2021 09:02:21 - INFO - __main__ - Step 84020: {'lr': 0.00020765042053690703, 'samples': 16131840, 'steps': 84019, 'loss/train': 1.2950961589813232} 11/07/2021 09:02:22 - INFO - __main__ - Step 84021: {'lr': 0.00020764519049692163, 'samples': 16132032, 'steps': 84020, 'loss/train': 1.4383794069290161} 11/07/2021 09:02:22 - INFO - __main__ - Step 84022: {'lr': 0.00020763996047602054, 'samples': 16132224, 'steps': 84021, 'loss/train': 1.6784424781799316} 11/07/2021 09:02:22 - INFO - __main__ - Step 84023: {'lr': 0.00020763473047420624, 'samples': 16132416, 'steps': 84022, 'loss/train': 1.73658287525177} 11/07/2021 09:02:23 - INFO - __main__ - Step 84024: {'lr': 0.00020762950049148108, 'samples': 16132608, 'steps': 84023, 'loss/train': 1.4269859790802002} 11/07/2021 09:02:23 - INFO - __main__ - Step 84025: {'lr': 0.00020762427052784741, 'samples': 16132800, 'steps': 84024, 'loss/train': 1.2396020889282227} 11/07/2021 09:02:24 - INFO - __main__ - Step 84026: {'lr': 0.00020761904058330758, 'samples': 16132992, 'steps': 84025, 'loss/train': 1.1237194538116455} 11/07/2021 09:02:24 - INFO - __main__ - Step 84027: {'lr': 0.00020761381065786394, 'samples': 16133184, 'steps': 84026, 'loss/train': 1.6750800609588623} 11/07/2021 09:02:25 - INFO - __main__ - Step 84028: {'lr': 0.0002076085807515189, 'samples': 16133376, 'steps': 84027, 'loss/train': 1.1646788120269775} 11/07/2021 09:02:25 - INFO - __main__ - Step 84029: {'lr': 0.00020760335086427475, 'samples': 16133568, 'steps': 84028, 'loss/train': 1.1679093837738037} 11/07/2021 09:02:25 - INFO - __main__ - Step 84030: {'lr': 0.0002075981209961339, 'samples': 16133760, 'steps': 84029, 'loss/train': 1.303818941116333} 11/07/2021 09:02:26 - INFO - __main__ - Step 84031: {'lr': 0.00020759289114709867, 'samples': 16133952, 'steps': 84030, 'loss/train': 1.032479166984558} 11/07/2021 09:02:27 - INFO - __main__ - Step 84032: {'lr': 0.00020758766131717145, 'samples': 16134144, 'steps': 84031, 'loss/train': 1.554354190826416} 11/07/2021 09:02:27 - INFO - __main__ - Step 84033: {'lr': 0.00020758243150635454, 'samples': 16134336, 'steps': 84032, 'loss/train': 1.1218966245651245} 11/07/2021 09:02:27 - INFO - __main__ - Step 84034: {'lr': 0.00020757720171465035, 'samples': 16134528, 'steps': 84033, 'loss/train': 1.6801116466522217} 11/07/2021 09:02:28 - INFO - __main__ - Step 84035: {'lr': 0.00020757197194206132, 'samples': 16134720, 'steps': 84034, 'loss/train': 1.3565831184387207} 11/07/2021 09:02:29 - INFO - __main__ - Step 84036: {'lr': 0.00020756674218858962, 'samples': 16134912, 'steps': 84035, 'loss/train': 1.105698823928833} 11/07/2021 09:02:29 - INFO - __main__ - Step 84037: {'lr': 0.00020756151245423767, 'samples': 16135104, 'steps': 84036, 'loss/train': 1.2864222526550293} 11/07/2021 09:02:29 - INFO - __main__ - Step 84038: {'lr': 0.00020755628273900784, 'samples': 16135296, 'steps': 84037, 'loss/train': 1.310589075088501} 11/07/2021 09:02:30 - INFO - __main__ - Step 84039: {'lr': 0.0002075510530429025, 'samples': 16135488, 'steps': 84038, 'loss/train': 2.146219253540039} 11/07/2021 09:02:30 - INFO - __main__ - Step 84040: {'lr': 0.000207545823365924, 'samples': 16135680, 'steps': 84039, 'loss/train': 1.5440622568130493} 11/07/2021 09:02:31 - INFO - __main__ - Step 84041: {'lr': 0.0002075405937080747, 'samples': 16135872, 'steps': 84040, 'loss/train': 1.0860517024993896} 11/07/2021 09:02:32 - INFO - __main__ - Step 84042: {'lr': 0.00020753536406935698, 'samples': 16136064, 'steps': 84041, 'loss/train': 1.54399573802948} 11/07/2021 09:02:32 - INFO - __main__ - Step 84043: {'lr': 0.0002075301344497731, 'samples': 16136256, 'steps': 84042, 'loss/train': 1.4584383964538574} 11/07/2021 09:02:32 - INFO - __main__ - Step 84044: {'lr': 0.00020752490484932557, 'samples': 16136448, 'steps': 84043, 'loss/train': 2.3370895385742188} 11/07/2021 09:02:33 - INFO - __main__ - Step 84045: {'lr': 0.0002075196752680166, 'samples': 16136640, 'steps': 84044, 'loss/train': 1.672153115272522} 11/07/2021 09:02:34 - INFO - __main__ - Step 84046: {'lr': 0.00020751444570584864, 'samples': 16136832, 'steps': 84045, 'loss/train': 0.6787330508232117} 11/07/2021 09:02:34 - INFO - __main__ - Step 84047: {'lr': 0.000207509216162824, 'samples': 16137024, 'steps': 84046, 'loss/train': 1.5473229885101318} 11/07/2021 09:02:34 - INFO - __main__ - Step 84048: {'lr': 0.00020750398663894518, 'samples': 16137216, 'steps': 84047, 'loss/train': 1.578579068183899} 11/07/2021 09:02:35 - INFO - __main__ - Step 84049: {'lr': 0.0002074987571342143, 'samples': 16137408, 'steps': 84048, 'loss/train': 1.526269555091858} 11/07/2021 09:02:35 - INFO - __main__ - Step 84050: {'lr': 0.0002074935276486338, 'samples': 16137600, 'steps': 84049, 'loss/train': 1.4556044340133667} 11/07/2021 09:02:35 - INFO - __main__ - Step 84051: {'lr': 0.00020748829818220603, 'samples': 16137792, 'steps': 84050, 'loss/train': 1.3434171676635742} 11/07/2021 09:02:36 - INFO - __main__ - Step 84052: {'lr': 0.00020748306873493344, 'samples': 16137984, 'steps': 84051, 'loss/train': 5.0742645263671875} 11/07/2021 09:02:37 - INFO - __main__ - Step 84053: {'lr': 0.0002074778393068183, 'samples': 16138176, 'steps': 84052, 'loss/train': 1.2925891876220703} 11/07/2021 09:02:37 - INFO - __main__ - Step 84054: {'lr': 0.00020747260989786298, 'samples': 16138368, 'steps': 84053, 'loss/train': 1.092694878578186} 11/07/2021 09:02:37 - INFO - __main__ - Step 84055: {'lr': 0.00020746738050806987, 'samples': 16138560, 'steps': 84054, 'loss/train': 0.7830671668052673} 11/07/2021 09:02:38 - INFO - __main__ - Step 84056: {'lr': 0.0002074621511374413, 'samples': 16138752, 'steps': 84055, 'loss/train': 1.4507032632827759} 11/07/2021 09:02:39 - INFO - __main__ - Step 84057: {'lr': 0.00020745692178597962, 'samples': 16138944, 'steps': 84056, 'loss/train': 1.0075962543487549} 11/07/2021 09:02:39 - INFO - __main__ - Step 84058: {'lr': 0.00020745169245368718, 'samples': 16139136, 'steps': 84057, 'loss/train': 1.095841884613037} 11/07/2021 09:02:40 - INFO - __main__ - Step 84059: {'lr': 0.00020744646314056636, 'samples': 16139328, 'steps': 84058, 'loss/train': 1.799384355545044} 11/07/2021 09:02:40 - INFO - __main__ - Step 84060: {'lr': 0.0002074412338466195, 'samples': 16139520, 'steps': 84059, 'loss/train': 1.401337742805481} 11/07/2021 09:02:40 - INFO - __main__ - Step 84061: {'lr': 0.00020743600457184897, 'samples': 16139712, 'steps': 84060, 'loss/train': 1.238930583000183} 11/07/2021 09:02:42 - INFO - __main__ - Step 84062: {'lr': 0.00020743077531625725, 'samples': 16139904, 'steps': 84061, 'loss/train': 1.9620225429534912} 11/07/2021 09:02:42 - INFO - __main__ - Step 84063: {'lr': 0.00020742554607984642, 'samples': 16140096, 'steps': 84062, 'loss/train': 1.2630794048309326} 11/07/2021 09:02:42 - INFO - __main__ - Step 84064: {'lr': 0.000207420316862619, 'samples': 16140288, 'steps': 84063, 'loss/train': 0.8590725660324097} 11/07/2021 09:02:43 - INFO - __main__ - Step 84065: {'lr': 0.00020741508766457733, 'samples': 16140480, 'steps': 84064, 'loss/train': 1.3556897640228271} 11/07/2021 09:02:43 - INFO - __main__ - Step 84066: {'lr': 0.00020740985848572379, 'samples': 16140672, 'steps': 84065, 'loss/train': 1.5709277391433716} 11/07/2021 09:02:44 - INFO - __main__ - Step 84067: {'lr': 0.00020740462932606067, 'samples': 16140864, 'steps': 84066, 'loss/train': 1.3985358476638794} 11/07/2021 09:02:44 - INFO - __main__ - Step 84068: {'lr': 0.00020739940018559035, 'samples': 16141056, 'steps': 84067, 'loss/train': 1.9161510467529297} 11/07/2021 09:02:45 - INFO - __main__ - Step 84069: {'lr': 0.00020739417106431523, 'samples': 16141248, 'steps': 84068, 'loss/train': 1.36184823513031} 11/07/2021 09:02:45 - INFO - __main__ - Step 84070: {'lr': 0.00020738894196223768, 'samples': 16141440, 'steps': 84069, 'loss/train': 1.5434985160827637} 11/07/2021 09:02:45 - INFO - __main__ - Step 84071: {'lr': 0.00020738371287935998, 'samples': 16141632, 'steps': 84070, 'loss/train': 1.4546475410461426} 11/07/2021 09:02:46 - INFO - __main__ - Step 84072: {'lr': 0.0002073784838156845, 'samples': 16141824, 'steps': 84071, 'loss/train': 1.4337072372436523} 11/07/2021 09:02:47 - INFO - __main__ - Step 84073: {'lr': 0.00020737325477121363, 'samples': 16142016, 'steps': 84072, 'loss/train': 1.1331019401550293} 11/07/2021 09:02:48 - INFO - __main__ - Step 84074: {'lr': 0.0002073680257459497, 'samples': 16142208, 'steps': 84073, 'loss/train': 0.8850939869880676} 11/07/2021 09:02:48 - INFO - __main__ - Step 84075: {'lr': 0.00020736279673989522, 'samples': 16142400, 'steps': 84074, 'loss/train': 1.3821320533752441} 11/07/2021 09:02:48 - INFO - __main__ - Step 84076: {'lr': 0.0002073575677530523, 'samples': 16142592, 'steps': 84075, 'loss/train': 1.407461404800415} 11/07/2021 09:02:49 - INFO - __main__ - Step 84077: {'lr': 0.00020735233878542336, 'samples': 16142784, 'steps': 84076, 'loss/train': 1.440761923789978} 11/07/2021 09:02:49 - INFO - __main__ - Step 84078: {'lr': 0.00020734710983701086, 'samples': 16142976, 'steps': 84077, 'loss/train': 1.7633394002914429} 11/07/2021 09:02:50 - INFO - __main__ - Step 84079: {'lr': 0.00020734188090781706, 'samples': 16143168, 'steps': 84078, 'loss/train': 1.7525116205215454} 11/07/2021 09:02:50 - INFO - __main__ - Step 84080: {'lr': 0.00020733665199784435, 'samples': 16143360, 'steps': 84079, 'loss/train': 0.9082611799240112} 11/07/2021 09:02:51 - INFO - __main__ - Step 84081: {'lr': 0.00020733142310709507, 'samples': 16143552, 'steps': 84080, 'loss/train': 1.1071277856826782} 11/07/2021 09:02:51 - INFO - __main__ - Step 84082: {'lr': 0.0002073261942355716, 'samples': 16143744, 'steps': 84081, 'loss/train': 1.278993844985962} 11/07/2021 09:02:51 - INFO - __main__ - Step 84083: {'lr': 0.0002073209653832763, 'samples': 16143936, 'steps': 84082, 'loss/train': 0.8502169847488403} 11/07/2021 09:02:52 - INFO - __main__ - Step 84084: {'lr': 0.00020731573655021152, 'samples': 16144128, 'steps': 84083, 'loss/train': 1.4849436283111572} 11/07/2021 09:02:53 - INFO - __main__ - Step 84085: {'lr': 0.0002073105077363796, 'samples': 16144320, 'steps': 84084, 'loss/train': 1.2800816297531128} 11/07/2021 09:02:53 - INFO - __main__ - Step 84086: {'lr': 0.00020730527894178292, 'samples': 16144512, 'steps': 84085, 'loss/train': 1.377155065536499} 11/07/2021 09:02:54 - INFO - __main__ - Step 84087: {'lr': 0.00020730005016642377, 'samples': 16144704, 'steps': 84086, 'loss/train': 1.598417043685913} 11/07/2021 09:02:54 - INFO - __main__ - Step 84088: {'lr': 0.00020729482141030467, 'samples': 16144896, 'steps': 84087, 'loss/train': 1.6729761362075806} 11/07/2021 09:02:55 - INFO - __main__ - Step 84089: {'lr': 0.00020728959267342785, 'samples': 16145088, 'steps': 84088, 'loss/train': 1.3572055101394653} 11/07/2021 09:02:55 - INFO - __main__ - Step 84090: {'lr': 0.0002072843639557956, 'samples': 16145280, 'steps': 84089, 'loss/train': 1.497260570526123} 11/07/2021 09:02:56 - INFO - __main__ - Step 84091: {'lr': 0.0002072791352574104, 'samples': 16145472, 'steps': 84090, 'loss/train': 1.3007763624191284} 11/07/2021 09:02:56 - INFO - __main__ - Step 84092: {'lr': 0.00020727390657827456, 'samples': 16145664, 'steps': 84091, 'loss/train': 1.8357245922088623} 11/07/2021 09:02:56 - INFO - __main__ - Step 84093: {'lr': 0.0002072686779183904, 'samples': 16145856, 'steps': 84092, 'loss/train': 1.4199190139770508} 11/07/2021 09:02:57 - INFO - __main__ - Step 84094: {'lr': 0.00020726344927776032, 'samples': 16146048, 'steps': 84093, 'loss/train': 1.2555296421051025} 11/07/2021 09:02:58 - INFO - __main__ - Step 84095: {'lr': 0.00020725822065638673, 'samples': 16146240, 'steps': 84094, 'loss/train': 1.349566102027893} 11/07/2021 09:02:58 - INFO - __main__ - Step 84096: {'lr': 0.00020725299205427185, 'samples': 16146432, 'steps': 84095, 'loss/train': 1.4031115770339966} 11/07/2021 09:02:58 - INFO - __main__ - Step 84097: {'lr': 0.00020724776347141817, 'samples': 16146624, 'steps': 84096, 'loss/train': 1.418949007987976} 11/07/2021 09:02:59 - INFO - __main__ - Step 84098: {'lr': 0.000207242534907828, 'samples': 16146816, 'steps': 84097, 'loss/train': 1.3707011938095093} 11/07/2021 09:03:00 - INFO - __main__ - Step 84099: {'lr': 0.00020723730636350364, 'samples': 16147008, 'steps': 84098, 'loss/train': 1.647952675819397} 11/07/2021 09:03:00 - INFO - __main__ - Step 84100: {'lr': 0.0002072320778384475, 'samples': 16147200, 'steps': 84099, 'loss/train': 1.4119559526443481} 11/07/2021 09:03:00 - INFO - __main__ - Step 84101: {'lr': 0.00020722684933266192, 'samples': 16147392, 'steps': 84100, 'loss/train': 1.3980011940002441} 11/07/2021 09:03:01 - INFO - __main__ - Step 84102: {'lr': 0.0002072216208461493, 'samples': 16147584, 'steps': 84101, 'loss/train': 1.4167249202728271} 11/07/2021 09:03:01 - INFO - __main__ - Step 84103: {'lr': 0.00020721639237891194, 'samples': 16147776, 'steps': 84102, 'loss/train': 1.3050413131713867} 11/07/2021 09:03:01 - INFO - __main__ - Step 84104: {'lr': 0.00020721116393095218, 'samples': 16147968, 'steps': 84103, 'loss/train': 1.3002928495407104} 11/07/2021 09:03:02 - INFO - __main__ - Step 84105: {'lr': 0.00020720593550227243, 'samples': 16148160, 'steps': 84104, 'loss/train': 1.898537278175354} 11/07/2021 09:03:03 - INFO - __main__ - Step 84106: {'lr': 0.00020720070709287502, 'samples': 16148352, 'steps': 84105, 'loss/train': 1.504387378692627} 11/07/2021 09:03:03 - INFO - __main__ - Step 84107: {'lr': 0.00020719547870276232, 'samples': 16148544, 'steps': 84106, 'loss/train': 0.7813427448272705} 11/07/2021 09:03:04 - INFO - __main__ - Step 84108: {'lr': 0.00020719025033193666, 'samples': 16148736, 'steps': 84107, 'loss/train': 1.3694382905960083} 11/07/2021 09:03:04 - INFO - __main__ - Step 84109: {'lr': 0.00020718502198040047, 'samples': 16148928, 'steps': 84108, 'loss/train': 1.9969353675842285} 11/07/2021 09:03:05 - INFO - __main__ - Step 84110: {'lr': 0.00020717979364815597, 'samples': 16149120, 'steps': 84109, 'loss/train': 1.4666900634765625} 11/07/2021 09:03:05 - INFO - __main__ - Step 84111: {'lr': 0.00020717456533520564, 'samples': 16149312, 'steps': 84110, 'loss/train': 1.1052926778793335} 11/07/2021 09:03:06 - INFO - __main__ - Step 84112: {'lr': 0.00020716933704155178, 'samples': 16149504, 'steps': 84111, 'loss/train': 1.0057512521743774} 11/07/2021 09:03:06 - INFO - __main__ - Step 84113: {'lr': 0.00020716410876719674, 'samples': 16149696, 'steps': 84112, 'loss/train': 1.423316240310669} 11/07/2021 09:03:06 - INFO - __main__ - Step 84114: {'lr': 0.00020715888051214292, 'samples': 16149888, 'steps': 84113, 'loss/train': 1.386677861213684} 11/07/2021 09:03:08 - INFO - __main__ - Step 84115: {'lr': 0.00020715365227639266, 'samples': 16150080, 'steps': 84114, 'loss/train': 1.5577625036239624} 11/07/2021 09:03:08 - INFO - __main__ - Step 84116: {'lr': 0.00020714842405994828, 'samples': 16150272, 'steps': 84115, 'loss/train': 1.1383099555969238} 11/07/2021 09:03:08 - INFO - __main__ - Step 84117: {'lr': 0.00020714319586281213, 'samples': 16150464, 'steps': 84116, 'loss/train': 0.6066579222679138} 11/07/2021 09:03:09 - INFO - __main__ - Step 84118: {'lr': 0.00020713796768498662, 'samples': 16150656, 'steps': 84117, 'loss/train': 1.9295562505722046} 11/07/2021 09:03:09 - INFO - __main__ - Step 84119: {'lr': 0.00020713273952647408, 'samples': 16150848, 'steps': 84118, 'loss/train': 3.927175998687744} 11/07/2021 09:03:10 - INFO - __main__ - Step 84120: {'lr': 0.00020712751138727694, 'samples': 16151040, 'steps': 84119, 'loss/train': 1.5921980142593384} 11/07/2021 09:03:10 - INFO - __main__ - Step 84121: {'lr': 0.00020712228326739737, 'samples': 16151232, 'steps': 84120, 'loss/train': 0.8377247452735901} 11/07/2021 09:03:11 - INFO - __main__ - Step 84122: {'lr': 0.00020711705516683788, 'samples': 16151424, 'steps': 84121, 'loss/train': 0.4064866900444031} 11/07/2021 09:03:11 - INFO - __main__ - Step 84123: {'lr': 0.00020711182708560075, 'samples': 16151616, 'steps': 84122, 'loss/train': 1.4990625381469727} 11/07/2021 09:03:11 - INFO - __main__ - Step 84124: {'lr': 0.00020710659902368838, 'samples': 16151808, 'steps': 84123, 'loss/train': 1.2086642980575562} 11/07/2021 09:03:12 - INFO - __main__ - Step 84125: {'lr': 0.00020710137098110316, 'samples': 16152000, 'steps': 84124, 'loss/train': 1.3890042304992676} 11/07/2021 09:03:13 - INFO - __main__ - Step 84126: {'lr': 0.00020709614295784734, 'samples': 16152192, 'steps': 84125, 'loss/train': 1.1785210371017456} 11/07/2021 09:03:13 - INFO - __main__ - Step 84127: {'lr': 0.0002070909149539234, 'samples': 16152384, 'steps': 84126, 'loss/train': 0.943396270275116} 11/07/2021 09:03:13 - INFO - __main__ - Step 84128: {'lr': 0.00020708568696933355, 'samples': 16152576, 'steps': 84127, 'loss/train': 1.707252860069275} 11/07/2021 09:03:14 - INFO - __main__ - Step 84129: {'lr': 0.00020708045900408035, 'samples': 16152768, 'steps': 84128, 'loss/train': 1.2316559553146362} 11/07/2021 09:03:14 - INFO - __main__ - Step 84130: {'lr': 0.00020707523105816596, 'samples': 16152960, 'steps': 84129, 'loss/train': 1.4802801609039307} 11/07/2021 09:03:16 - INFO - __main__ - Step 84131: {'lr': 0.00020707000313159284, 'samples': 16153152, 'steps': 84130, 'loss/train': 2.4859395027160645} 11/07/2021 09:03:16 - INFO - __main__ - Step 84132: {'lr': 0.0002070647752243633, 'samples': 16153344, 'steps': 84131, 'loss/train': 1.2494057416915894} 11/07/2021 09:03:16 - INFO - __main__ - Step 84133: {'lr': 0.00020705954733647966, 'samples': 16153536, 'steps': 84132, 'loss/train': 1.1563069820404053} 11/07/2021 09:03:17 - INFO - __main__ - Step 84134: {'lr': 0.00020705431946794434, 'samples': 16153728, 'steps': 84133, 'loss/train': 1.214955449104309} 11/07/2021 09:03:17 - INFO - __main__ - Step 84135: {'lr': 0.0002070490916187597, 'samples': 16153920, 'steps': 84134, 'loss/train': 0.10818719118833542} 11/07/2021 09:03:18 - INFO - __main__ - Step 84136: {'lr': 0.00020704386378892807, 'samples': 16154112, 'steps': 84135, 'loss/train': 1.2737400531768799} 11/07/2021 09:03:18 - INFO - __main__ - Step 84137: {'lr': 0.0002070386359784518, 'samples': 16154304, 'steps': 84136, 'loss/train': 1.0674996376037598} 11/07/2021 09:03:19 - INFO - __main__ - Step 84138: {'lr': 0.00020703340818733327, 'samples': 16154496, 'steps': 84137, 'loss/train': 1.5893940925598145} 11/07/2021 09:03:19 - INFO - __main__ - Step 84139: {'lr': 0.00020702818041557484, 'samples': 16154688, 'steps': 84138, 'loss/train': 1.6145721673965454} 11/07/2021 09:03:19 - INFO - __main__ - Step 84140: {'lr': 0.00020702295266317882, 'samples': 16154880, 'steps': 84139, 'loss/train': 0.9380039572715759} 11/07/2021 09:03:20 - INFO - __main__ - Step 84141: {'lr': 0.00020701772493014758, 'samples': 16155072, 'steps': 84140, 'loss/train': 1.454075813293457} 11/07/2021 09:03:21 - INFO - __main__ - Step 84142: {'lr': 0.00020701249721648363, 'samples': 16155264, 'steps': 84141, 'loss/train': 1.4706259965896606} 11/07/2021 09:03:21 - INFO - __main__ - Step 84143: {'lr': 0.00020700726952218906, 'samples': 16155456, 'steps': 84142, 'loss/train': 1.501004934310913} 11/07/2021 09:03:22 - INFO - __main__ - Step 84144: {'lr': 0.0002070020418472664, 'samples': 16155648, 'steps': 84143, 'loss/train': 1.384542465209961} 11/07/2021 09:03:22 - INFO - __main__ - Step 84145: {'lr': 0.0002069968141917179, 'samples': 16155840, 'steps': 84144, 'loss/train': 1.6663095951080322} 11/07/2021 09:03:23 - INFO - __main__ - Step 84146: {'lr': 0.000206991586555546, 'samples': 16156032, 'steps': 84145, 'loss/train': 1.3229254484176636} 11/07/2021 09:03:23 - INFO - __main__ - Step 84147: {'lr': 0.000206986358938753, 'samples': 16156224, 'steps': 84146, 'loss/train': 1.4756470918655396} 11/07/2021 09:03:24 - INFO - __main__ - Step 84148: {'lr': 0.0002069811313413413, 'samples': 16156416, 'steps': 84147, 'loss/train': 1.4436397552490234} 11/07/2021 09:03:24 - INFO - __main__ - Step 84149: {'lr': 0.00020697590376331324, 'samples': 16156608, 'steps': 84148, 'loss/train': 1.873944640159607} 11/07/2021 09:03:24 - INFO - __main__ - Step 84150: {'lr': 0.0002069706762046712, 'samples': 16156800, 'steps': 84149, 'loss/train': 1.3731276988983154} 11/07/2021 09:03:25 - INFO - __main__ - Step 84151: {'lr': 0.00020696544866541744, 'samples': 16156992, 'steps': 84150, 'loss/train': 1.7591129541397095} 11/07/2021 09:03:26 - INFO - __main__ - Step 84152: {'lr': 0.00020696022114555443, 'samples': 16157184, 'steps': 84151, 'loss/train': 4.5199503898620605} 11/07/2021 09:03:26 - INFO - __main__ - Step 84153: {'lr': 0.0002069549936450845, 'samples': 16157376, 'steps': 84152, 'loss/train': 1.7121798992156982} 11/07/2021 09:03:26 - INFO - __main__ - Step 84154: {'lr': 0.00020694976616400995, 'samples': 16157568, 'steps': 84153, 'loss/train': 1.5252647399902344} 11/07/2021 09:03:27 - INFO - __main__ - Step 84155: {'lr': 0.00020694453870233318, 'samples': 16157760, 'steps': 84154, 'loss/train': 1.5543408393859863} 11/07/2021 09:03:27 - INFO - __main__ - Step 84156: {'lr': 0.00020693931126005666, 'samples': 16157952, 'steps': 84155, 'loss/train': 1.406838297843933} 11/07/2021 09:03:28 - INFO - __main__ - Step 84157: {'lr': 0.00020693408383718248, 'samples': 16158144, 'steps': 84156, 'loss/train': 1.219146966934204} 11/07/2021 09:03:28 - INFO - __main__ - Step 84158: {'lr': 0.00020692885643371317, 'samples': 16158336, 'steps': 84157, 'loss/train': 1.6558862924575806} 11/07/2021 09:03:29 - INFO - __main__ - Step 84159: {'lr': 0.00020692362904965104, 'samples': 16158528, 'steps': 84158, 'loss/train': 1.7008463144302368} 11/07/2021 09:03:29 - INFO - __main__ - Step 84160: {'lr': 0.00020691840168499844, 'samples': 16158720, 'steps': 84159, 'loss/train': 1.787617802619934} 11/07/2021 09:03:29 - INFO - __main__ - Step 84161: {'lr': 0.00020691317433975777, 'samples': 16158912, 'steps': 84160, 'loss/train': 1.3204874992370605} 11/07/2021 09:03:30 - INFO - __main__ - Step 84162: {'lr': 0.00020690794701393137, 'samples': 16159104, 'steps': 84161, 'loss/train': 1.3281444311141968} 11/07/2021 09:03:31 - INFO - __main__ - Step 84163: {'lr': 0.00020690271970752157, 'samples': 16159296, 'steps': 84162, 'loss/train': 1.4331339597702026} 11/07/2021 09:03:31 - INFO - __main__ - Step 84164: {'lr': 0.00020689749242053075, 'samples': 16159488, 'steps': 84163, 'loss/train': 0.7350309491157532} 11/07/2021 09:03:32 - INFO - __main__ - Step 84165: {'lr': 0.00020689226515296122, 'samples': 16159680, 'steps': 84164, 'loss/train': 1.5950078964233398} 11/07/2021 09:03:32 - INFO - __main__ - Step 84166: {'lr': 0.00020688703790481538, 'samples': 16159872, 'steps': 84165, 'loss/train': 1.3950995206832886} 11/07/2021 09:03:32 - INFO - __main__ - Step 84167: {'lr': 0.00020688181067609558, 'samples': 16160064, 'steps': 84166, 'loss/train': 1.0442917346954346} 11/07/2021 09:03:33 - INFO - __main__ - Step 84168: {'lr': 0.00020687658346680418, 'samples': 16160256, 'steps': 84167, 'loss/train': 0.8612145781517029} 11/07/2021 09:03:34 - INFO - __main__ - Step 84169: {'lr': 0.00020687135627694365, 'samples': 16160448, 'steps': 84168, 'loss/train': 0.15837611258029938} 11/07/2021 09:03:34 - INFO - __main__ - Step 84170: {'lr': 0.00020686612910651608, 'samples': 16160640, 'steps': 84169, 'loss/train': 1.4969780445098877} 11/07/2021 09:03:34 - INFO - __main__ - Step 84171: {'lr': 0.00020686090195552398, 'samples': 16160832, 'steps': 84170, 'loss/train': 1.230628252029419} 11/07/2021 09:03:35 - INFO - __main__ - Step 84172: {'lr': 0.0002068556748239697, 'samples': 16161024, 'steps': 84171, 'loss/train': 1.4358903169631958} 11/07/2021 09:03:36 - INFO - __main__ - Step 84173: {'lr': 0.00020685044771185556, 'samples': 16161216, 'steps': 84172, 'loss/train': 1.0850034952163696} 11/07/2021 09:03:36 - INFO - __main__ - Step 84174: {'lr': 0.000206845220619184, 'samples': 16161408, 'steps': 84173, 'loss/train': 1.5691795349121094} 11/07/2021 09:03:36 - INFO - __main__ - Step 84175: {'lr': 0.00020683999354595726, 'samples': 16161600, 'steps': 84174, 'loss/train': 1.0512441396713257} 11/07/2021 09:03:37 - INFO - __main__ - Step 84176: {'lr': 0.00020683476649217776, 'samples': 16161792, 'steps': 84175, 'loss/train': 1.1672849655151367} 11/07/2021 09:03:37 - INFO - __main__ - Step 84177: {'lr': 0.00020682953945784787, 'samples': 16161984, 'steps': 84176, 'loss/train': 1.3745120763778687} 11/07/2021 09:03:38 - INFO - __main__ - Step 84178: {'lr': 0.0002068243124429699, 'samples': 16162176, 'steps': 84177, 'loss/train': 1.3441749811172485} 11/07/2021 09:03:39 - INFO - __main__ - Step 84179: {'lr': 0.00020681908544754624, 'samples': 16162368, 'steps': 84178, 'loss/train': 1.1874233484268188} 11/07/2021 09:03:39 - INFO - __main__ - Step 84180: {'lr': 0.00020681385847157925, 'samples': 16162560, 'steps': 84179, 'loss/train': 1.573333501815796} 11/07/2021 09:03:39 - INFO - __main__ - Step 84181: {'lr': 0.00020680863151507122, 'samples': 16162752, 'steps': 84180, 'loss/train': 1.501968502998352} 11/07/2021 09:03:40 - INFO - __main__ - Step 84182: {'lr': 0.0002068034045780246, 'samples': 16162944, 'steps': 84181, 'loss/train': 1.2452682256698608} 11/07/2021 09:03:40 - INFO - __main__ - Step 84183: {'lr': 0.0002067981776604418, 'samples': 16163136, 'steps': 84182, 'loss/train': 1.1452828645706177} 11/07/2021 09:03:41 - INFO - __main__ - Step 84184: {'lr': 0.00020679295076232498, 'samples': 16163328, 'steps': 84183, 'loss/train': 1.1777318716049194} 11/07/2021 09:03:41 - INFO - __main__ - Step 84185: {'lr': 0.00020678772388367655, 'samples': 16163520, 'steps': 84184, 'loss/train': 1.375914454460144} 11/07/2021 09:03:42 - INFO - __main__ - Step 84186: {'lr': 0.00020678249702449895, 'samples': 16163712, 'steps': 84185, 'loss/train': 1.5447245836257935} 11/07/2021 09:03:42 - INFO - __main__ - Step 84187: {'lr': 0.00020677727018479446, 'samples': 16163904, 'steps': 84186, 'loss/train': 1.3221206665039062} 11/07/2021 09:03:43 - INFO - __main__ - Step 84188: {'lr': 0.00020677204336456547, 'samples': 16164096, 'steps': 84187, 'loss/train': 1.685765027999878} 11/07/2021 09:03:44 - INFO - __main__ - Step 84189: {'lr': 0.00020676681656381436, 'samples': 16164288, 'steps': 84188, 'loss/train': 1.3992919921875} 11/07/2021 09:03:44 - INFO - __main__ - Step 84190: {'lr': 0.00020676158978254338, 'samples': 16164480, 'steps': 84189, 'loss/train': 1.829296588897705} 11/07/2021 09:03:45 - INFO - __main__ - Step 84191: {'lr': 0.00020675636302075503, 'samples': 16164672, 'steps': 84190, 'loss/train': 1.1903470754623413} 11/07/2021 09:03:45 - INFO - __main__ - Step 84192: {'lr': 0.00020675113627845158, 'samples': 16164864, 'steps': 84191, 'loss/train': 1.9141162633895874} 11/07/2021 09:03:45 - INFO - __main__ - Step 84193: {'lr': 0.00020674590955563539, 'samples': 16165056, 'steps': 84192, 'loss/train': 2.034795045852661} 11/07/2021 09:03:46 - INFO - __main__ - Step 84194: {'lr': 0.00020674068285230884, 'samples': 16165248, 'steps': 84193, 'loss/train': 1.4179593324661255} 11/07/2021 09:03:47 - INFO - __main__ - Step 84195: {'lr': 0.00020673545616847424, 'samples': 16165440, 'steps': 84194, 'loss/train': 1.224536418914795} 11/07/2021 09:03:47 - INFO - __main__ - Step 84196: {'lr': 0.00020673022950413412, 'samples': 16165632, 'steps': 84195, 'loss/train': 0.7744585275650024} 11/07/2021 09:03:47 - INFO - __main__ - Step 84197: {'lr': 0.00020672500285929057, 'samples': 16165824, 'steps': 84196, 'loss/train': 1.821515679359436} 11/07/2021 09:03:48 - INFO - __main__ - Step 84198: {'lr': 0.00020671977623394605, 'samples': 16166016, 'steps': 84197, 'loss/train': 1.409247636795044} 11/07/2021 09:03:48 - INFO - __main__ - Step 84199: {'lr': 0.00020671454962810297, 'samples': 16166208, 'steps': 84198, 'loss/train': 1.4365718364715576} 11/07/2021 09:03:49 - INFO - __main__ - Step 84200: {'lr': 0.0002067093230417636, 'samples': 16166400, 'steps': 84199, 'loss/train': 0.9225144386291504} 11/07/2021 09:03:50 - INFO - __main__ - Step 84201: {'lr': 0.00020670409647493039, 'samples': 16166592, 'steps': 84200, 'loss/train': 1.4496703147888184} 11/07/2021 09:03:50 - INFO - __main__ - Step 84202: {'lr': 0.0002066988699276056, 'samples': 16166784, 'steps': 84201, 'loss/train': 1.728940725326538} 11/07/2021 09:03:50 - INFO - __main__ - Step 84203: {'lr': 0.00020669364339979162, 'samples': 16166976, 'steps': 84202, 'loss/train': 1.3015692234039307} 11/07/2021 09:03:51 - INFO - __main__ - Step 84204: {'lr': 0.00020668841689149088, 'samples': 16167168, 'steps': 84203, 'loss/train': 1.5147496461868286} 11/07/2021 09:03:51 - INFO - __main__ - Step 84205: {'lr': 0.0002066831904027056, 'samples': 16167360, 'steps': 84204, 'loss/train': 1.580589771270752} 11/07/2021 09:03:52 - INFO - __main__ - Step 84206: {'lr': 0.00020667796393343828, 'samples': 16167552, 'steps': 84205, 'loss/train': 1.2682464122772217} 11/07/2021 09:03:52 - INFO - __main__ - Step 84207: {'lr': 0.00020667273748369114, 'samples': 16167744, 'steps': 84206, 'loss/train': 0.9077656865119934} 11/07/2021 09:03:53 - INFO - __main__ - Step 84208: {'lr': 0.0002066675110534666, 'samples': 16167936, 'steps': 84207, 'loss/train': 1.5644689798355103} 11/07/2021 09:03:53 - INFO - __main__ - Step 84209: {'lr': 0.00020666228464276707, 'samples': 16168128, 'steps': 84208, 'loss/train': 1.4380320310592651} 11/07/2021 09:03:53 - INFO - __main__ - Step 84210: {'lr': 0.00020665705825159488, 'samples': 16168320, 'steps': 84209, 'loss/train': 1.1907566785812378} 11/07/2021 09:03:54 - INFO - __main__ - Step 84211: {'lr': 0.0002066518318799523, 'samples': 16168512, 'steps': 84210, 'loss/train': 1.6548572778701782} 11/07/2021 09:03:55 - INFO - __main__ - Step 84212: {'lr': 0.0002066466055278417, 'samples': 16168704, 'steps': 84211, 'loss/train': 0.7173234224319458} 11/07/2021 09:03:55 - INFO - __main__ - Step 84213: {'lr': 0.0002066413791952655, 'samples': 16168896, 'steps': 84212, 'loss/train': 1.6074998378753662} 11/07/2021 09:03:55 - INFO - __main__ - Step 84214: {'lr': 0.000206636152882226, 'samples': 16169088, 'steps': 84213, 'loss/train': 1.5861310958862305} 11/07/2021 09:03:56 - INFO - __main__ - Step 84215: {'lr': 0.00020663092658872558, 'samples': 16169280, 'steps': 84214, 'loss/train': 1.484269380569458} 11/07/2021 09:03:57 - INFO - __main__ - Step 84216: {'lr': 0.0002066257003147666, 'samples': 16169472, 'steps': 84215, 'loss/train': 1.5245999097824097} 11/07/2021 09:03:57 - INFO - __main__ - Step 84217: {'lr': 0.0002066204740603514, 'samples': 16169664, 'steps': 84216, 'loss/train': 1.2986549139022827} 11/07/2021 09:03:57 - INFO - __main__ - Step 84218: {'lr': 0.00020661524782548238, 'samples': 16169856, 'steps': 84217, 'loss/train': 1.3610774278640747} 11/07/2021 09:03:58 - INFO - __main__ - Step 84219: {'lr': 0.00020661002161016185, 'samples': 16170048, 'steps': 84218, 'loss/train': 1.2822520732879639} 11/07/2021 09:03:58 - INFO - __main__ - Step 84220: {'lr': 0.00020660479541439214, 'samples': 16170240, 'steps': 84219, 'loss/train': 1.1822208166122437} 11/07/2021 09:03:59 - INFO - __main__ - Step 84221: {'lr': 0.00020659956923817568, 'samples': 16170432, 'steps': 84220, 'loss/train': 0.8901318907737732} 11/07/2021 09:03:59 - INFO - __main__ - Step 84222: {'lr': 0.00020659434308151482, 'samples': 16170624, 'steps': 84221, 'loss/train': 2.099421739578247} 11/07/2021 09:04:00 - INFO - __main__ - Step 84223: {'lr': 0.0002065891169444119, 'samples': 16170816, 'steps': 84222, 'loss/train': 1.6679720878601074} 11/07/2021 09:04:00 - INFO - __main__ - Step 84224: {'lr': 0.00020658389082686915, 'samples': 16171008, 'steps': 84223, 'loss/train': 1.3517184257507324} 11/07/2021 09:04:00 - INFO - __main__ - Step 84225: {'lr': 0.00020657866472888905, 'samples': 16171200, 'steps': 84224, 'loss/train': 1.5245457887649536} 11/07/2021 09:04:02 - INFO - __main__ - Step 84226: {'lr': 0.00020657343865047395, 'samples': 16171392, 'steps': 84225, 'loss/train': 1.6617850065231323} 11/07/2021 09:04:02 - INFO - __main__ - Step 84227: {'lr': 0.0002065682125916262, 'samples': 16171584, 'steps': 84226, 'loss/train': 1.5843786001205444} 11/07/2021 09:04:02 - INFO - __main__ - Step 84228: {'lr': 0.00020656298655234812, 'samples': 16171776, 'steps': 84227, 'loss/train': 0.9545333385467529} 11/07/2021 09:04:03 - INFO - __main__ - Step 84229: {'lr': 0.00020655776053264208, 'samples': 16171968, 'steps': 84228, 'loss/train': 1.7826658487319946} 11/07/2021 09:04:03 - INFO - __main__ - Step 84230: {'lr': 0.00020655253453251047, 'samples': 16172160, 'steps': 84229, 'loss/train': 1.1055322885513306} 11/07/2021 09:04:04 - INFO - __main__ - Step 84231: {'lr': 0.0002065473085519556, 'samples': 16172352, 'steps': 84230, 'loss/train': 1.1762423515319824} 11/07/2021 09:04:04 - INFO - __main__ - Step 84232: {'lr': 0.00020654208259097983, 'samples': 16172544, 'steps': 84231, 'loss/train': 1.5389280319213867} 11/07/2021 09:04:05 - INFO - __main__ - Step 84233: {'lr': 0.0002065368566495856, 'samples': 16172736, 'steps': 84232, 'loss/train': 1.2851150035858154} 11/07/2021 09:04:05 - INFO - __main__ - Step 84234: {'lr': 0.00020653163072777513, 'samples': 16172928, 'steps': 84233, 'loss/train': 1.3479790687561035} 11/07/2021 09:04:05 - INFO - __main__ - Step 84235: {'lr': 0.00020652640482555086, 'samples': 16173120, 'steps': 84234, 'loss/train': 1.3520891666412354} 11/07/2021 09:04:06 - INFO - __main__ - Step 84236: {'lr': 0.00020652117894291513, 'samples': 16173312, 'steps': 84235, 'loss/train': 0.8678374290466309} 11/07/2021 09:04:07 - INFO - __main__ - Step 84237: {'lr': 0.00020651595307987026, 'samples': 16173504, 'steps': 84236, 'loss/train': 1.3122247457504272} 11/07/2021 09:04:07 - INFO - __main__ - Step 84238: {'lr': 0.00020651072723641865, 'samples': 16173696, 'steps': 84237, 'loss/train': 1.3145235776901245} 11/07/2021 09:04:08 - INFO - __main__ - Step 84239: {'lr': 0.0002065055014125626, 'samples': 16173888, 'steps': 84238, 'loss/train': 1.3763341903686523} 11/07/2021 09:04:08 - INFO - __main__ - Step 84240: {'lr': 0.0002065002756083045, 'samples': 16174080, 'steps': 84239, 'loss/train': 1.2359668016433716} 11/07/2021 09:04:08 - INFO - __main__ - Step 84241: {'lr': 0.00020649504982364673, 'samples': 16174272, 'steps': 84240, 'loss/train': 1.337446689605713} 11/07/2021 09:04:09 - INFO - __main__ - Step 84242: {'lr': 0.00020648982405859162, 'samples': 16174464, 'steps': 84241, 'loss/train': 1.0845826864242554} 11/07/2021 09:04:10 - INFO - __main__ - Step 84243: {'lr': 0.00020648459831314151, 'samples': 16174656, 'steps': 84242, 'loss/train': 1.159540057182312} 11/07/2021 09:04:10 - INFO - __main__ - Step 84244: {'lr': 0.00020647937258729882, 'samples': 16174848, 'steps': 84243, 'loss/train': 1.2457342147827148} 11/07/2021 09:04:10 - INFO - __main__ - Step 84245: {'lr': 0.0002064741468810658, 'samples': 16175040, 'steps': 84244, 'loss/train': 1.3689912557601929} 11/07/2021 09:04:11 - INFO - __main__ - Step 84246: {'lr': 0.00020646892119444485, 'samples': 16175232, 'steps': 84245, 'loss/train': 1.2172884941101074} 11/07/2021 09:04:12 - INFO - __main__ - Step 84247: {'lr': 0.00020646369552743834, 'samples': 16175424, 'steps': 84246, 'loss/train': 1.2006124258041382} 11/07/2021 09:04:12 - INFO - __main__ - Step 84248: {'lr': 0.00020645846988004863, 'samples': 16175616, 'steps': 84247, 'loss/train': 1.384027361869812} 11/07/2021 09:04:12 - INFO - __main__ - Step 84249: {'lr': 0.00020645324425227804, 'samples': 16175808, 'steps': 84248, 'loss/train': 1.7500485181808472} 11/07/2021 09:04:13 - INFO - __main__ - Step 84250: {'lr': 0.00020644801864412902, 'samples': 16176000, 'steps': 84249, 'loss/train': 0.7362756133079529} 11/07/2021 09:04:13 - INFO - __main__ - Step 84251: {'lr': 0.00020644279305560379, 'samples': 16176192, 'steps': 84250, 'loss/train': 1.8603593111038208} 11/07/2021 09:04:14 - INFO - __main__ - Step 84252: {'lr': 0.00020643756748670475, 'samples': 16176384, 'steps': 84251, 'loss/train': 1.541584849357605} 11/07/2021 09:04:14 - INFO - __main__ - Step 84253: {'lr': 0.0002064323419374343, 'samples': 16176576, 'steps': 84252, 'loss/train': 1.2247051000595093} 11/07/2021 09:04:15 - INFO - __main__ - Step 84254: {'lr': 0.00020642711640779475, 'samples': 16176768, 'steps': 84253, 'loss/train': 1.09810471534729} 11/07/2021 09:04:15 - INFO - __main__ - Step 84255: {'lr': 0.00020642189089778852, 'samples': 16176960, 'steps': 84254, 'loss/train': 1.6780749559402466} 11/07/2021 09:04:15 - INFO - __main__ - Step 84256: {'lr': 0.00020641666540741784, 'samples': 16177152, 'steps': 84255, 'loss/train': 1.3433517217636108} 11/07/2021 09:04:16 - INFO - __main__ - Step 84257: {'lr': 0.00020641143993668516, 'samples': 16177344, 'steps': 84256, 'loss/train': 1.2149059772491455} 11/07/2021 09:04:17 - INFO - __main__ - Step 84258: {'lr': 0.00020640621448559282, 'samples': 16177536, 'steps': 84257, 'loss/train': 1.1691991090774536} 11/07/2021 09:04:17 - INFO - __main__ - Step 84259: {'lr': 0.00020640098905414314, 'samples': 16177728, 'steps': 84258, 'loss/train': 1.101273775100708} 11/07/2021 09:04:17 - INFO - __main__ - Step 84260: {'lr': 0.00020639576364233852, 'samples': 16177920, 'steps': 84259, 'loss/train': 1.1925427913665771} 11/07/2021 09:04:18 - INFO - __main__ - Step 84261: {'lr': 0.0002063905382501813, 'samples': 16178112, 'steps': 84260, 'loss/train': 1.5417654514312744} 11/07/2021 09:04:19 - INFO - __main__ - Step 84262: {'lr': 0.00020638531287767384, 'samples': 16178304, 'steps': 84261, 'loss/train': 0.27347531914711} 11/07/2021 09:04:19 - INFO - __main__ - Step 84263: {'lr': 0.00020638008752481851, 'samples': 16178496, 'steps': 84262, 'loss/train': 1.599084496498108} 11/07/2021 09:04:20 - INFO - __main__ - Step 84264: {'lr': 0.0002063748621916176, 'samples': 16178688, 'steps': 84263, 'loss/train': 1.4816017150878906} 11/07/2021 09:04:20 - INFO - __main__ - Step 84265: {'lr': 0.00020636963687807356, 'samples': 16178880, 'steps': 84264, 'loss/train': 1.6907932758331299} 11/07/2021 09:04:20 - INFO - __main__ - Step 84266: {'lr': 0.00020636441158418864, 'samples': 16179072, 'steps': 84265, 'loss/train': 1.328173279762268} 11/07/2021 09:04:21 - INFO - __main__ - Step 84267: {'lr': 0.0002063591863099652, 'samples': 16179264, 'steps': 84266, 'loss/train': 1.653181791305542} 11/07/2021 09:04:22 - INFO - __main__ - Step 84268: {'lr': 0.00020635396105540572, 'samples': 16179456, 'steps': 84267, 'loss/train': 1.036615252494812} 11/07/2021 09:04:22 - INFO - __main__ - Step 84269: {'lr': 0.00020634873582051243, 'samples': 16179648, 'steps': 84268, 'loss/train': 0.8752691149711609} 11/07/2021 09:04:22 - INFO - __main__ - Step 84270: {'lr': 0.0002063435106052877, 'samples': 16179840, 'steps': 84269, 'loss/train': 1.4140894412994385} 11/07/2021 09:04:23 - INFO - __main__ - Step 84271: {'lr': 0.00020633828540973393, 'samples': 16180032, 'steps': 84270, 'loss/train': 1.5698028802871704} 11/07/2021 09:04:23 - INFO - __main__ - Step 84272: {'lr': 0.00020633306023385345, 'samples': 16180224, 'steps': 84271, 'loss/train': 1.5179523229599} 11/07/2021 09:04:24 - INFO - __main__ - Step 84273: {'lr': 0.00020632783507764862, 'samples': 16180416, 'steps': 84272, 'loss/train': 0.7556256651878357} 11/07/2021 09:04:25 - INFO - __main__ - Step 84274: {'lr': 0.0002063226099411218, 'samples': 16180608, 'steps': 84273, 'loss/train': 1.2861852645874023} 11/07/2021 09:04:25 - INFO - __main__ - Step 84275: {'lr': 0.00020631738482427533, 'samples': 16180800, 'steps': 84274, 'loss/train': 1.5812166929244995} 11/07/2021 09:04:25 - INFO - __main__ - Step 84276: {'lr': 0.00020631215972711158, 'samples': 16180992, 'steps': 84275, 'loss/train': 0.24022255837917328} 11/07/2021 09:04:26 - INFO - __main__ - Step 84277: {'lr': 0.000206306934649633, 'samples': 16181184, 'steps': 84276, 'loss/train': 1.2997775077819824} 11/07/2021 09:04:27 - INFO - __main__ - Step 84278: {'lr': 0.00020630170959184174, 'samples': 16181376, 'steps': 84277, 'loss/train': 1.3194048404693604} 11/07/2021 09:04:27 - INFO - __main__ - Step 84279: {'lr': 0.00020629648455374025, 'samples': 16181568, 'steps': 84278, 'loss/train': 1.3434932231903076} 11/07/2021 09:04:27 - INFO - __main__ - Step 84280: {'lr': 0.0002062912595353309, 'samples': 16181760, 'steps': 84279, 'loss/train': 0.9480109214782715} 11/07/2021 09:04:28 - INFO - __main__ - Step 84281: {'lr': 0.00020628603453661605, 'samples': 16181952, 'steps': 84280, 'loss/train': 1.2353681325912476} 11/07/2021 09:04:28 - INFO - __main__ - Step 84282: {'lr': 0.00020628080955759797, 'samples': 16182144, 'steps': 84281, 'loss/train': 0.7597735524177551} 11/07/2021 09:04:29 - INFO - __main__ - Step 84283: {'lr': 0.00020627558459827917, 'samples': 16182336, 'steps': 84282, 'loss/train': 1.592591404914856} 11/07/2021 09:04:29 - INFO - __main__ - Step 84284: {'lr': 0.00020627035965866186, 'samples': 16182528, 'steps': 84283, 'loss/train': 1.5360654592514038} 11/07/2021 09:04:30 - INFO - __main__ - Step 84285: {'lr': 0.00020626513473874847, 'samples': 16182720, 'steps': 84284, 'loss/train': 1.3814654350280762} 11/07/2021 09:04:30 - INFO - __main__ - Step 84286: {'lr': 0.00020625990983854132, 'samples': 16182912, 'steps': 84285, 'loss/train': 1.269883632659912} 11/07/2021 09:04:30 - INFO - __main__ - Step 84287: {'lr': 0.0002062546849580428, 'samples': 16183104, 'steps': 84286, 'loss/train': 1.998978853225708} 11/07/2021 09:04:32 - INFO - __main__ - Step 84288: {'lr': 0.0002062494600972552, 'samples': 16183296, 'steps': 84287, 'loss/train': 1.8079102039337158} 11/07/2021 09:04:32 - INFO - __main__ - Step 84289: {'lr': 0.00020624423525618098, 'samples': 16183488, 'steps': 84288, 'loss/train': 1.4874919652938843} 11/07/2021 09:04:32 - INFO - __main__ - Step 84290: {'lr': 0.0002062390104348225, 'samples': 16183680, 'steps': 84289, 'loss/train': 1.053771734237671} 11/07/2021 09:04:33 - INFO - __main__ - Step 84291: {'lr': 0.00020623378563318197, 'samples': 16183872, 'steps': 84290, 'loss/train': 1.7266865968704224} 11/07/2021 09:04:33 - INFO - __main__ - Step 84292: {'lr': 0.00020622856085126179, 'samples': 16184064, 'steps': 84291, 'loss/train': 1.1438593864440918} 11/07/2021 09:04:34 - INFO - __main__ - Step 84293: {'lr': 0.00020622333608906436, 'samples': 16184256, 'steps': 84292, 'loss/train': 1.4534649848937988} 11/07/2021 09:04:34 - INFO - __main__ - Step 84294: {'lr': 0.00020621811134659203, 'samples': 16184448, 'steps': 84293, 'loss/train': 1.303269386291504} 11/07/2021 09:04:35 - INFO - __main__ - Step 84295: {'lr': 0.0002062128866238471, 'samples': 16184640, 'steps': 84294, 'loss/train': 1.0451678037643433} 11/07/2021 09:04:35 - INFO - __main__ - Step 84296: {'lr': 0.000206207661920832, 'samples': 16184832, 'steps': 84295, 'loss/train': 0.7146129608154297} 11/07/2021 09:04:35 - INFO - __main__ - Step 84297: {'lr': 0.00020620243723754907, 'samples': 16185024, 'steps': 84296, 'loss/train': 1.5244942903518677} 11/07/2021 09:04:36 - INFO - __main__ - Step 84298: {'lr': 0.0002061972125740006, 'samples': 16185216, 'steps': 84297, 'loss/train': 1.1945676803588867} 11/07/2021 09:04:37 - INFO - __main__ - Step 84299: {'lr': 0.000206191987930189, 'samples': 16185408, 'steps': 84298, 'loss/train': 1.5165272951126099} 11/07/2021 09:04:37 - INFO - __main__ - Step 84300: {'lr': 0.00020618676330611663, 'samples': 16185600, 'steps': 84299, 'loss/train': 1.6855483055114746} 11/07/2021 09:04:37 - INFO - __main__ - Step 84301: {'lr': 0.00020618153870178587, 'samples': 16185792, 'steps': 84300, 'loss/train': 1.0881736278533936} 11/07/2021 09:04:38 - INFO - __main__ - Step 84302: {'lr': 0.00020617631411719894, 'samples': 16185984, 'steps': 84301, 'loss/train': 1.2062140703201294} 11/07/2021 09:04:38 - INFO - __main__ - Step 84303: {'lr': 0.00020617108955235837, 'samples': 16186176, 'steps': 84302, 'loss/train': 1.6602360010147095} 11/07/2021 09:04:40 - INFO - __main__ - Step 84304: {'lr': 0.00020616586500726651, 'samples': 16186368, 'steps': 84303, 'loss/train': 1.4869885444641113} 11/07/2021 09:04:40 - INFO - __main__ - Step 84305: {'lr': 0.00020616064048192552, 'samples': 16186560, 'steps': 84304, 'loss/train': 1.2965129613876343} 11/07/2021 09:04:40 - INFO - __main__ - Step 84306: {'lr': 0.00020615541597633787, 'samples': 16186752, 'steps': 84305, 'loss/train': 1.4106370210647583} 11/07/2021 09:04:41 - INFO - __main__ - Step 84307: {'lr': 0.0002061501914905059, 'samples': 16186944, 'steps': 84306, 'loss/train': 1.7062523365020752} 11/07/2021 09:04:41 - INFO - __main__ - Step 84308: {'lr': 0.00020614496702443198, 'samples': 16187136, 'steps': 84307, 'loss/train': 2.366593360900879} 11/07/2021 09:04:42 - INFO - __main__ - Step 84309: {'lr': 0.0002061397425781185, 'samples': 16187328, 'steps': 84308, 'loss/train': 2.19138240814209} 11/07/2021 09:04:43 - INFO - __main__ - Step 84310: {'lr': 0.00020613451815156773, 'samples': 16187520, 'steps': 84309, 'loss/train': 1.1039879322052002} 11/07/2021 09:04:43 - INFO - __main__ - Step 84311: {'lr': 0.0002061292937447821, 'samples': 16187712, 'steps': 84310, 'loss/train': 0.9202563762664795} 11/07/2021 09:04:43 - INFO - __main__ - Step 84312: {'lr': 0.0002061240693577639, 'samples': 16187904, 'steps': 84311, 'loss/train': 1.656429648399353} 11/07/2021 09:04:44 - INFO - __main__ - Step 84313: {'lr': 0.00020611884499051553, 'samples': 16188096, 'steps': 84312, 'loss/train': 0.9054219722747803} 11/07/2021 09:04:44 - INFO - __main__ - Step 84314: {'lr': 0.00020611362064303936, 'samples': 16188288, 'steps': 84313, 'loss/train': 1.2711188793182373} 11/07/2021 09:04:45 - INFO - __main__ - Step 84315: {'lr': 0.00020610839631533768, 'samples': 16188480, 'steps': 84314, 'loss/train': 1.399101734161377} 11/07/2021 09:04:45 - INFO - __main__ - Step 84316: {'lr': 0.0002061031720074129, 'samples': 16188672, 'steps': 84315, 'loss/train': 1.5653401613235474} 11/07/2021 09:04:46 - INFO - __main__ - Step 84317: {'lr': 0.00020609794771926746, 'samples': 16188864, 'steps': 84316, 'loss/train': 1.2711296081542969} 11/07/2021 09:04:46 - INFO - __main__ - Step 84318: {'lr': 0.0002060927234509035, 'samples': 16189056, 'steps': 84317, 'loss/train': 1.1600654125213623} 11/07/2021 09:04:46 - INFO - __main__ - Step 84319: {'lr': 0.00020608749920232345, 'samples': 16189248, 'steps': 84318, 'loss/train': 1.476760745048523} 11/07/2021 09:04:48 - INFO - __main__ - Step 84320: {'lr': 0.00020608227497352971, 'samples': 16189440, 'steps': 84319, 'loss/train': 1.793562889099121} 11/07/2021 09:04:48 - INFO - __main__ - Step 84321: {'lr': 0.00020607705076452465, 'samples': 16189632, 'steps': 84320, 'loss/train': 1.43009352684021} 11/07/2021 09:04:49 - INFO - __main__ - Step 84322: {'lr': 0.00020607182657531056, 'samples': 16189824, 'steps': 84321, 'loss/train': 1.2942428588867188} 11/07/2021 09:04:49 - INFO - __main__ - Step 84323: {'lr': 0.00020606660240588985, 'samples': 16190016, 'steps': 84322, 'loss/train': 1.407281756401062} 11/07/2021 09:04:49 - INFO - __main__ - Step 84324: {'lr': 0.00020606137825626483, 'samples': 16190208, 'steps': 84323, 'loss/train': 0.7062873840332031} 11/07/2021 09:04:50 - INFO - __main__ - Step 84325: {'lr': 0.00020605615412643788, 'samples': 16190400, 'steps': 84324, 'loss/train': 0.3608951270580292} 11/07/2021 09:04:51 - INFO - __main__ - Step 84326: {'lr': 0.00020605093001641137, 'samples': 16190592, 'steps': 84325, 'loss/train': 1.9829468727111816} 11/07/2021 09:04:51 - INFO - __main__ - Step 84327: {'lr': 0.0002060457059261876, 'samples': 16190784, 'steps': 84326, 'loss/train': 1.5841000080108643} 11/07/2021 09:04:51 - INFO - __main__ - Step 84328: {'lr': 0.000206040481855769, 'samples': 16190976, 'steps': 84327, 'loss/train': 1.4343634843826294} 11/07/2021 09:04:52 - INFO - __main__ - Step 84329: {'lr': 0.00020603525780515784, 'samples': 16191168, 'steps': 84328, 'loss/train': 1.6180341243743896} 11/07/2021 09:04:52 - INFO - __main__ - Step 84330: {'lr': 0.00020603003377435653, 'samples': 16191360, 'steps': 84329, 'loss/train': 0.9334760904312134} 11/07/2021 09:04:53 - INFO - __main__ - Step 84331: {'lr': 0.00020602480976336752, 'samples': 16191552, 'steps': 84330, 'loss/train': 1.597582459449768} 11/07/2021 09:04:54 - INFO - __main__ - Step 84332: {'lr': 0.00020601958577219293, 'samples': 16191744, 'steps': 84331, 'loss/train': 0.996949315071106} 11/07/2021 09:04:54 - INFO - __main__ - Step 84333: {'lr': 0.00020601436180083525, 'samples': 16191936, 'steps': 84332, 'loss/train': 1.534002423286438} 11/07/2021 09:04:54 - INFO - __main__ - Step 84334: {'lr': 0.0002060091378492968, 'samples': 16192128, 'steps': 84333, 'loss/train': 0.07072228193283081} 11/07/2021 09:04:55 - INFO - __main__ - Step 84335: {'lr': 0.00020600391391758, 'samples': 16192320, 'steps': 84334, 'loss/train': 1.4494448900222778} 11/07/2021 09:04:55 - INFO - __main__ - Step 84336: {'lr': 0.0002059986900056871, 'samples': 16192512, 'steps': 84335, 'loss/train': 1.2845224142074585} 11/07/2021 09:04:56 - INFO - __main__ - Step 84337: {'lr': 0.00020599346611362054, 'samples': 16192704, 'steps': 84336, 'loss/train': 5.780696392059326} 11/07/2021 09:04:56 - INFO - __main__ - Step 84338: {'lr': 0.00020598824224138265, 'samples': 16192896, 'steps': 84337, 'loss/train': 1.1710150241851807} 11/07/2021 09:04:57 - INFO - __main__ - Step 84339: {'lr': 0.00020598301838897575, 'samples': 16193088, 'steps': 84338, 'loss/train': 1.3957545757293701} 11/07/2021 09:04:57 - INFO - __main__ - Step 84340: {'lr': 0.00020597779455640226, 'samples': 16193280, 'steps': 84339, 'loss/train': 1.6927725076675415} 11/07/2021 09:04:57 - INFO - __main__ - Step 84341: {'lr': 0.0002059725707436645, 'samples': 16193472, 'steps': 84340, 'loss/train': 1.5547658205032349} 11/07/2021 09:04:58 - INFO - __main__ - Step 84342: {'lr': 0.0002059673469507648, 'samples': 16193664, 'steps': 84341, 'loss/train': 1.0833820104599} 11/07/2021 09:04:59 - INFO - __main__ - Step 84343: {'lr': 0.00020596212317770552, 'samples': 16193856, 'steps': 84342, 'loss/train': 1.5041462182998657} 11/07/2021 09:04:59 - INFO - __main__ - Step 84344: {'lr': 0.00020595689942448913, 'samples': 16194048, 'steps': 84343, 'loss/train': 1.8019803762435913} 11/07/2021 09:04:59 - INFO - __main__ - Step 84345: {'lr': 0.0002059516756911178, 'samples': 16194240, 'steps': 84344, 'loss/train': 1.1891916990280151} 11/07/2021 09:05:00 - INFO - __main__ - Step 84346: {'lr': 0.00020594645197759398, 'samples': 16194432, 'steps': 84345, 'loss/train': 1.6789155006408691} 11/07/2021 09:05:01 - INFO - __main__ - Step 84347: {'lr': 0.00020594122828392, 'samples': 16194624, 'steps': 84346, 'loss/train': 1.2706385850906372} 11/07/2021 09:05:01 - INFO - __main__ - Step 84348: {'lr': 0.0002059360046100982, 'samples': 16194816, 'steps': 84347, 'loss/train': 1.652147889137268} 11/07/2021 09:05:01 - INFO - __main__ - Step 84349: {'lr': 0.00020593078095613096, 'samples': 16195008, 'steps': 84348, 'loss/train': 0.29079511761665344} 11/07/2021 09:05:02 - INFO - __main__ - Step 84350: {'lr': 0.00020592555732202062, 'samples': 16195200, 'steps': 84349, 'loss/train': 1.086082935333252} 11/07/2021 09:05:02 - INFO - __main__ - Step 84351: {'lr': 0.00020592033370776957, 'samples': 16195392, 'steps': 84350, 'loss/train': 1.5513148307800293} 11/07/2021 09:05:03 - INFO - __main__ - Step 84352: {'lr': 0.00020591511011338012, 'samples': 16195584, 'steps': 84351, 'loss/train': 1.1981221437454224} 11/07/2021 09:05:03 - INFO - __main__ - Step 84353: {'lr': 0.00020590988653885467, 'samples': 16195776, 'steps': 84352, 'loss/train': 1.0650126934051514} 11/07/2021 09:05:04 - INFO - __main__ - Step 84354: {'lr': 0.0002059046629841955, 'samples': 16195968, 'steps': 84353, 'loss/train': 1.2646561861038208} 11/07/2021 09:05:04 - INFO - __main__ - Step 84355: {'lr': 0.00020589943944940504, 'samples': 16196160, 'steps': 84354, 'loss/train': 1.3128958940505981} 11/07/2021 09:05:05 - INFO - __main__ - Step 84356: {'lr': 0.00020589421593448568, 'samples': 16196352, 'steps': 84355, 'loss/train': 1.292158603668213} 11/07/2021 09:05:06 - INFO - __main__ - Step 84357: {'lr': 0.00020588899243943967, 'samples': 16196544, 'steps': 84356, 'loss/train': 0.8947818875312805} 11/07/2021 09:05:06 - INFO - __main__ - Step 84358: {'lr': 0.00020588376896426937, 'samples': 16196736, 'steps': 84357, 'loss/train': 1.745620846748352} 11/07/2021 09:05:06 - INFO - __main__ - Step 84359: {'lr': 0.00020587854550897715, 'samples': 16196928, 'steps': 84358, 'loss/train': 1.4487377405166626} 11/07/2021 09:05:07 - INFO - __main__ - Step 84360: {'lr': 0.00020587332207356538, 'samples': 16197120, 'steps': 84359, 'loss/train': 1.4171881675720215} 11/07/2021 09:05:07 - INFO - __main__ - Step 84361: {'lr': 0.0002058680986580364, 'samples': 16197312, 'steps': 84360, 'loss/train': 1.4172440767288208} 11/07/2021 09:05:07 - INFO - __main__ - Step 84362: {'lr': 0.00020586287526239258, 'samples': 16197504, 'steps': 84361, 'loss/train': 1.269803762435913} 11/07/2021 09:05:08 - INFO - __main__ - Step 84363: {'lr': 0.00020585765188663627, 'samples': 16197696, 'steps': 84362, 'loss/train': 1.234804391860962} 11/07/2021 09:05:09 - INFO - __main__ - Step 84364: {'lr': 0.00020585242853076984, 'samples': 16197888, 'steps': 84363, 'loss/train': 1.029646396636963} 11/07/2021 09:05:09 - INFO - __main__ - Step 84365: {'lr': 0.0002058472051947956, 'samples': 16198080, 'steps': 84364, 'loss/train': 1.2398158311843872} 11/07/2021 09:05:09 - INFO - __main__ - Step 84366: {'lr': 0.00020584198187871596, 'samples': 16198272, 'steps': 84365, 'loss/train': 1.3838590383529663} 11/07/2021 09:05:10 - INFO - __main__ - Step 84367: {'lr': 0.00020583675858253325, 'samples': 16198464, 'steps': 84366, 'loss/train': 1.5388305187225342} 11/07/2021 09:05:11 - INFO - __main__ - Step 84368: {'lr': 0.00020583153530624975, 'samples': 16198656, 'steps': 84367, 'loss/train': 1.606859564781189} 11/07/2021 09:05:11 - INFO - __main__ - Step 84369: {'lr': 0.00020582631204986792, 'samples': 16198848, 'steps': 84368, 'loss/train': 1.3613994121551514} 11/07/2021 09:05:12 - INFO - __main__ - Step 84370: {'lr': 0.00020582108881339007, 'samples': 16199040, 'steps': 84369, 'loss/train': 1.310326337814331} 11/07/2021 09:05:12 - INFO - __main__ - Step 84371: {'lr': 0.0002058158655968186, 'samples': 16199232, 'steps': 84370, 'loss/train': 0.49529701471328735} 11/07/2021 09:05:12 - INFO - __main__ - Step 84372: {'lr': 0.00020581064240015576, 'samples': 16199424, 'steps': 84371, 'loss/train': 0.13930119574069977} 11/07/2021 09:05:13 - INFO - __main__ - Step 84373: {'lr': 0.000205805419223404, 'samples': 16199616, 'steps': 84372, 'loss/train': 1.2588410377502441} 11/07/2021 09:05:14 - INFO - __main__ - Step 84374: {'lr': 0.00020580019606656559, 'samples': 16199808, 'steps': 84373, 'loss/train': 0.7780636548995972} 11/07/2021 09:05:14 - INFO - __main__ - Step 84375: {'lr': 0.00020579497292964294, 'samples': 16200000, 'steps': 84374, 'loss/train': 1.316388487815857} 11/07/2021 09:05:14 - INFO - __main__ - Step 84376: {'lr': 0.0002057897498126384, 'samples': 16200192, 'steps': 84375, 'loss/train': 1.2603150606155396} 11/07/2021 09:05:15 - INFO - __main__ - Step 84377: {'lr': 0.00020578452671555432, 'samples': 16200384, 'steps': 84376, 'loss/train': 1.5279251337051392} 11/07/2021 09:05:16 - INFO - __main__ - Step 84378: {'lr': 0.00020577930363839308, 'samples': 16200576, 'steps': 84377, 'loss/train': 0.5417143106460571} 11/07/2021 09:05:16 - INFO - __main__ - Step 84379: {'lr': 0.000205774080581157, 'samples': 16200768, 'steps': 84378, 'loss/train': 1.650583028793335} 11/07/2021 09:05:16 - INFO - __main__ - Step 84380: {'lr': 0.00020576885754384838, 'samples': 16200960, 'steps': 84379, 'loss/train': 1.5223290920257568} 11/07/2021 09:05:17 - INFO - __main__ - Step 84381: {'lr': 0.00020576363452646964, 'samples': 16201152, 'steps': 84380, 'loss/train': 1.300111174583435} 11/07/2021 09:05:17 - INFO - __main__ - Step 84382: {'lr': 0.00020575841152902313, 'samples': 16201344, 'steps': 84381, 'loss/train': 1.305124044418335} 11/07/2021 09:05:18 - INFO - __main__ - Step 84383: {'lr': 0.00020575318855151124, 'samples': 16201536, 'steps': 84382, 'loss/train': 1.9734230041503906} 11/07/2021 09:05:18 - INFO - __main__ - Step 84384: {'lr': 0.0002057479655939363, 'samples': 16201728, 'steps': 84383, 'loss/train': 1.411020278930664} 11/07/2021 09:05:19 - INFO - __main__ - Step 84385: {'lr': 0.00020574274265630057, 'samples': 16201920, 'steps': 84384, 'loss/train': 1.356818675994873} 11/07/2021 09:05:19 - INFO - __main__ - Step 84386: {'lr': 0.00020573751973860648, 'samples': 16202112, 'steps': 84385, 'loss/train': 1.416837215423584} 11/07/2021 09:05:19 - INFO - __main__ - Step 84387: {'lr': 0.00020573229684085642, 'samples': 16202304, 'steps': 84386, 'loss/train': 1.301167368888855} 11/07/2021 09:05:20 - INFO - __main__ - Step 84388: {'lr': 0.00020572707396305267, 'samples': 16202496, 'steps': 84387, 'loss/train': 1.173892617225647} 11/07/2021 09:05:21 - INFO - __main__ - Step 84389: {'lr': 0.0002057218511051977, 'samples': 16202688, 'steps': 84388, 'loss/train': 1.453700304031372} 11/07/2021 09:05:21 - INFO - __main__ - Step 84390: {'lr': 0.0002057166282672937, 'samples': 16202880, 'steps': 84389, 'loss/train': 1.1726751327514648} 11/07/2021 09:05:21 - INFO - __main__ - Step 84391: {'lr': 0.00020571140544934315, 'samples': 16203072, 'steps': 84390, 'loss/train': 1.4512406587600708} 11/07/2021 09:05:22 - INFO - __main__ - Step 84392: {'lr': 0.0002057061826513483, 'samples': 16203264, 'steps': 84391, 'loss/train': 1.261427640914917} 11/07/2021 09:05:23 - INFO - __main__ - Step 84393: {'lr': 0.00020570095987331154, 'samples': 16203456, 'steps': 84392, 'loss/train': 1.4364503622055054} 11/07/2021 09:05:23 - INFO - __main__ - Step 84394: {'lr': 0.00020569573711523532, 'samples': 16203648, 'steps': 84393, 'loss/train': 1.684856653213501} 11/07/2021 09:05:24 - INFO - __main__ - Step 84395: {'lr': 0.0002056905143771219, 'samples': 16203840, 'steps': 84394, 'loss/train': 1.7211390733718872} 11/07/2021 09:05:24 - INFO - __main__ - Step 84396: {'lr': 0.0002056852916589736, 'samples': 16204032, 'steps': 84395, 'loss/train': 1.7363909482955933} 11/07/2021 09:05:24 - INFO - __main__ - Step 84397: {'lr': 0.00020568006896079286, 'samples': 16204224, 'steps': 84396, 'loss/train': 1.3812081813812256} 11/07/2021 09:05:25 - INFO - __main__ - Step 84398: {'lr': 0.00020567484628258203, 'samples': 16204416, 'steps': 84397, 'loss/train': 1.070642113685608} 11/07/2021 09:05:26 - INFO - __main__ - Step 84399: {'lr': 0.00020566962362434342, 'samples': 16204608, 'steps': 84398, 'loss/train': 1.255839467048645} 11/07/2021 09:05:26 - INFO - __main__ - Step 84400: {'lr': 0.00020566440098607943, 'samples': 16204800, 'steps': 84399, 'loss/train': 1.0617942810058594} 11/07/2021 09:05:26 - INFO - __main__ - Step 84401: {'lr': 0.0002056591783677923, 'samples': 16204992, 'steps': 84400, 'loss/train': 1.268270492553711} 11/07/2021 09:05:27 - INFO - __main__ - Step 84402: {'lr': 0.00020565395576948448, 'samples': 16205184, 'steps': 84401, 'loss/train': 1.6347589492797852} 11/07/2021 09:05:28 - INFO - __main__ - Step 84403: {'lr': 0.0002056487331911583, 'samples': 16205376, 'steps': 84402, 'loss/train': 1.2619003057479858} 11/07/2021 09:05:28 - INFO - __main__ - Step 84404: {'lr': 0.00020564351063281612, 'samples': 16205568, 'steps': 84403, 'loss/train': 1.5450537204742432} 11/07/2021 09:05:28 - INFO - __main__ - Step 84405: {'lr': 0.00020563828809446027, 'samples': 16205760, 'steps': 84404, 'loss/train': 1.3859812021255493} 11/07/2021 09:05:29 - INFO - __main__ - Step 84406: {'lr': 0.00020563306557609313, 'samples': 16205952, 'steps': 84405, 'loss/train': 0.9360536932945251} 11/07/2021 09:05:29 - INFO - __main__ - Step 84407: {'lr': 0.00020562784307771707, 'samples': 16206144, 'steps': 84406, 'loss/train': 1.4728535413742065} 11/07/2021 09:05:29 - INFO - __main__ - Step 84408: {'lr': 0.0002056226205993344, 'samples': 16206336, 'steps': 84407, 'loss/train': 1.1183110475540161} 11/07/2021 09:05:30 - INFO - __main__ - Step 84409: {'lr': 0.0002056173981409475, 'samples': 16206528, 'steps': 84408, 'loss/train': 1.2741128206253052} 11/07/2021 09:05:31 - INFO - __main__ - Step 84410: {'lr': 0.00020561217570255869, 'samples': 16206720, 'steps': 84409, 'loss/train': 1.0217615365982056} 11/07/2021 09:05:31 - INFO - __main__ - Step 84411: {'lr': 0.00020560695328417048, 'samples': 16206912, 'steps': 84410, 'loss/train': 1.5836631059646606} 11/07/2021 09:05:31 - INFO - __main__ - Step 84412: {'lr': 0.000205601730885785, 'samples': 16207104, 'steps': 84411, 'loss/train': 1.3501932621002197} 11/07/2021 09:05:32 - INFO - __main__ - Step 84413: {'lr': 0.00020559650850740467, 'samples': 16207296, 'steps': 84412, 'loss/train': 0.7468372583389282} 11/07/2021 09:05:33 - INFO - __main__ - Step 84414: {'lr': 0.00020559128614903186, 'samples': 16207488, 'steps': 84413, 'loss/train': 0.9560016989707947} 11/07/2021 09:05:33 - INFO - __main__ - Step 84415: {'lr': 0.00020558606381066897, 'samples': 16207680, 'steps': 84414, 'loss/train': 1.34201180934906} 11/07/2021 09:05:34 - INFO - __main__ - Step 84416: {'lr': 0.00020558084149231826, 'samples': 16207872, 'steps': 84415, 'loss/train': 1.5380573272705078} 11/07/2021 09:05:34 - INFO - __main__ - Step 84417: {'lr': 0.0002055756191939822, 'samples': 16208064, 'steps': 84416, 'loss/train': 1.3962185382843018} 11/07/2021 09:05:34 - INFO - __main__ - Step 84418: {'lr': 0.00020557039691566301, 'samples': 16208256, 'steps': 84417, 'loss/train': 1.3413220643997192} 11/07/2021 09:05:35 - INFO - __main__ - Step 84419: {'lr': 0.00020556517465736314, 'samples': 16208448, 'steps': 84418, 'loss/train': 1.6428202390670776} 11/07/2021 09:05:36 - INFO - __main__ - Step 84420: {'lr': 0.00020555995241908497, 'samples': 16208640, 'steps': 84419, 'loss/train': 1.2653759717941284} 11/07/2021 09:05:36 - INFO - __main__ - Step 84421: {'lr': 0.00020555473020083073, 'samples': 16208832, 'steps': 84420, 'loss/train': 1.1103742122650146} 11/07/2021 09:05:36 - INFO - __main__ - Step 84422: {'lr': 0.00020554950800260287, 'samples': 16209024, 'steps': 84421, 'loss/train': 1.3234283924102783} 11/07/2021 09:05:37 - INFO - __main__ - Step 84423: {'lr': 0.00020554428582440372, 'samples': 16209216, 'steps': 84422, 'loss/train': 1.4535189867019653} 11/07/2021 09:05:38 - INFO - __main__ - Step 84424: {'lr': 0.0002055390636662356, 'samples': 16209408, 'steps': 84423, 'loss/train': 1.3732576370239258} 11/07/2021 09:05:38 - INFO - __main__ - Step 84425: {'lr': 0.00020553384152810107, 'samples': 16209600, 'steps': 84424, 'loss/train': 1.5438075065612793} 11/07/2021 09:05:38 - INFO - __main__ - Step 84426: {'lr': 0.00020552861941000212, 'samples': 16209792, 'steps': 84425, 'loss/train': 1.6332169771194458} 11/07/2021 09:05:39 - INFO - __main__ - Step 84427: {'lr': 0.0002055233973119413, 'samples': 16209984, 'steps': 84426, 'loss/train': 1.259080171585083} 11/07/2021 09:05:39 - INFO - __main__ - Step 84428: {'lr': 0.000205518175233921, 'samples': 16210176, 'steps': 84427, 'loss/train': 1.5279171466827393} 11/07/2021 09:05:40 - INFO - __main__ - Step 84429: {'lr': 0.00020551295317594348, 'samples': 16210368, 'steps': 84428, 'loss/train': 1.444433331489563} 11/07/2021 09:05:41 - INFO - __main__ - Step 84430: {'lr': 0.00020550773113801117, 'samples': 16210560, 'steps': 84429, 'loss/train': 1.6077214479446411} 11/07/2021 09:05:41 - INFO - __main__ - Step 84431: {'lr': 0.00020550250912012636, 'samples': 16210752, 'steps': 84430, 'loss/train': 0.9037222266197205} 11/07/2021 09:05:41 - INFO - __main__ - Step 84432: {'lr': 0.00020549728712229142, 'samples': 16210944, 'steps': 84431, 'loss/train': 1.6169711351394653} 11/07/2021 09:05:42 - INFO - __main__ - Step 84433: {'lr': 0.00020549206514450876, 'samples': 16211136, 'steps': 84432, 'loss/train': 0.8670802712440491} 11/07/2021 09:05:43 - INFO - __main__ - Step 84434: {'lr': 0.00020548684318678065, 'samples': 16211328, 'steps': 84433, 'loss/train': 1.566977620124817} 11/07/2021 09:05:43 - INFO - __main__ - Step 84435: {'lr': 0.0002054816212491095, 'samples': 16211520, 'steps': 84434, 'loss/train': 1.4507079124450684} 11/07/2021 09:05:43 - INFO - __main__ - Step 84436: {'lr': 0.00020547639933149765, 'samples': 16211712, 'steps': 84435, 'loss/train': 1.1206697225570679} 11/07/2021 09:05:44 - INFO - __main__ - Step 84437: {'lr': 0.00020547117743394743, 'samples': 16211904, 'steps': 84436, 'loss/train': 1.367564082145691} 11/07/2021 09:05:44 - INFO - __main__ - Step 84438: {'lr': 0.00020546595555646135, 'samples': 16212096, 'steps': 84437, 'loss/train': 0.9893630743026733} 11/07/2021 09:05:44 - INFO - __main__ - Step 84439: {'lr': 0.0002054607336990415, 'samples': 16212288, 'steps': 84438, 'loss/train': 1.6980910301208496} 11/07/2021 09:05:45 - INFO - __main__ - Step 84440: {'lr': 0.00020545551186169035, 'samples': 16212480, 'steps': 84439, 'loss/train': 1.0928521156311035} 11/07/2021 09:05:46 - INFO - __main__ - Step 84441: {'lr': 0.00020545029004441024, 'samples': 16212672, 'steps': 84440, 'loss/train': 0.9358792901039124} 11/07/2021 09:05:46 - INFO - __main__ - Step 84442: {'lr': 0.00020544506824720355, 'samples': 16212864, 'steps': 84441, 'loss/train': 1.776924729347229} 11/07/2021 09:05:46 - INFO - __main__ - Step 84443: {'lr': 0.00020543984647007263, 'samples': 16213056, 'steps': 84442, 'loss/train': 1.248409628868103} 11/07/2021 09:05:47 - INFO - __main__ - Step 84444: {'lr': 0.00020543462471301987, 'samples': 16213248, 'steps': 84443, 'loss/train': 2.0558760166168213} 11/07/2021 09:05:48 - INFO - __main__ - Step 84445: {'lr': 0.00020542940297604752, 'samples': 16213440, 'steps': 84444, 'loss/train': 1.2449547052383423} 11/07/2021 09:05:48 - INFO - __main__ - Step 84446: {'lr': 0.00020542418125915802, 'samples': 16213632, 'steps': 84445, 'loss/train': 2.15468168258667} 11/07/2021 09:05:49 - INFO - __main__ - Step 84447: {'lr': 0.0002054189595623537, 'samples': 16213824, 'steps': 84446, 'loss/train': 1.064052939414978} 11/07/2021 09:05:49 - INFO - __main__ - Step 84448: {'lr': 0.0002054137378856369, 'samples': 16214016, 'steps': 84447, 'loss/train': 0.9319149255752563} 11/07/2021 09:05:49 - INFO - __main__ - Step 84449: {'lr': 0.00020540851622900997, 'samples': 16214208, 'steps': 84448, 'loss/train': 1.6605398654937744} 11/07/2021 09:05:50 - INFO - __main__ - Step 84450: {'lr': 0.0002054032945924753, 'samples': 16214400, 'steps': 84449, 'loss/train': 1.2807215452194214} 11/07/2021 09:05:51 - INFO - __main__ - Step 84451: {'lr': 0.00020539807297603518, 'samples': 16214592, 'steps': 84450, 'loss/train': 1.4664067029953003} 11/07/2021 09:05:51 - INFO - __main__ - Step 84452: {'lr': 0.00020539285137969216, 'samples': 16214784, 'steps': 84451, 'loss/train': 1.423790693283081} 11/07/2021 09:05:51 - INFO - __main__ - Step 84453: {'lr': 0.0002053876298034483, 'samples': 16214976, 'steps': 84452, 'loss/train': 1.5909624099731445} 11/07/2021 09:05:52 - INFO - __main__ - Step 84454: {'lr': 0.0002053824082473061, 'samples': 16215168, 'steps': 84453, 'loss/train': 1.0251312255859375} 11/07/2021 09:05:53 - INFO - __main__ - Step 84455: {'lr': 0.00020537718671126786, 'samples': 16215360, 'steps': 84454, 'loss/train': 1.4034340381622314} 11/07/2021 09:05:53 - INFO - __main__ - Step 84456: {'lr': 0.000205371965195336, 'samples': 16215552, 'steps': 84455, 'loss/train': 1.2324011325836182} 11/07/2021 09:05:53 - INFO - __main__ - Step 84457: {'lr': 0.00020536674369951282, 'samples': 16215744, 'steps': 84456, 'loss/train': 1.4207642078399658} 11/07/2021 09:05:54 - INFO - __main__ - Step 84458: {'lr': 0.00020536152222380073, 'samples': 16215936, 'steps': 84457, 'loss/train': 1.3865628242492676} 11/07/2021 09:05:54 - INFO - __main__ - Step 84459: {'lr': 0.00020535630076820203, 'samples': 16216128, 'steps': 84458, 'loss/train': 0.6009006500244141} 11/07/2021 09:05:55 - INFO - __main__ - Step 84460: {'lr': 0.0002053510793327191, 'samples': 16216320, 'steps': 84459, 'loss/train': 1.4510588645935059} 11/07/2021 09:05:56 - INFO - __main__ - Step 84461: {'lr': 0.00020534585791735427, 'samples': 16216512, 'steps': 84460, 'loss/train': 1.548398494720459} 11/07/2021 09:05:56 - INFO - __main__ - Step 84462: {'lr': 0.0002053406365221099, 'samples': 16216704, 'steps': 84461, 'loss/train': 1.8247085809707642} 11/07/2021 09:05:56 - INFO - __main__ - Step 84463: {'lr': 0.00020533541514698839, 'samples': 16216896, 'steps': 84462, 'loss/train': 1.3794692754745483} 11/07/2021 09:05:57 - INFO - __main__ - Step 84464: {'lr': 0.00020533019379199202, 'samples': 16217088, 'steps': 84463, 'loss/train': 1.6242148876190186} 11/07/2021 09:05:57 - INFO - __main__ - Step 84465: {'lr': 0.0002053249724571233, 'samples': 16217280, 'steps': 84464, 'loss/train': 1.237176775932312} 11/07/2021 09:05:58 - INFO - __main__ - Step 84466: {'lr': 0.00020531975114238433, 'samples': 16217472, 'steps': 84465, 'loss/train': 1.6528364419937134} 11/07/2021 09:05:58 - INFO - __main__ - Step 84467: {'lr': 0.0002053145298477776, 'samples': 16217664, 'steps': 84466, 'loss/train': 1.4003716707229614} 11/07/2021 09:05:59 - INFO - __main__ - Step 84468: {'lr': 0.00020530930857330548, 'samples': 16217856, 'steps': 84467, 'loss/train': 1.182408094406128} 11/07/2021 09:05:59 - INFO - __main__ - Step 84469: {'lr': 0.00020530408731897026, 'samples': 16218048, 'steps': 84468, 'loss/train': 1.118279218673706} 11/07/2021 09:05:59 - INFO - __main__ - Step 84470: {'lr': 0.00020529886608477434, 'samples': 16218240, 'steps': 84469, 'loss/train': 0.7990151643753052} 11/07/2021 09:06:00 - INFO - __main__ - Step 84471: {'lr': 0.00020529364487072006, 'samples': 16218432, 'steps': 84470, 'loss/train': 1.3580741882324219} 11/07/2021 09:06:01 - INFO - __main__ - Step 84472: {'lr': 0.00020528842367680978, 'samples': 16218624, 'steps': 84471, 'loss/train': 1.029937744140625} 11/07/2021 09:06:01 - INFO - __main__ - Step 84473: {'lr': 0.00020528320250304586, 'samples': 16218816, 'steps': 84472, 'loss/train': 1.4119409322738647} 11/07/2021 09:06:01 - INFO - __main__ - Step 84474: {'lr': 0.0002052779813494306, 'samples': 16219008, 'steps': 84473, 'loss/train': 1.5133881568908691} 11/07/2021 09:06:02 - INFO - __main__ - Step 84475: {'lr': 0.00020527276021596643, 'samples': 16219200, 'steps': 84474, 'loss/train': 1.17160964012146} 11/07/2021 09:06:03 - INFO - __main__ - Step 84476: {'lr': 0.00020526753910265564, 'samples': 16219392, 'steps': 84475, 'loss/train': 0.5593720078468323} 11/07/2021 09:06:03 - INFO - __main__ - Step 84477: {'lr': 0.00020526231800950062, 'samples': 16219584, 'steps': 84476, 'loss/train': 1.4041215181350708} 11/07/2021 09:06:04 - INFO - __main__ - Step 84478: {'lr': 0.0002052570969365038, 'samples': 16219776, 'steps': 84477, 'loss/train': 1.5142379999160767} 11/07/2021 09:06:04 - INFO - __main__ - Step 84479: {'lr': 0.00020525187588366734, 'samples': 16219968, 'steps': 84478, 'loss/train': 1.4213480949401855} 11/07/2021 09:06:04 - INFO - __main__ - Step 84480: {'lr': 0.0002052466548509937, 'samples': 16220160, 'steps': 84479, 'loss/train': 1.5051898956298828} 11/07/2021 09:06:05 - INFO - __main__ - Step 84481: {'lr': 0.00020524143383848523, 'samples': 16220352, 'steps': 84480, 'loss/train': 1.3073524236679077} 11/07/2021 09:06:06 - INFO - __main__ - Step 84482: {'lr': 0.00020523621284614427, 'samples': 16220544, 'steps': 84481, 'loss/train': 1.4370245933532715} 11/07/2021 09:06:06 - INFO - __main__ - Step 84483: {'lr': 0.0002052309918739732, 'samples': 16220736, 'steps': 84482, 'loss/train': 1.3541544675827026} 11/07/2021 09:06:06 - INFO - __main__ - Step 84484: {'lr': 0.00020522577092197433, 'samples': 16220928, 'steps': 84483, 'loss/train': 1.5271118879318237} 11/07/2021 09:06:07 - INFO - __main__ - Step 84485: {'lr': 0.00020522054999015004, 'samples': 16221120, 'steps': 84484, 'loss/train': 1.1878020763397217} 11/07/2021 09:06:07 - INFO - __main__ - Step 84486: {'lr': 0.00020521532907850272, 'samples': 16221312, 'steps': 84485, 'loss/train': 1.1074072122573853} 11/07/2021 09:06:08 - INFO - __main__ - Step 84487: {'lr': 0.00020521010818703463, 'samples': 16221504, 'steps': 84486, 'loss/train': 1.6024038791656494} 11/07/2021 09:06:09 - INFO - __main__ - Step 84488: {'lr': 0.00020520488731574818, 'samples': 16221696, 'steps': 84487, 'loss/train': 1.149214267730713} 11/07/2021 09:06:09 - INFO - __main__ - Step 84489: {'lr': 0.00020519966646464574, 'samples': 16221888, 'steps': 84488, 'loss/train': 1.4067715406417847} 11/07/2021 09:06:09 - INFO - __main__ - Step 84490: {'lr': 0.00020519444563372964, 'samples': 16222080, 'steps': 84489, 'loss/train': 2.0407135486602783} 11/07/2021 09:06:10 - INFO - __main__ - Step 84491: {'lr': 0.00020518922482300225, 'samples': 16222272, 'steps': 84490, 'loss/train': 1.7220433950424194} 11/07/2021 09:06:11 - INFO - __main__ - Step 84492: {'lr': 0.00020518400403246595, 'samples': 16222464, 'steps': 84491, 'loss/train': 1.4351319074630737} 11/07/2021 09:06:11 - INFO - __main__ - Step 84493: {'lr': 0.00020517878326212297, 'samples': 16222656, 'steps': 84492, 'loss/train': 1.6740446090698242} 11/07/2021 09:06:11 - INFO - __main__ - Step 84494: {'lr': 0.00020517356251197573, 'samples': 16222848, 'steps': 84493, 'loss/train': 1.6858018636703491} 11/07/2021 09:06:12 - INFO - __main__ - Step 84495: {'lr': 0.0002051683417820266, 'samples': 16223040, 'steps': 84494, 'loss/train': 1.4625985622406006} 11/07/2021 09:06:12 - INFO - __main__ - Step 84496: {'lr': 0.00020516312107227792, 'samples': 16223232, 'steps': 84495, 'loss/train': 1.4637012481689453} 11/07/2021 09:06:13 - INFO - __main__ - Step 84497: {'lr': 0.00020515790038273205, 'samples': 16223424, 'steps': 84496, 'loss/train': 1.5609747171401978} 11/07/2021 09:06:14 - INFO - __main__ - Step 84498: {'lr': 0.00020515267971339132, 'samples': 16223616, 'steps': 84497, 'loss/train': 1.2050973176956177} 11/07/2021 09:06:14 - INFO - __main__ - Step 84499: {'lr': 0.00020514745906425813, 'samples': 16223808, 'steps': 84498, 'loss/train': 1.015786051750183} 11/07/2021 09:06:14 - INFO - __main__ - Step 84500: {'lr': 0.0002051422384353348, 'samples': 16224000, 'steps': 84499, 'loss/train': 0.10791367292404175} 11/07/2021 09:06:15 - INFO - __main__ - Step 84501: {'lr': 0.00020513701782662368, 'samples': 16224192, 'steps': 84500, 'loss/train': 0.14546319842338562} 11/07/2021 09:06:15 - INFO - __main__ - Step 84502: {'lr': 0.00020513179723812716, 'samples': 16224384, 'steps': 84501, 'loss/train': 1.4005837440490723} 11/07/2021 09:06:16 - INFO - __main__ - Step 84503: {'lr': 0.0002051265766698475, 'samples': 16224576, 'steps': 84502, 'loss/train': 1.63362455368042} 11/07/2021 09:06:17 - INFO - __main__ - Step 84504: {'lr': 0.00020512135612178717, 'samples': 16224768, 'steps': 84503, 'loss/train': 0.9041234254837036} 11/07/2021 09:06:17 - INFO - __main__ - Step 84505: {'lr': 0.00020511613559394848, 'samples': 16224960, 'steps': 84504, 'loss/train': 1.026803731918335} 11/07/2021 09:06:17 - INFO - __main__ - Step 84506: {'lr': 0.0002051109150863337, 'samples': 16225152, 'steps': 84505, 'loss/train': 1.2611349821090698} 11/07/2021 09:06:18 - INFO - __main__ - Step 84507: {'lr': 0.00020510569459894525, 'samples': 16225344, 'steps': 84506, 'loss/train': 1.7884262800216675} 11/07/2021 09:06:19 - INFO - __main__ - Step 84508: {'lr': 0.0002051004741317855, 'samples': 16225536, 'steps': 84507, 'loss/train': 1.5355600118637085} 11/07/2021 09:06:19 - INFO - __main__ - Step 84509: {'lr': 0.00020509525368485679, 'samples': 16225728, 'steps': 84508, 'loss/train': 1.7056313753128052} 11/07/2021 09:06:19 - INFO - __main__ - Step 84510: {'lr': 0.00020509003325816145, 'samples': 16225920, 'steps': 84509, 'loss/train': 1.0972213745117188} 11/07/2021 09:06:20 - INFO - __main__ - Step 84511: {'lr': 0.00020508481285170185, 'samples': 16226112, 'steps': 84510, 'loss/train': 0.6974997520446777} 11/07/2021 09:06:20 - INFO - __main__ - Step 84512: {'lr': 0.00020507959246548042, 'samples': 16226304, 'steps': 84511, 'loss/train': 0.6448963284492493} 11/07/2021 09:06:21 - INFO - __main__ - Step 84513: {'lr': 0.00020507437209949937, 'samples': 16226496, 'steps': 84512, 'loss/train': 1.5513637065887451} 11/07/2021 09:06:21 - INFO - __main__ - Step 84514: {'lr': 0.00020506915175376106, 'samples': 16226688, 'steps': 84513, 'loss/train': 1.6534878015518188} 11/07/2021 09:06:22 - INFO - __main__ - Step 84515: {'lr': 0.00020506393142826797, 'samples': 16226880, 'steps': 84514, 'loss/train': 1.5258485078811646} 11/07/2021 09:06:22 - INFO - __main__ - Step 84516: {'lr': 0.00020505871112302233, 'samples': 16227072, 'steps': 84515, 'loss/train': 1.5211281776428223} 11/07/2021 09:06:22 - INFO - __main__ - Step 84517: {'lr': 0.00020505349083802654, 'samples': 16227264, 'steps': 84516, 'loss/train': 1.6812602281570435} 11/07/2021 09:06:23 - INFO - __main__ - Step 84518: {'lr': 0.00020504827057328298, 'samples': 16227456, 'steps': 84517, 'loss/train': 0.49665743112564087} 11/07/2021 09:06:24 - INFO - __main__ - Step 84519: {'lr': 0.00020504305032879402, 'samples': 16227648, 'steps': 84518, 'loss/train': 1.5497653484344482} 11/07/2021 09:06:24 - INFO - __main__ - Step 84520: {'lr': 0.00020503783010456192, 'samples': 16227840, 'steps': 84519, 'loss/train': 1.5240203142166138} 11/07/2021 09:06:25 - INFO - __main__ - Step 84521: {'lr': 0.00020503260990058908, 'samples': 16228032, 'steps': 84520, 'loss/train': 1.1711053848266602} 11/07/2021 09:06:25 - INFO - __main__ - Step 84522: {'lr': 0.00020502738971687784, 'samples': 16228224, 'steps': 84521, 'loss/train': 1.405091404914856} 11/07/2021 09:06:26 - INFO - __main__ - Step 84523: {'lr': 0.0002050221695534306, 'samples': 16228416, 'steps': 84522, 'loss/train': 1.0316482782363892} 11/07/2021 09:06:26 - INFO - __main__ - Step 84524: {'lr': 0.00020501694941024967, 'samples': 16228608, 'steps': 84523, 'loss/train': 1.2786797285079956} 11/07/2021 09:06:27 - INFO - __main__ - Step 84525: {'lr': 0.0002050117292873374, 'samples': 16228800, 'steps': 84524, 'loss/train': 1.223841905593872} 11/07/2021 09:06:27 - INFO - __main__ - Step 84526: {'lr': 0.00020500650918469612, 'samples': 16228992, 'steps': 84525, 'loss/train': 1.5673333406448364} 11/07/2021 09:06:27 - INFO - __main__ - Step 84527: {'lr': 0.00020500128910232824, 'samples': 16229184, 'steps': 84526, 'loss/train': 1.3511030673980713} 11/07/2021 09:06:28 - INFO - __main__ - Step 84528: {'lr': 0.00020499606904023608, 'samples': 16229376, 'steps': 84527, 'loss/train': 0.9071321487426758} 11/07/2021 09:06:29 - INFO - __main__ - Step 84529: {'lr': 0.000204990848998422, 'samples': 16229568, 'steps': 84528, 'loss/train': 1.5127277374267578} 11/07/2021 09:06:29 - INFO - __main__ - Step 84530: {'lr': 0.00020498562897688832, 'samples': 16229760, 'steps': 84529, 'loss/train': 1.1888238191604614} 11/07/2021 09:06:29 - INFO - __main__ - Step 84531: {'lr': 0.00020498040897563743, 'samples': 16229952, 'steps': 84530, 'loss/train': 1.3403551578521729} 11/07/2021 09:06:30 - INFO - __main__ - Step 84532: {'lr': 0.00020497518899467173, 'samples': 16230144, 'steps': 84531, 'loss/train': 1.7683273553848267} 11/07/2021 09:06:30 - INFO - __main__ - Step 84533: {'lr': 0.00020496996903399348, 'samples': 16230336, 'steps': 84532, 'loss/train': 1.5957438945770264} 11/07/2021 09:06:31 - INFO - __main__ - Step 84534: {'lr': 0.00020496474909360512, 'samples': 16230528, 'steps': 84533, 'loss/train': 1.4480431079864502} 11/07/2021 09:06:31 - INFO - __main__ - Step 84535: {'lr': 0.00020495952917350889, 'samples': 16230720, 'steps': 84534, 'loss/train': 1.236707329750061} 11/07/2021 09:06:32 - INFO - __main__ - Step 84536: {'lr': 0.00020495430927370718, 'samples': 16230912, 'steps': 84535, 'loss/train': 1.2569084167480469} 11/07/2021 09:06:32 - INFO - __main__ - Step 84537: {'lr': 0.00020494908939420236, 'samples': 16231104, 'steps': 84536, 'loss/train': 1.8076286315917969} 11/07/2021 09:06:33 - INFO - __main__ - Step 84538: {'lr': 0.00020494386953499684, 'samples': 16231296, 'steps': 84537, 'loss/train': 1.3040827512741089} 11/07/2021 09:06:34 - INFO - __main__ - Step 84539: {'lr': 0.00020493864969609287, 'samples': 16231488, 'steps': 84538, 'loss/train': 1.2076163291931152} 11/07/2021 09:06:34 - INFO - __main__ - Step 84540: {'lr': 0.00020493342987749287, 'samples': 16231680, 'steps': 84539, 'loss/train': 1.2756900787353516} 11/07/2021 09:06:34 - INFO - __main__ - Step 84541: {'lr': 0.00020492821007919915, 'samples': 16231872, 'steps': 84540, 'loss/train': 1.3896256685256958} 11/07/2021 09:06:35 - INFO - __main__ - Step 84542: {'lr': 0.0002049229903012141, 'samples': 16232064, 'steps': 84541, 'loss/train': 0.9723855257034302} 11/07/2021 09:06:35 - INFO - __main__ - Step 84543: {'lr': 0.00020491777054354004, 'samples': 16232256, 'steps': 84542, 'loss/train': 1.3722662925720215} 11/07/2021 09:06:35 - INFO - __main__ - Step 84544: {'lr': 0.00020491255080617936, 'samples': 16232448, 'steps': 84543, 'loss/train': 1.6967706680297852} 11/07/2021 09:06:37 - INFO - __main__ - Step 84545: {'lr': 0.00020490733108913438, 'samples': 16232640, 'steps': 84544, 'loss/train': 1.6914395093917847} 11/07/2021 09:06:37 - INFO - __main__ - Step 84546: {'lr': 0.00020490211139240756, 'samples': 16232832, 'steps': 84545, 'loss/train': 1.1566072702407837} 11/07/2021 09:06:37 - INFO - __main__ - Step 84547: {'lr': 0.00020489689171600105, 'samples': 16233024, 'steps': 84546, 'loss/train': 1.3886406421661377} 11/07/2021 09:06:38 - INFO - __main__ - Step 84548: {'lr': 0.0002048916720599173, 'samples': 16233216, 'steps': 84547, 'loss/train': 1.3791937828063965} 11/07/2021 09:06:39 - INFO - __main__ - Step 84549: {'lr': 0.00020488645242415865, 'samples': 16233408, 'steps': 84548, 'loss/train': 1.6002534627914429} 11/07/2021 09:06:39 - INFO - __main__ - Step 84550: {'lr': 0.0002048812328087275, 'samples': 16233600, 'steps': 84549, 'loss/train': 1.2560615539550781} 11/07/2021 09:06:39 - INFO - __main__ - Step 84551: {'lr': 0.00020487601321362615, 'samples': 16233792, 'steps': 84550, 'loss/train': 1.2413127422332764} 11/07/2021 09:06:40 - INFO - __main__ - Step 84552: {'lr': 0.000204870793638857, 'samples': 16233984, 'steps': 84551, 'loss/train': 1.3343855142593384} 11/07/2021 09:06:40 - INFO - __main__ - Step 84553: {'lr': 0.00020486557408442235, 'samples': 16234176, 'steps': 84552, 'loss/train': 1.315881371498108} 11/07/2021 09:06:41 - INFO - __main__ - Step 84554: {'lr': 0.00020486035455032458, 'samples': 16234368, 'steps': 84553, 'loss/train': 1.274509072303772} 11/07/2021 09:06:41 - INFO - __main__ - Step 84555: {'lr': 0.00020485513503656604, 'samples': 16234560, 'steps': 84554, 'loss/train': 1.5724247694015503} 11/07/2021 09:06:42 - INFO - __main__ - Step 84556: {'lr': 0.00020484991554314908, 'samples': 16234752, 'steps': 84555, 'loss/train': 1.5560871362686157} 11/07/2021 09:06:42 - INFO - __main__ - Step 84557: {'lr': 0.00020484469607007604, 'samples': 16234944, 'steps': 84556, 'loss/train': 1.7258869409561157} 11/07/2021 09:06:42 - INFO - __main__ - Step 84558: {'lr': 0.0002048394766173493, 'samples': 16235136, 'steps': 84557, 'loss/train': 1.716495394706726} 11/07/2021 09:06:43 - INFO - __main__ - Step 84559: {'lr': 0.00020483425718497127, 'samples': 16235328, 'steps': 84558, 'loss/train': 0.5459644198417664} 11/07/2021 09:06:44 - INFO - __main__ - Step 84560: {'lr': 0.00020482903777294416, 'samples': 16235520, 'steps': 84559, 'loss/train': 1.4168401956558228} 11/07/2021 09:06:44 - INFO - __main__ - Step 84561: {'lr': 0.00020482381838127038, 'samples': 16235712, 'steps': 84560, 'loss/train': 1.4493762254714966} 11/07/2021 09:06:45 - INFO - __main__ - Step 84562: {'lr': 0.0002048185990099523, 'samples': 16235904, 'steps': 84561, 'loss/train': 1.6995502710342407} 11/07/2021 09:06:45 - INFO - __main__ - Step 84563: {'lr': 0.0002048133796589922, 'samples': 16236096, 'steps': 84562, 'loss/train': 1.1596589088439941} 11/07/2021 09:06:45 - INFO - __main__ - Step 84564: {'lr': 0.00020480816032839255, 'samples': 16236288, 'steps': 84563, 'loss/train': 1.5591940879821777} 11/07/2021 09:06:46 - INFO - __main__ - Step 84565: {'lr': 0.00020480294101815565, 'samples': 16236480, 'steps': 84564, 'loss/train': 1.3742320537567139} 11/07/2021 09:06:47 - INFO - __main__ - Step 84566: {'lr': 0.00020479772172828382, 'samples': 16236672, 'steps': 84565, 'loss/train': 1.667954921722412} 11/07/2021 09:06:47 - INFO - __main__ - Step 84567: {'lr': 0.00020479250245877944, 'samples': 16236864, 'steps': 84566, 'loss/train': 0.921503484249115} 11/07/2021 09:06:47 - INFO - __main__ - Step 84568: {'lr': 0.0002047872832096449, 'samples': 16237056, 'steps': 84567, 'loss/train': 1.451155662536621} 11/07/2021 09:06:48 - INFO - __main__ - Step 84569: {'lr': 0.00020478206398088247, 'samples': 16237248, 'steps': 84568, 'loss/train': 1.3839348554611206} 11/07/2021 09:06:49 - INFO - __main__ - Step 84570: {'lr': 0.00020477684477249457, 'samples': 16237440, 'steps': 84569, 'loss/train': 1.1888041496276855} 11/07/2021 09:06:49 - INFO - __main__ - Step 84571: {'lr': 0.00020477162558448352, 'samples': 16237632, 'steps': 84570, 'loss/train': 1.0782030820846558} 11/07/2021 09:06:50 - INFO - __main__ - Step 84572: {'lr': 0.00020476640641685164, 'samples': 16237824, 'steps': 84571, 'loss/train': 1.3293739557266235} 11/07/2021 09:06:50 - INFO - __main__ - Step 84573: {'lr': 0.00020476118726960146, 'samples': 16238016, 'steps': 84572, 'loss/train': 1.3092409372329712} 11/07/2021 09:06:50 - INFO - __main__ - Step 84574: {'lr': 0.00020475596814273513, 'samples': 16238208, 'steps': 84573, 'loss/train': 1.2645753622055054} 11/07/2021 09:06:51 - INFO - __main__ - Step 84575: {'lr': 0.000204750749036255, 'samples': 16238400, 'steps': 84574, 'loss/train': 1.6865676641464233} 11/07/2021 09:06:52 - INFO - __main__ - Step 84576: {'lr': 0.0002047455299501635, 'samples': 16238592, 'steps': 84575, 'loss/train': 1.4901232719421387} 11/07/2021 09:06:52 - INFO - __main__ - Step 84577: {'lr': 0.00020474031088446294, 'samples': 16238784, 'steps': 84576, 'loss/train': 1.5809532403945923} 11/07/2021 09:06:52 - INFO - __main__ - Step 84578: {'lr': 0.00020473509183915572, 'samples': 16238976, 'steps': 84577, 'loss/train': 1.710353136062622} 11/07/2021 09:06:53 - INFO - __main__ - Step 84579: {'lr': 0.00020472987281424418, 'samples': 16239168, 'steps': 84578, 'loss/train': 1.2723020315170288} 11/07/2021 09:06:53 - INFO - __main__ - Step 84580: {'lr': 0.00020472465380973065, 'samples': 16239360, 'steps': 84579, 'loss/train': 1.4999611377716064} 11/07/2021 09:06:54 - INFO - __main__ - Step 84581: {'lr': 0.0002047194348256175, 'samples': 16239552, 'steps': 84580, 'loss/train': 1.453312873840332} 11/07/2021 09:06:54 - INFO - __main__ - Step 84582: {'lr': 0.00020471421586190706, 'samples': 16239744, 'steps': 84581, 'loss/train': 1.204297661781311} 11/07/2021 09:06:55 - INFO - __main__ - Step 84583: {'lr': 0.0002047089969186017, 'samples': 16239936, 'steps': 84582, 'loss/train': 1.4841258525848389} 11/07/2021 09:06:55 - INFO - __main__ - Step 84584: {'lr': 0.00020470377799570378, 'samples': 16240128, 'steps': 84583, 'loss/train': 0.6791462302207947} 11/07/2021 09:06:55 - INFO - __main__ - Step 84585: {'lr': 0.00020469855909321564, 'samples': 16240320, 'steps': 84584, 'loss/train': 1.4862334728240967} 11/07/2021 09:06:57 - INFO - __main__ - Step 84586: {'lr': 0.0002046933402111397, 'samples': 16240512, 'steps': 84585, 'loss/train': 1.5998090505599976} 11/07/2021 09:06:57 - INFO - __main__ - Step 84587: {'lr': 0.00020468812134947817, 'samples': 16240704, 'steps': 84586, 'loss/train': 1.3294440507888794} 11/07/2021 09:06:57 - INFO - __main__ - Step 84588: {'lr': 0.00020468290250823346, 'samples': 16240896, 'steps': 84587, 'loss/train': 1.2719627618789673} 11/07/2021 09:06:58 - INFO - __main__ - Step 84589: {'lr': 0.00020467768368740796, 'samples': 16241088, 'steps': 84588, 'loss/train': 1.4728877544403076} 11/07/2021 09:06:58 - INFO - __main__ - Step 84590: {'lr': 0.00020467246488700398, 'samples': 16241280, 'steps': 84589, 'loss/train': 1.0300781726837158} 11/07/2021 09:06:59 - INFO - __main__ - Step 84591: {'lr': 0.00020466724610702388, 'samples': 16241472, 'steps': 84590, 'loss/train': 1.9281189441680908} 11/07/2021 09:06:59 - INFO - __main__ - Step 84592: {'lr': 0.00020466202734747003, 'samples': 16241664, 'steps': 84591, 'loss/train': 1.4154212474822998} 11/07/2021 09:07:00 - INFO - __main__ - Step 84593: {'lr': 0.00020465680860834475, 'samples': 16241856, 'steps': 84592, 'loss/train': 1.3730981349945068} 11/07/2021 09:07:00 - INFO - __main__ - Step 84594: {'lr': 0.00020465158988965045, 'samples': 16242048, 'steps': 84593, 'loss/train': 1.208726406097412} 11/07/2021 09:07:00 - INFO - __main__ - Step 84595: {'lr': 0.0002046463711913894, 'samples': 16242240, 'steps': 84594, 'loss/train': 1.173787236213684} 11/07/2021 09:07:01 - INFO - __main__ - Step 84596: {'lr': 0.00020464115251356401, 'samples': 16242432, 'steps': 84595, 'loss/train': 1.8196641206741333} 11/07/2021 09:07:02 - INFO - __main__ - Step 84597: {'lr': 0.00020463593385617663, 'samples': 16242624, 'steps': 84596, 'loss/train': 1.6778243780136108} 11/07/2021 09:07:02 - INFO - __main__ - Step 84598: {'lr': 0.0002046307152192296, 'samples': 16242816, 'steps': 84597, 'loss/train': 1.4159280061721802} 11/07/2021 09:07:02 - INFO - __main__ - Step 84599: {'lr': 0.00020462549660272536, 'samples': 16243008, 'steps': 84598, 'loss/train': 1.2242116928100586} 11/07/2021 09:07:03 - INFO - __main__ - Step 84600: {'lr': 0.00020462027800666608, 'samples': 16243200, 'steps': 84599, 'loss/train': 1.035430908203125} 11/07/2021 09:07:04 - INFO - __main__ - Step 84601: {'lr': 0.00020461505943105419, 'samples': 16243392, 'steps': 84600, 'loss/train': 1.228895664215088} 11/07/2021 09:07:04 - INFO - __main__ - Step 84602: {'lr': 0.00020460984087589205, 'samples': 16243584, 'steps': 84601, 'loss/train': 1.340151309967041} 11/07/2021 09:07:05 - INFO - __main__ - Step 84603: {'lr': 0.00020460462234118203, 'samples': 16243776, 'steps': 84602, 'loss/train': 1.4861336946487427} 11/07/2021 09:07:05 - INFO - __main__ - Step 84604: {'lr': 0.00020459940382692646, 'samples': 16243968, 'steps': 84603, 'loss/train': 1.6012952327728271} 11/07/2021 09:07:05 - INFO - __main__ - Step 84605: {'lr': 0.00020459418533312767, 'samples': 16244160, 'steps': 84604, 'loss/train': 1.5499167442321777} 11/07/2021 09:07:06 - INFO - __main__ - Step 84606: {'lr': 0.0002045889668597881, 'samples': 16244352, 'steps': 84605, 'loss/train': 1.4334110021591187} 11/07/2021 09:07:07 - INFO - __main__ - Step 84607: {'lr': 0.00020458374840691, 'samples': 16244544, 'steps': 84606, 'loss/train': 1.5858066082000732} 11/07/2021 09:07:07 - INFO - __main__ - Step 84608: {'lr': 0.00020457852997449579, 'samples': 16244736, 'steps': 84607, 'loss/train': 1.1833418607711792} 11/07/2021 09:07:07 - INFO - __main__ - Step 84609: {'lr': 0.00020457331156254776, 'samples': 16244928, 'steps': 84608, 'loss/train': 1.44891357421875} 11/07/2021 09:07:08 - INFO - __main__ - Step 84610: {'lr': 0.0002045680931710683, 'samples': 16245120, 'steps': 84609, 'loss/train': 1.2299867868423462} 11/07/2021 09:07:08 - INFO - __main__ - Step 84611: {'lr': 0.00020456287480005974, 'samples': 16245312, 'steps': 84610, 'loss/train': 1.2410900592803955} 11/07/2021 09:07:09 - INFO - __main__ - Step 84612: {'lr': 0.0002045576564495245, 'samples': 16245504, 'steps': 84611, 'loss/train': 1.4761728048324585} 11/07/2021 09:07:09 - INFO - __main__ - Step 84613: {'lr': 0.00020455243811946496, 'samples': 16245696, 'steps': 84612, 'loss/train': 0.4418940246105194} 11/07/2021 09:07:10 - INFO - __main__ - Step 84614: {'lr': 0.00020454721980988329, 'samples': 16245888, 'steps': 84613, 'loss/train': 1.594132900238037} 11/07/2021 09:07:10 - INFO - __main__ - Step 84615: {'lr': 0.00020454200152078192, 'samples': 16246080, 'steps': 84614, 'loss/train': 1.1133122444152832} 11/07/2021 09:07:11 - INFO - __main__ - Step 84616: {'lr': 0.00020453678325216325, 'samples': 16246272, 'steps': 84615, 'loss/train': 1.367553472518921} 11/07/2021 09:07:11 - INFO - __main__ - Step 84617: {'lr': 0.00020453156500402958, 'samples': 16246464, 'steps': 84616, 'loss/train': 1.3582121133804321} 11/07/2021 09:07:12 - INFO - __main__ - Step 84618: {'lr': 0.00020452634677638328, 'samples': 16246656, 'steps': 84617, 'loss/train': 1.3757905960083008} 11/07/2021 09:07:12 - INFO - __main__ - Step 84619: {'lr': 0.00020452112856922673, 'samples': 16246848, 'steps': 84618, 'loss/train': 1.267234206199646} 11/07/2021 09:07:13 - INFO - __main__ - Step 84620: {'lr': 0.00020451591038256223, 'samples': 16247040, 'steps': 84619, 'loss/train': 1.4423067569732666} 11/07/2021 09:07:13 - INFO - __main__ - Step 84621: {'lr': 0.00020451069221639218, 'samples': 16247232, 'steps': 84620, 'loss/train': 1.341097354888916} 11/07/2021 09:07:14 - INFO - __main__ - Step 84622: {'lr': 0.00020450547407071894, 'samples': 16247424, 'steps': 84621, 'loss/train': 1.0640594959259033} 11/07/2021 09:07:14 - INFO - __main__ - Step 84623: {'lr': 0.00020450025594554477, 'samples': 16247616, 'steps': 84622, 'loss/train': 1.0073246955871582} 11/07/2021 09:07:15 - INFO - __main__ - Step 84624: {'lr': 0.0002044950378408721, 'samples': 16247808, 'steps': 84623, 'loss/train': 1.6086252927780151} 11/07/2021 09:07:15 - INFO - __main__ - Step 84625: {'lr': 0.00020448981975670336, 'samples': 16248000, 'steps': 84624, 'loss/train': 1.6265400648117065} 11/07/2021 09:07:15 - INFO - __main__ - Step 84626: {'lr': 0.00020448460169304074, 'samples': 16248192, 'steps': 84625, 'loss/train': 1.1049890518188477} 11/07/2021 09:07:16 - INFO - __main__ - Step 84627: {'lr': 0.00020447938364988666, 'samples': 16248384, 'steps': 84626, 'loss/train': 1.9510300159454346} 11/07/2021 09:07:17 - INFO - __main__ - Step 84628: {'lr': 0.00020447416562724345, 'samples': 16248576, 'steps': 84627, 'loss/train': 1.3662604093551636} 11/07/2021 09:07:17 - INFO - __main__ - Step 84629: {'lr': 0.00020446894762511346, 'samples': 16248768, 'steps': 84628, 'loss/train': 0.7154455780982971} 11/07/2021 09:07:17 - INFO - __main__ - Step 84630: {'lr': 0.00020446372964349907, 'samples': 16248960, 'steps': 84629, 'loss/train': 0.7557353377342224} 11/07/2021 09:07:18 - INFO - __main__ - Step 84631: {'lr': 0.00020445851168240264, 'samples': 16249152, 'steps': 84630, 'loss/train': 1.5942338705062866} 11/07/2021 09:07:19 - INFO - __main__ - Step 84632: {'lr': 0.00020445329374182646, 'samples': 16249344, 'steps': 84631, 'loss/train': 1.4458531141281128} 11/07/2021 09:07:19 - INFO - __main__ - Step 84633: {'lr': 0.00020444807582177296, 'samples': 16249536, 'steps': 84632, 'loss/train': 1.1619834899902344} 11/07/2021 09:07:19 - INFO - __main__ - Step 84634: {'lr': 0.00020444285792224444, 'samples': 16249728, 'steps': 84633, 'loss/train': 1.680894136428833} 11/07/2021 09:07:20 - INFO - __main__ - Step 84635: {'lr': 0.00020443764004324328, 'samples': 16249920, 'steps': 84634, 'loss/train': 1.643336296081543} 11/07/2021 09:07:20 - INFO - __main__ - Step 84636: {'lr': 0.00020443242218477184, 'samples': 16250112, 'steps': 84635, 'loss/train': 1.4283676147460938} 11/07/2021 09:07:21 - INFO - __main__ - Step 84637: {'lr': 0.00020442720434683242, 'samples': 16250304, 'steps': 84636, 'loss/train': 1.559242606163025} 11/07/2021 09:07:22 - INFO - __main__ - Step 84638: {'lr': 0.0002044219865294274, 'samples': 16250496, 'steps': 84637, 'loss/train': 1.256117343902588} 11/07/2021 09:07:22 - INFO - __main__ - Step 84639: {'lr': 0.0002044167687325591, 'samples': 16250688, 'steps': 84638, 'loss/train': 1.9996553659439087} 11/07/2021 09:07:22 - INFO - __main__ - Step 84640: {'lr': 0.00020441155095622998, 'samples': 16250880, 'steps': 84639, 'loss/train': 1.0889838933944702} 11/07/2021 09:07:23 - INFO - __main__ - Step 84641: {'lr': 0.00020440633320044225, 'samples': 16251072, 'steps': 84640, 'loss/train': 1.226456880569458} 11/07/2021 09:07:23 - INFO - __main__ - Step 84642: {'lr': 0.00020440111546519833, 'samples': 16251264, 'steps': 84641, 'loss/train': 1.6502223014831543} 11/07/2021 09:07:23 - INFO - __main__ - Step 84643: {'lr': 0.00020439589775050055, 'samples': 16251456, 'steps': 84642, 'loss/train': 1.4119904041290283} 11/07/2021 09:07:25 - INFO - __main__ - Step 84644: {'lr': 0.00020439068005635128, 'samples': 16251648, 'steps': 84643, 'loss/train': 0.6339604258537292} 11/07/2021 09:07:25 - INFO - __main__ - Step 84645: {'lr': 0.00020438546238275287, 'samples': 16251840, 'steps': 84644, 'loss/train': 1.6678794622421265} 11/07/2021 09:07:25 - INFO - __main__ - Step 84646: {'lr': 0.00020438024472970768, 'samples': 16252032, 'steps': 84645, 'loss/train': 1.0297315120697021} 11/07/2021 09:07:26 - INFO - __main__ - Step 84647: {'lr': 0.00020437502709721805, 'samples': 16252224, 'steps': 84646, 'loss/train': 0.8582286238670349} 11/07/2021 09:07:26 - INFO - __main__ - Step 84648: {'lr': 0.00020436980948528632, 'samples': 16252416, 'steps': 84647, 'loss/train': 1.5085622072219849} 11/07/2021 09:07:27 - INFO - __main__ - Step 84649: {'lr': 0.00020436459189391486, 'samples': 16252608, 'steps': 84648, 'loss/train': 1.4853891134262085} 11/07/2021 09:07:27 - INFO - __main__ - Step 84650: {'lr': 0.00020435937432310597, 'samples': 16252800, 'steps': 84649, 'loss/train': 0.9920943379402161} 11/07/2021 09:07:28 - INFO - __main__ - Step 84651: {'lr': 0.00020435415677286207, 'samples': 16252992, 'steps': 84650, 'loss/train': 1.504042148590088} 11/07/2021 09:07:28 - INFO - __main__ - Step 84652: {'lr': 0.00020434893924318548, 'samples': 16253184, 'steps': 84651, 'loss/train': 1.5970582962036133} 11/07/2021 09:07:28 - INFO - __main__ - Step 84653: {'lr': 0.00020434372173407862, 'samples': 16253376, 'steps': 84652, 'loss/train': 1.085263967514038} 11/07/2021 09:07:29 - INFO - __main__ - Step 84654: {'lr': 0.00020433850424554368, 'samples': 16253568, 'steps': 84653, 'loss/train': 1.6690202951431274} 11/07/2021 09:07:30 - INFO - __main__ - Step 84655: {'lr': 0.00020433328677758314, 'samples': 16253760, 'steps': 84654, 'loss/train': 1.6958613395690918} 11/07/2021 09:07:30 - INFO - __main__ - Step 84656: {'lr': 0.0002043280693301993, 'samples': 16253952, 'steps': 84655, 'loss/train': 1.0123772621154785} 11/07/2021 09:07:30 - INFO - __main__ - Step 84657: {'lr': 0.00020432285190339453, 'samples': 16254144, 'steps': 84656, 'loss/train': 1.4570038318634033} 11/07/2021 09:07:31 - INFO - __main__ - Step 84658: {'lr': 0.00020431763449717122, 'samples': 16254336, 'steps': 84657, 'loss/train': 1.2963780164718628} 11/07/2021 09:07:32 - INFO - __main__ - Step 84659: {'lr': 0.00020431241711153165, 'samples': 16254528, 'steps': 84658, 'loss/train': 1.27036714553833} 11/07/2021 09:07:32 - INFO - __main__ - Step 84660: {'lr': 0.0002043071997464782, 'samples': 16254720, 'steps': 84659, 'loss/train': 1.9753367900848389} 11/07/2021 09:07:32 - INFO - __main__ - Step 84661: {'lr': 0.0002043019824020132, 'samples': 16254912, 'steps': 84660, 'loss/train': 1.2565473318099976} 11/07/2021 09:07:33 - INFO - __main__ - Step 84662: {'lr': 0.00020429676507813905, 'samples': 16255104, 'steps': 84661, 'loss/train': 1.7904713153839111} 11/07/2021 09:07:33 - INFO - __main__ - Step 84663: {'lr': 0.00020429154777485802, 'samples': 16255296, 'steps': 84662, 'loss/train': 1.3559070825576782} 11/07/2021 09:07:34 - INFO - __main__ - Step 84664: {'lr': 0.00020428633049217258, 'samples': 16255488, 'steps': 84663, 'loss/train': 1.7995284795761108} 11/07/2021 09:07:35 - INFO - __main__ - Step 84665: {'lr': 0.00020428111323008498, 'samples': 16255680, 'steps': 84664, 'loss/train': 1.6143866777420044} 11/07/2021 09:07:35 - INFO - __main__ - Step 84666: {'lr': 0.0002042758959885976, 'samples': 16255872, 'steps': 84665, 'loss/train': 1.1708416938781738} 11/07/2021 09:07:35 - INFO - __main__ - Step 84667: {'lr': 0.00020427067876771285, 'samples': 16256064, 'steps': 84666, 'loss/train': 1.6756259202957153} 11/07/2021 09:07:36 - INFO - __main__ - Step 84668: {'lr': 0.00020426546156743298, 'samples': 16256256, 'steps': 84667, 'loss/train': 1.306104302406311} 11/07/2021 09:07:36 - INFO - __main__ - Step 84669: {'lr': 0.00020426024438776043, 'samples': 16256448, 'steps': 84668, 'loss/train': 1.6464347839355469} 11/07/2021 09:07:37 - INFO - __main__ - Step 84670: {'lr': 0.0002042550272286975, 'samples': 16256640, 'steps': 84669, 'loss/train': 1.1695822477340698} 11/07/2021 09:07:37 - INFO - __main__ - Step 84671: {'lr': 0.00020424981009024647, 'samples': 16256832, 'steps': 84670, 'loss/train': 1.2532548904418945} 11/07/2021 09:07:38 - INFO - __main__ - Step 84672: {'lr': 0.00020424459297240983, 'samples': 16257024, 'steps': 84671, 'loss/train': 1.6037249565124512} 11/07/2021 09:07:38 - INFO - __main__ - Step 84673: {'lr': 0.00020423937587518988, 'samples': 16257216, 'steps': 84672, 'loss/train': 0.9443399310112} 11/07/2021 09:07:38 - INFO - __main__ - Step 84674: {'lr': 0.0002042341587985889, 'samples': 16257408, 'steps': 84673, 'loss/train': 1.707146406173706} 11/07/2021 09:07:39 - INFO - __main__ - Step 84675: {'lr': 0.00020422894174260933, 'samples': 16257600, 'steps': 84674, 'loss/train': 1.5387051105499268} 11/07/2021 09:07:40 - INFO - __main__ - Step 84676: {'lr': 0.0002042237247072535, 'samples': 16257792, 'steps': 84675, 'loss/train': 1.1149235963821411} 11/07/2021 09:07:40 - INFO - __main__ - Step 84677: {'lr': 0.00020421850769252375, 'samples': 16257984, 'steps': 84676, 'loss/train': 0.8054776191711426} 11/07/2021 09:07:40 - INFO - __main__ - Step 84678: {'lr': 0.00020421329069842246, 'samples': 16258176, 'steps': 84677, 'loss/train': 1.436389684677124} 11/07/2021 09:07:41 - INFO - __main__ - Step 84679: {'lr': 0.00020420807372495192, 'samples': 16258368, 'steps': 84678, 'loss/train': 1.1575031280517578} 11/07/2021 09:07:42 - INFO - __main__ - Step 84680: {'lr': 0.00020420285677211463, 'samples': 16258560, 'steps': 84679, 'loss/train': 0.791327714920044} 11/07/2021 09:07:42 - INFO - __main__ - Step 84681: {'lr': 0.00020419763983991275, 'samples': 16258752, 'steps': 84680, 'loss/train': 1.3203988075256348} 11/07/2021 09:07:42 - INFO - __main__ - Step 84682: {'lr': 0.00020419242292834866, 'samples': 16258944, 'steps': 84681, 'loss/train': 2.6131372451782227} 11/07/2021 09:07:43 - INFO - __main__ - Step 84683: {'lr': 0.00020418720603742477, 'samples': 16259136, 'steps': 84682, 'loss/train': 0.672593355178833} 11/07/2021 09:07:43 - INFO - __main__ - Step 84684: {'lr': 0.00020418198916714343, 'samples': 16259328, 'steps': 84683, 'loss/train': 1.461197018623352} 11/07/2021 09:07:44 - INFO - __main__ - Step 84685: {'lr': 0.00020417677231750696, 'samples': 16259520, 'steps': 84684, 'loss/train': 1.2454251050949097} 11/07/2021 09:07:44 - INFO - __main__ - Step 84686: {'lr': 0.00020417155548851774, 'samples': 16259712, 'steps': 84685, 'loss/train': 1.5096968412399292} 11/07/2021 09:07:45 - INFO - __main__ - Step 84687: {'lr': 0.00020416633868017812, 'samples': 16259904, 'steps': 84686, 'loss/train': 1.2650223970413208} 11/07/2021 09:07:45 - INFO - __main__ - Step 84688: {'lr': 0.00020416112189249042, 'samples': 16260096, 'steps': 84687, 'loss/train': 1.3071868419647217} 11/07/2021 09:07:46 - INFO - __main__ - Step 84689: {'lr': 0.00020415590512545703, 'samples': 16260288, 'steps': 84688, 'loss/train': 1.3686522245407104} 11/07/2021 09:07:47 - INFO - __main__ - Step 84690: {'lr': 0.0002041506883790803, 'samples': 16260480, 'steps': 84689, 'loss/train': 1.3932477235794067} 11/07/2021 09:07:47 - INFO - __main__ - Step 84691: {'lr': 0.0002041454716533625, 'samples': 16260672, 'steps': 84690, 'loss/train': 1.5935925245285034} 11/07/2021 09:07:47 - INFO - __main__ - Step 84692: {'lr': 0.0002041402549483061, 'samples': 16260864, 'steps': 84691, 'loss/train': 1.5710593461990356} 11/07/2021 09:07:48 - INFO - __main__ - Step 84693: {'lr': 0.0002041350382639134, 'samples': 16261056, 'steps': 84692, 'loss/train': 1.5165703296661377} 11/07/2021 09:07:48 - INFO - __main__ - Step 84694: {'lr': 0.00020412982160018678, 'samples': 16261248, 'steps': 84693, 'loss/train': 0.6782932281494141} 11/07/2021 09:07:48 - INFO - __main__ - Step 84695: {'lr': 0.0002041246049571285, 'samples': 16261440, 'steps': 84694, 'loss/train': 1.168516755104065} 11/07/2021 09:07:49 - INFO - __main__ - Step 84696: {'lr': 0.00020411938833474097, 'samples': 16261632, 'steps': 84695, 'loss/train': 1.430998682975769} 11/07/2021 09:07:50 - INFO - __main__ - Step 84697: {'lr': 0.0002041141717330265, 'samples': 16261824, 'steps': 84696, 'loss/train': 1.6865696907043457} 11/07/2021 09:07:50 - INFO - __main__ - Step 84698: {'lr': 0.00020410895515198752, 'samples': 16262016, 'steps': 84697, 'loss/train': 1.4822839498519897} 11/07/2021 09:07:50 - INFO - __main__ - Step 84699: {'lr': 0.0002041037385916263, 'samples': 16262208, 'steps': 84698, 'loss/train': 1.4442052841186523} 11/07/2021 09:07:51 - INFO - __main__ - Step 84700: {'lr': 0.00020409852205194526, 'samples': 16262400, 'steps': 84699, 'loss/train': 1.8815431594848633} 11/07/2021 09:07:52 - INFO - __main__ - Step 84701: {'lr': 0.0002040933055329467, 'samples': 16262592, 'steps': 84700, 'loss/train': 2.2317450046539307} 11/07/2021 09:07:52 - INFO - __main__ - Step 84702: {'lr': 0.000204088089034633, 'samples': 16262784, 'steps': 84701, 'loss/train': 1.4403458833694458} 11/07/2021 09:07:52 - INFO - __main__ - Step 84703: {'lr': 0.00020408287255700648, 'samples': 16262976, 'steps': 84702, 'loss/train': 1.4530926942825317} 11/07/2021 09:07:53 - INFO - __main__ - Step 84704: {'lr': 0.0002040776561000695, 'samples': 16263168, 'steps': 84703, 'loss/train': 1.8077855110168457} 11/07/2021 09:07:53 - INFO - __main__ - Step 84705: {'lr': 0.00020407243966382444, 'samples': 16263360, 'steps': 84704, 'loss/train': 0.5170168280601501} 11/07/2021 09:07:54 - INFO - __main__ - Step 84706: {'lr': 0.00020406722324827365, 'samples': 16263552, 'steps': 84705, 'loss/train': 1.0341696739196777} 11/07/2021 09:07:55 - INFO - __main__ - Step 84707: {'lr': 0.00020406200685341952, 'samples': 16263744, 'steps': 84706, 'loss/train': 1.1833986043930054} 11/07/2021 09:07:55 - INFO - __main__ - Step 84708: {'lr': 0.00020405679047926425, 'samples': 16263936, 'steps': 84707, 'loss/train': 1.5763814449310303} 11/07/2021 09:07:55 - INFO - __main__ - Step 84709: {'lr': 0.0002040515741258103, 'samples': 16264128, 'steps': 84708, 'loss/train': 1.1120140552520752} 11/07/2021 09:07:56 - INFO - __main__ - Step 84710: {'lr': 0.00020404635779305998, 'samples': 16264320, 'steps': 84709, 'loss/train': 1.587100863456726} 11/07/2021 09:07:57 - INFO - __main__ - Step 84711: {'lr': 0.0002040411414810157, 'samples': 16264512, 'steps': 84710, 'loss/train': 1.2812408208847046} 11/07/2021 09:07:57 - INFO - __main__ - Step 84712: {'lr': 0.00020403592518967973, 'samples': 16264704, 'steps': 84711, 'loss/train': 1.3936455249786377} 11/07/2021 09:07:57 - INFO - __main__ - Step 84713: {'lr': 0.0002040307089190545, 'samples': 16264896, 'steps': 84712, 'loss/train': 1.0045639276504517} 11/07/2021 09:07:58 - INFO - __main__ - Step 84714: {'lr': 0.00020402549266914228, 'samples': 16265088, 'steps': 84713, 'loss/train': 1.2881100177764893} 11/07/2021 09:07:58 - INFO - __main__ - Step 84715: {'lr': 0.00020402027643994547, 'samples': 16265280, 'steps': 84714, 'loss/train': 0.6126003861427307} 11/07/2021 09:07:59 - INFO - __main__ - Step 84716: {'lr': 0.00020401506023146643, 'samples': 16265472, 'steps': 84715, 'loss/train': 1.5428484678268433} 11/07/2021 09:07:59 - INFO - __main__ - Step 84717: {'lr': 0.00020400984404370752, 'samples': 16265664, 'steps': 84716, 'loss/train': 1.2581043243408203} 11/07/2021 09:08:00 - INFO - __main__ - Step 84718: {'lr': 0.00020400462787667102, 'samples': 16265856, 'steps': 84717, 'loss/train': 1.277890920639038} 11/07/2021 09:08:00 - INFO - __main__ - Step 84719: {'lr': 0.00020399941173035934, 'samples': 16266048, 'steps': 84718, 'loss/train': 1.4523476362228394} 11/07/2021 09:08:00 - INFO - __main__ - Step 84720: {'lr': 0.00020399419560477493, 'samples': 16266240, 'steps': 84719, 'loss/train': 1.4359462261199951} 11/07/2021 09:08:01 - INFO - __main__ - Step 84721: {'lr': 0.0002039889794999199, 'samples': 16266432, 'steps': 84720, 'loss/train': 1.1932063102722168} 11/07/2021 09:08:02 - INFO - __main__ - Step 84722: {'lr': 0.00020398376341579675, 'samples': 16266624, 'steps': 84721, 'loss/train': 1.305720329284668} 11/07/2021 09:08:02 - INFO - __main__ - Step 84723: {'lr': 0.0002039785473524078, 'samples': 16266816, 'steps': 84722, 'loss/train': 1.3804092407226562} 11/07/2021 09:08:03 - INFO - __main__ - Step 84724: {'lr': 0.0002039733313097554, 'samples': 16267008, 'steps': 84723, 'loss/train': 1.4620338678359985} 11/07/2021 09:08:03 - INFO - __main__ - Step 84725: {'lr': 0.0002039681152878419, 'samples': 16267200, 'steps': 84724, 'loss/train': 1.6692488193511963} 11/07/2021 09:08:03 - INFO - __main__ - Step 84726: {'lr': 0.00020396289928666968, 'samples': 16267392, 'steps': 84725, 'loss/train': 1.4833528995513916} 11/07/2021 09:08:04 - INFO - __main__ - Step 84727: {'lr': 0.00020395768330624104, 'samples': 16267584, 'steps': 84726, 'loss/train': 1.510780692100525} 11/07/2021 09:08:04 - INFO - __main__ - Step 84728: {'lr': 0.00020395246734655837, 'samples': 16267776, 'steps': 84727, 'loss/train': 1.1990407705307007} 11/07/2021 09:08:05 - INFO - __main__ - Step 84729: {'lr': 0.000203947251407624, 'samples': 16267968, 'steps': 84728, 'loss/train': 1.9335665702819824} 11/07/2021 09:08:05 - INFO - __main__ - Step 84730: {'lr': 0.0002039420354894403, 'samples': 16268160, 'steps': 84729, 'loss/train': 1.6090489625930786} 11/07/2021 09:08:06 - INFO - __main__ - Step 84731: {'lr': 0.0002039368195920096, 'samples': 16268352, 'steps': 84730, 'loss/train': 1.7653719186782837} 11/07/2021 09:08:07 - INFO - __main__ - Step 84732: {'lr': 0.00020393160371533426, 'samples': 16268544, 'steps': 84731, 'loss/train': 1.2946304082870483} 11/07/2021 09:08:07 - INFO - __main__ - Step 84733: {'lr': 0.00020392638785941665, 'samples': 16268736, 'steps': 84732, 'loss/train': 1.648516058921814} 11/07/2021 09:08:07 - INFO - __main__ - Step 84734: {'lr': 0.00020392117202425918, 'samples': 16268928, 'steps': 84733, 'loss/train': 1.4427646398544312} 11/07/2021 09:08:08 - INFO - __main__ - Step 84735: {'lr': 0.000203915956209864, 'samples': 16269120, 'steps': 84734, 'loss/train': 1.0805782079696655} 11/07/2021 09:08:08 - INFO - __main__ - Step 84736: {'lr': 0.0002039107404162336, 'samples': 16269312, 'steps': 84735, 'loss/train': 1.9427993297576904} 11/07/2021 09:08:09 - INFO - __main__ - Step 84737: {'lr': 0.0002039055246433703, 'samples': 16269504, 'steps': 84736, 'loss/train': 1.156349539756775} 11/07/2021 09:08:10 - INFO - __main__ - Step 84738: {'lr': 0.00020390030889127649, 'samples': 16269696, 'steps': 84737, 'loss/train': 1.5992486476898193} 11/07/2021 09:08:10 - INFO - __main__ - Step 84739: {'lr': 0.00020389509315995444, 'samples': 16269888, 'steps': 84738, 'loss/train': 2.771066188812256} 11/07/2021 09:08:10 - INFO - __main__ - Step 84740: {'lr': 0.00020388987744940658, 'samples': 16270080, 'steps': 84739, 'loss/train': 2.740124225616455} 11/07/2021 09:08:11 - INFO - __main__ - Step 84741: {'lr': 0.00020388466175963522, 'samples': 16270272, 'steps': 84740, 'loss/train': 1.2677786350250244} 11/07/2021 09:08:11 - INFO - __main__ - Step 84742: {'lr': 0.00020387944609064274, 'samples': 16270464, 'steps': 84741, 'loss/train': 1.0499705076217651} 11/07/2021 09:08:12 - INFO - __main__ - Step 84743: {'lr': 0.00020387423044243143, 'samples': 16270656, 'steps': 84742, 'loss/train': 0.7033506035804749} 11/07/2021 09:08:12 - INFO - __main__ - Step 84744: {'lr': 0.0002038690148150037, 'samples': 16270848, 'steps': 84743, 'loss/train': 1.2747849225997925} 11/07/2021 09:08:13 - INFO - __main__ - Step 84745: {'lr': 0.0002038637992083619, 'samples': 16271040, 'steps': 84744, 'loss/train': 1.164971113204956} 11/07/2021 09:08:13 - INFO - __main__ - Step 84746: {'lr': 0.00020385858362250832, 'samples': 16271232, 'steps': 84745, 'loss/train': 1.4792232513427734} 11/07/2021 09:08:14 - INFO - __main__ - Step 84747: {'lr': 0.0002038533680574455, 'samples': 16271424, 'steps': 84746, 'loss/train': 1.6238281726837158} 11/07/2021 09:08:15 - INFO - __main__ - Step 84748: {'lr': 0.0002038481525131755, 'samples': 16271616, 'steps': 84747, 'loss/train': 1.4536504745483398} 11/07/2021 09:08:15 - INFO - __main__ - Step 84749: {'lr': 0.00020384293698970087, 'samples': 16271808, 'steps': 84748, 'loss/train': 1.3286758661270142} 11/07/2021 09:08:15 - INFO - __main__ - Step 84750: {'lr': 0.00020383772148702383, 'samples': 16272000, 'steps': 84749, 'loss/train': 0.9421212673187256} 11/07/2021 09:08:16 - INFO - __main__ - Step 84751: {'lr': 0.00020383250600514684, 'samples': 16272192, 'steps': 84750, 'loss/train': 1.6683181524276733} 11/07/2021 09:08:16 - INFO - __main__ - Step 84752: {'lr': 0.00020382729054407218, 'samples': 16272384, 'steps': 84751, 'loss/train': 1.6785221099853516} 11/07/2021 09:08:16 - INFO - __main__ - Step 84753: {'lr': 0.00020382207510380223, 'samples': 16272576, 'steps': 84752, 'loss/train': 1.4113918542861938} 11/07/2021 09:08:17 - INFO - __main__ - Step 84754: {'lr': 0.0002038168596843394, 'samples': 16272768, 'steps': 84753, 'loss/train': 1.1901564598083496} 11/07/2021 09:08:18 - INFO - __main__ - Step 84755: {'lr': 0.00020381164428568592, 'samples': 16272960, 'steps': 84754, 'loss/train': 1.0588582754135132} 11/07/2021 09:08:18 - INFO - __main__ - Step 84756: {'lr': 0.0002038064289078442, 'samples': 16273152, 'steps': 84755, 'loss/train': 1.6931915283203125} 11/07/2021 09:08:18 - INFO - __main__ - Step 84757: {'lr': 0.0002038012135508166, 'samples': 16273344, 'steps': 84756, 'loss/train': 1.5724453926086426} 11/07/2021 09:08:19 - INFO - __main__ - Step 84758: {'lr': 0.00020379599821460548, 'samples': 16273536, 'steps': 84757, 'loss/train': 1.4129855632781982} 11/07/2021 09:08:20 - INFO - __main__ - Step 84759: {'lr': 0.0002037907828992132, 'samples': 16273728, 'steps': 84758, 'loss/train': 1.3712419271469116} 11/07/2021 09:08:21 - INFO - __main__ - Step 84760: {'lr': 0.00020378556760464205, 'samples': 16273920, 'steps': 84759, 'loss/train': 1.1952331066131592} 11/07/2021 09:08:21 - INFO - __main__ - Step 84761: {'lr': 0.00020378035233089444, 'samples': 16274112, 'steps': 84760, 'loss/train': 1.1533946990966797} 11/07/2021 09:08:21 - INFO - __main__ - Step 84762: {'lr': 0.00020377513707797265, 'samples': 16274304, 'steps': 84761, 'loss/train': 0.9749341607093811} 11/07/2021 09:08:22 - INFO - __main__ - Step 84763: {'lr': 0.00020376992184587908, 'samples': 16274496, 'steps': 84762, 'loss/train': 1.608349084854126} 11/07/2021 09:08:22 - INFO - __main__ - Step 84764: {'lr': 0.00020376470663461605, 'samples': 16274688, 'steps': 84763, 'loss/train': 0.8886701464653015} 11/07/2021 09:08:23 - INFO - __main__ - Step 84765: {'lr': 0.00020375949144418594, 'samples': 16274880, 'steps': 84764, 'loss/train': 1.0311672687530518} 11/07/2021 09:08:23 - INFO - __main__ - Step 84766: {'lr': 0.0002037542762745911, 'samples': 16275072, 'steps': 84765, 'loss/train': 1.2092828750610352} 11/07/2021 09:08:24 - INFO - __main__ - Step 84767: {'lr': 0.00020374906112583386, 'samples': 16275264, 'steps': 84766, 'loss/train': 1.2217990159988403} 11/07/2021 09:08:24 - INFO - __main__ - Step 84768: {'lr': 0.00020374384599791657, 'samples': 16275456, 'steps': 84767, 'loss/train': 0.058880679309368134} 11/07/2021 09:08:24 - INFO - __main__ - Step 84769: {'lr': 0.0002037386308908416, 'samples': 16275648, 'steps': 84768, 'loss/train': 1.3255075216293335} 11/07/2021 09:08:25 - INFO - __main__ - Step 84770: {'lr': 0.00020373341580461133, 'samples': 16275840, 'steps': 84769, 'loss/train': 1.3495829105377197} 11/07/2021 09:08:26 - INFO - __main__ - Step 84771: {'lr': 0.00020372820073922803, 'samples': 16276032, 'steps': 84770, 'loss/train': 1.2243717908859253} 11/07/2021 09:08:26 - INFO - __main__ - Step 84772: {'lr': 0.0002037229856946941, 'samples': 16276224, 'steps': 84771, 'loss/train': 1.3887602090835571} 11/07/2021 09:08:26 - INFO - __main__ - Step 84773: {'lr': 0.00020371777067101183, 'samples': 16276416, 'steps': 84772, 'loss/train': 0.9809748530387878} 11/07/2021 09:08:27 - INFO - __main__ - Step 84774: {'lr': 0.00020371255566818368, 'samples': 16276608, 'steps': 84773, 'loss/train': 1.551694631576538} 11/07/2021 09:08:28 - INFO - __main__ - Step 84775: {'lr': 0.00020370734068621193, 'samples': 16276800, 'steps': 84774, 'loss/train': 1.5197474956512451} 11/07/2021 09:08:28 - INFO - __main__ - Step 84776: {'lr': 0.00020370212572509892, 'samples': 16276992, 'steps': 84775, 'loss/train': 1.513608694076538} 11/07/2021 09:08:29 - INFO - __main__ - Step 84777: {'lr': 0.00020369691078484702, 'samples': 16277184, 'steps': 84776, 'loss/train': 1.2917624711990356} 11/07/2021 09:08:29 - INFO - __main__ - Step 84778: {'lr': 0.00020369169586545856, 'samples': 16277376, 'steps': 84777, 'loss/train': 1.5303281545639038} 11/07/2021 09:08:29 - INFO - __main__ - Step 84779: {'lr': 0.00020368648096693592, 'samples': 16277568, 'steps': 84778, 'loss/train': 1.423628568649292} 11/07/2021 09:08:30 - INFO - __main__ - Step 84780: {'lr': 0.00020368126608928146, 'samples': 16277760, 'steps': 84779, 'loss/train': 1.6535595655441284} 11/07/2021 09:08:31 - INFO - __main__ - Step 84781: {'lr': 0.00020367605123249749, 'samples': 16277952, 'steps': 84780, 'loss/train': 1.3231265544891357} 11/07/2021 09:08:31 - INFO - __main__ - Step 84782: {'lr': 0.0002036708363965864, 'samples': 16278144, 'steps': 84781, 'loss/train': 1.164824366569519} 11/07/2021 09:08:31 - INFO - __main__ - Step 84783: {'lr': 0.00020366562158155048, 'samples': 16278336, 'steps': 84782, 'loss/train': 1.4330953359603882} 11/07/2021 09:08:32 - INFO - __main__ - Step 84784: {'lr': 0.0002036604067873921, 'samples': 16278528, 'steps': 84783, 'loss/train': 1.6105175018310547} 11/07/2021 09:08:33 - INFO - __main__ - Step 84785: {'lr': 0.00020365519201411364, 'samples': 16278720, 'steps': 84784, 'loss/train': 1.0671794414520264} 11/07/2021 09:08:33 - INFO - __main__ - Step 84786: {'lr': 0.00020364997726171746, 'samples': 16278912, 'steps': 84785, 'loss/train': 1.1897993087768555} 11/07/2021 09:08:33 - INFO - __main__ - Step 84787: {'lr': 0.00020364476253020587, 'samples': 16279104, 'steps': 84786, 'loss/train': 1.3872689008712769} 11/07/2021 09:08:34 - INFO - __main__ - Step 84788: {'lr': 0.00020363954781958126, 'samples': 16279296, 'steps': 84787, 'loss/train': 1.8563098907470703} 11/07/2021 09:08:34 - INFO - __main__ - Step 84789: {'lr': 0.00020363433312984596, 'samples': 16279488, 'steps': 84788, 'loss/train': 1.2561671733856201} 11/07/2021 09:08:35 - INFO - __main__ - Step 84790: {'lr': 0.00020362911846100228, 'samples': 16279680, 'steps': 84789, 'loss/train': 1.2398838996887207} 11/07/2021 09:08:35 - INFO - __main__ - Step 84791: {'lr': 0.00020362390381305256, 'samples': 16279872, 'steps': 84790, 'loss/train': 1.2470568418502808} 11/07/2021 09:08:36 - INFO - __main__ - Step 84792: {'lr': 0.0002036186891859993, 'samples': 16280064, 'steps': 84791, 'loss/train': 1.366943597793579} 11/07/2021 09:08:36 - INFO - __main__ - Step 84793: {'lr': 0.0002036134745798447, 'samples': 16280256, 'steps': 84792, 'loss/train': 1.3442431688308716} 11/07/2021 09:08:36 - INFO - __main__ - Step 84794: {'lr': 0.00020360825999459113, 'samples': 16280448, 'steps': 84793, 'loss/train': 1.065705418586731} 11/07/2021 09:08:37 - INFO - __main__ - Step 84795: {'lr': 0.00020360304543024096, 'samples': 16280640, 'steps': 84794, 'loss/train': 1.534156084060669} 11/07/2021 09:08:38 - INFO - __main__ - Step 84796: {'lr': 0.00020359783088679654, 'samples': 16280832, 'steps': 84795, 'loss/train': 0.7189258337020874} 11/07/2021 09:08:38 - INFO - __main__ - Step 84797: {'lr': 0.00020359261636426025, 'samples': 16281024, 'steps': 84796, 'loss/train': 1.7053591012954712} 11/07/2021 09:08:39 - INFO - __main__ - Step 84798: {'lr': 0.00020358740186263437, 'samples': 16281216, 'steps': 84797, 'loss/train': 1.9062641859054565} 11/07/2021 09:08:39 - INFO - __main__ - Step 84799: {'lr': 0.0002035821873819213, 'samples': 16281408, 'steps': 84798, 'loss/train': 1.4764999151229858} 11/07/2021 09:08:39 - INFO - __main__ - Step 84800: {'lr': 0.00020357697292212342, 'samples': 16281600, 'steps': 84799, 'loss/train': 1.6635867357254028} 11/07/2021 09:08:40 - INFO - __main__ - Step 84801: {'lr': 0.00020357175848324306, 'samples': 16281792, 'steps': 84800, 'loss/train': 1.060329556465149} 11/07/2021 09:08:41 - INFO - __main__ - Step 84802: {'lr': 0.00020356654406528246, 'samples': 16281984, 'steps': 84801, 'loss/train': 1.6405928134918213} 11/07/2021 09:08:41 - INFO - __main__ - Step 84803: {'lr': 0.00020356132966824417, 'samples': 16282176, 'steps': 84802, 'loss/train': 1.0684555768966675} 11/07/2021 09:08:41 - INFO - __main__ - Step 84804: {'lr': 0.00020355611529213036, 'samples': 16282368, 'steps': 84803, 'loss/train': 1.4137977361679077} 11/07/2021 09:08:42 - INFO - __main__ - Step 84805: {'lr': 0.00020355090093694342, 'samples': 16282560, 'steps': 84804, 'loss/train': 1.250025749206543} 11/07/2021 09:08:43 - INFO - __main__ - Step 84806: {'lr': 0.00020354568660268578, 'samples': 16282752, 'steps': 84805, 'loss/train': 1.5986745357513428} 11/07/2021 09:08:43 - INFO - __main__ - Step 84807: {'lr': 0.00020354047228935969, 'samples': 16282944, 'steps': 84806, 'loss/train': 1.5303072929382324} 11/07/2021 09:08:43 - INFO - __main__ - Step 84808: {'lr': 0.00020353525799696756, 'samples': 16283136, 'steps': 84807, 'loss/train': 1.7022050619125366} 11/07/2021 09:08:44 - INFO - __main__ - Step 84809: {'lr': 0.00020353004372551173, 'samples': 16283328, 'steps': 84808, 'loss/train': 1.6719096899032593} 11/07/2021 09:08:44 - INFO - __main__ - Step 84810: {'lr': 0.00020352482947499453, 'samples': 16283520, 'steps': 84809, 'loss/train': 1.0262938737869263} 11/07/2021 09:08:45 - INFO - __main__ - Step 84811: {'lr': 0.00020351961524541835, 'samples': 16283712, 'steps': 84810, 'loss/train': 1.7322380542755127} 11/07/2021 09:08:45 - INFO - __main__ - Step 84812: {'lr': 0.0002035144010367855, 'samples': 16283904, 'steps': 84811, 'loss/train': 1.1879498958587646} 11/07/2021 09:08:46 - INFO - __main__ - Step 84813: {'lr': 0.00020350918684909836, 'samples': 16284096, 'steps': 84812, 'loss/train': 1.6161328554153442} 11/07/2021 09:08:46 - INFO - __main__ - Step 84814: {'lr': 0.00020350397268235922, 'samples': 16284288, 'steps': 84813, 'loss/train': 1.4378583431243896} 11/07/2021 09:08:47 - INFO - __main__ - Step 84815: {'lr': 0.00020349875853657061, 'samples': 16284480, 'steps': 84814, 'loss/train': 1.202158808708191} 11/07/2021 09:08:47 - INFO - __main__ - Step 84816: {'lr': 0.00020349354441173464, 'samples': 16284672, 'steps': 84815, 'loss/train': 1.7267265319824219} 11/07/2021 09:08:48 - INFO - __main__ - Step 84817: {'lr': 0.00020348833030785378, 'samples': 16284864, 'steps': 84816, 'loss/train': 1.5801564455032349} 11/07/2021 09:08:48 - INFO - __main__ - Step 84818: {'lr': 0.00020348311622493033, 'samples': 16285056, 'steps': 84817, 'loss/train': 1.730839729309082} 11/07/2021 09:08:49 - INFO - __main__ - Step 84819: {'lr': 0.00020347790216296665, 'samples': 16285248, 'steps': 84818, 'loss/train': 1.535199522972107} 11/07/2021 09:08:49 - INFO - __main__ - Step 84820: {'lr': 0.00020347268812196515, 'samples': 16285440, 'steps': 84819, 'loss/train': 1.2657098770141602} 11/07/2021 09:08:49 - INFO - __main__ - Step 84821: {'lr': 0.0002034674741019281, 'samples': 16285632, 'steps': 84820, 'loss/train': 1.41207754611969} 11/07/2021 09:08:51 - INFO - __main__ - Step 84822: {'lr': 0.00020346226010285794, 'samples': 16285824, 'steps': 84821, 'loss/train': 1.338781714439392} 11/07/2021 09:08:51 - INFO - __main__ - Step 84823: {'lr': 0.00020345704612475694, 'samples': 16286016, 'steps': 84822, 'loss/train': 1.286590576171875} 11/07/2021 09:08:51 - INFO - __main__ - Step 84824: {'lr': 0.00020345183216762748, 'samples': 16286208, 'steps': 84823, 'loss/train': 1.4282617568969727} 11/07/2021 09:08:52 - INFO - __main__ - Step 84825: {'lr': 0.0002034466182314719, 'samples': 16286400, 'steps': 84824, 'loss/train': 1.4576377868652344} 11/07/2021 09:08:52 - INFO - __main__ - Step 84826: {'lr': 0.00020344140431629256, 'samples': 16286592, 'steps': 84825, 'loss/train': 1.6532951593399048} 11/07/2021 09:08:52 - INFO - __main__ - Step 84827: {'lr': 0.00020343619042209176, 'samples': 16286784, 'steps': 84826, 'loss/train': 1.6153017282485962} 11/07/2021 09:08:53 - INFO - __main__ - Step 84828: {'lr': 0.0002034309765488721, 'samples': 16286976, 'steps': 84827, 'loss/train': 1.9386768341064453} 11/07/2021 09:08:54 - INFO - __main__ - Step 84829: {'lr': 0.00020342576269663553, 'samples': 16287168, 'steps': 84828, 'loss/train': 1.4676299095153809} 11/07/2021 09:08:54 - INFO - __main__ - Step 84830: {'lr': 0.00020342054886538465, 'samples': 16287360, 'steps': 84829, 'loss/train': 1.3738651275634766} 11/07/2021 09:08:54 - INFO - __main__ - Step 84831: {'lr': 0.0002034153350551217, 'samples': 16287552, 'steps': 84830, 'loss/train': 1.2147921323776245} 11/07/2021 09:08:55 - INFO - __main__ - Step 84832: {'lr': 0.0002034101212658491, 'samples': 16287744, 'steps': 84831, 'loss/train': 1.3541024923324585} 11/07/2021 09:08:56 - INFO - __main__ - Step 84833: {'lr': 0.0002034049074975692, 'samples': 16287936, 'steps': 84832, 'loss/train': 1.3047187328338623} 11/07/2021 09:08:56 - INFO - __main__ - Step 84834: {'lr': 0.0002033996937502843, 'samples': 16288128, 'steps': 84833, 'loss/train': 1.490757703781128} 11/07/2021 09:08:56 - INFO - __main__ - Step 84835: {'lr': 0.00020339448002399679, 'samples': 16288320, 'steps': 84834, 'loss/train': 1.1195528507232666} 11/07/2021 09:08:57 - INFO - __main__ - Step 84836: {'lr': 0.00020338926631870903, 'samples': 16288512, 'steps': 84835, 'loss/train': 0.7873112559318542} 11/07/2021 09:08:57 - INFO - __main__ - Step 84837: {'lr': 0.00020338405263442333, 'samples': 16288704, 'steps': 84836, 'loss/train': 1.5338624715805054} 11/07/2021 09:08:58 - INFO - __main__ - Step 84838: {'lr': 0.00020337883897114203, 'samples': 16288896, 'steps': 84837, 'loss/train': 1.4205399751663208} 11/07/2021 09:08:59 - INFO - __main__ - Step 84839: {'lr': 0.00020337362532886756, 'samples': 16289088, 'steps': 84838, 'loss/train': 0.34720703959465027} 11/07/2021 09:08:59 - INFO - __main__ - Step 84840: {'lr': 0.00020336841170760217, 'samples': 16289280, 'steps': 84839, 'loss/train': 1.6851812601089478} 11/07/2021 09:08:59 - INFO - __main__ - Step 84841: {'lr': 0.00020336319810734837, 'samples': 16289472, 'steps': 84840, 'loss/train': 1.5633416175842285} 11/07/2021 09:09:00 - INFO - __main__ - Step 84842: {'lr': 0.0002033579845281083, 'samples': 16289664, 'steps': 84841, 'loss/train': 1.0413107872009277} 11/07/2021 09:09:01 - INFO - __main__ - Step 84843: {'lr': 0.0002033527709698844, 'samples': 16289856, 'steps': 84842, 'loss/train': 1.3838860988616943} 11/07/2021 09:09:01 - INFO - __main__ - Step 84844: {'lr': 0.00020334755743267903, 'samples': 16290048, 'steps': 84843, 'loss/train': 1.2604529857635498} 11/07/2021 09:09:01 - INFO - __main__ - Step 84845: {'lr': 0.0002033423439164945, 'samples': 16290240, 'steps': 84844, 'loss/train': 1.8008925914764404} 11/07/2021 09:09:02 - INFO - __main__ - Step 84846: {'lr': 0.00020333713042133323, 'samples': 16290432, 'steps': 84845, 'loss/train': 0.9528002142906189} 11/07/2021 09:09:02 - INFO - __main__ - Step 84847: {'lr': 0.0002033319169471975, 'samples': 16290624, 'steps': 84846, 'loss/train': 1.4962753057479858} 11/07/2021 09:09:02 - INFO - __main__ - Step 84848: {'lr': 0.00020332670349408968, 'samples': 16290816, 'steps': 84847, 'loss/train': 1.624157190322876} 11/07/2021 09:09:03 - INFO - __main__ - Step 84849: {'lr': 0.00020332149006201217, 'samples': 16291008, 'steps': 84848, 'loss/train': 1.6747825145721436} 11/07/2021 09:09:04 - INFO - __main__ - Step 84850: {'lr': 0.00020331627665096723, 'samples': 16291200, 'steps': 84849, 'loss/train': 1.013818621635437} 11/07/2021 09:09:04 - INFO - __main__ - Step 84851: {'lr': 0.00020331106326095728, 'samples': 16291392, 'steps': 84850, 'loss/train': 1.1401488780975342} 11/07/2021 09:09:04 - INFO - __main__ - Step 84852: {'lr': 0.00020330584989198465, 'samples': 16291584, 'steps': 84851, 'loss/train': 1.4536664485931396} 11/07/2021 09:09:05 - INFO - __main__ - Step 84853: {'lr': 0.0002033006365440517, 'samples': 16291776, 'steps': 84852, 'loss/train': 1.081992268562317} 11/07/2021 09:09:06 - INFO - __main__ - Step 84854: {'lr': 0.0002032954232171607, 'samples': 16291968, 'steps': 84853, 'loss/train': 1.0688083171844482} 11/07/2021 09:09:06 - INFO - __main__ - Step 84855: {'lr': 0.0002032902099113142, 'samples': 16292160, 'steps': 84854, 'loss/train': 1.731330394744873} 11/07/2021 09:09:07 - INFO - __main__ - Step 84856: {'lr': 0.0002032849966265143, 'samples': 16292352, 'steps': 84855, 'loss/train': 1.5547125339508057} 11/07/2021 09:09:07 - INFO - __main__ - Step 84857: {'lr': 0.0002032797833627635, 'samples': 16292544, 'steps': 84856, 'loss/train': 1.6479320526123047} 11/07/2021 09:09:07 - INFO - __main__ - Step 84858: {'lr': 0.00020327457012006407, 'samples': 16292736, 'steps': 84857, 'loss/train': 1.377415418624878} 11/07/2021 09:09:08 - INFO - __main__ - Step 84859: {'lr': 0.00020326935689841838, 'samples': 16292928, 'steps': 84858, 'loss/train': 1.5684731006622314} 11/07/2021 09:09:09 - INFO - __main__ - Step 84860: {'lr': 0.00020326414369782885, 'samples': 16293120, 'steps': 84859, 'loss/train': 1.5242202281951904} 11/07/2021 09:09:09 - INFO - __main__ - Step 84861: {'lr': 0.00020325893051829772, 'samples': 16293312, 'steps': 84860, 'loss/train': 1.5942866802215576} 11/07/2021 09:09:09 - INFO - __main__ - Step 84862: {'lr': 0.0002032537173598274, 'samples': 16293504, 'steps': 84861, 'loss/train': 1.2247488498687744} 11/07/2021 09:09:10 - INFO - __main__ - Step 84863: {'lr': 0.00020324850422242028, 'samples': 16293696, 'steps': 84862, 'loss/train': 1.602177381515503} 11/07/2021 09:09:11 - INFO - __main__ - Step 84864: {'lr': 0.00020324329110607865, 'samples': 16293888, 'steps': 84863, 'loss/train': 1.2870805263519287} 11/07/2021 09:09:11 - INFO - __main__ - Step 84865: {'lr': 0.00020323807801080486, 'samples': 16294080, 'steps': 84864, 'loss/train': 1.4621485471725464} 11/07/2021 09:09:11 - INFO - __main__ - Step 84866: {'lr': 0.00020323286493660126, 'samples': 16294272, 'steps': 84865, 'loss/train': 0.928322970867157} 11/07/2021 09:09:12 - INFO - __main__ - Step 84867: {'lr': 0.0002032276518834702, 'samples': 16294464, 'steps': 84866, 'loss/train': 1.180715799331665} 11/07/2021 09:09:12 - INFO - __main__ - Step 84868: {'lr': 0.00020322243885141417, 'samples': 16294656, 'steps': 84867, 'loss/train': 1.2845160961151123} 11/07/2021 09:09:13 - INFO - __main__ - Step 84869: {'lr': 0.00020321722584043528, 'samples': 16294848, 'steps': 84868, 'loss/train': 1.8774000406265259} 11/07/2021 09:09:14 - INFO - __main__ - Step 84870: {'lr': 0.000203212012850536, 'samples': 16295040, 'steps': 84869, 'loss/train': 1.6205332279205322} 11/07/2021 09:09:14 - INFO - __main__ - Step 84871: {'lr': 0.00020320679988171863, 'samples': 16295232, 'steps': 84870, 'loss/train': 0.976809024810791} 11/07/2021 09:09:14 - INFO - __main__ - Step 84872: {'lr': 0.00020320158693398554, 'samples': 16295424, 'steps': 84871, 'loss/train': 0.8808175325393677} 11/07/2021 09:09:15 - INFO - __main__ - Step 84873: {'lr': 0.00020319637400733915, 'samples': 16295616, 'steps': 84872, 'loss/train': 1.5258547067642212} 11/07/2021 09:09:15 - INFO - __main__ - Step 84874: {'lr': 0.0002031911611017817, 'samples': 16295808, 'steps': 84873, 'loss/train': 1.4578590393066406} 11/07/2021 09:09:16 - INFO - __main__ - Step 84875: {'lr': 0.0002031859482173156, 'samples': 16296000, 'steps': 84874, 'loss/train': 1.1946828365325928} 11/07/2021 09:09:16 - INFO - __main__ - Step 84876: {'lr': 0.00020318073535394325, 'samples': 16296192, 'steps': 84875, 'loss/train': 1.0452371835708618} 11/07/2021 09:09:17 - INFO - __main__ - Step 84877: {'lr': 0.00020317552251166687, 'samples': 16296384, 'steps': 84876, 'loss/train': 1.2498570680618286} 11/07/2021 09:09:17 - INFO - __main__ - Step 84878: {'lr': 0.00020317030969048888, 'samples': 16296576, 'steps': 84877, 'loss/train': 1.0675491094589233} 11/07/2021 09:09:17 - INFO - __main__ - Step 84879: {'lr': 0.00020316509689041168, 'samples': 16296768, 'steps': 84878, 'loss/train': 0.7886032462120056} 11/07/2021 09:09:19 - INFO - __main__ - Step 84880: {'lr': 0.00020315988411143753, 'samples': 16296960, 'steps': 84879, 'loss/train': 0.7349135279655457} 11/07/2021 09:09:19 - INFO - __main__ - Step 84881: {'lr': 0.0002031546713535688, 'samples': 16297152, 'steps': 84880, 'loss/train': 1.413413166999817} 11/07/2021 09:09:19 - INFO - __main__ - Step 84882: {'lr': 0.00020314945861680798, 'samples': 16297344, 'steps': 84881, 'loss/train': 1.8268293142318726} 11/07/2021 09:09:20 - INFO - __main__ - Step 84883: {'lr': 0.00020314424590115715, 'samples': 16297536, 'steps': 84882, 'loss/train': 0.8777966499328613} 11/07/2021 09:09:20 - INFO - __main__ - Step 84884: {'lr': 0.00020313903320661885, 'samples': 16297728, 'steps': 84883, 'loss/train': 1.060203194618225} 11/07/2021 09:09:21 - INFO - __main__ - Step 84885: {'lr': 0.00020313382053319535, 'samples': 16297920, 'steps': 84884, 'loss/train': 1.148055911064148} 11/07/2021 09:09:21 - INFO - __main__ - Step 84886: {'lr': 0.00020312860788088903, 'samples': 16298112, 'steps': 84885, 'loss/train': 1.1471353769302368} 11/07/2021 09:09:22 - INFO - __main__ - Step 84887: {'lr': 0.00020312339524970226, 'samples': 16298304, 'steps': 84886, 'loss/train': 1.323345422744751} 11/07/2021 09:09:22 - INFO - __main__ - Step 84888: {'lr': 0.00020311818263963732, 'samples': 16298496, 'steps': 84887, 'loss/train': 0.6196034550666809} 11/07/2021 09:09:22 - INFO - __main__ - Step 84889: {'lr': 0.00020311297005069662, 'samples': 16298688, 'steps': 84888, 'loss/train': 1.403769612312317} 11/07/2021 09:09:23 - INFO - __main__ - Step 84890: {'lr': 0.0002031077574828825, 'samples': 16298880, 'steps': 84889, 'loss/train': 1.22520112991333} 11/07/2021 09:09:24 - INFO - __main__ - Step 84891: {'lr': 0.00020310254493619728, 'samples': 16299072, 'steps': 84890, 'loss/train': 1.515282154083252} 11/07/2021 09:09:24 - INFO - __main__ - Step 84892: {'lr': 0.00020309733241064337, 'samples': 16299264, 'steps': 84891, 'loss/train': 1.4216947555541992} 11/07/2021 09:09:24 - INFO - __main__ - Step 84893: {'lr': 0.00020309211990622307, 'samples': 16299456, 'steps': 84892, 'loss/train': 0.7780171632766724} 11/07/2021 09:09:25 - INFO - __main__ - Step 84894: {'lr': 0.00020308690742293876, 'samples': 16299648, 'steps': 84893, 'loss/train': 0.7676221132278442} 11/07/2021 09:09:26 - INFO - __main__ - Step 84895: {'lr': 0.0002030816949607928, 'samples': 16299840, 'steps': 84894, 'loss/train': 0.8037959933280945} 11/07/2021 09:09:26 - INFO - __main__ - Step 84896: {'lr': 0.00020307648251978742, 'samples': 16300032, 'steps': 84895, 'loss/train': 1.4835824966430664} 11/07/2021 09:09:27 - INFO - __main__ - Step 84897: {'lr': 0.00020307127009992505, 'samples': 16300224, 'steps': 84896, 'loss/train': 1.5767024755477905} 11/07/2021 09:09:27 - INFO - __main__ - Step 84898: {'lr': 0.00020306605770120805, 'samples': 16300416, 'steps': 84897, 'loss/train': 1.5103532075881958} 11/07/2021 09:09:27 - INFO - __main__ - Step 84899: {'lr': 0.00020306084532363878, 'samples': 16300608, 'steps': 84898, 'loss/train': 1.2994376420974731} 11/07/2021 09:09:28 - INFO - __main__ - Step 84900: {'lr': 0.00020305563296721957, 'samples': 16300800, 'steps': 84899, 'loss/train': 1.296623706817627} 11/07/2021 09:09:29 - INFO - __main__ - Step 84901: {'lr': 0.00020305042063195275, 'samples': 16300992, 'steps': 84900, 'loss/train': 1.7140291929244995} 11/07/2021 09:09:29 - INFO - __main__ - Step 84902: {'lr': 0.00020304520831784068, 'samples': 16301184, 'steps': 84901, 'loss/train': 1.4662811756134033} 11/07/2021 09:09:29 - INFO - __main__ - Step 84903: {'lr': 0.00020303999602488574, 'samples': 16301376, 'steps': 84902, 'loss/train': 1.6441982984542847} 11/07/2021 09:09:30 - INFO - __main__ - Step 84904: {'lr': 0.00020303478375309023, 'samples': 16301568, 'steps': 84903, 'loss/train': 1.18998122215271} 11/07/2021 09:09:31 - INFO - __main__ - Step 84905: {'lr': 0.00020302957150245658, 'samples': 16301760, 'steps': 84904, 'loss/train': 1.6065773963928223} 11/07/2021 09:09:31 - INFO - __main__ - Step 84906: {'lr': 0.000203024359272987, 'samples': 16301952, 'steps': 84905, 'loss/train': 1.8771822452545166} 11/07/2021 09:09:31 - INFO - __main__ - Step 84907: {'lr': 0.00020301914706468397, 'samples': 16302144, 'steps': 84906, 'loss/train': 1.4722211360931396} 11/07/2021 09:09:32 - INFO - __main__ - Step 84908: {'lr': 0.00020301393487754977, 'samples': 16302336, 'steps': 84907, 'loss/train': 1.2470427751541138} 11/07/2021 09:09:32 - INFO - __main__ - Step 84909: {'lr': 0.00020300872271158683, 'samples': 16302528, 'steps': 84908, 'loss/train': 1.562495231628418} 11/07/2021 09:09:32 - INFO - __main__ - Step 84910: {'lr': 0.00020300351056679736, 'samples': 16302720, 'steps': 84909, 'loss/train': 1.5958503484725952} 11/07/2021 09:09:33 - INFO - __main__ - Step 84911: {'lr': 0.0002029982984431838, 'samples': 16302912, 'steps': 84910, 'loss/train': 1.1249332427978516} 11/07/2021 09:09:34 - INFO - __main__ - Step 84912: {'lr': 0.00020299308634074846, 'samples': 16303104, 'steps': 84911, 'loss/train': 1.3753554821014404} 11/07/2021 09:09:34 - INFO - __main__ - Step 84913: {'lr': 0.00020298787425949372, 'samples': 16303296, 'steps': 84912, 'loss/train': 1.6078221797943115} 11/07/2021 09:09:34 - INFO - __main__ - Step 84914: {'lr': 0.0002029826621994219, 'samples': 16303488, 'steps': 84913, 'loss/train': 1.3981242179870605} 11/07/2021 09:09:35 - INFO - __main__ - Step 84915: {'lr': 0.00020297745016053539, 'samples': 16303680, 'steps': 84914, 'loss/train': 1.4432477951049805} 11/07/2021 09:09:36 - INFO - __main__ - Step 84916: {'lr': 0.00020297223814283658, 'samples': 16303872, 'steps': 84915, 'loss/train': 1.4743127822875977} 11/07/2021 09:09:36 - INFO - __main__ - Step 84917: {'lr': 0.00020296702614632767, 'samples': 16304064, 'steps': 84916, 'loss/train': 0.8591583967208862} 11/07/2021 09:09:37 - INFO - __main__ - Step 84918: {'lr': 0.0002029618141710111, 'samples': 16304256, 'steps': 84917, 'loss/train': 1.3820152282714844} 11/07/2021 09:09:37 - INFO - __main__ - Step 84919: {'lr': 0.00020295660221688922, 'samples': 16304448, 'steps': 84918, 'loss/train': 1.560390591621399} 11/07/2021 09:09:37 - INFO - __main__ - Step 84920: {'lr': 0.00020295139028396437, 'samples': 16304640, 'steps': 84919, 'loss/train': 1.0968658924102783} 11/07/2021 09:09:38 - INFO - __main__ - Step 84921: {'lr': 0.0002029461783722389, 'samples': 16304832, 'steps': 84920, 'loss/train': 1.7644782066345215} 11/07/2021 09:09:39 - INFO - __main__ - Step 84922: {'lr': 0.0002029409664817152, 'samples': 16305024, 'steps': 84921, 'loss/train': 1.5931841135025024} 11/07/2021 09:09:39 - INFO - __main__ - Step 84923: {'lr': 0.00020293575461239553, 'samples': 16305216, 'steps': 84922, 'loss/train': 1.481278419494629} 11/07/2021 09:09:39 - INFO - __main__ - Step 84924: {'lr': 0.00020293054276428226, 'samples': 16305408, 'steps': 84923, 'loss/train': 1.3648643493652344} 11/07/2021 09:09:40 - INFO - __main__ - Step 84925: {'lr': 0.0002029253309373778, 'samples': 16305600, 'steps': 84924, 'loss/train': 0.9249489903450012} 11/07/2021 09:09:40 - INFO - __main__ - Step 84926: {'lr': 0.00020292011913168449, 'samples': 16305792, 'steps': 84925, 'loss/train': 5.7274274826049805} 11/07/2021 09:09:41 - INFO - __main__ - Step 84927: {'lr': 0.0002029149073472046, 'samples': 16305984, 'steps': 84926, 'loss/train': 1.3225106000900269} 11/07/2021 09:09:41 - INFO - __main__ - Step 84928: {'lr': 0.00020290969558394052, 'samples': 16306176, 'steps': 84927, 'loss/train': 1.2447515726089478} 11/07/2021 09:09:42 - INFO - __main__ - Step 84929: {'lr': 0.00020290448384189462, 'samples': 16306368, 'steps': 84928, 'loss/train': 1.0437366962432861} 11/07/2021 09:09:42 - INFO - __main__ - Step 84930: {'lr': 0.0002028992721210692, 'samples': 16306560, 'steps': 84929, 'loss/train': 1.6757090091705322} 11/07/2021 09:09:42 - INFO - __main__ - Step 84931: {'lr': 0.00020289406042146667, 'samples': 16306752, 'steps': 84930, 'loss/train': 1.5192018747329712} 11/07/2021 09:09:44 - INFO - __main__ - Step 84932: {'lr': 0.00020288884874308932, 'samples': 16306944, 'steps': 84931, 'loss/train': 1.4172683954238892} 11/07/2021 09:09:44 - INFO - __main__ - Step 84933: {'lr': 0.00020288363708593956, 'samples': 16307136, 'steps': 84932, 'loss/train': 1.4653334617614746} 11/07/2021 09:09:44 - INFO - __main__ - Step 84934: {'lr': 0.0002028784254500197, 'samples': 16307328, 'steps': 84933, 'loss/train': 1.2884262800216675} 11/07/2021 09:09:45 - INFO - __main__ - Step 84935: {'lr': 0.00020287321383533207, 'samples': 16307520, 'steps': 84934, 'loss/train': 0.5415070652961731} 11/07/2021 09:09:45 - INFO - __main__ - Step 84936: {'lr': 0.00020286800224187914, 'samples': 16307712, 'steps': 84935, 'loss/train': 1.3841822147369385} 11/07/2021 09:09:46 - INFO - __main__ - Step 84937: {'lr': 0.00020286279066966312, 'samples': 16307904, 'steps': 84936, 'loss/train': 0.8898035883903503} 11/07/2021 09:09:46 - INFO - __main__ - Step 84938: {'lr': 0.00020285757911868635, 'samples': 16308096, 'steps': 84937, 'loss/train': 1.552216649055481} 11/07/2021 09:09:47 - INFO - __main__ - Step 84939: {'lr': 0.00020285236758895125, 'samples': 16308288, 'steps': 84938, 'loss/train': 1.2858806848526} 11/07/2021 09:09:47 - INFO - __main__ - Step 84940: {'lr': 0.00020284715608046014, 'samples': 16308480, 'steps': 84939, 'loss/train': 1.2591179609298706} 11/07/2021 09:09:47 - INFO - __main__ - Step 84941: {'lr': 0.00020284194459321538, 'samples': 16308672, 'steps': 84940, 'loss/train': 1.3085936307907104} 11/07/2021 09:09:48 - INFO - __main__ - Step 84942: {'lr': 0.0002028367331272193, 'samples': 16308864, 'steps': 84941, 'loss/train': 1.3287389278411865} 11/07/2021 09:09:49 - INFO - __main__ - Step 84943: {'lr': 0.00020283152168247423, 'samples': 16309056, 'steps': 84942, 'loss/train': 1.1838507652282715} 11/07/2021 09:09:49 - INFO - __main__ - Step 84944: {'lr': 0.0002028263102589826, 'samples': 16309248, 'steps': 84943, 'loss/train': 1.7460927963256836} 11/07/2021 09:09:50 - INFO - __main__ - Step 84945: {'lr': 0.00020282109885674668, 'samples': 16309440, 'steps': 84944, 'loss/train': 1.5487143993377686} 11/07/2021 09:09:50 - INFO - __main__ - Step 84946: {'lr': 0.00020281588747576883, 'samples': 16309632, 'steps': 84945, 'loss/train': 1.8400810956954956} 11/07/2021 09:09:50 - INFO - __main__ - Step 84947: {'lr': 0.00020281067611605146, 'samples': 16309824, 'steps': 84946, 'loss/train': 1.339858889579773} 11/07/2021 09:09:51 - INFO - __main__ - Step 84948: {'lr': 0.00020280546477759686, 'samples': 16310016, 'steps': 84947, 'loss/train': 1.333543300628662} 11/07/2021 09:09:52 - INFO - __main__ - Step 84949: {'lr': 0.00020280025346040744, 'samples': 16310208, 'steps': 84948, 'loss/train': 1.5440118312835693} 11/07/2021 09:09:52 - INFO - __main__ - Step 84950: {'lr': 0.00020279504216448547, 'samples': 16310400, 'steps': 84949, 'loss/train': 1.2762986421585083} 11/07/2021 09:09:52 - INFO - __main__ - Step 84951: {'lr': 0.00020278983088983327, 'samples': 16310592, 'steps': 84950, 'loss/train': 1.6884151697158813} 11/07/2021 09:09:53 - INFO - __main__ - Step 84952: {'lr': 0.00020278461963645325, 'samples': 16310784, 'steps': 84951, 'loss/train': 1.6888259649276733} 11/07/2021 09:09:54 - INFO - __main__ - Step 84953: {'lr': 0.00020277940840434777, 'samples': 16310976, 'steps': 84952, 'loss/train': 1.179181694984436} 11/07/2021 09:09:54 - INFO - __main__ - Step 84954: {'lr': 0.00020277419719351913, 'samples': 16311168, 'steps': 84953, 'loss/train': 1.4824447631835938} 11/07/2021 09:09:54 - INFO - __main__ - Step 84955: {'lr': 0.00020276898600396975, 'samples': 16311360, 'steps': 84954, 'loss/train': 1.1028863191604614} 11/07/2021 09:09:55 - INFO - __main__ - Step 84956: {'lr': 0.0002027637748357019, 'samples': 16311552, 'steps': 84955, 'loss/train': 1.033945918083191} 11/07/2021 09:09:55 - INFO - __main__ - Step 84957: {'lr': 0.000202758563688718, 'samples': 16311744, 'steps': 84956, 'loss/train': 1.1216213703155518} 11/07/2021 09:09:56 - INFO - __main__ - Step 84958: {'lr': 0.00020275335256302035, 'samples': 16311936, 'steps': 84957, 'loss/train': 1.1501972675323486} 11/07/2021 09:09:56 - INFO - __main__ - Step 84959: {'lr': 0.00020274814145861128, 'samples': 16312128, 'steps': 84958, 'loss/train': 1.3731271028518677} 11/07/2021 09:09:57 - INFO - __main__ - Step 84960: {'lr': 0.0002027429303754932, 'samples': 16312320, 'steps': 84959, 'loss/train': 1.79300057888031} 11/07/2021 09:09:57 - INFO - __main__ - Step 84961: {'lr': 0.00020273771931366842, 'samples': 16312512, 'steps': 84960, 'loss/train': 1.6517122983932495} 11/07/2021 09:09:57 - INFO - __main__ - Step 84962: {'lr': 0.0002027325082731394, 'samples': 16312704, 'steps': 84961, 'loss/train': 1.2203890085220337} 11/07/2021 09:09:59 - INFO - __main__ - Step 84963: {'lr': 0.00020272729725390827, 'samples': 16312896, 'steps': 84962, 'loss/train': 0.8552365303039551} 11/07/2021 09:09:59 - INFO - __main__ - Step 84964: {'lr': 0.0002027220862559775, 'samples': 16313088, 'steps': 84963, 'loss/train': 1.7396966218948364} 11/07/2021 09:09:59 - INFO - __main__ - Step 84965: {'lr': 0.00020271687527934944, 'samples': 16313280, 'steps': 84964, 'loss/train': 1.4632701873779297} 11/07/2021 09:10:00 - INFO - __main__ - Step 84966: {'lr': 0.00020271166432402638, 'samples': 16313472, 'steps': 84965, 'loss/train': 1.5884132385253906} 11/07/2021 09:10:00 - INFO - __main__ - Step 84967: {'lr': 0.00020270645339001076, 'samples': 16313664, 'steps': 84966, 'loss/train': 1.434826135635376} 11/07/2021 09:10:01 - INFO - __main__ - Step 84968: {'lr': 0.00020270124247730487, 'samples': 16313856, 'steps': 84967, 'loss/train': 1.794901728630066} 11/07/2021 09:10:01 - INFO - __main__ - Step 84969: {'lr': 0.00020269603158591104, 'samples': 16314048, 'steps': 84968, 'loss/train': 1.3521729707717896} 11/07/2021 09:10:02 - INFO - __main__ - Step 84970: {'lr': 0.0002026908207158317, 'samples': 16314240, 'steps': 84969, 'loss/train': 1.789708137512207} 11/07/2021 09:10:02 - INFO - __main__ - Step 84971: {'lr': 0.0002026856098670691, 'samples': 16314432, 'steps': 84970, 'loss/train': 1.3954877853393555} 11/07/2021 09:10:02 - INFO - __main__ - Step 84972: {'lr': 0.00020268039903962565, 'samples': 16314624, 'steps': 84971, 'loss/train': 1.8102920055389404} 11/07/2021 09:10:03 - INFO - __main__ - Step 84973: {'lr': 0.0002026751882335037, 'samples': 16314816, 'steps': 84972, 'loss/train': 0.7868448495864868} 11/07/2021 09:10:04 - INFO - __main__ - Step 84974: {'lr': 0.00020266997744870557, 'samples': 16315008, 'steps': 84973, 'loss/train': 1.0196444988250732} 11/07/2021 09:10:04 - INFO - __main__ - Step 84975: {'lr': 0.00020266476668523363, 'samples': 16315200, 'steps': 84974, 'loss/train': 1.21646249294281} 11/07/2021 09:10:05 - INFO - __main__ - Step 84976: {'lr': 0.0002026595559430903, 'samples': 16315392, 'steps': 84975, 'loss/train': 1.592276930809021} 11/07/2021 09:10:05 - INFO - __main__ - Step 84977: {'lr': 0.00020265434522227774, 'samples': 16315584, 'steps': 84976, 'loss/train': 1.6081544160842896} 11/07/2021 09:10:05 - INFO - __main__ - Step 84978: {'lr': 0.0002026491345227984, 'samples': 16315776, 'steps': 84977, 'loss/train': 0.5603108406066895} 11/07/2021 09:10:06 - INFO - __main__ - Step 84979: {'lr': 0.00020264392384465463, 'samples': 16315968, 'steps': 84978, 'loss/train': 1.2702586650848389} 11/07/2021 09:10:07 - INFO - __main__ - Step 84980: {'lr': 0.0002026387131878488, 'samples': 16316160, 'steps': 84979, 'loss/train': 1.5101382732391357} 11/07/2021 09:10:07 - INFO - __main__ - Step 84981: {'lr': 0.00020263350255238322, 'samples': 16316352, 'steps': 84980, 'loss/train': 0.8900244832038879} 11/07/2021 09:10:07 - INFO - __main__ - Step 84982: {'lr': 0.00020262829193826024, 'samples': 16316544, 'steps': 84981, 'loss/train': 1.2771533727645874} 11/07/2021 09:10:08 - INFO - __main__ - Step 84983: {'lr': 0.00020262308134548224, 'samples': 16316736, 'steps': 84982, 'loss/train': 1.7331546545028687} 11/07/2021 09:10:08 - INFO - __main__ - Step 84984: {'lr': 0.00020261787077405154, 'samples': 16316928, 'steps': 84983, 'loss/train': 1.5534642934799194} 11/07/2021 09:10:09 - INFO - __main__ - Step 84985: {'lr': 0.00020261266022397048, 'samples': 16317120, 'steps': 84984, 'loss/train': 1.5664089918136597} 11/07/2021 09:10:09 - INFO - __main__ - Step 84986: {'lr': 0.00020260744969524146, 'samples': 16317312, 'steps': 84985, 'loss/train': 1.3050910234451294} 11/07/2021 09:10:10 - INFO - __main__ - Step 84987: {'lr': 0.00020260223918786675, 'samples': 16317504, 'steps': 84986, 'loss/train': 0.8666312098503113} 11/07/2021 09:10:10 - INFO - __main__ - Step 84988: {'lr': 0.00020259702870184876, 'samples': 16317696, 'steps': 84987, 'loss/train': 1.3752334117889404} 11/07/2021 09:10:10 - INFO - __main__ - Step 84989: {'lr': 0.00020259181823718993, 'samples': 16317888, 'steps': 84988, 'loss/train': 1.8053292036056519} 11/07/2021 09:10:12 - INFO - __main__ - Step 84990: {'lr': 0.00020258660779389238, 'samples': 16318080, 'steps': 84989, 'loss/train': 1.0476216077804565} 11/07/2021 09:10:12 - INFO - __main__ - Step 84991: {'lr': 0.00020258139737195857, 'samples': 16318272, 'steps': 84990, 'loss/train': 1.491678237915039} 11/07/2021 09:10:12 - INFO - __main__ - Step 84992: {'lr': 0.00020257618697139086, 'samples': 16318464, 'steps': 84991, 'loss/train': 0.8333829045295715} 11/07/2021 09:10:13 - INFO - __main__ - Step 84993: {'lr': 0.0002025709765921916, 'samples': 16318656, 'steps': 84992, 'loss/train': 1.2179028987884521} 11/07/2021 09:10:13 - INFO - __main__ - Step 84994: {'lr': 0.0002025657662343631, 'samples': 16318848, 'steps': 84993, 'loss/train': 0.8144583106040955} 11/07/2021 09:10:14 - INFO - __main__ - Step 84995: {'lr': 0.00020256055589790771, 'samples': 16319040, 'steps': 84994, 'loss/train': 1.2798035144805908} 11/07/2021 09:10:14 - INFO - __main__ - Step 84996: {'lr': 0.00020255534558282786, 'samples': 16319232, 'steps': 84995, 'loss/train': 1.491282343864441} 11/07/2021 09:10:15 - INFO - __main__ - Step 84997: {'lr': 0.00020255013528912584, 'samples': 16319424, 'steps': 84996, 'loss/train': 1.4642717838287354} 11/07/2021 09:10:15 - INFO - __main__ - Step 84998: {'lr': 0.00020254492501680396, 'samples': 16319616, 'steps': 84997, 'loss/train': 1.4634335041046143} 11/07/2021 09:10:15 - INFO - __main__ - Step 84999: {'lr': 0.0002025397147658646, 'samples': 16319808, 'steps': 84998, 'loss/train': 1.5412068367004395} 11/07/2021 09:10:17 - INFO - __main__ - Step 85000: {'lr': 0.00020253450453631015, 'samples': 16320000, 'steps': 84999, 'loss/train': 1.4150508642196655} 11/07/2021 09:10:17 - INFO - __main__ - Step 85001: {'lr': 0.00020252929432814287, 'samples': 16320192, 'steps': 85000, 'loss/train': 0.5966804027557373} 11/07/2021 09:10:17 - INFO - __main__ - Step 85002: {'lr': 0.0002025240841413652, 'samples': 16320384, 'steps': 85001, 'loss/train': 1.3591147661209106} 11/07/2021 09:10:18 - INFO - __main__ - Step 85003: {'lr': 0.00020251887397597956, 'samples': 16320576, 'steps': 85002, 'loss/train': 1.6358729600906372} 11/07/2021 09:10:18 - INFO - __main__ - Step 85004: {'lr': 0.00020251366383198805, 'samples': 16320768, 'steps': 85003, 'loss/train': 1.6467236280441284} 11/07/2021 09:10:19 - INFO - __main__ - Step 85005: {'lr': 0.00020250845370939314, 'samples': 16320960, 'steps': 85004, 'loss/train': 1.8346889019012451} 11/07/2021 09:10:20 - INFO - __main__ - Step 85006: {'lr': 0.0002025032436081972, 'samples': 16321152, 'steps': 85005, 'loss/train': 1.2800956964492798} 11/07/2021 09:10:20 - INFO - __main__ - Step 85007: {'lr': 0.00020249803352840256, 'samples': 16321344, 'steps': 85006, 'loss/train': 1.4922080039978027} 11/07/2021 09:10:20 - INFO - __main__ - Step 85008: {'lr': 0.0002024928234700116, 'samples': 16321536, 'steps': 85007, 'loss/train': 0.05864648148417473} 11/07/2021 09:10:21 - INFO - __main__ - Step 85009: {'lr': 0.00020248761343302663, 'samples': 16321728, 'steps': 85008, 'loss/train': 1.5032726526260376} 11/07/2021 09:10:21 - INFO - __main__ - Step 85010: {'lr': 0.00020248240341745, 'samples': 16321920, 'steps': 85009, 'loss/train': 1.5227543115615845} 11/07/2021 09:10:22 - INFO - __main__ - Step 85011: {'lr': 0.00020247719342328404, 'samples': 16322112, 'steps': 85010, 'loss/train': 1.304939866065979} 11/07/2021 09:10:22 - INFO - __main__ - Step 85012: {'lr': 0.00020247198345053116, 'samples': 16322304, 'steps': 85011, 'loss/train': 1.3100626468658447} 11/07/2021 09:10:23 - INFO - __main__ - Step 85013: {'lr': 0.00020246677349919367, 'samples': 16322496, 'steps': 85012, 'loss/train': 1.4588960409164429} 11/07/2021 09:10:23 - INFO - __main__ - Step 85014: {'lr': 0.00020246156356927393, 'samples': 16322688, 'steps': 85013, 'loss/train': 0.893616259098053} 11/07/2021 09:10:23 - INFO - __main__ - Step 85015: {'lr': 0.00020245635366077422, 'samples': 16322880, 'steps': 85014, 'loss/train': 1.6485549211502075} 11/07/2021 09:10:24 - INFO - __main__ - Step 85016: {'lr': 0.0002024511437736971, 'samples': 16323072, 'steps': 85015, 'loss/train': 1.743559718132019} 11/07/2021 09:10:25 - INFO - __main__ - Step 85017: {'lr': 0.00020244593390804464, 'samples': 16323264, 'steps': 85016, 'loss/train': 1.3693820238113403} 11/07/2021 09:10:25 - INFO - __main__ - Step 85018: {'lr': 0.0002024407240638193, 'samples': 16323456, 'steps': 85017, 'loss/train': 1.4150934219360352} 11/07/2021 09:10:25 - INFO - __main__ - Step 85019: {'lr': 0.00020243551424102343, 'samples': 16323648, 'steps': 85018, 'loss/train': 1.3197548389434814} 11/07/2021 09:10:26 - INFO - __main__ - Step 85020: {'lr': 0.0002024303044396594, 'samples': 16323840, 'steps': 85019, 'loss/train': 1.1132773160934448} 11/07/2021 09:10:27 - INFO - __main__ - Step 85021: {'lr': 0.00020242509465972955, 'samples': 16324032, 'steps': 85020, 'loss/train': 1.4572937488555908} 11/07/2021 09:10:27 - INFO - __main__ - Step 85022: {'lr': 0.0002024198849012362, 'samples': 16324224, 'steps': 85021, 'loss/train': 1.5492889881134033} 11/07/2021 09:10:28 - INFO - __main__ - Step 85023: {'lr': 0.00020241467516418171, 'samples': 16324416, 'steps': 85022, 'loss/train': 1.764586329460144} 11/07/2021 09:10:28 - INFO - __main__ - Step 85024: {'lr': 0.00020240946544856846, 'samples': 16324608, 'steps': 85023, 'loss/train': 1.4470365047454834} 11/07/2021 09:10:28 - INFO - __main__ - Step 85025: {'lr': 0.00020240425575439875, 'samples': 16324800, 'steps': 85024, 'loss/train': 1.431054711341858} 11/07/2021 09:10:29 - INFO - __main__ - Step 85026: {'lr': 0.00020239904608167496, 'samples': 16324992, 'steps': 85025, 'loss/train': 2.110053777694702} 11/07/2021 09:10:30 - INFO - __main__ - Step 85027: {'lr': 0.0002023938364303994, 'samples': 16325184, 'steps': 85026, 'loss/train': 1.0800195932388306} 11/07/2021 09:10:30 - INFO - __main__ - Step 85028: {'lr': 0.0002023886268005745, 'samples': 16325376, 'steps': 85027, 'loss/train': 1.3308196067810059} 11/07/2021 09:10:31 - INFO - __main__ - Step 85029: {'lr': 0.00020238341719220254, 'samples': 16325568, 'steps': 85028, 'loss/train': 2.5543339252471924} 11/07/2021 09:10:31 - INFO - __main__ - Step 85030: {'lr': 0.00020237820760528587, 'samples': 16325760, 'steps': 85029, 'loss/train': 1.3266407251358032} 11/07/2021 09:10:31 - INFO - __main__ - Step 85031: {'lr': 0.00020237299803982684, 'samples': 16325952, 'steps': 85030, 'loss/train': 1.2268701791763306} 11/07/2021 09:10:32 - INFO - __main__ - Step 85032: {'lr': 0.0002023677884958278, 'samples': 16326144, 'steps': 85031, 'loss/train': 1.3055921792984009} 11/07/2021 09:10:33 - INFO - __main__ - Step 85033: {'lr': 0.0002023625789732911, 'samples': 16326336, 'steps': 85032, 'loss/train': 1.0300415754318237} 11/07/2021 09:10:33 - INFO - __main__ - Step 85034: {'lr': 0.00020235736947221906, 'samples': 16326528, 'steps': 85033, 'loss/train': 0.9582001566886902} 11/07/2021 09:10:33 - INFO - __main__ - Step 85035: {'lr': 0.00020235215999261406, 'samples': 16326720, 'steps': 85034, 'loss/train': 1.2125110626220703} 11/07/2021 09:10:34 - INFO - __main__ - Step 85036: {'lr': 0.00020234695053447844, 'samples': 16326912, 'steps': 85035, 'loss/train': 1.8203319311141968} 11/07/2021 09:10:35 - INFO - __main__ - Step 85037: {'lr': 0.00020234174109781455, 'samples': 16327104, 'steps': 85036, 'loss/train': 0.939401388168335} 11/07/2021 09:10:35 - INFO - __main__ - Step 85038: {'lr': 0.00020233653168262475, 'samples': 16327296, 'steps': 85037, 'loss/train': 1.3735352754592896} 11/07/2021 09:10:35 - INFO - __main__ - Step 85039: {'lr': 0.00020233132228891142, 'samples': 16327488, 'steps': 85038, 'loss/train': 1.048368215560913} 11/07/2021 09:10:36 - INFO - __main__ - Step 85040: {'lr': 0.0002023261129166768, 'samples': 16327680, 'steps': 85039, 'loss/train': 1.4791494607925415} 11/07/2021 09:10:36 - INFO - __main__ - Step 85041: {'lr': 0.0002023209035659233, 'samples': 16327872, 'steps': 85040, 'loss/train': 1.5988376140594482} 11/07/2021 09:10:37 - INFO - __main__ - Step 85042: {'lr': 0.00020231569423665328, 'samples': 16328064, 'steps': 85041, 'loss/train': 1.432446002960205} 11/07/2021 09:10:38 - INFO - __main__ - Step 85043: {'lr': 0.00020231048492886912, 'samples': 16328256, 'steps': 85042, 'loss/train': 1.9524060487747192} 11/07/2021 09:10:38 - INFO - __main__ - Step 85044: {'lr': 0.00020230527564257307, 'samples': 16328448, 'steps': 85043, 'loss/train': 1.4497673511505127} 11/07/2021 09:10:38 - INFO - __main__ - Step 85045: {'lr': 0.0002023000663777675, 'samples': 16328640, 'steps': 85044, 'loss/train': 1.320120930671692} 11/07/2021 09:10:39 - INFO - __main__ - Step 85046: {'lr': 0.00020229485713445477, 'samples': 16328832, 'steps': 85045, 'loss/train': 1.6738567352294922} 11/07/2021 09:10:39 - INFO - __main__ - Step 85047: {'lr': 0.00020228964791263728, 'samples': 16329024, 'steps': 85046, 'loss/train': 1.3373472690582275} 11/07/2021 09:10:40 - INFO - __main__ - Step 85048: {'lr': 0.00020228443871231732, 'samples': 16329216, 'steps': 85047, 'loss/train': 1.659665584564209} 11/07/2021 09:10:41 - INFO - __main__ - Step 85049: {'lr': 0.00020227922953349728, 'samples': 16329408, 'steps': 85048, 'loss/train': 1.9162391424179077} 11/07/2021 09:10:41 - INFO - __main__ - Step 85050: {'lr': 0.00020227402037617954, 'samples': 16329600, 'steps': 85049, 'loss/train': 2.1872949600219727} 11/07/2021 09:10:41 - INFO - __main__ - Step 85051: {'lr': 0.0002022688112403663, 'samples': 16329792, 'steps': 85050, 'loss/train': 1.6270438432693481} 11/07/2021 09:10:42 - INFO - __main__ - Step 85052: {'lr': 0.00020226360212606003, 'samples': 16329984, 'steps': 85051, 'loss/train': 1.2691843509674072} 11/07/2021 09:10:43 - INFO - __main__ - Step 85053: {'lr': 0.00020225839303326305, 'samples': 16330176, 'steps': 85052, 'loss/train': 1.2308417558670044} 11/07/2021 09:10:43 - INFO - __main__ - Step 85054: {'lr': 0.00020225318396197768, 'samples': 16330368, 'steps': 85053, 'loss/train': 1.0230093002319336} 11/07/2021 09:10:43 - INFO - __main__ - Step 85055: {'lr': 0.00020224797491220627, 'samples': 16330560, 'steps': 85054, 'loss/train': 1.641890287399292} 11/07/2021 09:10:44 - INFO - __main__ - Step 85056: {'lr': 0.00020224276588395122, 'samples': 16330752, 'steps': 85055, 'loss/train': 1.4727472066879272} 11/07/2021 09:10:44 - INFO - __main__ - Step 85057: {'lr': 0.00020223755687721488, 'samples': 16330944, 'steps': 85056, 'loss/train': 1.4536988735198975} 11/07/2021 09:10:44 - INFO - __main__ - Step 85058: {'lr': 0.00020223234789199952, 'samples': 16331136, 'steps': 85057, 'loss/train': 1.4529800415039062} 11/07/2021 09:10:46 - INFO - __main__ - Step 85059: {'lr': 0.0002022271389283075, 'samples': 16331328, 'steps': 85058, 'loss/train': 1.8196202516555786} 11/07/2021 09:10:46 - INFO - __main__ - Step 85060: {'lr': 0.0002022219299861412, 'samples': 16331520, 'steps': 85059, 'loss/train': 1.6274510622024536} 11/07/2021 09:10:46 - INFO - __main__ - Step 85061: {'lr': 0.00020221672106550303, 'samples': 16331712, 'steps': 85060, 'loss/train': 1.7607921361923218} 11/07/2021 09:10:47 - INFO - __main__ - Step 85062: {'lr': 0.00020221151216639522, 'samples': 16331904, 'steps': 85061, 'loss/train': 1.4725853204727173} 11/07/2021 09:10:47 - INFO - __main__ - Step 85063: {'lr': 0.00020220630328882013, 'samples': 16332096, 'steps': 85062, 'loss/train': 0.08535615354776382} 11/07/2021 09:10:48 - INFO - __main__ - Step 85064: {'lr': 0.00020220109443278017, 'samples': 16332288, 'steps': 85063, 'loss/train': 1.7021769285202026} 11/07/2021 09:10:48 - INFO - __main__ - Step 85065: {'lr': 0.00020219588559827767, 'samples': 16332480, 'steps': 85064, 'loss/train': 0.6734210848808289} 11/07/2021 09:10:49 - INFO - __main__ - Step 85066: {'lr': 0.00020219067678531495, 'samples': 16332672, 'steps': 85065, 'loss/train': 1.692747950553894} 11/07/2021 09:10:49 - INFO - __main__ - Step 85067: {'lr': 0.00020218546799389436, 'samples': 16332864, 'steps': 85066, 'loss/train': 0.7670405507087708} 11/07/2021 09:10:49 - INFO - __main__ - Step 85068: {'lr': 0.00020218025922401827, 'samples': 16333056, 'steps': 85067, 'loss/train': 1.4434986114501953} 11/07/2021 09:10:51 - INFO - __main__ - Step 85069: {'lr': 0.00020217505047568905, 'samples': 16333248, 'steps': 85068, 'loss/train': 1.040368914604187} 11/07/2021 09:10:51 - INFO - __main__ - Step 85070: {'lr': 0.00020216984174890903, 'samples': 16333440, 'steps': 85069, 'loss/train': 1.7774696350097656} 11/07/2021 09:10:51 - INFO - __main__ - Step 85071: {'lr': 0.0002021646330436805, 'samples': 16333632, 'steps': 85070, 'loss/train': 1.5430150032043457} 11/07/2021 09:10:52 - INFO - __main__ - Step 85072: {'lr': 0.0002021594243600059, 'samples': 16333824, 'steps': 85071, 'loss/train': 1.3461594581604004} 11/07/2021 09:10:52 - INFO - __main__ - Step 85073: {'lr': 0.00020215421569788746, 'samples': 16334016, 'steps': 85072, 'loss/train': 0.9663582444190979} 11/07/2021 09:10:53 - INFO - __main__ - Step 85074: {'lr': 0.00020214900705732761, 'samples': 16334208, 'steps': 85073, 'loss/train': 1.2288402318954468} 11/07/2021 09:10:53 - INFO - __main__ - Step 85075: {'lr': 0.00020214379843832867, 'samples': 16334400, 'steps': 85074, 'loss/train': 1.41571044921875} 11/07/2021 09:10:54 - INFO - __main__ - Step 85076: {'lr': 0.00020213858984089301, 'samples': 16334592, 'steps': 85075, 'loss/train': 1.3214426040649414} 11/07/2021 09:10:54 - INFO - __main__ - Step 85077: {'lr': 0.00020213338126502295, 'samples': 16334784, 'steps': 85076, 'loss/train': 0.8835983872413635} 11/07/2021 09:10:54 - INFO - __main__ - Step 85078: {'lr': 0.00020212817271072085, 'samples': 16334976, 'steps': 85077, 'loss/train': 1.2722522020339966} 11/07/2021 09:10:55 - INFO - __main__ - Step 85079: {'lr': 0.00020212296417798905, 'samples': 16335168, 'steps': 85078, 'loss/train': 1.3683013916015625} 11/07/2021 09:10:56 - INFO - __main__ - Step 85080: {'lr': 0.00020211775566682992, 'samples': 16335360, 'steps': 85079, 'loss/train': 1.6158040761947632} 11/07/2021 09:10:56 - INFO - __main__ - Step 85081: {'lr': 0.0002021125471772458, 'samples': 16335552, 'steps': 85080, 'loss/train': 1.5528855323791504} 11/07/2021 09:10:56 - INFO - __main__ - Step 85082: {'lr': 0.00020210733870923897, 'samples': 16335744, 'steps': 85081, 'loss/train': 1.363981008529663} 11/07/2021 09:10:57 - INFO - __main__ - Step 85083: {'lr': 0.000202102130262812, 'samples': 16335936, 'steps': 85082, 'loss/train': 1.2299413681030273} 11/07/2021 09:10:57 - INFO - __main__ - Step 85084: {'lr': 0.00020209692183796696, 'samples': 16336128, 'steps': 85083, 'loss/train': 1.4182400703430176} 11/07/2021 09:10:58 - INFO - __main__ - Step 85085: {'lr': 0.00020209171343470628, 'samples': 16336320, 'steps': 85084, 'loss/train': 1.1184380054473877} 11/07/2021 09:10:59 - INFO - __main__ - Step 85086: {'lr': 0.00020208650505303233, 'samples': 16336512, 'steps': 85085, 'loss/train': 0.9759618639945984} 11/07/2021 09:10:59 - INFO - __main__ - Step 85087: {'lr': 0.0002020812966929475, 'samples': 16336704, 'steps': 85086, 'loss/train': 1.5374987125396729} 11/07/2021 09:10:59 - INFO - __main__ - Step 85088: {'lr': 0.00020207608835445408, 'samples': 16336896, 'steps': 85087, 'loss/train': 0.6250317096710205} 11/07/2021 09:11:00 - INFO - __main__ - Step 85089: {'lr': 0.0002020708800375544, 'samples': 16337088, 'steps': 85088, 'loss/train': 1.3516154289245605} 11/07/2021 09:11:00 - INFO - __main__ - Step 85090: {'lr': 0.0002020656717422509, 'samples': 16337280, 'steps': 85089, 'loss/train': 1.4701634645462036} 11/07/2021 09:11:01 - INFO - __main__ - Step 85091: {'lr': 0.00020206046346854585, 'samples': 16337472, 'steps': 85090, 'loss/train': 0.5792183876037598} 11/07/2021 09:11:01 - INFO - __main__ - Step 85092: {'lr': 0.00020205525521644157, 'samples': 16337664, 'steps': 85091, 'loss/train': 1.434394359588623} 11/07/2021 09:11:02 - INFO - __main__ - Step 85093: {'lr': 0.0002020500469859405, 'samples': 16337856, 'steps': 85092, 'loss/train': 0.7625614404678345} 11/07/2021 09:11:02 - INFO - __main__ - Step 85094: {'lr': 0.00020204483877704494, 'samples': 16338048, 'steps': 85093, 'loss/train': 1.0209978818893433} 11/07/2021 09:11:02 - INFO - __main__ - Step 85095: {'lr': 0.00020203963058975722, 'samples': 16338240, 'steps': 85094, 'loss/train': 0.6835483908653259} 11/07/2021 09:11:03 - INFO - __main__ - Step 85096: {'lr': 0.0002020344224240797, 'samples': 16338432, 'steps': 85095, 'loss/train': 1.3941502571105957} 11/07/2021 09:11:04 - INFO - __main__ - Step 85097: {'lr': 0.00020202921428001487, 'samples': 16338624, 'steps': 85096, 'loss/train': 1.9720244407653809} 11/07/2021 09:11:04 - INFO - __main__ - Step 85098: {'lr': 0.00020202400615756478, 'samples': 16338816, 'steps': 85097, 'loss/train': 1.4223798513412476} 11/07/2021 09:11:04 - INFO - __main__ - Step 85099: {'lr': 0.00020201879805673196, 'samples': 16339008, 'steps': 85098, 'loss/train': 1.2069096565246582} 11/07/2021 09:11:05 - INFO - __main__ - Step 85100: {'lr': 0.00020201358997751874, 'samples': 16339200, 'steps': 85099, 'loss/train': 1.2405534982681274} 11/07/2021 09:11:06 - INFO - __main__ - Step 85101: {'lr': 0.00020200838191992743, 'samples': 16339392, 'steps': 85100, 'loss/train': 0.6557533144950867} 11/07/2021 09:11:06 - INFO - __main__ - Step 85102: {'lr': 0.00020200317388396042, 'samples': 16339584, 'steps': 85101, 'loss/train': 1.1301881074905396} 11/07/2021 09:11:06 - INFO - __main__ - Step 85103: {'lr': 0.00020199796586962003, 'samples': 16339776, 'steps': 85102, 'loss/train': 1.6147898435592651} 11/07/2021 09:11:07 - INFO - __main__ - Step 85104: {'lr': 0.0002019927578769086, 'samples': 16339968, 'steps': 85103, 'loss/train': 1.2065672874450684} 11/07/2021 09:11:07 - INFO - __main__ - Step 85105: {'lr': 0.00020198754990582852, 'samples': 16340160, 'steps': 85104, 'loss/train': 1.4296470880508423} 11/07/2021 09:11:08 - INFO - __main__ - Step 85106: {'lr': 0.0002019823419563821, 'samples': 16340352, 'steps': 85105, 'loss/train': 1.6052100658416748} 11/07/2021 09:11:09 - INFO - __main__ - Step 85107: {'lr': 0.0002019771340285717, 'samples': 16340544, 'steps': 85106, 'loss/train': 1.1767351627349854} 11/07/2021 09:11:09 - INFO - __main__ - Step 85108: {'lr': 0.00020197192612239964, 'samples': 16340736, 'steps': 85107, 'loss/train': 0.9230204224586487} 11/07/2021 09:11:09 - INFO - __main__ - Step 85109: {'lr': 0.0002019667182378683, 'samples': 16340928, 'steps': 85108, 'loss/train': 1.475486159324646} 11/07/2021 09:11:10 - INFO - __main__ - Step 85110: {'lr': 0.00020196151037498014, 'samples': 16341120, 'steps': 85109, 'loss/train': 1.4508999586105347} 11/07/2021 09:11:11 - INFO - __main__ - Step 85111: {'lr': 0.00020195630253373725, 'samples': 16341312, 'steps': 85110, 'loss/train': 1.446764588356018} 11/07/2021 09:11:11 - INFO - __main__ - Step 85112: {'lr': 0.00020195109471414215, 'samples': 16341504, 'steps': 85111, 'loss/train': 1.3607474565505981} 11/07/2021 09:11:11 - INFO - __main__ - Step 85113: {'lr': 0.00020194588691619712, 'samples': 16341696, 'steps': 85112, 'loss/train': 1.0319136381149292} 11/07/2021 09:11:12 - INFO - __main__ - Step 85114: {'lr': 0.00020194067913990453, 'samples': 16341888, 'steps': 85113, 'loss/train': 1.5683821439743042} 11/07/2021 09:11:12 - INFO - __main__ - Step 85115: {'lr': 0.00020193547138526671, 'samples': 16342080, 'steps': 85114, 'loss/train': 1.3934285640716553} 11/07/2021 09:11:12 - INFO - __main__ - Step 85116: {'lr': 0.00020193026365228605, 'samples': 16342272, 'steps': 85115, 'loss/train': 1.962428092956543} 11/07/2021 09:11:13 - INFO - __main__ - Step 85117: {'lr': 0.00020192505594096485, 'samples': 16342464, 'steps': 85116, 'loss/train': 1.6236481666564941} 11/07/2021 09:11:14 - INFO - __main__ - Step 85118: {'lr': 0.0002019198482513055, 'samples': 16342656, 'steps': 85117, 'loss/train': 1.771886944770813} 11/07/2021 09:11:14 - INFO - __main__ - Step 85119: {'lr': 0.00020191464058331033, 'samples': 16342848, 'steps': 85118, 'loss/train': 1.004297137260437} 11/07/2021 09:11:14 - INFO - __main__ - Step 85120: {'lr': 0.00020190943293698166, 'samples': 16343040, 'steps': 85119, 'loss/train': 1.6829594373703003} 11/07/2021 09:11:15 - INFO - __main__ - Step 85121: {'lr': 0.00020190422531232187, 'samples': 16343232, 'steps': 85120, 'loss/train': 1.0215990543365479} 11/07/2021 09:11:16 - INFO - __main__ - Step 85122: {'lr': 0.00020189901770933328, 'samples': 16343424, 'steps': 85121, 'loss/train': 1.4500279426574707} 11/07/2021 09:11:16 - INFO - __main__ - Step 85123: {'lr': 0.00020189381012801824, 'samples': 16343616, 'steps': 85122, 'loss/train': 1.4436936378479004} 11/07/2021 09:11:17 - INFO - __main__ - Step 85124: {'lr': 0.00020188860256837926, 'samples': 16343808, 'steps': 85123, 'loss/train': 1.42967689037323} 11/07/2021 09:11:17 - INFO - __main__ - Step 85125: {'lr': 0.00020188339503041837, 'samples': 16344000, 'steps': 85124, 'loss/train': 1.5788506269454956} 11/07/2021 09:11:17 - INFO - __main__ - Step 85126: {'lr': 0.00020187818751413812, 'samples': 16344192, 'steps': 85125, 'loss/train': 1.523528814315796} 11/07/2021 09:11:19 - INFO - __main__ - Step 85127: {'lr': 0.0002018729800195408, 'samples': 16344384, 'steps': 85126, 'loss/train': 1.3386536836624146} 11/07/2021 09:11:19 - INFO - __main__ - Step 85128: {'lr': 0.0002018677725466288, 'samples': 16344576, 'steps': 85127, 'loss/train': 1.4139474630355835} 11/07/2021 09:11:20 - INFO - __main__ - Step 85129: {'lr': 0.00020186256509540442, 'samples': 16344768, 'steps': 85128, 'loss/train': 0.9116420745849609} 11/07/2021 09:11:20 - INFO - __main__ - Step 85130: {'lr': 0.00020185735766587, 'samples': 16344960, 'steps': 85129, 'loss/train': 1.2731866836547852} 11/07/2021 09:11:20 - INFO - __main__ - Step 85131: {'lr': 0.00020185215025802795, 'samples': 16345152, 'steps': 85130, 'loss/train': 1.771277666091919} 11/07/2021 09:11:21 - INFO - __main__ - Step 85132: {'lr': 0.00020184694287188055, 'samples': 16345344, 'steps': 85131, 'loss/train': 1.7423428297042847} 11/07/2021 09:11:21 - INFO - __main__ - Step 85133: {'lr': 0.00020184173550743018, 'samples': 16345536, 'steps': 85132, 'loss/train': 1.5217812061309814} 11/07/2021 09:11:22 - INFO - __main__ - Step 85134: {'lr': 0.0002018365281646792, 'samples': 16345728, 'steps': 85133, 'loss/train': 1.8506542444229126} 11/07/2021 09:11:23 - INFO - __main__ - Step 85135: {'lr': 0.00020183132084362993, 'samples': 16345920, 'steps': 85134, 'loss/train': 2.2054102420806885} 11/07/2021 09:11:23 - INFO - __main__ - Step 85136: {'lr': 0.0002018261135442847, 'samples': 16346112, 'steps': 85135, 'loss/train': 1.3942805528640747} 11/07/2021 09:11:23 - INFO - __main__ - Step 85137: {'lr': 0.000201820906266646, 'samples': 16346304, 'steps': 85136, 'loss/train': 1.3644607067108154} 11/07/2021 09:11:24 - INFO - __main__ - Step 85138: {'lr': 0.00020181569901071597, 'samples': 16346496, 'steps': 85137, 'loss/train': 1.2831964492797852} 11/07/2021 09:11:25 - INFO - __main__ - Step 85139: {'lr': 0.00020181049177649701, 'samples': 16346688, 'steps': 85138, 'loss/train': 1.5977602005004883} 11/07/2021 09:11:25 - INFO - __main__ - Step 85140: {'lr': 0.00020180528456399153, 'samples': 16346880, 'steps': 85139, 'loss/train': 1.3156096935272217} 11/07/2021 09:11:26 - INFO - __main__ - Step 85141: {'lr': 0.00020180007737320184, 'samples': 16347072, 'steps': 85140, 'loss/train': 1.311732292175293} 11/07/2021 09:11:26 - INFO - __main__ - Step 85142: {'lr': 0.00020179487020413028, 'samples': 16347264, 'steps': 85141, 'loss/train': 1.1863734722137451} 11/07/2021 09:11:26 - INFO - __main__ - Step 85143: {'lr': 0.0002017896630567792, 'samples': 16347456, 'steps': 85142, 'loss/train': 1.0695101022720337} 11/07/2021 09:11:27 - INFO - __main__ - Step 85144: {'lr': 0.00020178445593115098, 'samples': 16347648, 'steps': 85143, 'loss/train': 1.5181258916854858} 11/07/2021 09:11:28 - INFO - __main__ - Step 85145: {'lr': 0.00020177924882724792, 'samples': 16347840, 'steps': 85144, 'loss/train': 2.8022701740264893} 11/07/2021 09:11:28 - INFO - __main__ - Step 85146: {'lr': 0.00020177404174507237, 'samples': 16348032, 'steps': 85145, 'loss/train': 1.5660637617111206} 11/07/2021 09:11:28 - INFO - __main__ - Step 85147: {'lr': 0.00020176883468462674, 'samples': 16348224, 'steps': 85146, 'loss/train': 0.6501519083976746} 11/07/2021 09:11:29 - INFO - __main__ - Step 85148: {'lr': 0.00020176362764591328, 'samples': 16348416, 'steps': 85147, 'loss/train': 1.2320080995559692} 11/07/2021 09:11:29 - INFO - __main__ - Step 85149: {'lr': 0.0002017584206289344, 'samples': 16348608, 'steps': 85148, 'loss/train': 1.3367235660552979} 11/07/2021 09:11:30 - INFO - __main__ - Step 85150: {'lr': 0.00020175321363369246, 'samples': 16348800, 'steps': 85149, 'loss/train': 1.4648020267486572} 11/07/2021 09:11:30 - INFO - __main__ - Step 85151: {'lr': 0.00020174800666018986, 'samples': 16348992, 'steps': 85150, 'loss/train': 1.1775046586990356} 11/07/2021 09:11:31 - INFO - __main__ - Step 85152: {'lr': 0.00020174279970842874, 'samples': 16349184, 'steps': 85151, 'loss/train': 1.3558717966079712} 11/07/2021 09:11:31 - INFO - __main__ - Step 85153: {'lr': 0.00020173759277841157, 'samples': 16349376, 'steps': 85152, 'loss/train': 1.3559643030166626} 11/07/2021 09:11:31 - INFO - __main__ - Step 85154: {'lr': 0.00020173238587014076, 'samples': 16349568, 'steps': 85153, 'loss/train': 1.3641639947891235} 11/07/2021 09:11:33 - INFO - __main__ - Step 85155: {'lr': 0.00020172717898361852, 'samples': 16349760, 'steps': 85154, 'loss/train': 1.2179774045944214} 11/07/2021 09:11:33 - INFO - __main__ - Step 85156: {'lr': 0.0002017219721188473, 'samples': 16349952, 'steps': 85155, 'loss/train': 0.8224046230316162} 11/07/2021 09:11:33 - INFO - __main__ - Step 85157: {'lr': 0.00020171676527582942, 'samples': 16350144, 'steps': 85156, 'loss/train': 1.1213527917861938} 11/07/2021 09:11:34 - INFO - __main__ - Step 85158: {'lr': 0.0002017115584545672, 'samples': 16350336, 'steps': 85157, 'loss/train': 1.2146646976470947} 11/07/2021 09:11:34 - INFO - __main__ - Step 85159: {'lr': 0.00020170635165506302, 'samples': 16350528, 'steps': 85158, 'loss/train': 1.7260595560073853} 11/07/2021 09:11:35 - INFO - __main__ - Step 85160: {'lr': 0.00020170114487731922, 'samples': 16350720, 'steps': 85159, 'loss/train': 1.7756140232086182} 11/07/2021 09:11:35 - INFO - __main__ - Step 85161: {'lr': 0.0002016959381213381, 'samples': 16350912, 'steps': 85160, 'loss/train': 1.3403443098068237} 11/07/2021 09:11:36 - INFO - __main__ - Step 85162: {'lr': 0.00020169073138712206, 'samples': 16351104, 'steps': 85161, 'loss/train': 1.4067286252975464} 11/07/2021 09:11:36 - INFO - __main__ - Step 85163: {'lr': 0.00020168552467467353, 'samples': 16351296, 'steps': 85162, 'loss/train': 1.2297189235687256} 11/07/2021 09:11:36 - INFO - __main__ - Step 85164: {'lr': 0.00020168031798399472, 'samples': 16351488, 'steps': 85163, 'loss/train': 1.3214257955551147} 11/07/2021 09:11:38 - INFO - __main__ - Step 85165: {'lr': 0.000201675111315088, 'samples': 16351680, 'steps': 85164, 'loss/train': 1.5566257238388062} 11/07/2021 09:11:38 - INFO - __main__ - Step 85166: {'lr': 0.00020166990466795564, 'samples': 16351872, 'steps': 85165, 'loss/train': 1.5641385316848755} 11/07/2021 09:11:38 - INFO - __main__ - Step 85167: {'lr': 0.00020166469804260012, 'samples': 16352064, 'steps': 85166, 'loss/train': 1.4945708513259888} 11/07/2021 09:11:39 - INFO - __main__ - Step 85168: {'lr': 0.00020165949143902375, 'samples': 16352256, 'steps': 85167, 'loss/train': 1.4266421794891357} 11/07/2021 09:11:39 - INFO - __main__ - Step 85169: {'lr': 0.0002016542848572289, 'samples': 16352448, 'steps': 85168, 'loss/train': 0.8867405652999878} 11/07/2021 09:11:39 - INFO - __main__ - Step 85170: {'lr': 0.00020164907829721784, 'samples': 16352640, 'steps': 85169, 'loss/train': 1.4008132219314575} 11/07/2021 09:11:40 - INFO - __main__ - Step 85171: {'lr': 0.00020164387175899295, 'samples': 16352832, 'steps': 85170, 'loss/train': 1.5292372703552246} 11/07/2021 09:11:41 - INFO - __main__ - Step 85172: {'lr': 0.00020163866524255662, 'samples': 16353024, 'steps': 85171, 'loss/train': 1.3204355239868164} 11/07/2021 09:11:41 - INFO - __main__ - Step 85173: {'lr': 0.00020163345874791119, 'samples': 16353216, 'steps': 85172, 'loss/train': 1.3109244108200073} 11/07/2021 09:11:41 - INFO - __main__ - Step 85174: {'lr': 0.00020162825227505894, 'samples': 16353408, 'steps': 85173, 'loss/train': 1.733911395072937} 11/07/2021 09:11:42 - INFO - __main__ - Step 85175: {'lr': 0.00020162304582400226, 'samples': 16353600, 'steps': 85174, 'loss/train': 1.6386809349060059} 11/07/2021 09:11:43 - INFO - __main__ - Step 85176: {'lr': 0.00020161783939474346, 'samples': 16353792, 'steps': 85175, 'loss/train': 1.6906230449676514} 11/07/2021 09:11:43 - INFO - __main__ - Step 85177: {'lr': 0.00020161263298728495, 'samples': 16353984, 'steps': 85176, 'loss/train': 1.467250108718872} 11/07/2021 09:11:43 - INFO - __main__ - Step 85178: {'lr': 0.00020160742660162907, 'samples': 16354176, 'steps': 85177, 'loss/train': 1.5180296897888184} 11/07/2021 09:11:44 - INFO - __main__ - Step 85179: {'lr': 0.00020160222023777807, 'samples': 16354368, 'steps': 85178, 'loss/train': 1.3721303939819336} 11/07/2021 09:11:44 - INFO - __main__ - Step 85180: {'lr': 0.00020159701389573436, 'samples': 16354560, 'steps': 85179, 'loss/train': 1.3008712530136108} 11/07/2021 09:11:45 - INFO - __main__ - Step 85181: {'lr': 0.00020159180757550033, 'samples': 16354752, 'steps': 85180, 'loss/train': 1.5063518285751343} 11/07/2021 09:11:46 - INFO - __main__ - Step 85182: {'lr': 0.00020158660127707825, 'samples': 16354944, 'steps': 85181, 'loss/train': 1.0002537965774536} 11/07/2021 09:11:46 - INFO - __main__ - Step 85183: {'lr': 0.0002015813950004705, 'samples': 16355136, 'steps': 85182, 'loss/train': 3.040637254714966} 11/07/2021 09:11:46 - INFO - __main__ - Step 85184: {'lr': 0.0002015761887456795, 'samples': 16355328, 'steps': 85183, 'loss/train': 1.0715157985687256} 11/07/2021 09:11:47 - INFO - __main__ - Step 85185: {'lr': 0.00020157098251270751, 'samples': 16355520, 'steps': 85184, 'loss/train': 1.9509432315826416} 11/07/2021 09:11:48 - INFO - __main__ - Step 85186: {'lr': 0.00020156577630155682, 'samples': 16355712, 'steps': 85185, 'loss/train': 1.2758609056472778} 11/07/2021 09:11:48 - INFO - __main__ - Step 85187: {'lr': 0.00020156057011222987, 'samples': 16355904, 'steps': 85186, 'loss/train': 1.25115966796875} 11/07/2021 09:11:48 - INFO - __main__ - Step 85188: {'lr': 0.00020155536394472895, 'samples': 16356096, 'steps': 85187, 'loss/train': 1.335242509841919} 11/07/2021 09:11:49 - INFO - __main__ - Step 85189: {'lr': 0.00020155015779905648, 'samples': 16356288, 'steps': 85188, 'loss/train': 1.7357466220855713} 11/07/2021 09:11:49 - INFO - __main__ - Step 85190: {'lr': 0.00020154495167521471, 'samples': 16356480, 'steps': 85189, 'loss/train': 1.316489338874817} 11/07/2021 09:11:50 - INFO - __main__ - Step 85191: {'lr': 0.00020153974557320616, 'samples': 16356672, 'steps': 85190, 'loss/train': 1.3206428289413452} 11/07/2021 09:11:51 - INFO - __main__ - Step 85192: {'lr': 0.00020153453949303294, 'samples': 16356864, 'steps': 85191, 'loss/train': 1.5501822233200073} 11/07/2021 09:11:51 - INFO - __main__ - Step 85193: {'lr': 0.00020152933343469754, 'samples': 16357056, 'steps': 85192, 'loss/train': 1.1928918361663818} 11/07/2021 09:11:51 - INFO - __main__ - Step 85194: {'lr': 0.00020152412739820225, 'samples': 16357248, 'steps': 85193, 'loss/train': 1.3256254196166992} 11/07/2021 09:11:52 - INFO - __main__ - Step 85195: {'lr': 0.0002015189213835495, 'samples': 16357440, 'steps': 85194, 'loss/train': 2.20241117477417} 11/07/2021 09:11:53 - INFO - __main__ - Step 85196: {'lr': 0.00020151371539074153, 'samples': 16357632, 'steps': 85195, 'loss/train': 1.095133900642395} 11/07/2021 09:11:53 - INFO - __main__ - Step 85197: {'lr': 0.00020150850941978076, 'samples': 16357824, 'steps': 85196, 'loss/train': 1.2357888221740723} 11/07/2021 09:11:53 - INFO - __main__ - Step 85198: {'lr': 0.00020150330347066948, 'samples': 16358016, 'steps': 85197, 'loss/train': 1.4445770978927612} 11/07/2021 09:11:54 - INFO - __main__ - Step 85199: {'lr': 0.00020149809754341002, 'samples': 16358208, 'steps': 85198, 'loss/train': 1.4184927940368652} 11/07/2021 09:11:54 - INFO - __main__ - Step 85200: {'lr': 0.00020149289163800483, 'samples': 16358400, 'steps': 85199, 'loss/train': 1.0592238903045654} 11/07/2021 09:11:54 - INFO - __main__ - Step 85201: {'lr': 0.00020148768575445618, 'samples': 16358592, 'steps': 85200, 'loss/train': 1.7484018802642822} 11/07/2021 09:11:55 - INFO - __main__ - Step 85202: {'lr': 0.0002014824798927664, 'samples': 16358784, 'steps': 85201, 'loss/train': 2.151749610900879} 11/07/2021 09:11:56 - INFO - __main__ - Step 85203: {'lr': 0.00020147727405293793, 'samples': 16358976, 'steps': 85202, 'loss/train': 1.402241587638855} 11/07/2021 09:11:56 - INFO - __main__ - Step 85204: {'lr': 0.00020147206823497305, 'samples': 16359168, 'steps': 85203, 'loss/train': 1.362682819366455} 11/07/2021 09:11:56 - INFO - __main__ - Step 85205: {'lr': 0.00020146686243887409, 'samples': 16359360, 'steps': 85204, 'loss/train': 1.4299396276474} 11/07/2021 09:11:57 - INFO - __main__ - Step 85206: {'lr': 0.00020146165666464343, 'samples': 16359552, 'steps': 85205, 'loss/train': 1.3792256116867065} 11/07/2021 09:11:58 - INFO - __main__ - Step 85207: {'lr': 0.00020145645091228337, 'samples': 16359744, 'steps': 85206, 'loss/train': 0.8669281005859375} 11/07/2021 09:11:58 - INFO - __main__ - Step 85208: {'lr': 0.00020145124518179626, 'samples': 16359936, 'steps': 85207, 'loss/train': 1.6644623279571533} 11/07/2021 09:11:59 - INFO - __main__ - Step 85209: {'lr': 0.0002014460394731845, 'samples': 16360128, 'steps': 85208, 'loss/train': 1.6956064701080322} 11/07/2021 09:11:59 - INFO - __main__ - Step 85210: {'lr': 0.00020144083378645036, 'samples': 16360320, 'steps': 85209, 'loss/train': 0.67851722240448} 11/07/2021 09:11:59 - INFO - __main__ - Step 85211: {'lr': 0.00020143562812159626, 'samples': 16360512, 'steps': 85210, 'loss/train': 1.131873607635498} 11/07/2021 09:12:00 - INFO - __main__ - Step 85212: {'lr': 0.00020143042247862454, 'samples': 16360704, 'steps': 85211, 'loss/train': 1.3467848300933838} 11/07/2021 09:12:01 - INFO - __main__ - Step 85213: {'lr': 0.00020142521685753752, 'samples': 16360896, 'steps': 85212, 'loss/train': 1.4934016466140747} 11/07/2021 09:12:01 - INFO - __main__ - Step 85214: {'lr': 0.0002014200112583375, 'samples': 16361088, 'steps': 85213, 'loss/train': 1.0915775299072266} 11/07/2021 09:12:01 - INFO - __main__ - Step 85215: {'lr': 0.0002014148056810269, 'samples': 16361280, 'steps': 85214, 'loss/train': 1.6985198259353638} 11/07/2021 09:12:02 - INFO - __main__ - Step 85216: {'lr': 0.00020140960012560806, 'samples': 16361472, 'steps': 85215, 'loss/train': 1.4113093614578247} 11/07/2021 09:12:03 - INFO - __main__ - Step 85217: {'lr': 0.00020140439459208326, 'samples': 16361664, 'steps': 85216, 'loss/train': 1.3763326406478882} 11/07/2021 09:12:03 - INFO - __main__ - Step 85218: {'lr': 0.00020139918908045504, 'samples': 16361856, 'steps': 85217, 'loss/train': 1.533403754234314} 11/07/2021 09:12:04 - INFO - __main__ - Step 85219: {'lr': 0.00020139398359072548, 'samples': 16362048, 'steps': 85218, 'loss/train': 1.159613847732544} 11/07/2021 09:12:04 - INFO - __main__ - Step 85220: {'lr': 0.00020138877812289703, 'samples': 16362240, 'steps': 85219, 'loss/train': 1.53786301612854} 11/07/2021 09:12:04 - INFO - __main__ - Step 85221: {'lr': 0.00020138357267697203, 'samples': 16362432, 'steps': 85220, 'loss/train': 1.5969481468200684} 11/07/2021 09:12:05 - INFO - __main__ - Step 85222: {'lr': 0.00020137836725295287, 'samples': 16362624, 'steps': 85221, 'loss/train': 0.13583360612392426} 11/07/2021 09:12:06 - INFO - __main__ - Step 85223: {'lr': 0.00020137316185084184, 'samples': 16362816, 'steps': 85222, 'loss/train': 1.3353934288024902} 11/07/2021 09:12:06 - INFO - __main__ - Step 85224: {'lr': 0.00020136795647064133, 'samples': 16363008, 'steps': 85223, 'loss/train': 1.2350305318832397} 11/07/2021 09:12:06 - INFO - __main__ - Step 85225: {'lr': 0.00020136275111235367, 'samples': 16363200, 'steps': 85224, 'loss/train': 1.438586711883545} 11/07/2021 09:12:07 - INFO - __main__ - Step 85226: {'lr': 0.0002013575457759812, 'samples': 16363392, 'steps': 85225, 'loss/train': 1.8322651386260986} 11/07/2021 09:12:07 - INFO - __main__ - Step 85227: {'lr': 0.0002013523404615263, 'samples': 16363584, 'steps': 85226, 'loss/train': 1.6228243112564087} 11/07/2021 09:12:08 - INFO - __main__ - Step 85228: {'lr': 0.00020134713516899123, 'samples': 16363776, 'steps': 85227, 'loss/train': 1.101727843284607} 11/07/2021 09:12:08 - INFO - __main__ - Step 85229: {'lr': 0.00020134192989837841, 'samples': 16363968, 'steps': 85228, 'loss/train': 1.5789374113082886} 11/07/2021 09:12:09 - INFO - __main__ - Step 85230: {'lr': 0.00020133672464969017, 'samples': 16364160, 'steps': 85229, 'loss/train': 1.270982027053833} 11/07/2021 09:12:09 - INFO - __main__ - Step 85231: {'lr': 0.00020133151942292897, 'samples': 16364352, 'steps': 85230, 'loss/train': 1.7709426879882812} 11/07/2021 09:12:09 - INFO - __main__ - Step 85232: {'lr': 0.00020132631421809693, 'samples': 16364544, 'steps': 85231, 'loss/train': 1.4227288961410522} 11/07/2021 09:12:10 - INFO - __main__ - Step 85233: {'lr': 0.00020132110903519645, 'samples': 16364736, 'steps': 85232, 'loss/train': 1.2157962322235107} 11/07/2021 09:12:11 - INFO - __main__ - Step 85234: {'lr': 0.00020131590387423, 'samples': 16364928, 'steps': 85233, 'loss/train': 1.0833996534347534} 11/07/2021 09:12:11 - INFO - __main__ - Step 85235: {'lr': 0.0002013106987351998, 'samples': 16365120, 'steps': 85234, 'loss/train': 1.6059975624084473} 11/07/2021 09:12:12 - INFO - __main__ - Step 85236: {'lr': 0.0002013054936181083, 'samples': 16365312, 'steps': 85235, 'loss/train': 1.9684919118881226} 11/07/2021 09:12:12 - INFO - __main__ - Step 85237: {'lr': 0.00020130028852295774, 'samples': 16365504, 'steps': 85236, 'loss/train': 1.4516334533691406} 11/07/2021 09:12:12 - INFO - __main__ - Step 85238: {'lr': 0.00020129508344975054, 'samples': 16365696, 'steps': 85237, 'loss/train': 1.6487209796905518} 11/07/2021 09:12:13 - INFO - __main__ - Step 85239: {'lr': 0.00020128987839848904, 'samples': 16365888, 'steps': 85238, 'loss/train': 1.2254966497421265} 11/07/2021 09:12:14 - INFO - __main__ - Step 85240: {'lr': 0.00020128467336917556, 'samples': 16366080, 'steps': 85239, 'loss/train': 1.6114569902420044} 11/07/2021 09:12:14 - INFO - __main__ - Step 85241: {'lr': 0.00020127946836181242, 'samples': 16366272, 'steps': 85240, 'loss/train': 1.2318859100341797} 11/07/2021 09:12:14 - INFO - __main__ - Step 85242: {'lr': 0.00020127426337640202, 'samples': 16366464, 'steps': 85241, 'loss/train': 1.3656951189041138} 11/07/2021 09:12:15 - INFO - __main__ - Step 85243: {'lr': 0.00020126905841294672, 'samples': 16366656, 'steps': 85242, 'loss/train': 1.0403950214385986} 11/07/2021 09:12:16 - INFO - __main__ - Step 85244: {'lr': 0.00020126385347144876, 'samples': 16366848, 'steps': 85243, 'loss/train': 0.7885909080505371} 11/07/2021 09:12:16 - INFO - __main__ - Step 85245: {'lr': 0.00020125864855191072, 'samples': 16367040, 'steps': 85244, 'loss/train': 1.577330470085144} 11/07/2021 09:12:16 - INFO - __main__ - Step 85246: {'lr': 0.00020125344365433468, 'samples': 16367232, 'steps': 85245, 'loss/train': 1.3049910068511963} 11/07/2021 09:12:17 - INFO - __main__ - Step 85247: {'lr': 0.00020124823877872307, 'samples': 16367424, 'steps': 85246, 'loss/train': 1.3238911628723145} 11/07/2021 09:12:17 - INFO - __main__ - Step 85248: {'lr': 0.00020124303392507823, 'samples': 16367616, 'steps': 85247, 'loss/train': 1.4072891473770142} 11/07/2021 09:12:18 - INFO - __main__ - Step 85249: {'lr': 0.00020123782909340255, 'samples': 16367808, 'steps': 85248, 'loss/train': 1.8375211954116821} 11/07/2021 09:12:19 - INFO - __main__ - Step 85250: {'lr': 0.0002012326242836983, 'samples': 16368000, 'steps': 85249, 'loss/train': 1.105148434638977} 11/07/2021 09:12:19 - INFO - __main__ - Step 85251: {'lr': 0.00020122741949596797, 'samples': 16368192, 'steps': 85250, 'loss/train': 1.1898186206817627} 11/07/2021 09:12:19 - INFO - __main__ - Step 85252: {'lr': 0.00020122221473021373, 'samples': 16368384, 'steps': 85251, 'loss/train': 1.5115479230880737} 11/07/2021 09:12:20 - INFO - __main__ - Step 85253: {'lr': 0.00020121700998643804, 'samples': 16368576, 'steps': 85252, 'loss/train': 1.142883539199829} 11/07/2021 09:12:20 - INFO - __main__ - Step 85254: {'lr': 0.0002012118052646432, 'samples': 16368768, 'steps': 85253, 'loss/train': 1.6528544425964355} 11/07/2021 09:12:21 - INFO - __main__ - Step 85255: {'lr': 0.00020120660056483161, 'samples': 16368960, 'steps': 85254, 'loss/train': 0.06405775249004364} 11/07/2021 09:12:21 - INFO - __main__ - Step 85256: {'lr': 0.00020120139588700552, 'samples': 16369152, 'steps': 85255, 'loss/train': 1.5390958786010742} 11/07/2021 09:12:22 - INFO - __main__ - Step 85257: {'lr': 0.00020119619123116738, 'samples': 16369344, 'steps': 85256, 'loss/train': 0.6890901327133179} 11/07/2021 09:12:22 - INFO - __main__ - Step 85258: {'lr': 0.00020119098659731954, 'samples': 16369536, 'steps': 85257, 'loss/train': 1.2567241191864014} 11/07/2021 09:12:22 - INFO - __main__ - Step 85259: {'lr': 0.00020118578198546422, 'samples': 16369728, 'steps': 85258, 'loss/train': 1.608031988143921} 11/07/2021 09:12:23 - INFO - __main__ - Step 85260: {'lr': 0.0002011805773956038, 'samples': 16369920, 'steps': 85259, 'loss/train': 1.5282936096191406} 11/07/2021 09:12:24 - INFO - __main__ - Step 85261: {'lr': 0.0002011753728277407, 'samples': 16370112, 'steps': 85260, 'loss/train': 1.3793061971664429} 11/07/2021 09:12:24 - INFO - __main__ - Step 85262: {'lr': 0.0002011701682818772, 'samples': 16370304, 'steps': 85261, 'loss/train': 0.9596961736679077} 11/07/2021 09:12:24 - INFO - __main__ - Step 85263: {'lr': 0.00020116496375801565, 'samples': 16370496, 'steps': 85262, 'loss/train': 1.4459528923034668} 11/07/2021 09:12:25 - INFO - __main__ - Step 85264: {'lr': 0.00020115975925615842, 'samples': 16370688, 'steps': 85263, 'loss/train': 0.7622015476226807} 11/07/2021 09:12:26 - INFO - __main__ - Step 85265: {'lr': 0.0002011545547763079, 'samples': 16370880, 'steps': 85264, 'loss/train': 1.4672341346740723} 11/07/2021 09:12:26 - INFO - __main__ - Step 85266: {'lr': 0.00020114935031846631, 'samples': 16371072, 'steps': 85265, 'loss/train': 1.3775259256362915} 11/07/2021 09:12:27 - INFO - __main__ - Step 85267: {'lr': 0.00020114414588263613, 'samples': 16371264, 'steps': 85266, 'loss/train': 0.13808004558086395} 11/07/2021 09:12:27 - INFO - __main__ - Step 85268: {'lr': 0.0002011389414688196, 'samples': 16371456, 'steps': 85267, 'loss/train': 1.662279486656189} 11/07/2021 09:12:27 - INFO - __main__ - Step 85269: {'lr': 0.00020113373707701912, 'samples': 16371648, 'steps': 85268, 'loss/train': 1.0381438732147217} 11/07/2021 09:12:28 - INFO - __main__ - Step 85270: {'lr': 0.00020112853270723704, 'samples': 16371840, 'steps': 85269, 'loss/train': 1.579314112663269} 11/07/2021 09:12:29 - INFO - __main__ - Step 85271: {'lr': 0.00020112332835947567, 'samples': 16372032, 'steps': 85270, 'loss/train': 1.7570992708206177} 11/07/2021 09:12:29 - INFO - __main__ - Step 85272: {'lr': 0.00020111812403373749, 'samples': 16372224, 'steps': 85271, 'loss/train': 1.0815436840057373} 11/07/2021 09:12:29 - INFO - __main__ - Step 85273: {'lr': 0.00020111291973002462, 'samples': 16372416, 'steps': 85272, 'loss/train': 1.6582027673721313} 11/07/2021 09:12:30 - INFO - __main__ - Step 85274: {'lr': 0.00020110771544833953, 'samples': 16372608, 'steps': 85273, 'loss/train': 1.526292085647583} 11/07/2021 09:12:31 - INFO - __main__ - Step 85275: {'lr': 0.00020110251118868452, 'samples': 16372800, 'steps': 85274, 'loss/train': 1.2517712116241455} 11/07/2021 09:12:31 - INFO - __main__ - Step 85276: {'lr': 0.000201097306951062, 'samples': 16372992, 'steps': 85275, 'loss/train': 1.4236891269683838} 11/07/2021 09:12:31 - INFO - __main__ - Step 85277: {'lr': 0.00020109210273547423, 'samples': 16373184, 'steps': 85276, 'loss/train': 1.554836630821228} 11/07/2021 09:12:32 - INFO - __main__ - Step 85278: {'lr': 0.00020108689854192362, 'samples': 16373376, 'steps': 85277, 'loss/train': 1.8005553483963013} 11/07/2021 09:12:32 - INFO - __main__ - Step 85279: {'lr': 0.00020108169437041255, 'samples': 16373568, 'steps': 85278, 'loss/train': 1.4863834381103516} 11/07/2021 09:12:33 - INFO - __main__ - Step 85280: {'lr': 0.00020107649022094328, 'samples': 16373760, 'steps': 85279, 'loss/train': 0.8289968967437744} 11/07/2021 09:12:34 - INFO - __main__ - Step 85281: {'lr': 0.00020107128609351817, 'samples': 16373952, 'steps': 85280, 'loss/train': 1.8136839866638184} 11/07/2021 09:12:34 - INFO - __main__ - Step 85282: {'lr': 0.00020106608198813957, 'samples': 16374144, 'steps': 85281, 'loss/train': 1.0977472066879272} 11/07/2021 09:12:34 - INFO - __main__ - Step 85283: {'lr': 0.00020106087790480986, 'samples': 16374336, 'steps': 85282, 'loss/train': 1.2684670686721802} 11/07/2021 09:12:35 - INFO - __main__ - Step 85284: {'lr': 0.0002010556738435314, 'samples': 16374528, 'steps': 85283, 'loss/train': 1.7378838062286377} 11/07/2021 09:12:35 - INFO - __main__ - Step 85285: {'lr': 0.00020105046980430658, 'samples': 16374720, 'steps': 85284, 'loss/train': 1.1919143199920654} 11/07/2021 09:12:36 - INFO - __main__ - Step 85286: {'lr': 0.00020104526578713754, 'samples': 16374912, 'steps': 85285, 'loss/train': 0.854414701461792} 11/07/2021 09:12:37 - INFO - __main__ - Step 85287: {'lr': 0.00020104006179202675, 'samples': 16375104, 'steps': 85286, 'loss/train': 1.530516266822815} 11/07/2021 09:12:37 - INFO - __main__ - Step 85288: {'lr': 0.00020103485781897658, 'samples': 16375296, 'steps': 85287, 'loss/train': 0.10058294236660004} 11/07/2021 09:12:38 - INFO - __main__ - Step 85289: {'lr': 0.0002010296538679893, 'samples': 16375488, 'steps': 85288, 'loss/train': 1.7971934080123901} 11/07/2021 09:12:38 - INFO - __main__ - Step 85290: {'lr': 0.00020102444993906732, 'samples': 16375680, 'steps': 85289, 'loss/train': 1.612419843673706} 11/07/2021 09:12:39 - INFO - __main__ - Step 85291: {'lr': 0.000201019246032213, 'samples': 16375872, 'steps': 85290, 'loss/train': 1.2894028425216675} 11/07/2021 09:12:39 - INFO - __main__ - Step 85292: {'lr': 0.00020101404214742862, 'samples': 16376064, 'steps': 85291, 'loss/train': 1.0307344198226929} 11/07/2021 09:12:40 - INFO - __main__ - Step 85293: {'lr': 0.00020100883828471654, 'samples': 16376256, 'steps': 85292, 'loss/train': 1.4399226903915405} 11/07/2021 09:12:40 - INFO - __main__ - Step 85294: {'lr': 0.00020100363444407914, 'samples': 16376448, 'steps': 85293, 'loss/train': 1.4631413221359253} 11/07/2021 09:12:40 - INFO - __main__ - Step 85295: {'lr': 0.00020099843062551878, 'samples': 16376640, 'steps': 85294, 'loss/train': 1.3300256729125977} 11/07/2021 09:12:41 - INFO - __main__ - Step 85296: {'lr': 0.00020099322682903776, 'samples': 16376832, 'steps': 85295, 'loss/train': 1.6801224946975708} 11/07/2021 09:12:42 - INFO - __main__ - Step 85297: {'lr': 0.00020098802305463845, 'samples': 16377024, 'steps': 85296, 'loss/train': 1.706781029701233} 11/07/2021 09:12:42 - INFO - __main__ - Step 85298: {'lr': 0.00020098281930232314, 'samples': 16377216, 'steps': 85297, 'loss/train': 1.5198545455932617} 11/07/2021 09:12:42 - INFO - __main__ - Step 85299: {'lr': 0.0002009776155720943, 'samples': 16377408, 'steps': 85298, 'loss/train': 1.578860878944397} 11/07/2021 09:12:43 - INFO - __main__ - Step 85300: {'lr': 0.0002009724118639541, 'samples': 16377600, 'steps': 85299, 'loss/train': 1.7147647142410278} 11/07/2021 09:12:44 - INFO - __main__ - Step 85301: {'lr': 0.00020096720817790498, 'samples': 16377792, 'steps': 85300, 'loss/train': 1.6887056827545166} 11/07/2021 09:12:44 - INFO - __main__ - Step 85302: {'lr': 0.0002009620045139493, 'samples': 16377984, 'steps': 85301, 'loss/train': 1.3037502765655518} 11/07/2021 09:12:44 - INFO - __main__ - Step 85303: {'lr': 0.00020095680087208937, 'samples': 16378176, 'steps': 85302, 'loss/train': 1.2326487302780151} 11/07/2021 09:12:45 - INFO - __main__ - Step 85304: {'lr': 0.00020095159725232756, 'samples': 16378368, 'steps': 85303, 'loss/train': 0.8840793967247009} 11/07/2021 09:12:45 - INFO - __main__ - Step 85305: {'lr': 0.00020094639365466618, 'samples': 16378560, 'steps': 85304, 'loss/train': 0.979921817779541} 11/07/2021 09:12:46 - INFO - __main__ - Step 85306: {'lr': 0.00020094119007910765, 'samples': 16378752, 'steps': 85305, 'loss/train': 1.4065121412277222} 11/07/2021 09:12:46 - INFO - __main__ - Step 85307: {'lr': 0.0002009359865256542, 'samples': 16378944, 'steps': 85306, 'loss/train': 1.3682119846343994} 11/07/2021 09:12:47 - INFO - __main__ - Step 85308: {'lr': 0.00020093078299430835, 'samples': 16379136, 'steps': 85307, 'loss/train': 1.499861717224121} 11/07/2021 09:12:47 - INFO - __main__ - Step 85309: {'lr': 0.00020092557948507222, 'samples': 16379328, 'steps': 85308, 'loss/train': 1.4230324029922485} 11/07/2021 09:12:47 - INFO - __main__ - Step 85310: {'lr': 0.0002009203759979483, 'samples': 16379520, 'steps': 85309, 'loss/train': 1.4529950618743896} 11/07/2021 09:12:48 - INFO - __main__ - Step 85311: {'lr': 0.0002009151725329389, 'samples': 16379712, 'steps': 85310, 'loss/train': 1.5027971267700195} 11/07/2021 09:12:49 - INFO - __main__ - Step 85312: {'lr': 0.0002009099690900464, 'samples': 16379904, 'steps': 85311, 'loss/train': 1.2727563381195068} 11/07/2021 09:12:49 - INFO - __main__ - Step 85313: {'lr': 0.00020090476566927306, 'samples': 16380096, 'steps': 85312, 'loss/train': 2.0552456378936768} 11/07/2021 09:12:49 - INFO - __main__ - Step 85314: {'lr': 0.00020089956227062127, 'samples': 16380288, 'steps': 85313, 'loss/train': 1.7045730352401733} 11/07/2021 09:12:50 - INFO - __main__ - Step 85315: {'lr': 0.00020089435889409342, 'samples': 16380480, 'steps': 85314, 'loss/train': 1.1083537340164185} 11/07/2021 09:12:51 - INFO - __main__ - Step 85316: {'lr': 0.00020088915553969177, 'samples': 16380672, 'steps': 85315, 'loss/train': 0.6092178821563721} 11/07/2021 09:12:51 - INFO - __main__ - Step 85317: {'lr': 0.00020088395220741874, 'samples': 16380864, 'steps': 85316, 'loss/train': 1.5821342468261719} 11/07/2021 09:12:52 - INFO - __main__ - Step 85318: {'lr': 0.00020087874889727661, 'samples': 16381056, 'steps': 85317, 'loss/train': 1.556376576423645} 11/07/2021 09:12:52 - INFO - __main__ - Step 85319: {'lr': 0.00020087354560926785, 'samples': 16381248, 'steps': 85318, 'loss/train': 1.3769854307174683} 11/07/2021 09:12:52 - INFO - __main__ - Step 85320: {'lr': 0.00020086834234339461, 'samples': 16381440, 'steps': 85319, 'loss/train': 1.0245099067687988} 11/07/2021 09:12:53 - INFO - __main__ - Step 85321: {'lr': 0.00020086313909965938, 'samples': 16381632, 'steps': 85320, 'loss/train': 1.103308916091919} 11/07/2021 09:12:54 - INFO - __main__ - Step 85322: {'lr': 0.00020085793587806445, 'samples': 16381824, 'steps': 85321, 'loss/train': 1.875295639038086} 11/07/2021 09:12:54 - INFO - __main__ - Step 85323: {'lr': 0.00020085273267861218, 'samples': 16382016, 'steps': 85322, 'loss/train': 1.5365113019943237} 11/07/2021 09:12:54 - INFO - __main__ - Step 85324: {'lr': 0.00020084752950130493, 'samples': 16382208, 'steps': 85323, 'loss/train': 1.5112521648406982} 11/07/2021 09:12:55 - INFO - __main__ - Step 85325: {'lr': 0.00020084232634614503, 'samples': 16382400, 'steps': 85324, 'loss/train': 1.4058948755264282} 11/07/2021 09:12:55 - INFO - __main__ - Step 85326: {'lr': 0.0002008371232131348, 'samples': 16382592, 'steps': 85325, 'loss/train': 0.9837362766265869} 11/07/2021 09:12:57 - INFO - __main__ - Step 85327: {'lr': 0.00020083192010227657, 'samples': 16382784, 'steps': 85326, 'loss/train': 1.6791976690292358} 11/07/2021 09:12:57 - INFO - __main__ - Step 85328: {'lr': 0.00020082671701357273, 'samples': 16382976, 'steps': 85327, 'loss/train': 1.5366719961166382} 11/07/2021 09:12:57 - INFO - __main__ - Step 85329: {'lr': 0.00020082151394702562, 'samples': 16383168, 'steps': 85328, 'loss/train': 0.9323683381080627} 11/07/2021 09:12:58 - INFO - __main__ - Step 85330: {'lr': 0.00020081631090263766, 'samples': 16383360, 'steps': 85329, 'loss/train': 1.106172800064087} 11/07/2021 09:12:58 - INFO - __main__ - Step 85331: {'lr': 0.00020081110788041102, 'samples': 16383552, 'steps': 85330, 'loss/train': 1.2004059553146362} 11/07/2021 09:12:59 - INFO - __main__ - Step 85332: {'lr': 0.00020080590488034817, 'samples': 16383744, 'steps': 85331, 'loss/train': 1.9746848344802856} 11/07/2021 09:12:59 - INFO - __main__ - Step 85333: {'lr': 0.00020080070190245136, 'samples': 16383936, 'steps': 85332, 'loss/train': 1.1913650035858154} 11/07/2021 09:13:00 - INFO - __main__ - Step 85334: {'lr': 0.00020079549894672305, 'samples': 16384128, 'steps': 85333, 'loss/train': 1.4526628255844116} 11/07/2021 09:13:00 - INFO - __main__ - Step 85335: {'lr': 0.0002007902960131655, 'samples': 16384320, 'steps': 85334, 'loss/train': 1.86506986618042} 11/07/2021 09:13:01 - INFO - __main__ - Step 85336: {'lr': 0.00020078509310178112, 'samples': 16384512, 'steps': 85335, 'loss/train': 1.4485912322998047} 11/07/2021 09:13:02 - INFO - __main__ - Step 85337: {'lr': 0.00020077989021257217, 'samples': 16384704, 'steps': 85336, 'loss/train': 1.4295402765274048} 11/07/2021 09:13:02 - INFO - __main__ - Step 85338: {'lr': 0.00020077468734554105, 'samples': 16384896, 'steps': 85337, 'loss/train': 1.5417548418045044} 11/07/2021 09:13:02 - INFO - __main__ - Step 85339: {'lr': 0.0002007694845006902, 'samples': 16385088, 'steps': 85338, 'loss/train': 1.7326496839523315} 11/07/2021 09:13:03 - INFO - __main__ - Step 85340: {'lr': 0.00020076428167802179, 'samples': 16385280, 'steps': 85339, 'loss/train': 1.4871681928634644} 11/07/2021 09:13:03 - INFO - __main__ - Step 85341: {'lr': 0.00020075907887753822, 'samples': 16385472, 'steps': 85340, 'loss/train': 0.11720789223909378} 11/07/2021 09:13:04 - INFO - __main__ - Step 85342: {'lr': 0.00020075387609924184, 'samples': 16385664, 'steps': 85341, 'loss/train': 1.1223514080047607} 11/07/2021 09:13:04 - INFO - __main__ - Step 85343: {'lr': 0.00020074867334313502, 'samples': 16385856, 'steps': 85342, 'loss/train': 1.423475980758667} 11/07/2021 09:13:05 - INFO - __main__ - Step 85344: {'lr': 0.00020074347060922008, 'samples': 16386048, 'steps': 85343, 'loss/train': 1.3127903938293457} 11/07/2021 09:13:05 - INFO - __main__ - Step 85345: {'lr': 0.00020073826789749935, 'samples': 16386240, 'steps': 85344, 'loss/train': 0.3958052396774292} 11/07/2021 09:13:05 - INFO - __main__ - Step 85346: {'lr': 0.00020073306520797525, 'samples': 16386432, 'steps': 85345, 'loss/train': 1.4124265909194946} 11/07/2021 09:13:06 - INFO - __main__ - Step 85347: {'lr': 0.00020072786254065, 'samples': 16386624, 'steps': 85346, 'loss/train': 1.2442448139190674} 11/07/2021 09:13:07 - INFO - __main__ - Step 85348: {'lr': 0.00020072265989552607, 'samples': 16386816, 'steps': 85347, 'loss/train': 1.1599229574203491} 11/07/2021 09:13:07 - INFO - __main__ - Step 85349: {'lr': 0.0002007174572726057, 'samples': 16387008, 'steps': 85348, 'loss/train': 1.4656155109405518} 11/07/2021 09:13:07 - INFO - __main__ - Step 85350: {'lr': 0.00020071225467189132, 'samples': 16387200, 'steps': 85349, 'loss/train': 1.1850402355194092} 11/07/2021 09:13:08 - INFO - __main__ - Step 85351: {'lr': 0.00020070705209338524, 'samples': 16387392, 'steps': 85350, 'loss/train': 1.2455185651779175} 11/07/2021 09:13:09 - INFO - __main__ - Step 85352: {'lr': 0.0002007018495370899, 'samples': 16387584, 'steps': 85351, 'loss/train': 1.3229541778564453} 11/07/2021 09:13:09 - INFO - __main__ - Step 85353: {'lr': 0.00020069664700300745, 'samples': 16387776, 'steps': 85352, 'loss/train': 0.7297662496566772} 11/07/2021 09:13:09 - INFO - __main__ - Step 85354: {'lr': 0.00020069144449114029, 'samples': 16387968, 'steps': 85353, 'loss/train': 0.6305733323097229} 11/07/2021 09:13:10 - INFO - __main__ - Step 85355: {'lr': 0.00020068624200149084, 'samples': 16388160, 'steps': 85354, 'loss/train': 1.4224815368652344} 11/07/2021 09:13:10 - INFO - __main__ - Step 85356: {'lr': 0.00020068103953406138, 'samples': 16388352, 'steps': 85355, 'loss/train': 1.2017847299575806} 11/07/2021 09:13:10 - INFO - __main__ - Step 85357: {'lr': 0.0002006758370888543, 'samples': 16388544, 'steps': 85356, 'loss/train': 1.329725980758667} 11/07/2021 09:13:12 - INFO - __main__ - Step 85358: {'lr': 0.00020067063466587193, 'samples': 16388736, 'steps': 85357, 'loss/train': 1.1892955303192139} 11/07/2021 09:13:12 - INFO - __main__ - Step 85359: {'lr': 0.00020066543226511662, 'samples': 16388928, 'steps': 85358, 'loss/train': 1.422475814819336} 11/07/2021 09:13:12 - INFO - __main__ - Step 85360: {'lr': 0.0002006602298865907, 'samples': 16389120, 'steps': 85359, 'loss/train': 1.5031503438949585} 11/07/2021 09:13:13 - INFO - __main__ - Step 85361: {'lr': 0.0002006550275302965, 'samples': 16389312, 'steps': 85360, 'loss/train': 1.5397472381591797} 11/07/2021 09:13:13 - INFO - __main__ - Step 85362: {'lr': 0.0002006498251962364, 'samples': 16389504, 'steps': 85361, 'loss/train': 1.303407073020935} 11/07/2021 09:13:14 - INFO - __main__ - Step 85363: {'lr': 0.00020064462288441274, 'samples': 16389696, 'steps': 85362, 'loss/train': 1.289291501045227} 11/07/2021 09:13:14 - INFO - __main__ - Step 85364: {'lr': 0.0002006394205948278, 'samples': 16389888, 'steps': 85363, 'loss/train': 1.5093244314193726} 11/07/2021 09:13:15 - INFO - __main__ - Step 85365: {'lr': 0.000200634218327484, 'samples': 16390080, 'steps': 85364, 'loss/train': 1.310652494430542} 11/07/2021 09:13:15 - INFO - __main__ - Step 85366: {'lr': 0.00020062901608238382, 'samples': 16390272, 'steps': 85365, 'loss/train': 1.4000201225280762} 11/07/2021 09:13:15 - INFO - __main__ - Step 85367: {'lr': 0.00020062381385952928, 'samples': 16390464, 'steps': 85366, 'loss/train': 1.263419270515442} 11/07/2021 09:13:16 - INFO - __main__ - Step 85368: {'lr': 0.00020061861165892293, 'samples': 16390656, 'steps': 85367, 'loss/train': 1.4583972692489624} 11/07/2021 09:13:17 - INFO - __main__ - Step 85369: {'lr': 0.00020061340948056703, 'samples': 16390848, 'steps': 85368, 'loss/train': 1.5092811584472656} 11/07/2021 09:13:17 - INFO - __main__ - Step 85370: {'lr': 0.00020060820732446398, 'samples': 16391040, 'steps': 85369, 'loss/train': 1.301408052444458} 11/07/2021 09:13:18 - INFO - __main__ - Step 85371: {'lr': 0.00020060300519061607, 'samples': 16391232, 'steps': 85370, 'loss/train': 1.798159122467041} 11/07/2021 09:13:18 - INFO - __main__ - Step 85372: {'lr': 0.00020059780307902576, 'samples': 16391424, 'steps': 85371, 'loss/train': 1.4425668716430664} 11/07/2021 09:13:19 - INFO - __main__ - Step 85373: {'lr': 0.00020059260098969525, 'samples': 16391616, 'steps': 85372, 'loss/train': 1.0865155458450317} 11/07/2021 09:13:19 - INFO - __main__ - Step 85374: {'lr': 0.000200587398922627, 'samples': 16391808, 'steps': 85373, 'loss/train': 0.9417954087257385} 11/07/2021 09:13:20 - INFO - __main__ - Step 85375: {'lr': 0.00020058219687782327, 'samples': 16392000, 'steps': 85374, 'loss/train': 1.2032334804534912} 11/07/2021 09:13:20 - INFO - __main__ - Step 85376: {'lr': 0.00020057699485528647, 'samples': 16392192, 'steps': 85375, 'loss/train': 1.4913922548294067} 11/07/2021 09:13:20 - INFO - __main__ - Step 85377: {'lr': 0.0002005717928550189, 'samples': 16392384, 'steps': 85376, 'loss/train': 1.705809235572815} 11/07/2021 09:13:22 - INFO - __main__ - Step 85378: {'lr': 0.00020056659087702293, 'samples': 16392576, 'steps': 85377, 'loss/train': 1.2582820653915405} 11/07/2021 09:13:22 - INFO - __main__ - Step 85379: {'lr': 0.00020056138892130096, 'samples': 16392768, 'steps': 85378, 'loss/train': 1.503081202507019} 11/07/2021 09:13:23 - INFO - __main__ - Step 85380: {'lr': 0.0002005561869878552, 'samples': 16392960, 'steps': 85379, 'loss/train': 1.2268093824386597} 11/07/2021 09:13:23 - INFO - __main__ - Step 85381: {'lr': 0.00020055098507668805, 'samples': 16393152, 'steps': 85380, 'loss/train': 0.06605967879295349} 11/07/2021 09:13:23 - INFO - __main__ - Step 85382: {'lr': 0.00020054578318780183, 'samples': 16393344, 'steps': 85381, 'loss/train': 0.12670768797397614} 11/07/2021 09:13:24 - INFO - __main__ - Step 85383: {'lr': 0.00020054058132119894, 'samples': 16393536, 'steps': 85382, 'loss/train': 0.5236920714378357} 11/07/2021 09:13:24 - INFO - __main__ - Step 85384: {'lr': 0.00020053537947688172, 'samples': 16393728, 'steps': 85383, 'loss/train': 1.3879444599151611} 11/07/2021 09:13:25 - INFO - __main__ - Step 85385: {'lr': 0.00020053017765485248, 'samples': 16393920, 'steps': 85384, 'loss/train': 1.0492222309112549} 11/07/2021 09:13:25 - INFO - __main__ - Step 85386: {'lr': 0.00020052497585511356, 'samples': 16394112, 'steps': 85385, 'loss/train': 1.35684072971344} 11/07/2021 09:13:26 - INFO - __main__ - Step 85387: {'lr': 0.00020051977407766736, 'samples': 16394304, 'steps': 85386, 'loss/train': 1.3993034362792969} 11/07/2021 09:13:26 - INFO - __main__ - Step 85388: {'lr': 0.00020051457232251615, 'samples': 16394496, 'steps': 85387, 'loss/train': 1.7674179077148438} 11/07/2021 09:13:26 - INFO - __main__ - Step 85389: {'lr': 0.0002005093705896623, 'samples': 16394688, 'steps': 85388, 'loss/train': 1.5504066944122314} 11/07/2021 09:13:27 - INFO - __main__ - Step 85390: {'lr': 0.0002005041688791082, 'samples': 16394880, 'steps': 85389, 'loss/train': 1.6227587461471558} 11/07/2021 09:13:28 - INFO - __main__ - Step 85391: {'lr': 0.00020049896719085618, 'samples': 16395072, 'steps': 85390, 'loss/train': 1.1866692304611206} 11/07/2021 09:13:28 - INFO - __main__ - Step 85392: {'lr': 0.0002004937655249085, 'samples': 16395264, 'steps': 85391, 'loss/train': 1.1661593914031982} 11/07/2021 09:13:29 - INFO - __main__ - Step 85393: {'lr': 0.0002004885638812677, 'samples': 16395456, 'steps': 85392, 'loss/train': 1.0344791412353516} 11/07/2021 09:13:29 - INFO - __main__ - Step 85394: {'lr': 0.00020048336225993591, 'samples': 16395648, 'steps': 85393, 'loss/train': 0.802379846572876} 11/07/2021 09:13:31 - INFO - __main__ - Step 85395: {'lr': 0.0002004781606609155, 'samples': 16395840, 'steps': 85394, 'loss/train': 1.7074331045150757} 11/07/2021 09:13:31 - INFO - __main__ - Step 85396: {'lr': 0.0002004729590842089, 'samples': 16396032, 'steps': 85395, 'loss/train': 1.394778847694397} 11/07/2021 09:13:31 - INFO - __main__ - Step 85397: {'lr': 0.0002004677575298184, 'samples': 16396224, 'steps': 85396, 'loss/train': 1.4670602083206177} 11/07/2021 09:13:32 - INFO - __main__ - Step 85398: {'lr': 0.00020046255599774637, 'samples': 16396416, 'steps': 85397, 'loss/train': 1.589949369430542} 11/07/2021 09:13:32 - INFO - __main__ - Step 85399: {'lr': 0.0002004573544879952, 'samples': 16396608, 'steps': 85398, 'loss/train': 1.221474051475525} 11/07/2021 09:13:33 - INFO - __main__ - Step 85400: {'lr': 0.00020045215300056713, 'samples': 16396800, 'steps': 85399, 'loss/train': 1.545744776725769} 11/07/2021 09:13:33 - INFO - __main__ - Step 85401: {'lr': 0.00020044695153546456, 'samples': 16396992, 'steps': 85400, 'loss/train': 1.6028685569763184} 11/07/2021 09:13:34 - INFO - __main__ - Step 85402: {'lr': 0.0002004417500926898, 'samples': 16397184, 'steps': 85401, 'loss/train': 1.605759620666504} 11/07/2021 09:13:34 - INFO - __main__ - Step 85403: {'lr': 0.00020043654867224527, 'samples': 16397376, 'steps': 85402, 'loss/train': 1.435644268989563} 11/07/2021 09:13:35 - INFO - __main__ - Step 85404: {'lr': 0.00020043134727413327, 'samples': 16397568, 'steps': 85403, 'loss/train': 1.377200961112976} 11/07/2021 09:13:35 - INFO - __main__ - Step 85405: {'lr': 0.00020042614589835608, 'samples': 16397760, 'steps': 85404, 'loss/train': 1.4202440977096558} 11/07/2021 09:13:35 - INFO - __main__ - Step 85406: {'lr': 0.00020042094454491628, 'samples': 16397952, 'steps': 85405, 'loss/train': 1.4386301040649414} 11/07/2021 09:13:36 - INFO - __main__ - Step 85407: {'lr': 0.0002004157432138159, 'samples': 16398144, 'steps': 85406, 'loss/train': 1.7091948986053467} 11/07/2021 09:13:37 - INFO - __main__ - Step 85408: {'lr': 0.0002004105419050574, 'samples': 16398336, 'steps': 85407, 'loss/train': 1.4470136165618896} 11/07/2021 09:13:37 - INFO - __main__ - Step 85409: {'lr': 0.00020040534061864317, 'samples': 16398528, 'steps': 85408, 'loss/train': 1.4139931201934814} 11/07/2021 09:13:37 - INFO - __main__ - Step 85410: {'lr': 0.0002004001393545755, 'samples': 16398720, 'steps': 85409, 'loss/train': 1.592076063156128} 11/07/2021 09:13:38 - INFO - __main__ - Step 85411: {'lr': 0.0002003949381128568, 'samples': 16398912, 'steps': 85410, 'loss/train': 0.7963640689849854} 11/07/2021 09:13:39 - INFO - __main__ - Step 85412: {'lr': 0.00020038973689348938, 'samples': 16399104, 'steps': 85411, 'loss/train': 1.3033742904663086} 11/07/2021 09:13:39 - INFO - __main__ - Step 85413: {'lr': 0.00020038453569647555, 'samples': 16399296, 'steps': 85412, 'loss/train': 1.4338371753692627} 11/07/2021 09:13:39 - INFO - __main__ - Step 85414: {'lr': 0.0002003793345218177, 'samples': 16399488, 'steps': 85413, 'loss/train': 0.7670401334762573} 11/07/2021 09:13:40 - INFO - __main__ - Step 85415: {'lr': 0.00020037413336951816, 'samples': 16399680, 'steps': 85414, 'loss/train': 0.6147133708000183} 11/07/2021 09:13:40 - INFO - __main__ - Step 85416: {'lr': 0.00020036893223957924, 'samples': 16399872, 'steps': 85415, 'loss/train': 1.7199381589889526} 11/07/2021 09:13:41 - INFO - __main__ - Step 85417: {'lr': 0.00020036373113200333, 'samples': 16400064, 'steps': 85416, 'loss/train': 0.9926593899726868} 11/07/2021 09:13:42 - INFO - __main__ - Step 85418: {'lr': 0.0002003585300467928, 'samples': 16400256, 'steps': 85417, 'loss/train': 1.8824503421783447} 11/07/2021 09:13:42 - INFO - __main__ - Step 85419: {'lr': 0.00020035332898394988, 'samples': 16400448, 'steps': 85418, 'loss/train': 1.7525230646133423} 11/07/2021 09:13:42 - INFO - __main__ - Step 85420: {'lr': 0.00020034812794347712, 'samples': 16400640, 'steps': 85419, 'loss/train': 1.0585737228393555} 11/07/2021 09:13:43 - INFO - __main__ - Step 85421: {'lr': 0.00020034292692537662, 'samples': 16400832, 'steps': 85420, 'loss/train': 1.5124847888946533} 11/07/2021 09:13:43 - INFO - __main__ - Step 85422: {'lr': 0.00020033772592965084, 'samples': 16401024, 'steps': 85421, 'loss/train': 1.3761723041534424} 11/07/2021 09:13:44 - INFO - __main__ - Step 85423: {'lr': 0.00020033252495630212, 'samples': 16401216, 'steps': 85422, 'loss/train': 1.1237765550613403} 11/07/2021 09:13:44 - INFO - __main__ - Step 85424: {'lr': 0.00020032732400533277, 'samples': 16401408, 'steps': 85423, 'loss/train': 0.8235288262367249} 11/07/2021 09:13:45 - INFO - __main__ - Step 85425: {'lr': 0.00020032212307674515, 'samples': 16401600, 'steps': 85424, 'loss/train': 1.236611247062683} 11/07/2021 09:13:45 - INFO - __main__ - Step 85426: {'lr': 0.00020031692217054164, 'samples': 16401792, 'steps': 85425, 'loss/train': 1.2532490491867065} 11/07/2021 09:13:45 - INFO - __main__ - Step 85427: {'lr': 0.0002003117212867246, 'samples': 16401984, 'steps': 85426, 'loss/train': 1.4109753370285034} 11/07/2021 09:13:47 - INFO - __main__ - Step 85428: {'lr': 0.00020030652042529626, 'samples': 16402176, 'steps': 85427, 'loss/train': 1.2025463581085205} 11/07/2021 09:13:47 - INFO - __main__ - Step 85429: {'lr': 0.00020030131958625907, 'samples': 16402368, 'steps': 85428, 'loss/train': 1.620401382446289} 11/07/2021 09:13:48 - INFO - __main__ - Step 85430: {'lr': 0.00020029611876961535, 'samples': 16402560, 'steps': 85429, 'loss/train': 1.2815834283828735} 11/07/2021 09:13:48 - INFO - __main__ - Step 85431: {'lr': 0.00020029091797536748, 'samples': 16402752, 'steps': 85430, 'loss/train': 0.6334896683692932} 11/07/2021 09:13:48 - INFO - __main__ - Step 85432: {'lr': 0.00020028571720351768, 'samples': 16402944, 'steps': 85431, 'loss/train': 1.1941263675689697} 11/07/2021 09:13:49 - INFO - __main__ - Step 85433: {'lr': 0.00020028051645406842, 'samples': 16403136, 'steps': 85432, 'loss/train': 2.170022487640381} 11/07/2021 09:13:50 - INFO - __main__ - Step 85434: {'lr': 0.00020027531572702195, 'samples': 16403328, 'steps': 85433, 'loss/train': 2.6104536056518555} 11/07/2021 09:13:50 - INFO - __main__ - Step 85435: {'lr': 0.00020027011502238065, 'samples': 16403520, 'steps': 85434, 'loss/train': 0.7024753093719482} 11/07/2021 09:13:51 - INFO - __main__ - Step 85436: {'lr': 0.00020026491434014688, 'samples': 16403712, 'steps': 85435, 'loss/train': 0.9743160605430603} 11/07/2021 09:13:51 - INFO - __main__ - Step 85437: {'lr': 0.00020025971368032298, 'samples': 16403904, 'steps': 85436, 'loss/train': 1.3895124197006226} 11/07/2021 09:13:51 - INFO - __main__ - Step 85438: {'lr': 0.00020025451304291127, 'samples': 16404096, 'steps': 85437, 'loss/train': 1.547611951828003} 11/07/2021 09:13:52 - INFO - __main__ - Step 85439: {'lr': 0.0002002493124279141, 'samples': 16404288, 'steps': 85438, 'loss/train': 1.2857537269592285} 11/07/2021 09:13:53 - INFO - __main__ - Step 85440: {'lr': 0.00020024411183533383, 'samples': 16404480, 'steps': 85439, 'loss/train': 1.2055933475494385} 11/07/2021 09:13:53 - INFO - __main__ - Step 85441: {'lr': 0.0002002389112651728, 'samples': 16404672, 'steps': 85440, 'loss/train': 1.1166973114013672} 11/07/2021 09:13:54 - INFO - __main__ - Step 85442: {'lr': 0.0002002337107174334, 'samples': 16404864, 'steps': 85441, 'loss/train': 0.9610461592674255} 11/07/2021 09:13:54 - INFO - __main__ - Step 85443: {'lr': 0.00020022851019211788, 'samples': 16405056, 'steps': 85442, 'loss/train': 1.5714811086654663} 11/07/2021 09:13:55 - INFO - __main__ - Step 85444: {'lr': 0.0002002233096892286, 'samples': 16405248, 'steps': 85443, 'loss/train': 1.0349037647247314} 11/07/2021 09:13:55 - INFO - __main__ - Step 85445: {'lr': 0.00020021810920876795, 'samples': 16405440, 'steps': 85444, 'loss/train': 1.522605299949646} 11/07/2021 09:13:56 - INFO - __main__ - Step 85446: {'lr': 0.0002002129087507383, 'samples': 16405632, 'steps': 85445, 'loss/train': 0.4685090482234955} 11/07/2021 09:13:56 - INFO - __main__ - Step 85447: {'lr': 0.0002002077083151419, 'samples': 16405824, 'steps': 85446, 'loss/train': 1.262109637260437} 11/07/2021 09:13:56 - INFO - __main__ - Step 85448: {'lr': 0.00020020250790198113, 'samples': 16406016, 'steps': 85447, 'loss/train': 0.9761527180671692} 11/07/2021 09:13:57 - INFO - __main__ - Step 85449: {'lr': 0.00020019730751125834, 'samples': 16406208, 'steps': 85448, 'loss/train': 1.2932493686676025} 11/07/2021 09:13:58 - INFO - __main__ - Step 85450: {'lr': 0.00020019210714297586, 'samples': 16406400, 'steps': 85449, 'loss/train': 1.5868475437164307} 11/07/2021 09:13:58 - INFO - __main__ - Step 85451: {'lr': 0.0002001869067971361, 'samples': 16406592, 'steps': 85450, 'loss/train': 0.9220715761184692} 11/07/2021 09:13:59 - INFO - __main__ - Step 85452: {'lr': 0.00020018170647374128, 'samples': 16406784, 'steps': 85451, 'loss/train': 0.8943288922309875} 11/07/2021 09:13:59 - INFO - __main__ - Step 85453: {'lr': 0.00020017650617279394, 'samples': 16406976, 'steps': 85452, 'loss/train': 1.5625709295272827} 11/07/2021 09:13:59 - INFO - __main__ - Step 85454: {'lr': 0.00020017130589429619, 'samples': 16407168, 'steps': 85453, 'loss/train': 1.4180353879928589} 11/07/2021 09:14:00 - INFO - __main__ - Step 85455: {'lr': 0.0002001661056382505, 'samples': 16407360, 'steps': 85454, 'loss/train': 1.1975277662277222} 11/07/2021 09:14:01 - INFO - __main__ - Step 85456: {'lr': 0.00020016090540465919, 'samples': 16407552, 'steps': 85455, 'loss/train': 1.2629992961883545} 11/07/2021 09:14:01 - INFO - __main__ - Step 85457: {'lr': 0.0002001557051935246, 'samples': 16407744, 'steps': 85456, 'loss/train': 1.577849268913269} 11/07/2021 09:14:01 - INFO - __main__ - Step 85458: {'lr': 0.0002001505050048491, 'samples': 16407936, 'steps': 85457, 'loss/train': 1.5173733234405518} 11/07/2021 09:14:02 - INFO - __main__ - Step 85459: {'lr': 0.00020014530483863498, 'samples': 16408128, 'steps': 85458, 'loss/train': 1.441873550415039} 11/07/2021 09:14:03 - INFO - __main__ - Step 85460: {'lr': 0.0002001401046948847, 'samples': 16408320, 'steps': 85459, 'loss/train': 1.4327740669250488} 11/07/2021 09:14:03 - INFO - __main__ - Step 85461: {'lr': 0.00020013490457360046, 'samples': 16408512, 'steps': 85460, 'loss/train': 1.2926656007766724} 11/07/2021 09:14:03 - INFO - __main__ - Step 85462: {'lr': 0.00020012970447478464, 'samples': 16408704, 'steps': 85461, 'loss/train': 1.3865966796875} 11/07/2021 09:14:04 - INFO - __main__ - Step 85463: {'lr': 0.00020012450439843967, 'samples': 16408896, 'steps': 85462, 'loss/train': 1.3184205293655396} 11/07/2021 09:14:04 - INFO - __main__ - Step 85464: {'lr': 0.00020011930434456782, 'samples': 16409088, 'steps': 85463, 'loss/train': 1.372926950454712} 11/07/2021 09:14:05 - INFO - __main__ - Step 85465: {'lr': 0.0002001141043131714, 'samples': 16409280, 'steps': 85464, 'loss/train': 0.7932441830635071} 11/07/2021 09:14:05 - INFO - __main__ - Step 85466: {'lr': 0.0002001089043042528, 'samples': 16409472, 'steps': 85465, 'loss/train': 0.8562166690826416} 11/07/2021 09:14:06 - INFO - __main__ - Step 85467: {'lr': 0.00020010370431781436, 'samples': 16409664, 'steps': 85466, 'loss/train': 0.8952695727348328} 11/07/2021 09:14:06 - INFO - __main__ - Step 85468: {'lr': 0.0002000985043538584, 'samples': 16409856, 'steps': 85467, 'loss/train': 1.7409932613372803} 11/07/2021 09:14:07 - INFO - __main__ - Step 85469: {'lr': 0.00020009330441238732, 'samples': 16410048, 'steps': 85468, 'loss/train': 1.5622947216033936} 11/07/2021 09:14:07 - INFO - __main__ - Step 85470: {'lr': 0.00020008810449340342, 'samples': 16410240, 'steps': 85469, 'loss/train': 1.6520912647247314} 11/07/2021 09:14:08 - INFO - __main__ - Step 85471: {'lr': 0.00020008290459690904, 'samples': 16410432, 'steps': 85470, 'loss/train': 1.4662343263626099} 11/07/2021 09:14:08 - INFO - __main__ - Step 85472: {'lr': 0.00020007770472290652, 'samples': 16410624, 'steps': 85471, 'loss/train': 1.445117712020874} 11/07/2021 09:14:09 - INFO - __main__ - Step 85473: {'lr': 0.00020007250487139827, 'samples': 16410816, 'steps': 85472, 'loss/train': 1.2244964838027954} 11/07/2021 09:14:09 - INFO - __main__ - Step 85474: {'lr': 0.00020006730504238654, 'samples': 16411008, 'steps': 85473, 'loss/train': 1.4821735620498657} 11/07/2021 09:14:09 - INFO - __main__ - Step 85475: {'lr': 0.00020006210523587376, 'samples': 16411200, 'steps': 85474, 'loss/train': 1.6446021795272827} 11/07/2021 09:14:10 - INFO - __main__ - Step 85476: {'lr': 0.0002000569054518622, 'samples': 16411392, 'steps': 85475, 'loss/train': 1.5160263776779175} 11/07/2021 09:14:11 - INFO - __main__ - Step 85477: {'lr': 0.0002000517056903542, 'samples': 16411584, 'steps': 85476, 'loss/train': 1.5449339151382446} 11/07/2021 09:14:11 - INFO - __main__ - Step 85478: {'lr': 0.00020004650595135213, 'samples': 16411776, 'steps': 85477, 'loss/train': 1.4054713249206543} 11/07/2021 09:14:11 - INFO - __main__ - Step 85479: {'lr': 0.00020004130623485833, 'samples': 16411968, 'steps': 85478, 'loss/train': 0.7979374527931213} 11/07/2021 09:14:12 - INFO - __main__ - Step 85480: {'lr': 0.00020003610654087514, 'samples': 16412160, 'steps': 85479, 'loss/train': 1.2445895671844482} 11/07/2021 09:14:13 - INFO - __main__ - Step 85481: {'lr': 0.00020003090686940495, 'samples': 16412352, 'steps': 85480, 'loss/train': 1.0389646291732788} 11/07/2021 09:14:13 - INFO - __main__ - Step 85482: {'lr': 0.00020002570722045003, 'samples': 16412544, 'steps': 85481, 'loss/train': 0.9564878940582275} 11/07/2021 09:14:13 - INFO - __main__ - Step 85483: {'lr': 0.00020002050759401275, 'samples': 16412736, 'steps': 85482, 'loss/train': 1.5151432752609253} 11/07/2021 09:14:14 - INFO - __main__ - Step 85484: {'lr': 0.00020001530799009545, 'samples': 16412928, 'steps': 85483, 'loss/train': 1.5113493204116821} 11/07/2021 09:14:14 - INFO - __main__ - Step 85485: {'lr': 0.0002000101084087005, 'samples': 16413120, 'steps': 85484, 'loss/train': 1.4366978406906128} 11/07/2021 09:14:15 - INFO - __main__ - Step 85486: {'lr': 0.00020000490884983024, 'samples': 16413312, 'steps': 85485, 'loss/train': 1.3492456674575806} 11/07/2021 09:14:15 - INFO - __main__ - Step 85487: {'lr': 0.0001999997093134871, 'samples': 16413504, 'steps': 85486, 'loss/train': 1.3535197973251343} 11/07/2021 09:14:16 - INFO - __main__ - Step 85488: {'lr': 0.00019999450979967318, 'samples': 16413696, 'steps': 85487, 'loss/train': 1.5966888666152954} 11/07/2021 09:14:16 - INFO - __main__ - Step 85489: {'lr': 0.000199989310308391, 'samples': 16413888, 'steps': 85488, 'loss/train': 1.482977271080017} 11/07/2021 09:14:16 - INFO - __main__ - Step 85490: {'lr': 0.00019998411083964283, 'samples': 16414080, 'steps': 85489, 'loss/train': 1.406937837600708} 11/07/2021 09:14:18 - INFO - __main__ - Step 85491: {'lr': 0.00019997891139343106, 'samples': 16414272, 'steps': 85490, 'loss/train': 1.0869964361190796} 11/07/2021 09:14:18 - INFO - __main__ - Step 85492: {'lr': 0.00019997371196975802, 'samples': 16414464, 'steps': 85491, 'loss/train': 0.9389039874076843} 11/07/2021 09:14:18 - INFO - __main__ - Step 85493: {'lr': 0.00019996851256862605, 'samples': 16414656, 'steps': 85492, 'loss/train': 1.3201382160186768} 11/07/2021 09:14:19 - INFO - __main__ - Step 85494: {'lr': 0.0001999633131900375, 'samples': 16414848, 'steps': 85493, 'loss/train': 1.1321793794631958} 11/07/2021 09:14:19 - INFO - __main__ - Step 85495: {'lr': 0.00019995811383399472, 'samples': 16415040, 'steps': 85494, 'loss/train': 1.4591236114501953} 11/07/2021 09:14:20 - INFO - __main__ - Step 85496: {'lr': 0.00019995291450050005, 'samples': 16415232, 'steps': 85495, 'loss/train': 1.4662171602249146} 11/07/2021 09:14:20 - INFO - __main__ - Step 85497: {'lr': 0.0001999477151895558, 'samples': 16415424, 'steps': 85496, 'loss/train': 1.1982475519180298} 11/07/2021 09:14:21 - INFO - __main__ - Step 85498: {'lr': 0.00019994251590116436, 'samples': 16415616, 'steps': 85497, 'loss/train': 1.4207981824874878} 11/07/2021 09:14:21 - INFO - __main__ - Step 85499: {'lr': 0.00019993731663532803, 'samples': 16415808, 'steps': 85498, 'loss/train': 1.538076400756836} 11/07/2021 09:14:21 - INFO - __main__ - Step 85500: {'lr': 0.00019993211739204928, 'samples': 16416000, 'steps': 85499, 'loss/train': 1.1481393575668335} 11/07/2021 09:14:22 - INFO - __main__ - Step 85501: {'lr': 0.00019992691817133024, 'samples': 16416192, 'steps': 85500, 'loss/train': 1.3960872888565063} 11/07/2021 09:14:23 - INFO - __main__ - Step 85502: {'lr': 0.00019992171897317338, 'samples': 16416384, 'steps': 85501, 'loss/train': 1.4094548225402832} 11/07/2021 09:14:23 - INFO - __main__ - Step 85503: {'lr': 0.000199916519797581, 'samples': 16416576, 'steps': 85502, 'loss/train': 1.448350429534912} 11/07/2021 09:14:24 - INFO - __main__ - Step 85504: {'lr': 0.00019991132064455547, 'samples': 16416768, 'steps': 85503, 'loss/train': 1.3978817462921143} 11/07/2021 09:14:24 - INFO - __main__ - Step 85505: {'lr': 0.0001999061215140991, 'samples': 16416960, 'steps': 85504, 'loss/train': 1.1753442287445068} 11/07/2021 09:14:24 - INFO - __main__ - Step 85506: {'lr': 0.0001999009224062143, 'samples': 16417152, 'steps': 85505, 'loss/train': 1.0981796979904175} 11/07/2021 09:14:25 - INFO - __main__ - Step 85507: {'lr': 0.00019989572332090335, 'samples': 16417344, 'steps': 85506, 'loss/train': 1.0883393287658691} 11/07/2021 09:14:26 - INFO - __main__ - Step 85508: {'lr': 0.00019989052425816863, 'samples': 16417536, 'steps': 85507, 'loss/train': 1.3423309326171875} 11/07/2021 09:14:26 - INFO - __main__ - Step 85509: {'lr': 0.00019988532521801242, 'samples': 16417728, 'steps': 85508, 'loss/train': 1.2242170572280884} 11/07/2021 09:14:26 - INFO - __main__ - Step 85510: {'lr': 0.00019988012620043716, 'samples': 16417920, 'steps': 85509, 'loss/train': 0.9971972107887268} 11/07/2021 09:14:27 - INFO - __main__ - Step 85511: {'lr': 0.0001998749272054451, 'samples': 16418112, 'steps': 85510, 'loss/train': 1.3094245195388794} 11/07/2021 09:14:28 - INFO - __main__ - Step 85512: {'lr': 0.00019986972823303868, 'samples': 16418304, 'steps': 85511, 'loss/train': 1.618477463722229} 11/07/2021 09:14:28 - INFO - __main__ - Step 85513: {'lr': 0.00019986452928322013, 'samples': 16418496, 'steps': 85512, 'loss/train': 1.7448890209197998} 11/07/2021 09:14:29 - INFO - __main__ - Step 85514: {'lr': 0.000199859330355992, 'samples': 16418688, 'steps': 85513, 'loss/train': 1.3733536005020142} 11/07/2021 09:14:29 - INFO - __main__ - Step 85515: {'lr': 0.00019985413145135633, 'samples': 16418880, 'steps': 85514, 'loss/train': 1.1115961074829102} 11/07/2021 09:14:29 - INFO - __main__ - Step 85516: {'lr': 0.00019984893256931566, 'samples': 16419072, 'steps': 85515, 'loss/train': 1.2956534624099731} 11/07/2021 09:14:30 - INFO - __main__ - Step 85517: {'lr': 0.00019984373370987227, 'samples': 16419264, 'steps': 85516, 'loss/train': 1.5095378160476685} 11/07/2021 09:14:31 - INFO - __main__ - Step 85518: {'lr': 0.0001998385348730285, 'samples': 16419456, 'steps': 85517, 'loss/train': 1.301062822341919} 11/07/2021 09:14:31 - INFO - __main__ - Step 85519: {'lr': 0.00019983333605878674, 'samples': 16419648, 'steps': 85518, 'loss/train': 1.3690041303634644} 11/07/2021 09:14:31 - INFO - __main__ - Step 85520: {'lr': 0.0001998281372671493, 'samples': 16419840, 'steps': 85519, 'loss/train': 1.5414462089538574} 11/07/2021 09:14:32 - INFO - __main__ - Step 85521: {'lr': 0.0001998229384981185, 'samples': 16420032, 'steps': 85520, 'loss/train': 1.6917970180511475} 11/07/2021 09:14:32 - INFO - __main__ - Step 85522: {'lr': 0.00019981773975169675, 'samples': 16420224, 'steps': 85521, 'loss/train': 0.7123165130615234} 11/07/2021 09:14:33 - INFO - __main__ - Step 85523: {'lr': 0.00019981254102788631, 'samples': 16420416, 'steps': 85522, 'loss/train': 1.5572773218154907} 11/07/2021 09:14:34 - INFO - __main__ - Step 85524: {'lr': 0.00019980734232668963, 'samples': 16420608, 'steps': 85523, 'loss/train': 0.19940286874771118} 11/07/2021 09:14:34 - INFO - __main__ - Step 85525: {'lr': 0.0001998021436481089, 'samples': 16420800, 'steps': 85524, 'loss/train': 4.6328349113464355} 11/07/2021 09:14:34 - INFO - __main__ - Step 85526: {'lr': 0.00019979694499214662, 'samples': 16420992, 'steps': 85525, 'loss/train': 1.2855585813522339} 11/07/2021 09:14:35 - INFO - __main__ - Step 85527: {'lr': 0.00019979174635880516, 'samples': 16421184, 'steps': 85526, 'loss/train': 1.6452957391738892} 11/07/2021 09:14:35 - INFO - __main__ - Step 85528: {'lr': 0.00019978654774808664, 'samples': 16421376, 'steps': 85527, 'loss/train': 0.6931071877479553} 11/07/2021 09:14:36 - INFO - __main__ - Step 85529: {'lr': 0.0001997813491599935, 'samples': 16421568, 'steps': 85528, 'loss/train': 1.4416629076004028} 11/07/2021 09:14:36 - INFO - __main__ - Step 85530: {'lr': 0.00019977615059452815, 'samples': 16421760, 'steps': 85529, 'loss/train': 1.0740185976028442} 11/07/2021 09:14:37 - INFO - __main__ - Step 85531: {'lr': 0.00019977095205169287, 'samples': 16421952, 'steps': 85530, 'loss/train': 1.4795397520065308} 11/07/2021 09:14:37 - INFO - __main__ - Step 85532: {'lr': 0.00019976575353149005, 'samples': 16422144, 'steps': 85531, 'loss/train': 1.4501968622207642} 11/07/2021 09:14:37 - INFO - __main__ - Step 85533: {'lr': 0.00019976055503392195, 'samples': 16422336, 'steps': 85532, 'loss/train': 1.7270983457565308} 11/07/2021 09:14:39 - INFO - __main__ - Step 85534: {'lr': 0.00019975535655899102, 'samples': 16422528, 'steps': 85533, 'loss/train': 1.5829222202301025} 11/07/2021 09:14:39 - INFO - __main__ - Step 85535: {'lr': 0.00019975015810669956, 'samples': 16422720, 'steps': 85534, 'loss/train': 1.6541848182678223} 11/07/2021 09:14:39 - INFO - __main__ - Step 85536: {'lr': 0.00019974495967704987, 'samples': 16422912, 'steps': 85535, 'loss/train': 1.6659646034240723} 11/07/2021 09:14:40 - INFO - __main__ - Step 85537: {'lr': 0.00019973976127004434, 'samples': 16423104, 'steps': 85536, 'loss/train': 1.3212594985961914} 11/07/2021 09:14:40 - INFO - __main__ - Step 85538: {'lr': 0.0001997345628856853, 'samples': 16423296, 'steps': 85537, 'loss/train': 1.6574780941009521} 11/07/2021 09:14:41 - INFO - __main__ - Step 85539: {'lr': 0.00019972936452397505, 'samples': 16423488, 'steps': 85538, 'loss/train': 1.4666448831558228} 11/07/2021 09:14:41 - INFO - __main__ - Step 85540: {'lr': 0.000199724166184916, 'samples': 16423680, 'steps': 85539, 'loss/train': 1.5038970708847046} 11/07/2021 09:14:42 - INFO - __main__ - Step 85541: {'lr': 0.00019971896786851059, 'samples': 16423872, 'steps': 85540, 'loss/train': 7.486137390136719} 11/07/2021 09:14:42 - INFO - __main__ - Step 85542: {'lr': 0.00019971376957476095, 'samples': 16424064, 'steps': 85541, 'loss/train': 1.3904309272766113} 11/07/2021 09:14:42 - INFO - __main__ - Step 85543: {'lr': 0.00019970857130366949, 'samples': 16424256, 'steps': 85542, 'loss/train': 1.4256339073181152} 11/07/2021 09:14:43 - INFO - __main__ - Step 85544: {'lr': 0.00019970337305523852, 'samples': 16424448, 'steps': 85543, 'loss/train': 1.564329981803894} 11/07/2021 09:14:44 - INFO - __main__ - Step 85545: {'lr': 0.0001996981748294705, 'samples': 16424640, 'steps': 85544, 'loss/train': 1.660009503364563} 11/07/2021 09:14:44 - INFO - __main__ - Step 85546: {'lr': 0.00019969297662636768, 'samples': 16424832, 'steps': 85545, 'loss/train': 3.635897397994995} 11/07/2021 09:14:45 - INFO - __main__ - Step 85547: {'lr': 0.0001996877784459324, 'samples': 16425024, 'steps': 85546, 'loss/train': 1.3869644403457642} 11/07/2021 09:14:45 - INFO - __main__ - Step 85548: {'lr': 0.00019968258028816706, 'samples': 16425216, 'steps': 85547, 'loss/train': 1.8860355615615845} 11/07/2021 09:14:45 - INFO - __main__ - Step 85549: {'lr': 0.000199677382153074, 'samples': 16425408, 'steps': 85548, 'loss/train': 1.374732255935669} 11/07/2021 09:14:46 - INFO - __main__ - Step 85550: {'lr': 0.0001996721840406555, 'samples': 16425600, 'steps': 85549, 'loss/train': 0.8181941509246826} 11/07/2021 09:14:47 - INFO - __main__ - Step 85551: {'lr': 0.00019966698595091397, 'samples': 16425792, 'steps': 85550, 'loss/train': 1.4490994215011597} 11/07/2021 09:14:47 - INFO - __main__ - Step 85552: {'lr': 0.00019966178788385168, 'samples': 16425984, 'steps': 85551, 'loss/train': 1.3365882635116577} 11/07/2021 09:14:48 - INFO - __main__ - Step 85553: {'lr': 0.000199656589839471, 'samples': 16426176, 'steps': 85552, 'loss/train': 1.4523913860321045} 11/07/2021 09:14:48 - INFO - __main__ - Step 85554: {'lr': 0.00019965139181777445, 'samples': 16426368, 'steps': 85553, 'loss/train': 1.2418032884597778} 11/07/2021 09:14:48 - INFO - __main__ - Step 85555: {'lr': 0.00019964619381876406, 'samples': 16426560, 'steps': 85554, 'loss/train': 1.5771702527999878} 11/07/2021 09:14:49 - INFO - __main__ - Step 85556: {'lr': 0.00019964099584244234, 'samples': 16426752, 'steps': 85555, 'loss/train': 1.3446143865585327} 11/07/2021 09:14:50 - INFO - __main__ - Step 85557: {'lr': 0.0001996357978888116, 'samples': 16426944, 'steps': 85556, 'loss/train': 1.4804449081420898} 11/07/2021 09:14:50 - INFO - __main__ - Step 85558: {'lr': 0.00019963059995787416, 'samples': 16427136, 'steps': 85557, 'loss/train': 1.466629147529602} 11/07/2021 09:14:50 - INFO - __main__ - Step 85559: {'lr': 0.00019962540204963242, 'samples': 16427328, 'steps': 85558, 'loss/train': 0.6730407476425171} 11/07/2021 09:14:51 - INFO - __main__ - Step 85560: {'lr': 0.00019962020416408873, 'samples': 16427520, 'steps': 85559, 'loss/train': 1.7297649383544922} 11/07/2021 09:14:52 - INFO - __main__ - Step 85561: {'lr': 0.00019961500630124535, 'samples': 16427712, 'steps': 85560, 'loss/train': 1.0012389421463013} 11/07/2021 09:14:52 - INFO - __main__ - Step 85562: {'lr': 0.00019960980846110465, 'samples': 16427904, 'steps': 85561, 'loss/train': 1.5235573053359985} 11/07/2021 09:14:53 - INFO - __main__ - Step 85563: {'lr': 0.00019960461064366905, 'samples': 16428096, 'steps': 85562, 'loss/train': 0.9670077562332153} 11/07/2021 09:14:53 - INFO - __main__ - Step 85564: {'lr': 0.0001995994128489408, 'samples': 16428288, 'steps': 85563, 'loss/train': 1.5592613220214844} 11/07/2021 09:14:53 - INFO - __main__ - Step 85565: {'lr': 0.0001995942150769223, 'samples': 16428480, 'steps': 85564, 'loss/train': 1.4471088647842407} 11/07/2021 09:14:54 - INFO - __main__ - Step 85566: {'lr': 0.00019958901732761592, 'samples': 16428672, 'steps': 85565, 'loss/train': 1.4579846858978271} 11/07/2021 09:14:55 - INFO - __main__ - Step 85567: {'lr': 0.00019958381960102396, 'samples': 16428864, 'steps': 85566, 'loss/train': 1.1832529306411743} 11/07/2021 09:14:55 - INFO - __main__ - Step 85568: {'lr': 0.00019957862189714867, 'samples': 16429056, 'steps': 85567, 'loss/train': 1.4809279441833496} 11/07/2021 09:14:55 - INFO - __main__ - Step 85569: {'lr': 0.0001995734242159925, 'samples': 16429248, 'steps': 85568, 'loss/train': 1.4686135053634644} 11/07/2021 09:14:56 - INFO - __main__ - Step 85570: {'lr': 0.00019956822655755775, 'samples': 16429440, 'steps': 85569, 'loss/train': 1.3896414041519165} 11/07/2021 09:14:56 - INFO - __main__ - Step 85571: {'lr': 0.00019956302892184678, 'samples': 16429632, 'steps': 85570, 'loss/train': 0.9808968901634216} 11/07/2021 09:14:57 - INFO - __main__ - Step 85572: {'lr': 0.00019955783130886192, 'samples': 16429824, 'steps': 85571, 'loss/train': 1.3060663938522339} 11/07/2021 09:14:57 - INFO - __main__ - Step 85573: {'lr': 0.00019955263371860554, 'samples': 16430016, 'steps': 85572, 'loss/train': 1.3387529850006104} 11/07/2021 09:14:58 - INFO - __main__ - Step 85574: {'lr': 0.00019954743615108, 'samples': 16430208, 'steps': 85573, 'loss/train': 1.1143567562103271} 11/07/2021 09:14:58 - INFO - __main__ - Step 85575: {'lr': 0.00019954223860628757, 'samples': 16430400, 'steps': 85574, 'loss/train': 1.4670162200927734} 11/07/2021 09:14:59 - INFO - __main__ - Step 85576: {'lr': 0.0001995370410842306, 'samples': 16430592, 'steps': 85575, 'loss/train': 1.2941242456436157} 11/07/2021 09:15:00 - INFO - __main__ - Step 85577: {'lr': 0.00019953184358491156, 'samples': 16430784, 'steps': 85576, 'loss/train': 1.2858303785324097} 11/07/2021 09:15:00 - INFO - __main__ - Step 85578: {'lr': 0.0001995266461083326, 'samples': 16430976, 'steps': 85577, 'loss/train': 1.493685007095337} 11/07/2021 09:15:01 - INFO - __main__ - Step 85579: {'lr': 0.00019952144865449618, 'samples': 16431168, 'steps': 85578, 'loss/train': 1.457711935043335} 11/07/2021 09:15:01 - INFO - __main__ - Step 85580: {'lr': 0.0001995162512234046, 'samples': 16431360, 'steps': 85579, 'loss/train': 1.4756386280059814} 11/07/2021 09:15:01 - INFO - __main__ - Step 85581: {'lr': 0.0001995110538150603, 'samples': 16431552, 'steps': 85580, 'loss/train': 1.5527130365371704} 11/07/2021 09:15:02 - INFO - __main__ - Step 85582: {'lr': 0.00019950585642946548, 'samples': 16431744, 'steps': 85581, 'loss/train': 1.1475266218185425} 11/07/2021 09:15:03 - INFO - __main__ - Step 85583: {'lr': 0.0001995006590666225, 'samples': 16431936, 'steps': 85582, 'loss/train': 1.0807875394821167} 11/07/2021 09:15:03 - INFO - __main__ - Step 85584: {'lr': 0.0001994954617265338, 'samples': 16432128, 'steps': 85583, 'loss/train': 3.208350896835327} 11/07/2021 09:15:03 - INFO - __main__ - Step 85585: {'lr': 0.00019949026440920165, 'samples': 16432320, 'steps': 85584, 'loss/train': 1.2621058225631714} 11/07/2021 09:15:04 - INFO - __main__ - Step 85586: {'lr': 0.0001994850671146284, 'samples': 16432512, 'steps': 85585, 'loss/train': 1.4769262075424194} 11/07/2021 09:15:04 - INFO - __main__ - Step 85587: {'lr': 0.00019947986984281647, 'samples': 16432704, 'steps': 85586, 'loss/train': 1.6376953125} 11/07/2021 09:15:05 - INFO - __main__ - Step 85588: {'lr': 0.00019947467259376803, 'samples': 16432896, 'steps': 85587, 'loss/train': 1.4028644561767578} 11/07/2021 09:15:05 - INFO - __main__ - Step 85589: {'lr': 0.00019946947536748555, 'samples': 16433088, 'steps': 85588, 'loss/train': 1.441910743713379} 11/07/2021 09:15:06 - INFO - __main__ - Step 85590: {'lr': 0.00019946427816397138, 'samples': 16433280, 'steps': 85589, 'loss/train': 1.3535054922103882} 11/07/2021 09:15:06 - INFO - __main__ - Step 85591: {'lr': 0.00019945908098322778, 'samples': 16433472, 'steps': 85590, 'loss/train': 1.4268237352371216} 11/07/2021 09:15:07 - INFO - __main__ - Step 85592: {'lr': 0.00019945388382525714, 'samples': 16433664, 'steps': 85591, 'loss/train': 1.3719695806503296} 11/07/2021 09:15:08 - INFO - __main__ - Step 85593: {'lr': 0.00019944868669006182, 'samples': 16433856, 'steps': 85592, 'loss/train': 0.781061053276062} 11/07/2021 09:15:08 - INFO - __main__ - Step 85594: {'lr': 0.00019944348957764418, 'samples': 16434048, 'steps': 85593, 'loss/train': 0.7162147164344788} 11/07/2021 09:15:08 - INFO - __main__ - Step 85595: {'lr': 0.0001994382924880065, 'samples': 16434240, 'steps': 85594, 'loss/train': 1.7798192501068115} 11/07/2021 09:15:09 - INFO - __main__ - Step 85596: {'lr': 0.0001994330954211511, 'samples': 16434432, 'steps': 85595, 'loss/train': 0.7918144464492798} 11/07/2021 09:15:09 - INFO - __main__ - Step 85597: {'lr': 0.0001994278983770804, 'samples': 16434624, 'steps': 85596, 'loss/train': 1.1586483716964722} 11/07/2021 09:15:09 - INFO - __main__ - Step 85598: {'lr': 0.00019942270135579672, 'samples': 16434816, 'steps': 85597, 'loss/train': 1.4800584316253662} 11/07/2021 09:15:10 - INFO - __main__ - Step 85599: {'lr': 0.0001994175043573024, 'samples': 16435008, 'steps': 85598, 'loss/train': 1.225879430770874} 11/07/2021 09:15:11 - INFO - __main__ - Step 85600: {'lr': 0.00019941230738159974, 'samples': 16435200, 'steps': 85599, 'loss/train': 1.6962834596633911} 11/07/2021 09:15:11 - INFO - __main__ - Step 85601: {'lr': 0.00019940711042869112, 'samples': 16435392, 'steps': 85600, 'loss/train': 1.1520198583602905} 11/07/2021 09:15:12 - INFO - __main__ - Step 85602: {'lr': 0.00019940191349857887, 'samples': 16435584, 'steps': 85601, 'loss/train': 1.1256861686706543} 11/07/2021 09:15:12 - INFO - __main__ - Step 85603: {'lr': 0.00019939671659126532, 'samples': 16435776, 'steps': 85602, 'loss/train': 1.3521074056625366} 11/07/2021 09:15:13 - INFO - __main__ - Step 85604: {'lr': 0.00019939151970675285, 'samples': 16435968, 'steps': 85603, 'loss/train': 1.5000485181808472} 11/07/2021 09:15:13 - INFO - __main__ - Step 85605: {'lr': 0.00019938632284504377, 'samples': 16436160, 'steps': 85604, 'loss/train': 1.408871054649353} 11/07/2021 09:15:14 - INFO - __main__ - Step 85606: {'lr': 0.00019938112600614044, 'samples': 16436352, 'steps': 85605, 'loss/train': 1.275313377380371} 11/07/2021 09:15:14 - INFO - __main__ - Step 85607: {'lr': 0.00019937592919004517, 'samples': 16436544, 'steps': 85606, 'loss/train': 1.4452682733535767} 11/07/2021 09:15:14 - INFO - __main__ - Step 85608: {'lr': 0.00019937073239676044, 'samples': 16436736, 'steps': 85607, 'loss/train': 1.1501246690750122} 11/07/2021 09:15:15 - INFO - __main__ - Step 85609: {'lr': 0.00019936553562628843, 'samples': 16436928, 'steps': 85608, 'loss/train': 1.4145066738128662} 11/07/2021 09:15:16 - INFO - __main__ - Step 85610: {'lr': 0.00019936033887863147, 'samples': 16437120, 'steps': 85609, 'loss/train': 1.3687406778335571} 11/07/2021 09:15:16 - INFO - __main__ - Step 85611: {'lr': 0.00019935514215379196, 'samples': 16437312, 'steps': 85610, 'loss/train': 1.6192119121551514} 11/07/2021 09:15:16 - INFO - __main__ - Step 85612: {'lr': 0.00019934994545177227, 'samples': 16437504, 'steps': 85611, 'loss/train': 1.576099157333374} 11/07/2021 09:15:17 - INFO - __main__ - Step 85613: {'lr': 0.00019934474877257469, 'samples': 16437696, 'steps': 85612, 'loss/train': 1.2583515644073486} 11/07/2021 09:15:17 - INFO - __main__ - Step 85614: {'lr': 0.0001993395521162016, 'samples': 16437888, 'steps': 85613, 'loss/train': 1.572571873664856} 11/07/2021 09:15:18 - INFO - __main__ - Step 85615: {'lr': 0.0001993343554826553, 'samples': 16438080, 'steps': 85614, 'loss/train': 1.5011229515075684} 11/07/2021 09:15:18 - INFO - __main__ - Step 85616: {'lr': 0.00019932915887193816, 'samples': 16438272, 'steps': 85615, 'loss/train': 1.5722352266311646} 11/07/2021 09:15:19 - INFO - __main__ - Step 85617: {'lr': 0.00019932396228405252, 'samples': 16438464, 'steps': 85616, 'loss/train': 1.1295723915100098} 11/07/2021 09:15:19 - INFO - __main__ - Step 85618: {'lr': 0.00019931876571900077, 'samples': 16438656, 'steps': 85617, 'loss/train': 1.0509430170059204} 11/07/2021 09:15:19 - INFO - __main__ - Step 85619: {'lr': 0.00019931356917678517, 'samples': 16438848, 'steps': 85618, 'loss/train': 1.3484636545181274} 11/07/2021 09:15:20 - INFO - __main__ - Step 85620: {'lr': 0.0001993083726574081, 'samples': 16439040, 'steps': 85619, 'loss/train': 2.0159361362457275} 11/07/2021 09:15:21 - INFO - __main__ - Step 85621: {'lr': 0.00019930317616087195, 'samples': 16439232, 'steps': 85620, 'loss/train': 1.3260635137557983} 11/07/2021 09:15:21 - INFO - __main__ - Step 85622: {'lr': 0.00019929797968717896, 'samples': 16439424, 'steps': 85621, 'loss/train': 1.4589773416519165} 11/07/2021 09:15:21 - INFO - __main__ - Step 85623: {'lr': 0.00019929278323633148, 'samples': 16439616, 'steps': 85622, 'loss/train': 1.5543208122253418} 11/07/2021 09:15:22 - INFO - __main__ - Step 85624: {'lr': 0.0001992875868083319, 'samples': 16439808, 'steps': 85623, 'loss/train': 1.2892309427261353} 11/07/2021 09:15:23 - INFO - __main__ - Step 85625: {'lr': 0.00019928239040318258, 'samples': 16440000, 'steps': 85624, 'loss/train': 0.8059571385383606} 11/07/2021 09:15:23 - INFO - __main__ - Step 85626: {'lr': 0.00019927719402088582, 'samples': 16440192, 'steps': 85625, 'loss/train': 0.5214450359344482} 11/07/2021 09:15:24 - INFO - __main__ - Step 85627: {'lr': 0.00019927199766144396, 'samples': 16440384, 'steps': 85626, 'loss/train': 1.5021913051605225} 11/07/2021 09:15:24 - INFO - __main__ - Step 85628: {'lr': 0.00019926680132485936, 'samples': 16440576, 'steps': 85627, 'loss/train': 1.6028701066970825} 11/07/2021 09:15:24 - INFO - __main__ - Step 85629: {'lr': 0.00019926160501113435, 'samples': 16440768, 'steps': 85628, 'loss/train': 1.2425357103347778} 11/07/2021 09:15:26 - INFO - __main__ - Step 85630: {'lr': 0.00019925640872027128, 'samples': 16440960, 'steps': 85629, 'loss/train': 1.6484839916229248} 11/07/2021 09:15:26 - INFO - __main__ - Step 85631: {'lr': 0.00019925121245227252, 'samples': 16441152, 'steps': 85630, 'loss/train': 1.3020647764205933} 11/07/2021 09:15:26 - INFO - __main__ - Step 85632: {'lr': 0.00019924601620714032, 'samples': 16441344, 'steps': 85631, 'loss/train': 1.367253065109253} 11/07/2021 09:15:27 - INFO - __main__ - Step 85633: {'lr': 0.00019924081998487714, 'samples': 16441536, 'steps': 85632, 'loss/train': 1.0860505104064941} 11/07/2021 09:15:27 - INFO - __main__ - Step 85634: {'lr': 0.0001992356237854852, 'samples': 16441728, 'steps': 85633, 'loss/train': 0.09293660521507263} 11/07/2021 09:15:27 - INFO - __main__ - Step 85635: {'lr': 0.0001992304276089671, 'samples': 16441920, 'steps': 85634, 'loss/train': 0.11037874221801758} 11/07/2021 09:15:28 - INFO - __main__ - Step 85636: {'lr': 0.0001992252314553248, 'samples': 16442112, 'steps': 85635, 'loss/train': 1.684244155883789} 11/07/2021 09:15:29 - INFO - __main__ - Step 85637: {'lr': 0.00019922003532456088, 'samples': 16442304, 'steps': 85636, 'loss/train': 1.1951693296432495} 11/07/2021 09:15:29 - INFO - __main__ - Step 85638: {'lr': 0.0001992148392166776, 'samples': 16442496, 'steps': 85637, 'loss/train': 1.3041340112686157} 11/07/2021 09:15:29 - INFO - __main__ - Step 85639: {'lr': 0.00019920964313167733, 'samples': 16442688, 'steps': 85638, 'loss/train': 0.6808624863624573} 11/07/2021 09:15:30 - INFO - __main__ - Step 85640: {'lr': 0.0001992044470695624, 'samples': 16442880, 'steps': 85639, 'loss/train': 0.7997856140136719} 11/07/2021 09:15:31 - INFO - __main__ - Step 85641: {'lr': 0.00019919925103033517, 'samples': 16443072, 'steps': 85640, 'loss/train': 1.4677493572235107} 11/07/2021 09:15:31 - INFO - __main__ - Step 85642: {'lr': 0.000199194055013998, 'samples': 16443264, 'steps': 85641, 'loss/train': 1.3944156169891357} 11/07/2021 09:15:32 - INFO - __main__ - Step 85643: {'lr': 0.00019918885902055317, 'samples': 16443456, 'steps': 85642, 'loss/train': 1.28264319896698} 11/07/2021 09:15:32 - INFO - __main__ - Step 85644: {'lr': 0.00019918366305000308, 'samples': 16443648, 'steps': 85643, 'loss/train': 1.4412461519241333} 11/07/2021 09:15:33 - INFO - __main__ - Step 85645: {'lr': 0.00019917846710235004, 'samples': 16443840, 'steps': 85644, 'loss/train': 1.1439610719680786} 11/07/2021 09:15:33 - INFO - __main__ - Step 85646: {'lr': 0.0001991732711775964, 'samples': 16444032, 'steps': 85645, 'loss/train': 0.968147873878479} 11/07/2021 09:15:34 - INFO - __main__ - Step 85647: {'lr': 0.0001991680752757445, 'samples': 16444224, 'steps': 85646, 'loss/train': 1.6605095863342285} 11/07/2021 09:15:34 - INFO - __main__ - Step 85648: {'lr': 0.00019916287939679677, 'samples': 16444416, 'steps': 85647, 'loss/train': 1.3442020416259766} 11/07/2021 09:15:35 - INFO - __main__ - Step 85649: {'lr': 0.0001991576835407554, 'samples': 16444608, 'steps': 85648, 'loss/train': 0.5650180578231812} 11/07/2021 09:15:35 - INFO - __main__ - Step 85650: {'lr': 0.00019915248770762276, 'samples': 16444800, 'steps': 85649, 'loss/train': 0.569491446018219} 11/07/2021 09:15:35 - INFO - __main__ - Step 85651: {'lr': 0.0001991472918974012, 'samples': 16444992, 'steps': 85650, 'loss/train': 1.0301729440689087} 11/07/2021 09:15:36 - INFO - __main__ - Step 85652: {'lr': 0.00019914209611009316, 'samples': 16445184, 'steps': 85651, 'loss/train': 1.345494270324707} 11/07/2021 09:15:37 - INFO - __main__ - Step 85653: {'lr': 0.00019913690034570084, 'samples': 16445376, 'steps': 85652, 'loss/train': 0.8102061748504639} 11/07/2021 09:15:37 - INFO - __main__ - Step 85654: {'lr': 0.00019913170460422668, 'samples': 16445568, 'steps': 85653, 'loss/train': 1.3576778173446655} 11/07/2021 09:15:37 - INFO - __main__ - Step 85655: {'lr': 0.00019912650888567296, 'samples': 16445760, 'steps': 85654, 'loss/train': 1.2511255741119385} 11/07/2021 09:15:38 - INFO - __main__ - Step 85656: {'lr': 0.00019912131319004206, 'samples': 16445952, 'steps': 85655, 'loss/train': 1.4254459142684937} 11/07/2021 09:15:39 - INFO - __main__ - Step 85657: {'lr': 0.00019911611751733633, 'samples': 16446144, 'steps': 85656, 'loss/train': 1.792788028717041} 11/07/2021 09:15:39 - INFO - __main__ - Step 85658: {'lr': 0.00019911092186755808, 'samples': 16446336, 'steps': 85657, 'loss/train': 1.4293899536132812} 11/07/2021 09:15:40 - INFO - __main__ - Step 85659: {'lr': 0.00019910572624070967, 'samples': 16446528, 'steps': 85658, 'loss/train': 1.3653398752212524} 11/07/2021 09:15:40 - INFO - __main__ - Step 85660: {'lr': 0.00019910053063679342, 'samples': 16446720, 'steps': 85659, 'loss/train': 1.348686933517456} 11/07/2021 09:15:40 - INFO - __main__ - Step 85661: {'lr': 0.0001990953350558117, 'samples': 16446912, 'steps': 85660, 'loss/train': 1.1101915836334229} 11/07/2021 09:15:41 - INFO - __main__ - Step 85662: {'lr': 0.00019909013949776695, 'samples': 16447104, 'steps': 85661, 'loss/train': 1.4466131925582886} 11/07/2021 09:15:42 - INFO - __main__ - Step 85663: {'lr': 0.00019908494396266127, 'samples': 16447296, 'steps': 85662, 'loss/train': 1.3831627368927002} 11/07/2021 09:15:42 - INFO - __main__ - Step 85664: {'lr': 0.00019907974845049714, 'samples': 16447488, 'steps': 85663, 'loss/train': 1.990651249885559} 11/07/2021 09:15:42 - INFO - __main__ - Step 85665: {'lr': 0.00019907455296127688, 'samples': 16447680, 'steps': 85664, 'loss/train': 1.1460697650909424} 11/07/2021 09:15:43 - INFO - __main__ - Step 85666: {'lr': 0.00019906935749500285, 'samples': 16447872, 'steps': 85665, 'loss/train': 1.4163146018981934} 11/07/2021 09:15:43 - INFO - __main__ - Step 85667: {'lr': 0.00019906416205167738, 'samples': 16448064, 'steps': 85666, 'loss/train': 1.2541767358779907} 11/07/2021 09:15:44 - INFO - __main__ - Step 85668: {'lr': 0.0001990589666313028, 'samples': 16448256, 'steps': 85667, 'loss/train': 1.1976592540740967} 11/07/2021 09:15:44 - INFO - __main__ - Step 85669: {'lr': 0.00019905377123388148, 'samples': 16448448, 'steps': 85668, 'loss/train': 1.182569980621338} 11/07/2021 09:15:45 - INFO - __main__ - Step 85670: {'lr': 0.00019904857585941574, 'samples': 16448640, 'steps': 85669, 'loss/train': 1.3211805820465088} 11/07/2021 09:15:45 - INFO - __main__ - Step 85671: {'lr': 0.00019904338050790794, 'samples': 16448832, 'steps': 85670, 'loss/train': 1.2982282638549805} 11/07/2021 09:15:45 - INFO - __main__ - Step 85672: {'lr': 0.00019903818517936039, 'samples': 16449024, 'steps': 85671, 'loss/train': 1.052125334739685} 11/07/2021 09:15:46 - INFO - __main__ - Step 85673: {'lr': 0.00019903298987377545, 'samples': 16449216, 'steps': 85672, 'loss/train': 0.6938971281051636} 11/07/2021 09:15:47 - INFO - __main__ - Step 85674: {'lr': 0.00019902779459115544, 'samples': 16449408, 'steps': 85673, 'loss/train': 1.1033891439437866} 11/07/2021 09:15:47 - INFO - __main__ - Step 85675: {'lr': 0.00019902259933150286, 'samples': 16449600, 'steps': 85674, 'loss/train': 1.3446470499038696} 11/07/2021 09:15:47 - INFO - __main__ - Step 85676: {'lr': 0.0001990174040948198, 'samples': 16449792, 'steps': 85675, 'loss/train': 1.388789176940918} 11/07/2021 09:15:48 - INFO - __main__ - Step 85677: {'lr': 0.00019901220888110868, 'samples': 16449984, 'steps': 85676, 'loss/train': 1.8971818685531616} 11/07/2021 09:15:49 - INFO - __main__ - Step 85678: {'lr': 0.00019900701369037188, 'samples': 16450176, 'steps': 85677, 'loss/train': 1.6524198055267334} 11/07/2021 09:15:49 - INFO - __main__ - Step 85679: {'lr': 0.00019900181852261175, 'samples': 16450368, 'steps': 85678, 'loss/train': 1.2216095924377441} 11/07/2021 09:15:50 - INFO - __main__ - Step 85680: {'lr': 0.00019899662337783061, 'samples': 16450560, 'steps': 85679, 'loss/train': 1.3702332973480225} 11/07/2021 09:15:50 - INFO - __main__ - Step 85681: {'lr': 0.00019899142825603077, 'samples': 16450752, 'steps': 85680, 'loss/train': 0.8287651538848877} 11/07/2021 09:15:50 - INFO - __main__ - Step 85682: {'lr': 0.00019898623315721468, 'samples': 16450944, 'steps': 85681, 'loss/train': 1.604740858078003} 11/07/2021 09:15:51 - INFO - __main__ - Step 85683: {'lr': 0.00019898103808138455, 'samples': 16451136, 'steps': 85682, 'loss/train': 1.4609739780426025} 11/07/2021 09:15:52 - INFO - __main__ - Step 85684: {'lr': 0.00019897584302854278, 'samples': 16451328, 'steps': 85683, 'loss/train': 1.4495002031326294} 11/07/2021 09:15:52 - INFO - __main__ - Step 85685: {'lr': 0.0001989706479986917, 'samples': 16451520, 'steps': 85684, 'loss/train': 1.280332088470459} 11/07/2021 09:15:52 - INFO - __main__ - Step 85686: {'lr': 0.0001989654529918337, 'samples': 16451712, 'steps': 85685, 'loss/train': 1.1289634704589844} 11/07/2021 09:15:53 - INFO - __main__ - Step 85687: {'lr': 0.00019896025800797103, 'samples': 16451904, 'steps': 85686, 'loss/train': 1.4153046607971191} 11/07/2021 09:15:53 - INFO - __main__ - Step 85688: {'lr': 0.00019895506304710623, 'samples': 16452096, 'steps': 85687, 'loss/train': 0.8988490700721741} 11/07/2021 09:15:54 - INFO - __main__ - Step 85689: {'lr': 0.00019894986810924136, 'samples': 16452288, 'steps': 85688, 'loss/train': 1.2506333589553833} 11/07/2021 09:15:55 - INFO - __main__ - Step 85690: {'lr': 0.00019894467319437893, 'samples': 16452480, 'steps': 85689, 'loss/train': 1.4636915922164917} 11/07/2021 09:15:55 - INFO - __main__ - Step 85691: {'lr': 0.00019893947830252118, 'samples': 16452672, 'steps': 85690, 'loss/train': 1.598542332649231} 11/07/2021 09:15:55 - INFO - __main__ - Step 85692: {'lr': 0.00019893428343367053, 'samples': 16452864, 'steps': 85691, 'loss/train': 1.0924640893936157} 11/07/2021 09:15:57 - INFO - __main__ - Step 85693: {'lr': 0.00019892908858782933, 'samples': 16453056, 'steps': 85692, 'loss/train': 1.5567890405654907} 11/07/2021 09:15:57 - INFO - __main__ - Step 85694: {'lr': 0.00019892389376499988, 'samples': 16453248, 'steps': 85693, 'loss/train': 1.8054015636444092} 11/07/2021 09:15:58 - INFO - __main__ - Step 85695: {'lr': 0.00019891869896518455, 'samples': 16453440, 'steps': 85694, 'loss/train': 1.5237090587615967} 11/07/2021 09:15:58 - INFO - __main__ - Step 85696: {'lr': 0.00019891350418838567, 'samples': 16453632, 'steps': 85695, 'loss/train': 1.3270633220672607} 11/07/2021 09:15:58 - INFO - __main__ - Step 85697: {'lr': 0.00019890830943460552, 'samples': 16453824, 'steps': 85696, 'loss/train': 1.6588337421417236} 11/07/2021 09:15:59 - INFO - __main__ - Step 85698: {'lr': 0.00019890311470384655, 'samples': 16454016, 'steps': 85697, 'loss/train': 1.3590013980865479} 11/07/2021 09:15:59 - INFO - __main__ - Step 85699: {'lr': 0.00019889791999611105, 'samples': 16454208, 'steps': 85698, 'loss/train': 1.2763633728027344} 11/07/2021 09:15:59 - INFO - __main__ - Step 85700: {'lr': 0.0001988927253114014, 'samples': 16454400, 'steps': 85699, 'loss/train': 1.715011715888977} 11/07/2021 09:16:00 - INFO - __main__ - Step 85701: {'lr': 0.00019888753064971983, 'samples': 16454592, 'steps': 85700, 'loss/train': 0.5926951766014099} 11/07/2021 09:16:01 - INFO - __main__ - Step 85702: {'lr': 0.00019888233601106882, 'samples': 16454784, 'steps': 85701, 'loss/train': 2.0037922859191895} 11/07/2021 09:16:01 - INFO - __main__ - Step 85703: {'lr': 0.00019887714139545058, 'samples': 16454976, 'steps': 85702, 'loss/train': 1.2847788333892822} 11/07/2021 09:16:01 - INFO - __main__ - Step 85704: {'lr': 0.0001988719468028675, 'samples': 16455168, 'steps': 85703, 'loss/train': 1.807613492012024} 11/07/2021 09:16:02 - INFO - __main__ - Step 85705: {'lr': 0.00019886675223332195, 'samples': 16455360, 'steps': 85704, 'loss/train': 0.8583006858825684} 11/07/2021 09:16:03 - INFO - __main__ - Step 85706: {'lr': 0.00019886155768681626, 'samples': 16455552, 'steps': 85705, 'loss/train': 1.3417874574661255} 11/07/2021 09:16:03 - INFO - __main__ - Step 85707: {'lr': 0.00019885636316335276, 'samples': 16455744, 'steps': 85706, 'loss/train': 1.6360820531845093} 11/07/2021 09:16:03 - INFO - __main__ - Step 85708: {'lr': 0.0001988511686629338, 'samples': 16455936, 'steps': 85707, 'loss/train': 1.1530349254608154} 11/07/2021 09:16:04 - INFO - __main__ - Step 85709: {'lr': 0.00019884597418556166, 'samples': 16456128, 'steps': 85708, 'loss/train': 1.4495433568954468} 11/07/2021 09:16:04 - INFO - __main__ - Step 85710: {'lr': 0.0001988407797312388, 'samples': 16456320, 'steps': 85709, 'loss/train': 1.1090329885482788} 11/07/2021 09:16:05 - INFO - __main__ - Step 85711: {'lr': 0.0001988355852999675, 'samples': 16456512, 'steps': 85710, 'loss/train': 1.3507208824157715} 11/07/2021 09:16:06 - INFO - __main__ - Step 85712: {'lr': 0.00019883039089175009, 'samples': 16456704, 'steps': 85711, 'loss/train': 1.4227383136749268} 11/07/2021 09:16:06 - INFO - __main__ - Step 85713: {'lr': 0.00019882519650658885, 'samples': 16456896, 'steps': 85712, 'loss/train': 1.786718726158142} 11/07/2021 09:16:06 - INFO - __main__ - Step 85714: {'lr': 0.00019882000214448625, 'samples': 16457088, 'steps': 85713, 'loss/train': 1.444156527519226} 11/07/2021 09:16:07 - INFO - __main__ - Step 85715: {'lr': 0.00019881480780544462, 'samples': 16457280, 'steps': 85714, 'loss/train': 1.0461360216140747} 11/07/2021 09:16:08 - INFO - __main__ - Step 85716: {'lr': 0.00019880961348946616, 'samples': 16457472, 'steps': 85715, 'loss/train': 1.593654990196228} 11/07/2021 09:16:08 - INFO - __main__ - Step 85717: {'lr': 0.00019880441919655333, 'samples': 16457664, 'steps': 85716, 'loss/train': 1.4178866147994995} 11/07/2021 09:16:08 - INFO - __main__ - Step 85718: {'lr': 0.0001987992249267084, 'samples': 16457856, 'steps': 85717, 'loss/train': 1.3719466924667358} 11/07/2021 09:16:09 - INFO - __main__ - Step 85719: {'lr': 0.0001987940306799338, 'samples': 16458048, 'steps': 85718, 'loss/train': 1.299224853515625} 11/07/2021 09:16:09 - INFO - __main__ - Step 85720: {'lr': 0.00019878883645623176, 'samples': 16458240, 'steps': 85719, 'loss/train': 0.788733959197998} 11/07/2021 09:16:10 - INFO - __main__ - Step 85721: {'lr': 0.00019878364225560472, 'samples': 16458432, 'steps': 85720, 'loss/train': 0.7739098072052002} 11/07/2021 09:16:11 - INFO - __main__ - Step 85722: {'lr': 0.000198778448078055, 'samples': 16458624, 'steps': 85721, 'loss/train': 1.3190736770629883} 11/07/2021 09:16:11 - INFO - __main__ - Step 85723: {'lr': 0.00019877325392358492, 'samples': 16458816, 'steps': 85722, 'loss/train': 1.6165233850479126} 11/07/2021 09:16:11 - INFO - __main__ - Step 85724: {'lr': 0.0001987680597921968, 'samples': 16459008, 'steps': 85723, 'loss/train': 0.9623837471008301} 11/07/2021 09:16:12 - INFO - __main__ - Step 85725: {'lr': 0.00019876286568389296, 'samples': 16459200, 'steps': 85724, 'loss/train': 0.10776834934949875} 11/07/2021 09:16:13 - INFO - __main__ - Step 85726: {'lr': 0.00019875767159867582, 'samples': 16459392, 'steps': 85725, 'loss/train': 1.1603456735610962} 11/07/2021 09:16:13 - INFO - __main__ - Step 85727: {'lr': 0.00019875247753654767, 'samples': 16459584, 'steps': 85726, 'loss/train': 0.7668682932853699} 11/07/2021 09:16:13 - INFO - __main__ - Step 85728: {'lr': 0.0001987472834975109, 'samples': 16459776, 'steps': 85727, 'loss/train': 1.4644230604171753} 11/07/2021 09:16:14 - INFO - __main__ - Step 85729: {'lr': 0.00019874208948156781, 'samples': 16459968, 'steps': 85728, 'loss/train': 1.123551607131958} 11/07/2021 09:16:14 - INFO - __main__ - Step 85730: {'lr': 0.00019873689548872072, 'samples': 16460160, 'steps': 85729, 'loss/train': 1.397312045097351} 11/07/2021 09:16:15 - INFO - __main__ - Step 85731: {'lr': 0.00019873170151897201, 'samples': 16460352, 'steps': 85730, 'loss/train': 1.7629450559616089} 11/07/2021 09:16:16 - INFO - __main__ - Step 85732: {'lr': 0.00019872650757232397, 'samples': 16460544, 'steps': 85731, 'loss/train': 1.408481240272522} 11/07/2021 09:16:16 - INFO - __main__ - Step 85733: {'lr': 0.00019872131364877905, 'samples': 16460736, 'steps': 85732, 'loss/train': 1.4281785488128662} 11/07/2021 09:16:16 - INFO - __main__ - Step 85734: {'lr': 0.00019871611974833949, 'samples': 16460928, 'steps': 85733, 'loss/train': 1.3705381155014038} 11/07/2021 09:16:17 - INFO - __main__ - Step 85735: {'lr': 0.00019871092587100757, 'samples': 16461120, 'steps': 85734, 'loss/train': 1.4138842821121216} 11/07/2021 09:16:18 - INFO - __main__ - Step 85736: {'lr': 0.00019870573201678576, 'samples': 16461312, 'steps': 85735, 'loss/train': 0.9449716210365295} 11/07/2021 09:16:18 - INFO - __main__ - Step 85737: {'lr': 0.00019870053818567637, 'samples': 16461504, 'steps': 85736, 'loss/train': 1.1347428560256958} 11/07/2021 09:16:18 - INFO - __main__ - Step 85738: {'lr': 0.0001986953443776817, 'samples': 16461696, 'steps': 85737, 'loss/train': 0.6171151399612427} 11/07/2021 09:16:19 - INFO - __main__ - Step 85739: {'lr': 0.00019869015059280416, 'samples': 16461888, 'steps': 85738, 'loss/train': 1.6539888381958008} 11/07/2021 09:16:19 - INFO - __main__ - Step 85740: {'lr': 0.00019868495683104603, 'samples': 16462080, 'steps': 85739, 'loss/train': 1.1681163311004639} 11/07/2021 09:16:19 - INFO - __main__ - Step 85741: {'lr': 0.00019867976309240965, 'samples': 16462272, 'steps': 85740, 'loss/train': 1.4125804901123047} 11/07/2021 09:16:20 - INFO - __main__ - Step 85742: {'lr': 0.00019867456937689744, 'samples': 16462464, 'steps': 85741, 'loss/train': 1.0982378721237183} 11/07/2021 09:16:21 - INFO - __main__ - Step 85743: {'lr': 0.0001986693756845116, 'samples': 16462656, 'steps': 85742, 'loss/train': 0.9373008012771606} 11/07/2021 09:16:21 - INFO - __main__ - Step 85744: {'lr': 0.00019866418201525463, 'samples': 16462848, 'steps': 85743, 'loss/train': 1.122179388999939} 11/07/2021 09:16:21 - INFO - __main__ - Step 85745: {'lr': 0.00019865898836912875, 'samples': 16463040, 'steps': 85744, 'loss/train': 1.2946839332580566} 11/07/2021 09:16:22 - INFO - __main__ - Step 85746: {'lr': 0.0001986537947461363, 'samples': 16463232, 'steps': 85745, 'loss/train': 1.232128620147705} 11/07/2021 09:16:23 - INFO - __main__ - Step 85747: {'lr': 0.00019864860114627967, 'samples': 16463424, 'steps': 85746, 'loss/train': 1.2394905090332031} 11/07/2021 09:16:23 - INFO - __main__ - Step 85748: {'lr': 0.00019864340756956116, 'samples': 16463616, 'steps': 85747, 'loss/train': 1.2888435125350952} 11/07/2021 09:16:24 - INFO - __main__ - Step 85749: {'lr': 0.0001986382140159832, 'samples': 16463808, 'steps': 85748, 'loss/train': 1.31252121925354} 11/07/2021 09:16:24 - INFO - __main__ - Step 85750: {'lr': 0.00019863302048554803, 'samples': 16464000, 'steps': 85749, 'loss/train': 1.1242952346801758} 11/07/2021 09:16:24 - INFO - __main__ - Step 85751: {'lr': 0.00019862782697825803, 'samples': 16464192, 'steps': 85750, 'loss/train': 1.6011090278625488} 11/07/2021 09:16:25 - INFO - __main__ - Step 85752: {'lr': 0.00019862263349411553, 'samples': 16464384, 'steps': 85751, 'loss/train': 1.4151723384857178} 11/07/2021 09:16:26 - INFO - __main__ - Step 85753: {'lr': 0.0001986174400331229, 'samples': 16464576, 'steps': 85752, 'loss/train': 1.637719988822937} 11/07/2021 09:16:26 - INFO - __main__ - Step 85754: {'lr': 0.00019861224659528244, 'samples': 16464768, 'steps': 85753, 'loss/train': 1.326134204864502} 11/07/2021 09:16:26 - INFO - __main__ - Step 85755: {'lr': 0.00019860705318059651, 'samples': 16464960, 'steps': 85754, 'loss/train': 1.6506246328353882} 11/07/2021 09:16:27 - INFO - __main__ - Step 85756: {'lr': 0.0001986018597890676, 'samples': 16465152, 'steps': 85755, 'loss/train': 1.3359123468399048} 11/07/2021 09:16:28 - INFO - __main__ - Step 85757: {'lr': 0.00019859666642069773, 'samples': 16465344, 'steps': 85756, 'loss/train': 1.6710515022277832} 11/07/2021 09:16:28 - INFO - __main__ - Step 85758: {'lr': 0.00019859147307548942, 'samples': 16465536, 'steps': 85757, 'loss/train': 1.1259468793869019} 11/07/2021 09:16:28 - INFO - __main__ - Step 85759: {'lr': 0.00019858627975344502, 'samples': 16465728, 'steps': 85758, 'loss/train': 1.2853641510009766} 11/07/2021 09:16:29 - INFO - __main__ - Step 85760: {'lr': 0.00019858108645456684, 'samples': 16465920, 'steps': 85759, 'loss/train': 0.933064877986908} 11/07/2021 09:16:29 - INFO - __main__ - Step 85761: {'lr': 0.00019857589317885725, 'samples': 16466112, 'steps': 85760, 'loss/train': 2.2314181327819824} 11/07/2021 09:16:30 - INFO - __main__ - Step 85762: {'lr': 0.00019857069992631855, 'samples': 16466304, 'steps': 85761, 'loss/train': 1.523768424987793} 11/07/2021 09:16:30 - INFO - __main__ - Step 85763: {'lr': 0.00019856550669695308, 'samples': 16466496, 'steps': 85762, 'loss/train': 1.3395488262176514} 11/07/2021 09:16:31 - INFO - __main__ - Step 85764: {'lr': 0.00019856031349076324, 'samples': 16466688, 'steps': 85763, 'loss/train': 1.6320923566818237} 11/07/2021 09:16:31 - INFO - __main__ - Step 85765: {'lr': 0.0001985551203077513, 'samples': 16466880, 'steps': 85764, 'loss/train': 0.762226939201355} 11/07/2021 09:16:31 - INFO - __main__ - Step 85766: {'lr': 0.00019854992714791962, 'samples': 16467072, 'steps': 85765, 'loss/train': 1.2164995670318604} 11/07/2021 09:16:32 - INFO - __main__ - Step 85767: {'lr': 0.00019854473401127056, 'samples': 16467264, 'steps': 85766, 'loss/train': 0.9646930694580078} 11/07/2021 09:16:33 - INFO - __main__ - Step 85768: {'lr': 0.00019853954089780646, 'samples': 16467456, 'steps': 85767, 'loss/train': 1.392729640007019} 11/07/2021 09:16:34 - INFO - __main__ - Step 85769: {'lr': 0.00019853434780752973, 'samples': 16467648, 'steps': 85768, 'loss/train': 1.409970998764038} 11/07/2021 09:16:34 - INFO - __main__ - Step 85770: {'lr': 0.00019852915474044257, 'samples': 16467840, 'steps': 85769, 'loss/train': 0.23893219232559204} 11/07/2021 09:16:34 - INFO - __main__ - Step 85771: {'lr': 0.00019852396169654736, 'samples': 16468032, 'steps': 85770, 'loss/train': 1.8089628219604492} 11/07/2021 09:16:35 - INFO - __main__ - Step 85772: {'lr': 0.00019851876867584643, 'samples': 16468224, 'steps': 85771, 'loss/train': 1.8433369398117065} 11/07/2021 09:16:36 - INFO - __main__ - Step 85773: {'lr': 0.00019851357567834217, 'samples': 16468416, 'steps': 85772, 'loss/train': 1.5509918928146362} 11/07/2021 09:16:36 - INFO - __main__ - Step 85774: {'lr': 0.00019850838270403688, 'samples': 16468608, 'steps': 85773, 'loss/train': 1.080805778503418} 11/07/2021 09:16:37 - INFO - __main__ - Step 85775: {'lr': 0.00019850318975293295, 'samples': 16468800, 'steps': 85774, 'loss/train': 0.9584475755691528} 11/07/2021 09:16:37 - INFO - __main__ - Step 85776: {'lr': 0.00019849799682503265, 'samples': 16468992, 'steps': 85775, 'loss/train': 1.3082565069198608} 11/07/2021 09:16:37 - INFO - __main__ - Step 85777: {'lr': 0.00019849280392033838, 'samples': 16469184, 'steps': 85776, 'loss/train': 0.8270276188850403} 11/07/2021 09:16:38 - INFO - __main__ - Step 85778: {'lr': 0.00019848761103885245, 'samples': 16469376, 'steps': 85777, 'loss/train': 0.6040821671485901} 11/07/2021 09:16:39 - INFO - __main__ - Step 85779: {'lr': 0.00019848241818057723, 'samples': 16469568, 'steps': 85778, 'loss/train': 1.2155845165252686} 11/07/2021 09:16:39 - INFO - __main__ - Step 85780: {'lr': 0.00019847722534551502, 'samples': 16469760, 'steps': 85779, 'loss/train': 1.5120625495910645} 11/07/2021 09:16:39 - INFO - __main__ - Step 85781: {'lr': 0.0001984720325336682, 'samples': 16469952, 'steps': 85780, 'loss/train': 1.721375823020935} 11/07/2021 09:16:40 - INFO - __main__ - Step 85782: {'lr': 0.00019846683974503903, 'samples': 16470144, 'steps': 85781, 'loss/train': 0.4413132071495056} 11/07/2021 09:16:41 - INFO - __main__ - Step 85783: {'lr': 0.00019846164697963006, 'samples': 16470336, 'steps': 85782, 'loss/train': 1.9356714487075806} 11/07/2021 09:16:41 - INFO - __main__ - Step 85784: {'lr': 0.00019845645423744335, 'samples': 16470528, 'steps': 85783, 'loss/train': 1.3459103107452393} 11/07/2021 09:16:41 - INFO - __main__ - Step 85785: {'lr': 0.0001984512615184814, 'samples': 16470720, 'steps': 85784, 'loss/train': 1.726349949836731} 11/07/2021 09:16:42 - INFO - __main__ - Step 85786: {'lr': 0.00019844606882274648, 'samples': 16470912, 'steps': 85785, 'loss/train': 1.6485557556152344} 11/07/2021 09:16:42 - INFO - __main__ - Step 85787: {'lr': 0.00019844087615024099, 'samples': 16471104, 'steps': 85786, 'loss/train': 0.7941091060638428} 11/07/2021 09:16:43 - INFO - __main__ - Step 85788: {'lr': 0.00019843568350096723, 'samples': 16471296, 'steps': 85787, 'loss/train': 1.4122899770736694} 11/07/2021 09:16:43 - INFO - __main__ - Step 85789: {'lr': 0.00019843049087492755, 'samples': 16471488, 'steps': 85788, 'loss/train': 1.7543951272964478} 11/07/2021 09:16:44 - INFO - __main__ - Step 85790: {'lr': 0.0001984252982721243, 'samples': 16471680, 'steps': 85789, 'loss/train': 1.802300214767456} 11/07/2021 09:16:44 - INFO - __main__ - Step 85791: {'lr': 0.0001984201056925598, 'samples': 16471872, 'steps': 85790, 'loss/train': 1.068422794342041} 11/07/2021 09:16:44 - INFO - __main__ - Step 85792: {'lr': 0.00019841491313623644, 'samples': 16472064, 'steps': 85791, 'loss/train': 1.152634859085083} 11/07/2021 09:16:45 - INFO - __main__ - Step 85793: {'lr': 0.00019840972060315652, 'samples': 16472256, 'steps': 85792, 'loss/train': 1.141889214515686} 11/07/2021 09:16:46 - INFO - __main__ - Step 85794: {'lr': 0.00019840452809332236, 'samples': 16472448, 'steps': 85793, 'loss/train': 1.6344447135925293} 11/07/2021 09:16:46 - INFO - __main__ - Step 85795: {'lr': 0.00019839933560673634, 'samples': 16472640, 'steps': 85794, 'loss/train': 1.2653456926345825} 11/07/2021 09:16:46 - INFO - __main__ - Step 85796: {'lr': 0.00019839414314340087, 'samples': 16472832, 'steps': 85795, 'loss/train': 1.4052979946136475} 11/07/2021 09:16:47 - INFO - __main__ - Step 85797: {'lr': 0.0001983889507033181, 'samples': 16473024, 'steps': 85796, 'loss/train': 0.3991985619068146} 11/07/2021 09:16:47 - INFO - __main__ - Step 85798: {'lr': 0.00019838375828649048, 'samples': 16473216, 'steps': 85797, 'loss/train': 1.3967629671096802} 11/07/2021 09:16:48 - INFO - __main__ - Step 85799: {'lr': 0.00019837856589292036, 'samples': 16473408, 'steps': 85798, 'loss/train': 0.9765819311141968} 11/07/2021 09:16:49 - INFO - __main__ - Step 85800: {'lr': 0.00019837337352261004, 'samples': 16473600, 'steps': 85799, 'loss/train': 1.2657073736190796} 11/07/2021 09:16:49 - INFO - __main__ - Step 85801: {'lr': 0.00019836818117556187, 'samples': 16473792, 'steps': 85800, 'loss/train': 0.8816910982131958} 11/07/2021 09:16:49 - INFO - __main__ - Step 85802: {'lr': 0.00019836298885177826, 'samples': 16473984, 'steps': 85801, 'loss/train': 1.2862545251846313} 11/07/2021 09:16:50 - INFO - __main__ - Step 85803: {'lr': 0.00019835779655126145, 'samples': 16474176, 'steps': 85802, 'loss/train': 1.293613314628601} 11/07/2021 09:16:51 - INFO - __main__ - Step 85804: {'lr': 0.0001983526042740138, 'samples': 16474368, 'steps': 85803, 'loss/train': 1.3868389129638672} 11/07/2021 09:16:51 - INFO - __main__ - Step 85805: {'lr': 0.0001983474120200377, 'samples': 16474560, 'steps': 85804, 'loss/train': 1.345402717590332} 11/07/2021 09:16:52 - INFO - __main__ - Step 85806: {'lr': 0.00019834221978933542, 'samples': 16474752, 'steps': 85805, 'loss/train': 1.0493420362472534} 11/07/2021 09:16:52 - INFO - __main__ - Step 85807: {'lr': 0.0001983370275819094, 'samples': 16474944, 'steps': 85806, 'loss/train': 1.0905706882476807} 11/07/2021 09:16:52 - INFO - __main__ - Step 85808: {'lr': 0.00019833183539776187, 'samples': 16475136, 'steps': 85807, 'loss/train': 1.3010972738265991} 11/07/2021 09:16:54 - INFO - __main__ - Step 85809: {'lr': 0.00019832664323689533, 'samples': 16475328, 'steps': 85808, 'loss/train': 1.3327431678771973} 11/07/2021 09:16:54 - INFO - __main__ - Step 85810: {'lr': 0.0001983214510993119, 'samples': 16475520, 'steps': 85809, 'loss/train': 1.8970786333084106} 11/07/2021 09:16:54 - INFO - __main__ - Step 85811: {'lr': 0.00019831625898501405, 'samples': 16475712, 'steps': 85810, 'loss/train': 0.9803853034973145} 11/07/2021 09:16:55 - INFO - __main__ - Step 85812: {'lr': 0.00019831106689400407, 'samples': 16475904, 'steps': 85811, 'loss/train': 1.1961562633514404} 11/07/2021 09:16:55 - INFO - __main__ - Step 85813: {'lr': 0.00019830587482628435, 'samples': 16476096, 'steps': 85812, 'loss/train': 1.4954185485839844} 11/07/2021 09:16:56 - INFO - __main__ - Step 85814: {'lr': 0.0001983006827818572, 'samples': 16476288, 'steps': 85813, 'loss/train': 0.2157098352909088} 11/07/2021 09:16:56 - INFO - __main__ - Step 85815: {'lr': 0.00019829549076072494, 'samples': 16476480, 'steps': 85814, 'loss/train': 1.6184929609298706} 11/07/2021 09:16:57 - INFO - __main__ - Step 85816: {'lr': 0.00019829029876288994, 'samples': 16476672, 'steps': 85815, 'loss/train': 1.2939776182174683} 11/07/2021 09:16:57 - INFO - __main__ - Step 85817: {'lr': 0.00019828510678835456, 'samples': 16476864, 'steps': 85816, 'loss/train': 1.7323801517486572} 11/07/2021 09:16:57 - INFO - __main__ - Step 85818: {'lr': 0.00019827991483712111, 'samples': 16477056, 'steps': 85817, 'loss/train': 1.4908714294433594} 11/07/2021 09:16:58 - INFO - __main__ - Step 85819: {'lr': 0.00019827472290919192, 'samples': 16477248, 'steps': 85818, 'loss/train': 0.2666013538837433} 11/07/2021 09:16:59 - INFO - __main__ - Step 85820: {'lr': 0.00019826953100456933, 'samples': 16477440, 'steps': 85819, 'loss/train': 0.9039673209190369} 11/07/2021 09:16:59 - INFO - __main__ - Step 85821: {'lr': 0.0001982643391232557, 'samples': 16477632, 'steps': 85820, 'loss/train': 0.9935824275016785} 11/07/2021 09:16:59 - INFO - __main__ - Step 85822: {'lr': 0.00019825914726525335, 'samples': 16477824, 'steps': 85821, 'loss/train': 1.401077151298523} 11/07/2021 09:17:00 - INFO - __main__ - Step 85823: {'lr': 0.00019825395543056476, 'samples': 16478016, 'steps': 85822, 'loss/train': 1.4007689952850342} 11/07/2021 09:17:00 - INFO - __main__ - Step 85824: {'lr': 0.00019824876361919204, 'samples': 16478208, 'steps': 85823, 'loss/train': 1.5093145370483398} 11/07/2021 09:17:01 - INFO - __main__ - Step 85825: {'lr': 0.00019824357183113758, 'samples': 16478400, 'steps': 85824, 'loss/train': 1.0646876096725464} 11/07/2021 09:17:02 - INFO - __main__ - Step 85826: {'lr': 0.00019823838006640383, 'samples': 16478592, 'steps': 85825, 'loss/train': 0.875577986240387} 11/07/2021 09:17:02 - INFO - __main__ - Step 85827: {'lr': 0.00019823318832499302, 'samples': 16478784, 'steps': 85826, 'loss/train': 1.1591519117355347} 11/07/2021 09:17:02 - INFO - __main__ - Step 85828: {'lr': 0.00019822799660690755, 'samples': 16478976, 'steps': 85827, 'loss/train': 1.3470885753631592} 11/07/2021 09:17:03 - INFO - __main__ - Step 85829: {'lr': 0.00019822280491214975, 'samples': 16479168, 'steps': 85828, 'loss/train': 1.1418797969818115} 11/07/2021 09:17:04 - INFO - __main__ - Step 85830: {'lr': 0.00019821761324072197, 'samples': 16479360, 'steps': 85829, 'loss/train': 1.2080460786819458} 11/07/2021 09:17:04 - INFO - __main__ - Step 85831: {'lr': 0.0001982124215926265, 'samples': 16479552, 'steps': 85830, 'loss/train': 1.203875184059143} 11/07/2021 09:17:04 - INFO - __main__ - Step 85832: {'lr': 0.0001982072299678657, 'samples': 16479744, 'steps': 85831, 'loss/train': 1.7346560955047607} 11/07/2021 09:17:05 - INFO - __main__ - Step 85833: {'lr': 0.000198202038366442, 'samples': 16479936, 'steps': 85832, 'loss/train': 1.2879526615142822} 11/07/2021 09:17:05 - INFO - __main__ - Step 85834: {'lr': 0.00019819684678835765, 'samples': 16480128, 'steps': 85833, 'loss/train': 2.403252601623535} 11/07/2021 09:17:05 - INFO - __main__ - Step 85835: {'lr': 0.000198191655233615, 'samples': 16480320, 'steps': 85834, 'loss/train': 1.39680016040802} 11/07/2021 09:17:06 - INFO - __main__ - Step 85836: {'lr': 0.00019818646370221637, 'samples': 16480512, 'steps': 85835, 'loss/train': 1.4472863674163818} 11/07/2021 09:17:07 - INFO - __main__ - Step 85837: {'lr': 0.00019818127219416412, 'samples': 16480704, 'steps': 85836, 'loss/train': 1.395701289176941} 11/07/2021 09:17:07 - INFO - __main__ - Step 85838: {'lr': 0.0001981760807094606, 'samples': 16480896, 'steps': 85837, 'loss/train': 1.2127141952514648} 11/07/2021 09:17:07 - INFO - __main__ - Step 85839: {'lr': 0.0001981708892481081, 'samples': 16481088, 'steps': 85838, 'loss/train': 1.5398956537246704} 11/07/2021 09:17:08 - INFO - __main__ - Step 85840: {'lr': 0.00019816569781010902, 'samples': 16481280, 'steps': 85839, 'loss/train': 0.9669234156608582} 11/07/2021 09:17:09 - INFO - __main__ - Step 85841: {'lr': 0.00019816050639546564, 'samples': 16481472, 'steps': 85840, 'loss/train': 1.3255280256271362} 11/07/2021 09:17:10 - INFO - __main__ - Step 85842: {'lr': 0.0001981553150041804, 'samples': 16481664, 'steps': 85841, 'loss/train': 1.431123971939087} 11/07/2021 09:17:10 - INFO - __main__ - Step 85843: {'lr': 0.0001981501236362555, 'samples': 16481856, 'steps': 85842, 'loss/train': 0.9538840055465698} 11/07/2021 09:17:11 - INFO - __main__ - Step 85844: {'lr': 0.00019814493229169342, 'samples': 16482048, 'steps': 85843, 'loss/train': 0.7075463533401489} 11/07/2021 09:17:11 - INFO - __main__ - Step 85845: {'lr': 0.00019813974097049648, 'samples': 16482240, 'steps': 85844, 'loss/train': 0.7651471495628357} 11/07/2021 09:17:12 - INFO - __main__ - Step 85846: {'lr': 0.0001981345496726669, 'samples': 16482432, 'steps': 85845, 'loss/train': 1.3766447305679321} 11/07/2021 09:17:12 - INFO - __main__ - Step 85847: {'lr': 0.00019812935839820707, 'samples': 16482624, 'steps': 85846, 'loss/train': 1.6514309644699097} 11/07/2021 09:17:13 - INFO - __main__ - Step 85848: {'lr': 0.0001981241671471194, 'samples': 16482816, 'steps': 85847, 'loss/train': 1.6378718614578247} 11/07/2021 09:17:13 - INFO - __main__ - Step 85849: {'lr': 0.00019811897591940614, 'samples': 16483008, 'steps': 85848, 'loss/train': 1.638987421989441} 11/07/2021 09:17:13 - INFO - __main__ - Step 85850: {'lr': 0.00019811378471506976, 'samples': 16483200, 'steps': 85849, 'loss/train': 1.5704855918884277} 11/07/2021 09:17:14 - INFO - __main__ - Step 85851: {'lr': 0.0001981085935341124, 'samples': 16483392, 'steps': 85850, 'loss/train': 1.4861892461776733} 11/07/2021 09:17:15 - INFO - __main__ - Step 85852: {'lr': 0.00019810340237653653, 'samples': 16483584, 'steps': 85851, 'loss/train': 0.43836748600006104} 11/07/2021 09:17:15 - INFO - __main__ - Step 85853: {'lr': 0.00019809821124234448, 'samples': 16483776, 'steps': 85852, 'loss/train': 1.701228380203247} 11/07/2021 09:17:15 - INFO - __main__ - Step 85854: {'lr': 0.00019809302013153857, 'samples': 16483968, 'steps': 85853, 'loss/train': 1.3701276779174805} 11/07/2021 09:17:16 - INFO - __main__ - Step 85855: {'lr': 0.00019808782904412114, 'samples': 16484160, 'steps': 85854, 'loss/train': 1.4975148439407349} 11/07/2021 09:17:16 - INFO - __main__ - Step 85856: {'lr': 0.00019808263798009457, 'samples': 16484352, 'steps': 85855, 'loss/train': 1.546027660369873} 11/07/2021 09:17:17 - INFO - __main__ - Step 85857: {'lr': 0.00019807744693946114, 'samples': 16484544, 'steps': 85856, 'loss/train': 1.4345093965530396} 11/07/2021 09:17:17 - INFO - __main__ - Step 85858: {'lr': 0.0001980722559222232, 'samples': 16484736, 'steps': 85857, 'loss/train': 1.3649274110794067} 11/07/2021 09:17:18 - INFO - __main__ - Step 85859: {'lr': 0.0001980670649283831, 'samples': 16484928, 'steps': 85858, 'loss/train': 1.4150019884109497} 11/07/2021 09:17:18 - INFO - __main__ - Step 85860: {'lr': 0.00019806187395794318, 'samples': 16485120, 'steps': 85859, 'loss/train': 1.5698587894439697} 11/07/2021 09:17:18 - INFO - __main__ - Step 85861: {'lr': 0.00019805668301090578, 'samples': 16485312, 'steps': 85860, 'loss/train': 1.3513388633728027} 11/07/2021 09:17:19 - INFO - __main__ - Step 85862: {'lr': 0.00019805149208727325, 'samples': 16485504, 'steps': 85861, 'loss/train': 1.3942950963974} 11/07/2021 09:17:20 - INFO - __main__ - Step 85863: {'lr': 0.0001980463011870479, 'samples': 16485696, 'steps': 85862, 'loss/train': 1.7372009754180908} 11/07/2021 09:17:20 - INFO - __main__ - Step 85864: {'lr': 0.00019804111031023212, 'samples': 16485888, 'steps': 85863, 'loss/train': 1.2422504425048828} 11/07/2021 09:17:20 - INFO - __main__ - Step 85865: {'lr': 0.00019803591945682816, 'samples': 16486080, 'steps': 85864, 'loss/train': 1.197346806526184} 11/07/2021 09:17:21 - INFO - __main__ - Step 85866: {'lr': 0.00019803072862683847, 'samples': 16486272, 'steps': 85865, 'loss/train': 1.8083081245422363} 11/07/2021 09:17:22 - INFO - __main__ - Step 85867: {'lr': 0.00019802553782026532, 'samples': 16486464, 'steps': 85866, 'loss/train': 1.1459378004074097} 11/07/2021 09:17:22 - INFO - __main__ - Step 85868: {'lr': 0.00019802034703711102, 'samples': 16486656, 'steps': 85867, 'loss/train': 1.3457874059677124} 11/07/2021 09:17:22 - INFO - __main__ - Step 85869: {'lr': 0.00019801515627737798, 'samples': 16486848, 'steps': 85868, 'loss/train': 1.4721384048461914} 11/07/2021 09:17:23 - INFO - __main__ - Step 85870: {'lr': 0.00019800996554106848, 'samples': 16487040, 'steps': 85869, 'loss/train': 1.5496598482131958} 11/07/2021 09:17:23 - INFO - __main__ - Step 85871: {'lr': 0.0001980047748281849, 'samples': 16487232, 'steps': 85870, 'loss/train': 1.4277957677841187} 11/07/2021 09:17:24 - INFO - __main__ - Step 85872: {'lr': 0.00019799958413872957, 'samples': 16487424, 'steps': 85871, 'loss/train': 1.8854873180389404} 11/07/2021 09:17:25 - INFO - __main__ - Step 85873: {'lr': 0.0001979943934727048, 'samples': 16487616, 'steps': 85872, 'loss/train': 1.176744818687439} 11/07/2021 09:17:25 - INFO - __main__ - Step 85874: {'lr': 0.000197989202830113, 'samples': 16487808, 'steps': 85873, 'loss/train': 1.723524808883667} 11/07/2021 09:17:25 - INFO - __main__ - Step 85875: {'lr': 0.00019798401221095643, 'samples': 16488000, 'steps': 85874, 'loss/train': 5.74803352355957} 11/07/2021 09:17:26 - INFO - __main__ - Step 85876: {'lr': 0.00019797882161523748, 'samples': 16488192, 'steps': 85875, 'loss/train': 1.6798454523086548} 11/07/2021 09:17:26 - INFO - __main__ - Step 85877: {'lr': 0.00019797363104295853, 'samples': 16488384, 'steps': 85876, 'loss/train': 1.2568914890289307} 11/07/2021 09:17:27 - INFO - __main__ - Step 85878: {'lr': 0.00019796844049412184, 'samples': 16488576, 'steps': 85877, 'loss/train': 1.3127530813217163} 11/07/2021 09:17:28 - INFO - __main__ - Step 85879: {'lr': 0.0001979632499687297, 'samples': 16488768, 'steps': 85878, 'loss/train': 1.52670156955719} 11/07/2021 09:17:28 - INFO - __main__ - Step 85880: {'lr': 0.00019795805946678453, 'samples': 16488960, 'steps': 85879, 'loss/train': 1.6265854835510254} 11/07/2021 09:17:28 - INFO - __main__ - Step 85881: {'lr': 0.00019795286898828865, 'samples': 16489152, 'steps': 85880, 'loss/train': 0.9122320413589478} 11/07/2021 09:17:29 - INFO - __main__ - Step 85882: {'lr': 0.00019794767853324442, 'samples': 16489344, 'steps': 85881, 'loss/train': 1.0373753309249878} 11/07/2021 09:17:30 - INFO - __main__ - Step 85883: {'lr': 0.00019794248810165414, 'samples': 16489536, 'steps': 85882, 'loss/train': 1.211308240890503} 11/07/2021 09:17:30 - INFO - __main__ - Step 85884: {'lr': 0.0001979372976935202, 'samples': 16489728, 'steps': 85883, 'loss/train': 1.389923334121704} 11/07/2021 09:17:30 - INFO - __main__ - Step 85885: {'lr': 0.0001979321073088449, 'samples': 16489920, 'steps': 85884, 'loss/train': 1.1181576251983643} 11/07/2021 09:17:31 - INFO - __main__ - Step 85886: {'lr': 0.00019792691694763058, 'samples': 16490112, 'steps': 85885, 'loss/train': 1.6532649993896484} 11/07/2021 09:17:31 - INFO - __main__ - Step 85887: {'lr': 0.0001979217266098796, 'samples': 16490304, 'steps': 85886, 'loss/train': 2.0035831928253174} 11/07/2021 09:17:32 - INFO - __main__ - Step 85888: {'lr': 0.00019791653629559424, 'samples': 16490496, 'steps': 85887, 'loss/train': 1.0971121788024902} 11/07/2021 09:17:32 - INFO - __main__ - Step 85889: {'lr': 0.00019791134600477694, 'samples': 16490688, 'steps': 85888, 'loss/train': 1.2869460582733154} 11/07/2021 09:17:33 - INFO - __main__ - Step 85890: {'lr': 0.00019790615573743009, 'samples': 16490880, 'steps': 85889, 'loss/train': 0.8007276058197021} 11/07/2021 09:17:33 - INFO - __main__ - Step 85891: {'lr': 0.0001979009654935558, 'samples': 16491072, 'steps': 85890, 'loss/train': 1.6282036304473877} 11/07/2021 09:17:33 - INFO - __main__ - Step 85892: {'lr': 0.00019789577527315653, 'samples': 16491264, 'steps': 85891, 'loss/train': 1.300024390220642} 11/07/2021 09:17:35 - INFO - __main__ - Step 85893: {'lr': 0.0001978905850762346, 'samples': 16491456, 'steps': 85892, 'loss/train': 1.3820123672485352} 11/07/2021 09:17:35 - INFO - __main__ - Step 85894: {'lr': 0.0001978853949027924, 'samples': 16491648, 'steps': 85893, 'loss/train': 1.3487285375595093} 11/07/2021 09:17:35 - INFO - __main__ - Step 85895: {'lr': 0.00019788020475283223, 'samples': 16491840, 'steps': 85894, 'loss/train': 1.154742956161499} 11/07/2021 09:17:36 - INFO - __main__ - Step 85896: {'lr': 0.00019787501462635644, 'samples': 16492032, 'steps': 85895, 'loss/train': 1.6104241609573364} 11/07/2021 09:17:36 - INFO - __main__ - Step 85897: {'lr': 0.00019786982452336732, 'samples': 16492224, 'steps': 85896, 'loss/train': 1.795609951019287} 11/07/2021 09:17:36 - INFO - __main__ - Step 85898: {'lr': 0.00019786463444386733, 'samples': 16492416, 'steps': 85897, 'loss/train': 1.2899079322814941} 11/07/2021 09:17:37 - INFO - __main__ - Step 85899: {'lr': 0.00019785944438785867, 'samples': 16492608, 'steps': 85898, 'loss/train': 1.7357189655303955} 11/07/2021 09:17:38 - INFO - __main__ - Step 85900: {'lr': 0.00019785425435534377, 'samples': 16492800, 'steps': 85899, 'loss/train': 0.49218207597732544} 11/07/2021 09:17:38 - INFO - __main__ - Step 85901: {'lr': 0.0001978490643463249, 'samples': 16492992, 'steps': 85900, 'loss/train': 1.34369957447052} 11/07/2021 09:17:38 - INFO - __main__ - Step 85902: {'lr': 0.00019784387436080447, 'samples': 16493184, 'steps': 85901, 'loss/train': 1.2939497232437134} 11/07/2021 09:17:39 - INFO - __main__ - Step 85903: {'lr': 0.00019783868439878478, 'samples': 16493376, 'steps': 85902, 'loss/train': 0.5575409531593323} 11/07/2021 09:17:40 - INFO - __main__ - Step 85904: {'lr': 0.00019783349446026826, 'samples': 16493568, 'steps': 85903, 'loss/train': 1.3088544607162476} 11/07/2021 09:17:40 - INFO - __main__ - Step 85905: {'lr': 0.0001978283045452571, 'samples': 16493760, 'steps': 85904, 'loss/train': 1.0948988199234009} 11/07/2021 09:17:40 - INFO - __main__ - Step 85906: {'lr': 0.0001978231146537537, 'samples': 16493952, 'steps': 85905, 'loss/train': 0.15611028671264648} 11/07/2021 09:17:41 - INFO - __main__ - Step 85907: {'lr': 0.00019781792478576035, 'samples': 16494144, 'steps': 85906, 'loss/train': 1.124588131904602} 11/07/2021 09:17:41 - INFO - __main__ - Step 85908: {'lr': 0.00019781273494127947, 'samples': 16494336, 'steps': 85907, 'loss/train': 0.9565332531929016} 11/07/2021 09:17:42 - INFO - __main__ - Step 85909: {'lr': 0.00019780754512031335, 'samples': 16494528, 'steps': 85908, 'loss/train': 1.2852869033813477} 11/07/2021 09:17:43 - INFO - __main__ - Step 85910: {'lr': 0.00019780235532286435, 'samples': 16494720, 'steps': 85909, 'loss/train': 1.5222336053848267} 11/07/2021 09:17:43 - INFO - __main__ - Step 85911: {'lr': 0.00019779716554893482, 'samples': 16494912, 'steps': 85910, 'loss/train': 1.3967957496643066} 11/07/2021 09:17:43 - INFO - __main__ - Step 85912: {'lr': 0.00019779197579852705, 'samples': 16495104, 'steps': 85911, 'loss/train': 1.4944374561309814} 11/07/2021 09:17:44 - INFO - __main__ - Step 85913: {'lr': 0.00019778678607164347, 'samples': 16495296, 'steps': 85912, 'loss/train': 0.9537240862846375} 11/07/2021 09:17:45 - INFO - __main__ - Step 85914: {'lr': 0.0001977815963682863, 'samples': 16495488, 'steps': 85913, 'loss/train': 1.6147643327713013} 11/07/2021 09:17:45 - INFO - __main__ - Step 85915: {'lr': 0.00019777640668845796, 'samples': 16495680, 'steps': 85914, 'loss/train': 1.3577792644500732} 11/07/2021 09:17:45 - INFO - __main__ - Step 85916: {'lr': 0.00019777121703216076, 'samples': 16495872, 'steps': 85915, 'loss/train': 1.3045085668563843} 11/07/2021 09:17:46 - INFO - __main__ - Step 85917: {'lr': 0.00019776602739939714, 'samples': 16496064, 'steps': 85916, 'loss/train': 1.2246670722961426} 11/07/2021 09:17:46 - INFO - __main__ - Step 85918: {'lr': 0.00019776083779016927, 'samples': 16496256, 'steps': 85917, 'loss/train': 1.4822369813919067} 11/07/2021 09:17:46 - INFO - __main__ - Step 85919: {'lr': 0.00019775564820447952, 'samples': 16496448, 'steps': 85918, 'loss/train': 1.276395559310913} 11/07/2021 09:17:48 - INFO - __main__ - Step 85920: {'lr': 0.0001977504586423303, 'samples': 16496640, 'steps': 85919, 'loss/train': 1.2868967056274414} 11/07/2021 09:17:48 - INFO - __main__ - Step 85921: {'lr': 0.0001977452691037239, 'samples': 16496832, 'steps': 85920, 'loss/train': 1.602396011352539} 11/07/2021 09:17:48 - INFO - __main__ - Step 85922: {'lr': 0.00019774007958866266, 'samples': 16497024, 'steps': 85921, 'loss/train': 0.6017109155654907} 11/07/2021 09:17:49 - INFO - __main__ - Step 85923: {'lr': 0.00019773489009714896, 'samples': 16497216, 'steps': 85922, 'loss/train': 1.5040476322174072} 11/07/2021 09:17:49 - INFO - __main__ - Step 85924: {'lr': 0.0001977297006291851, 'samples': 16497408, 'steps': 85923, 'loss/train': 0.9978329539299011} 11/07/2021 09:17:50 - INFO - __main__ - Step 85925: {'lr': 0.00019772451118477344, 'samples': 16497600, 'steps': 85924, 'loss/train': 1.2646681070327759} 11/07/2021 09:17:50 - INFO - __main__ - Step 85926: {'lr': 0.0001977193217639163, 'samples': 16497792, 'steps': 85925, 'loss/train': 1.0533779859542847} 11/07/2021 09:17:51 - INFO - __main__ - Step 85927: {'lr': 0.00019771413236661602, 'samples': 16497984, 'steps': 85926, 'loss/train': 1.0970994234085083} 11/07/2021 09:17:51 - INFO - __main__ - Step 85928: {'lr': 0.00019770894299287495, 'samples': 16498176, 'steps': 85927, 'loss/train': 1.3874493837356567} 11/07/2021 09:17:51 - INFO - __main__ - Step 85929: {'lr': 0.00019770375364269545, 'samples': 16498368, 'steps': 85928, 'loss/train': 0.25032761693000793} 11/07/2021 09:17:52 - INFO - __main__ - Step 85930: {'lr': 0.0001976985643160799, 'samples': 16498560, 'steps': 85929, 'loss/train': 1.6899867057800293} 11/07/2021 09:17:53 - INFO - __main__ - Step 85931: {'lr': 0.00019769337501303048, 'samples': 16498752, 'steps': 85930, 'loss/train': 1.6083108186721802} 11/07/2021 09:17:53 - INFO - __main__ - Step 85932: {'lr': 0.00019768818573354964, 'samples': 16498944, 'steps': 85931, 'loss/train': 1.2048213481903076} 11/07/2021 09:17:53 - INFO - __main__ - Step 85933: {'lr': 0.00019768299647763966, 'samples': 16499136, 'steps': 85932, 'loss/train': 1.287190318107605} 11/07/2021 09:17:54 - INFO - __main__ - Step 85934: {'lr': 0.00019767780724530294, 'samples': 16499328, 'steps': 85933, 'loss/train': 1.6376177072525024} 11/07/2021 09:17:54 - INFO - __main__ - Step 85935: {'lr': 0.00019767261803654176, 'samples': 16499520, 'steps': 85934, 'loss/train': 1.247419834136963} 11/07/2021 09:17:55 - INFO - __main__ - Step 85936: {'lr': 0.00019766742885135854, 'samples': 16499712, 'steps': 85935, 'loss/train': 1.2491954565048218} 11/07/2021 09:17:56 - INFO - __main__ - Step 85937: {'lr': 0.00019766223968975552, 'samples': 16499904, 'steps': 85936, 'loss/train': 1.0707875490188599} 11/07/2021 09:17:56 - INFO - __main__ - Step 85938: {'lr': 0.00019765705055173512, 'samples': 16500096, 'steps': 85937, 'loss/train': 1.2203618288040161} 11/07/2021 09:17:56 - INFO - __main__ - Step 85939: {'lr': 0.00019765186143729963, 'samples': 16500288, 'steps': 85938, 'loss/train': 1.1948927640914917} 11/07/2021 09:17:57 - INFO - __main__ - Step 85940: {'lr': 0.0001976466723464514, 'samples': 16500480, 'steps': 85939, 'loss/train': 1.575951337814331} 11/07/2021 09:17:58 - INFO - __main__ - Step 85941: {'lr': 0.0001976414832791928, 'samples': 16500672, 'steps': 85940, 'loss/train': 1.1754159927368164} 11/07/2021 09:17:58 - INFO - __main__ - Step 85942: {'lr': 0.00019763629423552608, 'samples': 16500864, 'steps': 85941, 'loss/train': 1.383776068687439} 11/07/2021 09:17:59 - INFO - __main__ - Step 85943: {'lr': 0.00019763110521545368, 'samples': 16501056, 'steps': 85942, 'loss/train': 1.5848067998886108} 11/07/2021 09:17:59 - INFO - __main__ - Step 85944: {'lr': 0.000197625916218978, 'samples': 16501248, 'steps': 85943, 'loss/train': 0.3798387944698334} 11/07/2021 09:17:59 - INFO - __main__ - Step 85945: {'lr': 0.00019762072724610117, 'samples': 16501440, 'steps': 85944, 'loss/train': 1.4182863235473633} 11/07/2021 09:18:00 - INFO - __main__ - Step 85946: {'lr': 0.00019761553829682562, 'samples': 16501632, 'steps': 85945, 'loss/train': 1.3808555603027344} 11/07/2021 09:18:01 - INFO - __main__ - Step 85947: {'lr': 0.00019761034937115373, 'samples': 16501824, 'steps': 85946, 'loss/train': 1.3879421949386597} 11/07/2021 09:18:01 - INFO - __main__ - Step 85948: {'lr': 0.00019760516046908778, 'samples': 16502016, 'steps': 85947, 'loss/train': 1.2241202592849731} 11/07/2021 09:18:02 - INFO - __main__ - Step 85949: {'lr': 0.00019759997159063015, 'samples': 16502208, 'steps': 85948, 'loss/train': 1.1715763807296753} 11/07/2021 09:18:02 - INFO - __main__ - Step 85950: {'lr': 0.00019759478273578314, 'samples': 16502400, 'steps': 85949, 'loss/train': 2.0117881298065186} 11/07/2021 09:18:03 - INFO - __main__ - Step 85951: {'lr': 0.00019758959390454915, 'samples': 16502592, 'steps': 85950, 'loss/train': 1.4923654794692993} 11/07/2021 09:18:03 - INFO - __main__ - Step 85952: {'lr': 0.00019758440509693042, 'samples': 16502784, 'steps': 85951, 'loss/train': 1.4227896928787231} 11/07/2021 09:18:04 - INFO - __main__ - Step 85953: {'lr': 0.0001975792163129294, 'samples': 16502976, 'steps': 85952, 'loss/train': 1.267738699913025} 11/07/2021 09:18:04 - INFO - __main__ - Step 85954: {'lr': 0.00019757402755254838, 'samples': 16503168, 'steps': 85953, 'loss/train': 1.3677825927734375} 11/07/2021 09:18:04 - INFO - __main__ - Step 85955: {'lr': 0.00019756883881578969, 'samples': 16503360, 'steps': 85954, 'loss/train': 1.0389107465744019} 11/07/2021 09:18:05 - INFO - __main__ - Step 85956: {'lr': 0.00019756365010265565, 'samples': 16503552, 'steps': 85955, 'loss/train': 1.0945806503295898} 11/07/2021 09:18:06 - INFO - __main__ - Step 85957: {'lr': 0.00019755846141314873, 'samples': 16503744, 'steps': 85956, 'loss/train': 1.3936254978179932} 11/07/2021 09:18:06 - INFO - __main__ - Step 85958: {'lr': 0.00019755327274727105, 'samples': 16503936, 'steps': 85957, 'loss/train': 1.078700065612793} 11/07/2021 09:18:06 - INFO - __main__ - Step 85959: {'lr': 0.00019754808410502505, 'samples': 16504128, 'steps': 85958, 'loss/train': 1.7761270999908447} 11/07/2021 09:18:07 - INFO - __main__ - Step 85960: {'lr': 0.00019754289548641312, 'samples': 16504320, 'steps': 85959, 'loss/train': 1.0149650573730469} 11/07/2021 09:18:08 - INFO - __main__ - Step 85961: {'lr': 0.00019753770689143752, 'samples': 16504512, 'steps': 85960, 'loss/train': 0.9172937870025635} 11/07/2021 09:18:08 - INFO - __main__ - Step 85962: {'lr': 0.00019753251832010062, 'samples': 16504704, 'steps': 85961, 'loss/train': 1.3175359964370728} 11/07/2021 09:18:08 - INFO - __main__ - Step 85963: {'lr': 0.00019752732977240472, 'samples': 16504896, 'steps': 85962, 'loss/train': 0.5578610301017761} 11/07/2021 09:18:09 - INFO - __main__ - Step 85964: {'lr': 0.00019752214124835226, 'samples': 16505088, 'steps': 85963, 'loss/train': 1.2937695980072021} 11/07/2021 09:18:09 - INFO - __main__ - Step 85965: {'lr': 0.0001975169527479455, 'samples': 16505280, 'steps': 85964, 'loss/train': 1.2138112783432007} 11/07/2021 09:18:10 - INFO - __main__ - Step 85966: {'lr': 0.00019751176427118677, 'samples': 16505472, 'steps': 85965, 'loss/train': 1.0633714199066162} 11/07/2021 09:18:11 - INFO - __main__ - Step 85967: {'lr': 0.00019750657581807843, 'samples': 16505664, 'steps': 85966, 'loss/train': 1.4331245422363281} 11/07/2021 09:18:11 - INFO - __main__ - Step 85968: {'lr': 0.00019750138738862283, 'samples': 16505856, 'steps': 85967, 'loss/train': 1.1628077030181885} 11/07/2021 09:18:11 - INFO - __main__ - Step 85969: {'lr': 0.00019749619898282235, 'samples': 16506048, 'steps': 85968, 'loss/train': 1.0707721710205078} 11/07/2021 09:18:12 - INFO - __main__ - Step 85970: {'lr': 0.0001974910106006792, 'samples': 16506240, 'steps': 85969, 'loss/train': 1.4263898134231567} 11/07/2021 09:18:12 - INFO - __main__ - Step 85971: {'lr': 0.00019748582224219586, 'samples': 16506432, 'steps': 85970, 'loss/train': 1.5310907363891602} 11/07/2021 09:18:13 - INFO - __main__ - Step 85972: {'lr': 0.00019748063390737452, 'samples': 16506624, 'steps': 85971, 'loss/train': 2.245227575302124} 11/07/2021 09:18:14 - INFO - __main__ - Step 85973: {'lr': 0.0001974754455962176, 'samples': 16506816, 'steps': 85972, 'loss/train': 1.801334261894226} 11/07/2021 09:18:14 - INFO - __main__ - Step 85974: {'lr': 0.00019747025730872748, 'samples': 16507008, 'steps': 85973, 'loss/train': 1.5374672412872314} 11/07/2021 09:18:14 - INFO - __main__ - Step 85975: {'lr': 0.0001974650690449064, 'samples': 16507200, 'steps': 85974, 'loss/train': 1.6009535789489746} 11/07/2021 09:18:15 - INFO - __main__ - Step 85976: {'lr': 0.0001974598808047568, 'samples': 16507392, 'steps': 85975, 'loss/train': 1.6169779300689697} 11/07/2021 09:18:16 - INFO - __main__ - Step 85977: {'lr': 0.00019745469258828093, 'samples': 16507584, 'steps': 85976, 'loss/train': 1.1024829149246216} 11/07/2021 09:18:16 - INFO - __main__ - Step 85978: {'lr': 0.00019744950439548115, 'samples': 16507776, 'steps': 85977, 'loss/train': 0.834764301776886} 11/07/2021 09:18:16 - INFO - __main__ - Step 85979: {'lr': 0.00019744431622635984, 'samples': 16507968, 'steps': 85978, 'loss/train': 0.886043906211853} 11/07/2021 09:18:17 - INFO - __main__ - Step 85980: {'lr': 0.00019743912808091934, 'samples': 16508160, 'steps': 85979, 'loss/train': 1.3740613460540771} 11/07/2021 09:18:17 - INFO - __main__ - Step 85981: {'lr': 0.00019743393995916192, 'samples': 16508352, 'steps': 85980, 'loss/train': 1.4139701128005981} 11/07/2021 09:18:18 - INFO - __main__ - Step 85982: {'lr': 0.00019742875186108997, 'samples': 16508544, 'steps': 85981, 'loss/train': 1.2049570083618164} 11/07/2021 09:18:19 - INFO - __main__ - Step 85983: {'lr': 0.0001974235637867058, 'samples': 16508736, 'steps': 85982, 'loss/train': 1.0234178304672241} 11/07/2021 09:18:19 - INFO - __main__ - Step 85984: {'lr': 0.00019741837573601182, 'samples': 16508928, 'steps': 85983, 'loss/train': 1.442197322845459} 11/07/2021 09:18:19 - INFO - __main__ - Step 85985: {'lr': 0.00019741318770901027, 'samples': 16509120, 'steps': 85984, 'loss/train': 2.020249843597412} 11/07/2021 09:18:20 - INFO - __main__ - Step 85986: {'lr': 0.0001974079997057035, 'samples': 16509312, 'steps': 85985, 'loss/train': 0.24337314069271088} 11/07/2021 09:18:21 - INFO - __main__ - Step 85987: {'lr': 0.00019740281172609387, 'samples': 16509504, 'steps': 85986, 'loss/train': 1.2686742544174194} 11/07/2021 09:18:21 - INFO - __main__ - Step 85988: {'lr': 0.00019739762377018373, 'samples': 16509696, 'steps': 85987, 'loss/train': 1.1838113069534302} 11/07/2021 09:18:22 - INFO - __main__ - Step 85989: {'lr': 0.0001973924358379754, 'samples': 16509888, 'steps': 85988, 'loss/train': 1.570827841758728} 11/07/2021 09:18:22 - INFO - __main__ - Step 85990: {'lr': 0.00019738724792947124, 'samples': 16510080, 'steps': 85989, 'loss/train': 0.9812508225440979} 11/07/2021 09:18:22 - INFO - __main__ - Step 85991: {'lr': 0.00019738206004467362, 'samples': 16510272, 'steps': 85990, 'loss/train': 1.4282325506210327} 11/07/2021 09:18:23 - INFO - __main__ - Step 85992: {'lr': 0.0001973768721835848, 'samples': 16510464, 'steps': 85991, 'loss/train': 1.6593222618103027} 11/07/2021 09:18:24 - INFO - __main__ - Step 85993: {'lr': 0.00019737168434620712, 'samples': 16510656, 'steps': 85992, 'loss/train': 1.4751614332199097} 11/07/2021 09:18:24 - INFO - __main__ - Step 85994: {'lr': 0.00019736649653254295, 'samples': 16510848, 'steps': 85993, 'loss/train': 1.435961127281189} 11/07/2021 09:18:24 - INFO - __main__ - Step 85995: {'lr': 0.00019736130874259465, 'samples': 16511040, 'steps': 85994, 'loss/train': 1.5419683456420898} 11/07/2021 09:18:25 - INFO - __main__ - Step 85996: {'lr': 0.00019735612097636452, 'samples': 16511232, 'steps': 85995, 'loss/train': 1.374691128730774} 11/07/2021 09:18:26 - INFO - __main__ - Step 85997: {'lr': 0.00019735093323385489, 'samples': 16511424, 'steps': 85996, 'loss/train': 1.4381422996520996} 11/07/2021 09:18:26 - INFO - __main__ - Step 85998: {'lr': 0.00019734574551506817, 'samples': 16511616, 'steps': 85997, 'loss/train': 1.163485050201416} 11/07/2021 09:18:26 - INFO - __main__ - Step 85999: {'lr': 0.00019734055782000663, 'samples': 16511808, 'steps': 85998, 'loss/train': 1.7775483131408691} 11/07/2021 09:18:27 - INFO - __main__ - Step 86000: {'lr': 0.0001973353701486726, 'samples': 16512000, 'steps': 85999, 'loss/train': 1.1199957132339478} 11/07/2021 09:18:27 - INFO - __main__ - Step 86001: {'lr': 0.0001973301825010685, 'samples': 16512192, 'steps': 86000, 'loss/train': 1.262338638305664} 11/07/2021 09:18:27 - INFO - __main__ - Step 86002: {'lr': 0.00019732499487719652, 'samples': 16512384, 'steps': 86001, 'loss/train': 1.1305991411209106} 11/07/2021 09:18:28 - INFO - __main__ - Step 86003: {'lr': 0.0001973198072770591, 'samples': 16512576, 'steps': 86002, 'loss/train': 1.131640911102295} 11/07/2021 09:18:29 - INFO - __main__ - Step 86004: {'lr': 0.00019731461970065857, 'samples': 16512768, 'steps': 86003, 'loss/train': 1.209583044052124} 11/07/2021 09:18:29 - INFO - __main__ - Step 86005: {'lr': 0.0001973094321479973, 'samples': 16512960, 'steps': 86004, 'loss/train': 1.3211716413497925} 11/07/2021 09:18:30 - INFO - __main__ - Step 86006: {'lr': 0.00019730424461907752, 'samples': 16513152, 'steps': 86005, 'loss/train': 1.5672370195388794} 11/07/2021 09:18:30 - INFO - __main__ - Step 86007: {'lr': 0.00019729905711390167, 'samples': 16513344, 'steps': 86006, 'loss/train': 0.9619380235671997} 11/07/2021 09:18:31 - INFO - __main__ - Step 86008: {'lr': 0.00019729386963247204, 'samples': 16513536, 'steps': 86007, 'loss/train': 1.470035195350647} 11/07/2021 09:18:31 - INFO - __main__ - Step 86009: {'lr': 0.000197288682174791, 'samples': 16513728, 'steps': 86008, 'loss/train': 1.7067217826843262} 11/07/2021 09:18:32 - INFO - __main__ - Step 86010: {'lr': 0.00019728349474086083, 'samples': 16513920, 'steps': 86009, 'loss/train': 1.1483174562454224} 11/07/2021 09:18:32 - INFO - __main__ - Step 86011: {'lr': 0.00019727830733068396, 'samples': 16514112, 'steps': 86010, 'loss/train': 1.4474592208862305} 11/07/2021 09:18:32 - INFO - __main__ - Step 86012: {'lr': 0.00019727311994426273, 'samples': 16514304, 'steps': 86011, 'loss/train': 1.4927043914794922} 11/07/2021 09:18:33 - INFO - __main__ - Step 86013: {'lr': 0.0001972679325815993, 'samples': 16514496, 'steps': 86012, 'loss/train': 1.5840767621994019} 11/07/2021 09:18:34 - INFO - __main__ - Step 86014: {'lr': 0.00019726274524269616, 'samples': 16514688, 'steps': 86013, 'loss/train': 1.3750057220458984} 11/07/2021 09:18:34 - INFO - __main__ - Step 86015: {'lr': 0.00019725755792755558, 'samples': 16514880, 'steps': 86014, 'loss/train': 1.3638393878936768} 11/07/2021 09:18:34 - INFO - __main__ - Step 86016: {'lr': 0.00019725237063617995, 'samples': 16515072, 'steps': 86015, 'loss/train': 0.5357570648193359} 11/07/2021 09:18:35 - INFO - __main__ - Step 86017: {'lr': 0.0001972471833685716, 'samples': 16515264, 'steps': 86016, 'loss/train': 1.4050229787826538} 11/07/2021 09:18:36 - INFO - __main__ - Step 86018: {'lr': 0.00019724199612473285, 'samples': 16515456, 'steps': 86017, 'loss/train': 1.2781411409378052} 11/07/2021 09:18:36 - INFO - __main__ - Step 86019: {'lr': 0.00019723680890466606, 'samples': 16515648, 'steps': 86018, 'loss/train': 1.7908693552017212} 11/07/2021 09:18:36 - INFO - __main__ - Step 86020: {'lr': 0.0001972316217083735, 'samples': 16515840, 'steps': 86019, 'loss/train': 0.8050146698951721} 11/07/2021 09:18:37 - INFO - __main__ - Step 86021: {'lr': 0.0001972264345358576, 'samples': 16516032, 'steps': 86020, 'loss/train': 1.4696632623672485} 11/07/2021 09:18:37 - INFO - __main__ - Step 86022: {'lr': 0.00019722124738712064, 'samples': 16516224, 'steps': 86021, 'loss/train': 1.6934479475021362} 11/07/2021 09:18:38 - INFO - __main__ - Step 86023: {'lr': 0.00019721606026216497, 'samples': 16516416, 'steps': 86022, 'loss/train': 1.6285393238067627} 11/07/2021 09:18:38 - INFO - __main__ - Step 86024: {'lr': 0.00019721087316099294, 'samples': 16516608, 'steps': 86023, 'loss/train': 1.6387416124343872} 11/07/2021 09:18:39 - INFO - __main__ - Step 86025: {'lr': 0.00019720568608360694, 'samples': 16516800, 'steps': 86024, 'loss/train': 1.6549333333969116} 11/07/2021 09:18:39 - INFO - __main__ - Step 86026: {'lr': 0.0001972004990300092, 'samples': 16516992, 'steps': 86025, 'loss/train': 1.4449825286865234} 11/07/2021 09:18:39 - INFO - __main__ - Step 86027: {'lr': 0.00019719531200020204, 'samples': 16517184, 'steps': 86026, 'loss/train': 1.4833922386169434} 11/07/2021 09:18:40 - INFO - __main__ - Step 86028: {'lr': 0.0001971901249941879, 'samples': 16517376, 'steps': 86027, 'loss/train': 1.0832620859146118} 11/07/2021 09:18:41 - INFO - __main__ - Step 86029: {'lr': 0.00019718493801196906, 'samples': 16517568, 'steps': 86028, 'loss/train': 1.4857851266860962} 11/07/2021 09:18:41 - INFO - __main__ - Step 86030: {'lr': 0.00019717975105354785, 'samples': 16517760, 'steps': 86029, 'loss/train': 1.796721339225769} 11/07/2021 09:18:42 - INFO - __main__ - Step 86031: {'lr': 0.00019717456411892667, 'samples': 16517952, 'steps': 86030, 'loss/train': 1.4508389234542847} 11/07/2021 09:18:42 - INFO - __main__ - Step 86032: {'lr': 0.0001971693772081078, 'samples': 16518144, 'steps': 86031, 'loss/train': 1.2110488414764404} 11/07/2021 09:18:42 - INFO - __main__ - Step 86033: {'lr': 0.0001971641903210936, 'samples': 16518336, 'steps': 86032, 'loss/train': 1.3533719778060913} 11/07/2021 09:18:43 - INFO - __main__ - Step 86034: {'lr': 0.00019715900345788638, 'samples': 16518528, 'steps': 86033, 'loss/train': 1.6000179052352905} 11/07/2021 09:18:44 - INFO - __main__ - Step 86035: {'lr': 0.00019715381661848853, 'samples': 16518720, 'steps': 86034, 'loss/train': 0.4498925805091858} 11/07/2021 09:18:44 - INFO - __main__ - Step 86036: {'lr': 0.0001971486298029023, 'samples': 16518912, 'steps': 86035, 'loss/train': 1.2859572172164917} 11/07/2021 09:18:44 - INFO - __main__ - Step 86037: {'lr': 0.00019714344301113013, 'samples': 16519104, 'steps': 86036, 'loss/train': 0.9941686391830444} 11/07/2021 09:18:45 - INFO - __main__ - Step 86038: {'lr': 0.00019713825624317438, 'samples': 16519296, 'steps': 86037, 'loss/train': 1.5933953523635864} 11/07/2021 09:18:46 - INFO - __main__ - Step 86039: {'lr': 0.00019713306949903725, 'samples': 16519488, 'steps': 86038, 'loss/train': 1.3529452085494995} 11/07/2021 09:18:46 - INFO - __main__ - Step 86040: {'lr': 0.00019712788277872112, 'samples': 16519680, 'steps': 86039, 'loss/train': 1.533159852027893} 11/07/2021 09:18:47 - INFO - __main__ - Step 86041: {'lr': 0.00019712269608222836, 'samples': 16519872, 'steps': 86040, 'loss/train': 1.3748418092727661} 11/07/2021 09:18:47 - INFO - __main__ - Step 86042: {'lr': 0.0001971175094095613, 'samples': 16520064, 'steps': 86041, 'loss/train': 1.2169547080993652} 11/07/2021 09:18:47 - INFO - __main__ - Step 86043: {'lr': 0.00019711232276072228, 'samples': 16520256, 'steps': 86042, 'loss/train': 1.3622177839279175} 11/07/2021 09:18:48 - INFO - __main__ - Step 86044: {'lr': 0.0001971071361357136, 'samples': 16520448, 'steps': 86043, 'loss/train': 1.4438871145248413} 11/07/2021 09:18:49 - INFO - __main__ - Step 86045: {'lr': 0.00019710194953453765, 'samples': 16520640, 'steps': 86044, 'loss/train': 1.0372687578201294} 11/07/2021 09:18:49 - INFO - __main__ - Step 86046: {'lr': 0.00019709676295719673, 'samples': 16520832, 'steps': 86045, 'loss/train': 1.2314265966415405} 11/07/2021 09:18:49 - INFO - __main__ - Step 86047: {'lr': 0.0001970915764036932, 'samples': 16521024, 'steps': 86046, 'loss/train': 0.35408517718315125} 11/07/2021 09:18:50 - INFO - __main__ - Step 86048: {'lr': 0.00019708638987402937, 'samples': 16521216, 'steps': 86047, 'loss/train': 1.366864800453186} 11/07/2021 09:18:51 - INFO - __main__ - Step 86049: {'lr': 0.00019708120336820766, 'samples': 16521408, 'steps': 86048, 'loss/train': 1.447107195854187} 11/07/2021 09:18:51 - INFO - __main__ - Step 86050: {'lr': 0.00019707601688623028, 'samples': 16521600, 'steps': 86049, 'loss/train': 0.8836562037467957} 11/07/2021 09:18:51 - INFO - __main__ - Step 86051: {'lr': 0.00019707083042809975, 'samples': 16521792, 'steps': 86050, 'loss/train': 1.5450363159179688} 11/07/2021 09:18:52 - INFO - __main__ - Step 86052: {'lr': 0.00019706564399381822, 'samples': 16521984, 'steps': 86051, 'loss/train': 1.2937612533569336} 11/07/2021 09:18:52 - INFO - __main__ - Step 86053: {'lr': 0.00019706045758338802, 'samples': 16522176, 'steps': 86052, 'loss/train': 1.4816125631332397} 11/07/2021 09:18:53 - INFO - __main__ - Step 86054: {'lr': 0.00019705527119681163, 'samples': 16522368, 'steps': 86053, 'loss/train': 1.1962099075317383} 11/07/2021 09:18:53 - INFO - __main__ - Step 86055: {'lr': 0.0001970500848340913, 'samples': 16522560, 'steps': 86054, 'loss/train': 1.032800555229187} 11/07/2021 09:18:54 - INFO - __main__ - Step 86056: {'lr': 0.0001970448984952294, 'samples': 16522752, 'steps': 86055, 'loss/train': 1.3328684568405151} 11/07/2021 09:18:54 - INFO - __main__ - Step 86057: {'lr': 0.0001970397121802282, 'samples': 16522944, 'steps': 86056, 'loss/train': 2.1150691509246826} 11/07/2021 09:18:55 - INFO - __main__ - Step 86058: {'lr': 0.0001970345258890901, 'samples': 16523136, 'steps': 86057, 'loss/train': 1.4349421262741089} 11/07/2021 09:18:56 - INFO - __main__ - Step 86059: {'lr': 0.00019702933962181747, 'samples': 16523328, 'steps': 86058, 'loss/train': 1.5143133401870728} 11/07/2021 09:18:56 - INFO - __main__ - Step 86060: {'lr': 0.00019702415337841255, 'samples': 16523520, 'steps': 86059, 'loss/train': 0.6799181699752808} 11/07/2021 09:18:56 - INFO - __main__ - Step 86061: {'lr': 0.00019701896715887775, 'samples': 16523712, 'steps': 86060, 'loss/train': 1.7792901992797852} 11/07/2021 09:18:57 - INFO - __main__ - Step 86062: {'lr': 0.0001970137809632154, 'samples': 16523904, 'steps': 86061, 'loss/train': 2.042426109313965} 11/07/2021 09:18:57 - INFO - __main__ - Step 86063: {'lr': 0.0001970085947914278, 'samples': 16524096, 'steps': 86062, 'loss/train': 1.6283990144729614} 11/07/2021 09:18:57 - INFO - __main__ - Step 86064: {'lr': 0.00019700340864351734, 'samples': 16524288, 'steps': 86063, 'loss/train': 1.0710340738296509} 11/07/2021 09:18:58 - INFO - __main__ - Step 86065: {'lr': 0.0001969982225194864, 'samples': 16524480, 'steps': 86064, 'loss/train': 1.449758768081665} 11/07/2021 09:18:59 - INFO - __main__ - Step 86066: {'lr': 0.00019699303641933715, 'samples': 16524672, 'steps': 86065, 'loss/train': 1.2484203577041626} 11/07/2021 09:18:59 - INFO - __main__ - Step 86067: {'lr': 0.00019698785034307203, 'samples': 16524864, 'steps': 86066, 'loss/train': 1.35088312625885} 11/07/2021 09:18:59 - INFO - __main__ - Step 86068: {'lr': 0.00019698266429069334, 'samples': 16525056, 'steps': 86067, 'loss/train': 1.2255197763442993} 11/07/2021 09:19:00 - INFO - __main__ - Step 86069: {'lr': 0.00019697747826220348, 'samples': 16525248, 'steps': 86068, 'loss/train': 1.5834243297576904} 11/07/2021 09:19:01 - INFO - __main__ - Step 86070: {'lr': 0.0001969722922576047, 'samples': 16525440, 'steps': 86069, 'loss/train': 1.6013909578323364} 11/07/2021 09:19:01 - INFO - __main__ - Step 86071: {'lr': 0.00019696710627689946, 'samples': 16525632, 'steps': 86070, 'loss/train': 1.6329607963562012} 11/07/2021 09:19:01 - INFO - __main__ - Step 86072: {'lr': 0.00019696192032008997, 'samples': 16525824, 'steps': 86071, 'loss/train': 1.0562245845794678} 11/07/2021 09:19:02 - INFO - __main__ - Step 86073: {'lr': 0.00019695673438717862, 'samples': 16526016, 'steps': 86072, 'loss/train': 0.6571990251541138} 11/07/2021 09:19:02 - INFO - __main__ - Step 86074: {'lr': 0.00019695154847816776, 'samples': 16526208, 'steps': 86073, 'loss/train': 1.2688289880752563} 11/07/2021 09:19:03 - INFO - __main__ - Step 86075: {'lr': 0.0001969463625930597, 'samples': 16526400, 'steps': 86074, 'loss/train': 1.3477556705474854} 11/07/2021 09:19:03 - INFO - __main__ - Step 86076: {'lr': 0.0001969411767318568, 'samples': 16526592, 'steps': 86075, 'loss/train': 1.2298436164855957} 11/07/2021 09:19:04 - INFO - __main__ - Step 86077: {'lr': 0.00019693599089456141, 'samples': 16526784, 'steps': 86076, 'loss/train': 1.418974757194519} 11/07/2021 09:19:04 - INFO - __main__ - Step 86078: {'lr': 0.0001969308050811759, 'samples': 16526976, 'steps': 86077, 'loss/train': 0.30164971947669983} 11/07/2021 09:19:05 - INFO - __main__ - Step 86079: {'lr': 0.0001969256192917025, 'samples': 16527168, 'steps': 86078, 'loss/train': 1.677644968032837} 11/07/2021 09:19:06 - INFO - __main__ - Step 86080: {'lr': 0.00019692043352614356, 'samples': 16527360, 'steps': 86079, 'loss/train': 1.2159003019332886} 11/07/2021 09:19:06 - INFO - __main__ - Step 86081: {'lr': 0.00019691524778450145, 'samples': 16527552, 'steps': 86080, 'loss/train': 1.2278088331222534} 11/07/2021 09:19:06 - INFO - __main__ - Step 86082: {'lr': 0.00019691006206677854, 'samples': 16527744, 'steps': 86081, 'loss/train': 1.6461504697799683} 11/07/2021 09:19:07 - INFO - __main__ - Step 86083: {'lr': 0.00019690487637297711, 'samples': 16527936, 'steps': 86082, 'loss/train': 1.2489262819290161} 11/07/2021 09:19:07 - INFO - __main__ - Step 86084: {'lr': 0.00019689969070309953, 'samples': 16528128, 'steps': 86083, 'loss/train': 1.1223742961883545} 11/07/2021 09:19:08 - INFO - __main__ - Step 86085: {'lr': 0.00019689450505714815, 'samples': 16528320, 'steps': 86084, 'loss/train': 1.62998628616333} 11/07/2021 09:19:09 - INFO - __main__ - Step 86086: {'lr': 0.00019688931943512527, 'samples': 16528512, 'steps': 86085, 'loss/train': 1.2838072776794434} 11/07/2021 09:19:09 - INFO - __main__ - Step 86087: {'lr': 0.00019688413383703323, 'samples': 16528704, 'steps': 86086, 'loss/train': 0.48384788632392883} 11/07/2021 09:19:09 - INFO - __main__ - Step 86088: {'lr': 0.00019687894826287439, 'samples': 16528896, 'steps': 86087, 'loss/train': 1.4309824705123901} 11/07/2021 09:19:10 - INFO - __main__ - Step 86089: {'lr': 0.0001968737627126511, 'samples': 16529088, 'steps': 86088, 'loss/train': 1.2698462009429932} 11/07/2021 09:19:11 - INFO - __main__ - Step 86090: {'lr': 0.00019686857718636565, 'samples': 16529280, 'steps': 86089, 'loss/train': 1.3676854372024536} 11/07/2021 09:19:11 - INFO - __main__ - Step 86091: {'lr': 0.0001968633916840204, 'samples': 16529472, 'steps': 86090, 'loss/train': 1.8498433828353882} 11/07/2021 09:19:11 - INFO - __main__ - Step 86092: {'lr': 0.00019685820620561777, 'samples': 16529664, 'steps': 86091, 'loss/train': 1.1466548442840576} 11/07/2021 09:19:12 - INFO - __main__ - Step 86093: {'lr': 0.00019685302075115997, 'samples': 16529856, 'steps': 86092, 'loss/train': 1.3466730117797852} 11/07/2021 09:19:12 - INFO - __main__ - Step 86094: {'lr': 0.00019684783532064931, 'samples': 16530048, 'steps': 86093, 'loss/train': 1.4736789464950562} 11/07/2021 09:19:13 - INFO - __main__ - Step 86095: {'lr': 0.00019684264991408823, 'samples': 16530240, 'steps': 86094, 'loss/train': 1.0556319952011108} 11/07/2021 09:19:14 - INFO - __main__ - Step 86096: {'lr': 0.000196837464531479, 'samples': 16530432, 'steps': 86095, 'loss/train': 1.582107663154602} 11/07/2021 09:19:14 - INFO - __main__ - Step 86097: {'lr': 0.00019683227917282405, 'samples': 16530624, 'steps': 86096, 'loss/train': 1.4456887245178223} 11/07/2021 09:19:14 - INFO - __main__ - Step 86098: {'lr': 0.00019682709383812562, 'samples': 16530816, 'steps': 86097, 'loss/train': 1.334670901298523} 11/07/2021 09:19:15 - INFO - __main__ - Step 86099: {'lr': 0.00019682190852738607, 'samples': 16531008, 'steps': 86098, 'loss/train': 1.2751190662384033} 11/07/2021 09:19:15 - INFO - __main__ - Step 86100: {'lr': 0.00019681672324060775, 'samples': 16531200, 'steps': 86099, 'loss/train': 1.3649756908416748} 11/07/2021 09:19:16 - INFO - __main__ - Step 86101: {'lr': 0.000196811537977793, 'samples': 16531392, 'steps': 86100, 'loss/train': 1.3753899335861206} 11/07/2021 09:19:16 - INFO - __main__ - Step 86102: {'lr': 0.00019680635273894415, 'samples': 16531584, 'steps': 86101, 'loss/train': 1.2041254043579102} 11/07/2021 09:19:17 - INFO - __main__ - Step 86103: {'lr': 0.00019680116752406358, 'samples': 16531776, 'steps': 86102, 'loss/train': 0.9780117273330688} 11/07/2021 09:19:17 - INFO - __main__ - Step 86104: {'lr': 0.00019679598233315356, 'samples': 16531968, 'steps': 86103, 'loss/train': 1.4258800745010376} 11/07/2021 09:19:17 - INFO - __main__ - Step 86105: {'lr': 0.0001967907971662165, 'samples': 16532160, 'steps': 86104, 'loss/train': 1.5063167810440063} 11/07/2021 09:19:18 - INFO - __main__ - Step 86106: {'lr': 0.0001967856120232546, 'samples': 16532352, 'steps': 86105, 'loss/train': 0.8807182312011719} 11/07/2021 09:19:19 - INFO - __main__ - Step 86107: {'lr': 0.00019678042690427029, 'samples': 16532544, 'steps': 86106, 'loss/train': 1.934121012687683} 11/07/2021 09:19:19 - INFO - __main__ - Step 86108: {'lr': 0.0001967752418092659, 'samples': 16532736, 'steps': 86107, 'loss/train': 1.187266230583191} 11/07/2021 09:19:19 - INFO - __main__ - Step 86109: {'lr': 0.00019677005673824377, 'samples': 16532928, 'steps': 86108, 'loss/train': 1.685163140296936} 11/07/2021 09:19:20 - INFO - __main__ - Step 86110: {'lr': 0.0001967648716912062, 'samples': 16533120, 'steps': 86109, 'loss/train': 1.641913890838623} 11/07/2021 09:19:21 - INFO - __main__ - Step 86111: {'lr': 0.00019675968666815562, 'samples': 16533312, 'steps': 86110, 'loss/train': 1.2298742532730103} 11/07/2021 09:19:21 - INFO - __main__ - Step 86112: {'lr': 0.00019675450166909425, 'samples': 16533504, 'steps': 86111, 'loss/train': 1.3590730428695679} 11/07/2021 09:19:22 - INFO - __main__ - Step 86113: {'lr': 0.00019674931669402452, 'samples': 16533696, 'steps': 86112, 'loss/train': 1.4462591409683228} 11/07/2021 09:19:22 - INFO - __main__ - Step 86114: {'lr': 0.00019674413174294874, 'samples': 16533888, 'steps': 86113, 'loss/train': 1.1892982721328735} 11/07/2021 09:19:22 - INFO - __main__ - Step 86115: {'lr': 0.00019673894681586924, 'samples': 16534080, 'steps': 86114, 'loss/train': 1.1893984079360962} 11/07/2021 09:19:23 - INFO - __main__ - Step 86116: {'lr': 0.0001967337619127883, 'samples': 16534272, 'steps': 86115, 'loss/train': 0.7599348425865173} 11/07/2021 09:19:24 - INFO - __main__ - Step 86117: {'lr': 0.0001967285770337083, 'samples': 16534464, 'steps': 86116, 'loss/train': 1.359994888305664} 11/07/2021 09:19:24 - INFO - __main__ - Step 86118: {'lr': 0.0001967233921786316, 'samples': 16534656, 'steps': 86117, 'loss/train': 1.1296495199203491} 11/07/2021 09:19:24 - INFO - __main__ - Step 86119: {'lr': 0.00019671820734756059, 'samples': 16534848, 'steps': 86118, 'loss/train': 2.1554744243621826} 11/07/2021 09:19:25 - INFO - __main__ - Step 86120: {'lr': 0.00019671302254049743, 'samples': 16535040, 'steps': 86119, 'loss/train': 1.5416080951690674} 11/07/2021 09:19:26 - INFO - __main__ - Step 86121: {'lr': 0.00019670783775744462, 'samples': 16535232, 'steps': 86120, 'loss/train': 0.6455431580543518} 11/07/2021 09:19:26 - INFO - __main__ - Step 86122: {'lr': 0.00019670265299840436, 'samples': 16535424, 'steps': 86121, 'loss/train': 1.7478078603744507} 11/07/2021 09:19:27 - INFO - __main__ - Step 86123: {'lr': 0.00019669746826337913, 'samples': 16535616, 'steps': 86122, 'loss/train': 1.2864899635314941} 11/07/2021 09:19:27 - INFO - __main__ - Step 86124: {'lr': 0.00019669228355237116, 'samples': 16535808, 'steps': 86123, 'loss/train': 1.4400793313980103} 11/07/2021 09:19:27 - INFO - __main__ - Step 86125: {'lr': 0.0001966870988653829, 'samples': 16536000, 'steps': 86124, 'loss/train': 1.659364104270935} 11/07/2021 09:19:28 - INFO - __main__ - Step 86126: {'lr': 0.00019668191420241655, 'samples': 16536192, 'steps': 86125, 'loss/train': 1.371052861213684} 11/07/2021 09:19:29 - INFO - __main__ - Step 86127: {'lr': 0.00019667672956347448, 'samples': 16536384, 'steps': 86126, 'loss/train': 1.5472280979156494} 11/07/2021 09:19:29 - INFO - __main__ - Step 86128: {'lr': 0.0001966715449485591, 'samples': 16536576, 'steps': 86127, 'loss/train': 1.2722058296203613} 11/07/2021 09:19:29 - INFO - __main__ - Step 86129: {'lr': 0.00019666636035767265, 'samples': 16536768, 'steps': 86128, 'loss/train': 1.1569184064865112} 11/07/2021 09:19:30 - INFO - __main__ - Step 86130: {'lr': 0.00019666117579081755, 'samples': 16536960, 'steps': 86129, 'loss/train': 0.42055073380470276} 11/07/2021 09:19:31 - INFO - __main__ - Step 86131: {'lr': 0.0001966559912479961, 'samples': 16537152, 'steps': 86130, 'loss/train': 2.387908935546875} 11/07/2021 09:19:31 - INFO - __main__ - Step 86132: {'lr': 0.00019665080672921068, 'samples': 16537344, 'steps': 86131, 'loss/train': 1.5222445726394653} 11/07/2021 09:19:31 - INFO - __main__ - Step 86133: {'lr': 0.00019664562223446354, 'samples': 16537536, 'steps': 86132, 'loss/train': 1.6656919717788696} 11/07/2021 09:19:32 - INFO - __main__ - Step 86134: {'lr': 0.00019664043776375706, 'samples': 16537728, 'steps': 86133, 'loss/train': 1.4211781024932861} 11/07/2021 09:19:32 - INFO - __main__ - Step 86135: {'lr': 0.00019663525331709356, 'samples': 16537920, 'steps': 86134, 'loss/train': 1.156577706336975} 11/07/2021 09:19:32 - INFO - __main__ - Step 86136: {'lr': 0.00019663006889447543, 'samples': 16538112, 'steps': 86135, 'loss/train': 1.3983689546585083} 11/07/2021 09:19:33 - INFO - __main__ - Step 86137: {'lr': 0.00019662488449590496, 'samples': 16538304, 'steps': 86136, 'loss/train': 1.3628908395767212} 11/07/2021 09:19:34 - INFO - __main__ - Step 86138: {'lr': 0.00019661970012138446, 'samples': 16538496, 'steps': 86137, 'loss/train': 1.023603081703186} 11/07/2021 09:19:34 - INFO - __main__ - Step 86139: {'lr': 0.00019661451577091633, 'samples': 16538688, 'steps': 86138, 'loss/train': 1.6326115131378174} 11/07/2021 09:19:35 - INFO - __main__ - Step 86140: {'lr': 0.00019660933144450283, 'samples': 16538880, 'steps': 86139, 'loss/train': 1.1791411638259888} 11/07/2021 09:19:35 - INFO - __main__ - Step 86141: {'lr': 0.00019660414714214636, 'samples': 16539072, 'steps': 86140, 'loss/train': 0.5839411020278931} 11/07/2021 09:19:36 - INFO - __main__ - Step 86142: {'lr': 0.00019659896286384926, 'samples': 16539264, 'steps': 86141, 'loss/train': 1.2557929754257202} 11/07/2021 09:19:36 - INFO - __main__ - Step 86143: {'lr': 0.00019659377860961383, 'samples': 16539456, 'steps': 86142, 'loss/train': 1.554004192352295} 11/07/2021 09:19:36 - INFO - __main__ - Step 86144: {'lr': 0.0001965885943794424, 'samples': 16539648, 'steps': 86143, 'loss/train': 1.4033011198043823} 11/07/2021 09:19:37 - INFO - __main__ - Step 86145: {'lr': 0.00019658341017333736, 'samples': 16539840, 'steps': 86144, 'loss/train': 1.3519227504730225} 11/07/2021 09:19:37 - INFO - __main__ - Step 86146: {'lr': 0.00019657822599130105, 'samples': 16540032, 'steps': 86145, 'loss/train': 1.7873318195343018} 11/07/2021 09:19:39 - INFO - __main__ - Step 86147: {'lr': 0.00019657304183333575, 'samples': 16540224, 'steps': 86146, 'loss/train': 1.412078857421875} 11/07/2021 09:19:40 - INFO - __main__ - Step 86148: {'lr': 0.00019656785769944378, 'samples': 16540416, 'steps': 86147, 'loss/train': 1.3434723615646362} 11/07/2021 09:19:40 - INFO - __main__ - Step 86149: {'lr': 0.0001965626735896275, 'samples': 16540608, 'steps': 86148, 'loss/train': 1.3619375228881836} 11/07/2021 09:19:40 - INFO - __main__ - Step 86150: {'lr': 0.00019655748950388925, 'samples': 16540800, 'steps': 86149, 'loss/train': 1.6331725120544434} 11/07/2021 09:19:41 - INFO - __main__ - Step 86151: {'lr': 0.0001965523054422314, 'samples': 16540992, 'steps': 86150, 'loss/train': 5.474781036376953} 11/07/2021 09:19:41 - INFO - __main__ - Step 86152: {'lr': 0.0001965471214046562, 'samples': 16541184, 'steps': 86151, 'loss/train': 5.50349760055542} 11/07/2021 09:19:41 - INFO - __main__ - Step 86153: {'lr': 0.00019654193739116607, 'samples': 16541376, 'steps': 86152, 'loss/train': 5.47348165512085} 11/07/2021 09:19:42 - INFO - __main__ - Step 86154: {'lr': 0.00019653675340176334, 'samples': 16541568, 'steps': 86153, 'loss/train': 5.570001125335693} 11/07/2021 09:19:43 - INFO - __main__ - Step 86155: {'lr': 0.00019653156943645028, 'samples': 16541760, 'steps': 86154, 'loss/train': 1.569379210472107} 11/07/2021 09:19:43 - INFO - __main__ - Step 86156: {'lr': 0.0001965263854952293, 'samples': 16541952, 'steps': 86155, 'loss/train': 1.5507664680480957} 11/07/2021 09:19:43 - INFO - __main__ - Step 86157: {'lr': 0.00019652120157810272, 'samples': 16542144, 'steps': 86156, 'loss/train': 1.5900174379348755} 11/07/2021 09:19:44 - INFO - __main__ - Step 86158: {'lr': 0.00019651601768507282, 'samples': 16542336, 'steps': 86157, 'loss/train': 1.249745488166809} 11/07/2021 09:19:45 - INFO - __main__ - Step 86159: {'lr': 0.00019651083381614214, 'samples': 16542528, 'steps': 86158, 'loss/train': 1.087522029876709} 11/07/2021 09:19:45 - INFO - __main__ - Step 86160: {'lr': 0.0001965056499713127, 'samples': 16542720, 'steps': 86159, 'loss/train': 1.227394700050354} 11/07/2021 09:19:45 - INFO - __main__ - Step 86161: {'lr': 0.000196500466150587, 'samples': 16542912, 'steps': 86160, 'loss/train': 1.8074556589126587} 11/07/2021 09:19:46 - INFO - __main__ - Step 86162: {'lr': 0.00019649528235396736, 'samples': 16543104, 'steps': 86161, 'loss/train': 1.334180474281311} 11/07/2021 09:19:46 - INFO - __main__ - Step 86163: {'lr': 0.00019649009858145613, 'samples': 16543296, 'steps': 86162, 'loss/train': 1.525182843208313} 11/07/2021 09:19:47 - INFO - __main__ - Step 86164: {'lr': 0.00019648491483305563, 'samples': 16543488, 'steps': 86163, 'loss/train': 1.7147632837295532} 11/07/2021 09:19:48 - INFO - __main__ - Step 86165: {'lr': 0.0001964797311087682, 'samples': 16543680, 'steps': 86164, 'loss/train': 1.465596318244934} 11/07/2021 09:19:48 - INFO - __main__ - Step 86166: {'lr': 0.00019647454740859618, 'samples': 16543872, 'steps': 86165, 'loss/train': 1.3725000619888306} 11/07/2021 09:19:48 - INFO - __main__ - Step 86167: {'lr': 0.00019646936373254192, 'samples': 16544064, 'steps': 86166, 'loss/train': 0.6233959197998047} 11/07/2021 09:19:49 - INFO - __main__ - Step 86168: {'lr': 0.00019646418008060774, 'samples': 16544256, 'steps': 86167, 'loss/train': 1.6370197534561157} 11/07/2021 09:19:50 - INFO - __main__ - Step 86169: {'lr': 0.00019645899645279595, 'samples': 16544448, 'steps': 86168, 'loss/train': 1.1179403066635132} 11/07/2021 09:19:50 - INFO - __main__ - Step 86170: {'lr': 0.0001964538128491089, 'samples': 16544640, 'steps': 86169, 'loss/train': 1.2499066591262817} 11/07/2021 09:19:50 - INFO - __main__ - Step 86171: {'lr': 0.00019644862926954896, 'samples': 16544832, 'steps': 86170, 'loss/train': 1.3246829509735107} 11/07/2021 09:19:51 - INFO - __main__ - Step 86172: {'lr': 0.00019644344571411853, 'samples': 16545024, 'steps': 86171, 'loss/train': 1.1997047662734985} 11/07/2021 09:19:51 - INFO - __main__ - Step 86173: {'lr': 0.00019643826218281976, 'samples': 16545216, 'steps': 86172, 'loss/train': 1.0375233888626099} 11/07/2021 09:19:52 - INFO - __main__ - Step 86174: {'lr': 0.0001964330786756551, 'samples': 16545408, 'steps': 86173, 'loss/train': 1.1872628927230835} 11/07/2021 09:19:53 - INFO - __main__ - Step 86175: {'lr': 0.00019642789519262686, 'samples': 16545600, 'steps': 86174, 'loss/train': 1.39320707321167} 11/07/2021 09:19:53 - INFO - __main__ - Step 86176: {'lr': 0.00019642271173373735, 'samples': 16545792, 'steps': 86175, 'loss/train': 1.1646552085876465} 11/07/2021 09:19:53 - INFO - __main__ - Step 86177: {'lr': 0.00019641752829898897, 'samples': 16545984, 'steps': 86176, 'loss/train': 1.8120185136795044} 11/07/2021 09:19:54 - INFO - __main__ - Step 86178: {'lr': 0.00019641234488838402, 'samples': 16546176, 'steps': 86177, 'loss/train': 1.0750820636749268} 11/07/2021 09:19:54 - INFO - __main__ - Step 86179: {'lr': 0.00019640716150192485, 'samples': 16546368, 'steps': 86178, 'loss/train': 1.488797664642334} 11/07/2021 09:19:55 - INFO - __main__ - Step 86180: {'lr': 0.00019640197813961379, 'samples': 16546560, 'steps': 86179, 'loss/train': 1.4563804864883423} 11/07/2021 09:19:56 - INFO - __main__ - Step 86181: {'lr': 0.00019639679480145314, 'samples': 16546752, 'steps': 86180, 'loss/train': 0.5899531841278076} 11/07/2021 09:19:56 - INFO - __main__ - Step 86182: {'lr': 0.00019639161148744528, 'samples': 16546944, 'steps': 86181, 'loss/train': 1.3763984441757202} 11/07/2021 09:19:56 - INFO - __main__ - Step 86183: {'lr': 0.00019638642819759256, 'samples': 16547136, 'steps': 86182, 'loss/train': 1.5690377950668335} 11/07/2021 09:19:57 - INFO - __main__ - Step 86184: {'lr': 0.00019638124493189725, 'samples': 16547328, 'steps': 86183, 'loss/train': 0.9028561115264893} 11/07/2021 09:19:58 - INFO - __main__ - Step 86185: {'lr': 0.00019637606169036173, 'samples': 16547520, 'steps': 86184, 'loss/train': 1.1920627355575562} 11/07/2021 09:19:58 - INFO - __main__ - Step 86186: {'lr': 0.00019637087847298846, 'samples': 16547712, 'steps': 86185, 'loss/train': 1.3359299898147583} 11/07/2021 09:19:58 - INFO - __main__ - Step 86187: {'lr': 0.00019636569527977952, 'samples': 16547904, 'steps': 86186, 'loss/train': 1.355734944343567} 11/07/2021 09:19:59 - INFO - __main__ - Step 86188: {'lr': 0.00019636051211073736, 'samples': 16548096, 'steps': 86187, 'loss/train': 0.9610797166824341} 11/07/2021 09:19:59 - INFO - __main__ - Step 86189: {'lr': 0.00019635532896586437, 'samples': 16548288, 'steps': 86188, 'loss/train': 1.531053066253662} 11/07/2021 09:20:00 - INFO - __main__ - Step 86190: {'lr': 0.00019635014584516277, 'samples': 16548480, 'steps': 86189, 'loss/train': 1.3877214193344116} 11/07/2021 09:20:01 - INFO - __main__ - Step 86191: {'lr': 0.00019634496274863503, 'samples': 16548672, 'steps': 86190, 'loss/train': 1.3250253200531006} 11/07/2021 09:20:01 - INFO - __main__ - Step 86192: {'lr': 0.00019633977967628338, 'samples': 16548864, 'steps': 86191, 'loss/train': 1.3681601285934448} 11/07/2021 09:20:01 - INFO - __main__ - Step 86193: {'lr': 0.00019633459662811025, 'samples': 16549056, 'steps': 86192, 'loss/train': 1.4844177961349487} 11/07/2021 09:20:02 - INFO - __main__ - Step 86194: {'lr': 0.00019632941360411788, 'samples': 16549248, 'steps': 86193, 'loss/train': 1.4921300411224365} 11/07/2021 09:20:03 - INFO - __main__ - Step 86195: {'lr': 0.00019632423060430865, 'samples': 16549440, 'steps': 86194, 'loss/train': 1.2775870561599731} 11/07/2021 09:20:03 - INFO - __main__ - Step 86196: {'lr': 0.0001963190476286849, 'samples': 16549632, 'steps': 86195, 'loss/train': 1.090240240097046} 11/07/2021 09:20:03 - INFO - __main__ - Step 86197: {'lr': 0.00019631386467724895, 'samples': 16549824, 'steps': 86196, 'loss/train': 1.4222497940063477} 11/07/2021 09:20:04 - INFO - __main__ - Step 86198: {'lr': 0.00019630868175000315, 'samples': 16550016, 'steps': 86197, 'loss/train': 1.5005687475204468} 11/07/2021 09:20:04 - INFO - __main__ - Step 86199: {'lr': 0.00019630349884694996, 'samples': 16550208, 'steps': 86198, 'loss/train': 0.9965519905090332} 11/07/2021 09:20:04 - INFO - __main__ - Step 86200: {'lr': 0.00019629831596809145, 'samples': 16550400, 'steps': 86199, 'loss/train': 1.9529613256454468} 11/07/2021 09:20:05 - INFO - __main__ - Step 86201: {'lr': 0.00019629313311343008, 'samples': 16550592, 'steps': 86200, 'loss/train': 1.588322401046753} 11/07/2021 09:20:06 - INFO - __main__ - Step 86202: {'lr': 0.00019628795028296821, 'samples': 16550784, 'steps': 86201, 'loss/train': 1.4343559741973877} 11/07/2021 09:20:06 - INFO - __main__ - Step 86203: {'lr': 0.00019628276747670818, 'samples': 16550976, 'steps': 86202, 'loss/train': 1.3003253936767578} 11/07/2021 09:20:06 - INFO - __main__ - Step 86204: {'lr': 0.00019627758469465228, 'samples': 16551168, 'steps': 86203, 'loss/train': 1.7098244428634644} 11/07/2021 09:20:07 - INFO - __main__ - Step 86205: {'lr': 0.00019627240193680287, 'samples': 16551360, 'steps': 86204, 'loss/train': 1.2992019653320312} 11/07/2021 09:20:08 - INFO - __main__ - Step 86206: {'lr': 0.00019626721920316232, 'samples': 16551552, 'steps': 86205, 'loss/train': 1.6682302951812744} 11/07/2021 09:20:09 - INFO - __main__ - Step 86207: {'lr': 0.0001962620364937329, 'samples': 16551744, 'steps': 86206, 'loss/train': 0.861923098564148} 11/07/2021 09:20:09 - INFO - __main__ - Step 86208: {'lr': 0.00019625685380851698, 'samples': 16551936, 'steps': 86207, 'loss/train': 1.839774250984192} 11/07/2021 09:20:09 - INFO - __main__ - Step 86209: {'lr': 0.00019625167114751692, 'samples': 16552128, 'steps': 86208, 'loss/train': 1.226876139640808} 11/07/2021 09:20:10 - INFO - __main__ - Step 86210: {'lr': 0.00019624648851073497, 'samples': 16552320, 'steps': 86209, 'loss/train': 0.44545191526412964} 11/07/2021 09:20:11 - INFO - __main__ - Step 86211: {'lr': 0.00019624130589817357, 'samples': 16552512, 'steps': 86210, 'loss/train': 1.2839655876159668} 11/07/2021 09:20:11 - INFO - __main__ - Step 86212: {'lr': 0.000196236123309835, 'samples': 16552704, 'steps': 86211, 'loss/train': 1.3107168674468994} 11/07/2021 09:20:12 - INFO - __main__ - Step 86213: {'lr': 0.00019623094074572173, 'samples': 16552896, 'steps': 86212, 'loss/train': 0.10749651491641998} 11/07/2021 09:20:12 - INFO - __main__ - Step 86214: {'lr': 0.00019622575820583583, 'samples': 16553088, 'steps': 86213, 'loss/train': 0.2235732227563858} 11/07/2021 09:20:12 - INFO - __main__ - Step 86215: {'lr': 0.00019622057569017976, 'samples': 16553280, 'steps': 86214, 'loss/train': 1.5357571840286255} 11/07/2021 09:20:14 - INFO - __main__ - Step 86216: {'lr': 0.0001962153931987559, 'samples': 16553472, 'steps': 86215, 'loss/train': 1.0936635732650757} 11/07/2021 09:20:14 - INFO - __main__ - Step 86217: {'lr': 0.00019621021073156655, 'samples': 16553664, 'steps': 86216, 'loss/train': 1.206952691078186} 11/07/2021 09:20:14 - INFO - __main__ - Step 86218: {'lr': 0.00019620502828861404, 'samples': 16553856, 'steps': 86217, 'loss/train': 1.6607692241668701} 11/07/2021 09:20:15 - INFO - __main__ - Step 86219: {'lr': 0.00019619984586990072, 'samples': 16554048, 'steps': 86218, 'loss/train': 1.7530957460403442} 11/07/2021 09:20:15 - INFO - __main__ - Step 86220: {'lr': 0.0001961946634754289, 'samples': 16554240, 'steps': 86219, 'loss/train': 1.5252656936645508} 11/07/2021 09:20:16 - INFO - __main__ - Step 86221: {'lr': 0.00019618948110520097, 'samples': 16554432, 'steps': 86220, 'loss/train': 1.064193844795227} 11/07/2021 09:20:16 - INFO - __main__ - Step 86222: {'lr': 0.00019618429875921923, 'samples': 16554624, 'steps': 86221, 'loss/train': 1.2061150074005127} 11/07/2021 09:20:17 - INFO - __main__ - Step 86223: {'lr': 0.00019617911643748598, 'samples': 16554816, 'steps': 86222, 'loss/train': 1.022925615310669} 11/07/2021 09:20:17 - INFO - __main__ - Step 86224: {'lr': 0.0001961739341400036, 'samples': 16555008, 'steps': 86223, 'loss/train': 1.7838367223739624} 11/07/2021 09:20:17 - INFO - __main__ - Step 86225: {'lr': 0.00019616875186677442, 'samples': 16555200, 'steps': 86224, 'loss/train': 1.5539169311523438} 11/07/2021 09:20:19 - INFO - __main__ - Step 86226: {'lr': 0.00019616356961780088, 'samples': 16555392, 'steps': 86225, 'loss/train': 1.4269747734069824} 11/07/2021 09:20:19 - INFO - __main__ - Step 86227: {'lr': 0.00019615838739308507, 'samples': 16555584, 'steps': 86226, 'loss/train': 1.2308204174041748} 11/07/2021 09:20:19 - INFO - __main__ - Step 86228: {'lr': 0.00019615320519262953, 'samples': 16555776, 'steps': 86227, 'loss/train': 1.2952994108200073} 11/07/2021 09:20:20 - INFO - __main__ - Step 86229: {'lr': 0.00019614802301643646, 'samples': 16555968, 'steps': 86228, 'loss/train': 0.8874468207359314} 11/07/2021 09:20:20 - INFO - __main__ - Step 86230: {'lr': 0.0001961428408645083, 'samples': 16556160, 'steps': 86229, 'loss/train': 0.23311945796012878} 11/07/2021 09:20:20 - INFO - __main__ - Step 86231: {'lr': 0.0001961376587368473, 'samples': 16556352, 'steps': 86230, 'loss/train': 1.5044015645980835} 11/07/2021 09:20:22 - INFO - __main__ - Step 86232: {'lr': 0.00019613247663345586, 'samples': 16556544, 'steps': 86231, 'loss/train': 1.5998705625534058} 11/07/2021 09:20:22 - INFO - __main__ - Step 86233: {'lr': 0.0001961272945543363, 'samples': 16556736, 'steps': 86232, 'loss/train': 0.7043288946151733} 11/07/2021 09:20:22 - INFO - __main__ - Step 86234: {'lr': 0.00019612211249949097, 'samples': 16556928, 'steps': 86233, 'loss/train': 1.4374756813049316} 11/07/2021 09:20:23 - INFO - __main__ - Step 86235: {'lr': 0.00019611693046892216, 'samples': 16557120, 'steps': 86234, 'loss/train': 1.2768843173980713} 11/07/2021 09:20:23 - INFO - __main__ - Step 86236: {'lr': 0.0001961117484626322, 'samples': 16557312, 'steps': 86235, 'loss/train': 0.9086244106292725} 11/07/2021 09:20:24 - INFO - __main__ - Step 86237: {'lr': 0.0001961065664806235, 'samples': 16557504, 'steps': 86236, 'loss/train': 1.3478318452835083} 11/07/2021 09:20:24 - INFO - __main__ - Step 86238: {'lr': 0.0001961013845228984, 'samples': 16557696, 'steps': 86237, 'loss/train': 1.4120029211044312} 11/07/2021 09:20:25 - INFO - __main__ - Step 86239: {'lr': 0.0001960962025894591, 'samples': 16557888, 'steps': 86238, 'loss/train': 1.431290626525879} 11/07/2021 09:20:25 - INFO - __main__ - Step 86240: {'lr': 0.0001960910206803081, 'samples': 16558080, 'steps': 86239, 'loss/train': 1.572579026222229} 11/07/2021 09:20:25 - INFO - __main__ - Step 86241: {'lr': 0.0001960858387954476, 'samples': 16558272, 'steps': 86240, 'loss/train': 1.2317204475402832} 11/07/2021 09:20:27 - INFO - __main__ - Step 86242: {'lr': 0.00019608065693487998, 'samples': 16558464, 'steps': 86241, 'loss/train': 1.4857308864593506} 11/07/2021 09:20:27 - INFO - __main__ - Step 86243: {'lr': 0.0001960754750986076, 'samples': 16558656, 'steps': 86242, 'loss/train': 0.31094422936439514} 11/07/2021 09:20:27 - INFO - __main__ - Step 86244: {'lr': 0.00019607029328663276, 'samples': 16558848, 'steps': 86243, 'loss/train': 1.039039134979248} 11/07/2021 09:20:28 - INFO - __main__ - Step 86245: {'lr': 0.00019606511149895784, 'samples': 16559040, 'steps': 86244, 'loss/train': 0.6857962012290955} 11/07/2021 09:20:28 - INFO - __main__ - Step 86246: {'lr': 0.00019605992973558512, 'samples': 16559232, 'steps': 86245, 'loss/train': 0.1211409866809845} 11/07/2021 09:20:29 - INFO - __main__ - Step 86247: {'lr': 0.00019605474799651697, 'samples': 16559424, 'steps': 86246, 'loss/train': 1.4537672996520996} 11/07/2021 09:20:29 - INFO - __main__ - Step 86248: {'lr': 0.00019604956628175576, 'samples': 16559616, 'steps': 86247, 'loss/train': 1.6601499319076538} 11/07/2021 09:20:30 - INFO - __main__ - Step 86249: {'lr': 0.00019604438459130375, 'samples': 16559808, 'steps': 86248, 'loss/train': 0.7447784543037415} 11/07/2021 09:20:30 - INFO - __main__ - Step 86250: {'lr': 0.0001960392029251633, 'samples': 16560000, 'steps': 86249, 'loss/train': 1.5636003017425537} 11/07/2021 09:20:30 - INFO - __main__ - Step 86251: {'lr': 0.00019603402128333676, 'samples': 16560192, 'steps': 86250, 'loss/train': 0.9396142363548279} 11/07/2021 09:20:31 - INFO - __main__ - Step 86252: {'lr': 0.00019602883966582643, 'samples': 16560384, 'steps': 86251, 'loss/train': 1.581042766571045} 11/07/2021 09:20:32 - INFO - __main__ - Step 86253: {'lr': 0.00019602365807263475, 'samples': 16560576, 'steps': 86252, 'loss/train': 1.446616291999817} 11/07/2021 09:20:32 - INFO - __main__ - Step 86254: {'lr': 0.00019601847650376392, 'samples': 16560768, 'steps': 86253, 'loss/train': 1.29192316532135} 11/07/2021 09:20:32 - INFO - __main__ - Step 86255: {'lr': 0.00019601329495921632, 'samples': 16560960, 'steps': 86254, 'loss/train': 1.6167134046554565} 11/07/2021 09:20:33 - INFO - __main__ - Step 86256: {'lr': 0.00019600811343899432, 'samples': 16561152, 'steps': 86255, 'loss/train': 1.0068541765213013} 11/07/2021 09:20:34 - INFO - __main__ - Step 86257: {'lr': 0.00019600293194310024, 'samples': 16561344, 'steps': 86256, 'loss/train': 1.2438911199569702} 11/07/2021 09:20:34 - INFO - __main__ - Step 86258: {'lr': 0.00019599775047153637, 'samples': 16561536, 'steps': 86257, 'loss/train': 0.9736062288284302} 11/07/2021 09:20:35 - INFO - __main__ - Step 86259: {'lr': 0.00019599256902430516, 'samples': 16561728, 'steps': 86258, 'loss/train': 1.689091682434082} 11/07/2021 09:20:35 - INFO - __main__ - Step 86260: {'lr': 0.00019598738760140877, 'samples': 16561920, 'steps': 86259, 'loss/train': 1.482848048210144} 11/07/2021 09:20:35 - INFO - __main__ - Step 86261: {'lr': 0.00019598220620284967, 'samples': 16562112, 'steps': 86260, 'loss/train': 1.0719220638275146} 11/07/2021 09:20:36 - INFO - __main__ - Step 86262: {'lr': 0.00019597702482863013, 'samples': 16562304, 'steps': 86261, 'loss/train': 0.820709228515625} 11/07/2021 09:20:37 - INFO - __main__ - Step 86263: {'lr': 0.00019597184347875255, 'samples': 16562496, 'steps': 86262, 'loss/train': 1.3805866241455078} 11/07/2021 09:20:37 - INFO - __main__ - Step 86264: {'lr': 0.00019596666215321916, 'samples': 16562688, 'steps': 86263, 'loss/train': 1.286633849143982} 11/07/2021 09:20:37 - INFO - __main__ - Step 86265: {'lr': 0.0001959614808520324, 'samples': 16562880, 'steps': 86264, 'loss/train': 1.6129066944122314} 11/07/2021 09:20:38 - INFO - __main__ - Step 86266: {'lr': 0.00019595629957519457, 'samples': 16563072, 'steps': 86265, 'loss/train': 0.6433311700820923} 11/07/2021 09:20:38 - INFO - __main__ - Step 86267: {'lr': 0.00019595111832270803, 'samples': 16563264, 'steps': 86266, 'loss/train': 0.7056652903556824} 11/07/2021 09:20:39 - INFO - __main__ - Step 86268: {'lr': 0.00019594593709457503, 'samples': 16563456, 'steps': 86267, 'loss/train': 1.5093082189559937} 11/07/2021 09:20:39 - INFO - __main__ - Step 86269: {'lr': 0.00019594075589079798, 'samples': 16563648, 'steps': 86268, 'loss/train': 1.2400472164154053} 11/07/2021 09:20:40 - INFO - __main__ - Step 86270: {'lr': 0.00019593557471137924, 'samples': 16563840, 'steps': 86269, 'loss/train': 1.5003198385238647} 11/07/2021 09:20:40 - INFO - __main__ - Step 86271: {'lr': 0.00019593039355632103, 'samples': 16564032, 'steps': 86270, 'loss/train': 1.3094455003738403} 11/07/2021 09:20:41 - INFO - __main__ - Step 86272: {'lr': 0.00019592521242562576, 'samples': 16564224, 'steps': 86271, 'loss/train': 1.646623134613037} 11/07/2021 09:20:42 - INFO - __main__ - Step 86273: {'lr': 0.00019592003131929572, 'samples': 16564416, 'steps': 86272, 'loss/train': 1.0563271045684814} 11/07/2021 09:20:42 - INFO - __main__ - Step 86274: {'lr': 0.0001959148502373333, 'samples': 16564608, 'steps': 86273, 'loss/train': 1.148659348487854} 11/07/2021 09:20:42 - INFO - __main__ - Step 86275: {'lr': 0.0001959096691797408, 'samples': 16564800, 'steps': 86274, 'loss/train': 1.4346957206726074} 11/07/2021 09:20:43 - INFO - __main__ - Step 86276: {'lr': 0.00019590448814652063, 'samples': 16564992, 'steps': 86275, 'loss/train': 1.3628934621810913} 11/07/2021 09:20:43 - INFO - __main__ - Step 86277: {'lr': 0.000195899307137675, 'samples': 16565184, 'steps': 86276, 'loss/train': 1.1615349054336548} 11/07/2021 09:20:44 - INFO - __main__ - Step 86278: {'lr': 0.00019589412615320635, 'samples': 16565376, 'steps': 86277, 'loss/train': 1.2651734352111816} 11/07/2021 09:20:44 - INFO - __main__ - Step 86279: {'lr': 0.00019588894519311694, 'samples': 16565568, 'steps': 86278, 'loss/train': 1.3637642860412598} 11/07/2021 09:20:45 - INFO - __main__ - Step 86280: {'lr': 0.0001958837642574092, 'samples': 16565760, 'steps': 86279, 'loss/train': 1.4311429262161255} 11/07/2021 09:20:45 - INFO - __main__ - Step 86281: {'lr': 0.00019587858334608538, 'samples': 16565952, 'steps': 86280, 'loss/train': 0.6810244917869568} 11/07/2021 09:20:45 - INFO - __main__ - Step 86282: {'lr': 0.00019587340245914782, 'samples': 16566144, 'steps': 86281, 'loss/train': 1.6657336950302124} 11/07/2021 09:20:46 - INFO - __main__ - Step 86283: {'lr': 0.00019586822159659885, 'samples': 16566336, 'steps': 86282, 'loss/train': 1.4313410520553589} 11/07/2021 09:20:47 - INFO - __main__ - Step 86284: {'lr': 0.0001958630407584408, 'samples': 16566528, 'steps': 86283, 'loss/train': 1.169905185699463} 11/07/2021 09:20:48 - INFO - __main__ - Step 86285: {'lr': 0.00019585785994467606, 'samples': 16566720, 'steps': 86284, 'loss/train': 1.028458833694458} 11/07/2021 09:20:48 - INFO - __main__ - Step 86286: {'lr': 0.00019585267915530694, 'samples': 16566912, 'steps': 86285, 'loss/train': 1.653652548789978} 11/07/2021 09:20:48 - INFO - __main__ - Step 86287: {'lr': 0.00019584749839033575, 'samples': 16567104, 'steps': 86286, 'loss/train': 2.110153913497925} 11/07/2021 09:20:49 - INFO - __main__ - Step 86288: {'lr': 0.00019584231764976484, 'samples': 16567296, 'steps': 86287, 'loss/train': 1.303601861000061} 11/07/2021 09:20:49 - INFO - __main__ - Step 86289: {'lr': 0.00019583713693359657, 'samples': 16567488, 'steps': 86288, 'loss/train': 1.3638355731964111} 11/07/2021 09:20:50 - INFO - __main__ - Step 86290: {'lr': 0.0001958319562418332, 'samples': 16567680, 'steps': 86289, 'loss/train': 1.1493206024169922} 11/07/2021 09:20:50 - INFO - __main__ - Step 86291: {'lr': 0.00019582677557447714, 'samples': 16567872, 'steps': 86290, 'loss/train': 1.4314072132110596} 11/07/2021 09:20:51 - INFO - __main__ - Step 86292: {'lr': 0.00019582159493153074, 'samples': 16568064, 'steps': 86291, 'loss/train': 1.7839572429656982} 11/07/2021 09:20:51 - INFO - __main__ - Step 86293: {'lr': 0.00019581641431299634, 'samples': 16568256, 'steps': 86292, 'loss/train': 1.3257851600646973} 11/07/2021 09:20:52 - INFO - __main__ - Step 86294: {'lr': 0.00019581123371887615, 'samples': 16568448, 'steps': 86293, 'loss/train': 1.361061930656433} 11/07/2021 09:20:52 - INFO - __main__ - Step 86295: {'lr': 0.00019580605314917257, 'samples': 16568640, 'steps': 86294, 'loss/train': 1.0964232683181763} 11/07/2021 09:20:53 - INFO - __main__ - Step 86296: {'lr': 0.00019580087260388795, 'samples': 16568832, 'steps': 86295, 'loss/train': 0.22140540182590485} 11/07/2021 09:20:53 - INFO - __main__ - Step 86297: {'lr': 0.00019579569208302464, 'samples': 16569024, 'steps': 86296, 'loss/train': 1.070405125617981} 11/07/2021 09:20:53 - INFO - __main__ - Step 86298: {'lr': 0.00019579051158658496, 'samples': 16569216, 'steps': 86297, 'loss/train': 1.4535208940505981} 11/07/2021 09:20:54 - INFO - __main__ - Step 86299: {'lr': 0.0001957853311145712, 'samples': 16569408, 'steps': 86298, 'loss/train': 1.8025370836257935} 11/07/2021 09:20:55 - INFO - __main__ - Step 86300: {'lr': 0.00019578015066698572, 'samples': 16569600, 'steps': 86299, 'loss/train': 1.4263499975204468} 11/07/2021 09:20:55 - INFO - __main__ - Step 86301: {'lr': 0.00019577497024383093, 'samples': 16569792, 'steps': 86300, 'loss/train': 1.5426292419433594} 11/07/2021 09:20:56 - INFO - __main__ - Step 86302: {'lr': 0.00019576978984510906, 'samples': 16569984, 'steps': 86301, 'loss/train': 1.7768992185592651} 11/07/2021 09:20:56 - INFO - __main__ - Step 86303: {'lr': 0.00019576460947082252, 'samples': 16570176, 'steps': 86302, 'loss/train': 2.0010745525360107} 11/07/2021 09:20:56 - INFO - __main__ - Step 86304: {'lr': 0.00019575942912097359, 'samples': 16570368, 'steps': 86303, 'loss/train': 1.2470101118087769} 11/07/2021 09:20:57 - INFO - __main__ - Step 86305: {'lr': 0.0001957542487955646, 'samples': 16570560, 'steps': 86304, 'loss/train': 1.2810090780258179} 11/07/2021 09:20:58 - INFO - __main__ - Step 86306: {'lr': 0.00019574906849459793, 'samples': 16570752, 'steps': 86305, 'loss/train': 1.6238023042678833} 11/07/2021 09:20:58 - INFO - __main__ - Step 86307: {'lr': 0.000195743888218076, 'samples': 16570944, 'steps': 86306, 'loss/train': 1.013905644416809} 11/07/2021 09:20:59 - INFO - __main__ - Step 86308: {'lr': 0.00019573870796600094, 'samples': 16571136, 'steps': 86307, 'loss/train': 1.0729137659072876} 11/07/2021 09:20:59 - INFO - __main__ - Step 86309: {'lr': 0.00019573352773837515, 'samples': 16571328, 'steps': 86308, 'loss/train': 1.3778431415557861} 11/07/2021 09:21:01 - INFO - __main__ - Step 86310: {'lr': 0.00019572834753520102, 'samples': 16571520, 'steps': 86309, 'loss/train': 1.1253138780593872} 11/07/2021 09:21:01 - INFO - __main__ - Step 86311: {'lr': 0.00019572316735648086, 'samples': 16571712, 'steps': 86310, 'loss/train': 1.1567554473876953} 11/07/2021 09:21:01 - INFO - __main__ - Step 86312: {'lr': 0.000195717987202217, 'samples': 16571904, 'steps': 86311, 'loss/train': 0.27447226643562317} 11/07/2021 09:21:02 - INFO - __main__ - Step 86313: {'lr': 0.00019571280707241176, 'samples': 16572096, 'steps': 86312, 'loss/train': 0.27099478244781494} 11/07/2021 09:21:02 - INFO - __main__ - Step 86314: {'lr': 0.0001957076269670675, 'samples': 16572288, 'steps': 86313, 'loss/train': 1.3473320007324219} 11/07/2021 09:21:02 - INFO - __main__ - Step 86315: {'lr': 0.00019570244688618655, 'samples': 16572480, 'steps': 86314, 'loss/train': 0.7532285451889038} 11/07/2021 09:21:03 - INFO - __main__ - Step 86316: {'lr': 0.00019569726682977124, 'samples': 16572672, 'steps': 86315, 'loss/train': 1.4209972620010376} 11/07/2021 09:21:04 - INFO - __main__ - Step 86317: {'lr': 0.00019569208679782392, 'samples': 16572864, 'steps': 86316, 'loss/train': 1.1440802812576294} 11/07/2021 09:21:04 - INFO - __main__ - Step 86318: {'lr': 0.0001956869067903469, 'samples': 16573056, 'steps': 86317, 'loss/train': 1.2958807945251465} 11/07/2021 09:21:04 - INFO - __main__ - Step 86319: {'lr': 0.0001956817268073425, 'samples': 16573248, 'steps': 86318, 'loss/train': 1.2789864540100098} 11/07/2021 09:21:05 - INFO - __main__ - Step 86320: {'lr': 0.0001956765468488132, 'samples': 16573440, 'steps': 86319, 'loss/train': 1.217026948928833} 11/07/2021 09:21:06 - INFO - __main__ - Step 86321: {'lr': 0.0001956713669147611, 'samples': 16573632, 'steps': 86320, 'loss/train': 1.4830456972122192} 11/07/2021 09:21:06 - INFO - __main__ - Step 86322: {'lr': 0.00019566618700518862, 'samples': 16573824, 'steps': 86321, 'loss/train': 1.7934194803237915} 11/07/2021 09:21:07 - INFO - __main__ - Step 86323: {'lr': 0.00019566100712009815, 'samples': 16574016, 'steps': 86322, 'loss/train': 1.4653961658477783} 11/07/2021 09:21:07 - INFO - __main__ - Step 86324: {'lr': 0.00019565582725949198, 'samples': 16574208, 'steps': 86323, 'loss/train': 1.6867460012435913} 11/07/2021 09:21:07 - INFO - __main__ - Step 86325: {'lr': 0.00019565064742337247, 'samples': 16574400, 'steps': 86324, 'loss/train': 1.4530930519104004} 11/07/2021 09:21:08 - INFO - __main__ - Step 86326: {'lr': 0.00019564546761174193, 'samples': 16574592, 'steps': 86325, 'loss/train': 0.11968592554330826} 11/07/2021 09:21:09 - INFO - __main__ - Step 86327: {'lr': 0.00019564028782460268, 'samples': 16574784, 'steps': 86326, 'loss/train': 1.30978262424469} 11/07/2021 09:21:09 - INFO - __main__ - Step 86328: {'lr': 0.0001956351080619571, 'samples': 16574976, 'steps': 86327, 'loss/train': 1.3226085901260376} 11/07/2021 09:21:10 - INFO - __main__ - Step 86329: {'lr': 0.0001956299283238075, 'samples': 16575168, 'steps': 86328, 'loss/train': 1.326930284500122} 11/07/2021 09:21:10 - INFO - __main__ - Step 86330: {'lr': 0.00019562474861015621, 'samples': 16575360, 'steps': 86329, 'loss/train': 1.888556718826294} 11/07/2021 09:21:11 - INFO - __main__ - Step 86331: {'lr': 0.00019561956892100561, 'samples': 16575552, 'steps': 86330, 'loss/train': 1.6401554346084595} 11/07/2021 09:21:11 - INFO - __main__ - Step 86332: {'lr': 0.00019561438925635793, 'samples': 16575744, 'steps': 86331, 'loss/train': 1.2886271476745605} 11/07/2021 09:21:12 - INFO - __main__ - Step 86333: {'lr': 0.0001956092096162156, 'samples': 16575936, 'steps': 86332, 'loss/train': 1.3502858877182007} 11/07/2021 09:21:12 - INFO - __main__ - Step 86334: {'lr': 0.00019560403000058103, 'samples': 16576128, 'steps': 86333, 'loss/train': 1.4785078763961792} 11/07/2021 09:21:12 - INFO - __main__ - Step 86335: {'lr': 0.00019559885040945632, 'samples': 16576320, 'steps': 86334, 'loss/train': 1.1691617965698242} 11/07/2021 09:21:13 - INFO - __main__ - Step 86336: {'lr': 0.00019559367084284396, 'samples': 16576512, 'steps': 86335, 'loss/train': 1.5461081266403198} 11/07/2021 09:21:14 - INFO - __main__ - Step 86337: {'lr': 0.00019558849130074622, 'samples': 16576704, 'steps': 86336, 'loss/train': 1.4709922075271606} 11/07/2021 09:21:14 - INFO - __main__ - Step 86338: {'lr': 0.00019558331178316546, 'samples': 16576896, 'steps': 86337, 'loss/train': 1.8088470697402954} 11/07/2021 09:21:15 - INFO - __main__ - Step 86339: {'lr': 0.00019557813229010405, 'samples': 16577088, 'steps': 86338, 'loss/train': 1.305564045906067} 11/07/2021 09:21:15 - INFO - __main__ - Step 86340: {'lr': 0.00019557295282156427, 'samples': 16577280, 'steps': 86339, 'loss/train': 1.9903379678726196} 11/07/2021 09:21:15 - INFO - __main__ - Step 86341: {'lr': 0.0001955677733775485, 'samples': 16577472, 'steps': 86340, 'loss/train': 1.2538270950317383} 11/07/2021 09:21:16 - INFO - __main__ - Step 86342: {'lr': 0.00019556259395805904, 'samples': 16577664, 'steps': 86341, 'loss/train': 1.3755347728729248} 11/07/2021 09:21:17 - INFO - __main__ - Step 86343: {'lr': 0.00019555741456309822, 'samples': 16577856, 'steps': 86342, 'loss/train': 1.212101936340332} 11/07/2021 09:21:17 - INFO - __main__ - Step 86344: {'lr': 0.00019555223519266841, 'samples': 16578048, 'steps': 86343, 'loss/train': 1.3048535585403442} 11/07/2021 09:21:17 - INFO - __main__ - Step 86345: {'lr': 0.00019554705584677194, 'samples': 16578240, 'steps': 86344, 'loss/train': 1.2046812772750854} 11/07/2021 09:21:18 - INFO - __main__ - Step 86346: {'lr': 0.0001955418765254111, 'samples': 16578432, 'steps': 86345, 'loss/train': 1.383198618888855} 11/07/2021 09:21:19 - INFO - __main__ - Step 86347: {'lr': 0.00019553669722858835, 'samples': 16578624, 'steps': 86346, 'loss/train': 1.2875709533691406} 11/07/2021 09:21:19 - INFO - __main__ - Step 86348: {'lr': 0.00019553151795630584, 'samples': 16578816, 'steps': 86347, 'loss/train': 1.2012357711791992} 11/07/2021 09:21:19 - INFO - __main__ - Step 86349: {'lr': 0.00019552633870856595, 'samples': 16579008, 'steps': 86348, 'loss/train': 0.9975061416625977} 11/07/2021 09:21:20 - INFO - __main__ - Step 86350: {'lr': 0.00019552115948537108, 'samples': 16579200, 'steps': 86349, 'loss/train': 1.6027021408081055} 11/07/2021 09:21:20 - INFO - __main__ - Step 86351: {'lr': 0.00019551598028672354, 'samples': 16579392, 'steps': 86350, 'loss/train': 1.181372046470642} 11/07/2021 09:21:20 - INFO - __main__ - Step 86352: {'lr': 0.00019551080111262565, 'samples': 16579584, 'steps': 86351, 'loss/train': 1.4342947006225586} 11/07/2021 09:21:22 - INFO - __main__ - Step 86353: {'lr': 0.00019550562196307976, 'samples': 16579776, 'steps': 86352, 'loss/train': 1.5604162216186523} 11/07/2021 09:21:22 - INFO - __main__ - Step 86354: {'lr': 0.00019550044283808815, 'samples': 16579968, 'steps': 86353, 'loss/train': 2.054992198944092} 11/07/2021 09:21:22 - INFO - __main__ - Step 86355: {'lr': 0.00019549526373765326, 'samples': 16580160, 'steps': 86354, 'loss/train': 1.4504820108413696} 11/07/2021 09:21:23 - INFO - __main__ - Step 86356: {'lr': 0.00019549008466177733, 'samples': 16580352, 'steps': 86355, 'loss/train': 0.42185693979263306} 11/07/2021 09:21:23 - INFO - __main__ - Step 86357: {'lr': 0.00019548490561046273, 'samples': 16580544, 'steps': 86356, 'loss/train': 1.2577874660491943} 11/07/2021 09:21:24 - INFO - __main__ - Step 86358: {'lr': 0.00019547972658371182, 'samples': 16580736, 'steps': 86357, 'loss/train': 1.0371928215026855} 11/07/2021 09:21:24 - INFO - __main__ - Step 86359: {'lr': 0.0001954745475815269, 'samples': 16580928, 'steps': 86358, 'loss/train': 1.504831075668335} 11/07/2021 09:21:25 - INFO - __main__ - Step 86360: {'lr': 0.00019546936860391026, 'samples': 16581120, 'steps': 86359, 'loss/train': 1.5893187522888184} 11/07/2021 09:21:25 - INFO - __main__ - Step 86361: {'lr': 0.00019546418965086444, 'samples': 16581312, 'steps': 86360, 'loss/train': 1.6675314903259277} 11/07/2021 09:21:25 - INFO - __main__ - Step 86362: {'lr': 0.00019545901072239147, 'samples': 16581504, 'steps': 86361, 'loss/train': 1.5561342239379883} 11/07/2021 09:21:26 - INFO - __main__ - Step 86363: {'lr': 0.00019545383181849383, 'samples': 16581696, 'steps': 86362, 'loss/train': 1.173899531364441} 11/07/2021 09:21:27 - INFO - __main__ - Step 86364: {'lr': 0.00019544865293917384, 'samples': 16581888, 'steps': 86363, 'loss/train': 1.5178749561309814} 11/07/2021 09:21:27 - INFO - __main__ - Step 86365: {'lr': 0.00019544347408443388, 'samples': 16582080, 'steps': 86364, 'loss/train': 1.46511709690094} 11/07/2021 09:21:27 - INFO - __main__ - Step 86366: {'lr': 0.00019543829525427625, 'samples': 16582272, 'steps': 86365, 'loss/train': 1.3099085092544556} 11/07/2021 09:21:28 - INFO - __main__ - Step 86367: {'lr': 0.00019543311644870326, 'samples': 16582464, 'steps': 86366, 'loss/train': 2.072298049926758} 11/07/2021 09:21:29 - INFO - __main__ - Step 86368: {'lr': 0.00019542793766771726, 'samples': 16582656, 'steps': 86367, 'loss/train': 1.1082626581192017} 11/07/2021 09:21:30 - INFO - __main__ - Step 86369: {'lr': 0.00019542275891132064, 'samples': 16582848, 'steps': 86368, 'loss/train': 0.9836699366569519} 11/07/2021 09:21:30 - INFO - __main__ - Step 86370: {'lr': 0.00019541758017951563, 'samples': 16583040, 'steps': 86369, 'loss/train': 1.598557710647583} 11/07/2021 09:21:30 - INFO - __main__ - Step 86371: {'lr': 0.00019541240147230462, 'samples': 16583232, 'steps': 86370, 'loss/train': 1.7673776149749756} 11/07/2021 09:21:31 - INFO - __main__ - Step 86372: {'lr': 0.00019540722278969002, 'samples': 16583424, 'steps': 86371, 'loss/train': 0.07697760313749313} 11/07/2021 09:21:32 - INFO - __main__ - Step 86373: {'lr': 0.000195402044131674, 'samples': 16583616, 'steps': 86372, 'loss/train': 1.6187223196029663} 11/07/2021 09:21:32 - INFO - __main__ - Step 86374: {'lr': 0.00019539686549825908, 'samples': 16583808, 'steps': 86373, 'loss/train': 1.0934070348739624} 11/07/2021 09:21:32 - INFO - __main__ - Step 86375: {'lr': 0.0001953916868894474, 'samples': 16584000, 'steps': 86374, 'loss/train': 1.641589641571045} 11/07/2021 09:21:33 - INFO - __main__ - Step 86376: {'lr': 0.00019538650830524138, 'samples': 16584192, 'steps': 86375, 'loss/train': 1.4763695001602173} 11/07/2021 09:21:33 - INFO - __main__ - Step 86377: {'lr': 0.00019538132974564334, 'samples': 16584384, 'steps': 86376, 'loss/train': 2.6515047550201416} 11/07/2021 09:21:34 - INFO - __main__ - Step 86378: {'lr': 0.00019537615121065566, 'samples': 16584576, 'steps': 86377, 'loss/train': 0.32650226354599} 11/07/2021 09:21:34 - INFO - __main__ - Step 86379: {'lr': 0.00019537097270028064, 'samples': 16584768, 'steps': 86378, 'loss/train': 1.4637864828109741} 11/07/2021 09:21:35 - INFO - __main__ - Step 86380: {'lr': 0.00019536579421452062, 'samples': 16584960, 'steps': 86379, 'loss/train': 0.6300514340400696} 11/07/2021 09:21:35 - INFO - __main__ - Step 86381: {'lr': 0.00019536061575337792, 'samples': 16585152, 'steps': 86380, 'loss/train': 1.3333895206451416} 11/07/2021 09:21:35 - INFO - __main__ - Step 86382: {'lr': 0.00019535543731685488, 'samples': 16585344, 'steps': 86381, 'loss/train': 1.5753896236419678} 11/07/2021 09:21:36 - INFO - __main__ - Step 86383: {'lr': 0.0001953502589049539, 'samples': 16585536, 'steps': 86382, 'loss/train': 1.2663952112197876} 11/07/2021 09:21:37 - INFO - __main__ - Step 86384: {'lr': 0.0001953450805176772, 'samples': 16585728, 'steps': 86383, 'loss/train': 1.774291753768921} 11/07/2021 09:21:37 - INFO - __main__ - Step 86385: {'lr': 0.00019533990215502714, 'samples': 16585920, 'steps': 86384, 'loss/train': 1.3467124700546265} 11/07/2021 09:21:38 - INFO - __main__ - Step 86386: {'lr': 0.00019533472381700608, 'samples': 16586112, 'steps': 86385, 'loss/train': 1.4049628973007202} 11/07/2021 09:21:38 - INFO - __main__ - Step 86387: {'lr': 0.00019532954550361637, 'samples': 16586304, 'steps': 86386, 'loss/train': 1.4964516162872314} 11/07/2021 09:21:39 - INFO - __main__ - Step 86388: {'lr': 0.00019532436721486038, 'samples': 16586496, 'steps': 86387, 'loss/train': 1.47494637966156} 11/07/2021 09:21:39 - INFO - __main__ - Step 86389: {'lr': 0.00019531918895074034, 'samples': 16586688, 'steps': 86388, 'loss/train': 0.6813748478889465} 11/07/2021 09:21:40 - INFO - __main__ - Step 86390: {'lr': 0.0001953140107112586, 'samples': 16586880, 'steps': 86389, 'loss/train': 1.2759982347488403} 11/07/2021 09:21:40 - INFO - __main__ - Step 86391: {'lr': 0.0001953088324964175, 'samples': 16587072, 'steps': 86390, 'loss/train': 2.2347140312194824} 11/07/2021 09:21:40 - INFO - __main__ - Step 86392: {'lr': 0.00019530365430621947, 'samples': 16587264, 'steps': 86391, 'loss/train': 1.0238324403762817} 11/07/2021 09:21:41 - INFO - __main__ - Step 86393: {'lr': 0.00019529847614066672, 'samples': 16587456, 'steps': 86392, 'loss/train': 1.5742168426513672} 11/07/2021 09:21:42 - INFO - __main__ - Step 86394: {'lr': 0.0001952932979997617, 'samples': 16587648, 'steps': 86393, 'loss/train': 1.0946228504180908} 11/07/2021 09:21:42 - INFO - __main__ - Step 86395: {'lr': 0.0001952881198835066, 'samples': 16587840, 'steps': 86394, 'loss/train': 1.303506851196289} 11/07/2021 09:21:42 - INFO - __main__ - Step 86396: {'lr': 0.00019528294179190387, 'samples': 16588032, 'steps': 86395, 'loss/train': 1.2665913105010986} 11/07/2021 09:21:43 - INFO - __main__ - Step 86397: {'lr': 0.00019527776372495575, 'samples': 16588224, 'steps': 86396, 'loss/train': 0.7751516699790955} 11/07/2021 09:21:44 - INFO - __main__ - Step 86398: {'lr': 0.0001952725856826647, 'samples': 16588416, 'steps': 86397, 'loss/train': 1.1786677837371826} 11/07/2021 09:21:44 - INFO - __main__ - Step 86399: {'lr': 0.0001952674076650329, 'samples': 16588608, 'steps': 86398, 'loss/train': 1.1848983764648438} 11/07/2021 09:21:44 - INFO - __main__ - Step 86400: {'lr': 0.0001952622296720628, 'samples': 16588800, 'steps': 86399, 'loss/train': 1.3123024702072144} 11/07/2021 09:21:45 - INFO - __main__ - Step 86401: {'lr': 0.00019525705170375673, 'samples': 16588992, 'steps': 86400, 'loss/train': 2.5841786861419678} 11/07/2021 09:21:45 - INFO - __main__ - Step 86402: {'lr': 0.00019525187376011696, 'samples': 16589184, 'steps': 86401, 'loss/train': 1.5178776979446411} 11/07/2021 09:21:46 - INFO - __main__ - Step 86403: {'lr': 0.00019524669584114585, 'samples': 16589376, 'steps': 86402, 'loss/train': 1.1375209093093872} 11/07/2021 09:21:47 - INFO - __main__ - Step 86404: {'lr': 0.0001952415179468457, 'samples': 16589568, 'steps': 86403, 'loss/train': 1.9756405353546143} 11/07/2021 09:21:47 - INFO - __main__ - Step 86405: {'lr': 0.00019523634007721896, 'samples': 16589760, 'steps': 86404, 'loss/train': 1.6728991270065308} 11/07/2021 09:21:47 - INFO - __main__ - Step 86406: {'lr': 0.00019523116223226782, 'samples': 16589952, 'steps': 86405, 'loss/train': 1.3573815822601318} 11/07/2021 09:21:48 - INFO - __main__ - Step 86407: {'lr': 0.00019522598441199467, 'samples': 16590144, 'steps': 86406, 'loss/train': 1.0550199747085571} 11/07/2021 09:21:48 - INFO - __main__ - Step 86408: {'lr': 0.00019522080661640184, 'samples': 16590336, 'steps': 86407, 'loss/train': 1.0819272994995117} 11/07/2021 09:21:49 - INFO - __main__ - Step 86409: {'lr': 0.00019521562884549168, 'samples': 16590528, 'steps': 86408, 'loss/train': 1.351301908493042} 11/07/2021 09:21:49 - INFO - __main__ - Step 86410: {'lr': 0.00019521045109926653, 'samples': 16590720, 'steps': 86409, 'loss/train': 2.1501803398132324} 11/07/2021 09:21:50 - INFO - __main__ - Step 86411: {'lr': 0.00019520527337772868, 'samples': 16590912, 'steps': 86410, 'loss/train': 1.3821234703063965} 11/07/2021 09:21:50 - INFO - __main__ - Step 86412: {'lr': 0.00019520009568088048, 'samples': 16591104, 'steps': 86411, 'loss/train': 1.131779432296753} 11/07/2021 09:21:50 - INFO - __main__ - Step 86413: {'lr': 0.0001951949180087243, 'samples': 16591296, 'steps': 86412, 'loss/train': 0.5809757113456726} 11/07/2021 09:21:51 - INFO - __main__ - Step 86414: {'lr': 0.00019518974036126247, 'samples': 16591488, 'steps': 86413, 'loss/train': 0.6122252941131592} 11/07/2021 09:21:52 - INFO - __main__ - Step 86415: {'lr': 0.00019518456273849731, 'samples': 16591680, 'steps': 86414, 'loss/train': 0.9491802453994751} 11/07/2021 09:21:52 - INFO - __main__ - Step 86416: {'lr': 0.0001951793851404311, 'samples': 16591872, 'steps': 86415, 'loss/train': 1.3794618844985962} 11/07/2021 09:21:52 - INFO - __main__ - Step 86417: {'lr': 0.00019517420756706618, 'samples': 16592064, 'steps': 86416, 'loss/train': 1.1562126874923706} 11/07/2021 09:21:53 - INFO - __main__ - Step 86418: {'lr': 0.00019516903001840494, 'samples': 16592256, 'steps': 86417, 'loss/train': 1.148738145828247} 11/07/2021 09:21:54 - INFO - __main__ - Step 86419: {'lr': 0.00019516385249444967, 'samples': 16592448, 'steps': 86418, 'loss/train': 1.4417237043380737} 11/07/2021 09:21:54 - INFO - __main__ - Step 86420: {'lr': 0.00019515867499520273, 'samples': 16592640, 'steps': 86419, 'loss/train': 1.2956421375274658} 11/07/2021 09:21:54 - INFO - __main__ - Step 86421: {'lr': 0.00019515349752066648, 'samples': 16592832, 'steps': 86420, 'loss/train': 1.3125028610229492} 11/07/2021 09:21:55 - INFO - __main__ - Step 86422: {'lr': 0.00019514832007084317, 'samples': 16593024, 'steps': 86421, 'loss/train': 1.4481613636016846} 11/07/2021 09:21:55 - INFO - __main__ - Step 86423: {'lr': 0.0001951431426457352, 'samples': 16593216, 'steps': 86422, 'loss/train': 1.37786066532135} 11/07/2021 09:21:56 - INFO - __main__ - Step 86424: {'lr': 0.00019513796524534487, 'samples': 16593408, 'steps': 86423, 'loss/train': 1.440194845199585} 11/07/2021 09:21:56 - INFO - __main__ - Step 86425: {'lr': 0.00019513278786967457, 'samples': 16593600, 'steps': 86424, 'loss/train': 1.2990232706069946} 11/07/2021 09:21:57 - INFO - __main__ - Step 86426: {'lr': 0.00019512761051872655, 'samples': 16593792, 'steps': 86425, 'loss/train': 1.460911512374878} 11/07/2021 09:21:57 - INFO - __main__ - Step 86427: {'lr': 0.00019512243319250318, 'samples': 16593984, 'steps': 86426, 'loss/train': 1.4087165594100952} 11/07/2021 09:21:58 - INFO - __main__ - Step 86428: {'lr': 0.00019511725589100692, 'samples': 16594176, 'steps': 86427, 'loss/train': 1.651207447052002} 11/07/2021 09:21:59 - INFO - __main__ - Step 86429: {'lr': 0.00019511207861423984, 'samples': 16594368, 'steps': 86428, 'loss/train': 1.3219308853149414} 11/07/2021 09:21:59 - INFO - __main__ - Step 86430: {'lr': 0.00019510690136220445, 'samples': 16594560, 'steps': 86429, 'loss/train': 1.605780005455017} 11/07/2021 09:21:59 - INFO - __main__ - Step 86431: {'lr': 0.00019510172413490302, 'samples': 16594752, 'steps': 86430, 'loss/train': 1.6097478866577148} 11/07/2021 09:22:00 - INFO - __main__ - Step 86432: {'lr': 0.00019509654693233792, 'samples': 16594944, 'steps': 86431, 'loss/train': 1.41234290599823} 11/07/2021 09:22:00 - INFO - __main__ - Step 86433: {'lr': 0.00019509136975451148, 'samples': 16595136, 'steps': 86432, 'loss/train': 1.6569479703903198} 11/07/2021 09:22:02 - INFO - __main__ - Step 86434: {'lr': 0.000195086192601426, 'samples': 16595328, 'steps': 86433, 'loss/train': 1.635779857635498} 11/07/2021 09:22:02 - INFO - __main__ - Step 86435: {'lr': 0.00019508101547308383, 'samples': 16595520, 'steps': 86434, 'loss/train': 1.3579827547073364} 11/07/2021 09:22:03 - INFO - __main__ - Step 86436: {'lr': 0.00019507583836948732, 'samples': 16595712, 'steps': 86435, 'loss/train': 0.8863809108734131} 11/07/2021 09:22:03 - INFO - __main__ - Step 86437: {'lr': 0.00019507066129063877, 'samples': 16595904, 'steps': 86436, 'loss/train': 0.7402538061141968} 11/07/2021 09:22:03 - INFO - __main__ - Step 86438: {'lr': 0.00019506548423654056, 'samples': 16596096, 'steps': 86437, 'loss/train': 0.707229495048523} 11/07/2021 09:22:04 - INFO - __main__ - Step 86439: {'lr': 0.00019506030720719498, 'samples': 16596288, 'steps': 86438, 'loss/train': 1.435967206954956} 11/07/2021 09:22:04 - INFO - __main__ - Step 86440: {'lr': 0.00019505513020260434, 'samples': 16596480, 'steps': 86439, 'loss/train': 1.6335570812225342} 11/07/2021 09:22:05 - INFO - __main__ - Step 86441: {'lr': 0.0001950499532227712, 'samples': 16596672, 'steps': 86440, 'loss/train': 1.6813361644744873} 11/07/2021 09:22:05 - INFO - __main__ - Step 86442: {'lr': 0.00019504477626769754, 'samples': 16596864, 'steps': 86441, 'loss/train': 1.4287779331207275} 11/07/2021 09:22:06 - INFO - __main__ - Step 86443: {'lr': 0.00019503959933738586, 'samples': 16597056, 'steps': 86442, 'loss/train': 1.4664169549942017} 11/07/2021 09:22:06 - INFO - __main__ - Step 86444: {'lr': 0.0001950344224318385, 'samples': 16597248, 'steps': 86443, 'loss/train': 1.22736394405365} 11/07/2021 09:22:06 - INFO - __main__ - Step 86445: {'lr': 0.00019502924555105778, 'samples': 16597440, 'steps': 86444, 'loss/train': 1.806052327156067} 11/07/2021 09:22:07 - INFO - __main__ - Step 86446: {'lr': 0.000195024068695046, 'samples': 16597632, 'steps': 86445, 'loss/train': 1.3974491357803345} 11/07/2021 09:22:08 - INFO - __main__ - Step 86447: {'lr': 0.00019501889186380558, 'samples': 16597824, 'steps': 86446, 'loss/train': 1.4974957704544067} 11/07/2021 09:22:08 - INFO - __main__ - Step 86448: {'lr': 0.0001950137150573388, 'samples': 16598016, 'steps': 86447, 'loss/train': 1.339194416999817} 11/07/2021 09:22:09 - INFO - __main__ - Step 86449: {'lr': 0.00019500853827564795, 'samples': 16598208, 'steps': 86448, 'loss/train': 1.093662142753601} 11/07/2021 09:22:09 - INFO - __main__ - Step 86450: {'lr': 0.0001950033615187354, 'samples': 16598400, 'steps': 86449, 'loss/train': 0.9601923227310181} 11/07/2021 09:22:09 - INFO - __main__ - Step 86451: {'lr': 0.00019499818478660352, 'samples': 16598592, 'steps': 86450, 'loss/train': 1.5651376247406006} 11/07/2021 09:22:10 - INFO - __main__ - Step 86452: {'lr': 0.0001949930080792546, 'samples': 16598784, 'steps': 86451, 'loss/train': 0.8704138994216919} 11/07/2021 09:22:11 - INFO - __main__ - Step 86453: {'lr': 0.000194987831396691, 'samples': 16598976, 'steps': 86452, 'loss/train': 1.470238208770752} 11/07/2021 09:22:11 - INFO - __main__ - Step 86454: {'lr': 0.000194982654738915, 'samples': 16599168, 'steps': 86453, 'loss/train': 1.606284499168396} 11/07/2021 09:22:11 - INFO - __main__ - Step 86455: {'lr': 0.00019497747810592907, 'samples': 16599360, 'steps': 86454, 'loss/train': 0.6945438981056213} 11/07/2021 09:22:12 - INFO - __main__ - Step 86456: {'lr': 0.00019497230149773538, 'samples': 16599552, 'steps': 86455, 'loss/train': 0.6759269833564758} 11/07/2021 09:22:13 - INFO - __main__ - Step 86457: {'lr': 0.00019496712491433627, 'samples': 16599744, 'steps': 86456, 'loss/train': 0.8960050940513611} 11/07/2021 09:22:13 - INFO - __main__ - Step 86458: {'lr': 0.00019496194835573417, 'samples': 16599936, 'steps': 86457, 'loss/train': 1.0580146312713623} 11/07/2021 09:22:13 - INFO - __main__ - Step 86459: {'lr': 0.00019495677182193133, 'samples': 16600128, 'steps': 86458, 'loss/train': 1.6216130256652832} 11/07/2021 09:22:14 - INFO - __main__ - Step 86460: {'lr': 0.00019495159531293015, 'samples': 16600320, 'steps': 86459, 'loss/train': 1.4116120338439941} 11/07/2021 09:22:14 - INFO - __main__ - Step 86461: {'lr': 0.00019494641882873289, 'samples': 16600512, 'steps': 86460, 'loss/train': 1.3841922283172607} 11/07/2021 09:22:15 - INFO - __main__ - Step 86462: {'lr': 0.00019494124236934192, 'samples': 16600704, 'steps': 86461, 'loss/train': 1.3911387920379639} 11/07/2021 09:22:15 - INFO - __main__ - Step 86463: {'lr': 0.00019493606593475962, 'samples': 16600896, 'steps': 86462, 'loss/train': 1.2677310705184937} 11/07/2021 09:22:16 - INFO - __main__ - Step 86464: {'lr': 0.0001949308895249883, 'samples': 16601088, 'steps': 86463, 'loss/train': 0.8912258744239807} 11/07/2021 09:22:16 - INFO - __main__ - Step 86465: {'lr': 0.00019492571314003022, 'samples': 16601280, 'steps': 86464, 'loss/train': 1.4097403287887573} 11/07/2021 09:22:16 - INFO - __main__ - Step 86466: {'lr': 0.00019492053677988777, 'samples': 16601472, 'steps': 86465, 'loss/train': 1.2781908512115479} 11/07/2021 09:22:18 - INFO - __main__ - Step 86467: {'lr': 0.0001949153604445633, 'samples': 16601664, 'steps': 86466, 'loss/train': 1.5594394207000732} 11/07/2021 09:22:18 - INFO - __main__ - Step 86468: {'lr': 0.0001949101841340592, 'samples': 16601856, 'steps': 86467, 'loss/train': 1.293088674545288} 11/07/2021 09:22:18 - INFO - __main__ - Step 86469: {'lr': 0.00019490500784837762, 'samples': 16602048, 'steps': 86468, 'loss/train': 1.5463160276412964} 11/07/2021 09:22:19 - INFO - __main__ - Step 86470: {'lr': 0.000194899831587521, 'samples': 16602240, 'steps': 86469, 'loss/train': 1.5410295724868774} 11/07/2021 09:22:19 - INFO - __main__ - Step 86471: {'lr': 0.00019489465535149164, 'samples': 16602432, 'steps': 86470, 'loss/train': 1.323303461074829} 11/07/2021 09:22:20 - INFO - __main__ - Step 86472: {'lr': 0.00019488947914029193, 'samples': 16602624, 'steps': 86471, 'loss/train': 1.5955158472061157} 11/07/2021 09:22:20 - INFO - __main__ - Step 86473: {'lr': 0.00019488430295392417, 'samples': 16602816, 'steps': 86472, 'loss/train': 1.246339201927185} 11/07/2021 09:22:21 - INFO - __main__ - Step 86474: {'lr': 0.00019487912679239068, 'samples': 16603008, 'steps': 86473, 'loss/train': 1.3293105363845825} 11/07/2021 09:22:21 - INFO - __main__ - Step 86475: {'lr': 0.0001948739506556938, 'samples': 16603200, 'steps': 86474, 'loss/train': 1.289432168006897} 11/07/2021 09:22:21 - INFO - __main__ - Step 86476: {'lr': 0.0001948687745438359, 'samples': 16603392, 'steps': 86475, 'loss/train': 1.8214586973190308} 11/07/2021 09:22:22 - INFO - __main__ - Step 86477: {'lr': 0.00019486359845681926, 'samples': 16603584, 'steps': 86476, 'loss/train': 1.5021344423294067} 11/07/2021 09:22:23 - INFO - __main__ - Step 86478: {'lr': 0.0001948584223946462, 'samples': 16603776, 'steps': 86477, 'loss/train': 1.8539600372314453} 11/07/2021 09:22:23 - INFO - __main__ - Step 86479: {'lr': 0.00019485324635731913, 'samples': 16603968, 'steps': 86478, 'loss/train': 1.8641331195831299} 11/07/2021 09:22:23 - INFO - __main__ - Step 86480: {'lr': 0.00019484807034484032, 'samples': 16604160, 'steps': 86479, 'loss/train': 0.8575642108917236} 11/07/2021 09:22:24 - INFO - __main__ - Step 86481: {'lr': 0.00019484289435721212, 'samples': 16604352, 'steps': 86480, 'loss/train': 1.3312166929244995} 11/07/2021 09:22:24 - INFO - __main__ - Step 86482: {'lr': 0.00019483771839443696, 'samples': 16604544, 'steps': 86481, 'loss/train': 1.3380072116851807} 11/07/2021 09:22:25 - INFO - __main__ - Step 86483: {'lr': 0.00019483254245651697, 'samples': 16604736, 'steps': 86482, 'loss/train': 1.1826311349868774} 11/07/2021 09:22:25 - INFO - __main__ - Step 86484: {'lr': 0.00019482736654345456, 'samples': 16604928, 'steps': 86483, 'loss/train': 1.3505330085754395} 11/07/2021 09:22:26 - INFO - __main__ - Step 86485: {'lr': 0.00019482219065525215, 'samples': 16605120, 'steps': 86484, 'loss/train': 1.2182660102844238} 11/07/2021 09:22:26 - INFO - __main__ - Step 86486: {'lr': 0.00019481701479191193, 'samples': 16605312, 'steps': 86485, 'loss/train': 1.1223064661026} 11/07/2021 09:22:26 - INFO - __main__ - Step 86487: {'lr': 0.00019481183895343637, 'samples': 16605504, 'steps': 86486, 'loss/train': 1.5205038785934448} 11/07/2021 09:22:28 - INFO - __main__ - Step 86488: {'lr': 0.00019480666313982772, 'samples': 16605696, 'steps': 86487, 'loss/train': 1.742558240890503} 11/07/2021 09:22:28 - INFO - __main__ - Step 86489: {'lr': 0.00019480148735108834, 'samples': 16605888, 'steps': 86488, 'loss/train': 1.0981099605560303} 11/07/2021 09:22:28 - INFO - __main__ - Step 86490: {'lr': 0.00019479631158722058, 'samples': 16606080, 'steps': 86489, 'loss/train': 0.8961254358291626} 11/07/2021 09:22:29 - INFO - __main__ - Step 86491: {'lr': 0.00019479113584822672, 'samples': 16606272, 'steps': 86490, 'loss/train': 1.5324496030807495} 11/07/2021 09:22:29 - INFO - __main__ - Step 86492: {'lr': 0.00019478596013410915, 'samples': 16606464, 'steps': 86491, 'loss/train': 1.0253946781158447} 11/07/2021 09:22:30 - INFO - __main__ - Step 86493: {'lr': 0.00019478078444487015, 'samples': 16606656, 'steps': 86492, 'loss/train': 1.17601478099823} 11/07/2021 09:22:31 - INFO - __main__ - Step 86494: {'lr': 0.0001947756087805121, 'samples': 16606848, 'steps': 86493, 'loss/train': 1.1999948024749756} 11/07/2021 09:22:31 - INFO - __main__ - Step 86495: {'lr': 0.00019477043314103737, 'samples': 16607040, 'steps': 86494, 'loss/train': 1.376883625984192} 11/07/2021 09:22:31 - INFO - __main__ - Step 86496: {'lr': 0.00019476525752644817, 'samples': 16607232, 'steps': 86495, 'loss/train': 1.419260025024414} 11/07/2021 09:22:32 - INFO - __main__ - Step 86497: {'lr': 0.00019476008193674687, 'samples': 16607424, 'steps': 86496, 'loss/train': 0.20474693179130554} 11/07/2021 09:22:32 - INFO - __main__ - Step 86498: {'lr': 0.00019475490637193584, 'samples': 16607616, 'steps': 86497, 'loss/train': 1.4528858661651611} 11/07/2021 09:22:33 - INFO - __main__ - Step 86499: {'lr': 0.00019474973083201738, 'samples': 16607808, 'steps': 86498, 'loss/train': 0.2627605199813843} 11/07/2021 09:22:33 - INFO - __main__ - Step 86500: {'lr': 0.00019474455531699384, 'samples': 16608000, 'steps': 86499, 'loss/train': 0.8775913715362549} 11/07/2021 09:22:34 - INFO - __main__ - Step 86501: {'lr': 0.00019473937982686756, 'samples': 16608192, 'steps': 86500, 'loss/train': 1.224403977394104} 11/07/2021 09:22:34 - INFO - __main__ - Step 86502: {'lr': 0.00019473420436164085, 'samples': 16608384, 'steps': 86501, 'loss/train': 1.452445149421692} 11/07/2021 09:22:34 - INFO - __main__ - Step 86503: {'lr': 0.0001947290289213161, 'samples': 16608576, 'steps': 86502, 'loss/train': 1.1823289394378662} 11/07/2021 09:22:35 - INFO - __main__ - Step 86504: {'lr': 0.00019472385350589552, 'samples': 16608768, 'steps': 86503, 'loss/train': 1.8239877223968506} 11/07/2021 09:22:36 - INFO - __main__ - Step 86505: {'lr': 0.00019471867811538158, 'samples': 16608960, 'steps': 86504, 'loss/train': 0.9209504723548889} 11/07/2021 09:22:36 - INFO - __main__ - Step 86506: {'lr': 0.00019471350274977657, 'samples': 16609152, 'steps': 86505, 'loss/train': 1.4238553047180176} 11/07/2021 09:22:36 - INFO - __main__ - Step 86507: {'lr': 0.0001947083274090828, 'samples': 16609344, 'steps': 86506, 'loss/train': 1.5502177476882935} 11/07/2021 09:22:37 - INFO - __main__ - Step 86508: {'lr': 0.00019470315209330253, 'samples': 16609536, 'steps': 86507, 'loss/train': 1.6335465908050537} 11/07/2021 09:22:38 - INFO - __main__ - Step 86509: {'lr': 0.00019469797680243827, 'samples': 16609728, 'steps': 86508, 'loss/train': 0.9143983721733093} 11/07/2021 09:22:38 - INFO - __main__ - Step 86510: {'lr': 0.00019469280153649218, 'samples': 16609920, 'steps': 86509, 'loss/train': 1.5493369102478027} 11/07/2021 09:22:39 - INFO - __main__ - Step 86511: {'lr': 0.00019468762629546666, 'samples': 16610112, 'steps': 86510, 'loss/train': 1.7002323865890503} 11/07/2021 09:22:39 - INFO - __main__ - Step 86512: {'lr': 0.00019468245107936405, 'samples': 16610304, 'steps': 86511, 'loss/train': 1.2080482244491577} 11/07/2021 09:22:39 - INFO - __main__ - Step 86513: {'lr': 0.00019467727588818665, 'samples': 16610496, 'steps': 86512, 'loss/train': 1.3240858316421509} 11/07/2021 09:22:40 - INFO - __main__ - Step 86514: {'lr': 0.00019467210072193682, 'samples': 16610688, 'steps': 86513, 'loss/train': 1.506675362586975} 11/07/2021 09:22:41 - INFO - __main__ - Step 86515: {'lr': 0.00019466692558061695, 'samples': 16610880, 'steps': 86514, 'loss/train': 1.266071081161499} 11/07/2021 09:22:41 - INFO - __main__ - Step 86516: {'lr': 0.00019466175046422922, 'samples': 16611072, 'steps': 86515, 'loss/train': 0.510161817073822} 11/07/2021 09:22:41 - INFO - __main__ - Step 86517: {'lr': 0.00019465657537277614, 'samples': 16611264, 'steps': 86516, 'loss/train': 1.3423112630844116} 11/07/2021 09:22:42 - INFO - __main__ - Step 86518: {'lr': 0.00019465140030625993, 'samples': 16611456, 'steps': 86517, 'loss/train': 1.1888469457626343} 11/07/2021 09:22:42 - INFO - __main__ - Step 86519: {'lr': 0.00019464622526468292, 'samples': 16611648, 'steps': 86518, 'loss/train': 0.4065333902835846} 11/07/2021 09:22:43 - INFO - __main__ - Step 86520: {'lr': 0.00019464105024804746, 'samples': 16611840, 'steps': 86519, 'loss/train': 1.217254877090454} 11/07/2021 09:22:43 - INFO - __main__ - Step 86521: {'lr': 0.00019463587525635589, 'samples': 16612032, 'steps': 86520, 'loss/train': 1.4123084545135498} 11/07/2021 09:22:44 - INFO - __main__ - Step 86522: {'lr': 0.00019463070028961061, 'samples': 16612224, 'steps': 86521, 'loss/train': 0.940285861492157} 11/07/2021 09:22:44 - INFO - __main__ - Step 86523: {'lr': 0.0001946255253478138, 'samples': 16612416, 'steps': 86522, 'loss/train': 1.4358030557632446} 11/07/2021 09:22:44 - INFO - __main__ - Step 86524: {'lr': 0.0001946203504309679, 'samples': 16612608, 'steps': 86523, 'loss/train': 1.7147221565246582} 11/07/2021 09:22:45 - INFO - __main__ - Step 86525: {'lr': 0.0001946151755390752, 'samples': 16612800, 'steps': 86524, 'loss/train': 1.4851619005203247} 11/07/2021 09:22:46 - INFO - __main__ - Step 86526: {'lr': 0.00019461000067213808, 'samples': 16612992, 'steps': 86525, 'loss/train': 1.4014256000518799} 11/07/2021 09:22:46 - INFO - __main__ - Step 86527: {'lr': 0.0001946048258301588, 'samples': 16613184, 'steps': 86526, 'loss/train': 1.2918163537979126} 11/07/2021 09:22:47 - INFO - __main__ - Step 86528: {'lr': 0.00019459965101313982, 'samples': 16613376, 'steps': 86527, 'loss/train': 1.1602249145507812} 11/07/2021 09:22:47 - INFO - __main__ - Step 86529: {'lr': 0.0001945944762210833, 'samples': 16613568, 'steps': 86528, 'loss/train': 1.6405613422393799} 11/07/2021 09:22:48 - INFO - __main__ - Step 86530: {'lr': 0.00019458930145399166, 'samples': 16613760, 'steps': 86529, 'loss/train': 1.3704487085342407} 11/07/2021 09:22:48 - INFO - __main__ - Step 86531: {'lr': 0.00019458412671186721, 'samples': 16613952, 'steps': 86530, 'loss/train': 1.0223058462142944} 11/07/2021 09:22:49 - INFO - __main__ - Step 86532: {'lr': 0.00019457895199471233, 'samples': 16614144, 'steps': 86531, 'loss/train': 1.432289719581604} 11/07/2021 09:22:49 - INFO - __main__ - Step 86533: {'lr': 0.00019457377730252928, 'samples': 16614336, 'steps': 86532, 'loss/train': 1.1947332620620728} 11/07/2021 09:22:49 - INFO - __main__ - Step 86534: {'lr': 0.00019456860263532044, 'samples': 16614528, 'steps': 86533, 'loss/train': 1.4981880187988281} 11/07/2021 09:22:50 - INFO - __main__ - Step 86535: {'lr': 0.00019456342799308824, 'samples': 16614720, 'steps': 86534, 'loss/train': 1.2056151628494263} 11/07/2021 09:22:51 - INFO - __main__ - Step 86536: {'lr': 0.0001945582533758348, 'samples': 16614912, 'steps': 86535, 'loss/train': 1.0580180883407593} 11/07/2021 09:22:51 - INFO - __main__ - Step 86537: {'lr': 0.00019455307878356255, 'samples': 16615104, 'steps': 86536, 'loss/train': 1.5410488843917847} 11/07/2021 09:22:51 - INFO - __main__ - Step 86538: {'lr': 0.00019454790421627387, 'samples': 16615296, 'steps': 86537, 'loss/train': 1.5481730699539185} 11/07/2021 09:22:52 - INFO - __main__ - Step 86539: {'lr': 0.00019454272967397103, 'samples': 16615488, 'steps': 86538, 'loss/train': 1.305698275566101} 11/07/2021 09:22:52 - INFO - __main__ - Step 86540: {'lr': 0.00019453755515665637, 'samples': 16615680, 'steps': 86539, 'loss/train': 1.1612858772277832} 11/07/2021 09:22:53 - INFO - __main__ - Step 86541: {'lr': 0.00019453238066433228, 'samples': 16615872, 'steps': 86540, 'loss/train': 1.3267790079116821} 11/07/2021 09:22:53 - INFO - __main__ - Step 86542: {'lr': 0.000194527206197001, 'samples': 16616064, 'steps': 86541, 'loss/train': 1.1159310340881348} 11/07/2021 09:22:54 - INFO - __main__ - Step 86543: {'lr': 0.00019452203175466488, 'samples': 16616256, 'steps': 86542, 'loss/train': 1.7490230798721313} 11/07/2021 09:22:54 - INFO - __main__ - Step 86544: {'lr': 0.0001945168573373263, 'samples': 16616448, 'steps': 86543, 'loss/train': 0.8582000136375427} 11/07/2021 09:22:54 - INFO - __main__ - Step 86545: {'lr': 0.00019451168294498756, 'samples': 16616640, 'steps': 86544, 'loss/train': 0.6364337801933289} 11/07/2021 09:22:56 - INFO - __main__ - Step 86546: {'lr': 0.00019450650857765102, 'samples': 16616832, 'steps': 86545, 'loss/train': 1.5486375093460083} 11/07/2021 09:22:56 - INFO - __main__ - Step 86547: {'lr': 0.00019450133423531897, 'samples': 16617024, 'steps': 86546, 'loss/train': 1.9507420063018799} 11/07/2021 09:22:56 - INFO - __main__ - Step 86548: {'lr': 0.00019449615991799375, 'samples': 16617216, 'steps': 86547, 'loss/train': 1.3453556299209595} 11/07/2021 09:22:57 - INFO - __main__ - Step 86549: {'lr': 0.0001944909856256778, 'samples': 16617408, 'steps': 86548, 'loss/train': 1.3320447206497192} 11/07/2021 09:22:57 - INFO - __main__ - Step 86550: {'lr': 0.00019448581135837333, 'samples': 16617600, 'steps': 86549, 'loss/train': 1.1318808794021606} 11/07/2021 09:22:58 - INFO - __main__ - Step 86551: {'lr': 0.00019448063711608262, 'samples': 16617792, 'steps': 86550, 'loss/train': 0.5032885670661926} 11/07/2021 09:22:58 - INFO - __main__ - Step 86552: {'lr': 0.0001944754628988081, 'samples': 16617984, 'steps': 86551, 'loss/train': 1.4871230125427246} 11/07/2021 09:22:59 - INFO - __main__ - Step 86553: {'lr': 0.0001944702887065521, 'samples': 16618176, 'steps': 86552, 'loss/train': 1.2005807161331177} 11/07/2021 09:22:59 - INFO - __main__ - Step 86554: {'lr': 0.0001944651145393169, 'samples': 16618368, 'steps': 86553, 'loss/train': 1.2533551454544067} 11/07/2021 09:22:59 - INFO - __main__ - Step 86555: {'lr': 0.00019445994039710488, 'samples': 16618560, 'steps': 86554, 'loss/train': 1.3716768026351929} 11/07/2021 09:23:00 - INFO - __main__ - Step 86556: {'lr': 0.00019445476627991834, 'samples': 16618752, 'steps': 86555, 'loss/train': 1.4054617881774902} 11/07/2021 09:23:01 - INFO - __main__ - Step 86557: {'lr': 0.00019444959218775965, 'samples': 16618944, 'steps': 86556, 'loss/train': 1.2454313039779663} 11/07/2021 09:23:01 - INFO - __main__ - Step 86558: {'lr': 0.0001944444181206311, 'samples': 16619136, 'steps': 86557, 'loss/train': 1.017775297164917} 11/07/2021 09:23:02 - INFO - __main__ - Step 86559: {'lr': 0.00019443924407853503, 'samples': 16619328, 'steps': 86558, 'loss/train': 1.3569703102111816} 11/07/2021 09:23:02 - INFO - __main__ - Step 86560: {'lr': 0.0001944340700614738, 'samples': 16619520, 'steps': 86559, 'loss/train': 1.568757176399231} 11/07/2021 09:23:03 - INFO - __main__ - Step 86561: {'lr': 0.0001944288960694497, 'samples': 16619712, 'steps': 86560, 'loss/train': 1.4996225833892822} 11/07/2021 09:23:03 - INFO - __main__ - Step 86562: {'lr': 0.0001944237221024652, 'samples': 16619904, 'steps': 86561, 'loss/train': 1.5010613203048706} 11/07/2021 09:23:04 - INFO - __main__ - Step 86563: {'lr': 0.0001944185481605224, 'samples': 16620096, 'steps': 86562, 'loss/train': 1.5372024774551392} 11/07/2021 09:23:04 - INFO - __main__ - Step 86564: {'lr': 0.00019441337424362377, 'samples': 16620288, 'steps': 86563, 'loss/train': 0.9871104955673218} 11/07/2021 09:23:04 - INFO - __main__ - Step 86565: {'lr': 0.0001944082003517716, 'samples': 16620480, 'steps': 86564, 'loss/train': 1.273007869720459} 11/07/2021 09:23:05 - INFO - __main__ - Step 86566: {'lr': 0.00019440302648496822, 'samples': 16620672, 'steps': 86565, 'loss/train': 0.8308430910110474} 11/07/2021 09:23:06 - INFO - __main__ - Step 86567: {'lr': 0.00019439785264321598, 'samples': 16620864, 'steps': 86566, 'loss/train': 1.386969804763794} 11/07/2021 09:23:06 - INFO - __main__ - Step 86568: {'lr': 0.00019439267882651723, 'samples': 16621056, 'steps': 86567, 'loss/train': 1.4109283685684204} 11/07/2021 09:23:07 - INFO - __main__ - Step 86569: {'lr': 0.00019438750503487427, 'samples': 16621248, 'steps': 86568, 'loss/train': 1.161543846130371} 11/07/2021 09:23:07 - INFO - __main__ - Step 86570: {'lr': 0.0001943823312682894, 'samples': 16621440, 'steps': 86569, 'loss/train': 1.381686806678772} 11/07/2021 09:23:08 - INFO - __main__ - Step 86571: {'lr': 0.00019437715752676504, 'samples': 16621632, 'steps': 86570, 'loss/train': 0.8429765105247498} 11/07/2021 09:23:08 - INFO - __main__ - Step 86572: {'lr': 0.0001943719838103035, 'samples': 16621824, 'steps': 86571, 'loss/train': 1.1877880096435547} 11/07/2021 09:23:09 - INFO - __main__ - Step 86573: {'lr': 0.00019436681011890706, 'samples': 16622016, 'steps': 86572, 'loss/train': 0.7683623433113098} 11/07/2021 09:23:09 - INFO - __main__ - Step 86574: {'lr': 0.00019436163645257808, 'samples': 16622208, 'steps': 86573, 'loss/train': 1.3858771324157715} 11/07/2021 09:23:09 - INFO - __main__ - Step 86575: {'lr': 0.00019435646281131886, 'samples': 16622400, 'steps': 86574, 'loss/train': 1.058731198310852} 11/07/2021 09:23:10 - INFO - __main__ - Step 86576: {'lr': 0.0001943512891951319, 'samples': 16622592, 'steps': 86575, 'loss/train': 1.6578004360198975} 11/07/2021 09:23:11 - INFO - __main__ - Step 86577: {'lr': 0.00019434611560401928, 'samples': 16622784, 'steps': 86576, 'loss/train': 1.6205682754516602} 11/07/2021 09:23:11 - INFO - __main__ - Step 86578: {'lr': 0.0001943409420379834, 'samples': 16622976, 'steps': 86577, 'loss/train': 1.4634467363357544} 11/07/2021 09:23:11 - INFO - __main__ - Step 86579: {'lr': 0.00019433576849702666, 'samples': 16623168, 'steps': 86578, 'loss/train': 1.285887360572815} 11/07/2021 09:23:12 - INFO - __main__ - Step 86580: {'lr': 0.0001943305949811514, 'samples': 16623360, 'steps': 86579, 'loss/train': 1.4571024179458618} 11/07/2021 09:23:12 - INFO - __main__ - Step 86581: {'lr': 0.00019432542149035986, 'samples': 16623552, 'steps': 86580, 'loss/train': 1.4072437286376953} 11/07/2021 09:23:13 - INFO - __main__ - Step 86582: {'lr': 0.00019432024802465444, 'samples': 16623744, 'steps': 86581, 'loss/train': 1.4231600761413574} 11/07/2021 09:23:14 - INFO - __main__ - Step 86583: {'lr': 0.00019431507458403749, 'samples': 16623936, 'steps': 86582, 'loss/train': 1.4439247846603394} 11/07/2021 09:23:14 - INFO - __main__ - Step 86584: {'lr': 0.00019430990116851127, 'samples': 16624128, 'steps': 86583, 'loss/train': 1.2524909973144531} 11/07/2021 09:23:14 - INFO - __main__ - Step 86585: {'lr': 0.00019430472777807816, 'samples': 16624320, 'steps': 86584, 'loss/train': 1.332810878753662} 11/07/2021 09:23:15 - INFO - __main__ - Step 86586: {'lr': 0.0001942995544127405, 'samples': 16624512, 'steps': 86585, 'loss/train': 1.2135359048843384} 11/07/2021 09:23:16 - INFO - __main__ - Step 86587: {'lr': 0.0001942943810725006, 'samples': 16624704, 'steps': 86586, 'loss/train': 1.3818578720092773} 11/07/2021 09:23:16 - INFO - __main__ - Step 86588: {'lr': 0.00019428920775736076, 'samples': 16624896, 'steps': 86587, 'loss/train': 1.3953983783721924} 11/07/2021 09:23:17 - INFO - __main__ - Step 86589: {'lr': 0.00019428403446732344, 'samples': 16625088, 'steps': 86588, 'loss/train': 1.253563404083252} 11/07/2021 09:23:17 - INFO - __main__ - Step 86590: {'lr': 0.00019427886120239083, 'samples': 16625280, 'steps': 86589, 'loss/train': 1.3857346773147583} 11/07/2021 09:23:17 - INFO - __main__ - Step 86591: {'lr': 0.00019427368796256527, 'samples': 16625472, 'steps': 86590, 'loss/train': 1.7190275192260742} 11/07/2021 09:23:18 - INFO - __main__ - Step 86592: {'lr': 0.00019426851474784913, 'samples': 16625664, 'steps': 86591, 'loss/train': 1.087536096572876} 11/07/2021 09:23:19 - INFO - __main__ - Step 86593: {'lr': 0.0001942633415582447, 'samples': 16625856, 'steps': 86592, 'loss/train': 1.7056304216384888} 11/07/2021 09:23:19 - INFO - __main__ - Step 86594: {'lr': 0.0001942581683937544, 'samples': 16626048, 'steps': 86593, 'loss/train': 1.311112403869629} 11/07/2021 09:23:19 - INFO - __main__ - Step 86595: {'lr': 0.0001942529952543805, 'samples': 16626240, 'steps': 86594, 'loss/train': 1.2660404443740845} 11/07/2021 09:23:20 - INFO - __main__ - Step 86596: {'lr': 0.00019424782214012533, 'samples': 16626432, 'steps': 86595, 'loss/train': 1.254003882408142} 11/07/2021 09:23:21 - INFO - __main__ - Step 86597: {'lr': 0.00019424264905099124, 'samples': 16626624, 'steps': 86596, 'loss/train': 1.3207411766052246} 11/07/2021 09:23:21 - INFO - __main__ - Step 86598: {'lr': 0.00019423747598698053, 'samples': 16626816, 'steps': 86597, 'loss/train': 1.5886824131011963} 11/07/2021 09:23:21 - INFO - __main__ - Step 86599: {'lr': 0.00019423230294809558, 'samples': 16627008, 'steps': 86598, 'loss/train': 1.6417944431304932} 11/07/2021 09:23:22 - INFO - __main__ - Step 86600: {'lr': 0.00019422712993433867, 'samples': 16627200, 'steps': 86599, 'loss/train': 1.7525266408920288} 11/07/2021 09:23:22 - INFO - __main__ - Step 86601: {'lr': 0.00019422195694571216, 'samples': 16627392, 'steps': 86600, 'loss/train': 1.4742313623428345} 11/07/2021 09:23:23 - INFO - __main__ - Step 86602: {'lr': 0.00019421678398221838, 'samples': 16627584, 'steps': 86601, 'loss/train': 0.9695133566856384} 11/07/2021 09:23:24 - INFO - __main__ - Step 86603: {'lr': 0.00019421161104385976, 'samples': 16627776, 'steps': 86602, 'loss/train': 1.5043878555297852} 11/07/2021 09:23:24 - INFO - __main__ - Step 86604: {'lr': 0.00019420643813063842, 'samples': 16627968, 'steps': 86603, 'loss/train': 1.1172014474868774} 11/07/2021 09:23:24 - INFO - __main__ - Step 86605: {'lr': 0.00019420126524255683, 'samples': 16628160, 'steps': 86604, 'loss/train': 1.6562358140945435} 11/07/2021 09:23:25 - INFO - __main__ - Step 86606: {'lr': 0.00019419609237961724, 'samples': 16628352, 'steps': 86605, 'loss/train': 1.6300278902053833} 11/07/2021 09:23:26 - INFO - __main__ - Step 86607: {'lr': 0.00019419091954182206, 'samples': 16628544, 'steps': 86606, 'loss/train': 1.3373477458953857} 11/07/2021 09:23:26 - INFO - __main__ - Step 86608: {'lr': 0.00019418574672917357, 'samples': 16628736, 'steps': 86607, 'loss/train': 1.3342394828796387} 11/07/2021 09:23:26 - INFO - __main__ - Step 86609: {'lr': 0.00019418057394167415, 'samples': 16628928, 'steps': 86608, 'loss/train': 1.3217443227767944} 11/07/2021 09:23:27 - INFO - __main__ - Step 86610: {'lr': 0.00019417540117932608, 'samples': 16629120, 'steps': 86609, 'loss/train': 1.6273702383041382} 11/07/2021 09:23:27 - INFO - __main__ - Step 86611: {'lr': 0.0001941702284421317, 'samples': 16629312, 'steps': 86610, 'loss/train': 1.5053653717041016} 11/07/2021 09:23:28 - INFO - __main__ - Step 86612: {'lr': 0.0001941650557300934, 'samples': 16629504, 'steps': 86611, 'loss/train': 1.5145124197006226} 11/07/2021 09:23:28 - INFO - __main__ - Step 86613: {'lr': 0.0001941598830432134, 'samples': 16629696, 'steps': 86612, 'loss/train': 1.426162600517273} 11/07/2021 09:23:29 - INFO - __main__ - Step 86614: {'lr': 0.00019415471038149415, 'samples': 16629888, 'steps': 86613, 'loss/train': 1.5114257335662842} 11/07/2021 09:23:29 - INFO - __main__ - Step 86615: {'lr': 0.0001941495377449379, 'samples': 16630080, 'steps': 86614, 'loss/train': 1.3132007122039795} 11/07/2021 09:23:29 - INFO - __main__ - Step 86616: {'lr': 0.00019414436513354714, 'samples': 16630272, 'steps': 86615, 'loss/train': 1.2412264347076416} 11/07/2021 09:23:30 - INFO - __main__ - Step 86617: {'lr': 0.00019413919254732392, 'samples': 16630464, 'steps': 86616, 'loss/train': 1.2613579034805298} 11/07/2021 09:23:31 - INFO - __main__ - Step 86618: {'lr': 0.00019413401998627074, 'samples': 16630656, 'steps': 86617, 'loss/train': 1.412122130393982} 11/07/2021 09:23:31 - INFO - __main__ - Step 86619: {'lr': 0.00019412884745038993, 'samples': 16630848, 'steps': 86618, 'loss/train': 1.4056342840194702} 11/07/2021 09:23:31 - INFO - __main__ - Step 86620: {'lr': 0.00019412367493968374, 'samples': 16631040, 'steps': 86619, 'loss/train': 1.33607816696167} 11/07/2021 09:23:32 - INFO - __main__ - Step 86621: {'lr': 0.0001941185024541546, 'samples': 16631232, 'steps': 86620, 'loss/train': 0.6002122163772583} 11/07/2021 09:23:32 - INFO - __main__ - Step 86622: {'lr': 0.00019411332999380481, 'samples': 16631424, 'steps': 86621, 'loss/train': 1.4337562322616577} 11/07/2021 09:23:33 - INFO - __main__ - Step 86623: {'lr': 0.00019410815755863668, 'samples': 16631616, 'steps': 86622, 'loss/train': 1.3979920148849487} 11/07/2021 09:23:34 - INFO - __main__ - Step 86624: {'lr': 0.00019410298514865255, 'samples': 16631808, 'steps': 86623, 'loss/train': 1.1654813289642334} 11/07/2021 09:23:34 - INFO - __main__ - Step 86625: {'lr': 0.00019409781276385474, 'samples': 16632000, 'steps': 86624, 'loss/train': 1.1841274499893188} 11/07/2021 09:23:34 - INFO - __main__ - Step 86626: {'lr': 0.00019409264040424562, 'samples': 16632192, 'steps': 86625, 'loss/train': 0.19319209456443787} 11/07/2021 09:23:35 - INFO - __main__ - Step 86627: {'lr': 0.00019408746806982746, 'samples': 16632384, 'steps': 86626, 'loss/train': 1.119762659072876} 11/07/2021 09:23:36 - INFO - __main__ - Step 86628: {'lr': 0.00019408229576060266, 'samples': 16632576, 'steps': 86627, 'loss/train': 1.2156004905700684} 11/07/2021 09:23:36 - INFO - __main__ - Step 86629: {'lr': 0.00019407712347657347, 'samples': 16632768, 'steps': 86628, 'loss/train': 1.0687190294265747} 11/07/2021 09:23:36 - INFO - __main__ - Step 86630: {'lr': 0.0001940719512177424, 'samples': 16632960, 'steps': 86629, 'loss/train': 0.3604404628276825} 11/07/2021 09:23:37 - INFO - __main__ - Step 86631: {'lr': 0.00019406677898411154, 'samples': 16633152, 'steps': 86630, 'loss/train': 1.6661981344223022} 11/07/2021 09:23:37 - INFO - __main__ - Step 86632: {'lr': 0.0001940616067756833, 'samples': 16633344, 'steps': 86631, 'loss/train': 1.4216989278793335} 11/07/2021 09:23:38 - INFO - __main__ - Step 86633: {'lr': 0.0001940564345924601, 'samples': 16633536, 'steps': 86632, 'loss/train': 1.410128116607666} 11/07/2021 09:23:38 - INFO - __main__ - Step 86634: {'lr': 0.00019405126243444415, 'samples': 16633728, 'steps': 86633, 'loss/train': 1.0572483539581299} 11/07/2021 09:23:39 - INFO - __main__ - Step 86635: {'lr': 0.00019404609030163785, 'samples': 16633920, 'steps': 86634, 'loss/train': 1.2102549076080322} 11/07/2021 09:23:39 - INFO - __main__ - Step 86636: {'lr': 0.00019404091819404354, 'samples': 16634112, 'steps': 86635, 'loss/train': 1.3749161958694458} 11/07/2021 09:23:40 - INFO - __main__ - Step 86637: {'lr': 0.00019403574611166354, 'samples': 16634304, 'steps': 86636, 'loss/train': 1.702597737312317} 11/07/2021 09:23:41 - INFO - __main__ - Step 86638: {'lr': 0.00019403057405450013, 'samples': 16634496, 'steps': 86637, 'loss/train': 1.474136233329773} 11/07/2021 09:23:41 - INFO - __main__ - Step 86639: {'lr': 0.00019402540202255567, 'samples': 16634688, 'steps': 86638, 'loss/train': 1.438571572303772} 11/07/2021 09:23:41 - INFO - __main__ - Step 86640: {'lr': 0.00019402023001583251, 'samples': 16634880, 'steps': 86639, 'loss/train': 1.157318353652954} 11/07/2021 09:23:42 - INFO - __main__ - Step 86641: {'lr': 0.00019401505803433305, 'samples': 16635072, 'steps': 86640, 'loss/train': 1.2675586938858032} 11/07/2021 09:23:42 - INFO - __main__ - Step 86642: {'lr': 0.00019400988607805948, 'samples': 16635264, 'steps': 86641, 'loss/train': 0.6415411233901978} 11/07/2021 09:23:43 - INFO - __main__ - Step 86643: {'lr': 0.00019400471414701424, 'samples': 16635456, 'steps': 86642, 'loss/train': 1.7018247842788696} 11/07/2021 09:23:44 - INFO - __main__ - Step 86644: {'lr': 0.00019399954224119957, 'samples': 16635648, 'steps': 86643, 'loss/train': 1.6309388875961304} 11/07/2021 09:23:44 - INFO - __main__ - Step 86645: {'lr': 0.0001939943703606178, 'samples': 16635840, 'steps': 86644, 'loss/train': 0.3684845268726349} 11/07/2021 09:23:44 - INFO - __main__ - Step 86646: {'lr': 0.00019398919850527132, 'samples': 16636032, 'steps': 86645, 'loss/train': 1.352315068244934} 11/07/2021 09:23:45 - INFO - __main__ - Step 86647: {'lr': 0.00019398402667516245, 'samples': 16636224, 'steps': 86646, 'loss/train': 1.7988922595977783} 11/07/2021 09:23:46 - INFO - __main__ - Step 86648: {'lr': 0.00019397885487029354, 'samples': 16636416, 'steps': 86647, 'loss/train': 1.7910075187683105} 11/07/2021 09:23:46 - INFO - __main__ - Step 86649: {'lr': 0.00019397368309066688, 'samples': 16636608, 'steps': 86648, 'loss/train': 1.486834168434143} 11/07/2021 09:23:46 - INFO - __main__ - Step 86650: {'lr': 0.0001939685113362848, 'samples': 16636800, 'steps': 86649, 'loss/train': 1.4301323890686035} 11/07/2021 09:23:47 - INFO - __main__ - Step 86651: {'lr': 0.00019396333960714965, 'samples': 16636992, 'steps': 86650, 'loss/train': 1.3860228061676025} 11/07/2021 09:23:47 - INFO - __main__ - Step 86652: {'lr': 0.00019395816790326382, 'samples': 16637184, 'steps': 86651, 'loss/train': 1.2811474800109863} 11/07/2021 09:23:48 - INFO - __main__ - Step 86653: {'lr': 0.00019395299622462949, 'samples': 16637376, 'steps': 86652, 'loss/train': 1.0226318836212158} 11/07/2021 09:23:48 - INFO - __main__ - Step 86654: {'lr': 0.0001939478245712491, 'samples': 16637568, 'steps': 86653, 'loss/train': 1.1813676357269287} 11/07/2021 09:23:49 - INFO - __main__ - Step 86655: {'lr': 0.00019394265294312495, 'samples': 16637760, 'steps': 86654, 'loss/train': 1.3754888772964478} 11/07/2021 09:23:49 - INFO - __main__ - Step 86656: {'lr': 0.0001939374813402594, 'samples': 16637952, 'steps': 86655, 'loss/train': 1.1297826766967773} 11/07/2021 09:23:49 - INFO - __main__ - Step 86657: {'lr': 0.00019393230976265475, 'samples': 16638144, 'steps': 86656, 'loss/train': 0.5407644510269165} 11/07/2021 09:23:50 - INFO - __main__ - Step 86658: {'lr': 0.00019392713821031333, 'samples': 16638336, 'steps': 86657, 'loss/train': 1.3824387788772583} 11/07/2021 09:23:51 - INFO - __main__ - Step 86659: {'lr': 0.00019392196668323745, 'samples': 16638528, 'steps': 86658, 'loss/train': 0.8241647481918335} 11/07/2021 09:23:51 - INFO - __main__ - Step 86660: {'lr': 0.00019391679518142947, 'samples': 16638720, 'steps': 86659, 'loss/train': 1.4680777788162231} 11/07/2021 09:23:51 - INFO - __main__ - Step 86661: {'lr': 0.00019391162370489175, 'samples': 16638912, 'steps': 86660, 'loss/train': 1.3646364212036133} 11/07/2021 09:23:52 - INFO - __main__ - Step 86662: {'lr': 0.00019390645225362657, 'samples': 16639104, 'steps': 86661, 'loss/train': 1.1768137216567993} 11/07/2021 09:23:53 - INFO - __main__ - Step 86663: {'lr': 0.00019390128082763628, 'samples': 16639296, 'steps': 86662, 'loss/train': 1.4392386674880981} 11/07/2021 09:23:53 - INFO - __main__ - Step 86664: {'lr': 0.0001938961094269232, 'samples': 16639488, 'steps': 86663, 'loss/train': 1.5284167528152466} 11/07/2021 09:23:54 - INFO - __main__ - Step 86665: {'lr': 0.00019389093805148965, 'samples': 16639680, 'steps': 86664, 'loss/train': 0.9723259806632996} 11/07/2021 09:23:54 - INFO - __main__ - Step 86666: {'lr': 0.000193885766701338, 'samples': 16639872, 'steps': 86665, 'loss/train': 1.5481599569320679} 11/07/2021 09:23:54 - INFO - __main__ - Step 86667: {'lr': 0.00019388059537647057, 'samples': 16640064, 'steps': 86666, 'loss/train': 0.6284796595573425} 11/07/2021 09:23:55 - INFO - __main__ - Step 86668: {'lr': 0.00019387542407688964, 'samples': 16640256, 'steps': 86667, 'loss/train': 1.45033597946167} 11/07/2021 09:23:56 - INFO - __main__ - Step 86669: {'lr': 0.0001938702528025976, 'samples': 16640448, 'steps': 86668, 'loss/train': 1.2049320936203003} 11/07/2021 09:23:56 - INFO - __main__ - Step 86670: {'lr': 0.00019386508155359682, 'samples': 16640640, 'steps': 86669, 'loss/train': 1.0323325395584106} 11/07/2021 09:23:56 - INFO - __main__ - Step 86671: {'lr': 0.0001938599103298895, 'samples': 16640832, 'steps': 86670, 'loss/train': 1.385051965713501} 11/07/2021 09:23:57 - INFO - __main__ - Step 86672: {'lr': 0.00019385473913147803, 'samples': 16641024, 'steps': 86671, 'loss/train': 1.3272502422332764} 11/07/2021 09:23:57 - INFO - __main__ - Step 86673: {'lr': 0.0001938495679583648, 'samples': 16641216, 'steps': 86672, 'loss/train': 1.5531445741653442} 11/07/2021 09:23:58 - INFO - __main__ - Step 86674: {'lr': 0.00019384439681055204, 'samples': 16641408, 'steps': 86673, 'loss/train': 1.2633960247039795} 11/07/2021 09:23:58 - INFO - __main__ - Step 86675: {'lr': 0.00019383922568804213, 'samples': 16641600, 'steps': 86674, 'loss/train': 2.1578493118286133} 11/07/2021 09:23:59 - INFO - __main__ - Step 86676: {'lr': 0.00019383405459083743, 'samples': 16641792, 'steps': 86675, 'loss/train': 1.6642779111862183} 11/07/2021 09:23:59 - INFO - __main__ - Step 86677: {'lr': 0.00019382888351894017, 'samples': 16641984, 'steps': 86676, 'loss/train': 1.3267089128494263} 11/07/2021 09:24:00 - INFO - __main__ - Step 86678: {'lr': 0.00019382371247235282, 'samples': 16642176, 'steps': 86677, 'loss/train': 1.7015397548675537} 11/07/2021 09:24:00 - INFO - __main__ - Step 86679: {'lr': 0.00019381854145107758, 'samples': 16642368, 'steps': 86678, 'loss/train': 1.1892893314361572} 11/07/2021 09:24:01 - INFO - __main__ - Step 86680: {'lr': 0.00019381337045511687, 'samples': 16642560, 'steps': 86679, 'loss/train': 1.149567723274231} 11/07/2021 09:24:01 - INFO - __main__ - Step 86681: {'lr': 0.00019380819948447298, 'samples': 16642752, 'steps': 86680, 'loss/train': 1.3950167894363403} 11/07/2021 09:24:02 - INFO - __main__ - Step 86682: {'lr': 0.00019380302853914827, 'samples': 16642944, 'steps': 86681, 'loss/train': 1.3086127042770386} 11/07/2021 09:24:02 - INFO - __main__ - Step 86683: {'lr': 0.00019379785761914505, 'samples': 16643136, 'steps': 86682, 'loss/train': 1.320905089378357} 11/07/2021 09:24:03 - INFO - __main__ - Step 86684: {'lr': 0.0001937926867244657, 'samples': 16643328, 'steps': 86683, 'loss/train': 1.4359976053237915} 11/07/2021 09:24:03 - INFO - __main__ - Step 86685: {'lr': 0.00019378751585511243, 'samples': 16643520, 'steps': 86684, 'loss/train': 1.5835388898849487} 11/07/2021 09:24:04 - INFO - __main__ - Step 86686: {'lr': 0.00019378234501108763, 'samples': 16643712, 'steps': 86685, 'loss/train': 1.0297166109085083} 11/07/2021 09:24:04 - INFO - __main__ - Step 86687: {'lr': 0.00019377717419239365, 'samples': 16643904, 'steps': 86686, 'loss/train': 1.4192763566970825} 11/07/2021 09:24:04 - INFO - __main__ - Step 86688: {'lr': 0.00019377200339903278, 'samples': 16644096, 'steps': 86687, 'loss/train': 1.4456225633621216} 11/07/2021 09:24:05 - INFO - __main__ - Step 86689: {'lr': 0.0001937668326310074, 'samples': 16644288, 'steps': 86688, 'loss/train': 1.5560150146484375} 11/07/2021 09:24:06 - INFO - __main__ - Step 86690: {'lr': 0.00019376166188831982, 'samples': 16644480, 'steps': 86689, 'loss/train': 1.5273935794830322} 11/07/2021 09:24:06 - INFO - __main__ - Step 86691: {'lr': 0.00019375649117097236, 'samples': 16644672, 'steps': 86690, 'loss/train': 1.5600281953811646} 11/07/2021 09:24:06 - INFO - __main__ - Step 86692: {'lr': 0.00019375132047896735, 'samples': 16644864, 'steps': 86691, 'loss/train': 1.0400420427322388} 11/07/2021 09:24:07 - INFO - __main__ - Step 86693: {'lr': 0.00019374614981230716, 'samples': 16645056, 'steps': 86692, 'loss/train': 1.0808593034744263} 11/07/2021 09:24:08 - INFO - __main__ - Step 86694: {'lr': 0.00019374097917099404, 'samples': 16645248, 'steps': 86693, 'loss/train': 1.0106714963912964} 11/07/2021 09:24:08 - INFO - __main__ - Step 86695: {'lr': 0.00019373580855503038, 'samples': 16645440, 'steps': 86694, 'loss/train': 1.1250485181808472} 11/07/2021 09:24:08 - INFO - __main__ - Step 86696: {'lr': 0.00019373063796441852, 'samples': 16645632, 'steps': 86695, 'loss/train': 1.264784574508667} 11/07/2021 09:24:09 - INFO - __main__ - Step 86697: {'lr': 0.00019372546739916086, 'samples': 16645824, 'steps': 86696, 'loss/train': 1.5972228050231934} 11/07/2021 09:24:09 - INFO - __main__ - Step 86698: {'lr': 0.00019372029685925951, 'samples': 16646016, 'steps': 86697, 'loss/train': 1.8094513416290283} 11/07/2021 09:24:10 - INFO - __main__ - Step 86699: {'lr': 0.00019371512634471695, 'samples': 16646208, 'steps': 86698, 'loss/train': 1.6596790552139282} 11/07/2021 09:24:11 - INFO - __main__ - Step 86700: {'lr': 0.00019370995585553548, 'samples': 16646400, 'steps': 86699, 'loss/train': 1.7109688520431519} 11/07/2021 09:24:11 - INFO - __main__ - Step 86701: {'lr': 0.00019370478539171743, 'samples': 16646592, 'steps': 86700, 'loss/train': 1.3183212280273438} 11/07/2021 09:24:11 - INFO - __main__ - Step 86702: {'lr': 0.00019369961495326515, 'samples': 16646784, 'steps': 86701, 'loss/train': 1.7418465614318848} 11/07/2021 09:24:12 - INFO - __main__ - Step 86703: {'lr': 0.00019369444454018096, 'samples': 16646976, 'steps': 86702, 'loss/train': 1.974369764328003} 11/07/2021 09:24:13 - INFO - __main__ - Step 86704: {'lr': 0.00019368927415246715, 'samples': 16647168, 'steps': 86703, 'loss/train': 1.017823576927185} 11/07/2021 09:24:13 - INFO - __main__ - Step 86705: {'lr': 0.0001936841037901261, 'samples': 16647360, 'steps': 86704, 'loss/train': 1.537361979484558} 11/07/2021 09:24:13 - INFO - __main__ - Step 86706: {'lr': 0.00019367893345316012, 'samples': 16647552, 'steps': 86705, 'loss/train': 0.8457692861557007} 11/07/2021 09:24:14 - INFO - __main__ - Step 86707: {'lr': 0.00019367376314157156, 'samples': 16647744, 'steps': 86706, 'loss/train': 1.975041151046753} 11/07/2021 09:24:14 - INFO - __main__ - Step 86708: {'lr': 0.00019366859285536273, 'samples': 16647936, 'steps': 86707, 'loss/train': 1.234990119934082} 11/07/2021 09:24:15 - INFO - __main__ - Step 86709: {'lr': 0.00019366342259453595, 'samples': 16648128, 'steps': 86708, 'loss/train': 0.7931217551231384} 11/07/2021 09:24:16 - INFO - __main__ - Step 86710: {'lr': 0.00019365825235909367, 'samples': 16648320, 'steps': 86709, 'loss/train': 0.785803496837616} 11/07/2021 09:24:16 - INFO - __main__ - Step 86711: {'lr': 0.00019365308214903802, 'samples': 16648512, 'steps': 86710, 'loss/train': 1.4336289167404175} 11/07/2021 09:24:17 - INFO - __main__ - Step 86712: {'lr': 0.0001936479119643714, 'samples': 16648704, 'steps': 86711, 'loss/train': 1.5955544710159302} 11/07/2021 09:24:17 - INFO - __main__ - Step 86713: {'lr': 0.00019364274180509616, 'samples': 16648896, 'steps': 86712, 'loss/train': 0.23053494095802307} 11/07/2021 09:24:18 - INFO - __main__ - Step 86714: {'lr': 0.00019363757167121466, 'samples': 16649088, 'steps': 86713, 'loss/train': 1.6960337162017822} 11/07/2021 09:24:18 - INFO - __main__ - Step 86715: {'lr': 0.00019363240156272917, 'samples': 16649280, 'steps': 86714, 'loss/train': 1.5577431917190552} 11/07/2021 09:24:19 - INFO - __main__ - Step 86716: {'lr': 0.0001936272314796421, 'samples': 16649472, 'steps': 86715, 'loss/train': 0.7903529405593872} 11/07/2021 09:24:19 - INFO - __main__ - Step 86717: {'lr': 0.0001936220614219557, 'samples': 16649664, 'steps': 86716, 'loss/train': 1.2048554420471191} 11/07/2021 09:24:19 - INFO - __main__ - Step 86718: {'lr': 0.0001936168913896723, 'samples': 16649856, 'steps': 86717, 'loss/train': 0.8369081616401672} 11/07/2021 09:24:20 - INFO - __main__ - Step 86719: {'lr': 0.0001936117213827943, 'samples': 16650048, 'steps': 86718, 'loss/train': 1.6264333724975586} 11/07/2021 09:24:21 - INFO - __main__ - Step 86720: {'lr': 0.00019360655140132394, 'samples': 16650240, 'steps': 86719, 'loss/train': 1.1945581436157227} 11/07/2021 09:24:21 - INFO - __main__ - Step 86721: {'lr': 0.00019360138144526362, 'samples': 16650432, 'steps': 86720, 'loss/train': 0.9005486369132996} 11/07/2021 09:24:21 - INFO - __main__ - Step 86722: {'lr': 0.00019359621151461567, 'samples': 16650624, 'steps': 86721, 'loss/train': 1.1983410120010376} 11/07/2021 09:24:22 - INFO - __main__ - Step 86723: {'lr': 0.0001935910416093824, 'samples': 16650816, 'steps': 86722, 'loss/train': 1.4203577041625977} 11/07/2021 09:24:22 - INFO - __main__ - Step 86724: {'lr': 0.00019358587172956621, 'samples': 16651008, 'steps': 86723, 'loss/train': 1.5708891153335571} 11/07/2021 09:24:23 - INFO - __main__ - Step 86725: {'lr': 0.00019358070187516926, 'samples': 16651200, 'steps': 86724, 'loss/train': 1.4163177013397217} 11/07/2021 09:24:24 - INFO - __main__ - Step 86726: {'lr': 0.000193575532046194, 'samples': 16651392, 'steps': 86725, 'loss/train': 1.5132415294647217} 11/07/2021 09:24:24 - INFO - __main__ - Step 86727: {'lr': 0.00019357036224264268, 'samples': 16651584, 'steps': 86726, 'loss/train': 1.8522052764892578} 11/07/2021 09:24:24 - INFO - __main__ - Step 86728: {'lr': 0.00019356519246451772, 'samples': 16651776, 'steps': 86727, 'loss/train': 1.3607484102249146} 11/07/2021 09:24:25 - INFO - __main__ - Step 86729: {'lr': 0.00019356002271182145, 'samples': 16651968, 'steps': 86728, 'loss/train': 0.6855572462081909} 11/07/2021 09:24:26 - INFO - __main__ - Step 86730: {'lr': 0.0001935548529845561, 'samples': 16652160, 'steps': 86729, 'loss/train': 1.281531810760498} 11/07/2021 09:24:26 - INFO - __main__ - Step 86731: {'lr': 0.0001935496832827241, 'samples': 16652352, 'steps': 86730, 'loss/train': 0.9837262630462646} 11/07/2021 09:24:26 - INFO - __main__ - Step 86732: {'lr': 0.00019354451360632772, 'samples': 16652544, 'steps': 86731, 'loss/train': 1.574537754058838} 11/07/2021 09:24:27 - INFO - __main__ - Step 86733: {'lr': 0.00019353934395536932, 'samples': 16652736, 'steps': 86732, 'loss/train': 1.3378981351852417} 11/07/2021 09:24:27 - INFO - __main__ - Step 86734: {'lr': 0.0001935341743298512, 'samples': 16652928, 'steps': 86733, 'loss/train': 1.6973503828048706} 11/07/2021 09:24:28 - INFO - __main__ - Step 86735: {'lr': 0.00019352900472977574, 'samples': 16653120, 'steps': 86734, 'loss/train': 1.2228126525878906} 11/07/2021 09:24:28 - INFO - __main__ - Step 86736: {'lr': 0.00019352383515514523, 'samples': 16653312, 'steps': 86735, 'loss/train': 1.645487904548645} 11/07/2021 09:24:29 - INFO - __main__ - Step 86737: {'lr': 0.00019351866560596214, 'samples': 16653504, 'steps': 86736, 'loss/train': 1.4379788637161255} 11/07/2021 09:24:29 - INFO - __main__ - Step 86738: {'lr': 0.00019351349608222852, 'samples': 16653696, 'steps': 86737, 'loss/train': 1.687050700187683} 11/07/2021 09:24:29 - INFO - __main__ - Step 86739: {'lr': 0.00019350832658394685, 'samples': 16653888, 'steps': 86738, 'loss/train': 1.134028673171997} 11/07/2021 09:24:30 - INFO - __main__ - Step 86740: {'lr': 0.00019350315711111945, 'samples': 16654080, 'steps': 86739, 'loss/train': 1.0776406526565552} 11/07/2021 09:24:31 - INFO - __main__ - Step 86741: {'lr': 0.00019349798766374869, 'samples': 16654272, 'steps': 86740, 'loss/train': 1.3554493188858032} 11/07/2021 09:24:31 - INFO - __main__ - Step 86742: {'lr': 0.00019349281824183683, 'samples': 16654464, 'steps': 86741, 'loss/train': 1.6694930791854858} 11/07/2021 09:24:32 - INFO - __main__ - Step 86743: {'lr': 0.00019348764884538627, 'samples': 16654656, 'steps': 86742, 'loss/train': 1.2990938425064087} 11/07/2021 09:24:32 - INFO - __main__ - Step 86744: {'lr': 0.0001934824794743993, 'samples': 16654848, 'steps': 86743, 'loss/train': 1.4403703212738037} 11/07/2021 09:24:33 - INFO - __main__ - Step 86745: {'lr': 0.00019347731012887823, 'samples': 16655040, 'steps': 86744, 'loss/train': 1.1865530014038086} 11/07/2021 09:24:33 - INFO - __main__ - Step 86746: {'lr': 0.0001934721408088254, 'samples': 16655232, 'steps': 86745, 'loss/train': 1.4496396780014038} 11/07/2021 09:24:34 - INFO - __main__ - Step 86747: {'lr': 0.0001934669715142432, 'samples': 16655424, 'steps': 86746, 'loss/train': 1.7223918437957764} 11/07/2021 09:24:34 - INFO - __main__ - Step 86748: {'lr': 0.00019346180224513387, 'samples': 16655616, 'steps': 86747, 'loss/train': 1.3514326810836792} 11/07/2021 09:24:34 - INFO - __main__ - Step 86749: {'lr': 0.0001934566330014998, 'samples': 16655808, 'steps': 86748, 'loss/train': 1.3404244184494019} 11/07/2021 09:24:35 - INFO - __main__ - Step 86750: {'lr': 0.00019345146378334327, 'samples': 16656000, 'steps': 86749, 'loss/train': 1.7182573080062866} 11/07/2021 09:24:36 - INFO - __main__ - Step 86751: {'lr': 0.00019344629459066675, 'samples': 16656192, 'steps': 86750, 'loss/train': 1.570389986038208} 11/07/2021 09:24:36 - INFO - __main__ - Step 86752: {'lr': 0.0001934411254234724, 'samples': 16656384, 'steps': 86751, 'loss/train': 1.4215781688690186} 11/07/2021 09:24:36 - INFO - __main__ - Step 86753: {'lr': 0.00019343595628176256, 'samples': 16656576, 'steps': 86752, 'loss/train': 1.7310802936553955} 11/07/2021 09:24:37 - INFO - __main__ - Step 86754: {'lr': 0.00019343078716553962, 'samples': 16656768, 'steps': 86753, 'loss/train': 1.2413572072982788} 11/07/2021 09:24:37 - INFO - __main__ - Step 86755: {'lr': 0.00019342561807480588, 'samples': 16656960, 'steps': 86754, 'loss/train': 1.3623703718185425} 11/07/2021 09:24:38 - INFO - __main__ - Step 86756: {'lr': 0.0001934204490095637, 'samples': 16657152, 'steps': 86755, 'loss/train': 1.5733733177185059} 11/07/2021 09:24:39 - INFO - __main__ - Step 86757: {'lr': 0.0001934152799698154, 'samples': 16657344, 'steps': 86756, 'loss/train': 0.2478543370962143} 11/07/2021 09:24:39 - INFO - __main__ - Step 86758: {'lr': 0.00019341011095556327, 'samples': 16657536, 'steps': 86757, 'loss/train': 1.7501862049102783} 11/07/2021 09:24:39 - INFO - __main__ - Step 86759: {'lr': 0.0001934049419668097, 'samples': 16657728, 'steps': 86758, 'loss/train': 1.3420578241348267} 11/07/2021 09:24:40 - INFO - __main__ - Step 86760: {'lr': 0.00019339977300355697, 'samples': 16657920, 'steps': 86759, 'loss/train': 1.4356648921966553} 11/07/2021 09:24:41 - INFO - __main__ - Step 86761: {'lr': 0.00019339460406580744, 'samples': 16658112, 'steps': 86760, 'loss/train': 1.7997187376022339} 11/07/2021 09:24:41 - INFO - __main__ - Step 86762: {'lr': 0.0001933894351535634, 'samples': 16658304, 'steps': 86761, 'loss/train': 2.700181245803833} 11/07/2021 09:24:42 - INFO - __main__ - Step 86763: {'lr': 0.00019338426626682725, 'samples': 16658496, 'steps': 86762, 'loss/train': 1.0628198385238647} 11/07/2021 09:24:42 - INFO - __main__ - Step 86764: {'lr': 0.00019337909740560136, 'samples': 16658688, 'steps': 86763, 'loss/train': 1.3478885889053345} 11/07/2021 09:24:42 - INFO - __main__ - Step 86765: {'lr': 0.00019337392856988789, 'samples': 16658880, 'steps': 86764, 'loss/train': 1.4214035272598267} 11/07/2021 09:24:43 - INFO - __main__ - Step 86766: {'lr': 0.00019336875975968924, 'samples': 16659072, 'steps': 86765, 'loss/train': 1.3798243999481201} 11/07/2021 09:24:44 - INFO - __main__ - Step 86767: {'lr': 0.00019336359097500773, 'samples': 16659264, 'steps': 86766, 'loss/train': 1.4440126419067383} 11/07/2021 09:24:44 - INFO - __main__ - Step 86768: {'lr': 0.00019335842221584573, 'samples': 16659456, 'steps': 86767, 'loss/train': 1.270590901374817} 11/07/2021 09:24:44 - INFO - __main__ - Step 86769: {'lr': 0.00019335325348220555, 'samples': 16659648, 'steps': 86768, 'loss/train': 1.547362208366394} 11/07/2021 09:24:45 - INFO - __main__ - Step 86770: {'lr': 0.00019334808477408953, 'samples': 16659840, 'steps': 86769, 'loss/train': 1.302155613899231} 11/07/2021 09:24:45 - INFO - __main__ - Step 86771: {'lr': 0.0001933429160915, 'samples': 16660032, 'steps': 86770, 'loss/train': 1.6481471061706543} 11/07/2021 09:24:46 - INFO - __main__ - Step 86772: {'lr': 0.00019333774743443923, 'samples': 16660224, 'steps': 86771, 'loss/train': 1.1121524572372437} 11/07/2021 09:24:47 - INFO - __main__ - Step 86773: {'lr': 0.00019333257880290962, 'samples': 16660416, 'steps': 86772, 'loss/train': 1.239090919494629} 11/07/2021 09:24:47 - INFO - __main__ - Step 86774: {'lr': 0.0001933274101969135, 'samples': 16660608, 'steps': 86773, 'loss/train': 0.8555075526237488} 11/07/2021 09:24:47 - INFO - __main__ - Step 86775: {'lr': 0.0001933222416164532, 'samples': 16660800, 'steps': 86774, 'loss/train': 0.9654778242111206} 11/07/2021 09:24:48 - INFO - __main__ - Step 86776: {'lr': 0.00019331707306153098, 'samples': 16660992, 'steps': 86775, 'loss/train': 1.2616136074066162} 11/07/2021 09:24:49 - INFO - __main__ - Step 86777: {'lr': 0.0001933119045321493, 'samples': 16661184, 'steps': 86776, 'loss/train': 1.1506156921386719} 11/07/2021 09:24:49 - INFO - __main__ - Step 86778: {'lr': 0.0001933067360283103, 'samples': 16661376, 'steps': 86777, 'loss/train': 1.6754454374313354} 11/07/2021 09:24:50 - INFO - __main__ - Step 86779: {'lr': 0.0001933015675500164, 'samples': 16661568, 'steps': 86778, 'loss/train': 0.8589944839477539} 11/07/2021 09:24:50 - INFO - __main__ - Step 86780: {'lr': 0.00019329639909727, 'samples': 16661760, 'steps': 86779, 'loss/train': 3.5570826530456543} 11/07/2021 09:24:50 - INFO - __main__ - Step 86781: {'lr': 0.00019329123067007332, 'samples': 16661952, 'steps': 86780, 'loss/train': 2.3137261867523193} 11/07/2021 09:24:51 - INFO - __main__ - Step 86782: {'lr': 0.0001932860622684287, 'samples': 16662144, 'steps': 86781, 'loss/train': 1.4176511764526367} 11/07/2021 09:24:52 - INFO - __main__ - Step 86783: {'lr': 0.0001932808938923386, 'samples': 16662336, 'steps': 86782, 'loss/train': 1.5268018245697021} 11/07/2021 09:24:52 - INFO - __main__ - Step 86784: {'lr': 0.00019327572554180518, 'samples': 16662528, 'steps': 86783, 'loss/train': 1.5748733282089233} 11/07/2021 09:24:52 - INFO - __main__ - Step 86785: {'lr': 0.00019327055721683086, 'samples': 16662720, 'steps': 86784, 'loss/train': 1.6213603019714355} 11/07/2021 09:24:53 - INFO - __main__ - Step 86786: {'lr': 0.00019326538891741802, 'samples': 16662912, 'steps': 86785, 'loss/train': 1.6370515823364258} 11/07/2021 09:24:53 - INFO - __main__ - Step 86787: {'lr': 0.00019326022064356885, 'samples': 16663104, 'steps': 86786, 'loss/train': 1.4992707967758179} 11/07/2021 09:24:54 - INFO - __main__ - Step 86788: {'lr': 0.00019325505239528576, 'samples': 16663296, 'steps': 86787, 'loss/train': 1.3066189289093018} 11/07/2021 09:24:55 - INFO - __main__ - Step 86789: {'lr': 0.00019324988417257106, 'samples': 16663488, 'steps': 86788, 'loss/train': 1.7784550189971924} 11/07/2021 09:24:55 - INFO - __main__ - Step 86790: {'lr': 0.00019324471597542708, 'samples': 16663680, 'steps': 86789, 'loss/train': 0.8372526168823242} 11/07/2021 09:24:55 - INFO - __main__ - Step 86791: {'lr': 0.00019323954780385626, 'samples': 16663872, 'steps': 86790, 'loss/train': 1.4918545484542847} 11/07/2021 09:24:56 - INFO - __main__ - Step 86792: {'lr': 0.00019323437965786073, 'samples': 16664064, 'steps': 86791, 'loss/train': 1.6076147556304932} 11/07/2021 09:24:57 - INFO - __main__ - Step 86793: {'lr': 0.00019322921153744292, 'samples': 16664256, 'steps': 86792, 'loss/train': 1.8639949560165405} 11/07/2021 09:24:57 - INFO - __main__ - Step 86794: {'lr': 0.00019322404344260512, 'samples': 16664448, 'steps': 86793, 'loss/train': 2.09924054145813} 11/07/2021 09:24:57 - INFO - __main__ - Step 86795: {'lr': 0.00019321887537334973, 'samples': 16664640, 'steps': 86794, 'loss/train': 1.3960944414138794} 11/07/2021 09:24:58 - INFO - __main__ - Step 86796: {'lr': 0.00019321370732967904, 'samples': 16664832, 'steps': 86795, 'loss/train': 0.08674680441617966} 11/07/2021 09:24:58 - INFO - __main__ - Step 86797: {'lr': 0.00019320853931159538, 'samples': 16665024, 'steps': 86796, 'loss/train': 1.2296596765518188} 11/07/2021 09:25:00 - INFO - __main__ - Step 86798: {'lr': 0.00019320337131910106, 'samples': 16665216, 'steps': 86797, 'loss/train': 1.4512202739715576} 11/07/2021 09:25:00 - INFO - __main__ - Step 86799: {'lr': 0.00019319820335219842, 'samples': 16665408, 'steps': 86798, 'loss/train': 1.2564336061477661} 11/07/2021 09:25:00 - INFO - __main__ - Step 86800: {'lr': 0.0001931930354108898, 'samples': 16665600, 'steps': 86799, 'loss/train': 0.8704970479011536} 11/07/2021 09:25:01 - INFO - __main__ - Step 86801: {'lr': 0.00019318786749517752, 'samples': 16665792, 'steps': 86800, 'loss/train': 0.165896475315094} 11/07/2021 09:25:01 - INFO - __main__ - Step 86802: {'lr': 0.0001931826996050639, 'samples': 16665984, 'steps': 86801, 'loss/train': 1.1140964031219482} 11/07/2021 09:25:01 - INFO - __main__ - Step 86803: {'lr': 0.0001931775317405513, 'samples': 16666176, 'steps': 86802, 'loss/train': 0.8135704398155212} 11/07/2021 09:25:02 - INFO - __main__ - Step 86804: {'lr': 0.00019317236390164206, 'samples': 16666368, 'steps': 86803, 'loss/train': 1.6405454874038696} 11/07/2021 09:25:03 - INFO - __main__ - Step 86805: {'lr': 0.00019316719608833844, 'samples': 16666560, 'steps': 86804, 'loss/train': 1.232546329498291} 11/07/2021 09:25:03 - INFO - __main__ - Step 86806: {'lr': 0.00019316202830064279, 'samples': 16666752, 'steps': 86805, 'loss/train': 1.179831862449646} 11/07/2021 09:25:03 - INFO - __main__ - Step 86807: {'lr': 0.00019315686053855745, 'samples': 16666944, 'steps': 86806, 'loss/train': 1.2890185117721558} 11/07/2021 09:25:05 - INFO - __main__ - Step 86808: {'lr': 0.0001931516928020848, 'samples': 16667136, 'steps': 86807, 'loss/train': 1.5034173727035522} 11/07/2021 09:25:05 - INFO - __main__ - Step 86809: {'lr': 0.00019314652509122706, 'samples': 16667328, 'steps': 86808, 'loss/train': 1.3192640542984009} 11/07/2021 09:25:05 - INFO - __main__ - Step 86810: {'lr': 0.00019314135740598664, 'samples': 16667520, 'steps': 86809, 'loss/train': 1.272176742553711} 11/07/2021 09:25:06 - INFO - __main__ - Step 86811: {'lr': 0.00019313618974636587, 'samples': 16667712, 'steps': 86810, 'loss/train': 0.878127932548523} 11/07/2021 09:25:06 - INFO - __main__ - Step 86812: {'lr': 0.000193131022112367, 'samples': 16667904, 'steps': 86811, 'loss/train': 1.2391570806503296} 11/07/2021 09:25:06 - INFO - __main__ - Step 86813: {'lr': 0.00019312585450399246, 'samples': 16668096, 'steps': 86812, 'loss/train': 1.5095605850219727} 11/07/2021 09:25:07 - INFO - __main__ - Step 86814: {'lr': 0.00019312068692124452, 'samples': 16668288, 'steps': 86813, 'loss/train': 1.2013453245162964} 11/07/2021 09:25:08 - INFO - __main__ - Step 86815: {'lr': 0.00019311551936412551, 'samples': 16668480, 'steps': 86814, 'loss/train': 0.6971268057823181} 11/07/2021 09:25:08 - INFO - __main__ - Step 86816: {'lr': 0.00019311035183263778, 'samples': 16668672, 'steps': 86815, 'loss/train': 1.0484933853149414} 11/07/2021 09:25:08 - INFO - __main__ - Step 86817: {'lr': 0.00019310518432678366, 'samples': 16668864, 'steps': 86816, 'loss/train': 1.136379361152649} 11/07/2021 09:25:09 - INFO - __main__ - Step 86818: {'lr': 0.00019310001684656546, 'samples': 16669056, 'steps': 86817, 'loss/train': 1.131972074508667} 11/07/2021 09:25:10 - INFO - __main__ - Step 86819: {'lr': 0.00019309484939198558, 'samples': 16669248, 'steps': 86818, 'loss/train': 0.7251962423324585} 11/07/2021 09:25:10 - INFO - __main__ - Step 86820: {'lr': 0.00019308968196304622, 'samples': 16669440, 'steps': 86819, 'loss/train': 1.5501669645309448} 11/07/2021 09:25:10 - INFO - __main__ - Step 86821: {'lr': 0.00019308451455974973, 'samples': 16669632, 'steps': 86820, 'loss/train': 1.1004283428192139} 11/07/2021 09:25:11 - INFO - __main__ - Step 86822: {'lr': 0.00019307934718209853, 'samples': 16669824, 'steps': 86821, 'loss/train': 1.2191232442855835} 11/07/2021 09:25:11 - INFO - __main__ - Step 86823: {'lr': 0.00019307417983009485, 'samples': 16670016, 'steps': 86822, 'loss/train': 1.1402920484542847} 11/07/2021 09:25:12 - INFO - __main__ - Step 86824: {'lr': 0.0001930690125037411, 'samples': 16670208, 'steps': 86823, 'loss/train': 1.5900577306747437} 11/07/2021 09:25:13 - INFO - __main__ - Step 86825: {'lr': 0.00019306384520303955, 'samples': 16670400, 'steps': 86824, 'loss/train': 1.6343520879745483} 11/07/2021 09:25:13 - INFO - __main__ - Step 86826: {'lr': 0.0001930586779279926, 'samples': 16670592, 'steps': 86825, 'loss/train': 1.5457507371902466} 11/07/2021 09:25:13 - INFO - __main__ - Step 86827: {'lr': 0.00019305351067860246, 'samples': 16670784, 'steps': 86826, 'loss/train': 1.1441024541854858} 11/07/2021 09:25:14 - INFO - __main__ - Step 86828: {'lr': 0.00019304834345487158, 'samples': 16670976, 'steps': 86827, 'loss/train': 1.1352815628051758} 11/07/2021 09:25:15 - INFO - __main__ - Step 86829: {'lr': 0.00019304317625680223, 'samples': 16671168, 'steps': 86828, 'loss/train': 1.4754595756530762} 11/07/2021 09:25:15 - INFO - __main__ - Step 86830: {'lr': 0.00019303800908439673, 'samples': 16671360, 'steps': 86829, 'loss/train': 1.1201215982437134} 11/07/2021 09:25:15 - INFO - __main__ - Step 86831: {'lr': 0.00019303284193765755, 'samples': 16671552, 'steps': 86830, 'loss/train': 1.262805461883545} 11/07/2021 09:25:16 - INFO - __main__ - Step 86832: {'lr': 0.00019302767481658677, 'samples': 16671744, 'steps': 86831, 'loss/train': 1.580456018447876} 11/07/2021 09:25:16 - INFO - __main__ - Step 86833: {'lr': 0.00019302250772118684, 'samples': 16671936, 'steps': 86832, 'loss/train': 1.32729971408844} 11/07/2021 09:25:17 - INFO - __main__ - Step 86834: {'lr': 0.00019301734065146008, 'samples': 16672128, 'steps': 86833, 'loss/train': 1.4809064865112305} 11/07/2021 09:25:17 - INFO - __main__ - Step 86835: {'lr': 0.00019301217360740886, 'samples': 16672320, 'steps': 86834, 'loss/train': 1.4315123558044434} 11/07/2021 09:25:18 - INFO - __main__ - Step 86836: {'lr': 0.00019300700658903547, 'samples': 16672512, 'steps': 86835, 'loss/train': 1.3452730178833008} 11/07/2021 09:25:18 - INFO - __main__ - Step 86837: {'lr': 0.00019300183959634223, 'samples': 16672704, 'steps': 86836, 'loss/train': 1.2078455686569214} 11/07/2021 09:25:19 - INFO - __main__ - Step 86838: {'lr': 0.00019299667262933149, 'samples': 16672896, 'steps': 86837, 'loss/train': 1.2387605905532837} 11/07/2021 09:25:20 - INFO - __main__ - Step 86839: {'lr': 0.00019299150568800554, 'samples': 16673088, 'steps': 86838, 'loss/train': 1.3648202419281006} 11/07/2021 09:25:20 - INFO - __main__ - Step 86840: {'lr': 0.0001929863387723668, 'samples': 16673280, 'steps': 86839, 'loss/train': 1.609269618988037} 11/07/2021 09:25:21 - INFO - __main__ - Step 86841: {'lr': 0.00019298117188241748, 'samples': 16673472, 'steps': 86840, 'loss/train': 0.962725818157196} 11/07/2021 09:25:21 - INFO - __main__ - Step 86842: {'lr': 0.00019297600501816, 'samples': 16673664, 'steps': 86841, 'loss/train': 1.440036416053772} 11/07/2021 09:25:21 - INFO - __main__ - Step 86843: {'lr': 0.00019297083817959663, 'samples': 16673856, 'steps': 86842, 'loss/train': 0.5340643525123596} 11/07/2021 09:25:22 - INFO - __main__ - Step 86844: {'lr': 0.0001929656713667297, 'samples': 16674048, 'steps': 86843, 'loss/train': 0.41965630650520325} 11/07/2021 09:25:23 - INFO - __main__ - Step 86845: {'lr': 0.00019296050457956172, 'samples': 16674240, 'steps': 86844, 'loss/train': 1.613967776298523} 11/07/2021 09:25:23 - INFO - __main__ - Step 86846: {'lr': 0.0001929553378180947, 'samples': 16674432, 'steps': 86845, 'loss/train': 1.560721755027771} 11/07/2021 09:25:24 - INFO - __main__ - Step 86847: {'lr': 0.00019295017108233114, 'samples': 16674624, 'steps': 86846, 'loss/train': 0.9008715748786926} 11/07/2021 09:25:24 - INFO - __main__ - Step 86848: {'lr': 0.00019294500437227335, 'samples': 16674816, 'steps': 86847, 'loss/train': 0.9255654811859131} 11/07/2021 09:25:25 - INFO - __main__ - Step 86849: {'lr': 0.00019293983768792366, 'samples': 16675008, 'steps': 86848, 'loss/train': 1.2294219732284546} 11/07/2021 09:25:25 - INFO - __main__ - Step 86850: {'lr': 0.0001929346710292844, 'samples': 16675200, 'steps': 86849, 'loss/train': 1.3254507780075073} 11/07/2021 09:25:26 - INFO - __main__ - Step 86851: {'lr': 0.00019292950439635793, 'samples': 16675392, 'steps': 86850, 'loss/train': 0.7746249437332153} 11/07/2021 09:25:26 - INFO - __main__ - Step 86852: {'lr': 0.0001929243377891465, 'samples': 16675584, 'steps': 86851, 'loss/train': 1.1449955701828003} 11/07/2021 09:25:26 - INFO - __main__ - Step 86853: {'lr': 0.00019291917120765252, 'samples': 16675776, 'steps': 86852, 'loss/train': 1.2581100463867188} 11/07/2021 09:25:27 - INFO - __main__ - Step 86854: {'lr': 0.00019291400465187825, 'samples': 16675968, 'steps': 86853, 'loss/train': 1.1290777921676636} 11/07/2021 09:25:28 - INFO - __main__ - Step 86855: {'lr': 0.00019290883812182605, 'samples': 16676160, 'steps': 86854, 'loss/train': 1.436328411102295} 11/07/2021 09:25:28 - INFO - __main__ - Step 86856: {'lr': 0.0001929036716174983, 'samples': 16676352, 'steps': 86855, 'loss/train': 0.6054258942604065} 11/07/2021 09:25:28 - INFO - __main__ - Step 86857: {'lr': 0.00019289850513889718, 'samples': 16676544, 'steps': 86856, 'loss/train': 1.2569516897201538} 11/07/2021 09:25:29 - INFO - __main__ - Step 86858: {'lr': 0.00019289333868602527, 'samples': 16676736, 'steps': 86857, 'loss/train': 0.5284473299980164} 11/07/2021 09:25:29 - INFO - __main__ - Step 86859: {'lr': 0.00019288817225888465, 'samples': 16676928, 'steps': 86858, 'loss/train': 1.8853837251663208} 11/07/2021 09:25:30 - INFO - __main__ - Step 86860: {'lr': 0.0001928830058574777, 'samples': 16677120, 'steps': 86859, 'loss/train': 1.6687966585159302} 11/07/2021 09:25:31 - INFO - __main__ - Step 86861: {'lr': 0.0001928778394818068, 'samples': 16677312, 'steps': 86860, 'loss/train': 1.4477254152297974} 11/07/2021 09:25:31 - INFO - __main__ - Step 86862: {'lr': 0.0001928726731318743, 'samples': 16677504, 'steps': 86861, 'loss/train': 1.1968605518341064} 11/07/2021 09:25:31 - INFO - __main__ - Step 86863: {'lr': 0.00019286750680768246, 'samples': 16677696, 'steps': 86862, 'loss/train': 1.407029628753662} 11/07/2021 09:25:32 - INFO - __main__ - Step 86864: {'lr': 0.00019286234050923363, 'samples': 16677888, 'steps': 86863, 'loss/train': 1.5352321863174438} 11/07/2021 09:25:33 - INFO - __main__ - Step 86865: {'lr': 0.00019285717423653015, 'samples': 16678080, 'steps': 86864, 'loss/train': 0.7194256782531738} 11/07/2021 09:25:33 - INFO - __main__ - Step 86866: {'lr': 0.00019285200798957436, 'samples': 16678272, 'steps': 86865, 'loss/train': 1.4056527614593506} 11/07/2021 09:25:33 - INFO - __main__ - Step 86867: {'lr': 0.0001928468417683686, 'samples': 16678464, 'steps': 86866, 'loss/train': 1.2475577592849731} 11/07/2021 09:25:34 - INFO - __main__ - Step 86868: {'lr': 0.00019284167557291512, 'samples': 16678656, 'steps': 86867, 'loss/train': 1.592468500137329} 11/07/2021 09:25:34 - INFO - __main__ - Step 86869: {'lr': 0.00019283650940321633, 'samples': 16678848, 'steps': 86868, 'loss/train': 1.4799203872680664} 11/07/2021 09:25:35 - INFO - __main__ - Step 86870: {'lr': 0.00019283134325927452, 'samples': 16679040, 'steps': 86869, 'loss/train': 1.3451963663101196} 11/07/2021 09:25:35 - INFO - __main__ - Step 86871: {'lr': 0.00019282617714109202, 'samples': 16679232, 'steps': 86870, 'loss/train': 1.3103972673416138} 11/07/2021 09:25:36 - INFO - __main__ - Step 86872: {'lr': 0.00019282101104867126, 'samples': 16679424, 'steps': 86871, 'loss/train': 0.6212623715400696} 11/07/2021 09:25:36 - INFO - __main__ - Step 86873: {'lr': 0.0001928158449820144, 'samples': 16679616, 'steps': 86872, 'loss/train': 1.0911630392074585} 11/07/2021 09:25:36 - INFO - __main__ - Step 86874: {'lr': 0.00019281067894112377, 'samples': 16679808, 'steps': 86873, 'loss/train': 1.1485319137573242} 11/07/2021 09:25:38 - INFO - __main__ - Step 86875: {'lr': 0.00019280551292600182, 'samples': 16680000, 'steps': 86874, 'loss/train': 1.502768635749817} 11/07/2021 09:25:38 - INFO - __main__ - Step 86876: {'lr': 0.0001928003469366508, 'samples': 16680192, 'steps': 86875, 'loss/train': 1.5103623867034912} 11/07/2021 09:25:38 - INFO - __main__ - Step 86877: {'lr': 0.0001927951809730731, 'samples': 16680384, 'steps': 86876, 'loss/train': 1.356544852256775} 11/07/2021 09:25:39 - INFO - __main__ - Step 86878: {'lr': 0.00019279001503527096, 'samples': 16680576, 'steps': 86877, 'loss/train': 1.2610771656036377} 11/07/2021 09:25:39 - INFO - __main__ - Step 86879: {'lr': 0.00019278484912324678, 'samples': 16680768, 'steps': 86878, 'loss/train': 1.7576195001602173} 11/07/2021 09:25:39 - INFO - __main__ - Step 86880: {'lr': 0.00019277968323700285, 'samples': 16680960, 'steps': 86879, 'loss/train': 1.1482958793640137} 11/07/2021 09:25:40 - INFO - __main__ - Step 86881: {'lr': 0.00019277451737654153, 'samples': 16681152, 'steps': 86880, 'loss/train': 1.2743533849716187} 11/07/2021 09:25:41 - INFO - __main__ - Step 86882: {'lr': 0.00019276935154186511, 'samples': 16681344, 'steps': 86881, 'loss/train': 1.173423171043396} 11/07/2021 09:25:41 - INFO - __main__ - Step 86883: {'lr': 0.00019276418573297596, 'samples': 16681536, 'steps': 86882, 'loss/train': 1.8927160501480103} 11/07/2021 09:25:41 - INFO - __main__ - Step 86884: {'lr': 0.0001927590199498764, 'samples': 16681728, 'steps': 86883, 'loss/train': 1.4860479831695557} 11/07/2021 09:25:42 - INFO - __main__ - Step 86885: {'lr': 0.0001927538541925688, 'samples': 16681920, 'steps': 86884, 'loss/train': 1.205252766609192} 11/07/2021 09:25:43 - INFO - __main__ - Step 86886: {'lr': 0.00019274868846105533, 'samples': 16682112, 'steps': 86885, 'loss/train': 0.8218900561332703} 11/07/2021 09:25:43 - INFO - __main__ - Step 86887: {'lr': 0.00019274352275533844, 'samples': 16682304, 'steps': 86886, 'loss/train': 1.1537662744522095} 11/07/2021 09:25:43 - INFO - __main__ - Step 86888: {'lr': 0.00019273835707542042, 'samples': 16682496, 'steps': 86887, 'loss/train': 1.54152512550354} 11/07/2021 09:25:44 - INFO - __main__ - Step 86889: {'lr': 0.00019273319142130364, 'samples': 16682688, 'steps': 86888, 'loss/train': 1.385713815689087} 11/07/2021 09:25:44 - INFO - __main__ - Step 86890: {'lr': 0.00019272802579299036, 'samples': 16682880, 'steps': 86889, 'loss/train': 1.3677022457122803} 11/07/2021 09:25:45 - INFO - __main__ - Step 86891: {'lr': 0.000192722860190483, 'samples': 16683072, 'steps': 86890, 'loss/train': 1.194905161857605} 11/07/2021 09:25:46 - INFO - __main__ - Step 86892: {'lr': 0.0001927176946137838, 'samples': 16683264, 'steps': 86891, 'loss/train': 1.442295789718628} 11/07/2021 09:25:46 - INFO - __main__ - Step 86893: {'lr': 0.0001927125290628951, 'samples': 16683456, 'steps': 86892, 'loss/train': 1.5207254886627197} 11/07/2021 09:25:47 - INFO - __main__ - Step 86894: {'lr': 0.00019270736353781928, 'samples': 16683648, 'steps': 86893, 'loss/train': 1.4169161319732666} 11/07/2021 09:25:47 - INFO - __main__ - Step 86895: {'lr': 0.00019270219803855864, 'samples': 16683840, 'steps': 86894, 'loss/train': 0.49442362785339355} 11/07/2021 09:25:48 - INFO - __main__ - Step 86896: {'lr': 0.0001926970325651155, 'samples': 16684032, 'steps': 86895, 'loss/train': 0.9560973048210144} 11/07/2021 09:25:48 - INFO - __main__ - Step 86897: {'lr': 0.0001926918671174922, 'samples': 16684224, 'steps': 86896, 'loss/train': 1.6416704654693604} 11/07/2021 09:25:49 - INFO - __main__ - Step 86898: {'lr': 0.00019268670169569115, 'samples': 16684416, 'steps': 86897, 'loss/train': 1.345856785774231} 11/07/2021 09:25:49 - INFO - __main__ - Step 86899: {'lr': 0.0001926815362997145, 'samples': 16684608, 'steps': 86898, 'loss/train': 1.2454396486282349} 11/07/2021 09:25:49 - INFO - __main__ - Step 86900: {'lr': 0.00019267637092956468, 'samples': 16684800, 'steps': 86899, 'loss/train': 1.7034310102462769} 11/07/2021 09:25:50 - INFO - __main__ - Step 86901: {'lr': 0.00019267120558524395, 'samples': 16684992, 'steps': 86900, 'loss/train': 1.1366827487945557} 11/07/2021 09:25:51 - INFO - __main__ - Step 86902: {'lr': 0.00019266604026675472, 'samples': 16685184, 'steps': 86901, 'loss/train': 0.9261730909347534} 11/07/2021 09:25:51 - INFO - __main__ - Step 86903: {'lr': 0.0001926608749740993, 'samples': 16685376, 'steps': 86902, 'loss/train': 1.4400646686553955} 11/07/2021 09:25:51 - INFO - __main__ - Step 86904: {'lr': 0.00019265570970728, 'samples': 16685568, 'steps': 86903, 'loss/train': 1.0482465028762817} 11/07/2021 09:25:52 - INFO - __main__ - Step 86905: {'lr': 0.00019265054446629916, 'samples': 16685760, 'steps': 86904, 'loss/train': 1.124335765838623} 11/07/2021 09:25:52 - INFO - __main__ - Step 86906: {'lr': 0.00019264537925115904, 'samples': 16685952, 'steps': 86905, 'loss/train': 1.167860507965088} 11/07/2021 09:25:53 - INFO - __main__ - Step 86907: {'lr': 0.0001926402140618621, 'samples': 16686144, 'steps': 86906, 'loss/train': 1.1196889877319336} 11/07/2021 09:25:54 - INFO - __main__ - Step 86908: {'lr': 0.00019263504889841055, 'samples': 16686336, 'steps': 86907, 'loss/train': 1.335968255996704} 11/07/2021 09:25:54 - INFO - __main__ - Step 86909: {'lr': 0.00019262988376080685, 'samples': 16686528, 'steps': 86908, 'loss/train': 1.233593225479126} 11/07/2021 09:25:54 - INFO - __main__ - Step 86910: {'lr': 0.00019262471864905315, 'samples': 16686720, 'steps': 86909, 'loss/train': 1.3225587606430054} 11/07/2021 09:25:55 - INFO - __main__ - Step 86911: {'lr': 0.00019261955356315188, 'samples': 16686912, 'steps': 86910, 'loss/train': 1.2267515659332275} 11/07/2021 09:25:56 - INFO - __main__ - Step 86912: {'lr': 0.00019261438850310541, 'samples': 16687104, 'steps': 86911, 'loss/train': 1.3403937816619873} 11/07/2021 09:25:56 - INFO - __main__ - Step 86913: {'lr': 0.00019260922346891595, 'samples': 16687296, 'steps': 86912, 'loss/train': 1.4852869510650635} 11/07/2021 09:25:56 - INFO - __main__ - Step 86914: {'lr': 0.00019260405846058593, 'samples': 16687488, 'steps': 86913, 'loss/train': 1.2449370622634888} 11/07/2021 09:25:57 - INFO - __main__ - Step 86915: {'lr': 0.0001925988934781176, 'samples': 16687680, 'steps': 86914, 'loss/train': 1.1967440843582153} 11/07/2021 09:25:57 - INFO - __main__ - Step 86916: {'lr': 0.0001925937285215133, 'samples': 16687872, 'steps': 86915, 'loss/train': 1.0206884145736694} 11/07/2021 09:25:59 - INFO - __main__ - Step 86917: {'lr': 0.00019258856359077541, 'samples': 16688064, 'steps': 86916, 'loss/train': 1.5531593561172485} 11/07/2021 09:26:00 - INFO - __main__ - Step 86918: {'lr': 0.00019258339868590625, 'samples': 16688256, 'steps': 86917, 'loss/train': 1.150923728942871} 11/07/2021 09:26:00 - INFO - __main__ - Step 86919: {'lr': 0.00019257823380690808, 'samples': 16688448, 'steps': 86918, 'loss/train': 0.27757084369659424} 11/07/2021 09:26:00 - INFO - __main__ - Step 86920: {'lr': 0.00019257306895378336, 'samples': 16688640, 'steps': 86919, 'loss/train': 0.20042845606803894} 11/07/2021 09:26:01 - INFO - __main__ - Step 86921: {'lr': 0.00019256790412653423, 'samples': 16688832, 'steps': 86920, 'loss/train': 0.08575911819934845} 11/07/2021 09:26:01 - INFO - __main__ - Step 86922: {'lr': 0.00019256273932516317, 'samples': 16689024, 'steps': 86921, 'loss/train': 1.4934507608413696} 11/07/2021 09:26:01 - INFO - __main__ - Step 86923: {'lr': 0.0001925575745496724, 'samples': 16689216, 'steps': 86922, 'loss/train': 0.08903008699417114} 11/07/2021 09:26:02 - INFO - __main__ - Step 86924: {'lr': 0.00019255240980006432, 'samples': 16689408, 'steps': 86923, 'loss/train': 1.4819139242172241} 11/07/2021 09:26:03 - INFO - __main__ - Step 86925: {'lr': 0.0001925472450763413, 'samples': 16689600, 'steps': 86924, 'loss/train': 1.309037446975708} 11/07/2021 09:26:03 - INFO - __main__ - Step 86926: {'lr': 0.00019254208037850556, 'samples': 16689792, 'steps': 86925, 'loss/train': 1.0079008340835571} 11/07/2021 09:26:03 - INFO - __main__ - Step 86927: {'lr': 0.00019253691570655945, 'samples': 16689984, 'steps': 86926, 'loss/train': 1.36038339138031} 11/07/2021 09:26:04 - INFO - __main__ - Step 86928: {'lr': 0.00019253175106050536, 'samples': 16690176, 'steps': 86927, 'loss/train': 1.3257405757904053} 11/07/2021 09:26:05 - INFO - __main__ - Step 86929: {'lr': 0.00019252658644034555, 'samples': 16690368, 'steps': 86928, 'loss/train': 1.3135924339294434} 11/07/2021 09:26:05 - INFO - __main__ - Step 86930: {'lr': 0.00019252142184608234, 'samples': 16690560, 'steps': 86929, 'loss/train': 1.5622402429580688} 11/07/2021 09:26:06 - INFO - __main__ - Step 86931: {'lr': 0.00019251625727771817, 'samples': 16690752, 'steps': 86930, 'loss/train': 0.8238165974617004} 11/07/2021 09:26:06 - INFO - __main__ - Step 86932: {'lr': 0.00019251109273525524, 'samples': 16690944, 'steps': 86931, 'loss/train': 1.4487274885177612} 11/07/2021 09:26:06 - INFO - __main__ - Step 86933: {'lr': 0.00019250592821869596, 'samples': 16691136, 'steps': 86932, 'loss/train': 0.9166833758354187} 11/07/2021 09:26:07 - INFO - __main__ - Step 86934: {'lr': 0.00019250076372804258, 'samples': 16691328, 'steps': 86933, 'loss/train': 1.3888325691223145} 11/07/2021 09:26:07 - INFO - __main__ - Step 86935: {'lr': 0.00019249559926329745, 'samples': 16691520, 'steps': 86934, 'loss/train': 0.8118363618850708} 11/07/2021 09:26:08 - INFO - __main__ - Step 86936: {'lr': 0.00019249043482446294, 'samples': 16691712, 'steps': 86935, 'loss/train': 1.3684537410736084} 11/07/2021 09:26:08 - INFO - __main__ - Step 86937: {'lr': 0.00019248527041154136, 'samples': 16691904, 'steps': 86936, 'loss/train': 1.454566478729248} 11/07/2021 09:26:09 - INFO - __main__ - Step 86938: {'lr': 0.00019248010602453502, 'samples': 16692096, 'steps': 86937, 'loss/train': 1.4957143068313599} 11/07/2021 09:26:10 - INFO - __main__ - Step 86939: {'lr': 0.00019247494166344632, 'samples': 16692288, 'steps': 86938, 'loss/train': 1.3878217935562134} 11/07/2021 09:26:10 - INFO - __main__ - Step 86940: {'lr': 0.00019246977732827745, 'samples': 16692480, 'steps': 86939, 'loss/train': 1.1987749338150024} 11/07/2021 09:26:11 - INFO - __main__ - Step 86941: {'lr': 0.00019246461301903082, 'samples': 16692672, 'steps': 86940, 'loss/train': 1.229043960571289} 11/07/2021 09:26:11 - INFO - __main__ - Step 86942: {'lr': 0.0001924594487357088, 'samples': 16692864, 'steps': 86941, 'loss/train': 1.4818437099456787} 11/07/2021 09:26:11 - INFO - __main__ - Step 86943: {'lr': 0.00019245428447831362, 'samples': 16693056, 'steps': 86942, 'loss/train': 1.7619706392288208} 11/07/2021 09:26:12 - INFO - __main__ - Step 86944: {'lr': 0.00019244912024684764, 'samples': 16693248, 'steps': 86943, 'loss/train': 1.7485222816467285} 11/07/2021 09:26:13 - INFO - __main__ - Step 86945: {'lr': 0.00019244395604131321, 'samples': 16693440, 'steps': 86944, 'loss/train': 1.4561620950698853} 11/07/2021 09:26:13 - INFO - __main__ - Step 86946: {'lr': 0.00019243879186171264, 'samples': 16693632, 'steps': 86945, 'loss/train': 1.3367197513580322} 11/07/2021 09:26:14 - INFO - __main__ - Step 86947: {'lr': 0.00019243362770804827, 'samples': 16693824, 'steps': 86946, 'loss/train': 1.0225694179534912} 11/07/2021 09:26:14 - INFO - __main__ - Step 86948: {'lr': 0.0001924284635803224, 'samples': 16694016, 'steps': 86947, 'loss/train': 1.476378083229065} 11/07/2021 09:26:14 - INFO - __main__ - Step 86949: {'lr': 0.00019242329947853737, 'samples': 16694208, 'steps': 86948, 'loss/train': 1.3903958797454834} 11/07/2021 09:26:15 - INFO - __main__ - Step 86950: {'lr': 0.00019241813540269554, 'samples': 16694400, 'steps': 86949, 'loss/train': 1.436843752861023} 11/07/2021 09:26:16 - INFO - __main__ - Step 86951: {'lr': 0.0001924129713527992, 'samples': 16694592, 'steps': 86950, 'loss/train': 1.681504726409912} 11/07/2021 09:26:16 - INFO - __main__ - Step 86952: {'lr': 0.00019240780732885073, 'samples': 16694784, 'steps': 86951, 'loss/train': 1.8126020431518555} 11/07/2021 09:26:16 - INFO - __main__ - Step 86953: {'lr': 0.00019240264333085245, 'samples': 16694976, 'steps': 86952, 'loss/train': 1.4683892726898193} 11/07/2021 09:26:17 - INFO - __main__ - Step 86954: {'lr': 0.00019239747935880655, 'samples': 16695168, 'steps': 86953, 'loss/train': 1.518540859222412} 11/07/2021 09:26:17 - INFO - __main__ - Step 86955: {'lr': 0.00019239231541271547, 'samples': 16695360, 'steps': 86954, 'loss/train': 1.1061618328094482} 11/07/2021 09:26:18 - INFO - __main__ - Step 86956: {'lr': 0.00019238715149258155, 'samples': 16695552, 'steps': 86955, 'loss/train': 1.6327781677246094} 11/07/2021 09:26:18 - INFO - __main__ - Step 86957: {'lr': 0.00019238198759840707, 'samples': 16695744, 'steps': 86956, 'loss/train': 1.858065128326416} 11/07/2021 09:26:19 - INFO - __main__ - Step 86958: {'lr': 0.00019237682373019437, 'samples': 16695936, 'steps': 86957, 'loss/train': 0.8872216939926147} 11/07/2021 09:26:19 - INFO - __main__ - Step 86959: {'lr': 0.00019237165988794576, 'samples': 16696128, 'steps': 86958, 'loss/train': 1.254133701324463} 11/07/2021 09:26:20 - INFO - __main__ - Step 86960: {'lr': 0.00019236649607166364, 'samples': 16696320, 'steps': 86959, 'loss/train': 1.2755014896392822} 11/07/2021 09:26:21 - INFO - __main__ - Step 86961: {'lr': 0.00019236133228135027, 'samples': 16696512, 'steps': 86960, 'loss/train': 0.9173762798309326} 11/07/2021 09:26:21 - INFO - __main__ - Step 86962: {'lr': 0.000192356168517008, 'samples': 16696704, 'steps': 86961, 'loss/train': 1.4374154806137085} 11/07/2021 09:26:21 - INFO - __main__ - Step 86963: {'lr': 0.00019235100477863916, 'samples': 16696896, 'steps': 86962, 'loss/train': 1.6533197164535522} 11/07/2021 09:26:22 - INFO - __main__ - Step 86964: {'lr': 0.00019234584106624604, 'samples': 16697088, 'steps': 86963, 'loss/train': 1.285894513130188} 11/07/2021 09:26:22 - INFO - __main__ - Step 86965: {'lr': 0.000192340677379831, 'samples': 16697280, 'steps': 86964, 'loss/train': 1.2419264316558838} 11/07/2021 09:26:23 - INFO - __main__ - Step 86966: {'lr': 0.00019233551371939647, 'samples': 16697472, 'steps': 86965, 'loss/train': 1.7611688375473022} 11/07/2021 09:26:23 - INFO - __main__ - Step 86967: {'lr': 0.00019233035008494456, 'samples': 16697664, 'steps': 86966, 'loss/train': 1.716157078742981} 11/07/2021 09:26:24 - INFO - __main__ - Step 86968: {'lr': 0.00019232518647647773, 'samples': 16697856, 'steps': 86967, 'loss/train': 1.019762396812439} 11/07/2021 09:26:24 - INFO - __main__ - Step 86969: {'lr': 0.00019232002289399825, 'samples': 16698048, 'steps': 86968, 'loss/train': 1.7270550727844238} 11/07/2021 09:26:25 - INFO - __main__ - Step 86970: {'lr': 0.00019231485933750848, 'samples': 16698240, 'steps': 86969, 'loss/train': 0.7139232754707336} 11/07/2021 09:26:26 - INFO - __main__ - Step 86971: {'lr': 0.00019230969580701077, 'samples': 16698432, 'steps': 86970, 'loss/train': 1.2291773557662964} 11/07/2021 09:26:26 - INFO - __main__ - Step 86972: {'lr': 0.00019230453230250738, 'samples': 16698624, 'steps': 86971, 'loss/train': 0.6685494780540466} 11/07/2021 09:26:26 - INFO - __main__ - Step 86973: {'lr': 0.00019229936882400074, 'samples': 16698816, 'steps': 86972, 'loss/train': 1.6273372173309326} 11/07/2021 09:26:27 - INFO - __main__ - Step 86974: {'lr': 0.00019229420537149306, 'samples': 16699008, 'steps': 86973, 'loss/train': 0.680396318435669} 11/07/2021 09:26:27 - INFO - __main__ - Step 86975: {'lr': 0.00019228904194498674, 'samples': 16699200, 'steps': 86974, 'loss/train': 1.0979934930801392} 11/07/2021 09:26:28 - INFO - __main__ - Step 86976: {'lr': 0.00019228387854448406, 'samples': 16699392, 'steps': 86975, 'loss/train': 0.8795528411865234} 11/07/2021 09:26:29 - INFO - __main__ - Step 86977: {'lr': 0.0001922787151699874, 'samples': 16699584, 'steps': 86976, 'loss/train': 1.1305147409439087} 11/07/2021 09:26:29 - INFO - __main__ - Step 86978: {'lr': 0.00019227355182149905, 'samples': 16699776, 'steps': 86977, 'loss/train': 1.299600601196289} 11/07/2021 09:26:29 - INFO - __main__ - Step 86979: {'lr': 0.00019226838849902147, 'samples': 16699968, 'steps': 86978, 'loss/train': 1.2053802013397217} 11/07/2021 09:26:30 - INFO - __main__ - Step 86980: {'lr': 0.00019226322520255674, 'samples': 16700160, 'steps': 86979, 'loss/train': 1.6765714883804321} 11/07/2021 09:26:30 - INFO - __main__ - Step 86981: {'lr': 0.0001922580619321073, 'samples': 16700352, 'steps': 86980, 'loss/train': 0.6682043671607971} 11/07/2021 09:26:31 - INFO - __main__ - Step 86982: {'lr': 0.00019225289868767553, 'samples': 16700544, 'steps': 86981, 'loss/train': 0.6553238034248352} 11/07/2021 09:26:31 - INFO - __main__ - Step 86983: {'lr': 0.0001922477354692637, 'samples': 16700736, 'steps': 86982, 'loss/train': 1.2676708698272705} 11/07/2021 09:26:32 - INFO - __main__ - Step 86984: {'lr': 0.0001922425722768741, 'samples': 16700928, 'steps': 86983, 'loss/train': 1.6107479333877563} 11/07/2021 09:26:32 - INFO - __main__ - Step 86985: {'lr': 0.00019223740911050916, 'samples': 16701120, 'steps': 86984, 'loss/train': 1.1534072160720825} 11/07/2021 09:26:32 - INFO - __main__ - Step 86986: {'lr': 0.00019223224597017115, 'samples': 16701312, 'steps': 86985, 'loss/train': 1.4064216613769531} 11/07/2021 09:26:34 - INFO - __main__ - Step 86987: {'lr': 0.0001922270828558624, 'samples': 16701504, 'steps': 86986, 'loss/train': 1.4360861778259277} 11/07/2021 09:26:34 - INFO - __main__ - Step 86988: {'lr': 0.0001922219197675852, 'samples': 16701696, 'steps': 86987, 'loss/train': 0.8647075891494751} 11/07/2021 09:26:34 - INFO - __main__ - Step 86989: {'lr': 0.0001922167567053419, 'samples': 16701888, 'steps': 86988, 'loss/train': 1.3077301979064941} 11/07/2021 09:26:35 - INFO - __main__ - Step 86990: {'lr': 0.00019221159366913487, 'samples': 16702080, 'steps': 86989, 'loss/train': 0.8362762928009033} 11/07/2021 09:26:35 - INFO - __main__ - Step 86991: {'lr': 0.00019220643065896638, 'samples': 16702272, 'steps': 86990, 'loss/train': 1.5020016431808472} 11/07/2021 09:26:36 - INFO - __main__ - Step 86992: {'lr': 0.0001922012676748388, 'samples': 16702464, 'steps': 86991, 'loss/train': 1.080112338066101} 11/07/2021 09:26:36 - INFO - __main__ - Step 86993: {'lr': 0.00019219610471675458, 'samples': 16702656, 'steps': 86992, 'loss/train': 1.7566152811050415} 11/07/2021 09:26:37 - INFO - __main__ - Step 86994: {'lr': 0.00019219094178471574, 'samples': 16702848, 'steps': 86993, 'loss/train': 0.9361863136291504} 11/07/2021 09:26:37 - INFO - __main__ - Step 86995: {'lr': 0.0001921857788787248, 'samples': 16703040, 'steps': 86994, 'loss/train': 1.3452404737472534} 11/07/2021 09:26:37 - INFO - __main__ - Step 86996: {'lr': 0.00019218061599878407, 'samples': 16703232, 'steps': 86995, 'loss/train': 1.5540752410888672} 11/07/2021 09:26:38 - INFO - __main__ - Step 86997: {'lr': 0.00019217545314489582, 'samples': 16703424, 'steps': 86996, 'loss/train': 1.5050331354141235} 11/07/2021 09:26:39 - INFO - __main__ - Step 86998: {'lr': 0.00019217029031706243, 'samples': 16703616, 'steps': 86997, 'loss/train': 1.578718662261963} 11/07/2021 09:26:39 - INFO - __main__ - Step 86999: {'lr': 0.00019216512751528623, 'samples': 16703808, 'steps': 86998, 'loss/train': 1.4545907974243164} 11/07/2021 09:26:39 - INFO - __main__ - Step 87000: {'lr': 0.0001921599647395695, 'samples': 16704000, 'steps': 86999, 'loss/train': 1.0071686506271362} 11/07/2021 09:26:40 - INFO - __main__ - Step 87001: {'lr': 0.00019215480198991463, 'samples': 16704192, 'steps': 87000, 'loss/train': 1.2593156099319458} 11/07/2021 09:26:41 - INFO - __main__ - Step 87002: {'lr': 0.0001921496392663239, 'samples': 16704384, 'steps': 87001, 'loss/train': 1.2389944791793823} 11/07/2021 09:26:41 - INFO - __main__ - Step 87003: {'lr': 0.00019214447656879966, 'samples': 16704576, 'steps': 87002, 'loss/train': 0.7734476923942566} 11/07/2021 09:26:41 - INFO - __main__ - Step 87004: {'lr': 0.0001921393138973442, 'samples': 16704768, 'steps': 87003, 'loss/train': 1.2929198741912842} 11/07/2021 09:26:42 - INFO - __main__ - Step 87005: {'lr': 0.0001921341512519599, 'samples': 16704960, 'steps': 87004, 'loss/train': 1.6266299486160278} 11/07/2021 09:26:42 - INFO - __main__ - Step 87006: {'lr': 0.00019212898863264915, 'samples': 16705152, 'steps': 87005, 'loss/train': 1.3240814208984375} 11/07/2021 09:26:43 - INFO - __main__ - Step 87007: {'lr': 0.00019212382603941408, 'samples': 16705344, 'steps': 87006, 'loss/train': 1.5345523357391357} 11/07/2021 09:26:43 - INFO - __main__ - Step 87008: {'lr': 0.00019211866347225716, 'samples': 16705536, 'steps': 87007, 'loss/train': 1.2155284881591797} 11/07/2021 09:26:44 - INFO - __main__ - Step 87009: {'lr': 0.00019211350093118062, 'samples': 16705728, 'steps': 87008, 'loss/train': 1.6278666257858276} 11/07/2021 09:26:44 - INFO - __main__ - Step 87010: {'lr': 0.00019210833841618686, 'samples': 16705920, 'steps': 87009, 'loss/train': 1.321919560432434} 11/07/2021 09:26:44 - INFO - __main__ - Step 87011: {'lr': 0.00019210317592727823, 'samples': 16706112, 'steps': 87010, 'loss/train': 1.0192362070083618} 11/07/2021 09:26:46 - INFO - __main__ - Step 87012: {'lr': 0.00019209801346445696, 'samples': 16706304, 'steps': 87011, 'loss/train': 1.2620570659637451} 11/07/2021 09:26:46 - INFO - __main__ - Step 87013: {'lr': 0.00019209285102772544, 'samples': 16706496, 'steps': 87012, 'loss/train': 1.3428739309310913} 11/07/2021 09:26:46 - INFO - __main__ - Step 87014: {'lr': 0.000192087688617086, 'samples': 16706688, 'steps': 87013, 'loss/train': 1.6426074504852295} 11/07/2021 09:26:47 - INFO - __main__ - Step 87015: {'lr': 0.00019208252623254096, 'samples': 16706880, 'steps': 87014, 'loss/train': 1.1361621618270874} 11/07/2021 09:26:47 - INFO - __main__ - Step 87016: {'lr': 0.00019207736387409264, 'samples': 16707072, 'steps': 87015, 'loss/train': 1.2751338481903076} 11/07/2021 09:26:47 - INFO - __main__ - Step 87017: {'lr': 0.00019207220154174336, 'samples': 16707264, 'steps': 87016, 'loss/train': 1.4714175462722778} 11/07/2021 09:26:49 - INFO - __main__ - Step 87018: {'lr': 0.00019206703923549544, 'samples': 16707456, 'steps': 87017, 'loss/train': 1.5659323930740356} 11/07/2021 09:26:49 - INFO - __main__ - Step 87019: {'lr': 0.00019206187695535134, 'samples': 16707648, 'steps': 87018, 'loss/train': 1.6066843271255493} 11/07/2021 09:26:49 - INFO - __main__ - Step 87020: {'lr': 0.00019205671470131318, 'samples': 16707840, 'steps': 87019, 'loss/train': 1.274690866470337} 11/07/2021 09:26:50 - INFO - __main__ - Step 87021: {'lr': 0.00019205155247338333, 'samples': 16708032, 'steps': 87020, 'loss/train': 0.12666049599647522} 11/07/2021 09:26:50 - INFO - __main__ - Step 87022: {'lr': 0.00019204639027156417, 'samples': 16708224, 'steps': 87021, 'loss/train': 1.3978281021118164} 11/07/2021 09:26:51 - INFO - __main__ - Step 87023: {'lr': 0.000192041228095858, 'samples': 16708416, 'steps': 87022, 'loss/train': 1.1842347383499146} 11/07/2021 09:26:51 - INFO - __main__ - Step 87024: {'lr': 0.0001920360659462672, 'samples': 16708608, 'steps': 87023, 'loss/train': 1.3834792375564575} 11/07/2021 09:26:52 - INFO - __main__ - Step 87025: {'lr': 0.000192030903822794, 'samples': 16708800, 'steps': 87024, 'loss/train': 1.3393887281417847} 11/07/2021 09:26:52 - INFO - __main__ - Step 87026: {'lr': 0.00019202574172544082, 'samples': 16708992, 'steps': 87025, 'loss/train': 0.7710685133934021} 11/07/2021 09:26:52 - INFO - __main__ - Step 87027: {'lr': 0.00019202057965420993, 'samples': 16709184, 'steps': 87026, 'loss/train': 1.0608347654342651} 11/07/2021 09:26:53 - INFO - __main__ - Step 87028: {'lr': 0.00019201541760910368, 'samples': 16709376, 'steps': 87027, 'loss/train': 1.5471845865249634} 11/07/2021 09:26:54 - INFO - __main__ - Step 87029: {'lr': 0.00019201025559012437, 'samples': 16709568, 'steps': 87028, 'loss/train': 1.1326959133148193} 11/07/2021 09:26:54 - INFO - __main__ - Step 87030: {'lr': 0.00019200509359727436, 'samples': 16709760, 'steps': 87029, 'loss/train': 1.5608628988265991} 11/07/2021 09:26:55 - INFO - __main__ - Step 87031: {'lr': 0.00019199993163055595, 'samples': 16709952, 'steps': 87030, 'loss/train': 0.7697756886482239} 11/07/2021 09:26:55 - INFO - __main__ - Step 87032: {'lr': 0.00019199476968997147, 'samples': 16710144, 'steps': 87031, 'loss/train': 1.4292304515838623} 11/07/2021 09:26:56 - INFO - __main__ - Step 87033: {'lr': 0.00019198960777552337, 'samples': 16710336, 'steps': 87032, 'loss/train': 1.4478929042816162} 11/07/2021 09:26:56 - INFO - __main__ - Step 87034: {'lr': 0.00019198444588721379, 'samples': 16710528, 'steps': 87033, 'loss/train': 1.4115523099899292} 11/07/2021 09:26:57 - INFO - __main__ - Step 87035: {'lr': 0.00019197928402504505, 'samples': 16710720, 'steps': 87034, 'loss/train': 1.355694055557251} 11/07/2021 09:26:57 - INFO - __main__ - Step 87036: {'lr': 0.00019197412218901962, 'samples': 16710912, 'steps': 87035, 'loss/train': 1.7685556411743164} 11/07/2021 09:26:57 - INFO - __main__ - Step 87037: {'lr': 0.0001919689603791397, 'samples': 16711104, 'steps': 87036, 'loss/train': 1.1564455032348633} 11/07/2021 09:26:59 - INFO - __main__ - Step 87038: {'lr': 0.0001919637985954077, 'samples': 16711296, 'steps': 87037, 'loss/train': 1.4132636785507202} 11/07/2021 09:26:59 - INFO - __main__ - Step 87039: {'lr': 0.00019195863683782588, 'samples': 16711488, 'steps': 87038, 'loss/train': 1.5713934898376465} 11/07/2021 09:26:59 - INFO - __main__ - Step 87040: {'lr': 0.00019195347510639666, 'samples': 16711680, 'steps': 87039, 'loss/train': 1.0571208000183105} 11/07/2021 09:27:00 - INFO - __main__ - Step 87041: {'lr': 0.00019194831340112227, 'samples': 16711872, 'steps': 87040, 'loss/train': 1.3349262475967407} 11/07/2021 09:27:00 - INFO - __main__ - Step 87042: {'lr': 0.00019194315172200508, 'samples': 16712064, 'steps': 87041, 'loss/train': 1.6507014036178589} 11/07/2021 09:27:01 - INFO - __main__ - Step 87043: {'lr': 0.0001919379900690474, 'samples': 16712256, 'steps': 87042, 'loss/train': 1.325984001159668} 11/07/2021 09:27:02 - INFO - __main__ - Step 87044: {'lr': 0.00019193282844225164, 'samples': 16712448, 'steps': 87043, 'loss/train': 1.525829792022705} 11/07/2021 09:27:02 - INFO - __main__ - Step 87045: {'lr': 0.00019192766684162, 'samples': 16712640, 'steps': 87044, 'loss/train': 1.2039228677749634} 11/07/2021 09:27:02 - INFO - __main__ - Step 87046: {'lr': 0.00019192250526715488, 'samples': 16712832, 'steps': 87045, 'loss/train': 1.4827938079833984} 11/07/2021 09:27:03 - INFO - __main__ - Step 87047: {'lr': 0.00019191734371885855, 'samples': 16713024, 'steps': 87046, 'loss/train': 0.605148434638977} 11/07/2021 09:27:03 - INFO - __main__ - Step 87048: {'lr': 0.00019191218219673337, 'samples': 16713216, 'steps': 87047, 'loss/train': 1.361790418624878} 11/07/2021 09:27:04 - INFO - __main__ - Step 87049: {'lr': 0.00019190702070078167, 'samples': 16713408, 'steps': 87048, 'loss/train': 2.703433036804199} 11/07/2021 09:27:04 - INFO - __main__ - Step 87050: {'lr': 0.00019190185923100578, 'samples': 16713600, 'steps': 87049, 'loss/train': 0.9383610486984253} 11/07/2021 09:27:05 - INFO - __main__ - Step 87051: {'lr': 0.00019189669778740798, 'samples': 16713792, 'steps': 87050, 'loss/train': 1.5764687061309814} 11/07/2021 09:27:05 - INFO - __main__ - Step 87052: {'lr': 0.00019189153636999066, 'samples': 16713984, 'steps': 87051, 'loss/train': 1.2725425958633423} 11/07/2021 09:27:06 - INFO - __main__ - Step 87053: {'lr': 0.0001918863749787561, 'samples': 16714176, 'steps': 87052, 'loss/train': 1.6253200769424438} 11/07/2021 09:27:06 - INFO - __main__ - Step 87054: {'lr': 0.00019188121361370664, 'samples': 16714368, 'steps': 87053, 'loss/train': 1.347472906112671} 11/07/2021 09:27:07 - INFO - __main__ - Step 87055: {'lr': 0.00019187605227484467, 'samples': 16714560, 'steps': 87054, 'loss/train': 1.2414190769195557} 11/07/2021 09:27:07 - INFO - __main__ - Step 87056: {'lr': 0.0001918708909621724, 'samples': 16714752, 'steps': 87055, 'loss/train': 1.2163374423980713} 11/07/2021 09:27:08 - INFO - __main__ - Step 87057: {'lr': 0.0001918657296756922, 'samples': 16714944, 'steps': 87056, 'loss/train': 1.2563056945800781} 11/07/2021 09:27:08 - INFO - __main__ - Step 87058: {'lr': 0.00019186056841540645, 'samples': 16715136, 'steps': 87057, 'loss/train': 1.318207859992981} 11/07/2021 09:27:08 - INFO - __main__ - Step 87059: {'lr': 0.0001918554071813174, 'samples': 16715328, 'steps': 87058, 'loss/train': 1.4004794359207153} 11/07/2021 09:27:09 - INFO - __main__ - Step 87060: {'lr': 0.00019185024597342742, 'samples': 16715520, 'steps': 87059, 'loss/train': 0.8014184236526489} 11/07/2021 09:27:10 - INFO - __main__ - Step 87061: {'lr': 0.00019184508479173885, 'samples': 16715712, 'steps': 87060, 'loss/train': 1.6561367511749268} 11/07/2021 09:27:10 - INFO - __main__ - Step 87062: {'lr': 0.00019183992363625392, 'samples': 16715904, 'steps': 87061, 'loss/train': 1.6524460315704346} 11/07/2021 09:27:10 - INFO - __main__ - Step 87063: {'lr': 0.00019183476250697503, 'samples': 16716096, 'steps': 87062, 'loss/train': 1.2579944133758545} 11/07/2021 09:27:11 - INFO - __main__ - Step 87064: {'lr': 0.00019182960140390454, 'samples': 16716288, 'steps': 87063, 'loss/train': 1.204856038093567} 11/07/2021 09:27:11 - INFO - __main__ - Step 87065: {'lr': 0.0001918244403270447, 'samples': 16716480, 'steps': 87064, 'loss/train': 1.2718400955200195} 11/07/2021 09:27:12 - INFO - __main__ - Step 87066: {'lr': 0.0001918192792763979, 'samples': 16716672, 'steps': 87065, 'loss/train': 1.6030384302139282} 11/07/2021 09:27:13 - INFO - __main__ - Step 87067: {'lr': 0.00019181411825196644, 'samples': 16716864, 'steps': 87066, 'loss/train': 1.4196014404296875} 11/07/2021 09:27:13 - INFO - __main__ - Step 87068: {'lr': 0.0001918089572537526, 'samples': 16717056, 'steps': 87067, 'loss/train': 1.6479307413101196} 11/07/2021 09:27:13 - INFO - __main__ - Step 87069: {'lr': 0.00019180379628175879, 'samples': 16717248, 'steps': 87068, 'loss/train': 1.5852351188659668} 11/07/2021 09:27:14 - INFO - __main__ - Step 87070: {'lr': 0.00019179863533598724, 'samples': 16717440, 'steps': 87069, 'loss/train': 1.7292567491531372} 11/07/2021 09:27:15 - INFO - __main__ - Step 87071: {'lr': 0.00019179347441644035, 'samples': 16717632, 'steps': 87070, 'loss/train': 1.3553175926208496} 11/07/2021 09:27:15 - INFO - __main__ - Step 87072: {'lr': 0.00019178831352312042, 'samples': 16717824, 'steps': 87071, 'loss/train': 1.372596025466919} 11/07/2021 09:27:15 - INFO - __main__ - Step 87073: {'lr': 0.00019178315265602983, 'samples': 16718016, 'steps': 87072, 'loss/train': 1.4615072011947632} 11/07/2021 09:27:16 - INFO - __main__ - Step 87074: {'lr': 0.0001917779918151708, 'samples': 16718208, 'steps': 87073, 'loss/train': 0.6439422369003296} 11/07/2021 09:27:16 - INFO - __main__ - Step 87075: {'lr': 0.0001917728310005457, 'samples': 16718400, 'steps': 87074, 'loss/train': 1.1994373798370361} 11/07/2021 09:27:17 - INFO - __main__ - Step 87076: {'lr': 0.00019176767021215693, 'samples': 16718592, 'steps': 87075, 'loss/train': 1.3859148025512695} 11/07/2021 09:27:18 - INFO - __main__ - Step 87077: {'lr': 0.0001917625094500067, 'samples': 16718784, 'steps': 87076, 'loss/train': 1.5083860158920288} 11/07/2021 09:27:18 - INFO - __main__ - Step 87078: {'lr': 0.0001917573487140974, 'samples': 16718976, 'steps': 87077, 'loss/train': 1.6939905881881714} 11/07/2021 09:27:18 - INFO - __main__ - Step 87079: {'lr': 0.00019175218800443128, 'samples': 16719168, 'steps': 87078, 'loss/train': 1.6190165281295776} 11/07/2021 09:27:19 - INFO - __main__ - Step 87080: {'lr': 0.0001917470273210108, 'samples': 16719360, 'steps': 87079, 'loss/train': 1.5581696033477783} 11/07/2021 09:27:20 - INFO - __main__ - Step 87081: {'lr': 0.00019174186666383813, 'samples': 16719552, 'steps': 87080, 'loss/train': 0.9953639507293701} 11/07/2021 09:27:20 - INFO - __main__ - Step 87082: {'lr': 0.00019173670603291575, 'samples': 16719744, 'steps': 87081, 'loss/train': 1.6218667030334473} 11/07/2021 09:27:20 - INFO - __main__ - Step 87083: {'lr': 0.00019173154542824586, 'samples': 16719936, 'steps': 87082, 'loss/train': 1.320080041885376} 11/07/2021 09:27:21 - INFO - __main__ - Step 87084: {'lr': 0.00019172638484983085, 'samples': 16720128, 'steps': 87083, 'loss/train': 0.9816939234733582} 11/07/2021 09:27:21 - INFO - __main__ - Step 87085: {'lr': 0.00019172122429767305, 'samples': 16720320, 'steps': 87084, 'loss/train': 1.4846161603927612} 11/07/2021 09:27:22 - INFO - __main__ - Step 87086: {'lr': 0.00019171606377177476, 'samples': 16720512, 'steps': 87085, 'loss/train': 1.2480415105819702} 11/07/2021 09:27:23 - INFO - __main__ - Step 87087: {'lr': 0.00019171090327213842, 'samples': 16720704, 'steps': 87086, 'loss/train': 1.454073190689087} 11/07/2021 09:27:23 - INFO - __main__ - Step 87088: {'lr': 0.00019170574279876612, 'samples': 16720896, 'steps': 87087, 'loss/train': 1.4383310079574585} 11/07/2021 09:27:23 - INFO - __main__ - Step 87089: {'lr': 0.00019170058235166033, 'samples': 16721088, 'steps': 87088, 'loss/train': 1.2933603525161743} 11/07/2021 09:27:24 - INFO - __main__ - Step 87090: {'lr': 0.00019169542193082334, 'samples': 16721280, 'steps': 87089, 'loss/train': 1.1526246070861816} 11/07/2021 09:27:25 - INFO - __main__ - Step 87091: {'lr': 0.00019169026153625752, 'samples': 16721472, 'steps': 87090, 'loss/train': 0.9457367658615112} 11/07/2021 09:27:25 - INFO - __main__ - Step 87092: {'lr': 0.00019168510116796518, 'samples': 16721664, 'steps': 87091, 'loss/train': 1.2714407444000244} 11/07/2021 09:27:25 - INFO - __main__ - Step 87093: {'lr': 0.00019167994082594858, 'samples': 16721856, 'steps': 87092, 'loss/train': 1.1042166948318481} 11/07/2021 09:27:26 - INFO - __main__ - Step 87094: {'lr': 0.00019167478051021014, 'samples': 16722048, 'steps': 87093, 'loss/train': 1.4989778995513916} 11/07/2021 09:27:26 - INFO - __main__ - Step 87095: {'lr': 0.00019166962022075214, 'samples': 16722240, 'steps': 87094, 'loss/train': 0.9297798871994019} 11/07/2021 09:27:27 - INFO - __main__ - Step 87096: {'lr': 0.00019166445995757688, 'samples': 16722432, 'steps': 87095, 'loss/train': 1.3404743671417236} 11/07/2021 09:27:27 - INFO - __main__ - Step 87097: {'lr': 0.00019165929972068675, 'samples': 16722624, 'steps': 87096, 'loss/train': 1.3826403617858887} 11/07/2021 09:27:28 - INFO - __main__ - Step 87098: {'lr': 0.00019165413951008405, 'samples': 16722816, 'steps': 87097, 'loss/train': 1.20553719997406} 11/07/2021 09:27:28 - INFO - __main__ - Step 87099: {'lr': 0.00019164897932577105, 'samples': 16723008, 'steps': 87098, 'loss/train': 1.4302361011505127} 11/07/2021 09:27:28 - INFO - __main__ - Step 87100: {'lr': 0.00019164381916775026, 'samples': 16723200, 'steps': 87099, 'loss/train': 1.477695345878601} 11/07/2021 09:27:29 - INFO - __main__ - Step 87101: {'lr': 0.00019163865903602372, 'samples': 16723392, 'steps': 87100, 'loss/train': 1.2390048503875732} 11/07/2021 09:27:30 - INFO - __main__ - Step 87102: {'lr': 0.00019163349893059392, 'samples': 16723584, 'steps': 87101, 'loss/train': 0.9133557081222534} 11/07/2021 09:27:30 - INFO - __main__ - Step 87103: {'lr': 0.0001916283388514632, 'samples': 16723776, 'steps': 87102, 'loss/train': 1.3340733051300049} 11/07/2021 09:27:30 - INFO - __main__ - Step 87104: {'lr': 0.00019162317879863378, 'samples': 16723968, 'steps': 87103, 'loss/train': 1.5595424175262451} 11/07/2021 09:27:31 - INFO - __main__ - Step 87105: {'lr': 0.00019161801877210812, 'samples': 16724160, 'steps': 87104, 'loss/train': 0.7554799318313599} 11/07/2021 09:27:32 - INFO - __main__ - Step 87106: {'lr': 0.00019161285877188845, 'samples': 16724352, 'steps': 87105, 'loss/train': 0.5590908527374268} 11/07/2021 09:27:32 - INFO - __main__ - Step 87107: {'lr': 0.00019160769879797714, 'samples': 16724544, 'steps': 87106, 'loss/train': 1.1354024410247803} 11/07/2021 09:27:33 - INFO - __main__ - Step 87108: {'lr': 0.00019160253885037646, 'samples': 16724736, 'steps': 87107, 'loss/train': 1.0952914953231812} 11/07/2021 09:27:33 - INFO - __main__ - Step 87109: {'lr': 0.0001915973789290888, 'samples': 16724928, 'steps': 87108, 'loss/train': 1.2737678289413452} 11/07/2021 09:27:33 - INFO - __main__ - Step 87110: {'lr': 0.00019159221903411648, 'samples': 16725120, 'steps': 87109, 'loss/train': 1.6478627920150757} 11/07/2021 09:27:34 - INFO - __main__ - Step 87111: {'lr': 0.00019158705916546176, 'samples': 16725312, 'steps': 87110, 'loss/train': 1.649693489074707} 11/07/2021 09:27:35 - INFO - __main__ - Step 87112: {'lr': 0.00019158189932312706, 'samples': 16725504, 'steps': 87111, 'loss/train': 1.8255751132965088} 11/07/2021 09:27:35 - INFO - __main__ - Step 87113: {'lr': 0.00019157673950711464, 'samples': 16725696, 'steps': 87112, 'loss/train': 1.3244410753250122} 11/07/2021 09:27:35 - INFO - __main__ - Step 87114: {'lr': 0.00019157157971742692, 'samples': 16725888, 'steps': 87113, 'loss/train': 1.4055454730987549} 11/07/2021 09:27:36 - INFO - __main__ - Step 87115: {'lr': 0.00019156641995406604, 'samples': 16726080, 'steps': 87114, 'loss/train': 1.0414528846740723} 11/07/2021 09:27:36 - INFO - __main__ - Step 87116: {'lr': 0.00019156126021703445, 'samples': 16726272, 'steps': 87115, 'loss/train': 1.217769980430603} 11/07/2021 09:27:37 - INFO - __main__ - Step 87117: {'lr': 0.00019155610050633446, 'samples': 16726464, 'steps': 87116, 'loss/train': 1.6807036399841309} 11/07/2021 09:27:37 - INFO - __main__ - Step 87118: {'lr': 0.0001915509408219684, 'samples': 16726656, 'steps': 87117, 'loss/train': 0.8884117603302002} 11/07/2021 09:27:38 - INFO - __main__ - Step 87119: {'lr': 0.00019154578116393854, 'samples': 16726848, 'steps': 87118, 'loss/train': 1.3558143377304077} 11/07/2021 09:27:38 - INFO - __main__ - Step 87120: {'lr': 0.00019154062153224727, 'samples': 16727040, 'steps': 87119, 'loss/train': 1.3124386072158813} 11/07/2021 09:27:38 - INFO - __main__ - Step 87121: {'lr': 0.0001915354619268969, 'samples': 16727232, 'steps': 87120, 'loss/train': 0.7578462362289429} 11/07/2021 09:27:39 - INFO - __main__ - Step 87122: {'lr': 0.00019153030234788973, 'samples': 16727424, 'steps': 87121, 'loss/train': 1.5059170722961426} 11/07/2021 09:27:40 - INFO - __main__ - Step 87123: {'lr': 0.00019152514279522815, 'samples': 16727616, 'steps': 87122, 'loss/train': 1.0702413320541382} 11/07/2021 09:27:40 - INFO - __main__ - Step 87124: {'lr': 0.0001915199832689144, 'samples': 16727808, 'steps': 87123, 'loss/train': 1.43962562084198} 11/07/2021 09:27:40 - INFO - __main__ - Step 87125: {'lr': 0.00019151482376895086, 'samples': 16728000, 'steps': 87124, 'loss/train': 1.2438576221466064} 11/07/2021 09:27:41 - INFO - __main__ - Step 87126: {'lr': 0.00019150966429533982, 'samples': 16728192, 'steps': 87125, 'loss/train': 1.2428760528564453} 11/07/2021 09:27:42 - INFO - __main__ - Step 87127: {'lr': 0.00019150450484808375, 'samples': 16728384, 'steps': 87126, 'loss/train': 0.9029279947280884} 11/07/2021 09:27:42 - INFO - __main__ - Step 87128: {'lr': 0.0001914993454271847, 'samples': 16728576, 'steps': 87127, 'loss/train': 1.6265981197357178} 11/07/2021 09:27:43 - INFO - __main__ - Step 87129: {'lr': 0.0001914941860326452, 'samples': 16728768, 'steps': 87128, 'loss/train': 1.688023328781128} 11/07/2021 09:27:43 - INFO - __main__ - Step 87130: {'lr': 0.00019148902666446746, 'samples': 16728960, 'steps': 87129, 'loss/train': 1.3717347383499146} 11/07/2021 09:27:43 - INFO - __main__ - Step 87131: {'lr': 0.00019148386732265388, 'samples': 16729152, 'steps': 87130, 'loss/train': 0.946265697479248} 11/07/2021 09:27:44 - INFO - __main__ - Step 87132: {'lr': 0.0001914787080072068, 'samples': 16729344, 'steps': 87131, 'loss/train': 1.432370901107788} 11/07/2021 09:27:45 - INFO - __main__ - Step 87133: {'lr': 0.00019147354871812847, 'samples': 16729536, 'steps': 87132, 'loss/train': 1.5647523403167725} 11/07/2021 09:27:45 - INFO - __main__ - Step 87134: {'lr': 0.00019146838945542129, 'samples': 16729728, 'steps': 87133, 'loss/train': 1.2976242303848267} 11/07/2021 09:27:45 - INFO - __main__ - Step 87135: {'lr': 0.0001914632302190875, 'samples': 16729920, 'steps': 87134, 'loss/train': 1.4258174896240234} 11/07/2021 09:27:46 - INFO - __main__ - Step 87136: {'lr': 0.00019145807100912952, 'samples': 16730112, 'steps': 87135, 'loss/train': 1.4803760051727295} 11/07/2021 09:27:47 - INFO - __main__ - Step 87137: {'lr': 0.0001914529118255496, 'samples': 16730304, 'steps': 87136, 'loss/train': 1.389490008354187} 11/07/2021 09:27:47 - INFO - __main__ - Step 87138: {'lr': 0.00019144775266835012, 'samples': 16730496, 'steps': 87137, 'loss/train': 1.512701392173767} 11/07/2021 09:27:47 - INFO - __main__ - Step 87139: {'lr': 0.00019144259353753339, 'samples': 16730688, 'steps': 87138, 'loss/train': 1.5004817247390747} 11/07/2021 09:27:48 - INFO - __main__ - Step 87140: {'lr': 0.0001914374344331018, 'samples': 16730880, 'steps': 87139, 'loss/train': 0.8259867429733276} 11/07/2021 09:27:48 - INFO - __main__ - Step 87141: {'lr': 0.0001914322753550575, 'samples': 16731072, 'steps': 87140, 'loss/train': 1.2600101232528687} 11/07/2021 09:27:49 - INFO - __main__ - Step 87142: {'lr': 0.00019142711630340293, 'samples': 16731264, 'steps': 87141, 'loss/train': 1.6715736389160156} 11/07/2021 09:27:50 - INFO - __main__ - Step 87143: {'lr': 0.00019142195727814038, 'samples': 16731456, 'steps': 87142, 'loss/train': 1.6468206644058228} 11/07/2021 09:27:50 - INFO - __main__ - Step 87144: {'lr': 0.0001914167982792722, 'samples': 16731648, 'steps': 87143, 'loss/train': 1.4511913061141968} 11/07/2021 09:27:50 - INFO - __main__ - Step 87145: {'lr': 0.0001914116393068007, 'samples': 16731840, 'steps': 87144, 'loss/train': 0.7690889239311218} 11/07/2021 09:27:51 - INFO - __main__ - Step 87146: {'lr': 0.00019140648036072822, 'samples': 16732032, 'steps': 87145, 'loss/train': 1.320788860321045} 11/07/2021 09:27:52 - INFO - __main__ - Step 87147: {'lr': 0.00019140132144105705, 'samples': 16732224, 'steps': 87146, 'loss/train': 1.6252899169921875} 11/07/2021 09:27:52 - INFO - __main__ - Step 87148: {'lr': 0.00019139616254778958, 'samples': 16732416, 'steps': 87147, 'loss/train': 1.147672176361084} 11/07/2021 09:27:52 - INFO - __main__ - Step 87149: {'lr': 0.00019139100368092805, 'samples': 16732608, 'steps': 87148, 'loss/train': 1.2297712564468384} 11/07/2021 09:27:53 - INFO - __main__ - Step 87150: {'lr': 0.00019138584484047487, 'samples': 16732800, 'steps': 87149, 'loss/train': 1.667641520500183} 11/07/2021 09:27:53 - INFO - __main__ - Step 87151: {'lr': 0.0001913806860264323, 'samples': 16732992, 'steps': 87150, 'loss/train': 1.6088734865188599} 11/07/2021 09:27:53 - INFO - __main__ - Step 87152: {'lr': 0.0001913755272388027, 'samples': 16733184, 'steps': 87151, 'loss/train': 1.622915506362915} 11/07/2021 09:27:54 - INFO - __main__ - Step 87153: {'lr': 0.00019137036847758837, 'samples': 16733376, 'steps': 87152, 'loss/train': 1.4248569011688232} 11/07/2021 09:27:55 - INFO - __main__ - Step 87154: {'lr': 0.00019136520974279175, 'samples': 16733568, 'steps': 87153, 'loss/train': 1.4167895317077637} 11/07/2021 09:27:55 - INFO - __main__ - Step 87155: {'lr': 0.00019136005103441499, 'samples': 16733760, 'steps': 87154, 'loss/train': 1.4818806648254395} 11/07/2021 09:27:56 - INFO - __main__ - Step 87156: {'lr': 0.00019135489235246045, 'samples': 16733952, 'steps': 87155, 'loss/train': 1.618327260017395} 11/07/2021 09:27:56 - INFO - __main__ - Step 87157: {'lr': 0.00019134973369693052, 'samples': 16734144, 'steps': 87156, 'loss/train': 1.5969476699829102} 11/07/2021 09:27:57 - INFO - __main__ - Step 87158: {'lr': 0.00019134457506782748, 'samples': 16734336, 'steps': 87157, 'loss/train': 1.8703515529632568} 11/07/2021 09:27:57 - INFO - __main__ - Step 87159: {'lr': 0.00019133941646515368, 'samples': 16734528, 'steps': 87158, 'loss/train': 1.402065634727478} 11/07/2021 09:27:58 - INFO - __main__ - Step 87160: {'lr': 0.0001913342578889114, 'samples': 16734720, 'steps': 87159, 'loss/train': 1.3740499019622803} 11/07/2021 09:27:58 - INFO - __main__ - Step 87161: {'lr': 0.00019132909933910304, 'samples': 16734912, 'steps': 87160, 'loss/train': 1.518524169921875} 11/07/2021 09:27:58 - INFO - __main__ - Step 87162: {'lr': 0.00019132394081573085, 'samples': 16735104, 'steps': 87161, 'loss/train': 1.1981096267700195} 11/07/2021 09:27:59 - INFO - __main__ - Step 87163: {'lr': 0.0001913187823187972, 'samples': 16735296, 'steps': 87162, 'loss/train': 1.6356050968170166} 11/07/2021 09:28:00 - INFO - __main__ - Step 87164: {'lr': 0.00019131362384830443, 'samples': 16735488, 'steps': 87163, 'loss/train': 1.3351540565490723} 11/07/2021 09:28:00 - INFO - __main__ - Step 87165: {'lr': 0.00019130846540425477, 'samples': 16735680, 'steps': 87164, 'loss/train': 1.3597744703292847} 11/07/2021 09:28:00 - INFO - __main__ - Step 87166: {'lr': 0.00019130330698665065, 'samples': 16735872, 'steps': 87165, 'loss/train': 1.3454481363296509} 11/07/2021 09:28:01 - INFO - __main__ - Step 87167: {'lr': 0.00019129814859549445, 'samples': 16736064, 'steps': 87166, 'loss/train': 2.7178430557250977} 11/07/2021 09:28:01 - INFO - __main__ - Step 87168: {'lr': 0.00019129299023078831, 'samples': 16736256, 'steps': 87167, 'loss/train': 1.381242036819458} 11/07/2021 09:28:02 - INFO - __main__ - Step 87169: {'lr': 0.00019128783189253462, 'samples': 16736448, 'steps': 87168, 'loss/train': 1.6818617582321167} 11/07/2021 09:28:02 - INFO - __main__ - Step 87170: {'lr': 0.00019128267358073576, 'samples': 16736640, 'steps': 87169, 'loss/train': 1.6855263710021973} 11/07/2021 09:28:03 - INFO - __main__ - Step 87171: {'lr': 0.000191277515295394, 'samples': 16736832, 'steps': 87170, 'loss/train': 1.2800064086914062} 11/07/2021 09:28:03 - INFO - __main__ - Step 87172: {'lr': 0.0001912723570365117, 'samples': 16737024, 'steps': 87171, 'loss/train': 1.5812095403671265} 11/07/2021 09:28:04 - INFO - __main__ - Step 87173: {'lr': 0.00019126719880409112, 'samples': 16737216, 'steps': 87172, 'loss/train': 1.3014893531799316} 11/07/2021 09:28:04 - INFO - __main__ - Step 87174: {'lr': 0.00019126204059813468, 'samples': 16737408, 'steps': 87173, 'loss/train': 1.3270550966262817} 11/07/2021 09:28:05 - INFO - __main__ - Step 87175: {'lr': 0.00019125688241864464, 'samples': 16737600, 'steps': 87174, 'loss/train': 0.7438115477561951} 11/07/2021 09:28:05 - INFO - __main__ - Step 87176: {'lr': 0.00019125172426562336, 'samples': 16737792, 'steps': 87175, 'loss/train': 0.5577803254127502} 11/07/2021 09:28:06 - INFO - __main__ - Step 87177: {'lr': 0.00019124656613907315, 'samples': 16737984, 'steps': 87176, 'loss/train': 1.5962042808532715} 11/07/2021 09:28:06 - INFO - __main__ - Step 87178: {'lr': 0.00019124140803899637, 'samples': 16738176, 'steps': 87177, 'loss/train': 1.3514484167099} 11/07/2021 09:28:07 - INFO - __main__ - Step 87179: {'lr': 0.00019123624996539524, 'samples': 16738368, 'steps': 87178, 'loss/train': 1.0371953248977661} 11/07/2021 09:28:07 - INFO - __main__ - Step 87180: {'lr': 0.00019123109191827216, 'samples': 16738560, 'steps': 87179, 'loss/train': 1.6170214414596558} 11/07/2021 09:28:08 - INFO - __main__ - Step 87181: {'lr': 0.00019122593389762948, 'samples': 16738752, 'steps': 87180, 'loss/train': 1.054797887802124} 11/07/2021 09:28:08 - INFO - __main__ - Step 87182: {'lr': 0.0001912207759034695, 'samples': 16738944, 'steps': 87181, 'loss/train': 1.3264023065567017} 11/07/2021 09:28:08 - INFO - __main__ - Step 87183: {'lr': 0.00019121561793579444, 'samples': 16739136, 'steps': 87182, 'loss/train': 1.7961337566375732} 11/07/2021 09:28:10 - INFO - __main__ - Step 87184: {'lr': 0.00019121045999460676, 'samples': 16739328, 'steps': 87183, 'loss/train': 1.4264113903045654} 11/07/2021 09:28:10 - INFO - __main__ - Step 87185: {'lr': 0.00019120530207990873, 'samples': 16739520, 'steps': 87184, 'loss/train': 1.4041908979415894} 11/07/2021 09:28:10 - INFO - __main__ - Step 87186: {'lr': 0.0001912001441917027, 'samples': 16739712, 'steps': 87185, 'loss/train': 0.15362897515296936} 11/07/2021 09:28:11 - INFO - __main__ - Step 87187: {'lr': 0.000191194986329991, 'samples': 16739904, 'steps': 87186, 'loss/train': 1.0861492156982422} 11/07/2021 09:28:11 - INFO - __main__ - Step 87188: {'lr': 0.00019118982849477588, 'samples': 16740096, 'steps': 87187, 'loss/train': 1.3330961465835571} 11/07/2021 09:28:11 - INFO - __main__ - Step 87189: {'lr': 0.0001911846706860598, 'samples': 16740288, 'steps': 87188, 'loss/train': 1.4649611711502075} 11/07/2021 09:28:12 - INFO - __main__ - Step 87190: {'lr': 0.00019117951290384492, 'samples': 16740480, 'steps': 87189, 'loss/train': 1.3777600526809692} 11/07/2021 09:28:13 - INFO - __main__ - Step 87191: {'lr': 0.00019117435514813368, 'samples': 16740672, 'steps': 87190, 'loss/train': 1.4623172283172607} 11/07/2021 09:28:13 - INFO - __main__ - Step 87192: {'lr': 0.00019116919741892833, 'samples': 16740864, 'steps': 87191, 'loss/train': 1.5436102151870728} 11/07/2021 09:28:13 - INFO - __main__ - Step 87193: {'lr': 0.00019116403971623124, 'samples': 16741056, 'steps': 87192, 'loss/train': 1.1597682237625122} 11/07/2021 09:28:14 - INFO - __main__ - Step 87194: {'lr': 0.00019115888204004482, 'samples': 16741248, 'steps': 87193, 'loss/train': 1.712998628616333} 11/07/2021 09:28:15 - INFO - __main__ - Step 87195: {'lr': 0.0001911537243903712, 'samples': 16741440, 'steps': 87194, 'loss/train': 1.6299792528152466} 11/07/2021 09:28:15 - INFO - __main__ - Step 87196: {'lr': 0.0001911485667672128, 'samples': 16741632, 'steps': 87195, 'loss/train': 1.4160431623458862} 11/07/2021 09:28:16 - INFO - __main__ - Step 87197: {'lr': 0.000191143409170572, 'samples': 16741824, 'steps': 87196, 'loss/train': 1.4842575788497925} 11/07/2021 09:28:16 - INFO - __main__ - Step 87198: {'lr': 0.00019113825160045102, 'samples': 16742016, 'steps': 87197, 'loss/train': 0.9354493618011475} 11/07/2021 09:28:16 - INFO - __main__ - Step 87199: {'lr': 0.00019113309405685225, 'samples': 16742208, 'steps': 87198, 'loss/train': 1.116074562072754} 11/07/2021 09:28:17 - INFO - __main__ - Step 87200: {'lr': 0.00019112793653977805, 'samples': 16742400, 'steps': 87199, 'loss/train': 0.9478327631950378} 11/07/2021 09:28:18 - INFO - __main__ - Step 87201: {'lr': 0.00019112277904923065, 'samples': 16742592, 'steps': 87200, 'loss/train': 1.303207278251648} 11/07/2021 09:28:18 - INFO - __main__ - Step 87202: {'lr': 0.00019111762158521243, 'samples': 16742784, 'steps': 87201, 'loss/train': 1.236354112625122} 11/07/2021 09:28:19 - INFO - __main__ - Step 87203: {'lr': 0.0001911124641477257, 'samples': 16742976, 'steps': 87202, 'loss/train': 5.548733711242676} 11/07/2021 09:28:19 - INFO - __main__ - Step 87204: {'lr': 0.00019110730673677274, 'samples': 16743168, 'steps': 87203, 'loss/train': 1.758797526359558} 11/07/2021 09:28:19 - INFO - __main__ - Step 87205: {'lr': 0.00019110214935235596, 'samples': 16743360, 'steps': 87204, 'loss/train': 1.3433001041412354} 11/07/2021 09:28:20 - INFO - __main__ - Step 87206: {'lr': 0.0001910969919944776, 'samples': 16743552, 'steps': 87205, 'loss/train': 0.7747066617012024} 11/07/2021 09:28:21 - INFO - __main__ - Step 87207: {'lr': 0.0001910918346631401, 'samples': 16743744, 'steps': 87206, 'loss/train': 1.2576115131378174} 11/07/2021 09:28:21 - INFO - __main__ - Step 87208: {'lr': 0.0001910866773583457, 'samples': 16743936, 'steps': 87207, 'loss/train': 0.9886679649353027} 11/07/2021 09:28:21 - INFO - __main__ - Step 87209: {'lr': 0.00019108152008009673, 'samples': 16744128, 'steps': 87208, 'loss/train': 1.004407525062561} 11/07/2021 09:28:22 - INFO - __main__ - Step 87210: {'lr': 0.00019107636282839546, 'samples': 16744320, 'steps': 87209, 'loss/train': 2.310472249984741} 11/07/2021 09:28:22 - INFO - __main__ - Step 87211: {'lr': 0.00019107120560324438, 'samples': 16744512, 'steps': 87210, 'loss/train': 1.433729648590088} 11/07/2021 09:28:23 - INFO - __main__ - Step 87212: {'lr': 0.00019106604840464562, 'samples': 16744704, 'steps': 87211, 'loss/train': 1.2727824449539185} 11/07/2021 09:28:24 - INFO - __main__ - Step 87213: {'lr': 0.00019106089123260158, 'samples': 16744896, 'steps': 87212, 'loss/train': 1.259508728981018} 11/07/2021 09:28:24 - INFO - __main__ - Step 87214: {'lr': 0.00019105573408711464, 'samples': 16745088, 'steps': 87213, 'loss/train': 1.1924198865890503} 11/07/2021 09:28:24 - INFO - __main__ - Step 87215: {'lr': 0.000191050576968187, 'samples': 16745280, 'steps': 87214, 'loss/train': 1.4232138395309448} 11/07/2021 09:28:25 - INFO - __main__ - Step 87216: {'lr': 0.00019104541987582113, 'samples': 16745472, 'steps': 87215, 'loss/train': 1.7551448345184326} 11/07/2021 09:28:27 - INFO - __main__ - Step 87217: {'lr': 0.00019104026281001926, 'samples': 16745664, 'steps': 87216, 'loss/train': 1.9287394285202026} 11/07/2021 09:28:27 - INFO - __main__ - Step 87218: {'lr': 0.00019103510577078372, 'samples': 16745856, 'steps': 87217, 'loss/train': 1.4169094562530518} 11/07/2021 09:28:28 - INFO - __main__ - Step 87219: {'lr': 0.0001910299487581169, 'samples': 16746048, 'steps': 87218, 'loss/train': 1.5792275667190552} 11/07/2021 09:28:28 - INFO - __main__ - Step 87220: {'lr': 0.00019102479177202103, 'samples': 16746240, 'steps': 87219, 'loss/train': 1.289585828781128} 11/07/2021 09:28:28 - INFO - __main__ - Step 87221: {'lr': 0.00019101963481249853, 'samples': 16746432, 'steps': 87220, 'loss/train': 2.32489275932312} 11/07/2021 09:28:29 - INFO - __main__ - Step 87222: {'lr': 0.0001910144778795517, 'samples': 16746624, 'steps': 87221, 'loss/train': 1.7619389295578003} 11/07/2021 09:28:29 - INFO - __main__ - Step 87223: {'lr': 0.00019100932097318278, 'samples': 16746816, 'steps': 87222, 'loss/train': 0.9203473925590515} 11/07/2021 09:28:29 - INFO - __main__ - Step 87224: {'lr': 0.00019100416409339414, 'samples': 16747008, 'steps': 87223, 'loss/train': 0.989740788936615} 11/07/2021 09:28:30 - INFO - __main__ - Step 87225: {'lr': 0.00019099900724018812, 'samples': 16747200, 'steps': 87224, 'loss/train': 1.7560681104660034} 11/07/2021 09:28:31 - INFO - __main__ - Step 87226: {'lr': 0.00019099385041356705, 'samples': 16747392, 'steps': 87225, 'loss/train': 1.598658800125122} 11/07/2021 09:28:31 - INFO - __main__ - Step 87227: {'lr': 0.0001909886936135332, 'samples': 16747584, 'steps': 87226, 'loss/train': 1.2289316654205322} 11/07/2021 09:28:32 - INFO - __main__ - Step 87228: {'lr': 0.00019098353684008897, 'samples': 16747776, 'steps': 87227, 'loss/train': 1.2168245315551758} 11/07/2021 09:28:32 - INFO - __main__ - Step 87229: {'lr': 0.00019097838009323663, 'samples': 16747968, 'steps': 87228, 'loss/train': 1.336302638053894} 11/07/2021 09:28:33 - INFO - __main__ - Step 87230: {'lr': 0.00019097322337297852, 'samples': 16748160, 'steps': 87229, 'loss/train': 1.3247171640396118} 11/07/2021 09:28:33 - INFO - __main__ - Step 87231: {'lr': 0.00019096806667931695, 'samples': 16748352, 'steps': 87230, 'loss/train': 0.7514841556549072} 11/07/2021 09:28:33 - INFO - __main__ - Step 87232: {'lr': 0.0001909629100122543, 'samples': 16748544, 'steps': 87231, 'loss/train': 1.260901927947998} 11/07/2021 09:28:34 - INFO - __main__ - Step 87233: {'lr': 0.00019095775337179283, 'samples': 16748736, 'steps': 87232, 'loss/train': 1.551183819770813} 11/07/2021 09:28:34 - INFO - __main__ - Step 87234: {'lr': 0.00019095259675793488, 'samples': 16748928, 'steps': 87233, 'loss/train': 1.2133835554122925} 11/07/2021 09:28:35 - INFO - __main__ - Step 87235: {'lr': 0.00019094744017068288, 'samples': 16749120, 'steps': 87234, 'loss/train': 1.6907341480255127} 11/07/2021 09:28:36 - INFO - __main__ - Step 87236: {'lr': 0.00019094228361003895, 'samples': 16749312, 'steps': 87235, 'loss/train': 1.2307885885238647} 11/07/2021 09:28:36 - INFO - __main__ - Step 87237: {'lr': 0.00019093712707600553, 'samples': 16749504, 'steps': 87236, 'loss/train': 1.7022154331207275} 11/07/2021 09:28:36 - INFO - __main__ - Step 87238: {'lr': 0.00019093197056858495, 'samples': 16749696, 'steps': 87237, 'loss/train': 1.3266762495040894} 11/07/2021 09:28:37 - INFO - __main__ - Step 87239: {'lr': 0.00019092681408777946, 'samples': 16749888, 'steps': 87238, 'loss/train': 1.366782546043396} 11/07/2021 09:28:37 - INFO - __main__ - Step 87240: {'lr': 0.00019092165763359145, 'samples': 16750080, 'steps': 87239, 'loss/train': 1.3333826065063477} 11/07/2021 09:28:38 - INFO - __main__ - Step 87241: {'lr': 0.00019091650120602326, 'samples': 16750272, 'steps': 87240, 'loss/train': 1.0472612380981445} 11/07/2021 09:28:38 - INFO - __main__ - Step 87242: {'lr': 0.00019091134480507717, 'samples': 16750464, 'steps': 87241, 'loss/train': 1.4533838033676147} 11/07/2021 09:28:39 - INFO - __main__ - Step 87243: {'lr': 0.0001909061884307555, 'samples': 16750656, 'steps': 87242, 'loss/train': 1.3646434545516968} 11/07/2021 09:28:39 - INFO - __main__ - Step 87244: {'lr': 0.0001909010320830606, 'samples': 16750848, 'steps': 87243, 'loss/train': 1.581523060798645} 11/07/2021 09:28:39 - INFO - __main__ - Step 87245: {'lr': 0.00019089587576199478, 'samples': 16751040, 'steps': 87244, 'loss/train': 1.1017473936080933} 11/07/2021 09:28:40 - INFO - __main__ - Step 87246: {'lr': 0.00019089071946756038, 'samples': 16751232, 'steps': 87245, 'loss/train': 1.366173267364502} 11/07/2021 09:28:41 - INFO - __main__ - Step 87247: {'lr': 0.00019088556319975966, 'samples': 16751424, 'steps': 87246, 'loss/train': 1.4369903802871704} 11/07/2021 09:28:41 - INFO - __main__ - Step 87248: {'lr': 0.00019088040695859515, 'samples': 16751616, 'steps': 87247, 'loss/train': 1.030599594116211} 11/07/2021 09:28:42 - INFO - __main__ - Step 87249: {'lr': 0.0001908752507440689, 'samples': 16751808, 'steps': 87248, 'loss/train': 1.19240403175354} 11/07/2021 09:28:42 - INFO - __main__ - Step 87250: {'lr': 0.00019087009455618335, 'samples': 16752000, 'steps': 87249, 'loss/train': 1.7015304565429688} 11/07/2021 09:28:43 - INFO - __main__ - Step 87251: {'lr': 0.0001908649383949408, 'samples': 16752192, 'steps': 87250, 'loss/train': 1.0866364240646362} 11/07/2021 09:28:43 - INFO - __main__ - Step 87252: {'lr': 0.00019085978226034362, 'samples': 16752384, 'steps': 87251, 'loss/train': 1.7142176628112793} 11/07/2021 09:28:44 - INFO - __main__ - Step 87253: {'lr': 0.00019085462615239413, 'samples': 16752576, 'steps': 87252, 'loss/train': 1.5966933965682983} 11/07/2021 09:28:44 - INFO - __main__ - Step 87254: {'lr': 0.00019084947007109459, 'samples': 16752768, 'steps': 87253, 'loss/train': 1.7165701389312744} 11/07/2021 09:28:44 - INFO - __main__ - Step 87255: {'lr': 0.00019084431401644738, 'samples': 16752960, 'steps': 87254, 'loss/train': 1.860170841217041} 11/07/2021 09:28:46 - INFO - __main__ - Step 87256: {'lr': 0.0001908391579884548, 'samples': 16753152, 'steps': 87255, 'loss/train': 1.9920854568481445} 11/07/2021 09:28:46 - INFO - __main__ - Step 87257: {'lr': 0.0001908340019871192, 'samples': 16753344, 'steps': 87256, 'loss/train': 1.3258841037750244} 11/07/2021 09:28:46 - INFO - __main__ - Step 87258: {'lr': 0.0001908288460124429, 'samples': 16753536, 'steps': 87257, 'loss/train': 1.8310589790344238} 11/07/2021 09:28:47 - INFO - __main__ - Step 87259: {'lr': 0.0001908236900644282, 'samples': 16753728, 'steps': 87258, 'loss/train': 1.198044776916504} 11/07/2021 09:28:47 - INFO - __main__ - Step 87260: {'lr': 0.00019081853414307739, 'samples': 16753920, 'steps': 87259, 'loss/train': 0.6796894669532776} 11/07/2021 09:28:48 - INFO - __main__ - Step 87261: {'lr': 0.000190813378248393, 'samples': 16754112, 'steps': 87260, 'loss/train': 1.236943006515503} 11/07/2021 09:28:48 - INFO - __main__ - Step 87262: {'lr': 0.00019080822238037705, 'samples': 16754304, 'steps': 87261, 'loss/train': 1.2828965187072754} 11/07/2021 09:28:49 - INFO - __main__ - Step 87263: {'lr': 0.000190803066539032, 'samples': 16754496, 'steps': 87262, 'loss/train': 1.1477675437927246} 11/07/2021 09:28:49 - INFO - __main__ - Step 87264: {'lr': 0.00019079791072436017, 'samples': 16754688, 'steps': 87263, 'loss/train': 1.1772263050079346} 11/07/2021 09:28:49 - INFO - __main__ - Step 87265: {'lr': 0.00019079275493636392, 'samples': 16754880, 'steps': 87264, 'loss/train': 1.5517430305480957} 11/07/2021 09:28:50 - INFO - __main__ - Step 87266: {'lr': 0.0001907875991750455, 'samples': 16755072, 'steps': 87265, 'loss/train': 1.2436726093292236} 11/07/2021 09:28:51 - INFO - __main__ - Step 87267: {'lr': 0.0001907824434404073, 'samples': 16755264, 'steps': 87266, 'loss/train': 0.8618582487106323} 11/07/2021 09:28:51 - INFO - __main__ - Step 87268: {'lr': 0.00019077728773245163, 'samples': 16755456, 'steps': 87267, 'loss/train': 1.114466905593872} 11/07/2021 09:28:52 - INFO - __main__ - Step 87269: {'lr': 0.00019077213205118078, 'samples': 16755648, 'steps': 87268, 'loss/train': 1.286158561706543} 11/07/2021 09:28:52 - INFO - __main__ - Step 87270: {'lr': 0.0001907669763965971, 'samples': 16755840, 'steps': 87269, 'loss/train': 1.4953774213790894} 11/07/2021 09:28:52 - INFO - __main__ - Step 87271: {'lr': 0.00019076182076870288, 'samples': 16756032, 'steps': 87270, 'loss/train': 1.647733449935913} 11/07/2021 09:28:53 - INFO - __main__ - Step 87272: {'lr': 0.00019075666516750052, 'samples': 16756224, 'steps': 87271, 'loss/train': 1.4746602773666382} 11/07/2021 09:28:54 - INFO - __main__ - Step 87273: {'lr': 0.00019075150959299225, 'samples': 16756416, 'steps': 87272, 'loss/train': 1.3916443586349487} 11/07/2021 09:28:54 - INFO - __main__ - Step 87274: {'lr': 0.00019074635404518045, 'samples': 16756608, 'steps': 87273, 'loss/train': 1.7530144453048706} 11/07/2021 09:28:54 - INFO - __main__ - Step 87275: {'lr': 0.00019074119852406751, 'samples': 16756800, 'steps': 87274, 'loss/train': 1.6587337255477905} 11/07/2021 09:28:55 - INFO - __main__ - Step 87276: {'lr': 0.0001907360430296556, 'samples': 16756992, 'steps': 87275, 'loss/train': 1.7100945711135864} 11/07/2021 09:28:56 - INFO - __main__ - Step 87277: {'lr': 0.00019073088756194713, 'samples': 16757184, 'steps': 87276, 'loss/train': 1.831459403038025} 11/07/2021 09:28:56 - INFO - __main__ - Step 87278: {'lr': 0.00019072573212094434, 'samples': 16757376, 'steps': 87277, 'loss/train': 1.7297658920288086} 11/07/2021 09:28:56 - INFO - __main__ - Step 87279: {'lr': 0.00019072057670664968, 'samples': 16757568, 'steps': 87278, 'loss/train': 0.9577075839042664} 11/07/2021 09:28:57 - INFO - __main__ - Step 87280: {'lr': 0.0001907154213190654, 'samples': 16757760, 'steps': 87279, 'loss/train': 1.4339944124221802} 11/07/2021 09:28:57 - INFO - __main__ - Step 87281: {'lr': 0.00019071026595819386, 'samples': 16757952, 'steps': 87280, 'loss/train': 1.635752558708191} 11/07/2021 09:28:58 - INFO - __main__ - Step 87282: {'lr': 0.0001907051106240373, 'samples': 16758144, 'steps': 87281, 'loss/train': 1.0486809015274048} 11/07/2021 09:28:59 - INFO - __main__ - Step 87283: {'lr': 0.00019069995531659814, 'samples': 16758336, 'steps': 87282, 'loss/train': 1.4810035228729248} 11/07/2021 09:28:59 - INFO - __main__ - Step 87284: {'lr': 0.00019069480003587865, 'samples': 16758528, 'steps': 87283, 'loss/train': 1.4431875944137573} 11/07/2021 09:28:59 - INFO - __main__ - Step 87285: {'lr': 0.0001906896447818812, 'samples': 16758720, 'steps': 87284, 'loss/train': 1.5845215320587158} 11/07/2021 09:29:00 - INFO - __main__ - Step 87286: {'lr': 0.00019068448955460805, 'samples': 16758912, 'steps': 87285, 'loss/train': 0.6794354319572449} 11/07/2021 09:29:01 - INFO - __main__ - Step 87287: {'lr': 0.00019067933435406155, 'samples': 16759104, 'steps': 87286, 'loss/train': 1.6420483589172363} 11/07/2021 09:29:01 - INFO - __main__ - Step 87288: {'lr': 0.00019067417918024415, 'samples': 16759296, 'steps': 87287, 'loss/train': 1.1009821891784668} 11/07/2021 09:29:01 - INFO - __main__ - Step 87289: {'lr': 0.00019066902403315795, 'samples': 16759488, 'steps': 87288, 'loss/train': 1.571286678314209} 11/07/2021 09:29:02 - INFO - __main__ - Step 87290: {'lr': 0.00019066386891280536, 'samples': 16759680, 'steps': 87289, 'loss/train': 1.3205795288085938} 11/07/2021 09:29:02 - INFO - __main__ - Step 87291: {'lr': 0.0001906587138191887, 'samples': 16759872, 'steps': 87290, 'loss/train': 1.3446035385131836} 11/07/2021 09:29:03 - INFO - __main__ - Step 87292: {'lr': 0.00019065355875231034, 'samples': 16760064, 'steps': 87291, 'loss/train': 1.8580881357192993} 11/07/2021 09:29:04 - INFO - __main__ - Step 87293: {'lr': 0.00019064840371217255, 'samples': 16760256, 'steps': 87292, 'loss/train': 0.9787148237228394} 11/07/2021 09:29:04 - INFO - __main__ - Step 87294: {'lr': 0.00019064324869877766, 'samples': 16760448, 'steps': 87293, 'loss/train': 1.542047381401062} 11/07/2021 09:29:04 - INFO - __main__ - Step 87295: {'lr': 0.00019063809371212804, 'samples': 16760640, 'steps': 87294, 'loss/train': 1.4571938514709473} 11/07/2021 09:29:05 - INFO - __main__ - Step 87296: {'lr': 0.00019063293875222595, 'samples': 16760832, 'steps': 87295, 'loss/train': 0.9980185627937317} 11/07/2021 09:29:05 - INFO - __main__ - Step 87297: {'lr': 0.00019062778381907376, 'samples': 16761024, 'steps': 87296, 'loss/train': 1.4751033782958984} 11/07/2021 09:29:06 - INFO - __main__ - Step 87298: {'lr': 0.00019062262891267378, 'samples': 16761216, 'steps': 87297, 'loss/train': 1.6350239515304565} 11/07/2021 09:29:06 - INFO - __main__ - Step 87299: {'lr': 0.0001906174740330283, 'samples': 16761408, 'steps': 87298, 'loss/train': 1.619889497756958} 11/07/2021 09:29:07 - INFO - __main__ - Step 87300: {'lr': 0.00019061231918013967, 'samples': 16761600, 'steps': 87299, 'loss/train': 1.3444732427597046} 11/07/2021 09:29:07 - INFO - __main__ - Step 87301: {'lr': 0.00019060716435401025, 'samples': 16761792, 'steps': 87300, 'loss/train': 0.7469086647033691} 11/07/2021 09:29:07 - INFO - __main__ - Step 87302: {'lr': 0.0001906020095546424, 'samples': 16761984, 'steps': 87301, 'loss/train': 1.3363596200942993} 11/07/2021 09:29:09 - INFO - __main__ - Step 87303: {'lr': 0.00019059685478203824, 'samples': 16762176, 'steps': 87302, 'loss/train': 1.6132502555847168} 11/07/2021 09:29:09 - INFO - __main__ - Step 87304: {'lr': 0.00019059170003620028, 'samples': 16762368, 'steps': 87303, 'loss/train': 1.0790220499038696} 11/07/2021 09:29:09 - INFO - __main__ - Step 87305: {'lr': 0.00019058654531713075, 'samples': 16762560, 'steps': 87304, 'loss/train': 2.095912218093872} 11/07/2021 09:29:10 - INFO - __main__ - Step 87306: {'lr': 0.000190581390624832, 'samples': 16762752, 'steps': 87305, 'loss/train': 0.6578621864318848} 11/07/2021 09:29:10 - INFO - __main__ - Step 87307: {'lr': 0.00019057623595930637, 'samples': 16762944, 'steps': 87306, 'loss/train': 1.4572057723999023} 11/07/2021 09:29:10 - INFO - __main__ - Step 87308: {'lr': 0.00019057108132055617, 'samples': 16763136, 'steps': 87307, 'loss/train': 1.4607783555984497} 11/07/2021 09:29:11 - INFO - __main__ - Step 87309: {'lr': 0.00019056592670858372, 'samples': 16763328, 'steps': 87308, 'loss/train': 1.385434627532959} 11/07/2021 09:29:12 - INFO - __main__ - Step 87310: {'lr': 0.00019056077212339134, 'samples': 16763520, 'steps': 87309, 'loss/train': 1.538500189781189} 11/07/2021 09:29:12 - INFO - __main__ - Step 87311: {'lr': 0.00019055561756498138, 'samples': 16763712, 'steps': 87310, 'loss/train': 1.467637538909912} 11/07/2021 09:29:12 - INFO - __main__ - Step 87312: {'lr': 0.00019055046303335617, 'samples': 16763904, 'steps': 87311, 'loss/train': 1.5926549434661865} 11/07/2021 09:29:13 - INFO - __main__ - Step 87313: {'lr': 0.00019054530852851797, 'samples': 16764096, 'steps': 87312, 'loss/train': 1.1287446022033691} 11/07/2021 09:29:14 - INFO - __main__ - Step 87314: {'lr': 0.00019054015405046916, 'samples': 16764288, 'steps': 87313, 'loss/train': 1.759792685508728} 11/07/2021 09:29:14 - INFO - __main__ - Step 87315: {'lr': 0.00019053499959921207, 'samples': 16764480, 'steps': 87314, 'loss/train': 1.4940634965896606} 11/07/2021 09:29:15 - INFO - __main__ - Step 87316: {'lr': 0.00019052984517474892, 'samples': 16764672, 'steps': 87315, 'loss/train': 0.7063860893249512} 11/07/2021 09:29:15 - INFO - __main__ - Step 87317: {'lr': 0.00019052469077708212, 'samples': 16764864, 'steps': 87316, 'loss/train': 1.2396827936172485} 11/07/2021 09:29:15 - INFO - __main__ - Step 87318: {'lr': 0.00019051953640621393, 'samples': 16765056, 'steps': 87317, 'loss/train': 1.492343544960022} 11/07/2021 09:29:16 - INFO - __main__ - Step 87319: {'lr': 0.00019051438206214678, 'samples': 16765248, 'steps': 87318, 'loss/train': 1.5199991464614868} 11/07/2021 09:29:17 - INFO - __main__ - Step 87320: {'lr': 0.0001905092277448829, 'samples': 16765440, 'steps': 87319, 'loss/train': 1.232067584991455} 11/07/2021 09:29:17 - INFO - __main__ - Step 87321: {'lr': 0.00019050407345442468, 'samples': 16765632, 'steps': 87320, 'loss/train': 1.673579454421997} 11/07/2021 09:29:17 - INFO - __main__ - Step 87322: {'lr': 0.00019049891919077438, 'samples': 16765824, 'steps': 87321, 'loss/train': 1.7418988943099976} 11/07/2021 09:29:18 - INFO - __main__ - Step 87323: {'lr': 0.0001904937649539344, 'samples': 16766016, 'steps': 87322, 'loss/train': 1.7742551565170288} 11/07/2021 09:29:18 - INFO - __main__ - Step 87324: {'lr': 0.00019048861074390697, 'samples': 16766208, 'steps': 87323, 'loss/train': 1.2953349351882935} 11/07/2021 09:29:19 - INFO - __main__ - Step 87325: {'lr': 0.00019048345656069444, 'samples': 16766400, 'steps': 87324, 'loss/train': 1.9836719036102295} 11/07/2021 09:29:20 - INFO - __main__ - Step 87326: {'lr': 0.00019047830240429914, 'samples': 16766592, 'steps': 87325, 'loss/train': 1.3385530710220337} 11/07/2021 09:29:20 - INFO - __main__ - Step 87327: {'lr': 0.00019047314827472342, 'samples': 16766784, 'steps': 87326, 'loss/train': 1.598818302154541} 11/07/2021 09:29:20 - INFO - __main__ - Step 87328: {'lr': 0.0001904679941719696, 'samples': 16766976, 'steps': 87327, 'loss/train': 1.3278608322143555} 11/07/2021 09:29:21 - INFO - __main__ - Step 87329: {'lr': 0.00019046284009603998, 'samples': 16767168, 'steps': 87328, 'loss/train': 1.8045281171798706} 11/07/2021 09:29:22 - INFO - __main__ - Step 87330: {'lr': 0.00019045768604693687, 'samples': 16767360, 'steps': 87329, 'loss/train': 1.2613791227340698} 11/07/2021 09:29:22 - INFO - __main__ - Step 87331: {'lr': 0.00019045253202466258, 'samples': 16767552, 'steps': 87330, 'loss/train': 0.3498629033565521} 11/07/2021 09:29:22 - INFO - __main__ - Step 87332: {'lr': 0.0001904473780292195, 'samples': 16767744, 'steps': 87331, 'loss/train': 1.4221986532211304} 11/07/2021 09:29:23 - INFO - __main__ - Step 87333: {'lr': 0.0001904422240606099, 'samples': 16767936, 'steps': 87332, 'loss/train': 1.1243637800216675} 11/07/2021 09:29:23 - INFO - __main__ - Step 87334: {'lr': 0.00019043707011883615, 'samples': 16768128, 'steps': 87333, 'loss/train': 1.6492867469787598} 11/07/2021 09:29:24 - INFO - __main__ - Step 87335: {'lr': 0.0001904319162039005, 'samples': 16768320, 'steps': 87334, 'loss/train': 1.2689775228500366} 11/07/2021 09:29:24 - INFO - __main__ - Step 87336: {'lr': 0.0001904267623158053, 'samples': 16768512, 'steps': 87335, 'loss/train': 1.2558104991912842} 11/07/2021 09:29:25 - INFO - __main__ - Step 87337: {'lr': 0.00019042160845455285, 'samples': 16768704, 'steps': 87336, 'loss/train': 1.5528017282485962} 11/07/2021 09:29:25 - INFO - __main__ - Step 87338: {'lr': 0.00019041645462014557, 'samples': 16768896, 'steps': 87337, 'loss/train': 1.2722105979919434} 11/07/2021 09:29:25 - INFO - __main__ - Step 87339: {'lr': 0.00019041130081258567, 'samples': 16769088, 'steps': 87338, 'loss/train': 1.3147237300872803} 11/07/2021 09:29:26 - INFO - __main__ - Step 87340: {'lr': 0.00019040614703187553, 'samples': 16769280, 'steps': 87339, 'loss/train': 0.7365564107894897} 11/07/2021 09:29:27 - INFO - __main__ - Step 87341: {'lr': 0.00019040099327801747, 'samples': 16769472, 'steps': 87340, 'loss/train': 1.882457971572876} 11/07/2021 09:29:27 - INFO - __main__ - Step 87342: {'lr': 0.00019039583955101386, 'samples': 16769664, 'steps': 87341, 'loss/train': 1.4025253057479858} 11/07/2021 09:29:27 - INFO - __main__ - Step 87343: {'lr': 0.00019039068585086687, 'samples': 16769856, 'steps': 87342, 'loss/train': 1.0772312879562378} 11/07/2021 09:29:28 - INFO - __main__ - Step 87344: {'lr': 0.00019038553217757897, 'samples': 16770048, 'steps': 87343, 'loss/train': 1.3768845796585083} 11/07/2021 09:29:28 - INFO - __main__ - Step 87345: {'lr': 0.00019038037853115247, 'samples': 16770240, 'steps': 87344, 'loss/train': 1.238519549369812} 11/07/2021 09:29:29 - INFO - __main__ - Step 87346: {'lr': 0.0001903752249115896, 'samples': 16770432, 'steps': 87345, 'loss/train': 1.31955885887146} 11/07/2021 09:29:30 - INFO - __main__ - Step 87347: {'lr': 0.00019037007131889272, 'samples': 16770624, 'steps': 87346, 'loss/train': 1.3886290788650513} 11/07/2021 09:29:30 - INFO - __main__ - Step 87348: {'lr': 0.00019036491775306413, 'samples': 16770816, 'steps': 87347, 'loss/train': 0.9846014976501465} 11/07/2021 09:29:30 - INFO - __main__ - Step 87349: {'lr': 0.00019035976421410625, 'samples': 16771008, 'steps': 87348, 'loss/train': 1.6490098237991333} 11/07/2021 09:29:31 - INFO - __main__ - Step 87350: {'lr': 0.00019035461070202132, 'samples': 16771200, 'steps': 87349, 'loss/train': 0.5780075788497925} 11/07/2021 09:29:32 - INFO - __main__ - Step 87351: {'lr': 0.0001903494572168117, 'samples': 16771392, 'steps': 87350, 'loss/train': 1.1050572395324707} 11/07/2021 09:29:32 - INFO - __main__ - Step 87352: {'lr': 0.00019034430375847964, 'samples': 16771584, 'steps': 87351, 'loss/train': 1.4469841718673706} 11/07/2021 09:29:32 - INFO - __main__ - Step 87353: {'lr': 0.00019033915032702755, 'samples': 16771776, 'steps': 87352, 'loss/train': 1.007612943649292} 11/07/2021 09:29:33 - INFO - __main__ - Step 87354: {'lr': 0.00019033399692245772, 'samples': 16771968, 'steps': 87353, 'loss/train': 1.6717835664749146} 11/07/2021 09:29:33 - INFO - __main__ - Step 87355: {'lr': 0.00019032884354477247, 'samples': 16772160, 'steps': 87354, 'loss/train': 1.3456233739852905} 11/07/2021 09:29:34 - INFO - __main__ - Step 87356: {'lr': 0.0001903236901939742, 'samples': 16772352, 'steps': 87355, 'loss/train': 1.5744264125823975} 11/07/2021 09:29:34 - INFO - __main__ - Step 87357: {'lr': 0.0001903185368700651, 'samples': 16772544, 'steps': 87356, 'loss/train': 1.1552678346633911} 11/07/2021 09:29:35 - INFO - __main__ - Step 87358: {'lr': 0.00019031338357304752, 'samples': 16772736, 'steps': 87357, 'loss/train': 1.3116308450698853} 11/07/2021 09:29:35 - INFO - __main__ - Step 87359: {'lr': 0.0001903082303029238, 'samples': 16772928, 'steps': 87358, 'loss/train': 1.3294529914855957} 11/07/2021 09:29:35 - INFO - __main__ - Step 87360: {'lr': 0.00019030307705969628, 'samples': 16773120, 'steps': 87359, 'loss/train': 1.0740790367126465} 11/07/2021 09:29:37 - INFO - __main__ - Step 87361: {'lr': 0.00019029792384336728, 'samples': 16773312, 'steps': 87360, 'loss/train': 1.1060539484024048} 11/07/2021 09:29:37 - INFO - __main__ - Step 87362: {'lr': 0.0001902927706539391, 'samples': 16773504, 'steps': 87361, 'loss/train': 1.1348506212234497} 11/07/2021 09:29:37 - INFO - __main__ - Step 87363: {'lr': 0.00019028761749141407, 'samples': 16773696, 'steps': 87362, 'loss/train': 1.5013155937194824} 11/07/2021 09:29:38 - INFO - __main__ - Step 87364: {'lr': 0.00019028246435579454, 'samples': 16773888, 'steps': 87363, 'loss/train': 1.4171700477600098} 11/07/2021 09:29:38 - INFO - __main__ - Step 87365: {'lr': 0.0001902773112470828, 'samples': 16774080, 'steps': 87364, 'loss/train': 1.6578916311264038} 11/07/2021 09:29:39 - INFO - __main__ - Step 87366: {'lr': 0.00019027215816528118, 'samples': 16774272, 'steps': 87365, 'loss/train': 1.1283824443817139} 11/07/2021 09:29:39 - INFO - __main__ - Step 87367: {'lr': 0.000190267005110392, 'samples': 16774464, 'steps': 87366, 'loss/train': 1.4148147106170654} 11/07/2021 09:29:40 - INFO - __main__ - Step 87368: {'lr': 0.0001902618520824176, 'samples': 16774656, 'steps': 87367, 'loss/train': 0.7894191145896912} 11/07/2021 09:29:40 - INFO - __main__ - Step 87369: {'lr': 0.0001902566990813604, 'samples': 16774848, 'steps': 87368, 'loss/train': 1.3835476636886597} 11/07/2021 09:29:40 - INFO - __main__ - Step 87370: {'lr': 0.00019025154610722246, 'samples': 16775040, 'steps': 87369, 'loss/train': 0.8269889950752258} 11/07/2021 09:29:41 - INFO - __main__ - Step 87371: {'lr': 0.0001902463931600063, 'samples': 16775232, 'steps': 87370, 'loss/train': 1.445393681526184} 11/07/2021 09:29:42 - INFO - __main__ - Step 87372: {'lr': 0.00019024124023971417, 'samples': 16775424, 'steps': 87371, 'loss/train': 1.0634398460388184} 11/07/2021 09:29:42 - INFO - __main__ - Step 87373: {'lr': 0.0001902360873463484, 'samples': 16775616, 'steps': 87372, 'loss/train': 1.222823977470398} 11/07/2021 09:29:43 - INFO - __main__ - Step 87374: {'lr': 0.00019023093447991137, 'samples': 16775808, 'steps': 87373, 'loss/train': 1.3251628875732422} 11/07/2021 09:29:43 - INFO - __main__ - Step 87375: {'lr': 0.00019022578164040532, 'samples': 16776000, 'steps': 87374, 'loss/train': 1.1091681718826294} 11/07/2021 09:29:43 - INFO - __main__ - Step 87376: {'lr': 0.0001902206288278326, 'samples': 16776192, 'steps': 87375, 'loss/train': 1.3987407684326172} 11/07/2021 09:29:44 - INFO - __main__ - Step 87377: {'lr': 0.00019021547604219558, 'samples': 16776384, 'steps': 87376, 'loss/train': 0.9371957182884216} 11/07/2021 09:29:45 - INFO - __main__ - Step 87378: {'lr': 0.00019021032328349653, 'samples': 16776576, 'steps': 87377, 'loss/train': 1.6702256202697754} 11/07/2021 09:29:45 - INFO - __main__ - Step 87379: {'lr': 0.0001902051705517378, 'samples': 16776768, 'steps': 87378, 'loss/train': 5.7547526359558105} 11/07/2021 09:29:45 - INFO - __main__ - Step 87380: {'lr': 0.00019020001784692168, 'samples': 16776960, 'steps': 87379, 'loss/train': 1.2326263189315796} 11/07/2021 09:29:46 - INFO - __main__ - Step 87381: {'lr': 0.0001901948651690505, 'samples': 16777152, 'steps': 87380, 'loss/train': 1.4086267948150635} 11/07/2021 09:29:46 - INFO - __main__ - Step 87382: {'lr': 0.00019018971251812673, 'samples': 16777344, 'steps': 87381, 'loss/train': 1.3708748817443848} 11/07/2021 09:29:47 - INFO - __main__ - Step 87383: {'lr': 0.0001901845598941524, 'samples': 16777536, 'steps': 87382, 'loss/train': 0.9066331386566162} 11/07/2021 09:29:47 - INFO - __main__ - Step 87384: {'lr': 0.00019017940729713, 'samples': 16777728, 'steps': 87383, 'loss/train': 1.6247684955596924} 11/07/2021 09:29:48 - INFO - __main__ - Step 87385: {'lr': 0.00019017425472706188, 'samples': 16777920, 'steps': 87384, 'loss/train': 1.3240245580673218} 11/07/2021 09:29:48 - INFO - __main__ - Step 87386: {'lr': 0.00019016910218395028, 'samples': 16778112, 'steps': 87385, 'loss/train': 1.0119128227233887} 11/07/2021 09:29:48 - INFO - __main__ - Step 87387: {'lr': 0.00019016394966779755, 'samples': 16778304, 'steps': 87386, 'loss/train': 1.4502875804901123} 11/07/2021 09:29:50 - INFO - __main__ - Step 87388: {'lr': 0.00019015879717860604, 'samples': 16778496, 'steps': 87387, 'loss/train': 1.1993350982666016} 11/07/2021 09:29:50 - INFO - __main__ - Step 87389: {'lr': 0.00019015364471637803, 'samples': 16778688, 'steps': 87388, 'loss/train': 0.7508935332298279} 11/07/2021 09:29:50 - INFO - __main__ - Step 87390: {'lr': 0.0001901484922811159, 'samples': 16778880, 'steps': 87389, 'loss/train': 1.251092553138733} 11/07/2021 09:29:51 - INFO - __main__ - Step 87391: {'lr': 0.0001901433398728219, 'samples': 16779072, 'steps': 87390, 'loss/train': 1.3099489212036133} 11/07/2021 09:29:51 - INFO - __main__ - Step 87392: {'lr': 0.00019013818749149842, 'samples': 16779264, 'steps': 87391, 'loss/train': 1.4780750274658203} 11/07/2021 09:29:51 - INFO - __main__ - Step 87393: {'lr': 0.0001901330351371477, 'samples': 16779456, 'steps': 87392, 'loss/train': 1.3045254945755005} 11/07/2021 09:29:53 - INFO - __main__ - Step 87394: {'lr': 0.00019012788280977217, 'samples': 16779648, 'steps': 87393, 'loss/train': 1.7848039865493774} 11/07/2021 09:29:53 - INFO - __main__ - Step 87395: {'lr': 0.00019012273050937405, 'samples': 16779840, 'steps': 87394, 'loss/train': 1.4033957719802856} 11/07/2021 09:29:53 - INFO - __main__ - Step 87396: {'lr': 0.00019011757823595582, 'samples': 16780032, 'steps': 87395, 'loss/train': 1.1333991289138794} 11/07/2021 09:29:54 - INFO - __main__ - Step 87397: {'lr': 0.0001901124259895196, 'samples': 16780224, 'steps': 87396, 'loss/train': 1.3705031871795654} 11/07/2021 09:29:54 - INFO - __main__ - Step 87398: {'lr': 0.00019010727377006777, 'samples': 16780416, 'steps': 87397, 'loss/train': 1.3966807126998901} 11/07/2021 09:29:55 - INFO - __main__ - Step 87399: {'lr': 0.0001901021215776027, 'samples': 16780608, 'steps': 87398, 'loss/train': 1.815239667892456} 11/07/2021 09:29:55 - INFO - __main__ - Step 87400: {'lr': 0.00019009696941212667, 'samples': 16780800, 'steps': 87399, 'loss/train': 1.257692813873291} 11/07/2021 09:29:56 - INFO - __main__ - Step 87401: {'lr': 0.00019009181727364205, 'samples': 16780992, 'steps': 87400, 'loss/train': 0.7003956437110901} 11/07/2021 09:29:56 - INFO - __main__ - Step 87402: {'lr': 0.00019008666516215112, 'samples': 16781184, 'steps': 87401, 'loss/train': 1.42072594165802} 11/07/2021 09:29:57 - INFO - __main__ - Step 87403: {'lr': 0.0001900815130776562, 'samples': 16781376, 'steps': 87402, 'loss/train': 1.8560993671417236} 11/07/2021 09:29:57 - INFO - __main__ - Step 87404: {'lr': 0.00019007636102015964, 'samples': 16781568, 'steps': 87403, 'loss/train': 1.365442156791687} 11/07/2021 09:29:58 - INFO - __main__ - Step 87405: {'lr': 0.00019007120898966373, 'samples': 16781760, 'steps': 87404, 'loss/train': 1.5944303274154663} 11/07/2021 09:29:58 - INFO - __main__ - Step 87406: {'lr': 0.0001900660569861708, 'samples': 16781952, 'steps': 87405, 'loss/train': 1.653462529182434} 11/07/2021 09:29:59 - INFO - __main__ - Step 87407: {'lr': 0.0001900609050096832, 'samples': 16782144, 'steps': 87406, 'loss/train': 2.030320405960083} 11/07/2021 09:29:59 - INFO - __main__ - Step 87408: {'lr': 0.00019005575306020323, 'samples': 16782336, 'steps': 87407, 'loss/train': 1.9108046293258667} 11/07/2021 09:29:59 - INFO - __main__ - Step 87409: {'lr': 0.00019005060113773333, 'samples': 16782528, 'steps': 87408, 'loss/train': 1.275022268295288} 11/07/2021 09:30:00 - INFO - __main__ - Step 87410: {'lr': 0.00019004544924227558, 'samples': 16782720, 'steps': 87409, 'loss/train': 1.3873134851455688} 11/07/2021 09:30:01 - INFO - __main__ - Step 87411: {'lr': 0.00019004029737383244, 'samples': 16782912, 'steps': 87410, 'loss/train': 1.2008006572723389} 11/07/2021 09:30:01 - INFO - __main__ - Step 87412: {'lr': 0.0001900351455324062, 'samples': 16783104, 'steps': 87411, 'loss/train': 0.3515206575393677} 11/07/2021 09:30:01 - INFO - __main__ - Step 87413: {'lr': 0.0001900299937179992, 'samples': 16783296, 'steps': 87412, 'loss/train': 1.6608232259750366} 11/07/2021 09:30:02 - INFO - __main__ - Step 87414: {'lr': 0.00019002484193061378, 'samples': 16783488, 'steps': 87413, 'loss/train': 1.7914413213729858} 11/07/2021 09:30:02 - INFO - __main__ - Step 87415: {'lr': 0.00019001969017025223, 'samples': 16783680, 'steps': 87414, 'loss/train': 1.5851364135742188} 11/07/2021 09:30:03 - INFO - __main__ - Step 87416: {'lr': 0.00019001453843691687, 'samples': 16783872, 'steps': 87415, 'loss/train': 1.4573431015014648} 11/07/2021 09:30:04 - INFO - __main__ - Step 87417: {'lr': 0.00019000938673061006, 'samples': 16784064, 'steps': 87416, 'loss/train': 1.7492406368255615} 11/07/2021 09:30:04 - INFO - __main__ - Step 87418: {'lr': 0.00019000423505133407, 'samples': 16784256, 'steps': 87417, 'loss/train': 1.2986371517181396} 11/07/2021 09:30:04 - INFO - __main__ - Step 87419: {'lr': 0.00018999908339909126, 'samples': 16784448, 'steps': 87418, 'loss/train': 1.5222877264022827} 11/07/2021 09:30:05 - INFO - __main__ - Step 87420: {'lr': 0.00018999393177388392, 'samples': 16784640, 'steps': 87419, 'loss/train': 1.2408337593078613} 11/07/2021 09:30:06 - INFO - __main__ - Step 87421: {'lr': 0.00018998878017571438, 'samples': 16784832, 'steps': 87420, 'loss/train': 1.3944957256317139} 11/07/2021 09:30:06 - INFO - __main__ - Step 87422: {'lr': 0.000189983628604585, 'samples': 16785024, 'steps': 87421, 'loss/train': 0.7165651321411133} 11/07/2021 09:30:06 - INFO - __main__ - Step 87423: {'lr': 0.00018997847706049816, 'samples': 16785216, 'steps': 87422, 'loss/train': 1.057547926902771} 11/07/2021 09:30:07 - INFO - __main__ - Step 87424: {'lr': 0.00018997332554345598, 'samples': 16785408, 'steps': 87423, 'loss/train': 1.3692271709442139} 11/07/2021 09:30:07 - INFO - __main__ - Step 87425: {'lr': 0.00018996817405346093, 'samples': 16785600, 'steps': 87424, 'loss/train': 0.8610666990280151} 11/07/2021 09:30:08 - INFO - __main__ - Step 87426: {'lr': 0.00018996302259051526, 'samples': 16785792, 'steps': 87425, 'loss/train': 2.0412259101867676} 11/07/2021 09:30:08 - INFO - __main__ - Step 87427: {'lr': 0.00018995787115462132, 'samples': 16785984, 'steps': 87426, 'loss/train': 1.0908817052841187} 11/07/2021 09:30:09 - INFO - __main__ - Step 87428: {'lr': 0.00018995271974578146, 'samples': 16786176, 'steps': 87427, 'loss/train': 1.6091623306274414} 11/07/2021 09:30:09 - INFO - __main__ - Step 87429: {'lr': 0.00018994756836399794, 'samples': 16786368, 'steps': 87428, 'loss/train': 0.8133315443992615} 11/07/2021 09:30:09 - INFO - __main__ - Step 87430: {'lr': 0.00018994241700927316, 'samples': 16786560, 'steps': 87429, 'loss/train': 0.9494941830635071} 11/07/2021 09:30:10 - INFO - __main__ - Step 87431: {'lr': 0.0001899372656816094, 'samples': 16786752, 'steps': 87430, 'loss/train': 1.1835589408874512} 11/07/2021 09:30:11 - INFO - __main__ - Step 87432: {'lr': 0.00018993211438100897, 'samples': 16786944, 'steps': 87431, 'loss/train': 1.6709038019180298} 11/07/2021 09:30:11 - INFO - __main__ - Step 87433: {'lr': 0.0001899269631074742, 'samples': 16787136, 'steps': 87432, 'loss/train': 1.5370404720306396} 11/07/2021 09:30:11 - INFO - __main__ - Step 87434: {'lr': 0.00018992181186100744, 'samples': 16787328, 'steps': 87433, 'loss/train': 1.3803752660751343} 11/07/2021 09:30:12 - INFO - __main__ - Step 87435: {'lr': 0.00018991666064161096, 'samples': 16787520, 'steps': 87434, 'loss/train': 1.1183420419692993} 11/07/2021 09:30:12 - INFO - __main__ - Step 87436: {'lr': 0.0001899115094492872, 'samples': 16787712, 'steps': 87435, 'loss/train': 0.14335235953330994} 11/07/2021 09:30:13 - INFO - __main__ - Step 87437: {'lr': 0.00018990635828403828, 'samples': 16787904, 'steps': 87436, 'loss/train': 1.3123764991760254} 11/07/2021 09:30:14 - INFO - __main__ - Step 87438: {'lr': 0.00018990120714586665, 'samples': 16788096, 'steps': 87437, 'loss/train': 0.41078728437423706} 11/07/2021 09:30:14 - INFO - __main__ - Step 87439: {'lr': 0.00018989605603477458, 'samples': 16788288, 'steps': 87438, 'loss/train': 1.1215980052947998} 11/07/2021 09:30:14 - INFO - __main__ - Step 87440: {'lr': 0.00018989090495076443, 'samples': 16788480, 'steps': 87439, 'loss/train': 1.3124996423721313} 11/07/2021 09:30:15 - INFO - __main__ - Step 87441: {'lr': 0.0001898857538938385, 'samples': 16788672, 'steps': 87440, 'loss/train': 1.4412188529968262} 11/07/2021 09:30:16 - INFO - __main__ - Step 87442: {'lr': 0.00018988060286399916, 'samples': 16788864, 'steps': 87441, 'loss/train': 1.7231636047363281} 11/07/2021 09:30:16 - INFO - __main__ - Step 87443: {'lr': 0.0001898754518612487, 'samples': 16789056, 'steps': 87442, 'loss/train': 1.33155357837677} 11/07/2021 09:30:17 - INFO - __main__ - Step 87444: {'lr': 0.00018987030088558936, 'samples': 16789248, 'steps': 87443, 'loss/train': 1.8385815620422363} 11/07/2021 09:30:17 - INFO - __main__ - Step 87445: {'lr': 0.00018986514993702362, 'samples': 16789440, 'steps': 87444, 'loss/train': 1.2237313985824585} 11/07/2021 09:30:17 - INFO - __main__ - Step 87446: {'lr': 0.00018985999901555367, 'samples': 16789632, 'steps': 87445, 'loss/train': 0.7762059569358826} 11/07/2021 09:30:18 - INFO - __main__ - Step 87447: {'lr': 0.00018985484812118192, 'samples': 16789824, 'steps': 87446, 'loss/train': 1.7875819206237793} 11/07/2021 09:30:19 - INFO - __main__ - Step 87448: {'lr': 0.00018984969725391063, 'samples': 16790016, 'steps': 87447, 'loss/train': 1.6466100215911865} 11/07/2021 09:30:19 - INFO - __main__ - Step 87449: {'lr': 0.0001898445464137421, 'samples': 16790208, 'steps': 87448, 'loss/train': 1.4982123374938965} 11/07/2021 09:30:20 - INFO - __main__ - Step 87450: {'lr': 0.00018983939560067876, 'samples': 16790400, 'steps': 87449, 'loss/train': 1.3702282905578613} 11/07/2021 09:30:20 - INFO - __main__ - Step 87451: {'lr': 0.00018983424481472283, 'samples': 16790592, 'steps': 87450, 'loss/train': 1.4817523956298828} 11/07/2021 09:30:21 - INFO - __main__ - Step 87452: {'lr': 0.00018982909405587661, 'samples': 16790784, 'steps': 87451, 'loss/train': 1.796095371246338} 11/07/2021 09:30:22 - INFO - __main__ - Step 87453: {'lr': 0.00018982394332414253, 'samples': 16790976, 'steps': 87452, 'loss/train': 1.81024169921875} 11/07/2021 09:30:22 - INFO - __main__ - Step 87454: {'lr': 0.00018981879261952282, 'samples': 16791168, 'steps': 87453, 'loss/train': 1.2731504440307617} 11/07/2021 09:30:22 - INFO - __main__ - Step 87455: {'lr': 0.00018981364194201983, 'samples': 16791360, 'steps': 87454, 'loss/train': 1.321080207824707} 11/07/2021 09:30:23 - INFO - __main__ - Step 87456: {'lr': 0.00018980849129163587, 'samples': 16791552, 'steps': 87455, 'loss/train': 1.1711251735687256} 11/07/2021 09:30:24 - INFO - __main__ - Step 87457: {'lr': 0.0001898033406683733, 'samples': 16791744, 'steps': 87456, 'loss/train': 1.3734062910079956} 11/07/2021 09:30:24 - INFO - __main__ - Step 87458: {'lr': 0.00018979819007223448, 'samples': 16791936, 'steps': 87457, 'loss/train': 1.1263513565063477} 11/07/2021 09:30:25 - INFO - __main__ - Step 87459: {'lr': 0.00018979303950322158, 'samples': 16792128, 'steps': 87458, 'loss/train': 1.6292157173156738} 11/07/2021 09:30:25 - INFO - __main__ - Step 87460: {'lr': 0.00018978788896133704, 'samples': 16792320, 'steps': 87459, 'loss/train': 0.8842732310295105} 11/07/2021 09:30:25 - INFO - __main__ - Step 87461: {'lr': 0.00018978273844658312, 'samples': 16792512, 'steps': 87460, 'loss/train': 0.7039206027984619} 11/07/2021 09:30:26 - INFO - __main__ - Step 87462: {'lr': 0.0001897775879589622, 'samples': 16792704, 'steps': 87461, 'loss/train': 1.2078673839569092} 11/07/2021 09:30:27 - INFO - __main__ - Step 87463: {'lr': 0.00018977243749847663, 'samples': 16792896, 'steps': 87462, 'loss/train': 1.8437412977218628} 11/07/2021 09:30:27 - INFO - __main__ - Step 87464: {'lr': 0.00018976728706512856, 'samples': 16793088, 'steps': 87463, 'loss/train': 1.3643385171890259} 11/07/2021 09:30:27 - INFO - __main__ - Step 87465: {'lr': 0.00018976213665892046, 'samples': 16793280, 'steps': 87464, 'loss/train': 1.4537047147750854} 11/07/2021 09:30:28 - INFO - __main__ - Step 87466: {'lr': 0.0001897569862798546, 'samples': 16793472, 'steps': 87465, 'loss/train': 1.5081182718276978} 11/07/2021 09:30:28 - INFO - __main__ - Step 87467: {'lr': 0.0001897518359279333, 'samples': 16793664, 'steps': 87466, 'loss/train': 1.4709604978561401} 11/07/2021 09:30:29 - INFO - __main__ - Step 87468: {'lr': 0.0001897466856031589, 'samples': 16793856, 'steps': 87467, 'loss/train': 1.2194958925247192} 11/07/2021 09:30:30 - INFO - __main__ - Step 87469: {'lr': 0.00018974153530553378, 'samples': 16794048, 'steps': 87468, 'loss/train': 1.3616596460342407} 11/07/2021 09:30:30 - INFO - __main__ - Step 87470: {'lr': 0.00018973638503506015, 'samples': 16794240, 'steps': 87469, 'loss/train': 1.6150376796722412} 11/07/2021 09:30:30 - INFO - __main__ - Step 87471: {'lr': 0.00018973123479174036, 'samples': 16794432, 'steps': 87470, 'loss/train': 1.2583554983139038} 11/07/2021 09:30:31 - INFO - __main__ - Step 87472: {'lr': 0.00018972608457557675, 'samples': 16794624, 'steps': 87471, 'loss/train': 1.5774931907653809} 11/07/2021 09:30:32 - INFO - __main__ - Step 87473: {'lr': 0.00018972093438657164, 'samples': 16794816, 'steps': 87472, 'loss/train': 1.2195801734924316} 11/07/2021 09:30:32 - INFO - __main__ - Step 87474: {'lr': 0.00018971578422472736, 'samples': 16795008, 'steps': 87473, 'loss/train': 1.7382094860076904} 11/07/2021 09:30:33 - INFO - __main__ - Step 87475: {'lr': 0.00018971063409004617, 'samples': 16795200, 'steps': 87474, 'loss/train': 1.2457714080810547} 11/07/2021 09:30:33 - INFO - __main__ - Step 87476: {'lr': 0.00018970548398253049, 'samples': 16795392, 'steps': 87475, 'loss/train': 1.7697020769119263} 11/07/2021 09:30:33 - INFO - __main__ - Step 87477: {'lr': 0.0001897003339021826, 'samples': 16795584, 'steps': 87476, 'loss/train': 1.0480914115905762} 11/07/2021 09:30:34 - INFO - __main__ - Step 87478: {'lr': 0.00018969518384900477, 'samples': 16795776, 'steps': 87477, 'loss/train': 1.0137866735458374} 11/07/2021 09:30:35 - INFO - __main__ - Step 87479: {'lr': 0.00018969003382299937, 'samples': 16795968, 'steps': 87478, 'loss/train': 1.4650421142578125} 11/07/2021 09:30:35 - INFO - __main__ - Step 87480: {'lr': 0.00018968488382416877, 'samples': 16796160, 'steps': 87479, 'loss/train': 1.6437112092971802} 11/07/2021 09:30:35 - INFO - __main__ - Step 87481: {'lr': 0.00018967973385251516, 'samples': 16796352, 'steps': 87480, 'loss/train': 1.3364200592041016} 11/07/2021 09:30:36 - INFO - __main__ - Step 87482: {'lr': 0.00018967458390804092, 'samples': 16796544, 'steps': 87481, 'loss/train': 1.1738901138305664} 11/07/2021 09:30:36 - INFO - __main__ - Step 87483: {'lr': 0.0001896694339907484, 'samples': 16796736, 'steps': 87482, 'loss/train': 1.5622128248214722} 11/07/2021 09:30:37 - INFO - __main__ - Step 87484: {'lr': 0.0001896642841006399, 'samples': 16796928, 'steps': 87483, 'loss/train': 1.3990161418914795} 11/07/2021 09:30:37 - INFO - __main__ - Step 87485: {'lr': 0.00018965913423771774, 'samples': 16797120, 'steps': 87484, 'loss/train': 1.3477295637130737} 11/07/2021 09:30:38 - INFO - __main__ - Step 87486: {'lr': 0.00018965398440198427, 'samples': 16797312, 'steps': 87485, 'loss/train': 1.0845983028411865} 11/07/2021 09:30:38 - INFO - __main__ - Step 87487: {'lr': 0.00018964883459344174, 'samples': 16797504, 'steps': 87486, 'loss/train': 1.166558027267456} 11/07/2021 09:30:38 - INFO - __main__ - Step 87488: {'lr': 0.00018964368481209253, 'samples': 16797696, 'steps': 87487, 'loss/train': 1.3639734983444214} 11/07/2021 09:30:39 - INFO - __main__ - Step 87489: {'lr': 0.00018963853505793896, 'samples': 16797888, 'steps': 87488, 'loss/train': 0.9153897762298584} 11/07/2021 09:30:40 - INFO - __main__ - Step 87490: {'lr': 0.00018963338533098344, 'samples': 16798080, 'steps': 87489, 'loss/train': 1.5076218843460083} 11/07/2021 09:30:40 - INFO - __main__ - Step 87491: {'lr': 0.00018962823563122805, 'samples': 16798272, 'steps': 87490, 'loss/train': 1.6835883855819702} 11/07/2021 09:30:41 - INFO - __main__ - Step 87492: {'lr': 0.00018962308595867525, 'samples': 16798464, 'steps': 87491, 'loss/train': 1.334289312362671} 11/07/2021 09:30:41 - INFO - __main__ - Step 87493: {'lr': 0.00018961793631332738, 'samples': 16798656, 'steps': 87492, 'loss/train': 1.2954962253570557} 11/07/2021 09:30:42 - INFO - __main__ - Step 87494: {'lr': 0.00018961278669518672, 'samples': 16798848, 'steps': 87493, 'loss/train': 1.882318377494812} 11/07/2021 09:30:42 - INFO - __main__ - Step 87495: {'lr': 0.0001896076371042556, 'samples': 16799040, 'steps': 87494, 'loss/train': 1.2112624645233154} 11/07/2021 09:30:43 - INFO - __main__ - Step 87496: {'lr': 0.00018960248754053638, 'samples': 16799232, 'steps': 87495, 'loss/train': 1.283656358718872} 11/07/2021 09:30:43 - INFO - __main__ - Step 87497: {'lr': 0.00018959733800403132, 'samples': 16799424, 'steps': 87496, 'loss/train': 1.4472880363464355} 11/07/2021 09:30:43 - INFO - __main__ - Step 87498: {'lr': 0.00018959218849474277, 'samples': 16799616, 'steps': 87497, 'loss/train': 2.4246695041656494} 11/07/2021 09:30:44 - INFO - __main__ - Step 87499: {'lr': 0.00018958703901267304, 'samples': 16799808, 'steps': 87498, 'loss/train': 1.4808647632598877} 11/07/2021 09:30:45 - INFO - __main__ - Step 87500: {'lr': 0.00018958188955782446, 'samples': 16800000, 'steps': 87499, 'loss/train': 1.4935681819915771} 11/07/2021 09:30:45 - INFO - __main__ - Step 87501: {'lr': 0.00018957674013019937, 'samples': 16800192, 'steps': 87500, 'loss/train': 1.6227697134017944} 11/07/2021 09:30:45 - INFO - __main__ - Step 87502: {'lr': 0.00018957159072980004, 'samples': 16800384, 'steps': 87501, 'loss/train': 0.8364802002906799} 11/07/2021 09:30:46 - INFO - __main__ - Step 87503: {'lr': 0.00018956644135662896, 'samples': 16800576, 'steps': 87502, 'loss/train': 1.0528757572174072} 11/07/2021 09:30:47 - INFO - __main__ - Step 87504: {'lr': 0.00018956129201068818, 'samples': 16800768, 'steps': 87503, 'loss/train': 1.608440637588501} 11/07/2021 09:30:47 - INFO - __main__ - Step 87505: {'lr': 0.00018955614269198012, 'samples': 16800960, 'steps': 87504, 'loss/train': 1.5888888835906982} 11/07/2021 09:30:48 - INFO - __main__ - Step 87506: {'lr': 0.0001895509934005072, 'samples': 16801152, 'steps': 87505, 'loss/train': 1.3391987085342407} 11/07/2021 09:30:48 - INFO - __main__ - Step 87507: {'lr': 0.00018954584413627163, 'samples': 16801344, 'steps': 87506, 'loss/train': 1.4495549201965332} 11/07/2021 09:30:48 - INFO - __main__ - Step 87508: {'lr': 0.00018954069489927574, 'samples': 16801536, 'steps': 87507, 'loss/train': 0.6459611058235168} 11/07/2021 09:30:49 - INFO - __main__ - Step 87509: {'lr': 0.00018953554568952192, 'samples': 16801728, 'steps': 87508, 'loss/train': 1.1389391422271729} 11/07/2021 09:30:50 - INFO - __main__ - Step 87510: {'lr': 0.00018953039650701243, 'samples': 16801920, 'steps': 87509, 'loss/train': 1.3832409381866455} 11/07/2021 09:30:50 - INFO - __main__ - Step 87511: {'lr': 0.00018952524735174964, 'samples': 16802112, 'steps': 87510, 'loss/train': 1.5661039352416992} 11/07/2021 09:30:51 - INFO - __main__ - Step 87512: {'lr': 0.0001895200982237358, 'samples': 16802304, 'steps': 87511, 'loss/train': 1.300301432609558} 11/07/2021 09:30:51 - INFO - __main__ - Step 87513: {'lr': 0.00018951494912297328, 'samples': 16802496, 'steps': 87512, 'loss/train': 1.3352950811386108} 11/07/2021 09:30:51 - INFO - __main__ - Step 87514: {'lr': 0.00018950980004946443, 'samples': 16802688, 'steps': 87513, 'loss/train': 1.5349894762039185} 11/07/2021 09:30:52 - INFO - __main__ - Step 87515: {'lr': 0.0001895046510032115, 'samples': 16802880, 'steps': 87514, 'loss/train': 1.209999918937683} 11/07/2021 09:30:53 - INFO - __main__ - Step 87516: {'lr': 0.00018949950198421684, 'samples': 16803072, 'steps': 87515, 'loss/train': 1.2405470609664917} 11/07/2021 09:30:53 - INFO - __main__ - Step 87517: {'lr': 0.00018949435299248289, 'samples': 16803264, 'steps': 87516, 'loss/train': 1.671390414237976} 11/07/2021 09:30:53 - INFO - __main__ - Step 87518: {'lr': 0.00018948920402801173, 'samples': 16803456, 'steps': 87517, 'loss/train': 1.2570751905441284} 11/07/2021 09:30:54 - INFO - __main__ - Step 87519: {'lr': 0.0001894840550908058, 'samples': 16803648, 'steps': 87518, 'loss/train': 1.5065964460372925} 11/07/2021 09:30:55 - INFO - __main__ - Step 87520: {'lr': 0.00018947890618086744, 'samples': 16803840, 'steps': 87519, 'loss/train': 1.6976615190505981} 11/07/2021 09:30:55 - INFO - __main__ - Step 87521: {'lr': 0.00018947375729819893, 'samples': 16804032, 'steps': 87520, 'loss/train': 1.7986007928848267} 11/07/2021 09:30:55 - INFO - __main__ - Step 87522: {'lr': 0.0001894686084428026, 'samples': 16804224, 'steps': 87521, 'loss/train': 1.3231313228607178} 11/07/2021 09:30:56 - INFO - __main__ - Step 87523: {'lr': 0.0001894634596146808, 'samples': 16804416, 'steps': 87522, 'loss/train': 1.4210015535354614} 11/07/2021 09:30:56 - INFO - __main__ - Step 87524: {'lr': 0.00018945831081383587, 'samples': 16804608, 'steps': 87523, 'loss/train': 1.368979811668396} 11/07/2021 09:30:57 - INFO - __main__ - Step 87525: {'lr': 0.00018945316204027003, 'samples': 16804800, 'steps': 87524, 'loss/train': 1.453467845916748} 11/07/2021 09:30:58 - INFO - __main__ - Step 87526: {'lr': 0.00018944801329398567, 'samples': 16804992, 'steps': 87525, 'loss/train': 1.6187306642532349} 11/07/2021 09:30:58 - INFO - __main__ - Step 87527: {'lr': 0.0001894428645749851, 'samples': 16805184, 'steps': 87526, 'loss/train': 1.1844656467437744} 11/07/2021 09:30:58 - INFO - __main__ - Step 87528: {'lr': 0.00018943771588327067, 'samples': 16805376, 'steps': 87527, 'loss/train': 0.7370353937149048} 11/07/2021 09:30:59 - INFO - __main__ - Step 87529: {'lr': 0.00018943256721884466, 'samples': 16805568, 'steps': 87528, 'loss/train': 1.597265362739563} 11/07/2021 09:31:00 - INFO - __main__ - Step 87530: {'lr': 0.0001894274185817095, 'samples': 16805760, 'steps': 87529, 'loss/train': 0.6133521199226379} 11/07/2021 09:31:00 - INFO - __main__ - Step 87531: {'lr': 0.00018942226997186728, 'samples': 16805952, 'steps': 87530, 'loss/train': 0.47946053743362427} 11/07/2021 09:31:00 - INFO - __main__ - Step 87532: {'lr': 0.0001894171213893205, 'samples': 16806144, 'steps': 87531, 'loss/train': 1.5409263372421265} 11/07/2021 09:31:01 - INFO - __main__ - Step 87533: {'lr': 0.00018941197283407142, 'samples': 16806336, 'steps': 87532, 'loss/train': 0.9093905091285706} 11/07/2021 09:31:01 - INFO - __main__ - Step 87534: {'lr': 0.00018940682430612234, 'samples': 16806528, 'steps': 87533, 'loss/train': 1.3007220029830933} 11/07/2021 09:31:02 - INFO - __main__ - Step 87535: {'lr': 0.00018940167580547564, 'samples': 16806720, 'steps': 87534, 'loss/train': 1.3286961317062378} 11/07/2021 09:31:02 - INFO - __main__ - Step 87536: {'lr': 0.0001893965273321336, 'samples': 16806912, 'steps': 87535, 'loss/train': 1.6465165615081787} 11/07/2021 09:31:03 - INFO - __main__ - Step 87537: {'lr': 0.00018939137888609854, 'samples': 16807104, 'steps': 87536, 'loss/train': 1.6055076122283936} 11/07/2021 09:31:03 - INFO - __main__ - Step 87538: {'lr': 0.00018938623046737277, 'samples': 16807296, 'steps': 87537, 'loss/train': 1.4170821905136108} 11/07/2021 09:31:03 - INFO - __main__ - Step 87539: {'lr': 0.00018938108207595865, 'samples': 16807488, 'steps': 87538, 'loss/train': 0.5554360747337341} 11/07/2021 09:31:04 - INFO - __main__ - Step 87540: {'lr': 0.0001893759337118585, 'samples': 16807680, 'steps': 87539, 'loss/train': 1.7607008218765259} 11/07/2021 09:31:05 - INFO - __main__ - Step 87541: {'lr': 0.0001893707853750746, 'samples': 16807872, 'steps': 87540, 'loss/train': 1.4495689868927002} 11/07/2021 09:31:05 - INFO - __main__ - Step 87542: {'lr': 0.00018936563706560926, 'samples': 16808064, 'steps': 87541, 'loss/train': 1.4861233234405518} 11/07/2021 09:31:06 - INFO - __main__ - Step 87543: {'lr': 0.0001893604887834649, 'samples': 16808256, 'steps': 87542, 'loss/train': 1.4923449754714966} 11/07/2021 09:31:06 - INFO - __main__ - Step 87544: {'lr': 0.00018935534052864385, 'samples': 16808448, 'steps': 87543, 'loss/train': 1.2796064615249634} 11/07/2021 09:31:06 - INFO - __main__ - Step 87545: {'lr': 0.0001893501923011482, 'samples': 16808640, 'steps': 87544, 'loss/train': 1.5951350927352905} 11/07/2021 09:31:08 - INFO - __main__ - Step 87546: {'lr': 0.00018934504410098043, 'samples': 16808832, 'steps': 87545, 'loss/train': 1.267911434173584} 11/07/2021 09:31:08 - INFO - __main__ - Step 87547: {'lr': 0.00018933989592814288, 'samples': 16809024, 'steps': 87546, 'loss/train': 1.5279954671859741} 11/07/2021 09:31:09 - INFO - __main__ - Step 87548: {'lr': 0.00018933474778263783, 'samples': 16809216, 'steps': 87547, 'loss/train': 1.1015442609786987} 11/07/2021 09:31:09 - INFO - __main__ - Step 87549: {'lr': 0.00018932959966446757, 'samples': 16809408, 'steps': 87548, 'loss/train': 1.8190935850143433} 11/07/2021 09:31:09 - INFO - __main__ - Step 87550: {'lr': 0.0001893244515736345, 'samples': 16809600, 'steps': 87549, 'loss/train': 1.793216347694397} 11/07/2021 09:31:10 - INFO - __main__ - Step 87551: {'lr': 0.00018931930351014084, 'samples': 16809792, 'steps': 87550, 'loss/train': 1.2997641563415527} 11/07/2021 09:31:11 - INFO - __main__ - Step 87552: {'lr': 0.000189314155473989, 'samples': 16809984, 'steps': 87551, 'loss/train': 1.2748576402664185} 11/07/2021 09:31:11 - INFO - __main__ - Step 87553: {'lr': 0.00018930900746518125, 'samples': 16810176, 'steps': 87552, 'loss/train': 1.0737392902374268} 11/07/2021 09:31:11 - INFO - __main__ - Step 87554: {'lr': 0.00018930385948371997, 'samples': 16810368, 'steps': 87553, 'loss/train': 1.1827291250228882} 11/07/2021 09:31:12 - INFO - __main__ - Step 87555: {'lr': 0.00018929871152960739, 'samples': 16810560, 'steps': 87554, 'loss/train': 1.2315627336502075} 11/07/2021 09:31:12 - INFO - __main__ - Step 87556: {'lr': 0.0001892935636028459, 'samples': 16810752, 'steps': 87555, 'loss/train': 1.3229426145553589} 11/07/2021 09:31:13 - INFO - __main__ - Step 87557: {'lr': 0.0001892884157034379, 'samples': 16810944, 'steps': 87556, 'loss/train': 1.553268313407898} 11/07/2021 09:31:14 - INFO - __main__ - Step 87558: {'lr': 0.00018928326783138546, 'samples': 16811136, 'steps': 87557, 'loss/train': 1.5456312894821167} 11/07/2021 09:31:14 - INFO - __main__ - Step 87559: {'lr': 0.0001892781199866911, 'samples': 16811328, 'steps': 87558, 'loss/train': 1.7337417602539062} 11/07/2021 09:31:14 - INFO - __main__ - Step 87560: {'lr': 0.00018927297216935702, 'samples': 16811520, 'steps': 87559, 'loss/train': 1.730574607849121} 11/07/2021 09:31:15 - INFO - __main__ - Step 87561: {'lr': 0.00018926782437938563, 'samples': 16811712, 'steps': 87560, 'loss/train': 0.24542713165283203} 11/07/2021 09:31:15 - INFO - __main__ - Step 87562: {'lr': 0.00018926267661677923, 'samples': 16811904, 'steps': 87561, 'loss/train': 1.9359102249145508} 11/07/2021 09:31:16 - INFO - __main__ - Step 87563: {'lr': 0.00018925752888154012, 'samples': 16812096, 'steps': 87562, 'loss/train': 1.59926176071167} 11/07/2021 09:31:17 - INFO - __main__ - Step 87564: {'lr': 0.00018925238117367064, 'samples': 16812288, 'steps': 87563, 'loss/train': 1.7018932104110718} 11/07/2021 09:31:17 - INFO - __main__ - Step 87565: {'lr': 0.00018924723349317306, 'samples': 16812480, 'steps': 87564, 'loss/train': 0.2851230502128601} 11/07/2021 09:31:17 - INFO - __main__ - Step 87566: {'lr': 0.0001892420858400498, 'samples': 16812672, 'steps': 87565, 'loss/train': 1.3000402450561523} 11/07/2021 09:31:18 - INFO - __main__ - Step 87567: {'lr': 0.0001892369382143031, 'samples': 16812864, 'steps': 87566, 'loss/train': 1.0292199850082397} 11/07/2021 09:31:19 - INFO - __main__ - Step 87568: {'lr': 0.0001892317906159353, 'samples': 16813056, 'steps': 87567, 'loss/train': 1.5703580379486084} 11/07/2021 09:31:19 - INFO - __main__ - Step 87569: {'lr': 0.0001892266430449487, 'samples': 16813248, 'steps': 87568, 'loss/train': 1.62779700756073} 11/07/2021 09:31:20 - INFO - __main__ - Step 87570: {'lr': 0.00018922149550134568, 'samples': 16813440, 'steps': 87569, 'loss/train': 1.380131483078003} 11/07/2021 09:31:20 - INFO - __main__ - Step 87571: {'lr': 0.00018921634798512853, 'samples': 16813632, 'steps': 87570, 'loss/train': 1.2636533975601196} 11/07/2021 09:31:20 - INFO - __main__ - Step 87572: {'lr': 0.00018921120049629952, 'samples': 16813824, 'steps': 87571, 'loss/train': 1.181662678718567} 11/07/2021 09:31:21 - INFO - __main__ - Step 87573: {'lr': 0.00018920605303486099, 'samples': 16814016, 'steps': 87572, 'loss/train': 1.475786566734314} 11/07/2021 09:31:22 - INFO - __main__ - Step 87574: {'lr': 0.00018920090560081528, 'samples': 16814208, 'steps': 87573, 'loss/train': 1.8833534717559814} 11/07/2021 09:31:22 - INFO - __main__ - Step 87575: {'lr': 0.00018919575819416467, 'samples': 16814400, 'steps': 87574, 'loss/train': 1.380942940711975} 11/07/2021 09:31:22 - INFO - __main__ - Step 87576: {'lr': 0.00018919061081491156, 'samples': 16814592, 'steps': 87575, 'loss/train': 2.0873677730560303} 11/07/2021 09:31:23 - INFO - __main__ - Step 87577: {'lr': 0.0001891854634630582, 'samples': 16814784, 'steps': 87576, 'loss/train': 1.424631118774414} 11/07/2021 09:31:23 - INFO - __main__ - Step 87578: {'lr': 0.0001891803161386069, 'samples': 16814976, 'steps': 87577, 'loss/train': 1.4885270595550537} 11/07/2021 09:31:24 - INFO - __main__ - Step 87579: {'lr': 0.00018917516884156007, 'samples': 16815168, 'steps': 87578, 'loss/train': 1.4785261154174805} 11/07/2021 09:31:25 - INFO - __main__ - Step 87580: {'lr': 0.00018917002157191996, 'samples': 16815360, 'steps': 87579, 'loss/train': 1.7874751091003418} 11/07/2021 09:31:25 - INFO - __main__ - Step 87581: {'lr': 0.00018916487432968894, 'samples': 16815552, 'steps': 87580, 'loss/train': 1.0268372297286987} 11/07/2021 09:31:25 - INFO - __main__ - Step 87582: {'lr': 0.00018915972711486923, 'samples': 16815744, 'steps': 87581, 'loss/train': 1.4291926622390747} 11/07/2021 09:31:26 - INFO - __main__ - Step 87583: {'lr': 0.0001891545799274632, 'samples': 16815936, 'steps': 87582, 'loss/train': 1.567251443862915} 11/07/2021 09:31:27 - INFO - __main__ - Step 87584: {'lr': 0.00018914943276747325, 'samples': 16816128, 'steps': 87583, 'loss/train': 1.5468658208847046} 11/07/2021 09:31:27 - INFO - __main__ - Step 87585: {'lr': 0.00018914428563490159, 'samples': 16816320, 'steps': 87584, 'loss/train': 1.345381736755371} 11/07/2021 09:31:27 - INFO - __main__ - Step 87586: {'lr': 0.00018913913852975053, 'samples': 16816512, 'steps': 87585, 'loss/train': 1.5365444421768188} 11/07/2021 09:31:28 - INFO - __main__ - Step 87587: {'lr': 0.00018913399145202247, 'samples': 16816704, 'steps': 87586, 'loss/train': 1.3736629486083984} 11/07/2021 09:31:28 - INFO - __main__ - Step 87588: {'lr': 0.00018912884440171968, 'samples': 16816896, 'steps': 87587, 'loss/train': 1.6760751008987427} 11/07/2021 09:31:29 - INFO - __main__ - Step 87589: {'lr': 0.0001891236973788445, 'samples': 16817088, 'steps': 87588, 'loss/train': 1.1032978296279907} 11/07/2021 09:31:29 - INFO - __main__ - Step 87590: {'lr': 0.00018911855038339923, 'samples': 16817280, 'steps': 87589, 'loss/train': 1.5697247982025146} 11/07/2021 09:31:30 - INFO - __main__ - Step 87591: {'lr': 0.00018911340341538622, 'samples': 16817472, 'steps': 87590, 'loss/train': 1.1149591207504272} 11/07/2021 09:31:30 - INFO - __main__ - Step 87592: {'lr': 0.00018910825647480781, 'samples': 16817664, 'steps': 87591, 'loss/train': 1.7412071228027344} 11/07/2021 09:31:30 - INFO - __main__ - Step 87593: {'lr': 0.00018910310956166623, 'samples': 16817856, 'steps': 87592, 'loss/train': 1.7825294733047485} 11/07/2021 09:31:31 - INFO - __main__ - Step 87594: {'lr': 0.00018909796267596384, 'samples': 16818048, 'steps': 87593, 'loss/train': 1.0741299390792847} 11/07/2021 09:31:32 - INFO - __main__ - Step 87595: {'lr': 0.000189092815817703, 'samples': 16818240, 'steps': 87594, 'loss/train': 1.3797544240951538} 11/07/2021 09:31:32 - INFO - __main__ - Step 87596: {'lr': 0.00018908766898688596, 'samples': 16818432, 'steps': 87595, 'loss/train': 1.5162698030471802} 11/07/2021 09:31:32 - INFO - __main__ - Step 87597: {'lr': 0.0001890825221835151, 'samples': 16818624, 'steps': 87596, 'loss/train': 1.1507065296173096} 11/07/2021 09:31:33 - INFO - __main__ - Step 87598: {'lr': 0.00018907737540759277, 'samples': 16818816, 'steps': 87597, 'loss/train': 1.4716321229934692} 11/07/2021 09:31:33 - INFO - __main__ - Step 87599: {'lr': 0.00018907222865912116, 'samples': 16819008, 'steps': 87598, 'loss/train': 0.7957650423049927} 11/07/2021 09:31:34 - INFO - __main__ - Step 87600: {'lr': 0.0001890670819381027, 'samples': 16819200, 'steps': 87599, 'loss/train': 1.952478051185608} 11/07/2021 09:31:34 - INFO - __main__ - Step 87601: {'lr': 0.00018906193524453963, 'samples': 16819392, 'steps': 87600, 'loss/train': 1.3845441341400146} 11/07/2021 09:31:35 - INFO - __main__ - Step 87602: {'lr': 0.00018905678857843432, 'samples': 16819584, 'steps': 87601, 'loss/train': 1.1825910806655884} 11/07/2021 09:31:35 - INFO - __main__ - Step 87603: {'lr': 0.00018905164193978914, 'samples': 16819776, 'steps': 87602, 'loss/train': 1.4599493741989136} 11/07/2021 09:31:36 - INFO - __main__ - Step 87604: {'lr': 0.0001890464953286063, 'samples': 16819968, 'steps': 87603, 'loss/train': 1.5767253637313843} 11/07/2021 09:31:37 - INFO - __main__ - Step 87605: {'lr': 0.00018904134874488817, 'samples': 16820160, 'steps': 87604, 'loss/train': 1.4221739768981934} 11/07/2021 09:31:37 - INFO - __main__ - Step 87606: {'lr': 0.00018903620218863707, 'samples': 16820352, 'steps': 87605, 'loss/train': 1.680535078048706} 11/07/2021 09:31:37 - INFO - __main__ - Step 87607: {'lr': 0.00018903105565985532, 'samples': 16820544, 'steps': 87606, 'loss/train': 1.2219549417495728} 11/07/2021 09:31:38 - INFO - __main__ - Step 87608: {'lr': 0.00018902590915854521, 'samples': 16820736, 'steps': 87607, 'loss/train': 1.597761869430542} 11/07/2021 09:31:38 - INFO - __main__ - Step 87609: {'lr': 0.00018902076268470912, 'samples': 16820928, 'steps': 87608, 'loss/train': 1.3794496059417725} 11/07/2021 09:31:38 - INFO - __main__ - Step 87610: {'lr': 0.0001890156162383493, 'samples': 16821120, 'steps': 87609, 'loss/train': 1.6116551160812378} 11/07/2021 09:31:39 - INFO - __main__ - Step 87611: {'lr': 0.00018901046981946817, 'samples': 16821312, 'steps': 87610, 'loss/train': 2.750842809677124} 11/07/2021 09:31:40 - INFO - __main__ - Step 87612: {'lr': 0.00018900532342806795, 'samples': 16821504, 'steps': 87611, 'loss/train': 0.4909421503543854} 11/07/2021 09:31:40 - INFO - __main__ - Step 87613: {'lr': 0.00018900017706415095, 'samples': 16821696, 'steps': 87612, 'loss/train': 1.2453116178512573} 11/07/2021 09:31:40 - INFO - __main__ - Step 87614: {'lr': 0.00018899503072771962, 'samples': 16821888, 'steps': 87613, 'loss/train': 1.1198937892913818} 11/07/2021 09:31:41 - INFO - __main__ - Step 87615: {'lr': 0.00018898988441877612, 'samples': 16822080, 'steps': 87614, 'loss/train': 1.7990916967391968} 11/07/2021 09:31:42 - INFO - __main__ - Step 87616: {'lr': 0.0001889847381373228, 'samples': 16822272, 'steps': 87615, 'loss/train': 0.6257501840591431} 11/07/2021 09:31:42 - INFO - __main__ - Step 87617: {'lr': 0.00018897959188336205, 'samples': 16822464, 'steps': 87616, 'loss/train': 1.2839579582214355} 11/07/2021 09:31:42 - INFO - __main__ - Step 87618: {'lr': 0.00018897444565689616, 'samples': 16822656, 'steps': 87617, 'loss/train': 1.4985295534133911} 11/07/2021 09:31:43 - INFO - __main__ - Step 87619: {'lr': 0.00018896929945792746, 'samples': 16822848, 'steps': 87618, 'loss/train': 1.5756292343139648} 11/07/2021 09:31:43 - INFO - __main__ - Step 87620: {'lr': 0.00018896415328645822, 'samples': 16823040, 'steps': 87619, 'loss/train': 1.7068454027175903} 11/07/2021 09:31:44 - INFO - __main__ - Step 87621: {'lr': 0.0001889590071424908, 'samples': 16823232, 'steps': 87620, 'loss/train': 0.9329846501350403} 11/07/2021 09:31:44 - INFO - __main__ - Step 87622: {'lr': 0.00018895386102602753, 'samples': 16823424, 'steps': 87621, 'loss/train': 1.2801212072372437} 11/07/2021 09:31:45 - INFO - __main__ - Step 87623: {'lr': 0.00018894871493707065, 'samples': 16823616, 'steps': 87622, 'loss/train': 1.10739004611969} 11/07/2021 09:31:45 - INFO - __main__ - Step 87624: {'lr': 0.0001889435688756226, 'samples': 16823808, 'steps': 87623, 'loss/train': 1.2835768461227417} 11/07/2021 09:31:46 - INFO - __main__ - Step 87625: {'lr': 0.00018893842284168572, 'samples': 16824000, 'steps': 87624, 'loss/train': 0.8804445266723633} 11/07/2021 09:31:46 - INFO - __main__ - Step 87626: {'lr': 0.0001889332768352621, 'samples': 16824192, 'steps': 87625, 'loss/train': 1.5112004280090332} 11/07/2021 09:31:47 - INFO - __main__ - Step 87627: {'lr': 0.00018892813085635425, 'samples': 16824384, 'steps': 87626, 'loss/train': 1.502776026725769} 11/07/2021 09:31:47 - INFO - __main__ - Step 87628: {'lr': 0.00018892298490496442, 'samples': 16824576, 'steps': 87627, 'loss/train': 1.7938487529754639} 11/07/2021 09:31:48 - INFO - __main__ - Step 87629: {'lr': 0.00018891783898109497, 'samples': 16824768, 'steps': 87628, 'loss/train': 1.5909361839294434} 11/07/2021 09:31:48 - INFO - __main__ - Step 87630: {'lr': 0.00018891269308474819, 'samples': 16824960, 'steps': 87629, 'loss/train': 1.4239420890808105} 11/07/2021 09:31:49 - INFO - __main__ - Step 87631: {'lr': 0.0001889075472159264, 'samples': 16825152, 'steps': 87630, 'loss/train': 2.1834747791290283} 11/07/2021 09:31:49 - INFO - __main__ - Step 87632: {'lr': 0.00018890240137463195, 'samples': 16825344, 'steps': 87631, 'loss/train': 1.6022824048995972} 11/07/2021 09:31:50 - INFO - __main__ - Step 87633: {'lr': 0.0001888972555608671, 'samples': 16825536, 'steps': 87632, 'loss/train': 0.714360237121582} 11/07/2021 09:31:50 - INFO - __main__ - Step 87634: {'lr': 0.00018889210977463423, 'samples': 16825728, 'steps': 87633, 'loss/train': 1.227272868156433} 11/07/2021 09:31:50 - INFO - __main__ - Step 87635: {'lr': 0.00018888696401593563, 'samples': 16825920, 'steps': 87634, 'loss/train': 1.278175711631775} 11/07/2021 09:31:51 - INFO - __main__ - Step 87636: {'lr': 0.0001888818182847736, 'samples': 16826112, 'steps': 87635, 'loss/train': 1.2890071868896484} 11/07/2021 09:31:52 - INFO - __main__ - Step 87637: {'lr': 0.00018887667258115048, 'samples': 16826304, 'steps': 87636, 'loss/train': 0.9704949259757996} 11/07/2021 09:31:52 - INFO - __main__ - Step 87638: {'lr': 0.00018887152690506872, 'samples': 16826496, 'steps': 87637, 'loss/train': 0.8678407073020935} 11/07/2021 09:31:52 - INFO - __main__ - Step 87639: {'lr': 0.00018886638125653038, 'samples': 16826688, 'steps': 87638, 'loss/train': 2.3527021408081055} 11/07/2021 09:31:53 - INFO - __main__ - Step 87640: {'lr': 0.00018886123563553793, 'samples': 16826880, 'steps': 87639, 'loss/train': 1.042942762374878} 11/07/2021 09:31:53 - INFO - __main__ - Step 87641: {'lr': 0.00018885609004209365, 'samples': 16827072, 'steps': 87640, 'loss/train': 1.7874603271484375} 11/07/2021 09:31:54 - INFO - __main__ - Step 87642: {'lr': 0.0001888509444761999, 'samples': 16827264, 'steps': 87641, 'loss/train': 0.09232310205698013} 11/07/2021 09:31:55 - INFO - __main__ - Step 87643: {'lr': 0.00018884579893785892, 'samples': 16827456, 'steps': 87642, 'loss/train': 1.4284950494766235} 11/07/2021 09:31:55 - INFO - __main__ - Step 87644: {'lr': 0.0001888406534270731, 'samples': 16827648, 'steps': 87643, 'loss/train': 1.1972404718399048} 11/07/2021 09:31:55 - INFO - __main__ - Step 87645: {'lr': 0.00018883550794384474, 'samples': 16827840, 'steps': 87644, 'loss/train': 1.6125385761260986} 11/07/2021 09:31:56 - INFO - __main__ - Step 87646: {'lr': 0.00018883036248817613, 'samples': 16828032, 'steps': 87645, 'loss/train': 1.538163423538208} 11/07/2021 09:31:57 - INFO - __main__ - Step 87647: {'lr': 0.00018882521706006967, 'samples': 16828224, 'steps': 87646, 'loss/train': 1.3541375398635864} 11/07/2021 09:31:57 - INFO - __main__ - Step 87648: {'lr': 0.0001888200716595276, 'samples': 16828416, 'steps': 87647, 'loss/train': 1.525252342224121} 11/07/2021 09:31:58 - INFO - __main__ - Step 87649: {'lr': 0.00018881492628655222, 'samples': 16828608, 'steps': 87648, 'loss/train': 1.2365227937698364} 11/07/2021 09:31:58 - INFO - __main__ - Step 87650: {'lr': 0.0001888097809411459, 'samples': 16828800, 'steps': 87649, 'loss/train': 1.17494797706604} 11/07/2021 09:31:58 - INFO - __main__ - Step 87651: {'lr': 0.00018880463562331114, 'samples': 16828992, 'steps': 87650, 'loss/train': 1.4423420429229736} 11/07/2021 09:31:59 - INFO - __main__ - Step 87652: {'lr': 0.00018879949033304987, 'samples': 16829184, 'steps': 87651, 'loss/train': 1.204132080078125} 11/07/2021 09:32:00 - INFO - __main__ - Step 87653: {'lr': 0.00018879434507036464, 'samples': 16829376, 'steps': 87652, 'loss/train': 1.2859870195388794} 11/07/2021 09:32:00 - INFO - __main__ - Step 87654: {'lr': 0.00018878919983525771, 'samples': 16829568, 'steps': 87653, 'loss/train': 0.7896373271942139} 11/07/2021 09:32:00 - INFO - __main__ - Step 87655: {'lr': 0.00018878405462773146, 'samples': 16829760, 'steps': 87654, 'loss/train': 1.5702450275421143} 11/07/2021 09:32:01 - INFO - __main__ - Step 87656: {'lr': 0.00018877890944778814, 'samples': 16829952, 'steps': 87655, 'loss/train': 1.0358927249908447} 11/07/2021 09:32:01 - INFO - __main__ - Step 87657: {'lr': 0.00018877376429543013, 'samples': 16830144, 'steps': 87656, 'loss/train': 1.7820310592651367} 11/07/2021 09:32:02 - INFO - __main__ - Step 87658: {'lr': 0.0001887686191706597, 'samples': 16830336, 'steps': 87657, 'loss/train': 1.338318109512329} 11/07/2021 09:32:02 - INFO - __main__ - Step 87659: {'lr': 0.00018876347407347914, 'samples': 16830528, 'steps': 87658, 'loss/train': 1.3882381916046143} 11/07/2021 09:32:03 - INFO - __main__ - Step 87660: {'lr': 0.0001887583290038909, 'samples': 16830720, 'steps': 87659, 'loss/train': 1.684685230255127} 11/07/2021 09:32:03 - INFO - __main__ - Step 87661: {'lr': 0.00018875318396189718, 'samples': 16830912, 'steps': 87660, 'loss/train': 1.490196704864502} 11/07/2021 09:32:03 - INFO - __main__ - Step 87662: {'lr': 0.0001887480389475003, 'samples': 16831104, 'steps': 87661, 'loss/train': 1.5554206371307373} 11/07/2021 09:32:05 - INFO - __main__ - Step 87663: {'lr': 0.00018874289396070263, 'samples': 16831296, 'steps': 87662, 'loss/train': 1.078384518623352} 11/07/2021 09:32:05 - INFO - __main__ - Step 87664: {'lr': 0.00018873774900150645, 'samples': 16831488, 'steps': 87663, 'loss/train': 1.5894957780838013} 11/07/2021 09:32:05 - INFO - __main__ - Step 87665: {'lr': 0.00018873260406991423, 'samples': 16831680, 'steps': 87664, 'loss/train': 0.9337233901023865} 11/07/2021 09:32:06 - INFO - __main__ - Step 87666: {'lr': 0.00018872745916592804, 'samples': 16831872, 'steps': 87665, 'loss/train': 1.147174596786499} 11/07/2021 09:32:06 - INFO - __main__ - Step 87667: {'lr': 0.00018872231428955028, 'samples': 16832064, 'steps': 87666, 'loss/train': 1.4156078100204468} 11/07/2021 09:32:07 - INFO - __main__ - Step 87668: {'lr': 0.00018871716944078332, 'samples': 16832256, 'steps': 87667, 'loss/train': 0.6516605019569397} 11/07/2021 09:32:07 - INFO - __main__ - Step 87669: {'lr': 0.00018871202461962947, 'samples': 16832448, 'steps': 87668, 'loss/train': 2.1555333137512207} 11/07/2021 09:32:08 - INFO - __main__ - Step 87670: {'lr': 0.00018870687982609102, 'samples': 16832640, 'steps': 87669, 'loss/train': 0.6132200956344604} 11/07/2021 09:32:08 - INFO - __main__ - Step 87671: {'lr': 0.0001887017350601703, 'samples': 16832832, 'steps': 87670, 'loss/train': 1.5329598188400269} 11/07/2021 09:32:08 - INFO - __main__ - Step 87672: {'lr': 0.00018869659032186964, 'samples': 16833024, 'steps': 87671, 'loss/train': 0.760084331035614} 11/07/2021 09:32:09 - INFO - __main__ - Step 87673: {'lr': 0.00018869144561119137, 'samples': 16833216, 'steps': 87672, 'loss/train': 1.407099962234497} 11/07/2021 09:32:10 - INFO - __main__ - Step 87674: {'lr': 0.00018868630092813777, 'samples': 16833408, 'steps': 87673, 'loss/train': 1.4060369729995728} 11/07/2021 09:32:10 - INFO - __main__ - Step 87675: {'lr': 0.00018868115627271117, 'samples': 16833600, 'steps': 87674, 'loss/train': 1.4379862546920776} 11/07/2021 09:32:11 - INFO - __main__ - Step 87676: {'lr': 0.0001886760116449139, 'samples': 16833792, 'steps': 87675, 'loss/train': 1.1372984647750854} 11/07/2021 09:32:11 - INFO - __main__ - Step 87677: {'lr': 0.00018867086704474828, 'samples': 16833984, 'steps': 87676, 'loss/train': 0.9466671943664551} 11/07/2021 09:32:11 - INFO - __main__ - Step 87678: {'lr': 0.00018866572247221676, 'samples': 16834176, 'steps': 87677, 'loss/train': 0.8349578380584717} 11/07/2021 09:32:12 - INFO - __main__ - Step 87679: {'lr': 0.00018866057792732137, 'samples': 16834368, 'steps': 87678, 'loss/train': 1.4909617900848389} 11/07/2021 09:32:13 - INFO - __main__ - Step 87680: {'lr': 0.0001886554334100646, 'samples': 16834560, 'steps': 87679, 'loss/train': 1.8294833898544312} 11/07/2021 09:32:13 - INFO - __main__ - Step 87681: {'lr': 0.0001886502889204487, 'samples': 16834752, 'steps': 87680, 'loss/train': 1.8077119588851929} 11/07/2021 09:32:13 - INFO - __main__ - Step 87682: {'lr': 0.00018864514445847606, 'samples': 16834944, 'steps': 87681, 'loss/train': 1.1724169254302979} 11/07/2021 09:32:14 - INFO - __main__ - Step 87683: {'lr': 0.00018864000002414896, 'samples': 16835136, 'steps': 87682, 'loss/train': 1.3996162414550781} 11/07/2021 09:32:15 - INFO - __main__ - Step 87684: {'lr': 0.00018863485561746975, 'samples': 16835328, 'steps': 87683, 'loss/train': 1.8429689407348633} 11/07/2021 09:32:15 - INFO - __main__ - Step 87685: {'lr': 0.00018862971123844073, 'samples': 16835520, 'steps': 87684, 'loss/train': 1.3443268537521362} 11/07/2021 09:32:15 - INFO - __main__ - Step 87686: {'lr': 0.0001886245668870642, 'samples': 16835712, 'steps': 87685, 'loss/train': 1.5984947681427002} 11/07/2021 09:32:16 - INFO - __main__ - Step 87687: {'lr': 0.0001886194225633425, 'samples': 16835904, 'steps': 87686, 'loss/train': 1.514371395111084} 11/07/2021 09:32:16 - INFO - __main__ - Step 87688: {'lr': 0.00018861427826727793, 'samples': 16836096, 'steps': 87687, 'loss/train': 1.5080231428146362} 11/07/2021 09:32:18 - INFO - __main__ - Step 87689: {'lr': 0.0001886091339988728, 'samples': 16836288, 'steps': 87688, 'loss/train': 1.3922011852264404} 11/07/2021 09:32:18 - INFO - __main__ - Step 87690: {'lr': 0.00018860398975812948, 'samples': 16836480, 'steps': 87689, 'loss/train': 1.4946287870407104} 11/07/2021 09:32:19 - INFO - __main__ - Step 87691: {'lr': 0.00018859884554505026, 'samples': 16836672, 'steps': 87690, 'loss/train': 1.878322720527649} 11/07/2021 09:32:19 - INFO - __main__ - Step 87692: {'lr': 0.00018859370135963755, 'samples': 16836864, 'steps': 87691, 'loss/train': 1.7279317378997803} 11/07/2021 09:32:19 - INFO - __main__ - Step 87693: {'lr': 0.00018858855720189346, 'samples': 16837056, 'steps': 87692, 'loss/train': 1.7414442300796509} 11/07/2021 09:32:20 - INFO - __main__ - Step 87694: {'lr': 0.0001885834130718204, 'samples': 16837248, 'steps': 87693, 'loss/train': 1.7347705364227295} 11/07/2021 09:32:20 - INFO - __main__ - Step 87695: {'lr': 0.00018857826896942077, 'samples': 16837440, 'steps': 87694, 'loss/train': 1.1755839586257935} 11/07/2021 09:32:21 - INFO - __main__ - Step 87696: {'lr': 0.00018857312489469676, 'samples': 16837632, 'steps': 87695, 'loss/train': 1.4528663158416748} 11/07/2021 09:32:21 - INFO - __main__ - Step 87697: {'lr': 0.0001885679808476508, 'samples': 16837824, 'steps': 87696, 'loss/train': 1.4125334024429321} 11/07/2021 09:32:22 - INFO - __main__ - Step 87698: {'lr': 0.00018856283682828514, 'samples': 16838016, 'steps': 87697, 'loss/train': 1.0105583667755127} 11/07/2021 09:32:22 - INFO - __main__ - Step 87699: {'lr': 0.00018855769283660206, 'samples': 16838208, 'steps': 87698, 'loss/train': 1.4567580223083496} 11/07/2021 09:32:22 - INFO - __main__ - Step 87700: {'lr': 0.000188552548872604, 'samples': 16838400, 'steps': 87699, 'loss/train': 1.5732431411743164} 11/07/2021 09:32:24 - INFO - __main__ - Step 87701: {'lr': 0.0001885474049362932, 'samples': 16838592, 'steps': 87700, 'loss/train': 0.9517036080360413} 11/07/2021 09:32:24 - INFO - __main__ - Step 87702: {'lr': 0.000188542261027672, 'samples': 16838784, 'steps': 87701, 'loss/train': 1.4731405973434448} 11/07/2021 09:32:24 - INFO - __main__ - Step 87703: {'lr': 0.0001885371171467427, 'samples': 16838976, 'steps': 87702, 'loss/train': 1.3329191207885742} 11/07/2021 09:32:25 - INFO - __main__ - Step 87704: {'lr': 0.00018853197329350764, 'samples': 16839168, 'steps': 87703, 'loss/train': 1.272467851638794} 11/07/2021 09:32:25 - INFO - __main__ - Step 87705: {'lr': 0.0001885268294679692, 'samples': 16839360, 'steps': 87704, 'loss/train': 1.7050433158874512} 11/07/2021 09:32:25 - INFO - __main__ - Step 87706: {'lr': 0.00018852168567012954, 'samples': 16839552, 'steps': 87705, 'loss/train': 1.5682016611099243} 11/07/2021 09:32:26 - INFO - __main__ - Step 87707: {'lr': 0.00018851654189999103, 'samples': 16839744, 'steps': 87706, 'loss/train': 1.3000297546386719} 11/07/2021 09:32:27 - INFO - __main__ - Step 87708: {'lr': 0.00018851139815755606, 'samples': 16839936, 'steps': 87707, 'loss/train': 1.7255592346191406} 11/07/2021 09:32:27 - INFO - __main__ - Step 87709: {'lr': 0.00018850625444282688, 'samples': 16840128, 'steps': 87708, 'loss/train': 1.6983705759048462} 11/07/2021 09:32:27 - INFO - __main__ - Step 87710: {'lr': 0.00018850111075580583, 'samples': 16840320, 'steps': 87709, 'loss/train': 1.4888808727264404} 11/07/2021 09:32:28 - INFO - __main__ - Step 87711: {'lr': 0.00018849596709649526, 'samples': 16840512, 'steps': 87710, 'loss/train': 1.5838232040405273} 11/07/2021 09:32:29 - INFO - __main__ - Step 87712: {'lr': 0.00018849082346489743, 'samples': 16840704, 'steps': 87711, 'loss/train': 1.596621036529541} 11/07/2021 09:32:29 - INFO - __main__ - Step 87713: {'lr': 0.00018848567986101466, 'samples': 16840896, 'steps': 87712, 'loss/train': 1.6095805168151855} 11/07/2021 09:32:29 - INFO - __main__ - Step 87714: {'lr': 0.00018848053628484936, 'samples': 16841088, 'steps': 87713, 'loss/train': 1.0926035642623901} 11/07/2021 09:32:30 - INFO - __main__ - Step 87715: {'lr': 0.00018847539273640374, 'samples': 16841280, 'steps': 87714, 'loss/train': 1.0141496658325195} 11/07/2021 09:32:30 - INFO - __main__ - Step 87716: {'lr': 0.0001884702492156802, 'samples': 16841472, 'steps': 87715, 'loss/train': 1.9501678943634033} 11/07/2021 09:32:31 - INFO - __main__ - Step 87717: {'lr': 0.000188465105722681, 'samples': 16841664, 'steps': 87716, 'loss/train': 1.8626863956451416} 11/07/2021 09:32:32 - INFO - __main__ - Step 87718: {'lr': 0.00018845996225740844, 'samples': 16841856, 'steps': 87717, 'loss/train': 1.417251467704773} 11/07/2021 09:32:32 - INFO - __main__ - Step 87719: {'lr': 0.00018845481881986495, 'samples': 16842048, 'steps': 87718, 'loss/train': 1.1405240297317505} 11/07/2021 09:32:32 - INFO - __main__ - Step 87720: {'lr': 0.0001884496754100527, 'samples': 16842240, 'steps': 87719, 'loss/train': 1.5666371583938599} 11/07/2021 09:32:33 - INFO - __main__ - Step 87721: {'lr': 0.00018844453202797407, 'samples': 16842432, 'steps': 87720, 'loss/train': 0.8832560777664185} 11/07/2021 09:32:33 - INFO - __main__ - Step 87722: {'lr': 0.0001884393886736314, 'samples': 16842624, 'steps': 87721, 'loss/train': 1.2594363689422607} 11/07/2021 09:32:34 - INFO - __main__ - Step 87723: {'lr': 0.000188434245347027, 'samples': 16842816, 'steps': 87722, 'loss/train': 1.1458797454833984} 11/07/2021 09:32:34 - INFO - __main__ - Step 87724: {'lr': 0.00018842910204816315, 'samples': 16843008, 'steps': 87723, 'loss/train': 1.219563364982605} 11/07/2021 09:32:35 - INFO - __main__ - Step 87725: {'lr': 0.00018842395877704222, 'samples': 16843200, 'steps': 87724, 'loss/train': 1.3875977993011475} 11/07/2021 09:32:35 - INFO - __main__ - Step 87726: {'lr': 0.00018841881553366652, 'samples': 16843392, 'steps': 87725, 'loss/train': 1.3303449153900146} 11/07/2021 09:32:35 - INFO - __main__ - Step 87727: {'lr': 0.00018841367231803836, 'samples': 16843584, 'steps': 87726, 'loss/train': 1.61123788356781} 11/07/2021 09:32:36 - INFO - __main__ - Step 87728: {'lr': 0.00018840852913016, 'samples': 16843776, 'steps': 87727, 'loss/train': 1.2301486730575562} 11/07/2021 09:32:37 - INFO - __main__ - Step 87729: {'lr': 0.00018840338597003384, 'samples': 16843968, 'steps': 87728, 'loss/train': 1.299984335899353} 11/07/2021 09:32:37 - INFO - __main__ - Step 87730: {'lr': 0.00018839824283766216, 'samples': 16844160, 'steps': 87729, 'loss/train': 2.2985572814941406} 11/07/2021 09:32:38 - INFO - __main__ - Step 87731: {'lr': 0.00018839309973304728, 'samples': 16844352, 'steps': 87730, 'loss/train': 1.9144139289855957} 11/07/2021 09:32:38 - INFO - __main__ - Step 87732: {'lr': 0.00018838795665619156, 'samples': 16844544, 'steps': 87731, 'loss/train': 1.196718692779541} 11/07/2021 09:32:39 - INFO - __main__ - Step 87733: {'lr': 0.00018838281360709718, 'samples': 16844736, 'steps': 87732, 'loss/train': 1.8941233158111572} 11/07/2021 09:32:39 - INFO - __main__ - Step 87734: {'lr': 0.00018837767058576662, 'samples': 16844928, 'steps': 87733, 'loss/train': 1.3249666690826416} 11/07/2021 09:32:40 - INFO - __main__ - Step 87735: {'lr': 0.0001883725275922021, 'samples': 16845120, 'steps': 87734, 'loss/train': 1.2033970355987549} 11/07/2021 09:32:40 - INFO - __main__ - Step 87736: {'lr': 0.000188367384626406, 'samples': 16845312, 'steps': 87735, 'loss/train': 1.375185489654541} 11/07/2021 09:32:40 - INFO - __main__ - Step 87737: {'lr': 0.00018836224168838062, 'samples': 16845504, 'steps': 87736, 'loss/train': 1.3840060234069824} 11/07/2021 09:32:41 - INFO - __main__ - Step 87738: {'lr': 0.00018835709877812823, 'samples': 16845696, 'steps': 87737, 'loss/train': 1.2167786359786987} 11/07/2021 09:32:42 - INFO - __main__ - Step 87739: {'lr': 0.0001883519558956512, 'samples': 16845888, 'steps': 87738, 'loss/train': 1.2214643955230713} 11/07/2021 09:32:42 - INFO - __main__ - Step 87740: {'lr': 0.00018834681304095177, 'samples': 16846080, 'steps': 87739, 'loss/train': 1.0750007629394531} 11/07/2021 09:32:42 - INFO - __main__ - Step 87741: {'lr': 0.00018834167021403235, 'samples': 16846272, 'steps': 87740, 'loss/train': 1.5885335206985474} 11/07/2021 09:32:43 - INFO - __main__ - Step 87742: {'lr': 0.0001883365274148952, 'samples': 16846464, 'steps': 87741, 'loss/train': 1.166749358177185} 11/07/2021 09:32:43 - INFO - __main__ - Step 87743: {'lr': 0.00018833138464354268, 'samples': 16846656, 'steps': 87742, 'loss/train': 1.1736130714416504} 11/07/2021 09:32:44 - INFO - __main__ - Step 87744: {'lr': 0.0001883262418999771, 'samples': 16846848, 'steps': 87743, 'loss/train': 1.461266040802002} 11/07/2021 09:32:45 - INFO - __main__ - Step 87745: {'lr': 0.00018832109918420073, 'samples': 16847040, 'steps': 87744, 'loss/train': 1.2612738609313965} 11/07/2021 09:32:45 - INFO - __main__ - Step 87746: {'lr': 0.000188315956496216, 'samples': 16847232, 'steps': 87745, 'loss/train': 0.3858369290828705} 11/07/2021 09:32:46 - INFO - __main__ - Step 87747: {'lr': 0.00018831081383602512, 'samples': 16847424, 'steps': 87746, 'loss/train': 1.3679509162902832} 11/07/2021 09:32:46 - INFO - __main__ - Step 87748: {'lr': 0.00018830567120363043, 'samples': 16847616, 'steps': 87747, 'loss/train': 1.4439362287521362} 11/07/2021 09:32:47 - INFO - __main__ - Step 87749: {'lr': 0.00018830052859903425, 'samples': 16847808, 'steps': 87748, 'loss/train': 1.4923193454742432} 11/07/2021 09:32:47 - INFO - __main__ - Step 87750: {'lr': 0.00018829538602223883, 'samples': 16848000, 'steps': 87749, 'loss/train': 1.6442152261734009} 11/07/2021 09:32:48 - INFO - __main__ - Step 87751: {'lr': 0.0001882902434732466, 'samples': 16848192, 'steps': 87750, 'loss/train': 1.1463907957077026} 11/07/2021 09:32:48 - INFO - __main__ - Step 87752: {'lr': 0.00018828510095205987, 'samples': 16848384, 'steps': 87751, 'loss/train': 0.5553497076034546} 11/07/2021 09:32:48 - INFO - __main__ - Step 87753: {'lr': 0.00018827995845868088, 'samples': 16848576, 'steps': 87752, 'loss/train': 1.7392834424972534} 11/07/2021 09:32:49 - INFO - __main__ - Step 87754: {'lr': 0.00018827481599311197, 'samples': 16848768, 'steps': 87753, 'loss/train': 1.2589879035949707} 11/07/2021 09:32:50 - INFO - __main__ - Step 87755: {'lr': 0.0001882696735553555, 'samples': 16848960, 'steps': 87754, 'loss/train': 1.6138850450515747} 11/07/2021 09:32:50 - INFO - __main__ - Step 87756: {'lr': 0.00018826453114541378, 'samples': 16849152, 'steps': 87755, 'loss/train': 1.3130125999450684} 11/07/2021 09:32:50 - INFO - __main__ - Step 87757: {'lr': 0.00018825938876328909, 'samples': 16849344, 'steps': 87756, 'loss/train': 1.4095054864883423} 11/07/2021 09:32:51 - INFO - __main__ - Step 87758: {'lr': 0.00018825424640898374, 'samples': 16849536, 'steps': 87757, 'loss/train': 1.532701015472412} 11/07/2021 09:32:51 - INFO - __main__ - Step 87759: {'lr': 0.00018824910408250022, 'samples': 16849728, 'steps': 87758, 'loss/train': 1.571062684059143} 11/07/2021 09:32:52 - INFO - __main__ - Step 87760: {'lr': 0.0001882439617838406, 'samples': 16849920, 'steps': 87759, 'loss/train': 1.1460992097854614} 11/07/2021 09:32:53 - INFO - __main__ - Step 87761: {'lr': 0.00018823881951300728, 'samples': 16850112, 'steps': 87760, 'loss/train': 1.8059266805648804} 11/07/2021 09:32:53 - INFO - __main__ - Step 87762: {'lr': 0.00018823367727000258, 'samples': 16850304, 'steps': 87761, 'loss/train': 1.5288937091827393} 11/07/2021 09:32:53 - INFO - __main__ - Step 87763: {'lr': 0.00018822853505482887, 'samples': 16850496, 'steps': 87762, 'loss/train': 1.0512067079544067} 11/07/2021 09:32:54 - INFO - __main__ - Step 87764: {'lr': 0.0001882233928674884, 'samples': 16850688, 'steps': 87763, 'loss/train': 1.4501440525054932} 11/07/2021 09:32:55 - INFO - __main__ - Step 87765: {'lr': 0.00018821825070798354, 'samples': 16850880, 'steps': 87764, 'loss/train': 1.18625807762146} 11/07/2021 09:32:55 - INFO - __main__ - Step 87766: {'lr': 0.00018821310857631654, 'samples': 16851072, 'steps': 87765, 'loss/train': 1.4162302017211914} 11/07/2021 09:32:55 - INFO - __main__ - Step 87767: {'lr': 0.00018820796647248982, 'samples': 16851264, 'steps': 87766, 'loss/train': 0.8942852020263672} 11/07/2021 09:32:56 - INFO - __main__ - Step 87768: {'lr': 0.00018820282439650557, 'samples': 16851456, 'steps': 87767, 'loss/train': 1.532508134841919} 11/07/2021 09:32:56 - INFO - __main__ - Step 87769: {'lr': 0.0001881976823483662, 'samples': 16851648, 'steps': 87768, 'loss/train': 0.9851499795913696} 11/07/2021 09:32:57 - INFO - __main__ - Step 87770: {'lr': 0.00018819254032807403, 'samples': 16851840, 'steps': 87769, 'loss/train': 1.4106491804122925} 11/07/2021 09:32:57 - INFO - __main__ - Step 87771: {'lr': 0.0001881873983356313, 'samples': 16852032, 'steps': 87770, 'loss/train': 1.5751110315322876} 11/07/2021 09:32:58 - INFO - __main__ - Step 87772: {'lr': 0.00018818225637104053, 'samples': 16852224, 'steps': 87771, 'loss/train': 1.4917423725128174} 11/07/2021 09:32:58 - INFO - __main__ - Step 87773: {'lr': 0.00018817711443430373, 'samples': 16852416, 'steps': 87772, 'loss/train': 1.1657692193984985} 11/07/2021 09:32:59 - INFO - __main__ - Step 87774: {'lr': 0.0001881719725254234, 'samples': 16852608, 'steps': 87773, 'loss/train': 0.9868589639663696} 11/07/2021 09:32:59 - INFO - __main__ - Step 87775: {'lr': 0.0001881668306444018, 'samples': 16852800, 'steps': 87774, 'loss/train': 1.161162257194519} 11/07/2021 09:33:00 - INFO - __main__ - Step 87776: {'lr': 0.0001881616887912413, 'samples': 16852992, 'steps': 87775, 'loss/train': 1.3496383428573608} 11/07/2021 09:33:00 - INFO - __main__ - Step 87777: {'lr': 0.00018815654696594417, 'samples': 16853184, 'steps': 87776, 'loss/train': 0.46518373489379883} 11/07/2021 09:33:01 - INFO - __main__ - Step 87778: {'lr': 0.00018815140516851276, 'samples': 16853376, 'steps': 87777, 'loss/train': 1.4749423265457153} 11/07/2021 09:33:01 - INFO - __main__ - Step 87779: {'lr': 0.00018814626339894936, 'samples': 16853568, 'steps': 87778, 'loss/train': 1.2553659677505493} 11/07/2021 09:33:01 - INFO - __main__ - Step 87780: {'lr': 0.0001881411216572563, 'samples': 16853760, 'steps': 87779, 'loss/train': 1.5558085441589355} 11/07/2021 09:33:02 - INFO - __main__ - Step 87781: {'lr': 0.00018813597994343589, 'samples': 16853952, 'steps': 87780, 'loss/train': 1.213151454925537} 11/07/2021 09:33:03 - INFO - __main__ - Step 87782: {'lr': 0.00018813083825749047, 'samples': 16854144, 'steps': 87781, 'loss/train': 1.5184311866760254} 11/07/2021 09:33:03 - INFO - __main__ - Step 87783: {'lr': 0.00018812569659942233, 'samples': 16854336, 'steps': 87782, 'loss/train': 0.9243889451026917} 11/07/2021 09:33:03 - INFO - __main__ - Step 87784: {'lr': 0.0001881205549692338, 'samples': 16854528, 'steps': 87783, 'loss/train': 1.8641365766525269} 11/07/2021 09:33:04 - INFO - __main__ - Step 87785: {'lr': 0.00018811541336692718, 'samples': 16854720, 'steps': 87784, 'loss/train': 1.4703079462051392} 11/07/2021 09:33:05 - INFO - __main__ - Step 87786: {'lr': 0.0001881102717925049, 'samples': 16854912, 'steps': 87785, 'loss/train': 1.1973031759262085} 11/07/2021 09:33:05 - INFO - __main__ - Step 87787: {'lr': 0.00018810513024596908, 'samples': 16855104, 'steps': 87786, 'loss/train': 1.285437822341919} 11/07/2021 09:33:06 - INFO - __main__ - Step 87788: {'lr': 0.00018809998872732219, 'samples': 16855296, 'steps': 87787, 'loss/train': 1.4679710865020752} 11/07/2021 09:33:06 - INFO - __main__ - Step 87789: {'lr': 0.00018809484723656642, 'samples': 16855488, 'steps': 87788, 'loss/train': 1.4200886487960815} 11/07/2021 09:33:06 - INFO - __main__ - Step 87790: {'lr': 0.00018808970577370416, 'samples': 16855680, 'steps': 87789, 'loss/train': 1.7822043895721436} 11/07/2021 09:33:07 - INFO - __main__ - Step 87791: {'lr': 0.00018808456433873775, 'samples': 16855872, 'steps': 87790, 'loss/train': 1.348160743713379} 11/07/2021 09:33:08 - INFO - __main__ - Step 87792: {'lr': 0.00018807942293166946, 'samples': 16856064, 'steps': 87791, 'loss/train': 1.471801996231079} 11/07/2021 09:33:08 - INFO - __main__ - Step 87793: {'lr': 0.00018807428155250165, 'samples': 16856256, 'steps': 87792, 'loss/train': 1.1892566680908203} 11/07/2021 09:33:09 - INFO - __main__ - Step 87794: {'lr': 0.00018806914020123657, 'samples': 16856448, 'steps': 87793, 'loss/train': 1.747975468635559} 11/07/2021 09:33:09 - INFO - __main__ - Step 87795: {'lr': 0.00018806399887787663, 'samples': 16856640, 'steps': 87794, 'loss/train': 1.0676238536834717} 11/07/2021 09:33:11 - INFO - __main__ - Step 87796: {'lr': 0.00018805885758242408, 'samples': 16856832, 'steps': 87795, 'loss/train': 0.8343858122825623} 11/07/2021 09:33:11 - INFO - __main__ - Step 87797: {'lr': 0.00018805371631488125, 'samples': 16857024, 'steps': 87796, 'loss/train': 0.7484747767448425} 11/07/2021 09:33:11 - INFO - __main__ - Step 87798: {'lr': 0.00018804857507525045, 'samples': 16857216, 'steps': 87797, 'loss/train': 1.4640402793884277} 11/07/2021 09:33:12 - INFO - __main__ - Step 87799: {'lr': 0.00018804343386353412, 'samples': 16857408, 'steps': 87798, 'loss/train': 1.4336146116256714} 11/07/2021 09:33:12 - INFO - __main__ - Step 87800: {'lr': 0.00018803829267973436, 'samples': 16857600, 'steps': 87799, 'loss/train': 0.7933120727539062} 11/07/2021 09:33:12 - INFO - __main__ - Step 87801: {'lr': 0.0001880331515238536, 'samples': 16857792, 'steps': 87800, 'loss/train': 1.3127944469451904} 11/07/2021 09:33:13 - INFO - __main__ - Step 87802: {'lr': 0.00018802801039589413, 'samples': 16857984, 'steps': 87801, 'loss/train': 1.253635287284851} 11/07/2021 09:33:14 - INFO - __main__ - Step 87803: {'lr': 0.00018802286929585826, 'samples': 16858176, 'steps': 87802, 'loss/train': 0.7808305025100708} 11/07/2021 09:33:14 - INFO - __main__ - Step 87804: {'lr': 0.00018801772822374835, 'samples': 16858368, 'steps': 87803, 'loss/train': 0.6419970989227295} 11/07/2021 09:33:14 - INFO - __main__ - Step 87805: {'lr': 0.0001880125871795667, 'samples': 16858560, 'steps': 87804, 'loss/train': 1.202074646949768} 11/07/2021 09:33:15 - INFO - __main__ - Step 87806: {'lr': 0.00018800744616331562, 'samples': 16858752, 'steps': 87805, 'loss/train': 1.6172902584075928} 11/07/2021 09:33:15 - INFO - __main__ - Step 87807: {'lr': 0.00018800230517499743, 'samples': 16858944, 'steps': 87806, 'loss/train': 1.222325325012207} 11/07/2021 09:33:16 - INFO - __main__ - Step 87808: {'lr': 0.00018799716421461442, 'samples': 16859136, 'steps': 87807, 'loss/train': 0.9241860508918762} 11/07/2021 09:33:16 - INFO - __main__ - Step 87809: {'lr': 0.00018799202328216897, 'samples': 16859328, 'steps': 87808, 'loss/train': 1.4091711044311523} 11/07/2021 09:33:17 - INFO - __main__ - Step 87810: {'lr': 0.00018798688237766335, 'samples': 16859520, 'steps': 87809, 'loss/train': 1.3140536546707153} 11/07/2021 09:33:17 - INFO - __main__ - Step 87811: {'lr': 0.00018798174150109988, 'samples': 16859712, 'steps': 87810, 'loss/train': 1.4332561492919922} 11/07/2021 09:33:17 - INFO - __main__ - Step 87812: {'lr': 0.00018797660065248084, 'samples': 16859904, 'steps': 87811, 'loss/train': 1.0146604776382446} 11/07/2021 09:33:18 - INFO - __main__ - Step 87813: {'lr': 0.00018797145983180875, 'samples': 16860096, 'steps': 87812, 'loss/train': 1.6295623779296875} 11/07/2021 09:33:19 - INFO - __main__ - Step 87814: {'lr': 0.00018796631903908562, 'samples': 16860288, 'steps': 87813, 'loss/train': 1.5267689228057861} 11/07/2021 09:33:19 - INFO - __main__ - Step 87815: {'lr': 0.00018796117827431396, 'samples': 16860480, 'steps': 87814, 'loss/train': 1.0978285074234009} 11/07/2021 09:33:20 - INFO - __main__ - Step 87816: {'lr': 0.00018795603753749595, 'samples': 16860672, 'steps': 87815, 'loss/train': 0.8824273347854614} 11/07/2021 09:33:20 - INFO - __main__ - Step 87817: {'lr': 0.00018795089682863405, 'samples': 16860864, 'steps': 87816, 'loss/train': 1.904725193977356} 11/07/2021 09:33:20 - INFO - __main__ - Step 87818: {'lr': 0.00018794575614773052, 'samples': 16861056, 'steps': 87817, 'loss/train': 0.6754577159881592} 11/07/2021 09:33:21 - INFO - __main__ - Step 87819: {'lr': 0.00018794061549478767, 'samples': 16861248, 'steps': 87818, 'loss/train': 1.1750015020370483} 11/07/2021 09:33:22 - INFO - __main__ - Step 87820: {'lr': 0.0001879354748698078, 'samples': 16861440, 'steps': 87819, 'loss/train': 1.2917351722717285} 11/07/2021 09:33:22 - INFO - __main__ - Step 87821: {'lr': 0.00018793033427279328, 'samples': 16861632, 'steps': 87820, 'loss/train': 0.9073134064674377} 11/07/2021 09:33:22 - INFO - __main__ - Step 87822: {'lr': 0.00018792519370374638, 'samples': 16861824, 'steps': 87821, 'loss/train': 1.6612727642059326} 11/07/2021 09:33:23 - INFO - __main__ - Step 87823: {'lr': 0.00018792005316266942, 'samples': 16862016, 'steps': 87822, 'loss/train': 1.0070637464523315} 11/07/2021 09:33:24 - INFO - __main__ - Step 87824: {'lr': 0.00018791491264956472, 'samples': 16862208, 'steps': 87823, 'loss/train': 1.6459850072860718} 11/07/2021 09:33:24 - INFO - __main__ - Step 87825: {'lr': 0.0001879097721644346, 'samples': 16862400, 'steps': 87824, 'loss/train': 1.3299496173858643} 11/07/2021 09:33:25 - INFO - __main__ - Step 87826: {'lr': 0.00018790463170728153, 'samples': 16862592, 'steps': 87825, 'loss/train': 1.1695235967636108} 11/07/2021 09:33:25 - INFO - __main__ - Step 87827: {'lr': 0.00018789949127810755, 'samples': 16862784, 'steps': 87826, 'loss/train': 1.1259844303131104} 11/07/2021 09:33:25 - INFO - __main__ - Step 87828: {'lr': 0.0001878943508769151, 'samples': 16862976, 'steps': 87827, 'loss/train': 0.8912264108657837} 11/07/2021 09:33:26 - INFO - __main__ - Step 87829: {'lr': 0.00018788921050370646, 'samples': 16863168, 'steps': 87828, 'loss/train': 1.548799991607666} 11/07/2021 09:33:27 - INFO - __main__ - Step 87830: {'lr': 0.000187884070158484, 'samples': 16863360, 'steps': 87829, 'loss/train': 3.008601427078247} 11/07/2021 09:33:27 - INFO - __main__ - Step 87831: {'lr': 0.00018787892984125005, 'samples': 16863552, 'steps': 87830, 'loss/train': 1.385835886001587} 11/07/2021 09:33:27 - INFO - __main__ - Step 87832: {'lr': 0.00018787378955200686, 'samples': 16863744, 'steps': 87831, 'loss/train': 1.415402889251709} 11/07/2021 09:33:28 - INFO - __main__ - Step 87833: {'lr': 0.00018786864929075682, 'samples': 16863936, 'steps': 87832, 'loss/train': 1.4869847297668457} 11/07/2021 09:33:29 - INFO - __main__ - Step 87834: {'lr': 0.00018786350905750215, 'samples': 16864128, 'steps': 87833, 'loss/train': 1.4091548919677734} 11/07/2021 09:33:29 - INFO - __main__ - Step 87835: {'lr': 0.00018785836885224527, 'samples': 16864320, 'steps': 87834, 'loss/train': 1.4563347101211548} 11/07/2021 09:33:29 - INFO - __main__ - Step 87836: {'lr': 0.00018785322867498843, 'samples': 16864512, 'steps': 87835, 'loss/train': 0.7875078320503235} 11/07/2021 09:33:30 - INFO - __main__ - Step 87837: {'lr': 0.00018784808852573398, 'samples': 16864704, 'steps': 87836, 'loss/train': 0.8864500522613525} 11/07/2021 09:33:30 - INFO - __main__ - Step 87838: {'lr': 0.0001878429484044842, 'samples': 16864896, 'steps': 87837, 'loss/train': 0.8839327096939087} 11/07/2021 09:33:30 - INFO - __main__ - Step 87839: {'lr': 0.0001878378083112415, 'samples': 16865088, 'steps': 87838, 'loss/train': 0.8917562365531921} 11/07/2021 09:33:32 - INFO - __main__ - Step 87840: {'lr': 0.00018783266824600814, 'samples': 16865280, 'steps': 87839, 'loss/train': 1.2515151500701904} 11/07/2021 09:33:32 - INFO - __main__ - Step 87841: {'lr': 0.00018782752820878634, 'samples': 16865472, 'steps': 87840, 'loss/train': 0.5892181396484375} 11/07/2021 09:33:32 - INFO - __main__ - Step 87842: {'lr': 0.0001878223881995785, 'samples': 16865664, 'steps': 87841, 'loss/train': 1.7736657857894897} 11/07/2021 09:33:33 - INFO - __main__ - Step 87843: {'lr': 0.00018781724821838693, 'samples': 16865856, 'steps': 87842, 'loss/train': 1.1409159898757935} 11/07/2021 09:33:33 - INFO - __main__ - Step 87844: {'lr': 0.00018781210826521397, 'samples': 16866048, 'steps': 87843, 'loss/train': 1.0068129301071167} 11/07/2021 09:33:34 - INFO - __main__ - Step 87845: {'lr': 0.0001878069683400619, 'samples': 16866240, 'steps': 87844, 'loss/train': 1.3026548624038696} 11/07/2021 09:33:34 - INFO - __main__ - Step 87846: {'lr': 0.00018780182844293307, 'samples': 16866432, 'steps': 87845, 'loss/train': 1.4744459390640259} 11/07/2021 09:33:35 - INFO - __main__ - Step 87847: {'lr': 0.00018779668857382977, 'samples': 16866624, 'steps': 87846, 'loss/train': 1.4314076900482178} 11/07/2021 09:33:35 - INFO - __main__ - Step 87848: {'lr': 0.00018779154873275428, 'samples': 16866816, 'steps': 87847, 'loss/train': 1.7207928895950317} 11/07/2021 09:33:35 - INFO - __main__ - Step 87849: {'lr': 0.000187786408919709, 'samples': 16867008, 'steps': 87848, 'loss/train': 1.433586597442627} 11/07/2021 09:33:36 - INFO - __main__ - Step 87850: {'lr': 0.00018778126913469623, 'samples': 16867200, 'steps': 87849, 'loss/train': 1.607603907585144} 11/07/2021 09:33:37 - INFO - __main__ - Step 87851: {'lr': 0.00018777612937771824, 'samples': 16867392, 'steps': 87850, 'loss/train': 0.8190839290618896} 11/07/2021 09:33:37 - INFO - __main__ - Step 87852: {'lr': 0.00018777098964877734, 'samples': 16867584, 'steps': 87851, 'loss/train': 1.4362789392471313} 11/07/2021 09:33:37 - INFO - __main__ - Step 87853: {'lr': 0.00018776584994787594, 'samples': 16867776, 'steps': 87852, 'loss/train': 1.2265790700912476} 11/07/2021 09:33:38 - INFO - __main__ - Step 87854: {'lr': 0.00018776071027501624, 'samples': 16867968, 'steps': 87853, 'loss/train': 1.5666519403457642} 11/07/2021 09:33:39 - INFO - __main__ - Step 87855: {'lr': 0.00018775557063020057, 'samples': 16868160, 'steps': 87854, 'loss/train': 0.5553237795829773} 11/07/2021 09:33:39 - INFO - __main__ - Step 87856: {'lr': 0.0001877504310134313, 'samples': 16868352, 'steps': 87855, 'loss/train': 1.4630253314971924} 11/07/2021 09:33:40 - INFO - __main__ - Step 87857: {'lr': 0.0001877452914247107, 'samples': 16868544, 'steps': 87856, 'loss/train': 1.061275839805603} 11/07/2021 09:33:40 - INFO - __main__ - Step 87858: {'lr': 0.00018774015186404116, 'samples': 16868736, 'steps': 87857, 'loss/train': 0.492290198802948} 11/07/2021 09:33:40 - INFO - __main__ - Step 87859: {'lr': 0.00018773501233142493, 'samples': 16868928, 'steps': 87858, 'loss/train': 1.3913651704788208} 11/07/2021 09:33:41 - INFO - __main__ - Step 87860: {'lr': 0.0001877298728268643, 'samples': 16869120, 'steps': 87859, 'loss/train': 1.4096362590789795} 11/07/2021 09:33:42 - INFO - __main__ - Step 87861: {'lr': 0.00018772473335036175, 'samples': 16869312, 'steps': 87860, 'loss/train': 1.6416784524917603} 11/07/2021 09:33:42 - INFO - __main__ - Step 87862: {'lr': 0.0001877195939019194, 'samples': 16869504, 'steps': 87861, 'loss/train': 1.217336654663086} 11/07/2021 09:33:42 - INFO - __main__ - Step 87863: {'lr': 0.00018771445448153963, 'samples': 16869696, 'steps': 87862, 'loss/train': 1.5484970808029175} 11/07/2021 09:33:43 - INFO - __main__ - Step 87864: {'lr': 0.00018770931508922475, 'samples': 16869888, 'steps': 87863, 'loss/train': 1.0567058324813843} 11/07/2021 09:33:44 - INFO - __main__ - Step 87865: {'lr': 0.00018770417572497712, 'samples': 16870080, 'steps': 87864, 'loss/train': 1.2660424709320068} 11/07/2021 09:33:44 - INFO - __main__ - Step 87866: {'lr': 0.000187699036388799, 'samples': 16870272, 'steps': 87865, 'loss/train': 1.3159905672073364} 11/07/2021 09:33:45 - INFO - __main__ - Step 87867: {'lr': 0.0001876938970806928, 'samples': 16870464, 'steps': 87866, 'loss/train': 0.9641605019569397} 11/07/2021 09:33:45 - INFO - __main__ - Step 87868: {'lr': 0.00018768875780066072, 'samples': 16870656, 'steps': 87867, 'loss/train': 1.061650276184082} 11/07/2021 09:33:45 - INFO - __main__ - Step 87869: {'lr': 0.00018768361854870513, 'samples': 16870848, 'steps': 87868, 'loss/train': 1.4470977783203125} 11/07/2021 09:33:46 - INFO - __main__ - Step 87870: {'lr': 0.00018767847932482832, 'samples': 16871040, 'steps': 87869, 'loss/train': 1.504001259803772} 11/07/2021 09:33:47 - INFO - __main__ - Step 87871: {'lr': 0.00018767334012903265, 'samples': 16871232, 'steps': 87870, 'loss/train': 0.6998240351676941} 11/07/2021 09:33:47 - INFO - __main__ - Step 87872: {'lr': 0.0001876682009613204, 'samples': 16871424, 'steps': 87871, 'loss/train': 1.23470139503479} 11/07/2021 09:33:48 - INFO - __main__ - Step 87873: {'lr': 0.00018766306182169392, 'samples': 16871616, 'steps': 87872, 'loss/train': 1.5414438247680664} 11/07/2021 09:33:48 - INFO - __main__ - Step 87874: {'lr': 0.00018765792271015547, 'samples': 16871808, 'steps': 87873, 'loss/train': 1.4395700693130493} 11/07/2021 09:33:48 - INFO - __main__ - Step 87875: {'lr': 0.0001876527836267074, 'samples': 16872000, 'steps': 87874, 'loss/train': 1.7455379962921143} 11/07/2021 09:33:49 - INFO - __main__ - Step 87876: {'lr': 0.00018764764457135204, 'samples': 16872192, 'steps': 87875, 'loss/train': 1.431614637374878} 11/07/2021 09:33:50 - INFO - __main__ - Step 87877: {'lr': 0.00018764250554409169, 'samples': 16872384, 'steps': 87876, 'loss/train': 1.1837953329086304} 11/07/2021 09:33:50 - INFO - __main__ - Step 87878: {'lr': 0.00018763736654492863, 'samples': 16872576, 'steps': 87877, 'loss/train': 1.9918051958084106} 11/07/2021 09:33:50 - INFO - __main__ - Step 87879: {'lr': 0.0001876322275738652, 'samples': 16872768, 'steps': 87878, 'loss/train': 1.4331538677215576} 11/07/2021 09:33:51 - INFO - __main__ - Step 87880: {'lr': 0.00018762708863090383, 'samples': 16872960, 'steps': 87879, 'loss/train': 1.3028043508529663} 11/07/2021 09:33:51 - INFO - __main__ - Step 87881: {'lr': 0.00018762194971604668, 'samples': 16873152, 'steps': 87880, 'loss/train': 1.3784868717193604} 11/07/2021 09:33:52 - INFO - __main__ - Step 87882: {'lr': 0.0001876168108292961, 'samples': 16873344, 'steps': 87881, 'loss/train': 0.9622950553894043} 11/07/2021 09:33:52 - INFO - __main__ - Step 87883: {'lr': 0.00018761167197065446, 'samples': 16873536, 'steps': 87882, 'loss/train': 1.491939663887024} 11/07/2021 09:33:53 - INFO - __main__ - Step 87884: {'lr': 0.000187606533140124, 'samples': 16873728, 'steps': 87883, 'loss/train': 1.598411202430725} 11/07/2021 09:33:53 - INFO - __main__ - Step 87885: {'lr': 0.00018760139433770707, 'samples': 16873920, 'steps': 87884, 'loss/train': 1.3652981519699097} 11/07/2021 09:33:53 - INFO - __main__ - Step 87886: {'lr': 0.000187596255563406, 'samples': 16874112, 'steps': 87885, 'loss/train': 4.6559062004089355} 11/07/2021 09:33:54 - INFO - __main__ - Step 87887: {'lr': 0.00018759111681722308, 'samples': 16874304, 'steps': 87886, 'loss/train': 1.6468738317489624} 11/07/2021 09:33:55 - INFO - __main__ - Step 87888: {'lr': 0.00018758597809916063, 'samples': 16874496, 'steps': 87887, 'loss/train': 0.9150920510292053} 11/07/2021 09:33:55 - INFO - __main__ - Step 87889: {'lr': 0.000187580839409221, 'samples': 16874688, 'steps': 87888, 'loss/train': 1.4293291568756104} 11/07/2021 09:33:55 - INFO - __main__ - Step 87890: {'lr': 0.00018757570074740644, 'samples': 16874880, 'steps': 87889, 'loss/train': 1.4628359079360962} 11/07/2021 09:33:56 - INFO - __main__ - Step 87891: {'lr': 0.00018757056211371934, 'samples': 16875072, 'steps': 87890, 'loss/train': 1.3947410583496094} 11/07/2021 09:33:57 - INFO - __main__ - Step 87892: {'lr': 0.00018756542350816197, 'samples': 16875264, 'steps': 87891, 'loss/train': 1.1704480648040771} 11/07/2021 09:33:57 - INFO - __main__ - Step 87893: {'lr': 0.00018756028493073675, 'samples': 16875456, 'steps': 87892, 'loss/train': 1.7504558563232422} 11/07/2021 09:33:58 - INFO - __main__ - Step 87894: {'lr': 0.00018755514638144584, 'samples': 16875648, 'steps': 87893, 'loss/train': 1.562890648841858} 11/07/2021 09:33:58 - INFO - __main__ - Step 87895: {'lr': 0.00018755000786029158, 'samples': 16875840, 'steps': 87894, 'loss/train': 1.4754046201705933} 11/07/2021 09:33:58 - INFO - __main__ - Step 87896: {'lr': 0.00018754486936727632, 'samples': 16876032, 'steps': 87895, 'loss/train': 1.2827048301696777} 11/07/2021 09:33:59 - INFO - __main__ - Step 87897: {'lr': 0.00018753973090240243, 'samples': 16876224, 'steps': 87896, 'loss/train': 1.4785747528076172} 11/07/2021 09:34:00 - INFO - __main__ - Step 87898: {'lr': 0.00018753459246567211, 'samples': 16876416, 'steps': 87897, 'loss/train': 0.932418942451477} 11/07/2021 09:34:00 - INFO - __main__ - Step 87899: {'lr': 0.00018752945405708777, 'samples': 16876608, 'steps': 87898, 'loss/train': 1.3341325521469116} 11/07/2021 09:34:00 - INFO - __main__ - Step 87900: {'lr': 0.00018752431567665168, 'samples': 16876800, 'steps': 87899, 'loss/train': 1.6320680379867554} 11/07/2021 09:34:01 - INFO - __main__ - Step 87901: {'lr': 0.0001875191773243662, 'samples': 16876992, 'steps': 87900, 'loss/train': 1.736477017402649} 11/07/2021 09:34:01 - INFO - __main__ - Step 87902: {'lr': 0.00018751403900023355, 'samples': 16877184, 'steps': 87901, 'loss/train': 1.129645824432373} 11/07/2021 09:34:02 - INFO - __main__ - Step 87903: {'lr': 0.00018750890070425618, 'samples': 16877376, 'steps': 87902, 'loss/train': 1.2607990503311157} 11/07/2021 09:34:02 - INFO - __main__ - Step 87904: {'lr': 0.0001875037624364363, 'samples': 16877568, 'steps': 87903, 'loss/train': 1.1890794038772583} 11/07/2021 09:34:03 - INFO - __main__ - Step 87905: {'lr': 0.00018749862419677627, 'samples': 16877760, 'steps': 87904, 'loss/train': 0.4484020471572876} 11/07/2021 09:34:03 - INFO - __main__ - Step 87906: {'lr': 0.0001874934859852784, 'samples': 16877952, 'steps': 87905, 'loss/train': 1.3983943462371826} 11/07/2021 09:34:03 - INFO - __main__ - Step 87907: {'lr': 0.0001874883478019451, 'samples': 16878144, 'steps': 87906, 'loss/train': 1.1953895092010498} 11/07/2021 09:34:05 - INFO - __main__ - Step 87908: {'lr': 0.0001874832096467785, 'samples': 16878336, 'steps': 87907, 'loss/train': 1.458670973777771} 11/07/2021 09:34:05 - INFO - __main__ - Step 87909: {'lr': 0.00018747807151978097, 'samples': 16878528, 'steps': 87908, 'loss/train': 1.2396748065948486} 11/07/2021 09:34:05 - INFO - __main__ - Step 87910: {'lr': 0.00018747293342095484, 'samples': 16878720, 'steps': 87909, 'loss/train': 1.1716442108154297} 11/07/2021 09:34:06 - INFO - __main__ - Step 87911: {'lr': 0.0001874677953503025, 'samples': 16878912, 'steps': 87910, 'loss/train': 0.9899902939796448} 11/07/2021 09:34:06 - INFO - __main__ - Step 87912: {'lr': 0.00018746265730782614, 'samples': 16879104, 'steps': 87911, 'loss/train': 1.4687467813491821} 11/07/2021 09:34:07 - INFO - __main__ - Step 87913: {'lr': 0.00018745751929352816, 'samples': 16879296, 'steps': 87912, 'loss/train': 1.5282539129257202} 11/07/2021 09:34:07 - INFO - __main__ - Step 87914: {'lr': 0.0001874523813074109, 'samples': 16879488, 'steps': 87913, 'loss/train': 1.277895450592041} 11/07/2021 09:34:08 - INFO - __main__ - Step 87915: {'lr': 0.0001874472433494766, 'samples': 16879680, 'steps': 87914, 'loss/train': 1.487195372581482} 11/07/2021 09:34:08 - INFO - __main__ - Step 87916: {'lr': 0.00018744210541972762, 'samples': 16879872, 'steps': 87915, 'loss/train': 1.4489871263504028} 11/07/2021 09:34:08 - INFO - __main__ - Step 87917: {'lr': 0.00018743696751816625, 'samples': 16880064, 'steps': 87916, 'loss/train': 1.1703144311904907} 11/07/2021 09:34:09 - INFO - __main__ - Step 87918: {'lr': 0.00018743182964479481, 'samples': 16880256, 'steps': 87917, 'loss/train': 1.5892096757888794} 11/07/2021 09:34:10 - INFO - __main__ - Step 87919: {'lr': 0.00018742669179961564, 'samples': 16880448, 'steps': 87918, 'loss/train': 1.0178205966949463} 11/07/2021 09:34:10 - INFO - __main__ - Step 87920: {'lr': 0.00018742155398263115, 'samples': 16880640, 'steps': 87919, 'loss/train': 1.6361068487167358} 11/07/2021 09:34:10 - INFO - __main__ - Step 87921: {'lr': 0.00018741641619384342, 'samples': 16880832, 'steps': 87920, 'loss/train': 0.1877385526895523} 11/07/2021 09:34:11 - INFO - __main__ - Step 87922: {'lr': 0.0001874112784332549, 'samples': 16881024, 'steps': 87921, 'loss/train': 1.409961223602295} 11/07/2021 09:34:11 - INFO - __main__ - Step 87923: {'lr': 0.00018740614070086785, 'samples': 16881216, 'steps': 87922, 'loss/train': 0.8897773027420044} 11/07/2021 09:34:12 - INFO - __main__ - Step 87924: {'lr': 0.00018740100299668468, 'samples': 16881408, 'steps': 87923, 'loss/train': 1.5255099534988403} 11/07/2021 09:34:12 - INFO - __main__ - Step 87925: {'lr': 0.00018739586532070762, 'samples': 16881600, 'steps': 87924, 'loss/train': 1.5729408264160156} 11/07/2021 09:34:13 - INFO - __main__ - Step 87926: {'lr': 0.00018739072767293903, 'samples': 16881792, 'steps': 87925, 'loss/train': 2.011559247970581} 11/07/2021 09:34:13 - INFO - __main__ - Step 87927: {'lr': 0.0001873855900533812, 'samples': 16881984, 'steps': 87926, 'loss/train': 1.3267918825149536} 11/07/2021 09:34:14 - INFO - __main__ - Step 87928: {'lr': 0.00018738045246203644, 'samples': 16882176, 'steps': 87927, 'loss/train': 1.384947657585144} 11/07/2021 09:34:15 - INFO - __main__ - Step 87929: {'lr': 0.00018737531489890712, 'samples': 16882368, 'steps': 87928, 'loss/train': 1.6382250785827637} 11/07/2021 09:34:15 - INFO - __main__ - Step 87930: {'lr': 0.00018737017736399552, 'samples': 16882560, 'steps': 87929, 'loss/train': 1.3064460754394531} 11/07/2021 09:34:15 - INFO - __main__ - Step 87931: {'lr': 0.00018736503985730395, 'samples': 16882752, 'steps': 87930, 'loss/train': 1.3575384616851807} 11/07/2021 09:34:16 - INFO - __main__ - Step 87932: {'lr': 0.00018735990237883472, 'samples': 16882944, 'steps': 87931, 'loss/train': 1.468267560005188} 11/07/2021 09:34:16 - INFO - __main__ - Step 87933: {'lr': 0.0001873547649285901, 'samples': 16883136, 'steps': 87932, 'loss/train': 0.8479742407798767} 11/07/2021 09:34:17 - INFO - __main__ - Step 87934: {'lr': 0.00018734962750657265, 'samples': 16883328, 'steps': 87933, 'loss/train': 1.1261330842971802} 11/07/2021 09:34:18 - INFO - __main__ - Step 87935: {'lr': 0.00018734449011278433, 'samples': 16883520, 'steps': 87934, 'loss/train': 1.4322377443313599} 11/07/2021 09:34:18 - INFO - __main__ - Step 87936: {'lr': 0.00018733935274722763, 'samples': 16883712, 'steps': 87935, 'loss/train': 1.7349321842193604} 11/07/2021 09:34:18 - INFO - __main__ - Step 87937: {'lr': 0.00018733421540990483, 'samples': 16883904, 'steps': 87936, 'loss/train': 1.735681176185608} 11/07/2021 09:34:19 - INFO - __main__ - Step 87938: {'lr': 0.0001873290781008183, 'samples': 16884096, 'steps': 87937, 'loss/train': 1.495482325553894} 11/07/2021 09:34:19 - INFO - __main__ - Step 87939: {'lr': 0.00018732394081997028, 'samples': 16884288, 'steps': 87938, 'loss/train': 1.4238555431365967} 11/07/2021 09:34:20 - INFO - __main__ - Step 87940: {'lr': 0.00018731880356736313, 'samples': 16884480, 'steps': 87939, 'loss/train': 1.2369788885116577} 11/07/2021 09:34:21 - INFO - __main__ - Step 87941: {'lr': 0.0001873136663429992, 'samples': 16884672, 'steps': 87940, 'loss/train': 0.7520869374275208} 11/07/2021 09:34:21 - INFO - __main__ - Step 87942: {'lr': 0.00018730852914688073, 'samples': 16884864, 'steps': 87941, 'loss/train': 0.1579580456018448} 11/07/2021 09:34:21 - INFO - __main__ - Step 87943: {'lr': 0.0001873033919790101, 'samples': 16885056, 'steps': 87942, 'loss/train': 1.4128319025039673} 11/07/2021 09:34:22 - INFO - __main__ - Step 87944: {'lr': 0.00018729825483938955, 'samples': 16885248, 'steps': 87943, 'loss/train': 1.7647367715835571} 11/07/2021 09:34:22 - INFO - __main__ - Step 87945: {'lr': 0.0001872931177280215, 'samples': 16885440, 'steps': 87944, 'loss/train': 0.9012026786804199} 11/07/2021 09:34:23 - INFO - __main__ - Step 87946: {'lr': 0.00018728798064490814, 'samples': 16885632, 'steps': 87945, 'loss/train': 0.9893048405647278} 11/07/2021 09:34:23 - INFO - __main__ - Step 87947: {'lr': 0.00018728284359005202, 'samples': 16885824, 'steps': 87946, 'loss/train': 1.4195542335510254} 11/07/2021 09:34:24 - INFO - __main__ - Step 87948: {'lr': 0.00018727770656345514, 'samples': 16886016, 'steps': 87947, 'loss/train': 1.3932969570159912} 11/07/2021 09:34:24 - INFO - __main__ - Step 87949: {'lr': 0.00018727256956511997, 'samples': 16886208, 'steps': 87948, 'loss/train': 0.7875041961669922} 11/07/2021 09:34:24 - INFO - __main__ - Step 87950: {'lr': 0.00018726743259504878, 'samples': 16886400, 'steps': 87949, 'loss/train': 0.9770529270172119} 11/07/2021 09:34:25 - INFO - __main__ - Step 87951: {'lr': 0.00018726229565324394, 'samples': 16886592, 'steps': 87950, 'loss/train': 1.1890698671340942} 11/07/2021 09:34:26 - INFO - __main__ - Step 87952: {'lr': 0.0001872571587397077, 'samples': 16886784, 'steps': 87951, 'loss/train': 1.2874151468276978} 11/07/2021 09:34:26 - INFO - __main__ - Step 87953: {'lr': 0.0001872520218544425, 'samples': 16886976, 'steps': 87952, 'loss/train': 1.0525717735290527} 11/07/2021 09:34:27 - INFO - __main__ - Step 87954: {'lr': 0.00018724688499745053, 'samples': 16887168, 'steps': 87953, 'loss/train': 1.4873241186141968} 11/07/2021 09:34:27 - INFO - __main__ - Step 87955: {'lr': 0.00018724174816873412, 'samples': 16887360, 'steps': 87954, 'loss/train': 1.254885196685791} 11/07/2021 09:34:28 - INFO - __main__ - Step 87956: {'lr': 0.0001872366113682956, 'samples': 16887552, 'steps': 87955, 'loss/train': 1.5740936994552612} 11/07/2021 09:34:28 - INFO - __main__ - Step 87957: {'lr': 0.00018723147459613737, 'samples': 16887744, 'steps': 87956, 'loss/train': 1.2626274824142456} 11/07/2021 09:34:29 - INFO - __main__ - Step 87958: {'lr': 0.00018722633785226163, 'samples': 16887936, 'steps': 87957, 'loss/train': 1.520585298538208} 11/07/2021 09:34:29 - INFO - __main__ - Step 87959: {'lr': 0.00018722120113667072, 'samples': 16888128, 'steps': 87958, 'loss/train': 1.4827736616134644} 11/07/2021 09:34:29 - INFO - __main__ - Step 87960: {'lr': 0.00018721606444936696, 'samples': 16888320, 'steps': 87959, 'loss/train': 1.0799261331558228} 11/07/2021 09:34:30 - INFO - __main__ - Step 87961: {'lr': 0.0001872109277903528, 'samples': 16888512, 'steps': 87960, 'loss/train': 1.294146180152893} 11/07/2021 09:34:31 - INFO - __main__ - Step 87962: {'lr': 0.00018720579115963032, 'samples': 16888704, 'steps': 87961, 'loss/train': 1.0122416019439697} 11/07/2021 09:34:31 - INFO - __main__ - Step 87963: {'lr': 0.00018720065455720192, 'samples': 16888896, 'steps': 87962, 'loss/train': 1.2447988986968994} 11/07/2021 09:34:31 - INFO - __main__ - Step 87964: {'lr': 0.00018719551798306996, 'samples': 16889088, 'steps': 87963, 'loss/train': 1.4913957118988037} 11/07/2021 09:34:32 - INFO - __main__ - Step 87965: {'lr': 0.00018719038143723672, 'samples': 16889280, 'steps': 87964, 'loss/train': 1.6011223793029785} 11/07/2021 09:34:33 - INFO - __main__ - Step 87966: {'lr': 0.00018718524491970453, 'samples': 16889472, 'steps': 87965, 'loss/train': 1.3660558462142944} 11/07/2021 09:34:33 - INFO - __main__ - Step 87967: {'lr': 0.0001871801084304757, 'samples': 16889664, 'steps': 87966, 'loss/train': 1.4210538864135742} 11/07/2021 09:34:34 - INFO - __main__ - Step 87968: {'lr': 0.00018717497196955255, 'samples': 16889856, 'steps': 87967, 'loss/train': 1.286057949066162} 11/07/2021 09:34:34 - INFO - __main__ - Step 87969: {'lr': 0.00018716983553693738, 'samples': 16890048, 'steps': 87968, 'loss/train': 1.3205233812332153} 11/07/2021 09:34:34 - INFO - __main__ - Step 87970: {'lr': 0.0001871646991326325, 'samples': 16890240, 'steps': 87969, 'loss/train': 1.287121295928955} 11/07/2021 09:34:35 - INFO - __main__ - Step 87971: {'lr': 0.00018715956275664025, 'samples': 16890432, 'steps': 87970, 'loss/train': 1.2404171228408813} 11/07/2021 09:34:36 - INFO - __main__ - Step 87972: {'lr': 0.00018715442640896294, 'samples': 16890624, 'steps': 87971, 'loss/train': 0.6343061923980713} 11/07/2021 09:34:36 - INFO - __main__ - Step 87973: {'lr': 0.0001871492900896029, 'samples': 16890816, 'steps': 87972, 'loss/train': 1.607301115989685} 11/07/2021 09:34:36 - INFO - __main__ - Step 87974: {'lr': 0.00018714415379856244, 'samples': 16891008, 'steps': 87973, 'loss/train': 1.0632059574127197} 11/07/2021 09:34:37 - INFO - __main__ - Step 87975: {'lr': 0.0001871390175358438, 'samples': 16891200, 'steps': 87974, 'loss/train': 1.2399033308029175} 11/07/2021 09:34:37 - INFO - __main__ - Step 87976: {'lr': 0.00018713388130144938, 'samples': 16891392, 'steps': 87975, 'loss/train': 1.5712087154388428} 11/07/2021 09:34:38 - INFO - __main__ - Step 87977: {'lr': 0.00018712874509538142, 'samples': 16891584, 'steps': 87976, 'loss/train': 1.0725622177124023} 11/07/2021 09:34:38 - INFO - __main__ - Step 87978: {'lr': 0.0001871236089176423, 'samples': 16891776, 'steps': 87977, 'loss/train': 1.2281641960144043} 11/07/2021 09:34:39 - INFO - __main__ - Step 87979: {'lr': 0.0001871184727682343, 'samples': 16891968, 'steps': 87978, 'loss/train': 1.4468761682510376} 11/07/2021 09:34:39 - INFO - __main__ - Step 87980: {'lr': 0.00018711333664715973, 'samples': 16892160, 'steps': 87979, 'loss/train': 1.502597451210022} 11/07/2021 09:34:39 - INFO - __main__ - Step 87981: {'lr': 0.00018710820055442093, 'samples': 16892352, 'steps': 87980, 'loss/train': 1.6824121475219727} 11/07/2021 09:34:40 - INFO - __main__ - Step 87982: {'lr': 0.00018710306449002022, 'samples': 16892544, 'steps': 87981, 'loss/train': 1.8196336030960083} 11/07/2021 09:34:41 - INFO - __main__ - Step 87983: {'lr': 0.0001870979284539599, 'samples': 16892736, 'steps': 87982, 'loss/train': 1.4390636682510376} 11/07/2021 09:34:41 - INFO - __main__ - Step 87984: {'lr': 0.0001870927924462423, 'samples': 16892928, 'steps': 87983, 'loss/train': 1.3776823282241821} 11/07/2021 09:34:42 - INFO - __main__ - Step 87985: {'lr': 0.0001870876564668697, 'samples': 16893120, 'steps': 87984, 'loss/train': 1.2076776027679443} 11/07/2021 09:34:42 - INFO - __main__ - Step 87986: {'lr': 0.0001870825205158444, 'samples': 16893312, 'steps': 87985, 'loss/train': 1.1920807361602783} 11/07/2021 09:34:43 - INFO - __main__ - Step 87987: {'lr': 0.0001870773845931688, 'samples': 16893504, 'steps': 87986, 'loss/train': 0.47198936343193054} 11/07/2021 09:34:43 - INFO - __main__ - Step 87988: {'lr': 0.00018707224869884514, 'samples': 16893696, 'steps': 87987, 'loss/train': 1.5680875778198242} 11/07/2021 09:34:44 - INFO - __main__ - Step 87989: {'lr': 0.00018706711283287576, 'samples': 16893888, 'steps': 87988, 'loss/train': 1.458162784576416} 11/07/2021 09:34:44 - INFO - __main__ - Step 87990: {'lr': 0.00018706197699526296, 'samples': 16894080, 'steps': 87989, 'loss/train': 0.9915876388549805} 11/07/2021 09:34:44 - INFO - __main__ - Step 87991: {'lr': 0.000187056841186009, 'samples': 16894272, 'steps': 87990, 'loss/train': 1.1198450326919556} 11/07/2021 09:34:45 - INFO - __main__ - Step 87992: {'lr': 0.00018705170540511635, 'samples': 16894464, 'steps': 87991, 'loss/train': 1.168241024017334} 11/07/2021 09:34:46 - INFO - __main__ - Step 87993: {'lr': 0.00018704656965258718, 'samples': 16894656, 'steps': 87992, 'loss/train': 1.4070868492126465} 11/07/2021 09:34:46 - INFO - __main__ - Step 87994: {'lr': 0.0001870414339284239, 'samples': 16894848, 'steps': 87993, 'loss/train': 1.1451038122177124} 11/07/2021 09:34:47 - INFO - __main__ - Step 87995: {'lr': 0.00018703629823262874, 'samples': 16895040, 'steps': 87994, 'loss/train': 1.3993028402328491} 11/07/2021 09:34:47 - INFO - __main__ - Step 87996: {'lr': 0.00018703116256520405, 'samples': 16895232, 'steps': 87995, 'loss/train': 1.9751033782958984} 11/07/2021 09:34:47 - INFO - __main__ - Step 87997: {'lr': 0.00018702602692615217, 'samples': 16895424, 'steps': 87996, 'loss/train': 1.7654002904891968} 11/07/2021 09:34:48 - INFO - __main__ - Step 87998: {'lr': 0.00018702089131547535, 'samples': 16895616, 'steps': 87997, 'loss/train': 1.1773704290390015} 11/07/2021 09:34:49 - INFO - __main__ - Step 87999: {'lr': 0.00018701575573317597, 'samples': 16895808, 'steps': 87998, 'loss/train': 0.7330653667449951} 11/07/2021 09:34:49 - INFO - __main__ - Step 88000: {'lr': 0.0001870106201792563, 'samples': 16896000, 'steps': 87999, 'loss/train': 1.6253502368927002} 11/07/2021 09:34:49 - INFO - __main__ - Step 88001: {'lr': 0.00018700548465371876, 'samples': 16896192, 'steps': 88000, 'loss/train': 0.6688458919525146} 11/07/2021 09:34:50 - INFO - __main__ - Step 88002: {'lr': 0.0001870003491565655, 'samples': 16896384, 'steps': 88001, 'loss/train': 1.8307381868362427} 11/07/2021 09:34:51 - INFO - __main__ - Step 88003: {'lr': 0.0001869952136877989, 'samples': 16896576, 'steps': 88002, 'loss/train': 1.3291065692901611} 11/07/2021 09:34:51 - INFO - __main__ - Step 88004: {'lr': 0.0001869900782474213, 'samples': 16896768, 'steps': 88003, 'loss/train': 1.4841595888137817} 11/07/2021 09:34:51 - INFO - __main__ - Step 88005: {'lr': 0.00018698494283543498, 'samples': 16896960, 'steps': 88004, 'loss/train': 0.5634127855300903} 11/07/2021 09:34:52 - INFO - __main__ - Step 88006: {'lr': 0.00018697980745184235, 'samples': 16897152, 'steps': 88005, 'loss/train': 1.3011432886123657} 11/07/2021 09:34:52 - INFO - __main__ - Step 88007: {'lr': 0.0001869746720966456, 'samples': 16897344, 'steps': 88006, 'loss/train': 1.9111535549163818} 11/07/2021 09:34:52 - INFO - __main__ - Step 88008: {'lr': 0.00018696953676984704, 'samples': 16897536, 'steps': 88007, 'loss/train': 1.4621607065200806} 11/07/2021 09:34:53 - INFO - __main__ - Step 88009: {'lr': 0.00018696440147144904, 'samples': 16897728, 'steps': 88008, 'loss/train': 1.386916160583496} 11/07/2021 09:34:54 - INFO - __main__ - Step 88010: {'lr': 0.00018695926620145397, 'samples': 16897920, 'steps': 88009, 'loss/train': 1.6270906925201416} 11/07/2021 09:34:54 - INFO - __main__ - Step 88011: {'lr': 0.00018695413095986402, 'samples': 16898112, 'steps': 88010, 'loss/train': 1.5667903423309326} 11/07/2021 09:34:55 - INFO - __main__ - Step 88012: {'lr': 0.00018694899574668163, 'samples': 16898304, 'steps': 88011, 'loss/train': 1.3456902503967285} 11/07/2021 09:34:55 - INFO - __main__ - Step 88013: {'lr': 0.000186943860561909, 'samples': 16898496, 'steps': 88012, 'loss/train': 1.131056785583496} 11/07/2021 09:34:56 - INFO - __main__ - Step 88014: {'lr': 0.00018693872540554858, 'samples': 16898688, 'steps': 88013, 'loss/train': 1.5774327516555786} 11/07/2021 09:34:56 - INFO - __main__ - Step 88015: {'lr': 0.00018693359027760248, 'samples': 16898880, 'steps': 88014, 'loss/train': 1.342150092124939} 11/07/2021 09:34:57 - INFO - __main__ - Step 88016: {'lr': 0.00018692845517807318, 'samples': 16899072, 'steps': 88015, 'loss/train': 1.1593300104141235} 11/07/2021 09:34:57 - INFO - __main__ - Step 88017: {'lr': 0.000186923320106963, 'samples': 16899264, 'steps': 88016, 'loss/train': 1.0505058765411377} 11/07/2021 09:34:57 - INFO - __main__ - Step 88018: {'lr': 0.00018691818506427413, 'samples': 16899456, 'steps': 88017, 'loss/train': 1.2107406854629517} 11/07/2021 09:34:59 - INFO - __main__ - Step 88019: {'lr': 0.00018691305005000898, 'samples': 16899648, 'steps': 88018, 'loss/train': 1.7718323469161987} 11/07/2021 09:34:59 - INFO - __main__ - Step 88020: {'lr': 0.0001869079150641698, 'samples': 16899840, 'steps': 88019, 'loss/train': 1.7157092094421387} 11/07/2021 09:34:59 - INFO - __main__ - Step 88021: {'lr': 0.00018690278010675897, 'samples': 16900032, 'steps': 88020, 'loss/train': 1.778943657875061} 11/07/2021 09:35:00 - INFO - __main__ - Step 88022: {'lr': 0.00018689764517777874, 'samples': 16900224, 'steps': 88021, 'loss/train': 1.264243483543396} 11/07/2021 09:35:00 - INFO - __main__ - Step 88023: {'lr': 0.00018689251027723147, 'samples': 16900416, 'steps': 88022, 'loss/train': 1.3441784381866455} 11/07/2021 09:35:00 - INFO - __main__ - Step 88024: {'lr': 0.00018688737540511945, 'samples': 16900608, 'steps': 88023, 'loss/train': 1.5506011247634888} 11/07/2021 09:35:01 - INFO - __main__ - Step 88025: {'lr': 0.00018688224056144504, 'samples': 16900800, 'steps': 88024, 'loss/train': 1.622322678565979} 11/07/2021 09:35:02 - INFO - __main__ - Step 88026: {'lr': 0.00018687710574621051, 'samples': 16900992, 'steps': 88025, 'loss/train': 1.1509453058242798} 11/07/2021 09:35:02 - INFO - __main__ - Step 88027: {'lr': 0.00018687197095941817, 'samples': 16901184, 'steps': 88026, 'loss/train': 1.630362868309021} 11/07/2021 09:35:02 - INFO - __main__ - Step 88028: {'lr': 0.00018686683620107046, 'samples': 16901376, 'steps': 88027, 'loss/train': 1.5759178400039673} 11/07/2021 09:35:03 - INFO - __main__ - Step 88029: {'lr': 0.00018686170147116945, 'samples': 16901568, 'steps': 88028, 'loss/train': 0.9034333825111389} 11/07/2021 09:35:03 - INFO - __main__ - Step 88030: {'lr': 0.00018685656676971764, 'samples': 16901760, 'steps': 88029, 'loss/train': 1.4866070747375488} 11/07/2021 09:35:04 - INFO - __main__ - Step 88031: {'lr': 0.00018685143209671724, 'samples': 16901952, 'steps': 88030, 'loss/train': 1.6353617906570435} 11/07/2021 09:35:05 - INFO - __main__ - Step 88032: {'lr': 0.00018684629745217057, 'samples': 16902144, 'steps': 88031, 'loss/train': 1.738853096961975} 11/07/2021 09:35:05 - INFO - __main__ - Step 88033: {'lr': 0.00018684116283608005, 'samples': 16902336, 'steps': 88032, 'loss/train': 1.39177405834198} 11/07/2021 09:35:05 - INFO - __main__ - Step 88034: {'lr': 0.00018683602824844792, 'samples': 16902528, 'steps': 88033, 'loss/train': 1.1670440435409546} 11/07/2021 09:35:06 - INFO - __main__ - Step 88035: {'lr': 0.00018683089368927647, 'samples': 16902720, 'steps': 88034, 'loss/train': 0.8537707328796387} 11/07/2021 09:35:07 - INFO - __main__ - Step 88036: {'lr': 0.0001868257591585681, 'samples': 16902912, 'steps': 88035, 'loss/train': 1.6601678133010864} 11/07/2021 09:35:07 - INFO - __main__ - Step 88037: {'lr': 0.000186820624656325, 'samples': 16903104, 'steps': 88036, 'loss/train': 1.330894112586975} 11/07/2021 09:35:07 - INFO - __main__ - Step 88038: {'lr': 0.00018681549018254959, 'samples': 16903296, 'steps': 88037, 'loss/train': 1.6333434581756592} 11/07/2021 09:35:08 - INFO - __main__ - Step 88039: {'lr': 0.00018681035573724414, 'samples': 16903488, 'steps': 88038, 'loss/train': 2.154198408126831} 11/07/2021 09:35:08 - INFO - __main__ - Step 88040: {'lr': 0.00018680522132041098, 'samples': 16903680, 'steps': 88039, 'loss/train': 1.2361993789672852} 11/07/2021 09:35:09 - INFO - __main__ - Step 88041: {'lr': 0.0001868000869320525, 'samples': 16903872, 'steps': 88040, 'loss/train': 1.250984787940979} 11/07/2021 09:35:10 - INFO - __main__ - Step 88042: {'lr': 0.00018679495257217083, 'samples': 16904064, 'steps': 88041, 'loss/train': 1.7124193906784058} 11/07/2021 09:35:10 - INFO - __main__ - Step 88043: {'lr': 0.00018678981824076836, 'samples': 16904256, 'steps': 88042, 'loss/train': 1.5786322355270386} 11/07/2021 09:35:10 - INFO - __main__ - Step 88044: {'lr': 0.00018678468393784744, 'samples': 16904448, 'steps': 88043, 'loss/train': 1.1336004734039307} 11/07/2021 09:35:11 - INFO - __main__ - Step 88045: {'lr': 0.00018677954966341036, 'samples': 16904640, 'steps': 88044, 'loss/train': 1.63825523853302} 11/07/2021 09:35:12 - INFO - __main__ - Step 88046: {'lr': 0.00018677441541745942, 'samples': 16904832, 'steps': 88045, 'loss/train': 1.2067530155181885} 11/07/2021 09:35:12 - INFO - __main__ - Step 88047: {'lr': 0.000186769281199997, 'samples': 16905024, 'steps': 88046, 'loss/train': 1.106475591659546} 11/07/2021 09:35:12 - INFO - __main__ - Step 88048: {'lr': 0.00018676414701102533, 'samples': 16905216, 'steps': 88047, 'loss/train': 1.3136481046676636} 11/07/2021 09:35:13 - INFO - __main__ - Step 88049: {'lr': 0.0001867590128505468, 'samples': 16905408, 'steps': 88048, 'loss/train': 1.358460545539856} 11/07/2021 09:35:13 - INFO - __main__ - Step 88050: {'lr': 0.00018675387871856363, 'samples': 16905600, 'steps': 88049, 'loss/train': 1.6442639827728271} 11/07/2021 09:35:13 - INFO - __main__ - Step 88051: {'lr': 0.00018674874461507824, 'samples': 16905792, 'steps': 88050, 'loss/train': 1.6696079969406128} 11/07/2021 09:35:14 - INFO - __main__ - Step 88052: {'lr': 0.00018674361054009283, 'samples': 16905984, 'steps': 88051, 'loss/train': 0.961223304271698} 11/07/2021 09:35:15 - INFO - __main__ - Step 88053: {'lr': 0.0001867384764936098, 'samples': 16906176, 'steps': 88052, 'loss/train': 1.9392749071121216} 11/07/2021 09:35:15 - INFO - __main__ - Step 88054: {'lr': 0.00018673334247563145, 'samples': 16906368, 'steps': 88053, 'loss/train': 1.4014424085617065} 11/07/2021 09:35:16 - INFO - __main__ - Step 88055: {'lr': 0.00018672820848616019, 'samples': 16906560, 'steps': 88054, 'loss/train': 1.1625239849090576} 11/07/2021 09:35:16 - INFO - __main__ - Step 88056: {'lr': 0.0001867230745251981, 'samples': 16906752, 'steps': 88055, 'loss/train': 1.2675013542175293} 11/07/2021 09:35:17 - INFO - __main__ - Step 88057: {'lr': 0.0001867179405927476, 'samples': 16906944, 'steps': 88056, 'loss/train': 1.157111644744873} 11/07/2021 09:35:18 - INFO - __main__ - Step 88058: {'lr': 0.00018671280668881103, 'samples': 16907136, 'steps': 88057, 'loss/train': 0.9854026436805725} 11/07/2021 09:35:18 - INFO - __main__ - Step 88059: {'lr': 0.0001867076728133907, 'samples': 16907328, 'steps': 88058, 'loss/train': 1.3393185138702393} 11/07/2021 09:35:18 - INFO - __main__ - Step 88060: {'lr': 0.00018670253896648891, 'samples': 16907520, 'steps': 88059, 'loss/train': 0.9134646654129028} 11/07/2021 09:35:19 - INFO - __main__ - Step 88061: {'lr': 0.000186697405148108, 'samples': 16907712, 'steps': 88060, 'loss/train': 0.46542686223983765} 11/07/2021 09:35:20 - INFO - __main__ - Step 88062: {'lr': 0.00018669227135825024, 'samples': 16907904, 'steps': 88061, 'loss/train': 1.6994812488555908} 11/07/2021 09:35:20 - INFO - __main__ - Step 88063: {'lr': 0.00018668713759691796, 'samples': 16908096, 'steps': 88062, 'loss/train': 1.4503580331802368} 11/07/2021 09:35:20 - INFO - __main__ - Step 88064: {'lr': 0.0001866820038641135, 'samples': 16908288, 'steps': 88063, 'loss/train': 1.393004298210144} 11/07/2021 09:35:21 - INFO - __main__ - Step 88065: {'lr': 0.00018667687015983913, 'samples': 16908480, 'steps': 88064, 'loss/train': 1.5160646438598633} 11/07/2021 09:35:21 - INFO - __main__ - Step 88066: {'lr': 0.00018667173648409725, 'samples': 16908672, 'steps': 88065, 'loss/train': 1.3819209337234497} 11/07/2021 09:35:21 - INFO - __main__ - Step 88067: {'lr': 0.00018666660283689002, 'samples': 16908864, 'steps': 88066, 'loss/train': 1.575051188468933} 11/07/2021 09:35:22 - INFO - __main__ - Step 88068: {'lr': 0.00018666146921822, 'samples': 16909056, 'steps': 88067, 'loss/train': 1.2456066608428955} 11/07/2021 09:35:23 - INFO - __main__ - Step 88069: {'lr': 0.0001866563356280892, 'samples': 16909248, 'steps': 88068, 'loss/train': 1.4490442276000977} 11/07/2021 09:35:23 - INFO - __main__ - Step 88070: {'lr': 0.00018665120206650011, 'samples': 16909440, 'steps': 88069, 'loss/train': 1.3393996953964233} 11/07/2021 09:35:23 - INFO - __main__ - Step 88071: {'lr': 0.000186646068533455, 'samples': 16909632, 'steps': 88070, 'loss/train': 1.6485015153884888} 11/07/2021 09:35:24 - INFO - __main__ - Step 88072: {'lr': 0.00018664093502895621, 'samples': 16909824, 'steps': 88071, 'loss/train': 2.031014919281006} 11/07/2021 09:35:25 - INFO - __main__ - Step 88073: {'lr': 0.00018663580155300603, 'samples': 16910016, 'steps': 88072, 'loss/train': 1.3441766500473022} 11/07/2021 09:35:25 - INFO - __main__ - Step 88074: {'lr': 0.00018663066810560675, 'samples': 16910208, 'steps': 88073, 'loss/train': 1.6247608661651611} 11/07/2021 09:35:26 - INFO - __main__ - Step 88075: {'lr': 0.00018662553468676074, 'samples': 16910400, 'steps': 88074, 'loss/train': 3.943561315536499} 11/07/2021 09:35:26 - INFO - __main__ - Step 88076: {'lr': 0.00018662040129647028, 'samples': 16910592, 'steps': 88075, 'loss/train': 1.558699369430542} 11/07/2021 09:35:26 - INFO - __main__ - Step 88077: {'lr': 0.0001866152679347377, 'samples': 16910784, 'steps': 88076, 'loss/train': 1.9505057334899902} 11/07/2021 09:35:28 - INFO - __main__ - Step 88078: {'lr': 0.00018661013460156528, 'samples': 16910976, 'steps': 88077, 'loss/train': 1.4088339805603027} 11/07/2021 09:35:28 - INFO - __main__ - Step 88079: {'lr': 0.0001866050012969554, 'samples': 16911168, 'steps': 88078, 'loss/train': 1.117002010345459} 11/07/2021 09:35:28 - INFO - __main__ - Step 88080: {'lr': 0.00018659986802091027, 'samples': 16911360, 'steps': 88079, 'loss/train': 1.4035043716430664} 11/07/2021 09:35:29 - INFO - __main__ - Step 88081: {'lr': 0.0001865947347734323, 'samples': 16911552, 'steps': 88080, 'loss/train': 1.099597454071045} 11/07/2021 09:35:29 - INFO - __main__ - Step 88082: {'lr': 0.00018658960155452386, 'samples': 16911744, 'steps': 88081, 'loss/train': 5.7371954917907715} 11/07/2021 09:35:29 - INFO - __main__ - Step 88083: {'lr': 0.0001865844683641871, 'samples': 16911936, 'steps': 88082, 'loss/train': 1.161909818649292} 11/07/2021 09:35:30 - INFO - __main__ - Step 88084: {'lr': 0.0001865793352024243, 'samples': 16912128, 'steps': 88083, 'loss/train': 1.5680615901947021} 11/07/2021 09:35:31 - INFO - __main__ - Step 88085: {'lr': 0.00018657420206923795, 'samples': 16912320, 'steps': 88084, 'loss/train': 0.4239472448825836} 11/07/2021 09:35:31 - INFO - __main__ - Step 88086: {'lr': 0.00018656906896463027, 'samples': 16912512, 'steps': 88085, 'loss/train': 1.3134866952896118} 11/07/2021 09:35:31 - INFO - __main__ - Step 88087: {'lr': 0.0001865639358886036, 'samples': 16912704, 'steps': 88086, 'loss/train': 0.5841352939605713} 11/07/2021 09:35:32 - INFO - __main__ - Step 88088: {'lr': 0.00018655880284116022, 'samples': 16912896, 'steps': 88087, 'loss/train': 1.5364383459091187} 11/07/2021 09:35:33 - INFO - __main__ - Step 88089: {'lr': 0.0001865536698223025, 'samples': 16913088, 'steps': 88088, 'loss/train': 0.5700833201408386} 11/07/2021 09:35:33 - INFO - __main__ - Step 88090: {'lr': 0.00018654853683203266, 'samples': 16913280, 'steps': 88089, 'loss/train': 1.4209216833114624} 11/07/2021 09:35:34 - INFO - __main__ - Step 88091: {'lr': 0.0001865434038703531, 'samples': 16913472, 'steps': 88090, 'loss/train': 1.4906522035598755} 11/07/2021 09:35:34 - INFO - __main__ - Step 88092: {'lr': 0.00018653827093726612, 'samples': 16913664, 'steps': 88091, 'loss/train': 1.4708671569824219} 11/07/2021 09:35:35 - INFO - __main__ - Step 88093: {'lr': 0.000186533138032774, 'samples': 16913856, 'steps': 88092, 'loss/train': 0.5311657190322876} 11/07/2021 09:35:35 - INFO - __main__ - Step 88094: {'lr': 0.00018652800515687906, 'samples': 16914048, 'steps': 88093, 'loss/train': 0.8303433656692505} 11/07/2021 09:35:36 - INFO - __main__ - Step 88095: {'lr': 0.00018652287230958372, 'samples': 16914240, 'steps': 88094, 'loss/train': 1.6002453565597534} 11/07/2021 09:35:36 - INFO - __main__ - Step 88096: {'lr': 0.00018651773949089013, 'samples': 16914432, 'steps': 88095, 'loss/train': 1.4827672243118286} 11/07/2021 09:35:37 - INFO - __main__ - Step 88097: {'lr': 0.0001865126067008006, 'samples': 16914624, 'steps': 88096, 'loss/train': 1.4760290384292603} 11/07/2021 09:35:37 - INFO - __main__ - Step 88098: {'lr': 0.00018650747393931754, 'samples': 16914816, 'steps': 88097, 'loss/train': 1.4506181478500366} 11/07/2021 09:35:37 - INFO - __main__ - Step 88099: {'lr': 0.00018650234120644326, 'samples': 16915008, 'steps': 88098, 'loss/train': 1.252097725868225} 11/07/2021 09:35:38 - INFO - __main__ - Step 88100: {'lr': 0.00018649720850218005, 'samples': 16915200, 'steps': 88099, 'loss/train': 1.3661096096038818} 11/07/2021 09:35:39 - INFO - __main__ - Step 88101: {'lr': 0.00018649207582653018, 'samples': 16915392, 'steps': 88100, 'loss/train': 1.7645800113677979} 11/07/2021 09:35:39 - INFO - __main__ - Step 88102: {'lr': 0.00018648694317949601, 'samples': 16915584, 'steps': 88101, 'loss/train': 1.287412405014038} 11/07/2021 09:35:39 - INFO - __main__ - Step 88103: {'lr': 0.00018648181056107988, 'samples': 16915776, 'steps': 88102, 'loss/train': 1.6537675857543945} 11/07/2021 09:35:40 - INFO - __main__ - Step 88104: {'lr': 0.00018647667797128405, 'samples': 16915968, 'steps': 88103, 'loss/train': 1.1254267692565918} 11/07/2021 09:35:41 - INFO - __main__ - Step 88105: {'lr': 0.0001864715454101108, 'samples': 16916160, 'steps': 88104, 'loss/train': 1.357173204421997} 11/07/2021 09:35:41 - INFO - __main__ - Step 88106: {'lr': 0.00018646641287756253, 'samples': 16916352, 'steps': 88105, 'loss/train': 1.192034125328064} 11/07/2021 09:35:41 - INFO - __main__ - Step 88107: {'lr': 0.00018646128037364153, 'samples': 16916544, 'steps': 88106, 'loss/train': 1.223863959312439} 11/07/2021 09:35:42 - INFO - __main__ - Step 88108: {'lr': 0.0001864561478983501, 'samples': 16916736, 'steps': 88107, 'loss/train': 1.8913249969482422} 11/07/2021 09:35:42 - INFO - __main__ - Step 88109: {'lr': 0.00018645101545169057, 'samples': 16916928, 'steps': 88108, 'loss/train': 1.5134438276290894} 11/07/2021 09:35:43 - INFO - __main__ - Step 88110: {'lr': 0.0001864458830336652, 'samples': 16917120, 'steps': 88109, 'loss/train': 1.5373759269714355} 11/07/2021 09:35:43 - INFO - __main__ - Step 88111: {'lr': 0.00018644075064427632, 'samples': 16917312, 'steps': 88110, 'loss/train': 1.1858246326446533} 11/07/2021 09:35:44 - INFO - __main__ - Step 88112: {'lr': 0.00018643561828352625, 'samples': 16917504, 'steps': 88111, 'loss/train': 1.5298693180084229} 11/07/2021 09:35:44 - INFO - __main__ - Step 88113: {'lr': 0.0001864304859514173, 'samples': 16917696, 'steps': 88112, 'loss/train': 1.4796266555786133} 11/07/2021 09:35:44 - INFO - __main__ - Step 88114: {'lr': 0.00018642535364795182, 'samples': 16917888, 'steps': 88113, 'loss/train': 1.4872804880142212} 11/07/2021 09:35:46 - INFO - __main__ - Step 88115: {'lr': 0.0001864202213731321, 'samples': 16918080, 'steps': 88114, 'loss/train': 1.372959852218628} 11/07/2021 09:35:46 - INFO - __main__ - Step 88116: {'lr': 0.00018641508912696042, 'samples': 16918272, 'steps': 88115, 'loss/train': 1.2347853183746338} 11/07/2021 09:35:46 - INFO - __main__ - Step 88117: {'lr': 0.00018640995690943915, 'samples': 16918464, 'steps': 88116, 'loss/train': 1.612394094467163} 11/07/2021 09:35:47 - INFO - __main__ - Step 88118: {'lr': 0.00018640482472057058, 'samples': 16918656, 'steps': 88117, 'loss/train': 2.9703049659729004} 11/07/2021 09:35:47 - INFO - __main__ - Step 88119: {'lr': 0.00018639969256035703, 'samples': 16918848, 'steps': 88118, 'loss/train': 0.9866267442703247} 11/07/2021 09:35:47 - INFO - __main__ - Step 88120: {'lr': 0.00018639456042880077, 'samples': 16919040, 'steps': 88119, 'loss/train': 1.19212806224823} 11/07/2021 09:35:48 - INFO - __main__ - Step 88121: {'lr': 0.00018638942832590412, 'samples': 16919232, 'steps': 88120, 'loss/train': 1.7177748680114746} 11/07/2021 09:35:49 - INFO - __main__ - Step 88122: {'lr': 0.00018638429625166946, 'samples': 16919424, 'steps': 88121, 'loss/train': 0.7001370787620544} 11/07/2021 09:35:49 - INFO - __main__ - Step 88123: {'lr': 0.00018637916420609902, 'samples': 16919616, 'steps': 88122, 'loss/train': 1.1604875326156616} 11/07/2021 09:35:49 - INFO - __main__ - Step 88124: {'lr': 0.00018637403218919513, 'samples': 16919808, 'steps': 88123, 'loss/train': 1.332196831703186} 11/07/2021 09:35:50 - INFO - __main__ - Step 88125: {'lr': 0.00018636890020096012, 'samples': 16920000, 'steps': 88124, 'loss/train': 1.675052285194397} 11/07/2021 09:35:51 - INFO - __main__ - Step 88126: {'lr': 0.0001863637682413963, 'samples': 16920192, 'steps': 88125, 'loss/train': 1.7764427661895752} 11/07/2021 09:35:51 - INFO - __main__ - Step 88127: {'lr': 0.00018635863631050602, 'samples': 16920384, 'steps': 88126, 'loss/train': 0.8689357042312622} 11/07/2021 09:35:52 - INFO - __main__ - Step 88128: {'lr': 0.00018635350440829153, 'samples': 16920576, 'steps': 88127, 'loss/train': 1.6450769901275635} 11/07/2021 09:35:52 - INFO - __main__ - Step 88129: {'lr': 0.00018634837253475519, 'samples': 16920768, 'steps': 88128, 'loss/train': 1.1617369651794434} 11/07/2021 09:35:52 - INFO - __main__ - Step 88130: {'lr': 0.00018634324068989927, 'samples': 16920960, 'steps': 88129, 'loss/train': 1.7185449600219727} 11/07/2021 09:35:53 - INFO - __main__ - Step 88131: {'lr': 0.00018633810887372612, 'samples': 16921152, 'steps': 88130, 'loss/train': 1.1285505294799805} 11/07/2021 09:35:54 - INFO - __main__ - Step 88132: {'lr': 0.00018633297708623803, 'samples': 16921344, 'steps': 88131, 'loss/train': 1.2779901027679443} 11/07/2021 09:35:54 - INFO - __main__ - Step 88133: {'lr': 0.0001863278453274373, 'samples': 16921536, 'steps': 88132, 'loss/train': 1.4978185892105103} 11/07/2021 09:35:54 - INFO - __main__ - Step 88134: {'lr': 0.00018632271359732627, 'samples': 16921728, 'steps': 88133, 'loss/train': 1.7117557525634766} 11/07/2021 09:35:55 - INFO - __main__ - Step 88135: {'lr': 0.0001863175818959073, 'samples': 16921920, 'steps': 88134, 'loss/train': 1.4114608764648438} 11/07/2021 09:35:56 - INFO - __main__ - Step 88136: {'lr': 0.00018631245022318258, 'samples': 16922112, 'steps': 88135, 'loss/train': 1.2719950675964355} 11/07/2021 09:35:56 - INFO - __main__ - Step 88137: {'lr': 0.00018630731857915452, 'samples': 16922304, 'steps': 88136, 'loss/train': 1.1810188293457031} 11/07/2021 09:35:56 - INFO - __main__ - Step 88138: {'lr': 0.00018630218696382534, 'samples': 16922496, 'steps': 88137, 'loss/train': 1.4961462020874023} 11/07/2021 09:35:57 - INFO - __main__ - Step 88139: {'lr': 0.00018629705537719744, 'samples': 16922688, 'steps': 88138, 'loss/train': 1.4625930786132812} 11/07/2021 09:35:57 - INFO - __main__ - Step 88140: {'lr': 0.00018629192381927314, 'samples': 16922880, 'steps': 88139, 'loss/train': 1.767986536026001} 11/07/2021 09:35:58 - INFO - __main__ - Step 88141: {'lr': 0.00018628679229005471, 'samples': 16923072, 'steps': 88140, 'loss/train': 0.9846983551979065} 11/07/2021 09:35:59 - INFO - __main__ - Step 88142: {'lr': 0.00018628166078954445, 'samples': 16923264, 'steps': 88141, 'loss/train': 1.5963736772537231} 11/07/2021 09:35:59 - INFO - __main__ - Step 88143: {'lr': 0.00018627652931774467, 'samples': 16923456, 'steps': 88142, 'loss/train': 1.4917373657226562} 11/07/2021 09:35:59 - INFO - __main__ - Step 88144: {'lr': 0.0001862713978746577, 'samples': 16923648, 'steps': 88143, 'loss/train': 1.7547283172607422} 11/07/2021 09:36:00 - INFO - __main__ - Step 88145: {'lr': 0.0001862662664602859, 'samples': 16923840, 'steps': 88144, 'loss/train': 1.5895973443984985} 11/07/2021 09:36:00 - INFO - __main__ - Step 88146: {'lr': 0.0001862611350746315, 'samples': 16924032, 'steps': 88145, 'loss/train': 1.3898305892944336} 11/07/2021 09:36:01 - INFO - __main__ - Step 88147: {'lr': 0.00018625600371769685, 'samples': 16924224, 'steps': 88146, 'loss/train': 1.591571569442749} 11/07/2021 09:36:01 - INFO - __main__ - Step 88148: {'lr': 0.00018625087238948427, 'samples': 16924416, 'steps': 88147, 'loss/train': 0.7344946265220642} 11/07/2021 09:36:02 - INFO - __main__ - Step 88149: {'lr': 0.0001862457410899961, 'samples': 16924608, 'steps': 88148, 'loss/train': 1.2043718099594116} 11/07/2021 09:36:02 - INFO - __main__ - Step 88150: {'lr': 0.00018624060981923458, 'samples': 16924800, 'steps': 88149, 'loss/train': 0.13860218226909637} 11/07/2021 09:36:02 - INFO - __main__ - Step 88151: {'lr': 0.0001862354785772021, 'samples': 16924992, 'steps': 88150, 'loss/train': 1.529129981994629} 11/07/2021 09:36:04 - INFO - __main__ - Step 88152: {'lr': 0.00018623034736390087, 'samples': 16925184, 'steps': 88151, 'loss/train': 1.5920974016189575} 11/07/2021 09:36:04 - INFO - __main__ - Step 88153: {'lr': 0.0001862252161793333, 'samples': 16925376, 'steps': 88152, 'loss/train': 1.3452132940292358} 11/07/2021 09:36:04 - INFO - __main__ - Step 88154: {'lr': 0.00018622008502350163, 'samples': 16925568, 'steps': 88153, 'loss/train': 1.3690199851989746} 11/07/2021 09:36:05 - INFO - __main__ - Step 88155: {'lr': 0.00018621495389640818, 'samples': 16925760, 'steps': 88154, 'loss/train': 1.6644771099090576} 11/07/2021 09:36:05 - INFO - __main__ - Step 88156: {'lr': 0.00018620982279805533, 'samples': 16925952, 'steps': 88155, 'loss/train': 1.3629716634750366} 11/07/2021 09:36:06 - INFO - __main__ - Step 88157: {'lr': 0.00018620469172844534, 'samples': 16926144, 'steps': 88156, 'loss/train': 1.376997709274292} 11/07/2021 09:36:06 - INFO - __main__ - Step 88158: {'lr': 0.00018619956068758055, 'samples': 16926336, 'steps': 88157, 'loss/train': 1.5781891345977783} 11/07/2021 09:36:07 - INFO - __main__ - Step 88159: {'lr': 0.00018619442967546325, 'samples': 16926528, 'steps': 88158, 'loss/train': 1.5520273447036743} 11/07/2021 09:36:07 - INFO - __main__ - Step 88160: {'lr': 0.00018618929869209573, 'samples': 16926720, 'steps': 88159, 'loss/train': 1.9823249578475952} 11/07/2021 09:36:07 - INFO - __main__ - Step 88161: {'lr': 0.0001861841677374803, 'samples': 16926912, 'steps': 88160, 'loss/train': 1.0564885139465332} 11/07/2021 09:36:08 - INFO - __main__ - Step 88162: {'lr': 0.00018617903681161947, 'samples': 16927104, 'steps': 88161, 'loss/train': 1.8549549579620361} 11/07/2021 09:36:09 - INFO - __main__ - Step 88163: {'lr': 0.00018617390591451526, 'samples': 16927296, 'steps': 88162, 'loss/train': 1.7212756872177124} 11/07/2021 09:36:09 - INFO - __main__ - Step 88164: {'lr': 0.00018616877504617008, 'samples': 16927488, 'steps': 88163, 'loss/train': 0.9554679989814758} 11/07/2021 09:36:09 - INFO - __main__ - Step 88165: {'lr': 0.00018616364420658628, 'samples': 16927680, 'steps': 88164, 'loss/train': 1.655509352684021} 11/07/2021 09:36:10 - INFO - __main__ - Step 88166: {'lr': 0.00018615851339576616, 'samples': 16927872, 'steps': 88165, 'loss/train': 1.3996756076812744} 11/07/2021 09:36:11 - INFO - __main__ - Step 88167: {'lr': 0.000186153382613712, 'samples': 16928064, 'steps': 88166, 'loss/train': 1.7597272396087646} 11/07/2021 09:36:11 - INFO - __main__ - Step 88168: {'lr': 0.00018614825186042617, 'samples': 16928256, 'steps': 88167, 'loss/train': 1.717961072921753} 11/07/2021 09:36:12 - INFO - __main__ - Step 88169: {'lr': 0.00018614312113591095, 'samples': 16928448, 'steps': 88168, 'loss/train': 1.3934000730514526} 11/07/2021 09:36:12 - INFO - __main__ - Step 88170: {'lr': 0.00018613799044016867, 'samples': 16928640, 'steps': 88169, 'loss/train': 1.1974903345108032} 11/07/2021 09:36:12 - INFO - __main__ - Step 88171: {'lr': 0.00018613285977320157, 'samples': 16928832, 'steps': 88170, 'loss/train': 1.3467985391616821} 11/07/2021 09:36:13 - INFO - __main__ - Step 88172: {'lr': 0.00018612772913501207, 'samples': 16929024, 'steps': 88171, 'loss/train': 1.2272762060165405} 11/07/2021 09:36:14 - INFO - __main__ - Step 88173: {'lr': 0.0001861225985256024, 'samples': 16929216, 'steps': 88172, 'loss/train': 1.0407154560089111} 11/07/2021 09:36:14 - INFO - __main__ - Step 88174: {'lr': 0.00018611746794497492, 'samples': 16929408, 'steps': 88173, 'loss/train': 1.4738889932632446} 11/07/2021 09:36:14 - INFO - __main__ - Step 88175: {'lr': 0.0001861123373931319, 'samples': 16929600, 'steps': 88174, 'loss/train': 1.3420188426971436} 11/07/2021 09:36:15 - INFO - __main__ - Step 88176: {'lr': 0.0001861072068700758, 'samples': 16929792, 'steps': 88175, 'loss/train': 1.4451349973678589} 11/07/2021 09:36:15 - INFO - __main__ - Step 88177: {'lr': 0.00018610207637580873, 'samples': 16929984, 'steps': 88176, 'loss/train': 1.8501312732696533} 11/07/2021 09:36:16 - INFO - __main__ - Step 88178: {'lr': 0.00018609694591033301, 'samples': 16930176, 'steps': 88177, 'loss/train': 1.8859143257141113} 11/07/2021 09:36:17 - INFO - __main__ - Step 88179: {'lr': 0.00018609181547365105, 'samples': 16930368, 'steps': 88178, 'loss/train': 1.1714215278625488} 11/07/2021 09:36:17 - INFO - __main__ - Step 88180: {'lr': 0.00018608668506576515, 'samples': 16930560, 'steps': 88179, 'loss/train': 0.36384379863739014} 11/07/2021 09:36:17 - INFO - __main__ - Step 88181: {'lr': 0.00018608155468667758, 'samples': 16930752, 'steps': 88180, 'loss/train': 1.1334426403045654} 11/07/2021 09:36:18 - INFO - __main__ - Step 88182: {'lr': 0.0001860764243363907, 'samples': 16930944, 'steps': 88181, 'loss/train': 1.4392119646072388} 11/07/2021 09:36:19 - INFO - __main__ - Step 88183: {'lr': 0.0001860712940149068, 'samples': 16931136, 'steps': 88182, 'loss/train': 1.2650946378707886} 11/07/2021 09:36:19 - INFO - __main__ - Step 88184: {'lr': 0.0001860661637222282, 'samples': 16931328, 'steps': 88183, 'loss/train': 1.4706286191940308} 11/07/2021 09:36:20 - INFO - __main__ - Step 88185: {'lr': 0.00018606103345835713, 'samples': 16931520, 'steps': 88184, 'loss/train': 0.40343913435935974} 11/07/2021 09:36:20 - INFO - __main__ - Step 88186: {'lr': 0.000186055903223296, 'samples': 16931712, 'steps': 88185, 'loss/train': 1.3804495334625244} 11/07/2021 09:36:20 - INFO - __main__ - Step 88187: {'lr': 0.00018605077301704712, 'samples': 16931904, 'steps': 88186, 'loss/train': 1.477530598640442} 11/07/2021 09:36:21 - INFO - __main__ - Step 88188: {'lr': 0.00018604564283961278, 'samples': 16932096, 'steps': 88187, 'loss/train': 1.7030593156814575} 11/07/2021 09:36:22 - INFO - __main__ - Step 88189: {'lr': 0.00018604051269099537, 'samples': 16932288, 'steps': 88188, 'loss/train': 1.4248064756393433} 11/07/2021 09:36:22 - INFO - __main__ - Step 88190: {'lr': 0.000186035382571197, 'samples': 16932480, 'steps': 88189, 'loss/train': 0.8676984310150146} 11/07/2021 09:36:22 - INFO - __main__ - Step 88191: {'lr': 0.00018603025248022011, 'samples': 16932672, 'steps': 88190, 'loss/train': 1.175535798072815} 11/07/2021 09:36:23 - INFO - __main__ - Step 88192: {'lr': 0.000186025122418067, 'samples': 16932864, 'steps': 88191, 'loss/train': 1.724503755569458} 11/07/2021 09:36:24 - INFO - __main__ - Step 88193: {'lr': 0.00018601999238474003, 'samples': 16933056, 'steps': 88192, 'loss/train': 1.3745150566101074} 11/07/2021 09:36:24 - INFO - __main__ - Step 88194: {'lr': 0.0001860148623802414, 'samples': 16933248, 'steps': 88193, 'loss/train': 1.784989356994629} 11/07/2021 09:36:24 - INFO - __main__ - Step 88195: {'lr': 0.00018600973240457354, 'samples': 16933440, 'steps': 88194, 'loss/train': 1.5718029737472534} 11/07/2021 09:36:25 - INFO - __main__ - Step 88196: {'lr': 0.00018600460245773865, 'samples': 16933632, 'steps': 88195, 'loss/train': 1.0262398719787598} 11/07/2021 09:36:25 - INFO - __main__ - Step 88197: {'lr': 0.00018599947253973914, 'samples': 16933824, 'steps': 88196, 'loss/train': 0.5375341773033142} 11/07/2021 09:36:26 - INFO - __main__ - Step 88198: {'lr': 0.00018599434265057725, 'samples': 16934016, 'steps': 88197, 'loss/train': 1.4161850214004517} 11/07/2021 09:36:27 - INFO - __main__ - Step 88199: {'lr': 0.00018598921279025532, 'samples': 16934208, 'steps': 88198, 'loss/train': 1.231528878211975} 11/07/2021 09:36:27 - INFO - __main__ - Step 88200: {'lr': 0.00018598408295877569, 'samples': 16934400, 'steps': 88199, 'loss/train': 1.6333719491958618} 11/07/2021 09:36:27 - INFO - __main__ - Step 88201: {'lr': 0.00018597895315614066, 'samples': 16934592, 'steps': 88200, 'loss/train': 1.7475694417953491} 11/07/2021 09:36:28 - INFO - __main__ - Step 88202: {'lr': 0.00018597382338235248, 'samples': 16934784, 'steps': 88201, 'loss/train': 0.8347417712211609} 11/07/2021 09:36:29 - INFO - __main__ - Step 88203: {'lr': 0.00018596869363741365, 'samples': 16934976, 'steps': 88202, 'loss/train': 1.3281893730163574} 11/07/2021 09:36:29 - INFO - __main__ - Step 88204: {'lr': 0.0001859635639213262, 'samples': 16935168, 'steps': 88203, 'loss/train': 1.499733805656433} 11/07/2021 09:36:29 - INFO - __main__ - Step 88205: {'lr': 0.0001859584342340926, 'samples': 16935360, 'steps': 88204, 'loss/train': 1.490530014038086} 11/07/2021 09:36:30 - INFO - __main__ - Step 88206: {'lr': 0.00018595330457571514, 'samples': 16935552, 'steps': 88205, 'loss/train': 1.4192851781845093} 11/07/2021 09:36:30 - INFO - __main__ - Step 88207: {'lr': 0.00018594817494619614, 'samples': 16935744, 'steps': 88206, 'loss/train': 1.5150550603866577} 11/07/2021 09:36:30 - INFO - __main__ - Step 88208: {'lr': 0.0001859430453455379, 'samples': 16935936, 'steps': 88207, 'loss/train': 1.4700133800506592} 11/07/2021 09:36:31 - INFO - __main__ - Step 88209: {'lr': 0.0001859379157737427, 'samples': 16936128, 'steps': 88208, 'loss/train': 1.3924092054367065} 11/07/2021 09:36:32 - INFO - __main__ - Step 88210: {'lr': 0.00018593278623081294, 'samples': 16936320, 'steps': 88209, 'loss/train': 1.2465895414352417} 11/07/2021 09:36:32 - INFO - __main__ - Step 88211: {'lr': 0.00018592765671675081, 'samples': 16936512, 'steps': 88210, 'loss/train': 1.5220814943313599} 11/07/2021 09:36:32 - INFO - __main__ - Step 88212: {'lr': 0.00018592252723155877, 'samples': 16936704, 'steps': 88211, 'loss/train': 1.5792231559753418} 11/07/2021 09:36:33 - INFO - __main__ - Step 88213: {'lr': 0.00018591739777523903, 'samples': 16936896, 'steps': 88212, 'loss/train': 1.4015729427337646} 11/07/2021 09:36:34 - INFO - __main__ - Step 88214: {'lr': 0.0001859122683477939, 'samples': 16937088, 'steps': 88213, 'loss/train': 1.5372573137283325} 11/07/2021 09:36:34 - INFO - __main__ - Step 88215: {'lr': 0.0001859071389492257, 'samples': 16937280, 'steps': 88214, 'loss/train': 1.4421321153640747} 11/07/2021 09:36:34 - INFO - __main__ - Step 88216: {'lr': 0.00018590200957953687, 'samples': 16937472, 'steps': 88215, 'loss/train': 1.2391871213912964} 11/07/2021 09:36:35 - INFO - __main__ - Step 88217: {'lr': 0.00018589688023872952, 'samples': 16937664, 'steps': 88216, 'loss/train': 1.5648409128189087} 11/07/2021 09:36:35 - INFO - __main__ - Step 88218: {'lr': 0.00018589175092680605, 'samples': 16937856, 'steps': 88217, 'loss/train': 1.7618424892425537} 11/07/2021 09:36:36 - INFO - __main__ - Step 88219: {'lr': 0.00018588662164376873, 'samples': 16938048, 'steps': 88218, 'loss/train': 1.2554175853729248} 11/07/2021 09:36:37 - INFO - __main__ - Step 88220: {'lr': 0.00018588149238961993, 'samples': 16938240, 'steps': 88219, 'loss/train': 1.7171133756637573} 11/07/2021 09:36:37 - INFO - __main__ - Step 88221: {'lr': 0.00018587636316436197, 'samples': 16938432, 'steps': 88220, 'loss/train': 1.6496535539627075} 11/07/2021 09:36:37 - INFO - __main__ - Step 88222: {'lr': 0.00018587123396799707, 'samples': 16938624, 'steps': 88221, 'loss/train': 0.9548147320747375} 11/07/2021 09:36:38 - INFO - __main__ - Step 88223: {'lr': 0.00018586610480052763, 'samples': 16938816, 'steps': 88222, 'loss/train': 1.3667593002319336} 11/07/2021 09:36:39 - INFO - __main__ - Step 88224: {'lr': 0.00018586097566195594, 'samples': 16939008, 'steps': 88223, 'loss/train': 1.2824925184249878} 11/07/2021 09:36:39 - INFO - __main__ - Step 88225: {'lr': 0.00018585584655228432, 'samples': 16939200, 'steps': 88224, 'loss/train': 1.3513239622116089} 11/07/2021 09:36:39 - INFO - __main__ - Step 88226: {'lr': 0.00018585071747151505, 'samples': 16939392, 'steps': 88225, 'loss/train': 1.0397948026657104} 11/07/2021 09:36:40 - INFO - __main__ - Step 88227: {'lr': 0.00018584558841965043, 'samples': 16939584, 'steps': 88226, 'loss/train': 1.4296798706054688} 11/07/2021 09:36:40 - INFO - __main__ - Step 88228: {'lr': 0.00018584045939669283, 'samples': 16939776, 'steps': 88227, 'loss/train': 1.0193604230880737} 11/07/2021 09:36:41 - INFO - __main__ - Step 88229: {'lr': 0.00018583533040264456, 'samples': 16939968, 'steps': 88228, 'loss/train': 1.0893770456314087} 11/07/2021 09:36:42 - INFO - __main__ - Step 88230: {'lr': 0.00018583020143750795, 'samples': 16940160, 'steps': 88229, 'loss/train': 1.3607714176177979} 11/07/2021 09:36:42 - INFO - __main__ - Step 88231: {'lr': 0.00018582507250128517, 'samples': 16940352, 'steps': 88230, 'loss/train': 1.551334261894226} 11/07/2021 09:36:42 - INFO - __main__ - Step 88232: {'lr': 0.00018581994359397863, 'samples': 16940544, 'steps': 88231, 'loss/train': 0.7590294480323792} 11/07/2021 09:36:43 - INFO - __main__ - Step 88233: {'lr': 0.0001858148147155906, 'samples': 16940736, 'steps': 88232, 'loss/train': 1.4687408208847046} 11/07/2021 09:36:44 - INFO - __main__ - Step 88234: {'lr': 0.00018580968586612347, 'samples': 16940928, 'steps': 88233, 'loss/train': 1.6545984745025635} 11/07/2021 09:36:44 - INFO - __main__ - Step 88235: {'lr': 0.00018580455704557948, 'samples': 16941120, 'steps': 88234, 'loss/train': 1.8932795524597168} 11/07/2021 09:36:44 - INFO - __main__ - Step 88236: {'lr': 0.000185799428253961, 'samples': 16941312, 'steps': 88235, 'loss/train': 1.376892328262329} 11/07/2021 09:36:45 - INFO - __main__ - Step 88237: {'lr': 0.00018579429949127025, 'samples': 16941504, 'steps': 88236, 'loss/train': 1.5583223104476929} 11/07/2021 09:36:45 - INFO - __main__ - Step 88238: {'lr': 0.00018578917075750965, 'samples': 16941696, 'steps': 88237, 'loss/train': 1.4152733087539673} 11/07/2021 09:36:46 - INFO - __main__ - Step 88239: {'lr': 0.00018578404205268144, 'samples': 16941888, 'steps': 88238, 'loss/train': 1.4652553796768188} 11/07/2021 09:36:46 - INFO - __main__ - Step 88240: {'lr': 0.00018577891337678794, 'samples': 16942080, 'steps': 88239, 'loss/train': 1.6811754703521729} 11/07/2021 09:36:47 - INFO - __main__ - Step 88241: {'lr': 0.00018577378472983146, 'samples': 16942272, 'steps': 88240, 'loss/train': 1.3924500942230225} 11/07/2021 09:36:47 - INFO - __main__ - Step 88242: {'lr': 0.00018576865611181443, 'samples': 16942464, 'steps': 88241, 'loss/train': 1.1740809679031372} 11/07/2021 09:36:47 - INFO - __main__ - Step 88243: {'lr': 0.000185763527522739, 'samples': 16942656, 'steps': 88242, 'loss/train': 0.8522266745567322} 11/07/2021 09:36:49 - INFO - __main__ - Step 88244: {'lr': 0.00018575839896260748, 'samples': 16942848, 'steps': 88243, 'loss/train': 0.6548085808753967} 11/07/2021 09:36:49 - INFO - __main__ - Step 88245: {'lr': 0.00018575327043142227, 'samples': 16943040, 'steps': 88244, 'loss/train': 1.5549324750900269} 11/07/2021 09:36:49 - INFO - __main__ - Step 88246: {'lr': 0.0001857481419291856, 'samples': 16943232, 'steps': 88245, 'loss/train': 1.434945821762085} 11/07/2021 09:36:50 - INFO - __main__ - Step 88247: {'lr': 0.00018574301345589987, 'samples': 16943424, 'steps': 88246, 'loss/train': 0.7926427721977234} 11/07/2021 09:36:50 - INFO - __main__ - Step 88248: {'lr': 0.0001857378850115673, 'samples': 16943616, 'steps': 88247, 'loss/train': 1.4545252323150635} 11/07/2021 09:36:51 - INFO - __main__ - Step 88249: {'lr': 0.0001857327565961903, 'samples': 16943808, 'steps': 88248, 'loss/train': 1.5669890642166138} 11/07/2021 09:36:51 - INFO - __main__ - Step 88250: {'lr': 0.00018572762820977107, 'samples': 16944000, 'steps': 88249, 'loss/train': 1.4255130290985107} 11/07/2021 09:36:52 - INFO - __main__ - Step 88251: {'lr': 0.00018572249985231206, 'samples': 16944192, 'steps': 88250, 'loss/train': 1.96351957321167} 11/07/2021 09:36:52 - INFO - __main__ - Step 88252: {'lr': 0.0001857173715238154, 'samples': 16944384, 'steps': 88251, 'loss/train': 1.4610434770584106} 11/07/2021 09:36:52 - INFO - __main__ - Step 88253: {'lr': 0.0001857122432242836, 'samples': 16944576, 'steps': 88252, 'loss/train': 1.5329005718231201} 11/07/2021 09:36:53 - INFO - __main__ - Step 88254: {'lr': 0.00018570711495371884, 'samples': 16944768, 'steps': 88253, 'loss/train': 1.6136891841888428} 11/07/2021 09:36:54 - INFO - __main__ - Step 88255: {'lr': 0.00018570198671212347, 'samples': 16944960, 'steps': 88254, 'loss/train': 1.6596609354019165} 11/07/2021 09:36:54 - INFO - __main__ - Step 88256: {'lr': 0.0001856968584994998, 'samples': 16945152, 'steps': 88255, 'loss/train': 0.7454511523246765} 11/07/2021 09:36:54 - INFO - __main__ - Step 88257: {'lr': 0.0001856917303158501, 'samples': 16945344, 'steps': 88256, 'loss/train': 1.4822635650634766} 11/07/2021 09:36:55 - INFO - __main__ - Step 88258: {'lr': 0.00018568660216117673, 'samples': 16945536, 'steps': 88257, 'loss/train': 2.191793918609619} 11/07/2021 09:36:56 - INFO - __main__ - Step 88259: {'lr': 0.00018568147403548197, 'samples': 16945728, 'steps': 88258, 'loss/train': 0.8237287402153015} 11/07/2021 09:36:56 - INFO - __main__ - Step 88260: {'lr': 0.00018567634593876815, 'samples': 16945920, 'steps': 88259, 'loss/train': 1.5065083503723145} 11/07/2021 09:36:57 - INFO - __main__ - Step 88261: {'lr': 0.00018567121787103755, 'samples': 16946112, 'steps': 88260, 'loss/train': 1.482121229171753} 11/07/2021 09:36:57 - INFO - __main__ - Step 88262: {'lr': 0.00018566608983229253, 'samples': 16946304, 'steps': 88261, 'loss/train': 1.4093925952911377} 11/07/2021 09:36:57 - INFO - __main__ - Step 88263: {'lr': 0.00018566096182253536, 'samples': 16946496, 'steps': 88262, 'loss/train': 1.339577078819275} 11/07/2021 09:36:58 - INFO - __main__ - Step 88264: {'lr': 0.00018565583384176843, 'samples': 16946688, 'steps': 88263, 'loss/train': 0.9295227527618408} 11/07/2021 09:36:59 - INFO - __main__ - Step 88265: {'lr': 0.00018565070588999393, 'samples': 16946880, 'steps': 88264, 'loss/train': 1.6027756929397583} 11/07/2021 09:36:59 - INFO - __main__ - Step 88266: {'lr': 0.00018564557796721425, 'samples': 16947072, 'steps': 88265, 'loss/train': 0.7152631282806396} 11/07/2021 09:36:59 - INFO - __main__ - Step 88267: {'lr': 0.00018564045007343167, 'samples': 16947264, 'steps': 88266, 'loss/train': 1.495760440826416} 11/07/2021 09:37:00 - INFO - __main__ - Step 88268: {'lr': 0.0001856353222086485, 'samples': 16947456, 'steps': 88267, 'loss/train': 1.1084226369857788} 11/07/2021 09:37:00 - INFO - __main__ - Step 88269: {'lr': 0.0001856301943728671, 'samples': 16947648, 'steps': 88268, 'loss/train': 0.7779492139816284} 11/07/2021 09:37:01 - INFO - __main__ - Step 88270: {'lr': 0.00018562506656608974, 'samples': 16947840, 'steps': 88269, 'loss/train': 1.337316632270813} 11/07/2021 09:37:02 - INFO - __main__ - Step 88271: {'lr': 0.00018561993878831874, 'samples': 16948032, 'steps': 88270, 'loss/train': 0.9451234936714172} 11/07/2021 09:37:02 - INFO - __main__ - Step 88272: {'lr': 0.00018561481103955636, 'samples': 16948224, 'steps': 88271, 'loss/train': 1.4782445430755615} 11/07/2021 09:37:02 - INFO - __main__ - Step 88273: {'lr': 0.00018560968331980495, 'samples': 16948416, 'steps': 88272, 'loss/train': 1.4254032373428345} 11/07/2021 09:37:03 - INFO - __main__ - Step 88274: {'lr': 0.0001856045556290668, 'samples': 16948608, 'steps': 88273, 'loss/train': 1.5560991764068604} 11/07/2021 09:37:04 - INFO - __main__ - Step 88275: {'lr': 0.00018559942796734434, 'samples': 16948800, 'steps': 88274, 'loss/train': 1.0920578241348267} 11/07/2021 09:37:04 - INFO - __main__ - Step 88276: {'lr': 0.0001855943003346397, 'samples': 16948992, 'steps': 88275, 'loss/train': 1.3365237712860107} 11/07/2021 09:37:04 - INFO - __main__ - Step 88277: {'lr': 0.00018558917273095533, 'samples': 16949184, 'steps': 88276, 'loss/train': 1.5079917907714844} 11/07/2021 09:37:05 - INFO - __main__ - Step 88278: {'lr': 0.0001855840451562934, 'samples': 16949376, 'steps': 88277, 'loss/train': 1.2941921949386597} 11/07/2021 09:37:05 - INFO - __main__ - Step 88279: {'lr': 0.00018557891761065637, 'samples': 16949568, 'steps': 88278, 'loss/train': 1.3478682041168213} 11/07/2021 09:37:06 - INFO - __main__ - Step 88280: {'lr': 0.00018557379009404647, 'samples': 16949760, 'steps': 88279, 'loss/train': 1.3440001010894775} 11/07/2021 09:37:06 - INFO - __main__ - Step 88281: {'lr': 0.00018556866260646606, 'samples': 16949952, 'steps': 88280, 'loss/train': 0.738666296005249} 11/07/2021 09:37:07 - INFO - __main__ - Step 88282: {'lr': 0.00018556353514791735, 'samples': 16950144, 'steps': 88281, 'loss/train': 1.602458119392395} 11/07/2021 09:37:07 - INFO - __main__ - Step 88283: {'lr': 0.0001855584077184028, 'samples': 16950336, 'steps': 88282, 'loss/train': 1.5526663064956665} 11/07/2021 09:37:08 - INFO - __main__ - Step 88284: {'lr': 0.00018555328031792456, 'samples': 16950528, 'steps': 88283, 'loss/train': 1.6982477903366089} 11/07/2021 09:37:08 - INFO - __main__ - Step 88285: {'lr': 0.00018554815294648505, 'samples': 16950720, 'steps': 88284, 'loss/train': 1.564150333404541} 11/07/2021 09:37:09 - INFO - __main__ - Step 88286: {'lr': 0.0001855430256040866, 'samples': 16950912, 'steps': 88285, 'loss/train': 0.9290579557418823} 11/07/2021 09:37:09 - INFO - __main__ - Step 88287: {'lr': 0.00018553789829073143, 'samples': 16951104, 'steps': 88286, 'loss/train': 0.9532333612442017} 11/07/2021 09:37:10 - INFO - __main__ - Step 88288: {'lr': 0.00018553277100642185, 'samples': 16951296, 'steps': 88287, 'loss/train': 1.5610195398330688} 11/07/2021 09:37:10 - INFO - __main__ - Step 88289: {'lr': 0.0001855276437511602, 'samples': 16951488, 'steps': 88288, 'loss/train': 1.4336336851119995} 11/07/2021 09:37:10 - INFO - __main__ - Step 88290: {'lr': 0.00018552251652494885, 'samples': 16951680, 'steps': 88289, 'loss/train': 1.2701561450958252} 11/07/2021 09:37:11 - INFO - __main__ - Step 88291: {'lr': 0.00018551738932779, 'samples': 16951872, 'steps': 88290, 'loss/train': 1.4722403287887573} 11/07/2021 09:37:12 - INFO - __main__ - Step 88292: {'lr': 0.00018551226215968609, 'samples': 16952064, 'steps': 88291, 'loss/train': 1.257603406906128} 11/07/2021 09:37:12 - INFO - __main__ - Step 88293: {'lr': 0.00018550713502063932, 'samples': 16952256, 'steps': 88292, 'loss/train': 1.6153258085250854} 11/07/2021 09:37:13 - INFO - __main__ - Step 88294: {'lr': 0.00018550200791065202, 'samples': 16952448, 'steps': 88293, 'loss/train': 1.4596471786499023} 11/07/2021 09:37:13 - INFO - __main__ - Step 88295: {'lr': 0.00018549688082972654, 'samples': 16952640, 'steps': 88294, 'loss/train': 1.4065016508102417} 11/07/2021 09:37:13 - INFO - __main__ - Step 88296: {'lr': 0.00018549175377786516, 'samples': 16952832, 'steps': 88295, 'loss/train': 1.2540242671966553} 11/07/2021 09:37:14 - INFO - __main__ - Step 88297: {'lr': 0.00018548662675507032, 'samples': 16953024, 'steps': 88296, 'loss/train': 1.3233870267868042} 11/07/2021 09:37:15 - INFO - __main__ - Step 88298: {'lr': 0.0001854814997613441, 'samples': 16953216, 'steps': 88297, 'loss/train': 1.423478126525879} 11/07/2021 09:37:15 - INFO - __main__ - Step 88299: {'lr': 0.00018547637279668893, 'samples': 16953408, 'steps': 88298, 'loss/train': 1.8209624290466309} 11/07/2021 09:37:15 - INFO - __main__ - Step 88300: {'lr': 0.0001854712458611071, 'samples': 16953600, 'steps': 88299, 'loss/train': 1.8495690822601318} 11/07/2021 09:37:16 - INFO - __main__ - Step 88301: {'lr': 0.00018546611895460093, 'samples': 16953792, 'steps': 88300, 'loss/train': 1.5057791471481323} 11/07/2021 09:37:17 - INFO - __main__ - Step 88302: {'lr': 0.00018546099207717275, 'samples': 16953984, 'steps': 88301, 'loss/train': 1.3851046562194824} 11/07/2021 09:37:17 - INFO - __main__ - Step 88303: {'lr': 0.00018545586522882482, 'samples': 16954176, 'steps': 88302, 'loss/train': 1.2617915868759155} 11/07/2021 09:37:18 - INFO - __main__ - Step 88304: {'lr': 0.0001854507384095595, 'samples': 16954368, 'steps': 88303, 'loss/train': 1.1981806755065918} 11/07/2021 09:37:18 - INFO - __main__ - Step 88305: {'lr': 0.00018544561161937907, 'samples': 16954560, 'steps': 88304, 'loss/train': 1.463682770729065} 11/07/2021 09:37:18 - INFO - __main__ - Step 88306: {'lr': 0.00018544048485828586, 'samples': 16954752, 'steps': 88305, 'loss/train': 1.4895031452178955} 11/07/2021 09:37:20 - INFO - __main__ - Step 88307: {'lr': 0.00018543535812628217, 'samples': 16954944, 'steps': 88306, 'loss/train': 1.6210983991622925} 11/07/2021 09:37:20 - INFO - __main__ - Step 88308: {'lr': 0.0001854302314233703, 'samples': 16955136, 'steps': 88307, 'loss/train': 1.3760297298431396} 11/07/2021 09:37:20 - INFO - __main__ - Step 88309: {'lr': 0.00018542510474955259, 'samples': 16955328, 'steps': 88308, 'loss/train': 1.5939313173294067} 11/07/2021 09:37:21 - INFO - __main__ - Step 88310: {'lr': 0.00018541997810483146, 'samples': 16955520, 'steps': 88309, 'loss/train': 1.2894539833068848} 11/07/2021 09:37:21 - INFO - __main__ - Step 88311: {'lr': 0.00018541485148920896, 'samples': 16955712, 'steps': 88310, 'loss/train': 0.3812190592288971} 11/07/2021 09:37:21 - INFO - __main__ - Step 88312: {'lr': 0.0001854097249026875, 'samples': 16955904, 'steps': 88311, 'loss/train': 1.0643829107284546} 11/07/2021 09:37:22 - INFO - __main__ - Step 88313: {'lr': 0.00018540459834526945, 'samples': 16956096, 'steps': 88312, 'loss/train': 1.2606637477874756} 11/07/2021 09:37:23 - INFO - __main__ - Step 88314: {'lr': 0.0001853994718169571, 'samples': 16956288, 'steps': 88313, 'loss/train': 1.3324445486068726} 11/07/2021 09:37:23 - INFO - __main__ - Step 88315: {'lr': 0.00018539434531775274, 'samples': 16956480, 'steps': 88314, 'loss/train': 1.24648916721344} 11/07/2021 09:37:24 - INFO - __main__ - Step 88316: {'lr': 0.0001853892188476587, 'samples': 16956672, 'steps': 88315, 'loss/train': 1.5689198970794678} 11/07/2021 09:37:24 - INFO - __main__ - Step 88317: {'lr': 0.00018538409240667725, 'samples': 16956864, 'steps': 88316, 'loss/train': 0.32534095644950867} 11/07/2021 09:37:25 - INFO - __main__ - Step 88318: {'lr': 0.00018537896599481077, 'samples': 16957056, 'steps': 88317, 'loss/train': 1.4334971904754639} 11/07/2021 09:37:25 - INFO - __main__ - Step 88319: {'lr': 0.0001853738396120615, 'samples': 16957248, 'steps': 88318, 'loss/train': 1.1137090921401978} 11/07/2021 09:37:26 - INFO - __main__ - Step 88320: {'lr': 0.0001853687132584318, 'samples': 16957440, 'steps': 88319, 'loss/train': 0.920257031917572} 11/07/2021 09:37:26 - INFO - __main__ - Step 88321: {'lr': 0.00018536358693392396, 'samples': 16957632, 'steps': 88320, 'loss/train': 1.777097463607788} 11/07/2021 09:37:26 - INFO - __main__ - Step 88322: {'lr': 0.00018535846063854027, 'samples': 16957824, 'steps': 88321, 'loss/train': 1.122687816619873} 11/07/2021 09:37:27 - INFO - __main__ - Step 88323: {'lr': 0.0001853533343722831, 'samples': 16958016, 'steps': 88322, 'loss/train': 1.3639754056930542} 11/07/2021 09:37:28 - INFO - __main__ - Step 88324: {'lr': 0.00018534820813515478, 'samples': 16958208, 'steps': 88323, 'loss/train': 1.0652856826782227} 11/07/2021 09:37:28 - INFO - __main__ - Step 88325: {'lr': 0.0001853430819271575, 'samples': 16958400, 'steps': 88324, 'loss/train': 0.8799144625663757} 11/07/2021 09:37:28 - INFO - __main__ - Step 88326: {'lr': 0.0001853379557482936, 'samples': 16958592, 'steps': 88325, 'loss/train': 1.2324339151382446} 11/07/2021 09:37:29 - INFO - __main__ - Step 88327: {'lr': 0.00018533282959856543, 'samples': 16958784, 'steps': 88326, 'loss/train': 1.2846652269363403} 11/07/2021 09:37:30 - INFO - __main__ - Step 88328: {'lr': 0.00018532770347797528, 'samples': 16958976, 'steps': 88327, 'loss/train': 1.5967596769332886} 11/07/2021 09:37:30 - INFO - __main__ - Step 88329: {'lr': 0.00018532257738652547, 'samples': 16959168, 'steps': 88328, 'loss/train': 1.2727805376052856} 11/07/2021 09:37:30 - INFO - __main__ - Step 88330: {'lr': 0.0001853174513242183, 'samples': 16959360, 'steps': 88329, 'loss/train': 1.3074430227279663} 11/07/2021 09:37:31 - INFO - __main__ - Step 88331: {'lr': 0.00018531232529105614, 'samples': 16959552, 'steps': 88330, 'loss/train': 1.2905009984970093} 11/07/2021 09:37:31 - INFO - __main__ - Step 88332: {'lr': 0.00018530719928704117, 'samples': 16959744, 'steps': 88331, 'loss/train': 1.7357604503631592} 11/07/2021 09:37:32 - INFO - __main__ - Step 88333: {'lr': 0.0001853020733121758, 'samples': 16959936, 'steps': 88332, 'loss/train': 1.391742467880249} 11/07/2021 09:37:33 - INFO - __main__ - Step 88334: {'lr': 0.00018529694736646235, 'samples': 16960128, 'steps': 88333, 'loss/train': 1.3040558099746704} 11/07/2021 09:37:33 - INFO - __main__ - Step 88335: {'lr': 0.00018529182144990308, 'samples': 16960320, 'steps': 88334, 'loss/train': 1.525428295135498} 11/07/2021 09:37:33 - INFO - __main__ - Step 88336: {'lr': 0.00018528669556250034, 'samples': 16960512, 'steps': 88335, 'loss/train': 1.2807601690292358} 11/07/2021 09:37:34 - INFO - __main__ - Step 88337: {'lr': 0.00018528156970425646, 'samples': 16960704, 'steps': 88336, 'loss/train': 1.270269513130188} 11/07/2021 09:37:34 - INFO - __main__ - Step 88338: {'lr': 0.00018527644387517368, 'samples': 16960896, 'steps': 88337, 'loss/train': 1.3837889432907104} 11/07/2021 09:37:35 - INFO - __main__ - Step 88339: {'lr': 0.00018527131807525427, 'samples': 16961088, 'steps': 88338, 'loss/train': 1.1780116558074951} 11/07/2021 09:37:35 - INFO - __main__ - Step 88340: {'lr': 0.00018526619230450065, 'samples': 16961280, 'steps': 88339, 'loss/train': 1.5741592645645142} 11/07/2021 09:37:36 - INFO - __main__ - Step 88341: {'lr': 0.00018526106656291505, 'samples': 16961472, 'steps': 88340, 'loss/train': 1.7712616920471191} 11/07/2021 09:37:36 - INFO - __main__ - Step 88342: {'lr': 0.00018525594085049983, 'samples': 16961664, 'steps': 88341, 'loss/train': 1.3134578466415405} 11/07/2021 09:37:36 - INFO - __main__ - Step 88343: {'lr': 0.0001852508151672573, 'samples': 16961856, 'steps': 88342, 'loss/train': 1.2971640825271606} 11/07/2021 09:37:38 - INFO - __main__ - Step 88344: {'lr': 0.00018524568951318971, 'samples': 16962048, 'steps': 88343, 'loss/train': 1.3159749507904053} 11/07/2021 09:37:38 - INFO - __main__ - Step 88345: {'lr': 0.00018524056388829945, 'samples': 16962240, 'steps': 88344, 'loss/train': 0.06861434876918793} 11/07/2021 09:37:39 - INFO - __main__ - Step 88346: {'lr': 0.00018523543829258876, 'samples': 16962432, 'steps': 88345, 'loss/train': 1.1194946765899658} 11/07/2021 09:37:39 - INFO - __main__ - Step 88347: {'lr': 0.00018523031272606004, 'samples': 16962624, 'steps': 88346, 'loss/train': 1.5221492052078247} 11/07/2021 09:37:39 - INFO - __main__ - Step 88348: {'lr': 0.0001852251871887155, 'samples': 16962816, 'steps': 88347, 'loss/train': 0.8595049977302551} 11/07/2021 09:37:40 - INFO - __main__ - Step 88349: {'lr': 0.0001852200616805575, 'samples': 16963008, 'steps': 88348, 'loss/train': 1.2947587966918945} 11/07/2021 09:37:41 - INFO - __main__ - Step 88350: {'lr': 0.00018521493620158832, 'samples': 16963200, 'steps': 88349, 'loss/train': 1.292917251586914} 11/07/2021 09:37:41 - INFO - __main__ - Step 88351: {'lr': 0.00018520981075181042, 'samples': 16963392, 'steps': 88350, 'loss/train': 1.1856127977371216} 11/07/2021 09:37:41 - INFO - __main__ - Step 88352: {'lr': 0.00018520468533122586, 'samples': 16963584, 'steps': 88351, 'loss/train': 1.2541958093643188} 11/07/2021 09:37:42 - INFO - __main__ - Step 88353: {'lr': 0.00018519955993983708, 'samples': 16963776, 'steps': 88352, 'loss/train': 1.3545420169830322} 11/07/2021 09:37:42 - INFO - __main__ - Step 88354: {'lr': 0.0001851944345776464, 'samples': 16963968, 'steps': 88353, 'loss/train': 1.278406023979187} 11/07/2021 09:37:43 - INFO - __main__ - Step 88355: {'lr': 0.00018518930924465605, 'samples': 16964160, 'steps': 88354, 'loss/train': 1.2961905002593994} 11/07/2021 09:37:44 - INFO - __main__ - Step 88356: {'lr': 0.00018518418394086844, 'samples': 16964352, 'steps': 88355, 'loss/train': 0.5234193205833435} 11/07/2021 09:37:44 - INFO - __main__ - Step 88357: {'lr': 0.00018517905866628583, 'samples': 16964544, 'steps': 88356, 'loss/train': 0.5805550813674927} 11/07/2021 09:37:44 - INFO - __main__ - Step 88358: {'lr': 0.00018517393342091054, 'samples': 16964736, 'steps': 88357, 'loss/train': 1.698594331741333} 11/07/2021 09:37:45 - INFO - __main__ - Step 88359: {'lr': 0.00018516880820474484, 'samples': 16964928, 'steps': 88358, 'loss/train': 1.550956130027771} 11/07/2021 09:37:45 - INFO - __main__ - Step 88360: {'lr': 0.00018516368301779113, 'samples': 16965120, 'steps': 88359, 'loss/train': 1.6347318887710571} 11/07/2021 09:37:46 - INFO - __main__ - Step 88361: {'lr': 0.00018515855786005163, 'samples': 16965312, 'steps': 88360, 'loss/train': 0.8555581569671631} 11/07/2021 09:37:46 - INFO - __main__ - Step 88362: {'lr': 0.0001851534327315287, 'samples': 16965504, 'steps': 88361, 'loss/train': 1.3835840225219727} 11/07/2021 09:37:47 - INFO - __main__ - Step 88363: {'lr': 0.00018514830763222462, 'samples': 16965696, 'steps': 88362, 'loss/train': 1.1102097034454346} 11/07/2021 09:37:47 - INFO - __main__ - Step 88364: {'lr': 0.0001851431825621418, 'samples': 16965888, 'steps': 88363, 'loss/train': 0.9967725276947021} 11/07/2021 09:37:47 - INFO - __main__ - Step 88365: {'lr': 0.0001851380575212824, 'samples': 16966080, 'steps': 88364, 'loss/train': 1.4022470712661743} 11/07/2021 09:37:48 - INFO - __main__ - Step 88366: {'lr': 0.00018513293250964875, 'samples': 16966272, 'steps': 88365, 'loss/train': 1.3212907314300537} 11/07/2021 09:37:49 - INFO - __main__ - Step 88367: {'lr': 0.00018512780752724323, 'samples': 16966464, 'steps': 88366, 'loss/train': 1.195711374282837} 11/07/2021 09:37:49 - INFO - __main__ - Step 88368: {'lr': 0.0001851226825740681, 'samples': 16966656, 'steps': 88367, 'loss/train': 1.5942158699035645} 11/07/2021 09:37:49 - INFO - __main__ - Step 88369: {'lr': 0.00018511755765012567, 'samples': 16966848, 'steps': 88368, 'loss/train': 1.2274558544158936} 11/07/2021 09:37:50 - INFO - __main__ - Step 88370: {'lr': 0.00018511243275541828, 'samples': 16967040, 'steps': 88369, 'loss/train': 1.2784847021102905} 11/07/2021 09:37:51 - INFO - __main__ - Step 88371: {'lr': 0.00018510730788994827, 'samples': 16967232, 'steps': 88370, 'loss/train': 2.253816604614258} 11/07/2021 09:37:51 - INFO - __main__ - Step 88372: {'lr': 0.00018510218305371783, 'samples': 16967424, 'steps': 88371, 'loss/train': 1.8141919374465942} 11/07/2021 09:37:52 - INFO - __main__ - Step 88373: {'lr': 0.0001850970582467294, 'samples': 16967616, 'steps': 88372, 'loss/train': 0.7776252627372742} 11/07/2021 09:37:52 - INFO - __main__ - Step 88374: {'lr': 0.00018509193346898524, 'samples': 16967808, 'steps': 88373, 'loss/train': 1.2937599420547485} 11/07/2021 09:37:52 - INFO - __main__ - Step 88375: {'lr': 0.0001850868087204876, 'samples': 16968000, 'steps': 88374, 'loss/train': 1.499230146408081} 11/07/2021 09:37:53 - INFO - __main__ - Step 88376: {'lr': 0.0001850816840012389, 'samples': 16968192, 'steps': 88375, 'loss/train': 1.2267998456954956} 11/07/2021 09:37:54 - INFO - __main__ - Step 88377: {'lr': 0.00018507655931124145, 'samples': 16968384, 'steps': 88376, 'loss/train': 1.2123448848724365} 11/07/2021 09:37:54 - INFO - __main__ - Step 88378: {'lr': 0.00018507143465049746, 'samples': 16968576, 'steps': 88377, 'loss/train': 1.4257670640945435} 11/07/2021 09:37:54 - INFO - __main__ - Step 88379: {'lr': 0.0001850663100190092, 'samples': 16968768, 'steps': 88378, 'loss/train': 1.5366777181625366} 11/07/2021 09:37:55 - INFO - __main__ - Step 88380: {'lr': 0.00018506118541677913, 'samples': 16968960, 'steps': 88379, 'loss/train': 1.456566572189331} 11/07/2021 09:37:55 - INFO - __main__ - Step 88381: {'lr': 0.00018505606084380944, 'samples': 16969152, 'steps': 88380, 'loss/train': 1.1484131813049316} 11/07/2021 09:37:56 - INFO - __main__ - Step 88382: {'lr': 0.0001850509363001025, 'samples': 16969344, 'steps': 88381, 'loss/train': 1.535017728805542} 11/07/2021 09:37:56 - INFO - __main__ - Step 88383: {'lr': 0.0001850458117856606, 'samples': 16969536, 'steps': 88382, 'loss/train': 1.624950647354126} 11/07/2021 09:37:57 - INFO - __main__ - Step 88384: {'lr': 0.00018504068730048606, 'samples': 16969728, 'steps': 88383, 'loss/train': 1.5606372356414795} 11/07/2021 09:37:57 - INFO - __main__ - Step 88385: {'lr': 0.00018503556284458117, 'samples': 16969920, 'steps': 88384, 'loss/train': 1.0536882877349854} 11/07/2021 09:37:57 - INFO - __main__ - Step 88386: {'lr': 0.00018503043841794828, 'samples': 16970112, 'steps': 88385, 'loss/train': 1.6231894493103027} 11/07/2021 09:37:58 - INFO - __main__ - Step 88387: {'lr': 0.00018502531402058973, 'samples': 16970304, 'steps': 88386, 'loss/train': 1.0241146087646484} 11/07/2021 09:37:59 - INFO - __main__ - Step 88388: {'lr': 0.0001850201896525077, 'samples': 16970496, 'steps': 88387, 'loss/train': 1.5080543756484985} 11/07/2021 09:37:59 - INFO - __main__ - Step 88389: {'lr': 0.00018501506531370455, 'samples': 16970688, 'steps': 88388, 'loss/train': 1.2404406070709229} 11/07/2021 09:37:59 - INFO - __main__ - Step 88390: {'lr': 0.00018500994100418265, 'samples': 16970880, 'steps': 88389, 'loss/train': 1.3250033855438232} 11/07/2021 09:38:00 - INFO - __main__ - Step 88391: {'lr': 0.0001850048167239443, 'samples': 16971072, 'steps': 88390, 'loss/train': 1.2151169776916504} 11/07/2021 09:38:01 - INFO - __main__ - Step 88392: {'lr': 0.00018499969247299172, 'samples': 16971264, 'steps': 88391, 'loss/train': 1.4568558931350708} 11/07/2021 09:38:02 - INFO - __main__ - Step 88393: {'lr': 0.00018499456825132727, 'samples': 16971456, 'steps': 88392, 'loss/train': 1.5993582010269165} 11/07/2021 09:38:02 - INFO - __main__ - Step 88394: {'lr': 0.0001849894440589533, 'samples': 16971648, 'steps': 88393, 'loss/train': 1.1156933307647705} 11/07/2021 09:38:03 - INFO - __main__ - Step 88395: {'lr': 0.00018498431989587204, 'samples': 16971840, 'steps': 88394, 'loss/train': 1.7176991701126099} 11/07/2021 09:38:03 - INFO - __main__ - Step 88396: {'lr': 0.00018497919576208587, 'samples': 16972032, 'steps': 88395, 'loss/train': 1.5568665266036987} 11/07/2021 09:38:04 - INFO - __main__ - Step 88397: {'lr': 0.00018497407165759706, 'samples': 16972224, 'steps': 88396, 'loss/train': 0.4589582085609436} 11/07/2021 09:38:04 - INFO - __main__ - Step 88398: {'lr': 0.00018496894758240797, 'samples': 16972416, 'steps': 88397, 'loss/train': 1.6342806816101074} 11/07/2021 09:38:05 - INFO - __main__ - Step 88399: {'lr': 0.00018496382353652084, 'samples': 16972608, 'steps': 88398, 'loss/train': 1.4170260429382324} 11/07/2021 09:38:05 - INFO - __main__ - Step 88400: {'lr': 0.000184958699519938, 'samples': 16972800, 'steps': 88399, 'loss/train': 1.9730329513549805} 11/07/2021 09:38:05 - INFO - __main__ - Step 88401: {'lr': 0.00018495357553266177, 'samples': 16972992, 'steps': 88400, 'loss/train': 1.5722813606262207} 11/07/2021 09:38:06 - INFO - __main__ - Step 88402: {'lr': 0.00018494845157469443, 'samples': 16973184, 'steps': 88401, 'loss/train': 1.3696943521499634} 11/07/2021 09:38:07 - INFO - __main__ - Step 88403: {'lr': 0.00018494332764603833, 'samples': 16973376, 'steps': 88402, 'loss/train': 1.1427465677261353} 11/07/2021 09:38:07 - INFO - __main__ - Step 88404: {'lr': 0.00018493820374669584, 'samples': 16973568, 'steps': 88403, 'loss/train': 1.2019290924072266} 11/07/2021 09:38:07 - INFO - __main__ - Step 88405: {'lr': 0.0001849330798766691, 'samples': 16973760, 'steps': 88404, 'loss/train': 1.3732243776321411} 11/07/2021 09:38:08 - INFO - __main__ - Step 88406: {'lr': 0.0001849279560359605, 'samples': 16973952, 'steps': 88405, 'loss/train': 1.623282790184021} 11/07/2021 09:38:08 - INFO - __main__ - Step 88407: {'lr': 0.0001849228322245724, 'samples': 16974144, 'steps': 88406, 'loss/train': 1.442855954170227} 11/07/2021 09:38:09 - INFO - __main__ - Step 88408: {'lr': 0.00018491770844250704, 'samples': 16974336, 'steps': 88407, 'loss/train': 1.1058425903320312} 11/07/2021 09:38:09 - INFO - __main__ - Step 88409: {'lr': 0.00018491258468976684, 'samples': 16974528, 'steps': 88408, 'loss/train': 1.4457935094833374} 11/07/2021 09:38:10 - INFO - __main__ - Step 88410: {'lr': 0.00018490746096635398, 'samples': 16974720, 'steps': 88409, 'loss/train': 0.9549041390419006} 11/07/2021 09:38:10 - INFO - __main__ - Step 88411: {'lr': 0.00018490233727227077, 'samples': 16974912, 'steps': 88410, 'loss/train': 1.5007342100143433} 11/07/2021 09:38:10 - INFO - __main__ - Step 88412: {'lr': 0.0001848972136075196, 'samples': 16975104, 'steps': 88411, 'loss/train': 1.2320383787155151} 11/07/2021 09:38:11 - INFO - __main__ - Step 88413: {'lr': 0.00018489208997210272, 'samples': 16975296, 'steps': 88412, 'loss/train': 1.0980224609375} 11/07/2021 09:38:12 - INFO - __main__ - Step 88414: {'lr': 0.00018488696636602243, 'samples': 16975488, 'steps': 88413, 'loss/train': 1.4632648229599} 11/07/2021 09:38:12 - INFO - __main__ - Step 88415: {'lr': 0.00018488184278928112, 'samples': 16975680, 'steps': 88414, 'loss/train': 1.5712970495224} 11/07/2021 09:38:12 - INFO - __main__ - Step 88416: {'lr': 0.000184876719241881, 'samples': 16975872, 'steps': 88415, 'loss/train': 1.7765812873840332} 11/07/2021 09:38:13 - INFO - __main__ - Step 88417: {'lr': 0.00018487159572382446, 'samples': 16976064, 'steps': 88416, 'loss/train': 1.5112470388412476} 11/07/2021 09:38:14 - INFO - __main__ - Step 88418: {'lr': 0.00018486647223511383, 'samples': 16976256, 'steps': 88417, 'loss/train': 1.361863374710083} 11/07/2021 09:38:14 - INFO - __main__ - Step 88419: {'lr': 0.00018486134877575129, 'samples': 16976448, 'steps': 88418, 'loss/train': 1.5450800657272339} 11/07/2021 09:38:15 - INFO - __main__ - Step 88420: {'lr': 0.00018485622534573928, 'samples': 16976640, 'steps': 88419, 'loss/train': 1.2244316339492798} 11/07/2021 09:38:15 - INFO - __main__ - Step 88421: {'lr': 0.00018485110194508002, 'samples': 16976832, 'steps': 88420, 'loss/train': 1.3602761030197144} 11/07/2021 09:38:15 - INFO - __main__ - Step 88422: {'lr': 0.00018484597857377583, 'samples': 16977024, 'steps': 88421, 'loss/train': 1.6600549221038818} 11/07/2021 09:38:16 - INFO - __main__ - Step 88423: {'lr': 0.00018484085523182904, 'samples': 16977216, 'steps': 88422, 'loss/train': 1.4355653524398804} 11/07/2021 09:38:17 - INFO - __main__ - Step 88424: {'lr': 0.0001848357319192419, 'samples': 16977408, 'steps': 88423, 'loss/train': 1.1205469369888306} 11/07/2021 09:38:17 - INFO - __main__ - Step 88425: {'lr': 0.00018483060863601686, 'samples': 16977600, 'steps': 88424, 'loss/train': 1.3831627368927002} 11/07/2021 09:38:17 - INFO - __main__ - Step 88426: {'lr': 0.0001848254853821561, 'samples': 16977792, 'steps': 88425, 'loss/train': 1.1702386140823364} 11/07/2021 09:38:18 - INFO - __main__ - Step 88427: {'lr': 0.00018482036215766197, 'samples': 16977984, 'steps': 88426, 'loss/train': 1.6101503372192383} 11/07/2021 09:38:18 - INFO - __main__ - Step 88428: {'lr': 0.00018481523896253678, 'samples': 16978176, 'steps': 88427, 'loss/train': 1.5176748037338257} 11/07/2021 09:38:19 - INFO - __main__ - Step 88429: {'lr': 0.00018481011579678288, 'samples': 16978368, 'steps': 88428, 'loss/train': 1.3741663694381714} 11/07/2021 09:38:19 - INFO - __main__ - Step 88430: {'lr': 0.00018480499266040247, 'samples': 16978560, 'steps': 88429, 'loss/train': 1.1860766410827637} 11/07/2021 09:38:20 - INFO - __main__ - Step 88431: {'lr': 0.00018479986955339807, 'samples': 16978752, 'steps': 88430, 'loss/train': 0.8512476682662964} 11/07/2021 09:38:20 - INFO - __main__ - Step 88432: {'lr': 0.00018479474647577172, 'samples': 16978944, 'steps': 88431, 'loss/train': 1.299530029296875} 11/07/2021 09:38:20 - INFO - __main__ - Step 88433: {'lr': 0.00018478962342752584, 'samples': 16979136, 'steps': 88432, 'loss/train': 1.4771084785461426} 11/07/2021 09:38:22 - INFO - __main__ - Step 88434: {'lr': 0.00018478450040866276, 'samples': 16979328, 'steps': 88433, 'loss/train': 1.5450925827026367} 11/07/2021 09:38:22 - INFO - __main__ - Step 88435: {'lr': 0.00018477937741918476, 'samples': 16979520, 'steps': 88434, 'loss/train': 1.6363869905471802} 11/07/2021 09:38:22 - INFO - __main__ - Step 88436: {'lr': 0.00018477425445909422, 'samples': 16979712, 'steps': 88435, 'loss/train': 1.3579291105270386} 11/07/2021 09:38:23 - INFO - __main__ - Step 88437: {'lr': 0.00018476913152839337, 'samples': 16979904, 'steps': 88436, 'loss/train': 1.0584841966629028} 11/07/2021 09:38:23 - INFO - __main__ - Step 88438: {'lr': 0.00018476400862708453, 'samples': 16980096, 'steps': 88437, 'loss/train': 1.6769975423812866} 11/07/2021 09:38:24 - INFO - __main__ - Step 88439: {'lr': 0.00018475888575517004, 'samples': 16980288, 'steps': 88438, 'loss/train': 1.12332022190094} 11/07/2021 09:38:24 - INFO - __main__ - Step 88440: {'lr': 0.00018475376291265217, 'samples': 16980480, 'steps': 88439, 'loss/train': 1.4756168127059937} 11/07/2021 09:38:25 - INFO - __main__ - Step 88441: {'lr': 0.00018474864009953323, 'samples': 16980672, 'steps': 88440, 'loss/train': 1.8240034580230713} 11/07/2021 09:38:25 - INFO - __main__ - Step 88442: {'lr': 0.00018474351731581558, 'samples': 16980864, 'steps': 88441, 'loss/train': 1.5602693557739258} 11/07/2021 09:38:25 - INFO - __main__ - Step 88443: {'lr': 0.0001847383945615015, 'samples': 16981056, 'steps': 88442, 'loss/train': 1.4290788173675537} 11/07/2021 09:38:26 - INFO - __main__ - Step 88444: {'lr': 0.00018473327183659327, 'samples': 16981248, 'steps': 88443, 'loss/train': 1.4444482326507568} 11/07/2021 09:38:27 - INFO - __main__ - Step 88445: {'lr': 0.00018472814914109333, 'samples': 16981440, 'steps': 88444, 'loss/train': 1.2059228420257568} 11/07/2021 09:38:27 - INFO - __main__ - Step 88446: {'lr': 0.00018472302647500378, 'samples': 16981632, 'steps': 88445, 'loss/train': 1.8141427040100098} 11/07/2021 09:38:27 - INFO - __main__ - Step 88447: {'lr': 0.000184717903838327, 'samples': 16981824, 'steps': 88446, 'loss/train': 1.1172775030136108} 11/07/2021 09:38:28 - INFO - __main__ - Step 88448: {'lr': 0.00018471278123106537, 'samples': 16982016, 'steps': 88447, 'loss/train': 1.3127480745315552} 11/07/2021 09:38:29 - INFO - __main__ - Step 88449: {'lr': 0.00018470765865322112, 'samples': 16982208, 'steps': 88448, 'loss/train': 1.4970526695251465} 11/07/2021 09:38:29 - INFO - __main__ - Step 88450: {'lr': 0.0001847025361047966, 'samples': 16982400, 'steps': 88449, 'loss/train': 1.336340308189392} 11/07/2021 09:38:30 - INFO - __main__ - Step 88451: {'lr': 0.0001846974135857941, 'samples': 16982592, 'steps': 88450, 'loss/train': 1.729987621307373} 11/07/2021 09:38:30 - INFO - __main__ - Step 88452: {'lr': 0.00018469229109621595, 'samples': 16982784, 'steps': 88451, 'loss/train': 1.5703775882720947} 11/07/2021 09:38:30 - INFO - __main__ - Step 88453: {'lr': 0.00018468716863606445, 'samples': 16982976, 'steps': 88452, 'loss/train': 1.4836348295211792} 11/07/2021 09:38:31 - INFO - __main__ - Step 88454: {'lr': 0.0001846820462053419, 'samples': 16983168, 'steps': 88453, 'loss/train': 0.6466681957244873} 11/07/2021 09:38:32 - INFO - __main__ - Step 88455: {'lr': 0.0001846769238040506, 'samples': 16983360, 'steps': 88454, 'loss/train': 1.272329330444336} 11/07/2021 09:38:32 - INFO - __main__ - Step 88456: {'lr': 0.00018467180143219293, 'samples': 16983552, 'steps': 88455, 'loss/train': 1.430683970451355} 11/07/2021 09:38:32 - INFO - __main__ - Step 88457: {'lr': 0.00018466667908977107, 'samples': 16983744, 'steps': 88456, 'loss/train': 1.2632339000701904} 11/07/2021 09:38:33 - INFO - __main__ - Step 88458: {'lr': 0.00018466155677678754, 'samples': 16983936, 'steps': 88457, 'loss/train': 1.3055429458618164} 11/07/2021 09:38:33 - INFO - __main__ - Step 88459: {'lr': 0.00018465643449324436, 'samples': 16984128, 'steps': 88458, 'loss/train': 1.2105387449264526} 11/07/2021 09:38:34 - INFO - __main__ - Step 88460: {'lr': 0.000184651312239144, 'samples': 16984320, 'steps': 88459, 'loss/train': 2.1048808097839355} 11/07/2021 09:38:34 - INFO - __main__ - Step 88461: {'lr': 0.00018464619001448874, 'samples': 16984512, 'steps': 88460, 'loss/train': 1.7040201425552368} 11/07/2021 09:38:35 - INFO - __main__ - Step 88462: {'lr': 0.0001846410678192809, 'samples': 16984704, 'steps': 88461, 'loss/train': 1.234546422958374} 11/07/2021 09:38:35 - INFO - __main__ - Step 88463: {'lr': 0.00018463594565352282, 'samples': 16984896, 'steps': 88462, 'loss/train': 1.6601985692977905} 11/07/2021 09:38:36 - INFO - __main__ - Step 88464: {'lr': 0.00018463082351721677, 'samples': 16985088, 'steps': 88463, 'loss/train': 1.720964789390564} 11/07/2021 09:38:37 - INFO - __main__ - Step 88465: {'lr': 0.00018462570141036504, 'samples': 16985280, 'steps': 88464, 'loss/train': 1.2061463594436646} 11/07/2021 09:38:37 - INFO - __main__ - Step 88466: {'lr': 0.00018462057933296995, 'samples': 16985472, 'steps': 88465, 'loss/train': 1.8839586973190308} 11/07/2021 09:38:37 - INFO - __main__ - Step 88467: {'lr': 0.00018461545728503382, 'samples': 16985664, 'steps': 88466, 'loss/train': 1.2658604383468628} 11/07/2021 09:38:38 - INFO - __main__ - Step 88468: {'lr': 0.000184610335266559, 'samples': 16985856, 'steps': 88467, 'loss/train': 0.8606667518615723} 11/07/2021 09:38:38 - INFO - __main__ - Step 88469: {'lr': 0.0001846052132775477, 'samples': 16986048, 'steps': 88468, 'loss/train': 0.8297231197357178} 11/07/2021 09:38:39 - INFO - __main__ - Step 88470: {'lr': 0.00018460009131800233, 'samples': 16986240, 'steps': 88469, 'loss/train': 1.2634671926498413} 11/07/2021 09:38:39 - INFO - __main__ - Step 88471: {'lr': 0.0001845949693879251, 'samples': 16986432, 'steps': 88470, 'loss/train': 1.2159693241119385} 11/07/2021 09:38:40 - INFO - __main__ - Step 88472: {'lr': 0.0001845898474873185, 'samples': 16986624, 'steps': 88471, 'loss/train': 1.5690562725067139} 11/07/2021 09:38:40 - INFO - __main__ - Step 88473: {'lr': 0.0001845847256161846, 'samples': 16986816, 'steps': 88472, 'loss/train': 1.701293706893921} 11/07/2021 09:38:40 - INFO - __main__ - Step 88474: {'lr': 0.00018457960377452583, 'samples': 16987008, 'steps': 88473, 'loss/train': 1.4570322036743164} 11/07/2021 09:38:41 - INFO - __main__ - Step 88475: {'lr': 0.00018457448196234445, 'samples': 16987200, 'steps': 88474, 'loss/train': 1.2957111597061157} 11/07/2021 09:38:42 - INFO - __main__ - Step 88476: {'lr': 0.00018456936017964283, 'samples': 16987392, 'steps': 88475, 'loss/train': 1.4811177253723145} 11/07/2021 09:38:42 - INFO - __main__ - Step 88477: {'lr': 0.0001845642384264232, 'samples': 16987584, 'steps': 88476, 'loss/train': 1.568860411643982} 11/07/2021 09:38:42 - INFO - __main__ - Step 88478: {'lr': 0.00018455911670268792, 'samples': 16987776, 'steps': 88477, 'loss/train': 1.6460696458816528} 11/07/2021 09:38:43 - INFO - __main__ - Step 88479: {'lr': 0.00018455399500843934, 'samples': 16987968, 'steps': 88478, 'loss/train': 1.2165428400039673} 11/07/2021 09:38:44 - INFO - __main__ - Step 88480: {'lr': 0.0001845488733436797, 'samples': 16988160, 'steps': 88479, 'loss/train': 0.6098188757896423} 11/07/2021 09:38:44 - INFO - __main__ - Step 88481: {'lr': 0.00018454375170841132, 'samples': 16988352, 'steps': 88480, 'loss/train': 1.2862910032272339} 11/07/2021 09:38:44 - INFO - __main__ - Step 88482: {'lr': 0.0001845386301026365, 'samples': 16988544, 'steps': 88481, 'loss/train': 1.5257177352905273} 11/07/2021 09:38:45 - INFO - __main__ - Step 88483: {'lr': 0.0001845335085263576, 'samples': 16988736, 'steps': 88482, 'loss/train': 1.3585728406906128} 11/07/2021 09:38:45 - INFO - __main__ - Step 88484: {'lr': 0.00018452838697957685, 'samples': 16988928, 'steps': 88483, 'loss/train': 1.272176742553711} 11/07/2021 09:38:46 - INFO - __main__ - Step 88485: {'lr': 0.00018452326546229673, 'samples': 16989120, 'steps': 88484, 'loss/train': 1.6832618713378906} 11/07/2021 09:38:47 - INFO - __main__ - Step 88486: {'lr': 0.0001845181439745193, 'samples': 16989312, 'steps': 88485, 'loss/train': 1.4351284503936768} 11/07/2021 09:38:47 - INFO - __main__ - Step 88487: {'lr': 0.000184513022516247, 'samples': 16989504, 'steps': 88486, 'loss/train': 1.2184916734695435} 11/07/2021 09:38:47 - INFO - __main__ - Step 88488: {'lr': 0.00018450790108748212, 'samples': 16989696, 'steps': 88487, 'loss/train': 1.0015777349472046} 11/07/2021 09:38:48 - INFO - __main__ - Step 88489: {'lr': 0.00018450277968822692, 'samples': 16989888, 'steps': 88488, 'loss/train': 1.5648291110992432} 11/07/2021 09:38:48 - INFO - __main__ - Step 88490: {'lr': 0.0001844976583184838, 'samples': 16990080, 'steps': 88489, 'loss/train': 1.399457335472107} 11/07/2021 09:38:49 - INFO - __main__ - Step 88491: {'lr': 0.00018449253697825501, 'samples': 16990272, 'steps': 88490, 'loss/train': 1.3387300968170166} 11/07/2021 09:38:49 - INFO - __main__ - Step 88492: {'lr': 0.0001844874156675429, 'samples': 16990464, 'steps': 88491, 'loss/train': 0.6483622193336487} 11/07/2021 09:38:50 - INFO - __main__ - Step 88493: {'lr': 0.00018448229438634974, 'samples': 16990656, 'steps': 88492, 'loss/train': 1.0893537998199463} 11/07/2021 09:38:50 - INFO - __main__ - Step 88494: {'lr': 0.00018447717313467785, 'samples': 16990848, 'steps': 88493, 'loss/train': 1.3894472122192383} 11/07/2021 09:38:50 - INFO - __main__ - Step 88495: {'lr': 0.00018447205191252954, 'samples': 16991040, 'steps': 88494, 'loss/train': 1.2234275341033936} 11/07/2021 09:38:51 - INFO - __main__ - Step 88496: {'lr': 0.0001844669307199071, 'samples': 16991232, 'steps': 88495, 'loss/train': 0.19110316038131714} 11/07/2021 09:38:52 - INFO - __main__ - Step 88497: {'lr': 0.00018446180955681283, 'samples': 16991424, 'steps': 88496, 'loss/train': 1.4250075817108154} 11/07/2021 09:38:52 - INFO - __main__ - Step 88498: {'lr': 0.00018445668842324918, 'samples': 16991616, 'steps': 88497, 'loss/train': 1.6491639614105225} 11/07/2021 09:38:53 - INFO - __main__ - Step 88499: {'lr': 0.00018445156731921821, 'samples': 16991808, 'steps': 88498, 'loss/train': 1.082750678062439} 11/07/2021 09:38:53 - INFO - __main__ - Step 88500: {'lr': 0.0001844464462447224, 'samples': 16992000, 'steps': 88499, 'loss/train': 1.0032827854156494} 11/07/2021 09:38:54 - INFO - __main__ - Step 88501: {'lr': 0.000184441325199764, 'samples': 16992192, 'steps': 88500, 'loss/train': 1.7512271404266357} 11/07/2021 09:38:54 - INFO - __main__ - Step 88502: {'lr': 0.00018443620418434525, 'samples': 16992384, 'steps': 88501, 'loss/train': 1.4692357778549194} 11/07/2021 09:38:55 - INFO - __main__ - Step 88503: {'lr': 0.00018443108319846863, 'samples': 16992576, 'steps': 88502, 'loss/train': 1.2375547885894775} 11/07/2021 09:38:55 - INFO - __main__ - Step 88504: {'lr': 0.0001844259622421363, 'samples': 16992768, 'steps': 88503, 'loss/train': 1.5322128534317017} 11/07/2021 09:38:55 - INFO - __main__ - Step 88505: {'lr': 0.00018442084131535064, 'samples': 16992960, 'steps': 88504, 'loss/train': 1.405016303062439} 11/07/2021 09:38:57 - INFO - __main__ - Step 88506: {'lr': 0.00018441572041811395, 'samples': 16993152, 'steps': 88505, 'loss/train': 1.7145400047302246} 11/07/2021 09:38:57 - INFO - __main__ - Step 88507: {'lr': 0.0001844105995504285, 'samples': 16993344, 'steps': 88506, 'loss/train': 1.3238133192062378} 11/07/2021 09:38:58 - INFO - __main__ - Step 88508: {'lr': 0.00018440547871229662, 'samples': 16993536, 'steps': 88507, 'loss/train': 1.4538394212722778} 11/07/2021 09:38:58 - INFO - __main__ - Step 88509: {'lr': 0.0001844003579037206, 'samples': 16993728, 'steps': 88508, 'loss/train': 0.8604137897491455} 11/07/2021 09:38:58 - INFO - __main__ - Step 88510: {'lr': 0.0001843952371247028, 'samples': 16993920, 'steps': 88509, 'loss/train': 1.5977295637130737} 11/07/2021 09:38:59 - INFO - __main__ - Step 88511: {'lr': 0.00018439011637524556, 'samples': 16994112, 'steps': 88510, 'loss/train': 1.3808708190917969} 11/07/2021 09:39:00 - INFO - __main__ - Step 88512: {'lr': 0.0001843849956553511, 'samples': 16994304, 'steps': 88511, 'loss/train': 2.0168192386627197} 11/07/2021 09:39:00 - INFO - __main__ - Step 88513: {'lr': 0.00018437987496502166, 'samples': 16994496, 'steps': 88512, 'loss/train': 1.6538176536560059} 11/07/2021 09:39:01 - INFO - __main__ - Step 88514: {'lr': 0.0001843747543042597, 'samples': 16994688, 'steps': 88513, 'loss/train': 1.5164400339126587} 11/07/2021 09:39:01 - INFO - __main__ - Step 88515: {'lr': 0.00018436963367306742, 'samples': 16994880, 'steps': 88514, 'loss/train': 1.7160227298736572} 11/07/2021 09:39:01 - INFO - __main__ - Step 88516: {'lr': 0.00018436451307144718, 'samples': 16995072, 'steps': 88515, 'loss/train': 1.3470962047576904} 11/07/2021 09:39:02 - INFO - __main__ - Step 88517: {'lr': 0.00018435939249940132, 'samples': 16995264, 'steps': 88516, 'loss/train': 0.9554969668388367} 11/07/2021 09:39:02 - INFO - __main__ - Step 88518: {'lr': 0.00018435427195693206, 'samples': 16995456, 'steps': 88517, 'loss/train': 0.9523103833198547} 11/07/2021 09:39:03 - INFO - __main__ - Step 88519: {'lr': 0.00018434915144404173, 'samples': 16995648, 'steps': 88518, 'loss/train': 1.6815974712371826} 11/07/2021 09:39:04 - INFO - __main__ - Step 88520: {'lr': 0.0001843440309607327, 'samples': 16995840, 'steps': 88519, 'loss/train': 1.5851284265518188} 11/07/2021 09:39:04 - INFO - __main__ - Step 88521: {'lr': 0.00018433891050700723, 'samples': 16996032, 'steps': 88520, 'loss/train': 1.4956345558166504} 11/07/2021 09:39:04 - INFO - __main__ - Step 88522: {'lr': 0.00018433379008286769, 'samples': 16996224, 'steps': 88521, 'loss/train': 1.7970043420791626} 11/07/2021 09:39:05 - INFO - __main__ - Step 88523: {'lr': 0.00018432866968831624, 'samples': 16996416, 'steps': 88522, 'loss/train': 1.5869178771972656} 11/07/2021 09:39:06 - INFO - __main__ - Step 88524: {'lr': 0.00018432354932335532, 'samples': 16996608, 'steps': 88523, 'loss/train': 0.8826991319656372} 11/07/2021 09:39:06 - INFO - __main__ - Step 88525: {'lr': 0.00018431842898798724, 'samples': 16996800, 'steps': 88524, 'loss/train': 0.9346773624420166} 11/07/2021 09:39:06 - INFO - __main__ - Step 88526: {'lr': 0.00018431330868221422, 'samples': 16996992, 'steps': 88525, 'loss/train': 1.9264640808105469} 11/07/2021 09:39:07 - INFO - __main__ - Step 88527: {'lr': 0.00018430818840603857, 'samples': 16997184, 'steps': 88526, 'loss/train': 0.5022324919700623} 11/07/2021 09:39:07 - INFO - __main__ - Step 88528: {'lr': 0.0001843030681594627, 'samples': 16997376, 'steps': 88527, 'loss/train': 1.0151374340057373} 11/07/2021 09:39:08 - INFO - __main__ - Step 88529: {'lr': 0.0001842979479424888, 'samples': 16997568, 'steps': 88528, 'loss/train': 1.4308152198791504} 11/07/2021 09:39:08 - INFO - __main__ - Step 88530: {'lr': 0.00018429282775511924, 'samples': 16997760, 'steps': 88529, 'loss/train': 1.1609441041946411} 11/07/2021 09:39:09 - INFO - __main__ - Step 88531: {'lr': 0.00018428770759735633, 'samples': 16997952, 'steps': 88530, 'loss/train': 1.3974090814590454} 11/07/2021 09:39:09 - INFO - __main__ - Step 88532: {'lr': 0.00018428258746920235, 'samples': 16998144, 'steps': 88531, 'loss/train': 1.438910722732544} 11/07/2021 09:39:10 - INFO - __main__ - Step 88533: {'lr': 0.0001842774673706597, 'samples': 16998336, 'steps': 88532, 'loss/train': 1.2784901857376099} 11/07/2021 09:39:11 - INFO - __main__ - Step 88534: {'lr': 0.00018427234730173053, 'samples': 16998528, 'steps': 88533, 'loss/train': 1.68492591381073} 11/07/2021 09:39:11 - INFO - __main__ - Step 88535: {'lr': 0.00018426722726241725, 'samples': 16998720, 'steps': 88534, 'loss/train': 0.5242835283279419} 11/07/2021 09:39:11 - INFO - __main__ - Step 88536: {'lr': 0.00018426210725272214, 'samples': 16998912, 'steps': 88535, 'loss/train': 5.618417263031006} 11/07/2021 09:39:12 - INFO - __main__ - Step 88537: {'lr': 0.00018425698727264747, 'samples': 16999104, 'steps': 88536, 'loss/train': 1.1304223537445068} 11/07/2021 09:39:12 - INFO - __main__ - Step 88538: {'lr': 0.0001842518673221956, 'samples': 16999296, 'steps': 88537, 'loss/train': 1.4731597900390625} 11/07/2021 09:39:12 - INFO - __main__ - Step 88539: {'lr': 0.00018424674740136893, 'samples': 16999488, 'steps': 88538, 'loss/train': 1.5392756462097168} 11/07/2021 09:39:13 - INFO - __main__ - Step 88540: {'lr': 0.00018424162751016953, 'samples': 16999680, 'steps': 88539, 'loss/train': 0.9532517194747925} 11/07/2021 09:39:14 - INFO - __main__ - Step 88541: {'lr': 0.0001842365076485999, 'samples': 16999872, 'steps': 88540, 'loss/train': 1.2676869630813599} 11/07/2021 09:39:14 - INFO - __main__ - Step 88542: {'lr': 0.00018423138781666225, 'samples': 17000064, 'steps': 88541, 'loss/train': 1.2276661396026611} 11/07/2021 09:39:14 - INFO - __main__ - Step 88543: {'lr': 0.00018422626801435895, 'samples': 17000256, 'steps': 88542, 'loss/train': 1.5505521297454834} 11/07/2021 09:39:15 - INFO - __main__ - Step 88544: {'lr': 0.00018422114824169234, 'samples': 17000448, 'steps': 88543, 'loss/train': 1.4202955961227417} 11/07/2021 09:39:16 - INFO - __main__ - Step 88545: {'lr': 0.0001842160284986646, 'samples': 17000640, 'steps': 88544, 'loss/train': 1.3367465734481812} 11/07/2021 09:39:16 - INFO - __main__ - Step 88546: {'lr': 0.00018421090878527807, 'samples': 17000832, 'steps': 88545, 'loss/train': 2.009319305419922} 11/07/2021 09:39:16 - INFO - __main__ - Step 88547: {'lr': 0.00018420578910153512, 'samples': 17001024, 'steps': 88546, 'loss/train': 1.0803234577178955} 11/07/2021 09:39:17 - INFO - __main__ - Step 88548: {'lr': 0.00018420066944743803, 'samples': 17001216, 'steps': 88547, 'loss/train': 1.1129231452941895} 11/07/2021 09:39:17 - INFO - __main__ - Step 88549: {'lr': 0.0001841955498229891, 'samples': 17001408, 'steps': 88548, 'loss/train': 2.3872783184051514} 11/07/2021 09:39:18 - INFO - __main__ - Step 88550: {'lr': 0.0001841904302281906, 'samples': 17001600, 'steps': 88549, 'loss/train': 1.465157389640808} 11/07/2021 09:39:19 - INFO - __main__ - Step 88551: {'lr': 0.00018418531066304492, 'samples': 17001792, 'steps': 88550, 'loss/train': 1.6580020189285278} 11/07/2021 09:39:19 - INFO - __main__ - Step 88552: {'lr': 0.00018418019112755436, 'samples': 17001984, 'steps': 88551, 'loss/train': 1.2314674854278564} 11/07/2021 09:39:19 - INFO - __main__ - Step 88553: {'lr': 0.00018417507162172116, 'samples': 17002176, 'steps': 88552, 'loss/train': 0.9461981654167175} 11/07/2021 09:39:20 - INFO - __main__ - Step 88554: {'lr': 0.0001841699521455476, 'samples': 17002368, 'steps': 88553, 'loss/train': 1.2720564603805542} 11/07/2021 09:39:21 - INFO - __main__ - Step 88555: {'lr': 0.00018416483269903617, 'samples': 17002560, 'steps': 88554, 'loss/train': 1.0376369953155518} 11/07/2021 09:39:21 - INFO - __main__ - Step 88556: {'lr': 0.00018415971328218894, 'samples': 17002752, 'steps': 88555, 'loss/train': 0.3499837815761566} 11/07/2021 09:39:21 - INFO - __main__ - Step 88557: {'lr': 0.00018415459389500835, 'samples': 17002944, 'steps': 88556, 'loss/train': 1.607961893081665} 11/07/2021 09:39:22 - INFO - __main__ - Step 88558: {'lr': 0.0001841494745374967, 'samples': 17003136, 'steps': 88557, 'loss/train': 1.4706878662109375} 11/07/2021 09:39:22 - INFO - __main__ - Step 88559: {'lr': 0.00018414435520965625, 'samples': 17003328, 'steps': 88558, 'loss/train': 1.3619301319122314} 11/07/2021 09:39:22 - INFO - __main__ - Step 88560: {'lr': 0.00018413923591148934, 'samples': 17003520, 'steps': 88559, 'loss/train': 1.6857882738113403} 11/07/2021 09:39:23 - INFO - __main__ - Step 88561: {'lr': 0.0001841341166429983, 'samples': 17003712, 'steps': 88560, 'loss/train': 0.8611705303192139} 11/07/2021 09:39:24 - INFO - __main__ - Step 88562: {'lr': 0.0001841289974041854, 'samples': 17003904, 'steps': 88561, 'loss/train': 1.3153784275054932} 11/07/2021 09:39:24 - INFO - __main__ - Step 88563: {'lr': 0.00018412387819505293, 'samples': 17004096, 'steps': 88562, 'loss/train': 1.0365360975265503} 11/07/2021 09:39:24 - INFO - __main__ - Step 88564: {'lr': 0.00018411875901560326, 'samples': 17004288, 'steps': 88563, 'loss/train': 0.8896458148956299} 11/07/2021 09:39:25 - INFO - __main__ - Step 88565: {'lr': 0.0001841136398658387, 'samples': 17004480, 'steps': 88564, 'loss/train': 1.6325820684432983} 11/07/2021 09:39:26 - INFO - __main__ - Step 88566: {'lr': 0.00018410852074576153, 'samples': 17004672, 'steps': 88565, 'loss/train': 1.1864014863967896} 11/07/2021 09:39:26 - INFO - __main__ - Step 88567: {'lr': 0.00018410340165537397, 'samples': 17004864, 'steps': 88566, 'loss/train': 0.8537512421607971} 11/07/2021 09:39:27 - INFO - __main__ - Step 88568: {'lr': 0.00018409828259467842, 'samples': 17005056, 'steps': 88567, 'loss/train': 0.998667299747467} 11/07/2021 09:39:27 - INFO - __main__ - Step 88569: {'lr': 0.00018409316356367717, 'samples': 17005248, 'steps': 88568, 'loss/train': 1.3089576959609985} 11/07/2021 09:39:27 - INFO - __main__ - Step 88570: {'lr': 0.00018408804456237249, 'samples': 17005440, 'steps': 88569, 'loss/train': 1.649664044380188} 11/07/2021 09:39:28 - INFO - __main__ - Step 88571: {'lr': 0.00018408292559076676, 'samples': 17005632, 'steps': 88570, 'loss/train': 1.34999680519104} 11/07/2021 09:39:29 - INFO - __main__ - Step 88572: {'lr': 0.0001840778066488622, 'samples': 17005824, 'steps': 88571, 'loss/train': 1.314579725265503} 11/07/2021 09:39:29 - INFO - __main__ - Step 88573: {'lr': 0.00018407268773666118, 'samples': 17006016, 'steps': 88572, 'loss/train': 0.4749293923377991} 11/07/2021 09:39:30 - INFO - __main__ - Step 88574: {'lr': 0.00018406756885416603, 'samples': 17006208, 'steps': 88573, 'loss/train': 1.115103006362915} 11/07/2021 09:39:30 - INFO - __main__ - Step 88575: {'lr': 0.000184062450001379, 'samples': 17006400, 'steps': 88574, 'loss/train': 0.4632883667945862} 11/07/2021 09:39:31 - INFO - __main__ - Step 88576: {'lr': 0.00018405733117830237, 'samples': 17006592, 'steps': 88575, 'loss/train': 1.2918528318405151} 11/07/2021 09:39:31 - INFO - __main__ - Step 88577: {'lr': 0.00018405221238493853, 'samples': 17006784, 'steps': 88576, 'loss/train': 1.562997817993164} 11/07/2021 09:39:32 - INFO - __main__ - Step 88578: {'lr': 0.00018404709362128974, 'samples': 17006976, 'steps': 88577, 'loss/train': 1.3409457206726074} 11/07/2021 09:39:32 - INFO - __main__ - Step 88579: {'lr': 0.00018404197488735842, 'samples': 17007168, 'steps': 88578, 'loss/train': 1.4005522727966309} 11/07/2021 09:39:32 - INFO - __main__ - Step 88580: {'lr': 0.00018403685618314665, 'samples': 17007360, 'steps': 88579, 'loss/train': 1.4761453866958618} 11/07/2021 09:39:33 - INFO - __main__ - Step 88581: {'lr': 0.00018403173750865685, 'samples': 17007552, 'steps': 88580, 'loss/train': 1.1430952548980713} 11/07/2021 09:39:34 - INFO - __main__ - Step 88582: {'lr': 0.00018402661886389132, 'samples': 17007744, 'steps': 88581, 'loss/train': 1.3737424612045288} 11/07/2021 09:39:34 - INFO - __main__ - Step 88583: {'lr': 0.00018402150024885238, 'samples': 17007936, 'steps': 88582, 'loss/train': 0.9494138360023499} 11/07/2021 09:39:34 - INFO - __main__ - Step 88584: {'lr': 0.00018401638166354236, 'samples': 17008128, 'steps': 88583, 'loss/train': 0.7594572305679321} 11/07/2021 09:39:35 - INFO - __main__ - Step 88585: {'lr': 0.00018401126310796354, 'samples': 17008320, 'steps': 88584, 'loss/train': 1.5199286937713623} 11/07/2021 09:39:35 - INFO - __main__ - Step 88586: {'lr': 0.00018400614458211824, 'samples': 17008512, 'steps': 88585, 'loss/train': 1.3370518684387207} 11/07/2021 09:39:37 - INFO - __main__ - Step 88587: {'lr': 0.00018400102608600872, 'samples': 17008704, 'steps': 88586, 'loss/train': 1.8244723081588745} 11/07/2021 09:39:37 - INFO - __main__ - Step 88588: {'lr': 0.0001839959076196373, 'samples': 17008896, 'steps': 88587, 'loss/train': 0.7338974475860596} 11/07/2021 09:39:37 - INFO - __main__ - Step 88589: {'lr': 0.00018399078918300636, 'samples': 17009088, 'steps': 88588, 'loss/train': 1.3140606880187988} 11/07/2021 09:39:38 - INFO - __main__ - Step 88590: {'lr': 0.00018398567077611812, 'samples': 17009280, 'steps': 88589, 'loss/train': 1.3109285831451416} 11/07/2021 09:39:38 - INFO - __main__ - Step 88591: {'lr': 0.00018398055239897493, 'samples': 17009472, 'steps': 88590, 'loss/train': 1.7362300157546997} 11/07/2021 09:39:38 - INFO - __main__ - Step 88592: {'lr': 0.00018397543405157906, 'samples': 17009664, 'steps': 88591, 'loss/train': 1.667948842048645} 11/07/2021 09:39:39 - INFO - __main__ - Step 88593: {'lr': 0.00018397031573393296, 'samples': 17009856, 'steps': 88592, 'loss/train': 1.4904553890228271} 11/07/2021 09:39:40 - INFO - __main__ - Step 88594: {'lr': 0.00018396519744603873, 'samples': 17010048, 'steps': 88593, 'loss/train': 1.1276710033416748} 11/07/2021 09:39:40 - INFO - __main__ - Step 88595: {'lr': 0.00018396007918789875, 'samples': 17010240, 'steps': 88594, 'loss/train': 1.144500732421875} 11/07/2021 09:39:40 - INFO - __main__ - Step 88596: {'lr': 0.00018395496095951537, 'samples': 17010432, 'steps': 88595, 'loss/train': 1.4931226968765259} 11/07/2021 09:39:41 - INFO - __main__ - Step 88597: {'lr': 0.00018394984276089084, 'samples': 17010624, 'steps': 88596, 'loss/train': 1.3565641641616821} 11/07/2021 09:39:42 - INFO - __main__ - Step 88598: {'lr': 0.00018394472459202742, 'samples': 17010816, 'steps': 88597, 'loss/train': 1.1786625385284424} 11/07/2021 09:39:42 - INFO - __main__ - Step 88599: {'lr': 0.0001839396064529276, 'samples': 17011008, 'steps': 88598, 'loss/train': 1.1086889505386353} 11/07/2021 09:39:43 - INFO - __main__ - Step 88600: {'lr': 0.0001839344883435935, 'samples': 17011200, 'steps': 88599, 'loss/train': 1.6598842144012451} 11/07/2021 09:39:43 - INFO - __main__ - Step 88601: {'lr': 0.00018392937026402758, 'samples': 17011392, 'steps': 88600, 'loss/train': 1.6777918338775635} 11/07/2021 09:39:43 - INFO - __main__ - Step 88602: {'lr': 0.00018392425221423197, 'samples': 17011584, 'steps': 88601, 'loss/train': 1.3959534168243408} 11/07/2021 09:39:46 - INFO - __main__ - Step 88603: {'lr': 0.00018391913419420913, 'samples': 17011776, 'steps': 88602, 'loss/train': 1.4074763059616089} 11/07/2021 09:39:46 - INFO - __main__ - Step 88604: {'lr': 0.00018391401620396127, 'samples': 17011968, 'steps': 88603, 'loss/train': 1.5038386583328247} 11/07/2021 09:39:46 - INFO - __main__ - Step 88605: {'lr': 0.00018390889824349078, 'samples': 17012160, 'steps': 88604, 'loss/train': 1.4218683242797852} 11/07/2021 09:39:47 - INFO - __main__ - Step 88606: {'lr': 0.0001839037803128, 'samples': 17012352, 'steps': 88605, 'loss/train': 1.9481332302093506} 11/07/2021 09:39:47 - INFO - __main__ - Step 88607: {'lr': 0.00018389866241189107, 'samples': 17012544, 'steps': 88606, 'loss/train': 0.3122609555721283} 11/07/2021 09:39:47 - INFO - __main__ - Step 88608: {'lr': 0.00018389354454076634, 'samples': 17012736, 'steps': 88607, 'loss/train': 0.32549452781677246} 11/07/2021 09:39:48 - INFO - __main__ - Step 88609: {'lr': 0.0001838884266994282, 'samples': 17012928, 'steps': 88608, 'loss/train': 1.483406901359558} 11/07/2021 09:39:49 - INFO - __main__ - Step 88610: {'lr': 0.0001838833088878789, 'samples': 17013120, 'steps': 88609, 'loss/train': 1.46738862991333} 11/07/2021 09:39:49 - INFO - __main__ - Step 88611: {'lr': 0.00018387819110612076, 'samples': 17013312, 'steps': 88610, 'loss/train': 1.5120806694030762} 11/07/2021 09:39:49 - INFO - __main__ - Step 88612: {'lr': 0.0001838730733541561, 'samples': 17013504, 'steps': 88611, 'loss/train': 1.5082281827926636} 11/07/2021 09:39:50 - INFO - __main__ - Step 88613: {'lr': 0.00018386795563198722, 'samples': 17013696, 'steps': 88612, 'loss/train': 1.6344857215881348} 11/07/2021 09:39:51 - INFO - __main__ - Step 88614: {'lr': 0.0001838628379396164, 'samples': 17013888, 'steps': 88613, 'loss/train': 1.7204383611679077} 11/07/2021 09:39:51 - INFO - __main__ - Step 88615: {'lr': 0.00018385772027704596, 'samples': 17014080, 'steps': 88614, 'loss/train': 1.7624660730361938} 11/07/2021 09:39:51 - INFO - __main__ - Step 88616: {'lr': 0.00018385260264427823, 'samples': 17014272, 'steps': 88615, 'loss/train': 1.6938714981079102} 11/07/2021 09:39:52 - INFO - __main__ - Step 88617: {'lr': 0.00018384748504131547, 'samples': 17014464, 'steps': 88616, 'loss/train': 1.4663879871368408} 11/07/2021 09:39:52 - INFO - __main__ - Step 88618: {'lr': 0.00018384236746816002, 'samples': 17014656, 'steps': 88617, 'loss/train': 1.199231743812561} 11/07/2021 09:39:53 - INFO - __main__ - Step 88619: {'lr': 0.0001838372499248143, 'samples': 17014848, 'steps': 88618, 'loss/train': 1.2636938095092773} 11/07/2021 09:39:53 - INFO - __main__ - Step 88620: {'lr': 0.00018383213241128038, 'samples': 17015040, 'steps': 88619, 'loss/train': 1.3855681419372559} 11/07/2021 09:39:54 - INFO - __main__ - Step 88621: {'lr': 0.00018382701492756067, 'samples': 17015232, 'steps': 88620, 'loss/train': 1.5856475830078125} 11/07/2021 09:39:54 - INFO - __main__ - Step 88622: {'lr': 0.00018382189747365748, 'samples': 17015424, 'steps': 88621, 'loss/train': 1.0582709312438965} 11/07/2021 09:39:54 - INFO - __main__ - Step 88623: {'lr': 0.00018381678004957314, 'samples': 17015616, 'steps': 88622, 'loss/train': 1.4446624517440796} 11/07/2021 09:39:55 - INFO - __main__ - Step 88624: {'lr': 0.00018381166265530994, 'samples': 17015808, 'steps': 88623, 'loss/train': 1.5955348014831543} 11/07/2021 09:39:56 - INFO - __main__ - Step 88625: {'lr': 0.0001838065452908702, 'samples': 17016000, 'steps': 88624, 'loss/train': 1.22385573387146} 11/07/2021 09:39:56 - INFO - __main__ - Step 88626: {'lr': 0.00018380142795625616, 'samples': 17016192, 'steps': 88625, 'loss/train': 1.2104953527450562} 11/07/2021 09:39:56 - INFO - __main__ - Step 88627: {'lr': 0.00018379631065147022, 'samples': 17016384, 'steps': 88626, 'loss/train': 1.405657172203064} 11/07/2021 09:39:57 - INFO - __main__ - Step 88628: {'lr': 0.00018379119337651463, 'samples': 17016576, 'steps': 88627, 'loss/train': 1.3543994426727295} 11/07/2021 09:39:57 - INFO - __main__ - Step 88629: {'lr': 0.00018378607613139168, 'samples': 17016768, 'steps': 88628, 'loss/train': 1.3711979389190674} 11/07/2021 09:39:58 - INFO - __main__ - Step 88630: {'lr': 0.00018378095891610373, 'samples': 17016960, 'steps': 88629, 'loss/train': 1.4846917390823364} 11/07/2021 09:39:59 - INFO - __main__ - Step 88631: {'lr': 0.00018377584173065304, 'samples': 17017152, 'steps': 88630, 'loss/train': 1.4011098146438599} 11/07/2021 09:39:59 - INFO - __main__ - Step 88632: {'lr': 0.00018377072457504196, 'samples': 17017344, 'steps': 88631, 'loss/train': 1.2714293003082275} 11/07/2021 09:39:59 - INFO - __main__ - Step 88633: {'lr': 0.00018376560744927283, 'samples': 17017536, 'steps': 88632, 'loss/train': 1.3578335046768188} 11/07/2021 09:40:00 - INFO - __main__ - Step 88634: {'lr': 0.0001837604903533478, 'samples': 17017728, 'steps': 88633, 'loss/train': 2.028186082839966} 11/07/2021 09:40:01 - INFO - __main__ - Step 88635: {'lr': 0.00018375537328726933, 'samples': 17017920, 'steps': 88634, 'loss/train': 1.5446933507919312} 11/07/2021 09:40:02 - INFO - __main__ - Step 88636: {'lr': 0.00018375025625103961, 'samples': 17018112, 'steps': 88635, 'loss/train': 1.3245985507965088} 11/07/2021 09:40:02 - INFO - __main__ - Step 88637: {'lr': 0.00018374513924466102, 'samples': 17018304, 'steps': 88636, 'loss/train': 1.5957345962524414} 11/07/2021 09:40:02 - INFO - __main__ - Step 88638: {'lr': 0.00018374002226813585, 'samples': 17018496, 'steps': 88637, 'loss/train': 1.3051598072052002} 11/07/2021 09:40:03 - INFO - __main__ - Step 88639: {'lr': 0.00018373490532146638, 'samples': 17018688, 'steps': 88638, 'loss/train': 1.9265044927597046} 11/07/2021 09:40:03 - INFO - __main__ - Step 88640: {'lr': 0.00018372978840465497, 'samples': 17018880, 'steps': 88639, 'loss/train': 1.8299630880355835} 11/07/2021 09:40:03 - INFO - __main__ - Step 88641: {'lr': 0.0001837246715177039, 'samples': 17019072, 'steps': 88640, 'loss/train': 1.0072118043899536} 11/07/2021 09:40:04 - INFO - __main__ - Step 88642: {'lr': 0.00018371955466061545, 'samples': 17019264, 'steps': 88641, 'loss/train': 0.48503658175468445} 11/07/2021 09:40:05 - INFO - __main__ - Step 88643: {'lr': 0.00018371443783339193, 'samples': 17019456, 'steps': 88642, 'loss/train': 1.4942214488983154} 11/07/2021 09:40:05 - INFO - __main__ - Step 88644: {'lr': 0.0001837093210360357, 'samples': 17019648, 'steps': 88643, 'loss/train': 1.5926278829574585} 11/07/2021 09:40:05 - INFO - __main__ - Step 88645: {'lr': 0.00018370420426854904, 'samples': 17019840, 'steps': 88644, 'loss/train': 1.6829620599746704} 11/07/2021 09:40:06 - INFO - __main__ - Step 88646: {'lr': 0.00018369908753093427, 'samples': 17020032, 'steps': 88645, 'loss/train': 0.8969227075576782} 11/07/2021 09:40:07 - INFO - __main__ - Step 88647: {'lr': 0.0001836939708231936, 'samples': 17020224, 'steps': 88646, 'loss/train': 1.5054086446762085} 11/07/2021 09:40:07 - INFO - __main__ - Step 88648: {'lr': 0.00018368885414532944, 'samples': 17020416, 'steps': 88647, 'loss/train': 1.3920351266860962} 11/07/2021 09:40:08 - INFO - __main__ - Step 88649: {'lr': 0.000183683737497344, 'samples': 17020608, 'steps': 88648, 'loss/train': 1.0015106201171875} 11/07/2021 09:40:08 - INFO - __main__ - Step 88650: {'lr': 0.0001836786208792397, 'samples': 17020800, 'steps': 88649, 'loss/train': 1.1863325834274292} 11/07/2021 09:40:08 - INFO - __main__ - Step 88651: {'lr': 0.00018367350429101875, 'samples': 17020992, 'steps': 88650, 'loss/train': 1.1697204113006592} 11/07/2021 09:40:09 - INFO - __main__ - Step 88652: {'lr': 0.00018366838773268353, 'samples': 17021184, 'steps': 88651, 'loss/train': 1.4296513795852661} 11/07/2021 09:40:10 - INFO - __main__ - Step 88653: {'lr': 0.00018366327120423627, 'samples': 17021376, 'steps': 88652, 'loss/train': 1.9942373037338257} 11/07/2021 09:40:10 - INFO - __main__ - Step 88654: {'lr': 0.00018365815470567935, 'samples': 17021568, 'steps': 88653, 'loss/train': 1.3537414073944092} 11/07/2021 09:40:10 - INFO - __main__ - Step 88655: {'lr': 0.00018365303823701502, 'samples': 17021760, 'steps': 88654, 'loss/train': 1.1230101585388184} 11/07/2021 09:40:11 - INFO - __main__ - Step 88656: {'lr': 0.0001836479217982457, 'samples': 17021952, 'steps': 88655, 'loss/train': 2.090229034423828} 11/07/2021 09:40:11 - INFO - __main__ - Step 88657: {'lr': 0.0001836428053893735, 'samples': 17022144, 'steps': 88656, 'loss/train': 1.3974006175994873} 11/07/2021 09:40:12 - INFO - __main__ - Step 88658: {'lr': 0.00018363768901040085, 'samples': 17022336, 'steps': 88657, 'loss/train': 1.6127301454544067} 11/07/2021 09:40:13 - INFO - __main__ - Step 88659: {'lr': 0.00018363257266133004, 'samples': 17022528, 'steps': 88658, 'loss/train': 0.9325656294822693} 11/07/2021 09:40:13 - INFO - __main__ - Step 88660: {'lr': 0.00018362745634216337, 'samples': 17022720, 'steps': 88659, 'loss/train': 1.904715657234192} 11/07/2021 09:40:13 - INFO - __main__ - Step 88661: {'lr': 0.0001836223400529032, 'samples': 17022912, 'steps': 88660, 'loss/train': 1.8723822832107544} 11/07/2021 09:40:14 - INFO - __main__ - Step 88662: {'lr': 0.00018361722379355166, 'samples': 17023104, 'steps': 88661, 'loss/train': 1.6275242567062378} 11/07/2021 09:40:15 - INFO - __main__ - Step 88663: {'lr': 0.00018361210756411123, 'samples': 17023296, 'steps': 88662, 'loss/train': 1.099351167678833} 11/07/2021 09:40:15 - INFO - __main__ - Step 88664: {'lr': 0.00018360699136458418, 'samples': 17023488, 'steps': 88663, 'loss/train': 1.5281206369400024} 11/07/2021 09:40:16 - INFO - __main__ - Step 88665: {'lr': 0.00018360187519497276, 'samples': 17023680, 'steps': 88664, 'loss/train': 1.1323050260543823} 11/07/2021 09:40:16 - INFO - __main__ - Step 88666: {'lr': 0.00018359675905527933, 'samples': 17023872, 'steps': 88665, 'loss/train': 1.0862637758255005} 11/07/2021 09:40:16 - INFO - __main__ - Step 88667: {'lr': 0.00018359164294550623, 'samples': 17024064, 'steps': 88666, 'loss/train': 1.3923979997634888} 11/07/2021 09:40:17 - INFO - __main__ - Step 88668: {'lr': 0.00018358652686565564, 'samples': 17024256, 'steps': 88667, 'loss/train': 1.5476300716400146} 11/07/2021 09:40:18 - INFO - __main__ - Step 88669: {'lr': 0.00018358141081572992, 'samples': 17024448, 'steps': 88668, 'loss/train': 1.5043278932571411} 11/07/2021 09:40:18 - INFO - __main__ - Step 88670: {'lr': 0.00018357629479573146, 'samples': 17024640, 'steps': 88669, 'loss/train': 1.4598538875579834} 11/07/2021 09:40:18 - INFO - __main__ - Step 88671: {'lr': 0.00018357117880566244, 'samples': 17024832, 'steps': 88670, 'loss/train': 1.5007731914520264} 11/07/2021 09:40:19 - INFO - __main__ - Step 88672: {'lr': 0.00018356606284552525, 'samples': 17025024, 'steps': 88671, 'loss/train': 1.6427435874938965} 11/07/2021 09:40:20 - INFO - __main__ - Step 88673: {'lr': 0.00018356094691532218, 'samples': 17025216, 'steps': 88672, 'loss/train': 1.3112461566925049} 11/07/2021 09:40:20 - INFO - __main__ - Step 88674: {'lr': 0.00018355583101505553, 'samples': 17025408, 'steps': 88673, 'loss/train': 1.1358482837677002} 11/07/2021 09:40:20 - INFO - __main__ - Step 88675: {'lr': 0.00018355071514472755, 'samples': 17025600, 'steps': 88674, 'loss/train': 1.7165672779083252} 11/07/2021 09:40:21 - INFO - __main__ - Step 88676: {'lr': 0.00018354559930434057, 'samples': 17025792, 'steps': 88675, 'loss/train': 1.2074307203292847} 11/07/2021 09:40:21 - INFO - __main__ - Step 88677: {'lr': 0.00018354048349389696, 'samples': 17025984, 'steps': 88676, 'loss/train': 1.339699149131775} 11/07/2021 09:40:22 - INFO - __main__ - Step 88678: {'lr': 0.00018353536771339903, 'samples': 17026176, 'steps': 88677, 'loss/train': 1.6609535217285156} 11/07/2021 09:40:22 - INFO - __main__ - Step 88679: {'lr': 0.000183530251962849, 'samples': 17026368, 'steps': 88678, 'loss/train': 1.7018718719482422} 11/07/2021 09:40:23 - INFO - __main__ - Step 88680: {'lr': 0.0001835251362422492, 'samples': 17026560, 'steps': 88679, 'loss/train': 1.7389531135559082} 11/07/2021 09:40:23 - INFO - __main__ - Step 88681: {'lr': 0.00018352002055160193, 'samples': 17026752, 'steps': 88680, 'loss/train': 0.741227388381958} 11/07/2021 09:40:24 - INFO - __main__ - Step 88682: {'lr': 0.00018351490489090954, 'samples': 17026944, 'steps': 88681, 'loss/train': 1.5353991985321045} 11/07/2021 09:40:24 - INFO - __main__ - Step 88683: {'lr': 0.00018350978926017427, 'samples': 17027136, 'steps': 88682, 'loss/train': 1.3300423622131348} 11/07/2021 09:40:25 - INFO - __main__ - Step 88684: {'lr': 0.0001835046736593985, 'samples': 17027328, 'steps': 88683, 'loss/train': 1.2486246824264526} 11/07/2021 09:40:25 - INFO - __main__ - Step 88685: {'lr': 0.0001834995580885845, 'samples': 17027520, 'steps': 88684, 'loss/train': 1.8207252025604248} 11/07/2021 09:40:26 - INFO - __main__ - Step 88686: {'lr': 0.00018349444254773454, 'samples': 17027712, 'steps': 88685, 'loss/train': 2.0263168811798096} 11/07/2021 09:40:26 - INFO - __main__ - Step 88687: {'lr': 0.000183489327036851, 'samples': 17027904, 'steps': 88686, 'loss/train': 1.257797360420227} 11/07/2021 09:40:26 - INFO - __main__ - Step 88688: {'lr': 0.00018348421155593613, 'samples': 17028096, 'steps': 88687, 'loss/train': 0.8987569212913513} 11/07/2021 09:40:27 - INFO - __main__ - Step 88689: {'lr': 0.0001834790961049923, 'samples': 17028288, 'steps': 88688, 'loss/train': 1.3211768865585327} 11/07/2021 09:40:28 - INFO - __main__ - Step 88690: {'lr': 0.00018347398068402172, 'samples': 17028480, 'steps': 88689, 'loss/train': 1.695343017578125} 11/07/2021 09:40:28 - INFO - __main__ - Step 88691: {'lr': 0.0001834688652930267, 'samples': 17028672, 'steps': 88690, 'loss/train': 1.424452543258667} 11/07/2021 09:40:28 - INFO - __main__ - Step 88692: {'lr': 0.0001834637499320096, 'samples': 17028864, 'steps': 88691, 'loss/train': 1.5764827728271484} 11/07/2021 09:40:29 - INFO - __main__ - Step 88693: {'lr': 0.00018345863460097271, 'samples': 17029056, 'steps': 88692, 'loss/train': 0.6554007530212402} 11/07/2021 09:40:30 - INFO - __main__ - Step 88694: {'lr': 0.0001834535192999183, 'samples': 17029248, 'steps': 88693, 'loss/train': 1.4333573579788208} 11/07/2021 09:40:30 - INFO - __main__ - Step 88695: {'lr': 0.00018344840402884877, 'samples': 17029440, 'steps': 88694, 'loss/train': 1.5543947219848633} 11/07/2021 09:40:30 - INFO - __main__ - Step 88696: {'lr': 0.00018344328878776634, 'samples': 17029632, 'steps': 88695, 'loss/train': 0.9764851927757263} 11/07/2021 09:40:31 - INFO - __main__ - Step 88697: {'lr': 0.0001834381735766733, 'samples': 17029824, 'steps': 88696, 'loss/train': 1.124642252922058} 11/07/2021 09:40:31 - INFO - __main__ - Step 88698: {'lr': 0.000183433058395572, 'samples': 17030016, 'steps': 88697, 'loss/train': 1.8469140529632568} 11/07/2021 09:40:32 - INFO - __main__ - Step 88699: {'lr': 0.00018342794324446477, 'samples': 17030208, 'steps': 88698, 'loss/train': 0.8334084153175354} 11/07/2021 09:40:33 - INFO - __main__ - Step 88700: {'lr': 0.00018342282812335397, 'samples': 17030400, 'steps': 88699, 'loss/train': 1.757651448249817} 11/07/2021 09:40:33 - INFO - __main__ - Step 88701: {'lr': 0.0001834177130322417, 'samples': 17030592, 'steps': 88700, 'loss/train': 5.707924842834473} 11/07/2021 09:40:33 - INFO - __main__ - Step 88702: {'lr': 0.00018341259797113041, 'samples': 17030784, 'steps': 88701, 'loss/train': 1.598127841949463} 11/07/2021 09:40:34 - INFO - __main__ - Step 88703: {'lr': 0.00018340748294002235, 'samples': 17030976, 'steps': 88702, 'loss/train': 1.276092529296875} 11/07/2021 09:40:34 - INFO - __main__ - Step 88704: {'lr': 0.00018340236793891988, 'samples': 17031168, 'steps': 88703, 'loss/train': 1.0549566745758057} 11/07/2021 09:40:35 - INFO - __main__ - Step 88705: {'lr': 0.00018339725296782526, 'samples': 17031360, 'steps': 88704, 'loss/train': 1.4805219173431396} 11/07/2021 09:40:35 - INFO - __main__ - Step 88706: {'lr': 0.00018339213802674083, 'samples': 17031552, 'steps': 88705, 'loss/train': 1.3621273040771484} 11/07/2021 09:40:36 - INFO - __main__ - Step 88707: {'lr': 0.00018338702311566883, 'samples': 17031744, 'steps': 88706, 'loss/train': 1.4648001194000244} 11/07/2021 09:40:36 - INFO - __main__ - Step 88708: {'lr': 0.00018338190823461163, 'samples': 17031936, 'steps': 88707, 'loss/train': 5.514442443847656} 11/07/2021 09:40:36 - INFO - __main__ - Step 88709: {'lr': 0.0001833767933835715, 'samples': 17032128, 'steps': 88708, 'loss/train': 1.5563476085662842} 11/07/2021 09:40:37 - INFO - __main__ - Step 88710: {'lr': 0.00018337167856255076, 'samples': 17032320, 'steps': 88709, 'loss/train': 1.572489619255066} 11/07/2021 09:40:38 - INFO - __main__ - Step 88711: {'lr': 0.00018336656377155176, 'samples': 17032512, 'steps': 88710, 'loss/train': 1.2042189836502075} 11/07/2021 09:40:38 - INFO - __main__ - Step 88712: {'lr': 0.0001833614490105767, 'samples': 17032704, 'steps': 88711, 'loss/train': 1.3160371780395508} 11/07/2021 09:40:38 - INFO - __main__ - Step 88713: {'lr': 0.00018335633427962798, 'samples': 17032896, 'steps': 88712, 'loss/train': 0.7537205815315247} 11/07/2021 09:40:39 - INFO - __main__ - Step 88714: {'lr': 0.00018335121957870795, 'samples': 17033088, 'steps': 88713, 'loss/train': 1.724066972732544} 11/07/2021 09:40:39 - INFO - __main__ - Step 88715: {'lr': 0.00018334610490781874, 'samples': 17033280, 'steps': 88714, 'loss/train': 1.9190764427185059} 11/07/2021 09:40:40 - INFO - __main__ - Step 88716: {'lr': 0.00018334099026696274, 'samples': 17033472, 'steps': 88715, 'loss/train': 1.4686002731323242} 11/07/2021 09:40:41 - INFO - __main__ - Step 88717: {'lr': 0.00018333587565614226, 'samples': 17033664, 'steps': 88716, 'loss/train': 1.5062655210494995} 11/07/2021 09:40:41 - INFO - __main__ - Step 88718: {'lr': 0.00018333076107535963, 'samples': 17033856, 'steps': 88717, 'loss/train': 1.3084644079208374} 11/07/2021 09:40:41 - INFO - __main__ - Step 88719: {'lr': 0.0001833256465246171, 'samples': 17034048, 'steps': 88718, 'loss/train': 1.4852385520935059} 11/07/2021 09:40:42 - INFO - __main__ - Step 88720: {'lr': 0.00018332053200391702, 'samples': 17034240, 'steps': 88719, 'loss/train': 0.8295510411262512} 11/07/2021 09:40:43 - INFO - __main__ - Step 88721: {'lr': 0.00018331541751326168, 'samples': 17034432, 'steps': 88720, 'loss/train': 1.6553808450698853} 11/07/2021 09:40:43 - INFO - __main__ - Step 88722: {'lr': 0.00018331030305265337, 'samples': 17034624, 'steps': 88721, 'loss/train': 0.8539953231811523} 11/07/2021 09:40:43 - INFO - __main__ - Step 88723: {'lr': 0.0001833051886220944, 'samples': 17034816, 'steps': 88722, 'loss/train': 1.1295111179351807} 11/07/2021 09:40:44 - INFO - __main__ - Step 88724: {'lr': 0.0001833000742215871, 'samples': 17035008, 'steps': 88723, 'loss/train': 1.0647884607315063} 11/07/2021 09:40:44 - INFO - __main__ - Step 88725: {'lr': 0.00018329495985113377, 'samples': 17035200, 'steps': 88724, 'loss/train': 1.288020372390747} 11/07/2021 09:40:45 - INFO - __main__ - Step 88726: {'lr': 0.00018328984551073667, 'samples': 17035392, 'steps': 88725, 'loss/train': 1.5650252103805542} 11/07/2021 09:40:45 - INFO - __main__ - Step 88727: {'lr': 0.0001832847312003983, 'samples': 17035584, 'steps': 88726, 'loss/train': 1.2218046188354492} 11/07/2021 09:40:46 - INFO - __main__ - Step 88728: {'lr': 0.00018327961692012062, 'samples': 17035776, 'steps': 88727, 'loss/train': 1.6198453903198242} 11/07/2021 09:40:46 - INFO - __main__ - Step 88729: {'lr': 0.00018327450266990617, 'samples': 17035968, 'steps': 88728, 'loss/train': 1.5738955736160278} 11/07/2021 09:40:47 - INFO - __main__ - Step 88730: {'lr': 0.00018326938844975715, 'samples': 17036160, 'steps': 88729, 'loss/train': 1.47467839717865} 11/07/2021 09:40:47 - INFO - __main__ - Step 88731: {'lr': 0.00018326427425967596, 'samples': 17036352, 'steps': 88730, 'loss/train': 0.9452513456344604} 11/07/2021 09:40:48 - INFO - __main__ - Step 88732: {'lr': 0.00018325916009966488, 'samples': 17036544, 'steps': 88731, 'loss/train': 1.9887782335281372} 11/07/2021 09:40:48 - INFO - __main__ - Step 88733: {'lr': 0.00018325404596972612, 'samples': 17036736, 'steps': 88732, 'loss/train': 1.253568172454834} 11/07/2021 09:40:49 - INFO - __main__ - Step 88734: {'lr': 0.00018324893186986207, 'samples': 17036928, 'steps': 88733, 'loss/train': 1.2376370429992676} 11/07/2021 09:40:49 - INFO - __main__ - Step 88735: {'lr': 0.00018324381780007506, 'samples': 17037120, 'steps': 88734, 'loss/train': 1.0505558252334595} 11/07/2021 09:40:49 - INFO - __main__ - Step 88736: {'lr': 0.00018323870376036732, 'samples': 17037312, 'steps': 88735, 'loss/train': 1.449031114578247} 11/07/2021 09:40:50 - INFO - __main__ - Step 88737: {'lr': 0.00018323358975074123, 'samples': 17037504, 'steps': 88736, 'loss/train': 1.7548142671585083} 11/07/2021 09:40:51 - INFO - __main__ - Step 88738: {'lr': 0.00018322847577119906, 'samples': 17037696, 'steps': 88737, 'loss/train': 1.4596853256225586} 11/07/2021 09:40:51 - INFO - __main__ - Step 88739: {'lr': 0.00018322336182174308, 'samples': 17037888, 'steps': 88738, 'loss/train': 1.4753568172454834} 11/07/2021 09:40:51 - INFO - __main__ - Step 88740: {'lr': 0.0001832182479023757, 'samples': 17038080, 'steps': 88739, 'loss/train': 1.187155842781067} 11/07/2021 09:40:52 - INFO - __main__ - Step 88741: {'lr': 0.0001832131340130991, 'samples': 17038272, 'steps': 88740, 'loss/train': 1.4936070442199707} 11/07/2021 09:40:53 - INFO - __main__ - Step 88742: {'lr': 0.0001832080201539156, 'samples': 17038464, 'steps': 88741, 'loss/train': 1.086161732673645} 11/07/2021 09:40:53 - INFO - __main__ - Step 88743: {'lr': 0.00018320290632482754, 'samples': 17038656, 'steps': 88742, 'loss/train': 1.2821025848388672} 11/07/2021 09:40:54 - INFO - __main__ - Step 88744: {'lr': 0.00018319779252583718, 'samples': 17038848, 'steps': 88743, 'loss/train': 1.5297582149505615} 11/07/2021 09:40:54 - INFO - __main__ - Step 88745: {'lr': 0.00018319267875694693, 'samples': 17039040, 'steps': 88744, 'loss/train': 0.9024498462677002} 11/07/2021 09:40:54 - INFO - __main__ - Step 88746: {'lr': 0.00018318756501815896, 'samples': 17039232, 'steps': 88745, 'loss/train': 1.4658178091049194} 11/07/2021 09:40:55 - INFO - __main__ - Step 88747: {'lr': 0.0001831824513094757, 'samples': 17039424, 'steps': 88746, 'loss/train': 1.638080358505249} 11/07/2021 09:40:56 - INFO - __main__ - Step 88748: {'lr': 0.0001831773376308994, 'samples': 17039616, 'steps': 88747, 'loss/train': 1.4175060987472534} 11/07/2021 09:40:56 - INFO - __main__ - Step 88749: {'lr': 0.00018317222398243232, 'samples': 17039808, 'steps': 88748, 'loss/train': 1.048763632774353} 11/07/2021 09:40:56 - INFO - __main__ - Step 88750: {'lr': 0.00018316711036407685, 'samples': 17040000, 'steps': 88749, 'loss/train': 1.452291488647461} 11/07/2021 09:40:57 - INFO - __main__ - Step 88751: {'lr': 0.0001831619967758352, 'samples': 17040192, 'steps': 88750, 'loss/train': 1.2270535230636597} 11/07/2021 09:40:58 - INFO - __main__ - Step 88752: {'lr': 0.00018315688321770974, 'samples': 17040384, 'steps': 88751, 'loss/train': 0.9735932946205139} 11/07/2021 09:40:58 - INFO - __main__ - Step 88753: {'lr': 0.00018315176968970276, 'samples': 17040576, 'steps': 88752, 'loss/train': 1.7347629070281982} 11/07/2021 09:40:59 - INFO - __main__ - Step 88754: {'lr': 0.0001831466561918167, 'samples': 17040768, 'steps': 88753, 'loss/train': 1.4405596256256104} 11/07/2021 09:40:59 - INFO - __main__ - Step 88755: {'lr': 0.00018314154272405355, 'samples': 17040960, 'steps': 88754, 'loss/train': 1.275898814201355} 11/07/2021 09:40:59 - INFO - __main__ - Step 88756: {'lr': 0.00018313642928641583, 'samples': 17041152, 'steps': 88755, 'loss/train': 1.3692480325698853} 11/07/2021 09:41:00 - INFO - __main__ - Step 88757: {'lr': 0.0001831313158789058, 'samples': 17041344, 'steps': 88756, 'loss/train': 1.4605494737625122} 11/07/2021 09:41:01 - INFO - __main__ - Step 88758: {'lr': 0.00018312620250152578, 'samples': 17041536, 'steps': 88757, 'loss/train': 1.476635217666626} 11/07/2021 09:41:01 - INFO - __main__ - Step 88759: {'lr': 0.00018312108915427805, 'samples': 17041728, 'steps': 88758, 'loss/train': 0.3809349536895752} 11/07/2021 09:41:01 - INFO - __main__ - Step 88760: {'lr': 0.00018311597583716495, 'samples': 17041920, 'steps': 88759, 'loss/train': 1.4245915412902832} 11/07/2021 09:41:02 - INFO - __main__ - Step 88761: {'lr': 0.00018311086255018872, 'samples': 17042112, 'steps': 88760, 'loss/train': 1.4630917310714722} 11/07/2021 09:41:03 - INFO - __main__ - Step 88762: {'lr': 0.00018310574929335168, 'samples': 17042304, 'steps': 88761, 'loss/train': 1.3194429874420166} 11/07/2021 09:41:03 - INFO - __main__ - Step 88763: {'lr': 0.00018310063606665622, 'samples': 17042496, 'steps': 88762, 'loss/train': 1.3857126235961914} 11/07/2021 09:41:03 - INFO - __main__ - Step 88764: {'lr': 0.00018309552287010456, 'samples': 17042688, 'steps': 88763, 'loss/train': 1.0836310386657715} 11/07/2021 09:41:04 - INFO - __main__ - Step 88765: {'lr': 0.000183090409703699, 'samples': 17042880, 'steps': 88764, 'loss/train': 1.6936806440353394} 11/07/2021 09:41:04 - INFO - __main__ - Step 88766: {'lr': 0.0001830852965674419, 'samples': 17043072, 'steps': 88765, 'loss/train': 0.8751890063285828} 11/07/2021 09:41:04 - INFO - __main__ - Step 88767: {'lr': 0.00018308018346133563, 'samples': 17043264, 'steps': 88766, 'loss/train': 1.5105791091918945} 11/07/2021 09:41:06 - INFO - __main__ - Step 88768: {'lr': 0.0001830750703853823, 'samples': 17043456, 'steps': 88767, 'loss/train': 1.233298420906067} 11/07/2021 09:41:06 - INFO - __main__ - Step 88769: {'lr': 0.00018306995733958427, 'samples': 17043648, 'steps': 88768, 'loss/train': 1.0771381855010986} 11/07/2021 09:41:06 - INFO - __main__ - Step 88770: {'lr': 0.00018306484432394394, 'samples': 17043840, 'steps': 88769, 'loss/train': 1.3353813886642456} 11/07/2021 09:41:07 - INFO - __main__ - Step 88771: {'lr': 0.0001830597313384635, 'samples': 17044032, 'steps': 88770, 'loss/train': 1.3121347427368164} 11/07/2021 09:41:07 - INFO - __main__ - Step 88772: {'lr': 0.00018305461838314535, 'samples': 17044224, 'steps': 88771, 'loss/train': 1.5765732526779175} 11/07/2021 09:41:08 - INFO - __main__ - Step 88773: {'lr': 0.00018304950545799175, 'samples': 17044416, 'steps': 88772, 'loss/train': 1.2895839214324951} 11/07/2021 09:41:08 - INFO - __main__ - Step 88774: {'lr': 0.00018304439256300502, 'samples': 17044608, 'steps': 88773, 'loss/train': 1.6004743576049805} 11/07/2021 09:41:09 - INFO - __main__ - Step 88775: {'lr': 0.00018303927969818743, 'samples': 17044800, 'steps': 88774, 'loss/train': 1.0833264589309692} 11/07/2021 09:41:09 - INFO - __main__ - Step 88776: {'lr': 0.00018303416686354132, 'samples': 17044992, 'steps': 88775, 'loss/train': 1.23135507106781} 11/07/2021 09:41:09 - INFO - __main__ - Step 88777: {'lr': 0.000183029054059069, 'samples': 17045184, 'steps': 88776, 'loss/train': 1.418820858001709} 11/07/2021 09:41:10 - INFO - __main__ - Step 88778: {'lr': 0.00018302394128477274, 'samples': 17045376, 'steps': 88777, 'loss/train': 1.7560373544692993} 11/07/2021 09:41:11 - INFO - __main__ - Step 88779: {'lr': 0.00018301882854065483, 'samples': 17045568, 'steps': 88778, 'loss/train': 1.3179575204849243} 11/07/2021 09:41:11 - INFO - __main__ - Step 88780: {'lr': 0.00018301371582671766, 'samples': 17045760, 'steps': 88779, 'loss/train': 1.4400650262832642} 11/07/2021 09:41:11 - INFO - __main__ - Step 88781: {'lr': 0.00018300860314296352, 'samples': 17045952, 'steps': 88780, 'loss/train': 1.3621362447738647} 11/07/2021 09:41:12 - INFO - __main__ - Step 88782: {'lr': 0.00018300349048939457, 'samples': 17046144, 'steps': 88781, 'loss/train': 1.3558545112609863} 11/07/2021 09:41:13 - INFO - __main__ - Step 88783: {'lr': 0.00018299837786601324, 'samples': 17046336, 'steps': 88782, 'loss/train': 1.6104997396469116} 11/07/2021 09:41:13 - INFO - __main__ - Step 88784: {'lr': 0.0001829932652728218, 'samples': 17046528, 'steps': 88783, 'loss/train': 1.2669591903686523} 11/07/2021 09:41:14 - INFO - __main__ - Step 88785: {'lr': 0.00018298815270982257, 'samples': 17046720, 'steps': 88784, 'loss/train': 1.5074344873428345} 11/07/2021 09:41:14 - INFO - __main__ - Step 88786: {'lr': 0.00018298304017701783, 'samples': 17046912, 'steps': 88785, 'loss/train': 1.4507758617401123} 11/07/2021 09:41:14 - INFO - __main__ - Step 88787: {'lr': 0.0001829779276744099, 'samples': 17047104, 'steps': 88786, 'loss/train': 1.2089388370513916} 11/07/2021 09:41:15 - INFO - __main__ - Step 88788: {'lr': 0.0001829728152020011, 'samples': 17047296, 'steps': 88787, 'loss/train': 1.2487715482711792} 11/07/2021 09:41:16 - INFO - __main__ - Step 88789: {'lr': 0.00018296770275979372, 'samples': 17047488, 'steps': 88788, 'loss/train': 1.195920467376709} 11/07/2021 09:41:16 - INFO - __main__ - Step 88790: {'lr': 0.00018296259034779002, 'samples': 17047680, 'steps': 88789, 'loss/train': 1.0114307403564453} 11/07/2021 09:41:16 - INFO - __main__ - Step 88791: {'lr': 0.00018295747796599244, 'samples': 17047872, 'steps': 88790, 'loss/train': 1.6845842599868774} 11/07/2021 09:41:17 - INFO - __main__ - Step 88792: {'lr': 0.0001829523656144031, 'samples': 17048064, 'steps': 88791, 'loss/train': 1.5017985105514526} 11/07/2021 09:41:18 - INFO - __main__ - Step 88793: {'lr': 0.0001829472532930244, 'samples': 17048256, 'steps': 88792, 'loss/train': 1.4438101053237915} 11/07/2021 09:41:18 - INFO - __main__ - Step 88794: {'lr': 0.0001829421410018587, 'samples': 17048448, 'steps': 88793, 'loss/train': 1.6237637996673584} 11/07/2021 09:41:18 - INFO - __main__ - Step 88795: {'lr': 0.00018293702874090816, 'samples': 17048640, 'steps': 88794, 'loss/train': 1.44187331199646} 11/07/2021 09:41:19 - INFO - __main__ - Step 88796: {'lr': 0.00018293191651017515, 'samples': 17048832, 'steps': 88795, 'loss/train': 1.6540238857269287} 11/07/2021 09:41:19 - INFO - __main__ - Step 88797: {'lr': 0.000182926804309662, 'samples': 17049024, 'steps': 88796, 'loss/train': 1.574916124343872} 11/07/2021 09:41:19 - INFO - __main__ - Step 88798: {'lr': 0.000182921692139371, 'samples': 17049216, 'steps': 88797, 'loss/train': 1.5597435235977173} 11/07/2021 09:41:20 - INFO - __main__ - Step 88799: {'lr': 0.00018291657999930445, 'samples': 17049408, 'steps': 88798, 'loss/train': 1.8572136163711548} 11/07/2021 09:41:21 - INFO - __main__ - Step 88800: {'lr': 0.00018291146788946468, 'samples': 17049600, 'steps': 88799, 'loss/train': 1.0324844121932983} 11/07/2021 09:41:21 - INFO - __main__ - Step 88801: {'lr': 0.00018290635580985392, 'samples': 17049792, 'steps': 88800, 'loss/train': 1.439511775970459} 11/07/2021 09:41:21 - INFO - __main__ - Step 88802: {'lr': 0.0001829012437604746, 'samples': 17049984, 'steps': 88801, 'loss/train': 0.940906822681427} 11/07/2021 09:41:22 - INFO - __main__ - Step 88803: {'lr': 0.00018289613174132888, 'samples': 17050176, 'steps': 88802, 'loss/train': 1.3837530612945557} 11/07/2021 09:41:23 - INFO - __main__ - Step 88804: {'lr': 0.00018289101975241912, 'samples': 17050368, 'steps': 88803, 'loss/train': 0.8431553840637207} 11/07/2021 09:41:23 - INFO - __main__ - Step 88805: {'lr': 0.00018288590779374765, 'samples': 17050560, 'steps': 88804, 'loss/train': 0.8962242603302002} 11/07/2021 09:41:23 - INFO - __main__ - Step 88806: {'lr': 0.00018288079586531675, 'samples': 17050752, 'steps': 88805, 'loss/train': 1.606353521347046} 11/07/2021 09:41:24 - INFO - __main__ - Step 88807: {'lr': 0.00018287568396712872, 'samples': 17050944, 'steps': 88806, 'loss/train': 1.541637659072876} 11/07/2021 09:41:24 - INFO - __main__ - Step 88808: {'lr': 0.00018287057209918594, 'samples': 17051136, 'steps': 88807, 'loss/train': 1.413155198097229} 11/07/2021 09:41:25 - INFO - __main__ - Step 88809: {'lr': 0.0001828654602614906, 'samples': 17051328, 'steps': 88808, 'loss/train': 1.804150104522705} 11/07/2021 09:41:26 - INFO - __main__ - Step 88810: {'lr': 0.00018286034845404502, 'samples': 17051520, 'steps': 88809, 'loss/train': 1.4218194484710693} 11/07/2021 09:41:26 - INFO - __main__ - Step 88811: {'lr': 0.00018285523667685154, 'samples': 17051712, 'steps': 88810, 'loss/train': 1.1287633180618286} 11/07/2021 09:41:26 - INFO - __main__ - Step 88812: {'lr': 0.0001828501249299125, 'samples': 17051904, 'steps': 88811, 'loss/train': 1.055591344833374} 11/07/2021 09:41:27 - INFO - __main__ - Step 88813: {'lr': 0.0001828450132132301, 'samples': 17052096, 'steps': 88812, 'loss/train': 1.539716362953186} 11/07/2021 09:41:28 - INFO - __main__ - Step 88814: {'lr': 0.0001828399015268067, 'samples': 17052288, 'steps': 88813, 'loss/train': 0.16563387215137482} 11/07/2021 09:41:28 - INFO - __main__ - Step 88815: {'lr': 0.0001828347898706446, 'samples': 17052480, 'steps': 88814, 'loss/train': 1.504058837890625} 11/07/2021 09:41:28 - INFO - __main__ - Step 88816: {'lr': 0.00018282967824474617, 'samples': 17052672, 'steps': 88815, 'loss/train': 1.5180964469909668} 11/07/2021 09:41:29 - INFO - __main__ - Step 88817: {'lr': 0.0001828245666491136, 'samples': 17052864, 'steps': 88816, 'loss/train': 1.272017478942871} 11/07/2021 09:41:29 - INFO - __main__ - Step 88818: {'lr': 0.00018281945508374925, 'samples': 17053056, 'steps': 88817, 'loss/train': 1.4801130294799805} 11/07/2021 09:41:30 - INFO - __main__ - Step 88819: {'lr': 0.0001828143435486554, 'samples': 17053248, 'steps': 88818, 'loss/train': 1.596142292022705} 11/07/2021 09:41:31 - INFO - __main__ - Step 88820: {'lr': 0.00018280923204383437, 'samples': 17053440, 'steps': 88819, 'loss/train': 1.333622694015503} 11/07/2021 09:41:31 - INFO - __main__ - Step 88821: {'lr': 0.00018280412056928856, 'samples': 17053632, 'steps': 88820, 'loss/train': 0.9277184009552002} 11/07/2021 09:41:31 - INFO - __main__ - Step 88822: {'lr': 0.00018279900912502006, 'samples': 17053824, 'steps': 88821, 'loss/train': 1.6682096719741821} 11/07/2021 09:41:32 - INFO - __main__ - Step 88823: {'lr': 0.00018279389771103138, 'samples': 17054016, 'steps': 88822, 'loss/train': 1.331129789352417} 11/07/2021 09:41:32 - INFO - __main__ - Step 88824: {'lr': 0.0001827887863273247, 'samples': 17054208, 'steps': 88823, 'loss/train': 1.1452040672302246} 11/07/2021 09:41:33 - INFO - __main__ - Step 88825: {'lr': 0.0001827836749739023, 'samples': 17054400, 'steps': 88824, 'loss/train': 0.9865601062774658} 11/07/2021 09:41:33 - INFO - __main__ - Step 88826: {'lr': 0.00018277856365076658, 'samples': 17054592, 'steps': 88825, 'loss/train': 1.6685936450958252} 11/07/2021 09:41:34 - INFO - __main__ - Step 88827: {'lr': 0.00018277345235791982, 'samples': 17054784, 'steps': 88826, 'loss/train': 0.8817988634109497} 11/07/2021 09:41:34 - INFO - __main__ - Step 88828: {'lr': 0.00018276834109536428, 'samples': 17054976, 'steps': 88827, 'loss/train': 0.8344403505325317} 11/07/2021 09:41:35 - INFO - __main__ - Step 88829: {'lr': 0.00018276322986310228, 'samples': 17055168, 'steps': 88828, 'loss/train': 1.7317547798156738} 11/07/2021 09:41:35 - INFO - __main__ - Step 88830: {'lr': 0.0001827581186611361, 'samples': 17055360, 'steps': 88829, 'loss/train': 1.4550758600234985} 11/07/2021 09:41:36 - INFO - __main__ - Step 88831: {'lr': 0.00018275300748946813, 'samples': 17055552, 'steps': 88830, 'loss/train': 1.4530543088912964} 11/07/2021 09:41:36 - INFO - __main__ - Step 88832: {'lr': 0.0001827478963481006, 'samples': 17055744, 'steps': 88831, 'loss/train': 0.7877528071403503} 11/07/2021 09:41:37 - INFO - __main__ - Step 88833: {'lr': 0.00018274278523703583, 'samples': 17055936, 'steps': 88832, 'loss/train': 1.4240211248397827} 11/07/2021 09:41:37 - INFO - __main__ - Step 88834: {'lr': 0.00018273767415627611, 'samples': 17056128, 'steps': 88833, 'loss/train': 1.2321723699569702} 11/07/2021 09:41:37 - INFO - __main__ - Step 88835: {'lr': 0.0001827325631058239, 'samples': 17056320, 'steps': 88834, 'loss/train': 1.2940467596054077} 11/07/2021 09:41:38 - INFO - __main__ - Step 88836: {'lr': 0.0001827274520856812, 'samples': 17056512, 'steps': 88835, 'loss/train': 1.2725787162780762} 11/07/2021 09:41:39 - INFO - __main__ - Step 88837: {'lr': 0.0001827223410958505, 'samples': 17056704, 'steps': 88836, 'loss/train': 0.26165133714675903} 11/07/2021 09:41:39 - INFO - __main__ - Step 88838: {'lr': 0.0001827172301363341, 'samples': 17056896, 'steps': 88837, 'loss/train': 1.2740756273269653} 11/07/2021 09:41:39 - INFO - __main__ - Step 88839: {'lr': 0.00018271211920713423, 'samples': 17057088, 'steps': 88838, 'loss/train': 1.730088710784912} 11/07/2021 09:41:40 - INFO - __main__ - Step 88840: {'lr': 0.00018270700830825325, 'samples': 17057280, 'steps': 88839, 'loss/train': 0.9747503995895386} 11/07/2021 09:41:41 - INFO - __main__ - Step 88841: {'lr': 0.00018270189743969348, 'samples': 17057472, 'steps': 88840, 'loss/train': 1.3729997873306274} 11/07/2021 09:41:41 - INFO - __main__ - Step 88842: {'lr': 0.0001826967866014572, 'samples': 17057664, 'steps': 88841, 'loss/train': 1.4371088743209839} 11/07/2021 09:41:41 - INFO - __main__ - Step 88843: {'lr': 0.00018269167579354668, 'samples': 17057856, 'steps': 88842, 'loss/train': 1.2104593515396118} 11/07/2021 09:41:42 - INFO - __main__ - Step 88844: {'lr': 0.00018268656501596426, 'samples': 17058048, 'steps': 88843, 'loss/train': 1.0875753164291382} 11/07/2021 09:41:42 - INFO - __main__ - Step 88845: {'lr': 0.00018268145426871224, 'samples': 17058240, 'steps': 88844, 'loss/train': 1.1264004707336426} 11/07/2021 09:41:43 - INFO - __main__ - Step 88846: {'lr': 0.00018267634355179291, 'samples': 17058432, 'steps': 88845, 'loss/train': 1.7198740243911743} 11/07/2021 09:41:44 - INFO - __main__ - Step 88847: {'lr': 0.0001826712328652086, 'samples': 17058624, 'steps': 88846, 'loss/train': 1.5201184749603271} 11/07/2021 09:41:44 - INFO - __main__ - Step 88848: {'lr': 0.00018266612220896168, 'samples': 17058816, 'steps': 88847, 'loss/train': 1.374144196510315} 11/07/2021 09:41:44 - INFO - __main__ - Step 88849: {'lr': 0.00018266101158305426, 'samples': 17059008, 'steps': 88848, 'loss/train': 1.0250955820083618} 11/07/2021 09:41:45 - INFO - __main__ - Step 88850: {'lr': 0.00018265590098748876, 'samples': 17059200, 'steps': 88849, 'loss/train': 1.7108186483383179} 11/07/2021 09:41:46 - INFO - __main__ - Step 88851: {'lr': 0.00018265079042226748, 'samples': 17059392, 'steps': 88850, 'loss/train': 1.5230990648269653} 11/07/2021 09:41:46 - INFO - __main__ - Step 88852: {'lr': 0.0001826456798873927, 'samples': 17059584, 'steps': 88851, 'loss/train': 1.4251989126205444} 11/07/2021 09:41:46 - INFO - __main__ - Step 88853: {'lr': 0.00018264056938286676, 'samples': 17059776, 'steps': 88852, 'loss/train': 1.1940337419509888} 11/07/2021 09:41:47 - INFO - __main__ - Step 88854: {'lr': 0.0001826354589086919, 'samples': 17059968, 'steps': 88853, 'loss/train': 0.8226433396339417} 11/07/2021 09:41:47 - INFO - __main__ - Step 88855: {'lr': 0.0001826303484648705, 'samples': 17060160, 'steps': 88854, 'loss/train': 1.4238903522491455} 11/07/2021 09:41:48 - INFO - __main__ - Step 88856: {'lr': 0.0001826252380514048, 'samples': 17060352, 'steps': 88855, 'loss/train': 0.823657214641571} 11/07/2021 09:41:48 - INFO - __main__ - Step 88857: {'lr': 0.00018262012766829714, 'samples': 17060544, 'steps': 88856, 'loss/train': 1.5461784601211548} 11/07/2021 09:41:49 - INFO - __main__ - Step 88858: {'lr': 0.0001826150173155498, 'samples': 17060736, 'steps': 88857, 'loss/train': 1.3885945081710815} 11/07/2021 09:41:49 - INFO - __main__ - Step 88859: {'lr': 0.0001826099069931651, 'samples': 17060928, 'steps': 88858, 'loss/train': 0.7752049565315247} 11/07/2021 09:41:49 - INFO - __main__ - Step 88860: {'lr': 0.00018260479670114532, 'samples': 17061120, 'steps': 88859, 'loss/train': 1.1170828342437744} 11/07/2021 09:41:51 - INFO - __main__ - Step 88861: {'lr': 0.00018259968643949293, 'samples': 17061312, 'steps': 88860, 'loss/train': 1.0777697563171387} 11/07/2021 09:41:51 - INFO - __main__ - Step 88862: {'lr': 0.00018259457620820992, 'samples': 17061504, 'steps': 88861, 'loss/train': 0.811281144618988} 11/07/2021 09:41:52 - INFO - __main__ - Step 88863: {'lr': 0.0001825894660072988, 'samples': 17061696, 'steps': 88862, 'loss/train': 0.4528470039367676} 11/07/2021 09:41:52 - INFO - __main__ - Step 88864: {'lr': 0.00018258435583676182, 'samples': 17061888, 'steps': 88863, 'loss/train': 1.3880256414413452} 11/07/2021 09:41:52 - INFO - __main__ - Step 88865: {'lr': 0.00018257924569660126, 'samples': 17062080, 'steps': 88864, 'loss/train': 1.6732258796691895} 11/07/2021 09:41:53 - INFO - __main__ - Step 88866: {'lr': 0.00018257413558681946, 'samples': 17062272, 'steps': 88865, 'loss/train': 2.092818021774292} 11/07/2021 09:41:54 - INFO - __main__ - Step 88867: {'lr': 0.0001825690255074187, 'samples': 17062464, 'steps': 88866, 'loss/train': 1.4428755044937134} 11/07/2021 09:41:54 - INFO - __main__ - Step 88868: {'lr': 0.00018256391545840134, 'samples': 17062656, 'steps': 88867, 'loss/train': 1.2342997789382935} 11/07/2021 09:41:54 - INFO - __main__ - Step 88869: {'lr': 0.0001825588054397696, 'samples': 17062848, 'steps': 88868, 'loss/train': 1.705339789390564} 11/07/2021 09:41:55 - INFO - __main__ - Step 88870: {'lr': 0.00018255369545152586, 'samples': 17063040, 'steps': 88869, 'loss/train': 1.5163793563842773} 11/07/2021 09:41:55 - INFO - __main__ - Step 88871: {'lr': 0.00018254858549367236, 'samples': 17063232, 'steps': 88870, 'loss/train': 1.7208994626998901} 11/07/2021 09:41:55 - INFO - __main__ - Step 88872: {'lr': 0.00018254347556621143, 'samples': 17063424, 'steps': 88871, 'loss/train': 1.5654640197753906} 11/07/2021 09:41:56 - INFO - __main__ - Step 88873: {'lr': 0.0001825383656691454, 'samples': 17063616, 'steps': 88872, 'loss/train': 1.4371076822280884} 11/07/2021 09:41:57 - INFO - __main__ - Step 88874: {'lr': 0.00018253325580247647, 'samples': 17063808, 'steps': 88873, 'loss/train': 1.05513334274292} 11/07/2021 09:41:57 - INFO - __main__ - Step 88875: {'lr': 0.00018252814596620716, 'samples': 17064000, 'steps': 88874, 'loss/train': 1.4043272733688354} 11/07/2021 09:41:57 - INFO - __main__ - Step 88876: {'lr': 0.00018252303616033956, 'samples': 17064192, 'steps': 88875, 'loss/train': 0.7508611083030701} 11/07/2021 09:41:58 - INFO - __main__ - Step 88877: {'lr': 0.00018251792638487597, 'samples': 17064384, 'steps': 88876, 'loss/train': 1.4125629663467407} 11/07/2021 09:41:59 - INFO - __main__ - Step 88878: {'lr': 0.00018251281663981877, 'samples': 17064576, 'steps': 88877, 'loss/train': 1.4506841897964478} 11/07/2021 09:41:59 - INFO - __main__ - Step 88879: {'lr': 0.0001825077069251703, 'samples': 17064768, 'steps': 88878, 'loss/train': 1.2839584350585938} 11/07/2021 09:42:00 - INFO - __main__ - Step 88880: {'lr': 0.00018250259724093276, 'samples': 17064960, 'steps': 88879, 'loss/train': 1.9021672010421753} 11/07/2021 09:42:00 - INFO - __main__ - Step 88881: {'lr': 0.00018249748758710854, 'samples': 17065152, 'steps': 88880, 'loss/train': 0.6562809348106384} 11/07/2021 09:42:00 - INFO - __main__ - Step 88882: {'lr': 0.00018249237796369994, 'samples': 17065344, 'steps': 88881, 'loss/train': 0.933929443359375} 11/07/2021 09:42:01 - INFO - __main__ - Step 88883: {'lr': 0.00018248726837070918, 'samples': 17065536, 'steps': 88882, 'loss/train': 1.1549278497695923} 11/07/2021 09:42:02 - INFO - __main__ - Step 88884: {'lr': 0.00018248215880813863, 'samples': 17065728, 'steps': 88883, 'loss/train': 0.8706420660018921} 11/07/2021 09:42:02 - INFO - __main__ - Step 88885: {'lr': 0.0001824770492759906, 'samples': 17065920, 'steps': 88884, 'loss/train': 1.3475621938705444} 11/07/2021 09:42:02 - INFO - __main__ - Step 88886: {'lr': 0.00018247193977426735, 'samples': 17066112, 'steps': 88885, 'loss/train': 1.1922990083694458} 11/07/2021 09:42:03 - INFO - __main__ - Step 88887: {'lr': 0.0001824668303029712, 'samples': 17066304, 'steps': 88886, 'loss/train': 1.2763218879699707} 11/07/2021 09:42:03 - INFO - __main__ - Step 88888: {'lr': 0.00018246172086210455, 'samples': 17066496, 'steps': 88887, 'loss/train': 1.4783943891525269} 11/07/2021 09:42:04 - INFO - __main__ - Step 88889: {'lr': 0.00018245661145166952, 'samples': 17066688, 'steps': 88888, 'loss/train': 1.374481439590454} 11/07/2021 09:42:05 - INFO - __main__ - Step 88890: {'lr': 0.0001824515020716685, 'samples': 17066880, 'steps': 88889, 'loss/train': 1.3685243129730225} 11/07/2021 09:42:05 - INFO - __main__ - Step 88891: {'lr': 0.0001824463927221038, 'samples': 17067072, 'steps': 88890, 'loss/train': 1.4492334127426147} 11/07/2021 09:42:05 - INFO - __main__ - Step 88892: {'lr': 0.00018244128340297766, 'samples': 17067264, 'steps': 88891, 'loss/train': 1.304186463356018} 11/07/2021 09:42:06 - INFO - __main__ - Step 88893: {'lr': 0.00018243617411429247, 'samples': 17067456, 'steps': 88892, 'loss/train': 1.004669189453125} 11/07/2021 09:42:07 - INFO - __main__ - Step 88894: {'lr': 0.00018243106485605053, 'samples': 17067648, 'steps': 88893, 'loss/train': 1.4206814765930176} 11/07/2021 09:42:07 - INFO - __main__ - Step 88895: {'lr': 0.0001824259556282541, 'samples': 17067840, 'steps': 88894, 'loss/train': 1.564016342163086} 11/07/2021 09:42:07 - INFO - __main__ - Step 88896: {'lr': 0.00018242084643090546, 'samples': 17068032, 'steps': 88895, 'loss/train': 1.267768144607544} 11/07/2021 09:42:08 - INFO - __main__ - Step 88897: {'lr': 0.00018241573726400696, 'samples': 17068224, 'steps': 88896, 'loss/train': 0.9846521615982056} 11/07/2021 09:42:08 - INFO - __main__ - Step 88898: {'lr': 0.00018241062812756088, 'samples': 17068416, 'steps': 88897, 'loss/train': 1.5098921060562134} 11/07/2021 09:42:09 - INFO - __main__ - Step 88899: {'lr': 0.00018240551902156952, 'samples': 17068608, 'steps': 88898, 'loss/train': 1.3422943353652954} 11/07/2021 09:42:10 - INFO - __main__ - Step 88900: {'lr': 0.0001824004099460352, 'samples': 17068800, 'steps': 88899, 'loss/train': 0.7245060205459595} 11/07/2021 09:42:10 - INFO - __main__ - Step 88901: {'lr': 0.00018239530090096017, 'samples': 17068992, 'steps': 88900, 'loss/train': 0.39605069160461426} 11/07/2021 09:42:10 - INFO - __main__ - Step 88902: {'lr': 0.00018239019188634695, 'samples': 17069184, 'steps': 88901, 'loss/train': 1.3831919431686401} 11/07/2021 09:42:11 - INFO - __main__ - Step 88903: {'lr': 0.00018238508290219753, 'samples': 17069376, 'steps': 88902, 'loss/train': 0.6667335033416748} 11/07/2021 09:42:11 - INFO - __main__ - Step 88904: {'lr': 0.00018237997394851435, 'samples': 17069568, 'steps': 88903, 'loss/train': 1.5833609104156494} 11/07/2021 09:42:12 - INFO - __main__ - Step 88905: {'lr': 0.00018237486502529972, 'samples': 17069760, 'steps': 88904, 'loss/train': 1.3295259475708008} 11/07/2021 09:42:12 - INFO - __main__ - Step 88906: {'lr': 0.00018236975613255592, 'samples': 17069952, 'steps': 88905, 'loss/train': 1.5610243082046509} 11/07/2021 09:42:13 - INFO - __main__ - Step 88907: {'lr': 0.00018236464727028527, 'samples': 17070144, 'steps': 88906, 'loss/train': 1.4653282165527344} 11/07/2021 09:42:13 - INFO - __main__ - Step 88908: {'lr': 0.00018235953843849008, 'samples': 17070336, 'steps': 88907, 'loss/train': 1.3517398834228516} 11/07/2021 09:42:13 - INFO - __main__ - Step 88909: {'lr': 0.00018235442963717257, 'samples': 17070528, 'steps': 88908, 'loss/train': 0.6667860746383667} 11/07/2021 09:42:14 - INFO - __main__ - Step 88910: {'lr': 0.00018234932086633518, 'samples': 17070720, 'steps': 88909, 'loss/train': 1.5456684827804565} 11/07/2021 09:42:15 - INFO - __main__ - Step 88911: {'lr': 0.0001823442121259801, 'samples': 17070912, 'steps': 88910, 'loss/train': 1.2045990228652954} 11/07/2021 09:42:15 - INFO - __main__ - Step 88912: {'lr': 0.00018233910341610972, 'samples': 17071104, 'steps': 88911, 'loss/train': 1.4507105350494385} 11/07/2021 09:42:16 - INFO - __main__ - Step 88913: {'lr': 0.0001823339947367263, 'samples': 17071296, 'steps': 88912, 'loss/train': 1.264703392982483} 11/07/2021 09:42:16 - INFO - __main__ - Step 88914: {'lr': 0.00018232888608783217, 'samples': 17071488, 'steps': 88913, 'loss/train': 1.333620309829712} 11/07/2021 09:42:17 - INFO - __main__ - Step 88915: {'lr': 0.00018232377746942957, 'samples': 17071680, 'steps': 88914, 'loss/train': 1.5621306896209717} 11/07/2021 09:42:17 - INFO - __main__ - Step 88916: {'lr': 0.0001823186688815208, 'samples': 17071872, 'steps': 88915, 'loss/train': 1.2311230897903442} 11/07/2021 09:42:18 - INFO - __main__ - Step 88917: {'lr': 0.0001823135603241082, 'samples': 17072064, 'steps': 88916, 'loss/train': 0.798693835735321} 11/07/2021 09:42:18 - INFO - __main__ - Step 88918: {'lr': 0.0001823084517971941, 'samples': 17072256, 'steps': 88917, 'loss/train': 1.431484341621399} 11/07/2021 09:42:18 - INFO - __main__ - Step 88919: {'lr': 0.00018230334330078069, 'samples': 17072448, 'steps': 88918, 'loss/train': 1.3643901348114014} 11/07/2021 09:42:19 - INFO - __main__ - Step 88920: {'lr': 0.0001822982348348704, 'samples': 17072640, 'steps': 88919, 'loss/train': 1.7155749797821045} 11/07/2021 09:42:20 - INFO - __main__ - Step 88921: {'lr': 0.00018229312639946545, 'samples': 17072832, 'steps': 88920, 'loss/train': 1.7030454874038696} 11/07/2021 09:42:20 - INFO - __main__ - Step 88922: {'lr': 0.00018228801799456817, 'samples': 17073024, 'steps': 88921, 'loss/train': 1.3986246585845947} 11/07/2021 09:42:21 - INFO - __main__ - Step 88923: {'lr': 0.0001822829096201809, 'samples': 17073216, 'steps': 88922, 'loss/train': 0.921894907951355} 11/07/2021 09:42:21 - INFO - __main__ - Step 88924: {'lr': 0.0001822778012763059, 'samples': 17073408, 'steps': 88923, 'loss/train': 1.4336278438568115} 11/07/2021 09:42:22 - INFO - __main__ - Step 88925: {'lr': 0.00018227269296294552, 'samples': 17073600, 'steps': 88924, 'loss/train': 1.067434549331665} 11/07/2021 09:42:22 - INFO - __main__ - Step 88926: {'lr': 0.00018226758468010195, 'samples': 17073792, 'steps': 88925, 'loss/train': 1.1804537773132324} 11/07/2021 09:42:23 - INFO - __main__ - Step 88927: {'lr': 0.0001822624764277776, 'samples': 17073984, 'steps': 88926, 'loss/train': 1.4222925901412964} 11/07/2021 09:42:23 - INFO - __main__ - Step 88928: {'lr': 0.0001822573682059747, 'samples': 17074176, 'steps': 88927, 'loss/train': 1.3907841444015503} 11/07/2021 09:42:23 - INFO - __main__ - Step 88929: {'lr': 0.00018225226001469564, 'samples': 17074368, 'steps': 88928, 'loss/train': 1.0876489877700806} 11/07/2021 09:42:24 - INFO - __main__ - Step 88930: {'lr': 0.00018224715185394263, 'samples': 17074560, 'steps': 88929, 'loss/train': 1.057643175125122} 11/07/2021 09:42:25 - INFO - __main__ - Step 88931: {'lr': 0.000182242043723718, 'samples': 17074752, 'steps': 88930, 'loss/train': 1.6434026956558228} 11/07/2021 09:42:25 - INFO - __main__ - Step 88932: {'lr': 0.00018223693562402404, 'samples': 17074944, 'steps': 88931, 'loss/train': 1.294454574584961} 11/07/2021 09:42:25 - INFO - __main__ - Step 88933: {'lr': 0.0001822318275548631, 'samples': 17075136, 'steps': 88932, 'loss/train': 1.0698575973510742} 11/07/2021 09:42:26 - INFO - __main__ - Step 88934: {'lr': 0.00018222671951623746, 'samples': 17075328, 'steps': 88933, 'loss/train': 2.0450406074523926} 11/07/2021 09:42:26 - INFO - __main__ - Step 88935: {'lr': 0.0001822216115081494, 'samples': 17075520, 'steps': 88934, 'loss/train': 1.139235019683838} 11/07/2021 09:42:27 - INFO - __main__ - Step 88936: {'lr': 0.0001822165035306013, 'samples': 17075712, 'steps': 88935, 'loss/train': 0.22004170715808868} 11/07/2021 09:42:28 - INFO - __main__ - Step 88937: {'lr': 0.0001822113955835953, 'samples': 17075904, 'steps': 88936, 'loss/train': 1.3824890851974487} 11/07/2021 09:42:28 - INFO - __main__ - Step 88938: {'lr': 0.00018220628766713384, 'samples': 17076096, 'steps': 88937, 'loss/train': 0.774637758731842} 11/07/2021 09:42:28 - INFO - __main__ - Step 88939: {'lr': 0.0001822011797812192, 'samples': 17076288, 'steps': 88938, 'loss/train': 1.1260108947753906} 11/07/2021 09:42:29 - INFO - __main__ - Step 88940: {'lr': 0.0001821960719258536, 'samples': 17076480, 'steps': 88939, 'loss/train': 0.8007310628890991} 11/07/2021 09:42:30 - INFO - __main__ - Step 88941: {'lr': 0.00018219096410103947, 'samples': 17076672, 'steps': 88940, 'loss/train': 1.4948327541351318} 11/07/2021 09:42:30 - INFO - __main__ - Step 88942: {'lr': 0.00018218585630677903, 'samples': 17076864, 'steps': 88941, 'loss/train': 1.6578242778778076} 11/07/2021 09:42:30 - INFO - __main__ - Step 88943: {'lr': 0.0001821807485430746, 'samples': 17077056, 'steps': 88942, 'loss/train': 1.2339907884597778} 11/07/2021 09:42:31 - INFO - __main__ - Step 88944: {'lr': 0.00018217564080992845, 'samples': 17077248, 'steps': 88943, 'loss/train': 1.3748071193695068} 11/07/2021 09:42:31 - INFO - __main__ - Step 88945: {'lr': 0.00018217053310734294, 'samples': 17077440, 'steps': 88944, 'loss/train': 1.8892771005630493} 11/07/2021 09:42:32 - INFO - __main__ - Step 88946: {'lr': 0.0001821654254353203, 'samples': 17077632, 'steps': 88945, 'loss/train': 1.1459505558013916} 11/07/2021 09:42:32 - INFO - __main__ - Step 88947: {'lr': 0.00018216031779386295, 'samples': 17077824, 'steps': 88946, 'loss/train': 1.4893012046813965} 11/07/2021 09:42:33 - INFO - __main__ - Step 88948: {'lr': 0.00018215521018297303, 'samples': 17078016, 'steps': 88947, 'loss/train': 1.4344463348388672} 11/07/2021 09:42:33 - INFO - __main__ - Step 88949: {'lr': 0.00018215010260265297, 'samples': 17078208, 'steps': 88948, 'loss/train': 1.5095868110656738} 11/07/2021 09:42:33 - INFO - __main__ - Step 88950: {'lr': 0.000182144995052905, 'samples': 17078400, 'steps': 88949, 'loss/train': 1.1759729385375977} 11/07/2021 09:42:34 - INFO - __main__ - Step 88951: {'lr': 0.00018213988753373146, 'samples': 17078592, 'steps': 88950, 'loss/train': 0.8689207434654236} 11/07/2021 09:42:35 - INFO - __main__ - Step 88952: {'lr': 0.0001821347800451346, 'samples': 17078784, 'steps': 88951, 'loss/train': 1.9276107549667358} 11/07/2021 09:42:35 - INFO - __main__ - Step 88953: {'lr': 0.0001821296725871168, 'samples': 17078976, 'steps': 88952, 'loss/train': 1.6356160640716553} 11/07/2021 09:42:35 - INFO - __main__ - Step 88954: {'lr': 0.00018212456515968035, 'samples': 17079168, 'steps': 88953, 'loss/train': 1.3641736507415771} 11/07/2021 09:42:36 - INFO - __main__ - Step 88955: {'lr': 0.0001821194577628275, 'samples': 17079360, 'steps': 88954, 'loss/train': 1.3296420574188232} 11/07/2021 09:42:36 - INFO - __main__ - Step 88956: {'lr': 0.0001821143503965606, 'samples': 17079552, 'steps': 88955, 'loss/train': 0.9587552547454834} 11/07/2021 09:42:37 - INFO - __main__ - Step 88957: {'lr': 0.0001821092430608819, 'samples': 17079744, 'steps': 88956, 'loss/train': 0.8711540699005127} 11/07/2021 09:42:38 - INFO - __main__ - Step 88958: {'lr': 0.00018210413575579378, 'samples': 17079936, 'steps': 88957, 'loss/train': 1.106492042541504} 11/07/2021 09:42:38 - INFO - __main__ - Step 88959: {'lr': 0.00018209902848129842, 'samples': 17080128, 'steps': 88958, 'loss/train': 1.4078502655029297} 11/07/2021 09:42:38 - INFO - __main__ - Step 88960: {'lr': 0.00018209392123739822, 'samples': 17080320, 'steps': 88959, 'loss/train': 1.4580621719360352} 11/07/2021 09:42:39 - INFO - __main__ - Step 88961: {'lr': 0.0001820888140240954, 'samples': 17080512, 'steps': 88960, 'loss/train': 1.6252957582473755} 11/07/2021 09:42:40 - INFO - __main__ - Step 88962: {'lr': 0.00018208370684139237, 'samples': 17080704, 'steps': 88961, 'loss/train': 1.7822967767715454} 11/07/2021 09:42:40 - INFO - __main__ - Step 88963: {'lr': 0.00018207859968929132, 'samples': 17080896, 'steps': 88962, 'loss/train': 1.348771333694458} 11/07/2021 09:42:40 - INFO - __main__ - Step 88964: {'lr': 0.00018207349256779465, 'samples': 17081088, 'steps': 88963, 'loss/train': 1.0401579141616821} 11/07/2021 09:42:41 - INFO - __main__ - Step 88965: {'lr': 0.00018206838547690457, 'samples': 17081280, 'steps': 88964, 'loss/train': 1.6779555082321167} 11/07/2021 09:42:41 - INFO - __main__ - Step 88966: {'lr': 0.00018206327841662346, 'samples': 17081472, 'steps': 88965, 'loss/train': 1.4645670652389526} 11/07/2021 09:42:41 - INFO - __main__ - Step 88967: {'lr': 0.00018205817138695356, 'samples': 17081664, 'steps': 88966, 'loss/train': 1.1900489330291748} 11/07/2021 09:42:42 - INFO - __main__ - Step 88968: {'lr': 0.00018205306438789725, 'samples': 17081856, 'steps': 88967, 'loss/train': 1.117751955986023} 11/07/2021 09:42:43 - INFO - __main__ - Step 88969: {'lr': 0.00018204795741945685, 'samples': 17082048, 'steps': 88968, 'loss/train': 1.4235177040100098} 11/07/2021 09:42:43 - INFO - __main__ - Step 88970: {'lr': 0.0001820428504816345, 'samples': 17082240, 'steps': 88969, 'loss/train': 1.5987321138381958} 11/07/2021 09:42:44 - INFO - __main__ - Step 88971: {'lr': 0.0001820377435744326, 'samples': 17082432, 'steps': 88970, 'loss/train': 1.0758349895477295} 11/07/2021 09:42:44 - INFO - __main__ - Step 88972: {'lr': 0.00018203263669785342, 'samples': 17082624, 'steps': 88971, 'loss/train': 2.327357053756714} 11/07/2021 09:42:45 - INFO - __main__ - Step 88973: {'lr': 0.0001820275298518993, 'samples': 17082816, 'steps': 88972, 'loss/train': 1.3250911235809326} 11/07/2021 09:42:45 - INFO - __main__ - Step 88974: {'lr': 0.00018202242303657251, 'samples': 17083008, 'steps': 88973, 'loss/train': 0.7527326941490173} 11/07/2021 09:42:46 - INFO - __main__ - Step 88975: {'lr': 0.00018201731625187538, 'samples': 17083200, 'steps': 88974, 'loss/train': 1.5046530961990356} 11/07/2021 09:42:46 - INFO - __main__ - Step 88976: {'lr': 0.00018201220949781022, 'samples': 17083392, 'steps': 88975, 'loss/train': 1.3118555545806885} 11/07/2021 09:42:46 - INFO - __main__ - Step 88977: {'lr': 0.00018200710277437927, 'samples': 17083584, 'steps': 88976, 'loss/train': 1.011369228363037} 11/07/2021 09:42:47 - INFO - __main__ - Step 88978: {'lr': 0.0001820019960815849, 'samples': 17083776, 'steps': 88977, 'loss/train': 1.2176815271377563} 11/07/2021 09:42:48 - INFO - __main__ - Step 88979: {'lr': 0.00018199688941942938, 'samples': 17083968, 'steps': 88978, 'loss/train': 1.7003560066223145} 11/07/2021 09:42:48 - INFO - __main__ - Step 88980: {'lr': 0.000181991782787915, 'samples': 17084160, 'steps': 88979, 'loss/train': 1.549998164176941} 11/07/2021 09:42:48 - INFO - __main__ - Step 88981: {'lr': 0.00018198667618704408, 'samples': 17084352, 'steps': 88980, 'loss/train': 0.48196926712989807} 11/07/2021 09:42:49 - INFO - __main__ - Step 88982: {'lr': 0.000181981569616819, 'samples': 17084544, 'steps': 88981, 'loss/train': 1.212378740310669} 11/07/2021 09:42:50 - INFO - __main__ - Step 88983: {'lr': 0.0001819764630772419, 'samples': 17084736, 'steps': 88982, 'loss/train': 0.7840624451637268} 11/07/2021 09:42:50 - INFO - __main__ - Step 88984: {'lr': 0.0001819713565683151, 'samples': 17084928, 'steps': 88983, 'loss/train': 1.3118815422058105} 11/07/2021 09:42:51 - INFO - __main__ - Step 88985: {'lr': 0.00018196625009004103, 'samples': 17085120, 'steps': 88984, 'loss/train': 1.289143443107605} 11/07/2021 09:42:51 - INFO - __main__ - Step 88986: {'lr': 0.0001819611436424219, 'samples': 17085312, 'steps': 88985, 'loss/train': 1.491917610168457} 11/07/2021 09:42:51 - INFO - __main__ - Step 88987: {'lr': 0.00018195603722546, 'samples': 17085504, 'steps': 88986, 'loss/train': 5.69888162612915} 11/07/2021 09:42:52 - INFO - __main__ - Step 88988: {'lr': 0.00018195093083915766, 'samples': 17085696, 'steps': 88987, 'loss/train': 1.396615743637085} 11/07/2021 09:42:53 - INFO - __main__ - Step 88989: {'lr': 0.00018194582448351722, 'samples': 17085888, 'steps': 88988, 'loss/train': 1.079343318939209} 11/07/2021 09:42:53 - INFO - __main__ - Step 88990: {'lr': 0.00018194071815854092, 'samples': 17086080, 'steps': 88989, 'loss/train': 1.3792957067489624} 11/07/2021 09:42:53 - INFO - __main__ - Step 88991: {'lr': 0.00018193561186423106, 'samples': 17086272, 'steps': 88990, 'loss/train': 1.087691068649292} 11/07/2021 09:42:54 - INFO - __main__ - Step 88992: {'lr': 0.00018193050560058997, 'samples': 17086464, 'steps': 88991, 'loss/train': 1.1205886602401733} 11/07/2021 09:42:54 - INFO - __main__ - Step 88993: {'lr': 0.00018192539936761997, 'samples': 17086656, 'steps': 88992, 'loss/train': 1.0653496980667114} 11/07/2021 09:42:55 - INFO - __main__ - Step 88994: {'lr': 0.0001819202931653233, 'samples': 17086848, 'steps': 88993, 'loss/train': 1.197283387184143} 11/07/2021 09:42:55 - INFO - __main__ - Step 88995: {'lr': 0.0001819151869937023, 'samples': 17087040, 'steps': 88994, 'loss/train': 0.543476939201355} 11/07/2021 09:42:56 - INFO - __main__ - Step 88996: {'lr': 0.0001819100808527594, 'samples': 17087232, 'steps': 88995, 'loss/train': 1.6730798482894897} 11/07/2021 09:42:56 - INFO - __main__ - Step 88997: {'lr': 0.00018190497474249664, 'samples': 17087424, 'steps': 88996, 'loss/train': 1.6200696229934692} 11/07/2021 09:42:56 - INFO - __main__ - Step 88998: {'lr': 0.00018189986866291646, 'samples': 17087616, 'steps': 88997, 'loss/train': 1.0133460760116577} 11/07/2021 09:42:58 - INFO - __main__ - Step 88999: {'lr': 0.00018189476261402116, 'samples': 17087808, 'steps': 88998, 'loss/train': 1.543308138847351} 11/07/2021 09:42:58 - INFO - __main__ - Step 89000: {'lr': 0.000181889656595813, 'samples': 17088000, 'steps': 88999, 'loss/train': 0.6034559607505798} 11/07/2021 09:42:58 - INFO - __main__ - Step 89001: {'lr': 0.0001818845506082943, 'samples': 17088192, 'steps': 89000, 'loss/train': 1.247695803642273} 11/07/2021 09:42:59 - INFO - __main__ - Step 89002: {'lr': 0.00018187944465146742, 'samples': 17088384, 'steps': 89001, 'loss/train': 1.6086386442184448} 11/07/2021 09:42:59 - INFO - __main__ - Step 89003: {'lr': 0.00018187433872533457, 'samples': 17088576, 'steps': 89002, 'loss/train': 1.501760721206665} 11/07/2021 09:43:00 - INFO - __main__ - Step 89004: {'lr': 0.00018186923282989808, 'samples': 17088768, 'steps': 89003, 'loss/train': 0.8123756647109985} 11/07/2021 09:43:00 - INFO - __main__ - Step 89005: {'lr': 0.00018186412696516031, 'samples': 17088960, 'steps': 89004, 'loss/train': 1.3956588506698608} 11/07/2021 09:43:01 - INFO - __main__ - Step 89006: {'lr': 0.00018185902113112352, 'samples': 17089152, 'steps': 89005, 'loss/train': 1.5042136907577515} 11/07/2021 09:43:01 - INFO - __main__ - Step 89007: {'lr': 0.00018185391532778993, 'samples': 17089344, 'steps': 89006, 'loss/train': 1.3194527626037598} 11/07/2021 09:43:01 - INFO - __main__ - Step 89008: {'lr': 0.00018184880955516197, 'samples': 17089536, 'steps': 89007, 'loss/train': 1.573068618774414} 11/07/2021 09:43:02 - INFO - __main__ - Step 89009: {'lr': 0.000181843703813242, 'samples': 17089728, 'steps': 89008, 'loss/train': 1.8995918035507202} 11/07/2021 09:43:03 - INFO - __main__ - Step 89010: {'lr': 0.00018183859810203207, 'samples': 17089920, 'steps': 89009, 'loss/train': 2.153380870819092} 11/07/2021 09:43:03 - INFO - __main__ - Step 89011: {'lr': 0.00018183349242153462, 'samples': 17090112, 'steps': 89010, 'loss/train': 1.456430196762085} 11/07/2021 09:43:04 - INFO - __main__ - Step 89012: {'lr': 0.00018182838677175195, 'samples': 17090304, 'steps': 89011, 'loss/train': 0.9254629015922546} 11/07/2021 09:43:04 - INFO - __main__ - Step 89013: {'lr': 0.00018182328115268638, 'samples': 17090496, 'steps': 89012, 'loss/train': 1.3056248426437378} 11/07/2021 09:43:04 - INFO - __main__ - Step 89014: {'lr': 0.00018181817556434015, 'samples': 17090688, 'steps': 89013, 'loss/train': 2.0388684272766113} 11/07/2021 09:43:06 - INFO - __main__ - Step 89015: {'lr': 0.0001818130700067156, 'samples': 17090880, 'steps': 89014, 'loss/train': 1.4711494445800781} 11/07/2021 09:43:06 - INFO - __main__ - Step 89016: {'lr': 0.00018180796447981508, 'samples': 17091072, 'steps': 89015, 'loss/train': 1.3071033954620361} 11/07/2021 09:43:07 - INFO - __main__ - Step 89017: {'lr': 0.00018180285898364076, 'samples': 17091264, 'steps': 89016, 'loss/train': 0.7729685306549072} 11/07/2021 09:43:07 - INFO - __main__ - Step 89018: {'lr': 0.00018179775351819506, 'samples': 17091456, 'steps': 89017, 'loss/train': 1.4112269878387451} 11/07/2021 09:43:07 - INFO - __main__ - Step 89019: {'lr': 0.00018179264808348026, 'samples': 17091648, 'steps': 89018, 'loss/train': 1.253597617149353} 11/07/2021 09:43:08 - INFO - __main__ - Step 89020: {'lr': 0.0001817875426794986, 'samples': 17091840, 'steps': 89019, 'loss/train': 1.7669438123703003} 11/07/2021 09:43:09 - INFO - __main__ - Step 89021: {'lr': 0.00018178243730625242, 'samples': 17092032, 'steps': 89020, 'loss/train': 1.7768954038619995} 11/07/2021 09:43:09 - INFO - __main__ - Step 89022: {'lr': 0.00018177733196374408, 'samples': 17092224, 'steps': 89021, 'loss/train': 1.3015164136886597} 11/07/2021 09:43:09 - INFO - __main__ - Step 89023: {'lr': 0.0001817722266519759, 'samples': 17092416, 'steps': 89022, 'loss/train': 1.2658007144927979} 11/07/2021 09:43:10 - INFO - __main__ - Step 89024: {'lr': 0.00018176712137094996, 'samples': 17092608, 'steps': 89023, 'loss/train': 1.4479682445526123} 11/07/2021 09:43:10 - INFO - __main__ - Step 89025: {'lr': 0.00018176201612066874, 'samples': 17092800, 'steps': 89024, 'loss/train': 0.6621246337890625} 11/07/2021 09:43:11 - INFO - __main__ - Step 89026: {'lr': 0.0001817569109011345, 'samples': 17092992, 'steps': 89025, 'loss/train': 1.2696565389633179} 11/07/2021 09:43:12 - INFO - __main__ - Step 89027: {'lr': 0.0001817518057123495, 'samples': 17093184, 'steps': 89026, 'loss/train': 1.1287329196929932} 11/07/2021 09:43:12 - INFO - __main__ - Step 89028: {'lr': 0.00018174670055431613, 'samples': 17093376, 'steps': 89027, 'loss/train': 1.3897773027420044} 11/07/2021 09:43:12 - INFO - __main__ - Step 89029: {'lr': 0.00018174159542703664, 'samples': 17093568, 'steps': 89028, 'loss/train': 1.668404221534729} 11/07/2021 09:43:13 - INFO - __main__ - Step 89030: {'lr': 0.0001817364903305133, 'samples': 17093760, 'steps': 89029, 'loss/train': 0.6648353934288025} 11/07/2021 09:43:13 - INFO - __main__ - Step 89031: {'lr': 0.00018173138526474846, 'samples': 17093952, 'steps': 89030, 'loss/train': 1.0136128664016724} 11/07/2021 09:43:14 - INFO - __main__ - Step 89032: {'lr': 0.00018172628022974444, 'samples': 17094144, 'steps': 89031, 'loss/train': 2.0847744941711426} 11/07/2021 09:43:14 - INFO - __main__ - Step 89033: {'lr': 0.00018172117522550346, 'samples': 17094336, 'steps': 89032, 'loss/train': 1.6342905759811401} 11/07/2021 09:43:15 - INFO - __main__ - Step 89034: {'lr': 0.00018171607025202792, 'samples': 17094528, 'steps': 89033, 'loss/train': 1.0072808265686035} 11/07/2021 09:43:15 - INFO - __main__ - Step 89035: {'lr': 0.00018171096530932002, 'samples': 17094720, 'steps': 89034, 'loss/train': 1.6590402126312256} 11/07/2021 09:43:15 - INFO - __main__ - Step 89036: {'lr': 0.0001817058603973822, 'samples': 17094912, 'steps': 89035, 'loss/train': 1.4632761478424072} 11/07/2021 09:43:16 - INFO - __main__ - Step 89037: {'lr': 0.00018170075551621656, 'samples': 17095104, 'steps': 89036, 'loss/train': 1.3313255310058594} 11/07/2021 09:43:17 - INFO - __main__ - Step 89038: {'lr': 0.00018169565066582555, 'samples': 17095296, 'steps': 89037, 'loss/train': 1.4133660793304443} 11/07/2021 09:43:17 - INFO - __main__ - Step 89039: {'lr': 0.0001816905458462114, 'samples': 17095488, 'steps': 89038, 'loss/train': 0.4200151860713959} 11/07/2021 09:43:18 - INFO - __main__ - Step 89040: {'lr': 0.00018168544105737642, 'samples': 17095680, 'steps': 89039, 'loss/train': 1.2648301124572754} 11/07/2021 09:43:18 - INFO - __main__ - Step 89041: {'lr': 0.00018168033629932295, 'samples': 17095872, 'steps': 89040, 'loss/train': 1.2589185237884521} 11/07/2021 09:43:18 - INFO - __main__ - Step 89042: {'lr': 0.00018167523157205324, 'samples': 17096064, 'steps': 89041, 'loss/train': 1.3092782497406006} 11/07/2021 09:43:19 - INFO - __main__ - Step 89043: {'lr': 0.00018167012687556963, 'samples': 17096256, 'steps': 89042, 'loss/train': 1.4469884634017944} 11/07/2021 09:43:20 - INFO - __main__ - Step 89044: {'lr': 0.00018166502220987442, 'samples': 17096448, 'steps': 89043, 'loss/train': 1.1034015417099} 11/07/2021 09:43:20 - INFO - __main__ - Step 89045: {'lr': 0.0001816599175749699, 'samples': 17096640, 'steps': 89044, 'loss/train': 0.7041164636611938} 11/07/2021 09:43:20 - INFO - __main__ - Step 89046: {'lr': 0.00018165481297085834, 'samples': 17096832, 'steps': 89045, 'loss/train': 1.2194457054138184} 11/07/2021 09:43:21 - INFO - __main__ - Step 89047: {'lr': 0.00018164970839754208, 'samples': 17097024, 'steps': 89046, 'loss/train': 1.5603342056274414} 11/07/2021 09:43:22 - INFO - __main__ - Step 89048: {'lr': 0.00018164460385502345, 'samples': 17097216, 'steps': 89047, 'loss/train': 1.5598608255386353} 11/07/2021 09:43:22 - INFO - __main__ - Step 89049: {'lr': 0.00018163949934330467, 'samples': 17097408, 'steps': 89048, 'loss/train': 1.2056832313537598} 11/07/2021 09:43:22 - INFO - __main__ - Step 89050: {'lr': 0.00018163439486238814, 'samples': 17097600, 'steps': 89049, 'loss/train': 1.4614135026931763} 11/07/2021 09:43:23 - INFO - __main__ - Step 89051: {'lr': 0.000181629290412276, 'samples': 17097792, 'steps': 89050, 'loss/train': 1.672134280204773} 11/07/2021 09:43:23 - INFO - __main__ - Step 89052: {'lr': 0.0001816241859929707, 'samples': 17097984, 'steps': 89051, 'loss/train': 1.1269091367721558} 11/07/2021 09:43:25 - INFO - __main__ - Step 89053: {'lr': 0.00018161908160447442, 'samples': 17098176, 'steps': 89052, 'loss/train': 1.6161162853240967} 11/07/2021 09:43:25 - INFO - __main__ - Step 89054: {'lr': 0.00018161397724678958, 'samples': 17098368, 'steps': 89053, 'loss/train': 1.6359819173812866} 11/07/2021 09:43:25 - INFO - __main__ - Step 89055: {'lr': 0.00018160887291991844, 'samples': 17098560, 'steps': 89054, 'loss/train': 1.299730896949768} 11/07/2021 09:43:26 - INFO - __main__ - Step 89056: {'lr': 0.00018160376862386325, 'samples': 17098752, 'steps': 89055, 'loss/train': 1.462962031364441} 11/07/2021 09:43:26 - INFO - __main__ - Step 89057: {'lr': 0.00018159866435862634, 'samples': 17098944, 'steps': 89056, 'loss/train': 1.365218162536621} 11/07/2021 09:43:27 - INFO - __main__ - Step 89058: {'lr': 0.00018159356012421003, 'samples': 17099136, 'steps': 89057, 'loss/train': 1.661996603012085} 11/07/2021 09:43:27 - INFO - __main__ - Step 89059: {'lr': 0.00018158845592061669, 'samples': 17099328, 'steps': 89058, 'loss/train': 1.18598473072052} 11/07/2021 09:43:28 - INFO - __main__ - Step 89060: {'lr': 0.00018158335174784843, 'samples': 17099520, 'steps': 89059, 'loss/train': 1.2115412950515747} 11/07/2021 09:43:28 - INFO - __main__ - Step 89061: {'lr': 0.00018157824760590768, 'samples': 17099712, 'steps': 89060, 'loss/train': 0.9493670463562012} 11/07/2021 09:43:28 - INFO - __main__ - Step 89062: {'lr': 0.00018157314349479672, 'samples': 17099904, 'steps': 89061, 'loss/train': 1.240394115447998} 11/07/2021 09:43:29 - INFO - __main__ - Step 89063: {'lr': 0.00018156803941451788, 'samples': 17100096, 'steps': 89062, 'loss/train': 1.356257438659668} 11/07/2021 09:43:30 - INFO - __main__ - Step 89064: {'lr': 0.0001815629353650734, 'samples': 17100288, 'steps': 89063, 'loss/train': 1.2003419399261475} 11/07/2021 09:43:30 - INFO - __main__ - Step 89065: {'lr': 0.0001815578313464656, 'samples': 17100480, 'steps': 89064, 'loss/train': 1.59227454662323} 11/07/2021 09:43:30 - INFO - __main__ - Step 89066: {'lr': 0.00018155272735869676, 'samples': 17100672, 'steps': 89065, 'loss/train': 1.6005079746246338} 11/07/2021 09:43:31 - INFO - __main__ - Step 89067: {'lr': 0.00018154762340176923, 'samples': 17100864, 'steps': 89066, 'loss/train': 1.138648509979248} 11/07/2021 09:43:32 - INFO - __main__ - Step 89068: {'lr': 0.0001815425194756853, 'samples': 17101056, 'steps': 89067, 'loss/train': 1.253857970237732} 11/07/2021 09:43:32 - INFO - __main__ - Step 89069: {'lr': 0.00018153741558044723, 'samples': 17101248, 'steps': 89068, 'loss/train': 1.3693242073059082} 11/07/2021 09:43:33 - INFO - __main__ - Step 89070: {'lr': 0.00018153231171605738, 'samples': 17101440, 'steps': 89069, 'loss/train': 1.5288461446762085} 11/07/2021 09:43:33 - INFO - __main__ - Step 89071: {'lr': 0.000181527207882518, 'samples': 17101632, 'steps': 89070, 'loss/train': 1.1702195405960083} 11/07/2021 09:43:33 - INFO - __main__ - Step 89072: {'lr': 0.00018152210407983138, 'samples': 17101824, 'steps': 89071, 'loss/train': 1.0075534582138062} 11/07/2021 09:43:34 - INFO - __main__ - Step 89073: {'lr': 0.00018151700030799985, 'samples': 17102016, 'steps': 89072, 'loss/train': 1.3250911235809326} 11/07/2021 09:43:35 - INFO - __main__ - Step 89074: {'lr': 0.00018151189656702568, 'samples': 17102208, 'steps': 89073, 'loss/train': 0.16206897795200348} 11/07/2021 09:43:35 - INFO - __main__ - Step 89075: {'lr': 0.0001815067928569112, 'samples': 17102400, 'steps': 89074, 'loss/train': 1.4937139749526978} 11/07/2021 09:43:35 - INFO - __main__ - Step 89076: {'lr': 0.00018150168917765874, 'samples': 17102592, 'steps': 89075, 'loss/train': 1.5067023038864136} 11/07/2021 09:43:36 - INFO - __main__ - Step 89077: {'lr': 0.00018149658552927056, 'samples': 17102784, 'steps': 89076, 'loss/train': 1.137233853340149} 11/07/2021 09:43:36 - INFO - __main__ - Step 89078: {'lr': 0.00018149148191174896, 'samples': 17102976, 'steps': 89077, 'loss/train': 1.3702672719955444} 11/07/2021 09:43:37 - INFO - __main__ - Step 89079: {'lr': 0.0001814863783250962, 'samples': 17103168, 'steps': 89078, 'loss/train': 1.2543872594833374} 11/07/2021 09:43:38 - INFO - __main__ - Step 89080: {'lr': 0.00018148127476931463, 'samples': 17103360, 'steps': 89079, 'loss/train': 1.4660347700119019} 11/07/2021 09:43:38 - INFO - __main__ - Step 89081: {'lr': 0.00018147617124440662, 'samples': 17103552, 'steps': 89080, 'loss/train': 1.6170401573181152} 11/07/2021 09:43:38 - INFO - __main__ - Step 89082: {'lr': 0.00018147106775037432, 'samples': 17103744, 'steps': 89081, 'loss/train': 1.4742628335952759} 11/07/2021 09:43:39 - INFO - __main__ - Step 89083: {'lr': 0.00018146596428722013, 'samples': 17103936, 'steps': 89082, 'loss/train': 1.4033156633377075} 11/07/2021 09:43:40 - INFO - __main__ - Step 89084: {'lr': 0.00018146086085494626, 'samples': 17104128, 'steps': 89083, 'loss/train': 1.4667482376098633} 11/07/2021 09:43:40 - INFO - __main__ - Step 89085: {'lr': 0.00018145575745355508, 'samples': 17104320, 'steps': 89084, 'loss/train': 1.3863391876220703} 11/07/2021 09:43:40 - INFO - __main__ - Step 89086: {'lr': 0.0001814506540830489, 'samples': 17104512, 'steps': 89085, 'loss/train': 1.7497223615646362} 11/07/2021 09:43:41 - INFO - __main__ - Step 89087: {'lr': 0.00018144555074343, 'samples': 17104704, 'steps': 89086, 'loss/train': 1.4937827587127686} 11/07/2021 09:43:41 - INFO - __main__ - Step 89088: {'lr': 0.00018144044743470067, 'samples': 17104896, 'steps': 89087, 'loss/train': 1.0803495645523071} 11/07/2021 09:43:42 - INFO - __main__ - Step 89089: {'lr': 0.00018143534415686322, 'samples': 17105088, 'steps': 89088, 'loss/train': 1.5523663759231567} 11/07/2021 09:43:42 - INFO - __main__ - Step 89090: {'lr': 0.00018143024090992, 'samples': 17105280, 'steps': 89089, 'loss/train': 0.8379854559898376} 11/07/2021 09:43:43 - INFO - __main__ - Step 89091: {'lr': 0.0001814251376938732, 'samples': 17105472, 'steps': 89090, 'loss/train': 1.5879453420639038} 11/07/2021 09:43:43 - INFO - __main__ - Step 89092: {'lr': 0.0001814200345087252, 'samples': 17105664, 'steps': 89091, 'loss/train': 1.29557204246521} 11/07/2021 09:43:43 - INFO - __main__ - Step 89093: {'lr': 0.00018141493135447826, 'samples': 17105856, 'steps': 89092, 'loss/train': 0.6338412761688232} 11/07/2021 09:43:44 - INFO - __main__ - Step 89094: {'lr': 0.00018140982823113466, 'samples': 17106048, 'steps': 89093, 'loss/train': 1.2638404369354248} 11/07/2021 09:43:45 - INFO - __main__ - Step 89095: {'lr': 0.00018140472513869676, 'samples': 17106240, 'steps': 89094, 'loss/train': 1.7289336919784546} 11/07/2021 09:43:45 - INFO - __main__ - Step 89096: {'lr': 0.00018139962207716683, 'samples': 17106432, 'steps': 89095, 'loss/train': 1.5046390295028687} 11/07/2021 09:43:46 - INFO - __main__ - Step 89097: {'lr': 0.00018139451904654718, 'samples': 17106624, 'steps': 89096, 'loss/train': 1.638564944267273} 11/07/2021 09:43:46 - INFO - __main__ - Step 89098: {'lr': 0.00018138941604684005, 'samples': 17106816, 'steps': 89097, 'loss/train': 1.4380743503570557} 11/07/2021 09:43:46 - INFO - __main__ - Step 89099: {'lr': 0.00018138431307804784, 'samples': 17107008, 'steps': 89098, 'loss/train': 1.5047543048858643} 11/07/2021 09:43:47 - INFO - __main__ - Step 89100: {'lr': 0.00018137921014017277, 'samples': 17107200, 'steps': 89099, 'loss/train': 1.4435936212539673} 11/07/2021 09:43:48 - INFO - __main__ - Step 89101: {'lr': 0.0001813741072332172, 'samples': 17107392, 'steps': 89100, 'loss/train': 0.4973219335079193} 11/07/2021 09:43:48 - INFO - __main__ - Step 89102: {'lr': 0.0001813690043571834, 'samples': 17107584, 'steps': 89101, 'loss/train': 1.2109920978546143} 11/07/2021 09:43:48 - INFO - __main__ - Step 89103: {'lr': 0.00018136390151207376, 'samples': 17107776, 'steps': 89102, 'loss/train': 2.049574136734009} 11/07/2021 09:43:49 - INFO - __main__ - Step 89104: {'lr': 0.00018135879869789038, 'samples': 17107968, 'steps': 89103, 'loss/train': 1.363505482673645} 11/07/2021 09:43:49 - INFO - __main__ - Step 89105: {'lr': 0.00018135369591463566, 'samples': 17108160, 'steps': 89104, 'loss/train': 1.0512031316757202} 11/07/2021 09:43:50 - INFO - __main__ - Step 89106: {'lr': 0.0001813485931623119, 'samples': 17108352, 'steps': 89105, 'loss/train': 1.5043772459030151} 11/07/2021 09:43:50 - INFO - __main__ - Step 89107: {'lr': 0.0001813434904409214, 'samples': 17108544, 'steps': 89106, 'loss/train': 1.7411943674087524} 11/07/2021 09:43:51 - INFO - __main__ - Step 89108: {'lr': 0.00018133838775046652, 'samples': 17108736, 'steps': 89107, 'loss/train': 1.311635971069336} 11/07/2021 09:43:51 - INFO - __main__ - Step 89109: {'lr': 0.00018133328509094943, 'samples': 17108928, 'steps': 89108, 'loss/train': 1.7800850868225098} 11/07/2021 09:43:51 - INFO - __main__ - Step 89110: {'lr': 0.00018132818246237255, 'samples': 17109120, 'steps': 89109, 'loss/train': 1.1033456325531006} 11/07/2021 09:43:53 - INFO - __main__ - Step 89111: {'lr': 0.0001813230798647381, 'samples': 17109312, 'steps': 89110, 'loss/train': 1.5639082193374634} 11/07/2021 09:43:53 - INFO - __main__ - Step 89112: {'lr': 0.00018131797729804844, 'samples': 17109504, 'steps': 89111, 'loss/train': 1.4424690008163452} 11/07/2021 09:43:53 - INFO - __main__ - Step 89113: {'lr': 0.00018131287476230582, 'samples': 17109696, 'steps': 89112, 'loss/train': 1.483629584312439} 11/07/2021 09:43:54 - INFO - __main__ - Step 89114: {'lr': 0.00018130777225751254, 'samples': 17109888, 'steps': 89113, 'loss/train': 1.8136210441589355} 11/07/2021 09:43:54 - INFO - __main__ - Step 89115: {'lr': 0.00018130266978367096, 'samples': 17110080, 'steps': 89114, 'loss/train': 1.2643766403198242} 11/07/2021 09:43:55 - INFO - __main__ - Step 89116: {'lr': 0.00018129756734078334, 'samples': 17110272, 'steps': 89115, 'loss/train': 1.6442599296569824} 11/07/2021 09:43:55 - INFO - __main__ - Step 89117: {'lr': 0.00018129246492885203, 'samples': 17110464, 'steps': 89116, 'loss/train': 1.4207630157470703} 11/07/2021 09:43:56 - INFO - __main__ - Step 89118: {'lr': 0.0001812873625478792, 'samples': 17110656, 'steps': 89117, 'loss/train': 1.631670594215393} 11/07/2021 09:43:56 - INFO - __main__ - Step 89119: {'lr': 0.00018128226019786724, 'samples': 17110848, 'steps': 89118, 'loss/train': 1.7729612588882446} 11/07/2021 09:43:56 - INFO - __main__ - Step 89120: {'lr': 0.00018127715787881842, 'samples': 17111040, 'steps': 89119, 'loss/train': 1.158362627029419} 11/07/2021 09:43:57 - INFO - __main__ - Step 89121: {'lr': 0.00018127205559073507, 'samples': 17111232, 'steps': 89120, 'loss/train': 1.5059503316879272} 11/07/2021 09:43:58 - INFO - __main__ - Step 89122: {'lr': 0.00018126695333361943, 'samples': 17111424, 'steps': 89121, 'loss/train': 0.16342724859714508} 11/07/2021 09:43:58 - INFO - __main__ - Step 89123: {'lr': 0.00018126185110747383, 'samples': 17111616, 'steps': 89122, 'loss/train': 1.3198015689849854} 11/07/2021 09:43:58 - INFO - __main__ - Step 89124: {'lr': 0.00018125674891230064, 'samples': 17111808, 'steps': 89123, 'loss/train': 1.192307472229004} 11/07/2021 09:43:59 - INFO - __main__ - Step 89125: {'lr': 0.00018125164674810207, 'samples': 17112000, 'steps': 89124, 'loss/train': 1.6620899438858032} 11/07/2021 09:43:59 - INFO - __main__ - Step 89126: {'lr': 0.00018124654461488043, 'samples': 17112192, 'steps': 89125, 'loss/train': 2.149962902069092} 11/07/2021 09:44:00 - INFO - __main__ - Step 89127: {'lr': 0.00018124144251263809, 'samples': 17112384, 'steps': 89126, 'loss/train': 1.4606369733810425} 11/07/2021 09:44:01 - INFO - __main__ - Step 89128: {'lr': 0.00018123634044137722, 'samples': 17112576, 'steps': 89127, 'loss/train': 1.8652772903442383} 11/07/2021 09:44:01 - INFO - __main__ - Step 89129: {'lr': 0.00018123123840110023, 'samples': 17112768, 'steps': 89128, 'loss/train': 1.742761254310608} 11/07/2021 09:44:01 - INFO - __main__ - Step 89130: {'lr': 0.0001812261363918095, 'samples': 17112960, 'steps': 89129, 'loss/train': 1.2263250350952148} 11/07/2021 09:44:02 - INFO - __main__ - Step 89131: {'lr': 0.00018122103441350706, 'samples': 17113152, 'steps': 89130, 'loss/train': 1.5783226490020752} 11/07/2021 09:44:03 - INFO - __main__ - Step 89132: {'lr': 0.00018121593246619544, 'samples': 17113344, 'steps': 89131, 'loss/train': 1.2624372243881226} 11/07/2021 09:44:03 - INFO - __main__ - Step 89133: {'lr': 0.0001812108305498768, 'samples': 17113536, 'steps': 89132, 'loss/train': 1.6385302543640137} 11/07/2021 09:44:04 - INFO - __main__ - Step 89134: {'lr': 0.0001812057286645535, 'samples': 17113728, 'steps': 89133, 'loss/train': 1.686790943145752} 11/07/2021 09:44:04 - INFO - __main__ - Step 89135: {'lr': 0.00018120062681022787, 'samples': 17113920, 'steps': 89134, 'loss/train': 0.9676257371902466} 11/07/2021 09:44:04 - INFO - __main__ - Step 89136: {'lr': 0.00018119552498690214, 'samples': 17114112, 'steps': 89135, 'loss/train': 1.2181512117385864} 11/07/2021 09:44:05 - INFO - __main__ - Step 89137: {'lr': 0.00018119042319457868, 'samples': 17114304, 'steps': 89136, 'loss/train': 1.638714075088501} 11/07/2021 09:44:06 - INFO - __main__ - Step 89138: {'lr': 0.00018118532143325972, 'samples': 17114496, 'steps': 89137, 'loss/train': 0.7744029760360718} 11/07/2021 09:44:06 - INFO - __main__ - Step 89139: {'lr': 0.00018118021970294762, 'samples': 17114688, 'steps': 89138, 'loss/train': 1.5490663051605225} 11/07/2021 09:44:07 - INFO - __main__ - Step 89140: {'lr': 0.00018117511800364462, 'samples': 17114880, 'steps': 89139, 'loss/train': 1.3233121633529663} 11/07/2021 09:44:07 - INFO - __main__ - Step 89141: {'lr': 0.00018117001633535308, 'samples': 17115072, 'steps': 89140, 'loss/train': 1.4082201719284058} 11/07/2021 09:44:07 - INFO - __main__ - Step 89142: {'lr': 0.00018116491469807525, 'samples': 17115264, 'steps': 89141, 'loss/train': 1.4967561960220337} 11/07/2021 09:44:08 - INFO - __main__ - Step 89143: {'lr': 0.00018115981309181346, 'samples': 17115456, 'steps': 89142, 'loss/train': 1.8689111471176147} 11/07/2021 09:44:09 - INFO - __main__ - Step 89144: {'lr': 0.0001811547115165701, 'samples': 17115648, 'steps': 89143, 'loss/train': 1.0257701873779297} 11/07/2021 09:44:09 - INFO - __main__ - Step 89145: {'lr': 0.00018114960997234726, 'samples': 17115840, 'steps': 89144, 'loss/train': 1.2148767709732056} 11/07/2021 09:44:09 - INFO - __main__ - Step 89146: {'lr': 0.00018114450845914732, 'samples': 17116032, 'steps': 89145, 'loss/train': 1.0695812702178955} 11/07/2021 09:44:10 - INFO - __main__ - Step 89147: {'lr': 0.00018113940697697263, 'samples': 17116224, 'steps': 89146, 'loss/train': 0.9808873534202576} 11/07/2021 09:44:11 - INFO - __main__ - Step 89148: {'lr': 0.00018113430552582543, 'samples': 17116416, 'steps': 89147, 'loss/train': 1.4011621475219727} 11/07/2021 09:44:11 - INFO - __main__ - Step 89149: {'lr': 0.00018112920410570806, 'samples': 17116608, 'steps': 89148, 'loss/train': 1.1164002418518066} 11/07/2021 09:44:11 - INFO - __main__ - Step 89150: {'lr': 0.00018112410271662284, 'samples': 17116800, 'steps': 89149, 'loss/train': 1.0097033977508545} 11/07/2021 09:44:12 - INFO - __main__ - Step 89151: {'lr': 0.000181119001358572, 'samples': 17116992, 'steps': 89150, 'loss/train': 1.2314976453781128} 11/07/2021 09:44:12 - INFO - __main__ - Step 89152: {'lr': 0.00018111390003155788, 'samples': 17117184, 'steps': 89151, 'loss/train': 1.7862074375152588} 11/07/2021 09:44:13 - INFO - __main__ - Step 89153: {'lr': 0.00018110879873558278, 'samples': 17117376, 'steps': 89152, 'loss/train': 1.540093183517456} 11/07/2021 09:44:14 - INFO - __main__ - Step 89154: {'lr': 0.000181103697470649, 'samples': 17117568, 'steps': 89153, 'loss/train': 1.0497113466262817} 11/07/2021 09:44:14 - INFO - __main__ - Step 89155: {'lr': 0.00018109859623675884, 'samples': 17117760, 'steps': 89154, 'loss/train': 1.1979047060012817} 11/07/2021 09:44:14 - INFO - __main__ - Step 89156: {'lr': 0.00018109349503391456, 'samples': 17117952, 'steps': 89155, 'loss/train': 1.5275181531906128} 11/07/2021 09:44:15 - INFO - __main__ - Step 89157: {'lr': 0.0001810883938621186, 'samples': 17118144, 'steps': 89156, 'loss/train': 0.9036754369735718} 11/07/2021 09:44:16 - INFO - __main__ - Step 89158: {'lr': 0.0001810832927213731, 'samples': 17118336, 'steps': 89157, 'loss/train': 1.572872281074524} 11/07/2021 09:44:16 - INFO - __main__ - Step 89159: {'lr': 0.00018107819161168032, 'samples': 17118528, 'steps': 89158, 'loss/train': 0.9473255276679993} 11/07/2021 09:44:16 - INFO - __main__ - Step 89160: {'lr': 0.00018107309053304267, 'samples': 17118720, 'steps': 89159, 'loss/train': 1.2044837474822998} 11/07/2021 09:44:17 - INFO - __main__ - Step 89161: {'lr': 0.00018106798948546243, 'samples': 17118912, 'steps': 89160, 'loss/train': 0.8502820730209351} 11/07/2021 09:44:17 - INFO - __main__ - Step 89162: {'lr': 0.0001810628884689419, 'samples': 17119104, 'steps': 89161, 'loss/train': 1.484323263168335} 11/07/2021 09:44:18 - INFO - __main__ - Step 89163: {'lr': 0.00018105778748348333, 'samples': 17119296, 'steps': 89162, 'loss/train': 1.6791304349899292} 11/07/2021 09:44:18 - INFO - __main__ - Step 89164: {'lr': 0.0001810526865290891, 'samples': 17119488, 'steps': 89163, 'loss/train': 1.6102938652038574} 11/07/2021 09:44:19 - INFO - __main__ - Step 89165: {'lr': 0.00018104758560576146, 'samples': 17119680, 'steps': 89164, 'loss/train': 1.2775001525878906} 11/07/2021 09:44:19 - INFO - __main__ - Step 89166: {'lr': 0.0001810424847135027, 'samples': 17119872, 'steps': 89165, 'loss/train': 1.3773339986801147} 11/07/2021 09:44:19 - INFO - __main__ - Step 89167: {'lr': 0.00018103738385231514, 'samples': 17120064, 'steps': 89166, 'loss/train': 1.3506996631622314} 11/07/2021 09:44:20 - INFO - __main__ - Step 89168: {'lr': 0.00018103228302220108, 'samples': 17120256, 'steps': 89167, 'loss/train': 1.2341457605361938} 11/07/2021 09:44:21 - INFO - __main__ - Step 89169: {'lr': 0.00018102718222316277, 'samples': 17120448, 'steps': 89168, 'loss/train': 1.39107084274292} 11/07/2021 09:44:21 - INFO - __main__ - Step 89170: {'lr': 0.00018102208145520258, 'samples': 17120640, 'steps': 89169, 'loss/train': 1.086580514907837} 11/07/2021 09:44:21 - INFO - __main__ - Step 89171: {'lr': 0.00018101698071832287, 'samples': 17120832, 'steps': 89170, 'loss/train': 1.3796499967575073} 11/07/2021 09:44:22 - INFO - __main__ - Step 89172: {'lr': 0.00018101188001252576, 'samples': 17121024, 'steps': 89171, 'loss/train': 1.2957277297973633} 11/07/2021 09:44:23 - INFO - __main__ - Step 89173: {'lr': 0.00018100677933781362, 'samples': 17121216, 'steps': 89172, 'loss/train': 1.2450441122055054} 11/07/2021 09:44:23 - INFO - __main__ - Step 89174: {'lr': 0.00018100167869418874, 'samples': 17121408, 'steps': 89173, 'loss/train': 1.306411862373352} 11/07/2021 09:44:24 - INFO - __main__ - Step 89175: {'lr': 0.00018099657808165346, 'samples': 17121600, 'steps': 89174, 'loss/train': 1.4473415613174438} 11/07/2021 09:44:24 - INFO - __main__ - Step 89176: {'lr': 0.00018099147750021006, 'samples': 17121792, 'steps': 89175, 'loss/train': 1.5341755151748657} 11/07/2021 09:44:24 - INFO - __main__ - Step 89177: {'lr': 0.00018098637694986082, 'samples': 17121984, 'steps': 89176, 'loss/train': 1.4423961639404297} 11/07/2021 09:44:25 - INFO - __main__ - Step 89178: {'lr': 0.00018098127643060804, 'samples': 17122176, 'steps': 89177, 'loss/train': 1.113978624343872} 11/07/2021 09:44:26 - INFO - __main__ - Step 89179: {'lr': 0.00018097617594245408, 'samples': 17122368, 'steps': 89178, 'loss/train': 1.4490128755569458} 11/07/2021 09:44:26 - INFO - __main__ - Step 89180: {'lr': 0.00018097107548540115, 'samples': 17122560, 'steps': 89179, 'loss/train': 1.3282650709152222} 11/07/2021 09:44:26 - INFO - __main__ - Step 89181: {'lr': 0.0001809659750594516, 'samples': 17122752, 'steps': 89180, 'loss/train': 1.457247018814087} 11/07/2021 09:44:27 - INFO - __main__ - Step 89182: {'lr': 0.0001809608746646077, 'samples': 17122944, 'steps': 89181, 'loss/train': 1.2848156690597534} 11/07/2021 09:44:27 - INFO - __main__ - Step 89183: {'lr': 0.00018095577430087185, 'samples': 17123136, 'steps': 89182, 'loss/train': 1.3404817581176758} 11/07/2021 09:44:28 - INFO - __main__ - Step 89184: {'lr': 0.00018095067396824626, 'samples': 17123328, 'steps': 89183, 'loss/train': 1.226150631904602} 11/07/2021 09:44:28 - INFO - __main__ - Step 89185: {'lr': 0.00018094557366673313, 'samples': 17123520, 'steps': 89184, 'loss/train': 1.4283106327056885} 11/07/2021 09:44:29 - INFO - __main__ - Step 89186: {'lr': 0.0001809404733963349, 'samples': 17123712, 'steps': 89185, 'loss/train': 1.2337363958358765} 11/07/2021 09:44:29 - INFO - __main__ - Step 89187: {'lr': 0.00018093537315705383, 'samples': 17123904, 'steps': 89186, 'loss/train': 1.6017202138900757} 11/07/2021 09:44:29 - INFO - __main__ - Step 89188: {'lr': 0.0001809302729488922, 'samples': 17124096, 'steps': 89187, 'loss/train': 1.137083649635315} 11/07/2021 09:44:30 - INFO - __main__ - Step 89189: {'lr': 0.00018092517277185232, 'samples': 17124288, 'steps': 89188, 'loss/train': 1.0316358804702759} 11/07/2021 09:44:31 - INFO - __main__ - Step 89190: {'lr': 0.0001809200726259365, 'samples': 17124480, 'steps': 89189, 'loss/train': 1.0577417612075806} 11/07/2021 09:44:31 - INFO - __main__ - Step 89191: {'lr': 0.0001809149725111471, 'samples': 17124672, 'steps': 89190, 'loss/train': 1.539607286453247} 11/07/2021 09:44:32 - INFO - __main__ - Step 89192: {'lr': 0.00018090987242748625, 'samples': 17124864, 'steps': 89191, 'loss/train': 1.5449897050857544} 11/07/2021 09:44:32 - INFO - __main__ - Step 89193: {'lr': 0.00018090477237495638, 'samples': 17125056, 'steps': 89192, 'loss/train': 1.4880845546722412} 11/07/2021 09:44:32 - INFO - __main__ - Step 89194: {'lr': 0.00018089967235355978, 'samples': 17125248, 'steps': 89193, 'loss/train': 1.6276590824127197} 11/07/2021 09:44:33 - INFO - __main__ - Step 89195: {'lr': 0.0001808945723632987, 'samples': 17125440, 'steps': 89194, 'loss/train': 1.5745818614959717} 11/07/2021 09:44:34 - INFO - __main__ - Step 89196: {'lr': 0.00018088947240417545, 'samples': 17125632, 'steps': 89195, 'loss/train': 1.244607925415039} 11/07/2021 09:44:34 - INFO - __main__ - Step 89197: {'lr': 0.00018088437247619233, 'samples': 17125824, 'steps': 89196, 'loss/train': 1.2366950511932373} 11/07/2021 09:44:35 - INFO - __main__ - Step 89198: {'lr': 0.0001808792725793517, 'samples': 17126016, 'steps': 89197, 'loss/train': 0.17507781088352203} 11/07/2021 09:44:35 - INFO - __main__ - Step 89199: {'lr': 0.00018087417271365574, 'samples': 17126208, 'steps': 89198, 'loss/train': 0.6957067251205444} 11/07/2021 09:44:36 - INFO - __main__ - Step 89200: {'lr': 0.0001808690728791068, 'samples': 17126400, 'steps': 89199, 'loss/train': 1.758845329284668} 11/07/2021 09:44:36 - INFO - __main__ - Step 89201: {'lr': 0.00018086397307570724, 'samples': 17126592, 'steps': 89200, 'loss/train': 2.040661096572876} 11/07/2021 09:44:37 - INFO - __main__ - Step 89202: {'lr': 0.0001808588733034593, 'samples': 17126784, 'steps': 89201, 'loss/train': 1.756380558013916} 11/07/2021 09:44:37 - INFO - __main__ - Step 89203: {'lr': 0.00018085377356236526, 'samples': 17126976, 'steps': 89202, 'loss/train': 0.6599521040916443} 11/07/2021 09:44:37 - INFO - __main__ - Step 89204: {'lr': 0.00018084867385242742, 'samples': 17127168, 'steps': 89203, 'loss/train': 1.454344391822815} 11/07/2021 09:44:38 - INFO - __main__ - Step 89205: {'lr': 0.0001808435741736482, 'samples': 17127360, 'steps': 89204, 'loss/train': 1.5698322057724} 11/07/2021 09:44:39 - INFO - __main__ - Step 89206: {'lr': 0.00018083847452602972, 'samples': 17127552, 'steps': 89205, 'loss/train': 1.5212222337722778} 11/07/2021 09:44:39 - INFO - __main__ - Step 89207: {'lr': 0.00018083337490957437, 'samples': 17127744, 'steps': 89206, 'loss/train': 1.1187946796417236} 11/07/2021 09:44:39 - INFO - __main__ - Step 89208: {'lr': 0.00018082827532428443, 'samples': 17127936, 'steps': 89207, 'loss/train': 1.3085087537765503} 11/07/2021 09:44:40 - INFO - __main__ - Step 89209: {'lr': 0.0001808231757701622, 'samples': 17128128, 'steps': 89208, 'loss/train': 1.8027180433273315} 11/07/2021 09:44:40 - INFO - __main__ - Step 89210: {'lr': 0.00018081807624720998, 'samples': 17128320, 'steps': 89209, 'loss/train': 1.1211072206497192} 11/07/2021 09:44:41 - INFO - __main__ - Step 89211: {'lr': 0.0001808129767554301, 'samples': 17128512, 'steps': 89210, 'loss/train': 1.4491490125656128} 11/07/2021 09:44:41 - INFO - __main__ - Step 89212: {'lr': 0.0001808078772948248, 'samples': 17128704, 'steps': 89211, 'loss/train': 1.4485622644424438} 11/07/2021 09:44:42 - INFO - __main__ - Step 89213: {'lr': 0.0001808027778653964, 'samples': 17128896, 'steps': 89212, 'loss/train': 1.5984028577804565} 11/07/2021 09:44:42 - INFO - __main__ - Step 89214: {'lr': 0.00018079767846714717, 'samples': 17129088, 'steps': 89213, 'loss/train': 1.5107229948043823} 11/07/2021 09:44:42 - INFO - __main__ - Step 89215: {'lr': 0.00018079257910007945, 'samples': 17129280, 'steps': 89214, 'loss/train': 1.4256237745285034} 11/07/2021 09:44:44 - INFO - __main__ - Step 89216: {'lr': 0.00018078747976419562, 'samples': 17129472, 'steps': 89215, 'loss/train': 0.9481223225593567} 11/07/2021 09:44:44 - INFO - __main__ - Step 89217: {'lr': 0.0001807823804594978, 'samples': 17129664, 'steps': 89216, 'loss/train': 0.8825391530990601} 11/07/2021 09:44:44 - INFO - __main__ - Step 89218: {'lr': 0.00018077728118598836, 'samples': 17129856, 'steps': 89217, 'loss/train': 4.429328441619873} 11/07/2021 09:44:45 - INFO - __main__ - Step 89219: {'lr': 0.00018077218194366963, 'samples': 17130048, 'steps': 89218, 'loss/train': 1.5313670635223389} 11/07/2021 09:44:45 - INFO - __main__ - Step 89220: {'lr': 0.00018076708273254388, 'samples': 17130240, 'steps': 89219, 'loss/train': 1.0845390558242798} 11/07/2021 09:44:46 - INFO - __main__ - Step 89221: {'lr': 0.00018076198355261342, 'samples': 17130432, 'steps': 89220, 'loss/train': 1.2189663648605347} 11/07/2021 09:44:46 - INFO - __main__ - Step 89222: {'lr': 0.00018075688440388052, 'samples': 17130624, 'steps': 89221, 'loss/train': 1.4960908889770508} 11/07/2021 09:44:47 - INFO - __main__ - Step 89223: {'lr': 0.0001807517852863475, 'samples': 17130816, 'steps': 89222, 'loss/train': 3.089214563369751} 11/07/2021 09:44:47 - INFO - __main__ - Step 89224: {'lr': 0.00018074668620001672, 'samples': 17131008, 'steps': 89223, 'loss/train': 0.46342533826828003} 11/07/2021 09:44:47 - INFO - __main__ - Step 89225: {'lr': 0.00018074158714489037, 'samples': 17131200, 'steps': 89224, 'loss/train': 1.0465341806411743} 11/07/2021 09:44:49 - INFO - __main__ - Step 89226: {'lr': 0.00018073648812097086, 'samples': 17131392, 'steps': 89225, 'loss/train': 1.453278660774231} 11/07/2021 09:44:49 - INFO - __main__ - Step 89227: {'lr': 0.00018073138912826032, 'samples': 17131584, 'steps': 89226, 'loss/train': 0.817786455154419} 11/07/2021 09:44:50 - INFO - __main__ - Step 89228: {'lr': 0.00018072629016676117, 'samples': 17131776, 'steps': 89227, 'loss/train': 1.5488137006759644} 11/07/2021 09:44:50 - INFO - __main__ - Step 89229: {'lr': 0.00018072119123647572, 'samples': 17131968, 'steps': 89228, 'loss/train': 1.5630158185958862} 11/07/2021 09:44:50 - INFO - __main__ - Step 89230: {'lr': 0.00018071609233740618, 'samples': 17132160, 'steps': 89229, 'loss/train': 1.7503252029418945} 11/07/2021 09:44:51 - INFO - __main__ - Step 89231: {'lr': 0.00018071099346955494, 'samples': 17132352, 'steps': 89230, 'loss/train': 1.4933851957321167} 11/07/2021 09:44:52 - INFO - __main__ - Step 89232: {'lr': 0.00018070589463292422, 'samples': 17132544, 'steps': 89231, 'loss/train': 0.42800289392471313} 11/07/2021 09:44:52 - INFO - __main__ - Step 89233: {'lr': 0.00018070079582751636, 'samples': 17132736, 'steps': 89232, 'loss/train': 1.3902461528778076} 11/07/2021 09:44:52 - INFO - __main__ - Step 89234: {'lr': 0.00018069569705333365, 'samples': 17132928, 'steps': 89233, 'loss/train': 1.6854113340377808} 11/07/2021 09:44:53 - INFO - __main__ - Step 89235: {'lr': 0.00018069059831037843, 'samples': 17133120, 'steps': 89234, 'loss/train': 1.5179526805877686} 11/07/2021 09:44:53 - INFO - __main__ - Step 89236: {'lr': 0.00018068549959865293, 'samples': 17133312, 'steps': 89235, 'loss/train': 1.461196780204773} 11/07/2021 09:44:54 - INFO - __main__ - Step 89237: {'lr': 0.00018068040091815947, 'samples': 17133504, 'steps': 89236, 'loss/train': 1.7383719682693481} 11/07/2021 09:44:54 - INFO - __main__ - Step 89238: {'lr': 0.00018067530226890046, 'samples': 17133696, 'steps': 89237, 'loss/train': 1.3624745607376099} 11/07/2021 09:44:55 - INFO - __main__ - Step 89239: {'lr': 0.000180670203650878, 'samples': 17133888, 'steps': 89238, 'loss/train': 0.8965857028961182} 11/07/2021 09:44:55 - INFO - __main__ - Step 89240: {'lr': 0.00018066510506409446, 'samples': 17134080, 'steps': 89239, 'loss/train': 1.529890775680542} 11/07/2021 09:44:55 - INFO - __main__ - Step 89241: {'lr': 0.00018066000650855213, 'samples': 17134272, 'steps': 89240, 'loss/train': 1.3160308599472046} 11/07/2021 09:44:56 - INFO - __main__ - Step 89242: {'lr': 0.00018065490798425339, 'samples': 17134464, 'steps': 89241, 'loss/train': 0.9353503584861755} 11/07/2021 09:44:57 - INFO - __main__ - Step 89243: {'lr': 0.0001806498094912004, 'samples': 17134656, 'steps': 89242, 'loss/train': 1.2030998468399048} 11/07/2021 09:44:57 - INFO - __main__ - Step 89244: {'lr': 0.0001806447110293956, 'samples': 17134848, 'steps': 89243, 'loss/train': 1.102487325668335} 11/07/2021 09:44:58 - INFO - __main__ - Step 89245: {'lr': 0.00018063961259884122, 'samples': 17135040, 'steps': 89244, 'loss/train': 1.6855626106262207} 11/07/2021 09:44:58 - INFO - __main__ - Step 89246: {'lr': 0.00018063451419953952, 'samples': 17135232, 'steps': 89245, 'loss/train': 0.5273798108100891} 11/07/2021 09:44:59 - INFO - __main__ - Step 89247: {'lr': 0.0001806294158314929, 'samples': 17135424, 'steps': 89246, 'loss/train': 1.3977783918380737} 11/07/2021 09:44:59 - INFO - __main__ - Step 89248: {'lr': 0.00018062431749470354, 'samples': 17135616, 'steps': 89247, 'loss/train': 1.0845729112625122} 11/07/2021 09:45:00 - INFO - __main__ - Step 89249: {'lr': 0.00018061921918917378, 'samples': 17135808, 'steps': 89248, 'loss/train': 1.3324192762374878} 11/07/2021 09:45:00 - INFO - __main__ - Step 89250: {'lr': 0.00018061412091490597, 'samples': 17136000, 'steps': 89249, 'loss/train': 1.3923178911209106} 11/07/2021 09:45:00 - INFO - __main__ - Step 89251: {'lr': 0.0001806090226719025, 'samples': 17136192, 'steps': 89250, 'loss/train': 1.782203197479248} 11/07/2021 09:45:01 - INFO - __main__ - Step 89252: {'lr': 0.00018060392446016537, 'samples': 17136384, 'steps': 89251, 'loss/train': 1.3686705827713013} 11/07/2021 09:45:02 - INFO - __main__ - Step 89253: {'lr': 0.00018059882627969703, 'samples': 17136576, 'steps': 89252, 'loss/train': 1.5742852687835693} 11/07/2021 09:45:02 - INFO - __main__ - Step 89254: {'lr': 0.00018059372813049985, 'samples': 17136768, 'steps': 89253, 'loss/train': 0.9070223569869995} 11/07/2021 09:45:02 - INFO - __main__ - Step 89255: {'lr': 0.00018058863001257602, 'samples': 17136960, 'steps': 89254, 'loss/train': 1.9163297414779663} 11/07/2021 09:45:03 - INFO - __main__ - Step 89256: {'lr': 0.0001805835319259279, 'samples': 17137152, 'steps': 89255, 'loss/train': 1.2522273063659668} 11/07/2021 09:45:03 - INFO - __main__ - Step 89257: {'lr': 0.00018057843387055776, 'samples': 17137344, 'steps': 89256, 'loss/train': 1.1728712320327759} 11/07/2021 09:45:04 - INFO - __main__ - Step 89258: {'lr': 0.0001805733358464679, 'samples': 17137536, 'steps': 89257, 'loss/train': 1.4971373081207275} 11/07/2021 09:45:05 - INFO - __main__ - Step 89259: {'lr': 0.00018056823785366063, 'samples': 17137728, 'steps': 89258, 'loss/train': 1.148032546043396} 11/07/2021 09:45:05 - INFO - __main__ - Step 89260: {'lr': 0.00018056313989213825, 'samples': 17137920, 'steps': 89259, 'loss/train': 1.52675461769104} 11/07/2021 09:45:05 - INFO - __main__ - Step 89261: {'lr': 0.00018055804196190304, 'samples': 17138112, 'steps': 89260, 'loss/train': 1.5273576974868774} 11/07/2021 09:45:06 - INFO - __main__ - Step 89262: {'lr': 0.00018055294406295731, 'samples': 17138304, 'steps': 89261, 'loss/train': 1.796324372291565} 11/07/2021 09:45:06 - INFO - __main__ - Step 89263: {'lr': 0.00018054784619530334, 'samples': 17138496, 'steps': 89262, 'loss/train': 1.1993052959442139} 11/07/2021 09:45:07 - INFO - __main__ - Step 89264: {'lr': 0.00018054274835894345, 'samples': 17138688, 'steps': 89263, 'loss/train': 1.396931767463684} 11/07/2021 09:45:08 - INFO - __main__ - Step 89265: {'lr': 0.00018053765055388004, 'samples': 17138880, 'steps': 89264, 'loss/train': 1.0787544250488281} 11/07/2021 09:45:08 - INFO - __main__ - Step 89266: {'lr': 0.00018053255278011515, 'samples': 17139072, 'steps': 89265, 'loss/train': 1.36150324344635} 11/07/2021 09:45:08 - INFO - __main__ - Step 89267: {'lr': 0.00018052745503765124, 'samples': 17139264, 'steps': 89266, 'loss/train': 0.8862993717193604} 11/07/2021 09:45:09 - INFO - __main__ - Step 89268: {'lr': 0.0001805223573264906, 'samples': 17139456, 'steps': 89267, 'loss/train': 1.4163150787353516} 11/07/2021 09:45:09 - INFO - __main__ - Step 89269: {'lr': 0.0001805172596466355, 'samples': 17139648, 'steps': 89268, 'loss/train': 1.1493500471115112} 11/07/2021 09:45:10 - INFO - __main__ - Step 89270: {'lr': 0.00018051216199808828, 'samples': 17139840, 'steps': 89269, 'loss/train': 1.3732365369796753} 11/07/2021 09:45:10 - INFO - __main__ - Step 89271: {'lr': 0.00018050706438085118, 'samples': 17140032, 'steps': 89270, 'loss/train': 1.626245141029358} 11/07/2021 09:45:11 - INFO - __main__ - Step 89272: {'lr': 0.00018050196679492654, 'samples': 17140224, 'steps': 89271, 'loss/train': 1.2982239723205566} 11/07/2021 09:45:11 - INFO - __main__ - Step 89273: {'lr': 0.0001804968692403166, 'samples': 17140416, 'steps': 89272, 'loss/train': 1.1752524375915527} 11/07/2021 09:45:11 - INFO - __main__ - Step 89274: {'lr': 0.00018049177171702374, 'samples': 17140608, 'steps': 89273, 'loss/train': 1.7650972604751587} 11/07/2021 09:45:12 - INFO - __main__ - Step 89275: {'lr': 0.0001804866742250502, 'samples': 17140800, 'steps': 89274, 'loss/train': 1.1019408702850342} 11/07/2021 09:45:13 - INFO - __main__ - Step 89276: {'lr': 0.0001804815767643983, 'samples': 17140992, 'steps': 89275, 'loss/train': 1.210039734840393} 11/07/2021 09:45:13 - INFO - __main__ - Step 89277: {'lr': 0.00018047647933507033, 'samples': 17141184, 'steps': 89276, 'loss/train': 1.4938621520996094} 11/07/2021 09:45:13 - INFO - __main__ - Step 89278: {'lr': 0.0001804713819370687, 'samples': 17141376, 'steps': 89277, 'loss/train': 1.4686691761016846} 11/07/2021 09:45:14 - INFO - __main__ - Step 89279: {'lr': 0.00018046628457039544, 'samples': 17141568, 'steps': 89278, 'loss/train': 1.6706080436706543} 11/07/2021 09:45:15 - INFO - __main__ - Step 89280: {'lr': 0.00018046118723505304, 'samples': 17141760, 'steps': 89279, 'loss/train': 1.426232933998108} 11/07/2021 09:45:15 - INFO - __main__ - Step 89281: {'lr': 0.00018045608993104374, 'samples': 17141952, 'steps': 89280, 'loss/train': 1.3139455318450928} 11/07/2021 09:45:15 - INFO - __main__ - Step 89282: {'lr': 0.00018045099265836983, 'samples': 17142144, 'steps': 89281, 'loss/train': 0.8388586044311523} 11/07/2021 09:45:16 - INFO - __main__ - Step 89283: {'lr': 0.00018044589541703368, 'samples': 17142336, 'steps': 89282, 'loss/train': 0.2860471308231354} 11/07/2021 09:45:16 - INFO - __main__ - Step 89284: {'lr': 0.00018044079820703752, 'samples': 17142528, 'steps': 89283, 'loss/train': 0.8613105416297913} 11/07/2021 09:45:17 - INFO - __main__ - Step 89285: {'lr': 0.00018043570102838367, 'samples': 17142720, 'steps': 89284, 'loss/train': 1.4557441473007202} 11/07/2021 09:45:18 - INFO - __main__ - Step 89286: {'lr': 0.0001804306038810744, 'samples': 17142912, 'steps': 89285, 'loss/train': 1.3071478605270386} 11/07/2021 09:45:18 - INFO - __main__ - Step 89287: {'lr': 0.00018042550676511206, 'samples': 17143104, 'steps': 89286, 'loss/train': 1.3527045249938965} 11/07/2021 09:45:18 - INFO - __main__ - Step 89288: {'lr': 0.00018042040968049885, 'samples': 17143296, 'steps': 89287, 'loss/train': 1.4454541206359863} 11/07/2021 09:45:19 - INFO - __main__ - Step 89289: {'lr': 0.00018041531262723718, 'samples': 17143488, 'steps': 89288, 'loss/train': 1.5558325052261353} 11/07/2021 09:45:20 - INFO - __main__ - Step 89290: {'lr': 0.0001804102156053293, 'samples': 17143680, 'steps': 89289, 'loss/train': 1.8231114149093628} 11/07/2021 09:45:20 - INFO - __main__ - Step 89291: {'lr': 0.00018040511861477747, 'samples': 17143872, 'steps': 89290, 'loss/train': 1.114409327507019} 11/07/2021 09:45:20 - INFO - __main__ - Step 89292: {'lr': 0.00018040002165558414, 'samples': 17144064, 'steps': 89291, 'loss/train': 1.3526973724365234} 11/07/2021 09:45:21 - INFO - __main__ - Step 89293: {'lr': 0.00018039492472775138, 'samples': 17144256, 'steps': 89292, 'loss/train': 1.7139275074005127} 11/07/2021 09:45:21 - INFO - __main__ - Step 89294: {'lr': 0.00018038982783128162, 'samples': 17144448, 'steps': 89293, 'loss/train': 1.3044869899749756} 11/07/2021 09:45:22 - INFO - __main__ - Step 89295: {'lr': 0.00018038473096617709, 'samples': 17144640, 'steps': 89294, 'loss/train': 1.2276891469955444} 11/07/2021 09:45:22 - INFO - __main__ - Step 89296: {'lr': 0.00018037963413244012, 'samples': 17144832, 'steps': 89295, 'loss/train': 1.419340968132019} 11/07/2021 09:45:23 - INFO - __main__ - Step 89297: {'lr': 0.00018037453733007303, 'samples': 17145024, 'steps': 89296, 'loss/train': 0.7148557901382446} 11/07/2021 09:45:23 - INFO - __main__ - Step 89298: {'lr': 0.00018036944055907812, 'samples': 17145216, 'steps': 89297, 'loss/train': 1.7298213243484497} 11/07/2021 09:45:23 - INFO - __main__ - Step 89299: {'lr': 0.00018036434381945766, 'samples': 17145408, 'steps': 89298, 'loss/train': 0.7995774149894714} 11/07/2021 09:45:25 - INFO - __main__ - Step 89300: {'lr': 0.00018035924711121392, 'samples': 17145600, 'steps': 89299, 'loss/train': 1.2934387922286987} 11/07/2021 09:45:25 - INFO - __main__ - Step 89301: {'lr': 0.00018035415043434925, 'samples': 17145792, 'steps': 89300, 'loss/train': 1.4731998443603516} 11/07/2021 09:45:25 - INFO - __main__ - Step 89302: {'lr': 0.0001803490537888659, 'samples': 17145984, 'steps': 89301, 'loss/train': 1.5961427688598633} 11/07/2021 09:45:26 - INFO - __main__ - Step 89303: {'lr': 0.00018034395717476622, 'samples': 17146176, 'steps': 89302, 'loss/train': 1.5330572128295898} 11/07/2021 09:45:26 - INFO - __main__ - Step 89304: {'lr': 0.00018033886059205248, 'samples': 17146368, 'steps': 89303, 'loss/train': 1.3037734031677246} 11/07/2021 09:45:27 - INFO - __main__ - Step 89305: {'lr': 0.0001803337640407271, 'samples': 17146560, 'steps': 89304, 'loss/train': 1.4290704727172852} 11/07/2021 09:45:27 - INFO - __main__ - Step 89306: {'lr': 0.0001803286675207921, 'samples': 17146752, 'steps': 89305, 'loss/train': 1.3959742784500122} 11/07/2021 09:45:28 - INFO - __main__ - Step 89307: {'lr': 0.00018032357103224994, 'samples': 17146944, 'steps': 89306, 'loss/train': 1.5192151069641113} 11/07/2021 09:45:28 - INFO - __main__ - Step 89308: {'lr': 0.0001803184745751029, 'samples': 17147136, 'steps': 89307, 'loss/train': 1.5037508010864258} 11/07/2021 09:45:28 - INFO - __main__ - Step 89309: {'lr': 0.0001803133781493533, 'samples': 17147328, 'steps': 89308, 'loss/train': 0.905257523059845} 11/07/2021 09:45:29 - INFO - __main__ - Step 89310: {'lr': 0.00018030828175500342, 'samples': 17147520, 'steps': 89309, 'loss/train': 1.4057203531265259} 11/07/2021 09:45:30 - INFO - __main__ - Step 89311: {'lr': 0.00018030318539205553, 'samples': 17147712, 'steps': 89310, 'loss/train': 1.7389421463012695} 11/07/2021 09:45:30 - INFO - __main__ - Step 89312: {'lr': 0.00018029808906051196, 'samples': 17147904, 'steps': 89311, 'loss/train': 1.5167384147644043} 11/07/2021 09:45:31 - INFO - __main__ - Step 89313: {'lr': 0.00018029299276037497, 'samples': 17148096, 'steps': 89312, 'loss/train': 1.9453843832015991} 11/07/2021 09:45:31 - INFO - __main__ - Step 89314: {'lr': 0.00018028789649164693, 'samples': 17148288, 'steps': 89313, 'loss/train': 1.2773525714874268} 11/07/2021 09:45:31 - INFO - __main__ - Step 89315: {'lr': 0.00018028280025433007, 'samples': 17148480, 'steps': 89314, 'loss/train': 1.6427640914916992} 11/07/2021 09:45:32 - INFO - __main__ - Step 89316: {'lr': 0.0001802777040484267, 'samples': 17148672, 'steps': 89315, 'loss/train': 0.9274711608886719} 11/07/2021 09:45:33 - INFO - __main__ - Step 89317: {'lr': 0.00018027260787393918, 'samples': 17148864, 'steps': 89316, 'loss/train': 1.7577131986618042} 11/07/2021 09:45:33 - INFO - __main__ - Step 89318: {'lr': 0.00018026751173086966, 'samples': 17149056, 'steps': 89317, 'loss/train': 1.5334508419036865} 11/07/2021 09:45:33 - INFO - __main__ - Step 89319: {'lr': 0.00018026241561922062, 'samples': 17149248, 'steps': 89318, 'loss/train': 1.7722575664520264} 11/07/2021 09:45:34 - INFO - __main__ - Step 89320: {'lr': 0.00018025731953899416, 'samples': 17149440, 'steps': 89319, 'loss/train': 1.0991092920303345} 11/07/2021 09:45:35 - INFO - __main__ - Step 89321: {'lr': 0.0001802522234901927, 'samples': 17149632, 'steps': 89320, 'loss/train': 2.113126516342163} 11/07/2021 09:45:35 - INFO - __main__ - Step 89322: {'lr': 0.0001802471274728185, 'samples': 17149824, 'steps': 89321, 'loss/train': 1.3968133926391602} 11/07/2021 09:45:35 - INFO - __main__ - Step 89323: {'lr': 0.0001802420314868739, 'samples': 17150016, 'steps': 89322, 'loss/train': 1.3342041969299316} 11/07/2021 09:45:36 - INFO - __main__ - Step 89324: {'lr': 0.00018023693553236115, 'samples': 17150208, 'steps': 89323, 'loss/train': 1.2887217998504639} 11/07/2021 09:45:36 - INFO - __main__ - Step 89325: {'lr': 0.0001802318396092826, 'samples': 17150400, 'steps': 89324, 'loss/train': 1.2112733125686646} 11/07/2021 09:45:37 - INFO - __main__ - Step 89326: {'lr': 0.00018022674371764042, 'samples': 17150592, 'steps': 89325, 'loss/train': 1.341036319732666} 11/07/2021 09:45:37 - INFO - __main__ - Step 89327: {'lr': 0.00018022164785743704, 'samples': 17150784, 'steps': 89326, 'loss/train': 1.1336179971694946} 11/07/2021 09:45:38 - INFO - __main__ - Step 89328: {'lr': 0.00018021655202867478, 'samples': 17150976, 'steps': 89327, 'loss/train': 1.5199428796768188} 11/07/2021 09:45:38 - INFO - __main__ - Step 89329: {'lr': 0.00018021145623135575, 'samples': 17151168, 'steps': 89328, 'loss/train': 1.0824183225631714} 11/07/2021 09:45:38 - INFO - __main__ - Step 89330: {'lr': 0.00018020636046548244, 'samples': 17151360, 'steps': 89329, 'loss/train': 1.538689136505127} 11/07/2021 09:45:39 - INFO - __main__ - Step 89331: {'lr': 0.000180201264731057, 'samples': 17151552, 'steps': 89330, 'loss/train': 1.1004681587219238} 11/07/2021 09:45:40 - INFO - __main__ - Step 89332: {'lr': 0.0001801961690280819, 'samples': 17151744, 'steps': 89331, 'loss/train': 1.0878016948699951} 11/07/2021 09:45:40 - INFO - __main__ - Step 89333: {'lr': 0.00018019107335655925, 'samples': 17151936, 'steps': 89332, 'loss/train': 1.4164412021636963} 11/07/2021 09:45:40 - INFO - __main__ - Step 89334: {'lr': 0.00018018597771649142, 'samples': 17152128, 'steps': 89333, 'loss/train': 1.763588786125183} 11/07/2021 09:45:41 - INFO - __main__ - Step 89335: {'lr': 0.00018018088210788072, 'samples': 17152320, 'steps': 89334, 'loss/train': 1.3533116579055786} 11/07/2021 09:45:41 - INFO - __main__ - Step 89336: {'lr': 0.00018017578653072944, 'samples': 17152512, 'steps': 89335, 'loss/train': 0.988947331905365} 11/07/2021 09:45:42 - INFO - __main__ - Step 89337: {'lr': 0.00018017069098503986, 'samples': 17152704, 'steps': 89336, 'loss/train': 1.8571721315383911} 11/07/2021 09:45:43 - INFO - __main__ - Step 89338: {'lr': 0.0001801655954708143, 'samples': 17152896, 'steps': 89337, 'loss/train': 1.219069004058838} 11/07/2021 09:45:43 - INFO - __main__ - Step 89339: {'lr': 0.00018016049998805512, 'samples': 17153088, 'steps': 89338, 'loss/train': 1.1799843311309814} 11/07/2021 09:45:44 - INFO - __main__ - Step 89340: {'lr': 0.00018015540453676442, 'samples': 17153280, 'steps': 89339, 'loss/train': 1.41082763671875} 11/07/2021 09:45:44 - INFO - __main__ - Step 89341: {'lr': 0.00018015030911694468, 'samples': 17153472, 'steps': 89340, 'loss/train': 1.6261829137802124} 11/07/2021 09:45:45 - INFO - __main__ - Step 89342: {'lr': 0.0001801452137285981, 'samples': 17153664, 'steps': 89341, 'loss/train': 0.16413180530071259} 11/07/2021 09:45:45 - INFO - __main__ - Step 89343: {'lr': 0.00018014011837172702, 'samples': 17153856, 'steps': 89342, 'loss/train': 1.5152535438537598} 11/07/2021 09:45:46 - INFO - __main__ - Step 89344: {'lr': 0.00018013502304633372, 'samples': 17154048, 'steps': 89343, 'loss/train': 1.2737054824829102} 11/07/2021 09:45:46 - INFO - __main__ - Step 89345: {'lr': 0.00018012992775242058, 'samples': 17154240, 'steps': 89344, 'loss/train': 1.1384024620056152} 11/07/2021 09:45:46 - INFO - __main__ - Step 89346: {'lr': 0.00018012483248998974, 'samples': 17154432, 'steps': 89345, 'loss/train': 1.2355722188949585} 11/07/2021 09:45:47 - INFO - __main__ - Step 89347: {'lr': 0.00018011973725904357, 'samples': 17154624, 'steps': 89346, 'loss/train': 1.5756038427352905} 11/07/2021 09:45:48 - INFO - __main__ - Step 89348: {'lr': 0.0001801146420595844, 'samples': 17154816, 'steps': 89347, 'loss/train': 1.4596576690673828} 11/07/2021 09:45:48 - INFO - __main__ - Step 89349: {'lr': 0.00018010954689161445, 'samples': 17155008, 'steps': 89348, 'loss/train': 1.4761954545974731} 11/07/2021 09:45:48 - INFO - __main__ - Step 89350: {'lr': 0.00018010445175513612, 'samples': 17155200, 'steps': 89349, 'loss/train': 1.0752044916152954} 11/07/2021 09:45:49 - INFO - __main__ - Step 89351: {'lr': 0.0001800993566501516, 'samples': 17155392, 'steps': 89350, 'loss/train': 1.5746774673461914} 11/07/2021 09:45:50 - INFO - __main__ - Step 89352: {'lr': 0.00018009426157666324, 'samples': 17155584, 'steps': 89351, 'loss/train': 1.6922725439071655} 11/07/2021 09:45:50 - INFO - __main__ - Step 89353: {'lr': 0.00018008916653467334, 'samples': 17155776, 'steps': 89352, 'loss/train': 1.5499999523162842} 11/07/2021 09:45:50 - INFO - __main__ - Step 89354: {'lr': 0.00018008407152418415, 'samples': 17155968, 'steps': 89353, 'loss/train': 1.6213696002960205} 11/07/2021 09:45:51 - INFO - __main__ - Step 89355: {'lr': 0.000180078976545198, 'samples': 17156160, 'steps': 89354, 'loss/train': 1.5940637588500977} 11/07/2021 09:45:51 - INFO - __main__ - Step 89356: {'lr': 0.00018007388159771721, 'samples': 17156352, 'steps': 89355, 'loss/train': 1.6456267833709717} 11/07/2021 09:45:51 - INFO - __main__ - Step 89357: {'lr': 0.00018006878668174402, 'samples': 17156544, 'steps': 89356, 'loss/train': 1.3792634010314941} 11/07/2021 09:45:52 - INFO - __main__ - Step 89358: {'lr': 0.00018006369179728078, 'samples': 17156736, 'steps': 89357, 'loss/train': 1.1767816543579102} 11/07/2021 09:45:53 - INFO - __main__ - Step 89359: {'lr': 0.0001800585969443298, 'samples': 17156928, 'steps': 89358, 'loss/train': 1.243351697921753} 11/07/2021 09:45:53 - INFO - __main__ - Step 89360: {'lr': 0.0001800535021228933, 'samples': 17157120, 'steps': 89359, 'loss/train': 1.4352871179580688} 11/07/2021 09:45:53 - INFO - __main__ - Step 89361: {'lr': 0.00018004840733297365, 'samples': 17157312, 'steps': 89360, 'loss/train': 1.1198047399520874} 11/07/2021 09:45:54 - INFO - __main__ - Step 89362: {'lr': 0.00018004331257457306, 'samples': 17157504, 'steps': 89361, 'loss/train': 1.2072848081588745} 11/07/2021 09:45:55 - INFO - __main__ - Step 89363: {'lr': 0.00018003821784769386, 'samples': 17157696, 'steps': 89362, 'loss/train': 1.3424065113067627} 11/07/2021 09:45:55 - INFO - __main__ - Step 89364: {'lr': 0.0001800331231523384, 'samples': 17157888, 'steps': 89363, 'loss/train': 1.3568594455718994} 11/07/2021 09:45:56 - INFO - __main__ - Step 89365: {'lr': 0.0001800280284885089, 'samples': 17158080, 'steps': 89364, 'loss/train': 1.2680134773254395} 11/07/2021 09:45:56 - INFO - __main__ - Step 89366: {'lr': 0.0001800229338562077, 'samples': 17158272, 'steps': 89365, 'loss/train': 1.1399242877960205} 11/07/2021 09:45:56 - INFO - __main__ - Step 89367: {'lr': 0.00018001783925543707, 'samples': 17158464, 'steps': 89366, 'loss/train': 0.34224915504455566} 11/07/2021 09:45:58 - INFO - __main__ - Step 89368: {'lr': 0.00018001274468619933, 'samples': 17158656, 'steps': 89367, 'loss/train': 0.9364262223243713} 11/07/2021 09:45:58 - INFO - __main__ - Step 89369: {'lr': 0.0001800076501484968, 'samples': 17158848, 'steps': 89368, 'loss/train': 1.596136212348938} 11/07/2021 09:45:58 - INFO - __main__ - Step 89370: {'lr': 0.0001800025556423317, 'samples': 17159040, 'steps': 89369, 'loss/train': 1.1579614877700806} 11/07/2021 09:45:59 - INFO - __main__ - Step 89371: {'lr': 0.0001799974611677064, 'samples': 17159232, 'steps': 89370, 'loss/train': 1.4910850524902344} 11/07/2021 09:45:59 - INFO - __main__ - Step 89372: {'lr': 0.00017999236672462326, 'samples': 17159424, 'steps': 89371, 'loss/train': 1.332191824913025} 11/07/2021 09:45:59 - INFO - __main__ - Step 89373: {'lr': 0.00017998727231308438, 'samples': 17159616, 'steps': 89372, 'loss/train': 1.2917912006378174} 11/07/2021 09:46:00 - INFO - __main__ - Step 89374: {'lr': 0.00017998217793309214, 'samples': 17159808, 'steps': 89373, 'loss/train': 1.2371283769607544} 11/07/2021 09:46:01 - INFO - __main__ - Step 89375: {'lr': 0.0001799770835846488, 'samples': 17160000, 'steps': 89374, 'loss/train': 1.4520998001098633} 11/07/2021 09:46:01 - INFO - __main__ - Step 89376: {'lr': 0.00017997198926775679, 'samples': 17160192, 'steps': 89375, 'loss/train': 1.1183449029922485} 11/07/2021 09:46:01 - INFO - __main__ - Step 89377: {'lr': 0.0001799668949824183, 'samples': 17160384, 'steps': 89376, 'loss/train': 1.5601954460144043} 11/07/2021 09:46:02 - INFO - __main__ - Step 89378: {'lr': 0.00017996180072863563, 'samples': 17160576, 'steps': 89377, 'loss/train': 0.9946852326393127} 11/07/2021 09:46:03 - INFO - __main__ - Step 89379: {'lr': 0.0001799567065064111, 'samples': 17160768, 'steps': 89378, 'loss/train': 1.5560187101364136} 11/07/2021 09:46:03 - INFO - __main__ - Step 89380: {'lr': 0.000179951612315747, 'samples': 17160960, 'steps': 89379, 'loss/train': 1.3210946321487427} 11/07/2021 09:46:04 - INFO - __main__ - Step 89381: {'lr': 0.00017994651815664563, 'samples': 17161152, 'steps': 89380, 'loss/train': 1.6441333293914795} 11/07/2021 09:46:04 - INFO - __main__ - Step 89382: {'lr': 0.00017994142402910925, 'samples': 17161344, 'steps': 89381, 'loss/train': 1.3052047491073608} 11/07/2021 09:46:04 - INFO - __main__ - Step 89383: {'lr': 0.0001799363299331402, 'samples': 17161536, 'steps': 89382, 'loss/train': 1.6167502403259277} 11/07/2021 09:46:05 - INFO - __main__ - Step 89384: {'lr': 0.00017993123586874078, 'samples': 17161728, 'steps': 89383, 'loss/train': 1.7503901720046997} 11/07/2021 09:46:06 - INFO - __main__ - Step 89385: {'lr': 0.00017992614183591322, 'samples': 17161920, 'steps': 89384, 'loss/train': 1.2377684116363525} 11/07/2021 09:46:06 - INFO - __main__ - Step 89386: {'lr': 0.00017992104783466, 'samples': 17162112, 'steps': 89385, 'loss/train': 1.1946920156478882} 11/07/2021 09:46:06 - INFO - __main__ - Step 89387: {'lr': 0.00017991595386498315, 'samples': 17162304, 'steps': 89386, 'loss/train': 0.8261470794677734} 11/07/2021 09:46:07 - INFO - __main__ - Step 89388: {'lr': 0.0001799108599268851, 'samples': 17162496, 'steps': 89387, 'loss/train': 1.6863898038864136} 11/07/2021 09:46:08 - INFO - __main__ - Step 89389: {'lr': 0.00017990576602036813, 'samples': 17162688, 'steps': 89388, 'loss/train': 0.941268801689148} 11/07/2021 09:46:08 - INFO - __main__ - Step 89390: {'lr': 0.00017990067214543453, 'samples': 17162880, 'steps': 89389, 'loss/train': 0.9991438984870911} 11/07/2021 09:46:08 - INFO - __main__ - Step 89391: {'lr': 0.00017989557830208665, 'samples': 17163072, 'steps': 89390, 'loss/train': 1.033560872077942} 11/07/2021 09:46:09 - INFO - __main__ - Step 89392: {'lr': 0.0001798904844903267, 'samples': 17163264, 'steps': 89391, 'loss/train': 0.9444857835769653} 11/07/2021 09:46:09 - INFO - __main__ - Step 89393: {'lr': 0.000179885390710157, 'samples': 17163456, 'steps': 89392, 'loss/train': 1.2380728721618652} 11/07/2021 09:46:09 - INFO - __main__ - Step 89394: {'lr': 0.00017988029696157986, 'samples': 17163648, 'steps': 89393, 'loss/train': 1.7143983840942383} 11/07/2021 09:46:11 - INFO - __main__ - Step 89395: {'lr': 0.0001798752032445976, 'samples': 17163840, 'steps': 89394, 'loss/train': 1.438801646232605} 11/07/2021 09:46:11 - INFO - __main__ - Step 89396: {'lr': 0.0001798701095592125, 'samples': 17164032, 'steps': 89395, 'loss/train': 1.5760802030563354} 11/07/2021 09:46:11 - INFO - __main__ - Step 89397: {'lr': 0.00017986501590542688, 'samples': 17164224, 'steps': 89396, 'loss/train': 0.9444239139556885} 11/07/2021 09:46:12 - INFO - __main__ - Step 89398: {'lr': 0.00017985992228324293, 'samples': 17164416, 'steps': 89397, 'loss/train': 1.3320674896240234} 11/07/2021 09:46:12 - INFO - __main__ - Step 89399: {'lr': 0.00017985482869266315, 'samples': 17164608, 'steps': 89398, 'loss/train': 1.5839215517044067} 11/07/2021 09:46:13 - INFO - __main__ - Step 89400: {'lr': 0.0001798497351336896, 'samples': 17164800, 'steps': 89399, 'loss/train': 1.3267176151275635} 11/07/2021 09:46:13 - INFO - __main__ - Step 89401: {'lr': 0.00017984464160632468, 'samples': 17164992, 'steps': 89400, 'loss/train': 1.4389185905456543} 11/07/2021 09:46:14 - INFO - __main__ - Step 89402: {'lr': 0.00017983954811057068, 'samples': 17165184, 'steps': 89401, 'loss/train': 1.3924312591552734} 11/07/2021 09:46:14 - INFO - __main__ - Step 89403: {'lr': 0.00017983445464642988, 'samples': 17165376, 'steps': 89402, 'loss/train': 1.5931888818740845} 11/07/2021 09:46:14 - INFO - __main__ - Step 89404: {'lr': 0.0001798293612139046, 'samples': 17165568, 'steps': 89403, 'loss/train': 0.9312685132026672} 11/07/2021 09:46:15 - INFO - __main__ - Step 89405: {'lr': 0.00017982426781299715, 'samples': 17165760, 'steps': 89404, 'loss/train': 0.6159845590591431} 11/07/2021 09:46:16 - INFO - __main__ - Step 89406: {'lr': 0.00017981917444370976, 'samples': 17165952, 'steps': 89405, 'loss/train': 0.7507354617118835} 11/07/2021 09:46:16 - INFO - __main__ - Step 89407: {'lr': 0.0001798140811060448, 'samples': 17166144, 'steps': 89406, 'loss/train': 1.5459738969802856} 11/07/2021 09:46:16 - INFO - __main__ - Step 89408: {'lr': 0.00017980898780000455, 'samples': 17166336, 'steps': 89407, 'loss/train': 1.6603527069091797} 11/07/2021 09:46:17 - INFO - __main__ - Step 89409: {'lr': 0.00017980389452559124, 'samples': 17166528, 'steps': 89408, 'loss/train': 1.5408291816711426} 11/07/2021 09:46:18 - INFO - __main__ - Step 89410: {'lr': 0.00017979880128280722, 'samples': 17166720, 'steps': 89409, 'loss/train': 1.8006508350372314} 11/07/2021 09:46:18 - INFO - __main__ - Step 89411: {'lr': 0.00017979370807165478, 'samples': 17166912, 'steps': 89410, 'loss/train': 1.2914299964904785} 11/07/2021 09:46:19 - INFO - __main__ - Step 89412: {'lr': 0.00017978861489213624, 'samples': 17167104, 'steps': 89411, 'loss/train': 1.4082973003387451} 11/07/2021 09:46:19 - INFO - __main__ - Step 89413: {'lr': 0.00017978352174425393, 'samples': 17167296, 'steps': 89412, 'loss/train': 0.4266877770423889} 11/07/2021 09:46:19 - INFO - __main__ - Step 89414: {'lr': 0.00017977842862801002, 'samples': 17167488, 'steps': 89413, 'loss/train': 1.8388689756393433} 11/07/2021 09:46:20 - INFO - __main__ - Step 89415: {'lr': 0.00017977333554340685, 'samples': 17167680, 'steps': 89414, 'loss/train': 1.2583420276641846} 11/07/2021 09:46:21 - INFO - __main__ - Step 89416: {'lr': 0.0001797682424904467, 'samples': 17167872, 'steps': 89415, 'loss/train': 1.6512051820755005} 11/07/2021 09:46:21 - INFO - __main__ - Step 89417: {'lr': 0.00017976314946913197, 'samples': 17168064, 'steps': 89416, 'loss/train': 1.7424434423446655} 11/07/2021 09:46:21 - INFO - __main__ - Step 89418: {'lr': 0.0001797580564794648, 'samples': 17168256, 'steps': 89417, 'loss/train': 1.2305786609649658} 11/07/2021 09:46:22 - INFO - __main__ - Step 89419: {'lr': 0.0001797529635214476, 'samples': 17168448, 'steps': 89418, 'loss/train': 1.698091983795166} 11/07/2021 09:46:24 - INFO - __main__ - Step 89420: {'lr': 0.00017974787059508264, 'samples': 17168640, 'steps': 89419, 'loss/train': 0.9222173690795898} 11/07/2021 09:46:24 - INFO - __main__ - Step 89421: {'lr': 0.0001797427777003722, 'samples': 17168832, 'steps': 89420, 'loss/train': 1.4745134115219116} 11/07/2021 09:46:24 - INFO - __main__ - Step 89422: {'lr': 0.0001797376848373186, 'samples': 17169024, 'steps': 89421, 'loss/train': 1.013573169708252} 11/07/2021 09:46:25 - INFO - __main__ - Step 89423: {'lr': 0.00017973259200592407, 'samples': 17169216, 'steps': 89422, 'loss/train': 1.5614068508148193} 11/07/2021 09:46:25 - INFO - __main__ - Step 89424: {'lr': 0.00017972749920619097, 'samples': 17169408, 'steps': 89423, 'loss/train': 1.550581932067871} 11/07/2021 09:46:25 - INFO - __main__ - Step 89425: {'lr': 0.0001797224064381216, 'samples': 17169600, 'steps': 89424, 'loss/train': 1.7632750272750854} 11/07/2021 09:46:26 - INFO - __main__ - Step 89426: {'lr': 0.00017971731370171828, 'samples': 17169792, 'steps': 89425, 'loss/train': 1.7783054113388062} 11/07/2021 09:46:27 - INFO - __main__ - Step 89427: {'lr': 0.0001797122209969832, 'samples': 17169984, 'steps': 89426, 'loss/train': 1.6191215515136719} 11/07/2021 09:46:27 - INFO - __main__ - Step 89428: {'lr': 0.00017970712832391866, 'samples': 17170176, 'steps': 89427, 'loss/train': 1.5450060367584229} 11/07/2021 09:46:28 - INFO - __main__ - Step 89429: {'lr': 0.00017970203568252704, 'samples': 17170368, 'steps': 89428, 'loss/train': 1.4580453634262085} 11/07/2021 09:46:28 - INFO - __main__ - Step 89430: {'lr': 0.0001796969430728106, 'samples': 17170560, 'steps': 89429, 'loss/train': 1.3579134941101074} 11/07/2021 09:46:28 - INFO - __main__ - Step 89431: {'lr': 0.0001796918504947716, 'samples': 17170752, 'steps': 89430, 'loss/train': 1.6856603622436523} 11/07/2021 09:46:29 - INFO - __main__ - Step 89432: {'lr': 0.00017968675794841242, 'samples': 17170944, 'steps': 89431, 'loss/train': 0.7299540638923645} 11/07/2021 09:46:30 - INFO - __main__ - Step 89433: {'lr': 0.00017968166543373527, 'samples': 17171136, 'steps': 89432, 'loss/train': 1.4559482336044312} 11/07/2021 09:46:30 - INFO - __main__ - Step 89434: {'lr': 0.00017967657295074247, 'samples': 17171328, 'steps': 89433, 'loss/train': 1.1937588453292847} 11/07/2021 09:46:30 - INFO - __main__ - Step 89435: {'lr': 0.00017967148049943634, 'samples': 17171520, 'steps': 89434, 'loss/train': 1.650261402130127} 11/07/2021 09:46:31 - INFO - __main__ - Step 89436: {'lr': 0.0001796663880798191, 'samples': 17171712, 'steps': 89435, 'loss/train': 1.2078907489776611} 11/07/2021 09:46:32 - INFO - __main__ - Step 89437: {'lr': 0.00017966129569189316, 'samples': 17171904, 'steps': 89436, 'loss/train': 1.637515902519226} 11/07/2021 09:46:32 - INFO - __main__ - Step 89438: {'lr': 0.00017965620333566074, 'samples': 17172096, 'steps': 89437, 'loss/train': 1.6087448596954346} 11/07/2021 09:46:32 - INFO - __main__ - Step 89439: {'lr': 0.00017965111101112417, 'samples': 17172288, 'steps': 89438, 'loss/train': 1.0564850568771362} 11/07/2021 09:46:33 - INFO - __main__ - Step 89440: {'lr': 0.00017964601871828579, 'samples': 17172480, 'steps': 89439, 'loss/train': 1.0585922002792358} 11/07/2021 09:46:33 - INFO - __main__ - Step 89441: {'lr': 0.00017964092645714774, 'samples': 17172672, 'steps': 89440, 'loss/train': 1.5841948986053467} 11/07/2021 09:46:34 - INFO - __main__ - Step 89442: {'lr': 0.0001796358342277124, 'samples': 17172864, 'steps': 89441, 'loss/train': 1.387900710105896} 11/07/2021 09:46:35 - INFO - __main__ - Step 89443: {'lr': 0.0001796307420299821, 'samples': 17173056, 'steps': 89442, 'loss/train': 1.301877498626709} 11/07/2021 09:46:35 - INFO - __main__ - Step 89444: {'lr': 0.00017962564986395908, 'samples': 17173248, 'steps': 89443, 'loss/train': 1.4634194374084473} 11/07/2021 09:46:35 - INFO - __main__ - Step 89445: {'lr': 0.00017962055772964563, 'samples': 17173440, 'steps': 89444, 'loss/train': 1.523655891418457} 11/07/2021 09:46:36 - INFO - __main__ - Step 89446: {'lr': 0.00017961546562704405, 'samples': 17173632, 'steps': 89445, 'loss/train': 1.2552049160003662} 11/07/2021 09:46:36 - INFO - __main__ - Step 89447: {'lr': 0.00017961037355615673, 'samples': 17173824, 'steps': 89446, 'loss/train': 1.393973708152771} 11/07/2021 09:46:37 - INFO - __main__ - Step 89448: {'lr': 0.00017960528151698586, 'samples': 17174016, 'steps': 89447, 'loss/train': 1.2567046880722046} 11/07/2021 09:46:37 - INFO - __main__ - Step 89449: {'lr': 0.00017960018950953375, 'samples': 17174208, 'steps': 89448, 'loss/train': 1.3458960056304932} 11/07/2021 09:46:38 - INFO - __main__ - Step 89450: {'lr': 0.0001795950975338027, 'samples': 17174400, 'steps': 89449, 'loss/train': 1.611803650856018} 11/07/2021 09:46:38 - INFO - __main__ - Step 89451: {'lr': 0.00017959000558979505, 'samples': 17174592, 'steps': 89450, 'loss/train': 1.5637456178665161} 11/07/2021 09:46:38 - INFO - __main__ - Step 89452: {'lr': 0.00017958491367751306, 'samples': 17174784, 'steps': 89451, 'loss/train': 1.4051830768585205} 11/07/2021 09:46:39 - INFO - __main__ - Step 89453: {'lr': 0.0001795798217969591, 'samples': 17174976, 'steps': 89452, 'loss/train': 0.648826003074646} 11/07/2021 09:46:40 - INFO - __main__ - Step 89454: {'lr': 0.00017957472994813525, 'samples': 17175168, 'steps': 89453, 'loss/train': 1.4527746438980103} 11/07/2021 09:46:40 - INFO - __main__ - Step 89455: {'lr': 0.000179569638131044, 'samples': 17175360, 'steps': 89454, 'loss/train': 1.5702425241470337} 11/07/2021 09:46:40 - INFO - __main__ - Step 89456: {'lr': 0.00017956454634568753, 'samples': 17175552, 'steps': 89455, 'loss/train': 0.7198446989059448} 11/07/2021 09:46:41 - INFO - __main__ - Step 89457: {'lr': 0.0001795594545920682, 'samples': 17175744, 'steps': 89456, 'loss/train': 1.2328342199325562} 11/07/2021 09:46:42 - INFO - __main__ - Step 89458: {'lr': 0.00017955436287018833, 'samples': 17175936, 'steps': 89457, 'loss/train': 1.5245494842529297} 11/07/2021 09:46:42 - INFO - __main__ - Step 89459: {'lr': 0.00017954927118005016, 'samples': 17176128, 'steps': 89458, 'loss/train': 1.354609489440918} 11/07/2021 09:46:43 - INFO - __main__ - Step 89460: {'lr': 0.00017954417952165596, 'samples': 17176320, 'steps': 89459, 'loss/train': 1.4522467851638794} 11/07/2021 09:46:43 - INFO - __main__ - Step 89461: {'lr': 0.0001795390878950081, 'samples': 17176512, 'steps': 89460, 'loss/train': 1.5608952045440674} 11/07/2021 09:46:43 - INFO - __main__ - Step 89462: {'lr': 0.0001795339963001089, 'samples': 17176704, 'steps': 89461, 'loss/train': 1.6424187421798706} 11/07/2021 09:46:44 - INFO - __main__ - Step 89463: {'lr': 0.00017952890473696054, 'samples': 17176896, 'steps': 89462, 'loss/train': 0.4676416516304016} 11/07/2021 09:46:45 - INFO - __main__ - Step 89464: {'lr': 0.00017952381320556537, 'samples': 17177088, 'steps': 89463, 'loss/train': 1.553939700126648} 11/07/2021 09:46:45 - INFO - __main__ - Step 89465: {'lr': 0.0001795187217059257, 'samples': 17177280, 'steps': 89464, 'loss/train': 0.7049587965011597} 11/07/2021 09:46:46 - INFO - __main__ - Step 89466: {'lr': 0.00017951363023804381, 'samples': 17177472, 'steps': 89465, 'loss/train': 1.2948514223098755} 11/07/2021 09:46:46 - INFO - __main__ - Step 89467: {'lr': 0.00017950853880192196, 'samples': 17177664, 'steps': 89466, 'loss/train': 1.3989591598510742} 11/07/2021 09:46:46 - INFO - __main__ - Step 89468: {'lr': 0.00017950344739756248, 'samples': 17177856, 'steps': 89467, 'loss/train': 0.3455002009868622} 11/07/2021 09:46:47 - INFO - __main__ - Step 89469: {'lr': 0.00017949835602496767, 'samples': 17178048, 'steps': 89468, 'loss/train': 1.4811815023422241} 11/07/2021 09:46:48 - INFO - __main__ - Step 89470: {'lr': 0.00017949326468413978, 'samples': 17178240, 'steps': 89469, 'loss/train': 1.3897876739501953} 11/07/2021 09:46:48 - INFO - __main__ - Step 89471: {'lr': 0.00017948817337508116, 'samples': 17178432, 'steps': 89470, 'loss/train': 1.254757046699524} 11/07/2021 09:46:48 - INFO - __main__ - Step 89472: {'lr': 0.00017948308209779406, 'samples': 17178624, 'steps': 89471, 'loss/train': 1.7673804759979248} 11/07/2021 09:46:49 - INFO - __main__ - Step 89473: {'lr': 0.00017947799085228088, 'samples': 17178816, 'steps': 89472, 'loss/train': 1.397950530052185} 11/07/2021 09:46:50 - INFO - __main__ - Step 89474: {'lr': 0.0001794728996385438, 'samples': 17179008, 'steps': 89473, 'loss/train': 1.4893925189971924} 11/07/2021 09:46:50 - INFO - __main__ - Step 89475: {'lr': 0.0001794678084565851, 'samples': 17179200, 'steps': 89474, 'loss/train': 1.3961870670318604} 11/07/2021 09:46:51 - INFO - __main__ - Step 89476: {'lr': 0.0001794627173064071, 'samples': 17179392, 'steps': 89475, 'loss/train': 2.0573935508728027} 11/07/2021 09:46:51 - INFO - __main__ - Step 89477: {'lr': 0.00017945762618801214, 'samples': 17179584, 'steps': 89476, 'loss/train': 1.6664880514144897} 11/07/2021 09:46:51 - INFO - __main__ - Step 89478: {'lr': 0.00017945253510140248, 'samples': 17179776, 'steps': 89477, 'loss/train': 1.4844577312469482} 11/07/2021 09:46:52 - INFO - __main__ - Step 89479: {'lr': 0.0001794474440465804, 'samples': 17179968, 'steps': 89478, 'loss/train': 1.4785021543502808} 11/07/2021 09:46:53 - INFO - __main__ - Step 89480: {'lr': 0.00017944235302354828, 'samples': 17180160, 'steps': 89479, 'loss/train': 1.1246079206466675} 11/07/2021 09:46:53 - INFO - __main__ - Step 89481: {'lr': 0.00017943726203230832, 'samples': 17180352, 'steps': 89480, 'loss/train': 1.0686076879501343} 11/07/2021 09:46:53 - INFO - __main__ - Step 89482: {'lr': 0.0001794321710728628, 'samples': 17180544, 'steps': 89481, 'loss/train': 1.3887425661087036} 11/07/2021 09:46:54 - INFO - __main__ - Step 89483: {'lr': 0.00017942708014521408, 'samples': 17180736, 'steps': 89482, 'loss/train': 1.6790555715560913} 11/07/2021 09:46:55 - INFO - __main__ - Step 89484: {'lr': 0.00017942198924936447, 'samples': 17180928, 'steps': 89483, 'loss/train': 1.0911908149719238} 11/07/2021 09:46:55 - INFO - __main__ - Step 89485: {'lr': 0.00017941689838531615, 'samples': 17181120, 'steps': 89484, 'loss/train': 1.1908748149871826} 11/07/2021 09:46:55 - INFO - __main__ - Step 89486: {'lr': 0.00017941180755307154, 'samples': 17181312, 'steps': 89485, 'loss/train': 1.6334229707717896} 11/07/2021 09:46:56 - INFO - __main__ - Step 89487: {'lr': 0.00017940671675263284, 'samples': 17181504, 'steps': 89486, 'loss/train': 1.212219476699829} 11/07/2021 09:46:56 - INFO - __main__ - Step 89488: {'lr': 0.00017940162598400238, 'samples': 17181696, 'steps': 89487, 'loss/train': 1.691240668296814} 11/07/2021 09:46:57 - INFO - __main__ - Step 89489: {'lr': 0.0001793965352471825, 'samples': 17181888, 'steps': 89488, 'loss/train': 1.0846307277679443} 11/07/2021 09:46:57 - INFO - __main__ - Step 89490: {'lr': 0.00017939144454217544, 'samples': 17182080, 'steps': 89489, 'loss/train': 1.2926377058029175} 11/07/2021 09:46:58 - INFO - __main__ - Step 89491: {'lr': 0.00017938635386898348, 'samples': 17182272, 'steps': 89490, 'loss/train': 1.6368252038955688} 11/07/2021 09:46:58 - INFO - __main__ - Step 89492: {'lr': 0.00017938126322760895, 'samples': 17182464, 'steps': 89491, 'loss/train': 1.3600773811340332} 11/07/2021 09:46:59 - INFO - __main__ - Step 89493: {'lr': 0.00017937617261805418, 'samples': 17182656, 'steps': 89492, 'loss/train': 1.8326994180679321} 11/07/2021 09:46:59 - INFO - __main__ - Step 89494: {'lr': 0.00017937108204032137, 'samples': 17182848, 'steps': 89493, 'loss/train': 1.599526286125183} 11/07/2021 09:47:00 - INFO - __main__ - Step 89495: {'lr': 0.0001793659914944129, 'samples': 17183040, 'steps': 89494, 'loss/train': 1.2013510465621948} 11/07/2021 09:47:00 - INFO - __main__ - Step 89496: {'lr': 0.00017936090098033097, 'samples': 17183232, 'steps': 89495, 'loss/train': 5.62519645690918} 11/07/2021 09:47:01 - INFO - __main__ - Step 89497: {'lr': 0.000179355810498078, 'samples': 17183424, 'steps': 89496, 'loss/train': 1.4442017078399658} 11/07/2021 09:47:01 - INFO - __main__ - Step 89498: {'lr': 0.00017935072004765613, 'samples': 17183616, 'steps': 89497, 'loss/train': 0.8217788338661194} 11/07/2021 09:47:01 - INFO - __main__ - Step 89499: {'lr': 0.00017934562962906774, 'samples': 17183808, 'steps': 89498, 'loss/train': 1.0001943111419678} 11/07/2021 09:47:02 - INFO - __main__ - Step 89500: {'lr': 0.00017934053924231514, 'samples': 17184000, 'steps': 89499, 'loss/train': 1.0787559747695923} 11/07/2021 09:47:03 - INFO - __main__ - Step 89501: {'lr': 0.00017933544888740062, 'samples': 17184192, 'steps': 89500, 'loss/train': 1.4517396688461304} 11/07/2021 09:47:03 - INFO - __main__ - Step 89502: {'lr': 0.00017933035856432643, 'samples': 17184384, 'steps': 89501, 'loss/train': 1.242164134979248} 11/07/2021 09:47:04 - INFO - __main__ - Step 89503: {'lr': 0.00017932526827309486, 'samples': 17184576, 'steps': 89502, 'loss/train': 1.6417227983474731} 11/07/2021 09:47:04 - INFO - __main__ - Step 89504: {'lr': 0.0001793201780137083, 'samples': 17184768, 'steps': 89503, 'loss/train': 1.3514658212661743} 11/07/2021 09:47:04 - INFO - __main__ - Step 89505: {'lr': 0.00017931508778616895, 'samples': 17184960, 'steps': 89504, 'loss/train': 0.4188991189002991} 11/07/2021 09:47:05 - INFO - __main__ - Step 89506: {'lr': 0.0001793099975904791, 'samples': 17185152, 'steps': 89505, 'loss/train': 0.649027943611145} 11/07/2021 09:47:06 - INFO - __main__ - Step 89507: {'lr': 0.00017930490742664124, 'samples': 17185344, 'steps': 89506, 'loss/train': 1.4166936874389648} 11/07/2021 09:47:06 - INFO - __main__ - Step 89508: {'lr': 0.00017929981729465733, 'samples': 17185536, 'steps': 89507, 'loss/train': 1.5230624675750732} 11/07/2021 09:47:06 - INFO - __main__ - Step 89509: {'lr': 0.0001792947271945299, 'samples': 17185728, 'steps': 89508, 'loss/train': 1.7446727752685547} 11/07/2021 09:47:07 - INFO - __main__ - Step 89510: {'lr': 0.00017928963712626113, 'samples': 17185920, 'steps': 89509, 'loss/train': 1.1342326402664185} 11/07/2021 09:47:07 - INFO - __main__ - Step 89511: {'lr': 0.00017928454708985336, 'samples': 17186112, 'steps': 89510, 'loss/train': 1.3844754695892334} 11/07/2021 09:47:08 - INFO - __main__ - Step 89512: {'lr': 0.00017927945708530888, 'samples': 17186304, 'steps': 89511, 'loss/train': 1.8268814086914062} 11/07/2021 09:47:08 - INFO - __main__ - Step 89513: {'lr': 0.00017927436711262997, 'samples': 17186496, 'steps': 89512, 'loss/train': 1.2368450164794922} 11/07/2021 09:47:09 - INFO - __main__ - Step 89514: {'lr': 0.000179269277171819, 'samples': 17186688, 'steps': 89513, 'loss/train': 1.6127818822860718} 11/07/2021 09:47:09 - INFO - __main__ - Step 89515: {'lr': 0.00017926418726287813, 'samples': 17186880, 'steps': 89514, 'loss/train': 1.520124077796936} 11/07/2021 09:47:09 - INFO - __main__ - Step 89516: {'lr': 0.00017925909738580976, 'samples': 17187072, 'steps': 89515, 'loss/train': 1.2547374963760376} 11/07/2021 09:47:10 - INFO - __main__ - Step 89517: {'lr': 0.00017925400754061616, 'samples': 17187264, 'steps': 89516, 'loss/train': 1.4581629037857056} 11/07/2021 09:47:11 - INFO - __main__ - Step 89518: {'lr': 0.0001792489177272996, 'samples': 17187456, 'steps': 89517, 'loss/train': 1.292655110359192} 11/07/2021 09:47:11 - INFO - __main__ - Step 89519: {'lr': 0.0001792438279458624, 'samples': 17187648, 'steps': 89518, 'loss/train': 1.2247902154922485} 11/07/2021 09:47:11 - INFO - __main__ - Step 89520: {'lr': 0.00017923873819630692, 'samples': 17187840, 'steps': 89519, 'loss/train': 1.287838339805603} 11/07/2021 09:47:12 - INFO - __main__ - Step 89521: {'lr': 0.00017923364847863526, 'samples': 17188032, 'steps': 89520, 'loss/train': 1.2198472023010254} 11/07/2021 09:47:13 - INFO - __main__ - Step 89522: {'lr': 0.00017922855879284985, 'samples': 17188224, 'steps': 89521, 'loss/train': 1.4869346618652344} 11/07/2021 09:47:13 - INFO - __main__ - Step 89523: {'lr': 0.00017922346913895295, 'samples': 17188416, 'steps': 89522, 'loss/train': 1.4423670768737793} 11/07/2021 09:47:13 - INFO - __main__ - Step 89524: {'lr': 0.00017921837951694687, 'samples': 17188608, 'steps': 89523, 'loss/train': 1.3326166868209839} 11/07/2021 09:47:14 - INFO - __main__ - Step 89525: {'lr': 0.00017921328992683388, 'samples': 17188800, 'steps': 89524, 'loss/train': 1.791100263595581} 11/07/2021 09:47:14 - INFO - __main__ - Step 89526: {'lr': 0.00017920820036861632, 'samples': 17188992, 'steps': 89525, 'loss/train': 0.9674580097198486} 11/07/2021 09:47:15 - INFO - __main__ - Step 89527: {'lr': 0.00017920311084229645, 'samples': 17189184, 'steps': 89526, 'loss/train': 1.0102050304412842} 11/07/2021 09:47:16 - INFO - __main__ - Step 89528: {'lr': 0.00017919802134787655, 'samples': 17189376, 'steps': 89527, 'loss/train': 1.5103999376296997} 11/07/2021 09:47:16 - INFO - __main__ - Step 89529: {'lr': 0.0001791929318853589, 'samples': 17189568, 'steps': 89528, 'loss/train': 1.6926798820495605} 11/07/2021 09:47:16 - INFO - __main__ - Step 89530: {'lr': 0.00017918784245474586, 'samples': 17189760, 'steps': 89529, 'loss/train': 1.261482834815979} 11/07/2021 09:47:17 - INFO - __main__ - Step 89531: {'lr': 0.00017918275305603968, 'samples': 17189952, 'steps': 89530, 'loss/train': 1.3645035028457642} 11/07/2021 09:47:18 - INFO - __main__ - Step 89532: {'lr': 0.00017917766368924265, 'samples': 17190144, 'steps': 89531, 'loss/train': 1.2304707765579224} 11/07/2021 09:47:18 - INFO - __main__ - Step 89533: {'lr': 0.00017917257435435708, 'samples': 17190336, 'steps': 89532, 'loss/train': 1.534883737564087} 11/07/2021 09:47:18 - INFO - __main__ - Step 89534: {'lr': 0.00017916748505138536, 'samples': 17190528, 'steps': 89533, 'loss/train': 1.4856899976730347} 11/07/2021 09:47:19 - INFO - __main__ - Step 89535: {'lr': 0.00017916239578032956, 'samples': 17190720, 'steps': 89534, 'loss/train': 1.0553078651428223} 11/07/2021 09:47:19 - INFO - __main__ - Step 89536: {'lr': 0.0001791573065411921, 'samples': 17190912, 'steps': 89535, 'loss/train': 1.108437418937683} 11/07/2021 09:47:20 - INFO - __main__ - Step 89537: {'lr': 0.0001791522173339753, 'samples': 17191104, 'steps': 89536, 'loss/train': 1.8774884939193726} 11/07/2021 09:47:20 - INFO - __main__ - Step 89538: {'lr': 0.00017914712815868136, 'samples': 17191296, 'steps': 89537, 'loss/train': 1.5486338138580322} 11/07/2021 09:47:21 - INFO - __main__ - Step 89539: {'lr': 0.00017914203901531268, 'samples': 17191488, 'steps': 89538, 'loss/train': 1.601324200630188} 11/07/2021 09:47:21 - INFO - __main__ - Step 89540: {'lr': 0.00017913694990387148, 'samples': 17191680, 'steps': 89539, 'loss/train': 1.3342195749282837} 11/07/2021 09:47:21 - INFO - __main__ - Step 89541: {'lr': 0.00017913186082436005, 'samples': 17191872, 'steps': 89540, 'loss/train': 1.6379495859146118} 11/07/2021 09:47:22 - INFO - __main__ - Step 89542: {'lr': 0.00017912677177678074, 'samples': 17192064, 'steps': 89541, 'loss/train': 1.0566951036453247} 11/07/2021 09:47:23 - INFO - __main__ - Step 89543: {'lr': 0.00017912168276113582, 'samples': 17192256, 'steps': 89542, 'loss/train': 1.3101050853729248} 11/07/2021 09:47:23 - INFO - __main__ - Step 89544: {'lr': 0.00017911659377742756, 'samples': 17192448, 'steps': 89543, 'loss/train': 1.5497462749481201} 11/07/2021 09:47:24 - INFO - __main__ - Step 89545: {'lr': 0.00017911150482565827, 'samples': 17192640, 'steps': 89544, 'loss/train': 1.4409916400909424} 11/07/2021 09:47:24 - INFO - __main__ - Step 89546: {'lr': 0.00017910641590583023, 'samples': 17192832, 'steps': 89545, 'loss/train': 1.4664201736450195} 11/07/2021 09:47:24 - INFO - __main__ - Step 89547: {'lr': 0.00017910132701794588, 'samples': 17193024, 'steps': 89546, 'loss/train': 1.6302934885025024} 11/07/2021 09:47:25 - INFO - __main__ - Step 89548: {'lr': 0.00017909623816200727, 'samples': 17193216, 'steps': 89547, 'loss/train': 1.5856921672821045} 11/07/2021 09:47:26 - INFO - __main__ - Step 89549: {'lr': 0.0001790911493380168, 'samples': 17193408, 'steps': 89548, 'loss/train': 0.9029682874679565} 11/07/2021 09:47:26 - INFO - __main__ - Step 89550: {'lr': 0.00017908606054597672, 'samples': 17193600, 'steps': 89549, 'loss/train': 1.5927098989486694} 11/07/2021 09:47:26 - INFO - __main__ - Step 89551: {'lr': 0.00017908097178588942, 'samples': 17193792, 'steps': 89550, 'loss/train': 1.3913342952728271} 11/07/2021 09:47:27 - INFO - __main__ - Step 89552: {'lr': 0.00017907588305775713, 'samples': 17193984, 'steps': 89551, 'loss/train': 1.6854143142700195} 11/07/2021 09:47:28 - INFO - __main__ - Step 89553: {'lr': 0.00017907079436158213, 'samples': 17194176, 'steps': 89552, 'loss/train': 1.0722923278808594} 11/07/2021 09:47:28 - INFO - __main__ - Step 89554: {'lr': 0.00017906570569736673, 'samples': 17194368, 'steps': 89553, 'loss/train': 1.5969862937927246} 11/07/2021 09:47:28 - INFO - __main__ - Step 89555: {'lr': 0.00017906061706511326, 'samples': 17194560, 'steps': 89554, 'loss/train': 1.1475213766098022} 11/07/2021 09:47:29 - INFO - __main__ - Step 89556: {'lr': 0.00017905552846482397, 'samples': 17194752, 'steps': 89555, 'loss/train': 1.3668978214263916} 11/07/2021 09:47:29 - INFO - __main__ - Step 89557: {'lr': 0.00017905043989650116, 'samples': 17194944, 'steps': 89556, 'loss/train': 1.4071943759918213} 11/07/2021 09:47:30 - INFO - __main__ - Step 89558: {'lr': 0.00017904535136014713, 'samples': 17195136, 'steps': 89557, 'loss/train': 1.2578076124191284} 11/07/2021 09:47:30 - INFO - __main__ - Step 89559: {'lr': 0.00017904026285576417, 'samples': 17195328, 'steps': 89558, 'loss/train': 1.055044412612915} 11/07/2021 09:47:31 - INFO - __main__ - Step 89560: {'lr': 0.00017903517438335457, 'samples': 17195520, 'steps': 89559, 'loss/train': 1.3227458000183105} 11/07/2021 09:47:31 - INFO - __main__ - Step 89561: {'lr': 0.00017903008594292075, 'samples': 17195712, 'steps': 89560, 'loss/train': 1.330786943435669} 11/07/2021 09:47:32 - INFO - __main__ - Step 89562: {'lr': 0.0001790249975344647, 'samples': 17195904, 'steps': 89561, 'loss/train': 1.4507124423980713} 11/07/2021 09:47:32 - INFO - __main__ - Step 89563: {'lr': 0.00017901990915798898, 'samples': 17196096, 'steps': 89562, 'loss/train': 1.1804002523422241} 11/07/2021 09:47:33 - INFO - __main__ - Step 89564: {'lr': 0.00017901482081349578, 'samples': 17196288, 'steps': 89563, 'loss/train': 1.183331847190857} 11/07/2021 09:47:33 - INFO - __main__ - Step 89565: {'lr': 0.00017900973250098738, 'samples': 17196480, 'steps': 89564, 'loss/train': 1.3244551420211792} 11/07/2021 09:47:34 - INFO - __main__ - Step 89566: {'lr': 0.00017900464422046609, 'samples': 17196672, 'steps': 89565, 'loss/train': 1.171535611152649} 11/07/2021 09:47:34 - INFO - __main__ - Step 89567: {'lr': 0.00017899955597193423, 'samples': 17196864, 'steps': 89566, 'loss/train': 0.7769943475723267} 11/07/2021 09:47:34 - INFO - __main__ - Step 89568: {'lr': 0.00017899446775539407, 'samples': 17197056, 'steps': 89567, 'loss/train': 1.7027593851089478} 11/07/2021 09:47:35 - INFO - __main__ - Step 89569: {'lr': 0.0001789893795708479, 'samples': 17197248, 'steps': 89568, 'loss/train': 1.397322654724121} 11/07/2021 09:47:36 - INFO - __main__ - Step 89570: {'lr': 0.00017898429141829803, 'samples': 17197440, 'steps': 89569, 'loss/train': 1.4762005805969238} 11/07/2021 09:47:36 - INFO - __main__ - Step 89571: {'lr': 0.00017897920329774676, 'samples': 17197632, 'steps': 89570, 'loss/train': 1.2172330617904663} 11/07/2021 09:47:36 - INFO - __main__ - Step 89572: {'lr': 0.0001789741152091963, 'samples': 17197824, 'steps': 89571, 'loss/train': 1.1882165670394897} 11/07/2021 09:47:37 - INFO - __main__ - Step 89573: {'lr': 0.0001789690271526491, 'samples': 17198016, 'steps': 89572, 'loss/train': 1.3040223121643066} 11/07/2021 09:47:38 - INFO - __main__ - Step 89574: {'lr': 0.0001789639391281074, 'samples': 17198208, 'steps': 89573, 'loss/train': 1.4039474725723267} 11/07/2021 09:47:38 - INFO - __main__ - Step 89575: {'lr': 0.00017895885113557337, 'samples': 17198400, 'steps': 89574, 'loss/train': 1.275062918663025} 11/07/2021 09:47:39 - INFO - __main__ - Step 89576: {'lr': 0.0001789537631750494, 'samples': 17198592, 'steps': 89575, 'loss/train': 0.8688292503356934} 11/07/2021 09:47:39 - INFO - __main__ - Step 89577: {'lr': 0.00017894867524653773, 'samples': 17198784, 'steps': 89576, 'loss/train': 1.0180131196975708} 11/07/2021 09:47:39 - INFO - __main__ - Step 89578: {'lr': 0.00017894358735004074, 'samples': 17198976, 'steps': 89577, 'loss/train': 0.9906489849090576} 11/07/2021 09:47:40 - INFO - __main__ - Step 89579: {'lr': 0.00017893849948556062, 'samples': 17199168, 'steps': 89578, 'loss/train': 1.7226605415344238} 11/07/2021 09:47:41 - INFO - __main__ - Step 89580: {'lr': 0.00017893341165309973, 'samples': 17199360, 'steps': 89579, 'loss/train': 1.6850671768188477} 11/07/2021 09:47:41 - INFO - __main__ - Step 89581: {'lr': 0.00017892832385266037, 'samples': 17199552, 'steps': 89580, 'loss/train': 1.324580192565918} 11/07/2021 09:47:41 - INFO - __main__ - Step 89582: {'lr': 0.00017892323608424479, 'samples': 17199744, 'steps': 89581, 'loss/train': 1.2546018362045288} 11/07/2021 09:47:42 - INFO - __main__ - Step 89583: {'lr': 0.0001789181483478553, 'samples': 17199936, 'steps': 89582, 'loss/train': 1.5880560874938965} 11/07/2021 09:47:42 - INFO - __main__ - Step 89584: {'lr': 0.0001789130606434942, 'samples': 17200128, 'steps': 89583, 'loss/train': 1.2056725025177002} 11/07/2021 09:47:43 - INFO - __main__ - Step 89585: {'lr': 0.0001789079729711638, 'samples': 17200320, 'steps': 89584, 'loss/train': 1.4464699029922485} 11/07/2021 09:47:44 - INFO - __main__ - Step 89586: {'lr': 0.00017890288533086641, 'samples': 17200512, 'steps': 89585, 'loss/train': 1.7347885370254517} 11/07/2021 09:47:44 - INFO - __main__ - Step 89587: {'lr': 0.00017889779772260427, 'samples': 17200704, 'steps': 89586, 'loss/train': 1.3194605112075806} 11/07/2021 09:47:44 - INFO - __main__ - Step 89588: {'lr': 0.00017889271014637966, 'samples': 17200896, 'steps': 89587, 'loss/train': 1.3469926118850708} 11/07/2021 09:47:45 - INFO - __main__ - Step 89589: {'lr': 0.00017888762260219487, 'samples': 17201088, 'steps': 89588, 'loss/train': 0.9714574813842773} 11/07/2021 09:47:46 - INFO - __main__ - Step 89590: {'lr': 0.00017888253509005226, 'samples': 17201280, 'steps': 89589, 'loss/train': 1.5451768636703491} 11/07/2021 09:47:46 - INFO - __main__ - Step 89591: {'lr': 0.00017887744760995404, 'samples': 17201472, 'steps': 89590, 'loss/train': 1.7350187301635742} 11/07/2021 09:47:46 - INFO - __main__ - Step 89592: {'lr': 0.00017887236016190256, 'samples': 17201664, 'steps': 89591, 'loss/train': 1.0587695837020874} 11/07/2021 09:47:47 - INFO - __main__ - Step 89593: {'lr': 0.0001788672727459001, 'samples': 17201856, 'steps': 89592, 'loss/train': 1.3164969682693481} 11/07/2021 09:47:47 - INFO - __main__ - Step 89594: {'lr': 0.00017886218536194892, 'samples': 17202048, 'steps': 89593, 'loss/train': 1.2673311233520508} 11/07/2021 09:47:48 - INFO - __main__ - Step 89595: {'lr': 0.00017885709801005137, 'samples': 17202240, 'steps': 89594, 'loss/train': 1.1086114645004272} 11/07/2021 09:47:49 - INFO - __main__ - Step 89596: {'lr': 0.0001788520106902097, 'samples': 17202432, 'steps': 89595, 'loss/train': 1.4014384746551514} 11/07/2021 09:47:49 - INFO - __main__ - Step 89597: {'lr': 0.00017884692340242627, 'samples': 17202624, 'steps': 89596, 'loss/train': 1.3698970079421997} 11/07/2021 09:47:49 - INFO - __main__ - Step 89598: {'lr': 0.00017884183614670329, 'samples': 17202816, 'steps': 89597, 'loss/train': 1.3507176637649536} 11/07/2021 09:47:50 - INFO - __main__ - Step 89599: {'lr': 0.00017883674892304308, 'samples': 17203008, 'steps': 89598, 'loss/train': 1.624157190322876} 11/07/2021 09:47:50 - INFO - __main__ - Step 89600: {'lr': 0.00017883166173144789, 'samples': 17203200, 'steps': 89599, 'loss/train': 1.2761929035186768} 11/07/2021 09:47:51 - INFO - __main__ - Step 89601: {'lr': 0.00017882657457192014, 'samples': 17203392, 'steps': 89600, 'loss/train': 1.4924452304840088} 11/07/2021 09:47:51 - INFO - __main__ - Step 89602: {'lr': 0.00017882148744446198, 'samples': 17203584, 'steps': 89601, 'loss/train': 1.5211986303329468} 11/07/2021 09:47:52 - INFO - __main__ - Step 89603: {'lr': 0.00017881640034907577, 'samples': 17203776, 'steps': 89602, 'loss/train': 2.09218168258667} 11/07/2021 09:47:52 - INFO - __main__ - Step 89604: {'lr': 0.00017881131328576378, 'samples': 17203968, 'steps': 89603, 'loss/train': 5.761884689331055} 11/07/2021 09:47:52 - INFO - __main__ - Step 89605: {'lr': 0.0001788062262545283, 'samples': 17204160, 'steps': 89604, 'loss/train': 1.2294365167617798} 11/07/2021 09:47:53 - INFO - __main__ - Step 89606: {'lr': 0.00017880113925537166, 'samples': 17204352, 'steps': 89605, 'loss/train': 1.4933960437774658} 11/07/2021 09:47:54 - INFO - __main__ - Step 89607: {'lr': 0.0001787960522882961, 'samples': 17204544, 'steps': 89606, 'loss/train': 1.279327154159546} 11/07/2021 09:47:54 - INFO - __main__ - Step 89608: {'lr': 0.00017879096535330404, 'samples': 17204736, 'steps': 89607, 'loss/train': 1.2744414806365967} 11/07/2021 09:47:54 - INFO - __main__ - Step 89609: {'lr': 0.00017878587845039756, 'samples': 17204928, 'steps': 89608, 'loss/train': 1.2529977560043335} 11/07/2021 09:47:55 - INFO - __main__ - Step 89610: {'lr': 0.0001787807915795791, 'samples': 17205120, 'steps': 89609, 'loss/train': 1.2801107168197632} 11/07/2021 09:47:55 - INFO - __main__ - Step 89611: {'lr': 0.00017877570474085093, 'samples': 17205312, 'steps': 89610, 'loss/train': 1.6883236169815063} 11/07/2021 09:47:56 - INFO - __main__ - Step 89612: {'lr': 0.0001787706179342153, 'samples': 17205504, 'steps': 89611, 'loss/train': 0.902953028678894} 11/07/2021 09:47:57 - INFO - __main__ - Step 89613: {'lr': 0.00017876553115967454, 'samples': 17205696, 'steps': 89612, 'loss/train': 5.803478240966797} 11/07/2021 09:47:57 - INFO - __main__ - Step 89614: {'lr': 0.000178760444417231, 'samples': 17205888, 'steps': 89613, 'loss/train': 1.5127894878387451} 11/07/2021 09:47:57 - INFO - __main__ - Step 89615: {'lr': 0.0001787553577068868, 'samples': 17206080, 'steps': 89614, 'loss/train': 1.5705097913742065} 11/07/2021 09:47:58 - INFO - __main__ - Step 89616: {'lr': 0.0001787502710286444, 'samples': 17206272, 'steps': 89615, 'loss/train': 1.6877357959747314} 11/07/2021 09:47:59 - INFO - __main__ - Step 89617: {'lr': 0.00017874518438250596, 'samples': 17206464, 'steps': 89616, 'loss/train': 1.6225441694259644} 11/07/2021 09:47:59 - INFO - __main__ - Step 89618: {'lr': 0.0001787400977684739, 'samples': 17206656, 'steps': 89617, 'loss/train': 1.677808403968811} 11/07/2021 09:47:59 - INFO - __main__ - Step 89619: {'lr': 0.0001787350111865505, 'samples': 17206848, 'steps': 89618, 'loss/train': 1.3073606491088867} 11/07/2021 09:48:00 - INFO - __main__ - Step 89620: {'lr': 0.00017872992463673792, 'samples': 17207040, 'steps': 89619, 'loss/train': 1.7113211154937744} 11/07/2021 09:48:00 - INFO - __main__ - Step 89621: {'lr': 0.00017872483811903856, 'samples': 17207232, 'steps': 89620, 'loss/train': 1.4589238166809082} 11/07/2021 09:48:01 - INFO - __main__ - Step 89622: {'lr': 0.0001787197516334547, 'samples': 17207424, 'steps': 89621, 'loss/train': 1.3438717126846313} 11/07/2021 09:48:02 - INFO - __main__ - Step 89623: {'lr': 0.00017871466517998857, 'samples': 17207616, 'steps': 89622, 'loss/train': 1.5441919565200806} 11/07/2021 09:48:02 - INFO - __main__ - Step 89624: {'lr': 0.0001787095787586426, 'samples': 17207808, 'steps': 89623, 'loss/train': 1.5306565761566162} 11/07/2021 09:48:02 - INFO - __main__ - Step 89625: {'lr': 0.00017870449236941888, 'samples': 17208000, 'steps': 89624, 'loss/train': 1.0210812091827393} 11/07/2021 09:48:03 - INFO - __main__ - Step 89626: {'lr': 0.00017869940601231987, 'samples': 17208192, 'steps': 89625, 'loss/train': 1.4628640413284302} 11/07/2021 09:48:03 - INFO - __main__ - Step 89627: {'lr': 0.00017869431968734785, 'samples': 17208384, 'steps': 89626, 'loss/train': 1.257904291152954} 11/07/2021 09:48:04 - INFO - __main__ - Step 89628: {'lr': 0.00017868923339450508, 'samples': 17208576, 'steps': 89627, 'loss/train': 1.0977789163589478} 11/07/2021 09:48:04 - INFO - __main__ - Step 89629: {'lr': 0.00017868414713379378, 'samples': 17208768, 'steps': 89628, 'loss/train': 1.5754278898239136} 11/07/2021 09:48:05 - INFO - __main__ - Step 89630: {'lr': 0.00017867906090521634, 'samples': 17208960, 'steps': 89629, 'loss/train': 1.5604991912841797} 11/07/2021 09:48:05 - INFO - __main__ - Step 89631: {'lr': 0.000178673974708775, 'samples': 17209152, 'steps': 89630, 'loss/train': 1.4720544815063477} 11/07/2021 09:48:05 - INFO - __main__ - Step 89632: {'lr': 0.00017866888854447204, 'samples': 17209344, 'steps': 89631, 'loss/train': 0.9408596754074097} 11/07/2021 09:48:06 - INFO - __main__ - Step 89633: {'lr': 0.00017866380241230985, 'samples': 17209536, 'steps': 89632, 'loss/train': 0.6465396285057068} 11/07/2021 09:48:07 - INFO - __main__ - Step 89634: {'lr': 0.0001786587163122906, 'samples': 17209728, 'steps': 89633, 'loss/train': 1.4663004875183105} 11/07/2021 09:48:07 - INFO - __main__ - Step 89635: {'lr': 0.0001786536302444166, 'samples': 17209920, 'steps': 89634, 'loss/train': 1.2967755794525146} 11/07/2021 09:48:08 - INFO - __main__ - Step 89636: {'lr': 0.0001786485442086902, 'samples': 17210112, 'steps': 89635, 'loss/train': 1.1361699104309082} 11/07/2021 09:48:08 - INFO - __main__ - Step 89637: {'lr': 0.00017864345820511364, 'samples': 17210304, 'steps': 89636, 'loss/train': 0.6742681264877319} 11/07/2021 09:48:09 - INFO - __main__ - Step 89638: {'lr': 0.00017863837223368927, 'samples': 17210496, 'steps': 89637, 'loss/train': 1.2023035287857056} 11/07/2021 09:48:09 - INFO - __main__ - Step 89639: {'lr': 0.00017863328629441933, 'samples': 17210688, 'steps': 89638, 'loss/train': 1.219280481338501} 11/07/2021 09:48:10 - INFO - __main__ - Step 89640: {'lr': 0.00017862820038730615, 'samples': 17210880, 'steps': 89639, 'loss/train': 1.3255653381347656} 11/07/2021 09:48:10 - INFO - __main__ - Step 89641: {'lr': 0.0001786231145123521, 'samples': 17211072, 'steps': 89640, 'loss/train': 1.1190334558486938} 11/07/2021 09:48:10 - INFO - __main__ - Step 89642: {'lr': 0.00017861802866955926, 'samples': 17211264, 'steps': 89641, 'loss/train': 0.8509531617164612} 11/07/2021 09:48:11 - INFO - __main__ - Step 89643: {'lr': 0.00017861294285893004, 'samples': 17211456, 'steps': 89642, 'loss/train': 1.0464463233947754} 11/07/2021 09:48:12 - INFO - __main__ - Step 89644: {'lr': 0.00017860785708046672, 'samples': 17211648, 'steps': 89643, 'loss/train': 0.6652165651321411} 11/07/2021 09:48:12 - INFO - __main__ - Step 89645: {'lr': 0.00017860277133417156, 'samples': 17211840, 'steps': 89644, 'loss/train': 1.5629645586013794} 11/07/2021 09:48:12 - INFO - __main__ - Step 89646: {'lr': 0.00017859768562004697, 'samples': 17212032, 'steps': 89645, 'loss/train': 0.9797791838645935} 11/07/2021 09:48:13 - INFO - __main__ - Step 89647: {'lr': 0.00017859259993809512, 'samples': 17212224, 'steps': 89646, 'loss/train': 1.525293231010437} 11/07/2021 09:48:13 - INFO - __main__ - Step 89648: {'lr': 0.00017858751428831833, 'samples': 17212416, 'steps': 89647, 'loss/train': 1.3011651039123535} 11/07/2021 09:48:14 - INFO - __main__ - Step 89649: {'lr': 0.00017858242867071895, 'samples': 17212608, 'steps': 89648, 'loss/train': 1.6272002458572388} 11/07/2021 09:48:15 - INFO - __main__ - Step 89650: {'lr': 0.00017857734308529915, 'samples': 17212800, 'steps': 89649, 'loss/train': 1.4487122297286987} 11/07/2021 09:48:15 - INFO - __main__ - Step 89651: {'lr': 0.00017857225753206135, 'samples': 17212992, 'steps': 89650, 'loss/train': 1.7138605117797852} 11/07/2021 09:48:15 - INFO - __main__ - Step 89652: {'lr': 0.0001785671720110078, 'samples': 17213184, 'steps': 89651, 'loss/train': 1.7594938278198242} 11/07/2021 09:48:16 - INFO - __main__ - Step 89653: {'lr': 0.00017856208652214072, 'samples': 17213376, 'steps': 89652, 'loss/train': 1.491249918937683} 11/07/2021 09:48:17 - INFO - __main__ - Step 89654: {'lr': 0.00017855700106546253, 'samples': 17213568, 'steps': 89653, 'loss/train': 1.6148263216018677} 11/07/2021 09:48:17 - INFO - __main__ - Step 89655: {'lr': 0.00017855191564097552, 'samples': 17213760, 'steps': 89654, 'loss/train': 2.0080816745758057} 11/07/2021 09:48:17 - INFO - __main__ - Step 89656: {'lr': 0.00017854683024868184, 'samples': 17213952, 'steps': 89655, 'loss/train': 1.6698044538497925} 11/07/2021 09:48:18 - INFO - __main__ - Step 89657: {'lr': 0.00017854174488858384, 'samples': 17214144, 'steps': 89656, 'loss/train': 1.6030995845794678} 11/07/2021 09:48:18 - INFO - __main__ - Step 89658: {'lr': 0.00017853665956068382, 'samples': 17214336, 'steps': 89657, 'loss/train': 1.39662504196167} 11/07/2021 09:48:19 - INFO - __main__ - Step 89659: {'lr': 0.00017853157426498407, 'samples': 17214528, 'steps': 89658, 'loss/train': 1.0478864908218384} 11/07/2021 09:48:19 - INFO - __main__ - Step 89660: {'lr': 0.00017852648900148688, 'samples': 17214720, 'steps': 89659, 'loss/train': 1.3694905042648315} 11/07/2021 09:48:20 - INFO - __main__ - Step 89661: {'lr': 0.00017852140377019461, 'samples': 17214912, 'steps': 89660, 'loss/train': 1.2928497791290283} 11/07/2021 09:48:20 - INFO - __main__ - Step 89662: {'lr': 0.00017851631857110944, 'samples': 17215104, 'steps': 89661, 'loss/train': 1.4780683517456055} 11/07/2021 09:48:20 - INFO - __main__ - Step 89663: {'lr': 0.0001785112334042337, 'samples': 17215296, 'steps': 89662, 'loss/train': 1.4230451583862305} 11/07/2021 09:48:22 - INFO - __main__ - Step 89664: {'lr': 0.00017850614826956973, 'samples': 17215488, 'steps': 89663, 'loss/train': 0.7200479507446289} 11/07/2021 09:48:23 - INFO - __main__ - Step 89665: {'lr': 0.00017850106316711977, 'samples': 17215680, 'steps': 89664, 'loss/train': 1.2145801782608032} 11/07/2021 09:48:23 - INFO - __main__ - Step 89666: {'lr': 0.00017849597809688618, 'samples': 17215872, 'steps': 89665, 'loss/train': 1.4911473989486694} 11/07/2021 09:48:23 - INFO - __main__ - Step 89667: {'lr': 0.0001784908930588711, 'samples': 17216064, 'steps': 89666, 'loss/train': 1.6899528503417969} 11/07/2021 09:48:24 - INFO - __main__ - Step 89668: {'lr': 0.00017848580805307712, 'samples': 17216256, 'steps': 89667, 'loss/train': 1.5177439451217651} 11/07/2021 09:48:24 - INFO - __main__ - Step 89669: {'lr': 0.0001784807230795062, 'samples': 17216448, 'steps': 89668, 'loss/train': 1.7054780721664429} 11/07/2021 09:48:24 - INFO - __main__ - Step 89670: {'lr': 0.00017847563813816074, 'samples': 17216640, 'steps': 89669, 'loss/train': 1.4383518695831299} 11/07/2021 09:48:25 - INFO - __main__ - Step 89671: {'lr': 0.00017847055322904305, 'samples': 17216832, 'steps': 89670, 'loss/train': 1.4293272495269775} 11/07/2021 09:48:26 - INFO - __main__ - Step 89672: {'lr': 0.00017846546835215545, 'samples': 17217024, 'steps': 89671, 'loss/train': 1.4673683643341064} 11/07/2021 09:48:26 - INFO - __main__ - Step 89673: {'lr': 0.0001784603835075002, 'samples': 17217216, 'steps': 89672, 'loss/train': 1.354933500289917} 11/07/2021 09:48:26 - INFO - __main__ - Step 89674: {'lr': 0.00017845529869507957, 'samples': 17217408, 'steps': 89673, 'loss/train': 2.128615140914917} 11/07/2021 09:48:27 - INFO - __main__ - Step 89675: {'lr': 0.00017845021391489592, 'samples': 17217600, 'steps': 89674, 'loss/train': 1.5037370920181274} 11/07/2021 09:48:28 - INFO - __main__ - Step 89676: {'lr': 0.00017844512916695147, 'samples': 17217792, 'steps': 89675, 'loss/train': 1.5090067386627197} 11/07/2021 09:48:28 - INFO - __main__ - Step 89677: {'lr': 0.00017844004445124854, 'samples': 17217984, 'steps': 89676, 'loss/train': 1.515098214149475} 11/07/2021 09:48:28 - INFO - __main__ - Step 89678: {'lr': 0.00017843495976778943, 'samples': 17218176, 'steps': 89677, 'loss/train': 1.6215568780899048} 11/07/2021 09:48:29 - INFO - __main__ - Step 89679: {'lr': 0.00017842987511657642, 'samples': 17218368, 'steps': 89678, 'loss/train': 1.7060801982879639} 11/07/2021 09:48:29 - INFO - __main__ - Step 89680: {'lr': 0.0001784247904976118, 'samples': 17218560, 'steps': 89679, 'loss/train': 1.3932816982269287} 11/07/2021 09:48:30 - INFO - __main__ - Step 89681: {'lr': 0.0001784197059108979, 'samples': 17218752, 'steps': 89680, 'loss/train': 1.4854745864868164} 11/07/2021 09:48:31 - INFO - __main__ - Step 89682: {'lr': 0.00017841462135643704, 'samples': 17218944, 'steps': 89681, 'loss/train': 1.5423657894134521} 11/07/2021 09:48:31 - INFO - __main__ - Step 89683: {'lr': 0.00017840953683423137, 'samples': 17219136, 'steps': 89682, 'loss/train': 1.0500447750091553} 11/07/2021 09:48:31 - INFO - __main__ - Step 89684: {'lr': 0.00017840445234428324, 'samples': 17219328, 'steps': 89683, 'loss/train': 1.5205940008163452} 11/07/2021 09:48:32 - INFO - __main__ - Step 89685: {'lr': 0.00017839936788659495, 'samples': 17219520, 'steps': 89684, 'loss/train': 1.6314314603805542} 11/07/2021 09:48:32 - INFO - __main__ - Step 89686: {'lr': 0.0001783942834611688, 'samples': 17219712, 'steps': 89685, 'loss/train': 2.0889992713928223} 11/07/2021 09:48:33 - INFO - __main__ - Step 89687: {'lr': 0.0001783891990680071, 'samples': 17219904, 'steps': 89686, 'loss/train': 0.1461659073829651} 11/07/2021 09:48:33 - INFO - __main__ - Step 89688: {'lr': 0.00017838411470711213, 'samples': 17220096, 'steps': 89687, 'loss/train': 0.6354267597198486} 11/07/2021 09:48:34 - INFO - __main__ - Step 89689: {'lr': 0.00017837903037848615, 'samples': 17220288, 'steps': 89688, 'loss/train': 1.3299763202667236} 11/07/2021 09:48:34 - INFO - __main__ - Step 89690: {'lr': 0.00017837394608213148, 'samples': 17220480, 'steps': 89689, 'loss/train': 1.2647079229354858} 11/07/2021 09:48:34 - INFO - __main__ - Step 89691: {'lr': 0.0001783688618180504, 'samples': 17220672, 'steps': 89690, 'loss/train': 1.302755355834961} 11/07/2021 09:48:36 - INFO - __main__ - Step 89692: {'lr': 0.00017836377758624522, 'samples': 17220864, 'steps': 89691, 'loss/train': 0.8187870979309082} 11/07/2021 09:48:36 - INFO - __main__ - Step 89693: {'lr': 0.00017835869338671823, 'samples': 17221056, 'steps': 89692, 'loss/train': 1.0107744932174683} 11/07/2021 09:48:37 - INFO - __main__ - Step 89694: {'lr': 0.00017835360921947168, 'samples': 17221248, 'steps': 89693, 'loss/train': 0.6232209205627441} 11/07/2021 09:48:37 - INFO - __main__ - Step 89695: {'lr': 0.00017834852508450799, 'samples': 17221440, 'steps': 89694, 'loss/train': 1.3012763261795044} 11/07/2021 09:48:37 - INFO - __main__ - Step 89696: {'lr': 0.00017834344098182926, 'samples': 17221632, 'steps': 89695, 'loss/train': 2.382094621658325} 11/07/2021 09:48:38 - INFO - __main__ - Step 89697: {'lr': 0.00017833835691143785, 'samples': 17221824, 'steps': 89696, 'loss/train': 0.8630907535552979} 11/07/2021 09:48:39 - INFO - __main__ - Step 89698: {'lr': 0.0001783332728733361, 'samples': 17222016, 'steps': 89697, 'loss/train': 1.9971505403518677} 11/07/2021 09:48:39 - INFO - __main__ - Step 89699: {'lr': 0.00017832818886752625, 'samples': 17222208, 'steps': 89698, 'loss/train': 1.1625877618789673} 11/07/2021 09:48:39 - INFO - __main__ - Step 89700: {'lr': 0.0001783231048940106, 'samples': 17222400, 'steps': 89699, 'loss/train': 0.9649933576583862} 11/07/2021 09:48:40 - INFO - __main__ - Step 89701: {'lr': 0.00017831802095279149, 'samples': 17222592, 'steps': 89700, 'loss/train': 1.4113190174102783} 11/07/2021 09:48:40 - INFO - __main__ - Step 89702: {'lr': 0.00017831293704387115, 'samples': 17222784, 'steps': 89701, 'loss/train': 1.200137734413147} 11/07/2021 09:48:41 - INFO - __main__ - Step 89703: {'lr': 0.0001783078531672519, 'samples': 17222976, 'steps': 89702, 'loss/train': 1.2286381721496582} 11/07/2021 09:48:41 - INFO - __main__ - Step 89704: {'lr': 0.000178302769322936, 'samples': 17223168, 'steps': 89703, 'loss/train': 1.5924478769302368} 11/07/2021 09:48:42 - INFO - __main__ - Step 89705: {'lr': 0.00017829768551092578, 'samples': 17223360, 'steps': 89704, 'loss/train': 0.8165701031684875} 11/07/2021 09:48:42 - INFO - __main__ - Step 89706: {'lr': 0.00017829260173122356, 'samples': 17223552, 'steps': 89705, 'loss/train': 1.2048921585083008} 11/07/2021 09:48:42 - INFO - __main__ - Step 89707: {'lr': 0.00017828751798383154, 'samples': 17223744, 'steps': 89706, 'loss/train': 1.9719488620758057} 11/07/2021 09:48:43 - INFO - __main__ - Step 89708: {'lr': 0.00017828243426875218, 'samples': 17223936, 'steps': 89707, 'loss/train': 1.3100930452346802} 11/07/2021 09:48:44 - INFO - __main__ - Step 89709: {'lr': 0.00017827735058598753, 'samples': 17224128, 'steps': 89708, 'loss/train': 1.5912480354309082} 11/07/2021 09:48:44 - INFO - __main__ - Step 89710: {'lr': 0.00017827226693554, 'samples': 17224320, 'steps': 89709, 'loss/train': 1.6117013692855835} 11/07/2021 09:48:45 - INFO - __main__ - Step 89711: {'lr': 0.0001782671833174119, 'samples': 17224512, 'steps': 89710, 'loss/train': 1.1706124544143677} 11/07/2021 09:48:45 - INFO - __main__ - Step 89712: {'lr': 0.0001782620997316055, 'samples': 17224704, 'steps': 89711, 'loss/train': 1.2923493385314941} 11/07/2021 09:48:46 - INFO - __main__ - Step 89713: {'lr': 0.00017825701617812308, 'samples': 17224896, 'steps': 89712, 'loss/train': 0.6951799392700195} 11/07/2021 09:48:46 - INFO - __main__ - Step 89714: {'lr': 0.00017825193265696694, 'samples': 17225088, 'steps': 89713, 'loss/train': 1.0713175535202026} 11/07/2021 09:48:47 - INFO - __main__ - Step 89715: {'lr': 0.00017824684916813938, 'samples': 17225280, 'steps': 89714, 'loss/train': 1.3884087800979614} 11/07/2021 09:48:47 - INFO - __main__ - Step 89716: {'lr': 0.00017824176571164265, 'samples': 17225472, 'steps': 89715, 'loss/train': 1.256530523300171} 11/07/2021 09:48:47 - INFO - __main__ - Step 89717: {'lr': 0.0001782366822874791, 'samples': 17225664, 'steps': 89716, 'loss/train': 1.819375991821289} 11/07/2021 09:48:48 - INFO - __main__ - Step 89718: {'lr': 0.000178231598895651, 'samples': 17225856, 'steps': 89717, 'loss/train': 1.3524315357208252} 11/07/2021 09:48:49 - INFO - __main__ - Step 89719: {'lr': 0.00017822651553616063, 'samples': 17226048, 'steps': 89718, 'loss/train': 1.5454442501068115} 11/07/2021 09:48:49 - INFO - __main__ - Step 89720: {'lr': 0.00017822143220901034, 'samples': 17226240, 'steps': 89719, 'loss/train': 1.2337790727615356} 11/07/2021 09:48:49 - INFO - __main__ - Step 89721: {'lr': 0.0001782163489142023, 'samples': 17226432, 'steps': 89720, 'loss/train': 1.6018708944320679} 11/07/2021 09:48:50 - INFO - __main__ - Step 89722: {'lr': 0.0001782112656517389, 'samples': 17226624, 'steps': 89721, 'loss/train': 1.5762075185775757} 11/07/2021 09:48:50 - INFO - __main__ - Step 89723: {'lr': 0.00017820618242162238, 'samples': 17226816, 'steps': 89722, 'loss/train': 1.1480458974838257} 11/07/2021 09:48:51 - INFO - __main__ - Step 89724: {'lr': 0.00017820109922385503, 'samples': 17227008, 'steps': 89723, 'loss/train': 0.8585368394851685} 11/07/2021 09:48:51 - INFO - __main__ - Step 89725: {'lr': 0.00017819601605843915, 'samples': 17227200, 'steps': 89724, 'loss/train': 2.0061638355255127} 11/07/2021 09:48:52 - INFO - __main__ - Step 89726: {'lr': 0.00017819093292537706, 'samples': 17227392, 'steps': 89725, 'loss/train': 1.5241330862045288} 11/07/2021 09:48:52 - INFO - __main__ - Step 89727: {'lr': 0.000178185849824671, 'samples': 17227584, 'steps': 89726, 'loss/train': 1.0420849323272705} 11/07/2021 09:48:53 - INFO - __main__ - Step 89728: {'lr': 0.00017818076675632334, 'samples': 17227776, 'steps': 89727, 'loss/train': 0.6910235285758972} 11/07/2021 09:48:53 - INFO - __main__ - Step 89729: {'lr': 0.00017817568372033627, 'samples': 17227968, 'steps': 89728, 'loss/train': 1.3380464315414429} 11/07/2021 09:48:54 - INFO - __main__ - Step 89730: {'lr': 0.00017817060071671212, 'samples': 17228160, 'steps': 89729, 'loss/train': 1.7351224422454834} 11/07/2021 09:48:54 - INFO - __main__ - Step 89731: {'lr': 0.00017816551774545327, 'samples': 17228352, 'steps': 89730, 'loss/train': 1.4555199146270752} 11/07/2021 09:48:54 - INFO - __main__ - Step 89732: {'lr': 0.00017816043480656186, 'samples': 17228544, 'steps': 89731, 'loss/train': 1.2807353734970093} 11/07/2021 09:48:55 - INFO - __main__ - Step 89733: {'lr': 0.00017815535190004027, 'samples': 17228736, 'steps': 89732, 'loss/train': 0.9715684652328491} 11/07/2021 09:48:56 - INFO - __main__ - Step 89734: {'lr': 0.00017815026902589075, 'samples': 17228928, 'steps': 89733, 'loss/train': 1.4299688339233398} 11/07/2021 09:48:56 - INFO - __main__ - Step 89735: {'lr': 0.00017814518618411567, 'samples': 17229120, 'steps': 89734, 'loss/train': 1.6128370761871338} 11/07/2021 09:48:57 - INFO - __main__ - Step 89736: {'lr': 0.0001781401033747172, 'samples': 17229312, 'steps': 89735, 'loss/train': 1.754528522491455} 11/07/2021 09:48:57 - INFO - __main__ - Step 89737: {'lr': 0.0001781350205976977, 'samples': 17229504, 'steps': 89736, 'loss/train': 1.344884991645813} 11/07/2021 09:48:58 - INFO - __main__ - Step 89738: {'lr': 0.00017812993785305944, 'samples': 17229696, 'steps': 89737, 'loss/train': 1.0263642072677612} 11/07/2021 09:48:58 - INFO - __main__ - Step 89739: {'lr': 0.00017812485514080473, 'samples': 17229888, 'steps': 89738, 'loss/train': 1.5388085842132568} 11/07/2021 09:48:59 - INFO - __main__ - Step 89740: {'lr': 0.00017811977246093587, 'samples': 17230080, 'steps': 89739, 'loss/train': 1.4636379480361938} 11/07/2021 09:48:59 - INFO - __main__ - Step 89741: {'lr': 0.00017811468981345508, 'samples': 17230272, 'steps': 89740, 'loss/train': 1.2819452285766602} 11/07/2021 09:49:00 - INFO - __main__ - Step 89742: {'lr': 0.0001781096071983648, 'samples': 17230464, 'steps': 89741, 'loss/train': 1.476144552230835} 11/07/2021 09:49:00 - INFO - __main__ - Step 89743: {'lr': 0.00017810452461566718, 'samples': 17230656, 'steps': 89742, 'loss/train': 1.8330003023147583} 11/07/2021 09:49:00 - INFO - __main__ - Step 89744: {'lr': 0.0001780994420653645, 'samples': 17230848, 'steps': 89743, 'loss/train': 1.4666963815689087} 11/07/2021 09:49:01 - INFO - __main__ - Step 89745: {'lr': 0.0001780943595474591, 'samples': 17231040, 'steps': 89744, 'loss/train': 0.653899610042572} 11/07/2021 09:49:02 - INFO - __main__ - Step 89746: {'lr': 0.00017808927706195333, 'samples': 17231232, 'steps': 89745, 'loss/train': 1.8401695489883423} 11/07/2021 09:49:02 - INFO - __main__ - Step 89747: {'lr': 0.0001780841946088494, 'samples': 17231424, 'steps': 89746, 'loss/train': 1.6200566291809082} 11/07/2021 09:49:03 - INFO - __main__ - Step 89748: {'lr': 0.0001780791121881496, 'samples': 17231616, 'steps': 89747, 'loss/train': 1.4869425296783447} 11/07/2021 09:49:03 - INFO - __main__ - Step 89749: {'lr': 0.0001780740297998563, 'samples': 17231808, 'steps': 89748, 'loss/train': 1.435142159461975} 11/07/2021 09:49:04 - INFO - __main__ - Step 89750: {'lr': 0.00017806894744397172, 'samples': 17232000, 'steps': 89749, 'loss/train': 1.4578895568847656} 11/07/2021 09:49:04 - INFO - __main__ - Step 89751: {'lr': 0.0001780638651204981, 'samples': 17232192, 'steps': 89750, 'loss/train': 1.3922677040100098} 11/07/2021 09:49:05 - INFO - __main__ - Step 89752: {'lr': 0.00017805878282943784, 'samples': 17232384, 'steps': 89751, 'loss/train': 1.4252490997314453} 11/07/2021 09:49:05 - INFO - __main__ - Step 89753: {'lr': 0.00017805370057079323, 'samples': 17232576, 'steps': 89752, 'loss/train': 0.9214276075363159} 11/07/2021 09:49:05 - INFO - __main__ - Step 89754: {'lr': 0.00017804861834456643, 'samples': 17232768, 'steps': 89753, 'loss/train': 1.1506317853927612} 11/07/2021 09:49:06 - INFO - __main__ - Step 89755: {'lr': 0.00017804353615075985, 'samples': 17232960, 'steps': 89754, 'loss/train': 0.6237203478813171} 11/07/2021 09:49:07 - INFO - __main__ - Step 89756: {'lr': 0.00017803845398937573, 'samples': 17233152, 'steps': 89755, 'loss/train': 1.908238172531128} 11/07/2021 09:49:07 - INFO - __main__ - Step 89757: {'lr': 0.00017803337186041634, 'samples': 17233344, 'steps': 89756, 'loss/train': 1.4110808372497559} 11/07/2021 09:49:07 - INFO - __main__ - Step 89758: {'lr': 0.00017802828976388403, 'samples': 17233536, 'steps': 89757, 'loss/train': 1.2927924394607544} 11/07/2021 09:49:08 - INFO - __main__ - Step 89759: {'lr': 0.0001780232076997811, 'samples': 17233728, 'steps': 89758, 'loss/train': 1.4293676614761353} 11/07/2021 09:49:08 - INFO - __main__ - Step 89760: {'lr': 0.00017801812566810974, 'samples': 17233920, 'steps': 89759, 'loss/train': 1.4618797302246094} 11/07/2021 09:49:09 - INFO - __main__ - Step 89761: {'lr': 0.00017801304366887234, 'samples': 17234112, 'steps': 89760, 'loss/train': 1.4995694160461426} 11/07/2021 09:49:09 - INFO - __main__ - Step 89762: {'lr': 0.0001780079617020712, 'samples': 17234304, 'steps': 89761, 'loss/train': 1.6205663681030273} 11/07/2021 09:49:10 - INFO - __main__ - Step 89763: {'lr': 0.00017800287976770847, 'samples': 17234496, 'steps': 89762, 'loss/train': 1.486822247505188} 11/07/2021 09:49:10 - INFO - __main__ - Step 89764: {'lr': 0.0001779977978657866, 'samples': 17234688, 'steps': 89763, 'loss/train': 1.5636188983917236} 11/07/2021 09:49:10 - INFO - __main__ - Step 89765: {'lr': 0.0001779927159963078, 'samples': 17234880, 'steps': 89764, 'loss/train': 1.1595499515533447} 11/07/2021 09:49:12 - INFO - __main__ - Step 89766: {'lr': 0.00017798763415927433, 'samples': 17235072, 'steps': 89765, 'loss/train': 1.1939753293991089} 11/07/2021 09:49:13 - INFO - __main__ - Step 89767: {'lr': 0.00017798255235468852, 'samples': 17235264, 'steps': 89766, 'loss/train': 0.13199588656425476} 11/07/2021 09:49:13 - INFO - __main__ - Step 89768: {'lr': 0.00017797747058255264, 'samples': 17235456, 'steps': 89767, 'loss/train': 1.3189843893051147} 11/07/2021 09:49:13 - INFO - __main__ - Step 89769: {'lr': 0.00017797238884286905, 'samples': 17235648, 'steps': 89768, 'loss/train': 1.4449020624160767} 11/07/2021 09:49:14 - INFO - __main__ - Step 89770: {'lr': 0.00017796730713563996, 'samples': 17235840, 'steps': 89769, 'loss/train': 4.579737663269043} 11/07/2021 09:49:14 - INFO - __main__ - Step 89771: {'lr': 0.0001779622254608677, 'samples': 17236032, 'steps': 89770, 'loss/train': 4.476839542388916} 11/07/2021 09:49:14 - INFO - __main__ - Step 89772: {'lr': 0.00017795714381855458, 'samples': 17236224, 'steps': 89771, 'loss/train': 4.788177013397217} 11/07/2021 09:49:15 - INFO - __main__ - Step 89773: {'lr': 0.0001779520622087028, 'samples': 17236416, 'steps': 89772, 'loss/train': 1.57176673412323} 11/07/2021 09:49:16 - INFO - __main__ - Step 89774: {'lr': 0.00017794698063131476, 'samples': 17236608, 'steps': 89773, 'loss/train': 1.5875120162963867} 11/07/2021 09:49:16 - INFO - __main__ - Step 89775: {'lr': 0.0001779418990863927, 'samples': 17236800, 'steps': 89774, 'loss/train': 1.1498384475708008} 11/07/2021 09:49:16 - INFO - __main__ - Step 89776: {'lr': 0.00017793681757393898, 'samples': 17236992, 'steps': 89775, 'loss/train': 1.314529299736023} 11/07/2021 09:49:17 - INFO - __main__ - Step 89777: {'lr': 0.00017793173609395571, 'samples': 17237184, 'steps': 89776, 'loss/train': 5.829163074493408} 11/07/2021 09:49:17 - INFO - __main__ - Step 89778: {'lr': 0.0001779266546464453, 'samples': 17237376, 'steps': 89777, 'loss/train': 1.665595293045044} 11/07/2021 09:49:18 - INFO - __main__ - Step 89779: {'lr': 0.00017792157323141003, 'samples': 17237568, 'steps': 89778, 'loss/train': 1.626258373260498} 11/07/2021 09:49:19 - INFO - __main__ - Step 89780: {'lr': 0.0001779164918488522, 'samples': 17237760, 'steps': 89779, 'loss/train': 1.1320421695709229} 11/07/2021 09:49:19 - INFO - __main__ - Step 89781: {'lr': 0.00017791141049877408, 'samples': 17237952, 'steps': 89780, 'loss/train': 1.0377458333969116} 11/07/2021 09:49:19 - INFO - __main__ - Step 89782: {'lr': 0.00017790632918117795, 'samples': 17238144, 'steps': 89781, 'loss/train': 0.9421239495277405} 11/07/2021 09:49:20 - INFO - __main__ - Step 89783: {'lr': 0.00017790124789606612, 'samples': 17238336, 'steps': 89782, 'loss/train': 0.9052952527999878} 11/07/2021 09:49:21 - INFO - __main__ - Step 89784: {'lr': 0.0001778961666434409, 'samples': 17238528, 'steps': 89783, 'loss/train': 1.510336995124817} 11/07/2021 09:49:21 - INFO - __main__ - Step 89785: {'lr': 0.0001778910854233045, 'samples': 17238720, 'steps': 89784, 'loss/train': 1.4466753005981445} 11/07/2021 09:49:21 - INFO - __main__ - Step 89786: {'lr': 0.0001778860042356593, 'samples': 17238912, 'steps': 89785, 'loss/train': 1.2837269306182861} 11/07/2021 09:49:22 - INFO - __main__ - Step 89787: {'lr': 0.00017788092308050756, 'samples': 17239104, 'steps': 89786, 'loss/train': 0.8576130270957947} 11/07/2021 09:49:22 - INFO - __main__ - Step 89788: {'lr': 0.00017787584195785156, 'samples': 17239296, 'steps': 89787, 'loss/train': 5.856151103973389} 11/07/2021 09:49:22 - INFO - __main__ - Step 89789: {'lr': 0.00017787076086769372, 'samples': 17239488, 'steps': 89788, 'loss/train': 1.4702779054641724} 11/07/2021 09:49:24 - INFO - __main__ - Step 89790: {'lr': 0.00017786567981003604, 'samples': 17239680, 'steps': 89789, 'loss/train': 1.301589846611023} 11/07/2021 09:49:24 - INFO - __main__ - Step 89791: {'lr': 0.00017786059878488103, 'samples': 17239872, 'steps': 89790, 'loss/train': 1.5078662633895874} 11/07/2021 09:49:24 - INFO - __main__ - Step 89792: {'lr': 0.00017785551779223087, 'samples': 17240064, 'steps': 89791, 'loss/train': 1.5701414346694946} 11/07/2021 09:49:25 - INFO - __main__ - Step 89793: {'lr': 0.00017785043683208793, 'samples': 17240256, 'steps': 89792, 'loss/train': 1.319549322128296} 11/07/2021 09:49:25 - INFO - __main__ - Step 89794: {'lr': 0.00017784535590445447, 'samples': 17240448, 'steps': 89793, 'loss/train': 1.535132884979248} 11/07/2021 09:49:26 - INFO - __main__ - Step 89795: {'lr': 0.00017784027500933276, 'samples': 17240640, 'steps': 89794, 'loss/train': 1.435683250427246} 11/07/2021 09:49:27 - INFO - __main__ - Step 89796: {'lr': 0.0001778351941467251, 'samples': 17240832, 'steps': 89795, 'loss/train': 1.5555286407470703} 11/07/2021 09:49:27 - INFO - __main__ - Step 89797: {'lr': 0.00017783011331663385, 'samples': 17241024, 'steps': 89796, 'loss/train': 1.5774484872817993} 11/07/2021 09:49:27 - INFO - __main__ - Step 89798: {'lr': 0.00017782503251906117, 'samples': 17241216, 'steps': 89797, 'loss/train': 1.2296996116638184} 11/07/2021 09:49:28 - INFO - __main__ - Step 89799: {'lr': 0.00017781995175400944, 'samples': 17241408, 'steps': 89798, 'loss/train': 1.3144898414611816} 11/07/2021 09:49:28 - INFO - __main__ - Step 89800: {'lr': 0.00017781487102148095, 'samples': 17241600, 'steps': 89799, 'loss/train': 0.8428471088409424} 11/07/2021 09:49:29 - INFO - __main__ - Step 89801: {'lr': 0.00017780979032147793, 'samples': 17241792, 'steps': 89800, 'loss/train': 1.5718363523483276} 11/07/2021 09:49:29 - INFO - __main__ - Step 89802: {'lr': 0.0001778047096540027, 'samples': 17241984, 'steps': 89801, 'loss/train': 1.385088324546814} 11/07/2021 09:49:30 - INFO - __main__ - Step 89803: {'lr': 0.00017779962901905773, 'samples': 17242176, 'steps': 89802, 'loss/train': 1.2986116409301758} 11/07/2021 09:49:30 - INFO - __main__ - Step 89804: {'lr': 0.00017779454841664494, 'samples': 17242368, 'steps': 89803, 'loss/train': 2.152484178543091} 11/07/2021 09:49:30 - INFO - __main__ - Step 89805: {'lr': 0.00017778946784676686, 'samples': 17242560, 'steps': 89804, 'loss/train': 1.3399296998977661} 11/07/2021 09:49:31 - INFO - __main__ - Step 89806: {'lr': 0.0001777843873094257, 'samples': 17242752, 'steps': 89805, 'loss/train': 1.513764500617981} 11/07/2021 09:49:32 - INFO - __main__ - Step 89807: {'lr': 0.00017777930680462381, 'samples': 17242944, 'steps': 89806, 'loss/train': 1.7939358949661255} 11/07/2021 09:49:32 - INFO - __main__ - Step 89808: {'lr': 0.00017777422633236345, 'samples': 17243136, 'steps': 89807, 'loss/train': 1.014028549194336} 11/07/2021 09:49:32 - INFO - __main__ - Step 89809: {'lr': 0.0001777691458926469, 'samples': 17243328, 'steps': 89808, 'loss/train': 1.6199281215667725} 11/07/2021 09:49:33 - INFO - __main__ - Step 89810: {'lr': 0.00017776406548547646, 'samples': 17243520, 'steps': 89809, 'loss/train': 0.8441510796546936} 11/07/2021 09:49:34 - INFO - __main__ - Step 89811: {'lr': 0.0001777589851108544, 'samples': 17243712, 'steps': 89810, 'loss/train': 1.2857531309127808} 11/07/2021 09:49:34 - INFO - __main__ - Step 89812: {'lr': 0.00017775390476878306, 'samples': 17243904, 'steps': 89811, 'loss/train': 1.1193536520004272} 11/07/2021 09:49:34 - INFO - __main__ - Step 89813: {'lr': 0.00017774882445926465, 'samples': 17244096, 'steps': 89812, 'loss/train': 2.404735803604126} 11/07/2021 09:49:35 - INFO - __main__ - Step 89814: {'lr': 0.00017774374418230154, 'samples': 17244288, 'steps': 89813, 'loss/train': 1.5462465286254883} 11/07/2021 09:49:35 - INFO - __main__ - Step 89815: {'lr': 0.000177738663937896, 'samples': 17244480, 'steps': 89814, 'loss/train': 1.4787704944610596} 11/07/2021 09:49:36 - INFO - __main__ - Step 89816: {'lr': 0.00017773358372605037, 'samples': 17244672, 'steps': 89815, 'loss/train': 1.4735933542251587} 11/07/2021 09:49:37 - INFO - __main__ - Step 89817: {'lr': 0.00017772850354676677, 'samples': 17244864, 'steps': 89816, 'loss/train': 1.7169196605682373} 11/07/2021 09:49:37 - INFO - __main__ - Step 89818: {'lr': 0.0001777234234000476, 'samples': 17245056, 'steps': 89817, 'loss/train': 1.411225438117981} 11/07/2021 09:49:37 - INFO - __main__ - Step 89819: {'lr': 0.00017771834328589515, 'samples': 17245248, 'steps': 89818, 'loss/train': 1.9968301057815552} 11/07/2021 09:49:38 - INFO - __main__ - Step 89820: {'lr': 0.0001777132632043117, 'samples': 17245440, 'steps': 89819, 'loss/train': 1.5287790298461914} 11/07/2021 09:49:38 - INFO - __main__ - Step 89821: {'lr': 0.00017770818315529952, 'samples': 17245632, 'steps': 89820, 'loss/train': 1.7712239027023315} 11/07/2021 09:49:39 - INFO - __main__ - Step 89822: {'lr': 0.00017770310313886093, 'samples': 17245824, 'steps': 89821, 'loss/train': 1.853467583656311} 11/07/2021 09:49:39 - INFO - __main__ - Step 89823: {'lr': 0.00017769802315499821, 'samples': 17246016, 'steps': 89822, 'loss/train': 0.5023411512374878} 11/07/2021 09:49:40 - INFO - __main__ - Step 89824: {'lr': 0.00017769294320371367, 'samples': 17246208, 'steps': 89823, 'loss/train': 1.5197522640228271} 11/07/2021 09:49:40 - INFO - __main__ - Step 89825: {'lr': 0.00017768786328500953, 'samples': 17246400, 'steps': 89824, 'loss/train': 1.2389347553253174} 11/07/2021 09:49:40 - INFO - __main__ - Step 89826: {'lr': 0.0001776827833988881, 'samples': 17246592, 'steps': 89825, 'loss/train': 1.6892133951187134} 11/07/2021 09:49:41 - INFO - __main__ - Step 89827: {'lr': 0.00017767770354535172, 'samples': 17246784, 'steps': 89826, 'loss/train': 1.2788138389587402} 11/07/2021 09:49:42 - INFO - __main__ - Step 89828: {'lr': 0.0001776726237244027, 'samples': 17246976, 'steps': 89827, 'loss/train': 0.18104654550552368} 11/07/2021 09:49:42 - INFO - __main__ - Step 89829: {'lr': 0.00017766754393604332, 'samples': 17247168, 'steps': 89828, 'loss/train': 1.4988641738891602} 11/07/2021 09:49:42 - INFO - __main__ - Step 89830: {'lr': 0.00017766246418027574, 'samples': 17247360, 'steps': 89829, 'loss/train': 1.3376320600509644} 11/07/2021 09:49:43 - INFO - __main__ - Step 89831: {'lr': 0.00017765738445710234, 'samples': 17247552, 'steps': 89830, 'loss/train': 1.6625235080718994} 11/07/2021 09:49:44 - INFO - __main__ - Step 89832: {'lr': 0.00017765230476652542, 'samples': 17247744, 'steps': 89831, 'loss/train': 1.606542706489563} 11/07/2021 09:49:44 - INFO - __main__ - Step 89833: {'lr': 0.00017764722510854724, 'samples': 17247936, 'steps': 89832, 'loss/train': 0.8259205222129822} 11/07/2021 09:49:45 - INFO - __main__ - Step 89834: {'lr': 0.0001776421454831701, 'samples': 17248128, 'steps': 89833, 'loss/train': 1.1990429162979126} 11/07/2021 09:49:45 - INFO - __main__ - Step 89835: {'lr': 0.0001776370658903963, 'samples': 17248320, 'steps': 89834, 'loss/train': 0.8915497064590454} 11/07/2021 09:49:45 - INFO - __main__ - Step 89836: {'lr': 0.00017763198633022814, 'samples': 17248512, 'steps': 89835, 'loss/train': 1.282705307006836} 11/07/2021 09:49:46 - INFO - __main__ - Step 89837: {'lr': 0.00017762690680266785, 'samples': 17248704, 'steps': 89836, 'loss/train': 1.8551305532455444} 11/07/2021 09:49:47 - INFO - __main__ - Step 89838: {'lr': 0.00017762182730771781, 'samples': 17248896, 'steps': 89837, 'loss/train': 0.6759945154190063} 11/07/2021 09:49:47 - INFO - __main__ - Step 89839: {'lr': 0.0001776167478453802, 'samples': 17249088, 'steps': 89838, 'loss/train': 1.3936976194381714} 11/07/2021 09:49:47 - INFO - __main__ - Step 89840: {'lr': 0.0001776116684156574, 'samples': 17249280, 'steps': 89839, 'loss/train': 1.5292333364486694} 11/07/2021 09:49:48 - INFO - __main__ - Step 89841: {'lr': 0.00017760658901855167, 'samples': 17249472, 'steps': 89840, 'loss/train': 1.3863844871520996} 11/07/2021 09:49:49 - INFO - __main__ - Step 89842: {'lr': 0.00017760150965406528, 'samples': 17249664, 'steps': 89841, 'loss/train': 1.2191656827926636} 11/07/2021 09:49:49 - INFO - __main__ - Step 89843: {'lr': 0.00017759643032220064, 'samples': 17249856, 'steps': 89842, 'loss/train': 1.4402261972427368} 11/07/2021 09:49:49 - INFO - __main__ - Step 89844: {'lr': 0.00017759135102295983, 'samples': 17250048, 'steps': 89843, 'loss/train': 1.6846963167190552} 11/07/2021 09:49:50 - INFO - __main__ - Step 89845: {'lr': 0.00017758627175634524, 'samples': 17250240, 'steps': 89844, 'loss/train': 0.6543107032775879} 11/07/2021 09:49:50 - INFO - __main__ - Step 89846: {'lr': 0.00017758119252235914, 'samples': 17250432, 'steps': 89845, 'loss/train': 0.6523315906524658} 11/07/2021 09:49:50 - INFO - __main__ - Step 89847: {'lr': 0.00017757611332100388, 'samples': 17250624, 'steps': 89846, 'loss/train': 1.6046541929244995} 11/07/2021 09:49:51 - INFO - __main__ - Step 89848: {'lr': 0.00017757103415228168, 'samples': 17250816, 'steps': 89847, 'loss/train': 1.1688705682754517} 11/07/2021 09:49:52 - INFO - __main__ - Step 89849: {'lr': 0.00017756595501619484, 'samples': 17251008, 'steps': 89848, 'loss/train': 1.4357002973556519} 11/07/2021 09:49:52 - INFO - __main__ - Step 89850: {'lr': 0.00017756087591274566, 'samples': 17251200, 'steps': 89849, 'loss/train': 1.2643266916275024} 11/07/2021 09:49:52 - INFO - __main__ - Step 89851: {'lr': 0.00017755579684193646, 'samples': 17251392, 'steps': 89850, 'loss/train': 1.3657692670822144} 11/07/2021 09:49:53 - INFO - __main__ - Step 89852: {'lr': 0.00017755071780376953, 'samples': 17251584, 'steps': 89851, 'loss/train': 1.279445767402649} 11/07/2021 09:49:54 - INFO - __main__ - Step 89853: {'lr': 0.00017754563879824706, 'samples': 17251776, 'steps': 89852, 'loss/train': 1.4045307636260986} 11/07/2021 09:49:54 - INFO - __main__ - Step 89854: {'lr': 0.00017754055982537143, 'samples': 17251968, 'steps': 89853, 'loss/train': 1.0780327320098877} 11/07/2021 09:49:55 - INFO - __main__ - Step 89855: {'lr': 0.00017753548088514498, 'samples': 17252160, 'steps': 89854, 'loss/train': 1.5256935358047485} 11/07/2021 09:49:55 - INFO - __main__ - Step 89856: {'lr': 0.0001775304019775699, 'samples': 17252352, 'steps': 89855, 'loss/train': 1.6397008895874023} 11/07/2021 09:49:55 - INFO - __main__ - Step 89857: {'lr': 0.00017752532310264847, 'samples': 17252544, 'steps': 89856, 'loss/train': 1.0902831554412842} 11/07/2021 09:49:56 - INFO - __main__ - Step 89858: {'lr': 0.000177520244260383, 'samples': 17252736, 'steps': 89857, 'loss/train': 1.7558832168579102} 11/07/2021 09:49:57 - INFO - __main__ - Step 89859: {'lr': 0.00017751516545077577, 'samples': 17252928, 'steps': 89858, 'loss/train': 1.3646214008331299} 11/07/2021 09:49:57 - INFO - __main__ - Step 89860: {'lr': 0.0001775100866738291, 'samples': 17253120, 'steps': 89859, 'loss/train': 1.4814872741699219} 11/07/2021 09:49:57 - INFO - __main__ - Step 89861: {'lr': 0.00017750500792954526, 'samples': 17253312, 'steps': 89860, 'loss/train': 1.4032503366470337} 11/07/2021 09:49:58 - INFO - __main__ - Step 89862: {'lr': 0.00017749992921792658, 'samples': 17253504, 'steps': 89861, 'loss/train': 1.160263180732727} 11/07/2021 09:49:59 - INFO - __main__ - Step 89863: {'lr': 0.0001774948505389753, 'samples': 17253696, 'steps': 89862, 'loss/train': 1.3372572660446167} 11/07/2021 09:49:59 - INFO - __main__ - Step 89864: {'lr': 0.0001774897718926937, 'samples': 17253888, 'steps': 89863, 'loss/train': 1.1726338863372803} 11/07/2021 09:49:59 - INFO - __main__ - Step 89865: {'lr': 0.0001774846932790841, 'samples': 17254080, 'steps': 89864, 'loss/train': 1.3036773204803467} 11/07/2021 09:50:00 - INFO - __main__ - Step 89866: {'lr': 0.00017747961469814883, 'samples': 17254272, 'steps': 89865, 'loss/train': 1.4356290102005005} 11/07/2021 09:50:00 - INFO - __main__ - Step 89867: {'lr': 0.00017747453614989006, 'samples': 17254464, 'steps': 89866, 'loss/train': 1.4787065982818604} 11/07/2021 09:50:01 - INFO - __main__ - Step 89868: {'lr': 0.00017746945763431017, 'samples': 17254656, 'steps': 89867, 'loss/train': 2.35613751411438} 11/07/2021 09:50:02 - INFO - __main__ - Step 89869: {'lr': 0.00017746437915141142, 'samples': 17254848, 'steps': 89868, 'loss/train': 1.2149882316589355} 11/07/2021 09:50:02 - INFO - __main__ - Step 89870: {'lr': 0.00017745930070119616, 'samples': 17255040, 'steps': 89869, 'loss/train': 1.3189224004745483} 11/07/2021 09:50:02 - INFO - __main__ - Step 89871: {'lr': 0.00017745422228366653, 'samples': 17255232, 'steps': 89870, 'loss/train': 1.5655405521392822} 11/07/2021 09:50:03 - INFO - __main__ - Step 89872: {'lr': 0.00017744914389882495, 'samples': 17255424, 'steps': 89871, 'loss/train': 0.9821291565895081} 11/07/2021 09:50:03 - INFO - __main__ - Step 89873: {'lr': 0.00017744406554667363, 'samples': 17255616, 'steps': 89872, 'loss/train': 1.3761814832687378} 11/07/2021 09:50:05 - INFO - __main__ - Step 89874: {'lr': 0.0001774389872272149, 'samples': 17255808, 'steps': 89873, 'loss/train': 1.1963367462158203} 11/07/2021 09:50:05 - INFO - __main__ - Step 89875: {'lr': 0.00017743390894045107, 'samples': 17256000, 'steps': 89874, 'loss/train': 1.216734766960144} 11/07/2021 09:50:05 - INFO - __main__ - Step 89876: {'lr': 0.00017742883068638446, 'samples': 17256192, 'steps': 89875, 'loss/train': 1.4946261644363403} 11/07/2021 09:50:06 - INFO - __main__ - Step 89877: {'lr': 0.00017742375246501723, 'samples': 17256384, 'steps': 89876, 'loss/train': 1.5331735610961914} 11/07/2021 09:50:06 - INFO - __main__ - Step 89878: {'lr': 0.00017741867427635173, 'samples': 17256576, 'steps': 89877, 'loss/train': 1.3356537818908691} 11/07/2021 09:50:06 - INFO - __main__ - Step 89879: {'lr': 0.00017741359612039026, 'samples': 17256768, 'steps': 89878, 'loss/train': 0.8690239787101746} 11/07/2021 09:50:07 - INFO - __main__ - Step 89880: {'lr': 0.0001774085179971351, 'samples': 17256960, 'steps': 89879, 'loss/train': 1.0077075958251953} 11/07/2021 09:50:08 - INFO - __main__ - Step 89881: {'lr': 0.00017740343990658853, 'samples': 17257152, 'steps': 89880, 'loss/train': 0.7996180653572083} 11/07/2021 09:50:08 - INFO - __main__ - Step 89882: {'lr': 0.0001773983618487529, 'samples': 17257344, 'steps': 89881, 'loss/train': 1.4730104207992554} 11/07/2021 09:50:09 - INFO - __main__ - Step 89883: {'lr': 0.00017739328382363045, 'samples': 17257536, 'steps': 89882, 'loss/train': 1.0961439609527588} 11/07/2021 09:50:09 - INFO - __main__ - Step 89884: {'lr': 0.00017738820583122343, 'samples': 17257728, 'steps': 89883, 'loss/train': 1.8341783285140991} 11/07/2021 09:50:10 - INFO - __main__ - Step 89885: {'lr': 0.00017738312787153417, 'samples': 17257920, 'steps': 89884, 'loss/train': 1.3061949014663696} 11/07/2021 09:50:10 - INFO - __main__ - Step 89886: {'lr': 0.0001773780499445649, 'samples': 17258112, 'steps': 89885, 'loss/train': 1.6219329833984375} 11/07/2021 09:50:11 - INFO - __main__ - Step 89887: {'lr': 0.00017737297205031808, 'samples': 17258304, 'steps': 89886, 'loss/train': 0.8619177341461182} 11/07/2021 09:50:11 - INFO - __main__ - Step 89888: {'lr': 0.0001773678941887958, 'samples': 17258496, 'steps': 89887, 'loss/train': 1.236127495765686} 11/07/2021 09:50:11 - INFO - __main__ - Step 89889: {'lr': 0.00017736281636000043, 'samples': 17258688, 'steps': 89888, 'loss/train': 2.0071964263916016} 11/07/2021 09:50:13 - INFO - __main__ - Step 89890: {'lr': 0.00017735773856393424, 'samples': 17258880, 'steps': 89889, 'loss/train': 1.5155768394470215} 11/07/2021 09:50:13 - INFO - __main__ - Step 89891: {'lr': 0.00017735266080059955, 'samples': 17259072, 'steps': 89890, 'loss/train': 0.9538317918777466} 11/07/2021 09:50:14 - INFO - __main__ - Step 89892: {'lr': 0.00017734758306999862, 'samples': 17259264, 'steps': 89891, 'loss/train': 1.514686942100525} 11/07/2021 09:50:14 - INFO - __main__ - Step 89893: {'lr': 0.00017734250537213375, 'samples': 17259456, 'steps': 89892, 'loss/train': 1.840241551399231} 11/07/2021 09:50:14 - INFO - __main__ - Step 89894: {'lr': 0.00017733742770700722, 'samples': 17259648, 'steps': 89893, 'loss/train': 1.3677287101745605} 11/07/2021 09:50:15 - INFO - __main__ - Step 89895: {'lr': 0.00017733235007462135, 'samples': 17259840, 'steps': 89894, 'loss/train': 1.115918755531311} 11/07/2021 09:50:16 - INFO - __main__ - Step 89896: {'lr': 0.00017732727247497836, 'samples': 17260032, 'steps': 89895, 'loss/train': 0.48496100306510925} 11/07/2021 09:50:16 - INFO - __main__ - Step 89897: {'lr': 0.0001773221949080807, 'samples': 17260224, 'steps': 89896, 'loss/train': 1.700029730796814} 11/07/2021 09:50:16 - INFO - __main__ - Step 89898: {'lr': 0.00017731711737393048, 'samples': 17260416, 'steps': 89897, 'loss/train': 1.6326676607131958} 11/07/2021 09:50:17 - INFO - __main__ - Step 89899: {'lr': 0.00017731203987253, 'samples': 17260608, 'steps': 89898, 'loss/train': 1.0955549478530884} 11/07/2021 09:50:17 - INFO - __main__ - Step 89900: {'lr': 0.00017730696240388162, 'samples': 17260800, 'steps': 89899, 'loss/train': 1.492627739906311} 11/07/2021 09:50:18 - INFO - __main__ - Step 89901: {'lr': 0.00017730188496798755, 'samples': 17260992, 'steps': 89900, 'loss/train': 0.9964068531990051} 11/07/2021 09:50:19 - INFO - __main__ - Step 89902: {'lr': 0.00017729680756485016, 'samples': 17261184, 'steps': 89901, 'loss/train': 1.1184319257736206} 11/07/2021 09:50:19 - INFO - __main__ - Step 89903: {'lr': 0.0001772917301944717, 'samples': 17261376, 'steps': 89902, 'loss/train': 1.449904441833496} 11/07/2021 09:50:19 - INFO - __main__ - Step 89904: {'lr': 0.00017728665285685446, 'samples': 17261568, 'steps': 89903, 'loss/train': 1.4677015542984009} 11/07/2021 09:50:20 - INFO - __main__ - Step 89905: {'lr': 0.00017728157555200075, 'samples': 17261760, 'steps': 89904, 'loss/train': 1.2956745624542236} 11/07/2021 09:50:20 - INFO - __main__ - Step 89906: {'lr': 0.00017727649827991286, 'samples': 17261952, 'steps': 89905, 'loss/train': 1.0816222429275513} 11/07/2021 09:50:21 - INFO - __main__ - Step 89907: {'lr': 0.00017727142104059302, 'samples': 17262144, 'steps': 89906, 'loss/train': 1.041326642036438} 11/07/2021 09:50:22 - INFO - __main__ - Step 89908: {'lr': 0.00017726634383404355, 'samples': 17262336, 'steps': 89907, 'loss/train': 1.0245366096496582} 11/07/2021 09:50:22 - INFO - __main__ - Step 89909: {'lr': 0.00017726126666026677, 'samples': 17262528, 'steps': 89908, 'loss/train': 0.9831699728965759} 11/07/2021 09:50:22 - INFO - __main__ - Step 89910: {'lr': 0.00017725618951926504, 'samples': 17262720, 'steps': 89909, 'loss/train': 1.5157434940338135} 11/07/2021 09:50:23 - INFO - __main__ - Step 89911: {'lr': 0.00017725111241104045, 'samples': 17262912, 'steps': 89910, 'loss/train': 0.18663842976093292} 11/07/2021 09:50:24 - INFO - __main__ - Step 89912: {'lr': 0.00017724603533559536, 'samples': 17263104, 'steps': 89911, 'loss/train': 1.1760329008102417} 11/07/2021 09:50:24 - INFO - __main__ - Step 89913: {'lr': 0.0001772409582929321, 'samples': 17263296, 'steps': 89912, 'loss/train': 1.2431834936141968} 11/07/2021 09:50:25 - INFO - __main__ - Step 89914: {'lr': 0.00017723588128305297, 'samples': 17263488, 'steps': 89913, 'loss/train': 1.0598695278167725} 11/07/2021 09:50:25 - INFO - __main__ - Step 89915: {'lr': 0.00017723080430596017, 'samples': 17263680, 'steps': 89914, 'loss/train': 1.392132043838501} 11/07/2021 09:50:25 - INFO - __main__ - Step 89916: {'lr': 0.00017722572736165608, 'samples': 17263872, 'steps': 89915, 'loss/train': 1.3675527572631836} 11/07/2021 09:50:26 - INFO - __main__ - Step 89917: {'lr': 0.00017722065045014293, 'samples': 17264064, 'steps': 89916, 'loss/train': 1.3193786144256592} 11/07/2021 09:50:27 - INFO - __main__ - Step 89918: {'lr': 0.00017721557357142307, 'samples': 17264256, 'steps': 89917, 'loss/train': 0.8996821641921997} 11/07/2021 09:50:27 - INFO - __main__ - Step 89919: {'lr': 0.00017721049672549872, 'samples': 17264448, 'steps': 89918, 'loss/train': 1.358504295349121} 11/07/2021 09:50:27 - INFO - __main__ - Step 89920: {'lr': 0.0001772054199123722, 'samples': 17264640, 'steps': 89919, 'loss/train': 1.6403688192367554} 11/07/2021 09:50:28 - INFO - __main__ - Step 89921: {'lr': 0.0001772003431320458, 'samples': 17264832, 'steps': 89920, 'loss/train': 1.4679492712020874} 11/07/2021 09:50:28 - INFO - __main__ - Step 89922: {'lr': 0.00017719526638452184, 'samples': 17265024, 'steps': 89921, 'loss/train': 1.0828843116760254} 11/07/2021 09:50:29 - INFO - __main__ - Step 89923: {'lr': 0.0001771901896698025, 'samples': 17265216, 'steps': 89922, 'loss/train': 1.217231035232544} 11/07/2021 09:50:29 - INFO - __main__ - Step 89924: {'lr': 0.0001771851129878903, 'samples': 17265408, 'steps': 89923, 'loss/train': 1.5457277297973633} 11/07/2021 09:50:30 - INFO - __main__ - Step 89925: {'lr': 0.0001771800363387872, 'samples': 17265600, 'steps': 89924, 'loss/train': 1.2236417531967163} 11/07/2021 09:50:30 - INFO - __main__ - Step 89926: {'lr': 0.0001771749597224957, 'samples': 17265792, 'steps': 89925, 'loss/train': 1.8351911306381226} 11/07/2021 09:50:30 - INFO - __main__ - Step 89927: {'lr': 0.00017716988313901805, 'samples': 17265984, 'steps': 89926, 'loss/train': 1.2152936458587646} 11/07/2021 09:50:32 - INFO - __main__ - Step 89928: {'lr': 0.0001771648065883565, 'samples': 17266176, 'steps': 89927, 'loss/train': 1.6137595176696777} 11/07/2021 09:50:32 - INFO - __main__ - Step 89929: {'lr': 0.00017715973007051332, 'samples': 17266368, 'steps': 89928, 'loss/train': 1.2493375539779663} 11/07/2021 09:50:32 - INFO - __main__ - Step 89930: {'lr': 0.00017715465358549094, 'samples': 17266560, 'steps': 89929, 'loss/train': 1.4226113557815552} 11/07/2021 09:50:33 - INFO - __main__ - Step 89931: {'lr': 0.0001771495771332915, 'samples': 17266752, 'steps': 89930, 'loss/train': 0.8058826923370361} 11/07/2021 09:50:33 - INFO - __main__ - Step 89932: {'lr': 0.0001771445007139173, 'samples': 17266944, 'steps': 89931, 'loss/train': 5.768790245056152} 11/07/2021 09:50:33 - INFO - __main__ - Step 89933: {'lr': 0.0001771394243273707, 'samples': 17267136, 'steps': 89932, 'loss/train': 1.8183683156967163} 11/07/2021 09:50:34 - INFO - __main__ - Step 89934: {'lr': 0.00017713434797365398, 'samples': 17267328, 'steps': 89933, 'loss/train': 1.2077680826187134} 11/07/2021 09:50:35 - INFO - __main__ - Step 89935: {'lr': 0.00017712927165276933, 'samples': 17267520, 'steps': 89934, 'loss/train': 1.0330177545547485} 11/07/2021 09:50:35 - INFO - __main__ - Step 89936: {'lr': 0.00017712419536471916, 'samples': 17267712, 'steps': 89935, 'loss/train': 1.5937128067016602} 11/07/2021 09:50:35 - INFO - __main__ - Step 89937: {'lr': 0.00017711911910950578, 'samples': 17267904, 'steps': 89936, 'loss/train': 1.6439497470855713} 11/07/2021 09:50:36 - INFO - __main__ - Step 89938: {'lr': 0.00017711404288713134, 'samples': 17268096, 'steps': 89937, 'loss/train': 1.5018342733383179} 11/07/2021 09:50:37 - INFO - __main__ - Step 89939: {'lr': 0.00017710896669759812, 'samples': 17268288, 'steps': 89938, 'loss/train': 1.8737963438034058} 11/07/2021 09:50:37 - INFO - __main__ - Step 89940: {'lr': 0.00017710389054090853, 'samples': 17268480, 'steps': 89939, 'loss/train': 1.0923309326171875} 11/07/2021 09:50:37 - INFO - __main__ - Step 89941: {'lr': 0.00017709881441706476, 'samples': 17268672, 'steps': 89940, 'loss/train': 1.4815845489501953} 11/07/2021 09:50:38 - INFO - __main__ - Step 89942: {'lr': 0.00017709373832606917, 'samples': 17268864, 'steps': 89941, 'loss/train': 1.617969036102295} 11/07/2021 09:50:38 - INFO - __main__ - Step 89943: {'lr': 0.00017708866226792404, 'samples': 17269056, 'steps': 89942, 'loss/train': 1.4225181341171265} 11/07/2021 09:50:39 - INFO - __main__ - Step 89944: {'lr': 0.00017708358624263156, 'samples': 17269248, 'steps': 89943, 'loss/train': 1.014405369758606} 11/07/2021 09:50:40 - INFO - __main__ - Step 89945: {'lr': 0.00017707851025019415, 'samples': 17269440, 'steps': 89944, 'loss/train': 1.8102847337722778} 11/07/2021 09:50:40 - INFO - __main__ - Step 89946: {'lr': 0.000177073434290614, 'samples': 17269632, 'steps': 89945, 'loss/train': 0.8608522415161133} 11/07/2021 09:50:40 - INFO - __main__ - Step 89947: {'lr': 0.00017706835836389344, 'samples': 17269824, 'steps': 89946, 'loss/train': 1.46830415725708} 11/07/2021 09:50:41 - INFO - __main__ - Step 89948: {'lr': 0.00017706328247003478, 'samples': 17270016, 'steps': 89947, 'loss/train': 1.393592357635498} 11/07/2021 09:50:41 - INFO - __main__ - Step 89949: {'lr': 0.0001770582066090403, 'samples': 17270208, 'steps': 89948, 'loss/train': 1.431085467338562} 11/07/2021 09:50:42 - INFO - __main__ - Step 89950: {'lr': 0.00017705313078091235, 'samples': 17270400, 'steps': 89949, 'loss/train': 1.383586049079895} 11/07/2021 09:50:43 - INFO - __main__ - Step 89951: {'lr': 0.00017704805498565298, 'samples': 17270592, 'steps': 89950, 'loss/train': 1.8799470663070679} 11/07/2021 09:50:43 - INFO - __main__ - Step 89952: {'lr': 0.00017704297922326468, 'samples': 17270784, 'steps': 89951, 'loss/train': 1.1142017841339111} 11/07/2021 09:50:43 - INFO - __main__ - Step 89953: {'lr': 0.00017703790349374968, 'samples': 17270976, 'steps': 89952, 'loss/train': 1.5270329713821411} 11/07/2021 09:50:44 - INFO - __main__ - Step 89954: {'lr': 0.00017703282779711027, 'samples': 17271168, 'steps': 89953, 'loss/train': 0.5398431420326233} 11/07/2021 09:50:45 - INFO - __main__ - Step 89955: {'lr': 0.00017702775213334872, 'samples': 17271360, 'steps': 89954, 'loss/train': 1.2683788537979126} 11/07/2021 09:50:45 - INFO - __main__ - Step 89956: {'lr': 0.0001770226765024674, 'samples': 17271552, 'steps': 89955, 'loss/train': 1.3644452095031738} 11/07/2021 09:50:45 - INFO - __main__ - Step 89957: {'lr': 0.00017701760090446848, 'samples': 17271744, 'steps': 89956, 'loss/train': 1.4240325689315796} 11/07/2021 09:50:46 - INFO - __main__ - Step 89958: {'lr': 0.0001770125253393543, 'samples': 17271936, 'steps': 89957, 'loss/train': 1.3485400676727295} 11/07/2021 09:50:46 - INFO - __main__ - Step 89959: {'lr': 0.0001770074498071272, 'samples': 17272128, 'steps': 89958, 'loss/train': 1.1604827642440796} 11/07/2021 09:50:47 - INFO - __main__ - Step 89960: {'lr': 0.00017700237430778938, 'samples': 17272320, 'steps': 89959, 'loss/train': 1.388231635093689} 11/07/2021 09:50:47 - INFO - __main__ - Step 89961: {'lr': 0.00017699729884134316, 'samples': 17272512, 'steps': 89960, 'loss/train': 1.1480093002319336} 11/07/2021 09:50:48 - INFO - __main__ - Step 89962: {'lr': 0.00017699222340779083, 'samples': 17272704, 'steps': 89961, 'loss/train': 1.535638689994812} 11/07/2021 09:50:48 - INFO - __main__ - Step 89963: {'lr': 0.00017698714800713468, 'samples': 17272896, 'steps': 89962, 'loss/train': 1.5690020322799683} 11/07/2021 09:50:49 - INFO - __main__ - Step 89964: {'lr': 0.00017698207263937713, 'samples': 17273088, 'steps': 89963, 'loss/train': 1.810302972793579} 11/07/2021 09:50:49 - INFO - __main__ - Step 89965: {'lr': 0.0001769769973045202, 'samples': 17273280, 'steps': 89964, 'loss/train': 1.5763235092163086} 11/07/2021 09:50:50 - INFO - __main__ - Step 89966: {'lr': 0.0001769719220025663, 'samples': 17273472, 'steps': 89965, 'loss/train': 1.452532172203064} 11/07/2021 09:50:50 - INFO - __main__ - Step 89967: {'lr': 0.00017696684673351777, 'samples': 17273664, 'steps': 89966, 'loss/train': 1.6556389331817627} 11/07/2021 09:50:51 - INFO - __main__ - Step 89968: {'lr': 0.0001769617714973768, 'samples': 17273856, 'steps': 89967, 'loss/train': 1.4992914199829102} 11/07/2021 09:50:51 - INFO - __main__ - Step 89969: {'lr': 0.00017695669629414575, 'samples': 17274048, 'steps': 89968, 'loss/train': 1.5173259973526} 11/07/2021 09:50:51 - INFO - __main__ - Step 89970: {'lr': 0.00017695162112382689, 'samples': 17274240, 'steps': 89969, 'loss/train': 1.442929744720459} 11/07/2021 09:50:52 - INFO - __main__ - Step 89971: {'lr': 0.00017694654598642248, 'samples': 17274432, 'steps': 89970, 'loss/train': 1.760441541671753} 11/07/2021 09:50:53 - INFO - __main__ - Step 89972: {'lr': 0.00017694147088193486, 'samples': 17274624, 'steps': 89971, 'loss/train': 1.2898415327072144} 11/07/2021 09:50:53 - INFO - __main__ - Step 89973: {'lr': 0.00017693639581036624, 'samples': 17274816, 'steps': 89972, 'loss/train': 1.4865341186523438} 11/07/2021 09:50:53 - INFO - __main__ - Step 89974: {'lr': 0.000176931320771719, 'samples': 17275008, 'steps': 89973, 'loss/train': 1.390205979347229} 11/07/2021 09:50:54 - INFO - __main__ - Step 89975: {'lr': 0.00017692624576599536, 'samples': 17275200, 'steps': 89974, 'loss/train': 1.829337239265442} 11/07/2021 09:50:55 - INFO - __main__ - Step 89976: {'lr': 0.00017692117079319764, 'samples': 17275392, 'steps': 89975, 'loss/train': 1.194107174873352} 11/07/2021 09:50:55 - INFO - __main__ - Step 89977: {'lr': 0.00017691609585332818, 'samples': 17275584, 'steps': 89976, 'loss/train': 1.2266851663589478} 11/07/2021 09:50:55 - INFO - __main__ - Step 89978: {'lr': 0.00017691102094638913, 'samples': 17275776, 'steps': 89977, 'loss/train': 1.2138481140136719} 11/07/2021 09:50:56 - INFO - __main__ - Step 89979: {'lr': 0.00017690594607238286, 'samples': 17275968, 'steps': 89978, 'loss/train': 1.407800316810608} 11/07/2021 09:50:56 - INFO - __main__ - Step 89980: {'lr': 0.0001769008712313116, 'samples': 17276160, 'steps': 89979, 'loss/train': 1.438555121421814} 11/07/2021 09:50:57 - INFO - __main__ - Step 89981: {'lr': 0.00017689579642317773, 'samples': 17276352, 'steps': 89980, 'loss/train': 1.883653163909912} 11/07/2021 09:50:58 - INFO - __main__ - Step 89982: {'lr': 0.00017689072164798342, 'samples': 17276544, 'steps': 89981, 'loss/train': 1.6042377948760986} 11/07/2021 09:50:58 - INFO - __main__ - Step 89983: {'lr': 0.00017688564690573105, 'samples': 17276736, 'steps': 89982, 'loss/train': 0.4127095341682434} 11/07/2021 09:50:58 - INFO - __main__ - Step 89984: {'lr': 0.0001768805721964229, 'samples': 17276928, 'steps': 89983, 'loss/train': 1.374071478843689} 11/07/2021 09:50:59 - INFO - __main__ - Step 89985: {'lr': 0.0001768754975200612, 'samples': 17277120, 'steps': 89984, 'loss/train': 1.3469277620315552} 11/07/2021 09:51:00 - INFO - __main__ - Step 89986: {'lr': 0.00017687042287664834, 'samples': 17277312, 'steps': 89985, 'loss/train': 0.32809096574783325} 11/07/2021 09:51:00 - INFO - __main__ - Step 89987: {'lr': 0.00017686534826618646, 'samples': 17277504, 'steps': 89986, 'loss/train': 0.7066847085952759} 11/07/2021 09:51:01 - INFO - __main__ - Step 89988: {'lr': 0.00017686027368867796, 'samples': 17277696, 'steps': 89987, 'loss/train': 1.694815754890442} 11/07/2021 09:51:01 - INFO - __main__ - Step 89989: {'lr': 0.00017685519914412517, 'samples': 17277888, 'steps': 89988, 'loss/train': 1.3419655561447144} 11/07/2021 09:51:01 - INFO - __main__ - Step 89990: {'lr': 0.0001768501246325302, 'samples': 17278080, 'steps': 89989, 'loss/train': 1.6906057596206665} 11/07/2021 09:51:02 - INFO - __main__ - Step 89991: {'lr': 0.00017684505015389551, 'samples': 17278272, 'steps': 89990, 'loss/train': 1.4361088275909424} 11/07/2021 09:51:02 - INFO - __main__ - Step 89992: {'lr': 0.00017683997570822326, 'samples': 17278464, 'steps': 89991, 'loss/train': 0.4054669141769409} 11/07/2021 09:51:03 - INFO - __main__ - Step 89993: {'lr': 0.00017683490129551577, 'samples': 17278656, 'steps': 89992, 'loss/train': 1.3594752550125122} 11/07/2021 09:51:03 - INFO - __main__ - Step 89994: {'lr': 0.00017682982691577537, 'samples': 17278848, 'steps': 89993, 'loss/train': 1.4408957958221436} 11/07/2021 09:51:04 - INFO - __main__ - Step 89995: {'lr': 0.00017682475256900433, 'samples': 17279040, 'steps': 89994, 'loss/train': 1.2575799226760864} 11/07/2021 09:51:04 - INFO - __main__ - Step 89996: {'lr': 0.0001768196782552049, 'samples': 17279232, 'steps': 89995, 'loss/train': 1.090964913368225} 11/07/2021 09:51:05 - INFO - __main__ - Step 89997: {'lr': 0.0001768146039743794, 'samples': 17279424, 'steps': 89996, 'loss/train': 1.5968022346496582} 11/07/2021 09:51:05 - INFO - __main__ - Step 89998: {'lr': 0.0001768095297265301, 'samples': 17279616, 'steps': 89997, 'loss/train': 1.2839361429214478} 11/07/2021 09:51:06 - INFO - __main__ - Step 89999: {'lr': 0.0001768044555116593, 'samples': 17279808, 'steps': 89998, 'loss/train': 1.4773166179656982} 11/07/2021 09:51:06 - INFO - __main__ - Step 90000: {'lr': 0.00017679938132976936, 'samples': 17280000, 'steps': 89999, 'loss/train': 1.3141337633132935} 11/07/2021 09:51:06 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 09:54:18 - INFO - __main__ - Step 90000: {'loss/eval': 1.3116109371185303, 'perplexity': 3.712148904800415} 11/07/2021 09:54:34 - WARNING - huggingface_hub.repository - Several commits (6) will be pushed upstream. 11/07/2021 09:54:34 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 09:55:00 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small c8a25e7..73e0bf4 proud-haze-135 -> proud-haze-135 11/07/2021 09:55:01 - INFO - __main__ - Step 90001: {'lr': 0.00017679430718086243, 'samples': 17280192, 'steps': 90000, 'loss/train': 1.2001985311508179} 11/07/2021 09:55:02 - INFO - __main__ - Step 90002: {'lr': 0.00017678923306494083, 'samples': 17280384, 'steps': 90001, 'loss/train': 1.4582816362380981} 11/07/2021 09:55:03 - INFO - __main__ - Step 90003: {'lr': 0.0001767841589820069, 'samples': 17280576, 'steps': 90002, 'loss/train': 1.2628333568572998} 11/07/2021 09:55:03 - INFO - __main__ - Step 90004: {'lr': 0.00017677908493206294, 'samples': 17280768, 'steps': 90003, 'loss/train': 1.9823577404022217} 11/07/2021 09:55:03 - INFO - __main__ - Step 90005: {'lr': 0.00017677401091511114, 'samples': 17280960, 'steps': 90004, 'loss/train': 0.14744818210601807} 11/07/2021 09:55:04 - INFO - __main__ - Step 90006: {'lr': 0.00017676893693115384, 'samples': 17281152, 'steps': 90005, 'loss/train': 1.055541753768921} 11/07/2021 09:55:05 - INFO - __main__ - Step 90007: {'lr': 0.0001767638629801933, 'samples': 17281344, 'steps': 90006, 'loss/train': 1.3933331966400146} 11/07/2021 09:55:05 - INFO - __main__ - Step 90008: {'lr': 0.0001767587890622319, 'samples': 17281536, 'steps': 90007, 'loss/train': 1.4732543230056763} 11/07/2021 09:55:06 - INFO - __main__ - Step 90009: {'lr': 0.0001767537151772718, 'samples': 17281728, 'steps': 90008, 'loss/train': 5.783237457275391} 11/07/2021 09:55:06 - INFO - __main__ - Step 90010: {'lr': 0.00017674864132531537, 'samples': 17281920, 'steps': 90009, 'loss/train': 1.5716447830200195} 11/07/2021 09:55:06 - INFO - __main__ - Step 90011: {'lr': 0.00017674356750636494, 'samples': 17282112, 'steps': 90010, 'loss/train': 1.4505771398544312} 11/07/2021 09:55:07 - INFO - __main__ - Step 90012: {'lr': 0.00017673849372042263, 'samples': 17282304, 'steps': 90011, 'loss/train': 0.7903847694396973} 11/07/2021 09:55:07 - INFO - __main__ - Step 90013: {'lr': 0.00017673341996749087, 'samples': 17282496, 'steps': 90012, 'loss/train': 1.962886929512024} 11/07/2021 09:55:08 - INFO - __main__ - Step 90014: {'lr': 0.0001767283462475719, 'samples': 17282688, 'steps': 90013, 'loss/train': 5.6889801025390625} 11/07/2021 09:55:09 - INFO - __main__ - Step 90015: {'lr': 0.00017672327256066796, 'samples': 17282880, 'steps': 90014, 'loss/train': 1.490450143814087} 11/07/2021 09:55:09 - INFO - __main__ - Step 90016: {'lr': 0.00017671819890678142, 'samples': 17283072, 'steps': 90015, 'loss/train': 1.8470923900604248} 11/07/2021 09:55:09 - INFO - __main__ - Step 90017: {'lr': 0.0001767131252859145, 'samples': 17283264, 'steps': 90016, 'loss/train': 1.6236997842788696} 11/07/2021 09:55:10 - INFO - __main__ - Step 90018: {'lr': 0.00017670805169806957, 'samples': 17283456, 'steps': 90017, 'loss/train': 1.5081210136413574} 11/07/2021 09:55:10 - INFO - __main__ - Step 90019: {'lr': 0.00017670297814324887, 'samples': 17283648, 'steps': 90018, 'loss/train': 1.6248501539230347} 11/07/2021 09:55:11 - INFO - __main__ - Step 90020: {'lr': 0.00017669790462145464, 'samples': 17283840, 'steps': 90019, 'loss/train': 1.892830729484558} 11/07/2021 09:55:12 - INFO - __main__ - Step 90021: {'lr': 0.00017669283113268917, 'samples': 17284032, 'steps': 90020, 'loss/train': 1.5775182247161865} 11/07/2021 09:55:12 - INFO - __main__ - Step 90022: {'lr': 0.00017668775767695487, 'samples': 17284224, 'steps': 90021, 'loss/train': 1.5647639036178589} 11/07/2021 09:55:12 - INFO - __main__ - Step 90023: {'lr': 0.00017668268425425384, 'samples': 17284416, 'steps': 90022, 'loss/train': 1.334678053855896} 11/07/2021 09:55:13 - INFO - __main__ - Step 90024: {'lr': 0.0001766776108645885, 'samples': 17284608, 'steps': 90023, 'loss/train': 0.2979755699634552} 11/07/2021 09:55:14 - INFO - __main__ - Step 90025: {'lr': 0.00017667253750796108, 'samples': 17284800, 'steps': 90024, 'loss/train': 1.3239326477050781} 11/07/2021 09:55:14 - INFO - __main__ - Step 90026: {'lr': 0.00017666746418437392, 'samples': 17284992, 'steps': 90025, 'loss/train': 1.3274627923965454} 11/07/2021 09:55:14 - INFO - __main__ - Step 90027: {'lr': 0.00017666239089382925, 'samples': 17285184, 'steps': 90026, 'loss/train': 1.4180124998092651} 11/07/2021 09:55:15 - INFO - __main__ - Step 90028: {'lr': 0.00017665731763632933, 'samples': 17285376, 'steps': 90027, 'loss/train': 1.6094468832015991} 11/07/2021 09:55:15 - INFO - __main__ - Step 90029: {'lr': 0.00017665224441187655, 'samples': 17285568, 'steps': 90028, 'loss/train': 1.113959789276123} 11/07/2021 09:55:16 - INFO - __main__ - Step 90030: {'lr': 0.00017664717122047307, 'samples': 17285760, 'steps': 90029, 'loss/train': 1.2036980390548706} 11/07/2021 09:55:16 - INFO - __main__ - Step 90031: {'lr': 0.00017664209806212138, 'samples': 17285952, 'steps': 90030, 'loss/train': 0.8211650252342224} 11/07/2021 09:55:17 - INFO - __main__ - Step 90032: {'lr': 0.00017663702493682352, 'samples': 17286144, 'steps': 90031, 'loss/train': 1.4152696132659912} 11/07/2021 09:55:17 - INFO - __main__ - Step 90033: {'lr': 0.00017663195184458195, 'samples': 17286336, 'steps': 90032, 'loss/train': 0.931626558303833} 11/07/2021 09:55:17 - INFO - __main__ - Step 90034: {'lr': 0.00017662687878539885, 'samples': 17286528, 'steps': 90033, 'loss/train': 1.3146265745162964} 11/07/2021 09:55:18 - INFO - __main__ - Step 90035: {'lr': 0.0001766218057592765, 'samples': 17286720, 'steps': 90034, 'loss/train': 1.579572319984436} 11/07/2021 09:55:19 - INFO - __main__ - Step 90036: {'lr': 0.0001766167327662173, 'samples': 17286912, 'steps': 90035, 'loss/train': 1.6519219875335693} 11/07/2021 09:55:19 - INFO - __main__ - Step 90037: {'lr': 0.0001766116598062234, 'samples': 17287104, 'steps': 90036, 'loss/train': 1.3313623666763306} 11/07/2021 09:55:19 - INFO - __main__ - Step 90038: {'lr': 0.00017660658687929722, 'samples': 17287296, 'steps': 90037, 'loss/train': 1.7477999925613403} 11/07/2021 09:55:20 - INFO - __main__ - Step 90039: {'lr': 0.00017660151398544093, 'samples': 17287488, 'steps': 90038, 'loss/train': 1.643019676208496} 11/07/2021 09:55:21 - INFO - __main__ - Step 90040: {'lr': 0.0001765964411246569, 'samples': 17287680, 'steps': 90039, 'loss/train': 1.4119353294372559} 11/07/2021 09:55:21 - INFO - __main__ - Step 90041: {'lr': 0.00017659136829694736, 'samples': 17287872, 'steps': 90040, 'loss/train': 0.9624133110046387} 11/07/2021 09:55:22 - INFO - __main__ - Step 90042: {'lr': 0.00017658629550231463, 'samples': 17288064, 'steps': 90041, 'loss/train': 1.6811933517456055} 11/07/2021 09:55:22 - INFO - __main__ - Step 90043: {'lr': 0.00017658122274076093, 'samples': 17288256, 'steps': 90042, 'loss/train': 1.1279206275939941} 11/07/2021 09:55:22 - INFO - __main__ - Step 90044: {'lr': 0.00017657615001228865, 'samples': 17288448, 'steps': 90043, 'loss/train': 0.8587880730628967} 11/07/2021 09:55:23 - INFO - __main__ - Step 90045: {'lr': 0.00017657107731690013, 'samples': 17288640, 'steps': 90044, 'loss/train': 1.0703539848327637} 11/07/2021 09:55:24 - INFO - __main__ - Step 90046: {'lr': 0.00017656600465459744, 'samples': 17288832, 'steps': 90045, 'loss/train': 0.8904002904891968} 11/07/2021 09:55:24 - INFO - __main__ - Step 90047: {'lr': 0.00017656093202538298, 'samples': 17289024, 'steps': 90046, 'loss/train': 0.6289442181587219} 11/07/2021 09:55:24 - INFO - __main__ - Step 90048: {'lr': 0.000176555859429259, 'samples': 17289216, 'steps': 90047, 'loss/train': 1.2465354204177856} 11/07/2021 09:55:25 - INFO - __main__ - Step 90049: {'lr': 0.00017655078686622784, 'samples': 17289408, 'steps': 90048, 'loss/train': 1.2909373044967651} 11/07/2021 09:55:25 - INFO - __main__ - Step 90050: {'lr': 0.00017654571433629176, 'samples': 17289600, 'steps': 90049, 'loss/train': 1.5268011093139648} 11/07/2021 09:55:26 - INFO - __main__ - Step 90051: {'lr': 0.00017654064183945307, 'samples': 17289792, 'steps': 90050, 'loss/train': 1.281406044960022} 11/07/2021 09:55:26 - INFO - __main__ - Step 90052: {'lr': 0.000176535569375714, 'samples': 17289984, 'steps': 90051, 'loss/train': 1.5489498376846313} 11/07/2021 09:55:27 - INFO - __main__ - Step 90053: {'lr': 0.00017653049694507688, 'samples': 17290176, 'steps': 90052, 'loss/train': 0.7295044660568237} 11/07/2021 09:55:27 - INFO - __main__ - Step 90054: {'lr': 0.00017652542454754398, 'samples': 17290368, 'steps': 90053, 'loss/train': 1.493564486503601} 11/07/2021 09:55:27 - INFO - __main__ - Step 90055: {'lr': 0.00017652035218311757, 'samples': 17290560, 'steps': 90054, 'loss/train': 1.0592617988586426} 11/07/2021 09:55:29 - INFO - __main__ - Step 90056: {'lr': 0.0001765152798518, 'samples': 17290752, 'steps': 90055, 'loss/train': 1.351973295211792} 11/07/2021 09:55:30 - INFO - __main__ - Step 90057: {'lr': 0.00017651020755359348, 'samples': 17290944, 'steps': 90056, 'loss/train': 0.37558385729789734} 11/07/2021 09:55:30 - INFO - __main__ - Step 90058: {'lr': 0.00017650513528850043, 'samples': 17291136, 'steps': 90057, 'loss/train': 0.29469892382621765} 11/07/2021 09:55:30 - INFO - __main__ - Step 90059: {'lr': 0.00017650006305652293, 'samples': 17291328, 'steps': 90058, 'loss/train': 0.359389990568161} 11/07/2021 09:55:31 - INFO - __main__ - Step 90060: {'lr': 0.0001764949908576634, 'samples': 17291520, 'steps': 90059, 'loss/train': 1.3961833715438843} 11/07/2021 09:55:31 - INFO - __main__ - Step 90061: {'lr': 0.00017648991869192405, 'samples': 17291712, 'steps': 90060, 'loss/train': 1.3485932350158691} 11/07/2021 09:55:31 - INFO - __main__ - Step 90062: {'lr': 0.00017648484655930725, 'samples': 17291904, 'steps': 90061, 'loss/train': 1.1105648279190063} 11/07/2021 09:55:32 - INFO - __main__ - Step 90063: {'lr': 0.00017647977445981524, 'samples': 17292096, 'steps': 90062, 'loss/train': 1.2300174236297607} 11/07/2021 09:55:33 - INFO - __main__ - Step 90064: {'lr': 0.00017647470239345026, 'samples': 17292288, 'steps': 90063, 'loss/train': 1.517451524734497} 11/07/2021 09:55:33 - INFO - __main__ - Step 90065: {'lr': 0.0001764696303602147, 'samples': 17292480, 'steps': 90064, 'loss/train': 1.4328720569610596} 11/07/2021 09:55:33 - INFO - __main__ - Step 90066: {'lr': 0.0001764645583601107, 'samples': 17292672, 'steps': 90065, 'loss/train': 1.5910234451293945} 11/07/2021 09:55:34 - INFO - __main__ - Step 90067: {'lr': 0.00017645948639314076, 'samples': 17292864, 'steps': 90066, 'loss/train': 1.219312310218811} 11/07/2021 09:55:35 - INFO - __main__ - Step 90068: {'lr': 0.00017645441445930692, 'samples': 17293056, 'steps': 90067, 'loss/train': 1.1768684387207031} 11/07/2021 09:55:35 - INFO - __main__ - Step 90069: {'lr': 0.00017644934255861168, 'samples': 17293248, 'steps': 90068, 'loss/train': 1.3230226039886475} 11/07/2021 09:55:36 - INFO - __main__ - Step 90070: {'lr': 0.00017644427069105718, 'samples': 17293440, 'steps': 90069, 'loss/train': 1.0643234252929688} 11/07/2021 09:55:36 - INFO - __main__ - Step 90071: {'lr': 0.00017643919885664588, 'samples': 17293632, 'steps': 90070, 'loss/train': 0.8817723989486694} 11/07/2021 09:55:36 - INFO - __main__ - Step 90072: {'lr': 0.00017643412705537986, 'samples': 17293824, 'steps': 90071, 'loss/train': 1.6993697881698608} 11/07/2021 09:55:37 - INFO - __main__ - Step 90073: {'lr': 0.00017642905528726145, 'samples': 17294016, 'steps': 90072, 'loss/train': 1.4457513093948364} 11/07/2021 09:55:38 - INFO - __main__ - Step 90074: {'lr': 0.000176423983552293, 'samples': 17294208, 'steps': 90073, 'loss/train': 1.4808530807495117} 11/07/2021 09:55:38 - INFO - __main__ - Step 90075: {'lr': 0.00017641891185047674, 'samples': 17294400, 'steps': 90074, 'loss/train': 0.5237008333206177} 11/07/2021 09:55:38 - INFO - __main__ - Step 90076: {'lr': 0.000176413840181815, 'samples': 17294592, 'steps': 90075, 'loss/train': 1.6907234191894531} 11/07/2021 09:55:39 - INFO - __main__ - Step 90077: {'lr': 0.00017640876854631006, 'samples': 17294784, 'steps': 90076, 'loss/train': 1.2666915655136108} 11/07/2021 09:55:39 - INFO - __main__ - Step 90078: {'lr': 0.00017640369694396413, 'samples': 17294976, 'steps': 90077, 'loss/train': 1.3708593845367432} 11/07/2021 09:55:40 - INFO - __main__ - Step 90079: {'lr': 0.00017639862537477963, 'samples': 17295168, 'steps': 90078, 'loss/train': 1.2718443870544434} 11/07/2021 09:55:41 - INFO - __main__ - Step 90080: {'lr': 0.00017639355383875874, 'samples': 17295360, 'steps': 90079, 'loss/train': 1.817335844039917} 11/07/2021 09:55:41 - INFO - __main__ - Step 90081: {'lr': 0.00017638848233590378, 'samples': 17295552, 'steps': 90080, 'loss/train': 1.592172384262085} 11/07/2021 09:55:41 - INFO - __main__ - Step 90082: {'lr': 0.00017638341086621706, 'samples': 17295744, 'steps': 90081, 'loss/train': 0.4502171576023102} 11/07/2021 09:55:42 - INFO - __main__ - Step 90083: {'lr': 0.00017637833942970083, 'samples': 17295936, 'steps': 90082, 'loss/train': 1.2231065034866333} 11/07/2021 09:55:43 - INFO - __main__ - Step 90084: {'lr': 0.00017637326802635736, 'samples': 17296128, 'steps': 90083, 'loss/train': 0.9117428064346313} 11/07/2021 09:55:43 - INFO - __main__ - Step 90085: {'lr': 0.00017636819665618907, 'samples': 17296320, 'steps': 90084, 'loss/train': 1.0981277227401733} 11/07/2021 09:55:44 - INFO - __main__ - Step 90086: {'lr': 0.00017636312531919804, 'samples': 17296512, 'steps': 90085, 'loss/train': 2.072716474533081} 11/07/2021 09:55:44 - INFO - __main__ - Step 90087: {'lr': 0.00017635805401538667, 'samples': 17296704, 'steps': 90086, 'loss/train': 1.596861481666565} 11/07/2021 09:55:44 - INFO - __main__ - Step 90088: {'lr': 0.0001763529827447572, 'samples': 17296896, 'steps': 90087, 'loss/train': 1.6324477195739746} 11/07/2021 09:55:45 - INFO - __main__ - Step 90089: {'lr': 0.00017634791150731194, 'samples': 17297088, 'steps': 90088, 'loss/train': 0.3979319930076599} 11/07/2021 09:55:46 - INFO - __main__ - Step 90090: {'lr': 0.00017634284030305317, 'samples': 17297280, 'steps': 90089, 'loss/train': 1.8565726280212402} 11/07/2021 09:55:46 - INFO - __main__ - Step 90091: {'lr': 0.0001763377691319832, 'samples': 17297472, 'steps': 90090, 'loss/train': 1.636279582977295} 11/07/2021 09:55:46 - INFO - __main__ - Step 90092: {'lr': 0.00017633269799410427, 'samples': 17297664, 'steps': 90091, 'loss/train': 0.8317313194274902} 11/07/2021 09:55:47 - INFO - __main__ - Step 90093: {'lr': 0.0001763276268894187, 'samples': 17297856, 'steps': 90092, 'loss/train': 1.456439733505249} 11/07/2021 09:55:47 - INFO - __main__ - Step 90094: {'lr': 0.0001763225558179288, 'samples': 17298048, 'steps': 90093, 'loss/train': 1.4709982872009277} 11/07/2021 09:55:48 - INFO - __main__ - Step 90095: {'lr': 0.00017631748477963673, 'samples': 17298240, 'steps': 90094, 'loss/train': 1.1399166584014893} 11/07/2021 09:55:48 - INFO - __main__ - Step 90096: {'lr': 0.00017631241377454493, 'samples': 17298432, 'steps': 90095, 'loss/train': 1.4541106224060059} 11/07/2021 09:55:49 - INFO - __main__ - Step 90097: {'lr': 0.0001763073428026556, 'samples': 17298624, 'steps': 90096, 'loss/train': 1.1838147640228271} 11/07/2021 09:55:49 - INFO - __main__ - Step 90098: {'lr': 0.00017630227186397118, 'samples': 17298816, 'steps': 90097, 'loss/train': 1.3723665475845337} 11/07/2021 09:55:49 - INFO - __main__ - Step 90099: {'lr': 0.00017629720095849367, 'samples': 17299008, 'steps': 90098, 'loss/train': 1.1545395851135254} 11/07/2021 09:55:50 - INFO - __main__ - Step 90100: {'lr': 0.00017629213008622552, 'samples': 17299200, 'steps': 90099, 'loss/train': 1.037042260169983} 11/07/2021 09:55:51 - INFO - __main__ - Step 90101: {'lr': 0.00017628705924716903, 'samples': 17299392, 'steps': 90100, 'loss/train': 1.5633611679077148} 11/07/2021 09:55:51 - INFO - __main__ - Step 90102: {'lr': 0.00017628198844132643, 'samples': 17299584, 'steps': 90101, 'loss/train': 0.9917369484901428} 11/07/2021 09:55:52 - INFO - __main__ - Step 90103: {'lr': 0.0001762769176687, 'samples': 17299776, 'steps': 90102, 'loss/train': 0.9943256378173828} 11/07/2021 09:55:52 - INFO - __main__ - Step 90104: {'lr': 0.0001762718469292921, 'samples': 17299968, 'steps': 90103, 'loss/train': 1.002510666847229} 11/07/2021 09:55:53 - INFO - __main__ - Step 90105: {'lr': 0.00017626677622310495, 'samples': 17300160, 'steps': 90104, 'loss/train': 1.220017433166504} 11/07/2021 09:55:53 - INFO - __main__ - Step 90106: {'lr': 0.0001762617055501408, 'samples': 17300352, 'steps': 90105, 'loss/train': 1.1733169555664062} 11/07/2021 09:55:54 - INFO - __main__ - Step 90107: {'lr': 0.00017625663491040205, 'samples': 17300544, 'steps': 90106, 'loss/train': 1.5214450359344482} 11/07/2021 09:55:54 - INFO - __main__ - Step 90108: {'lr': 0.00017625156430389093, 'samples': 17300736, 'steps': 90107, 'loss/train': 1.339103102684021} 11/07/2021 09:55:54 - INFO - __main__ - Step 90109: {'lr': 0.0001762464937306097, 'samples': 17300928, 'steps': 90108, 'loss/train': 1.4517872333526611} 11/07/2021 09:55:55 - INFO - __main__ - Step 90110: {'lr': 0.00017624142319056066, 'samples': 17301120, 'steps': 90109, 'loss/train': 1.476278305053711} 11/07/2021 09:55:56 - INFO - __main__ - Step 90111: {'lr': 0.0001762363526837461, 'samples': 17301312, 'steps': 90110, 'loss/train': 1.3221408128738403} 11/07/2021 09:55:56 - INFO - __main__ - Step 90112: {'lr': 0.0001762312822101684, 'samples': 17301504, 'steps': 90111, 'loss/train': 1.6517564058303833} 11/07/2021 09:55:56 - INFO - __main__ - Step 90113: {'lr': 0.00017622621176982965, 'samples': 17301696, 'steps': 90112, 'loss/train': 1.529111385345459} 11/07/2021 09:55:57 - INFO - __main__ - Step 90114: {'lr': 0.0001762211413627322, 'samples': 17301888, 'steps': 90113, 'loss/train': 1.4315451383590698} 11/07/2021 09:55:57 - INFO - __main__ - Step 90115: {'lr': 0.0001762160709888784, 'samples': 17302080, 'steps': 90114, 'loss/train': 0.9452076554298401} 11/07/2021 09:55:58 - INFO - __main__ - Step 90116: {'lr': 0.0001762110006482705, 'samples': 17302272, 'steps': 90115, 'loss/train': 2.0243170261383057} 11/07/2021 09:55:58 - INFO - __main__ - Step 90117: {'lr': 0.00017620593034091075, 'samples': 17302464, 'steps': 90116, 'loss/train': 1.3771089315414429} 11/07/2021 09:55:59 - INFO - __main__ - Step 90118: {'lr': 0.0001762008600668015, 'samples': 17302656, 'steps': 90117, 'loss/train': 1.0181347131729126} 11/07/2021 09:55:59 - INFO - __main__ - Step 90119: {'lr': 0.000176195789825945, 'samples': 17302848, 'steps': 90118, 'loss/train': 1.3537747859954834} 11/07/2021 09:55:59 - INFO - __main__ - Step 90120: {'lr': 0.00017619071961834354, 'samples': 17303040, 'steps': 90119, 'loss/train': 0.9212965965270996} 11/07/2021 09:56:01 - INFO - __main__ - Step 90121: {'lr': 0.0001761856494439994, 'samples': 17303232, 'steps': 90120, 'loss/train': 0.7982496619224548} 11/07/2021 09:56:01 - INFO - __main__ - Step 90122: {'lr': 0.00017618057930291487, 'samples': 17303424, 'steps': 90121, 'loss/train': 1.29148268699646} 11/07/2021 09:56:01 - INFO - __main__ - Step 90123: {'lr': 0.00017617550919509227, 'samples': 17303616, 'steps': 90122, 'loss/train': 1.0870076417922974} 11/07/2021 09:56:02 - INFO - __main__ - Step 90124: {'lr': 0.0001761704391205338, 'samples': 17303808, 'steps': 90123, 'loss/train': 1.5535056591033936} 11/07/2021 09:56:02 - INFO - __main__ - Step 90125: {'lr': 0.00017616536907924185, 'samples': 17304000, 'steps': 90124, 'loss/train': 1.460890531539917} 11/07/2021 09:56:02 - INFO - __main__ - Step 90126: {'lr': 0.00017616029907121858, 'samples': 17304192, 'steps': 90125, 'loss/train': 1.5014535188674927} 11/07/2021 09:56:04 - INFO - __main__ - Step 90127: {'lr': 0.00017615522909646638, 'samples': 17304384, 'steps': 90126, 'loss/train': 1.63656747341156} 11/07/2021 09:56:04 - INFO - __main__ - Step 90128: {'lr': 0.00017615015915498745, 'samples': 17304576, 'steps': 90127, 'loss/train': 1.5392345190048218} 11/07/2021 09:56:04 - INFO - __main__ - Step 90129: {'lr': 0.00017614508924678412, 'samples': 17304768, 'steps': 90128, 'loss/train': 1.2930480241775513} 11/07/2021 09:56:05 - INFO - __main__ - Step 90130: {'lr': 0.0001761400193718587, 'samples': 17304960, 'steps': 90129, 'loss/train': 1.5198249816894531} 11/07/2021 09:56:05 - INFO - __main__ - Step 90131: {'lr': 0.00017613494953021343, 'samples': 17305152, 'steps': 90130, 'loss/train': 1.7896907329559326} 11/07/2021 09:56:05 - INFO - __main__ - Step 90132: {'lr': 0.00017612987972185056, 'samples': 17305344, 'steps': 90131, 'loss/train': 0.9457494020462036} 11/07/2021 09:56:06 - INFO - __main__ - Step 90133: {'lr': 0.00017612480994677252, 'samples': 17305536, 'steps': 90132, 'loss/train': 1.7266111373901367} 11/07/2021 09:56:07 - INFO - __main__ - Step 90134: {'lr': 0.0001761197402049815, 'samples': 17305728, 'steps': 90133, 'loss/train': 1.3889011144638062} 11/07/2021 09:56:07 - INFO - __main__ - Step 90135: {'lr': 0.00017611467049647976, 'samples': 17305920, 'steps': 90134, 'loss/train': 1.798237681388855} 11/07/2021 09:56:07 - INFO - __main__ - Step 90136: {'lr': 0.00017610960082126958, 'samples': 17306112, 'steps': 90135, 'loss/train': 1.7871013879776} 11/07/2021 09:56:08 - INFO - __main__ - Step 90137: {'lr': 0.0001761045311793533, 'samples': 17306304, 'steps': 90136, 'loss/train': 1.16188645362854} 11/07/2021 09:56:09 - INFO - __main__ - Step 90138: {'lr': 0.00017609946157073314, 'samples': 17306496, 'steps': 90137, 'loss/train': 1.1124366521835327} 11/07/2021 09:56:09 - INFO - __main__ - Step 90139: {'lr': 0.0001760943919954115, 'samples': 17306688, 'steps': 90138, 'loss/train': 1.5807417631149292} 11/07/2021 09:56:10 - INFO - __main__ - Step 90140: {'lr': 0.00017608932245339055, 'samples': 17306880, 'steps': 90139, 'loss/train': 1.5287903547286987} 11/07/2021 09:56:10 - INFO - __main__ - Step 90141: {'lr': 0.00017608425294467263, 'samples': 17307072, 'steps': 90140, 'loss/train': 0.6502556204795837} 11/07/2021 09:56:10 - INFO - __main__ - Step 90142: {'lr': 0.00017607918346925993, 'samples': 17307264, 'steps': 90141, 'loss/train': 1.3463724851608276} 11/07/2021 09:56:11 - INFO - __main__ - Step 90143: {'lr': 0.00017607411402715487, 'samples': 17307456, 'steps': 90142, 'loss/train': 1.473525047302246} 11/07/2021 09:56:12 - INFO - __main__ - Step 90144: {'lr': 0.00017606904461835965, 'samples': 17307648, 'steps': 90143, 'loss/train': 1.4070707559585571} 11/07/2021 09:56:12 - INFO - __main__ - Step 90145: {'lr': 0.00017606397524287665, 'samples': 17307840, 'steps': 90144, 'loss/train': 1.6768218278884888} 11/07/2021 09:56:12 - INFO - __main__ - Step 90146: {'lr': 0.000176058905900708, 'samples': 17308032, 'steps': 90145, 'loss/train': 1.3653502464294434} 11/07/2021 09:56:13 - INFO - __main__ - Step 90147: {'lr': 0.00017605383659185608, 'samples': 17308224, 'steps': 90146, 'loss/train': 1.696558952331543} 11/07/2021 09:56:13 - INFO - __main__ - Step 90148: {'lr': 0.00017604876731632316, 'samples': 17308416, 'steps': 90147, 'loss/train': 1.1041185855865479} 11/07/2021 09:56:14 - INFO - __main__ - Step 90149: {'lr': 0.00017604369807411153, 'samples': 17308608, 'steps': 90148, 'loss/train': 1.4957091808319092} 11/07/2021 09:56:14 - INFO - __main__ - Step 90150: {'lr': 0.00017603862886522346, 'samples': 17308800, 'steps': 90149, 'loss/train': 1.5078458786010742} 11/07/2021 09:56:15 - INFO - __main__ - Step 90151: {'lr': 0.00017603355968966123, 'samples': 17308992, 'steps': 90150, 'loss/train': 1.870834231376648} 11/07/2021 09:56:15 - INFO - __main__ - Step 90152: {'lr': 0.0001760284905474272, 'samples': 17309184, 'steps': 90151, 'loss/train': 1.411266565322876} 11/07/2021 09:56:15 - INFO - __main__ - Step 90153: {'lr': 0.00017602342143852357, 'samples': 17309376, 'steps': 90152, 'loss/train': 1.3566293716430664} 11/07/2021 09:56:17 - INFO - __main__ - Step 90154: {'lr': 0.0001760183523629526, 'samples': 17309568, 'steps': 90153, 'loss/train': 1.6325805187225342} 11/07/2021 09:56:17 - INFO - __main__ - Step 90155: {'lr': 0.00017601328332071664, 'samples': 17309760, 'steps': 90154, 'loss/train': 1.6293152570724487} 11/07/2021 09:56:17 - INFO - __main__ - Step 90156: {'lr': 0.000176008214311818, 'samples': 17309952, 'steps': 90155, 'loss/train': 1.3730268478393555} 11/07/2021 09:56:18 - INFO - __main__ - Step 90157: {'lr': 0.00017600314533625889, 'samples': 17310144, 'steps': 90156, 'loss/train': 1.1602576971054077} 11/07/2021 09:56:18 - INFO - __main__ - Step 90158: {'lr': 0.00017599807639404158, 'samples': 17310336, 'steps': 90157, 'loss/train': 1.6435303688049316} 11/07/2021 09:56:19 - INFO - __main__ - Step 90159: {'lr': 0.0001759930074851684, 'samples': 17310528, 'steps': 90158, 'loss/train': 0.8723829388618469} 11/07/2021 09:56:19 - INFO - __main__ - Step 90160: {'lr': 0.00017598793860964165, 'samples': 17310720, 'steps': 90159, 'loss/train': 1.446857213973999} 11/07/2021 09:56:20 - INFO - __main__ - Step 90161: {'lr': 0.00017598286976746357, 'samples': 17310912, 'steps': 90160, 'loss/train': 2.1841936111450195} 11/07/2021 09:56:20 - INFO - __main__ - Step 90162: {'lr': 0.0001759778009586365, 'samples': 17311104, 'steps': 90161, 'loss/train': 1.3883213996887207} 11/07/2021 09:56:20 - INFO - __main__ - Step 90163: {'lr': 0.00017597273218316267, 'samples': 17311296, 'steps': 90162, 'loss/train': 1.5840232372283936} 11/07/2021 09:56:22 - INFO - __main__ - Step 90164: {'lr': 0.00017596766344104436, 'samples': 17311488, 'steps': 90163, 'loss/train': 1.5206023454666138} 11/07/2021 09:56:22 - INFO - __main__ - Step 90165: {'lr': 0.00017596259473228392, 'samples': 17311680, 'steps': 90164, 'loss/train': 1.7000513076782227} 11/07/2021 09:56:22 - INFO - __main__ - Step 90166: {'lr': 0.00017595752605688365, 'samples': 17311872, 'steps': 90165, 'loss/train': 1.2203654050827026} 11/07/2021 09:56:23 - INFO - __main__ - Step 90167: {'lr': 0.00017595245741484572, 'samples': 17312064, 'steps': 90166, 'loss/train': 1.5678819417953491} 11/07/2021 09:56:23 - INFO - __main__ - Step 90168: {'lr': 0.00017594738880617245, 'samples': 17312256, 'steps': 90167, 'loss/train': 1.4717625379562378} 11/07/2021 09:56:24 - INFO - __main__ - Step 90169: {'lr': 0.00017594232023086616, 'samples': 17312448, 'steps': 90168, 'loss/train': 1.5267140865325928} 11/07/2021 09:56:24 - INFO - __main__ - Step 90170: {'lr': 0.0001759372516889291, 'samples': 17312640, 'steps': 90169, 'loss/train': 1.1081174612045288} 11/07/2021 09:56:25 - INFO - __main__ - Step 90171: {'lr': 0.00017593218318036357, 'samples': 17312832, 'steps': 90170, 'loss/train': 0.7851414680480957} 11/07/2021 09:56:25 - INFO - __main__ - Step 90172: {'lr': 0.00017592711470517186, 'samples': 17313024, 'steps': 90171, 'loss/train': 1.550161361694336} 11/07/2021 09:56:25 - INFO - __main__ - Step 90173: {'lr': 0.00017592204626335628, 'samples': 17313216, 'steps': 90172, 'loss/train': 0.6878520846366882} 11/07/2021 09:56:26 - INFO - __main__ - Step 90174: {'lr': 0.00017591697785491905, 'samples': 17313408, 'steps': 90173, 'loss/train': 1.3696954250335693} 11/07/2021 09:56:27 - INFO - __main__ - Step 90175: {'lr': 0.00017591190947986246, 'samples': 17313600, 'steps': 90174, 'loss/train': 1.4490636587142944} 11/07/2021 09:56:27 - INFO - __main__ - Step 90176: {'lr': 0.00017590684113818886, 'samples': 17313792, 'steps': 90175, 'loss/train': 1.3729628324508667} 11/07/2021 09:56:27 - INFO - __main__ - Step 90177: {'lr': 0.0001759017728299005, 'samples': 17313984, 'steps': 90176, 'loss/train': 1.5498930215835571} 11/07/2021 09:56:28 - INFO - __main__ - Step 90178: {'lr': 0.0001758967045549996, 'samples': 17314176, 'steps': 90177, 'loss/train': 1.203079104423523} 11/07/2021 09:56:28 - INFO - __main__ - Step 90179: {'lr': 0.0001758916363134887, 'samples': 17314368, 'steps': 90178, 'loss/train': 1.338674545288086} 11/07/2021 09:56:29 - INFO - __main__ - Step 90180: {'lr': 0.0001758865681053697, 'samples': 17314560, 'steps': 90179, 'loss/train': 1.8970378637313843} 11/07/2021 09:56:30 - INFO - __main__ - Step 90181: {'lr': 0.0001758814999306451, 'samples': 17314752, 'steps': 90180, 'loss/train': 1.502379298210144} 11/07/2021 09:56:30 - INFO - __main__ - Step 90182: {'lr': 0.00017587643178931716, 'samples': 17314944, 'steps': 90181, 'loss/train': 1.2068265676498413} 11/07/2021 09:56:30 - INFO - __main__ - Step 90183: {'lr': 0.00017587136368138812, 'samples': 17315136, 'steps': 90182, 'loss/train': 1.7945243120193481} 11/07/2021 09:56:31 - INFO - __main__ - Step 90184: {'lr': 0.00017586629560686036, 'samples': 17315328, 'steps': 90183, 'loss/train': 1.3957535028457642} 11/07/2021 09:56:32 - INFO - __main__ - Step 90185: {'lr': 0.00017586122756573606, 'samples': 17315520, 'steps': 90184, 'loss/train': 1.5707578659057617} 11/07/2021 09:56:32 - INFO - __main__ - Step 90186: {'lr': 0.00017585615955801755, 'samples': 17315712, 'steps': 90185, 'loss/train': 1.6561596393585205} 11/07/2021 09:56:32 - INFO - __main__ - Step 90187: {'lr': 0.0001758510915837071, 'samples': 17315904, 'steps': 90186, 'loss/train': 1.5436608791351318} 11/07/2021 09:56:33 - INFO - __main__ - Step 90188: {'lr': 0.00017584602364280704, 'samples': 17316096, 'steps': 90187, 'loss/train': 1.166961431503296} 11/07/2021 09:56:33 - INFO - __main__ - Step 90189: {'lr': 0.0001758409557353196, 'samples': 17316288, 'steps': 90188, 'loss/train': 1.5042508840560913} 11/07/2021 09:56:34 - INFO - __main__ - Step 90190: {'lr': 0.00017583588786124703, 'samples': 17316480, 'steps': 90189, 'loss/train': 1.1794757843017578} 11/07/2021 09:56:35 - INFO - __main__ - Step 90191: {'lr': 0.00017583082002059174, 'samples': 17316672, 'steps': 90190, 'loss/train': 1.7184993028640747} 11/07/2021 09:56:35 - INFO - __main__ - Step 90192: {'lr': 0.000175825752213356, 'samples': 17316864, 'steps': 90191, 'loss/train': 1.9656081199645996} 11/07/2021 09:56:35 - INFO - __main__ - Step 90193: {'lr': 0.00017582068443954197, 'samples': 17317056, 'steps': 90192, 'loss/train': 1.4800680875778198} 11/07/2021 09:56:36 - INFO - __main__ - Step 90194: {'lr': 0.00017581561669915196, 'samples': 17317248, 'steps': 90193, 'loss/train': 1.5845050811767578} 11/07/2021 09:56:37 - INFO - __main__ - Step 90195: {'lr': 0.00017581054899218828, 'samples': 17317440, 'steps': 90194, 'loss/train': 1.3630166053771973} 11/07/2021 09:56:37 - INFO - __main__ - Step 90196: {'lr': 0.00017580548131865327, 'samples': 17317632, 'steps': 90195, 'loss/train': 1.584800124168396} 11/07/2021 09:56:37 - INFO - __main__ - Step 90197: {'lr': 0.0001758004136785491, 'samples': 17317824, 'steps': 90196, 'loss/train': 1.0707660913467407} 11/07/2021 09:56:38 - INFO - __main__ - Step 90198: {'lr': 0.00017579534607187815, 'samples': 17318016, 'steps': 90197, 'loss/train': 1.3110543489456177} 11/07/2021 09:56:38 - INFO - __main__ - Step 90199: {'lr': 0.0001757902784986427, 'samples': 17318208, 'steps': 90198, 'loss/train': 1.2762537002563477} 11/07/2021 09:56:38 - INFO - __main__ - Step 90200: {'lr': 0.00017578521095884498, 'samples': 17318400, 'steps': 90199, 'loss/train': 1.245099663734436} 11/07/2021 09:56:39 - INFO - __main__ - Step 90201: {'lr': 0.00017578014345248728, 'samples': 17318592, 'steps': 90200, 'loss/train': 1.0265424251556396} 11/07/2021 09:56:40 - INFO - __main__ - Step 90202: {'lr': 0.00017577507597957192, 'samples': 17318784, 'steps': 90201, 'loss/train': 1.6832267045974731} 11/07/2021 09:56:40 - INFO - __main__ - Step 90203: {'lr': 0.00017577000854010117, 'samples': 17318976, 'steps': 90202, 'loss/train': 1.3514479398727417} 11/07/2021 09:56:40 - INFO - __main__ - Step 90204: {'lr': 0.00017576494113407732, 'samples': 17319168, 'steps': 90203, 'loss/train': 1.6025738716125488} 11/07/2021 09:56:41 - INFO - __main__ - Step 90205: {'lr': 0.0001757598737615026, 'samples': 17319360, 'steps': 90204, 'loss/train': 1.414191722869873} 11/07/2021 09:56:42 - INFO - __main__ - Step 90206: {'lr': 0.00017575480642237945, 'samples': 17319552, 'steps': 90205, 'loss/train': 1.2230381965637207} 11/07/2021 09:56:42 - INFO - __main__ - Step 90207: {'lr': 0.00017574973911670998, 'samples': 17319744, 'steps': 90206, 'loss/train': 1.090510606765747} 11/07/2021 09:56:43 - INFO - __main__ - Step 90208: {'lr': 0.0001757446718444965, 'samples': 17319936, 'steps': 90207, 'loss/train': 1.323973298072815} 11/07/2021 09:56:43 - INFO - __main__ - Step 90209: {'lr': 0.00017573960460574132, 'samples': 17320128, 'steps': 90208, 'loss/train': 1.5413892269134521} 11/07/2021 09:56:43 - INFO - __main__ - Step 90210: {'lr': 0.00017573453740044674, 'samples': 17320320, 'steps': 90209, 'loss/train': 1.5981525182724} 11/07/2021 09:56:44 - INFO - __main__ - Step 90211: {'lr': 0.000175729470228615, 'samples': 17320512, 'steps': 90210, 'loss/train': 1.4129854440689087} 11/07/2021 09:56:45 - INFO - __main__ - Step 90212: {'lr': 0.00017572440309024845, 'samples': 17320704, 'steps': 90211, 'loss/train': 1.507667899131775} 11/07/2021 09:56:45 - INFO - __main__ - Step 90213: {'lr': 0.00017571933598534934, 'samples': 17320896, 'steps': 90212, 'loss/train': 1.431936264038086} 11/07/2021 09:56:45 - INFO - __main__ - Step 90214: {'lr': 0.00017571426891391996, 'samples': 17321088, 'steps': 90213, 'loss/train': 1.0812321901321411} 11/07/2021 09:56:46 - INFO - __main__ - Step 90215: {'lr': 0.00017570920187596253, 'samples': 17321280, 'steps': 90214, 'loss/train': 1.4059900045394897} 11/07/2021 09:56:47 - INFO - __main__ - Step 90216: {'lr': 0.00017570413487147943, 'samples': 17321472, 'steps': 90215, 'loss/train': 1.3335273265838623} 11/07/2021 09:56:47 - INFO - __main__ - Step 90217: {'lr': 0.0001756990679004729, 'samples': 17321664, 'steps': 90216, 'loss/train': 0.2710247039794922} 11/07/2021 09:56:48 - INFO - __main__ - Step 90218: {'lr': 0.0001756940009629452, 'samples': 17321856, 'steps': 90217, 'loss/train': 0.3968145549297333} 11/07/2021 09:56:48 - INFO - __main__ - Step 90219: {'lr': 0.00017568893405889874, 'samples': 17322048, 'steps': 90218, 'loss/train': 1.306175947189331} 11/07/2021 09:56:48 - INFO - __main__ - Step 90220: {'lr': 0.00017568386718833562, 'samples': 17322240, 'steps': 90219, 'loss/train': 1.2865263223648071} 11/07/2021 09:56:49 - INFO - __main__ - Step 90221: {'lr': 0.00017567880035125822, 'samples': 17322432, 'steps': 90220, 'loss/train': 1.4018197059631348} 11/07/2021 09:56:50 - INFO - __main__ - Step 90222: {'lr': 0.00017567373354766876, 'samples': 17322624, 'steps': 90221, 'loss/train': 1.6858972311019897} 11/07/2021 09:56:50 - INFO - __main__ - Step 90223: {'lr': 0.0001756686667775696, 'samples': 17322816, 'steps': 90222, 'loss/train': 1.8826936483383179} 11/07/2021 09:56:50 - INFO - __main__ - Step 90224: {'lr': 0.00017566360004096296, 'samples': 17323008, 'steps': 90223, 'loss/train': 1.695499300956726} 11/07/2021 09:56:51 - INFO - __main__ - Step 90225: {'lr': 0.0001756585333378512, 'samples': 17323200, 'steps': 90224, 'loss/train': 1.175251841545105} 11/07/2021 09:56:52 - INFO - __main__ - Step 90226: {'lr': 0.0001756534666682365, 'samples': 17323392, 'steps': 90225, 'loss/train': 1.5601798295974731} 11/07/2021 09:56:52 - INFO - __main__ - Step 90227: {'lr': 0.00017564840003212123, 'samples': 17323584, 'steps': 90226, 'loss/train': 1.3913753032684326} 11/07/2021 09:56:52 - INFO - __main__ - Step 90228: {'lr': 0.00017564333342950768, 'samples': 17323776, 'steps': 90227, 'loss/train': 0.5665398836135864} 11/07/2021 09:56:53 - INFO - __main__ - Step 90229: {'lr': 0.00017563826686039805, 'samples': 17323968, 'steps': 90228, 'loss/train': 3.7488865852355957} 11/07/2021 09:56:53 - INFO - __main__ - Step 90230: {'lr': 0.0001756332003247947, 'samples': 17324160, 'steps': 90229, 'loss/train': 1.679335594177246} 11/07/2021 09:56:53 - INFO - __main__ - Step 90231: {'lr': 0.00017562813382269985, 'samples': 17324352, 'steps': 90230, 'loss/train': 0.8448482751846313} 11/07/2021 09:56:54 - INFO - __main__ - Step 90232: {'lr': 0.00017562306735411582, 'samples': 17324544, 'steps': 90231, 'loss/train': 1.4014198780059814} 11/07/2021 09:56:55 - INFO - __main__ - Step 90233: {'lr': 0.000175618000919045, 'samples': 17324736, 'steps': 90232, 'loss/train': 1.20517897605896} 11/07/2021 09:56:55 - INFO - __main__ - Step 90234: {'lr': 0.00017561293451748947, 'samples': 17324928, 'steps': 90233, 'loss/train': 0.8100371956825256} 11/07/2021 09:56:56 - INFO - __main__ - Step 90235: {'lr': 0.00017560786814945157, 'samples': 17325120, 'steps': 90234, 'loss/train': 0.9594841003417969} 11/07/2021 09:56:56 - INFO - __main__ - Step 90236: {'lr': 0.00017560280181493367, 'samples': 17325312, 'steps': 90235, 'loss/train': 1.6157974004745483} 11/07/2021 09:56:57 - INFO - __main__ - Step 90237: {'lr': 0.00017559773551393797, 'samples': 17325504, 'steps': 90236, 'loss/train': 1.560044288635254} 11/07/2021 09:56:57 - INFO - __main__ - Step 90238: {'lr': 0.00017559266924646678, 'samples': 17325696, 'steps': 90237, 'loss/train': 1.2662698030471802} 11/07/2021 09:56:58 - INFO - __main__ - Step 90239: {'lr': 0.00017558760301252235, 'samples': 17325888, 'steps': 90238, 'loss/train': 1.0528126955032349} 11/07/2021 09:56:58 - INFO - __main__ - Step 90240: {'lr': 0.00017558253681210705, 'samples': 17326080, 'steps': 90239, 'loss/train': 1.9297430515289307} 11/07/2021 09:56:58 - INFO - __main__ - Step 90241: {'lr': 0.0001755774706452231, 'samples': 17326272, 'steps': 90240, 'loss/train': 1.816274881362915} 11/07/2021 09:56:59 - INFO - __main__ - Step 90242: {'lr': 0.0001755724045118728, 'samples': 17326464, 'steps': 90241, 'loss/train': 1.5018489360809326} 11/07/2021 09:57:00 - INFO - __main__ - Step 90243: {'lr': 0.00017556733841205842, 'samples': 17326656, 'steps': 90242, 'loss/train': 0.16398733854293823} 11/07/2021 09:57:00 - INFO - __main__ - Step 90244: {'lr': 0.00017556227234578222, 'samples': 17326848, 'steps': 90243, 'loss/train': 1.7023297548294067} 11/07/2021 09:57:00 - INFO - __main__ - Step 90245: {'lr': 0.00017555720631304655, 'samples': 17327040, 'steps': 90244, 'loss/train': 1.3047287464141846} 11/07/2021 09:57:01 - INFO - __main__ - Step 90246: {'lr': 0.00017555214031385376, 'samples': 17327232, 'steps': 90245, 'loss/train': 1.3354196548461914} 11/07/2021 09:57:02 - INFO - __main__ - Step 90247: {'lr': 0.0001755470743482059, 'samples': 17327424, 'steps': 90246, 'loss/train': 1.4706718921661377} 11/07/2021 09:57:02 - INFO - __main__ - Step 90248: {'lr': 0.00017554200841610534, 'samples': 17327616, 'steps': 90247, 'loss/train': 1.1781448125839233} 11/07/2021 09:57:02 - INFO - __main__ - Step 90249: {'lr': 0.0001755369425175545, 'samples': 17327808, 'steps': 90248, 'loss/train': 1.4721684455871582} 11/07/2021 09:57:03 - INFO - __main__ - Step 90250: {'lr': 0.0001755318766525555, 'samples': 17328000, 'steps': 90249, 'loss/train': 1.2105389833450317} 11/07/2021 09:57:03 - INFO - __main__ - Step 90251: {'lr': 0.00017552681082111065, 'samples': 17328192, 'steps': 90250, 'loss/train': 1.5762008428573608} 11/07/2021 09:57:04 - INFO - __main__ - Step 90252: {'lr': 0.00017552174502322236, 'samples': 17328384, 'steps': 90251, 'loss/train': 1.0720388889312744} 11/07/2021 09:57:04 - INFO - __main__ - Step 90253: {'lr': 0.00017551667925889275, 'samples': 17328576, 'steps': 90252, 'loss/train': 1.2871826887130737} 11/07/2021 09:57:05 - INFO - __main__ - Step 90254: {'lr': 0.0001755116135281242, 'samples': 17328768, 'steps': 90253, 'loss/train': 1.3800259828567505} 11/07/2021 09:57:05 - INFO - __main__ - Step 90255: {'lr': 0.00017550654783091903, 'samples': 17328960, 'steps': 90254, 'loss/train': 0.3608902394771576} 11/07/2021 09:57:05 - INFO - __main__ - Step 90256: {'lr': 0.00017550148216727938, 'samples': 17329152, 'steps': 90255, 'loss/train': 1.2850432395935059} 11/07/2021 09:57:06 - INFO - __main__ - Step 90257: {'lr': 0.00017549641653720764, 'samples': 17329344, 'steps': 90256, 'loss/train': 1.3070130348205566} 11/07/2021 09:57:07 - INFO - __main__ - Step 90258: {'lr': 0.0001754913509407061, 'samples': 17329536, 'steps': 90257, 'loss/train': 1.4695466756820679} 11/07/2021 09:57:07 - INFO - __main__ - Step 90259: {'lr': 0.00017548628537777697, 'samples': 17329728, 'steps': 90258, 'loss/train': 1.4683438539505005} 11/07/2021 09:57:08 - INFO - __main__ - Step 90260: {'lr': 0.00017548121984842263, 'samples': 17329920, 'steps': 90259, 'loss/train': 1.5313488245010376} 11/07/2021 09:57:08 - INFO - __main__ - Step 90261: {'lr': 0.00017547615435264523, 'samples': 17330112, 'steps': 90260, 'loss/train': 1.5161832571029663} 11/07/2021 09:57:08 - INFO - __main__ - Step 90262: {'lr': 0.00017547108889044713, 'samples': 17330304, 'steps': 90261, 'loss/train': 1.3095216751098633} 11/07/2021 09:57:09 - INFO - __main__ - Step 90263: {'lr': 0.0001754660234618306, 'samples': 17330496, 'steps': 90262, 'loss/train': 1.9951725006103516} 11/07/2021 09:57:10 - INFO - __main__ - Step 90264: {'lr': 0.00017546095806679796, 'samples': 17330688, 'steps': 90263, 'loss/train': 0.9446544051170349} 11/07/2021 09:57:10 - INFO - __main__ - Step 90265: {'lr': 0.00017545589270535146, 'samples': 17330880, 'steps': 90264, 'loss/train': 1.4420415163040161} 11/07/2021 09:57:10 - INFO - __main__ - Step 90266: {'lr': 0.00017545082737749335, 'samples': 17331072, 'steps': 90265, 'loss/train': 1.41855788230896} 11/07/2021 09:57:11 - INFO - __main__ - Step 90267: {'lr': 0.000175445762083226, 'samples': 17331264, 'steps': 90266, 'loss/train': 1.1736962795257568} 11/07/2021 09:57:12 - INFO - __main__ - Step 90268: {'lr': 0.0001754406968225516, 'samples': 17331456, 'steps': 90267, 'loss/train': 1.2536239624023438} 11/07/2021 09:57:12 - INFO - __main__ - Step 90269: {'lr': 0.0001754356315954725, 'samples': 17331648, 'steps': 90268, 'loss/train': 1.3983874320983887} 11/07/2021 09:57:12 - INFO - __main__ - Step 90270: {'lr': 0.00017543056640199095, 'samples': 17331840, 'steps': 90269, 'loss/train': 0.9538955092430115} 11/07/2021 09:57:13 - INFO - __main__ - Step 90271: {'lr': 0.0001754255012421092, 'samples': 17332032, 'steps': 90270, 'loss/train': 1.433227300643921} 11/07/2021 09:57:13 - INFO - __main__ - Step 90272: {'lr': 0.0001754204361158296, 'samples': 17332224, 'steps': 90271, 'loss/train': 1.2064796686172485} 11/07/2021 09:57:14 - INFO - __main__ - Step 90273: {'lr': 0.00017541537102315442, 'samples': 17332416, 'steps': 90272, 'loss/train': 1.4958869218826294} 11/07/2021 09:57:15 - INFO - __main__ - Step 90274: {'lr': 0.0001754103059640859, 'samples': 17332608, 'steps': 90273, 'loss/train': 1.2761540412902832} 11/07/2021 09:57:15 - INFO - __main__ - Step 90275: {'lr': 0.00017540524093862631, 'samples': 17332800, 'steps': 90274, 'loss/train': 1.534940242767334} 11/07/2021 09:57:15 - INFO - __main__ - Step 90276: {'lr': 0.00017540017594677802, 'samples': 17332992, 'steps': 90275, 'loss/train': 1.3328523635864258} 11/07/2021 09:57:16 - INFO - __main__ - Step 90277: {'lr': 0.0001753951109885432, 'samples': 17333184, 'steps': 90276, 'loss/train': 1.4577233791351318} 11/07/2021 09:57:16 - INFO - __main__ - Step 90278: {'lr': 0.00017539004606392423, 'samples': 17333376, 'steps': 90277, 'loss/train': 1.3675587177276611} 11/07/2021 09:57:17 - INFO - __main__ - Step 90279: {'lr': 0.00017538498117292335, 'samples': 17333568, 'steps': 90278, 'loss/train': 0.8947303891181946} 11/07/2021 09:57:17 - INFO - __main__ - Step 90280: {'lr': 0.0001753799163155429, 'samples': 17333760, 'steps': 90279, 'loss/train': 1.7897931337356567} 11/07/2021 09:57:18 - INFO - __main__ - Step 90281: {'lr': 0.00017537485149178507, 'samples': 17333952, 'steps': 90280, 'loss/train': 0.6552925705909729} 11/07/2021 09:57:18 - INFO - __main__ - Step 90282: {'lr': 0.00017536978670165215, 'samples': 17334144, 'steps': 90281, 'loss/train': 0.9068219661712646} 11/07/2021 09:57:18 - INFO - __main__ - Step 90283: {'lr': 0.00017536472194514647, 'samples': 17334336, 'steps': 90282, 'loss/train': 0.9734132289886475} 11/07/2021 09:57:19 - INFO - __main__ - Step 90284: {'lr': 0.00017535965722227027, 'samples': 17334528, 'steps': 90283, 'loss/train': 1.337359070777893} 11/07/2021 09:57:20 - INFO - __main__ - Step 90285: {'lr': 0.0001753545925330259, 'samples': 17334720, 'steps': 90284, 'loss/train': 1.5465953350067139} 11/07/2021 09:57:20 - INFO - __main__ - Step 90286: {'lr': 0.00017534952787741554, 'samples': 17334912, 'steps': 90285, 'loss/train': 1.4914331436157227} 11/07/2021 09:57:21 - INFO - __main__ - Step 90287: {'lr': 0.00017534446325544162, 'samples': 17335104, 'steps': 90286, 'loss/train': 1.1116780042648315} 11/07/2021 09:57:21 - INFO - __main__ - Step 90288: {'lr': 0.0001753393986671063, 'samples': 17335296, 'steps': 90287, 'loss/train': 1.4891822338104248} 11/07/2021 09:57:22 - INFO - __main__ - Step 90289: {'lr': 0.00017533433411241186, 'samples': 17335488, 'steps': 90288, 'loss/train': 1.5317249298095703} 11/07/2021 09:57:22 - INFO - __main__ - Step 90290: {'lr': 0.00017532926959136063, 'samples': 17335680, 'steps': 90289, 'loss/train': 1.5819047689437866} 11/07/2021 09:57:23 - INFO - __main__ - Step 90291: {'lr': 0.0001753242051039549, 'samples': 17335872, 'steps': 90290, 'loss/train': 1.3597497940063477} 11/07/2021 09:57:23 - INFO - __main__ - Step 90292: {'lr': 0.00017531914065019693, 'samples': 17336064, 'steps': 90291, 'loss/train': 1.607634425163269} 11/07/2021 09:57:23 - INFO - __main__ - Step 90293: {'lr': 0.00017531407623008898, 'samples': 17336256, 'steps': 90292, 'loss/train': 1.352907419204712} 11/07/2021 09:57:24 - INFO - __main__ - Step 90294: {'lr': 0.00017530901184363337, 'samples': 17336448, 'steps': 90293, 'loss/train': 1.131343126296997} 11/07/2021 09:57:25 - INFO - __main__ - Step 90295: {'lr': 0.00017530394749083235, 'samples': 17336640, 'steps': 90294, 'loss/train': 1.3256367444992065} 11/07/2021 09:57:25 - INFO - __main__ - Step 90296: {'lr': 0.00017529888317168824, 'samples': 17336832, 'steps': 90295, 'loss/train': 1.5085687637329102} 11/07/2021 09:57:25 - INFO - __main__ - Step 90297: {'lr': 0.00017529381888620326, 'samples': 17337024, 'steps': 90296, 'loss/train': 0.12790869176387787} 11/07/2021 09:57:26 - INFO - __main__ - Step 90298: {'lr': 0.00017528875463437976, 'samples': 17337216, 'steps': 90297, 'loss/train': 1.394155740737915} 11/07/2021 09:57:26 - INFO - __main__ - Step 90299: {'lr': 0.00017528369041622, 'samples': 17337408, 'steps': 90298, 'loss/train': 0.9884231090545654} 11/07/2021 09:57:27 - INFO - __main__ - Step 90300: {'lr': 0.0001752786262317263, 'samples': 17337600, 'steps': 90299, 'loss/train': 1.4233869314193726} 11/07/2021 09:57:28 - INFO - __main__ - Step 90301: {'lr': 0.0001752735620809009, 'samples': 17337792, 'steps': 90300, 'loss/train': 1.3328205347061157} 11/07/2021 09:57:28 - INFO - __main__ - Step 90302: {'lr': 0.000175268497963746, 'samples': 17337984, 'steps': 90301, 'loss/train': 1.1419306993484497} 11/07/2021 09:57:28 - INFO - __main__ - Step 90303: {'lr': 0.000175263433880264, 'samples': 17338176, 'steps': 90302, 'loss/train': 1.1883231401443481} 11/07/2021 09:57:29 - INFO - __main__ - Step 90304: {'lr': 0.00017525836983045713, 'samples': 17338368, 'steps': 90303, 'loss/train': 2.3809871673583984} 11/07/2021 09:57:30 - INFO - __main__ - Step 90305: {'lr': 0.0001752533058143277, 'samples': 17338560, 'steps': 90304, 'loss/train': 1.2994886636734009} 11/07/2021 09:57:30 - INFO - __main__ - Step 90306: {'lr': 0.00017524824183187793, 'samples': 17338752, 'steps': 90305, 'loss/train': 1.3093584775924683} 11/07/2021 09:57:30 - INFO - __main__ - Step 90307: {'lr': 0.00017524317788311018, 'samples': 17338944, 'steps': 90306, 'loss/train': 1.7303836345672607} 11/07/2021 09:57:31 - INFO - __main__ - Step 90308: {'lr': 0.0001752381139680267, 'samples': 17339136, 'steps': 90307, 'loss/train': 1.6136924028396606} 11/07/2021 09:57:31 - INFO - __main__ - Step 90309: {'lr': 0.00017523305008662976, 'samples': 17339328, 'steps': 90308, 'loss/train': 2.4904026985168457} 11/07/2021 09:57:32 - INFO - __main__ - Step 90310: {'lr': 0.00017522798623892166, 'samples': 17339520, 'steps': 90309, 'loss/train': 1.1366477012634277} 11/07/2021 09:57:33 - INFO - __main__ - Step 90311: {'lr': 0.0001752229224249047, 'samples': 17339712, 'steps': 90310, 'loss/train': 1.3354766368865967} 11/07/2021 09:57:33 - INFO - __main__ - Step 90312: {'lr': 0.0001752178586445811, 'samples': 17339904, 'steps': 90311, 'loss/train': 0.09487418085336685} 11/07/2021 09:57:33 - INFO - __main__ - Step 90313: {'lr': 0.00017521279489795334, 'samples': 17340096, 'steps': 90312, 'loss/train': 0.4007195234298706} 11/07/2021 09:57:34 - INFO - __main__ - Step 90314: {'lr': 0.00017520773118502337, 'samples': 17340288, 'steps': 90313, 'loss/train': 1.3072060346603394} 11/07/2021 09:57:35 - INFO - __main__ - Step 90315: {'lr': 0.00017520266750579367, 'samples': 17340480, 'steps': 90314, 'loss/train': 1.138965129852295} 11/07/2021 09:57:35 - INFO - __main__ - Step 90316: {'lr': 0.00017519760386026652, 'samples': 17340672, 'steps': 90315, 'loss/train': 1.2315078973770142} 11/07/2021 09:57:35 - INFO - __main__ - Step 90317: {'lr': 0.00017519254024844414, 'samples': 17340864, 'steps': 90316, 'loss/train': 1.6672109365463257} 11/07/2021 09:57:36 - INFO - __main__ - Step 90318: {'lr': 0.00017518747667032885, 'samples': 17341056, 'steps': 90317, 'loss/train': 1.3627568483352661} 11/07/2021 09:57:36 - INFO - __main__ - Step 90319: {'lr': 0.00017518241312592292, 'samples': 17341248, 'steps': 90318, 'loss/train': 1.6622800827026367} 11/07/2021 09:57:37 - INFO - __main__ - Step 90320: {'lr': 0.0001751773496152287, 'samples': 17341440, 'steps': 90319, 'loss/train': 1.5898222923278809} 11/07/2021 09:57:38 - INFO - __main__ - Step 90321: {'lr': 0.00017517228613824835, 'samples': 17341632, 'steps': 90320, 'loss/train': 1.7620724439620972} 11/07/2021 09:57:38 - INFO - __main__ - Step 90322: {'lr': 0.00017516722269498422, 'samples': 17341824, 'steps': 90321, 'loss/train': 1.3889474868774414} 11/07/2021 09:57:38 - INFO - __main__ - Step 90323: {'lr': 0.0001751621592854386, 'samples': 17342016, 'steps': 90322, 'loss/train': 0.3841775059700012} 11/07/2021 09:57:39 - INFO - __main__ - Step 90324: {'lr': 0.00017515709590961375, 'samples': 17342208, 'steps': 90323, 'loss/train': 0.9258747100830078} 11/07/2021 09:57:39 - INFO - __main__ - Step 90325: {'lr': 0.00017515203256751195, 'samples': 17342400, 'steps': 90324, 'loss/train': 1.5080746412277222} 11/07/2021 09:57:40 - INFO - __main__ - Step 90326: {'lr': 0.00017514696925913548, 'samples': 17342592, 'steps': 90325, 'loss/train': 1.609296202659607} 11/07/2021 09:57:41 - INFO - __main__ - Step 90327: {'lr': 0.00017514190598448675, 'samples': 17342784, 'steps': 90326, 'loss/train': 1.164007306098938} 11/07/2021 09:57:41 - INFO - __main__ - Step 90328: {'lr': 0.00017513684274356783, 'samples': 17342976, 'steps': 90327, 'loss/train': 1.603095293045044} 11/07/2021 09:57:41 - INFO - __main__ - Step 90329: {'lr': 0.00017513177953638108, 'samples': 17343168, 'steps': 90328, 'loss/train': 1.3548065423965454} 11/07/2021 09:57:42 - INFO - __main__ - Step 90330: {'lr': 0.0001751267163629288, 'samples': 17343360, 'steps': 90329, 'loss/train': 1.254543662071228} 11/07/2021 09:57:43 - INFO - __main__ - Step 90331: {'lr': 0.00017512165322321327, 'samples': 17343552, 'steps': 90330, 'loss/train': 1.9194461107254028} 11/07/2021 09:57:43 - INFO - __main__ - Step 90332: {'lr': 0.0001751165901172368, 'samples': 17343744, 'steps': 90331, 'loss/train': 1.6930421590805054} 11/07/2021 09:57:43 - INFO - __main__ - Step 90333: {'lr': 0.00017511152704500157, 'samples': 17343936, 'steps': 90332, 'loss/train': 1.2768864631652832} 11/07/2021 09:57:44 - INFO - __main__ - Step 90334: {'lr': 0.00017510646400650999, 'samples': 17344128, 'steps': 90333, 'loss/train': 1.4619125127792358} 11/07/2021 09:57:44 - INFO - __main__ - Step 90335: {'lr': 0.00017510140100176425, 'samples': 17344320, 'steps': 90334, 'loss/train': 1.5904561281204224} 11/07/2021 09:57:45 - INFO - __main__ - Step 90336: {'lr': 0.00017509633803076665, 'samples': 17344512, 'steps': 90335, 'loss/train': 1.0093735456466675} 11/07/2021 09:57:45 - INFO - __main__ - Step 90337: {'lr': 0.00017509127509351952, 'samples': 17344704, 'steps': 90336, 'loss/train': 1.4144419431686401} 11/07/2021 09:57:46 - INFO - __main__ - Step 90338: {'lr': 0.00017508621219002507, 'samples': 17344896, 'steps': 90337, 'loss/train': 1.5921543836593628} 11/07/2021 09:57:46 - INFO - __main__ - Step 90339: {'lr': 0.00017508114932028563, 'samples': 17345088, 'steps': 90338, 'loss/train': 1.3632270097732544} 11/07/2021 09:57:46 - INFO - __main__ - Step 90340: {'lr': 0.00017507608648430355, 'samples': 17345280, 'steps': 90339, 'loss/train': 1.2394676208496094} 11/07/2021 09:57:48 - INFO - __main__ - Step 90341: {'lr': 0.00017507102368208096, 'samples': 17345472, 'steps': 90340, 'loss/train': 1.4439173936843872} 11/07/2021 09:57:48 - INFO - __main__ - Step 90342: {'lr': 0.0001750659609136202, 'samples': 17345664, 'steps': 90341, 'loss/train': 1.3403633832931519} 11/07/2021 09:57:48 - INFO - __main__ - Step 90343: {'lr': 0.00017506089817892356, 'samples': 17345856, 'steps': 90342, 'loss/train': 0.7647069096565247} 11/07/2021 09:57:49 - INFO - __main__ - Step 90344: {'lr': 0.00017505583547799337, 'samples': 17346048, 'steps': 90343, 'loss/train': 1.233899474143982} 11/07/2021 09:57:49 - INFO - __main__ - Step 90345: {'lr': 0.00017505077281083182, 'samples': 17346240, 'steps': 90344, 'loss/train': 1.2805103063583374} 11/07/2021 09:57:49 - INFO - __main__ - Step 90346: {'lr': 0.0001750457101774412, 'samples': 17346432, 'steps': 90345, 'loss/train': 1.3591572046279907} 11/07/2021 09:57:50 - INFO - __main__ - Step 90347: {'lr': 0.00017504064757782386, 'samples': 17346624, 'steps': 90346, 'loss/train': 0.9867153167724609} 11/07/2021 09:57:51 - INFO - __main__ - Step 90348: {'lr': 0.0001750355850119821, 'samples': 17346816, 'steps': 90347, 'loss/train': 1.2156656980514526} 11/07/2021 09:57:51 - INFO - __main__ - Step 90349: {'lr': 0.00017503052247991806, 'samples': 17347008, 'steps': 90348, 'loss/train': 1.885066032409668} 11/07/2021 09:57:51 - INFO - __main__ - Step 90350: {'lr': 0.00017502545998163415, 'samples': 17347200, 'steps': 90349, 'loss/train': 0.9309923648834229} 11/07/2021 09:57:52 - INFO - __main__ - Step 90351: {'lr': 0.00017502039751713262, 'samples': 17347392, 'steps': 90350, 'loss/train': 1.204504370689392} 11/07/2021 09:57:53 - INFO - __main__ - Step 90352: {'lr': 0.00017501533508641572, 'samples': 17347584, 'steps': 90351, 'loss/train': 1.2658473253250122} 11/07/2021 09:57:53 - INFO - __main__ - Step 90353: {'lr': 0.00017501027268948579, 'samples': 17347776, 'steps': 90352, 'loss/train': 1.0725288391113281} 11/07/2021 09:57:54 - INFO - __main__ - Step 90354: {'lr': 0.00017500521032634512, 'samples': 17347968, 'steps': 90353, 'loss/train': 1.7098368406295776} 11/07/2021 09:57:54 - INFO - __main__ - Step 90355: {'lr': 0.00017500014799699587, 'samples': 17348160, 'steps': 90354, 'loss/train': 1.267942190170288} 11/07/2021 09:57:54 - INFO - __main__ - Step 90356: {'lr': 0.0001749950857014404, 'samples': 17348352, 'steps': 90355, 'loss/train': 1.3850407600402832} 11/07/2021 09:57:55 - INFO - __main__ - Step 90357: {'lr': 0.00017499002343968097, 'samples': 17348544, 'steps': 90356, 'loss/train': 1.4373669624328613} 11/07/2021 09:57:56 - INFO - __main__ - Step 90358: {'lr': 0.0001749849612117199, 'samples': 17348736, 'steps': 90357, 'loss/train': 1.4728909730911255} 11/07/2021 09:57:56 - INFO - __main__ - Step 90359: {'lr': 0.00017497989901755945, 'samples': 17348928, 'steps': 90358, 'loss/train': 1.7557432651519775} 11/07/2021 09:57:56 - INFO - __main__ - Step 90360: {'lr': 0.00017497483685720189, 'samples': 17349120, 'steps': 90359, 'loss/train': 5.716197490692139} 11/07/2021 09:57:57 - INFO - __main__ - Step 90361: {'lr': 0.0001749697747306495, 'samples': 17349312, 'steps': 90360, 'loss/train': 1.6410030126571655} 11/07/2021 09:57:57 - INFO - __main__ - Step 90362: {'lr': 0.00017496471263790458, 'samples': 17349504, 'steps': 90361, 'loss/train': 1.2912393808364868} 11/07/2021 09:57:58 - INFO - __main__ - Step 90363: {'lr': 0.0001749596505789694, 'samples': 17349696, 'steps': 90362, 'loss/train': 0.8581980466842651} 11/07/2021 09:57:59 - INFO - __main__ - Step 90364: {'lr': 0.00017495458855384626, 'samples': 17349888, 'steps': 90363, 'loss/train': 1.8648418188095093} 11/07/2021 09:57:59 - INFO - __main__ - Step 90365: {'lr': 0.00017494952656253742, 'samples': 17350080, 'steps': 90364, 'loss/train': 1.4824353456497192} 11/07/2021 09:57:59 - INFO - __main__ - Step 90366: {'lr': 0.00017494446460504515, 'samples': 17350272, 'steps': 90365, 'loss/train': 1.697394847869873} 11/07/2021 09:58:00 - INFO - __main__ - Step 90367: {'lr': 0.00017493940268137188, 'samples': 17350464, 'steps': 90366, 'loss/train': 1.5029209852218628} 11/07/2021 09:58:01 - INFO - __main__ - Step 90368: {'lr': 0.0001749343407915196, 'samples': 17350656, 'steps': 90367, 'loss/train': 1.3147600889205933} 11/07/2021 09:58:01 - INFO - __main__ - Step 90369: {'lr': 0.00017492927893549083, 'samples': 17350848, 'steps': 90368, 'loss/train': 1.3955961465835571} 11/07/2021 09:58:02 - INFO - __main__ - Step 90370: {'lr': 0.0001749242171132877, 'samples': 17351040, 'steps': 90369, 'loss/train': 1.2588704824447632} 11/07/2021 09:58:02 - INFO - __main__ - Step 90371: {'lr': 0.0001749191553249126, 'samples': 17351232, 'steps': 90370, 'loss/train': 0.12995073199272156} 11/07/2021 09:58:02 - INFO - __main__ - Step 90372: {'lr': 0.00017491409357036773, 'samples': 17351424, 'steps': 90371, 'loss/train': 1.4969981908798218} 11/07/2021 09:58:03 - INFO - __main__ - Step 90373: {'lr': 0.00017490903184965543, 'samples': 17351616, 'steps': 90372, 'loss/train': 1.1215680837631226} 11/07/2021 09:58:04 - INFO - __main__ - Step 90374: {'lr': 0.00017490397016277796, 'samples': 17351808, 'steps': 90373, 'loss/train': 1.089290738105774} 11/07/2021 09:58:04 - INFO - __main__ - Step 90375: {'lr': 0.00017489890850973762, 'samples': 17352000, 'steps': 90374, 'loss/train': 1.0847814083099365} 11/07/2021 09:58:04 - INFO - __main__ - Step 90376: {'lr': 0.00017489384689053662, 'samples': 17352192, 'steps': 90375, 'loss/train': 1.3855992555618286} 11/07/2021 09:58:05 - INFO - __main__ - Step 90377: {'lr': 0.00017488878530517733, 'samples': 17352384, 'steps': 90376, 'loss/train': 1.6425338983535767} 11/07/2021 09:58:06 - INFO - __main__ - Step 90378: {'lr': 0.000174883723753662, 'samples': 17352576, 'steps': 90377, 'loss/train': 1.6241899728775024} 11/07/2021 09:58:06 - INFO - __main__ - Step 90379: {'lr': 0.0001748786622359929, 'samples': 17352768, 'steps': 90378, 'loss/train': 1.5918934345245361} 11/07/2021 09:58:07 - INFO - __main__ - Step 90380: {'lr': 0.00017487360075217232, 'samples': 17352960, 'steps': 90379, 'loss/train': 1.154196858406067} 11/07/2021 09:58:07 - INFO - __main__ - Step 90381: {'lr': 0.00017486853930220265, 'samples': 17353152, 'steps': 90380, 'loss/train': 1.699405550956726} 11/07/2021 09:58:07 - INFO - __main__ - Step 90382: {'lr': 0.0001748634778860859, 'samples': 17353344, 'steps': 90381, 'loss/train': 1.5411916971206665} 11/07/2021 09:58:08 - INFO - __main__ - Step 90383: {'lr': 0.00017485841650382455, 'samples': 17353536, 'steps': 90382, 'loss/train': 1.9710230827331543} 11/07/2021 09:58:09 - INFO - __main__ - Step 90384: {'lr': 0.00017485335515542085, 'samples': 17353728, 'steps': 90383, 'loss/train': 1.1394675970077515} 11/07/2021 09:58:09 - INFO - __main__ - Step 90385: {'lr': 0.00017484829384087702, 'samples': 17353920, 'steps': 90384, 'loss/train': 1.4225386381149292} 11/07/2021 09:58:09 - INFO - __main__ - Step 90386: {'lr': 0.00017484323256019546, 'samples': 17354112, 'steps': 90385, 'loss/train': 1.6420471668243408} 11/07/2021 09:58:10 - INFO - __main__ - Step 90387: {'lr': 0.0001748381713133783, 'samples': 17354304, 'steps': 90386, 'loss/train': 1.4765557050704956} 11/07/2021 09:58:10 - INFO - __main__ - Step 90388: {'lr': 0.00017483311010042796, 'samples': 17354496, 'steps': 90387, 'loss/train': 1.335822343826294} 11/07/2021 09:58:11 - INFO - __main__ - Step 90389: {'lr': 0.00017482804892134666, 'samples': 17354688, 'steps': 90388, 'loss/train': 1.3502312898635864} 11/07/2021 09:58:12 - INFO - __main__ - Step 90390: {'lr': 0.00017482298777613664, 'samples': 17354880, 'steps': 90389, 'loss/train': 1.3127480745315552} 11/07/2021 09:58:12 - INFO - __main__ - Step 90391: {'lr': 0.00017481792666480025, 'samples': 17355072, 'steps': 90390, 'loss/train': 1.117229700088501} 11/07/2021 09:58:13 - INFO - __main__ - Step 90392: {'lr': 0.00017481286558733978, 'samples': 17355264, 'steps': 90391, 'loss/train': 0.6012700200080872} 11/07/2021 09:58:13 - INFO - __main__ - Step 90393: {'lr': 0.00017480780454375743, 'samples': 17355456, 'steps': 90392, 'loss/train': 1.2212613821029663} 11/07/2021 09:58:13 - INFO - __main__ - Step 90394: {'lr': 0.00017480274353405558, 'samples': 17355648, 'steps': 90393, 'loss/train': 1.5296986103057861} 11/07/2021 09:58:14 - INFO - __main__ - Step 90395: {'lr': 0.0001747976825582364, 'samples': 17355840, 'steps': 90394, 'loss/train': 0.7600122094154358} 11/07/2021 09:58:15 - INFO - __main__ - Step 90396: {'lr': 0.00017479262161630222, 'samples': 17356032, 'steps': 90395, 'loss/train': 1.137945294380188} 11/07/2021 09:58:15 - INFO - __main__ - Step 90397: {'lr': 0.00017478756070825533, 'samples': 17356224, 'steps': 90396, 'loss/train': 1.6748604774475098} 11/07/2021 09:58:15 - INFO - __main__ - Step 90398: {'lr': 0.000174782499834098, 'samples': 17356416, 'steps': 90397, 'loss/train': 2.4969146251678467} 11/07/2021 09:58:16 - INFO - __main__ - Step 90399: {'lr': 0.0001747774389938325, 'samples': 17356608, 'steps': 90398, 'loss/train': 1.1403170824050903} 11/07/2021 09:58:17 - INFO - __main__ - Step 90400: {'lr': 0.00017477237818746115, 'samples': 17356800, 'steps': 90399, 'loss/train': 1.6689339876174927} 11/07/2021 09:58:17 - INFO - __main__ - Step 90401: {'lr': 0.00017476731741498618, 'samples': 17356992, 'steps': 90400, 'loss/train': 1.4772794246673584} 11/07/2021 09:58:17 - INFO - __main__ - Step 90402: {'lr': 0.0001747622566764099, 'samples': 17357184, 'steps': 90401, 'loss/train': 1.059770107269287} 11/07/2021 09:58:18 - INFO - __main__ - Step 90403: {'lr': 0.00017475719597173468, 'samples': 17357376, 'steps': 90402, 'loss/train': 1.68361234664917} 11/07/2021 09:58:18 - INFO - __main__ - Step 90404: {'lr': 0.0001747521353009626, 'samples': 17357568, 'steps': 90403, 'loss/train': 1.4574368000030518} 11/07/2021 09:58:19 - INFO - __main__ - Step 90405: {'lr': 0.00017474707466409606, 'samples': 17357760, 'steps': 90404, 'loss/train': 1.6586159467697144} 11/07/2021 09:58:20 - INFO - __main__ - Step 90406: {'lr': 0.00017474201406113735, 'samples': 17357952, 'steps': 90405, 'loss/train': 1.6235533952713013} 11/07/2021 09:58:20 - INFO - __main__ - Step 90407: {'lr': 0.0001747369534920887, 'samples': 17358144, 'steps': 90406, 'loss/train': 1.3899846076965332} 11/07/2021 09:58:20 - INFO - __main__ - Step 90408: {'lr': 0.00017473189295695249, 'samples': 17358336, 'steps': 90407, 'loss/train': 0.9031802415847778} 11/07/2021 09:58:21 - INFO - __main__ - Step 90409: {'lr': 0.00017472683245573086, 'samples': 17358528, 'steps': 90408, 'loss/train': 1.3341659307479858} 11/07/2021 09:58:22 - INFO - __main__ - Step 90410: {'lr': 0.00017472177198842617, 'samples': 17358720, 'steps': 90409, 'loss/train': 1.3816670179367065} 11/07/2021 09:58:22 - INFO - __main__ - Step 90411: {'lr': 0.0001747167115550407, 'samples': 17358912, 'steps': 90410, 'loss/train': 1.4989882707595825} 11/07/2021 09:58:23 - INFO - __main__ - Step 90412: {'lr': 0.0001747116511555767, 'samples': 17359104, 'steps': 90411, 'loss/train': 1.8583157062530518} 11/07/2021 09:58:23 - INFO - __main__ - Step 90413: {'lr': 0.00017470659079003644, 'samples': 17359296, 'steps': 90412, 'loss/train': 0.9651975035667419} 11/07/2021 09:58:23 - INFO - __main__ - Step 90414: {'lr': 0.00017470153045842234, 'samples': 17359488, 'steps': 90413, 'loss/train': 1.2636151313781738} 11/07/2021 09:58:24 - INFO - __main__ - Step 90415: {'lr': 0.00017469647016073647, 'samples': 17359680, 'steps': 90414, 'loss/train': 1.5292078256607056} 11/07/2021 09:58:24 - INFO - __main__ - Step 90416: {'lr': 0.00017469140989698122, 'samples': 17359872, 'steps': 90415, 'loss/train': 0.980841338634491} 11/07/2021 09:58:25 - INFO - __main__ - Step 90417: {'lr': 0.00017468634966715885, 'samples': 17360064, 'steps': 90416, 'loss/train': 0.9144805669784546} 11/07/2021 09:58:25 - INFO - __main__ - Step 90418: {'lr': 0.00017468128947127168, 'samples': 17360256, 'steps': 90417, 'loss/train': 1.804276704788208} 11/07/2021 09:58:26 - INFO - __main__ - Step 90419: {'lr': 0.00017467622930932193, 'samples': 17360448, 'steps': 90418, 'loss/train': 1.2248549461364746} 11/07/2021 09:58:26 - INFO - __main__ - Step 90420: {'lr': 0.00017467116918131194, 'samples': 17360640, 'steps': 90419, 'loss/train': 2.155827283859253} 11/07/2021 09:58:26 - INFO - __main__ - Step 90421: {'lr': 0.00017466610908724398, 'samples': 17360832, 'steps': 90420, 'loss/train': 1.9042161703109741} 11/07/2021 09:58:28 - INFO - __main__ - Step 90422: {'lr': 0.00017466104902712025, 'samples': 17361024, 'steps': 90421, 'loss/train': 1.2567983865737915} 11/07/2021 09:58:28 - INFO - __main__ - Step 90423: {'lr': 0.0001746559890009431, 'samples': 17361216, 'steps': 90422, 'loss/train': 2.017662286758423} 11/07/2021 09:58:28 - INFO - __main__ - Step 90424: {'lr': 0.0001746509290087148, 'samples': 17361408, 'steps': 90423, 'loss/train': 1.6897786855697632} 11/07/2021 09:58:29 - INFO - __main__ - Step 90425: {'lr': 0.00017464586905043772, 'samples': 17361600, 'steps': 90424, 'loss/train': 1.046202301979065} 11/07/2021 09:58:29 - INFO - __main__ - Step 90426: {'lr': 0.00017464080912611395, 'samples': 17361792, 'steps': 90425, 'loss/train': 1.588213562965393} 11/07/2021 09:58:30 - INFO - __main__ - Step 90427: {'lr': 0.00017463574923574587, 'samples': 17361984, 'steps': 90426, 'loss/train': 1.180737853050232} 11/07/2021 09:58:31 - INFO - __main__ - Step 90428: {'lr': 0.0001746306893793358, 'samples': 17362176, 'steps': 90427, 'loss/train': 1.7135510444641113} 11/07/2021 09:58:31 - INFO - __main__ - Step 90429: {'lr': 0.00017462562955688593, 'samples': 17362368, 'steps': 90428, 'loss/train': 1.184545874595642} 11/07/2021 09:58:31 - INFO - __main__ - Step 90430: {'lr': 0.0001746205697683986, 'samples': 17362560, 'steps': 90429, 'loss/train': 1.3076786994934082} 11/07/2021 09:58:32 - INFO - __main__ - Step 90431: {'lr': 0.0001746155100138761, 'samples': 17362752, 'steps': 90430, 'loss/train': 0.536493718624115} 11/07/2021 09:58:33 - INFO - __main__ - Step 90432: {'lr': 0.00017461045029332068, 'samples': 17362944, 'steps': 90431, 'loss/train': 0.9406847953796387} 11/07/2021 09:58:33 - INFO - __main__ - Step 90433: {'lr': 0.00017460539060673458, 'samples': 17363136, 'steps': 90432, 'loss/train': 1.1620136499404907} 11/07/2021 09:58:33 - INFO - __main__ - Step 90434: {'lr': 0.00017460033095412024, 'samples': 17363328, 'steps': 90433, 'loss/train': 1.0073696374893188} 11/07/2021 09:58:34 - INFO - __main__ - Step 90435: {'lr': 0.00017459527133547976, 'samples': 17363520, 'steps': 90434, 'loss/train': 1.4395161867141724} 11/07/2021 09:58:34 - INFO - __main__ - Step 90436: {'lr': 0.00017459021175081552, 'samples': 17363712, 'steps': 90435, 'loss/train': 1.4361495971679688} 11/07/2021 09:58:35 - INFO - __main__ - Step 90437: {'lr': 0.00017458515220012972, 'samples': 17363904, 'steps': 90436, 'loss/train': 1.6329013109207153} 11/07/2021 09:58:35 - INFO - __main__ - Step 90438: {'lr': 0.00017458009268342474, 'samples': 17364096, 'steps': 90437, 'loss/train': 1.3052775859832764} 11/07/2021 09:58:36 - INFO - __main__ - Step 90439: {'lr': 0.00017457503320070271, 'samples': 17364288, 'steps': 90438, 'loss/train': 1.3530558347702026} 11/07/2021 09:58:36 - INFO - __main__ - Step 90440: {'lr': 0.0001745699737519661, 'samples': 17364480, 'steps': 90439, 'loss/train': 1.4896670579910278} 11/07/2021 09:58:37 - INFO - __main__ - Step 90441: {'lr': 0.00017456491433721704, 'samples': 17364672, 'steps': 90440, 'loss/train': 1.5952234268188477} 11/07/2021 09:58:37 - INFO - __main__ - Step 90442: {'lr': 0.00017455985495645786, 'samples': 17364864, 'steps': 90441, 'loss/train': 1.3213797807693481} 11/07/2021 09:58:38 - INFO - __main__ - Step 90443: {'lr': 0.00017455479560969086, 'samples': 17365056, 'steps': 90442, 'loss/train': 1.3909329175949097} 11/07/2021 09:58:38 - INFO - __main__ - Step 90444: {'lr': 0.00017454973629691835, 'samples': 17365248, 'steps': 90443, 'loss/train': 1.803442358970642} 11/07/2021 09:58:39 - INFO - __main__ - Step 90445: {'lr': 0.0001745446770181425, 'samples': 17365440, 'steps': 90444, 'loss/train': 1.461923599243164} 11/07/2021 09:58:39 - INFO - __main__ - Step 90446: {'lr': 0.0001745396177733657, 'samples': 17365632, 'steps': 90445, 'loss/train': 1.8005863428115845} 11/07/2021 09:58:39 - INFO - __main__ - Step 90447: {'lr': 0.00017453455856259015, 'samples': 17365824, 'steps': 90446, 'loss/train': 1.463132619857788} 11/07/2021 09:58:40 - INFO - __main__ - Step 90448: {'lr': 0.00017452949938581824, 'samples': 17366016, 'steps': 90447, 'loss/train': 1.292733073234558} 11/07/2021 09:58:41 - INFO - __main__ - Step 90449: {'lr': 0.00017452444024305215, 'samples': 17366208, 'steps': 90448, 'loss/train': 1.4341251850128174} 11/07/2021 09:58:41 - INFO - __main__ - Step 90450: {'lr': 0.00017451938113429412, 'samples': 17366400, 'steps': 90449, 'loss/train': 1.702822208404541} 11/07/2021 09:58:41 - INFO - __main__ - Step 90451: {'lr': 0.00017451432205954653, 'samples': 17366592, 'steps': 90450, 'loss/train': 1.55104660987854} 11/07/2021 09:58:42 - INFO - __main__ - Step 90452: {'lr': 0.00017450926301881158, 'samples': 17366784, 'steps': 90451, 'loss/train': 1.790251612663269} 11/07/2021 09:58:43 - INFO - __main__ - Step 90453: {'lr': 0.00017450420401209164, 'samples': 17366976, 'steps': 90452, 'loss/train': 1.221847653388977} 11/07/2021 09:58:44 - INFO - __main__ - Step 90454: {'lr': 0.00017449914503938892, 'samples': 17367168, 'steps': 90453, 'loss/train': 1.426224946975708} 11/07/2021 09:58:44 - INFO - __main__ - Step 90455: {'lr': 0.00017449408610070572, 'samples': 17367360, 'steps': 90454, 'loss/train': 0.2057521790266037} 11/07/2021 09:58:44 - INFO - __main__ - Step 90456: {'lr': 0.0001744890271960443, 'samples': 17367552, 'steps': 90455, 'loss/train': 1.2950642108917236} 11/07/2021 09:58:45 - INFO - __main__ - Step 90457: {'lr': 0.00017448396832540696, 'samples': 17367744, 'steps': 90456, 'loss/train': 1.45834219455719} 11/07/2021 09:58:46 - INFO - __main__ - Step 90458: {'lr': 0.00017447890948879603, 'samples': 17367936, 'steps': 90457, 'loss/train': 0.8329326510429382} 11/07/2021 09:58:47 - INFO - __main__ - Step 90459: {'lr': 0.00017447385068621369, 'samples': 17368128, 'steps': 90458, 'loss/train': 1.325355887413025} 11/07/2021 09:58:47 - INFO - __main__ - Step 90460: {'lr': 0.00017446879191766228, 'samples': 17368320, 'steps': 90459, 'loss/train': 1.7465966939926147} 11/07/2021 09:58:47 - INFO - __main__ - Step 90461: {'lr': 0.00017446373318314416, 'samples': 17368512, 'steps': 90460, 'loss/train': 0.6628066301345825} 11/07/2021 09:58:48 - INFO - __main__ - Step 90462: {'lr': 0.00017445867448266143, 'samples': 17368704, 'steps': 90461, 'loss/train': 0.1110304445028305} 11/07/2021 09:58:48 - INFO - __main__ - Step 90463: {'lr': 0.00017445361581621644, 'samples': 17368896, 'steps': 90462, 'loss/train': 1.7347183227539062} 11/07/2021 09:58:48 - INFO - __main__ - Step 90464: {'lr': 0.00017444855718381147, 'samples': 17369088, 'steps': 90463, 'loss/train': 1.7557628154754639} 11/07/2021 09:58:49 - INFO - __main__ - Step 90465: {'lr': 0.00017444349858544887, 'samples': 17369280, 'steps': 90464, 'loss/train': 1.7090104818344116} 11/07/2021 09:58:50 - INFO - __main__ - Step 90466: {'lr': 0.00017443844002113082, 'samples': 17369472, 'steps': 90465, 'loss/train': 1.279994010925293} 11/07/2021 09:58:50 - INFO - __main__ - Step 90467: {'lr': 0.00017443338149085964, 'samples': 17369664, 'steps': 90466, 'loss/train': 1.4372366666793823} 11/07/2021 09:58:51 - INFO - __main__ - Step 90468: {'lr': 0.00017442832299463762, 'samples': 17369856, 'steps': 90467, 'loss/train': 1.6411794424057007} 11/07/2021 09:58:51 - INFO - __main__ - Step 90469: {'lr': 0.00017442326453246705, 'samples': 17370048, 'steps': 90468, 'loss/train': 1.4379169940948486} 11/07/2021 09:58:51 - INFO - __main__ - Step 90470: {'lr': 0.0001744182061043502, 'samples': 17370240, 'steps': 90469, 'loss/train': 1.406606674194336} 11/07/2021 09:58:52 - INFO - __main__ - Step 90471: {'lr': 0.0001744131477102893, 'samples': 17370432, 'steps': 90470, 'loss/train': 1.5134761333465576} 11/07/2021 09:58:53 - INFO - __main__ - Step 90472: {'lr': 0.0001744080893502867, 'samples': 17370624, 'steps': 90471, 'loss/train': 1.4653606414794922} 11/07/2021 09:58:53 - INFO - __main__ - Step 90473: {'lr': 0.00017440303102434464, 'samples': 17370816, 'steps': 90472, 'loss/train': 0.28431951999664307} 11/07/2021 09:58:53 - INFO - __main__ - Step 90474: {'lr': 0.0001743979727324654, 'samples': 17371008, 'steps': 90473, 'loss/train': 1.68635892868042} 11/07/2021 09:58:54 - INFO - __main__ - Step 90475: {'lr': 0.00017439291447465138, 'samples': 17371200, 'steps': 90474, 'loss/train': 1.199599266052246} 11/07/2021 09:58:55 - INFO - __main__ - Step 90476: {'lr': 0.00017438785625090465, 'samples': 17371392, 'steps': 90475, 'loss/train': 1.5231451988220215} 11/07/2021 09:58:55 - INFO - __main__ - Step 90477: {'lr': 0.00017438279806122753, 'samples': 17371584, 'steps': 90476, 'loss/train': 1.3734465837478638} 11/07/2021 09:58:56 - INFO - __main__ - Step 90478: {'lr': 0.00017437773990562242, 'samples': 17371776, 'steps': 90477, 'loss/train': 1.8017407655715942} 11/07/2021 09:58:56 - INFO - __main__ - Step 90479: {'lr': 0.00017437268178409148, 'samples': 17371968, 'steps': 90478, 'loss/train': 1.5253348350524902} 11/07/2021 09:58:57 - INFO - __main__ - Step 90480: {'lr': 0.00017436762369663712, 'samples': 17372160, 'steps': 90479, 'loss/train': 1.5548806190490723} 11/07/2021 09:58:57 - INFO - __main__ - Step 90481: {'lr': 0.00017436256564326146, 'samples': 17372352, 'steps': 90480, 'loss/train': 1.1195993423461914} 11/07/2021 09:58:58 - INFO - __main__ - Step 90482: {'lr': 0.0001743575076239669, 'samples': 17372544, 'steps': 90481, 'loss/train': 0.13470323383808136} 11/07/2021 09:58:58 - INFO - __main__ - Step 90483: {'lr': 0.00017435244963875569, 'samples': 17372736, 'steps': 90482, 'loss/train': 1.4900726079940796} 11/07/2021 09:58:59 - INFO - __main__ - Step 90484: {'lr': 0.00017434739168763007, 'samples': 17372928, 'steps': 90483, 'loss/train': 1.6561311483383179} 11/07/2021 09:58:59 - INFO - __main__ - Step 90485: {'lr': 0.00017434233377059235, 'samples': 17373120, 'steps': 90484, 'loss/train': 1.3432739973068237} 11/07/2021 09:58:59 - INFO - __main__ - Step 90486: {'lr': 0.00017433727588764484, 'samples': 17373312, 'steps': 90485, 'loss/train': 1.2521172761917114} 11/07/2021 09:59:00 - INFO - __main__ - Step 90487: {'lr': 0.00017433221803878974, 'samples': 17373504, 'steps': 90486, 'loss/train': 0.7237115502357483} 11/07/2021 09:59:01 - INFO - __main__ - Step 90488: {'lr': 0.0001743271602240295, 'samples': 17373696, 'steps': 90487, 'loss/train': 1.3352100849151611} 11/07/2021 09:59:01 - INFO - __main__ - Step 90489: {'lr': 0.00017432210244336618, 'samples': 17373888, 'steps': 90488, 'loss/train': 1.5006314516067505} 11/07/2021 09:59:01 - INFO - __main__ - Step 90490: {'lr': 0.00017431704469680215, 'samples': 17374080, 'steps': 90489, 'loss/train': 1.454590916633606} 11/07/2021 09:59:02 - INFO - __main__ - Step 90491: {'lr': 0.0001743119869843397, 'samples': 17374272, 'steps': 90490, 'loss/train': 1.5553345680236816} 11/07/2021 09:59:03 - INFO - __main__ - Step 90492: {'lr': 0.00017430692930598107, 'samples': 17374464, 'steps': 90491, 'loss/train': 0.6350515484809875} 11/07/2021 09:59:03 - INFO - __main__ - Step 90493: {'lr': 0.0001743018716617286, 'samples': 17374656, 'steps': 90492, 'loss/train': 1.6485358476638794} 11/07/2021 09:59:03 - INFO - __main__ - Step 90494: {'lr': 0.00017429681405158455, 'samples': 17374848, 'steps': 90493, 'loss/train': 0.3855491876602173} 11/07/2021 09:59:04 - INFO - __main__ - Step 90495: {'lr': 0.00017429175647555115, 'samples': 17375040, 'steps': 90494, 'loss/train': 2.74178147315979} 11/07/2021 09:59:04 - INFO - __main__ - Step 90496: {'lr': 0.00017428669893363073, 'samples': 17375232, 'steps': 90495, 'loss/train': 1.3499659299850464} 11/07/2021 09:59:05 - INFO - __main__ - Step 90497: {'lr': 0.00017428164142582552, 'samples': 17375424, 'steps': 90496, 'loss/train': 0.8357868194580078} 11/07/2021 09:59:06 - INFO - __main__ - Step 90498: {'lr': 0.0001742765839521379, 'samples': 17375616, 'steps': 90497, 'loss/train': 1.1807801723480225} 11/07/2021 09:59:06 - INFO - __main__ - Step 90499: {'lr': 0.00017427152651257005, 'samples': 17375808, 'steps': 90498, 'loss/train': 1.157365083694458} 11/07/2021 09:59:06 - INFO - __main__ - Step 90500: {'lr': 0.00017426646910712428, 'samples': 17376000, 'steps': 90499, 'loss/train': 1.767688274383545} 11/07/2021 09:59:07 - INFO - __main__ - Step 90501: {'lr': 0.00017426141173580289, 'samples': 17376192, 'steps': 90500, 'loss/train': 1.262531042098999} 11/07/2021 09:59:08 - INFO - __main__ - Step 90502: {'lr': 0.00017425635439860822, 'samples': 17376384, 'steps': 90501, 'loss/train': 1.2903094291687012} 11/07/2021 09:59:08 - INFO - __main__ - Step 90503: {'lr': 0.0001742512970955424, 'samples': 17376576, 'steps': 90502, 'loss/train': 1.5416613817214966} 11/07/2021 09:59:08 - INFO - __main__ - Step 90504: {'lr': 0.0001742462398266077, 'samples': 17376768, 'steps': 90503, 'loss/train': 1.2014622688293457} 11/07/2021 09:59:09 - INFO - __main__ - Step 90505: {'lr': 0.00017424118259180656, 'samples': 17376960, 'steps': 90504, 'loss/train': 1.0922799110412598} 11/07/2021 09:59:09 - INFO - __main__ - Step 90506: {'lr': 0.0001742361253911411, 'samples': 17377152, 'steps': 90505, 'loss/train': 1.180373191833496} 11/07/2021 09:59:10 - INFO - __main__ - Step 90507: {'lr': 0.0001742310682246137, 'samples': 17377344, 'steps': 90506, 'loss/train': 1.0335592031478882} 11/07/2021 09:59:11 - INFO - __main__ - Step 90508: {'lr': 0.00017422601109222662, 'samples': 17377536, 'steps': 90507, 'loss/train': 0.9942478537559509} 11/07/2021 09:59:11 - INFO - __main__ - Step 90509: {'lr': 0.00017422095399398217, 'samples': 17377728, 'steps': 90508, 'loss/train': 1.5153158903121948} 11/07/2021 09:59:11 - INFO - __main__ - Step 90510: {'lr': 0.00017421589692988255, 'samples': 17377920, 'steps': 90509, 'loss/train': 1.2753537893295288} 11/07/2021 09:59:12 - INFO - __main__ - Step 90511: {'lr': 0.0001742108398999301, 'samples': 17378112, 'steps': 90510, 'loss/train': 0.7251616716384888} 11/07/2021 09:59:12 - INFO - __main__ - Step 90512: {'lr': 0.00017420578290412703, 'samples': 17378304, 'steps': 90511, 'loss/train': 1.7863572835922241} 11/07/2021 09:59:13 - INFO - __main__ - Step 90513: {'lr': 0.00017420072594247568, 'samples': 17378496, 'steps': 90512, 'loss/train': 1.2709662914276123} 11/07/2021 09:59:13 - INFO - __main__ - Step 90514: {'lr': 0.00017419566901497833, 'samples': 17378688, 'steps': 90513, 'loss/train': 1.6831331253051758} 11/07/2021 09:59:14 - INFO - __main__ - Step 90515: {'lr': 0.00017419061212163732, 'samples': 17378880, 'steps': 90514, 'loss/train': 1.1389778852462769} 11/07/2021 09:59:14 - INFO - __main__ - Step 90516: {'lr': 0.00017418555526245476, 'samples': 17379072, 'steps': 90515, 'loss/train': 1.3381112813949585} 11/07/2021 09:59:14 - INFO - __main__ - Step 90517: {'lr': 0.00017418049843743305, 'samples': 17379264, 'steps': 90516, 'loss/train': 0.14828196167945862} 11/07/2021 09:59:15 - INFO - __main__ - Step 90518: {'lr': 0.0001741754416465744, 'samples': 17379456, 'steps': 90517, 'loss/train': 1.214479684829712} 11/07/2021 09:59:16 - INFO - __main__ - Step 90519: {'lr': 0.00017417038488988114, 'samples': 17379648, 'steps': 90518, 'loss/train': 1.3818752765655518} 11/07/2021 09:59:16 - INFO - __main__ - Step 90520: {'lr': 0.00017416532816735554, 'samples': 17379840, 'steps': 90519, 'loss/train': 1.2902029752731323} 11/07/2021 09:59:16 - INFO - __main__ - Step 90521: {'lr': 0.00017416027147899984, 'samples': 17380032, 'steps': 90520, 'loss/train': 1.213720440864563} 11/07/2021 09:59:17 - INFO - __main__ - Step 90522: {'lr': 0.00017415521482481639, 'samples': 17380224, 'steps': 90521, 'loss/train': 1.449222445487976} 11/07/2021 09:59:18 - INFO - __main__ - Step 90523: {'lr': 0.00017415015820480739, 'samples': 17380416, 'steps': 90522, 'loss/train': 1.5468987226486206} 11/07/2021 09:59:18 - INFO - __main__ - Step 90524: {'lr': 0.0001741451016189752, 'samples': 17380608, 'steps': 90523, 'loss/train': 1.259226679801941} 11/07/2021 09:59:18 - INFO - __main__ - Step 90525: {'lr': 0.00017414004506732206, 'samples': 17380800, 'steps': 90524, 'loss/train': 1.0544368028640747} 11/07/2021 09:59:19 - INFO - __main__ - Step 90526: {'lr': 0.0001741349885498502, 'samples': 17380992, 'steps': 90525, 'loss/train': 1.191081166267395} 11/07/2021 09:59:19 - INFO - __main__ - Step 90527: {'lr': 0.00017412993206656203, 'samples': 17381184, 'steps': 90526, 'loss/train': 1.5315433740615845} 11/07/2021 09:59:20 - INFO - __main__ - Step 90528: {'lr': 0.00017412487561745967, 'samples': 17381376, 'steps': 90527, 'loss/train': 1.5743157863616943} 11/07/2021 09:59:21 - INFO - __main__ - Step 90529: {'lr': 0.00017411981920254554, 'samples': 17381568, 'steps': 90528, 'loss/train': 1.4417955875396729} 11/07/2021 09:59:21 - INFO - __main__ - Step 90530: {'lr': 0.0001741147628218218, 'samples': 17381760, 'steps': 90529, 'loss/train': 0.7332841157913208} 11/07/2021 09:59:21 - INFO - __main__ - Step 90531: {'lr': 0.00017410970647529077, 'samples': 17381952, 'steps': 90530, 'loss/train': 1.0592602491378784} 11/07/2021 09:59:22 - INFO - __main__ - Step 90532: {'lr': 0.00017410465016295474, 'samples': 17382144, 'steps': 90531, 'loss/train': 1.2167624235153198} 11/07/2021 09:59:22 - INFO - __main__ - Step 90533: {'lr': 0.00017409959388481593, 'samples': 17382336, 'steps': 90532, 'loss/train': 1.445949673652649} 11/07/2021 09:59:23 - INFO - __main__ - Step 90534: {'lr': 0.00017409453764087674, 'samples': 17382528, 'steps': 90533, 'loss/train': 0.6892134547233582} 11/07/2021 09:59:23 - INFO - __main__ - Step 90535: {'lr': 0.00017408948143113936, 'samples': 17382720, 'steps': 90534, 'loss/train': 1.8147692680358887} 11/07/2021 09:59:24 - INFO - __main__ - Step 90536: {'lr': 0.0001740844252556061, 'samples': 17382912, 'steps': 90535, 'loss/train': 1.3143577575683594} 11/07/2021 09:59:24 - INFO - __main__ - Step 90537: {'lr': 0.00017407936911427923, 'samples': 17383104, 'steps': 90536, 'loss/train': 1.3598071336746216} 11/07/2021 09:59:24 - INFO - __main__ - Step 90538: {'lr': 0.00017407431300716104, 'samples': 17383296, 'steps': 90537, 'loss/train': 1.4997777938842773} 11/07/2021 09:59:25 - INFO - __main__ - Step 90539: {'lr': 0.00017406925693425374, 'samples': 17383488, 'steps': 90538, 'loss/train': 1.520395278930664} 11/07/2021 09:59:26 - INFO - __main__ - Step 90540: {'lr': 0.0001740642008955597, 'samples': 17383680, 'steps': 90539, 'loss/train': 1.3259313106536865} 11/07/2021 09:59:26 - INFO - __main__ - Step 90541: {'lr': 0.00017405914489108113, 'samples': 17383872, 'steps': 90540, 'loss/train': 1.2326031923294067} 11/07/2021 09:59:27 - INFO - __main__ - Step 90542: {'lr': 0.0001740540889208204, 'samples': 17384064, 'steps': 90541, 'loss/train': 0.5651528835296631} 11/07/2021 09:59:27 - INFO - __main__ - Step 90543: {'lr': 0.00017404903298477966, 'samples': 17384256, 'steps': 90542, 'loss/train': 1.5888125896453857} 11/07/2021 09:59:28 - INFO - __main__ - Step 90544: {'lr': 0.00017404397708296128, 'samples': 17384448, 'steps': 90543, 'loss/train': 0.9398135542869568} 11/07/2021 09:59:28 - INFO - __main__ - Step 90545: {'lr': 0.0001740389212153675, 'samples': 17384640, 'steps': 90544, 'loss/train': 0.6268999576568604} 11/07/2021 09:59:29 - INFO - __main__ - Step 90546: {'lr': 0.0001740338653820006, 'samples': 17384832, 'steps': 90545, 'loss/train': 0.9662044644355774} 11/07/2021 09:59:29 - INFO - __main__ - Step 90547: {'lr': 0.0001740288095828629, 'samples': 17385024, 'steps': 90546, 'loss/train': 1.3467791080474854} 11/07/2021 09:59:30 - INFO - __main__ - Step 90548: {'lr': 0.00017402375381795666, 'samples': 17385216, 'steps': 90547, 'loss/train': 1.2626464366912842} 11/07/2021 09:59:30 - INFO - __main__ - Step 90549: {'lr': 0.0001740186980872841, 'samples': 17385408, 'steps': 90548, 'loss/train': 1.4858639240264893} 11/07/2021 09:59:31 - INFO - __main__ - Step 90550: {'lr': 0.00017401364239084754, 'samples': 17385600, 'steps': 90549, 'loss/train': 1.4271320104599} 11/07/2021 09:59:31 - INFO - __main__ - Step 90551: {'lr': 0.00017400858672864927, 'samples': 17385792, 'steps': 90550, 'loss/train': 5.086771488189697} 11/07/2021 09:59:32 - INFO - __main__ - Step 90552: {'lr': 0.00017400353110069155, 'samples': 17385984, 'steps': 90551, 'loss/train': 1.8003394603729248} 11/07/2021 09:59:32 - INFO - __main__ - Step 90553: {'lr': 0.00017399847550697667, 'samples': 17386176, 'steps': 90552, 'loss/train': 1.2686676979064941} 11/07/2021 09:59:32 - INFO - __main__ - Step 90554: {'lr': 0.00017399341994750692, 'samples': 17386368, 'steps': 90553, 'loss/train': 1.879335880279541} 11/07/2021 09:59:33 - INFO - __main__ - Step 90555: {'lr': 0.00017398836442228461, 'samples': 17386560, 'steps': 90554, 'loss/train': 1.5916584730148315} 11/07/2021 09:59:34 - INFO - __main__ - Step 90556: {'lr': 0.00017398330893131193, 'samples': 17386752, 'steps': 90555, 'loss/train': 1.6186187267303467} 11/07/2021 09:59:34 - INFO - __main__ - Step 90557: {'lr': 0.00017397825347459118, 'samples': 17386944, 'steps': 90556, 'loss/train': 1.2285451889038086} 11/07/2021 09:59:35 - INFO - __main__ - Step 90558: {'lr': 0.00017397319805212465, 'samples': 17387136, 'steps': 90557, 'loss/train': 1.2909905910491943} 11/07/2021 09:59:35 - INFO - __main__ - Step 90559: {'lr': 0.00017396814266391463, 'samples': 17387328, 'steps': 90558, 'loss/train': 1.0013152360916138} 11/07/2021 09:59:35 - INFO - __main__ - Step 90560: {'lr': 0.00017396308730996342, 'samples': 17387520, 'steps': 90559, 'loss/train': 1.3170640468597412} 11/07/2021 09:59:36 - INFO - __main__ - Step 90561: {'lr': 0.00017395803199027324, 'samples': 17387712, 'steps': 90560, 'loss/train': 0.5490167140960693} 11/07/2021 09:59:37 - INFO - __main__ - Step 90562: {'lr': 0.0001739529767048464, 'samples': 17387904, 'steps': 90561, 'loss/train': 1.5598641633987427} 11/07/2021 09:59:37 - INFO - __main__ - Step 90563: {'lr': 0.00017394792145368514, 'samples': 17388096, 'steps': 90562, 'loss/train': 1.467996597290039} 11/07/2021 09:59:37 - INFO - __main__ - Step 90564: {'lr': 0.00017394286623679183, 'samples': 17388288, 'steps': 90563, 'loss/train': 1.958892583847046} 11/07/2021 09:59:38 - INFO - __main__ - Step 90565: {'lr': 0.00017393781105416866, 'samples': 17388480, 'steps': 90564, 'loss/train': 1.9691778421401978} 11/07/2021 09:59:39 - INFO - __main__ - Step 90566: {'lr': 0.00017393275590581793, 'samples': 17388672, 'steps': 90565, 'loss/train': 1.375064730644226} 11/07/2021 09:59:39 - INFO - __main__ - Step 90567: {'lr': 0.00017392770079174198, 'samples': 17388864, 'steps': 90566, 'loss/train': 1.1557198762893677} 11/07/2021 09:59:39 - INFO - __main__ - Step 90568: {'lr': 0.00017392264571194297, 'samples': 17389056, 'steps': 90567, 'loss/train': 1.4001061916351318} 11/07/2021 09:59:40 - INFO - __main__ - Step 90569: {'lr': 0.00017391759066642332, 'samples': 17389248, 'steps': 90568, 'loss/train': 1.5010849237442017} 11/07/2021 09:59:40 - INFO - __main__ - Step 90570: {'lr': 0.00017391253565518522, 'samples': 17389440, 'steps': 90569, 'loss/train': 1.618923306465149} 11/07/2021 09:59:42 - INFO - __main__ - Step 90571: {'lr': 0.00017390748067823092, 'samples': 17389632, 'steps': 90570, 'loss/train': 0.7211723923683167} 11/07/2021 09:59:42 - INFO - __main__ - Step 90572: {'lr': 0.00017390242573556272, 'samples': 17389824, 'steps': 90571, 'loss/train': 1.3793551921844482} 11/07/2021 09:59:42 - INFO - __main__ - Step 90573: {'lr': 0.00017389737082718293, 'samples': 17390016, 'steps': 90572, 'loss/train': 1.4576236009597778} 11/07/2021 09:59:43 - INFO - __main__ - Step 90574: {'lr': 0.0001738923159530938, 'samples': 17390208, 'steps': 90573, 'loss/train': 1.499133586883545} 11/07/2021 09:59:43 - INFO - __main__ - Step 90575: {'lr': 0.0001738872611132976, 'samples': 17390400, 'steps': 90574, 'loss/train': 1.332383155822754} 11/07/2021 09:59:43 - INFO - __main__ - Step 90576: {'lr': 0.00017388220630779665, 'samples': 17390592, 'steps': 90575, 'loss/train': 0.4300929009914398} 11/07/2021 09:59:44 - INFO - __main__ - Step 90577: {'lr': 0.0001738771515365932, 'samples': 17390784, 'steps': 90576, 'loss/train': 0.5340887904167175} 11/07/2021 09:59:45 - INFO - __main__ - Step 90578: {'lr': 0.00017387209679968954, 'samples': 17390976, 'steps': 90577, 'loss/train': 0.8984358310699463} 11/07/2021 09:59:45 - INFO - __main__ - Step 90579: {'lr': 0.00017386704209708794, 'samples': 17391168, 'steps': 90578, 'loss/train': 1.4331235885620117} 11/07/2021 09:59:45 - INFO - __main__ - Step 90580: {'lr': 0.00017386198742879068, 'samples': 17391360, 'steps': 90579, 'loss/train': 1.425118327140808} 11/07/2021 09:59:46 - INFO - __main__ - Step 90581: {'lr': 0.0001738569327948, 'samples': 17391552, 'steps': 90580, 'loss/train': 1.219084620475769} 11/07/2021 09:59:47 - INFO - __main__ - Step 90582: {'lr': 0.00017385187819511834, 'samples': 17391744, 'steps': 90581, 'loss/train': 1.0437098741531372} 11/07/2021 09:59:47 - INFO - __main__ - Step 90583: {'lr': 0.00017384682362974775, 'samples': 17391936, 'steps': 90582, 'loss/train': 1.3110811710357666} 11/07/2021 09:59:48 - INFO - __main__ - Step 90584: {'lr': 0.00017384176909869057, 'samples': 17392128, 'steps': 90583, 'loss/train': 1.1585700511932373} 11/07/2021 09:59:48 - INFO - __main__ - Step 90585: {'lr': 0.00017383671460194914, 'samples': 17392320, 'steps': 90584, 'loss/train': 1.112931489944458} 11/07/2021 09:59:48 - INFO - __main__ - Step 90586: {'lr': 0.0001738316601395257, 'samples': 17392512, 'steps': 90585, 'loss/train': 1.446710228919983} 11/07/2021 09:59:49 - INFO - __main__ - Step 90587: {'lr': 0.00017382660571142256, 'samples': 17392704, 'steps': 90586, 'loss/train': 0.18200458586215973} 11/07/2021 09:59:50 - INFO - __main__ - Step 90588: {'lr': 0.00017382155131764193, 'samples': 17392896, 'steps': 90587, 'loss/train': 0.13871411979198456} 11/07/2021 09:59:50 - INFO - __main__ - Step 90589: {'lr': 0.0001738164969581862, 'samples': 17393088, 'steps': 90588, 'loss/train': 0.43809568881988525} 11/07/2021 09:59:51 - INFO - __main__ - Step 90590: {'lr': 0.00017381144263305755, 'samples': 17393280, 'steps': 90589, 'loss/train': 1.0771986246109009} 11/07/2021 09:59:51 - INFO - __main__ - Step 90591: {'lr': 0.00017380638834225826, 'samples': 17393472, 'steps': 90590, 'loss/train': 1.4521089792251587} 11/07/2021 09:59:51 - INFO - __main__ - Step 90592: {'lr': 0.00017380133408579067, 'samples': 17393664, 'steps': 90591, 'loss/train': 1.4139817953109741} 11/07/2021 09:59:52 - INFO - __main__ - Step 90593: {'lr': 0.000173796279863657, 'samples': 17393856, 'steps': 90592, 'loss/train': 1.5602843761444092} 11/07/2021 09:59:53 - INFO - __main__ - Step 90594: {'lr': 0.00017379122567585958, 'samples': 17394048, 'steps': 90593, 'loss/train': 1.3496384620666504} 11/07/2021 09:59:53 - INFO - __main__ - Step 90595: {'lr': 0.00017378617152240063, 'samples': 17394240, 'steps': 90594, 'loss/train': 1.0167865753173828} 11/07/2021 09:59:54 - INFO - __main__ - Step 90596: {'lr': 0.00017378111740328257, 'samples': 17394432, 'steps': 90595, 'loss/train': 0.9194519519805908} 11/07/2021 09:59:54 - INFO - __main__ - Step 90597: {'lr': 0.00017377606331850747, 'samples': 17394624, 'steps': 90596, 'loss/train': 1.378090500831604} 11/07/2021 09:59:55 - INFO - __main__ - Step 90598: {'lr': 0.0001737710092680777, 'samples': 17394816, 'steps': 90597, 'loss/train': 1.2577810287475586} 11/07/2021 09:59:55 - INFO - __main__ - Step 90599: {'lr': 0.00017376595525199552, 'samples': 17395008, 'steps': 90598, 'loss/train': 1.2533870935440063} 11/07/2021 09:59:56 - INFO - __main__ - Step 90600: {'lr': 0.00017376090127026322, 'samples': 17395200, 'steps': 90599, 'loss/train': 1.251420259475708} 11/07/2021 09:59:56 - INFO - __main__ - Step 90601: {'lr': 0.00017375584732288307, 'samples': 17395392, 'steps': 90600, 'loss/train': 1.2668795585632324} 11/07/2021 09:59:56 - INFO - __main__ - Step 90602: {'lr': 0.0001737507934098574, 'samples': 17395584, 'steps': 90601, 'loss/train': 1.000494122505188} 11/07/2021 09:59:57 - INFO - __main__ - Step 90603: {'lr': 0.00017374573953118843, 'samples': 17395776, 'steps': 90602, 'loss/train': 1.0749248266220093} 11/07/2021 09:59:58 - INFO - __main__ - Step 90604: {'lr': 0.00017374068568687845, 'samples': 17395968, 'steps': 90603, 'loss/train': 1.4759730100631714} 11/07/2021 09:59:58 - INFO - __main__ - Step 90605: {'lr': 0.00017373563187692974, 'samples': 17396160, 'steps': 90604, 'loss/train': 0.9860820770263672} 11/07/2021 09:59:58 - INFO - __main__ - Step 90606: {'lr': 0.00017373057810134458, 'samples': 17396352, 'steps': 90605, 'loss/train': 1.3364979028701782} 11/07/2021 09:59:59 - INFO - __main__ - Step 90607: {'lr': 0.00017372552436012523, 'samples': 17396544, 'steps': 90606, 'loss/train': 1.4004936218261719} 11/07/2021 09:59:59 - INFO - __main__ - Step 90608: {'lr': 0.00017372047065327401, 'samples': 17396736, 'steps': 90607, 'loss/train': 1.324008822441101} 11/07/2021 10:00:00 - INFO - __main__ - Step 90609: {'lr': 0.00017371541698079325, 'samples': 17396928, 'steps': 90608, 'loss/train': 1.6429717540740967} 11/07/2021 10:00:00 - INFO - __main__ - Step 90610: {'lr': 0.00017371036334268503, 'samples': 17397120, 'steps': 90609, 'loss/train': 1.4410849809646606} 11/07/2021 10:00:01 - INFO - __main__ - Step 90611: {'lr': 0.00017370530973895176, 'samples': 17397312, 'steps': 90610, 'loss/train': 1.4405300617218018} 11/07/2021 10:00:01 - INFO - __main__ - Step 90612: {'lr': 0.00017370025616959573, 'samples': 17397504, 'steps': 90611, 'loss/train': 1.2611310482025146} 11/07/2021 10:00:02 - INFO - __main__ - Step 90613: {'lr': 0.00017369520263461912, 'samples': 17397696, 'steps': 90612, 'loss/train': 1.3077304363250732} 11/07/2021 10:00:03 - INFO - __main__ - Step 90614: {'lr': 0.00017369014913402433, 'samples': 17397888, 'steps': 90613, 'loss/train': 0.7513828873634338} 11/07/2021 10:00:03 - INFO - __main__ - Step 90615: {'lr': 0.0001736850956678136, 'samples': 17398080, 'steps': 90614, 'loss/train': 1.0612590312957764} 11/07/2021 10:00:03 - INFO - __main__ - Step 90616: {'lr': 0.00017368004223598912, 'samples': 17398272, 'steps': 90615, 'loss/train': 1.3316253423690796} 11/07/2021 10:00:04 - INFO - __main__ - Step 90617: {'lr': 0.00017367498883855327, 'samples': 17398464, 'steps': 90616, 'loss/train': 0.6596760153770447} 11/07/2021 10:00:04 - INFO - __main__ - Step 90618: {'lr': 0.0001736699354755083, 'samples': 17398656, 'steps': 90617, 'loss/train': 1.2972320318222046} 11/07/2021 10:00:05 - INFO - __main__ - Step 90619: {'lr': 0.00017366488214685648, 'samples': 17398848, 'steps': 90618, 'loss/train': 1.6383585929870605} 11/07/2021 10:00:05 - INFO - __main__ - Step 90620: {'lr': 0.00017365982885260008, 'samples': 17399040, 'steps': 90619, 'loss/train': 1.380651593208313} 11/07/2021 10:00:06 - INFO - __main__ - Step 90621: {'lr': 0.00017365477559274135, 'samples': 17399232, 'steps': 90620, 'loss/train': 1.3091838359832764} 11/07/2021 10:00:06 - INFO - __main__ - Step 90622: {'lr': 0.00017364972236728267, 'samples': 17399424, 'steps': 90621, 'loss/train': 1.7426015138626099} 11/07/2021 10:00:06 - INFO - __main__ - Step 90623: {'lr': 0.0001736446691762263, 'samples': 17399616, 'steps': 90622, 'loss/train': 1.3611935377120972} 11/07/2021 10:00:07 - INFO - __main__ - Step 90624: {'lr': 0.00017363961601957434, 'samples': 17399808, 'steps': 90623, 'loss/train': 1.593830943107605} 11/07/2021 10:00:08 - INFO - __main__ - Step 90625: {'lr': 0.00017363456289732924, 'samples': 17400000, 'steps': 90624, 'loss/train': 1.351249098777771} 11/07/2021 10:00:08 - INFO - __main__ - Step 90626: {'lr': 0.00017362950980949322, 'samples': 17400192, 'steps': 90625, 'loss/train': 1.4949454069137573} 11/07/2021 10:00:09 - INFO - __main__ - Step 90627: {'lr': 0.00017362445675606853, 'samples': 17400384, 'steps': 90626, 'loss/train': 1.36006498336792} 11/07/2021 10:00:09 - INFO - __main__ - Step 90628: {'lr': 0.0001736194037370575, 'samples': 17400576, 'steps': 90627, 'loss/train': 1.4519051313400269} 11/07/2021 10:00:09 - INFO - __main__ - Step 90629: {'lr': 0.00017361435075246242, 'samples': 17400768, 'steps': 90628, 'loss/train': 1.5024052858352661} 11/07/2021 10:00:10 - INFO - __main__ - Step 90630: {'lr': 0.00017360929780228546, 'samples': 17400960, 'steps': 90629, 'loss/train': 1.7294961214065552} 11/07/2021 10:00:11 - INFO - __main__ - Step 90631: {'lr': 0.00017360424488652905, 'samples': 17401152, 'steps': 90630, 'loss/train': 0.9042702913284302} 11/07/2021 10:00:11 - INFO - __main__ - Step 90632: {'lr': 0.00017359919200519536, 'samples': 17401344, 'steps': 90631, 'loss/train': 1.190027117729187} 11/07/2021 10:00:11 - INFO - __main__ - Step 90633: {'lr': 0.00017359413915828668, 'samples': 17401536, 'steps': 90632, 'loss/train': 1.6062027215957642} 11/07/2021 10:00:12 - INFO - __main__ - Step 90634: {'lr': 0.0001735890863458053, 'samples': 17401728, 'steps': 90633, 'loss/train': 1.244301676750183} 11/07/2021 10:00:13 - INFO - __main__ - Step 90635: {'lr': 0.0001735840335677535, 'samples': 17401920, 'steps': 90634, 'loss/train': 1.3372950553894043} 11/07/2021 10:00:13 - INFO - __main__ - Step 90636: {'lr': 0.00017357898082413371, 'samples': 17402112, 'steps': 90635, 'loss/train': 1.1000646352767944} 11/07/2021 10:00:13 - INFO - __main__ - Step 90637: {'lr': 0.00017357392811494788, 'samples': 17402304, 'steps': 90636, 'loss/train': 1.7459847927093506} 11/07/2021 10:00:14 - INFO - __main__ - Step 90638: {'lr': 0.0001735688754401985, 'samples': 17402496, 'steps': 90637, 'loss/train': 1.9309501647949219} 11/07/2021 10:00:14 - INFO - __main__ - Step 90639: {'lr': 0.0001735638227998878, 'samples': 17402688, 'steps': 90638, 'loss/train': 1.7934436798095703} 11/07/2021 10:00:15 - INFO - __main__ - Step 90640: {'lr': 0.00017355877019401805, 'samples': 17402880, 'steps': 90639, 'loss/train': 1.8393688201904297} 11/07/2021 10:00:16 - INFO - __main__ - Step 90641: {'lr': 0.00017355371762259154, 'samples': 17403072, 'steps': 90640, 'loss/train': 1.333423376083374} 11/07/2021 10:00:16 - INFO - __main__ - Step 90642: {'lr': 0.00017354866508561054, 'samples': 17403264, 'steps': 90641, 'loss/train': 1.5185989141464233} 11/07/2021 10:00:16 - INFO - __main__ - Step 90643: {'lr': 0.00017354361258307735, 'samples': 17403456, 'steps': 90642, 'loss/train': 1.5474227666854858} 11/07/2021 10:00:17 - INFO - __main__ - Step 90644: {'lr': 0.00017353856011499423, 'samples': 17403648, 'steps': 90643, 'loss/train': 0.37086859345436096} 11/07/2021 10:00:18 - INFO - __main__ - Step 90645: {'lr': 0.00017353350768136344, 'samples': 17403840, 'steps': 90644, 'loss/train': 0.6902307868003845} 11/07/2021 10:00:18 - INFO - __main__ - Step 90646: {'lr': 0.00017352845528218724, 'samples': 17404032, 'steps': 90645, 'loss/train': 1.5917553901672363} 11/07/2021 10:00:19 - INFO - __main__ - Step 90647: {'lr': 0.000173523402917468, 'samples': 17404224, 'steps': 90646, 'loss/train': 1.6313021183013916} 11/07/2021 10:00:19 - INFO - __main__ - Step 90648: {'lr': 0.00017351835058720792, 'samples': 17404416, 'steps': 90647, 'loss/train': 0.8730949759483337} 11/07/2021 10:00:19 - INFO - __main__ - Step 90649: {'lr': 0.00017351329829140926, 'samples': 17404608, 'steps': 90648, 'loss/train': 0.21671165525913239} 11/07/2021 10:00:20 - INFO - __main__ - Step 90650: {'lr': 0.00017350824603007444, 'samples': 17404800, 'steps': 90649, 'loss/train': 0.9782439470291138} 11/07/2021 10:00:21 - INFO - __main__ - Step 90651: {'lr': 0.00017350319380320556, 'samples': 17404992, 'steps': 90650, 'loss/train': 1.216391682624817} 11/07/2021 10:00:21 - INFO - __main__ - Step 90652: {'lr': 0.0001734981416108049, 'samples': 17405184, 'steps': 90651, 'loss/train': 1.311216950416565} 11/07/2021 10:00:21 - INFO - __main__ - Step 90653: {'lr': 0.00017349308945287484, 'samples': 17405376, 'steps': 90652, 'loss/train': 1.870360016822815} 11/07/2021 10:00:22 - INFO - __main__ - Step 90654: {'lr': 0.0001734880373294176, 'samples': 17405568, 'steps': 90653, 'loss/train': 1.1409566402435303} 11/07/2021 10:00:22 - INFO - __main__ - Step 90655: {'lr': 0.0001734829852404355, 'samples': 17405760, 'steps': 90654, 'loss/train': 1.5216522216796875} 11/07/2021 10:00:23 - INFO - __main__ - Step 90656: {'lr': 0.00017347793318593074, 'samples': 17405952, 'steps': 90655, 'loss/train': 1.6046888828277588} 11/07/2021 10:00:24 - INFO - __main__ - Step 90657: {'lr': 0.00017347288116590566, 'samples': 17406144, 'steps': 90656, 'loss/train': 0.844950258731842} 11/07/2021 10:00:24 - INFO - __main__ - Step 90658: {'lr': 0.0001734678291803625, 'samples': 17406336, 'steps': 90657, 'loss/train': 1.2176858186721802} 11/07/2021 10:00:24 - INFO - __main__ - Step 90659: {'lr': 0.00017346277722930358, 'samples': 17406528, 'steps': 90658, 'loss/train': 1.192205548286438} 11/07/2021 10:00:25 - INFO - __main__ - Step 90660: {'lr': 0.00017345772531273117, 'samples': 17406720, 'steps': 90659, 'loss/train': 1.2895289659500122} 11/07/2021 10:00:25 - INFO - __main__ - Step 90661: {'lr': 0.00017345267343064753, 'samples': 17406912, 'steps': 90660, 'loss/train': 1.0413644313812256} 11/07/2021 10:00:26 - INFO - __main__ - Step 90662: {'lr': 0.0001734476215830549, 'samples': 17407104, 'steps': 90661, 'loss/train': 1.1408462524414062} 11/07/2021 10:00:26 - INFO - __main__ - Step 90663: {'lr': 0.00017344256976995566, 'samples': 17407296, 'steps': 90662, 'loss/train': 0.6877296566963196} 11/07/2021 10:00:27 - INFO - __main__ - Step 90664: {'lr': 0.00017343751799135196, 'samples': 17407488, 'steps': 90663, 'loss/train': 1.1988962888717651} 11/07/2021 10:00:27 - INFO - __main__ - Step 90665: {'lr': 0.00017343246624724614, 'samples': 17407680, 'steps': 90664, 'loss/train': 1.376447319984436} 11/07/2021 10:00:27 - INFO - __main__ - Step 90666: {'lr': 0.00017342741453764044, 'samples': 17407872, 'steps': 90665, 'loss/train': 1.4760528802871704} 11/07/2021 10:00:29 - INFO - __main__ - Step 90667: {'lr': 0.00017342236286253717, 'samples': 17408064, 'steps': 90666, 'loss/train': 0.7034920454025269} 11/07/2021 10:00:29 - INFO - __main__ - Step 90668: {'lr': 0.00017341731122193864, 'samples': 17408256, 'steps': 90667, 'loss/train': 1.3948924541473389} 11/07/2021 10:00:29 - INFO - __main__ - Step 90669: {'lr': 0.00017341225961584706, 'samples': 17408448, 'steps': 90668, 'loss/train': 1.8451193571090698} 11/07/2021 10:00:30 - INFO - __main__ - Step 90670: {'lr': 0.00017340720804426475, 'samples': 17408640, 'steps': 90669, 'loss/train': 1.3362243175506592} 11/07/2021 10:00:30 - INFO - __main__ - Step 90671: {'lr': 0.00017340215650719394, 'samples': 17408832, 'steps': 90670, 'loss/train': 1.1387962102890015} 11/07/2021 10:00:31 - INFO - __main__ - Step 90672: {'lr': 0.000173397105004637, 'samples': 17409024, 'steps': 90671, 'loss/train': 1.1052051782608032} 11/07/2021 10:00:31 - INFO - __main__ - Step 90673: {'lr': 0.0001733920535365961, 'samples': 17409216, 'steps': 90672, 'loss/train': 0.6782529354095459} 11/07/2021 10:00:32 - INFO - __main__ - Step 90674: {'lr': 0.00017338700210307355, 'samples': 17409408, 'steps': 90673, 'loss/train': 1.083193063735962} 11/07/2021 10:00:32 - INFO - __main__ - Step 90675: {'lr': 0.00017338195070407163, 'samples': 17409600, 'steps': 90674, 'loss/train': 1.0246601104736328} 11/07/2021 10:00:33 - INFO - __main__ - Step 90676: {'lr': 0.00017337689933959267, 'samples': 17409792, 'steps': 90675, 'loss/train': 1.5633021593093872} 11/07/2021 10:00:33 - INFO - __main__ - Step 90677: {'lr': 0.00017337184800963887, 'samples': 17409984, 'steps': 90676, 'loss/train': 0.12659768760204315} 11/07/2021 10:00:34 - INFO - __main__ - Step 90678: {'lr': 0.00017336679671421253, 'samples': 17410176, 'steps': 90677, 'loss/train': 1.2979085445404053} 11/07/2021 10:00:34 - INFO - __main__ - Step 90679: {'lr': 0.0001733617454533159, 'samples': 17410368, 'steps': 90678, 'loss/train': 0.9642648100852966} 11/07/2021 10:00:35 - INFO - __main__ - Step 90680: {'lr': 0.0001733566942269513, 'samples': 17410560, 'steps': 90679, 'loss/train': 1.1308503150939941} 11/07/2021 10:00:35 - INFO - __main__ - Step 90681: {'lr': 0.000173351643035121, 'samples': 17410752, 'steps': 90680, 'loss/train': 1.4554238319396973} 11/07/2021 10:00:35 - INFO - __main__ - Step 90682: {'lr': 0.00017334659187782724, 'samples': 17410944, 'steps': 90681, 'loss/train': 0.2631327509880066} 11/07/2021 10:00:36 - INFO - __main__ - Step 90683: {'lr': 0.00017334154075507243, 'samples': 17411136, 'steps': 90682, 'loss/train': 0.9055803418159485} 11/07/2021 10:00:37 - INFO - __main__ - Step 90684: {'lr': 0.0001733364896668586, 'samples': 17411328, 'steps': 90683, 'loss/train': 1.1888678073883057} 11/07/2021 10:00:37 - INFO - __main__ - Step 90685: {'lr': 0.00017333143861318823, 'samples': 17411520, 'steps': 90684, 'loss/train': 1.4884130954742432} 11/07/2021 10:00:37 - INFO - __main__ - Step 90686: {'lr': 0.00017332638759406355, 'samples': 17411712, 'steps': 90685, 'loss/train': 0.1796979159116745} 11/07/2021 10:00:38 - INFO - __main__ - Step 90687: {'lr': 0.00017332133660948677, 'samples': 17411904, 'steps': 90686, 'loss/train': 1.3094838857650757} 11/07/2021 10:00:39 - INFO - __main__ - Step 90688: {'lr': 0.00017331628565946022, 'samples': 17412096, 'steps': 90687, 'loss/train': 1.462243676185608} 11/07/2021 10:00:39 - INFO - __main__ - Step 90689: {'lr': 0.00017331123474398618, 'samples': 17412288, 'steps': 90688, 'loss/train': 1.197777271270752} 11/07/2021 10:00:39 - INFO - __main__ - Step 90690: {'lr': 0.00017330618386306697, 'samples': 17412480, 'steps': 90689, 'loss/train': 1.3545430898666382} 11/07/2021 10:00:40 - INFO - __main__ - Step 90691: {'lr': 0.00017330113301670475, 'samples': 17412672, 'steps': 90690, 'loss/train': 1.3061745166778564} 11/07/2021 10:00:40 - INFO - __main__ - Step 90692: {'lr': 0.00017329608220490185, 'samples': 17412864, 'steps': 90691, 'loss/train': 1.2722594738006592} 11/07/2021 10:00:41 - INFO - __main__ - Step 90693: {'lr': 0.00017329103142766055, 'samples': 17413056, 'steps': 90692, 'loss/train': 0.4183870255947113} 11/07/2021 10:00:42 - INFO - __main__ - Step 90694: {'lr': 0.0001732859806849832, 'samples': 17413248, 'steps': 90693, 'loss/train': 1.3011173009872437} 11/07/2021 10:00:42 - INFO - __main__ - Step 90695: {'lr': 0.00017328092997687193, 'samples': 17413440, 'steps': 90694, 'loss/train': 1.2410147190093994} 11/07/2021 10:00:42 - INFO - __main__ - Step 90696: {'lr': 0.0001732758793033291, 'samples': 17413632, 'steps': 90695, 'loss/train': 1.2417323589324951} 11/07/2021 10:00:43 - INFO - __main__ - Step 90697: {'lr': 0.00017327082866435694, 'samples': 17413824, 'steps': 90696, 'loss/train': 1.3680146932601929} 11/07/2021 10:00:43 - INFO - __main__ - Step 90698: {'lr': 0.0001732657780599578, 'samples': 17414016, 'steps': 90697, 'loss/train': 1.415935754776001} 11/07/2021 10:00:44 - INFO - __main__ - Step 90699: {'lr': 0.00017326072749013392, 'samples': 17414208, 'steps': 90698, 'loss/train': 5.7536211013793945} 11/07/2021 10:00:44 - INFO - __main__ - Step 90700: {'lr': 0.00017325567695488753, 'samples': 17414400, 'steps': 90699, 'loss/train': 0.6501580476760864} 11/07/2021 10:00:45 - INFO - __main__ - Step 90701: {'lr': 0.00017325062645422103, 'samples': 17414592, 'steps': 90700, 'loss/train': 1.4536774158477783} 11/07/2021 10:00:45 - INFO - __main__ - Step 90702: {'lr': 0.00017324557598813654, 'samples': 17414784, 'steps': 90701, 'loss/train': 1.0871772766113281} 11/07/2021 10:00:46 - INFO - __main__ - Step 90703: {'lr': 0.00017324052555663647, 'samples': 17414976, 'steps': 90702, 'loss/train': 1.8342255353927612} 11/07/2021 10:00:46 - INFO - __main__ - Step 90704: {'lr': 0.000173235475159723, 'samples': 17415168, 'steps': 90703, 'loss/train': 1.3575527667999268} 11/07/2021 10:00:47 - INFO - __main__ - Step 90705: {'lr': 0.00017323042479739848, 'samples': 17415360, 'steps': 90704, 'loss/train': 1.1982392072677612} 11/07/2021 10:00:47 - INFO - __main__ - Step 90706: {'lr': 0.0001732253744696651, 'samples': 17415552, 'steps': 90705, 'loss/train': 1.4643218517303467} 11/07/2021 10:00:48 - INFO - __main__ - Step 90707: {'lr': 0.00017322032417652517, 'samples': 17415744, 'steps': 90706, 'loss/train': 2.086256265640259} 11/07/2021 10:00:48 - INFO - __main__ - Step 90708: {'lr': 0.000173215273917981, 'samples': 17415936, 'steps': 90707, 'loss/train': 1.049272060394287} 11/07/2021 10:00:48 - INFO - __main__ - Step 90709: {'lr': 0.00017321022369403484, 'samples': 17416128, 'steps': 90708, 'loss/train': 2.0508079528808594} 11/07/2021 10:00:49 - INFO - __main__ - Step 90710: {'lr': 0.00017320517350468895, 'samples': 17416320, 'steps': 90709, 'loss/train': 1.2800980806350708} 11/07/2021 10:00:50 - INFO - __main__ - Step 90711: {'lr': 0.00017320012334994564, 'samples': 17416512, 'steps': 90710, 'loss/train': 1.4970018863677979} 11/07/2021 10:00:50 - INFO - __main__ - Step 90712: {'lr': 0.00017319507322980716, 'samples': 17416704, 'steps': 90711, 'loss/train': 0.8925251960754395} 11/07/2021 10:00:50 - INFO - __main__ - Step 90713: {'lr': 0.0001731900231442758, 'samples': 17416896, 'steps': 90712, 'loss/train': 1.4087014198303223} 11/07/2021 10:00:51 - INFO - __main__ - Step 90714: {'lr': 0.00017318497309335386, 'samples': 17417088, 'steps': 90713, 'loss/train': 0.9500381350517273} 11/07/2021 10:00:52 - INFO - __main__ - Step 90715: {'lr': 0.00017317992307704352, 'samples': 17417280, 'steps': 90714, 'loss/train': 1.188143253326416} 11/07/2021 10:00:52 - INFO - __main__ - Step 90716: {'lr': 0.0001731748730953472, 'samples': 17417472, 'steps': 90715, 'loss/train': 1.67879319190979} 11/07/2021 10:00:53 - INFO - __main__ - Step 90717: {'lr': 0.0001731698231482671, 'samples': 17417664, 'steps': 90716, 'loss/train': 1.9733821153640747} 11/07/2021 10:00:53 - INFO - __main__ - Step 90718: {'lr': 0.00017316477323580547, 'samples': 17417856, 'steps': 90717, 'loss/train': 1.6821919679641724} 11/07/2021 10:00:53 - INFO - __main__ - Step 90719: {'lr': 0.0001731597233579646, 'samples': 17418048, 'steps': 90718, 'loss/train': 1.7335147857666016} 11/07/2021 10:00:54 - INFO - __main__ - Step 90720: {'lr': 0.00017315467351474673, 'samples': 17418240, 'steps': 90719, 'loss/train': 1.3215668201446533} 11/07/2021 10:00:55 - INFO - __main__ - Step 90721: {'lr': 0.00017314962370615423, 'samples': 17418432, 'steps': 90720, 'loss/train': 1.3552703857421875} 11/07/2021 10:00:55 - INFO - __main__ - Step 90722: {'lr': 0.00017314457393218928, 'samples': 17418624, 'steps': 90721, 'loss/train': 1.5124504566192627} 11/07/2021 10:00:55 - INFO - __main__ - Step 90723: {'lr': 0.0001731395241928542, 'samples': 17418816, 'steps': 90722, 'loss/train': 1.3514724969863892} 11/07/2021 10:00:56 - INFO - __main__ - Step 90724: {'lr': 0.00017313447448815127, 'samples': 17419008, 'steps': 90723, 'loss/train': 1.1531180143356323} 11/07/2021 10:00:56 - INFO - __main__ - Step 90725: {'lr': 0.0001731294248180828, 'samples': 17419200, 'steps': 90724, 'loss/train': 1.3818174600601196} 11/07/2021 10:00:57 - INFO - __main__ - Step 90726: {'lr': 0.000173124375182651, 'samples': 17419392, 'steps': 90725, 'loss/train': 1.6572080850601196} 11/07/2021 10:00:58 - INFO - __main__ - Step 90727: {'lr': 0.00017311932558185817, 'samples': 17419584, 'steps': 90726, 'loss/train': 1.516334056854248} 11/07/2021 10:00:58 - INFO - __main__ - Step 90728: {'lr': 0.00017311427601570656, 'samples': 17419776, 'steps': 90727, 'loss/train': 1.5141046047210693} 11/07/2021 10:00:58 - INFO - __main__ - Step 90729: {'lr': 0.0001731092264841985, 'samples': 17419968, 'steps': 90728, 'loss/train': 1.1861469745635986} 11/07/2021 10:00:59 - INFO - __main__ - Step 90730: {'lr': 0.00017310417698733631, 'samples': 17420160, 'steps': 90729, 'loss/train': 1.5250980854034424} 11/07/2021 10:01:00 - INFO - __main__ - Step 90731: {'lr': 0.00017309912752512213, 'samples': 17420352, 'steps': 90730, 'loss/train': 0.30649566650390625} 11/07/2021 10:01:00 - INFO - __main__ - Step 90732: {'lr': 0.00017309407809755828, 'samples': 17420544, 'steps': 90731, 'loss/train': 2.9248862266540527} 11/07/2021 10:01:00 - INFO - __main__ - Step 90733: {'lr': 0.00017308902870464705, 'samples': 17420736, 'steps': 90732, 'loss/train': 1.2115607261657715} 11/07/2021 10:01:01 - INFO - __main__ - Step 90734: {'lr': 0.0001730839793463907, 'samples': 17420928, 'steps': 90733, 'loss/train': 0.9838910102844238} 11/07/2021 10:01:01 - INFO - __main__ - Step 90735: {'lr': 0.00017307893002279154, 'samples': 17421120, 'steps': 90734, 'loss/train': 1.232100009918213} 11/07/2021 10:01:01 - INFO - __main__ - Step 90736: {'lr': 0.00017307388073385183, 'samples': 17421312, 'steps': 90735, 'loss/train': 1.636163353919983} 11/07/2021 10:01:03 - INFO - __main__ - Step 90737: {'lr': 0.00017306883147957382, 'samples': 17421504, 'steps': 90736, 'loss/train': 1.2023625373840332} 11/07/2021 10:01:03 - INFO - __main__ - Step 90738: {'lr': 0.00017306378225995984, 'samples': 17421696, 'steps': 90737, 'loss/train': 1.411023497581482} 11/07/2021 10:01:03 - INFO - __main__ - Step 90739: {'lr': 0.00017305873307501212, 'samples': 17421888, 'steps': 90738, 'loss/train': 1.230529546737671} 11/07/2021 10:01:04 - INFO - __main__ - Step 90740: {'lr': 0.00017305368392473293, 'samples': 17422080, 'steps': 90739, 'loss/train': 1.2491559982299805} 11/07/2021 10:01:04 - INFO - __main__ - Step 90741: {'lr': 0.0001730486348091246, 'samples': 17422272, 'steps': 90740, 'loss/train': 0.6799587607383728} 11/07/2021 10:01:05 - INFO - __main__ - Step 90742: {'lr': 0.00017304358572818934, 'samples': 17422464, 'steps': 90741, 'loss/train': 1.456525444984436} 11/07/2021 10:01:06 - INFO - __main__ - Step 90743: {'lr': 0.00017303853668192943, 'samples': 17422656, 'steps': 90742, 'loss/train': 1.4678353071212769} 11/07/2021 10:01:06 - INFO - __main__ - Step 90744: {'lr': 0.0001730334876703473, 'samples': 17422848, 'steps': 90743, 'loss/train': 0.7952456474304199} 11/07/2021 10:01:06 - INFO - __main__ - Step 90745: {'lr': 0.000173028438693445, 'samples': 17423040, 'steps': 90744, 'loss/train': 1.9751654863357544} 11/07/2021 10:01:07 - INFO - __main__ - Step 90746: {'lr': 0.00017302338975122488, 'samples': 17423232, 'steps': 90745, 'loss/train': 1.9533190727233887} 11/07/2021 10:01:07 - INFO - __main__ - Step 90747: {'lr': 0.00017301834084368923, 'samples': 17423424, 'steps': 90746, 'loss/train': 1.4359418153762817} 11/07/2021 10:01:08 - INFO - __main__ - Step 90748: {'lr': 0.00017301329197084037, 'samples': 17423616, 'steps': 90747, 'loss/train': 1.1906906366348267} 11/07/2021 10:01:08 - INFO - __main__ - Step 90749: {'lr': 0.0001730082431326805, 'samples': 17423808, 'steps': 90748, 'loss/train': 1.4094593524932861} 11/07/2021 10:01:09 - INFO - __main__ - Step 90750: {'lr': 0.0001730031943292119, 'samples': 17424000, 'steps': 90749, 'loss/train': 1.2392898797988892} 11/07/2021 10:01:09 - INFO - __main__ - Step 90751: {'lr': 0.0001729981455604369, 'samples': 17424192, 'steps': 90750, 'loss/train': 0.9715095162391663} 11/07/2021 10:01:09 - INFO - __main__ - Step 90752: {'lr': 0.00017299309682635775, 'samples': 17424384, 'steps': 90751, 'loss/train': 1.4063808917999268} 11/07/2021 10:01:10 - INFO - __main__ - Step 90753: {'lr': 0.00017298804812697672, 'samples': 17424576, 'steps': 90752, 'loss/train': 1.3910069465637207} 11/07/2021 10:01:11 - INFO - __main__ - Step 90754: {'lr': 0.00017298299946229607, 'samples': 17424768, 'steps': 90753, 'loss/train': 1.778894066810608} 11/07/2021 10:01:11 - INFO - __main__ - Step 90755: {'lr': 0.0001729779508323181, 'samples': 17424960, 'steps': 90754, 'loss/train': 0.9509474039077759} 11/07/2021 10:01:11 - INFO - __main__ - Step 90756: {'lr': 0.00017297290223704508, 'samples': 17425152, 'steps': 90755, 'loss/train': 1.4785592555999756} 11/07/2021 10:01:12 - INFO - __main__ - Step 90757: {'lr': 0.0001729678536764794, 'samples': 17425344, 'steps': 90756, 'loss/train': 1.467465877532959} 11/07/2021 10:01:13 - INFO - __main__ - Step 90758: {'lr': 0.00017296280515062312, 'samples': 17425536, 'steps': 90757, 'loss/train': 1.7039003372192383} 11/07/2021 10:01:13 - INFO - __main__ - Step 90759: {'lr': 0.0001729577566594786, 'samples': 17425728, 'steps': 90758, 'loss/train': 1.374988079071045} 11/07/2021 10:01:14 - INFO - __main__ - Step 90760: {'lr': 0.0001729527082030481, 'samples': 17425920, 'steps': 90759, 'loss/train': 1.416108250617981} 11/07/2021 10:01:14 - INFO - __main__ - Step 90761: {'lr': 0.00017294765978133396, 'samples': 17426112, 'steps': 90760, 'loss/train': 0.7640959024429321} 11/07/2021 10:01:15 - INFO - __main__ - Step 90762: {'lr': 0.00017294261139433838, 'samples': 17426304, 'steps': 90761, 'loss/train': 0.7651260495185852} 11/07/2021 10:01:15 - INFO - __main__ - Step 90763: {'lr': 0.0001729375630420637, 'samples': 17426496, 'steps': 90762, 'loss/train': 1.9851211309432983} 11/07/2021 10:01:15 - INFO - __main__ - Step 90764: {'lr': 0.00017293251472451216, 'samples': 17426688, 'steps': 90763, 'loss/train': 1.3228583335876465} 11/07/2021 10:01:16 - INFO - __main__ - Step 90765: {'lr': 0.000172927466441686, 'samples': 17426880, 'steps': 90764, 'loss/train': 1.3003246784210205} 11/07/2021 10:01:17 - INFO - __main__ - Step 90766: {'lr': 0.00017292241819358756, 'samples': 17427072, 'steps': 90765, 'loss/train': 1.3127049207687378} 11/07/2021 10:01:17 - INFO - __main__ - Step 90767: {'lr': 0.00017291736998021912, 'samples': 17427264, 'steps': 90766, 'loss/train': 1.49894118309021} 11/07/2021 10:01:17 - INFO - __main__ - Step 90768: {'lr': 0.00017291232180158289, 'samples': 17427456, 'steps': 90767, 'loss/train': 1.1872127056121826} 11/07/2021 10:01:18 - INFO - __main__ - Step 90769: {'lr': 0.00017290727365768115, 'samples': 17427648, 'steps': 90768, 'loss/train': 1.4876419305801392} 11/07/2021 10:01:19 - INFO - __main__ - Step 90770: {'lr': 0.00017290222554851626, 'samples': 17427840, 'steps': 90769, 'loss/train': 1.399706482887268} 11/07/2021 10:01:19 - INFO - __main__ - Step 90771: {'lr': 0.00017289717747409053, 'samples': 17428032, 'steps': 90770, 'loss/train': 1.19437575340271} 11/07/2021 10:01:20 - INFO - __main__ - Step 90772: {'lr': 0.00017289212943440602, 'samples': 17428224, 'steps': 90771, 'loss/train': 0.8728453516960144} 11/07/2021 10:01:20 - INFO - __main__ - Step 90773: {'lr': 0.00017288708142946513, 'samples': 17428416, 'steps': 90772, 'loss/train': 1.0992704629898071} 11/07/2021 10:01:20 - INFO - __main__ - Step 90774: {'lr': 0.00017288203345927015, 'samples': 17428608, 'steps': 90773, 'loss/train': 1.6594394445419312} 11/07/2021 10:01:21 - INFO - __main__ - Step 90775: {'lr': 0.0001728769855238233, 'samples': 17428800, 'steps': 90774, 'loss/train': 1.202199101448059} 11/07/2021 10:01:22 - INFO - __main__ - Step 90776: {'lr': 0.0001728719376231269, 'samples': 17428992, 'steps': 90775, 'loss/train': 1.3298686742782593} 11/07/2021 10:01:22 - INFO - __main__ - Step 90777: {'lr': 0.00017286688975718325, 'samples': 17429184, 'steps': 90776, 'loss/train': 1.3713064193725586} 11/07/2021 10:01:22 - INFO - __main__ - Step 90778: {'lr': 0.0001728618419259946, 'samples': 17429376, 'steps': 90777, 'loss/train': 1.4447855949401855} 11/07/2021 10:01:23 - INFO - __main__ - Step 90779: {'lr': 0.00017285679412956315, 'samples': 17429568, 'steps': 90778, 'loss/train': 1.577003836631775} 11/07/2021 10:01:24 - INFO - __main__ - Step 90780: {'lr': 0.00017285174636789125, 'samples': 17429760, 'steps': 90779, 'loss/train': 0.9263598918914795} 11/07/2021 10:01:24 - INFO - __main__ - Step 90781: {'lr': 0.00017284669864098119, 'samples': 17429952, 'steps': 90780, 'loss/train': 1.8036984205245972} 11/07/2021 10:01:24 - INFO - __main__ - Step 90782: {'lr': 0.00017284165094883522, 'samples': 17430144, 'steps': 90781, 'loss/train': 1.3942006826400757} 11/07/2021 10:01:25 - INFO - __main__ - Step 90783: {'lr': 0.00017283660329145558, 'samples': 17430336, 'steps': 90782, 'loss/train': 1.3178248405456543} 11/07/2021 10:01:25 - INFO - __main__ - Step 90784: {'lr': 0.00017283155566884473, 'samples': 17430528, 'steps': 90783, 'loss/train': 1.433727502822876} 11/07/2021 10:01:26 - INFO - __main__ - Step 90785: {'lr': 0.00017282650808100465, 'samples': 17430720, 'steps': 90784, 'loss/train': 1.3968439102172852} 11/07/2021 10:01:27 - INFO - __main__ - Step 90786: {'lr': 0.00017282146052793773, 'samples': 17430912, 'steps': 90785, 'loss/train': 1.3924627304077148} 11/07/2021 10:01:27 - INFO - __main__ - Step 90787: {'lr': 0.00017281641300964632, 'samples': 17431104, 'steps': 90786, 'loss/train': 1.511252760887146} 11/07/2021 10:01:27 - INFO - __main__ - Step 90788: {'lr': 0.00017281136552613265, 'samples': 17431296, 'steps': 90787, 'loss/train': 1.6064475774765015} 11/07/2021 10:01:28 - INFO - __main__ - Step 90789: {'lr': 0.00017280631807739893, 'samples': 17431488, 'steps': 90788, 'loss/train': 1.2144341468811035} 11/07/2021 10:01:28 - INFO - __main__ - Step 90790: {'lr': 0.00017280127066344753, 'samples': 17431680, 'steps': 90789, 'loss/train': 1.7063056230545044} 11/07/2021 10:01:29 - INFO - __main__ - Step 90791: {'lr': 0.00017279622328428068, 'samples': 17431872, 'steps': 90790, 'loss/train': 1.872795820236206} 11/07/2021 10:01:29 - INFO - __main__ - Step 90792: {'lr': 0.00017279117593990063, 'samples': 17432064, 'steps': 90791, 'loss/train': 1.5055913925170898} 11/07/2021 10:01:30 - INFO - __main__ - Step 90793: {'lr': 0.00017278612863030974, 'samples': 17432256, 'steps': 90792, 'loss/train': 1.5514427423477173} 11/07/2021 10:01:30 - INFO - __main__ - Step 90794: {'lr': 0.0001727810813555102, 'samples': 17432448, 'steps': 90793, 'loss/train': 1.2472959756851196} 11/07/2021 10:01:30 - INFO - __main__ - Step 90795: {'lr': 0.00017277603411550437, 'samples': 17432640, 'steps': 90794, 'loss/train': 1.6126645803451538} 11/07/2021 10:01:31 - INFO - __main__ - Step 90796: {'lr': 0.00017277098691029441, 'samples': 17432832, 'steps': 90795, 'loss/train': 1.474041223526001} 11/07/2021 10:01:32 - INFO - __main__ - Step 90797: {'lr': 0.0001727659397398827, 'samples': 17433024, 'steps': 90796, 'loss/train': 1.6979128122329712} 11/07/2021 10:01:32 - INFO - __main__ - Step 90798: {'lr': 0.0001727608926042714, 'samples': 17433216, 'steps': 90797, 'loss/train': 1.7845007181167603} 11/07/2021 10:01:32 - INFO - __main__ - Step 90799: {'lr': 0.00017275584550346287, 'samples': 17433408, 'steps': 90798, 'loss/train': 1.208871841430664} 11/07/2021 10:01:33 - INFO - __main__ - Step 90800: {'lr': 0.0001727507984374594, 'samples': 17433600, 'steps': 90799, 'loss/train': 1.1695481538772583} 11/07/2021 10:01:34 - INFO - __main__ - Step 90801: {'lr': 0.00017274575140626317, 'samples': 17433792, 'steps': 90800, 'loss/train': 1.9767085313796997} 11/07/2021 10:01:34 - INFO - __main__ - Step 90802: {'lr': 0.00017274070440987654, 'samples': 17433984, 'steps': 90801, 'loss/train': 1.110766053199768} 11/07/2021 10:01:35 - INFO - __main__ - Step 90803: {'lr': 0.00017273565744830172, 'samples': 17434176, 'steps': 90802, 'loss/train': 1.172705054283142} 11/07/2021 10:01:35 - INFO - __main__ - Step 90804: {'lr': 0.00017273061052154107, 'samples': 17434368, 'steps': 90803, 'loss/train': 0.4196327030658722} 11/07/2021 10:01:35 - INFO - __main__ - Step 90805: {'lr': 0.00017272556362959678, 'samples': 17434560, 'steps': 90804, 'loss/train': 1.3150558471679688} 11/07/2021 10:01:36 - INFO - __main__ - Step 90806: {'lr': 0.00017272051677247124, 'samples': 17434752, 'steps': 90805, 'loss/train': 0.7285857200622559} 11/07/2021 10:01:37 - INFO - __main__ - Step 90807: {'lr': 0.00017271546995016658, 'samples': 17434944, 'steps': 90806, 'loss/train': 1.5854607820510864} 11/07/2021 10:01:37 - INFO - __main__ - Step 90808: {'lr': 0.00017271042316268514, 'samples': 17435136, 'steps': 90807, 'loss/train': 0.992493748664856} 11/07/2021 10:01:37 - INFO - __main__ - Step 90809: {'lr': 0.00017270537641002917, 'samples': 17435328, 'steps': 90808, 'loss/train': 1.7435128688812256} 11/07/2021 10:01:38 - INFO - __main__ - Step 90810: {'lr': 0.00017270032969220097, 'samples': 17435520, 'steps': 90809, 'loss/train': 1.3180991411209106} 11/07/2021 10:01:38 - INFO - __main__ - Step 90811: {'lr': 0.0001726952830092029, 'samples': 17435712, 'steps': 90810, 'loss/train': 2.194681406021118} 11/07/2021 10:01:39 - INFO - __main__ - Step 90812: {'lr': 0.00017269023636103703, 'samples': 17435904, 'steps': 90811, 'loss/train': 1.4603570699691772} 11/07/2021 10:01:39 - INFO - __main__ - Step 90813: {'lr': 0.0001726851897477058, 'samples': 17436096, 'steps': 90812, 'loss/train': 1.322041630744934} 11/07/2021 10:01:40 - INFO - __main__ - Step 90814: {'lr': 0.00017268014316921138, 'samples': 17436288, 'steps': 90813, 'loss/train': 0.8714263439178467} 11/07/2021 10:01:40 - INFO - __main__ - Step 90815: {'lr': 0.00017267509662555614, 'samples': 17436480, 'steps': 90814, 'loss/train': 1.9522182941436768} 11/07/2021 10:01:40 - INFO - __main__ - Step 90816: {'lr': 0.0001726700501167423, 'samples': 17436672, 'steps': 90815, 'loss/train': 1.2854210138320923} 11/07/2021 10:01:41 - INFO - __main__ - Step 90817: {'lr': 0.00017266500364277216, 'samples': 17436864, 'steps': 90816, 'loss/train': 0.960962176322937} 11/07/2021 10:01:42 - INFO - __main__ - Step 90818: {'lr': 0.00017265995720364797, 'samples': 17437056, 'steps': 90817, 'loss/train': 1.3641982078552246} 11/07/2021 10:01:42 - INFO - __main__ - Step 90819: {'lr': 0.00017265491079937196, 'samples': 17437248, 'steps': 90818, 'loss/train': 1.5403975248336792} 11/07/2021 10:01:42 - INFO - __main__ - Step 90820: {'lr': 0.00017264986442994652, 'samples': 17437440, 'steps': 90819, 'loss/train': 0.8958832025527954} 11/07/2021 10:01:43 - INFO - __main__ - Step 90821: {'lr': 0.0001726448180953738, 'samples': 17437632, 'steps': 90820, 'loss/train': 1.1982953548431396} 11/07/2021 10:01:44 - INFO - __main__ - Step 90822: {'lr': 0.00017263977179565615, 'samples': 17437824, 'steps': 90821, 'loss/train': 1.1860450506210327} 11/07/2021 10:01:44 - INFO - __main__ - Step 90823: {'lr': 0.00017263472553079583, 'samples': 17438016, 'steps': 90822, 'loss/train': 1.8761870861053467} 11/07/2021 10:01:45 - INFO - __main__ - Step 90824: {'lr': 0.00017262967930079516, 'samples': 17438208, 'steps': 90823, 'loss/train': 1.0317442417144775} 11/07/2021 10:01:45 - INFO - __main__ - Step 90825: {'lr': 0.0001726246331056563, 'samples': 17438400, 'steps': 90824, 'loss/train': 1.497603178024292} 11/07/2021 10:01:45 - INFO - __main__ - Step 90826: {'lr': 0.0001726195869453816, 'samples': 17438592, 'steps': 90825, 'loss/train': 1.2351367473602295} 11/07/2021 10:01:46 - INFO - __main__ - Step 90827: {'lr': 0.0001726145408199733, 'samples': 17438784, 'steps': 90826, 'loss/train': 1.6945151090621948} 11/07/2021 10:01:47 - INFO - __main__ - Step 90828: {'lr': 0.00017260949472943377, 'samples': 17438976, 'steps': 90827, 'loss/train': 1.1156526803970337} 11/07/2021 10:01:47 - INFO - __main__ - Step 90829: {'lr': 0.00017260444867376514, 'samples': 17439168, 'steps': 90828, 'loss/train': 1.1120020151138306} 11/07/2021 10:01:47 - INFO - __main__ - Step 90830: {'lr': 0.00017259940265296976, 'samples': 17439360, 'steps': 90829, 'loss/train': 1.3474242687225342} 11/07/2021 10:01:48 - INFO - __main__ - Step 90831: {'lr': 0.00017259435666704988, 'samples': 17439552, 'steps': 90830, 'loss/train': 1.1506386995315552} 11/07/2021 10:01:49 - INFO - __main__ - Step 90832: {'lr': 0.0001725893107160078, 'samples': 17439744, 'steps': 90831, 'loss/train': 1.332177996635437} 11/07/2021 10:01:49 - INFO - __main__ - Step 90833: {'lr': 0.0001725842647998458, 'samples': 17439936, 'steps': 90832, 'loss/train': 1.3609561920166016} 11/07/2021 10:01:50 - INFO - __main__ - Step 90834: {'lr': 0.0001725792189185661, 'samples': 17440128, 'steps': 90833, 'loss/train': 0.6154439449310303} 11/07/2021 10:01:50 - INFO - __main__ - Step 90835: {'lr': 0.00017257417307217103, 'samples': 17440320, 'steps': 90834, 'loss/train': 1.625065565109253} 11/07/2021 10:01:50 - INFO - __main__ - Step 90836: {'lr': 0.00017256912726066283, 'samples': 17440512, 'steps': 90835, 'loss/train': 1.2600696086883545} 11/07/2021 10:01:52 - INFO - __main__ - Step 90837: {'lr': 0.0001725640814840438, 'samples': 17440704, 'steps': 90836, 'loss/train': 1.8357529640197754} 11/07/2021 10:01:52 - INFO - __main__ - Step 90838: {'lr': 0.00017255903574231625, 'samples': 17440896, 'steps': 90837, 'loss/train': 1.286036729812622} 11/07/2021 10:01:52 - INFO - __main__ - Step 90839: {'lr': 0.0001725539900354824, 'samples': 17441088, 'steps': 90838, 'loss/train': 1.3119767904281616} 11/07/2021 10:01:53 - INFO - __main__ - Step 90840: {'lr': 0.00017254894436354447, 'samples': 17441280, 'steps': 90839, 'loss/train': 0.8562422394752502} 11/07/2021 10:01:53 - INFO - __main__ - Step 90841: {'lr': 0.00017254389872650477, 'samples': 17441472, 'steps': 90840, 'loss/train': 1.1481537818908691} 11/07/2021 10:01:54 - INFO - __main__ - Step 90842: {'lr': 0.00017253885312436563, 'samples': 17441664, 'steps': 90841, 'loss/train': 1.4900176525115967} 11/07/2021 10:01:55 - INFO - __main__ - Step 90843: {'lr': 0.00017253380755712926, 'samples': 17441856, 'steps': 90842, 'loss/train': 1.0830594301223755} 11/07/2021 10:01:55 - INFO - __main__ - Step 90844: {'lr': 0.000172528762024798, 'samples': 17442048, 'steps': 90843, 'loss/train': 1.2909783124923706} 11/07/2021 10:01:55 - INFO - __main__ - Step 90845: {'lr': 0.00017252371652737408, 'samples': 17442240, 'steps': 90844, 'loss/train': 1.098681926727295} 11/07/2021 10:01:56 - INFO - __main__ - Step 90846: {'lr': 0.00017251867106485974, 'samples': 17442432, 'steps': 90845, 'loss/train': 1.3717129230499268} 11/07/2021 10:01:56 - INFO - __main__ - Step 90847: {'lr': 0.0001725136256372573, 'samples': 17442624, 'steps': 90846, 'loss/train': 1.6369645595550537} 11/07/2021 10:01:57 - INFO - __main__ - Step 90848: {'lr': 0.00017250858024456906, 'samples': 17442816, 'steps': 90847, 'loss/train': 1.6657596826553345} 11/07/2021 10:01:57 - INFO - __main__ - Step 90849: {'lr': 0.00017250353488679725, 'samples': 17443008, 'steps': 90848, 'loss/train': 1.0769915580749512} 11/07/2021 10:01:58 - INFO - __main__ - Step 90850: {'lr': 0.0001724984895639441, 'samples': 17443200, 'steps': 90849, 'loss/train': 1.0904229879379272} 11/07/2021 10:01:58 - INFO - __main__ - Step 90851: {'lr': 0.0001724934442760121, 'samples': 17443392, 'steps': 90850, 'loss/train': 1.4356194734573364} 11/07/2021 10:01:58 - INFO - __main__ - Step 90852: {'lr': 0.00017248839902300322, 'samples': 17443584, 'steps': 90851, 'loss/train': 1.3066388368606567} 11/07/2021 10:02:00 - INFO - __main__ - Step 90853: {'lr': 0.00017248335380491987, 'samples': 17443776, 'steps': 90852, 'loss/train': 1.0872530937194824} 11/07/2021 10:02:00 - INFO - __main__ - Step 90854: {'lr': 0.00017247830862176435, 'samples': 17443968, 'steps': 90853, 'loss/train': 1.190559983253479} 11/07/2021 10:02:00 - INFO - __main__ - Step 90855: {'lr': 0.00017247326347353886, 'samples': 17444160, 'steps': 90854, 'loss/train': 1.2222284078598022} 11/07/2021 10:02:01 - INFO - __main__ - Step 90856: {'lr': 0.0001724682183602458, 'samples': 17444352, 'steps': 90855, 'loss/train': 1.6910934448242188} 11/07/2021 10:02:01 - INFO - __main__ - Step 90857: {'lr': 0.0001724631732818873, 'samples': 17444544, 'steps': 90856, 'loss/train': 0.41471222043037415} 11/07/2021 10:02:02 - INFO - __main__ - Step 90858: {'lr': 0.0001724581282384657, 'samples': 17444736, 'steps': 90857, 'loss/train': 1.2440967559814453} 11/07/2021 10:02:02 - INFO - __main__ - Step 90859: {'lr': 0.0001724530832299833, 'samples': 17444928, 'steps': 90858, 'loss/train': 1.758184790611267} 11/07/2021 10:02:03 - INFO - __main__ - Step 90860: {'lr': 0.00017244803825644235, 'samples': 17445120, 'steps': 90859, 'loss/train': 1.3012131452560425} 11/07/2021 10:02:03 - INFO - __main__ - Step 90861: {'lr': 0.00017244299331784508, 'samples': 17445312, 'steps': 90860, 'loss/train': 0.5044549107551575} 11/07/2021 10:02:03 - INFO - __main__ - Step 90862: {'lr': 0.0001724379484141938, 'samples': 17445504, 'steps': 90861, 'loss/train': 1.742526650428772} 11/07/2021 10:02:04 - INFO - __main__ - Step 90863: {'lr': 0.00017243290354549082, 'samples': 17445696, 'steps': 90862, 'loss/train': 3.0855906009674072} 11/07/2021 10:02:05 - INFO - __main__ - Step 90864: {'lr': 0.00017242785871173836, 'samples': 17445888, 'steps': 90863, 'loss/train': 1.3661144971847534} 11/07/2021 10:02:05 - INFO - __main__ - Step 90865: {'lr': 0.0001724228139129388, 'samples': 17446080, 'steps': 90864, 'loss/train': 1.6335252523422241} 11/07/2021 10:02:05 - INFO - __main__ - Step 90866: {'lr': 0.00017241776914909423, 'samples': 17446272, 'steps': 90865, 'loss/train': 1.391155481338501} 11/07/2021 10:02:06 - INFO - __main__ - Step 90867: {'lr': 0.00017241272442020702, 'samples': 17446464, 'steps': 90866, 'loss/train': 1.4247639179229736} 11/07/2021 10:02:06 - INFO - __main__ - Step 90868: {'lr': 0.00017240767972627943, 'samples': 17446656, 'steps': 90867, 'loss/train': 1.5979362726211548} 11/07/2021 10:02:07 - INFO - __main__ - Step 90869: {'lr': 0.00017240263506731375, 'samples': 17446848, 'steps': 90868, 'loss/train': 1.081495761871338} 11/07/2021 10:02:08 - INFO - __main__ - Step 90870: {'lr': 0.00017239759044331227, 'samples': 17447040, 'steps': 90869, 'loss/train': 1.4421123266220093} 11/07/2021 10:02:08 - INFO - __main__ - Step 90871: {'lr': 0.00017239254585427722, 'samples': 17447232, 'steps': 90870, 'loss/train': 1.6012749671936035} 11/07/2021 10:02:08 - INFO - __main__ - Step 90872: {'lr': 0.00017238750130021087, 'samples': 17447424, 'steps': 90871, 'loss/train': 1.183478593826294} 11/07/2021 10:02:09 - INFO - __main__ - Step 90873: {'lr': 0.0001723824567811155, 'samples': 17447616, 'steps': 90872, 'loss/train': 1.2341409921646118} 11/07/2021 10:02:10 - INFO - __main__ - Step 90874: {'lr': 0.00017237741229699343, 'samples': 17447808, 'steps': 90873, 'loss/train': 0.9778850078582764} 11/07/2021 10:02:10 - INFO - __main__ - Step 90875: {'lr': 0.00017237236784784692, 'samples': 17448000, 'steps': 90874, 'loss/train': 1.4554327726364136} 11/07/2021 10:02:10 - INFO - __main__ - Step 90876: {'lr': 0.00017236732343367818, 'samples': 17448192, 'steps': 90875, 'loss/train': 1.4182583093643188} 11/07/2021 10:02:11 - INFO - __main__ - Step 90877: {'lr': 0.00017236227905448955, 'samples': 17448384, 'steps': 90876, 'loss/train': 1.7735466957092285} 11/07/2021 10:02:11 - INFO - __main__ - Step 90878: {'lr': 0.00017235723471028337, 'samples': 17448576, 'steps': 90877, 'loss/train': 1.283045768737793} 11/07/2021 10:02:12 - INFO - __main__ - Step 90879: {'lr': 0.00017235219040106174, 'samples': 17448768, 'steps': 90878, 'loss/train': 1.2747507095336914} 11/07/2021 10:02:12 - INFO - __main__ - Step 90880: {'lr': 0.000172347146126827, 'samples': 17448960, 'steps': 90879, 'loss/train': 1.2139443159103394} 11/07/2021 10:02:13 - INFO - __main__ - Step 90881: {'lr': 0.00017234210188758143, 'samples': 17449152, 'steps': 90880, 'loss/train': 1.5969562530517578} 11/07/2021 10:02:13 - INFO - __main__ - Step 90882: {'lr': 0.0001723370576833273, 'samples': 17449344, 'steps': 90881, 'loss/train': 1.4307562112808228} 11/07/2021 10:02:13 - INFO - __main__ - Step 90883: {'lr': 0.00017233201351406693, 'samples': 17449536, 'steps': 90882, 'loss/train': 1.0100206136703491} 11/07/2021 10:02:14 - INFO - __main__ - Step 90884: {'lr': 0.00017232696937980252, 'samples': 17449728, 'steps': 90883, 'loss/train': 1.7131150960922241} 11/07/2021 10:02:15 - INFO - __main__ - Step 90885: {'lr': 0.00017232192528053643, 'samples': 17449920, 'steps': 90884, 'loss/train': 1.4834022521972656} 11/07/2021 10:02:15 - INFO - __main__ - Step 90886: {'lr': 0.00017231688121627082, 'samples': 17450112, 'steps': 90885, 'loss/train': 0.2915656566619873} 11/07/2021 10:02:16 - INFO - __main__ - Step 90887: {'lr': 0.00017231183718700808, 'samples': 17450304, 'steps': 90886, 'loss/train': 1.3987410068511963} 11/07/2021 10:02:16 - INFO - __main__ - Step 90888: {'lr': 0.00017230679319275039, 'samples': 17450496, 'steps': 90887, 'loss/train': 1.972109317779541} 11/07/2021 10:02:16 - INFO - __main__ - Step 90889: {'lr': 0.00017230174923350006, 'samples': 17450688, 'steps': 90888, 'loss/train': 0.8430572152137756} 11/07/2021 10:02:17 - INFO - __main__ - Step 90890: {'lr': 0.0001722967053092594, 'samples': 17450880, 'steps': 90889, 'loss/train': 1.5268373489379883} 11/07/2021 10:02:18 - INFO - __main__ - Step 90891: {'lr': 0.0001722916614200306, 'samples': 17451072, 'steps': 90890, 'loss/train': 1.02252995967865} 11/07/2021 10:02:18 - INFO - __main__ - Step 90892: {'lr': 0.0001722866175658161, 'samples': 17451264, 'steps': 90891, 'loss/train': 1.3806867599487305} 11/07/2021 10:02:18 - INFO - __main__ - Step 90893: {'lr': 0.00017228157374661796, 'samples': 17451456, 'steps': 90892, 'loss/train': 1.2943851947784424} 11/07/2021 10:02:19 - INFO - __main__ - Step 90894: {'lr': 0.00017227652996243853, 'samples': 17451648, 'steps': 90893, 'loss/train': 1.338278889656067} 11/07/2021 10:02:20 - INFO - __main__ - Step 90895: {'lr': 0.0001722714862132801, 'samples': 17451840, 'steps': 90894, 'loss/train': 1.8766802549362183} 11/07/2021 10:02:20 - INFO - __main__ - Step 90896: {'lr': 0.0001722664424991449, 'samples': 17452032, 'steps': 90895, 'loss/train': 1.6436522006988525} 11/07/2021 10:02:20 - INFO - __main__ - Step 90897: {'lr': 0.00017226139882003534, 'samples': 17452224, 'steps': 90896, 'loss/train': 1.2321720123291016} 11/07/2021 10:02:21 - INFO - __main__ - Step 90898: {'lr': 0.0001722563551759535, 'samples': 17452416, 'steps': 90897, 'loss/train': 1.3903559446334839} 11/07/2021 10:02:21 - INFO - __main__ - Step 90899: {'lr': 0.00017225131156690178, 'samples': 17452608, 'steps': 90898, 'loss/train': 0.48877912759780884} 11/07/2021 10:02:22 - INFO - __main__ - Step 90900: {'lr': 0.00017224626799288242, 'samples': 17452800, 'steps': 90899, 'loss/train': 1.5447098016738892} 11/07/2021 10:02:23 - INFO - __main__ - Step 90901: {'lr': 0.0001722412244538977, 'samples': 17452992, 'steps': 90900, 'loss/train': 1.6252814531326294} 11/07/2021 10:02:23 - INFO - __main__ - Step 90902: {'lr': 0.00017223618094994986, 'samples': 17453184, 'steps': 90901, 'loss/train': 1.5783582925796509} 11/07/2021 10:02:23 - INFO - __main__ - Step 90903: {'lr': 0.0001722311374810412, 'samples': 17453376, 'steps': 90902, 'loss/train': 1.159103512763977} 11/07/2021 10:02:24 - INFO - __main__ - Step 90904: {'lr': 0.00017222609404717403, 'samples': 17453568, 'steps': 90903, 'loss/train': 1.4469693899154663} 11/07/2021 10:02:25 - INFO - __main__ - Step 90905: {'lr': 0.00017222105064835063, 'samples': 17453760, 'steps': 90904, 'loss/train': 1.5063303709030151} 11/07/2021 10:02:25 - INFO - __main__ - Step 90906: {'lr': 0.00017221600728457314, 'samples': 17453952, 'steps': 90905, 'loss/train': 1.4199055433273315} 11/07/2021 10:02:25 - INFO - __main__ - Step 90907: {'lr': 0.00017221096395584395, 'samples': 17454144, 'steps': 90906, 'loss/train': 1.4720871448516846} 11/07/2021 10:02:26 - INFO - __main__ - Step 90908: {'lr': 0.00017220592066216527, 'samples': 17454336, 'steps': 90907, 'loss/train': 1.4246491193771362} 11/07/2021 10:02:26 - INFO - __main__ - Step 90909: {'lr': 0.0001722008774035394, 'samples': 17454528, 'steps': 90908, 'loss/train': 1.365587592124939} 11/07/2021 10:02:27 - INFO - __main__ - Step 90910: {'lr': 0.00017219583417996866, 'samples': 17454720, 'steps': 90909, 'loss/train': 0.11076612770557404} 11/07/2021 10:02:27 - INFO - __main__ - Step 90911: {'lr': 0.0001721907909914552, 'samples': 17454912, 'steps': 90910, 'loss/train': 1.5390576124191284} 11/07/2021 10:02:28 - INFO - __main__ - Step 90912: {'lr': 0.0001721857478380014, 'samples': 17455104, 'steps': 90911, 'loss/train': 2.137462854385376} 11/07/2021 10:02:28 - INFO - __main__ - Step 90913: {'lr': 0.0001721807047196095, 'samples': 17455296, 'steps': 90912, 'loss/train': 1.8041881322860718} 11/07/2021 10:02:28 - INFO - __main__ - Step 90914: {'lr': 0.00017217566163628178, 'samples': 17455488, 'steps': 90913, 'loss/train': 1.9052972793579102} 11/07/2021 10:02:29 - INFO - __main__ - Step 90915: {'lr': 0.00017217061858802051, 'samples': 17455680, 'steps': 90914, 'loss/train': 1.4482537508010864} 11/07/2021 10:02:30 - INFO - __main__ - Step 90916: {'lr': 0.00017216557557482798, 'samples': 17455872, 'steps': 90915, 'loss/train': 1.5552723407745361} 11/07/2021 10:02:30 - INFO - __main__ - Step 90917: {'lr': 0.00017216053259670638, 'samples': 17456064, 'steps': 90916, 'loss/train': 1.2293827533721924} 11/07/2021 10:02:30 - INFO - __main__ - Step 90918: {'lr': 0.0001721554896536582, 'samples': 17456256, 'steps': 90917, 'loss/train': 1.6548023223876953} 11/07/2021 10:02:31 - INFO - __main__ - Step 90919: {'lr': 0.00017215044674568543, 'samples': 17456448, 'steps': 90918, 'loss/train': 1.8483628034591675} 11/07/2021 10:02:31 - INFO - __main__ - Step 90920: {'lr': 0.00017214540387279048, 'samples': 17456640, 'steps': 90919, 'loss/train': 1.6964898109436035} 11/07/2021 10:02:32 - INFO - __main__ - Step 90921: {'lr': 0.0001721403610349756, 'samples': 17456832, 'steps': 90920, 'loss/train': 1.4016817808151245} 11/07/2021 10:02:33 - INFO - __main__ - Step 90922: {'lr': 0.00017213531823224307, 'samples': 17457024, 'steps': 90921, 'loss/train': 1.6751601696014404} 11/07/2021 10:02:33 - INFO - __main__ - Step 90923: {'lr': 0.00017213027546459517, 'samples': 17457216, 'steps': 90922, 'loss/train': 0.6986984014511108} 11/07/2021 10:02:33 - INFO - __main__ - Step 90924: {'lr': 0.0001721252327320342, 'samples': 17457408, 'steps': 90923, 'loss/train': 1.3283114433288574} 11/07/2021 10:02:34 - INFO - __main__ - Step 90925: {'lr': 0.0001721201900345623, 'samples': 17457600, 'steps': 90924, 'loss/train': 1.5021966695785522} 11/07/2021 10:02:35 - INFO - __main__ - Step 90926: {'lr': 0.00017211514737218192, 'samples': 17457792, 'steps': 90925, 'loss/train': 1.4236582517623901} 11/07/2021 10:02:35 - INFO - __main__ - Step 90927: {'lr': 0.00017211010474489524, 'samples': 17457984, 'steps': 90926, 'loss/train': 1.2559269666671753} 11/07/2021 10:02:35 - INFO - __main__ - Step 90928: {'lr': 0.00017210506215270454, 'samples': 17458176, 'steps': 90927, 'loss/train': 1.4285041093826294} 11/07/2021 10:02:36 - INFO - __main__ - Step 90929: {'lr': 0.0001721000195956121, 'samples': 17458368, 'steps': 90928, 'loss/train': 1.6959307193756104} 11/07/2021 10:02:36 - INFO - __main__ - Step 90930: {'lr': 0.0001720949770736202, 'samples': 17458560, 'steps': 90929, 'loss/train': 1.3555470705032349} 11/07/2021 10:02:37 - INFO - __main__ - Step 90931: {'lr': 0.0001720899345867311, 'samples': 17458752, 'steps': 90930, 'loss/train': 1.3654850721359253} 11/07/2021 10:02:37 - INFO - __main__ - Step 90932: {'lr': 0.00017208489213494714, 'samples': 17458944, 'steps': 90931, 'loss/train': 0.9478698968887329} 11/07/2021 10:02:38 - INFO - __main__ - Step 90933: {'lr': 0.0001720798497182704, 'samples': 17459136, 'steps': 90932, 'loss/train': 1.3136847019195557} 11/07/2021 10:02:38 - INFO - __main__ - Step 90934: {'lr': 0.00017207480733670333, 'samples': 17459328, 'steps': 90933, 'loss/train': 1.10098135471344} 11/07/2021 10:02:38 - INFO - __main__ - Step 90935: {'lr': 0.00017206976499024819, 'samples': 17459520, 'steps': 90934, 'loss/train': 1.57118558883667} 11/07/2021 10:02:39 - INFO - __main__ - Step 90936: {'lr': 0.00017206472267890713, 'samples': 17459712, 'steps': 90935, 'loss/train': 1.3465056419372559} 11/07/2021 10:02:40 - INFO - __main__ - Step 90937: {'lr': 0.00017205968040268256, 'samples': 17459904, 'steps': 90936, 'loss/train': 1.1395337581634521} 11/07/2021 10:02:40 - INFO - __main__ - Step 90938: {'lr': 0.00017205463816157666, 'samples': 17460096, 'steps': 90937, 'loss/train': 1.249803900718689} 11/07/2021 10:02:41 - INFO - __main__ - Step 90939: {'lr': 0.00017204959595559173, 'samples': 17460288, 'steps': 90938, 'loss/train': 1.0643740892410278} 11/07/2021 10:02:41 - INFO - __main__ - Step 90940: {'lr': 0.0001720445537847301, 'samples': 17460480, 'steps': 90939, 'loss/train': 1.6002814769744873} 11/07/2021 10:02:42 - INFO - __main__ - Step 90941: {'lr': 0.000172039511648994, 'samples': 17460672, 'steps': 90940, 'loss/train': 1.3517842292785645} 11/07/2021 10:02:42 - INFO - __main__ - Step 90942: {'lr': 0.00017203446954838563, 'samples': 17460864, 'steps': 90941, 'loss/train': 1.4266585111618042} 11/07/2021 10:02:43 - INFO - __main__ - Step 90943: {'lr': 0.00017202942748290734, 'samples': 17461056, 'steps': 90942, 'loss/train': 1.2232179641723633} 11/07/2021 10:02:43 - INFO - __main__ - Step 90944: {'lr': 0.00017202438545256142, 'samples': 17461248, 'steps': 90943, 'loss/train': 1.2687186002731323} 11/07/2021 10:02:43 - INFO - __main__ - Step 90945: {'lr': 0.00017201934345735013, 'samples': 17461440, 'steps': 90944, 'loss/train': 1.3558082580566406} 11/07/2021 10:02:44 - INFO - __main__ - Step 90946: {'lr': 0.00017201430149727567, 'samples': 17461632, 'steps': 90945, 'loss/train': 1.791256308555603} 11/07/2021 10:02:45 - INFO - __main__ - Step 90947: {'lr': 0.00017200925957234036, 'samples': 17461824, 'steps': 90946, 'loss/train': 1.3647009134292603} 11/07/2021 10:02:45 - INFO - __main__ - Step 90948: {'lr': 0.00017200421768254648, 'samples': 17462016, 'steps': 90947, 'loss/train': 1.467984676361084} 11/07/2021 10:02:45 - INFO - __main__ - Step 90949: {'lr': 0.00017199917582789631, 'samples': 17462208, 'steps': 90948, 'loss/train': 1.4269237518310547} 11/07/2021 10:02:46 - INFO - __main__ - Step 90950: {'lr': 0.00017199413400839208, 'samples': 17462400, 'steps': 90949, 'loss/train': 5.73760986328125} 11/07/2021 10:02:46 - INFO - __main__ - Step 90951: {'lr': 0.00017198909222403616, 'samples': 17462592, 'steps': 90950, 'loss/train': 1.2808773517608643} 11/07/2021 10:02:47 - INFO - __main__ - Step 90952: {'lr': 0.00017198405047483067, 'samples': 17462784, 'steps': 90951, 'loss/train': 1.5058916807174683} 11/07/2021 10:02:48 - INFO - __main__ - Step 90953: {'lr': 0.00017197900876077802, 'samples': 17462976, 'steps': 90952, 'loss/train': 1.5669430494308472} 11/07/2021 10:02:48 - INFO - __main__ - Step 90954: {'lr': 0.0001719739670818804, 'samples': 17463168, 'steps': 90953, 'loss/train': 0.8421223163604736} 11/07/2021 10:02:48 - INFO - __main__ - Step 90955: {'lr': 0.00017196892543814006, 'samples': 17463360, 'steps': 90954, 'loss/train': 1.4096195697784424} 11/07/2021 10:02:49 - INFO - __main__ - Step 90956: {'lr': 0.0001719638838295594, 'samples': 17463552, 'steps': 90955, 'loss/train': 1.1991313695907593} 11/07/2021 10:02:50 - INFO - __main__ - Step 90957: {'lr': 0.00017195884225614056, 'samples': 17463744, 'steps': 90956, 'loss/train': 1.3455371856689453} 11/07/2021 10:02:50 - INFO - __main__ - Step 90958: {'lr': 0.00017195380071788585, 'samples': 17463936, 'steps': 90957, 'loss/train': 1.4316151142120361} 11/07/2021 10:02:50 - INFO - __main__ - Step 90959: {'lr': 0.00017194875921479764, 'samples': 17464128, 'steps': 90958, 'loss/train': 0.9663114547729492} 11/07/2021 10:02:51 - INFO - __main__ - Step 90960: {'lr': 0.00017194371774687802, 'samples': 17464320, 'steps': 90959, 'loss/train': 1.50754714012146} 11/07/2021 10:02:51 - INFO - __main__ - Step 90961: {'lr': 0.0001719386763141294, 'samples': 17464512, 'steps': 90960, 'loss/train': 1.5605902671813965} 11/07/2021 10:02:52 - INFO - __main__ - Step 90962: {'lr': 0.00017193363491655402, 'samples': 17464704, 'steps': 90961, 'loss/train': 1.3703103065490723} 11/07/2021 10:02:53 - INFO - __main__ - Step 90963: {'lr': 0.00017192859355415413, 'samples': 17464896, 'steps': 90962, 'loss/train': 1.626806378364563} 11/07/2021 10:02:53 - INFO - __main__ - Step 90964: {'lr': 0.00017192355222693198, 'samples': 17465088, 'steps': 90963, 'loss/train': 1.3416138887405396} 11/07/2021 10:02:53 - INFO - __main__ - Step 90965: {'lr': 0.0001719185109348899, 'samples': 17465280, 'steps': 90964, 'loss/train': 1.5764638185501099} 11/07/2021 10:02:54 - INFO - __main__ - Step 90966: {'lr': 0.0001719134696780301, 'samples': 17465472, 'steps': 90965, 'loss/train': 1.5033605098724365} 11/07/2021 10:02:55 - INFO - __main__ - Step 90967: {'lr': 0.00017190842845635492, 'samples': 17465664, 'steps': 90966, 'loss/train': 1.3178800344467163} 11/07/2021 10:02:55 - INFO - __main__ - Step 90968: {'lr': 0.00017190338726986654, 'samples': 17465856, 'steps': 90967, 'loss/train': 1.2506157159805298} 11/07/2021 10:02:56 - INFO - __main__ - Step 90969: {'lr': 0.00017189834611856737, 'samples': 17466048, 'steps': 90968, 'loss/train': 1.3108224868774414} 11/07/2021 10:02:56 - INFO - __main__ - Step 90970: {'lr': 0.00017189330500245954, 'samples': 17466240, 'steps': 90969, 'loss/train': 1.472920536994934} 11/07/2021 10:02:56 - INFO - __main__ - Step 90971: {'lr': 0.00017188826392154538, 'samples': 17466432, 'steps': 90970, 'loss/train': 1.2641078233718872} 11/07/2021 10:02:57 - INFO - __main__ - Step 90972: {'lr': 0.00017188322287582726, 'samples': 17466624, 'steps': 90971, 'loss/train': 1.5919513702392578} 11/07/2021 10:02:58 - INFO - __main__ - Step 90973: {'lr': 0.00017187818186530733, 'samples': 17466816, 'steps': 90972, 'loss/train': 0.8158782720565796} 11/07/2021 10:02:58 - INFO - __main__ - Step 90974: {'lr': 0.0001718731408899878, 'samples': 17467008, 'steps': 90973, 'loss/train': 1.158613681793213} 11/07/2021 10:02:58 - INFO - __main__ - Step 90975: {'lr': 0.00017186809994987107, 'samples': 17467200, 'steps': 90974, 'loss/train': 0.26463308930397034} 11/07/2021 10:02:59 - INFO - __main__ - Step 90976: {'lr': 0.00017186305904495937, 'samples': 17467392, 'steps': 90975, 'loss/train': 1.334018349647522} 11/07/2021 10:02:59 - INFO - __main__ - Step 90977: {'lr': 0.00017185801817525494, 'samples': 17467584, 'steps': 90976, 'loss/train': 1.4570016860961914} 11/07/2021 10:03:00 - INFO - __main__ - Step 90978: {'lr': 0.00017185297734076011, 'samples': 17467776, 'steps': 90977, 'loss/train': 1.54364812374115} 11/07/2021 10:03:00 - INFO - __main__ - Step 90979: {'lr': 0.0001718479365414771, 'samples': 17467968, 'steps': 90978, 'loss/train': 1.8689799308776855} 11/07/2021 10:03:01 - INFO - __main__ - Step 90980: {'lr': 0.00017184289577740824, 'samples': 17468160, 'steps': 90979, 'loss/train': 1.3954706192016602} 11/07/2021 10:03:01 - INFO - __main__ - Step 90981: {'lr': 0.00017183785504855574, 'samples': 17468352, 'steps': 90980, 'loss/train': 1.1506446599960327} 11/07/2021 10:03:01 - INFO - __main__ - Step 90982: {'lr': 0.00017183281435492187, 'samples': 17468544, 'steps': 90981, 'loss/train': 1.315295696258545} 11/07/2021 10:03:02 - INFO - __main__ - Step 90983: {'lr': 0.00017182777369650898, 'samples': 17468736, 'steps': 90982, 'loss/train': 1.4653409719467163} 11/07/2021 10:03:03 - INFO - __main__ - Step 90984: {'lr': 0.00017182273307331925, 'samples': 17468928, 'steps': 90983, 'loss/train': 1.3527007102966309} 11/07/2021 10:03:03 - INFO - __main__ - Step 90985: {'lr': 0.000171817692485355, 'samples': 17469120, 'steps': 90984, 'loss/train': 1.4931399822235107} 11/07/2021 10:03:03 - INFO - __main__ - Step 90986: {'lr': 0.00017181265193261865, 'samples': 17469312, 'steps': 90985, 'loss/train': 0.7509333491325378} 11/07/2021 10:03:04 - INFO - __main__ - Step 90987: {'lr': 0.00017180761141511215, 'samples': 17469504, 'steps': 90986, 'loss/train': 1.6866122484207153} 11/07/2021 10:03:04 - INFO - __main__ - Step 90988: {'lr': 0.0001718025709328379, 'samples': 17469696, 'steps': 90987, 'loss/train': 1.6533828973770142} 11/07/2021 10:03:05 - INFO - __main__ - Step 90989: {'lr': 0.00017179753048579828, 'samples': 17469888, 'steps': 90988, 'loss/train': 0.13393795490264893} 11/07/2021 10:03:06 - INFO - __main__ - Step 90990: {'lr': 0.00017179249007399545, 'samples': 17470080, 'steps': 90989, 'loss/train': 1.6046254634857178} 11/07/2021 10:03:06 - INFO - __main__ - Step 90991: {'lr': 0.0001717874496974317, 'samples': 17470272, 'steps': 90990, 'loss/train': 1.9092167615890503} 11/07/2021 10:03:06 - INFO - __main__ - Step 90992: {'lr': 0.00017178240935610933, 'samples': 17470464, 'steps': 90991, 'loss/train': 1.8760197162628174} 11/07/2021 10:03:07 - INFO - __main__ - Step 90993: {'lr': 0.0001717773690500306, 'samples': 17470656, 'steps': 90992, 'loss/train': 1.1607366800308228} 11/07/2021 10:03:08 - INFO - __main__ - Step 90994: {'lr': 0.0001717723287791978, 'samples': 17470848, 'steps': 90993, 'loss/train': 1.268720030784607} 11/07/2021 10:03:09 - INFO - __main__ - Step 90995: {'lr': 0.00017176728854361318, 'samples': 17471040, 'steps': 90994, 'loss/train': 1.3736612796783447} 11/07/2021 10:03:09 - INFO - __main__ - Step 90996: {'lr': 0.000171762248343279, 'samples': 17471232, 'steps': 90995, 'loss/train': 1.1984763145446777} 11/07/2021 10:03:10 - INFO - __main__ - Step 90997: {'lr': 0.00017175720817819753, 'samples': 17471424, 'steps': 90996, 'loss/train': 0.49548232555389404} 11/07/2021 10:03:10 - INFO - __main__ - Step 90998: {'lr': 0.00017175216804837107, 'samples': 17471616, 'steps': 90997, 'loss/train': 1.0968831777572632} 11/07/2021 10:03:10 - INFO - __main__ - Step 90999: {'lr': 0.000171747127953802, 'samples': 17471808, 'steps': 90998, 'loss/train': 1.1632862091064453} 11/07/2021 10:03:11 - INFO - __main__ - Step 91000: {'lr': 0.00017174208789449234, 'samples': 17472000, 'steps': 90999, 'loss/train': 0.8750776648521423} 11/07/2021 10:03:11 - INFO - __main__ - Step 91001: {'lr': 0.00017173704787044446, 'samples': 17472192, 'steps': 91000, 'loss/train': 1.7180875539779663} 11/07/2021 10:03:12 - INFO - __main__ - Step 91002: {'lr': 0.00017173200788166073, 'samples': 17472384, 'steps': 91001, 'loss/train': 1.3573975563049316} 11/07/2021 10:03:12 - INFO - __main__ - Step 91003: {'lr': 0.0001717269679281433, 'samples': 17472576, 'steps': 91002, 'loss/train': 1.1743091344833374} 11/07/2021 10:03:13 - INFO - __main__ - Step 91004: {'lr': 0.0001717219280098945, 'samples': 17472768, 'steps': 91003, 'loss/train': 1.7736656665802002} 11/07/2021 10:03:13 - INFO - __main__ - Step 91005: {'lr': 0.00017171688812691658, 'samples': 17472960, 'steps': 91004, 'loss/train': 1.3670347929000854} 11/07/2021 10:03:13 - INFO - __main__ - Step 91006: {'lr': 0.00017171184827921183, 'samples': 17473152, 'steps': 91005, 'loss/train': 1.4193978309631348} 11/07/2021 10:03:14 - INFO - __main__ - Step 91007: {'lr': 0.0001717068084667825, 'samples': 17473344, 'steps': 91006, 'loss/train': 1.1020389795303345} 11/07/2021 10:03:15 - INFO - __main__ - Step 91008: {'lr': 0.0001717017686896309, 'samples': 17473536, 'steps': 91007, 'loss/train': 1.4348492622375488} 11/07/2021 10:03:15 - INFO - __main__ - Step 91009: {'lr': 0.0001716967289477593, 'samples': 17473728, 'steps': 91008, 'loss/train': 2.4644651412963867} 11/07/2021 10:03:15 - INFO - __main__ - Step 91010: {'lr': 0.00017169168924116988, 'samples': 17473920, 'steps': 91009, 'loss/train': 1.6570743322372437} 11/07/2021 10:03:16 - INFO - __main__ - Step 91011: {'lr': 0.00017168664956986501, 'samples': 17474112, 'steps': 91010, 'loss/train': 1.2773441076278687} 11/07/2021 10:03:17 - INFO - __main__ - Step 91012: {'lr': 0.00017168160993384692, 'samples': 17474304, 'steps': 91011, 'loss/train': 1.6016008853912354} 11/07/2021 10:03:17 - INFO - __main__ - Step 91013: {'lr': 0.000171676570333118, 'samples': 17474496, 'steps': 91012, 'loss/train': 1.4839062690734863} 11/07/2021 10:03:17 - INFO - __main__ - Step 91014: {'lr': 0.00017167153076768027, 'samples': 17474688, 'steps': 91013, 'loss/train': 1.1828360557556152} 11/07/2021 10:03:18 - INFO - __main__ - Step 91015: {'lr': 0.0001716664912375362, 'samples': 17474880, 'steps': 91014, 'loss/train': 0.8659502267837524} 11/07/2021 10:03:18 - INFO - __main__ - Step 91016: {'lr': 0.00017166145174268797, 'samples': 17475072, 'steps': 91015, 'loss/train': 1.3940868377685547} 11/07/2021 10:03:20 - INFO - __main__ - Step 91017: {'lr': 0.0001716564122831379, 'samples': 17475264, 'steps': 91016, 'loss/train': 1.4210658073425293} 11/07/2021 10:03:20 - INFO - __main__ - Step 91018: {'lr': 0.0001716513728588882, 'samples': 17475456, 'steps': 91017, 'loss/train': 1.485975742340088} 11/07/2021 10:03:20 - INFO - __main__ - Step 91019: {'lr': 0.00017164633346994118, 'samples': 17475648, 'steps': 91018, 'loss/train': 1.987857460975647} 11/07/2021 10:03:21 - INFO - __main__ - Step 91020: {'lr': 0.00017164129411629915, 'samples': 17475840, 'steps': 91019, 'loss/train': 1.289507269859314} 11/07/2021 10:03:21 - INFO - __main__ - Step 91021: {'lr': 0.00017163625479796435, 'samples': 17476032, 'steps': 91020, 'loss/train': 1.5957543849945068} 11/07/2021 10:03:21 - INFO - __main__ - Step 91022: {'lr': 0.000171631215514939, 'samples': 17476224, 'steps': 91021, 'loss/train': 1.6463176012039185} 11/07/2021 10:03:23 - INFO - __main__ - Step 91023: {'lr': 0.00017162617626722545, 'samples': 17476416, 'steps': 91022, 'loss/train': 2.196046829223633} 11/07/2021 10:03:23 - INFO - __main__ - Step 91024: {'lr': 0.00017162113705482593, 'samples': 17476608, 'steps': 91023, 'loss/train': 1.4126073122024536} 11/07/2021 10:03:23 - INFO - __main__ - Step 91025: {'lr': 0.0001716160978777427, 'samples': 17476800, 'steps': 91024, 'loss/train': 0.9842761754989624} 11/07/2021 10:03:24 - INFO - __main__ - Step 91026: {'lr': 0.0001716110587359782, 'samples': 17476992, 'steps': 91025, 'loss/train': 1.5634592771530151} 11/07/2021 10:03:24 - INFO - __main__ - Step 91027: {'lr': 0.00017160601962953436, 'samples': 17477184, 'steps': 91026, 'loss/train': 1.2927395105361938} 11/07/2021 10:03:24 - INFO - __main__ - Step 91028: {'lr': 0.00017160098055841373, 'samples': 17477376, 'steps': 91027, 'loss/train': 1.5737289190292358} 11/07/2021 10:03:25 - INFO - __main__ - Step 91029: {'lr': 0.00017159594152261841, 'samples': 17477568, 'steps': 91028, 'loss/train': 1.2756054401397705} 11/07/2021 10:03:26 - INFO - __main__ - Step 91030: {'lr': 0.00017159090252215082, 'samples': 17477760, 'steps': 91029, 'loss/train': 1.4358768463134766} 11/07/2021 10:03:26 - INFO - __main__ - Step 91031: {'lr': 0.00017158586355701312, 'samples': 17477952, 'steps': 91030, 'loss/train': 1.2639111280441284} 11/07/2021 10:03:26 - INFO - __main__ - Step 91032: {'lr': 0.0001715808246272076, 'samples': 17478144, 'steps': 91031, 'loss/train': 1.349709391593933} 11/07/2021 10:03:27 - INFO - __main__ - Step 91033: {'lr': 0.0001715757857327366, 'samples': 17478336, 'steps': 91032, 'loss/train': 1.123180627822876} 11/07/2021 10:03:28 - INFO - __main__ - Step 91034: {'lr': 0.0001715707468736023, 'samples': 17478528, 'steps': 91033, 'loss/train': 1.5362658500671387} 11/07/2021 10:03:28 - INFO - __main__ - Step 91035: {'lr': 0.000171565708049807, 'samples': 17478720, 'steps': 91034, 'loss/train': 1.5788812637329102} 11/07/2021 10:03:29 - INFO - __main__ - Step 91036: {'lr': 0.000171560669261353, 'samples': 17478912, 'steps': 91035, 'loss/train': 1.3246372938156128} 11/07/2021 10:03:29 - INFO - __main__ - Step 91037: {'lr': 0.0001715556305082426, 'samples': 17479104, 'steps': 91036, 'loss/train': 1.111193299293518} 11/07/2021 10:03:29 - INFO - __main__ - Step 91038: {'lr': 0.00017155059179047795, 'samples': 17479296, 'steps': 91037, 'loss/train': 1.4725492000579834} 11/07/2021 10:03:30 - INFO - __main__ - Step 91039: {'lr': 0.00017154555310806152, 'samples': 17479488, 'steps': 91038, 'loss/train': 0.881872296333313} 11/07/2021 10:03:31 - INFO - __main__ - Step 91040: {'lr': 0.00017154051446099537, 'samples': 17479680, 'steps': 91039, 'loss/train': 1.4123868942260742} 11/07/2021 10:03:31 - INFO - __main__ - Step 91041: {'lr': 0.00017153547584928183, 'samples': 17479872, 'steps': 91040, 'loss/train': 1.6000200510025024} 11/07/2021 10:03:31 - INFO - __main__ - Step 91042: {'lr': 0.00017153043727292323, 'samples': 17480064, 'steps': 91041, 'loss/train': 1.5798293352127075} 11/07/2021 10:03:32 - INFO - __main__ - Step 91043: {'lr': 0.00017152539873192176, 'samples': 17480256, 'steps': 91042, 'loss/train': 1.0551667213439941} 11/07/2021 10:03:33 - INFO - __main__ - Step 91044: {'lr': 0.00017152036022627975, 'samples': 17480448, 'steps': 91043, 'loss/train': 1.0355415344238281} 11/07/2021 10:03:33 - INFO - __main__ - Step 91045: {'lr': 0.00017151532175599943, 'samples': 17480640, 'steps': 91044, 'loss/train': 1.2658652067184448} 11/07/2021 10:03:33 - INFO - __main__ - Step 91046: {'lr': 0.00017151028332108314, 'samples': 17480832, 'steps': 91045, 'loss/train': 1.7654387950897217} 11/07/2021 10:03:34 - INFO - __main__ - Step 91047: {'lr': 0.00017150524492153308, 'samples': 17481024, 'steps': 91046, 'loss/train': 2.1782279014587402} 11/07/2021 10:03:34 - INFO - __main__ - Step 91048: {'lr': 0.00017150020655735154, 'samples': 17481216, 'steps': 91047, 'loss/train': 1.507782220840454} 11/07/2021 10:03:35 - INFO - __main__ - Step 91049: {'lr': 0.00017149516822854082, 'samples': 17481408, 'steps': 91048, 'loss/train': 1.3863697052001953} 11/07/2021 10:03:35 - INFO - __main__ - Step 91050: {'lr': 0.00017149012993510315, 'samples': 17481600, 'steps': 91049, 'loss/train': 1.3235615491867065} 11/07/2021 10:03:36 - INFO - __main__ - Step 91051: {'lr': 0.00017148509167704083, 'samples': 17481792, 'steps': 91050, 'loss/train': 1.4846550226211548} 11/07/2021 10:03:36 - INFO - __main__ - Step 91052: {'lr': 0.0001714800534543561, 'samples': 17481984, 'steps': 91051, 'loss/train': 1.2615793943405151} 11/07/2021 10:03:36 - INFO - __main__ - Step 91053: {'lr': 0.00017147501526705133, 'samples': 17482176, 'steps': 91052, 'loss/train': 1.6538869142532349} 11/07/2021 10:03:37 - INFO - __main__ - Step 91054: {'lr': 0.00017146997711512866, 'samples': 17482368, 'steps': 91053, 'loss/train': 1.7381173372268677} 11/07/2021 10:03:38 - INFO - __main__ - Step 91055: {'lr': 0.00017146493899859036, 'samples': 17482560, 'steps': 91054, 'loss/train': 1.3796541690826416} 11/07/2021 10:03:38 - INFO - __main__ - Step 91056: {'lr': 0.00017145990091743877, 'samples': 17482752, 'steps': 91055, 'loss/train': 1.558150291442871} 11/07/2021 10:03:38 - INFO - __main__ - Step 91057: {'lr': 0.0001714548628716761, 'samples': 17482944, 'steps': 91056, 'loss/train': 1.9136332273483276} 11/07/2021 10:03:39 - INFO - __main__ - Step 91058: {'lr': 0.00017144982486130473, 'samples': 17483136, 'steps': 91057, 'loss/train': 0.9760960340499878} 11/07/2021 10:03:39 - INFO - __main__ - Step 91059: {'lr': 0.0001714447868863268, 'samples': 17483328, 'steps': 91058, 'loss/train': 1.176012635231018} 11/07/2021 10:03:40 - INFO - __main__ - Step 91060: {'lr': 0.00017143974894674464, 'samples': 17483520, 'steps': 91059, 'loss/train': 1.567379117012024} 11/07/2021 10:03:41 - INFO - __main__ - Step 91061: {'lr': 0.00017143471104256054, 'samples': 17483712, 'steps': 91060, 'loss/train': 1.174367904663086} 11/07/2021 10:03:41 - INFO - __main__ - Step 91062: {'lr': 0.00017142967317377672, 'samples': 17483904, 'steps': 91061, 'loss/train': 1.4978054761886597} 11/07/2021 10:03:41 - INFO - __main__ - Step 91063: {'lr': 0.0001714246353403955, 'samples': 17484096, 'steps': 91062, 'loss/train': 1.1215622425079346} 11/07/2021 10:03:42 - INFO - __main__ - Step 91064: {'lr': 0.00017141959754241916, 'samples': 17484288, 'steps': 91063, 'loss/train': 1.466196894645691} 11/07/2021 10:03:43 - INFO - __main__ - Step 91065: {'lr': 0.00017141455977984988, 'samples': 17484480, 'steps': 91064, 'loss/train': 1.2013875246047974} 11/07/2021 10:03:43 - INFO - __main__ - Step 91066: {'lr': 0.00017140952205269006, 'samples': 17484672, 'steps': 91065, 'loss/train': 1.3441356420516968} 11/07/2021 10:03:43 - INFO - __main__ - Step 91067: {'lr': 0.00017140448436094182, 'samples': 17484864, 'steps': 91066, 'loss/train': 1.1250414848327637} 11/07/2021 10:03:44 - INFO - __main__ - Step 91068: {'lr': 0.00017139944670460755, 'samples': 17485056, 'steps': 91067, 'loss/train': 1.5781985521316528} 11/07/2021 10:03:44 - INFO - __main__ - Step 91069: {'lr': 0.00017139440908368943, 'samples': 17485248, 'steps': 91068, 'loss/train': 1.1588268280029297} 11/07/2021 10:03:45 - INFO - __main__ - Step 91070: {'lr': 0.00017138937149818978, 'samples': 17485440, 'steps': 91069, 'loss/train': 1.408466100692749} 11/07/2021 10:03:45 - INFO - __main__ - Step 91071: {'lr': 0.0001713843339481109, 'samples': 17485632, 'steps': 91070, 'loss/train': 1.3681505918502808} 11/07/2021 10:03:46 - INFO - __main__ - Step 91072: {'lr': 0.000171379296433455, 'samples': 17485824, 'steps': 91071, 'loss/train': 1.339435338973999} 11/07/2021 10:03:46 - INFO - __main__ - Step 91073: {'lr': 0.00017137425895422437, 'samples': 17486016, 'steps': 91072, 'loss/train': 1.1582558155059814} 11/07/2021 10:03:46 - INFO - __main__ - Step 91074: {'lr': 0.00017136922151042133, 'samples': 17486208, 'steps': 91073, 'loss/train': 1.3440704345703125} 11/07/2021 10:03:48 - INFO - __main__ - Step 91075: {'lr': 0.00017136418410204814, 'samples': 17486400, 'steps': 91074, 'loss/train': 1.0521924495697021} 11/07/2021 10:03:48 - INFO - __main__ - Step 91076: {'lr': 0.00017135914672910697, 'samples': 17486592, 'steps': 91075, 'loss/train': 1.6865919828414917} 11/07/2021 10:03:48 - INFO - __main__ - Step 91077: {'lr': 0.00017135410939160013, 'samples': 17486784, 'steps': 91076, 'loss/train': 1.4880939722061157} 11/07/2021 10:03:49 - INFO - __main__ - Step 91078: {'lr': 0.00017134907208952993, 'samples': 17486976, 'steps': 91077, 'loss/train': 0.1866437941789627} 11/07/2021 10:03:49 - INFO - __main__ - Step 91079: {'lr': 0.00017134403482289864, 'samples': 17487168, 'steps': 91078, 'loss/train': 1.6726735830307007} 11/07/2021 10:03:50 - INFO - __main__ - Step 91080: {'lr': 0.00017133899759170856, 'samples': 17487360, 'steps': 91079, 'loss/train': 1.4302408695220947} 11/07/2021 10:03:50 - INFO - __main__ - Step 91081: {'lr': 0.00017133396039596186, 'samples': 17487552, 'steps': 91080, 'loss/train': 1.1960514783859253} 11/07/2021 10:03:51 - INFO - __main__ - Step 91082: {'lr': 0.00017132892323566085, 'samples': 17487744, 'steps': 91081, 'loss/train': 0.4788397252559662} 11/07/2021 10:03:51 - INFO - __main__ - Step 91083: {'lr': 0.00017132388611080786, 'samples': 17487936, 'steps': 91082, 'loss/train': 1.0748366117477417} 11/07/2021 10:03:51 - INFO - __main__ - Step 91084: {'lr': 0.00017131884902140508, 'samples': 17488128, 'steps': 91083, 'loss/train': 1.179880142211914} 11/07/2021 10:03:52 - INFO - __main__ - Step 91085: {'lr': 0.00017131381196745478, 'samples': 17488320, 'steps': 91084, 'loss/train': 1.1678647994995117} 11/07/2021 10:03:53 - INFO - __main__ - Step 91086: {'lr': 0.00017130877494895937, 'samples': 17488512, 'steps': 91085, 'loss/train': 1.340047836303711} 11/07/2021 10:03:53 - INFO - __main__ - Step 91087: {'lr': 0.00017130373796592094, 'samples': 17488704, 'steps': 91086, 'loss/train': 1.2676821947097778} 11/07/2021 10:03:54 - INFO - __main__ - Step 91088: {'lr': 0.00017129870101834183, 'samples': 17488896, 'steps': 91087, 'loss/train': 1.5401389598846436} 11/07/2021 10:03:54 - INFO - __main__ - Step 91089: {'lr': 0.00017129366410622432, 'samples': 17489088, 'steps': 91088, 'loss/train': 1.5693994760513306} 11/07/2021 10:03:54 - INFO - __main__ - Step 91090: {'lr': 0.00017128862722957065, 'samples': 17489280, 'steps': 91089, 'loss/train': 1.5146839618682861} 11/07/2021 10:03:55 - INFO - __main__ - Step 91091: {'lr': 0.0001712835903883831, 'samples': 17489472, 'steps': 91090, 'loss/train': 1.6846659183502197} 11/07/2021 10:03:56 - INFO - __main__ - Step 91092: {'lr': 0.00017127855358266397, 'samples': 17489664, 'steps': 91091, 'loss/train': 1.4992595911026} 11/07/2021 10:03:56 - INFO - __main__ - Step 91093: {'lr': 0.00017127351681241556, 'samples': 17489856, 'steps': 91092, 'loss/train': 1.3142567873001099} 11/07/2021 10:03:56 - INFO - __main__ - Step 91094: {'lr': 0.00017126848007764008, 'samples': 17490048, 'steps': 91093, 'loss/train': 1.5615043640136719} 11/07/2021 10:03:57 - INFO - __main__ - Step 91095: {'lr': 0.00017126344337833974, 'samples': 17490240, 'steps': 91094, 'loss/train': 1.186570405960083} 11/07/2021 10:03:58 - INFO - __main__ - Step 91096: {'lr': 0.0001712584067145169, 'samples': 17490432, 'steps': 91095, 'loss/train': 1.262673258781433} 11/07/2021 10:03:58 - INFO - __main__ - Step 91097: {'lr': 0.00017125337008617387, 'samples': 17490624, 'steps': 91096, 'loss/train': 1.446333646774292} 11/07/2021 10:03:58 - INFO - __main__ - Step 91098: {'lr': 0.00017124833349331278, 'samples': 17490816, 'steps': 91097, 'loss/train': 1.6307352781295776} 11/07/2021 10:03:59 - INFO - __main__ - Step 91099: {'lr': 0.00017124329693593598, 'samples': 17491008, 'steps': 91098, 'loss/train': 1.0956145524978638} 11/07/2021 10:03:59 - INFO - __main__ - Step 91100: {'lr': 0.00017123826041404579, 'samples': 17491200, 'steps': 91099, 'loss/train': 1.6285933256149292} 11/07/2021 10:03:59 - INFO - __main__ - Step 91101: {'lr': 0.00017123322392764435, 'samples': 17491392, 'steps': 91100, 'loss/train': 1.3798280954360962} 11/07/2021 10:04:01 - INFO - __main__ - Step 91102: {'lr': 0.00017122818747673403, 'samples': 17491584, 'steps': 91101, 'loss/train': 1.3695838451385498} 11/07/2021 10:04:01 - INFO - __main__ - Step 91103: {'lr': 0.00017122315106131707, 'samples': 17491776, 'steps': 91102, 'loss/train': 1.7520142793655396} 11/07/2021 10:04:02 - INFO - __main__ - Step 91104: {'lr': 0.00017121811468139575, 'samples': 17491968, 'steps': 91103, 'loss/train': 1.5043398141860962} 11/07/2021 10:04:02 - INFO - __main__ - Step 91105: {'lr': 0.00017121307833697235, 'samples': 17492160, 'steps': 91104, 'loss/train': 1.435557246208191} 11/07/2021 10:04:02 - INFO - __main__ - Step 91106: {'lr': 0.0001712080420280491, 'samples': 17492352, 'steps': 91105, 'loss/train': 1.2947115898132324} 11/07/2021 10:04:03 - INFO - __main__ - Step 91107: {'lr': 0.00017120300575462836, 'samples': 17492544, 'steps': 91106, 'loss/train': 1.8548569679260254} 11/07/2021 10:04:04 - INFO - __main__ - Step 91108: {'lr': 0.0001711979695167123, 'samples': 17492736, 'steps': 91107, 'loss/train': 1.144775390625} 11/07/2021 10:04:04 - INFO - __main__ - Step 91109: {'lr': 0.0001711929333143032, 'samples': 17492928, 'steps': 91108, 'loss/train': 1.2250182628631592} 11/07/2021 10:04:04 - INFO - __main__ - Step 91110: {'lr': 0.00017118789714740332, 'samples': 17493120, 'steps': 91109, 'loss/train': 0.8633219003677368} 11/07/2021 10:04:05 - INFO - __main__ - Step 91111: {'lr': 0.000171182861016015, 'samples': 17493312, 'steps': 91110, 'loss/train': 1.6747004985809326} 11/07/2021 10:04:05 - INFO - __main__ - Step 91112: {'lr': 0.0001711778249201404, 'samples': 17493504, 'steps': 91111, 'loss/train': 1.515761375427246} 11/07/2021 10:04:07 - INFO - __main__ - Step 91113: {'lr': 0.0001711727888597819, 'samples': 17493696, 'steps': 91112, 'loss/train': 1.4684410095214844} 11/07/2021 10:04:07 - INFO - __main__ - Step 91114: {'lr': 0.00017116775283494172, 'samples': 17493888, 'steps': 91113, 'loss/train': 1.4996790885925293} 11/07/2021 10:04:07 - INFO - __main__ - Step 91115: {'lr': 0.00017116271684562213, 'samples': 17494080, 'steps': 91114, 'loss/train': 1.3126815557479858} 11/07/2021 10:04:08 - INFO - __main__ - Step 91116: {'lr': 0.00017115768089182539, 'samples': 17494272, 'steps': 91115, 'loss/train': 1.7424076795578003} 11/07/2021 10:04:08 - INFO - __main__ - Step 91117: {'lr': 0.00017115264497355383, 'samples': 17494464, 'steps': 91116, 'loss/train': 1.7465263605117798} 11/07/2021 10:04:08 - INFO - __main__ - Step 91118: {'lr': 0.00017114760909080963, 'samples': 17494656, 'steps': 91117, 'loss/train': 1.7628839015960693} 11/07/2021 10:04:09 - INFO - __main__ - Step 91119: {'lr': 0.00017114257324359508, 'samples': 17494848, 'steps': 91118, 'loss/train': 1.0106357336044312} 11/07/2021 10:04:10 - INFO - __main__ - Step 91120: {'lr': 0.0001711375374319126, 'samples': 17495040, 'steps': 91119, 'loss/train': 1.310302495956421} 11/07/2021 10:04:10 - INFO - __main__ - Step 91121: {'lr': 0.0001711325016557642, 'samples': 17495232, 'steps': 91120, 'loss/train': 1.644096851348877} 11/07/2021 10:04:10 - INFO - __main__ - Step 91122: {'lr': 0.00017112746591515233, 'samples': 17495424, 'steps': 91121, 'loss/train': 1.3189162015914917} 11/07/2021 10:04:11 - INFO - __main__ - Step 91123: {'lr': 0.00017112243021007918, 'samples': 17495616, 'steps': 91122, 'loss/train': 1.2553013563156128} 11/07/2021 10:04:12 - INFO - __main__ - Step 91124: {'lr': 0.00017111739454054702, 'samples': 17495808, 'steps': 91123, 'loss/train': 1.2739331722259521} 11/07/2021 10:04:12 - INFO - __main__ - Step 91125: {'lr': 0.00017111235890655818, 'samples': 17496000, 'steps': 91124, 'loss/train': 1.5151300430297852} 11/07/2021 10:04:13 - INFO - __main__ - Step 91126: {'lr': 0.0001711073233081149, 'samples': 17496192, 'steps': 91125, 'loss/train': 1.839024543762207} 11/07/2021 10:04:13 - INFO - __main__ - Step 91127: {'lr': 0.00017110228774521943, 'samples': 17496384, 'steps': 91126, 'loss/train': 1.1903715133666992} 11/07/2021 10:04:13 - INFO - __main__ - Step 91128: {'lr': 0.00017109725221787405, 'samples': 17496576, 'steps': 91127, 'loss/train': 2.0294435024261475} 11/07/2021 10:04:14 - INFO - __main__ - Step 91129: {'lr': 0.00017109221672608106, 'samples': 17496768, 'steps': 91128, 'loss/train': 0.3106578588485718} 11/07/2021 10:04:15 - INFO - __main__ - Step 91130: {'lr': 0.00017108718126984264, 'samples': 17496960, 'steps': 91129, 'loss/train': 1.7312217950820923} 11/07/2021 10:04:16 - INFO - __main__ - Step 91131: {'lr': 0.00017108214584916114, 'samples': 17497152, 'steps': 91130, 'loss/train': 1.4726529121398926} 11/07/2021 10:04:16 - INFO - __main__ - Step 91132: {'lr': 0.00017107711046403885, 'samples': 17497344, 'steps': 91131, 'loss/train': 0.8936165571212769} 11/07/2021 10:04:16 - INFO - __main__ - Step 91133: {'lr': 0.00017107207511447793, 'samples': 17497536, 'steps': 91132, 'loss/train': 1.4766162633895874} 11/07/2021 10:04:17 - INFO - __main__ - Step 91134: {'lr': 0.00017106703980048084, 'samples': 17497728, 'steps': 91133, 'loss/train': 1.238621473312378} 11/07/2021 10:04:18 - INFO - __main__ - Step 91135: {'lr': 0.00017106200452204966, 'samples': 17497920, 'steps': 91134, 'loss/train': 1.3491874933242798} 11/07/2021 10:04:18 - INFO - __main__ - Step 91136: {'lr': 0.00017105696927918667, 'samples': 17498112, 'steps': 91135, 'loss/train': 0.9630807638168335} 11/07/2021 10:04:18 - INFO - __main__ - Step 91137: {'lr': 0.00017105193407189424, 'samples': 17498304, 'steps': 91136, 'loss/train': 0.46576443314552307} 11/07/2021 10:04:19 - INFO - __main__ - Step 91138: {'lr': 0.00017104689890017454, 'samples': 17498496, 'steps': 91137, 'loss/train': 1.5001730918884277} 11/07/2021 10:04:19 - INFO - __main__ - Step 91139: {'lr': 0.00017104186376402992, 'samples': 17498688, 'steps': 91138, 'loss/train': 0.9075676202774048} 11/07/2021 10:04:20 - INFO - __main__ - Step 91140: {'lr': 0.0001710368286634626, 'samples': 17498880, 'steps': 91139, 'loss/train': 0.8430210947990417} 11/07/2021 10:04:20 - INFO - __main__ - Step 91141: {'lr': 0.00017103179359847487, 'samples': 17499072, 'steps': 91140, 'loss/train': 1.2024319171905518} 11/07/2021 10:04:21 - INFO - __main__ - Step 91142: {'lr': 0.000171026758569069, 'samples': 17499264, 'steps': 91141, 'loss/train': 1.4049038887023926} 11/07/2021 10:04:21 - INFO - __main__ - Step 91143: {'lr': 0.0001710217235752473, 'samples': 17499456, 'steps': 91142, 'loss/train': 1.5455443859100342} 11/07/2021 10:04:22 - INFO - __main__ - Step 91144: {'lr': 0.00017101668861701193, 'samples': 17499648, 'steps': 91143, 'loss/train': 1.2229286432266235} 11/07/2021 10:04:22 - INFO - __main__ - Step 91145: {'lr': 0.00017101165369436523, 'samples': 17499840, 'steps': 91144, 'loss/train': 1.5798636674880981} 11/07/2021 10:04:23 - INFO - __main__ - Step 91146: {'lr': 0.0001710066188073095, 'samples': 17500032, 'steps': 91145, 'loss/train': 1.4620858430862427} 11/07/2021 10:04:23 - INFO - __main__ - Step 91147: {'lr': 0.00017100158395584703, 'samples': 17500224, 'steps': 91146, 'loss/train': 1.3972076177597046} 11/07/2021 10:04:24 - INFO - __main__ - Step 91148: {'lr': 0.0001709965491399799, 'samples': 17500416, 'steps': 91147, 'loss/train': 1.444023609161377} 11/07/2021 10:04:24 - INFO - __main__ - Step 91149: {'lr': 0.00017099151435971056, 'samples': 17500608, 'steps': 91148, 'loss/train': 1.169904112815857} 11/07/2021 10:04:25 - INFO - __main__ - Step 91150: {'lr': 0.0001709864796150412, 'samples': 17500800, 'steps': 91149, 'loss/train': 1.4680781364440918} 11/07/2021 10:04:25 - INFO - __main__ - Step 91151: {'lr': 0.00017098144490597413, 'samples': 17500992, 'steps': 91150, 'loss/train': 1.439003348350525} 11/07/2021 10:04:26 - INFO - __main__ - Step 91152: {'lr': 0.0001709764102325116, 'samples': 17501184, 'steps': 91151, 'loss/train': 1.5993303060531616} 11/07/2021 10:04:26 - INFO - __main__ - Step 91153: {'lr': 0.00017097137559465587, 'samples': 17501376, 'steps': 91152, 'loss/train': 1.1558864116668701} 11/07/2021 10:04:26 - INFO - __main__ - Step 91154: {'lr': 0.0001709663409924092, 'samples': 17501568, 'steps': 91153, 'loss/train': 1.4218530654907227} 11/07/2021 10:04:27 - INFO - __main__ - Step 91155: {'lr': 0.00017096130642577393, 'samples': 17501760, 'steps': 91154, 'loss/train': 1.1932648420333862} 11/07/2021 10:04:28 - INFO - __main__ - Step 91156: {'lr': 0.00017095627189475223, 'samples': 17501952, 'steps': 91155, 'loss/train': 1.5855844020843506} 11/07/2021 10:04:28 - INFO - __main__ - Step 91157: {'lr': 0.00017095123739934643, 'samples': 17502144, 'steps': 91156, 'loss/train': 1.5536400079727173} 11/07/2021 10:04:28 - INFO - __main__ - Step 91158: {'lr': 0.0001709462029395588, 'samples': 17502336, 'steps': 91157, 'loss/train': 1.3643624782562256} 11/07/2021 10:04:29 - INFO - __main__ - Step 91159: {'lr': 0.00017094116851539153, 'samples': 17502528, 'steps': 91158, 'loss/train': 1.2835404872894287} 11/07/2021 10:04:29 - INFO - __main__ - Step 91160: {'lr': 0.0001709361341268471, 'samples': 17502720, 'steps': 91159, 'loss/train': 0.43126392364501953} 11/07/2021 10:04:31 - INFO - __main__ - Step 91161: {'lr': 0.00017093109977392754, 'samples': 17502912, 'steps': 91160, 'loss/train': 0.7465264797210693} 11/07/2021 10:04:31 - INFO - __main__ - Step 91162: {'lr': 0.00017092606545663518, 'samples': 17503104, 'steps': 91161, 'loss/train': 0.5297514796257019} 11/07/2021 10:04:32 - INFO - __main__ - Step 91163: {'lr': 0.0001709210311749723, 'samples': 17503296, 'steps': 91162, 'loss/train': 1.0865708589553833} 11/07/2021 10:04:32 - INFO - __main__ - Step 91164: {'lr': 0.00017091599692894123, 'samples': 17503488, 'steps': 91163, 'loss/train': 1.440803050994873} 11/07/2021 10:04:32 - INFO - __main__ - Step 91165: {'lr': 0.00017091096271854418, 'samples': 17503680, 'steps': 91164, 'loss/train': 1.4928635358810425} 11/07/2021 10:04:33 - INFO - __main__ - Step 91166: {'lr': 0.0001709059285437834, 'samples': 17503872, 'steps': 91165, 'loss/train': 1.7822178602218628} 11/07/2021 10:04:33 - INFO - __main__ - Step 91167: {'lr': 0.0001709008944046612, 'samples': 17504064, 'steps': 91166, 'loss/train': 1.7735202312469482} 11/07/2021 10:04:34 - INFO - __main__ - Step 91168: {'lr': 0.0001708958603011798, 'samples': 17504256, 'steps': 91167, 'loss/train': 1.6663297414779663} 11/07/2021 10:04:35 - INFO - __main__ - Step 91169: {'lr': 0.00017089082623334158, 'samples': 17504448, 'steps': 91168, 'loss/train': 1.5399951934814453} 11/07/2021 10:04:35 - INFO - __main__ - Step 91170: {'lr': 0.0001708857922011487, 'samples': 17504640, 'steps': 91169, 'loss/train': 0.8875201940536499} 11/07/2021 10:04:35 - INFO - __main__ - Step 91171: {'lr': 0.00017088075820460348, 'samples': 17504832, 'steps': 91170, 'loss/train': 0.6449953317642212} 11/07/2021 10:04:36 - INFO - __main__ - Step 91172: {'lr': 0.00017087572424370813, 'samples': 17505024, 'steps': 91171, 'loss/train': 1.7663652896881104} 11/07/2021 10:04:37 - INFO - __main__ - Step 91173: {'lr': 0.00017087069031846498, 'samples': 17505216, 'steps': 91172, 'loss/train': 1.1451759338378906} 11/07/2021 10:04:37 - INFO - __main__ - Step 91174: {'lr': 0.00017086565642887637, 'samples': 17505408, 'steps': 91173, 'loss/train': 1.598867654800415} 11/07/2021 10:04:38 - INFO - __main__ - Step 91175: {'lr': 0.00017086062257494437, 'samples': 17505600, 'steps': 91174, 'loss/train': 1.1379019021987915} 11/07/2021 10:04:38 - INFO - __main__ - Step 91176: {'lr': 0.00017085558875667135, 'samples': 17505792, 'steps': 91175, 'loss/train': 1.0265650749206543} 11/07/2021 10:04:38 - INFO - __main__ - Step 91177: {'lr': 0.0001708505549740596, 'samples': 17505984, 'steps': 91176, 'loss/train': 1.075728416442871} 11/07/2021 10:04:39 - INFO - __main__ - Step 91178: {'lr': 0.00017084552122711134, 'samples': 17506176, 'steps': 91177, 'loss/train': 1.753763198852539} 11/07/2021 10:04:40 - INFO - __main__ - Step 91179: {'lr': 0.00017084048751582888, 'samples': 17506368, 'steps': 91178, 'loss/train': 0.085906483232975} 11/07/2021 10:04:40 - INFO - __main__ - Step 91180: {'lr': 0.00017083545384021447, 'samples': 17506560, 'steps': 91179, 'loss/train': 1.2094426155090332} 11/07/2021 10:04:40 - INFO - __main__ - Step 91181: {'lr': 0.0001708304202002704, 'samples': 17506752, 'steps': 91180, 'loss/train': 1.0663374662399292} 11/07/2021 10:04:41 - INFO - __main__ - Step 91182: {'lr': 0.0001708253865959989, 'samples': 17506944, 'steps': 91181, 'loss/train': 0.7450734972953796} 11/07/2021 10:04:42 - INFO - __main__ - Step 91183: {'lr': 0.00017082035302740228, 'samples': 17507136, 'steps': 91182, 'loss/train': 0.8634593486785889} 11/07/2021 10:04:42 - INFO - __main__ - Step 91184: {'lr': 0.0001708153194944828, 'samples': 17507328, 'steps': 91183, 'loss/train': 1.3389263153076172} 11/07/2021 10:04:43 - INFO - __main__ - Step 91185: {'lr': 0.00017081028599724268, 'samples': 17507520, 'steps': 91184, 'loss/train': 0.7693899869918823} 11/07/2021 10:04:43 - INFO - __main__ - Step 91186: {'lr': 0.00017080525253568423, 'samples': 17507712, 'steps': 91185, 'loss/train': 1.3610011339187622} 11/07/2021 10:04:43 - INFO - __main__ - Step 91187: {'lr': 0.0001708002191098098, 'samples': 17507904, 'steps': 91186, 'loss/train': 1.3246533870697021} 11/07/2021 10:04:44 - INFO - __main__ - Step 91188: {'lr': 0.0001707951857196215, 'samples': 17508096, 'steps': 91187, 'loss/train': 0.5204952955245972} 11/07/2021 10:04:45 - INFO - __main__ - Step 91189: {'lr': 0.00017079015236512167, 'samples': 17508288, 'steps': 91188, 'loss/train': 1.3204582929611206} 11/07/2021 10:04:45 - INFO - __main__ - Step 91190: {'lr': 0.00017078511904631256, 'samples': 17508480, 'steps': 91189, 'loss/train': 1.3868119716644287} 11/07/2021 10:04:46 - INFO - __main__ - Step 91191: {'lr': 0.00017078008576319642, 'samples': 17508672, 'steps': 91190, 'loss/train': 1.4147249460220337} 11/07/2021 10:04:46 - INFO - __main__ - Step 91192: {'lr': 0.0001707750525157756, 'samples': 17508864, 'steps': 91191, 'loss/train': 1.4046920537948608} 11/07/2021 10:04:46 - INFO - __main__ - Step 91193: {'lr': 0.0001707700193040523, 'samples': 17509056, 'steps': 91192, 'loss/train': 1.1278618574142456} 11/07/2021 10:04:47 - INFO - __main__ - Step 91194: {'lr': 0.00017076498612802882, 'samples': 17509248, 'steps': 91193, 'loss/train': 1.9000506401062012} 11/07/2021 10:04:48 - INFO - __main__ - Step 91195: {'lr': 0.00017075995298770742, 'samples': 17509440, 'steps': 91194, 'loss/train': 1.253057599067688} 11/07/2021 10:04:48 - INFO - __main__ - Step 91196: {'lr': 0.00017075491988309034, 'samples': 17509632, 'steps': 91195, 'loss/train': 1.4474903345108032} 11/07/2021 10:04:48 - INFO - __main__ - Step 91197: {'lr': 0.00017074988681417986, 'samples': 17509824, 'steps': 91196, 'loss/train': 0.6712296605110168} 11/07/2021 10:04:49 - INFO - __main__ - Step 91198: {'lr': 0.0001707448537809783, 'samples': 17510016, 'steps': 91197, 'loss/train': 1.0457643270492554} 11/07/2021 10:04:50 - INFO - __main__ - Step 91199: {'lr': 0.00017073982078348788, 'samples': 17510208, 'steps': 91198, 'loss/train': 1.5514863729476929} 11/07/2021 10:04:50 - INFO - __main__ - Step 91200: {'lr': 0.00017073478782171086, 'samples': 17510400, 'steps': 91199, 'loss/train': 1.6296660900115967} 11/07/2021 10:04:50 - INFO - __main__ - Step 91201: {'lr': 0.00017072975489564957, 'samples': 17510592, 'steps': 91200, 'loss/train': 1.3524197340011597} 11/07/2021 10:04:51 - INFO - __main__ - Step 91202: {'lr': 0.00017072472200530616, 'samples': 17510784, 'steps': 91201, 'loss/train': 0.7205749750137329} 11/07/2021 10:04:51 - INFO - __main__ - Step 91203: {'lr': 0.00017071968915068297, 'samples': 17510976, 'steps': 91202, 'loss/train': 1.6920019388198853} 11/07/2021 10:04:52 - INFO - __main__ - Step 91204: {'lr': 0.0001707146563317823, 'samples': 17511168, 'steps': 91203, 'loss/train': 1.1365125179290771} 11/07/2021 10:04:52 - INFO - __main__ - Step 91205: {'lr': 0.00017070962354860637, 'samples': 17511360, 'steps': 91204, 'loss/train': 1.4704904556274414} 11/07/2021 10:04:53 - INFO - __main__ - Step 91206: {'lr': 0.0001707045908011574, 'samples': 17511552, 'steps': 91205, 'loss/train': 1.2962397336959839} 11/07/2021 10:04:53 - INFO - __main__ - Step 91207: {'lr': 0.0001706995580894378, 'samples': 17511744, 'steps': 91206, 'loss/train': 1.2407934665679932} 11/07/2021 10:04:54 - INFO - __main__ - Step 91208: {'lr': 0.00017069452541344972, 'samples': 17511936, 'steps': 91207, 'loss/train': 1.0776671171188354} 11/07/2021 10:04:55 - INFO - __main__ - Step 91209: {'lr': 0.0001706894927731955, 'samples': 17512128, 'steps': 91208, 'loss/train': 0.7523512840270996} 11/07/2021 10:04:55 - INFO - __main__ - Step 91210: {'lr': 0.00017068446016867733, 'samples': 17512320, 'steps': 91209, 'loss/train': 1.3921631574630737} 11/07/2021 10:04:55 - INFO - __main__ - Step 91211: {'lr': 0.00017067942759989752, 'samples': 17512512, 'steps': 91210, 'loss/train': 1.0850588083267212} 11/07/2021 10:04:56 - INFO - __main__ - Step 91212: {'lr': 0.00017067439506685832, 'samples': 17512704, 'steps': 91211, 'loss/train': 1.6318235397338867} 11/07/2021 10:04:56 - INFO - __main__ - Step 91213: {'lr': 0.00017066936256956205, 'samples': 17512896, 'steps': 91212, 'loss/train': 1.3646070957183838} 11/07/2021 10:04:56 - INFO - __main__ - Step 91214: {'lr': 0.000170664330108011, 'samples': 17513088, 'steps': 91213, 'loss/train': 1.2993520498275757} 11/07/2021 10:04:57 - INFO - __main__ - Step 91215: {'lr': 0.0001706592976822073, 'samples': 17513280, 'steps': 91214, 'loss/train': 1.7510337829589844} 11/07/2021 10:04:58 - INFO - __main__ - Step 91216: {'lr': 0.00017065426529215327, 'samples': 17513472, 'steps': 91215, 'loss/train': 1.1384649276733398} 11/07/2021 10:04:58 - INFO - __main__ - Step 91217: {'lr': 0.00017064923293785126, 'samples': 17513664, 'steps': 91216, 'loss/train': 1.4524604082107544} 11/07/2021 10:04:59 - INFO - __main__ - Step 91218: {'lr': 0.00017064420061930344, 'samples': 17513856, 'steps': 91217, 'loss/train': 1.1837769746780396} 11/07/2021 10:04:59 - INFO - __main__ - Step 91219: {'lr': 0.00017063916833651215, 'samples': 17514048, 'steps': 91218, 'loss/train': 1.2428135871887207} 11/07/2021 10:05:00 - INFO - __main__ - Step 91220: {'lr': 0.00017063413608947963, 'samples': 17514240, 'steps': 91219, 'loss/train': 1.9369651079177856} 11/07/2021 10:05:00 - INFO - __main__ - Step 91221: {'lr': 0.00017062910387820811, 'samples': 17514432, 'steps': 91220, 'loss/train': 1.4901460409164429} 11/07/2021 10:05:01 - INFO - __main__ - Step 91222: {'lr': 0.00017062407170269996, 'samples': 17514624, 'steps': 91221, 'loss/train': 1.65607488155365} 11/07/2021 10:05:01 - INFO - __main__ - Step 91223: {'lr': 0.0001706190395629573, 'samples': 17514816, 'steps': 91222, 'loss/train': 1.5062674283981323} 11/07/2021 10:05:01 - INFO - __main__ - Step 91224: {'lr': 0.0001706140074589825, 'samples': 17515008, 'steps': 91223, 'loss/train': 1.5258713960647583} 11/07/2021 10:05:03 - INFO - __main__ - Step 91225: {'lr': 0.0001706089753907778, 'samples': 17515200, 'steps': 91224, 'loss/train': 0.96661376953125} 11/07/2021 10:05:04 - INFO - __main__ - Step 91226: {'lr': 0.00017060394335834545, 'samples': 17515392, 'steps': 91225, 'loss/train': 1.763399600982666} 11/07/2021 10:05:04 - INFO - __main__ - Step 91227: {'lr': 0.00017059891136168777, 'samples': 17515584, 'steps': 91226, 'loss/train': 0.9166393280029297} 11/07/2021 10:05:04 - INFO - __main__ - Step 91228: {'lr': 0.00017059387940080703, 'samples': 17515776, 'steps': 91227, 'loss/train': 1.2842695713043213} 11/07/2021 10:05:05 - INFO - __main__ - Step 91229: {'lr': 0.00017058884747570542, 'samples': 17515968, 'steps': 91228, 'loss/train': 0.8733910918235779} 11/07/2021 10:05:05 - INFO - __main__ - Step 91230: {'lr': 0.00017058381558638524, 'samples': 17516160, 'steps': 91229, 'loss/train': 1.4084731340408325} 11/07/2021 10:05:05 - INFO - __main__ - Step 91231: {'lr': 0.00017057878373284886, 'samples': 17516352, 'steps': 91230, 'loss/train': 1.7582858800888062} 11/07/2021 10:05:06 - INFO - __main__ - Step 91232: {'lr': 0.00017057375191509834, 'samples': 17516544, 'steps': 91231, 'loss/train': 1.7042371034622192} 11/07/2021 10:05:07 - INFO - __main__ - Step 91233: {'lr': 0.0001705687201331361, 'samples': 17516736, 'steps': 91232, 'loss/train': 1.737615942955017} 11/07/2021 10:05:07 - INFO - __main__ - Step 91234: {'lr': 0.00017056368838696433, 'samples': 17516928, 'steps': 91233, 'loss/train': 2.077085018157959} 11/07/2021 10:05:07 - INFO - __main__ - Step 91235: {'lr': 0.00017055865667658539, 'samples': 17517120, 'steps': 91234, 'loss/train': 0.9895156621932983} 11/07/2021 10:05:08 - INFO - __main__ - Step 91236: {'lr': 0.00017055362500200148, 'samples': 17517312, 'steps': 91235, 'loss/train': 1.5776647329330444} 11/07/2021 10:05:08 - INFO - __main__ - Step 91237: {'lr': 0.00017054859336321487, 'samples': 17517504, 'steps': 91236, 'loss/train': 1.019932746887207} 11/07/2021 10:05:10 - INFO - __main__ - Step 91238: {'lr': 0.00017054356176022785, 'samples': 17517696, 'steps': 91237, 'loss/train': 1.2243003845214844} 11/07/2021 10:05:10 - INFO - __main__ - Step 91239: {'lr': 0.00017053853019304263, 'samples': 17517888, 'steps': 91238, 'loss/train': 1.182449221611023} 11/07/2021 10:05:11 - INFO - __main__ - Step 91240: {'lr': 0.00017053349866166158, 'samples': 17518080, 'steps': 91239, 'loss/train': 1.3004714250564575} 11/07/2021 10:05:11 - INFO - __main__ - Step 91241: {'lr': 0.0001705284671660869, 'samples': 17518272, 'steps': 91240, 'loss/train': 1.228847861289978} 11/07/2021 10:05:11 - INFO - __main__ - Step 91242: {'lr': 0.0001705234357063209, 'samples': 17518464, 'steps': 91241, 'loss/train': 1.5746684074401855} 11/07/2021 10:05:12 - INFO - __main__ - Step 91243: {'lr': 0.0001705184042823658, 'samples': 17518656, 'steps': 91242, 'loss/train': 1.6100540161132812} 11/07/2021 10:05:12 - INFO - __main__ - Step 91244: {'lr': 0.0001705133728942238, 'samples': 17518848, 'steps': 91243, 'loss/train': 1.7854468822479248} 11/07/2021 10:05:13 - INFO - __main__ - Step 91245: {'lr': 0.00017050834154189732, 'samples': 17519040, 'steps': 91244, 'loss/train': 1.5521221160888672} 11/07/2021 10:05:13 - INFO - __main__ - Step 91246: {'lr': 0.0001705033102253885, 'samples': 17519232, 'steps': 91245, 'loss/train': 1.4170786142349243} 11/07/2021 10:05:14 - INFO - __main__ - Step 91247: {'lr': 0.00017049827894469972, 'samples': 17519424, 'steps': 91246, 'loss/train': 1.6119306087493896} 11/07/2021 10:05:14 - INFO - __main__ - Step 91248: {'lr': 0.00017049324769983316, 'samples': 17519616, 'steps': 91247, 'loss/train': 0.8454214334487915} 11/07/2021 10:05:14 - INFO - __main__ - Step 91249: {'lr': 0.0001704882164907911, 'samples': 17519808, 'steps': 91248, 'loss/train': 1.6767888069152832} 11/07/2021 10:05:16 - INFO - __main__ - Step 91250: {'lr': 0.00017048318531757585, 'samples': 17520000, 'steps': 91249, 'loss/train': 1.023941993713379} 11/07/2021 10:05:16 - INFO - __main__ - Step 91251: {'lr': 0.00017047815418018964, 'samples': 17520192, 'steps': 91250, 'loss/train': 5.741248607635498} 11/07/2021 10:05:16 - INFO - __main__ - Step 91252: {'lr': 0.00017047312307863472, 'samples': 17520384, 'steps': 91251, 'loss/train': 5.209644317626953} 11/07/2021 10:05:17 - INFO - __main__ - Step 91253: {'lr': 0.0001704680920129134, 'samples': 17520576, 'steps': 91252, 'loss/train': 1.6376200914382935} 11/07/2021 10:05:17 - INFO - __main__ - Step 91254: {'lr': 0.00017046306098302794, 'samples': 17520768, 'steps': 91253, 'loss/train': 1.4085369110107422} 11/07/2021 10:05:17 - INFO - __main__ - Step 91255: {'lr': 0.00017045802998898069, 'samples': 17520960, 'steps': 91254, 'loss/train': 1.0972890853881836} 11/07/2021 10:05:18 - INFO - __main__ - Step 91256: {'lr': 0.00017045299903077374, 'samples': 17521152, 'steps': 91255, 'loss/train': 1.4143458604812622} 11/07/2021 10:05:19 - INFO - __main__ - Step 91257: {'lr': 0.00017044796810840944, 'samples': 17521344, 'steps': 91256, 'loss/train': 1.656106948852539} 11/07/2021 10:05:19 - INFO - __main__ - Step 91258: {'lr': 0.00017044293722189003, 'samples': 17521536, 'steps': 91257, 'loss/train': 1.4875560998916626} 11/07/2021 10:05:19 - INFO - __main__ - Step 91259: {'lr': 0.0001704379063712178, 'samples': 17521728, 'steps': 91258, 'loss/train': 1.259056568145752} 11/07/2021 10:05:20 - INFO - __main__ - Step 91260: {'lr': 0.00017043287555639508, 'samples': 17521920, 'steps': 91259, 'loss/train': 1.7685084342956543} 11/07/2021 10:05:21 - INFO - __main__ - Step 91261: {'lr': 0.00017042784477742403, 'samples': 17522112, 'steps': 91260, 'loss/train': 1.632550835609436} 11/07/2021 10:05:21 - INFO - __main__ - Step 91262: {'lr': 0.000170422814034307, 'samples': 17522304, 'steps': 91261, 'loss/train': 1.4594502449035645} 11/07/2021 10:05:22 - INFO - __main__ - Step 91263: {'lr': 0.00017041778332704615, 'samples': 17522496, 'steps': 91262, 'loss/train': 1.3940643072128296} 11/07/2021 10:05:22 - INFO - __main__ - Step 91264: {'lr': 0.00017041275265564389, 'samples': 17522688, 'steps': 91263, 'loss/train': 0.6794039607048035} 11/07/2021 10:05:22 - INFO - __main__ - Step 91265: {'lr': 0.0001704077220201024, 'samples': 17522880, 'steps': 91264, 'loss/train': 1.4198641777038574} 11/07/2021 10:05:23 - INFO - __main__ - Step 91266: {'lr': 0.00017040269142042395, 'samples': 17523072, 'steps': 91265, 'loss/train': 1.2591837644577026} 11/07/2021 10:05:24 - INFO - __main__ - Step 91267: {'lr': 0.0001703976608566108, 'samples': 17523264, 'steps': 91266, 'loss/train': 1.9931273460388184} 11/07/2021 10:05:24 - INFO - __main__ - Step 91268: {'lr': 0.0001703926303286654, 'samples': 17523456, 'steps': 91267, 'loss/train': 0.8566461205482483} 11/07/2021 10:05:24 - INFO - __main__ - Step 91269: {'lr': 0.0001703875998365897, 'samples': 17523648, 'steps': 91268, 'loss/train': 1.2942326068878174} 11/07/2021 10:05:25 - INFO - __main__ - Step 91270: {'lr': 0.00017038256938038614, 'samples': 17523840, 'steps': 91269, 'loss/train': 1.4632172584533691} 11/07/2021 10:05:26 - INFO - __main__ - Step 91271: {'lr': 0.00017037753896005696, 'samples': 17524032, 'steps': 91270, 'loss/train': 1.2853831052780151} 11/07/2021 10:05:26 - INFO - __main__ - Step 91272: {'lr': 0.00017037250857560444, 'samples': 17524224, 'steps': 91271, 'loss/train': 1.3930468559265137} 11/07/2021 10:05:27 - INFO - __main__ - Step 91273: {'lr': 0.0001703674782270308, 'samples': 17524416, 'steps': 91272, 'loss/train': 1.1100115776062012} 11/07/2021 10:05:27 - INFO - __main__ - Step 91274: {'lr': 0.0001703624479143384, 'samples': 17524608, 'steps': 91273, 'loss/train': 1.4274370670318604} 11/07/2021 10:05:27 - INFO - __main__ - Step 91275: {'lr': 0.0001703574176375294, 'samples': 17524800, 'steps': 91274, 'loss/train': 1.37725830078125} 11/07/2021 10:05:29 - INFO - __main__ - Step 91276: {'lr': 0.00017035238739660614, 'samples': 17524992, 'steps': 91275, 'loss/train': 1.5744858980178833} 11/07/2021 10:05:29 - INFO - __main__ - Step 91277: {'lr': 0.0001703473571915709, 'samples': 17525184, 'steps': 91276, 'loss/train': 1.1742534637451172} 11/07/2021 10:05:29 - INFO - __main__ - Step 91278: {'lr': 0.00017034232702242585, 'samples': 17525376, 'steps': 91277, 'loss/train': 1.0503185987472534} 11/07/2021 10:05:30 - INFO - __main__ - Step 91279: {'lr': 0.00017033729688917338, 'samples': 17525568, 'steps': 91278, 'loss/train': 1.1750143766403198} 11/07/2021 10:05:30 - INFO - __main__ - Step 91280: {'lr': 0.00017033226679181562, 'samples': 17525760, 'steps': 91279, 'loss/train': 1.3386129140853882} 11/07/2021 10:05:30 - INFO - __main__ - Step 91281: {'lr': 0.0001703272367303551, 'samples': 17525952, 'steps': 91280, 'loss/train': 1.2879928350448608} 11/07/2021 10:05:31 - INFO - __main__ - Step 91282: {'lr': 0.00017032220670479376, 'samples': 17526144, 'steps': 91281, 'loss/train': 1.44230055809021} 11/07/2021 10:05:32 - INFO - __main__ - Step 91283: {'lr': 0.00017031717671513397, 'samples': 17526336, 'steps': 91282, 'loss/train': 1.0570570230484009} 11/07/2021 10:05:32 - INFO - __main__ - Step 91284: {'lr': 0.00017031214676137808, 'samples': 17526528, 'steps': 91283, 'loss/train': 1.0395429134368896} 11/07/2021 10:05:32 - INFO - __main__ - Step 91285: {'lr': 0.00017030711684352828, 'samples': 17526720, 'steps': 91284, 'loss/train': 1.0825468301773071} 11/07/2021 10:05:33 - INFO - __main__ - Step 91286: {'lr': 0.00017030208696158685, 'samples': 17526912, 'steps': 91285, 'loss/train': 1.417251706123352} 11/07/2021 10:05:34 - INFO - __main__ - Step 91287: {'lr': 0.0001702970571155561, 'samples': 17527104, 'steps': 91286, 'loss/train': 0.7226258516311646} 11/07/2021 10:05:34 - INFO - __main__ - Step 91288: {'lr': 0.00017029202730543824, 'samples': 17527296, 'steps': 91287, 'loss/train': 1.1899722814559937} 11/07/2021 10:05:35 - INFO - __main__ - Step 91289: {'lr': 0.00017028699753123558, 'samples': 17527488, 'steps': 91288, 'loss/train': 0.8273909687995911} 11/07/2021 10:05:35 - INFO - __main__ - Step 91290: {'lr': 0.00017028196779295034, 'samples': 17527680, 'steps': 91289, 'loss/train': 1.4476888179779053} 11/07/2021 10:05:35 - INFO - __main__ - Step 91291: {'lr': 0.00017027693809058486, 'samples': 17527872, 'steps': 91290, 'loss/train': 1.4683811664581299} 11/07/2021 10:05:36 - INFO - __main__ - Step 91292: {'lr': 0.00017027190842414135, 'samples': 17528064, 'steps': 91291, 'loss/train': 1.3691519498825073} 11/07/2021 10:05:37 - INFO - __main__ - Step 91293: {'lr': 0.00017026687879362207, 'samples': 17528256, 'steps': 91292, 'loss/train': 1.822762370109558} 11/07/2021 10:05:37 - INFO - __main__ - Step 91294: {'lr': 0.00017026184919902932, 'samples': 17528448, 'steps': 91293, 'loss/train': 1.1649945974349976} 11/07/2021 10:05:37 - INFO - __main__ - Step 91295: {'lr': 0.00017025681964036546, 'samples': 17528640, 'steps': 91294, 'loss/train': 1.08896005153656} 11/07/2021 10:05:38 - INFO - __main__ - Step 91296: {'lr': 0.00017025179011763254, 'samples': 17528832, 'steps': 91295, 'loss/train': 0.8253346681594849} 11/07/2021 10:05:38 - INFO - __main__ - Step 91297: {'lr': 0.0001702467606308329, 'samples': 17529024, 'steps': 91296, 'loss/train': 1.5187629461288452} 11/07/2021 10:05:39 - INFO - __main__ - Step 91298: {'lr': 0.00017024173117996888, 'samples': 17529216, 'steps': 91297, 'loss/train': 1.7033365964889526} 11/07/2021 10:05:39 - INFO - __main__ - Step 91299: {'lr': 0.00017023670176504268, 'samples': 17529408, 'steps': 91298, 'loss/train': 1.0987898111343384} 11/07/2021 10:05:40 - INFO - __main__ - Step 91300: {'lr': 0.0001702316723860566, 'samples': 17529600, 'steps': 91299, 'loss/train': 1.249964714050293} 11/07/2021 10:05:40 - INFO - __main__ - Step 91301: {'lr': 0.00017022664304301287, 'samples': 17529792, 'steps': 91300, 'loss/train': 1.7646725177764893} 11/07/2021 10:05:41 - INFO - __main__ - Step 91302: {'lr': 0.00017022161373591384, 'samples': 17529984, 'steps': 91301, 'loss/train': 1.3919748067855835} 11/07/2021 10:05:42 - INFO - __main__ - Step 91303: {'lr': 0.0001702165844647617, 'samples': 17530176, 'steps': 91302, 'loss/train': 1.3627492189407349} 11/07/2021 10:05:42 - INFO - __main__ - Step 91304: {'lr': 0.00017021155522955873, 'samples': 17530368, 'steps': 91303, 'loss/train': 1.3158725500106812} 11/07/2021 10:05:42 - INFO - __main__ - Step 91305: {'lr': 0.00017020652603030718, 'samples': 17530560, 'steps': 91304, 'loss/train': 1.5817729234695435} 11/07/2021 10:05:43 - INFO - __main__ - Step 91306: {'lr': 0.00017020149686700937, 'samples': 17530752, 'steps': 91305, 'loss/train': 1.1036608219146729} 11/07/2021 10:05:43 - INFO - __main__ - Step 91307: {'lr': 0.0001701964677396675, 'samples': 17530944, 'steps': 91306, 'loss/train': 0.7248578071594238} 11/07/2021 10:05:44 - INFO - __main__ - Step 91308: {'lr': 0.00017019143864828402, 'samples': 17531136, 'steps': 91307, 'loss/train': 1.6306313276290894} 11/07/2021 10:05:44 - INFO - __main__ - Step 91309: {'lr': 0.00017018640959286092, 'samples': 17531328, 'steps': 91308, 'loss/train': 1.2462589740753174} 11/07/2021 10:05:45 - INFO - __main__ - Step 91310: {'lr': 0.0001701813805734006, 'samples': 17531520, 'steps': 91309, 'loss/train': 1.6403032541275024} 11/07/2021 10:05:45 - INFO - __main__ - Step 91311: {'lr': 0.0001701763515899053, 'samples': 17531712, 'steps': 91310, 'loss/train': 1.5035629272460938} 11/07/2021 10:05:45 - INFO - __main__ - Step 91312: {'lr': 0.00017017132264237727, 'samples': 17531904, 'steps': 91311, 'loss/train': 1.4140117168426514} 11/07/2021 10:05:46 - INFO - __main__ - Step 91313: {'lr': 0.00017016629373081888, 'samples': 17532096, 'steps': 91312, 'loss/train': 1.4324088096618652} 11/07/2021 10:05:47 - INFO - __main__ - Step 91314: {'lr': 0.0001701612648552323, 'samples': 17532288, 'steps': 91313, 'loss/train': 0.8310604095458984} 11/07/2021 10:05:47 - INFO - __main__ - Step 91315: {'lr': 0.0001701562360156198, 'samples': 17532480, 'steps': 91314, 'loss/train': 1.423612117767334} 11/07/2021 10:05:48 - INFO - __main__ - Step 91316: {'lr': 0.00017015120721198371, 'samples': 17532672, 'steps': 91315, 'loss/train': 1.5770390033721924} 11/07/2021 10:05:48 - INFO - __main__ - Step 91317: {'lr': 0.00017014617844432622, 'samples': 17532864, 'steps': 91316, 'loss/train': 1.3993326425552368} 11/07/2021 10:05:49 - INFO - __main__ - Step 91318: {'lr': 0.00017014114971264965, 'samples': 17533056, 'steps': 91317, 'loss/train': 1.5191997289657593} 11/07/2021 10:05:49 - INFO - __main__ - Step 91319: {'lr': 0.00017013612101695623, 'samples': 17533248, 'steps': 91318, 'loss/train': 1.9227179288864136} 11/07/2021 10:05:50 - INFO - __main__ - Step 91320: {'lr': 0.00017013109235724827, 'samples': 17533440, 'steps': 91319, 'loss/train': 1.4266142845153809} 11/07/2021 10:05:50 - INFO - __main__ - Step 91321: {'lr': 0.00017012606373352797, 'samples': 17533632, 'steps': 91320, 'loss/train': 1.1234689950942993} 11/07/2021 10:05:50 - INFO - __main__ - Step 91322: {'lr': 0.00017012103514579775, 'samples': 17533824, 'steps': 91321, 'loss/train': 1.1261495351791382} 11/07/2021 10:05:52 - INFO - __main__ - Step 91323: {'lr': 0.00017011600659405969, 'samples': 17534016, 'steps': 91322, 'loss/train': 1.666127324104309} 11/07/2021 10:05:52 - INFO - __main__ - Step 91324: {'lr': 0.00017011097807831607, 'samples': 17534208, 'steps': 91323, 'loss/train': 1.6636275053024292} 11/07/2021 10:05:53 - INFO - __main__ - Step 91325: {'lr': 0.00017010594959856922, 'samples': 17534400, 'steps': 91324, 'loss/train': 1.7673819065093994} 11/07/2021 10:05:53 - INFO - __main__ - Step 91326: {'lr': 0.00017010092115482143, 'samples': 17534592, 'steps': 91325, 'loss/train': 1.5833711624145508} 11/07/2021 10:05:53 - INFO - __main__ - Step 91327: {'lr': 0.0001700958927470749, 'samples': 17534784, 'steps': 91326, 'loss/train': 1.3325220346450806} 11/07/2021 10:05:54 - INFO - __main__ - Step 91328: {'lr': 0.00017009086437533194, 'samples': 17534976, 'steps': 91327, 'loss/train': 1.8277114629745483} 11/07/2021 10:05:54 - INFO - __main__ - Step 91329: {'lr': 0.0001700858360395948, 'samples': 17535168, 'steps': 91328, 'loss/train': 1.6297506093978882} 11/07/2021 10:05:55 - INFO - __main__ - Step 91330: {'lr': 0.00017008080773986577, 'samples': 17535360, 'steps': 91329, 'loss/train': 1.2802298069000244} 11/07/2021 10:05:55 - INFO - __main__ - Step 91331: {'lr': 0.00017007577947614704, 'samples': 17535552, 'steps': 91330, 'loss/train': 1.123862385749817} 11/07/2021 10:05:56 - INFO - __main__ - Step 91332: {'lr': 0.000170070751248441, 'samples': 17535744, 'steps': 91331, 'loss/train': 1.392352819442749} 11/07/2021 10:05:56 - INFO - __main__ - Step 91333: {'lr': 0.00017006572305674987, 'samples': 17535936, 'steps': 91332, 'loss/train': 1.1864266395568848} 11/07/2021 10:05:56 - INFO - __main__ - Step 91334: {'lr': 0.00017006069490107584, 'samples': 17536128, 'steps': 91333, 'loss/train': 1.6544946432113647} 11/07/2021 10:05:57 - INFO - __main__ - Step 91335: {'lr': 0.00017005566678142127, 'samples': 17536320, 'steps': 91334, 'loss/train': 1.375345230102539} 11/07/2021 10:05:58 - INFO - __main__ - Step 91336: {'lr': 0.00017005063869778833, 'samples': 17536512, 'steps': 91335, 'loss/train': 1.4996651411056519} 11/07/2021 10:05:58 - INFO - __main__ - Step 91337: {'lr': 0.00017004561065017934, 'samples': 17536704, 'steps': 91336, 'loss/train': 1.4637587070465088} 11/07/2021 10:05:58 - INFO - __main__ - Step 91338: {'lr': 0.00017004058263859657, 'samples': 17536896, 'steps': 91337, 'loss/train': 1.1433144807815552} 11/07/2021 10:05:59 - INFO - __main__ - Step 91339: {'lr': 0.00017003555466304227, 'samples': 17537088, 'steps': 91338, 'loss/train': 1.24209463596344} 11/07/2021 10:06:00 - INFO - __main__ - Step 91340: {'lr': 0.00017003052672351875, 'samples': 17537280, 'steps': 91339, 'loss/train': 1.2400413751602173} 11/07/2021 10:06:00 - INFO - __main__ - Step 91341: {'lr': 0.00017002549882002822, 'samples': 17537472, 'steps': 91340, 'loss/train': 1.3349922895431519} 11/07/2021 10:06:01 - INFO - __main__ - Step 91342: {'lr': 0.00017002047095257295, 'samples': 17537664, 'steps': 91341, 'loss/train': 0.7937454581260681} 11/07/2021 10:06:01 - INFO - __main__ - Step 91343: {'lr': 0.00017001544312115522, 'samples': 17537856, 'steps': 91342, 'loss/train': 1.2542108297348022} 11/07/2021 10:06:01 - INFO - __main__ - Step 91344: {'lr': 0.00017001041532577736, 'samples': 17538048, 'steps': 91343, 'loss/train': 1.116538405418396} 11/07/2021 10:06:02 - INFO - __main__ - Step 91345: {'lr': 0.00017000538756644151, 'samples': 17538240, 'steps': 91344, 'loss/train': 1.1661741733551025} 11/07/2021 10:06:03 - INFO - __main__ - Step 91346: {'lr': 0.00017000035984315003, 'samples': 17538432, 'steps': 91345, 'loss/train': 1.2888091802597046} 11/07/2021 10:06:03 - INFO - __main__ - Step 91347: {'lr': 0.00016999533215590512, 'samples': 17538624, 'steps': 91346, 'loss/train': 0.9945414066314697} 11/07/2021 10:06:03 - INFO - __main__ - Step 91348: {'lr': 0.0001699903045047091, 'samples': 17538816, 'steps': 91347, 'loss/train': 1.3359545469284058} 11/07/2021 10:06:04 - INFO - __main__ - Step 91349: {'lr': 0.00016998527688956425, 'samples': 17539008, 'steps': 91348, 'loss/train': 1.038851022720337} 11/07/2021 10:06:04 - INFO - __main__ - Step 91350: {'lr': 0.00016998024931047273, 'samples': 17539200, 'steps': 91349, 'loss/train': 1.7904645204544067} 11/07/2021 10:06:05 - INFO - __main__ - Step 91351: {'lr': 0.0001699752217674369, 'samples': 17539392, 'steps': 91350, 'loss/train': 1.4216305017471313} 11/07/2021 10:06:05 - INFO - __main__ - Step 91352: {'lr': 0.000169970194260459, 'samples': 17539584, 'steps': 91351, 'loss/train': 1.0596171617507935} 11/07/2021 10:06:06 - INFO - __main__ - Step 91353: {'lr': 0.00016996516678954133, 'samples': 17539776, 'steps': 91352, 'loss/train': 1.16170334815979} 11/07/2021 10:06:06 - INFO - __main__ - Step 91354: {'lr': 0.00016996013935468608, 'samples': 17539968, 'steps': 91353, 'loss/train': 1.3688372373580933} 11/07/2021 10:06:06 - INFO - __main__ - Step 91355: {'lr': 0.0001699551119558956, 'samples': 17540160, 'steps': 91354, 'loss/train': 1.3381586074829102} 11/07/2021 10:06:08 - INFO - __main__ - Step 91356: {'lr': 0.00016995008459317208, 'samples': 17540352, 'steps': 91355, 'loss/train': 1.4405008554458618} 11/07/2021 10:06:08 - INFO - __main__ - Step 91357: {'lr': 0.00016994505726651782, 'samples': 17540544, 'steps': 91356, 'loss/train': 1.3179986476898193} 11/07/2021 10:06:08 - INFO - __main__ - Step 91358: {'lr': 0.00016994002997593505, 'samples': 17540736, 'steps': 91357, 'loss/train': 1.272719144821167} 11/07/2021 10:06:09 - INFO - __main__ - Step 91359: {'lr': 0.0001699350027214261, 'samples': 17540928, 'steps': 91358, 'loss/train': 1.293900489807129} 11/07/2021 10:06:09 - INFO - __main__ - Step 91360: {'lr': 0.00016992997550299322, 'samples': 17541120, 'steps': 91359, 'loss/train': 1.39144766330719} 11/07/2021 10:06:10 - INFO - __main__ - Step 91361: {'lr': 0.0001699249483206386, 'samples': 17541312, 'steps': 91360, 'loss/train': 1.3576842546463013} 11/07/2021 10:06:11 - INFO - __main__ - Step 91362: {'lr': 0.00016991992117436466, 'samples': 17541504, 'steps': 91361, 'loss/train': 1.4091811180114746} 11/07/2021 10:06:11 - INFO - __main__ - Step 91363: {'lr': 0.00016991489406417348, 'samples': 17541696, 'steps': 91362, 'loss/train': 1.5709842443466187} 11/07/2021 10:06:11 - INFO - __main__ - Step 91364: {'lr': 0.00016990986699006743, 'samples': 17541888, 'steps': 91363, 'loss/train': 1.4314543008804321} 11/07/2021 10:06:12 - INFO - __main__ - Step 91365: {'lr': 0.00016990483995204877, 'samples': 17542080, 'steps': 91364, 'loss/train': 1.154292106628418} 11/07/2021 10:06:13 - INFO - __main__ - Step 91366: {'lr': 0.0001698998129501198, 'samples': 17542272, 'steps': 91365, 'loss/train': 1.7784796953201294} 11/07/2021 10:06:13 - INFO - __main__ - Step 91367: {'lr': 0.00016989478598428267, 'samples': 17542464, 'steps': 91366, 'loss/train': 1.2098374366760254} 11/07/2021 10:06:13 - INFO - __main__ - Step 91368: {'lr': 0.00016988975905453974, 'samples': 17542656, 'steps': 91367, 'loss/train': 1.4338330030441284} 11/07/2021 10:06:14 - INFO - __main__ - Step 91369: {'lr': 0.00016988473216089322, 'samples': 17542848, 'steps': 91368, 'loss/train': 1.054404616355896} 11/07/2021 10:06:14 - INFO - __main__ - Step 91370: {'lr': 0.00016987970530334544, 'samples': 17543040, 'steps': 91369, 'loss/train': 1.431832194328308} 11/07/2021 10:06:14 - INFO - __main__ - Step 91371: {'lr': 0.00016987467848189857, 'samples': 17543232, 'steps': 91370, 'loss/train': 1.4804768562316895} 11/07/2021 10:06:16 - INFO - __main__ - Step 91372: {'lr': 0.000169869651696555, 'samples': 17543424, 'steps': 91371, 'loss/train': 1.6084606647491455} 11/07/2021 10:06:16 - INFO - __main__ - Step 91373: {'lr': 0.0001698646249473169, 'samples': 17543616, 'steps': 91372, 'loss/train': 0.9786996245384216} 11/07/2021 10:06:16 - INFO - __main__ - Step 91374: {'lr': 0.00016985959823418657, 'samples': 17543808, 'steps': 91373, 'loss/train': 1.7994980812072754} 11/07/2021 10:06:17 - INFO - __main__ - Step 91375: {'lr': 0.00016985457155716625, 'samples': 17544000, 'steps': 91374, 'loss/train': 1.4940910339355469} 11/07/2021 10:06:17 - INFO - __main__ - Step 91376: {'lr': 0.00016984954491625832, 'samples': 17544192, 'steps': 91375, 'loss/train': 1.0245786905288696} 11/07/2021 10:06:18 - INFO - __main__ - Step 91377: {'lr': 0.00016984451831146487, 'samples': 17544384, 'steps': 91376, 'loss/train': 1.4738049507141113} 11/07/2021 10:06:19 - INFO - __main__ - Step 91378: {'lr': 0.00016983949174278822, 'samples': 17544576, 'steps': 91377, 'loss/train': 0.21175001561641693} 11/07/2021 10:06:19 - INFO - __main__ - Step 91379: {'lr': 0.0001698344652102307, 'samples': 17544768, 'steps': 91378, 'loss/train': 0.986032247543335} 11/07/2021 10:06:19 - INFO - __main__ - Step 91380: {'lr': 0.0001698294387137945, 'samples': 17544960, 'steps': 91379, 'loss/train': 1.027890920639038} 11/07/2021 10:06:20 - INFO - __main__ - Step 91381: {'lr': 0.0001698244122534819, 'samples': 17545152, 'steps': 91380, 'loss/train': 1.8220692873001099} 11/07/2021 10:06:21 - INFO - __main__ - Step 91382: {'lr': 0.00016981938582929522, 'samples': 17545344, 'steps': 91381, 'loss/train': 1.2277438640594482} 11/07/2021 10:06:21 - INFO - __main__ - Step 91383: {'lr': 0.0001698143594412367, 'samples': 17545536, 'steps': 91382, 'loss/train': 1.226301670074463} 11/07/2021 10:06:22 - INFO - __main__ - Step 91384: {'lr': 0.00016980933308930854, 'samples': 17545728, 'steps': 91383, 'loss/train': 1.2188366651535034} 11/07/2021 10:06:22 - INFO - __main__ - Step 91385: {'lr': 0.00016980430677351308, 'samples': 17545920, 'steps': 91384, 'loss/train': 0.9754796624183655} 11/07/2021 10:06:22 - INFO - __main__ - Step 91386: {'lr': 0.00016979928049385258, 'samples': 17546112, 'steps': 91385, 'loss/train': 0.9445826411247253} 11/07/2021 10:06:23 - INFO - __main__ - Step 91387: {'lr': 0.00016979425425032925, 'samples': 17546304, 'steps': 91386, 'loss/train': 1.7047171592712402} 11/07/2021 10:06:24 - INFO - __main__ - Step 91388: {'lr': 0.00016978922804294545, 'samples': 17546496, 'steps': 91387, 'loss/train': 1.0884947776794434} 11/07/2021 10:06:24 - INFO - __main__ - Step 91389: {'lr': 0.00016978420187170343, 'samples': 17546688, 'steps': 91388, 'loss/train': 1.6704461574554443} 11/07/2021 10:06:24 - INFO - __main__ - Step 91390: {'lr': 0.00016977917573660534, 'samples': 17546880, 'steps': 91389, 'loss/train': 1.3551080226898193} 11/07/2021 10:06:25 - INFO - __main__ - Step 91391: {'lr': 0.00016977414963765348, 'samples': 17547072, 'steps': 91390, 'loss/train': 0.42703455686569214} 11/07/2021 10:06:26 - INFO - __main__ - Step 91392: {'lr': 0.0001697691235748502, 'samples': 17547264, 'steps': 91391, 'loss/train': 1.539650797843933} 11/07/2021 10:06:26 - INFO - __main__ - Step 91393: {'lr': 0.00016976409754819767, 'samples': 17547456, 'steps': 91392, 'loss/train': 1.3692604303359985} 11/07/2021 10:06:26 - INFO - __main__ - Step 91394: {'lr': 0.0001697590715576982, 'samples': 17547648, 'steps': 91393, 'loss/train': 1.8121347427368164} 11/07/2021 10:06:27 - INFO - __main__ - Step 91395: {'lr': 0.00016975404560335412, 'samples': 17547840, 'steps': 91394, 'loss/train': 1.689666748046875} 11/07/2021 10:06:27 - INFO - __main__ - Step 91396: {'lr': 0.00016974901968516758, 'samples': 17548032, 'steps': 91395, 'loss/train': 0.5615774989128113} 11/07/2021 10:06:28 - INFO - __main__ - Step 91397: {'lr': 0.00016974399380314086, 'samples': 17548224, 'steps': 91396, 'loss/train': 1.2535333633422852} 11/07/2021 10:06:30 - INFO - __main__ - Step 91398: {'lr': 0.0001697389679572763, 'samples': 17548416, 'steps': 91397, 'loss/train': 1.4629418849945068} 11/07/2021 10:06:30 - INFO - __main__ - Step 91399: {'lr': 0.00016973394214757614, 'samples': 17548608, 'steps': 91398, 'loss/train': 1.2885141372680664} 11/07/2021 10:06:30 - INFO - __main__ - Step 91400: {'lr': 0.00016972891637404258, 'samples': 17548800, 'steps': 91399, 'loss/train': 0.8635687232017517} 11/07/2021 10:06:31 - INFO - __main__ - Step 91401: {'lr': 0.00016972389063667798, 'samples': 17548992, 'steps': 91400, 'loss/train': 1.7840617895126343} 11/07/2021 10:06:31 - INFO - __main__ - Step 91402: {'lr': 0.0001697188649354846, 'samples': 17549184, 'steps': 91401, 'loss/train': 1.7274928092956543} 11/07/2021 10:06:31 - INFO - __main__ - Step 91403: {'lr': 0.00016971383927046464, 'samples': 17549376, 'steps': 91402, 'loss/train': 1.7255117893218994} 11/07/2021 10:06:32 - INFO - __main__ - Step 91404: {'lr': 0.00016970881364162033, 'samples': 17549568, 'steps': 91403, 'loss/train': 1.648003339767456} 11/07/2021 10:06:33 - INFO - __main__ - Step 91405: {'lr': 0.00016970378804895397, 'samples': 17549760, 'steps': 91404, 'loss/train': 1.5031654834747314} 11/07/2021 10:06:33 - INFO - __main__ - Step 91406: {'lr': 0.00016969876249246787, 'samples': 17549952, 'steps': 91405, 'loss/train': 1.1561403274536133} 11/07/2021 10:06:33 - INFO - __main__ - Step 91407: {'lr': 0.0001696937369721643, 'samples': 17550144, 'steps': 91406, 'loss/train': 1.7646799087524414} 11/07/2021 10:06:34 - INFO - __main__ - Step 91408: {'lr': 0.00016968871148804543, 'samples': 17550336, 'steps': 91407, 'loss/train': 1.5570539236068726} 11/07/2021 10:06:34 - INFO - __main__ - Step 91409: {'lr': 0.00016968368604011364, 'samples': 17550528, 'steps': 91408, 'loss/train': 1.100040316581726} 11/07/2021 10:06:35 - INFO - __main__ - Step 91410: {'lr': 0.0001696786606283711, 'samples': 17550720, 'steps': 91409, 'loss/train': 1.593117594718933} 11/07/2021 10:06:36 - INFO - __main__ - Step 91411: {'lr': 0.00016967363525282014, 'samples': 17550912, 'steps': 91410, 'loss/train': 1.7371625900268555} 11/07/2021 10:06:36 - INFO - __main__ - Step 91412: {'lr': 0.000169668609913463, 'samples': 17551104, 'steps': 91411, 'loss/train': 1.8437108993530273} 11/07/2021 10:06:36 - INFO - __main__ - Step 91413: {'lr': 0.00016966358461030195, 'samples': 17551296, 'steps': 91412, 'loss/train': 1.2218645811080933} 11/07/2021 10:06:37 - INFO - __main__ - Step 91414: {'lr': 0.00016965855934333925, 'samples': 17551488, 'steps': 91413, 'loss/train': 2.0148086547851562} 11/07/2021 10:06:37 - INFO - __main__ - Step 91415: {'lr': 0.00016965353411257713, 'samples': 17551680, 'steps': 91414, 'loss/train': 5.749205589294434} 11/07/2021 10:06:38 - INFO - __main__ - Step 91416: {'lr': 0.00016964850891801802, 'samples': 17551872, 'steps': 91415, 'loss/train': 1.6716234683990479} 11/07/2021 10:06:38 - INFO - __main__ - Step 91417: {'lr': 0.00016964348375966395, 'samples': 17552064, 'steps': 91416, 'loss/train': 0.9376537799835205} 11/07/2021 10:06:39 - INFO - __main__ - Step 91418: {'lr': 0.0001696384586375173, 'samples': 17552256, 'steps': 91417, 'loss/train': 1.2599791288375854} 11/07/2021 10:06:39 - INFO - __main__ - Step 91419: {'lr': 0.00016963343355158028, 'samples': 17552448, 'steps': 91418, 'loss/train': 1.693266749382019} 11/07/2021 10:06:39 - INFO - __main__ - Step 91420: {'lr': 0.00016962840850185524, 'samples': 17552640, 'steps': 91419, 'loss/train': 1.2823350429534912} 11/07/2021 10:06:40 - INFO - __main__ - Step 91421: {'lr': 0.00016962338348834436, 'samples': 17552832, 'steps': 91420, 'loss/train': 1.771357536315918} 11/07/2021 10:06:41 - INFO - __main__ - Step 91422: {'lr': 0.00016961835851104996, 'samples': 17553024, 'steps': 91421, 'loss/train': 1.9169528484344482} 11/07/2021 10:06:41 - INFO - __main__ - Step 91423: {'lr': 0.00016961333356997426, 'samples': 17553216, 'steps': 91422, 'loss/train': 0.9466032981872559} 11/07/2021 10:06:41 - INFO - __main__ - Step 91424: {'lr': 0.0001696083086651196, 'samples': 17553408, 'steps': 91423, 'loss/train': 1.3336492776870728} 11/07/2021 10:06:42 - INFO - __main__ - Step 91425: {'lr': 0.00016960328379648818, 'samples': 17553600, 'steps': 91424, 'loss/train': 1.7499011754989624} 11/07/2021 10:06:43 - INFO - __main__ - Step 91426: {'lr': 0.00016959825896408227, 'samples': 17553792, 'steps': 91425, 'loss/train': 1.6493912935256958} 11/07/2021 10:06:43 - INFO - __main__ - Step 91427: {'lr': 0.00016959323416790414, 'samples': 17553984, 'steps': 91426, 'loss/train': 1.375730276107788} 11/07/2021 10:06:43 - INFO - __main__ - Step 91428: {'lr': 0.00016958820940795604, 'samples': 17554176, 'steps': 91427, 'loss/train': 1.5312517881393433} 11/07/2021 10:06:44 - INFO - __main__ - Step 91429: {'lr': 0.00016958318468424043, 'samples': 17554368, 'steps': 91428, 'loss/train': 1.620551586151123} 11/07/2021 10:06:44 - INFO - __main__ - Step 91430: {'lr': 0.00016957815999675923, 'samples': 17554560, 'steps': 91429, 'loss/train': 1.2158445119857788} 11/07/2021 10:06:45 - INFO - __main__ - Step 91431: {'lr': 0.0001695731353455149, 'samples': 17554752, 'steps': 91430, 'loss/train': 1.4678622484207153} 11/07/2021 10:06:45 - INFO - __main__ - Step 91432: {'lr': 0.00016956811073050963, 'samples': 17554944, 'steps': 91431, 'loss/train': 0.9927523136138916} 11/07/2021 10:06:46 - INFO - __main__ - Step 91433: {'lr': 0.00016956308615174575, 'samples': 17555136, 'steps': 91432, 'loss/train': 1.3575682640075684} 11/07/2021 10:06:46 - INFO - __main__ - Step 91434: {'lr': 0.00016955806160922553, 'samples': 17555328, 'steps': 91433, 'loss/train': 0.8072934150695801} 11/07/2021 10:06:47 - INFO - __main__ - Step 91435: {'lr': 0.00016955303710295116, 'samples': 17555520, 'steps': 91434, 'loss/train': 1.659541368484497} 11/07/2021 10:06:47 - INFO - __main__ - Step 91436: {'lr': 0.00016954801263292498, 'samples': 17555712, 'steps': 91435, 'loss/train': 1.209855318069458} 11/07/2021 10:06:48 - INFO - __main__ - Step 91437: {'lr': 0.0001695429881991492, 'samples': 17555904, 'steps': 91436, 'loss/train': 0.2428291141986847} 11/07/2021 10:06:48 - INFO - __main__ - Step 91438: {'lr': 0.00016953796380162614, 'samples': 17556096, 'steps': 91437, 'loss/train': 1.3967188596725464} 11/07/2021 10:06:49 - INFO - __main__ - Step 91439: {'lr': 0.00016953293944035801, 'samples': 17556288, 'steps': 91438, 'loss/train': 1.2772603034973145} 11/07/2021 10:06:49 - INFO - __main__ - Step 91440: {'lr': 0.0001695279151153471, 'samples': 17556480, 'steps': 91439, 'loss/train': 0.8452351093292236} 11/07/2021 10:06:49 - INFO - __main__ - Step 91441: {'lr': 0.00016952289082659567, 'samples': 17556672, 'steps': 91440, 'loss/train': 1.347325325012207} 11/07/2021 10:06:50 - INFO - __main__ - Step 91442: {'lr': 0.000169517866574106, 'samples': 17556864, 'steps': 91441, 'loss/train': 0.8533405065536499} 11/07/2021 10:06:51 - INFO - __main__ - Step 91443: {'lr': 0.00016951284235788041, 'samples': 17557056, 'steps': 91442, 'loss/train': 1.9587913751602173} 11/07/2021 10:06:51 - INFO - __main__ - Step 91444: {'lr': 0.00016950781817792103, 'samples': 17557248, 'steps': 91443, 'loss/train': 1.6041033267974854} 11/07/2021 10:06:51 - INFO - __main__ - Step 91445: {'lr': 0.00016950279403423014, 'samples': 17557440, 'steps': 91444, 'loss/train': 1.0083976984024048} 11/07/2021 10:06:52 - INFO - __main__ - Step 91446: {'lr': 0.00016949776992681009, 'samples': 17557632, 'steps': 91445, 'loss/train': 1.1514307260513306} 11/07/2021 10:06:53 - INFO - __main__ - Step 91447: {'lr': 0.00016949274585566308, 'samples': 17557824, 'steps': 91446, 'loss/train': 1.3463484048843384} 11/07/2021 10:06:53 - INFO - __main__ - Step 91448: {'lr': 0.00016948772182079138, 'samples': 17558016, 'steps': 91447, 'loss/train': 1.0498937368392944} 11/07/2021 10:06:53 - INFO - __main__ - Step 91449: {'lr': 0.0001694826978221973, 'samples': 17558208, 'steps': 91448, 'loss/train': 1.3894622325897217} 11/07/2021 10:06:54 - INFO - __main__ - Step 91450: {'lr': 0.00016947767385988306, 'samples': 17558400, 'steps': 91449, 'loss/train': 0.934883177280426} 11/07/2021 10:06:54 - INFO - __main__ - Step 91451: {'lr': 0.00016947264993385093, 'samples': 17558592, 'steps': 91450, 'loss/train': 1.277622938156128} 11/07/2021 10:06:55 - INFO - __main__ - Step 91452: {'lr': 0.00016946762604410322, 'samples': 17558784, 'steps': 91451, 'loss/train': 1.4934628009796143} 11/07/2021 10:06:56 - INFO - __main__ - Step 91453: {'lr': 0.0001694626021906421, 'samples': 17558976, 'steps': 91452, 'loss/train': 1.0196365118026733} 11/07/2021 10:06:56 - INFO - __main__ - Step 91454: {'lr': 0.0001694575783734699, 'samples': 17559168, 'steps': 91453, 'loss/train': 1.2948378324508667} 11/07/2021 10:06:56 - INFO - __main__ - Step 91455: {'lr': 0.0001694525545925889, 'samples': 17559360, 'steps': 91454, 'loss/train': 1.4367897510528564} 11/07/2021 10:06:57 - INFO - __main__ - Step 91456: {'lr': 0.00016944753084800144, 'samples': 17559552, 'steps': 91455, 'loss/train': 1.1497420072555542} 11/07/2021 10:06:58 - INFO - __main__ - Step 91457: {'lr': 0.00016944250713970955, 'samples': 17559744, 'steps': 91456, 'loss/train': 1.5003889799118042} 11/07/2021 10:06:58 - INFO - __main__ - Step 91458: {'lr': 0.00016943748346771563, 'samples': 17559936, 'steps': 91457, 'loss/train': 1.482740879058838} 11/07/2021 10:06:58 - INFO - __main__ - Step 91459: {'lr': 0.00016943245983202195, 'samples': 17560128, 'steps': 91458, 'loss/train': 1.2648441791534424} 11/07/2021 10:06:59 - INFO - __main__ - Step 91460: {'lr': 0.00016942743623263074, 'samples': 17560320, 'steps': 91459, 'loss/train': 1.4880216121673584} 11/07/2021 10:06:59 - INFO - __main__ - Step 91461: {'lr': 0.0001694224126695443, 'samples': 17560512, 'steps': 91460, 'loss/train': 1.3947089910507202} 11/07/2021 10:07:00 - INFO - __main__ - Step 91462: {'lr': 0.00016941738914276488, 'samples': 17560704, 'steps': 91461, 'loss/train': 1.368638515472412} 11/07/2021 10:07:01 - INFO - __main__ - Step 91463: {'lr': 0.00016941236565229474, 'samples': 17560896, 'steps': 91462, 'loss/train': 1.1400138139724731} 11/07/2021 10:07:01 - INFO - __main__ - Step 91464: {'lr': 0.00016940734219813615, 'samples': 17561088, 'steps': 91463, 'loss/train': 1.4385836124420166} 11/07/2021 10:07:01 - INFO - __main__ - Step 91465: {'lr': 0.00016940231878029134, 'samples': 17561280, 'steps': 91464, 'loss/train': 1.1827951669692993} 11/07/2021 10:07:02 - INFO - __main__ - Step 91466: {'lr': 0.00016939729539876264, 'samples': 17561472, 'steps': 91465, 'loss/train': 1.2701451778411865} 11/07/2021 10:07:03 - INFO - __main__ - Step 91467: {'lr': 0.0001693922720535523, 'samples': 17561664, 'steps': 91466, 'loss/train': 1.7637699842453003} 11/07/2021 10:07:03 - INFO - __main__ - Step 91468: {'lr': 0.0001693872487446625, 'samples': 17561856, 'steps': 91467, 'loss/train': 1.4051114320755005} 11/07/2021 10:07:03 - INFO - __main__ - Step 91469: {'lr': 0.0001693822254720956, 'samples': 17562048, 'steps': 91468, 'loss/train': 1.3949165344238281} 11/07/2021 10:07:04 - INFO - __main__ - Step 91470: {'lr': 0.00016937720223585384, 'samples': 17562240, 'steps': 91469, 'loss/train': 1.4033770561218262} 11/07/2021 10:07:04 - INFO - __main__ - Step 91471: {'lr': 0.00016937217903593944, 'samples': 17562432, 'steps': 91470, 'loss/train': 1.4285969734191895} 11/07/2021 10:07:05 - INFO - __main__ - Step 91472: {'lr': 0.00016936715587235465, 'samples': 17562624, 'steps': 91471, 'loss/train': 1.2289648056030273} 11/07/2021 10:07:05 - INFO - __main__ - Step 91473: {'lr': 0.00016936213274510183, 'samples': 17562816, 'steps': 91472, 'loss/train': 1.5354962348937988} 11/07/2021 10:07:06 - INFO - __main__ - Step 91474: {'lr': 0.00016935710965418317, 'samples': 17563008, 'steps': 91473, 'loss/train': 1.7398037910461426} 11/07/2021 10:07:06 - INFO - __main__ - Step 91475: {'lr': 0.00016935208659960094, 'samples': 17563200, 'steps': 91474, 'loss/train': 1.4234386682510376} 11/07/2021 10:07:06 - INFO - __main__ - Step 91476: {'lr': 0.0001693470635813574, 'samples': 17563392, 'steps': 91475, 'loss/train': 1.689540982246399} 11/07/2021 10:07:07 - INFO - __main__ - Step 91477: {'lr': 0.00016934204059945485, 'samples': 17563584, 'steps': 91476, 'loss/train': 1.5041906833648682} 11/07/2021 10:07:08 - INFO - __main__ - Step 91478: {'lr': 0.00016933701765389558, 'samples': 17563776, 'steps': 91477, 'loss/train': 0.23210546374320984} 11/07/2021 10:07:08 - INFO - __main__ - Step 91479: {'lr': 0.00016933199474468175, 'samples': 17563968, 'steps': 91478, 'loss/train': 1.0028499364852905} 11/07/2021 10:07:09 - INFO - __main__ - Step 91480: {'lr': 0.0001693269718718157, 'samples': 17564160, 'steps': 91479, 'loss/train': 0.9684052467346191} 11/07/2021 10:07:09 - INFO - __main__ - Step 91481: {'lr': 0.00016932194903529965, 'samples': 17564352, 'steps': 91480, 'loss/train': 0.6590220332145691} 11/07/2021 10:07:09 - INFO - __main__ - Step 91482: {'lr': 0.0001693169262351359, 'samples': 17564544, 'steps': 91481, 'loss/train': 1.1415656805038452} 11/07/2021 10:07:10 - INFO - __main__ - Step 91483: {'lr': 0.00016931190347132676, 'samples': 17564736, 'steps': 91482, 'loss/train': 1.6176782846450806} 11/07/2021 10:07:11 - INFO - __main__ - Step 91484: {'lr': 0.00016930688074387435, 'samples': 17564928, 'steps': 91483, 'loss/train': 1.48415207862854} 11/07/2021 10:07:11 - INFO - __main__ - Step 91485: {'lr': 0.00016930185805278102, 'samples': 17565120, 'steps': 91484, 'loss/train': 1.276667833328247} 11/07/2021 10:07:11 - INFO - __main__ - Step 91486: {'lr': 0.000169296835398049, 'samples': 17565312, 'steps': 91485, 'loss/train': 1.2868708372116089} 11/07/2021 10:07:12 - INFO - __main__ - Step 91487: {'lr': 0.00016929181277968065, 'samples': 17565504, 'steps': 91486, 'loss/train': 1.1377265453338623} 11/07/2021 10:07:13 - INFO - __main__ - Step 91488: {'lr': 0.00016928679019767812, 'samples': 17565696, 'steps': 91487, 'loss/train': 0.941189169883728} 11/07/2021 10:07:13 - INFO - __main__ - Step 91489: {'lr': 0.0001692817676520438, 'samples': 17565888, 'steps': 91488, 'loss/train': 1.3047637939453125} 11/07/2021 10:07:14 - INFO - __main__ - Step 91490: {'lr': 0.00016927674514277978, 'samples': 17566080, 'steps': 91489, 'loss/train': 1.5291379690170288} 11/07/2021 10:07:14 - INFO - __main__ - Step 91491: {'lr': 0.00016927172266988842, 'samples': 17566272, 'steps': 91490, 'loss/train': 1.6022567749023438} 11/07/2021 10:07:14 - INFO - __main__ - Step 91492: {'lr': 0.000169266700233372, 'samples': 17566464, 'steps': 91491, 'loss/train': 1.1032060384750366} 11/07/2021 10:07:15 - INFO - __main__ - Step 91493: {'lr': 0.00016926167783323272, 'samples': 17566656, 'steps': 91492, 'loss/train': 1.2962689399719238} 11/07/2021 10:07:16 - INFO - __main__ - Step 91494: {'lr': 0.0001692566554694729, 'samples': 17566848, 'steps': 91493, 'loss/train': 0.6063793897628784} 11/07/2021 10:07:16 - INFO - __main__ - Step 91495: {'lr': 0.0001692516331420948, 'samples': 17567040, 'steps': 91494, 'loss/train': 0.9011119604110718} 11/07/2021 10:07:16 - INFO - __main__ - Step 91496: {'lr': 0.00016924661085110064, 'samples': 17567232, 'steps': 91495, 'loss/train': 1.2419886589050293} 11/07/2021 10:07:17 - INFO - __main__ - Step 91497: {'lr': 0.0001692415885964928, 'samples': 17567424, 'steps': 91496, 'loss/train': 1.1872953176498413} 11/07/2021 10:07:18 - INFO - __main__ - Step 91498: {'lr': 0.00016923656637827337, 'samples': 17567616, 'steps': 91497, 'loss/train': 1.3265711069107056} 11/07/2021 10:07:18 - INFO - __main__ - Step 91499: {'lr': 0.0001692315441964447, 'samples': 17567808, 'steps': 91498, 'loss/train': 1.187569260597229} 11/07/2021 10:07:18 - INFO - __main__ - Step 91500: {'lr': 0.00016922652205100913, 'samples': 17568000, 'steps': 91499, 'loss/train': 1.086343765258789} 11/07/2021 10:07:19 - INFO - __main__ - Step 91501: {'lr': 0.0001692214999419688, 'samples': 17568192, 'steps': 91500, 'loss/train': 0.5300586223602295} 11/07/2021 10:07:19 - INFO - __main__ - Step 91502: {'lr': 0.00016921647786932595, 'samples': 17568384, 'steps': 91501, 'loss/train': 1.5823607444763184} 11/07/2021 10:07:20 - INFO - __main__ - Step 91503: {'lr': 0.00016921145583308295, 'samples': 17568576, 'steps': 91502, 'loss/train': 1.588826298713684} 11/07/2021 10:07:20 - INFO - __main__ - Step 91504: {'lr': 0.00016920643383324201, 'samples': 17568768, 'steps': 91503, 'loss/train': 1.1471319198608398} 11/07/2021 10:07:21 - INFO - __main__ - Step 91505: {'lr': 0.00016920141186980541, 'samples': 17568960, 'steps': 91504, 'loss/train': 1.3660355806350708} 11/07/2021 10:07:21 - INFO - __main__ - Step 91506: {'lr': 0.00016919638994277543, 'samples': 17569152, 'steps': 91505, 'loss/train': 0.9032590389251709} 11/07/2021 10:07:22 - INFO - __main__ - Step 91507: {'lr': 0.00016919136805215428, 'samples': 17569344, 'steps': 91506, 'loss/train': 0.8567991256713867} 11/07/2021 10:07:22 - INFO - __main__ - Step 91508: {'lr': 0.00016918634619794427, 'samples': 17569536, 'steps': 91507, 'loss/train': 1.1578190326690674} 11/07/2021 10:07:23 - INFO - __main__ - Step 91509: {'lr': 0.0001691813243801476, 'samples': 17569728, 'steps': 91508, 'loss/train': 1.4066095352172852} 11/07/2021 10:07:23 - INFO - __main__ - Step 91510: {'lr': 0.00016917630259876668, 'samples': 17569920, 'steps': 91509, 'loss/train': 1.2112195491790771} 11/07/2021 10:07:24 - INFO - __main__ - Step 91511: {'lr': 0.00016917128085380367, 'samples': 17570112, 'steps': 91510, 'loss/train': 1.2296018600463867} 11/07/2021 10:07:24 - INFO - __main__ - Step 91512: {'lr': 0.00016916625914526075, 'samples': 17570304, 'steps': 91511, 'loss/train': 0.8430492281913757} 11/07/2021 10:07:24 - INFO - __main__ - Step 91513: {'lr': 0.0001691612374731403, 'samples': 17570496, 'steps': 91512, 'loss/train': 1.2160478830337524} 11/07/2021 10:07:25 - INFO - __main__ - Step 91514: {'lr': 0.00016915621583744452, 'samples': 17570688, 'steps': 91513, 'loss/train': 1.327898383140564} 11/07/2021 10:07:26 - INFO - __main__ - Step 91515: {'lr': 0.0001691511942381757, 'samples': 17570880, 'steps': 91514, 'loss/train': 1.5748200416564941} 11/07/2021 10:07:26 - INFO - __main__ - Step 91516: {'lr': 0.00016914617267533617, 'samples': 17571072, 'steps': 91515, 'loss/train': 1.5171232223510742} 11/07/2021 10:07:26 - INFO - __main__ - Step 91517: {'lr': 0.00016914115114892805, 'samples': 17571264, 'steps': 91516, 'loss/train': 1.5914798974990845} 11/07/2021 10:07:27 - INFO - __main__ - Step 91518: {'lr': 0.0001691361296589537, 'samples': 17571456, 'steps': 91517, 'loss/train': 1.4422930479049683} 11/07/2021 10:07:28 - INFO - __main__ - Step 91519: {'lr': 0.00016913110820541538, 'samples': 17571648, 'steps': 91518, 'loss/train': 1.4757239818572998} 11/07/2021 10:07:28 - INFO - __main__ - Step 91520: {'lr': 0.00016912608678831532, 'samples': 17571840, 'steps': 91519, 'loss/train': 1.5128535032272339} 11/07/2021 10:07:29 - INFO - __main__ - Step 91521: {'lr': 0.00016912106540765582, 'samples': 17572032, 'steps': 91520, 'loss/train': 1.1435019969940186} 11/07/2021 10:07:29 - INFO - __main__ - Step 91522: {'lr': 0.0001691160440634391, 'samples': 17572224, 'steps': 91521, 'loss/train': 1.1952263116836548} 11/07/2021 10:07:29 - INFO - __main__ - Step 91523: {'lr': 0.00016911102275566752, 'samples': 17572416, 'steps': 91522, 'loss/train': 1.4151841402053833} 11/07/2021 10:07:30 - INFO - __main__ - Step 91524: {'lr': 0.0001691060014843432, 'samples': 17572608, 'steps': 91523, 'loss/train': 1.1658700704574585} 11/07/2021 10:07:31 - INFO - __main__ - Step 91525: {'lr': 0.00016910098024946847, 'samples': 17572800, 'steps': 91524, 'loss/train': 1.687246561050415} 11/07/2021 10:07:31 - INFO - __main__ - Step 91526: {'lr': 0.00016909595905104558, 'samples': 17572992, 'steps': 91525, 'loss/train': 1.4205774068832397} 11/07/2021 10:07:31 - INFO - __main__ - Step 91527: {'lr': 0.00016909093788907678, 'samples': 17573184, 'steps': 91526, 'loss/train': 1.2984946966171265} 11/07/2021 10:07:32 - INFO - __main__ - Step 91528: {'lr': 0.0001690859167635644, 'samples': 17573376, 'steps': 91527, 'loss/train': 3.2514474391937256} 11/07/2021 10:07:32 - INFO - __main__ - Step 91529: {'lr': 0.0001690808956745106, 'samples': 17573568, 'steps': 91528, 'loss/train': 1.5650513172149658} 11/07/2021 10:07:33 - INFO - __main__ - Step 91530: {'lr': 0.00016907587462191773, 'samples': 17573760, 'steps': 91529, 'loss/train': 1.3909586668014526} 11/07/2021 10:07:33 - INFO - __main__ - Step 91531: {'lr': 0.00016907085360578803, 'samples': 17573952, 'steps': 91530, 'loss/train': 1.2679804563522339} 11/07/2021 10:07:34 - INFO - __main__ - Step 91532: {'lr': 0.00016906583262612374, 'samples': 17574144, 'steps': 91531, 'loss/train': 1.8132851123809814} 11/07/2021 10:07:34 - INFO - __main__ - Step 91533: {'lr': 0.00016906081168292715, 'samples': 17574336, 'steps': 91532, 'loss/train': 0.9941095113754272} 11/07/2021 10:07:35 - INFO - __main__ - Step 91534: {'lr': 0.00016905579077620048, 'samples': 17574528, 'steps': 91533, 'loss/train': 1.3680247068405151} 11/07/2021 10:07:35 - INFO - __main__ - Step 91535: {'lr': 0.00016905076990594606, 'samples': 17574720, 'steps': 91534, 'loss/train': 1.54427969455719} 11/07/2021 10:07:36 - INFO - __main__ - Step 91536: {'lr': 0.0001690457490721661, 'samples': 17574912, 'steps': 91535, 'loss/train': 1.2356895208358765} 11/07/2021 10:07:36 - INFO - __main__ - Step 91537: {'lr': 0.000169040728274863, 'samples': 17575104, 'steps': 91536, 'loss/train': 1.2296829223632812} 11/07/2021 10:07:37 - INFO - __main__ - Step 91538: {'lr': 0.00016903570751403873, 'samples': 17575296, 'steps': 91537, 'loss/train': 1.6117993593215942} 11/07/2021 10:07:37 - INFO - __main__ - Step 91539: {'lr': 0.0001690306867896958, 'samples': 17575488, 'steps': 91538, 'loss/train': 1.4487824440002441} 11/07/2021 10:07:38 - INFO - __main__ - Step 91540: {'lr': 0.00016902566610183634, 'samples': 17575680, 'steps': 91539, 'loss/train': 1.557719349861145} 11/07/2021 10:07:38 - INFO - __main__ - Step 91541: {'lr': 0.0001690206454504627, 'samples': 17575872, 'steps': 91540, 'loss/train': 1.4886544942855835} 11/07/2021 10:07:39 - INFO - __main__ - Step 91542: {'lr': 0.00016901562483557708, 'samples': 17576064, 'steps': 91541, 'loss/train': 1.09829843044281} 11/07/2021 10:07:39 - INFO - __main__ - Step 91543: {'lr': 0.0001690106042571818, 'samples': 17576256, 'steps': 91542, 'loss/train': 1.1166439056396484} 11/07/2021 10:07:39 - INFO - __main__ - Step 91544: {'lr': 0.00016900558371527906, 'samples': 17576448, 'steps': 91543, 'loss/train': 0.983115553855896} 11/07/2021 10:07:40 - INFO - __main__ - Step 91545: {'lr': 0.00016900056320987117, 'samples': 17576640, 'steps': 91544, 'loss/train': 1.3189078569412231} 11/07/2021 10:07:41 - INFO - __main__ - Step 91546: {'lr': 0.00016899554274096035, 'samples': 17576832, 'steps': 91545, 'loss/train': 1.078229308128357} 11/07/2021 10:07:41 - INFO - __main__ - Step 91547: {'lr': 0.00016899052230854892, 'samples': 17577024, 'steps': 91546, 'loss/train': 1.0463998317718506} 11/07/2021 10:07:41 - INFO - __main__ - Step 91548: {'lr': 0.0001689855019126391, 'samples': 17577216, 'steps': 91547, 'loss/train': 1.2491445541381836} 11/07/2021 10:07:42 - INFO - __main__ - Step 91549: {'lr': 0.00016898048155323313, 'samples': 17577408, 'steps': 91548, 'loss/train': 1.2386345863342285} 11/07/2021 10:07:42 - INFO - __main__ - Step 91550: {'lr': 0.00016897546123033347, 'samples': 17577600, 'steps': 91549, 'loss/train': 1.437027096748352} 11/07/2021 10:07:43 - INFO - __main__ - Step 91551: {'lr': 0.0001689704409439421, 'samples': 17577792, 'steps': 91550, 'loss/train': 0.4917445182800293} 11/07/2021 10:07:43 - INFO - __main__ - Step 91552: {'lr': 0.0001689654206940614, 'samples': 17577984, 'steps': 91551, 'loss/train': 1.037237286567688} 11/07/2021 10:07:44 - INFO - __main__ - Step 91553: {'lr': 0.00016896040048069362, 'samples': 17578176, 'steps': 91552, 'loss/train': 1.6030852794647217} 11/07/2021 10:07:44 - INFO - __main__ - Step 91554: {'lr': 0.000168955380303841, 'samples': 17578368, 'steps': 91553, 'loss/train': 2.0534677505493164} 11/07/2021 10:07:45 - INFO - __main__ - Step 91555: {'lr': 0.00016895036016350589, 'samples': 17578560, 'steps': 91554, 'loss/train': 1.4410536289215088} 11/07/2021 10:07:46 - INFO - __main__ - Step 91556: {'lr': 0.00016894534005969044, 'samples': 17578752, 'steps': 91555, 'loss/train': 1.1090909242630005} 11/07/2021 10:07:46 - INFO - __main__ - Step 91557: {'lr': 0.00016894031999239702, 'samples': 17578944, 'steps': 91556, 'loss/train': 1.312842845916748} 11/07/2021 10:07:46 - INFO - __main__ - Step 91558: {'lr': 0.00016893529996162782, 'samples': 17579136, 'steps': 91557, 'loss/train': 0.982707679271698} 11/07/2021 10:07:47 - INFO - __main__ - Step 91559: {'lr': 0.0001689302799673851, 'samples': 17579328, 'steps': 91558, 'loss/train': 1.200127124786377} 11/07/2021 10:07:47 - INFO - __main__ - Step 91560: {'lr': 0.00016892526000967118, 'samples': 17579520, 'steps': 91559, 'loss/train': 1.4675954580307007} 11/07/2021 10:07:48 - INFO - __main__ - Step 91561: {'lr': 0.00016892024008848826, 'samples': 17579712, 'steps': 91560, 'loss/train': 1.028908133506775} 11/07/2021 10:07:48 - INFO - __main__ - Step 91562: {'lr': 0.00016891522020383865, 'samples': 17579904, 'steps': 91561, 'loss/train': 1.0576101541519165} 11/07/2021 10:07:49 - INFO - __main__ - Step 91563: {'lr': 0.0001689102003557246, 'samples': 17580096, 'steps': 91562, 'loss/train': 1.6193342208862305} 11/07/2021 10:07:49 - INFO - __main__ - Step 91564: {'lr': 0.00016890518054414843, 'samples': 17580288, 'steps': 91563, 'loss/train': 1.5530080795288086} 11/07/2021 10:07:49 - INFO - __main__ - Step 91565: {'lr': 0.00016890016076911228, 'samples': 17580480, 'steps': 91564, 'loss/train': 1.8890103101730347} 11/07/2021 10:07:50 - INFO - __main__ - Step 91566: {'lr': 0.00016889514103061843, 'samples': 17580672, 'steps': 91565, 'loss/train': 1.1840074062347412} 11/07/2021 10:07:51 - INFO - __main__ - Step 91567: {'lr': 0.0001688901213286692, 'samples': 17580864, 'steps': 91566, 'loss/train': 0.45666810870170593} 11/07/2021 10:07:51 - INFO - __main__ - Step 91568: {'lr': 0.00016888510166326683, 'samples': 17581056, 'steps': 91567, 'loss/train': 1.4094539880752563} 11/07/2021 10:07:51 - INFO - __main__ - Step 91569: {'lr': 0.00016888008203441352, 'samples': 17581248, 'steps': 91568, 'loss/train': 1.2190011739730835} 11/07/2021 10:07:52 - INFO - __main__ - Step 91570: {'lr': 0.00016887506244211165, 'samples': 17581440, 'steps': 91569, 'loss/train': 1.329309344291687} 11/07/2021 10:07:53 - INFO - __main__ - Step 91571: {'lr': 0.00016887004288636343, 'samples': 17581632, 'steps': 91570, 'loss/train': 1.396955966949463} 11/07/2021 10:07:53 - INFO - __main__ - Step 91572: {'lr': 0.00016886502336717108, 'samples': 17581824, 'steps': 91571, 'loss/train': 1.7056411504745483} 11/07/2021 10:07:54 - INFO - __main__ - Step 91573: {'lr': 0.00016886000388453693, 'samples': 17582016, 'steps': 91572, 'loss/train': 1.210250735282898} 11/07/2021 10:07:54 - INFO - __main__ - Step 91574: {'lr': 0.0001688549844384632, 'samples': 17582208, 'steps': 91573, 'loss/train': 0.8413466811180115} 11/07/2021 10:07:54 - INFO - __main__ - Step 91575: {'lr': 0.00016884996502895217, 'samples': 17582400, 'steps': 91574, 'loss/train': 2.1958043575286865} 11/07/2021 10:07:55 - INFO - __main__ - Step 91576: {'lr': 0.00016884494565600608, 'samples': 17582592, 'steps': 91575, 'loss/train': 1.2437679767608643} 11/07/2021 10:07:56 - INFO - __main__ - Step 91577: {'lr': 0.00016883992631962731, 'samples': 17582784, 'steps': 91576, 'loss/train': 1.205857753753662} 11/07/2021 10:07:56 - INFO - __main__ - Step 91578: {'lr': 0.0001688349070198179, 'samples': 17582976, 'steps': 91577, 'loss/train': 1.4948171377182007} 11/07/2021 10:07:56 - INFO - __main__ - Step 91579: {'lr': 0.00016882988775658025, 'samples': 17583168, 'steps': 91578, 'loss/train': 0.7495471835136414} 11/07/2021 10:07:57 - INFO - __main__ - Step 91580: {'lr': 0.00016882486852991664, 'samples': 17583360, 'steps': 91579, 'loss/train': 1.556339979171753} 11/07/2021 10:07:58 - INFO - __main__ - Step 91581: {'lr': 0.00016881984933982922, 'samples': 17583552, 'steps': 91580, 'loss/train': 1.2373048067092896} 11/07/2021 10:07:58 - INFO - __main__ - Step 91582: {'lr': 0.00016881483018632034, 'samples': 17583744, 'steps': 91581, 'loss/train': 1.5577349662780762} 11/07/2021 10:07:58 - INFO - __main__ - Step 91583: {'lr': 0.00016880981106939227, 'samples': 17583936, 'steps': 91582, 'loss/train': 0.8933048844337463} 11/07/2021 10:07:59 - INFO - __main__ - Step 91584: {'lr': 0.00016880479198904725, 'samples': 17584128, 'steps': 91583, 'loss/train': 1.558046579360962} 11/07/2021 10:07:59 - INFO - __main__ - Step 91585: {'lr': 0.0001687997729452875, 'samples': 17584320, 'steps': 91584, 'loss/train': 1.3273556232452393} 11/07/2021 10:08:00 - INFO - __main__ - Step 91586: {'lr': 0.00016879475393811533, 'samples': 17584512, 'steps': 91585, 'loss/train': 0.9901847243309021} 11/07/2021 10:08:01 - INFO - __main__ - Step 91587: {'lr': 0.00016878973496753301, 'samples': 17584704, 'steps': 91586, 'loss/train': 1.3986420631408691} 11/07/2021 10:08:01 - INFO - __main__ - Step 91588: {'lr': 0.0001687847160335428, 'samples': 17584896, 'steps': 91587, 'loss/train': 1.511900782585144} 11/07/2021 10:08:01 - INFO - __main__ - Step 91589: {'lr': 0.00016877969713614688, 'samples': 17585088, 'steps': 91588, 'loss/train': 1.1815497875213623} 11/07/2021 10:08:02 - INFO - __main__ - Step 91590: {'lr': 0.00016877467827534762, 'samples': 17585280, 'steps': 91589, 'loss/train': 1.5945978164672852} 11/07/2021 10:08:02 - INFO - __main__ - Step 91591: {'lr': 0.00016876965945114734, 'samples': 17585472, 'steps': 91590, 'loss/train': 1.2450752258300781} 11/07/2021 10:08:03 - INFO - __main__ - Step 91592: {'lr': 0.00016876464066354808, 'samples': 17585664, 'steps': 91591, 'loss/train': 1.3915201425552368} 11/07/2021 10:08:03 - INFO - __main__ - Step 91593: {'lr': 0.00016875962191255223, 'samples': 17585856, 'steps': 91592, 'loss/train': 1.1209250688552856} 11/07/2021 10:08:04 - INFO - __main__ - Step 91594: {'lr': 0.00016875460319816204, 'samples': 17586048, 'steps': 91593, 'loss/train': 5.749165058135986} 11/07/2021 10:08:04 - INFO - __main__ - Step 91595: {'lr': 0.00016874958452037976, 'samples': 17586240, 'steps': 91594, 'loss/train': 1.1858083009719849} 11/07/2021 10:08:04 - INFO - __main__ - Step 91596: {'lr': 0.00016874456587920766, 'samples': 17586432, 'steps': 91595, 'loss/train': 1.0971542596817017} 11/07/2021 10:08:06 - INFO - __main__ - Step 91597: {'lr': 0.00016873954727464802, 'samples': 17586624, 'steps': 91596, 'loss/train': 0.7523378133773804} 11/07/2021 10:08:06 - INFO - __main__ - Step 91598: {'lr': 0.0001687345287067031, 'samples': 17586816, 'steps': 91597, 'loss/train': 1.6823936700820923} 11/07/2021 10:08:06 - INFO - __main__ - Step 91599: {'lr': 0.00016872951017537512, 'samples': 17587008, 'steps': 91598, 'loss/train': 1.5026576519012451} 11/07/2021 10:08:07 - INFO - __main__ - Step 91600: {'lr': 0.0001687244916806664, 'samples': 17587200, 'steps': 91599, 'loss/train': 1.4576057195663452} 11/07/2021 10:08:07 - INFO - __main__ - Step 91601: {'lr': 0.00016871947322257913, 'samples': 17587392, 'steps': 91600, 'loss/train': 1.2782890796661377} 11/07/2021 10:08:08 - INFO - __main__ - Step 91602: {'lr': 0.0001687144548011157, 'samples': 17587584, 'steps': 91601, 'loss/train': 1.4910929203033447} 11/07/2021 10:08:08 - INFO - __main__ - Step 91603: {'lr': 0.00016870943641627818, 'samples': 17587776, 'steps': 91602, 'loss/train': 1.3212487697601318} 11/07/2021 10:08:09 - INFO - __main__ - Step 91604: {'lr': 0.00016870441806806903, 'samples': 17587968, 'steps': 91603, 'loss/train': 1.3272496461868286} 11/07/2021 10:08:09 - INFO - __main__ - Step 91605: {'lr': 0.00016869939975649035, 'samples': 17588160, 'steps': 91604, 'loss/train': 1.2778267860412598} 11/07/2021 10:08:09 - INFO - __main__ - Step 91606: {'lr': 0.00016869438148154448, 'samples': 17588352, 'steps': 91605, 'loss/train': 1.42152738571167} 11/07/2021 10:08:10 - INFO - __main__ - Step 91607: {'lr': 0.00016868936324323364, 'samples': 17588544, 'steps': 91606, 'loss/train': 1.5930330753326416} 11/07/2021 10:08:11 - INFO - __main__ - Step 91608: {'lr': 0.00016868434504156013, 'samples': 17588736, 'steps': 91607, 'loss/train': 1.6702182292938232} 11/07/2021 10:08:11 - INFO - __main__ - Step 91609: {'lr': 0.0001686793268765262, 'samples': 17588928, 'steps': 91608, 'loss/train': 1.5139538049697876} 11/07/2021 10:08:11 - INFO - __main__ - Step 91610: {'lr': 0.0001686743087481341, 'samples': 17589120, 'steps': 91609, 'loss/train': 1.3725072145462036} 11/07/2021 10:08:12 - INFO - __main__ - Step 91611: {'lr': 0.00016866929065638615, 'samples': 17589312, 'steps': 91610, 'loss/train': 1.7503530979156494} 11/07/2021 10:08:13 - INFO - __main__ - Step 91612: {'lr': 0.00016866427260128455, 'samples': 17589504, 'steps': 91611, 'loss/train': 1.4968630075454712} 11/07/2021 10:08:13 - INFO - __main__ - Step 91613: {'lr': 0.00016865925458283155, 'samples': 17589696, 'steps': 91612, 'loss/train': 1.351185917854309} 11/07/2021 10:08:13 - INFO - __main__ - Step 91614: {'lr': 0.00016865423660102945, 'samples': 17589888, 'steps': 91613, 'loss/train': 1.3652690649032593} 11/07/2021 10:08:14 - INFO - __main__ - Step 91615: {'lr': 0.00016864921865588045, 'samples': 17590080, 'steps': 91614, 'loss/train': 0.5681241154670715} 11/07/2021 10:08:14 - INFO - __main__ - Step 91616: {'lr': 0.0001686442007473869, 'samples': 17590272, 'steps': 91615, 'loss/train': 0.8279055953025818} 11/07/2021 10:08:14 - INFO - __main__ - Step 91617: {'lr': 0.00016863918287555102, 'samples': 17590464, 'steps': 91616, 'loss/train': 0.921493649482727} 11/07/2021 10:08:15 - INFO - __main__ - Step 91618: {'lr': 0.0001686341650403751, 'samples': 17590656, 'steps': 91617, 'loss/train': 1.4276150465011597} 11/07/2021 10:08:16 - INFO - __main__ - Step 91619: {'lr': 0.00016862914724186128, 'samples': 17590848, 'steps': 91618, 'loss/train': 1.154151201248169} 11/07/2021 10:08:16 - INFO - __main__ - Step 91620: {'lr': 0.000168624129480012, 'samples': 17591040, 'steps': 91619, 'loss/train': 1.292893409729004} 11/07/2021 10:08:17 - INFO - __main__ - Step 91621: {'lr': 0.00016861911175482936, 'samples': 17591232, 'steps': 91620, 'loss/train': 1.5264326333999634} 11/07/2021 10:08:17 - INFO - __main__ - Step 91622: {'lr': 0.00016861409406631573, 'samples': 17591424, 'steps': 91621, 'loss/train': 1.239927053451538} 11/07/2021 10:08:18 - INFO - __main__ - Step 91623: {'lr': 0.00016860907641447337, 'samples': 17591616, 'steps': 91622, 'loss/train': 1.5401889085769653} 11/07/2021 10:08:18 - INFO - __main__ - Step 91624: {'lr': 0.00016860405879930447, 'samples': 17591808, 'steps': 91623, 'loss/train': 1.2834111452102661} 11/07/2021 10:08:19 - INFO - __main__ - Step 91625: {'lr': 0.00016859904122081129, 'samples': 17592000, 'steps': 91624, 'loss/train': 1.3558694124221802} 11/07/2021 10:08:19 - INFO - __main__ - Step 91626: {'lr': 0.00016859402367899615, 'samples': 17592192, 'steps': 91625, 'loss/train': 1.6687766313552856} 11/07/2021 10:08:19 - INFO - __main__ - Step 91627: {'lr': 0.00016858900617386128, 'samples': 17592384, 'steps': 91626, 'loss/train': 1.4371399879455566} 11/07/2021 10:08:21 - INFO - __main__ - Step 91628: {'lr': 0.00016858398870540895, 'samples': 17592576, 'steps': 91627, 'loss/train': 1.4778282642364502} 11/07/2021 10:08:21 - INFO - __main__ - Step 91629: {'lr': 0.00016857897127364141, 'samples': 17592768, 'steps': 91628, 'loss/train': 0.8039286136627197} 11/07/2021 10:08:21 - INFO - __main__ - Step 91630: {'lr': 0.00016857395387856095, 'samples': 17592960, 'steps': 91629, 'loss/train': 1.40609872341156} 11/07/2021 10:08:22 - INFO - __main__ - Step 91631: {'lr': 0.00016856893652016986, 'samples': 17593152, 'steps': 91630, 'loss/train': 1.3402307033538818} 11/07/2021 10:08:22 - INFO - __main__ - Step 91632: {'lr': 0.0001685639191984703, 'samples': 17593344, 'steps': 91631, 'loss/train': 1.0648841857910156} 11/07/2021 10:08:23 - INFO - __main__ - Step 91633: {'lr': 0.00016855890191346453, 'samples': 17593536, 'steps': 91632, 'loss/train': 1.4347511529922485} 11/07/2021 10:08:24 - INFO - __main__ - Step 91634: {'lr': 0.00016855388466515499, 'samples': 17593728, 'steps': 91633, 'loss/train': 1.2321406602859497} 11/07/2021 10:08:24 - INFO - __main__ - Step 91635: {'lr': 0.0001685488674535437, 'samples': 17593920, 'steps': 91634, 'loss/train': 0.9046047329902649} 11/07/2021 10:08:24 - INFO - __main__ - Step 91636: {'lr': 0.00016854385027863307, 'samples': 17594112, 'steps': 91635, 'loss/train': 0.974776566028595} 11/07/2021 10:08:25 - INFO - __main__ - Step 91637: {'lr': 0.00016853883314042528, 'samples': 17594304, 'steps': 91636, 'loss/train': 1.3012815713882446} 11/07/2021 10:08:26 - INFO - __main__ - Step 91638: {'lr': 0.0001685338160389227, 'samples': 17594496, 'steps': 91637, 'loss/train': 0.37655022740364075} 11/07/2021 10:08:26 - INFO - __main__ - Step 91639: {'lr': 0.00016852879897412748, 'samples': 17594688, 'steps': 91638, 'loss/train': 0.11556801199913025} 11/07/2021 10:08:27 - INFO - __main__ - Step 91640: {'lr': 0.00016852378194604195, 'samples': 17594880, 'steps': 91639, 'loss/train': 1.4253076314926147} 11/07/2021 10:08:27 - INFO - __main__ - Step 91641: {'lr': 0.00016851876495466834, 'samples': 17595072, 'steps': 91640, 'loss/train': 1.3050854206085205} 11/07/2021 10:08:27 - INFO - __main__ - Step 91642: {'lr': 0.0001685137480000089, 'samples': 17595264, 'steps': 91641, 'loss/train': 1.437300205230713} 11/07/2021 10:08:28 - INFO - __main__ - Step 91643: {'lr': 0.00016850873108206592, 'samples': 17595456, 'steps': 91642, 'loss/train': 1.3448967933654785} 11/07/2021 10:08:29 - INFO - __main__ - Step 91644: {'lr': 0.00016850371420084172, 'samples': 17595648, 'steps': 91643, 'loss/train': 1.680133581161499} 11/07/2021 10:08:29 - INFO - __main__ - Step 91645: {'lr': 0.00016849869735633844, 'samples': 17595840, 'steps': 91644, 'loss/train': 1.8504866361618042} 11/07/2021 10:08:29 - INFO - __main__ - Step 91646: {'lr': 0.00016849368054855837, 'samples': 17596032, 'steps': 91645, 'loss/train': 1.4066319465637207} 11/07/2021 10:08:30 - INFO - __main__ - Step 91647: {'lr': 0.00016848866377750378, 'samples': 17596224, 'steps': 91646, 'loss/train': 1.6552166938781738} 11/07/2021 10:08:30 - INFO - __main__ - Step 91648: {'lr': 0.00016848364704317697, 'samples': 17596416, 'steps': 91647, 'loss/train': 1.4771195650100708} 11/07/2021 10:08:31 - INFO - __main__ - Step 91649: {'lr': 0.00016847863034558013, 'samples': 17596608, 'steps': 91648, 'loss/train': 1.5158536434173584} 11/07/2021 10:08:32 - INFO - __main__ - Step 91650: {'lr': 0.00016847361368471558, 'samples': 17596800, 'steps': 91649, 'loss/train': 1.436808705329895} 11/07/2021 10:08:32 - INFO - __main__ - Step 91651: {'lr': 0.00016846859706058553, 'samples': 17596992, 'steps': 91650, 'loss/train': 0.9155389666557312} 11/07/2021 10:08:32 - INFO - __main__ - Step 91652: {'lr': 0.00016846358047319232, 'samples': 17597184, 'steps': 91651, 'loss/train': 2.3899176120758057} 11/07/2021 10:08:33 - INFO - __main__ - Step 91653: {'lr': 0.00016845856392253816, 'samples': 17597376, 'steps': 91652, 'loss/train': 1.402917504310608} 11/07/2021 10:08:33 - INFO - __main__ - Step 91654: {'lr': 0.0001684535474086253, 'samples': 17597568, 'steps': 91653, 'loss/train': 1.3515913486480713} 11/07/2021 10:08:35 - INFO - __main__ - Step 91655: {'lr': 0.00016844853093145602, 'samples': 17597760, 'steps': 91654, 'loss/train': 1.434875249862671} 11/07/2021 10:08:35 - INFO - __main__ - Step 91656: {'lr': 0.00016844351449103254, 'samples': 17597952, 'steps': 91655, 'loss/train': 1.4147162437438965} 11/07/2021 10:08:36 - INFO - __main__ - Step 91657: {'lr': 0.00016843849808735717, 'samples': 17598144, 'steps': 91656, 'loss/train': 1.755582332611084} 11/07/2021 10:08:36 - INFO - __main__ - Step 91658: {'lr': 0.00016843348172043227, 'samples': 17598336, 'steps': 91657, 'loss/train': 1.7227132320404053} 11/07/2021 10:08:36 - INFO - __main__ - Step 91659: {'lr': 0.0001684284653902599, 'samples': 17598528, 'steps': 91658, 'loss/train': 1.712369680404663} 11/07/2021 10:08:37 - INFO - __main__ - Step 91660: {'lr': 0.00016842344909684238, 'samples': 17598720, 'steps': 91659, 'loss/train': 0.8605157732963562} 11/07/2021 10:08:37 - INFO - __main__ - Step 91661: {'lr': 0.00016841843284018198, 'samples': 17598912, 'steps': 91660, 'loss/train': 1.7563822269439697} 11/07/2021 10:08:38 - INFO - __main__ - Step 91662: {'lr': 0.000168413416620281, 'samples': 17599104, 'steps': 91661, 'loss/train': 0.9340380430221558} 11/07/2021 10:08:38 - INFO - __main__ - Step 91663: {'lr': 0.00016840840043714166, 'samples': 17599296, 'steps': 91662, 'loss/train': 1.624074935913086} 11/07/2021 10:08:39 - INFO - __main__ - Step 91664: {'lr': 0.00016840338429076625, 'samples': 17599488, 'steps': 91663, 'loss/train': 1.6406292915344238} 11/07/2021 10:08:39 - INFO - __main__ - Step 91665: {'lr': 0.000168398368181157, 'samples': 17599680, 'steps': 91664, 'loss/train': 1.3540664911270142} 11/07/2021 10:08:39 - INFO - __main__ - Step 91666: {'lr': 0.0001683933521083162, 'samples': 17599872, 'steps': 91665, 'loss/train': 1.4858033657073975} 11/07/2021 10:08:40 - INFO - __main__ - Step 91667: {'lr': 0.00016838833607224607, 'samples': 17600064, 'steps': 91666, 'loss/train': 0.8244354724884033} 11/07/2021 10:08:41 - INFO - __main__ - Step 91668: {'lr': 0.00016838332007294894, 'samples': 17600256, 'steps': 91667, 'loss/train': 1.6607086658477783} 11/07/2021 10:08:41 - INFO - __main__ - Step 91669: {'lr': 0.00016837830411042698, 'samples': 17600448, 'steps': 91668, 'loss/train': 1.045675277709961} 11/07/2021 10:08:41 - INFO - __main__ - Step 91670: {'lr': 0.00016837328818468253, 'samples': 17600640, 'steps': 91669, 'loss/train': 1.4604578018188477} 11/07/2021 10:08:42 - INFO - __main__ - Step 91671: {'lr': 0.00016836827229571794, 'samples': 17600832, 'steps': 91670, 'loss/train': 0.8519294857978821} 11/07/2021 10:08:42 - INFO - __main__ - Step 91672: {'lr': 0.00016836325644353518, 'samples': 17601024, 'steps': 91671, 'loss/train': 1.0158668756484985} 11/07/2021 10:08:43 - INFO - __main__ - Step 91673: {'lr': 0.0001683582406281367, 'samples': 17601216, 'steps': 91672, 'loss/train': 1.1974589824676514} 11/07/2021 10:08:43 - INFO - __main__ - Step 91674: {'lr': 0.00016835322484952476, 'samples': 17601408, 'steps': 91673, 'loss/train': 1.4969698190689087} 11/07/2021 10:08:44 - INFO - __main__ - Step 91675: {'lr': 0.0001683482091077016, 'samples': 17601600, 'steps': 91674, 'loss/train': 1.3606178760528564} 11/07/2021 10:08:44 - INFO - __main__ - Step 91676: {'lr': 0.00016834319340266945, 'samples': 17601792, 'steps': 91675, 'loss/train': 1.6856111288070679} 11/07/2021 10:08:44 - INFO - __main__ - Step 91677: {'lr': 0.0001683381777344306, 'samples': 17601984, 'steps': 91676, 'loss/train': 1.2760741710662842} 11/07/2021 10:08:46 - INFO - __main__ - Step 91678: {'lr': 0.0001683331621029873, 'samples': 17602176, 'steps': 91677, 'loss/train': 0.8488902449607849} 11/07/2021 10:08:46 - INFO - __main__ - Step 91679: {'lr': 0.0001683281465083419, 'samples': 17602368, 'steps': 91678, 'loss/train': 1.5090404748916626} 11/07/2021 10:08:46 - INFO - __main__ - Step 91680: {'lr': 0.00016832313095049647, 'samples': 17602560, 'steps': 91679, 'loss/train': 1.391300082206726} 11/07/2021 10:08:47 - INFO - __main__ - Step 91681: {'lr': 0.00016831811542945341, 'samples': 17602752, 'steps': 91680, 'loss/train': 1.3799798488616943} 11/07/2021 10:08:47 - INFO - __main__ - Step 91682: {'lr': 0.00016831309994521499, 'samples': 17602944, 'steps': 91681, 'loss/train': 1.263771891593933} 11/07/2021 10:08:48 - INFO - __main__ - Step 91683: {'lr': 0.00016830808449778338, 'samples': 17603136, 'steps': 91682, 'loss/train': 1.4172568321228027} 11/07/2021 10:08:48 - INFO - __main__ - Step 91684: {'lr': 0.00016830306908716087, 'samples': 17603328, 'steps': 91683, 'loss/train': 1.5624514818191528} 11/07/2021 10:08:49 - INFO - __main__ - Step 91685: {'lr': 0.0001682980537133499, 'samples': 17603520, 'steps': 91684, 'loss/train': 1.4368188381195068} 11/07/2021 10:08:49 - INFO - __main__ - Step 91686: {'lr': 0.0001682930383763524, 'samples': 17603712, 'steps': 91685, 'loss/train': 1.4557353258132935} 11/07/2021 10:08:49 - INFO - __main__ - Step 91687: {'lr': 0.00016828802307617083, 'samples': 17603904, 'steps': 91686, 'loss/train': 1.9564673900604248} 11/07/2021 10:08:50 - INFO - __main__ - Step 91688: {'lr': 0.00016828300781280742, 'samples': 17604096, 'steps': 91687, 'loss/train': 1.498939871788025} 11/07/2021 10:08:51 - INFO - __main__ - Step 91689: {'lr': 0.00016827799258626442, 'samples': 17604288, 'steps': 91688, 'loss/train': 0.8765378594398499} 11/07/2021 10:08:51 - INFO - __main__ - Step 91690: {'lr': 0.00016827297739654406, 'samples': 17604480, 'steps': 91689, 'loss/train': 0.7840800881385803} 11/07/2021 10:08:51 - INFO - __main__ - Step 91691: {'lr': 0.00016826796224364871, 'samples': 17604672, 'steps': 91690, 'loss/train': 1.7207820415496826} 11/07/2021 10:08:52 - INFO - __main__ - Step 91692: {'lr': 0.0001682629471275805, 'samples': 17604864, 'steps': 91691, 'loss/train': 1.3536696434020996} 11/07/2021 10:08:53 - INFO - __main__ - Step 91693: {'lr': 0.00016825793204834177, 'samples': 17605056, 'steps': 91692, 'loss/train': 1.8290824890136719} 11/07/2021 10:08:53 - INFO - __main__ - Step 91694: {'lr': 0.00016825291700593473, 'samples': 17605248, 'steps': 91693, 'loss/train': 1.448961853981018} 11/07/2021 10:08:54 - INFO - __main__ - Step 91695: {'lr': 0.00016824790200036167, 'samples': 17605440, 'steps': 91694, 'loss/train': 0.8876399993896484} 11/07/2021 10:08:54 - INFO - __main__ - Step 91696: {'lr': 0.00016824288703162486, 'samples': 17605632, 'steps': 91695, 'loss/train': 1.8345913887023926} 11/07/2021 10:08:54 - INFO - __main__ - Step 91697: {'lr': 0.0001682378720997265, 'samples': 17605824, 'steps': 91696, 'loss/train': 1.3528461456298828} 11/07/2021 10:08:55 - INFO - __main__ - Step 91698: {'lr': 0.00016823285720466907, 'samples': 17606016, 'steps': 91697, 'loss/train': 1.1301116943359375} 11/07/2021 10:08:56 - INFO - __main__ - Step 91699: {'lr': 0.00016822784234645448, 'samples': 17606208, 'steps': 91698, 'loss/train': 1.3618961572647095} 11/07/2021 10:08:56 - INFO - __main__ - Step 91700: {'lr': 0.00016822282752508523, 'samples': 17606400, 'steps': 91699, 'loss/train': 1.6071276664733887} 11/07/2021 10:08:56 - INFO - __main__ - Step 91701: {'lr': 0.00016821781274056348, 'samples': 17606592, 'steps': 91700, 'loss/train': 1.1888554096221924} 11/07/2021 10:08:57 - INFO - __main__ - Step 91702: {'lr': 0.0001682127979928915, 'samples': 17606784, 'steps': 91701, 'loss/train': 1.2765322923660278} 11/07/2021 10:08:57 - INFO - __main__ - Step 91703: {'lr': 0.00016820778328207158, 'samples': 17606976, 'steps': 91702, 'loss/train': 1.4770854711532593} 11/07/2021 10:08:58 - INFO - __main__ - Step 91704: {'lr': 0.00016820276860810595, 'samples': 17607168, 'steps': 91703, 'loss/train': 1.481670618057251} 11/07/2021 10:08:58 - INFO - __main__ - Step 91705: {'lr': 0.00016819775397099697, 'samples': 17607360, 'steps': 91704, 'loss/train': 1.1523396968841553} 11/07/2021 10:08:59 - INFO - __main__ - Step 91706: {'lr': 0.00016819273937074676, 'samples': 17607552, 'steps': 91705, 'loss/train': 1.2226054668426514} 11/07/2021 10:08:59 - INFO - __main__ - Step 91707: {'lr': 0.00016818772480735761, 'samples': 17607744, 'steps': 91706, 'loss/train': 0.7278129458427429} 11/07/2021 10:08:59 - INFO - __main__ - Step 91708: {'lr': 0.00016818271028083188, 'samples': 17607936, 'steps': 91707, 'loss/train': 1.3410624265670776} 11/07/2021 10:09:00 - INFO - __main__ - Step 91709: {'lr': 0.0001681776957911717, 'samples': 17608128, 'steps': 91708, 'loss/train': 0.7147866487503052} 11/07/2021 10:09:01 - INFO - __main__ - Step 91710: {'lr': 0.00016817268133837942, 'samples': 17608320, 'steps': 91709, 'loss/train': 1.509781837463379} 11/07/2021 10:09:01 - INFO - __main__ - Step 91711: {'lr': 0.00016816766692245727, 'samples': 17608512, 'steps': 91710, 'loss/train': 1.6244056224822998} 11/07/2021 10:09:02 - INFO - __main__ - Step 91712: {'lr': 0.0001681626525434076, 'samples': 17608704, 'steps': 91711, 'loss/train': 1.5069652795791626} 11/07/2021 10:09:02 - INFO - __main__ - Step 91713: {'lr': 0.00016815763820123247, 'samples': 17608896, 'steps': 91712, 'loss/train': 0.6340115070343018} 11/07/2021 10:09:03 - INFO - __main__ - Step 91714: {'lr': 0.0001681526238959342, 'samples': 17609088, 'steps': 91713, 'loss/train': 1.5809495449066162} 11/07/2021 10:09:03 - INFO - __main__ - Step 91715: {'lr': 0.0001681476096275152, 'samples': 17609280, 'steps': 91714, 'loss/train': 1.1525863409042358} 11/07/2021 10:09:04 - INFO - __main__ - Step 91716: {'lr': 0.00016814259539597753, 'samples': 17609472, 'steps': 91715, 'loss/train': 1.508595585823059} 11/07/2021 10:09:04 - INFO - __main__ - Step 91717: {'lr': 0.00016813758120132362, 'samples': 17609664, 'steps': 91716, 'loss/train': 1.2091727256774902} 11/07/2021 10:09:04 - INFO - __main__ - Step 91718: {'lr': 0.0001681325670435556, 'samples': 17609856, 'steps': 91717, 'loss/train': 0.5873464345932007} 11/07/2021 10:09:05 - INFO - __main__ - Step 91719: {'lr': 0.00016812755292267578, 'samples': 17610048, 'steps': 91718, 'loss/train': 1.4806389808654785} 11/07/2021 10:09:06 - INFO - __main__ - Step 91720: {'lr': 0.00016812253883868644, 'samples': 17610240, 'steps': 91719, 'loss/train': 1.0540202856063843} 11/07/2021 10:09:06 - INFO - __main__ - Step 91721: {'lr': 0.0001681175247915898, 'samples': 17610432, 'steps': 91720, 'loss/train': 2.1799206733703613} 11/07/2021 10:09:06 - INFO - __main__ - Step 91722: {'lr': 0.00016811251078138818, 'samples': 17610624, 'steps': 91721, 'loss/train': 1.30558443069458} 11/07/2021 10:09:07 - INFO - __main__ - Step 91723: {'lr': 0.00016810749680808373, 'samples': 17610816, 'steps': 91722, 'loss/train': 1.8748496770858765} 11/07/2021 10:09:07 - INFO - __main__ - Step 91724: {'lr': 0.00016810248287167884, 'samples': 17611008, 'steps': 91723, 'loss/train': 1.4370653629302979} 11/07/2021 10:09:08 - INFO - __main__ - Step 91725: {'lr': 0.00016809746897217582, 'samples': 17611200, 'steps': 91724, 'loss/train': 1.2819157838821411} 11/07/2021 10:09:09 - INFO - __main__ - Step 91726: {'lr': 0.00016809245510957666, 'samples': 17611392, 'steps': 91725, 'loss/train': 1.185221552848816} 11/07/2021 10:09:09 - INFO - __main__ - Step 91727: {'lr': 0.00016808744128388382, 'samples': 17611584, 'steps': 91726, 'loss/train': 1.5267094373703003} 11/07/2021 10:09:09 - INFO - __main__ - Step 91728: {'lr': 0.0001680824274950995, 'samples': 17611776, 'steps': 91727, 'loss/train': 1.0229536294937134} 11/07/2021 10:09:10 - INFO - __main__ - Step 91729: {'lr': 0.00016807741374322597, 'samples': 17611968, 'steps': 91728, 'loss/train': 1.167500615119934} 11/07/2021 10:09:11 - INFO - __main__ - Step 91730: {'lr': 0.0001680724000282655, 'samples': 17612160, 'steps': 91729, 'loss/train': 1.4839327335357666} 11/07/2021 10:09:11 - INFO - __main__ - Step 91731: {'lr': 0.0001680673863502203, 'samples': 17612352, 'steps': 91730, 'loss/train': 1.3975908756256104} 11/07/2021 10:09:11 - INFO - __main__ - Step 91732: {'lr': 0.00016806237270909275, 'samples': 17612544, 'steps': 91731, 'loss/train': 1.903235673904419} 11/07/2021 10:09:12 - INFO - __main__ - Step 91733: {'lr': 0.00016805735910488496, 'samples': 17612736, 'steps': 91732, 'loss/train': 1.60752272605896} 11/07/2021 10:09:12 - INFO - __main__ - Step 91734: {'lr': 0.0001680523455375993, 'samples': 17612928, 'steps': 91733, 'loss/train': 1.3984696865081787} 11/07/2021 10:09:13 - INFO - __main__ - Step 91735: {'lr': 0.000168047332007238, 'samples': 17613120, 'steps': 91734, 'loss/train': 1.1914920806884766} 11/07/2021 10:09:13 - INFO - __main__ - Step 91736: {'lr': 0.0001680423185138033, 'samples': 17613312, 'steps': 91735, 'loss/train': 1.2030911445617676} 11/07/2021 10:09:14 - INFO - __main__ - Step 91737: {'lr': 0.00016803730505729746, 'samples': 17613504, 'steps': 91736, 'loss/train': 1.3255114555358887} 11/07/2021 10:09:14 - INFO - __main__ - Step 91738: {'lr': 0.00016803229163772274, 'samples': 17613696, 'steps': 91737, 'loss/train': 1.4589872360229492} 11/07/2021 10:09:14 - INFO - __main__ - Step 91739: {'lr': 0.00016802727825508147, 'samples': 17613888, 'steps': 91738, 'loss/train': 1.6015938520431519} 11/07/2021 10:09:15 - INFO - __main__ - Step 91740: {'lr': 0.00016802226490937575, 'samples': 17614080, 'steps': 91739, 'loss/train': 1.4088549613952637} 11/07/2021 10:09:16 - INFO - __main__ - Step 91741: {'lr': 0.00016801725160060796, 'samples': 17614272, 'steps': 91740, 'loss/train': 1.2912390232086182} 11/07/2021 10:09:16 - INFO - __main__ - Step 91742: {'lr': 0.0001680122383287803, 'samples': 17614464, 'steps': 91741, 'loss/train': 1.2849057912826538} 11/07/2021 10:09:16 - INFO - __main__ - Step 91743: {'lr': 0.0001680072250938951, 'samples': 17614656, 'steps': 91742, 'loss/train': 1.4331440925598145} 11/07/2021 10:09:17 - INFO - __main__ - Step 91744: {'lr': 0.0001680022118959546, 'samples': 17614848, 'steps': 91743, 'loss/train': 1.4428614377975464} 11/07/2021 10:09:18 - INFO - __main__ - Step 91745: {'lr': 0.000167997198734961, 'samples': 17615040, 'steps': 91744, 'loss/train': 0.9902279376983643} 11/07/2021 10:09:18 - INFO - __main__ - Step 91746: {'lr': 0.0001679921856109166, 'samples': 17615232, 'steps': 91745, 'loss/train': 1.1683378219604492} 11/07/2021 10:09:19 - INFO - __main__ - Step 91747: {'lr': 0.0001679871725238237, 'samples': 17615424, 'steps': 91746, 'loss/train': 0.7591825723648071} 11/07/2021 10:09:19 - INFO - __main__ - Step 91748: {'lr': 0.00016798215947368448, 'samples': 17615616, 'steps': 91747, 'loss/train': 1.5746937990188599} 11/07/2021 10:09:19 - INFO - __main__ - Step 91749: {'lr': 0.0001679771464605012, 'samples': 17615808, 'steps': 91748, 'loss/train': 1.7051841020584106} 11/07/2021 10:09:21 - INFO - __main__ - Step 91750: {'lr': 0.00016797213348427621, 'samples': 17616000, 'steps': 91749, 'loss/train': 1.5612164735794067} 11/07/2021 10:09:21 - INFO - __main__ - Step 91751: {'lr': 0.00016796712054501168, 'samples': 17616192, 'steps': 91750, 'loss/train': 1.404444694519043} 11/07/2021 10:09:21 - INFO - __main__ - Step 91752: {'lr': 0.00016796210764270995, 'samples': 17616384, 'steps': 91751, 'loss/train': 1.305804967880249} 11/07/2021 10:09:22 - INFO - __main__ - Step 91753: {'lr': 0.00016795709477737317, 'samples': 17616576, 'steps': 91752, 'loss/train': 1.4931063652038574} 11/07/2021 10:09:22 - INFO - __main__ - Step 91754: {'lr': 0.00016795208194900365, 'samples': 17616768, 'steps': 91753, 'loss/train': 0.14519543945789337} 11/07/2021 10:09:23 - INFO - __main__ - Step 91755: {'lr': 0.00016794706915760369, 'samples': 17616960, 'steps': 91754, 'loss/train': 1.2981292009353638} 11/07/2021 10:09:23 - INFO - __main__ - Step 91756: {'lr': 0.0001679420564031755, 'samples': 17617152, 'steps': 91755, 'loss/train': 1.3028732538223267} 11/07/2021 10:09:24 - INFO - __main__ - Step 91757: {'lr': 0.00016793704368572133, 'samples': 17617344, 'steps': 91756, 'loss/train': 1.311335563659668} 11/07/2021 10:09:24 - INFO - __main__ - Step 91758: {'lr': 0.00016793203100524354, 'samples': 17617536, 'steps': 91757, 'loss/train': 1.5173922777175903} 11/07/2021 10:09:24 - INFO - __main__ - Step 91759: {'lr': 0.00016792701836174423, 'samples': 17617728, 'steps': 91758, 'loss/train': 1.4114837646484375} 11/07/2021 10:09:26 - INFO - __main__ - Step 91760: {'lr': 0.00016792200575522576, 'samples': 17617920, 'steps': 91759, 'loss/train': 1.342724084854126} 11/07/2021 10:09:26 - INFO - __main__ - Step 91761: {'lr': 0.00016791699318569037, 'samples': 17618112, 'steps': 91760, 'loss/train': 1.7026604413986206} 11/07/2021 10:09:26 - INFO - __main__ - Step 91762: {'lr': 0.00016791198065314034, 'samples': 17618304, 'steps': 91761, 'loss/train': 1.4437588453292847} 11/07/2021 10:09:27 - INFO - __main__ - Step 91763: {'lr': 0.00016790696815757787, 'samples': 17618496, 'steps': 91762, 'loss/train': 1.089995265007019} 11/07/2021 10:09:27 - INFO - __main__ - Step 91764: {'lr': 0.00016790195569900524, 'samples': 17618688, 'steps': 91763, 'loss/train': 1.5124998092651367} 11/07/2021 10:09:27 - INFO - __main__ - Step 91765: {'lr': 0.00016789694327742482, 'samples': 17618880, 'steps': 91764, 'loss/train': 1.4685442447662354} 11/07/2021 10:09:28 - INFO - __main__ - Step 91766: {'lr': 0.00016789193089283868, 'samples': 17619072, 'steps': 91765, 'loss/train': 1.6381975412368774} 11/07/2021 10:09:29 - INFO - __main__ - Step 91767: {'lr': 0.00016788691854524918, 'samples': 17619264, 'steps': 91766, 'loss/train': 1.7703176736831665} 11/07/2021 10:09:29 - INFO - __main__ - Step 91768: {'lr': 0.00016788190623465856, 'samples': 17619456, 'steps': 91767, 'loss/train': 1.575351357460022} 11/07/2021 10:09:29 - INFO - __main__ - Step 91769: {'lr': 0.00016787689396106917, 'samples': 17619648, 'steps': 91768, 'loss/train': 1.5759484767913818} 11/07/2021 10:09:30 - INFO - __main__ - Step 91770: {'lr': 0.00016787188172448308, 'samples': 17619840, 'steps': 91769, 'loss/train': 1.4695740938186646} 11/07/2021 10:09:31 - INFO - __main__ - Step 91771: {'lr': 0.0001678668695249027, 'samples': 17620032, 'steps': 91770, 'loss/train': 1.3472895622253418} 11/07/2021 10:09:31 - INFO - __main__ - Step 91772: {'lr': 0.00016786185736233022, 'samples': 17620224, 'steps': 91771, 'loss/train': 1.519077181816101} 11/07/2021 10:09:31 - INFO - __main__ - Step 91773: {'lr': 0.00016785684523676792, 'samples': 17620416, 'steps': 91772, 'loss/train': 1.2478604316711426} 11/07/2021 10:09:32 - INFO - __main__ - Step 91774: {'lr': 0.00016785183314821806, 'samples': 17620608, 'steps': 91773, 'loss/train': 1.2006475925445557} 11/07/2021 10:09:32 - INFO - __main__ - Step 91775: {'lr': 0.00016784682109668292, 'samples': 17620800, 'steps': 91774, 'loss/train': 1.508906364440918} 11/07/2021 10:09:33 - INFO - __main__ - Step 91776: {'lr': 0.0001678418090821647, 'samples': 17620992, 'steps': 91775, 'loss/train': 1.5674798488616943} 11/07/2021 10:09:34 - INFO - __main__ - Step 91777: {'lr': 0.0001678367971046657, 'samples': 17621184, 'steps': 91776, 'loss/train': 1.4379003047943115} 11/07/2021 10:09:34 - INFO - __main__ - Step 91778: {'lr': 0.00016783178516418818, 'samples': 17621376, 'steps': 91777, 'loss/train': 1.3746469020843506} 11/07/2021 10:09:34 - INFO - __main__ - Step 91779: {'lr': 0.00016782677326073446, 'samples': 17621568, 'steps': 91778, 'loss/train': 1.1009594202041626} 11/07/2021 10:09:35 - INFO - __main__ - Step 91780: {'lr': 0.00016782176139430673, 'samples': 17621760, 'steps': 91779, 'loss/train': 1.4361001253128052} 11/07/2021 10:09:36 - INFO - __main__ - Step 91781: {'lr': 0.00016781674956490715, 'samples': 17621952, 'steps': 91780, 'loss/train': 1.330990195274353} 11/07/2021 10:09:36 - INFO - __main__ - Step 91782: {'lr': 0.00016781173777253807, 'samples': 17622144, 'steps': 91781, 'loss/train': 0.17040075361728668} 11/07/2021 10:09:36 - INFO - __main__ - Step 91783: {'lr': 0.0001678067260172018, 'samples': 17622336, 'steps': 91782, 'loss/train': 1.5120173692703247} 11/07/2021 10:09:37 - INFO - __main__ - Step 91784: {'lr': 0.00016780171429890052, 'samples': 17622528, 'steps': 91783, 'loss/train': 1.7472891807556152} 11/07/2021 10:09:37 - INFO - __main__ - Step 91785: {'lr': 0.00016779670261763652, 'samples': 17622720, 'steps': 91784, 'loss/train': 1.0430256128311157} 11/07/2021 10:09:38 - INFO - __main__ - Step 91786: {'lr': 0.00016779169097341207, 'samples': 17622912, 'steps': 91785, 'loss/train': 1.3181840181350708} 11/07/2021 10:09:39 - INFO - __main__ - Step 91787: {'lr': 0.00016778667936622943, 'samples': 17623104, 'steps': 91786, 'loss/train': 1.4461135864257812} 11/07/2021 10:09:39 - INFO - __main__ - Step 91788: {'lr': 0.00016778166779609084, 'samples': 17623296, 'steps': 91787, 'loss/train': 1.9706623554229736} 11/07/2021 10:09:39 - INFO - __main__ - Step 91789: {'lr': 0.00016777665626299855, 'samples': 17623488, 'steps': 91788, 'loss/train': 1.5172191858291626} 11/07/2021 10:09:40 - INFO - __main__ - Step 91790: {'lr': 0.00016777164476695477, 'samples': 17623680, 'steps': 91789, 'loss/train': 1.060770034790039} 11/07/2021 10:09:41 - INFO - __main__ - Step 91791: {'lr': 0.0001677666333079619, 'samples': 17623872, 'steps': 91790, 'loss/train': 0.5671252608299255} 11/07/2021 10:09:41 - INFO - __main__ - Step 91792: {'lr': 0.00016776162188602217, 'samples': 17624064, 'steps': 91791, 'loss/train': 1.537505865097046} 11/07/2021 10:09:41 - INFO - __main__ - Step 91793: {'lr': 0.0001677566105011377, 'samples': 17624256, 'steps': 91792, 'loss/train': 1.3052971363067627} 11/07/2021 10:09:42 - INFO - __main__ - Step 91794: {'lr': 0.00016775159915331087, 'samples': 17624448, 'steps': 91793, 'loss/train': 0.9706909656524658} 11/07/2021 10:09:42 - INFO - __main__ - Step 91795: {'lr': 0.00016774658784254388, 'samples': 17624640, 'steps': 91794, 'loss/train': 1.0790338516235352} 11/07/2021 10:09:43 - INFO - __main__ - Step 91796: {'lr': 0.00016774157656883898, 'samples': 17624832, 'steps': 91795, 'loss/train': 1.7553493976593018} 11/07/2021 10:09:43 - INFO - __main__ - Step 91797: {'lr': 0.00016773656533219846, 'samples': 17625024, 'steps': 91796, 'loss/train': 1.5702881813049316} 11/07/2021 10:09:44 - INFO - __main__ - Step 91798: {'lr': 0.0001677315541326246, 'samples': 17625216, 'steps': 91797, 'loss/train': 1.0457606315612793} 11/07/2021 10:09:44 - INFO - __main__ - Step 91799: {'lr': 0.00016772654297011964, 'samples': 17625408, 'steps': 91798, 'loss/train': 1.1878726482391357} 11/07/2021 10:09:44 - INFO - __main__ - Step 91800: {'lr': 0.0001677215318446858, 'samples': 17625600, 'steps': 91799, 'loss/train': 1.469807744026184} 11/07/2021 10:09:45 - INFO - __main__ - Step 91801: {'lr': 0.00016771652075632537, 'samples': 17625792, 'steps': 91800, 'loss/train': 0.9198329448699951} 11/07/2021 10:09:46 - INFO - __main__ - Step 91802: {'lr': 0.00016771150970504062, 'samples': 17625984, 'steps': 91801, 'loss/train': 1.831748366355896} 11/07/2021 10:09:46 - INFO - __main__ - Step 91803: {'lr': 0.00016770649869083377, 'samples': 17626176, 'steps': 91802, 'loss/train': 1.746943712234497} 11/07/2021 10:09:47 - INFO - __main__ - Step 91804: {'lr': 0.00016770148771370715, 'samples': 17626368, 'steps': 91803, 'loss/train': 1.0706857442855835} 11/07/2021 10:09:47 - INFO - __main__ - Step 91805: {'lr': 0.0001676964767736629, 'samples': 17626560, 'steps': 91804, 'loss/train': 1.1043473482131958} 11/07/2021 10:09:47 - INFO - __main__ - Step 91806: {'lr': 0.0001676914658707035, 'samples': 17626752, 'steps': 91805, 'loss/train': 1.2335140705108643} 11/07/2021 10:09:48 - INFO - __main__ - Step 91807: {'lr': 0.00016768645500483094, 'samples': 17626944, 'steps': 91806, 'loss/train': 1.415031909942627} 11/07/2021 10:09:49 - INFO - __main__ - Step 91808: {'lr': 0.00016768144417604757, 'samples': 17627136, 'steps': 91807, 'loss/train': 0.07076407968997955} 11/07/2021 10:09:49 - INFO - __main__ - Step 91809: {'lr': 0.00016767643338435573, 'samples': 17627328, 'steps': 91808, 'loss/train': 1.7566646337509155} 11/07/2021 10:09:50 - INFO - __main__ - Step 91810: {'lr': 0.00016767142262975757, 'samples': 17627520, 'steps': 91809, 'loss/train': 1.1894193887710571} 11/07/2021 10:09:50 - INFO - __main__ - Step 91811: {'lr': 0.0001676664119122554, 'samples': 17627712, 'steps': 91810, 'loss/train': 0.6246582269668579} 11/07/2021 10:09:50 - INFO - __main__ - Step 91812: {'lr': 0.0001676614012318515, 'samples': 17627904, 'steps': 91811, 'loss/train': 1.5874758958816528} 11/07/2021 10:09:52 - INFO - __main__ - Step 91813: {'lr': 0.0001676563905885481, 'samples': 17628096, 'steps': 91812, 'loss/train': 1.5331116914749146} 11/07/2021 10:09:53 - INFO - __main__ - Step 91814: {'lr': 0.00016765137998234742, 'samples': 17628288, 'steps': 91813, 'loss/train': 1.2034823894500732} 11/07/2021 10:09:53 - INFO - __main__ - Step 91815: {'lr': 0.00016764636941325178, 'samples': 17628480, 'steps': 91814, 'loss/train': 0.5697569251060486} 11/07/2021 10:09:53 - INFO - __main__ - Step 91816: {'lr': 0.00016764135888126341, 'samples': 17628672, 'steps': 91815, 'loss/train': 1.0809221267700195} 11/07/2021 10:09:54 - INFO - __main__ - Step 91817: {'lr': 0.0001676363483863846, 'samples': 17628864, 'steps': 91816, 'loss/train': 0.39431601762771606} 11/07/2021 10:09:54 - INFO - __main__ - Step 91818: {'lr': 0.00016763133792861758, 'samples': 17629056, 'steps': 91817, 'loss/train': 1.4370901584625244} 11/07/2021 10:09:54 - INFO - __main__ - Step 91819: {'lr': 0.0001676263275079647, 'samples': 17629248, 'steps': 91818, 'loss/train': 0.31468191742897034} 11/07/2021 10:09:55 - INFO - __main__ - Step 91820: {'lr': 0.00016762131712442802, 'samples': 17629440, 'steps': 91819, 'loss/train': 1.3145970106124878} 11/07/2021 10:09:56 - INFO - __main__ - Step 91821: {'lr': 0.00016761630677800989, 'samples': 17629632, 'steps': 91820, 'loss/train': 1.1241768598556519} 11/07/2021 10:09:56 - INFO - __main__ - Step 91822: {'lr': 0.00016761129646871258, 'samples': 17629824, 'steps': 91821, 'loss/train': 1.6475907564163208} 11/07/2021 10:09:56 - INFO - __main__ - Step 91823: {'lr': 0.00016760628619653836, 'samples': 17630016, 'steps': 91822, 'loss/train': 0.794076144695282} 11/07/2021 10:09:57 - INFO - __main__ - Step 91824: {'lr': 0.00016760127596148947, 'samples': 17630208, 'steps': 91823, 'loss/train': 1.5634466409683228} 11/07/2021 10:09:58 - INFO - __main__ - Step 91825: {'lr': 0.0001675962657635682, 'samples': 17630400, 'steps': 91824, 'loss/train': 1.6207095384597778} 11/07/2021 10:09:58 - INFO - __main__ - Step 91826: {'lr': 0.00016759125560277674, 'samples': 17630592, 'steps': 91825, 'loss/train': 1.4690898656845093} 11/07/2021 10:09:58 - INFO - __main__ - Step 91827: {'lr': 0.0001675862454791174, 'samples': 17630784, 'steps': 91826, 'loss/train': 1.277138352394104} 11/07/2021 10:09:59 - INFO - __main__ - Step 91828: {'lr': 0.00016758123539259247, 'samples': 17630976, 'steps': 91827, 'loss/train': 1.4161934852600098} 11/07/2021 10:09:59 - INFO - __main__ - Step 91829: {'lr': 0.0001675762253432041, 'samples': 17631168, 'steps': 91828, 'loss/train': 0.36946165561676025} 11/07/2021 10:10:00 - INFO - __main__ - Step 91830: {'lr': 0.00016757121533095466, 'samples': 17631360, 'steps': 91829, 'loss/train': 0.8304632902145386} 11/07/2021 10:10:01 - INFO - __main__ - Step 91831: {'lr': 0.00016756620535584633, 'samples': 17631552, 'steps': 91830, 'loss/train': 1.2192955017089844} 11/07/2021 10:10:01 - INFO - __main__ - Step 91832: {'lr': 0.00016756119541788138, 'samples': 17631744, 'steps': 91831, 'loss/train': 1.3958888053894043} 11/07/2021 10:10:01 - INFO - __main__ - Step 91833: {'lr': 0.00016755618551706224, 'samples': 17631936, 'steps': 91832, 'loss/train': 1.182146668434143} 11/07/2021 10:10:02 - INFO - __main__ - Step 91834: {'lr': 0.00016755117565339084, 'samples': 17632128, 'steps': 91833, 'loss/train': 0.9454692006111145} 11/07/2021 10:10:02 - INFO - __main__ - Step 91835: {'lr': 0.00016754616582686965, 'samples': 17632320, 'steps': 91834, 'loss/train': 1.4541261196136475} 11/07/2021 10:10:03 - INFO - __main__ - Step 91836: {'lr': 0.0001675411560375009, 'samples': 17632512, 'steps': 91835, 'loss/train': 1.5213663578033447} 11/07/2021 10:10:03 - INFO - __main__ - Step 91837: {'lr': 0.00016753614628528678, 'samples': 17632704, 'steps': 91836, 'loss/train': 1.8083611726760864} 11/07/2021 10:10:04 - INFO - __main__ - Step 91838: {'lr': 0.00016753113657022966, 'samples': 17632896, 'steps': 91837, 'loss/train': 1.4375536441802979} 11/07/2021 10:10:04 - INFO - __main__ - Step 91839: {'lr': 0.00016752612689233172, 'samples': 17633088, 'steps': 91838, 'loss/train': 1.3095091581344604} 11/07/2021 10:10:04 - INFO - __main__ - Step 91840: {'lr': 0.00016752111725159522, 'samples': 17633280, 'steps': 91839, 'loss/train': 1.9299100637435913} 11/07/2021 10:10:05 - INFO - __main__ - Step 91841: {'lr': 0.00016751610764802245, 'samples': 17633472, 'steps': 91840, 'loss/train': 0.7477200627326965} 11/07/2021 10:10:06 - INFO - __main__ - Step 91842: {'lr': 0.00016751109808161563, 'samples': 17633664, 'steps': 91841, 'loss/train': 1.005792260169983} 11/07/2021 10:10:06 - INFO - __main__ - Step 91843: {'lr': 0.00016750608855237704, 'samples': 17633856, 'steps': 91842, 'loss/train': 1.060685157775879} 11/07/2021 10:10:06 - INFO - __main__ - Step 91844: {'lr': 0.0001675010790603089, 'samples': 17634048, 'steps': 91843, 'loss/train': 1.4054099321365356} 11/07/2021 10:10:07 - INFO - __main__ - Step 91845: {'lr': 0.00016749606960541358, 'samples': 17634240, 'steps': 91844, 'loss/train': 1.2755486965179443} 11/07/2021 10:10:08 - INFO - __main__ - Step 91846: {'lr': 0.00016749106018769332, 'samples': 17634432, 'steps': 91845, 'loss/train': 1.2931032180786133} 11/07/2021 10:10:08 - INFO - __main__ - Step 91847: {'lr': 0.00016748605080715018, 'samples': 17634624, 'steps': 91846, 'loss/train': 1.445236086845398} 11/07/2021 10:10:09 - INFO - __main__ - Step 91848: {'lr': 0.0001674810414637866, 'samples': 17634816, 'steps': 91847, 'loss/train': 1.571693778038025} 11/07/2021 10:10:09 - INFO - __main__ - Step 91849: {'lr': 0.00016747603215760477, 'samples': 17635008, 'steps': 91848, 'loss/train': 1.5622859001159668} 11/07/2021 10:10:09 - INFO - __main__ - Step 91850: {'lr': 0.00016747102288860695, 'samples': 17635200, 'steps': 91849, 'loss/train': 1.596989631652832} 11/07/2021 10:10:10 - INFO - __main__ - Step 91851: {'lr': 0.00016746601365679543, 'samples': 17635392, 'steps': 91850, 'loss/train': 1.207690954208374} 11/07/2021 10:10:11 - INFO - __main__ - Step 91852: {'lr': 0.00016746100446217245, 'samples': 17635584, 'steps': 91851, 'loss/train': 1.1746630668640137} 11/07/2021 10:10:11 - INFO - __main__ - Step 91853: {'lr': 0.0001674559953047403, 'samples': 17635776, 'steps': 91852, 'loss/train': 1.1982896327972412} 11/07/2021 10:10:11 - INFO - __main__ - Step 91854: {'lr': 0.00016745098618450117, 'samples': 17635968, 'steps': 91853, 'loss/train': 1.1730855703353882} 11/07/2021 10:10:12 - INFO - __main__ - Step 91855: {'lr': 0.00016744597710145734, 'samples': 17636160, 'steps': 91854, 'loss/train': 1.2750327587127686} 11/07/2021 10:10:13 - INFO - __main__ - Step 91856: {'lr': 0.0001674409680556111, 'samples': 17636352, 'steps': 91855, 'loss/train': 1.183664083480835} 11/07/2021 10:10:13 - INFO - __main__ - Step 91857: {'lr': 0.00016743595904696469, 'samples': 17636544, 'steps': 91856, 'loss/train': 1.3440372943878174} 11/07/2021 10:10:13 - INFO - __main__ - Step 91858: {'lr': 0.00016743095007552033, 'samples': 17636736, 'steps': 91857, 'loss/train': 1.8155484199523926} 11/07/2021 10:10:14 - INFO - __main__ - Step 91859: {'lr': 0.0001674259411412804, 'samples': 17636928, 'steps': 91858, 'loss/train': 1.106895923614502} 11/07/2021 10:10:14 - INFO - __main__ - Step 91860: {'lr': 0.00016742093224424704, 'samples': 17637120, 'steps': 91859, 'loss/train': 1.4396179914474487} 11/07/2021 10:10:15 - INFO - __main__ - Step 91861: {'lr': 0.00016741592338442252, 'samples': 17637312, 'steps': 91860, 'loss/train': 0.7146191596984863} 11/07/2021 10:10:15 - INFO - __main__ - Step 91862: {'lr': 0.00016741091456180907, 'samples': 17637504, 'steps': 91861, 'loss/train': 0.871457040309906} 11/07/2021 10:10:16 - INFO - __main__ - Step 91863: {'lr': 0.000167405905776409, 'samples': 17637696, 'steps': 91862, 'loss/train': 1.1745514869689941} 11/07/2021 10:10:16 - INFO - __main__ - Step 91864: {'lr': 0.00016740089702822457, 'samples': 17637888, 'steps': 91863, 'loss/train': 0.8603522181510925} 11/07/2021 10:10:17 - INFO - __main__ - Step 91865: {'lr': 0.000167395888317258, 'samples': 17638080, 'steps': 91864, 'loss/train': 1.1935484409332275} 11/07/2021 10:10:17 - INFO - __main__ - Step 91866: {'lr': 0.00016739087964351158, 'samples': 17638272, 'steps': 91865, 'loss/train': 1.4139254093170166} 11/07/2021 10:10:18 - INFO - __main__ - Step 91867: {'lr': 0.00016738587100698755, 'samples': 17638464, 'steps': 91866, 'loss/train': 1.3690831661224365} 11/07/2021 10:10:18 - INFO - __main__ - Step 91868: {'lr': 0.0001673808624076882, 'samples': 17638656, 'steps': 91867, 'loss/train': 0.11132645606994629} 11/07/2021 10:10:19 - INFO - __main__ - Step 91869: {'lr': 0.0001673758538456157, 'samples': 17638848, 'steps': 91868, 'loss/train': 1.6824114322662354} 11/07/2021 10:10:19 - INFO - __main__ - Step 91870: {'lr': 0.00016737084532077246, 'samples': 17639040, 'steps': 91869, 'loss/train': 1.3902621269226074} 11/07/2021 10:10:19 - INFO - __main__ - Step 91871: {'lr': 0.00016736583683316057, 'samples': 17639232, 'steps': 91870, 'loss/train': 1.3951977491378784} 11/07/2021 10:10:21 - INFO - __main__ - Step 91872: {'lr': 0.00016736082838278234, 'samples': 17639424, 'steps': 91871, 'loss/train': 1.3771923780441284} 11/07/2021 10:10:21 - INFO - __main__ - Step 91873: {'lr': 0.00016735581996964015, 'samples': 17639616, 'steps': 91872, 'loss/train': 1.1658549308776855} 11/07/2021 10:10:21 - INFO - __main__ - Step 91874: {'lr': 0.00016735081159373604, 'samples': 17639808, 'steps': 91873, 'loss/train': 1.4144906997680664} 11/07/2021 10:10:22 - INFO - __main__ - Step 91875: {'lr': 0.00016734580325507243, 'samples': 17640000, 'steps': 91874, 'loss/train': 1.3294072151184082} 11/07/2021 10:10:22 - INFO - __main__ - Step 91876: {'lr': 0.0001673407949536515, 'samples': 17640192, 'steps': 91875, 'loss/train': 1.6828725337982178} 11/07/2021 10:10:23 - INFO - __main__ - Step 91877: {'lr': 0.0001673357866894756, 'samples': 17640384, 'steps': 91876, 'loss/train': 1.2490061521530151} 11/07/2021 10:10:23 - INFO - __main__ - Step 91878: {'lr': 0.00016733077846254682, 'samples': 17640576, 'steps': 91877, 'loss/train': 1.4577020406723022} 11/07/2021 10:10:24 - INFO - __main__ - Step 91879: {'lr': 0.00016732577027286756, 'samples': 17640768, 'steps': 91878, 'loss/train': 1.2086719274520874} 11/07/2021 10:10:24 - INFO - __main__ - Step 91880: {'lr': 0.00016732076212044002, 'samples': 17640960, 'steps': 91879, 'loss/train': 1.3906055688858032} 11/07/2021 10:10:24 - INFO - __main__ - Step 91881: {'lr': 0.00016731575400526656, 'samples': 17641152, 'steps': 91880, 'loss/train': 1.7333104610443115} 11/07/2021 10:10:25 - INFO - __main__ - Step 91882: {'lr': 0.00016731074592734924, 'samples': 17641344, 'steps': 91881, 'loss/train': 0.3336120843887329} 11/07/2021 10:10:26 - INFO - __main__ - Step 91883: {'lr': 0.00016730573788669047, 'samples': 17641536, 'steps': 91882, 'loss/train': 1.838733196258545} 11/07/2021 10:10:26 - INFO - __main__ - Step 91884: {'lr': 0.0001673007298832924, 'samples': 17641728, 'steps': 91883, 'loss/train': 1.3047734498977661} 11/07/2021 10:10:26 - INFO - __main__ - Step 91885: {'lr': 0.00016729572191715735, 'samples': 17641920, 'steps': 91884, 'loss/train': 1.4847843647003174} 11/07/2021 10:10:27 - INFO - __main__ - Step 91886: {'lr': 0.0001672907139882877, 'samples': 17642112, 'steps': 91885, 'loss/train': 0.8658822178840637} 11/07/2021 10:10:28 - INFO - __main__ - Step 91887: {'lr': 0.00016728570609668547, 'samples': 17642304, 'steps': 91886, 'loss/train': 1.4973599910736084} 11/07/2021 10:10:28 - INFO - __main__ - Step 91888: {'lr': 0.00016728069824235303, 'samples': 17642496, 'steps': 91887, 'loss/train': 1.2857085466384888} 11/07/2021 10:10:28 - INFO - __main__ - Step 91889: {'lr': 0.0001672756904252926, 'samples': 17642688, 'steps': 91888, 'loss/train': 1.027477502822876} 11/07/2021 10:10:29 - INFO - __main__ - Step 91890: {'lr': 0.00016727068264550652, 'samples': 17642880, 'steps': 91889, 'loss/train': 1.2857693433761597} 11/07/2021 10:10:29 - INFO - __main__ - Step 91891: {'lr': 0.00016726567490299698, 'samples': 17643072, 'steps': 91890, 'loss/train': 1.5794405937194824} 11/07/2021 10:10:31 - INFO - __main__ - Step 91892: {'lr': 0.00016726066719776627, 'samples': 17643264, 'steps': 91891, 'loss/train': 0.8918642401695251} 11/07/2021 10:10:31 - INFO - __main__ - Step 91893: {'lr': 0.00016725565952981663, 'samples': 17643456, 'steps': 91892, 'loss/train': 1.0737327337265015} 11/07/2021 10:10:32 - INFO - __main__ - Step 91894: {'lr': 0.00016725065189915028, 'samples': 17643648, 'steps': 91893, 'loss/train': 1.8559356927871704} 11/07/2021 10:10:32 - INFO - __main__ - Step 91895: {'lr': 0.0001672456443057695, 'samples': 17643840, 'steps': 91894, 'loss/train': 3.9659128189086914} 11/07/2021 10:10:32 - INFO - __main__ - Step 91896: {'lr': 0.00016724063674967656, 'samples': 17644032, 'steps': 91895, 'loss/train': 1.4302148818969727} 11/07/2021 10:10:33 - INFO - __main__ - Step 91897: {'lr': 0.00016723562923087374, 'samples': 17644224, 'steps': 91896, 'loss/train': 0.9906774759292603} 11/07/2021 10:10:33 - INFO - __main__ - Step 91898: {'lr': 0.00016723062174936327, 'samples': 17644416, 'steps': 91897, 'loss/train': 1.5995279550552368} 11/07/2021 10:10:34 - INFO - __main__ - Step 91899: {'lr': 0.00016722561430514737, 'samples': 17644608, 'steps': 91898, 'loss/train': 1.2626729011535645} 11/07/2021 10:10:34 - INFO - __main__ - Step 91900: {'lr': 0.00016722060689822838, 'samples': 17644800, 'steps': 91899, 'loss/train': 1.402150273323059} 11/07/2021 10:10:35 - INFO - __main__ - Step 91901: {'lr': 0.0001672155995286085, 'samples': 17644992, 'steps': 91900, 'loss/train': 1.372778296470642} 11/07/2021 10:10:35 - INFO - __main__ - Step 91902: {'lr': 0.0001672105921962899, 'samples': 17645184, 'steps': 91901, 'loss/train': 1.478979229927063} 11/07/2021 10:10:35 - INFO - __main__ - Step 91903: {'lr': 0.0001672055849012751, 'samples': 17645376, 'steps': 91902, 'loss/train': 1.2903920412063599} 11/07/2021 10:10:36 - INFO - __main__ - Step 91904: {'lr': 0.00016720057764356606, 'samples': 17645568, 'steps': 91903, 'loss/train': 1.8208274841308594} 11/07/2021 10:10:37 - INFO - __main__ - Step 91905: {'lr': 0.0001671955704231652, 'samples': 17645760, 'steps': 91904, 'loss/train': 0.3445129692554474} 11/07/2021 10:10:37 - INFO - __main__ - Step 91906: {'lr': 0.0001671905632400747, 'samples': 17645952, 'steps': 91905, 'loss/train': 1.203680396080017} 11/07/2021 10:10:38 - INFO - __main__ - Step 91907: {'lr': 0.0001671855560942969, 'samples': 17646144, 'steps': 91906, 'loss/train': 1.2547732591629028} 11/07/2021 10:10:38 - INFO - __main__ - Step 91908: {'lr': 0.00016718054898583396, 'samples': 17646336, 'steps': 91907, 'loss/train': 1.3642683029174805} 11/07/2021 10:10:39 - INFO - __main__ - Step 91909: {'lr': 0.00016717554191468824, 'samples': 17646528, 'steps': 91908, 'loss/train': 1.3276851177215576} 11/07/2021 10:10:40 - INFO - __main__ - Step 91910: {'lr': 0.0001671705348808619, 'samples': 17646720, 'steps': 91909, 'loss/train': 0.9112508893013} 11/07/2021 10:10:40 - INFO - __main__ - Step 91911: {'lr': 0.00016716552788435723, 'samples': 17646912, 'steps': 91910, 'loss/train': 1.2668919563293457} 11/07/2021 10:10:40 - INFO - __main__ - Step 91912: {'lr': 0.00016716052092517652, 'samples': 17647104, 'steps': 91911, 'loss/train': 0.8250459432601929} 11/07/2021 10:10:41 - INFO - __main__ - Step 91913: {'lr': 0.00016715551400332208, 'samples': 17647296, 'steps': 91912, 'loss/train': 1.6908462047576904} 11/07/2021 10:10:42 - INFO - __main__ - Step 91914: {'lr': 0.00016715050711879604, 'samples': 17647488, 'steps': 91913, 'loss/train': 1.4055365324020386} 11/07/2021 10:10:42 - INFO - __main__ - Step 91915: {'lr': 0.0001671455002716007, 'samples': 17647680, 'steps': 91914, 'loss/train': 1.3979474306106567} 11/07/2021 10:10:42 - INFO - __main__ - Step 91916: {'lr': 0.00016714049346173827, 'samples': 17647872, 'steps': 91915, 'loss/train': 1.2436186075210571} 11/07/2021 10:10:43 - INFO - __main__ - Step 91917: {'lr': 0.00016713548668921107, 'samples': 17648064, 'steps': 91916, 'loss/train': 1.3501816987991333} 11/07/2021 10:10:43 - INFO - __main__ - Step 91918: {'lr': 0.00016713047995402136, 'samples': 17648256, 'steps': 91917, 'loss/train': 1.2151390314102173} 11/07/2021 10:10:44 - INFO - __main__ - Step 91919: {'lr': 0.00016712547325617132, 'samples': 17648448, 'steps': 91918, 'loss/train': 0.14297455549240112} 11/07/2021 10:10:44 - INFO - __main__ - Step 91920: {'lr': 0.00016712046659566332, 'samples': 17648640, 'steps': 91919, 'loss/train': 1.6472066640853882} 11/07/2021 10:10:45 - INFO - __main__ - Step 91921: {'lr': 0.00016711545997249956, 'samples': 17648832, 'steps': 91920, 'loss/train': 1.3014812469482422} 11/07/2021 10:10:45 - INFO - __main__ - Step 91922: {'lr': 0.0001671104533866823, 'samples': 17649024, 'steps': 91921, 'loss/train': 2.996685028076172} 11/07/2021 10:10:45 - INFO - __main__ - Step 91923: {'lr': 0.00016710544683821375, 'samples': 17649216, 'steps': 91922, 'loss/train': 1.170531988143921} 11/07/2021 10:10:46 - INFO - __main__ - Step 91924: {'lr': 0.0001671004403270962, 'samples': 17649408, 'steps': 91923, 'loss/train': 1.4462532997131348} 11/07/2021 10:10:47 - INFO - __main__ - Step 91925: {'lr': 0.00016709543385333198, 'samples': 17649600, 'steps': 91924, 'loss/train': 1.4047396183013916} 11/07/2021 10:10:47 - INFO - __main__ - Step 91926: {'lr': 0.0001670904274169232, 'samples': 17649792, 'steps': 91925, 'loss/train': 1.6053026914596558} 11/07/2021 10:10:47 - INFO - __main__ - Step 91927: {'lr': 0.00016708542101787237, 'samples': 17649984, 'steps': 91926, 'loss/train': 1.1186975240707397} 11/07/2021 10:10:48 - INFO - __main__ - Step 91928: {'lr': 0.0001670804146561814, 'samples': 17650176, 'steps': 91927, 'loss/train': 1.3467894792556763} 11/07/2021 10:10:48 - INFO - __main__ - Step 91929: {'lr': 0.00016707540833185274, 'samples': 17650368, 'steps': 91928, 'loss/train': 2.305441379547119} 11/07/2021 10:10:49 - INFO - __main__ - Step 91930: {'lr': 0.00016707040204488866, 'samples': 17650560, 'steps': 91929, 'loss/train': 1.839630365371704} 11/07/2021 10:10:50 - INFO - __main__ - Step 91931: {'lr': 0.00016706539579529133, 'samples': 17650752, 'steps': 91930, 'loss/train': 1.6427216529846191} 11/07/2021 10:10:50 - INFO - __main__ - Step 91932: {'lr': 0.00016706038958306306, 'samples': 17650944, 'steps': 91931, 'loss/train': 1.5411795377731323} 11/07/2021 10:10:50 - INFO - __main__ - Step 91933: {'lr': 0.0001670553834082061, 'samples': 17651136, 'steps': 91932, 'loss/train': 1.3530820608139038} 11/07/2021 10:10:51 - INFO - __main__ - Step 91934: {'lr': 0.00016705037727072271, 'samples': 17651328, 'steps': 91933, 'loss/train': 1.492550253868103} 11/07/2021 10:10:52 - INFO - __main__ - Step 91935: {'lr': 0.00016704537117061513, 'samples': 17651520, 'steps': 91934, 'loss/train': 1.2728625535964966} 11/07/2021 10:10:52 - INFO - __main__ - Step 91936: {'lr': 0.00016704036510788568, 'samples': 17651712, 'steps': 91935, 'loss/train': 1.6451783180236816} 11/07/2021 10:10:52 - INFO - __main__ - Step 91937: {'lr': 0.00016703535908253647, 'samples': 17651904, 'steps': 91936, 'loss/train': 1.3317437171936035} 11/07/2021 10:10:53 - INFO - __main__ - Step 91938: {'lr': 0.00016703035309456992, 'samples': 17652096, 'steps': 91937, 'loss/train': 1.3119559288024902} 11/07/2021 10:10:53 - INFO - __main__ - Step 91939: {'lr': 0.0001670253471439882, 'samples': 17652288, 'steps': 91938, 'loss/train': 1.2768608331680298} 11/07/2021 10:10:54 - INFO - __main__ - Step 91940: {'lr': 0.00016702034123079366, 'samples': 17652480, 'steps': 91939, 'loss/train': 1.1725164651870728} 11/07/2021 10:10:54 - INFO - __main__ - Step 91941: {'lr': 0.00016701533535498837, 'samples': 17652672, 'steps': 91940, 'loss/train': 0.8258083462715149} 11/07/2021 10:10:55 - INFO - __main__ - Step 91942: {'lr': 0.00016701032951657469, 'samples': 17652864, 'steps': 91941, 'loss/train': 0.9240174293518066} 11/07/2021 10:10:55 - INFO - __main__ - Step 91943: {'lr': 0.00016700532371555487, 'samples': 17653056, 'steps': 91942, 'loss/train': 1.226022481918335} 11/07/2021 10:10:56 - INFO - __main__ - Step 91944: {'lr': 0.00016700031795193122, 'samples': 17653248, 'steps': 91943, 'loss/train': 0.9614821076393127} 11/07/2021 10:10:56 - INFO - __main__ - Step 91945: {'lr': 0.0001669953122257059, 'samples': 17653440, 'steps': 91944, 'loss/train': 1.3164931535720825} 11/07/2021 10:10:57 - INFO - __main__ - Step 91946: {'lr': 0.00016699030653688122, 'samples': 17653632, 'steps': 91945, 'loss/train': 0.6806685328483582} 11/07/2021 10:10:57 - INFO - __main__ - Step 91947: {'lr': 0.00016698530088545943, 'samples': 17653824, 'steps': 91946, 'loss/train': 2.1166622638702393} 11/07/2021 10:10:58 - INFO - __main__ - Step 91948: {'lr': 0.00016698029527144277, 'samples': 17654016, 'steps': 91947, 'loss/train': 2.203840970993042} 11/07/2021 10:10:58 - INFO - __main__ - Step 91949: {'lr': 0.00016697528969483353, 'samples': 17654208, 'steps': 91948, 'loss/train': 0.9148712158203125} 11/07/2021 10:10:59 - INFO - __main__ - Step 91950: {'lr': 0.00016697028415563393, 'samples': 17654400, 'steps': 91949, 'loss/train': 1.3979592323303223} 11/07/2021 10:10:59 - INFO - __main__ - Step 91951: {'lr': 0.00016696527865384627, 'samples': 17654592, 'steps': 91950, 'loss/train': 1.5619536638259888} 11/07/2021 10:11:00 - INFO - __main__ - Step 91952: {'lr': 0.0001669602731894727, 'samples': 17654784, 'steps': 91951, 'loss/train': 1.5370814800262451} 11/07/2021 10:11:00 - INFO - __main__ - Step 91953: {'lr': 0.0001669552677625156, 'samples': 17654976, 'steps': 91952, 'loss/train': 1.092864751815796} 11/07/2021 10:11:00 - INFO - __main__ - Step 91954: {'lr': 0.00016695026237297729, 'samples': 17655168, 'steps': 91953, 'loss/train': 1.4376345872879028} 11/07/2021 10:11:01 - INFO - __main__ - Step 91955: {'lr': 0.00016694525702085978, 'samples': 17655360, 'steps': 91954, 'loss/train': 1.240407943725586} 11/07/2021 10:11:02 - INFO - __main__ - Step 91956: {'lr': 0.00016694025170616546, 'samples': 17655552, 'steps': 91955, 'loss/train': 1.3312492370605469} 11/07/2021 10:11:02 - INFO - __main__ - Step 91957: {'lr': 0.00016693524642889658, 'samples': 17655744, 'steps': 91956, 'loss/train': 1.276569128036499} 11/07/2021 10:11:02 - INFO - __main__ - Step 91958: {'lr': 0.0001669302411890554, 'samples': 17655936, 'steps': 91957, 'loss/train': 0.8648205399513245} 11/07/2021 10:11:03 - INFO - __main__ - Step 91959: {'lr': 0.00016692523598664416, 'samples': 17656128, 'steps': 91958, 'loss/train': 0.8832384347915649} 11/07/2021 10:11:04 - INFO - __main__ - Step 91960: {'lr': 0.00016692023082166515, 'samples': 17656320, 'steps': 91959, 'loss/train': 1.3172355890274048} 11/07/2021 10:11:05 - INFO - __main__ - Step 91961: {'lr': 0.0001669152256941206, 'samples': 17656512, 'steps': 91960, 'loss/train': 1.5251035690307617} 11/07/2021 10:11:05 - INFO - __main__ - Step 91962: {'lr': 0.00016691022060401274, 'samples': 17656704, 'steps': 91961, 'loss/train': 1.193008303642273} 11/07/2021 10:11:05 - INFO - __main__ - Step 91963: {'lr': 0.00016690521555134388, 'samples': 17656896, 'steps': 91962, 'loss/train': 1.391801357269287} 11/07/2021 10:11:06 - INFO - __main__ - Step 91964: {'lr': 0.00016690021053611626, 'samples': 17657088, 'steps': 91963, 'loss/train': 1.4479491710662842} 11/07/2021 10:11:06 - INFO - __main__ - Step 91965: {'lr': 0.0001668952055583321, 'samples': 17657280, 'steps': 91964, 'loss/train': 1.672492504119873} 11/07/2021 10:11:06 - INFO - __main__ - Step 91966: {'lr': 0.00016689020061799368, 'samples': 17657472, 'steps': 91965, 'loss/train': 1.2241929769515991} 11/07/2021 10:11:08 - INFO - __main__ - Step 91967: {'lr': 0.00016688519571510336, 'samples': 17657664, 'steps': 91966, 'loss/train': 0.16158387064933777} 11/07/2021 10:11:08 - INFO - __main__ - Step 91968: {'lr': 0.00016688019084966317, 'samples': 17657856, 'steps': 91967, 'loss/train': 1.3018863201141357} 11/07/2021 10:11:08 - INFO - __main__ - Step 91969: {'lr': 0.0001668751860216755, 'samples': 17658048, 'steps': 91968, 'loss/train': 1.4821237325668335} 11/07/2021 10:11:09 - INFO - __main__ - Step 91970: {'lr': 0.00016687018123114257, 'samples': 17658240, 'steps': 91969, 'loss/train': 1.2155210971832275} 11/07/2021 10:11:09 - INFO - __main__ - Step 91971: {'lr': 0.00016686517647806668, 'samples': 17658432, 'steps': 91970, 'loss/train': 1.401302695274353} 11/07/2021 10:11:10 - INFO - __main__ - Step 91972: {'lr': 0.00016686017176245006, 'samples': 17658624, 'steps': 91971, 'loss/train': 1.2395039796829224} 11/07/2021 10:11:10 - INFO - __main__ - Step 91973: {'lr': 0.00016685516708429493, 'samples': 17658816, 'steps': 91972, 'loss/train': 0.5890994071960449} 11/07/2021 10:11:11 - INFO - __main__ - Step 91974: {'lr': 0.0001668501624436036, 'samples': 17659008, 'steps': 91973, 'loss/train': 1.667759895324707} 11/07/2021 10:11:11 - INFO - __main__ - Step 91975: {'lr': 0.0001668451578403783, 'samples': 17659200, 'steps': 91974, 'loss/train': 1.3990060091018677} 11/07/2021 10:11:11 - INFO - __main__ - Step 91976: {'lr': 0.0001668401532746213, 'samples': 17659392, 'steps': 91975, 'loss/train': 0.8518202304840088} 11/07/2021 10:11:12 - INFO - __main__ - Step 91977: {'lr': 0.00016683514874633483, 'samples': 17659584, 'steps': 91976, 'loss/train': 1.222362756729126} 11/07/2021 10:11:13 - INFO - __main__ - Step 91978: {'lr': 0.00016683014425552116, 'samples': 17659776, 'steps': 91977, 'loss/train': 1.4802435636520386} 11/07/2021 10:11:13 - INFO - __main__ - Step 91979: {'lr': 0.00016682513980218256, 'samples': 17659968, 'steps': 91978, 'loss/train': 0.934403121471405} 11/07/2021 10:11:13 - INFO - __main__ - Step 91980: {'lr': 0.00016682013538632125, 'samples': 17660160, 'steps': 91979, 'loss/train': 1.4060888290405273} 11/07/2021 10:11:14 - INFO - __main__ - Step 91981: {'lr': 0.0001668151310079396, 'samples': 17660352, 'steps': 91980, 'loss/train': 1.3747645616531372} 11/07/2021 10:11:15 - INFO - __main__ - Step 91982: {'lr': 0.0001668101266670397, 'samples': 17660544, 'steps': 91981, 'loss/train': 1.0364562273025513} 11/07/2021 10:11:15 - INFO - __main__ - Step 91983: {'lr': 0.00016680512236362383, 'samples': 17660736, 'steps': 91982, 'loss/train': 1.8087139129638672} 11/07/2021 10:11:16 - INFO - __main__ - Step 91984: {'lr': 0.0001668001180976943, 'samples': 17660928, 'steps': 91983, 'loss/train': 0.768398106098175} 11/07/2021 10:11:16 - INFO - __main__ - Step 91985: {'lr': 0.00016679511386925337, 'samples': 17661120, 'steps': 91984, 'loss/train': 1.2891408205032349} 11/07/2021 10:11:16 - INFO - __main__ - Step 91986: {'lr': 0.00016679010967830327, 'samples': 17661312, 'steps': 91985, 'loss/train': 1.4079355001449585} 11/07/2021 10:11:17 - INFO - __main__ - Step 91987: {'lr': 0.00016678510552484626, 'samples': 17661504, 'steps': 91986, 'loss/train': 1.065571904182434} 11/07/2021 10:11:18 - INFO - __main__ - Step 91988: {'lr': 0.0001667801014088846, 'samples': 17661696, 'steps': 91987, 'loss/train': 1.2451632022857666} 11/07/2021 10:11:18 - INFO - __main__ - Step 91989: {'lr': 0.0001667750973304205, 'samples': 17661888, 'steps': 91988, 'loss/train': 1.6771624088287354} 11/07/2021 10:11:18 - INFO - __main__ - Step 91990: {'lr': 0.00016677009328945632, 'samples': 17662080, 'steps': 91989, 'loss/train': 1.5705349445343018} 11/07/2021 10:11:19 - INFO - __main__ - Step 91991: {'lr': 0.00016676508928599424, 'samples': 17662272, 'steps': 91990, 'loss/train': 0.8454370498657227} 11/07/2021 10:11:20 - INFO - __main__ - Step 91992: {'lr': 0.0001667600853200365, 'samples': 17662464, 'steps': 91991, 'loss/train': 0.7959811091423035} 11/07/2021 10:11:20 - INFO - __main__ - Step 91993: {'lr': 0.0001667550813915854, 'samples': 17662656, 'steps': 91992, 'loss/train': 1.2728054523468018} 11/07/2021 10:11:20 - INFO - __main__ - Step 91994: {'lr': 0.00016675007750064331, 'samples': 17662848, 'steps': 91993, 'loss/train': 1.3146311044692993} 11/07/2021 10:11:21 - INFO - __main__ - Step 91995: {'lr': 0.0001667450736472122, 'samples': 17663040, 'steps': 91994, 'loss/train': 1.826762318611145} 11/07/2021 10:11:21 - INFO - __main__ - Step 91996: {'lr': 0.00016674006983129447, 'samples': 17663232, 'steps': 91995, 'loss/train': 1.2910809516906738} 11/07/2021 10:11:22 - INFO - __main__ - Step 91997: {'lr': 0.0001667350660528924, 'samples': 17663424, 'steps': 91996, 'loss/train': 1.571846842765808} 11/07/2021 10:11:22 - INFO - __main__ - Step 91998: {'lr': 0.00016673006231200823, 'samples': 17663616, 'steps': 91997, 'loss/train': 0.9745010137557983} 11/07/2021 10:11:23 - INFO - __main__ - Step 91999: {'lr': 0.0001667250586086442, 'samples': 17663808, 'steps': 91998, 'loss/train': 1.2294108867645264} 11/07/2021 10:11:23 - INFO - __main__ - Step 92000: {'lr': 0.00016672005494280256, 'samples': 17664000, 'steps': 91999, 'loss/train': 1.6080527305603027} 11/07/2021 10:11:23 - INFO - __main__ - Step 92001: {'lr': 0.0001667150513144856, 'samples': 17664192, 'steps': 92000, 'loss/train': 1.5566900968551636} 11/07/2021 10:11:24 - INFO - __main__ - Step 92002: {'lr': 0.00016671004772369555, 'samples': 17664384, 'steps': 92001, 'loss/train': 1.162291407585144} 11/07/2021 10:11:25 - INFO - __main__ - Step 92003: {'lr': 0.00016670504417043465, 'samples': 17664576, 'steps': 92002, 'loss/train': 1.1639130115509033} 11/07/2021 10:11:25 - INFO - __main__ - Step 92004: {'lr': 0.0001667000406547052, 'samples': 17664768, 'steps': 92003, 'loss/train': 1.3032506704330444} 11/07/2021 10:11:26 - INFO - __main__ - Step 92005: {'lr': 0.00016669503717650947, 'samples': 17664960, 'steps': 92004, 'loss/train': 0.9881738424301147} 11/07/2021 10:11:26 - INFO - __main__ - Step 92006: {'lr': 0.0001666900337358496, 'samples': 17665152, 'steps': 92005, 'loss/train': 1.349738597869873} 11/07/2021 10:11:26 - INFO - __main__ - Step 92007: {'lr': 0.00016668503033272797, 'samples': 17665344, 'steps': 92006, 'loss/train': 1.1971434354782104} 11/07/2021 10:11:27 - INFO - __main__ - Step 92008: {'lr': 0.00016668002696714675, 'samples': 17665536, 'steps': 92007, 'loss/train': 1.6320090293884277} 11/07/2021 10:11:28 - INFO - __main__ - Step 92009: {'lr': 0.0001666750236391082, 'samples': 17665728, 'steps': 92008, 'loss/train': 1.1374142169952393} 11/07/2021 10:11:28 - INFO - __main__ - Step 92010: {'lr': 0.00016667002034861461, 'samples': 17665920, 'steps': 92009, 'loss/train': 1.4029624462127686} 11/07/2021 10:11:28 - INFO - __main__ - Step 92011: {'lr': 0.00016666501709566823, 'samples': 17666112, 'steps': 92010, 'loss/train': 1.698442816734314} 11/07/2021 10:11:29 - INFO - __main__ - Step 92012: {'lr': 0.0001666600138802713, 'samples': 17666304, 'steps': 92011, 'loss/train': 0.7461891174316406} 11/07/2021 10:11:30 - INFO - __main__ - Step 92013: {'lr': 0.0001666550107024261, 'samples': 17666496, 'steps': 92012, 'loss/train': 1.522713541984558} 11/07/2021 10:11:30 - INFO - __main__ - Step 92014: {'lr': 0.00016665000756213482, 'samples': 17666688, 'steps': 92013, 'loss/train': 1.8797643184661865} 11/07/2021 10:11:30 - INFO - __main__ - Step 92015: {'lr': 0.0001666450044593998, 'samples': 17666880, 'steps': 92014, 'loss/train': 1.1924840211868286} 11/07/2021 10:11:31 - INFO - __main__ - Step 92016: {'lr': 0.0001666400013942233, 'samples': 17667072, 'steps': 92015, 'loss/train': 0.7250218987464905} 11/07/2021 10:11:31 - INFO - __main__ - Step 92017: {'lr': 0.00016663499836660746, 'samples': 17667264, 'steps': 92016, 'loss/train': 1.3252747058868408} 11/07/2021 10:11:32 - INFO - __main__ - Step 92018: {'lr': 0.0001666299953765546, 'samples': 17667456, 'steps': 92017, 'loss/train': 1.7588675022125244} 11/07/2021 10:11:33 - INFO - __main__ - Step 92019: {'lr': 0.000166624992424067, 'samples': 17667648, 'steps': 92018, 'loss/train': 1.1490817070007324} 11/07/2021 10:11:33 - INFO - __main__ - Step 92020: {'lr': 0.0001666199895091469, 'samples': 17667840, 'steps': 92019, 'loss/train': 1.3565585613250732} 11/07/2021 10:11:33 - INFO - __main__ - Step 92021: {'lr': 0.0001666149866317966, 'samples': 17668032, 'steps': 92020, 'loss/train': 1.1641041040420532} 11/07/2021 10:11:34 - INFO - __main__ - Step 92022: {'lr': 0.0001666099837920182, 'samples': 17668224, 'steps': 92021, 'loss/train': 1.4902268648147583} 11/07/2021 10:11:35 - INFO - __main__ - Step 92023: {'lr': 0.00016660498098981409, 'samples': 17668416, 'steps': 92022, 'loss/train': 0.7538528442382812} 11/07/2021 10:11:35 - INFO - __main__ - Step 92024: {'lr': 0.0001665999782251865, 'samples': 17668608, 'steps': 92023, 'loss/train': 1.6207515001296997} 11/07/2021 10:11:35 - INFO - __main__ - Step 92025: {'lr': 0.00016659497549813761, 'samples': 17668800, 'steps': 92024, 'loss/train': 1.3065069913864136} 11/07/2021 10:11:36 - INFO - __main__ - Step 92026: {'lr': 0.00016658997280866988, 'samples': 17668992, 'steps': 92025, 'loss/train': 1.3709720373153687} 11/07/2021 10:11:36 - INFO - __main__ - Step 92027: {'lr': 0.00016658497015678531, 'samples': 17669184, 'steps': 92026, 'loss/train': 1.3402259349822998} 11/07/2021 10:11:36 - INFO - __main__ - Step 92028: {'lr': 0.00016657996754248627, 'samples': 17669376, 'steps': 92027, 'loss/train': 0.30998748540878296} 11/07/2021 10:11:37 - INFO - __main__ - Step 92029: {'lr': 0.00016657496496577505, 'samples': 17669568, 'steps': 92028, 'loss/train': 1.5023657083511353} 11/07/2021 10:11:38 - INFO - __main__ - Step 92030: {'lr': 0.00016656996242665382, 'samples': 17669760, 'steps': 92029, 'loss/train': 1.3937673568725586} 11/07/2021 10:11:38 - INFO - __main__ - Step 92031: {'lr': 0.0001665649599251249, 'samples': 17669952, 'steps': 92030, 'loss/train': 1.6237385272979736} 11/07/2021 10:11:38 - INFO - __main__ - Step 92032: {'lr': 0.0001665599574611905, 'samples': 17670144, 'steps': 92031, 'loss/train': 1.273848295211792} 11/07/2021 10:11:39 - INFO - __main__ - Step 92033: {'lr': 0.0001665549550348529, 'samples': 17670336, 'steps': 92032, 'loss/train': 1.319827675819397} 11/07/2021 10:11:40 - INFO - __main__ - Step 92034: {'lr': 0.0001665499526461144, 'samples': 17670528, 'steps': 92033, 'loss/train': 1.3673663139343262} 11/07/2021 10:11:40 - INFO - __main__ - Step 92035: {'lr': 0.00016654495029497717, 'samples': 17670720, 'steps': 92034, 'loss/train': 1.778522253036499} 11/07/2021 10:11:40 - INFO - __main__ - Step 92036: {'lr': 0.0001665399479814435, 'samples': 17670912, 'steps': 92035, 'loss/train': 1.5508043766021729} 11/07/2021 10:11:41 - INFO - __main__ - Step 92037: {'lr': 0.0001665349457055157, 'samples': 17671104, 'steps': 92036, 'loss/train': 3.254067897796631} 11/07/2021 10:11:41 - INFO - __main__ - Step 92038: {'lr': 0.0001665299434671959, 'samples': 17671296, 'steps': 92037, 'loss/train': 1.6548306941986084} 11/07/2021 10:11:42 - INFO - __main__ - Step 92039: {'lr': 0.00016652494126648636, 'samples': 17671488, 'steps': 92038, 'loss/train': 1.3639755249023438} 11/07/2021 10:11:42 - INFO - __main__ - Step 92040: {'lr': 0.00016651993910338946, 'samples': 17671680, 'steps': 92039, 'loss/train': 1.4355967044830322} 11/07/2021 10:11:43 - INFO - __main__ - Step 92041: {'lr': 0.0001665149369779074, 'samples': 17671872, 'steps': 92040, 'loss/train': 1.6300972700119019} 11/07/2021 10:11:43 - INFO - __main__ - Step 92042: {'lr': 0.0001665099348900424, 'samples': 17672064, 'steps': 92041, 'loss/train': 1.5457468032836914} 11/07/2021 10:11:44 - INFO - __main__ - Step 92043: {'lr': 0.00016650493283979672, 'samples': 17672256, 'steps': 92042, 'loss/train': 1.2049754858016968} 11/07/2021 10:11:45 - INFO - __main__ - Step 92044: {'lr': 0.00016649993082717263, 'samples': 17672448, 'steps': 92043, 'loss/train': 1.9368315935134888} 11/07/2021 10:11:45 - INFO - __main__ - Step 92045: {'lr': 0.00016649492885217242, 'samples': 17672640, 'steps': 92044, 'loss/train': 1.3642021417617798} 11/07/2021 10:11:45 - INFO - __main__ - Step 92046: {'lr': 0.00016648992691479828, 'samples': 17672832, 'steps': 92045, 'loss/train': 1.4723700284957886} 11/07/2021 10:11:46 - INFO - __main__ - Step 92047: {'lr': 0.00016648492501505246, 'samples': 17673024, 'steps': 92046, 'loss/train': 0.9938133358955383} 11/07/2021 10:11:46 - INFO - __main__ - Step 92048: {'lr': 0.00016647992315293742, 'samples': 17673216, 'steps': 92047, 'loss/train': 1.8928035497665405} 11/07/2021 10:11:47 - INFO - __main__ - Step 92049: {'lr': 0.00016647492132845508, 'samples': 17673408, 'steps': 92048, 'loss/train': 1.4026352167129517} 11/07/2021 10:11:48 - INFO - __main__ - Step 92050: {'lr': 0.00016646991954160785, 'samples': 17673600, 'steps': 92049, 'loss/train': 1.2859355211257935} 11/07/2021 10:11:48 - INFO - __main__ - Step 92051: {'lr': 0.000166464917792398, 'samples': 17673792, 'steps': 92050, 'loss/train': 0.978943943977356} 11/07/2021 10:11:48 - INFO - __main__ - Step 92052: {'lr': 0.00016645991608082777, 'samples': 17673984, 'steps': 92051, 'loss/train': 0.8232744336128235} 11/07/2021 10:11:49 - INFO - __main__ - Step 92053: {'lr': 0.00016645491440689942, 'samples': 17674176, 'steps': 92052, 'loss/train': 1.0420681238174438} 11/07/2021 10:11:49 - INFO - __main__ - Step 92054: {'lr': 0.00016644991277061516, 'samples': 17674368, 'steps': 92053, 'loss/train': 1.2996106147766113} 11/07/2021 10:11:50 - INFO - __main__ - Step 92055: {'lr': 0.00016644491117197733, 'samples': 17674560, 'steps': 92054, 'loss/train': 0.8777112364768982} 11/07/2021 10:11:51 - INFO - __main__ - Step 92056: {'lr': 0.0001664399096109881, 'samples': 17674752, 'steps': 92055, 'loss/train': 1.3846968412399292} 11/07/2021 10:11:51 - INFO - __main__ - Step 92057: {'lr': 0.00016643490808764978, 'samples': 17674944, 'steps': 92056, 'loss/train': 1.3913638591766357} 11/07/2021 10:11:51 - INFO - __main__ - Step 92058: {'lr': 0.00016642990660196462, 'samples': 17675136, 'steps': 92057, 'loss/train': 1.2355598211288452} 11/07/2021 10:11:52 - INFO - __main__ - Step 92059: {'lr': 0.0001664249051539348, 'samples': 17675328, 'steps': 92058, 'loss/train': 1.219313383102417} 11/07/2021 10:11:53 - INFO - __main__ - Step 92060: {'lr': 0.00016641990374356263, 'samples': 17675520, 'steps': 92059, 'loss/train': 1.451598882675171} 11/07/2021 10:11:53 - INFO - __main__ - Step 92061: {'lr': 0.0001664149023708505, 'samples': 17675712, 'steps': 92060, 'loss/train': 1.3029855489730835} 11/07/2021 10:11:53 - INFO - __main__ - Step 92062: {'lr': 0.0001664099010358004, 'samples': 17675904, 'steps': 92061, 'loss/train': 1.2158759832382202} 11/07/2021 10:11:54 - INFO - __main__ - Step 92063: {'lr': 0.00016640489973841473, 'samples': 17676096, 'steps': 92062, 'loss/train': 1.5169618129730225} 11/07/2021 10:11:54 - INFO - __main__ - Step 92064: {'lr': 0.0001663998984786957, 'samples': 17676288, 'steps': 92063, 'loss/train': 1.4037553071975708} 11/07/2021 10:11:55 - INFO - __main__ - Step 92065: {'lr': 0.0001663948972566456, 'samples': 17676480, 'steps': 92064, 'loss/train': 1.6188607215881348} 11/07/2021 10:11:55 - INFO - __main__ - Step 92066: {'lr': 0.00016638989607226668, 'samples': 17676672, 'steps': 92065, 'loss/train': 1.4558687210083008} 11/07/2021 10:11:56 - INFO - __main__ - Step 92067: {'lr': 0.00016638489492556115, 'samples': 17676864, 'steps': 92066, 'loss/train': 1.3779982328414917} 11/07/2021 10:11:56 - INFO - __main__ - Step 92068: {'lr': 0.00016637989381653131, 'samples': 17677056, 'steps': 92067, 'loss/train': 1.0359196662902832} 11/07/2021 10:11:56 - INFO - __main__ - Step 92069: {'lr': 0.0001663748927451794, 'samples': 17677248, 'steps': 92068, 'loss/train': 1.2197723388671875} 11/07/2021 10:11:57 - INFO - __main__ - Step 92070: {'lr': 0.00016636989171150767, 'samples': 17677440, 'steps': 92069, 'loss/train': 1.4773694276809692} 11/07/2021 10:11:58 - INFO - __main__ - Step 92071: {'lr': 0.0001663648907155184, 'samples': 17677632, 'steps': 92070, 'loss/train': 1.4767259359359741} 11/07/2021 10:11:58 - INFO - __main__ - Step 92072: {'lr': 0.0001663598897572138, 'samples': 17677824, 'steps': 92071, 'loss/train': 1.6190112829208374} 11/07/2021 10:11:59 - INFO - __main__ - Step 92073: {'lr': 0.00016635488883659616, 'samples': 17678016, 'steps': 92072, 'loss/train': 1.4253791570663452} 11/07/2021 10:11:59 - INFO - __main__ - Step 92074: {'lr': 0.00016634988795366767, 'samples': 17678208, 'steps': 92073, 'loss/train': 0.9169307947158813} 11/07/2021 10:11:59 - INFO - __main__ - Step 92075: {'lr': 0.00016634488710843076, 'samples': 17678400, 'steps': 92074, 'loss/train': 2.1825380325317383} 11/07/2021 10:12:00 - INFO - __main__ - Step 92076: {'lr': 0.00016633988630088747, 'samples': 17678592, 'steps': 92075, 'loss/train': 1.081493616104126} 11/07/2021 10:12:01 - INFO - __main__ - Step 92077: {'lr': 0.00016633488553104015, 'samples': 17678784, 'steps': 92076, 'loss/train': 0.9690409898757935} 11/07/2021 10:12:01 - INFO - __main__ - Step 92078: {'lr': 0.000166329884798891, 'samples': 17678976, 'steps': 92077, 'loss/train': 1.0128252506256104} 11/07/2021 10:12:01 - INFO - __main__ - Step 92079: {'lr': 0.0001663248841044423, 'samples': 17679168, 'steps': 92078, 'loss/train': 1.0008471012115479} 11/07/2021 10:12:02 - INFO - __main__ - Step 92080: {'lr': 0.00016631988344769632, 'samples': 17679360, 'steps': 92079, 'loss/train': 1.203241229057312} 11/07/2021 10:12:03 - INFO - __main__ - Step 92081: {'lr': 0.00016631488282865537, 'samples': 17679552, 'steps': 92080, 'loss/train': 1.4903638362884521} 11/07/2021 10:12:03 - INFO - __main__ - Step 92082: {'lr': 0.00016630988224732157, 'samples': 17679744, 'steps': 92081, 'loss/train': 1.1286550760269165} 11/07/2021 10:12:03 - INFO - __main__ - Step 92083: {'lr': 0.0001663048817036973, 'samples': 17679936, 'steps': 92082, 'loss/train': 1.195517897605896} 11/07/2021 10:12:04 - INFO - __main__ - Step 92084: {'lr': 0.00016629988119778473, 'samples': 17680128, 'steps': 92083, 'loss/train': 1.5007719993591309} 11/07/2021 10:12:04 - INFO - __main__ - Step 92085: {'lr': 0.00016629488072958615, 'samples': 17680320, 'steps': 92084, 'loss/train': 1.4634177684783936} 11/07/2021 10:12:05 - INFO - __main__ - Step 92086: {'lr': 0.00016628988029910381, 'samples': 17680512, 'steps': 92085, 'loss/train': 1.217764139175415} 11/07/2021 10:12:05 - INFO - __main__ - Step 92087: {'lr': 0.00016628487990633995, 'samples': 17680704, 'steps': 92086, 'loss/train': 1.272161841392517} 11/07/2021 10:12:06 - INFO - __main__ - Step 92088: {'lr': 0.00016627987955129692, 'samples': 17680896, 'steps': 92087, 'loss/train': 0.9661315083503723} 11/07/2021 10:12:06 - INFO - __main__ - Step 92089: {'lr': 0.0001662748792339768, 'samples': 17681088, 'steps': 92088, 'loss/train': 1.5804036855697632} 11/07/2021 10:12:06 - INFO - __main__ - Step 92090: {'lr': 0.0001662698789543819, 'samples': 17681280, 'steps': 92089, 'loss/train': 1.0979976654052734} 11/07/2021 10:12:07 - INFO - __main__ - Step 92091: {'lr': 0.00016626487871251457, 'samples': 17681472, 'steps': 92090, 'loss/train': 1.7625802755355835} 11/07/2021 10:12:08 - INFO - __main__ - Step 92092: {'lr': 0.00016625987850837692, 'samples': 17681664, 'steps': 92091, 'loss/train': 0.9621965289115906} 11/07/2021 10:12:08 - INFO - __main__ - Step 92093: {'lr': 0.00016625487834197132, 'samples': 17681856, 'steps': 92092, 'loss/train': 1.2125904560089111} 11/07/2021 10:12:08 - INFO - __main__ - Step 92094: {'lr': 0.00016624987821329995, 'samples': 17682048, 'steps': 92093, 'loss/train': 1.4919517040252686} 11/07/2021 10:12:09 - INFO - __main__ - Step 92095: {'lr': 0.0001662448781223651, 'samples': 17682240, 'steps': 92094, 'loss/train': 1.3892090320587158} 11/07/2021 10:12:10 - INFO - __main__ - Step 92096: {'lr': 0.00016623987806916902, 'samples': 17682432, 'steps': 92095, 'loss/train': 1.4662965536117554} 11/07/2021 10:12:10 - INFO - __main__ - Step 92097: {'lr': 0.00016623487805371396, 'samples': 17682624, 'steps': 92096, 'loss/train': 1.7029953002929688} 11/07/2021 10:12:11 - INFO - __main__ - Step 92098: {'lr': 0.00016622987807600218, 'samples': 17682816, 'steps': 92097, 'loss/train': 1.2044155597686768} 11/07/2021 10:12:11 - INFO - __main__ - Step 92099: {'lr': 0.00016622487813603592, 'samples': 17683008, 'steps': 92098, 'loss/train': 1.1915199756622314} 11/07/2021 10:12:11 - INFO - __main__ - Step 92100: {'lr': 0.00016621987823381743, 'samples': 17683200, 'steps': 92099, 'loss/train': 1.4412692785263062} 11/07/2021 10:12:12 - INFO - __main__ - Step 92101: {'lr': 0.00016621487836934897, 'samples': 17683392, 'steps': 92100, 'loss/train': 1.6835157871246338} 11/07/2021 10:12:13 - INFO - __main__ - Step 92102: {'lr': 0.00016620987854263288, 'samples': 17683584, 'steps': 92101, 'loss/train': 1.4966604709625244} 11/07/2021 10:12:13 - INFO - __main__ - Step 92103: {'lr': 0.00016620487875367124, 'samples': 17683776, 'steps': 92102, 'loss/train': 1.7983207702636719} 11/07/2021 10:12:14 - INFO - __main__ - Step 92104: {'lr': 0.00016619987900246642, 'samples': 17683968, 'steps': 92103, 'loss/train': 0.09434466809034348} 11/07/2021 10:12:14 - INFO - __main__ - Step 92105: {'lr': 0.0001661948792890206, 'samples': 17684160, 'steps': 92104, 'loss/train': 1.4645920991897583} 11/07/2021 10:12:14 - INFO - __main__ - Step 92106: {'lr': 0.0001661898796133361, 'samples': 17684352, 'steps': 92105, 'loss/train': 1.4781665802001953} 11/07/2021 10:12:15 - INFO - __main__ - Step 92107: {'lr': 0.00016618487997541512, 'samples': 17684544, 'steps': 92106, 'loss/train': 1.1564713716506958} 11/07/2021 10:12:16 - INFO - __main__ - Step 92108: {'lr': 0.00016617988037525994, 'samples': 17684736, 'steps': 92107, 'loss/train': 1.5321601629257202} 11/07/2021 10:12:16 - INFO - __main__ - Step 92109: {'lr': 0.00016617488081287286, 'samples': 17684928, 'steps': 92108, 'loss/train': 1.0836044549942017} 11/07/2021 10:12:16 - INFO - __main__ - Step 92110: {'lr': 0.00016616988128825602, 'samples': 17685120, 'steps': 92109, 'loss/train': 1.5246654748916626} 11/07/2021 10:12:17 - INFO - __main__ - Step 92111: {'lr': 0.00016616488180141176, 'samples': 17685312, 'steps': 92110, 'loss/train': 1.9900081157684326} 11/07/2021 10:12:18 - INFO - __main__ - Step 92112: {'lr': 0.00016615988235234235, 'samples': 17685504, 'steps': 92111, 'loss/train': 1.4643583297729492} 11/07/2021 10:12:18 - INFO - __main__ - Step 92113: {'lr': 0.00016615488294104998, 'samples': 17685696, 'steps': 92112, 'loss/train': 0.4253024458885193} 11/07/2021 10:12:18 - INFO - __main__ - Step 92114: {'lr': 0.0001661498835675369, 'samples': 17685888, 'steps': 92113, 'loss/train': 4.162232875823975} 11/07/2021 10:12:19 - INFO - __main__ - Step 92115: {'lr': 0.00016614488423180552, 'samples': 17686080, 'steps': 92114, 'loss/train': 1.4727668762207031} 11/07/2021 10:12:19 - INFO - __main__ - Step 92116: {'lr': 0.00016613988493385784, 'samples': 17686272, 'steps': 92115, 'loss/train': 1.3601945638656616} 11/07/2021 10:12:20 - INFO - __main__ - Step 92117: {'lr': 0.0001661348856736962, 'samples': 17686464, 'steps': 92116, 'loss/train': 1.1360247135162354} 11/07/2021 10:12:21 - INFO - __main__ - Step 92118: {'lr': 0.00016612988645132296, 'samples': 17686656, 'steps': 92117, 'loss/train': 1.395470142364502} 11/07/2021 10:12:21 - INFO - __main__ - Step 92119: {'lr': 0.00016612488726674027, 'samples': 17686848, 'steps': 92118, 'loss/train': 1.686354637145996} 11/07/2021 10:12:21 - INFO - __main__ - Step 92120: {'lr': 0.0001661198881199504, 'samples': 17687040, 'steps': 92119, 'loss/train': 1.3807494640350342} 11/07/2021 10:12:22 - INFO - __main__ - Step 92121: {'lr': 0.00016611488901095562, 'samples': 17687232, 'steps': 92120, 'loss/train': 1.193848967552185} 11/07/2021 10:12:22 - INFO - __main__ - Step 92122: {'lr': 0.00016610988993975818, 'samples': 17687424, 'steps': 92121, 'loss/train': 0.9783124327659607} 11/07/2021 10:12:23 - INFO - __main__ - Step 92123: {'lr': 0.00016610489090636033, 'samples': 17687616, 'steps': 92122, 'loss/train': 1.1788220405578613} 11/07/2021 10:12:23 - INFO - __main__ - Step 92124: {'lr': 0.00016609989191076433, 'samples': 17687808, 'steps': 92123, 'loss/train': 1.2745648622512817} 11/07/2021 10:12:24 - INFO - __main__ - Step 92125: {'lr': 0.00016609489295297243, 'samples': 17688000, 'steps': 92124, 'loss/train': 1.7271391153335571} 11/07/2021 10:12:24 - INFO - __main__ - Step 92126: {'lr': 0.00016608989403298684, 'samples': 17688192, 'steps': 92125, 'loss/train': 1.7592384815216064} 11/07/2021 10:12:24 - INFO - __main__ - Step 92127: {'lr': 0.00016608489515080989, 'samples': 17688384, 'steps': 92126, 'loss/train': 1.1100369691848755} 11/07/2021 10:12:26 - INFO - __main__ - Step 92128: {'lr': 0.00016607989630644385, 'samples': 17688576, 'steps': 92127, 'loss/train': 0.5429094433784485} 11/07/2021 10:12:26 - INFO - __main__ - Step 92129: {'lr': 0.00016607489749989086, 'samples': 17688768, 'steps': 92128, 'loss/train': 0.5170159935951233} 11/07/2021 10:12:26 - INFO - __main__ - Step 92130: {'lr': 0.0001660698987311532, 'samples': 17688960, 'steps': 92129, 'loss/train': 1.5197525024414062} 11/07/2021 10:12:27 - INFO - __main__ - Step 92131: {'lr': 0.0001660649000002331, 'samples': 17689152, 'steps': 92130, 'loss/train': 1.5535978078842163} 11/07/2021 10:12:27 - INFO - __main__ - Step 92132: {'lr': 0.00016605990130713294, 'samples': 17689344, 'steps': 92131, 'loss/train': 1.098056674003601} 11/07/2021 10:12:28 - INFO - __main__ - Step 92133: {'lr': 0.00016605490265185485, 'samples': 17689536, 'steps': 92132, 'loss/train': 1.14873468875885} 11/07/2021 10:12:29 - INFO - __main__ - Step 92134: {'lr': 0.0001660499040344011, 'samples': 17689728, 'steps': 92133, 'loss/train': 1.1941694021224976} 11/07/2021 10:12:29 - INFO - __main__ - Step 92135: {'lr': 0.00016604490545477405, 'samples': 17689920, 'steps': 92134, 'loss/train': 1.133203387260437} 11/07/2021 10:12:29 - INFO - __main__ - Step 92136: {'lr': 0.00016603990691297583, 'samples': 17690112, 'steps': 92135, 'loss/train': 5.035739898681641} 11/07/2021 10:12:30 - INFO - __main__ - Step 92137: {'lr': 0.00016603490840900873, 'samples': 17690304, 'steps': 92136, 'loss/train': 5.462688446044922} 11/07/2021 10:12:30 - INFO - __main__ - Step 92138: {'lr': 0.00016602990994287497, 'samples': 17690496, 'steps': 92137, 'loss/train': 5.291414260864258} 11/07/2021 10:12:31 - INFO - __main__ - Step 92139: {'lr': 0.00016602491151457695, 'samples': 17690688, 'steps': 92138, 'loss/train': 1.523160696029663} 11/07/2021 10:12:32 - INFO - __main__ - Step 92140: {'lr': 0.00016601991312411674, 'samples': 17690880, 'steps': 92139, 'loss/train': 1.775167465209961} 11/07/2021 10:12:32 - INFO - __main__ - Step 92141: {'lr': 0.00016601491477149664, 'samples': 17691072, 'steps': 92140, 'loss/train': 1.1928069591522217} 11/07/2021 10:12:32 - INFO - __main__ - Step 92142: {'lr': 0.00016600991645671897, 'samples': 17691264, 'steps': 92141, 'loss/train': 1.6944715976715088} 11/07/2021 10:12:33 - INFO - __main__ - Step 92143: {'lr': 0.00016600491817978592, 'samples': 17691456, 'steps': 92142, 'loss/train': 1.3250974416732788} 11/07/2021 10:12:33 - INFO - __main__ - Step 92144: {'lr': 0.00016599991994069974, 'samples': 17691648, 'steps': 92143, 'loss/train': 1.6694746017456055} 11/07/2021 10:12:34 - INFO - __main__ - Step 92145: {'lr': 0.00016599492173946268, 'samples': 17691840, 'steps': 92144, 'loss/train': 1.4090993404388428} 11/07/2021 10:12:34 - INFO - __main__ - Step 92146: {'lr': 0.00016598992357607704, 'samples': 17692032, 'steps': 92145, 'loss/train': 1.6423466205596924} 11/07/2021 10:12:35 - INFO - __main__ - Step 92147: {'lr': 0.00016598492545054502, 'samples': 17692224, 'steps': 92146, 'loss/train': 1.2443050146102905} 11/07/2021 10:12:35 - INFO - __main__ - Step 92148: {'lr': 0.00016597992736286894, 'samples': 17692416, 'steps': 92147, 'loss/train': 1.1656140089035034} 11/07/2021 10:12:35 - INFO - __main__ - Step 92149: {'lr': 0.00016597492931305096, 'samples': 17692608, 'steps': 92148, 'loss/train': 1.389173150062561} 11/07/2021 10:12:36 - INFO - __main__ - Step 92150: {'lr': 0.00016596993130109345, 'samples': 17692800, 'steps': 92149, 'loss/train': 1.1632498502731323} 11/07/2021 10:12:37 - INFO - __main__ - Step 92151: {'lr': 0.00016596493332699853, 'samples': 17692992, 'steps': 92150, 'loss/train': 1.474827766418457} 11/07/2021 10:12:37 - INFO - __main__ - Step 92152: {'lr': 0.00016595993539076853, 'samples': 17693184, 'steps': 92151, 'loss/train': 1.461876630783081} 11/07/2021 10:12:37 - INFO - __main__ - Step 92153: {'lr': 0.0001659549374924057, 'samples': 17693376, 'steps': 92152, 'loss/train': 1.3682267665863037} 11/07/2021 10:12:38 - INFO - __main__ - Step 92154: {'lr': 0.00016594993963191224, 'samples': 17693568, 'steps': 92153, 'loss/train': 1.6073150634765625} 11/07/2021 10:12:38 - INFO - __main__ - Step 92155: {'lr': 0.0001659449418092905, 'samples': 17693760, 'steps': 92154, 'loss/train': 1.6961345672607422} 11/07/2021 10:12:39 - INFO - __main__ - Step 92156: {'lr': 0.00016593994402454266, 'samples': 17693952, 'steps': 92155, 'loss/train': 1.603753685951233} 11/07/2021 10:12:40 - INFO - __main__ - Step 92157: {'lr': 0.00016593494627767095, 'samples': 17694144, 'steps': 92156, 'loss/train': 1.3461090326309204} 11/07/2021 10:12:40 - INFO - __main__ - Step 92158: {'lr': 0.00016592994856867767, 'samples': 17694336, 'steps': 92157, 'loss/train': 1.2156342267990112} 11/07/2021 10:12:40 - INFO - __main__ - Step 92159: {'lr': 0.00016592495089756505, 'samples': 17694528, 'steps': 92158, 'loss/train': 1.434714436531067} 11/07/2021 10:12:41 - INFO - __main__ - Step 92160: {'lr': 0.00016591995326433536, 'samples': 17694720, 'steps': 92159, 'loss/train': 1.923326015472412} 11/07/2021 10:12:42 - INFO - __main__ - Step 92161: {'lr': 0.00016591495566899085, 'samples': 17694912, 'steps': 92160, 'loss/train': 1.3811169862747192} 11/07/2021 10:12:42 - INFO - __main__ - Step 92162: {'lr': 0.00016590995811153374, 'samples': 17695104, 'steps': 92161, 'loss/train': 1.3524872064590454} 11/07/2021 10:12:42 - INFO - __main__ - Step 92163: {'lr': 0.0001659049605919663, 'samples': 17695296, 'steps': 92162, 'loss/train': 1.6706165075302124} 11/07/2021 10:12:43 - INFO - __main__ - Step 92164: {'lr': 0.00016589996311029082, 'samples': 17695488, 'steps': 92163, 'loss/train': 1.317513108253479} 11/07/2021 10:12:43 - INFO - __main__ - Step 92165: {'lr': 0.00016589496566650946, 'samples': 17695680, 'steps': 92164, 'loss/train': 1.4680181741714478} 11/07/2021 10:12:44 - INFO - __main__ - Step 92166: {'lr': 0.00016588996826062458, 'samples': 17695872, 'steps': 92165, 'loss/train': 1.4146065711975098} 11/07/2021 10:12:45 - INFO - __main__ - Step 92167: {'lr': 0.00016588497089263838, 'samples': 17696064, 'steps': 92166, 'loss/train': 1.1675868034362793} 11/07/2021 10:12:45 - INFO - __main__ - Step 92168: {'lr': 0.0001658799735625531, 'samples': 17696256, 'steps': 92167, 'loss/train': 1.1041988134384155} 11/07/2021 10:12:45 - INFO - __main__ - Step 92169: {'lr': 0.00016587497627037107, 'samples': 17696448, 'steps': 92168, 'loss/train': 1.440588116645813} 11/07/2021 10:12:46 - INFO - __main__ - Step 92170: {'lr': 0.0001658699790160944, 'samples': 17696640, 'steps': 92169, 'loss/train': 1.2954624891281128} 11/07/2021 10:12:47 - INFO - __main__ - Step 92171: {'lr': 0.00016586498179972545, 'samples': 17696832, 'steps': 92170, 'loss/train': 2.5771007537841797} 11/07/2021 10:12:47 - INFO - __main__ - Step 92172: {'lr': 0.00016585998462126646, 'samples': 17697024, 'steps': 92171, 'loss/train': 1.8174364566802979} 11/07/2021 10:12:47 - INFO - __main__ - Step 92173: {'lr': 0.00016585498748071965, 'samples': 17697216, 'steps': 92172, 'loss/train': 1.2318834066390991} 11/07/2021 10:12:48 - INFO - __main__ - Step 92174: {'lr': 0.00016584999037808727, 'samples': 17697408, 'steps': 92173, 'loss/train': 1.2764123678207397} 11/07/2021 10:12:48 - INFO - __main__ - Step 92175: {'lr': 0.00016584499331337156, 'samples': 17697600, 'steps': 92174, 'loss/train': 5.75645112991333} 11/07/2021 10:12:48 - INFO - __main__ - Step 92176: {'lr': 0.00016583999628657481, 'samples': 17697792, 'steps': 92175, 'loss/train': 1.201303243637085} 11/07/2021 10:12:49 - INFO - __main__ - Step 92177: {'lr': 0.0001658349992976993, 'samples': 17697984, 'steps': 92176, 'loss/train': 1.1644598245620728} 11/07/2021 10:12:50 - INFO - __main__ - Step 92178: {'lr': 0.00016583000234674718, 'samples': 17698176, 'steps': 92177, 'loss/train': 1.4796603918075562} 11/07/2021 10:12:50 - INFO - __main__ - Step 92179: {'lr': 0.0001658250054337208, 'samples': 17698368, 'steps': 92178, 'loss/train': 1.6563578844070435} 11/07/2021 10:12:50 - INFO - __main__ - Step 92180: {'lr': 0.00016582000855862232, 'samples': 17698560, 'steps': 92179, 'loss/train': 1.5356673002243042} 11/07/2021 10:12:51 - INFO - __main__ - Step 92181: {'lr': 0.00016581501172145414, 'samples': 17698752, 'steps': 92180, 'loss/train': 1.2953788042068481} 11/07/2021 10:12:52 - INFO - __main__ - Step 92182: {'lr': 0.0001658100149222184, 'samples': 17698944, 'steps': 92181, 'loss/train': 0.6869189739227295} 11/07/2021 10:12:52 - INFO - __main__ - Step 92183: {'lr': 0.00016580501816091737, 'samples': 17699136, 'steps': 92182, 'loss/train': 1.3701225519180298} 11/07/2021 10:12:53 - INFO - __main__ - Step 92184: {'lr': 0.00016580002143755328, 'samples': 17699328, 'steps': 92183, 'loss/train': 1.825148344039917} 11/07/2021 10:12:53 - INFO - __main__ - Step 92185: {'lr': 0.00016579502475212837, 'samples': 17699520, 'steps': 92184, 'loss/train': 0.6823630928993225} 11/07/2021 10:12:53 - INFO - __main__ - Step 92186: {'lr': 0.00016579002810464494, 'samples': 17699712, 'steps': 92185, 'loss/train': 1.3858530521392822} 11/07/2021 10:12:54 - INFO - __main__ - Step 92187: {'lr': 0.00016578503149510522, 'samples': 17699904, 'steps': 92186, 'loss/train': 1.573978304862976} 11/07/2021 10:12:55 - INFO - __main__ - Step 92188: {'lr': 0.00016578003492351146, 'samples': 17700096, 'steps': 92187, 'loss/train': 1.3890085220336914} 11/07/2021 10:12:55 - INFO - __main__ - Step 92189: {'lr': 0.00016577503838986592, 'samples': 17700288, 'steps': 92188, 'loss/train': 0.8488559722900391} 11/07/2021 10:12:55 - INFO - __main__ - Step 92190: {'lr': 0.00016577004189417084, 'samples': 17700480, 'steps': 92189, 'loss/train': 1.6211507320404053} 11/07/2021 10:12:56 - INFO - __main__ - Step 92191: {'lr': 0.0001657650454364285, 'samples': 17700672, 'steps': 92190, 'loss/train': 1.6216620206832886} 11/07/2021 10:12:56 - INFO - __main__ - Step 92192: {'lr': 0.0001657600490166411, 'samples': 17700864, 'steps': 92191, 'loss/train': 1.5925285816192627} 11/07/2021 10:12:57 - INFO - __main__ - Step 92193: {'lr': 0.00016575505263481094, 'samples': 17701056, 'steps': 92192, 'loss/train': 1.530869960784912} 11/07/2021 10:12:57 - INFO - __main__ - Step 92194: {'lr': 0.00016575005629094024, 'samples': 17701248, 'steps': 92193, 'loss/train': 1.8232022523880005} 11/07/2021 10:12:58 - INFO - __main__ - Step 92195: {'lr': 0.0001657450599850313, 'samples': 17701440, 'steps': 92194, 'loss/train': 1.524389386177063} 11/07/2021 10:12:58 - INFO - __main__ - Step 92196: {'lr': 0.00016574006371708645, 'samples': 17701632, 'steps': 92195, 'loss/train': 1.3165194988250732} 11/07/2021 10:12:58 - INFO - __main__ - Step 92197: {'lr': 0.00016573506748710764, 'samples': 17701824, 'steps': 92196, 'loss/train': 1.6325491666793823} 11/07/2021 10:13:00 - INFO - __main__ - Step 92198: {'lr': 0.00016573007129509738, 'samples': 17702016, 'steps': 92197, 'loss/train': 1.5200084447860718} 11/07/2021 10:13:00 - INFO - __main__ - Step 92199: {'lr': 0.00016572507514105785, 'samples': 17702208, 'steps': 92198, 'loss/train': 0.7703402042388916} 11/07/2021 10:13:00 - INFO - __main__ - Step 92200: {'lr': 0.00016572007902499125, 'samples': 17702400, 'steps': 92199, 'loss/train': 1.3797039985656738} 11/07/2021 10:13:01 - INFO - __main__ - Step 92201: {'lr': 0.0001657150829468999, 'samples': 17702592, 'steps': 92200, 'loss/train': 1.5646997690200806} 11/07/2021 10:13:01 - INFO - __main__ - Step 92202: {'lr': 0.00016571008690678609, 'samples': 17702784, 'steps': 92201, 'loss/train': 1.427489161491394} 11/07/2021 10:13:02 - INFO - __main__ - Step 92203: {'lr': 0.00016570509090465196, 'samples': 17702976, 'steps': 92202, 'loss/train': 1.0219730138778687} 11/07/2021 10:13:02 - INFO - __main__ - Step 92204: {'lr': 0.00016570009494049981, 'samples': 17703168, 'steps': 92203, 'loss/train': 1.046779990196228} 11/07/2021 10:13:03 - INFO - __main__ - Step 92205: {'lr': 0.0001656950990143319, 'samples': 17703360, 'steps': 92204, 'loss/train': 0.7716943025588989} 11/07/2021 10:13:03 - INFO - __main__ - Step 92206: {'lr': 0.00016569010312615052, 'samples': 17703552, 'steps': 92205, 'loss/train': 1.5537039041519165} 11/07/2021 10:13:03 - INFO - __main__ - Step 92207: {'lr': 0.0001656851072759578, 'samples': 17703744, 'steps': 92206, 'loss/train': 1.141514539718628} 11/07/2021 10:13:04 - INFO - __main__ - Step 92208: {'lr': 0.00016568011146375617, 'samples': 17703936, 'steps': 92207, 'loss/train': 1.5408062934875488} 11/07/2021 10:13:05 - INFO - __main__ - Step 92209: {'lr': 0.0001656751156895478, 'samples': 17704128, 'steps': 92208, 'loss/train': 1.5594803094863892} 11/07/2021 10:13:05 - INFO - __main__ - Step 92210: {'lr': 0.00016567011995333487, 'samples': 17704320, 'steps': 92209, 'loss/train': 1.3859676122665405} 11/07/2021 10:13:05 - INFO - __main__ - Step 92211: {'lr': 0.00016566512425511966, 'samples': 17704512, 'steps': 92210, 'loss/train': 1.3215720653533936} 11/07/2021 10:13:06 - INFO - __main__ - Step 92212: {'lr': 0.00016566012859490443, 'samples': 17704704, 'steps': 92211, 'loss/train': 1.4015387296676636} 11/07/2021 10:13:07 - INFO - __main__ - Step 92213: {'lr': 0.00016565513297269146, 'samples': 17704896, 'steps': 92212, 'loss/train': 1.694804072380066} 11/07/2021 10:13:07 - INFO - __main__ - Step 92214: {'lr': 0.000165650137388483, 'samples': 17705088, 'steps': 92213, 'loss/train': 1.3234082460403442} 11/07/2021 10:13:07 - INFO - __main__ - Step 92215: {'lr': 0.00016564514184228124, 'samples': 17705280, 'steps': 92214, 'loss/train': 1.088897466659546} 11/07/2021 10:13:08 - INFO - __main__ - Step 92216: {'lr': 0.00016564014633408853, 'samples': 17705472, 'steps': 92215, 'loss/train': 1.2609424591064453} 11/07/2021 10:13:08 - INFO - __main__ - Step 92217: {'lr': 0.00016563515086390706, 'samples': 17705664, 'steps': 92216, 'loss/train': 1.3093591928482056} 11/07/2021 10:13:08 - INFO - __main__ - Step 92218: {'lr': 0.00016563015543173907, 'samples': 17705856, 'steps': 92217, 'loss/train': 1.34196937084198} 11/07/2021 10:13:10 - INFO - __main__ - Step 92219: {'lr': 0.0001656251600375868, 'samples': 17706048, 'steps': 92218, 'loss/train': 1.1725915670394897} 11/07/2021 10:13:10 - INFO - __main__ - Step 92220: {'lr': 0.00016562016468145261, 'samples': 17706240, 'steps': 92219, 'loss/train': 1.5330439805984497} 11/07/2021 10:13:10 - INFO - __main__ - Step 92221: {'lr': 0.00016561516936333863, 'samples': 17706432, 'steps': 92220, 'loss/train': 1.6134881973266602} 11/07/2021 10:13:11 - INFO - __main__ - Step 92222: {'lr': 0.00016561017408324712, 'samples': 17706624, 'steps': 92221, 'loss/train': 1.4464941024780273} 11/07/2021 10:13:11 - INFO - __main__ - Step 92223: {'lr': 0.00016560517884118054, 'samples': 17706816, 'steps': 92222, 'loss/train': 0.19022728502750397} 11/07/2021 10:13:12 - INFO - __main__ - Step 92224: {'lr': 0.0001656001836371408, 'samples': 17707008, 'steps': 92223, 'loss/train': 0.9435681700706482} 11/07/2021 10:13:13 - INFO - __main__ - Step 92225: {'lr': 0.00016559518847113035, 'samples': 17707200, 'steps': 92224, 'loss/train': 0.8004502654075623} 11/07/2021 10:13:13 - INFO - __main__ - Step 92226: {'lr': 0.00016559019334315138, 'samples': 17707392, 'steps': 92225, 'loss/train': 0.1104946956038475} 11/07/2021 10:13:14 - INFO - __main__ - Step 92227: {'lr': 0.00016558519825320616, 'samples': 17707584, 'steps': 92226, 'loss/train': 0.8894897103309631} 11/07/2021 10:13:14 - INFO - __main__ - Step 92228: {'lr': 0.00016558020320129696, 'samples': 17707776, 'steps': 92227, 'loss/train': 1.449589729309082} 11/07/2021 10:13:15 - INFO - __main__ - Step 92229: {'lr': 0.00016557520818742607, 'samples': 17707968, 'steps': 92228, 'loss/train': 1.3165888786315918} 11/07/2021 10:13:15 - INFO - __main__ - Step 92230: {'lr': 0.0001655702132115956, 'samples': 17708160, 'steps': 92229, 'loss/train': 1.1718388795852661} 11/07/2021 10:13:16 - INFO - __main__ - Step 92231: {'lr': 0.00016556521827380794, 'samples': 17708352, 'steps': 92230, 'loss/train': 1.4697953462600708} 11/07/2021 10:13:16 - INFO - __main__ - Step 92232: {'lr': 0.0001655602233740653, 'samples': 17708544, 'steps': 92231, 'loss/train': 1.8869109153747559} 11/07/2021 10:13:16 - INFO - __main__ - Step 92233: {'lr': 0.00016555522851236987, 'samples': 17708736, 'steps': 92232, 'loss/train': 0.9745898842811584} 11/07/2021 10:13:17 - INFO - __main__ - Step 92234: {'lr': 0.00016555023368872396, 'samples': 17708928, 'steps': 92233, 'loss/train': 0.8397597670555115} 11/07/2021 10:13:18 - INFO - __main__ - Step 92235: {'lr': 0.00016554523890312982, 'samples': 17709120, 'steps': 92234, 'loss/train': 0.9836950898170471} 11/07/2021 10:13:18 - INFO - __main__ - Step 92236: {'lr': 0.00016554024415558983, 'samples': 17709312, 'steps': 92235, 'loss/train': 1.4947245121002197} 11/07/2021 10:13:18 - INFO - __main__ - Step 92237: {'lr': 0.000165535249446106, 'samples': 17709504, 'steps': 92236, 'loss/train': 0.8305336236953735} 11/07/2021 10:13:19 - INFO - __main__ - Step 92238: {'lr': 0.00016553025477468065, 'samples': 17709696, 'steps': 92237, 'loss/train': 1.284397840499878} 11/07/2021 10:13:20 - INFO - __main__ - Step 92239: {'lr': 0.0001655252601413161, 'samples': 17709888, 'steps': 92238, 'loss/train': 1.2645610570907593} 11/07/2021 10:13:20 - INFO - __main__ - Step 92240: {'lr': 0.0001655202655460145, 'samples': 17710080, 'steps': 92239, 'loss/train': 1.7602819204330444} 11/07/2021 10:13:20 - INFO - __main__ - Step 92241: {'lr': 0.00016551527098877821, 'samples': 17710272, 'steps': 92240, 'loss/train': 1.6237367391586304} 11/07/2021 10:13:21 - INFO - __main__ - Step 92242: {'lr': 0.00016551027646960942, 'samples': 17710464, 'steps': 92241, 'loss/train': 1.807320475578308} 11/07/2021 10:13:21 - INFO - __main__ - Step 92243: {'lr': 0.0001655052819885104, 'samples': 17710656, 'steps': 92242, 'loss/train': 0.9534923434257507} 11/07/2021 10:13:21 - INFO - __main__ - Step 92244: {'lr': 0.00016550028754548342, 'samples': 17710848, 'steps': 92243, 'loss/train': 0.2872805595397949} 11/07/2021 10:13:23 - INFO - __main__ - Step 92245: {'lr': 0.0001654952931405307, 'samples': 17711040, 'steps': 92244, 'loss/train': 1.4591186046600342} 11/07/2021 10:13:23 - INFO - __main__ - Step 92246: {'lr': 0.00016549029877365446, 'samples': 17711232, 'steps': 92245, 'loss/train': 1.3925271034240723} 11/07/2021 10:13:23 - INFO - __main__ - Step 92247: {'lr': 0.00016548530444485698, 'samples': 17711424, 'steps': 92246, 'loss/train': 1.3025211095809937} 11/07/2021 10:13:24 - INFO - __main__ - Step 92248: {'lr': 0.00016548031015414056, 'samples': 17711616, 'steps': 92247, 'loss/train': 1.38985013961792} 11/07/2021 10:13:24 - INFO - __main__ - Step 92249: {'lr': 0.0001654753159015074, 'samples': 17711808, 'steps': 92248, 'loss/train': 0.5489757061004639} 11/07/2021 10:13:25 - INFO - __main__ - Step 92250: {'lr': 0.00016547032168695987, 'samples': 17712000, 'steps': 92249, 'loss/train': 1.8265913724899292} 11/07/2021 10:13:26 - INFO - __main__ - Step 92251: {'lr': 0.00016546532751049998, 'samples': 17712192, 'steps': 92250, 'loss/train': 0.625476598739624} 11/07/2021 10:13:26 - INFO - __main__ - Step 92252: {'lr': 0.00016546033337213012, 'samples': 17712384, 'steps': 92251, 'loss/train': 1.3082361221313477} 11/07/2021 10:13:26 - INFO - __main__ - Step 92253: {'lr': 0.00016545533927185254, 'samples': 17712576, 'steps': 92252, 'loss/train': 0.8677536845207214} 11/07/2021 10:13:27 - INFO - __main__ - Step 92254: {'lr': 0.00016545034520966945, 'samples': 17712768, 'steps': 92253, 'loss/train': 1.3076732158660889} 11/07/2021 10:13:28 - INFO - __main__ - Step 92255: {'lr': 0.00016544535118558318, 'samples': 17712960, 'steps': 92254, 'loss/train': 1.493240237236023} 11/07/2021 10:13:28 - INFO - __main__ - Step 92256: {'lr': 0.00016544035719959587, 'samples': 17713152, 'steps': 92255, 'loss/train': 1.8246822357177734} 11/07/2021 10:13:28 - INFO - __main__ - Step 92257: {'lr': 0.00016543536325170987, 'samples': 17713344, 'steps': 92256, 'loss/train': 1.278038740158081} 11/07/2021 10:13:29 - INFO - __main__ - Step 92258: {'lr': 0.0001654303693419274, 'samples': 17713536, 'steps': 92257, 'loss/train': 1.6066726446151733} 11/07/2021 10:13:29 - INFO - __main__ - Step 92259: {'lr': 0.00016542537547025067, 'samples': 17713728, 'steps': 92258, 'loss/train': 1.3859056234359741} 11/07/2021 10:13:30 - INFO - __main__ - Step 92260: {'lr': 0.00016542038163668197, 'samples': 17713920, 'steps': 92259, 'loss/train': 0.8397838473320007} 11/07/2021 10:13:30 - INFO - __main__ - Step 92261: {'lr': 0.00016541538784122357, 'samples': 17714112, 'steps': 92260, 'loss/train': 0.8513132929801941} 11/07/2021 10:13:31 - INFO - __main__ - Step 92262: {'lr': 0.00016541039408387765, 'samples': 17714304, 'steps': 92261, 'loss/train': 1.2739574909210205} 11/07/2021 10:13:31 - INFO - __main__ - Step 92263: {'lr': 0.0001654054003646466, 'samples': 17714496, 'steps': 92262, 'loss/train': 1.2708771228790283} 11/07/2021 10:13:31 - INFO - __main__ - Step 92264: {'lr': 0.0001654004066835325, 'samples': 17714688, 'steps': 92263, 'loss/train': 1.44181227684021} 11/07/2021 10:13:32 - INFO - __main__ - Step 92265: {'lr': 0.00016539541304053766, 'samples': 17714880, 'steps': 92264, 'loss/train': 1.2245934009552002} 11/07/2021 10:13:33 - INFO - __main__ - Step 92266: {'lr': 0.00016539041943566433, 'samples': 17715072, 'steps': 92265, 'loss/train': 2.090813398361206} 11/07/2021 10:13:33 - INFO - __main__ - Step 92267: {'lr': 0.00016538542586891478, 'samples': 17715264, 'steps': 92266, 'loss/train': 1.1563440561294556} 11/07/2021 10:13:33 - INFO - __main__ - Step 92268: {'lr': 0.00016538043234029127, 'samples': 17715456, 'steps': 92267, 'loss/train': 1.3391245603561401} 11/07/2021 10:13:34 - INFO - __main__ - Step 92269: {'lr': 0.000165375438849796, 'samples': 17715648, 'steps': 92268, 'loss/train': 1.595704436302185} 11/07/2021 10:13:35 - INFO - __main__ - Step 92270: {'lr': 0.00016537044539743126, 'samples': 17715840, 'steps': 92269, 'loss/train': 0.786201000213623} 11/07/2021 10:13:35 - INFO - __main__ - Step 92271: {'lr': 0.0001653654519831993, 'samples': 17716032, 'steps': 92270, 'loss/train': 1.2500591278076172} 11/07/2021 10:13:35 - INFO - __main__ - Step 92272: {'lr': 0.00016536045860710236, 'samples': 17716224, 'steps': 92271, 'loss/train': 1.2931023836135864} 11/07/2021 10:13:36 - INFO - __main__ - Step 92273: {'lr': 0.00016535546526914274, 'samples': 17716416, 'steps': 92272, 'loss/train': 1.4418845176696777} 11/07/2021 10:13:36 - INFO - __main__ - Step 92274: {'lr': 0.00016535047196932257, 'samples': 17716608, 'steps': 92273, 'loss/train': 1.5324323177337646} 11/07/2021 10:13:37 - INFO - __main__ - Step 92275: {'lr': 0.00016534547870764423, 'samples': 17716800, 'steps': 92274, 'loss/train': 1.0491214990615845} 11/07/2021 10:13:38 - INFO - __main__ - Step 92276: {'lr': 0.0001653404854841099, 'samples': 17716992, 'steps': 92275, 'loss/train': 1.551565170288086} 11/07/2021 10:13:38 - INFO - __main__ - Step 92277: {'lr': 0.0001653354922987218, 'samples': 17717184, 'steps': 92276, 'loss/train': 0.8192396759986877} 11/07/2021 10:13:38 - INFO - __main__ - Step 92278: {'lr': 0.00016533049915148224, 'samples': 17717376, 'steps': 92277, 'loss/train': 0.14162449538707733} 11/07/2021 10:13:39 - INFO - __main__ - Step 92279: {'lr': 0.00016532550604239345, 'samples': 17717568, 'steps': 92278, 'loss/train': 1.7009376287460327} 11/07/2021 10:13:39 - INFO - __main__ - Step 92280: {'lr': 0.00016532051297145768, 'samples': 17717760, 'steps': 92279, 'loss/train': 1.4512956142425537} 11/07/2021 10:13:40 - INFO - __main__ - Step 92281: {'lr': 0.00016531551993867715, 'samples': 17717952, 'steps': 92280, 'loss/train': 1.1432619094848633} 11/07/2021 10:13:40 - INFO - __main__ - Step 92282: {'lr': 0.00016531052694405417, 'samples': 17718144, 'steps': 92281, 'loss/train': 1.6057041883468628} 11/07/2021 10:13:41 - INFO - __main__ - Step 92283: {'lr': 0.00016530553398759097, 'samples': 17718336, 'steps': 92282, 'loss/train': 1.2269879579544067} 11/07/2021 10:13:41 - INFO - __main__ - Step 92284: {'lr': 0.00016530054106928983, 'samples': 17718528, 'steps': 92283, 'loss/train': 1.5146492719650269} 11/07/2021 10:13:42 - INFO - __main__ - Step 92285: {'lr': 0.00016529554818915288, 'samples': 17718720, 'steps': 92284, 'loss/train': 1.6394001245498657} 11/07/2021 10:13:43 - INFO - __main__ - Step 92286: {'lr': 0.00016529055534718248, 'samples': 17718912, 'steps': 92285, 'loss/train': 1.1594942808151245} 11/07/2021 10:13:43 - INFO - __main__ - Step 92287: {'lr': 0.00016528556254338084, 'samples': 17719104, 'steps': 92286, 'loss/train': 1.1515402793884277} 11/07/2021 10:13:43 - INFO - __main__ - Step 92288: {'lr': 0.00016528056977775023, 'samples': 17719296, 'steps': 92287, 'loss/train': 1.6390122175216675} 11/07/2021 10:13:44 - INFO - __main__ - Step 92289: {'lr': 0.00016527557705029288, 'samples': 17719488, 'steps': 92288, 'loss/train': 1.4803917407989502} 11/07/2021 10:13:44 - INFO - __main__ - Step 92290: {'lr': 0.00016527058436101107, 'samples': 17719680, 'steps': 92289, 'loss/train': 5.704256534576416} 11/07/2021 10:13:44 - INFO - __main__ - Step 92291: {'lr': 0.000165265591709907, 'samples': 17719872, 'steps': 92290, 'loss/train': 1.240840196609497} 11/07/2021 10:13:46 - INFO - __main__ - Step 92292: {'lr': 0.00016526059909698296, 'samples': 17720064, 'steps': 92291, 'loss/train': 1.140265941619873} 11/07/2021 10:13:46 - INFO - __main__ - Step 92293: {'lr': 0.0001652556065222412, 'samples': 17720256, 'steps': 92292, 'loss/train': 1.2274378538131714} 11/07/2021 10:13:46 - INFO - __main__ - Step 92294: {'lr': 0.00016525061398568391, 'samples': 17720448, 'steps': 92293, 'loss/train': 1.5006662607192993} 11/07/2021 10:13:47 - INFO - __main__ - Step 92295: {'lr': 0.00016524562148731347, 'samples': 17720640, 'steps': 92294, 'loss/train': 0.9684673547744751} 11/07/2021 10:13:47 - INFO - __main__ - Step 92296: {'lr': 0.00016524062902713196, 'samples': 17720832, 'steps': 92295, 'loss/train': 1.3109899759292603} 11/07/2021 10:13:48 - INFO - __main__ - Step 92297: {'lr': 0.00016523563660514174, 'samples': 17721024, 'steps': 92296, 'loss/train': 0.9250585436820984} 11/07/2021 10:13:48 - INFO - __main__ - Step 92298: {'lr': 0.00016523064422134504, 'samples': 17721216, 'steps': 92297, 'loss/train': 1.4777281284332275} 11/07/2021 10:13:49 - INFO - __main__ - Step 92299: {'lr': 0.0001652256518757441, 'samples': 17721408, 'steps': 92298, 'loss/train': 1.2965883016586304} 11/07/2021 10:13:49 - INFO - __main__ - Step 92300: {'lr': 0.00016522065956834115, 'samples': 17721600, 'steps': 92299, 'loss/train': 1.4314666986465454} 11/07/2021 10:13:49 - INFO - __main__ - Step 92301: {'lr': 0.0001652156672991385, 'samples': 17721792, 'steps': 92300, 'loss/train': 1.5125066041946411} 11/07/2021 10:13:51 - INFO - __main__ - Step 92302: {'lr': 0.00016521067506813832, 'samples': 17721984, 'steps': 92301, 'loss/train': 0.8272976875305176} 11/07/2021 10:13:51 - INFO - __main__ - Step 92303: {'lr': 0.000165205682875343, 'samples': 17722176, 'steps': 92302, 'loss/train': 1.5545481443405151} 11/07/2021 10:13:51 - INFO - __main__ - Step 92304: {'lr': 0.0001652006907207546, 'samples': 17722368, 'steps': 92303, 'loss/train': 1.3758033514022827} 11/07/2021 10:13:52 - INFO - __main__ - Step 92305: {'lr': 0.00016519569860437547, 'samples': 17722560, 'steps': 92304, 'loss/train': 1.029343843460083} 11/07/2021 10:13:52 - INFO - __main__ - Step 92306: {'lr': 0.0001651907065262079, 'samples': 17722752, 'steps': 92305, 'loss/train': 1.584701418876648} 11/07/2021 10:13:53 - INFO - __main__ - Step 92307: {'lr': 0.00016518571448625405, 'samples': 17722944, 'steps': 92306, 'loss/train': 1.4102747440338135} 11/07/2021 10:13:53 - INFO - __main__ - Step 92308: {'lr': 0.0001651807224845162, 'samples': 17723136, 'steps': 92307, 'loss/train': 1.2972239255905151} 11/07/2021 10:13:54 - INFO - __main__ - Step 92309: {'lr': 0.0001651757305209966, 'samples': 17723328, 'steps': 92308, 'loss/train': 1.3432931900024414} 11/07/2021 10:13:54 - INFO - __main__ - Step 92310: {'lr': 0.00016517073859569753, 'samples': 17723520, 'steps': 92309, 'loss/train': 1.537320852279663} 11/07/2021 10:13:54 - INFO - __main__ - Step 92311: {'lr': 0.0001651657467086212, 'samples': 17723712, 'steps': 92310, 'loss/train': 1.128462314605713} 11/07/2021 10:13:55 - INFO - __main__ - Step 92312: {'lr': 0.0001651607548597699, 'samples': 17723904, 'steps': 92311, 'loss/train': 1.395520567893982} 11/07/2021 10:13:56 - INFO - __main__ - Step 92313: {'lr': 0.00016515576304914581, 'samples': 17724096, 'steps': 92312, 'loss/train': 1.5314165353775024} 11/07/2021 10:13:56 - INFO - __main__ - Step 92314: {'lr': 0.00016515077127675124, 'samples': 17724288, 'steps': 92313, 'loss/train': 2.4677491188049316} 11/07/2021 10:13:56 - INFO - __main__ - Step 92315: {'lr': 0.00016514577954258842, 'samples': 17724480, 'steps': 92314, 'loss/train': 0.5786054730415344} 11/07/2021 10:13:57 - INFO - __main__ - Step 92316: {'lr': 0.0001651407878466596, 'samples': 17724672, 'steps': 92315, 'loss/train': 1.0448167324066162} 11/07/2021 10:13:58 - INFO - __main__ - Step 92317: {'lr': 0.00016513579618896717, 'samples': 17724864, 'steps': 92316, 'loss/train': 1.7216253280639648} 11/07/2021 10:13:58 - INFO - __main__ - Step 92318: {'lr': 0.00016513080456951313, 'samples': 17725056, 'steps': 92317, 'loss/train': 1.4898449182510376} 11/07/2021 10:13:59 - INFO - __main__ - Step 92319: {'lr': 0.00016512581298829982, 'samples': 17725248, 'steps': 92318, 'loss/train': 1.1144697666168213} 11/07/2021 10:13:59 - INFO - __main__ - Step 92320: {'lr': 0.0001651208214453295, 'samples': 17725440, 'steps': 92319, 'loss/train': 1.1547166109085083} 11/07/2021 10:13:59 - INFO - __main__ - Step 92321: {'lr': 0.00016511582994060443, 'samples': 17725632, 'steps': 92320, 'loss/train': 1.1388039588928223} 11/07/2021 10:14:00 - INFO - __main__ - Step 92322: {'lr': 0.00016511083847412688, 'samples': 17725824, 'steps': 92321, 'loss/train': 1.3677726984024048} 11/07/2021 10:14:01 - INFO - __main__ - Step 92323: {'lr': 0.00016510584704589908, 'samples': 17726016, 'steps': 92322, 'loss/train': 1.4949018955230713} 11/07/2021 10:14:01 - INFO - __main__ - Step 92324: {'lr': 0.00016510085565592326, 'samples': 17726208, 'steps': 92323, 'loss/train': 1.2895710468292236} 11/07/2021 10:14:01 - INFO - __main__ - Step 92325: {'lr': 0.00016509586430420164, 'samples': 17726400, 'steps': 92324, 'loss/train': 1.5475112199783325} 11/07/2021 10:14:02 - INFO - __main__ - Step 92326: {'lr': 0.0001650908729907366, 'samples': 17726592, 'steps': 92325, 'loss/train': 1.2026300430297852} 11/07/2021 10:14:02 - INFO - __main__ - Step 92327: {'lr': 0.00016508588171553024, 'samples': 17726784, 'steps': 92326, 'loss/train': 1.3629536628723145} 11/07/2021 10:14:03 - INFO - __main__ - Step 92328: {'lr': 0.00016508089047858487, 'samples': 17726976, 'steps': 92327, 'loss/train': 1.4571216106414795} 11/07/2021 10:14:03 - INFO - __main__ - Step 92329: {'lr': 0.0001650758992799028, 'samples': 17727168, 'steps': 92328, 'loss/train': 1.340246558189392} 11/07/2021 10:14:04 - INFO - __main__ - Step 92330: {'lr': 0.00016507090811948628, 'samples': 17727360, 'steps': 92329, 'loss/train': 1.5709489583969116} 11/07/2021 10:14:04 - INFO - __main__ - Step 92331: {'lr': 0.00016506591699733738, 'samples': 17727552, 'steps': 92330, 'loss/train': 0.8507043719291687} 11/07/2021 10:14:04 - INFO - __main__ - Step 92332: {'lr': 0.0001650609259134585, 'samples': 17727744, 'steps': 92331, 'loss/train': 1.1811835765838623} 11/07/2021 10:14:05 - INFO - __main__ - Step 92333: {'lr': 0.00016505593486785183, 'samples': 17727936, 'steps': 92332, 'loss/train': 1.4410115480422974} 11/07/2021 10:14:06 - INFO - __main__ - Step 92334: {'lr': 0.00016505094386051966, 'samples': 17728128, 'steps': 92333, 'loss/train': 1.458396077156067} 11/07/2021 10:14:06 - INFO - __main__ - Step 92335: {'lr': 0.00016504595289146422, 'samples': 17728320, 'steps': 92334, 'loss/train': 1.5832146406173706} 11/07/2021 10:14:07 - INFO - __main__ - Step 92336: {'lr': 0.00016504096196068776, 'samples': 17728512, 'steps': 92335, 'loss/train': 1.0330393314361572} 11/07/2021 10:14:07 - INFO - __main__ - Step 92337: {'lr': 0.00016503597106819255, 'samples': 17728704, 'steps': 92336, 'loss/train': 0.09111805260181427} 11/07/2021 10:14:08 - INFO - __main__ - Step 92338: {'lr': 0.0001650309802139808, 'samples': 17728896, 'steps': 92337, 'loss/train': 1.063728928565979} 11/07/2021 10:14:08 - INFO - __main__ - Step 92339: {'lr': 0.0001650259893980548, 'samples': 17729088, 'steps': 92338, 'loss/train': 1.3729604482650757} 11/07/2021 10:14:08 - INFO - __main__ - Step 92340: {'lr': 0.00016502099862041676, 'samples': 17729280, 'steps': 92339, 'loss/train': 1.2289997339248657} 11/07/2021 10:14:09 - INFO - __main__ - Step 92341: {'lr': 0.00016501600788106893, 'samples': 17729472, 'steps': 92340, 'loss/train': 1.102745771408081} 11/07/2021 10:14:09 - INFO - __main__ - Step 92342: {'lr': 0.0001650110171800136, 'samples': 17729664, 'steps': 92341, 'loss/train': 1.110007643699646} 11/07/2021 10:14:10 - INFO - __main__ - Step 92343: {'lr': 0.000165006026517253, 'samples': 17729856, 'steps': 92342, 'loss/train': 1.3689398765563965} 11/07/2021 10:14:11 - INFO - __main__ - Step 92344: {'lr': 0.00016500103589278946, 'samples': 17730048, 'steps': 92343, 'loss/train': 1.5997068881988525} 11/07/2021 10:14:11 - INFO - __main__ - Step 92345: {'lr': 0.00016499604530662503, 'samples': 17730240, 'steps': 92344, 'loss/train': 1.3816739320755005} 11/07/2021 10:14:11 - INFO - __main__ - Step 92346: {'lr': 0.00016499105475876208, 'samples': 17730432, 'steps': 92345, 'loss/train': 1.4632225036621094} 11/07/2021 10:14:12 - INFO - __main__ - Step 92347: {'lr': 0.00016498606424920288, 'samples': 17730624, 'steps': 92346, 'loss/train': 0.8161612153053284} 11/07/2021 10:14:13 - INFO - __main__ - Step 92348: {'lr': 0.0001649810737779496, 'samples': 17730816, 'steps': 92347, 'loss/train': 1.6584546566009521} 11/07/2021 10:14:13 - INFO - __main__ - Step 92349: {'lr': 0.0001649760833450046, 'samples': 17731008, 'steps': 92348, 'loss/train': 1.5302263498306274} 11/07/2021 10:14:13 - INFO - __main__ - Step 92350: {'lr': 0.00016497109295037, 'samples': 17731200, 'steps': 92349, 'loss/train': 1.4324556589126587} 11/07/2021 10:14:14 - INFO - __main__ - Step 92351: {'lr': 0.0001649661025940481, 'samples': 17731392, 'steps': 92350, 'loss/train': 1.2318443059921265} 11/07/2021 10:14:14 - INFO - __main__ - Step 92352: {'lr': 0.0001649611122760412, 'samples': 17731584, 'steps': 92351, 'loss/train': 1.8815349340438843} 11/07/2021 10:14:15 - INFO - __main__ - Step 92353: {'lr': 0.0001649561219963515, 'samples': 17731776, 'steps': 92352, 'loss/train': 1.119502305984497} 11/07/2021 10:14:16 - INFO - __main__ - Step 92354: {'lr': 0.0001649511317549813, 'samples': 17731968, 'steps': 92353, 'loss/train': 1.4137024879455566} 11/07/2021 10:14:16 - INFO - __main__ - Step 92355: {'lr': 0.00016494614155193276, 'samples': 17732160, 'steps': 92354, 'loss/train': 1.2783076763153076} 11/07/2021 10:14:16 - INFO - __main__ - Step 92356: {'lr': 0.00016494115138720818, 'samples': 17732352, 'steps': 92355, 'loss/train': 1.2995256185531616} 11/07/2021 10:14:17 - INFO - __main__ - Step 92357: {'lr': 0.00016493616126080993, 'samples': 17732544, 'steps': 92356, 'loss/train': 1.2445465326309204} 11/07/2021 10:14:18 - INFO - __main__ - Step 92358: {'lr': 0.00016493117117274004, 'samples': 17732736, 'steps': 92357, 'loss/train': 1.4685450792312622} 11/07/2021 10:14:18 - INFO - __main__ - Step 92359: {'lr': 0.00016492618112300082, 'samples': 17732928, 'steps': 92358, 'loss/train': 1.6652647256851196} 11/07/2021 10:14:18 - INFO - __main__ - Step 92360: {'lr': 0.00016492119111159454, 'samples': 17733120, 'steps': 92359, 'loss/train': 1.518444538116455} 11/07/2021 10:14:19 - INFO - __main__ - Step 92361: {'lr': 0.00016491620113852348, 'samples': 17733312, 'steps': 92360, 'loss/train': 1.6520750522613525} 11/07/2021 10:14:19 - INFO - __main__ - Step 92362: {'lr': 0.00016491121120378987, 'samples': 17733504, 'steps': 92361, 'loss/train': 1.1977603435516357} 11/07/2021 10:14:19 - INFO - __main__ - Step 92363: {'lr': 0.00016490622130739598, 'samples': 17733696, 'steps': 92362, 'loss/train': 1.346719741821289} 11/07/2021 10:14:20 - INFO - __main__ - Step 92364: {'lr': 0.000164901231449344, 'samples': 17733888, 'steps': 92363, 'loss/train': 1.4700109958648682} 11/07/2021 10:14:21 - INFO - __main__ - Step 92365: {'lr': 0.00016489624162963618, 'samples': 17734080, 'steps': 92364, 'loss/train': 1.447077989578247} 11/07/2021 10:14:21 - INFO - __main__ - Step 92366: {'lr': 0.00016489125184827486, 'samples': 17734272, 'steps': 92365, 'loss/train': 1.2992103099822998} 11/07/2021 10:14:21 - INFO - __main__ - Step 92367: {'lr': 0.00016488626210526218, 'samples': 17734464, 'steps': 92366, 'loss/train': 1.815798282623291} 11/07/2021 10:14:22 - INFO - __main__ - Step 92368: {'lr': 0.00016488127240060047, 'samples': 17734656, 'steps': 92367, 'loss/train': 1.3563541173934937} 11/07/2021 10:14:23 - INFO - __main__ - Step 92369: {'lr': 0.00016487628273429195, 'samples': 17734848, 'steps': 92368, 'loss/train': 1.5400097370147705} 11/07/2021 10:14:23 - INFO - __main__ - Step 92370: {'lr': 0.00016487129310633887, 'samples': 17735040, 'steps': 92369, 'loss/train': 1.036948561668396} 11/07/2021 10:14:24 - INFO - __main__ - Step 92371: {'lr': 0.00016486630351674353, 'samples': 17735232, 'steps': 92370, 'loss/train': 2.4399569034576416} 11/07/2021 10:14:24 - INFO - __main__ - Step 92372: {'lr': 0.00016486131396550803, 'samples': 17735424, 'steps': 92371, 'loss/train': 1.1764767169952393} 11/07/2021 10:14:24 - INFO - __main__ - Step 92373: {'lr': 0.00016485632445263472, 'samples': 17735616, 'steps': 92372, 'loss/train': 1.3192720413208008} 11/07/2021 10:14:25 - INFO - __main__ - Step 92374: {'lr': 0.00016485133497812584, 'samples': 17735808, 'steps': 92373, 'loss/train': 1.4145492315292358} 11/07/2021 10:14:26 - INFO - __main__ - Step 92375: {'lr': 0.00016484634554198363, 'samples': 17736000, 'steps': 92374, 'loss/train': 1.4631788730621338} 11/07/2021 10:14:26 - INFO - __main__ - Step 92376: {'lr': 0.00016484135614421036, 'samples': 17736192, 'steps': 92375, 'loss/train': 1.4870346784591675} 11/07/2021 10:14:26 - INFO - __main__ - Step 92377: {'lr': 0.00016483636678480825, 'samples': 17736384, 'steps': 92376, 'loss/train': 1.3578836917877197} 11/07/2021 10:14:27 - INFO - __main__ - Step 92378: {'lr': 0.00016483137746377952, 'samples': 17736576, 'steps': 92377, 'loss/train': 1.1609461307525635} 11/07/2021 10:14:28 - INFO - __main__ - Step 92379: {'lr': 0.0001648263881811265, 'samples': 17736768, 'steps': 92378, 'loss/train': 1.9044667482376099} 11/07/2021 10:14:28 - INFO - __main__ - Step 92380: {'lr': 0.00016482139893685138, 'samples': 17736960, 'steps': 92379, 'loss/train': 1.198578953742981} 11/07/2021 10:14:28 - INFO - __main__ - Step 92381: {'lr': 0.00016481640973095647, 'samples': 17737152, 'steps': 92380, 'loss/train': 1.08818781375885} 11/07/2021 10:14:29 - INFO - __main__ - Step 92382: {'lr': 0.00016481142056344388, 'samples': 17737344, 'steps': 92381, 'loss/train': 1.0124398469924927} 11/07/2021 10:14:29 - INFO - __main__ - Step 92383: {'lr': 0.00016480643143431601, 'samples': 17737536, 'steps': 92382, 'loss/train': 1.740165114402771} 11/07/2021 10:14:30 - INFO - __main__ - Step 92384: {'lr': 0.00016480144234357514, 'samples': 17737728, 'steps': 92383, 'loss/train': 1.3596529960632324} 11/07/2021 10:14:31 - INFO - __main__ - Step 92385: {'lr': 0.00016479645329122334, 'samples': 17737920, 'steps': 92384, 'loss/train': 1.1651382446289062} 11/07/2021 10:14:31 - INFO - __main__ - Step 92386: {'lr': 0.00016479146427726294, 'samples': 17738112, 'steps': 92385, 'loss/train': 0.6999941468238831} 11/07/2021 10:14:31 - INFO - __main__ - Step 92387: {'lr': 0.00016478647530169616, 'samples': 17738304, 'steps': 92386, 'loss/train': 1.1554617881774902} 11/07/2021 10:14:32 - INFO - __main__ - Step 92388: {'lr': 0.00016478148636452528, 'samples': 17738496, 'steps': 92387, 'loss/train': 1.4149686098098755} 11/07/2021 10:14:33 - INFO - __main__ - Step 92389: {'lr': 0.00016477649746575256, 'samples': 17738688, 'steps': 92388, 'loss/train': 1.3493541479110718} 11/07/2021 10:14:33 - INFO - __main__ - Step 92390: {'lr': 0.00016477150860538025, 'samples': 17738880, 'steps': 92389, 'loss/train': 1.434588074684143} 11/07/2021 10:14:33 - INFO - __main__ - Step 92391: {'lr': 0.00016476651978341057, 'samples': 17739072, 'steps': 92390, 'loss/train': 1.1587297916412354} 11/07/2021 10:14:34 - INFO - __main__ - Step 92392: {'lr': 0.00016476153099984582, 'samples': 17739264, 'steps': 92391, 'loss/train': 1.2528263330459595} 11/07/2021 10:14:34 - INFO - __main__ - Step 92393: {'lr': 0.00016475654225468815, 'samples': 17739456, 'steps': 92392, 'loss/train': 1.4467287063598633} 11/07/2021 10:14:35 - INFO - __main__ - Step 92394: {'lr': 0.0001647515535479399, 'samples': 17739648, 'steps': 92393, 'loss/train': 1.9479113817214966} 11/07/2021 10:14:35 - INFO - __main__ - Step 92395: {'lr': 0.00016474656487960326, 'samples': 17739840, 'steps': 92394, 'loss/train': 1.5651211738586426} 11/07/2021 10:14:36 - INFO - __main__ - Step 92396: {'lr': 0.0001647415762496805, 'samples': 17740032, 'steps': 92395, 'loss/train': 1.2916351556777954} 11/07/2021 10:14:36 - INFO - __main__ - Step 92397: {'lr': 0.000164736587658174, 'samples': 17740224, 'steps': 92396, 'loss/train': 1.3757356405258179} 11/07/2021 10:14:37 - INFO - __main__ - Step 92398: {'lr': 0.0001647315991050858, 'samples': 17740416, 'steps': 92397, 'loss/train': 1.1401844024658203} 11/07/2021 10:14:37 - INFO - __main__ - Step 92399: {'lr': 0.00016472661059041815, 'samples': 17740608, 'steps': 92398, 'loss/train': 1.376753330230713} 11/07/2021 10:14:38 - INFO - __main__ - Step 92400: {'lr': 0.0001647216221141734, 'samples': 17740800, 'steps': 92399, 'loss/train': 1.5785801410675049} 11/07/2021 10:14:38 - INFO - __main__ - Step 92401: {'lr': 0.00016471663367635382, 'samples': 17740992, 'steps': 92400, 'loss/train': 1.657957673072815} 11/07/2021 10:14:39 - INFO - __main__ - Step 92402: {'lr': 0.00016471164527696156, 'samples': 17741184, 'steps': 92401, 'loss/train': 1.4065210819244385} 11/07/2021 10:14:39 - INFO - __main__ - Step 92403: {'lr': 0.00016470665691599892, 'samples': 17741376, 'steps': 92402, 'loss/train': 1.0853384733200073} 11/07/2021 10:14:39 - INFO - __main__ - Step 92404: {'lr': 0.00016470166859346814, 'samples': 17741568, 'steps': 92403, 'loss/train': 0.5279557704925537} 11/07/2021 10:14:40 - INFO - __main__ - Step 92405: {'lr': 0.0001646966803093715, 'samples': 17741760, 'steps': 92404, 'loss/train': 1.60062837600708} 11/07/2021 10:14:41 - INFO - __main__ - Step 92406: {'lr': 0.0001646916920637112, 'samples': 17741952, 'steps': 92405, 'loss/train': 1.603124976158142} 11/07/2021 10:14:41 - INFO - __main__ - Step 92407: {'lr': 0.00016468670385648952, 'samples': 17742144, 'steps': 92406, 'loss/train': 1.6330853700637817} 11/07/2021 10:14:41 - INFO - __main__ - Step 92408: {'lr': 0.00016468171568770874, 'samples': 17742336, 'steps': 92407, 'loss/train': 1.370489478111267} 11/07/2021 10:14:42 - INFO - __main__ - Step 92409: {'lr': 0.000164676727557371, 'samples': 17742528, 'steps': 92408, 'loss/train': 1.4412040710449219} 11/07/2021 10:14:43 - INFO - __main__ - Step 92410: {'lr': 0.00016467173946547865, 'samples': 17742720, 'steps': 92409, 'loss/train': 1.168926477432251} 11/07/2021 10:14:43 - INFO - __main__ - Step 92411: {'lr': 0.0001646667514120339, 'samples': 17742912, 'steps': 92410, 'loss/train': 1.3342344760894775} 11/07/2021 10:14:43 - INFO - __main__ - Step 92412: {'lr': 0.00016466176339703894, 'samples': 17743104, 'steps': 92411, 'loss/train': 1.7208092212677002} 11/07/2021 10:14:44 - INFO - __main__ - Step 92413: {'lr': 0.00016465677542049613, 'samples': 17743296, 'steps': 92412, 'loss/train': 1.2330721616744995} 11/07/2021 10:14:44 - INFO - __main__ - Step 92414: {'lr': 0.0001646517874824076, 'samples': 17743488, 'steps': 92413, 'loss/train': 1.4965553283691406} 11/07/2021 10:14:45 - INFO - __main__ - Step 92415: {'lr': 0.00016464679958277568, 'samples': 17743680, 'steps': 92414, 'loss/train': 1.1678916215896606} 11/07/2021 10:14:46 - INFO - __main__ - Step 92416: {'lr': 0.0001646418117216026, 'samples': 17743872, 'steps': 92415, 'loss/train': 1.9474903345108032} 11/07/2021 10:14:46 - INFO - __main__ - Step 92417: {'lr': 0.00016463682389889059, 'samples': 17744064, 'steps': 92416, 'loss/train': 1.68498694896698} 11/07/2021 10:14:46 - INFO - __main__ - Step 92418: {'lr': 0.00016463183611464195, 'samples': 17744256, 'steps': 92417, 'loss/train': 2.0106983184814453} 11/07/2021 10:14:47 - INFO - __main__ - Step 92419: {'lr': 0.00016462684836885888, 'samples': 17744448, 'steps': 92418, 'loss/train': 1.7548986673355103} 11/07/2021 10:14:47 - INFO - __main__ - Step 92420: {'lr': 0.0001646218606615436, 'samples': 17744640, 'steps': 92419, 'loss/train': 0.9282254576683044} 11/07/2021 10:14:48 - INFO - __main__ - Step 92421: {'lr': 0.00016461687299269842, 'samples': 17744832, 'steps': 92420, 'loss/train': 1.2268790006637573} 11/07/2021 10:14:48 - INFO - __main__ - Step 92422: {'lr': 0.00016461188536232555, 'samples': 17745024, 'steps': 92421, 'loss/train': 1.7485084533691406} 11/07/2021 10:14:49 - INFO - __main__ - Step 92423: {'lr': 0.00016460689777042723, 'samples': 17745216, 'steps': 92422, 'loss/train': 1.6971263885498047} 11/07/2021 10:14:49 - INFO - __main__ - Step 92424: {'lr': 0.00016460191021700578, 'samples': 17745408, 'steps': 92423, 'loss/train': 1.3012244701385498} 11/07/2021 10:14:49 - INFO - __main__ - Step 92425: {'lr': 0.00016459692270206334, 'samples': 17745600, 'steps': 92424, 'loss/train': 1.2942795753479004} 11/07/2021 10:14:50 - INFO - __main__ - Step 92426: {'lr': 0.00016459193522560224, 'samples': 17745792, 'steps': 92425, 'loss/train': 1.891684889793396} 11/07/2021 10:14:51 - INFO - __main__ - Step 92427: {'lr': 0.00016458694778762468, 'samples': 17745984, 'steps': 92426, 'loss/train': 1.5454492568969727} 11/07/2021 10:14:51 - INFO - __main__ - Step 92428: {'lr': 0.0001645819603881329, 'samples': 17746176, 'steps': 92427, 'loss/train': 1.4607973098754883} 11/07/2021 10:14:51 - INFO - __main__ - Step 92429: {'lr': 0.00016457697302712918, 'samples': 17746368, 'steps': 92428, 'loss/train': 1.8193773031234741} 11/07/2021 10:14:52 - INFO - __main__ - Step 92430: {'lr': 0.0001645719857046158, 'samples': 17746560, 'steps': 92429, 'loss/train': 1.518210530281067} 11/07/2021 10:14:53 - INFO - __main__ - Step 92431: {'lr': 0.00016456699842059492, 'samples': 17746752, 'steps': 92430, 'loss/train': 1.186759352684021} 11/07/2021 10:14:53 - INFO - __main__ - Step 92432: {'lr': 0.00016456201117506886, 'samples': 17746944, 'steps': 92431, 'loss/train': 1.0234566926956177} 11/07/2021 10:14:53 - INFO - __main__ - Step 92433: {'lr': 0.0001645570239680398, 'samples': 17747136, 'steps': 92432, 'loss/train': 1.4803746938705444} 11/07/2021 10:14:54 - INFO - __main__ - Step 92434: {'lr': 0.00016455203679951005, 'samples': 17747328, 'steps': 92433, 'loss/train': 1.3241854906082153} 11/07/2021 10:14:54 - INFO - __main__ - Step 92435: {'lr': 0.00016454704966948185, 'samples': 17747520, 'steps': 92434, 'loss/train': 1.3142449855804443} 11/07/2021 10:14:55 - INFO - __main__ - Step 92436: {'lr': 0.0001645420625779574, 'samples': 17747712, 'steps': 92435, 'loss/train': 1.1681838035583496} 11/07/2021 10:14:56 - INFO - __main__ - Step 92437: {'lr': 0.00016453707552493895, 'samples': 17747904, 'steps': 92436, 'loss/train': 1.518028736114502} 11/07/2021 10:14:56 - INFO - __main__ - Step 92438: {'lr': 0.0001645320885104289, 'samples': 17748096, 'steps': 92437, 'loss/train': 0.8928558230400085} 11/07/2021 10:14:56 - INFO - __main__ - Step 92439: {'lr': 0.00016452710153442928, 'samples': 17748288, 'steps': 92438, 'loss/train': 1.4830557107925415} 11/07/2021 10:14:57 - INFO - __main__ - Step 92440: {'lr': 0.00016452211459694243, 'samples': 17748480, 'steps': 92439, 'loss/train': 1.610959529876709} 11/07/2021 10:14:58 - INFO - __main__ - Step 92441: {'lr': 0.00016451712769797067, 'samples': 17748672, 'steps': 92440, 'loss/train': 1.2066062688827515} 11/07/2021 10:14:58 - INFO - __main__ - Step 92442: {'lr': 0.0001645121408375161, 'samples': 17748864, 'steps': 92441, 'loss/train': 1.4849915504455566} 11/07/2021 10:14:59 - INFO - __main__ - Step 92443: {'lr': 0.00016450715401558104, 'samples': 17749056, 'steps': 92442, 'loss/train': 1.3197544813156128} 11/07/2021 10:14:59 - INFO - __main__ - Step 92444: {'lr': 0.00016450216723216775, 'samples': 17749248, 'steps': 92443, 'loss/train': 1.1765373945236206} 11/07/2021 10:14:59 - INFO - __main__ - Step 92445: {'lr': 0.00016449718048727844, 'samples': 17749440, 'steps': 92444, 'loss/train': 0.10436849296092987} 11/07/2021 10:15:00 - INFO - __main__ - Step 92446: {'lr': 0.0001644921937809154, 'samples': 17749632, 'steps': 92445, 'loss/train': 1.2300305366516113} 11/07/2021 10:15:01 - INFO - __main__ - Step 92447: {'lr': 0.00016448720711308086, 'samples': 17749824, 'steps': 92446, 'loss/train': 1.2752968072891235} 11/07/2021 10:15:01 - INFO - __main__ - Step 92448: {'lr': 0.00016448222048377704, 'samples': 17750016, 'steps': 92447, 'loss/train': 2.036445379257202} 11/07/2021 10:15:01 - INFO - __main__ - Step 92449: {'lr': 0.00016447723389300623, 'samples': 17750208, 'steps': 92448, 'loss/train': 1.2738795280456543} 11/07/2021 10:15:02 - INFO - __main__ - Step 92450: {'lr': 0.00016447224734077065, 'samples': 17750400, 'steps': 92449, 'loss/train': 1.4009708166122437} 11/07/2021 10:15:02 - INFO - __main__ - Step 92451: {'lr': 0.0001644672608270727, 'samples': 17750592, 'steps': 92450, 'loss/train': 1.4731484651565552} 11/07/2021 10:15:03 - INFO - __main__ - Step 92452: {'lr': 0.00016446227435191433, 'samples': 17750784, 'steps': 92451, 'loss/train': 1.4990347623825073} 11/07/2021 10:15:03 - INFO - __main__ - Step 92453: {'lr': 0.00016445728791529795, 'samples': 17750976, 'steps': 92452, 'loss/train': 1.1823676824569702} 11/07/2021 10:15:04 - INFO - __main__ - Step 92454: {'lr': 0.00016445230151722578, 'samples': 17751168, 'steps': 92453, 'loss/train': 1.218752145767212} 11/07/2021 10:15:04 - INFO - __main__ - Step 92455: {'lr': 0.00016444731515770011, 'samples': 17751360, 'steps': 92454, 'loss/train': 0.8201834559440613} 11/07/2021 10:15:05 - INFO - __main__ - Step 92456: {'lr': 0.00016444232883672317, 'samples': 17751552, 'steps': 92455, 'loss/train': 2.340329885482788} 11/07/2021 10:15:06 - INFO - __main__ - Step 92457: {'lr': 0.00016443734255429718, 'samples': 17751744, 'steps': 92456, 'loss/train': 1.2150710821151733} 11/07/2021 10:15:06 - INFO - __main__ - Step 92458: {'lr': 0.00016443235631042442, 'samples': 17751936, 'steps': 92457, 'loss/train': 1.4331324100494385} 11/07/2021 10:15:07 - INFO - __main__ - Step 92459: {'lr': 0.0001644273701051071, 'samples': 17752128, 'steps': 92458, 'loss/train': 1.294045090675354} 11/07/2021 10:15:07 - INFO - __main__ - Step 92460: {'lr': 0.00016442238393834746, 'samples': 17752320, 'steps': 92459, 'loss/train': 1.5078010559082031} 11/07/2021 10:15:07 - INFO - __main__ - Step 92461: {'lr': 0.00016441739781014784, 'samples': 17752512, 'steps': 92460, 'loss/train': 1.8301576375961304} 11/07/2021 10:15:09 - INFO - __main__ - Step 92462: {'lr': 0.00016441241172051037, 'samples': 17752704, 'steps': 92461, 'loss/train': 1.0971654653549194} 11/07/2021 10:15:09 - INFO - __main__ - Step 92463: {'lr': 0.00016440742566943737, 'samples': 17752896, 'steps': 92462, 'loss/train': 1.443315029144287} 11/07/2021 10:15:09 - INFO - __main__ - Step 92464: {'lr': 0.00016440243965693105, 'samples': 17753088, 'steps': 92463, 'loss/train': 1.5558788776397705} 11/07/2021 10:15:10 - INFO - __main__ - Step 92465: {'lr': 0.00016439745368299378, 'samples': 17753280, 'steps': 92464, 'loss/train': 1.098452091217041} 11/07/2021 10:15:10 - INFO - __main__ - Step 92466: {'lr': 0.0001643924677476276, 'samples': 17753472, 'steps': 92465, 'loss/train': 0.9036141633987427} 11/07/2021 10:15:10 - INFO - __main__ - Step 92467: {'lr': 0.00016438748185083484, 'samples': 17753664, 'steps': 92466, 'loss/train': 1.9251019954681396} 11/07/2021 10:15:12 - INFO - __main__ - Step 92468: {'lr': 0.00016438249599261772, 'samples': 17753856, 'steps': 92467, 'loss/train': 0.0812501311302185} 11/07/2021 10:15:12 - INFO - __main__ - Step 92469: {'lr': 0.00016437751017297857, 'samples': 17754048, 'steps': 92468, 'loss/train': 0.9289027452468872} 11/07/2021 10:15:12 - INFO - __main__ - Step 92470: {'lr': 0.00016437252439191962, 'samples': 17754240, 'steps': 92469, 'loss/train': 0.9304125308990479} 11/07/2021 10:15:13 - INFO - __main__ - Step 92471: {'lr': 0.00016436753864944304, 'samples': 17754432, 'steps': 92470, 'loss/train': 2.047581195831299} 11/07/2021 10:15:13 - INFO - __main__ - Step 92472: {'lr': 0.00016436255294555117, 'samples': 17754624, 'steps': 92471, 'loss/train': 1.2929966449737549} 11/07/2021 10:15:14 - INFO - __main__ - Step 92473: {'lr': 0.0001643575672802462, 'samples': 17754816, 'steps': 92472, 'loss/train': 2.2780964374542236} 11/07/2021 10:15:14 - INFO - __main__ - Step 92474: {'lr': 0.00016435258165353034, 'samples': 17755008, 'steps': 92473, 'loss/train': 1.1891014575958252} 11/07/2021 10:15:15 - INFO - __main__ - Step 92475: {'lr': 0.00016434759606540595, 'samples': 17755200, 'steps': 92474, 'loss/train': 1.0858054161071777} 11/07/2021 10:15:15 - INFO - __main__ - Step 92476: {'lr': 0.00016434261051587518, 'samples': 17755392, 'steps': 92475, 'loss/train': 0.8350712060928345} 11/07/2021 10:15:16 - INFO - __main__ - Step 92477: {'lr': 0.00016433762500494032, 'samples': 17755584, 'steps': 92476, 'loss/train': 1.6086599826812744} 11/07/2021 10:15:16 - INFO - __main__ - Step 92478: {'lr': 0.00016433263953260368, 'samples': 17755776, 'steps': 92477, 'loss/train': 1.0273383855819702} 11/07/2021 10:15:17 - INFO - __main__ - Step 92479: {'lr': 0.00016432765409886736, 'samples': 17755968, 'steps': 92478, 'loss/train': 1.3279153108596802} 11/07/2021 10:15:17 - INFO - __main__ - Step 92480: {'lr': 0.00016432266870373367, 'samples': 17756160, 'steps': 92479, 'loss/train': 1.3578389883041382} 11/07/2021 10:15:18 - INFO - __main__ - Step 92481: {'lr': 0.00016431768334720484, 'samples': 17756352, 'steps': 92480, 'loss/train': 0.6071287989616394} 11/07/2021 10:15:18 - INFO - __main__ - Step 92482: {'lr': 0.00016431269802928317, 'samples': 17756544, 'steps': 92481, 'loss/train': 1.350475788116455} 11/07/2021 10:15:18 - INFO - __main__ - Step 92483: {'lr': 0.00016430771274997087, 'samples': 17756736, 'steps': 92482, 'loss/train': 1.184206247329712} 11/07/2021 10:15:19 - INFO - __main__ - Step 92484: {'lr': 0.00016430272750927018, 'samples': 17756928, 'steps': 92483, 'loss/train': 1.289831519126892} 11/07/2021 10:15:20 - INFO - __main__ - Step 92485: {'lr': 0.00016429774230718338, 'samples': 17757120, 'steps': 92484, 'loss/train': 0.32511892914772034} 11/07/2021 10:15:20 - INFO - __main__ - Step 92486: {'lr': 0.00016429275714371268, 'samples': 17757312, 'steps': 92485, 'loss/train': 1.5578539371490479} 11/07/2021 10:15:20 - INFO - __main__ - Step 92487: {'lr': 0.0001642877720188603, 'samples': 17757504, 'steps': 92486, 'loss/train': 1.402468204498291} 11/07/2021 10:15:21 - INFO - __main__ - Step 92488: {'lr': 0.00016428278693262857, 'samples': 17757696, 'steps': 92487, 'loss/train': 1.4472410678863525} 11/07/2021 10:15:22 - INFO - __main__ - Step 92489: {'lr': 0.0001642778018850197, 'samples': 17757888, 'steps': 92488, 'loss/train': 1.3542579412460327} 11/07/2021 10:15:22 - INFO - __main__ - Step 92490: {'lr': 0.0001642728168760359, 'samples': 17758080, 'steps': 92489, 'loss/train': 1.5992413759231567} 11/07/2021 10:15:23 - INFO - __main__ - Step 92491: {'lr': 0.0001642678319056795, 'samples': 17758272, 'steps': 92490, 'loss/train': 1.5028337240219116} 11/07/2021 10:15:23 - INFO - __main__ - Step 92492: {'lr': 0.00016426284697395276, 'samples': 17758464, 'steps': 92491, 'loss/train': 1.410550832748413} 11/07/2021 10:15:23 - INFO - __main__ - Step 92493: {'lr': 0.00016425786208085775, 'samples': 17758656, 'steps': 92492, 'loss/train': 1.1731536388397217} 11/07/2021 10:15:24 - INFO - __main__ - Step 92494: {'lr': 0.00016425287722639681, 'samples': 17758848, 'steps': 92493, 'loss/train': 1.5428975820541382} 11/07/2021 10:15:25 - INFO - __main__ - Step 92495: {'lr': 0.00016424789241057224, 'samples': 17759040, 'steps': 92494, 'loss/train': 0.897962749004364} 11/07/2021 10:15:25 - INFO - __main__ - Step 92496: {'lr': 0.00016424290763338622, 'samples': 17759232, 'steps': 92495, 'loss/train': 1.538249135017395} 11/07/2021 10:15:25 - INFO - __main__ - Step 92497: {'lr': 0.00016423792289484103, 'samples': 17759424, 'steps': 92496, 'loss/train': 1.4293814897537231} 11/07/2021 10:15:26 - INFO - __main__ - Step 92498: {'lr': 0.0001642329381949389, 'samples': 17759616, 'steps': 92497, 'loss/train': 1.6229833364486694} 11/07/2021 10:15:26 - INFO - __main__ - Step 92499: {'lr': 0.00016422795353368208, 'samples': 17759808, 'steps': 92498, 'loss/train': 1.301171898841858} 11/07/2021 10:15:27 - INFO - __main__ - Step 92500: {'lr': 0.00016422296891107285, 'samples': 17760000, 'steps': 92499, 'loss/train': 1.6480203866958618} 11/07/2021 10:15:27 - INFO - __main__ - Step 92501: {'lr': 0.00016421798432711345, 'samples': 17760192, 'steps': 92500, 'loss/train': 1.6547322273254395} 11/07/2021 10:15:28 - INFO - __main__ - Step 92502: {'lr': 0.00016421299978180604, 'samples': 17760384, 'steps': 92501, 'loss/train': 1.8240631818771362} 11/07/2021 10:15:28 - INFO - __main__ - Step 92503: {'lr': 0.00016420801527515294, 'samples': 17760576, 'steps': 92502, 'loss/train': 1.467203974723816} 11/07/2021 10:15:28 - INFO - __main__ - Step 92504: {'lr': 0.0001642030308071564, 'samples': 17760768, 'steps': 92503, 'loss/train': 1.368852138519287} 11/07/2021 10:15:30 - INFO - __main__ - Step 92505: {'lr': 0.00016419804637781874, 'samples': 17760960, 'steps': 92504, 'loss/train': 1.6062465906143188} 11/07/2021 10:15:30 - INFO - __main__ - Step 92506: {'lr': 0.00016419306198714201, 'samples': 17761152, 'steps': 92505, 'loss/train': 1.4534540176391602} 11/07/2021 10:15:30 - INFO - __main__ - Step 92507: {'lr': 0.0001641880776351286, 'samples': 17761344, 'steps': 92506, 'loss/train': 1.3531408309936523} 11/07/2021 10:15:31 - INFO - __main__ - Step 92508: {'lr': 0.00016418309332178065, 'samples': 17761536, 'steps': 92507, 'loss/train': 1.1833425760269165} 11/07/2021 10:15:31 - INFO - __main__ - Step 92509: {'lr': 0.00016417810904710057, 'samples': 17761728, 'steps': 92508, 'loss/train': 1.7958470582962036} 11/07/2021 10:15:32 - INFO - __main__ - Step 92510: {'lr': 0.00016417312481109043, 'samples': 17761920, 'steps': 92509, 'loss/train': 1.3125498294830322} 11/07/2021 10:15:32 - INFO - __main__ - Step 92511: {'lr': 0.00016416814061375257, 'samples': 17762112, 'steps': 92510, 'loss/train': 1.380415439605713} 11/07/2021 10:15:33 - INFO - __main__ - Step 92512: {'lr': 0.00016416315645508925, 'samples': 17762304, 'steps': 92511, 'loss/train': 1.4116185903549194} 11/07/2021 10:15:33 - INFO - __main__ - Step 92513: {'lr': 0.00016415817233510267, 'samples': 17762496, 'steps': 92512, 'loss/train': 1.156864881515503} 11/07/2021 10:15:33 - INFO - __main__ - Step 92514: {'lr': 0.0001641531882537951, 'samples': 17762688, 'steps': 92513, 'loss/train': 1.5161670446395874} 11/07/2021 10:15:34 - INFO - __main__ - Step 92515: {'lr': 0.00016414820421116878, 'samples': 17762880, 'steps': 92514, 'loss/train': 1.6957637071609497} 11/07/2021 10:15:35 - INFO - __main__ - Step 92516: {'lr': 0.00016414322020722594, 'samples': 17763072, 'steps': 92515, 'loss/train': 1.4175726175308228} 11/07/2021 10:15:35 - INFO - __main__ - Step 92517: {'lr': 0.00016413823624196884, 'samples': 17763264, 'steps': 92516, 'loss/train': 1.3628429174423218} 11/07/2021 10:15:35 - INFO - __main__ - Step 92518: {'lr': 0.00016413325231539984, 'samples': 17763456, 'steps': 92517, 'loss/train': 1.0999387502670288} 11/07/2021 10:15:36 - INFO - __main__ - Step 92519: {'lr': 0.00016412826842752097, 'samples': 17763648, 'steps': 92518, 'loss/train': 1.3997056484222412} 11/07/2021 10:15:37 - INFO - __main__ - Step 92520: {'lr': 0.00016412328457833457, 'samples': 17763840, 'steps': 92519, 'loss/train': 1.429007649421692} 11/07/2021 10:15:37 - INFO - __main__ - Step 92521: {'lr': 0.00016411830076784289, 'samples': 17764032, 'steps': 92520, 'loss/train': 1.9208552837371826} 11/07/2021 10:15:38 - INFO - __main__ - Step 92522: {'lr': 0.00016411331699604816, 'samples': 17764224, 'steps': 92521, 'loss/train': 1.3085711002349854} 11/07/2021 10:15:38 - INFO - __main__ - Step 92523: {'lr': 0.00016410833326295268, 'samples': 17764416, 'steps': 92522, 'loss/train': 1.495737075805664} 11/07/2021 10:15:38 - INFO - __main__ - Step 92524: {'lr': 0.00016410334956855867, 'samples': 17764608, 'steps': 92523, 'loss/train': 1.2807071208953857} 11/07/2021 10:15:39 - INFO - __main__ - Step 92525: {'lr': 0.0001640983659128683, 'samples': 17764800, 'steps': 92524, 'loss/train': 1.444475769996643} 11/07/2021 10:15:40 - INFO - __main__ - Step 92526: {'lr': 0.00016409338229588394, 'samples': 17764992, 'steps': 92525, 'loss/train': 1.3028889894485474} 11/07/2021 10:15:40 - INFO - __main__ - Step 92527: {'lr': 0.0001640883987176078, 'samples': 17765184, 'steps': 92526, 'loss/train': 1.5629349946975708} 11/07/2021 10:15:41 - INFO - __main__ - Step 92528: {'lr': 0.00016408341517804205, 'samples': 17765376, 'steps': 92527, 'loss/train': 1.3768236637115479} 11/07/2021 10:15:41 - INFO - __main__ - Step 92529: {'lr': 0.00016407843167718896, 'samples': 17765568, 'steps': 92528, 'loss/train': 1.178596019744873} 11/07/2021 10:15:42 - INFO - __main__ - Step 92530: {'lr': 0.00016407344821505086, 'samples': 17765760, 'steps': 92529, 'loss/train': 1.7753881216049194} 11/07/2021 10:15:42 - INFO - __main__ - Step 92531: {'lr': 0.00016406846479162995, 'samples': 17765952, 'steps': 92530, 'loss/train': 1.6058863401412964} 11/07/2021 10:15:43 - INFO - __main__ - Step 92532: {'lr': 0.0001640634814069285, 'samples': 17766144, 'steps': 92531, 'loss/train': 1.059861421585083} 11/07/2021 10:15:43 - INFO - __main__ - Step 92533: {'lr': 0.00016405849806094862, 'samples': 17766336, 'steps': 92532, 'loss/train': 1.4953755140304565} 11/07/2021 10:15:43 - INFO - __main__ - Step 92534: {'lr': 0.0001640535147536927, 'samples': 17766528, 'steps': 92533, 'loss/train': 1.2079702615737915} 11/07/2021 10:15:44 - INFO - __main__ - Step 92535: {'lr': 0.00016404853148516293, 'samples': 17766720, 'steps': 92534, 'loss/train': 1.455661416053772} 11/07/2021 10:15:45 - INFO - __main__ - Step 92536: {'lr': 0.00016404354825536155, 'samples': 17766912, 'steps': 92535, 'loss/train': 1.39882493019104} 11/07/2021 10:15:45 - INFO - __main__ - Step 92537: {'lr': 0.00016403856506429085, 'samples': 17767104, 'steps': 92536, 'loss/train': 1.431244969367981} 11/07/2021 10:15:45 - INFO - __main__ - Step 92538: {'lr': 0.00016403358191195304, 'samples': 17767296, 'steps': 92537, 'loss/train': 1.486861228942871} 11/07/2021 10:15:46 - INFO - __main__ - Step 92539: {'lr': 0.00016402859879835035, 'samples': 17767488, 'steps': 92538, 'loss/train': 1.0543843507766724} 11/07/2021 10:15:46 - INFO - __main__ - Step 92540: {'lr': 0.00016402361572348507, 'samples': 17767680, 'steps': 92539, 'loss/train': 0.6896103024482727} 11/07/2021 10:15:47 - INFO - __main__ - Step 92541: {'lr': 0.00016401863268735939, 'samples': 17767872, 'steps': 92540, 'loss/train': 2.050060272216797} 11/07/2021 10:15:47 - INFO - __main__ - Step 92542: {'lr': 0.00016401364968997566, 'samples': 17768064, 'steps': 92541, 'loss/train': 1.6076635122299194} 11/07/2021 10:15:48 - INFO - __main__ - Step 92543: {'lr': 0.00016400866673133599, 'samples': 17768256, 'steps': 92542, 'loss/train': 1.0837171077728271} 11/07/2021 10:15:48 - INFO - __main__ - Step 92544: {'lr': 0.0001640036838114427, 'samples': 17768448, 'steps': 92543, 'loss/train': 1.2450813055038452} 11/07/2021 10:15:48 - INFO - __main__ - Step 92545: {'lr': 0.00016399870093029807, 'samples': 17768640, 'steps': 92544, 'loss/train': 1.188309669494629} 11/07/2021 10:15:49 - INFO - __main__ - Step 92546: {'lr': 0.00016399371808790424, 'samples': 17768832, 'steps': 92545, 'loss/train': 1.0021189451217651} 11/07/2021 10:15:50 - INFO - __main__ - Step 92547: {'lr': 0.00016398873528426352, 'samples': 17769024, 'steps': 92546, 'loss/train': 0.99781334400177} 11/07/2021 10:15:50 - INFO - __main__ - Step 92548: {'lr': 0.00016398375251937817, 'samples': 17769216, 'steps': 92547, 'loss/train': 1.6183726787567139} 11/07/2021 10:15:51 - INFO - __main__ - Step 92549: {'lr': 0.0001639787697932504, 'samples': 17769408, 'steps': 92548, 'loss/train': 1.4555944204330444} 11/07/2021 10:15:51 - INFO - __main__ - Step 92550: {'lr': 0.00016397378710588246, 'samples': 17769600, 'steps': 92549, 'loss/train': 0.5009363293647766} 11/07/2021 10:15:52 - INFO - __main__ - Step 92551: {'lr': 0.0001639688044572766, 'samples': 17769792, 'steps': 92550, 'loss/train': 0.8349807262420654} 11/07/2021 10:15:52 - INFO - __main__ - Step 92552: {'lr': 0.0001639638218474351, 'samples': 17769984, 'steps': 92551, 'loss/train': 1.7745327949523926} 11/07/2021 10:15:53 - INFO - __main__ - Step 92553: {'lr': 0.00016395883927636018, 'samples': 17770176, 'steps': 92552, 'loss/train': 1.659533977508545} 11/07/2021 10:15:53 - INFO - __main__ - Step 92554: {'lr': 0.00016395385674405406, 'samples': 17770368, 'steps': 92553, 'loss/train': 1.0471147298812866} 11/07/2021 10:15:53 - INFO - __main__ - Step 92555: {'lr': 0.00016394887425051895, 'samples': 17770560, 'steps': 92554, 'loss/train': 1.4798473119735718} 11/07/2021 10:15:54 - INFO - __main__ - Step 92556: {'lr': 0.00016394389179575722, 'samples': 17770752, 'steps': 92555, 'loss/train': 2.1757633686065674} 11/07/2021 10:15:55 - INFO - __main__ - Step 92557: {'lr': 0.000163938909379771, 'samples': 17770944, 'steps': 92556, 'loss/train': 1.5149565935134888} 11/07/2021 10:15:55 - INFO - __main__ - Step 92558: {'lr': 0.0001639339270025626, 'samples': 17771136, 'steps': 92557, 'loss/train': 1.2848501205444336} 11/07/2021 10:15:56 - INFO - __main__ - Step 92559: {'lr': 0.00016392894466413433, 'samples': 17771328, 'steps': 92558, 'loss/train': 1.360984444618225} 11/07/2021 10:15:56 - INFO - __main__ - Step 92560: {'lr': 0.00016392396236448827, 'samples': 17771520, 'steps': 92559, 'loss/train': 1.5760667324066162} 11/07/2021 10:15:56 - INFO - __main__ - Step 92561: {'lr': 0.00016391898010362671, 'samples': 17771712, 'steps': 92560, 'loss/train': 5.759470462799072} 11/07/2021 10:15:57 - INFO - __main__ - Step 92562: {'lr': 0.00016391399788155195, 'samples': 17771904, 'steps': 92561, 'loss/train': 1.1403762102127075} 11/07/2021 10:15:58 - INFO - __main__ - Step 92563: {'lr': 0.0001639090156982662, 'samples': 17772096, 'steps': 92562, 'loss/train': 1.778415322303772} 11/07/2021 10:15:58 - INFO - __main__ - Step 92564: {'lr': 0.0001639040335537718, 'samples': 17772288, 'steps': 92563, 'loss/train': 1.9039665460586548} 11/07/2021 10:15:58 - INFO - __main__ - Step 92565: {'lr': 0.00016389905144807088, 'samples': 17772480, 'steps': 92564, 'loss/train': 1.4065836668014526} 11/07/2021 10:15:59 - INFO - __main__ - Step 92566: {'lr': 0.0001638940693811657, 'samples': 17772672, 'steps': 92565, 'loss/train': 1.4833356142044067} 11/07/2021 10:16:00 - INFO - __main__ - Step 92567: {'lr': 0.0001638890873530585, 'samples': 17772864, 'steps': 92566, 'loss/train': 1.0141485929489136} 11/07/2021 10:16:01 - INFO - __main__ - Step 92568: {'lr': 0.00016388410536375154, 'samples': 17773056, 'steps': 92567, 'loss/train': 1.3122867345809937} 11/07/2021 10:16:01 - INFO - __main__ - Step 92569: {'lr': 0.00016387912341324712, 'samples': 17773248, 'steps': 92568, 'loss/train': 1.293451189994812} 11/07/2021 10:16:01 - INFO - __main__ - Step 92570: {'lr': 0.0001638741415015474, 'samples': 17773440, 'steps': 92569, 'loss/train': 1.150863766670227} 11/07/2021 10:16:02 - INFO - __main__ - Step 92571: {'lr': 0.00016386915962865467, 'samples': 17773632, 'steps': 92570, 'loss/train': 1.4243963956832886} 11/07/2021 10:16:02 - INFO - __main__ - Step 92572: {'lr': 0.00016386417779457125, 'samples': 17773824, 'steps': 92571, 'loss/train': 1.2618415355682373} 11/07/2021 10:16:02 - INFO - __main__ - Step 92573: {'lr': 0.0001638591959992992, 'samples': 17774016, 'steps': 92572, 'loss/train': 0.9450951814651489} 11/07/2021 10:16:03 - INFO - __main__ - Step 92574: {'lr': 0.00016385421424284092, 'samples': 17774208, 'steps': 92573, 'loss/train': 1.7501769065856934} 11/07/2021 10:16:04 - INFO - __main__ - Step 92575: {'lr': 0.00016384923252519862, 'samples': 17774400, 'steps': 92574, 'loss/train': 0.9176738858222961} 11/07/2021 10:16:04 - INFO - __main__ - Step 92576: {'lr': 0.00016384425084637447, 'samples': 17774592, 'steps': 92575, 'loss/train': 1.5424351692199707} 11/07/2021 10:16:04 - INFO - __main__ - Step 92577: {'lr': 0.00016383926920637078, 'samples': 17774784, 'steps': 92576, 'loss/train': 1.348476767539978} 11/07/2021 10:16:05 - INFO - __main__ - Step 92578: {'lr': 0.00016383428760518982, 'samples': 17774976, 'steps': 92577, 'loss/train': 1.3351610898971558} 11/07/2021 10:16:06 - INFO - __main__ - Step 92579: {'lr': 0.00016382930604283375, 'samples': 17775168, 'steps': 92578, 'loss/train': 1.0032165050506592} 11/07/2021 10:16:06 - INFO - __main__ - Step 92580: {'lr': 0.00016382432451930487, 'samples': 17775360, 'steps': 92579, 'loss/train': 1.3549188375473022} 11/07/2021 10:16:07 - INFO - __main__ - Step 92581: {'lr': 0.00016381934303460544, 'samples': 17775552, 'steps': 92580, 'loss/train': 1.1356935501098633} 11/07/2021 10:16:07 - INFO - __main__ - Step 92582: {'lr': 0.00016381436158873769, 'samples': 17775744, 'steps': 92581, 'loss/train': 1.3899332284927368} 11/07/2021 10:16:07 - INFO - __main__ - Step 92583: {'lr': 0.00016380938018170383, 'samples': 17775936, 'steps': 92582, 'loss/train': 1.3464847803115845} 11/07/2021 10:16:08 - INFO - __main__ - Step 92584: {'lr': 0.00016380439881350618, 'samples': 17776128, 'steps': 92583, 'loss/train': 1.238462209701538} 11/07/2021 10:16:09 - INFO - __main__ - Step 92585: {'lr': 0.0001637994174841469, 'samples': 17776320, 'steps': 92584, 'loss/train': 1.3048111200332642} 11/07/2021 10:16:09 - INFO - __main__ - Step 92586: {'lr': 0.00016379443619362837, 'samples': 17776512, 'steps': 92585, 'loss/train': 1.7036755084991455} 11/07/2021 10:16:09 - INFO - __main__ - Step 92587: {'lr': 0.00016378945494195264, 'samples': 17776704, 'steps': 92586, 'loss/train': 1.2686266899108887} 11/07/2021 10:16:10 - INFO - __main__ - Step 92588: {'lr': 0.00016378447372912205, 'samples': 17776896, 'steps': 92587, 'loss/train': 1.628021478652954} 11/07/2021 10:16:10 - INFO - __main__ - Step 92589: {'lr': 0.00016377949255513887, 'samples': 17777088, 'steps': 92588, 'loss/train': 0.9867296814918518} 11/07/2021 10:16:11 - INFO - __main__ - Step 92590: {'lr': 0.0001637745114200053, 'samples': 17777280, 'steps': 92589, 'loss/train': 1.4203896522521973} 11/07/2021 10:16:11 - INFO - __main__ - Step 92591: {'lr': 0.0001637695303237236, 'samples': 17777472, 'steps': 92590, 'loss/train': 1.5193837881088257} 11/07/2021 10:16:12 - INFO - __main__ - Step 92592: {'lr': 0.00016376454926629602, 'samples': 17777664, 'steps': 92591, 'loss/train': 1.4087756872177124} 11/07/2021 10:16:12 - INFO - __main__ - Step 92593: {'lr': 0.0001637595682477248, 'samples': 17777856, 'steps': 92592, 'loss/train': 1.345204472541809} 11/07/2021 10:16:13 - INFO - __main__ - Step 92594: {'lr': 0.00016375458726801225, 'samples': 17778048, 'steps': 92593, 'loss/train': 1.5775920152664185} 11/07/2021 10:16:14 - INFO - __main__ - Step 92595: {'lr': 0.00016374960632716047, 'samples': 17778240, 'steps': 92594, 'loss/train': 1.3985692262649536} 11/07/2021 10:16:14 - INFO - __main__ - Step 92596: {'lr': 0.0001637446254251718, 'samples': 17778432, 'steps': 92595, 'loss/train': 1.3776719570159912} 11/07/2021 10:16:14 - INFO - __main__ - Step 92597: {'lr': 0.00016373964456204852, 'samples': 17778624, 'steps': 92596, 'loss/train': 1.1915762424468994} 11/07/2021 10:16:15 - INFO - __main__ - Step 92598: {'lr': 0.00016373466373779277, 'samples': 17778816, 'steps': 92597, 'loss/train': 1.3603847026824951} 11/07/2021 10:16:15 - INFO - __main__ - Step 92599: {'lr': 0.00016372968295240697, 'samples': 17779008, 'steps': 92598, 'loss/train': 1.2833517789840698} 11/07/2021 10:16:16 - INFO - __main__ - Step 92600: {'lr': 0.00016372470220589317, 'samples': 17779200, 'steps': 92599, 'loss/train': 1.455651044845581} 11/07/2021 10:16:16 - INFO - __main__ - Step 92601: {'lr': 0.00016371972149825366, 'samples': 17779392, 'steps': 92600, 'loss/train': 1.3482835292816162} 11/07/2021 10:16:17 - INFO - __main__ - Step 92602: {'lr': 0.00016371474082949071, 'samples': 17779584, 'steps': 92601, 'loss/train': 1.285388469696045} 11/07/2021 10:16:17 - INFO - __main__ - Step 92603: {'lr': 0.0001637097601996066, 'samples': 17779776, 'steps': 92602, 'loss/train': 1.2972588539123535} 11/07/2021 10:16:17 - INFO - __main__ - Step 92604: {'lr': 0.0001637047796086035, 'samples': 17779968, 'steps': 92603, 'loss/train': 1.2612721920013428} 11/07/2021 10:16:18 - INFO - __main__ - Step 92605: {'lr': 0.0001636997990564837, 'samples': 17780160, 'steps': 92604, 'loss/train': 1.7927533388137817} 11/07/2021 10:16:19 - INFO - __main__ - Step 92606: {'lr': 0.00016369481854324947, 'samples': 17780352, 'steps': 92605, 'loss/train': 1.0596426725387573} 11/07/2021 10:16:19 - INFO - __main__ - Step 92607: {'lr': 0.00016368983806890297, 'samples': 17780544, 'steps': 92606, 'loss/train': 1.378126859664917} 11/07/2021 10:16:19 - INFO - __main__ - Step 92608: {'lr': 0.00016368485763344653, 'samples': 17780736, 'steps': 92607, 'loss/train': 1.4715746641159058} 11/07/2021 10:16:20 - INFO - __main__ - Step 92609: {'lr': 0.00016367987723688238, 'samples': 17780928, 'steps': 92608, 'loss/train': 1.079538345336914} 11/07/2021 10:16:20 - INFO - __main__ - Step 92610: {'lr': 0.0001636748968792127, 'samples': 17781120, 'steps': 92609, 'loss/train': 1.1168562173843384} 11/07/2021 10:16:21 - INFO - __main__ - Step 92611: {'lr': 0.00016366991656043982, 'samples': 17781312, 'steps': 92610, 'loss/train': 1.3254517316818237} 11/07/2021 10:16:22 - INFO - __main__ - Step 92612: {'lr': 0.0001636649362805659, 'samples': 17781504, 'steps': 92611, 'loss/train': 1.362117052078247} 11/07/2021 10:16:22 - INFO - __main__ - Step 92613: {'lr': 0.00016365995603959338, 'samples': 17781696, 'steps': 92612, 'loss/train': 1.318534016609192} 11/07/2021 10:16:22 - INFO - __main__ - Step 92614: {'lr': 0.00016365497583752423, 'samples': 17781888, 'steps': 92613, 'loss/train': 1.036423683166504} 11/07/2021 10:16:23 - INFO - __main__ - Step 92615: {'lr': 0.00016364999567436078, 'samples': 17782080, 'steps': 92614, 'loss/train': 2.0138392448425293} 11/07/2021 10:16:24 - INFO - __main__ - Step 92616: {'lr': 0.00016364501555010536, 'samples': 17782272, 'steps': 92615, 'loss/train': 1.2940796613693237} 11/07/2021 10:16:24 - INFO - __main__ - Step 92617: {'lr': 0.00016364003546476014, 'samples': 17782464, 'steps': 92616, 'loss/train': 1.2055553197860718} 11/07/2021 10:16:24 - INFO - __main__ - Step 92618: {'lr': 0.0001636350554183274, 'samples': 17782656, 'steps': 92617, 'loss/train': 0.6430858969688416} 11/07/2021 10:16:25 - INFO - __main__ - Step 92619: {'lr': 0.00016363007541080938, 'samples': 17782848, 'steps': 92618, 'loss/train': 1.1441137790679932} 11/07/2021 10:16:25 - INFO - __main__ - Step 92620: {'lr': 0.00016362509544220826, 'samples': 17783040, 'steps': 92619, 'loss/train': 1.0445475578308105} 11/07/2021 10:16:26 - INFO - __main__ - Step 92621: {'lr': 0.0001636201155125264, 'samples': 17783232, 'steps': 92620, 'loss/train': 1.646746277809143} 11/07/2021 10:16:26 - INFO - __main__ - Step 92622: {'lr': 0.00016361513562176595, 'samples': 17783424, 'steps': 92621, 'loss/train': 1.007261037826538} 11/07/2021 10:16:27 - INFO - __main__ - Step 92623: {'lr': 0.00016361015576992922, 'samples': 17783616, 'steps': 92622, 'loss/train': 1.5014339685440063} 11/07/2021 10:16:27 - INFO - __main__ - Step 92624: {'lr': 0.00016360517595701837, 'samples': 17783808, 'steps': 92623, 'loss/train': 1.2904982566833496} 11/07/2021 10:16:27 - INFO - __main__ - Step 92625: {'lr': 0.00016360019618303574, 'samples': 17784000, 'steps': 92624, 'loss/train': 1.254024863243103} 11/07/2021 10:16:28 - INFO - __main__ - Step 92626: {'lr': 0.0001635952164479836, 'samples': 17784192, 'steps': 92625, 'loss/train': 1.3332663774490356} 11/07/2021 10:16:29 - INFO - __main__ - Step 92627: {'lr': 0.00016359023675186401, 'samples': 17784384, 'steps': 92626, 'loss/train': 0.7572280168533325} 11/07/2021 10:16:29 - INFO - __main__ - Step 92628: {'lr': 0.00016358525709467937, 'samples': 17784576, 'steps': 92627, 'loss/train': 1.0646765232086182} 11/07/2021 10:16:30 - INFO - __main__ - Step 92629: {'lr': 0.00016358027747643186, 'samples': 17784768, 'steps': 92628, 'loss/train': 1.5591871738433838} 11/07/2021 10:16:30 - INFO - __main__ - Step 92630: {'lr': 0.00016357529789712375, 'samples': 17784960, 'steps': 92629, 'loss/train': 1.4235162734985352} 11/07/2021 10:16:31 - INFO - __main__ - Step 92631: {'lr': 0.00016357031835675728, 'samples': 17785152, 'steps': 92630, 'loss/train': 1.9182500839233398} 11/07/2021 10:16:31 - INFO - __main__ - Step 92632: {'lr': 0.00016356533885533467, 'samples': 17785344, 'steps': 92631, 'loss/train': 1.4920705556869507} 11/07/2021 10:16:32 - INFO - __main__ - Step 92633: {'lr': 0.00016356035939285818, 'samples': 17785536, 'steps': 92632, 'loss/train': 1.4770488739013672} 11/07/2021 10:16:32 - INFO - __main__ - Step 92634: {'lr': 0.00016355537996933008, 'samples': 17785728, 'steps': 92633, 'loss/train': 1.2446842193603516} 11/07/2021 10:16:32 - INFO - __main__ - Step 92635: {'lr': 0.00016355040058475256, 'samples': 17785920, 'steps': 92634, 'loss/train': 1.3018394708633423} 11/07/2021 10:16:33 - INFO - __main__ - Step 92636: {'lr': 0.00016354542123912796, 'samples': 17786112, 'steps': 92635, 'loss/train': 1.1945165395736694} 11/07/2021 10:16:34 - INFO - __main__ - Step 92637: {'lr': 0.0001635404419324584, 'samples': 17786304, 'steps': 92636, 'loss/train': 1.4196317195892334} 11/07/2021 10:16:34 - INFO - __main__ - Step 92638: {'lr': 0.00016353546266474622, 'samples': 17786496, 'steps': 92637, 'loss/train': 1.4237120151519775} 11/07/2021 10:16:35 - INFO - __main__ - Step 92639: {'lr': 0.00016353048343599368, 'samples': 17786688, 'steps': 92638, 'loss/train': 1.2221497297286987} 11/07/2021 10:16:35 - INFO - __main__ - Step 92640: {'lr': 0.0001635255042462029, 'samples': 17786880, 'steps': 92639, 'loss/train': 1.8257728815078735} 11/07/2021 10:16:35 - INFO - __main__ - Step 92641: {'lr': 0.0001635205250953762, 'samples': 17787072, 'steps': 92640, 'loss/train': 1.3901124000549316} 11/07/2021 10:16:36 - INFO - __main__ - Step 92642: {'lr': 0.0001635155459835158, 'samples': 17787264, 'steps': 92641, 'loss/train': 0.42945215106010437} 11/07/2021 10:16:37 - INFO - __main__ - Step 92643: {'lr': 0.00016351056691062398, 'samples': 17787456, 'steps': 92642, 'loss/train': 1.4871892929077148} 11/07/2021 10:16:37 - INFO - __main__ - Step 92644: {'lr': 0.00016350558787670295, 'samples': 17787648, 'steps': 92643, 'loss/train': 1.5652612447738647} 11/07/2021 10:16:37 - INFO - __main__ - Step 92645: {'lr': 0.000163500608881755, 'samples': 17787840, 'steps': 92644, 'loss/train': 0.9351515173912048} 11/07/2021 10:16:38 - INFO - __main__ - Step 92646: {'lr': 0.0001634956299257823, 'samples': 17788032, 'steps': 92645, 'loss/train': 1.4911304712295532} 11/07/2021 10:16:39 - INFO - __main__ - Step 92647: {'lr': 0.00016349065100878713, 'samples': 17788224, 'steps': 92646, 'loss/train': 2.328734874725342} 11/07/2021 10:16:39 - INFO - __main__ - Step 92648: {'lr': 0.00016348567213077175, 'samples': 17788416, 'steps': 92647, 'loss/train': 0.49325793981552124} 11/07/2021 10:16:39 - INFO - __main__ - Step 92649: {'lr': 0.0001634806932917384, 'samples': 17788608, 'steps': 92648, 'loss/train': 0.5787261128425598} 11/07/2021 10:16:40 - INFO - __main__ - Step 92650: {'lr': 0.00016347571449168929, 'samples': 17788800, 'steps': 92649, 'loss/train': 1.4066740274429321} 11/07/2021 10:16:40 - INFO - __main__ - Step 92651: {'lr': 0.0001634707357306267, 'samples': 17788992, 'steps': 92650, 'loss/train': 2.197436571121216} 11/07/2021 10:16:41 - INFO - __main__ - Step 92652: {'lr': 0.00016346575700855288, 'samples': 17789184, 'steps': 92651, 'loss/train': 1.5674848556518555} 11/07/2021 10:16:42 - INFO - __main__ - Step 92653: {'lr': 0.00016346077832547017, 'samples': 17789376, 'steps': 92652, 'loss/train': 0.7921027541160583} 11/07/2021 10:16:42 - INFO - __main__ - Step 92654: {'lr': 0.0001634557996813806, 'samples': 17789568, 'steps': 92653, 'loss/train': 1.4049471616744995} 11/07/2021 10:16:42 - INFO - __main__ - Step 92655: {'lr': 0.00016345082107628646, 'samples': 17789760, 'steps': 92654, 'loss/train': 1.0696755647659302} 11/07/2021 10:16:43 - INFO - __main__ - Step 92656: {'lr': 0.00016344584251019005, 'samples': 17789952, 'steps': 92655, 'loss/train': 1.3799084424972534} 11/07/2021 10:16:43 - INFO - __main__ - Step 92657: {'lr': 0.0001634408639830936, 'samples': 17790144, 'steps': 92656, 'loss/train': 1.3610957860946655} 11/07/2021 10:16:44 - INFO - __main__ - Step 92658: {'lr': 0.0001634358854949994, 'samples': 17790336, 'steps': 92657, 'loss/train': 1.5730022192001343} 11/07/2021 10:16:44 - INFO - __main__ - Step 92659: {'lr': 0.00016343090704590963, 'samples': 17790528, 'steps': 92658, 'loss/train': 1.7026361227035522} 11/07/2021 10:16:45 - INFO - __main__ - Step 92660: {'lr': 0.00016342592863582655, 'samples': 17790720, 'steps': 92659, 'loss/train': 1.4171662330627441} 11/07/2021 10:16:45 - INFO - __main__ - Step 92661: {'lr': 0.00016342095026475244, 'samples': 17790912, 'steps': 92660, 'loss/train': 0.6654432415962219} 11/07/2021 10:16:45 - INFO - __main__ - Step 92662: {'lr': 0.00016341597193268953, 'samples': 17791104, 'steps': 92661, 'loss/train': 1.0429426431655884} 11/07/2021 10:16:46 - INFO - __main__ - Step 92663: {'lr': 0.00016341099363964, 'samples': 17791296, 'steps': 92662, 'loss/train': 1.09403657913208} 11/07/2021 10:16:47 - INFO - __main__ - Step 92664: {'lr': 0.00016340601538560617, 'samples': 17791488, 'steps': 92663, 'loss/train': 1.388963222503662} 11/07/2021 10:16:47 - INFO - __main__ - Step 92665: {'lr': 0.00016340103717059023, 'samples': 17791680, 'steps': 92664, 'loss/train': 1.4588472843170166} 11/07/2021 10:16:47 - INFO - __main__ - Step 92666: {'lr': 0.00016339605899459456, 'samples': 17791872, 'steps': 92665, 'loss/train': 0.9279698729515076} 11/07/2021 10:16:48 - INFO - __main__ - Step 92667: {'lr': 0.0001633910808576212, 'samples': 17792064, 'steps': 92666, 'loss/train': 1.110673189163208} 11/07/2021 10:16:49 - INFO - __main__ - Step 92668: {'lr': 0.00016338610275967247, 'samples': 17792256, 'steps': 92667, 'loss/train': 1.5279338359832764} 11/07/2021 10:16:49 - INFO - __main__ - Step 92669: {'lr': 0.0001633811247007506, 'samples': 17792448, 'steps': 92668, 'loss/train': 1.3412106037139893} 11/07/2021 10:16:50 - INFO - __main__ - Step 92670: {'lr': 0.0001633761466808579, 'samples': 17792640, 'steps': 92669, 'loss/train': 5.4109697341918945} 11/07/2021 10:16:50 - INFO - __main__ - Step 92671: {'lr': 0.00016337116869999654, 'samples': 17792832, 'steps': 92670, 'loss/train': 1.1131126880645752} 11/07/2021 10:16:50 - INFO - __main__ - Step 92672: {'lr': 0.00016336619075816883, 'samples': 17793024, 'steps': 92671, 'loss/train': 1.2721247673034668} 11/07/2021 10:16:52 - INFO - __main__ - Step 92673: {'lr': 0.00016336121285537696, 'samples': 17793216, 'steps': 92672, 'loss/train': 1.036698818206787} 11/07/2021 10:16:52 - INFO - __main__ - Step 92674: {'lr': 0.00016335623499162316, 'samples': 17793408, 'steps': 92673, 'loss/train': 1.3004069328308105} 11/07/2021 10:16:52 - INFO - __main__ - Step 92675: {'lr': 0.00016335125716690973, 'samples': 17793600, 'steps': 92674, 'loss/train': 1.419641375541687} 11/07/2021 10:16:53 - INFO - __main__ - Step 92676: {'lr': 0.0001633462793812389, 'samples': 17793792, 'steps': 92675, 'loss/train': 1.2259224653244019} 11/07/2021 10:16:53 - INFO - __main__ - Step 92677: {'lr': 0.00016334130163461294, 'samples': 17793984, 'steps': 92676, 'loss/train': 1.7580686807632446} 11/07/2021 10:16:54 - INFO - __main__ - Step 92678: {'lr': 0.00016333632392703402, 'samples': 17794176, 'steps': 92677, 'loss/train': 1.5931727886199951} 11/07/2021 10:16:54 - INFO - __main__ - Step 92679: {'lr': 0.00016333134625850438, 'samples': 17794368, 'steps': 92678, 'loss/train': 1.1020079851150513} 11/07/2021 10:16:55 - INFO - __main__ - Step 92680: {'lr': 0.00016332636862902635, 'samples': 17794560, 'steps': 92679, 'loss/train': 1.6454553604125977} 11/07/2021 10:16:55 - INFO - __main__ - Step 92681: {'lr': 0.0001633213910386021, 'samples': 17794752, 'steps': 92680, 'loss/train': 1.2093024253845215} 11/07/2021 10:16:55 - INFO - __main__ - Step 92682: {'lr': 0.00016331641348723387, 'samples': 17794944, 'steps': 92681, 'loss/train': 1.121864676475525} 11/07/2021 10:16:56 - INFO - __main__ - Step 92683: {'lr': 0.00016331143597492394, 'samples': 17795136, 'steps': 92682, 'loss/train': 1.4932925701141357} 11/07/2021 10:16:57 - INFO - __main__ - Step 92684: {'lr': 0.00016330645850167453, 'samples': 17795328, 'steps': 92683, 'loss/train': 0.8460497856140137} 11/07/2021 10:16:57 - INFO - __main__ - Step 92685: {'lr': 0.00016330148106748787, 'samples': 17795520, 'steps': 92684, 'loss/train': 1.3870069980621338} 11/07/2021 10:16:57 - INFO - __main__ - Step 92686: {'lr': 0.00016329650367236627, 'samples': 17795712, 'steps': 92685, 'loss/train': 1.5152980089187622} 11/07/2021 10:16:58 - INFO - __main__ - Step 92687: {'lr': 0.00016329152631631196, 'samples': 17795904, 'steps': 92686, 'loss/train': 1.2023320198059082} 11/07/2021 10:16:58 - INFO - __main__ - Step 92688: {'lr': 0.0001632865489993271, 'samples': 17796096, 'steps': 92687, 'loss/train': 1.4444875717163086} 11/07/2021 10:16:59 - INFO - __main__ - Step 92689: {'lr': 0.000163281571721414, 'samples': 17796288, 'steps': 92688, 'loss/train': 1.85741126537323} 11/07/2021 10:17:00 - INFO - __main__ - Step 92690: {'lr': 0.00016327659448257486, 'samples': 17796480, 'steps': 92689, 'loss/train': 1.4815552234649658} 11/07/2021 10:17:00 - INFO - __main__ - Step 92691: {'lr': 0.00016327161728281196, 'samples': 17796672, 'steps': 92690, 'loss/train': 1.1945337057113647} 11/07/2021 10:17:00 - INFO - __main__ - Step 92692: {'lr': 0.0001632666401221275, 'samples': 17796864, 'steps': 92691, 'loss/train': 1.309108018875122} 11/07/2021 10:17:01 - INFO - __main__ - Step 92693: {'lr': 0.00016326166300052383, 'samples': 17797056, 'steps': 92692, 'loss/train': 0.9956023693084717} 11/07/2021 10:17:01 - INFO - __main__ - Step 92694: {'lr': 0.00016325668591800308, 'samples': 17797248, 'steps': 92693, 'loss/train': 1.0019067525863647} 11/07/2021 10:17:02 - INFO - __main__ - Step 92695: {'lr': 0.00016325170887456752, 'samples': 17797440, 'steps': 92694, 'loss/train': 1.3991477489471436} 11/07/2021 10:17:02 - INFO - __main__ - Step 92696: {'lr': 0.00016324673187021938, 'samples': 17797632, 'steps': 92695, 'loss/train': 1.5124638080596924} 11/07/2021 10:17:03 - INFO - __main__ - Step 92697: {'lr': 0.00016324175490496095, 'samples': 17797824, 'steps': 92696, 'loss/train': 1.6005102396011353} 11/07/2021 10:17:03 - INFO - __main__ - Step 92698: {'lr': 0.00016323677797879448, 'samples': 17798016, 'steps': 92697, 'loss/train': 1.2870374917984009} 11/07/2021 10:17:04 - INFO - __main__ - Step 92699: {'lr': 0.00016323180109172216, 'samples': 17798208, 'steps': 92698, 'loss/train': 1.5232759714126587} 11/07/2021 10:17:05 - INFO - __main__ - Step 92700: {'lr': 0.00016322682424374618, 'samples': 17798400, 'steps': 92699, 'loss/train': 1.1693644523620605} 11/07/2021 10:17:05 - INFO - __main__ - Step 92701: {'lr': 0.00016322184743486893, 'samples': 17798592, 'steps': 92700, 'loss/train': 1.3185725212097168} 11/07/2021 10:17:06 - INFO - __main__ - Step 92702: {'lr': 0.00016321687066509256, 'samples': 17798784, 'steps': 92701, 'loss/train': 1.4869104623794556} 11/07/2021 10:17:06 - INFO - __main__ - Step 92703: {'lr': 0.0001632118939344193, 'samples': 17798976, 'steps': 92702, 'loss/train': 1.5164060592651367} 11/07/2021 10:17:06 - INFO - __main__ - Step 92704: {'lr': 0.0001632069172428514, 'samples': 17799168, 'steps': 92703, 'loss/train': 0.9506267309188843} 11/07/2021 10:17:07 - INFO - __main__ - Step 92705: {'lr': 0.00016320194059039116, 'samples': 17799360, 'steps': 92704, 'loss/train': 1.2475367784500122} 11/07/2021 10:17:08 - INFO - __main__ - Step 92706: {'lr': 0.00016319696397704082, 'samples': 17799552, 'steps': 92705, 'loss/train': 1.6974927186965942} 11/07/2021 10:17:08 - INFO - __main__ - Step 92707: {'lr': 0.00016319198740280262, 'samples': 17799744, 'steps': 92706, 'loss/train': 1.521805763244629} 11/07/2021 10:17:08 - INFO - __main__ - Step 92708: {'lr': 0.00016318701086767869, 'samples': 17799936, 'steps': 92707, 'loss/train': 1.405255675315857} 11/07/2021 10:17:09 - INFO - __main__ - Step 92709: {'lr': 0.0001631820343716714, 'samples': 17800128, 'steps': 92708, 'loss/train': 1.3068495988845825} 11/07/2021 10:17:10 - INFO - __main__ - Step 92710: {'lr': 0.00016317705791478294, 'samples': 17800320, 'steps': 92709, 'loss/train': 1.5754075050354004} 11/07/2021 10:17:10 - INFO - __main__ - Step 92711: {'lr': 0.00016317208149701555, 'samples': 17800512, 'steps': 92710, 'loss/train': 1.6436536312103271} 11/07/2021 10:17:11 - INFO - __main__ - Step 92712: {'lr': 0.00016316710511837145, 'samples': 17800704, 'steps': 92711, 'loss/train': 1.1724507808685303} 11/07/2021 10:17:11 - INFO - __main__ - Step 92713: {'lr': 0.00016316212877885293, 'samples': 17800896, 'steps': 92712, 'loss/train': 1.5961461067199707} 11/07/2021 10:17:11 - INFO - __main__ - Step 92714: {'lr': 0.00016315715247846219, 'samples': 17801088, 'steps': 92713, 'loss/train': 1.4957975149154663} 11/07/2021 10:17:12 - INFO - __main__ - Step 92715: {'lr': 0.00016315217621720152, 'samples': 17801280, 'steps': 92714, 'loss/train': 0.7805781364440918} 11/07/2021 10:17:13 - INFO - __main__ - Step 92716: {'lr': 0.00016314719999507317, 'samples': 17801472, 'steps': 92715, 'loss/train': 1.052405834197998} 11/07/2021 10:17:13 - INFO - __main__ - Step 92717: {'lr': 0.0001631422238120793, 'samples': 17801664, 'steps': 92716, 'loss/train': 0.35964953899383545} 11/07/2021 10:17:13 - INFO - __main__ - Step 92718: {'lr': 0.00016313724766822221, 'samples': 17801856, 'steps': 92717, 'loss/train': 1.6485741138458252} 11/07/2021 10:17:14 - INFO - __main__ - Step 92719: {'lr': 0.00016313227156350416, 'samples': 17802048, 'steps': 92718, 'loss/train': 1.4295376539230347} 11/07/2021 10:17:14 - INFO - __main__ - Step 92720: {'lr': 0.00016312729549792745, 'samples': 17802240, 'steps': 92719, 'loss/train': 1.0209852457046509} 11/07/2021 10:17:15 - INFO - __main__ - Step 92721: {'lr': 0.00016312231947149413, 'samples': 17802432, 'steps': 92720, 'loss/train': 1.8369240760803223} 11/07/2021 10:17:16 - INFO - __main__ - Step 92722: {'lr': 0.0001631173434842066, 'samples': 17802624, 'steps': 92721, 'loss/train': 1.1415077447891235} 11/07/2021 10:17:16 - INFO - __main__ - Step 92723: {'lr': 0.00016311236753606702, 'samples': 17802816, 'steps': 92722, 'loss/train': 1.190447449684143} 11/07/2021 10:17:16 - INFO - __main__ - Step 92724: {'lr': 0.00016310739162707767, 'samples': 17803008, 'steps': 92723, 'loss/train': 1.2611267566680908} 11/07/2021 10:17:17 - INFO - __main__ - Step 92725: {'lr': 0.00016310241575724077, 'samples': 17803200, 'steps': 92724, 'loss/train': 1.2705869674682617} 11/07/2021 10:17:18 - INFO - __main__ - Step 92726: {'lr': 0.00016309743992655863, 'samples': 17803392, 'steps': 92725, 'loss/train': 1.100801944732666} 11/07/2021 10:17:18 - INFO - __main__ - Step 92727: {'lr': 0.0001630924641350334, 'samples': 17803584, 'steps': 92726, 'loss/train': 1.210171103477478} 11/07/2021 10:17:18 - INFO - __main__ - Step 92728: {'lr': 0.00016308748838266736, 'samples': 17803776, 'steps': 92727, 'loss/train': 1.1372392177581787} 11/07/2021 10:17:19 - INFO - __main__ - Step 92729: {'lr': 0.00016308251266946279, 'samples': 17803968, 'steps': 92728, 'loss/train': 1.1886552572250366} 11/07/2021 10:17:19 - INFO - __main__ - Step 92730: {'lr': 0.0001630775369954219, 'samples': 17804160, 'steps': 92729, 'loss/train': 1.3469164371490479} 11/07/2021 10:17:20 - INFO - __main__ - Step 92731: {'lr': 0.0001630725613605469, 'samples': 17804352, 'steps': 92730, 'loss/train': 1.5126031637191772} 11/07/2021 10:17:20 - INFO - __main__ - Step 92732: {'lr': 0.00016306758576484004, 'samples': 17804544, 'steps': 92731, 'loss/train': 1.0912145376205444} 11/07/2021 10:17:21 - INFO - __main__ - Step 92733: {'lr': 0.00016306261020830365, 'samples': 17804736, 'steps': 92732, 'loss/train': 1.5234079360961914} 11/07/2021 10:17:21 - INFO - __main__ - Step 92734: {'lr': 0.00016305763469093998, 'samples': 17804928, 'steps': 92733, 'loss/train': 1.5597866773605347} 11/07/2021 10:17:21 - INFO - __main__ - Step 92735: {'lr': 0.00016305265921275107, 'samples': 17805120, 'steps': 92734, 'loss/train': 1.1682028770446777} 11/07/2021 10:17:24 - INFO - __main__ - Step 92736: {'lr': 0.00016304768377373933, 'samples': 17805312, 'steps': 92735, 'loss/train': 1.0222060680389404} 11/07/2021 10:17:24 - INFO - __main__ - Step 92737: {'lr': 0.00016304270837390694, 'samples': 17805504, 'steps': 92736, 'loss/train': 1.151965618133545} 11/07/2021 10:17:25 - INFO - __main__ - Step 92738: {'lr': 0.00016303773301325618, 'samples': 17805696, 'steps': 92737, 'loss/train': 0.9811510443687439} 11/07/2021 10:17:25 - INFO - __main__ - Step 92739: {'lr': 0.00016303275769178924, 'samples': 17805888, 'steps': 92738, 'loss/train': 1.7618837356567383} 11/07/2021 10:17:25 - INFO - __main__ - Step 92740: {'lr': 0.00016302778240950843, 'samples': 17806080, 'steps': 92739, 'loss/train': 1.7493408918380737} 11/07/2021 10:17:26 - INFO - __main__ - Step 92741: {'lr': 0.00016302280716641593, 'samples': 17806272, 'steps': 92740, 'loss/train': 0.7858427166938782} 11/07/2021 10:17:26 - INFO - __main__ - Step 92742: {'lr': 0.00016301783196251405, 'samples': 17806464, 'steps': 92741, 'loss/train': 1.7594085931777954} 11/07/2021 10:17:26 - INFO - __main__ - Step 92743: {'lr': 0.00016301285679780496, 'samples': 17806656, 'steps': 92742, 'loss/train': 0.8630133271217346} 11/07/2021 10:17:27 - INFO - __main__ - Step 92744: {'lr': 0.00016300788167229098, 'samples': 17806848, 'steps': 92743, 'loss/train': 0.86916583776474} 11/07/2021 10:17:28 - INFO - __main__ - Step 92745: {'lr': 0.00016300290658597427, 'samples': 17807040, 'steps': 92744, 'loss/train': 1.4087207317352295} 11/07/2021 10:17:28 - INFO - __main__ - Step 92746: {'lr': 0.0001629979315388571, 'samples': 17807232, 'steps': 92745, 'loss/train': 1.8687664270401} 11/07/2021 10:17:28 - INFO - __main__ - Step 92747: {'lr': 0.00016299295653094182, 'samples': 17807424, 'steps': 92746, 'loss/train': 1.336948275566101} 11/07/2021 10:17:29 - INFO - __main__ - Step 92748: {'lr': 0.0001629879815622305, 'samples': 17807616, 'steps': 92747, 'loss/train': 1.1478071212768555} 11/07/2021 10:17:30 - INFO - __main__ - Step 92749: {'lr': 0.0001629830066327254, 'samples': 17807808, 'steps': 92748, 'loss/train': 1.282707929611206} 11/07/2021 10:17:30 - INFO - __main__ - Step 92750: {'lr': 0.00016297803174242887, 'samples': 17808000, 'steps': 92749, 'loss/train': 1.1170682907104492} 11/07/2021 10:17:31 - INFO - __main__ - Step 92751: {'lr': 0.0001629730568913431, 'samples': 17808192, 'steps': 92750, 'loss/train': 0.8599065542221069} 11/07/2021 10:17:31 - INFO - __main__ - Step 92752: {'lr': 0.00016296808207947027, 'samples': 17808384, 'steps': 92751, 'loss/train': 1.6623480319976807} 11/07/2021 10:17:31 - INFO - __main__ - Step 92753: {'lr': 0.00016296310730681273, 'samples': 17808576, 'steps': 92752, 'loss/train': 1.196974754333496} 11/07/2021 10:17:32 - INFO - __main__ - Step 92754: {'lr': 0.00016295813257337266, 'samples': 17808768, 'steps': 92753, 'loss/train': 1.196539044380188} 11/07/2021 10:17:33 - INFO - __main__ - Step 92755: {'lr': 0.00016295315787915232, 'samples': 17808960, 'steps': 92754, 'loss/train': 1.18233323097229} 11/07/2021 10:17:33 - INFO - __main__ - Step 92756: {'lr': 0.00016294818322415392, 'samples': 17809152, 'steps': 92755, 'loss/train': 1.0091495513916016} 11/07/2021 10:17:33 - INFO - __main__ - Step 92757: {'lr': 0.00016294320860837976, 'samples': 17809344, 'steps': 92756, 'loss/train': 2.031006097793579} 11/07/2021 10:17:34 - INFO - __main__ - Step 92758: {'lr': 0.000162938234031832, 'samples': 17809536, 'steps': 92757, 'loss/train': 1.5686616897583008} 11/07/2021 10:17:35 - INFO - __main__ - Step 92759: {'lr': 0.00016293325949451293, 'samples': 17809728, 'steps': 92758, 'loss/train': 0.07223021239042282} 11/07/2021 10:17:35 - INFO - __main__ - Step 92760: {'lr': 0.00016292828499642493, 'samples': 17809920, 'steps': 92759, 'loss/train': 1.1719317436218262} 11/07/2021 10:17:35 - INFO - __main__ - Step 92761: {'lr': 0.00016292331053756998, 'samples': 17810112, 'steps': 92760, 'loss/train': 1.4475584030151367} 11/07/2021 10:17:36 - INFO - __main__ - Step 92762: {'lr': 0.00016291833611795046, 'samples': 17810304, 'steps': 92761, 'loss/train': 1.4197282791137695} 11/07/2021 10:17:36 - INFO - __main__ - Step 92763: {'lr': 0.00016291336173756857, 'samples': 17810496, 'steps': 92762, 'loss/train': 1.757225751876831} 11/07/2021 10:17:37 - INFO - __main__ - Step 92764: {'lr': 0.00016290838739642662, 'samples': 17810688, 'steps': 92763, 'loss/train': 0.8610395193099976} 11/07/2021 10:17:38 - INFO - __main__ - Step 92765: {'lr': 0.0001629034130945267, 'samples': 17810880, 'steps': 92764, 'loss/train': 1.3936907052993774} 11/07/2021 10:17:38 - INFO - __main__ - Step 92766: {'lr': 0.00016289843883187128, 'samples': 17811072, 'steps': 92765, 'loss/train': 1.679727554321289} 11/07/2021 10:17:38 - INFO - __main__ - Step 92767: {'lr': 0.0001628934646084624, 'samples': 17811264, 'steps': 92766, 'loss/train': 2.0769309997558594} 11/07/2021 10:17:39 - INFO - __main__ - Step 92768: {'lr': 0.00016288849042430244, 'samples': 17811456, 'steps': 92767, 'loss/train': 1.2694792747497559} 11/07/2021 10:17:39 - INFO - __main__ - Step 92769: {'lr': 0.0001628835162793935, 'samples': 17811648, 'steps': 92768, 'loss/train': 1.7672224044799805} 11/07/2021 10:17:40 - INFO - __main__ - Step 92770: {'lr': 0.000162878542173738, 'samples': 17811840, 'steps': 92769, 'loss/train': 1.620589256286621} 11/07/2021 10:17:40 - INFO - __main__ - Step 92771: {'lr': 0.00016287356810733804, 'samples': 17812032, 'steps': 92770, 'loss/train': 1.599994421005249} 11/07/2021 10:17:41 - INFO - __main__ - Step 92772: {'lr': 0.00016286859408019588, 'samples': 17812224, 'steps': 92771, 'loss/train': 1.0175421237945557} 11/07/2021 10:17:41 - INFO - __main__ - Step 92773: {'lr': 0.0001628636200923138, 'samples': 17812416, 'steps': 92772, 'loss/train': 1.8251111507415771} 11/07/2021 10:17:41 - INFO - __main__ - Step 92774: {'lr': 0.00016285864614369418, 'samples': 17812608, 'steps': 92773, 'loss/train': 1.6992754936218262} 11/07/2021 10:17:42 - INFO - __main__ - Step 92775: {'lr': 0.00016285367223433893, 'samples': 17812800, 'steps': 92774, 'loss/train': 0.8603866696357727} 11/07/2021 10:17:43 - INFO - __main__ - Step 92776: {'lr': 0.00016284869836425054, 'samples': 17812992, 'steps': 92775, 'loss/train': 1.2025773525238037} 11/07/2021 10:17:43 - INFO - __main__ - Step 92777: {'lr': 0.00016284372453343116, 'samples': 17813184, 'steps': 92776, 'loss/train': 1.3616394996643066} 11/07/2021 10:17:43 - INFO - __main__ - Step 92778: {'lr': 0.00016283875074188302, 'samples': 17813376, 'steps': 92777, 'loss/train': 1.796685814857483} 11/07/2021 10:17:44 - INFO - __main__ - Step 92779: {'lr': 0.00016283377698960843, 'samples': 17813568, 'steps': 92778, 'loss/train': 1.064462423324585} 11/07/2021 10:17:45 - INFO - __main__ - Step 92780: {'lr': 0.0001628288032766096, 'samples': 17813760, 'steps': 92779, 'loss/train': 1.2900855541229248} 11/07/2021 10:17:45 - INFO - __main__ - Step 92781: {'lr': 0.00016282382960288873, 'samples': 17813952, 'steps': 92780, 'loss/train': 1.48911714553833} 11/07/2021 10:17:46 - INFO - __main__ - Step 92782: {'lr': 0.00016281885596844812, 'samples': 17814144, 'steps': 92781, 'loss/train': 1.5964186191558838} 11/07/2021 10:17:46 - INFO - __main__ - Step 92783: {'lr': 0.00016281388237328998, 'samples': 17814336, 'steps': 92782, 'loss/train': 1.605318546295166} 11/07/2021 10:17:46 - INFO - __main__ - Step 92784: {'lr': 0.00016280890881741655, 'samples': 17814528, 'steps': 92783, 'loss/train': 1.2801539897918701} 11/07/2021 10:17:47 - INFO - __main__ - Step 92785: {'lr': 0.0001628039353008301, 'samples': 17814720, 'steps': 92784, 'loss/train': 0.9433056116104126} 11/07/2021 10:17:48 - INFO - __main__ - Step 92786: {'lr': 0.00016279896182353284, 'samples': 17814912, 'steps': 92785, 'loss/train': 1.4818699359893799} 11/07/2021 10:17:48 - INFO - __main__ - Step 92787: {'lr': 0.00016279398838552715, 'samples': 17815104, 'steps': 92786, 'loss/train': 0.5980170369148254} 11/07/2021 10:17:48 - INFO - __main__ - Step 92788: {'lr': 0.00016278901498681503, 'samples': 17815296, 'steps': 92787, 'loss/train': 1.213597297668457} 11/07/2021 10:17:49 - INFO - __main__ - Step 92789: {'lr': 0.00016278404162739879, 'samples': 17815488, 'steps': 92788, 'loss/train': 1.3475321531295776} 11/07/2021 10:17:49 - INFO - __main__ - Step 92790: {'lr': 0.00016277906830728078, 'samples': 17815680, 'steps': 92789, 'loss/train': 0.5627545714378357} 11/07/2021 10:17:51 - INFO - __main__ - Step 92791: {'lr': 0.00016277409502646312, 'samples': 17815872, 'steps': 92790, 'loss/train': 1.3474950790405273} 11/07/2021 10:17:51 - INFO - __main__ - Step 92792: {'lr': 0.00016276912178494812, 'samples': 17816064, 'steps': 92791, 'loss/train': 1.3974297046661377} 11/07/2021 10:17:51 - INFO - __main__ - Step 92793: {'lr': 0.00016276414858273802, 'samples': 17816256, 'steps': 92792, 'loss/train': 1.3000733852386475} 11/07/2021 10:17:52 - INFO - __main__ - Step 92794: {'lr': 0.0001627591754198351, 'samples': 17816448, 'steps': 92793, 'loss/train': 1.339980125427246} 11/07/2021 10:17:52 - INFO - __main__ - Step 92795: {'lr': 0.00016275420229624148, 'samples': 17816640, 'steps': 92794, 'loss/train': 1.060488224029541} 11/07/2021 10:17:53 - INFO - __main__ - Step 92796: {'lr': 0.00016274922921195948, 'samples': 17816832, 'steps': 92795, 'loss/train': 1.3935034275054932} 11/07/2021 10:17:53 - INFO - __main__ - Step 92797: {'lr': 0.00016274425616699133, 'samples': 17817024, 'steps': 92796, 'loss/train': 1.819888710975647} 11/07/2021 10:17:54 - INFO - __main__ - Step 92798: {'lr': 0.00016273928316133928, 'samples': 17817216, 'steps': 92797, 'loss/train': 1.029344916343689} 11/07/2021 10:17:54 - INFO - __main__ - Step 92799: {'lr': 0.00016273431019500558, 'samples': 17817408, 'steps': 92798, 'loss/train': 1.946974754333496} 11/07/2021 10:17:55 - INFO - __main__ - Step 92800: {'lr': 0.0001627293372679925, 'samples': 17817600, 'steps': 92799, 'loss/train': 1.8346058130264282} 11/07/2021 10:17:55 - INFO - __main__ - Step 92801: {'lr': 0.00016272436438030219, 'samples': 17817792, 'steps': 92800, 'loss/train': 1.2707542181015015} 11/07/2021 10:17:56 - INFO - __main__ - Step 92802: {'lr': 0.00016271939153193694, 'samples': 17817984, 'steps': 92801, 'loss/train': 1.4852718114852905} 11/07/2021 10:17:57 - INFO - __main__ - Step 92803: {'lr': 0.00016271441872289894, 'samples': 17818176, 'steps': 92802, 'loss/train': 1.150972604751587} 11/07/2021 10:17:57 - INFO - __main__ - Step 92804: {'lr': 0.0001627094459531905, 'samples': 17818368, 'steps': 92803, 'loss/train': 1.9169697761535645} 11/07/2021 10:17:57 - INFO - __main__ - Step 92805: {'lr': 0.00016270447322281383, 'samples': 17818560, 'steps': 92804, 'loss/train': 0.8607460260391235} 11/07/2021 10:17:58 - INFO - __main__ - Step 92806: {'lr': 0.00016269950053177118, 'samples': 17818752, 'steps': 92805, 'loss/train': 1.328283429145813} 11/07/2021 10:17:59 - INFO - __main__ - Step 92807: {'lr': 0.00016269452788006479, 'samples': 17818944, 'steps': 92806, 'loss/train': 0.5115980505943298} 11/07/2021 10:17:59 - INFO - __main__ - Step 92808: {'lr': 0.0001626895552676969, 'samples': 17819136, 'steps': 92807, 'loss/train': 1.5177621841430664} 11/07/2021 10:17:59 - INFO - __main__ - Step 92809: {'lr': 0.00016268458269466974, 'samples': 17819328, 'steps': 92808, 'loss/train': 1.7267208099365234} 11/07/2021 10:18:00 - INFO - __main__ - Step 92810: {'lr': 0.00016267961016098559, 'samples': 17819520, 'steps': 92809, 'loss/train': 0.9924872517585754} 11/07/2021 10:18:00 - INFO - __main__ - Step 92811: {'lr': 0.00016267463766664667, 'samples': 17819712, 'steps': 92810, 'loss/train': 5.679830551147461} 11/07/2021 10:18:01 - INFO - __main__ - Step 92812: {'lr': 0.00016266966521165518, 'samples': 17819904, 'steps': 92811, 'loss/train': 1.3945283889770508} 11/07/2021 10:18:02 - INFO - __main__ - Step 92813: {'lr': 0.00016266469279601337, 'samples': 17820096, 'steps': 92812, 'loss/train': 1.5279738903045654} 11/07/2021 10:18:02 - INFO - __main__ - Step 92814: {'lr': 0.0001626597204197236, 'samples': 17820288, 'steps': 92813, 'loss/train': 1.486737608909607} 11/07/2021 10:18:02 - INFO - __main__ - Step 92815: {'lr': 0.00016265474808278791, 'samples': 17820480, 'steps': 92814, 'loss/train': 1.555320143699646} 11/07/2021 10:18:03 - INFO - __main__ - Step 92816: {'lr': 0.00016264977578520868, 'samples': 17820672, 'steps': 92815, 'loss/train': 1.3335566520690918} 11/07/2021 10:18:03 - INFO - __main__ - Step 92817: {'lr': 0.0001626448035269881, 'samples': 17820864, 'steps': 92816, 'loss/train': 1.427019715309143} 11/07/2021 10:18:04 - INFO - __main__ - Step 92818: {'lr': 0.00016263983130812844, 'samples': 17821056, 'steps': 92817, 'loss/train': 1.5268683433532715} 11/07/2021 10:18:05 - INFO - __main__ - Step 92819: {'lr': 0.00016263485912863189, 'samples': 17821248, 'steps': 92818, 'loss/train': 1.4930006265640259} 11/07/2021 10:18:05 - INFO - __main__ - Step 92820: {'lr': 0.00016262988698850073, 'samples': 17821440, 'steps': 92819, 'loss/train': 0.9576245546340942} 11/07/2021 10:18:05 - INFO - __main__ - Step 92821: {'lr': 0.0001626249148877372, 'samples': 17821632, 'steps': 92820, 'loss/train': 1.5460155010223389} 11/07/2021 10:18:06 - INFO - __main__ - Step 92822: {'lr': 0.0001626199428263436, 'samples': 17821824, 'steps': 92821, 'loss/train': 0.7064564824104309} 11/07/2021 10:18:07 - INFO - __main__ - Step 92823: {'lr': 0.00016261497080432202, 'samples': 17822016, 'steps': 92822, 'loss/train': 1.49299156665802} 11/07/2021 10:18:07 - INFO - __main__ - Step 92824: {'lr': 0.0001626099988216748, 'samples': 17822208, 'steps': 92823, 'loss/train': 1.2293280363082886} 11/07/2021 10:18:08 - INFO - __main__ - Step 92825: {'lr': 0.00016260502687840423, 'samples': 17822400, 'steps': 92824, 'loss/train': 1.246734857559204} 11/07/2021 10:18:08 - INFO - __main__ - Step 92826: {'lr': 0.0001626000549745124, 'samples': 17822592, 'steps': 92825, 'loss/train': 0.7979704141616821} 11/07/2021 10:18:08 - INFO - __main__ - Step 92827: {'lr': 0.00016259508311000168, 'samples': 17822784, 'steps': 92826, 'loss/train': 1.4959642887115479} 11/07/2021 10:18:09 - INFO - __main__ - Step 92828: {'lr': 0.00016259011128487433, 'samples': 17822976, 'steps': 92827, 'loss/train': 1.1155470609664917} 11/07/2021 10:18:10 - INFO - __main__ - Step 92829: {'lr': 0.00016258513949913246, 'samples': 17823168, 'steps': 92828, 'loss/train': 1.1284643411636353} 11/07/2021 10:18:10 - INFO - __main__ - Step 92830: {'lr': 0.00016258016775277833, 'samples': 17823360, 'steps': 92829, 'loss/train': 1.6749136447906494} 11/07/2021 10:18:10 - INFO - __main__ - Step 92831: {'lr': 0.00016257519604581427, 'samples': 17823552, 'steps': 92830, 'loss/train': 1.0404325723648071} 11/07/2021 10:18:11 - INFO - __main__ - Step 92832: {'lr': 0.00016257022437824248, 'samples': 17823744, 'steps': 92831, 'loss/train': 1.458716630935669} 11/07/2021 10:18:12 - INFO - __main__ - Step 92833: {'lr': 0.00016256525275006525, 'samples': 17823936, 'steps': 92832, 'loss/train': 1.4676915407180786} 11/07/2021 10:18:12 - INFO - __main__ - Step 92834: {'lr': 0.0001625602811612847, 'samples': 17824128, 'steps': 92833, 'loss/train': 1.488869309425354} 11/07/2021 10:18:12 - INFO - __main__ - Step 92835: {'lr': 0.0001625553096119032, 'samples': 17824320, 'steps': 92834, 'loss/train': 0.8571949005126953} 11/07/2021 10:18:13 - INFO - __main__ - Step 92836: {'lr': 0.00016255033810192284, 'samples': 17824512, 'steps': 92835, 'loss/train': 0.719855010509491} 11/07/2021 10:18:13 - INFO - __main__ - Step 92837: {'lr': 0.000162545366631346, 'samples': 17824704, 'steps': 92836, 'loss/train': 1.4283756017684937} 11/07/2021 10:18:14 - INFO - __main__ - Step 92838: {'lr': 0.00016254039520017483, 'samples': 17824896, 'steps': 92837, 'loss/train': 0.6838464736938477} 11/07/2021 10:18:14 - INFO - __main__ - Step 92839: {'lr': 0.00016253542380841162, 'samples': 17825088, 'steps': 92838, 'loss/train': 1.4057925939559937} 11/07/2021 10:18:15 - INFO - __main__ - Step 92840: {'lr': 0.00016253045245605863, 'samples': 17825280, 'steps': 92839, 'loss/train': 1.5848804712295532} 11/07/2021 10:18:15 - INFO - __main__ - Step 92841: {'lr': 0.0001625254811431181, 'samples': 17825472, 'steps': 92840, 'loss/train': 1.4569823741912842} 11/07/2021 10:18:16 - INFO - __main__ - Step 92842: {'lr': 0.00016252050986959222, 'samples': 17825664, 'steps': 92841, 'loss/train': 1.8610998392105103} 11/07/2021 10:18:17 - INFO - __main__ - Step 92843: {'lr': 0.00016251553863548318, 'samples': 17825856, 'steps': 92842, 'loss/train': 1.3305528163909912} 11/07/2021 10:18:17 - INFO - __main__ - Step 92844: {'lr': 0.0001625105674407934, 'samples': 17826048, 'steps': 92843, 'loss/train': 1.1113053560256958} 11/07/2021 10:18:17 - INFO - __main__ - Step 92845: {'lr': 0.0001625055962855249, 'samples': 17826240, 'steps': 92844, 'loss/train': 1.3996248245239258} 11/07/2021 10:18:18 - INFO - __main__ - Step 92846: {'lr': 0.00016250062516968007, 'samples': 17826432, 'steps': 92845, 'loss/train': 1.4685817956924438} 11/07/2021 10:18:18 - INFO - __main__ - Step 92847: {'lr': 0.0001624956540932611, 'samples': 17826624, 'steps': 92846, 'loss/train': 1.5586543083190918} 11/07/2021 10:18:19 - INFO - __main__ - Step 92848: {'lr': 0.00016249068305627023, 'samples': 17826816, 'steps': 92847, 'loss/train': 1.4899991750717163} 11/07/2021 10:18:19 - INFO - __main__ - Step 92849: {'lr': 0.0001624857120587097, 'samples': 17827008, 'steps': 92848, 'loss/train': 1.5030750036239624} 11/07/2021 10:18:20 - INFO - __main__ - Step 92850: {'lr': 0.0001624807411005818, 'samples': 17827200, 'steps': 92849, 'loss/train': 1.3902033567428589} 11/07/2021 10:18:20 - INFO - __main__ - Step 92851: {'lr': 0.0001624757701818887, 'samples': 17827392, 'steps': 92850, 'loss/train': 1.1060237884521484} 11/07/2021 10:18:20 - INFO - __main__ - Step 92852: {'lr': 0.00016247079930263266, 'samples': 17827584, 'steps': 92851, 'loss/train': 0.9909173250198364} 11/07/2021 10:18:21 - INFO - __main__ - Step 92853: {'lr': 0.00016246582846281594, 'samples': 17827776, 'steps': 92852, 'loss/train': 1.1881532669067383} 11/07/2021 10:18:22 - INFO - __main__ - Step 92854: {'lr': 0.00016246085766244078, 'samples': 17827968, 'steps': 92853, 'loss/train': 1.3941295146942139} 11/07/2021 10:18:22 - INFO - __main__ - Step 92855: {'lr': 0.00016245588690150947, 'samples': 17828160, 'steps': 92854, 'loss/train': 1.5406414270401} 11/07/2021 10:18:23 - INFO - __main__ - Step 92856: {'lr': 0.00016245091618002412, 'samples': 17828352, 'steps': 92855, 'loss/train': 1.395022988319397} 11/07/2021 10:18:23 - INFO - __main__ - Step 92857: {'lr': 0.00016244594549798703, 'samples': 17828544, 'steps': 92856, 'loss/train': 1.5397053956985474} 11/07/2021 10:18:23 - INFO - __main__ - Step 92858: {'lr': 0.00016244097485540045, 'samples': 17828736, 'steps': 92857, 'loss/train': 1.4053821563720703} 11/07/2021 10:18:25 - INFO - __main__ - Step 92859: {'lr': 0.00016243600425226658, 'samples': 17828928, 'steps': 92858, 'loss/train': 1.3481957912445068} 11/07/2021 10:18:26 - INFO - __main__ - Step 92860: {'lr': 0.0001624310336885877, 'samples': 17829120, 'steps': 92859, 'loss/train': 1.3742564916610718} 11/07/2021 10:18:26 - INFO - __main__ - Step 92861: {'lr': 0.0001624260631643661, 'samples': 17829312, 'steps': 92860, 'loss/train': 2.002256155014038} 11/07/2021 10:18:26 - INFO - __main__ - Step 92862: {'lr': 0.0001624210926796039, 'samples': 17829504, 'steps': 92861, 'loss/train': 1.4486231803894043} 11/07/2021 10:18:27 - INFO - __main__ - Step 92863: {'lr': 0.00016241612223430343, 'samples': 17829696, 'steps': 92862, 'loss/train': 1.460534691810608} 11/07/2021 10:18:27 - INFO - __main__ - Step 92864: {'lr': 0.00016241115182846687, 'samples': 17829888, 'steps': 92863, 'loss/train': 1.4939453601837158} 11/07/2021 10:18:27 - INFO - __main__ - Step 92865: {'lr': 0.00016240618146209657, 'samples': 17830080, 'steps': 92864, 'loss/train': 1.4449938535690308} 11/07/2021 10:18:29 - INFO - __main__ - Step 92866: {'lr': 0.00016240121113519462, 'samples': 17830272, 'steps': 92865, 'loss/train': 1.7392463684082031} 11/07/2021 10:18:29 - INFO - __main__ - Step 92867: {'lr': 0.0001623962408477634, 'samples': 17830464, 'steps': 92866, 'loss/train': 0.8913626074790955} 11/07/2021 10:18:29 - INFO - __main__ - Step 92868: {'lr': 0.00016239127059980513, 'samples': 17830656, 'steps': 92867, 'loss/train': 1.7138371467590332} 11/07/2021 10:18:30 - INFO - __main__ - Step 92869: {'lr': 0.00016238630039132194, 'samples': 17830848, 'steps': 92868, 'loss/train': 1.3301124572753906} 11/07/2021 10:18:30 - INFO - __main__ - Step 92870: {'lr': 0.00016238133022231611, 'samples': 17831040, 'steps': 92869, 'loss/train': 0.8126034736633301} 11/07/2021 10:18:31 - INFO - __main__ - Step 92871: {'lr': 0.0001623763600927899, 'samples': 17831232, 'steps': 92870, 'loss/train': 1.0261448621749878} 11/07/2021 10:18:31 - INFO - __main__ - Step 92872: {'lr': 0.00016237139000274553, 'samples': 17831424, 'steps': 92871, 'loss/train': 1.1925098896026611} 11/07/2021 10:18:32 - INFO - __main__ - Step 92873: {'lr': 0.0001623664199521853, 'samples': 17831616, 'steps': 92872, 'loss/train': 1.7041815519332886} 11/07/2021 10:18:32 - INFO - __main__ - Step 92874: {'lr': 0.0001623614499411114, 'samples': 17831808, 'steps': 92873, 'loss/train': 1.357712984085083} 11/07/2021 10:18:32 - INFO - __main__ - Step 92875: {'lr': 0.00016235647996952604, 'samples': 17832000, 'steps': 92874, 'loss/train': 1.530866265296936} 11/07/2021 10:18:33 - INFO - __main__ - Step 92876: {'lr': 0.00016235151003743154, 'samples': 17832192, 'steps': 92875, 'loss/train': 1.4797018766403198} 11/07/2021 10:18:34 - INFO - __main__ - Step 92877: {'lr': 0.00016234654014483008, 'samples': 17832384, 'steps': 92876, 'loss/train': 1.4827096462249756} 11/07/2021 10:18:34 - INFO - __main__ - Step 92878: {'lr': 0.00016234157029172393, 'samples': 17832576, 'steps': 92877, 'loss/train': 1.6072406768798828} 11/07/2021 10:18:35 - INFO - __main__ - Step 92879: {'lr': 0.00016233660047811527, 'samples': 17832768, 'steps': 92878, 'loss/train': 1.5430214405059814} 11/07/2021 10:18:35 - INFO - __main__ - Step 92880: {'lr': 0.00016233163070400642, 'samples': 17832960, 'steps': 92879, 'loss/train': 1.4540852308273315} 11/07/2021 10:18:35 - INFO - __main__ - Step 92881: {'lr': 0.00016232666096939967, 'samples': 17833152, 'steps': 92880, 'loss/train': 1.8990850448608398} 11/07/2021 10:18:36 - INFO - __main__ - Step 92882: {'lr': 0.0001623216912742971, 'samples': 17833344, 'steps': 92881, 'loss/train': 1.4505845308303833} 11/07/2021 10:18:37 - INFO - __main__ - Step 92883: {'lr': 0.00016231672161870104, 'samples': 17833536, 'steps': 92882, 'loss/train': 1.4914673566818237} 11/07/2021 10:18:37 - INFO - __main__ - Step 92884: {'lr': 0.00016231175200261366, 'samples': 17833728, 'steps': 92883, 'loss/train': 1.216086506843567} 11/07/2021 10:18:37 - INFO - __main__ - Step 92885: {'lr': 0.00016230678242603726, 'samples': 17833920, 'steps': 92884, 'loss/train': 1.4403135776519775} 11/07/2021 10:18:38 - INFO - __main__ - Step 92886: {'lr': 0.0001623018128889741, 'samples': 17834112, 'steps': 92885, 'loss/train': 0.417248010635376} 11/07/2021 10:18:39 - INFO - __main__ - Step 92887: {'lr': 0.00016229684339142636, 'samples': 17834304, 'steps': 92886, 'loss/train': 1.6472203731536865} 11/07/2021 10:18:39 - INFO - __main__ - Step 92888: {'lr': 0.00016229187393339633, 'samples': 17834496, 'steps': 92887, 'loss/train': 1.298148512840271} 11/07/2021 10:18:40 - INFO - __main__ - Step 92889: {'lr': 0.0001622869045148862, 'samples': 17834688, 'steps': 92888, 'loss/train': 1.2257797718048096} 11/07/2021 10:18:40 - INFO - __main__ - Step 92890: {'lr': 0.00016228193513589828, 'samples': 17834880, 'steps': 92889, 'loss/train': 1.7892842292785645} 11/07/2021 10:18:40 - INFO - __main__ - Step 92891: {'lr': 0.00016227696579643476, 'samples': 17835072, 'steps': 92890, 'loss/train': 0.9079868197441101} 11/07/2021 10:18:41 - INFO - __main__ - Step 92892: {'lr': 0.00016227199649649786, 'samples': 17835264, 'steps': 92891, 'loss/train': 0.9882387518882751} 11/07/2021 10:18:42 - INFO - __main__ - Step 92893: {'lr': 0.00016226702723608983, 'samples': 17835456, 'steps': 92892, 'loss/train': 1.5245585441589355} 11/07/2021 10:18:42 - INFO - __main__ - Step 92894: {'lr': 0.00016226205801521295, 'samples': 17835648, 'steps': 92893, 'loss/train': 1.6853054761886597} 11/07/2021 10:18:42 - INFO - __main__ - Step 92895: {'lr': 0.00016225708883386956, 'samples': 17835840, 'steps': 92894, 'loss/train': 1.3022401332855225} 11/07/2021 10:18:43 - INFO - __main__ - Step 92896: {'lr': 0.00016225211969206165, 'samples': 17836032, 'steps': 92895, 'loss/train': 1.2802977561950684} 11/07/2021 10:18:43 - INFO - __main__ - Step 92897: {'lr': 0.00016224715058979155, 'samples': 17836224, 'steps': 92896, 'loss/train': 1.5690721273422241} 11/07/2021 10:18:44 - INFO - __main__ - Step 92898: {'lr': 0.00016224218152706155, 'samples': 17836416, 'steps': 92897, 'loss/train': 1.2642903327941895} 11/07/2021 10:18:45 - INFO - __main__ - Step 92899: {'lr': 0.00016223721250387387, 'samples': 17836608, 'steps': 92898, 'loss/train': 0.9778096675872803} 11/07/2021 10:18:45 - INFO - __main__ - Step 92900: {'lr': 0.00016223224352023076, 'samples': 17836800, 'steps': 92899, 'loss/train': 2.0601091384887695} 11/07/2021 10:18:45 - INFO - __main__ - Step 92901: {'lr': 0.00016222727457613446, 'samples': 17836992, 'steps': 92900, 'loss/train': 1.3706285953521729} 11/07/2021 10:18:46 - INFO - __main__ - Step 92902: {'lr': 0.00016222230567158714, 'samples': 17837184, 'steps': 92901, 'loss/train': 1.3850393295288086} 11/07/2021 10:18:47 - INFO - __main__ - Step 92903: {'lr': 0.00016221733680659112, 'samples': 17837376, 'steps': 92902, 'loss/train': 1.5439468622207642} 11/07/2021 10:18:47 - INFO - __main__ - Step 92904: {'lr': 0.00016221236798114863, 'samples': 17837568, 'steps': 92903, 'loss/train': 1.0731463432312012} 11/07/2021 10:18:47 - INFO - __main__ - Step 92905: {'lr': 0.0001622073991952619, 'samples': 17837760, 'steps': 92904, 'loss/train': 1.6464070081710815} 11/07/2021 10:18:48 - INFO - __main__ - Step 92906: {'lr': 0.00016220243044893313, 'samples': 17837952, 'steps': 92905, 'loss/train': 1.401822566986084} 11/07/2021 10:18:48 - INFO - __main__ - Step 92907: {'lr': 0.0001621974617421646, 'samples': 17838144, 'steps': 92906, 'loss/train': 1.4153062105178833} 11/07/2021 10:18:48 - INFO - __main__ - Step 92908: {'lr': 0.00016219249307495865, 'samples': 17838336, 'steps': 92907, 'loss/train': 1.2426490783691406} 11/07/2021 10:18:49 - INFO - __main__ - Step 92909: {'lr': 0.00016218752444731733, 'samples': 17838528, 'steps': 92908, 'loss/train': 1.6574105024337769} 11/07/2021 10:18:50 - INFO - __main__ - Step 92910: {'lr': 0.0001621825558592429, 'samples': 17838720, 'steps': 92909, 'loss/train': 1.3860893249511719} 11/07/2021 10:18:50 - INFO - __main__ - Step 92911: {'lr': 0.00016217758731073767, 'samples': 17838912, 'steps': 92910, 'loss/train': 1.2182420492172241} 11/07/2021 10:18:51 - INFO - __main__ - Step 92912: {'lr': 0.00016217261880180388, 'samples': 17839104, 'steps': 92911, 'loss/train': 1.2190256118774414} 11/07/2021 10:18:51 - INFO - __main__ - Step 92913: {'lr': 0.00016216765033244377, 'samples': 17839296, 'steps': 92912, 'loss/train': 1.1290414333343506} 11/07/2021 10:18:52 - INFO - __main__ - Step 92914: {'lr': 0.00016216268190265954, 'samples': 17839488, 'steps': 92913, 'loss/train': 1.4379740953445435} 11/07/2021 10:18:52 - INFO - __main__ - Step 92915: {'lr': 0.00016215771351245345, 'samples': 17839680, 'steps': 92914, 'loss/train': 1.5724648237228394} 11/07/2021 10:18:53 - INFO - __main__ - Step 92916: {'lr': 0.00016215274516182774, 'samples': 17839872, 'steps': 92915, 'loss/train': 1.806044340133667} 11/07/2021 10:18:53 - INFO - __main__ - Step 92917: {'lr': 0.00016214777685078465, 'samples': 17840064, 'steps': 92916, 'loss/train': 1.664777159690857} 11/07/2021 10:18:53 - INFO - __main__ - Step 92918: {'lr': 0.0001621428085793264, 'samples': 17840256, 'steps': 92917, 'loss/train': 1.0522607564926147} 11/07/2021 10:18:54 - INFO - __main__ - Step 92919: {'lr': 0.00016213784034745527, 'samples': 17840448, 'steps': 92918, 'loss/train': 1.3633586168289185} 11/07/2021 10:18:55 - INFO - __main__ - Step 92920: {'lr': 0.00016213287215517347, 'samples': 17840640, 'steps': 92919, 'loss/train': 1.2806609869003296} 11/07/2021 10:18:55 - INFO - __main__ - Step 92921: {'lr': 0.00016212790400248322, 'samples': 17840832, 'steps': 92920, 'loss/train': 1.7371959686279297} 11/07/2021 10:18:55 - INFO - __main__ - Step 92922: {'lr': 0.0001621229358893869, 'samples': 17841024, 'steps': 92921, 'loss/train': 1.5513715744018555} 11/07/2021 10:18:56 - INFO - __main__ - Step 92923: {'lr': 0.0001621179678158865, 'samples': 17841216, 'steps': 92922, 'loss/train': 1.4913617372512817} 11/07/2021 10:18:57 - INFO - __main__ - Step 92924: {'lr': 0.00016211299978198442, 'samples': 17841408, 'steps': 92923, 'loss/train': 1.3323869705200195} 11/07/2021 10:18:57 - INFO - __main__ - Step 92925: {'lr': 0.00016210803178768286, 'samples': 17841600, 'steps': 92924, 'loss/train': 1.0679610967636108} 11/07/2021 10:18:58 - INFO - __main__ - Step 92926: {'lr': 0.00016210306383298407, 'samples': 17841792, 'steps': 92925, 'loss/train': 0.8566097617149353} 11/07/2021 10:18:58 - INFO - __main__ - Step 92927: {'lr': 0.00016209809591789025, 'samples': 17841984, 'steps': 92926, 'loss/train': 1.1816767454147339} 11/07/2021 10:18:58 - INFO - __main__ - Step 92928: {'lr': 0.00016209312804240373, 'samples': 17842176, 'steps': 92927, 'loss/train': 1.2649742364883423} 11/07/2021 10:18:59 - INFO - __main__ - Step 92929: {'lr': 0.00016208816020652663, 'samples': 17842368, 'steps': 92928, 'loss/train': 1.4666517972946167} 11/07/2021 10:19:00 - INFO - __main__ - Step 92930: {'lr': 0.0001620831924102613, 'samples': 17842560, 'steps': 92929, 'loss/train': 1.5329163074493408} 11/07/2021 10:19:00 - INFO - __main__ - Step 92931: {'lr': 0.00016207822465360989, 'samples': 17842752, 'steps': 92930, 'loss/train': 1.5269571542739868} 11/07/2021 10:19:00 - INFO - __main__ - Step 92932: {'lr': 0.00016207325693657468, 'samples': 17842944, 'steps': 92931, 'loss/train': 1.416822910308838} 11/07/2021 10:19:01 - INFO - __main__ - Step 92933: {'lr': 0.0001620682892591579, 'samples': 17843136, 'steps': 92932, 'loss/train': 1.3624845743179321} 11/07/2021 10:19:02 - INFO - __main__ - Step 92934: {'lr': 0.00016206332162136186, 'samples': 17843328, 'steps': 92933, 'loss/train': 1.3773221969604492} 11/07/2021 10:19:02 - INFO - __main__ - Step 92935: {'lr': 0.00016205835402318875, 'samples': 17843520, 'steps': 92934, 'loss/train': 1.1736692190170288} 11/07/2021 10:19:02 - INFO - __main__ - Step 92936: {'lr': 0.00016205338646464067, 'samples': 17843712, 'steps': 92935, 'loss/train': 1.2422776222229004} 11/07/2021 10:19:03 - INFO - __main__ - Step 92937: {'lr': 0.00016204841894572003, 'samples': 17843904, 'steps': 92936, 'loss/train': 1.5072143077850342} 11/07/2021 10:19:03 - INFO - __main__ - Step 92938: {'lr': 0.00016204345146642903, 'samples': 17844096, 'steps': 92937, 'loss/train': 1.4998687505722046} 11/07/2021 10:19:04 - INFO - __main__ - Step 92939: {'lr': 0.00016203848402676985, 'samples': 17844288, 'steps': 92938, 'loss/train': 1.3841971158981323} 11/07/2021 10:19:05 - INFO - __main__ - Step 92940: {'lr': 0.0001620335166267448, 'samples': 17844480, 'steps': 92939, 'loss/train': 1.1420154571533203} 11/07/2021 10:19:05 - INFO - __main__ - Step 92941: {'lr': 0.00016202854926635607, 'samples': 17844672, 'steps': 92940, 'loss/train': 1.8408173322677612} 11/07/2021 10:19:05 - INFO - __main__ - Step 92942: {'lr': 0.0001620235819456059, 'samples': 17844864, 'steps': 92941, 'loss/train': 1.2709194421768188} 11/07/2021 10:19:06 - INFO - __main__ - Step 92943: {'lr': 0.00016201861466449657, 'samples': 17845056, 'steps': 92942, 'loss/train': 1.299153208732605} 11/07/2021 10:19:06 - INFO - __main__ - Step 92944: {'lr': 0.00016201364742303033, 'samples': 17845248, 'steps': 92943, 'loss/train': 1.2188100814819336} 11/07/2021 10:19:07 - INFO - __main__ - Step 92945: {'lr': 0.0001620086802212094, 'samples': 17845440, 'steps': 92944, 'loss/train': 1.184290885925293} 11/07/2021 10:19:07 - INFO - __main__ - Step 92946: {'lr': 0.00016200371305903594, 'samples': 17845632, 'steps': 92945, 'loss/train': 1.1842875480651855} 11/07/2021 10:19:08 - INFO - __main__ - Step 92947: {'lr': 0.00016199874593651227, 'samples': 17845824, 'steps': 92946, 'loss/train': 1.4095810651779175} 11/07/2021 10:19:08 - INFO - __main__ - Step 92948: {'lr': 0.00016199377885364058, 'samples': 17846016, 'steps': 92947, 'loss/train': 1.5433671474456787} 11/07/2021 10:19:08 - INFO - __main__ - Step 92949: {'lr': 0.00016198881181042323, 'samples': 17846208, 'steps': 92948, 'loss/train': 1.6563295125961304} 11/07/2021 10:19:09 - INFO - __main__ - Step 92950: {'lr': 0.00016198384480686228, 'samples': 17846400, 'steps': 92949, 'loss/train': 1.7988306283950806} 11/07/2021 10:19:10 - INFO - __main__ - Step 92951: {'lr': 0.00016197887784296007, 'samples': 17846592, 'steps': 92950, 'loss/train': 1.4312572479248047} 11/07/2021 10:19:10 - INFO - __main__ - Step 92952: {'lr': 0.00016197391091871878, 'samples': 17846784, 'steps': 92951, 'loss/train': 1.3392612934112549} 11/07/2021 10:19:10 - INFO - __main__ - Step 92953: {'lr': 0.00016196894403414073, 'samples': 17846976, 'steps': 92952, 'loss/train': 1.3472697734832764} 11/07/2021 10:19:11 - INFO - __main__ - Step 92954: {'lr': 0.0001619639771892281, 'samples': 17847168, 'steps': 92953, 'loss/train': 1.2514128684997559} 11/07/2021 10:19:12 - INFO - __main__ - Step 92955: {'lr': 0.00016195901038398313, 'samples': 17847360, 'steps': 92954, 'loss/train': 0.8364883661270142} 11/07/2021 10:19:12 - INFO - __main__ - Step 92956: {'lr': 0.00016195404361840816, 'samples': 17847552, 'steps': 92955, 'loss/train': 1.5237141847610474} 11/07/2021 10:19:13 - INFO - __main__ - Step 92957: {'lr': 0.00016194907689250524, 'samples': 17847744, 'steps': 92956, 'loss/train': 1.5218851566314697} 11/07/2021 10:19:13 - INFO - __main__ - Step 92958: {'lr': 0.00016194411020627674, 'samples': 17847936, 'steps': 92957, 'loss/train': 0.8968916535377502} 11/07/2021 10:19:13 - INFO - __main__ - Step 92959: {'lr': 0.00016193914355972484, 'samples': 17848128, 'steps': 92958, 'loss/train': 1.3574445247650146} 11/07/2021 10:19:14 - INFO - __main__ - Step 92960: {'lr': 0.00016193417695285184, 'samples': 17848320, 'steps': 92959, 'loss/train': 1.0571173429489136} 11/07/2021 10:19:15 - INFO - __main__ - Step 92961: {'lr': 0.0001619292103856599, 'samples': 17848512, 'steps': 92960, 'loss/train': 1.3229669332504272} 11/07/2021 10:19:15 - INFO - __main__ - Step 92962: {'lr': 0.0001619242438581514, 'samples': 17848704, 'steps': 92961, 'loss/train': 1.5215636491775513} 11/07/2021 10:19:15 - INFO - __main__ - Step 92963: {'lr': 0.00016191927737032834, 'samples': 17848896, 'steps': 92962, 'loss/train': 1.4470242261886597} 11/07/2021 10:19:16 - INFO - __main__ - Step 92964: {'lr': 0.00016191431092219317, 'samples': 17849088, 'steps': 92963, 'loss/train': 1.4357631206512451} 11/07/2021 10:19:16 - INFO - __main__ - Step 92965: {'lr': 0.00016190934451374805, 'samples': 17849280, 'steps': 92964, 'loss/train': 1.323939561843872} 11/07/2021 10:19:17 - INFO - __main__ - Step 92966: {'lr': 0.0001619043781449952, 'samples': 17849472, 'steps': 92965, 'loss/train': 1.7054774761199951} 11/07/2021 10:19:17 - INFO - __main__ - Step 92967: {'lr': 0.00016189941181593692, 'samples': 17849664, 'steps': 92966, 'loss/train': 1.6834040880203247} 11/07/2021 10:19:18 - INFO - __main__ - Step 92968: {'lr': 0.0001618944455265754, 'samples': 17849856, 'steps': 92967, 'loss/train': 0.8403952717781067} 11/07/2021 10:19:18 - INFO - __main__ - Step 92969: {'lr': 0.00016188947927691283, 'samples': 17850048, 'steps': 92968, 'loss/train': 1.6030714511871338} 11/07/2021 10:19:18 - INFO - __main__ - Step 92970: {'lr': 0.00016188451306695152, 'samples': 17850240, 'steps': 92969, 'loss/train': 1.3554037809371948} 11/07/2021 10:19:20 - INFO - __main__ - Step 92971: {'lr': 0.00016187954689669368, 'samples': 17850432, 'steps': 92970, 'loss/train': 1.3548563718795776} 11/07/2021 10:19:20 - INFO - __main__ - Step 92972: {'lr': 0.0001618745807661416, 'samples': 17850624, 'steps': 92971, 'loss/train': 1.1482828855514526} 11/07/2021 10:19:20 - INFO - __main__ - Step 92973: {'lr': 0.0001618696146752974, 'samples': 17850816, 'steps': 92972, 'loss/train': 1.1762728691101074} 11/07/2021 10:19:21 - INFO - __main__ - Step 92974: {'lr': 0.00016186464862416345, 'samples': 17851008, 'steps': 92973, 'loss/train': 1.7850512266159058} 11/07/2021 10:19:21 - INFO - __main__ - Step 92975: {'lr': 0.0001618596826127419, 'samples': 17851200, 'steps': 92974, 'loss/train': 1.2756291627883911} 11/07/2021 10:19:22 - INFO - __main__ - Step 92976: {'lr': 0.00016185471664103507, 'samples': 17851392, 'steps': 92975, 'loss/train': 1.6932138204574585} 11/07/2021 10:19:22 - INFO - __main__ - Step 92977: {'lr': 0.00016184975070904513, 'samples': 17851584, 'steps': 92976, 'loss/train': 1.3656402826309204} 11/07/2021 10:19:23 - INFO - __main__ - Step 92978: {'lr': 0.00016184478481677433, 'samples': 17851776, 'steps': 92977, 'loss/train': 1.1779016256332397} 11/07/2021 10:19:23 - INFO - __main__ - Step 92979: {'lr': 0.0001618398189642249, 'samples': 17851968, 'steps': 92978, 'loss/train': 1.2861199378967285} 11/07/2021 10:19:23 - INFO - __main__ - Step 92980: {'lr': 0.00016183485315139905, 'samples': 17852160, 'steps': 92979, 'loss/train': 1.7166807651519775} 11/07/2021 10:19:24 - INFO - __main__ - Step 92981: {'lr': 0.00016182988737829907, 'samples': 17852352, 'steps': 92980, 'loss/train': 1.2896772623062134} 11/07/2021 10:19:25 - INFO - __main__ - Step 92982: {'lr': 0.00016182492164492718, 'samples': 17852544, 'steps': 92981, 'loss/train': 1.141157627105713} 11/07/2021 10:19:25 - INFO - __main__ - Step 92983: {'lr': 0.00016181995595128564, 'samples': 17852736, 'steps': 92982, 'loss/train': 1.472124695777893} 11/07/2021 10:19:25 - INFO - __main__ - Step 92984: {'lr': 0.0001618149902973767, 'samples': 17852928, 'steps': 92983, 'loss/train': 1.271074891090393} 11/07/2021 10:19:26 - INFO - __main__ - Step 92985: {'lr': 0.0001618100246832025, 'samples': 17853120, 'steps': 92984, 'loss/train': 1.532159686088562} 11/07/2021 10:19:27 - INFO - __main__ - Step 92986: {'lr': 0.00016180505910876533, 'samples': 17853312, 'steps': 92985, 'loss/train': 0.9026973247528076} 11/07/2021 10:19:27 - INFO - __main__ - Step 92987: {'lr': 0.0001618000935740675, 'samples': 17853504, 'steps': 92986, 'loss/train': 1.4743869304656982} 11/07/2021 10:19:28 - INFO - __main__ - Step 92988: {'lr': 0.00016179512807911112, 'samples': 17853696, 'steps': 92987, 'loss/train': 1.229438066482544} 11/07/2021 10:19:28 - INFO - __main__ - Step 92989: {'lr': 0.00016179016262389865, 'samples': 17853888, 'steps': 92988, 'loss/train': 1.3046929836273193} 11/07/2021 10:19:28 - INFO - __main__ - Step 92990: {'lr': 0.00016178519720843205, 'samples': 17854080, 'steps': 92989, 'loss/train': 1.1750215291976929} 11/07/2021 10:19:29 - INFO - __main__ - Step 92991: {'lr': 0.00016178023183271368, 'samples': 17854272, 'steps': 92990, 'loss/train': 1.3834187984466553} 11/07/2021 10:19:30 - INFO - __main__ - Step 92992: {'lr': 0.00016177526649674577, 'samples': 17854464, 'steps': 92991, 'loss/train': 0.8220775723457336} 11/07/2021 10:19:30 - INFO - __main__ - Step 92993: {'lr': 0.0001617703012005306, 'samples': 17854656, 'steps': 92992, 'loss/train': 0.5831640362739563} 11/07/2021 10:19:30 - INFO - __main__ - Step 92994: {'lr': 0.00016176533594407033, 'samples': 17854848, 'steps': 92993, 'loss/train': 1.179703712463379} 11/07/2021 10:19:31 - INFO - __main__ - Step 92995: {'lr': 0.00016176037072736723, 'samples': 17855040, 'steps': 92994, 'loss/train': 1.2740851640701294} 11/07/2021 10:19:32 - INFO - __main__ - Step 92996: {'lr': 0.00016175540555042356, 'samples': 17855232, 'steps': 92995, 'loss/train': 1.395777702331543} 11/07/2021 10:19:32 - INFO - __main__ - Step 92997: {'lr': 0.00016175044041324155, 'samples': 17855424, 'steps': 92996, 'loss/train': 1.7764631509780884} 11/07/2021 10:19:33 - INFO - __main__ - Step 92998: {'lr': 0.00016174547531582346, 'samples': 17855616, 'steps': 92997, 'loss/train': 0.57564777135849} 11/07/2021 10:19:33 - INFO - __main__ - Step 92999: {'lr': 0.00016174051025817144, 'samples': 17855808, 'steps': 92998, 'loss/train': 1.218440294265747} 11/07/2021 10:19:33 - INFO - __main__ - Step 93000: {'lr': 0.00016173554524028782, 'samples': 17856000, 'steps': 92999, 'loss/train': 1.3518935441970825} 11/07/2021 10:19:34 - INFO - __main__ - Step 93001: {'lr': 0.0001617305802621748, 'samples': 17856192, 'steps': 93000, 'loss/train': 1.5502358675003052} 11/07/2021 10:19:35 - INFO - __main__ - Step 93002: {'lr': 0.0001617256153238347, 'samples': 17856384, 'steps': 93001, 'loss/train': 1.0777392387390137} 11/07/2021 10:19:35 - INFO - __main__ - Step 93003: {'lr': 0.0001617206504252696, 'samples': 17856576, 'steps': 93002, 'loss/train': 1.2502235174179077} 11/07/2021 10:19:35 - INFO - __main__ - Step 93004: {'lr': 0.00016171568556648178, 'samples': 17856768, 'steps': 93003, 'loss/train': 1.278792142868042} 11/07/2021 10:19:36 - INFO - __main__ - Step 93005: {'lr': 0.00016171072074747353, 'samples': 17856960, 'steps': 93004, 'loss/train': 1.5784331560134888} 11/07/2021 10:19:36 - INFO - __main__ - Step 93006: {'lr': 0.00016170575596824704, 'samples': 17857152, 'steps': 93005, 'loss/train': 0.9971768260002136} 11/07/2021 10:19:37 - INFO - __main__ - Step 93007: {'lr': 0.00016170079122880462, 'samples': 17857344, 'steps': 93006, 'loss/train': 1.4871909618377686} 11/07/2021 10:19:37 - INFO - __main__ - Step 93008: {'lr': 0.00016169582652914843, 'samples': 17857536, 'steps': 93007, 'loss/train': 1.1553765535354614} 11/07/2021 10:19:38 - INFO - __main__ - Step 93009: {'lr': 0.00016169086186928076, 'samples': 17857728, 'steps': 93008, 'loss/train': 1.2633436918258667} 11/07/2021 10:19:38 - INFO - __main__ - Step 93010: {'lr': 0.0001616858972492038, 'samples': 17857920, 'steps': 93009, 'loss/train': 1.1485201120376587} 11/07/2021 10:19:38 - INFO - __main__ - Step 93011: {'lr': 0.00016168093266891983, 'samples': 17858112, 'steps': 93010, 'loss/train': 1.3982962369918823} 11/07/2021 10:19:39 - INFO - __main__ - Step 93012: {'lr': 0.00016167596812843106, 'samples': 17858304, 'steps': 93011, 'loss/train': 1.5986104011535645} 11/07/2021 10:19:40 - INFO - __main__ - Step 93013: {'lr': 0.00016167100362773974, 'samples': 17858496, 'steps': 93012, 'loss/train': 1.2741353511810303} 11/07/2021 10:19:40 - INFO - __main__ - Step 93014: {'lr': 0.0001616660391668481, 'samples': 17858688, 'steps': 93013, 'loss/train': 1.3574298620224} 11/07/2021 10:19:41 - INFO - __main__ - Step 93015: {'lr': 0.0001616610747457584, 'samples': 17858880, 'steps': 93014, 'loss/train': 1.3594958782196045} 11/07/2021 10:19:41 - INFO - __main__ - Step 93016: {'lr': 0.00016165611036447292, 'samples': 17859072, 'steps': 93015, 'loss/train': 1.4348057508468628} 11/07/2021 10:19:42 - INFO - __main__ - Step 93017: {'lr': 0.00016165114602299373, 'samples': 17859264, 'steps': 93016, 'loss/train': 1.2801979780197144} 11/07/2021 10:19:42 - INFO - __main__ - Step 93018: {'lr': 0.00016164618172132323, 'samples': 17859456, 'steps': 93017, 'loss/train': 1.6087967157363892} 11/07/2021 10:19:42 - INFO - __main__ - Step 93019: {'lr': 0.00016164121745946354, 'samples': 17859648, 'steps': 93018, 'loss/train': 1.546528935432434} 11/07/2021 10:19:43 - INFO - __main__ - Step 93020: {'lr': 0.00016163625323741698, 'samples': 17859840, 'steps': 93019, 'loss/train': 1.5406123399734497} 11/07/2021 10:19:43 - INFO - __main__ - Step 93021: {'lr': 0.00016163128905518576, 'samples': 17860032, 'steps': 93020, 'loss/train': 1.2118154764175415} 11/07/2021 10:19:44 - INFO - __main__ - Step 93022: {'lr': 0.0001616263249127721, 'samples': 17860224, 'steps': 93021, 'loss/train': 1.310768485069275} 11/07/2021 10:19:45 - INFO - __main__ - Step 93023: {'lr': 0.00016162136081017826, 'samples': 17860416, 'steps': 93022, 'loss/train': 1.435797095298767} 11/07/2021 10:19:45 - INFO - __main__ - Step 93024: {'lr': 0.00016161639674740647, 'samples': 17860608, 'steps': 93023, 'loss/train': 1.3541432619094849} 11/07/2021 10:19:45 - INFO - __main__ - Step 93025: {'lr': 0.000161611432724459, 'samples': 17860800, 'steps': 93024, 'loss/train': 1.518418788909912} 11/07/2021 10:19:46 - INFO - __main__ - Step 93026: {'lr': 0.000161606468741338, 'samples': 17860992, 'steps': 93025, 'loss/train': 1.7321072816848755} 11/07/2021 10:19:47 - INFO - __main__ - Step 93027: {'lr': 0.0001616015047980458, 'samples': 17861184, 'steps': 93026, 'loss/train': 2.5479307174682617} 11/07/2021 10:19:47 - INFO - __main__ - Step 93028: {'lr': 0.0001615965408945846, 'samples': 17861376, 'steps': 93027, 'loss/train': 1.0538058280944824} 11/07/2021 10:19:47 - INFO - __main__ - Step 93029: {'lr': 0.00016159157703095673, 'samples': 17861568, 'steps': 93028, 'loss/train': 1.1023133993148804} 11/07/2021 10:19:48 - INFO - __main__ - Step 93030: {'lr': 0.0001615866132071642, 'samples': 17861760, 'steps': 93029, 'loss/train': 1.745273470878601} 11/07/2021 10:19:48 - INFO - __main__ - Step 93031: {'lr': 0.0001615816494232094, 'samples': 17861952, 'steps': 93030, 'loss/train': 1.4695215225219727} 11/07/2021 10:19:49 - INFO - __main__ - Step 93032: {'lr': 0.00016157668567909456, 'samples': 17862144, 'steps': 93031, 'loss/train': 1.2778397798538208} 11/07/2021 10:19:50 - INFO - __main__ - Step 93033: {'lr': 0.0001615717219748219, 'samples': 17862336, 'steps': 93032, 'loss/train': 1.149406909942627} 11/07/2021 10:19:50 - INFO - __main__ - Step 93034: {'lr': 0.00016156675831039362, 'samples': 17862528, 'steps': 93033, 'loss/train': 1.1072328090667725} 11/07/2021 10:19:50 - INFO - __main__ - Step 93035: {'lr': 0.000161561794685812, 'samples': 17862720, 'steps': 93034, 'loss/train': 1.645426869392395} 11/07/2021 10:19:51 - INFO - __main__ - Step 93036: {'lr': 0.0001615568311010793, 'samples': 17862912, 'steps': 93035, 'loss/train': 1.5808169841766357} 11/07/2021 10:19:52 - INFO - __main__ - Step 93037: {'lr': 0.0001615518675561977, 'samples': 17863104, 'steps': 93036, 'loss/train': 1.5621371269226074} 11/07/2021 10:19:52 - INFO - __main__ - Step 93038: {'lr': 0.0001615469040511695, 'samples': 17863296, 'steps': 93037, 'loss/train': 1.5319204330444336} 11/07/2021 10:19:52 - INFO - __main__ - Step 93039: {'lr': 0.00016154194058599686, 'samples': 17863488, 'steps': 93038, 'loss/train': 1.589982509613037} 11/07/2021 10:19:53 - INFO - __main__ - Step 93040: {'lr': 0.00016153697716068212, 'samples': 17863680, 'steps': 93039, 'loss/train': 1.6344257593154907} 11/07/2021 10:19:53 - INFO - __main__ - Step 93041: {'lr': 0.0001615320137752274, 'samples': 17863872, 'steps': 93040, 'loss/train': 1.154547929763794} 11/07/2021 10:19:53 - INFO - __main__ - Step 93042: {'lr': 0.00016152705042963498, 'samples': 17864064, 'steps': 93041, 'loss/train': 1.4842009544372559} 11/07/2021 10:19:55 - INFO - __main__ - Step 93043: {'lr': 0.00016152208712390723, 'samples': 17864256, 'steps': 93042, 'loss/train': 1.5940603017807007} 11/07/2021 10:19:55 - INFO - __main__ - Step 93044: {'lr': 0.00016151712385804615, 'samples': 17864448, 'steps': 93043, 'loss/train': 1.510469675064087} 11/07/2021 10:19:55 - INFO - __main__ - Step 93045: {'lr': 0.0001615121606320541, 'samples': 17864640, 'steps': 93044, 'loss/train': 2.4053890705108643} 11/07/2021 10:19:56 - INFO - __main__ - Step 93046: {'lr': 0.0001615071974459333, 'samples': 17864832, 'steps': 93045, 'loss/train': 1.2693326473236084} 11/07/2021 10:19:56 - INFO - __main__ - Step 93047: {'lr': 0.00016150223429968596, 'samples': 17865024, 'steps': 93046, 'loss/train': 1.1757047176361084} 11/07/2021 10:19:57 - INFO - __main__ - Step 93048: {'lr': 0.00016149727119331442, 'samples': 17865216, 'steps': 93047, 'loss/train': 1.4173372983932495} 11/07/2021 10:19:57 - INFO - __main__ - Step 93049: {'lr': 0.0001614923081268208, 'samples': 17865408, 'steps': 93048, 'loss/train': 1.1900808811187744} 11/07/2021 10:19:58 - INFO - __main__ - Step 93050: {'lr': 0.00016148734510020737, 'samples': 17865600, 'steps': 93049, 'loss/train': 1.9994466304779053} 11/07/2021 10:19:58 - INFO - __main__ - Step 93051: {'lr': 0.00016148238211347637, 'samples': 17865792, 'steps': 93050, 'loss/train': 1.1179845333099365} 11/07/2021 10:19:59 - INFO - __main__ - Step 93052: {'lr': 0.00016147741916663008, 'samples': 17865984, 'steps': 93051, 'loss/train': 1.782984972000122} 11/07/2021 10:20:00 - INFO - __main__ - Step 93053: {'lr': 0.00016147245625967066, 'samples': 17866176, 'steps': 93052, 'loss/train': 1.0559535026550293} 11/07/2021 10:20:00 - INFO - __main__ - Step 93054: {'lr': 0.00016146749339260042, 'samples': 17866368, 'steps': 93053, 'loss/train': 1.4846692085266113} 11/07/2021 10:20:00 - INFO - __main__ - Step 93055: {'lr': 0.00016146253056542153, 'samples': 17866560, 'steps': 93054, 'loss/train': 1.0427216291427612} 11/07/2021 10:20:01 - INFO - __main__ - Step 93056: {'lr': 0.0001614575677781364, 'samples': 17866752, 'steps': 93055, 'loss/train': 1.5161755084991455} 11/07/2021 10:20:01 - INFO - __main__ - Step 93057: {'lr': 0.000161452605030747, 'samples': 17866944, 'steps': 93056, 'loss/train': 1.9383882284164429} 11/07/2021 10:20:02 - INFO - __main__ - Step 93058: {'lr': 0.0001614476423232557, 'samples': 17867136, 'steps': 93057, 'loss/train': 1.460034966468811} 11/07/2021 10:20:02 - INFO - __main__ - Step 93059: {'lr': 0.00016144267965566473, 'samples': 17867328, 'steps': 93058, 'loss/train': 1.522706151008606} 11/07/2021 10:20:03 - INFO - __main__ - Step 93060: {'lr': 0.00016143771702797628, 'samples': 17867520, 'steps': 93059, 'loss/train': 0.9658939838409424} 11/07/2021 10:20:03 - INFO - __main__ - Step 93061: {'lr': 0.00016143275444019267, 'samples': 17867712, 'steps': 93060, 'loss/train': 1.1855847835540771} 11/07/2021 10:20:03 - INFO - __main__ - Step 93062: {'lr': 0.00016142779189231608, 'samples': 17867904, 'steps': 93061, 'loss/train': 1.4555840492248535} 11/07/2021 10:20:04 - INFO - __main__ - Step 93063: {'lr': 0.00016142282938434873, 'samples': 17868096, 'steps': 93062, 'loss/train': 1.520459771156311} 11/07/2021 10:20:05 - INFO - __main__ - Step 93064: {'lr': 0.00016141786691629292, 'samples': 17868288, 'steps': 93063, 'loss/train': 1.647117257118225} 11/07/2021 10:20:05 - INFO - __main__ - Step 93065: {'lr': 0.00016141290448815085, 'samples': 17868480, 'steps': 93064, 'loss/train': 1.2708122730255127} 11/07/2021 10:20:06 - INFO - __main__ - Step 93066: {'lr': 0.00016140794209992476, 'samples': 17868672, 'steps': 93065, 'loss/train': 1.3816633224487305} 11/07/2021 10:20:06 - INFO - __main__ - Step 93067: {'lr': 0.00016140297975161688, 'samples': 17868864, 'steps': 93066, 'loss/train': 1.4737416505813599} 11/07/2021 10:20:06 - INFO - __main__ - Step 93068: {'lr': 0.00016139801744322947, 'samples': 17869056, 'steps': 93067, 'loss/train': 1.1356242895126343} 11/07/2021 10:20:07 - INFO - __main__ - Step 93069: {'lr': 0.00016139305517476476, 'samples': 17869248, 'steps': 93068, 'loss/train': 1.8711906671524048} 11/07/2021 10:20:08 - INFO - __main__ - Step 93070: {'lr': 0.00016138809294622498, 'samples': 17869440, 'steps': 93069, 'loss/train': 1.2920246124267578} 11/07/2021 10:20:08 - INFO - __main__ - Step 93071: {'lr': 0.00016138313075761233, 'samples': 17869632, 'steps': 93070, 'loss/train': 1.8580228090286255} 11/07/2021 10:20:08 - INFO - __main__ - Step 93072: {'lr': 0.00016137816860892906, 'samples': 17869824, 'steps': 93071, 'loss/train': 0.9405996203422546} 11/07/2021 10:20:09 - INFO - __main__ - Step 93073: {'lr': 0.00016137320650017742, 'samples': 17870016, 'steps': 93072, 'loss/train': 1.398747444152832} 11/07/2021 10:20:10 - INFO - __main__ - Step 93074: {'lr': 0.00016136824443135965, 'samples': 17870208, 'steps': 93073, 'loss/train': 1.314123511314392} 11/07/2021 10:20:10 - INFO - __main__ - Step 93075: {'lr': 0.00016136328240247796, 'samples': 17870400, 'steps': 93074, 'loss/train': 1.0347553491592407} 11/07/2021 10:20:11 - INFO - __main__ - Step 93076: {'lr': 0.00016135832041353464, 'samples': 17870592, 'steps': 93075, 'loss/train': 1.5224475860595703} 11/07/2021 10:20:11 - INFO - __main__ - Step 93077: {'lr': 0.00016135335846453186, 'samples': 17870784, 'steps': 93076, 'loss/train': 1.255305528640747} 11/07/2021 10:20:11 - INFO - __main__ - Step 93078: {'lr': 0.0001613483965554719, 'samples': 17870976, 'steps': 93077, 'loss/train': 1.2364205121994019} 11/07/2021 10:20:12 - INFO - __main__ - Step 93079: {'lr': 0.000161343434686357, 'samples': 17871168, 'steps': 93078, 'loss/train': 0.3512703478336334} 11/07/2021 10:20:13 - INFO - __main__ - Step 93080: {'lr': 0.00016133847285718943, 'samples': 17871360, 'steps': 93079, 'loss/train': 0.8548104166984558} 11/07/2021 10:20:13 - INFO - __main__ - Step 93081: {'lr': 0.0001613335110679713, 'samples': 17871552, 'steps': 93080, 'loss/train': 1.3841294050216675} 11/07/2021 10:20:13 - INFO - __main__ - Step 93082: {'lr': 0.00016132854931870494, 'samples': 17871744, 'steps': 93081, 'loss/train': 1.2185804843902588} 11/07/2021 10:20:14 - INFO - __main__ - Step 93083: {'lr': 0.00016132358760939265, 'samples': 17871936, 'steps': 93082, 'loss/train': 1.4162315130233765} 11/07/2021 10:20:15 - INFO - __main__ - Step 93084: {'lr': 0.00016131862594003649, 'samples': 17872128, 'steps': 93083, 'loss/train': 1.5275729894638062} 11/07/2021 10:20:15 - INFO - __main__ - Step 93085: {'lr': 0.00016131366431063876, 'samples': 17872320, 'steps': 93084, 'loss/train': 1.4298765659332275} 11/07/2021 10:20:15 - INFO - __main__ - Step 93086: {'lr': 0.0001613087027212018, 'samples': 17872512, 'steps': 93085, 'loss/train': 1.5937089920043945} 11/07/2021 10:20:16 - INFO - __main__ - Step 93087: {'lr': 0.0001613037411717277, 'samples': 17872704, 'steps': 93086, 'loss/train': 1.7291452884674072} 11/07/2021 10:20:16 - INFO - __main__ - Step 93088: {'lr': 0.0001612987796622188, 'samples': 17872896, 'steps': 93087, 'loss/train': 1.0241918563842773} 11/07/2021 10:20:16 - INFO - __main__ - Step 93089: {'lr': 0.0001612938181926773, 'samples': 17873088, 'steps': 93088, 'loss/train': 1.953621745109558} 11/07/2021 10:20:17 - INFO - __main__ - Step 93090: {'lr': 0.00016128885676310544, 'samples': 17873280, 'steps': 93089, 'loss/train': 1.404950499534607} 11/07/2021 10:20:18 - INFO - __main__ - Step 93091: {'lr': 0.00016128389537350553, 'samples': 17873472, 'steps': 93090, 'loss/train': 1.6888445615768433} 11/07/2021 10:20:18 - INFO - __main__ - Step 93092: {'lr': 0.0001612789340238796, 'samples': 17873664, 'steps': 93091, 'loss/train': 1.1338058710098267} 11/07/2021 10:20:18 - INFO - __main__ - Step 93093: {'lr': 0.00016127397271423007, 'samples': 17873856, 'steps': 93092, 'loss/train': 1.3872150182724} 11/07/2021 10:20:19 - INFO - __main__ - Step 93094: {'lr': 0.00016126901144455913, 'samples': 17874048, 'steps': 93093, 'loss/train': 1.297934889793396} 11/07/2021 10:20:20 - INFO - __main__ - Step 93095: {'lr': 0.00016126405021486896, 'samples': 17874240, 'steps': 93094, 'loss/train': 2.422539710998535} 11/07/2021 10:20:20 - INFO - __main__ - Step 93096: {'lr': 0.00016125908902516186, 'samples': 17874432, 'steps': 93095, 'loss/train': 1.583148717880249} 11/07/2021 10:20:21 - INFO - __main__ - Step 93097: {'lr': 0.0001612541278754401, 'samples': 17874624, 'steps': 93096, 'loss/train': 1.445920467376709} 11/07/2021 10:20:21 - INFO - __main__ - Step 93098: {'lr': 0.00016124916676570582, 'samples': 17874816, 'steps': 93097, 'loss/train': 1.6177935600280762} 11/07/2021 10:20:21 - INFO - __main__ - Step 93099: {'lr': 0.00016124420569596127, 'samples': 17875008, 'steps': 93098, 'loss/train': 1.5710117816925049} 11/07/2021 10:20:22 - INFO - __main__ - Step 93100: {'lr': 0.00016123924466620874, 'samples': 17875200, 'steps': 93099, 'loss/train': 1.19000244140625} 11/07/2021 10:20:23 - INFO - __main__ - Step 93101: {'lr': 0.00016123428367645045, 'samples': 17875392, 'steps': 93100, 'loss/train': 1.270164132118225} 11/07/2021 10:20:23 - INFO - __main__ - Step 93102: {'lr': 0.00016122932272668862, 'samples': 17875584, 'steps': 93101, 'loss/train': 1.6300194263458252} 11/07/2021 10:20:24 - INFO - __main__ - Step 93103: {'lr': 0.00016122436181692545, 'samples': 17875776, 'steps': 93102, 'loss/train': 1.3487128019332886} 11/07/2021 10:20:24 - INFO - __main__ - Step 93104: {'lr': 0.0001612194009471632, 'samples': 17875968, 'steps': 93103, 'loss/train': 1.323673963546753} 11/07/2021 10:20:24 - INFO - __main__ - Step 93105: {'lr': 0.00016121444011740416, 'samples': 17876160, 'steps': 93104, 'loss/train': 1.7621020078659058} 11/07/2021 10:20:25 - INFO - __main__ - Step 93106: {'lr': 0.0001612094793276505, 'samples': 17876352, 'steps': 93105, 'loss/train': 1.5276399850845337} 11/07/2021 10:20:26 - INFO - __main__ - Step 93107: {'lr': 0.00016120451857790446, 'samples': 17876544, 'steps': 93106, 'loss/train': 1.6966276168823242} 11/07/2021 10:20:26 - INFO - __main__ - Step 93108: {'lr': 0.00016119955786816833, 'samples': 17876736, 'steps': 93107, 'loss/train': 1.613688588142395} 11/07/2021 10:20:26 - INFO - __main__ - Step 93109: {'lr': 0.00016119459719844432, 'samples': 17876928, 'steps': 93108, 'loss/train': 1.5535088777542114} 11/07/2021 10:20:27 - INFO - __main__ - Step 93110: {'lr': 0.00016118963656873466, 'samples': 17877120, 'steps': 93109, 'loss/train': 1.4772515296936035} 11/07/2021 10:20:28 - INFO - __main__ - Step 93111: {'lr': 0.00016118467597904158, 'samples': 17877312, 'steps': 93110, 'loss/train': 1.284328818321228} 11/07/2021 10:20:28 - INFO - __main__ - Step 93112: {'lr': 0.00016117971542936732, 'samples': 17877504, 'steps': 93111, 'loss/train': 0.7645211815834045} 11/07/2021 10:20:28 - INFO - __main__ - Step 93113: {'lr': 0.00016117475491971407, 'samples': 17877696, 'steps': 93112, 'loss/train': 1.6644095182418823} 11/07/2021 10:20:29 - INFO - __main__ - Step 93114: {'lr': 0.00016116979445008413, 'samples': 17877888, 'steps': 93113, 'loss/train': 1.2552497386932373} 11/07/2021 10:20:29 - INFO - __main__ - Step 93115: {'lr': 0.00016116483402047965, 'samples': 17878080, 'steps': 93114, 'loss/train': 0.8304169178009033} 11/07/2021 10:20:30 - INFO - __main__ - Step 93116: {'lr': 0.00016115987363090296, 'samples': 17878272, 'steps': 93115, 'loss/train': 3.610405683517456} 11/07/2021 10:20:30 - INFO - __main__ - Step 93117: {'lr': 0.0001611549132813563, 'samples': 17878464, 'steps': 93116, 'loss/train': 1.6784509420394897} 11/07/2021 10:20:31 - INFO - __main__ - Step 93118: {'lr': 0.00016114995297184182, 'samples': 17878656, 'steps': 93117, 'loss/train': 1.583593726158142} 11/07/2021 10:20:31 - INFO - __main__ - Step 93119: {'lr': 0.00016114499270236177, 'samples': 17878848, 'steps': 93118, 'loss/train': 1.356047511100769} 11/07/2021 10:20:32 - INFO - __main__ - Step 93120: {'lr': 0.00016114003247291847, 'samples': 17879040, 'steps': 93119, 'loss/train': 1.4302994012832642} 11/07/2021 10:20:33 - INFO - __main__ - Step 93121: {'lr': 0.0001611350722835141, 'samples': 17879232, 'steps': 93120, 'loss/train': 1.272779107093811} 11/07/2021 10:20:33 - INFO - __main__ - Step 93122: {'lr': 0.00016113011213415084, 'samples': 17879424, 'steps': 93121, 'loss/train': 1.2508338689804077} 11/07/2021 10:20:33 - INFO - __main__ - Step 93123: {'lr': 0.00016112515202483115, 'samples': 17879616, 'steps': 93122, 'loss/train': 1.0881733894348145} 11/07/2021 10:20:34 - INFO - __main__ - Step 93124: {'lr': 0.00016112019195555695, 'samples': 17879808, 'steps': 93123, 'loss/train': 1.6731575727462769} 11/07/2021 10:20:34 - INFO - __main__ - Step 93125: {'lr': 0.00016111523192633066, 'samples': 17880000, 'steps': 93124, 'loss/train': 1.3404185771942139} 11/07/2021 10:20:34 - INFO - __main__ - Step 93126: {'lr': 0.00016111027193715444, 'samples': 17880192, 'steps': 93125, 'loss/train': 1.0891183614730835} 11/07/2021 10:20:35 - INFO - __main__ - Step 93127: {'lr': 0.00016110531198803055, 'samples': 17880384, 'steps': 93126, 'loss/train': 1.352777361869812} 11/07/2021 10:20:36 - INFO - __main__ - Step 93128: {'lr': 0.00016110035207896127, 'samples': 17880576, 'steps': 93127, 'loss/train': 1.313431978225708} 11/07/2021 10:20:36 - INFO - __main__ - Step 93129: {'lr': 0.00016109539220994878, 'samples': 17880768, 'steps': 93128, 'loss/train': 1.8724991083145142} 11/07/2021 10:20:37 - INFO - __main__ - Step 93130: {'lr': 0.00016109043238099534, 'samples': 17880960, 'steps': 93129, 'loss/train': 2.930649518966675} 11/07/2021 10:20:37 - INFO - __main__ - Step 93131: {'lr': 0.00016108547259210317, 'samples': 17881152, 'steps': 93130, 'loss/train': 1.6658477783203125} 11/07/2021 10:20:38 - INFO - __main__ - Step 93132: {'lr': 0.00016108051284327452, 'samples': 17881344, 'steps': 93131, 'loss/train': 1.4390314817428589} 11/07/2021 10:20:38 - INFO - __main__ - Step 93133: {'lr': 0.0001610755531345116, 'samples': 17881536, 'steps': 93132, 'loss/train': 1.4840527772903442} 11/07/2021 10:20:39 - INFO - __main__ - Step 93134: {'lr': 0.0001610705934658167, 'samples': 17881728, 'steps': 93133, 'loss/train': 1.3291875123977661} 11/07/2021 10:20:39 - INFO - __main__ - Step 93135: {'lr': 0.000161065633837192, 'samples': 17881920, 'steps': 93134, 'loss/train': 1.5198088884353638} 11/07/2021 10:20:39 - INFO - __main__ - Step 93136: {'lr': 0.00016106067424863973, 'samples': 17882112, 'steps': 93135, 'loss/train': 1.4371259212493896} 11/07/2021 10:20:40 - INFO - __main__ - Step 93137: {'lr': 0.0001610557147001623, 'samples': 17882304, 'steps': 93136, 'loss/train': 0.1847122311592102} 11/07/2021 10:20:41 - INFO - __main__ - Step 93138: {'lr': 0.00016105075519176165, 'samples': 17882496, 'steps': 93137, 'loss/train': 1.6359118223190308} 11/07/2021 10:20:41 - INFO - __main__ - Step 93139: {'lr': 0.0001610457957234402, 'samples': 17882688, 'steps': 93138, 'loss/train': 1.023189663887024} 11/07/2021 10:20:41 - INFO - __main__ - Step 93140: {'lr': 0.0001610408362952001, 'samples': 17882880, 'steps': 93139, 'loss/train': 1.5436651706695557} 11/07/2021 10:20:42 - INFO - __main__ - Step 93141: {'lr': 0.00016103587690704363, 'samples': 17883072, 'steps': 93140, 'loss/train': 1.6100022792816162} 11/07/2021 10:20:43 - INFO - __main__ - Step 93142: {'lr': 0.00016103091755897302, 'samples': 17883264, 'steps': 93141, 'loss/train': 1.6959989070892334} 11/07/2021 10:20:43 - INFO - __main__ - Step 93143: {'lr': 0.00016102595825099054, 'samples': 17883456, 'steps': 93142, 'loss/train': 1.3578764200210571} 11/07/2021 10:20:43 - INFO - __main__ - Step 93144: {'lr': 0.00016102099898309836, 'samples': 17883648, 'steps': 93143, 'loss/train': 0.8700263500213623} 11/07/2021 10:20:44 - INFO - __main__ - Step 93145: {'lr': 0.00016101603975529873, 'samples': 17883840, 'steps': 93144, 'loss/train': 1.2220473289489746} 11/07/2021 10:20:44 - INFO - __main__ - Step 93146: {'lr': 0.00016101108056759396, 'samples': 17884032, 'steps': 93145, 'loss/train': 0.9046561121940613} 11/07/2021 10:20:45 - INFO - __main__ - Step 93147: {'lr': 0.00016100612141998615, 'samples': 17884224, 'steps': 93146, 'loss/train': 1.3808180093765259} 11/07/2021 10:20:46 - INFO - __main__ - Step 93148: {'lr': 0.00016100116231247764, 'samples': 17884416, 'steps': 93147, 'loss/train': 0.8517068028450012} 11/07/2021 10:20:46 - INFO - __main__ - Step 93149: {'lr': 0.00016099620324507065, 'samples': 17884608, 'steps': 93148, 'loss/train': 1.8621429204940796} 11/07/2021 10:20:46 - INFO - __main__ - Step 93150: {'lr': 0.0001609912442177675, 'samples': 17884800, 'steps': 93149, 'loss/train': 1.395085096359253} 11/07/2021 10:20:47 - INFO - __main__ - Step 93151: {'lr': 0.00016098628523057018, 'samples': 17884992, 'steps': 93150, 'loss/train': 1.4992388486862183} 11/07/2021 10:20:47 - INFO - __main__ - Step 93152: {'lr': 0.00016098132628348112, 'samples': 17885184, 'steps': 93151, 'loss/train': 1.3775737285614014} 11/07/2021 10:20:48 - INFO - __main__ - Step 93153: {'lr': 0.00016097636737650244, 'samples': 17885376, 'steps': 93152, 'loss/train': 0.6494150161743164} 11/07/2021 10:20:48 - INFO - __main__ - Step 93154: {'lr': 0.00016097140850963648, 'samples': 17885568, 'steps': 93153, 'loss/train': 1.552982211112976} 11/07/2021 10:20:49 - INFO - __main__ - Step 93155: {'lr': 0.00016096644968288543, 'samples': 17885760, 'steps': 93154, 'loss/train': 1.2883050441741943} 11/07/2021 10:20:49 - INFO - __main__ - Step 93156: {'lr': 0.0001609614908962515, 'samples': 17885952, 'steps': 93155, 'loss/train': 1.4681860208511353} 11/07/2021 10:20:50 - INFO - __main__ - Step 93157: {'lr': 0.00016095653214973695, 'samples': 17886144, 'steps': 93156, 'loss/train': 1.6694267988204956} 11/07/2021 10:20:50 - INFO - __main__ - Step 93158: {'lr': 0.00016095157344334405, 'samples': 17886336, 'steps': 93157, 'loss/train': 2.279714822769165} 11/07/2021 10:20:51 - INFO - __main__ - Step 93159: {'lr': 0.00016094661477707495, 'samples': 17886528, 'steps': 93158, 'loss/train': 1.7847801446914673} 11/07/2021 10:20:52 - INFO - __main__ - Step 93160: {'lr': 0.00016094165615093193, 'samples': 17886720, 'steps': 93159, 'loss/train': 1.5663644075393677} 11/07/2021 10:20:52 - INFO - __main__ - Step 93161: {'lr': 0.00016093669756491724, 'samples': 17886912, 'steps': 93160, 'loss/train': 0.6009225249290466} 11/07/2021 10:20:52 - INFO - __main__ - Step 93162: {'lr': 0.00016093173901903312, 'samples': 17887104, 'steps': 93161, 'loss/train': 0.5293936133384705} 11/07/2021 10:20:53 - INFO - __main__ - Step 93163: {'lr': 0.00016092678051328178, 'samples': 17887296, 'steps': 93162, 'loss/train': 0.4664562940597534} 11/07/2021 10:20:53 - INFO - __main__ - Step 93164: {'lr': 0.00016092182204766552, 'samples': 17887488, 'steps': 93163, 'loss/train': 0.7845654487609863} 11/07/2021 10:20:53 - INFO - __main__ - Step 93165: {'lr': 0.00016091686362218648, 'samples': 17887680, 'steps': 93164, 'loss/train': 1.626598834991455} 11/07/2021 10:20:54 - INFO - __main__ - Step 93166: {'lr': 0.00016091190523684687, 'samples': 17887872, 'steps': 93165, 'loss/train': 1.1232621669769287} 11/07/2021 10:20:55 - INFO - __main__ - Step 93167: {'lr': 0.000160906946891649, 'samples': 17888064, 'steps': 93166, 'loss/train': 0.8494899868965149} 11/07/2021 10:20:55 - INFO - __main__ - Step 93168: {'lr': 0.00016090198858659507, 'samples': 17888256, 'steps': 93167, 'loss/train': 1.4106558561325073} 11/07/2021 10:20:55 - INFO - __main__ - Step 93169: {'lr': 0.00016089703032168734, 'samples': 17888448, 'steps': 93168, 'loss/train': 1.171697735786438} 11/07/2021 10:20:56 - INFO - __main__ - Step 93170: {'lr': 0.00016089207209692805, 'samples': 17888640, 'steps': 93169, 'loss/train': 1.3820115327835083} 11/07/2021 10:20:57 - INFO - __main__ - Step 93171: {'lr': 0.00016088711391231938, 'samples': 17888832, 'steps': 93170, 'loss/train': 1.4537196159362793} 11/07/2021 10:20:57 - INFO - __main__ - Step 93172: {'lr': 0.00016088215576786364, 'samples': 17889024, 'steps': 93171, 'loss/train': 1.1579866409301758} 11/07/2021 10:20:58 - INFO - __main__ - Step 93173: {'lr': 0.000160877197663563, 'samples': 17889216, 'steps': 93172, 'loss/train': 1.773093342781067} 11/07/2021 10:20:58 - INFO - __main__ - Step 93174: {'lr': 0.00016087223959941973, 'samples': 17889408, 'steps': 93173, 'loss/train': 1.0351881980895996} 11/07/2021 10:20:58 - INFO - __main__ - Step 93175: {'lr': 0.00016086728157543607, 'samples': 17889600, 'steps': 93174, 'loss/train': 1.5402907133102417} 11/07/2021 10:20:59 - INFO - __main__ - Step 93176: {'lr': 0.0001608623235916142, 'samples': 17889792, 'steps': 93175, 'loss/train': 1.0261350870132446} 11/07/2021 10:21:00 - INFO - __main__ - Step 93177: {'lr': 0.0001608573656479565, 'samples': 17889984, 'steps': 93176, 'loss/train': 1.425187110900879} 11/07/2021 10:21:00 - INFO - __main__ - Step 93178: {'lr': 0.00016085240774446502, 'samples': 17890176, 'steps': 93177, 'loss/train': 1.276685357093811} 11/07/2021 10:21:00 - INFO - __main__ - Step 93179: {'lr': 0.00016084744988114206, 'samples': 17890368, 'steps': 93178, 'loss/train': 1.456344723701477} 11/07/2021 10:21:01 - INFO - __main__ - Step 93180: {'lr': 0.00016084249205798983, 'samples': 17890560, 'steps': 93179, 'loss/train': 1.400570034980774} 11/07/2021 10:21:01 - INFO - __main__ - Step 93181: {'lr': 0.00016083753427501064, 'samples': 17890752, 'steps': 93180, 'loss/train': 1.727543830871582} 11/07/2021 10:21:03 - INFO - __main__ - Step 93182: {'lr': 0.00016083257653220668, 'samples': 17890944, 'steps': 93181, 'loss/train': 1.316927194595337} 11/07/2021 10:21:03 - INFO - __main__ - Step 93183: {'lr': 0.0001608276188295802, 'samples': 17891136, 'steps': 93182, 'loss/train': 1.8417750597000122} 11/07/2021 10:21:03 - INFO - __main__ - Step 93184: {'lr': 0.00016082266116713336, 'samples': 17891328, 'steps': 93183, 'loss/train': 1.2924295663833618} 11/07/2021 10:21:04 - INFO - __main__ - Step 93185: {'lr': 0.00016081770354486847, 'samples': 17891520, 'steps': 93184, 'loss/train': 1.8215795755386353} 11/07/2021 10:21:04 - INFO - __main__ - Step 93186: {'lr': 0.00016081274596278777, 'samples': 17891712, 'steps': 93185, 'loss/train': 0.7343735098838806} 11/07/2021 10:21:05 - INFO - __main__ - Step 93187: {'lr': 0.00016080778842089347, 'samples': 17891904, 'steps': 93186, 'loss/train': 1.0810467004776} 11/07/2021 10:21:06 - INFO - __main__ - Step 93188: {'lr': 0.0001608028309191878, 'samples': 17892096, 'steps': 93187, 'loss/train': 1.2343534231185913} 11/07/2021 10:21:06 - INFO - __main__ - Step 93189: {'lr': 0.00016079787345767298, 'samples': 17892288, 'steps': 93188, 'loss/train': 1.5890793800354004} 11/07/2021 10:21:06 - INFO - __main__ - Step 93190: {'lr': 0.00016079291603635128, 'samples': 17892480, 'steps': 93189, 'loss/train': 1.9065614938735962} 11/07/2021 10:21:07 - INFO - __main__ - Step 93191: {'lr': 0.000160787958655225, 'samples': 17892672, 'steps': 93190, 'loss/train': 1.118606448173523} 11/07/2021 10:21:07 - INFO - __main__ - Step 93192: {'lr': 0.0001607830013142962, 'samples': 17892864, 'steps': 93191, 'loss/train': 1.3077480792999268} 11/07/2021 10:21:08 - INFO - __main__ - Step 93193: {'lr': 0.00016077804401356722, 'samples': 17893056, 'steps': 93192, 'loss/train': 1.336204171180725} 11/07/2021 10:21:08 - INFO - __main__ - Step 93194: {'lr': 0.00016077308675304026, 'samples': 17893248, 'steps': 93193, 'loss/train': 1.3620250225067139} 11/07/2021 10:21:09 - INFO - __main__ - Step 93195: {'lr': 0.00016076812953271758, 'samples': 17893440, 'steps': 93194, 'loss/train': 1.5212020874023438} 11/07/2021 10:21:09 - INFO - __main__ - Step 93196: {'lr': 0.00016076317235260137, 'samples': 17893632, 'steps': 93195, 'loss/train': 1.3651883602142334} 11/07/2021 10:21:09 - INFO - __main__ - Step 93197: {'lr': 0.00016075821521269393, 'samples': 17893824, 'steps': 93196, 'loss/train': 1.6721488237380981} 11/07/2021 10:21:10 - INFO - __main__ - Step 93198: {'lr': 0.00016075325811299747, 'samples': 17894016, 'steps': 93197, 'loss/train': 1.3108861446380615} 11/07/2021 10:21:11 - INFO - __main__ - Step 93199: {'lr': 0.00016074830105351418, 'samples': 17894208, 'steps': 93198, 'loss/train': 2.2325923442840576} 11/07/2021 10:21:11 - INFO - __main__ - Step 93200: {'lr': 0.00016074334403424635, 'samples': 17894400, 'steps': 93199, 'loss/train': 1.3853503465652466} 11/07/2021 10:21:11 - INFO - __main__ - Step 93201: {'lr': 0.00016073838705519617, 'samples': 17894592, 'steps': 93200, 'loss/train': 1.0292577743530273} 11/07/2021 10:21:12 - INFO - __main__ - Step 93202: {'lr': 0.00016073343011636593, 'samples': 17894784, 'steps': 93201, 'loss/train': 0.16133911907672882} 11/07/2021 10:21:12 - INFO - __main__ - Step 93203: {'lr': 0.00016072847321775785, 'samples': 17894976, 'steps': 93202, 'loss/train': 1.5050150156021118} 11/07/2021 10:21:13 - INFO - __main__ - Step 93204: {'lr': 0.00016072351635937416, 'samples': 17895168, 'steps': 93203, 'loss/train': 1.3007577657699585} 11/07/2021 10:21:14 - INFO - __main__ - Step 93205: {'lr': 0.000160718559541217, 'samples': 17895360, 'steps': 93204, 'loss/train': 1.0197783708572388} 11/07/2021 10:21:14 - INFO - __main__ - Step 93206: {'lr': 0.00016071360276328874, 'samples': 17895552, 'steps': 93205, 'loss/train': 1.7827701568603516} 11/07/2021 10:21:14 - INFO - __main__ - Step 93207: {'lr': 0.0001607086460255915, 'samples': 17895744, 'steps': 93206, 'loss/train': 1.382276177406311} 11/07/2021 10:21:15 - INFO - __main__ - Step 93208: {'lr': 0.00016070368932812756, 'samples': 17895936, 'steps': 93207, 'loss/train': 1.5211153030395508} 11/07/2021 10:21:16 - INFO - __main__ - Step 93209: {'lr': 0.00016069873267089918, 'samples': 17896128, 'steps': 93208, 'loss/train': 1.425524353981018} 11/07/2021 10:21:16 - INFO - __main__ - Step 93210: {'lr': 0.00016069377605390856, 'samples': 17896320, 'steps': 93209, 'loss/train': 1.418277382850647} 11/07/2021 10:21:16 - INFO - __main__ - Step 93211: {'lr': 0.00016068881947715796, 'samples': 17896512, 'steps': 93210, 'loss/train': 1.9079020023345947} 11/07/2021 10:21:17 - INFO - __main__ - Step 93212: {'lr': 0.00016068386294064964, 'samples': 17896704, 'steps': 93211, 'loss/train': 1.3805071115493774} 11/07/2021 10:21:17 - INFO - __main__ - Step 93213: {'lr': 0.0001606789064443857, 'samples': 17896896, 'steps': 93212, 'loss/train': 0.9089117646217346} 11/07/2021 10:21:18 - INFO - __main__ - Step 93214: {'lr': 0.0001606739499883686, 'samples': 17897088, 'steps': 93213, 'loss/train': 1.5924896001815796} 11/07/2021 10:21:19 - INFO - __main__ - Step 93215: {'lr': 0.00016066899357260035, 'samples': 17897280, 'steps': 93214, 'loss/train': 1.9732553958892822} 11/07/2021 10:21:19 - INFO - __main__ - Step 93216: {'lr': 0.00016066403719708328, 'samples': 17897472, 'steps': 93215, 'loss/train': 1.5107054710388184} 11/07/2021 10:21:19 - INFO - __main__ - Step 93217: {'lr': 0.0001606590808618196, 'samples': 17897664, 'steps': 93216, 'loss/train': 2.0388057231903076} 11/07/2021 10:21:20 - INFO - __main__ - Step 93218: {'lr': 0.00016065412456681163, 'samples': 17897856, 'steps': 93217, 'loss/train': 1.262162446975708} 11/07/2021 10:21:21 - INFO - __main__ - Step 93219: {'lr': 0.0001606491683120615, 'samples': 17898048, 'steps': 93218, 'loss/train': 1.4919339418411255} 11/07/2021 10:21:21 - INFO - __main__ - Step 93220: {'lr': 0.00016064421209757143, 'samples': 17898240, 'steps': 93219, 'loss/train': 1.4804614782333374} 11/07/2021 10:21:21 - INFO - __main__ - Step 93221: {'lr': 0.0001606392559233437, 'samples': 17898432, 'steps': 93220, 'loss/train': 1.1014162302017212} 11/07/2021 10:21:22 - INFO - __main__ - Step 93222: {'lr': 0.0001606342997893806, 'samples': 17898624, 'steps': 93221, 'loss/train': 1.460372805595398} 11/07/2021 10:21:22 - INFO - __main__ - Step 93223: {'lr': 0.00016062934369568427, 'samples': 17898816, 'steps': 93222, 'loss/train': 1.4226783514022827} 11/07/2021 10:21:23 - INFO - __main__ - Step 93224: {'lr': 0.00016062438764225694, 'samples': 17899008, 'steps': 93223, 'loss/train': 1.4209887981414795} 11/07/2021 10:21:23 - INFO - __main__ - Step 93225: {'lr': 0.000160619431629101, 'samples': 17899200, 'steps': 93224, 'loss/train': 1.9462932348251343} 11/07/2021 10:21:24 - INFO - __main__ - Step 93226: {'lr': 0.00016061447565621852, 'samples': 17899392, 'steps': 93225, 'loss/train': 1.7088919878005981} 11/07/2021 10:21:24 - INFO - __main__ - Step 93227: {'lr': 0.0001606095197236117, 'samples': 17899584, 'steps': 93226, 'loss/train': 1.4257094860076904} 11/07/2021 10:21:24 - INFO - __main__ - Step 93228: {'lr': 0.00016060456383128291, 'samples': 17899776, 'steps': 93227, 'loss/train': 1.3136826753616333} 11/07/2021 10:21:26 - INFO - __main__ - Step 93229: {'lr': 0.00016059960797923432, 'samples': 17899968, 'steps': 93228, 'loss/train': 1.137951374053955} 11/07/2021 10:21:26 - INFO - __main__ - Step 93230: {'lr': 0.00016059465216746816, 'samples': 17900160, 'steps': 93229, 'loss/train': 0.955281138420105} 11/07/2021 10:21:26 - INFO - __main__ - Step 93231: {'lr': 0.00016058969639598668, 'samples': 17900352, 'steps': 93230, 'loss/train': 1.2279150485992432} 11/07/2021 10:21:27 - INFO - __main__ - Step 93232: {'lr': 0.0001605847406647921, 'samples': 17900544, 'steps': 93231, 'loss/train': 1.4166818857192993} 11/07/2021 10:21:27 - INFO - __main__ - Step 93233: {'lr': 0.00016057978497388664, 'samples': 17900736, 'steps': 93232, 'loss/train': 0.9796985387802124} 11/07/2021 10:21:27 - INFO - __main__ - Step 93234: {'lr': 0.00016057482932327257, 'samples': 17900928, 'steps': 93233, 'loss/train': 1.261142373085022} 11/07/2021 10:21:28 - INFO - __main__ - Step 93235: {'lr': 0.00016056987371295209, 'samples': 17901120, 'steps': 93234, 'loss/train': 1.253390908241272} 11/07/2021 10:21:29 - INFO - __main__ - Step 93236: {'lr': 0.00016056491814292752, 'samples': 17901312, 'steps': 93235, 'loss/train': 1.3485288619995117} 11/07/2021 10:21:29 - INFO - __main__ - Step 93237: {'lr': 0.0001605599626132009, 'samples': 17901504, 'steps': 93236, 'loss/train': 1.145503044128418} 11/07/2021 10:21:30 - INFO - __main__ - Step 93238: {'lr': 0.00016055500712377463, 'samples': 17901696, 'steps': 93237, 'loss/train': 1.448940634727478} 11/07/2021 10:21:30 - INFO - __main__ - Step 93239: {'lr': 0.00016055005167465089, 'samples': 17901888, 'steps': 93238, 'loss/train': 1.5898252725601196} 11/07/2021 10:21:31 - INFO - __main__ - Step 93240: {'lr': 0.00016054509626583192, 'samples': 17902080, 'steps': 93239, 'loss/train': 1.3969298601150513} 11/07/2021 10:21:31 - INFO - __main__ - Step 93241: {'lr': 0.00016054014089731994, 'samples': 17902272, 'steps': 93240, 'loss/train': 1.3858230113983154} 11/07/2021 10:21:32 - INFO - __main__ - Step 93242: {'lr': 0.00016053518556911718, 'samples': 17902464, 'steps': 93241, 'loss/train': 0.8533219695091248} 11/07/2021 10:21:32 - INFO - __main__ - Step 93243: {'lr': 0.00016053023028122587, 'samples': 17902656, 'steps': 93242, 'loss/train': 1.3268638849258423} 11/07/2021 10:21:32 - INFO - __main__ - Step 93244: {'lr': 0.00016052527503364835, 'samples': 17902848, 'steps': 93243, 'loss/train': 1.2588374614715576} 11/07/2021 10:21:33 - INFO - __main__ - Step 93245: {'lr': 0.00016052031982638672, 'samples': 17903040, 'steps': 93244, 'loss/train': 0.9334554672241211} 11/07/2021 10:21:34 - INFO - __main__ - Step 93246: {'lr': 0.00016051536465944323, 'samples': 17903232, 'steps': 93245, 'loss/train': 1.1952261924743652} 11/07/2021 10:21:34 - INFO - __main__ - Step 93247: {'lr': 0.00016051040953282017, 'samples': 17903424, 'steps': 93246, 'loss/train': 1.4770212173461914} 11/07/2021 10:21:34 - INFO - __main__ - Step 93248: {'lr': 0.00016050545444651972, 'samples': 17903616, 'steps': 93247, 'loss/train': 1.5596964359283447} 11/07/2021 10:21:35 - INFO - __main__ - Step 93249: {'lr': 0.00016050049940054408, 'samples': 17903808, 'steps': 93248, 'loss/train': 1.0605716705322266} 11/07/2021 10:21:36 - INFO - __main__ - Step 93250: {'lr': 0.0001604955443948956, 'samples': 17904000, 'steps': 93249, 'loss/train': 1.4874733686447144} 11/07/2021 10:21:36 - INFO - __main__ - Step 93251: {'lr': 0.00016049058942957639, 'samples': 17904192, 'steps': 93250, 'loss/train': 1.7606418132781982} 11/07/2021 10:21:36 - INFO - __main__ - Step 93252: {'lr': 0.00016048563450458874, 'samples': 17904384, 'steps': 93251, 'loss/train': 1.6199963092803955} 11/07/2021 10:21:37 - INFO - __main__ - Step 93253: {'lr': 0.00016048067961993494, 'samples': 17904576, 'steps': 93252, 'loss/train': 1.7020950317382812} 11/07/2021 10:21:37 - INFO - __main__ - Step 93254: {'lr': 0.0001604757247756171, 'samples': 17904768, 'steps': 93253, 'loss/train': 1.4251713752746582} 11/07/2021 10:21:38 - INFO - __main__ - Step 93255: {'lr': 0.00016047076997163757, 'samples': 17904960, 'steps': 93254, 'loss/train': 0.9139475226402283} 11/07/2021 10:21:39 - INFO - __main__ - Step 93256: {'lr': 0.00016046581520799853, 'samples': 17905152, 'steps': 93255, 'loss/train': 1.677262306213379} 11/07/2021 10:21:39 - INFO - __main__ - Step 93257: {'lr': 0.00016046086048470215, 'samples': 17905344, 'steps': 93256, 'loss/train': 1.609190583229065} 11/07/2021 10:21:39 - INFO - __main__ - Step 93258: {'lr': 0.00016045590580175087, 'samples': 17905536, 'steps': 93257, 'loss/train': 1.4492605924606323} 11/07/2021 10:21:40 - INFO - __main__ - Step 93259: {'lr': 0.00016045095115914667, 'samples': 17905728, 'steps': 93258, 'loss/train': 1.3688777685165405} 11/07/2021 10:21:41 - INFO - __main__ - Step 93260: {'lr': 0.0001604459965568919, 'samples': 17905920, 'steps': 93259, 'loss/train': 0.43749040365219116} 11/07/2021 10:21:41 - INFO - __main__ - Step 93261: {'lr': 0.00016044104199498878, 'samples': 17906112, 'steps': 93260, 'loss/train': 1.3460981845855713} 11/07/2021 10:21:41 - INFO - __main__ - Step 93262: {'lr': 0.0001604360874734395, 'samples': 17906304, 'steps': 93261, 'loss/train': 0.9973796606063843} 11/07/2021 10:21:42 - INFO - __main__ - Step 93263: {'lr': 0.0001604311329922464, 'samples': 17906496, 'steps': 93262, 'loss/train': 1.2385369539260864} 11/07/2021 10:21:42 - INFO - __main__ - Step 93264: {'lr': 0.0001604261785514116, 'samples': 17906688, 'steps': 93263, 'loss/train': 1.4180430173873901} 11/07/2021 10:21:42 - INFO - __main__ - Step 93265: {'lr': 0.0001604212241509374, 'samples': 17906880, 'steps': 93264, 'loss/train': 1.5092118978500366} 11/07/2021 10:21:43 - INFO - __main__ - Step 93266: {'lr': 0.00016041626979082602, 'samples': 17907072, 'steps': 93265, 'loss/train': 1.4003543853759766} 11/07/2021 10:21:44 - INFO - __main__ - Step 93267: {'lr': 0.00016041131547107969, 'samples': 17907264, 'steps': 93266, 'loss/train': 0.6721251010894775} 11/07/2021 10:21:44 - INFO - __main__ - Step 93268: {'lr': 0.00016040636119170066, 'samples': 17907456, 'steps': 93267, 'loss/train': 1.3332117795944214} 11/07/2021 10:21:44 - INFO - __main__ - Step 93269: {'lr': 0.0001604014069526911, 'samples': 17907648, 'steps': 93268, 'loss/train': 1.175777554512024} 11/07/2021 10:21:45 - INFO - __main__ - Step 93270: {'lr': 0.00016039645275405328, 'samples': 17907840, 'steps': 93269, 'loss/train': 1.2156169414520264} 11/07/2021 10:21:46 - INFO - __main__ - Step 93271: {'lr': 0.00016039149859578956, 'samples': 17908032, 'steps': 93270, 'loss/train': 1.3725311756134033} 11/07/2021 10:21:46 - INFO - __main__ - Step 93272: {'lr': 0.00016038654447790197, 'samples': 17908224, 'steps': 93271, 'loss/train': 1.183982014656067} 11/07/2021 10:21:46 - INFO - __main__ - Step 93273: {'lr': 0.00016038159040039277, 'samples': 17908416, 'steps': 93272, 'loss/train': 1.5677870512008667} 11/07/2021 10:21:47 - INFO - __main__ - Step 93274: {'lr': 0.00016037663636326427, 'samples': 17908608, 'steps': 93273, 'loss/train': 1.2680351734161377} 11/07/2021 10:21:47 - INFO - __main__ - Step 93275: {'lr': 0.00016037168236651868, 'samples': 17908800, 'steps': 93274, 'loss/train': 1.2831565141677856} 11/07/2021 10:21:48 - INFO - __main__ - Step 93276: {'lr': 0.0001603667284101582, 'samples': 17908992, 'steps': 93275, 'loss/train': 1.558331847190857} 11/07/2021 10:21:48 - INFO - __main__ - Step 93277: {'lr': 0.0001603617744941851, 'samples': 17909184, 'steps': 93276, 'loss/train': 1.36737859249115} 11/07/2021 10:21:49 - INFO - __main__ - Step 93278: {'lr': 0.00016035682061860162, 'samples': 17909376, 'steps': 93277, 'loss/train': 1.4920933246612549} 11/07/2021 10:21:49 - INFO - __main__ - Step 93279: {'lr': 0.00016035186678340995, 'samples': 17909568, 'steps': 93278, 'loss/train': 1.6672039031982422} 11/07/2021 10:21:49 - INFO - __main__ - Step 93280: {'lr': 0.00016034691298861238, 'samples': 17909760, 'steps': 93279, 'loss/train': 0.5763206481933594} 11/07/2021 10:21:51 - INFO - __main__ - Step 93281: {'lr': 0.00016034195923421104, 'samples': 17909952, 'steps': 93280, 'loss/train': 1.463558554649353} 11/07/2021 10:21:51 - INFO - __main__ - Step 93282: {'lr': 0.0001603370055202083, 'samples': 17910144, 'steps': 93281, 'loss/train': 1.746585488319397} 11/07/2021 10:21:51 - INFO - __main__ - Step 93283: {'lr': 0.00016033205184660625, 'samples': 17910336, 'steps': 93282, 'loss/train': 1.3269169330596924} 11/07/2021 10:21:52 - INFO - __main__ - Step 93284: {'lr': 0.00016032709821340728, 'samples': 17910528, 'steps': 93283, 'loss/train': 1.1931495666503906} 11/07/2021 10:21:52 - INFO - __main__ - Step 93285: {'lr': 0.00016032214462061357, 'samples': 17910720, 'steps': 93284, 'loss/train': 1.069608449935913} 11/07/2021 10:21:53 - INFO - __main__ - Step 93286: {'lr': 0.00016031719106822726, 'samples': 17910912, 'steps': 93285, 'loss/train': 1.5624492168426514} 11/07/2021 10:21:53 - INFO - __main__ - Step 93287: {'lr': 0.00016031223755625062, 'samples': 17911104, 'steps': 93286, 'loss/train': 1.6979893445968628} 11/07/2021 10:21:54 - INFO - __main__ - Step 93288: {'lr': 0.0001603072840846859, 'samples': 17911296, 'steps': 93287, 'loss/train': 1.0517433881759644} 11/07/2021 10:21:54 - INFO - __main__ - Step 93289: {'lr': 0.00016030233065353534, 'samples': 17911488, 'steps': 93288, 'loss/train': 1.0932399034500122} 11/07/2021 10:21:54 - INFO - __main__ - Step 93290: {'lr': 0.00016029737726280113, 'samples': 17911680, 'steps': 93289, 'loss/train': 1.5288504362106323} 11/07/2021 10:21:56 - INFO - __main__ - Step 93291: {'lr': 0.0001602924239124856, 'samples': 17911872, 'steps': 93290, 'loss/train': 1.1393589973449707} 11/07/2021 10:21:56 - INFO - __main__ - Step 93292: {'lr': 0.0001602874706025909, 'samples': 17912064, 'steps': 93291, 'loss/train': 0.802944004535675} 11/07/2021 10:21:56 - INFO - __main__ - Step 93293: {'lr': 0.00016028251733311928, 'samples': 17912256, 'steps': 93292, 'loss/train': 1.0615849494934082} 11/07/2021 10:21:57 - INFO - __main__ - Step 93294: {'lr': 0.00016027756410407293, 'samples': 17912448, 'steps': 93293, 'loss/train': 1.011857032775879} 11/07/2021 10:21:57 - INFO - __main__ - Step 93295: {'lr': 0.00016027261091545417, 'samples': 17912640, 'steps': 93294, 'loss/train': 0.6154887676239014} 11/07/2021 10:21:57 - INFO - __main__ - Step 93296: {'lr': 0.00016026765776726515, 'samples': 17912832, 'steps': 93295, 'loss/train': 1.7939962148666382} 11/07/2021 10:21:59 - INFO - __main__ - Step 93297: {'lr': 0.00016026270465950817, 'samples': 17913024, 'steps': 93296, 'loss/train': 1.0213737487792969} 11/07/2021 10:21:59 - INFO - __main__ - Step 93298: {'lr': 0.00016025775159218554, 'samples': 17913216, 'steps': 93297, 'loss/train': 1.4998806715011597} 11/07/2021 10:21:59 - INFO - __main__ - Step 93299: {'lr': 0.00016025279856529928, 'samples': 17913408, 'steps': 93298, 'loss/train': 1.281728982925415} 11/07/2021 10:22:00 - INFO - __main__ - Step 93300: {'lr': 0.0001602478455788517, 'samples': 17913600, 'steps': 93299, 'loss/train': 0.168174147605896} 11/07/2021 10:22:00 - INFO - __main__ - Step 93301: {'lr': 0.00016024289263284508, 'samples': 17913792, 'steps': 93300, 'loss/train': 1.2583531141281128} 11/07/2021 10:22:01 - INFO - __main__ - Step 93302: {'lr': 0.00016023793972728162, 'samples': 17913984, 'steps': 93301, 'loss/train': 1.0839083194732666} 11/07/2021 10:22:01 - INFO - __main__ - Step 93303: {'lr': 0.00016023298686216353, 'samples': 17914176, 'steps': 93302, 'loss/train': 0.6770657300949097} 11/07/2021 10:22:02 - INFO - __main__ - Step 93304: {'lr': 0.0001602280340374931, 'samples': 17914368, 'steps': 93303, 'loss/train': 1.5233436822891235} 11/07/2021 10:22:02 - INFO - __main__ - Step 93305: {'lr': 0.00016022308125327253, 'samples': 17914560, 'steps': 93304, 'loss/train': 1.4632140398025513} 11/07/2021 10:22:03 - INFO - __main__ - Step 93306: {'lr': 0.00016021812850950407, 'samples': 17914752, 'steps': 93305, 'loss/train': 1.5786868333816528} 11/07/2021 10:22:04 - INFO - __main__ - Step 93307: {'lr': 0.00016021317580618987, 'samples': 17914944, 'steps': 93306, 'loss/train': 0.4922959804534912} 11/07/2021 10:22:04 - INFO - __main__ - Step 93308: {'lr': 0.0001602082231433323, 'samples': 17915136, 'steps': 93307, 'loss/train': 1.166126012802124} 11/07/2021 10:22:05 - INFO - __main__ - Step 93309: {'lr': 0.0001602032705209335, 'samples': 17915328, 'steps': 93308, 'loss/train': 0.8597443699836731} 11/07/2021 10:22:05 - INFO - __main__ - Step 93310: {'lr': 0.0001601983179389957, 'samples': 17915520, 'steps': 93309, 'loss/train': 1.7563496828079224} 11/07/2021 10:22:05 - INFO - __main__ - Step 93311: {'lr': 0.00016019336539752118, 'samples': 17915712, 'steps': 93310, 'loss/train': 0.9670037031173706} 11/07/2021 10:22:06 - INFO - __main__ - Step 93312: {'lr': 0.00016018841289651222, 'samples': 17915904, 'steps': 93311, 'loss/train': 1.7641687393188477} 11/07/2021 10:22:07 - INFO - __main__ - Step 93313: {'lr': 0.0001601834604359709, 'samples': 17916096, 'steps': 93312, 'loss/train': 1.22742760181427} 11/07/2021 10:22:07 - INFO - __main__ - Step 93314: {'lr': 0.0001601785080158995, 'samples': 17916288, 'steps': 93313, 'loss/train': 1.1704646348953247} 11/07/2021 10:22:07 - INFO - __main__ - Step 93315: {'lr': 0.00016017355563630032, 'samples': 17916480, 'steps': 93314, 'loss/train': 1.558660626411438} 11/07/2021 10:22:08 - INFO - __main__ - Step 93316: {'lr': 0.0001601686032971755, 'samples': 17916672, 'steps': 93315, 'loss/train': 1.225998878479004} 11/07/2021 10:22:08 - INFO - __main__ - Step 93317: {'lr': 0.00016016365099852736, 'samples': 17916864, 'steps': 93316, 'loss/train': 1.4611767530441284} 11/07/2021 10:22:09 - INFO - __main__ - Step 93318: {'lr': 0.00016015869874035803, 'samples': 17917056, 'steps': 93317, 'loss/train': 1.508829116821289} 11/07/2021 10:22:10 - INFO - __main__ - Step 93319: {'lr': 0.0001601537465226699, 'samples': 17917248, 'steps': 93318, 'loss/train': 0.6550244688987732} 11/07/2021 10:22:10 - INFO - __main__ - Step 93320: {'lr': 0.00016014879434546504, 'samples': 17917440, 'steps': 93319, 'loss/train': 1.0492080450057983} 11/07/2021 10:22:10 - INFO - __main__ - Step 93321: {'lr': 0.00016014384220874577, 'samples': 17917632, 'steps': 93320, 'loss/train': 1.1862413883209229} 11/07/2021 10:22:11 - INFO - __main__ - Step 93322: {'lr': 0.00016013889011251426, 'samples': 17917824, 'steps': 93321, 'loss/train': 1.218714714050293} 11/07/2021 10:22:11 - INFO - __main__ - Step 93323: {'lr': 0.00016013393805677285, 'samples': 17918016, 'steps': 93322, 'loss/train': 1.0305094718933105} 11/07/2021 10:22:12 - INFO - __main__ - Step 93324: {'lr': 0.00016012898604152366, 'samples': 17918208, 'steps': 93323, 'loss/train': 0.9820907711982727} 11/07/2021 10:22:12 - INFO - __main__ - Step 93325: {'lr': 0.00016012403406676903, 'samples': 17918400, 'steps': 93324, 'loss/train': 1.4340835809707642} 11/07/2021 10:22:13 - INFO - __main__ - Step 93326: {'lr': 0.00016011908213251107, 'samples': 17918592, 'steps': 93325, 'loss/train': 1.158226728439331} 11/07/2021 10:22:13 - INFO - __main__ - Step 93327: {'lr': 0.00016011413023875204, 'samples': 17918784, 'steps': 93326, 'loss/train': 1.1671537160873413} 11/07/2021 10:22:13 - INFO - __main__ - Step 93328: {'lr': 0.00016010917838549422, 'samples': 17918976, 'steps': 93327, 'loss/train': 1.5071735382080078} 11/07/2021 10:22:14 - INFO - __main__ - Step 93329: {'lr': 0.0001601042265727398, 'samples': 17919168, 'steps': 93328, 'loss/train': 1.6509463787078857} 11/07/2021 10:22:15 - INFO - __main__ - Step 93330: {'lr': 0.000160099274800491, 'samples': 17919360, 'steps': 93329, 'loss/train': 1.3389744758605957} 11/07/2021 10:22:15 - INFO - __main__ - Step 93331: {'lr': 0.00016009432306875014, 'samples': 17919552, 'steps': 93330, 'loss/train': 1.3539999723434448} 11/07/2021 10:22:15 - INFO - __main__ - Step 93332: {'lr': 0.00016008937137751935, 'samples': 17919744, 'steps': 93331, 'loss/train': 1.888564109802246} 11/07/2021 10:22:16 - INFO - __main__ - Step 93333: {'lr': 0.00016008441972680093, 'samples': 17919936, 'steps': 93332, 'loss/train': 1.439955711364746} 11/07/2021 10:22:17 - INFO - __main__ - Step 93334: {'lr': 0.00016007946811659704, 'samples': 17920128, 'steps': 93333, 'loss/train': 1.3240231275558472} 11/07/2021 10:22:18 - INFO - __main__ - Step 93335: {'lr': 0.00016007451654691, 'samples': 17920320, 'steps': 93334, 'loss/train': 1.586215615272522} 11/07/2021 10:22:18 - INFO - __main__ - Step 93336: {'lr': 0.00016006956501774195, 'samples': 17920512, 'steps': 93335, 'loss/train': 1.8493716716766357} 11/07/2021 10:22:18 - INFO - __main__ - Step 93337: {'lr': 0.00016006461352909522, 'samples': 17920704, 'steps': 93336, 'loss/train': 1.277133822441101} 11/07/2021 10:22:19 - INFO - __main__ - Step 93338: {'lr': 0.000160059662080972, 'samples': 17920896, 'steps': 93337, 'loss/train': 1.8988744020462036} 11/07/2021 10:22:19 - INFO - __main__ - Step 93339: {'lr': 0.00016005471067337453, 'samples': 17921088, 'steps': 93338, 'loss/train': 1.6285816431045532} 11/07/2021 10:22:20 - INFO - __main__ - Step 93340: {'lr': 0.00016004975930630495, 'samples': 17921280, 'steps': 93339, 'loss/train': 1.1589951515197754} 11/07/2021 10:22:20 - INFO - __main__ - Step 93341: {'lr': 0.00016004480797976556, 'samples': 17921472, 'steps': 93340, 'loss/train': 1.6609057188034058} 11/07/2021 10:22:21 - INFO - __main__ - Step 93342: {'lr': 0.00016003985669375858, 'samples': 17921664, 'steps': 93341, 'loss/train': 1.3204911947250366} 11/07/2021 10:22:21 - INFO - __main__ - Step 93343: {'lr': 0.00016003490544828631, 'samples': 17921856, 'steps': 93342, 'loss/train': 1.5297120809555054} 11/07/2021 10:22:21 - INFO - __main__ - Step 93344: {'lr': 0.00016002995424335088, 'samples': 17922048, 'steps': 93343, 'loss/train': 1.5883945226669312} 11/07/2021 10:22:22 - INFO - __main__ - Step 93345: {'lr': 0.00016002500307895457, 'samples': 17922240, 'steps': 93344, 'loss/train': 1.0378925800323486} 11/07/2021 10:22:23 - INFO - __main__ - Step 93346: {'lr': 0.0001600200519550996, 'samples': 17922432, 'steps': 93345, 'loss/train': 1.2606616020202637} 11/07/2021 10:22:23 - INFO - __main__ - Step 93347: {'lr': 0.0001600151008717882, 'samples': 17922624, 'steps': 93346, 'loss/train': 1.5336722135543823} 11/07/2021 10:22:23 - INFO - __main__ - Step 93348: {'lr': 0.00016001014982902268, 'samples': 17922816, 'steps': 93347, 'loss/train': 0.8376100659370422} 11/07/2021 10:22:24 - INFO - __main__ - Step 93349: {'lr': 0.00016000519882680513, 'samples': 17923008, 'steps': 93348, 'loss/train': 0.9255864024162292} 11/07/2021 10:22:25 - INFO - __main__ - Step 93350: {'lr': 0.00016000024786513782, 'samples': 17923200, 'steps': 93349, 'loss/train': 1.3925141096115112} 11/07/2021 10:22:25 - INFO - __main__ - Step 93351: {'lr': 0.00015999529694402307, 'samples': 17923392, 'steps': 93350, 'loss/train': 1.2785764932632446} 11/07/2021 10:22:26 - INFO - __main__ - Step 93352: {'lr': 0.0001599903460634631, 'samples': 17923584, 'steps': 93351, 'loss/train': 1.3689568042755127} 11/07/2021 10:22:26 - INFO - __main__ - Step 93353: {'lr': 0.00015998539522346, 'samples': 17923776, 'steps': 93352, 'loss/train': 1.537091612815857} 11/07/2021 10:22:26 - INFO - __main__ - Step 93354: {'lr': 0.0001599804444240161, 'samples': 17923968, 'steps': 93353, 'loss/train': 1.583892583847046} 11/07/2021 10:22:27 - INFO - __main__ - Step 93355: {'lr': 0.00015997549366513362, 'samples': 17924160, 'steps': 93354, 'loss/train': 1.5352180004119873} 11/07/2021 10:22:28 - INFO - __main__ - Step 93356: {'lr': 0.0001599705429468148, 'samples': 17924352, 'steps': 93355, 'loss/train': 1.2390722036361694} 11/07/2021 10:22:28 - INFO - __main__ - Step 93357: {'lr': 0.00015996559226906187, 'samples': 17924544, 'steps': 93356, 'loss/train': 1.8095214366912842} 11/07/2021 10:22:28 - INFO - __main__ - Step 93358: {'lr': 0.00015996064163187706, 'samples': 17924736, 'steps': 93357, 'loss/train': 1.2125914096832275} 11/07/2021 10:22:29 - INFO - __main__ - Step 93359: {'lr': 0.00015995569103526263, 'samples': 17924928, 'steps': 93358, 'loss/train': 1.748632788658142} 11/07/2021 10:22:29 - INFO - __main__ - Step 93360: {'lr': 0.00015995074047922073, 'samples': 17925120, 'steps': 93359, 'loss/train': 1.412149429321289} 11/07/2021 10:22:30 - INFO - __main__ - Step 93361: {'lr': 0.00015994578996375363, 'samples': 17925312, 'steps': 93360, 'loss/train': 1.5442421436309814} 11/07/2021 10:22:30 - INFO - __main__ - Step 93362: {'lr': 0.00015994083948886356, 'samples': 17925504, 'steps': 93361, 'loss/train': 0.8046853542327881} 11/07/2021 10:22:31 - INFO - __main__ - Step 93363: {'lr': 0.00015993588905455282, 'samples': 17925696, 'steps': 93362, 'loss/train': 1.5072505474090576} 11/07/2021 10:22:31 - INFO - __main__ - Step 93364: {'lr': 0.00015993093866082354, 'samples': 17925888, 'steps': 93363, 'loss/train': 1.5491082668304443} 11/07/2021 10:22:32 - INFO - __main__ - Step 93365: {'lr': 0.00015992598830767802, 'samples': 17926080, 'steps': 93364, 'loss/train': 1.285848617553711} 11/07/2021 10:22:32 - INFO - __main__ - Step 93366: {'lr': 0.00015992103799511843, 'samples': 17926272, 'steps': 93365, 'loss/train': 1.0556086301803589} 11/07/2021 10:22:33 - INFO - __main__ - Step 93367: {'lr': 0.000159916087723147, 'samples': 17926464, 'steps': 93366, 'loss/train': 0.9493730664253235} 11/07/2021 10:22:33 - INFO - __main__ - Step 93368: {'lr': 0.000159911137491766, 'samples': 17926656, 'steps': 93367, 'loss/train': 1.5883514881134033} 11/07/2021 10:22:33 - INFO - __main__ - Step 93369: {'lr': 0.00015990618730097768, 'samples': 17926848, 'steps': 93368, 'loss/train': 1.477808952331543} 11/07/2021 10:22:34 - INFO - __main__ - Step 93370: {'lr': 0.00015990123715078428, 'samples': 17927040, 'steps': 93369, 'loss/train': 1.3018022775650024} 11/07/2021 10:22:35 - INFO - __main__ - Step 93371: {'lr': 0.00015989628704118794, 'samples': 17927232, 'steps': 93370, 'loss/train': 1.5959234237670898} 11/07/2021 10:22:35 - INFO - __main__ - Step 93372: {'lr': 0.00015989133697219093, 'samples': 17927424, 'steps': 93371, 'loss/train': 1.6739630699157715} 11/07/2021 10:22:36 - INFO - __main__ - Step 93373: {'lr': 0.00015988638694379553, 'samples': 17927616, 'steps': 93372, 'loss/train': 1.6718153953552246} 11/07/2021 10:22:36 - INFO - __main__ - Step 93374: {'lr': 0.0001598814369560039, 'samples': 17927808, 'steps': 93373, 'loss/train': 1.201287865638733} 11/07/2021 10:22:36 - INFO - __main__ - Step 93375: {'lr': 0.0001598764870088183, 'samples': 17928000, 'steps': 93374, 'loss/train': 0.5434019565582275} 11/07/2021 10:22:37 - INFO - __main__ - Step 93376: {'lr': 0.000159871537102241, 'samples': 17928192, 'steps': 93375, 'loss/train': 1.3386708498001099} 11/07/2021 10:22:38 - INFO - __main__ - Step 93377: {'lr': 0.00015986658723627417, 'samples': 17928384, 'steps': 93376, 'loss/train': 1.5463814735412598} 11/07/2021 10:22:38 - INFO - __main__ - Step 93378: {'lr': 0.00015986163741092005, 'samples': 17928576, 'steps': 93377, 'loss/train': 1.1636624336242676} 11/07/2021 10:22:38 - INFO - __main__ - Step 93379: {'lr': 0.00015985668762618095, 'samples': 17928768, 'steps': 93378, 'loss/train': 1.4412678480148315} 11/07/2021 10:22:39 - INFO - __main__ - Step 93380: {'lr': 0.00015985173788205897, 'samples': 17928960, 'steps': 93379, 'loss/train': 1.082363486289978} 11/07/2021 10:22:40 - INFO - __main__ - Step 93381: {'lr': 0.0001598467881785565, 'samples': 17929152, 'steps': 93380, 'loss/train': 1.3985660076141357} 11/07/2021 10:22:40 - INFO - __main__ - Step 93382: {'lr': 0.00015984183851567557, 'samples': 17929344, 'steps': 93381, 'loss/train': 1.4428898096084595} 11/07/2021 10:22:40 - INFO - __main__ - Step 93383: {'lr': 0.00015983688889341857, 'samples': 17929536, 'steps': 93382, 'loss/train': 1.2187068462371826} 11/07/2021 10:22:41 - INFO - __main__ - Step 93384: {'lr': 0.00015983193931178762, 'samples': 17929728, 'steps': 93383, 'loss/train': 1.1490390300750732} 11/07/2021 10:22:41 - INFO - __main__ - Step 93385: {'lr': 0.0001598269897707851, 'samples': 17929920, 'steps': 93384, 'loss/train': 1.269249439239502} 11/07/2021 10:22:41 - INFO - __main__ - Step 93386: {'lr': 0.00015982204027041306, 'samples': 17930112, 'steps': 93385, 'loss/train': 1.0579614639282227} 11/07/2021 10:22:42 - INFO - __main__ - Step 93387: {'lr': 0.00015981709081067382, 'samples': 17930304, 'steps': 93386, 'loss/train': 1.0951753854751587} 11/07/2021 10:22:43 - INFO - __main__ - Step 93388: {'lr': 0.00015981214139156963, 'samples': 17930496, 'steps': 93387, 'loss/train': 0.9540257453918457} 11/07/2021 10:22:43 - INFO - __main__ - Step 93389: {'lr': 0.00015980719201310272, 'samples': 17930688, 'steps': 93388, 'loss/train': 1.613396167755127} 11/07/2021 10:22:43 - INFO - __main__ - Step 93390: {'lr': 0.00015980224267527526, 'samples': 17930880, 'steps': 93389, 'loss/train': 1.3528724908828735} 11/07/2021 10:22:44 - INFO - __main__ - Step 93391: {'lr': 0.00015979729337808955, 'samples': 17931072, 'steps': 93390, 'loss/train': 0.7232038974761963} 11/07/2021 10:22:45 - INFO - __main__ - Step 93392: {'lr': 0.00015979234412154787, 'samples': 17931264, 'steps': 93391, 'loss/train': 1.468213677406311} 11/07/2021 10:22:45 - INFO - __main__ - Step 93393: {'lr': 0.00015978739490565225, 'samples': 17931456, 'steps': 93392, 'loss/train': 1.2982057332992554} 11/07/2021 10:22:46 - INFO - __main__ - Step 93394: {'lr': 0.00015978244573040506, 'samples': 17931648, 'steps': 93393, 'loss/train': 1.5062334537506104} 11/07/2021 10:22:46 - INFO - __main__ - Step 93395: {'lr': 0.0001597774965958085, 'samples': 17931840, 'steps': 93394, 'loss/train': 1.3580207824707031} 11/07/2021 10:22:46 - INFO - __main__ - Step 93396: {'lr': 0.0001597725475018648, 'samples': 17932032, 'steps': 93395, 'loss/train': 1.356416940689087} 11/07/2021 10:22:47 - INFO - __main__ - Step 93397: {'lr': 0.00015976759844857623, 'samples': 17932224, 'steps': 93396, 'loss/train': 0.3429526388645172} 11/07/2021 10:22:48 - INFO - __main__ - Step 93398: {'lr': 0.000159762649435945, 'samples': 17932416, 'steps': 93397, 'loss/train': 1.471126914024353} 11/07/2021 10:22:48 - INFO - __main__ - Step 93399: {'lr': 0.00015975770046397326, 'samples': 17932608, 'steps': 93398, 'loss/train': 1.436238169670105} 11/07/2021 10:22:48 - INFO - __main__ - Step 93400: {'lr': 0.00015975275153266334, 'samples': 17932800, 'steps': 93399, 'loss/train': 1.1517456769943237} 11/07/2021 10:22:49 - INFO - __main__ - Step 93401: {'lr': 0.00015974780264201743, 'samples': 17932992, 'steps': 93400, 'loss/train': 1.236457109451294} 11/07/2021 10:22:50 - INFO - __main__ - Step 93402: {'lr': 0.0001597428537920378, 'samples': 17933184, 'steps': 93401, 'loss/train': 1.346336007118225} 11/07/2021 10:22:50 - INFO - __main__ - Step 93403: {'lr': 0.0001597379049827266, 'samples': 17933376, 'steps': 93402, 'loss/train': 0.8091041445732117} 11/07/2021 10:22:50 - INFO - __main__ - Step 93404: {'lr': 0.00015973295621408615, 'samples': 17933568, 'steps': 93403, 'loss/train': 1.5344388484954834} 11/07/2021 10:22:51 - INFO - __main__ - Step 93405: {'lr': 0.0001597280074861186, 'samples': 17933760, 'steps': 93404, 'loss/train': 1.2135932445526123} 11/07/2021 10:22:51 - INFO - __main__ - Step 93406: {'lr': 0.00015972305879882636, 'samples': 17933952, 'steps': 93405, 'loss/train': 1.181816577911377} 11/07/2021 10:22:52 - INFO - __main__ - Step 93407: {'lr': 0.00015971811015221137, 'samples': 17934144, 'steps': 93406, 'loss/train': 1.2104270458221436} 11/07/2021 10:22:52 - INFO - __main__ - Step 93408: {'lr': 0.00015971316154627605, 'samples': 17934336, 'steps': 93407, 'loss/train': 1.3037474155426025} 11/07/2021 10:22:53 - INFO - __main__ - Step 93409: {'lr': 0.00015970821298102257, 'samples': 17934528, 'steps': 93408, 'loss/train': 1.374690055847168} 11/07/2021 10:22:53 - INFO - __main__ - Step 93410: {'lr': 0.00015970326445645315, 'samples': 17934720, 'steps': 93409, 'loss/train': 1.5245120525360107} 11/07/2021 10:22:54 - INFO - __main__ - Step 93411: {'lr': 0.00015969831597257005, 'samples': 17934912, 'steps': 93410, 'loss/train': 1.1899354457855225} 11/07/2021 10:22:54 - INFO - __main__ - Step 93412: {'lr': 0.0001596933675293755, 'samples': 17935104, 'steps': 93411, 'loss/train': 1.3616931438446045} 11/07/2021 10:22:55 - INFO - __main__ - Step 93413: {'lr': 0.00015968841912687175, 'samples': 17935296, 'steps': 93412, 'loss/train': 1.0466395616531372} 11/07/2021 10:22:55 - INFO - __main__ - Step 93414: {'lr': 0.000159683470765061, 'samples': 17935488, 'steps': 93413, 'loss/train': 1.5804595947265625} 11/07/2021 10:22:56 - INFO - __main__ - Step 93415: {'lr': 0.00015967852244394548, 'samples': 17935680, 'steps': 93414, 'loss/train': 1.2909077405929565} 11/07/2021 10:22:56 - INFO - __main__ - Step 93416: {'lr': 0.00015967357416352742, 'samples': 17935872, 'steps': 93415, 'loss/train': 1.1303985118865967} 11/07/2021 10:22:57 - INFO - __main__ - Step 93417: {'lr': 0.00015966862592380906, 'samples': 17936064, 'steps': 93416, 'loss/train': 1.16240394115448} 11/07/2021 10:22:57 - INFO - __main__ - Step 93418: {'lr': 0.00015966367772479262, 'samples': 17936256, 'steps': 93417, 'loss/train': 1.4809303283691406} 11/07/2021 10:22:58 - INFO - __main__ - Step 93419: {'lr': 0.0001596587295664804, 'samples': 17936448, 'steps': 93418, 'loss/train': 1.0148704051971436} 11/07/2021 10:22:58 - INFO - __main__ - Step 93420: {'lr': 0.00015965378144887455, 'samples': 17936640, 'steps': 93419, 'loss/train': 1.5792804956436157} 11/07/2021 10:22:58 - INFO - __main__ - Step 93421: {'lr': 0.00015964883337197722, 'samples': 17936832, 'steps': 93420, 'loss/train': 1.3006699085235596} 11/07/2021 10:22:59 - INFO - __main__ - Step 93422: {'lr': 0.00015964388533579077, 'samples': 17937024, 'steps': 93421, 'loss/train': 0.1496804803609848} 11/07/2021 10:23:00 - INFO - __main__ - Step 93423: {'lr': 0.0001596389373403174, 'samples': 17937216, 'steps': 93422, 'loss/train': 1.304398775100708} 11/07/2021 10:23:00 - INFO - __main__ - Step 93424: {'lr': 0.0001596339893855593, 'samples': 17937408, 'steps': 93423, 'loss/train': 1.7103400230407715} 11/07/2021 10:23:00 - INFO - __main__ - Step 93425: {'lr': 0.00015962904147151874, 'samples': 17937600, 'steps': 93424, 'loss/train': 1.2906298637390137} 11/07/2021 10:23:01 - INFO - __main__ - Step 93426: {'lr': 0.00015962409359819796, 'samples': 17937792, 'steps': 93425, 'loss/train': 1.4245713949203491} 11/07/2021 10:23:01 - INFO - __main__ - Step 93427: {'lr': 0.00015961914576559917, 'samples': 17937984, 'steps': 93426, 'loss/train': 1.4095485210418701} 11/07/2021 10:23:02 - INFO - __main__ - Step 93428: {'lr': 0.00015961419797372455, 'samples': 17938176, 'steps': 93427, 'loss/train': 1.3050135374069214} 11/07/2021 10:23:03 - INFO - __main__ - Step 93429: {'lr': 0.00015960925022257645, 'samples': 17938368, 'steps': 93428, 'loss/train': 1.4525346755981445} 11/07/2021 10:23:03 - INFO - __main__ - Step 93430: {'lr': 0.00015960430251215697, 'samples': 17938560, 'steps': 93429, 'loss/train': 1.9940526485443115} 11/07/2021 10:23:04 - INFO - __main__ - Step 93431: {'lr': 0.0001595993548424684, 'samples': 17938752, 'steps': 93430, 'loss/train': 1.5119342803955078} 11/07/2021 10:23:04 - INFO - __main__ - Step 93432: {'lr': 0.000159594407213513, 'samples': 17938944, 'steps': 93431, 'loss/train': 0.9699779152870178} 11/07/2021 10:23:05 - INFO - __main__ - Step 93433: {'lr': 0.00015958945962529303, 'samples': 17939136, 'steps': 93432, 'loss/train': 1.0496759414672852} 11/07/2021 10:23:05 - INFO - __main__ - Step 93434: {'lr': 0.0001595845120778106, 'samples': 17939328, 'steps': 93433, 'loss/train': 1.5232505798339844} 11/07/2021 10:23:06 - INFO - __main__ - Step 93435: {'lr': 0.00015957956457106795, 'samples': 17939520, 'steps': 93434, 'loss/train': 1.4756759405136108} 11/07/2021 10:23:06 - INFO - __main__ - Step 93436: {'lr': 0.00015957461710506738, 'samples': 17939712, 'steps': 93435, 'loss/train': 1.3042292594909668} 11/07/2021 10:23:06 - INFO - __main__ - Step 93437: {'lr': 0.00015956966967981107, 'samples': 17939904, 'steps': 93436, 'loss/train': 1.8437833786010742} 11/07/2021 10:23:07 - INFO - __main__ - Step 93438: {'lr': 0.00015956472229530127, 'samples': 17940096, 'steps': 93437, 'loss/train': 1.745011568069458} 11/07/2021 10:23:08 - INFO - __main__ - Step 93439: {'lr': 0.00015955977495154023, 'samples': 17940288, 'steps': 93438, 'loss/train': 0.9665083289146423} 11/07/2021 10:23:08 - INFO - __main__ - Step 93440: {'lr': 0.00015955482764853013, 'samples': 17940480, 'steps': 93439, 'loss/train': 1.3155426979064941} 11/07/2021 10:23:08 - INFO - __main__ - Step 93441: {'lr': 0.00015954988038627328, 'samples': 17940672, 'steps': 93440, 'loss/train': 1.4641633033752441} 11/07/2021 10:23:09 - INFO - __main__ - Step 93442: {'lr': 0.00015954493316477182, 'samples': 17940864, 'steps': 93441, 'loss/train': 1.3709615468978882} 11/07/2021 10:23:09 - INFO - __main__ - Step 93443: {'lr': 0.00015953998598402803, 'samples': 17941056, 'steps': 93442, 'loss/train': 1.519850254058838} 11/07/2021 10:23:10 - INFO - __main__ - Step 93444: {'lr': 0.00015953503884404412, 'samples': 17941248, 'steps': 93443, 'loss/train': 1.31830894947052} 11/07/2021 10:23:10 - INFO - __main__ - Step 93445: {'lr': 0.0001595300917448223, 'samples': 17941440, 'steps': 93444, 'loss/train': 1.3507784605026245} 11/07/2021 10:23:11 - INFO - __main__ - Step 93446: {'lr': 0.00015952514468636498, 'samples': 17941632, 'steps': 93445, 'loss/train': 1.362713098526001} 11/07/2021 10:23:11 - INFO - __main__ - Step 93447: {'lr': 0.0001595201976686741, 'samples': 17941824, 'steps': 93446, 'loss/train': 2.6289491653442383} 11/07/2021 10:23:11 - INFO - __main__ - Step 93448: {'lr': 0.00015951525069175205, 'samples': 17942016, 'steps': 93447, 'loss/train': 1.24222731590271} 11/07/2021 10:23:13 - INFO - __main__ - Step 93449: {'lr': 0.000159510303755601, 'samples': 17942208, 'steps': 93448, 'loss/train': 1.3348506689071655} 11/07/2021 10:23:13 - INFO - __main__ - Step 93450: {'lr': 0.00015950535686022323, 'samples': 17942400, 'steps': 93449, 'loss/train': 1.4460581541061401} 11/07/2021 10:23:13 - INFO - __main__ - Step 93451: {'lr': 0.00015950041000562093, 'samples': 17942592, 'steps': 93450, 'loss/train': 0.9734323620796204} 11/07/2021 10:23:14 - INFO - __main__ - Step 93452: {'lr': 0.00015949546319179636, 'samples': 17942784, 'steps': 93451, 'loss/train': 1.3824489116668701} 11/07/2021 10:23:14 - INFO - __main__ - Step 93453: {'lr': 0.00015949051641875173, 'samples': 17942976, 'steps': 93452, 'loss/train': 0.4425801634788513} 11/07/2021 10:23:15 - INFO - __main__ - Step 93454: {'lr': 0.0001594855696864893, 'samples': 17943168, 'steps': 93453, 'loss/train': 1.3420991897583008} 11/07/2021 10:23:15 - INFO - __main__ - Step 93455: {'lr': 0.00015948062299501125, 'samples': 17943360, 'steps': 93454, 'loss/train': 1.2977830171585083} 11/07/2021 10:23:16 - INFO - __main__ - Step 93456: {'lr': 0.00015947567634431984, 'samples': 17943552, 'steps': 93455, 'loss/train': 0.8303061723709106} 11/07/2021 10:23:16 - INFO - __main__ - Step 93457: {'lr': 0.00015947072973441728, 'samples': 17943744, 'steps': 93456, 'loss/train': 1.4281741380691528} 11/07/2021 10:23:16 - INFO - __main__ - Step 93458: {'lr': 0.00015946578316530585, 'samples': 17943936, 'steps': 93457, 'loss/train': 1.2539581060409546} 11/07/2021 10:23:18 - INFO - __main__ - Step 93459: {'lr': 0.0001594608366369877, 'samples': 17944128, 'steps': 93458, 'loss/train': 1.5316550731658936} 11/07/2021 10:23:18 - INFO - __main__ - Step 93460: {'lr': 0.00015945589014946523, 'samples': 17944320, 'steps': 93459, 'loss/train': 1.7754524946212769} 11/07/2021 10:23:18 - INFO - __main__ - Step 93461: {'lr': 0.00015945094370274044, 'samples': 17944512, 'steps': 93460, 'loss/train': 1.743786334991455} 11/07/2021 10:23:19 - INFO - __main__ - Step 93462: {'lr': 0.00015944599729681563, 'samples': 17944704, 'steps': 93461, 'loss/train': 1.866256594657898} 11/07/2021 10:23:19 - INFO - __main__ - Step 93463: {'lr': 0.0001594410509316931, 'samples': 17944896, 'steps': 93462, 'loss/train': 0.764835000038147} 11/07/2021 10:23:19 - INFO - __main__ - Step 93464: {'lr': 0.000159436104607375, 'samples': 17945088, 'steps': 93463, 'loss/train': 1.469212293624878} 11/07/2021 10:23:20 - INFO - __main__ - Step 93465: {'lr': 0.0001594311583238636, 'samples': 17945280, 'steps': 93464, 'loss/train': 1.6116161346435547} 11/07/2021 10:23:21 - INFO - __main__ - Step 93466: {'lr': 0.00015942621208116112, 'samples': 17945472, 'steps': 93465, 'loss/train': 1.4897562265396118} 11/07/2021 10:23:21 - INFO - __main__ - Step 93467: {'lr': 0.0001594212658792698, 'samples': 17945664, 'steps': 93466, 'loss/train': 1.3315950632095337} 11/07/2021 10:23:21 - INFO - __main__ - Step 93468: {'lr': 0.00015941631971819184, 'samples': 17945856, 'steps': 93467, 'loss/train': 1.586348295211792} 11/07/2021 10:23:22 - INFO - __main__ - Step 93469: {'lr': 0.0001594113735979295, 'samples': 17946048, 'steps': 93468, 'loss/train': 1.297717809677124} 11/07/2021 10:23:23 - INFO - __main__ - Step 93470: {'lr': 0.000159406427518485, 'samples': 17946240, 'steps': 93469, 'loss/train': 1.342858910560608} 11/07/2021 10:23:23 - INFO - __main__ - Step 93471: {'lr': 0.00015940148147986058, 'samples': 17946432, 'steps': 93470, 'loss/train': 1.655980110168457} 11/07/2021 10:23:24 - INFO - __main__ - Step 93472: {'lr': 0.00015939653548205848, 'samples': 17946624, 'steps': 93471, 'loss/train': 0.9325956106185913} 11/07/2021 10:23:24 - INFO - __main__ - Step 93473: {'lr': 0.00015939158952508093, 'samples': 17946816, 'steps': 93472, 'loss/train': 2.023674488067627} 11/07/2021 10:23:24 - INFO - __main__ - Step 93474: {'lr': 0.00015938664360893006, 'samples': 17947008, 'steps': 93473, 'loss/train': 1.2830758094787598} 11/07/2021 10:23:25 - INFO - __main__ - Step 93475: {'lr': 0.00015938169773360817, 'samples': 17947200, 'steps': 93474, 'loss/train': 1.130565881729126} 11/07/2021 10:23:26 - INFO - __main__ - Step 93476: {'lr': 0.00015937675189911749, 'samples': 17947392, 'steps': 93475, 'loss/train': 1.7613223791122437} 11/07/2021 10:23:26 - INFO - __main__ - Step 93477: {'lr': 0.00015937180610546027, 'samples': 17947584, 'steps': 93476, 'loss/train': 1.381459355354309} 11/07/2021 10:23:26 - INFO - __main__ - Step 93478: {'lr': 0.0001593668603526387, 'samples': 17947776, 'steps': 93477, 'loss/train': 1.4063270092010498} 11/07/2021 10:23:27 - INFO - __main__ - Step 93479: {'lr': 0.00015936191464065502, 'samples': 17947968, 'steps': 93478, 'loss/train': 1.4178364276885986} 11/07/2021 10:23:28 - INFO - __main__ - Step 93480: {'lr': 0.00015935696896951146, 'samples': 17948160, 'steps': 93479, 'loss/train': 1.0764294862747192} 11/07/2021 10:23:28 - INFO - __main__ - Step 93481: {'lr': 0.00015935202333921026, 'samples': 17948352, 'steps': 93480, 'loss/train': 1.3927592039108276} 11/07/2021 10:23:28 - INFO - __main__ - Step 93482: {'lr': 0.00015934707774975363, 'samples': 17948544, 'steps': 93481, 'loss/train': 2.1509125232696533} 11/07/2021 10:23:29 - INFO - __main__ - Step 93483: {'lr': 0.00015934213220114386, 'samples': 17948736, 'steps': 93482, 'loss/train': 1.2513784170150757} 11/07/2021 10:23:29 - INFO - __main__ - Step 93484: {'lr': 0.0001593371866933831, 'samples': 17948928, 'steps': 93483, 'loss/train': 1.2201814651489258} 11/07/2021 10:23:30 - INFO - __main__ - Step 93485: {'lr': 0.00015933224122647354, 'samples': 17949120, 'steps': 93484, 'loss/train': 1.594870686531067} 11/07/2021 10:23:31 - INFO - __main__ - Step 93486: {'lr': 0.0001593272958004176, 'samples': 17949312, 'steps': 93485, 'loss/train': 0.9774329662322998} 11/07/2021 10:23:31 - INFO - __main__ - Step 93487: {'lr': 0.0001593223504152173, 'samples': 17949504, 'steps': 93486, 'loss/train': 1.7271301746368408} 11/07/2021 10:23:31 - INFO - __main__ - Step 93488: {'lr': 0.00015931740507087495, 'samples': 17949696, 'steps': 93487, 'loss/train': 1.272484540939331} 11/07/2021 10:23:32 - INFO - __main__ - Step 93489: {'lr': 0.00015931245976739277, 'samples': 17949888, 'steps': 93488, 'loss/train': 1.5391277074813843} 11/07/2021 10:23:32 - INFO - __main__ - Step 93490: {'lr': 0.00015930751450477299, 'samples': 17950080, 'steps': 93489, 'loss/train': 1.4680818319320679} 11/07/2021 10:23:33 - INFO - __main__ - Step 93491: {'lr': 0.00015930256928301783, 'samples': 17950272, 'steps': 93490, 'loss/train': 0.8702667355537415} 11/07/2021 10:23:33 - INFO - __main__ - Step 93492: {'lr': 0.00015929762410212957, 'samples': 17950464, 'steps': 93491, 'loss/train': 1.2433359622955322} 11/07/2021 10:23:34 - INFO - __main__ - Step 93493: {'lr': 0.00015929267896211042, 'samples': 17950656, 'steps': 93492, 'loss/train': 1.5028421878814697} 11/07/2021 10:23:34 - INFO - __main__ - Step 93494: {'lr': 0.00015928773386296257, 'samples': 17950848, 'steps': 93493, 'loss/train': 1.4550092220306396} 11/07/2021 10:23:34 - INFO - __main__ - Step 93495: {'lr': 0.00015928278880468827, 'samples': 17951040, 'steps': 93494, 'loss/train': 1.3855310678482056} 11/07/2021 10:23:36 - INFO - __main__ - Step 93496: {'lr': 0.00015927784378728975, 'samples': 17951232, 'steps': 93495, 'loss/train': 1.392161250114441} 11/07/2021 10:23:36 - INFO - __main__ - Step 93497: {'lr': 0.00015927289881076924, 'samples': 17951424, 'steps': 93496, 'loss/train': 1.3462992906570435} 11/07/2021 10:23:36 - INFO - __main__ - Step 93498: {'lr': 0.0001592679538751289, 'samples': 17951616, 'steps': 93497, 'loss/train': 1.5030436515808105} 11/07/2021 10:23:37 - INFO - __main__ - Step 93499: {'lr': 0.00015926300898037104, 'samples': 17951808, 'steps': 93498, 'loss/train': 1.5950417518615723} 11/07/2021 10:23:37 - INFO - __main__ - Step 93500: {'lr': 0.00015925806412649796, 'samples': 17952000, 'steps': 93499, 'loss/train': 1.5834851264953613} 11/07/2021 10:23:38 - INFO - __main__ - Step 93501: {'lr': 0.00015925311931351172, 'samples': 17952192, 'steps': 93500, 'loss/train': 1.5127534866333008} 11/07/2021 10:23:38 - INFO - __main__ - Step 93502: {'lr': 0.00015924817454141462, 'samples': 17952384, 'steps': 93501, 'loss/train': 1.4863380193710327} 11/07/2021 10:23:39 - INFO - __main__ - Step 93503: {'lr': 0.0001592432298102089, 'samples': 17952576, 'steps': 93502, 'loss/train': 0.9682807326316833} 11/07/2021 10:23:39 - INFO - __main__ - Step 93504: {'lr': 0.0001592382851198968, 'samples': 17952768, 'steps': 93503, 'loss/train': 1.528629183769226} 11/07/2021 10:23:39 - INFO - __main__ - Step 93505: {'lr': 0.00015923334047048056, 'samples': 17952960, 'steps': 93504, 'loss/train': 1.4581791162490845} 11/07/2021 10:23:40 - INFO - __main__ - Step 93506: {'lr': 0.0001592283958619623, 'samples': 17953152, 'steps': 93505, 'loss/train': 0.9015491008758545} 11/07/2021 10:23:41 - INFO - __main__ - Step 93507: {'lr': 0.00015922345129434435, 'samples': 17953344, 'steps': 93506, 'loss/train': 1.3016589879989624} 11/07/2021 10:23:41 - INFO - __main__ - Step 93508: {'lr': 0.00015921850676762892, 'samples': 17953536, 'steps': 93507, 'loss/train': 1.737780213356018} 11/07/2021 10:23:41 - INFO - __main__ - Step 93509: {'lr': 0.0001592135622818182, 'samples': 17953728, 'steps': 93508, 'loss/train': 1.105638861656189} 11/07/2021 10:23:42 - INFO - __main__ - Step 93510: {'lr': 0.00015920861783691448, 'samples': 17953920, 'steps': 93509, 'loss/train': 1.2847329378128052} 11/07/2021 10:23:43 - INFO - __main__ - Step 93511: {'lr': 0.00015920367343291993, 'samples': 17954112, 'steps': 93510, 'loss/train': 1.2546074390411377} 11/07/2021 10:23:43 - INFO - __main__ - Step 93512: {'lr': 0.00015919872906983685, 'samples': 17954304, 'steps': 93511, 'loss/train': 1.006738543510437} 11/07/2021 10:23:44 - INFO - __main__ - Step 93513: {'lr': 0.00015919378474766742, 'samples': 17954496, 'steps': 93512, 'loss/train': 1.3494116067886353} 11/07/2021 10:23:44 - INFO - __main__ - Step 93514: {'lr': 0.00015918884046641383, 'samples': 17954688, 'steps': 93513, 'loss/train': 1.406718134880066} 11/07/2021 10:23:44 - INFO - __main__ - Step 93515: {'lr': 0.0001591838962260784, 'samples': 17954880, 'steps': 93514, 'loss/train': 1.6604782342910767} 11/07/2021 10:23:45 - INFO - __main__ - Step 93516: {'lr': 0.00015917895202666326, 'samples': 17955072, 'steps': 93515, 'loss/train': 1.3102458715438843} 11/07/2021 10:23:46 - INFO - __main__ - Step 93517: {'lr': 0.00015917400786817068, 'samples': 17955264, 'steps': 93516, 'loss/train': 1.5028936862945557} 11/07/2021 10:23:47 - INFO - __main__ - Step 93518: {'lr': 0.0001591690637506029, 'samples': 17955456, 'steps': 93517, 'loss/train': 0.8000648021697998} 11/07/2021 10:23:47 - INFO - __main__ - Step 93519: {'lr': 0.00015916411967396214, 'samples': 17955648, 'steps': 93518, 'loss/train': 1.3989967107772827} 11/07/2021 10:23:47 - INFO - __main__ - Step 93520: {'lr': 0.0001591591756382506, 'samples': 17955840, 'steps': 93519, 'loss/train': 1.7556369304656982} 11/07/2021 10:23:48 - INFO - __main__ - Step 93521: {'lr': 0.00015915423164347055, 'samples': 17956032, 'steps': 93520, 'loss/train': 1.6808514595031738} 11/07/2021 10:23:48 - INFO - __main__ - Step 93522: {'lr': 0.00015914928768962422, 'samples': 17956224, 'steps': 93521, 'loss/train': 0.9285896420478821} 11/07/2021 10:23:49 - INFO - __main__ - Step 93523: {'lr': 0.00015914434377671378, 'samples': 17956416, 'steps': 93522, 'loss/train': 1.3756650686264038} 11/07/2021 10:23:49 - INFO - __main__ - Step 93524: {'lr': 0.00015913939990474152, 'samples': 17956608, 'steps': 93523, 'loss/train': 1.278997540473938} 11/07/2021 10:23:50 - INFO - __main__ - Step 93525: {'lr': 0.00015913445607370963, 'samples': 17956800, 'steps': 93524, 'loss/train': 1.1569586992263794} 11/07/2021 10:23:50 - INFO - __main__ - Step 93526: {'lr': 0.00015912951228362038, 'samples': 17956992, 'steps': 93525, 'loss/train': 1.9706604480743408} 11/07/2021 10:23:51 - INFO - __main__ - Step 93527: {'lr': 0.00015912456853447605, 'samples': 17957184, 'steps': 93526, 'loss/train': 1.3564549684524536} 11/07/2021 10:23:51 - INFO - __main__ - Step 93528: {'lr': 0.0001591196248262787, 'samples': 17957376, 'steps': 93527, 'loss/train': 1.5020854473114014} 11/07/2021 10:23:52 - INFO - __main__ - Step 93529: {'lr': 0.00015911468115903062, 'samples': 17957568, 'steps': 93528, 'loss/train': 1.1356959342956543} 11/07/2021 10:23:53 - INFO - __main__ - Step 93530: {'lr': 0.0001591097375327341, 'samples': 17957760, 'steps': 93529, 'loss/train': 1.6319791078567505} 11/07/2021 10:23:53 - INFO - __main__ - Step 93531: {'lr': 0.0001591047939473913, 'samples': 17957952, 'steps': 93530, 'loss/train': 0.8831084966659546} 11/07/2021 10:23:53 - INFO - __main__ - Step 93532: {'lr': 0.00015909985040300447, 'samples': 17958144, 'steps': 93531, 'loss/train': 1.5024025440216064} 11/07/2021 10:23:54 - INFO - __main__ - Step 93533: {'lr': 0.00015909490689957587, 'samples': 17958336, 'steps': 93532, 'loss/train': 0.675434410572052} 11/07/2021 10:23:55 - INFO - __main__ - Step 93534: {'lr': 0.0001590899634371077, 'samples': 17958528, 'steps': 93533, 'loss/train': 1.5334713459014893} 11/07/2021 10:23:55 - INFO - __main__ - Step 93535: {'lr': 0.00015908502001560216, 'samples': 17958720, 'steps': 93534, 'loss/train': 0.4005618989467621} 11/07/2021 10:23:55 - INFO - __main__ - Step 93536: {'lr': 0.00015908007663506153, 'samples': 17958912, 'steps': 93535, 'loss/train': 1.7427774667739868} 11/07/2021 10:23:56 - INFO - __main__ - Step 93537: {'lr': 0.00015907513329548801, 'samples': 17959104, 'steps': 93536, 'loss/train': 1.5985432863235474} 11/07/2021 10:23:56 - INFO - __main__ - Step 93538: {'lr': 0.00015907018999688382, 'samples': 17959296, 'steps': 93537, 'loss/train': 0.6606783270835876} 11/07/2021 10:23:57 - INFO - __main__ - Step 93539: {'lr': 0.00015906524673925125, 'samples': 17959488, 'steps': 93538, 'loss/train': 1.1665676832199097} 11/07/2021 10:23:57 - INFO - __main__ - Step 93540: {'lr': 0.00015906030352259254, 'samples': 17959680, 'steps': 93539, 'loss/train': 1.3436635732650757} 11/07/2021 10:23:58 - INFO - __main__ - Step 93541: {'lr': 0.00015905536034690977, 'samples': 17959872, 'steps': 93540, 'loss/train': 1.0967670679092407} 11/07/2021 10:23:58 - INFO - __main__ - Step 93542: {'lr': 0.0001590504172122052, 'samples': 17960064, 'steps': 93541, 'loss/train': 1.0460484027862549} 11/07/2021 10:23:58 - INFO - __main__ - Step 93543: {'lr': 0.00015904547411848115, 'samples': 17960256, 'steps': 93542, 'loss/train': 1.563359022140503} 11/07/2021 10:24:00 - INFO - __main__ - Step 93544: {'lr': 0.0001590405310657398, 'samples': 17960448, 'steps': 93543, 'loss/train': 1.3627493381500244} 11/07/2021 10:24:00 - INFO - __main__ - Step 93545: {'lr': 0.00015903558805398338, 'samples': 17960640, 'steps': 93544, 'loss/train': 0.8167644143104553} 11/07/2021 10:24:00 - INFO - __main__ - Step 93546: {'lr': 0.00015903064508321414, 'samples': 17960832, 'steps': 93545, 'loss/train': 1.4707719087600708} 11/07/2021 10:24:01 - INFO - __main__ - Step 93547: {'lr': 0.00015902570215343425, 'samples': 17961024, 'steps': 93546, 'loss/train': 1.3082371950149536} 11/07/2021 10:24:01 - INFO - __main__ - Step 93548: {'lr': 0.000159020759264646, 'samples': 17961216, 'steps': 93547, 'loss/train': 1.6866048574447632} 11/07/2021 10:24:02 - INFO - __main__ - Step 93549: {'lr': 0.00015901581641685158, 'samples': 17961408, 'steps': 93548, 'loss/train': 1.387137770652771} 11/07/2021 10:24:02 - INFO - __main__ - Step 93550: {'lr': 0.00015901087361005326, 'samples': 17961600, 'steps': 93549, 'loss/train': 1.0531535148620605} 11/07/2021 10:24:03 - INFO - __main__ - Step 93551: {'lr': 0.0001590059308442532, 'samples': 17961792, 'steps': 93550, 'loss/train': 1.409508228302002} 11/07/2021 10:24:03 - INFO - __main__ - Step 93552: {'lr': 0.00015900098811945368, 'samples': 17961984, 'steps': 93551, 'loss/train': 1.5655827522277832} 11/07/2021 10:24:03 - INFO - __main__ - Step 93553: {'lr': 0.0001589960454356569, 'samples': 17962176, 'steps': 93552, 'loss/train': 1.028741478919983} 11/07/2021 10:24:04 - INFO - __main__ - Step 93554: {'lr': 0.0001589911027928652, 'samples': 17962368, 'steps': 93553, 'loss/train': 1.3632341623306274} 11/07/2021 10:24:05 - INFO - __main__ - Step 93555: {'lr': 0.00015898616019108065, 'samples': 17962560, 'steps': 93554, 'loss/train': 1.4826210737228394} 11/07/2021 10:24:05 - INFO - __main__ - Step 93556: {'lr': 0.00015898121763030547, 'samples': 17962752, 'steps': 93555, 'loss/train': 1.3715391159057617} 11/07/2021 10:24:05 - INFO - __main__ - Step 93557: {'lr': 0.00015897627511054198, 'samples': 17962944, 'steps': 93556, 'loss/train': 0.696286678314209} 11/07/2021 10:24:06 - INFO - __main__ - Step 93558: {'lr': 0.00015897133263179236, 'samples': 17963136, 'steps': 93557, 'loss/train': 1.7643028497695923} 11/07/2021 10:24:06 - INFO - __main__ - Step 93559: {'lr': 0.00015896639019405884, 'samples': 17963328, 'steps': 93558, 'loss/train': 1.3438175916671753} 11/07/2021 10:24:07 - INFO - __main__ - Step 93560: {'lr': 0.00015896144779734366, 'samples': 17963520, 'steps': 93559, 'loss/train': 1.269461750984192} 11/07/2021 10:24:08 - INFO - __main__ - Step 93561: {'lr': 0.0001589565054416491, 'samples': 17963712, 'steps': 93560, 'loss/train': 1.549894094467163} 11/07/2021 10:24:08 - INFO - __main__ - Step 93562: {'lr': 0.0001589515631269773, 'samples': 17963904, 'steps': 93561, 'loss/train': 1.5400124788284302} 11/07/2021 10:24:08 - INFO - __main__ - Step 93563: {'lr': 0.0001589466208533305, 'samples': 17964096, 'steps': 93562, 'loss/train': 1.6908376216888428} 11/07/2021 10:24:09 - INFO - __main__ - Step 93564: {'lr': 0.00015894167862071098, 'samples': 17964288, 'steps': 93563, 'loss/train': 1.2898799180984497} 11/07/2021 10:24:10 - INFO - __main__ - Step 93565: {'lr': 0.00015893673642912093, 'samples': 17964480, 'steps': 93564, 'loss/train': 0.07542716711759567} 11/07/2021 10:24:10 - INFO - __main__ - Step 93566: {'lr': 0.00015893179427856259, 'samples': 17964672, 'steps': 93565, 'loss/train': 1.508673071861267} 11/07/2021 10:24:10 - INFO - __main__ - Step 93567: {'lr': 0.00015892685216903823, 'samples': 17964864, 'steps': 93566, 'loss/train': 1.7454172372817993} 11/07/2021 10:24:11 - INFO - __main__ - Step 93568: {'lr': 0.00015892191010054995, 'samples': 17965056, 'steps': 93567, 'loss/train': 1.4105597734451294} 11/07/2021 10:24:11 - INFO - __main__ - Step 93569: {'lr': 0.00015891696807310008, 'samples': 17965248, 'steps': 93568, 'loss/train': 1.7878968715667725} 11/07/2021 10:24:12 - INFO - __main__ - Step 93570: {'lr': 0.00015891202608669082, 'samples': 17965440, 'steps': 93569, 'loss/train': 1.0509264469146729} 11/07/2021 10:24:12 - INFO - __main__ - Step 93571: {'lr': 0.00015890708414132435, 'samples': 17965632, 'steps': 93570, 'loss/train': 1.3961340188980103} 11/07/2021 10:24:13 - INFO - __main__ - Step 93572: {'lr': 0.00015890214223700296, 'samples': 17965824, 'steps': 93571, 'loss/train': 1.487597107887268} 11/07/2021 10:24:13 - INFO - __main__ - Step 93573: {'lr': 0.00015889720037372886, 'samples': 17966016, 'steps': 93572, 'loss/train': 1.7988522052764893} 11/07/2021 10:24:13 - INFO - __main__ - Step 93574: {'lr': 0.00015889225855150429, 'samples': 17966208, 'steps': 93573, 'loss/train': 1.464078664779663} 11/07/2021 10:24:15 - INFO - __main__ - Step 93575: {'lr': 0.00015888731677033148, 'samples': 17966400, 'steps': 93574, 'loss/train': 1.460461974143982} 11/07/2021 10:24:15 - INFO - __main__ - Step 93576: {'lr': 0.0001588823750302126, 'samples': 17966592, 'steps': 93575, 'loss/train': 1.6773308515548706} 11/07/2021 10:24:15 - INFO - __main__ - Step 93577: {'lr': 0.0001588774333311499, 'samples': 17966784, 'steps': 93576, 'loss/train': 1.7654770612716675} 11/07/2021 10:24:16 - INFO - __main__ - Step 93578: {'lr': 0.00015887249167314567, 'samples': 17966976, 'steps': 93577, 'loss/train': 1.3830158710479736} 11/07/2021 10:24:16 - INFO - __main__ - Step 93579: {'lr': 0.00015886755005620207, 'samples': 17967168, 'steps': 93578, 'loss/train': 1.2742639780044556} 11/07/2021 10:24:17 - INFO - __main__ - Step 93580: {'lr': 0.00015886260848032134, 'samples': 17967360, 'steps': 93579, 'loss/train': 0.6690245270729065} 11/07/2021 10:24:17 - INFO - __main__ - Step 93581: {'lr': 0.0001588576669455058, 'samples': 17967552, 'steps': 93580, 'loss/train': 1.4414424896240234} 11/07/2021 10:24:18 - INFO - __main__ - Step 93582: {'lr': 0.0001588527254517575, 'samples': 17967744, 'steps': 93581, 'loss/train': 1.3162575960159302} 11/07/2021 10:24:18 - INFO - __main__ - Step 93583: {'lr': 0.00015884778399907878, 'samples': 17967936, 'steps': 93582, 'loss/train': 1.1158249378204346} 11/07/2021 10:24:19 - INFO - __main__ - Step 93584: {'lr': 0.00015884284258747185, 'samples': 17968128, 'steps': 93583, 'loss/train': 1.3358232975006104} 11/07/2021 10:24:19 - INFO - __main__ - Step 93585: {'lr': 0.00015883790121693885, 'samples': 17968320, 'steps': 93584, 'loss/train': 1.3957451581954956} 11/07/2021 10:24:20 - INFO - __main__ - Step 93586: {'lr': 0.00015883295988748213, 'samples': 17968512, 'steps': 93585, 'loss/train': 1.504468321800232} 11/07/2021 10:24:20 - INFO - __main__ - Step 93587: {'lr': 0.00015882801859910388, 'samples': 17968704, 'steps': 93586, 'loss/train': 1.605237603187561} 11/07/2021 10:24:21 - INFO - __main__ - Step 93588: {'lr': 0.00015882307735180635, 'samples': 17968896, 'steps': 93587, 'loss/train': 1.353516697883606} 11/07/2021 10:24:21 - INFO - __main__ - Step 93589: {'lr': 0.0001588181361455917, 'samples': 17969088, 'steps': 93588, 'loss/train': 1.433793067932129} 11/07/2021 10:24:21 - INFO - __main__ - Step 93590: {'lr': 0.00015881319498046217, 'samples': 17969280, 'steps': 93589, 'loss/train': 1.3905352354049683} 11/07/2021 10:24:22 - INFO - __main__ - Step 93591: {'lr': 0.00015880825385642004, 'samples': 17969472, 'steps': 93590, 'loss/train': 1.4709304571151733} 11/07/2021 10:24:23 - INFO - __main__ - Step 93592: {'lr': 0.0001588033127734675, 'samples': 17969664, 'steps': 93591, 'loss/train': 3.0097134113311768} 11/07/2021 10:24:23 - INFO - __main__ - Step 93593: {'lr': 0.00015879837173160677, 'samples': 17969856, 'steps': 93592, 'loss/train': 1.204999327659607} 11/07/2021 10:24:23 - INFO - __main__ - Step 93594: {'lr': 0.0001587934307308402, 'samples': 17970048, 'steps': 93593, 'loss/train': 1.777646541595459} 11/07/2021 10:24:24 - INFO - __main__ - Step 93595: {'lr': 0.00015878848977116979, 'samples': 17970240, 'steps': 93594, 'loss/train': 1.0090229511260986} 11/07/2021 10:24:25 - INFO - __main__ - Step 93596: {'lr': 0.00015878354885259788, 'samples': 17970432, 'steps': 93595, 'loss/train': 1.388160228729248} 11/07/2021 10:24:25 - INFO - __main__ - Step 93597: {'lr': 0.00015877860797512667, 'samples': 17970624, 'steps': 93596, 'loss/train': 0.8242500424385071} 11/07/2021 10:24:26 - INFO - __main__ - Step 93598: {'lr': 0.00015877366713875845, 'samples': 17970816, 'steps': 93597, 'loss/train': 1.5754671096801758} 11/07/2021 10:24:26 - INFO - __main__ - Step 93599: {'lr': 0.00015876872634349538, 'samples': 17971008, 'steps': 93598, 'loss/train': 1.5644978284835815} 11/07/2021 10:24:26 - INFO - __main__ - Step 93600: {'lr': 0.00015876378558933973, 'samples': 17971200, 'steps': 93599, 'loss/train': 1.7799173593521118} 11/07/2021 10:24:27 - INFO - __main__ - Step 93601: {'lr': 0.0001587588448762937, 'samples': 17971392, 'steps': 93600, 'loss/train': 0.5827279090881348} 11/07/2021 10:24:28 - INFO - __main__ - Step 93602: {'lr': 0.00015875390420435953, 'samples': 17971584, 'steps': 93601, 'loss/train': 1.2219834327697754} 11/07/2021 10:24:28 - INFO - __main__ - Step 93603: {'lr': 0.00015874896357353946, 'samples': 17971776, 'steps': 93602, 'loss/train': 1.6181039810180664} 11/07/2021 10:24:28 - INFO - __main__ - Step 93604: {'lr': 0.0001587440229838357, 'samples': 17971968, 'steps': 93603, 'loss/train': 1.5096553564071655} 11/07/2021 10:24:29 - INFO - __main__ - Step 93605: {'lr': 0.00015873908243525047, 'samples': 17972160, 'steps': 93604, 'loss/train': 1.5673425197601318} 11/07/2021 10:24:29 - INFO - __main__ - Step 93606: {'lr': 0.00015873414192778604, 'samples': 17972352, 'steps': 93605, 'loss/train': 1.5894200801849365} 11/07/2021 10:24:30 - INFO - __main__ - Step 93607: {'lr': 0.0001587292014614446, 'samples': 17972544, 'steps': 93606, 'loss/train': 1.3703269958496094} 11/07/2021 10:24:30 - INFO - __main__ - Step 93608: {'lr': 0.00015872426103622834, 'samples': 17972736, 'steps': 93607, 'loss/train': 1.3944952487945557} 11/07/2021 10:24:31 - INFO - __main__ - Step 93609: {'lr': 0.00015871932065213948, 'samples': 17972928, 'steps': 93608, 'loss/train': 1.6256476640701294} 11/07/2021 10:24:31 - INFO - __main__ - Step 93610: {'lr': 0.00015871438030918032, 'samples': 17973120, 'steps': 93609, 'loss/train': 1.654977560043335} 11/07/2021 10:24:32 - INFO - __main__ - Step 93611: {'lr': 0.00015870944000735305, 'samples': 17973312, 'steps': 93610, 'loss/train': 0.39988479018211365} 11/07/2021 10:24:33 - INFO - __main__ - Step 93612: {'lr': 0.00015870449974665987, 'samples': 17973504, 'steps': 93611, 'loss/train': 1.2405273914337158} 11/07/2021 10:24:33 - INFO - __main__ - Step 93613: {'lr': 0.00015869955952710308, 'samples': 17973696, 'steps': 93612, 'loss/train': 1.4247909784317017} 11/07/2021 10:24:33 - INFO - __main__ - Step 93614: {'lr': 0.0001586946193486848, 'samples': 17973888, 'steps': 93613, 'loss/train': 1.5616170167922974} 11/07/2021 10:24:34 - INFO - __main__ - Step 93615: {'lr': 0.00015868967921140736, 'samples': 17974080, 'steps': 93614, 'loss/train': 1.3698898553848267} 11/07/2021 10:24:34 - INFO - __main__ - Step 93616: {'lr': 0.00015868473911527292, 'samples': 17974272, 'steps': 93615, 'loss/train': 1.2779698371887207} 11/07/2021 10:24:35 - INFO - __main__ - Step 93617: {'lr': 0.0001586797990602838, 'samples': 17974464, 'steps': 93616, 'loss/train': 1.3383859395980835} 11/07/2021 10:24:36 - INFO - __main__ - Step 93618: {'lr': 0.0001586748590464421, 'samples': 17974656, 'steps': 93617, 'loss/train': 1.2851048707962036} 11/07/2021 10:24:36 - INFO - __main__ - Step 93619: {'lr': 0.00015866991907375006, 'samples': 17974848, 'steps': 93618, 'loss/train': 1.563151478767395} 11/07/2021 10:24:36 - INFO - __main__ - Step 93620: {'lr': 0.00015866497914220998, 'samples': 17975040, 'steps': 93619, 'loss/train': 1.3432297706604004} 11/07/2021 10:24:37 - INFO - __main__ - Step 93621: {'lr': 0.0001586600392518241, 'samples': 17975232, 'steps': 93620, 'loss/train': 0.14319248497486115} 11/07/2021 10:24:38 - INFO - __main__ - Step 93622: {'lr': 0.00015865509940259453, 'samples': 17975424, 'steps': 93621, 'loss/train': 1.4379374980926514} 11/07/2021 10:24:38 - INFO - __main__ - Step 93623: {'lr': 0.00015865015959452358, 'samples': 17975616, 'steps': 93622, 'loss/train': 1.3696435689926147} 11/07/2021 10:24:38 - INFO - __main__ - Step 93624: {'lr': 0.00015864521982761348, 'samples': 17975808, 'steps': 93623, 'loss/train': 1.541550874710083} 11/07/2021 10:24:39 - INFO - __main__ - Step 93625: {'lr': 0.00015864028010186638, 'samples': 17976000, 'steps': 93624, 'loss/train': 1.3545284271240234} 11/07/2021 10:24:39 - INFO - __main__ - Step 93626: {'lr': 0.0001586353404172846, 'samples': 17976192, 'steps': 93625, 'loss/train': 1.4277340173721313} 11/07/2021 10:24:40 - INFO - __main__ - Step 93627: {'lr': 0.0001586304007738703, 'samples': 17976384, 'steps': 93626, 'loss/train': 1.393221378326416} 11/07/2021 10:24:40 - INFO - __main__ - Step 93628: {'lr': 0.00015862546117162578, 'samples': 17976576, 'steps': 93627, 'loss/train': 1.3071672916412354} 11/07/2021 10:24:41 - INFO - __main__ - Step 93629: {'lr': 0.00015862052161055322, 'samples': 17976768, 'steps': 93628, 'loss/train': 0.16644160449504852} 11/07/2021 10:24:41 - INFO - __main__ - Step 93630: {'lr': 0.0001586155820906548, 'samples': 17976960, 'steps': 93629, 'loss/train': 1.1157498359680176} 11/07/2021 10:24:41 - INFO - __main__ - Step 93631: {'lr': 0.0001586106426119328, 'samples': 17977152, 'steps': 93630, 'loss/train': 1.052596092224121} 11/07/2021 10:24:42 - INFO - __main__ - Step 93632: {'lr': 0.00015860570317438943, 'samples': 17977344, 'steps': 93631, 'loss/train': 0.42541229724884033} 11/07/2021 10:24:43 - INFO - __main__ - Step 93633: {'lr': 0.00015860076377802691, 'samples': 17977536, 'steps': 93632, 'loss/train': 0.6876203417778015} 11/07/2021 10:24:43 - INFO - __main__ - Step 93634: {'lr': 0.00015859582442284754, 'samples': 17977728, 'steps': 93633, 'loss/train': 1.5711079835891724} 11/07/2021 10:24:43 - INFO - __main__ - Step 93635: {'lr': 0.0001585908851088534, 'samples': 17977920, 'steps': 93634, 'loss/train': 1.130753517150879} 11/07/2021 10:24:44 - INFO - __main__ - Step 93636: {'lr': 0.00015858594583604684, 'samples': 17978112, 'steps': 93635, 'loss/train': 1.4591717720031738} 11/07/2021 10:24:44 - INFO - __main__ - Step 93637: {'lr': 0.00015858100660443003, 'samples': 17978304, 'steps': 93636, 'loss/train': 0.05432211980223656} 11/07/2021 10:24:45 - INFO - __main__ - Step 93638: {'lr': 0.0001585760674140052, 'samples': 17978496, 'steps': 93637, 'loss/train': 1.4312399625778198} 11/07/2021 10:24:46 - INFO - __main__ - Step 93639: {'lr': 0.00015857112826477463, 'samples': 17978688, 'steps': 93638, 'loss/train': 1.4199433326721191} 11/07/2021 10:24:46 - INFO - __main__ - Step 93640: {'lr': 0.00015856618915674044, 'samples': 17978880, 'steps': 93639, 'loss/train': 0.6413478255271912} 11/07/2021 10:24:46 - INFO - __main__ - Step 93641: {'lr': 0.00015856125008990494, 'samples': 17979072, 'steps': 93640, 'loss/train': 1.3718042373657227} 11/07/2021 10:24:47 - INFO - __main__ - Step 93642: {'lr': 0.0001585563110642703, 'samples': 17979264, 'steps': 93641, 'loss/train': 5.6694536209106445} 11/07/2021 10:24:48 - INFO - __main__ - Step 93643: {'lr': 0.0001585513720798388, 'samples': 17979456, 'steps': 93642, 'loss/train': 1.2214792966842651} 11/07/2021 10:24:48 - INFO - __main__ - Step 93644: {'lr': 0.00015854643313661265, 'samples': 17979648, 'steps': 93643, 'loss/train': 1.4376392364501953} 11/07/2021 10:24:48 - INFO - __main__ - Step 93645: {'lr': 0.00015854149423459403, 'samples': 17979840, 'steps': 93644, 'loss/train': 1.358774185180664} 11/07/2021 10:24:49 - INFO - __main__ - Step 93646: {'lr': 0.0001585365553737852, 'samples': 17980032, 'steps': 93645, 'loss/train': 1.7956228256225586} 11/07/2021 10:24:49 - INFO - __main__ - Step 93647: {'lr': 0.00015853161655418843, 'samples': 17980224, 'steps': 93646, 'loss/train': 1.7548729181289673} 11/07/2021 10:24:49 - INFO - __main__ - Step 93648: {'lr': 0.00015852667777580592, 'samples': 17980416, 'steps': 93647, 'loss/train': 1.1400045156478882} 11/07/2021 10:24:50 - INFO - __main__ - Step 93649: {'lr': 0.00015852173903863986, 'samples': 17980608, 'steps': 93648, 'loss/train': 1.4247041940689087} 11/07/2021 10:24:51 - INFO - __main__ - Step 93650: {'lr': 0.0001585168003426925, 'samples': 17980800, 'steps': 93649, 'loss/train': 1.371017575263977} 11/07/2021 10:24:51 - INFO - __main__ - Step 93651: {'lr': 0.00015851186168796606, 'samples': 17980992, 'steps': 93650, 'loss/train': 1.5884416103363037} 11/07/2021 10:24:52 - INFO - __main__ - Step 93652: {'lr': 0.00015850692307446272, 'samples': 17981184, 'steps': 93651, 'loss/train': 0.3849007487297058} 11/07/2021 10:24:52 - INFO - __main__ - Step 93653: {'lr': 0.00015850198450218474, 'samples': 17981376, 'steps': 93652, 'loss/train': 1.4164576530456543} 11/07/2021 10:24:53 - INFO - __main__ - Step 93654: {'lr': 0.00015849704597113438, 'samples': 17981568, 'steps': 93653, 'loss/train': 1.8903084993362427} 11/07/2021 10:24:53 - INFO - __main__ - Step 93655: {'lr': 0.00015849210748131382, 'samples': 17981760, 'steps': 93654, 'loss/train': 1.0199027061462402} 11/07/2021 10:24:54 - INFO - __main__ - Step 93656: {'lr': 0.0001584871690327253, 'samples': 17981952, 'steps': 93655, 'loss/train': 1.2898999452590942} 11/07/2021 10:24:54 - INFO - __main__ - Step 93657: {'lr': 0.0001584822306253711, 'samples': 17982144, 'steps': 93656, 'loss/train': 0.1030687466263771} 11/07/2021 10:24:54 - INFO - __main__ - Step 93658: {'lr': 0.00015847729225925333, 'samples': 17982336, 'steps': 93657, 'loss/train': 1.246720790863037} 11/07/2021 10:24:55 - INFO - __main__ - Step 93659: {'lr': 0.00015847235393437435, 'samples': 17982528, 'steps': 93658, 'loss/train': 1.2013229131698608} 11/07/2021 10:24:56 - INFO - __main__ - Step 93660: {'lr': 0.00015846741565073624, 'samples': 17982720, 'steps': 93659, 'loss/train': 0.7659128308296204} 11/07/2021 10:24:56 - INFO - __main__ - Step 93661: {'lr': 0.00015846247740834146, 'samples': 17982912, 'steps': 93660, 'loss/train': 1.410837173461914} 11/07/2021 10:24:56 - INFO - __main__ - Step 93662: {'lr': 0.00015845753920719198, 'samples': 17983104, 'steps': 93661, 'loss/train': 1.6618626117706299} 11/07/2021 10:24:57 - INFO - __main__ - Step 93663: {'lr': 0.00015845260104729007, 'samples': 17983296, 'steps': 93662, 'loss/train': 1.5066297054290771} 11/07/2021 10:24:57 - INFO - __main__ - Step 93664: {'lr': 0.00015844766292863802, 'samples': 17983488, 'steps': 93663, 'loss/train': 1.2236438989639282} 11/07/2021 10:24:58 - INFO - __main__ - Step 93665: {'lr': 0.00015844272485123807, 'samples': 17983680, 'steps': 93664, 'loss/train': 1.3959094285964966} 11/07/2021 10:24:58 - INFO - __main__ - Step 93666: {'lr': 0.00015843778681509234, 'samples': 17983872, 'steps': 93665, 'loss/train': 1.9053785800933838} 11/07/2021 10:24:59 - INFO - __main__ - Step 93667: {'lr': 0.0001584328488202032, 'samples': 17984064, 'steps': 93666, 'loss/train': 1.2196238040924072} 11/07/2021 10:24:59 - INFO - __main__ - Step 93668: {'lr': 0.0001584279108665728, 'samples': 17984256, 'steps': 93667, 'loss/train': 1.2269614934921265} 11/07/2021 10:24:59 - INFO - __main__ - Step 93669: {'lr': 0.00015842297295420336, 'samples': 17984448, 'steps': 93668, 'loss/train': 1.2483165264129639} 11/07/2021 10:25:00 - INFO - __main__ - Step 93670: {'lr': 0.0001584180350830971, 'samples': 17984640, 'steps': 93669, 'loss/train': 1.5584893226623535} 11/07/2021 10:25:01 - INFO - __main__ - Step 93671: {'lr': 0.00015841309725325627, 'samples': 17984832, 'steps': 93670, 'loss/train': 1.374049425125122} 11/07/2021 10:25:01 - INFO - __main__ - Step 93672: {'lr': 0.0001584081594646831, 'samples': 17985024, 'steps': 93671, 'loss/train': 1.487916111946106} 11/07/2021 10:25:02 - INFO - __main__ - Step 93673: {'lr': 0.0001584032217173798, 'samples': 17985216, 'steps': 93672, 'loss/train': 1.4069849252700806} 11/07/2021 10:25:02 - INFO - __main__ - Step 93674: {'lr': 0.0001583982840113486, 'samples': 17985408, 'steps': 93673, 'loss/train': 1.4772261381149292} 11/07/2021 10:25:03 - INFO - __main__ - Step 93675: {'lr': 0.0001583933463465918, 'samples': 17985600, 'steps': 93674, 'loss/train': 1.4879294633865356} 11/07/2021 10:25:03 - INFO - __main__ - Step 93676: {'lr': 0.00015838840872311146, 'samples': 17985792, 'steps': 93675, 'loss/train': 1.1182606220245361} 11/07/2021 10:25:04 - INFO - __main__ - Step 93677: {'lr': 0.00015838347114090985, 'samples': 17985984, 'steps': 93676, 'loss/train': 1.9421530961990356} 11/07/2021 10:25:04 - INFO - __main__ - Step 93678: {'lr': 0.00015837853359998926, 'samples': 17986176, 'steps': 93677, 'loss/train': 0.5344652533531189} 11/07/2021 10:25:04 - INFO - __main__ - Step 93679: {'lr': 0.0001583735961003519, 'samples': 17986368, 'steps': 93678, 'loss/train': 1.377200961112976} 11/07/2021 10:25:05 - INFO - __main__ - Step 93680: {'lr': 0.00015836865864199995, 'samples': 17986560, 'steps': 93679, 'loss/train': 1.1307975053787231} 11/07/2021 10:25:06 - INFO - __main__ - Step 93681: {'lr': 0.0001583637212249357, 'samples': 17986752, 'steps': 93680, 'loss/train': 1.2456804513931274} 11/07/2021 10:25:06 - INFO - __main__ - Step 93682: {'lr': 0.00015835878384916135, 'samples': 17986944, 'steps': 93681, 'loss/train': 1.2421075105667114} 11/07/2021 10:25:06 - INFO - __main__ - Step 93683: {'lr': 0.0001583538465146791, 'samples': 17987136, 'steps': 93682, 'loss/train': 1.5055911540985107} 11/07/2021 10:25:07 - INFO - __main__ - Step 93684: {'lr': 0.0001583489092214912, 'samples': 17987328, 'steps': 93683, 'loss/train': 1.3766611814498901} 11/07/2021 10:25:07 - INFO - __main__ - Step 93685: {'lr': 0.00015834397196959986, 'samples': 17987520, 'steps': 93684, 'loss/train': 0.9632483720779419} 11/07/2021 10:25:08 - INFO - __main__ - Step 93686: {'lr': 0.0001583390347590073, 'samples': 17987712, 'steps': 93685, 'loss/train': 1.4324580430984497} 11/07/2021 10:25:09 - INFO - __main__ - Step 93687: {'lr': 0.0001583340975897158, 'samples': 17987904, 'steps': 93686, 'loss/train': 1.3295005559921265} 11/07/2021 10:25:09 - INFO - __main__ - Step 93688: {'lr': 0.0001583291604617276, 'samples': 17988096, 'steps': 93687, 'loss/train': 1.6983799934387207} 11/07/2021 10:25:09 - INFO - __main__ - Step 93689: {'lr': 0.00015832422337504475, 'samples': 17988288, 'steps': 93688, 'loss/train': 1.76311194896698} 11/07/2021 10:25:10 - INFO - __main__ - Step 93690: {'lr': 0.00015831928632966964, 'samples': 17988480, 'steps': 93689, 'loss/train': 1.5350470542907715} 11/07/2021 10:25:10 - INFO - __main__ - Step 93691: {'lr': 0.00015831434932560442, 'samples': 17988672, 'steps': 93690, 'loss/train': 1.9150681495666504} 11/07/2021 10:25:11 - INFO - __main__ - Step 93692: {'lr': 0.00015830941236285134, 'samples': 17988864, 'steps': 93691, 'loss/train': 0.8271337747573853} 11/07/2021 10:25:11 - INFO - __main__ - Step 93693: {'lr': 0.00015830447544141262, 'samples': 17989056, 'steps': 93692, 'loss/train': 1.8688064813613892} 11/07/2021 10:25:12 - INFO - __main__ - Step 93694: {'lr': 0.00015829953856129052, 'samples': 17989248, 'steps': 93693, 'loss/train': 0.9230802655220032} 11/07/2021 10:25:12 - INFO - __main__ - Step 93695: {'lr': 0.00015829460172248723, 'samples': 17989440, 'steps': 93694, 'loss/train': 0.5888314247131348} 11/07/2021 10:25:12 - INFO - __main__ - Step 93696: {'lr': 0.0001582896649250049, 'samples': 17989632, 'steps': 93695, 'loss/train': 1.5333824157714844} 11/07/2021 10:25:14 - INFO - __main__ - Step 93697: {'lr': 0.00015828472816884593, 'samples': 17989824, 'steps': 93696, 'loss/train': 1.091734766960144} 11/07/2021 10:25:14 - INFO - __main__ - Step 93698: {'lr': 0.0001582797914540124, 'samples': 17990016, 'steps': 93697, 'loss/train': 1.1231532096862793} 11/07/2021 10:25:14 - INFO - __main__ - Step 93699: {'lr': 0.00015827485478050657, 'samples': 17990208, 'steps': 93698, 'loss/train': 1.131947636604309} 11/07/2021 10:25:15 - INFO - __main__ - Step 93700: {'lr': 0.0001582699181483307, 'samples': 17990400, 'steps': 93699, 'loss/train': 5.630252838134766} 11/07/2021 10:25:15 - INFO - __main__ - Step 93701: {'lr': 0.00015826498155748698, 'samples': 17990592, 'steps': 93700, 'loss/train': 1.5572683811187744} 11/07/2021 10:25:16 - INFO - __main__ - Step 93702: {'lr': 0.00015826004500797775, 'samples': 17990784, 'steps': 93701, 'loss/train': 1.7414249181747437} 11/07/2021 10:25:16 - INFO - __main__ - Step 93703: {'lr': 0.000158255108499805, 'samples': 17990976, 'steps': 93702, 'loss/train': 1.2638174295425415} 11/07/2021 10:25:17 - INFO - __main__ - Step 93704: {'lr': 0.0001582501720329711, 'samples': 17991168, 'steps': 93703, 'loss/train': 1.7992987632751465} 11/07/2021 10:25:17 - INFO - __main__ - Step 93705: {'lr': 0.00015824523560747827, 'samples': 17991360, 'steps': 93704, 'loss/train': 1.0713902711868286} 11/07/2021 10:25:17 - INFO - __main__ - Step 93706: {'lr': 0.0001582402992233287, 'samples': 17991552, 'steps': 93705, 'loss/train': 1.4172680377960205} 11/07/2021 10:25:18 - INFO - __main__ - Step 93707: {'lr': 0.00015823536288052465, 'samples': 17991744, 'steps': 93706, 'loss/train': 1.2562377452850342} 11/07/2021 10:25:19 - INFO - __main__ - Step 93708: {'lr': 0.00015823042657906833, 'samples': 17991936, 'steps': 93707, 'loss/train': 1.1600736379623413} 11/07/2021 10:25:19 - INFO - __main__ - Step 93709: {'lr': 0.00015822549031896196, 'samples': 17992128, 'steps': 93708, 'loss/train': 0.5965026617050171} 11/07/2021 10:25:19 - INFO - __main__ - Step 93710: {'lr': 0.0001582205541002078, 'samples': 17992320, 'steps': 93709, 'loss/train': 0.9259901642799377} 11/07/2021 10:25:20 - INFO - __main__ - Step 93711: {'lr': 0.00015821561792280796, 'samples': 17992512, 'steps': 93710, 'loss/train': 1.4537220001220703} 11/07/2021 10:25:20 - INFO - __main__ - Step 93712: {'lr': 0.0001582106817867648, 'samples': 17992704, 'steps': 93711, 'loss/train': 1.511067509651184} 11/07/2021 10:25:21 - INFO - __main__ - Step 93713: {'lr': 0.0001582057456920805, 'samples': 17992896, 'steps': 93712, 'loss/train': 1.5273081064224243} 11/07/2021 10:25:22 - INFO - __main__ - Step 93714: {'lr': 0.00015820080963875727, 'samples': 17993088, 'steps': 93713, 'loss/train': 1.5422968864440918} 11/07/2021 10:25:22 - INFO - __main__ - Step 93715: {'lr': 0.00015819587362679745, 'samples': 17993280, 'steps': 93714, 'loss/train': 1.5006885528564453} 11/07/2021 10:25:22 - INFO - __main__ - Step 93716: {'lr': 0.000158190937656203, 'samples': 17993472, 'steps': 93715, 'loss/train': 1.5652328729629517} 11/07/2021 10:25:23 - INFO - __main__ - Step 93717: {'lr': 0.00015818600172697633, 'samples': 17993664, 'steps': 93716, 'loss/train': 1.3938617706298828} 11/07/2021 10:25:24 - INFO - __main__ - Step 93718: {'lr': 0.00015818106583911963, 'samples': 17993856, 'steps': 93717, 'loss/train': 0.6794837713241577} 11/07/2021 10:25:24 - INFO - __main__ - Step 93719: {'lr': 0.00015817612999263514, 'samples': 17994048, 'steps': 93718, 'loss/train': 1.2365363836288452} 11/07/2021 10:25:25 - INFO - __main__ - Step 93720: {'lr': 0.00015817119418752503, 'samples': 17994240, 'steps': 93719, 'loss/train': 1.5277040004730225} 11/07/2021 10:25:25 - INFO - __main__ - Step 93721: {'lr': 0.0001581662584237916, 'samples': 17994432, 'steps': 93720, 'loss/train': 1.6357738971710205} 11/07/2021 10:25:25 - INFO - __main__ - Step 93722: {'lr': 0.000158161322701437, 'samples': 17994624, 'steps': 93721, 'loss/train': 1.37932288646698} 11/07/2021 10:25:26 - INFO - __main__ - Step 93723: {'lr': 0.00015815638702046354, 'samples': 17994816, 'steps': 93722, 'loss/train': 0.9987938404083252} 11/07/2021 10:25:27 - INFO - __main__ - Step 93724: {'lr': 0.00015815145138087336, 'samples': 17995008, 'steps': 93723, 'loss/train': 1.7110291719436646} 11/07/2021 10:25:27 - INFO - __main__ - Step 93725: {'lr': 0.00015814651578266873, 'samples': 17995200, 'steps': 93724, 'loss/train': 1.774209976196289} 11/07/2021 10:25:27 - INFO - __main__ - Step 93726: {'lr': 0.00015814158022585184, 'samples': 17995392, 'steps': 93725, 'loss/train': 1.6220910549163818} 11/07/2021 10:25:28 - INFO - __main__ - Step 93727: {'lr': 0.00015813664471042498, 'samples': 17995584, 'steps': 93726, 'loss/train': 1.3047436475753784} 11/07/2021 10:25:29 - INFO - __main__ - Step 93728: {'lr': 0.00015813170923639042, 'samples': 17995776, 'steps': 93727, 'loss/train': 1.3754655122756958} 11/07/2021 10:25:29 - INFO - __main__ - Step 93729: {'lr': 0.00015812677380375019, 'samples': 17995968, 'steps': 93728, 'loss/train': 1.194738507270813} 11/07/2021 10:25:30 - INFO - __main__ - Step 93730: {'lr': 0.0001581218384125066, 'samples': 17996160, 'steps': 93729, 'loss/train': 1.3298895359039307} 11/07/2021 10:25:30 - INFO - __main__ - Step 93731: {'lr': 0.00015811690306266187, 'samples': 17996352, 'steps': 93730, 'loss/train': 1.2371900081634521} 11/07/2021 10:25:30 - INFO - __main__ - Step 93732: {'lr': 0.0001581119677542183, 'samples': 17996544, 'steps': 93731, 'loss/train': 1.3667882680892944} 11/07/2021 10:25:31 - INFO - __main__ - Step 93733: {'lr': 0.00015810703248717804, 'samples': 17996736, 'steps': 93732, 'loss/train': 1.4895650148391724} 11/07/2021 10:25:32 - INFO - __main__ - Step 93734: {'lr': 0.00015810209726154333, 'samples': 17996928, 'steps': 93733, 'loss/train': 1.5195062160491943} 11/07/2021 10:25:32 - INFO - __main__ - Step 93735: {'lr': 0.00015809716207731639, 'samples': 17997120, 'steps': 93734, 'loss/train': 1.4734159708023071} 11/07/2021 10:25:32 - INFO - __main__ - Step 93736: {'lr': 0.00015809222693449943, 'samples': 17997312, 'steps': 93735, 'loss/train': 1.1766510009765625} 11/07/2021 10:25:33 - INFO - __main__ - Step 93737: {'lr': 0.00015808729183309472, 'samples': 17997504, 'steps': 93736, 'loss/train': 1.5640254020690918} 11/07/2021 10:25:33 - INFO - __main__ - Step 93738: {'lr': 0.00015808235677310448, 'samples': 17997696, 'steps': 93737, 'loss/train': 1.3538600206375122} 11/07/2021 10:25:34 - INFO - __main__ - Step 93739: {'lr': 0.0001580774217545309, 'samples': 17997888, 'steps': 93738, 'loss/train': 1.4431771039962769} 11/07/2021 10:25:35 - INFO - __main__ - Step 93740: {'lr': 0.00015807248677737618, 'samples': 17998080, 'steps': 93739, 'loss/train': 1.1575485467910767} 11/07/2021 10:25:35 - INFO - __main__ - Step 93741: {'lr': 0.00015806755184164268, 'samples': 17998272, 'steps': 93740, 'loss/train': 0.8976739645004272} 11/07/2021 10:25:35 - INFO - __main__ - Step 93742: {'lr': 0.0001580626169473325, 'samples': 17998464, 'steps': 93741, 'loss/train': 1.5232120752334595} 11/07/2021 10:25:36 - INFO - __main__ - Step 93743: {'lr': 0.0001580576820944478, 'samples': 17998656, 'steps': 93742, 'loss/train': 1.1626477241516113} 11/07/2021 10:25:37 - INFO - __main__ - Step 93744: {'lr': 0.00015805274728299096, 'samples': 17998848, 'steps': 93743, 'loss/train': 1.3675276041030884} 11/07/2021 10:25:37 - INFO - __main__ - Step 93745: {'lr': 0.00015804781251296408, 'samples': 17999040, 'steps': 93744, 'loss/train': 1.2893092632293701} 11/07/2021 10:25:37 - INFO - __main__ - Step 93746: {'lr': 0.00015804287778436947, 'samples': 17999232, 'steps': 93745, 'loss/train': 1.4099785089492798} 11/07/2021 10:25:38 - INFO - __main__ - Step 93747: {'lr': 0.00015803794309720927, 'samples': 17999424, 'steps': 93746, 'loss/train': 1.672642707824707} 11/07/2021 10:25:38 - INFO - __main__ - Step 93748: {'lr': 0.0001580330084514858, 'samples': 17999616, 'steps': 93747, 'loss/train': 2.961601972579956} 11/07/2021 10:25:38 - INFO - __main__ - Step 93749: {'lr': 0.00015802807384720125, 'samples': 17999808, 'steps': 93748, 'loss/train': 1.1657698154449463} 11/07/2021 10:25:39 - INFO - __main__ - Step 93750: {'lr': 0.00015802313928435778, 'samples': 18000000, 'steps': 93749, 'loss/train': 1.6503757238388062} 11/07/2021 10:25:40 - INFO - __main__ - Step 93751: {'lr': 0.0001580182047629577, 'samples': 18000192, 'steps': 93750, 'loss/train': 1.359389305114746} 11/07/2021 10:25:40 - INFO - __main__ - Step 93752: {'lr': 0.0001580132702830032, 'samples': 18000384, 'steps': 93751, 'loss/train': 1.551836609840393} 11/07/2021 10:25:40 - INFO - __main__ - Step 93753: {'lr': 0.00015800833584449654, 'samples': 18000576, 'steps': 93752, 'loss/train': 1.1517324447631836} 11/07/2021 10:25:41 - INFO - __main__ - Step 93754: {'lr': 0.00015800340144743984, 'samples': 18000768, 'steps': 93753, 'loss/train': 1.3097331523895264} 11/07/2021 10:25:42 - INFO - __main__ - Step 93755: {'lr': 0.00015799846709183547, 'samples': 18000960, 'steps': 93754, 'loss/train': 1.7007554769515991} 11/07/2021 10:25:42 - INFO - __main__ - Step 93756: {'lr': 0.00015799353277768546, 'samples': 18001152, 'steps': 93755, 'loss/train': 1.633212685585022} 11/07/2021 10:25:43 - INFO - __main__ - Step 93757: {'lr': 0.0001579885985049922, 'samples': 18001344, 'steps': 93756, 'loss/train': 1.611094355583191} 11/07/2021 10:25:43 - INFO - __main__ - Step 93758: {'lr': 0.00015798366427375785, 'samples': 18001536, 'steps': 93757, 'loss/train': 1.2085425853729248} 11/07/2021 10:25:43 - INFO - __main__ - Step 93759: {'lr': 0.0001579787300839846, 'samples': 18001728, 'steps': 93758, 'loss/train': 1.3026208877563477} 11/07/2021 10:25:44 - INFO - __main__ - Step 93760: {'lr': 0.0001579737959356748, 'samples': 18001920, 'steps': 93759, 'loss/train': 1.53178071975708} 11/07/2021 10:25:45 - INFO - __main__ - Step 93761: {'lr': 0.00015796886182883053, 'samples': 18002112, 'steps': 93760, 'loss/train': 1.9715888500213623} 11/07/2021 10:25:45 - INFO - __main__ - Step 93762: {'lr': 0.00015796392776345412, 'samples': 18002304, 'steps': 93761, 'loss/train': 1.293910264968872} 11/07/2021 10:25:45 - INFO - __main__ - Step 93763: {'lr': 0.0001579589937395477, 'samples': 18002496, 'steps': 93762, 'loss/train': 1.3014581203460693} 11/07/2021 10:25:46 - INFO - __main__ - Step 93764: {'lr': 0.0001579540597571135, 'samples': 18002688, 'steps': 93763, 'loss/train': 1.4446425437927246} 11/07/2021 10:25:47 - INFO - __main__ - Step 93765: {'lr': 0.00015794912581615383, 'samples': 18002880, 'steps': 93764, 'loss/train': 1.4988428354263306} 11/07/2021 10:25:47 - INFO - __main__ - Step 93766: {'lr': 0.00015794419191667087, 'samples': 18003072, 'steps': 93765, 'loss/train': 1.128638505935669} 11/07/2021 10:25:47 - INFO - __main__ - Step 93767: {'lr': 0.00015793925805866684, 'samples': 18003264, 'steps': 93766, 'loss/train': 1.5688177347183228} 11/07/2021 10:25:48 - INFO - __main__ - Step 93768: {'lr': 0.0001579343242421439, 'samples': 18003456, 'steps': 93767, 'loss/train': 1.931429386138916} 11/07/2021 10:25:48 - INFO - __main__ - Step 93769: {'lr': 0.0001579293904671044, 'samples': 18003648, 'steps': 93768, 'loss/train': 1.7112497091293335} 11/07/2021 10:25:49 - INFO - __main__ - Step 93770: {'lr': 0.0001579244567335505, 'samples': 18003840, 'steps': 93769, 'loss/train': 1.6947473287582397} 11/07/2021 10:25:49 - INFO - __main__ - Step 93771: {'lr': 0.00015791952304148438, 'samples': 18004032, 'steps': 93770, 'loss/train': 1.0346083641052246} 11/07/2021 10:25:50 - INFO - __main__ - Step 93772: {'lr': 0.0001579145893909083, 'samples': 18004224, 'steps': 93771, 'loss/train': 1.5943677425384521} 11/07/2021 10:25:50 - INFO - __main__ - Step 93773: {'lr': 0.00015790965578182456, 'samples': 18004416, 'steps': 93772, 'loss/train': 1.5596562623977661} 11/07/2021 10:25:51 - INFO - __main__ - Step 93774: {'lr': 0.00015790472221423525, 'samples': 18004608, 'steps': 93773, 'loss/train': 1.8329639434814453} 11/07/2021 10:25:52 - INFO - __main__ - Step 93775: {'lr': 0.00015789978868814265, 'samples': 18004800, 'steps': 93774, 'loss/train': 1.4313106536865234} 11/07/2021 10:25:52 - INFO - __main__ - Step 93776: {'lr': 0.00015789485520354896, 'samples': 18004992, 'steps': 93775, 'loss/train': 1.3837556838989258} 11/07/2021 10:25:52 - INFO - __main__ - Step 93777: {'lr': 0.00015788992176045643, 'samples': 18005184, 'steps': 93776, 'loss/train': 1.1527583599090576} 11/07/2021 10:25:53 - INFO - __main__ - Step 93778: {'lr': 0.0001578849883588673, 'samples': 18005376, 'steps': 93777, 'loss/train': 1.0597903728485107} 11/07/2021 10:25:53 - INFO - __main__ - Step 93779: {'lr': 0.00015788005499878377, 'samples': 18005568, 'steps': 93778, 'loss/train': 1.6593722105026245} 11/07/2021 10:25:54 - INFO - __main__ - Step 93780: {'lr': 0.00015787512168020807, 'samples': 18005760, 'steps': 93779, 'loss/train': 1.6091891527175903} 11/07/2021 10:25:54 - INFO - __main__ - Step 93781: {'lr': 0.00015787018840314238, 'samples': 18005952, 'steps': 93780, 'loss/train': 0.6423577070236206} 11/07/2021 10:25:55 - INFO - __main__ - Step 93782: {'lr': 0.00015786525516758905, 'samples': 18006144, 'steps': 93781, 'loss/train': 1.0565263032913208} 11/07/2021 10:25:55 - INFO - __main__ - Step 93783: {'lr': 0.00015786032197355015, 'samples': 18006336, 'steps': 93782, 'loss/train': 0.9091300368309021} 11/07/2021 10:25:55 - INFO - __main__ - Step 93784: {'lr': 0.00015785538882102804, 'samples': 18006528, 'steps': 93783, 'loss/train': 1.2870779037475586} 11/07/2021 10:25:56 - INFO - __main__ - Step 93785: {'lr': 0.00015785045571002483, 'samples': 18006720, 'steps': 93784, 'loss/train': 0.6073445677757263} 11/07/2021 10:25:57 - INFO - __main__ - Step 93786: {'lr': 0.00015784552264054273, 'samples': 18006912, 'steps': 93785, 'loss/train': 3.0242369174957275} 11/07/2021 10:25:57 - INFO - __main__ - Step 93787: {'lr': 0.000157840589612584, 'samples': 18007104, 'steps': 93786, 'loss/train': 1.8610985279083252} 11/07/2021 10:25:58 - INFO - __main__ - Step 93788: {'lr': 0.00015783565662615096, 'samples': 18007296, 'steps': 93787, 'loss/train': 0.4957357347011566} 11/07/2021 10:25:58 - INFO - __main__ - Step 93789: {'lr': 0.0001578307236812457, 'samples': 18007488, 'steps': 93788, 'loss/train': 1.337988018989563} 11/07/2021 10:25:58 - INFO - __main__ - Step 93790: {'lr': 0.0001578257907778705, 'samples': 18007680, 'steps': 93789, 'loss/train': 1.016075611114502} 11/07/2021 10:25:59 - INFO - __main__ - Step 93791: {'lr': 0.00015782085791602758, 'samples': 18007872, 'steps': 93790, 'loss/train': 2.632309675216675} 11/07/2021 10:26:00 - INFO - __main__ - Step 93792: {'lr': 0.0001578159250957192, 'samples': 18008064, 'steps': 93791, 'loss/train': 1.5077277421951294} 11/07/2021 10:26:00 - INFO - __main__ - Step 93793: {'lr': 0.00015781099231694745, 'samples': 18008256, 'steps': 93792, 'loss/train': 1.0640690326690674} 11/07/2021 10:26:00 - INFO - __main__ - Step 93794: {'lr': 0.00015780605957971472, 'samples': 18008448, 'steps': 93793, 'loss/train': 0.83598393201828} 11/07/2021 10:26:01 - INFO - __main__ - Step 93795: {'lr': 0.00015780112688402312, 'samples': 18008640, 'steps': 93794, 'loss/train': 1.7604948282241821} 11/07/2021 10:26:02 - INFO - __main__ - Step 93796: {'lr': 0.000157796194229875, 'samples': 18008832, 'steps': 93795, 'loss/train': 1.6466941833496094} 11/07/2021 10:26:02 - INFO - __main__ - Step 93797: {'lr': 0.00015779126161727245, 'samples': 18009024, 'steps': 93796, 'loss/train': 1.1898562908172607} 11/07/2021 10:26:02 - INFO - __main__ - Step 93798: {'lr': 0.0001577863290462177, 'samples': 18009216, 'steps': 93797, 'loss/train': 1.3948744535446167} 11/07/2021 10:26:03 - INFO - __main__ - Step 93799: {'lr': 0.000157781396516713, 'samples': 18009408, 'steps': 93798, 'loss/train': 0.7404584884643555} 11/07/2021 10:26:03 - INFO - __main__ - Step 93800: {'lr': 0.00015777646402876058, 'samples': 18009600, 'steps': 93799, 'loss/train': 1.402199387550354} 11/07/2021 10:26:04 - INFO - __main__ - Step 93801: {'lr': 0.00015777153158236267, 'samples': 18009792, 'steps': 93800, 'loss/train': 1.328943133354187} 11/07/2021 10:26:04 - INFO - __main__ - Step 93802: {'lr': 0.00015776659917752148, 'samples': 18009984, 'steps': 93801, 'loss/train': 1.4821001291275024} 11/07/2021 10:26:05 - INFO - __main__ - Step 93803: {'lr': 0.00015776166681423927, 'samples': 18010176, 'steps': 93802, 'loss/train': 1.2886312007904053} 11/07/2021 10:26:05 - INFO - __main__ - Step 93804: {'lr': 0.00015775673449251816, 'samples': 18010368, 'steps': 93803, 'loss/train': 5.801753044128418} 11/07/2021 10:26:05 - INFO - __main__ - Step 93805: {'lr': 0.00015775180221236048, 'samples': 18010560, 'steps': 93804, 'loss/train': 0.728422224521637} 11/07/2021 10:26:06 - INFO - __main__ - Step 93806: {'lr': 0.0001577468699737684, 'samples': 18010752, 'steps': 93805, 'loss/train': 0.10064979642629623} 11/07/2021 10:26:07 - INFO - __main__ - Step 93807: {'lr': 0.0001577419377767442, 'samples': 18010944, 'steps': 93806, 'loss/train': 1.6417043209075928} 11/07/2021 10:26:07 - INFO - __main__ - Step 93808: {'lr': 0.00015773700562129, 'samples': 18011136, 'steps': 93807, 'loss/train': 1.5194655656814575} 11/07/2021 10:26:08 - INFO - __main__ - Step 93809: {'lr': 0.00015773207350740825, 'samples': 18011328, 'steps': 93808, 'loss/train': 1.2136093378067017} 11/07/2021 10:26:08 - INFO - __main__ - Step 93810: {'lr': 0.00015772714143510086, 'samples': 18011520, 'steps': 93809, 'loss/train': 0.879173755645752} 11/07/2021 10:26:08 - INFO - __main__ - Step 93811: {'lr': 0.0001577222094043702, 'samples': 18011712, 'steps': 93810, 'loss/train': 1.1692790985107422} 11/07/2021 10:26:09 - INFO - __main__ - Step 93812: {'lr': 0.0001577172774152185, 'samples': 18011904, 'steps': 93811, 'loss/train': 1.6956158876419067} 11/07/2021 10:26:10 - INFO - __main__ - Step 93813: {'lr': 0.00015771234546764796, 'samples': 18012096, 'steps': 93812, 'loss/train': 1.1151379346847534} 11/07/2021 10:26:10 - INFO - __main__ - Step 93814: {'lr': 0.0001577074135616608, 'samples': 18012288, 'steps': 93813, 'loss/train': 1.6483283042907715} 11/07/2021 10:26:10 - INFO - __main__ - Step 93815: {'lr': 0.00015770248169725927, 'samples': 18012480, 'steps': 93814, 'loss/train': 0.9818071722984314} 11/07/2021 10:26:11 - INFO - __main__ - Step 93816: {'lr': 0.00015769754987444556, 'samples': 18012672, 'steps': 93815, 'loss/train': 1.2130262851715088} 11/07/2021 10:26:12 - INFO - __main__ - Step 93817: {'lr': 0.00015769261809322194, 'samples': 18012864, 'steps': 93816, 'loss/train': 1.1867073774337769} 11/07/2021 10:26:12 - INFO - __main__ - Step 93818: {'lr': 0.0001576876863535906, 'samples': 18013056, 'steps': 93817, 'loss/train': 1.6901947259902954} 11/07/2021 10:26:13 - INFO - __main__ - Step 93819: {'lr': 0.00015768275465555376, 'samples': 18013248, 'steps': 93818, 'loss/train': 1.5530239343643188} 11/07/2021 10:26:13 - INFO - __main__ - Step 93820: {'lr': 0.00015767782299911366, 'samples': 18013440, 'steps': 93819, 'loss/train': 0.976551353931427} 11/07/2021 10:26:13 - INFO - __main__ - Step 93821: {'lr': 0.00015767289138427247, 'samples': 18013632, 'steps': 93820, 'loss/train': 1.391677737236023} 11/07/2021 10:26:14 - INFO - __main__ - Step 93822: {'lr': 0.00015766795981103247, 'samples': 18013824, 'steps': 93821, 'loss/train': 0.45403799414634705} 11/07/2021 10:26:15 - INFO - __main__ - Step 93823: {'lr': 0.00015766302827939594, 'samples': 18014016, 'steps': 93822, 'loss/train': 1.2289509773254395} 11/07/2021 10:26:15 - INFO - __main__ - Step 93824: {'lr': 0.00015765809678936496, 'samples': 18014208, 'steps': 93823, 'loss/train': 1.4804757833480835} 11/07/2021 10:26:15 - INFO - __main__ - Step 93825: {'lr': 0.00015765316534094181, 'samples': 18014400, 'steps': 93824, 'loss/train': 1.5038379430770874} 11/07/2021 10:26:16 - INFO - __main__ - Step 93826: {'lr': 0.0001576482339341287, 'samples': 18014592, 'steps': 93825, 'loss/train': 1.525024175643921} 11/07/2021 10:26:16 - INFO - __main__ - Step 93827: {'lr': 0.0001576433025689279, 'samples': 18014784, 'steps': 93826, 'loss/train': 1.250496745109558} 11/07/2021 10:26:17 - INFO - __main__ - Step 93828: {'lr': 0.00015763837124534158, 'samples': 18014976, 'steps': 93827, 'loss/train': 0.8176856637001038} 11/07/2021 10:26:18 - INFO - __main__ - Step 93829: {'lr': 0.00015763343996337198, 'samples': 18015168, 'steps': 93828, 'loss/train': 1.2962327003479004} 11/07/2021 10:26:18 - INFO - __main__ - Step 93830: {'lr': 0.00015762850872302135, 'samples': 18015360, 'steps': 93829, 'loss/train': 1.4327753782272339} 11/07/2021 10:26:18 - INFO - __main__ - Step 93831: {'lr': 0.00015762357752429186, 'samples': 18015552, 'steps': 93830, 'loss/train': 1.2599871158599854} 11/07/2021 10:26:19 - INFO - __main__ - Step 93832: {'lr': 0.00015761864636718576, 'samples': 18015744, 'steps': 93831, 'loss/train': 1.7494845390319824} 11/07/2021 10:26:20 - INFO - __main__ - Step 93833: {'lr': 0.00015761371525170533, 'samples': 18015936, 'steps': 93832, 'loss/train': 1.3990832567214966} 11/07/2021 10:26:20 - INFO - __main__ - Step 93834: {'lr': 0.00015760878417785267, 'samples': 18016128, 'steps': 93833, 'loss/train': 1.499989628791809} 11/07/2021 10:26:20 - INFO - __main__ - Step 93835: {'lr': 0.00015760385314563007, 'samples': 18016320, 'steps': 93834, 'loss/train': 1.3601524829864502} 11/07/2021 10:26:21 - INFO - __main__ - Step 93836: {'lr': 0.0001575989221550399, 'samples': 18016512, 'steps': 93835, 'loss/train': 1.5555404424667358} 11/07/2021 10:26:21 - INFO - __main__ - Step 93837: {'lr': 0.0001575939912060841, 'samples': 18016704, 'steps': 93836, 'loss/train': 1.3399769067764282} 11/07/2021 10:26:22 - INFO - __main__ - Step 93838: {'lr': 0.000157589060298765, 'samples': 18016896, 'steps': 93837, 'loss/train': 1.290550947189331} 11/07/2021 10:26:22 - INFO - __main__ - Step 93839: {'lr': 0.00015758412943308486, 'samples': 18017088, 'steps': 93838, 'loss/train': 1.1995640993118286} 11/07/2021 10:26:23 - INFO - __main__ - Step 93840: {'lr': 0.00015757919860904588, 'samples': 18017280, 'steps': 93839, 'loss/train': 0.6965512633323669} 11/07/2021 10:26:23 - INFO - __main__ - Step 93841: {'lr': 0.0001575742678266503, 'samples': 18017472, 'steps': 93840, 'loss/train': 1.7023608684539795} 11/07/2021 10:26:23 - INFO - __main__ - Step 93842: {'lr': 0.00015756933708590033, 'samples': 18017664, 'steps': 93841, 'loss/train': 0.8918753862380981} 11/07/2021 10:26:24 - INFO - __main__ - Step 93843: {'lr': 0.00015756440638679817, 'samples': 18017856, 'steps': 93842, 'loss/train': 1.4483221769332886} 11/07/2021 10:26:25 - INFO - __main__ - Step 93844: {'lr': 0.0001575594757293461, 'samples': 18018048, 'steps': 93843, 'loss/train': 1.3859918117523193} 11/07/2021 10:26:25 - INFO - __main__ - Step 93845: {'lr': 0.00015755454511354625, 'samples': 18018240, 'steps': 93844, 'loss/train': 0.38928744196891785} 11/07/2021 10:26:25 - INFO - __main__ - Step 93846: {'lr': 0.0001575496145394009, 'samples': 18018432, 'steps': 93845, 'loss/train': 1.099596619606018} 11/07/2021 10:26:26 - INFO - __main__ - Step 93847: {'lr': 0.0001575446840069123, 'samples': 18018624, 'steps': 93846, 'loss/train': 0.9288237690925598} 11/07/2021 10:26:27 - INFO - __main__ - Step 93848: {'lr': 0.00015753975351608262, 'samples': 18018816, 'steps': 93847, 'loss/train': 1.6218574047088623} 11/07/2021 10:26:27 - INFO - __main__ - Step 93849: {'lr': 0.00015753482306691424, 'samples': 18019008, 'steps': 93848, 'loss/train': 1.2238517999649048} 11/07/2021 10:26:28 - INFO - __main__ - Step 93850: {'lr': 0.0001575298926594091, 'samples': 18019200, 'steps': 93849, 'loss/train': 1.2920383214950562} 11/07/2021 10:26:28 - INFO - __main__ - Step 93851: {'lr': 0.00015752496229356957, 'samples': 18019392, 'steps': 93850, 'loss/train': 1.1563693284988403} 11/07/2021 10:26:28 - INFO - __main__ - Step 93852: {'lr': 0.00015752003196939788, 'samples': 18019584, 'steps': 93851, 'loss/train': 1.2934354543685913} 11/07/2021 10:26:29 - INFO - __main__ - Step 93853: {'lr': 0.00015751510168689623, 'samples': 18019776, 'steps': 93852, 'loss/train': 1.4778962135314941} 11/07/2021 10:26:30 - INFO - __main__ - Step 93854: {'lr': 0.00015751017144606682, 'samples': 18019968, 'steps': 93853, 'loss/train': 1.2813923358917236} 11/07/2021 10:26:30 - INFO - __main__ - Step 93855: {'lr': 0.00015750524124691196, 'samples': 18020160, 'steps': 93854, 'loss/train': 1.3544812202453613} 11/07/2021 10:26:30 - INFO - __main__ - Step 93856: {'lr': 0.00015750031108943373, 'samples': 18020352, 'steps': 93855, 'loss/train': 1.5310007333755493} 11/07/2021 10:26:31 - INFO - __main__ - Step 93857: {'lr': 0.00015749538097363454, 'samples': 18020544, 'steps': 93856, 'loss/train': 1.4011750221252441} 11/07/2021 10:26:31 - INFO - __main__ - Step 93858: {'lr': 0.0001574904508995164, 'samples': 18020736, 'steps': 93857, 'loss/train': 1.179343581199646} 11/07/2021 10:26:32 - INFO - __main__ - Step 93859: {'lr': 0.00015748552086708169, 'samples': 18020928, 'steps': 93858, 'loss/train': 1.7045865058898926} 11/07/2021 10:26:33 - INFO - __main__ - Step 93860: {'lr': 0.00015748059087633255, 'samples': 18021120, 'steps': 93859, 'loss/train': 1.778473973274231} 11/07/2021 10:26:33 - INFO - __main__ - Step 93861: {'lr': 0.00015747566092727126, 'samples': 18021312, 'steps': 93860, 'loss/train': 1.3628944158554077} 11/07/2021 10:26:33 - INFO - __main__ - Step 93862: {'lr': 0.00015747073101990002, 'samples': 18021504, 'steps': 93861, 'loss/train': 1.181348443031311} 11/07/2021 10:26:34 - INFO - __main__ - Step 93863: {'lr': 0.00015746580115422106, 'samples': 18021696, 'steps': 93862, 'loss/train': 1.742024540901184} 11/07/2021 10:26:35 - INFO - __main__ - Step 93864: {'lr': 0.00015746087133023656, 'samples': 18021888, 'steps': 93863, 'loss/train': 0.5544937252998352} 11/07/2021 10:26:35 - INFO - __main__ - Step 93865: {'lr': 0.00015745594154794874, 'samples': 18022080, 'steps': 93864, 'loss/train': 1.5664384365081787} 11/07/2021 10:26:35 - INFO - __main__ - Step 93866: {'lr': 0.00015745101180735983, 'samples': 18022272, 'steps': 93865, 'loss/train': 1.331313967704773} 11/07/2021 10:26:36 - INFO - __main__ - Step 93867: {'lr': 0.0001574460821084721, 'samples': 18022464, 'steps': 93866, 'loss/train': 1.1309937238693237} 11/07/2021 10:26:36 - INFO - __main__ - Step 93868: {'lr': 0.0001574411524512877, 'samples': 18022656, 'steps': 93867, 'loss/train': 1.274497389793396} 11/07/2021 10:26:37 - INFO - __main__ - Step 93869: {'lr': 0.0001574362228358089, 'samples': 18022848, 'steps': 93868, 'loss/train': 1.6951512098312378} 11/07/2021 10:26:38 - INFO - __main__ - Step 93870: {'lr': 0.00015743129326203792, 'samples': 18023040, 'steps': 93869, 'loss/train': 1.5803457498550415} 11/07/2021 10:26:38 - INFO - __main__ - Step 93871: {'lr': 0.00015742636372997694, 'samples': 18023232, 'steps': 93870, 'loss/train': 1.5771656036376953} 11/07/2021 10:26:38 - INFO - __main__ - Step 93872: {'lr': 0.00015742143423962823, 'samples': 18023424, 'steps': 93871, 'loss/train': 0.20295538008213043} 11/07/2021 10:26:39 - INFO - __main__ - Step 93873: {'lr': 0.000157416504790994, 'samples': 18023616, 'steps': 93872, 'loss/train': 0.18744753301143646} 11/07/2021 10:26:40 - INFO - __main__ - Step 93874: {'lr': 0.00015741157538407647, 'samples': 18023808, 'steps': 93873, 'loss/train': 1.1101993322372437} 11/07/2021 10:26:40 - INFO - __main__ - Step 93875: {'lr': 0.00015740664601887792, 'samples': 18024000, 'steps': 93874, 'loss/train': 1.660034418106079} 11/07/2021 10:26:41 - INFO - __main__ - Step 93876: {'lr': 0.00015740171669540047, 'samples': 18024192, 'steps': 93875, 'loss/train': 1.9222530126571655} 11/07/2021 10:26:41 - INFO - __main__ - Step 93877: {'lr': 0.00015739678741364635, 'samples': 18024384, 'steps': 93876, 'loss/train': 1.5059322118759155} 11/07/2021 10:26:41 - INFO - __main__ - Step 93878: {'lr': 0.0001573918581736178, 'samples': 18024576, 'steps': 93877, 'loss/train': 1.829916000366211} 11/07/2021 10:26:42 - INFO - __main__ - Step 93879: {'lr': 0.00015738692897531706, 'samples': 18024768, 'steps': 93878, 'loss/train': 0.19133485853672028} 11/07/2021 10:26:43 - INFO - __main__ - Step 93880: {'lr': 0.00015738199981874635, 'samples': 18024960, 'steps': 93879, 'loss/train': 1.255771517753601} 11/07/2021 10:26:43 - INFO - __main__ - Step 93881: {'lr': 0.00015737707070390784, 'samples': 18025152, 'steps': 93880, 'loss/train': 1.1032778024673462} 11/07/2021 10:26:44 - INFO - __main__ - Step 93882: {'lr': 0.00015737214163080382, 'samples': 18025344, 'steps': 93881, 'loss/train': 1.1438597440719604} 11/07/2021 10:26:44 - INFO - __main__ - Step 93883: {'lr': 0.00015736721259943648, 'samples': 18025536, 'steps': 93882, 'loss/train': 1.9432874917984009} 11/07/2021 10:26:44 - INFO - __main__ - Step 93884: {'lr': 0.00015736228360980803, 'samples': 18025728, 'steps': 93883, 'loss/train': 0.9504581689834595} 11/07/2021 10:26:45 - INFO - __main__ - Step 93885: {'lr': 0.00015735735466192074, 'samples': 18025920, 'steps': 93884, 'loss/train': 1.2927827835083008} 11/07/2021 10:26:46 - INFO - __main__ - Step 93886: {'lr': 0.00015735242575577683, 'samples': 18026112, 'steps': 93885, 'loss/train': 1.4808294773101807} 11/07/2021 10:26:46 - INFO - __main__ - Step 93887: {'lr': 0.00015734749689137842, 'samples': 18026304, 'steps': 93886, 'loss/train': 1.4391052722930908} 11/07/2021 10:26:46 - INFO - __main__ - Step 93888: {'lr': 0.0001573425680687278, 'samples': 18026496, 'steps': 93887, 'loss/train': 1.369438886642456} 11/07/2021 10:26:47 - INFO - __main__ - Step 93889: {'lr': 0.00015733763928782723, 'samples': 18026688, 'steps': 93888, 'loss/train': 1.4655849933624268} 11/07/2021 10:26:48 - INFO - __main__ - Step 93890: {'lr': 0.00015733271054867889, 'samples': 18026880, 'steps': 93889, 'loss/train': 1.3916592597961426} 11/07/2021 10:26:48 - INFO - __main__ - Step 93891: {'lr': 0.000157327781851285, 'samples': 18027072, 'steps': 93890, 'loss/train': 1.5159516334533691} 11/07/2021 10:26:48 - INFO - __main__ - Step 93892: {'lr': 0.00015732285319564773, 'samples': 18027264, 'steps': 93891, 'loss/train': 1.1493234634399414} 11/07/2021 10:26:49 - INFO - __main__ - Step 93893: {'lr': 0.00015731792458176938, 'samples': 18027456, 'steps': 93892, 'loss/train': 1.3069086074829102} 11/07/2021 10:26:49 - INFO - __main__ - Step 93894: {'lr': 0.00015731299600965214, 'samples': 18027648, 'steps': 93893, 'loss/train': 0.7970845103263855} 11/07/2021 10:26:50 - INFO - __main__ - Step 93895: {'lr': 0.00015730806747929824, 'samples': 18027840, 'steps': 93894, 'loss/train': 1.310730218887329} 11/07/2021 10:26:50 - INFO - __main__ - Step 93896: {'lr': 0.0001573031389907099, 'samples': 18028032, 'steps': 93895, 'loss/train': 1.3133611679077148} 11/07/2021 10:26:51 - INFO - __main__ - Step 93897: {'lr': 0.00015729821054388934, 'samples': 18028224, 'steps': 93896, 'loss/train': 1.2352169752120972} 11/07/2021 10:26:51 - INFO - __main__ - Step 93898: {'lr': 0.00015729328213883877, 'samples': 18028416, 'steps': 93897, 'loss/train': 1.4548518657684326} 11/07/2021 10:26:52 - INFO - __main__ - Step 93899: {'lr': 0.0001572883537755604, 'samples': 18028608, 'steps': 93898, 'loss/train': 1.295471429824829} 11/07/2021 10:26:52 - INFO - __main__ - Step 93900: {'lr': 0.00015728342545405648, 'samples': 18028800, 'steps': 93899, 'loss/train': 1.3536206483840942} 11/07/2021 10:26:53 - INFO - __main__ - Step 93901: {'lr': 0.00015727849717432922, 'samples': 18028992, 'steps': 93900, 'loss/train': 1.131102204322815} 11/07/2021 10:26:53 - INFO - __main__ - Step 93902: {'lr': 0.00015727356893638082, 'samples': 18029184, 'steps': 93901, 'loss/train': 0.652558445930481} 11/07/2021 10:26:54 - INFO - __main__ - Step 93903: {'lr': 0.00015726864074021358, 'samples': 18029376, 'steps': 93902, 'loss/train': 1.8208152055740356} 11/07/2021 10:26:54 - INFO - __main__ - Step 93904: {'lr': 0.0001572637125858296, 'samples': 18029568, 'steps': 93903, 'loss/train': 1.6866692304611206} 11/07/2021 10:26:54 - INFO - __main__ - Step 93905: {'lr': 0.00015725878447323116, 'samples': 18029760, 'steps': 93904, 'loss/train': 1.1424415111541748} 11/07/2021 10:26:55 - INFO - __main__ - Step 93906: {'lr': 0.0001572538564024205, 'samples': 18029952, 'steps': 93905, 'loss/train': 1.4107345342636108} 11/07/2021 10:26:56 - INFO - __main__ - Step 93907: {'lr': 0.0001572489283733998, 'samples': 18030144, 'steps': 93906, 'loss/train': 1.5235559940338135} 11/07/2021 10:26:56 - INFO - __main__ - Step 93908: {'lr': 0.00015724400038617136, 'samples': 18030336, 'steps': 93907, 'loss/train': 1.4597550630569458} 11/07/2021 10:26:56 - INFO - __main__ - Step 93909: {'lr': 0.0001572390724407373, 'samples': 18030528, 'steps': 93908, 'loss/train': 1.2722090482711792} 11/07/2021 10:26:57 - INFO - __main__ - Step 93910: {'lr': 0.00015723414453709986, 'samples': 18030720, 'steps': 93909, 'loss/train': 1.512765645980835} 11/07/2021 10:26:58 - INFO - __main__ - Step 93911: {'lr': 0.0001572292166752613, 'samples': 18030912, 'steps': 93910, 'loss/train': 1.4957220554351807} 11/07/2021 10:26:58 - INFO - __main__ - Step 93912: {'lr': 0.00015722428885522384, 'samples': 18031104, 'steps': 93911, 'loss/train': 1.4924150705337524} 11/07/2021 10:26:58 - INFO - __main__ - Step 93913: {'lr': 0.00015721936107698965, 'samples': 18031296, 'steps': 93912, 'loss/train': 0.6211670637130737} 11/07/2021 10:26:59 - INFO - __main__ - Step 93914: {'lr': 0.000157214433340561, 'samples': 18031488, 'steps': 93913, 'loss/train': 1.034873127937317} 11/07/2021 10:26:59 - INFO - __main__ - Step 93915: {'lr': 0.0001572095056459401, 'samples': 18031680, 'steps': 93914, 'loss/train': 1.2754149436950684} 11/07/2021 10:27:00 - INFO - __main__ - Step 93916: {'lr': 0.00015720457799312914, 'samples': 18031872, 'steps': 93915, 'loss/train': 1.3030487298965454} 11/07/2021 10:27:01 - INFO - __main__ - Step 93917: {'lr': 0.00015719965038213043, 'samples': 18032064, 'steps': 93916, 'loss/train': 1.319772720336914} 11/07/2021 10:27:01 - INFO - __main__ - Step 93918: {'lr': 0.00015719472281294612, 'samples': 18032256, 'steps': 93917, 'loss/train': 1.3200913667678833} 11/07/2021 10:27:01 - INFO - __main__ - Step 93919: {'lr': 0.00015718979528557843, 'samples': 18032448, 'steps': 93918, 'loss/train': 1.1535046100616455} 11/07/2021 10:27:02 - INFO - __main__ - Step 93920: {'lr': 0.00015718486780002955, 'samples': 18032640, 'steps': 93919, 'loss/train': 1.6332566738128662} 11/07/2021 10:27:03 - INFO - __main__ - Step 93921: {'lr': 0.00015717994035630174, 'samples': 18032832, 'steps': 93920, 'loss/train': 1.4838591814041138} 11/07/2021 10:27:03 - INFO - __main__ - Step 93922: {'lr': 0.0001571750129543972, 'samples': 18033024, 'steps': 93921, 'loss/train': 1.594446063041687} 11/07/2021 10:27:03 - INFO - __main__ - Step 93923: {'lr': 0.00015717008559431816, 'samples': 18033216, 'steps': 93922, 'loss/train': 1.4400644302368164} 11/07/2021 10:27:04 - INFO - __main__ - Step 93924: {'lr': 0.00015716515827606688, 'samples': 18033408, 'steps': 93923, 'loss/train': 0.5162572860717773} 11/07/2021 10:27:04 - INFO - __main__ - Step 93925: {'lr': 0.00015716023099964554, 'samples': 18033600, 'steps': 93924, 'loss/train': 1.463836908340454} 11/07/2021 10:27:05 - INFO - __main__ - Step 93926: {'lr': 0.00015715530376505637, 'samples': 18033792, 'steps': 93925, 'loss/train': 1.5203438997268677} 11/07/2021 10:27:06 - INFO - __main__ - Step 93927: {'lr': 0.00015715037657230158, 'samples': 18033984, 'steps': 93926, 'loss/train': 1.645376205444336} 11/07/2021 10:27:06 - INFO - __main__ - Step 93928: {'lr': 0.0001571454494213834, 'samples': 18034176, 'steps': 93927, 'loss/train': 1.0869282484054565} 11/07/2021 10:27:06 - INFO - __main__ - Step 93929: {'lr': 0.00015714052231230403, 'samples': 18034368, 'steps': 93928, 'loss/train': 1.6966975927352905} 11/07/2021 10:27:07 - INFO - __main__ - Step 93930: {'lr': 0.0001571355952450658, 'samples': 18034560, 'steps': 93929, 'loss/train': 1.3790074586868286} 11/07/2021 10:27:07 - INFO - __main__ - Step 93931: {'lr': 0.00015713066821967082, 'samples': 18034752, 'steps': 93930, 'loss/train': 1.3092939853668213} 11/07/2021 10:27:08 - INFO - __main__ - Step 93932: {'lr': 0.0001571257412361212, 'samples': 18034944, 'steps': 93931, 'loss/train': 1.6142915487289429} 11/07/2021 10:27:08 - INFO - __main__ - Step 93933: {'lr': 0.00015712081429441937, 'samples': 18035136, 'steps': 93932, 'loss/train': 0.6472116112709045} 11/07/2021 10:27:09 - INFO - __main__ - Step 93934: {'lr': 0.00015711588739456749, 'samples': 18035328, 'steps': 93933, 'loss/train': 1.818894624710083} 11/07/2021 10:27:09 - INFO - __main__ - Step 93935: {'lr': 0.0001571109605365677, 'samples': 18035520, 'steps': 93934, 'loss/train': 1.6552791595458984} 11/07/2021 10:27:09 - INFO - __main__ - Step 93936: {'lr': 0.00015710603372042232, 'samples': 18035712, 'steps': 93935, 'loss/train': 1.2937393188476562} 11/07/2021 10:27:10 - INFO - __main__ - Step 93937: {'lr': 0.0001571011069461335, 'samples': 18035904, 'steps': 93936, 'loss/train': 1.2997149229049683} 11/07/2021 10:27:11 - INFO - __main__ - Step 93938: {'lr': 0.00015709618021370349, 'samples': 18036096, 'steps': 93937, 'loss/train': 1.4829633235931396} 11/07/2021 10:27:11 - INFO - __main__ - Step 93939: {'lr': 0.00015709125352313452, 'samples': 18036288, 'steps': 93938, 'loss/train': 1.5647720098495483} 11/07/2021 10:27:11 - INFO - __main__ - Step 93940: {'lr': 0.00015708632687442878, 'samples': 18036480, 'steps': 93939, 'loss/train': 1.5140278339385986} 11/07/2021 10:27:12 - INFO - __main__ - Step 93941: {'lr': 0.00015708140026758852, 'samples': 18036672, 'steps': 93940, 'loss/train': 1.2404530048370361} 11/07/2021 10:27:13 - INFO - __main__ - Step 93942: {'lr': 0.00015707647370261595, 'samples': 18036864, 'steps': 93941, 'loss/train': 1.324660301208496} 11/07/2021 10:27:13 - INFO - __main__ - Step 93943: {'lr': 0.00015707154717951326, 'samples': 18037056, 'steps': 93942, 'loss/train': 1.5297679901123047} 11/07/2021 10:27:14 - INFO - __main__ - Step 93944: {'lr': 0.00015706662069828284, 'samples': 18037248, 'steps': 93943, 'loss/train': 1.3743633031845093} 11/07/2021 10:27:14 - INFO - __main__ - Step 93945: {'lr': 0.00015706169425892664, 'samples': 18037440, 'steps': 93944, 'loss/train': 1.2610459327697754} 11/07/2021 10:27:14 - INFO - __main__ - Step 93946: {'lr': 0.00015705676786144702, 'samples': 18037632, 'steps': 93945, 'loss/train': 1.3927253484725952} 11/07/2021 10:27:15 - INFO - __main__ - Step 93947: {'lr': 0.00015705184150584616, 'samples': 18037824, 'steps': 93946, 'loss/train': 1.4281519651412964} 11/07/2021 10:27:16 - INFO - __main__ - Step 93948: {'lr': 0.00015704691519212633, 'samples': 18038016, 'steps': 93947, 'loss/train': 1.1635470390319824} 11/07/2021 10:27:16 - INFO - __main__ - Step 93949: {'lr': 0.00015704198892028972, 'samples': 18038208, 'steps': 93948, 'loss/train': 1.3979966640472412} 11/07/2021 10:27:16 - INFO - __main__ - Step 93950: {'lr': 0.00015703706269033858, 'samples': 18038400, 'steps': 93949, 'loss/train': 1.2188563346862793} 11/07/2021 10:27:17 - INFO - __main__ - Step 93951: {'lr': 0.00015703213650227504, 'samples': 18038592, 'steps': 93950, 'loss/train': 1.517128348350525} 11/07/2021 10:27:18 - INFO - __main__ - Step 93952: {'lr': 0.00015702721035610145, 'samples': 18038784, 'steps': 93951, 'loss/train': 1.7397679090499878} 11/07/2021 10:27:19 - INFO - __main__ - Step 93953: {'lr': 0.00015702228425181993, 'samples': 18038976, 'steps': 93952, 'loss/train': 1.5014806985855103} 11/07/2021 10:27:19 - INFO - __main__ - Step 93954: {'lr': 0.00015701735818943275, 'samples': 18039168, 'steps': 93953, 'loss/train': 1.9555046558380127} 11/07/2021 10:27:19 - INFO - __main__ - Step 93955: {'lr': 0.00015701243216894212, 'samples': 18039360, 'steps': 93954, 'loss/train': 1.4993884563446045} 11/07/2021 10:27:20 - INFO - __main__ - Step 93956: {'lr': 0.00015700750619035024, 'samples': 18039552, 'steps': 93955, 'loss/train': 1.63399076461792} 11/07/2021 10:27:20 - INFO - __main__ - Step 93957: {'lr': 0.00015700258025365944, 'samples': 18039744, 'steps': 93956, 'loss/train': 1.2793656587600708} 11/07/2021 10:27:21 - INFO - __main__ - Step 93958: {'lr': 0.00015699765435887175, 'samples': 18039936, 'steps': 93957, 'loss/train': 1.3090863227844238} 11/07/2021 10:27:22 - INFO - __main__ - Step 93959: {'lr': 0.00015699272850598945, 'samples': 18040128, 'steps': 93958, 'loss/train': 1.2011054754257202} 11/07/2021 10:27:22 - INFO - __main__ - Step 93960: {'lr': 0.00015698780269501485, 'samples': 18040320, 'steps': 93959, 'loss/train': 1.723212480545044} 11/07/2021 10:27:23 - INFO - __main__ - Step 93961: {'lr': 0.00015698287692595005, 'samples': 18040512, 'steps': 93960, 'loss/train': 0.6332077980041504} 11/07/2021 10:27:23 - INFO - __main__ - Step 93962: {'lr': 0.00015697795119879737, 'samples': 18040704, 'steps': 93961, 'loss/train': 1.2540971040725708} 11/07/2021 10:27:23 - INFO - __main__ - Step 93963: {'lr': 0.00015697302551355896, 'samples': 18040896, 'steps': 93962, 'loss/train': 1.3360925912857056} 11/07/2021 10:27:24 - INFO - __main__ - Step 93964: {'lr': 0.0001569680998702371, 'samples': 18041088, 'steps': 93963, 'loss/train': 1.3722106218338013} 11/07/2021 10:27:25 - INFO - __main__ - Step 93965: {'lr': 0.00015696317426883396, 'samples': 18041280, 'steps': 93964, 'loss/train': 1.2705641984939575} 11/07/2021 10:27:25 - INFO - __main__ - Step 93966: {'lr': 0.0001569582487093518, 'samples': 18041472, 'steps': 93965, 'loss/train': 1.7380176782608032} 11/07/2021 10:27:25 - INFO - __main__ - Step 93967: {'lr': 0.00015695332319179279, 'samples': 18041664, 'steps': 93966, 'loss/train': 1.841515064239502} 11/07/2021 10:27:26 - INFO - __main__ - Step 93968: {'lr': 0.0001569483977161592, 'samples': 18041856, 'steps': 93967, 'loss/train': 1.3187578916549683} 11/07/2021 10:27:26 - INFO - __main__ - Step 93969: {'lr': 0.0001569434722824532, 'samples': 18042048, 'steps': 93968, 'loss/train': 1.4125155210494995} 11/07/2021 10:27:27 - INFO - __main__ - Step 93970: {'lr': 0.00015693854689067716, 'samples': 18042240, 'steps': 93969, 'loss/train': 1.14281165599823} 11/07/2021 10:27:27 - INFO - __main__ - Step 93971: {'lr': 0.00015693362154083307, 'samples': 18042432, 'steps': 93970, 'loss/train': 1.5569154024124146} 11/07/2021 10:27:28 - INFO - __main__ - Step 93972: {'lr': 0.00015692869623292326, 'samples': 18042624, 'steps': 93971, 'loss/train': 1.3046122789382935} 11/07/2021 10:27:28 - INFO - __main__ - Step 93973: {'lr': 0.00015692377096694992, 'samples': 18042816, 'steps': 93972, 'loss/train': 1.1529178619384766} 11/07/2021 10:27:28 - INFO - __main__ - Step 93974: {'lr': 0.00015691884574291532, 'samples': 18043008, 'steps': 93973, 'loss/train': 1.2913844585418701} 11/07/2021 10:27:30 - INFO - __main__ - Step 93975: {'lr': 0.00015691392056082162, 'samples': 18043200, 'steps': 93974, 'loss/train': 1.4151484966278076} 11/07/2021 10:27:30 - INFO - __main__ - Step 93976: {'lr': 0.0001569089954206711, 'samples': 18043392, 'steps': 93975, 'loss/train': 1.460153341293335} 11/07/2021 10:27:30 - INFO - __main__ - Step 93977: {'lr': 0.00015690407032246595, 'samples': 18043584, 'steps': 93976, 'loss/train': 0.12990589439868927} 11/07/2021 10:27:31 - INFO - __main__ - Step 93978: {'lr': 0.00015689914526620835, 'samples': 18043776, 'steps': 93977, 'loss/train': 1.434869408607483} 11/07/2021 10:27:31 - INFO - __main__ - Step 93979: {'lr': 0.0001568942202519006, 'samples': 18043968, 'steps': 93978, 'loss/train': 1.2335686683654785} 11/07/2021 10:27:31 - INFO - __main__ - Step 93980: {'lr': 0.00015688929527954488, 'samples': 18044160, 'steps': 93979, 'loss/train': 1.7078793048858643} 11/07/2021 10:27:33 - INFO - __main__ - Step 93981: {'lr': 0.00015688437034914337, 'samples': 18044352, 'steps': 93980, 'loss/train': 1.5085362195968628} 11/07/2021 10:27:33 - INFO - __main__ - Step 93982: {'lr': 0.00015687944546069834, 'samples': 18044544, 'steps': 93981, 'loss/train': 1.567267656326294} 11/07/2021 10:27:33 - INFO - __main__ - Step 93983: {'lr': 0.000156874520614212, 'samples': 18044736, 'steps': 93982, 'loss/train': 0.7204191088676453} 11/07/2021 10:27:34 - INFO - __main__ - Step 93984: {'lr': 0.00015686959580968668, 'samples': 18044928, 'steps': 93983, 'loss/train': 1.0101122856140137} 11/07/2021 10:27:34 - INFO - __main__ - Step 93985: {'lr': 0.00015686467104712438, 'samples': 18045120, 'steps': 93984, 'loss/train': 0.8665518164634705} 11/07/2021 10:27:35 - INFO - __main__ - Step 93986: {'lr': 0.00015685974632652738, 'samples': 18045312, 'steps': 93985, 'loss/train': 1.522761583328247} 11/07/2021 10:27:35 - INFO - __main__ - Step 93987: {'lr': 0.000156854821647898, 'samples': 18045504, 'steps': 93986, 'loss/train': 1.465025782585144} 11/07/2021 10:27:36 - INFO - __main__ - Step 93988: {'lr': 0.00015684989701123837, 'samples': 18045696, 'steps': 93987, 'loss/train': 1.083678960800171} 11/07/2021 10:27:36 - INFO - __main__ - Step 93989: {'lr': 0.00015684497241655072, 'samples': 18045888, 'steps': 93988, 'loss/train': 1.3233212232589722} 11/07/2021 10:27:36 - INFO - __main__ - Step 93990: {'lr': 0.00015684004786383732, 'samples': 18046080, 'steps': 93989, 'loss/train': 1.342332124710083} 11/07/2021 10:27:37 - INFO - __main__ - Step 93991: {'lr': 0.00015683512335310036, 'samples': 18046272, 'steps': 93990, 'loss/train': 1.1099847555160522} 11/07/2021 10:27:38 - INFO - __main__ - Step 93992: {'lr': 0.00015683019888434202, 'samples': 18046464, 'steps': 93991, 'loss/train': 1.5068551301956177} 11/07/2021 10:27:38 - INFO - __main__ - Step 93993: {'lr': 0.00015682527445756456, 'samples': 18046656, 'steps': 93992, 'loss/train': 1.403784990310669} 11/07/2021 10:27:38 - INFO - __main__ - Step 93994: {'lr': 0.00015682035007277023, 'samples': 18046848, 'steps': 93993, 'loss/train': 0.138332337141037} 11/07/2021 10:27:39 - INFO - __main__ - Step 93995: {'lr': 0.0001568154257299612, 'samples': 18047040, 'steps': 93994, 'loss/train': 1.2853931188583374} 11/07/2021 10:27:40 - INFO - __main__ - Step 93996: {'lr': 0.00015681050142913965, 'samples': 18047232, 'steps': 93995, 'loss/train': 1.059644341468811} 11/07/2021 10:27:40 - INFO - __main__ - Step 93997: {'lr': 0.00015680557717030803, 'samples': 18047424, 'steps': 93996, 'loss/train': 1.3493304252624512} 11/07/2021 10:27:40 - INFO - __main__ - Step 93998: {'lr': 0.00015680065295346825, 'samples': 18047616, 'steps': 93997, 'loss/train': 1.1617215871810913} 11/07/2021 10:27:41 - INFO - __main__ - Step 93999: {'lr': 0.00015679572877862265, 'samples': 18047808, 'steps': 93998, 'loss/train': 1.4553899765014648} 11/07/2021 10:27:41 - INFO - __main__ - Step 94000: {'lr': 0.00015679080464577345, 'samples': 18048000, 'steps': 93999, 'loss/train': 1.155993103981018} 11/07/2021 10:27:42 - INFO - __main__ - Step 94001: {'lr': 0.00015678588055492287, 'samples': 18048192, 'steps': 94000, 'loss/train': 1.188683032989502} 11/07/2021 10:27:42 - INFO - __main__ - Step 94002: {'lr': 0.00015678095650607316, 'samples': 18048384, 'steps': 94001, 'loss/train': 1.1624504327774048} 11/07/2021 10:27:43 - INFO - __main__ - Step 94003: {'lr': 0.0001567760324992265, 'samples': 18048576, 'steps': 94002, 'loss/train': 1.2847185134887695} 11/07/2021 10:27:43 - INFO - __main__ - Step 94004: {'lr': 0.00015677110853438509, 'samples': 18048768, 'steps': 94003, 'loss/train': 1.0891252756118774} 11/07/2021 10:27:44 - INFO - __main__ - Step 94005: {'lr': 0.00015676618461155122, 'samples': 18048960, 'steps': 94004, 'loss/train': 0.7076224684715271} 11/07/2021 10:27:45 - INFO - __main__ - Step 94006: {'lr': 0.00015676126073072705, 'samples': 18049152, 'steps': 94005, 'loss/train': 1.4163732528686523} 11/07/2021 10:27:45 - INFO - __main__ - Step 94007: {'lr': 0.0001567563368919148, 'samples': 18049344, 'steps': 94006, 'loss/train': 1.7933865785598755} 11/07/2021 10:27:45 - INFO - __main__ - Step 94008: {'lr': 0.00015675141309511677, 'samples': 18049536, 'steps': 94007, 'loss/train': 1.6742693185806274} 11/07/2021 10:27:46 - INFO - __main__ - Step 94009: {'lr': 0.0001567464893403351, 'samples': 18049728, 'steps': 94008, 'loss/train': 1.1230921745300293} 11/07/2021 10:27:46 - INFO - __main__ - Step 94010: {'lr': 0.00015674156562757202, 'samples': 18049920, 'steps': 94009, 'loss/train': 0.8365665674209595} 11/07/2021 10:27:47 - INFO - __main__ - Step 94011: {'lr': 0.0001567366419568298, 'samples': 18050112, 'steps': 94010, 'loss/train': 1.2602617740631104} 11/07/2021 10:27:47 - INFO - __main__ - Step 94012: {'lr': 0.0001567317183281105, 'samples': 18050304, 'steps': 94011, 'loss/train': 1.6011420488357544} 11/07/2021 10:27:48 - INFO - __main__ - Step 94013: {'lr': 0.0001567267947414165, 'samples': 18050496, 'steps': 94012, 'loss/train': 1.1310467720031738} 11/07/2021 10:27:48 - INFO - __main__ - Step 94014: {'lr': 0.00015672187119674996, 'samples': 18050688, 'steps': 94013, 'loss/train': 0.24971330165863037} 11/07/2021 10:27:48 - INFO - __main__ - Step 94015: {'lr': 0.0001567169476941131, 'samples': 18050880, 'steps': 94014, 'loss/train': 1.3660297393798828} 11/07/2021 10:27:49 - INFO - __main__ - Step 94016: {'lr': 0.00015671202423350814, 'samples': 18051072, 'steps': 94015, 'loss/train': 1.0014395713806152} 11/07/2021 10:27:50 - INFO - __main__ - Step 94017: {'lr': 0.0001567071008149373, 'samples': 18051264, 'steps': 94016, 'loss/train': 1.4520419836044312} 11/07/2021 10:27:50 - INFO - __main__ - Step 94018: {'lr': 0.0001567021774384028, 'samples': 18051456, 'steps': 94017, 'loss/train': 1.3416619300842285} 11/07/2021 10:27:51 - INFO - __main__ - Step 94019: {'lr': 0.00015669725410390688, 'samples': 18051648, 'steps': 94018, 'loss/train': 1.2981969118118286} 11/07/2021 10:27:51 - INFO - __main__ - Step 94020: {'lr': 0.0001566923308114518, 'samples': 18051840, 'steps': 94019, 'loss/train': 1.541890025138855} 11/07/2021 10:27:51 - INFO - __main__ - Step 94021: {'lr': 0.0001566874075610396, 'samples': 18052032, 'steps': 94020, 'loss/train': 1.1034562587738037} 11/07/2021 10:27:52 - INFO - __main__ - Step 94022: {'lr': 0.0001566824843526727, 'samples': 18052224, 'steps': 94021, 'loss/train': 1.2133393287658691} 11/07/2021 10:27:53 - INFO - __main__ - Step 94023: {'lr': 0.00015667756118635314, 'samples': 18052416, 'steps': 94022, 'loss/train': 1.39793860912323} 11/07/2021 10:27:53 - INFO - __main__ - Step 94024: {'lr': 0.00015667263806208335, 'samples': 18052608, 'steps': 94023, 'loss/train': 1.2485431432724} 11/07/2021 10:27:53 - INFO - __main__ - Step 94025: {'lr': 0.00015666771497986533, 'samples': 18052800, 'steps': 94024, 'loss/train': 1.0385323762893677} 11/07/2021 10:27:54 - INFO - __main__ - Step 94026: {'lr': 0.00015666279193970146, 'samples': 18052992, 'steps': 94025, 'loss/train': 0.8081356287002563} 11/07/2021 10:27:55 - INFO - __main__ - Step 94027: {'lr': 0.00015665786894159385, 'samples': 18053184, 'steps': 94026, 'loss/train': 1.4595425128936768} 11/07/2021 10:27:55 - INFO - __main__ - Step 94028: {'lr': 0.00015665294598554474, 'samples': 18053376, 'steps': 94027, 'loss/train': 1.234321117401123} 11/07/2021 10:27:55 - INFO - __main__ - Step 94029: {'lr': 0.00015664802307155642, 'samples': 18053568, 'steps': 94028, 'loss/train': 1.1449036598205566} 11/07/2021 10:27:56 - INFO - __main__ - Step 94030: {'lr': 0.00015664310019963105, 'samples': 18053760, 'steps': 94029, 'loss/train': 1.0473726987838745} 11/07/2021 10:27:56 - INFO - __main__ - Step 94031: {'lr': 0.0001566381773697709, 'samples': 18053952, 'steps': 94030, 'loss/train': 1.4360957145690918} 11/07/2021 10:27:57 - INFO - __main__ - Step 94032: {'lr': 0.0001566332545819781, 'samples': 18054144, 'steps': 94031, 'loss/train': 1.436524510383606} 11/07/2021 10:27:58 - INFO - __main__ - Step 94033: {'lr': 0.00015662833183625492, 'samples': 18054336, 'steps': 94032, 'loss/train': 1.20814049243927} 11/07/2021 10:27:58 - INFO - __main__ - Step 94034: {'lr': 0.00015662340913260358, 'samples': 18054528, 'steps': 94033, 'loss/train': 1.3284111022949219} 11/07/2021 10:27:58 - INFO - __main__ - Step 94035: {'lr': 0.00015661848647102627, 'samples': 18054720, 'steps': 94034, 'loss/train': 1.1222279071807861} 11/07/2021 10:27:59 - INFO - __main__ - Step 94036: {'lr': 0.00015661356385152526, 'samples': 18054912, 'steps': 94035, 'loss/train': 0.822299599647522} 11/07/2021 10:28:00 - INFO - __main__ - Step 94037: {'lr': 0.00015660864127410267, 'samples': 18055104, 'steps': 94036, 'loss/train': 1.861153483390808} 11/07/2021 10:28:00 - INFO - __main__ - Step 94038: {'lr': 0.0001566037187387609, 'samples': 18055296, 'steps': 94037, 'loss/train': 1.624773621559143} 11/07/2021 10:28:00 - INFO - __main__ - Step 94039: {'lr': 0.000156598796245502, 'samples': 18055488, 'steps': 94038, 'loss/train': 1.5207798480987549} 11/07/2021 10:28:01 - INFO - __main__ - Step 94040: {'lr': 0.00015659387379432822, 'samples': 18055680, 'steps': 94039, 'loss/train': 1.3590083122253418} 11/07/2021 10:28:01 - INFO - __main__ - Step 94041: {'lr': 0.00015658895138524179, 'samples': 18055872, 'steps': 94040, 'loss/train': 1.0558942556381226} 11/07/2021 10:28:01 - INFO - __main__ - Step 94042: {'lr': 0.000156584029018245, 'samples': 18056064, 'steps': 94041, 'loss/train': 1.1402606964111328} 11/07/2021 10:28:02 - INFO - __main__ - Step 94043: {'lr': 0.00015657910669333996, 'samples': 18056256, 'steps': 94042, 'loss/train': 1.5886794328689575} 11/07/2021 10:28:03 - INFO - __main__ - Step 94044: {'lr': 0.00015657418441052896, 'samples': 18056448, 'steps': 94043, 'loss/train': 1.5796549320220947} 11/07/2021 10:28:03 - INFO - __main__ - Step 94045: {'lr': 0.00015656926216981416, 'samples': 18056640, 'steps': 94044, 'loss/train': 1.6140432357788086} 11/07/2021 10:28:03 - INFO - __main__ - Step 94046: {'lr': 0.0001565643399711978, 'samples': 18056832, 'steps': 94045, 'loss/train': 2.0864815711975098} 11/07/2021 10:28:04 - INFO - __main__ - Step 94047: {'lr': 0.0001565594178146821, 'samples': 18057024, 'steps': 94046, 'loss/train': 1.1348118782043457} 11/07/2021 10:28:05 - INFO - __main__ - Step 94048: {'lr': 0.00015655449570026932, 'samples': 18057216, 'steps': 94047, 'loss/train': 1.390498161315918} 11/07/2021 10:28:05 - INFO - __main__ - Step 94049: {'lr': 0.0001565495736279616, 'samples': 18057408, 'steps': 94048, 'loss/train': 0.8710553050041199} 11/07/2021 10:28:06 - INFO - __main__ - Step 94050: {'lr': 0.0001565446515977612, 'samples': 18057600, 'steps': 94049, 'loss/train': 1.3970751762390137} 11/07/2021 10:28:06 - INFO - __main__ - Step 94051: {'lr': 0.00015653972960967045, 'samples': 18057792, 'steps': 94050, 'loss/train': 1.2285019159317017} 11/07/2021 10:28:06 - INFO - __main__ - Step 94052: {'lr': 0.00015653480766369135, 'samples': 18057984, 'steps': 94051, 'loss/train': 1.5841948986053467} 11/07/2021 10:28:08 - INFO - __main__ - Step 94053: {'lr': 0.0001565298857598263, 'samples': 18058176, 'steps': 94052, 'loss/train': 0.8942039012908936} 11/07/2021 10:28:08 - INFO - __main__ - Step 94054: {'lr': 0.00015652496389807736, 'samples': 18058368, 'steps': 94053, 'loss/train': 1.257347822189331} 11/07/2021 10:28:08 - INFO - __main__ - Step 94055: {'lr': 0.00015652004207844687, 'samples': 18058560, 'steps': 94054, 'loss/train': 1.3214796781539917} 11/07/2021 10:28:09 - INFO - __main__ - Step 94056: {'lr': 0.00015651512030093697, 'samples': 18058752, 'steps': 94055, 'loss/train': 1.4664572477340698} 11/07/2021 10:28:09 - INFO - __main__ - Step 94057: {'lr': 0.00015651019856554994, 'samples': 18058944, 'steps': 94056, 'loss/train': 0.6826241612434387} 11/07/2021 10:28:10 - INFO - __main__ - Step 94058: {'lr': 0.00015650527687228793, 'samples': 18059136, 'steps': 94057, 'loss/train': 1.3878997564315796} 11/07/2021 10:28:10 - INFO - __main__ - Step 94059: {'lr': 0.00015650035522115326, 'samples': 18059328, 'steps': 94058, 'loss/train': 1.9355506896972656} 11/07/2021 10:28:11 - INFO - __main__ - Step 94060: {'lr': 0.00015649543361214804, 'samples': 18059520, 'steps': 94059, 'loss/train': 1.6951442956924438} 11/07/2021 10:28:11 - INFO - __main__ - Step 94061: {'lr': 0.00015649051204527458, 'samples': 18059712, 'steps': 94060, 'loss/train': 1.1774789094924927} 11/07/2021 10:28:11 - INFO - __main__ - Step 94062: {'lr': 0.00015648559052053502, 'samples': 18059904, 'steps': 94061, 'loss/train': 1.5739716291427612} 11/07/2021 10:28:12 - INFO - __main__ - Step 94063: {'lr': 0.00015648066903793163, 'samples': 18060096, 'steps': 94062, 'loss/train': 0.9726495742797852} 11/07/2021 10:28:13 - INFO - __main__ - Step 94064: {'lr': 0.00015647574759746657, 'samples': 18060288, 'steps': 94063, 'loss/train': 1.5318924188613892} 11/07/2021 10:28:13 - INFO - __main__ - Step 94065: {'lr': 0.00015647082619914222, 'samples': 18060480, 'steps': 94064, 'loss/train': 1.3235958814620972} 11/07/2021 10:28:13 - INFO - __main__ - Step 94066: {'lr': 0.0001564659048429606, 'samples': 18060672, 'steps': 94065, 'loss/train': 1.401048183441162} 11/07/2021 10:28:14 - INFO - __main__ - Step 94067: {'lr': 0.00015646098352892394, 'samples': 18060864, 'steps': 94066, 'loss/train': 1.0776307582855225} 11/07/2021 10:28:15 - INFO - __main__ - Step 94068: {'lr': 0.00015645606225703454, 'samples': 18061056, 'steps': 94067, 'loss/train': 1.6204062700271606} 11/07/2021 10:28:15 - INFO - __main__ - Step 94069: {'lr': 0.0001564511410272946, 'samples': 18061248, 'steps': 94068, 'loss/train': 1.718570351600647} 11/07/2021 10:28:15 - INFO - __main__ - Step 94070: {'lr': 0.00015644621983970636, 'samples': 18061440, 'steps': 94069, 'loss/train': 1.574204444885254} 11/07/2021 10:28:16 - INFO - __main__ - Step 94071: {'lr': 0.00015644129869427198, 'samples': 18061632, 'steps': 94070, 'loss/train': 1.0836347341537476} 11/07/2021 10:28:16 - INFO - __main__ - Step 94072: {'lr': 0.00015643637759099371, 'samples': 18061824, 'steps': 94071, 'loss/train': 1.6669679880142212} 11/07/2021 10:28:17 - INFO - __main__ - Step 94073: {'lr': 0.00015643145652987375, 'samples': 18062016, 'steps': 94072, 'loss/train': 0.837073564529419} 11/07/2021 10:28:18 - INFO - __main__ - Step 94074: {'lr': 0.00015642653551091435, 'samples': 18062208, 'steps': 94073, 'loss/train': 1.2453629970550537} 11/07/2021 10:28:18 - INFO - __main__ - Step 94075: {'lr': 0.00015642161453411772, 'samples': 18062400, 'steps': 94074, 'loss/train': 1.546690583229065} 11/07/2021 10:28:18 - INFO - __main__ - Step 94076: {'lr': 0.00015641669359948605, 'samples': 18062592, 'steps': 94075, 'loss/train': 1.3104852437973022} 11/07/2021 10:28:19 - INFO - __main__ - Step 94077: {'lr': 0.00015641177270702157, 'samples': 18062784, 'steps': 94076, 'loss/train': 1.3420689105987549} 11/07/2021 10:28:20 - INFO - __main__ - Step 94078: {'lr': 0.0001564068518567266, 'samples': 18062976, 'steps': 94077, 'loss/train': 1.0025644302368164} 11/07/2021 10:28:20 - INFO - __main__ - Step 94079: {'lr': 0.00015640193104860317, 'samples': 18063168, 'steps': 94078, 'loss/train': 0.6137741804122925} 11/07/2021 10:28:20 - INFO - __main__ - Step 94080: {'lr': 0.00015639701028265357, 'samples': 18063360, 'steps': 94079, 'loss/train': 1.1707240343093872} 11/07/2021 10:28:21 - INFO - __main__ - Step 94081: {'lr': 0.00015639208955888008, 'samples': 18063552, 'steps': 94080, 'loss/train': 1.2495585680007935} 11/07/2021 10:28:21 - INFO - __main__ - Step 94082: {'lr': 0.00015638716887728482, 'samples': 18063744, 'steps': 94081, 'loss/train': 1.6734542846679688} 11/07/2021 10:28:22 - INFO - __main__ - Step 94083: {'lr': 0.00015638224823787006, 'samples': 18063936, 'steps': 94082, 'loss/train': 1.0378875732421875} 11/07/2021 10:28:23 - INFO - __main__ - Step 94084: {'lr': 0.00015637732764063806, 'samples': 18064128, 'steps': 94083, 'loss/train': 1.308734655380249} 11/07/2021 10:28:23 - INFO - __main__ - Step 94085: {'lr': 0.00015637240708559093, 'samples': 18064320, 'steps': 94084, 'loss/train': 1.2832295894622803} 11/07/2021 10:28:23 - INFO - __main__ - Step 94086: {'lr': 0.00015636748657273098, 'samples': 18064512, 'steps': 94085, 'loss/train': 1.5986496210098267} 11/07/2021 10:28:24 - INFO - __main__ - Step 94087: {'lr': 0.0001563625661020604, 'samples': 18064704, 'steps': 94086, 'loss/train': 1.1508210897445679} 11/07/2021 10:28:24 - INFO - __main__ - Step 94088: {'lr': 0.0001563576456735814, 'samples': 18064896, 'steps': 94087, 'loss/train': 1.2114235162734985} 11/07/2021 10:28:25 - INFO - __main__ - Step 94089: {'lr': 0.0001563527252872962, 'samples': 18065088, 'steps': 94088, 'loss/train': 0.7022836804389954} 11/07/2021 10:28:25 - INFO - __main__ - Step 94090: {'lr': 0.000156347804943207, 'samples': 18065280, 'steps': 94089, 'loss/train': 1.1149137020111084} 11/07/2021 10:28:26 - INFO - __main__ - Step 94091: {'lr': 0.00015634288464131614, 'samples': 18065472, 'steps': 94090, 'loss/train': 0.29197660088539124} 11/07/2021 10:28:26 - INFO - __main__ - Step 94092: {'lr': 0.00015633796438162565, 'samples': 18065664, 'steps': 94091, 'loss/train': 1.3787888288497925} 11/07/2021 10:28:26 - INFO - __main__ - Step 94093: {'lr': 0.0001563330441641378, 'samples': 18065856, 'steps': 94092, 'loss/train': 1.4861682653427124} 11/07/2021 10:28:28 - INFO - __main__ - Step 94094: {'lr': 0.00015632812398885487, 'samples': 18066048, 'steps': 94093, 'loss/train': 1.8430514335632324} 11/07/2021 10:28:28 - INFO - __main__ - Step 94095: {'lr': 0.00015632320385577903, 'samples': 18066240, 'steps': 94094, 'loss/train': 1.5049670934677124} 11/07/2021 10:28:28 - INFO - __main__ - Step 94096: {'lr': 0.00015631828376491246, 'samples': 18066432, 'steps': 94095, 'loss/train': 1.260998010635376} 11/07/2021 10:28:29 - INFO - __main__ - Step 94097: {'lr': 0.0001563133637162575, 'samples': 18066624, 'steps': 94096, 'loss/train': 1.6005651950836182} 11/07/2021 10:28:29 - INFO - __main__ - Step 94098: {'lr': 0.00015630844370981623, 'samples': 18066816, 'steps': 94097, 'loss/train': 1.182023525238037} 11/07/2021 10:28:29 - INFO - __main__ - Step 94099: {'lr': 0.00015630352374559098, 'samples': 18067008, 'steps': 94098, 'loss/train': 2.2051796913146973} 11/07/2021 10:28:30 - INFO - __main__ - Step 94100: {'lr': 0.00015629860382358388, 'samples': 18067200, 'steps': 94099, 'loss/train': 1.6892611980438232} 11/07/2021 10:28:31 - INFO - __main__ - Step 94101: {'lr': 0.0001562936839437972, 'samples': 18067392, 'steps': 94100, 'loss/train': 1.4122458696365356} 11/07/2021 10:28:31 - INFO - __main__ - Step 94102: {'lr': 0.00015628876410623315, 'samples': 18067584, 'steps': 94101, 'loss/train': 1.5505173206329346} 11/07/2021 10:28:32 - INFO - __main__ - Step 94103: {'lr': 0.00015628384431089394, 'samples': 18067776, 'steps': 94102, 'loss/train': 1.3477402925491333} 11/07/2021 10:28:32 - INFO - __main__ - Step 94104: {'lr': 0.00015627892455778174, 'samples': 18067968, 'steps': 94103, 'loss/train': 0.08951704949140549} 11/07/2021 10:28:33 - INFO - __main__ - Step 94105: {'lr': 0.00015627400484689895, 'samples': 18068160, 'steps': 94104, 'loss/train': 1.6605346202850342} 11/07/2021 10:28:33 - INFO - __main__ - Step 94106: {'lr': 0.00015626908517824754, 'samples': 18068352, 'steps': 94105, 'loss/train': 1.5883327722549438} 11/07/2021 10:28:34 - INFO - __main__ - Step 94107: {'lr': 0.00015626416555182982, 'samples': 18068544, 'steps': 94106, 'loss/train': 1.1626933813095093} 11/07/2021 10:28:34 - INFO - __main__ - Step 94108: {'lr': 0.000156259245967648, 'samples': 18068736, 'steps': 94107, 'loss/train': 1.496869683265686} 11/07/2021 10:28:34 - INFO - __main__ - Step 94109: {'lr': 0.00015625432642570435, 'samples': 18068928, 'steps': 94108, 'loss/train': 1.9893238544464111} 11/07/2021 10:28:35 - INFO - __main__ - Step 94110: {'lr': 0.0001562494069260011, 'samples': 18069120, 'steps': 94109, 'loss/train': 1.528653621673584} 11/07/2021 10:28:36 - INFO - __main__ - Step 94111: {'lr': 0.00015624448746854038, 'samples': 18069312, 'steps': 94110, 'loss/train': 0.761365532875061} 11/07/2021 10:28:36 - INFO - __main__ - Step 94112: {'lr': 0.0001562395680533244, 'samples': 18069504, 'steps': 94111, 'loss/train': 1.6985270977020264} 11/07/2021 10:28:37 - INFO - __main__ - Step 94113: {'lr': 0.00015623464868035547, 'samples': 18069696, 'steps': 94112, 'loss/train': 1.5147966146469116} 11/07/2021 10:28:37 - INFO - __main__ - Step 94114: {'lr': 0.00015622972934963575, 'samples': 18069888, 'steps': 94113, 'loss/train': 1.0716454982757568} 11/07/2021 10:28:38 - INFO - __main__ - Step 94115: {'lr': 0.00015622481006116748, 'samples': 18070080, 'steps': 94114, 'loss/train': 1.532805323600769} 11/07/2021 10:28:38 - INFO - __main__ - Step 94116: {'lr': 0.00015621989081495287, 'samples': 18070272, 'steps': 94115, 'loss/train': 0.9033555388450623} 11/07/2021 10:28:39 - INFO - __main__ - Step 94117: {'lr': 0.0001562149716109941, 'samples': 18070464, 'steps': 94116, 'loss/train': 1.6699795722961426} 11/07/2021 10:28:39 - INFO - __main__ - Step 94118: {'lr': 0.00015621005244929355, 'samples': 18070656, 'steps': 94117, 'loss/train': 1.3023916482925415} 11/07/2021 10:28:39 - INFO - __main__ - Step 94119: {'lr': 0.00015620513332985315, 'samples': 18070848, 'steps': 94118, 'loss/train': 1.030634880065918} 11/07/2021 10:28:40 - INFO - __main__ - Step 94120: {'lr': 0.00015620021425267534, 'samples': 18071040, 'steps': 94119, 'loss/train': 0.8447486162185669} 11/07/2021 10:28:41 - INFO - __main__ - Step 94121: {'lr': 0.00015619529521776221, 'samples': 18071232, 'steps': 94120, 'loss/train': 1.0995794534683228} 11/07/2021 10:28:41 - INFO - __main__ - Step 94122: {'lr': 0.00015619037622511606, 'samples': 18071424, 'steps': 94121, 'loss/train': 1.2814899682998657} 11/07/2021 10:28:41 - INFO - __main__ - Step 94123: {'lr': 0.00015618545727473905, 'samples': 18071616, 'steps': 94122, 'loss/train': 0.6058156490325928} 11/07/2021 10:28:42 - INFO - __main__ - Step 94124: {'lr': 0.00015618053836663346, 'samples': 18071808, 'steps': 94123, 'loss/train': 1.5807870626449585} 11/07/2021 10:28:43 - INFO - __main__ - Step 94125: {'lr': 0.00015617561950080145, 'samples': 18072000, 'steps': 94124, 'loss/train': 1.0967170000076294} 11/07/2021 10:28:43 - INFO - __main__ - Step 94126: {'lr': 0.00015617070067724525, 'samples': 18072192, 'steps': 94125, 'loss/train': 1.1366636753082275} 11/07/2021 10:28:43 - INFO - __main__ - Step 94127: {'lr': 0.00015616578189596713, 'samples': 18072384, 'steps': 94126, 'loss/train': 1.5123400688171387} 11/07/2021 10:28:44 - INFO - __main__ - Step 94128: {'lr': 0.0001561608631569692, 'samples': 18072576, 'steps': 94127, 'loss/train': 1.141573429107666} 11/07/2021 10:28:44 - INFO - __main__ - Step 94129: {'lr': 0.00015615594446025376, 'samples': 18072768, 'steps': 94128, 'loss/train': 1.7496733665466309} 11/07/2021 10:28:45 - INFO - __main__ - Step 94130: {'lr': 0.00015615102580582302, 'samples': 18072960, 'steps': 94129, 'loss/train': 0.8007218241691589} 11/07/2021 10:28:46 - INFO - __main__ - Step 94131: {'lr': 0.0001561461071936792, 'samples': 18073152, 'steps': 94130, 'loss/train': 1.6037652492523193} 11/07/2021 10:28:46 - INFO - __main__ - Step 94132: {'lr': 0.00015614118862382456, 'samples': 18073344, 'steps': 94131, 'loss/train': 1.249565839767456} 11/07/2021 10:28:46 - INFO - __main__ - Step 94133: {'lr': 0.00015613627009626116, 'samples': 18073536, 'steps': 94132, 'loss/train': 1.2699618339538574} 11/07/2021 10:28:47 - INFO - __main__ - Step 94134: {'lr': 0.0001561313516109913, 'samples': 18073728, 'steps': 94133, 'loss/train': 1.5189520120620728} 11/07/2021 10:28:47 - INFO - __main__ - Step 94135: {'lr': 0.00015612643316801722, 'samples': 18073920, 'steps': 94134, 'loss/train': 1.5347316265106201} 11/07/2021 10:28:48 - INFO - __main__ - Step 94136: {'lr': 0.0001561215147673411, 'samples': 18074112, 'steps': 94135, 'loss/train': 1.4577456712722778} 11/07/2021 10:28:49 - INFO - __main__ - Step 94137: {'lr': 0.0001561165964089652, 'samples': 18074304, 'steps': 94136, 'loss/train': 1.0860481262207031} 11/07/2021 10:28:49 - INFO - __main__ - Step 94138: {'lr': 0.0001561116780928917, 'samples': 18074496, 'steps': 94137, 'loss/train': 1.2265830039978027} 11/07/2021 10:28:49 - INFO - __main__ - Step 94139: {'lr': 0.00015610675981912283, 'samples': 18074688, 'steps': 94138, 'loss/train': 0.09231266379356384} 11/07/2021 10:28:50 - INFO - __main__ - Step 94140: {'lr': 0.00015610184158766082, 'samples': 18074880, 'steps': 94139, 'loss/train': 1.1233749389648438} 11/07/2021 10:28:51 - INFO - __main__ - Step 94141: {'lr': 0.00015609692339850785, 'samples': 18075072, 'steps': 94140, 'loss/train': 1.5526725053787231} 11/07/2021 10:28:51 - INFO - __main__ - Step 94142: {'lr': 0.00015609200525166616, 'samples': 18075264, 'steps': 94141, 'loss/train': 1.0958524942398071} 11/07/2021 10:28:51 - INFO - __main__ - Step 94143: {'lr': 0.000156087087147138, 'samples': 18075456, 'steps': 94142, 'loss/train': 1.5862771272659302} 11/07/2021 10:28:52 - INFO - __main__ - Step 94144: {'lr': 0.00015608216908492555, 'samples': 18075648, 'steps': 94143, 'loss/train': 1.6878626346588135} 11/07/2021 10:28:52 - INFO - __main__ - Step 94145: {'lr': 0.00015607725106503103, 'samples': 18075840, 'steps': 94144, 'loss/train': 1.2516883611679077} 11/07/2021 10:28:53 - INFO - __main__ - Step 94146: {'lr': 0.00015607233308745662, 'samples': 18076032, 'steps': 94145, 'loss/train': 0.7286818027496338} 11/07/2021 10:28:54 - INFO - __main__ - Step 94147: {'lr': 0.00015606741515220457, 'samples': 18076224, 'steps': 94146, 'loss/train': 1.092632532119751} 11/07/2021 10:28:54 - INFO - __main__ - Step 94148: {'lr': 0.00015606249725927707, 'samples': 18076416, 'steps': 94147, 'loss/train': 1.5735630989074707} 11/07/2021 10:28:54 - INFO - __main__ - Step 94149: {'lr': 0.00015605757940867637, 'samples': 18076608, 'steps': 94148, 'loss/train': 1.4590760469436646} 11/07/2021 10:28:55 - INFO - __main__ - Step 94150: {'lr': 0.00015605266160040467, 'samples': 18076800, 'steps': 94149, 'loss/train': 0.9478268027305603} 11/07/2021 10:28:55 - INFO - __main__ - Step 94151: {'lr': 0.00015604774383446422, 'samples': 18076992, 'steps': 94150, 'loss/train': 2.3773751258850098} 11/07/2021 10:28:56 - INFO - __main__ - Step 94152: {'lr': 0.0001560428261108572, 'samples': 18077184, 'steps': 94151, 'loss/train': 1.4411553144454956} 11/07/2021 10:28:56 - INFO - __main__ - Step 94153: {'lr': 0.00015603790842958582, 'samples': 18077376, 'steps': 94152, 'loss/train': 1.4974771738052368} 11/07/2021 10:28:57 - INFO - __main__ - Step 94154: {'lr': 0.0001560329907906523, 'samples': 18077568, 'steps': 94153, 'loss/train': 1.3003920316696167} 11/07/2021 10:28:57 - INFO - __main__ - Step 94155: {'lr': 0.00015602807319405892, 'samples': 18077760, 'steps': 94154, 'loss/train': 1.2077521085739136} 11/07/2021 10:28:58 - INFO - __main__ - Step 94156: {'lr': 0.0001560231556398078, 'samples': 18077952, 'steps': 94155, 'loss/train': 1.2568470239639282} 11/07/2021 10:28:58 - INFO - __main__ - Step 94157: {'lr': 0.00015601823812790117, 'samples': 18078144, 'steps': 94156, 'loss/train': 1.524804711341858} 11/07/2021 10:28:59 - INFO - __main__ - Step 94158: {'lr': 0.00015601332065834128, 'samples': 18078336, 'steps': 94157, 'loss/train': 1.2073696851730347} 11/07/2021 10:28:59 - INFO - __main__ - Step 94159: {'lr': 0.0001560084032311304, 'samples': 18078528, 'steps': 94158, 'loss/train': 1.0057995319366455} 11/07/2021 10:28:59 - INFO - __main__ - Step 94160: {'lr': 0.00015600348584627068, 'samples': 18078720, 'steps': 94159, 'loss/train': 1.5894598960876465} 11/07/2021 10:29:00 - INFO - __main__ - Step 94161: {'lr': 0.00015599856850376427, 'samples': 18078912, 'steps': 94160, 'loss/train': 1.2661770582199097} 11/07/2021 10:29:01 - INFO - __main__ - Step 94162: {'lr': 0.00015599365120361346, 'samples': 18079104, 'steps': 94161, 'loss/train': 1.4062339067459106} 11/07/2021 10:29:01 - INFO - __main__ - Step 94163: {'lr': 0.00015598873394582046, 'samples': 18079296, 'steps': 94162, 'loss/train': 1.3988823890686035} 11/07/2021 10:29:01 - INFO - __main__ - Step 94164: {'lr': 0.00015598381673038753, 'samples': 18079488, 'steps': 94163, 'loss/train': 1.7735835313796997} 11/07/2021 10:29:02 - INFO - __main__ - Step 94165: {'lr': 0.00015597889955731682, 'samples': 18079680, 'steps': 94164, 'loss/train': 1.3053643703460693} 11/07/2021 10:29:02 - INFO - __main__ - Step 94166: {'lr': 0.00015597398242661058, 'samples': 18079872, 'steps': 94165, 'loss/train': 1.2527071237564087} 11/07/2021 10:29:03 - INFO - __main__ - Step 94167: {'lr': 0.00015596906533827098, 'samples': 18080064, 'steps': 94166, 'loss/train': 1.4439404010772705} 11/07/2021 10:29:04 - INFO - __main__ - Step 94168: {'lr': 0.0001559641482923003, 'samples': 18080256, 'steps': 94167, 'loss/train': 1.576241135597229} 11/07/2021 10:29:04 - INFO - __main__ - Step 94169: {'lr': 0.0001559592312887007, 'samples': 18080448, 'steps': 94168, 'loss/train': 1.5766148567199707} 11/07/2021 10:29:04 - INFO - __main__ - Step 94170: {'lr': 0.00015595431432747443, 'samples': 18080640, 'steps': 94169, 'loss/train': 1.1971951723098755} 11/07/2021 10:29:05 - INFO - __main__ - Step 94171: {'lr': 0.00015594939740862368, 'samples': 18080832, 'steps': 94170, 'loss/train': 1.0684678554534912} 11/07/2021 10:29:06 - INFO - __main__ - Step 94172: {'lr': 0.00015594448053215073, 'samples': 18081024, 'steps': 94171, 'loss/train': 1.553813099861145} 11/07/2021 10:29:06 - INFO - __main__ - Step 94173: {'lr': 0.00015593956369805773, 'samples': 18081216, 'steps': 94172, 'loss/train': 0.8312493562698364} 11/07/2021 10:29:06 - INFO - __main__ - Step 94174: {'lr': 0.00015593464690634687, 'samples': 18081408, 'steps': 94173, 'loss/train': 1.118217945098877} 11/07/2021 10:29:07 - INFO - __main__ - Step 94175: {'lr': 0.00015592973015702042, 'samples': 18081600, 'steps': 94174, 'loss/train': 1.2684319019317627} 11/07/2021 10:29:07 - INFO - __main__ - Step 94176: {'lr': 0.00015592481345008064, 'samples': 18081792, 'steps': 94175, 'loss/train': 1.5556507110595703} 11/07/2021 10:29:08 - INFO - __main__ - Step 94177: {'lr': 0.00015591989678552962, 'samples': 18081984, 'steps': 94176, 'loss/train': 1.5228573083877563} 11/07/2021 10:29:08 - INFO - __main__ - Step 94178: {'lr': 0.00015591498016336966, 'samples': 18082176, 'steps': 94177, 'loss/train': 1.6498432159423828} 11/07/2021 10:29:09 - INFO - __main__ - Step 94179: {'lr': 0.00015591006358360294, 'samples': 18082368, 'steps': 94178, 'loss/train': 0.9181137681007385} 11/07/2021 10:29:09 - INFO - __main__ - Step 94180: {'lr': 0.0001559051470462317, 'samples': 18082560, 'steps': 94179, 'loss/train': 1.712214469909668} 11/07/2021 10:29:10 - INFO - __main__ - Step 94181: {'lr': 0.00015590023055125817, 'samples': 18082752, 'steps': 94180, 'loss/train': 1.2135223150253296} 11/07/2021 10:29:10 - INFO - __main__ - Step 94182: {'lr': 0.0001558953140986845, 'samples': 18082944, 'steps': 94181, 'loss/train': 1.4483124017715454} 11/07/2021 10:29:11 - INFO - __main__ - Step 94183: {'lr': 0.00015589039768851298, 'samples': 18083136, 'steps': 94182, 'loss/train': 1.3850321769714355} 11/07/2021 10:29:11 - INFO - __main__ - Step 94184: {'lr': 0.00015588548132074582, 'samples': 18083328, 'steps': 94183, 'loss/train': 1.1945925951004028} 11/07/2021 10:29:12 - INFO - __main__ - Step 94185: {'lr': 0.00015588056499538517, 'samples': 18083520, 'steps': 94184, 'loss/train': 1.3215761184692383} 11/07/2021 10:29:12 - INFO - __main__ - Step 94186: {'lr': 0.00015587564871243333, 'samples': 18083712, 'steps': 94185, 'loss/train': 1.6098244190216064} 11/07/2021 10:29:12 - INFO - __main__ - Step 94187: {'lr': 0.0001558707324718925, 'samples': 18083904, 'steps': 94186, 'loss/train': 1.2086750268936157} 11/07/2021 10:29:13 - INFO - __main__ - Step 94188: {'lr': 0.00015586581627376482, 'samples': 18084096, 'steps': 94187, 'loss/train': 1.6782516241073608} 11/07/2021 10:29:14 - INFO - __main__ - Step 94189: {'lr': 0.00015586090011805254, 'samples': 18084288, 'steps': 94188, 'loss/train': 1.2272822856903076} 11/07/2021 10:29:14 - INFO - __main__ - Step 94190: {'lr': 0.00015585598400475788, 'samples': 18084480, 'steps': 94189, 'loss/train': 0.949556827545166} 11/07/2021 10:29:14 - INFO - __main__ - Step 94191: {'lr': 0.00015585106793388303, 'samples': 18084672, 'steps': 94190, 'loss/train': 1.422806739807129} 11/07/2021 10:29:15 - INFO - __main__ - Step 94192: {'lr': 0.00015584615190543028, 'samples': 18084864, 'steps': 94191, 'loss/train': 1.6589362621307373} 11/07/2021 10:29:16 - INFO - __main__ - Step 94193: {'lr': 0.00015584123591940179, 'samples': 18085056, 'steps': 94192, 'loss/train': 1.270896315574646} 11/07/2021 10:29:16 - INFO - __main__ - Step 94194: {'lr': 0.00015583631997579978, 'samples': 18085248, 'steps': 94193, 'loss/train': 1.3763431310653687} 11/07/2021 10:29:16 - INFO - __main__ - Step 94195: {'lr': 0.00015583140407462648, 'samples': 18085440, 'steps': 94194, 'loss/train': 1.2198455333709717} 11/07/2021 10:29:17 - INFO - __main__ - Step 94196: {'lr': 0.00015582648821588408, 'samples': 18085632, 'steps': 94195, 'loss/train': 1.8085401058197021} 11/07/2021 10:29:17 - INFO - __main__ - Step 94197: {'lr': 0.0001558215723995748, 'samples': 18085824, 'steps': 94196, 'loss/train': 1.4476770162582397} 11/07/2021 10:29:18 - INFO - __main__ - Step 94198: {'lr': 0.00015581665662570092, 'samples': 18086016, 'steps': 94197, 'loss/train': 0.740347683429718} 11/07/2021 10:29:19 - INFO - __main__ - Step 94199: {'lr': 0.00015581174089426464, 'samples': 18086208, 'steps': 94198, 'loss/train': 1.0008996725082397} 11/07/2021 10:29:19 - INFO - __main__ - Step 94200: {'lr': 0.00015580682520526806, 'samples': 18086400, 'steps': 94199, 'loss/train': 1.3224188089370728} 11/07/2021 10:29:19 - INFO - __main__ - Step 94201: {'lr': 0.00015580190955871348, 'samples': 18086592, 'steps': 94200, 'loss/train': 1.089693307876587} 11/07/2021 10:29:20 - INFO - __main__ - Step 94202: {'lr': 0.00015579699395460312, 'samples': 18086784, 'steps': 94201, 'loss/train': 1.4565012454986572} 11/07/2021 10:29:21 - INFO - __main__ - Step 94203: {'lr': 0.00015579207839293917, 'samples': 18086976, 'steps': 94202, 'loss/train': 1.553073763847351} 11/07/2021 10:29:21 - INFO - __main__ - Step 94204: {'lr': 0.00015578716287372384, 'samples': 18087168, 'steps': 94203, 'loss/train': 1.5959093570709229} 11/07/2021 10:29:21 - INFO - __main__ - Step 94205: {'lr': 0.00015578224739695937, 'samples': 18087360, 'steps': 94204, 'loss/train': 1.382852554321289} 11/07/2021 10:29:22 - INFO - __main__ - Step 94206: {'lr': 0.00015577733196264795, 'samples': 18087552, 'steps': 94205, 'loss/train': 1.6461896896362305} 11/07/2021 10:29:22 - INFO - __main__ - Step 94207: {'lr': 0.00015577241657079184, 'samples': 18087744, 'steps': 94206, 'loss/train': 1.3894344568252563} 11/07/2021 10:29:23 - INFO - __main__ - Step 94208: {'lr': 0.0001557675012213932, 'samples': 18087936, 'steps': 94207, 'loss/train': 2.046487331390381} 11/07/2021 10:29:23 - INFO - __main__ - Step 94209: {'lr': 0.00015576258591445431, 'samples': 18088128, 'steps': 94208, 'loss/train': 1.3821830749511719} 11/07/2021 10:29:24 - INFO - __main__ - Step 94210: {'lr': 0.00015575767064997728, 'samples': 18088320, 'steps': 94209, 'loss/train': 1.3124724626541138} 11/07/2021 10:29:24 - INFO - __main__ - Step 94211: {'lr': 0.00015575275542796443, 'samples': 18088512, 'steps': 94210, 'loss/train': 5.121862888336182} 11/07/2021 10:29:25 - INFO - __main__ - Step 94212: {'lr': 0.00015574784024841804, 'samples': 18088704, 'steps': 94211, 'loss/train': 1.4704170227050781} 11/07/2021 10:29:25 - INFO - __main__ - Step 94213: {'lr': 0.00015574292511134007, 'samples': 18088896, 'steps': 94212, 'loss/train': 1.3244106769561768} 11/07/2021 10:29:26 - INFO - __main__ - Step 94214: {'lr': 0.00015573801001673293, 'samples': 18089088, 'steps': 94213, 'loss/train': 2.6933014392852783} 11/07/2021 10:29:26 - INFO - __main__ - Step 94215: {'lr': 0.00015573309496459881, 'samples': 18089280, 'steps': 94214, 'loss/train': 1.146398663520813} 11/07/2021 10:29:26 - INFO - __main__ - Step 94216: {'lr': 0.00015572817995493986, 'samples': 18089472, 'steps': 94215, 'loss/train': 0.8750593662261963} 11/07/2021 10:29:27 - INFO - __main__ - Step 94217: {'lr': 0.00015572326498775835, 'samples': 18089664, 'steps': 94216, 'loss/train': 1.4270069599151611} 11/07/2021 10:29:27 - INFO - __main__ - Step 94218: {'lr': 0.00015571835006305645, 'samples': 18089856, 'steps': 94217, 'loss/train': 1.574482798576355} 11/07/2021 10:29:28 - INFO - __main__ - Step 94219: {'lr': 0.00015571343518083647, 'samples': 18090048, 'steps': 94218, 'loss/train': 1.2150925397872925} 11/07/2021 10:29:29 - INFO - __main__ - Step 94220: {'lr': 0.0001557085203411005, 'samples': 18090240, 'steps': 94219, 'loss/train': 3.7218847274780273} 11/07/2021 10:29:29 - INFO - __main__ - Step 94221: {'lr': 0.00015570360554385089, 'samples': 18090432, 'steps': 94220, 'loss/train': 1.7215831279754639} 11/07/2021 10:29:29 - INFO - __main__ - Step 94222: {'lr': 0.0001556986907890897, 'samples': 18090624, 'steps': 94221, 'loss/train': 1.2096772193908691} 11/07/2021 10:29:30 - INFO - __main__ - Step 94223: {'lr': 0.00015569377607681928, 'samples': 18090816, 'steps': 94222, 'loss/train': 1.5506367683410645} 11/07/2021 10:29:31 - INFO - __main__ - Step 94224: {'lr': 0.00015568886140704174, 'samples': 18091008, 'steps': 94223, 'loss/train': 1.3696402311325073} 11/07/2021 10:29:31 - INFO - __main__ - Step 94225: {'lr': 0.00015568394677975938, 'samples': 18091200, 'steps': 94224, 'loss/train': 0.5082774758338928} 11/07/2021 10:29:31 - INFO - __main__ - Step 94226: {'lr': 0.00015567903219497448, 'samples': 18091392, 'steps': 94225, 'loss/train': 1.5680100917816162} 11/07/2021 10:29:32 - INFO - __main__ - Step 94227: {'lr': 0.00015567411765268904, 'samples': 18091584, 'steps': 94226, 'loss/train': 1.8419984579086304} 11/07/2021 10:29:32 - INFO - __main__ - Step 94228: {'lr': 0.0001556692031529054, 'samples': 18091776, 'steps': 94227, 'loss/train': 0.7664121389389038} 11/07/2021 10:29:33 - INFO - __main__ - Step 94229: {'lr': 0.00015566428869562575, 'samples': 18091968, 'steps': 94228, 'loss/train': 1.2160968780517578} 11/07/2021 10:29:34 - INFO - __main__ - Step 94230: {'lr': 0.00015565937428085232, 'samples': 18092160, 'steps': 94229, 'loss/train': 1.8065526485443115} 11/07/2021 10:29:34 - INFO - __main__ - Step 94231: {'lr': 0.00015565445990858729, 'samples': 18092352, 'steps': 94230, 'loss/train': 1.405224084854126} 11/07/2021 10:29:34 - INFO - __main__ - Step 94232: {'lr': 0.00015564954557883292, 'samples': 18092544, 'steps': 94231, 'loss/train': 1.4123419523239136} 11/07/2021 10:29:35 - INFO - __main__ - Step 94233: {'lr': 0.00015564463129159146, 'samples': 18092736, 'steps': 94232, 'loss/train': 1.1858956813812256} 11/07/2021 10:29:36 - INFO - __main__ - Step 94234: {'lr': 0.00015563971704686503, 'samples': 18092928, 'steps': 94233, 'loss/train': 1.488772988319397} 11/07/2021 10:29:36 - INFO - __main__ - Step 94235: {'lr': 0.00015563480284465586, 'samples': 18093120, 'steps': 94234, 'loss/train': 1.5682896375656128} 11/07/2021 10:29:36 - INFO - __main__ - Step 94236: {'lr': 0.00015562988868496626, 'samples': 18093312, 'steps': 94235, 'loss/train': 1.2862911224365234} 11/07/2021 10:29:37 - INFO - __main__ - Step 94237: {'lr': 0.00015562497456779833, 'samples': 18093504, 'steps': 94236, 'loss/train': 1.0005046129226685} 11/07/2021 10:29:37 - INFO - __main__ - Step 94238: {'lr': 0.00015562006049315433, 'samples': 18093696, 'steps': 94237, 'loss/train': 1.419135332107544} 11/07/2021 10:29:38 - INFO - __main__ - Step 94239: {'lr': 0.00015561514646103658, 'samples': 18093888, 'steps': 94238, 'loss/train': 1.095747947692871} 11/07/2021 10:29:38 - INFO - __main__ - Step 94240: {'lr': 0.0001556102324714471, 'samples': 18094080, 'steps': 94239, 'loss/train': 1.409970760345459} 11/07/2021 10:29:39 - INFO - __main__ - Step 94241: {'lr': 0.0001556053185243882, 'samples': 18094272, 'steps': 94240, 'loss/train': 1.3866698741912842} 11/07/2021 10:29:39 - INFO - __main__ - Step 94242: {'lr': 0.00015560040461986204, 'samples': 18094464, 'steps': 94241, 'loss/train': 1.417618751525879} 11/07/2021 10:29:39 - INFO - __main__ - Step 94243: {'lr': 0.00015559549075787094, 'samples': 18094656, 'steps': 94242, 'loss/train': 1.8549115657806396} 11/07/2021 10:29:40 - INFO - __main__ - Step 94244: {'lr': 0.000155590576938417, 'samples': 18094848, 'steps': 94243, 'loss/train': 1.2378954887390137} 11/07/2021 10:29:41 - INFO - __main__ - Step 94245: {'lr': 0.00015558566316150251, 'samples': 18095040, 'steps': 94244, 'loss/train': 1.0947202444076538} 11/07/2021 10:29:41 - INFO - __main__ - Step 94246: {'lr': 0.0001555807494271297, 'samples': 18095232, 'steps': 94245, 'loss/train': 0.9918766021728516} 11/07/2021 10:29:41 - INFO - __main__ - Step 94247: {'lr': 0.0001555758357353007, 'samples': 18095424, 'steps': 94246, 'loss/train': 1.172485113143921} 11/07/2021 10:29:42 - INFO - __main__ - Step 94248: {'lr': 0.00015557092208601781, 'samples': 18095616, 'steps': 94247, 'loss/train': 1.6776164770126343} 11/07/2021 10:29:42 - INFO - __main__ - Step 94249: {'lr': 0.0001555660084792832, 'samples': 18095808, 'steps': 94248, 'loss/train': 1.083796501159668} 11/07/2021 10:29:43 - INFO - __main__ - Step 94250: {'lr': 0.00015556109491509908, 'samples': 18096000, 'steps': 94249, 'loss/train': 1.150508165359497} 11/07/2021 10:29:44 - INFO - __main__ - Step 94251: {'lr': 0.00015555618139346763, 'samples': 18096192, 'steps': 94250, 'loss/train': 1.158359408378601} 11/07/2021 10:29:44 - INFO - __main__ - Step 94252: {'lr': 0.00015555126791439114, 'samples': 18096384, 'steps': 94251, 'loss/train': 1.8222185373306274} 11/07/2021 10:29:44 - INFO - __main__ - Step 94253: {'lr': 0.00015554635447787192, 'samples': 18096576, 'steps': 94252, 'loss/train': 1.3009412288665771} 11/07/2021 10:29:45 - INFO - __main__ - Step 94254: {'lr': 0.00015554144108391192, 'samples': 18096768, 'steps': 94253, 'loss/train': 3.3551530838012695} 11/07/2021 10:29:46 - INFO - __main__ - Step 94255: {'lr': 0.0001555365277325135, 'samples': 18096960, 'steps': 94254, 'loss/train': 1.044068455696106} 11/07/2021 10:29:46 - INFO - __main__ - Step 94256: {'lr': 0.00015553161442367886, 'samples': 18097152, 'steps': 94255, 'loss/train': 1.29689359664917} 11/07/2021 10:29:46 - INFO - __main__ - Step 94257: {'lr': 0.00015552670115741022, 'samples': 18097344, 'steps': 94256, 'loss/train': 1.329154372215271} 11/07/2021 10:29:47 - INFO - __main__ - Step 94258: {'lr': 0.0001555217879337098, 'samples': 18097536, 'steps': 94257, 'loss/train': 1.6087898015975952} 11/07/2021 10:29:47 - INFO - __main__ - Step 94259: {'lr': 0.00015551687475257977, 'samples': 18097728, 'steps': 94258, 'loss/train': 1.0478267669677734} 11/07/2021 10:29:48 - INFO - __main__ - Step 94260: {'lr': 0.00015551196161402243, 'samples': 18097920, 'steps': 94259, 'loss/train': 1.5537561178207397} 11/07/2021 10:29:48 - INFO - __main__ - Step 94261: {'lr': 0.00015550704851803991, 'samples': 18098112, 'steps': 94260, 'loss/train': 1.1562894582748413} 11/07/2021 10:29:49 - INFO - __main__ - Step 94262: {'lr': 0.00015550213546463443, 'samples': 18098304, 'steps': 94261, 'loss/train': 1.3634225130081177} 11/07/2021 10:29:49 - INFO - __main__ - Step 94263: {'lr': 0.00015549722245380827, 'samples': 18098496, 'steps': 94262, 'loss/train': 1.564321756362915} 11/07/2021 10:29:49 - INFO - __main__ - Step 94264: {'lr': 0.00015549230948556358, 'samples': 18098688, 'steps': 94263, 'loss/train': 1.4258842468261719} 11/07/2021 10:29:50 - INFO - __main__ - Step 94265: {'lr': 0.00015548739655990262, 'samples': 18098880, 'steps': 94264, 'loss/train': 1.4893176555633545} 11/07/2021 10:29:51 - INFO - __main__ - Step 94266: {'lr': 0.00015548248367682767, 'samples': 18099072, 'steps': 94265, 'loss/train': 1.3613101243972778} 11/07/2021 10:29:51 - INFO - __main__ - Step 94267: {'lr': 0.0001554775708363408, 'samples': 18099264, 'steps': 94266, 'loss/train': 1.321814775466919} 11/07/2021 10:29:52 - INFO - __main__ - Step 94268: {'lr': 0.00015547265803844421, 'samples': 18099456, 'steps': 94267, 'loss/train': 1.260284185409546} 11/07/2021 10:29:52 - INFO - __main__ - Step 94269: {'lr': 0.0001554677452831402, 'samples': 18099648, 'steps': 94268, 'loss/train': 1.5193440914154053} 11/07/2021 10:29:52 - INFO - __main__ - Step 94270: {'lr': 0.00015546283257043098, 'samples': 18099840, 'steps': 94269, 'loss/train': 1.3448704481124878} 11/07/2021 10:29:53 - INFO - __main__ - Step 94271: {'lr': 0.00015545791990031872, 'samples': 18100032, 'steps': 94270, 'loss/train': 1.1999995708465576} 11/07/2021 10:29:54 - INFO - __main__ - Step 94272: {'lr': 0.0001554530072728057, 'samples': 18100224, 'steps': 94271, 'loss/train': 1.604166030883789} 11/07/2021 10:29:54 - INFO - __main__ - Step 94273: {'lr': 0.00015544809468789406, 'samples': 18100416, 'steps': 94272, 'loss/train': 0.9630998373031616} 11/07/2021 10:29:54 - INFO - __main__ - Step 94274: {'lr': 0.00015544318214558606, 'samples': 18100608, 'steps': 94273, 'loss/train': 1.1630381345748901} 11/07/2021 10:29:55 - INFO - __main__ - Step 94275: {'lr': 0.00015543826964588392, 'samples': 18100800, 'steps': 94274, 'loss/train': 1.0049678087234497} 11/07/2021 10:29:56 - INFO - __main__ - Step 94276: {'lr': 0.00015543335718878982, 'samples': 18100992, 'steps': 94275, 'loss/train': 1.4026352167129517} 11/07/2021 10:29:56 - INFO - __main__ - Step 94277: {'lr': 0.000155428444774306, 'samples': 18101184, 'steps': 94276, 'loss/train': 1.662335991859436} 11/07/2021 10:29:56 - INFO - __main__ - Step 94278: {'lr': 0.00015542353240243474, 'samples': 18101376, 'steps': 94277, 'loss/train': 1.0084232091903687} 11/07/2021 10:29:57 - INFO - __main__ - Step 94279: {'lr': 0.00015541862007317807, 'samples': 18101568, 'steps': 94278, 'loss/train': 1.591689944267273} 11/07/2021 10:29:57 - INFO - __main__ - Step 94280: {'lr': 0.00015541370778653841, 'samples': 18101760, 'steps': 94279, 'loss/train': 1.7115002870559692} 11/07/2021 10:29:58 - INFO - __main__ - Step 94281: {'lr': 0.0001554087955425178, 'samples': 18101952, 'steps': 94280, 'loss/train': 1.3433436155319214} 11/07/2021 10:29:58 - INFO - __main__ - Step 94282: {'lr': 0.00015540388334111852, 'samples': 18102144, 'steps': 94281, 'loss/train': 1.2364211082458496} 11/07/2021 10:29:59 - INFO - __main__ - Step 94283: {'lr': 0.0001553989711823428, 'samples': 18102336, 'steps': 94282, 'loss/train': 1.0991977453231812} 11/07/2021 10:29:59 - INFO - __main__ - Step 94284: {'lr': 0.00015539405906619282, 'samples': 18102528, 'steps': 94283, 'loss/train': 1.5131224393844604} 11/07/2021 10:29:59 - INFO - __main__ - Step 94285: {'lr': 0.00015538914699267088, 'samples': 18102720, 'steps': 94284, 'loss/train': 1.0772000551223755} 11/07/2021 10:30:01 - INFO - __main__ - Step 94286: {'lr': 0.00015538423496177907, 'samples': 18102912, 'steps': 94285, 'loss/train': 1.671264886856079} 11/07/2021 10:30:01 - INFO - __main__ - Step 94287: {'lr': 0.0001553793229735197, 'samples': 18103104, 'steps': 94286, 'loss/train': 1.4512181282043457} 11/07/2021 10:30:01 - INFO - __main__ - Step 94288: {'lr': 0.00015537441102789491, 'samples': 18103296, 'steps': 94287, 'loss/train': 1.2959116697311401} 11/07/2021 10:30:02 - INFO - __main__ - Step 94289: {'lr': 0.00015536949912490702, 'samples': 18103488, 'steps': 94288, 'loss/train': 1.40120267868042} 11/07/2021 10:30:02 - INFO - __main__ - Step 94290: {'lr': 0.00015536458726455812, 'samples': 18103680, 'steps': 94289, 'loss/train': 0.9739714860916138} 11/07/2021 10:30:03 - INFO - __main__ - Step 94291: {'lr': 0.00015535967544685048, 'samples': 18103872, 'steps': 94290, 'loss/train': 1.3028825521469116} 11/07/2021 10:30:03 - INFO - __main__ - Step 94292: {'lr': 0.0001553547636717863, 'samples': 18104064, 'steps': 94291, 'loss/train': 1.5983655452728271} 11/07/2021 10:30:04 - INFO - __main__ - Step 94293: {'lr': 0.0001553498519393679, 'samples': 18104256, 'steps': 94292, 'loss/train': 0.8821430206298828} 11/07/2021 10:30:04 - INFO - __main__ - Step 94294: {'lr': 0.00015534494024959728, 'samples': 18104448, 'steps': 94293, 'loss/train': 1.43101966381073} 11/07/2021 10:30:04 - INFO - __main__ - Step 94295: {'lr': 0.00015534002860247682, 'samples': 18104640, 'steps': 94294, 'loss/train': 0.9472008943557739} 11/07/2021 10:30:06 - INFO - __main__ - Step 94296: {'lr': 0.00015533511699800868, 'samples': 18104832, 'steps': 94295, 'loss/train': 1.2910276651382446} 11/07/2021 10:30:06 - INFO - __main__ - Step 94297: {'lr': 0.00015533020543619504, 'samples': 18105024, 'steps': 94296, 'loss/train': 0.8867065906524658} 11/07/2021 10:30:06 - INFO - __main__ - Step 94298: {'lr': 0.00015532529391703814, 'samples': 18105216, 'steps': 94297, 'loss/train': 1.3309804201126099} 11/07/2021 10:30:07 - INFO - __main__ - Step 94299: {'lr': 0.00015532038244054025, 'samples': 18105408, 'steps': 94298, 'loss/train': 1.2945231199264526} 11/07/2021 10:30:07 - INFO - __main__ - Step 94300: {'lr': 0.00015531547100670356, 'samples': 18105600, 'steps': 94299, 'loss/train': 1.6158121824264526} 11/07/2021 10:30:08 - INFO - __main__ - Step 94301: {'lr': 0.0001553105596155302, 'samples': 18105792, 'steps': 94300, 'loss/train': 1.6160049438476562} 11/07/2021 10:30:08 - INFO - __main__ - Step 94302: {'lr': 0.00015530564826702245, 'samples': 18105984, 'steps': 94301, 'loss/train': 1.7289788722991943} 11/07/2021 10:30:09 - INFO - __main__ - Step 94303: {'lr': 0.00015530073696118252, 'samples': 18106176, 'steps': 94302, 'loss/train': 1.4218287467956543} 11/07/2021 10:30:09 - INFO - __main__ - Step 94304: {'lr': 0.0001552958256980126, 'samples': 18106368, 'steps': 94303, 'loss/train': 1.162545919418335} 11/07/2021 10:30:09 - INFO - __main__ - Step 94305: {'lr': 0.00015529091447751494, 'samples': 18106560, 'steps': 94304, 'loss/train': 1.6003642082214355} 11/07/2021 10:30:10 - INFO - __main__ - Step 94306: {'lr': 0.00015528600329969171, 'samples': 18106752, 'steps': 94305, 'loss/train': 0.42050179839134216} 11/07/2021 10:30:11 - INFO - __main__ - Step 94307: {'lr': 0.00015528109216454523, 'samples': 18106944, 'steps': 94306, 'loss/train': 1.1243658065795898} 11/07/2021 10:30:11 - INFO - __main__ - Step 94308: {'lr': 0.00015527618107207756, 'samples': 18107136, 'steps': 94307, 'loss/train': 0.8636630177497864} 11/07/2021 10:30:12 - INFO - __main__ - Step 94309: {'lr': 0.00015527127002229097, 'samples': 18107328, 'steps': 94308, 'loss/train': 1.4110054969787598} 11/07/2021 10:30:12 - INFO - __main__ - Step 94310: {'lr': 0.0001552663590151877, 'samples': 18107520, 'steps': 94309, 'loss/train': 0.579534649848938} 11/07/2021 10:30:12 - INFO - __main__ - Step 94311: {'lr': 0.00015526144805076998, 'samples': 18107712, 'steps': 94310, 'loss/train': 1.4252729415893555} 11/07/2021 10:30:13 - INFO - __main__ - Step 94312: {'lr': 0.00015525653712903994, 'samples': 18107904, 'steps': 94311, 'loss/train': 1.3919661045074463} 11/07/2021 10:30:14 - INFO - __main__ - Step 94313: {'lr': 0.00015525162624999985, 'samples': 18108096, 'steps': 94312, 'loss/train': 1.4198063611984253} 11/07/2021 10:30:14 - INFO - __main__ - Step 94314: {'lr': 0.00015524671541365193, 'samples': 18108288, 'steps': 94313, 'loss/train': 1.8873491287231445} 11/07/2021 10:30:14 - INFO - __main__ - Step 94315: {'lr': 0.00015524180461999837, 'samples': 18108480, 'steps': 94314, 'loss/train': 1.341162085533142} 11/07/2021 10:30:15 - INFO - __main__ - Step 94316: {'lr': 0.0001552368938690414, 'samples': 18108672, 'steps': 94315, 'loss/train': 0.9982638955116272} 11/07/2021 10:30:16 - INFO - __main__ - Step 94317: {'lr': 0.00015523198316078318, 'samples': 18108864, 'steps': 94316, 'loss/train': 1.6903960704803467} 11/07/2021 10:30:17 - INFO - __main__ - Step 94318: {'lr': 0.000155227072495226, 'samples': 18109056, 'steps': 94317, 'loss/train': 1.4444384574890137} 11/07/2021 10:30:17 - INFO - __main__ - Step 94319: {'lr': 0.00015522216187237203, 'samples': 18109248, 'steps': 94318, 'loss/train': 1.0606012344360352} 11/07/2021 10:30:17 - INFO - __main__ - Step 94320: {'lr': 0.00015521725129222352, 'samples': 18109440, 'steps': 94319, 'loss/train': 1.376879096031189} 11/07/2021 10:30:18 - INFO - __main__ - Step 94321: {'lr': 0.00015521234075478263, 'samples': 18109632, 'steps': 94320, 'loss/train': 1.1627073287963867} 11/07/2021 10:30:18 - INFO - __main__ - Step 94322: {'lr': 0.0001552074302600517, 'samples': 18109824, 'steps': 94321, 'loss/train': 2.1733264923095703} 11/07/2021 10:30:20 - INFO - __main__ - Step 94323: {'lr': 0.00015520251980803267, 'samples': 18110016, 'steps': 94322, 'loss/train': 2.336904287338257} 11/07/2021 10:30:20 - INFO - __main__ - Step 94324: {'lr': 0.00015519760939872802, 'samples': 18110208, 'steps': 94323, 'loss/train': 1.2553907632827759} 11/07/2021 10:30:20 - INFO - __main__ - Step 94325: {'lr': 0.00015519269903213983, 'samples': 18110400, 'steps': 94324, 'loss/train': 1.529600739479065} 11/07/2021 10:30:21 - INFO - __main__ - Step 94326: {'lr': 0.00015518778870827031, 'samples': 18110592, 'steps': 94325, 'loss/train': 2.0065677165985107} 11/07/2021 10:30:21 - INFO - __main__ - Step 94327: {'lr': 0.00015518287842712178, 'samples': 18110784, 'steps': 94326, 'loss/train': 1.7899709939956665} 11/07/2021 10:30:21 - INFO - __main__ - Step 94328: {'lr': 0.00015517796818869634, 'samples': 18110976, 'steps': 94327, 'loss/train': 1.5543022155761719} 11/07/2021 10:30:23 - INFO - __main__ - Step 94329: {'lr': 0.00015517305799299624, 'samples': 18111168, 'steps': 94328, 'loss/train': 1.1329413652420044} 11/07/2021 10:30:23 - INFO - __main__ - Step 94330: {'lr': 0.0001551681478400237, 'samples': 18111360, 'steps': 94329, 'loss/train': 1.0688990354537964} 11/07/2021 10:30:23 - INFO - __main__ - Step 94331: {'lr': 0.00015516323772978097, 'samples': 18111552, 'steps': 94330, 'loss/train': 1.347328543663025} 11/07/2021 10:30:24 - INFO - __main__ - Step 94332: {'lr': 0.00015515832766227017, 'samples': 18111744, 'steps': 94331, 'loss/train': 0.08388785272836685} 11/07/2021 10:30:24 - INFO - __main__ - Step 94333: {'lr': 0.00015515341763749368, 'samples': 18111936, 'steps': 94332, 'loss/train': 1.5168812274932861} 11/07/2021 10:30:25 - INFO - __main__ - Step 94334: {'lr': 0.0001551485076554535, 'samples': 18112128, 'steps': 94333, 'loss/train': 1.5664525032043457} 11/07/2021 10:30:25 - INFO - __main__ - Step 94335: {'lr': 0.00015514359771615194, 'samples': 18112320, 'steps': 94334, 'loss/train': 1.0842111110687256} 11/07/2021 10:30:26 - INFO - __main__ - Step 94336: {'lr': 0.00015513868781959122, 'samples': 18112512, 'steps': 94335, 'loss/train': 1.2029787302017212} 11/07/2021 10:30:26 - INFO - __main__ - Step 94337: {'lr': 0.00015513377796577354, 'samples': 18112704, 'steps': 94336, 'loss/train': 1.5790947675704956} 11/07/2021 10:30:26 - INFO - __main__ - Step 94338: {'lr': 0.00015512886815470113, 'samples': 18112896, 'steps': 94337, 'loss/train': 1.169625997543335} 11/07/2021 10:30:27 - INFO - __main__ - Step 94339: {'lr': 0.00015512395838637616, 'samples': 18113088, 'steps': 94338, 'loss/train': 1.4122551679611206} 11/07/2021 10:30:28 - INFO - __main__ - Step 94340: {'lr': 0.00015511904866080084, 'samples': 18113280, 'steps': 94339, 'loss/train': 0.294057160615921} 11/07/2021 10:30:28 - INFO - __main__ - Step 94341: {'lr': 0.0001551141389779775, 'samples': 18113472, 'steps': 94340, 'loss/train': 5.722957134246826} 11/07/2021 10:30:29 - INFO - __main__ - Step 94342: {'lr': 0.00015510922933790818, 'samples': 18113664, 'steps': 94341, 'loss/train': 1.3915493488311768} 11/07/2021 10:30:29 - INFO - __main__ - Step 94343: {'lr': 0.00015510431974059523, 'samples': 18113856, 'steps': 94342, 'loss/train': 1.3192787170410156} 11/07/2021 10:30:29 - INFO - __main__ - Step 94344: {'lr': 0.0001550994101860408, 'samples': 18114048, 'steps': 94343, 'loss/train': 0.28205034136772156} 11/07/2021 10:30:30 - INFO - __main__ - Step 94345: {'lr': 0.0001550945006742471, 'samples': 18114240, 'steps': 94344, 'loss/train': 0.9650419354438782} 11/07/2021 10:30:31 - INFO - __main__ - Step 94346: {'lr': 0.00015508959120521634, 'samples': 18114432, 'steps': 94345, 'loss/train': 1.138118028640747} 11/07/2021 10:30:31 - INFO - __main__ - Step 94347: {'lr': 0.00015508468177895086, 'samples': 18114624, 'steps': 94346, 'loss/train': 1.57643723487854} 11/07/2021 10:30:31 - INFO - __main__ - Step 94348: {'lr': 0.0001550797723954527, 'samples': 18114816, 'steps': 94347, 'loss/train': 1.4552803039550781} 11/07/2021 10:30:32 - INFO - __main__ - Step 94349: {'lr': 0.00015507486305472407, 'samples': 18115008, 'steps': 94348, 'loss/train': 1.0250132083892822} 11/07/2021 10:30:33 - INFO - __main__ - Step 94350: {'lr': 0.00015506995375676725, 'samples': 18115200, 'steps': 94349, 'loss/train': 1.0803306102752686} 11/07/2021 10:30:33 - INFO - __main__ - Step 94351: {'lr': 0.00015506504450158446, 'samples': 18115392, 'steps': 94350, 'loss/train': 1.1450035572052002} 11/07/2021 10:30:33 - INFO - __main__ - Step 94352: {'lr': 0.0001550601352891779, 'samples': 18115584, 'steps': 94351, 'loss/train': 1.208938717842102} 11/07/2021 10:30:34 - INFO - __main__ - Step 94353: {'lr': 0.00015505522611954976, 'samples': 18115776, 'steps': 94352, 'loss/train': 1.2844973802566528} 11/07/2021 10:30:34 - INFO - __main__ - Step 94354: {'lr': 0.00015505031699270227, 'samples': 18115968, 'steps': 94353, 'loss/train': 1.034609079360962} 11/07/2021 10:30:35 - INFO - __main__ - Step 94355: {'lr': 0.00015504540790863764, 'samples': 18116160, 'steps': 94354, 'loss/train': 1.5479092597961426} 11/07/2021 10:30:36 - INFO - __main__ - Step 94356: {'lr': 0.0001550404988673581, 'samples': 18116352, 'steps': 94355, 'loss/train': 1.504126787185669} 11/07/2021 10:30:36 - INFO - __main__ - Step 94357: {'lr': 0.00015503558986886584, 'samples': 18116544, 'steps': 94356, 'loss/train': 1.9678527116775513} 11/07/2021 10:30:36 - INFO - __main__ - Step 94358: {'lr': 0.00015503068091316308, 'samples': 18116736, 'steps': 94357, 'loss/train': 1.3319494724273682} 11/07/2021 10:30:37 - INFO - __main__ - Step 94359: {'lr': 0.00015502577200025204, 'samples': 18116928, 'steps': 94358, 'loss/train': 1.2564914226531982} 11/07/2021 10:30:38 - INFO - __main__ - Step 94360: {'lr': 0.00015502086313013504, 'samples': 18117120, 'steps': 94359, 'loss/train': 1.1396610736846924} 11/07/2021 10:30:38 - INFO - __main__ - Step 94361: {'lr': 0.000155015954302814, 'samples': 18117312, 'steps': 94360, 'loss/train': 1.1625491380691528} 11/07/2021 10:30:38 - INFO - __main__ - Step 94362: {'lr': 0.00015501104551829138, 'samples': 18117504, 'steps': 94361, 'loss/train': 1.103596806526184} 11/07/2021 10:30:39 - INFO - __main__ - Step 94363: {'lr': 0.00015500613677656928, 'samples': 18117696, 'steps': 94362, 'loss/train': 1.1490029096603394} 11/07/2021 10:30:39 - INFO - __main__ - Step 94364: {'lr': 0.00015500122807764994, 'samples': 18117888, 'steps': 94363, 'loss/train': 1.1923936605453491} 11/07/2021 10:30:40 - INFO - __main__ - Step 94365: {'lr': 0.00015499631942153557, 'samples': 18118080, 'steps': 94364, 'loss/train': 1.1567784547805786} 11/07/2021 10:30:40 - INFO - __main__ - Step 94366: {'lr': 0.00015499141080822844, 'samples': 18118272, 'steps': 94365, 'loss/train': 1.1289554834365845} 11/07/2021 10:30:41 - INFO - __main__ - Step 94367: {'lr': 0.0001549865022377307, 'samples': 18118464, 'steps': 94366, 'loss/train': 1.3302520513534546} 11/07/2021 10:30:41 - INFO - __main__ - Step 94368: {'lr': 0.00015498159371004456, 'samples': 18118656, 'steps': 94367, 'loss/train': 1.5250211954116821} 11/07/2021 10:30:41 - INFO - __main__ - Step 94369: {'lr': 0.00015497668522517229, 'samples': 18118848, 'steps': 94368, 'loss/train': 1.2869629859924316} 11/07/2021 10:30:42 - INFO - __main__ - Step 94370: {'lr': 0.000154971776783116, 'samples': 18119040, 'steps': 94369, 'loss/train': 1.3844407796859741} 11/07/2021 10:30:43 - INFO - __main__ - Step 94371: {'lr': 0.00015496686838387797, 'samples': 18119232, 'steps': 94370, 'loss/train': 0.9243103861808777} 11/07/2021 10:30:43 - INFO - __main__ - Step 94372: {'lr': 0.00015496196002746042, 'samples': 18119424, 'steps': 94371, 'loss/train': 1.1124961376190186} 11/07/2021 10:30:44 - INFO - __main__ - Step 94373: {'lr': 0.00015495705171386553, 'samples': 18119616, 'steps': 94372, 'loss/train': 1.6973540782928467} 11/07/2021 10:30:44 - INFO - __main__ - Step 94374: {'lr': 0.00015495214344309565, 'samples': 18119808, 'steps': 94373, 'loss/train': 1.4686920642852783} 11/07/2021 10:30:45 - INFO - __main__ - Step 94375: {'lr': 0.00015494723521515278, 'samples': 18120000, 'steps': 94374, 'loss/train': 1.3999184370040894} 11/07/2021 10:30:45 - INFO - __main__ - Step 94376: {'lr': 0.00015494232703003918, 'samples': 18120192, 'steps': 94375, 'loss/train': 1.3745983839035034} 11/07/2021 10:30:45 - INFO - __main__ - Step 94377: {'lr': 0.00015493741888775715, 'samples': 18120384, 'steps': 94376, 'loss/train': 1.2907198667526245} 11/07/2021 10:30:46 - INFO - __main__ - Step 94378: {'lr': 0.00015493251078830877, 'samples': 18120576, 'steps': 94377, 'loss/train': 1.1857562065124512} 11/07/2021 10:30:46 - INFO - __main__ - Step 94379: {'lr': 0.00015492760273169644, 'samples': 18120768, 'steps': 94378, 'loss/train': 1.4240484237670898} 11/07/2021 10:30:47 - INFO - __main__ - Step 94380: {'lr': 0.00015492269471792218, 'samples': 18120960, 'steps': 94379, 'loss/train': 1.2210936546325684} 11/07/2021 10:30:47 - INFO - __main__ - Step 94381: {'lr': 0.0001549177867469883, 'samples': 18121152, 'steps': 94380, 'loss/train': 1.2921661138534546} 11/07/2021 10:30:48 - INFO - __main__ - Step 94382: {'lr': 0.00015491287881889705, 'samples': 18121344, 'steps': 94381, 'loss/train': 1.3915215730667114} 11/07/2021 10:30:48 - INFO - __main__ - Step 94383: {'lr': 0.00015490797093365054, 'samples': 18121536, 'steps': 94382, 'loss/train': 1.366611361503601} 11/07/2021 10:30:49 - INFO - __main__ - Step 94384: {'lr': 0.00015490306309125102, 'samples': 18121728, 'steps': 94383, 'loss/train': 1.4348080158233643} 11/07/2021 10:30:50 - INFO - __main__ - Step 94385: {'lr': 0.00015489815529170077, 'samples': 18121920, 'steps': 94384, 'loss/train': 1.4102343320846558} 11/07/2021 10:30:50 - INFO - __main__ - Step 94386: {'lr': 0.00015489324753500188, 'samples': 18122112, 'steps': 94385, 'loss/train': 1.7406574487686157} 11/07/2021 10:30:50 - INFO - __main__ - Step 94387: {'lr': 0.00015488833982115675, 'samples': 18122304, 'steps': 94386, 'loss/train': 1.2897876501083374} 11/07/2021 10:30:51 - INFO - __main__ - Step 94388: {'lr': 0.00015488343215016738, 'samples': 18122496, 'steps': 94387, 'loss/train': 1.1554787158966064} 11/07/2021 10:30:51 - INFO - __main__ - Step 94389: {'lr': 0.00015487852452203605, 'samples': 18122688, 'steps': 94388, 'loss/train': 1.0538755655288696} 11/07/2021 10:30:51 - INFO - __main__ - Step 94390: {'lr': 0.000154873616936765, 'samples': 18122880, 'steps': 94389, 'loss/train': 0.8641053438186646} 11/07/2021 10:30:52 - INFO - __main__ - Step 94391: {'lr': 0.00015486870939435644, 'samples': 18123072, 'steps': 94390, 'loss/train': 1.8614084720611572} 11/07/2021 10:30:53 - INFO - __main__ - Step 94392: {'lr': 0.00015486380189481253, 'samples': 18123264, 'steps': 94391, 'loss/train': 0.75803142786026} 11/07/2021 10:30:53 - INFO - __main__ - Step 94393: {'lr': 0.00015485889443813555, 'samples': 18123456, 'steps': 94392, 'loss/train': 1.2375833988189697} 11/07/2021 10:30:53 - INFO - __main__ - Step 94394: {'lr': 0.0001548539870243277, 'samples': 18123648, 'steps': 94393, 'loss/train': 1.5152887105941772} 11/07/2021 10:30:54 - INFO - __main__ - Step 94395: {'lr': 0.00015484907965339118, 'samples': 18123840, 'steps': 94394, 'loss/train': 1.3014910221099854} 11/07/2021 10:30:55 - INFO - __main__ - Step 94396: {'lr': 0.00015484417232532817, 'samples': 18124032, 'steps': 94395, 'loss/train': 1.5231810808181763} 11/07/2021 10:30:55 - INFO - __main__ - Step 94397: {'lr': 0.0001548392650401409, 'samples': 18124224, 'steps': 94396, 'loss/train': 1.2947380542755127} 11/07/2021 10:30:55 - INFO - __main__ - Step 94398: {'lr': 0.0001548343577978316, 'samples': 18124416, 'steps': 94397, 'loss/train': 1.335127353668213} 11/07/2021 10:30:56 - INFO - __main__ - Step 94399: {'lr': 0.00015482945059840247, 'samples': 18124608, 'steps': 94398, 'loss/train': 1.445784091949463} 11/07/2021 10:30:56 - INFO - __main__ - Step 94400: {'lr': 0.00015482454344185575, 'samples': 18124800, 'steps': 94399, 'loss/train': 1.4651689529418945} 11/07/2021 10:30:57 - INFO - __main__ - Step 94401: {'lr': 0.0001548196363281937, 'samples': 18124992, 'steps': 94400, 'loss/train': 1.322120189666748} 11/07/2021 10:30:58 - INFO - __main__ - Step 94402: {'lr': 0.00015481472925741834, 'samples': 18125184, 'steps': 94401, 'loss/train': 1.3088093996047974} 11/07/2021 10:30:58 - INFO - __main__ - Step 94403: {'lr': 0.000154809822229532, 'samples': 18125376, 'steps': 94402, 'loss/train': 1.4578428268432617} 11/07/2021 10:30:58 - INFO - __main__ - Step 94404: {'lr': 0.00015480491524453687, 'samples': 18125568, 'steps': 94403, 'loss/train': 1.3030046224594116} 11/07/2021 10:30:59 - INFO - __main__ - Step 94405: {'lr': 0.00015480000830243523, 'samples': 18125760, 'steps': 94404, 'loss/train': 1.5389385223388672} 11/07/2021 10:30:59 - INFO - __main__ - Step 94406: {'lr': 0.00015479510140322918, 'samples': 18125952, 'steps': 94405, 'loss/train': 1.3512113094329834} 11/07/2021 10:31:00 - INFO - __main__ - Step 94407: {'lr': 0.000154790194546921, 'samples': 18126144, 'steps': 94406, 'loss/train': 1.299500584602356} 11/07/2021 10:31:00 - INFO - __main__ - Step 94408: {'lr': 0.0001547852877335129, 'samples': 18126336, 'steps': 94407, 'loss/train': 1.727208137512207} 11/07/2021 10:31:01 - INFO - __main__ - Step 94409: {'lr': 0.0001547803809630071, 'samples': 18126528, 'steps': 94408, 'loss/train': 1.2301123142242432} 11/07/2021 10:31:01 - INFO - __main__ - Step 94410: {'lr': 0.00015477547423540578, 'samples': 18126720, 'steps': 94409, 'loss/train': 1.5307682752609253} 11/07/2021 10:31:01 - INFO - __main__ - Step 94411: {'lr': 0.00015477056755071114, 'samples': 18126912, 'steps': 94410, 'loss/train': 1.1182609796524048} 11/07/2021 10:31:02 - INFO - __main__ - Step 94412: {'lr': 0.00015476566090892542, 'samples': 18127104, 'steps': 94411, 'loss/train': 1.7619637250900269} 11/07/2021 10:31:03 - INFO - __main__ - Step 94413: {'lr': 0.00015476075431005088, 'samples': 18127296, 'steps': 94412, 'loss/train': 1.3835020065307617} 11/07/2021 10:31:03 - INFO - __main__ - Step 94414: {'lr': 0.00015475584775408968, 'samples': 18127488, 'steps': 94413, 'loss/train': 1.7207826375961304} 11/07/2021 10:31:04 - INFO - __main__ - Step 94415: {'lr': 0.00015475094124104398, 'samples': 18127680, 'steps': 94414, 'loss/train': 1.6865239143371582} 11/07/2021 10:31:04 - INFO - __main__ - Step 94416: {'lr': 0.00015474603477091603, 'samples': 18127872, 'steps': 94415, 'loss/train': 1.025240421295166} 11/07/2021 10:31:05 - INFO - __main__ - Step 94417: {'lr': 0.00015474112834370802, 'samples': 18128064, 'steps': 94416, 'loss/train': 1.2968772649765015} 11/07/2021 10:31:05 - INFO - __main__ - Step 94418: {'lr': 0.0001547362219594222, 'samples': 18128256, 'steps': 94417, 'loss/train': 1.849234700202942} 11/07/2021 10:31:06 - INFO - __main__ - Step 94419: {'lr': 0.00015473131561806081, 'samples': 18128448, 'steps': 94418, 'loss/train': 1.429114580154419} 11/07/2021 10:31:06 - INFO - __main__ - Step 94420: {'lr': 0.00015472640931962599, 'samples': 18128640, 'steps': 94419, 'loss/train': 1.292988657951355} 11/07/2021 10:31:06 - INFO - __main__ - Step 94421: {'lr': 0.00015472150306411998, 'samples': 18128832, 'steps': 94420, 'loss/train': 0.7445428371429443} 11/07/2021 10:31:07 - INFO - __main__ - Step 94422: {'lr': 0.000154716596851545, 'samples': 18129024, 'steps': 94421, 'loss/train': 1.4514334201812744} 11/07/2021 10:31:08 - INFO - __main__ - Step 94423: {'lr': 0.00015471169068190328, 'samples': 18129216, 'steps': 94422, 'loss/train': 1.6642582416534424} 11/07/2021 10:31:08 - INFO - __main__ - Step 94424: {'lr': 0.00015470678455519694, 'samples': 18129408, 'steps': 94423, 'loss/train': 0.8577752113342285} 11/07/2021 10:31:08 - INFO - __main__ - Step 94425: {'lr': 0.00015470187847142829, 'samples': 18129600, 'steps': 94424, 'loss/train': 1.1413013935089111} 11/07/2021 10:31:09 - INFO - __main__ - Step 94426: {'lr': 0.0001546969724305995, 'samples': 18129792, 'steps': 94425, 'loss/train': 1.572954535484314} 11/07/2021 10:31:10 - INFO - __main__ - Step 94427: {'lr': 0.00015469206643271274, 'samples': 18129984, 'steps': 94426, 'loss/train': 1.2406277656555176} 11/07/2021 10:31:10 - INFO - __main__ - Step 94428: {'lr': 0.00015468716047777035, 'samples': 18130176, 'steps': 94427, 'loss/train': 1.6892952919006348} 11/07/2021 10:31:10 - INFO - __main__ - Step 94429: {'lr': 0.0001546822545657744, 'samples': 18130368, 'steps': 94428, 'loss/train': 1.210752248764038} 11/07/2021 10:31:11 - INFO - __main__ - Step 94430: {'lr': 0.00015467734869672716, 'samples': 18130560, 'steps': 94429, 'loss/train': 1.3245649337768555} 11/07/2021 10:31:11 - INFO - __main__ - Step 94431: {'lr': 0.0001546724428706308, 'samples': 18130752, 'steps': 94430, 'loss/train': 1.8185840845108032} 11/07/2021 10:31:11 - INFO - __main__ - Step 94432: {'lr': 0.0001546675370874876, 'samples': 18130944, 'steps': 94431, 'loss/train': 1.594253659248352} 11/07/2021 10:31:13 - INFO - __main__ - Step 94433: {'lr': 0.00015466263134729973, 'samples': 18131136, 'steps': 94432, 'loss/train': 1.1515240669250488} 11/07/2021 10:31:13 - INFO - __main__ - Step 94434: {'lr': 0.00015465772565006946, 'samples': 18131328, 'steps': 94433, 'loss/train': 0.1276903748512268} 11/07/2021 10:31:14 - INFO - __main__ - Step 94435: {'lr': 0.0001546528199957989, 'samples': 18131520, 'steps': 94434, 'loss/train': 1.3040671348571777} 11/07/2021 10:31:14 - INFO - __main__ - Step 94436: {'lr': 0.00015464791438449032, 'samples': 18131712, 'steps': 94435, 'loss/train': 1.4010913372039795} 11/07/2021 10:31:14 - INFO - __main__ - Step 94437: {'lr': 0.0001546430088161459, 'samples': 18131904, 'steps': 94436, 'loss/train': 1.4092985391616821} 11/07/2021 10:31:15 - INFO - __main__ - Step 94438: {'lr': 0.00015463810329076789, 'samples': 18132096, 'steps': 94437, 'loss/train': 0.07710301131010056} 11/07/2021 10:31:16 - INFO - __main__ - Step 94439: {'lr': 0.00015463319780835845, 'samples': 18132288, 'steps': 94438, 'loss/train': 1.4421782493591309} 11/07/2021 10:31:16 - INFO - __main__ - Step 94440: {'lr': 0.00015462829236891984, 'samples': 18132480, 'steps': 94439, 'loss/train': 1.7368944883346558} 11/07/2021 10:31:16 - INFO - __main__ - Step 94441: {'lr': 0.00015462338697245427, 'samples': 18132672, 'steps': 94440, 'loss/train': 1.5250253677368164} 11/07/2021 10:31:17 - INFO - __main__ - Step 94442: {'lr': 0.00015461848161896392, 'samples': 18132864, 'steps': 94441, 'loss/train': 1.6635476350784302} 11/07/2021 10:31:18 - INFO - __main__ - Step 94443: {'lr': 0.00015461357630845097, 'samples': 18133056, 'steps': 94442, 'loss/train': 1.111561894416809} 11/07/2021 10:31:18 - INFO - __main__ - Step 94444: {'lr': 0.0001546086710409177, 'samples': 18133248, 'steps': 94443, 'loss/train': 1.6935395002365112} 11/07/2021 10:31:18 - INFO - __main__ - Step 94445: {'lr': 0.00015460376581636633, 'samples': 18133440, 'steps': 94444, 'loss/train': 1.9515665769577026} 11/07/2021 10:31:19 - INFO - __main__ - Step 94446: {'lr': 0.000154598860634799, 'samples': 18133632, 'steps': 94445, 'loss/train': 1.3432530164718628} 11/07/2021 10:31:19 - INFO - __main__ - Step 94447: {'lr': 0.00015459395549621792, 'samples': 18133824, 'steps': 94446, 'loss/train': 1.34650719165802} 11/07/2021 10:31:20 - INFO - __main__ - Step 94448: {'lr': 0.00015458905040062536, 'samples': 18134016, 'steps': 94447, 'loss/train': 1.1944139003753662} 11/07/2021 10:31:20 - INFO - __main__ - Step 94449: {'lr': 0.00015458414534802348, 'samples': 18134208, 'steps': 94448, 'loss/train': 1.1490732431411743} 11/07/2021 10:31:21 - INFO - __main__ - Step 94450: {'lr': 0.00015457924033841452, 'samples': 18134400, 'steps': 94449, 'loss/train': 5.68433952331543} 11/07/2021 10:31:21 - INFO - __main__ - Step 94451: {'lr': 0.00015457433537180068, 'samples': 18134592, 'steps': 94450, 'loss/train': 1.6763607263565063} 11/07/2021 10:31:22 - INFO - __main__ - Step 94452: {'lr': 0.00015456943044818417, 'samples': 18134784, 'steps': 94451, 'loss/train': 1.2384800910949707} 11/07/2021 10:31:22 - INFO - __main__ - Step 94453: {'lr': 0.00015456452556756722, 'samples': 18134976, 'steps': 94452, 'loss/train': 1.2072147130966187} 11/07/2021 10:31:23 - INFO - __main__ - Step 94454: {'lr': 0.00015455962072995205, 'samples': 18135168, 'steps': 94453, 'loss/train': 1.686795949935913} 11/07/2021 10:31:23 - INFO - __main__ - Step 94455: {'lr': 0.00015455471593534082, 'samples': 18135360, 'steps': 94454, 'loss/train': 1.1919769048690796} 11/07/2021 10:31:24 - INFO - __main__ - Step 94456: {'lr': 0.0001545498111837358, 'samples': 18135552, 'steps': 94455, 'loss/train': 1.1315594911575317} 11/07/2021 10:31:24 - INFO - __main__ - Step 94457: {'lr': 0.00015454490647513907, 'samples': 18135744, 'steps': 94456, 'loss/train': 1.7582473754882812} 11/07/2021 10:31:24 - INFO - __main__ - Step 94458: {'lr': 0.00015454000180955296, 'samples': 18135936, 'steps': 94457, 'loss/train': 1.1326234340667725} 11/07/2021 10:31:25 - INFO - __main__ - Step 94459: {'lr': 0.00015453509718697968, 'samples': 18136128, 'steps': 94458, 'loss/train': 1.5013511180877686} 11/07/2021 10:31:26 - INFO - __main__ - Step 94460: {'lr': 0.0001545301926074214, 'samples': 18136320, 'steps': 94459, 'loss/train': 1.359291911125183} 11/07/2021 10:31:26 - INFO - __main__ - Step 94461: {'lr': 0.0001545252880708803, 'samples': 18136512, 'steps': 94460, 'loss/train': 1.5824211835861206} 11/07/2021 10:31:26 - INFO - __main__ - Step 94462: {'lr': 0.0001545203835773587, 'samples': 18136704, 'steps': 94461, 'loss/train': 1.4088901281356812} 11/07/2021 10:31:27 - INFO - __main__ - Step 94463: {'lr': 0.0001545154791268587, 'samples': 18136896, 'steps': 94462, 'loss/train': 1.4963529109954834} 11/07/2021 10:31:28 - INFO - __main__ - Step 94464: {'lr': 0.00015451057471938258, 'samples': 18137088, 'steps': 94463, 'loss/train': 1.2967545986175537} 11/07/2021 10:31:28 - INFO - __main__ - Step 94465: {'lr': 0.00015450567035493246, 'samples': 18137280, 'steps': 94464, 'loss/train': 1.5055623054504395} 11/07/2021 10:31:29 - INFO - __main__ - Step 94466: {'lr': 0.00015450076603351065, 'samples': 18137472, 'steps': 94465, 'loss/train': 0.9544816017150879} 11/07/2021 10:31:29 - INFO - __main__ - Step 94467: {'lr': 0.00015449586175511932, 'samples': 18137664, 'steps': 94466, 'loss/train': 1.543015718460083} 11/07/2021 10:31:29 - INFO - __main__ - Step 94468: {'lr': 0.00015449095751976077, 'samples': 18137856, 'steps': 94467, 'loss/train': 1.8909202814102173} 11/07/2021 10:31:30 - INFO - __main__ - Step 94469: {'lr': 0.00015448605332743707, 'samples': 18138048, 'steps': 94468, 'loss/train': 1.4653990268707275} 11/07/2021 10:31:31 - INFO - __main__ - Step 94470: {'lr': 0.00015448114917815042, 'samples': 18138240, 'steps': 94469, 'loss/train': 1.8654667139053345} 11/07/2021 10:31:31 - INFO - __main__ - Step 94471: {'lr': 0.00015447624507190314, 'samples': 18138432, 'steps': 94470, 'loss/train': 1.329726219177246} 11/07/2021 10:31:31 - INFO - __main__ - Step 94472: {'lr': 0.00015447134100869737, 'samples': 18138624, 'steps': 94471, 'loss/train': 1.1925225257873535} 11/07/2021 10:31:32 - INFO - __main__ - Step 94473: {'lr': 0.00015446643698853533, 'samples': 18138816, 'steps': 94472, 'loss/train': 1.1990543603897095} 11/07/2021 10:31:33 - INFO - __main__ - Step 94474: {'lr': 0.00015446153301141923, 'samples': 18139008, 'steps': 94473, 'loss/train': 0.8362217545509338} 11/07/2021 10:31:33 - INFO - __main__ - Step 94475: {'lr': 0.0001544566290773513, 'samples': 18139200, 'steps': 94474, 'loss/train': 0.6272976398468018} 11/07/2021 10:31:33 - INFO - __main__ - Step 94476: {'lr': 0.00015445172518633373, 'samples': 18139392, 'steps': 94475, 'loss/train': 1.4587717056274414} 11/07/2021 10:31:34 - INFO - __main__ - Step 94477: {'lr': 0.00015444682133836877, 'samples': 18139584, 'steps': 94476, 'loss/train': 1.389453649520874} 11/07/2021 10:31:34 - INFO - __main__ - Step 94478: {'lr': 0.00015444191753345856, 'samples': 18139776, 'steps': 94477, 'loss/train': 1.3434805870056152} 11/07/2021 10:31:34 - INFO - __main__ - Step 94479: {'lr': 0.00015443701377160538, 'samples': 18139968, 'steps': 94478, 'loss/train': 1.5326130390167236} 11/07/2021 10:31:35 - INFO - __main__ - Step 94480: {'lr': 0.00015443211005281137, 'samples': 18140160, 'steps': 94479, 'loss/train': 0.850435197353363} 11/07/2021 10:31:36 - INFO - __main__ - Step 94481: {'lr': 0.00015442720637707892, 'samples': 18140352, 'steps': 94480, 'loss/train': 1.2627291679382324} 11/07/2021 10:31:36 - INFO - __main__ - Step 94482: {'lr': 0.00015442230274441, 'samples': 18140544, 'steps': 94481, 'loss/train': 1.2850347757339478} 11/07/2021 10:31:37 - INFO - __main__ - Step 94483: {'lr': 0.00015441739915480685, 'samples': 18140736, 'steps': 94482, 'loss/train': 1.1728624105453491} 11/07/2021 10:31:37 - INFO - __main__ - Step 94484: {'lr': 0.0001544124956082718, 'samples': 18140928, 'steps': 94483, 'loss/train': 1.2102290391921997} 11/07/2021 10:31:38 - INFO - __main__ - Step 94485: {'lr': 0.00015440759210480698, 'samples': 18141120, 'steps': 94484, 'loss/train': 1.4980757236480713} 11/07/2021 10:31:38 - INFO - __main__ - Step 94486: {'lr': 0.00015440268864441465, 'samples': 18141312, 'steps': 94485, 'loss/train': 1.4410699605941772} 11/07/2021 10:31:39 - INFO - __main__ - Step 94487: {'lr': 0.00015439778522709696, 'samples': 18141504, 'steps': 94486, 'loss/train': 1.7979762554168701} 11/07/2021 10:31:39 - INFO - __main__ - Step 94488: {'lr': 0.0001543928818528562, 'samples': 18141696, 'steps': 94487, 'loss/train': 1.251320242881775} 11/07/2021 10:31:39 - INFO - __main__ - Step 94489: {'lr': 0.00015438797852169447, 'samples': 18141888, 'steps': 94488, 'loss/train': 1.323302149772644} 11/07/2021 10:31:40 - INFO - __main__ - Step 94490: {'lr': 0.00015438307523361409, 'samples': 18142080, 'steps': 94489, 'loss/train': 1.5169130563735962} 11/07/2021 10:31:41 - INFO - __main__ - Step 94491: {'lr': 0.0001543781719886172, 'samples': 18142272, 'steps': 94490, 'loss/train': 1.0358316898345947} 11/07/2021 10:31:41 - INFO - __main__ - Step 94492: {'lr': 0.00015437326878670605, 'samples': 18142464, 'steps': 94491, 'loss/train': 1.2149053812026978} 11/07/2021 10:31:41 - INFO - __main__ - Step 94493: {'lr': 0.0001543683656278828, 'samples': 18142656, 'steps': 94492, 'loss/train': 0.18921880424022675} 11/07/2021 10:31:42 - INFO - __main__ - Step 94494: {'lr': 0.0001543634625121497, 'samples': 18142848, 'steps': 94493, 'loss/train': 0.6481598019599915} 11/07/2021 10:31:43 - INFO - __main__ - Step 94495: {'lr': 0.00015435855943950904, 'samples': 18143040, 'steps': 94494, 'loss/train': 1.334032654762268} 11/07/2021 10:31:43 - INFO - __main__ - Step 94496: {'lr': 0.00015435365640996285, 'samples': 18143232, 'steps': 94495, 'loss/train': 1.0958709716796875} 11/07/2021 10:31:44 - INFO - __main__ - Step 94497: {'lr': 0.00015434875342351342, 'samples': 18143424, 'steps': 94496, 'loss/train': 1.256848692893982} 11/07/2021 10:31:44 - INFO - __main__ - Step 94498: {'lr': 0.00015434385048016298, 'samples': 18143616, 'steps': 94497, 'loss/train': 1.4415730237960815} 11/07/2021 10:31:44 - INFO - __main__ - Step 94499: {'lr': 0.00015433894757991374, 'samples': 18143808, 'steps': 94498, 'loss/train': 0.4522702395915985} 11/07/2021 10:31:45 - INFO - __main__ - Step 94500: {'lr': 0.00015433404472276786, 'samples': 18144000, 'steps': 94499, 'loss/train': 1.2280486822128296} 11/07/2021 10:31:46 - INFO - __main__ - Step 94501: {'lr': 0.00015432914190872756, 'samples': 18144192, 'steps': 94500, 'loss/train': 1.2407861948013306} 11/07/2021 10:31:46 - INFO - __main__ - Step 94502: {'lr': 0.00015432423913779513, 'samples': 18144384, 'steps': 94501, 'loss/train': 1.6205042600631714} 11/07/2021 10:31:46 - INFO - __main__ - Step 94503: {'lr': 0.0001543193364099727, 'samples': 18144576, 'steps': 94502, 'loss/train': 1.6726235151290894} 11/07/2021 10:31:47 - INFO - __main__ - Step 94504: {'lr': 0.0001543144337252625, 'samples': 18144768, 'steps': 94503, 'loss/train': 1.191165566444397} 11/07/2021 10:31:47 - INFO - __main__ - Step 94505: {'lr': 0.00015430953108366672, 'samples': 18144960, 'steps': 94504, 'loss/train': 1.70658540725708} 11/07/2021 10:31:48 - INFO - __main__ - Step 94506: {'lr': 0.0001543046284851876, 'samples': 18145152, 'steps': 94505, 'loss/train': 1.5981378555297852} 11/07/2021 10:31:48 - INFO - __main__ - Step 94507: {'lr': 0.00015429972592982734, 'samples': 18145344, 'steps': 94506, 'loss/train': 0.764670729637146} 11/07/2021 10:31:49 - INFO - __main__ - Step 94508: {'lr': 0.00015429482341758826, 'samples': 18145536, 'steps': 94507, 'loss/train': 1.5136185884475708} 11/07/2021 10:31:49 - INFO - __main__ - Step 94509: {'lr': 0.00015428992094847232, 'samples': 18145728, 'steps': 94508, 'loss/train': 1.132437825202942} 11/07/2021 10:31:49 - INFO - __main__ - Step 94510: {'lr': 0.0001542850185224819, 'samples': 18145920, 'steps': 94509, 'loss/train': 0.6921151876449585} 11/07/2021 10:31:50 - INFO - __main__ - Step 94511: {'lr': 0.00015428011613961918, 'samples': 18146112, 'steps': 94510, 'loss/train': 1.2721246480941772} 11/07/2021 10:31:51 - INFO - __main__ - Step 94512: {'lr': 0.00015427521379988635, 'samples': 18146304, 'steps': 94511, 'loss/train': 1.2004976272583008} 11/07/2021 10:31:51 - INFO - __main__ - Step 94513: {'lr': 0.00015427031150328562, 'samples': 18146496, 'steps': 94512, 'loss/train': 1.373999834060669} 11/07/2021 10:31:51 - INFO - __main__ - Step 94514: {'lr': 0.00015426540924981923, 'samples': 18146688, 'steps': 94513, 'loss/train': 1.0025087594985962} 11/07/2021 10:31:52 - INFO - __main__ - Step 94515: {'lr': 0.00015426050703948934, 'samples': 18146880, 'steps': 94514, 'loss/train': 1.7721233367919922} 11/07/2021 10:31:53 - INFO - __main__ - Step 94516: {'lr': 0.00015425560487229822, 'samples': 18147072, 'steps': 94515, 'loss/train': 1.417062520980835} 11/07/2021 10:31:53 - INFO - __main__ - Step 94517: {'lr': 0.00015425070274824803, 'samples': 18147264, 'steps': 94516, 'loss/train': 1.4455453157424927} 11/07/2021 10:31:53 - INFO - __main__ - Step 94518: {'lr': 0.000154245800667341, 'samples': 18147456, 'steps': 94517, 'loss/train': 1.2776018381118774} 11/07/2021 10:31:54 - INFO - __main__ - Step 94519: {'lr': 0.00015424089862957932, 'samples': 18147648, 'steps': 94518, 'loss/train': 1.3322381973266602} 11/07/2021 10:31:54 - INFO - __main__ - Step 94520: {'lr': 0.00015423599663496525, 'samples': 18147840, 'steps': 94519, 'loss/train': 1.3668737411499023} 11/07/2021 10:31:55 - INFO - __main__ - Step 94521: {'lr': 0.00015423109468350093, 'samples': 18148032, 'steps': 94520, 'loss/train': 1.5174692869186401} 11/07/2021 10:31:56 - INFO - __main__ - Step 94522: {'lr': 0.0001542261927751887, 'samples': 18148224, 'steps': 94521, 'loss/train': 1.4883911609649658} 11/07/2021 10:31:56 - INFO - __main__ - Step 94523: {'lr': 0.00015422129091003056, 'samples': 18148416, 'steps': 94522, 'loss/train': 1.1928433179855347} 11/07/2021 10:31:56 - INFO - __main__ - Step 94524: {'lr': 0.00015421638908802887, 'samples': 18148608, 'steps': 94523, 'loss/train': 1.2959315776824951} 11/07/2021 10:31:57 - INFO - __main__ - Step 94525: {'lr': 0.00015421148730918578, 'samples': 18148800, 'steps': 94524, 'loss/train': 1.2601845264434814} 11/07/2021 10:31:58 - INFO - __main__ - Step 94526: {'lr': 0.0001542065855735035, 'samples': 18148992, 'steps': 94525, 'loss/train': 1.4086302518844604} 11/07/2021 10:31:58 - INFO - __main__ - Step 94527: {'lr': 0.00015420168388098426, 'samples': 18149184, 'steps': 94526, 'loss/train': 0.9866416454315186} 11/07/2021 10:31:58 - INFO - __main__ - Step 94528: {'lr': 0.00015419678223163027, 'samples': 18149376, 'steps': 94527, 'loss/train': 1.1917661428451538} 11/07/2021 10:31:59 - INFO - __main__ - Step 94529: {'lr': 0.00015419188062544374, 'samples': 18149568, 'steps': 94528, 'loss/train': 2.012903928756714} 11/07/2021 10:31:59 - INFO - __main__ - Step 94530: {'lr': 0.00015418697906242684, 'samples': 18149760, 'steps': 94529, 'loss/train': 1.2664215564727783} 11/07/2021 10:32:00 - INFO - __main__ - Step 94531: {'lr': 0.00015418207754258183, 'samples': 18149952, 'steps': 94530, 'loss/train': 1.329163670539856} 11/07/2021 10:32:01 - INFO - __main__ - Step 94532: {'lr': 0.00015417717606591093, 'samples': 18150144, 'steps': 94531, 'loss/train': 1.294395923614502} 11/07/2021 10:32:01 - INFO - __main__ - Step 94533: {'lr': 0.00015417227463241626, 'samples': 18150336, 'steps': 94532, 'loss/train': 1.2323235273361206} 11/07/2021 10:32:01 - INFO - __main__ - Step 94534: {'lr': 0.00015416737324210013, 'samples': 18150528, 'steps': 94533, 'loss/train': 1.3347607851028442} 11/07/2021 10:32:02 - INFO - __main__ - Step 94535: {'lr': 0.00015416247189496473, 'samples': 18150720, 'steps': 94534, 'loss/train': 1.1978037357330322} 11/07/2021 10:32:03 - INFO - __main__ - Step 94536: {'lr': 0.00015415757059101222, 'samples': 18150912, 'steps': 94535, 'loss/train': 1.396031141281128} 11/07/2021 10:32:03 - INFO - __main__ - Step 94537: {'lr': 0.0001541526693302448, 'samples': 18151104, 'steps': 94536, 'loss/train': 1.4508943557739258} 11/07/2021 10:32:03 - INFO - __main__ - Step 94538: {'lr': 0.00015414776811266471, 'samples': 18151296, 'steps': 94537, 'loss/train': 0.7185602784156799} 11/07/2021 10:32:04 - INFO - __main__ - Step 94539: {'lr': 0.00015414286693827414, 'samples': 18151488, 'steps': 94538, 'loss/train': 1.3179410696029663} 11/07/2021 10:32:04 - INFO - __main__ - Step 94540: {'lr': 0.00015413796580707534, 'samples': 18151680, 'steps': 94539, 'loss/train': 1.645290493965149} 11/07/2021 10:32:05 - INFO - __main__ - Step 94541: {'lr': 0.00015413306471907047, 'samples': 18151872, 'steps': 94540, 'loss/train': 1.5927298069000244} 11/07/2021 10:32:05 - INFO - __main__ - Step 94542: {'lr': 0.0001541281636742618, 'samples': 18152064, 'steps': 94541, 'loss/train': 1.3779398202896118} 11/07/2021 10:32:06 - INFO - __main__ - Step 94543: {'lr': 0.00015412326267265147, 'samples': 18152256, 'steps': 94542, 'loss/train': 0.8975041508674622} 11/07/2021 10:32:06 - INFO - __main__ - Step 94544: {'lr': 0.0001541183617142417, 'samples': 18152448, 'steps': 94543, 'loss/train': 1.1402229070663452} 11/07/2021 10:32:06 - INFO - __main__ - Step 94545: {'lr': 0.00015411346079903477, 'samples': 18152640, 'steps': 94544, 'loss/train': 1.3362202644348145} 11/07/2021 10:32:07 - INFO - __main__ - Step 94546: {'lr': 0.00015410855992703277, 'samples': 18152832, 'steps': 94545, 'loss/train': 1.2904021739959717} 11/07/2021 10:32:08 - INFO - __main__ - Step 94547: {'lr': 0.0001541036590982381, 'samples': 18153024, 'steps': 94546, 'loss/train': 0.5657694935798645} 11/07/2021 10:32:08 - INFO - __main__ - Step 94548: {'lr': 0.00015409875831265274, 'samples': 18153216, 'steps': 94547, 'loss/train': 1.2874438762664795} 11/07/2021 10:32:08 - INFO - __main__ - Step 94549: {'lr': 0.00015409385757027906, 'samples': 18153408, 'steps': 94548, 'loss/train': 1.7800819873809814} 11/07/2021 10:32:09 - INFO - __main__ - Step 94550: {'lr': 0.00015408895687111913, 'samples': 18153600, 'steps': 94549, 'loss/train': 1.3481546640396118} 11/07/2021 10:32:09 - INFO - __main__ - Step 94551: {'lr': 0.00015408405621517528, 'samples': 18153792, 'steps': 94550, 'loss/train': 1.4535576105117798} 11/07/2021 10:32:10 - INFO - __main__ - Step 94552: {'lr': 0.00015407915560244965, 'samples': 18153984, 'steps': 94551, 'loss/train': 1.3911705017089844} 11/07/2021 10:32:11 - INFO - __main__ - Step 94553: {'lr': 0.00015407425503294447, 'samples': 18154176, 'steps': 94552, 'loss/train': 0.9425432682037354} 11/07/2021 10:32:11 - INFO - __main__ - Step 94554: {'lr': 0.000154069354506662, 'samples': 18154368, 'steps': 94553, 'loss/train': 1.7243412733078003} 11/07/2021 10:32:11 - INFO - __main__ - Step 94555: {'lr': 0.0001540644540236043, 'samples': 18154560, 'steps': 94554, 'loss/train': 1.2147177457809448} 11/07/2021 10:32:12 - INFO - __main__ - Step 94556: {'lr': 0.00015405955358377378, 'samples': 18154752, 'steps': 94555, 'loss/train': 1.2418172359466553} 11/07/2021 10:32:13 - INFO - __main__ - Step 94557: {'lr': 0.0001540546531871725, 'samples': 18154944, 'steps': 94556, 'loss/train': 1.3211380243301392} 11/07/2021 10:32:13 - INFO - __main__ - Step 94558: {'lr': 0.00015404975283380275, 'samples': 18155136, 'steps': 94557, 'loss/train': 1.3121153116226196} 11/07/2021 10:32:13 - INFO - __main__ - Step 94559: {'lr': 0.00015404485252366664, 'samples': 18155328, 'steps': 94558, 'loss/train': 1.4019280672073364} 11/07/2021 10:32:14 - INFO - __main__ - Step 94560: {'lr': 0.0001540399522567665, 'samples': 18155520, 'steps': 94559, 'loss/train': 1.2913880348205566} 11/07/2021 10:32:14 - INFO - __main__ - Step 94561: {'lr': 0.00015403505203310443, 'samples': 18155712, 'steps': 94560, 'loss/train': 0.08839049190282822} 11/07/2021 10:32:15 - INFO - __main__ - Step 94562: {'lr': 0.00015403015185268273, 'samples': 18155904, 'steps': 94561, 'loss/train': 1.071553111076355} 11/07/2021 10:32:15 - INFO - __main__ - Step 94563: {'lr': 0.00015402525171550352, 'samples': 18156096, 'steps': 94562, 'loss/train': 1.4904241561889648} 11/07/2021 10:32:16 - INFO - __main__ - Step 94564: {'lr': 0.00015402035162156907, 'samples': 18156288, 'steps': 94563, 'loss/train': 1.1736862659454346} 11/07/2021 10:32:16 - INFO - __main__ - Step 94565: {'lr': 0.00015401545157088154, 'samples': 18156480, 'steps': 94564, 'loss/train': 0.9277775883674622} 11/07/2021 10:32:16 - INFO - __main__ - Step 94566: {'lr': 0.0001540105515634432, 'samples': 18156672, 'steps': 94565, 'loss/train': 0.0696229636669159} 11/07/2021 10:32:17 - INFO - __main__ - Step 94567: {'lr': 0.0001540056515992562, 'samples': 18156864, 'steps': 94566, 'loss/train': 1.1850473880767822} 11/07/2021 10:32:18 - INFO - __main__ - Step 94568: {'lr': 0.00015400075167832278, 'samples': 18157056, 'steps': 94567, 'loss/train': 1.4163274765014648} 11/07/2021 10:32:18 - INFO - __main__ - Step 94569: {'lr': 0.0001539958518006452, 'samples': 18157248, 'steps': 94568, 'loss/train': 1.5758799314498901} 11/07/2021 10:32:18 - INFO - __main__ - Step 94570: {'lr': 0.00015399095196622553, 'samples': 18157440, 'steps': 94569, 'loss/train': 1.5126103162765503} 11/07/2021 10:32:19 - INFO - __main__ - Step 94571: {'lr': 0.00015398605217506605, 'samples': 18157632, 'steps': 94570, 'loss/train': 1.302493691444397} 11/07/2021 10:32:19 - INFO - __main__ - Step 94572: {'lr': 0.000153981152427169, 'samples': 18157824, 'steps': 94571, 'loss/train': 1.7701061964035034} 11/07/2021 10:32:20 - INFO - __main__ - Step 94573: {'lr': 0.00015397625272253656, 'samples': 18158016, 'steps': 94572, 'loss/train': 1.3732186555862427} 11/07/2021 10:32:21 - INFO - __main__ - Step 94574: {'lr': 0.00015397135306117094, 'samples': 18158208, 'steps': 94573, 'loss/train': 1.4599010944366455} 11/07/2021 10:32:21 - INFO - __main__ - Step 94575: {'lr': 0.00015396645344307438, 'samples': 18158400, 'steps': 94574, 'loss/train': 1.2564828395843506} 11/07/2021 10:32:21 - INFO - __main__ - Step 94576: {'lr': 0.00015396155386824902, 'samples': 18158592, 'steps': 94575, 'loss/train': 0.10838284343481064} 11/07/2021 10:32:22 - INFO - __main__ - Step 94577: {'lr': 0.0001539566543366971, 'samples': 18158784, 'steps': 94576, 'loss/train': 1.0912528038024902} 11/07/2021 10:32:23 - INFO - __main__ - Step 94578: {'lr': 0.00015395175484842082, 'samples': 18158976, 'steps': 94577, 'loss/train': 1.5762499570846558} 11/07/2021 10:32:23 - INFO - __main__ - Step 94579: {'lr': 0.0001539468554034224, 'samples': 18159168, 'steps': 94578, 'loss/train': 1.1745665073394775} 11/07/2021 10:32:24 - INFO - __main__ - Step 94580: {'lr': 0.00015394195600170412, 'samples': 18159360, 'steps': 94579, 'loss/train': 1.4364941120147705} 11/07/2021 10:32:24 - INFO - __main__ - Step 94581: {'lr': 0.00015393705664326805, 'samples': 18159552, 'steps': 94580, 'loss/train': 1.344582200050354} 11/07/2021 10:32:24 - INFO - __main__ - Step 94582: {'lr': 0.00015393215732811645, 'samples': 18159744, 'steps': 94581, 'loss/train': 1.4863057136535645} 11/07/2021 10:32:25 - INFO - __main__ - Step 94583: {'lr': 0.00015392725805625152, 'samples': 18159936, 'steps': 94582, 'loss/train': 0.45318251848220825} 11/07/2021 10:32:26 - INFO - __main__ - Step 94584: {'lr': 0.00015392235882767552, 'samples': 18160128, 'steps': 94583, 'loss/train': 1.1413276195526123} 11/07/2021 10:32:26 - INFO - __main__ - Step 94585: {'lr': 0.0001539174596423906, 'samples': 18160320, 'steps': 94584, 'loss/train': 1.1188223361968994} 11/07/2021 10:32:26 - INFO - __main__ - Step 94586: {'lr': 0.00015391256050039897, 'samples': 18160512, 'steps': 94585, 'loss/train': 1.0155173540115356} 11/07/2021 10:32:27 - INFO - __main__ - Step 94587: {'lr': 0.00015390766140170289, 'samples': 18160704, 'steps': 94586, 'loss/train': 0.4918782413005829} 11/07/2021 10:32:28 - INFO - __main__ - Step 94588: {'lr': 0.00015390276234630455, 'samples': 18160896, 'steps': 94587, 'loss/train': 1.2935603857040405} 11/07/2021 10:32:28 - INFO - __main__ - Step 94589: {'lr': 0.00015389786333420616, 'samples': 18161088, 'steps': 94588, 'loss/train': 1.6418617963790894} 11/07/2021 10:32:28 - INFO - __main__ - Step 94590: {'lr': 0.0001538929643654099, 'samples': 18161280, 'steps': 94589, 'loss/train': 1.4514132738113403} 11/07/2021 10:32:29 - INFO - __main__ - Step 94591: {'lr': 0.00015388806543991797, 'samples': 18161472, 'steps': 94590, 'loss/train': 1.3196663856506348} 11/07/2021 10:32:29 - INFO - __main__ - Step 94592: {'lr': 0.0001538831665577326, 'samples': 18161664, 'steps': 94591, 'loss/train': 1.7670987844467163} 11/07/2021 10:32:30 - INFO - __main__ - Step 94593: {'lr': 0.00015387826771885596, 'samples': 18161856, 'steps': 94592, 'loss/train': 1.7553819417953491} 11/07/2021 10:32:31 - INFO - __main__ - Step 94594: {'lr': 0.00015387336892329028, 'samples': 18162048, 'steps': 94593, 'loss/train': 1.2194418907165527} 11/07/2021 10:32:31 - INFO - __main__ - Step 94595: {'lr': 0.00015386847017103783, 'samples': 18162240, 'steps': 94594, 'loss/train': 1.2796574831008911} 11/07/2021 10:32:31 - INFO - __main__ - Step 94596: {'lr': 0.00015386357146210072, 'samples': 18162432, 'steps': 94595, 'loss/train': 1.4697540998458862} 11/07/2021 10:32:32 - INFO - __main__ - Step 94597: {'lr': 0.00015385867279648125, 'samples': 18162624, 'steps': 94596, 'loss/train': 1.506603479385376} 11/07/2021 10:32:33 - INFO - __main__ - Step 94598: {'lr': 0.00015385377417418151, 'samples': 18162816, 'steps': 94597, 'loss/train': 1.247454047203064} 11/07/2021 10:32:33 - INFO - __main__ - Step 94599: {'lr': 0.00015384887559520384, 'samples': 18163008, 'steps': 94598, 'loss/train': 1.3268959522247314} 11/07/2021 10:32:33 - INFO - __main__ - Step 94600: {'lr': 0.00015384397705955034, 'samples': 18163200, 'steps': 94599, 'loss/train': 1.2675858736038208} 11/07/2021 10:32:34 - INFO - __main__ - Step 94601: {'lr': 0.00015383907856722327, 'samples': 18163392, 'steps': 94600, 'loss/train': 2.092703104019165} 11/07/2021 10:32:34 - INFO - __main__ - Step 94602: {'lr': 0.00015383418011822493, 'samples': 18163584, 'steps': 94601, 'loss/train': 1.0554817914962769} 11/07/2021 10:32:34 - INFO - __main__ - Step 94603: {'lr': 0.00015382928171255733, 'samples': 18163776, 'steps': 94602, 'loss/train': 1.0820814371109009} 11/07/2021 10:32:35 - INFO - __main__ - Step 94604: {'lr': 0.00015382438335022276, 'samples': 18163968, 'steps': 94603, 'loss/train': 1.3278779983520508} 11/07/2021 10:32:36 - INFO - __main__ - Step 94605: {'lr': 0.00015381948503122346, 'samples': 18164160, 'steps': 94604, 'loss/train': 1.3753365278244019} 11/07/2021 10:32:36 - INFO - __main__ - Step 94606: {'lr': 0.0001538145867555616, 'samples': 18164352, 'steps': 94605, 'loss/train': 1.277791976928711} 11/07/2021 10:32:36 - INFO - __main__ - Step 94607: {'lr': 0.0001538096885232394, 'samples': 18164544, 'steps': 94606, 'loss/train': 1.0313178300857544} 11/07/2021 10:32:37 - INFO - __main__ - Step 94608: {'lr': 0.00015380479033425906, 'samples': 18164736, 'steps': 94607, 'loss/train': 1.7715847492218018} 11/07/2021 10:32:38 - INFO - __main__ - Step 94609: {'lr': 0.00015379989218862282, 'samples': 18164928, 'steps': 94608, 'loss/train': 0.7301117181777954} 11/07/2021 10:32:38 - INFO - __main__ - Step 94610: {'lr': 0.00015379499408633285, 'samples': 18165120, 'steps': 94609, 'loss/train': 1.2494871616363525} 11/07/2021 10:32:39 - INFO - __main__ - Step 94611: {'lr': 0.00015379009602739136, 'samples': 18165312, 'steps': 94610, 'loss/train': 0.45505398511886597} 11/07/2021 10:32:39 - INFO - __main__ - Step 94612: {'lr': 0.0001537851980118006, 'samples': 18165504, 'steps': 94611, 'loss/train': 1.6626660823822021} 11/07/2021 10:32:39 - INFO - __main__ - Step 94613: {'lr': 0.00015378030003956273, 'samples': 18165696, 'steps': 94612, 'loss/train': 1.4714813232421875} 11/07/2021 10:32:40 - INFO - __main__ - Step 94614: {'lr': 0.00015377540211067997, 'samples': 18165888, 'steps': 94613, 'loss/train': 1.521759271621704} 11/07/2021 10:32:41 - INFO - __main__ - Step 94615: {'lr': 0.00015377050422515454, 'samples': 18166080, 'steps': 94614, 'loss/train': 1.4985712766647339} 11/07/2021 10:32:41 - INFO - __main__ - Step 94616: {'lr': 0.00015376560638298873, 'samples': 18166272, 'steps': 94615, 'loss/train': 1.5686906576156616} 11/07/2021 10:32:41 - INFO - __main__ - Step 94617: {'lr': 0.00015376070858418454, 'samples': 18166464, 'steps': 94616, 'loss/train': 0.16943322122097015} 11/07/2021 10:32:42 - INFO - __main__ - Step 94618: {'lr': 0.00015375581082874428, 'samples': 18166656, 'steps': 94617, 'loss/train': 1.7516815662384033} 11/07/2021 10:32:43 - INFO - __main__ - Step 94619: {'lr': 0.00015375091311667022, 'samples': 18166848, 'steps': 94618, 'loss/train': 0.9955852031707764} 11/07/2021 10:32:43 - INFO - __main__ - Step 94620: {'lr': 0.00015374601544796446, 'samples': 18167040, 'steps': 94619, 'loss/train': 1.0180177688598633} 11/07/2021 10:32:43 - INFO - __main__ - Step 94621: {'lr': 0.00015374111782262927, 'samples': 18167232, 'steps': 94620, 'loss/train': 0.8503137230873108} 11/07/2021 10:32:44 - INFO - __main__ - Step 94622: {'lr': 0.00015373622024066687, 'samples': 18167424, 'steps': 94621, 'loss/train': 0.9812050461769104} 11/07/2021 10:32:44 - INFO - __main__ - Step 94623: {'lr': 0.00015373132270207944, 'samples': 18167616, 'steps': 94622, 'loss/train': 2.6869795322418213} 11/07/2021 10:32:45 - INFO - __main__ - Step 94624: {'lr': 0.00015372642520686917, 'samples': 18167808, 'steps': 94623, 'loss/train': 1.2941384315490723} 11/07/2021 10:32:46 - INFO - __main__ - Step 94625: {'lr': 0.0001537215277550383, 'samples': 18168000, 'steps': 94624, 'loss/train': 1.4012938737869263} 11/07/2021 10:32:46 - INFO - __main__ - Step 94626: {'lr': 0.000153716630346589, 'samples': 18168192, 'steps': 94625, 'loss/train': 1.058111310005188} 11/07/2021 10:32:46 - INFO - __main__ - Step 94627: {'lr': 0.00015371173298152352, 'samples': 18168384, 'steps': 94626, 'loss/train': 1.2366130352020264} 11/07/2021 10:32:47 - INFO - __main__ - Step 94628: {'lr': 0.00015370683565984407, 'samples': 18168576, 'steps': 94627, 'loss/train': 1.2040283679962158} 11/07/2021 10:32:48 - INFO - __main__ - Step 94629: {'lr': 0.00015370193838155292, 'samples': 18168768, 'steps': 94628, 'loss/train': 0.8591702580451965} 11/07/2021 10:32:48 - INFO - __main__ - Step 94630: {'lr': 0.00015369704114665206, 'samples': 18168960, 'steps': 94629, 'loss/train': 0.7579891085624695} 11/07/2021 10:32:48 - INFO - __main__ - Step 94631: {'lr': 0.00015369214395514387, 'samples': 18169152, 'steps': 94630, 'loss/train': 0.8045276999473572} 11/07/2021 10:32:49 - INFO - __main__ - Step 94632: {'lr': 0.0001536872468070305, 'samples': 18169344, 'steps': 94631, 'loss/train': 0.9982059001922607} 11/07/2021 10:32:49 - INFO - __main__ - Step 94633: {'lr': 0.00015368234970231415, 'samples': 18169536, 'steps': 94632, 'loss/train': 1.1420464515686035} 11/07/2021 10:32:50 - INFO - __main__ - Step 94634: {'lr': 0.00015367745264099707, 'samples': 18169728, 'steps': 94633, 'loss/train': 1.894697666168213} 11/07/2021 10:32:51 - INFO - __main__ - Step 94635: {'lr': 0.00015367255562308141, 'samples': 18169920, 'steps': 94634, 'loss/train': 0.6031219959259033} 11/07/2021 10:32:51 - INFO - __main__ - Step 94636: {'lr': 0.00015366765864856946, 'samples': 18170112, 'steps': 94635, 'loss/train': 1.3298453092575073} 11/07/2021 10:32:51 - INFO - __main__ - Step 94637: {'lr': 0.00015366276171746335, 'samples': 18170304, 'steps': 94636, 'loss/train': 0.6896917819976807} 11/07/2021 10:32:52 - INFO - __main__ - Step 94638: {'lr': 0.0001536578648297653, 'samples': 18170496, 'steps': 94637, 'loss/train': 1.3058550357818604} 11/07/2021 10:32:52 - INFO - __main__ - Step 94639: {'lr': 0.00015365296798547755, 'samples': 18170688, 'steps': 94638, 'loss/train': 1.8434875011444092} 11/07/2021 10:32:53 - INFO - __main__ - Step 94640: {'lr': 0.00015364807118460228, 'samples': 18170880, 'steps': 94639, 'loss/train': 0.6093666553497314} 11/07/2021 10:32:53 - INFO - __main__ - Step 94641: {'lr': 0.0001536431744271417, 'samples': 18171072, 'steps': 94640, 'loss/train': 1.7795796394348145} 11/07/2021 10:32:54 - INFO - __main__ - Step 94642: {'lr': 0.000153638277713098, 'samples': 18171264, 'steps': 94641, 'loss/train': 1.2426568269729614} 11/07/2021 10:32:54 - INFO - __main__ - Step 94643: {'lr': 0.00015363338104247353, 'samples': 18171456, 'steps': 94642, 'loss/train': 1.21094810962677} 11/07/2021 10:32:54 - INFO - __main__ - Step 94644: {'lr': 0.00015362848441527027, 'samples': 18171648, 'steps': 94643, 'loss/train': 1.6354868412017822} 11/07/2021 10:32:55 - INFO - __main__ - Step 94645: {'lr': 0.00015362358783149055, 'samples': 18171840, 'steps': 94644, 'loss/train': 1.171695590019226} 11/07/2021 10:32:56 - INFO - __main__ - Step 94646: {'lr': 0.00015361869129113654, 'samples': 18172032, 'steps': 94645, 'loss/train': 1.0304428339004517} 11/07/2021 10:32:56 - INFO - __main__ - Step 94647: {'lr': 0.00015361379479421046, 'samples': 18172224, 'steps': 94646, 'loss/train': 0.5014593601226807} 11/07/2021 10:32:56 - INFO - __main__ - Step 94648: {'lr': 0.00015360889834071452, 'samples': 18172416, 'steps': 94647, 'loss/train': 1.370827078819275} 11/07/2021 10:32:57 - INFO - __main__ - Step 94649: {'lr': 0.00015360400193065087, 'samples': 18172608, 'steps': 94648, 'loss/train': 1.7214049100875854} 11/07/2021 10:32:58 - INFO - __main__ - Step 94650: {'lr': 0.00015359910556402183, 'samples': 18172800, 'steps': 94649, 'loss/train': 0.9766652584075928} 11/07/2021 10:32:58 - INFO - __main__ - Step 94651: {'lr': 0.0001535942092408295, 'samples': 18172992, 'steps': 94650, 'loss/train': 1.3735994100570679} 11/07/2021 10:32:59 - INFO - __main__ - Step 94652: {'lr': 0.00015358931296107617, 'samples': 18173184, 'steps': 94651, 'loss/train': 1.3039087057113647} 11/07/2021 10:32:59 - INFO - __main__ - Step 94653: {'lr': 0.00015358441672476398, 'samples': 18173376, 'steps': 94652, 'loss/train': 1.4708718061447144} 11/07/2021 10:33:00 - INFO - __main__ - Step 94654: {'lr': 0.0001535795205318952, 'samples': 18173568, 'steps': 94653, 'loss/train': 0.09880093485116959} 11/07/2021 10:33:01 - INFO - __main__ - Step 94655: {'lr': 0.00015357462438247196, 'samples': 18173760, 'steps': 94654, 'loss/train': 0.578840434551239} 11/07/2021 10:33:01 - INFO - __main__ - Step 94656: {'lr': 0.0001535697282764966, 'samples': 18173952, 'steps': 94655, 'loss/train': 1.0709969997406006} 11/07/2021 10:33:01 - INFO - __main__ - Step 94657: {'lr': 0.00015356483221397118, 'samples': 18174144, 'steps': 94656, 'loss/train': 1.4789283275604248} 11/07/2021 10:33:02 - INFO - __main__ - Step 94658: {'lr': 0.00015355993619489794, 'samples': 18174336, 'steps': 94657, 'loss/train': 1.420148491859436} 11/07/2021 10:33:02 - INFO - __main__ - Step 94659: {'lr': 0.00015355504021927912, 'samples': 18174528, 'steps': 94658, 'loss/train': 1.379859209060669} 11/07/2021 10:33:03 - INFO - __main__ - Step 94660: {'lr': 0.0001535501442871169, 'samples': 18174720, 'steps': 94659, 'loss/train': 1.4330389499664307} 11/07/2021 10:33:03 - INFO - __main__ - Step 94661: {'lr': 0.00015354524839841346, 'samples': 18174912, 'steps': 94660, 'loss/train': 1.297045111656189} 11/07/2021 10:33:04 - INFO - __main__ - Step 94662: {'lr': 0.00015354035255317106, 'samples': 18175104, 'steps': 94661, 'loss/train': 1.7757058143615723} 11/07/2021 10:33:04 - INFO - __main__ - Step 94663: {'lr': 0.00015353545675139192, 'samples': 18175296, 'steps': 94662, 'loss/train': 1.5695812702178955} 11/07/2021 10:33:04 - INFO - __main__ - Step 94664: {'lr': 0.0001535305609930782, 'samples': 18175488, 'steps': 94663, 'loss/train': 1.4220609664916992} 11/07/2021 10:33:05 - INFO - __main__ - Step 94665: {'lr': 0.00015352566527823209, 'samples': 18175680, 'steps': 94664, 'loss/train': 1.2609889507293701} 11/07/2021 10:33:06 - INFO - __main__ - Step 94666: {'lr': 0.00015352076960685584, 'samples': 18175872, 'steps': 94665, 'loss/train': 0.8783234357833862} 11/07/2021 10:33:06 - INFO - __main__ - Step 94667: {'lr': 0.00015351587397895167, 'samples': 18176064, 'steps': 94666, 'loss/train': 1.1676150560379028} 11/07/2021 10:33:06 - INFO - __main__ - Step 94668: {'lr': 0.00015351097839452177, 'samples': 18176256, 'steps': 94667, 'loss/train': 1.2376881837844849} 11/07/2021 10:33:07 - INFO - __main__ - Step 94669: {'lr': 0.0001535060828535683, 'samples': 18176448, 'steps': 94668, 'loss/train': 1.153954267501831} 11/07/2021 10:33:07 - INFO - __main__ - Step 94670: {'lr': 0.0001535011873560936, 'samples': 18176640, 'steps': 94669, 'loss/train': 1.190148115158081} 11/07/2021 10:33:08 - INFO - __main__ - Step 94671: {'lr': 0.0001534962919020997, 'samples': 18176832, 'steps': 94670, 'loss/train': 1.550238847732544} 11/07/2021 10:33:09 - INFO - __main__ - Step 94672: {'lr': 0.00015349139649158884, 'samples': 18177024, 'steps': 94671, 'loss/train': 1.2617875337600708} 11/07/2021 10:33:09 - INFO - __main__ - Step 94673: {'lr': 0.0001534865011245633, 'samples': 18177216, 'steps': 94672, 'loss/train': 1.3660541772842407} 11/07/2021 10:33:09 - INFO - __main__ - Step 94674: {'lr': 0.00015348160580102525, 'samples': 18177408, 'steps': 94673, 'loss/train': 1.088426113128662} 11/07/2021 10:33:10 - INFO - __main__ - Step 94675: {'lr': 0.0001534767105209769, 'samples': 18177600, 'steps': 94674, 'loss/train': 1.0764501094818115} 11/07/2021 10:33:11 - INFO - __main__ - Step 94676: {'lr': 0.00015347181528442045, 'samples': 18177792, 'steps': 94675, 'loss/train': 1.4656345844268799} 11/07/2021 10:33:11 - INFO - __main__ - Step 94677: {'lr': 0.0001534669200913581, 'samples': 18177984, 'steps': 94676, 'loss/train': 1.3919639587402344} 11/07/2021 10:33:11 - INFO - __main__ - Step 94678: {'lr': 0.00015346202494179206, 'samples': 18178176, 'steps': 94677, 'loss/train': 1.482280969619751} 11/07/2021 10:33:12 - INFO - __main__ - Step 94679: {'lr': 0.00015345712983572457, 'samples': 18178368, 'steps': 94678, 'loss/train': 1.685915231704712} 11/07/2021 10:33:12 - INFO - __main__ - Step 94680: {'lr': 0.00015345223477315778, 'samples': 18178560, 'steps': 94679, 'loss/train': 1.3416879177093506} 11/07/2021 10:33:13 - INFO - __main__ - Step 94681: {'lr': 0.00015344733975409397, 'samples': 18178752, 'steps': 94680, 'loss/train': 1.3261324167251587} 11/07/2021 10:33:14 - INFO - __main__ - Step 94682: {'lr': 0.00015344244477853532, 'samples': 18178944, 'steps': 94681, 'loss/train': 5.718120574951172} 11/07/2021 10:33:14 - INFO - __main__ - Step 94683: {'lr': 0.00015343754984648396, 'samples': 18179136, 'steps': 94682, 'loss/train': 1.4221349954605103} 11/07/2021 10:33:14 - INFO - __main__ - Step 94684: {'lr': 0.0001534326549579422, 'samples': 18179328, 'steps': 94683, 'loss/train': 1.742186188697815} 11/07/2021 10:33:15 - INFO - __main__ - Step 94685: {'lr': 0.0001534277601129121, 'samples': 18179520, 'steps': 94684, 'loss/train': 1.0874239206314087} 11/07/2021 10:33:16 - INFO - __main__ - Step 94686: {'lr': 0.00015342286531139603, 'samples': 18179712, 'steps': 94685, 'loss/train': 1.2687413692474365} 11/07/2021 10:33:16 - INFO - __main__ - Step 94687: {'lr': 0.0001534179705533961, 'samples': 18179904, 'steps': 94686, 'loss/train': 1.0369904041290283} 11/07/2021 10:33:16 - INFO - __main__ - Step 94688: {'lr': 0.00015341307583891455, 'samples': 18180096, 'steps': 94687, 'loss/train': 1.4531841278076172} 11/07/2021 10:33:17 - INFO - __main__ - Step 94689: {'lr': 0.00015340818116795358, 'samples': 18180288, 'steps': 94688, 'loss/train': 1.3460439443588257} 11/07/2021 10:33:17 - INFO - __main__ - Step 94690: {'lr': 0.0001534032865405154, 'samples': 18180480, 'steps': 94689, 'loss/train': 1.1436535120010376} 11/07/2021 10:33:18 - INFO - __main__ - Step 94691: {'lr': 0.00015339839195660217, 'samples': 18180672, 'steps': 94690, 'loss/train': 1.418200135231018} 11/07/2021 10:33:19 - INFO - __main__ - Step 94692: {'lr': 0.00015339349741621622, 'samples': 18180864, 'steps': 94691, 'loss/train': 1.4328364133834839} 11/07/2021 10:33:19 - INFO - __main__ - Step 94693: {'lr': 0.0001533886029193596, 'samples': 18181056, 'steps': 94692, 'loss/train': 0.7134424448013306} 11/07/2021 10:33:19 - INFO - __main__ - Step 94694: {'lr': 0.0001533837084660346, 'samples': 18181248, 'steps': 94693, 'loss/train': 0.9635443091392517} 11/07/2021 10:33:20 - INFO - __main__ - Step 94695: {'lr': 0.0001533788140562434, 'samples': 18181440, 'steps': 94694, 'loss/train': 0.2451763153076172} 11/07/2021 10:33:21 - INFO - __main__ - Step 94696: {'lr': 0.00015337391968998825, 'samples': 18181632, 'steps': 94695, 'loss/train': 1.0906835794448853} 11/07/2021 10:33:21 - INFO - __main__ - Step 94697: {'lr': 0.00015336902536727131, 'samples': 18181824, 'steps': 94696, 'loss/train': 0.8112925291061401} 11/07/2021 10:33:21 - INFO - __main__ - Step 94698: {'lr': 0.00015336413108809477, 'samples': 18182016, 'steps': 94697, 'loss/train': 1.34029221534729} 11/07/2021 10:33:22 - INFO - __main__ - Step 94699: {'lr': 0.00015335923685246087, 'samples': 18182208, 'steps': 94698, 'loss/train': 0.2002914547920227} 11/07/2021 10:33:22 - INFO - __main__ - Step 94700: {'lr': 0.00015335434266037178, 'samples': 18182400, 'steps': 94699, 'loss/train': 1.4260718822479248} 11/07/2021 10:33:22 - INFO - __main__ - Step 94701: {'lr': 0.00015334944851182978, 'samples': 18182592, 'steps': 94700, 'loss/train': 1.2998921871185303} 11/07/2021 10:33:23 - INFO - __main__ - Step 94702: {'lr': 0.000153344554406837, 'samples': 18182784, 'steps': 94701, 'loss/train': 1.2472622394561768} 11/07/2021 10:33:24 - INFO - __main__ - Step 94703: {'lr': 0.00015333966034539575, 'samples': 18182976, 'steps': 94702, 'loss/train': 1.2698659896850586} 11/07/2021 10:33:24 - INFO - __main__ - Step 94704: {'lr': 0.00015333476632750808, 'samples': 18183168, 'steps': 94703, 'loss/train': 1.9904563426971436} 11/07/2021 10:33:24 - INFO - __main__ - Step 94705: {'lr': 0.00015332987235317625, 'samples': 18183360, 'steps': 94704, 'loss/train': 1.5074502229690552} 11/07/2021 10:33:25 - INFO - __main__ - Step 94706: {'lr': 0.00015332497842240252, 'samples': 18183552, 'steps': 94705, 'loss/train': 1.3713760375976562} 11/07/2021 10:33:26 - INFO - __main__ - Step 94707: {'lr': 0.00015332008453518902, 'samples': 18183744, 'steps': 94706, 'loss/train': 1.2809722423553467} 11/07/2021 10:33:26 - INFO - __main__ - Step 94708: {'lr': 0.00015331519069153806, 'samples': 18183936, 'steps': 94707, 'loss/train': 1.1010411977767944} 11/07/2021 10:33:27 - INFO - __main__ - Step 94709: {'lr': 0.00015331029689145175, 'samples': 18184128, 'steps': 94708, 'loss/train': 1.1409239768981934} 11/07/2021 10:33:27 - INFO - __main__ - Step 94710: {'lr': 0.0001533054031349324, 'samples': 18184320, 'steps': 94709, 'loss/train': 1.5480363368988037} 11/07/2021 10:33:27 - INFO - __main__ - Step 94711: {'lr': 0.0001533005094219821, 'samples': 18184512, 'steps': 94710, 'loss/train': 1.3966606855392456} 11/07/2021 10:33:28 - INFO - __main__ - Step 94712: {'lr': 0.00015329561575260303, 'samples': 18184704, 'steps': 94711, 'loss/train': 1.4765396118164062} 11/07/2021 10:33:29 - INFO - __main__ - Step 94713: {'lr': 0.00015329072212679753, 'samples': 18184896, 'steps': 94712, 'loss/train': 1.640913486480713} 11/07/2021 10:33:29 - INFO - __main__ - Step 94714: {'lr': 0.00015328582854456777, 'samples': 18185088, 'steps': 94713, 'loss/train': 1.5753816366195679} 11/07/2021 10:33:29 - INFO - __main__ - Step 94715: {'lr': 0.0001532809350059159, 'samples': 18185280, 'steps': 94714, 'loss/train': 1.3449219465255737} 11/07/2021 10:33:30 - INFO - __main__ - Step 94716: {'lr': 0.0001532760415108441, 'samples': 18185472, 'steps': 94715, 'loss/train': 1.3522682189941406} 11/07/2021 10:33:31 - INFO - __main__ - Step 94717: {'lr': 0.00015327114805935464, 'samples': 18185664, 'steps': 94716, 'loss/train': 1.3153789043426514} 11/07/2021 10:33:31 - INFO - __main__ - Step 94718: {'lr': 0.0001532662546514497, 'samples': 18185856, 'steps': 94717, 'loss/train': 1.5357416868209839} 11/07/2021 10:33:31 - INFO - __main__ - Step 94719: {'lr': 0.00015326136128713153, 'samples': 18186048, 'steps': 94718, 'loss/train': 1.641395092010498} 11/07/2021 10:33:32 - INFO - __main__ - Step 94720: {'lr': 0.00015325646796640225, 'samples': 18186240, 'steps': 94719, 'loss/train': 1.3220727443695068} 11/07/2021 10:33:32 - INFO - __main__ - Step 94721: {'lr': 0.00015325157468926414, 'samples': 18186432, 'steps': 94720, 'loss/train': 1.6459766626358032} 11/07/2021 10:33:33 - INFO - __main__ - Step 94722: {'lr': 0.00015324668145571936, 'samples': 18186624, 'steps': 94721, 'loss/train': 0.9667590856552124} 11/07/2021 10:33:33 - INFO - __main__ - Step 94723: {'lr': 0.0001532417882657702, 'samples': 18186816, 'steps': 94722, 'loss/train': 0.40461570024490356} 11/07/2021 10:33:34 - INFO - __main__ - Step 94724: {'lr': 0.00015323689511941875, 'samples': 18187008, 'steps': 94723, 'loss/train': 1.2521525621414185} 11/07/2021 10:33:34 - INFO - __main__ - Step 94725: {'lr': 0.00015323200201666732, 'samples': 18187200, 'steps': 94724, 'loss/train': 1.4660474061965942} 11/07/2021 10:33:35 - INFO - __main__ - Step 94726: {'lr': 0.00015322710895751795, 'samples': 18187392, 'steps': 94725, 'loss/train': 1.4039674997329712} 11/07/2021 10:33:35 - INFO - __main__ - Step 94727: {'lr': 0.000153222215941973, 'samples': 18187584, 'steps': 94726, 'loss/train': 1.5120385885238647} 11/07/2021 10:33:36 - INFO - __main__ - Step 94728: {'lr': 0.00015321732297003462, 'samples': 18187776, 'steps': 94727, 'loss/train': 1.2414886951446533} 11/07/2021 10:33:36 - INFO - __main__ - Step 94729: {'lr': 0.00015321243004170506, 'samples': 18187968, 'steps': 94728, 'loss/train': 0.5220288038253784} 11/07/2021 10:33:37 - INFO - __main__ - Step 94730: {'lr': 0.00015320753715698644, 'samples': 18188160, 'steps': 94729, 'loss/train': 1.4793201684951782} 11/07/2021 10:33:37 - INFO - __main__ - Step 94731: {'lr': 0.000153202644315881, 'samples': 18188352, 'steps': 94730, 'loss/train': 1.1701823472976685} 11/07/2021 10:33:37 - INFO - __main__ - Step 94732: {'lr': 0.00015319775151839094, 'samples': 18188544, 'steps': 94731, 'loss/train': 1.1604825258255005} 11/07/2021 10:33:38 - INFO - __main__ - Step 94733: {'lr': 0.00015319285876451853, 'samples': 18188736, 'steps': 94732, 'loss/train': 1.4114402532577515} 11/07/2021 10:33:39 - INFO - __main__ - Step 94734: {'lr': 0.00015318796605426588, 'samples': 18188928, 'steps': 94733, 'loss/train': 1.129739761352539} 11/07/2021 10:33:39 - INFO - __main__ - Step 94735: {'lr': 0.00015318307338763526, 'samples': 18189120, 'steps': 94734, 'loss/train': 1.8756712675094604} 11/07/2021 10:33:40 - INFO - __main__ - Step 94736: {'lr': 0.00015317818076462887, 'samples': 18189312, 'steps': 94735, 'loss/train': 1.2366151809692383} 11/07/2021 10:33:40 - INFO - __main__ - Step 94737: {'lr': 0.000153173288185249, 'samples': 18189504, 'steps': 94736, 'loss/train': 1.4354352951049805} 11/07/2021 10:33:41 - INFO - __main__ - Step 94738: {'lr': 0.00015316839564949764, 'samples': 18189696, 'steps': 94737, 'loss/train': 1.4949493408203125} 11/07/2021 10:33:41 - INFO - __main__ - Step 94739: {'lr': 0.0001531635031573771, 'samples': 18189888, 'steps': 94738, 'loss/train': 1.1022835969924927} 11/07/2021 10:33:42 - INFO - __main__ - Step 94740: {'lr': 0.0001531586107088896, 'samples': 18190080, 'steps': 94739, 'loss/train': 1.0874288082122803} 11/07/2021 10:33:42 - INFO - __main__ - Step 94741: {'lr': 0.0001531537183040373, 'samples': 18190272, 'steps': 94740, 'loss/train': 1.2237378358840942} 11/07/2021 10:33:42 - INFO - __main__ - Step 94742: {'lr': 0.00015314882594282247, 'samples': 18190464, 'steps': 94741, 'loss/train': 1.7886266708374023} 11/07/2021 10:33:43 - INFO - __main__ - Step 94743: {'lr': 0.0001531439336252473, 'samples': 18190656, 'steps': 94742, 'loss/train': 0.9303444027900696} 11/07/2021 10:33:44 - INFO - __main__ - Step 94744: {'lr': 0.00015313904135131395, 'samples': 18190848, 'steps': 94743, 'loss/train': 1.2809815406799316} 11/07/2021 10:33:44 - INFO - __main__ - Step 94745: {'lr': 0.00015313414912102464, 'samples': 18191040, 'steps': 94744, 'loss/train': 1.3985487222671509} 11/07/2021 10:33:44 - INFO - __main__ - Step 94746: {'lr': 0.00015312925693438162, 'samples': 18191232, 'steps': 94745, 'loss/train': 1.2727686166763306} 11/07/2021 10:33:45 - INFO - __main__ - Step 94747: {'lr': 0.00015312436479138705, 'samples': 18191424, 'steps': 94746, 'loss/train': 1.1720736026763916} 11/07/2021 10:33:46 - INFO - __main__ - Step 94748: {'lr': 0.00015311947269204315, 'samples': 18191616, 'steps': 94747, 'loss/train': 1.4362972974777222} 11/07/2021 10:33:46 - INFO - __main__ - Step 94749: {'lr': 0.00015311458063635213, 'samples': 18191808, 'steps': 94748, 'loss/train': 1.2518460750579834} 11/07/2021 10:33:47 - INFO - __main__ - Step 94750: {'lr': 0.0001531096886243163, 'samples': 18192000, 'steps': 94749, 'loss/train': 1.3844761848449707} 11/07/2021 10:33:47 - INFO - __main__ - Step 94751: {'lr': 0.0001531047966559376, 'samples': 18192192, 'steps': 94750, 'loss/train': 1.8260971307754517} 11/07/2021 10:33:47 - INFO - __main__ - Step 94752: {'lr': 0.00015309990473121843, 'samples': 18192384, 'steps': 94751, 'loss/train': 1.862185001373291} 11/07/2021 10:33:48 - INFO - __main__ - Step 94753: {'lr': 0.00015309501285016091, 'samples': 18192576, 'steps': 94752, 'loss/train': 1.0346759557724} 11/07/2021 10:33:49 - INFO - __main__ - Step 94754: {'lr': 0.0001530901210127673, 'samples': 18192768, 'steps': 94753, 'loss/train': 0.5253903269767761} 11/07/2021 10:33:49 - INFO - __main__ - Step 94755: {'lr': 0.0001530852292190398, 'samples': 18192960, 'steps': 94754, 'loss/train': 1.171609878540039} 11/07/2021 10:33:49 - INFO - __main__ - Step 94756: {'lr': 0.00015308033746898057, 'samples': 18193152, 'steps': 94755, 'loss/train': 1.3201240301132202} 11/07/2021 10:33:50 - INFO - __main__ - Step 94757: {'lr': 0.00015307544576259187, 'samples': 18193344, 'steps': 94756, 'loss/train': 1.6373051404953003} 11/07/2021 10:33:50 - INFO - __main__ - Step 94758: {'lr': 0.00015307055409987587, 'samples': 18193536, 'steps': 94757, 'loss/train': 1.401605486869812} 11/07/2021 10:33:51 - INFO - __main__ - Step 94759: {'lr': 0.00015306566248083476, 'samples': 18193728, 'steps': 94758, 'loss/train': 1.3286538124084473} 11/07/2021 10:33:51 - INFO - __main__ - Step 94760: {'lr': 0.00015306077090547078, 'samples': 18193920, 'steps': 94759, 'loss/train': 2.9388699531555176} 11/07/2021 10:33:52 - INFO - __main__ - Step 94761: {'lr': 0.00015305587937378611, 'samples': 18194112, 'steps': 94760, 'loss/train': 1.1999794244766235} 11/07/2021 10:33:52 - INFO - __main__ - Step 94762: {'lr': 0.000153050987885783, 'samples': 18194304, 'steps': 94761, 'loss/train': 0.8450825214385986} 11/07/2021 10:33:52 - INFO - __main__ - Step 94763: {'lr': 0.00015304609644146362, 'samples': 18194496, 'steps': 94762, 'loss/train': 1.0443590879440308} 11/07/2021 10:33:54 - INFO - __main__ - Step 94764: {'lr': 0.00015304120504083024, 'samples': 18194688, 'steps': 94763, 'loss/train': 1.3383946418762207} 11/07/2021 10:33:54 - INFO - __main__ - Step 94765: {'lr': 0.00015303631368388494, 'samples': 18194880, 'steps': 94764, 'loss/train': 0.9874789714813232} 11/07/2021 10:33:54 - INFO - __main__ - Step 94766: {'lr': 0.00015303142237062996, 'samples': 18195072, 'steps': 94765, 'loss/train': 1.2474334239959717} 11/07/2021 10:33:55 - INFO - __main__ - Step 94767: {'lr': 0.00015302653110106748, 'samples': 18195264, 'steps': 94766, 'loss/train': 1.3140727281570435} 11/07/2021 10:33:55 - INFO - __main__ - Step 94768: {'lr': 0.0001530216398751998, 'samples': 18195456, 'steps': 94767, 'loss/train': 0.39711007475852966} 11/07/2021 10:33:56 - INFO - __main__ - Step 94769: {'lr': 0.00015301674869302906, 'samples': 18195648, 'steps': 94768, 'loss/train': 1.755843162536621} 11/07/2021 10:33:56 - INFO - __main__ - Step 94770: {'lr': 0.00015301185755455746, 'samples': 18195840, 'steps': 94769, 'loss/train': 1.2750177383422852} 11/07/2021 10:33:57 - INFO - __main__ - Step 94771: {'lr': 0.00015300696645978725, 'samples': 18196032, 'steps': 94770, 'loss/train': 1.864477515220642} 11/07/2021 10:33:57 - INFO - __main__ - Step 94772: {'lr': 0.00015300207540872056, 'samples': 18196224, 'steps': 94771, 'loss/train': 0.9119406342506409} 11/07/2021 10:33:57 - INFO - __main__ - Step 94773: {'lr': 0.00015299718440135967, 'samples': 18196416, 'steps': 94772, 'loss/train': 2.0043301582336426} 11/07/2021 10:33:58 - INFO - __main__ - Step 94774: {'lr': 0.00015299229343770677, 'samples': 18196608, 'steps': 94773, 'loss/train': 1.6882483959197998} 11/07/2021 10:33:59 - INFO - __main__ - Step 94775: {'lr': 0.00015298740251776398, 'samples': 18196800, 'steps': 94774, 'loss/train': 1.5068120956420898} 11/07/2021 10:33:59 - INFO - __main__ - Step 94776: {'lr': 0.00015298251164153366, 'samples': 18196992, 'steps': 94775, 'loss/train': 1.0908368825912476} 11/07/2021 10:34:00 - INFO - __main__ - Step 94777: {'lr': 0.00015297762080901799, 'samples': 18197184, 'steps': 94776, 'loss/train': 1.1135618686676025} 11/07/2021 10:34:00 - INFO - __main__ - Step 94778: {'lr': 0.00015297273002021897, 'samples': 18197376, 'steps': 94777, 'loss/train': 1.2352596521377563} 11/07/2021 10:34:00 - INFO - __main__ - Step 94779: {'lr': 0.00015296783927513897, 'samples': 18197568, 'steps': 94778, 'loss/train': 1.0898089408874512} 11/07/2021 10:34:01 - INFO - __main__ - Step 94780: {'lr': 0.00015296294857378016, 'samples': 18197760, 'steps': 94779, 'loss/train': 1.1395021677017212} 11/07/2021 10:34:02 - INFO - __main__ - Step 94781: {'lr': 0.00015295805791614475, 'samples': 18197952, 'steps': 94780, 'loss/train': 1.2668333053588867} 11/07/2021 10:34:02 - INFO - __main__ - Step 94782: {'lr': 0.00015295316730223494, 'samples': 18198144, 'steps': 94781, 'loss/train': 1.164728045463562} 11/07/2021 10:34:02 - INFO - __main__ - Step 94783: {'lr': 0.0001529482767320529, 'samples': 18198336, 'steps': 94782, 'loss/train': 1.248017430305481} 11/07/2021 10:34:03 - INFO - __main__ - Step 94784: {'lr': 0.00015294338620560095, 'samples': 18198528, 'steps': 94783, 'loss/train': 1.3817580938339233} 11/07/2021 10:34:04 - INFO - __main__ - Step 94785: {'lr': 0.00015293849572288115, 'samples': 18198720, 'steps': 94784, 'loss/train': 0.07827561348676682} 11/07/2021 10:34:04 - INFO - __main__ - Step 94786: {'lr': 0.00015293360528389577, 'samples': 18198912, 'steps': 94785, 'loss/train': 1.4219245910644531} 11/07/2021 10:34:04 - INFO - __main__ - Step 94787: {'lr': 0.00015292871488864702, 'samples': 18199104, 'steps': 94786, 'loss/train': 1.3215702772140503} 11/07/2021 10:34:05 - INFO - __main__ - Step 94788: {'lr': 0.0001529238245371371, 'samples': 18199296, 'steps': 94787, 'loss/train': 1.695469856262207} 11/07/2021 10:34:05 - INFO - __main__ - Step 94789: {'lr': 0.0001529189342293682, 'samples': 18199488, 'steps': 94788, 'loss/train': 1.4594858884811401} 11/07/2021 10:34:06 - INFO - __main__ - Step 94790: {'lr': 0.00015291404396534252, 'samples': 18199680, 'steps': 94789, 'loss/train': 0.9807434678077698} 11/07/2021 10:34:07 - INFO - __main__ - Step 94791: {'lr': 0.0001529091537450624, 'samples': 18199872, 'steps': 94790, 'loss/train': 0.3221157193183899} 11/07/2021 10:34:07 - INFO - __main__ - Step 94792: {'lr': 0.0001529042635685298, 'samples': 18200064, 'steps': 94791, 'loss/train': 1.5563205480575562} 11/07/2021 10:34:07 - INFO - __main__ - Step 94793: {'lr': 0.00015289937343574705, 'samples': 18200256, 'steps': 94792, 'loss/train': 1.2569979429244995} 11/07/2021 10:34:08 - INFO - __main__ - Step 94794: {'lr': 0.00015289448334671632, 'samples': 18200448, 'steps': 94793, 'loss/train': 1.6195731163024902} 11/07/2021 10:34:09 - INFO - __main__ - Step 94795: {'lr': 0.00015288959330143987, 'samples': 18200640, 'steps': 94794, 'loss/train': 1.2089636325836182} 11/07/2021 10:34:09 - INFO - __main__ - Step 94796: {'lr': 0.00015288470329991984, 'samples': 18200832, 'steps': 94795, 'loss/train': 1.3007878065109253} 11/07/2021 10:34:09 - INFO - __main__ - Step 94797: {'lr': 0.00015287981334215851, 'samples': 18201024, 'steps': 94796, 'loss/train': 1.104586124420166} 11/07/2021 10:34:10 - INFO - __main__ - Step 94798: {'lr': 0.00015287492342815797, 'samples': 18201216, 'steps': 94797, 'loss/train': 0.8704171180725098} 11/07/2021 10:34:10 - INFO - __main__ - Step 94799: {'lr': 0.00015287003355792054, 'samples': 18201408, 'steps': 94798, 'loss/train': 1.6937211751937866} 11/07/2021 10:34:11 - INFO - __main__ - Step 94800: {'lr': 0.00015286514373144837, 'samples': 18201600, 'steps': 94799, 'loss/train': 1.5935711860656738} 11/07/2021 10:34:11 - INFO - __main__ - Step 94801: {'lr': 0.00015286025394874365, 'samples': 18201792, 'steps': 94800, 'loss/train': 1.3857147693634033} 11/07/2021 10:34:12 - INFO - __main__ - Step 94802: {'lr': 0.0001528553642098086, 'samples': 18201984, 'steps': 94801, 'loss/train': 1.6221554279327393} 11/07/2021 10:34:12 - INFO - __main__ - Step 94803: {'lr': 0.00015285047451464546, 'samples': 18202176, 'steps': 94802, 'loss/train': 1.5099900960922241} 11/07/2021 10:34:12 - INFO - __main__ - Step 94804: {'lr': 0.00015284558486325644, 'samples': 18202368, 'steps': 94803, 'loss/train': 1.2431195974349976} 11/07/2021 10:34:13 - INFO - __main__ - Step 94805: {'lr': 0.00015284069525564365, 'samples': 18202560, 'steps': 94804, 'loss/train': 1.2795202732086182} 11/07/2021 10:34:14 - INFO - __main__ - Step 94806: {'lr': 0.00015283580569180934, 'samples': 18202752, 'steps': 94805, 'loss/train': 1.8067541122436523} 11/07/2021 10:34:14 - INFO - __main__ - Step 94807: {'lr': 0.0001528309161717557, 'samples': 18202944, 'steps': 94806, 'loss/train': 1.2145131826400757} 11/07/2021 10:34:15 - INFO - __main__ - Step 94808: {'lr': 0.00015282602669548494, 'samples': 18203136, 'steps': 94807, 'loss/train': 1.4428951740264893} 11/07/2021 10:34:15 - INFO - __main__ - Step 94809: {'lr': 0.00015282113726299926, 'samples': 18203328, 'steps': 94808, 'loss/train': 1.3916043043136597} 11/07/2021 10:34:15 - INFO - __main__ - Step 94810: {'lr': 0.0001528162478743009, 'samples': 18203520, 'steps': 94809, 'loss/train': 1.4324913024902344} 11/07/2021 10:34:16 - INFO - __main__ - Step 94811: {'lr': 0.00015281135852939203, 'samples': 18203712, 'steps': 94810, 'loss/train': 1.9379717111587524} 11/07/2021 10:34:17 - INFO - __main__ - Step 94812: {'lr': 0.00015280646922827487, 'samples': 18203904, 'steps': 94811, 'loss/train': 1.5845789909362793} 11/07/2021 10:34:17 - INFO - __main__ - Step 94813: {'lr': 0.00015280157997095162, 'samples': 18204096, 'steps': 94812, 'loss/train': 0.7975922226905823} 11/07/2021 10:34:17 - INFO - __main__ - Step 94814: {'lr': 0.00015279669075742448, 'samples': 18204288, 'steps': 94813, 'loss/train': 1.5504754781723022} 11/07/2021 10:34:18 - INFO - __main__ - Step 94815: {'lr': 0.00015279180158769566, 'samples': 18204480, 'steps': 94814, 'loss/train': 0.8232129812240601} 11/07/2021 10:34:19 - INFO - __main__ - Step 94816: {'lr': 0.00015278691246176738, 'samples': 18204672, 'steps': 94815, 'loss/train': 1.7877916097640991} 11/07/2021 10:34:19 - INFO - __main__ - Step 94817: {'lr': 0.0001527820233796418, 'samples': 18204864, 'steps': 94816, 'loss/train': 1.3199315071105957} 11/07/2021 10:34:20 - INFO - __main__ - Step 94818: {'lr': 0.00015277713434132113, 'samples': 18205056, 'steps': 94817, 'loss/train': 1.4963268041610718} 11/07/2021 10:34:20 - INFO - __main__ - Step 94819: {'lr': 0.00015277224534680756, 'samples': 18205248, 'steps': 94818, 'loss/train': 1.4611648321151733} 11/07/2021 10:34:20 - INFO - __main__ - Step 94820: {'lr': 0.00015276735639610335, 'samples': 18205440, 'steps': 94819, 'loss/train': 1.2757316827774048} 11/07/2021 10:34:21 - INFO - __main__ - Step 94821: {'lr': 0.00015276246748921064, 'samples': 18205632, 'steps': 94820, 'loss/train': 1.061858892440796} 11/07/2021 10:34:22 - INFO - __main__ - Step 94822: {'lr': 0.00015275757862613166, 'samples': 18205824, 'steps': 94821, 'loss/train': 1.6118944883346558} 11/07/2021 10:34:22 - INFO - __main__ - Step 94823: {'lr': 0.00015275268980686864, 'samples': 18206016, 'steps': 94822, 'loss/train': 1.3481394052505493} 11/07/2021 10:34:22 - INFO - __main__ - Step 94824: {'lr': 0.0001527478010314237, 'samples': 18206208, 'steps': 94823, 'loss/train': 0.9344918727874756} 11/07/2021 10:34:23 - INFO - __main__ - Step 94825: {'lr': 0.00015274291229979914, 'samples': 18206400, 'steps': 94824, 'loss/train': 1.2509456872940063} 11/07/2021 10:34:24 - INFO - __main__ - Step 94826: {'lr': 0.00015273802361199712, 'samples': 18206592, 'steps': 94825, 'loss/train': 1.2464550733566284} 11/07/2021 10:34:24 - INFO - __main__ - Step 94827: {'lr': 0.00015273313496801992, 'samples': 18206784, 'steps': 94826, 'loss/train': 1.2161211967468262} 11/07/2021 10:34:24 - INFO - __main__ - Step 94828: {'lr': 0.00015272824636786958, 'samples': 18206976, 'steps': 94827, 'loss/train': 1.3323533535003662} 11/07/2021 10:34:25 - INFO - __main__ - Step 94829: {'lr': 0.00015272335781154838, 'samples': 18207168, 'steps': 94828, 'loss/train': 1.474396824836731} 11/07/2021 10:34:25 - INFO - __main__ - Step 94830: {'lr': 0.00015271846929905858, 'samples': 18207360, 'steps': 94829, 'loss/train': 1.447134256362915} 11/07/2021 10:34:26 - INFO - __main__ - Step 94831: {'lr': 0.00015271358083040237, 'samples': 18207552, 'steps': 94830, 'loss/train': 0.9245665669441223} 11/07/2021 10:34:26 - INFO - __main__ - Step 94832: {'lr': 0.0001527086924055819, 'samples': 18207744, 'steps': 94831, 'loss/train': 1.1680474281311035} 11/07/2021 10:34:27 - INFO - __main__ - Step 94833: {'lr': 0.00015270380402459933, 'samples': 18207936, 'steps': 94832, 'loss/train': 1.4552454948425293} 11/07/2021 10:34:27 - INFO - __main__ - Step 94834: {'lr': 0.00015269891568745698, 'samples': 18208128, 'steps': 94833, 'loss/train': 1.8554184436798096} 11/07/2021 10:34:27 - INFO - __main__ - Step 94835: {'lr': 0.00015269402739415694, 'samples': 18208320, 'steps': 94834, 'loss/train': 1.5434746742248535} 11/07/2021 10:34:28 - INFO - __main__ - Step 94836: {'lr': 0.0001526891391447015, 'samples': 18208512, 'steps': 94835, 'loss/train': 1.3800873756408691} 11/07/2021 10:34:29 - INFO - __main__ - Step 94837: {'lr': 0.00015268425093909287, 'samples': 18208704, 'steps': 94836, 'loss/train': 1.23208749294281} 11/07/2021 10:34:29 - INFO - __main__ - Step 94838: {'lr': 0.00015267936277733318, 'samples': 18208896, 'steps': 94837, 'loss/train': 0.9969526529312134} 11/07/2021 10:34:29 - INFO - __main__ - Step 94839: {'lr': 0.0001526744746594247, 'samples': 18209088, 'steps': 94838, 'loss/train': 0.7842779159545898} 11/07/2021 10:34:30 - INFO - __main__ - Step 94840: {'lr': 0.00015266958658536952, 'samples': 18209280, 'steps': 94839, 'loss/train': 1.1577552556991577} 11/07/2021 10:34:31 - INFO - __main__ - Step 94841: {'lr': 0.00015266469855516998, 'samples': 18209472, 'steps': 94840, 'loss/train': 1.338438868522644} 11/07/2021 10:34:31 - INFO - __main__ - Step 94842: {'lr': 0.0001526598105688282, 'samples': 18209664, 'steps': 94841, 'loss/train': 1.0781418085098267} 11/07/2021 10:34:32 - INFO - __main__ - Step 94843: {'lr': 0.00015265492262634645, 'samples': 18209856, 'steps': 94842, 'loss/train': 1.2198493480682373} 11/07/2021 10:34:32 - INFO - __main__ - Step 94844: {'lr': 0.00015265003472772688, 'samples': 18210048, 'steps': 94843, 'loss/train': 2.1670033931732178} 11/07/2021 10:34:32 - INFO - __main__ - Step 94845: {'lr': 0.0001526451468729717, 'samples': 18210240, 'steps': 94844, 'loss/train': 1.5683889389038086} 11/07/2021 10:34:33 - INFO - __main__ - Step 94846: {'lr': 0.00015264025906208307, 'samples': 18210432, 'steps': 94845, 'loss/train': 1.4700409173965454} 11/07/2021 10:34:34 - INFO - __main__ - Step 94847: {'lr': 0.00015263537129506328, 'samples': 18210624, 'steps': 94846, 'loss/train': 1.0838208198547363} 11/07/2021 10:34:34 - INFO - __main__ - Step 94848: {'lr': 0.0001526304835719145, 'samples': 18210816, 'steps': 94847, 'loss/train': 1.045764446258545} 11/07/2021 10:34:34 - INFO - __main__ - Step 94849: {'lr': 0.00015262559589263893, 'samples': 18211008, 'steps': 94848, 'loss/train': 0.8946524262428284} 11/07/2021 10:34:35 - INFO - __main__ - Step 94850: {'lr': 0.0001526207082572387, 'samples': 18211200, 'steps': 94849, 'loss/train': 1.4202831983566284} 11/07/2021 10:34:35 - INFO - __main__ - Step 94851: {'lr': 0.00015261582066571612, 'samples': 18211392, 'steps': 94850, 'loss/train': 1.312355637550354} 11/07/2021 10:34:36 - INFO - __main__ - Step 94852: {'lr': 0.00015261093311807333, 'samples': 18211584, 'steps': 94851, 'loss/train': 1.5632178783416748} 11/07/2021 10:34:37 - INFO - __main__ - Step 94853: {'lr': 0.00015260604561431255, 'samples': 18211776, 'steps': 94852, 'loss/train': 1.1304261684417725} 11/07/2021 10:34:37 - INFO - __main__ - Step 94854: {'lr': 0.00015260115815443598, 'samples': 18211968, 'steps': 94853, 'loss/train': 0.7929124236106873} 11/07/2021 10:34:37 - INFO - __main__ - Step 94855: {'lr': 0.00015259627073844584, 'samples': 18212160, 'steps': 94854, 'loss/train': 1.234807014465332} 11/07/2021 10:34:38 - INFO - __main__ - Step 94856: {'lr': 0.0001525913833663443, 'samples': 18212352, 'steps': 94855, 'loss/train': 0.818975567817688} 11/07/2021 10:34:39 - INFO - __main__ - Step 94857: {'lr': 0.00015258649603813357, 'samples': 18212544, 'steps': 94856, 'loss/train': 1.3456768989562988} 11/07/2021 10:34:39 - INFO - __main__ - Step 94858: {'lr': 0.00015258160875381593, 'samples': 18212736, 'steps': 94857, 'loss/train': 1.3067234754562378} 11/07/2021 10:34:39 - INFO - __main__ - Step 94859: {'lr': 0.00015257672151339352, 'samples': 18212928, 'steps': 94858, 'loss/train': 1.1216570138931274} 11/07/2021 10:34:40 - INFO - __main__ - Step 94860: {'lr': 0.00015257183431686847, 'samples': 18213120, 'steps': 94859, 'loss/train': 1.340918779373169} 11/07/2021 10:34:40 - INFO - __main__ - Step 94861: {'lr': 0.00015256694716424306, 'samples': 18213312, 'steps': 94860, 'loss/train': 1.3074365854263306} 11/07/2021 10:34:41 - INFO - __main__ - Step 94862: {'lr': 0.00015256206005551947, 'samples': 18213504, 'steps': 94861, 'loss/train': 1.1399540901184082} 11/07/2021 10:34:41 - INFO - __main__ - Step 94863: {'lr': 0.00015255717299069994, 'samples': 18213696, 'steps': 94862, 'loss/train': 1.5488885641098022} 11/07/2021 10:34:42 - INFO - __main__ - Step 94864: {'lr': 0.0001525522859697866, 'samples': 18213888, 'steps': 94863, 'loss/train': 1.353393793106079} 11/07/2021 10:34:42 - INFO - __main__ - Step 94865: {'lr': 0.00015254739899278171, 'samples': 18214080, 'steps': 94864, 'loss/train': 1.4141838550567627} 11/07/2021 10:34:42 - INFO - __main__ - Step 94866: {'lr': 0.0001525425120596875, 'samples': 18214272, 'steps': 94865, 'loss/train': 0.821323573589325} 11/07/2021 10:34:44 - INFO - __main__ - Step 94867: {'lr': 0.00015253762517050605, 'samples': 18214464, 'steps': 94866, 'loss/train': 1.6740221977233887} 11/07/2021 10:34:44 - INFO - __main__ - Step 94868: {'lr': 0.00015253273832523974, 'samples': 18214656, 'steps': 94867, 'loss/train': 1.2995661497116089} 11/07/2021 10:34:44 - INFO - __main__ - Step 94869: {'lr': 0.00015252785152389058, 'samples': 18214848, 'steps': 94868, 'loss/train': 1.6526336669921875} 11/07/2021 10:34:45 - INFO - __main__ - Step 94870: {'lr': 0.00015252296476646094, 'samples': 18215040, 'steps': 94869, 'loss/train': 1.493151307106018} 11/07/2021 10:34:45 - INFO - __main__ - Step 94871: {'lr': 0.000152518078052953, 'samples': 18215232, 'steps': 94870, 'loss/train': 1.0548100471496582} 11/07/2021 10:34:46 - INFO - __main__ - Step 94872: {'lr': 0.00015251319138336882, 'samples': 18215424, 'steps': 94871, 'loss/train': 1.160768985748291} 11/07/2021 10:34:46 - INFO - __main__ - Step 94873: {'lr': 0.00015250830475771072, 'samples': 18215616, 'steps': 94872, 'loss/train': 1.5452289581298828} 11/07/2021 10:34:47 - INFO - __main__ - Step 94874: {'lr': 0.00015250341817598084, 'samples': 18215808, 'steps': 94873, 'loss/train': 1.269293189048767} 11/07/2021 10:34:47 - INFO - __main__ - Step 94875: {'lr': 0.00015249853163818144, 'samples': 18216000, 'steps': 94874, 'loss/train': 1.2510230541229248} 11/07/2021 10:34:47 - INFO - __main__ - Step 94876: {'lr': 0.0001524936451443147, 'samples': 18216192, 'steps': 94875, 'loss/train': 0.9416180849075317} 11/07/2021 10:34:48 - INFO - __main__ - Step 94877: {'lr': 0.00015248875869438278, 'samples': 18216384, 'steps': 94876, 'loss/train': 0.8383516073226929} 11/07/2021 10:34:49 - INFO - __main__ - Step 94878: {'lr': 0.00015248387228838795, 'samples': 18216576, 'steps': 94877, 'loss/train': 1.4275513887405396} 11/07/2021 10:34:49 - INFO - __main__ - Step 94879: {'lr': 0.00015247898592633236, 'samples': 18216768, 'steps': 94878, 'loss/train': 1.6842445135116577} 11/07/2021 10:34:49 - INFO - __main__ - Step 94880: {'lr': 0.00015247409960821828, 'samples': 18216960, 'steps': 94879, 'loss/train': 1.2114832401275635} 11/07/2021 10:34:50 - INFO - __main__ - Step 94881: {'lr': 0.00015246921333404785, 'samples': 18217152, 'steps': 94880, 'loss/train': 1.2613160610198975} 11/07/2021 10:34:50 - INFO - __main__ - Step 94882: {'lr': 0.00015246432710382324, 'samples': 18217344, 'steps': 94881, 'loss/train': 0.7839977741241455} 11/07/2021 10:34:52 - INFO - __main__ - Step 94883: {'lr': 0.00015245944091754675, 'samples': 18217536, 'steps': 94882, 'loss/train': 1.2903729677200317} 11/07/2021 10:34:52 - INFO - __main__ - Step 94884: {'lr': 0.00015245455477522053, 'samples': 18217728, 'steps': 94883, 'loss/train': 1.5382450819015503} 11/07/2021 10:34:52 - INFO - __main__ - Step 94885: {'lr': 0.00015244966867684683, 'samples': 18217920, 'steps': 94884, 'loss/train': 1.5574713945388794} 11/07/2021 10:34:53 - INFO - __main__ - Step 94886: {'lr': 0.00015244478262242775, 'samples': 18218112, 'steps': 94885, 'loss/train': 0.06964508444070816} 11/07/2021 10:34:53 - INFO - __main__ - Step 94887: {'lr': 0.00015243989661196556, 'samples': 18218304, 'steps': 94886, 'loss/train': 1.313795804977417} 11/07/2021 10:34:54 - INFO - __main__ - Step 94888: {'lr': 0.0001524350106454624, 'samples': 18218496, 'steps': 94887, 'loss/train': 1.1813390254974365} 11/07/2021 10:34:54 - INFO - __main__ - Step 94889: {'lr': 0.00015243012472292055, 'samples': 18218688, 'steps': 94888, 'loss/train': 1.1554796695709229} 11/07/2021 10:34:55 - INFO - __main__ - Step 94890: {'lr': 0.00015242523884434218, 'samples': 18218880, 'steps': 94889, 'loss/train': 1.121797800064087} 11/07/2021 10:34:55 - INFO - __main__ - Step 94891: {'lr': 0.00015242035300972945, 'samples': 18219072, 'steps': 94890, 'loss/train': 1.3188711404800415} 11/07/2021 10:34:55 - INFO - __main__ - Step 94892: {'lr': 0.00015241546721908467, 'samples': 18219264, 'steps': 94891, 'loss/train': 0.6869860887527466} 11/07/2021 10:34:56 - INFO - __main__ - Step 94893: {'lr': 0.00015241058147240995, 'samples': 18219456, 'steps': 94892, 'loss/train': 1.3947817087173462} 11/07/2021 10:34:57 - INFO - __main__ - Step 94894: {'lr': 0.0001524056957697075, 'samples': 18219648, 'steps': 94893, 'loss/train': 1.3468462228775024} 11/07/2021 10:34:57 - INFO - __main__ - Step 94895: {'lr': 0.00015240081011097954, 'samples': 18219840, 'steps': 94894, 'loss/train': 1.3183557987213135} 11/07/2021 10:34:57 - INFO - __main__ - Step 94896: {'lr': 0.00015239592449622824, 'samples': 18220032, 'steps': 94895, 'loss/train': 1.4894518852233887} 11/07/2021 10:34:58 - INFO - __main__ - Step 94897: {'lr': 0.0001523910389254559, 'samples': 18220224, 'steps': 94896, 'loss/train': 1.80966055393219} 11/07/2021 10:34:59 - INFO - __main__ - Step 94898: {'lr': 0.00015238615339866472, 'samples': 18220416, 'steps': 94897, 'loss/train': 0.8178684711456299} 11/07/2021 10:34:59 - INFO - __main__ - Step 94899: {'lr': 0.00015238126791585673, 'samples': 18220608, 'steps': 94898, 'loss/train': 1.7767025232315063} 11/07/2021 10:34:59 - INFO - __main__ - Step 94900: {'lr': 0.00015237638247703422, 'samples': 18220800, 'steps': 94899, 'loss/train': 1.3824383020401} 11/07/2021 10:35:00 - INFO - __main__ - Step 94901: {'lr': 0.0001523714970821994, 'samples': 18220992, 'steps': 94900, 'loss/train': 1.1244982481002808} 11/07/2021 10:35:00 - INFO - __main__ - Step 94902: {'lr': 0.00015236661173135453, 'samples': 18221184, 'steps': 94901, 'loss/train': 0.8480095267295837} 11/07/2021 10:35:01 - INFO - __main__ - Step 94903: {'lr': 0.0001523617264245017, 'samples': 18221376, 'steps': 94902, 'loss/train': 1.2408024072647095} 11/07/2021 10:35:02 - INFO - __main__ - Step 94904: {'lr': 0.0001523568411616432, 'samples': 18221568, 'steps': 94903, 'loss/train': 1.4842489957809448} 11/07/2021 10:35:02 - INFO - __main__ - Step 94905: {'lr': 0.0001523519559427812, 'samples': 18221760, 'steps': 94904, 'loss/train': 1.3761130571365356} 11/07/2021 10:35:02 - INFO - __main__ - Step 94906: {'lr': 0.00015234707076791786, 'samples': 18221952, 'steps': 94905, 'loss/train': 1.4471057653427124} 11/07/2021 10:35:03 - INFO - __main__ - Step 94907: {'lr': 0.00015234218563705548, 'samples': 18222144, 'steps': 94906, 'loss/train': 1.487052083015442} 11/07/2021 10:35:04 - INFO - __main__ - Step 94908: {'lr': 0.00015233730055019617, 'samples': 18222336, 'steps': 94907, 'loss/train': 1.462125539779663} 11/07/2021 10:35:04 - INFO - __main__ - Step 94909: {'lr': 0.0001523324155073422, 'samples': 18222528, 'steps': 94908, 'loss/train': 0.8986525535583496} 11/07/2021 10:35:04 - INFO - __main__ - Step 94910: {'lr': 0.0001523275305084957, 'samples': 18222720, 'steps': 94909, 'loss/train': 1.83936607837677} 11/07/2021 10:35:05 - INFO - __main__ - Step 94911: {'lr': 0.00015232264555365893, 'samples': 18222912, 'steps': 94910, 'loss/train': 1.3446348905563354} 11/07/2021 10:35:05 - INFO - __main__ - Step 94912: {'lr': 0.00015231776064283419, 'samples': 18223104, 'steps': 94911, 'loss/train': 1.059542179107666} 11/07/2021 10:35:06 - INFO - __main__ - Step 94913: {'lr': 0.00015231287577602344, 'samples': 18223296, 'steps': 94912, 'loss/train': 1.2090412378311157} 11/07/2021 10:35:06 - INFO - __main__ - Step 94914: {'lr': 0.00015230799095322894, 'samples': 18223488, 'steps': 94913, 'loss/train': 1.5753370523452759} 11/07/2021 10:35:07 - INFO - __main__ - Step 94915: {'lr': 0.00015230310617445303, 'samples': 18223680, 'steps': 94914, 'loss/train': 1.4837568998336792} 11/07/2021 10:35:07 - INFO - __main__ - Step 94916: {'lr': 0.00015229822143969778, 'samples': 18223872, 'steps': 94915, 'loss/train': 0.8911372423171997} 11/07/2021 10:35:07 - INFO - __main__ - Step 94917: {'lr': 0.0001522933367489655, 'samples': 18224064, 'steps': 94916, 'loss/train': 1.5904375314712524} 11/07/2021 10:35:08 - INFO - __main__ - Step 94918: {'lr': 0.0001522884521022583, 'samples': 18224256, 'steps': 94917, 'loss/train': 0.43401503562927246} 11/07/2021 10:35:09 - INFO - __main__ - Step 94919: {'lr': 0.0001522835674995784, 'samples': 18224448, 'steps': 94918, 'loss/train': 1.386494755744934} 11/07/2021 10:35:10 - INFO - __main__ - Step 94920: {'lr': 0.00015227868294092806, 'samples': 18224640, 'steps': 94919, 'loss/train': 1.4065024852752686} 11/07/2021 10:35:10 - INFO - __main__ - Step 94921: {'lr': 0.00015227379842630939, 'samples': 18224832, 'steps': 94920, 'loss/train': 1.5290610790252686} 11/07/2021 10:35:10 - INFO - __main__ - Step 94922: {'lr': 0.0001522689139557247, 'samples': 18225024, 'steps': 94921, 'loss/train': 0.3091946542263031} 11/07/2021 10:35:11 - INFO - __main__ - Step 94923: {'lr': 0.00015226402952917605, 'samples': 18225216, 'steps': 94922, 'loss/train': 1.7083656787872314} 11/07/2021 10:35:12 - INFO - __main__ - Step 94924: {'lr': 0.00015225914514666578, 'samples': 18225408, 'steps': 94923, 'loss/train': 0.9300922751426697} 11/07/2021 10:35:12 - INFO - __main__ - Step 94925: {'lr': 0.00015225426080819614, 'samples': 18225600, 'steps': 94924, 'loss/train': 1.5231188535690308} 11/07/2021 10:35:12 - INFO - __main__ - Step 94926: {'lr': 0.00015224937651376908, 'samples': 18225792, 'steps': 94925, 'loss/train': 1.6064209938049316} 11/07/2021 10:35:13 - INFO - __main__ - Step 94927: {'lr': 0.00015224449226338696, 'samples': 18225984, 'steps': 94926, 'loss/train': 1.307652235031128} 11/07/2021 10:35:13 - INFO - __main__ - Step 94928: {'lr': 0.00015223960805705195, 'samples': 18226176, 'steps': 94927, 'loss/train': 1.178421139717102} 11/07/2021 10:35:14 - INFO - __main__ - Step 94929: {'lr': 0.0001522347238947663, 'samples': 18226368, 'steps': 94928, 'loss/train': 1.1969910860061646} 11/07/2021 10:35:14 - INFO - __main__ - Step 94930: {'lr': 0.00015222983977653215, 'samples': 18226560, 'steps': 94929, 'loss/train': 1.4867993593215942} 11/07/2021 10:35:15 - INFO - __main__ - Step 94931: {'lr': 0.00015222495570235174, 'samples': 18226752, 'steps': 94930, 'loss/train': 1.652389407157898} 11/07/2021 10:35:15 - INFO - __main__ - Step 94932: {'lr': 0.0001522200716722272, 'samples': 18226944, 'steps': 94931, 'loss/train': 1.1312158107757568} 11/07/2021 10:35:15 - INFO - __main__ - Step 94933: {'lr': 0.00015221518768616084, 'samples': 18227136, 'steps': 94932, 'loss/train': 1.2730461359024048} 11/07/2021 10:35:16 - INFO - __main__ - Step 94934: {'lr': 0.00015221030374415478, 'samples': 18227328, 'steps': 94933, 'loss/train': 1.2190849781036377} 11/07/2021 10:35:17 - INFO - __main__ - Step 94935: {'lr': 0.00015220541984621127, 'samples': 18227520, 'steps': 94934, 'loss/train': 1.141896367073059} 11/07/2021 10:35:17 - INFO - __main__ - Step 94936: {'lr': 0.0001522005359923325, 'samples': 18227712, 'steps': 94935, 'loss/train': 1.1453874111175537} 11/07/2021 10:35:18 - INFO - __main__ - Step 94937: {'lr': 0.00015219565218252062, 'samples': 18227904, 'steps': 94936, 'loss/train': 1.3888064622879028} 11/07/2021 10:35:18 - INFO - __main__ - Step 94938: {'lr': 0.000152190768416778, 'samples': 18228096, 'steps': 94937, 'loss/train': 1.2284200191497803} 11/07/2021 10:35:18 - INFO - __main__ - Step 94939: {'lr': 0.0001521858846951066, 'samples': 18228288, 'steps': 94938, 'loss/train': 1.9352948665618896} 11/07/2021 10:35:19 - INFO - __main__ - Step 94940: {'lr': 0.00015218100101750876, 'samples': 18228480, 'steps': 94939, 'loss/train': 0.7015941143035889} 11/07/2021 10:35:20 - INFO - __main__ - Step 94941: {'lr': 0.00015217611738398663, 'samples': 18228672, 'steps': 94940, 'loss/train': 1.1598737239837646} 11/07/2021 10:35:20 - INFO - __main__ - Step 94942: {'lr': 0.0001521712337945424, 'samples': 18228864, 'steps': 94941, 'loss/train': 1.5892434120178223} 11/07/2021 10:35:20 - INFO - __main__ - Step 94943: {'lr': 0.00015216635024917834, 'samples': 18229056, 'steps': 94942, 'loss/train': 0.8135735988616943} 11/07/2021 10:35:21 - INFO - __main__ - Step 94944: {'lr': 0.0001521614667478966, 'samples': 18229248, 'steps': 94943, 'loss/train': 0.9154157638549805} 11/07/2021 10:35:22 - INFO - __main__ - Step 94945: {'lr': 0.0001521565832906994, 'samples': 18229440, 'steps': 94944, 'loss/train': 1.2963225841522217} 11/07/2021 10:35:22 - INFO - __main__ - Step 94946: {'lr': 0.00015215169987758894, 'samples': 18229632, 'steps': 94945, 'loss/train': 1.0688506364822388} 11/07/2021 10:35:22 - INFO - __main__ - Step 94947: {'lr': 0.00015214681650856739, 'samples': 18229824, 'steps': 94946, 'loss/train': 1.3915067911148071} 11/07/2021 10:35:23 - INFO - __main__ - Step 94948: {'lr': 0.000152141933183637, 'samples': 18230016, 'steps': 94947, 'loss/train': 1.7167387008666992} 11/07/2021 10:35:23 - INFO - __main__ - Step 94949: {'lr': 0.0001521370499027999, 'samples': 18230208, 'steps': 94948, 'loss/train': 1.5985316038131714} 11/07/2021 10:35:24 - INFO - __main__ - Step 94950: {'lr': 0.00015213216666605845, 'samples': 18230400, 'steps': 94949, 'loss/train': 1.4045342206954956} 11/07/2021 10:35:24 - INFO - __main__ - Step 94951: {'lr': 0.00015212728347341464, 'samples': 18230592, 'steps': 94950, 'loss/train': 0.977388858795166} 11/07/2021 10:35:25 - INFO - __main__ - Step 94952: {'lr': 0.00015212240032487086, 'samples': 18230784, 'steps': 94951, 'loss/train': 1.4436452388763428} 11/07/2021 10:35:25 - INFO - __main__ - Step 94953: {'lr': 0.0001521175172204291, 'samples': 18230976, 'steps': 94952, 'loss/train': 1.2541251182556152} 11/07/2021 10:35:26 - INFO - __main__ - Step 94954: {'lr': 0.00015211263416009175, 'samples': 18231168, 'steps': 94953, 'loss/train': 0.4007869064807892} 11/07/2021 10:35:27 - INFO - __main__ - Step 94955: {'lr': 0.00015210775114386088, 'samples': 18231360, 'steps': 94954, 'loss/train': 1.4964592456817627} 11/07/2021 10:35:27 - INFO - __main__ - Step 94956: {'lr': 0.00015210286817173875, 'samples': 18231552, 'steps': 94955, 'loss/train': 0.9833523035049438} 11/07/2021 10:35:27 - INFO - __main__ - Step 94957: {'lr': 0.00015209798524372758, 'samples': 18231744, 'steps': 94956, 'loss/train': 1.0694457292556763} 11/07/2021 10:35:28 - INFO - __main__ - Step 94958: {'lr': 0.00015209310235982955, 'samples': 18231936, 'steps': 94957, 'loss/train': 1.2877874374389648} 11/07/2021 10:35:28 - INFO - __main__ - Step 94959: {'lr': 0.00015208821952004685, 'samples': 18232128, 'steps': 94958, 'loss/train': 1.0638408660888672} 11/07/2021 10:35:29 - INFO - __main__ - Step 94960: {'lr': 0.00015208333672438168, 'samples': 18232320, 'steps': 94959, 'loss/train': 1.2711635828018188} 11/07/2021 10:35:29 - INFO - __main__ - Step 94961: {'lr': 0.00015207845397283628, 'samples': 18232512, 'steps': 94960, 'loss/train': 1.4893062114715576} 11/07/2021 10:35:30 - INFO - __main__ - Step 94962: {'lr': 0.00015207357126541281, 'samples': 18232704, 'steps': 94961, 'loss/train': 0.701863706111908} 11/07/2021 10:35:30 - INFO - __main__ - Step 94963: {'lr': 0.00015206868860211345, 'samples': 18232896, 'steps': 94962, 'loss/train': 0.7783507704734802} 11/07/2021 10:35:30 - INFO - __main__ - Step 94964: {'lr': 0.00015206380598294046, 'samples': 18233088, 'steps': 94963, 'loss/train': 1.3876101970672607} 11/07/2021 10:35:31 - INFO - __main__ - Step 94965: {'lr': 0.00015205892340789602, 'samples': 18233280, 'steps': 94964, 'loss/train': 1.4874908924102783} 11/07/2021 10:35:32 - INFO - __main__ - Step 94966: {'lr': 0.00015205404087698226, 'samples': 18233472, 'steps': 94965, 'loss/train': 1.388073205947876} 11/07/2021 10:35:32 - INFO - __main__ - Step 94967: {'lr': 0.00015204915839020147, 'samples': 18233664, 'steps': 94966, 'loss/train': 1.3292853832244873} 11/07/2021 10:35:32 - INFO - __main__ - Step 94968: {'lr': 0.00015204427594755582, 'samples': 18233856, 'steps': 94967, 'loss/train': 1.319311261177063} 11/07/2021 10:35:33 - INFO - __main__ - Step 94969: {'lr': 0.00015203939354904746, 'samples': 18234048, 'steps': 94968, 'loss/train': 1.1438910961151123} 11/07/2021 10:35:34 - INFO - __main__ - Step 94970: {'lr': 0.0001520345111946787, 'samples': 18234240, 'steps': 94969, 'loss/train': 1.3822554349899292} 11/07/2021 10:35:34 - INFO - __main__ - Step 94971: {'lr': 0.00015202962888445165, 'samples': 18234432, 'steps': 94970, 'loss/train': 0.9829167723655701} 11/07/2021 10:35:35 - INFO - __main__ - Step 94972: {'lr': 0.00015202474661836856, 'samples': 18234624, 'steps': 94971, 'loss/train': 0.9644860625267029} 11/07/2021 10:35:35 - INFO - __main__ - Step 94973: {'lr': 0.0001520198643964316, 'samples': 18234816, 'steps': 94972, 'loss/train': 5.044995307922363} 11/07/2021 10:35:36 - INFO - __main__ - Step 94974: {'lr': 0.00015201498221864297, 'samples': 18235008, 'steps': 94973, 'loss/train': 1.4880458116531372} 11/07/2021 10:35:36 - INFO - __main__ - Step 94975: {'lr': 0.00015201010008500488, 'samples': 18235200, 'steps': 94974, 'loss/train': 1.442258358001709} 11/07/2021 10:35:38 - INFO - __main__ - Step 94976: {'lr': 0.00015200521799551948, 'samples': 18235392, 'steps': 94975, 'loss/train': 1.5851435661315918} 11/07/2021 10:35:38 - INFO - __main__ - Step 94977: {'lr': 0.0001520003359501891, 'samples': 18235584, 'steps': 94976, 'loss/train': 1.6977031230926514} 11/07/2021 10:35:38 - INFO - __main__ - Step 94978: {'lr': 0.00015199545394901576, 'samples': 18235776, 'steps': 94977, 'loss/train': 1.3301029205322266} 11/07/2021 10:35:39 - INFO - __main__ - Step 94979: {'lr': 0.00015199057199200187, 'samples': 18235968, 'steps': 94978, 'loss/train': 0.4436163306236267} 11/07/2021 10:35:39 - INFO - __main__ - Step 94980: {'lr': 0.00015198569007914944, 'samples': 18236160, 'steps': 94979, 'loss/train': 0.672435462474823} 11/07/2021 10:35:39 - INFO - __main__ - Step 94981: {'lr': 0.00015198080821046076, 'samples': 18236352, 'steps': 94980, 'loss/train': 1.6461783647537231} 11/07/2021 10:35:40 - INFO - __main__ - Step 94982: {'lr': 0.000151975926385938, 'samples': 18236544, 'steps': 94981, 'loss/train': 1.7227364778518677} 11/07/2021 10:35:41 - INFO - __main__ - Step 94983: {'lr': 0.00015197104460558345, 'samples': 18236736, 'steps': 94982, 'loss/train': 1.5755091905593872} 11/07/2021 10:35:41 - INFO - __main__ - Step 94984: {'lr': 0.0001519661628693992, 'samples': 18236928, 'steps': 94983, 'loss/train': 0.9533984661102295} 11/07/2021 10:35:41 - INFO - __main__ - Step 94985: {'lr': 0.0001519612811773874, 'samples': 18237120, 'steps': 94984, 'loss/train': 0.9313892722129822} 11/07/2021 10:35:42 - INFO - __main__ - Step 94986: {'lr': 0.00015195639952955041, 'samples': 18237312, 'steps': 94985, 'loss/train': 1.4367949962615967} 11/07/2021 10:35:42 - INFO - __main__ - Step 94987: {'lr': 0.00015195151792589035, 'samples': 18237504, 'steps': 94986, 'loss/train': 1.2069705724716187} 11/07/2021 10:35:43 - INFO - __main__ - Step 94988: {'lr': 0.00015194663636640938, 'samples': 18237696, 'steps': 94987, 'loss/train': 1.6541509628295898} 11/07/2021 10:35:44 - INFO - __main__ - Step 94989: {'lr': 0.0001519417548511098, 'samples': 18237888, 'steps': 94988, 'loss/train': 1.2929075956344604} 11/07/2021 10:35:44 - INFO - __main__ - Step 94990: {'lr': 0.00015193687337999368, 'samples': 18238080, 'steps': 94989, 'loss/train': 1.5623177289962769} 11/07/2021 10:35:44 - INFO - __main__ - Step 94991: {'lr': 0.00015193199195306334, 'samples': 18238272, 'steps': 94990, 'loss/train': 1.7604920864105225} 11/07/2021 10:35:45 - INFO - __main__ - Step 94992: {'lr': 0.000151927110570321, 'samples': 18238464, 'steps': 94991, 'loss/train': 0.9027819037437439} 11/07/2021 10:35:46 - INFO - __main__ - Step 94993: {'lr': 0.00015192222923176869, 'samples': 18238656, 'steps': 94992, 'loss/train': 1.1393071413040161} 11/07/2021 10:35:46 - INFO - __main__ - Step 94994: {'lr': 0.0001519173479374088, 'samples': 18238848, 'steps': 94993, 'loss/train': 1.1311471462249756} 11/07/2021 10:35:46 - INFO - __main__ - Step 94995: {'lr': 0.00015191246668724335, 'samples': 18239040, 'steps': 94994, 'loss/train': 1.6713908910751343} 11/07/2021 10:35:47 - INFO - __main__ - Step 94996: {'lr': 0.00015190758548127464, 'samples': 18239232, 'steps': 94995, 'loss/train': 1.1314321756362915} 11/07/2021 10:35:47 - INFO - __main__ - Step 94997: {'lr': 0.00015190270431950488, 'samples': 18239424, 'steps': 94996, 'loss/train': 1.0227398872375488} 11/07/2021 10:35:48 - INFO - __main__ - Step 94998: {'lr': 0.00015189782320193624, 'samples': 18239616, 'steps': 94997, 'loss/train': 1.050968885421753} 11/07/2021 10:35:48 - INFO - __main__ - Step 94999: {'lr': 0.00015189294212857095, 'samples': 18239808, 'steps': 94998, 'loss/train': 1.3514708280563354} 11/07/2021 10:35:49 - INFO - __main__ - Step 95000: {'lr': 0.00015188806109941113, 'samples': 18240000, 'steps': 94999, 'loss/train': 1.3021292686462402} 11/07/2021 10:35:49 - INFO - __main__ - Step 95001: {'lr': 0.00015188318011445906, 'samples': 18240192, 'steps': 95000, 'loss/train': 1.4184300899505615} 11/07/2021 10:35:49 - INFO - __main__ - Step 95002: {'lr': 0.00015187829917371693, 'samples': 18240384, 'steps': 95001, 'loss/train': 1.2803764343261719} 11/07/2021 10:35:51 - INFO - __main__ - Step 95003: {'lr': 0.00015187341827718694, 'samples': 18240576, 'steps': 95002, 'loss/train': 1.47151517868042} 11/07/2021 10:35:51 - INFO - __main__ - Step 95004: {'lr': 0.00015186853742487122, 'samples': 18240768, 'steps': 95003, 'loss/train': 1.1699086427688599} 11/07/2021 10:35:51 - INFO - __main__ - Step 95005: {'lr': 0.00015186365661677207, 'samples': 18240960, 'steps': 95004, 'loss/train': 1.1514084339141846} 11/07/2021 10:35:52 - INFO - __main__ - Step 95006: {'lr': 0.0001518587758528917, 'samples': 18241152, 'steps': 95005, 'loss/train': 1.440003514289856} 11/07/2021 10:35:52 - INFO - __main__ - Step 95007: {'lr': 0.00015185389513323218, 'samples': 18241344, 'steps': 95006, 'loss/train': 2.157426595687866} 11/07/2021 10:35:54 - INFO - __main__ - Step 95008: {'lr': 0.00015184901445779582, 'samples': 18241536, 'steps': 95007, 'loss/train': 1.4058778285980225} 11/07/2021 10:35:54 - INFO - __main__ - Step 95009: {'lr': 0.0001518441338265847, 'samples': 18241728, 'steps': 95008, 'loss/train': 0.9730534553527832} 11/07/2021 10:35:54 - INFO - __main__ - Step 95010: {'lr': 0.00015183925323960113, 'samples': 18241920, 'steps': 95009, 'loss/train': 1.3489850759506226} 11/07/2021 10:35:55 - INFO - __main__ - Step 95011: {'lr': 0.0001518343726968473, 'samples': 18242112, 'steps': 95010, 'loss/train': 1.6076737642288208} 11/07/2021 10:35:55 - INFO - __main__ - Step 95012: {'lr': 0.00015182949219832536, 'samples': 18242304, 'steps': 95011, 'loss/train': 1.6109989881515503} 11/07/2021 10:35:55 - INFO - __main__ - Step 95013: {'lr': 0.00015182461174403756, 'samples': 18242496, 'steps': 95012, 'loss/train': 1.5045394897460938} 11/07/2021 10:35:56 - INFO - __main__ - Step 95014: {'lr': 0.00015181973133398605, 'samples': 18242688, 'steps': 95013, 'loss/train': 1.3197325468063354} 11/07/2021 10:35:57 - INFO - __main__ - Step 95015: {'lr': 0.00015181485096817305, 'samples': 18242880, 'steps': 95014, 'loss/train': 1.292449712753296} 11/07/2021 10:35:57 - INFO - __main__ - Step 95016: {'lr': 0.00015180997064660078, 'samples': 18243072, 'steps': 95015, 'loss/train': 1.4545464515686035} 11/07/2021 10:35:57 - INFO - __main__ - Step 95017: {'lr': 0.00015180509036927142, 'samples': 18243264, 'steps': 95016, 'loss/train': 1.2156366109848022} 11/07/2021 10:35:58 - INFO - __main__ - Step 95018: {'lr': 0.00015180021013618715, 'samples': 18243456, 'steps': 95017, 'loss/train': 1.3187333345413208} 11/07/2021 10:35:58 - INFO - __main__ - Step 95019: {'lr': 0.00015179532994735034, 'samples': 18243648, 'steps': 95018, 'loss/train': 1.2968288660049438} 11/07/2021 10:35:59 - INFO - __main__ - Step 95020: {'lr': 0.00015179044980276292, 'samples': 18243840, 'steps': 95019, 'loss/train': 1.283088207244873} 11/07/2021 10:36:00 - INFO - __main__ - Step 95021: {'lr': 0.00015178556970242717, 'samples': 18244032, 'steps': 95020, 'loss/train': 1.1955527067184448} 11/07/2021 10:36:00 - INFO - __main__ - Step 95022: {'lr': 0.00015178068964634536, 'samples': 18244224, 'steps': 95021, 'loss/train': 1.3401551246643066} 11/07/2021 10:36:00 - INFO - __main__ - Step 95023: {'lr': 0.00015177580963451965, 'samples': 18244416, 'steps': 95022, 'loss/train': 1.5891822576522827} 11/07/2021 10:36:01 - INFO - __main__ - Step 95024: {'lr': 0.00015177092966695225, 'samples': 18244608, 'steps': 95023, 'loss/train': 1.204832673072815} 11/07/2021 10:36:02 - INFO - __main__ - Step 95025: {'lr': 0.00015176604974364533, 'samples': 18244800, 'steps': 95024, 'loss/train': 1.0356159210205078} 11/07/2021 10:36:02 - INFO - __main__ - Step 95026: {'lr': 0.00015176116986460116, 'samples': 18244992, 'steps': 95025, 'loss/train': 0.4596741497516632} 11/07/2021 10:36:02 - INFO - __main__ - Step 95027: {'lr': 0.00015175629002982184, 'samples': 18245184, 'steps': 95026, 'loss/train': 1.052955985069275} 11/07/2021 10:36:03 - INFO - __main__ - Step 95028: {'lr': 0.00015175141023930966, 'samples': 18245376, 'steps': 95027, 'loss/train': 1.4567004442214966} 11/07/2021 10:36:03 - INFO - __main__ - Step 95029: {'lr': 0.00015174653049306676, 'samples': 18245568, 'steps': 95028, 'loss/train': 1.5142616033554077} 11/07/2021 10:36:04 - INFO - __main__ - Step 95030: {'lr': 0.00015174165079109533, 'samples': 18245760, 'steps': 95029, 'loss/train': 1.9585241079330444} 11/07/2021 10:36:04 - INFO - __main__ - Step 95031: {'lr': 0.00015173677113339761, 'samples': 18245952, 'steps': 95030, 'loss/train': 1.1106945276260376} 11/07/2021 10:36:05 - INFO - __main__ - Step 95032: {'lr': 0.00015173189151997582, 'samples': 18246144, 'steps': 95031, 'loss/train': 1.4883805513381958} 11/07/2021 10:36:05 - INFO - __main__ - Step 95033: {'lr': 0.00015172701195083222, 'samples': 18246336, 'steps': 95032, 'loss/train': 1.5648468732833862} 11/07/2021 10:36:06 - INFO - __main__ - Step 95034: {'lr': 0.00015172213242596879, 'samples': 18246528, 'steps': 95033, 'loss/train': 1.4123656749725342} 11/07/2021 10:36:07 - INFO - __main__ - Step 95035: {'lr': 0.00015171725294538786, 'samples': 18246720, 'steps': 95034, 'loss/train': 1.1035311222076416} 11/07/2021 10:36:07 - INFO - __main__ - Step 95036: {'lr': 0.00015171237350909158, 'samples': 18246912, 'steps': 95035, 'loss/train': 1.3187177181243896} 11/07/2021 10:36:07 - INFO - __main__ - Step 95037: {'lr': 0.00015170749411708224, 'samples': 18247104, 'steps': 95036, 'loss/train': 0.7389788031578064} 11/07/2021 10:36:08 - INFO - __main__ - Step 95038: {'lr': 0.00015170261476936194, 'samples': 18247296, 'steps': 95037, 'loss/train': 1.2663536071777344} 11/07/2021 10:36:08 - INFO - __main__ - Step 95039: {'lr': 0.00015169773546593295, 'samples': 18247488, 'steps': 95038, 'loss/train': 1.5119311809539795} 11/07/2021 10:36:09 - INFO - __main__ - Step 95040: {'lr': 0.00015169285620679745, 'samples': 18247680, 'steps': 95039, 'loss/train': 1.5099292993545532} 11/07/2021 10:36:09 - INFO - __main__ - Step 95041: {'lr': 0.00015168797699195764, 'samples': 18247872, 'steps': 95040, 'loss/train': 1.2521592378616333} 11/07/2021 10:36:10 - INFO - __main__ - Step 95042: {'lr': 0.00015168309782141569, 'samples': 18248064, 'steps': 95041, 'loss/train': 1.5198699235916138} 11/07/2021 10:36:10 - INFO - __main__ - Step 95043: {'lr': 0.00015167821869517382, 'samples': 18248256, 'steps': 95042, 'loss/train': 1.3687530755996704} 11/07/2021 10:36:10 - INFO - __main__ - Step 95044: {'lr': 0.00015167333961323425, 'samples': 18248448, 'steps': 95043, 'loss/train': 1.0174058675765991} 11/07/2021 10:36:12 - INFO - __main__ - Step 95045: {'lr': 0.00015166846057559913, 'samples': 18248640, 'steps': 95044, 'loss/train': 1.0701878070831299} 11/07/2021 10:36:12 - INFO - __main__ - Step 95046: {'lr': 0.00015166358158227077, 'samples': 18248832, 'steps': 95045, 'loss/train': 1.1774492263793945} 11/07/2021 10:36:12 - INFO - __main__ - Step 95047: {'lr': 0.00015165870263325121, 'samples': 18249024, 'steps': 95046, 'loss/train': 1.4915438890457153} 11/07/2021 10:36:13 - INFO - __main__ - Step 95048: {'lr': 0.00015165382372854273, 'samples': 18249216, 'steps': 95047, 'loss/train': 1.4800649881362915} 11/07/2021 10:36:13 - INFO - __main__ - Step 95049: {'lr': 0.0001516489448681475, 'samples': 18249408, 'steps': 95048, 'loss/train': 0.9577690958976746} 11/07/2021 10:36:13 - INFO - __main__ - Step 95050: {'lr': 0.00015164406605206777, 'samples': 18249600, 'steps': 95049, 'loss/train': 1.2414621114730835} 11/07/2021 10:36:15 - INFO - __main__ - Step 95051: {'lr': 0.00015163918728030565, 'samples': 18249792, 'steps': 95050, 'loss/train': 1.551863193511963} 11/07/2021 10:36:15 - INFO - __main__ - Step 95052: {'lr': 0.00015163430855286343, 'samples': 18249984, 'steps': 95051, 'loss/train': 0.9081740975379944} 11/07/2021 10:36:15 - INFO - __main__ - Step 95053: {'lr': 0.00015162942986974326, 'samples': 18250176, 'steps': 95052, 'loss/train': 1.2256027460098267} 11/07/2021 10:36:16 - INFO - __main__ - Step 95054: {'lr': 0.00015162455123094736, 'samples': 18250368, 'steps': 95053, 'loss/train': 1.5468080043792725} 11/07/2021 10:36:16 - INFO - __main__ - Step 95055: {'lr': 0.0001516196726364779, 'samples': 18250560, 'steps': 95054, 'loss/train': 5.2531046867370605} 11/07/2021 10:36:16 - INFO - __main__ - Step 95056: {'lr': 0.00015161479408633713, 'samples': 18250752, 'steps': 95055, 'loss/train': 5.36776876449585} 11/07/2021 10:36:17 - INFO - __main__ - Step 95057: {'lr': 0.00015160991558052722, 'samples': 18250944, 'steps': 95056, 'loss/train': 1.0136628150939941} 11/07/2021 10:36:18 - INFO - __main__ - Step 95058: {'lr': 0.00015160503711905032, 'samples': 18251136, 'steps': 95057, 'loss/train': 1.1622527837753296} 11/07/2021 10:36:18 - INFO - __main__ - Step 95059: {'lr': 0.0001516001587019088, 'samples': 18251328, 'steps': 95058, 'loss/train': 1.1355501413345337} 11/07/2021 10:36:19 - INFO - __main__ - Step 95060: {'lr': 0.00015159528032910463, 'samples': 18251520, 'steps': 95059, 'loss/train': 1.5066163539886475} 11/07/2021 10:36:19 - INFO - __main__ - Step 95061: {'lr': 0.0001515904020006401, 'samples': 18251712, 'steps': 95060, 'loss/train': 1.2919896841049194} 11/07/2021 10:36:20 - INFO - __main__ - Step 95062: {'lr': 0.00015158552371651743, 'samples': 18251904, 'steps': 95061, 'loss/train': 1.3136662244796753} 11/07/2021 10:36:20 - INFO - __main__ - Step 95063: {'lr': 0.00015158064547673877, 'samples': 18252096, 'steps': 95062, 'loss/train': 0.6241506934165955} 11/07/2021 10:36:21 - INFO - __main__ - Step 95064: {'lr': 0.0001515757672813064, 'samples': 18252288, 'steps': 95063, 'loss/train': 1.1169489622116089} 11/07/2021 10:36:21 - INFO - __main__ - Step 95065: {'lr': 0.00015157088913022242, 'samples': 18252480, 'steps': 95064, 'loss/train': 0.8920952677726746} 11/07/2021 10:36:21 - INFO - __main__ - Step 95066: {'lr': 0.00015156601102348912, 'samples': 18252672, 'steps': 95065, 'loss/train': 1.5361051559448242} 11/07/2021 10:36:22 - INFO - __main__ - Step 95067: {'lr': 0.00015156113296110866, 'samples': 18252864, 'steps': 95066, 'loss/train': 1.4863437414169312} 11/07/2021 10:36:23 - INFO - __main__ - Step 95068: {'lr': 0.00015155625494308323, 'samples': 18253056, 'steps': 95067, 'loss/train': 1.3291188478469849} 11/07/2021 10:36:23 - INFO - __main__ - Step 95069: {'lr': 0.000151551376969415, 'samples': 18253248, 'steps': 95068, 'loss/train': 1.2780518531799316} 11/07/2021 10:36:23 - INFO - __main__ - Step 95070: {'lr': 0.00015154649904010624, 'samples': 18253440, 'steps': 95069, 'loss/train': 1.2011699676513672} 11/07/2021 10:36:24 - INFO - __main__ - Step 95071: {'lr': 0.00015154162115515907, 'samples': 18253632, 'steps': 95070, 'loss/train': 1.4911253452301025} 11/07/2021 10:36:25 - INFO - __main__ - Step 95072: {'lr': 0.00015153674331457574, 'samples': 18253824, 'steps': 95071, 'loss/train': 1.5037332773208618} 11/07/2021 10:36:25 - INFO - __main__ - Step 95073: {'lr': 0.00015153186551835856, 'samples': 18254016, 'steps': 95072, 'loss/train': 1.2048190832138062} 11/07/2021 10:36:25 - INFO - __main__ - Step 95074: {'lr': 0.00015152698776650948, 'samples': 18254208, 'steps': 95073, 'loss/train': 1.031006097793579} 11/07/2021 10:36:26 - INFO - __main__ - Step 95075: {'lr': 0.00015152211005903084, 'samples': 18254400, 'steps': 95074, 'loss/train': 1.0028035640716553} 11/07/2021 10:36:26 - INFO - __main__ - Step 95076: {'lr': 0.00015151723239592476, 'samples': 18254592, 'steps': 95075, 'loss/train': 1.2924126386642456} 11/07/2021 10:36:27 - INFO - __main__ - Step 95077: {'lr': 0.00015151235477719354, 'samples': 18254784, 'steps': 95076, 'loss/train': 1.6803797483444214} 11/07/2021 10:36:27 - INFO - __main__ - Step 95078: {'lr': 0.00015150747720283934, 'samples': 18254976, 'steps': 95077, 'loss/train': 1.334852933883667} 11/07/2021 10:36:28 - INFO - __main__ - Step 95079: {'lr': 0.00015150259967286434, 'samples': 18255168, 'steps': 95078, 'loss/train': 1.2730895280838013} 11/07/2021 10:36:28 - INFO - __main__ - Step 95080: {'lr': 0.00015149772218727074, 'samples': 18255360, 'steps': 95079, 'loss/train': 0.9980828166007996} 11/07/2021 10:36:28 - INFO - __main__ - Step 95081: {'lr': 0.00015149284474606073, 'samples': 18255552, 'steps': 95080, 'loss/train': 1.1001787185668945} 11/07/2021 10:36:30 - INFO - __main__ - Step 95082: {'lr': 0.00015148796734923656, 'samples': 18255744, 'steps': 95081, 'loss/train': 1.2860511541366577} 11/07/2021 10:36:30 - INFO - __main__ - Step 95083: {'lr': 0.00015148308999680038, 'samples': 18255936, 'steps': 95082, 'loss/train': 1.3522770404815674} 11/07/2021 10:36:30 - INFO - __main__ - Step 95084: {'lr': 0.00015147821268875444, 'samples': 18256128, 'steps': 95083, 'loss/train': 0.6877779364585876} 11/07/2021 10:36:31 - INFO - __main__ - Step 95085: {'lr': 0.0001514733354251009, 'samples': 18256320, 'steps': 95084, 'loss/train': 1.4259333610534668} 11/07/2021 10:36:31 - INFO - __main__ - Step 95086: {'lr': 0.00015146845820584193, 'samples': 18256512, 'steps': 95085, 'loss/train': 1.6326290369033813} 11/07/2021 10:36:32 - INFO - __main__ - Step 95087: {'lr': 0.00015146358103097974, 'samples': 18256704, 'steps': 95086, 'loss/train': 1.0359629392623901} 11/07/2021 10:36:32 - INFO - __main__ - Step 95088: {'lr': 0.00015145870390051653, 'samples': 18256896, 'steps': 95087, 'loss/train': 1.402840495109558} 11/07/2021 10:36:33 - INFO - __main__ - Step 95089: {'lr': 0.0001514538268144545, 'samples': 18257088, 'steps': 95088, 'loss/train': 1.6795768737792969} 11/07/2021 10:36:33 - INFO - __main__ - Step 95090: {'lr': 0.00015144894977279588, 'samples': 18257280, 'steps': 95089, 'loss/train': 0.5299057960510254} 11/07/2021 10:36:33 - INFO - __main__ - Step 95091: {'lr': 0.00015144407277554282, 'samples': 18257472, 'steps': 95090, 'loss/train': 0.8238829374313354} 11/07/2021 10:36:34 - INFO - __main__ - Step 95092: {'lr': 0.00015143919582269756, 'samples': 18257664, 'steps': 95091, 'loss/train': 1.3334320783615112} 11/07/2021 10:36:35 - INFO - __main__ - Step 95093: {'lr': 0.00015143431891426223, 'samples': 18257856, 'steps': 95092, 'loss/train': 0.9620721340179443} 11/07/2021 10:36:35 - INFO - __main__ - Step 95094: {'lr': 0.00015142944205023912, 'samples': 18258048, 'steps': 95093, 'loss/train': 1.1005750894546509} 11/07/2021 10:36:35 - INFO - __main__ - Step 95095: {'lr': 0.0001514245652306304, 'samples': 18258240, 'steps': 95094, 'loss/train': 0.8576415777206421} 11/07/2021 10:36:36 - INFO - __main__ - Step 95096: {'lr': 0.00015141968845543824, 'samples': 18258432, 'steps': 95095, 'loss/train': 1.6786044836044312} 11/07/2021 10:36:37 - INFO - __main__ - Step 95097: {'lr': 0.00015141481172466483, 'samples': 18258624, 'steps': 95096, 'loss/train': 1.4094880819320679} 11/07/2021 10:36:37 - INFO - __main__ - Step 95098: {'lr': 0.0001514099350383124, 'samples': 18258816, 'steps': 95097, 'loss/train': 1.447428584098816} 11/07/2021 10:36:38 - INFO - __main__ - Step 95099: {'lr': 0.0001514050583963831, 'samples': 18259008, 'steps': 95098, 'loss/train': 1.2722363471984863} 11/07/2021 10:36:38 - INFO - __main__ - Step 95100: {'lr': 0.00015140018179887925, 'samples': 18259200, 'steps': 95099, 'loss/train': 0.9350006580352783} 11/07/2021 10:36:38 - INFO - __main__ - Step 95101: {'lr': 0.00015139530524580286, 'samples': 18259392, 'steps': 95100, 'loss/train': 1.512370228767395} 11/07/2021 10:36:39 - INFO - __main__ - Step 95102: {'lr': 0.00015139042873715624, 'samples': 18259584, 'steps': 95101, 'loss/train': 0.05646722391247749} 11/07/2021 10:36:40 - INFO - __main__ - Step 95103: {'lr': 0.0001513855522729416, 'samples': 18259776, 'steps': 95102, 'loss/train': 1.208504319190979} 11/07/2021 10:36:40 - INFO - __main__ - Step 95104: {'lr': 0.00015138067585316107, 'samples': 18259968, 'steps': 95103, 'loss/train': 1.4657922983169556} 11/07/2021 10:36:41 - INFO - __main__ - Step 95105: {'lr': 0.0001513757994778169, 'samples': 18260160, 'steps': 95104, 'loss/train': 1.3780866861343384} 11/07/2021 10:36:41 - INFO - __main__ - Step 95106: {'lr': 0.0001513709231469113, 'samples': 18260352, 'steps': 95105, 'loss/train': 1.3384792804718018} 11/07/2021 10:36:41 - INFO - __main__ - Step 95107: {'lr': 0.00015136604686044643, 'samples': 18260544, 'steps': 95106, 'loss/train': 0.09207836538553238} 11/07/2021 10:36:42 - INFO - __main__ - Step 95108: {'lr': 0.00015136117061842448, 'samples': 18260736, 'steps': 95107, 'loss/train': 1.1760203838348389} 11/07/2021 10:36:43 - INFO - __main__ - Step 95109: {'lr': 0.00015135629442084768, 'samples': 18260928, 'steps': 95108, 'loss/train': 1.1786466836929321} 11/07/2021 10:36:43 - INFO - __main__ - Step 95110: {'lr': 0.0001513514182677182, 'samples': 18261120, 'steps': 95109, 'loss/train': 1.6095315217971802} 11/07/2021 10:36:43 - INFO - __main__ - Step 95111: {'lr': 0.00015134654215903824, 'samples': 18261312, 'steps': 95110, 'loss/train': 1.2441000938415527} 11/07/2021 10:36:44 - INFO - __main__ - Step 95112: {'lr': 0.00015134166609481002, 'samples': 18261504, 'steps': 95111, 'loss/train': 1.1583548784255981} 11/07/2021 10:36:45 - INFO - __main__ - Step 95113: {'lr': 0.00015133679007503577, 'samples': 18261696, 'steps': 95112, 'loss/train': 0.8679611086845398} 11/07/2021 10:36:45 - INFO - __main__ - Step 95114: {'lr': 0.0001513319140997176, 'samples': 18261888, 'steps': 95113, 'loss/train': 2.298478841781616} 11/07/2021 10:36:45 - INFO - __main__ - Step 95115: {'lr': 0.00015132703816885768, 'samples': 18262080, 'steps': 95114, 'loss/train': 1.454143762588501} 11/07/2021 10:36:46 - INFO - __main__ - Step 95116: {'lr': 0.00015132216228245834, 'samples': 18262272, 'steps': 95115, 'loss/train': 1.6140267848968506} 11/07/2021 10:36:46 - INFO - __main__ - Step 95117: {'lr': 0.00015131728644052173, 'samples': 18262464, 'steps': 95116, 'loss/train': 1.305690050125122} 11/07/2021 10:36:47 - INFO - __main__ - Step 95118: {'lr': 0.00015131241064305002, 'samples': 18262656, 'steps': 95117, 'loss/train': 0.9986306428909302} 11/07/2021 10:36:48 - INFO - __main__ - Step 95119: {'lr': 0.0001513075348900454, 'samples': 18262848, 'steps': 95118, 'loss/train': 1.3310158252716064} 11/07/2021 10:36:48 - INFO - __main__ - Step 95120: {'lr': 0.00015130265918151004, 'samples': 18263040, 'steps': 95119, 'loss/train': 0.7121617794036865} 11/07/2021 10:36:48 - INFO - __main__ - Step 95121: {'lr': 0.0001512977835174462, 'samples': 18263232, 'steps': 95120, 'loss/train': 1.6942882537841797} 11/07/2021 10:36:49 - INFO - __main__ - Step 95122: {'lr': 0.0001512929078978561, 'samples': 18263424, 'steps': 95121, 'loss/train': 0.985346794128418} 11/07/2021 10:36:50 - INFO - __main__ - Step 95123: {'lr': 0.00015128803232274186, 'samples': 18263616, 'steps': 95122, 'loss/train': 0.07198744267225266} 11/07/2021 10:36:50 - INFO - __main__ - Step 95124: {'lr': 0.0001512831567921057, 'samples': 18263808, 'steps': 95123, 'loss/train': 1.3095529079437256} 11/07/2021 10:36:50 - INFO - __main__ - Step 95125: {'lr': 0.00015127828130594983, 'samples': 18264000, 'steps': 95124, 'loss/train': 1.5819696187973022} 11/07/2021 10:36:51 - INFO - __main__ - Step 95126: {'lr': 0.00015127340586427646, 'samples': 18264192, 'steps': 95125, 'loss/train': 1.5218706130981445} 11/07/2021 10:36:51 - INFO - __main__ - Step 95127: {'lr': 0.00015126853046708777, 'samples': 18264384, 'steps': 95126, 'loss/train': 1.1592755317687988} 11/07/2021 10:36:51 - INFO - __main__ - Step 95128: {'lr': 0.000151263655114386, 'samples': 18264576, 'steps': 95127, 'loss/train': 0.9829692840576172} 11/07/2021 10:36:52 - INFO - __main__ - Step 95129: {'lr': 0.00015125877980617326, 'samples': 18264768, 'steps': 95128, 'loss/train': 1.2929093837738037} 11/07/2021 10:36:53 - INFO - __main__ - Step 95130: {'lr': 0.00015125390454245177, 'samples': 18264960, 'steps': 95129, 'loss/train': 1.051133155822754} 11/07/2021 10:36:53 - INFO - __main__ - Step 95131: {'lr': 0.00015124902932322376, 'samples': 18265152, 'steps': 95130, 'loss/train': 1.3557498455047607} 11/07/2021 10:36:53 - INFO - __main__ - Step 95132: {'lr': 0.00015124415414849142, 'samples': 18265344, 'steps': 95131, 'loss/train': 1.2309571504592896} 11/07/2021 10:36:54 - INFO - __main__ - Step 95133: {'lr': 0.0001512392790182569, 'samples': 18265536, 'steps': 95132, 'loss/train': 1.5173726081848145} 11/07/2021 10:36:55 - INFO - __main__ - Step 95134: {'lr': 0.00015123440393252248, 'samples': 18265728, 'steps': 95133, 'loss/train': 1.1021764278411865} 11/07/2021 10:36:55 - INFO - __main__ - Step 95135: {'lr': 0.00015122952889129029, 'samples': 18265920, 'steps': 95134, 'loss/train': 1.1774437427520752} 11/07/2021 10:36:55 - INFO - __main__ - Step 95136: {'lr': 0.00015122465389456256, 'samples': 18266112, 'steps': 95135, 'loss/train': 0.7378812432289124} 11/07/2021 10:36:56 - INFO - __main__ - Step 95137: {'lr': 0.00015121977894234145, 'samples': 18266304, 'steps': 95136, 'loss/train': 0.8927878737449646} 11/07/2021 10:36:56 - INFO - __main__ - Step 95138: {'lr': 0.00015121490403462924, 'samples': 18266496, 'steps': 95137, 'loss/train': 1.218191146850586} 11/07/2021 10:36:57 - INFO - __main__ - Step 95139: {'lr': 0.000151210029171428, 'samples': 18266688, 'steps': 95138, 'loss/train': 1.082859754562378} 11/07/2021 10:36:58 - INFO - __main__ - Step 95140: {'lr': 0.00015120515435274018, 'samples': 18266880, 'steps': 95139, 'loss/train': 1.452340006828308} 11/07/2021 10:36:58 - INFO - __main__ - Step 95141: {'lr': 0.00015120027957856764, 'samples': 18267072, 'steps': 95140, 'loss/train': 1.1802855730056763} 11/07/2021 10:36:58 - INFO - __main__ - Step 95142: {'lr': 0.0001511954048489127, 'samples': 18267264, 'steps': 95141, 'loss/train': 1.2270506620407104} 11/07/2021 10:36:59 - INFO - __main__ - Step 95143: {'lr': 0.00015119053016377765, 'samples': 18267456, 'steps': 95142, 'loss/train': 1.7362982034683228} 11/07/2021 10:37:00 - INFO - __main__ - Step 95144: {'lr': 0.0001511856555231646, 'samples': 18267648, 'steps': 95143, 'loss/train': 1.4359638690948486} 11/07/2021 10:37:00 - INFO - __main__ - Step 95145: {'lr': 0.00015118078092707577, 'samples': 18267840, 'steps': 95144, 'loss/train': 1.421521782875061} 11/07/2021 10:37:00 - INFO - __main__ - Step 95146: {'lr': 0.00015117590637551333, 'samples': 18268032, 'steps': 95145, 'loss/train': 1.1570987701416016} 11/07/2021 10:37:01 - INFO - __main__ - Step 95147: {'lr': 0.00015117103186847953, 'samples': 18268224, 'steps': 95146, 'loss/train': 1.994812250137329} 11/07/2021 10:37:01 - INFO - __main__ - Step 95148: {'lr': 0.00015116615740597654, 'samples': 18268416, 'steps': 95147, 'loss/train': 1.047562599182129} 11/07/2021 10:37:02 - INFO - __main__ - Step 95149: {'lr': 0.00015116128298800653, 'samples': 18268608, 'steps': 95148, 'loss/train': 1.426510214805603} 11/07/2021 10:37:02 - INFO - __main__ - Step 95150: {'lr': 0.00015115640861457176, 'samples': 18268800, 'steps': 95149, 'loss/train': 1.112322449684143} 11/07/2021 10:37:03 - INFO - __main__ - Step 95151: {'lr': 0.00015115153428567435, 'samples': 18268992, 'steps': 95150, 'loss/train': 1.2465676069259644} 11/07/2021 10:37:03 - INFO - __main__ - Step 95152: {'lr': 0.00015114666000131652, 'samples': 18269184, 'steps': 95151, 'loss/train': 1.1481095552444458} 11/07/2021 10:37:03 - INFO - __main__ - Step 95153: {'lr': 0.0001511417857615005, 'samples': 18269376, 'steps': 95152, 'loss/train': 1.7525793313980103} 11/07/2021 10:37:04 - INFO - __main__ - Step 95154: {'lr': 0.00015113691156622857, 'samples': 18269568, 'steps': 95153, 'loss/train': 1.3132296800613403} 11/07/2021 10:37:05 - INFO - __main__ - Step 95155: {'lr': 0.00015113203741550275, 'samples': 18269760, 'steps': 95154, 'loss/train': 1.3345966339111328} 11/07/2021 10:37:05 - INFO - __main__ - Step 95156: {'lr': 0.00015112716330932524, 'samples': 18269952, 'steps': 95155, 'loss/train': 0.049902379512786865} 11/07/2021 10:37:06 - INFO - __main__ - Step 95157: {'lr': 0.0001511222892476984, 'samples': 18270144, 'steps': 95156, 'loss/train': 1.0226068496704102} 11/07/2021 10:37:06 - INFO - __main__ - Step 95158: {'lr': 0.00015111741523062423, 'samples': 18270336, 'steps': 95157, 'loss/train': 1.471869945526123} 11/07/2021 10:37:07 - INFO - __main__ - Step 95159: {'lr': 0.00015111254125810508, 'samples': 18270528, 'steps': 95158, 'loss/train': 1.4305390119552612} 11/07/2021 10:37:07 - INFO - __main__ - Step 95160: {'lr': 0.0001511076673301431, 'samples': 18270720, 'steps': 95159, 'loss/train': 1.4236594438552856} 11/07/2021 10:37:08 - INFO - __main__ - Step 95161: {'lr': 0.00015110279344674043, 'samples': 18270912, 'steps': 95160, 'loss/train': 1.2916101217269897} 11/07/2021 10:37:08 - INFO - __main__ - Step 95162: {'lr': 0.00015109791960789937, 'samples': 18271104, 'steps': 95161, 'loss/train': 1.3910728693008423} 11/07/2021 10:37:08 - INFO - __main__ - Step 95163: {'lr': 0.00015109304581362203, 'samples': 18271296, 'steps': 95162, 'loss/train': 1.2314436435699463} 11/07/2021 10:37:09 - INFO - __main__ - Step 95164: {'lr': 0.0001510881720639107, 'samples': 18271488, 'steps': 95163, 'loss/train': 1.0323909521102905} 11/07/2021 10:37:10 - INFO - __main__ - Step 95165: {'lr': 0.00015108329835876745, 'samples': 18271680, 'steps': 95164, 'loss/train': 1.1259486675262451} 11/07/2021 10:37:10 - INFO - __main__ - Step 95166: {'lr': 0.00015107842469819452, 'samples': 18271872, 'steps': 95165, 'loss/train': 1.2758429050445557} 11/07/2021 10:37:10 - INFO - __main__ - Step 95167: {'lr': 0.00015107355108219425, 'samples': 18272064, 'steps': 95166, 'loss/train': 1.2607511281967163} 11/07/2021 10:37:11 - INFO - __main__ - Step 95168: {'lr': 0.00015106867751076865, 'samples': 18272256, 'steps': 95167, 'loss/train': 1.2302613258361816} 11/07/2021 10:37:11 - INFO - __main__ - Step 95169: {'lr': 0.0001510638039839199, 'samples': 18272448, 'steps': 95168, 'loss/train': 1.454773187637329} 11/07/2021 10:37:12 - INFO - __main__ - Step 95170: {'lr': 0.00015105893050165034, 'samples': 18272640, 'steps': 95169, 'loss/train': 1.4137994050979614} 11/07/2021 10:37:12 - INFO - __main__ - Step 95171: {'lr': 0.00015105405706396208, 'samples': 18272832, 'steps': 95170, 'loss/train': 1.3944569826126099} 11/07/2021 10:37:13 - INFO - __main__ - Step 95172: {'lr': 0.00015104918367085736, 'samples': 18273024, 'steps': 95171, 'loss/train': 1.4645366668701172} 11/07/2021 10:37:13 - INFO - __main__ - Step 95173: {'lr': 0.00015104431032233827, 'samples': 18273216, 'steps': 95172, 'loss/train': 1.2490599155426025} 11/07/2021 10:37:14 - INFO - __main__ - Step 95174: {'lr': 0.00015103943701840717, 'samples': 18273408, 'steps': 95173, 'loss/train': 1.4319343566894531} 11/07/2021 10:37:14 - INFO - __main__ - Step 95175: {'lr': 0.00015103456375906613, 'samples': 18273600, 'steps': 95174, 'loss/train': 1.5217441320419312} 11/07/2021 10:37:15 - INFO - __main__ - Step 95176: {'lr': 0.00015102969054431743, 'samples': 18273792, 'steps': 95175, 'loss/train': 1.1956164836883545} 11/07/2021 10:37:15 - INFO - __main__ - Step 95177: {'lr': 0.00015102481737416318, 'samples': 18273984, 'steps': 95176, 'loss/train': 1.35624098777771} 11/07/2021 10:37:16 - INFO - __main__ - Step 95178: {'lr': 0.00015101994424860564, 'samples': 18274176, 'steps': 95177, 'loss/train': 0.7412286996841431} 11/07/2021 10:37:16 - INFO - __main__ - Step 95179: {'lr': 0.00015101507116764695, 'samples': 18274368, 'steps': 95178, 'loss/train': 1.6144076585769653} 11/07/2021 10:37:17 - INFO - __main__ - Step 95180: {'lr': 0.00015101019813128948, 'samples': 18274560, 'steps': 95179, 'loss/train': 1.0913403034210205} 11/07/2021 10:37:17 - INFO - __main__ - Step 95181: {'lr': 0.00015100532513953518, 'samples': 18274752, 'steps': 95180, 'loss/train': 1.4532239437103271} 11/07/2021 10:37:18 - INFO - __main__ - Step 95182: {'lr': 0.00015100045219238636, 'samples': 18274944, 'steps': 95181, 'loss/train': 1.3553847074508667} 11/07/2021 10:37:18 - INFO - __main__ - Step 95183: {'lr': 0.0001509955792898452, 'samples': 18275136, 'steps': 95182, 'loss/train': 1.2059383392333984} 11/07/2021 10:37:18 - INFO - __main__ - Step 95184: {'lr': 0.00015099070643191393, 'samples': 18275328, 'steps': 95183, 'loss/train': 1.7219878435134888} 11/07/2021 10:37:19 - INFO - __main__ - Step 95185: {'lr': 0.0001509858336185947, 'samples': 18275520, 'steps': 95184, 'loss/train': 1.4376801252365112} 11/07/2021 10:37:20 - INFO - __main__ - Step 95186: {'lr': 0.0001509809608498897, 'samples': 18275712, 'steps': 95185, 'loss/train': 0.9128504395484924} 11/07/2021 10:37:20 - INFO - __main__ - Step 95187: {'lr': 0.00015097608812580116, 'samples': 18275904, 'steps': 95186, 'loss/train': 1.8848556280136108} 11/07/2021 10:37:21 - INFO - __main__ - Step 95188: {'lr': 0.0001509712154463313, 'samples': 18276096, 'steps': 95187, 'loss/train': 1.2985308170318604} 11/07/2021 10:37:21 - INFO - __main__ - Step 95189: {'lr': 0.00015096634281148224, 'samples': 18276288, 'steps': 95188, 'loss/train': 0.9041907787322998} 11/07/2021 10:37:21 - INFO - __main__ - Step 95190: {'lr': 0.0001509614702212562, 'samples': 18276480, 'steps': 95189, 'loss/train': 0.7473807334899902} 11/07/2021 10:37:22 - INFO - __main__ - Step 95191: {'lr': 0.00015095659767565546, 'samples': 18276672, 'steps': 95190, 'loss/train': 5.849247932434082} 11/07/2021 10:37:23 - INFO - __main__ - Step 95192: {'lr': 0.00015095172517468213, 'samples': 18276864, 'steps': 95191, 'loss/train': 1.503175973892212} 11/07/2021 10:37:23 - INFO - __main__ - Step 95193: {'lr': 0.0001509468527183384, 'samples': 18277056, 'steps': 95192, 'loss/train': 1.0986080169677734} 11/07/2021 10:37:23 - INFO - __main__ - Step 95194: {'lr': 0.00015094198030662662, 'samples': 18277248, 'steps': 95193, 'loss/train': 1.341729760169983} 11/07/2021 10:37:24 - INFO - __main__ - Step 95195: {'lr': 0.00015093710793954873, 'samples': 18277440, 'steps': 95194, 'loss/train': 1.696678876876831} 11/07/2021 10:37:25 - INFO - __main__ - Step 95196: {'lr': 0.00015093223561710707, 'samples': 18277632, 'steps': 95195, 'loss/train': 1.8405261039733887} 11/07/2021 10:37:25 - INFO - __main__ - Step 95197: {'lr': 0.0001509273633393038, 'samples': 18277824, 'steps': 95196, 'loss/train': 0.9308603405952454} 11/07/2021 10:37:26 - INFO - __main__ - Step 95198: {'lr': 0.00015092249110614114, 'samples': 18278016, 'steps': 95197, 'loss/train': 0.8464120626449585} 11/07/2021 10:37:26 - INFO - __main__ - Step 95199: {'lr': 0.0001509176189176213, 'samples': 18278208, 'steps': 95198, 'loss/train': 1.855670690536499} 11/07/2021 10:37:26 - INFO - __main__ - Step 95200: {'lr': 0.0001509127467737464, 'samples': 18278400, 'steps': 95199, 'loss/train': 1.4713548421859741} 11/07/2021 10:37:27 - INFO - __main__ - Step 95201: {'lr': 0.00015090787467451872, 'samples': 18278592, 'steps': 95200, 'loss/train': 1.410526156425476} 11/07/2021 10:37:28 - INFO - __main__ - Step 95202: {'lr': 0.00015090300261994043, 'samples': 18278784, 'steps': 95201, 'loss/train': 1.619089126586914} 11/07/2021 10:37:28 - INFO - __main__ - Step 95203: {'lr': 0.00015089813061001367, 'samples': 18278976, 'steps': 95202, 'loss/train': 1.5821311473846436} 11/07/2021 10:37:28 - INFO - __main__ - Step 95204: {'lr': 0.00015089325864474075, 'samples': 18279168, 'steps': 95203, 'loss/train': 1.0287022590637207} 11/07/2021 10:37:29 - INFO - __main__ - Step 95205: {'lr': 0.00015088838672412376, 'samples': 18279360, 'steps': 95204, 'loss/train': 1.3128619194030762} 11/07/2021 10:37:30 - INFO - __main__ - Step 95206: {'lr': 0.00015088351484816493, 'samples': 18279552, 'steps': 95205, 'loss/train': 0.6532434821128845} 11/07/2021 10:37:30 - INFO - __main__ - Step 95207: {'lr': 0.00015087864301686657, 'samples': 18279744, 'steps': 95206, 'loss/train': 1.1054195165634155} 11/07/2021 10:37:31 - INFO - __main__ - Step 95208: {'lr': 0.00015087377123023066, 'samples': 18279936, 'steps': 95207, 'loss/train': 1.5287632942199707} 11/07/2021 10:37:31 - INFO - __main__ - Step 95209: {'lr': 0.0001508688994882595, 'samples': 18280128, 'steps': 95208, 'loss/train': 1.742853045463562} 11/07/2021 10:37:31 - INFO - __main__ - Step 95210: {'lr': 0.00015086402779095528, 'samples': 18280320, 'steps': 95209, 'loss/train': 1.451709270477295} 11/07/2021 10:37:32 - INFO - __main__ - Step 95211: {'lr': 0.00015085915613832022, 'samples': 18280512, 'steps': 95210, 'loss/train': 0.9384121298789978} 11/07/2021 10:37:33 - INFO - __main__ - Step 95212: {'lr': 0.00015085428453035646, 'samples': 18280704, 'steps': 95211, 'loss/train': 1.1219099760055542} 11/07/2021 10:37:33 - INFO - __main__ - Step 95213: {'lr': 0.00015084941296706624, 'samples': 18280896, 'steps': 95212, 'loss/train': 1.1480978727340698} 11/07/2021 10:37:33 - INFO - __main__ - Step 95214: {'lr': 0.00015084454144845177, 'samples': 18281088, 'steps': 95213, 'loss/train': 1.5326939821243286} 11/07/2021 10:37:34 - INFO - __main__ - Step 95215: {'lr': 0.0001508396699745152, 'samples': 18281280, 'steps': 95214, 'loss/train': 1.486392855644226} 11/07/2021 10:37:35 - INFO - __main__ - Step 95216: {'lr': 0.00015083479854525875, 'samples': 18281472, 'steps': 95215, 'loss/train': 1.2704161405563354} 11/07/2021 10:37:35 - INFO - __main__ - Step 95217: {'lr': 0.0001508299271606846, 'samples': 18281664, 'steps': 95216, 'loss/train': 1.2278488874435425} 11/07/2021 10:37:35 - INFO - __main__ - Step 95218: {'lr': 0.00015082505582079497, 'samples': 18281856, 'steps': 95217, 'loss/train': 1.494893193244934} 11/07/2021 10:37:36 - INFO - __main__ - Step 95219: {'lr': 0.00015082018452559207, 'samples': 18282048, 'steps': 95218, 'loss/train': 0.8603768348693848} 11/07/2021 10:37:36 - INFO - __main__ - Step 95220: {'lr': 0.00015081531327507802, 'samples': 18282240, 'steps': 95219, 'loss/train': 2.1288342475891113} 11/07/2021 10:37:36 - INFO - __main__ - Step 95221: {'lr': 0.00015081044206925512, 'samples': 18282432, 'steps': 95220, 'loss/train': 1.2151947021484375} 11/07/2021 10:37:37 - INFO - __main__ - Step 95222: {'lr': 0.00015080557090812547, 'samples': 18282624, 'steps': 95221, 'loss/train': 1.0268231630325317} 11/07/2021 10:37:38 - INFO - __main__ - Step 95223: {'lr': 0.00015080069979169126, 'samples': 18282816, 'steps': 95222, 'loss/train': 1.6000860929489136} 11/07/2021 10:37:38 - INFO - __main__ - Step 95224: {'lr': 0.00015079582871995473, 'samples': 18283008, 'steps': 95223, 'loss/train': 1.2900012731552124} 11/07/2021 10:37:38 - INFO - __main__ - Step 95225: {'lr': 0.0001507909576929181, 'samples': 18283200, 'steps': 95224, 'loss/train': 0.997986912727356} 11/07/2021 10:37:39 - INFO - __main__ - Step 95226: {'lr': 0.00015078608671058349, 'samples': 18283392, 'steps': 95225, 'loss/train': 1.289203405380249} 11/07/2021 10:37:40 - INFO - __main__ - Step 95227: {'lr': 0.00015078121577295317, 'samples': 18283584, 'steps': 95226, 'loss/train': 1.3716232776641846} 11/07/2021 10:37:40 - INFO - __main__ - Step 95228: {'lr': 0.00015077634488002927, 'samples': 18283776, 'steps': 95227, 'loss/train': 1.5934234857559204} 11/07/2021 10:37:41 - INFO - __main__ - Step 95229: {'lr': 0.00015077147403181408, 'samples': 18283968, 'steps': 95228, 'loss/train': 2.0980541706085205} 11/07/2021 10:37:41 - INFO - __main__ - Step 95230: {'lr': 0.00015076660322830974, 'samples': 18284160, 'steps': 95229, 'loss/train': 1.583662509918213} 11/07/2021 10:37:41 - INFO - __main__ - Step 95231: {'lr': 0.00015076173246951838, 'samples': 18284352, 'steps': 95230, 'loss/train': 0.8390915989875793} 11/07/2021 10:37:43 - INFO - __main__ - Step 95232: {'lr': 0.00015075686175544228, 'samples': 18284544, 'steps': 95231, 'loss/train': 1.392703652381897} 11/07/2021 10:37:43 - INFO - __main__ - Step 95233: {'lr': 0.00015075199108608356, 'samples': 18284736, 'steps': 95232, 'loss/train': 1.2019821405410767} 11/07/2021 10:37:43 - INFO - __main__ - Step 95234: {'lr': 0.00015074712046144457, 'samples': 18284928, 'steps': 95233, 'loss/train': 1.3409326076507568} 11/07/2021 10:37:44 - INFO - __main__ - Step 95235: {'lr': 0.0001507422498815273, 'samples': 18285120, 'steps': 95234, 'loss/train': 1.2924981117248535} 11/07/2021 10:37:44 - INFO - __main__ - Step 95236: {'lr': 0.0001507373793463341, 'samples': 18285312, 'steps': 95235, 'loss/train': 1.455381155014038} 11/07/2021 10:37:44 - INFO - __main__ - Step 95237: {'lr': 0.00015073250885586702, 'samples': 18285504, 'steps': 95236, 'loss/train': 1.668125033378601} 11/07/2021 10:37:45 - INFO - __main__ - Step 95238: {'lr': 0.00015072763841012841, 'samples': 18285696, 'steps': 95237, 'loss/train': 1.4366408586502075} 11/07/2021 10:37:46 - INFO - __main__ - Step 95239: {'lr': 0.00015072276800912035, 'samples': 18285888, 'steps': 95238, 'loss/train': 1.3423861265182495} 11/07/2021 10:37:46 - INFO - __main__ - Step 95240: {'lr': 0.0001507178976528451, 'samples': 18286080, 'steps': 95239, 'loss/train': 1.5415135622024536} 11/07/2021 10:37:46 - INFO - __main__ - Step 95241: {'lr': 0.00015071302734130488, 'samples': 18286272, 'steps': 95240, 'loss/train': 1.384752869606018} 11/07/2021 10:37:47 - INFO - __main__ - Step 95242: {'lr': 0.0001507081570745018, 'samples': 18286464, 'steps': 95241, 'loss/train': 1.387945532798767} 11/07/2021 10:37:48 - INFO - __main__ - Step 95243: {'lr': 0.00015070328685243807, 'samples': 18286656, 'steps': 95242, 'loss/train': 1.452865719795227} 11/07/2021 10:37:48 - INFO - __main__ - Step 95244: {'lr': 0.0001506984166751159, 'samples': 18286848, 'steps': 95243, 'loss/train': 1.270049810409546} 11/07/2021 10:37:48 - INFO - __main__ - Step 95245: {'lr': 0.00015069354654253752, 'samples': 18287040, 'steps': 95244, 'loss/train': 1.076697826385498} 11/07/2021 10:37:49 - INFO - __main__ - Step 95246: {'lr': 0.00015068867645470508, 'samples': 18287232, 'steps': 95245, 'loss/train': 1.7400137186050415} 11/07/2021 10:37:49 - INFO - __main__ - Step 95247: {'lr': 0.00015068380641162084, 'samples': 18287424, 'steps': 95246, 'loss/train': 1.2770382165908813} 11/07/2021 10:37:50 - INFO - __main__ - Step 95248: {'lr': 0.00015067893641328693, 'samples': 18287616, 'steps': 95247, 'loss/train': 1.3913747072219849} 11/07/2021 10:37:50 - INFO - __main__ - Step 95249: {'lr': 0.0001506740664597055, 'samples': 18287808, 'steps': 95248, 'loss/train': 1.095587968826294} 11/07/2021 10:37:51 - INFO - __main__ - Step 95250: {'lr': 0.00015066919655087885, 'samples': 18288000, 'steps': 95249, 'loss/train': 1.2351995706558228} 11/07/2021 10:37:51 - INFO - __main__ - Step 95251: {'lr': 0.00015066432668680915, 'samples': 18288192, 'steps': 95250, 'loss/train': 1.3040522336959839} 11/07/2021 10:37:51 - INFO - __main__ - Step 95252: {'lr': 0.00015065945686749854, 'samples': 18288384, 'steps': 95251, 'loss/train': 1.5785353183746338} 11/07/2021 10:37:52 - INFO - __main__ - Step 95253: {'lr': 0.00015065458709294922, 'samples': 18288576, 'steps': 95252, 'loss/train': 1.5693329572677612} 11/07/2021 10:37:53 - INFO - __main__ - Step 95254: {'lr': 0.0001506497173631634, 'samples': 18288768, 'steps': 95253, 'loss/train': 1.1000688076019287} 11/07/2021 10:37:53 - INFO - __main__ - Step 95255: {'lr': 0.00015064484767814335, 'samples': 18288960, 'steps': 95254, 'loss/train': 1.4741333723068237} 11/07/2021 10:37:53 - INFO - __main__ - Step 95256: {'lr': 0.00015063997803789115, 'samples': 18289152, 'steps': 95255, 'loss/train': 1.1975828409194946} 11/07/2021 10:37:54 - INFO - __main__ - Step 95257: {'lr': 0.00015063510844240903, 'samples': 18289344, 'steps': 95256, 'loss/train': 1.818924069404602} 11/07/2021 10:37:55 - INFO - __main__ - Step 95258: {'lr': 0.00015063023889169924, 'samples': 18289536, 'steps': 95257, 'loss/train': 1.3406490087509155} 11/07/2021 10:37:55 - INFO - __main__ - Step 95259: {'lr': 0.00015062536938576388, 'samples': 18289728, 'steps': 95258, 'loss/train': 1.2742719650268555} 11/07/2021 10:37:56 - INFO - __main__ - Step 95260: {'lr': 0.00015062049992460526, 'samples': 18289920, 'steps': 95259, 'loss/train': 0.9148750305175781} 11/07/2021 10:37:56 - INFO - __main__ - Step 95261: {'lr': 0.0001506156305082255, 'samples': 18290112, 'steps': 95260, 'loss/train': 1.4508699178695679} 11/07/2021 10:37:56 - INFO - __main__ - Step 95262: {'lr': 0.00015061076113662684, 'samples': 18290304, 'steps': 95261, 'loss/train': 1.450221300125122} 11/07/2021 10:37:57 - INFO - __main__ - Step 95263: {'lr': 0.00015060589180981138, 'samples': 18290496, 'steps': 95262, 'loss/train': 1.5949655771255493} 11/07/2021 10:37:58 - INFO - __main__ - Step 95264: {'lr': 0.00015060102252778136, 'samples': 18290688, 'steps': 95263, 'loss/train': 0.6340125799179077} 11/07/2021 10:37:58 - INFO - __main__ - Step 95265: {'lr': 0.000150596153290539, 'samples': 18290880, 'steps': 95264, 'loss/train': 1.777541160583496} 11/07/2021 10:37:58 - INFO - __main__ - Step 95266: {'lr': 0.00015059128409808641, 'samples': 18291072, 'steps': 95265, 'loss/train': 1.3134160041809082} 11/07/2021 10:37:59 - INFO - __main__ - Step 95267: {'lr': 0.00015058641495042596, 'samples': 18291264, 'steps': 95266, 'loss/train': 1.2295538187026978} 11/07/2021 10:38:00 - INFO - __main__ - Step 95268: {'lr': 0.00015058154584755967, 'samples': 18291456, 'steps': 95267, 'loss/train': 0.9980098009109497} 11/07/2021 10:38:00 - INFO - __main__ - Step 95269: {'lr': 0.00015057667678948982, 'samples': 18291648, 'steps': 95268, 'loss/train': 1.6445368528366089} 11/07/2021 10:38:00 - INFO - __main__ - Step 95270: {'lr': 0.0001505718077762186, 'samples': 18291840, 'steps': 95269, 'loss/train': 1.4989113807678223} 11/07/2021 10:38:01 - INFO - __main__ - Step 95271: {'lr': 0.00015056693880774816, 'samples': 18292032, 'steps': 95270, 'loss/train': 1.7908464670181274} 11/07/2021 10:38:01 - INFO - __main__ - Step 95272: {'lr': 0.00015056206988408075, 'samples': 18292224, 'steps': 95271, 'loss/train': 0.9436767101287842} 11/07/2021 10:38:02 - INFO - __main__ - Step 95273: {'lr': 0.00015055720100521852, 'samples': 18292416, 'steps': 95272, 'loss/train': 1.2743319272994995} 11/07/2021 10:38:03 - INFO - __main__ - Step 95274: {'lr': 0.00015055233217116368, 'samples': 18292608, 'steps': 95273, 'loss/train': 1.4380706548690796} 11/07/2021 10:38:03 - INFO - __main__ - Step 95275: {'lr': 0.00015054746338191854, 'samples': 18292800, 'steps': 95274, 'loss/train': 0.5034703016281128} 11/07/2021 10:38:03 - INFO - __main__ - Step 95276: {'lr': 0.00015054259463748507, 'samples': 18292992, 'steps': 95275, 'loss/train': 1.5190918445587158} 11/07/2021 10:38:04 - INFO - __main__ - Step 95277: {'lr': 0.00015053772593786558, 'samples': 18293184, 'steps': 95276, 'loss/train': 1.0512486696243286} 11/07/2021 10:38:05 - INFO - __main__ - Step 95278: {'lr': 0.00015053285728306224, 'samples': 18293376, 'steps': 95277, 'loss/train': 1.5712456703186035} 11/07/2021 10:38:05 - INFO - __main__ - Step 95279: {'lr': 0.00015052798867307726, 'samples': 18293568, 'steps': 95278, 'loss/train': 1.355783462524414} 11/07/2021 10:38:05 - INFO - __main__ - Step 95280: {'lr': 0.00015052312010791285, 'samples': 18293760, 'steps': 95279, 'loss/train': 1.3883110284805298} 11/07/2021 10:38:06 - INFO - __main__ - Step 95281: {'lr': 0.00015051825158757115, 'samples': 18293952, 'steps': 95280, 'loss/train': 1.2194100618362427} 11/07/2021 10:38:06 - INFO - __main__ - Step 95282: {'lr': 0.00015051338311205444, 'samples': 18294144, 'steps': 95281, 'loss/train': 1.3751616477966309} 11/07/2021 10:38:06 - INFO - __main__ - Step 95283: {'lr': 0.00015050851468136485, 'samples': 18294336, 'steps': 95282, 'loss/train': 1.3278659582138062} 11/07/2021 10:38:07 - INFO - __main__ - Step 95284: {'lr': 0.00015050364629550455, 'samples': 18294528, 'steps': 95283, 'loss/train': 1.6050347089767456} 11/07/2021 10:38:08 - INFO - __main__ - Step 95285: {'lr': 0.00015049877795447582, 'samples': 18294720, 'steps': 95284, 'loss/train': 1.5077862739562988} 11/07/2021 10:38:08 - INFO - __main__ - Step 95286: {'lr': 0.0001504939096582808, 'samples': 18294912, 'steps': 95285, 'loss/train': 1.2653300762176514} 11/07/2021 10:38:08 - INFO - __main__ - Step 95287: {'lr': 0.00015048904140692166, 'samples': 18295104, 'steps': 95286, 'loss/train': 1.367114782333374} 11/07/2021 10:38:09 - INFO - __main__ - Step 95288: {'lr': 0.00015048417320040076, 'samples': 18295296, 'steps': 95287, 'loss/train': 1.6615434885025024} 11/07/2021 10:38:10 - INFO - __main__ - Step 95289: {'lr': 0.00015047930503872003, 'samples': 18295488, 'steps': 95288, 'loss/train': 1.5006133317947388} 11/07/2021 10:38:10 - INFO - __main__ - Step 95290: {'lr': 0.00015047443692188178, 'samples': 18295680, 'steps': 95289, 'loss/train': 1.016829252243042} 11/07/2021 10:38:11 - INFO - __main__ - Step 95291: {'lr': 0.00015046956884988823, 'samples': 18295872, 'steps': 95290, 'loss/train': 1.2303732633590698} 11/07/2021 10:38:11 - INFO - __main__ - Step 95292: {'lr': 0.00015046470082274156, 'samples': 18296064, 'steps': 95291, 'loss/train': 1.4359309673309326} 11/07/2021 10:38:11 - INFO - __main__ - Step 95293: {'lr': 0.00015045983284044397, 'samples': 18296256, 'steps': 95292, 'loss/train': 1.4413111209869385} 11/07/2021 10:38:12 - INFO - __main__ - Step 95294: {'lr': 0.0001504549649029976, 'samples': 18296448, 'steps': 95293, 'loss/train': 1.0547717809677124} 11/07/2021 10:38:13 - INFO - __main__ - Step 95295: {'lr': 0.00015045009701040473, 'samples': 18296640, 'steps': 95294, 'loss/train': 1.8760169744491577} 11/07/2021 10:38:13 - INFO - __main__ - Step 95296: {'lr': 0.00015044522916266747, 'samples': 18296832, 'steps': 95295, 'loss/train': 1.244678258895874} 11/07/2021 10:38:13 - INFO - __main__ - Step 95297: {'lr': 0.0001504403613597881, 'samples': 18297024, 'steps': 95296, 'loss/train': 1.1741278171539307} 11/07/2021 10:38:14 - INFO - __main__ - Step 95298: {'lr': 0.00015043549360176873, 'samples': 18297216, 'steps': 95297, 'loss/train': 0.8845700025558472} 11/07/2021 10:38:15 - INFO - __main__ - Step 95299: {'lr': 0.00015043062588861162, 'samples': 18297408, 'steps': 95298, 'loss/train': 1.305566430091858} 11/07/2021 10:38:15 - INFO - __main__ - Step 95300: {'lr': 0.0001504257582203189, 'samples': 18297600, 'steps': 95299, 'loss/train': 1.4262382984161377} 11/07/2021 10:38:15 - INFO - __main__ - Step 95301: {'lr': 0.0001504208905968929, 'samples': 18297792, 'steps': 95300, 'loss/train': 1.5217643976211548} 11/07/2021 10:38:16 - INFO - __main__ - Step 95302: {'lr': 0.00015041602301833561, 'samples': 18297984, 'steps': 95301, 'loss/train': 1.3591642379760742} 11/07/2021 10:38:16 - INFO - __main__ - Step 95303: {'lr': 0.00015041115548464936, 'samples': 18298176, 'steps': 95302, 'loss/train': 0.9703762531280518} 11/07/2021 10:38:17 - INFO - __main__ - Step 95304: {'lr': 0.00015040628799583628, 'samples': 18298368, 'steps': 95303, 'loss/train': 1.443824291229248} 11/07/2021 10:38:18 - INFO - __main__ - Step 95305: {'lr': 0.0001504014205518986, 'samples': 18298560, 'steps': 95304, 'loss/train': 1.3005961179733276} 11/07/2021 10:38:18 - INFO - __main__ - Step 95306: {'lr': 0.00015039655315283852, 'samples': 18298752, 'steps': 95305, 'loss/train': 1.5780715942382812} 11/07/2021 10:38:18 - INFO - __main__ - Step 95307: {'lr': 0.00015039168579865817, 'samples': 18298944, 'steps': 95306, 'loss/train': 1.1323119401931763} 11/07/2021 10:38:19 - INFO - __main__ - Step 95308: {'lr': 0.0001503868184893598, 'samples': 18299136, 'steps': 95307, 'loss/train': 0.8696303963661194} 11/07/2021 10:38:19 - INFO - __main__ - Step 95309: {'lr': 0.00015038195122494562, 'samples': 18299328, 'steps': 95308, 'loss/train': 1.2941653728485107} 11/07/2021 10:38:20 - INFO - __main__ - Step 95310: {'lr': 0.00015037708400541776, 'samples': 18299520, 'steps': 95309, 'loss/train': 0.6325526237487793} 11/07/2021 10:38:20 - INFO - __main__ - Step 95311: {'lr': 0.0001503722168307785, 'samples': 18299712, 'steps': 95310, 'loss/train': 1.2719974517822266} 11/07/2021 10:38:21 - INFO - __main__ - Step 95312: {'lr': 0.00015036734970102995, 'samples': 18299904, 'steps': 95311, 'loss/train': 1.3413232564926147} 11/07/2021 10:38:21 - INFO - __main__ - Step 95313: {'lr': 0.00015036248261617434, 'samples': 18300096, 'steps': 95312, 'loss/train': 1.238427758216858} 11/07/2021 10:38:21 - INFO - __main__ - Step 95314: {'lr': 0.00015035761557621386, 'samples': 18300288, 'steps': 95313, 'loss/train': 1.5208383798599243} 11/07/2021 10:38:22 - INFO - __main__ - Step 95315: {'lr': 0.0001503527485811508, 'samples': 18300480, 'steps': 95314, 'loss/train': 1.3837451934814453} 11/07/2021 10:38:23 - INFO - __main__ - Step 95316: {'lr': 0.0001503478816309871, 'samples': 18300672, 'steps': 95315, 'loss/train': 0.4713139235973358} 11/07/2021 10:38:23 - INFO - __main__ - Step 95317: {'lr': 0.00015034301472572516, 'samples': 18300864, 'steps': 95316, 'loss/train': 1.1944061517715454} 11/07/2021 10:38:23 - INFO - __main__ - Step 95318: {'lr': 0.00015033814786536714, 'samples': 18301056, 'steps': 95317, 'loss/train': 1.5147607326507568} 11/07/2021 10:38:24 - INFO - __main__ - Step 95319: {'lr': 0.00015033328104991516, 'samples': 18301248, 'steps': 95318, 'loss/train': 0.9084166288375854} 11/07/2021 10:38:25 - INFO - __main__ - Step 95320: {'lr': 0.0001503284142793715, 'samples': 18301440, 'steps': 95319, 'loss/train': 0.9924030303955078} 11/07/2021 10:38:25 - INFO - __main__ - Step 95321: {'lr': 0.00015032354755373833, 'samples': 18301632, 'steps': 95320, 'loss/train': 1.3695135116577148} 11/07/2021 10:38:25 - INFO - __main__ - Step 95322: {'lr': 0.0001503186808730178, 'samples': 18301824, 'steps': 95321, 'loss/train': 1.1534732580184937} 11/07/2021 10:38:26 - INFO - __main__ - Step 95323: {'lr': 0.00015031381423721217, 'samples': 18302016, 'steps': 95322, 'loss/train': 1.8322160243988037} 11/07/2021 10:38:26 - INFO - __main__ - Step 95324: {'lr': 0.00015030894764632357, 'samples': 18302208, 'steps': 95323, 'loss/train': 1.4271429777145386} 11/07/2021 10:38:27 - INFO - __main__ - Step 95325: {'lr': 0.00015030408110035422, 'samples': 18302400, 'steps': 95324, 'loss/train': 1.486147403717041} 11/07/2021 10:38:28 - INFO - __main__ - Step 95326: {'lr': 0.00015029921459930632, 'samples': 18302592, 'steps': 95325, 'loss/train': 1.1791197061538696} 11/07/2021 10:38:28 - INFO - __main__ - Step 95327: {'lr': 0.00015029434814318204, 'samples': 18302784, 'steps': 95326, 'loss/train': 1.176578164100647} 11/07/2021 10:38:28 - INFO - __main__ - Step 95328: {'lr': 0.00015028948173198371, 'samples': 18302976, 'steps': 95327, 'loss/train': 1.4115227460861206} 11/07/2021 10:38:29 - INFO - __main__ - Step 95329: {'lr': 0.0001502846153657133, 'samples': 18303168, 'steps': 95328, 'loss/train': 1.09663987159729} 11/07/2021 10:38:30 - INFO - __main__ - Step 95330: {'lr': 0.0001502797490443731, 'samples': 18303360, 'steps': 95329, 'loss/train': 1.3401896953582764} 11/07/2021 10:38:30 - INFO - __main__ - Step 95331: {'lr': 0.00015027488276796527, 'samples': 18303552, 'steps': 95330, 'loss/train': 1.708470106124878} 11/07/2021 10:38:30 - INFO - __main__ - Step 95332: {'lr': 0.00015027001653649207, 'samples': 18303744, 'steps': 95331, 'loss/train': 1.2834707498550415} 11/07/2021 10:38:31 - INFO - __main__ - Step 95333: {'lr': 0.0001502651503499557, 'samples': 18303936, 'steps': 95332, 'loss/train': 0.7135066390037537} 11/07/2021 10:38:31 - INFO - __main__ - Step 95334: {'lr': 0.00015026028420835825, 'samples': 18304128, 'steps': 95333, 'loss/train': 1.0218181610107422} 11/07/2021 10:38:32 - INFO - __main__ - Step 95335: {'lr': 0.00015025541811170202, 'samples': 18304320, 'steps': 95334, 'loss/train': 1.5796983242034912} 11/07/2021 10:38:32 - INFO - __main__ - Step 95336: {'lr': 0.0001502505520599891, 'samples': 18304512, 'steps': 95335, 'loss/train': 1.2393343448638916} 11/07/2021 10:38:33 - INFO - __main__ - Step 95337: {'lr': 0.0001502456860532218, 'samples': 18304704, 'steps': 95336, 'loss/train': 1.3339077234268188} 11/07/2021 10:38:33 - INFO - __main__ - Step 95338: {'lr': 0.00015024082009140226, 'samples': 18304896, 'steps': 95337, 'loss/train': 1.5886223316192627} 11/07/2021 10:38:33 - INFO - __main__ - Step 95339: {'lr': 0.00015023595417453263, 'samples': 18305088, 'steps': 95338, 'loss/train': 1.282052993774414} 11/07/2021 10:38:34 - INFO - __main__ - Step 95340: {'lr': 0.00015023108830261516, 'samples': 18305280, 'steps': 95339, 'loss/train': 1.2844717502593994} 11/07/2021 10:38:35 - INFO - __main__ - Step 95341: {'lr': 0.00015022622247565202, 'samples': 18305472, 'steps': 95340, 'loss/train': 1.1555452346801758} 11/07/2021 10:38:35 - INFO - __main__ - Step 95342: {'lr': 0.0001502213566936455, 'samples': 18305664, 'steps': 95341, 'loss/train': 1.502536416053772} 11/07/2021 10:38:35 - INFO - __main__ - Step 95343: {'lr': 0.00015021649095659761, 'samples': 18305856, 'steps': 95342, 'loss/train': 1.260650634765625} 11/07/2021 10:38:36 - INFO - __main__ - Step 95344: {'lr': 0.0001502116252645106, 'samples': 18306048, 'steps': 95343, 'loss/train': 0.6812189221382141} 11/07/2021 10:38:36 - INFO - __main__ - Step 95345: {'lr': 0.0001502067596173867, 'samples': 18306240, 'steps': 95344, 'loss/train': 1.6726237535476685} 11/07/2021 10:38:37 - INFO - __main__ - Step 95346: {'lr': 0.00015020189401522812, 'samples': 18306432, 'steps': 95345, 'loss/train': 1.2950952053070068} 11/07/2021 10:38:37 - INFO - __main__ - Step 95347: {'lr': 0.000150197028458037, 'samples': 18306624, 'steps': 95346, 'loss/train': 1.323083519935608} 11/07/2021 10:38:38 - INFO - __main__ - Step 95348: {'lr': 0.0001501921629458156, 'samples': 18306816, 'steps': 95347, 'loss/train': 1.3360350131988525} 11/07/2021 10:38:38 - INFO - __main__ - Step 95349: {'lr': 0.000150187297478566, 'samples': 18307008, 'steps': 95348, 'loss/train': 1.4987680912017822} 11/07/2021 10:38:39 - INFO - __main__ - Step 95350: {'lr': 0.00015018243205629054, 'samples': 18307200, 'steps': 95349, 'loss/train': 1.239450454711914} 11/07/2021 10:38:40 - INFO - __main__ - Step 95351: {'lr': 0.00015017756667899128, 'samples': 18307392, 'steps': 95350, 'loss/train': 1.5887396335601807} 11/07/2021 10:38:40 - INFO - __main__ - Step 95352: {'lr': 0.0001501727013466705, 'samples': 18307584, 'steps': 95351, 'loss/train': 1.4185417890548706} 11/07/2021 10:38:40 - INFO - __main__ - Step 95353: {'lr': 0.0001501678360593304, 'samples': 18307776, 'steps': 95352, 'loss/train': 0.9178551435470581} 11/07/2021 10:38:41 - INFO - __main__ - Step 95354: {'lr': 0.00015016297081697308, 'samples': 18307968, 'steps': 95353, 'loss/train': 1.1968106031417847} 11/07/2021 10:38:41 - INFO - __main__ - Step 95355: {'lr': 0.00015015810561960086, 'samples': 18308160, 'steps': 95354, 'loss/train': 1.503665804862976} 11/07/2021 10:38:42 - INFO - __main__ - Step 95356: {'lr': 0.00015015324046721576, 'samples': 18308352, 'steps': 95355, 'loss/train': 1.4506536722183228} 11/07/2021 10:38:43 - INFO - __main__ - Step 95357: {'lr': 0.0001501483753598201, 'samples': 18308544, 'steps': 95356, 'loss/train': 1.3866173028945923} 11/07/2021 10:38:43 - INFO - __main__ - Step 95358: {'lr': 0.00015014351029741602, 'samples': 18308736, 'steps': 95357, 'loss/train': 1.7518471479415894} 11/07/2021 10:38:43 - INFO - __main__ - Step 95359: {'lr': 0.00015013864528000577, 'samples': 18308928, 'steps': 95358, 'loss/train': 0.08325207233428955} 11/07/2021 10:38:44 - INFO - __main__ - Step 95360: {'lr': 0.00015013378030759146, 'samples': 18309120, 'steps': 95359, 'loss/train': 0.8800830245018005} 11/07/2021 10:38:45 - INFO - __main__ - Step 95361: {'lr': 0.00015012891538017536, 'samples': 18309312, 'steps': 95360, 'loss/train': 1.1712861061096191} 11/07/2021 10:38:45 - INFO - __main__ - Step 95362: {'lr': 0.00015012405049775963, 'samples': 18309504, 'steps': 95361, 'loss/train': 1.1668187379837036} 11/07/2021 10:38:46 - INFO - __main__ - Step 95363: {'lr': 0.00015011918566034643, 'samples': 18309696, 'steps': 95362, 'loss/train': 1.3444228172302246} 11/07/2021 10:38:46 - INFO - __main__ - Step 95364: {'lr': 0.0001501143208679381, 'samples': 18309888, 'steps': 95363, 'loss/train': 0.9231928586959839} 11/07/2021 10:38:46 - INFO - __main__ - Step 95365: {'lr': 0.00015010945612053657, 'samples': 18310080, 'steps': 95364, 'loss/train': 1.2419100999832153} 11/07/2021 10:38:47 - INFO - __main__ - Step 95366: {'lr': 0.00015010459141814425, 'samples': 18310272, 'steps': 95365, 'loss/train': 1.442260980606079} 11/07/2021 10:38:48 - INFO - __main__ - Step 95367: {'lr': 0.00015009972676076322, 'samples': 18310464, 'steps': 95366, 'loss/train': 1.6334824562072754} 11/07/2021 10:38:48 - INFO - __main__ - Step 95368: {'lr': 0.00015009486214839573, 'samples': 18310656, 'steps': 95367, 'loss/train': 1.5761182308197021} 11/07/2021 10:38:48 - INFO - __main__ - Step 95369: {'lr': 0.00015008999758104404, 'samples': 18310848, 'steps': 95368, 'loss/train': 1.397354006767273} 11/07/2021 10:38:49 - INFO - __main__ - Step 95370: {'lr': 0.00015008513305871012, 'samples': 18311040, 'steps': 95369, 'loss/train': 1.4016624689102173} 11/07/2021 10:38:49 - INFO - __main__ - Step 95371: {'lr': 0.00015008026858139638, 'samples': 18311232, 'steps': 95370, 'loss/train': 1.450921654701233} 11/07/2021 10:38:50 - INFO - __main__ - Step 95372: {'lr': 0.0001500754041491049, 'samples': 18311424, 'steps': 95371, 'loss/train': 0.4893127381801605} 11/07/2021 10:38:50 - INFO - __main__ - Step 95373: {'lr': 0.00015007053976183788, 'samples': 18311616, 'steps': 95372, 'loss/train': 1.4265090227127075} 11/07/2021 10:38:51 - INFO - __main__ - Step 95374: {'lr': 0.00015006567541959754, 'samples': 18311808, 'steps': 95373, 'loss/train': 1.3818339109420776} 11/07/2021 10:38:51 - INFO - __main__ - Step 95375: {'lr': 0.00015006081112238612, 'samples': 18312000, 'steps': 95374, 'loss/train': 1.2199454307556152} 11/07/2021 10:38:51 - INFO - __main__ - Step 95376: {'lr': 0.00015005594687020574, 'samples': 18312192, 'steps': 95375, 'loss/train': 0.7833857536315918} 11/07/2021 10:38:53 - INFO - __main__ - Step 95377: {'lr': 0.00015005108266305856, 'samples': 18312384, 'steps': 95376, 'loss/train': 1.2190965414047241} 11/07/2021 10:38:53 - INFO - __main__ - Step 95378: {'lr': 0.00015004621850094686, 'samples': 18312576, 'steps': 95377, 'loss/train': 1.4213746786117554} 11/07/2021 10:38:53 - INFO - __main__ - Step 95379: {'lr': 0.00015004135438387276, 'samples': 18312768, 'steps': 95378, 'loss/train': 1.2561416625976562} 11/07/2021 10:38:54 - INFO - __main__ - Step 95380: {'lr': 0.00015003649031183848, 'samples': 18312960, 'steps': 95379, 'loss/train': 1.2381805181503296} 11/07/2021 10:38:54 - INFO - __main__ - Step 95381: {'lr': 0.00015003162628484624, 'samples': 18313152, 'steps': 95380, 'loss/train': 1.5014196634292603} 11/07/2021 10:38:54 - INFO - __main__ - Step 95382: {'lr': 0.00015002676230289826, 'samples': 18313344, 'steps': 95381, 'loss/train': 1.2605427503585815} 11/07/2021 10:38:55 - INFO - __main__ - Step 95383: {'lr': 0.00015002189836599658, 'samples': 18313536, 'steps': 95382, 'loss/train': 0.0943828746676445} 11/07/2021 10:38:56 - INFO - __main__ - Step 95384: {'lr': 0.00015001703447414352, 'samples': 18313728, 'steps': 95383, 'loss/train': 1.3663173913955688} 11/07/2021 10:38:56 - INFO - __main__ - Step 95385: {'lr': 0.00015001217062734124, 'samples': 18313920, 'steps': 95384, 'loss/train': 1.189072847366333} 11/07/2021 10:38:56 - INFO - __main__ - Step 95386: {'lr': 0.000150007306825592, 'samples': 18314112, 'steps': 95385, 'loss/train': 1.2865104675292969} 11/07/2021 10:38:57 - INFO - __main__ - Step 95387: {'lr': 0.0001500024430688979, 'samples': 18314304, 'steps': 95386, 'loss/train': 0.6391265988349915} 11/07/2021 10:38:58 - INFO - __main__ - Step 95388: {'lr': 0.00014999757935726108, 'samples': 18314496, 'steps': 95387, 'loss/train': 1.0581780672073364} 11/07/2021 10:38:58 - INFO - __main__ - Step 95389: {'lr': 0.00014999271569068385, 'samples': 18314688, 'steps': 95388, 'loss/train': 1.7660224437713623} 11/07/2021 10:38:59 - INFO - __main__ - Step 95390: {'lr': 0.00014998785206916834, 'samples': 18314880, 'steps': 95389, 'loss/train': 1.211909294128418} 11/07/2021 10:38:59 - INFO - __main__ - Step 95391: {'lr': 0.0001499829884927168, 'samples': 18315072, 'steps': 95390, 'loss/train': 1.6129944324493408} 11/07/2021 10:38:59 - INFO - __main__ - Step 95392: {'lr': 0.00014997812496133134, 'samples': 18315264, 'steps': 95391, 'loss/train': 1.5948344469070435} 11/07/2021 10:39:00 - INFO - __main__ - Step 95393: {'lr': 0.00014997326147501422, 'samples': 18315456, 'steps': 95392, 'loss/train': 1.52908194065094} 11/07/2021 10:39:01 - INFO - __main__ - Step 95394: {'lr': 0.00014996839803376762, 'samples': 18315648, 'steps': 95393, 'loss/train': 1.2490668296813965} 11/07/2021 10:39:01 - INFO - __main__ - Step 95395: {'lr': 0.00014996353463759366, 'samples': 18315840, 'steps': 95394, 'loss/train': 1.2839101552963257} 11/07/2021 10:39:01 - INFO - __main__ - Step 95396: {'lr': 0.00014995867128649466, 'samples': 18316032, 'steps': 95395, 'loss/train': 1.3388582468032837} 11/07/2021 10:39:02 - INFO - __main__ - Step 95397: {'lr': 0.00014995380798047276, 'samples': 18316224, 'steps': 95396, 'loss/train': 1.336898922920227} 11/07/2021 10:39:03 - INFO - __main__ - Step 95398: {'lr': 0.00014994894471953007, 'samples': 18316416, 'steps': 95397, 'loss/train': 1.3504478931427002} 11/07/2021 10:39:03 - INFO - __main__ - Step 95399: {'lr': 0.00014994408150366883, 'samples': 18316608, 'steps': 95398, 'loss/train': 1.4911506175994873} 11/07/2021 10:39:03 - INFO - __main__ - Step 95400: {'lr': 0.00014993921833289127, 'samples': 18316800, 'steps': 95399, 'loss/train': 0.7250443696975708} 11/07/2021 10:39:04 - INFO - __main__ - Step 95401: {'lr': 0.00014993435520719954, 'samples': 18316992, 'steps': 95400, 'loss/train': 1.645056962966919} 11/07/2021 10:39:04 - INFO - __main__ - Step 95402: {'lr': 0.00014992949212659586, 'samples': 18317184, 'steps': 95401, 'loss/train': 0.9273329377174377} 11/07/2021 10:39:05 - INFO - __main__ - Step 95403: {'lr': 0.00014992462909108235, 'samples': 18317376, 'steps': 95402, 'loss/train': 1.2135767936706543} 11/07/2021 10:39:06 - INFO - __main__ - Step 95404: {'lr': 0.0001499197661006613, 'samples': 18317568, 'steps': 95403, 'loss/train': 1.3064804077148438} 11/07/2021 10:39:06 - INFO - __main__ - Step 95405: {'lr': 0.00014991490315533485, 'samples': 18317760, 'steps': 95404, 'loss/train': 0.7928490042686462} 11/07/2021 10:39:06 - INFO - __main__ - Step 95406: {'lr': 0.00014991004025510522, 'samples': 18317952, 'steps': 95405, 'loss/train': 1.3571062088012695} 11/07/2021 10:39:07 - INFO - __main__ - Step 95407: {'lr': 0.00014990517739997455, 'samples': 18318144, 'steps': 95406, 'loss/train': 1.2878576517105103} 11/07/2021 10:39:08 - INFO - __main__ - Step 95408: {'lr': 0.00014990031458994506, 'samples': 18318336, 'steps': 95407, 'loss/train': 0.9430919885635376} 11/07/2021 10:39:08 - INFO - __main__ - Step 95409: {'lr': 0.0001498954518250191, 'samples': 18318528, 'steps': 95408, 'loss/train': 1.1271514892578125} 11/07/2021 10:39:08 - INFO - __main__ - Step 95410: {'lr': 0.00014989058910519856, 'samples': 18318720, 'steps': 95409, 'loss/train': 1.120370864868164} 11/07/2021 10:39:09 - INFO - __main__ - Step 95411: {'lr': 0.0001498857264304858, 'samples': 18318912, 'steps': 95410, 'loss/train': 1.2790172100067139} 11/07/2021 10:39:09 - INFO - __main__ - Step 95412: {'lr': 0.00014988086380088295, 'samples': 18319104, 'steps': 95411, 'loss/train': 1.5963962078094482} 11/07/2021 10:39:09 - INFO - __main__ - Step 95413: {'lr': 0.0001498760012163923, 'samples': 18319296, 'steps': 95412, 'loss/train': 1.0195789337158203} 11/07/2021 10:39:11 - INFO - __main__ - Step 95414: {'lr': 0.0001498711386770159, 'samples': 18319488, 'steps': 95413, 'loss/train': 1.4361991882324219} 11/07/2021 10:39:11 - INFO - __main__ - Step 95415: {'lr': 0.00014986627618275605, 'samples': 18319680, 'steps': 95414, 'loss/train': 1.1908724308013916} 11/07/2021 10:39:12 - INFO - __main__ - Step 95416: {'lr': 0.00014986141373361496, 'samples': 18319872, 'steps': 95415, 'loss/train': 0.14821040630340576} 11/07/2021 10:39:12 - INFO - __main__ - Step 95417: {'lr': 0.00014985655132959469, 'samples': 18320064, 'steps': 95416, 'loss/train': 0.29895728826522827} 11/07/2021 10:39:12 - INFO - __main__ - Step 95418: {'lr': 0.00014985168897069758, 'samples': 18320256, 'steps': 95417, 'loss/train': 0.8924899101257324} 11/07/2021 10:39:13 - INFO - __main__ - Step 95419: {'lr': 0.00014984682665692572, 'samples': 18320448, 'steps': 95418, 'loss/train': 1.367430329322815} 11/07/2021 10:39:14 - INFO - __main__ - Step 95420: {'lr': 0.00014984196438828134, 'samples': 18320640, 'steps': 95419, 'loss/train': 1.155451774597168} 11/07/2021 10:39:14 - INFO - __main__ - Step 95421: {'lr': 0.00014983710216476663, 'samples': 18320832, 'steps': 95420, 'loss/train': 1.3783631324768066} 11/07/2021 10:39:14 - INFO - __main__ - Step 95422: {'lr': 0.00014983223998638384, 'samples': 18321024, 'steps': 95421, 'loss/train': 1.408866286277771} 11/07/2021 10:39:15 - INFO - __main__ - Step 95423: {'lr': 0.00014982737785313504, 'samples': 18321216, 'steps': 95422, 'loss/train': 0.725254476070404} 11/07/2021 10:39:16 - INFO - __main__ - Step 95424: {'lr': 0.0001498225157650225, 'samples': 18321408, 'steps': 95423, 'loss/train': 0.6615182757377625} 11/07/2021 10:39:16 - INFO - __main__ - Step 95425: {'lr': 0.00014981765372204834, 'samples': 18321600, 'steps': 95424, 'loss/train': 1.3633216619491577} 11/07/2021 10:39:16 - INFO - __main__ - Step 95426: {'lr': 0.00014981279172421482, 'samples': 18321792, 'steps': 95425, 'loss/train': 1.728379487991333} 11/07/2021 10:39:17 - INFO - __main__ - Step 95427: {'lr': 0.00014980792977152408, 'samples': 18321984, 'steps': 95426, 'loss/train': 1.4678833484649658} 11/07/2021 10:39:17 - INFO - __main__ - Step 95428: {'lr': 0.00014980306786397838, 'samples': 18322176, 'steps': 95427, 'loss/train': 1.4456497430801392} 11/07/2021 10:39:18 - INFO - __main__ - Step 95429: {'lr': 0.00014979820600157984, 'samples': 18322368, 'steps': 95428, 'loss/train': 1.3368200063705444} 11/07/2021 10:39:19 - INFO - __main__ - Step 95430: {'lr': 0.00014979334418433073, 'samples': 18322560, 'steps': 95429, 'loss/train': 1.215494155883789} 11/07/2021 10:39:19 - INFO - __main__ - Step 95431: {'lr': 0.00014978848241223314, 'samples': 18322752, 'steps': 95430, 'loss/train': 1.3804965019226074} 11/07/2021 10:39:19 - INFO - __main__ - Step 95432: {'lr': 0.00014978362068528934, 'samples': 18322944, 'steps': 95431, 'loss/train': 1.2693195343017578} 11/07/2021 10:39:20 - INFO - __main__ - Step 95433: {'lr': 0.0001497787590035015, 'samples': 18323136, 'steps': 95432, 'loss/train': 1.7240864038467407} 11/07/2021 10:39:20 - INFO - __main__ - Step 95434: {'lr': 0.0001497738973668718, 'samples': 18323328, 'steps': 95433, 'loss/train': 1.2246888875961304} 11/07/2021 10:39:21 - INFO - __main__ - Step 95435: {'lr': 0.0001497690357754024, 'samples': 18323520, 'steps': 95434, 'loss/train': 1.843514084815979} 11/07/2021 10:39:21 - INFO - __main__ - Step 95436: {'lr': 0.00014976417422909565, 'samples': 18323712, 'steps': 95435, 'loss/train': 1.6169657707214355} 11/07/2021 10:39:22 - INFO - __main__ - Step 95437: {'lr': 0.00014975931272795355, 'samples': 18323904, 'steps': 95436, 'loss/train': 1.1485844850540161} 11/07/2021 10:39:22 - INFO - __main__ - Step 95438: {'lr': 0.00014975445127197833, 'samples': 18324096, 'steps': 95437, 'loss/train': 1.219923496246338} 11/07/2021 10:39:22 - INFO - __main__ - Step 95439: {'lr': 0.00014974958986117221, 'samples': 18324288, 'steps': 95438, 'loss/train': 1.3687207698822021} 11/07/2021 10:39:23 - INFO - __main__ - Step 95440: {'lr': 0.00014974472849553735, 'samples': 18324480, 'steps': 95439, 'loss/train': 1.8983606100082397} 11/07/2021 10:39:24 - INFO - __main__ - Step 95441: {'lr': 0.000149739867175076, 'samples': 18324672, 'steps': 95440, 'loss/train': 1.4298131465911865} 11/07/2021 10:39:24 - INFO - __main__ - Step 95442: {'lr': 0.00014973500589979033, 'samples': 18324864, 'steps': 95441, 'loss/train': 1.5682339668273926} 11/07/2021 10:39:24 - INFO - __main__ - Step 95443: {'lr': 0.0001497301446696825, 'samples': 18325056, 'steps': 95442, 'loss/train': 0.7759522795677185} 11/07/2021 10:39:25 - INFO - __main__ - Step 95444: {'lr': 0.0001497252834847547, 'samples': 18325248, 'steps': 95443, 'loss/train': 1.159886360168457} 11/07/2021 10:39:26 - INFO - __main__ - Step 95445: {'lr': 0.00014972042234500917, 'samples': 18325440, 'steps': 95444, 'loss/train': 1.2851582765579224} 11/07/2021 10:39:26 - INFO - __main__ - Step 95446: {'lr': 0.00014971556125044805, 'samples': 18325632, 'steps': 95445, 'loss/train': 1.6428630352020264} 11/07/2021 10:39:27 - INFO - __main__ - Step 95447: {'lr': 0.00014971070020107358, 'samples': 18325824, 'steps': 95446, 'loss/train': 1.3125625848770142} 11/07/2021 10:39:27 - INFO - __main__ - Step 95448: {'lr': 0.0001497058391968879, 'samples': 18326016, 'steps': 95447, 'loss/train': 1.005645990371704} 11/07/2021 10:39:27 - INFO - __main__ - Step 95449: {'lr': 0.0001497009782378933, 'samples': 18326208, 'steps': 95448, 'loss/train': 1.5142040252685547} 11/07/2021 10:39:28 - INFO - __main__ - Step 95450: {'lr': 0.00014969611732409182, 'samples': 18326400, 'steps': 95449, 'loss/train': 1.1745163202285767} 11/07/2021 10:39:29 - INFO - __main__ - Step 95451: {'lr': 0.0001496912564554857, 'samples': 18326592, 'steps': 95450, 'loss/train': 1.4360849857330322} 11/07/2021 10:39:29 - INFO - __main__ - Step 95452: {'lr': 0.0001496863956320772, 'samples': 18326784, 'steps': 95451, 'loss/train': 0.5394425392150879} 11/07/2021 10:39:29 - INFO - __main__ - Step 95453: {'lr': 0.00014968153485386842, 'samples': 18326976, 'steps': 95452, 'loss/train': 2.5242137908935547} 11/07/2021 10:39:30 - INFO - __main__ - Step 95454: {'lr': 0.0001496766741208616, 'samples': 18327168, 'steps': 95453, 'loss/train': 1.4638175964355469} 11/07/2021 10:39:30 - INFO - __main__ - Step 95455: {'lr': 0.0001496718134330589, 'samples': 18327360, 'steps': 95454, 'loss/train': 1.1506470441818237} 11/07/2021 10:39:31 - INFO - __main__ - Step 95456: {'lr': 0.0001496669527904626, 'samples': 18327552, 'steps': 95455, 'loss/train': 1.7254490852355957} 11/07/2021 10:39:31 - INFO - __main__ - Step 95457: {'lr': 0.00014966209219307474, 'samples': 18327744, 'steps': 95456, 'loss/train': 1.6805537939071655} 11/07/2021 10:39:32 - INFO - __main__ - Step 95458: {'lr': 0.00014965723164089766, 'samples': 18327936, 'steps': 95457, 'loss/train': 1.3713737726211548} 11/07/2021 10:39:32 - INFO - __main__ - Step 95459: {'lr': 0.00014965237113393346, 'samples': 18328128, 'steps': 95458, 'loss/train': 1.6738622188568115} 11/07/2021 10:39:32 - INFO - __main__ - Step 95460: {'lr': 0.00014964751067218435, 'samples': 18328320, 'steps': 95459, 'loss/train': 1.4878313541412354} 11/07/2021 10:39:34 - INFO - __main__ - Step 95461: {'lr': 0.0001496426502556525, 'samples': 18328512, 'steps': 95460, 'loss/train': 1.43178129196167} 11/07/2021 10:39:34 - INFO - __main__ - Step 95462: {'lr': 0.0001496377898843402, 'samples': 18328704, 'steps': 95461, 'loss/train': 1.2420003414154053} 11/07/2021 10:39:34 - INFO - __main__ - Step 95463: {'lr': 0.0001496329295582496, 'samples': 18328896, 'steps': 95462, 'loss/train': 0.8358434438705444} 11/07/2021 10:39:35 - INFO - __main__ - Step 95464: {'lr': 0.00014962806927738272, 'samples': 18329088, 'steps': 95463, 'loss/train': 1.5980770587921143} 11/07/2021 10:39:35 - INFO - __main__ - Step 95465: {'lr': 0.00014962320904174194, 'samples': 18329280, 'steps': 95464, 'loss/train': 1.3142014741897583} 11/07/2021 10:39:36 - INFO - __main__ - Step 95466: {'lr': 0.00014961834885132938, 'samples': 18329472, 'steps': 95465, 'loss/train': 1.2679588794708252} 11/07/2021 10:39:36 - INFO - __main__ - Step 95467: {'lr': 0.00014961348870614724, 'samples': 18329664, 'steps': 95466, 'loss/train': 1.2645244598388672} 11/07/2021 10:39:37 - INFO - __main__ - Step 95468: {'lr': 0.00014960862860619772, 'samples': 18329856, 'steps': 95467, 'loss/train': 1.1775164604187012} 11/07/2021 10:39:37 - INFO - __main__ - Step 95469: {'lr': 0.000149603768551483, 'samples': 18330048, 'steps': 95468, 'loss/train': 1.098944067955017} 11/07/2021 10:39:37 - INFO - __main__ - Step 95470: {'lr': 0.00014959890854200525, 'samples': 18330240, 'steps': 95469, 'loss/train': 1.1033649444580078} 11/07/2021 10:39:38 - INFO - __main__ - Step 95471: {'lr': 0.00014959404857776672, 'samples': 18330432, 'steps': 95470, 'loss/train': 1.3993592262268066} 11/07/2021 10:39:39 - INFO - __main__ - Step 95472: {'lr': 0.00014958918865876954, 'samples': 18330624, 'steps': 95471, 'loss/train': 1.0101674795150757} 11/07/2021 10:39:39 - INFO - __main__ - Step 95473: {'lr': 0.00014958432878501595, 'samples': 18330816, 'steps': 95472, 'loss/train': 1.4501291513442993} 11/07/2021 10:39:40 - INFO - __main__ - Step 95474: {'lr': 0.00014957946895650807, 'samples': 18331008, 'steps': 95473, 'loss/train': 1.3894020318984985} 11/07/2021 10:39:40 - INFO - __main__ - Step 95475: {'lr': 0.00014957460917324817, 'samples': 18331200, 'steps': 95474, 'loss/train': 1.2827177047729492} 11/07/2021 10:39:40 - INFO - __main__ - Step 95476: {'lr': 0.00014956974943523845, 'samples': 18331392, 'steps': 95475, 'loss/train': 1.5540865659713745} 11/07/2021 10:39:41 - INFO - __main__ - Step 95477: {'lr': 0.000149564889742481, 'samples': 18331584, 'steps': 95476, 'loss/train': 1.3582326173782349} 11/07/2021 10:39:42 - INFO - __main__ - Step 95478: {'lr': 0.00014956003009497805, 'samples': 18331776, 'steps': 95477, 'loss/train': 1.5325331687927246} 11/07/2021 10:39:42 - INFO - __main__ - Step 95479: {'lr': 0.00014955517049273175, 'samples': 18331968, 'steps': 95478, 'loss/train': 1.347690463066101} 11/07/2021 10:39:42 - INFO - __main__ - Step 95480: {'lr': 0.0001495503109357444, 'samples': 18332160, 'steps': 95479, 'loss/train': 1.2970348596572876} 11/07/2021 10:39:43 - INFO - __main__ - Step 95481: {'lr': 0.00014954545142401813, 'samples': 18332352, 'steps': 95480, 'loss/train': 1.4840190410614014} 11/07/2021 10:39:44 - INFO - __main__ - Step 95482: {'lr': 0.0001495405919575551, 'samples': 18332544, 'steps': 95481, 'loss/train': 1.1888285875320435} 11/07/2021 10:39:44 - INFO - __main__ - Step 95483: {'lr': 0.00014953573253635754, 'samples': 18332736, 'steps': 95482, 'loss/train': 1.6431946754455566} 11/07/2021 10:39:44 - INFO - __main__ - Step 95484: {'lr': 0.00014953087316042766, 'samples': 18332928, 'steps': 95483, 'loss/train': 0.6904372572898865} 11/07/2021 10:39:45 - INFO - __main__ - Step 95485: {'lr': 0.00014952601382976758, 'samples': 18333120, 'steps': 95484, 'loss/train': 1.2776588201522827} 11/07/2021 10:39:45 - INFO - __main__ - Step 95486: {'lr': 0.00014952115454437953, 'samples': 18333312, 'steps': 95485, 'loss/train': 1.5655776262283325} 11/07/2021 10:39:46 - INFO - __main__ - Step 95487: {'lr': 0.00014951629530426572, 'samples': 18333504, 'steps': 95486, 'loss/train': 1.3593782186508179} 11/07/2021 10:39:46 - INFO - __main__ - Step 95488: {'lr': 0.00014951143610942837, 'samples': 18333696, 'steps': 95487, 'loss/train': 0.6183043718338013} 11/07/2021 10:39:47 - INFO - __main__ - Step 95489: {'lr': 0.00014950657695986952, 'samples': 18333888, 'steps': 95488, 'loss/train': 1.861047625541687} 11/07/2021 10:39:47 - INFO - __main__ - Step 95490: {'lr': 0.00014950171785559153, 'samples': 18334080, 'steps': 95489, 'loss/train': 1.629355549812317} 11/07/2021 10:39:48 - INFO - __main__ - Step 95491: {'lr': 0.00014949685879659647, 'samples': 18334272, 'steps': 95490, 'loss/train': 1.223937749862671} 11/07/2021 10:39:48 - INFO - __main__ - Step 95492: {'lr': 0.00014949199978288657, 'samples': 18334464, 'steps': 95491, 'loss/train': 1.3264039754867554} 11/07/2021 10:39:49 - INFO - __main__ - Step 95493: {'lr': 0.00014948714081446405, 'samples': 18334656, 'steps': 95492, 'loss/train': 0.9786007404327393} 11/07/2021 10:39:49 - INFO - __main__ - Step 95494: {'lr': 0.00014948228189133107, 'samples': 18334848, 'steps': 95493, 'loss/train': 1.183266043663025} 11/07/2021 10:39:50 - INFO - __main__ - Step 95495: {'lr': 0.00014947742301348976, 'samples': 18335040, 'steps': 95494, 'loss/train': 0.3839772343635559} 11/07/2021 10:39:50 - INFO - __main__ - Step 95496: {'lr': 0.00014947256418094244, 'samples': 18335232, 'steps': 95495, 'loss/train': 0.981548011302948} 11/07/2021 10:39:50 - INFO - __main__ - Step 95497: {'lr': 0.0001494677053936912, 'samples': 18335424, 'steps': 95496, 'loss/train': 0.7279603481292725} 11/07/2021 10:39:51 - INFO - __main__ - Step 95498: {'lr': 0.00014946284665173833, 'samples': 18335616, 'steps': 95497, 'loss/train': 1.614055871963501} 11/07/2021 10:39:52 - INFO - __main__ - Step 95499: {'lr': 0.00014945798795508585, 'samples': 18335808, 'steps': 95498, 'loss/train': 1.3108632564544678} 11/07/2021 10:39:52 - INFO - __main__ - Step 95500: {'lr': 0.00014945312930373611, 'samples': 18336000, 'steps': 95499, 'loss/train': 1.5142303705215454} 11/07/2021 10:39:52 - INFO - __main__ - Step 95501: {'lr': 0.00014944827069769123, 'samples': 18336192, 'steps': 95500, 'loss/train': 1.4987770318984985} 11/07/2021 10:39:53 - INFO - __main__ - Step 95502: {'lr': 0.0001494434121369534, 'samples': 18336384, 'steps': 95501, 'loss/train': 1.4576561450958252} 11/07/2021 10:39:54 - INFO - __main__ - Step 95503: {'lr': 0.00014943855362152485, 'samples': 18336576, 'steps': 95502, 'loss/train': 1.6973724365234375} 11/07/2021 10:39:54 - INFO - __main__ - Step 95504: {'lr': 0.00014943369515140771, 'samples': 18336768, 'steps': 95503, 'loss/train': 0.855810284614563} 11/07/2021 10:39:55 - INFO - __main__ - Step 95505: {'lr': 0.0001494288367266042, 'samples': 18336960, 'steps': 95504, 'loss/train': 1.365056037902832} 11/07/2021 10:39:55 - INFO - __main__ - Step 95506: {'lr': 0.00014942397834711646, 'samples': 18337152, 'steps': 95505, 'loss/train': 1.5089166164398193} 11/07/2021 10:39:55 - INFO - __main__ - Step 95507: {'lr': 0.0001494191200129467, 'samples': 18337344, 'steps': 95506, 'loss/train': 1.2396819591522217} 11/07/2021 10:39:56 - INFO - __main__ - Step 95508: {'lr': 0.00014941426172409723, 'samples': 18337536, 'steps': 95507, 'loss/train': 1.3976138830184937} 11/07/2021 10:39:57 - INFO - __main__ - Step 95509: {'lr': 0.00014940940348057014, 'samples': 18337728, 'steps': 95508, 'loss/train': 1.4526863098144531} 11/07/2021 10:39:57 - INFO - __main__ - Step 95510: {'lr': 0.00014940454528236758, 'samples': 18337920, 'steps': 95509, 'loss/train': 1.4943439960479736} 11/07/2021 10:39:58 - INFO - __main__ - Step 95511: {'lr': 0.00014939968712949174, 'samples': 18338112, 'steps': 95510, 'loss/train': 1.199664831161499} 11/07/2021 10:39:58 - INFO - __main__ - Step 95512: {'lr': 0.0001493948290219449, 'samples': 18338304, 'steps': 95511, 'loss/train': 1.4069451093673706} 11/07/2021 10:39:58 - INFO - __main__ - Step 95513: {'lr': 0.00014938997095972917, 'samples': 18338496, 'steps': 95512, 'loss/train': 1.4853471517562866} 11/07/2021 10:39:59 - INFO - __main__ - Step 95514: {'lr': 0.0001493851129428468, 'samples': 18338688, 'steps': 95513, 'loss/train': 0.9239208698272705} 11/07/2021 10:40:00 - INFO - __main__ - Step 95515: {'lr': 0.0001493802549712999, 'samples': 18338880, 'steps': 95514, 'loss/train': 1.5129776000976562} 11/07/2021 10:40:00 - INFO - __main__ - Step 95516: {'lr': 0.00014937539704509072, 'samples': 18339072, 'steps': 95515, 'loss/train': 1.1636102199554443} 11/07/2021 10:40:00 - INFO - __main__ - Step 95517: {'lr': 0.0001493705391642215, 'samples': 18339264, 'steps': 95516, 'loss/train': 0.6331053376197815} 11/07/2021 10:40:01 - INFO - __main__ - Step 95518: {'lr': 0.0001493656813286943, 'samples': 18339456, 'steps': 95517, 'loss/train': 1.5703586339950562} 11/07/2021 10:40:02 - INFO - __main__ - Step 95519: {'lr': 0.00014936082353851139, 'samples': 18339648, 'steps': 95518, 'loss/train': 2.088531017303467} 11/07/2021 10:40:02 - INFO - __main__ - Step 95520: {'lr': 0.00014935596579367493, 'samples': 18339840, 'steps': 95519, 'loss/train': 1.379672646522522} 11/07/2021 10:40:02 - INFO - __main__ - Step 95521: {'lr': 0.00014935110809418713, 'samples': 18340032, 'steps': 95520, 'loss/train': 1.507258653640747} 11/07/2021 10:40:03 - INFO - __main__ - Step 95522: {'lr': 0.00014934625044005014, 'samples': 18340224, 'steps': 95521, 'loss/train': 1.3037383556365967} 11/07/2021 10:40:03 - INFO - __main__ - Step 95523: {'lr': 0.00014934139283126618, 'samples': 18340416, 'steps': 95522, 'loss/train': 1.5982956886291504} 11/07/2021 10:40:04 - INFO - __main__ - Step 95524: {'lr': 0.00014933653526783748, 'samples': 18340608, 'steps': 95523, 'loss/train': 1.4759068489074707} 11/07/2021 10:40:04 - INFO - __main__ - Step 95525: {'lr': 0.00014933167774976614, 'samples': 18340800, 'steps': 95524, 'loss/train': 1.1864818334579468} 11/07/2021 10:40:05 - INFO - __main__ - Step 95526: {'lr': 0.00014932682027705437, 'samples': 18340992, 'steps': 95525, 'loss/train': 1.5502132177352905} 11/07/2021 10:40:05 - INFO - __main__ - Step 95527: {'lr': 0.0001493219628497044, 'samples': 18341184, 'steps': 95526, 'loss/train': 0.8919107913970947} 11/07/2021 10:40:06 - INFO - __main__ - Step 95528: {'lr': 0.00014931710546771843, 'samples': 18341376, 'steps': 95527, 'loss/train': 1.7268024682998657} 11/07/2021 10:40:06 - INFO - __main__ - Step 95529: {'lr': 0.0001493122481310986, 'samples': 18341568, 'steps': 95528, 'loss/train': 1.5241643190383911} 11/07/2021 10:40:07 - INFO - __main__ - Step 95530: {'lr': 0.00014930739083984714, 'samples': 18341760, 'steps': 95529, 'loss/train': 1.503648042678833} 11/07/2021 10:40:07 - INFO - __main__ - Step 95531: {'lr': 0.00014930253359396627, 'samples': 18341952, 'steps': 95530, 'loss/train': 0.6406209468841553} 11/07/2021 10:40:08 - INFO - __main__ - Step 95532: {'lr': 0.00014929767639345804, 'samples': 18342144, 'steps': 95531, 'loss/train': 1.395004153251648} 11/07/2021 10:40:08 - INFO - __main__ - Step 95533: {'lr': 0.00014929281923832473, 'samples': 18342336, 'steps': 95532, 'loss/train': 1.041250228881836} 11/07/2021 10:40:08 - INFO - __main__ - Step 95534: {'lr': 0.00014928796212856848, 'samples': 18342528, 'steps': 95533, 'loss/train': 1.2510926723480225} 11/07/2021 10:40:09 - INFO - __main__ - Step 95535: {'lr': 0.00014928310506419156, 'samples': 18342720, 'steps': 95534, 'loss/train': 1.6313846111297607} 11/07/2021 10:40:10 - INFO - __main__ - Step 95536: {'lr': 0.00014927824804519612, 'samples': 18342912, 'steps': 95535, 'loss/train': 1.5851398706436157} 11/07/2021 10:40:10 - INFO - __main__ - Step 95537: {'lr': 0.00014927339107158436, 'samples': 18343104, 'steps': 95536, 'loss/train': 1.5254157781600952} 11/07/2021 10:40:11 - INFO - __main__ - Step 95538: {'lr': 0.00014926853414335846, 'samples': 18343296, 'steps': 95537, 'loss/train': 1.6418492794036865} 11/07/2021 10:40:11 - INFO - __main__ - Step 95539: {'lr': 0.00014926367726052057, 'samples': 18343488, 'steps': 95538, 'loss/train': 1.3902854919433594} 11/07/2021 10:40:12 - INFO - __main__ - Step 95540: {'lr': 0.00014925882042307292, 'samples': 18343680, 'steps': 95539, 'loss/train': 1.1305760145187378} 11/07/2021 10:40:12 - INFO - __main__ - Step 95541: {'lr': 0.00014925396363101772, 'samples': 18343872, 'steps': 95540, 'loss/train': 1.2641122341156006} 11/07/2021 10:40:13 - INFO - __main__ - Step 95542: {'lr': 0.0001492491068843571, 'samples': 18344064, 'steps': 95541, 'loss/train': 1.3568586111068726} 11/07/2021 10:40:13 - INFO - __main__ - Step 95543: {'lr': 0.0001492442501830934, 'samples': 18344256, 'steps': 95542, 'loss/train': 1.4244314432144165} 11/07/2021 10:40:13 - INFO - __main__ - Step 95544: {'lr': 0.00014923939352722853, 'samples': 18344448, 'steps': 95543, 'loss/train': 1.8573557138442993} 11/07/2021 10:40:14 - INFO - __main__ - Step 95545: {'lr': 0.00014923453691676492, 'samples': 18344640, 'steps': 95544, 'loss/train': 1.5372729301452637} 11/07/2021 10:40:15 - INFO - __main__ - Step 95546: {'lr': 0.0001492296803517046, 'samples': 18344832, 'steps': 95545, 'loss/train': 0.8549189567565918} 11/07/2021 10:40:15 - INFO - __main__ - Step 95547: {'lr': 0.00014922482383204988, 'samples': 18345024, 'steps': 95546, 'loss/train': 1.3421001434326172} 11/07/2021 10:40:15 - INFO - __main__ - Step 95548: {'lr': 0.00014921996735780285, 'samples': 18345216, 'steps': 95547, 'loss/train': 1.75882887840271} 11/07/2021 10:40:16 - INFO - __main__ - Step 95549: {'lr': 0.0001492151109289658, 'samples': 18345408, 'steps': 95548, 'loss/train': 1.5210354328155518} 11/07/2021 10:40:17 - INFO - __main__ - Step 95550: {'lr': 0.00014921025454554083, 'samples': 18345600, 'steps': 95549, 'loss/train': 1.124350905418396} 11/07/2021 10:40:17 - INFO - __main__ - Step 95551: {'lr': 0.00014920539820753017, 'samples': 18345792, 'steps': 95550, 'loss/train': 1.0696698427200317} 11/07/2021 10:40:18 - INFO - __main__ - Step 95552: {'lr': 0.00014920054191493604, 'samples': 18345984, 'steps': 95551, 'loss/train': 0.8432616591453552} 11/07/2021 10:40:18 - INFO - __main__ - Step 95553: {'lr': 0.00014919568566776055, 'samples': 18346176, 'steps': 95552, 'loss/train': 0.9557512998580933} 11/07/2021 10:40:18 - INFO - __main__ - Step 95554: {'lr': 0.00014919082946600592, 'samples': 18346368, 'steps': 95553, 'loss/train': 1.068395972251892} 11/07/2021 10:40:19 - INFO - __main__ - Step 95555: {'lr': 0.00014918597330967437, 'samples': 18346560, 'steps': 95554, 'loss/train': 1.4499752521514893} 11/07/2021 10:40:20 - INFO - __main__ - Step 95556: {'lr': 0.00014918111719876807, 'samples': 18346752, 'steps': 95555, 'loss/train': 1.2712616920471191} 11/07/2021 10:40:20 - INFO - __main__ - Step 95557: {'lr': 0.0001491762611332893, 'samples': 18346944, 'steps': 95556, 'loss/train': 1.2724682092666626} 11/07/2021 10:40:20 - INFO - __main__ - Step 95558: {'lr': 0.00014917140511324002, 'samples': 18347136, 'steps': 95557, 'loss/train': 1.3114339113235474} 11/07/2021 10:40:21 - INFO - __main__ - Step 95559: {'lr': 0.0001491665491386226, 'samples': 18347328, 'steps': 95558, 'loss/train': 1.3489426374435425} 11/07/2021 10:40:21 - INFO - __main__ - Step 95560: {'lr': 0.00014916169320943913, 'samples': 18347520, 'steps': 95559, 'loss/train': 1.450276494026184} 11/07/2021 10:40:22 - INFO - __main__ - Step 95561: {'lr': 0.00014915683732569186, 'samples': 18347712, 'steps': 95560, 'loss/train': 0.7580868601799011} 11/07/2021 10:40:22 - INFO - __main__ - Step 95562: {'lr': 0.00014915198148738297, 'samples': 18347904, 'steps': 95561, 'loss/train': 1.179430603981018} 11/07/2021 10:40:23 - INFO - __main__ - Step 95563: {'lr': 0.00014914712569451464, 'samples': 18348096, 'steps': 95562, 'loss/train': 1.1956418752670288} 11/07/2021 10:40:23 - INFO - __main__ - Step 95564: {'lr': 0.00014914226994708907, 'samples': 18348288, 'steps': 95563, 'loss/train': 1.1852705478668213} 11/07/2021 10:40:24 - INFO - __main__ - Step 95565: {'lr': 0.0001491374142451084, 'samples': 18348480, 'steps': 95564, 'loss/train': 1.3055469989776611} 11/07/2021 10:40:25 - INFO - __main__ - Step 95566: {'lr': 0.00014913255858857487, 'samples': 18348672, 'steps': 95565, 'loss/train': 1.3990641832351685} 11/07/2021 10:40:25 - INFO - __main__ - Step 95567: {'lr': 0.00014912770297749068, 'samples': 18348864, 'steps': 95566, 'loss/train': 1.5592821836471558} 11/07/2021 10:40:25 - INFO - __main__ - Step 95568: {'lr': 0.00014912284741185798, 'samples': 18349056, 'steps': 95567, 'loss/train': 1.5140306949615479} 11/07/2021 10:40:26 - INFO - __main__ - Step 95569: {'lr': 0.00014911799189167897, 'samples': 18349248, 'steps': 95568, 'loss/train': 1.3722277879714966} 11/07/2021 10:40:26 - INFO - __main__ - Step 95570: {'lr': 0.0001491131364169559, 'samples': 18349440, 'steps': 95569, 'loss/train': 2.233945846557617} 11/07/2021 10:40:27 - INFO - __main__ - Step 95571: {'lr': 0.00014910828098769083, 'samples': 18349632, 'steps': 95570, 'loss/train': 1.5202929973602295} 11/07/2021 10:40:28 - INFO - __main__ - Step 95572: {'lr': 0.00014910342560388602, 'samples': 18349824, 'steps': 95571, 'loss/train': 1.4075636863708496} 11/07/2021 10:40:28 - INFO - __main__ - Step 95573: {'lr': 0.0001490985702655436, 'samples': 18350016, 'steps': 95572, 'loss/train': 1.4453539848327637} 11/07/2021 10:40:28 - INFO - __main__ - Step 95574: {'lr': 0.00014909371497266583, 'samples': 18350208, 'steps': 95573, 'loss/train': 1.5147273540496826} 11/07/2021 10:40:29 - INFO - __main__ - Step 95575: {'lr': 0.0001490888597252549, 'samples': 18350400, 'steps': 95574, 'loss/train': 1.647055745124817} 11/07/2021 10:40:30 - INFO - __main__ - Step 95576: {'lr': 0.00014908400452331294, 'samples': 18350592, 'steps': 95575, 'loss/train': 1.547234296798706} 11/07/2021 10:40:30 - INFO - __main__ - Step 95577: {'lr': 0.0001490791493668422, 'samples': 18350784, 'steps': 95576, 'loss/train': 1.1673592329025269} 11/07/2021 10:40:30 - INFO - __main__ - Step 95578: {'lr': 0.00014907429425584483, 'samples': 18350976, 'steps': 95577, 'loss/train': 1.5990585088729858} 11/07/2021 10:40:31 - INFO - __main__ - Step 95579: {'lr': 0.00014906943919032302, 'samples': 18351168, 'steps': 95578, 'loss/train': 0.2373121678829193} 11/07/2021 10:40:31 - INFO - __main__ - Step 95580: {'lr': 0.00014906458417027896, 'samples': 18351360, 'steps': 95579, 'loss/train': 1.678846001625061} 11/07/2021 10:40:32 - INFO - __main__ - Step 95581: {'lr': 0.00014905972919571485, 'samples': 18351552, 'steps': 95580, 'loss/train': 1.3697482347488403} 11/07/2021 10:40:33 - INFO - __main__ - Step 95582: {'lr': 0.00014905487426663283, 'samples': 18351744, 'steps': 95581, 'loss/train': 0.9072967767715454} 11/07/2021 10:40:33 - INFO - __main__ - Step 95583: {'lr': 0.0001490500193830352, 'samples': 18351936, 'steps': 95582, 'loss/train': 1.7072851657867432} 11/07/2021 10:40:33 - INFO - __main__ - Step 95584: {'lr': 0.00014904516454492412, 'samples': 18352128, 'steps': 95583, 'loss/train': 1.3682001829147339} 11/07/2021 10:40:34 - INFO - __main__ - Step 95585: {'lr': 0.00014904030975230166, 'samples': 18352320, 'steps': 95584, 'loss/train': 1.3910690546035767} 11/07/2021 10:40:34 - INFO - __main__ - Step 95586: {'lr': 0.00014903545500517004, 'samples': 18352512, 'steps': 95585, 'loss/train': 1.276658296585083} 11/07/2021 10:40:35 - INFO - __main__ - Step 95587: {'lr': 0.0001490306003035315, 'samples': 18352704, 'steps': 95586, 'loss/train': 1.0333504676818848} 11/07/2021 10:40:35 - INFO - __main__ - Step 95588: {'lr': 0.00014902574564738824, 'samples': 18352896, 'steps': 95587, 'loss/train': 1.3824745416641235} 11/07/2021 10:40:36 - INFO - __main__ - Step 95589: {'lr': 0.0001490208910367424, 'samples': 18353088, 'steps': 95588, 'loss/train': 1.7650383710861206} 11/07/2021 10:40:36 - INFO - __main__ - Step 95590: {'lr': 0.00014901603647159617, 'samples': 18353280, 'steps': 95589, 'loss/train': 1.3307369947433472} 11/07/2021 10:40:36 - INFO - __main__ - Step 95591: {'lr': 0.0001490111819519518, 'samples': 18353472, 'steps': 95590, 'loss/train': 0.9718762636184692} 11/07/2021 10:40:37 - INFO - __main__ - Step 95592: {'lr': 0.0001490063274778114, 'samples': 18353664, 'steps': 95591, 'loss/train': 1.4712684154510498} 11/07/2021 10:40:38 - INFO - __main__ - Step 95593: {'lr': 0.0001490014730491772, 'samples': 18353856, 'steps': 95592, 'loss/train': 1.5000122785568237} 11/07/2021 10:40:38 - INFO - __main__ - Step 95594: {'lr': 0.00014899661866605137, 'samples': 18354048, 'steps': 95593, 'loss/train': 1.7340396642684937} 11/07/2021 10:40:39 - INFO - __main__ - Step 95595: {'lr': 0.0001489917643284361, 'samples': 18354240, 'steps': 95594, 'loss/train': 1.3917579650878906} 11/07/2021 10:40:39 - INFO - __main__ - Step 95596: {'lr': 0.0001489869100363336, 'samples': 18354432, 'steps': 95595, 'loss/train': 1.9556668996810913} 11/07/2021 10:40:40 - INFO - __main__ - Step 95597: {'lr': 0.00014898205578974617, 'samples': 18354624, 'steps': 95596, 'loss/train': 1.527839183807373} 11/07/2021 10:40:40 - INFO - __main__ - Step 95598: {'lr': 0.0001489772015886757, 'samples': 18354816, 'steps': 95597, 'loss/train': 1.2801332473754883} 11/07/2021 10:40:41 - INFO - __main__ - Step 95599: {'lr': 0.0001489723474331246, 'samples': 18355008, 'steps': 95598, 'loss/train': 1.0463882684707642} 11/07/2021 10:40:41 - INFO - __main__ - Step 95600: {'lr': 0.00014896749332309495, 'samples': 18355200, 'steps': 95599, 'loss/train': 1.2622872591018677} 11/07/2021 10:40:41 - INFO - __main__ - Step 95601: {'lr': 0.00014896263925858903, 'samples': 18355392, 'steps': 95600, 'loss/train': 1.3471146821975708} 11/07/2021 10:40:42 - INFO - __main__ - Step 95602: {'lr': 0.00014895778523960895, 'samples': 18355584, 'steps': 95601, 'loss/train': 1.518966794013977} 11/07/2021 10:40:43 - INFO - __main__ - Step 95603: {'lr': 0.00014895293126615696, 'samples': 18355776, 'steps': 95602, 'loss/train': 1.5340546369552612} 11/07/2021 10:40:43 - INFO - __main__ - Step 95604: {'lr': 0.00014894807733823522, 'samples': 18355968, 'steps': 95603, 'loss/train': 1.222280740737915} 11/07/2021 10:40:43 - INFO - __main__ - Step 95605: {'lr': 0.0001489432234558459, 'samples': 18356160, 'steps': 95604, 'loss/train': 1.7363842725753784} 11/07/2021 10:40:44 - INFO - __main__ - Step 95606: {'lr': 0.00014893836961899122, 'samples': 18356352, 'steps': 95605, 'loss/train': 1.3746315240859985} 11/07/2021 10:40:44 - INFO - __main__ - Step 95607: {'lr': 0.00014893351582767335, 'samples': 18356544, 'steps': 95606, 'loss/train': 1.7216558456420898} 11/07/2021 10:40:45 - INFO - __main__ - Step 95608: {'lr': 0.00014892866208189448, 'samples': 18356736, 'steps': 95607, 'loss/train': 1.6230963468551636} 11/07/2021 10:40:46 - INFO - __main__ - Step 95609: {'lr': 0.00014892380838165678, 'samples': 18356928, 'steps': 95608, 'loss/train': 1.3291656970977783} 11/07/2021 10:40:46 - INFO - __main__ - Step 95610: {'lr': 0.00014891895472696244, 'samples': 18357120, 'steps': 95609, 'loss/train': 1.3352961540222168} 11/07/2021 10:40:46 - INFO - __main__ - Step 95611: {'lr': 0.0001489141011178138, 'samples': 18357312, 'steps': 95610, 'loss/train': 1.2319364547729492} 11/07/2021 10:40:47 - INFO - __main__ - Step 95612: {'lr': 0.00014890924755421277, 'samples': 18357504, 'steps': 95611, 'loss/train': 1.3768357038497925} 11/07/2021 10:40:48 - INFO - __main__ - Step 95613: {'lr': 0.00014890439403616171, 'samples': 18357696, 'steps': 95612, 'loss/train': 1.5073797702789307} 11/07/2021 10:40:48 - INFO - __main__ - Step 95614: {'lr': 0.00014889954056366273, 'samples': 18357888, 'steps': 95613, 'loss/train': 1.510330319404602} 11/07/2021 10:40:48 - INFO - __main__ - Step 95615: {'lr': 0.0001488946871367181, 'samples': 18358080, 'steps': 95614, 'loss/train': 0.8276024460792542} 11/07/2021 10:40:49 - INFO - __main__ - Step 95616: {'lr': 0.00014888983375532994, 'samples': 18358272, 'steps': 95615, 'loss/train': 1.291101336479187} 11/07/2021 10:40:49 - INFO - __main__ - Step 95617: {'lr': 0.00014888498041950045, 'samples': 18358464, 'steps': 95616, 'loss/train': 1.4137531518936157} 11/07/2021 10:40:50 - INFO - __main__ - Step 95618: {'lr': 0.00014888012712923186, 'samples': 18358656, 'steps': 95617, 'loss/train': 1.4937912225723267} 11/07/2021 10:40:50 - INFO - __main__ - Step 95619: {'lr': 0.00014887527388452628, 'samples': 18358848, 'steps': 95618, 'loss/train': 1.4457944631576538} 11/07/2021 10:40:51 - INFO - __main__ - Step 95620: {'lr': 0.00014887042068538597, 'samples': 18359040, 'steps': 95619, 'loss/train': 1.5321391820907593} 11/07/2021 10:40:51 - INFO - __main__ - Step 95621: {'lr': 0.00014886556753181308, 'samples': 18359232, 'steps': 95620, 'loss/train': 1.244670033454895} 11/07/2021 10:40:52 - INFO - __main__ - Step 95622: {'lr': 0.00014886071442380986, 'samples': 18359424, 'steps': 95621, 'loss/train': 1.6679749488830566} 11/07/2021 10:40:53 - INFO - __main__ - Step 95623: {'lr': 0.00014885586136137842, 'samples': 18359616, 'steps': 95622, 'loss/train': 1.0204899311065674} 11/07/2021 10:40:53 - INFO - __main__ - Step 95624: {'lr': 0.00014885100834452099, 'samples': 18359808, 'steps': 95623, 'loss/train': 0.9979268908500671} 11/07/2021 10:40:53 - INFO - __main__ - Step 95625: {'lr': 0.00014884615537323964, 'samples': 18360000, 'steps': 95624, 'loss/train': 0.9393404722213745} 11/07/2021 10:40:54 - INFO - __main__ - Step 95626: {'lr': 0.0001488413024475367, 'samples': 18360192, 'steps': 95625, 'loss/train': 1.4104644060134888} 11/07/2021 10:40:54 - INFO - __main__ - Step 95627: {'lr': 0.00014883644956741428, 'samples': 18360384, 'steps': 95626, 'loss/train': 1.0448057651519775} 11/07/2021 10:40:55 - INFO - __main__ - Step 95628: {'lr': 0.00014883159673287463, 'samples': 18360576, 'steps': 95627, 'loss/train': 0.792184591293335} 11/07/2021 10:40:55 - INFO - __main__ - Step 95629: {'lr': 0.00014882674394391988, 'samples': 18360768, 'steps': 95628, 'loss/train': 1.4040205478668213} 11/07/2021 10:40:56 - INFO - __main__ - Step 95630: {'lr': 0.00014882189120055226, 'samples': 18360960, 'steps': 95629, 'loss/train': 0.9202346801757812} 11/07/2021 10:40:56 - INFO - __main__ - Step 95631: {'lr': 0.00014881703850277392, 'samples': 18361152, 'steps': 95630, 'loss/train': 0.7172181010246277} 11/07/2021 10:40:56 - INFO - __main__ - Step 95632: {'lr': 0.00014881218585058707, 'samples': 18361344, 'steps': 95631, 'loss/train': 0.8293659090995789} 11/07/2021 10:40:57 - INFO - __main__ - Step 95633: {'lr': 0.00014880733324399394, 'samples': 18361536, 'steps': 95632, 'loss/train': 1.2564728260040283} 11/07/2021 10:40:58 - INFO - __main__ - Step 95634: {'lr': 0.00014880248068299657, 'samples': 18361728, 'steps': 95633, 'loss/train': 1.4021741151809692} 11/07/2021 10:40:58 - INFO - __main__ - Step 95635: {'lr': 0.00014879762816759728, 'samples': 18361920, 'steps': 95634, 'loss/train': 1.142830729484558} 11/07/2021 10:40:59 - INFO - __main__ - Step 95636: {'lr': 0.0001487927756977982, 'samples': 18362112, 'steps': 95635, 'loss/train': 1.2419612407684326} 11/07/2021 10:40:59 - INFO - __main__ - Step 95637: {'lr': 0.00014878792327360156, 'samples': 18362304, 'steps': 95636, 'loss/train': 1.3939247131347656} 11/07/2021 10:40:59 - INFO - __main__ - Step 95638: {'lr': 0.00014878307089500952, 'samples': 18362496, 'steps': 95637, 'loss/train': 1.4189287424087524} 11/07/2021 10:41:00 - INFO - __main__ - Step 95639: {'lr': 0.0001487782185620243, 'samples': 18362688, 'steps': 95638, 'loss/train': 1.5333434343338013} 11/07/2021 10:41:01 - INFO - __main__ - Step 95640: {'lr': 0.000148773366274648, 'samples': 18362880, 'steps': 95639, 'loss/train': 1.3674647808074951} 11/07/2021 10:41:01 - INFO - __main__ - Step 95641: {'lr': 0.00014876851403288282, 'samples': 18363072, 'steps': 95640, 'loss/train': 0.9981669783592224} 11/07/2021 10:41:01 - INFO - __main__ - Step 95642: {'lr': 0.00014876366183673106, 'samples': 18363264, 'steps': 95641, 'loss/train': 1.6008996963500977} 11/07/2021 10:41:02 - INFO - __main__ - Step 95643: {'lr': 0.0001487588096861948, 'samples': 18363456, 'steps': 95642, 'loss/train': 1.33907151222229} 11/07/2021 10:41:03 - INFO - __main__ - Step 95644: {'lr': 0.0001487539575812763, 'samples': 18363648, 'steps': 95643, 'loss/train': 1.2434746026992798} 11/07/2021 10:41:03 - INFO - __main__ - Step 95645: {'lr': 0.00014874910552197768, 'samples': 18363840, 'steps': 95644, 'loss/train': 1.2779600620269775} 11/07/2021 10:41:04 - INFO - __main__ - Step 95646: {'lr': 0.00014874425350830113, 'samples': 18364032, 'steps': 95645, 'loss/train': 1.3359966278076172} 11/07/2021 10:41:04 - INFO - __main__ - Step 95647: {'lr': 0.00014873940154024883, 'samples': 18364224, 'steps': 95646, 'loss/train': 1.4438670873641968} 11/07/2021 10:41:04 - INFO - __main__ - Step 95648: {'lr': 0.00014873454961782304, 'samples': 18364416, 'steps': 95647, 'loss/train': 1.6528706550598145} 11/07/2021 10:41:05 - INFO - __main__ - Step 95649: {'lr': 0.00014872969774102588, 'samples': 18364608, 'steps': 95648, 'loss/train': 1.59243905544281} 11/07/2021 10:41:06 - INFO - __main__ - Step 95650: {'lr': 0.00014872484590985956, 'samples': 18364800, 'steps': 95649, 'loss/train': 1.1042866706848145} 11/07/2021 10:41:06 - INFO - __main__ - Step 95651: {'lr': 0.0001487199941243263, 'samples': 18364992, 'steps': 95650, 'loss/train': 1.3124817609786987} 11/07/2021 10:41:06 - INFO - __main__ - Step 95652: {'lr': 0.0001487151423844282, 'samples': 18365184, 'steps': 95651, 'loss/train': 1.6687921285629272} 11/07/2021 10:41:07 - INFO - __main__ - Step 95653: {'lr': 0.0001487102906901675, 'samples': 18365376, 'steps': 95652, 'loss/train': 1.157302737236023} 11/07/2021 10:41:07 - INFO - __main__ - Step 95654: {'lr': 0.00014870543904154637, 'samples': 18365568, 'steps': 95653, 'loss/train': 1.870009422302246} 11/07/2021 10:41:08 - INFO - __main__ - Step 95655: {'lr': 0.00014870058743856706, 'samples': 18365760, 'steps': 95654, 'loss/train': 1.6539498567581177} 11/07/2021 10:41:08 - INFO - __main__ - Step 95656: {'lr': 0.0001486957358812317, 'samples': 18365952, 'steps': 95655, 'loss/train': 1.3006864786148071} 11/07/2021 10:41:09 - INFO - __main__ - Step 95657: {'lr': 0.00014869088436954243, 'samples': 18366144, 'steps': 95656, 'loss/train': 1.4171829223632812} 11/07/2021 10:41:09 - INFO - __main__ - Step 95658: {'lr': 0.00014868603290350146, 'samples': 18366336, 'steps': 95657, 'loss/train': 0.8480222821235657} 11/07/2021 10:41:09 - INFO - __main__ - Step 95659: {'lr': 0.00014868118148311105, 'samples': 18366528, 'steps': 95658, 'loss/train': 1.030495285987854} 11/07/2021 10:41:11 - INFO - __main__ - Step 95660: {'lr': 0.00014867633010837335, 'samples': 18366720, 'steps': 95659, 'loss/train': 1.2781956195831299} 11/07/2021 10:41:11 - INFO - __main__ - Step 95661: {'lr': 0.00014867147877929048, 'samples': 18366912, 'steps': 95660, 'loss/train': 0.589092493057251} 11/07/2021 10:41:11 - INFO - __main__ - Step 95662: {'lr': 0.0001486666274958647, 'samples': 18367104, 'steps': 95661, 'loss/train': 1.3560147285461426} 11/07/2021 10:41:12 - INFO - __main__ - Step 95663: {'lr': 0.00014866177625809818, 'samples': 18367296, 'steps': 95662, 'loss/train': 2.7052664756774902} 11/07/2021 10:41:12 - INFO - __main__ - Step 95664: {'lr': 0.00014865692506599312, 'samples': 18367488, 'steps': 95663, 'loss/train': 1.3568569421768188} 11/07/2021 10:41:13 - INFO - __main__ - Step 95665: {'lr': 0.0001486520739195517, 'samples': 18367680, 'steps': 95664, 'loss/train': 1.4718431234359741} 11/07/2021 10:41:13 - INFO - __main__ - Step 95666: {'lr': 0.00014864722281877609, 'samples': 18367872, 'steps': 95665, 'loss/train': 1.4010646343231201} 11/07/2021 10:41:14 - INFO - __main__ - Step 95667: {'lr': 0.0001486423717636684, 'samples': 18368064, 'steps': 95666, 'loss/train': 1.433397889137268} 11/07/2021 10:41:14 - INFO - __main__ - Step 95668: {'lr': 0.00014863752075423094, 'samples': 18368256, 'steps': 95667, 'loss/train': 0.9826617240905762} 11/07/2021 10:41:14 - INFO - __main__ - Step 95669: {'lr': 0.00014863266979046582, 'samples': 18368448, 'steps': 95668, 'loss/train': 1.4787884950637817} 11/07/2021 10:41:15 - INFO - __main__ - Step 95670: {'lr': 0.00014862781887237532, 'samples': 18368640, 'steps': 95669, 'loss/train': 1.0426298379898071} 11/07/2021 10:41:16 - INFO - __main__ - Step 95671: {'lr': 0.0001486229679999615, 'samples': 18368832, 'steps': 95670, 'loss/train': 1.2226563692092896} 11/07/2021 10:41:16 - INFO - __main__ - Step 95672: {'lr': 0.0001486181171732266, 'samples': 18369024, 'steps': 95671, 'loss/train': 1.622039556503296} 11/07/2021 10:41:16 - INFO - __main__ - Step 95673: {'lr': 0.00014861326639217283, 'samples': 18369216, 'steps': 95672, 'loss/train': 1.7338563203811646} 11/07/2021 10:41:17 - INFO - __main__ - Step 95674: {'lr': 0.00014860841565680235, 'samples': 18369408, 'steps': 95673, 'loss/train': 2.0868351459503174} 11/07/2021 10:41:18 - INFO - __main__ - Step 95675: {'lr': 0.0001486035649671174, 'samples': 18369600, 'steps': 95674, 'loss/train': 0.8725195527076721} 11/07/2021 10:41:18 - INFO - __main__ - Step 95676: {'lr': 0.00014859871432312005, 'samples': 18369792, 'steps': 95675, 'loss/train': 0.7759091258049011} 11/07/2021 10:41:19 - INFO - __main__ - Step 95677: {'lr': 0.0001485938637248126, 'samples': 18369984, 'steps': 95676, 'loss/train': 1.1193805932998657} 11/07/2021 10:41:19 - INFO - __main__ - Step 95678: {'lr': 0.00014858901317219727, 'samples': 18370176, 'steps': 95677, 'loss/train': 1.6475002765655518} 11/07/2021 10:41:19 - INFO - __main__ - Step 95679: {'lr': 0.00014858416266527608, 'samples': 18370368, 'steps': 95678, 'loss/train': 1.7991836071014404} 11/07/2021 10:41:20 - INFO - __main__ - Step 95680: {'lr': 0.0001485793122040513, 'samples': 18370560, 'steps': 95679, 'loss/train': 1.5431784391403198} 11/07/2021 10:41:21 - INFO - __main__ - Step 95681: {'lr': 0.0001485744617885251, 'samples': 18370752, 'steps': 95680, 'loss/train': 1.0692898035049438} 11/07/2021 10:41:21 - INFO - __main__ - Step 95682: {'lr': 0.00014856961141869967, 'samples': 18370944, 'steps': 95681, 'loss/train': 1.6758006811141968} 11/07/2021 10:41:21 - INFO - __main__ - Step 95683: {'lr': 0.00014856476109457726, 'samples': 18371136, 'steps': 95682, 'loss/train': 1.5625708103179932} 11/07/2021 10:41:22 - INFO - __main__ - Step 95684: {'lr': 0.00014855991081616, 'samples': 18371328, 'steps': 95683, 'loss/train': 1.3467353582382202} 11/07/2021 10:41:22 - INFO - __main__ - Step 95685: {'lr': 0.00014855506058345002, 'samples': 18371520, 'steps': 95684, 'loss/train': 1.110032081604004} 11/07/2021 10:41:23 - INFO - __main__ - Step 95686: {'lr': 0.00014855021039644962, 'samples': 18371712, 'steps': 95685, 'loss/train': 1.3082475662231445} 11/07/2021 10:41:24 - INFO - __main__ - Step 95687: {'lr': 0.00014854536025516092, 'samples': 18371904, 'steps': 95686, 'loss/train': 1.4235129356384277} 11/07/2021 10:41:24 - INFO - __main__ - Step 95688: {'lr': 0.00014854051015958608, 'samples': 18372096, 'steps': 95687, 'loss/train': 1.1781338453292847} 11/07/2021 10:41:24 - INFO - __main__ - Step 95689: {'lr': 0.00014853566010972736, 'samples': 18372288, 'steps': 95688, 'loss/train': 1.6964635848999023} 11/07/2021 10:41:25 - INFO - __main__ - Step 95690: {'lr': 0.00014853081010558688, 'samples': 18372480, 'steps': 95689, 'loss/train': 1.2606168985366821} 11/07/2021 10:41:26 - INFO - __main__ - Step 95691: {'lr': 0.00014852596014716695, 'samples': 18372672, 'steps': 95690, 'loss/train': 0.7757840156555176} 11/07/2021 10:41:26 - INFO - __main__ - Step 95692: {'lr': 0.00014852111023446957, 'samples': 18372864, 'steps': 95691, 'loss/train': 0.9002321362495422} 11/07/2021 10:41:26 - INFO - __main__ - Step 95693: {'lr': 0.000148516260367497, 'samples': 18373056, 'steps': 95692, 'loss/train': 1.4651713371276855} 11/07/2021 10:41:27 - INFO - __main__ - Step 95694: {'lr': 0.00014851141054625144, 'samples': 18373248, 'steps': 95693, 'loss/train': 1.362772822380066} 11/07/2021 10:41:27 - INFO - __main__ - Step 95695: {'lr': 0.0001485065607707351, 'samples': 18373440, 'steps': 95694, 'loss/train': 0.7865837812423706} 11/07/2021 10:41:28 - INFO - __main__ - Step 95696: {'lr': 0.00014850171104095012, 'samples': 18373632, 'steps': 95695, 'loss/train': 1.2106659412384033} 11/07/2021 10:41:28 - INFO - __main__ - Step 95697: {'lr': 0.0001484968613568987, 'samples': 18373824, 'steps': 95696, 'loss/train': 1.4586480855941772} 11/07/2021 10:41:29 - INFO - __main__ - Step 95698: {'lr': 0.00014849201171858301, 'samples': 18374016, 'steps': 95697, 'loss/train': 1.3444147109985352} 11/07/2021 10:41:29 - INFO - __main__ - Step 95699: {'lr': 0.00014848716212600526, 'samples': 18374208, 'steps': 95698, 'loss/train': 1.5508862733840942} 11/07/2021 10:41:29 - INFO - __main__ - Step 95700: {'lr': 0.00014848231257916767, 'samples': 18374400, 'steps': 95699, 'loss/train': 1.0984395742416382} 11/07/2021 10:41:30 - INFO - __main__ - Step 95701: {'lr': 0.00014847746307807233, 'samples': 18374592, 'steps': 95700, 'loss/train': 1.82334566116333} 11/07/2021 10:41:31 - INFO - __main__ - Step 95702: {'lr': 0.0001484726136227215, 'samples': 18374784, 'steps': 95701, 'loss/train': 1.3271347284317017} 11/07/2021 10:41:31 - INFO - __main__ - Step 95703: {'lr': 0.00014846776421311738, 'samples': 18374976, 'steps': 95702, 'loss/train': 1.2495605945587158} 11/07/2021 10:41:31 - INFO - __main__ - Step 95704: {'lr': 0.00014846291484926205, 'samples': 18375168, 'steps': 95703, 'loss/train': 1.1101945638656616} 11/07/2021 10:41:32 - INFO - __main__ - Step 95705: {'lr': 0.0001484580655311579, 'samples': 18375360, 'steps': 95704, 'loss/train': 1.3272370100021362} 11/07/2021 10:41:32 - INFO - __main__ - Step 95706: {'lr': 0.00014845321625880687, 'samples': 18375552, 'steps': 95705, 'loss/train': 1.4582364559173584} 11/07/2021 10:41:33 - INFO - __main__ - Step 95707: {'lr': 0.00014844836703221126, 'samples': 18375744, 'steps': 95706, 'loss/train': 1.5602107048034668} 11/07/2021 10:41:33 - INFO - __main__ - Step 95708: {'lr': 0.00014844351785137325, 'samples': 18375936, 'steps': 95707, 'loss/train': 0.889776349067688} 11/07/2021 10:41:34 - INFO - __main__ - Step 95709: {'lr': 0.00014843866871629502, 'samples': 18376128, 'steps': 95708, 'loss/train': 1.9260413646697998} 11/07/2021 10:41:34 - INFO - __main__ - Step 95710: {'lr': 0.00014843381962697876, 'samples': 18376320, 'steps': 95709, 'loss/train': 1.8865585327148438} 11/07/2021 10:41:35 - INFO - __main__ - Step 95711: {'lr': 0.00014842897058342663, 'samples': 18376512, 'steps': 95710, 'loss/train': 1.3418792486190796} 11/07/2021 10:41:36 - INFO - __main__ - Step 95712: {'lr': 0.0001484241215856409, 'samples': 18376704, 'steps': 95711, 'loss/train': 1.047156572341919} 11/07/2021 10:41:36 - INFO - __main__ - Step 95713: {'lr': 0.00014841927263362366, 'samples': 18376896, 'steps': 95712, 'loss/train': 1.5517011880874634} 11/07/2021 10:41:36 - INFO - __main__ - Step 95714: {'lr': 0.0001484144237273771, 'samples': 18377088, 'steps': 95713, 'loss/train': 1.4087649583816528} 11/07/2021 10:41:37 - INFO - __main__ - Step 95715: {'lr': 0.00014840957486690346, 'samples': 18377280, 'steps': 95714, 'loss/train': 1.1143144369125366} 11/07/2021 10:41:37 - INFO - __main__ - Step 95716: {'lr': 0.0001484047260522049, 'samples': 18377472, 'steps': 95715, 'loss/train': 1.213840365409851} 11/07/2021 10:41:38 - INFO - __main__ - Step 95717: {'lr': 0.00014839987728328357, 'samples': 18377664, 'steps': 95716, 'loss/train': 1.4718708992004395} 11/07/2021 10:41:38 - INFO - __main__ - Step 95718: {'lr': 0.00014839502856014183, 'samples': 18377856, 'steps': 95717, 'loss/train': 1.0019372701644897} 11/07/2021 10:41:39 - INFO - __main__ - Step 95719: {'lr': 0.0001483901798827816, 'samples': 18378048, 'steps': 95718, 'loss/train': 0.72139972448349} 11/07/2021 10:41:39 - INFO - __main__ - Step 95720: {'lr': 0.00014838533125120521, 'samples': 18378240, 'steps': 95719, 'loss/train': 1.2014727592468262} 11/07/2021 10:41:39 - INFO - __main__ - Step 95721: {'lr': 0.0001483804826654148, 'samples': 18378432, 'steps': 95720, 'loss/train': 1.1201306581497192} 11/07/2021 10:41:40 - INFO - __main__ - Step 95722: {'lr': 0.0001483756341254126, 'samples': 18378624, 'steps': 95721, 'loss/train': 0.983285129070282} 11/07/2021 10:41:41 - INFO - __main__ - Step 95723: {'lr': 0.00014837078563120074, 'samples': 18378816, 'steps': 95722, 'loss/train': 1.446542501449585} 11/07/2021 10:41:41 - INFO - __main__ - Step 95724: {'lr': 0.00014836593718278146, 'samples': 18379008, 'steps': 95723, 'loss/train': 1.4732285737991333} 11/07/2021 10:41:41 - INFO - __main__ - Step 95725: {'lr': 0.0001483610887801569, 'samples': 18379200, 'steps': 95724, 'loss/train': 1.4466681480407715} 11/07/2021 10:41:42 - INFO - __main__ - Step 95726: {'lr': 0.0001483562404233293, 'samples': 18379392, 'steps': 95725, 'loss/train': 0.8506258130073547} 11/07/2021 10:41:43 - INFO - __main__ - Step 95727: {'lr': 0.00014835139211230076, 'samples': 18379584, 'steps': 95726, 'loss/train': 1.6092114448547363} 11/07/2021 10:41:43 - INFO - __main__ - Step 95728: {'lr': 0.00014834654384707351, 'samples': 18379776, 'steps': 95727, 'loss/train': 0.6228721737861633} 11/07/2021 10:41:43 - INFO - __main__ - Step 95729: {'lr': 0.0001483416956276498, 'samples': 18379968, 'steps': 95728, 'loss/train': 2.2359232902526855} 11/07/2021 10:41:44 - INFO - __main__ - Step 95730: {'lr': 0.0001483368474540317, 'samples': 18380160, 'steps': 95729, 'loss/train': 1.3336783647537231} 11/07/2021 10:41:44 - INFO - __main__ - Step 95731: {'lr': 0.0001483319993262215, 'samples': 18380352, 'steps': 95730, 'loss/train': 1.3635458946228027} 11/07/2021 10:41:45 - INFO - __main__ - Step 95732: {'lr': 0.00014832715124422138, 'samples': 18380544, 'steps': 95731, 'loss/train': 1.0955370664596558} 11/07/2021 10:41:45 - INFO - __main__ - Step 95733: {'lr': 0.0001483223032080334, 'samples': 18380736, 'steps': 95732, 'loss/train': 2.29111909866333} 11/07/2021 10:41:46 - INFO - __main__ - Step 95734: {'lr': 0.00014831745521765981, 'samples': 18380928, 'steps': 95733, 'loss/train': 0.818153440952301} 11/07/2021 10:41:46 - INFO - __main__ - Step 95735: {'lr': 0.00014831260727310284, 'samples': 18381120, 'steps': 95734, 'loss/train': 1.2016946077346802} 11/07/2021 10:41:47 - INFO - __main__ - Step 95736: {'lr': 0.0001483077593743646, 'samples': 18381312, 'steps': 95735, 'loss/train': 1.4947596788406372} 11/07/2021 10:41:47 - INFO - __main__ - Step 95737: {'lr': 0.0001483029115214473, 'samples': 18381504, 'steps': 95736, 'loss/train': 1.1861155033111572} 11/07/2021 10:41:48 - INFO - __main__ - Step 95738: {'lr': 0.0001482980637143532, 'samples': 18381696, 'steps': 95737, 'loss/train': 0.7323434948921204} 11/07/2021 10:41:48 - INFO - __main__ - Step 95739: {'lr': 0.00014829321595308438, 'samples': 18381888, 'steps': 95738, 'loss/train': 1.570775032043457} 11/07/2021 10:41:49 - INFO - __main__ - Step 95740: {'lr': 0.00014828836823764307, 'samples': 18382080, 'steps': 95739, 'loss/train': 1.609140157699585} 11/07/2021 10:41:49 - INFO - __main__ - Step 95741: {'lr': 0.00014828352056803145, 'samples': 18382272, 'steps': 95740, 'loss/train': 0.9423354864120483} 11/07/2021 10:41:49 - INFO - __main__ - Step 95742: {'lr': 0.00014827867294425173, 'samples': 18382464, 'steps': 95741, 'loss/train': 1.323718547821045} 11/07/2021 10:41:50 - INFO - __main__ - Step 95743: {'lr': 0.00014827382536630607, 'samples': 18382656, 'steps': 95742, 'loss/train': 1.107712745666504} 11/07/2021 10:41:51 - INFO - __main__ - Step 95744: {'lr': 0.00014826897783419663, 'samples': 18382848, 'steps': 95743, 'loss/train': 1.5257748365402222} 11/07/2021 10:41:51 - INFO - __main__ - Step 95745: {'lr': 0.00014826413034792573, 'samples': 18383040, 'steps': 95744, 'loss/train': 1.2013133764266968} 11/07/2021 10:41:51 - INFO - __main__ - Step 95746: {'lr': 0.00014825928290749534, 'samples': 18383232, 'steps': 95745, 'loss/train': 1.4383649826049805} 11/07/2021 10:41:52 - INFO - __main__ - Step 95747: {'lr': 0.00014825443551290775, 'samples': 18383424, 'steps': 95746, 'loss/train': 1.4013608694076538} 11/07/2021 10:41:53 - INFO - __main__ - Step 95748: {'lr': 0.00014824958816416517, 'samples': 18383616, 'steps': 95747, 'loss/train': 1.554766058921814} 11/07/2021 10:41:53 - INFO - __main__ - Step 95749: {'lr': 0.00014824474086126972, 'samples': 18383808, 'steps': 95748, 'loss/train': 1.0352659225463867} 11/07/2021 10:41:53 - INFO - __main__ - Step 95750: {'lr': 0.00014823989360422362, 'samples': 18384000, 'steps': 95749, 'loss/train': 1.1671335697174072} 11/07/2021 10:41:54 - INFO - __main__ - Step 95751: {'lr': 0.00014823504639302905, 'samples': 18384192, 'steps': 95750, 'loss/train': 0.904188871383667} 11/07/2021 10:41:54 - INFO - __main__ - Step 95752: {'lr': 0.0001482301992276882, 'samples': 18384384, 'steps': 95751, 'loss/train': 1.4206331968307495} 11/07/2021 10:41:55 - INFO - __main__ - Step 95753: {'lr': 0.00014822535210820326, 'samples': 18384576, 'steps': 95752, 'loss/train': 1.2169923782348633} 11/07/2021 10:41:56 - INFO - __main__ - Step 95754: {'lr': 0.0001482205050345764, 'samples': 18384768, 'steps': 95753, 'loss/train': 1.5191079378128052} 11/07/2021 10:41:56 - INFO - __main__ - Step 95755: {'lr': 0.00014821565800680984, 'samples': 18384960, 'steps': 95754, 'loss/train': 1.2174409627914429} 11/07/2021 10:41:56 - INFO - __main__ - Step 95756: {'lr': 0.00014821081102490575, 'samples': 18385152, 'steps': 95755, 'loss/train': 1.2824580669403076} 11/07/2021 10:41:57 - INFO - __main__ - Step 95757: {'lr': 0.00014820596408886627, 'samples': 18385344, 'steps': 95756, 'loss/train': 1.400281310081482} 11/07/2021 10:41:58 - INFO - __main__ - Step 95758: {'lr': 0.00014820111719869358, 'samples': 18385536, 'steps': 95757, 'loss/train': 1.5498473644256592} 11/07/2021 10:41:58 - INFO - __main__ - Step 95759: {'lr': 0.00014819627035439, 'samples': 18385728, 'steps': 95758, 'loss/train': 1.786362886428833} 11/07/2021 10:41:58 - INFO - __main__ - Step 95760: {'lr': 0.0001481914235559575, 'samples': 18385920, 'steps': 95759, 'loss/train': 1.2058525085449219} 11/07/2021 10:41:59 - INFO - __main__ - Step 95761: {'lr': 0.0001481865768033984, 'samples': 18386112, 'steps': 95760, 'loss/train': 1.356045126914978} 11/07/2021 10:41:59 - INFO - __main__ - Step 95762: {'lr': 0.00014818173009671485, 'samples': 18386304, 'steps': 95761, 'loss/train': 1.3424873352050781} 11/07/2021 10:42:00 - INFO - __main__ - Step 95763: {'lr': 0.00014817688343590903, 'samples': 18386496, 'steps': 95762, 'loss/train': 1.2996078729629517} 11/07/2021 10:42:00 - INFO - __main__ - Step 95764: {'lr': 0.00014817203682098318, 'samples': 18386688, 'steps': 95763, 'loss/train': 1.3018323183059692} 11/07/2021 10:42:01 - INFO - __main__ - Step 95765: {'lr': 0.00014816719025193939, 'samples': 18386880, 'steps': 95764, 'loss/train': 1.121256709098816} 11/07/2021 10:42:01 - INFO - __main__ - Step 95766: {'lr': 0.0001481623437287799, 'samples': 18387072, 'steps': 95765, 'loss/train': 1.5967735052108765} 11/07/2021 10:42:01 - INFO - __main__ - Step 95767: {'lr': 0.00014815749725150695, 'samples': 18387264, 'steps': 95766, 'loss/train': 1.3278133869171143} 11/07/2021 10:42:02 - INFO - __main__ - Step 95768: {'lr': 0.00014815265082012265, 'samples': 18387456, 'steps': 95767, 'loss/train': 1.7030959129333496} 11/07/2021 10:42:03 - INFO - __main__ - Step 95769: {'lr': 0.00014814780443462913, 'samples': 18387648, 'steps': 95768, 'loss/train': 1.6110601425170898} 11/07/2021 10:42:03 - INFO - __main__ - Step 95770: {'lr': 0.00014814295809502864, 'samples': 18387840, 'steps': 95769, 'loss/train': 1.6957393884658813} 11/07/2021 10:42:04 - INFO - __main__ - Step 95771: {'lr': 0.0001481381118013234, 'samples': 18388032, 'steps': 95770, 'loss/train': 1.1188685894012451} 11/07/2021 10:42:04 - INFO - __main__ - Step 95772: {'lr': 0.0001481332655535156, 'samples': 18388224, 'steps': 95771, 'loss/train': 0.7719798684120178} 11/07/2021 10:42:04 - INFO - __main__ - Step 95773: {'lr': 0.00014812841935160731, 'samples': 18388416, 'steps': 95772, 'loss/train': 0.831759512424469} 11/07/2021 10:42:05 - INFO - __main__ - Step 95774: {'lr': 0.00014812357319560077, 'samples': 18388608, 'steps': 95773, 'loss/train': 1.0743032693862915} 11/07/2021 10:42:06 - INFO - __main__ - Step 95775: {'lr': 0.00014811872708549823, 'samples': 18388800, 'steps': 95774, 'loss/train': 1.3304756879806519} 11/07/2021 10:42:06 - INFO - __main__ - Step 95776: {'lr': 0.00014811388102130177, 'samples': 18388992, 'steps': 95775, 'loss/train': 1.575857162475586} 11/07/2021 10:42:06 - INFO - __main__ - Step 95777: {'lr': 0.00014810903500301365, 'samples': 18389184, 'steps': 95776, 'loss/train': 2.2610867023468018} 11/07/2021 10:42:07 - INFO - __main__ - Step 95778: {'lr': 0.00014810418903063604, 'samples': 18389376, 'steps': 95777, 'loss/train': 1.7869781255722046} 11/07/2021 10:42:08 - INFO - __main__ - Step 95779: {'lr': 0.00014809934310417108, 'samples': 18389568, 'steps': 95778, 'loss/train': 1.4325395822525024} 11/07/2021 10:42:08 - INFO - __main__ - Step 95780: {'lr': 0.000148094497223621, 'samples': 18389760, 'steps': 95779, 'loss/train': 1.2449756860733032} 11/07/2021 10:42:09 - INFO - __main__ - Step 95781: {'lr': 0.00014808965138898795, 'samples': 18389952, 'steps': 95780, 'loss/train': 1.6964213848114014} 11/07/2021 10:42:09 - INFO - __main__ - Step 95782: {'lr': 0.00014808480560027414, 'samples': 18390144, 'steps': 95781, 'loss/train': 1.7439361810684204} 11/07/2021 10:42:09 - INFO - __main__ - Step 95783: {'lr': 0.00014807995985748174, 'samples': 18390336, 'steps': 95782, 'loss/train': 1.2510567903518677} 11/07/2021 10:42:10 - INFO - __main__ - Step 95784: {'lr': 0.0001480751141606129, 'samples': 18390528, 'steps': 95783, 'loss/train': 1.5084917545318604} 11/07/2021 10:42:11 - INFO - __main__ - Step 95785: {'lr': 0.00014807026850966994, 'samples': 18390720, 'steps': 95784, 'loss/train': 1.3954168558120728} 11/07/2021 10:42:11 - INFO - __main__ - Step 95786: {'lr': 0.0001480654229046549, 'samples': 18390912, 'steps': 95785, 'loss/train': 1.323442816734314} 11/07/2021 10:42:11 - INFO - __main__ - Step 95787: {'lr': 0.00014806057734557, 'samples': 18391104, 'steps': 95786, 'loss/train': 1.1995940208435059} 11/07/2021 10:42:12 - INFO - __main__ - Step 95788: {'lr': 0.00014805573183241738, 'samples': 18391296, 'steps': 95787, 'loss/train': 1.4381585121154785} 11/07/2021 10:42:13 - INFO - __main__ - Step 95789: {'lr': 0.00014805088636519938, 'samples': 18391488, 'steps': 95788, 'loss/train': 1.302580714225769} 11/07/2021 10:42:13 - INFO - __main__ - Step 95790: {'lr': 0.00014804604094391803, 'samples': 18391680, 'steps': 95789, 'loss/train': 1.7645186185836792} 11/07/2021 10:42:14 - INFO - __main__ - Step 95791: {'lr': 0.00014804119556857554, 'samples': 18391872, 'steps': 95790, 'loss/train': 1.5422266721725464} 11/07/2021 10:42:14 - INFO - __main__ - Step 95792: {'lr': 0.0001480363502391741, 'samples': 18392064, 'steps': 95791, 'loss/train': 1.417860984802246} 11/07/2021 10:42:14 - INFO - __main__ - Step 95793: {'lr': 0.00014803150495571593, 'samples': 18392256, 'steps': 95792, 'loss/train': 1.399399757385254} 11/07/2021 10:42:15 - INFO - __main__ - Step 95794: {'lr': 0.00014802665971820318, 'samples': 18392448, 'steps': 95793, 'loss/train': 0.9073792099952698} 11/07/2021 10:42:16 - INFO - __main__ - Step 95795: {'lr': 0.00014802181452663803, 'samples': 18392640, 'steps': 95794, 'loss/train': 1.31370210647583} 11/07/2021 10:42:16 - INFO - __main__ - Step 95796: {'lr': 0.00014801696938102272, 'samples': 18392832, 'steps': 95795, 'loss/train': 1.315752625465393} 11/07/2021 10:42:16 - INFO - __main__ - Step 95797: {'lr': 0.00014801212428135934, 'samples': 18393024, 'steps': 95796, 'loss/train': 1.2805992364883423} 11/07/2021 10:42:17 - INFO - __main__ - Step 95798: {'lr': 0.00014800727922765016, 'samples': 18393216, 'steps': 95797, 'loss/train': 1.5577119588851929} 11/07/2021 10:42:17 - INFO - __main__ - Step 95799: {'lr': 0.00014800243421989734, 'samples': 18393408, 'steps': 95798, 'loss/train': 1.559122085571289} 11/07/2021 10:42:18 - INFO - __main__ - Step 95800: {'lr': 0.00014799758925810309, 'samples': 18393600, 'steps': 95799, 'loss/train': 1.5253154039382935} 11/07/2021 10:42:18 - INFO - __main__ - Step 95801: {'lr': 0.0001479927443422695, 'samples': 18393792, 'steps': 95800, 'loss/train': 1.5522593259811401} 11/07/2021 10:42:19 - INFO - __main__ - Step 95802: {'lr': 0.00014798789947239878, 'samples': 18393984, 'steps': 95801, 'loss/train': 1.4650177955627441} 11/07/2021 10:42:19 - INFO - __main__ - Step 95803: {'lr': 0.00014798305464849316, 'samples': 18394176, 'steps': 95802, 'loss/train': 1.5946019887924194} 11/07/2021 10:42:19 - INFO - __main__ - Step 95804: {'lr': 0.00014797820987055477, 'samples': 18394368, 'steps': 95803, 'loss/train': 1.018606424331665} 11/07/2021 10:42:20 - INFO - __main__ - Step 95805: {'lr': 0.00014797336513858584, 'samples': 18394560, 'steps': 95804, 'loss/train': 1.7578163146972656} 11/07/2021 10:42:21 - INFO - __main__ - Step 95806: {'lr': 0.00014796852045258855, 'samples': 18394752, 'steps': 95805, 'loss/train': 1.83645761013031} 11/07/2021 10:42:21 - INFO - __main__ - Step 95807: {'lr': 0.00014796367581256507, 'samples': 18394944, 'steps': 95806, 'loss/train': 0.07223570346832275} 11/07/2021 10:42:21 - INFO - __main__ - Step 95808: {'lr': 0.00014795883121851755, 'samples': 18395136, 'steps': 95807, 'loss/train': 1.5147444009780884} 11/07/2021 10:42:22 - INFO - __main__ - Step 95809: {'lr': 0.00014795398667044824, 'samples': 18395328, 'steps': 95808, 'loss/train': 1.4303280115127563} 11/07/2021 10:42:22 - INFO - __main__ - Step 95810: {'lr': 0.00014794914216835928, 'samples': 18395520, 'steps': 95809, 'loss/train': 1.6357002258300781} 11/07/2021 10:42:23 - INFO - __main__ - Step 95811: {'lr': 0.00014794429771225289, 'samples': 18395712, 'steps': 95810, 'loss/train': 1.2810306549072266} 11/07/2021 10:42:24 - INFO - __main__ - Step 95812: {'lr': 0.00014793945330213127, 'samples': 18395904, 'steps': 95811, 'loss/train': 1.6617158651351929} 11/07/2021 10:42:24 - INFO - __main__ - Step 95813: {'lr': 0.00014793460893799647, 'samples': 18396096, 'steps': 95812, 'loss/train': 1.5087082386016846} 11/07/2021 10:42:24 - INFO - __main__ - Step 95814: {'lr': 0.0001479297646198508, 'samples': 18396288, 'steps': 95813, 'loss/train': 1.8967548608779907} 11/07/2021 10:42:25 - INFO - __main__ - Step 95815: {'lr': 0.00014792492034769637, 'samples': 18396480, 'steps': 95814, 'loss/train': 1.5050129890441895} 11/07/2021 10:42:26 - INFO - __main__ - Step 95816: {'lr': 0.0001479200761215354, 'samples': 18396672, 'steps': 95815, 'loss/train': 0.8539024591445923} 11/07/2021 10:42:26 - INFO - __main__ - Step 95817: {'lr': 0.00014791523194137006, 'samples': 18396864, 'steps': 95816, 'loss/train': 1.3360778093338013} 11/07/2021 10:42:26 - INFO - __main__ - Step 95818: {'lr': 0.00014791038780720257, 'samples': 18397056, 'steps': 95817, 'loss/train': 1.4591269493103027} 11/07/2021 10:42:27 - INFO - __main__ - Step 95819: {'lr': 0.00014790554371903503, 'samples': 18397248, 'steps': 95818, 'loss/train': 1.1559598445892334} 11/07/2021 10:42:27 - INFO - __main__ - Step 95820: {'lr': 0.00014790069967686974, 'samples': 18397440, 'steps': 95819, 'loss/train': 1.6518620252609253} 11/07/2021 10:42:28 - INFO - __main__ - Step 95821: {'lr': 0.0001478958556807088, 'samples': 18397632, 'steps': 95820, 'loss/train': 0.8159639239311218} 11/07/2021 10:42:29 - INFO - __main__ - Step 95822: {'lr': 0.0001478910117305544, 'samples': 18397824, 'steps': 95821, 'loss/train': 1.556949496269226} 11/07/2021 10:42:29 - INFO - __main__ - Step 95823: {'lr': 0.00014788616782640874, 'samples': 18398016, 'steps': 95822, 'loss/train': 1.1957985162734985} 11/07/2021 10:42:29 - INFO - __main__ - Step 95824: {'lr': 0.00014788132396827396, 'samples': 18398208, 'steps': 95823, 'loss/train': 1.4898884296417236} 11/07/2021 10:42:30 - INFO - __main__ - Step 95825: {'lr': 0.00014787648015615235, 'samples': 18398400, 'steps': 95824, 'loss/train': 1.1022629737854004} 11/07/2021 10:42:31 - INFO - __main__ - Step 95826: {'lr': 0.00014787163639004607, 'samples': 18398592, 'steps': 95825, 'loss/train': 1.2019702196121216} 11/07/2021 10:42:31 - INFO - __main__ - Step 95827: {'lr': 0.00014786679266995718, 'samples': 18398784, 'steps': 95826, 'loss/train': 1.405630111694336} 11/07/2021 10:42:31 - INFO - __main__ - Step 95828: {'lr': 0.00014786194899588792, 'samples': 18398976, 'steps': 95827, 'loss/train': 1.1731616258621216} 11/07/2021 10:42:32 - INFO - __main__ - Step 95829: {'lr': 0.0001478571053678405, 'samples': 18399168, 'steps': 95828, 'loss/train': 1.7392977476119995} 11/07/2021 10:42:32 - INFO - __main__ - Step 95830: {'lr': 0.00014785226178581708, 'samples': 18399360, 'steps': 95829, 'loss/train': 1.8139430284500122} 11/07/2021 10:42:33 - INFO - __main__ - Step 95831: {'lr': 0.00014784741824981986, 'samples': 18399552, 'steps': 95830, 'loss/train': 0.9945093393325806} 11/07/2021 10:42:33 - INFO - __main__ - Step 95832: {'lr': 0.000147842574759851, 'samples': 18399744, 'steps': 95831, 'loss/train': 1.4129360914230347} 11/07/2021 10:42:34 - INFO - __main__ - Step 95833: {'lr': 0.00014783773131591278, 'samples': 18399936, 'steps': 95832, 'loss/train': 0.9811636209487915} 11/07/2021 10:42:34 - INFO - __main__ - Step 95834: {'lr': 0.00014783288791800722, 'samples': 18400128, 'steps': 95833, 'loss/train': 1.3325814008712769} 11/07/2021 10:42:34 - INFO - __main__ - Step 95835: {'lr': 0.0001478280445661366, 'samples': 18400320, 'steps': 95834, 'loss/train': 1.0128921270370483} 11/07/2021 10:42:35 - INFO - __main__ - Step 95836: {'lr': 0.0001478232012603031, 'samples': 18400512, 'steps': 95835, 'loss/train': 1.504791498184204} 11/07/2021 10:42:36 - INFO - __main__ - Step 95837: {'lr': 0.00014781835800050888, 'samples': 18400704, 'steps': 95836, 'loss/train': 1.9228144884109497} 11/07/2021 10:42:36 - INFO - __main__ - Step 95838: {'lr': 0.00014781351478675614, 'samples': 18400896, 'steps': 95837, 'loss/train': 0.9661620855331421} 11/07/2021 10:42:36 - INFO - __main__ - Step 95839: {'lr': 0.00014780867161904717, 'samples': 18401088, 'steps': 95838, 'loss/train': 1.254683256149292} 11/07/2021 10:42:37 - INFO - __main__ - Step 95840: {'lr': 0.00014780382849738388, 'samples': 18401280, 'steps': 95839, 'loss/train': 0.7286389470100403} 11/07/2021 10:42:37 - INFO - __main__ - Step 95841: {'lr': 0.00014779898542176864, 'samples': 18401472, 'steps': 95840, 'loss/train': 1.3272887468338013} 11/07/2021 10:42:38 - INFO - __main__ - Step 95842: {'lr': 0.00014779414239220363, 'samples': 18401664, 'steps': 95841, 'loss/train': 1.457060694694519} 11/07/2021 10:42:38 - INFO - __main__ - Step 95843: {'lr': 0.00014778929940869096, 'samples': 18401856, 'steps': 95842, 'loss/train': 1.0637156963348389} 11/07/2021 10:42:39 - INFO - __main__ - Step 95844: {'lr': 0.00014778445647123284, 'samples': 18402048, 'steps': 95843, 'loss/train': 0.9747068285942078} 11/07/2021 10:42:39 - INFO - __main__ - Step 95845: {'lr': 0.00014777961357983148, 'samples': 18402240, 'steps': 95844, 'loss/train': 1.4416714906692505} 11/07/2021 10:42:40 - INFO - __main__ - Step 95846: {'lr': 0.00014777477073448907, 'samples': 18402432, 'steps': 95845, 'loss/train': 1.4231929779052734} 11/07/2021 10:42:41 - INFO - __main__ - Step 95847: {'lr': 0.00014776992793520777, 'samples': 18402624, 'steps': 95846, 'loss/train': 0.9713525772094727} 11/07/2021 10:42:41 - INFO - __main__ - Step 95848: {'lr': 0.00014776508518198978, 'samples': 18402816, 'steps': 95847, 'loss/train': 1.0910167694091797} 11/07/2021 10:42:41 - INFO - __main__ - Step 95849: {'lr': 0.00014776024247483725, 'samples': 18403008, 'steps': 95848, 'loss/train': 1.3780393600463867} 11/07/2021 10:42:42 - INFO - __main__ - Step 95850: {'lr': 0.00014775539981375235, 'samples': 18403200, 'steps': 95849, 'loss/train': 1.0883675813674927} 11/07/2021 10:42:42 - INFO - __main__ - Step 95851: {'lr': 0.0001477505571987373, 'samples': 18403392, 'steps': 95850, 'loss/train': 1.1187971830368042} 11/07/2021 10:42:43 - INFO - __main__ - Step 95852: {'lr': 0.0001477457146297943, 'samples': 18403584, 'steps': 95851, 'loss/train': 1.620156168937683} 11/07/2021 10:42:43 - INFO - __main__ - Step 95853: {'lr': 0.00014774087210692557, 'samples': 18403776, 'steps': 95852, 'loss/train': 1.3785029649734497} 11/07/2021 10:42:44 - INFO - __main__ - Step 95854: {'lr': 0.00014773602963013316, 'samples': 18403968, 'steps': 95853, 'loss/train': 1.3456157445907593} 11/07/2021 10:42:44 - INFO - __main__ - Step 95855: {'lr': 0.00014773118719941928, 'samples': 18404160, 'steps': 95854, 'loss/train': 1.2503528594970703} 11/07/2021 10:42:45 - INFO - __main__ - Step 95856: {'lr': 0.00014772634481478617, 'samples': 18404352, 'steps': 95855, 'loss/train': 1.2434630393981934} 11/07/2021 10:42:45 - INFO - __main__ - Step 95857: {'lr': 0.00014772150247623598, 'samples': 18404544, 'steps': 95856, 'loss/train': 1.5629940032958984} 11/07/2021 10:42:46 - INFO - __main__ - Step 95858: {'lr': 0.0001477166601837709, 'samples': 18404736, 'steps': 95857, 'loss/train': 1.2722184658050537} 11/07/2021 10:42:46 - INFO - __main__ - Step 95859: {'lr': 0.00014771181793739313, 'samples': 18404928, 'steps': 95858, 'loss/train': 1.0778391361236572} 11/07/2021 10:42:47 - INFO - __main__ - Step 95860: {'lr': 0.00014770697573710485, 'samples': 18405120, 'steps': 95859, 'loss/train': 0.7862834930419922} 11/07/2021 10:42:47 - INFO - __main__ - Step 95861: {'lr': 0.00014770213358290818, 'samples': 18405312, 'steps': 95860, 'loss/train': 1.4640074968338013} 11/07/2021 10:42:47 - INFO - __main__ - Step 95862: {'lr': 0.00014769729147480538, 'samples': 18405504, 'steps': 95861, 'loss/train': 0.9866035580635071} 11/07/2021 10:42:48 - INFO - __main__ - Step 95863: {'lr': 0.00014769244941279858, 'samples': 18405696, 'steps': 95862, 'loss/train': 0.7885811924934387} 11/07/2021 10:42:49 - INFO - __main__ - Step 95864: {'lr': 0.00014768760739689002, 'samples': 18405888, 'steps': 95863, 'loss/train': 1.2231336832046509} 11/07/2021 10:42:49 - INFO - __main__ - Step 95865: {'lr': 0.00014768276542708182, 'samples': 18406080, 'steps': 95864, 'loss/train': 0.06281472742557526} 11/07/2021 10:42:49 - INFO - __main__ - Step 95866: {'lr': 0.0001476779235033763, 'samples': 18406272, 'steps': 95865, 'loss/train': 1.355322003364563} 11/07/2021 10:42:50 - INFO - __main__ - Step 95867: {'lr': 0.00014767308162577541, 'samples': 18406464, 'steps': 95866, 'loss/train': 1.4989622831344604} 11/07/2021 10:42:51 - INFO - __main__ - Step 95868: {'lr': 0.00014766823979428146, 'samples': 18406656, 'steps': 95867, 'loss/train': 1.0030778646469116} 11/07/2021 10:42:51 - INFO - __main__ - Step 95869: {'lr': 0.00014766339800889665, 'samples': 18406848, 'steps': 95868, 'loss/train': 1.387681484222412} 11/07/2021 10:42:52 - INFO - __main__ - Step 95870: {'lr': 0.0001476585562696231, 'samples': 18407040, 'steps': 95869, 'loss/train': 1.3169804811477661} 11/07/2021 10:42:52 - INFO - __main__ - Step 95871: {'lr': 0.00014765371457646303, 'samples': 18407232, 'steps': 95870, 'loss/train': 1.172415018081665} 11/07/2021 10:42:52 - INFO - __main__ - Step 95872: {'lr': 0.00014764887292941864, 'samples': 18407424, 'steps': 95871, 'loss/train': 0.1278521716594696} 11/07/2021 10:42:53 - INFO - __main__ - Step 95873: {'lr': 0.00014764403132849204, 'samples': 18407616, 'steps': 95872, 'loss/train': 1.2315222024917603} 11/07/2021 10:42:54 - INFO - __main__ - Step 95874: {'lr': 0.0001476391897736855, 'samples': 18407808, 'steps': 95873, 'loss/train': 1.3814603090286255} 11/07/2021 10:42:54 - INFO - __main__ - Step 95875: {'lr': 0.00014763434826500115, 'samples': 18408000, 'steps': 95874, 'loss/train': 1.4088082313537598} 11/07/2021 10:42:54 - INFO - __main__ - Step 95876: {'lr': 0.0001476295068024412, 'samples': 18408192, 'steps': 95875, 'loss/train': 1.4255061149597168} 11/07/2021 10:42:55 - INFO - __main__ - Step 95877: {'lr': 0.00014762466538600777, 'samples': 18408384, 'steps': 95876, 'loss/train': 1.8579330444335938} 11/07/2021 10:42:56 - INFO - __main__ - Step 95878: {'lr': 0.00014761982401570312, 'samples': 18408576, 'steps': 95877, 'loss/train': 0.8281406164169312} 11/07/2021 10:42:56 - INFO - __main__ - Step 95879: {'lr': 0.0001476149826915294, 'samples': 18408768, 'steps': 95878, 'loss/train': 1.362657070159912} 11/07/2021 10:42:57 - INFO - __main__ - Step 95880: {'lr': 0.0001476101414134889, 'samples': 18408960, 'steps': 95879, 'loss/train': 1.3073310852050781} 11/07/2021 10:42:57 - INFO - __main__ - Step 95881: {'lr': 0.0001476053001815836, 'samples': 18409152, 'steps': 95880, 'loss/train': 1.2016621828079224} 11/07/2021 10:42:57 - INFO - __main__ - Step 95882: {'lr': 0.0001476004589958157, 'samples': 18409344, 'steps': 95881, 'loss/train': 1.073306679725647} 11/07/2021 10:42:59 - INFO - __main__ - Step 95883: {'lr': 0.0001475956178561875, 'samples': 18409536, 'steps': 95882, 'loss/train': 1.481302261352539} 11/07/2021 10:42:59 - INFO - __main__ - Step 95884: {'lr': 0.00014759077676270113, 'samples': 18409728, 'steps': 95883, 'loss/train': 1.0347317457199097} 11/07/2021 10:42:59 - INFO - __main__ - Step 95885: {'lr': 0.00014758593571535878, 'samples': 18409920, 'steps': 95884, 'loss/train': 1.4501681327819824} 11/07/2021 10:43:00 - INFO - __main__ - Step 95886: {'lr': 0.00014758109471416263, 'samples': 18410112, 'steps': 95885, 'loss/train': 1.151507019996643} 11/07/2021 10:43:00 - INFO - __main__ - Step 95887: {'lr': 0.00014757625375911486, 'samples': 18410304, 'steps': 95886, 'loss/train': 1.584752082824707} 11/07/2021 10:43:01 - INFO - __main__ - Step 95888: {'lr': 0.00014757141285021762, 'samples': 18410496, 'steps': 95887, 'loss/train': 0.9608004093170166} 11/07/2021 10:43:02 - INFO - __main__ - Step 95889: {'lr': 0.00014756657198747314, 'samples': 18410688, 'steps': 95888, 'loss/train': 1.3934948444366455} 11/07/2021 10:43:02 - INFO - __main__ - Step 95890: {'lr': 0.0001475617311708836, 'samples': 18410880, 'steps': 95889, 'loss/train': 0.4710240364074707} 11/07/2021 10:43:03 - INFO - __main__ - Step 95891: {'lr': 0.00014755689040045117, 'samples': 18411072, 'steps': 95890, 'loss/train': 1.626760482788086} 11/07/2021 10:43:03 - INFO - __main__ - Step 95892: {'lr': 0.00014755204967617803, 'samples': 18411264, 'steps': 95891, 'loss/train': 1.6102244853973389} 11/07/2021 10:43:03 - INFO - __main__ - Step 95893: {'lr': 0.00014754720899806637, 'samples': 18411456, 'steps': 95892, 'loss/train': 1.6116679906845093} 11/07/2021 10:43:04 - INFO - __main__ - Step 95894: {'lr': 0.0001475423683661183, 'samples': 18411648, 'steps': 95893, 'loss/train': 1.6929147243499756} 11/07/2021 10:43:05 - INFO - __main__ - Step 95895: {'lr': 0.00014753752778033608, 'samples': 18411840, 'steps': 95894, 'loss/train': 1.6184513568878174} 11/07/2021 10:43:05 - INFO - __main__ - Step 95896: {'lr': 0.00014753268724072187, 'samples': 18412032, 'steps': 95895, 'loss/train': 0.7942976355552673} 11/07/2021 10:43:05 - INFO - __main__ - Step 95897: {'lr': 0.00014752784674727784, 'samples': 18412224, 'steps': 95896, 'loss/train': 1.5460938215255737} 11/07/2021 10:43:06 - INFO - __main__ - Step 95898: {'lr': 0.00014752300630000616, 'samples': 18412416, 'steps': 95897, 'loss/train': 1.5076730251312256} 11/07/2021 10:43:06 - INFO - __main__ - Step 95899: {'lr': 0.00014751816589890908, 'samples': 18412608, 'steps': 95898, 'loss/train': 1.3738305568695068} 11/07/2021 10:43:07 - INFO - __main__ - Step 95900: {'lr': 0.0001475133255439887, 'samples': 18412800, 'steps': 95899, 'loss/train': 1.393396258354187} 11/07/2021 10:43:07 - INFO - __main__ - Step 95901: {'lr': 0.00014750848523524724, 'samples': 18412992, 'steps': 95900, 'loss/train': 1.410862922668457} 11/07/2021 10:43:08 - INFO - __main__ - Step 95902: {'lr': 0.0001475036449726869, 'samples': 18413184, 'steps': 95901, 'loss/train': 1.5620986223220825} 11/07/2021 10:43:08 - INFO - __main__ - Step 95903: {'lr': 0.00014749880475630983, 'samples': 18413376, 'steps': 95902, 'loss/train': 1.6242953538894653} 11/07/2021 10:43:08 - INFO - __main__ - Step 95904: {'lr': 0.00014749396458611818, 'samples': 18413568, 'steps': 95903, 'loss/train': 0.9633151292800903} 11/07/2021 10:43:09 - INFO - __main__ - Step 95905: {'lr': 0.00014748912446211422, 'samples': 18413760, 'steps': 95904, 'loss/train': 1.8085042238235474} 11/07/2021 10:43:10 - INFO - __main__ - Step 95906: {'lr': 0.0001474842843843001, 'samples': 18413952, 'steps': 95905, 'loss/train': 1.4243634939193726} 11/07/2021 10:43:10 - INFO - __main__ - Step 95907: {'lr': 0.0001474794443526779, 'samples': 18414144, 'steps': 95906, 'loss/train': 1.1696505546569824} 11/07/2021 10:43:11 - INFO - __main__ - Step 95908: {'lr': 0.0001474746043672499, 'samples': 18414336, 'steps': 95907, 'loss/train': 1.5443180799484253} 11/07/2021 10:43:11 - INFO - __main__ - Step 95909: {'lr': 0.0001474697644280183, 'samples': 18414528, 'steps': 95908, 'loss/train': 1.4275400638580322} 11/07/2021 10:43:11 - INFO - __main__ - Step 95910: {'lr': 0.0001474649245349852, 'samples': 18414720, 'steps': 95909, 'loss/train': 1.5386508703231812} 11/07/2021 10:43:12 - INFO - __main__ - Step 95911: {'lr': 0.0001474600846881528, 'samples': 18414912, 'steps': 95910, 'loss/train': 0.7927108407020569} 11/07/2021 10:43:13 - INFO - __main__ - Step 95912: {'lr': 0.00014745524488752343, 'samples': 18415104, 'steps': 95911, 'loss/train': 3.8215084075927734} 11/07/2021 10:43:13 - INFO - __main__ - Step 95913: {'lr': 0.00014745040513309903, 'samples': 18415296, 'steps': 95912, 'loss/train': 1.5134236812591553} 11/07/2021 10:43:13 - INFO - __main__ - Step 95914: {'lr': 0.00014744556542488192, 'samples': 18415488, 'steps': 95913, 'loss/train': 1.8850722312927246} 11/07/2021 10:43:14 - INFO - __main__ - Step 95915: {'lr': 0.00014744072576287426, 'samples': 18415680, 'steps': 95914, 'loss/train': 1.4746726751327515} 11/07/2021 10:43:14 - INFO - __main__ - Step 95916: {'lr': 0.0001474358861470782, 'samples': 18415872, 'steps': 95915, 'loss/train': 1.6526955366134644} 11/07/2021 10:43:15 - INFO - __main__ - Step 95917: {'lr': 0.00014743104657749596, 'samples': 18416064, 'steps': 95916, 'loss/train': 1.2664177417755127} 11/07/2021 10:43:16 - INFO - __main__ - Step 95918: {'lr': 0.00014742620705412974, 'samples': 18416256, 'steps': 95917, 'loss/train': 1.298395037651062} 11/07/2021 10:43:16 - INFO - __main__ - Step 95919: {'lr': 0.00014742136757698164, 'samples': 18416448, 'steps': 95918, 'loss/train': 0.9399012327194214} 11/07/2021 10:43:16 - INFO - __main__ - Step 95920: {'lr': 0.00014741652814605395, 'samples': 18416640, 'steps': 95919, 'loss/train': 1.6642521619796753} 11/07/2021 10:43:17 - INFO - __main__ - Step 95921: {'lr': 0.00014741168876134875, 'samples': 18416832, 'steps': 95920, 'loss/train': 1.3552790880203247} 11/07/2021 10:43:18 - INFO - __main__ - Step 95922: {'lr': 0.00014740684942286824, 'samples': 18417024, 'steps': 95921, 'loss/train': 1.419674038887024} 11/07/2021 10:43:18 - INFO - __main__ - Step 95923: {'lr': 0.00014740201013061473, 'samples': 18417216, 'steps': 95922, 'loss/train': 1.7301967144012451} 11/07/2021 10:43:18 - INFO - __main__ - Step 95924: {'lr': 0.00014739717088459018, 'samples': 18417408, 'steps': 95923, 'loss/train': 1.5733952522277832} 11/07/2021 10:43:19 - INFO - __main__ - Step 95925: {'lr': 0.00014739233168479688, 'samples': 18417600, 'steps': 95924, 'loss/train': 1.5035039186477661} 11/07/2021 10:43:19 - INFO - __main__ - Step 95926: {'lr': 0.00014738749253123706, 'samples': 18417792, 'steps': 95925, 'loss/train': 0.8960781693458557} 11/07/2021 10:43:20 - INFO - __main__ - Step 95927: {'lr': 0.00014738265342391282, 'samples': 18417984, 'steps': 95926, 'loss/train': 1.156923532485962} 11/07/2021 10:43:20 - INFO - __main__ - Step 95928: {'lr': 0.00014737781436282638, 'samples': 18418176, 'steps': 95927, 'loss/train': 1.6303529739379883} 11/07/2021 10:43:21 - INFO - __main__ - Step 95929: {'lr': 0.0001473729753479799, 'samples': 18418368, 'steps': 95928, 'loss/train': 2.05132794380188} 11/07/2021 10:43:21 - INFO - __main__ - Step 95930: {'lr': 0.00014736813637937558, 'samples': 18418560, 'steps': 95929, 'loss/train': 1.6854907274246216} 11/07/2021 10:43:21 - INFO - __main__ - Step 95931: {'lr': 0.0001473632974570156, 'samples': 18418752, 'steps': 95930, 'loss/train': 1.1309560537338257} 11/07/2021 10:43:22 - INFO - __main__ - Step 95932: {'lr': 0.00014735845858090214, 'samples': 18418944, 'steps': 95931, 'loss/train': 1.1393427848815918} 11/07/2021 10:43:23 - INFO - __main__ - Step 95933: {'lr': 0.00014735361975103743, 'samples': 18419136, 'steps': 95932, 'loss/train': 1.2207417488098145} 11/07/2021 10:43:23 - INFO - __main__ - Step 95934: {'lr': 0.00014734878096742357, 'samples': 18419328, 'steps': 95933, 'loss/train': 0.7810671925544739} 11/07/2021 10:43:24 - INFO - __main__ - Step 95935: {'lr': 0.00014734394223006272, 'samples': 18419520, 'steps': 95934, 'loss/train': 1.3639369010925293} 11/07/2021 10:43:24 - INFO - __main__ - Step 95936: {'lr': 0.00014733910353895713, 'samples': 18419712, 'steps': 95935, 'loss/train': 0.9242664575576782} 11/07/2021 10:43:24 - INFO - __main__ - Step 95937: {'lr': 0.00014733426489410895, 'samples': 18419904, 'steps': 95936, 'loss/train': 2.0713539123535156} 11/07/2021 10:43:25 - INFO - __main__ - Step 95938: {'lr': 0.00014732942629552034, 'samples': 18420096, 'steps': 95937, 'loss/train': 1.1358331441879272} 11/07/2021 10:43:26 - INFO - __main__ - Step 95939: {'lr': 0.00014732458774319352, 'samples': 18420288, 'steps': 95938, 'loss/train': 1.5476233959197998} 11/07/2021 10:43:26 - INFO - __main__ - Step 95940: {'lr': 0.00014731974923713065, 'samples': 18420480, 'steps': 95939, 'loss/train': 1.4713212251663208} 11/07/2021 10:43:26 - INFO - __main__ - Step 95941: {'lr': 0.00014731491077733396, 'samples': 18420672, 'steps': 95940, 'loss/train': 1.3995044231414795} 11/07/2021 10:43:27 - INFO - __main__ - Step 95942: {'lr': 0.00014731007236380554, 'samples': 18420864, 'steps': 95941, 'loss/train': 1.5346429347991943} 11/07/2021 10:43:28 - INFO - __main__ - Step 95943: {'lr': 0.00014730523399654762, 'samples': 18421056, 'steps': 95942, 'loss/train': 0.6238558292388916} 11/07/2021 10:43:28 - INFO - __main__ - Step 95944: {'lr': 0.00014730039567556239, 'samples': 18421248, 'steps': 95943, 'loss/train': 1.4600675106048584} 11/07/2021 10:43:28 - INFO - __main__ - Step 95945: {'lr': 0.000147295557400852, 'samples': 18421440, 'steps': 95944, 'loss/train': 1.4656639099121094} 11/07/2021 10:43:29 - INFO - __main__ - Step 95946: {'lr': 0.00014729071917241865, 'samples': 18421632, 'steps': 95945, 'loss/train': 0.3697997033596039} 11/07/2021 10:43:29 - INFO - __main__ - Step 95947: {'lr': 0.00014728588099026464, 'samples': 18421824, 'steps': 95946, 'loss/train': 1.099017858505249} 11/07/2021 10:43:30 - INFO - __main__ - Step 95948: {'lr': 0.0001472810428543919, 'samples': 18422016, 'steps': 95947, 'loss/train': 0.881122350692749} 11/07/2021 10:43:30 - INFO - __main__ - Step 95949: {'lr': 0.00014727620476480275, 'samples': 18422208, 'steps': 95948, 'loss/train': 1.302836537361145} 11/07/2021 10:43:31 - INFO - __main__ - Step 95950: {'lr': 0.00014727136672149937, 'samples': 18422400, 'steps': 95949, 'loss/train': 1.6321163177490234} 11/07/2021 10:43:31 - INFO - __main__ - Step 95951: {'lr': 0.00014726652872448394, 'samples': 18422592, 'steps': 95950, 'loss/train': 1.1988699436187744} 11/07/2021 10:43:31 - INFO - __main__ - Step 95952: {'lr': 0.00014726169077375857, 'samples': 18422784, 'steps': 95951, 'loss/train': 1.467780590057373} 11/07/2021 10:43:33 - INFO - __main__ - Step 95953: {'lr': 0.00014725685286932555, 'samples': 18422976, 'steps': 95952, 'loss/train': 1.4872552156448364} 11/07/2021 10:43:33 - INFO - __main__ - Step 95954: {'lr': 0.00014725201501118696, 'samples': 18423168, 'steps': 95953, 'loss/train': 1.1305299997329712} 11/07/2021 10:43:33 - INFO - __main__ - Step 95955: {'lr': 0.00014724717719934505, 'samples': 18423360, 'steps': 95954, 'loss/train': 1.4427387714385986} 11/07/2021 10:43:34 - INFO - __main__ - Step 95956: {'lr': 0.00014724233943380199, 'samples': 18423552, 'steps': 95955, 'loss/train': 0.9189614653587341} 11/07/2021 10:43:34 - INFO - __main__ - Step 95957: {'lr': 0.00014723750171455994, 'samples': 18423744, 'steps': 95956, 'loss/train': 1.5461560487747192} 11/07/2021 10:43:34 - INFO - __main__ - Step 95958: {'lr': 0.00014723266404162105, 'samples': 18423936, 'steps': 95957, 'loss/train': 1.0707542896270752} 11/07/2021 10:43:35 - INFO - __main__ - Step 95959: {'lr': 0.00014722782641498757, 'samples': 18424128, 'steps': 95958, 'loss/train': 1.6669836044311523} 11/07/2021 10:43:36 - INFO - __main__ - Step 95960: {'lr': 0.00014722298883466177, 'samples': 18424320, 'steps': 95959, 'loss/train': 0.6937898993492126} 11/07/2021 10:43:36 - INFO - __main__ - Step 95961: {'lr': 0.00014721815130064555, 'samples': 18424512, 'steps': 95960, 'loss/train': 1.1384061574935913} 11/07/2021 10:43:36 - INFO - __main__ - Step 95962: {'lr': 0.00014721331381294128, 'samples': 18424704, 'steps': 95961, 'loss/train': 1.279589056968689} 11/07/2021 10:43:37 - INFO - __main__ - Step 95963: {'lr': 0.0001472084763715511, 'samples': 18424896, 'steps': 95962, 'loss/train': 1.6626936197280884} 11/07/2021 10:43:38 - INFO - __main__ - Step 95964: {'lr': 0.00014720363897647722, 'samples': 18425088, 'steps': 95963, 'loss/train': 1.4321118593215942} 11/07/2021 10:43:38 - INFO - __main__ - Step 95965: {'lr': 0.00014719880162772175, 'samples': 18425280, 'steps': 95964, 'loss/train': 1.318002700805664} 11/07/2021 10:43:38 - INFO - __main__ - Step 95966: {'lr': 0.0001471939643252869, 'samples': 18425472, 'steps': 95965, 'loss/train': 0.8918560743331909} 11/07/2021 10:43:39 - INFO - __main__ - Step 95967: {'lr': 0.00014718912706917491, 'samples': 18425664, 'steps': 95966, 'loss/train': 1.4224027395248413} 11/07/2021 10:43:39 - INFO - __main__ - Step 95968: {'lr': 0.0001471842898593879, 'samples': 18425856, 'steps': 95967, 'loss/train': 1.7388060092926025} 11/07/2021 10:43:40 - INFO - __main__ - Step 95969: {'lr': 0.00014717945269592802, 'samples': 18426048, 'steps': 95968, 'loss/train': 1.298101544380188} 11/07/2021 10:43:41 - INFO - __main__ - Step 95970: {'lr': 0.00014717461557879757, 'samples': 18426240, 'steps': 95969, 'loss/train': 1.519294023513794} 11/07/2021 10:43:41 - INFO - __main__ - Step 95971: {'lr': 0.0001471697785079986, 'samples': 18426432, 'steps': 95970, 'loss/train': 0.943438708782196} 11/07/2021 10:43:41 - INFO - __main__ - Step 95972: {'lr': 0.00014716494148353336, 'samples': 18426624, 'steps': 95971, 'loss/train': 1.3626558780670166} 11/07/2021 10:43:42 - INFO - __main__ - Step 95973: {'lr': 0.000147160104505404, 'samples': 18426816, 'steps': 95972, 'loss/train': 1.4006975889205933} 11/07/2021 10:43:43 - INFO - __main__ - Step 95974: {'lr': 0.0001471552675736128, 'samples': 18427008, 'steps': 95973, 'loss/train': 0.8534848093986511} 11/07/2021 10:43:43 - INFO - __main__ - Step 95975: {'lr': 0.00014715043068816176, 'samples': 18427200, 'steps': 95974, 'loss/train': 1.4127840995788574} 11/07/2021 10:43:43 - INFO - __main__ - Step 95976: {'lr': 0.00014714559384905316, 'samples': 18427392, 'steps': 95975, 'loss/train': 1.3619717359542847} 11/07/2021 10:43:44 - INFO - __main__ - Step 95977: {'lr': 0.00014714075705628916, 'samples': 18427584, 'steps': 95976, 'loss/train': 1.627258062362671} 11/07/2021 10:43:44 - INFO - __main__ - Step 95978: {'lr': 0.00014713592030987194, 'samples': 18427776, 'steps': 95977, 'loss/train': 1.2417312860488892} 11/07/2021 10:43:45 - INFO - __main__ - Step 95979: {'lr': 0.0001471310836098037, 'samples': 18427968, 'steps': 95978, 'loss/train': 1.664720058441162} 11/07/2021 10:43:46 - INFO - __main__ - Step 95980: {'lr': 0.0001471262469560866, 'samples': 18428160, 'steps': 95979, 'loss/train': 1.5697808265686035} 11/07/2021 10:43:46 - INFO - __main__ - Step 95981: {'lr': 0.0001471214103487228, 'samples': 18428352, 'steps': 95980, 'loss/train': 1.1386511325836182} 11/07/2021 10:43:46 - INFO - __main__ - Step 95982: {'lr': 0.00014711657378771453, 'samples': 18428544, 'steps': 95981, 'loss/train': 1.4121203422546387} 11/07/2021 10:43:47 - INFO - __main__ - Step 95983: {'lr': 0.00014711173727306395, 'samples': 18428736, 'steps': 95982, 'loss/train': 1.3149206638336182} 11/07/2021 10:43:47 - INFO - __main__ - Step 95984: {'lr': 0.00014710690080477323, 'samples': 18428928, 'steps': 95983, 'loss/train': 3.663846015930176} 11/07/2021 10:43:48 - INFO - __main__ - Step 95985: {'lr': 0.00014710206438284457, 'samples': 18429120, 'steps': 95984, 'loss/train': 1.621858835220337} 11/07/2021 10:43:48 - INFO - __main__ - Step 95986: {'lr': 0.00014709722800728008, 'samples': 18429312, 'steps': 95985, 'loss/train': 1.268349528312683} 11/07/2021 10:43:49 - INFO - __main__ - Step 95987: {'lr': 0.00014709239167808215, 'samples': 18429504, 'steps': 95986, 'loss/train': 1.3842644691467285} 11/07/2021 10:43:49 - INFO - __main__ - Step 95988: {'lr': 0.00014708755539525267, 'samples': 18429696, 'steps': 95987, 'loss/train': 1.3944432735443115} 11/07/2021 10:43:49 - INFO - __main__ - Step 95989: {'lr': 0.00014708271915879394, 'samples': 18429888, 'steps': 95988, 'loss/train': 1.8598932027816772} 11/07/2021 10:43:50 - INFO - __main__ - Step 95990: {'lr': 0.00014707788296870817, 'samples': 18430080, 'steps': 95989, 'loss/train': 1.329305648803711} 11/07/2021 10:43:51 - INFO - __main__ - Step 95991: {'lr': 0.0001470730468249975, 'samples': 18430272, 'steps': 95990, 'loss/train': 1.3672707080841064} 11/07/2021 10:43:51 - INFO - __main__ - Step 95992: {'lr': 0.00014706821072766417, 'samples': 18430464, 'steps': 95991, 'loss/train': 0.5841900110244751} 11/07/2021 10:43:51 - INFO - __main__ - Step 95993: {'lr': 0.00014706337467671027, 'samples': 18430656, 'steps': 95992, 'loss/train': 1.1638258695602417} 11/07/2021 10:43:52 - INFO - __main__ - Step 95994: {'lr': 0.00014705853867213802, 'samples': 18430848, 'steps': 95993, 'loss/train': 1.2642074823379517} 11/07/2021 10:43:53 - INFO - __main__ - Step 95995: {'lr': 0.00014705370271394963, 'samples': 18431040, 'steps': 95994, 'loss/train': 1.43070650100708} 11/07/2021 10:43:53 - INFO - __main__ - Step 95996: {'lr': 0.00014704886680214725, 'samples': 18431232, 'steps': 95995, 'loss/train': 1.4283671379089355} 11/07/2021 10:43:53 - INFO - __main__ - Step 95997: {'lr': 0.00014704403093673308, 'samples': 18431424, 'steps': 95996, 'loss/train': 0.7159470319747925} 11/07/2021 10:43:54 - INFO - __main__ - Step 95998: {'lr': 0.00014703919511770925, 'samples': 18431616, 'steps': 95997, 'loss/train': 1.5411605834960938} 11/07/2021 10:43:54 - INFO - __main__ - Step 95999: {'lr': 0.00014703435934507796, 'samples': 18431808, 'steps': 95998, 'loss/train': 1.467491626739502} 11/07/2021 10:43:55 - INFO - __main__ - Step 96000: {'lr': 0.00014702952361884142, 'samples': 18432000, 'steps': 95999, 'loss/train': 1.7424241304397583} 11/07/2021 10:43:56 - INFO - __main__ - Step 96001: {'lr': 0.00014702468793900187, 'samples': 18432192, 'steps': 96000, 'loss/train': 1.4812955856323242} 11/07/2021 10:43:56 - INFO - __main__ - Step 96002: {'lr': 0.00014701985230556133, 'samples': 18432384, 'steps': 96001, 'loss/train': 1.8811588287353516} 11/07/2021 10:43:56 - INFO - __main__ - Step 96003: {'lr': 0.00014701501671852206, 'samples': 18432576, 'steps': 96002, 'loss/train': 1.3295284509658813} 11/07/2021 10:43:57 - INFO - __main__ - Step 96004: {'lr': 0.00014701018117788621, 'samples': 18432768, 'steps': 96003, 'loss/train': 1.480239748954773} 11/07/2021 10:43:57 - INFO - __main__ - Step 96005: {'lr': 0.00014700534568365598, 'samples': 18432960, 'steps': 96004, 'loss/train': 1.0076711177825928} 11/07/2021 10:43:58 - INFO - __main__ - Step 96006: {'lr': 0.0001470005102358336, 'samples': 18433152, 'steps': 96005, 'loss/train': 0.43740731477737427} 11/07/2021 10:43:59 - INFO - __main__ - Step 96007: {'lr': 0.00014699567483442117, 'samples': 18433344, 'steps': 96006, 'loss/train': 1.8401257991790771} 11/07/2021 10:43:59 - INFO - __main__ - Step 96008: {'lr': 0.00014699083947942087, 'samples': 18433536, 'steps': 96007, 'loss/train': 0.13553543388843536} 11/07/2021 10:43:59 - INFO - __main__ - Step 96009: {'lr': 0.00014698600417083495, 'samples': 18433728, 'steps': 96008, 'loss/train': 0.9359025359153748} 11/07/2021 10:44:00 - INFO - __main__ - Step 96010: {'lr': 0.00014698116890866553, 'samples': 18433920, 'steps': 96009, 'loss/train': 2.130449056625366} 11/07/2021 10:44:01 - INFO - __main__ - Step 96011: {'lr': 0.0001469763336929148, 'samples': 18434112, 'steps': 96010, 'loss/train': 0.2956288456916809} 11/07/2021 10:44:01 - INFO - __main__ - Step 96012: {'lr': 0.00014697149852358493, 'samples': 18434304, 'steps': 96011, 'loss/train': 1.0800424814224243} 11/07/2021 10:44:02 - INFO - __main__ - Step 96013: {'lr': 0.00014696666340067817, 'samples': 18434496, 'steps': 96012, 'loss/train': 1.0415072441101074} 11/07/2021 10:44:02 - INFO - __main__ - Step 96014: {'lr': 0.0001469618283241967, 'samples': 18434688, 'steps': 96013, 'loss/train': 0.9979627132415771} 11/07/2021 10:44:02 - INFO - __main__ - Step 96015: {'lr': 0.00014695699329414253, 'samples': 18434880, 'steps': 96014, 'loss/train': 1.0168800354003906} 11/07/2021 10:44:03 - INFO - __main__ - Step 96016: {'lr': 0.00014695215831051796, 'samples': 18435072, 'steps': 96015, 'loss/train': 1.4355316162109375} 11/07/2021 10:44:04 - INFO - __main__ - Step 96017: {'lr': 0.00014694732337332516, 'samples': 18435264, 'steps': 96016, 'loss/train': 1.3868852853775024} 11/07/2021 10:44:04 - INFO - __main__ - Step 96018: {'lr': 0.0001469424884825663, 'samples': 18435456, 'steps': 96017, 'loss/train': 1.739548683166504} 11/07/2021 10:44:04 - INFO - __main__ - Step 96019: {'lr': 0.00014693765363824358, 'samples': 18435648, 'steps': 96018, 'loss/train': 1.4006768465042114} 11/07/2021 10:44:05 - INFO - __main__ - Step 96020: {'lr': 0.00014693281884035916, 'samples': 18435840, 'steps': 96019, 'loss/train': 1.0565481185913086} 11/07/2021 10:44:06 - INFO - __main__ - Step 96021: {'lr': 0.0001469279840889152, 'samples': 18436032, 'steps': 96020, 'loss/train': 1.3169481754302979} 11/07/2021 10:44:06 - INFO - __main__ - Step 96022: {'lr': 0.00014692314938391393, 'samples': 18436224, 'steps': 96021, 'loss/train': 1.5219323635101318} 11/07/2021 10:44:06 - INFO - __main__ - Step 96023: {'lr': 0.0001469183147253575, 'samples': 18436416, 'steps': 96022, 'loss/train': 1.4127572774887085} 11/07/2021 10:44:07 - INFO - __main__ - Step 96024: {'lr': 0.00014691348011324808, 'samples': 18436608, 'steps': 96023, 'loss/train': 1.0712916851043701} 11/07/2021 10:44:07 - INFO - __main__ - Step 96025: {'lr': 0.00014690864554758786, 'samples': 18436800, 'steps': 96024, 'loss/train': 1.637075424194336} 11/07/2021 10:44:07 - INFO - __main__ - Step 96026: {'lr': 0.00014690381102837902, 'samples': 18436992, 'steps': 96025, 'loss/train': 1.4338204860687256} 11/07/2021 10:44:09 - INFO - __main__ - Step 96027: {'lr': 0.00014689897655562376, 'samples': 18437184, 'steps': 96026, 'loss/train': 1.6930519342422485} 11/07/2021 10:44:09 - INFO - __main__ - Step 96028: {'lr': 0.00014689414212932416, 'samples': 18437376, 'steps': 96027, 'loss/train': 1.4992539882659912} 11/07/2021 10:44:09 - INFO - __main__ - Step 96029: {'lr': 0.0001468893077494825, 'samples': 18437568, 'steps': 96028, 'loss/train': 1.4198342561721802} 11/07/2021 10:44:10 - INFO - __main__ - Step 96030: {'lr': 0.00014688447341610096, 'samples': 18437760, 'steps': 96029, 'loss/train': 1.7828831672668457} 11/07/2021 10:44:10 - INFO - __main__ - Step 96031: {'lr': 0.00014687963912918161, 'samples': 18437952, 'steps': 96030, 'loss/train': 1.117138147354126} 11/07/2021 10:44:11 - INFO - __main__ - Step 96032: {'lr': 0.00014687480488872673, 'samples': 18438144, 'steps': 96031, 'loss/train': 1.6291364431381226} 11/07/2021 10:44:11 - INFO - __main__ - Step 96033: {'lr': 0.00014686997069473848, 'samples': 18438336, 'steps': 96032, 'loss/train': 1.5001444816589355} 11/07/2021 10:44:12 - INFO - __main__ - Step 96034: {'lr': 0.00014686513654721902, 'samples': 18438528, 'steps': 96033, 'loss/train': 1.9159375429153442} 11/07/2021 10:44:12 - INFO - __main__ - Step 96035: {'lr': 0.00014686030244617055, 'samples': 18438720, 'steps': 96034, 'loss/train': 1.2087057828903198} 11/07/2021 10:44:12 - INFO - __main__ - Step 96036: {'lr': 0.0001468554683915953, 'samples': 18438912, 'steps': 96035, 'loss/train': 1.702597737312317} 11/07/2021 10:44:13 - INFO - __main__ - Step 96037: {'lr': 0.0001468506343834953, 'samples': 18439104, 'steps': 96036, 'loss/train': 1.5722757577896118} 11/07/2021 10:44:14 - INFO - __main__ - Step 96038: {'lr': 0.00014684580042187285, 'samples': 18439296, 'steps': 96037, 'loss/train': 1.1262826919555664} 11/07/2021 10:44:14 - INFO - __main__ - Step 96039: {'lr': 0.00014684096650673006, 'samples': 18439488, 'steps': 96038, 'loss/train': 1.2703351974487305} 11/07/2021 10:44:14 - INFO - __main__ - Step 96040: {'lr': 0.00014683613263806914, 'samples': 18439680, 'steps': 96039, 'loss/train': 0.69590824842453} 11/07/2021 10:44:15 - INFO - __main__ - Step 96041: {'lr': 0.00014683129881589232, 'samples': 18439872, 'steps': 96040, 'loss/train': 1.2319341897964478} 11/07/2021 10:44:15 - INFO - __main__ - Step 96042: {'lr': 0.0001468264650402017, 'samples': 18440064, 'steps': 96041, 'loss/train': 1.6718721389770508} 11/07/2021 10:44:16 - INFO - __main__ - Step 96043: {'lr': 0.00014682163131099946, 'samples': 18440256, 'steps': 96042, 'loss/train': 1.1170843839645386} 11/07/2021 10:44:17 - INFO - __main__ - Step 96044: {'lr': 0.00014681679762828777, 'samples': 18440448, 'steps': 96043, 'loss/train': 1.395769715309143} 11/07/2021 10:44:17 - INFO - __main__ - Step 96045: {'lr': 0.0001468119639920689, 'samples': 18440640, 'steps': 96044, 'loss/train': 1.5998897552490234} 11/07/2021 10:44:17 - INFO - __main__ - Step 96046: {'lr': 0.00014680713040234495, 'samples': 18440832, 'steps': 96045, 'loss/train': 1.371938943862915} 11/07/2021 10:44:18 - INFO - __main__ - Step 96047: {'lr': 0.00014680229685911812, 'samples': 18441024, 'steps': 96046, 'loss/train': 1.461309790611267} 11/07/2021 10:44:19 - INFO - __main__ - Step 96048: {'lr': 0.00014679746336239058, 'samples': 18441216, 'steps': 96047, 'loss/train': 1.3720871210098267} 11/07/2021 10:44:19 - INFO - __main__ - Step 96049: {'lr': 0.0001467926299121645, 'samples': 18441408, 'steps': 96048, 'loss/train': 0.9401131272315979} 11/07/2021 10:44:19 - INFO - __main__ - Step 96050: {'lr': 0.00014678779650844205, 'samples': 18441600, 'steps': 96049, 'loss/train': 0.8088008761405945} 11/07/2021 10:44:20 - INFO - __main__ - Step 96051: {'lr': 0.00014678296315122545, 'samples': 18441792, 'steps': 96050, 'loss/train': 1.1939623355865479} 11/07/2021 10:44:20 - INFO - __main__ - Step 96052: {'lr': 0.00014677812984051683, 'samples': 18441984, 'steps': 96051, 'loss/train': 1.1929386854171753} 11/07/2021 10:44:20 - INFO - __main__ - Step 96053: {'lr': 0.0001467732965763184, 'samples': 18442176, 'steps': 96052, 'loss/train': 1.1744439601898193} 11/07/2021 10:44:22 - INFO - __main__ - Step 96054: {'lr': 0.00014676846335863242, 'samples': 18442368, 'steps': 96053, 'loss/train': 0.6500841379165649} 11/07/2021 10:44:22 - INFO - __main__ - Step 96055: {'lr': 0.00014676363018746087, 'samples': 18442560, 'steps': 96054, 'loss/train': 1.290664792060852} 11/07/2021 10:44:22 - INFO - __main__ - Step 96056: {'lr': 0.00014675879706280606, 'samples': 18442752, 'steps': 96055, 'loss/train': 2.579953193664551} 11/07/2021 10:44:23 - INFO - __main__ - Step 96057: {'lr': 0.00014675396398467015, 'samples': 18442944, 'steps': 96056, 'loss/train': 1.292563796043396} 11/07/2021 10:44:23 - INFO - __main__ - Step 96058: {'lr': 0.00014674913095305537, 'samples': 18443136, 'steps': 96057, 'loss/train': 1.2305554151535034} 11/07/2021 10:44:24 - INFO - __main__ - Step 96059: {'lr': 0.00014674429796796373, 'samples': 18443328, 'steps': 96058, 'loss/train': 1.2651270627975464} 11/07/2021 10:44:24 - INFO - __main__ - Step 96060: {'lr': 0.00014673946502939756, 'samples': 18443520, 'steps': 96059, 'loss/train': 1.0760493278503418} 11/07/2021 10:44:25 - INFO - __main__ - Step 96061: {'lr': 0.00014673463213735899, 'samples': 18443712, 'steps': 96060, 'loss/train': 1.613088846206665} 11/07/2021 10:44:25 - INFO - __main__ - Step 96062: {'lr': 0.00014672979929185022, 'samples': 18443904, 'steps': 96061, 'loss/train': 1.6453698873519897} 11/07/2021 10:44:25 - INFO - __main__ - Step 96063: {'lr': 0.00014672496649287338, 'samples': 18444096, 'steps': 96062, 'loss/train': 1.5029277801513672} 11/07/2021 10:44:26 - INFO - __main__ - Step 96064: {'lr': 0.00014672013374043068, 'samples': 18444288, 'steps': 96063, 'loss/train': 1.5296040773391724} 11/07/2021 10:44:27 - INFO - __main__ - Step 96065: {'lr': 0.0001467153010345243, 'samples': 18444480, 'steps': 96064, 'loss/train': 1.3234649896621704} 11/07/2021 10:44:27 - INFO - __main__ - Step 96066: {'lr': 0.00014671046837515646, 'samples': 18444672, 'steps': 96065, 'loss/train': 0.8714377284049988} 11/07/2021 10:44:27 - INFO - __main__ - Step 96067: {'lr': 0.00014670563576232921, 'samples': 18444864, 'steps': 96066, 'loss/train': 1.1386008262634277} 11/07/2021 10:44:28 - INFO - __main__ - Step 96068: {'lr': 0.0001467008031960449, 'samples': 18445056, 'steps': 96067, 'loss/train': 1.6341038942337036} 11/07/2021 10:44:29 - INFO - __main__ - Step 96069: {'lr': 0.00014669597067630557, 'samples': 18445248, 'steps': 96068, 'loss/train': 1.8325108289718628} 11/07/2021 10:44:29 - INFO - __main__ - Step 96070: {'lr': 0.00014669113820311343, 'samples': 18445440, 'steps': 96069, 'loss/train': 1.4754091501235962} 11/07/2021 10:44:30 - INFO - __main__ - Step 96071: {'lr': 0.0001466863057764707, 'samples': 18445632, 'steps': 96070, 'loss/train': 1.1898099184036255} 11/07/2021 10:44:30 - INFO - __main__ - Step 96072: {'lr': 0.00014668147339637946, 'samples': 18445824, 'steps': 96071, 'loss/train': 1.359263300895691} 11/07/2021 10:44:30 - INFO - __main__ - Step 96073: {'lr': 0.00014667664106284201, 'samples': 18446016, 'steps': 96072, 'loss/train': 1.5473493337631226} 11/07/2021 10:44:31 - INFO - __main__ - Step 96074: {'lr': 0.00014667180877586043, 'samples': 18446208, 'steps': 96073, 'loss/train': 0.5613579154014587} 11/07/2021 10:44:32 - INFO - __main__ - Step 96075: {'lr': 0.00014666697653543693, 'samples': 18446400, 'steps': 96074, 'loss/train': 1.280398964881897} 11/07/2021 10:44:32 - INFO - __main__ - Step 96076: {'lr': 0.00014666214434157373, 'samples': 18446592, 'steps': 96075, 'loss/train': 0.35029274225234985} 11/07/2021 10:44:32 - INFO - __main__ - Step 96077: {'lr': 0.00014665731219427297, 'samples': 18446784, 'steps': 96076, 'loss/train': 0.7246399521827698} 11/07/2021 10:44:33 - INFO - __main__ - Step 96078: {'lr': 0.00014665248009353683, 'samples': 18446976, 'steps': 96077, 'loss/train': 1.5462955236434937} 11/07/2021 10:44:33 - INFO - __main__ - Step 96079: {'lr': 0.00014664764803936747, 'samples': 18447168, 'steps': 96078, 'loss/train': 1.2547533512115479} 11/07/2021 10:44:34 - INFO - __main__ - Step 96080: {'lr': 0.0001466428160317671, 'samples': 18447360, 'steps': 96079, 'loss/train': 1.3244996070861816} 11/07/2021 10:44:35 - INFO - __main__ - Step 96081: {'lr': 0.00014663798407073798, 'samples': 18447552, 'steps': 96080, 'loss/train': 1.2214717864990234} 11/07/2021 10:44:35 - INFO - __main__ - Step 96082: {'lr': 0.00014663315215628208, 'samples': 18447744, 'steps': 96081, 'loss/train': 1.4481284618377686} 11/07/2021 10:44:35 - INFO - __main__ - Step 96083: {'lr': 0.00014662832028840167, 'samples': 18447936, 'steps': 96082, 'loss/train': 1.0855945348739624} 11/07/2021 10:44:36 - INFO - __main__ - Step 96084: {'lr': 0.00014662348846709899, 'samples': 18448128, 'steps': 96083, 'loss/train': 1.2995190620422363} 11/07/2021 10:44:37 - INFO - __main__ - Step 96085: {'lr': 0.00014661865669237615, 'samples': 18448320, 'steps': 96084, 'loss/train': 1.584801197052002} 11/07/2021 10:44:37 - INFO - __main__ - Step 96086: {'lr': 0.00014661382496423533, 'samples': 18448512, 'steps': 96085, 'loss/train': 1.2558339834213257} 11/07/2021 10:44:37 - INFO - __main__ - Step 96087: {'lr': 0.00014660899328267874, 'samples': 18448704, 'steps': 96086, 'loss/train': 1.0705841779708862} 11/07/2021 10:44:38 - INFO - __main__ - Step 96088: {'lr': 0.00014660416164770856, 'samples': 18448896, 'steps': 96087, 'loss/train': 1.0132062435150146} 11/07/2021 10:44:38 - INFO - __main__ - Step 96089: {'lr': 0.0001465993300593269, 'samples': 18449088, 'steps': 96088, 'loss/train': 0.9611067771911621} 11/07/2021 10:44:39 - INFO - __main__ - Step 96090: {'lr': 0.00014659449851753603, 'samples': 18449280, 'steps': 96089, 'loss/train': 1.3632612228393555} 11/07/2021 10:44:39 - INFO - __main__ - Step 96091: {'lr': 0.00014658966702233808, 'samples': 18449472, 'steps': 96090, 'loss/train': 1.4681605100631714} 11/07/2021 10:44:40 - INFO - __main__ - Step 96092: {'lr': 0.00014658483557373523, 'samples': 18449664, 'steps': 96091, 'loss/train': 1.3337568044662476} 11/07/2021 10:44:40 - INFO - __main__ - Step 96093: {'lr': 0.00014658000417172964, 'samples': 18449856, 'steps': 96092, 'loss/train': 1.1138471364974976} 11/07/2021 10:44:40 - INFO - __main__ - Step 96094: {'lr': 0.0001465751728163235, 'samples': 18450048, 'steps': 96093, 'loss/train': 1.360614538192749} 11/07/2021 10:44:41 - INFO - __main__ - Step 96095: {'lr': 0.00014657034150751912, 'samples': 18450240, 'steps': 96094, 'loss/train': 4.295828819274902} 11/07/2021 10:44:42 - INFO - __main__ - Step 96096: {'lr': 0.00014656551024531844, 'samples': 18450432, 'steps': 96095, 'loss/train': 1.5813242197036743} 11/07/2021 10:44:42 - INFO - __main__ - Step 96097: {'lr': 0.00014656067902972376, 'samples': 18450624, 'steps': 96096, 'loss/train': 1.1703635454177856} 11/07/2021 10:44:43 - INFO - __main__ - Step 96098: {'lr': 0.0001465558478607372, 'samples': 18450816, 'steps': 96097, 'loss/train': 1.2374827861785889} 11/07/2021 10:44:43 - INFO - __main__ - Step 96099: {'lr': 0.000146551016738361, 'samples': 18451008, 'steps': 96098, 'loss/train': 1.3882862329483032} 11/07/2021 10:44:44 - INFO - __main__ - Step 96100: {'lr': 0.0001465461856625973, 'samples': 18451200, 'steps': 96099, 'loss/train': 0.8549119234085083} 11/07/2021 10:44:44 - INFO - __main__ - Step 96101: {'lr': 0.00014654135463344832, 'samples': 18451392, 'steps': 96100, 'loss/train': 0.29479101300239563} 11/07/2021 10:44:45 - INFO - __main__ - Step 96102: {'lr': 0.00014653652365091618, 'samples': 18451584, 'steps': 96101, 'loss/train': 1.4187546968460083} 11/07/2021 10:44:45 - INFO - __main__ - Step 96103: {'lr': 0.0001465316927150031, 'samples': 18451776, 'steps': 96102, 'loss/train': 1.551200032234192} 11/07/2021 10:44:45 - INFO - __main__ - Step 96104: {'lr': 0.00014652686182571126, 'samples': 18451968, 'steps': 96103, 'loss/train': 1.338752269744873} 11/07/2021 10:44:46 - INFO - __main__ - Step 96105: {'lr': 0.0001465220309830428, 'samples': 18452160, 'steps': 96104, 'loss/train': 1.3857769966125488} 11/07/2021 10:44:47 - INFO - __main__ - Step 96106: {'lr': 0.00014651720018699993, 'samples': 18452352, 'steps': 96105, 'loss/train': 1.4969936609268188} 11/07/2021 10:44:47 - INFO - __main__ - Step 96107: {'lr': 0.00014651236943758478, 'samples': 18452544, 'steps': 96106, 'loss/train': 1.5780600309371948} 11/07/2021 10:44:47 - INFO - __main__ - Step 96108: {'lr': 0.00014650753873479968, 'samples': 18452736, 'steps': 96107, 'loss/train': 1.8395835161209106} 11/07/2021 10:44:48 - INFO - __main__ - Step 96109: {'lr': 0.0001465027080786466, 'samples': 18452928, 'steps': 96108, 'loss/train': 1.6895228624343872} 11/07/2021 10:44:48 - INFO - __main__ - Step 96110: {'lr': 0.00014649787746912778, 'samples': 18453120, 'steps': 96109, 'loss/train': 1.5766735076904297} 11/07/2021 10:44:49 - INFO - __main__ - Step 96111: {'lr': 0.00014649304690624544, 'samples': 18453312, 'steps': 96110, 'loss/train': 2.082451105117798} 11/07/2021 10:44:49 - INFO - __main__ - Step 96112: {'lr': 0.00014648821639000174, 'samples': 18453504, 'steps': 96111, 'loss/train': 3.5027668476104736} 11/07/2021 10:44:50 - INFO - __main__ - Step 96113: {'lr': 0.00014648338592039884, 'samples': 18453696, 'steps': 96112, 'loss/train': 1.0986295938491821} 11/07/2021 10:44:50 - INFO - __main__ - Step 96114: {'lr': 0.00014647855549743892, 'samples': 18453888, 'steps': 96113, 'loss/train': 1.166048526763916} 11/07/2021 10:44:51 - INFO - __main__ - Step 96115: {'lr': 0.00014647372512112416, 'samples': 18454080, 'steps': 96114, 'loss/train': 1.6154382228851318} 11/07/2021 10:44:52 - INFO - __main__ - Step 96116: {'lr': 0.00014646889479145674, 'samples': 18454272, 'steps': 96115, 'loss/train': 1.3849740028381348} 11/07/2021 10:44:52 - INFO - __main__ - Step 96117: {'lr': 0.00014646406450843886, 'samples': 18454464, 'steps': 96116, 'loss/train': 0.6957922577857971} 11/07/2021 10:44:52 - INFO - __main__ - Step 96118: {'lr': 0.0001464592342720727, 'samples': 18454656, 'steps': 96117, 'loss/train': 0.9756510257720947} 11/07/2021 10:44:53 - INFO - __main__ - Step 96119: {'lr': 0.00014645440408236036, 'samples': 18454848, 'steps': 96118, 'loss/train': 1.4549554586410522} 11/07/2021 10:44:53 - INFO - __main__ - Step 96120: {'lr': 0.0001464495739393041, 'samples': 18455040, 'steps': 96119, 'loss/train': 1.4991737604141235} 11/07/2021 10:44:53 - INFO - __main__ - Step 96121: {'lr': 0.00014644474384290605, 'samples': 18455232, 'steps': 96120, 'loss/train': 1.4685903787612915} 11/07/2021 10:44:54 - INFO - __main__ - Step 96122: {'lr': 0.0001464399137931685, 'samples': 18455424, 'steps': 96121, 'loss/train': 1.4153261184692383} 11/07/2021 10:44:55 - INFO - __main__ - Step 96123: {'lr': 0.0001464350837900934, 'samples': 18455616, 'steps': 96122, 'loss/train': 1.1118842363357544} 11/07/2021 10:44:55 - INFO - __main__ - Step 96124: {'lr': 0.00014643025383368307, 'samples': 18455808, 'steps': 96123, 'loss/train': 1.2489529848098755} 11/07/2021 10:44:56 - INFO - __main__ - Step 96125: {'lr': 0.0001464254239239397, 'samples': 18456000, 'steps': 96124, 'loss/train': 1.3732894659042358} 11/07/2021 10:44:56 - INFO - __main__ - Step 96126: {'lr': 0.00014642059406086544, 'samples': 18456192, 'steps': 96125, 'loss/train': 2.0584394931793213} 11/07/2021 10:44:57 - INFO - __main__ - Step 96127: {'lr': 0.00014641576424446242, 'samples': 18456384, 'steps': 96126, 'loss/train': 1.4316753149032593} 11/07/2021 10:44:57 - INFO - __main__ - Step 96128: {'lr': 0.00014641093447473287, 'samples': 18456576, 'steps': 96127, 'loss/train': 1.3540596961975098} 11/07/2021 10:44:58 - INFO - __main__ - Step 96129: {'lr': 0.00014640610475167898, 'samples': 18456768, 'steps': 96128, 'loss/train': 1.6857566833496094} 11/07/2021 10:44:58 - INFO - __main__ - Step 96130: {'lr': 0.00014640127507530286, 'samples': 18456960, 'steps': 96129, 'loss/train': 1.4730006456375122} 11/07/2021 10:44:58 - INFO - __main__ - Step 96131: {'lr': 0.00014639644544560675, 'samples': 18457152, 'steps': 96130, 'loss/train': 1.5937609672546387} 11/07/2021 10:44:59 - INFO - __main__ - Step 96132: {'lr': 0.0001463916158625928, 'samples': 18457344, 'steps': 96131, 'loss/train': 1.2378544807434082} 11/07/2021 10:45:00 - INFO - __main__ - Step 96133: {'lr': 0.0001463867863262632, 'samples': 18457536, 'steps': 96132, 'loss/train': 1.3948254585266113} 11/07/2021 10:45:00 - INFO - __main__ - Step 96134: {'lr': 0.0001463819568366201, 'samples': 18457728, 'steps': 96133, 'loss/train': 1.1067907810211182} 11/07/2021 10:45:01 - INFO - __main__ - Step 96135: {'lr': 0.00014637712739366582, 'samples': 18457920, 'steps': 96134, 'loss/train': 1.9625886678695679} 11/07/2021 10:45:01 - INFO - __main__ - Step 96136: {'lr': 0.00014637229799740225, 'samples': 18458112, 'steps': 96135, 'loss/train': 0.9263071417808533} 11/07/2021 10:45:02 - INFO - __main__ - Step 96137: {'lr': 0.00014636746864783178, 'samples': 18458304, 'steps': 96136, 'loss/train': 1.5603104829788208} 11/07/2021 10:45:02 - INFO - __main__ - Step 96138: {'lr': 0.00014636263934495654, 'samples': 18458496, 'steps': 96137, 'loss/train': 1.4180514812469482} 11/07/2021 10:45:03 - INFO - __main__ - Step 96139: {'lr': 0.00014635781008877862, 'samples': 18458688, 'steps': 96138, 'loss/train': 1.401164174079895} 11/07/2021 10:45:03 - INFO - __main__ - Step 96140: {'lr': 0.00014635298087930032, 'samples': 18458880, 'steps': 96139, 'loss/train': 1.0975141525268555} 11/07/2021 10:45:03 - INFO - __main__ - Step 96141: {'lr': 0.00014634815171652376, 'samples': 18459072, 'steps': 96140, 'loss/train': 1.4611048698425293} 11/07/2021 10:45:04 - INFO - __main__ - Step 96142: {'lr': 0.00014634332260045113, 'samples': 18459264, 'steps': 96141, 'loss/train': 1.9128990173339844} 11/07/2021 10:45:05 - INFO - __main__ - Step 96143: {'lr': 0.00014633849353108458, 'samples': 18459456, 'steps': 96142, 'loss/train': 1.511799931526184} 11/07/2021 10:45:05 - INFO - __main__ - Step 96144: {'lr': 0.00014633366450842632, 'samples': 18459648, 'steps': 96143, 'loss/train': 1.5506463050842285} 11/07/2021 10:45:05 - INFO - __main__ - Step 96145: {'lr': 0.00014632883553247853, 'samples': 18459840, 'steps': 96144, 'loss/train': 1.6013127565383911} 11/07/2021 10:45:06 - INFO - __main__ - Step 96146: {'lr': 0.00014632400660324335, 'samples': 18460032, 'steps': 96145, 'loss/train': 1.227543830871582} 11/07/2021 10:45:06 - INFO - __main__ - Step 96147: {'lr': 0.00014631917772072296, 'samples': 18460224, 'steps': 96146, 'loss/train': 1.3989486694335938} 11/07/2021 10:45:07 - INFO - __main__ - Step 96148: {'lr': 0.0001463143488849197, 'samples': 18460416, 'steps': 96147, 'loss/train': 0.9717071056365967} 11/07/2021 10:45:08 - INFO - __main__ - Step 96149: {'lr': 0.00014630952009583542, 'samples': 18460608, 'steps': 96148, 'loss/train': 1.4258407354354858} 11/07/2021 10:45:08 - INFO - __main__ - Step 96150: {'lr': 0.00014630469135347253, 'samples': 18460800, 'steps': 96149, 'loss/train': 1.267545461654663} 11/07/2021 10:45:08 - INFO - __main__ - Step 96151: {'lr': 0.0001462998626578331, 'samples': 18460992, 'steps': 96150, 'loss/train': 1.222267508506775} 11/07/2021 10:45:09 - INFO - __main__ - Step 96152: {'lr': 0.00014629503400891936, 'samples': 18461184, 'steps': 96151, 'loss/train': 1.4621576070785522} 11/07/2021 10:45:10 - INFO - __main__ - Step 96153: {'lr': 0.0001462902054067335, 'samples': 18461376, 'steps': 96152, 'loss/train': 1.1776609420776367} 11/07/2021 10:45:10 - INFO - __main__ - Step 96154: {'lr': 0.00014628537685127765, 'samples': 18461568, 'steps': 96153, 'loss/train': 0.7019922733306885} 11/07/2021 10:45:10 - INFO - __main__ - Step 96155: {'lr': 0.00014628054834255402, 'samples': 18461760, 'steps': 96154, 'loss/train': 1.0730739831924438} 11/07/2021 10:45:11 - INFO - __main__ - Step 96156: {'lr': 0.0001462757198805648, 'samples': 18461952, 'steps': 96155, 'loss/train': 1.4500706195831299} 11/07/2021 10:45:11 - INFO - __main__ - Step 96157: {'lr': 0.00014627089146531207, 'samples': 18462144, 'steps': 96156, 'loss/train': 1.7557920217514038} 11/07/2021 10:45:12 - INFO - __main__ - Step 96158: {'lr': 0.00014626606309679812, 'samples': 18462336, 'steps': 96157, 'loss/train': 1.6520253419876099} 11/07/2021 10:45:12 - INFO - __main__ - Step 96159: {'lr': 0.00014626123477502517, 'samples': 18462528, 'steps': 96158, 'loss/train': 1.2314369678497314} 11/07/2021 10:45:13 - INFO - __main__ - Step 96160: {'lr': 0.00014625640649999522, 'samples': 18462720, 'steps': 96159, 'loss/train': 1.7084717750549316} 11/07/2021 10:45:13 - INFO - __main__ - Step 96161: {'lr': 0.00014625157827171054, 'samples': 18462912, 'steps': 96160, 'loss/train': 1.5194412469863892} 11/07/2021 10:45:13 - INFO - __main__ - Step 96162: {'lr': 0.00014624675009017332, 'samples': 18463104, 'steps': 96161, 'loss/train': 0.9915881752967834} 11/07/2021 10:45:14 - INFO - __main__ - Step 96163: {'lr': 0.00014624192195538568, 'samples': 18463296, 'steps': 96162, 'loss/train': 1.7823777198791504} 11/07/2021 10:45:15 - INFO - __main__ - Step 96164: {'lr': 0.00014623709386734984, 'samples': 18463488, 'steps': 96163, 'loss/train': 0.8171694874763489} 11/07/2021 10:45:15 - INFO - __main__ - Step 96165: {'lr': 0.00014623226582606796, 'samples': 18463680, 'steps': 96164, 'loss/train': 1.2962608337402344} 11/07/2021 10:45:16 - INFO - __main__ - Step 96166: {'lr': 0.0001462274378315422, 'samples': 18463872, 'steps': 96165, 'loss/train': 1.6049919128417969} 11/07/2021 10:45:16 - INFO - __main__ - Step 96167: {'lr': 0.00014622260988377477, 'samples': 18464064, 'steps': 96166, 'loss/train': 1.5361799001693726} 11/07/2021 10:45:17 - INFO - __main__ - Step 96168: {'lr': 0.00014621778198276787, 'samples': 18464256, 'steps': 96167, 'loss/train': 1.916053056716919} 11/07/2021 10:45:17 - INFO - __main__ - Step 96169: {'lr': 0.0001462129541285236, 'samples': 18464448, 'steps': 96168, 'loss/train': 1.3618402481079102} 11/07/2021 10:45:18 - INFO - __main__ - Step 96170: {'lr': 0.0001462081263210442, 'samples': 18464640, 'steps': 96169, 'loss/train': 1.5447665452957153} 11/07/2021 10:45:18 - INFO - __main__ - Step 96171: {'lr': 0.00014620329856033175, 'samples': 18464832, 'steps': 96170, 'loss/train': 1.5491082668304443} 11/07/2021 10:45:18 - INFO - __main__ - Step 96172: {'lr': 0.00014619847084638854, 'samples': 18465024, 'steps': 96171, 'loss/train': 0.8806294202804565} 11/07/2021 10:45:19 - INFO - __main__ - Step 96173: {'lr': 0.00014619364317921667, 'samples': 18465216, 'steps': 96172, 'loss/train': 1.5469642877578735} 11/07/2021 10:45:20 - INFO - __main__ - Step 96174: {'lr': 0.00014618881555881837, 'samples': 18465408, 'steps': 96173, 'loss/train': 1.3445911407470703} 11/07/2021 10:45:20 - INFO - __main__ - Step 96175: {'lr': 0.00014618398798519583, 'samples': 18465600, 'steps': 96174, 'loss/train': 1.2079211473464966} 11/07/2021 10:45:21 - INFO - __main__ - Step 96176: {'lr': 0.00014617916045835114, 'samples': 18465792, 'steps': 96175, 'loss/train': 1.4924324750900269} 11/07/2021 10:45:21 - INFO - __main__ - Step 96177: {'lr': 0.0001461743329782865, 'samples': 18465984, 'steps': 96176, 'loss/train': 1.5767723321914673} 11/07/2021 10:45:21 - INFO - __main__ - Step 96178: {'lr': 0.00014616950554500414, 'samples': 18466176, 'steps': 96177, 'loss/train': 1.5329711437225342} 11/07/2021 10:45:22 - INFO - __main__ - Step 96179: {'lr': 0.00014616467815850614, 'samples': 18466368, 'steps': 96178, 'loss/train': 1.3521145582199097} 11/07/2021 10:45:23 - INFO - __main__ - Step 96180: {'lr': 0.00014615985081879477, 'samples': 18466560, 'steps': 96179, 'loss/train': 1.2326712608337402} 11/07/2021 10:45:23 - INFO - __main__ - Step 96181: {'lr': 0.0001461550235258722, 'samples': 18466752, 'steps': 96180, 'loss/train': 1.467231035232544} 11/07/2021 10:45:23 - INFO - __main__ - Step 96182: {'lr': 0.00014615019627974054, 'samples': 18466944, 'steps': 96181, 'loss/train': 1.8655766248703003} 11/07/2021 10:45:24 - INFO - __main__ - Step 96183: {'lr': 0.000146145369080402, 'samples': 18467136, 'steps': 96182, 'loss/train': 1.543182134628296} 11/07/2021 10:45:25 - INFO - __main__ - Step 96184: {'lr': 0.00014614054192785874, 'samples': 18467328, 'steps': 96183, 'loss/train': 1.329659104347229} 11/07/2021 10:45:25 - INFO - __main__ - Step 96185: {'lr': 0.00014613571482211297, 'samples': 18467520, 'steps': 96184, 'loss/train': 1.480072259902954} 11/07/2021 10:45:25 - INFO - __main__ - Step 96186: {'lr': 0.00014613088776316684, 'samples': 18467712, 'steps': 96185, 'loss/train': 1.2493102550506592} 11/07/2021 10:45:26 - INFO - __main__ - Step 96187: {'lr': 0.00014612606075102252, 'samples': 18467904, 'steps': 96186, 'loss/train': 1.1208040714263916} 11/07/2021 10:45:26 - INFO - __main__ - Step 96188: {'lr': 0.00014612123378568217, 'samples': 18468096, 'steps': 96187, 'loss/train': 0.9221217632293701} 11/07/2021 10:45:27 - INFO - __main__ - Step 96189: {'lr': 0.00014611640686714805, 'samples': 18468288, 'steps': 96188, 'loss/train': 1.5159716606140137} 11/07/2021 10:45:27 - INFO - __main__ - Step 96190: {'lr': 0.00014611157999542228, 'samples': 18468480, 'steps': 96189, 'loss/train': 1.3497675657272339} 11/07/2021 10:45:28 - INFO - __main__ - Step 96191: {'lr': 0.000146106753170507, 'samples': 18468672, 'steps': 96190, 'loss/train': 1.2850171327590942} 11/07/2021 10:45:28 - INFO - __main__ - Step 96192: {'lr': 0.00014610192639240443, 'samples': 18468864, 'steps': 96191, 'loss/train': 1.9913465976715088} 11/07/2021 10:45:28 - INFO - __main__ - Step 96193: {'lr': 0.00014609709966111666, 'samples': 18469056, 'steps': 96192, 'loss/train': 1.3487275838851929} 11/07/2021 10:45:29 - INFO - __main__ - Step 96194: {'lr': 0.00014609227297664602, 'samples': 18469248, 'steps': 96193, 'loss/train': 1.5470080375671387} 11/07/2021 10:45:30 - INFO - __main__ - Step 96195: {'lr': 0.00014608744633899453, 'samples': 18469440, 'steps': 96194, 'loss/train': 1.4665364027023315} 11/07/2021 10:45:30 - INFO - __main__ - Step 96196: {'lr': 0.00014608261974816445, 'samples': 18469632, 'steps': 96195, 'loss/train': 1.3500529527664185} 11/07/2021 10:45:30 - INFO - __main__ - Step 96197: {'lr': 0.00014607779320415795, 'samples': 18469824, 'steps': 96196, 'loss/train': 2.1246068477630615} 11/07/2021 10:45:31 - INFO - __main__ - Step 96198: {'lr': 0.00014607296670697718, 'samples': 18470016, 'steps': 96197, 'loss/train': 1.3789610862731934} 11/07/2021 10:45:31 - INFO - __main__ - Step 96199: {'lr': 0.00014606814025662436, 'samples': 18470208, 'steps': 96198, 'loss/train': 1.3654154539108276} 11/07/2021 10:45:32 - INFO - __main__ - Step 96200: {'lr': 0.0001460633138531016, 'samples': 18470400, 'steps': 96199, 'loss/train': 2.076655149459839} 11/07/2021 10:45:32 - INFO - __main__ - Step 96201: {'lr': 0.0001460584874964111, 'samples': 18470592, 'steps': 96200, 'loss/train': 1.286881446838379} 11/07/2021 10:45:33 - INFO - __main__ - Step 96202: {'lr': 0.0001460536611865551, 'samples': 18470784, 'steps': 96201, 'loss/train': 1.5288076400756836} 11/07/2021 10:45:33 - INFO - __main__ - Step 96203: {'lr': 0.0001460488349235357, 'samples': 18470976, 'steps': 96202, 'loss/train': 1.2764419317245483} 11/07/2021 10:45:34 - INFO - __main__ - Step 96204: {'lr': 0.00014604400870735508, 'samples': 18471168, 'steps': 96203, 'loss/train': 1.3404667377471924} 11/07/2021 10:45:35 - INFO - __main__ - Step 96205: {'lr': 0.0001460391825380154, 'samples': 18471360, 'steps': 96204, 'loss/train': 1.5187225341796875} 11/07/2021 10:45:35 - INFO - __main__ - Step 96206: {'lr': 0.0001460343564155189, 'samples': 18471552, 'steps': 96205, 'loss/train': 0.8509948253631592} 11/07/2021 10:45:35 - INFO - __main__ - Step 96207: {'lr': 0.00014602953033986766, 'samples': 18471744, 'steps': 96206, 'loss/train': 1.3960094451904297} 11/07/2021 10:45:36 - INFO - __main__ - Step 96208: {'lr': 0.00014602470431106392, 'samples': 18471936, 'steps': 96207, 'loss/train': 1.2818728685379028} 11/07/2021 10:45:36 - INFO - __main__ - Step 96209: {'lr': 0.00014601987832910988, 'samples': 18472128, 'steps': 96208, 'loss/train': 1.170914649963379} 11/07/2021 10:45:37 - INFO - __main__ - Step 96210: {'lr': 0.00014601505239400763, 'samples': 18472320, 'steps': 96209, 'loss/train': 1.4686211347579956} 11/07/2021 10:45:37 - INFO - __main__ - Step 96211: {'lr': 0.00014601022650575943, 'samples': 18472512, 'steps': 96210, 'loss/train': 1.400855302810669} 11/07/2021 10:45:38 - INFO - __main__ - Step 96212: {'lr': 0.0001460054006643674, 'samples': 18472704, 'steps': 96211, 'loss/train': 1.2870274782180786} 11/07/2021 10:45:38 - INFO - __main__ - Step 96213: {'lr': 0.00014600057486983373, 'samples': 18472896, 'steps': 96212, 'loss/train': 1.6550043821334839} 11/07/2021 10:45:38 - INFO - __main__ - Step 96214: {'lr': 0.00014599574912216063, 'samples': 18473088, 'steps': 96213, 'loss/train': 1.574580192565918} 11/07/2021 10:45:39 - INFO - __main__ - Step 96215: {'lr': 0.00014599092342135018, 'samples': 18473280, 'steps': 96214, 'loss/train': 1.590484619140625} 11/07/2021 10:45:40 - INFO - __main__ - Step 96216: {'lr': 0.00014598609776740474, 'samples': 18473472, 'steps': 96215, 'loss/train': 1.0942540168762207} 11/07/2021 10:45:40 - INFO - __main__ - Step 96217: {'lr': 0.00014598127216032628, 'samples': 18473664, 'steps': 96216, 'loss/train': 1.3826584815979004} 11/07/2021 10:45:40 - INFO - __main__ - Step 96218: {'lr': 0.00014597644660011705, 'samples': 18473856, 'steps': 96217, 'loss/train': 1.551374077796936} 11/07/2021 10:45:41 - INFO - __main__ - Step 96219: {'lr': 0.0001459716210867792, 'samples': 18474048, 'steps': 96218, 'loss/train': 1.3268874883651733} 11/07/2021 10:45:42 - INFO - __main__ - Step 96220: {'lr': 0.00014596679562031494, 'samples': 18474240, 'steps': 96219, 'loss/train': 1.4533721208572388} 11/07/2021 10:45:42 - INFO - __main__ - Step 96221: {'lr': 0.0001459619702007264, 'samples': 18474432, 'steps': 96220, 'loss/train': 1.607069730758667} 11/07/2021 10:45:43 - INFO - __main__ - Step 96222: {'lr': 0.00014595714482801587, 'samples': 18474624, 'steps': 96221, 'loss/train': 1.4847971200942993} 11/07/2021 10:45:43 - INFO - __main__ - Step 96223: {'lr': 0.0001459523195021854, 'samples': 18474816, 'steps': 96222, 'loss/train': 1.400352120399475} 11/07/2021 10:45:43 - INFO - __main__ - Step 96224: {'lr': 0.0001459474942232372, 'samples': 18475008, 'steps': 96223, 'loss/train': 1.2750760316848755} 11/07/2021 10:45:44 - INFO - __main__ - Step 96225: {'lr': 0.00014594266899117347, 'samples': 18475200, 'steps': 96224, 'loss/train': 1.5847312211990356} 11/07/2021 10:45:45 - INFO - __main__ - Step 96226: {'lr': 0.00014593784380599638, 'samples': 18475392, 'steps': 96225, 'loss/train': 1.3702564239501953} 11/07/2021 10:45:45 - INFO - __main__ - Step 96227: {'lr': 0.0001459330186677082, 'samples': 18475584, 'steps': 96226, 'loss/train': 1.274289608001709} 11/07/2021 10:45:45 - INFO - __main__ - Step 96228: {'lr': 0.00014592819357631088, 'samples': 18475776, 'steps': 96227, 'loss/train': 1.864167332649231} 11/07/2021 10:45:46 - INFO - __main__ - Step 96229: {'lr': 0.00014592336853180672, 'samples': 18475968, 'steps': 96228, 'loss/train': 1.5285817384719849} 11/07/2021 10:45:46 - INFO - __main__ - Step 96230: {'lr': 0.00014591854353419786, 'samples': 18476160, 'steps': 96229, 'loss/train': 1.4013758897781372} 11/07/2021 10:45:47 - INFO - __main__ - Step 96231: {'lr': 0.0001459137185834865, 'samples': 18476352, 'steps': 96230, 'loss/train': 1.710094690322876} 11/07/2021 10:45:47 - INFO - __main__ - Step 96232: {'lr': 0.00014590889367967482, 'samples': 18476544, 'steps': 96231, 'loss/train': 1.289496660232544} 11/07/2021 10:45:48 - INFO - __main__ - Step 96233: {'lr': 0.00014590406882276504, 'samples': 18476736, 'steps': 96232, 'loss/train': 0.3990049660205841} 11/07/2021 10:45:48 - INFO - __main__ - Step 96234: {'lr': 0.0001458992440127592, 'samples': 18476928, 'steps': 96233, 'loss/train': 2.0954818725585938} 11/07/2021 10:45:48 - INFO - __main__ - Step 96235: {'lr': 0.00014589441924965958, 'samples': 18477120, 'steps': 96234, 'loss/train': 1.262754201889038} 11/07/2021 10:45:49 - INFO - __main__ - Step 96236: {'lr': 0.00014588959453346834, 'samples': 18477312, 'steps': 96235, 'loss/train': 1.2611480951309204} 11/07/2021 10:45:50 - INFO - __main__ - Step 96237: {'lr': 0.00014588476986418774, 'samples': 18477504, 'steps': 96236, 'loss/train': 1.7053589820861816} 11/07/2021 10:45:50 - INFO - __main__ - Step 96238: {'lr': 0.00014587994524181976, 'samples': 18477696, 'steps': 96237, 'loss/train': 1.2711772918701172} 11/07/2021 10:45:50 - INFO - __main__ - Step 96239: {'lr': 0.00014587512066636666, 'samples': 18477888, 'steps': 96238, 'loss/train': 1.1811187267303467} 11/07/2021 10:45:51 - INFO - __main__ - Step 96240: {'lr': 0.00014587029613783063, 'samples': 18478080, 'steps': 96239, 'loss/train': 1.0859289169311523} 11/07/2021 10:45:52 - INFO - __main__ - Step 96241: {'lr': 0.00014586547165621383, 'samples': 18478272, 'steps': 96240, 'loss/train': 0.8493039608001709} 11/07/2021 10:45:52 - INFO - __main__ - Step 96242: {'lr': 0.00014586064722151842, 'samples': 18478464, 'steps': 96241, 'loss/train': 1.2866559028625488} 11/07/2021 10:45:53 - INFO - __main__ - Step 96243: {'lr': 0.00014585582283374666, 'samples': 18478656, 'steps': 96242, 'loss/train': 1.2185157537460327} 11/07/2021 10:45:53 - INFO - __main__ - Step 96244: {'lr': 0.0001458509984929006, 'samples': 18478848, 'steps': 96243, 'loss/train': 1.3658525943756104} 11/07/2021 10:45:53 - INFO - __main__ - Step 96245: {'lr': 0.0001458461741989825, 'samples': 18479040, 'steps': 96244, 'loss/train': 0.959139347076416} 11/07/2021 10:45:54 - INFO - __main__ - Step 96246: {'lr': 0.0001458413499519945, 'samples': 18479232, 'steps': 96245, 'loss/train': 1.7636423110961914} 11/07/2021 10:45:55 - INFO - __main__ - Step 96247: {'lr': 0.00014583652575193877, 'samples': 18479424, 'steps': 96246, 'loss/train': 1.2452751398086548} 11/07/2021 10:45:55 - INFO - __main__ - Step 96248: {'lr': 0.00014583170159881758, 'samples': 18479616, 'steps': 96247, 'loss/train': 1.4049497842788696} 11/07/2021 10:45:55 - INFO - __main__ - Step 96249: {'lr': 0.00014582687749263297, 'samples': 18479808, 'steps': 96248, 'loss/train': 1.093275547027588} 11/07/2021 10:45:56 - INFO - __main__ - Step 96250: {'lr': 0.00014582205343338712, 'samples': 18480000, 'steps': 96249, 'loss/train': 1.3896758556365967} 11/07/2021 10:45:57 - INFO - __main__ - Step 96251: {'lr': 0.00014581722942108227, 'samples': 18480192, 'steps': 96250, 'loss/train': 1.4556965827941895} 11/07/2021 10:45:57 - INFO - __main__ - Step 96252: {'lr': 0.00014581240545572056, 'samples': 18480384, 'steps': 96251, 'loss/train': 1.8682005405426025} 11/07/2021 10:45:57 - INFO - __main__ - Step 96253: {'lr': 0.00014580758153730417, 'samples': 18480576, 'steps': 96252, 'loss/train': 1.490249752998352} 11/07/2021 10:45:58 - INFO - __main__ - Step 96254: {'lr': 0.0001458027576658353, 'samples': 18480768, 'steps': 96253, 'loss/train': 1.2880525588989258} 11/07/2021 10:45:58 - INFO - __main__ - Step 96255: {'lr': 0.00014579793384131607, 'samples': 18480960, 'steps': 96254, 'loss/train': 1.092581868171692} 11/07/2021 10:45:59 - INFO - __main__ - Step 96256: {'lr': 0.0001457931100637487, 'samples': 18481152, 'steps': 96255, 'loss/train': 1.2616482973098755} 11/07/2021 10:45:59 - INFO - __main__ - Step 96257: {'lr': 0.00014578828633313528, 'samples': 18481344, 'steps': 96256, 'loss/train': 1.2365925312042236} 11/07/2021 10:46:00 - INFO - __main__ - Step 96258: {'lr': 0.0001457834626494781, 'samples': 18481536, 'steps': 96257, 'loss/train': 1.4969325065612793} 11/07/2021 10:46:00 - INFO - __main__ - Step 96259: {'lr': 0.00014577863901277943, 'samples': 18481728, 'steps': 96258, 'loss/train': 1.6208412647247314} 11/07/2021 10:46:00 - INFO - __main__ - Step 96260: {'lr': 0.00014577381542304113, 'samples': 18481920, 'steps': 96259, 'loss/train': 1.6468249559402466} 11/07/2021 10:46:01 - INFO - __main__ - Step 96261: {'lr': 0.0001457689918802656, 'samples': 18482112, 'steps': 96260, 'loss/train': 1.2861837148666382} 11/07/2021 10:46:03 - INFO - __main__ - Step 96262: {'lr': 0.0001457641683844549, 'samples': 18482304, 'steps': 96261, 'loss/train': 1.024200677871704} 11/07/2021 10:46:03 - INFO - __main__ - Step 96263: {'lr': 0.00014575934493561127, 'samples': 18482496, 'steps': 96262, 'loss/train': 1.3029310703277588} 11/07/2021 10:46:03 - INFO - __main__ - Step 96264: {'lr': 0.00014575452153373688, 'samples': 18482688, 'steps': 96263, 'loss/train': 1.5221104621887207} 11/07/2021 10:46:04 - INFO - __main__ - Step 96265: {'lr': 0.0001457496981788339, 'samples': 18482880, 'steps': 96264, 'loss/train': 1.7321319580078125} 11/07/2021 10:46:04 - INFO - __main__ - Step 96266: {'lr': 0.0001457448748709045, 'samples': 18483072, 'steps': 96265, 'loss/train': 0.7282781004905701} 11/07/2021 10:46:04 - INFO - __main__ - Step 96267: {'lr': 0.00014574005160995082, 'samples': 18483264, 'steps': 96266, 'loss/train': 1.7588062286376953} 11/07/2021 10:46:05 - INFO - __main__ - Step 96268: {'lr': 0.0001457352283959751, 'samples': 18483456, 'steps': 96267, 'loss/train': 1.0914443731307983} 11/07/2021 10:46:06 - INFO - __main__ - Step 96269: {'lr': 0.00014573040522897944, 'samples': 18483648, 'steps': 96268, 'loss/train': 1.7936806678771973} 11/07/2021 10:46:06 - INFO - __main__ - Step 96270: {'lr': 0.0001457255821089662, 'samples': 18483840, 'steps': 96269, 'loss/train': 1.4603803157806396} 11/07/2021 10:46:06 - INFO - __main__ - Step 96271: {'lr': 0.00014572075903593727, 'samples': 18484032, 'steps': 96270, 'loss/train': 1.0616533756256104} 11/07/2021 10:46:07 - INFO - __main__ - Step 96272: {'lr': 0.00014571593600989495, 'samples': 18484224, 'steps': 96271, 'loss/train': 1.3733456134796143} 11/07/2021 10:46:08 - INFO - __main__ - Step 96273: {'lr': 0.00014571111303084144, 'samples': 18484416, 'steps': 96272, 'loss/train': 1.5881953239440918} 11/07/2021 10:46:08 - INFO - __main__ - Step 96274: {'lr': 0.0001457062900987789, 'samples': 18484608, 'steps': 96273, 'loss/train': 1.5012409687042236} 11/07/2021 10:46:09 - INFO - __main__ - Step 96275: {'lr': 0.00014570146721370946, 'samples': 18484800, 'steps': 96274, 'loss/train': 1.476564884185791} 11/07/2021 10:46:09 - INFO - __main__ - Step 96276: {'lr': 0.00014569664437563535, 'samples': 18484992, 'steps': 96275, 'loss/train': 5.503561496734619} 11/07/2021 10:46:09 - INFO - __main__ - Step 96277: {'lr': 0.00014569182158455873, 'samples': 18485184, 'steps': 96276, 'loss/train': 1.3215045928955078} 11/07/2021 10:46:10 - INFO - __main__ - Step 96278: {'lr': 0.00014568699884048175, 'samples': 18485376, 'steps': 96277, 'loss/train': 1.3533573150634766} 11/07/2021 10:46:11 - INFO - __main__ - Step 96279: {'lr': 0.00014568217614340662, 'samples': 18485568, 'steps': 96278, 'loss/train': 1.4457372426986694} 11/07/2021 10:46:11 - INFO - __main__ - Step 96280: {'lr': 0.00014567735349333547, 'samples': 18485760, 'steps': 96279, 'loss/train': 1.242208480834961} 11/07/2021 10:46:11 - INFO - __main__ - Step 96281: {'lr': 0.0001456725308902705, 'samples': 18485952, 'steps': 96280, 'loss/train': 1.6392590999603271} 11/07/2021 10:46:12 - INFO - __main__ - Step 96282: {'lr': 0.0001456677083342139, 'samples': 18486144, 'steps': 96281, 'loss/train': 0.871557354927063} 11/07/2021 10:46:12 - INFO - __main__ - Step 96283: {'lr': 0.0001456628858251679, 'samples': 18486336, 'steps': 96282, 'loss/train': 0.6537437438964844} 11/07/2021 10:46:13 - INFO - __main__ - Step 96284: {'lr': 0.00014565806336313446, 'samples': 18486528, 'steps': 96283, 'loss/train': 1.4295285940170288} 11/07/2021 10:46:13 - INFO - __main__ - Step 96285: {'lr': 0.00014565324094811593, 'samples': 18486720, 'steps': 96284, 'loss/train': 0.9140001535415649} 11/07/2021 10:46:14 - INFO - __main__ - Step 96286: {'lr': 0.00014564841858011446, 'samples': 18486912, 'steps': 96285, 'loss/train': 1.5429449081420898} 11/07/2021 10:46:14 - INFO - __main__ - Step 96287: {'lr': 0.00014564359625913217, 'samples': 18487104, 'steps': 96286, 'loss/train': 1.01638925075531} 11/07/2021 10:46:14 - INFO - __main__ - Step 96288: {'lr': 0.00014563877398517127, 'samples': 18487296, 'steps': 96287, 'loss/train': 1.375874400138855} 11/07/2021 10:46:16 - INFO - __main__ - Step 96289: {'lr': 0.00014563395175823393, 'samples': 18487488, 'steps': 96288, 'loss/train': 1.0452994108200073} 11/07/2021 10:46:16 - INFO - __main__ - Step 96290: {'lr': 0.0001456291295783223, 'samples': 18487680, 'steps': 96289, 'loss/train': 1.4549400806427002} 11/07/2021 10:46:16 - INFO - __main__ - Step 96291: {'lr': 0.00014562430744543861, 'samples': 18487872, 'steps': 96290, 'loss/train': 1.0705136060714722} 11/07/2021 10:46:17 - INFO - __main__ - Step 96292: {'lr': 0.00014561948535958498, 'samples': 18488064, 'steps': 96291, 'loss/train': 1.5447627305984497} 11/07/2021 10:46:17 - INFO - __main__ - Step 96293: {'lr': 0.00014561466332076362, 'samples': 18488256, 'steps': 96292, 'loss/train': 1.0625755786895752} 11/07/2021 10:46:17 - INFO - __main__ - Step 96294: {'lr': 0.00014560984132897664, 'samples': 18488448, 'steps': 96293, 'loss/train': 1.322605013847351} 11/07/2021 10:46:18 - INFO - __main__ - Step 96295: {'lr': 0.00014560501938422628, 'samples': 18488640, 'steps': 96294, 'loss/train': 1.4439259767532349} 11/07/2021 10:46:19 - INFO - __main__ - Step 96296: {'lr': 0.00014560019748651476, 'samples': 18488832, 'steps': 96295, 'loss/train': 1.3214402198791504} 11/07/2021 10:46:19 - INFO - __main__ - Step 96297: {'lr': 0.00014559537563584412, 'samples': 18489024, 'steps': 96296, 'loss/train': 1.358160376548767} 11/07/2021 10:46:19 - INFO - __main__ - Step 96298: {'lr': 0.0001455905538322166, 'samples': 18489216, 'steps': 96297, 'loss/train': 1.7116316556930542} 11/07/2021 10:46:20 - INFO - __main__ - Step 96299: {'lr': 0.0001455857320756343, 'samples': 18489408, 'steps': 96298, 'loss/train': 1.2702362537384033} 11/07/2021 10:46:21 - INFO - __main__ - Step 96300: {'lr': 0.0001455809103660995, 'samples': 18489600, 'steps': 96299, 'loss/train': 1.24617600440979} 11/07/2021 10:46:21 - INFO - __main__ - Step 96301: {'lr': 0.00014557608870361432, 'samples': 18489792, 'steps': 96300, 'loss/train': 5.763233184814453} 11/07/2021 10:46:22 - INFO - __main__ - Step 96302: {'lr': 0.00014557126708818096, 'samples': 18489984, 'steps': 96301, 'loss/train': 1.260230541229248} 11/07/2021 10:46:22 - INFO - __main__ - Step 96303: {'lr': 0.00014556644551980157, 'samples': 18490176, 'steps': 96302, 'loss/train': 1.07626211643219} 11/07/2021 10:46:22 - INFO - __main__ - Step 96304: {'lr': 0.00014556162399847832, 'samples': 18490368, 'steps': 96303, 'loss/train': 1.7929421663284302} 11/07/2021 10:46:23 - INFO - __main__ - Step 96305: {'lr': 0.0001455568025242134, 'samples': 18490560, 'steps': 96304, 'loss/train': 1.4680291414260864} 11/07/2021 10:46:24 - INFO - __main__ - Step 96306: {'lr': 0.00014555198109700898, 'samples': 18490752, 'steps': 96305, 'loss/train': 1.463605284690857} 11/07/2021 10:46:24 - INFO - __main__ - Step 96307: {'lr': 0.00014554715971686722, 'samples': 18490944, 'steps': 96306, 'loss/train': 1.3336502313613892} 11/07/2021 10:46:24 - INFO - __main__ - Step 96308: {'lr': 0.00014554233838379028, 'samples': 18491136, 'steps': 96307, 'loss/train': 1.3561302423477173} 11/07/2021 10:46:25 - INFO - __main__ - Step 96309: {'lr': 0.00014553751709778037, 'samples': 18491328, 'steps': 96308, 'loss/train': 1.0871540307998657} 11/07/2021 10:46:25 - INFO - __main__ - Step 96310: {'lr': 0.00014553269585883974, 'samples': 18491520, 'steps': 96309, 'loss/train': 1.1888858079910278} 11/07/2021 10:46:26 - INFO - __main__ - Step 96311: {'lr': 0.00014552787466697037, 'samples': 18491712, 'steps': 96310, 'loss/train': 1.1667556762695312} 11/07/2021 10:46:26 - INFO - __main__ - Step 96312: {'lr': 0.0001455230535221745, 'samples': 18491904, 'steps': 96311, 'loss/train': 1.7218098640441895} 11/07/2021 10:46:27 - INFO - __main__ - Step 96313: {'lr': 0.00014551823242445436, 'samples': 18492096, 'steps': 96312, 'loss/train': 1.2924230098724365} 11/07/2021 10:46:27 - INFO - __main__ - Step 96314: {'lr': 0.00014551341137381208, 'samples': 18492288, 'steps': 96313, 'loss/train': 1.3234103918075562} 11/07/2021 10:46:28 - INFO - __main__ - Step 96315: {'lr': 0.00014550859037024981, 'samples': 18492480, 'steps': 96314, 'loss/train': 1.8108196258544922} 11/07/2021 10:46:28 - INFO - __main__ - Step 96316: {'lr': 0.00014550376941376984, 'samples': 18492672, 'steps': 96315, 'loss/train': 1.155555248260498} 11/07/2021 10:46:29 - INFO - __main__ - Step 96317: {'lr': 0.0001454989485043742, 'samples': 18492864, 'steps': 96316, 'loss/train': 1.5525977611541748} 11/07/2021 10:46:29 - INFO - __main__ - Step 96318: {'lr': 0.00014549412764206513, 'samples': 18493056, 'steps': 96317, 'loss/train': 1.2811777591705322} 11/07/2021 10:46:30 - INFO - __main__ - Step 96319: {'lr': 0.0001454893068268448, 'samples': 18493248, 'steps': 96318, 'loss/train': 1.1436582803726196} 11/07/2021 10:46:30 - INFO - __main__ - Step 96320: {'lr': 0.00014548448605871536, 'samples': 18493440, 'steps': 96319, 'loss/train': 1.3710567951202393} 11/07/2021 10:46:31 - INFO - __main__ - Step 96321: {'lr': 0.00014547966533767903, 'samples': 18493632, 'steps': 96320, 'loss/train': 1.4881950616836548} 11/07/2021 10:46:31 - INFO - __main__ - Step 96322: {'lr': 0.00014547484466373792, 'samples': 18493824, 'steps': 96321, 'loss/train': 1.6404551267623901} 11/07/2021 10:46:32 - INFO - __main__ - Step 96323: {'lr': 0.00014547002403689436, 'samples': 18494016, 'steps': 96322, 'loss/train': 0.901675283908844} 11/07/2021 10:46:32 - INFO - __main__ - Step 96324: {'lr': 0.00014546520345715025, 'samples': 18494208, 'steps': 96323, 'loss/train': 0.3155548572540283} 11/07/2021 10:46:32 - INFO - __main__ - Step 96325: {'lr': 0.00014546038292450792, 'samples': 18494400, 'steps': 96324, 'loss/train': 1.301945447921753} 11/07/2021 10:46:33 - INFO - __main__ - Step 96326: {'lr': 0.00014545556243896957, 'samples': 18494592, 'steps': 96325, 'loss/train': 1.3665109872817993} 11/07/2021 10:46:34 - INFO - __main__ - Step 96327: {'lr': 0.00014545074200053728, 'samples': 18494784, 'steps': 96326, 'loss/train': 1.4446340799331665} 11/07/2021 10:46:34 - INFO - __main__ - Step 96328: {'lr': 0.0001454459216092133, 'samples': 18494976, 'steps': 96327, 'loss/train': 0.6939032077789307} 11/07/2021 10:46:34 - INFO - __main__ - Step 96329: {'lr': 0.00014544110126499975, 'samples': 18495168, 'steps': 96328, 'loss/train': 1.639520525932312} 11/07/2021 10:46:35 - INFO - __main__ - Step 96330: {'lr': 0.00014543628096789886, 'samples': 18495360, 'steps': 96329, 'loss/train': 1.4468039274215698} 11/07/2021 10:46:35 - INFO - __main__ - Step 96331: {'lr': 0.00014543146071791275, 'samples': 18495552, 'steps': 96330, 'loss/train': 1.2839871644973755} 11/07/2021 10:46:36 - INFO - __main__ - Step 96332: {'lr': 0.0001454266405150436, 'samples': 18495744, 'steps': 96331, 'loss/train': 1.4001131057739258} 11/07/2021 10:46:37 - INFO - __main__ - Step 96333: {'lr': 0.00014542182035929364, 'samples': 18495936, 'steps': 96332, 'loss/train': 0.8461894392967224} 11/07/2021 10:46:37 - INFO - __main__ - Step 96334: {'lr': 0.00014541700025066495, 'samples': 18496128, 'steps': 96333, 'loss/train': 2.5641181468963623} 11/07/2021 10:46:37 - INFO - __main__ - Step 96335: {'lr': 0.00014541218018915975, 'samples': 18496320, 'steps': 96334, 'loss/train': 1.3664586544036865} 11/07/2021 10:46:38 - INFO - __main__ - Step 96336: {'lr': 0.0001454073601747802, 'samples': 18496512, 'steps': 96335, 'loss/train': 1.6396219730377197} 11/07/2021 10:46:39 - INFO - __main__ - Step 96337: {'lr': 0.00014540254020752857, 'samples': 18496704, 'steps': 96336, 'loss/train': 1.2397915124893188} 11/07/2021 10:46:39 - INFO - __main__ - Step 96338: {'lr': 0.00014539772028740689, 'samples': 18496896, 'steps': 96337, 'loss/train': 1.2906721830368042} 11/07/2021 10:46:40 - INFO - __main__ - Step 96339: {'lr': 0.00014539290041441736, 'samples': 18497088, 'steps': 96338, 'loss/train': 1.1759226322174072} 11/07/2021 10:46:40 - INFO - __main__ - Step 96340: {'lr': 0.00014538808058856217, 'samples': 18497280, 'steps': 96339, 'loss/train': 1.4284024238586426} 11/07/2021 10:46:40 - INFO - __main__ - Step 96341: {'lr': 0.0001453832608098435, 'samples': 18497472, 'steps': 96340, 'loss/train': 1.066781997680664} 11/07/2021 10:46:41 - INFO - __main__ - Step 96342: {'lr': 0.0001453784410782635, 'samples': 18497664, 'steps': 96341, 'loss/train': 1.417333960533142} 11/07/2021 10:46:42 - INFO - __main__ - Step 96343: {'lr': 0.00014537362139382438, 'samples': 18497856, 'steps': 96342, 'loss/train': 0.7830007672309875} 11/07/2021 10:46:42 - INFO - __main__ - Step 96344: {'lr': 0.00014536880175652827, 'samples': 18498048, 'steps': 96343, 'loss/train': 1.398014783859253} 11/07/2021 10:46:42 - INFO - __main__ - Step 96345: {'lr': 0.0001453639821663774, 'samples': 18498240, 'steps': 96344, 'loss/train': 1.8024448156356812} 11/07/2021 10:46:43 - INFO - __main__ - Step 96346: {'lr': 0.0001453591626233739, 'samples': 18498432, 'steps': 96345, 'loss/train': 1.503496766090393} 11/07/2021 10:46:43 - INFO - __main__ - Step 96347: {'lr': 0.00014535434312751993, 'samples': 18498624, 'steps': 96346, 'loss/train': 1.6893366575241089} 11/07/2021 10:46:44 - INFO - __main__ - Step 96348: {'lr': 0.00014534952367881764, 'samples': 18498816, 'steps': 96347, 'loss/train': 3.764418601989746} 11/07/2021 10:46:45 - INFO - __main__ - Step 96349: {'lr': 0.0001453447042772693, 'samples': 18499008, 'steps': 96348, 'loss/train': 1.346003532409668} 11/07/2021 10:46:45 - INFO - __main__ - Step 96350: {'lr': 0.0001453398849228771, 'samples': 18499200, 'steps': 96349, 'loss/train': 0.7234417200088501} 11/07/2021 10:46:45 - INFO - __main__ - Step 96351: {'lr': 0.00014533506561564306, 'samples': 18499392, 'steps': 96350, 'loss/train': 1.105097770690918} 11/07/2021 10:46:46 - INFO - __main__ - Step 96352: {'lr': 0.0001453302463555694, 'samples': 18499584, 'steps': 96351, 'loss/train': 1.7225089073181152} 11/07/2021 10:46:46 - INFO - __main__ - Step 96353: {'lr': 0.0001453254271426583, 'samples': 18499776, 'steps': 96352, 'loss/train': 1.7406976222991943} 11/07/2021 10:46:47 - INFO - __main__ - Step 96354: {'lr': 0.00014532060797691195, 'samples': 18499968, 'steps': 96353, 'loss/train': 1.336310863494873} 11/07/2021 10:46:48 - INFO - __main__ - Step 96355: {'lr': 0.00014531578885833255, 'samples': 18500160, 'steps': 96354, 'loss/train': 1.7610548734664917} 11/07/2021 10:46:48 - INFO - __main__ - Step 96356: {'lr': 0.0001453109697869222, 'samples': 18500352, 'steps': 96355, 'loss/train': 1.1065386533737183} 11/07/2021 10:46:48 - INFO - __main__ - Step 96357: {'lr': 0.00014530615076268317, 'samples': 18500544, 'steps': 96356, 'loss/train': 0.9900149703025818} 11/07/2021 10:46:49 - INFO - __main__ - Step 96358: {'lr': 0.0001453013317856175, 'samples': 18500736, 'steps': 96357, 'loss/train': 1.2345987558364868} 11/07/2021 10:46:50 - INFO - __main__ - Step 96359: {'lr': 0.00014529651285572748, 'samples': 18500928, 'steps': 96358, 'loss/train': 0.5549294948577881} 11/07/2021 10:46:50 - INFO - __main__ - Step 96360: {'lr': 0.00014529169397301523, 'samples': 18501120, 'steps': 96359, 'loss/train': 1.278050422668457} 11/07/2021 10:46:50 - INFO - __main__ - Step 96361: {'lr': 0.00014528687513748294, 'samples': 18501312, 'steps': 96360, 'loss/train': 1.5378637313842773} 11/07/2021 10:46:51 - INFO - __main__ - Step 96362: {'lr': 0.0001452820563491327, 'samples': 18501504, 'steps': 96361, 'loss/train': 1.3689823150634766} 11/07/2021 10:46:51 - INFO - __main__ - Step 96363: {'lr': 0.00014527723760796686, 'samples': 18501696, 'steps': 96362, 'loss/train': 1.0410912036895752} 11/07/2021 10:46:52 - INFO - __main__ - Step 96364: {'lr': 0.0001452724189139875, 'samples': 18501888, 'steps': 96363, 'loss/train': 1.4231880903244019} 11/07/2021 10:46:53 - INFO - __main__ - Step 96365: {'lr': 0.0001452676002671967, 'samples': 18502080, 'steps': 96364, 'loss/train': 1.43134343624115} 11/07/2021 10:46:53 - INFO - __main__ - Step 96366: {'lr': 0.00014526278166759668, 'samples': 18502272, 'steps': 96365, 'loss/train': 1.3601927757263184} 11/07/2021 10:46:53 - INFO - __main__ - Step 96367: {'lr': 0.00014525796311518966, 'samples': 18502464, 'steps': 96366, 'loss/train': 1.0299127101898193} 11/07/2021 10:46:54 - INFO - __main__ - Step 96368: {'lr': 0.00014525314460997777, 'samples': 18502656, 'steps': 96367, 'loss/train': 1.2456773519515991} 11/07/2021 10:46:54 - INFO - __main__ - Step 96369: {'lr': 0.00014524832615196321, 'samples': 18502848, 'steps': 96368, 'loss/train': 1.3911908864974976} 11/07/2021 10:46:55 - INFO - __main__ - Step 96370: {'lr': 0.00014524350774114815, 'samples': 18503040, 'steps': 96369, 'loss/train': 0.7211460471153259} 11/07/2021 10:46:55 - INFO - __main__ - Step 96371: {'lr': 0.0001452386893775347, 'samples': 18503232, 'steps': 96370, 'loss/train': 1.4713484048843384} 11/07/2021 10:46:56 - INFO - __main__ - Step 96372: {'lr': 0.00014523387106112512, 'samples': 18503424, 'steps': 96371, 'loss/train': 0.9763208031654358} 11/07/2021 10:46:56 - INFO - __main__ - Step 96373: {'lr': 0.00014522905279192152, 'samples': 18503616, 'steps': 96372, 'loss/train': 1.7625523805618286} 11/07/2021 10:46:56 - INFO - __main__ - Step 96374: {'lr': 0.00014522423456992612, 'samples': 18503808, 'steps': 96373, 'loss/train': 1.378046989440918} 11/07/2021 10:46:57 - INFO - __main__ - Step 96375: {'lr': 0.00014521941639514103, 'samples': 18504000, 'steps': 96374, 'loss/train': 1.3085824251174927} 11/07/2021 10:46:58 - INFO - __main__ - Step 96376: {'lr': 0.00014521459826756847, 'samples': 18504192, 'steps': 96375, 'loss/train': 1.2738512754440308} 11/07/2021 10:46:58 - INFO - __main__ - Step 96377: {'lr': 0.0001452097801872107, 'samples': 18504384, 'steps': 96376, 'loss/train': 1.4004948139190674} 11/07/2021 10:46:58 - INFO - __main__ - Step 96378: {'lr': 0.0001452049621540697, 'samples': 18504576, 'steps': 96377, 'loss/train': 0.11105266213417053} 11/07/2021 10:46:59 - INFO - __main__ - Step 96379: {'lr': 0.0001452001441681477, 'samples': 18504768, 'steps': 96378, 'loss/train': 0.8773264288902283} 11/07/2021 10:47:00 - INFO - __main__ - Step 96380: {'lr': 0.0001451953262294469, 'samples': 18504960, 'steps': 96379, 'loss/train': 1.9874272346496582} 11/07/2021 10:47:00 - INFO - __main__ - Step 96381: {'lr': 0.0001451905083379695, 'samples': 18505152, 'steps': 96380, 'loss/train': 0.9259125590324402} 11/07/2021 10:47:01 - INFO - __main__ - Step 96382: {'lr': 0.00014518569049371758, 'samples': 18505344, 'steps': 96381, 'loss/train': 1.0839040279388428} 11/07/2021 10:47:01 - INFO - __main__ - Step 96383: {'lr': 0.00014518087269669338, 'samples': 18505536, 'steps': 96382, 'loss/train': 1.1209468841552734} 11/07/2021 10:47:01 - INFO - __main__ - Step 96384: {'lr': 0.00014517605494689912, 'samples': 18505728, 'steps': 96383, 'loss/train': 1.7678771018981934} 11/07/2021 10:47:02 - INFO - __main__ - Step 96385: {'lr': 0.00014517123724433687, 'samples': 18505920, 'steps': 96384, 'loss/train': 1.717936396598816} 11/07/2021 10:47:03 - INFO - __main__ - Step 96386: {'lr': 0.00014516641958900884, 'samples': 18506112, 'steps': 96385, 'loss/train': 1.5018434524536133} 11/07/2021 10:47:03 - INFO - __main__ - Step 96387: {'lr': 0.00014516160198091722, 'samples': 18506304, 'steps': 96386, 'loss/train': 1.4614695310592651} 11/07/2021 10:47:03 - INFO - __main__ - Step 96388: {'lr': 0.00014515678442006416, 'samples': 18506496, 'steps': 96387, 'loss/train': 1.2753691673278809} 11/07/2021 10:47:04 - INFO - __main__ - Step 96389: {'lr': 0.00014515196690645182, 'samples': 18506688, 'steps': 96388, 'loss/train': 1.3755689859390259} 11/07/2021 10:47:05 - INFO - __main__ - Step 96390: {'lr': 0.0001451471494400825, 'samples': 18506880, 'steps': 96389, 'loss/train': 1.384629249572754} 11/07/2021 10:47:05 - INFO - __main__ - Step 96391: {'lr': 0.00014514233202095816, 'samples': 18507072, 'steps': 96390, 'loss/train': 1.2013875246047974} 11/07/2021 10:47:05 - INFO - __main__ - Step 96392: {'lr': 0.0001451375146490811, 'samples': 18507264, 'steps': 96391, 'loss/train': 1.3381396532058716} 11/07/2021 10:47:06 - INFO - __main__ - Step 96393: {'lr': 0.00014513269732445338, 'samples': 18507456, 'steps': 96392, 'loss/train': 1.420705795288086} 11/07/2021 10:47:06 - INFO - __main__ - Step 96394: {'lr': 0.00014512788004707733, 'samples': 18507648, 'steps': 96393, 'loss/train': 1.257611632347107} 11/07/2021 10:47:07 - INFO - __main__ - Step 96395: {'lr': 0.00014512306281695497, 'samples': 18507840, 'steps': 96394, 'loss/train': 1.4834178686141968} 11/07/2021 10:47:08 - INFO - __main__ - Step 96396: {'lr': 0.0001451182456340886, 'samples': 18508032, 'steps': 96395, 'loss/train': 0.6368695497512817} 11/07/2021 10:47:08 - INFO - __main__ - Step 96397: {'lr': 0.0001451134284984803, 'samples': 18508224, 'steps': 96396, 'loss/train': 0.9742182493209839} 11/07/2021 10:47:08 - INFO - __main__ - Step 96398: {'lr': 0.00014510861141013226, 'samples': 18508416, 'steps': 96397, 'loss/train': 1.4836173057556152} 11/07/2021 10:47:09 - INFO - __main__ - Step 96399: {'lr': 0.00014510379436904664, 'samples': 18508608, 'steps': 96398, 'loss/train': 1.1999006271362305} 11/07/2021 10:47:09 - INFO - __main__ - Step 96400: {'lr': 0.00014509897737522567, 'samples': 18508800, 'steps': 96399, 'loss/train': 1.2030586004257202} 11/07/2021 10:47:10 - INFO - __main__ - Step 96401: {'lr': 0.00014509416042867148, 'samples': 18508992, 'steps': 96400, 'loss/train': 1.2689558267593384} 11/07/2021 10:47:10 - INFO - __main__ - Step 96402: {'lr': 0.00014508934352938625, 'samples': 18509184, 'steps': 96401, 'loss/train': 1.5274198055267334} 11/07/2021 10:47:11 - INFO - __main__ - Step 96403: {'lr': 0.00014508452667737212, 'samples': 18509376, 'steps': 96402, 'loss/train': 1.3048312664031982} 11/07/2021 10:47:11 - INFO - __main__ - Step 96404: {'lr': 0.00014507970987263138, 'samples': 18509568, 'steps': 96403, 'loss/train': 1.0878915786743164} 11/07/2021 10:47:11 - INFO - __main__ - Step 96405: {'lr': 0.00014507489311516602, 'samples': 18509760, 'steps': 96404, 'loss/train': 1.4406756162643433} 11/07/2021 10:47:12 - INFO - __main__ - Step 96406: {'lr': 0.00014507007640497828, 'samples': 18509952, 'steps': 96405, 'loss/train': 1.2369357347488403} 11/07/2021 10:47:13 - INFO - __main__ - Step 96407: {'lr': 0.00014506525974207035, 'samples': 18510144, 'steps': 96406, 'loss/train': 1.5692028999328613} 11/07/2021 10:47:13 - INFO - __main__ - Step 96408: {'lr': 0.00014506044312644442, 'samples': 18510336, 'steps': 96407, 'loss/train': 1.550208330154419} 11/07/2021 10:47:14 - INFO - __main__ - Step 96409: {'lr': 0.00014505562655810263, 'samples': 18510528, 'steps': 96408, 'loss/train': 1.3778939247131348} 11/07/2021 10:47:14 - INFO - __main__ - Step 96410: {'lr': 0.00014505081003704712, 'samples': 18510720, 'steps': 96409, 'loss/train': 1.2698806524276733} 11/07/2021 10:47:15 - INFO - __main__ - Step 96411: {'lr': 0.00014504599356328013, 'samples': 18510912, 'steps': 96410, 'loss/train': 1.5276395082473755} 11/07/2021 10:47:15 - INFO - __main__ - Step 96412: {'lr': 0.00014504117713680376, 'samples': 18511104, 'steps': 96411, 'loss/train': 1.1581841707229614} 11/07/2021 10:47:16 - INFO - __main__ - Step 96413: {'lr': 0.00014503636075762025, 'samples': 18511296, 'steps': 96412, 'loss/train': 1.0395253896713257} 11/07/2021 10:47:16 - INFO - __main__ - Step 96414: {'lr': 0.00014503154442573174, 'samples': 18511488, 'steps': 96413, 'loss/train': 1.086053490638733} 11/07/2021 10:47:16 - INFO - __main__ - Step 96415: {'lr': 0.00014502672814114038, 'samples': 18511680, 'steps': 96414, 'loss/train': 1.8177423477172852} 11/07/2021 10:47:17 - INFO - __main__ - Step 96416: {'lr': 0.00014502191190384834, 'samples': 18511872, 'steps': 96415, 'loss/train': 0.8665468096733093} 11/07/2021 10:47:18 - INFO - __main__ - Step 96417: {'lr': 0.0001450170957138579, 'samples': 18512064, 'steps': 96416, 'loss/train': 1.283052682876587} 11/07/2021 10:47:18 - INFO - __main__ - Step 96418: {'lr': 0.0001450122795711711, 'samples': 18512256, 'steps': 96417, 'loss/train': 1.2877187728881836} 11/07/2021 10:47:18 - INFO - __main__ - Step 96419: {'lr': 0.00014500746347579008, 'samples': 18512448, 'steps': 96418, 'loss/train': 1.4263851642608643} 11/07/2021 10:47:19 - INFO - __main__ - Step 96420: {'lr': 0.00014500264742771713, 'samples': 18512640, 'steps': 96419, 'loss/train': 1.3387327194213867} 11/07/2021 10:47:20 - INFO - __main__ - Step 96421: {'lr': 0.00014499783142695434, 'samples': 18512832, 'steps': 96420, 'loss/train': 1.4085536003112793} 11/07/2021 10:47:20 - INFO - __main__ - Step 96422: {'lr': 0.0001449930154735039, 'samples': 18513024, 'steps': 96421, 'loss/train': 1.118165135383606} 11/07/2021 10:47:21 - INFO - __main__ - Step 96423: {'lr': 0.00014498819956736798, 'samples': 18513216, 'steps': 96422, 'loss/train': 1.3927688598632812} 11/07/2021 10:47:21 - INFO - __main__ - Step 96424: {'lr': 0.00014498338370854877, 'samples': 18513408, 'steps': 96423, 'loss/train': 1.6579201221466064} 11/07/2021 10:47:21 - INFO - __main__ - Step 96425: {'lr': 0.00014497856789704843, 'samples': 18513600, 'steps': 96424, 'loss/train': 1.7775044441223145} 11/07/2021 10:47:22 - INFO - __main__ - Step 96426: {'lr': 0.00014497375213286912, 'samples': 18513792, 'steps': 96425, 'loss/train': 1.3332282304763794} 11/07/2021 10:47:23 - INFO - __main__ - Step 96427: {'lr': 0.00014496893641601302, 'samples': 18513984, 'steps': 96426, 'loss/train': 1.641907811164856} 11/07/2021 10:47:23 - INFO - __main__ - Step 96428: {'lr': 0.0001449641207464823, 'samples': 18514176, 'steps': 96427, 'loss/train': 1.7670313119888306} 11/07/2021 10:47:23 - INFO - __main__ - Step 96429: {'lr': 0.00014495930512427912, 'samples': 18514368, 'steps': 96428, 'loss/train': 1.2919012308120728} 11/07/2021 10:47:24 - INFO - __main__ - Step 96430: {'lr': 0.00014495448954940566, 'samples': 18514560, 'steps': 96429, 'loss/train': 1.3700796365737915} 11/07/2021 10:47:24 - INFO - __main__ - Step 96431: {'lr': 0.0001449496740218642, 'samples': 18514752, 'steps': 96430, 'loss/train': 1.4384315013885498} 11/07/2021 10:47:25 - INFO - __main__ - Step 96432: {'lr': 0.00014494485854165667, 'samples': 18514944, 'steps': 96431, 'loss/train': 1.8211997747421265} 11/07/2021 10:47:26 - INFO - __main__ - Step 96433: {'lr': 0.0001449400431087854, 'samples': 18515136, 'steps': 96432, 'loss/train': 1.451266884803772} 11/07/2021 10:47:26 - INFO - __main__ - Step 96434: {'lr': 0.00014493522772325248, 'samples': 18515328, 'steps': 96433, 'loss/train': 1.2398210763931274} 11/07/2021 10:47:26 - INFO - __main__ - Step 96435: {'lr': 0.00014493041238506016, 'samples': 18515520, 'steps': 96434, 'loss/train': 1.2395097017288208} 11/07/2021 10:47:27 - INFO - __main__ - Step 96436: {'lr': 0.00014492559709421054, 'samples': 18515712, 'steps': 96435, 'loss/train': 1.5293692350387573} 11/07/2021 10:47:28 - INFO - __main__ - Step 96437: {'lr': 0.00014492078185070583, 'samples': 18515904, 'steps': 96436, 'loss/train': 0.8256000876426697} 11/07/2021 10:47:28 - INFO - __main__ - Step 96438: {'lr': 0.00014491596665454825, 'samples': 18516096, 'steps': 96437, 'loss/train': 1.7216888666152954} 11/07/2021 10:47:28 - INFO - __main__ - Step 96439: {'lr': 0.00014491115150573985, 'samples': 18516288, 'steps': 96438, 'loss/train': 0.975712239742279} 11/07/2021 10:47:29 - INFO - __main__ - Step 96440: {'lr': 0.00014490633640428291, 'samples': 18516480, 'steps': 96439, 'loss/train': 1.4851362705230713} 11/07/2021 10:47:29 - INFO - __main__ - Step 96441: {'lr': 0.00014490152135017954, 'samples': 18516672, 'steps': 96440, 'loss/train': 1.249961018562317} 11/07/2021 10:47:30 - INFO - __main__ - Step 96442: {'lr': 0.0001448967063434319, 'samples': 18516864, 'steps': 96441, 'loss/train': 0.872055172920227} 11/07/2021 10:47:30 - INFO - __main__ - Step 96443: {'lr': 0.00014489189138404217, 'samples': 18517056, 'steps': 96442, 'loss/train': 1.391629695892334} 11/07/2021 10:47:31 - INFO - __main__ - Step 96444: {'lr': 0.00014488707647201268, 'samples': 18517248, 'steps': 96443, 'loss/train': 1.5287890434265137} 11/07/2021 10:47:31 - INFO - __main__ - Step 96445: {'lr': 0.00014488226160734536, 'samples': 18517440, 'steps': 96444, 'loss/train': 1.058474063873291} 11/07/2021 10:47:32 - INFO - __main__ - Step 96446: {'lr': 0.00014487744679004242, 'samples': 18517632, 'steps': 96445, 'loss/train': 1.5027961730957031} 11/07/2021 10:47:33 - INFO - __main__ - Step 96447: {'lr': 0.00014487263202010608, 'samples': 18517824, 'steps': 96446, 'loss/train': 1.1670621633529663} 11/07/2021 10:47:33 - INFO - __main__ - Step 96448: {'lr': 0.00014486781729753856, 'samples': 18518016, 'steps': 96447, 'loss/train': 1.323168396949768} 11/07/2021 10:47:33 - INFO - __main__ - Step 96449: {'lr': 0.00014486300262234192, 'samples': 18518208, 'steps': 96448, 'loss/train': 1.271837830543518} 11/07/2021 10:47:34 - INFO - __main__ - Step 96450: {'lr': 0.00014485818799451843, 'samples': 18518400, 'steps': 96449, 'loss/train': 1.3119237422943115} 11/07/2021 10:47:34 - INFO - __main__ - Step 96451: {'lr': 0.00014485337341407024, 'samples': 18518592, 'steps': 96450, 'loss/train': 1.640915870666504} 11/07/2021 10:47:35 - INFO - __main__ - Step 96452: {'lr': 0.00014484855888099947, 'samples': 18518784, 'steps': 96451, 'loss/train': 0.337226539850235} 11/07/2021 10:47:35 - INFO - __main__ - Step 96453: {'lr': 0.00014484374439530827, 'samples': 18518976, 'steps': 96452, 'loss/train': 1.7065006494522095} 11/07/2021 10:47:36 - INFO - __main__ - Step 96454: {'lr': 0.0001448389299569989, 'samples': 18519168, 'steps': 96453, 'loss/train': 1.1193040609359741} 11/07/2021 10:47:36 - INFO - __main__ - Step 96455: {'lr': 0.00014483411556607352, 'samples': 18519360, 'steps': 96454, 'loss/train': 1.1237283945083618} 11/07/2021 10:47:37 - INFO - __main__ - Step 96456: {'lr': 0.00014482930122253419, 'samples': 18519552, 'steps': 96455, 'loss/train': 1.2772139310836792} 11/07/2021 10:47:38 - INFO - __main__ - Step 96457: {'lr': 0.0001448244869263832, 'samples': 18519744, 'steps': 96456, 'loss/train': 1.5340607166290283} 11/07/2021 10:47:38 - INFO - __main__ - Step 96458: {'lr': 0.00014481967267762275, 'samples': 18519936, 'steps': 96457, 'loss/train': 1.3657208681106567} 11/07/2021 10:47:38 - INFO - __main__ - Step 96459: {'lr': 0.00014481485847625487, 'samples': 18520128, 'steps': 96458, 'loss/train': 1.2337472438812256} 11/07/2021 10:47:39 - INFO - __main__ - Step 96460: {'lr': 0.00014481004432228176, 'samples': 18520320, 'steps': 96459, 'loss/train': 1.3581887483596802} 11/07/2021 10:47:39 - INFO - __main__ - Step 96461: {'lr': 0.00014480523021570562, 'samples': 18520512, 'steps': 96460, 'loss/train': 1.588034749031067} 11/07/2021 10:47:39 - INFO - __main__ - Step 96462: {'lr': 0.00014480041615652864, 'samples': 18520704, 'steps': 96461, 'loss/train': 1.3955342769622803} 11/07/2021 10:47:40 - INFO - __main__ - Step 96463: {'lr': 0.00014479560214475295, 'samples': 18520896, 'steps': 96462, 'loss/train': 1.565994143486023} 11/07/2021 10:47:41 - INFO - __main__ - Step 96464: {'lr': 0.00014479078818038077, 'samples': 18521088, 'steps': 96463, 'loss/train': 0.9021571278572083} 11/07/2021 10:47:41 - INFO - __main__ - Step 96465: {'lr': 0.00014478597426341422, 'samples': 18521280, 'steps': 96464, 'loss/train': 1.6282039880752563} 11/07/2021 10:47:41 - INFO - __main__ - Step 96466: {'lr': 0.00014478116039385547, 'samples': 18521472, 'steps': 96465, 'loss/train': 1.4934128522872925} 11/07/2021 10:47:42 - INFO - __main__ - Step 96467: {'lr': 0.00014477634657170671, 'samples': 18521664, 'steps': 96466, 'loss/train': 1.098114252090454} 11/07/2021 10:47:43 - INFO - __main__ - Step 96468: {'lr': 0.00014477153279697012, 'samples': 18521856, 'steps': 96467, 'loss/train': 1.326973795890808} 11/07/2021 10:47:43 - INFO - __main__ - Step 96469: {'lr': 0.00014476671906964782, 'samples': 18522048, 'steps': 96468, 'loss/train': 1.4419550895690918} 11/07/2021 10:47:43 - INFO - __main__ - Step 96470: {'lr': 0.00014476190538974205, 'samples': 18522240, 'steps': 96469, 'loss/train': 1.264264702796936} 11/07/2021 10:47:44 - INFO - __main__ - Step 96471: {'lr': 0.00014475709175725506, 'samples': 18522432, 'steps': 96470, 'loss/train': 1.3728322982788086} 11/07/2021 10:47:44 - INFO - __main__ - Step 96472: {'lr': 0.00014475227817218873, 'samples': 18522624, 'steps': 96471, 'loss/train': 1.5077191591262817} 11/07/2021 10:47:45 - INFO - __main__ - Step 96473: {'lr': 0.00014474746463454547, 'samples': 18522816, 'steps': 96472, 'loss/train': 1.417741060256958} 11/07/2021 10:47:46 - INFO - __main__ - Step 96474: {'lr': 0.00014474265114432732, 'samples': 18523008, 'steps': 96473, 'loss/train': 1.5986871719360352} 11/07/2021 10:47:46 - INFO - __main__ - Step 96475: {'lr': 0.00014473783770153654, 'samples': 18523200, 'steps': 96474, 'loss/train': 1.3903063535690308} 11/07/2021 10:47:46 - INFO - __main__ - Step 96476: {'lr': 0.00014473302430617523, 'samples': 18523392, 'steps': 96475, 'loss/train': 1.3466750383377075} 11/07/2021 10:47:47 - INFO - __main__ - Step 96477: {'lr': 0.00014472821095824566, 'samples': 18523584, 'steps': 96476, 'loss/train': 1.5079666376113892} 11/07/2021 10:47:48 - INFO - __main__ - Step 96478: {'lr': 0.00014472339765774989, 'samples': 18523776, 'steps': 96477, 'loss/train': 1.3713042736053467} 11/07/2021 10:47:48 - INFO - __main__ - Step 96479: {'lr': 0.00014471858440469015, 'samples': 18523968, 'steps': 96478, 'loss/train': 1.4155449867248535} 11/07/2021 10:47:48 - INFO - __main__ - Step 96480: {'lr': 0.0001447137711990686, 'samples': 18524160, 'steps': 96479, 'loss/train': 1.5953575372695923} 11/07/2021 10:47:49 - INFO - __main__ - Step 96481: {'lr': 0.00014470895804088736, 'samples': 18524352, 'steps': 96480, 'loss/train': 1.2885632514953613} 11/07/2021 10:47:49 - INFO - __main__ - Step 96482: {'lr': 0.00014470414493014867, 'samples': 18524544, 'steps': 96481, 'loss/train': 1.345041275024414} 11/07/2021 10:47:50 - INFO - __main__ - Step 96483: {'lr': 0.00014469933186685464, 'samples': 18524736, 'steps': 96482, 'loss/train': 1.1248165369033813} 11/07/2021 10:47:51 - INFO - __main__ - Step 96484: {'lr': 0.0001446945188510076, 'samples': 18524928, 'steps': 96483, 'loss/train': 1.7702687978744507} 11/07/2021 10:47:51 - INFO - __main__ - Step 96485: {'lr': 0.00014468970588260945, 'samples': 18525120, 'steps': 96484, 'loss/train': 1.2956340312957764} 11/07/2021 10:47:51 - INFO - __main__ - Step 96486: {'lr': 0.00014468489296166255, 'samples': 18525312, 'steps': 96485, 'loss/train': 1.5179113149642944} 11/07/2021 10:47:52 - INFO - __main__ - Step 96487: {'lr': 0.00014468008008816896, 'samples': 18525504, 'steps': 96486, 'loss/train': 1.2340182065963745} 11/07/2021 10:47:52 - INFO - __main__ - Step 96488: {'lr': 0.00014467526726213092, 'samples': 18525696, 'steps': 96487, 'loss/train': 1.9607264995574951} 11/07/2021 10:47:53 - INFO - __main__ - Step 96489: {'lr': 0.00014467045448355057, 'samples': 18525888, 'steps': 96488, 'loss/train': 1.2446008920669556} 11/07/2021 10:47:53 - INFO - __main__ - Step 96490: {'lr': 0.00014466564175243007, 'samples': 18526080, 'steps': 96489, 'loss/train': 1.8456761837005615} 11/07/2021 10:47:54 - INFO - __main__ - Step 96491: {'lr': 0.00014466082906877166, 'samples': 18526272, 'steps': 96490, 'loss/train': 1.4583966732025146} 11/07/2021 10:47:54 - INFO - __main__ - Step 96492: {'lr': 0.00014465601643257742, 'samples': 18526464, 'steps': 96491, 'loss/train': 1.1520638465881348} 11/07/2021 10:47:54 - INFO - __main__ - Step 96493: {'lr': 0.00014465120384384955, 'samples': 18526656, 'steps': 96492, 'loss/train': 1.2160413265228271} 11/07/2021 10:47:55 - INFO - __main__ - Step 96494: {'lr': 0.00014464639130259022, 'samples': 18526848, 'steps': 96493, 'loss/train': 1.4007371664047241} 11/07/2021 10:47:56 - INFO - __main__ - Step 96495: {'lr': 0.0001446415788088017, 'samples': 18527040, 'steps': 96494, 'loss/train': 1.5938249826431274} 11/07/2021 10:47:56 - INFO - __main__ - Step 96496: {'lr': 0.000144636766362486, 'samples': 18527232, 'steps': 96495, 'loss/train': 1.899671196937561} 11/07/2021 10:47:56 - INFO - __main__ - Step 96497: {'lr': 0.00014463195396364532, 'samples': 18527424, 'steps': 96496, 'loss/train': 1.7788581848144531} 11/07/2021 10:47:57 - INFO - __main__ - Step 96498: {'lr': 0.00014462714161228186, 'samples': 18527616, 'steps': 96497, 'loss/train': 1.4466255903244019} 11/07/2021 10:47:58 - INFO - __main__ - Step 96499: {'lr': 0.00014462232930839776, 'samples': 18527808, 'steps': 96498, 'loss/train': 1.444192886352539} 11/07/2021 10:47:58 - INFO - __main__ - Step 96500: {'lr': 0.00014461751705199523, 'samples': 18528000, 'steps': 96499, 'loss/train': 1.4190274477005005} 11/07/2021 10:47:58 - INFO - __main__ - Step 96501: {'lr': 0.00014461270484307642, 'samples': 18528192, 'steps': 96500, 'loss/train': 1.3151283264160156} 11/07/2021 10:47:59 - INFO - __main__ - Step 96502: {'lr': 0.0001446078926816435, 'samples': 18528384, 'steps': 96501, 'loss/train': 0.6014633774757385} 11/07/2021 10:47:59 - INFO - __main__ - Step 96503: {'lr': 0.0001446030805676986, 'samples': 18528576, 'steps': 96502, 'loss/train': 1.363160490989685} 11/07/2021 10:48:00 - INFO - __main__ - Step 96504: {'lr': 0.00014459826850124396, 'samples': 18528768, 'steps': 96503, 'loss/train': 1.218808889389038} 11/07/2021 10:48:00 - INFO - __main__ - Step 96505: {'lr': 0.00014459345648228173, 'samples': 18528960, 'steps': 96504, 'loss/train': 1.0057330131530762} 11/07/2021 10:48:01 - INFO - __main__ - Step 96506: {'lr': 0.00014458864451081415, 'samples': 18529152, 'steps': 96505, 'loss/train': 1.5544236898422241} 11/07/2021 10:48:01 - INFO - __main__ - Step 96507: {'lr': 0.00014458383258684321, 'samples': 18529344, 'steps': 96506, 'loss/train': 2.1363086700439453} 11/07/2021 10:48:02 - INFO - __main__ - Step 96508: {'lr': 0.00014457902071037115, 'samples': 18529536, 'steps': 96507, 'loss/train': 1.2818728685379028} 11/07/2021 10:48:02 - INFO - __main__ - Step 96509: {'lr': 0.00014457420888140015, 'samples': 18529728, 'steps': 96508, 'loss/train': 1.4510215520858765} 11/07/2021 10:48:03 - INFO - __main__ - Step 96510: {'lr': 0.00014456939709993238, 'samples': 18529920, 'steps': 96509, 'loss/train': 1.2154439687728882} 11/07/2021 10:48:03 - INFO - __main__ - Step 96511: {'lr': 0.00014456458536597005, 'samples': 18530112, 'steps': 96510, 'loss/train': 1.3678334951400757} 11/07/2021 10:48:04 - INFO - __main__ - Step 96512: {'lr': 0.00014455977367951528, 'samples': 18530304, 'steps': 96511, 'loss/train': 0.6212591528892517} 11/07/2021 10:48:04 - INFO - __main__ - Step 96513: {'lr': 0.00014455496204057023, 'samples': 18530496, 'steps': 96512, 'loss/train': 1.5141668319702148} 11/07/2021 10:48:05 - INFO - __main__ - Step 96514: {'lr': 0.0001445501504491371, 'samples': 18530688, 'steps': 96513, 'loss/train': 1.0530532598495483} 11/07/2021 10:48:05 - INFO - __main__ - Step 96515: {'lr': 0.00014454533890521804, 'samples': 18530880, 'steps': 96514, 'loss/train': 1.0149271488189697} 11/07/2021 10:48:06 - INFO - __main__ - Step 96516: {'lr': 0.00014454052740881524, 'samples': 18531072, 'steps': 96515, 'loss/train': 1.3134146928787231} 11/07/2021 10:48:06 - INFO - __main__ - Step 96517: {'lr': 0.00014453571595993093, 'samples': 18531264, 'steps': 96516, 'loss/train': 1.5957555770874023} 11/07/2021 10:48:06 - INFO - __main__ - Step 96518: {'lr': 0.0001445309045585671, 'samples': 18531456, 'steps': 96517, 'loss/train': 0.980661153793335} 11/07/2021 10:48:07 - INFO - __main__ - Step 96519: {'lr': 0.00014452609320472602, 'samples': 18531648, 'steps': 96518, 'loss/train': 1.5319480895996094} 11/07/2021 10:48:08 - INFO - __main__ - Step 96520: {'lr': 0.00014452128189840986, 'samples': 18531840, 'steps': 96519, 'loss/train': 1.2281020879745483} 11/07/2021 10:48:08 - INFO - __main__ - Step 96521: {'lr': 0.00014451647063962075, 'samples': 18532032, 'steps': 96520, 'loss/train': 0.2197187840938568} 11/07/2021 10:48:09 - INFO - __main__ - Step 96522: {'lr': 0.00014451165942836093, 'samples': 18532224, 'steps': 96521, 'loss/train': 2.3008670806884766} 11/07/2021 10:48:09 - INFO - __main__ - Step 96523: {'lr': 0.0001445068482646325, 'samples': 18532416, 'steps': 96522, 'loss/train': 1.4058887958526611} 11/07/2021 10:48:09 - INFO - __main__ - Step 96524: {'lr': 0.0001445020371484377, 'samples': 18532608, 'steps': 96523, 'loss/train': 1.409214973449707} 11/07/2021 10:48:10 - INFO - __main__ - Step 96525: {'lr': 0.00014449722607977862, 'samples': 18532800, 'steps': 96524, 'loss/train': 0.940679132938385} 11/07/2021 10:48:11 - INFO - __main__ - Step 96526: {'lr': 0.00014449241505865745, 'samples': 18532992, 'steps': 96525, 'loss/train': 1.060299038887024} 11/07/2021 10:48:11 - INFO - __main__ - Step 96527: {'lr': 0.00014448760408507642, 'samples': 18533184, 'steps': 96526, 'loss/train': 1.2243677377700806} 11/07/2021 10:48:11 - INFO - __main__ - Step 96528: {'lr': 0.0001444827931590377, 'samples': 18533376, 'steps': 96527, 'loss/train': 1.5939202308654785} 11/07/2021 10:48:12 - INFO - __main__ - Step 96529: {'lr': 0.00014447798228054331, 'samples': 18533568, 'steps': 96528, 'loss/train': 2.0002803802490234} 11/07/2021 10:48:13 - INFO - __main__ - Step 96530: {'lr': 0.00014447317144959554, 'samples': 18533760, 'steps': 96529, 'loss/train': 1.3319815397262573} 11/07/2021 10:48:13 - INFO - __main__ - Step 96531: {'lr': 0.0001444683606661965, 'samples': 18533952, 'steps': 96530, 'loss/train': 1.6102018356323242} 11/07/2021 10:48:14 - INFO - __main__ - Step 96532: {'lr': 0.00014446354993034844, 'samples': 18534144, 'steps': 96531, 'loss/train': 1.3843790292739868} 11/07/2021 10:48:14 - INFO - __main__ - Step 96533: {'lr': 0.00014445873924205343, 'samples': 18534336, 'steps': 96532, 'loss/train': 1.6721380949020386} 11/07/2021 10:48:14 - INFO - __main__ - Step 96534: {'lr': 0.0001444539286013137, 'samples': 18534528, 'steps': 96533, 'loss/train': 1.2431652545928955} 11/07/2021 10:48:15 - INFO - __main__ - Step 96535: {'lr': 0.00014444911800813137, 'samples': 18534720, 'steps': 96534, 'loss/train': 1.925665259361267} 11/07/2021 10:48:16 - INFO - __main__ - Step 96536: {'lr': 0.00014444430746250866, 'samples': 18534912, 'steps': 96535, 'loss/train': 1.4163787364959717} 11/07/2021 10:48:16 - INFO - __main__ - Step 96537: {'lr': 0.00014443949696444776, 'samples': 18535104, 'steps': 96536, 'loss/train': 1.103784441947937} 11/07/2021 10:48:16 - INFO - __main__ - Step 96538: {'lr': 0.00014443468651395073, 'samples': 18535296, 'steps': 96537, 'loss/train': 1.7781298160552979} 11/07/2021 10:48:17 - INFO - __main__ - Step 96539: {'lr': 0.00014442987611101992, 'samples': 18535488, 'steps': 96538, 'loss/train': 0.8611140251159668} 11/07/2021 10:48:17 - INFO - __main__ - Step 96540: {'lr': 0.0001444250657556573, 'samples': 18535680, 'steps': 96539, 'loss/train': 1.2063430547714233} 11/07/2021 10:48:18 - INFO - __main__ - Step 96541: {'lr': 0.00014442025544786507, 'samples': 18535872, 'steps': 96540, 'loss/train': 0.8829474449157715} 11/07/2021 10:48:18 - INFO - __main__ - Step 96542: {'lr': 0.0001444154451876455, 'samples': 18536064, 'steps': 96541, 'loss/train': 1.358393669128418} 11/07/2021 10:48:19 - INFO - __main__ - Step 96543: {'lr': 0.00014441063497500067, 'samples': 18536256, 'steps': 96542, 'loss/train': 1.4912004470825195} 11/07/2021 10:48:19 - INFO - __main__ - Step 96544: {'lr': 0.00014440582480993274, 'samples': 18536448, 'steps': 96543, 'loss/train': 1.7874078750610352} 11/07/2021 10:48:19 - INFO - __main__ - Step 96545: {'lr': 0.000144401014692444, 'samples': 18536640, 'steps': 96544, 'loss/train': 1.0549994707107544} 11/07/2021 10:48:21 - INFO - __main__ - Step 96546: {'lr': 0.0001443962046225365, 'samples': 18536832, 'steps': 96545, 'loss/train': 0.9763548970222473} 11/07/2021 10:48:21 - INFO - __main__ - Step 96547: {'lr': 0.00014439139460021243, 'samples': 18537024, 'steps': 96546, 'loss/train': 1.6802895069122314} 11/07/2021 10:48:21 - INFO - __main__ - Step 96548: {'lr': 0.00014438658462547394, 'samples': 18537216, 'steps': 96547, 'loss/train': 1.0261638164520264} 11/07/2021 10:48:22 - INFO - __main__ - Step 96549: {'lr': 0.00014438177469832324, 'samples': 18537408, 'steps': 96548, 'loss/train': 1.8840060234069824} 11/07/2021 10:48:22 - INFO - __main__ - Step 96550: {'lr': 0.00014437696481876252, 'samples': 18537600, 'steps': 96549, 'loss/train': 1.6057230234146118} 11/07/2021 10:48:23 - INFO - __main__ - Step 96551: {'lr': 0.0001443721549867939, 'samples': 18537792, 'steps': 96550, 'loss/train': 0.9516719579696655} 11/07/2021 10:48:24 - INFO - __main__ - Step 96552: {'lr': 0.0001443673452024196, 'samples': 18537984, 'steps': 96551, 'loss/train': 1.5640937089920044} 11/07/2021 10:48:24 - INFO - __main__ - Step 96553: {'lr': 0.0001443625354656417, 'samples': 18538176, 'steps': 96552, 'loss/train': 1.9539337158203125} 11/07/2021 10:48:24 - INFO - __main__ - Step 96554: {'lr': 0.00014435772577646243, 'samples': 18538368, 'steps': 96553, 'loss/train': 1.7637983560562134} 11/07/2021 10:48:25 - INFO - __main__ - Step 96555: {'lr': 0.0001443529161348839, 'samples': 18538560, 'steps': 96554, 'loss/train': 1.6650826930999756} 11/07/2021 10:48:26 - INFO - __main__ - Step 96556: {'lr': 0.00014434810654090835, 'samples': 18538752, 'steps': 96555, 'loss/train': 1.100806474685669} 11/07/2021 10:48:26 - INFO - __main__ - Step 96557: {'lr': 0.00014434329699453786, 'samples': 18538944, 'steps': 96556, 'loss/train': 1.5353285074234009} 11/07/2021 10:48:26 - INFO - __main__ - Step 96558: {'lr': 0.0001443384874957747, 'samples': 18539136, 'steps': 96557, 'loss/train': 1.1521565914154053} 11/07/2021 10:48:27 - INFO - __main__ - Step 96559: {'lr': 0.00014433367804462095, 'samples': 18539328, 'steps': 96558, 'loss/train': 1.507103443145752} 11/07/2021 10:48:27 - INFO - __main__ - Step 96560: {'lr': 0.00014432886864107884, 'samples': 18539520, 'steps': 96559, 'loss/train': 1.1895369291305542} 11/07/2021 10:48:27 - INFO - __main__ - Step 96561: {'lr': 0.0001443240592851505, 'samples': 18539712, 'steps': 96560, 'loss/train': 1.5462929010391235} 11/07/2021 10:48:28 - INFO - __main__ - Step 96562: {'lr': 0.0001443192499768381, 'samples': 18539904, 'steps': 96561, 'loss/train': 1.620873212814331} 11/07/2021 10:48:29 - INFO - __main__ - Step 96563: {'lr': 0.00014431444071614382, 'samples': 18540096, 'steps': 96562, 'loss/train': 1.506822109222412} 11/07/2021 10:48:29 - INFO - __main__ - Step 96564: {'lr': 0.00014430963150306982, 'samples': 18540288, 'steps': 96563, 'loss/train': 1.4633008241653442} 11/07/2021 10:48:29 - INFO - __main__ - Step 96565: {'lr': 0.00014430482233761838, 'samples': 18540480, 'steps': 96564, 'loss/train': 1.6120513677597046} 11/07/2021 10:48:30 - INFO - __main__ - Step 96566: {'lr': 0.00014430001321979148, 'samples': 18540672, 'steps': 96565, 'loss/train': 1.3613132238388062} 11/07/2021 10:48:31 - INFO - __main__ - Step 96567: {'lr': 0.0001442952041495913, 'samples': 18540864, 'steps': 96566, 'loss/train': 1.342038869857788} 11/07/2021 10:48:31 - INFO - __main__ - Step 96568: {'lr': 0.0001442903951270201, 'samples': 18541056, 'steps': 96567, 'loss/train': 1.5085152387619019} 11/07/2021 10:48:32 - INFO - __main__ - Step 96569: {'lr': 0.00014428558615208004, 'samples': 18541248, 'steps': 96568, 'loss/train': 0.6866540312767029} 11/07/2021 10:48:32 - INFO - __main__ - Step 96570: {'lr': 0.00014428077722477322, 'samples': 18541440, 'steps': 96569, 'loss/train': 1.3400005102157593} 11/07/2021 10:48:32 - INFO - __main__ - Step 96571: {'lr': 0.0001442759683451019, 'samples': 18541632, 'steps': 96570, 'loss/train': 1.3706531524658203} 11/07/2021 10:48:33 - INFO - __main__ - Step 96572: {'lr': 0.0001442711595130682, 'samples': 18541824, 'steps': 96571, 'loss/train': 1.3117741346359253} 11/07/2021 10:48:34 - INFO - __main__ - Step 96573: {'lr': 0.00014426635072867423, 'samples': 18542016, 'steps': 96572, 'loss/train': 1.5371510982513428} 11/07/2021 10:48:34 - INFO - __main__ - Step 96574: {'lr': 0.0001442615419919222, 'samples': 18542208, 'steps': 96573, 'loss/train': 1.9261410236358643} 11/07/2021 10:48:34 - INFO - __main__ - Step 96575: {'lr': 0.00014425673330281435, 'samples': 18542400, 'steps': 96574, 'loss/train': 1.1373943090438843} 11/07/2021 10:48:35 - INFO - __main__ - Step 96576: {'lr': 0.00014425192466135275, 'samples': 18542592, 'steps': 96575, 'loss/train': 1.380812168121338} 11/07/2021 10:48:36 - INFO - __main__ - Step 96577: {'lr': 0.00014424711606753963, 'samples': 18542784, 'steps': 96576, 'loss/train': 1.4137881994247437} 11/07/2021 10:48:36 - INFO - __main__ - Step 96578: {'lr': 0.0001442423075213771, 'samples': 18542976, 'steps': 96577, 'loss/train': 1.0230202674865723} 11/07/2021 10:48:36 - INFO - __main__ - Step 96579: {'lr': 0.00014423749902286746, 'samples': 18543168, 'steps': 96578, 'loss/train': 1.1559031009674072} 11/07/2021 10:48:37 - INFO - __main__ - Step 96580: {'lr': 0.00014423269057201266, 'samples': 18543360, 'steps': 96579, 'loss/train': 1.6503098011016846} 11/07/2021 10:48:37 - INFO - __main__ - Step 96581: {'lr': 0.000144227882168815, 'samples': 18543552, 'steps': 96580, 'loss/train': 1.289286732673645} 11/07/2021 10:48:38 - INFO - __main__ - Step 96582: {'lr': 0.0001442230738132766, 'samples': 18543744, 'steps': 96581, 'loss/train': 5.7266693115234375} 11/07/2021 10:48:39 - INFO - __main__ - Step 96583: {'lr': 0.00014421826550539967, 'samples': 18543936, 'steps': 96582, 'loss/train': 1.5507032871246338} 11/07/2021 10:48:39 - INFO - __main__ - Step 96584: {'lr': 0.00014421345724518637, 'samples': 18544128, 'steps': 96583, 'loss/train': 1.7390238046646118} 11/07/2021 10:48:39 - INFO - __main__ - Step 96585: {'lr': 0.00014420864903263883, 'samples': 18544320, 'steps': 96584, 'loss/train': 0.8585346937179565} 11/07/2021 10:48:40 - INFO - __main__ - Step 96586: {'lr': 0.00014420384086775924, 'samples': 18544512, 'steps': 96585, 'loss/train': 1.296247959136963} 11/07/2021 10:48:40 - INFO - __main__ - Step 96587: {'lr': 0.0001441990327505498, 'samples': 18544704, 'steps': 96586, 'loss/train': 1.078721284866333} 11/07/2021 10:48:41 - INFO - __main__ - Step 96588: {'lr': 0.0001441942246810126, 'samples': 18544896, 'steps': 96587, 'loss/train': 1.6341161727905273} 11/07/2021 10:48:41 - INFO - __main__ - Step 96589: {'lr': 0.00014418941665914986, 'samples': 18545088, 'steps': 96588, 'loss/train': 1.414486289024353} 11/07/2021 10:48:42 - INFO - __main__ - Step 96590: {'lr': 0.00014418460868496376, 'samples': 18545280, 'steps': 96589, 'loss/train': 1.3769580125808716} 11/07/2021 10:48:42 - INFO - __main__ - Step 96591: {'lr': 0.0001441798007584564, 'samples': 18545472, 'steps': 96590, 'loss/train': 1.1804722547531128} 11/07/2021 10:48:42 - INFO - __main__ - Step 96592: {'lr': 0.00014417499287963014, 'samples': 18545664, 'steps': 96591, 'loss/train': 1.0939290523529053} 11/07/2021 10:48:43 - INFO - __main__ - Step 96593: {'lr': 0.00014417018504848684, 'samples': 18545856, 'steps': 96592, 'loss/train': 1.3131550550460815} 11/07/2021 10:48:44 - INFO - __main__ - Step 96594: {'lr': 0.00014416537726502887, 'samples': 18546048, 'steps': 96593, 'loss/train': 1.9434911012649536} 11/07/2021 10:48:44 - INFO - __main__ - Step 96595: {'lr': 0.0001441605695292583, 'samples': 18546240, 'steps': 96594, 'loss/train': 1.208136796951294} 11/07/2021 10:48:44 - INFO - __main__ - Step 96596: {'lr': 0.00014415576184117741, 'samples': 18546432, 'steps': 96595, 'loss/train': 0.6861744523048401} 11/07/2021 10:48:45 - INFO - __main__ - Step 96597: {'lr': 0.00014415095420078822, 'samples': 18546624, 'steps': 96596, 'loss/train': 0.5005611181259155} 11/07/2021 10:48:45 - INFO - __main__ - Step 96598: {'lr': 0.00014414614660809304, 'samples': 18546816, 'steps': 96597, 'loss/train': 1.4793505668640137} 11/07/2021 10:48:47 - INFO - __main__ - Step 96599: {'lr': 0.00014414133906309395, 'samples': 18547008, 'steps': 96598, 'loss/train': 1.0367584228515625} 11/07/2021 10:48:47 - INFO - __main__ - Step 96600: {'lr': 0.00014413653156579315, 'samples': 18547200, 'steps': 96599, 'loss/train': 2.0712835788726807} 11/07/2021 10:48:47 - INFO - __main__ - Step 96601: {'lr': 0.0001441317241161928, 'samples': 18547392, 'steps': 96600, 'loss/train': 1.4500476121902466} 11/07/2021 10:48:48 - INFO - __main__ - Step 96602: {'lr': 0.000144126916714295, 'samples': 18547584, 'steps': 96601, 'loss/train': 1.5390561819076538} 11/07/2021 10:48:48 - INFO - __main__ - Step 96603: {'lr': 0.00014412210936010206, 'samples': 18547776, 'steps': 96602, 'loss/train': 0.6712766289710999} 11/07/2021 10:48:49 - INFO - __main__ - Step 96604: {'lr': 0.000144117302053616, 'samples': 18547968, 'steps': 96603, 'loss/train': 0.34466153383255005} 11/07/2021 10:48:49 - INFO - __main__ - Step 96605: {'lr': 0.00014411249479483909, 'samples': 18548160, 'steps': 96604, 'loss/train': 1.4364513158798218} 11/07/2021 10:48:50 - INFO - __main__ - Step 96606: {'lr': 0.00014410768758377356, 'samples': 18548352, 'steps': 96605, 'loss/train': 1.4142038822174072} 11/07/2021 10:48:50 - INFO - __main__ - Step 96607: {'lr': 0.00014410288042042137, 'samples': 18548544, 'steps': 96606, 'loss/train': 1.0163054466247559} 11/07/2021 10:48:50 - INFO - __main__ - Step 96608: {'lr': 0.00014409807330478474, 'samples': 18548736, 'steps': 96607, 'loss/train': 1.3210376501083374} 11/07/2021 10:48:51 - INFO - __main__ - Step 96609: {'lr': 0.00014409326623686592, 'samples': 18548928, 'steps': 96608, 'loss/train': 1.4044733047485352} 11/07/2021 10:48:52 - INFO - __main__ - Step 96610: {'lr': 0.00014408845921666706, 'samples': 18549120, 'steps': 96609, 'loss/train': 1.7668681144714355} 11/07/2021 10:48:52 - INFO - __main__ - Step 96611: {'lr': 0.00014408365224419028, 'samples': 18549312, 'steps': 96610, 'loss/train': 0.8393039107322693} 11/07/2021 10:48:52 - INFO - __main__ - Step 96612: {'lr': 0.00014407884531943778, 'samples': 18549504, 'steps': 96611, 'loss/train': 1.3018832206726074} 11/07/2021 10:48:53 - INFO - __main__ - Step 96613: {'lr': 0.00014407403844241172, 'samples': 18549696, 'steps': 96612, 'loss/train': 1.2274961471557617} 11/07/2021 10:48:54 - INFO - __main__ - Step 96614: {'lr': 0.00014406923161311425, 'samples': 18549888, 'steps': 96613, 'loss/train': 1.6487013101577759} 11/07/2021 10:48:54 - INFO - __main__ - Step 96615: {'lr': 0.00014406442483154755, 'samples': 18550080, 'steps': 96614, 'loss/train': 1.6254669427871704} 11/07/2021 10:48:54 - INFO - __main__ - Step 96616: {'lr': 0.00014405961809771378, 'samples': 18550272, 'steps': 96615, 'loss/train': 1.3743430376052856} 11/07/2021 10:48:55 - INFO - __main__ - Step 96617: {'lr': 0.00014405481141161513, 'samples': 18550464, 'steps': 96616, 'loss/train': 1.303456425666809} 11/07/2021 10:48:55 - INFO - __main__ - Step 96618: {'lr': 0.00014405000477325376, 'samples': 18550656, 'steps': 96617, 'loss/train': 1.19068443775177} 11/07/2021 10:48:56 - INFO - __main__ - Step 96619: {'lr': 0.0001440451981826319, 'samples': 18550848, 'steps': 96618, 'loss/train': 0.7887080907821655} 11/07/2021 10:48:57 - INFO - __main__ - Step 96620: {'lr': 0.00014404039163975156, 'samples': 18551040, 'steps': 96619, 'loss/train': 1.7698445320129395} 11/07/2021 10:48:57 - INFO - __main__ - Step 96621: {'lr': 0.00014403558514461496, 'samples': 18551232, 'steps': 96620, 'loss/train': 0.9446005821228027} 11/07/2021 10:48:57 - INFO - __main__ - Step 96622: {'lr': 0.0001440307786972243, 'samples': 18551424, 'steps': 96621, 'loss/train': 1.2721858024597168} 11/07/2021 10:48:58 - INFO - __main__ - Step 96623: {'lr': 0.00014402597229758174, 'samples': 18551616, 'steps': 96622, 'loss/train': 1.4324278831481934} 11/07/2021 10:48:59 - INFO - __main__ - Step 96624: {'lr': 0.00014402116594568944, 'samples': 18551808, 'steps': 96623, 'loss/train': 1.221198558807373} 11/07/2021 10:48:59 - INFO - __main__ - Step 96625: {'lr': 0.00014401635964154954, 'samples': 18552000, 'steps': 96624, 'loss/train': 1.7224503755569458} 11/07/2021 10:48:59 - INFO - __main__ - Step 96626: {'lr': 0.00014401155338516426, 'samples': 18552192, 'steps': 96625, 'loss/train': 1.496591567993164} 11/07/2021 10:49:00 - INFO - __main__ - Step 96627: {'lr': 0.00014400674717653572, 'samples': 18552384, 'steps': 96626, 'loss/train': 1.1644856929779053} 11/07/2021 10:49:00 - INFO - __main__ - Step 96628: {'lr': 0.00014400194101566612, 'samples': 18552576, 'steps': 96627, 'loss/train': 0.6551637649536133} 11/07/2021 10:49:01 - INFO - __main__ - Step 96629: {'lr': 0.00014399713490255761, 'samples': 18552768, 'steps': 96628, 'loss/train': 1.2628941535949707} 11/07/2021 10:49:02 - INFO - __main__ - Step 96630: {'lr': 0.00014399232883721236, 'samples': 18552960, 'steps': 96629, 'loss/train': 0.9865018725395203} 11/07/2021 10:49:02 - INFO - __main__ - Step 96631: {'lr': 0.00014398752281963255, 'samples': 18553152, 'steps': 96630, 'loss/train': 0.8951504826545715} 11/07/2021 10:49:02 - INFO - __main__ - Step 96632: {'lr': 0.0001439827168498204, 'samples': 18553344, 'steps': 96631, 'loss/train': 1.4713139533996582} 11/07/2021 10:49:03 - INFO - __main__ - Step 96633: {'lr': 0.0001439779109277779, 'samples': 18553536, 'steps': 96632, 'loss/train': 1.3048957586288452} 11/07/2021 10:49:04 - INFO - __main__ - Step 96634: {'lr': 0.0001439731050535073, 'samples': 18553728, 'steps': 96633, 'loss/train': 1.5986913442611694} 11/07/2021 10:49:04 - INFO - __main__ - Step 96635: {'lr': 0.00014396829922701083, 'samples': 18553920, 'steps': 96634, 'loss/train': 1.3904575109481812} 11/07/2021 10:49:05 - INFO - __main__ - Step 96636: {'lr': 0.00014396349344829057, 'samples': 18554112, 'steps': 96635, 'loss/train': 1.2229586839675903} 11/07/2021 10:49:05 - INFO - __main__ - Step 96637: {'lr': 0.00014395868771734872, 'samples': 18554304, 'steps': 96636, 'loss/train': 1.2951135635375977} 11/07/2021 10:49:05 - INFO - __main__ - Step 96638: {'lr': 0.00014395388203418746, 'samples': 18554496, 'steps': 96637, 'loss/train': 1.8454574346542358} 11/07/2021 10:49:06 - INFO - __main__ - Step 96639: {'lr': 0.00014394907639880895, 'samples': 18554688, 'steps': 96638, 'loss/train': 0.19263145327568054} 11/07/2021 10:49:07 - INFO - __main__ - Step 96640: {'lr': 0.00014394427081121537, 'samples': 18554880, 'steps': 96639, 'loss/train': 1.4576839208602905} 11/07/2021 10:49:07 - INFO - __main__ - Step 96641: {'lr': 0.00014393946527140882, 'samples': 18555072, 'steps': 96640, 'loss/train': 1.269399881362915} 11/07/2021 10:49:07 - INFO - __main__ - Step 96642: {'lr': 0.00014393465977939152, 'samples': 18555264, 'steps': 96641, 'loss/train': 1.6658934354782104} 11/07/2021 10:49:08 - INFO - __main__ - Step 96643: {'lr': 0.00014392985433516565, 'samples': 18555456, 'steps': 96642, 'loss/train': 1.1020851135253906} 11/07/2021 10:49:08 - INFO - __main__ - Step 96644: {'lr': 0.00014392504893873334, 'samples': 18555648, 'steps': 96643, 'loss/train': 1.1772006750106812} 11/07/2021 10:49:09 - INFO - __main__ - Step 96645: {'lr': 0.00014392024359009676, 'samples': 18555840, 'steps': 96644, 'loss/train': 1.6248564720153809} 11/07/2021 10:49:09 - INFO - __main__ - Step 96646: {'lr': 0.00014391543828925818, 'samples': 18556032, 'steps': 96645, 'loss/train': 1.1961292028427124} 11/07/2021 10:49:10 - INFO - __main__ - Step 96647: {'lr': 0.0001439106330362196, 'samples': 18556224, 'steps': 96646, 'loss/train': 1.4722554683685303} 11/07/2021 10:49:10 - INFO - __main__ - Step 96648: {'lr': 0.0001439058278309832, 'samples': 18556416, 'steps': 96647, 'loss/train': 1.4032065868377686} 11/07/2021 10:49:10 - INFO - __main__ - Step 96649: {'lr': 0.00014390102267355123, 'samples': 18556608, 'steps': 96648, 'loss/train': 1.6156861782073975} 11/07/2021 10:49:11 - INFO - __main__ - Step 96650: {'lr': 0.00014389621756392585, 'samples': 18556800, 'steps': 96649, 'loss/train': 1.529538869857788} 11/07/2021 10:49:12 - INFO - __main__ - Step 96651: {'lr': 0.00014389141250210913, 'samples': 18556992, 'steps': 96650, 'loss/train': 1.2411984205245972} 11/07/2021 10:49:12 - INFO - __main__ - Step 96652: {'lr': 0.00014388660748810333, 'samples': 18557184, 'steps': 96651, 'loss/train': 1.5239108800888062} 11/07/2021 10:49:13 - INFO - __main__ - Step 96653: {'lr': 0.0001438818025219106, 'samples': 18557376, 'steps': 96652, 'loss/train': 1.2908239364624023} 11/07/2021 10:49:13 - INFO - __main__ - Step 96654: {'lr': 0.00014387699760353307, 'samples': 18557568, 'steps': 96653, 'loss/train': 1.3153177499771118} 11/07/2021 10:49:14 - INFO - __main__ - Step 96655: {'lr': 0.00014387219273297297, 'samples': 18557760, 'steps': 96654, 'loss/train': 1.6407325267791748} 11/07/2021 10:49:14 - INFO - __main__ - Step 96656: {'lr': 0.0001438673879102324, 'samples': 18557952, 'steps': 96655, 'loss/train': 1.1685446500778198} 11/07/2021 10:49:15 - INFO - __main__ - Step 96657: {'lr': 0.00014386258313531353, 'samples': 18558144, 'steps': 96656, 'loss/train': 1.0668857097625732} 11/07/2021 10:49:15 - INFO - __main__ - Step 96658: {'lr': 0.00014385777840821853, 'samples': 18558336, 'steps': 96657, 'loss/train': 1.4292662143707275} 11/07/2021 10:49:15 - INFO - __main__ - Step 96659: {'lr': 0.00014385297372894972, 'samples': 18558528, 'steps': 96658, 'loss/train': 1.2059227228164673} 11/07/2021 10:49:16 - INFO - __main__ - Step 96660: {'lr': 0.00014384816909750897, 'samples': 18558720, 'steps': 96659, 'loss/train': 1.185364842414856} 11/07/2021 10:49:17 - INFO - __main__ - Step 96661: {'lr': 0.00014384336451389864, 'samples': 18558912, 'steps': 96660, 'loss/train': 0.8865295648574829} 11/07/2021 10:49:17 - INFO - __main__ - Step 96662: {'lr': 0.00014383855997812084, 'samples': 18559104, 'steps': 96661, 'loss/train': 1.0663386583328247} 11/07/2021 10:49:17 - INFO - __main__ - Step 96663: {'lr': 0.00014383375549017774, 'samples': 18559296, 'steps': 96662, 'loss/train': 1.4585031270980835} 11/07/2021 10:49:18 - INFO - __main__ - Step 96664: {'lr': 0.00014382895105007155, 'samples': 18559488, 'steps': 96663, 'loss/train': 1.3593597412109375} 11/07/2021 10:49:19 - INFO - __main__ - Step 96665: {'lr': 0.00014382414665780436, 'samples': 18559680, 'steps': 96664, 'loss/train': 1.5287026166915894} 11/07/2021 10:49:19 - INFO - __main__ - Step 96666: {'lr': 0.00014381934231337835, 'samples': 18559872, 'steps': 96665, 'loss/train': 1.1729612350463867} 11/07/2021 10:49:20 - INFO - __main__ - Step 96667: {'lr': 0.00014381453801679572, 'samples': 18560064, 'steps': 96666, 'loss/train': 1.63247811794281} 11/07/2021 10:49:20 - INFO - __main__ - Step 96668: {'lr': 0.00014380973376805866, 'samples': 18560256, 'steps': 96667, 'loss/train': 1.7809252738952637} 11/07/2021 10:49:20 - INFO - __main__ - Step 96669: {'lr': 0.00014380492956716926, 'samples': 18560448, 'steps': 96668, 'loss/train': 1.5757273435592651} 11/07/2021 10:49:21 - INFO - __main__ - Step 96670: {'lr': 0.00014380012541412974, 'samples': 18560640, 'steps': 96669, 'loss/train': 1.635115385055542} 11/07/2021 10:49:22 - INFO - __main__ - Step 96671: {'lr': 0.00014379532130894224, 'samples': 18560832, 'steps': 96670, 'loss/train': 1.143057942390442} 11/07/2021 10:49:22 - INFO - __main__ - Step 96672: {'lr': 0.0001437905172516089, 'samples': 18561024, 'steps': 96671, 'loss/train': 1.3601032495498657} 11/07/2021 10:49:22 - INFO - __main__ - Step 96673: {'lr': 0.00014378571324213203, 'samples': 18561216, 'steps': 96672, 'loss/train': 1.5419459342956543} 11/07/2021 10:49:23 - INFO - __main__ - Step 96674: {'lr': 0.0001437809092805136, 'samples': 18561408, 'steps': 96673, 'loss/train': 1.4085042476654053} 11/07/2021 10:49:23 - INFO - __main__ - Step 96675: {'lr': 0.00014377610536675585, 'samples': 18561600, 'steps': 96674, 'loss/train': 1.1174367666244507} 11/07/2021 10:49:24 - INFO - __main__ - Step 96676: {'lr': 0.00014377130150086093, 'samples': 18561792, 'steps': 96675, 'loss/train': 1.2206729650497437} 11/07/2021 10:49:24 - INFO - __main__ - Step 96677: {'lr': 0.00014376649768283101, 'samples': 18561984, 'steps': 96676, 'loss/train': 0.48307013511657715} 11/07/2021 10:49:25 - INFO - __main__ - Step 96678: {'lr': 0.0001437616939126683, 'samples': 18562176, 'steps': 96677, 'loss/train': 0.5794355273246765} 11/07/2021 10:49:25 - INFO - __main__ - Step 96679: {'lr': 0.0001437568901903749, 'samples': 18562368, 'steps': 96678, 'loss/train': 1.2109427452087402} 11/07/2021 10:49:25 - INFO - __main__ - Step 96680: {'lr': 0.00014375208651595304, 'samples': 18562560, 'steps': 96679, 'loss/train': 1.3236925601959229} 11/07/2021 10:49:26 - INFO - __main__ - Step 96681: {'lr': 0.0001437472828894048, 'samples': 18562752, 'steps': 96680, 'loss/train': 1.4680933952331543} 11/07/2021 10:49:27 - INFO - __main__ - Step 96682: {'lr': 0.00014374247931073244, 'samples': 18562944, 'steps': 96681, 'loss/train': 1.8492215871810913} 11/07/2021 10:49:27 - INFO - __main__ - Step 96683: {'lr': 0.00014373767577993807, 'samples': 18563136, 'steps': 96682, 'loss/train': 1.31661057472229} 11/07/2021 10:49:28 - INFO - __main__ - Step 96684: {'lr': 0.00014373287229702388, 'samples': 18563328, 'steps': 96683, 'loss/train': 1.3185498714447021} 11/07/2021 10:49:28 - INFO - __main__ - Step 96685: {'lr': 0.00014372806886199196, 'samples': 18563520, 'steps': 96684, 'loss/train': 1.4503873586654663} 11/07/2021 10:49:29 - INFO - __main__ - Step 96686: {'lr': 0.00014372326547484472, 'samples': 18563712, 'steps': 96685, 'loss/train': 1.3370939493179321} 11/07/2021 10:49:29 - INFO - __main__ - Step 96687: {'lr': 0.00014371846213558396, 'samples': 18563904, 'steps': 96686, 'loss/train': 1.4264849424362183} 11/07/2021 10:49:30 - INFO - __main__ - Step 96688: {'lr': 0.00014371365884421205, 'samples': 18564096, 'steps': 96687, 'loss/train': 1.3936076164245605} 11/07/2021 10:49:30 - INFO - __main__ - Step 96689: {'lr': 0.0001437088556007311, 'samples': 18564288, 'steps': 96688, 'loss/train': 1.2654392719268799} 11/07/2021 10:49:30 - INFO - __main__ - Step 96690: {'lr': 0.00014370405240514333, 'samples': 18564480, 'steps': 96689, 'loss/train': 1.724951148033142} 11/07/2021 10:49:31 - INFO - __main__ - Step 96691: {'lr': 0.00014369924925745087, 'samples': 18564672, 'steps': 96690, 'loss/train': 1.2280241250991821} 11/07/2021 10:49:32 - INFO - __main__ - Step 96692: {'lr': 0.0001436944461576559, 'samples': 18564864, 'steps': 96691, 'loss/train': 1.1721243858337402} 11/07/2021 10:49:32 - INFO - __main__ - Step 96693: {'lr': 0.00014368964310576055, 'samples': 18565056, 'steps': 96692, 'loss/train': 1.16357421875} 11/07/2021 10:49:32 - INFO - __main__ - Step 96694: {'lr': 0.00014368484010176703, 'samples': 18565248, 'steps': 96693, 'loss/train': 1.8345168828964233} 11/07/2021 10:49:33 - INFO - __main__ - Step 96695: {'lr': 0.00014368003714567746, 'samples': 18565440, 'steps': 96694, 'loss/train': 1.8518425226211548} 11/07/2021 10:49:34 - INFO - __main__ - Step 96696: {'lr': 0.00014367523423749402, 'samples': 18565632, 'steps': 96695, 'loss/train': 1.3966052532196045} 11/07/2021 10:49:34 - INFO - __main__ - Step 96697: {'lr': 0.00014367043137721887, 'samples': 18565824, 'steps': 96696, 'loss/train': 1.0974897146224976} 11/07/2021 10:49:34 - INFO - __main__ - Step 96698: {'lr': 0.0001436656285648542, 'samples': 18566016, 'steps': 96697, 'loss/train': 1.3950695991516113} 11/07/2021 10:49:35 - INFO - __main__ - Step 96699: {'lr': 0.00014366082580040214, 'samples': 18566208, 'steps': 96698, 'loss/train': 1.304900884628296} 11/07/2021 10:49:35 - INFO - __main__ - Step 96700: {'lr': 0.000143656023083865, 'samples': 18566400, 'steps': 96699, 'loss/train': 1.315292477607727} 11/07/2021 10:49:35 - INFO - __main__ - Step 96701: {'lr': 0.0001436512204152447, 'samples': 18566592, 'steps': 96700, 'loss/train': 1.2848620414733887} 11/07/2021 10:49:36 - INFO - __main__ - Step 96702: {'lr': 0.00014364641779454352, 'samples': 18566784, 'steps': 96701, 'loss/train': 1.4115365743637085} 11/07/2021 10:49:37 - INFO - __main__ - Step 96703: {'lr': 0.00014364161522176363, 'samples': 18566976, 'steps': 96702, 'loss/train': 1.4892115592956543} 11/07/2021 10:49:37 - INFO - __main__ - Step 96704: {'lr': 0.0001436368126969072, 'samples': 18567168, 'steps': 96703, 'loss/train': 1.551031470298767} 11/07/2021 10:49:37 - INFO - __main__ - Step 96705: {'lr': 0.00014363201021997635, 'samples': 18567360, 'steps': 96704, 'loss/train': 1.1971346139907837} 11/07/2021 10:49:38 - INFO - __main__ - Step 96706: {'lr': 0.00014362720779097327, 'samples': 18567552, 'steps': 96705, 'loss/train': 0.7371227741241455} 11/07/2021 10:49:39 - INFO - __main__ - Step 96707: {'lr': 0.0001436224054099002, 'samples': 18567744, 'steps': 96706, 'loss/train': 1.6593608856201172} 11/07/2021 10:49:39 - INFO - __main__ - Step 96708: {'lr': 0.00014361760307675915, 'samples': 18567936, 'steps': 96707, 'loss/train': 1.551672101020813} 11/07/2021 10:49:40 - INFO - __main__ - Step 96709: {'lr': 0.00014361280079155237, 'samples': 18568128, 'steps': 96708, 'loss/train': 1.4056552648544312} 11/07/2021 10:49:40 - INFO - __main__ - Step 96710: {'lr': 0.00014360799855428206, 'samples': 18568320, 'steps': 96709, 'loss/train': 0.9020358920097351} 11/07/2021 10:49:40 - INFO - __main__ - Step 96711: {'lr': 0.00014360319636495033, 'samples': 18568512, 'steps': 96710, 'loss/train': 1.7546859979629517} 11/07/2021 10:49:41 - INFO - __main__ - Step 96712: {'lr': 0.00014359839422355936, 'samples': 18568704, 'steps': 96711, 'loss/train': 1.4441646337509155} 11/07/2021 10:49:42 - INFO - __main__ - Step 96713: {'lr': 0.00014359359213011143, 'samples': 18568896, 'steps': 96712, 'loss/train': 1.3249338865280151} 11/07/2021 10:49:42 - INFO - __main__ - Step 96714: {'lr': 0.00014358879008460846, 'samples': 18569088, 'steps': 96713, 'loss/train': 1.1671093702316284} 11/07/2021 10:49:42 - INFO - __main__ - Step 96715: {'lr': 0.0001435839880870527, 'samples': 18569280, 'steps': 96714, 'loss/train': 1.023710012435913} 11/07/2021 10:49:43 - INFO - __main__ - Step 96716: {'lr': 0.00014357918613744643, 'samples': 18569472, 'steps': 96715, 'loss/train': 1.483246922492981} 11/07/2021 10:49:44 - INFO - __main__ - Step 96717: {'lr': 0.0001435743842357917, 'samples': 18569664, 'steps': 96716, 'loss/train': 0.07272008806467056} 11/07/2021 10:49:44 - INFO - __main__ - Step 96718: {'lr': 0.0001435695823820907, 'samples': 18569856, 'steps': 96717, 'loss/train': 1.5996429920196533} 11/07/2021 10:49:45 - INFO - __main__ - Step 96719: {'lr': 0.0001435647805763456, 'samples': 18570048, 'steps': 96718, 'loss/train': 1.484320878982544} 11/07/2021 10:49:45 - INFO - __main__ - Step 96720: {'lr': 0.0001435599788185586, 'samples': 18570240, 'steps': 96719, 'loss/train': 1.4497565031051636} 11/07/2021 10:49:45 - INFO - __main__ - Step 96721: {'lr': 0.00014355517710873183, 'samples': 18570432, 'steps': 96720, 'loss/train': 1.6415495872497559} 11/07/2021 10:49:46 - INFO - __main__ - Step 96722: {'lr': 0.00014355037544686744, 'samples': 18570624, 'steps': 96721, 'loss/train': 1.2875169515609741} 11/07/2021 10:49:47 - INFO - __main__ - Step 96723: {'lr': 0.0001435455738329676, 'samples': 18570816, 'steps': 96722, 'loss/train': 1.1585949659347534} 11/07/2021 10:49:47 - INFO - __main__ - Step 96724: {'lr': 0.0001435407722670345, 'samples': 18571008, 'steps': 96723, 'loss/train': 1.2635077238082886} 11/07/2021 10:49:47 - INFO - __main__ - Step 96725: {'lr': 0.00014353597074907027, 'samples': 18571200, 'steps': 96724, 'loss/train': 1.0437324047088623} 11/07/2021 10:49:48 - INFO - __main__ - Step 96726: {'lr': 0.00014353116927907708, 'samples': 18571392, 'steps': 96725, 'loss/train': 1.836078405380249} 11/07/2021 10:49:49 - INFO - __main__ - Step 96727: {'lr': 0.00014352636785705723, 'samples': 18571584, 'steps': 96726, 'loss/train': 1.3907339572906494} 11/07/2021 10:49:49 - INFO - __main__ - Step 96728: {'lr': 0.00014352156648301262, 'samples': 18571776, 'steps': 96727, 'loss/train': 0.7376115918159485} 11/07/2021 10:49:50 - INFO - __main__ - Step 96729: {'lr': 0.0001435167651569456, 'samples': 18571968, 'steps': 96728, 'loss/train': 1.4601596593856812} 11/07/2021 10:49:50 - INFO - __main__ - Step 96730: {'lr': 0.00014351196387885824, 'samples': 18572160, 'steps': 96729, 'loss/train': 1.74794602394104} 11/07/2021 10:49:50 - INFO - __main__ - Step 96731: {'lr': 0.00014350716264875275, 'samples': 18572352, 'steps': 96730, 'loss/train': 0.9516466856002808} 11/07/2021 10:49:51 - INFO - __main__ - Step 96732: {'lr': 0.0001435023614666313, 'samples': 18572544, 'steps': 96731, 'loss/train': 1.4421683549880981} 11/07/2021 10:49:51 - INFO - __main__ - Step 96733: {'lr': 0.00014349756033249606, 'samples': 18572736, 'steps': 96732, 'loss/train': 1.3837186098098755} 11/07/2021 10:49:52 - INFO - __main__ - Step 96734: {'lr': 0.00014349275924634914, 'samples': 18572928, 'steps': 96733, 'loss/train': 1.4583169221878052} 11/07/2021 10:49:53 - INFO - __main__ - Step 96735: {'lr': 0.00014348795820819278, 'samples': 18573120, 'steps': 96734, 'loss/train': 1.2589871883392334} 11/07/2021 10:49:53 - INFO - __main__ - Step 96736: {'lr': 0.00014348315721802906, 'samples': 18573312, 'steps': 96735, 'loss/train': 1.5310050249099731} 11/07/2021 10:49:53 - INFO - __main__ - Step 96737: {'lr': 0.0001434783562758602, 'samples': 18573504, 'steps': 96736, 'loss/train': 1.1243287324905396} 11/07/2021 10:49:54 - INFO - __main__ - Step 96738: {'lr': 0.00014347355538168837, 'samples': 18573696, 'steps': 96737, 'loss/train': 1.3236174583435059} 11/07/2021 10:49:55 - INFO - __main__ - Step 96739: {'lr': 0.00014346875453551567, 'samples': 18573888, 'steps': 96738, 'loss/train': 1.5234570503234863} 11/07/2021 10:49:55 - INFO - __main__ - Step 96740: {'lr': 0.00014346395373734445, 'samples': 18574080, 'steps': 96739, 'loss/train': 1.5059890747070312} 11/07/2021 10:49:55 - INFO - __main__ - Step 96741: {'lr': 0.0001434591529871766, 'samples': 18574272, 'steps': 96740, 'loss/train': 1.3425875902175903} 11/07/2021 10:49:56 - INFO - __main__ - Step 96742: {'lr': 0.00014345435228501442, 'samples': 18574464, 'steps': 96741, 'loss/train': 1.2676676511764526} 11/07/2021 10:49:56 - INFO - __main__ - Step 96743: {'lr': 0.00014344955163086008, 'samples': 18574656, 'steps': 96742, 'loss/train': 1.549575924873352} 11/07/2021 10:49:57 - INFO - __main__ - Step 96744: {'lr': 0.0001434447510247157, 'samples': 18574848, 'steps': 96743, 'loss/train': 0.6670247912406921} 11/07/2021 10:49:57 - INFO - __main__ - Step 96745: {'lr': 0.00014343995046658348, 'samples': 18575040, 'steps': 96744, 'loss/train': 1.217962622642517} 11/07/2021 10:49:58 - INFO - __main__ - Step 96746: {'lr': 0.0001434351499564656, 'samples': 18575232, 'steps': 96745, 'loss/train': 0.9671699404716492} 11/07/2021 10:49:58 - INFO - __main__ - Step 96747: {'lr': 0.00014343034949436417, 'samples': 18575424, 'steps': 96746, 'loss/train': 1.339741587638855} 11/07/2021 10:49:58 - INFO - __main__ - Step 96748: {'lr': 0.00014342554908028138, 'samples': 18575616, 'steps': 96747, 'loss/train': 1.3415828943252563} 11/07/2021 10:50:00 - INFO - __main__ - Step 96749: {'lr': 0.0001434207487142194, 'samples': 18575808, 'steps': 96748, 'loss/train': 0.9309265613555908} 11/07/2021 10:50:00 - INFO - __main__ - Step 96750: {'lr': 0.0001434159483961804, 'samples': 18576000, 'steps': 96749, 'loss/train': 1.4841759204864502} 11/07/2021 10:50:00 - INFO - __main__ - Step 96751: {'lr': 0.0001434111481261665, 'samples': 18576192, 'steps': 96750, 'loss/train': 1.3275142908096313} 11/07/2021 10:50:01 - INFO - __main__ - Step 96752: {'lr': 0.0001434063479041799, 'samples': 18576384, 'steps': 96751, 'loss/train': 1.0633825063705444} 11/07/2021 10:50:01 - INFO - __main__ - Step 96753: {'lr': 0.00014340154773022284, 'samples': 18576576, 'steps': 96752, 'loss/train': 1.3324304819107056} 11/07/2021 10:50:02 - INFO - __main__ - Step 96754: {'lr': 0.00014339674760429732, 'samples': 18576768, 'steps': 96753, 'loss/train': 1.7280439138412476} 11/07/2021 10:50:02 - INFO - __main__ - Step 96755: {'lr': 0.0001433919475264056, 'samples': 18576960, 'steps': 96754, 'loss/train': 0.8314580917358398} 11/07/2021 10:50:03 - INFO - __main__ - Step 96756: {'lr': 0.0001433871474965498, 'samples': 18577152, 'steps': 96755, 'loss/train': 1.7353380918502808} 11/07/2021 10:50:03 - INFO - __main__ - Step 96757: {'lr': 0.0001433823475147321, 'samples': 18577344, 'steps': 96756, 'loss/train': 1.9466301202774048} 11/07/2021 10:50:03 - INFO - __main__ - Step 96758: {'lr': 0.00014337754758095468, 'samples': 18577536, 'steps': 96757, 'loss/train': 1.667772889137268} 11/07/2021 10:50:05 - INFO - __main__ - Step 96759: {'lr': 0.00014337274769521969, 'samples': 18577728, 'steps': 96758, 'loss/train': 1.0347962379455566} 11/07/2021 10:50:05 - INFO - __main__ - Step 96760: {'lr': 0.0001433679478575293, 'samples': 18577920, 'steps': 96759, 'loss/train': 0.6775725483894348} 11/07/2021 10:50:05 - INFO - __main__ - Step 96761: {'lr': 0.00014336314806788565, 'samples': 18578112, 'steps': 96760, 'loss/train': 1.5255271196365356} 11/07/2021 10:50:06 - INFO - __main__ - Step 96762: {'lr': 0.0001433583483262909, 'samples': 18578304, 'steps': 96761, 'loss/train': 1.379685640335083} 11/07/2021 10:50:06 - INFO - __main__ - Step 96763: {'lr': 0.00014335354863274729, 'samples': 18578496, 'steps': 96762, 'loss/train': 1.1695640087127686} 11/07/2021 10:50:07 - INFO - __main__ - Step 96764: {'lr': 0.000143348748987257, 'samples': 18578688, 'steps': 96763, 'loss/train': 1.3894469738006592} 11/07/2021 10:50:07 - INFO - __main__ - Step 96765: {'lr': 0.00014334394938982203, 'samples': 18578880, 'steps': 96764, 'loss/train': 0.937260627746582} 11/07/2021 10:50:08 - INFO - __main__ - Step 96766: {'lr': 0.0001433391498404446, 'samples': 18579072, 'steps': 96765, 'loss/train': 1.0789687633514404} 11/07/2021 10:50:08 - INFO - __main__ - Step 96767: {'lr': 0.0001433343503391269, 'samples': 18579264, 'steps': 96766, 'loss/train': 0.9343979954719543} 11/07/2021 10:50:08 - INFO - __main__ - Step 96768: {'lr': 0.00014332955088587114, 'samples': 18579456, 'steps': 96767, 'loss/train': 1.4975244998931885} 11/07/2021 10:50:09 - INFO - __main__ - Step 96769: {'lr': 0.00014332475148067943, 'samples': 18579648, 'steps': 96768, 'loss/train': 1.0462629795074463} 11/07/2021 10:50:10 - INFO - __main__ - Step 96770: {'lr': 0.00014331995212355392, 'samples': 18579840, 'steps': 96769, 'loss/train': 1.055599331855774} 11/07/2021 10:50:10 - INFO - __main__ - Step 96771: {'lr': 0.00014331515281449682, 'samples': 18580032, 'steps': 96770, 'loss/train': 1.3952010869979858} 11/07/2021 10:50:10 - INFO - __main__ - Step 96772: {'lr': 0.0001433103535535102, 'samples': 18580224, 'steps': 96771, 'loss/train': 1.319065809249878} 11/07/2021 10:50:11 - INFO - __main__ - Step 96773: {'lr': 0.00014330555434059633, 'samples': 18580416, 'steps': 96772, 'loss/train': 1.6926589012145996} 11/07/2021 10:50:11 - INFO - __main__ - Step 96774: {'lr': 0.00014330075517575736, 'samples': 18580608, 'steps': 96773, 'loss/train': 0.9066418409347534} 11/07/2021 10:50:12 - INFO - __main__ - Step 96775: {'lr': 0.00014329595605899547, 'samples': 18580800, 'steps': 96774, 'loss/train': 1.2762404680252075} 11/07/2021 10:50:12 - INFO - __main__ - Step 96776: {'lr': 0.00014329115699031274, 'samples': 18580992, 'steps': 96775, 'loss/train': 1.2743713855743408} 11/07/2021 10:50:13 - INFO - __main__ - Step 96777: {'lr': 0.0001432863579697113, 'samples': 18581184, 'steps': 96776, 'loss/train': 0.8654906153678894} 11/07/2021 10:50:13 - INFO - __main__ - Step 96778: {'lr': 0.0001432815589971934, 'samples': 18581376, 'steps': 96777, 'loss/train': 1.350236177444458} 11/07/2021 10:50:13 - INFO - __main__ - Step 96779: {'lr': 0.00014327676007276123, 'samples': 18581568, 'steps': 96778, 'loss/train': 1.2088829278945923} 11/07/2021 10:50:15 - INFO - __main__ - Step 96780: {'lr': 0.00014327196119641686, 'samples': 18581760, 'steps': 96779, 'loss/train': 0.9609159231185913} 11/07/2021 10:50:15 - INFO - __main__ - Step 96781: {'lr': 0.00014326716236816252, 'samples': 18581952, 'steps': 96780, 'loss/train': 1.240177869796753} 11/07/2021 10:50:15 - INFO - __main__ - Step 96782: {'lr': 0.00014326236358800032, 'samples': 18582144, 'steps': 96781, 'loss/train': 1.234963059425354} 11/07/2021 10:50:16 - INFO - __main__ - Step 96783: {'lr': 0.00014325756485593247, 'samples': 18582336, 'steps': 96782, 'loss/train': 1.310650110244751} 11/07/2021 10:50:16 - INFO - __main__ - Step 96784: {'lr': 0.0001432527661719611, 'samples': 18582528, 'steps': 96783, 'loss/train': 0.9417352676391602} 11/07/2021 10:50:17 - INFO - __main__ - Step 96785: {'lr': 0.0001432479675360884, 'samples': 18582720, 'steps': 96784, 'loss/train': 1.4149078130722046} 11/07/2021 10:50:18 - INFO - __main__ - Step 96786: {'lr': 0.00014324316894831664, 'samples': 18582912, 'steps': 96785, 'loss/train': 1.0433359146118164} 11/07/2021 10:50:18 - INFO - __main__ - Step 96787: {'lr': 0.00014323837040864772, 'samples': 18583104, 'steps': 96786, 'loss/train': 1.042996883392334} 11/07/2021 10:50:18 - INFO - __main__ - Step 96788: {'lr': 0.00014323357191708397, 'samples': 18583296, 'steps': 96787, 'loss/train': 2.6076812744140625} 11/07/2021 10:50:19 - INFO - __main__ - Step 96789: {'lr': 0.0001432287734736275, 'samples': 18583488, 'steps': 96788, 'loss/train': 1.0182981491088867} 11/07/2021 10:50:19 - INFO - __main__ - Step 96790: {'lr': 0.0001432239750782805, 'samples': 18583680, 'steps': 96789, 'loss/train': 1.5631024837493896} 11/07/2021 10:50:20 - INFO - __main__ - Step 96791: {'lr': 0.00014321917673104518, 'samples': 18583872, 'steps': 96790, 'loss/train': 1.2029321193695068} 11/07/2021 10:50:20 - INFO - __main__ - Step 96792: {'lr': 0.0001432143784319236, 'samples': 18584064, 'steps': 96791, 'loss/train': 1.726266860961914} 11/07/2021 10:50:21 - INFO - __main__ - Step 96793: {'lr': 0.00014320958018091797, 'samples': 18584256, 'steps': 96792, 'loss/train': 1.358493447303772} 11/07/2021 10:50:21 - INFO - __main__ - Step 96794: {'lr': 0.0001432047819780305, 'samples': 18584448, 'steps': 96793, 'loss/train': 0.9969381093978882} 11/07/2021 10:50:21 - INFO - __main__ - Step 96795: {'lr': 0.00014319998382326328, 'samples': 18584640, 'steps': 96794, 'loss/train': 1.0360180139541626} 11/07/2021 10:50:23 - INFO - __main__ - Step 96796: {'lr': 0.0001431951857166185, 'samples': 18584832, 'steps': 96795, 'loss/train': 1.0526949167251587} 11/07/2021 10:50:23 - INFO - __main__ - Step 96797: {'lr': 0.00014319038765809837, 'samples': 18585024, 'steps': 96796, 'loss/train': 1.407855749130249} 11/07/2021 10:50:23 - INFO - __main__ - Step 96798: {'lr': 0.00014318558964770498, 'samples': 18585216, 'steps': 96797, 'loss/train': 1.3971238136291504} 11/07/2021 10:50:24 - INFO - __main__ - Step 96799: {'lr': 0.00014318079168544048, 'samples': 18585408, 'steps': 96798, 'loss/train': 1.9295473098754883} 11/07/2021 10:50:24 - INFO - __main__ - Step 96800: {'lr': 0.00014317599377130708, 'samples': 18585600, 'steps': 96799, 'loss/train': 1.6410863399505615} 11/07/2021 10:50:24 - INFO - __main__ - Step 96801: {'lr': 0.0001431711959053069, 'samples': 18585792, 'steps': 96800, 'loss/train': 1.645283818244934} 11/07/2021 10:50:25 - INFO - __main__ - Step 96802: {'lr': 0.0001431663980874422, 'samples': 18585984, 'steps': 96801, 'loss/train': 1.9307122230529785} 11/07/2021 10:50:26 - INFO - __main__ - Step 96803: {'lr': 0.00014316160031771502, 'samples': 18586176, 'steps': 96802, 'loss/train': 2.1640779972076416} 11/07/2021 10:50:26 - INFO - __main__ - Step 96804: {'lr': 0.00014315680259612758, 'samples': 18586368, 'steps': 96803, 'loss/train': 1.513556718826294} 11/07/2021 10:50:26 - INFO - __main__ - Step 96805: {'lr': 0.00014315200492268201, 'samples': 18586560, 'steps': 96804, 'loss/train': 1.5838309526443481} 11/07/2021 10:50:27 - INFO - __main__ - Step 96806: {'lr': 0.00014314720729738053, 'samples': 18586752, 'steps': 96805, 'loss/train': 1.8150309324264526} 11/07/2021 10:50:28 - INFO - __main__ - Step 96807: {'lr': 0.00014314240972022527, 'samples': 18586944, 'steps': 96806, 'loss/train': 1.508135437965393} 11/07/2021 10:50:28 - INFO - __main__ - Step 96808: {'lr': 0.00014313761219121848, 'samples': 18587136, 'steps': 96807, 'loss/train': 1.5114126205444336} 11/07/2021 10:50:29 - INFO - __main__ - Step 96809: {'lr': 0.00014313281471036216, 'samples': 18587328, 'steps': 96808, 'loss/train': 1.3057975769042969} 11/07/2021 10:50:29 - INFO - __main__ - Step 96810: {'lr': 0.00014312801727765851, 'samples': 18587520, 'steps': 96809, 'loss/train': 2.338048219680786} 11/07/2021 10:50:29 - INFO - __main__ - Step 96811: {'lr': 0.00014312321989310973, 'samples': 18587712, 'steps': 96810, 'loss/train': 1.8520188331604004} 11/07/2021 10:50:30 - INFO - __main__ - Step 96812: {'lr': 0.00014311842255671796, 'samples': 18587904, 'steps': 96811, 'loss/train': 1.6994658708572388} 11/07/2021 10:50:31 - INFO - __main__ - Step 96813: {'lr': 0.00014311362526848542, 'samples': 18588096, 'steps': 96812, 'loss/train': 1.345668911933899} 11/07/2021 10:50:31 - INFO - __main__ - Step 96814: {'lr': 0.00014310882802841425, 'samples': 18588288, 'steps': 96813, 'loss/train': 1.4109930992126465} 11/07/2021 10:50:31 - INFO - __main__ - Step 96815: {'lr': 0.00014310403083650654, 'samples': 18588480, 'steps': 96814, 'loss/train': 1.3580695390701294} 11/07/2021 10:50:32 - INFO - __main__ - Step 96816: {'lr': 0.00014309923369276454, 'samples': 18588672, 'steps': 96815, 'loss/train': 1.3108824491500854} 11/07/2021 10:50:32 - INFO - __main__ - Step 96817: {'lr': 0.00014309443659719034, 'samples': 18588864, 'steps': 96816, 'loss/train': 1.23439359664917} 11/07/2021 10:50:33 - INFO - __main__ - Step 96818: {'lr': 0.00014308963954978615, 'samples': 18589056, 'steps': 96817, 'loss/train': 1.1587783098220825} 11/07/2021 10:50:33 - INFO - __main__ - Step 96819: {'lr': 0.00014308484255055415, 'samples': 18589248, 'steps': 96818, 'loss/train': 0.8393577933311462} 11/07/2021 10:50:34 - INFO - __main__ - Step 96820: {'lr': 0.00014308004559949645, 'samples': 18589440, 'steps': 96819, 'loss/train': 0.9974262118339539} 11/07/2021 10:50:34 - INFO - __main__ - Step 96821: {'lr': 0.00014307524869661533, 'samples': 18589632, 'steps': 96820, 'loss/train': 1.4300284385681152} 11/07/2021 10:50:34 - INFO - __main__ - Step 96822: {'lr': 0.00014307045184191276, 'samples': 18589824, 'steps': 96821, 'loss/train': 0.8930279612541199} 11/07/2021 10:50:35 - INFO - __main__ - Step 96823: {'lr': 0.00014306565503539097, 'samples': 18590016, 'steps': 96822, 'loss/train': 1.3819799423217773} 11/07/2021 10:50:36 - INFO - __main__ - Step 96824: {'lr': 0.0001430608582770522, 'samples': 18590208, 'steps': 96823, 'loss/train': 1.1275321245193481} 11/07/2021 10:50:36 - INFO - __main__ - Step 96825: {'lr': 0.0001430560615668985, 'samples': 18590400, 'steps': 96824, 'loss/train': 1.3509072065353394} 11/07/2021 10:50:37 - INFO - __main__ - Step 96826: {'lr': 0.00014305126490493208, 'samples': 18590592, 'steps': 96825, 'loss/train': 1.518880844116211} 11/07/2021 10:50:37 - INFO - __main__ - Step 96827: {'lr': 0.00014304646829115515, 'samples': 18590784, 'steps': 96826, 'loss/train': 1.4431906938552856} 11/07/2021 10:50:38 - INFO - __main__ - Step 96828: {'lr': 0.0001430416717255698, 'samples': 18590976, 'steps': 96827, 'loss/train': 1.4569504261016846} 11/07/2021 10:50:38 - INFO - __main__ - Step 96829: {'lr': 0.00014303687520817826, 'samples': 18591168, 'steps': 96828, 'loss/train': 0.7373239994049072} 11/07/2021 10:50:39 - INFO - __main__ - Step 96830: {'lr': 0.00014303207873898261, 'samples': 18591360, 'steps': 96829, 'loss/train': 1.2411147356033325} 11/07/2021 10:50:39 - INFO - __main__ - Step 96831: {'lr': 0.0001430272823179851, 'samples': 18591552, 'steps': 96830, 'loss/train': 1.0890586376190186} 11/07/2021 10:50:39 - INFO - __main__ - Step 96832: {'lr': 0.0001430224859451878, 'samples': 18591744, 'steps': 96831, 'loss/train': 1.2816243171691895} 11/07/2021 10:50:40 - INFO - __main__ - Step 96833: {'lr': 0.00014301768962059295, 'samples': 18591936, 'steps': 96832, 'loss/train': 1.4572458267211914} 11/07/2021 10:50:41 - INFO - __main__ - Step 96834: {'lr': 0.00014301289334420276, 'samples': 18592128, 'steps': 96833, 'loss/train': 1.316428303718567} 11/07/2021 10:50:41 - INFO - __main__ - Step 96835: {'lr': 0.00014300809711601922, 'samples': 18592320, 'steps': 96834, 'loss/train': 1.3327741622924805} 11/07/2021 10:50:41 - INFO - __main__ - Step 96836: {'lr': 0.00014300330093604458, 'samples': 18592512, 'steps': 96835, 'loss/train': 0.07695842534303665} 11/07/2021 10:50:42 - INFO - __main__ - Step 96837: {'lr': 0.000142998504804281, 'samples': 18592704, 'steps': 96836, 'loss/train': 1.3876534700393677} 11/07/2021 10:50:43 - INFO - __main__ - Step 96838: {'lr': 0.00014299370872073065, 'samples': 18592896, 'steps': 96837, 'loss/train': 1.183514952659607} 11/07/2021 10:50:43 - INFO - __main__ - Step 96839: {'lr': 0.00014298891268539566, 'samples': 18593088, 'steps': 96838, 'loss/train': 1.9815294742584229} 11/07/2021 10:50:43 - INFO - __main__ - Step 96840: {'lr': 0.00014298411669827826, 'samples': 18593280, 'steps': 96839, 'loss/train': 1.558397889137268} 11/07/2021 10:50:44 - INFO - __main__ - Step 96841: {'lr': 0.00014297932075938054, 'samples': 18593472, 'steps': 96840, 'loss/train': 1.3115341663360596} 11/07/2021 10:50:44 - INFO - __main__ - Step 96842: {'lr': 0.00014297452486870465, 'samples': 18593664, 'steps': 96841, 'loss/train': 1.3169987201690674} 11/07/2021 10:50:45 - INFO - __main__ - Step 96843: {'lr': 0.00014296972902625284, 'samples': 18593856, 'steps': 96842, 'loss/train': 1.1100066900253296} 11/07/2021 10:50:46 - INFO - __main__ - Step 96844: {'lr': 0.0001429649332320272, 'samples': 18594048, 'steps': 96843, 'loss/train': 1.4304863214492798} 11/07/2021 10:50:46 - INFO - __main__ - Step 96845: {'lr': 0.0001429601374860299, 'samples': 18594240, 'steps': 96844, 'loss/train': 0.9437782168388367} 11/07/2021 10:50:46 - INFO - __main__ - Step 96846: {'lr': 0.00014295534178826314, 'samples': 18594432, 'steps': 96845, 'loss/train': 1.206633448600769} 11/07/2021 10:50:47 - INFO - __main__ - Step 96847: {'lr': 0.00014295054613872903, 'samples': 18594624, 'steps': 96846, 'loss/train': 1.2319146394729614} 11/07/2021 10:50:48 - INFO - __main__ - Step 96848: {'lr': 0.00014294575053742985, 'samples': 18594816, 'steps': 96847, 'loss/train': 1.2004276514053345} 11/07/2021 10:50:48 - INFO - __main__ - Step 96849: {'lr': 0.00014294095498436756, 'samples': 18595008, 'steps': 96848, 'loss/train': 1.3557624816894531} 11/07/2021 10:50:48 - INFO - __main__ - Step 96850: {'lr': 0.00014293615947954443, 'samples': 18595200, 'steps': 96849, 'loss/train': 1.6099940538406372} 11/07/2021 10:50:49 - INFO - __main__ - Step 96851: {'lr': 0.0001429313640229626, 'samples': 18595392, 'steps': 96850, 'loss/train': 1.4682512283325195} 11/07/2021 10:50:49 - INFO - __main__ - Step 96852: {'lr': 0.00014292656861462428, 'samples': 18595584, 'steps': 96851, 'loss/train': 1.5187643766403198} 11/07/2021 10:50:50 - INFO - __main__ - Step 96853: {'lr': 0.00014292177325453157, 'samples': 18595776, 'steps': 96852, 'loss/train': 1.2702962160110474} 11/07/2021 10:50:50 - INFO - __main__ - Step 96854: {'lr': 0.00014291697794268667, 'samples': 18595968, 'steps': 96853, 'loss/train': 0.9196322560310364} 11/07/2021 10:50:51 - INFO - __main__ - Step 96855: {'lr': 0.0001429121826790917, 'samples': 18596160, 'steps': 96854, 'loss/train': 1.1187494993209839} 11/07/2021 10:50:51 - INFO - __main__ - Step 96856: {'lr': 0.00014290738746374886, 'samples': 18596352, 'steps': 96855, 'loss/train': 0.4792807400226593} 11/07/2021 10:50:52 - INFO - __main__ - Step 96857: {'lr': 0.0001429025922966603, 'samples': 18596544, 'steps': 96856, 'loss/train': 1.2875685691833496} 11/07/2021 10:50:53 - INFO - __main__ - Step 96858: {'lr': 0.00014289779717782818, 'samples': 18596736, 'steps': 96857, 'loss/train': 1.3617751598358154} 11/07/2021 10:50:53 - INFO - __main__ - Step 96859: {'lr': 0.0001428930021072547, 'samples': 18596928, 'steps': 96858, 'loss/train': 0.5470477938652039} 11/07/2021 10:50:53 - INFO - __main__ - Step 96860: {'lr': 0.00014288820708494195, 'samples': 18597120, 'steps': 96859, 'loss/train': 1.5877784490585327} 11/07/2021 10:50:54 - INFO - __main__ - Step 96861: {'lr': 0.00014288341211089219, 'samples': 18597312, 'steps': 96860, 'loss/train': 1.0645523071289062} 11/07/2021 10:50:54 - INFO - __main__ - Step 96862: {'lr': 0.00014287861718510745, 'samples': 18597504, 'steps': 96861, 'loss/train': 1.317458152770996} 11/07/2021 10:50:55 - INFO - __main__ - Step 96863: {'lr': 0.00014287382230758995, 'samples': 18597696, 'steps': 96862, 'loss/train': 1.4175783395767212} 11/07/2021 10:50:55 - INFO - __main__ - Step 96864: {'lr': 0.00014286902747834182, 'samples': 18597888, 'steps': 96863, 'loss/train': 1.210575819015503} 11/07/2021 10:50:56 - INFO - __main__ - Step 96865: {'lr': 0.00014286423269736526, 'samples': 18598080, 'steps': 96864, 'loss/train': 1.3832366466522217} 11/07/2021 10:50:56 - INFO - __main__ - Step 96866: {'lr': 0.00014285943796466243, 'samples': 18598272, 'steps': 96865, 'loss/train': 1.2089651823043823} 11/07/2021 10:50:56 - INFO - __main__ - Step 96867: {'lr': 0.00014285464328023551, 'samples': 18598464, 'steps': 96866, 'loss/train': 1.4213402271270752} 11/07/2021 10:50:57 - INFO - __main__ - Step 96868: {'lr': 0.00014284984864408663, 'samples': 18598656, 'steps': 96867, 'loss/train': 1.4396238327026367} 11/07/2021 10:50:58 - INFO - __main__ - Step 96869: {'lr': 0.00014284505405621795, 'samples': 18598848, 'steps': 96868, 'loss/train': 1.2316983938217163} 11/07/2021 10:50:58 - INFO - __main__ - Step 96870: {'lr': 0.0001428402595166316, 'samples': 18599040, 'steps': 96869, 'loss/train': 5.838191032409668} 11/07/2021 10:50:59 - INFO - __main__ - Step 96871: {'lr': 0.00014283546502532983, 'samples': 18599232, 'steps': 96870, 'loss/train': 1.6514455080032349} 11/07/2021 10:50:59 - INFO - __main__ - Step 96872: {'lr': 0.00014283067058231468, 'samples': 18599424, 'steps': 96871, 'loss/train': 1.4461969137191772} 11/07/2021 10:50:59 - INFO - __main__ - Step 96873: {'lr': 0.00014282587618758843, 'samples': 18599616, 'steps': 96872, 'loss/train': 1.6640419960021973} 11/07/2021 10:51:00 - INFO - __main__ - Step 96874: {'lr': 0.00014282108184115316, 'samples': 18599808, 'steps': 96873, 'loss/train': 1.2097934484481812} 11/07/2021 10:51:01 - INFO - __main__ - Step 96875: {'lr': 0.0001428162875430112, 'samples': 18600000, 'steps': 96874, 'loss/train': 0.29522404074668884} 11/07/2021 10:51:01 - INFO - __main__ - Step 96876: {'lr': 0.0001428114932931644, 'samples': 18600192, 'steps': 96875, 'loss/train': 1.1848024129867554} 11/07/2021 10:51:01 - INFO - __main__ - Step 96877: {'lr': 0.00014280669909161515, 'samples': 18600384, 'steps': 96876, 'loss/train': 1.3267314434051514} 11/07/2021 10:51:02 - INFO - __main__ - Step 96878: {'lr': 0.00014280190493836552, 'samples': 18600576, 'steps': 96877, 'loss/train': 1.3595741987228394} 11/07/2021 10:51:03 - INFO - __main__ - Step 96879: {'lr': 0.00014279711083341767, 'samples': 18600768, 'steps': 96878, 'loss/train': 2.8833067417144775} 11/07/2021 10:51:03 - INFO - __main__ - Step 96880: {'lr': 0.00014279231677677385, 'samples': 18600960, 'steps': 96879, 'loss/train': 1.5729330778121948} 11/07/2021 10:51:03 - INFO - __main__ - Step 96881: {'lr': 0.00014278752276843608, 'samples': 18601152, 'steps': 96880, 'loss/train': 1.3156126737594604} 11/07/2021 10:51:04 - INFO - __main__ - Step 96882: {'lr': 0.00014278272880840668, 'samples': 18601344, 'steps': 96881, 'loss/train': 1.3204537630081177} 11/07/2021 10:51:04 - INFO - __main__ - Step 96883: {'lr': 0.00014277793489668767, 'samples': 18601536, 'steps': 96882, 'loss/train': 0.120759516954422} 11/07/2021 10:51:04 - INFO - __main__ - Step 96884: {'lr': 0.00014277314103328128, 'samples': 18601728, 'steps': 96883, 'loss/train': 1.2026506662368774} 11/07/2021 10:51:05 - INFO - __main__ - Step 96885: {'lr': 0.00014276834721818968, 'samples': 18601920, 'steps': 96884, 'loss/train': 1.5561413764953613} 11/07/2021 10:51:06 - INFO - __main__ - Step 96886: {'lr': 0.000142763553451415, 'samples': 18602112, 'steps': 96885, 'loss/train': 1.3122398853302002} 11/07/2021 10:51:06 - INFO - __main__ - Step 96887: {'lr': 0.00014275875973295937, 'samples': 18602304, 'steps': 96886, 'loss/train': 1.3368357419967651} 11/07/2021 10:51:06 - INFO - __main__ - Step 96888: {'lr': 0.00014275396606282513, 'samples': 18602496, 'steps': 96887, 'loss/train': 1.0176128149032593} 11/07/2021 10:51:07 - INFO - __main__ - Step 96889: {'lr': 0.0001427491724410142, 'samples': 18602688, 'steps': 96888, 'loss/train': 1.3579884767532349} 11/07/2021 10:51:08 - INFO - __main__ - Step 96890: {'lr': 0.00014274437886752884, 'samples': 18602880, 'steps': 96889, 'loss/train': 1.0474284887313843} 11/07/2021 10:51:08 - INFO - __main__ - Step 96891: {'lr': 0.00014273958534237116, 'samples': 18603072, 'steps': 96890, 'loss/train': 1.23057222366333} 11/07/2021 10:51:09 - INFO - __main__ - Step 96892: {'lr': 0.0001427347918655434, 'samples': 18603264, 'steps': 96891, 'loss/train': 1.093157172203064} 11/07/2021 10:51:09 - INFO - __main__ - Step 96893: {'lr': 0.00014272999843704771, 'samples': 18603456, 'steps': 96892, 'loss/train': 1.4377212524414062} 11/07/2021 10:51:09 - INFO - __main__ - Step 96894: {'lr': 0.0001427252050568862, 'samples': 18603648, 'steps': 96893, 'loss/train': 1.8680959939956665} 11/07/2021 10:51:10 - INFO - __main__ - Step 96895: {'lr': 0.00014272041172506107, 'samples': 18603840, 'steps': 96894, 'loss/train': 1.6073225736618042} 11/07/2021 10:51:11 - INFO - __main__ - Step 96896: {'lr': 0.00014271561844157445, 'samples': 18604032, 'steps': 96895, 'loss/train': 1.5862412452697754} 11/07/2021 10:51:11 - INFO - __main__ - Step 96897: {'lr': 0.00014271082520642852, 'samples': 18604224, 'steps': 96896, 'loss/train': 0.8862264752388} 11/07/2021 10:51:11 - INFO - __main__ - Step 96898: {'lr': 0.00014270603201962546, 'samples': 18604416, 'steps': 96897, 'loss/train': 1.0378894805908203} 11/07/2021 10:51:12 - INFO - __main__ - Step 96899: {'lr': 0.00014270123888116738, 'samples': 18604608, 'steps': 96898, 'loss/train': 1.3962129354476929} 11/07/2021 10:51:13 - INFO - __main__ - Step 96900: {'lr': 0.00014269644579105646, 'samples': 18604800, 'steps': 96899, 'loss/train': 1.5691362619400024} 11/07/2021 10:51:13 - INFO - __main__ - Step 96901: {'lr': 0.00014269165274929496, 'samples': 18604992, 'steps': 96900, 'loss/train': 1.8668227195739746} 11/07/2021 10:51:13 - INFO - __main__ - Step 96902: {'lr': 0.0001426868597558849, 'samples': 18605184, 'steps': 96901, 'loss/train': 1.271572232246399} 11/07/2021 10:51:14 - INFO - __main__ - Step 96903: {'lr': 0.00014268206681082842, 'samples': 18605376, 'steps': 96902, 'loss/train': 1.8048714399337769} 11/07/2021 10:51:14 - INFO - __main__ - Step 96904: {'lr': 0.00014267727391412778, 'samples': 18605568, 'steps': 96903, 'loss/train': 1.6342308521270752} 11/07/2021 10:51:15 - INFO - __main__ - Step 96905: {'lr': 0.00014267248106578513, 'samples': 18605760, 'steps': 96904, 'loss/train': 1.3579559326171875} 11/07/2021 10:51:15 - INFO - __main__ - Step 96906: {'lr': 0.00014266768826580255, 'samples': 18605952, 'steps': 96905, 'loss/train': 1.4990063905715942} 11/07/2021 10:51:16 - INFO - __main__ - Step 96907: {'lr': 0.00014266289551418226, 'samples': 18606144, 'steps': 96906, 'loss/train': 1.3454833030700684} 11/07/2021 10:51:16 - INFO - __main__ - Step 96908: {'lr': 0.00014265810281092645, 'samples': 18606336, 'steps': 96907, 'loss/train': 1.3276399374008179} 11/07/2021 10:51:16 - INFO - __main__ - Step 96909: {'lr': 0.0001426533101560372, 'samples': 18606528, 'steps': 96908, 'loss/train': 1.4484161138534546} 11/07/2021 10:51:18 - INFO - __main__ - Step 96910: {'lr': 0.00014264851754951675, 'samples': 18606720, 'steps': 96909, 'loss/train': 1.1603469848632812} 11/07/2021 10:51:18 - INFO - __main__ - Step 96911: {'lr': 0.0001426437249913672, 'samples': 18606912, 'steps': 96910, 'loss/train': 1.357857346534729} 11/07/2021 10:51:18 - INFO - __main__ - Step 96912: {'lr': 0.00014263893248159078, 'samples': 18607104, 'steps': 96911, 'loss/train': 1.2657092809677124} 11/07/2021 10:51:19 - INFO - __main__ - Step 96913: {'lr': 0.00014263414002018955, 'samples': 18607296, 'steps': 96912, 'loss/train': 1.4072625637054443} 11/07/2021 10:51:19 - INFO - __main__ - Step 96914: {'lr': 0.0001426293476071657, 'samples': 18607488, 'steps': 96913, 'loss/train': 1.8752623796463013} 11/07/2021 10:51:20 - INFO - __main__ - Step 96915: {'lr': 0.00014262455524252155, 'samples': 18607680, 'steps': 96914, 'loss/train': 0.791739284992218} 11/07/2021 10:51:20 - INFO - __main__ - Step 96916: {'lr': 0.000142619762926259, 'samples': 18607872, 'steps': 96915, 'loss/train': 1.4091169834136963} 11/07/2021 10:51:21 - INFO - __main__ - Step 96917: {'lr': 0.0001426149706583803, 'samples': 18608064, 'steps': 96916, 'loss/train': 1.3065354824066162} 11/07/2021 10:51:21 - INFO - __main__ - Step 96918: {'lr': 0.00014261017843888768, 'samples': 18608256, 'steps': 96917, 'loss/train': 1.0451124906539917} 11/07/2021 10:51:21 - INFO - __main__ - Step 96919: {'lr': 0.00014260538626778324, 'samples': 18608448, 'steps': 96918, 'loss/train': 1.5635446310043335} 11/07/2021 10:51:22 - INFO - __main__ - Step 96920: {'lr': 0.0001426005941450692, 'samples': 18608640, 'steps': 96919, 'loss/train': 1.3608789443969727} 11/07/2021 10:51:23 - INFO - __main__ - Step 96921: {'lr': 0.00014259580207074763, 'samples': 18608832, 'steps': 96920, 'loss/train': 0.8022283315658569} 11/07/2021 10:51:23 - INFO - __main__ - Step 96922: {'lr': 0.00014259101004482073, 'samples': 18609024, 'steps': 96921, 'loss/train': 0.364937424659729} 11/07/2021 10:51:24 - INFO - __main__ - Step 96923: {'lr': 0.00014258621806729067, 'samples': 18609216, 'steps': 96922, 'loss/train': 1.020399570465088} 11/07/2021 10:51:24 - INFO - __main__ - Step 96924: {'lr': 0.0001425814261381596, 'samples': 18609408, 'steps': 96923, 'loss/train': 1.4295649528503418} 11/07/2021 10:51:25 - INFO - __main__ - Step 96925: {'lr': 0.0001425766342574297, 'samples': 18609600, 'steps': 96924, 'loss/train': 1.806294560432434} 11/07/2021 10:51:25 - INFO - __main__ - Step 96926: {'lr': 0.0001425718424251031, 'samples': 18609792, 'steps': 96925, 'loss/train': 1.3354928493499756} 11/07/2021 10:51:26 - INFO - __main__ - Step 96927: {'lr': 0.00014256705064118197, 'samples': 18609984, 'steps': 96926, 'loss/train': 0.7307353019714355} 11/07/2021 10:51:26 - INFO - __main__ - Step 96928: {'lr': 0.00014256225890566857, 'samples': 18610176, 'steps': 96927, 'loss/train': 1.157839059829712} 11/07/2021 10:51:26 - INFO - __main__ - Step 96929: {'lr': 0.00014255746721856486, 'samples': 18610368, 'steps': 96928, 'loss/train': 1.3182153701782227} 11/07/2021 10:51:27 - INFO - __main__ - Step 96930: {'lr': 0.00014255267557987308, 'samples': 18610560, 'steps': 96929, 'loss/train': 1.490689754486084} 11/07/2021 10:51:28 - INFO - __main__ - Step 96931: {'lr': 0.00014254788398959542, 'samples': 18610752, 'steps': 96930, 'loss/train': 1.0727794170379639} 11/07/2021 10:51:28 - INFO - __main__ - Step 96932: {'lr': 0.00014254309244773403, 'samples': 18610944, 'steps': 96931, 'loss/train': 1.492811918258667} 11/07/2021 10:51:28 - INFO - __main__ - Step 96933: {'lr': 0.00014253830095429108, 'samples': 18611136, 'steps': 96932, 'loss/train': 1.4233696460723877} 11/07/2021 10:51:29 - INFO - __main__ - Step 96934: {'lr': 0.00014253350950926868, 'samples': 18611328, 'steps': 96933, 'loss/train': 1.4954111576080322} 11/07/2021 10:51:29 - INFO - __main__ - Step 96935: {'lr': 0.00014252871811266905, 'samples': 18611520, 'steps': 96934, 'loss/train': 1.0981223583221436} 11/07/2021 10:51:30 - INFO - __main__ - Step 96936: {'lr': 0.0001425239267644943, 'samples': 18611712, 'steps': 96935, 'loss/train': 1.7079960107803345} 11/07/2021 10:51:30 - INFO - __main__ - Step 96937: {'lr': 0.0001425191354647466, 'samples': 18611904, 'steps': 96936, 'loss/train': 1.415586233139038} 11/07/2021 10:51:31 - INFO - __main__ - Step 96938: {'lr': 0.00014251434421342816, 'samples': 18612096, 'steps': 96937, 'loss/train': 1.2549446821212769} 11/07/2021 10:51:31 - INFO - __main__ - Step 96939: {'lr': 0.0001425095530105411, 'samples': 18612288, 'steps': 96938, 'loss/train': 1.1504825353622437} 11/07/2021 10:51:32 - INFO - __main__ - Step 96940: {'lr': 0.00014250476185608752, 'samples': 18612480, 'steps': 96939, 'loss/train': 2.6814324855804443} 11/07/2021 10:51:32 - INFO - __main__ - Step 96941: {'lr': 0.00014249997075006966, 'samples': 18612672, 'steps': 96940, 'loss/train': 0.8600540161132812} 11/07/2021 10:51:33 - INFO - __main__ - Step 96942: {'lr': 0.00014249517969248975, 'samples': 18612864, 'steps': 96941, 'loss/train': 1.3532122373580933} 11/07/2021 10:51:33 - INFO - __main__ - Step 96943: {'lr': 0.0001424903886833498, 'samples': 18613056, 'steps': 96942, 'loss/train': 1.1485735177993774} 11/07/2021 10:51:34 - INFO - __main__ - Step 96944: {'lr': 0.00014248559772265195, 'samples': 18613248, 'steps': 96943, 'loss/train': 0.9501208662986755} 11/07/2021 10:51:34 - INFO - __main__ - Step 96945: {'lr': 0.0001424808068103985, 'samples': 18613440, 'steps': 96944, 'loss/train': 0.5226346254348755} 11/07/2021 10:51:35 - INFO - __main__ - Step 96946: {'lr': 0.00014247601594659148, 'samples': 18613632, 'steps': 96945, 'loss/train': 1.439677119255066} 11/07/2021 10:51:35 - INFO - __main__ - Step 96947: {'lr': 0.00014247122513123315, 'samples': 18613824, 'steps': 96946, 'loss/train': 1.489092469215393} 11/07/2021 10:51:36 - INFO - __main__ - Step 96948: {'lr': 0.0001424664343643256, 'samples': 18614016, 'steps': 96947, 'loss/train': 1.0780726671218872} 11/07/2021 10:51:36 - INFO - __main__ - Step 96949: {'lr': 0.00014246164364587103, 'samples': 18614208, 'steps': 96948, 'loss/train': 1.690037488937378} 11/07/2021 10:51:36 - INFO - __main__ - Step 96950: {'lr': 0.00014245685297587158, 'samples': 18614400, 'steps': 96949, 'loss/train': 1.7919425964355469} 11/07/2021 10:51:37 - INFO - __main__ - Step 96951: {'lr': 0.00014245206235432938, 'samples': 18614592, 'steps': 96950, 'loss/train': 0.3575851619243622} 11/07/2021 10:51:38 - INFO - __main__ - Step 96952: {'lr': 0.00014244727178124668, 'samples': 18614784, 'steps': 96951, 'loss/train': 1.3335679769515991} 11/07/2021 10:51:38 - INFO - __main__ - Step 96953: {'lr': 0.00014244248125662556, 'samples': 18614976, 'steps': 96952, 'loss/train': 1.2653449773788452} 11/07/2021 10:51:38 - INFO - __main__ - Step 96954: {'lr': 0.0001424376907804682, 'samples': 18615168, 'steps': 96953, 'loss/train': 1.5444930791854858} 11/07/2021 10:51:39 - INFO - __main__ - Step 96955: {'lr': 0.00014243290035277685, 'samples': 18615360, 'steps': 96954, 'loss/train': 1.1584159135818481} 11/07/2021 10:51:40 - INFO - __main__ - Step 96956: {'lr': 0.00014242810997355346, 'samples': 18615552, 'steps': 96955, 'loss/train': 0.7131778597831726} 11/07/2021 10:51:40 - INFO - __main__ - Step 96957: {'lr': 0.00014242331964280032, 'samples': 18615744, 'steps': 96956, 'loss/train': 1.2213788032531738} 11/07/2021 10:51:40 - INFO - __main__ - Step 96958: {'lr': 0.0001424185293605196, 'samples': 18615936, 'steps': 96957, 'loss/train': 1.279188632965088} 11/07/2021 10:51:41 - INFO - __main__ - Step 96959: {'lr': 0.00014241373912671337, 'samples': 18616128, 'steps': 96958, 'loss/train': 1.3409785032272339} 11/07/2021 10:51:41 - INFO - __main__ - Step 96960: {'lr': 0.0001424089489413839, 'samples': 18616320, 'steps': 96959, 'loss/train': 1.32374107837677} 11/07/2021 10:51:42 - INFO - __main__ - Step 96961: {'lr': 0.00014240415880453324, 'samples': 18616512, 'steps': 96960, 'loss/train': 1.651672124862671} 11/07/2021 10:51:43 - INFO - __main__ - Step 96962: {'lr': 0.00014239936871616367, 'samples': 18616704, 'steps': 96961, 'loss/train': 1.3609200716018677} 11/07/2021 10:51:43 - INFO - __main__ - Step 96963: {'lr': 0.00014239457867627726, 'samples': 18616896, 'steps': 96962, 'loss/train': 0.9024490118026733} 11/07/2021 10:51:43 - INFO - __main__ - Step 96964: {'lr': 0.00014238978868487618, 'samples': 18617088, 'steps': 96963, 'loss/train': 1.2413660287857056} 11/07/2021 10:51:44 - INFO - __main__ - Step 96965: {'lr': 0.0001423849987419626, 'samples': 18617280, 'steps': 96964, 'loss/train': 1.4211790561676025} 11/07/2021 10:51:45 - INFO - __main__ - Step 96966: {'lr': 0.00014238020884753868, 'samples': 18617472, 'steps': 96965, 'loss/train': 0.860150933265686} 11/07/2021 10:51:45 - INFO - __main__ - Step 96967: {'lr': 0.0001423754190016066, 'samples': 18617664, 'steps': 96966, 'loss/train': 1.058793067932129} 11/07/2021 10:51:45 - INFO - __main__ - Step 96968: {'lr': 0.00014237062920416848, 'samples': 18617856, 'steps': 96967, 'loss/train': 1.4090474843978882} 11/07/2021 10:51:46 - INFO - __main__ - Step 96969: {'lr': 0.0001423658394552266, 'samples': 18618048, 'steps': 96968, 'loss/train': 1.234145164489746} 11/07/2021 10:51:46 - INFO - __main__ - Step 96970: {'lr': 0.00014236104975478292, 'samples': 18618240, 'steps': 96969, 'loss/train': 1.5715210437774658} 11/07/2021 10:51:48 - INFO - __main__ - Step 96971: {'lr': 0.00014235626010283963, 'samples': 18618432, 'steps': 96970, 'loss/train': 0.4199790358543396} 11/07/2021 10:51:48 - INFO - __main__ - Step 96972: {'lr': 0.000142351470499399, 'samples': 18618624, 'steps': 96971, 'loss/train': 1.2796205282211304} 11/07/2021 10:51:48 - INFO - __main__ - Step 96973: {'lr': 0.00014234668094446315, 'samples': 18618816, 'steps': 96972, 'loss/train': 1.3784011602401733} 11/07/2021 10:51:49 - INFO - __main__ - Step 96974: {'lr': 0.00014234189143803421, 'samples': 18619008, 'steps': 96973, 'loss/train': 1.1185632944107056} 11/07/2021 10:51:49 - INFO - __main__ - Step 96975: {'lr': 0.00014233710198011435, 'samples': 18619200, 'steps': 96974, 'loss/train': 1.2683249711990356} 11/07/2021 10:51:49 - INFO - __main__ - Step 96976: {'lr': 0.00014233231257070573, 'samples': 18619392, 'steps': 96975, 'loss/train': 1.170230746269226} 11/07/2021 10:51:51 - INFO - __main__ - Step 96977: {'lr': 0.00014232752320981053, 'samples': 18619584, 'steps': 96976, 'loss/train': 0.7575505971908569} 11/07/2021 10:51:51 - INFO - __main__ - Step 96978: {'lr': 0.00014232273389743085, 'samples': 18619776, 'steps': 96977, 'loss/train': 1.3367784023284912} 11/07/2021 10:51:51 - INFO - __main__ - Step 96979: {'lr': 0.0001423179446335689, 'samples': 18619968, 'steps': 96978, 'loss/train': 0.34017112851142883} 11/07/2021 10:51:52 - INFO - __main__ - Step 96980: {'lr': 0.00014231315541822682, 'samples': 18620160, 'steps': 96979, 'loss/train': 0.5941416621208191} 11/07/2021 10:51:52 - INFO - __main__ - Step 96981: {'lr': 0.00014230836625140676, 'samples': 18620352, 'steps': 96980, 'loss/train': 1.1126594543457031} 11/07/2021 10:51:53 - INFO - __main__ - Step 96982: {'lr': 0.00014230357713311098, 'samples': 18620544, 'steps': 96981, 'loss/train': 1.4044373035430908} 11/07/2021 10:51:54 - INFO - __main__ - Step 96983: {'lr': 0.00014229878806334148, 'samples': 18620736, 'steps': 96982, 'loss/train': 1.5063772201538086} 11/07/2021 10:51:54 - INFO - __main__ - Step 96984: {'lr': 0.00014229399904210047, 'samples': 18620928, 'steps': 96983, 'loss/train': 1.0367956161499023} 11/07/2021 10:51:54 - INFO - __main__ - Step 96985: {'lr': 0.0001422892100693901, 'samples': 18621120, 'steps': 96984, 'loss/train': 1.2588468790054321} 11/07/2021 10:51:55 - INFO - __main__ - Step 96986: {'lr': 0.00014228442114521262, 'samples': 18621312, 'steps': 96985, 'loss/train': 5.724117279052734} 11/07/2021 10:51:55 - INFO - __main__ - Step 96987: {'lr': 0.00014227963226957004, 'samples': 18621504, 'steps': 96986, 'loss/train': 1.5157259702682495} 11/07/2021 10:51:56 - INFO - __main__ - Step 96988: {'lr': 0.00014227484344246465, 'samples': 18621696, 'steps': 96987, 'loss/train': 0.8829978704452515} 11/07/2021 10:51:57 - INFO - __main__ - Step 96989: {'lr': 0.00014227005466389852, 'samples': 18621888, 'steps': 96988, 'loss/train': 0.9983264207839966} 11/07/2021 10:51:57 - INFO - __main__ - Step 96990: {'lr': 0.00014226526593387383, 'samples': 18622080, 'steps': 96989, 'loss/train': 1.0537010431289673} 11/07/2021 10:51:57 - INFO - __main__ - Step 96991: {'lr': 0.00014226047725239278, 'samples': 18622272, 'steps': 96990, 'loss/train': 1.338580846786499} 11/07/2021 10:51:58 - INFO - __main__ - Step 96992: {'lr': 0.00014225568861945748, 'samples': 18622464, 'steps': 96991, 'loss/train': 1.1074100732803345} 11/07/2021 10:51:59 - INFO - __main__ - Step 96993: {'lr': 0.00014225090003507014, 'samples': 18622656, 'steps': 96992, 'loss/train': 1.5994257926940918} 11/07/2021 10:51:59 - INFO - __main__ - Step 96994: {'lr': 0.00014224611149923284, 'samples': 18622848, 'steps': 96993, 'loss/train': 0.930855929851532} 11/07/2021 10:51:59 - INFO - __main__ - Step 96995: {'lr': 0.00014224132301194776, 'samples': 18623040, 'steps': 96994, 'loss/train': 1.4181386232376099} 11/07/2021 10:52:00 - INFO - __main__ - Step 96996: {'lr': 0.00014223653457321722, 'samples': 18623232, 'steps': 96995, 'loss/train': 1.4133702516555786} 11/07/2021 10:52:00 - INFO - __main__ - Step 96997: {'lr': 0.00014223174618304313, 'samples': 18623424, 'steps': 96996, 'loss/train': 0.9111426472663879} 11/07/2021 10:52:00 - INFO - __main__ - Step 96998: {'lr': 0.00014222695784142775, 'samples': 18623616, 'steps': 96997, 'loss/train': 0.9588895440101624} 11/07/2021 10:52:02 - INFO - __main__ - Step 96999: {'lr': 0.00014222216954837323, 'samples': 18623808, 'steps': 96998, 'loss/train': 1.5057387351989746} 11/07/2021 10:52:02 - INFO - __main__ - Step 97000: {'lr': 0.00014221738130388174, 'samples': 18624000, 'steps': 96999, 'loss/train': 0.9033804535865784} 11/07/2021 10:52:02 - INFO - __main__ - Step 97001: {'lr': 0.00014221259310795542, 'samples': 18624192, 'steps': 97000, 'loss/train': 1.3194373846054077} 11/07/2021 10:52:03 - INFO - __main__ - Step 97002: {'lr': 0.00014220780496059646, 'samples': 18624384, 'steps': 97001, 'loss/train': 1.5574055910110474} 11/07/2021 10:52:03 - INFO - __main__ - Step 97003: {'lr': 0.000142203016861807, 'samples': 18624576, 'steps': 97002, 'loss/train': 1.4970512390136719} 11/07/2021 10:52:04 - INFO - __main__ - Step 97004: {'lr': 0.0001421982288115892, 'samples': 18624768, 'steps': 97003, 'loss/train': 0.9533843994140625} 11/07/2021 10:52:04 - INFO - __main__ - Step 97005: {'lr': 0.0001421934408099452, 'samples': 18624960, 'steps': 97004, 'loss/train': 0.7760515809059143} 11/07/2021 10:52:05 - INFO - __main__ - Step 97006: {'lr': 0.0001421886528568772, 'samples': 18625152, 'steps': 97005, 'loss/train': 1.077505111694336} 11/07/2021 10:52:05 - INFO - __main__ - Step 97007: {'lr': 0.00014218386495238727, 'samples': 18625344, 'steps': 97006, 'loss/train': 1.556117057800293} 11/07/2021 10:52:05 - INFO - __main__ - Step 97008: {'lr': 0.0001421790770964777, 'samples': 18625536, 'steps': 97007, 'loss/train': 1.173203945159912} 11/07/2021 10:52:06 - INFO - __main__ - Step 97009: {'lr': 0.00014217428928915064, 'samples': 18625728, 'steps': 97008, 'loss/train': 0.8844147324562073} 11/07/2021 10:52:07 - INFO - __main__ - Step 97010: {'lr': 0.0001421695015304081, 'samples': 18625920, 'steps': 97009, 'loss/train': 1.1304500102996826} 11/07/2021 10:52:07 - INFO - __main__ - Step 97011: {'lr': 0.00014216471382025225, 'samples': 18626112, 'steps': 97010, 'loss/train': 0.805219292640686} 11/07/2021 10:52:07 - INFO - __main__ - Step 97012: {'lr': 0.00014215992615868538, 'samples': 18626304, 'steps': 97011, 'loss/train': 1.480625033378601} 11/07/2021 10:52:08 - INFO - __main__ - Step 97013: {'lr': 0.00014215513854570958, 'samples': 18626496, 'steps': 97012, 'loss/train': 1.3457088470458984} 11/07/2021 10:52:09 - INFO - __main__ - Step 97014: {'lr': 0.000142150350981327, 'samples': 18626688, 'steps': 97013, 'loss/train': 1.293805718421936} 11/07/2021 10:52:09 - INFO - __main__ - Step 97015: {'lr': 0.0001421455634655398, 'samples': 18626880, 'steps': 97014, 'loss/train': 1.3314851522445679} 11/07/2021 10:52:10 - INFO - __main__ - Step 97016: {'lr': 0.00014214077599835018, 'samples': 18627072, 'steps': 97015, 'loss/train': 1.2718336582183838} 11/07/2021 10:52:10 - INFO - __main__ - Step 97017: {'lr': 0.00014213598857976023, 'samples': 18627264, 'steps': 97016, 'loss/train': 1.2637996673583984} 11/07/2021 10:52:10 - INFO - __main__ - Step 97018: {'lr': 0.00014213120120977214, 'samples': 18627456, 'steps': 97017, 'loss/train': 1.157542109489441} 11/07/2021 10:52:11 - INFO - __main__ - Step 97019: {'lr': 0.00014212641388838807, 'samples': 18627648, 'steps': 97018, 'loss/train': 1.3133645057678223} 11/07/2021 10:52:12 - INFO - __main__ - Step 97020: {'lr': 0.00014212162661561017, 'samples': 18627840, 'steps': 97019, 'loss/train': 1.4253581762313843} 11/07/2021 10:52:12 - INFO - __main__ - Step 97021: {'lr': 0.0001421168393914406, 'samples': 18628032, 'steps': 97020, 'loss/train': 1.1146409511566162} 11/07/2021 10:52:12 - INFO - __main__ - Step 97022: {'lr': 0.00014211205221588164, 'samples': 18628224, 'steps': 97021, 'loss/train': 1.2110263109207153} 11/07/2021 10:52:13 - INFO - __main__ - Step 97023: {'lr': 0.0001421072650889352, 'samples': 18628416, 'steps': 97022, 'loss/train': 1.1194970607757568} 11/07/2021 10:52:14 - INFO - __main__ - Step 97024: {'lr': 0.00014210247801060355, 'samples': 18628608, 'steps': 97023, 'loss/train': 0.9497612118721008} 11/07/2021 10:52:14 - INFO - __main__ - Step 97025: {'lr': 0.00014209769098088887, 'samples': 18628800, 'steps': 97024, 'loss/train': 1.0730462074279785} 11/07/2021 10:52:14 - INFO - __main__ - Step 97026: {'lr': 0.00014209290399979334, 'samples': 18628992, 'steps': 97025, 'loss/train': 1.4657433032989502} 11/07/2021 10:52:15 - INFO - __main__ - Step 97027: {'lr': 0.00014208811706731907, 'samples': 18629184, 'steps': 97026, 'loss/train': 1.153592586517334} 11/07/2021 10:52:15 - INFO - __main__ - Step 97028: {'lr': 0.0001420833301834682, 'samples': 18629376, 'steps': 97027, 'loss/train': 1.8321298360824585} 11/07/2021 10:52:16 - INFO - __main__ - Step 97029: {'lr': 0.00014207854334824294, 'samples': 18629568, 'steps': 97028, 'loss/train': 0.9147504568099976} 11/07/2021 10:52:17 - INFO - __main__ - Step 97030: {'lr': 0.00014207375656164538, 'samples': 18629760, 'steps': 97029, 'loss/train': 1.0121194124221802} 11/07/2021 10:52:17 - INFO - __main__ - Step 97031: {'lr': 0.0001420689698236778, 'samples': 18629952, 'steps': 97030, 'loss/train': 0.8516036868095398} 11/07/2021 10:52:17 - INFO - __main__ - Step 97032: {'lr': 0.00014206418313434218, 'samples': 18630144, 'steps': 97031, 'loss/train': 1.5461219549179077} 11/07/2021 10:52:18 - INFO - __main__ - Step 97033: {'lr': 0.00014205939649364094, 'samples': 18630336, 'steps': 97032, 'loss/train': 1.6103535890579224} 11/07/2021 10:52:18 - INFO - __main__ - Step 97034: {'lr': 0.00014205460990157596, 'samples': 18630528, 'steps': 97033, 'loss/train': 1.7576271295547485} 11/07/2021 10:52:19 - INFO - __main__ - Step 97035: {'lr': 0.00014204982335814948, 'samples': 18630720, 'steps': 97034, 'loss/train': 1.119375467300415} 11/07/2021 10:52:19 - INFO - __main__ - Step 97036: {'lr': 0.00014204503686336372, 'samples': 18630912, 'steps': 97035, 'loss/train': 1.0828230381011963} 11/07/2021 10:52:20 - INFO - __main__ - Step 97037: {'lr': 0.0001420402504172208, 'samples': 18631104, 'steps': 97036, 'loss/train': 1.181477665901184} 11/07/2021 10:52:20 - INFO - __main__ - Step 97038: {'lr': 0.00014203546401972284, 'samples': 18631296, 'steps': 97037, 'loss/train': 1.2885370254516602} 11/07/2021 10:52:20 - INFO - __main__ - Step 97039: {'lr': 0.00014203067767087208, 'samples': 18631488, 'steps': 97038, 'loss/train': 1.2169519662857056} 11/07/2021 10:52:21 - INFO - __main__ - Step 97040: {'lr': 0.00014202589137067058, 'samples': 18631680, 'steps': 97039, 'loss/train': 1.0694258213043213} 11/07/2021 10:52:22 - INFO - __main__ - Step 97041: {'lr': 0.0001420211051191206, 'samples': 18631872, 'steps': 97040, 'loss/train': 1.5923422574996948} 11/07/2021 10:52:22 - INFO - __main__ - Step 97042: {'lr': 0.00014201631891622418, 'samples': 18632064, 'steps': 97041, 'loss/train': 1.3725115060806274} 11/07/2021 10:52:22 - INFO - __main__ - Step 97043: {'lr': 0.00014201153276198358, 'samples': 18632256, 'steps': 97042, 'loss/train': 0.9936412572860718} 11/07/2021 10:52:23 - INFO - __main__ - Step 97044: {'lr': 0.000142006746656401, 'samples': 18632448, 'steps': 97043, 'loss/train': 1.4150298833847046} 11/07/2021 10:52:24 - INFO - __main__ - Step 97045: {'lr': 0.00014200196059947846, 'samples': 18632640, 'steps': 97044, 'loss/train': 1.6581709384918213} 11/07/2021 10:52:24 - INFO - __main__ - Step 97046: {'lr': 0.00014199717459121813, 'samples': 18632832, 'steps': 97045, 'loss/train': 1.537085771560669} 11/07/2021 10:52:25 - INFO - __main__ - Step 97047: {'lr': 0.00014199238863162224, 'samples': 18633024, 'steps': 97046, 'loss/train': 1.366891622543335} 11/07/2021 10:52:25 - INFO - __main__ - Step 97048: {'lr': 0.00014198760272069285, 'samples': 18633216, 'steps': 97047, 'loss/train': 1.5101730823516846} 11/07/2021 10:52:25 - INFO - __main__ - Step 97049: {'lr': 0.00014198281685843224, 'samples': 18633408, 'steps': 97048, 'loss/train': 0.7974488139152527} 11/07/2021 10:52:26 - INFO - __main__ - Step 97050: {'lr': 0.00014197803104484247, 'samples': 18633600, 'steps': 97049, 'loss/train': 1.5186537504196167} 11/07/2021 10:52:27 - INFO - __main__ - Step 97051: {'lr': 0.00014197324527992576, 'samples': 18633792, 'steps': 97050, 'loss/train': 1.0150624513626099} 11/07/2021 10:52:27 - INFO - __main__ - Step 97052: {'lr': 0.0001419684595636842, 'samples': 18633984, 'steps': 97051, 'loss/train': 1.2976723909378052} 11/07/2021 10:52:27 - INFO - __main__ - Step 97053: {'lr': 0.00014196367389612003, 'samples': 18634176, 'steps': 97052, 'loss/train': 1.2677325010299683} 11/07/2021 10:52:28 - INFO - __main__ - Step 97054: {'lr': 0.00014195888827723535, 'samples': 18634368, 'steps': 97053, 'loss/train': 1.5783785581588745} 11/07/2021 10:52:29 - INFO - __main__ - Step 97055: {'lr': 0.0001419541027070324, 'samples': 18634560, 'steps': 97054, 'loss/train': 0.8300455808639526} 11/07/2021 10:52:29 - INFO - __main__ - Step 97056: {'lr': 0.00014194931718551317, 'samples': 18634752, 'steps': 97055, 'loss/train': 1.3580594062805176} 11/07/2021 10:52:30 - INFO - __main__ - Step 97057: {'lr': 0.00014194453171267996, 'samples': 18634944, 'steps': 97056, 'loss/train': 1.875908613204956} 11/07/2021 10:52:30 - INFO - __main__ - Step 97058: {'lr': 0.00014193974628853482, 'samples': 18635136, 'steps': 97057, 'loss/train': 1.223982572555542} 11/07/2021 10:52:30 - INFO - __main__ - Step 97059: {'lr': 0.00014193496091307998, 'samples': 18635328, 'steps': 97058, 'loss/train': 1.1644065380096436} 11/07/2021 10:52:31 - INFO - __main__ - Step 97060: {'lr': 0.0001419301755863176, 'samples': 18635520, 'steps': 97059, 'loss/train': 0.6084378361701965} 11/07/2021 10:52:32 - INFO - __main__ - Step 97061: {'lr': 0.00014192539030824977, 'samples': 18635712, 'steps': 97060, 'loss/train': 1.7300693988800049} 11/07/2021 10:52:32 - INFO - __main__ - Step 97062: {'lr': 0.0001419206050788787, 'samples': 18635904, 'steps': 97061, 'loss/train': 1.2424854040145874} 11/07/2021 10:52:32 - INFO - __main__ - Step 97063: {'lr': 0.00014191581989820656, 'samples': 18636096, 'steps': 97062, 'loss/train': 1.3037852048873901} 11/07/2021 10:52:33 - INFO - __main__ - Step 97064: {'lr': 0.0001419110347662355, 'samples': 18636288, 'steps': 97063, 'loss/train': 1.1914960145950317} 11/07/2021 10:52:33 - INFO - __main__ - Step 97065: {'lr': 0.00014190624968296765, 'samples': 18636480, 'steps': 97064, 'loss/train': 1.1287533044815063} 11/07/2021 10:52:34 - INFO - __main__ - Step 97066: {'lr': 0.00014190146464840525, 'samples': 18636672, 'steps': 97065, 'loss/train': 1.370485782623291} 11/07/2021 10:52:35 - INFO - __main__ - Step 97067: {'lr': 0.00014189667966255033, 'samples': 18636864, 'steps': 97066, 'loss/train': 1.2154847383499146} 11/07/2021 10:52:35 - INFO - __main__ - Step 97068: {'lr': 0.00014189189472540504, 'samples': 18637056, 'steps': 97067, 'loss/train': 0.8915582895278931} 11/07/2021 10:52:36 - INFO - __main__ - Step 97069: {'lr': 0.00014188710983697162, 'samples': 18637248, 'steps': 97068, 'loss/train': 1.549246072769165} 11/07/2021 10:52:36 - INFO - __main__ - Step 97070: {'lr': 0.0001418823249972522, 'samples': 18637440, 'steps': 97069, 'loss/train': 0.4890837073326111} 11/07/2021 10:52:37 - INFO - __main__ - Step 97071: {'lr': 0.00014187754020624893, 'samples': 18637632, 'steps': 97070, 'loss/train': 1.4468309879302979} 11/07/2021 10:52:37 - INFO - __main__ - Step 97072: {'lr': 0.000141872755463964, 'samples': 18637824, 'steps': 97071, 'loss/train': 1.4854148626327515} 11/07/2021 10:52:38 - INFO - __main__ - Step 97073: {'lr': 0.00014186797077039948, 'samples': 18638016, 'steps': 97072, 'loss/train': 1.3142551183700562} 11/07/2021 10:52:38 - INFO - __main__ - Step 97074: {'lr': 0.00014186318612555764, 'samples': 18638208, 'steps': 97073, 'loss/train': 1.2459839582443237} 11/07/2021 10:52:38 - INFO - __main__ - Step 97075: {'lr': 0.00014185840152944058, 'samples': 18638400, 'steps': 97074, 'loss/train': 0.876483142375946} 11/07/2021 10:52:39 - INFO - __main__ - Step 97076: {'lr': 0.00014185361698205052, 'samples': 18638592, 'steps': 97075, 'loss/train': 0.6312455534934998} 11/07/2021 10:52:40 - INFO - __main__ - Step 97077: {'lr': 0.00014184883248338946, 'samples': 18638784, 'steps': 97076, 'loss/train': 1.7304065227508545} 11/07/2021 10:52:40 - INFO - __main__ - Step 97078: {'lr': 0.00014184404803345963, 'samples': 18638976, 'steps': 97077, 'loss/train': 1.1536592245101929} 11/07/2021 10:52:40 - INFO - __main__ - Step 97079: {'lr': 0.00014183926363226323, 'samples': 18639168, 'steps': 97078, 'loss/train': 1.2260633707046509} 11/07/2021 10:52:41 - INFO - __main__ - Step 97080: {'lr': 0.0001418344792798024, 'samples': 18639360, 'steps': 97079, 'loss/train': 2.3019535541534424} 11/07/2021 10:52:42 - INFO - __main__ - Step 97081: {'lr': 0.0001418296949760793, 'samples': 18639552, 'steps': 97080, 'loss/train': 1.3043440580368042} 11/07/2021 10:52:42 - INFO - __main__ - Step 97082: {'lr': 0.00014182491072109598, 'samples': 18639744, 'steps': 97081, 'loss/train': 0.899023175239563} 11/07/2021 10:52:43 - INFO - __main__ - Step 97083: {'lr': 0.00014182012651485477, 'samples': 18639936, 'steps': 97082, 'loss/train': 0.7675520181655884} 11/07/2021 10:52:43 - INFO - __main__ - Step 97084: {'lr': 0.0001418153423573577, 'samples': 18640128, 'steps': 97083, 'loss/train': 1.4592307806015015} 11/07/2021 10:52:43 - INFO - __main__ - Step 97085: {'lr': 0.000141810558248607, 'samples': 18640320, 'steps': 97084, 'loss/train': 1.108165979385376} 11/07/2021 10:52:44 - INFO - __main__ - Step 97086: {'lr': 0.0001418057741886048, 'samples': 18640512, 'steps': 97085, 'loss/train': 1.7934693098068237} 11/07/2021 10:52:45 - INFO - __main__ - Step 97087: {'lr': 0.0001418009901773532, 'samples': 18640704, 'steps': 97086, 'loss/train': 1.5691499710083008} 11/07/2021 10:52:45 - INFO - __main__ - Step 97088: {'lr': 0.00014179620621485446, 'samples': 18640896, 'steps': 97087, 'loss/train': 1.560156226158142} 11/07/2021 10:52:45 - INFO - __main__ - Step 97089: {'lr': 0.00014179142230111064, 'samples': 18641088, 'steps': 97088, 'loss/train': 1.286841869354248} 11/07/2021 10:52:46 - INFO - __main__ - Step 97090: {'lr': 0.00014178663843612404, 'samples': 18641280, 'steps': 97089, 'loss/train': 1.1403855085372925} 11/07/2021 10:52:46 - INFO - __main__ - Step 97091: {'lr': 0.0001417818546198966, 'samples': 18641472, 'steps': 97090, 'loss/train': 1.1788575649261475} 11/07/2021 10:52:47 - INFO - __main__ - Step 97092: {'lr': 0.0001417770708524306, 'samples': 18641664, 'steps': 97091, 'loss/train': 1.2933038473129272} 11/07/2021 10:52:47 - INFO - __main__ - Step 97093: {'lr': 0.0001417722871337282, 'samples': 18641856, 'steps': 97092, 'loss/train': 1.0197471380233765} 11/07/2021 10:52:48 - INFO - __main__ - Step 97094: {'lr': 0.00014176750346379152, 'samples': 18642048, 'steps': 97093, 'loss/train': 1.4075511693954468} 11/07/2021 10:52:48 - INFO - __main__ - Step 97095: {'lr': 0.00014176271984262274, 'samples': 18642240, 'steps': 97094, 'loss/train': 1.6849037408828735} 11/07/2021 10:52:48 - INFO - __main__ - Step 97096: {'lr': 0.00014175793627022398, 'samples': 18642432, 'steps': 97095, 'loss/train': 1.5892398357391357} 11/07/2021 10:52:50 - INFO - __main__ - Step 97097: {'lr': 0.00014175315274659746, 'samples': 18642624, 'steps': 97096, 'loss/train': 1.1260898113250732} 11/07/2021 10:52:50 - INFO - __main__ - Step 97098: {'lr': 0.0001417483692717453, 'samples': 18642816, 'steps': 97097, 'loss/train': 1.2768328189849854} 11/07/2021 10:52:50 - INFO - __main__ - Step 97099: {'lr': 0.00014174358584566964, 'samples': 18643008, 'steps': 97098, 'loss/train': 1.5796757936477661} 11/07/2021 10:52:51 - INFO - __main__ - Step 97100: {'lr': 0.00014173880246837263, 'samples': 18643200, 'steps': 97099, 'loss/train': 1.399497151374817} 11/07/2021 10:52:51 - INFO - __main__ - Step 97101: {'lr': 0.00014173401913985644, 'samples': 18643392, 'steps': 97100, 'loss/train': 0.09694083034992218} 11/07/2021 10:52:52 - INFO - __main__ - Step 97102: {'lr': 0.00014172923586012326, 'samples': 18643584, 'steps': 97101, 'loss/train': 1.238054633140564} 11/07/2021 10:52:53 - INFO - __main__ - Step 97103: {'lr': 0.00014172445262917532, 'samples': 18643776, 'steps': 97102, 'loss/train': 1.7582237720489502} 11/07/2021 10:52:53 - INFO - __main__ - Step 97104: {'lr': 0.0001417196694470146, 'samples': 18643968, 'steps': 97103, 'loss/train': 0.45950809121131897} 11/07/2021 10:52:53 - INFO - __main__ - Step 97105: {'lr': 0.00014171488631364328, 'samples': 18644160, 'steps': 97104, 'loss/train': 1.3428436517715454} 11/07/2021 10:52:54 - INFO - __main__ - Step 97106: {'lr': 0.00014171010322906356, 'samples': 18644352, 'steps': 97105, 'loss/train': 1.4241942167282104} 11/07/2021 10:52:55 - INFO - __main__ - Step 97107: {'lr': 0.0001417053201932776, 'samples': 18644544, 'steps': 97106, 'loss/train': 2.091219902038574} 11/07/2021 10:52:55 - INFO - __main__ - Step 97108: {'lr': 0.00014170053720628757, 'samples': 18644736, 'steps': 97107, 'loss/train': 0.9533679485321045} 11/07/2021 10:52:55 - INFO - __main__ - Step 97109: {'lr': 0.00014169575426809558, 'samples': 18644928, 'steps': 97108, 'loss/train': 1.1645545959472656} 11/07/2021 10:52:56 - INFO - __main__ - Step 97110: {'lr': 0.00014169097137870383, 'samples': 18645120, 'steps': 97109, 'loss/train': 1.0740469694137573} 11/07/2021 10:52:56 - INFO - __main__ - Step 97111: {'lr': 0.00014168618853811443, 'samples': 18645312, 'steps': 97110, 'loss/train': 1.3595551252365112} 11/07/2021 10:52:57 - INFO - __main__ - Step 97112: {'lr': 0.0001416814057463296, 'samples': 18645504, 'steps': 97111, 'loss/train': 1.5321857929229736} 11/07/2021 10:52:58 - INFO - __main__ - Step 97113: {'lr': 0.00014167662300335144, 'samples': 18645696, 'steps': 97112, 'loss/train': 1.0198092460632324} 11/07/2021 10:52:58 - INFO - __main__ - Step 97114: {'lr': 0.00014167184030918213, 'samples': 18645888, 'steps': 97113, 'loss/train': 1.6063740253448486} 11/07/2021 10:52:58 - INFO - __main__ - Step 97115: {'lr': 0.00014166705766382383, 'samples': 18646080, 'steps': 97114, 'loss/train': 1.4886322021484375} 11/07/2021 10:52:59 - INFO - __main__ - Step 97116: {'lr': 0.00014166227506727863, 'samples': 18646272, 'steps': 97115, 'loss/train': 1.456660509109497} 11/07/2021 10:52:59 - INFO - __main__ - Step 97117: {'lr': 0.00014165749251954888, 'samples': 18646464, 'steps': 97116, 'loss/train': 1.2119734287261963} 11/07/2021 10:53:00 - INFO - __main__ - Step 97118: {'lr': 0.00014165271002063647, 'samples': 18646656, 'steps': 97117, 'loss/train': 1.593920350074768} 11/07/2021 10:53:00 - INFO - __main__ - Step 97119: {'lr': 0.0001416479275705437, 'samples': 18646848, 'steps': 97118, 'loss/train': 1.419183611869812} 11/07/2021 10:53:01 - INFO - __main__ - Step 97120: {'lr': 0.00014164314516927268, 'samples': 18647040, 'steps': 97119, 'loss/train': 0.9917078614234924} 11/07/2021 10:53:01 - INFO - __main__ - Step 97121: {'lr': 0.00014163836281682563, 'samples': 18647232, 'steps': 97120, 'loss/train': 1.3151754140853882} 11/07/2021 10:53:01 - INFO - __main__ - Step 97122: {'lr': 0.00014163358051320462, 'samples': 18647424, 'steps': 97121, 'loss/train': 0.8597118854522705} 11/07/2021 10:53:03 - INFO - __main__ - Step 97123: {'lr': 0.00014162879825841185, 'samples': 18647616, 'steps': 97122, 'loss/train': 1.2756222486495972} 11/07/2021 10:53:03 - INFO - __main__ - Step 97124: {'lr': 0.00014162401605244946, 'samples': 18647808, 'steps': 97123, 'loss/train': 1.0563228130340576} 11/07/2021 10:53:03 - INFO - __main__ - Step 97125: {'lr': 0.00014161923389531967, 'samples': 18648000, 'steps': 97124, 'loss/train': 1.329170823097229} 11/07/2021 10:53:04 - INFO - __main__ - Step 97126: {'lr': 0.00014161445178702454, 'samples': 18648192, 'steps': 97125, 'loss/train': 1.3340609073638916} 11/07/2021 10:53:04 - INFO - __main__ - Step 97127: {'lr': 0.00014160966972756624, 'samples': 18648384, 'steps': 97126, 'loss/train': 1.0831894874572754} 11/07/2021 10:53:05 - INFO - __main__ - Step 97128: {'lr': 0.000141604887716947, 'samples': 18648576, 'steps': 97127, 'loss/train': 1.050214409828186} 11/07/2021 10:53:05 - INFO - __main__ - Step 97129: {'lr': 0.00014160010575516892, 'samples': 18648768, 'steps': 97128, 'loss/train': 1.218207836151123} 11/07/2021 10:53:06 - INFO - __main__ - Step 97130: {'lr': 0.00014159532384223423, 'samples': 18648960, 'steps': 97129, 'loss/train': 0.8761935830116272} 11/07/2021 10:53:06 - INFO - __main__ - Step 97131: {'lr': 0.0001415905419781449, 'samples': 18649152, 'steps': 97130, 'loss/train': 0.6399369835853577} 11/07/2021 10:53:06 - INFO - __main__ - Step 97132: {'lr': 0.00014158576016290325, 'samples': 18649344, 'steps': 97131, 'loss/train': 1.1259785890579224} 11/07/2021 10:53:07 - INFO - __main__ - Step 97133: {'lr': 0.00014158097839651136, 'samples': 18649536, 'steps': 97132, 'loss/train': 1.2092440128326416} 11/07/2021 10:53:08 - INFO - __main__ - Step 97134: {'lr': 0.00014157619667897142, 'samples': 18649728, 'steps': 97133, 'loss/train': 1.2659257650375366} 11/07/2021 10:53:08 - INFO - __main__ - Step 97135: {'lr': 0.00014157141501028553, 'samples': 18649920, 'steps': 97134, 'loss/train': 1.6232376098632812} 11/07/2021 10:53:08 - INFO - __main__ - Step 97136: {'lr': 0.00014156663339045595, 'samples': 18650112, 'steps': 97135, 'loss/train': 1.2715120315551758} 11/07/2021 10:53:09 - INFO - __main__ - Step 97137: {'lr': 0.00014156185181948472, 'samples': 18650304, 'steps': 97136, 'loss/train': 1.3242243528366089} 11/07/2021 10:53:10 - INFO - __main__ - Step 97138: {'lr': 0.00014155707029737407, 'samples': 18650496, 'steps': 97137, 'loss/train': 1.5034526586532593} 11/07/2021 10:53:10 - INFO - __main__ - Step 97139: {'lr': 0.00014155228882412613, 'samples': 18650688, 'steps': 97138, 'loss/train': 1.2193069458007812} 11/07/2021 10:53:11 - INFO - __main__ - Step 97140: {'lr': 0.00014154750739974305, 'samples': 18650880, 'steps': 97139, 'loss/train': 1.483091950416565} 11/07/2021 10:53:11 - INFO - __main__ - Step 97141: {'lr': 0.000141542726024227, 'samples': 18651072, 'steps': 97140, 'loss/train': 1.4997375011444092} 11/07/2021 10:53:11 - INFO - __main__ - Step 97142: {'lr': 0.00014153794469758013, 'samples': 18651264, 'steps': 97141, 'loss/train': 1.4099162817001343} 11/07/2021 10:53:12 - INFO - __main__ - Step 97143: {'lr': 0.00014153316341980465, 'samples': 18651456, 'steps': 97142, 'loss/train': 0.5445997714996338} 11/07/2021 10:53:13 - INFO - __main__ - Step 97144: {'lr': 0.00014152838219090257, 'samples': 18651648, 'steps': 97143, 'loss/train': 1.5449503660202026} 11/07/2021 10:53:13 - INFO - __main__ - Step 97145: {'lr': 0.00014152360101087614, 'samples': 18651840, 'steps': 97144, 'loss/train': 1.3773393630981445} 11/07/2021 10:53:14 - INFO - __main__ - Step 97146: {'lr': 0.00014151881987972751, 'samples': 18652032, 'steps': 97145, 'loss/train': 0.3157970607280731} 11/07/2021 10:53:14 - INFO - __main__ - Step 97147: {'lr': 0.00014151403879745882, 'samples': 18652224, 'steps': 97146, 'loss/train': 1.3475518226623535} 11/07/2021 10:53:14 - INFO - __main__ - Step 97148: {'lr': 0.0001415092577640722, 'samples': 18652416, 'steps': 97147, 'loss/train': 1.8809878826141357} 11/07/2021 10:53:15 - INFO - __main__ - Step 97149: {'lr': 0.00014150447677956988, 'samples': 18652608, 'steps': 97148, 'loss/train': 1.2152049541473389} 11/07/2021 10:53:16 - INFO - __main__ - Step 97150: {'lr': 0.00014149969584395394, 'samples': 18652800, 'steps': 97149, 'loss/train': 1.529721975326538} 11/07/2021 10:53:16 - INFO - __main__ - Step 97151: {'lr': 0.00014149491495722656, 'samples': 18652992, 'steps': 97150, 'loss/train': 0.5738968253135681} 11/07/2021 10:53:16 - INFO - __main__ - Step 97152: {'lr': 0.0001414901341193899, 'samples': 18653184, 'steps': 97151, 'loss/train': 0.9989297389984131} 11/07/2021 10:53:17 - INFO - __main__ - Step 97153: {'lr': 0.00014148535333044612, 'samples': 18653376, 'steps': 97152, 'loss/train': 0.040355224162340164} 11/07/2021 10:53:18 - INFO - __main__ - Step 97154: {'lr': 0.00014148057259039736, 'samples': 18653568, 'steps': 97153, 'loss/train': 1.4408671855926514} 11/07/2021 10:53:18 - INFO - __main__ - Step 97155: {'lr': 0.0001414757918992458, 'samples': 18653760, 'steps': 97154, 'loss/train': 1.759022831916809} 11/07/2021 10:53:18 - INFO - __main__ - Step 97156: {'lr': 0.00014147101125699355, 'samples': 18653952, 'steps': 97155, 'loss/train': 1.4514158964157104} 11/07/2021 10:53:19 - INFO - __main__ - Step 97157: {'lr': 0.0001414662306636429, 'samples': 18654144, 'steps': 97156, 'loss/train': 1.2622238397598267} 11/07/2021 10:53:19 - INFO - __main__ - Step 97158: {'lr': 0.00014146145011919575, 'samples': 18654336, 'steps': 97157, 'loss/train': 0.36365872621536255} 11/07/2021 10:53:20 - INFO - __main__ - Step 97159: {'lr': 0.00014145666962365444, 'samples': 18654528, 'steps': 97158, 'loss/train': 1.59822678565979} 11/07/2021 10:53:21 - INFO - __main__ - Step 97160: {'lr': 0.00014145188917702106, 'samples': 18654720, 'steps': 97159, 'loss/train': 1.3061034679412842} 11/07/2021 10:53:21 - INFO - __main__ - Step 97161: {'lr': 0.0001414471087792978, 'samples': 18654912, 'steps': 97160, 'loss/train': 1.5278687477111816} 11/07/2021 10:53:21 - INFO - __main__ - Step 97162: {'lr': 0.00014144232843048683, 'samples': 18655104, 'steps': 97161, 'loss/train': 1.5679924488067627} 11/07/2021 10:53:22 - INFO - __main__ - Step 97163: {'lr': 0.00014143754813059021, 'samples': 18655296, 'steps': 97162, 'loss/train': 1.1820178031921387} 11/07/2021 10:53:23 - INFO - __main__ - Step 97164: {'lr': 0.00014143276787961017, 'samples': 18655488, 'steps': 97163, 'loss/train': 1.3523889780044556} 11/07/2021 10:53:23 - INFO - __main__ - Step 97165: {'lr': 0.00014142798767754886, 'samples': 18655680, 'steps': 97164, 'loss/train': 0.5203116536140442} 11/07/2021 10:53:23 - INFO - __main__ - Step 97166: {'lr': 0.00014142320752440842, 'samples': 18655872, 'steps': 97165, 'loss/train': 1.3836369514465332} 11/07/2021 10:53:24 - INFO - __main__ - Step 97167: {'lr': 0.00014141842742019102, 'samples': 18656064, 'steps': 97166, 'loss/train': 1.0270535945892334} 11/07/2021 10:53:24 - INFO - __main__ - Step 97168: {'lr': 0.00014141364736489878, 'samples': 18656256, 'steps': 97167, 'loss/train': 1.2985206842422485} 11/07/2021 10:53:25 - INFO - __main__ - Step 97169: {'lr': 0.00014140886735853386, 'samples': 18656448, 'steps': 97168, 'loss/train': 1.4899989366531372} 11/07/2021 10:53:26 - INFO - __main__ - Step 97170: {'lr': 0.0001414040874010986, 'samples': 18656640, 'steps': 97169, 'loss/train': 1.7568604946136475} 11/07/2021 10:53:26 - INFO - __main__ - Step 97171: {'lr': 0.00014139930749259484, 'samples': 18656832, 'steps': 97170, 'loss/train': 1.169816493988037} 11/07/2021 10:53:26 - INFO - __main__ - Step 97172: {'lr': 0.00014139452763302485, 'samples': 18657024, 'steps': 97171, 'loss/train': 1.9697881937026978} 11/07/2021 10:53:27 - INFO - __main__ - Step 97173: {'lr': 0.00014138974782239083, 'samples': 18657216, 'steps': 97172, 'loss/train': 0.629660964012146} 11/07/2021 10:53:27 - INFO - __main__ - Step 97174: {'lr': 0.0001413849680606949, 'samples': 18657408, 'steps': 97173, 'loss/train': 0.05560450255870819} 11/07/2021 10:53:28 - INFO - __main__ - Step 97175: {'lr': 0.00014138018834793925, 'samples': 18657600, 'steps': 97174, 'loss/train': 1.6251940727233887} 11/07/2021 10:53:28 - INFO - __main__ - Step 97176: {'lr': 0.00014137540868412602, 'samples': 18657792, 'steps': 97175, 'loss/train': 1.373569369316101} 11/07/2021 10:53:29 - INFO - __main__ - Step 97177: {'lr': 0.00014137062906925733, 'samples': 18657984, 'steps': 97176, 'loss/train': 1.3590412139892578} 11/07/2021 10:53:29 - INFO - __main__ - Step 97178: {'lr': 0.00014136584950333536, 'samples': 18658176, 'steps': 97177, 'loss/train': 1.343339204788208} 11/07/2021 10:53:29 - INFO - __main__ - Step 97179: {'lr': 0.00014136106998636228, 'samples': 18658368, 'steps': 97178, 'loss/train': 1.4410829544067383} 11/07/2021 10:53:30 - INFO - __main__ - Step 97180: {'lr': 0.0001413562905183402, 'samples': 18658560, 'steps': 97179, 'loss/train': 0.6880878806114197} 11/07/2021 10:53:31 - INFO - __main__ - Step 97181: {'lr': 0.0001413515110992713, 'samples': 18658752, 'steps': 97180, 'loss/train': 1.5160058736801147} 11/07/2021 10:53:31 - INFO - __main__ - Step 97182: {'lr': 0.00014134673172915777, 'samples': 18658944, 'steps': 97181, 'loss/train': 1.3335849046707153} 11/07/2021 10:53:31 - INFO - __main__ - Step 97183: {'lr': 0.00014134195240800168, 'samples': 18659136, 'steps': 97182, 'loss/train': 1.0200544595718384} 11/07/2021 10:53:32 - INFO - __main__ - Step 97184: {'lr': 0.00014133717313580534, 'samples': 18659328, 'steps': 97183, 'loss/train': 1.4884105920791626} 11/07/2021 10:53:33 - INFO - __main__ - Step 97185: {'lr': 0.00014133239391257076, 'samples': 18659520, 'steps': 97184, 'loss/train': 0.9859971404075623} 11/07/2021 10:53:33 - INFO - __main__ - Step 97186: {'lr': 0.00014132761473830002, 'samples': 18659712, 'steps': 97185, 'loss/train': 0.991918683052063} 11/07/2021 10:53:33 - INFO - __main__ - Step 97187: {'lr': 0.00014132283561299548, 'samples': 18659904, 'steps': 97186, 'loss/train': 1.1762229204177856} 11/07/2021 10:53:34 - INFO - __main__ - Step 97188: {'lr': 0.00014131805653665912, 'samples': 18660096, 'steps': 97187, 'loss/train': 1.3427823781967163} 11/07/2021 10:53:34 - INFO - __main__ - Step 97189: {'lr': 0.0001413132775092932, 'samples': 18660288, 'steps': 97188, 'loss/train': 1.4302902221679688} 11/07/2021 10:53:35 - INFO - __main__ - Step 97190: {'lr': 0.00014130849853089984, 'samples': 18660480, 'steps': 97189, 'loss/train': 1.366604208946228} 11/07/2021 10:53:36 - INFO - __main__ - Step 97191: {'lr': 0.00014130371960148117, 'samples': 18660672, 'steps': 97190, 'loss/train': 1.6745973825454712} 11/07/2021 10:53:36 - INFO - __main__ - Step 97192: {'lr': 0.00014129894072103938, 'samples': 18660864, 'steps': 97191, 'loss/train': 1.9897103309631348} 11/07/2021 10:53:36 - INFO - __main__ - Step 97193: {'lr': 0.00014129416188957662, 'samples': 18661056, 'steps': 97192, 'loss/train': 2.194835901260376} 11/07/2021 10:53:37 - INFO - __main__ - Step 97194: {'lr': 0.000141289383107095, 'samples': 18661248, 'steps': 97193, 'loss/train': 1.344855785369873} 11/07/2021 10:53:38 - INFO - __main__ - Step 97195: {'lr': 0.00014128460437359675, 'samples': 18661440, 'steps': 97194, 'loss/train': 1.570114254951477} 11/07/2021 10:53:38 - INFO - __main__ - Step 97196: {'lr': 0.00014127982568908393, 'samples': 18661632, 'steps': 97195, 'loss/train': 1.5002670288085938} 11/07/2021 10:53:38 - INFO - __main__ - Step 97197: {'lr': 0.0001412750470535589, 'samples': 18661824, 'steps': 97196, 'loss/train': 1.4482625722885132} 11/07/2021 10:53:39 - INFO - __main__ - Step 97198: {'lr': 0.00014127026846702352, 'samples': 18662016, 'steps': 97197, 'loss/train': 1.4909709692001343} 11/07/2021 10:53:39 - INFO - __main__ - Step 97199: {'lr': 0.00014126548992948008, 'samples': 18662208, 'steps': 97198, 'loss/train': 1.652223825454712} 11/07/2021 10:53:40 - INFO - __main__ - Step 97200: {'lr': 0.00014126071144093076, 'samples': 18662400, 'steps': 97199, 'loss/train': 1.0166797637939453} 11/07/2021 10:53:41 - INFO - __main__ - Step 97201: {'lr': 0.00014125593300137764, 'samples': 18662592, 'steps': 97200, 'loss/train': 1.4610958099365234} 11/07/2021 10:53:41 - INFO - __main__ - Step 97202: {'lr': 0.00014125115461082293, 'samples': 18662784, 'steps': 97201, 'loss/train': 1.486548662185669} 11/07/2021 10:53:41 - INFO - __main__ - Step 97203: {'lr': 0.00014124637626926882, 'samples': 18662976, 'steps': 97202, 'loss/train': 1.588222622871399} 11/07/2021 10:53:42 - INFO - __main__ - Step 97204: {'lr': 0.00014124159797671736, 'samples': 18663168, 'steps': 97203, 'loss/train': 1.170328140258789} 11/07/2021 10:53:43 - INFO - __main__ - Step 97205: {'lr': 0.0001412368197331708, 'samples': 18663360, 'steps': 97204, 'loss/train': 1.513592004776001} 11/07/2021 10:53:43 - INFO - __main__ - Step 97206: {'lr': 0.00014123204153863124, 'samples': 18663552, 'steps': 97205, 'loss/train': 1.460324764251709} 11/07/2021 10:53:43 - INFO - __main__ - Step 97207: {'lr': 0.00014122726339310082, 'samples': 18663744, 'steps': 97206, 'loss/train': 1.8560230731964111} 11/07/2021 10:53:44 - INFO - __main__ - Step 97208: {'lr': 0.0001412224852965817, 'samples': 18663936, 'steps': 97207, 'loss/train': 1.131264567375183} 11/07/2021 10:53:44 - INFO - __main__ - Step 97209: {'lr': 0.00014121770724907613, 'samples': 18664128, 'steps': 97208, 'loss/train': 1.258697509765625} 11/07/2021 10:53:45 - INFO - __main__ - Step 97210: {'lr': 0.0001412129292505861, 'samples': 18664320, 'steps': 97209, 'loss/train': 1.419224500656128} 11/07/2021 10:53:46 - INFO - __main__ - Step 97211: {'lr': 0.00014120815130111398, 'samples': 18664512, 'steps': 97210, 'loss/train': 1.6946969032287598} 11/07/2021 10:53:46 - INFO - __main__ - Step 97212: {'lr': 0.0001412033734006617, 'samples': 18664704, 'steps': 97211, 'loss/train': 1.471980333328247} 11/07/2021 10:53:46 - INFO - __main__ - Step 97213: {'lr': 0.00014119859554923147, 'samples': 18664896, 'steps': 97212, 'loss/train': 2.2173166275024414} 11/07/2021 10:53:47 - INFO - __main__ - Step 97214: {'lr': 0.00014119381774682548, 'samples': 18665088, 'steps': 97213, 'loss/train': 0.446033775806427} 11/07/2021 10:53:47 - INFO - __main__ - Step 97215: {'lr': 0.0001411890399934459, 'samples': 18665280, 'steps': 97214, 'loss/train': 1.5435478687286377} 11/07/2021 10:53:48 - INFO - __main__ - Step 97216: {'lr': 0.00014118426228909486, 'samples': 18665472, 'steps': 97215, 'loss/train': 1.0847662687301636} 11/07/2021 10:53:49 - INFO - __main__ - Step 97217: {'lr': 0.0001411794846337745, 'samples': 18665664, 'steps': 97216, 'loss/train': 1.22297203540802} 11/07/2021 10:53:49 - INFO - __main__ - Step 97218: {'lr': 0.00014117470702748697, 'samples': 18665856, 'steps': 97217, 'loss/train': 2.4537570476531982} 11/07/2021 10:53:49 - INFO - __main__ - Step 97219: {'lr': 0.00014116992947023444, 'samples': 18666048, 'steps': 97218, 'loss/train': 1.458282470703125} 11/07/2021 10:53:50 - INFO - __main__ - Step 97220: {'lr': 0.0001411651519620191, 'samples': 18666240, 'steps': 97219, 'loss/train': 1.1309168338775635} 11/07/2021 10:53:51 - INFO - __main__ - Step 97221: {'lr': 0.00014116037450284303, 'samples': 18666432, 'steps': 97220, 'loss/train': 1.4374829530715942} 11/07/2021 10:53:51 - INFO - __main__ - Step 97222: {'lr': 0.00014115559709270843, 'samples': 18666624, 'steps': 97221, 'loss/train': 1.3310604095458984} 11/07/2021 10:53:52 - INFO - __main__ - Step 97223: {'lr': 0.00014115081973161743, 'samples': 18666816, 'steps': 97222, 'loss/train': 1.269149661064148} 11/07/2021 10:53:52 - INFO - __main__ - Step 97224: {'lr': 0.00014114604241957226, 'samples': 18667008, 'steps': 97223, 'loss/train': 1.310687780380249} 11/07/2021 10:53:52 - INFO - __main__ - Step 97225: {'lr': 0.00014114126515657493, 'samples': 18667200, 'steps': 97224, 'loss/train': 0.6575723886489868} 11/07/2021 10:53:53 - INFO - __main__ - Step 97226: {'lr': 0.00014113648794262767, 'samples': 18667392, 'steps': 97225, 'loss/train': 1.1741058826446533} 11/07/2021 10:53:54 - INFO - __main__ - Step 97227: {'lr': 0.00014113171077773267, 'samples': 18667584, 'steps': 97226, 'loss/train': 1.6211436986923218} 11/07/2021 10:53:54 - INFO - __main__ - Step 97228: {'lr': 0.00014112693366189196, 'samples': 18667776, 'steps': 97227, 'loss/train': 1.3363929986953735} 11/07/2021 10:53:54 - INFO - __main__ - Step 97229: {'lr': 0.00014112215659510782, 'samples': 18667968, 'steps': 97228, 'loss/train': 0.9704346656799316} 11/07/2021 10:53:55 - INFO - __main__ - Step 97230: {'lr': 0.00014111737957738237, 'samples': 18668160, 'steps': 97229, 'loss/train': 1.3703559637069702} 11/07/2021 10:53:57 - INFO - __main__ - Step 97231: {'lr': 0.00014111260260871771, 'samples': 18668352, 'steps': 97230, 'loss/train': 1.267173171043396} 11/07/2021 10:53:58 - INFO - __main__ - Step 97232: {'lr': 0.00014110782568911605, 'samples': 18668544, 'steps': 97231, 'loss/train': 1.768989086151123} 11/07/2021 10:53:58 - INFO - __main__ - Step 97233: {'lr': 0.00014110304881857956, 'samples': 18668736, 'steps': 97232, 'loss/train': 1.7568644285202026} 11/07/2021 10:53:58 - INFO - __main__ - Step 97234: {'lr': 0.00014109827199711028, 'samples': 18668928, 'steps': 97233, 'loss/train': 1.7570271492004395} 11/07/2021 10:53:59 - INFO - __main__ - Step 97235: {'lr': 0.00014109349522471048, 'samples': 18669120, 'steps': 97234, 'loss/train': 1.7587710618972778} 11/07/2021 10:53:59 - INFO - __main__ - Step 97236: {'lr': 0.00014108871850138227, 'samples': 18669312, 'steps': 97235, 'loss/train': 1.1312425136566162} 11/07/2021 10:53:59 - INFO - __main__ - Step 97237: {'lr': 0.0001410839418271278, 'samples': 18669504, 'steps': 97236, 'loss/train': 1.1843026876449585} 11/07/2021 10:54:00 - INFO - __main__ - Step 97238: {'lr': 0.00014107916520194932, 'samples': 18669696, 'steps': 97237, 'loss/train': 1.605806827545166} 11/07/2021 10:54:01 - INFO - __main__ - Step 97239: {'lr': 0.00014107438862584883, 'samples': 18669888, 'steps': 97238, 'loss/train': 1.0674402713775635} 11/07/2021 10:54:01 - INFO - __main__ - Step 97240: {'lr': 0.00014106961209882845, 'samples': 18670080, 'steps': 97239, 'loss/train': 1.4443695545196533} 11/07/2021 10:54:01 - INFO - __main__ - Step 97241: {'lr': 0.0001410648356208905, 'samples': 18670272, 'steps': 97240, 'loss/train': 1.183830976486206} 11/07/2021 10:54:02 - INFO - __main__ - Step 97242: {'lr': 0.00014106005919203702, 'samples': 18670464, 'steps': 97241, 'loss/train': 1.3071393966674805} 11/07/2021 10:54:02 - INFO - __main__ - Step 97243: {'lr': 0.0001410552828122702, 'samples': 18670656, 'steps': 97242, 'loss/train': 1.0516068935394287} 11/07/2021 10:54:03 - INFO - __main__ - Step 97244: {'lr': 0.0001410505064815922, 'samples': 18670848, 'steps': 97243, 'loss/train': 1.4050132036209106} 11/07/2021 10:54:03 - INFO - __main__ - Step 97245: {'lr': 0.00014104573020000516, 'samples': 18671040, 'steps': 97244, 'loss/train': 1.428529143333435} 11/07/2021 10:54:04 - INFO - __main__ - Step 97246: {'lr': 0.0001410409539675112, 'samples': 18671232, 'steps': 97245, 'loss/train': 1.4804877042770386} 11/07/2021 10:54:04 - INFO - __main__ - Step 97247: {'lr': 0.00014103617778411253, 'samples': 18671424, 'steps': 97246, 'loss/train': 1.4037420749664307} 11/07/2021 10:54:04 - INFO - __main__ - Step 97248: {'lr': 0.00014103140164981132, 'samples': 18671616, 'steps': 97247, 'loss/train': 1.4251792430877686} 11/07/2021 10:54:05 - INFO - __main__ - Step 97249: {'lr': 0.0001410266255646096, 'samples': 18671808, 'steps': 97248, 'loss/train': 1.4252629280090332} 11/07/2021 10:54:06 - INFO - __main__ - Step 97250: {'lr': 0.00014102184952850965, 'samples': 18672000, 'steps': 97249, 'loss/train': 1.3631038665771484} 11/07/2021 10:54:06 - INFO - __main__ - Step 97251: {'lr': 0.00014101707354151365, 'samples': 18672192, 'steps': 97250, 'loss/train': 1.3624248504638672} 11/07/2021 10:54:06 - INFO - __main__ - Step 97252: {'lr': 0.0001410122976036236, 'samples': 18672384, 'steps': 97251, 'loss/train': 0.5840388536453247} 11/07/2021 10:54:07 - INFO - __main__ - Step 97253: {'lr': 0.00014100752171484172, 'samples': 18672576, 'steps': 97252, 'loss/train': 1.5904892683029175} 11/07/2021 10:54:08 - INFO - __main__ - Step 97254: {'lr': 0.00014100274587517016, 'samples': 18672768, 'steps': 97253, 'loss/train': 1.4942902326583862} 11/07/2021 10:54:09 - INFO - __main__ - Step 97255: {'lr': 0.00014099797008461108, 'samples': 18672960, 'steps': 97254, 'loss/train': 0.2327282577753067} 11/07/2021 10:54:09 - INFO - __main__ - Step 97256: {'lr': 0.00014099319434316665, 'samples': 18673152, 'steps': 97255, 'loss/train': 1.4512337446212769} 11/07/2021 10:54:09 - INFO - __main__ - Step 97257: {'lr': 0.00014098841865083897, 'samples': 18673344, 'steps': 97256, 'loss/train': 0.7555103898048401} 11/07/2021 10:54:10 - INFO - __main__ - Step 97258: {'lr': 0.00014098364300763026, 'samples': 18673536, 'steps': 97257, 'loss/train': 1.3726600408554077} 11/07/2021 10:54:11 - INFO - __main__ - Step 97259: {'lr': 0.0001409788674135426, 'samples': 18673728, 'steps': 97258, 'loss/train': 0.5448576807975769} 11/07/2021 10:54:11 - INFO - __main__ - Step 97260: {'lr': 0.00014097409186857824, 'samples': 18673920, 'steps': 97259, 'loss/train': 1.4209706783294678} 11/07/2021 10:54:11 - INFO - __main__ - Step 97261: {'lr': 0.00014096931637273922, 'samples': 18674112, 'steps': 97260, 'loss/train': 1.048996090888977} 11/07/2021 10:54:12 - INFO - __main__ - Step 97262: {'lr': 0.00014096454092602775, 'samples': 18674304, 'steps': 97261, 'loss/train': 1.2779291868209839} 11/07/2021 10:54:12 - INFO - __main__ - Step 97263: {'lr': 0.000140959765528446, 'samples': 18674496, 'steps': 97262, 'loss/train': 1.103119969367981} 11/07/2021 10:54:12 - INFO - __main__ - Step 97264: {'lr': 0.0001409549901799962, 'samples': 18674688, 'steps': 97263, 'loss/train': 0.9447553157806396} 11/07/2021 10:54:13 - INFO - __main__ - Step 97265: {'lr': 0.00014095021488068026, 'samples': 18674880, 'steps': 97264, 'loss/train': 1.9935020208358765} 11/07/2021 10:54:14 - INFO - __main__ - Step 97266: {'lr': 0.0001409454396305005, 'samples': 18675072, 'steps': 97265, 'loss/train': 1.3314523696899414} 11/07/2021 10:54:14 - INFO - __main__ - Step 97267: {'lr': 0.00014094066442945903, 'samples': 18675264, 'steps': 97266, 'loss/train': 1.5446460247039795} 11/07/2021 10:54:14 - INFO - __main__ - Step 97268: {'lr': 0.00014093588927755802, 'samples': 18675456, 'steps': 97267, 'loss/train': 1.470815658569336} 11/07/2021 10:54:15 - INFO - __main__ - Step 97269: {'lr': 0.0001409311141747996, 'samples': 18675648, 'steps': 97268, 'loss/train': 0.9994629621505737} 11/07/2021 10:54:16 - INFO - __main__ - Step 97270: {'lr': 0.00014092633912118595, 'samples': 18675840, 'steps': 97269, 'loss/train': 1.114827275276184} 11/07/2021 10:54:16 - INFO - __main__ - Step 97271: {'lr': 0.0001409215641167192, 'samples': 18676032, 'steps': 97270, 'loss/train': 1.3475080728530884} 11/07/2021 10:54:17 - INFO - __main__ - Step 97272: {'lr': 0.00014091678916140153, 'samples': 18676224, 'steps': 97271, 'loss/train': 1.473704218864441} 11/07/2021 10:54:17 - INFO - __main__ - Step 97273: {'lr': 0.00014091201425523505, 'samples': 18676416, 'steps': 97272, 'loss/train': 0.9233793616294861} 11/07/2021 10:54:17 - INFO - __main__ - Step 97274: {'lr': 0.00014090723939822196, 'samples': 18676608, 'steps': 97273, 'loss/train': 1.89191472530365} 11/07/2021 10:54:18 - INFO - __main__ - Step 97275: {'lr': 0.00014090246459036435, 'samples': 18676800, 'steps': 97274, 'loss/train': 1.7350107431411743} 11/07/2021 10:54:19 - INFO - __main__ - Step 97276: {'lr': 0.00014089768983166444, 'samples': 18676992, 'steps': 97275, 'loss/train': 1.209798812866211} 11/07/2021 10:54:19 - INFO - __main__ - Step 97277: {'lr': 0.0001408929151221243, 'samples': 18677184, 'steps': 97276, 'loss/train': 1.4837254285812378} 11/07/2021 10:54:19 - INFO - __main__ - Step 97278: {'lr': 0.00014088814046174628, 'samples': 18677376, 'steps': 97277, 'loss/train': 1.25111985206604} 11/07/2021 10:54:20 - INFO - __main__ - Step 97279: {'lr': 0.00014088336585053223, 'samples': 18677568, 'steps': 97278, 'loss/train': 1.578052282333374} 11/07/2021 10:54:21 - INFO - __main__ - Step 97280: {'lr': 0.00014087859128848453, 'samples': 18677760, 'steps': 97279, 'loss/train': 1.3724349737167358} 11/07/2021 10:54:21 - INFO - __main__ - Step 97281: {'lr': 0.00014087381677560518, 'samples': 18677952, 'steps': 97280, 'loss/train': 0.1401650607585907} 11/07/2021 10:54:21 - INFO - __main__ - Step 97282: {'lr': 0.00014086904231189643, 'samples': 18678144, 'steps': 97281, 'loss/train': 1.4872071743011475} 11/07/2021 10:54:22 - INFO - __main__ - Step 97283: {'lr': 0.0001408642678973604, 'samples': 18678336, 'steps': 97282, 'loss/train': 0.8928919434547424} 11/07/2021 10:54:22 - INFO - __main__ - Step 97284: {'lr': 0.00014085949353199925, 'samples': 18678528, 'steps': 97283, 'loss/train': 1.6594539880752563} 11/07/2021 10:54:23 - INFO - __main__ - Step 97285: {'lr': 0.00014085471921581515, 'samples': 18678720, 'steps': 97284, 'loss/train': 1.4184894561767578} 11/07/2021 10:54:24 - INFO - __main__ - Step 97286: {'lr': 0.0001408499449488102, 'samples': 18678912, 'steps': 97285, 'loss/train': 1.6220651865005493} 11/07/2021 10:54:24 - INFO - __main__ - Step 97287: {'lr': 0.00014084517073098657, 'samples': 18679104, 'steps': 97286, 'loss/train': 0.9388483166694641} 11/07/2021 10:54:24 - INFO - __main__ - Step 97288: {'lr': 0.00014084039656234642, 'samples': 18679296, 'steps': 97287, 'loss/train': 1.3478325605392456} 11/07/2021 10:54:25 - INFO - __main__ - Step 97289: {'lr': 0.00014083562244289195, 'samples': 18679488, 'steps': 97288, 'loss/train': 1.3675532341003418} 11/07/2021 10:54:25 - INFO - __main__ - Step 97290: {'lr': 0.0001408308483726252, 'samples': 18679680, 'steps': 97289, 'loss/train': 1.7481321096420288} 11/07/2021 10:54:26 - INFO - __main__ - Step 97291: {'lr': 0.00014082607435154856, 'samples': 18679872, 'steps': 97290, 'loss/train': 1.711805820465088} 11/07/2021 10:54:26 - INFO - __main__ - Step 97292: {'lr': 0.00014082130037966386, 'samples': 18680064, 'steps': 97291, 'loss/train': 1.0215696096420288} 11/07/2021 10:54:27 - INFO - __main__ - Step 97293: {'lr': 0.0001408165264569734, 'samples': 18680256, 'steps': 97292, 'loss/train': 0.9504858255386353} 11/07/2021 10:54:27 - INFO - __main__ - Step 97294: {'lr': 0.00014081175258347933, 'samples': 18680448, 'steps': 97293, 'loss/train': 1.3886889219284058} 11/07/2021 10:54:28 - INFO - __main__ - Step 97295: {'lr': 0.00014080697875918383, 'samples': 18680640, 'steps': 97294, 'loss/train': 0.9733586311340332} 11/07/2021 10:54:29 - INFO - __main__ - Step 97296: {'lr': 0.00014080220498408896, 'samples': 18680832, 'steps': 97295, 'loss/train': 1.5000560283660889} 11/07/2021 10:54:29 - INFO - __main__ - Step 97297: {'lr': 0.000140797431258197, 'samples': 18681024, 'steps': 97296, 'loss/train': 1.5802682638168335} 11/07/2021 10:54:29 - INFO - __main__ - Step 97298: {'lr': 0.00014079265758150999, 'samples': 18681216, 'steps': 97297, 'loss/train': 1.796121597290039} 11/07/2021 10:54:30 - INFO - __main__ - Step 97299: {'lr': 0.00014078788395403014, 'samples': 18681408, 'steps': 97298, 'loss/train': 1.1879709959030151} 11/07/2021 10:54:30 - INFO - __main__ - Step 97300: {'lr': 0.0001407831103757596, 'samples': 18681600, 'steps': 97299, 'loss/train': 1.1459978818893433} 11/07/2021 10:54:31 - INFO - __main__ - Step 97301: {'lr': 0.00014077833684670045, 'samples': 18681792, 'steps': 97300, 'loss/train': 1.4396120309829712} 11/07/2021 10:54:31 - INFO - __main__ - Step 97302: {'lr': 0.00014077356336685503, 'samples': 18681984, 'steps': 97301, 'loss/train': 1.4633322954177856} 11/07/2021 10:54:32 - INFO - __main__ - Step 97303: {'lr': 0.00014076878993622526, 'samples': 18682176, 'steps': 97302, 'loss/train': 1.2483277320861816} 11/07/2021 10:54:32 - INFO - __main__ - Step 97304: {'lr': 0.00014076401655481336, 'samples': 18682368, 'steps': 97303, 'loss/train': 1.325385570526123} 11/07/2021 10:54:32 - INFO - __main__ - Step 97305: {'lr': 0.00014075924322262155, 'samples': 18682560, 'steps': 97304, 'loss/train': 1.086150884628296} 11/07/2021 10:54:34 - INFO - __main__ - Step 97306: {'lr': 0.0001407544699396519, 'samples': 18682752, 'steps': 97305, 'loss/train': 1.2253395318984985} 11/07/2021 10:54:34 - INFO - __main__ - Step 97307: {'lr': 0.00014074969670590663, 'samples': 18682944, 'steps': 97306, 'loss/train': 1.2742822170257568} 11/07/2021 10:54:34 - INFO - __main__ - Step 97308: {'lr': 0.00014074492352138786, 'samples': 18683136, 'steps': 97307, 'loss/train': 1.4703775644302368} 11/07/2021 10:54:35 - INFO - __main__ - Step 97309: {'lr': 0.0001407401503860977, 'samples': 18683328, 'steps': 97308, 'loss/train': 0.23013313114643097} 11/07/2021 10:54:35 - INFO - __main__ - Step 97310: {'lr': 0.0001407353773000384, 'samples': 18683520, 'steps': 97309, 'loss/train': 1.4899193048477173} 11/07/2021 10:54:36 - INFO - __main__ - Step 97311: {'lr': 0.00014073060426321202, 'samples': 18683712, 'steps': 97310, 'loss/train': 1.284063458442688} 11/07/2021 10:54:36 - INFO - __main__ - Step 97312: {'lr': 0.00014072583127562084, 'samples': 18683904, 'steps': 97311, 'loss/train': 1.1242194175720215} 11/07/2021 10:54:37 - INFO - __main__ - Step 97313: {'lr': 0.00014072105833726683, 'samples': 18684096, 'steps': 97312, 'loss/train': 0.9433461427688599} 11/07/2021 10:54:37 - INFO - __main__ - Step 97314: {'lr': 0.00014071628544815224, 'samples': 18684288, 'steps': 97313, 'loss/train': 1.251171588897705} 11/07/2021 10:54:37 - INFO - __main__ - Step 97315: {'lr': 0.00014071151260827916, 'samples': 18684480, 'steps': 97314, 'loss/train': 1.046154260635376} 11/07/2021 10:54:38 - INFO - __main__ - Step 97316: {'lr': 0.00014070673981764981, 'samples': 18684672, 'steps': 97315, 'loss/train': 1.4424760341644287} 11/07/2021 10:54:39 - INFO - __main__ - Step 97317: {'lr': 0.0001407019670762663, 'samples': 18684864, 'steps': 97316, 'loss/train': 1.7423644065856934} 11/07/2021 10:54:39 - INFO - __main__ - Step 97318: {'lr': 0.00014069719438413085, 'samples': 18685056, 'steps': 97317, 'loss/train': 0.7762000560760498} 11/07/2021 10:54:39 - INFO - __main__ - Step 97319: {'lr': 0.00014069242174124554, 'samples': 18685248, 'steps': 97318, 'loss/train': 1.3707094192504883} 11/07/2021 10:54:40 - INFO - __main__ - Step 97320: {'lr': 0.0001406876491476125, 'samples': 18685440, 'steps': 97319, 'loss/train': 1.1660076379776} 11/07/2021 10:54:41 - INFO - __main__ - Step 97321: {'lr': 0.00014068287660323392, 'samples': 18685632, 'steps': 97320, 'loss/train': 1.6355195045471191} 11/07/2021 10:54:41 - INFO - __main__ - Step 97322: {'lr': 0.00014067810410811198, 'samples': 18685824, 'steps': 97321, 'loss/train': 0.8503224849700928} 11/07/2021 10:54:41 - INFO - __main__ - Step 97323: {'lr': 0.0001406733316622489, 'samples': 18686016, 'steps': 97322, 'loss/train': 1.1131600141525269} 11/07/2021 10:54:42 - INFO - __main__ - Step 97324: {'lr': 0.00014066855926564659, 'samples': 18686208, 'steps': 97323, 'loss/train': 1.2552061080932617} 11/07/2021 10:54:42 - INFO - __main__ - Step 97325: {'lr': 0.0001406637869183074, 'samples': 18686400, 'steps': 97324, 'loss/train': 1.4354274272918701} 11/07/2021 10:54:43 - INFO - __main__ - Step 97326: {'lr': 0.00014065901462023336, 'samples': 18686592, 'steps': 97325, 'loss/train': 1.3566521406173706} 11/07/2021 10:54:44 - INFO - __main__ - Step 97327: {'lr': 0.0001406542423714267, 'samples': 18686784, 'steps': 97326, 'loss/train': 1.1516444683074951} 11/07/2021 10:54:44 - INFO - __main__ - Step 97328: {'lr': 0.00014064947017188956, 'samples': 18686976, 'steps': 97327, 'loss/train': 1.2287802696228027} 11/07/2021 10:54:44 - INFO - __main__ - Step 97329: {'lr': 0.0001406446980216241, 'samples': 18687168, 'steps': 97328, 'loss/train': 1.4546494483947754} 11/07/2021 10:54:45 - INFO - __main__ - Step 97330: {'lr': 0.0001406399259206324, 'samples': 18687360, 'steps': 97329, 'loss/train': 1.7638511657714844} 11/07/2021 10:54:45 - INFO - __main__ - Step 97331: {'lr': 0.00014063515386891672, 'samples': 18687552, 'steps': 97330, 'loss/train': 1.1211371421813965} 11/07/2021 10:54:46 - INFO - __main__ - Step 97332: {'lr': 0.00014063038186647913, 'samples': 18687744, 'steps': 97331, 'loss/train': 1.2985637187957764} 11/07/2021 10:54:46 - INFO - __main__ - Step 97333: {'lr': 0.0001406256099133218, 'samples': 18687936, 'steps': 97332, 'loss/train': 1.3164901733398438} 11/07/2021 10:54:47 - INFO - __main__ - Step 97334: {'lr': 0.00014062083800944698, 'samples': 18688128, 'steps': 97333, 'loss/train': 1.074642539024353} 11/07/2021 10:54:47 - INFO - __main__ - Step 97335: {'lr': 0.00014061606615485661, 'samples': 18688320, 'steps': 97334, 'loss/train': 1.7666833400726318} 11/07/2021 10:54:47 - INFO - __main__ - Step 97336: {'lr': 0.00014061129434955296, 'samples': 18688512, 'steps': 97335, 'loss/train': 1.218088984489441} 11/07/2021 10:54:49 - INFO - __main__ - Step 97337: {'lr': 0.00014060652259353817, 'samples': 18688704, 'steps': 97336, 'loss/train': 0.6257885098457336} 11/07/2021 10:54:49 - INFO - __main__ - Step 97338: {'lr': 0.00014060175088681441, 'samples': 18688896, 'steps': 97337, 'loss/train': 1.1323949098587036} 11/07/2021 10:54:49 - INFO - __main__ - Step 97339: {'lr': 0.0001405969792293838, 'samples': 18689088, 'steps': 97338, 'loss/train': 1.4566940069198608} 11/07/2021 10:54:50 - INFO - __main__ - Step 97340: {'lr': 0.00014059220762124852, 'samples': 18689280, 'steps': 97339, 'loss/train': 0.9306468963623047} 11/07/2021 10:54:50 - INFO - __main__ - Step 97341: {'lr': 0.0001405874360624107, 'samples': 18689472, 'steps': 97340, 'loss/train': 1.1129984855651855} 11/07/2021 10:54:51 - INFO - __main__ - Step 97342: {'lr': 0.00014058266455287247, 'samples': 18689664, 'steps': 97341, 'loss/train': 1.19082772731781} 11/07/2021 10:54:51 - INFO - __main__ - Step 97343: {'lr': 0.00014057789309263602, 'samples': 18689856, 'steps': 97342, 'loss/train': 1.2915658950805664} 11/07/2021 10:54:52 - INFO - __main__ - Step 97344: {'lr': 0.00014057312168170346, 'samples': 18690048, 'steps': 97343, 'loss/train': 1.4304757118225098} 11/07/2021 10:54:52 - INFO - __main__ - Step 97345: {'lr': 0.00014056835032007708, 'samples': 18690240, 'steps': 97344, 'loss/train': 1.0797678232192993} 11/07/2021 10:54:52 - INFO - __main__ - Step 97346: {'lr': 0.00014056357900775886, 'samples': 18690432, 'steps': 97345, 'loss/train': 1.4205642938613892} 11/07/2021 10:54:53 - INFO - __main__ - Step 97347: {'lr': 0.00014055880774475093, 'samples': 18690624, 'steps': 97346, 'loss/train': 1.498970627784729} 11/07/2021 10:54:54 - INFO - __main__ - Step 97348: {'lr': 0.00014055403653105553, 'samples': 18690816, 'steps': 97347, 'loss/train': 1.5450520515441895} 11/07/2021 10:54:54 - INFO - __main__ - Step 97349: {'lr': 0.0001405492653666748, 'samples': 18691008, 'steps': 97348, 'loss/train': 1.3487434387207031} 11/07/2021 10:54:54 - INFO - __main__ - Step 97350: {'lr': 0.0001405444942516109, 'samples': 18691200, 'steps': 97349, 'loss/train': 1.4693301916122437} 11/07/2021 10:54:55 - INFO - __main__ - Step 97351: {'lr': 0.00014053972318586595, 'samples': 18691392, 'steps': 97350, 'loss/train': 0.9529745578765869} 11/07/2021 10:54:56 - INFO - __main__ - Step 97352: {'lr': 0.00014053495216944208, 'samples': 18691584, 'steps': 97351, 'loss/train': 1.6427353620529175} 11/07/2021 10:54:56 - INFO - __main__ - Step 97353: {'lr': 0.0001405301812023415, 'samples': 18691776, 'steps': 97352, 'loss/train': 1.7012444734573364} 11/07/2021 10:54:57 - INFO - __main__ - Step 97354: {'lr': 0.00014052541028456635, 'samples': 18691968, 'steps': 97353, 'loss/train': 1.0429880619049072} 11/07/2021 10:54:57 - INFO - __main__ - Step 97355: {'lr': 0.00014052063941611876, 'samples': 18692160, 'steps': 97354, 'loss/train': 1.1790931224822998} 11/07/2021 10:54:57 - INFO - __main__ - Step 97356: {'lr': 0.00014051586859700082, 'samples': 18692352, 'steps': 97355, 'loss/train': 1.5618913173675537} 11/07/2021 10:54:58 - INFO - __main__ - Step 97357: {'lr': 0.0001405110978272148, 'samples': 18692544, 'steps': 97356, 'loss/train': 1.228217601776123} 11/07/2021 10:54:59 - INFO - __main__ - Step 97358: {'lr': 0.00014050632710676275, 'samples': 18692736, 'steps': 97357, 'loss/train': 1.4683605432510376} 11/07/2021 10:54:59 - INFO - __main__ - Step 97359: {'lr': 0.000140501556435647, 'samples': 18692928, 'steps': 97358, 'loss/train': 1.3177844285964966} 11/07/2021 10:54:59 - INFO - __main__ - Step 97360: {'lr': 0.00014049678581386942, 'samples': 18693120, 'steps': 97359, 'loss/train': 1.5145220756530762} 11/07/2021 10:55:00 - INFO - __main__ - Step 97361: {'lr': 0.00014049201524143234, 'samples': 18693312, 'steps': 97360, 'loss/train': 1.450724482536316} 11/07/2021 10:55:01 - INFO - __main__ - Step 97362: {'lr': 0.00014048724471833784, 'samples': 18693504, 'steps': 97361, 'loss/train': 1.2787243127822876} 11/07/2021 10:55:01 - INFO - __main__ - Step 97363: {'lr': 0.00014048247424458809, 'samples': 18693696, 'steps': 97362, 'loss/train': 1.2547509670257568} 11/07/2021 10:55:01 - INFO - __main__ - Step 97364: {'lr': 0.00014047770382018526, 'samples': 18693888, 'steps': 97363, 'loss/train': 1.2808761596679688} 11/07/2021 10:55:02 - INFO - __main__ - Step 97365: {'lr': 0.0001404729334451315, 'samples': 18694080, 'steps': 97364, 'loss/train': 1.437983512878418} 11/07/2021 10:55:02 - INFO - __main__ - Step 97366: {'lr': 0.00014046816311942895, 'samples': 18694272, 'steps': 97365, 'loss/train': 1.2802072763442993} 11/07/2021 10:55:03 - INFO - __main__ - Step 97367: {'lr': 0.00014046339284307975, 'samples': 18694464, 'steps': 97366, 'loss/train': 1.6192176342010498} 11/07/2021 10:55:03 - INFO - __main__ - Step 97368: {'lr': 0.00014045862261608604, 'samples': 18694656, 'steps': 97367, 'loss/train': 1.7494237422943115} 11/07/2021 10:55:04 - INFO - __main__ - Step 97369: {'lr': 0.00014045385243844998, 'samples': 18694848, 'steps': 97368, 'loss/train': 1.0735359191894531} 11/07/2021 10:55:04 - INFO - __main__ - Step 97370: {'lr': 0.00014044908231017372, 'samples': 18695040, 'steps': 97369, 'loss/train': 1.5011833906173706} 11/07/2021 10:55:05 - INFO - __main__ - Step 97371: {'lr': 0.00014044431223125941, 'samples': 18695232, 'steps': 97370, 'loss/train': 1.4969977140426636} 11/07/2021 10:55:05 - INFO - __main__ - Step 97372: {'lr': 0.00014043954220170935, 'samples': 18695424, 'steps': 97371, 'loss/train': 1.2477322816848755} 11/07/2021 10:55:06 - INFO - __main__ - Step 97373: {'lr': 0.0001404347722215254, 'samples': 18695616, 'steps': 97372, 'loss/train': 1.1765257120132446} 11/07/2021 10:55:06 - INFO - __main__ - Step 97374: {'lr': 0.00014043000229070984, 'samples': 18695808, 'steps': 97373, 'loss/train': 1.5129443407058716} 11/07/2021 10:55:06 - INFO - __main__ - Step 97375: {'lr': 0.00014042523240926486, 'samples': 18696000, 'steps': 97374, 'loss/train': 1.6331045627593994} 11/07/2021 10:55:07 - INFO - __main__ - Step 97376: {'lr': 0.0001404204625771926, 'samples': 18696192, 'steps': 97375, 'loss/train': 1.598926067352295} 11/07/2021 10:55:07 - INFO - __main__ - Step 97377: {'lr': 0.00014041569279449513, 'samples': 18696384, 'steps': 97376, 'loss/train': 1.2563599348068237} 11/07/2021 10:55:08 - INFO - __main__ - Step 97378: {'lr': 0.0001404109230611747, 'samples': 18696576, 'steps': 97377, 'loss/train': 1.5991896390914917} 11/07/2021 10:55:09 - INFO - __main__ - Step 97379: {'lr': 0.0001404061533772334, 'samples': 18696768, 'steps': 97378, 'loss/train': 1.0994014739990234} 11/07/2021 10:55:09 - INFO - __main__ - Step 97380: {'lr': 0.00014040138374267342, 'samples': 18696960, 'steps': 97379, 'loss/train': 1.3012919425964355} 11/07/2021 10:55:09 - INFO - __main__ - Step 97381: {'lr': 0.00014039661415749682, 'samples': 18697152, 'steps': 97380, 'loss/train': 1.3204481601715088} 11/07/2021 10:55:10 - INFO - __main__ - Step 97382: {'lr': 0.0001403918446217059, 'samples': 18697344, 'steps': 97381, 'loss/train': 0.9502792954444885} 11/07/2021 10:55:11 - INFO - __main__ - Step 97383: {'lr': 0.00014038707513530267, 'samples': 18697536, 'steps': 97382, 'loss/train': 1.3893383741378784} 11/07/2021 10:55:11 - INFO - __main__ - Step 97384: {'lr': 0.00014038230569828937, 'samples': 18697728, 'steps': 97383, 'loss/train': 1.064415693283081} 11/07/2021 10:55:11 - INFO - __main__ - Step 97385: {'lr': 0.00014037753631066815, 'samples': 18697920, 'steps': 97384, 'loss/train': 1.6302522420883179} 11/07/2021 10:55:12 - INFO - __main__ - Step 97386: {'lr': 0.00014037276697244106, 'samples': 18698112, 'steps': 97385, 'loss/train': 1.27461576461792} 11/07/2021 10:55:12 - INFO - __main__ - Step 97387: {'lr': 0.0001403679976836103, 'samples': 18698304, 'steps': 97386, 'loss/train': 1.5970542430877686} 11/07/2021 10:55:13 - INFO - __main__ - Step 97388: {'lr': 0.00014036322844417803, 'samples': 18698496, 'steps': 97387, 'loss/train': 1.229178786277771} 11/07/2021 10:55:14 - INFO - __main__ - Step 97389: {'lr': 0.00014035845925414642, 'samples': 18698688, 'steps': 97388, 'loss/train': 1.2312350273132324} 11/07/2021 10:55:14 - INFO - __main__ - Step 97390: {'lr': 0.00014035369011351756, 'samples': 18698880, 'steps': 97389, 'loss/train': 1.7119767665863037} 11/07/2021 10:55:14 - INFO - __main__ - Step 97391: {'lr': 0.0001403489210222937, 'samples': 18699072, 'steps': 97390, 'loss/train': 1.5562995672225952} 11/07/2021 10:55:15 - INFO - __main__ - Step 97392: {'lr': 0.00014034415198047685, 'samples': 18699264, 'steps': 97391, 'loss/train': 1.2968653440475464} 11/07/2021 10:55:16 - INFO - __main__ - Step 97393: {'lr': 0.00014033938298806925, 'samples': 18699456, 'steps': 97392, 'loss/train': 0.08629168570041656} 11/07/2021 10:55:16 - INFO - __main__ - Step 97394: {'lr': 0.00014033461404507305, 'samples': 18699648, 'steps': 97393, 'loss/train': 1.3378912210464478} 11/07/2021 10:55:16 - INFO - __main__ - Step 97395: {'lr': 0.0001403298451514904, 'samples': 18699840, 'steps': 97394, 'loss/train': 1.1550168991088867} 11/07/2021 10:55:17 - INFO - __main__ - Step 97396: {'lr': 0.0001403250763073234, 'samples': 18700032, 'steps': 97395, 'loss/train': 0.6819843649864197} 11/07/2021 10:55:17 - INFO - __main__ - Step 97397: {'lr': 0.0001403203075125742, 'samples': 18700224, 'steps': 97396, 'loss/train': 1.438965916633606} 11/07/2021 10:55:18 - INFO - __main__ - Step 97398: {'lr': 0.000140315538767245, 'samples': 18700416, 'steps': 97397, 'loss/train': 1.015559434890747} 11/07/2021 10:55:19 - INFO - __main__ - Step 97399: {'lr': 0.00014031077007133807, 'samples': 18700608, 'steps': 97398, 'loss/train': 0.6253157258033752} 11/07/2021 10:55:19 - INFO - __main__ - Step 97400: {'lr': 0.00014030600142485528, 'samples': 18700800, 'steps': 97399, 'loss/train': 1.4191771745681763} 11/07/2021 10:55:19 - INFO - __main__ - Step 97401: {'lr': 0.00014030123282779888, 'samples': 18700992, 'steps': 97400, 'loss/train': 0.12248192727565765} 11/07/2021 10:55:20 - INFO - __main__ - Step 97402: {'lr': 0.00014029646428017113, 'samples': 18701184, 'steps': 97401, 'loss/train': 1.4160741567611694} 11/07/2021 10:55:21 - INFO - __main__ - Step 97403: {'lr': 0.00014029169578197404, 'samples': 18701376, 'steps': 97402, 'loss/train': 1.479399561882019} 11/07/2021 10:55:21 - INFO - __main__ - Step 97404: {'lr': 0.00014028692733320983, 'samples': 18701568, 'steps': 97403, 'loss/train': 1.2969008684158325} 11/07/2021 10:55:21 - INFO - __main__ - Step 97405: {'lr': 0.00014028215893388063, 'samples': 18701760, 'steps': 97404, 'loss/train': 1.14283287525177} 11/07/2021 10:55:22 - INFO - __main__ - Step 97406: {'lr': 0.0001402773905839886, 'samples': 18701952, 'steps': 97405, 'loss/train': 1.1341277360916138} 11/07/2021 10:55:22 - INFO - __main__ - Step 97407: {'lr': 0.0001402726222835359, 'samples': 18702144, 'steps': 97406, 'loss/train': 1.305964469909668} 11/07/2021 10:55:23 - INFO - __main__ - Step 97408: {'lr': 0.00014026785403252468, 'samples': 18702336, 'steps': 97407, 'loss/train': 0.8848766684532166} 11/07/2021 10:55:24 - INFO - __main__ - Step 97409: {'lr': 0.00014026308583095704, 'samples': 18702528, 'steps': 97408, 'loss/train': 1.295719027519226} 11/07/2021 10:55:24 - INFO - __main__ - Step 97410: {'lr': 0.00014025831767883515, 'samples': 18702720, 'steps': 97409, 'loss/train': 1.2258368730545044} 11/07/2021 10:55:24 - INFO - __main__ - Step 97411: {'lr': 0.0001402535495761612, 'samples': 18702912, 'steps': 97410, 'loss/train': 0.9865967035293579} 11/07/2021 10:55:25 - INFO - __main__ - Step 97412: {'lr': 0.0001402487815229374, 'samples': 18703104, 'steps': 97411, 'loss/train': 1.3954378366470337} 11/07/2021 10:55:26 - INFO - __main__ - Step 97413: {'lr': 0.0001402440135191657, 'samples': 18703296, 'steps': 97412, 'loss/train': 1.304990291595459} 11/07/2021 10:55:26 - INFO - __main__ - Step 97414: {'lr': 0.00014023924556484836, 'samples': 18703488, 'steps': 97413, 'loss/train': 1.5356653928756714} 11/07/2021 10:55:26 - INFO - __main__ - Step 97415: {'lr': 0.00014023447765998748, 'samples': 18703680, 'steps': 97414, 'loss/train': 1.4085525274276733} 11/07/2021 10:55:27 - INFO - __main__ - Step 97416: {'lr': 0.00014022970980458527, 'samples': 18703872, 'steps': 97415, 'loss/train': 1.39313805103302} 11/07/2021 10:55:27 - INFO - __main__ - Step 97417: {'lr': 0.00014022494199864387, 'samples': 18704064, 'steps': 97416, 'loss/train': 1.6490107774734497} 11/07/2021 10:55:28 - INFO - __main__ - Step 97418: {'lr': 0.00014022017424216544, 'samples': 18704256, 'steps': 97417, 'loss/train': 1.3086639642715454} 11/07/2021 10:55:28 - INFO - __main__ - Step 97419: {'lr': 0.00014021540653515207, 'samples': 18704448, 'steps': 97418, 'loss/train': 1.3838804960250854} 11/07/2021 10:55:29 - INFO - __main__ - Step 97420: {'lr': 0.000140210638877606, 'samples': 18704640, 'steps': 97419, 'loss/train': 1.6233221292495728} 11/07/2021 10:55:29 - INFO - __main__ - Step 97421: {'lr': 0.00014020587126952928, 'samples': 18704832, 'steps': 97420, 'loss/train': 1.221342921257019} 11/07/2021 10:55:29 - INFO - __main__ - Step 97422: {'lr': 0.0001402011037109241, 'samples': 18705024, 'steps': 97421, 'loss/train': 1.396509051322937} 11/07/2021 10:55:30 - INFO - __main__ - Step 97423: {'lr': 0.0001401963362017926, 'samples': 18705216, 'steps': 97422, 'loss/train': 0.9107914566993713} 11/07/2021 10:55:31 - INFO - __main__ - Step 97424: {'lr': 0.00014019156874213695, 'samples': 18705408, 'steps': 97423, 'loss/train': 1.5563420057296753} 11/07/2021 10:55:31 - INFO - __main__ - Step 97425: {'lr': 0.00014018680133195927, 'samples': 18705600, 'steps': 97424, 'loss/train': 0.6336289048194885} 11/07/2021 10:55:32 - INFO - __main__ - Step 97426: {'lr': 0.00014018203397126185, 'samples': 18705792, 'steps': 97425, 'loss/train': 0.14683571457862854} 11/07/2021 10:55:32 - INFO - __main__ - Step 97427: {'lr': 0.0001401772666600466, 'samples': 18705984, 'steps': 97426, 'loss/train': 1.0432415008544922} 11/07/2021 10:55:32 - INFO - __main__ - Step 97428: {'lr': 0.0001401724993983158, 'samples': 18706176, 'steps': 97427, 'loss/train': 1.5797154903411865} 11/07/2021 10:55:33 - INFO - __main__ - Step 97429: {'lr': 0.0001401677321860715, 'samples': 18706368, 'steps': 97428, 'loss/train': 1.2251951694488525} 11/07/2021 10:55:34 - INFO - __main__ - Step 97430: {'lr': 0.000140162965023316, 'samples': 18706560, 'steps': 97429, 'loss/train': 1.3914060592651367} 11/07/2021 10:55:34 - INFO - __main__ - Step 97431: {'lr': 0.00014015819791005137, 'samples': 18706752, 'steps': 97430, 'loss/train': 1.3297207355499268} 11/07/2021 10:55:34 - INFO - __main__ - Step 97432: {'lr': 0.0001401534308462797, 'samples': 18706944, 'steps': 97431, 'loss/train': 1.2700903415679932} 11/07/2021 10:55:35 - INFO - __main__ - Step 97433: {'lr': 0.00014014866383200324, 'samples': 18707136, 'steps': 97432, 'loss/train': 0.8947109580039978} 11/07/2021 10:55:36 - INFO - __main__ - Step 97434: {'lr': 0.0001401438968672241, 'samples': 18707328, 'steps': 97433, 'loss/train': 1.084048867225647} 11/07/2021 10:55:36 - INFO - __main__ - Step 97435: {'lr': 0.00014013912995194445, 'samples': 18707520, 'steps': 97434, 'loss/train': 1.0484346151351929} 11/07/2021 10:55:36 - INFO - __main__ - Step 97436: {'lr': 0.00014013436308616634, 'samples': 18707712, 'steps': 97435, 'loss/train': 1.1960138082504272} 11/07/2021 10:55:37 - INFO - __main__ - Step 97437: {'lr': 0.00014012959626989206, 'samples': 18707904, 'steps': 97436, 'loss/train': 1.4703593254089355} 11/07/2021 10:55:37 - INFO - __main__ - Step 97438: {'lr': 0.00014012482950312368, 'samples': 18708096, 'steps': 97437, 'loss/train': 1.47111976146698} 11/07/2021 10:55:38 - INFO - __main__ - Step 97439: {'lr': 0.00014012006278586343, 'samples': 18708288, 'steps': 97438, 'loss/train': 1.0988796949386597} 11/07/2021 10:55:38 - INFO - __main__ - Step 97440: {'lr': 0.0001401152961181133, 'samples': 18708480, 'steps': 97439, 'loss/train': 1.239790678024292} 11/07/2021 10:55:39 - INFO - __main__ - Step 97441: {'lr': 0.0001401105294998755, 'samples': 18708672, 'steps': 97440, 'loss/train': 1.9497935771942139} 11/07/2021 10:55:39 - INFO - __main__ - Step 97442: {'lr': 0.00014010576293115222, 'samples': 18708864, 'steps': 97441, 'loss/train': 1.431652545928955} 11/07/2021 10:55:39 - INFO - __main__ - Step 97443: {'lr': 0.00014010099641194556, 'samples': 18709056, 'steps': 97442, 'loss/train': 1.4283591508865356} 11/07/2021 10:55:41 - INFO - __main__ - Step 97444: {'lr': 0.00014009622994225773, 'samples': 18709248, 'steps': 97443, 'loss/train': 1.0231252908706665} 11/07/2021 10:55:41 - INFO - __main__ - Step 97445: {'lr': 0.00014009146352209084, 'samples': 18709440, 'steps': 97444, 'loss/train': 1.5631712675094604} 11/07/2021 10:55:41 - INFO - __main__ - Step 97446: {'lr': 0.00014008669715144702, 'samples': 18709632, 'steps': 97445, 'loss/train': 1.164747714996338} 11/07/2021 10:55:42 - INFO - __main__ - Step 97447: {'lr': 0.00014008193083032844, 'samples': 18709824, 'steps': 97446, 'loss/train': 1.3546353578567505} 11/07/2021 10:55:42 - INFO - __main__ - Step 97448: {'lr': 0.00014007716455873725, 'samples': 18710016, 'steps': 97447, 'loss/train': 1.4339606761932373} 11/07/2021 10:55:43 - INFO - __main__ - Step 97449: {'lr': 0.0001400723983366756, 'samples': 18710208, 'steps': 97448, 'loss/train': 1.5567598342895508} 11/07/2021 10:55:43 - INFO - __main__ - Step 97450: {'lr': 0.00014006763216414564, 'samples': 18710400, 'steps': 97449, 'loss/train': 1.5237456560134888} 11/07/2021 10:55:44 - INFO - __main__ - Step 97451: {'lr': 0.0001400628660411495, 'samples': 18710592, 'steps': 97450, 'loss/train': 1.386210560798645} 11/07/2021 10:55:44 - INFO - __main__ - Step 97452: {'lr': 0.00014005809996768935, 'samples': 18710784, 'steps': 97451, 'loss/train': 1.4372365474700928} 11/07/2021 10:55:44 - INFO - __main__ - Step 97453: {'lr': 0.0001400533339437674, 'samples': 18710976, 'steps': 97452, 'loss/train': 0.5957849621772766} 11/07/2021 10:55:46 - INFO - __main__ - Step 97454: {'lr': 0.00014004856796938565, 'samples': 18711168, 'steps': 97453, 'loss/train': 1.3125821352005005} 11/07/2021 10:55:46 - INFO - __main__ - Step 97455: {'lr': 0.00014004380204454627, 'samples': 18711360, 'steps': 97454, 'loss/train': 1.1409317255020142} 11/07/2021 10:55:47 - INFO - __main__ - Step 97456: {'lr': 0.00014003903616925152, 'samples': 18711552, 'steps': 97455, 'loss/train': 1.4259625673294067} 11/07/2021 10:55:47 - INFO - __main__ - Step 97457: {'lr': 0.00014003427034350342, 'samples': 18711744, 'steps': 97456, 'loss/train': 1.6239895820617676} 11/07/2021 10:55:47 - INFO - __main__ - Step 97458: {'lr': 0.0001400295045673042, 'samples': 18711936, 'steps': 97457, 'loss/train': 0.9547484517097473} 11/07/2021 10:55:48 - INFO - __main__ - Step 97459: {'lr': 0.00014002473884065601, 'samples': 18712128, 'steps': 97458, 'loss/train': 1.4337025880813599} 11/07/2021 10:55:49 - INFO - __main__ - Step 97460: {'lr': 0.00014001997316356095, 'samples': 18712320, 'steps': 97459, 'loss/train': 0.12713727355003357} 11/07/2021 10:55:49 - INFO - __main__ - Step 97461: {'lr': 0.0001400152075360212, 'samples': 18712512, 'steps': 97460, 'loss/train': 1.3183939456939697} 11/07/2021 10:55:49 - INFO - __main__ - Step 97462: {'lr': 0.00014001044195803891, 'samples': 18712704, 'steps': 97461, 'loss/train': 1.0331637859344482} 11/07/2021 10:55:50 - INFO - __main__ - Step 97463: {'lr': 0.00014000567642961622, 'samples': 18712896, 'steps': 97462, 'loss/train': 1.3528541326522827} 11/07/2021 10:55:50 - INFO - __main__ - Step 97464: {'lr': 0.0001400009109507553, 'samples': 18713088, 'steps': 97463, 'loss/train': 1.5821572542190552} 11/07/2021 10:55:51 - INFO - __main__ - Step 97465: {'lr': 0.00013999614552145823, 'samples': 18713280, 'steps': 97464, 'loss/train': 2.125688076019287} 11/07/2021 10:55:51 - INFO - __main__ - Step 97466: {'lr': 0.0001399913801417273, 'samples': 18713472, 'steps': 97465, 'loss/train': 1.1009105443954468} 11/07/2021 10:55:52 - INFO - __main__ - Step 97467: {'lr': 0.00013998661481156446, 'samples': 18713664, 'steps': 97466, 'loss/train': 1.6595715284347534} 11/07/2021 10:55:52 - INFO - __main__ - Step 97468: {'lr': 0.00013998184953097195, 'samples': 18713856, 'steps': 97467, 'loss/train': 1.0934065580368042} 11/07/2021 10:55:52 - INFO - __main__ - Step 97469: {'lr': 0.00013997708429995193, 'samples': 18714048, 'steps': 97468, 'loss/train': 0.645461916923523} 11/07/2021 10:55:54 - INFO - __main__ - Step 97470: {'lr': 0.00013997231911850656, 'samples': 18714240, 'steps': 97469, 'loss/train': 1.1757524013519287} 11/07/2021 10:55:54 - INFO - __main__ - Step 97471: {'lr': 0.00013996755398663793, 'samples': 18714432, 'steps': 97470, 'loss/train': 1.2248892784118652} 11/07/2021 10:55:55 - INFO - __main__ - Step 97472: {'lr': 0.00013996278890434825, 'samples': 18714624, 'steps': 97471, 'loss/train': 1.8889424800872803} 11/07/2021 10:55:55 - INFO - __main__ - Step 97473: {'lr': 0.00013995802387163964, 'samples': 18714816, 'steps': 97472, 'loss/train': 1.4174329042434692} 11/07/2021 10:55:55 - INFO - __main__ - Step 97474: {'lr': 0.0001399532588885142, 'samples': 18715008, 'steps': 97473, 'loss/train': 1.0070725679397583} 11/07/2021 10:55:56 - INFO - __main__ - Step 97475: {'lr': 0.00013994849395497415, 'samples': 18715200, 'steps': 97474, 'loss/train': 1.369762897491455} 11/07/2021 10:55:57 - INFO - __main__ - Step 97476: {'lr': 0.00013994372907102167, 'samples': 18715392, 'steps': 97475, 'loss/train': 1.232983112335205} 11/07/2021 10:55:57 - INFO - __main__ - Step 97477: {'lr': 0.00013993896423665874, 'samples': 18715584, 'steps': 97476, 'loss/train': 1.765738606452942} 11/07/2021 10:55:58 - INFO - __main__ - Step 97478: {'lr': 0.00013993419945188768, 'samples': 18715776, 'steps': 97477, 'loss/train': 0.06984374672174454} 11/07/2021 10:55:58 - INFO - __main__ - Step 97479: {'lr': 0.00013992943471671055, 'samples': 18715968, 'steps': 97478, 'loss/train': 1.262378215789795} 11/07/2021 10:55:58 - INFO - __main__ - Step 97480: {'lr': 0.00013992467003112963, 'samples': 18716160, 'steps': 97479, 'loss/train': 1.5354286432266235} 11/07/2021 10:55:59 - INFO - __main__ - Step 97481: {'lr': 0.00013991990539514686, 'samples': 18716352, 'steps': 97480, 'loss/train': 1.4380537271499634} 11/07/2021 10:56:00 - INFO - __main__ - Step 97482: {'lr': 0.0001399151408087645, 'samples': 18716544, 'steps': 97481, 'loss/train': 1.2973610162734985} 11/07/2021 10:56:00 - INFO - __main__ - Step 97483: {'lr': 0.00013991037627198463, 'samples': 18716736, 'steps': 97482, 'loss/train': 1.868369460105896} 11/07/2021 10:56:00 - INFO - __main__ - Step 97484: {'lr': 0.00013990561178480948, 'samples': 18716928, 'steps': 97483, 'loss/train': 1.7293143272399902} 11/07/2021 10:56:01 - INFO - __main__ - Step 97485: {'lr': 0.00013990084734724116, 'samples': 18717120, 'steps': 97484, 'loss/train': 1.0767765045166016} 11/07/2021 10:56:02 - INFO - __main__ - Step 97486: {'lr': 0.0001398960829592818, 'samples': 18717312, 'steps': 97485, 'loss/train': 1.6852149963378906} 11/07/2021 10:56:02 - INFO - __main__ - Step 97487: {'lr': 0.00013989131862093357, 'samples': 18717504, 'steps': 97486, 'loss/train': 1.2216343879699707} 11/07/2021 10:56:03 - INFO - __main__ - Step 97488: {'lr': 0.0001398865543321986, 'samples': 18717696, 'steps': 97487, 'loss/train': 0.8284114599227905} 11/07/2021 10:56:03 - INFO - __main__ - Step 97489: {'lr': 0.0001398817900930791, 'samples': 18717888, 'steps': 97488, 'loss/train': 1.617833137512207} 11/07/2021 10:56:03 - INFO - __main__ - Step 97490: {'lr': 0.0001398770259035771, 'samples': 18718080, 'steps': 97489, 'loss/train': 0.9977625012397766} 11/07/2021 10:56:04 - INFO - __main__ - Step 97491: {'lr': 0.00013987226176369487, 'samples': 18718272, 'steps': 97490, 'loss/train': 1.1735291481018066} 11/07/2021 10:56:05 - INFO - __main__ - Step 97492: {'lr': 0.00013986749767343448, 'samples': 18718464, 'steps': 97491, 'loss/train': 0.9957885146141052} 11/07/2021 10:56:05 - INFO - __main__ - Step 97493: {'lr': 0.00013986273363279818, 'samples': 18718656, 'steps': 97492, 'loss/train': 1.2280470132827759} 11/07/2021 10:56:05 - INFO - __main__ - Step 97494: {'lr': 0.00013985796964178796, 'samples': 18718848, 'steps': 97493, 'loss/train': 1.326210856437683} 11/07/2021 10:56:06 - INFO - __main__ - Step 97495: {'lr': 0.000139853205700406, 'samples': 18719040, 'steps': 97494, 'loss/train': 1.4393994808197021} 11/07/2021 10:56:06 - INFO - __main__ - Step 97496: {'lr': 0.00013984844180865453, 'samples': 18719232, 'steps': 97495, 'loss/train': 1.5373814105987549} 11/07/2021 10:56:08 - INFO - __main__ - Step 97497: {'lr': 0.00013984367796653562, 'samples': 18719424, 'steps': 97496, 'loss/train': 1.0722167491912842} 11/07/2021 10:56:08 - INFO - __main__ - Step 97498: {'lr': 0.00013983891417405147, 'samples': 18719616, 'steps': 97497, 'loss/train': 1.7138910293579102} 11/07/2021 10:56:08 - INFO - __main__ - Step 97499: {'lr': 0.00013983415043120423, 'samples': 18719808, 'steps': 97498, 'loss/train': 1.0641450881958008} 11/07/2021 10:56:09 - INFO - __main__ - Step 97500: {'lr': 0.00013982938673799596, 'samples': 18720000, 'steps': 97499, 'loss/train': 1.2569489479064941} 11/07/2021 10:56:09 - INFO - __main__ - Step 97501: {'lr': 0.0001398246230944289, 'samples': 18720192, 'steps': 97500, 'loss/train': 0.1224503442645073} 11/07/2021 10:56:10 - INFO - __main__ - Step 97502: {'lr': 0.00013981985950050518, 'samples': 18720384, 'steps': 97501, 'loss/train': 1.2391152381896973} 11/07/2021 10:56:10 - INFO - __main__ - Step 97503: {'lr': 0.0001398150959562269, 'samples': 18720576, 'steps': 97502, 'loss/train': 1.8002578020095825} 11/07/2021 10:56:11 - INFO - __main__ - Step 97504: {'lr': 0.00013981033246159624, 'samples': 18720768, 'steps': 97503, 'loss/train': 1.3819923400878906} 11/07/2021 10:56:11 - INFO - __main__ - Step 97505: {'lr': 0.0001398055690166154, 'samples': 18720960, 'steps': 97504, 'loss/train': 1.4179717302322388} 11/07/2021 10:56:11 - INFO - __main__ - Step 97506: {'lr': 0.0001398008056212865, 'samples': 18721152, 'steps': 97505, 'loss/train': 1.4359991550445557} 11/07/2021 10:56:13 - INFO - __main__ - Step 97507: {'lr': 0.0001397960422756116, 'samples': 18721344, 'steps': 97506, 'loss/train': 1.0742369890213013} 11/07/2021 10:56:13 - INFO - __main__ - Step 97508: {'lr': 0.00013979127897959288, 'samples': 18721536, 'steps': 97507, 'loss/train': 1.0306785106658936} 11/07/2021 10:56:13 - INFO - __main__ - Step 97509: {'lr': 0.0001397865157332325, 'samples': 18721728, 'steps': 97508, 'loss/train': 1.0248005390167236} 11/07/2021 10:56:14 - INFO - __main__ - Step 97510: {'lr': 0.00013978175253653264, 'samples': 18721920, 'steps': 97509, 'loss/train': 2.2519962787628174} 11/07/2021 10:56:14 - INFO - __main__ - Step 97511: {'lr': 0.0001397769893894954, 'samples': 18722112, 'steps': 97510, 'loss/train': 1.56768000125885} 11/07/2021 10:56:15 - INFO - __main__ - Step 97512: {'lr': 0.00013977222629212296, 'samples': 18722304, 'steps': 97511, 'loss/train': 1.232927680015564} 11/07/2021 10:56:15 - INFO - __main__ - Step 97513: {'lr': 0.00013976746324441747, 'samples': 18722496, 'steps': 97512, 'loss/train': 1.144498586654663} 11/07/2021 10:56:16 - INFO - __main__ - Step 97514: {'lr': 0.00013976270024638104, 'samples': 18722688, 'steps': 97513, 'loss/train': 1.3737106323242188} 11/07/2021 10:56:16 - INFO - __main__ - Step 97515: {'lr': 0.00013975793729801582, 'samples': 18722880, 'steps': 97514, 'loss/train': 1.4895460605621338} 11/07/2021 10:56:16 - INFO - __main__ - Step 97516: {'lr': 0.000139753174399324, 'samples': 18723072, 'steps': 97515, 'loss/train': 0.9630944132804871} 11/07/2021 10:56:17 - INFO - __main__ - Step 97517: {'lr': 0.0001397484115503077, 'samples': 18723264, 'steps': 97516, 'loss/train': 1.0955883264541626} 11/07/2021 10:56:18 - INFO - __main__ - Step 97518: {'lr': 0.00013974364875096905, 'samples': 18723456, 'steps': 97517, 'loss/train': 1.4521002769470215} 11/07/2021 10:56:18 - INFO - __main__ - Step 97519: {'lr': 0.00013973888600131022, 'samples': 18723648, 'steps': 97518, 'loss/train': 1.1687673330307007} 11/07/2021 10:56:19 - INFO - __main__ - Step 97520: {'lr': 0.00013973412330133345, 'samples': 18723840, 'steps': 97519, 'loss/train': 1.1962472200393677} 11/07/2021 10:56:19 - INFO - __main__ - Step 97521: {'lr': 0.00013972936065104064, 'samples': 18724032, 'steps': 97520, 'loss/train': 1.5027949810028076} 11/07/2021 10:56:20 - INFO - __main__ - Step 97522: {'lr': 0.00013972459805043413, 'samples': 18724224, 'steps': 97521, 'loss/train': 1.3768941164016724} 11/07/2021 10:56:20 - INFO - __main__ - Step 97523: {'lr': 0.000139719835499516, 'samples': 18724416, 'steps': 97522, 'loss/train': 0.08196950703859329} 11/07/2021 10:56:21 - INFO - __main__ - Step 97524: {'lr': 0.0001397150729982884, 'samples': 18724608, 'steps': 97523, 'loss/train': 1.876459002494812} 11/07/2021 10:56:21 - INFO - __main__ - Step 97525: {'lr': 0.0001397103105467535, 'samples': 18724800, 'steps': 97524, 'loss/train': 1.0152281522750854} 11/07/2021 10:56:21 - INFO - __main__ - Step 97526: {'lr': 0.00013970554814491344, 'samples': 18724992, 'steps': 97525, 'loss/train': 1.1313546895980835} 11/07/2021 10:56:22 - INFO - __main__ - Step 97527: {'lr': 0.00013970078579277032, 'samples': 18725184, 'steps': 97526, 'loss/train': 1.767148494720459} 11/07/2021 10:56:23 - INFO - __main__ - Step 97528: {'lr': 0.00013969602349032633, 'samples': 18725376, 'steps': 97527, 'loss/train': 1.4497555494308472} 11/07/2021 10:56:23 - INFO - __main__ - Step 97529: {'lr': 0.00013969126123758362, 'samples': 18725568, 'steps': 97528, 'loss/train': 1.3096553087234497} 11/07/2021 10:56:23 - INFO - __main__ - Step 97530: {'lr': 0.00013968649903454435, 'samples': 18725760, 'steps': 97529, 'loss/train': 1.3119460344314575} 11/07/2021 10:56:24 - INFO - __main__ - Step 97531: {'lr': 0.00013968173688121062, 'samples': 18725952, 'steps': 97530, 'loss/train': 1.5309481620788574} 11/07/2021 10:56:25 - INFO - __main__ - Step 97532: {'lr': 0.00013967697477758461, 'samples': 18726144, 'steps': 97531, 'loss/train': 1.0778049230575562} 11/07/2021 10:56:25 - INFO - __main__ - Step 97533: {'lr': 0.00013967221272366854, 'samples': 18726336, 'steps': 97532, 'loss/train': 0.9772436618804932} 11/07/2021 10:56:26 - INFO - __main__ - Step 97534: {'lr': 0.00013966745071946439, 'samples': 18726528, 'steps': 97533, 'loss/train': 1.1907647848129272} 11/07/2021 10:56:26 - INFO - __main__ - Step 97535: {'lr': 0.00013966268876497434, 'samples': 18726720, 'steps': 97534, 'loss/train': 1.3947205543518066} 11/07/2021 10:56:26 - INFO - __main__ - Step 97536: {'lr': 0.00013965792686020063, 'samples': 18726912, 'steps': 97535, 'loss/train': 1.3548665046691895} 11/07/2021 10:56:27 - INFO - __main__ - Step 97537: {'lr': 0.00013965316500514532, 'samples': 18727104, 'steps': 97536, 'loss/train': 0.6151019334793091} 11/07/2021 10:56:28 - INFO - __main__ - Step 97538: {'lr': 0.0001396484031998106, 'samples': 18727296, 'steps': 97537, 'loss/train': 1.4832209348678589} 11/07/2021 10:56:28 - INFO - __main__ - Step 97539: {'lr': 0.0001396436414441986, 'samples': 18727488, 'steps': 97538, 'loss/train': 1.374806523323059} 11/07/2021 10:56:29 - INFO - __main__ - Step 97540: {'lr': 0.00013963887973831153, 'samples': 18727680, 'steps': 97539, 'loss/train': 1.447018027305603} 11/07/2021 10:56:29 - INFO - __main__ - Step 97541: {'lr': 0.0001396341180821514, 'samples': 18727872, 'steps': 97540, 'loss/train': 1.329587697982788} 11/07/2021 10:56:29 - INFO - __main__ - Step 97542: {'lr': 0.00013962935647572044, 'samples': 18728064, 'steps': 97541, 'loss/train': 1.9066271781921387} 11/07/2021 10:56:30 - INFO - __main__ - Step 97543: {'lr': 0.00013962459491902084, 'samples': 18728256, 'steps': 97542, 'loss/train': 1.0598937273025513} 11/07/2021 10:56:31 - INFO - __main__ - Step 97544: {'lr': 0.00013961983341205465, 'samples': 18728448, 'steps': 97543, 'loss/train': 1.4376786947250366} 11/07/2021 10:56:31 - INFO - __main__ - Step 97545: {'lr': 0.0001396150719548241, 'samples': 18728640, 'steps': 97544, 'loss/train': 1.29905366897583} 11/07/2021 10:56:31 - INFO - __main__ - Step 97546: {'lr': 0.00013961031054733126, 'samples': 18728832, 'steps': 97545, 'loss/train': 1.5888330936431885} 11/07/2021 10:56:32 - INFO - __main__ - Step 97547: {'lr': 0.00013960554918957842, 'samples': 18729024, 'steps': 97546, 'loss/train': 1.238223671913147} 11/07/2021 10:56:33 - INFO - __main__ - Step 97548: {'lr': 0.00013960078788156753, 'samples': 18729216, 'steps': 97547, 'loss/train': 1.5152124166488647} 11/07/2021 10:56:33 - INFO - __main__ - Step 97549: {'lr': 0.00013959602662330078, 'samples': 18729408, 'steps': 97548, 'loss/train': 1.5553427934646606} 11/07/2021 10:56:33 - INFO - __main__ - Step 97550: {'lr': 0.0001395912654147804, 'samples': 18729600, 'steps': 97549, 'loss/train': 1.369454264640808} 11/07/2021 10:56:34 - INFO - __main__ - Step 97551: {'lr': 0.0001395865042560085, 'samples': 18729792, 'steps': 97550, 'loss/train': 1.2961795330047607} 11/07/2021 10:56:34 - INFO - __main__ - Step 97552: {'lr': 0.00013958174314698718, 'samples': 18729984, 'steps': 97551, 'loss/train': 0.9892671704292297} 11/07/2021 10:56:35 - INFO - __main__ - Step 97553: {'lr': 0.00013957698208771864, 'samples': 18730176, 'steps': 97552, 'loss/train': 0.7850678563117981} 11/07/2021 10:56:35 - INFO - __main__ - Step 97554: {'lr': 0.000139572221078205, 'samples': 18730368, 'steps': 97553, 'loss/train': 1.1345356702804565} 11/07/2021 10:56:36 - INFO - __main__ - Step 97555: {'lr': 0.00013956746011844842, 'samples': 18730560, 'steps': 97554, 'loss/train': 1.5273668766021729} 11/07/2021 10:56:36 - INFO - __main__ - Step 97556: {'lr': 0.00013956269920845104, 'samples': 18730752, 'steps': 97555, 'loss/train': 1.5433309078216553} 11/07/2021 10:56:36 - INFO - __main__ - Step 97557: {'lr': 0.000139557938348215, 'samples': 18730944, 'steps': 97556, 'loss/train': 1.459364652633667} 11/07/2021 10:56:37 - INFO - __main__ - Step 97558: {'lr': 0.00013955317753774243, 'samples': 18731136, 'steps': 97557, 'loss/train': 1.4516520500183105} 11/07/2021 10:56:38 - INFO - __main__ - Step 97559: {'lr': 0.00013954841677703565, 'samples': 18731328, 'steps': 97558, 'loss/train': 1.2722599506378174} 11/07/2021 10:56:38 - INFO - __main__ - Step 97560: {'lr': 0.00013954365606609647, 'samples': 18731520, 'steps': 97559, 'loss/train': 0.5654886960983276} 11/07/2021 10:56:38 - INFO - __main__ - Step 97561: {'lr': 0.0001395388954049273, 'samples': 18731712, 'steps': 97560, 'loss/train': 1.1512260437011719} 11/07/2021 10:56:39 - INFO - __main__ - Step 97562: {'lr': 0.00013953413479353015, 'samples': 18731904, 'steps': 97561, 'loss/train': 1.5319050550460815} 11/07/2021 10:56:40 - INFO - __main__ - Step 97563: {'lr': 0.0001395293742319072, 'samples': 18732096, 'steps': 97562, 'loss/train': 1.0811069011688232} 11/07/2021 10:56:40 - INFO - __main__ - Step 97564: {'lr': 0.00013952461372006064, 'samples': 18732288, 'steps': 97563, 'loss/train': 0.795385479927063} 11/07/2021 10:56:41 - INFO - __main__ - Step 97565: {'lr': 0.00013951985325799259, 'samples': 18732480, 'steps': 97564, 'loss/train': 1.7864118814468384} 11/07/2021 10:56:41 - INFO - __main__ - Step 97566: {'lr': 0.00013951509284570516, 'samples': 18732672, 'steps': 97565, 'loss/train': 1.2672295570373535} 11/07/2021 10:56:41 - INFO - __main__ - Step 97567: {'lr': 0.00013951033248320056, 'samples': 18732864, 'steps': 97566, 'loss/train': 1.1778881549835205} 11/07/2021 10:56:42 - INFO - __main__ - Step 97568: {'lr': 0.0001395055721704809, 'samples': 18733056, 'steps': 97567, 'loss/train': 0.9899225234985352} 11/07/2021 10:56:43 - INFO - __main__ - Step 97569: {'lr': 0.00013950081190754828, 'samples': 18733248, 'steps': 97568, 'loss/train': 1.7904956340789795} 11/07/2021 10:56:43 - INFO - __main__ - Step 97570: {'lr': 0.000139496051694405, 'samples': 18733440, 'steps': 97569, 'loss/train': 1.4672465324401855} 11/07/2021 10:56:43 - INFO - __main__ - Step 97571: {'lr': 0.000139491291531053, 'samples': 18733632, 'steps': 97570, 'loss/train': 1.1072239875793457} 11/07/2021 10:56:44 - INFO - __main__ - Step 97572: {'lr': 0.0001394865314174945, 'samples': 18733824, 'steps': 97571, 'loss/train': 1.222100853919983} 11/07/2021 10:56:45 - INFO - __main__ - Step 97573: {'lr': 0.0001394817713537317, 'samples': 18734016, 'steps': 97572, 'loss/train': 1.4673426151275635} 11/07/2021 10:56:45 - INFO - __main__ - Step 97574: {'lr': 0.0001394770113397667, 'samples': 18734208, 'steps': 97573, 'loss/train': 1.3703473806381226} 11/07/2021 10:56:45 - INFO - __main__ - Step 97575: {'lr': 0.00013947225137560164, 'samples': 18734400, 'steps': 97574, 'loss/train': 1.1056625843048096} 11/07/2021 10:56:46 - INFO - __main__ - Step 97576: {'lr': 0.0001394674914612387, 'samples': 18734592, 'steps': 97575, 'loss/train': 1.2097054719924927} 11/07/2021 10:56:46 - INFO - __main__ - Step 97577: {'lr': 0.00013946273159668, 'samples': 18734784, 'steps': 97576, 'loss/train': 1.2043582201004028} 11/07/2021 10:56:47 - INFO - __main__ - Step 97578: {'lr': 0.00013945797178192766, 'samples': 18734976, 'steps': 97577, 'loss/train': 0.9873724579811096} 11/07/2021 10:56:47 - INFO - __main__ - Step 97579: {'lr': 0.00013945321201698385, 'samples': 18735168, 'steps': 97578, 'loss/train': 1.4141356945037842} 11/07/2021 10:56:48 - INFO - __main__ - Step 97580: {'lr': 0.00013944845230185078, 'samples': 18735360, 'steps': 97579, 'loss/train': 1.356389045715332} 11/07/2021 10:56:48 - INFO - __main__ - Step 97581: {'lr': 0.00013944369263653057, 'samples': 18735552, 'steps': 97580, 'loss/train': 1.7102653980255127} 11/07/2021 10:56:49 - INFO - __main__ - Step 97582: {'lr': 0.00013943893302102522, 'samples': 18735744, 'steps': 97581, 'loss/train': 1.3543272018432617} 11/07/2021 10:56:50 - INFO - __main__ - Step 97583: {'lr': 0.00013943417345533703, 'samples': 18735936, 'steps': 97582, 'loss/train': 1.4833709001541138} 11/07/2021 10:56:51 - INFO - __main__ - Step 97584: {'lr': 0.00013942941393946807, 'samples': 18736128, 'steps': 97583, 'loss/train': 1.0838696956634521} 11/07/2021 10:56:51 - INFO - __main__ - Step 97585: {'lr': 0.0001394246544734205, 'samples': 18736320, 'steps': 97584, 'loss/train': 1.0340803861618042} 11/07/2021 10:56:51 - INFO - __main__ - Step 97586: {'lr': 0.0001394198950571965, 'samples': 18736512, 'steps': 97585, 'loss/train': 1.745996356010437} 11/07/2021 10:56:52 - INFO - __main__ - Step 97587: {'lr': 0.00013941513569079816, 'samples': 18736704, 'steps': 97586, 'loss/train': 1.7546037435531616} 11/07/2021 10:56:52 - INFO - __main__ - Step 97588: {'lr': 0.00013941037637422765, 'samples': 18736896, 'steps': 97587, 'loss/train': 0.8320348262786865} 11/07/2021 10:56:53 - INFO - __main__ - Step 97589: {'lr': 0.00013940561710748715, 'samples': 18737088, 'steps': 97588, 'loss/train': 0.8793270587921143} 11/07/2021 10:56:53 - INFO - __main__ - Step 97590: {'lr': 0.00013940085789057875, 'samples': 18737280, 'steps': 97589, 'loss/train': 1.4261178970336914} 11/07/2021 10:56:54 - INFO - __main__ - Step 97591: {'lr': 0.00013939609872350462, 'samples': 18737472, 'steps': 97590, 'loss/train': 1.2950708866119385} 11/07/2021 10:56:54 - INFO - __main__ - Step 97592: {'lr': 0.00013939133960626698, 'samples': 18737664, 'steps': 97591, 'loss/train': 1.2686654329299927} 11/07/2021 10:56:55 - INFO - __main__ - Step 97593: {'lr': 0.00013938658053886782, 'samples': 18737856, 'steps': 97592, 'loss/train': 1.2752152681350708} 11/07/2021 10:56:55 - INFO - __main__ - Step 97594: {'lr': 0.00013938182152130937, 'samples': 18738048, 'steps': 97593, 'loss/train': 1.4094116687774658} 11/07/2021 10:56:56 - INFO - __main__ - Step 97595: {'lr': 0.00013937706255359378, 'samples': 18738240, 'steps': 97594, 'loss/train': 1.5139414072036743} 11/07/2021 10:56:56 - INFO - __main__ - Step 97596: {'lr': 0.0001393723036357231, 'samples': 18738432, 'steps': 97595, 'loss/train': 0.999030590057373} 11/07/2021 10:56:57 - INFO - __main__ - Step 97597: {'lr': 0.00013936754476769964, 'samples': 18738624, 'steps': 97596, 'loss/train': 1.6407370567321777} 11/07/2021 10:56:57 - INFO - __main__ - Step 97598: {'lr': 0.00013936278594952543, 'samples': 18738816, 'steps': 97597, 'loss/train': 0.6952365040779114} 11/07/2021 10:56:57 - INFO - __main__ - Step 97599: {'lr': 0.00013935802718120262, 'samples': 18739008, 'steps': 97598, 'loss/train': 1.3830242156982422} 11/07/2021 10:56:58 - INFO - __main__ - Step 97600: {'lr': 0.00013935326846273337, 'samples': 18739200, 'steps': 97599, 'loss/train': 0.7406180500984192} 11/07/2021 10:56:59 - INFO - __main__ - Step 97601: {'lr': 0.0001393485097941199, 'samples': 18739392, 'steps': 97600, 'loss/train': 1.6972581148147583} 11/07/2021 10:56:59 - INFO - __main__ - Step 97602: {'lr': 0.0001393437511753642, 'samples': 18739584, 'steps': 97601, 'loss/train': 1.3014674186706543} 11/07/2021 10:56:59 - INFO - __main__ - Step 97603: {'lr': 0.00013933899260646864, 'samples': 18739776, 'steps': 97602, 'loss/train': 0.07625014334917068} 11/07/2021 10:57:00 - INFO - __main__ - Step 97604: {'lr': 0.0001393342340874351, 'samples': 18739968, 'steps': 97603, 'loss/train': 1.4043763875961304} 11/07/2021 10:57:00 - INFO - __main__ - Step 97605: {'lr': 0.00013932947561826588, 'samples': 18740160, 'steps': 97604, 'loss/train': 1.7186225652694702} 11/07/2021 10:57:01 - INFO - __main__ - Step 97606: {'lr': 0.00013932471719896306, 'samples': 18740352, 'steps': 97605, 'loss/train': 0.054254043847322464} 11/07/2021 10:57:02 - INFO - __main__ - Step 97607: {'lr': 0.00013931995882952882, 'samples': 18740544, 'steps': 97606, 'loss/train': 1.216517686843872} 11/07/2021 10:57:02 - INFO - __main__ - Step 97608: {'lr': 0.0001393152005099653, 'samples': 18740736, 'steps': 97607, 'loss/train': 1.4286657571792603} 11/07/2021 10:57:02 - INFO - __main__ - Step 97609: {'lr': 0.00013931044224027467, 'samples': 18740928, 'steps': 97608, 'loss/train': 1.0502307415008545} 11/07/2021 10:57:03 - INFO - __main__ - Step 97610: {'lr': 0.000139305684020459, 'samples': 18741120, 'steps': 97609, 'loss/train': 1.6288443803787231} 11/07/2021 10:57:04 - INFO - __main__ - Step 97611: {'lr': 0.00013930092585052052, 'samples': 18741312, 'steps': 97610, 'loss/train': 1.9059209823608398} 11/07/2021 10:57:04 - INFO - __main__ - Step 97612: {'lr': 0.00013929616773046135, 'samples': 18741504, 'steps': 97611, 'loss/train': 1.6274856328964233} 11/07/2021 10:57:04 - INFO - __main__ - Step 97613: {'lr': 0.00013929140966028355, 'samples': 18741696, 'steps': 97612, 'loss/train': 0.6690410375595093} 11/07/2021 10:57:05 - INFO - __main__ - Step 97614: {'lr': 0.0001392866516399895, 'samples': 18741888, 'steps': 97613, 'loss/train': 0.07569380849599838} 11/07/2021 10:57:05 - INFO - __main__ - Step 97615: {'lr': 0.00013928189366958101, 'samples': 18742080, 'steps': 97614, 'loss/train': 1.2961562871932983} 11/07/2021 10:57:06 - INFO - __main__ - Step 97616: {'lr': 0.00013927713574906042, 'samples': 18742272, 'steps': 97615, 'loss/train': 1.4161089658737183} 11/07/2021 10:57:06 - INFO - __main__ - Step 97617: {'lr': 0.00013927237787842987, 'samples': 18742464, 'steps': 97616, 'loss/train': 1.5609482526779175} 11/07/2021 10:57:07 - INFO - __main__ - Step 97618: {'lr': 0.00013926762005769144, 'samples': 18742656, 'steps': 97617, 'loss/train': 1.2912408113479614} 11/07/2021 10:57:07 - INFO - __main__ - Step 97619: {'lr': 0.00013926286228684734, 'samples': 18742848, 'steps': 97618, 'loss/train': 1.1344492435455322} 11/07/2021 10:57:07 - INFO - __main__ - Step 97620: {'lr': 0.00013925810456589968, 'samples': 18743040, 'steps': 97619, 'loss/train': 1.8807318210601807} 11/07/2021 10:57:09 - INFO - __main__ - Step 97621: {'lr': 0.00013925334689485062, 'samples': 18743232, 'steps': 97620, 'loss/train': 1.8133480548858643} 11/07/2021 10:57:09 - INFO - __main__ - Step 97622: {'lr': 0.00013924858927370225, 'samples': 18743424, 'steps': 97621, 'loss/train': 1.40426504611969} 11/07/2021 10:57:10 - INFO - __main__ - Step 97623: {'lr': 0.0001392438317024568, 'samples': 18743616, 'steps': 97622, 'loss/train': 0.8882939219474792} 11/07/2021 10:57:10 - INFO - __main__ - Step 97624: {'lr': 0.00013923907418111637, 'samples': 18743808, 'steps': 97623, 'loss/train': 1.5379343032836914} 11/07/2021 10:57:10 - INFO - __main__ - Step 97625: {'lr': 0.00013923431670968307, 'samples': 18744000, 'steps': 97624, 'loss/train': 0.04709068313241005} 11/07/2021 10:57:11 - INFO - __main__ - Step 97626: {'lr': 0.00013922955928815913, 'samples': 18744192, 'steps': 97625, 'loss/train': 1.0719939470291138} 11/07/2021 10:57:12 - INFO - __main__ - Step 97627: {'lr': 0.0001392248019165467, 'samples': 18744384, 'steps': 97626, 'loss/train': 1.6489514112472534} 11/07/2021 10:57:12 - INFO - __main__ - Step 97628: {'lr': 0.00013922004459484774, 'samples': 18744576, 'steps': 97627, 'loss/train': 1.3750640153884888} 11/07/2021 10:57:12 - INFO - __main__ - Step 97629: {'lr': 0.00013921528732306455, 'samples': 18744768, 'steps': 97628, 'loss/train': 1.3567818403244019} 11/07/2021 10:57:13 - INFO - __main__ - Step 97630: {'lr': 0.00013921053010119928, 'samples': 18744960, 'steps': 97629, 'loss/train': 1.4993786811828613} 11/07/2021 10:57:14 - INFO - __main__ - Step 97631: {'lr': 0.00013920577292925396, 'samples': 18745152, 'steps': 97630, 'loss/train': 1.4595096111297607} 11/07/2021 10:57:14 - INFO - __main__ - Step 97632: {'lr': 0.0001392010158072309, 'samples': 18745344, 'steps': 97631, 'loss/train': 1.4855294227600098} 11/07/2021 10:57:14 - INFO - __main__ - Step 97633: {'lr': 0.00013919625873513205, 'samples': 18745536, 'steps': 97632, 'loss/train': 1.3545936346054077} 11/07/2021 10:57:15 - INFO - __main__ - Step 97634: {'lr': 0.00013919150171295971, 'samples': 18745728, 'steps': 97633, 'loss/train': 0.8753668665885925} 11/07/2021 10:57:15 - INFO - __main__ - Step 97635: {'lr': 0.00013918674474071597, 'samples': 18745920, 'steps': 97634, 'loss/train': 0.08373796939849854} 11/07/2021 10:57:16 - INFO - __main__ - Step 97636: {'lr': 0.00013918198781840297, 'samples': 18746112, 'steps': 97635, 'loss/train': 1.3063126802444458} 11/07/2021 10:57:17 - INFO - __main__ - Step 97637: {'lr': 0.00013917723094602287, 'samples': 18746304, 'steps': 97636, 'loss/train': 0.8966081738471985} 11/07/2021 10:57:17 - INFO - __main__ - Step 97638: {'lr': 0.00013917247412357776, 'samples': 18746496, 'steps': 97637, 'loss/train': 0.8988198041915894} 11/07/2021 10:57:17 - INFO - __main__ - Step 97639: {'lr': 0.00013916771735106987, 'samples': 18746688, 'steps': 97638, 'loss/train': 1.3314586877822876} 11/07/2021 10:57:18 - INFO - __main__ - Step 97640: {'lr': 0.00013916296062850125, 'samples': 18746880, 'steps': 97639, 'loss/train': 1.2634917497634888} 11/07/2021 10:57:18 - INFO - __main__ - Step 97641: {'lr': 0.00013915820395587423, 'samples': 18747072, 'steps': 97640, 'loss/train': 1.4670294523239136} 11/07/2021 10:57:19 - INFO - __main__ - Step 97642: {'lr': 0.00013915344733319069, 'samples': 18747264, 'steps': 97641, 'loss/train': 1.2802152633666992} 11/07/2021 10:57:19 - INFO - __main__ - Step 97643: {'lr': 0.0001391486907604529, 'samples': 18747456, 'steps': 97642, 'loss/train': 1.2723978757858276} 11/07/2021 10:57:20 - INFO - __main__ - Step 97644: {'lr': 0.000139143934237663, 'samples': 18747648, 'steps': 97643, 'loss/train': 1.3853482007980347} 11/07/2021 10:57:20 - INFO - __main__ - Step 97645: {'lr': 0.00013913917776482316, 'samples': 18747840, 'steps': 97644, 'loss/train': 1.1805591583251953} 11/07/2021 10:57:20 - INFO - __main__ - Step 97646: {'lr': 0.00013913442134193545, 'samples': 18748032, 'steps': 97645, 'loss/train': 1.143354892730713} 11/07/2021 10:57:21 - INFO - __main__ - Step 97647: {'lr': 0.00013912966496900208, 'samples': 18748224, 'steps': 97646, 'loss/train': 1.3973604440689087} 11/07/2021 10:57:22 - INFO - __main__ - Step 97648: {'lr': 0.00013912490864602517, 'samples': 18748416, 'steps': 97647, 'loss/train': 1.1751329898834229} 11/07/2021 10:57:22 - INFO - __main__ - Step 97649: {'lr': 0.00013912015237300688, 'samples': 18748608, 'steps': 97648, 'loss/train': 1.0514097213745117} 11/07/2021 10:57:22 - INFO - __main__ - Step 97650: {'lr': 0.0001391153961499493, 'samples': 18748800, 'steps': 97649, 'loss/train': 1.3411997556686401} 11/07/2021 10:57:23 - INFO - __main__ - Step 97651: {'lr': 0.00013911063997685465, 'samples': 18748992, 'steps': 97650, 'loss/train': 1.1353542804718018} 11/07/2021 10:57:24 - INFO - __main__ - Step 97652: {'lr': 0.00013910588385372504, 'samples': 18749184, 'steps': 97651, 'loss/train': 1.1537376642227173} 11/07/2021 10:57:24 - INFO - __main__ - Step 97653: {'lr': 0.00013910112778056256, 'samples': 18749376, 'steps': 97652, 'loss/train': 1.4766467809677124} 11/07/2021 10:57:25 - INFO - __main__ - Step 97654: {'lr': 0.00013909637175736956, 'samples': 18749568, 'steps': 97653, 'loss/train': 1.1913396120071411} 11/07/2021 10:57:25 - INFO - __main__ - Step 97655: {'lr': 0.00013909161578414786, 'samples': 18749760, 'steps': 97654, 'loss/train': 1.6282063722610474} 11/07/2021 10:57:25 - INFO - __main__ - Step 97656: {'lr': 0.0001390868598608998, 'samples': 18749952, 'steps': 97655, 'loss/train': 1.2447046041488647} 11/07/2021 10:57:26 - INFO - __main__ - Step 97657: {'lr': 0.0001390821039876275, 'samples': 18750144, 'steps': 97656, 'loss/train': 1.933139681816101} 11/07/2021 10:57:27 - INFO - __main__ - Step 97658: {'lr': 0.0001390773481643331, 'samples': 18750336, 'steps': 97657, 'loss/train': 1.243727445602417} 11/07/2021 10:57:27 - INFO - __main__ - Step 97659: {'lr': 0.0001390725923910187, 'samples': 18750528, 'steps': 97658, 'loss/train': 1.0365955829620361} 11/07/2021 10:57:27 - INFO - __main__ - Step 97660: {'lr': 0.00013906783666768648, 'samples': 18750720, 'steps': 97659, 'loss/train': 1.51374351978302} 11/07/2021 10:57:28 - INFO - __main__ - Step 97661: {'lr': 0.00013906308099433863, 'samples': 18750912, 'steps': 97660, 'loss/train': 1.4730310440063477} 11/07/2021 10:57:29 - INFO - __main__ - Step 97662: {'lr': 0.0001390583253709772, 'samples': 18751104, 'steps': 97661, 'loss/train': 1.4511882066726685} 11/07/2021 10:57:29 - INFO - __main__ - Step 97663: {'lr': 0.00013905356979760438, 'samples': 18751296, 'steps': 97662, 'loss/train': 1.6566131114959717} 11/07/2021 10:57:29 - INFO - __main__ - Step 97664: {'lr': 0.0001390488142742223, 'samples': 18751488, 'steps': 97663, 'loss/train': 1.204357624053955} 11/07/2021 10:57:30 - INFO - __main__ - Step 97665: {'lr': 0.00013904405880083316, 'samples': 18751680, 'steps': 97664, 'loss/train': 1.4444326162338257} 11/07/2021 10:57:30 - INFO - __main__ - Step 97666: {'lr': 0.000139039303377439, 'samples': 18751872, 'steps': 97665, 'loss/train': 1.1565755605697632} 11/07/2021 10:57:31 - INFO - __main__ - Step 97667: {'lr': 0.00013903454800404203, 'samples': 18752064, 'steps': 97666, 'loss/train': 0.8943421840667725} 11/07/2021 10:57:31 - INFO - __main__ - Step 97668: {'lr': 0.0001390297926806445, 'samples': 18752256, 'steps': 97667, 'loss/train': 1.5312024354934692} 11/07/2021 10:57:32 - INFO - __main__ - Step 97669: {'lr': 0.0001390250374072483, 'samples': 18752448, 'steps': 97668, 'loss/train': 1.1790367364883423} 11/07/2021 10:57:32 - INFO - __main__ - Step 97670: {'lr': 0.00013902028218385577, 'samples': 18752640, 'steps': 97669, 'loss/train': 1.1026115417480469} 11/07/2021 10:57:32 - INFO - __main__ - Step 97671: {'lr': 0.00013901552701046894, 'samples': 18752832, 'steps': 97670, 'loss/train': 1.8291152715682983} 11/07/2021 10:57:34 - INFO - __main__ - Step 97672: {'lr': 0.00013901077188708998, 'samples': 18753024, 'steps': 97671, 'loss/train': 0.80275559425354} 11/07/2021 10:57:34 - INFO - __main__ - Step 97673: {'lr': 0.0001390060168137211, 'samples': 18753216, 'steps': 97672, 'loss/train': 0.43765413761138916} 11/07/2021 10:57:34 - INFO - __main__ - Step 97674: {'lr': 0.00013900126179036438, 'samples': 18753408, 'steps': 97673, 'loss/train': 1.443755030632019} 11/07/2021 10:57:35 - INFO - __main__ - Step 97675: {'lr': 0.00013899650681702198, 'samples': 18753600, 'steps': 97674, 'loss/train': 1.3758437633514404} 11/07/2021 10:57:35 - INFO - __main__ - Step 97676: {'lr': 0.00013899175189369603, 'samples': 18753792, 'steps': 97675, 'loss/train': 1.3381612300872803} 11/07/2021 10:57:35 - INFO - __main__ - Step 97677: {'lr': 0.0001389869970203887, 'samples': 18753984, 'steps': 97676, 'loss/train': 1.3061752319335938} 11/07/2021 10:57:36 - INFO - __main__ - Step 97678: {'lr': 0.0001389822421971021, 'samples': 18754176, 'steps': 97677, 'loss/train': 0.4754769206047058} 11/07/2021 10:57:37 - INFO - __main__ - Step 97679: {'lr': 0.0001389774874238384, 'samples': 18754368, 'steps': 97678, 'loss/train': 0.906046986579895} 11/07/2021 10:57:37 - INFO - __main__ - Step 97680: {'lr': 0.00013897273270059975, 'samples': 18754560, 'steps': 97679, 'loss/train': 2.6299078464508057} 11/07/2021 10:57:38 - INFO - __main__ - Step 97681: {'lr': 0.0001389679780273883, 'samples': 18754752, 'steps': 97680, 'loss/train': 1.2001314163208008} 11/07/2021 10:57:38 - INFO - __main__ - Step 97682: {'lr': 0.00013896322340420614, 'samples': 18754944, 'steps': 97681, 'loss/train': 1.0547173023223877} 11/07/2021 10:57:39 - INFO - __main__ - Step 97683: {'lr': 0.0001389584688310554, 'samples': 18755136, 'steps': 97682, 'loss/train': 1.5996836423873901} 11/07/2021 10:57:39 - INFO - __main__ - Step 97684: {'lr': 0.0001389537143079383, 'samples': 18755328, 'steps': 97683, 'loss/train': 1.5943596363067627} 11/07/2021 10:57:40 - INFO - __main__ - Step 97685: {'lr': 0.0001389489598348569, 'samples': 18755520, 'steps': 97684, 'loss/train': 1.429070234298706} 11/07/2021 10:57:40 - INFO - __main__ - Step 97686: {'lr': 0.0001389442054118134, 'samples': 18755712, 'steps': 97685, 'loss/train': 1.2315887212753296} 11/07/2021 10:57:40 - INFO - __main__ - Step 97687: {'lr': 0.00013893945103880996, 'samples': 18755904, 'steps': 97686, 'loss/train': 1.339682936668396} 11/07/2021 10:57:41 - INFO - __main__ - Step 97688: {'lr': 0.00013893469671584862, 'samples': 18756096, 'steps': 97687, 'loss/train': 1.3345526456832886} 11/07/2021 10:57:42 - INFO - __main__ - Step 97689: {'lr': 0.00013892994244293168, 'samples': 18756288, 'steps': 97688, 'loss/train': 1.2060128450393677} 11/07/2021 10:57:42 - INFO - __main__ - Step 97690: {'lr': 0.00013892518822006112, 'samples': 18756480, 'steps': 97689, 'loss/train': 1.0855151414871216} 11/07/2021 10:57:42 - INFO - __main__ - Step 97691: {'lr': 0.0001389204340472392, 'samples': 18756672, 'steps': 97690, 'loss/train': 1.192405104637146} 11/07/2021 10:57:43 - INFO - __main__ - Step 97692: {'lr': 0.00013891567992446797, 'samples': 18756864, 'steps': 97691, 'loss/train': 1.3824615478515625} 11/07/2021 10:57:44 - INFO - __main__ - Step 97693: {'lr': 0.00013891092585174966, 'samples': 18757056, 'steps': 97692, 'loss/train': 1.297284483909607} 11/07/2021 10:57:44 - INFO - __main__ - Step 97694: {'lr': 0.0001389061718290864, 'samples': 18757248, 'steps': 97693, 'loss/train': 1.2934437990188599} 11/07/2021 10:57:44 - INFO - __main__ - Step 97695: {'lr': 0.00013890141785648032, 'samples': 18757440, 'steps': 97694, 'loss/train': 1.2409346103668213} 11/07/2021 10:57:45 - INFO - __main__ - Step 97696: {'lr': 0.00013889666393393353, 'samples': 18757632, 'steps': 97695, 'loss/train': 1.022942066192627} 11/07/2021 10:57:45 - INFO - __main__ - Step 97697: {'lr': 0.00013889191006144814, 'samples': 18757824, 'steps': 97696, 'loss/train': 1.435518741607666} 11/07/2021 10:57:46 - INFO - __main__ - Step 97698: {'lr': 0.00013888715623902633, 'samples': 18758016, 'steps': 97697, 'loss/train': 1.3365099430084229} 11/07/2021 10:57:46 - INFO - __main__ - Step 97699: {'lr': 0.00013888240246667026, 'samples': 18758208, 'steps': 97698, 'loss/train': 1.2471529245376587} 11/07/2021 10:57:47 - INFO - __main__ - Step 97700: {'lr': 0.00013887764874438214, 'samples': 18758400, 'steps': 97699, 'loss/train': 1.147447943687439} 11/07/2021 10:57:47 - INFO - __main__ - Step 97701: {'lr': 0.00013887289507216394, 'samples': 18758592, 'steps': 97700, 'loss/train': 1.3646489381790161} 11/07/2021 10:57:47 - INFO - __main__ - Step 97702: {'lr': 0.00013886814145001796, 'samples': 18758784, 'steps': 97701, 'loss/train': 0.8928151726722717} 11/07/2021 10:57:48 - INFO - __main__ - Step 97703: {'lr': 0.00013886338787794626, 'samples': 18758976, 'steps': 97702, 'loss/train': 1.055133581161499} 11/07/2021 10:57:49 - INFO - __main__ - Step 97704: {'lr': 0.00013885863435595096, 'samples': 18759168, 'steps': 97703, 'loss/train': 1.4654619693756104} 11/07/2021 10:57:49 - INFO - __main__ - Step 97705: {'lr': 0.00013885388088403434, 'samples': 18759360, 'steps': 97704, 'loss/train': 1.0675325393676758} 11/07/2021 10:57:50 - INFO - __main__ - Step 97706: {'lr': 0.00013884912746219835, 'samples': 18759552, 'steps': 97705, 'loss/train': 1.4126904010772705} 11/07/2021 10:57:50 - INFO - __main__ - Step 97707: {'lr': 0.00013884437409044528, 'samples': 18759744, 'steps': 97706, 'loss/train': 1.1306843757629395} 11/07/2021 10:57:50 - INFO - __main__ - Step 97708: {'lr': 0.00013883962076877731, 'samples': 18759936, 'steps': 97707, 'loss/train': 1.4923830032348633} 11/07/2021 10:57:51 - INFO - __main__ - Step 97709: {'lr': 0.0001388348674971964, 'samples': 18760128, 'steps': 97708, 'loss/train': 1.296794056892395} 11/07/2021 10:57:52 - INFO - __main__ - Step 97710: {'lr': 0.00013883011427570478, 'samples': 18760320, 'steps': 97709, 'loss/train': 1.4990310668945312} 11/07/2021 10:57:52 - INFO - __main__ - Step 97711: {'lr': 0.00013882536110430458, 'samples': 18760512, 'steps': 97710, 'loss/train': 1.742354393005371} 11/07/2021 10:57:52 - INFO - __main__ - Step 97712: {'lr': 0.00013882060798299796, 'samples': 18760704, 'steps': 97711, 'loss/train': 1.0949538946151733} 11/07/2021 10:57:53 - INFO - __main__ - Step 97713: {'lr': 0.00013881585491178707, 'samples': 18760896, 'steps': 97712, 'loss/train': 1.418515920639038} 11/07/2021 10:57:54 - INFO - __main__ - Step 97714: {'lr': 0.00013881110189067404, 'samples': 18761088, 'steps': 97713, 'loss/train': 1.1943320035934448} 11/07/2021 10:57:54 - INFO - __main__ - Step 97715: {'lr': 0.00013880634891966099, 'samples': 18761280, 'steps': 97714, 'loss/train': 0.4869949519634247} 11/07/2021 10:57:54 - INFO - __main__ - Step 97716: {'lr': 0.00013880159599875008, 'samples': 18761472, 'steps': 97715, 'loss/train': 1.3742411136627197} 11/07/2021 10:57:55 - INFO - __main__ - Step 97717: {'lr': 0.0001387968431279435, 'samples': 18761664, 'steps': 97716, 'loss/train': 1.348921775817871} 11/07/2021 10:57:55 - INFO - __main__ - Step 97718: {'lr': 0.00013879209030724331, 'samples': 18761856, 'steps': 97717, 'loss/train': 1.5578818321228027} 11/07/2021 10:57:56 - INFO - __main__ - Step 97719: {'lr': 0.0001387873375366517, 'samples': 18762048, 'steps': 97718, 'loss/train': 1.0411326885223389} 11/07/2021 10:57:57 - INFO - __main__ - Step 97720: {'lr': 0.00013878258481617078, 'samples': 18762240, 'steps': 97719, 'loss/train': 1.3457250595092773} 11/07/2021 10:57:57 - INFO - __main__ - Step 97721: {'lr': 0.00013877783214580276, 'samples': 18762432, 'steps': 97720, 'loss/train': 0.9105295538902283} 11/07/2021 10:57:57 - INFO - __main__ - Step 97722: {'lr': 0.0001387730795255498, 'samples': 18762624, 'steps': 97721, 'loss/train': 1.2610852718353271} 11/07/2021 10:57:58 - INFO - __main__ - Step 97723: {'lr': 0.00013876832695541386, 'samples': 18762816, 'steps': 97722, 'loss/train': 1.4529049396514893} 11/07/2021 10:57:59 - INFO - __main__ - Step 97724: {'lr': 0.00013876357443539722, 'samples': 18763008, 'steps': 97723, 'loss/train': 1.4909645318984985} 11/07/2021 10:58:00 - INFO - __main__ - Step 97725: {'lr': 0.00013875882196550199, 'samples': 18763200, 'steps': 97724, 'loss/train': 1.2468308210372925} 11/07/2021 10:58:00 - INFO - __main__ - Step 97726: {'lr': 0.00013875406954573033, 'samples': 18763392, 'steps': 97725, 'loss/train': 0.11288827657699585} 11/07/2021 10:58:00 - INFO - __main__ - Step 97727: {'lr': 0.00013874931717608436, 'samples': 18763584, 'steps': 97726, 'loss/train': 1.031337022781372} 11/07/2021 10:58:01 - INFO - __main__ - Step 97728: {'lr': 0.00013874456485656622, 'samples': 18763776, 'steps': 97727, 'loss/train': 1.2865047454833984} 11/07/2021 10:58:02 - INFO - __main__ - Step 97729: {'lr': 0.00013873981258717805, 'samples': 18763968, 'steps': 97728, 'loss/train': 1.4176887273788452} 11/07/2021 10:58:02 - INFO - __main__ - Step 97730: {'lr': 0.00013873506036792205, 'samples': 18764160, 'steps': 97729, 'loss/train': 1.4891890287399292} 11/07/2021 10:58:02 - INFO - __main__ - Step 97731: {'lr': 0.00013873030819880027, 'samples': 18764352, 'steps': 97730, 'loss/train': 1.258371114730835} 11/07/2021 10:58:03 - INFO - __main__ - Step 97732: {'lr': 0.0001387255560798149, 'samples': 18764544, 'steps': 97731, 'loss/train': 1.4618405103683472} 11/07/2021 10:58:03 - INFO - __main__ - Step 97733: {'lr': 0.0001387208040109681, 'samples': 18764736, 'steps': 97732, 'loss/train': 1.0841776132583618} 11/07/2021 10:58:04 - INFO - __main__ - Step 97734: {'lr': 0.000138716051992262, 'samples': 18764928, 'steps': 97733, 'loss/train': 1.225343108177185} 11/07/2021 10:58:05 - INFO - __main__ - Step 97735: {'lr': 0.0001387113000236988, 'samples': 18765120, 'steps': 97734, 'loss/train': 1.2021269798278809} 11/07/2021 10:58:05 - INFO - __main__ - Step 97736: {'lr': 0.0001387065481052805, 'samples': 18765312, 'steps': 97735, 'loss/train': 1.2443253993988037} 11/07/2021 10:58:05 - INFO - __main__ - Step 97737: {'lr': 0.00013870179623700927, 'samples': 18765504, 'steps': 97736, 'loss/train': 1.2780922651290894} 11/07/2021 10:58:06 - INFO - __main__ - Step 97738: {'lr': 0.00013869704441888731, 'samples': 18765696, 'steps': 97737, 'loss/train': 0.14031903445720673} 11/07/2021 10:58:06 - INFO - __main__ - Step 97739: {'lr': 0.00013869229265091676, 'samples': 18765888, 'steps': 97738, 'loss/train': 1.5273605585098267} 11/07/2021 10:58:07 - INFO - __main__ - Step 97740: {'lr': 0.00013868754093309974, 'samples': 18766080, 'steps': 97739, 'loss/train': 1.2378721237182617} 11/07/2021 10:58:08 - INFO - __main__ - Step 97741: {'lr': 0.00013868278926543838, 'samples': 18766272, 'steps': 97740, 'loss/train': 0.665865421295166} 11/07/2021 10:58:08 - INFO - __main__ - Step 97742: {'lr': 0.00013867803764793486, 'samples': 18766464, 'steps': 97741, 'loss/train': 1.0721839666366577} 11/07/2021 10:58:08 - INFO - __main__ - Step 97743: {'lr': 0.00013867328608059126, 'samples': 18766656, 'steps': 97742, 'loss/train': 1.334787368774414} 11/07/2021 10:58:09 - INFO - __main__ - Step 97744: {'lr': 0.0001386685345634098, 'samples': 18766848, 'steps': 97743, 'loss/train': 0.12306948006153107} 11/07/2021 10:58:10 - INFO - __main__ - Step 97745: {'lr': 0.00013866378309639258, 'samples': 18767040, 'steps': 97744, 'loss/train': 1.3754245042800903} 11/07/2021 10:58:10 - INFO - __main__ - Step 97746: {'lr': 0.0001386590316795417, 'samples': 18767232, 'steps': 97745, 'loss/train': 1.2482469081878662} 11/07/2021 10:58:10 - INFO - __main__ - Step 97747: {'lr': 0.0001386542803128594, 'samples': 18767424, 'steps': 97746, 'loss/train': 0.9897463321685791} 11/07/2021 10:58:11 - INFO - __main__ - Step 97748: {'lr': 0.00013864952899634783, 'samples': 18767616, 'steps': 97747, 'loss/train': 1.541203498840332} 11/07/2021 10:58:11 - INFO - __main__ - Step 97749: {'lr': 0.00013864477773000897, 'samples': 18767808, 'steps': 97748, 'loss/train': 1.1011250019073486} 11/07/2021 10:58:12 - INFO - __main__ - Step 97750: {'lr': 0.00013864002651384506, 'samples': 18768000, 'steps': 97749, 'loss/train': 0.995804488658905} 11/07/2021 10:58:12 - INFO - __main__ - Step 97751: {'lr': 0.00013863527534785822, 'samples': 18768192, 'steps': 97750, 'loss/train': 1.6146491765975952} 11/07/2021 10:58:13 - INFO - __main__ - Step 97752: {'lr': 0.00013863052423205064, 'samples': 18768384, 'steps': 97751, 'loss/train': 1.1744351387023926} 11/07/2021 10:58:13 - INFO - __main__ - Step 97753: {'lr': 0.00013862577316642438, 'samples': 18768576, 'steps': 97752, 'loss/train': 1.198941707611084} 11/07/2021 10:58:13 - INFO - __main__ - Step 97754: {'lr': 0.00013862102215098166, 'samples': 18768768, 'steps': 97753, 'loss/train': 1.2872271537780762} 11/07/2021 10:58:15 - INFO - __main__ - Step 97755: {'lr': 0.00013861627118572455, 'samples': 18768960, 'steps': 97754, 'loss/train': 0.9288464188575745} 11/07/2021 10:58:15 - INFO - __main__ - Step 97756: {'lr': 0.00013861152027065527, 'samples': 18769152, 'steps': 97755, 'loss/train': 0.9306029081344604} 11/07/2021 10:58:15 - INFO - __main__ - Step 97757: {'lr': 0.00013860676940577593, 'samples': 18769344, 'steps': 97756, 'loss/train': 0.906252920627594} 11/07/2021 10:58:16 - INFO - __main__ - Step 97758: {'lr': 0.00013860201859108861, 'samples': 18769536, 'steps': 97757, 'loss/train': 1.3887377977371216} 11/07/2021 10:58:16 - INFO - __main__ - Step 97759: {'lr': 0.00013859726782659555, 'samples': 18769728, 'steps': 97758, 'loss/train': 1.0511966943740845} 11/07/2021 10:58:17 - INFO - __main__ - Step 97760: {'lr': 0.0001385925171122988, 'samples': 18769920, 'steps': 97759, 'loss/train': 1.3939402103424072} 11/07/2021 10:58:17 - INFO - __main__ - Step 97761: {'lr': 0.00013858776644820058, 'samples': 18770112, 'steps': 97760, 'loss/train': 0.9793931245803833} 11/07/2021 10:58:18 - INFO - __main__ - Step 97762: {'lr': 0.0001385830158343031, 'samples': 18770304, 'steps': 97761, 'loss/train': 1.3455511331558228} 11/07/2021 10:58:18 - INFO - __main__ - Step 97763: {'lr': 0.00013857826527060823, 'samples': 18770496, 'steps': 97762, 'loss/train': 0.800258457660675} 11/07/2021 10:58:18 - INFO - __main__ - Step 97764: {'lr': 0.00013857351475711832, 'samples': 18770688, 'steps': 97763, 'loss/train': 1.1189422607421875} 11/07/2021 10:58:20 - INFO - __main__ - Step 97765: {'lr': 0.00013856876429383546, 'samples': 18770880, 'steps': 97764, 'loss/train': 1.29975426197052} 11/07/2021 10:58:20 - INFO - __main__ - Step 97766: {'lr': 0.00013856401388076184, 'samples': 18771072, 'steps': 97765, 'loss/train': 1.2204722166061401} 11/07/2021 10:58:20 - INFO - __main__ - Step 97767: {'lr': 0.0001385592635178995, 'samples': 18771264, 'steps': 97766, 'loss/train': 1.0780177116394043} 11/07/2021 10:58:21 - INFO - __main__ - Step 97768: {'lr': 0.00013855451320525064, 'samples': 18771456, 'steps': 97767, 'loss/train': 1.0636643171310425} 11/07/2021 10:58:21 - INFO - __main__ - Step 97769: {'lr': 0.0001385497629428174, 'samples': 18771648, 'steps': 97768, 'loss/train': 1.4031038284301758} 11/07/2021 10:58:22 - INFO - __main__ - Step 97770: {'lr': 0.00013854501273060193, 'samples': 18771840, 'steps': 97769, 'loss/train': 1.5529146194458008} 11/07/2021 10:58:22 - INFO - __main__ - Step 97771: {'lr': 0.00013854026256860635, 'samples': 18772032, 'steps': 97770, 'loss/train': 1.3972556591033936} 11/07/2021 10:58:23 - INFO - __main__ - Step 97772: {'lr': 0.00013853551245683282, 'samples': 18772224, 'steps': 97771, 'loss/train': 1.314024806022644} 11/07/2021 10:58:23 - INFO - __main__ - Step 97773: {'lr': 0.00013853076239528345, 'samples': 18772416, 'steps': 97772, 'loss/train': 1.403512716293335} 11/07/2021 10:58:23 - INFO - __main__ - Step 97774: {'lr': 0.0001385260123839604, 'samples': 18772608, 'steps': 97773, 'loss/train': 1.0143383741378784} 11/07/2021 10:58:24 - INFO - __main__ - Step 97775: {'lr': 0.00013852126242286592, 'samples': 18772800, 'steps': 97774, 'loss/train': 1.4825924634933472} 11/07/2021 10:58:25 - INFO - __main__ - Step 97776: {'lr': 0.00013851651251200193, 'samples': 18772992, 'steps': 97775, 'loss/train': 1.1358917951583862} 11/07/2021 10:58:25 - INFO - __main__ - Step 97777: {'lr': 0.00013851176265137067, 'samples': 18773184, 'steps': 97776, 'loss/train': 0.1959705799818039} 11/07/2021 10:58:26 - INFO - __main__ - Step 97778: {'lr': 0.0001385070128409743, 'samples': 18773376, 'steps': 97777, 'loss/train': 1.6044483184814453} 11/07/2021 10:58:26 - INFO - __main__ - Step 97779: {'lr': 0.00013850226308081498, 'samples': 18773568, 'steps': 97778, 'loss/train': 1.5680464506149292} 11/07/2021 10:58:26 - INFO - __main__ - Step 97780: {'lr': 0.00013849751337089477, 'samples': 18773760, 'steps': 97779, 'loss/train': 1.502947449684143} 11/07/2021 10:58:27 - INFO - __main__ - Step 97781: {'lr': 0.0001384927637112159, 'samples': 18773952, 'steps': 97780, 'loss/train': 1.6958340406417847} 11/07/2021 10:58:28 - INFO - __main__ - Step 97782: {'lr': 0.0001384880141017804, 'samples': 18774144, 'steps': 97781, 'loss/train': 1.433398962020874} 11/07/2021 10:58:28 - INFO - __main__ - Step 97783: {'lr': 0.0001384832645425906, 'samples': 18774336, 'steps': 97782, 'loss/train': 1.931803584098816} 11/07/2021 10:58:28 - INFO - __main__ - Step 97784: {'lr': 0.00013847851503364842, 'samples': 18774528, 'steps': 97783, 'loss/train': 1.5326464176177979} 11/07/2021 10:58:29 - INFO - __main__ - Step 97785: {'lr': 0.00013847376557495612, 'samples': 18774720, 'steps': 97784, 'loss/train': 1.7706862688064575} 11/07/2021 10:58:30 - INFO - __main__ - Step 97786: {'lr': 0.00013846901616651583, 'samples': 18774912, 'steps': 97785, 'loss/train': 1.4897940158843994} 11/07/2021 10:58:30 - INFO - __main__ - Step 97787: {'lr': 0.0001384642668083297, 'samples': 18775104, 'steps': 97786, 'loss/train': 1.601722240447998} 11/07/2021 10:58:31 - INFO - __main__ - Step 97788: {'lr': 0.0001384595175003998, 'samples': 18775296, 'steps': 97787, 'loss/train': 1.3602982759475708} 11/07/2021 10:58:31 - INFO - __main__ - Step 97789: {'lr': 0.00013845476824272845, 'samples': 18775488, 'steps': 97788, 'loss/train': 0.04993930086493492} 11/07/2021 10:58:31 - INFO - __main__ - Step 97790: {'lr': 0.00013845001903531757, 'samples': 18775680, 'steps': 97789, 'loss/train': 1.4351909160614014} 11/07/2021 10:58:32 - INFO - __main__ - Step 97791: {'lr': 0.0001384452698781694, 'samples': 18775872, 'steps': 97790, 'loss/train': 1.0333470106124878} 11/07/2021 10:58:33 - INFO - __main__ - Step 97792: {'lr': 0.00013844052077128605, 'samples': 18776064, 'steps': 97791, 'loss/train': 1.3434137105941772} 11/07/2021 10:58:33 - INFO - __main__ - Step 97793: {'lr': 0.00013843577171466966, 'samples': 18776256, 'steps': 97792, 'loss/train': 1.64992356300354} 11/07/2021 10:58:33 - INFO - __main__ - Step 97794: {'lr': 0.00013843102270832242, 'samples': 18776448, 'steps': 97793, 'loss/train': 0.9748687744140625} 11/07/2021 10:58:34 - INFO - __main__ - Step 97795: {'lr': 0.00013842627375224644, 'samples': 18776640, 'steps': 97794, 'loss/train': 1.5992052555084229} 11/07/2021 10:58:35 - INFO - __main__ - Step 97796: {'lr': 0.00013842152484644385, 'samples': 18776832, 'steps': 97795, 'loss/train': 1.7630709409713745} 11/07/2021 10:58:35 - INFO - __main__ - Step 97797: {'lr': 0.0001384167759909168, 'samples': 18777024, 'steps': 97796, 'loss/train': 0.24894589185714722} 11/07/2021 10:58:36 - INFO - __main__ - Step 97798: {'lr': 0.00013841202718566743, 'samples': 18777216, 'steps': 97797, 'loss/train': 1.6531360149383545} 11/07/2021 10:58:36 - INFO - __main__ - Step 97799: {'lr': 0.00013840727843069788, 'samples': 18777408, 'steps': 97798, 'loss/train': 1.9782946109771729} 11/07/2021 10:58:37 - INFO - __main__ - Step 97800: {'lr': 0.00013840252972601027, 'samples': 18777600, 'steps': 97799, 'loss/train': 0.7635474801063538} 11/07/2021 10:58:38 - INFO - __main__ - Step 97801: {'lr': 0.0001383977810716068, 'samples': 18777792, 'steps': 97800, 'loss/train': 1.2788256406784058} 11/07/2021 10:58:38 - INFO - __main__ - Step 97802: {'lr': 0.00013839303246748964, 'samples': 18777984, 'steps': 97801, 'loss/train': 1.5144842863082886} 11/07/2021 10:58:38 - INFO - __main__ - Step 97803: {'lr': 0.00013838828391366076, 'samples': 18778176, 'steps': 97802, 'loss/train': 1.4288488626480103} 11/07/2021 10:58:39 - INFO - __main__ - Step 97804: {'lr': 0.00013838353541012239, 'samples': 18778368, 'steps': 97803, 'loss/train': 1.6507028341293335} 11/07/2021 10:58:39 - INFO - __main__ - Step 97805: {'lr': 0.00013837878695687668, 'samples': 18778560, 'steps': 97804, 'loss/train': 1.4010801315307617} 11/07/2021 10:58:40 - INFO - __main__ - Step 97806: {'lr': 0.00013837403855392579, 'samples': 18778752, 'steps': 97805, 'loss/train': 0.9837906360626221} 11/07/2021 10:58:40 - INFO - __main__ - Step 97807: {'lr': 0.0001383692902012718, 'samples': 18778944, 'steps': 97806, 'loss/train': 1.400866985321045} 11/07/2021 10:58:41 - INFO - __main__ - Step 97808: {'lr': 0.00013836454189891689, 'samples': 18779136, 'steps': 97807, 'loss/train': 1.52085542678833} 11/07/2021 10:58:41 - INFO - __main__ - Step 97809: {'lr': 0.0001383597936468632, 'samples': 18779328, 'steps': 97808, 'loss/train': 0.9794755578041077} 11/07/2021 10:58:42 - INFO - __main__ - Step 97810: {'lr': 0.00013835504544511284, 'samples': 18779520, 'steps': 97809, 'loss/train': 1.2478214502334595} 11/07/2021 10:58:42 - INFO - __main__ - Step 97811: {'lr': 0.00013835029729366804, 'samples': 18779712, 'steps': 97810, 'loss/train': 1.365483045578003} 11/07/2021 10:58:43 - INFO - __main__ - Step 97812: {'lr': 0.00013834554919253084, 'samples': 18779904, 'steps': 97811, 'loss/train': 1.8968948125839233} 11/07/2021 10:58:43 - INFO - __main__ - Step 97813: {'lr': 0.00013834080114170339, 'samples': 18780096, 'steps': 97812, 'loss/train': 1.5305899381637573} 11/07/2021 10:58:44 - INFO - __main__ - Step 97814: {'lr': 0.00013833605314118785, 'samples': 18780288, 'steps': 97813, 'loss/train': 1.295364260673523} 11/07/2021 10:58:44 - INFO - __main__ - Step 97815: {'lr': 0.00013833130519098642, 'samples': 18780480, 'steps': 97814, 'loss/train': 1.5833625793457031} 11/07/2021 10:58:45 - INFO - __main__ - Step 97816: {'lr': 0.0001383265572911012, 'samples': 18780672, 'steps': 97815, 'loss/train': 0.09961318224668503} 11/07/2021 10:58:45 - INFO - __main__ - Step 97817: {'lr': 0.00013832180944153429, 'samples': 18780864, 'steps': 97816, 'loss/train': 1.2971062660217285} 11/07/2021 10:58:46 - INFO - __main__ - Step 97818: {'lr': 0.0001383170616422878, 'samples': 18781056, 'steps': 97817, 'loss/train': 1.301012396812439} 11/07/2021 10:58:46 - INFO - __main__ - Step 97819: {'lr': 0.00013831231389336394, 'samples': 18781248, 'steps': 97818, 'loss/train': 1.3533813953399658} 11/07/2021 10:58:46 - INFO - __main__ - Step 97820: {'lr': 0.00013830756619476482, 'samples': 18781440, 'steps': 97819, 'loss/train': 1.4317160844802856} 11/07/2021 10:58:47 - INFO - __main__ - Step 97821: {'lr': 0.00013830281854649258, 'samples': 18781632, 'steps': 97820, 'loss/train': 1.955767035484314} 11/07/2021 10:58:48 - INFO - __main__ - Step 97822: {'lr': 0.00013829807094854936, 'samples': 18781824, 'steps': 97821, 'loss/train': 1.4740169048309326} 11/07/2021 10:58:48 - INFO - __main__ - Step 97823: {'lr': 0.00013829332340093732, 'samples': 18782016, 'steps': 97822, 'loss/train': 1.5796066522598267} 11/07/2021 10:58:48 - INFO - __main__ - Step 97824: {'lr': 0.00013828857590365856, 'samples': 18782208, 'steps': 97823, 'loss/train': 1.502173662185669} 11/07/2021 10:58:49 - INFO - __main__ - Step 97825: {'lr': 0.0001382838284567153, 'samples': 18782400, 'steps': 97824, 'loss/train': 1.8072059154510498} 11/07/2021 10:58:50 - INFO - __main__ - Step 97826: {'lr': 0.00013827908106010955, 'samples': 18782592, 'steps': 97825, 'loss/train': 1.7217576503753662} 11/07/2021 10:58:50 - INFO - __main__ - Step 97827: {'lr': 0.00013827433371384356, 'samples': 18782784, 'steps': 97826, 'loss/train': 1.8410152196884155} 11/07/2021 10:58:50 - INFO - __main__ - Step 97828: {'lr': 0.00013826958641791957, 'samples': 18782976, 'steps': 97827, 'loss/train': 0.9201439023017883} 11/07/2021 10:58:51 - INFO - __main__ - Step 97829: {'lr': 0.00013826483917233945, 'samples': 18783168, 'steps': 97828, 'loss/train': 1.293258786201477} 11/07/2021 10:58:51 - INFO - __main__ - Step 97830: {'lr': 0.00013826009197710542, 'samples': 18783360, 'steps': 97829, 'loss/train': 1.6194086074829102} 11/07/2021 10:58:51 - INFO - __main__ - Step 97831: {'lr': 0.00013825534483221974, 'samples': 18783552, 'steps': 97830, 'loss/train': 1.4288361072540283} 11/07/2021 10:58:52 - INFO - __main__ - Step 97832: {'lr': 0.00013825059773768444, 'samples': 18783744, 'steps': 97831, 'loss/train': 1.5133665800094604} 11/07/2021 10:58:53 - INFO - __main__ - Step 97833: {'lr': 0.0001382458506935017, 'samples': 18783936, 'steps': 97832, 'loss/train': 1.5327401161193848} 11/07/2021 10:58:53 - INFO - __main__ - Step 97834: {'lr': 0.00013824110369967365, 'samples': 18784128, 'steps': 97833, 'loss/train': 1.1842252016067505} 11/07/2021 10:58:54 - INFO - __main__ - Step 97835: {'lr': 0.00013823635675620243, 'samples': 18784320, 'steps': 97834, 'loss/train': 1.4824907779693604} 11/07/2021 10:58:54 - INFO - __main__ - Step 97836: {'lr': 0.00013823160986309023, 'samples': 18784512, 'steps': 97835, 'loss/train': 1.2288199663162231} 11/07/2021 10:58:55 - INFO - __main__ - Step 97837: {'lr': 0.0001382268630203391, 'samples': 18784704, 'steps': 97836, 'loss/train': 1.1993662118911743} 11/07/2021 10:58:55 - INFO - __main__ - Step 97838: {'lr': 0.00013822211622795122, 'samples': 18784896, 'steps': 97837, 'loss/train': 1.360798954963684} 11/07/2021 10:58:56 - INFO - __main__ - Step 97839: {'lr': 0.00013821736948592883, 'samples': 18785088, 'steps': 97838, 'loss/train': 1.213954210281372} 11/07/2021 10:58:56 - INFO - __main__ - Step 97840: {'lr': 0.00013821262279427389, 'samples': 18785280, 'steps': 97839, 'loss/train': 1.5854350328445435} 11/07/2021 10:58:56 - INFO - __main__ - Step 97841: {'lr': 0.0001382078761529886, 'samples': 18785472, 'steps': 97840, 'loss/train': 1.218981385231018} 11/07/2021 10:58:57 - INFO - __main__ - Step 97842: {'lr': 0.00013820312956207512, 'samples': 18785664, 'steps': 97841, 'loss/train': 1.3757715225219727} 11/07/2021 10:58:58 - INFO - __main__ - Step 97843: {'lr': 0.0001381983830215356, 'samples': 18785856, 'steps': 97842, 'loss/train': 1.1398608684539795} 11/07/2021 10:58:58 - INFO - __main__ - Step 97844: {'lr': 0.00013819363653137212, 'samples': 18786048, 'steps': 97843, 'loss/train': 1.692622423171997} 11/07/2021 10:58:58 - INFO - __main__ - Step 97845: {'lr': 0.00013818889009158691, 'samples': 18786240, 'steps': 97844, 'loss/train': 1.1936395168304443} 11/07/2021 10:58:59 - INFO - __main__ - Step 97846: {'lr': 0.000138184143702182, 'samples': 18786432, 'steps': 97845, 'loss/train': 1.5855779647827148} 11/07/2021 10:59:00 - INFO - __main__ - Step 97847: {'lr': 0.00013817939736315965, 'samples': 18786624, 'steps': 97846, 'loss/train': 1.6251189708709717} 11/07/2021 10:59:00 - INFO - __main__ - Step 97848: {'lr': 0.00013817465107452193, 'samples': 18786816, 'steps': 97847, 'loss/train': 1.4240458011627197} 11/07/2021 10:59:01 - INFO - __main__ - Step 97849: {'lr': 0.00013816990483627098, 'samples': 18787008, 'steps': 97848, 'loss/train': 1.5555897951126099} 11/07/2021 10:59:01 - INFO - __main__ - Step 97850: {'lr': 0.00013816515864840904, 'samples': 18787200, 'steps': 97849, 'loss/train': 1.1922905445098877} 11/07/2021 10:59:01 - INFO - __main__ - Step 97851: {'lr': 0.00013816041251093805, 'samples': 18787392, 'steps': 97850, 'loss/train': 1.203029751777649} 11/07/2021 10:59:02 - INFO - __main__ - Step 97852: {'lr': 0.00013815566642386026, 'samples': 18787584, 'steps': 97851, 'loss/train': 1.1277598142623901} 11/07/2021 10:59:03 - INFO - __main__ - Step 97853: {'lr': 0.0001381509203871778, 'samples': 18787776, 'steps': 97852, 'loss/train': 1.4370912313461304} 11/07/2021 10:59:03 - INFO - __main__ - Step 97854: {'lr': 0.0001381461744008928, 'samples': 18787968, 'steps': 97853, 'loss/train': 1.0843935012817383} 11/07/2021 10:59:03 - INFO - __main__ - Step 97855: {'lr': 0.00013814142846500744, 'samples': 18788160, 'steps': 97854, 'loss/train': 1.2191779613494873} 11/07/2021 10:59:04 - INFO - __main__ - Step 97856: {'lr': 0.00013813668257952377, 'samples': 18788352, 'steps': 97855, 'loss/train': 0.8912981748580933} 11/07/2021 10:59:04 - INFO - __main__ - Step 97857: {'lr': 0.00013813193674444403, 'samples': 18788544, 'steps': 97856, 'loss/train': 1.2472200393676758} 11/07/2021 10:59:05 - INFO - __main__ - Step 97858: {'lr': 0.00013812719095977028, 'samples': 18788736, 'steps': 97857, 'loss/train': 1.0802514553070068} 11/07/2021 10:59:05 - INFO - __main__ - Step 97859: {'lr': 0.0001381224452255047, 'samples': 18788928, 'steps': 97858, 'loss/train': 0.9028496146202087} 11/07/2021 10:59:06 - INFO - __main__ - Step 97860: {'lr': 0.00013811769954164943, 'samples': 18789120, 'steps': 97859, 'loss/train': 1.486567735671997} 11/07/2021 10:59:06 - INFO - __main__ - Step 97861: {'lr': 0.0001381129539082067, 'samples': 18789312, 'steps': 97860, 'loss/train': 1.4750523567199707} 11/07/2021 10:59:06 - INFO - __main__ - Step 97862: {'lr': 0.00013810820832517846, 'samples': 18789504, 'steps': 97861, 'loss/train': 1.1642111539840698} 11/07/2021 10:59:08 - INFO - __main__ - Step 97863: {'lr': 0.00013810346279256693, 'samples': 18789696, 'steps': 97862, 'loss/train': 4.129947185516357} 11/07/2021 10:59:08 - INFO - __main__ - Step 97864: {'lr': 0.0001380987173103742, 'samples': 18789888, 'steps': 97863, 'loss/train': 1.798799991607666} 11/07/2021 10:59:08 - INFO - __main__ - Step 97865: {'lr': 0.00013809397187860255, 'samples': 18790080, 'steps': 97864, 'loss/train': 1.1477553844451904} 11/07/2021 10:59:09 - INFO - __main__ - Step 97866: {'lr': 0.00013808922649725396, 'samples': 18790272, 'steps': 97865, 'loss/train': 1.1030899286270142} 11/07/2021 10:59:09 - INFO - __main__ - Step 97867: {'lr': 0.00013808448116633064, 'samples': 18790464, 'steps': 97866, 'loss/train': 1.54781174659729} 11/07/2021 10:59:10 - INFO - __main__ - Step 97868: {'lr': 0.0001380797358858348, 'samples': 18790656, 'steps': 97867, 'loss/train': 0.5958777666091919} 11/07/2021 10:59:10 - INFO - __main__ - Step 97869: {'lr': 0.00013807499065576843, 'samples': 18790848, 'steps': 97868, 'loss/train': 1.3827437162399292} 11/07/2021 10:59:11 - INFO - __main__ - Step 97870: {'lr': 0.00013807024547613376, 'samples': 18791040, 'steps': 97869, 'loss/train': 1.420095443725586} 11/07/2021 10:59:11 - INFO - __main__ - Step 97871: {'lr': 0.0001380655003469329, 'samples': 18791232, 'steps': 97870, 'loss/train': 1.7174363136291504} 11/07/2021 10:59:11 - INFO - __main__ - Step 97872: {'lr': 0.00013806075526816815, 'samples': 18791424, 'steps': 97871, 'loss/train': 1.7303167581558228} 11/07/2021 10:59:12 - INFO - __main__ - Step 97873: {'lr': 0.00013805601023984132, 'samples': 18791616, 'steps': 97872, 'loss/train': 1.1169127225875854} 11/07/2021 10:59:13 - INFO - __main__ - Step 97874: {'lr': 0.00013805126526195477, 'samples': 18791808, 'steps': 97873, 'loss/train': 1.412811040878296} 11/07/2021 10:59:13 - INFO - __main__ - Step 97875: {'lr': 0.0001380465203345106, 'samples': 18792000, 'steps': 97874, 'loss/train': 0.7753411531448364} 11/07/2021 10:59:14 - INFO - __main__ - Step 97876: {'lr': 0.0001380417754575109, 'samples': 18792192, 'steps': 97875, 'loss/train': 1.2581636905670166} 11/07/2021 10:59:14 - INFO - __main__ - Step 97877: {'lr': 0.00013803703063095787, 'samples': 18792384, 'steps': 97876, 'loss/train': 0.8276164531707764} 11/07/2021 10:59:14 - INFO - __main__ - Step 97878: {'lr': 0.00013803228585485363, 'samples': 18792576, 'steps': 97877, 'loss/train': 1.1302471160888672} 11/07/2021 10:59:15 - INFO - __main__ - Step 97879: {'lr': 0.0001380275411292003, 'samples': 18792768, 'steps': 97878, 'loss/train': 1.2809308767318726} 11/07/2021 10:59:16 - INFO - __main__ - Step 97880: {'lr': 0.00013802279645400007, 'samples': 18792960, 'steps': 97879, 'loss/train': 1.409908413887024} 11/07/2021 10:59:16 - INFO - __main__ - Step 97881: {'lr': 0.000138018051829255, 'samples': 18793152, 'steps': 97880, 'loss/train': 1.3189129829406738} 11/07/2021 10:59:16 - INFO - __main__ - Step 97882: {'lr': 0.0001380133072549673, 'samples': 18793344, 'steps': 97881, 'loss/train': 1.226173996925354} 11/07/2021 10:59:17 - INFO - __main__ - Step 97883: {'lr': 0.00013800856273113915, 'samples': 18793536, 'steps': 97882, 'loss/train': 0.897771418094635} 11/07/2021 10:59:18 - INFO - __main__ - Step 97884: {'lr': 0.00013800381825777253, 'samples': 18793728, 'steps': 97883, 'loss/train': 1.6261558532714844} 11/07/2021 10:59:18 - INFO - __main__ - Step 97885: {'lr': 0.00013799907383486965, 'samples': 18793920, 'steps': 97884, 'loss/train': 0.7996679544448853} 11/07/2021 10:59:18 - INFO - __main__ - Step 97886: {'lr': 0.00013799432946243266, 'samples': 18794112, 'steps': 97885, 'loss/train': 1.4203550815582275} 11/07/2021 10:59:19 - INFO - __main__ - Step 97887: {'lr': 0.0001379895851404637, 'samples': 18794304, 'steps': 97886, 'loss/train': 1.858672857284546} 11/07/2021 10:59:19 - INFO - __main__ - Step 97888: {'lr': 0.0001379848408689649, 'samples': 18794496, 'steps': 97887, 'loss/train': 1.2493802309036255} 11/07/2021 10:59:20 - INFO - __main__ - Step 97889: {'lr': 0.0001379800966479384, 'samples': 18794688, 'steps': 97888, 'loss/train': 2.1303203105926514} 11/07/2021 10:59:21 - INFO - __main__ - Step 97890: {'lr': 0.00013797535247738634, 'samples': 18794880, 'steps': 97889, 'loss/train': 1.3971189260482788} 11/07/2021 10:59:21 - INFO - __main__ - Step 97891: {'lr': 0.00013797060835731088, 'samples': 18795072, 'steps': 97890, 'loss/train': 1.10506272315979} 11/07/2021 10:59:21 - INFO - __main__ - Step 97892: {'lr': 0.00013796586428771414, 'samples': 18795264, 'steps': 97891, 'loss/train': 1.051766276359558} 11/07/2021 10:59:22 - INFO - __main__ - Step 97893: {'lr': 0.0001379611202685982, 'samples': 18795456, 'steps': 97892, 'loss/train': 1.4958891868591309} 11/07/2021 10:59:23 - INFO - __main__ - Step 97894: {'lr': 0.00013795637629996526, 'samples': 18795648, 'steps': 97893, 'loss/train': 1.1478934288024902} 11/07/2021 10:59:23 - INFO - __main__ - Step 97895: {'lr': 0.0001379516323818175, 'samples': 18795840, 'steps': 97894, 'loss/train': 1.6249783039093018} 11/07/2021 10:59:23 - INFO - __main__ - Step 97896: {'lr': 0.00013794688851415706, 'samples': 18796032, 'steps': 97895, 'loss/train': 1.509881615638733} 11/07/2021 10:59:24 - INFO - __main__ - Step 97897: {'lr': 0.00013794214469698595, 'samples': 18796224, 'steps': 97896, 'loss/train': 1.7273247241973877} 11/07/2021 10:59:24 - INFO - __main__ - Step 97898: {'lr': 0.00013793740093030637, 'samples': 18796416, 'steps': 97897, 'loss/train': 1.3640038967132568} 11/07/2021 10:59:25 - INFO - __main__ - Step 97899: {'lr': 0.00013793265721412045, 'samples': 18796608, 'steps': 97898, 'loss/train': 1.2182294130325317} 11/07/2021 10:59:26 - INFO - __main__ - Step 97900: {'lr': 0.00013792791354843038, 'samples': 18796800, 'steps': 97899, 'loss/train': 1.0897995233535767} 11/07/2021 10:59:26 - INFO - __main__ - Step 97901: {'lr': 0.00013792316993323822, 'samples': 18796992, 'steps': 97900, 'loss/train': 1.3906240463256836} 11/07/2021 10:59:26 - INFO - __main__ - Step 97902: {'lr': 0.00013791842636854619, 'samples': 18797184, 'steps': 97901, 'loss/train': 1.6059024333953857} 11/07/2021 10:59:27 - INFO - __main__ - Step 97903: {'lr': 0.00013791368285435637, 'samples': 18797376, 'steps': 97902, 'loss/train': 1.1681106090545654} 11/07/2021 10:59:27 - INFO - __main__ - Step 97904: {'lr': 0.00013790893939067092, 'samples': 18797568, 'steps': 97903, 'loss/train': 2.099257707595825} 11/07/2021 10:59:28 - INFO - __main__ - Step 97905: {'lr': 0.000137904195977492, 'samples': 18797760, 'steps': 97904, 'loss/train': 0.9773300886154175} 11/07/2021 10:59:28 - INFO - __main__ - Step 97906: {'lr': 0.00013789945261482168, 'samples': 18797952, 'steps': 97905, 'loss/train': 1.238354206085205} 11/07/2021 10:59:29 - INFO - __main__ - Step 97907: {'lr': 0.00013789470930266213, 'samples': 18798144, 'steps': 97906, 'loss/train': 1.2257192134857178} 11/07/2021 10:59:29 - INFO - __main__ - Step 97908: {'lr': 0.0001378899660410155, 'samples': 18798336, 'steps': 97907, 'loss/train': 2.029127836227417} 11/07/2021 10:59:29 - INFO - __main__ - Step 97909: {'lr': 0.0001378852228298839, 'samples': 18798528, 'steps': 97908, 'loss/train': 1.2220932245254517} 11/07/2021 10:59:30 - INFO - __main__ - Step 97910: {'lr': 0.00013788047966926964, 'samples': 18798720, 'steps': 97909, 'loss/train': 1.4461597204208374} 11/07/2021 10:59:31 - INFO - __main__ - Step 97911: {'lr': 0.0001378757365591746, 'samples': 18798912, 'steps': 97910, 'loss/train': 1.3019038438796997} 11/07/2021 10:59:31 - INFO - __main__ - Step 97912: {'lr': 0.000137870993499601, 'samples': 18799104, 'steps': 97911, 'loss/train': 1.2164301872253418} 11/07/2021 10:59:32 - INFO - __main__ - Step 97913: {'lr': 0.00013786625049055102, 'samples': 18799296, 'steps': 97912, 'loss/train': 1.4358528852462769} 11/07/2021 10:59:32 - INFO - __main__ - Step 97914: {'lr': 0.00013786150753202674, 'samples': 18799488, 'steps': 97913, 'loss/train': 0.7736479640007019} 11/07/2021 10:59:33 - INFO - __main__ - Step 97915: {'lr': 0.00013785676462403038, 'samples': 18799680, 'steps': 97914, 'loss/train': 1.3547940254211426} 11/07/2021 10:59:33 - INFO - __main__ - Step 97916: {'lr': 0.00013785202176656402, 'samples': 18799872, 'steps': 97915, 'loss/train': 1.8016774654388428} 11/07/2021 10:59:34 - INFO - __main__ - Step 97917: {'lr': 0.00013784727895962978, 'samples': 18800064, 'steps': 97916, 'loss/train': 1.4731502532958984} 11/07/2021 10:59:34 - INFO - __main__ - Step 97918: {'lr': 0.00013784253620322985, 'samples': 18800256, 'steps': 97917, 'loss/train': 1.206916332244873} 11/07/2021 10:59:35 - INFO - __main__ - Step 97919: {'lr': 0.00013783779349736637, 'samples': 18800448, 'steps': 97918, 'loss/train': 1.5051246881484985} 11/07/2021 10:59:35 - INFO - __main__ - Step 97920: {'lr': 0.0001378330508420414, 'samples': 18800640, 'steps': 97919, 'loss/train': 1.1892300844192505} 11/07/2021 10:59:36 - INFO - __main__ - Step 97921: {'lr': 0.0001378283082372571, 'samples': 18800832, 'steps': 97920, 'loss/train': 1.0947426557540894} 11/07/2021 10:59:36 - INFO - __main__ - Step 97922: {'lr': 0.0001378235656830157, 'samples': 18801024, 'steps': 97921, 'loss/train': 1.094844937324524} 11/07/2021 10:59:37 - INFO - __main__ - Step 97923: {'lr': 0.0001378188231793194, 'samples': 18801216, 'steps': 97922, 'loss/train': 0.16133753955364227} 11/07/2021 10:59:37 - INFO - __main__ - Step 97924: {'lr': 0.00013781408072617002, 'samples': 18801408, 'steps': 97923, 'loss/train': 1.558539867401123} 11/07/2021 10:59:37 - INFO - __main__ - Step 97925: {'lr': 0.0001378093383235699, 'samples': 18801600, 'steps': 97924, 'loss/train': 1.392905831336975} 11/07/2021 10:59:38 - INFO - __main__ - Step 97926: {'lr': 0.00013780459597152118, 'samples': 18801792, 'steps': 97925, 'loss/train': 0.5555901527404785} 11/07/2021 10:59:39 - INFO - __main__ - Step 97927: {'lr': 0.00013779985367002597, 'samples': 18801984, 'steps': 97926, 'loss/train': 1.3495688438415527} 11/07/2021 10:59:39 - INFO - __main__ - Step 97928: {'lr': 0.00013779511141908643, 'samples': 18802176, 'steps': 97927, 'loss/train': 1.4086682796478271} 11/07/2021 10:59:39 - INFO - __main__ - Step 97929: {'lr': 0.0001377903692187047, 'samples': 18802368, 'steps': 97928, 'loss/train': 1.4663580656051636} 11/07/2021 10:59:40 - INFO - __main__ - Step 97930: {'lr': 0.00013778562706888287, 'samples': 18802560, 'steps': 97929, 'loss/train': 1.333853006362915} 11/07/2021 10:59:41 - INFO - __main__ - Step 97931: {'lr': 0.0001377808849696231, 'samples': 18802752, 'steps': 97930, 'loss/train': 1.1229866743087769} 11/07/2021 10:59:41 - INFO - __main__ - Step 97932: {'lr': 0.00013777614292092752, 'samples': 18802944, 'steps': 97931, 'loss/train': 1.5771446228027344} 11/07/2021 10:59:42 - INFO - __main__ - Step 97933: {'lr': 0.0001377714009227983, 'samples': 18803136, 'steps': 97932, 'loss/train': 1.1416761875152588} 11/07/2021 10:59:42 - INFO - __main__ - Step 97934: {'lr': 0.00013776665897523755, 'samples': 18803328, 'steps': 97933, 'loss/train': 1.4297106266021729} 11/07/2021 10:59:42 - INFO - __main__ - Step 97935: {'lr': 0.00013776191707824743, 'samples': 18803520, 'steps': 97934, 'loss/train': 1.3807815313339233} 11/07/2021 10:59:43 - INFO - __main__ - Step 97936: {'lr': 0.00013775717523183, 'samples': 18803712, 'steps': 97935, 'loss/train': 0.9880009293556213} 11/07/2021 10:59:44 - INFO - __main__ - Step 97937: {'lr': 0.00013775243343598761, 'samples': 18803904, 'steps': 97936, 'loss/train': 1.2115733623504639} 11/07/2021 10:59:44 - INFO - __main__ - Step 97938: {'lr': 0.00013774769169072216, 'samples': 18804096, 'steps': 97937, 'loss/train': 1.6954556703567505} 11/07/2021 10:59:44 - INFO - __main__ - Step 97939: {'lr': 0.00013774294999603583, 'samples': 18804288, 'steps': 97938, 'loss/train': 1.0860213041305542} 11/07/2021 10:59:45 - INFO - __main__ - Step 97940: {'lr': 0.0001377382083519308, 'samples': 18804480, 'steps': 97939, 'loss/train': 1.4621607065200806} 11/07/2021 10:59:45 - INFO - __main__ - Step 97941: {'lr': 0.0001377334667584092, 'samples': 18804672, 'steps': 97940, 'loss/train': 1.4304559230804443} 11/07/2021 10:59:46 - INFO - __main__ - Step 97942: {'lr': 0.00013772872521547314, 'samples': 18804864, 'steps': 97941, 'loss/train': 1.317905306816101} 11/07/2021 10:59:47 - INFO - __main__ - Step 97943: {'lr': 0.00013772398372312485, 'samples': 18805056, 'steps': 97942, 'loss/train': 0.5804967880249023} 11/07/2021 10:59:47 - INFO - __main__ - Step 97944: {'lr': 0.00013771924228136634, 'samples': 18805248, 'steps': 97943, 'loss/train': 1.2946875095367432} 11/07/2021 10:59:47 - INFO - __main__ - Step 97945: {'lr': 0.00013771450089019983, 'samples': 18805440, 'steps': 97944, 'loss/train': 0.048488449305295944} 11/07/2021 10:59:48 - INFO - __main__ - Step 97946: {'lr': 0.00013770975954962745, 'samples': 18805632, 'steps': 97945, 'loss/train': 1.9570931196212769} 11/07/2021 10:59:49 - INFO - __main__ - Step 97947: {'lr': 0.0001377050182596513, 'samples': 18805824, 'steps': 97946, 'loss/train': 1.2465076446533203} 11/07/2021 10:59:49 - INFO - __main__ - Step 97948: {'lr': 0.00013770027702027351, 'samples': 18806016, 'steps': 97947, 'loss/train': 1.2163889408111572} 11/07/2021 10:59:50 - INFO - __main__ - Step 97949: {'lr': 0.0001376955358314963, 'samples': 18806208, 'steps': 97948, 'loss/train': 1.7068240642547607} 11/07/2021 10:59:50 - INFO - __main__ - Step 97950: {'lr': 0.0001376907946933218, 'samples': 18806400, 'steps': 97949, 'loss/train': 1.552969217300415} 11/07/2021 10:59:50 - INFO - __main__ - Step 97951: {'lr': 0.000137686053605752, 'samples': 18806592, 'steps': 97950, 'loss/train': 1.2447131872177124} 11/07/2021 10:59:51 - INFO - __main__ - Step 97952: {'lr': 0.00013768131256878917, 'samples': 18806784, 'steps': 97951, 'loss/train': 1.3704659938812256} 11/07/2021 10:59:52 - INFO - __main__ - Step 97953: {'lr': 0.00013767657158243534, 'samples': 18806976, 'steps': 97952, 'loss/train': 1.2953479290008545} 11/07/2021 10:59:52 - INFO - __main__ - Step 97954: {'lr': 0.00013767183064669278, 'samples': 18807168, 'steps': 97953, 'loss/train': 1.1354866027832031} 11/07/2021 10:59:52 - INFO - __main__ - Step 97955: {'lr': 0.00013766708976156356, 'samples': 18807360, 'steps': 97954, 'loss/train': 1.138401985168457} 11/07/2021 10:59:53 - INFO - __main__ - Step 97956: {'lr': 0.00013766234892704975, 'samples': 18807552, 'steps': 97955, 'loss/train': 0.8968766331672668} 11/07/2021 10:59:54 - INFO - __main__ - Step 97957: {'lr': 0.0001376576081431536, 'samples': 18807744, 'steps': 97956, 'loss/train': 1.2034783363342285} 11/07/2021 10:59:54 - INFO - __main__ - Step 97958: {'lr': 0.0001376528674098772, 'samples': 18807936, 'steps': 97957, 'loss/train': 1.1831673383712769} 11/07/2021 10:59:54 - INFO - __main__ - Step 97959: {'lr': 0.0001376481267272227, 'samples': 18808128, 'steps': 97958, 'loss/train': 0.4660540223121643} 11/07/2021 10:59:55 - INFO - __main__ - Step 97960: {'lr': 0.00013764338609519218, 'samples': 18808320, 'steps': 97959, 'loss/train': 0.9728555083274841} 11/07/2021 10:59:55 - INFO - __main__ - Step 97961: {'lr': 0.00013763864551378786, 'samples': 18808512, 'steps': 97960, 'loss/train': 1.3093805313110352} 11/07/2021 10:59:56 - INFO - __main__ - Step 97962: {'lr': 0.00013763390498301178, 'samples': 18808704, 'steps': 97961, 'loss/train': 1.4483087062835693} 11/07/2021 10:59:56 - INFO - __main__ - Step 97963: {'lr': 0.00013762916450286617, 'samples': 18808896, 'steps': 97962, 'loss/train': 1.3094000816345215} 11/07/2021 10:59:57 - INFO - __main__ - Step 97964: {'lr': 0.00013762442407335318, 'samples': 18809088, 'steps': 97963, 'loss/train': 0.5466161966323853} 11/07/2021 10:59:57 - INFO - __main__ - Step 97965: {'lr': 0.00013761968369447483, 'samples': 18809280, 'steps': 97964, 'loss/train': 1.0292245149612427} 11/07/2021 10:59:58 - INFO - __main__ - Step 97966: {'lr': 0.00013761494336623332, 'samples': 18809472, 'steps': 97965, 'loss/train': 1.4472204446792603} 11/07/2021 10:59:59 - INFO - __main__ - Step 97967: {'lr': 0.00013761020308863077, 'samples': 18809664, 'steps': 97966, 'loss/train': 1.3624882698059082} 11/07/2021 10:59:59 - INFO - __main__ - Step 97968: {'lr': 0.0001376054628616693, 'samples': 18809856, 'steps': 97967, 'loss/train': 1.326590895652771} 11/07/2021 10:59:59 - INFO - __main__ - Step 97969: {'lr': 0.0001376007226853511, 'samples': 18810048, 'steps': 97968, 'loss/train': 1.1963027715682983} 11/07/2021 11:00:00 - INFO - __main__ - Step 97970: {'lr': 0.0001375959825596783, 'samples': 18810240, 'steps': 97969, 'loss/train': 1.1871880292892456} 11/07/2021 11:00:00 - INFO - __main__ - Step 97971: {'lr': 0.000137591242484653, 'samples': 18810432, 'steps': 97970, 'loss/train': 1.2261455059051514} 11/07/2021 11:00:00 - INFO - __main__ - Step 97972: {'lr': 0.00013758650246027733, 'samples': 18810624, 'steps': 97971, 'loss/train': 0.778890073299408} 11/07/2021 11:00:01 - INFO - __main__ - Step 97973: {'lr': 0.00013758176248655345, 'samples': 18810816, 'steps': 97972, 'loss/train': 1.298000454902649} 11/07/2021 11:00:02 - INFO - __main__ - Step 97974: {'lr': 0.00013757702256348353, 'samples': 18811008, 'steps': 97973, 'loss/train': 1.4401856660842896} 11/07/2021 11:00:02 - INFO - __main__ - Step 97975: {'lr': 0.00013757228269106964, 'samples': 18811200, 'steps': 97974, 'loss/train': 1.0659441947937012} 11/07/2021 11:00:02 - INFO - __main__ - Step 97976: {'lr': 0.00013756754286931393, 'samples': 18811392, 'steps': 97975, 'loss/train': 0.3663313090801239} 11/07/2021 11:00:03 - INFO - __main__ - Step 97977: {'lr': 0.00013756280309821869, 'samples': 18811584, 'steps': 97976, 'loss/train': 0.7057287693023682} 11/07/2021 11:00:04 - INFO - __main__ - Step 97978: {'lr': 0.00013755806337778582, 'samples': 18811776, 'steps': 97977, 'loss/train': 1.3955940008163452} 11/07/2021 11:00:04 - INFO - __main__ - Step 97979: {'lr': 0.0001375533237080175, 'samples': 18811968, 'steps': 97978, 'loss/train': 1.6705032587051392} 11/07/2021 11:00:04 - INFO - __main__ - Step 97980: {'lr': 0.00013754858408891596, 'samples': 18812160, 'steps': 97979, 'loss/train': 1.366942048072815} 11/07/2021 11:00:05 - INFO - __main__ - Step 97981: {'lr': 0.00013754384452048328, 'samples': 18812352, 'steps': 97980, 'loss/train': 1.3105031251907349} 11/07/2021 11:00:05 - INFO - __main__ - Step 97982: {'lr': 0.0001375391050027216, 'samples': 18812544, 'steps': 97981, 'loss/train': 1.379152536392212} 11/07/2021 11:00:06 - INFO - __main__ - Step 97983: {'lr': 0.0001375343655356331, 'samples': 18812736, 'steps': 97982, 'loss/train': 1.2457787990570068} 11/07/2021 11:00:06 - INFO - __main__ - Step 97984: {'lr': 0.00013752962611921982, 'samples': 18812928, 'steps': 97983, 'loss/train': 1.500241994857788} 11/07/2021 11:00:07 - INFO - __main__ - Step 97985: {'lr': 0.00013752488675348402, 'samples': 18813120, 'steps': 97984, 'loss/train': 1.0642542839050293} 11/07/2021 11:00:07 - INFO - __main__ - Step 97986: {'lr': 0.00013752014743842773, 'samples': 18813312, 'steps': 97985, 'loss/train': 1.2495195865631104} 11/07/2021 11:00:08 - INFO - __main__ - Step 97987: {'lr': 0.00013751540817405312, 'samples': 18813504, 'steps': 97986, 'loss/train': 0.974490225315094} 11/07/2021 11:00:09 - INFO - __main__ - Step 97988: {'lr': 0.00013751066896036234, 'samples': 18813696, 'steps': 97987, 'loss/train': 1.3094278573989868} 11/07/2021 11:00:09 - INFO - __main__ - Step 97989: {'lr': 0.00013750592979735752, 'samples': 18813888, 'steps': 97988, 'loss/train': 1.406957983970642} 11/07/2021 11:00:09 - INFO - __main__ - Step 97990: {'lr': 0.0001375011906850409, 'samples': 18814080, 'steps': 97989, 'loss/train': 1.0131348371505737} 11/07/2021 11:00:10 - INFO - __main__ - Step 97991: {'lr': 0.0001374964516234144, 'samples': 18814272, 'steps': 97990, 'loss/train': 1.2109510898590088} 11/07/2021 11:00:10 - INFO - __main__ - Step 97992: {'lr': 0.00013749171261248026, 'samples': 18814464, 'steps': 97991, 'loss/train': 1.4428791999816895} 11/07/2021 11:00:11 - INFO - __main__ - Step 97993: {'lr': 0.0001374869736522406, 'samples': 18814656, 'steps': 97992, 'loss/train': 1.2612000703811646} 11/07/2021 11:00:11 - INFO - __main__ - Step 97994: {'lr': 0.0001374822347426976, 'samples': 18814848, 'steps': 97993, 'loss/train': 1.475068211555481} 11/07/2021 11:00:12 - INFO - __main__ - Step 97995: {'lr': 0.00013747749588385335, 'samples': 18815040, 'steps': 97994, 'loss/train': 0.0453745573759079} 11/07/2021 11:00:12 - INFO - __main__ - Step 97996: {'lr': 0.00013747275707571, 'samples': 18815232, 'steps': 97995, 'loss/train': 0.9514252543449402} 11/07/2021 11:00:12 - INFO - __main__ - Step 97997: {'lr': 0.00013746801831826974, 'samples': 18815424, 'steps': 97996, 'loss/train': 1.3430659770965576} 11/07/2021 11:00:13 - INFO - __main__ - Step 97998: {'lr': 0.00013746327961153463, 'samples': 18815616, 'steps': 97997, 'loss/train': 1.4145783185958862} 11/07/2021 11:00:14 - INFO - __main__ - Step 97999: {'lr': 0.00013745854095550681, 'samples': 18815808, 'steps': 97998, 'loss/train': 0.6278250217437744} 11/07/2021 11:00:14 - INFO - __main__ - Step 98000: {'lr': 0.00013745380235018846, 'samples': 18816000, 'steps': 97999, 'loss/train': 1.3948689699172974} 11/07/2021 11:00:15 - INFO - __main__ - Step 98001: {'lr': 0.00013744906379558163, 'samples': 18816192, 'steps': 98000, 'loss/train': 1.585827350616455} 11/07/2021 11:00:15 - INFO - __main__ - Step 98002: {'lr': 0.0001374443252916886, 'samples': 18816384, 'steps': 98001, 'loss/train': 1.0892486572265625} 11/07/2021 11:00:16 - INFO - __main__ - Step 98003: {'lr': 0.00013743958683851138, 'samples': 18816576, 'steps': 98002, 'loss/train': 1.103209137916565} 11/07/2021 11:00:16 - INFO - __main__ - Step 98004: {'lr': 0.00013743484843605226, 'samples': 18816768, 'steps': 98003, 'loss/train': 1.362790584564209} 11/07/2021 11:00:17 - INFO - __main__ - Step 98005: {'lr': 0.0001374301100843131, 'samples': 18816960, 'steps': 98004, 'loss/train': 1.4693541526794434} 11/07/2021 11:00:17 - INFO - __main__ - Step 98006: {'lr': 0.00013742537178329628, 'samples': 18817152, 'steps': 98005, 'loss/train': 1.327939748764038} 11/07/2021 11:00:17 - INFO - __main__ - Step 98007: {'lr': 0.0001374206335330038, 'samples': 18817344, 'steps': 98006, 'loss/train': 1.9875253438949585} 11/07/2021 11:00:18 - INFO - __main__ - Step 98008: {'lr': 0.00013741589533343784, 'samples': 18817536, 'steps': 98007, 'loss/train': 2.1174371242523193} 11/07/2021 11:00:19 - INFO - __main__ - Step 98009: {'lr': 0.00013741115718460056, 'samples': 18817728, 'steps': 98008, 'loss/train': 1.740727424621582} 11/07/2021 11:00:19 - INFO - __main__ - Step 98010: {'lr': 0.0001374064190864941, 'samples': 18817920, 'steps': 98009, 'loss/train': 1.4012267589569092} 11/07/2021 11:00:20 - INFO - __main__ - Step 98011: {'lr': 0.00013740168103912055, 'samples': 18818112, 'steps': 98010, 'loss/train': 1.2585618495941162} 11/07/2021 11:00:20 - INFO - __main__ - Step 98012: {'lr': 0.00013739694304248202, 'samples': 18818304, 'steps': 98011, 'loss/train': 1.3120198249816895} 11/07/2021 11:00:20 - INFO - __main__ - Step 98013: {'lr': 0.00013739220509658074, 'samples': 18818496, 'steps': 98012, 'loss/train': 1.2285609245300293} 11/07/2021 11:00:22 - INFO - __main__ - Step 98014: {'lr': 0.0001373874672014188, 'samples': 18818688, 'steps': 98013, 'loss/train': 1.4460241794586182} 11/07/2021 11:00:22 - INFO - __main__ - Step 98015: {'lr': 0.0001373827293569983, 'samples': 18818880, 'steps': 98014, 'loss/train': 1.1810353994369507} 11/07/2021 11:00:22 - INFO - __main__ - Step 98016: {'lr': 0.00013737799156332144, 'samples': 18819072, 'steps': 98015, 'loss/train': 0.3856896758079529} 11/07/2021 11:00:23 - INFO - __main__ - Step 98017: {'lr': 0.00013737325382039037, 'samples': 18819264, 'steps': 98016, 'loss/train': 1.0032588243484497} 11/07/2021 11:00:23 - INFO - __main__ - Step 98018: {'lr': 0.0001373685161282071, 'samples': 18819456, 'steps': 98017, 'loss/train': 1.2812327146530151} 11/07/2021 11:00:24 - INFO - __main__ - Step 98019: {'lr': 0.00013736377848677384, 'samples': 18819648, 'steps': 98018, 'loss/train': 1.5073835849761963} 11/07/2021 11:00:24 - INFO - __main__ - Step 98020: {'lr': 0.00013735904089609273, 'samples': 18819840, 'steps': 98019, 'loss/train': 1.5010744333267212} 11/07/2021 11:00:25 - INFO - __main__ - Step 98021: {'lr': 0.00013735430335616588, 'samples': 18820032, 'steps': 98020, 'loss/train': 1.7061617374420166} 11/07/2021 11:00:25 - INFO - __main__ - Step 98022: {'lr': 0.00013734956586699542, 'samples': 18820224, 'steps': 98021, 'loss/train': 1.1746795177459717} 11/07/2021 11:00:25 - INFO - __main__ - Step 98023: {'lr': 0.00013734482842858356, 'samples': 18820416, 'steps': 98022, 'loss/train': 1.1624377965927124} 11/07/2021 11:00:26 - INFO - __main__ - Step 98024: {'lr': 0.00013734009104093237, 'samples': 18820608, 'steps': 98023, 'loss/train': 1.2508314847946167} 11/07/2021 11:00:27 - INFO - __main__ - Step 98025: {'lr': 0.00013733535370404399, 'samples': 18820800, 'steps': 98024, 'loss/train': 1.5049504041671753} 11/07/2021 11:00:27 - INFO - __main__ - Step 98026: {'lr': 0.00013733061641792055, 'samples': 18820992, 'steps': 98025, 'loss/train': 1.2732926607131958} 11/07/2021 11:00:27 - INFO - __main__ - Step 98027: {'lr': 0.0001373258791825642, 'samples': 18821184, 'steps': 98026, 'loss/train': 1.390015959739685} 11/07/2021 11:00:28 - INFO - __main__ - Step 98028: {'lr': 0.00013732114199797708, 'samples': 18821376, 'steps': 98027, 'loss/train': 1.3668346405029297} 11/07/2021 11:00:28 - INFO - __main__ - Step 98029: {'lr': 0.0001373164048641613, 'samples': 18821568, 'steps': 98028, 'loss/train': 1.2617733478546143} 11/07/2021 11:00:29 - INFO - __main__ - Step 98030: {'lr': 0.00013731166778111904, 'samples': 18821760, 'steps': 98029, 'loss/train': 1.4639861583709717} 11/07/2021 11:00:29 - INFO - __main__ - Step 98031: {'lr': 0.00013730693074885246, 'samples': 18821952, 'steps': 98030, 'loss/train': 1.2570405006408691} 11/07/2021 11:00:30 - INFO - __main__ - Step 98032: {'lr': 0.00013730219376736357, 'samples': 18822144, 'steps': 98031, 'loss/train': 1.3805707693099976} 11/07/2021 11:00:30 - INFO - __main__ - Step 98033: {'lr': 0.00013729745683665456, 'samples': 18822336, 'steps': 98032, 'loss/train': 1.003005862236023} 11/07/2021 11:00:31 - INFO - __main__ - Step 98034: {'lr': 0.0001372927199567276, 'samples': 18822528, 'steps': 98033, 'loss/train': 1.281646966934204} 11/07/2021 11:00:32 - INFO - __main__ - Step 98035: {'lr': 0.00013728798312758478, 'samples': 18822720, 'steps': 98034, 'loss/train': 1.3416993618011475} 11/07/2021 11:00:32 - INFO - __main__ - Step 98036: {'lr': 0.00013728324634922824, 'samples': 18822912, 'steps': 98035, 'loss/train': 0.9867538213729858} 11/07/2021 11:00:32 - INFO - __main__ - Step 98037: {'lr': 0.00013727850962166015, 'samples': 18823104, 'steps': 98036, 'loss/train': 1.102973222732544} 11/07/2021 11:00:33 - INFO - __main__ - Step 98038: {'lr': 0.00013727377294488262, 'samples': 18823296, 'steps': 98037, 'loss/train': 1.795297622680664} 11/07/2021 11:00:33 - INFO - __main__ - Step 98039: {'lr': 0.0001372690363188978, 'samples': 18823488, 'steps': 98038, 'loss/train': 1.5661981105804443} 11/07/2021 11:00:34 - INFO - __main__ - Step 98040: {'lr': 0.0001372642997437078, 'samples': 18823680, 'steps': 98039, 'loss/train': 1.5269871950149536} 11/07/2021 11:00:34 - INFO - __main__ - Step 98041: {'lr': 0.00013725956321931475, 'samples': 18823872, 'steps': 98040, 'loss/train': 0.9419553875923157} 11/07/2021 11:00:35 - INFO - __main__ - Step 98042: {'lr': 0.00013725482674572083, 'samples': 18824064, 'steps': 98041, 'loss/train': 1.4447282552719116} 11/07/2021 11:00:35 - INFO - __main__ - Step 98043: {'lr': 0.00013725009032292812, 'samples': 18824256, 'steps': 98042, 'loss/train': 1.317029356956482} 11/07/2021 11:00:35 - INFO - __main__ - Step 98044: {'lr': 0.0001372453539509389, 'samples': 18824448, 'steps': 98043, 'loss/train': 1.1180362701416016} 11/07/2021 11:00:36 - INFO - __main__ - Step 98045: {'lr': 0.0001372406176297551, 'samples': 18824640, 'steps': 98044, 'loss/train': 1.4829435348510742} 11/07/2021 11:00:37 - INFO - __main__ - Step 98046: {'lr': 0.00013723588135937888, 'samples': 18824832, 'steps': 98045, 'loss/train': 1.0876822471618652} 11/07/2021 11:00:37 - INFO - __main__ - Step 98047: {'lr': 0.0001372311451398125, 'samples': 18825024, 'steps': 98046, 'loss/train': 0.8481073975563049} 11/07/2021 11:00:38 - INFO - __main__ - Step 98048: {'lr': 0.00013722640897105798, 'samples': 18825216, 'steps': 98047, 'loss/train': 1.5964598655700684} 11/07/2021 11:00:38 - INFO - __main__ - Step 98049: {'lr': 0.0001372216728531175, 'samples': 18825408, 'steps': 98048, 'loss/train': 1.4253547191619873} 11/07/2021 11:00:39 - INFO - __main__ - Step 98050: {'lr': 0.00013721693678599324, 'samples': 18825600, 'steps': 98049, 'loss/train': 1.5845574140548706} 11/07/2021 11:00:39 - INFO - __main__ - Step 98051: {'lr': 0.00013721220076968723, 'samples': 18825792, 'steps': 98050, 'loss/train': 1.483945369720459} 11/07/2021 11:00:40 - INFO - __main__ - Step 98052: {'lr': 0.0001372074648042017, 'samples': 18825984, 'steps': 98051, 'loss/train': 1.3950508832931519} 11/07/2021 11:00:40 - INFO - __main__ - Step 98053: {'lr': 0.0001372027288895387, 'samples': 18826176, 'steps': 98052, 'loss/train': 1.1730071306228638} 11/07/2021 11:00:40 - INFO - __main__ - Step 98054: {'lr': 0.00013719799302570047, 'samples': 18826368, 'steps': 98053, 'loss/train': 1.6236075162887573} 11/07/2021 11:00:41 - INFO - __main__ - Step 98055: {'lr': 0.00013719325721268905, 'samples': 18826560, 'steps': 98054, 'loss/train': 1.3381351232528687} 11/07/2021 11:00:42 - INFO - __main__ - Step 98056: {'lr': 0.0001371885214505066, 'samples': 18826752, 'steps': 98055, 'loss/train': 1.5718590021133423} 11/07/2021 11:00:42 - INFO - __main__ - Step 98057: {'lr': 0.0001371837857391553, 'samples': 18826944, 'steps': 98056, 'loss/train': 1.413114309310913} 11/07/2021 11:00:42 - INFO - __main__ - Step 98058: {'lr': 0.00013717905007863728, 'samples': 18827136, 'steps': 98057, 'loss/train': 1.1471487283706665} 11/07/2021 11:00:43 - INFO - __main__ - Step 98059: {'lr': 0.00013717431446895462, 'samples': 18827328, 'steps': 98058, 'loss/train': 0.6427567005157471} 11/07/2021 11:00:44 - INFO - __main__ - Step 98060: {'lr': 0.0001371695789101094, 'samples': 18827520, 'steps': 98059, 'loss/train': 1.005519986152649} 11/07/2021 11:00:44 - INFO - __main__ - Step 98061: {'lr': 0.00013716484340210388, 'samples': 18827712, 'steps': 98060, 'loss/train': 0.7686497569084167} 11/07/2021 11:00:44 - INFO - __main__ - Step 98062: {'lr': 0.00013716010794494012, 'samples': 18827904, 'steps': 98061, 'loss/train': 0.8491660952568054} 11/07/2021 11:00:45 - INFO - __main__ - Step 98063: {'lr': 0.00013715537253862026, 'samples': 18828096, 'steps': 98062, 'loss/train': 1.3710412979125977} 11/07/2021 11:00:45 - INFO - __main__ - Step 98064: {'lr': 0.00013715063718314647, 'samples': 18828288, 'steps': 98063, 'loss/train': 0.9453704357147217} 11/07/2021 11:00:46 - INFO - __main__ - Step 98065: {'lr': 0.00013714590187852087, 'samples': 18828480, 'steps': 98064, 'loss/train': 1.0830597877502441} 11/07/2021 11:00:47 - INFO - __main__ - Step 98066: {'lr': 0.00013714116662474554, 'samples': 18828672, 'steps': 98065, 'loss/train': 1.2532203197479248} 11/07/2021 11:00:47 - INFO - __main__ - Step 98067: {'lr': 0.0001371364314218227, 'samples': 18828864, 'steps': 98066, 'loss/train': 1.2749899625778198} 11/07/2021 11:00:47 - INFO - __main__ - Step 98068: {'lr': 0.00013713169626975442, 'samples': 18829056, 'steps': 98067, 'loss/train': 1.2299675941467285} 11/07/2021 11:00:48 - INFO - __main__ - Step 98069: {'lr': 0.00013712696116854287, 'samples': 18829248, 'steps': 98068, 'loss/train': 1.1537275314331055} 11/07/2021 11:00:48 - INFO - __main__ - Step 98070: {'lr': 0.00013712222611819016, 'samples': 18829440, 'steps': 98069, 'loss/train': 0.6181566715240479} 11/07/2021 11:00:49 - INFO - __main__ - Step 98071: {'lr': 0.00013711749111869855, 'samples': 18829632, 'steps': 98070, 'loss/train': 1.5091743469238281} 11/07/2021 11:00:49 - INFO - __main__ - Step 98072: {'lr': 0.00013711275617006994, 'samples': 18829824, 'steps': 98071, 'loss/train': 1.483686089515686} 11/07/2021 11:00:50 - INFO - __main__ - Step 98073: {'lr': 0.0001371080212723066, 'samples': 18830016, 'steps': 98072, 'loss/train': 1.7835650444030762} 11/07/2021 11:00:50 - INFO - __main__ - Step 98074: {'lr': 0.00013710328642541062, 'samples': 18830208, 'steps': 98073, 'loss/train': 1.3729221820831299} 11/07/2021 11:00:50 - INFO - __main__ - Step 98075: {'lr': 0.00013709855162938417, 'samples': 18830400, 'steps': 98074, 'loss/train': 1.2692526578903198} 11/07/2021 11:00:52 - INFO - __main__ - Step 98076: {'lr': 0.00013709381688422934, 'samples': 18830592, 'steps': 98075, 'loss/train': 1.2369714975357056} 11/07/2021 11:00:52 - INFO - __main__ - Step 98077: {'lr': 0.00013708908218994833, 'samples': 18830784, 'steps': 98076, 'loss/train': 1.2256709337234497} 11/07/2021 11:00:53 - INFO - __main__ - Step 98078: {'lr': 0.00013708434754654324, 'samples': 18830976, 'steps': 98077, 'loss/train': 1.5363329648971558} 11/07/2021 11:00:53 - INFO - __main__ - Step 98079: {'lr': 0.00013707961295401618, 'samples': 18831168, 'steps': 98078, 'loss/train': 1.333873987197876} 11/07/2021 11:00:53 - INFO - __main__ - Step 98080: {'lr': 0.00013707487841236931, 'samples': 18831360, 'steps': 98079, 'loss/train': 1.258007526397705} 11/07/2021 11:00:54 - INFO - __main__ - Step 98081: {'lr': 0.00013707014392160476, 'samples': 18831552, 'steps': 98080, 'loss/train': 0.5257357954978943} 11/07/2021 11:00:55 - INFO - __main__ - Step 98082: {'lr': 0.00013706540948172467, 'samples': 18831744, 'steps': 98081, 'loss/train': 0.33341068029403687} 11/07/2021 11:00:55 - INFO - __main__ - Step 98083: {'lr': 0.0001370606750927311, 'samples': 18831936, 'steps': 98082, 'loss/train': 1.2138630151748657} 11/07/2021 11:00:55 - INFO - __main__ - Step 98084: {'lr': 0.00013705594075462635, 'samples': 18832128, 'steps': 98083, 'loss/train': 1.470373511314392} 11/07/2021 11:00:56 - INFO - __main__ - Step 98085: {'lr': 0.0001370512064674125, 'samples': 18832320, 'steps': 98084, 'loss/train': 0.04296347498893738} 11/07/2021 11:00:56 - INFO - __main__ - Step 98086: {'lr': 0.0001370464722310915, 'samples': 18832512, 'steps': 98085, 'loss/train': 1.0410486459732056} 11/07/2021 11:00:57 - INFO - __main__ - Step 98087: {'lr': 0.00013704173804566567, 'samples': 18832704, 'steps': 98086, 'loss/train': 1.1566439867019653} 11/07/2021 11:00:58 - INFO - __main__ - Step 98088: {'lr': 0.00013703700391113708, 'samples': 18832896, 'steps': 98087, 'loss/train': 1.121147632598877} 11/07/2021 11:00:58 - INFO - __main__ - Step 98089: {'lr': 0.00013703226982750784, 'samples': 18833088, 'steps': 98088, 'loss/train': 1.3291963338851929} 11/07/2021 11:00:58 - INFO - __main__ - Step 98090: {'lr': 0.00013702753579478017, 'samples': 18833280, 'steps': 98089, 'loss/train': 1.2017964124679565} 11/07/2021 11:00:59 - INFO - __main__ - Step 98091: {'lr': 0.0001370228018129561, 'samples': 18833472, 'steps': 98090, 'loss/train': 1.1315079927444458} 11/07/2021 11:01:00 - INFO - __main__ - Step 98092: {'lr': 0.00013701806788203786, 'samples': 18833664, 'steps': 98091, 'loss/train': 1.3548601865768433} 11/07/2021 11:01:00 - INFO - __main__ - Step 98093: {'lr': 0.0001370133340020275, 'samples': 18833856, 'steps': 98092, 'loss/train': 1.4411709308624268} 11/07/2021 11:01:00 - INFO - __main__ - Step 98094: {'lr': 0.00013700860017292716, 'samples': 18834048, 'steps': 98093, 'loss/train': 1.2220146656036377} 11/07/2021 11:01:01 - INFO - __main__ - Step 98095: {'lr': 0.00013700386639473906, 'samples': 18834240, 'steps': 98094, 'loss/train': 1.2954658269882202} 11/07/2021 11:01:01 - INFO - __main__ - Step 98096: {'lr': 0.0001369991326674652, 'samples': 18834432, 'steps': 98095, 'loss/train': 1.2489913702011108} 11/07/2021 11:01:02 - INFO - __main__ - Step 98097: {'lr': 0.00013699439899110799, 'samples': 18834624, 'steps': 98096, 'loss/train': 1.3401840925216675} 11/07/2021 11:01:02 - INFO - __main__ - Step 98098: {'lr': 0.0001369896653656692, 'samples': 18834816, 'steps': 98097, 'loss/train': 1.1635990142822266} 11/07/2021 11:01:03 - INFO - __main__ - Step 98099: {'lr': 0.00013698493179115112, 'samples': 18835008, 'steps': 98098, 'loss/train': 1.4056934118270874} 11/07/2021 11:01:03 - INFO - __main__ - Step 98100: {'lr': 0.0001369801982675559, 'samples': 18835200, 'steps': 98099, 'loss/train': 1.8061299324035645} 11/07/2021 11:01:03 - INFO - __main__ - Step 98101: {'lr': 0.00013697546479488564, 'samples': 18835392, 'steps': 98100, 'loss/train': 1.103545904159546} 11/07/2021 11:01:04 - INFO - __main__ - Step 98102: {'lr': 0.00013697073137314253, 'samples': 18835584, 'steps': 98101, 'loss/train': 1.3755519390106201} 11/07/2021 11:01:05 - INFO - __main__ - Step 98103: {'lr': 0.0001369659980023286, 'samples': 18835776, 'steps': 98102, 'loss/train': 1.2623051404953003} 11/07/2021 11:01:05 - INFO - __main__ - Step 98104: {'lr': 0.00013696126468244613, 'samples': 18835968, 'steps': 98103, 'loss/train': 1.3572046756744385} 11/07/2021 11:01:05 - INFO - __main__ - Step 98105: {'lr': 0.00013695653141349712, 'samples': 18836160, 'steps': 98104, 'loss/train': 1.1272742748260498} 11/07/2021 11:01:06 - INFO - __main__ - Step 98106: {'lr': 0.00013695179819548376, 'samples': 18836352, 'steps': 98105, 'loss/train': 1.5791044235229492} 11/07/2021 11:01:07 - INFO - __main__ - Step 98107: {'lr': 0.00013694706502840814, 'samples': 18836544, 'steps': 98106, 'loss/train': 1.105332374572754} 11/07/2021 11:01:07 - INFO - __main__ - Step 98108: {'lr': 0.00013694233191227257, 'samples': 18836736, 'steps': 98107, 'loss/train': 1.2527142763137817} 11/07/2021 11:01:07 - INFO - __main__ - Step 98109: {'lr': 0.00013693759884707895, 'samples': 18836928, 'steps': 98108, 'loss/train': 1.4682958126068115} 11/07/2021 11:01:08 - INFO - __main__ - Step 98110: {'lr': 0.0001369328658328295, 'samples': 18837120, 'steps': 98109, 'loss/train': 1.5581170320510864} 11/07/2021 11:01:08 - INFO - __main__ - Step 98111: {'lr': 0.00013692813286952634, 'samples': 18837312, 'steps': 98110, 'loss/train': 1.3063780069351196} 11/07/2021 11:01:09 - INFO - __main__ - Step 98112: {'lr': 0.00013692339995717163, 'samples': 18837504, 'steps': 98111, 'loss/train': 1.731896996498108} 11/07/2021 11:01:09 - INFO - __main__ - Step 98113: {'lr': 0.00013691866709576744, 'samples': 18837696, 'steps': 98112, 'loss/train': 1.0470411777496338} 11/07/2021 11:01:10 - INFO - __main__ - Step 98114: {'lr': 0.000136913934285316, 'samples': 18837888, 'steps': 98113, 'loss/train': 1.6575977802276611} 11/07/2021 11:01:10 - INFO - __main__ - Step 98115: {'lr': 0.0001369092015258194, 'samples': 18838080, 'steps': 98114, 'loss/train': 1.1341944932937622} 11/07/2021 11:01:10 - INFO - __main__ - Step 98116: {'lr': 0.00013690446881727976, 'samples': 18838272, 'steps': 98115, 'loss/train': 1.6487345695495605} 11/07/2021 11:01:12 - INFO - __main__ - Step 98117: {'lr': 0.00013689973615969923, 'samples': 18838464, 'steps': 98116, 'loss/train': 1.264796257019043} 11/07/2021 11:01:12 - INFO - __main__ - Step 98118: {'lr': 0.00013689500355307995, 'samples': 18838656, 'steps': 98117, 'loss/train': 1.5363763570785522} 11/07/2021 11:01:12 - INFO - __main__ - Step 98119: {'lr': 0.00013689027099742407, 'samples': 18838848, 'steps': 98118, 'loss/train': 1.0338398218154907} 11/07/2021 11:01:13 - INFO - __main__ - Step 98120: {'lr': 0.00013688553849273364, 'samples': 18839040, 'steps': 98119, 'loss/train': 1.423329472541809} 11/07/2021 11:01:13 - INFO - __main__ - Step 98121: {'lr': 0.00013688080603901082, 'samples': 18839232, 'steps': 98120, 'loss/train': 1.5390938520431519} 11/07/2021 11:01:14 - INFO - __main__ - Step 98122: {'lr': 0.00013687607363625779, 'samples': 18839424, 'steps': 98121, 'loss/train': 1.3432033061981201} 11/07/2021 11:01:14 - INFO - __main__ - Step 98123: {'lr': 0.00013687134128447664, 'samples': 18839616, 'steps': 98122, 'loss/train': 1.459742546081543} 11/07/2021 11:01:15 - INFO - __main__ - Step 98124: {'lr': 0.0001368666089836695, 'samples': 18839808, 'steps': 98123, 'loss/train': 1.4407471418380737} 11/07/2021 11:01:15 - INFO - __main__ - Step 98125: {'lr': 0.00013686187673383855, 'samples': 18840000, 'steps': 98124, 'loss/train': 1.127366065979004} 11/07/2021 11:01:16 - INFO - __main__ - Step 98126: {'lr': 0.0001368571445349859, 'samples': 18840192, 'steps': 98125, 'loss/train': 1.0226839780807495} 11/07/2021 11:01:16 - INFO - __main__ - Step 98127: {'lr': 0.00013685241238711366, 'samples': 18840384, 'steps': 98126, 'loss/train': 1.1314934492111206} 11/07/2021 11:01:17 - INFO - __main__ - Step 98128: {'lr': 0.00013684768029022392, 'samples': 18840576, 'steps': 98127, 'loss/train': 1.0784434080123901} 11/07/2021 11:01:17 - INFO - __main__ - Step 98129: {'lr': 0.00013684294824431895, 'samples': 18840768, 'steps': 98128, 'loss/train': 0.5114068984985352} 11/07/2021 11:01:18 - INFO - __main__ - Step 98130: {'lr': 0.00013683821624940087, 'samples': 18840960, 'steps': 98129, 'loss/train': 1.5649020671844482} 11/07/2021 11:01:18 - INFO - __main__ - Step 98131: {'lr': 0.00013683348430547164, 'samples': 18841152, 'steps': 98130, 'loss/train': 1.5982780456542969} 11/07/2021 11:01:18 - INFO - __main__ - Step 98132: {'lr': 0.0001368287524125335, 'samples': 18841344, 'steps': 98131, 'loss/train': 1.4651628732681274} 11/07/2021 11:01:19 - INFO - __main__ - Step 98133: {'lr': 0.00013682402057058857, 'samples': 18841536, 'steps': 98132, 'loss/train': 1.2851916551589966} 11/07/2021 11:01:20 - INFO - __main__ - Step 98134: {'lr': 0.000136819288779639, 'samples': 18841728, 'steps': 98133, 'loss/train': 0.7473536133766174} 11/07/2021 11:01:20 - INFO - __main__ - Step 98135: {'lr': 0.00013681455703968691, 'samples': 18841920, 'steps': 98134, 'loss/train': 0.8848841786384583} 11/07/2021 11:01:20 - INFO - __main__ - Step 98136: {'lr': 0.00013680982535073445, 'samples': 18842112, 'steps': 98135, 'loss/train': 1.1952072381973267} 11/07/2021 11:01:21 - INFO - __main__ - Step 98137: {'lr': 0.00013680509371278372, 'samples': 18842304, 'steps': 98136, 'loss/train': 1.2158563137054443} 11/07/2021 11:01:22 - INFO - __main__ - Step 98138: {'lr': 0.00013680036212583688, 'samples': 18842496, 'steps': 98137, 'loss/train': 1.1901265382766724} 11/07/2021 11:01:22 - INFO - __main__ - Step 98139: {'lr': 0.00013679563058989602, 'samples': 18842688, 'steps': 98138, 'loss/train': 1.2093582153320312} 11/07/2021 11:01:22 - INFO - __main__ - Step 98140: {'lr': 0.00013679089910496344, 'samples': 18842880, 'steps': 98139, 'loss/train': 1.1616300344467163} 11/07/2021 11:01:23 - INFO - __main__ - Step 98141: {'lr': 0.00013678616767104102, 'samples': 18843072, 'steps': 98140, 'loss/train': 1.717421531677246} 11/07/2021 11:01:23 - INFO - __main__ - Step 98142: {'lr': 0.000136781436288131, 'samples': 18843264, 'steps': 98141, 'loss/train': 1.2710694074630737} 11/07/2021 11:01:24 - INFO - __main__ - Step 98143: {'lr': 0.0001367767049562355, 'samples': 18843456, 'steps': 98142, 'loss/train': 1.1877682209014893} 11/07/2021 11:01:25 - INFO - __main__ - Step 98144: {'lr': 0.0001367719736753567, 'samples': 18843648, 'steps': 98143, 'loss/train': 1.835487723350525} 11/07/2021 11:01:25 - INFO - __main__ - Step 98145: {'lr': 0.00013676724244549672, 'samples': 18843840, 'steps': 98144, 'loss/train': 1.4056628942489624} 11/07/2021 11:01:25 - INFO - __main__ - Step 98146: {'lr': 0.0001367625112666576, 'samples': 18844032, 'steps': 98145, 'loss/train': 1.3432896137237549} 11/07/2021 11:01:26 - INFO - __main__ - Step 98147: {'lr': 0.0001367577801388416, 'samples': 18844224, 'steps': 98146, 'loss/train': 1.2417876720428467} 11/07/2021 11:01:27 - INFO - __main__ - Step 98148: {'lr': 0.0001367530490620508, 'samples': 18844416, 'steps': 98147, 'loss/train': 1.4339865446090698} 11/07/2021 11:01:27 - INFO - __main__ - Step 98149: {'lr': 0.0001367483180362873, 'samples': 18844608, 'steps': 98148, 'loss/train': 1.6044551134109497} 11/07/2021 11:01:27 - INFO - __main__ - Step 98150: {'lr': 0.00013674358706155328, 'samples': 18844800, 'steps': 98149, 'loss/train': 1.5370733737945557} 11/07/2021 11:01:28 - INFO - __main__ - Step 98151: {'lr': 0.00013673885613785087, 'samples': 18844992, 'steps': 98150, 'loss/train': 1.17228102684021} 11/07/2021 11:01:28 - INFO - __main__ - Step 98152: {'lr': 0.00013673412526518224, 'samples': 18845184, 'steps': 98151, 'loss/train': 1.2534254789352417} 11/07/2021 11:01:29 - INFO - __main__ - Step 98153: {'lr': 0.00013672939444354937, 'samples': 18845376, 'steps': 98152, 'loss/train': 1.8693023920059204} 11/07/2021 11:01:29 - INFO - __main__ - Step 98154: {'lr': 0.0001367246636729545, 'samples': 18845568, 'steps': 98153, 'loss/train': 1.6180143356323242} 11/07/2021 11:01:30 - INFO - __main__ - Step 98155: {'lr': 0.00013671993295339977, 'samples': 18845760, 'steps': 98154, 'loss/train': 1.2751206159591675} 11/07/2021 11:01:30 - INFO - __main__ - Step 98156: {'lr': 0.00013671520228488725, 'samples': 18845952, 'steps': 98155, 'loss/train': 1.4378468990325928} 11/07/2021 11:01:30 - INFO - __main__ - Step 98157: {'lr': 0.00013671047166741916, 'samples': 18846144, 'steps': 98156, 'loss/train': 1.8064110279083252} 11/07/2021 11:01:32 - INFO - __main__ - Step 98158: {'lr': 0.00013670574110099753, 'samples': 18846336, 'steps': 98157, 'loss/train': 1.4654172658920288} 11/07/2021 11:01:32 - INFO - __main__ - Step 98159: {'lr': 0.00013670101058562459, 'samples': 18846528, 'steps': 98158, 'loss/train': 1.3031961917877197} 11/07/2021 11:01:32 - INFO - __main__ - Step 98160: {'lr': 0.0001366962801213024, 'samples': 18846720, 'steps': 98159, 'loss/train': 0.8540191650390625} 11/07/2021 11:01:33 - INFO - __main__ - Step 98161: {'lr': 0.00013669154970803312, 'samples': 18846912, 'steps': 98160, 'loss/train': 1.2357975244522095} 11/07/2021 11:01:33 - INFO - __main__ - Step 98162: {'lr': 0.00013668681934581888, 'samples': 18847104, 'steps': 98161, 'loss/train': 1.3094475269317627} 11/07/2021 11:01:33 - INFO - __main__ - Step 98163: {'lr': 0.00013668208903466184, 'samples': 18847296, 'steps': 98162, 'loss/train': 1.6687726974487305} 11/07/2021 11:01:34 - INFO - __main__ - Step 98164: {'lr': 0.00013667735877456405, 'samples': 18847488, 'steps': 98163, 'loss/train': 1.1727123260498047} 11/07/2021 11:01:35 - INFO - __main__ - Step 98165: {'lr': 0.00013667262856552784, 'samples': 18847680, 'steps': 98164, 'loss/train': 1.5419600009918213} 11/07/2021 11:01:35 - INFO - __main__ - Step 98166: {'lr': 0.00013666789840755507, 'samples': 18847872, 'steps': 98165, 'loss/train': 1.1603021621704102} 11/07/2021 11:01:35 - INFO - __main__ - Step 98167: {'lr': 0.000136663168300648, 'samples': 18848064, 'steps': 98166, 'loss/train': 1.3493287563323975} 11/07/2021 11:01:36 - INFO - __main__ - Step 98168: {'lr': 0.00013665843824480877, 'samples': 18848256, 'steps': 98167, 'loss/train': 1.4083366394042969} 11/07/2021 11:01:37 - INFO - __main__ - Step 98169: {'lr': 0.00013665370824003949, 'samples': 18848448, 'steps': 98168, 'loss/train': 1.328662633895874} 11/07/2021 11:01:37 - INFO - __main__ - Step 98170: {'lr': 0.0001366489782863423, 'samples': 18848640, 'steps': 98169, 'loss/train': 1.5924129486083984} 11/07/2021 11:01:37 - INFO - __main__ - Step 98171: {'lr': 0.0001366442483837193, 'samples': 18848832, 'steps': 98170, 'loss/train': 1.7653379440307617} 11/07/2021 11:01:38 - INFO - __main__ - Step 98172: {'lr': 0.00013663951853217272, 'samples': 18849024, 'steps': 98171, 'loss/train': 1.2363669872283936} 11/07/2021 11:01:38 - INFO - __main__ - Step 98173: {'lr': 0.00013663478873170458, 'samples': 18849216, 'steps': 98172, 'loss/train': 1.6211578845977783} 11/07/2021 11:01:39 - INFO - __main__ - Step 98174: {'lr': 0.00013663005898231708, 'samples': 18849408, 'steps': 98173, 'loss/train': 1.2682156562805176} 11/07/2021 11:01:40 - INFO - __main__ - Step 98175: {'lr': 0.00013662532928401228, 'samples': 18849600, 'steps': 98174, 'loss/train': 1.2748891115188599} 11/07/2021 11:01:40 - INFO - __main__ - Step 98176: {'lr': 0.00013662059963679237, 'samples': 18849792, 'steps': 98175, 'loss/train': 1.2157633304595947} 11/07/2021 11:01:40 - INFO - __main__ - Step 98177: {'lr': 0.0001366158700406595, 'samples': 18849984, 'steps': 98176, 'loss/train': 1.4336516857147217} 11/07/2021 11:01:41 - INFO - __main__ - Step 98178: {'lr': 0.00013661114049561574, 'samples': 18850176, 'steps': 98177, 'loss/train': 1.3100422620773315} 11/07/2021 11:01:42 - INFO - __main__ - Step 98179: {'lr': 0.00013660641100166337, 'samples': 18850368, 'steps': 98178, 'loss/train': 0.9519297480583191} 11/07/2021 11:01:42 - INFO - __main__ - Step 98180: {'lr': 0.0001366016815588043, 'samples': 18850560, 'steps': 98179, 'loss/train': 0.7190936803817749} 11/07/2021 11:01:42 - INFO - __main__ - Step 98181: {'lr': 0.00013659695216704075, 'samples': 18850752, 'steps': 98180, 'loss/train': 1.4756046533584595} 11/07/2021 11:01:43 - INFO - __main__ - Step 98182: {'lr': 0.00013659222282637483, 'samples': 18850944, 'steps': 98181, 'loss/train': 1.57320237159729} 11/07/2021 11:01:43 - INFO - __main__ - Step 98183: {'lr': 0.00013658749353680878, 'samples': 18851136, 'steps': 98182, 'loss/train': 1.5064048767089844} 11/07/2021 11:01:44 - INFO - __main__ - Step 98184: {'lr': 0.00013658276429834459, 'samples': 18851328, 'steps': 98183, 'loss/train': 1.4113014936447144} 11/07/2021 11:01:44 - INFO - __main__ - Step 98185: {'lr': 0.00013657803511098448, 'samples': 18851520, 'steps': 98184, 'loss/train': 0.5150919556617737} 11/07/2021 11:01:45 - INFO - __main__ - Step 98186: {'lr': 0.0001365733059747306, 'samples': 18851712, 'steps': 98185, 'loss/train': 1.3303419351577759} 11/07/2021 11:01:45 - INFO - __main__ - Step 98187: {'lr': 0.00013656857688958498, 'samples': 18851904, 'steps': 98186, 'loss/train': 0.956798791885376} 11/07/2021 11:01:46 - INFO - __main__ - Step 98188: {'lr': 0.00013656384785554985, 'samples': 18852096, 'steps': 98187, 'loss/train': 0.6436994075775146} 11/07/2021 11:01:47 - INFO - __main__ - Step 98189: {'lr': 0.00013655911887262728, 'samples': 18852288, 'steps': 98188, 'loss/train': 1.7194836139678955} 11/07/2021 11:01:47 - INFO - __main__ - Step 98190: {'lr': 0.00013655438994081943, 'samples': 18852480, 'steps': 98189, 'loss/train': 1.3362051248550415} 11/07/2021 11:01:47 - INFO - __main__ - Step 98191: {'lr': 0.0001365496610601284, 'samples': 18852672, 'steps': 98190, 'loss/train': 0.9202824234962463} 11/07/2021 11:01:48 - INFO - __main__ - Step 98192: {'lr': 0.00013654493223055645, 'samples': 18852864, 'steps': 98191, 'loss/train': 1.0820772647857666} 11/07/2021 11:01:48 - INFO - __main__ - Step 98193: {'lr': 0.0001365402034521055, 'samples': 18853056, 'steps': 98192, 'loss/train': 0.820099413394928} 11/07/2021 11:01:49 - INFO - __main__ - Step 98194: {'lr': 0.0001365354747247778, 'samples': 18853248, 'steps': 98193, 'loss/train': 1.6653265953063965} 11/07/2021 11:01:49 - INFO - __main__ - Step 98195: {'lr': 0.00013653074604857542, 'samples': 18853440, 'steps': 98194, 'loss/train': 1.3924847841262817} 11/07/2021 11:01:50 - INFO - __main__ - Step 98196: {'lr': 0.00013652601742350056, 'samples': 18853632, 'steps': 98195, 'loss/train': 1.7576950788497925} 11/07/2021 11:01:50 - INFO - __main__ - Step 98197: {'lr': 0.00013652128884955537, 'samples': 18853824, 'steps': 98196, 'loss/train': 1.5002152919769287} 11/07/2021 11:01:50 - INFO - __main__ - Step 98198: {'lr': 0.0001365165603267419, 'samples': 18854016, 'steps': 98197, 'loss/train': 1.6739773750305176} 11/07/2021 11:01:52 - INFO - __main__ - Step 98199: {'lr': 0.0001365118318550623, 'samples': 18854208, 'steps': 98198, 'loss/train': 1.5208790302276611} 11/07/2021 11:01:52 - INFO - __main__ - Step 98200: {'lr': 0.00013650710343451872, 'samples': 18854400, 'steps': 98199, 'loss/train': 1.4415384531021118} 11/07/2021 11:01:52 - INFO - __main__ - Step 98201: {'lr': 0.00013650237506511331, 'samples': 18854592, 'steps': 98200, 'loss/train': 1.337920904159546} 11/07/2021 11:01:53 - INFO - __main__ - Step 98202: {'lr': 0.00013649764674684818, 'samples': 18854784, 'steps': 98201, 'loss/train': 1.3379019498825073} 11/07/2021 11:01:53 - INFO - __main__ - Step 98203: {'lr': 0.00013649291847972546, 'samples': 18854976, 'steps': 98202, 'loss/train': 1.155429720878601} 11/07/2021 11:01:53 - INFO - __main__ - Step 98204: {'lr': 0.00013648819026374726, 'samples': 18855168, 'steps': 98203, 'loss/train': 1.411332130432129} 11/07/2021 11:01:54 - INFO - __main__ - Step 98205: {'lr': 0.00013648346209891573, 'samples': 18855360, 'steps': 98204, 'loss/train': 1.4838238954544067} 11/07/2021 11:01:55 - INFO - __main__ - Step 98206: {'lr': 0.00013647873398523312, 'samples': 18855552, 'steps': 98205, 'loss/train': 1.4891375303268433} 11/07/2021 11:01:55 - INFO - __main__ - Step 98207: {'lr': 0.00013647400592270133, 'samples': 18855744, 'steps': 98206, 'loss/train': 1.3814823627471924} 11/07/2021 11:01:55 - INFO - __main__ - Step 98208: {'lr': 0.0001364692779113226, 'samples': 18855936, 'steps': 98207, 'loss/train': 1.6171714067459106} 11/07/2021 11:01:56 - INFO - __main__ - Step 98209: {'lr': 0.00013646454995109905, 'samples': 18856128, 'steps': 98208, 'loss/train': 0.8161880373954773} 11/07/2021 11:01:57 - INFO - __main__ - Step 98210: {'lr': 0.00013645982204203282, 'samples': 18856320, 'steps': 98209, 'loss/train': 1.1847299337387085} 11/07/2021 11:01:57 - INFO - __main__ - Step 98211: {'lr': 0.00013645509418412608, 'samples': 18856512, 'steps': 98210, 'loss/train': 0.727622926235199} 11/07/2021 11:01:58 - INFO - __main__ - Step 98212: {'lr': 0.0001364503663773809, 'samples': 18856704, 'steps': 98211, 'loss/train': 0.5876040458679199} 11/07/2021 11:01:58 - INFO - __main__ - Step 98213: {'lr': 0.00013644563862179942, 'samples': 18856896, 'steps': 98212, 'loss/train': 1.3514961004257202} 11/07/2021 11:01:58 - INFO - __main__ - Step 98214: {'lr': 0.0001364409109173838, 'samples': 18857088, 'steps': 98213, 'loss/train': 1.8937050104141235} 11/07/2021 11:02:00 - INFO - __main__ - Step 98215: {'lr': 0.00013643618326413616, 'samples': 18857280, 'steps': 98214, 'loss/train': 1.0097769498825073} 11/07/2021 11:02:00 - INFO - __main__ - Step 98216: {'lr': 0.0001364314556620586, 'samples': 18857472, 'steps': 98215, 'loss/train': 1.4009876251220703} 11/07/2021 11:02:00 - INFO - __main__ - Step 98217: {'lr': 0.00013642672811115328, 'samples': 18857664, 'steps': 98216, 'loss/train': 1.1297192573547363} 11/07/2021 11:02:01 - INFO - __main__ - Step 98218: {'lr': 0.00013642200061142235, 'samples': 18857856, 'steps': 98217, 'loss/train': 0.6479000449180603} 11/07/2021 11:02:01 - INFO - __main__ - Step 98219: {'lr': 0.00013641727316286798, 'samples': 18858048, 'steps': 98218, 'loss/train': 1.334229588508606} 11/07/2021 11:02:02 - INFO - __main__ - Step 98220: {'lr': 0.00013641254576549213, 'samples': 18858240, 'steps': 98219, 'loss/train': 0.6478684544563293} 11/07/2021 11:02:02 - INFO - __main__ - Step 98221: {'lr': 0.00013640781841929705, 'samples': 18858432, 'steps': 98220, 'loss/train': 1.3716034889221191} 11/07/2021 11:02:03 - INFO - __main__ - Step 98222: {'lr': 0.00013640309112428488, 'samples': 18858624, 'steps': 98221, 'loss/train': 1.2046267986297607} 11/07/2021 11:02:03 - INFO - __main__ - Step 98223: {'lr': 0.00013639836388045767, 'samples': 18858816, 'steps': 98222, 'loss/train': 1.258004903793335} 11/07/2021 11:02:03 - INFO - __main__ - Step 98224: {'lr': 0.00013639363668781765, 'samples': 18859008, 'steps': 98223, 'loss/train': 1.5853060483932495} 11/07/2021 11:02:05 - INFO - __main__ - Step 98225: {'lr': 0.0001363889095463669, 'samples': 18859200, 'steps': 98224, 'loss/train': 1.40349280834198} 11/07/2021 11:02:05 - INFO - __main__ - Step 98226: {'lr': 0.00013638418245610751, 'samples': 18859392, 'steps': 98225, 'loss/train': 1.373964548110962} 11/07/2021 11:02:05 - INFO - __main__ - Step 98227: {'lr': 0.00013637945541704173, 'samples': 18859584, 'steps': 98226, 'loss/train': 1.4245285987854004} 11/07/2021 11:02:06 - INFO - __main__ - Step 98228: {'lr': 0.00013637472842917153, 'samples': 18859776, 'steps': 98227, 'loss/train': 1.1207438707351685} 11/07/2021 11:02:06 - INFO - __main__ - Step 98229: {'lr': 0.00013637000149249918, 'samples': 18859968, 'steps': 98228, 'loss/train': 1.2738454341888428} 11/07/2021 11:02:06 - INFO - __main__ - Step 98230: {'lr': 0.00013636527460702673, 'samples': 18860160, 'steps': 98229, 'loss/train': 1.7585163116455078} 11/07/2021 11:02:07 - INFO - __main__ - Step 98231: {'lr': 0.00013636054777275636, 'samples': 18860352, 'steps': 98230, 'loss/train': 1.6093682050704956} 11/07/2021 11:02:08 - INFO - __main__ - Step 98232: {'lr': 0.00013635582098969024, 'samples': 18860544, 'steps': 98231, 'loss/train': 0.8761515021324158} 11/07/2021 11:02:08 - INFO - __main__ - Step 98233: {'lr': 0.00013635109425783035, 'samples': 18860736, 'steps': 98232, 'loss/train': 1.2762106657028198} 11/07/2021 11:02:08 - INFO - __main__ - Step 98234: {'lr': 0.0001363463675771789, 'samples': 18860928, 'steps': 98233, 'loss/train': 0.7235524654388428} 11/07/2021 11:02:09 - INFO - __main__ - Step 98235: {'lr': 0.00013634164094773805, 'samples': 18861120, 'steps': 98234, 'loss/train': 1.6394826173782349} 11/07/2021 11:02:10 - INFO - __main__ - Step 98236: {'lr': 0.00013633691436950985, 'samples': 18861312, 'steps': 98235, 'loss/train': 1.4443308115005493} 11/07/2021 11:02:10 - INFO - __main__ - Step 98237: {'lr': 0.00013633218784249652, 'samples': 18861504, 'steps': 98236, 'loss/train': 1.3200751543045044} 11/07/2021 11:02:11 - INFO - __main__ - Step 98238: {'lr': 0.00013632746136670016, 'samples': 18861696, 'steps': 98237, 'loss/train': 1.034289002418518} 11/07/2021 11:02:11 - INFO - __main__ - Step 98239: {'lr': 0.00013632273494212287, 'samples': 18861888, 'steps': 98238, 'loss/train': 0.9937560558319092} 11/07/2021 11:02:11 - INFO - __main__ - Step 98240: {'lr': 0.0001363180085687668, 'samples': 18862080, 'steps': 98239, 'loss/train': 1.1152644157409668} 11/07/2021 11:02:12 - INFO - __main__ - Step 98241: {'lr': 0.00013631328224663407, 'samples': 18862272, 'steps': 98240, 'loss/train': 1.4385548830032349} 11/07/2021 11:02:13 - INFO - __main__ - Step 98242: {'lr': 0.00013630855597572683, 'samples': 18862464, 'steps': 98241, 'loss/train': 1.338181972503662} 11/07/2021 11:02:13 - INFO - __main__ - Step 98243: {'lr': 0.0001363038297560472, 'samples': 18862656, 'steps': 98242, 'loss/train': 1.2778514623641968} 11/07/2021 11:02:13 - INFO - __main__ - Step 98244: {'lr': 0.00013629910358759734, 'samples': 18862848, 'steps': 98243, 'loss/train': 0.9097602367401123} 11/07/2021 11:02:14 - INFO - __main__ - Step 98245: {'lr': 0.00013629437747037933, 'samples': 18863040, 'steps': 98244, 'loss/train': 1.074336290359497} 11/07/2021 11:02:14 - INFO - __main__ - Step 98246: {'lr': 0.00013628965140439543, 'samples': 18863232, 'steps': 98245, 'loss/train': 1.2224403619766235} 11/07/2021 11:02:15 - INFO - __main__ - Step 98247: {'lr': 0.00013628492538964753, 'samples': 18863424, 'steps': 98246, 'loss/train': 2.048401117324829} 11/07/2021 11:02:15 - INFO - __main__ - Step 98248: {'lr': 0.0001362801994261379, 'samples': 18863616, 'steps': 98247, 'loss/train': 1.0882370471954346} 11/07/2021 11:02:16 - INFO - __main__ - Step 98249: {'lr': 0.00013627547351386865, 'samples': 18863808, 'steps': 98248, 'loss/train': 1.5052485466003418} 11/07/2021 11:02:16 - INFO - __main__ - Step 98250: {'lr': 0.00013627074765284192, 'samples': 18864000, 'steps': 98249, 'loss/train': 1.6777364015579224} 11/07/2021 11:02:16 - INFO - __main__ - Step 98251: {'lr': 0.00013626602184305987, 'samples': 18864192, 'steps': 98250, 'loss/train': 1.5419598817825317} 11/07/2021 11:02:18 - INFO - __main__ - Step 98252: {'lr': 0.00013626129608452454, 'samples': 18864384, 'steps': 98251, 'loss/train': 1.1501816511154175} 11/07/2021 11:02:18 - INFO - __main__ - Step 98253: {'lr': 0.00013625657037723816, 'samples': 18864576, 'steps': 98252, 'loss/train': 1.468721628189087} 11/07/2021 11:02:18 - INFO - __main__ - Step 98254: {'lr': 0.00013625184472120278, 'samples': 18864768, 'steps': 98253, 'loss/train': 1.4757745265960693} 11/07/2021 11:02:19 - INFO - __main__ - Step 98255: {'lr': 0.00013624711911642057, 'samples': 18864960, 'steps': 98254, 'loss/train': 1.1073095798492432} 11/07/2021 11:02:19 - INFO - __main__ - Step 98256: {'lr': 0.0001362423935628937, 'samples': 18865152, 'steps': 98255, 'loss/train': 1.228054404258728} 11/07/2021 11:02:21 - INFO - __main__ - Step 98257: {'lr': 0.0001362376680606242, 'samples': 18865344, 'steps': 98256, 'loss/train': 3.518496036529541} 11/07/2021 11:02:21 - INFO - __main__ - Step 98258: {'lr': 0.00013623294260961427, 'samples': 18865536, 'steps': 98257, 'loss/train': 0.9195997714996338} 11/07/2021 11:02:21 - INFO - __main__ - Step 98259: {'lr': 0.00013622821720986613, 'samples': 18865728, 'steps': 98258, 'loss/train': 1.6158796548843384} 11/07/2021 11:02:22 - INFO - __main__ - Step 98260: {'lr': 0.00013622349186138166, 'samples': 18865920, 'steps': 98259, 'loss/train': 0.1321917623281479} 11/07/2021 11:02:22 - INFO - __main__ - Step 98261: {'lr': 0.00013621876656416316, 'samples': 18866112, 'steps': 98260, 'loss/train': 1.4726394414901733} 11/07/2021 11:02:22 - INFO - __main__ - Step 98262: {'lr': 0.00013621404131821275, 'samples': 18866304, 'steps': 98261, 'loss/train': 0.9648634791374207} 11/07/2021 11:02:23 - INFO - __main__ - Step 98263: {'lr': 0.0001362093161235325, 'samples': 18866496, 'steps': 98262, 'loss/train': 1.3749955892562866} 11/07/2021 11:02:23 - INFO - __main__ - Step 98264: {'lr': 0.00013620459098012458, 'samples': 18866688, 'steps': 98263, 'loss/train': 1.3424737453460693} 11/07/2021 11:02:24 - INFO - __main__ - Step 98265: {'lr': 0.0001361998658879911, 'samples': 18866880, 'steps': 98264, 'loss/train': 1.2725343704223633} 11/07/2021 11:02:25 - INFO - __main__ - Step 98266: {'lr': 0.00013619514084713426, 'samples': 18867072, 'steps': 98265, 'loss/train': 1.3228312730789185} 11/07/2021 11:02:25 - INFO - __main__ - Step 98267: {'lr': 0.00013619041585755608, 'samples': 18867264, 'steps': 98266, 'loss/train': 1.4501442909240723} 11/07/2021 11:02:25 - INFO - __main__ - Step 98268: {'lr': 0.00013618569091925875, 'samples': 18867456, 'steps': 98267, 'loss/train': 1.1219112873077393} 11/07/2021 11:02:26 - INFO - __main__ - Step 98269: {'lr': 0.00013618096603224442, 'samples': 18867648, 'steps': 98268, 'loss/train': 1.311540961265564} 11/07/2021 11:02:27 - INFO - __main__ - Step 98270: {'lr': 0.00013617624119651516, 'samples': 18867840, 'steps': 98269, 'loss/train': 1.3641808032989502} 11/07/2021 11:02:27 - INFO - __main__ - Step 98271: {'lr': 0.00013617151641207316, 'samples': 18868032, 'steps': 98270, 'loss/train': 1.3044123649597168} 11/07/2021 11:02:27 - INFO - __main__ - Step 98272: {'lr': 0.00013616679167892048, 'samples': 18868224, 'steps': 98271, 'loss/train': 1.0545995235443115} 11/07/2021 11:02:28 - INFO - __main__ - Step 98273: {'lr': 0.00013616206699705943, 'samples': 18868416, 'steps': 98272, 'loss/train': 1.2607026100158691} 11/07/2021 11:02:28 - INFO - __main__ - Step 98274: {'lr': 0.00013615734236649188, 'samples': 18868608, 'steps': 98273, 'loss/train': 1.2605818510055542} 11/07/2021 11:02:29 - INFO - __main__ - Step 98275: {'lr': 0.00013615261778722007, 'samples': 18868800, 'steps': 98274, 'loss/train': 1.5185550451278687} 11/07/2021 11:02:29 - INFO - __main__ - Step 98276: {'lr': 0.00013614789325924615, 'samples': 18868992, 'steps': 98275, 'loss/train': 0.6767735481262207} 11/07/2021 11:02:30 - INFO - __main__ - Step 98277: {'lr': 0.0001361431687825722, 'samples': 18869184, 'steps': 98276, 'loss/train': 1.3042576313018799} 11/07/2021 11:02:30 - INFO - __main__ - Step 98278: {'lr': 0.0001361384443572004, 'samples': 18869376, 'steps': 98277, 'loss/train': 1.2759649753570557} 11/07/2021 11:02:31 - INFO - __main__ - Step 98279: {'lr': 0.00013613371998313285, 'samples': 18869568, 'steps': 98278, 'loss/train': 1.7559747695922852} 11/07/2021 11:02:32 - INFO - __main__ - Step 98280: {'lr': 0.0001361289956603717, 'samples': 18869760, 'steps': 98279, 'loss/train': 1.1619873046875} 11/07/2021 11:02:32 - INFO - __main__ - Step 98281: {'lr': 0.00013612427138891907, 'samples': 18869952, 'steps': 98280, 'loss/train': 1.2447764873504639} 11/07/2021 11:02:32 - INFO - __main__ - Step 98282: {'lr': 0.00013611954716877706, 'samples': 18870144, 'steps': 98281, 'loss/train': 1.2438536882400513} 11/07/2021 11:02:33 - INFO - __main__ - Step 98283: {'lr': 0.00013611482299994787, 'samples': 18870336, 'steps': 98282, 'loss/train': 1.684382677078247} 11/07/2021 11:02:33 - INFO - __main__ - Step 98284: {'lr': 0.00013611009888243354, 'samples': 18870528, 'steps': 98283, 'loss/train': 1.5858134031295776} 11/07/2021 11:02:34 - INFO - __main__ - Step 98285: {'lr': 0.00013610537481623626, 'samples': 18870720, 'steps': 98284, 'loss/train': 1.2503626346588135} 11/07/2021 11:02:34 - INFO - __main__ - Step 98286: {'lr': 0.00013610065080135825, 'samples': 18870912, 'steps': 98285, 'loss/train': 1.5916770696640015} 11/07/2021 11:02:35 - INFO - __main__ - Step 98287: {'lr': 0.00013609592683780142, 'samples': 18871104, 'steps': 98286, 'loss/train': 1.7432996034622192} 11/07/2021 11:02:35 - INFO - __main__ - Step 98288: {'lr': 0.000136091202925568, 'samples': 18871296, 'steps': 98287, 'loss/train': 1.2596049308776855} 11/07/2021 11:02:35 - INFO - __main__ - Step 98289: {'lr': 0.00013608647906466015, 'samples': 18871488, 'steps': 98288, 'loss/train': 1.214677095413208} 11/07/2021 11:02:36 - INFO - __main__ - Step 98290: {'lr': 0.00013608175525507994, 'samples': 18871680, 'steps': 98289, 'loss/train': 0.8529773950576782} 11/07/2021 11:02:37 - INFO - __main__ - Step 98291: {'lr': 0.00013607703149682955, 'samples': 18871872, 'steps': 98290, 'loss/train': 1.1300022602081299} 11/07/2021 11:02:37 - INFO - __main__ - Step 98292: {'lr': 0.0001360723077899111, 'samples': 18872064, 'steps': 98291, 'loss/train': 1.473792552947998} 11/07/2021 11:02:37 - INFO - __main__ - Step 98293: {'lr': 0.00013606758413432668, 'samples': 18872256, 'steps': 98292, 'loss/train': 0.9928355813026428} 11/07/2021 11:02:38 - INFO - __main__ - Step 98294: {'lr': 0.00013606286053007848, 'samples': 18872448, 'steps': 98293, 'loss/train': 0.8371829390525818} 11/07/2021 11:02:39 - INFO - __main__ - Step 98295: {'lr': 0.0001360581369771686, 'samples': 18872640, 'steps': 98294, 'loss/train': 1.355940818786621} 11/07/2021 11:02:39 - INFO - __main__ - Step 98296: {'lr': 0.00013605341347559916, 'samples': 18872832, 'steps': 98295, 'loss/train': 1.037961721420288} 11/07/2021 11:02:39 - INFO - __main__ - Step 98297: {'lr': 0.00013604869002537229, 'samples': 18873024, 'steps': 98296, 'loss/train': 1.2920479774475098} 11/07/2021 11:02:40 - INFO - __main__ - Step 98298: {'lr': 0.0001360439666264901, 'samples': 18873216, 'steps': 98297, 'loss/train': 1.2554845809936523} 11/07/2021 11:02:40 - INFO - __main__ - Step 98299: {'lr': 0.00013603924327895478, 'samples': 18873408, 'steps': 98298, 'loss/train': 1.37446129322052} 11/07/2021 11:02:41 - INFO - __main__ - Step 98300: {'lr': 0.0001360345199827685, 'samples': 18873600, 'steps': 98299, 'loss/train': 1.493644118309021} 11/07/2021 11:02:42 - INFO - __main__ - Step 98301: {'lr': 0.0001360297967379332, 'samples': 18873792, 'steps': 98300, 'loss/train': 1.070837140083313} 11/07/2021 11:02:42 - INFO - __main__ - Step 98302: {'lr': 0.00013602507354445111, 'samples': 18873984, 'steps': 98301, 'loss/train': 2.84089732170105} 11/07/2021 11:02:42 - INFO - __main__ - Step 98303: {'lr': 0.00013602035040232439, 'samples': 18874176, 'steps': 98302, 'loss/train': 1.425254464149475} 11/07/2021 11:02:43 - INFO - __main__ - Step 98304: {'lr': 0.00013601562731155512, 'samples': 18874368, 'steps': 98303, 'loss/train': 1.0423390865325928} 11/07/2021 11:02:43 - INFO - __main__ - Step 98305: {'lr': 0.00013601090427214547, 'samples': 18874560, 'steps': 98304, 'loss/train': 1.5289756059646606} 11/07/2021 11:02:44 - INFO - __main__ - Step 98306: {'lr': 0.0001360061812840975, 'samples': 18874752, 'steps': 98305, 'loss/train': 1.3939839601516724} 11/07/2021 11:02:44 - INFO - __main__ - Step 98307: {'lr': 0.00013600145834741342, 'samples': 18874944, 'steps': 98306, 'loss/train': 2.083261013031006} 11/07/2021 11:02:45 - INFO - __main__ - Step 98308: {'lr': 0.00013599673546209535, 'samples': 18875136, 'steps': 98307, 'loss/train': 1.2419558763504028} 11/07/2021 11:02:45 - INFO - __main__ - Step 98309: {'lr': 0.00013599201262814534, 'samples': 18875328, 'steps': 98308, 'loss/train': 1.2548686265945435} 11/07/2021 11:02:45 - INFO - __main__ - Step 98310: {'lr': 0.00013598728984556558, 'samples': 18875520, 'steps': 98309, 'loss/train': 1.5182770490646362} 11/07/2021 11:02:47 - INFO - __main__ - Step 98311: {'lr': 0.0001359825671143582, 'samples': 18875712, 'steps': 98310, 'loss/train': 2.02710223197937} 11/07/2021 11:02:47 - INFO - __main__ - Step 98312: {'lr': 0.00013597784443452533, 'samples': 18875904, 'steps': 98311, 'loss/train': 1.4744961261749268} 11/07/2021 11:02:48 - INFO - __main__ - Step 98313: {'lr': 0.00013597312180606917, 'samples': 18876096, 'steps': 98312, 'loss/train': 1.599023699760437} 11/07/2021 11:02:48 - INFO - __main__ - Step 98314: {'lr': 0.00013596839922899165, 'samples': 18876288, 'steps': 98313, 'loss/train': 0.7850956320762634} 11/07/2021 11:02:48 - INFO - __main__ - Step 98315: {'lr': 0.000135963676703295, 'samples': 18876480, 'steps': 98314, 'loss/train': 2.1155142784118652} 11/07/2021 11:02:49 - INFO - __main__ - Step 98316: {'lr': 0.0001359589542289814, 'samples': 18876672, 'steps': 98315, 'loss/train': 1.8481587171554565} 11/07/2021 11:02:50 - INFO - __main__ - Step 98317: {'lr': 0.00013595423180605293, 'samples': 18876864, 'steps': 98316, 'loss/train': 0.6052528619766235} 11/07/2021 11:02:50 - INFO - __main__ - Step 98318: {'lr': 0.0001359495094345117, 'samples': 18877056, 'steps': 98317, 'loss/train': 1.5446165800094604} 11/07/2021 11:02:51 - INFO - __main__ - Step 98319: {'lr': 0.00013594478711435987, 'samples': 18877248, 'steps': 98318, 'loss/train': 1.3887382745742798} 11/07/2021 11:02:51 - INFO - __main__ - Step 98320: {'lr': 0.00013594006484559957, 'samples': 18877440, 'steps': 98319, 'loss/train': 1.373978614807129} 11/07/2021 11:02:51 - INFO - __main__ - Step 98321: {'lr': 0.00013593534262823287, 'samples': 18877632, 'steps': 98320, 'loss/train': 1.2018088102340698} 11/07/2021 11:02:52 - INFO - __main__ - Step 98322: {'lr': 0.000135930620462262, 'samples': 18877824, 'steps': 98321, 'loss/train': 1.8976179361343384} 11/07/2021 11:02:53 - INFO - __main__ - Step 98323: {'lr': 0.000135925898347689, 'samples': 18878016, 'steps': 98322, 'loss/train': 1.7614325284957886} 11/07/2021 11:02:53 - INFO - __main__ - Step 98324: {'lr': 0.00013592117628451607, 'samples': 18878208, 'steps': 98323, 'loss/train': 1.3271616697311401} 11/07/2021 11:02:54 - INFO - __main__ - Step 98325: {'lr': 0.00013591645427274524, 'samples': 18878400, 'steps': 98324, 'loss/train': 1.4071228504180908} 11/07/2021 11:02:54 - INFO - __main__ - Step 98326: {'lr': 0.00013591173231237874, 'samples': 18878592, 'steps': 98325, 'loss/train': 1.3830368518829346} 11/07/2021 11:02:54 - INFO - __main__ - Step 98327: {'lr': 0.00013590701040341874, 'samples': 18878784, 'steps': 98326, 'loss/train': 1.5571703910827637} 11/07/2021 11:02:55 - INFO - __main__ - Step 98328: {'lr': 0.00013590228854586716, 'samples': 18878976, 'steps': 98327, 'loss/train': 1.3600624799728394} 11/07/2021 11:02:56 - INFO - __main__ - Step 98329: {'lr': 0.00013589756673972628, 'samples': 18879168, 'steps': 98328, 'loss/train': 1.3532968759536743} 11/07/2021 11:02:56 - INFO - __main__ - Step 98330: {'lr': 0.00013589284498499818, 'samples': 18879360, 'steps': 98329, 'loss/train': 1.6616936922073364} 11/07/2021 11:02:56 - INFO - __main__ - Step 98331: {'lr': 0.000135888123281685, 'samples': 18879552, 'steps': 98330, 'loss/train': 1.6683658361434937} 11/07/2021 11:02:57 - INFO - __main__ - Step 98332: {'lr': 0.0001358834016297889, 'samples': 18879744, 'steps': 98331, 'loss/train': 1.3869045972824097} 11/07/2021 11:02:58 - INFO - __main__ - Step 98333: {'lr': 0.00013587868002931192, 'samples': 18879936, 'steps': 98332, 'loss/train': 1.001996636390686} 11/07/2021 11:02:58 - INFO - __main__ - Step 98334: {'lr': 0.0001358739584802563, 'samples': 18880128, 'steps': 98333, 'loss/train': 1.3335449695587158} 11/07/2021 11:02:58 - INFO - __main__ - Step 98335: {'lr': 0.0001358692369826241, 'samples': 18880320, 'steps': 98334, 'loss/train': 1.4720847606658936} 11/07/2021 11:02:59 - INFO - __main__ - Step 98336: {'lr': 0.00013586451553641743, 'samples': 18880512, 'steps': 98335, 'loss/train': 1.3318308591842651} 11/07/2021 11:02:59 - INFO - __main__ - Step 98337: {'lr': 0.00013585979414163846, 'samples': 18880704, 'steps': 98336, 'loss/train': 1.500773549079895} 11/07/2021 11:03:00 - INFO - __main__ - Step 98338: {'lr': 0.00013585507279828933, 'samples': 18880896, 'steps': 98337, 'loss/train': 1.214516520500183} 11/07/2021 11:03:01 - INFO - __main__ - Step 98339: {'lr': 0.00013585035150637215, 'samples': 18881088, 'steps': 98338, 'loss/train': 1.1015985012054443} 11/07/2021 11:03:01 - INFO - __main__ - Step 98340: {'lr': 0.0001358456302658891, 'samples': 18881280, 'steps': 98339, 'loss/train': 1.7815322875976562} 11/07/2021 11:03:01 - INFO - __main__ - Step 98341: {'lr': 0.00013584090907684215, 'samples': 18881472, 'steps': 98340, 'loss/train': 1.174264669418335} 11/07/2021 11:03:02 - INFO - __main__ - Step 98342: {'lr': 0.00013583618793923358, 'samples': 18881664, 'steps': 98341, 'loss/train': 1.5911731719970703} 11/07/2021 11:03:03 - INFO - __main__ - Step 98343: {'lr': 0.00013583146685306542, 'samples': 18881856, 'steps': 98342, 'loss/train': 1.3426573276519775} 11/07/2021 11:03:03 - INFO - __main__ - Step 98344: {'lr': 0.0001358267458183398, 'samples': 18882048, 'steps': 98343, 'loss/train': 1.5939886569976807} 11/07/2021 11:03:03 - INFO - __main__ - Step 98345: {'lr': 0.00013582202483505896, 'samples': 18882240, 'steps': 98344, 'loss/train': 1.5695134401321411} 11/07/2021 11:03:04 - INFO - __main__ - Step 98346: {'lr': 0.00013581730390322495, 'samples': 18882432, 'steps': 98345, 'loss/train': 1.0651081800460815} 11/07/2021 11:03:04 - INFO - __main__ - Step 98347: {'lr': 0.00013581258302283985, 'samples': 18882624, 'steps': 98346, 'loss/train': 1.6244902610778809} 11/07/2021 11:03:04 - INFO - __main__ - Step 98348: {'lr': 0.00013580786219390587, 'samples': 18882816, 'steps': 98347, 'loss/train': 1.8999954462051392} 11/07/2021 11:03:05 - INFO - __main__ - Step 98349: {'lr': 0.00013580314141642508, 'samples': 18883008, 'steps': 98348, 'loss/train': 1.4261064529418945} 11/07/2021 11:03:06 - INFO - __main__ - Step 98350: {'lr': 0.00013579842069039966, 'samples': 18883200, 'steps': 98349, 'loss/train': 1.0020737648010254} 11/07/2021 11:03:06 - INFO - __main__ - Step 98351: {'lr': 0.0001357937000158317, 'samples': 18883392, 'steps': 98350, 'loss/train': 1.4614806175231934} 11/07/2021 11:03:06 - INFO - __main__ - Step 98352: {'lr': 0.00013578897939272333, 'samples': 18883584, 'steps': 98351, 'loss/train': 1.4418045282363892} 11/07/2021 11:03:07 - INFO - __main__ - Step 98353: {'lr': 0.0001357842588210768, 'samples': 18883776, 'steps': 98352, 'loss/train': 0.833089292049408} 11/07/2021 11:03:08 - INFO - __main__ - Step 98354: {'lr': 0.000135779538300894, 'samples': 18883968, 'steps': 98353, 'loss/train': 1.305370807647705} 11/07/2021 11:03:08 - INFO - __main__ - Step 98355: {'lr': 0.00013577481783217722, 'samples': 18884160, 'steps': 98354, 'loss/train': 1.4666707515716553} 11/07/2021 11:03:08 - INFO - __main__ - Step 98356: {'lr': 0.00013577009741492848, 'samples': 18884352, 'steps': 98355, 'loss/train': 1.2275768518447876} 11/07/2021 11:03:09 - INFO - __main__ - Step 98357: {'lr': 0.00013576537704915003, 'samples': 18884544, 'steps': 98356, 'loss/train': 1.2418649196624756} 11/07/2021 11:03:09 - INFO - __main__ - Step 98358: {'lr': 0.0001357606567348439, 'samples': 18884736, 'steps': 98357, 'loss/train': 0.7887827754020691} 11/07/2021 11:03:10 - INFO - __main__ - Step 98359: {'lr': 0.00013575593647201226, 'samples': 18884928, 'steps': 98358, 'loss/train': 1.146718144416809} 11/07/2021 11:03:11 - INFO - __main__ - Step 98360: {'lr': 0.00013575121626065723, 'samples': 18885120, 'steps': 98359, 'loss/train': 1.5055015087127686} 11/07/2021 11:03:11 - INFO - __main__ - Step 98361: {'lr': 0.00013574649610078096, 'samples': 18885312, 'steps': 98360, 'loss/train': 1.3043992519378662} 11/07/2021 11:03:11 - INFO - __main__ - Step 98362: {'lr': 0.00013574177599238554, 'samples': 18885504, 'steps': 98361, 'loss/train': 1.6219667196273804} 11/07/2021 11:03:12 - INFO - __main__ - Step 98363: {'lr': 0.00013573705593547314, 'samples': 18885696, 'steps': 98362, 'loss/train': 1.4256881475448608} 11/07/2021 11:03:12 - INFO - __main__ - Step 98364: {'lr': 0.0001357323359300458, 'samples': 18885888, 'steps': 98363, 'loss/train': 0.8361854553222656} 11/07/2021 11:03:13 - INFO - __main__ - Step 98365: {'lr': 0.00013572761597610577, 'samples': 18886080, 'steps': 98364, 'loss/train': 5.212090015411377} 11/07/2021 11:03:13 - INFO - __main__ - Step 98366: {'lr': 0.00013572289607365518, 'samples': 18886272, 'steps': 98365, 'loss/train': 1.4121781587600708} 11/07/2021 11:03:14 - INFO - __main__ - Step 98367: {'lr': 0.000135718176222696, 'samples': 18886464, 'steps': 98366, 'loss/train': 1.1155071258544922} 11/07/2021 11:03:14 - INFO - __main__ - Step 98368: {'lr': 0.00013571345642323043, 'samples': 18886656, 'steps': 98367, 'loss/train': 1.230709433555603} 11/07/2021 11:03:14 - INFO - __main__ - Step 98369: {'lr': 0.00013570873667526062, 'samples': 18886848, 'steps': 98368, 'loss/train': 1.3635631799697876} 11/07/2021 11:03:15 - INFO - __main__ - Step 98370: {'lr': 0.0001357040169787887, 'samples': 18887040, 'steps': 98369, 'loss/train': 2.030616283416748} 11/07/2021 11:03:16 - INFO - __main__ - Step 98371: {'lr': 0.00013569929733381678, 'samples': 18887232, 'steps': 98370, 'loss/train': 1.458173394203186} 11/07/2021 11:03:16 - INFO - __main__ - Step 98372: {'lr': 0.000135694577740347, 'samples': 18887424, 'steps': 98371, 'loss/train': 0.4790641963481903} 11/07/2021 11:03:16 - INFO - __main__ - Step 98373: {'lr': 0.00013568985819838148, 'samples': 18887616, 'steps': 98372, 'loss/train': 0.8819230198860168} 11/07/2021 11:03:17 - INFO - __main__ - Step 98374: {'lr': 0.00013568513870792232, 'samples': 18887808, 'steps': 98373, 'loss/train': 1.5821757316589355} 11/07/2021 11:03:18 - INFO - __main__ - Step 98375: {'lr': 0.00013568041926897168, 'samples': 18888000, 'steps': 98374, 'loss/train': 1.3714760541915894} 11/07/2021 11:03:18 - INFO - __main__ - Step 98376: {'lr': 0.00013567569988153172, 'samples': 18888192, 'steps': 98375, 'loss/train': 1.909122109413147} 11/07/2021 11:03:19 - INFO - __main__ - Step 98377: {'lr': 0.00013567098054560457, 'samples': 18888384, 'steps': 98376, 'loss/train': 1.325333833694458} 11/07/2021 11:03:19 - INFO - __main__ - Step 98378: {'lr': 0.00013566626126119226, 'samples': 18888576, 'steps': 98377, 'loss/train': 1.6655287742614746} 11/07/2021 11:03:19 - INFO - __main__ - Step 98379: {'lr': 0.00013566154202829695, 'samples': 18888768, 'steps': 98378, 'loss/train': 1.6103006601333618} 11/07/2021 11:03:20 - INFO - __main__ - Step 98380: {'lr': 0.00013565682284692076, 'samples': 18888960, 'steps': 98379, 'loss/train': 1.3262858390808105} 11/07/2021 11:03:21 - INFO - __main__ - Step 98381: {'lr': 0.00013565210371706588, 'samples': 18889152, 'steps': 98380, 'loss/train': 1.3674496412277222} 11/07/2021 11:03:21 - INFO - __main__ - Step 98382: {'lr': 0.00013564738463873438, 'samples': 18889344, 'steps': 98381, 'loss/train': 1.4288396835327148} 11/07/2021 11:03:21 - INFO - __main__ - Step 98383: {'lr': 0.0001356426656119284, 'samples': 18889536, 'steps': 98382, 'loss/train': 0.3656274378299713} 11/07/2021 11:03:22 - INFO - __main__ - Step 98384: {'lr': 0.00013563794663665007, 'samples': 18889728, 'steps': 98383, 'loss/train': 1.1525236368179321} 11/07/2021 11:03:22 - INFO - __main__ - Step 98385: {'lr': 0.0001356332277129015, 'samples': 18889920, 'steps': 98384, 'loss/train': 1.157060980796814} 11/07/2021 11:03:23 - INFO - __main__ - Step 98386: {'lr': 0.00013562850884068486, 'samples': 18890112, 'steps': 98385, 'loss/train': 1.7386648654937744} 11/07/2021 11:03:23 - INFO - __main__ - Step 98387: {'lr': 0.00013562379002000235, 'samples': 18890304, 'steps': 98386, 'loss/train': 1.4637274742126465} 11/07/2021 11:03:24 - INFO - __main__ - Step 98388: {'lr': 0.00013561907125085587, 'samples': 18890496, 'steps': 98387, 'loss/train': 0.7730493545532227} 11/07/2021 11:03:24 - INFO - __main__ - Step 98389: {'lr': 0.00013561435253324773, 'samples': 18890688, 'steps': 98388, 'loss/train': 1.2755545377731323} 11/07/2021 11:03:24 - INFO - __main__ - Step 98390: {'lr': 0.00013560963386717996, 'samples': 18890880, 'steps': 98389, 'loss/train': 1.517714500427246} 11/07/2021 11:03:25 - INFO - __main__ - Step 98391: {'lr': 0.00013560491525265467, 'samples': 18891072, 'steps': 98390, 'loss/train': 1.2441823482513428} 11/07/2021 11:03:26 - INFO - __main__ - Step 98392: {'lr': 0.0001356001966896741, 'samples': 18891264, 'steps': 98391, 'loss/train': 1.550074577331543} 11/07/2021 11:03:26 - INFO - __main__ - Step 98393: {'lr': 0.0001355954781782403, 'samples': 18891456, 'steps': 98392, 'loss/train': 1.2642529010772705} 11/07/2021 11:03:27 - INFO - __main__ - Step 98394: {'lr': 0.00013559075971835544, 'samples': 18891648, 'steps': 98393, 'loss/train': 0.8906887769699097} 11/07/2021 11:03:27 - INFO - __main__ - Step 98395: {'lr': 0.0001355860413100216, 'samples': 18891840, 'steps': 98394, 'loss/train': 1.0988144874572754} 11/07/2021 11:03:28 - INFO - __main__ - Step 98396: {'lr': 0.0001355813229532409, 'samples': 18892032, 'steps': 98395, 'loss/train': 1.5470179319381714} 11/07/2021 11:03:28 - INFO - __main__ - Step 98397: {'lr': 0.0001355766046480155, 'samples': 18892224, 'steps': 98396, 'loss/train': 1.0371147394180298} 11/07/2021 11:03:29 - INFO - __main__ - Step 98398: {'lr': 0.00013557188639434764, 'samples': 18892416, 'steps': 98397, 'loss/train': 1.7130364179611206} 11/07/2021 11:03:29 - INFO - __main__ - Step 98399: {'lr': 0.00013556716819223923, 'samples': 18892608, 'steps': 98398, 'loss/train': 1.3864307403564453} 11/07/2021 11:03:29 - INFO - __main__ - Step 98400: {'lr': 0.00013556245004169246, 'samples': 18892800, 'steps': 98399, 'loss/train': 1.0736411809921265} 11/07/2021 11:03:30 - INFO - __main__ - Step 98401: {'lr': 0.00013555773194270948, 'samples': 18892992, 'steps': 98400, 'loss/train': 1.5206451416015625} 11/07/2021 11:03:31 - INFO - __main__ - Step 98402: {'lr': 0.00013555301389529245, 'samples': 18893184, 'steps': 98401, 'loss/train': 1.0626158714294434} 11/07/2021 11:03:31 - INFO - __main__ - Step 98403: {'lr': 0.00013554829589944344, 'samples': 18893376, 'steps': 98402, 'loss/train': 1.4164586067199707} 11/07/2021 11:03:31 - INFO - __main__ - Step 98404: {'lr': 0.00013554357795516462, 'samples': 18893568, 'steps': 98403, 'loss/train': 1.553225040435791} 11/07/2021 11:03:32 - INFO - __main__ - Step 98405: {'lr': 0.0001355388600624581, 'samples': 18893760, 'steps': 98404, 'loss/train': 1.4028819799423218} 11/07/2021 11:03:32 - INFO - __main__ - Step 98406: {'lr': 0.00013553414222132598, 'samples': 18893952, 'steps': 98405, 'loss/train': 0.98814857006073} 11/07/2021 11:03:33 - INFO - __main__ - Step 98407: {'lr': 0.00013552942443177042, 'samples': 18894144, 'steps': 98406, 'loss/train': 1.1964280605316162} 11/07/2021 11:03:33 - INFO - __main__ - Step 98408: {'lr': 0.00013552470669379353, 'samples': 18894336, 'steps': 98407, 'loss/train': 1.2374588251113892} 11/07/2021 11:03:34 - INFO - __main__ - Step 98409: {'lr': 0.00013551998900739753, 'samples': 18894528, 'steps': 98408, 'loss/train': 1.3886756896972656} 11/07/2021 11:03:34 - INFO - __main__ - Step 98410: {'lr': 0.0001355152713725844, 'samples': 18894720, 'steps': 98409, 'loss/train': 1.0392868518829346} 11/07/2021 11:03:34 - INFO - __main__ - Step 98411: {'lr': 0.0001355105537893563, 'samples': 18894912, 'steps': 98410, 'loss/train': 1.569361925125122} 11/07/2021 11:03:35 - INFO - __main__ - Step 98412: {'lr': 0.00013550583625771535, 'samples': 18895104, 'steps': 98411, 'loss/train': 1.4471443891525269} 11/07/2021 11:03:36 - INFO - __main__ - Step 98413: {'lr': 0.00013550111877766373, 'samples': 18895296, 'steps': 98412, 'loss/train': 1.1909029483795166} 11/07/2021 11:03:36 - INFO - __main__ - Step 98414: {'lr': 0.00013549640134920355, 'samples': 18895488, 'steps': 98413, 'loss/train': 1.3917717933654785} 11/07/2021 11:03:37 - INFO - __main__ - Step 98415: {'lr': 0.00013549168397233692, 'samples': 18895680, 'steps': 98414, 'loss/train': 0.5137991309165955} 11/07/2021 11:03:37 - INFO - __main__ - Step 98416: {'lr': 0.00013548696664706595, 'samples': 18895872, 'steps': 98415, 'loss/train': 1.624289631843567} 11/07/2021 11:03:38 - INFO - __main__ - Step 98417: {'lr': 0.00013548224937339276, 'samples': 18896064, 'steps': 98416, 'loss/train': 1.0909003019332886} 11/07/2021 11:03:38 - INFO - __main__ - Step 98418: {'lr': 0.00013547753215131954, 'samples': 18896256, 'steps': 98417, 'loss/train': 1.3917680978775024} 11/07/2021 11:03:39 - INFO - __main__ - Step 98419: {'lr': 0.0001354728149808484, 'samples': 18896448, 'steps': 98418, 'loss/train': 2.132930278778076} 11/07/2021 11:03:39 - INFO - __main__ - Step 98420: {'lr': 0.00013546809786198137, 'samples': 18896640, 'steps': 98419, 'loss/train': 1.3054800033569336} 11/07/2021 11:03:39 - INFO - __main__ - Step 98421: {'lr': 0.00013546338079472082, 'samples': 18896832, 'steps': 98420, 'loss/train': 1.2370983362197876} 11/07/2021 11:03:40 - INFO - __main__ - Step 98422: {'lr': 0.00013545866377906858, 'samples': 18897024, 'steps': 98421, 'loss/train': 1.2229751348495483} 11/07/2021 11:03:41 - INFO - __main__ - Step 98423: {'lr': 0.00013545394681502689, 'samples': 18897216, 'steps': 98422, 'loss/train': 1.2306723594665527} 11/07/2021 11:03:41 - INFO - __main__ - Step 98424: {'lr': 0.0001354492299025979, 'samples': 18897408, 'steps': 98423, 'loss/train': 0.7076557278633118} 11/07/2021 11:03:41 - INFO - __main__ - Step 98425: {'lr': 0.0001354445130417837, 'samples': 18897600, 'steps': 98424, 'loss/train': 0.9742152690887451} 11/07/2021 11:03:42 - INFO - __main__ - Step 98426: {'lr': 0.00013543979623258646, 'samples': 18897792, 'steps': 98425, 'loss/train': 1.3980671167373657} 11/07/2021 11:03:43 - INFO - __main__ - Step 98427: {'lr': 0.00013543507947500825, 'samples': 18897984, 'steps': 98426, 'loss/train': 1.449747085571289} 11/07/2021 11:03:43 - INFO - __main__ - Step 98428: {'lr': 0.00013543036276905123, 'samples': 18898176, 'steps': 98427, 'loss/train': 1.5316102504730225} 11/07/2021 11:03:44 - INFO - __main__ - Step 98429: {'lr': 0.00013542564611471753, 'samples': 18898368, 'steps': 98428, 'loss/train': 1.2447521686553955} 11/07/2021 11:03:44 - INFO - __main__ - Step 98430: {'lr': 0.00013542092951200927, 'samples': 18898560, 'steps': 98429, 'loss/train': 1.1844550371170044} 11/07/2021 11:03:44 - INFO - __main__ - Step 98431: {'lr': 0.00013541621296092856, 'samples': 18898752, 'steps': 98430, 'loss/train': 1.2383227348327637} 11/07/2021 11:03:45 - INFO - __main__ - Step 98432: {'lr': 0.00013541149646147755, 'samples': 18898944, 'steps': 98431, 'loss/train': 1.2685805559158325} 11/07/2021 11:03:46 - INFO - __main__ - Step 98433: {'lr': 0.00013540678001365837, 'samples': 18899136, 'steps': 98432, 'loss/train': 0.7527052164077759} 11/07/2021 11:03:46 - INFO - __main__ - Step 98434: {'lr': 0.00013540206361747318, 'samples': 18899328, 'steps': 98433, 'loss/train': 2.2276628017425537} 11/07/2021 11:03:46 - INFO - __main__ - Step 98435: {'lr': 0.00013539734727292398, 'samples': 18899520, 'steps': 98434, 'loss/train': 1.3349436521530151} 11/07/2021 11:03:47 - INFO - __main__ - Step 98436: {'lr': 0.00013539263098001294, 'samples': 18899712, 'steps': 98435, 'loss/train': 1.5253632068634033} 11/07/2021 11:03:48 - INFO - __main__ - Step 98437: {'lr': 0.00013538791473874224, 'samples': 18899904, 'steps': 98436, 'loss/train': 0.5066120624542236} 11/07/2021 11:03:48 - INFO - __main__ - Step 98438: {'lr': 0.00013538319854911396, 'samples': 18900096, 'steps': 98437, 'loss/train': 1.4423346519470215} 11/07/2021 11:03:48 - INFO - __main__ - Step 98439: {'lr': 0.00013537848241113027, 'samples': 18900288, 'steps': 98438, 'loss/train': 1.5204437971115112} 11/07/2021 11:03:49 - INFO - __main__ - Step 98440: {'lr': 0.00013537376632479325, 'samples': 18900480, 'steps': 98439, 'loss/train': 1.1633957624435425} 11/07/2021 11:03:49 - INFO - __main__ - Step 98441: {'lr': 0.00013536905029010505, 'samples': 18900672, 'steps': 98440, 'loss/train': 1.5238350629806519} 11/07/2021 11:03:50 - INFO - __main__ - Step 98442: {'lr': 0.00013536433430706775, 'samples': 18900864, 'steps': 98441, 'loss/train': 1.319195032119751} 11/07/2021 11:03:50 - INFO - __main__ - Step 98443: {'lr': 0.00013535961837568355, 'samples': 18901056, 'steps': 98442, 'loss/train': 1.203751802444458} 11/07/2021 11:03:51 - INFO - __main__ - Step 98444: {'lr': 0.0001353549024959545, 'samples': 18901248, 'steps': 98443, 'loss/train': 1.323792576789856} 11/07/2021 11:03:51 - INFO - __main__ - Step 98445: {'lr': 0.0001353501866678828, 'samples': 18901440, 'steps': 98444, 'loss/train': 1.113924503326416} 11/07/2021 11:03:52 - INFO - __main__ - Step 98446: {'lr': 0.00013534547089147052, 'samples': 18901632, 'steps': 98445, 'loss/train': 1.5060700178146362} 11/07/2021 11:03:52 - INFO - __main__ - Step 98447: {'lr': 0.0001353407551667198, 'samples': 18901824, 'steps': 98446, 'loss/train': 1.4778257608413696} 11/07/2021 11:03:53 - INFO - __main__ - Step 98448: {'lr': 0.00013533603949363287, 'samples': 18902016, 'steps': 98447, 'loss/train': 1.6437896490097046} 11/07/2021 11:03:53 - INFO - __main__ - Step 98449: {'lr': 0.00013533132387221166, 'samples': 18902208, 'steps': 98448, 'loss/train': 1.286073923110962} 11/07/2021 11:03:54 - INFO - __main__ - Step 98450: {'lr': 0.0001353266083024584, 'samples': 18902400, 'steps': 98449, 'loss/train': 1.5943832397460938} 11/07/2021 11:03:54 - INFO - __main__ - Step 98451: {'lr': 0.00013532189278437517, 'samples': 18902592, 'steps': 98450, 'loss/train': 1.144266128540039} 11/07/2021 11:03:55 - INFO - __main__ - Step 98452: {'lr': 0.00013531717731796414, 'samples': 18902784, 'steps': 98451, 'loss/train': 1.6232521533966064} 11/07/2021 11:03:55 - INFO - __main__ - Step 98453: {'lr': 0.00013531246190322743, 'samples': 18902976, 'steps': 98452, 'loss/train': 1.3400344848632812} 11/07/2021 11:03:56 - INFO - __main__ - Step 98454: {'lr': 0.00013530774654016715, 'samples': 18903168, 'steps': 98453, 'loss/train': 1.3465652465820312} 11/07/2021 11:03:56 - INFO - __main__ - Step 98455: {'lr': 0.0001353030312287854, 'samples': 18903360, 'steps': 98454, 'loss/train': 0.8966878652572632} 11/07/2021 11:03:56 - INFO - __main__ - Step 98456: {'lr': 0.00013529831596908434, 'samples': 18903552, 'steps': 98455, 'loss/train': 0.9571955800056458} 11/07/2021 11:03:57 - INFO - __main__ - Step 98457: {'lr': 0.00013529360076106612, 'samples': 18903744, 'steps': 98456, 'loss/train': 1.40067458152771} 11/07/2021 11:03:58 - INFO - __main__ - Step 98458: {'lr': 0.00013528888560473281, 'samples': 18903936, 'steps': 98457, 'loss/train': 1.265067458152771} 11/07/2021 11:03:58 - INFO - __main__ - Step 98459: {'lr': 0.00013528417050008657, 'samples': 18904128, 'steps': 98458, 'loss/train': 1.223770022392273} 11/07/2021 11:03:58 - INFO - __main__ - Step 98460: {'lr': 0.0001352794554471295, 'samples': 18904320, 'steps': 98459, 'loss/train': 1.2474489212036133} 11/07/2021 11:03:59 - INFO - __main__ - Step 98461: {'lr': 0.00013527474044586386, 'samples': 18904512, 'steps': 98460, 'loss/train': 1.0798609256744385} 11/07/2021 11:03:59 - INFO - __main__ - Step 98462: {'lr': 0.00013527002549629152, 'samples': 18904704, 'steps': 98461, 'loss/train': 0.9390119314193726} 11/07/2021 11:04:00 - INFO - __main__ - Step 98463: {'lr': 0.00013526531059841477, 'samples': 18904896, 'steps': 98462, 'loss/train': 1.3549518585205078} 11/07/2021 11:04:00 - INFO - __main__ - Step 98464: {'lr': 0.0001352605957522357, 'samples': 18905088, 'steps': 98463, 'loss/train': 0.6062745451927185} 11/07/2021 11:04:01 - INFO - __main__ - Step 98465: {'lr': 0.0001352558809577564, 'samples': 18905280, 'steps': 98464, 'loss/train': 1.1475313901901245} 11/07/2021 11:04:01 - INFO - __main__ - Step 98466: {'lr': 0.00013525116621497903, 'samples': 18905472, 'steps': 98465, 'loss/train': 1.6195857524871826} 11/07/2021 11:04:01 - INFO - __main__ - Step 98467: {'lr': 0.00013524645152390575, 'samples': 18905664, 'steps': 98466, 'loss/train': 1.3565113544464111} 11/07/2021 11:04:03 - INFO - __main__ - Step 98468: {'lr': 0.0001352417368845386, 'samples': 18905856, 'steps': 98467, 'loss/train': 1.252115249633789} 11/07/2021 11:04:03 - INFO - __main__ - Step 98469: {'lr': 0.00013523702229687978, 'samples': 18906048, 'steps': 98468, 'loss/train': 1.503517508506775} 11/07/2021 11:04:03 - INFO - __main__ - Step 98470: {'lr': 0.00013523230776093143, 'samples': 18906240, 'steps': 98469, 'loss/train': 1.2968119382858276} 11/07/2021 11:04:04 - INFO - __main__ - Step 98471: {'lr': 0.0001352275932766956, 'samples': 18906432, 'steps': 98470, 'loss/train': 1.4609096050262451} 11/07/2021 11:04:04 - INFO - __main__ - Step 98472: {'lr': 0.0001352228788441744, 'samples': 18906624, 'steps': 98471, 'loss/train': 1.1662620306015015} 11/07/2021 11:04:05 - INFO - __main__ - Step 98473: {'lr': 0.00013521816446337005, 'samples': 18906816, 'steps': 98472, 'loss/train': 1.9209717512130737} 11/07/2021 11:04:05 - INFO - __main__ - Step 98474: {'lr': 0.0001352134501342847, 'samples': 18907008, 'steps': 98473, 'loss/train': 1.2796759605407715} 11/07/2021 11:04:06 - INFO - __main__ - Step 98475: {'lr': 0.00013520873585692032, 'samples': 18907200, 'steps': 98474, 'loss/train': 1.1335170269012451} 11/07/2021 11:04:06 - INFO - __main__ - Step 98476: {'lr': 0.00013520402163127909, 'samples': 18907392, 'steps': 98475, 'loss/train': 1.6689958572387695} 11/07/2021 11:04:06 - INFO - __main__ - Step 98477: {'lr': 0.00013519930745736316, 'samples': 18907584, 'steps': 98476, 'loss/train': 1.550864577293396} 11/07/2021 11:04:07 - INFO - __main__ - Step 98478: {'lr': 0.00013519459333517466, 'samples': 18907776, 'steps': 98477, 'loss/train': 1.5978097915649414} 11/07/2021 11:04:08 - INFO - __main__ - Step 98479: {'lr': 0.00013518987926471572, 'samples': 18907968, 'steps': 98478, 'loss/train': 1.3556678295135498} 11/07/2021 11:04:08 - INFO - __main__ - Step 98480: {'lr': 0.00013518516524598843, 'samples': 18908160, 'steps': 98479, 'loss/train': 1.4471088647842407} 11/07/2021 11:04:08 - INFO - __main__ - Step 98481: {'lr': 0.00013518045127899493, 'samples': 18908352, 'steps': 98480, 'loss/train': 1.3678088188171387} 11/07/2021 11:04:09 - INFO - __main__ - Step 98482: {'lr': 0.00013517573736373734, 'samples': 18908544, 'steps': 98481, 'loss/train': 1.6164947748184204} 11/07/2021 11:04:10 - INFO - __main__ - Step 98483: {'lr': 0.00013517102350021781, 'samples': 18908736, 'steps': 98482, 'loss/train': 1.1319737434387207} 11/07/2021 11:04:10 - INFO - __main__ - Step 98484: {'lr': 0.00013516630968843843, 'samples': 18908928, 'steps': 98483, 'loss/train': 1.0361262559890747} 11/07/2021 11:04:11 - INFO - __main__ - Step 98485: {'lr': 0.00013516159592840131, 'samples': 18909120, 'steps': 98484, 'loss/train': 0.9352841377258301} 11/07/2021 11:04:11 - INFO - __main__ - Step 98486: {'lr': 0.0001351568822201087, 'samples': 18909312, 'steps': 98485, 'loss/train': 1.2863751649856567} 11/07/2021 11:04:11 - INFO - __main__ - Step 98487: {'lr': 0.00013515216856356256, 'samples': 18909504, 'steps': 98486, 'loss/train': 1.4121408462524414} 11/07/2021 11:04:12 - INFO - __main__ - Step 98488: {'lr': 0.00013514745495876515, 'samples': 18909696, 'steps': 98487, 'loss/train': 1.02507483959198} 11/07/2021 11:04:13 - INFO - __main__ - Step 98489: {'lr': 0.00013514274140571846, 'samples': 18909888, 'steps': 98488, 'loss/train': 1.8220487833023071} 11/07/2021 11:04:13 - INFO - __main__ - Step 98490: {'lr': 0.0001351380279044247, 'samples': 18910080, 'steps': 98489, 'loss/train': 1.4175180196762085} 11/07/2021 11:04:13 - INFO - __main__ - Step 98491: {'lr': 0.00013513331445488594, 'samples': 18910272, 'steps': 98490, 'loss/train': 1.2164909839630127} 11/07/2021 11:04:14 - INFO - __main__ - Step 98492: {'lr': 0.00013512860105710433, 'samples': 18910464, 'steps': 98491, 'loss/train': 1.650642991065979} 11/07/2021 11:04:14 - INFO - __main__ - Step 98493: {'lr': 0.00013512388771108204, 'samples': 18910656, 'steps': 98492, 'loss/train': 1.5294599533081055} 11/07/2021 11:04:15 - INFO - __main__ - Step 98494: {'lr': 0.0001351191744168211, 'samples': 18910848, 'steps': 98493, 'loss/train': 1.3459570407867432} 11/07/2021 11:04:15 - INFO - __main__ - Step 98495: {'lr': 0.00013511446117432375, 'samples': 18911040, 'steps': 98494, 'loss/train': 1.2502626180648804} 11/07/2021 11:04:16 - INFO - __main__ - Step 98496: {'lr': 0.00013510974798359199, 'samples': 18911232, 'steps': 98495, 'loss/train': 1.2792185544967651} 11/07/2021 11:04:16 - INFO - __main__ - Step 98497: {'lr': 0.00013510503484462805, 'samples': 18911424, 'steps': 98496, 'loss/train': 1.4117307662963867} 11/07/2021 11:04:16 - INFO - __main__ - Step 98498: {'lr': 0.000135100321757434, 'samples': 18911616, 'steps': 98497, 'loss/train': 1.4133306741714478} 11/07/2021 11:04:17 - INFO - __main__ - Step 98499: {'lr': 0.00013509560872201193, 'samples': 18911808, 'steps': 98498, 'loss/train': 1.8434267044067383} 11/07/2021 11:04:18 - INFO - __main__ - Step 98500: {'lr': 0.00013509089573836405, 'samples': 18912000, 'steps': 98499, 'loss/train': 1.9759706258773804} 11/07/2021 11:04:18 - INFO - __main__ - Step 98501: {'lr': 0.00013508618280649255, 'samples': 18912192, 'steps': 98500, 'loss/train': 0.8136519193649292} 11/07/2021 11:04:19 - INFO - __main__ - Step 98502: {'lr': 0.0001350814699263993, 'samples': 18912384, 'steps': 98501, 'loss/train': 1.3866060972213745} 11/07/2021 11:04:19 - INFO - __main__ - Step 98503: {'lr': 0.0001350767570980866, 'samples': 18912576, 'steps': 98502, 'loss/train': 1.5908640623092651} 11/07/2021 11:04:20 - INFO - __main__ - Step 98504: {'lr': 0.0001350720443215565, 'samples': 18912768, 'steps': 98503, 'loss/train': 1.2332552671432495} 11/07/2021 11:04:20 - INFO - __main__ - Step 98505: {'lr': 0.00013506733159681123, 'samples': 18912960, 'steps': 98504, 'loss/train': 1.011268138885498} 11/07/2021 11:04:21 - INFO - __main__ - Step 98506: {'lr': 0.0001350626189238528, 'samples': 18913152, 'steps': 98505, 'loss/train': 0.9520596861839294} 11/07/2021 11:04:21 - INFO - __main__ - Step 98507: {'lr': 0.00013505790630268338, 'samples': 18913344, 'steps': 98506, 'loss/train': 1.027584433555603} 11/07/2021 11:04:21 - INFO - __main__ - Step 98508: {'lr': 0.0001350531937333051, 'samples': 18913536, 'steps': 98507, 'loss/train': 1.0417027473449707} 11/07/2021 11:04:22 - INFO - __main__ - Step 98509: {'lr': 0.00013504848121572005, 'samples': 18913728, 'steps': 98508, 'loss/train': 1.6066093444824219} 11/07/2021 11:04:23 - INFO - __main__ - Step 98510: {'lr': 0.00013504376874993044, 'samples': 18913920, 'steps': 98509, 'loss/train': 1.0593034029006958} 11/07/2021 11:04:23 - INFO - __main__ - Step 98511: {'lr': 0.00013503905633593827, 'samples': 18914112, 'steps': 98510, 'loss/train': 1.3226196765899658} 11/07/2021 11:04:23 - INFO - __main__ - Step 98512: {'lr': 0.00013503434397374578, 'samples': 18914304, 'steps': 98511, 'loss/train': 1.86878502368927} 11/07/2021 11:04:24 - INFO - __main__ - Step 98513: {'lr': 0.00013502963166335503, 'samples': 18914496, 'steps': 98512, 'loss/train': 1.4431031942367554} 11/07/2021 11:04:25 - INFO - __main__ - Step 98514: {'lr': 0.00013502491940476814, 'samples': 18914688, 'steps': 98513, 'loss/train': 1.6303400993347168} 11/07/2021 11:04:25 - INFO - __main__ - Step 98515: {'lr': 0.00013502020719798736, 'samples': 18914880, 'steps': 98514, 'loss/train': 2.150179147720337} 11/07/2021 11:04:25 - INFO - __main__ - Step 98516: {'lr': 0.00013501549504301458, 'samples': 18915072, 'steps': 98515, 'loss/train': 1.4482630491256714} 11/07/2021 11:04:26 - INFO - __main__ - Step 98517: {'lr': 0.00013501078293985205, 'samples': 18915264, 'steps': 98516, 'loss/train': 1.3541122674942017} 11/07/2021 11:04:26 - INFO - __main__ - Step 98518: {'lr': 0.0001350060708885019, 'samples': 18915456, 'steps': 98517, 'loss/train': 1.528324007987976} 11/07/2021 11:04:27 - INFO - __main__ - Step 98519: {'lr': 0.00013500135888896622, 'samples': 18915648, 'steps': 98518, 'loss/train': 1.1746481657028198} 11/07/2021 11:04:27 - INFO - __main__ - Step 98520: {'lr': 0.0001349966469412472, 'samples': 18915840, 'steps': 98519, 'loss/train': 1.4708575010299683} 11/07/2021 11:04:28 - INFO - __main__ - Step 98521: {'lr': 0.00013499193504534684, 'samples': 18916032, 'steps': 98520, 'loss/train': 2.532139778137207} 11/07/2021 11:04:28 - INFO - __main__ - Step 98522: {'lr': 0.00013498722320126738, 'samples': 18916224, 'steps': 98521, 'loss/train': 1.6278483867645264} 11/07/2021 11:04:29 - INFO - __main__ - Step 98523: {'lr': 0.0001349825114090109, 'samples': 18916416, 'steps': 98522, 'loss/train': 1.415604829788208} 11/07/2021 11:04:30 - INFO - __main__ - Step 98524: {'lr': 0.00013497779966857953, 'samples': 18916608, 'steps': 98523, 'loss/train': 1.4306175708770752} 11/07/2021 11:04:30 - INFO - __main__ - Step 98525: {'lr': 0.0001349730879799754, 'samples': 18916800, 'steps': 98524, 'loss/train': 1.56378972530365} 11/07/2021 11:04:30 - INFO - __main__ - Step 98526: {'lr': 0.00013496837634320062, 'samples': 18916992, 'steps': 98525, 'loss/train': 1.3748835325241089} 11/07/2021 11:04:31 - INFO - __main__ - Step 98527: {'lr': 0.0001349636647582573, 'samples': 18917184, 'steps': 98526, 'loss/train': 1.3802285194396973} 11/07/2021 11:04:31 - INFO - __main__ - Step 98528: {'lr': 0.00013495895322514768, 'samples': 18917376, 'steps': 98527, 'loss/train': 1.2598539590835571} 11/07/2021 11:04:31 - INFO - __main__ - Step 98529: {'lr': 0.00013495424174387365, 'samples': 18917568, 'steps': 98528, 'loss/train': 1.5147782564163208} 11/07/2021 11:04:32 - INFO - __main__ - Step 98530: {'lr': 0.00013494953031443753, 'samples': 18917760, 'steps': 98529, 'loss/train': 1.9637401103973389} 11/07/2021 11:04:33 - INFO - __main__ - Step 98531: {'lr': 0.00013494481893684134, 'samples': 18917952, 'steps': 98530, 'loss/train': 1.2943000793457031} 11/07/2021 11:04:33 - INFO - __main__ - Step 98532: {'lr': 0.00013494010761108726, 'samples': 18918144, 'steps': 98531, 'loss/train': 1.0780843496322632} 11/07/2021 11:04:33 - INFO - __main__ - Step 98533: {'lr': 0.00013493539633717736, 'samples': 18918336, 'steps': 98532, 'loss/train': 0.8606361150741577} 11/07/2021 11:04:34 - INFO - __main__ - Step 98534: {'lr': 0.0001349306851151138, 'samples': 18918528, 'steps': 98533, 'loss/train': 1.431397795677185} 11/07/2021 11:04:35 - INFO - __main__ - Step 98535: {'lr': 0.0001349259739448987, 'samples': 18918720, 'steps': 98534, 'loss/train': 1.5628408193588257} 11/07/2021 11:04:35 - INFO - __main__ - Step 98536: {'lr': 0.0001349212628265342, 'samples': 18918912, 'steps': 98535, 'loss/train': 1.4936060905456543} 11/07/2021 11:04:36 - INFO - __main__ - Step 98537: {'lr': 0.0001349165517600224, 'samples': 18919104, 'steps': 98536, 'loss/train': 1.464402198791504} 11/07/2021 11:04:36 - INFO - __main__ - Step 98538: {'lr': 0.0001349118407453654, 'samples': 18919296, 'steps': 98537, 'loss/train': 1.1122130155563354} 11/07/2021 11:04:36 - INFO - __main__ - Step 98539: {'lr': 0.00013490712978256537, 'samples': 18919488, 'steps': 98538, 'loss/train': 1.4234387874603271} 11/07/2021 11:04:37 - INFO - __main__ - Step 98540: {'lr': 0.0001349024188716244, 'samples': 18919680, 'steps': 98539, 'loss/train': 1.3386741876602173} 11/07/2021 11:04:38 - INFO - __main__ - Step 98541: {'lr': 0.00013489770801254465, 'samples': 18919872, 'steps': 98540, 'loss/train': 1.3507503271102905} 11/07/2021 11:04:38 - INFO - __main__ - Step 98542: {'lr': 0.00013489299720532828, 'samples': 18920064, 'steps': 98541, 'loss/train': 1.6293385028839111} 11/07/2021 11:04:38 - INFO - __main__ - Step 98543: {'lr': 0.00013488828644997724, 'samples': 18920256, 'steps': 98542, 'loss/train': 0.36781924962997437} 11/07/2021 11:04:39 - INFO - __main__ - Step 98544: {'lr': 0.0001348835757464938, 'samples': 18920448, 'steps': 98543, 'loss/train': 1.2132822275161743} 11/07/2021 11:04:40 - INFO - __main__ - Step 98545: {'lr': 0.00013487886509488001, 'samples': 18920640, 'steps': 98544, 'loss/train': 1.1205358505249023} 11/07/2021 11:04:40 - INFO - __main__ - Step 98546: {'lr': 0.00013487415449513806, 'samples': 18920832, 'steps': 98545, 'loss/train': 1.3448694944381714} 11/07/2021 11:04:40 - INFO - __main__ - Step 98547: {'lr': 0.00013486944394727003, 'samples': 18921024, 'steps': 98546, 'loss/train': 1.4649658203125} 11/07/2021 11:04:41 - INFO - __main__ - Step 98548: {'lr': 0.00013486473345127804, 'samples': 18921216, 'steps': 98547, 'loss/train': 0.9993444085121155} 11/07/2021 11:04:41 - INFO - __main__ - Step 98549: {'lr': 0.00013486002300716423, 'samples': 18921408, 'steps': 98548, 'loss/train': 1.2818840742111206} 11/07/2021 11:04:42 - INFO - __main__ - Step 98550: {'lr': 0.00013485531261493074, 'samples': 18921600, 'steps': 98549, 'loss/train': 0.9386177062988281} 11/07/2021 11:04:43 - INFO - __main__ - Step 98551: {'lr': 0.00013485060227457965, 'samples': 18921792, 'steps': 98550, 'loss/train': 1.7065277099609375} 11/07/2021 11:04:43 - INFO - __main__ - Step 98552: {'lr': 0.00013484589198611306, 'samples': 18921984, 'steps': 98551, 'loss/train': 1.328627586364746} 11/07/2021 11:04:43 - INFO - __main__ - Step 98553: {'lr': 0.00013484118174953322, 'samples': 18922176, 'steps': 98552, 'loss/train': 1.575774073600769} 11/07/2021 11:04:44 - INFO - __main__ - Step 98554: {'lr': 0.00013483647156484213, 'samples': 18922368, 'steps': 98553, 'loss/train': 1.2847265005111694} 11/07/2021 11:04:44 - INFO - __main__ - Step 98555: {'lr': 0.000134831761432042, 'samples': 18922560, 'steps': 98554, 'loss/train': 1.1623845100402832} 11/07/2021 11:04:45 - INFO - __main__ - Step 98556: {'lr': 0.00013482705135113487, 'samples': 18922752, 'steps': 98555, 'loss/train': 1.1513434648513794} 11/07/2021 11:04:45 - INFO - __main__ - Step 98557: {'lr': 0.00013482234132212287, 'samples': 18922944, 'steps': 98556, 'loss/train': 1.4431698322296143} 11/07/2021 11:04:46 - INFO - __main__ - Step 98558: {'lr': 0.00013481763134500814, 'samples': 18923136, 'steps': 98557, 'loss/train': 1.1348109245300293} 11/07/2021 11:04:46 - INFO - __main__ - Step 98559: {'lr': 0.00013481292141979278, 'samples': 18923328, 'steps': 98558, 'loss/train': 1.5346879959106445} 11/07/2021 11:04:46 - INFO - __main__ - Step 98560: {'lr': 0.000134808211546479, 'samples': 18923520, 'steps': 98559, 'loss/train': 1.3050888776779175} 11/07/2021 11:04:47 - INFO - __main__ - Step 98561: {'lr': 0.00013480350172506884, 'samples': 18923712, 'steps': 98560, 'loss/train': 1.5590519905090332} 11/07/2021 11:04:48 - INFO - __main__ - Step 98562: {'lr': 0.00013479879195556443, 'samples': 18923904, 'steps': 98561, 'loss/train': 0.2811278998851776} 11/07/2021 11:04:48 - INFO - __main__ - Step 98563: {'lr': 0.0001347940822379679, 'samples': 18924096, 'steps': 98562, 'loss/train': 1.5153565406799316} 11/07/2021 11:04:48 - INFO - __main__ - Step 98564: {'lr': 0.00013478937257228142, 'samples': 18924288, 'steps': 98563, 'loss/train': 0.9965879321098328} 11/07/2021 11:04:49 - INFO - __main__ - Step 98565: {'lr': 0.00013478466295850704, 'samples': 18924480, 'steps': 98564, 'loss/train': 1.460830569267273} 11/07/2021 11:04:50 - INFO - __main__ - Step 98566: {'lr': 0.00013477995339664689, 'samples': 18924672, 'steps': 98565, 'loss/train': 1.4697974920272827} 11/07/2021 11:04:50 - INFO - __main__ - Step 98567: {'lr': 0.00013477524388670316, 'samples': 18924864, 'steps': 98566, 'loss/train': 1.5351368188858032} 11/07/2021 11:04:51 - INFO - __main__ - Step 98568: {'lr': 0.0001347705344286779, 'samples': 18925056, 'steps': 98567, 'loss/train': 1.942835807800293} 11/07/2021 11:04:51 - INFO - __main__ - Step 98569: {'lr': 0.00013476582502257336, 'samples': 18925248, 'steps': 98568, 'loss/train': 1.1334362030029297} 11/07/2021 11:04:51 - INFO - __main__ - Step 98570: {'lr': 0.00013476111566839148, 'samples': 18925440, 'steps': 98569, 'loss/train': 1.5577706098556519} 11/07/2021 11:04:52 - INFO - __main__ - Step 98571: {'lr': 0.00013475640636613446, 'samples': 18925632, 'steps': 98570, 'loss/train': 1.5464377403259277} 11/07/2021 11:04:53 - INFO - __main__ - Step 98572: {'lr': 0.0001347516971158044, 'samples': 18925824, 'steps': 98571, 'loss/train': 1.3249952793121338} 11/07/2021 11:04:53 - INFO - __main__ - Step 98573: {'lr': 0.00013474698791740347, 'samples': 18926016, 'steps': 98572, 'loss/train': 1.6088650226593018} 11/07/2021 11:04:53 - INFO - __main__ - Step 98574: {'lr': 0.00013474227877093375, 'samples': 18926208, 'steps': 98573, 'loss/train': 1.2779675722122192} 11/07/2021 11:04:54 - INFO - __main__ - Step 98575: {'lr': 0.0001347375696763974, 'samples': 18926400, 'steps': 98574, 'loss/train': 1.2839831113815308} 11/07/2021 11:04:55 - INFO - __main__ - Step 98576: {'lr': 0.00013473286063379653, 'samples': 18926592, 'steps': 98575, 'loss/train': 1.5929737091064453} 11/07/2021 11:04:55 - INFO - __main__ - Step 98577: {'lr': 0.00013472815164313325, 'samples': 18926784, 'steps': 98576, 'loss/train': 1.3175699710845947} 11/07/2021 11:04:56 - INFO - __main__ - Step 98578: {'lr': 0.00013472344270440965, 'samples': 18926976, 'steps': 98577, 'loss/train': 1.4246878623962402} 11/07/2021 11:04:56 - INFO - __main__ - Step 98579: {'lr': 0.0001347187338176279, 'samples': 18927168, 'steps': 98578, 'loss/train': 0.6282012462615967} 11/07/2021 11:04:56 - INFO - __main__ - Step 98580: {'lr': 0.0001347140249827901, 'samples': 18927360, 'steps': 98579, 'loss/train': 1.4416236877441406} 11/07/2021 11:04:57 - INFO - __main__ - Step 98581: {'lr': 0.00013470931619989846, 'samples': 18927552, 'steps': 98580, 'loss/train': 1.5833542346954346} 11/07/2021 11:04:58 - INFO - __main__ - Step 98582: {'lr': 0.00013470460746895505, 'samples': 18927744, 'steps': 98581, 'loss/train': 1.104358434677124} 11/07/2021 11:04:58 - INFO - __main__ - Step 98583: {'lr': 0.00013469989878996187, 'samples': 18927936, 'steps': 98582, 'loss/train': 1.29217529296875} 11/07/2021 11:04:58 - INFO - __main__ - Step 98584: {'lr': 0.00013469519016292113, 'samples': 18928128, 'steps': 98583, 'loss/train': 1.9965591430664062} 11/07/2021 11:04:59 - INFO - __main__ - Step 98585: {'lr': 0.000134690481587835, 'samples': 18928320, 'steps': 98584, 'loss/train': 1.2877286672592163} 11/07/2021 11:04:59 - INFO - __main__ - Step 98586: {'lr': 0.00013468577306470554, 'samples': 18928512, 'steps': 98585, 'loss/train': 1.358564019203186} 11/07/2021 11:05:00 - INFO - __main__ - Step 98587: {'lr': 0.0001346810645935349, 'samples': 18928704, 'steps': 98586, 'loss/train': 1.4120063781738281} 11/07/2021 11:05:00 - INFO - __main__ - Step 98588: {'lr': 0.00013467635617432516, 'samples': 18928896, 'steps': 98587, 'loss/train': 1.2753301858901978} 11/07/2021 11:05:01 - INFO - __main__ - Step 98589: {'lr': 0.00013467164780707849, 'samples': 18929088, 'steps': 98588, 'loss/train': 0.8075432777404785} 11/07/2021 11:05:01 - INFO - __main__ - Step 98590: {'lr': 0.000134666939491797, 'samples': 18929280, 'steps': 98589, 'loss/train': 1.2452013492584229} 11/07/2021 11:05:01 - INFO - __main__ - Step 98591: {'lr': 0.0001346622312284828, 'samples': 18929472, 'steps': 98590, 'loss/train': 1.8459932804107666} 11/07/2021 11:05:03 - INFO - __main__ - Step 98592: {'lr': 0.00013465752301713806, 'samples': 18929664, 'steps': 98591, 'loss/train': 0.9406793117523193} 11/07/2021 11:05:03 - INFO - __main__ - Step 98593: {'lr': 0.0001346528148577648, 'samples': 18929856, 'steps': 98592, 'loss/train': 1.3703773021697998} 11/07/2021 11:05:03 - INFO - __main__ - Step 98594: {'lr': 0.00013464810675036526, 'samples': 18930048, 'steps': 98593, 'loss/train': 1.4487603902816772} 11/07/2021 11:05:04 - INFO - __main__ - Step 98595: {'lr': 0.00013464339869494155, 'samples': 18930240, 'steps': 98594, 'loss/train': 1.1084033250808716} 11/07/2021 11:05:04 - INFO - __main__ - Step 98596: {'lr': 0.00013463869069149566, 'samples': 18930432, 'steps': 98595, 'loss/train': 1.6132479906082153} 11/07/2021 11:05:05 - INFO - __main__ - Step 98597: {'lr': 0.0001346339827400298, 'samples': 18930624, 'steps': 98596, 'loss/train': 1.3423261642456055} 11/07/2021 11:05:05 - INFO - __main__ - Step 98598: {'lr': 0.0001346292748405461, 'samples': 18930816, 'steps': 98597, 'loss/train': 1.262235164642334} 11/07/2021 11:05:06 - INFO - __main__ - Step 98599: {'lr': 0.00013462456699304666, 'samples': 18931008, 'steps': 98598, 'loss/train': 1.4353877305984497} 11/07/2021 11:05:06 - INFO - __main__ - Step 98600: {'lr': 0.00013461985919753362, 'samples': 18931200, 'steps': 98599, 'loss/train': 1.4643163681030273} 11/07/2021 11:05:06 - INFO - __main__ - Step 98601: {'lr': 0.00013461515145400907, 'samples': 18931392, 'steps': 98600, 'loss/train': 1.4581241607666016} 11/07/2021 11:05:07 - INFO - __main__ - Step 98602: {'lr': 0.00013461044376247516, 'samples': 18931584, 'steps': 98601, 'loss/train': 1.3136805295944214} 11/07/2021 11:05:08 - INFO - __main__ - Step 98603: {'lr': 0.000134605736122934, 'samples': 18931776, 'steps': 98602, 'loss/train': 0.8281056880950928} 11/07/2021 11:05:08 - INFO - __main__ - Step 98604: {'lr': 0.0001346010285353877, 'samples': 18931968, 'steps': 98603, 'loss/train': 1.524132251739502} 11/07/2021 11:05:09 - INFO - __main__ - Step 98605: {'lr': 0.00013459632099983843, 'samples': 18932160, 'steps': 98604, 'loss/train': 1.2791244983673096} 11/07/2021 11:05:09 - INFO - __main__ - Step 98606: {'lr': 0.00013459161351628827, 'samples': 18932352, 'steps': 98605, 'loss/train': 1.4113832712173462} 11/07/2021 11:05:09 - INFO - __main__ - Step 98607: {'lr': 0.00013458690608473934, 'samples': 18932544, 'steps': 98606, 'loss/train': 1.286653757095337} 11/07/2021 11:05:10 - INFO - __main__ - Step 98608: {'lr': 0.00013458219870519377, 'samples': 18932736, 'steps': 98607, 'loss/train': 1.1678448915481567} 11/07/2021 11:05:11 - INFO - __main__ - Step 98609: {'lr': 0.0001345774913776538, 'samples': 18932928, 'steps': 98608, 'loss/train': 1.527572751045227} 11/07/2021 11:05:11 - INFO - __main__ - Step 98610: {'lr': 0.0001345727841021213, 'samples': 18933120, 'steps': 98609, 'loss/train': 1.1922667026519775} 11/07/2021 11:05:11 - INFO - __main__ - Step 98611: {'lr': 0.00013456807687859852, 'samples': 18933312, 'steps': 98610, 'loss/train': 1.3829959630966187} 11/07/2021 11:05:12 - INFO - __main__ - Step 98612: {'lr': 0.0001345633697070876, 'samples': 18933504, 'steps': 98611, 'loss/train': 1.2869248390197754} 11/07/2021 11:05:13 - INFO - __main__ - Step 98613: {'lr': 0.00013455866258759065, 'samples': 18933696, 'steps': 98612, 'loss/train': 1.1261502504348755} 11/07/2021 11:05:13 - INFO - __main__ - Step 98614: {'lr': 0.00013455395552010977, 'samples': 18933888, 'steps': 98613, 'loss/train': 1.6028447151184082} 11/07/2021 11:05:13 - INFO - __main__ - Step 98615: {'lr': 0.00013454924850464712, 'samples': 18934080, 'steps': 98614, 'loss/train': 1.4574110507965088} 11/07/2021 11:05:14 - INFO - __main__ - Step 98616: {'lr': 0.00013454454154120476, 'samples': 18934272, 'steps': 98615, 'loss/train': 0.9393197894096375} 11/07/2021 11:05:14 - INFO - __main__ - Step 98617: {'lr': 0.00013453983462978486, 'samples': 18934464, 'steps': 98616, 'loss/train': 1.3137191534042358} 11/07/2021 11:05:15 - INFO - __main__ - Step 98618: {'lr': 0.00013453512777038954, 'samples': 18934656, 'steps': 98617, 'loss/train': 0.7710111141204834} 11/07/2021 11:05:16 - INFO - __main__ - Step 98619: {'lr': 0.0001345304209630209, 'samples': 18934848, 'steps': 98618, 'loss/train': 1.3179579973220825} 11/07/2021 11:05:16 - INFO - __main__ - Step 98620: {'lr': 0.00013452571420768106, 'samples': 18935040, 'steps': 98619, 'loss/train': 1.6623945236206055} 11/07/2021 11:05:16 - INFO - __main__ - Step 98621: {'lr': 0.00013452100750437217, 'samples': 18935232, 'steps': 98620, 'loss/train': 1.3745497465133667} 11/07/2021 11:05:17 - INFO - __main__ - Step 98622: {'lr': 0.00013451630085309647, 'samples': 18935424, 'steps': 98621, 'loss/train': 1.5499961376190186} 11/07/2021 11:05:18 - INFO - __main__ - Step 98623: {'lr': 0.00013451159425385579, 'samples': 18935616, 'steps': 98622, 'loss/train': 1.071232795715332} 11/07/2021 11:05:18 - INFO - __main__ - Step 98624: {'lr': 0.00013450688770665244, 'samples': 18935808, 'steps': 98623, 'loss/train': 1.635992169380188} 11/07/2021 11:05:18 - INFO - __main__ - Step 98625: {'lr': 0.00013450218121148844, 'samples': 18936000, 'steps': 98624, 'loss/train': 1.1024539470672607} 11/07/2021 11:05:19 - INFO - __main__ - Step 98626: {'lr': 0.00013449747476836603, 'samples': 18936192, 'steps': 98625, 'loss/train': 1.3310885429382324} 11/07/2021 11:05:19 - INFO - __main__ - Step 98627: {'lr': 0.00013449276837728725, 'samples': 18936384, 'steps': 98626, 'loss/train': 1.334861397743225} 11/07/2021 11:05:19 - INFO - __main__ - Step 98628: {'lr': 0.00013448806203825424, 'samples': 18936576, 'steps': 98627, 'loss/train': 1.3654171228408813} 11/07/2021 11:05:20 - INFO - __main__ - Step 98629: {'lr': 0.00013448335575126915, 'samples': 18936768, 'steps': 98628, 'loss/train': 0.9716129302978516} 11/07/2021 11:05:21 - INFO - __main__ - Step 98630: {'lr': 0.0001344786495163341, 'samples': 18936960, 'steps': 98629, 'loss/train': 0.8663007616996765} 11/07/2021 11:05:21 - INFO - __main__ - Step 98631: {'lr': 0.00013447394333345115, 'samples': 18937152, 'steps': 98630, 'loss/train': 1.869653582572937} 11/07/2021 11:05:22 - INFO - __main__ - Step 98632: {'lr': 0.00013446923720262244, 'samples': 18937344, 'steps': 98631, 'loss/train': 1.7710411548614502} 11/07/2021 11:05:22 - INFO - __main__ - Step 98633: {'lr': 0.00013446453112385016, 'samples': 18937536, 'steps': 98632, 'loss/train': 0.09693453460931778} 11/07/2021 11:05:23 - INFO - __main__ - Step 98634: {'lr': 0.00013445982509713644, 'samples': 18937728, 'steps': 98633, 'loss/train': 1.2823964357376099} 11/07/2021 11:05:23 - INFO - __main__ - Step 98635: {'lr': 0.00013445511912248327, 'samples': 18937920, 'steps': 98634, 'loss/train': 1.4661797285079956} 11/07/2021 11:05:24 - INFO - __main__ - Step 98636: {'lr': 0.00013445041319989283, 'samples': 18938112, 'steps': 98635, 'loss/train': 1.2863768339157104} 11/07/2021 11:05:24 - INFO - __main__ - Step 98637: {'lr': 0.00013444570732936721, 'samples': 18938304, 'steps': 98636, 'loss/train': 1.1518174409866333} 11/07/2021 11:05:24 - INFO - __main__ - Step 98638: {'lr': 0.00013444100151090865, 'samples': 18938496, 'steps': 98637, 'loss/train': 0.8608997464179993} 11/07/2021 11:05:25 - INFO - __main__ - Step 98639: {'lr': 0.00013443629574451916, 'samples': 18938688, 'steps': 98638, 'loss/train': 1.2221826314926147} 11/07/2021 11:05:26 - INFO - __main__ - Step 98640: {'lr': 0.00013443159003020087, 'samples': 18938880, 'steps': 98639, 'loss/train': 1.2423344850540161} 11/07/2021 11:05:26 - INFO - __main__ - Step 98641: {'lr': 0.00013442688436795592, 'samples': 18939072, 'steps': 98640, 'loss/train': 1.7515476942062378} 11/07/2021 11:05:26 - INFO - __main__ - Step 98642: {'lr': 0.00013442217875778644, 'samples': 18939264, 'steps': 98641, 'loss/train': 1.19896399974823} 11/07/2021 11:05:27 - INFO - __main__ - Step 98643: {'lr': 0.00013441747319969455, 'samples': 18939456, 'steps': 98642, 'loss/train': 1.1609183549880981} 11/07/2021 11:05:29 - INFO - __main__ - Step 98644: {'lr': 0.00013441276769368237, 'samples': 18939648, 'steps': 98643, 'loss/train': 1.3403464555740356} 11/07/2021 11:05:29 - INFO - __main__ - Step 98645: {'lr': 0.0001344080622397521, 'samples': 18939840, 'steps': 98644, 'loss/train': 1.6220948696136475} 11/07/2021 11:05:30 - INFO - __main__ - Step 98646: {'lr': 0.00013440335683790567, 'samples': 18940032, 'steps': 98645, 'loss/train': 1.2281389236450195} 11/07/2021 11:05:30 - INFO - __main__ - Step 98647: {'lr': 0.00013439865148814534, 'samples': 18940224, 'steps': 98646, 'loss/train': 1.1178312301635742} 11/07/2021 11:05:30 - INFO - __main__ - Step 98648: {'lr': 0.00013439394619047315, 'samples': 18940416, 'steps': 98647, 'loss/train': 0.8308822512626648} 11/07/2021 11:05:31 - INFO - __main__ - Step 98649: {'lr': 0.0001343892409448913, 'samples': 18940608, 'steps': 98648, 'loss/train': 1.7047758102416992} 11/07/2021 11:05:31 - INFO - __main__ - Step 98650: {'lr': 0.00013438453575140183, 'samples': 18940800, 'steps': 98649, 'loss/train': 1.7467089891433716} 11/07/2021 11:05:31 - INFO - __main__ - Step 98651: {'lr': 0.00013437983061000694, 'samples': 18940992, 'steps': 98650, 'loss/train': 1.757638931274414} 11/07/2021 11:05:32 - INFO - __main__ - Step 98652: {'lr': 0.00013437512552070868, 'samples': 18941184, 'steps': 98651, 'loss/train': 0.5607432723045349} 11/07/2021 11:05:33 - INFO - __main__ - Step 98653: {'lr': 0.00013437042048350923, 'samples': 18941376, 'steps': 98652, 'loss/train': 1.4252263307571411} 11/07/2021 11:05:33 - INFO - __main__ - Step 98654: {'lr': 0.00013436571549841071, 'samples': 18941568, 'steps': 98653, 'loss/train': 1.381831169128418} 11/07/2021 11:05:33 - INFO - __main__ - Step 98655: {'lr': 0.00013436101056541516, 'samples': 18941760, 'steps': 98654, 'loss/train': 1.2580598592758179} 11/07/2021 11:05:34 - INFO - __main__ - Step 98656: {'lr': 0.0001343563056845249, 'samples': 18941952, 'steps': 98655, 'loss/train': 0.9640718102455139} 11/07/2021 11:05:35 - INFO - __main__ - Step 98657: {'lr': 0.00013435160085574176, 'samples': 18942144, 'steps': 98656, 'loss/train': 1.4260221719741821} 11/07/2021 11:05:35 - INFO - __main__ - Step 98658: {'lr': 0.00013434689607906802, 'samples': 18942336, 'steps': 98657, 'loss/train': 1.3028737306594849} 11/07/2021 11:05:36 - INFO - __main__ - Step 98659: {'lr': 0.0001343421913545058, 'samples': 18942528, 'steps': 98658, 'loss/train': 1.7812740802764893} 11/07/2021 11:05:36 - INFO - __main__ - Step 98660: {'lr': 0.0001343374866820572, 'samples': 18942720, 'steps': 98659, 'loss/train': 1.1194465160369873} 11/07/2021 11:05:36 - INFO - __main__ - Step 98661: {'lr': 0.00013433278206172433, 'samples': 18942912, 'steps': 98660, 'loss/train': 1.9537456035614014} 11/07/2021 11:05:37 - INFO - __main__ - Step 98662: {'lr': 0.00013432807749350935, 'samples': 18943104, 'steps': 98661, 'loss/train': 0.45448586344718933} 11/07/2021 11:05:38 - INFO - __main__ - Step 98663: {'lr': 0.00013432337297741436, 'samples': 18943296, 'steps': 98662, 'loss/train': 1.4617420434951782} 11/07/2021 11:05:38 - INFO - __main__ - Step 98664: {'lr': 0.00013431866851344143, 'samples': 18943488, 'steps': 98663, 'loss/train': 1.2633477449417114} 11/07/2021 11:05:39 - INFO - __main__ - Step 98665: {'lr': 0.00013431396410159275, 'samples': 18943680, 'steps': 98664, 'loss/train': 1.3954282999038696} 11/07/2021 11:05:39 - INFO - __main__ - Step 98666: {'lr': 0.00013430925974187042, 'samples': 18943872, 'steps': 98665, 'loss/train': 1.5361640453338623} 11/07/2021 11:05:40 - INFO - __main__ - Step 98667: {'lr': 0.0001343045554342766, 'samples': 18944064, 'steps': 98666, 'loss/train': 1.3835631608963013} 11/07/2021 11:05:40 - INFO - __main__ - Step 98668: {'lr': 0.00013429985117881333, 'samples': 18944256, 'steps': 98667, 'loss/train': 1.5889272689819336} 11/07/2021 11:05:41 - INFO - __main__ - Step 98669: {'lr': 0.00013429514697548274, 'samples': 18944448, 'steps': 98668, 'loss/train': 1.5402586460113525} 11/07/2021 11:05:41 - INFO - __main__ - Step 98670: {'lr': 0.00013429044282428694, 'samples': 18944640, 'steps': 98669, 'loss/train': 2.4692530632019043} 11/07/2021 11:05:41 - INFO - __main__ - Step 98671: {'lr': 0.0001342857387252281, 'samples': 18944832, 'steps': 98670, 'loss/train': 1.4750124216079712} 11/07/2021 11:05:43 - INFO - __main__ - Step 98672: {'lr': 0.00013428103467830833, 'samples': 18945024, 'steps': 98671, 'loss/train': 0.8325183987617493} 11/07/2021 11:05:43 - INFO - __main__ - Step 98673: {'lr': 0.00013427633068352973, 'samples': 18945216, 'steps': 98672, 'loss/train': 1.4484590291976929} 11/07/2021 11:05:43 - INFO - __main__ - Step 98674: {'lr': 0.00013427162674089444, 'samples': 18945408, 'steps': 98673, 'loss/train': 1.6519277095794678} 11/07/2021 11:05:44 - INFO - __main__ - Step 98675: {'lr': 0.00013426692285040454, 'samples': 18945600, 'steps': 98674, 'loss/train': 1.8496516942977905} 11/07/2021 11:05:44 - INFO - __main__ - Step 98676: {'lr': 0.0001342622190120622, 'samples': 18945792, 'steps': 98675, 'loss/train': 1.5077877044677734} 11/07/2021 11:05:45 - INFO - __main__ - Step 98677: {'lr': 0.00013425751522586955, 'samples': 18945984, 'steps': 98676, 'loss/train': 1.4077728986740112} 11/07/2021 11:05:45 - INFO - __main__ - Step 98678: {'lr': 0.00013425281149182872, 'samples': 18946176, 'steps': 98677, 'loss/train': 1.1860836744308472} 11/07/2021 11:05:46 - INFO - __main__ - Step 98679: {'lr': 0.00013424810780994173, 'samples': 18946368, 'steps': 98678, 'loss/train': 1.4605777263641357} 11/07/2021 11:05:46 - INFO - __main__ - Step 98680: {'lr': 0.00013424340418021074, 'samples': 18946560, 'steps': 98679, 'loss/train': 1.2482258081436157} 11/07/2021 11:05:46 - INFO - __main__ - Step 98681: {'lr': 0.00013423870060263787, 'samples': 18946752, 'steps': 98680, 'loss/train': 0.8105692863464355} 11/07/2021 11:05:47 - INFO - __main__ - Step 98682: {'lr': 0.00013423399707722527, 'samples': 18946944, 'steps': 98681, 'loss/train': 0.8157587647438049} 11/07/2021 11:05:48 - INFO - __main__ - Step 98683: {'lr': 0.00013422929360397507, 'samples': 18947136, 'steps': 98682, 'loss/train': 1.578786015510559} 11/07/2021 11:05:48 - INFO - __main__ - Step 98684: {'lr': 0.00013422459018288936, 'samples': 18947328, 'steps': 98683, 'loss/train': 1.2584004402160645} 11/07/2021 11:05:48 - INFO - __main__ - Step 98685: {'lr': 0.00013421988681397022, 'samples': 18947520, 'steps': 98684, 'loss/train': 1.2543683052062988} 11/07/2021 11:05:49 - INFO - __main__ - Step 98686: {'lr': 0.00013421518349721983, 'samples': 18947712, 'steps': 98685, 'loss/train': 1.3977375030517578} 11/07/2021 11:05:49 - INFO - __main__ - Step 98687: {'lr': 0.00013421048023264028, 'samples': 18947904, 'steps': 98686, 'loss/train': 1.3181413412094116} 11/07/2021 11:05:50 - INFO - __main__ - Step 98688: {'lr': 0.00013420577702023373, 'samples': 18948096, 'steps': 98687, 'loss/train': 1.3255921602249146} 11/07/2021 11:05:51 - INFO - __main__ - Step 98689: {'lr': 0.00013420107386000226, 'samples': 18948288, 'steps': 98688, 'loss/train': 1.5171387195587158} 11/07/2021 11:05:51 - INFO - __main__ - Step 98690: {'lr': 0.0001341963707519481, 'samples': 18948480, 'steps': 98689, 'loss/train': 1.635765790939331} 11/07/2021 11:05:51 - INFO - __main__ - Step 98691: {'lr': 0.00013419166769607316, 'samples': 18948672, 'steps': 98690, 'loss/train': 1.3939487934112549} 11/07/2021 11:05:52 - INFO - __main__ - Step 98692: {'lr': 0.00013418696469237967, 'samples': 18948864, 'steps': 98691, 'loss/train': 1.085745096206665} 11/07/2021 11:05:53 - INFO - __main__ - Step 98693: {'lr': 0.00013418226174086975, 'samples': 18949056, 'steps': 98692, 'loss/train': 1.9596433639526367} 11/07/2021 11:05:53 - INFO - __main__ - Step 98694: {'lr': 0.00013417755884154552, 'samples': 18949248, 'steps': 98693, 'loss/train': 0.890134334564209} 11/07/2021 11:05:53 - INFO - __main__ - Step 98695: {'lr': 0.0001341728559944091, 'samples': 18949440, 'steps': 98694, 'loss/train': 0.5208776593208313} 11/07/2021 11:05:54 - INFO - __main__ - Step 98696: {'lr': 0.00013416815319946258, 'samples': 18949632, 'steps': 98695, 'loss/train': 1.4723776578903198} 11/07/2021 11:05:54 - INFO - __main__ - Step 98697: {'lr': 0.00013416345045670814, 'samples': 18949824, 'steps': 98696, 'loss/train': 1.4136862754821777} 11/07/2021 11:05:54 - INFO - __main__ - Step 98698: {'lr': 0.00013415874776614783, 'samples': 18950016, 'steps': 98697, 'loss/train': 1.189691185951233} 11/07/2021 11:05:55 - INFO - __main__ - Step 98699: {'lr': 0.00013415404512778382, 'samples': 18950208, 'steps': 98698, 'loss/train': 0.6608111262321472} 11/07/2021 11:05:56 - INFO - __main__ - Step 98700: {'lr': 0.0001341493425416182, 'samples': 18950400, 'steps': 98699, 'loss/train': 0.675640344619751} 11/07/2021 11:05:56 - INFO - __main__ - Step 98701: {'lr': 0.0001341446400076531, 'samples': 18950592, 'steps': 98700, 'loss/train': 1.6060291528701782} 11/07/2021 11:05:57 - INFO - __main__ - Step 98702: {'lr': 0.00013413993752589063, 'samples': 18950784, 'steps': 98701, 'loss/train': 1.737141489982605} 11/07/2021 11:05:57 - INFO - __main__ - Step 98703: {'lr': 0.00013413523509633301, 'samples': 18950976, 'steps': 98702, 'loss/train': 0.9259989261627197} 11/07/2021 11:05:58 - INFO - __main__ - Step 98704: {'lr': 0.00013413053271898217, 'samples': 18951168, 'steps': 98703, 'loss/train': 1.4720195531845093} 11/07/2021 11:05:58 - INFO - __main__ - Step 98705: {'lr': 0.00013412583039384035, 'samples': 18951360, 'steps': 98704, 'loss/train': 1.4817739725112915} 11/07/2021 11:05:59 - INFO - __main__ - Step 98706: {'lr': 0.0001341211281209096, 'samples': 18951552, 'steps': 98705, 'loss/train': 1.2302230596542358} 11/07/2021 11:05:59 - INFO - __main__ - Step 98707: {'lr': 0.00013411642590019214, 'samples': 18951744, 'steps': 98706, 'loss/train': 1.2927075624465942} 11/07/2021 11:05:59 - INFO - __main__ - Step 98708: {'lr': 0.00013411172373168997, 'samples': 18951936, 'steps': 98707, 'loss/train': 0.6082497835159302} 11/07/2021 11:06:00 - INFO - __main__ - Step 98709: {'lr': 0.00013410702161540528, 'samples': 18952128, 'steps': 98708, 'loss/train': 1.4942846298217773} 11/07/2021 11:06:01 - INFO - __main__ - Step 98710: {'lr': 0.00013410231955134023, 'samples': 18952320, 'steps': 98709, 'loss/train': 1.0536251068115234} 11/07/2021 11:06:01 - INFO - __main__ - Step 98711: {'lr': 0.00013409761753949685, 'samples': 18952512, 'steps': 98710, 'loss/train': 0.7793208360671997} 11/07/2021 11:06:01 - INFO - __main__ - Step 98712: {'lr': 0.00013409291557987726, 'samples': 18952704, 'steps': 98711, 'loss/train': 1.3187243938446045} 11/07/2021 11:06:02 - INFO - __main__ - Step 98713: {'lr': 0.00013408821367248363, 'samples': 18952896, 'steps': 98712, 'loss/train': 0.9888020753860474} 11/07/2021 11:06:03 - INFO - __main__ - Step 98714: {'lr': 0.00013408351181731808, 'samples': 18953088, 'steps': 98713, 'loss/train': 0.866351842880249} 11/07/2021 11:06:03 - INFO - __main__ - Step 98715: {'lr': 0.00013407881001438273, 'samples': 18953280, 'steps': 98714, 'loss/train': 1.1849193572998047} 11/07/2021 11:06:03 - INFO - __main__ - Step 98716: {'lr': 0.00013407410826367976, 'samples': 18953472, 'steps': 98715, 'loss/train': 1.7343283891677856} 11/07/2021 11:06:04 - INFO - __main__ - Step 98717: {'lr': 0.0001340694065652111, 'samples': 18953664, 'steps': 98716, 'loss/train': 1.0550068616867065} 11/07/2021 11:06:04 - INFO - __main__ - Step 98718: {'lr': 0.000134064704918979, 'samples': 18953856, 'steps': 98717, 'loss/train': 0.906725287437439} 11/07/2021 11:06:05 - INFO - __main__ - Step 98719: {'lr': 0.00013406000332498552, 'samples': 18954048, 'steps': 98718, 'loss/train': 0.5252923369407654} 11/07/2021 11:06:06 - INFO - __main__ - Step 98720: {'lr': 0.00013405530178323282, 'samples': 18954240, 'steps': 98719, 'loss/train': 1.6318130493164062} 11/07/2021 11:06:06 - INFO - __main__ - Step 98721: {'lr': 0.00013405060029372307, 'samples': 18954432, 'steps': 98720, 'loss/train': 1.4475773572921753} 11/07/2021 11:06:06 - INFO - __main__ - Step 98722: {'lr': 0.00013404589885645827, 'samples': 18954624, 'steps': 98721, 'loss/train': 1.5581598281860352} 11/07/2021 11:06:07 - INFO - __main__ - Step 98723: {'lr': 0.00013404119747144062, 'samples': 18954816, 'steps': 98722, 'loss/train': 1.4973070621490479} 11/07/2021 11:06:08 - INFO - __main__ - Step 98724: {'lr': 0.0001340364961386722, 'samples': 18955008, 'steps': 98723, 'loss/train': 0.11995498090982437} 11/07/2021 11:06:08 - INFO - __main__ - Step 98725: {'lr': 0.00013403179485815513, 'samples': 18955200, 'steps': 98724, 'loss/train': 0.9577521085739136} 11/07/2021 11:06:09 - INFO - __main__ - Step 98726: {'lr': 0.0001340270936298916, 'samples': 18955392, 'steps': 98725, 'loss/train': 1.3208978176116943} 11/07/2021 11:06:09 - INFO - __main__ - Step 98727: {'lr': 0.00013402239245388365, 'samples': 18955584, 'steps': 98726, 'loss/train': 0.7239967584609985} 11/07/2021 11:06:09 - INFO - __main__ - Step 98728: {'lr': 0.0001340176913301334, 'samples': 18955776, 'steps': 98727, 'loss/train': 1.6383947134017944} 11/07/2021 11:06:10 - INFO - __main__ - Step 98729: {'lr': 0.000134012990258643, 'samples': 18955968, 'steps': 98728, 'loss/train': 1.4401752948760986} 11/07/2021 11:06:11 - INFO - __main__ - Step 98730: {'lr': 0.00013400828923941467, 'samples': 18956160, 'steps': 98729, 'loss/train': 1.8410325050354004} 11/07/2021 11:06:11 - INFO - __main__ - Step 98731: {'lr': 0.00013400358827245028, 'samples': 18956352, 'steps': 98730, 'loss/train': 0.9787253737449646} 11/07/2021 11:06:11 - INFO - __main__ - Step 98732: {'lr': 0.0001339988873577521, 'samples': 18956544, 'steps': 98731, 'loss/train': 1.401559829711914} 11/07/2021 11:06:12 - INFO - __main__ - Step 98733: {'lr': 0.00013399418649532224, 'samples': 18956736, 'steps': 98732, 'loss/train': 1.59177565574646} 11/07/2021 11:06:13 - INFO - __main__ - Step 98734: {'lr': 0.00013398948568516284, 'samples': 18956928, 'steps': 98733, 'loss/train': 1.1800169944763184} 11/07/2021 11:06:13 - INFO - __main__ - Step 98735: {'lr': 0.00013398478492727595, 'samples': 18957120, 'steps': 98734, 'loss/train': 1.5041781663894653} 11/07/2021 11:06:13 - INFO - __main__ - Step 98736: {'lr': 0.00013398008422166373, 'samples': 18957312, 'steps': 98735, 'loss/train': 1.8988463878631592} 11/07/2021 11:06:14 - INFO - __main__ - Step 98737: {'lr': 0.00013397538356832827, 'samples': 18957504, 'steps': 98736, 'loss/train': 1.1313494443893433} 11/07/2021 11:06:14 - INFO - __main__ - Step 98738: {'lr': 0.00013397068296727173, 'samples': 18957696, 'steps': 98737, 'loss/train': 0.9202689528465271} 11/07/2021 11:06:14 - INFO - __main__ - Step 98739: {'lr': 0.0001339659824184962, 'samples': 18957888, 'steps': 98738, 'loss/train': 1.1398214101791382} 11/07/2021 11:06:15 - INFO - __main__ - Step 98740: {'lr': 0.00013396128192200385, 'samples': 18958080, 'steps': 98739, 'loss/train': 1.2093034982681274} 11/07/2021 11:06:16 - INFO - __main__ - Step 98741: {'lr': 0.0001339565814777967, 'samples': 18958272, 'steps': 98740, 'loss/train': 1.219545841217041} 11/07/2021 11:06:16 - INFO - __main__ - Step 98742: {'lr': 0.00013395188108587697, 'samples': 18958464, 'steps': 98741, 'loss/train': 1.5238360166549683} 11/07/2021 11:06:16 - INFO - __main__ - Step 98743: {'lr': 0.00013394718074624684, 'samples': 18958656, 'steps': 98742, 'loss/train': 1.543504238128662} 11/07/2021 11:06:17 - INFO - __main__ - Step 98744: {'lr': 0.00013394248045890816, 'samples': 18958848, 'steps': 98743, 'loss/train': 1.3256710767745972} 11/07/2021 11:06:18 - INFO - __main__ - Step 98745: {'lr': 0.00013393778022386326, 'samples': 18959040, 'steps': 98744, 'loss/train': 1.2968547344207764} 11/07/2021 11:06:18 - INFO - __main__ - Step 98746: {'lr': 0.0001339330800411142, 'samples': 18959232, 'steps': 98745, 'loss/train': 1.0549952983856201} 11/07/2021 11:06:19 - INFO - __main__ - Step 98747: {'lr': 0.00013392837991066308, 'samples': 18959424, 'steps': 98746, 'loss/train': 1.396835207939148} 11/07/2021 11:06:19 - INFO - __main__ - Step 98748: {'lr': 0.00013392367983251205, 'samples': 18959616, 'steps': 98747, 'loss/train': 0.7862650156021118} 11/07/2021 11:06:19 - INFO - __main__ - Step 98749: {'lr': 0.00013391897980666323, 'samples': 18959808, 'steps': 98748, 'loss/train': 1.5466150045394897} 11/07/2021 11:06:20 - INFO - __main__ - Step 98750: {'lr': 0.0001339142798331187, 'samples': 18960000, 'steps': 98749, 'loss/train': 1.3743772506713867} 11/07/2021 11:06:21 - INFO - __main__ - Step 98751: {'lr': 0.00013390957991188062, 'samples': 18960192, 'steps': 98750, 'loss/train': 1.7422993183135986} 11/07/2021 11:06:21 - INFO - __main__ - Step 98752: {'lr': 0.0001339048800429511, 'samples': 18960384, 'steps': 98751, 'loss/train': 1.2171097993850708} 11/07/2021 11:06:21 - INFO - __main__ - Step 98753: {'lr': 0.00013390018022633223, 'samples': 18960576, 'steps': 98752, 'loss/train': 1.5349522829055786} 11/07/2021 11:06:22 - INFO - __main__ - Step 98754: {'lr': 0.00013389548046202615, 'samples': 18960768, 'steps': 98753, 'loss/train': 0.8951400518417358} 11/07/2021 11:06:23 - INFO - __main__ - Step 98755: {'lr': 0.000133890780750035, 'samples': 18960960, 'steps': 98754, 'loss/train': 1.6464160680770874} 11/07/2021 11:06:23 - INFO - __main__ - Step 98756: {'lr': 0.00013388608109036085, 'samples': 18961152, 'steps': 98755, 'loss/train': 1.7121446132659912} 11/07/2021 11:06:24 - INFO - __main__ - Step 98757: {'lr': 0.00013388138148300594, 'samples': 18961344, 'steps': 98756, 'loss/train': 1.4987651109695435} 11/07/2021 11:06:24 - INFO - __main__ - Step 98758: {'lr': 0.0001338766819279722, 'samples': 18961536, 'steps': 98757, 'loss/train': 1.466565489768982} 11/07/2021 11:06:24 - INFO - __main__ - Step 98759: {'lr': 0.00013387198242526183, 'samples': 18961728, 'steps': 98758, 'loss/train': 1.6043872833251953} 11/07/2021 11:06:25 - INFO - __main__ - Step 98760: {'lr': 0.00013386728297487693, 'samples': 18961920, 'steps': 98759, 'loss/train': 1.0249414443969727} 11/07/2021 11:06:26 - INFO - __main__ - Step 98761: {'lr': 0.00013386258357681968, 'samples': 18962112, 'steps': 98760, 'loss/train': 0.6261943578720093} 11/07/2021 11:06:26 - INFO - __main__ - Step 98762: {'lr': 0.00013385788423109213, 'samples': 18962304, 'steps': 98761, 'loss/train': 1.3574062585830688} 11/07/2021 11:06:26 - INFO - __main__ - Step 98763: {'lr': 0.00013385318493769644, 'samples': 18962496, 'steps': 98762, 'loss/train': 1.2589325904846191} 11/07/2021 11:06:27 - INFO - __main__ - Step 98764: {'lr': 0.0001338484856966347, 'samples': 18962688, 'steps': 98763, 'loss/train': 0.9914019107818604} 11/07/2021 11:06:27 - INFO - __main__ - Step 98765: {'lr': 0.00013384378650790907, 'samples': 18962880, 'steps': 98764, 'loss/train': 1.9698034524917603} 11/07/2021 11:06:28 - INFO - __main__ - Step 98766: {'lr': 0.00013383908737152163, 'samples': 18963072, 'steps': 98765, 'loss/train': 0.8805155754089355} 11/07/2021 11:06:28 - INFO - __main__ - Step 98767: {'lr': 0.00013383438828747446, 'samples': 18963264, 'steps': 98766, 'loss/train': 1.4417974948883057} 11/07/2021 11:06:29 - INFO - __main__ - Step 98768: {'lr': 0.00013382968925576978, 'samples': 18963456, 'steps': 98767, 'loss/train': 1.441521406173706} 11/07/2021 11:06:29 - INFO - __main__ - Step 98769: {'lr': 0.0001338249902764096, 'samples': 18963648, 'steps': 98768, 'loss/train': 1.0816618204116821} 11/07/2021 11:06:30 - INFO - __main__ - Step 98770: {'lr': 0.0001338202913493962, 'samples': 18963840, 'steps': 98769, 'loss/train': 2.2670013904571533} 11/07/2021 11:06:31 - INFO - __main__ - Step 98771: {'lr': 0.0001338155924747315, 'samples': 18964032, 'steps': 98770, 'loss/train': 1.4098728895187378} 11/07/2021 11:06:31 - INFO - __main__ - Step 98772: {'lr': 0.00013381089365241769, 'samples': 18964224, 'steps': 98771, 'loss/train': 1.401306390762329} 11/07/2021 11:06:31 - INFO - __main__ - Step 98773: {'lr': 0.00013380619488245692, 'samples': 18964416, 'steps': 98772, 'loss/train': 1.4458569288253784} 11/07/2021 11:06:32 - INFO - __main__ - Step 98774: {'lr': 0.00013380149616485127, 'samples': 18964608, 'steps': 98773, 'loss/train': 1.1713227033615112} 11/07/2021 11:06:32 - INFO - __main__ - Step 98775: {'lr': 0.00013379679749960286, 'samples': 18964800, 'steps': 98774, 'loss/train': 1.356184720993042} 11/07/2021 11:06:32 - INFO - __main__ - Step 98776: {'lr': 0.00013379209888671385, 'samples': 18964992, 'steps': 98775, 'loss/train': 1.9158776998519897} 11/07/2021 11:06:33 - INFO - __main__ - Step 98777: {'lr': 0.00013378740032618627, 'samples': 18965184, 'steps': 98776, 'loss/train': 1.7483781576156616} 11/07/2021 11:06:34 - INFO - __main__ - Step 98778: {'lr': 0.00013378270181802233, 'samples': 18965376, 'steps': 98777, 'loss/train': 1.3530482053756714} 11/07/2021 11:06:34 - INFO - __main__ - Step 98779: {'lr': 0.00013377800336222413, 'samples': 18965568, 'steps': 98778, 'loss/train': 1.7070297002792358} 11/07/2021 11:06:34 - INFO - __main__ - Step 98780: {'lr': 0.00013377330495879374, 'samples': 18965760, 'steps': 98779, 'loss/train': 1.2155473232269287} 11/07/2021 11:06:35 - INFO - __main__ - Step 98781: {'lr': 0.00013376860660773332, 'samples': 18965952, 'steps': 98780, 'loss/train': 1.3019492626190186} 11/07/2021 11:06:36 - INFO - __main__ - Step 98782: {'lr': 0.00013376390830904496, 'samples': 18966144, 'steps': 98781, 'loss/train': 0.6799478530883789} 11/07/2021 11:06:36 - INFO - __main__ - Step 98783: {'lr': 0.0001337592100627308, 'samples': 18966336, 'steps': 98782, 'loss/train': 1.4510352611541748} 11/07/2021 11:06:36 - INFO - __main__ - Step 98784: {'lr': 0.00013375451186879307, 'samples': 18966528, 'steps': 98783, 'loss/train': 1.4570521116256714} 11/07/2021 11:06:37 - INFO - __main__ - Step 98785: {'lr': 0.0001337498137272336, 'samples': 18966720, 'steps': 98784, 'loss/train': 1.3084620237350464} 11/07/2021 11:06:37 - INFO - __main__ - Step 98786: {'lr': 0.00013374511563805472, 'samples': 18966912, 'steps': 98785, 'loss/train': 1.0036935806274414} 11/07/2021 11:06:38 - INFO - __main__ - Step 98787: {'lr': 0.00013374041760125848, 'samples': 18967104, 'steps': 98786, 'loss/train': 1.1004550457000732} 11/07/2021 11:06:39 - INFO - __main__ - Step 98788: {'lr': 0.00013373571961684702, 'samples': 18967296, 'steps': 98787, 'loss/train': 1.2726954221725464} 11/07/2021 11:06:39 - INFO - __main__ - Step 98789: {'lr': 0.00013373102168482245, 'samples': 18967488, 'steps': 98788, 'loss/train': 2.00880765914917} 11/07/2021 11:06:39 - INFO - __main__ - Step 98790: {'lr': 0.0001337263238051869, 'samples': 18967680, 'steps': 98789, 'loss/train': 1.2703149318695068} 11/07/2021 11:06:40 - INFO - __main__ - Step 98791: {'lr': 0.00013372162597794247, 'samples': 18967872, 'steps': 98790, 'loss/train': 1.268029808998108} 11/07/2021 11:06:41 - INFO - __main__ - Step 98792: {'lr': 0.00013371692820309124, 'samples': 18968064, 'steps': 98791, 'loss/train': 1.2850496768951416} 11/07/2021 11:06:41 - INFO - __main__ - Step 98793: {'lr': 0.00013371223048063541, 'samples': 18968256, 'steps': 98792, 'loss/train': 1.6684579849243164} 11/07/2021 11:06:42 - INFO - __main__ - Step 98794: {'lr': 0.00013370753281057704, 'samples': 18968448, 'steps': 98793, 'loss/train': 1.143832802772522} 11/07/2021 11:06:42 - INFO - __main__ - Step 98795: {'lr': 0.00013370283519291827, 'samples': 18968640, 'steps': 98794, 'loss/train': 1.336119532585144} 11/07/2021 11:06:42 - INFO - __main__ - Step 98796: {'lr': 0.00013369813762766119, 'samples': 18968832, 'steps': 98795, 'loss/train': 1.8452199697494507} 11/07/2021 11:06:43 - INFO - __main__ - Step 98797: {'lr': 0.00013369344011480806, 'samples': 18969024, 'steps': 98796, 'loss/train': 1.8293145895004272} 11/07/2021 11:06:44 - INFO - __main__ - Step 98798: {'lr': 0.00013368874265436075, 'samples': 18969216, 'steps': 98797, 'loss/train': 1.4946632385253906} 11/07/2021 11:06:44 - INFO - __main__ - Step 98799: {'lr': 0.0001336840452463215, 'samples': 18969408, 'steps': 98798, 'loss/train': 1.3776699304580688} 11/07/2021 11:06:45 - INFO - __main__ - Step 98800: {'lr': 0.00013367934789069246, 'samples': 18969600, 'steps': 98799, 'loss/train': 0.6545036435127258} 11/07/2021 11:06:45 - INFO - __main__ - Step 98801: {'lr': 0.00013367465058747565, 'samples': 18969792, 'steps': 98800, 'loss/train': 1.4942432641983032} 11/07/2021 11:06:45 - INFO - __main__ - Step 98802: {'lr': 0.0001336699533366733, 'samples': 18969984, 'steps': 98801, 'loss/train': 1.9809859991073608} 11/07/2021 11:06:47 - INFO - __main__ - Step 98803: {'lr': 0.00013366525613828746, 'samples': 18970176, 'steps': 98802, 'loss/train': 1.3460417985916138} 11/07/2021 11:06:47 - INFO - __main__ - Step 98804: {'lr': 0.00013366055899232025, 'samples': 18970368, 'steps': 98803, 'loss/train': 5.392951488494873} 11/07/2021 11:06:48 - INFO - __main__ - Step 98805: {'lr': 0.00013365586189877378, 'samples': 18970560, 'steps': 98804, 'loss/train': 5.409395694732666} 11/07/2021 11:06:48 - INFO - __main__ - Step 98806: {'lr': 0.00013365116485765022, 'samples': 18970752, 'steps': 98805, 'loss/train': 5.48921537399292} 11/07/2021 11:06:48 - INFO - __main__ - Step 98807: {'lr': 0.00013364646786895163, 'samples': 18970944, 'steps': 98806, 'loss/train': 1.4246535301208496} 11/07/2021 11:06:49 - INFO - __main__ - Step 98808: {'lr': 0.00013364177093268015, 'samples': 18971136, 'steps': 98807, 'loss/train': 1.036902904510498} 11/07/2021 11:06:49 - INFO - __main__ - Step 98809: {'lr': 0.0001336370740488379, 'samples': 18971328, 'steps': 98808, 'loss/train': 1.6191954612731934} 11/07/2021 11:06:50 - INFO - __main__ - Step 98810: {'lr': 0.00013363237721742696, 'samples': 18971520, 'steps': 98809, 'loss/train': 1.5064979791641235} 11/07/2021 11:06:51 - INFO - __main__ - Step 98811: {'lr': 0.00013362768043844958, 'samples': 18971712, 'steps': 98810, 'loss/train': 0.15023018419742584} 11/07/2021 11:06:51 - INFO - __main__ - Step 98812: {'lr': 0.0001336229837119077, 'samples': 18971904, 'steps': 98811, 'loss/train': 1.0602070093154907} 11/07/2021 11:06:51 - INFO - __main__ - Step 98813: {'lr': 0.0001336182870378035, 'samples': 18972096, 'steps': 98812, 'loss/train': 1.6279008388519287} 11/07/2021 11:06:52 - INFO - __main__ - Step 98814: {'lr': 0.0001336135904161391, 'samples': 18972288, 'steps': 98813, 'loss/train': 1.0561387538909912} 11/07/2021 11:06:53 - INFO - __main__ - Step 98815: {'lr': 0.0001336088938469166, 'samples': 18972480, 'steps': 98814, 'loss/train': 1.5785083770751953} 11/07/2021 11:06:53 - INFO - __main__ - Step 98816: {'lr': 0.00013360419733013818, 'samples': 18972672, 'steps': 98815, 'loss/train': 1.3869513273239136} 11/07/2021 11:06:53 - INFO - __main__ - Step 98817: {'lr': 0.0001335995008658059, 'samples': 18972864, 'steps': 98816, 'loss/train': 1.445678949356079} 11/07/2021 11:06:54 - INFO - __main__ - Step 98818: {'lr': 0.00013359480445392186, 'samples': 18973056, 'steps': 98817, 'loss/train': 1.6802738904953003} 11/07/2021 11:06:54 - INFO - __main__ - Step 98819: {'lr': 0.00013359010809448825, 'samples': 18973248, 'steps': 98818, 'loss/train': 1.1624425649642944} 11/07/2021 11:06:56 - INFO - __main__ - Step 98820: {'lr': 0.00013358541178750712, 'samples': 18973440, 'steps': 98819, 'loss/train': 1.4052388668060303} 11/07/2021 11:06:56 - INFO - __main__ - Step 98821: {'lr': 0.00013358071553298055, 'samples': 18973632, 'steps': 98820, 'loss/train': 1.3414870500564575} 11/07/2021 11:06:56 - INFO - __main__ - Step 98822: {'lr': 0.00013357601933091078, 'samples': 18973824, 'steps': 98821, 'loss/train': 1.8461042642593384} 11/07/2021 11:06:57 - INFO - __main__ - Step 98823: {'lr': 0.00013357132318129984, 'samples': 18974016, 'steps': 98822, 'loss/train': 1.3568247556686401} 11/07/2021 11:06:57 - INFO - __main__ - Step 98824: {'lr': 0.00013356662708414996, 'samples': 18974208, 'steps': 98823, 'loss/train': 1.6505045890808105} 11/07/2021 11:06:58 - INFO - __main__ - Step 98825: {'lr': 0.00013356193103946306, 'samples': 18974400, 'steps': 98824, 'loss/train': 0.24194471538066864} 11/07/2021 11:06:58 - INFO - __main__ - Step 98826: {'lr': 0.00013355723504724138, 'samples': 18974592, 'steps': 98825, 'loss/train': 1.4591789245605469} 11/07/2021 11:06:59 - INFO - __main__ - Step 98827: {'lr': 0.000133552539107487, 'samples': 18974784, 'steps': 98826, 'loss/train': 0.955756425857544} 11/07/2021 11:06:59 - INFO - __main__ - Step 98828: {'lr': 0.00013354784322020202, 'samples': 18974976, 'steps': 98827, 'loss/train': 1.099303960800171} 11/07/2021 11:06:59 - INFO - __main__ - Step 98829: {'lr': 0.00013354314738538863, 'samples': 18975168, 'steps': 98828, 'loss/train': 1.3938204050064087} 11/07/2021 11:07:00 - INFO - __main__ - Step 98830: {'lr': 0.0001335384516030489, 'samples': 18975360, 'steps': 98829, 'loss/train': 1.169142723083496} 11/07/2021 11:07:01 - INFO - __main__ - Step 98831: {'lr': 0.00013353375587318492, 'samples': 18975552, 'steps': 98830, 'loss/train': 1.5019887685775757} 11/07/2021 11:07:01 - INFO - __main__ - Step 98832: {'lr': 0.00013352906019579885, 'samples': 18975744, 'steps': 98831, 'loss/train': 1.4732635021209717} 11/07/2021 11:07:01 - INFO - __main__ - Step 98833: {'lr': 0.00013352436457089278, 'samples': 18975936, 'steps': 98832, 'loss/train': 1.0529882907867432} 11/07/2021 11:07:02 - INFO - __main__ - Step 98834: {'lr': 0.00013351966899846884, 'samples': 18976128, 'steps': 98833, 'loss/train': 1.4983659982681274} 11/07/2021 11:07:02 - INFO - __main__ - Step 98835: {'lr': 0.00013351497347852912, 'samples': 18976320, 'steps': 98834, 'loss/train': 5.829855442047119} 11/07/2021 11:07:03 - INFO - __main__ - Step 98836: {'lr': 0.0001335102780110758, 'samples': 18976512, 'steps': 98835, 'loss/train': 2.0175769329071045} 11/07/2021 11:07:04 - INFO - __main__ - Step 98837: {'lr': 0.00013350558259611102, 'samples': 18976704, 'steps': 98836, 'loss/train': 1.6692309379577637} 11/07/2021 11:07:04 - INFO - __main__ - Step 98838: {'lr': 0.00013350088723363668, 'samples': 18976896, 'steps': 98837, 'loss/train': 1.2457144260406494} 11/07/2021 11:07:04 - INFO - __main__ - Step 98839: {'lr': 0.00013349619192365512, 'samples': 18977088, 'steps': 98838, 'loss/train': 0.812110424041748} 11/07/2021 11:07:05 - INFO - __main__ - Step 98840: {'lr': 0.00013349149666616833, 'samples': 18977280, 'steps': 98839, 'loss/train': 1.152550220489502} 11/07/2021 11:07:06 - INFO - __main__ - Step 98841: {'lr': 0.0001334868014611785, 'samples': 18977472, 'steps': 98840, 'loss/train': 1.305818796157837} 11/07/2021 11:07:06 - INFO - __main__ - Step 98842: {'lr': 0.00013348210630868772, 'samples': 18977664, 'steps': 98841, 'loss/train': 1.4927036762237549} 11/07/2021 11:07:07 - INFO - __main__ - Step 98843: {'lr': 0.0001334774112086981, 'samples': 18977856, 'steps': 98842, 'loss/train': 1.5124403238296509} 11/07/2021 11:07:07 - INFO - __main__ - Step 98844: {'lr': 0.00013347271616121175, 'samples': 18978048, 'steps': 98843, 'loss/train': 1.117116928100586} 11/07/2021 11:07:07 - INFO - __main__ - Step 98845: {'lr': 0.0001334680211662308, 'samples': 18978240, 'steps': 98844, 'loss/train': 1.6971997022628784} 11/07/2021 11:07:08 - INFO - __main__ - Step 98846: {'lr': 0.00013346332622375735, 'samples': 18978432, 'steps': 98845, 'loss/train': 1.2651689052581787} 11/07/2021 11:07:08 - INFO - __main__ - Step 98847: {'lr': 0.00013345863133379355, 'samples': 18978624, 'steps': 98846, 'loss/train': 1.551964282989502} 11/07/2021 11:07:09 - INFO - __main__ - Step 98848: {'lr': 0.0001334539364963415, 'samples': 18978816, 'steps': 98847, 'loss/train': 1.740263819694519} 11/07/2021 11:07:09 - INFO - __main__ - Step 98849: {'lr': 0.00013344924171140326, 'samples': 18979008, 'steps': 98848, 'loss/train': 1.7990092039108276} 11/07/2021 11:07:10 - INFO - __main__ - Step 98850: {'lr': 0.00013344454697898108, 'samples': 18979200, 'steps': 98849, 'loss/train': 1.1353286504745483} 11/07/2021 11:07:10 - INFO - __main__ - Step 98851: {'lr': 0.00013343985229907703, 'samples': 18979392, 'steps': 98850, 'loss/train': 1.6245990991592407} 11/07/2021 11:07:10 - INFO - __main__ - Step 98852: {'lr': 0.00013343515767169306, 'samples': 18979584, 'steps': 98851, 'loss/train': 1.3029429912567139} 11/07/2021 11:07:12 - INFO - __main__ - Step 98853: {'lr': 0.00013343046309683145, 'samples': 18979776, 'steps': 98852, 'loss/train': 1.4289370775222778} 11/07/2021 11:07:12 - INFO - __main__ - Step 98854: {'lr': 0.00013342576857449423, 'samples': 18979968, 'steps': 98853, 'loss/train': 1.5419708490371704} 11/07/2021 11:07:12 - INFO - __main__ - Step 98855: {'lr': 0.0001334210741046836, 'samples': 18980160, 'steps': 98854, 'loss/train': 1.265575885772705} 11/07/2021 11:07:13 - INFO - __main__ - Step 98856: {'lr': 0.0001334163796874016, 'samples': 18980352, 'steps': 98855, 'loss/train': 1.2394353151321411} 11/07/2021 11:07:13 - INFO - __main__ - Step 98857: {'lr': 0.00013341168532265044, 'samples': 18980544, 'steps': 98856, 'loss/train': 1.1663039922714233} 11/07/2021 11:07:13 - INFO - __main__ - Step 98858: {'lr': 0.00013340699101043214, 'samples': 18980736, 'steps': 98857, 'loss/train': 1.4848909378051758} 11/07/2021 11:07:14 - INFO - __main__ - Step 98859: {'lr': 0.00013340229675074883, 'samples': 18980928, 'steps': 98858, 'loss/train': 1.7018697261810303} 11/07/2021 11:07:15 - INFO - __main__ - Step 98860: {'lr': 0.00013339760254360268, 'samples': 18981120, 'steps': 98859, 'loss/train': 1.463150143623352} 11/07/2021 11:07:15 - INFO - __main__ - Step 98861: {'lr': 0.00013339290838899575, 'samples': 18981312, 'steps': 98860, 'loss/train': 1.1831055879592896} 11/07/2021 11:07:15 - INFO - __main__ - Step 98862: {'lr': 0.0001333882142869302, 'samples': 18981504, 'steps': 98861, 'loss/train': 1.5491455793380737} 11/07/2021 11:07:16 - INFO - __main__ - Step 98863: {'lr': 0.0001333835202374081, 'samples': 18981696, 'steps': 98862, 'loss/train': 1.5236234664916992} 11/07/2021 11:07:17 - INFO - __main__ - Step 98864: {'lr': 0.00013337882624043167, 'samples': 18981888, 'steps': 98863, 'loss/train': 1.1546951532363892} 11/07/2021 11:07:17 - INFO - __main__ - Step 98865: {'lr': 0.0001333741322960029, 'samples': 18982080, 'steps': 98864, 'loss/train': 1.1393694877624512} 11/07/2021 11:07:17 - INFO - __main__ - Step 98866: {'lr': 0.00013336943840412392, 'samples': 18982272, 'steps': 98865, 'loss/train': 1.4670021533966064} 11/07/2021 11:07:18 - INFO - __main__ - Step 98867: {'lr': 0.00013336474456479685, 'samples': 18982464, 'steps': 98866, 'loss/train': 1.7206683158874512} 11/07/2021 11:07:18 - INFO - __main__ - Step 98868: {'lr': 0.00013336005077802383, 'samples': 18982656, 'steps': 98867, 'loss/train': 1.6796690225601196} 11/07/2021 11:07:19 - INFO - __main__ - Step 98869: {'lr': 0.00013335535704380697, 'samples': 18982848, 'steps': 98868, 'loss/train': 1.2066653966903687} 11/07/2021 11:07:19 - INFO - __main__ - Step 98870: {'lr': 0.0001333506633621484, 'samples': 18983040, 'steps': 98869, 'loss/train': 1.3114200830459595} 11/07/2021 11:07:20 - INFO - __main__ - Step 98871: {'lr': 0.00013334596973305025, 'samples': 18983232, 'steps': 98870, 'loss/train': 1.4587035179138184} 11/07/2021 11:07:20 - INFO - __main__ - Step 98872: {'lr': 0.00013334127615651452, 'samples': 18983424, 'steps': 98871, 'loss/train': 1.1591172218322754} 11/07/2021 11:07:21 - INFO - __main__ - Step 98873: {'lr': 0.00013333658263254351, 'samples': 18983616, 'steps': 98872, 'loss/train': 1.422946572303772} 11/07/2021 11:07:22 - INFO - __main__ - Step 98874: {'lr': 0.00013333188916113918, 'samples': 18983808, 'steps': 98873, 'loss/train': 1.4018237590789795} 11/07/2021 11:07:22 - INFO - __main__ - Step 98875: {'lr': 0.0001333271957423037, 'samples': 18984000, 'steps': 98874, 'loss/train': 0.8683300614356995} 11/07/2021 11:07:22 - INFO - __main__ - Step 98876: {'lr': 0.00013332250237603921, 'samples': 18984192, 'steps': 98875, 'loss/train': 1.5728108882904053} 11/07/2021 11:07:23 - INFO - __main__ - Step 98877: {'lr': 0.00013331780906234775, 'samples': 18984384, 'steps': 98876, 'loss/train': 1.0625882148742676} 11/07/2021 11:07:23 - INFO - __main__ - Step 98878: {'lr': 0.00013331311580123162, 'samples': 18984576, 'steps': 98877, 'loss/train': 0.953425407409668} 11/07/2021 11:07:24 - INFO - __main__ - Step 98879: {'lr': 0.00013330842259269272, 'samples': 18984768, 'steps': 98878, 'loss/train': 1.6611846685409546} 11/07/2021 11:07:24 - INFO - __main__ - Step 98880: {'lr': 0.00013330372943673322, 'samples': 18984960, 'steps': 98879, 'loss/train': 0.8614981770515442} 11/07/2021 11:07:25 - INFO - __main__ - Step 98881: {'lr': 0.00013329903633335527, 'samples': 18985152, 'steps': 98880, 'loss/train': 1.934775710105896} 11/07/2021 11:07:25 - INFO - __main__ - Step 98882: {'lr': 0.00013329434328256096, 'samples': 18985344, 'steps': 98881, 'loss/train': 1.0834996700286865} 11/07/2021 11:07:25 - INFO - __main__ - Step 98883: {'lr': 0.0001332896502843524, 'samples': 18985536, 'steps': 98882, 'loss/train': 1.4074729681015015} 11/07/2021 11:07:26 - INFO - __main__ - Step 98884: {'lr': 0.00013328495733873176, 'samples': 18985728, 'steps': 98883, 'loss/train': 1.2623623609542847} 11/07/2021 11:07:27 - INFO - __main__ - Step 98885: {'lr': 0.00013328026444570112, 'samples': 18985920, 'steps': 98884, 'loss/train': 1.3718767166137695} 11/07/2021 11:07:27 - INFO - __main__ - Step 98886: {'lr': 0.00013327557160526255, 'samples': 18986112, 'steps': 98885, 'loss/train': 1.0808050632476807} 11/07/2021 11:07:27 - INFO - __main__ - Step 98887: {'lr': 0.00013327087881741823, 'samples': 18986304, 'steps': 98886, 'loss/train': 1.692151665687561} 11/07/2021 11:07:28 - INFO - __main__ - Step 98888: {'lr': 0.00013326618608217028, 'samples': 18986496, 'steps': 98887, 'loss/train': 1.3719494342803955} 11/07/2021 11:07:29 - INFO - __main__ - Step 98889: {'lr': 0.00013326149339952075, 'samples': 18986688, 'steps': 98888, 'loss/train': 1.4654566049575806} 11/07/2021 11:07:29 - INFO - __main__ - Step 98890: {'lr': 0.00013325680076947178, 'samples': 18986880, 'steps': 98889, 'loss/train': 1.0779192447662354} 11/07/2021 11:07:30 - INFO - __main__ - Step 98891: {'lr': 0.0001332521081920256, 'samples': 18987072, 'steps': 98890, 'loss/train': 1.2335752248764038} 11/07/2021 11:07:30 - INFO - __main__ - Step 98892: {'lr': 0.00013324741566718415, 'samples': 18987264, 'steps': 98891, 'loss/train': 1.4358158111572266} 11/07/2021 11:07:30 - INFO - __main__ - Step 98893: {'lr': 0.0001332427231949496, 'samples': 18987456, 'steps': 98892, 'loss/train': 1.638058066368103} 11/07/2021 11:07:31 - INFO - __main__ - Step 98894: {'lr': 0.00013323803077532406, 'samples': 18987648, 'steps': 98893, 'loss/train': 1.17609441280365} 11/07/2021 11:07:32 - INFO - __main__ - Step 98895: {'lr': 0.00013323333840830967, 'samples': 18987840, 'steps': 98894, 'loss/train': 1.6074554920196533} 11/07/2021 11:07:32 - INFO - __main__ - Step 98896: {'lr': 0.00013322864609390856, 'samples': 18988032, 'steps': 98895, 'loss/train': 1.4181350469589233} 11/07/2021 11:07:32 - INFO - __main__ - Step 98897: {'lr': 0.00013322395383212276, 'samples': 18988224, 'steps': 98896, 'loss/train': 1.134456753730774} 11/07/2021 11:07:33 - INFO - __main__ - Step 98898: {'lr': 0.00013321926162295451, 'samples': 18988416, 'steps': 98897, 'loss/train': 1.4034196138381958} 11/07/2021 11:07:34 - INFO - __main__ - Step 98899: {'lr': 0.00013321456946640582, 'samples': 18988608, 'steps': 98898, 'loss/train': 1.361863374710083} 11/07/2021 11:07:34 - INFO - __main__ - Step 98900: {'lr': 0.00013320987736247886, 'samples': 18988800, 'steps': 98899, 'loss/train': 1.4545414447784424} 11/07/2021 11:07:34 - INFO - __main__ - Step 98901: {'lr': 0.0001332051853111757, 'samples': 18988992, 'steps': 98900, 'loss/train': 1.532667636871338} 11/07/2021 11:07:35 - INFO - __main__ - Step 98902: {'lr': 0.0001332004933124985, 'samples': 18989184, 'steps': 98901, 'loss/train': 0.6742056012153625} 11/07/2021 11:07:35 - INFO - __main__ - Step 98903: {'lr': 0.00013319580136644948, 'samples': 18989376, 'steps': 98902, 'loss/train': 1.474034070968628} 11/07/2021 11:07:36 - INFO - __main__ - Step 98904: {'lr': 0.00013319110947303047, 'samples': 18989568, 'steps': 98903, 'loss/train': 1.772411823272705} 11/07/2021 11:07:37 - INFO - __main__ - Step 98905: {'lr': 0.00013318641763224382, 'samples': 18989760, 'steps': 98904, 'loss/train': 1.4050512313842773} 11/07/2021 11:07:37 - INFO - __main__ - Step 98906: {'lr': 0.0001331817258440915, 'samples': 18989952, 'steps': 98905, 'loss/train': 1.2343024015426636} 11/07/2021 11:07:37 - INFO - __main__ - Step 98907: {'lr': 0.00013317703410857572, 'samples': 18990144, 'steps': 98906, 'loss/train': 1.3367795944213867} 11/07/2021 11:07:38 - INFO - __main__ - Step 98908: {'lr': 0.0001331723424256986, 'samples': 18990336, 'steps': 98907, 'loss/train': 1.4437364339828491} 11/07/2021 11:07:39 - INFO - __main__ - Step 98909: {'lr': 0.00013316765079546218, 'samples': 18990528, 'steps': 98908, 'loss/train': 1.5868799686431885} 11/07/2021 11:07:39 - INFO - __main__ - Step 98910: {'lr': 0.00013316295921786858, 'samples': 18990720, 'steps': 98909, 'loss/train': 1.2452229261398315} 11/07/2021 11:07:39 - INFO - __main__ - Step 98911: {'lr': 0.00013315826769292, 'samples': 18990912, 'steps': 98910, 'loss/train': 1.4686626195907593} 11/07/2021 11:07:40 - INFO - __main__ - Step 98912: {'lr': 0.0001331535762206185, 'samples': 18991104, 'steps': 98911, 'loss/train': 1.5746128559112549} 11/07/2021 11:07:40 - INFO - __main__ - Step 98913: {'lr': 0.00013314888480096617, 'samples': 18991296, 'steps': 98912, 'loss/train': 1.1579476594924927} 11/07/2021 11:07:41 - INFO - __main__ - Step 98914: {'lr': 0.00013314419343396527, 'samples': 18991488, 'steps': 98913, 'loss/train': 1.3822352886199951} 11/07/2021 11:07:41 - INFO - __main__ - Step 98915: {'lr': 0.00013313950211961767, 'samples': 18991680, 'steps': 98914, 'loss/train': 1.4555784463882446} 11/07/2021 11:07:42 - INFO - __main__ - Step 98916: {'lr': 0.00013313481085792565, 'samples': 18991872, 'steps': 98915, 'loss/train': 1.4017186164855957} 11/07/2021 11:07:42 - INFO - __main__ - Step 98917: {'lr': 0.00013313011964889124, 'samples': 18992064, 'steps': 98916, 'loss/train': 1.5393035411834717} 11/07/2021 11:07:42 - INFO - __main__ - Step 98918: {'lr': 0.00013312542849251664, 'samples': 18992256, 'steps': 98917, 'loss/train': 1.3945894241333008} 11/07/2021 11:07:43 - INFO - __main__ - Step 98919: {'lr': 0.00013312073738880388, 'samples': 18992448, 'steps': 98918, 'loss/train': 1.3436654806137085} 11/07/2021 11:07:44 - INFO - __main__ - Step 98920: {'lr': 0.0001331160463377551, 'samples': 18992640, 'steps': 98919, 'loss/train': 1.231615662574768} 11/07/2021 11:07:44 - INFO - __main__ - Step 98921: {'lr': 0.00013311135533937248, 'samples': 18992832, 'steps': 98920, 'loss/train': 1.403511881828308} 11/07/2021 11:07:44 - INFO - __main__ - Step 98922: {'lr': 0.000133106664393658, 'samples': 18993024, 'steps': 98921, 'loss/train': 1.7495125532150269} 11/07/2021 11:07:45 - INFO - __main__ - Step 98923: {'lr': 0.00013310197350061391, 'samples': 18993216, 'steps': 98922, 'loss/train': 1.618675947189331} 11/07/2021 11:07:46 - INFO - __main__ - Step 98924: {'lr': 0.00013309728266024223, 'samples': 18993408, 'steps': 98923, 'loss/train': 1.1150457859039307} 11/07/2021 11:07:46 - INFO - __main__ - Step 98925: {'lr': 0.00013309259187254524, 'samples': 18993600, 'steps': 98924, 'loss/train': 1.188405156135559} 11/07/2021 11:07:46 - INFO - __main__ - Step 98926: {'lr': 0.00013308790113752484, 'samples': 18993792, 'steps': 98925, 'loss/train': 1.397242784500122} 11/07/2021 11:07:47 - INFO - __main__ - Step 98927: {'lr': 0.00013308321045518321, 'samples': 18993984, 'steps': 98926, 'loss/train': 0.9505614638328552} 11/07/2021 11:07:47 - INFO - __main__ - Step 98928: {'lr': 0.0001330785198255225, 'samples': 18994176, 'steps': 98927, 'loss/train': 1.5267914533615112} 11/07/2021 11:07:48 - INFO - __main__ - Step 98929: {'lr': 0.00013307382924854477, 'samples': 18994368, 'steps': 98928, 'loss/train': 1.4826369285583496} 11/07/2021 11:07:49 - INFO - __main__ - Step 98930: {'lr': 0.00013306913872425217, 'samples': 18994560, 'steps': 98929, 'loss/train': 1.2125869989395142} 11/07/2021 11:07:49 - INFO - __main__ - Step 98931: {'lr': 0.00013306444825264682, 'samples': 18994752, 'steps': 98930, 'loss/train': 1.4922040700912476} 11/07/2021 11:07:49 - INFO - __main__ - Step 98932: {'lr': 0.00013305975783373082, 'samples': 18994944, 'steps': 98931, 'loss/train': 1.2531720399856567} 11/07/2021 11:07:50 - INFO - __main__ - Step 98933: {'lr': 0.0001330550674675063, 'samples': 18995136, 'steps': 98932, 'loss/train': 1.601885437965393} 11/07/2021 11:07:50 - INFO - __main__ - Step 98934: {'lr': 0.00013305037715397535, 'samples': 18995328, 'steps': 98933, 'loss/train': 1.5709223747253418} 11/07/2021 11:07:51 - INFO - __main__ - Step 98935: {'lr': 0.00013304568689314012, 'samples': 18995520, 'steps': 98934, 'loss/train': 1.4046754837036133} 11/07/2021 11:07:51 - INFO - __main__ - Step 98936: {'lr': 0.0001330409966850028, 'samples': 18995712, 'steps': 98935, 'loss/train': 1.4583287239074707} 11/07/2021 11:07:52 - INFO - __main__ - Step 98937: {'lr': 0.00013303630652956527, 'samples': 18995904, 'steps': 98936, 'loss/train': 1.0391227006912231} 11/07/2021 11:07:52 - INFO - __main__ - Step 98938: {'lr': 0.00013303161642682978, 'samples': 18996096, 'steps': 98937, 'loss/train': 1.2989174127578735} 11/07/2021 11:07:52 - INFO - __main__ - Step 98939: {'lr': 0.00013302692637679847, 'samples': 18996288, 'steps': 98938, 'loss/train': 1.3374288082122803} 11/07/2021 11:07:54 - INFO - __main__ - Step 98940: {'lr': 0.0001330222363794734, 'samples': 18996480, 'steps': 98939, 'loss/train': 1.2169417142868042} 11/07/2021 11:07:54 - INFO - __main__ - Step 98941: {'lr': 0.0001330175464348567, 'samples': 18996672, 'steps': 98940, 'loss/train': 1.1693474054336548} 11/07/2021 11:07:54 - INFO - __main__ - Step 98942: {'lr': 0.00013301285654295048, 'samples': 18996864, 'steps': 98941, 'loss/train': 1.3135582208633423} 11/07/2021 11:07:55 - INFO - __main__ - Step 98943: {'lr': 0.00013300816670375686, 'samples': 18997056, 'steps': 98942, 'loss/train': 1.221986174583435} 11/07/2021 11:07:55 - INFO - __main__ - Step 98944: {'lr': 0.000133003476917278, 'samples': 18997248, 'steps': 98943, 'loss/train': 1.1261842250823975} 11/07/2021 11:07:56 - INFO - __main__ - Step 98945: {'lr': 0.00013299878718351594, 'samples': 18997440, 'steps': 98944, 'loss/train': 0.44292888045310974} 11/07/2021 11:07:56 - INFO - __main__ - Step 98946: {'lr': 0.00013299409750247283, 'samples': 18997632, 'steps': 98945, 'loss/train': 1.36735200881958} 11/07/2021 11:07:57 - INFO - __main__ - Step 98947: {'lr': 0.00013298940787415087, 'samples': 18997824, 'steps': 98946, 'loss/train': 1.5495185852050781} 11/07/2021 11:07:57 - INFO - __main__ - Step 98948: {'lr': 0.00013298471829855196, 'samples': 18998016, 'steps': 98947, 'loss/train': 0.0884665995836258} 11/07/2021 11:07:57 - INFO - __main__ - Step 98949: {'lr': 0.00013298002877567834, 'samples': 18998208, 'steps': 98948, 'loss/train': 1.2932748794555664} 11/07/2021 11:07:58 - INFO - __main__ - Step 98950: {'lr': 0.00013297533930553212, 'samples': 18998400, 'steps': 98949, 'loss/train': 1.364320993423462} 11/07/2021 11:07:59 - INFO - __main__ - Step 98951: {'lr': 0.0001329706498881154, 'samples': 18998592, 'steps': 98950, 'loss/train': 1.365957498550415} 11/07/2021 11:07:59 - INFO - __main__ - Step 98952: {'lr': 0.0001329659605234303, 'samples': 18998784, 'steps': 98951, 'loss/train': 1.4456590414047241} 11/07/2021 11:08:00 - INFO - __main__ - Step 98953: {'lr': 0.00013296127121147894, 'samples': 18998976, 'steps': 98952, 'loss/train': 1.1648809909820557} 11/07/2021 11:08:00 - INFO - __main__ - Step 98954: {'lr': 0.0001329565819522634, 'samples': 18999168, 'steps': 98953, 'loss/train': 1.4762670993804932} 11/07/2021 11:08:01 - INFO - __main__ - Step 98955: {'lr': 0.00013295189274578585, 'samples': 18999360, 'steps': 98954, 'loss/train': 1.301537036895752} 11/07/2021 11:08:01 - INFO - __main__ - Step 98956: {'lr': 0.00013294720359204837, 'samples': 18999552, 'steps': 98955, 'loss/train': 1.4801439046859741} 11/07/2021 11:08:02 - INFO - __main__ - Step 98957: {'lr': 0.00013294251449105305, 'samples': 18999744, 'steps': 98956, 'loss/train': 0.4797080159187317} 11/07/2021 11:08:02 - INFO - __main__ - Step 98958: {'lr': 0.00013293782544280213, 'samples': 18999936, 'steps': 98957, 'loss/train': 1.9054423570632935} 11/07/2021 11:08:02 - INFO - __main__ - Step 98959: {'lr': 0.00013293313644729753, 'samples': 19000128, 'steps': 98958, 'loss/train': 0.8895014524459839} 11/07/2021 11:08:03 - INFO - __main__ - Step 98960: {'lr': 0.00013292844750454144, 'samples': 19000320, 'steps': 98959, 'loss/train': 2.0311172008514404} 11/07/2021 11:08:04 - INFO - __main__ - Step 98961: {'lr': 0.00013292375861453598, 'samples': 19000512, 'steps': 98960, 'loss/train': 1.2724263668060303} 11/07/2021 11:08:04 - INFO - __main__ - Step 98962: {'lr': 0.0001329190697772833, 'samples': 19000704, 'steps': 98961, 'loss/train': 1.4839279651641846} 11/07/2021 11:08:04 - INFO - __main__ - Step 98963: {'lr': 0.00013291438099278548, 'samples': 19000896, 'steps': 98962, 'loss/train': 1.1840283870697021} 11/07/2021 11:08:05 - INFO - __main__ - Step 98964: {'lr': 0.00013290969226104461, 'samples': 19001088, 'steps': 98963, 'loss/train': 1.5307929515838623} 11/07/2021 11:08:06 - INFO - __main__ - Step 98965: {'lr': 0.00013290500358206282, 'samples': 19001280, 'steps': 98964, 'loss/train': 0.5942177176475525} 11/07/2021 11:08:06 - INFO - __main__ - Step 98966: {'lr': 0.00013290031495584225, 'samples': 19001472, 'steps': 98965, 'loss/train': 1.5449353456497192} 11/07/2021 11:08:07 - INFO - __main__ - Step 98967: {'lr': 0.000132895626382385, 'samples': 19001664, 'steps': 98966, 'loss/train': 1.3344509601593018} 11/07/2021 11:08:07 - INFO - __main__ - Step 98968: {'lr': 0.00013289093786169316, 'samples': 19001856, 'steps': 98967, 'loss/train': 1.1592888832092285} 11/07/2021 11:08:07 - INFO - __main__ - Step 98969: {'lr': 0.00013288624939376882, 'samples': 19002048, 'steps': 98968, 'loss/train': 0.9644434452056885} 11/07/2021 11:08:08 - INFO - __main__ - Step 98970: {'lr': 0.00013288156097861415, 'samples': 19002240, 'steps': 98969, 'loss/train': 1.5489603281021118} 11/07/2021 11:08:09 - INFO - __main__ - Step 98971: {'lr': 0.00013287687261623126, 'samples': 19002432, 'steps': 98970, 'loss/train': 1.6382497549057007} 11/07/2021 11:08:09 - INFO - __main__ - Step 98972: {'lr': 0.00013287218430662234, 'samples': 19002624, 'steps': 98971, 'loss/train': 0.46486666798591614} 11/07/2021 11:08:09 - INFO - __main__ - Step 98973: {'lr': 0.00013286749604978933, 'samples': 19002816, 'steps': 98972, 'loss/train': 1.374839425086975} 11/07/2021 11:08:10 - INFO - __main__ - Step 98974: {'lr': 0.00013286280784573435, 'samples': 19003008, 'steps': 98973, 'loss/train': 1.405431866645813} 11/07/2021 11:08:10 - INFO - __main__ - Step 98975: {'lr': 0.00013285811969445966, 'samples': 19003200, 'steps': 98974, 'loss/train': 1.3674850463867188} 11/07/2021 11:08:11 - INFO - __main__ - Step 98976: {'lr': 0.00013285343159596724, 'samples': 19003392, 'steps': 98975, 'loss/train': 1.5220357179641724} 11/07/2021 11:08:11 - INFO - __main__ - Step 98977: {'lr': 0.00013284874355025929, 'samples': 19003584, 'steps': 98976, 'loss/train': 1.5318806171417236} 11/07/2021 11:08:12 - INFO - __main__ - Step 98978: {'lr': 0.00013284405555733785, 'samples': 19003776, 'steps': 98977, 'loss/train': 1.229413628578186} 11/07/2021 11:08:12 - INFO - __main__ - Step 98979: {'lr': 0.0001328393676172051, 'samples': 19003968, 'steps': 98978, 'loss/train': 1.3811395168304443} 11/07/2021 11:08:13 - INFO - __main__ - Step 98980: {'lr': 0.0001328346797298631, 'samples': 19004160, 'steps': 98979, 'loss/train': 0.7766314744949341} 11/07/2021 11:08:13 - INFO - __main__ - Step 98981: {'lr': 0.000132829991895314, 'samples': 19004352, 'steps': 98980, 'loss/train': 5.797438621520996} 11/07/2021 11:08:14 - INFO - __main__ - Step 98982: {'lr': 0.0001328253041135599, 'samples': 19004544, 'steps': 98981, 'loss/train': 1.551023244857788} 11/07/2021 11:08:14 - INFO - __main__ - Step 98983: {'lr': 0.0001328206163846029, 'samples': 19004736, 'steps': 98982, 'loss/train': 1.2468234300613403} 11/07/2021 11:08:15 - INFO - __main__ - Step 98984: {'lr': 0.00013281592870844513, 'samples': 19004928, 'steps': 98983, 'loss/train': 1.4411767721176147} 11/07/2021 11:08:15 - INFO - __main__ - Step 98985: {'lr': 0.0001328112410850888, 'samples': 19005120, 'steps': 98984, 'loss/train': 1.9022855758666992} 11/07/2021 11:08:15 - INFO - __main__ - Step 98986: {'lr': 0.0001328065535145358, 'samples': 19005312, 'steps': 98985, 'loss/train': 0.9041839838027954} 11/07/2021 11:08:16 - INFO - __main__ - Step 98987: {'lr': 0.00013280186599678838, 'samples': 19005504, 'steps': 98986, 'loss/train': 1.0723118782043457} 11/07/2021 11:08:17 - INFO - __main__ - Step 98988: {'lr': 0.0001327971785318486, 'samples': 19005696, 'steps': 98987, 'loss/train': 1.0079271793365479} 11/07/2021 11:08:17 - INFO - __main__ - Step 98989: {'lr': 0.00013279249111971864, 'samples': 19005888, 'steps': 98988, 'loss/train': 1.26547110080719} 11/07/2021 11:08:18 - INFO - __main__ - Step 98990: {'lr': 0.00013278780376040056, 'samples': 19006080, 'steps': 98989, 'loss/train': 1.063883900642395} 11/07/2021 11:08:18 - INFO - __main__ - Step 98991: {'lr': 0.00013278311645389645, 'samples': 19006272, 'steps': 98990, 'loss/train': 1.8589107990264893} 11/07/2021 11:08:19 - INFO - __main__ - Step 98992: {'lr': 0.00013277842920020853, 'samples': 19006464, 'steps': 98991, 'loss/train': 0.5853973031044006} 11/07/2021 11:08:19 - INFO - __main__ - Step 98993: {'lr': 0.00013277374199933877, 'samples': 19006656, 'steps': 98992, 'loss/train': 1.4912532567977905} 11/07/2021 11:08:20 - INFO - __main__ - Step 98994: {'lr': 0.00013276905485128942, 'samples': 19006848, 'steps': 98993, 'loss/train': 1.6649229526519775} 11/07/2021 11:08:20 - INFO - __main__ - Step 98995: {'lr': 0.00013276436775606248, 'samples': 19007040, 'steps': 98994, 'loss/train': 1.2664657831192017} 11/07/2021 11:08:20 - INFO - __main__ - Step 98996: {'lr': 0.00013275968071366012, 'samples': 19007232, 'steps': 98995, 'loss/train': 1.498641014099121} 11/07/2021 11:08:21 - INFO - __main__ - Step 98997: {'lr': 0.00013275499372408445, 'samples': 19007424, 'steps': 98996, 'loss/train': 1.624529480934143} 11/07/2021 11:08:22 - INFO - __main__ - Step 98998: {'lr': 0.00013275030678733753, 'samples': 19007616, 'steps': 98997, 'loss/train': 0.6487733125686646} 11/07/2021 11:08:22 - INFO - __main__ - Step 98999: {'lr': 0.00013274561990342165, 'samples': 19007808, 'steps': 98998, 'loss/train': 1.4626076221466064} 11/07/2021 11:08:22 - INFO - __main__ - Step 99000: {'lr': 0.00013274093307233867, 'samples': 19008000, 'steps': 98999, 'loss/train': 1.2587906122207642} 11/07/2021 11:08:23 - INFO - __main__ - Step 99001: {'lr': 0.0001327362462940908, 'samples': 19008192, 'steps': 99000, 'loss/train': 1.7214598655700684} 11/07/2021 11:08:23 - INFO - __main__ - Step 99002: {'lr': 0.0001327315595686802, 'samples': 19008384, 'steps': 99001, 'loss/train': 1.2022664546966553} 11/07/2021 11:08:24 - INFO - __main__ - Step 99003: {'lr': 0.00013272687289610897, 'samples': 19008576, 'steps': 99002, 'loss/train': 1.561203956604004} 11/07/2021 11:08:25 - INFO - __main__ - Step 99004: {'lr': 0.00013272218627637916, 'samples': 19008768, 'steps': 99003, 'loss/train': 0.8320217728614807} 11/07/2021 11:08:25 - INFO - __main__ - Step 99005: {'lr': 0.00013271749970949294, 'samples': 19008960, 'steps': 99004, 'loss/train': 0.8975833654403687} 11/07/2021 11:08:25 - INFO - __main__ - Step 99006: {'lr': 0.00013271281319545235, 'samples': 19009152, 'steps': 99005, 'loss/train': 1.2298403978347778} 11/07/2021 11:08:26 - INFO - __main__ - Step 99007: {'lr': 0.00013270812673425963, 'samples': 19009344, 'steps': 99006, 'loss/train': 1.0511279106140137} 11/07/2021 11:08:27 - INFO - __main__ - Step 99008: {'lr': 0.0001327034403259168, 'samples': 19009536, 'steps': 99007, 'loss/train': 1.6199111938476562} 11/07/2021 11:08:27 - INFO - __main__ - Step 99009: {'lr': 0.00013269875397042596, 'samples': 19009728, 'steps': 99008, 'loss/train': 3.2414145469665527} 11/07/2021 11:08:27 - INFO - __main__ - Step 99010: {'lr': 0.0001326940676677893, 'samples': 19009920, 'steps': 99009, 'loss/train': 1.4422303438186646} 11/07/2021 11:08:28 - INFO - __main__ - Step 99011: {'lr': 0.00013268938141800885, 'samples': 19010112, 'steps': 99010, 'loss/train': 0.9778432846069336} 11/07/2021 11:08:28 - INFO - __main__ - Step 99012: {'lr': 0.00013268469522108685, 'samples': 19010304, 'steps': 99011, 'loss/train': 1.0936095714569092} 11/07/2021 11:08:29 - INFO - __main__ - Step 99013: {'lr': 0.00013268000907702525, 'samples': 19010496, 'steps': 99012, 'loss/train': 0.8807874321937561} 11/07/2021 11:08:29 - INFO - __main__ - Step 99014: {'lr': 0.0001326753229858262, 'samples': 19010688, 'steps': 99013, 'loss/train': 1.2837226390838623} 11/07/2021 11:08:30 - INFO - __main__ - Step 99015: {'lr': 0.0001326706369474918, 'samples': 19010880, 'steps': 99014, 'loss/train': 1.2837414741516113} 11/07/2021 11:08:30 - INFO - __main__ - Step 99016: {'lr': 0.0001326659509620243, 'samples': 19011072, 'steps': 99015, 'loss/train': 1.4485821723937988} 11/07/2021 11:08:31 - INFO - __main__ - Step 99017: {'lr': 0.00013266126502942563, 'samples': 19011264, 'steps': 99016, 'loss/train': 1.1620439291000366} 11/07/2021 11:08:32 - INFO - __main__ - Step 99018: {'lr': 0.00013265657914969802, 'samples': 19011456, 'steps': 99017, 'loss/train': 0.5843575596809387} 11/07/2021 11:08:32 - INFO - __main__ - Step 99019: {'lr': 0.00013265189332284353, 'samples': 19011648, 'steps': 99018, 'loss/train': 1.2108781337738037} 11/07/2021 11:08:32 - INFO - __main__ - Step 99020: {'lr': 0.00013264720754886428, 'samples': 19011840, 'steps': 99019, 'loss/train': 1.1889883279800415} 11/07/2021 11:08:33 - INFO - __main__ - Step 99021: {'lr': 0.0001326425218277624, 'samples': 19012032, 'steps': 99020, 'loss/train': 1.2064703702926636} 11/07/2021 11:08:33 - INFO - __main__ - Step 99022: {'lr': 0.00013263783615954, 'samples': 19012224, 'steps': 99021, 'loss/train': 1.5990792512893677} 11/07/2021 11:08:34 - INFO - __main__ - Step 99023: {'lr': 0.00013263315054419918, 'samples': 19012416, 'steps': 99022, 'loss/train': 0.8491818904876709} 11/07/2021 11:08:34 - INFO - __main__ - Step 99024: {'lr': 0.00013262846498174203, 'samples': 19012608, 'steps': 99023, 'loss/train': 1.4039644002914429} 11/07/2021 11:08:35 - INFO - __main__ - Step 99025: {'lr': 0.00013262377947217068, 'samples': 19012800, 'steps': 99024, 'loss/train': 1.4865801334381104} 11/07/2021 11:08:35 - INFO - __main__ - Step 99026: {'lr': 0.00013261909401548737, 'samples': 19012992, 'steps': 99025, 'loss/train': 1.1498991250991821} 11/07/2021 11:08:35 - INFO - __main__ - Step 99027: {'lr': 0.00013261440861169393, 'samples': 19013184, 'steps': 99026, 'loss/train': 1.8508203029632568} 11/07/2021 11:08:36 - INFO - __main__ - Step 99028: {'lr': 0.00013260972326079268, 'samples': 19013376, 'steps': 99027, 'loss/train': 1.8564770221710205} 11/07/2021 11:08:37 - INFO - __main__ - Step 99029: {'lr': 0.00013260503796278566, 'samples': 19013568, 'steps': 99028, 'loss/train': 1.3818626403808594} 11/07/2021 11:08:37 - INFO - __main__ - Step 99030: {'lr': 0.000132600352717675, 'samples': 19013760, 'steps': 99029, 'loss/train': 1.7155894041061401} 11/07/2021 11:08:37 - INFO - __main__ - Step 99031: {'lr': 0.0001325956675254628, 'samples': 19013952, 'steps': 99030, 'loss/train': 1.5783929824829102} 11/07/2021 11:08:38 - INFO - __main__ - Step 99032: {'lr': 0.0001325909823861512, 'samples': 19014144, 'steps': 99031, 'loss/train': 1.664642572402954} 11/07/2021 11:08:38 - INFO - __main__ - Step 99033: {'lr': 0.0001325862972997423, 'samples': 19014336, 'steps': 99032, 'loss/train': 0.1005573496222496} 11/07/2021 11:08:39 - INFO - __main__ - Step 99034: {'lr': 0.00013258161226623817, 'samples': 19014528, 'steps': 99033, 'loss/train': 1.532130479812622} 11/07/2021 11:08:40 - INFO - __main__ - Step 99035: {'lr': 0.00013257692728564096, 'samples': 19014720, 'steps': 99034, 'loss/train': 1.3578473329544067} 11/07/2021 11:08:40 - INFO - __main__ - Step 99036: {'lr': 0.0001325722423579528, 'samples': 19014912, 'steps': 99035, 'loss/train': 1.4610308408737183} 11/07/2021 11:08:40 - INFO - __main__ - Step 99037: {'lr': 0.00013256755748317575, 'samples': 19015104, 'steps': 99036, 'loss/train': 1.5215333700180054} 11/07/2021 11:08:41 - INFO - __main__ - Step 99038: {'lr': 0.00013256287266131194, 'samples': 19015296, 'steps': 99037, 'loss/train': 0.779082715511322} 11/07/2021 11:08:42 - INFO - __main__ - Step 99039: {'lr': 0.00013255818789236363, 'samples': 19015488, 'steps': 99038, 'loss/train': 1.4568880796432495} 11/07/2021 11:08:42 - INFO - __main__ - Step 99040: {'lr': 0.00013255350317633265, 'samples': 19015680, 'steps': 99039, 'loss/train': 1.413082480430603} 11/07/2021 11:08:42 - INFO - __main__ - Step 99041: {'lr': 0.00013254881851322125, 'samples': 19015872, 'steps': 99040, 'loss/train': 1.4286746978759766} 11/07/2021 11:08:43 - INFO - __main__ - Step 99042: {'lr': 0.00013254413390303155, 'samples': 19016064, 'steps': 99041, 'loss/train': 0.5948489308357239} 11/07/2021 11:08:43 - INFO - __main__ - Step 99043: {'lr': 0.00013253944934576566, 'samples': 19016256, 'steps': 99042, 'loss/train': 1.3819798231124878} 11/07/2021 11:08:44 - INFO - __main__ - Step 99044: {'lr': 0.00013253476484142567, 'samples': 19016448, 'steps': 99043, 'loss/train': 0.0484987273812294} 11/07/2021 11:08:44 - INFO - __main__ - Step 99045: {'lr': 0.00013253008039001372, 'samples': 19016640, 'steps': 99044, 'loss/train': 1.1481281518936157} 11/07/2021 11:08:45 - INFO - __main__ - Step 99046: {'lr': 0.00013252539599153187, 'samples': 19016832, 'steps': 99045, 'loss/train': 1.214617371559143} 11/07/2021 11:08:45 - INFO - __main__ - Step 99047: {'lr': 0.00013252071164598228, 'samples': 19017024, 'steps': 99046, 'loss/train': 1.435469388961792} 11/07/2021 11:08:46 - INFO - __main__ - Step 99048: {'lr': 0.00013251602735336705, 'samples': 19017216, 'steps': 99047, 'loss/train': 1.2453235387802124} 11/07/2021 11:08:46 - INFO - __main__ - Step 99049: {'lr': 0.0001325113431136883, 'samples': 19017408, 'steps': 99048, 'loss/train': 1.227342963218689} 11/07/2021 11:08:47 - INFO - __main__ - Step 99050: {'lr': 0.00013250665892694812, 'samples': 19017600, 'steps': 99049, 'loss/train': 1.4541386365890503} 11/07/2021 11:08:47 - INFO - __main__ - Step 99051: {'lr': 0.00013250197479314858, 'samples': 19017792, 'steps': 99050, 'loss/train': 1.2571548223495483} 11/07/2021 11:08:48 - INFO - __main__ - Step 99052: {'lr': 0.0001324972907122919, 'samples': 19017984, 'steps': 99051, 'loss/train': 1.0458701848983765} 11/07/2021 11:08:48 - INFO - __main__ - Step 99053: {'lr': 0.00013249260668438017, 'samples': 19018176, 'steps': 99052, 'loss/train': 1.696810007095337} 11/07/2021 11:08:48 - INFO - __main__ - Step 99054: {'lr': 0.0001324879227094154, 'samples': 19018368, 'steps': 99053, 'loss/train': 0.9368258118629456} 11/07/2021 11:08:50 - INFO - __main__ - Step 99055: {'lr': 0.00013248323878739974, 'samples': 19018560, 'steps': 99054, 'loss/train': 1.0564484596252441} 11/07/2021 11:08:50 - INFO - __main__ - Step 99056: {'lr': 0.00013247855491833532, 'samples': 19018752, 'steps': 99055, 'loss/train': 1.6282762289047241} 11/07/2021 11:08:50 - INFO - __main__ - Step 99057: {'lr': 0.00013247387110222427, 'samples': 19018944, 'steps': 99056, 'loss/train': 1.5381407737731934} 11/07/2021 11:08:51 - INFO - __main__ - Step 99058: {'lr': 0.00013246918733906865, 'samples': 19019136, 'steps': 99057, 'loss/train': 0.2033185064792633} 11/07/2021 11:08:51 - INFO - __main__ - Step 99059: {'lr': 0.00013246450362887065, 'samples': 19019328, 'steps': 99058, 'loss/train': 1.0942268371582031} 11/07/2021 11:08:53 - INFO - __main__ - Step 99060: {'lr': 0.00013245981997163226, 'samples': 19019520, 'steps': 99059, 'loss/train': 1.140099287033081} 11/07/2021 11:08:53 - INFO - __main__ - Step 99061: {'lr': 0.0001324551363673557, 'samples': 19019712, 'steps': 99060, 'loss/train': 1.3079530000686646} 11/07/2021 11:08:53 - INFO - __main__ - Step 99062: {'lr': 0.00013245045281604304, 'samples': 19019904, 'steps': 99061, 'loss/train': 1.1049312353134155} 11/07/2021 11:08:54 - INFO - __main__ - Step 99063: {'lr': 0.0001324457693176964, 'samples': 19020096, 'steps': 99062, 'loss/train': 1.0381827354431152} 11/07/2021 11:08:54 - INFO - __main__ - Step 99064: {'lr': 0.00013244108587231784, 'samples': 19020288, 'steps': 99063, 'loss/train': 1.1649279594421387} 11/07/2021 11:08:54 - INFO - __main__ - Step 99065: {'lr': 0.00013243640247990958, 'samples': 19020480, 'steps': 99064, 'loss/train': 1.1823351383209229} 11/07/2021 11:08:56 - INFO - __main__ - Step 99066: {'lr': 0.00013243171914047373, 'samples': 19020672, 'steps': 99065, 'loss/train': 1.2950831651687622} 11/07/2021 11:08:56 - INFO - __main__ - Step 99067: {'lr': 0.00013242703585401223, 'samples': 19020864, 'steps': 99066, 'loss/train': 1.3973206281661987} 11/07/2021 11:08:56 - INFO - __main__ - Step 99068: {'lr': 0.0001324223526205273, 'samples': 19021056, 'steps': 99067, 'loss/train': 0.47868889570236206} 11/07/2021 11:08:57 - INFO - __main__ - Step 99069: {'lr': 0.00013241766944002104, 'samples': 19021248, 'steps': 99068, 'loss/train': 1.2806930541992188} 11/07/2021 11:08:57 - INFO - __main__ - Step 99070: {'lr': 0.00013241298631249554, 'samples': 19021440, 'steps': 99069, 'loss/train': 1.4263584613800049} 11/07/2021 11:08:58 - INFO - __main__ - Step 99071: {'lr': 0.00013240830323795296, 'samples': 19021632, 'steps': 99070, 'loss/train': 1.921589732170105} 11/07/2021 11:08:58 - INFO - __main__ - Step 99072: {'lr': 0.0001324036202163954, 'samples': 19021824, 'steps': 99071, 'loss/train': 1.4479035139083862} 11/07/2021 11:08:59 - INFO - __main__ - Step 99073: {'lr': 0.0001323989372478249, 'samples': 19022016, 'steps': 99072, 'loss/train': 0.9023018479347229} 11/07/2021 11:08:59 - INFO - __main__ - Step 99074: {'lr': 0.00013239425433224367, 'samples': 19022208, 'steps': 99073, 'loss/train': 1.5586512088775635} 11/07/2021 11:08:59 - INFO - __main__ - Step 99075: {'lr': 0.00013238957146965378, 'samples': 19022400, 'steps': 99074, 'loss/train': 0.34215807914733887} 11/07/2021 11:09:00 - INFO - __main__ - Step 99076: {'lr': 0.00013238488866005734, 'samples': 19022592, 'steps': 99075, 'loss/train': 1.7429448366165161} 11/07/2021 11:09:01 - INFO - __main__ - Step 99077: {'lr': 0.0001323802059034564, 'samples': 19022784, 'steps': 99076, 'loss/train': 1.2925759553909302} 11/07/2021 11:09:01 - INFO - __main__ - Step 99078: {'lr': 0.00013237552319985316, 'samples': 19022976, 'steps': 99077, 'loss/train': 1.663686752319336} 11/07/2021 11:09:02 - INFO - __main__ - Step 99079: {'lr': 0.0001323708405492498, 'samples': 19023168, 'steps': 99078, 'loss/train': 1.418136715888977} 11/07/2021 11:09:02 - INFO - __main__ - Step 99080: {'lr': 0.00013236615795164818, 'samples': 19023360, 'steps': 99079, 'loss/train': 1.4458521604537964} 11/07/2021 11:09:02 - INFO - __main__ - Step 99081: {'lr': 0.00013236147540705062, 'samples': 19023552, 'steps': 99080, 'loss/train': 1.4900814294815063} 11/07/2021 11:09:04 - INFO - __main__ - Step 99082: {'lr': 0.00013235679291545913, 'samples': 19023744, 'steps': 99081, 'loss/train': 0.20367954671382904} 11/07/2021 11:09:04 - INFO - __main__ - Step 99083: {'lr': 0.00013235211047687585, 'samples': 19023936, 'steps': 99082, 'loss/train': 1.253386378288269} 11/07/2021 11:09:04 - INFO - __main__ - Step 99084: {'lr': 0.0001323474280913029, 'samples': 19024128, 'steps': 99083, 'loss/train': 1.431296706199646} 11/07/2021 11:09:05 - INFO - __main__ - Step 99085: {'lr': 0.00013234274575874239, 'samples': 19024320, 'steps': 99084, 'loss/train': 1.3388757705688477} 11/07/2021 11:09:05 - INFO - __main__ - Step 99086: {'lr': 0.00013233806347919642, 'samples': 19024512, 'steps': 99085, 'loss/train': 1.59433913230896} 11/07/2021 11:09:06 - INFO - __main__ - Step 99087: {'lr': 0.00013233338125266707, 'samples': 19024704, 'steps': 99086, 'loss/train': 1.6595157384872437} 11/07/2021 11:09:06 - INFO - __main__ - Step 99088: {'lr': 0.0001323286990791565, 'samples': 19024896, 'steps': 99087, 'loss/train': 1.4551652669906616} 11/07/2021 11:09:07 - INFO - __main__ - Step 99089: {'lr': 0.00013232401695866685, 'samples': 19025088, 'steps': 99088, 'loss/train': 1.3776016235351562} 11/07/2021 11:09:07 - INFO - __main__ - Step 99090: {'lr': 0.00013231933489120013, 'samples': 19025280, 'steps': 99089, 'loss/train': 0.720068097114563} 11/07/2021 11:09:07 - INFO - __main__ - Step 99091: {'lr': 0.00013231465287675854, 'samples': 19025472, 'steps': 99090, 'loss/train': 1.2489351034164429} 11/07/2021 11:09:08 - INFO - __main__ - Step 99092: {'lr': 0.00013230997091534413, 'samples': 19025664, 'steps': 99091, 'loss/train': 1.274825096130371} 11/07/2021 11:09:09 - INFO - __main__ - Step 99093: {'lr': 0.0001323052890069591, 'samples': 19025856, 'steps': 99092, 'loss/train': 1.8213391304016113} 11/07/2021 11:09:09 - INFO - __main__ - Step 99094: {'lr': 0.00013230060715160543, 'samples': 19026048, 'steps': 99093, 'loss/train': 1.5027180910110474} 11/07/2021 11:09:09 - INFO - __main__ - Step 99095: {'lr': 0.0001322959253492853, 'samples': 19026240, 'steps': 99094, 'loss/train': 1.0930858850479126} 11/07/2021 11:09:10 - INFO - __main__ - Step 99096: {'lr': 0.00013229124360000078, 'samples': 19026432, 'steps': 99095, 'loss/train': 1.3457118272781372} 11/07/2021 11:09:11 - INFO - __main__ - Step 99097: {'lr': 0.00013228656190375404, 'samples': 19026624, 'steps': 99096, 'loss/train': 1.5018900632858276} 11/07/2021 11:09:11 - INFO - __main__ - Step 99098: {'lr': 0.00013228188026054711, 'samples': 19026816, 'steps': 99097, 'loss/train': 1.2301214933395386} 11/07/2021 11:09:11 - INFO - __main__ - Step 99099: {'lr': 0.00013227719867038218, 'samples': 19027008, 'steps': 99098, 'loss/train': 1.0280168056488037} 11/07/2021 11:09:12 - INFO - __main__ - Step 99100: {'lr': 0.00013227251713326133, 'samples': 19027200, 'steps': 99099, 'loss/train': 1.4814577102661133} 11/07/2021 11:09:12 - INFO - __main__ - Step 99101: {'lr': 0.00013226783564918666, 'samples': 19027392, 'steps': 99100, 'loss/train': 1.7252299785614014} 11/07/2021 11:09:13 - INFO - __main__ - Step 99102: {'lr': 0.0001322631542181603, 'samples': 19027584, 'steps': 99101, 'loss/train': 1.4352391958236694} 11/07/2021 11:09:14 - INFO - __main__ - Step 99103: {'lr': 0.00013225847284018433, 'samples': 19027776, 'steps': 99102, 'loss/train': 1.5150824785232544} 11/07/2021 11:09:14 - INFO - __main__ - Step 99104: {'lr': 0.0001322537915152609, 'samples': 19027968, 'steps': 99103, 'loss/train': 1.5813186168670654} 11/07/2021 11:09:14 - INFO - __main__ - Step 99105: {'lr': 0.00013224911024339205, 'samples': 19028160, 'steps': 99104, 'loss/train': 1.4790033102035522} 11/07/2021 11:09:15 - INFO - __main__ - Step 99106: {'lr': 0.00013224442902458005, 'samples': 19028352, 'steps': 99105, 'loss/train': 0.901845395565033} 11/07/2021 11:09:15 - INFO - __main__ - Step 99107: {'lr': 0.00013223974785882682, 'samples': 19028544, 'steps': 99106, 'loss/train': 1.3871361017227173} 11/07/2021 11:09:16 - INFO - __main__ - Step 99108: {'lr': 0.0001322350667461345, 'samples': 19028736, 'steps': 99107, 'loss/train': 1.3340007066726685} 11/07/2021 11:09:16 - INFO - __main__ - Step 99109: {'lr': 0.0001322303856865053, 'samples': 19028928, 'steps': 99108, 'loss/train': 0.9933347702026367} 11/07/2021 11:09:17 - INFO - __main__ - Step 99110: {'lr': 0.00013222570467994122, 'samples': 19029120, 'steps': 99109, 'loss/train': 0.8707031607627869} 11/07/2021 11:09:17 - INFO - __main__ - Step 99111: {'lr': 0.00013222102372644447, 'samples': 19029312, 'steps': 99110, 'loss/train': 1.7166447639465332} 11/07/2021 11:09:17 - INFO - __main__ - Step 99112: {'lr': 0.00013221634282601706, 'samples': 19029504, 'steps': 99111, 'loss/train': 1.6895257234573364} 11/07/2021 11:09:19 - INFO - __main__ - Step 99113: {'lr': 0.00013221166197866112, 'samples': 19029696, 'steps': 99112, 'loss/train': 1.341438889503479} 11/07/2021 11:09:19 - INFO - __main__ - Step 99114: {'lr': 0.00013220698118437884, 'samples': 19029888, 'steps': 99113, 'loss/train': 1.4684395790100098} 11/07/2021 11:09:19 - INFO - __main__ - Step 99115: {'lr': 0.00013220230044317229, 'samples': 19030080, 'steps': 99114, 'loss/train': 0.9399117231369019} 11/07/2021 11:09:20 - INFO - __main__ - Step 99116: {'lr': 0.00013219761975504356, 'samples': 19030272, 'steps': 99115, 'loss/train': 1.317287564277649} 11/07/2021 11:09:20 - INFO - __main__ - Step 99117: {'lr': 0.00013219293911999474, 'samples': 19030464, 'steps': 99116, 'loss/train': 1.1503344774246216} 11/07/2021 11:09:20 - INFO - __main__ - Step 99118: {'lr': 0.00013218825853802797, 'samples': 19030656, 'steps': 99117, 'loss/train': 0.8036368489265442} 11/07/2021 11:09:22 - INFO - __main__ - Step 99119: {'lr': 0.00013218357800914534, 'samples': 19030848, 'steps': 99118, 'loss/train': 0.2576850652694702} 11/07/2021 11:09:22 - INFO - __main__ - Step 99120: {'lr': 0.0001321788975333491, 'samples': 19031040, 'steps': 99119, 'loss/train': 0.7269474267959595} 11/07/2021 11:09:22 - INFO - __main__ - Step 99121: {'lr': 0.0001321742171106411, 'samples': 19031232, 'steps': 99120, 'loss/train': 1.2958383560180664} 11/07/2021 11:09:23 - INFO - __main__ - Step 99122: {'lr': 0.0001321695367410236, 'samples': 19031424, 'steps': 99121, 'loss/train': 1.1475261449813843} 11/07/2021 11:09:23 - INFO - __main__ - Step 99123: {'lr': 0.00013216485642449872, 'samples': 19031616, 'steps': 99122, 'loss/train': 1.2434579133987427} 11/07/2021 11:09:24 - INFO - __main__ - Step 99124: {'lr': 0.0001321601761610685, 'samples': 19031808, 'steps': 99123, 'loss/train': 1.1177388429641724} 11/07/2021 11:09:24 - INFO - __main__ - Step 99125: {'lr': 0.00013215549595073505, 'samples': 19032000, 'steps': 99124, 'loss/train': 1.166322946548462} 11/07/2021 11:09:25 - INFO - __main__ - Step 99126: {'lr': 0.00013215081579350058, 'samples': 19032192, 'steps': 99125, 'loss/train': 0.9434316754341125} 11/07/2021 11:09:25 - INFO - __main__ - Step 99127: {'lr': 0.0001321461356893671, 'samples': 19032384, 'steps': 99126, 'loss/train': 1.8343068361282349} 11/07/2021 11:09:25 - INFO - __main__ - Step 99128: {'lr': 0.0001321414556383368, 'samples': 19032576, 'steps': 99127, 'loss/train': 1.2470821142196655} 11/07/2021 11:09:26 - INFO - __main__ - Step 99129: {'lr': 0.0001321367756404117, 'samples': 19032768, 'steps': 99128, 'loss/train': 1.2974028587341309} 11/07/2021 11:09:27 - INFO - __main__ - Step 99130: {'lr': 0.00013213209569559392, 'samples': 19032960, 'steps': 99129, 'loss/train': 1.5909029245376587} 11/07/2021 11:09:27 - INFO - __main__ - Step 99131: {'lr': 0.00013212741580388566, 'samples': 19033152, 'steps': 99130, 'loss/train': 1.6495434045791626} 11/07/2021 11:09:27 - INFO - __main__ - Step 99132: {'lr': 0.00013212273596528894, 'samples': 19033344, 'steps': 99131, 'loss/train': 1.609675407409668} 11/07/2021 11:09:28 - INFO - __main__ - Step 99133: {'lr': 0.00013211805617980598, 'samples': 19033536, 'steps': 99132, 'loss/train': 1.5705078840255737} 11/07/2021 11:09:29 - INFO - __main__ - Step 99134: {'lr': 0.0001321133764474387, 'samples': 19033728, 'steps': 99133, 'loss/train': 0.9997004866600037} 11/07/2021 11:09:29 - INFO - __main__ - Step 99135: {'lr': 0.00013210869676818935, 'samples': 19033920, 'steps': 99134, 'loss/train': 1.3410483598709106} 11/07/2021 11:09:29 - INFO - __main__ - Step 99136: {'lr': 0.00013210401714205998, 'samples': 19034112, 'steps': 99135, 'loss/train': 0.9671399593353271} 11/07/2021 11:09:30 - INFO - __main__ - Step 99137: {'lr': 0.00013209933756905273, 'samples': 19034304, 'steps': 99136, 'loss/train': 1.5316345691680908} 11/07/2021 11:09:30 - INFO - __main__ - Step 99138: {'lr': 0.0001320946580491697, 'samples': 19034496, 'steps': 99137, 'loss/train': 1.6884245872497559} 11/07/2021 11:09:31 - INFO - __main__ - Step 99139: {'lr': 0.000132089978582413, 'samples': 19034688, 'steps': 99138, 'loss/train': 1.3161945343017578} 11/07/2021 11:09:32 - INFO - __main__ - Step 99140: {'lr': 0.00013208529916878474, 'samples': 19034880, 'steps': 99139, 'loss/train': 1.4098676443099976} 11/07/2021 11:09:32 - INFO - __main__ - Step 99141: {'lr': 0.000132080619808287, 'samples': 19035072, 'steps': 99140, 'loss/train': 1.2595137357711792} 11/07/2021 11:09:32 - INFO - __main__ - Step 99142: {'lr': 0.00013207594050092193, 'samples': 19035264, 'steps': 99141, 'loss/train': 1.024368166923523} 11/07/2021 11:09:33 - INFO - __main__ - Step 99143: {'lr': 0.00013207126124669161, 'samples': 19035456, 'steps': 99142, 'loss/train': 1.6849192380905151} 11/07/2021 11:09:33 - INFO - __main__ - Step 99144: {'lr': 0.00013206658204559818, 'samples': 19035648, 'steps': 99143, 'loss/train': 1.1509120464324951} 11/07/2021 11:09:34 - INFO - __main__ - Step 99145: {'lr': 0.0001320619028976437, 'samples': 19035840, 'steps': 99144, 'loss/train': 1.1410478353500366} 11/07/2021 11:09:34 - INFO - __main__ - Step 99146: {'lr': 0.00013205722380283034, 'samples': 19036032, 'steps': 99145, 'loss/train': 1.041632056236267} 11/07/2021 11:09:35 - INFO - __main__ - Step 99147: {'lr': 0.00013205254476116024, 'samples': 19036224, 'steps': 99146, 'loss/train': 1.080368995666504} 11/07/2021 11:09:35 - INFO - __main__ - Step 99148: {'lr': 0.00013204786577263538, 'samples': 19036416, 'steps': 99147, 'loss/train': 1.8648576736450195} 11/07/2021 11:09:35 - INFO - __main__ - Step 99149: {'lr': 0.00013204318683725791, 'samples': 19036608, 'steps': 99148, 'loss/train': 1.170263648033142} 11/07/2021 11:09:36 - INFO - __main__ - Step 99150: {'lr': 0.00013203850795502997, 'samples': 19036800, 'steps': 99149, 'loss/train': 1.259636640548706} 11/07/2021 11:09:37 - INFO - __main__ - Step 99151: {'lr': 0.00013203382912595362, 'samples': 19036992, 'steps': 99150, 'loss/train': 1.420289397239685} 11/07/2021 11:09:37 - INFO - __main__ - Step 99152: {'lr': 0.00013202915035003104, 'samples': 19037184, 'steps': 99151, 'loss/train': 0.733376145362854} 11/07/2021 11:09:37 - INFO - __main__ - Step 99153: {'lr': 0.00013202447162726432, 'samples': 19037376, 'steps': 99152, 'loss/train': 1.4302018880844116} 11/07/2021 11:09:38 - INFO - __main__ - Step 99154: {'lr': 0.00013201979295765555, 'samples': 19037568, 'steps': 99153, 'loss/train': 1.3441728353500366} 11/07/2021 11:09:39 - INFO - __main__ - Step 99155: {'lr': 0.00013201511434120683, 'samples': 19037760, 'steps': 99154, 'loss/train': 1.5080088376998901} 11/07/2021 11:09:39 - INFO - __main__ - Step 99156: {'lr': 0.00013201043577792026, 'samples': 19037952, 'steps': 99155, 'loss/train': 1.3811109066009521} 11/07/2021 11:09:40 - INFO - __main__ - Step 99157: {'lr': 0.00013200575726779798, 'samples': 19038144, 'steps': 99156, 'loss/train': 1.6074426174163818} 11/07/2021 11:09:40 - INFO - __main__ - Step 99158: {'lr': 0.0001320010788108421, 'samples': 19038336, 'steps': 99157, 'loss/train': 1.8779852390289307} 11/07/2021 11:09:40 - INFO - __main__ - Step 99159: {'lr': 0.00013199640040705468, 'samples': 19038528, 'steps': 99158, 'loss/train': 1.5435420274734497} 11/07/2021 11:09:41 - INFO - __main__ - Step 99160: {'lr': 0.000131991722056438, 'samples': 19038720, 'steps': 99159, 'loss/train': 1.0757509469985962} 11/07/2021 11:09:42 - INFO - __main__ - Step 99161: {'lr': 0.0001319870437589939, 'samples': 19038912, 'steps': 99160, 'loss/train': 1.1544222831726074} 11/07/2021 11:09:42 - INFO - __main__ - Step 99162: {'lr': 0.00013198236551472463, 'samples': 19039104, 'steps': 99161, 'loss/train': 1.3784968852996826} 11/07/2021 11:09:42 - INFO - __main__ - Step 99163: {'lr': 0.0001319776873236323, 'samples': 19039296, 'steps': 99162, 'loss/train': 2.4107284545898438} 11/07/2021 11:09:43 - INFO - __main__ - Step 99164: {'lr': 0.00013197300918571896, 'samples': 19039488, 'steps': 99163, 'loss/train': 1.4772766828536987} 11/07/2021 11:09:43 - INFO - __main__ - Step 99165: {'lr': 0.00013196833110098676, 'samples': 19039680, 'steps': 99164, 'loss/train': 1.5267924070358276} 11/07/2021 11:09:44 - INFO - __main__ - Step 99166: {'lr': 0.00013196365306943785, 'samples': 19039872, 'steps': 99165, 'loss/train': 1.3389232158660889} 11/07/2021 11:09:45 - INFO - __main__ - Step 99167: {'lr': 0.0001319589750910743, 'samples': 19040064, 'steps': 99166, 'loss/train': 1.1198781728744507} 11/07/2021 11:09:45 - INFO - __main__ - Step 99168: {'lr': 0.0001319542971658982, 'samples': 19040256, 'steps': 99167, 'loss/train': 1.5280810594558716} 11/07/2021 11:09:45 - INFO - __main__ - Step 99169: {'lr': 0.00013194961929391166, 'samples': 19040448, 'steps': 99168, 'loss/train': 1.0872315168380737} 11/07/2021 11:09:46 - INFO - __main__ - Step 99170: {'lr': 0.00013194494147511683, 'samples': 19040640, 'steps': 99169, 'loss/train': 0.6281700730323792} 11/07/2021 11:09:47 - INFO - __main__ - Step 99171: {'lr': 0.00013194026370951572, 'samples': 19040832, 'steps': 99170, 'loss/train': 1.3633341789245605} 11/07/2021 11:09:47 - INFO - __main__ - Step 99172: {'lr': 0.00013193558599711066, 'samples': 19041024, 'steps': 99171, 'loss/train': 1.508082628250122} 11/07/2021 11:09:47 - INFO - __main__ - Step 99173: {'lr': 0.0001319309083379035, 'samples': 19041216, 'steps': 99172, 'loss/train': 1.3049283027648926} 11/07/2021 11:09:48 - INFO - __main__ - Step 99174: {'lr': 0.00013192623073189644, 'samples': 19041408, 'steps': 99173, 'loss/train': 1.4826982021331787} 11/07/2021 11:09:48 - INFO - __main__ - Step 99175: {'lr': 0.0001319215531790916, 'samples': 19041600, 'steps': 99174, 'loss/train': 1.4919836521148682} 11/07/2021 11:09:49 - INFO - __main__ - Step 99176: {'lr': 0.0001319168756794911, 'samples': 19041792, 'steps': 99175, 'loss/train': 1.5072795152664185} 11/07/2021 11:09:49 - INFO - __main__ - Step 99177: {'lr': 0.00013191219823309702, 'samples': 19041984, 'steps': 99176, 'loss/train': 0.8602758049964905} 11/07/2021 11:09:50 - INFO - __main__ - Step 99178: {'lr': 0.00013190752083991147, 'samples': 19042176, 'steps': 99177, 'loss/train': 1.4675147533416748} 11/07/2021 11:09:50 - INFO - __main__ - Step 99179: {'lr': 0.00013190284349993658, 'samples': 19042368, 'steps': 99178, 'loss/train': 1.0268223285675049} 11/07/2021 11:09:50 - INFO - __main__ - Step 99180: {'lr': 0.00013189816621317447, 'samples': 19042560, 'steps': 99179, 'loss/train': 1.1526552438735962} 11/07/2021 11:09:51 - INFO - __main__ - Step 99181: {'lr': 0.00013189348897962722, 'samples': 19042752, 'steps': 99180, 'loss/train': 0.6205053925514221} 11/07/2021 11:09:52 - INFO - __main__ - Step 99182: {'lr': 0.0001318888117992969, 'samples': 19042944, 'steps': 99181, 'loss/train': 1.6335893869400024} 11/07/2021 11:09:52 - INFO - __main__ - Step 99183: {'lr': 0.00013188413467218578, 'samples': 19043136, 'steps': 99182, 'loss/train': 0.4908795952796936} 11/07/2021 11:09:52 - INFO - __main__ - Step 99184: {'lr': 0.00013187945759829576, 'samples': 19043328, 'steps': 99183, 'loss/train': 1.4916070699691772} 11/07/2021 11:09:53 - INFO - __main__ - Step 99185: {'lr': 0.00013187478057762901, 'samples': 19043520, 'steps': 99184, 'loss/train': 0.9850095510482788} 11/07/2021 11:09:54 - INFO - __main__ - Step 99186: {'lr': 0.0001318701036101877, 'samples': 19043712, 'steps': 99185, 'loss/train': 0.7332355976104736} 11/07/2021 11:09:54 - INFO - __main__ - Step 99187: {'lr': 0.00013186542669597385, 'samples': 19043904, 'steps': 99186, 'loss/train': 1.5296458005905151} 11/07/2021 11:09:55 - INFO - __main__ - Step 99188: {'lr': 0.00013186074983498965, 'samples': 19044096, 'steps': 99187, 'loss/train': 1.3575332164764404} 11/07/2021 11:09:55 - INFO - __main__ - Step 99189: {'lr': 0.00013185607302723716, 'samples': 19044288, 'steps': 99188, 'loss/train': 1.085240125656128} 11/07/2021 11:09:55 - INFO - __main__ - Step 99190: {'lr': 0.0001318513962727185, 'samples': 19044480, 'steps': 99189, 'loss/train': 1.3193092346191406} 11/07/2021 11:09:56 - INFO - __main__ - Step 99191: {'lr': 0.0001318467195714358, 'samples': 19044672, 'steps': 99190, 'loss/train': 1.5892958641052246} 11/07/2021 11:09:57 - INFO - __main__ - Step 99192: {'lr': 0.0001318420429233911, 'samples': 19044864, 'steps': 99191, 'loss/train': 1.2180628776550293} 11/07/2021 11:09:57 - INFO - __main__ - Step 99193: {'lr': 0.00013183736632858657, 'samples': 19045056, 'steps': 99192, 'loss/train': 0.578855037689209} 11/07/2021 11:09:57 - INFO - __main__ - Step 99194: {'lr': 0.0001318326897870244, 'samples': 19045248, 'steps': 99193, 'loss/train': 1.2081619501113892} 11/07/2021 11:09:58 - INFO - __main__ - Step 99195: {'lr': 0.00013182801329870652, 'samples': 19045440, 'steps': 99194, 'loss/train': 1.4688968658447266} 11/07/2021 11:09:59 - INFO - __main__ - Step 99196: {'lr': 0.00013182333686363506, 'samples': 19045632, 'steps': 99195, 'loss/train': 0.6648030281066895} 11/07/2021 11:09:59 - INFO - __main__ - Step 99197: {'lr': 0.00013181866048181225, 'samples': 19045824, 'steps': 99196, 'loss/train': 1.4440268278121948} 11/07/2021 11:09:59 - INFO - __main__ - Step 99198: {'lr': 0.0001318139841532401, 'samples': 19046016, 'steps': 99197, 'loss/train': 1.1808890104293823} 11/07/2021 11:10:00 - INFO - __main__ - Step 99199: {'lr': 0.00013180930787792073, 'samples': 19046208, 'steps': 99198, 'loss/train': 1.2709592580795288} 11/07/2021 11:10:00 - INFO - __main__ - Step 99200: {'lr': 0.00013180463165585627, 'samples': 19046400, 'steps': 99199, 'loss/train': 1.4668654203414917} 11/07/2021 11:10:00 - INFO - __main__ - Step 99201: {'lr': 0.00013179995548704882, 'samples': 19046592, 'steps': 99200, 'loss/train': 0.897213876247406} 11/07/2021 11:10:02 - INFO - __main__ - Step 99202: {'lr': 0.0001317952793715005, 'samples': 19046784, 'steps': 99201, 'loss/train': 1.3757461309432983} 11/07/2021 11:10:02 - INFO - __main__ - Step 99203: {'lr': 0.0001317906033092134, 'samples': 19046976, 'steps': 99202, 'loss/train': 0.7196676731109619} 11/07/2021 11:10:02 - INFO - __main__ - Step 99204: {'lr': 0.0001317859273001896, 'samples': 19047168, 'steps': 99203, 'loss/train': 1.4054222106933594} 11/07/2021 11:10:03 - INFO - __main__ - Step 99205: {'lr': 0.00013178125134443136, 'samples': 19047360, 'steps': 99204, 'loss/train': 1.1186803579330444} 11/07/2021 11:10:03 - INFO - __main__ - Step 99206: {'lr': 0.00013177657544194055, 'samples': 19047552, 'steps': 99205, 'loss/train': 1.6362552642822266} 11/07/2021 11:10:04 - INFO - __main__ - Step 99207: {'lr': 0.0001317718995927194, 'samples': 19047744, 'steps': 99206, 'loss/train': 1.464148998260498} 11/07/2021 11:10:04 - INFO - __main__ - Step 99208: {'lr': 0.00013176722379677004, 'samples': 19047936, 'steps': 99207, 'loss/train': 0.7106667160987854} 11/07/2021 11:10:05 - INFO - __main__ - Step 99209: {'lr': 0.0001317625480540945, 'samples': 19048128, 'steps': 99208, 'loss/train': 1.3519160747528076} 11/07/2021 11:10:05 - INFO - __main__ - Step 99210: {'lr': 0.00013175787236469495, 'samples': 19048320, 'steps': 99209, 'loss/train': 1.3284562826156616} 11/07/2021 11:10:05 - INFO - __main__ - Step 99211: {'lr': 0.00013175319672857348, 'samples': 19048512, 'steps': 99210, 'loss/train': 1.1010433435440063} 11/07/2021 11:10:06 - INFO - __main__ - Step 99212: {'lr': 0.00013174852114573215, 'samples': 19048704, 'steps': 99211, 'loss/train': 1.5386666059494019} 11/07/2021 11:10:07 - INFO - __main__ - Step 99213: {'lr': 0.0001317438456161732, 'samples': 19048896, 'steps': 99212, 'loss/train': 1.0104142427444458} 11/07/2021 11:10:07 - INFO - __main__ - Step 99214: {'lr': 0.00013173917013989856, 'samples': 19049088, 'steps': 99213, 'loss/train': 1.3873614072799683} 11/07/2021 11:10:07 - INFO - __main__ - Step 99215: {'lr': 0.00013173449471691058, 'samples': 19049280, 'steps': 99214, 'loss/train': 1.3756864070892334} 11/07/2021 11:10:08 - INFO - __main__ - Step 99216: {'lr': 0.0001317298193472111, 'samples': 19049472, 'steps': 99215, 'loss/train': 0.44544944167137146} 11/07/2021 11:10:09 - INFO - __main__ - Step 99217: {'lr': 0.00013172514403080233, 'samples': 19049664, 'steps': 99216, 'loss/train': 2.1251697540283203} 11/07/2021 11:10:09 - INFO - __main__ - Step 99218: {'lr': 0.0001317204687676864, 'samples': 19049856, 'steps': 99217, 'loss/train': 1.5476388931274414} 11/07/2021 11:10:10 - INFO - __main__ - Step 99219: {'lr': 0.00013171579355786538, 'samples': 19050048, 'steps': 99218, 'loss/train': 1.149437427520752} 11/07/2021 11:10:10 - INFO - __main__ - Step 99220: {'lr': 0.00013171111840134142, 'samples': 19050240, 'steps': 99219, 'loss/train': 0.972144603729248} 11/07/2021 11:10:10 - INFO - __main__ - Step 99221: {'lr': 0.0001317064432981166, 'samples': 19050432, 'steps': 99220, 'loss/train': 1.070694923400879} 11/07/2021 11:10:11 - INFO - __main__ - Step 99222: {'lr': 0.00013170176824819303, 'samples': 19050624, 'steps': 99221, 'loss/train': 1.2001172304153442} 11/07/2021 11:10:12 - INFO - __main__ - Step 99223: {'lr': 0.0001316970932515728, 'samples': 19050816, 'steps': 99222, 'loss/train': 1.080682396888733} 11/07/2021 11:10:12 - INFO - __main__ - Step 99224: {'lr': 0.00013169241830825803, 'samples': 19051008, 'steps': 99223, 'loss/train': 1.1477242708206177} 11/07/2021 11:10:12 - INFO - __main__ - Step 99225: {'lr': 0.00013168774341825086, 'samples': 19051200, 'steps': 99224, 'loss/train': 1.4078203439712524} 11/07/2021 11:10:13 - INFO - __main__ - Step 99226: {'lr': 0.00013168306858155334, 'samples': 19051392, 'steps': 99225, 'loss/train': 1.1664385795593262} 11/07/2021 11:10:13 - INFO - __main__ - Step 99227: {'lr': 0.0001316783937981677, 'samples': 19051584, 'steps': 99226, 'loss/train': 1.544101357460022} 11/07/2021 11:10:14 - INFO - __main__ - Step 99228: {'lr': 0.00013167371906809588, 'samples': 19051776, 'steps': 99227, 'loss/train': 1.110032081604004} 11/07/2021 11:10:15 - INFO - __main__ - Step 99229: {'lr': 0.00013166904439134005, 'samples': 19051968, 'steps': 99228, 'loss/train': 0.7313032150268555} 11/07/2021 11:10:15 - INFO - __main__ - Step 99230: {'lr': 0.0001316643697679023, 'samples': 19052160, 'steps': 99229, 'loss/train': 1.4421284198760986} 11/07/2021 11:10:15 - INFO - __main__ - Step 99231: {'lr': 0.00013165969519778482, 'samples': 19052352, 'steps': 99230, 'loss/train': 1.3671717643737793} 11/07/2021 11:10:16 - INFO - __main__ - Step 99232: {'lr': 0.00013165502068098958, 'samples': 19052544, 'steps': 99231, 'loss/train': 0.9192132353782654} 11/07/2021 11:10:17 - INFO - __main__ - Step 99233: {'lr': 0.00013165034621751882, 'samples': 19052736, 'steps': 99232, 'loss/train': 1.2843306064605713} 11/07/2021 11:10:17 - INFO - __main__ - Step 99234: {'lr': 0.00013164567180737452, 'samples': 19052928, 'steps': 99233, 'loss/train': 1.3859902620315552} 11/07/2021 11:10:17 - INFO - __main__ - Step 99235: {'lr': 0.0001316409974505589, 'samples': 19053120, 'steps': 99234, 'loss/train': 0.9959202408790588} 11/07/2021 11:10:18 - INFO - __main__ - Step 99236: {'lr': 0.000131636323147074, 'samples': 19053312, 'steps': 99235, 'loss/train': 1.2117668390274048} 11/07/2021 11:10:18 - INFO - __main__ - Step 99237: {'lr': 0.00013163164889692198, 'samples': 19053504, 'steps': 99236, 'loss/train': 1.1814903020858765} 11/07/2021 11:10:19 - INFO - __main__ - Step 99238: {'lr': 0.0001316269747001049, 'samples': 19053696, 'steps': 99237, 'loss/train': 1.1231194734573364} 11/07/2021 11:10:20 - INFO - __main__ - Step 99239: {'lr': 0.00013162230055662488, 'samples': 19053888, 'steps': 99238, 'loss/train': 1.135668158531189} 11/07/2021 11:10:20 - INFO - __main__ - Step 99240: {'lr': 0.00013161762646648402, 'samples': 19054080, 'steps': 99239, 'loss/train': 1.3212611675262451} 11/07/2021 11:10:20 - INFO - __main__ - Step 99241: {'lr': 0.00013161295242968452, 'samples': 19054272, 'steps': 99240, 'loss/train': 1.3997892141342163} 11/07/2021 11:10:21 - INFO - __main__ - Step 99242: {'lr': 0.0001316082784462283, 'samples': 19054464, 'steps': 99241, 'loss/train': 1.4315836429595947} 11/07/2021 11:10:22 - INFO - __main__ - Step 99243: {'lr': 0.00013160360451611758, 'samples': 19054656, 'steps': 99242, 'loss/train': 1.395176887512207} 11/07/2021 11:10:22 - INFO - __main__ - Step 99244: {'lr': 0.00013159893063935442, 'samples': 19054848, 'steps': 99243, 'loss/train': 1.3890644311904907} 11/07/2021 11:10:22 - INFO - __main__ - Step 99245: {'lr': 0.00013159425681594098, 'samples': 19055040, 'steps': 99244, 'loss/train': 0.9853406548500061} 11/07/2021 11:10:23 - INFO - __main__ - Step 99246: {'lr': 0.0001315895830458793, 'samples': 19055232, 'steps': 99245, 'loss/train': 1.245302438735962} 11/07/2021 11:10:23 - INFO - __main__ - Step 99247: {'lr': 0.0001315849093291716, 'samples': 19055424, 'steps': 99246, 'loss/train': 0.5893731713294983} 11/07/2021 11:10:24 - INFO - __main__ - Step 99248: {'lr': 0.00013158023566581988, 'samples': 19055616, 'steps': 99247, 'loss/train': 1.2200599908828735} 11/07/2021 11:10:25 - INFO - __main__ - Step 99249: {'lr': 0.00013157556205582626, 'samples': 19055808, 'steps': 99248, 'loss/train': 1.725691795349121} 11/07/2021 11:10:25 - INFO - __main__ - Step 99250: {'lr': 0.00013157088849919286, 'samples': 19056000, 'steps': 99249, 'loss/train': 1.1597974300384521} 11/07/2021 11:10:25 - INFO - __main__ - Step 99251: {'lr': 0.00013156621499592182, 'samples': 19056192, 'steps': 99250, 'loss/train': 1.6575803756713867} 11/07/2021 11:10:26 - INFO - __main__ - Step 99252: {'lr': 0.00013156154154601518, 'samples': 19056384, 'steps': 99251, 'loss/train': 1.409142255783081} 11/07/2021 11:10:27 - INFO - __main__ - Step 99253: {'lr': 0.0001315568681494751, 'samples': 19056576, 'steps': 99252, 'loss/train': 1.3468693494796753} 11/07/2021 11:10:27 - INFO - __main__ - Step 99254: {'lr': 0.00013155219480630377, 'samples': 19056768, 'steps': 99253, 'loss/train': 1.2698646783828735} 11/07/2021 11:10:27 - INFO - __main__ - Step 99255: {'lr': 0.00013154752151650308, 'samples': 19056960, 'steps': 99254, 'loss/train': 1.2496873140335083} 11/07/2021 11:10:28 - INFO - __main__ - Step 99256: {'lr': 0.0001315428482800753, 'samples': 19057152, 'steps': 99255, 'loss/train': 1.3475888967514038} 11/07/2021 11:10:28 - INFO - __main__ - Step 99257: {'lr': 0.00013153817509702244, 'samples': 19057344, 'steps': 99256, 'loss/train': 1.41683828830719} 11/07/2021 11:10:29 - INFO - __main__ - Step 99258: {'lr': 0.00013153350196734665, 'samples': 19057536, 'steps': 99257, 'loss/train': 1.3426908254623413} 11/07/2021 11:10:29 - INFO - __main__ - Step 99259: {'lr': 0.00013152882889105007, 'samples': 19057728, 'steps': 99258, 'loss/train': 1.8256868124008179} 11/07/2021 11:10:30 - INFO - __main__ - Step 99260: {'lr': 0.00013152415586813472, 'samples': 19057920, 'steps': 99259, 'loss/train': 2.1853883266448975} 11/07/2021 11:10:30 - INFO - __main__ - Step 99261: {'lr': 0.00013151948289860278, 'samples': 19058112, 'steps': 99260, 'loss/train': 0.5181527733802795} 11/07/2021 11:10:31 - INFO - __main__ - Step 99262: {'lr': 0.00013151480998245633, 'samples': 19058304, 'steps': 99261, 'loss/train': 1.1504853963851929} 11/07/2021 11:10:31 - INFO - __main__ - Step 99263: {'lr': 0.00013151013711969748, 'samples': 19058496, 'steps': 99262, 'loss/train': 1.328800916671753} 11/07/2021 11:10:32 - INFO - __main__ - Step 99264: {'lr': 0.00013150546431032833, 'samples': 19058688, 'steps': 99263, 'loss/train': 0.8205128908157349} 11/07/2021 11:10:32 - INFO - __main__ - Step 99265: {'lr': 0.000131500791554351, 'samples': 19058880, 'steps': 99264, 'loss/train': 1.33895742893219} 11/07/2021 11:10:33 - INFO - __main__ - Step 99266: {'lr': 0.0001314961188517676, 'samples': 19059072, 'steps': 99265, 'loss/train': 1.2250699996948242} 11/07/2021 11:10:33 - INFO - __main__ - Step 99267: {'lr': 0.0001314914462025802, 'samples': 19059264, 'steps': 99266, 'loss/train': 1.3752590417861938} 11/07/2021 11:10:33 - INFO - __main__ - Step 99268: {'lr': 0.000131486773606791, 'samples': 19059456, 'steps': 99267, 'loss/train': 1.6986596584320068} 11/07/2021 11:10:34 - INFO - __main__ - Step 99269: {'lr': 0.00013148210106440195, 'samples': 19059648, 'steps': 99268, 'loss/train': 1.5916181802749634} 11/07/2021 11:10:35 - INFO - __main__ - Step 99270: {'lr': 0.00013147742857541524, 'samples': 19059840, 'steps': 99269, 'loss/train': 1.3646811246871948} 11/07/2021 11:10:35 - INFO - __main__ - Step 99271: {'lr': 0.000131472756139833, 'samples': 19060032, 'steps': 99270, 'loss/train': 0.5462033748626709} 11/07/2021 11:10:35 - INFO - __main__ - Step 99272: {'lr': 0.00013146808375765729, 'samples': 19060224, 'steps': 99271, 'loss/train': 0.9779263734817505} 11/07/2021 11:10:36 - INFO - __main__ - Step 99273: {'lr': 0.0001314634114288902, 'samples': 19060416, 'steps': 99272, 'loss/train': 1.147074818611145} 11/07/2021 11:10:37 - INFO - __main__ - Step 99274: {'lr': 0.0001314587391535339, 'samples': 19060608, 'steps': 99273, 'loss/train': 1.3102679252624512} 11/07/2021 11:10:37 - INFO - __main__ - Step 99275: {'lr': 0.00013145406693159046, 'samples': 19060800, 'steps': 99274, 'loss/train': 1.6111125946044922} 11/07/2021 11:10:38 - INFO - __main__ - Step 99276: {'lr': 0.00013144939476306198, 'samples': 19060992, 'steps': 99275, 'loss/train': 1.7426350116729736} 11/07/2021 11:10:38 - INFO - __main__ - Step 99277: {'lr': 0.00013144472264795058, 'samples': 19061184, 'steps': 99276, 'loss/train': 1.2516379356384277} 11/07/2021 11:10:38 - INFO - __main__ - Step 99278: {'lr': 0.00013144005058625836, 'samples': 19061376, 'steps': 99277, 'loss/train': 1.0858455896377563} 11/07/2021 11:10:39 - INFO - __main__ - Step 99279: {'lr': 0.0001314353785779874, 'samples': 19061568, 'steps': 99278, 'loss/train': 1.167590618133545} 11/07/2021 11:10:40 - INFO - __main__ - Step 99280: {'lr': 0.00013143070662313986, 'samples': 19061760, 'steps': 99279, 'loss/train': 1.0269287824630737} 11/07/2021 11:10:40 - INFO - __main__ - Step 99281: {'lr': 0.00013142603472171788, 'samples': 19061952, 'steps': 99280, 'loss/train': 1.2913434505462646} 11/07/2021 11:10:41 - INFO - __main__ - Step 99282: {'lr': 0.00013142136287372342, 'samples': 19062144, 'steps': 99281, 'loss/train': 1.1871638298034668} 11/07/2021 11:10:41 - INFO - __main__ - Step 99283: {'lr': 0.0001314166910791587, 'samples': 19062336, 'steps': 99282, 'loss/train': 0.7020700573921204} 11/07/2021 11:10:41 - INFO - __main__ - Step 99284: {'lr': 0.00013141201933802575, 'samples': 19062528, 'steps': 99283, 'loss/train': 1.212551474571228} 11/07/2021 11:10:42 - INFO - __main__ - Step 99285: {'lr': 0.00013140734765032668, 'samples': 19062720, 'steps': 99284, 'loss/train': 1.3836674690246582} 11/07/2021 11:10:43 - INFO - __main__ - Step 99286: {'lr': 0.0001314026760160637, 'samples': 19062912, 'steps': 99285, 'loss/train': 1.4563300609588623} 11/07/2021 11:10:43 - INFO - __main__ - Step 99287: {'lr': 0.00013139800443523882, 'samples': 19063104, 'steps': 99286, 'loss/train': 1.2649110555648804} 11/07/2021 11:10:43 - INFO - __main__ - Step 99288: {'lr': 0.00013139333290785416, 'samples': 19063296, 'steps': 99287, 'loss/train': 0.9192425012588501} 11/07/2021 11:10:44 - INFO - __main__ - Step 99289: {'lr': 0.00013138866143391182, 'samples': 19063488, 'steps': 99288, 'loss/train': 1.8009499311447144} 11/07/2021 11:10:45 - INFO - __main__ - Step 99290: {'lr': 0.00013138399001341394, 'samples': 19063680, 'steps': 99289, 'loss/train': 1.797187328338623} 11/07/2021 11:10:45 - INFO - __main__ - Step 99291: {'lr': 0.0001313793186463626, 'samples': 19063872, 'steps': 99290, 'loss/train': 1.4904983043670654} 11/07/2021 11:10:45 - INFO - __main__ - Step 99292: {'lr': 0.0001313746473327599, 'samples': 19064064, 'steps': 99291, 'loss/train': 0.2775900661945343} 11/07/2021 11:10:46 - INFO - __main__ - Step 99293: {'lr': 0.00013136997607260796, 'samples': 19064256, 'steps': 99292, 'loss/train': 1.4333659410476685} 11/07/2021 11:10:46 - INFO - __main__ - Step 99294: {'lr': 0.00013136530486590887, 'samples': 19064448, 'steps': 99293, 'loss/train': 1.6220754384994507} 11/07/2021 11:10:47 - INFO - __main__ - Step 99295: {'lr': 0.00013136063371266485, 'samples': 19064640, 'steps': 99294, 'loss/train': 1.6685516834259033} 11/07/2021 11:10:48 - INFO - __main__ - Step 99296: {'lr': 0.0001313559626128778, 'samples': 19064832, 'steps': 99295, 'loss/train': 1.4096847772598267} 11/07/2021 11:10:48 - INFO - __main__ - Step 99297: {'lr': 0.0001313512915665499, 'samples': 19065024, 'steps': 99296, 'loss/train': 1.4723012447357178} 11/07/2021 11:10:48 - INFO - __main__ - Step 99298: {'lr': 0.0001313466205736833, 'samples': 19065216, 'steps': 99297, 'loss/train': 1.4711822271347046} 11/07/2021 11:10:49 - INFO - __main__ - Step 99299: {'lr': 0.00013134194963428008, 'samples': 19065408, 'steps': 99298, 'loss/train': 1.7092984914779663} 11/07/2021 11:10:50 - INFO - __main__ - Step 99300: {'lr': 0.00013133727874834237, 'samples': 19065600, 'steps': 99299, 'loss/train': 1.367281198501587} 11/07/2021 11:10:50 - INFO - __main__ - Step 99301: {'lr': 0.0001313326079158722, 'samples': 19065792, 'steps': 99300, 'loss/train': 1.6397696733474731} 11/07/2021 11:10:50 - INFO - __main__ - Step 99302: {'lr': 0.00013132793713687178, 'samples': 19065984, 'steps': 99301, 'loss/train': 1.5111901760101318} 11/07/2021 11:10:51 - INFO - __main__ - Step 99303: {'lr': 0.00013132326641134313, 'samples': 19066176, 'steps': 99302, 'loss/train': 1.4038728475570679} 11/07/2021 11:10:51 - INFO - __main__ - Step 99304: {'lr': 0.0001313185957392884, 'samples': 19066368, 'steps': 99303, 'loss/train': 1.4661777019500732} 11/07/2021 11:10:52 - INFO - __main__ - Step 99305: {'lr': 0.00013131392512070967, 'samples': 19066560, 'steps': 99304, 'loss/train': 1.345839262008667} 11/07/2021 11:10:53 - INFO - __main__ - Step 99306: {'lr': 0.00013130925455560904, 'samples': 19066752, 'steps': 99305, 'loss/train': 1.8718583583831787} 11/07/2021 11:10:53 - INFO - __main__ - Step 99307: {'lr': 0.00013130458404398866, 'samples': 19066944, 'steps': 99306, 'loss/train': 1.3972997665405273} 11/07/2021 11:10:53 - INFO - __main__ - Step 99308: {'lr': 0.00013129991358585064, 'samples': 19067136, 'steps': 99307, 'loss/train': 1.1682147979736328} 11/07/2021 11:10:54 - INFO - __main__ - Step 99309: {'lr': 0.00013129524318119702, 'samples': 19067328, 'steps': 99308, 'loss/train': 1.5470389127731323} 11/07/2021 11:10:55 - INFO - __main__ - Step 99310: {'lr': 0.00013129057283002988, 'samples': 19067520, 'steps': 99309, 'loss/train': 0.7491765022277832} 11/07/2021 11:10:55 - INFO - __main__ - Step 99311: {'lr': 0.0001312859025323514, 'samples': 19067712, 'steps': 99310, 'loss/train': 1.4087899923324585} 11/07/2021 11:10:55 - INFO - __main__ - Step 99312: {'lr': 0.00013128123228816366, 'samples': 19067904, 'steps': 99311, 'loss/train': 1.2940953969955444} 11/07/2021 11:10:56 - INFO - __main__ - Step 99313: {'lr': 0.00013127656209746874, 'samples': 19068096, 'steps': 99312, 'loss/train': 1.2606215476989746} 11/07/2021 11:10:56 - INFO - __main__ - Step 99314: {'lr': 0.00013127189196026883, 'samples': 19068288, 'steps': 99313, 'loss/train': 1.0305227041244507} 11/07/2021 11:10:57 - INFO - __main__ - Step 99315: {'lr': 0.00013126722187656594, 'samples': 19068480, 'steps': 99314, 'loss/train': 1.1364128589630127} 11/07/2021 11:10:58 - INFO - __main__ - Step 99316: {'lr': 0.0001312625518463622, 'samples': 19068672, 'steps': 99315, 'loss/train': 1.4794118404388428} 11/07/2021 11:10:58 - INFO - __main__ - Step 99317: {'lr': 0.0001312578818696597, 'samples': 19068864, 'steps': 99316, 'loss/train': 1.4269133806228638} 11/07/2021 11:10:58 - INFO - __main__ - Step 99318: {'lr': 0.0001312532119464606, 'samples': 19069056, 'steps': 99317, 'loss/train': 1.376615047454834} 11/07/2021 11:10:59 - INFO - __main__ - Step 99319: {'lr': 0.00013124854207676695, 'samples': 19069248, 'steps': 99318, 'loss/train': 1.2518869638442993} 11/07/2021 11:10:59 - INFO - __main__ - Step 99320: {'lr': 0.0001312438722605809, 'samples': 19069440, 'steps': 99319, 'loss/train': 1.2197816371917725} 11/07/2021 11:11:00 - INFO - __main__ - Step 99321: {'lr': 0.0001312392024979046, 'samples': 19069632, 'steps': 99320, 'loss/train': 1.5723621845245361} 11/07/2021 11:11:00 - INFO - __main__ - Step 99322: {'lr': 0.00013123453278874, 'samples': 19069824, 'steps': 99321, 'loss/train': 1.3916746377944946} 11/07/2021 11:11:01 - INFO - __main__ - Step 99323: {'lr': 0.0001312298631330893, 'samples': 19070016, 'steps': 99322, 'loss/train': 0.5027155876159668} 11/07/2021 11:11:01 - INFO - __main__ - Step 99324: {'lr': 0.00013122519353095459, 'samples': 19070208, 'steps': 99323, 'loss/train': 1.2235846519470215} 11/07/2021 11:11:01 - INFO - __main__ - Step 99325: {'lr': 0.00013122052398233794, 'samples': 19070400, 'steps': 99324, 'loss/train': 1.0842313766479492} 11/07/2021 11:11:03 - INFO - __main__ - Step 99326: {'lr': 0.0001312158544872415, 'samples': 19070592, 'steps': 99325, 'loss/train': 1.5753188133239746} 11/07/2021 11:11:03 - INFO - __main__ - Step 99327: {'lr': 0.00013121118504566738, 'samples': 19070784, 'steps': 99326, 'loss/train': 1.6636455059051514} 11/07/2021 11:11:03 - INFO - __main__ - Step 99328: {'lr': 0.00013120651565761766, 'samples': 19070976, 'steps': 99327, 'loss/train': 1.3496052026748657} 11/07/2021 11:11:04 - INFO - __main__ - Step 99329: {'lr': 0.00013120184632309446, 'samples': 19071168, 'steps': 99328, 'loss/train': 1.4619916677474976} 11/07/2021 11:11:04 - INFO - __main__ - Step 99330: {'lr': 0.00013119717704209986, 'samples': 19071360, 'steps': 99329, 'loss/train': 1.1465250253677368} 11/07/2021 11:11:05 - INFO - __main__ - Step 99331: {'lr': 0.000131192507814636, 'samples': 19071552, 'steps': 99330, 'loss/train': 1.4225298166275024} 11/07/2021 11:11:05 - INFO - __main__ - Step 99332: {'lr': 0.00013118783864070493, 'samples': 19071744, 'steps': 99331, 'loss/train': 1.4140865802764893} 11/07/2021 11:11:06 - INFO - __main__ - Step 99333: {'lr': 0.00013118316952030878, 'samples': 19071936, 'steps': 99332, 'loss/train': 1.2703343629837036} 11/07/2021 11:11:06 - INFO - __main__ - Step 99334: {'lr': 0.0001311785004534497, 'samples': 19072128, 'steps': 99333, 'loss/train': 1.0953712463378906} 11/07/2021 11:11:06 - INFO - __main__ - Step 99335: {'lr': 0.00013117383144012985, 'samples': 19072320, 'steps': 99334, 'loss/train': 1.0371730327606201} 11/07/2021 11:11:07 - INFO - __main__ - Step 99336: {'lr': 0.0001311691624803511, 'samples': 19072512, 'steps': 99335, 'loss/train': 1.2960822582244873} 11/07/2021 11:11:08 - INFO - __main__ - Step 99337: {'lr': 0.00013116449357411574, 'samples': 19072704, 'steps': 99336, 'loss/train': 1.161980152130127} 11/07/2021 11:11:08 - INFO - __main__ - Step 99338: {'lr': 0.0001311598247214258, 'samples': 19072896, 'steps': 99337, 'loss/train': 1.37434720993042} 11/07/2021 11:11:08 - INFO - __main__ - Step 99339: {'lr': 0.0001311551559222834, 'samples': 19073088, 'steps': 99338, 'loss/train': 1.5316916704177856} 11/07/2021 11:11:09 - INFO - __main__ - Step 99340: {'lr': 0.00013115048717669063, 'samples': 19073280, 'steps': 99339, 'loss/train': 1.4222913980484009} 11/07/2021 11:11:10 - INFO - __main__ - Step 99341: {'lr': 0.00013114581848464968, 'samples': 19073472, 'steps': 99340, 'loss/train': 1.2545838356018066} 11/07/2021 11:11:10 - INFO - __main__ - Step 99342: {'lr': 0.00013114114984616256, 'samples': 19073664, 'steps': 99341, 'loss/train': 1.4260179996490479} 11/07/2021 11:11:10 - INFO - __main__ - Step 99343: {'lr': 0.0001311364812612314, 'samples': 19073856, 'steps': 99342, 'loss/train': 1.4871115684509277} 11/07/2021 11:11:11 - INFO - __main__ - Step 99344: {'lr': 0.00013113181272985834, 'samples': 19074048, 'steps': 99343, 'loss/train': 1.7406561374664307} 11/07/2021 11:11:11 - INFO - __main__ - Step 99345: {'lr': 0.00013112714425204543, 'samples': 19074240, 'steps': 99344, 'loss/train': 1.691622257232666} 11/07/2021 11:11:12 - INFO - __main__ - Step 99346: {'lr': 0.00013112247582779476, 'samples': 19074432, 'steps': 99345, 'loss/train': 1.3145670890808105} 11/07/2021 11:11:13 - INFO - __main__ - Step 99347: {'lr': 0.00013111780745710849, 'samples': 19074624, 'steps': 99346, 'loss/train': 1.7891666889190674} 11/07/2021 11:11:13 - INFO - __main__ - Step 99348: {'lr': 0.0001311131391399888, 'samples': 19074816, 'steps': 99347, 'loss/train': 1.5657298564910889} 11/07/2021 11:11:13 - INFO - __main__ - Step 99349: {'lr': 0.00013110847087643762, 'samples': 19075008, 'steps': 99348, 'loss/train': 1.2388414144515991} 11/07/2021 11:11:14 - INFO - __main__ - Step 99350: {'lr': 0.0001311038026664571, 'samples': 19075200, 'steps': 99349, 'loss/train': 1.3495584726333618} 11/07/2021 11:11:14 - INFO - __main__ - Step 99351: {'lr': 0.0001310991345100494, 'samples': 19075392, 'steps': 99350, 'loss/train': 1.403319001197815} 11/07/2021 11:11:15 - INFO - __main__ - Step 99352: {'lr': 0.00013109446640721656, 'samples': 19075584, 'steps': 99351, 'loss/train': 1.7625887393951416} 11/07/2021 11:11:15 - INFO - __main__ - Step 99353: {'lr': 0.00013108979835796075, 'samples': 19075776, 'steps': 99352, 'loss/train': 1.2156931161880493} 11/07/2021 11:11:16 - INFO - __main__ - Step 99354: {'lr': 0.00013108513036228403, 'samples': 19075968, 'steps': 99353, 'loss/train': 1.6394203901290894} 11/07/2021 11:11:16 - INFO - __main__ - Step 99355: {'lr': 0.0001310804624201885, 'samples': 19076160, 'steps': 99354, 'loss/train': 1.4215375185012817} 11/07/2021 11:11:16 - INFO - __main__ - Step 99356: {'lr': 0.00013107579453167632, 'samples': 19076352, 'steps': 99355, 'loss/train': 1.6010329723358154} 11/07/2021 11:11:18 - INFO - __main__ - Step 99357: {'lr': 0.0001310711266967495, 'samples': 19076544, 'steps': 99356, 'loss/train': 1.1389410495758057} 11/07/2021 11:11:18 - INFO - __main__ - Step 99358: {'lr': 0.00013106645891541025, 'samples': 19076736, 'steps': 99357, 'loss/train': 1.8023372888565063} 11/07/2021 11:11:18 - INFO - __main__ - Step 99359: {'lr': 0.00013106179118766058, 'samples': 19076928, 'steps': 99358, 'loss/train': 1.2104657888412476} 11/07/2021 11:11:19 - INFO - __main__ - Step 99360: {'lr': 0.00013105712351350264, 'samples': 19077120, 'steps': 99359, 'loss/train': 1.180750846862793} 11/07/2021 11:11:19 - INFO - __main__ - Step 99361: {'lr': 0.00013105245589293852, 'samples': 19077312, 'steps': 99360, 'loss/train': 0.8092520236968994} 11/07/2021 11:11:20 - INFO - __main__ - Step 99362: {'lr': 0.00013104778832597041, 'samples': 19077504, 'steps': 99361, 'loss/train': 1.179702639579773} 11/07/2021 11:11:21 - INFO - __main__ - Step 99363: {'lr': 0.00013104312081260028, 'samples': 19077696, 'steps': 99362, 'loss/train': 1.1085615158081055} 11/07/2021 11:11:21 - INFO - __main__ - Step 99364: {'lr': 0.00013103845335283023, 'samples': 19077888, 'steps': 99363, 'loss/train': 1.5269782543182373} 11/07/2021 11:11:21 - INFO - __main__ - Step 99365: {'lr': 0.00013103378594666245, 'samples': 19078080, 'steps': 99364, 'loss/train': 1.7336132526397705} 11/07/2021 11:11:22 - INFO - __main__ - Step 99366: {'lr': 0.000131029118594099, 'samples': 19078272, 'steps': 99365, 'loss/train': 1.744337797164917} 11/07/2021 11:11:22 - INFO - __main__ - Step 99367: {'lr': 0.000131024451295142, 'samples': 19078464, 'steps': 99366, 'loss/train': 0.6080161333084106} 11/07/2021 11:11:23 - INFO - __main__ - Step 99368: {'lr': 0.00013101978404979353, 'samples': 19078656, 'steps': 99367, 'loss/train': 1.112125277519226} 11/07/2021 11:11:23 - INFO - __main__ - Step 99369: {'lr': 0.00013101511685805574, 'samples': 19078848, 'steps': 99368, 'loss/train': 1.2148616313934326} 11/07/2021 11:11:24 - INFO - __main__ - Step 99370: {'lr': 0.0001310104497199307, 'samples': 19079040, 'steps': 99369, 'loss/train': 1.5656108856201172} 11/07/2021 11:11:24 - INFO - __main__ - Step 99371: {'lr': 0.0001310057826354205, 'samples': 19079232, 'steps': 99370, 'loss/train': 1.3553098440170288} 11/07/2021 11:11:24 - INFO - __main__ - Step 99372: {'lr': 0.00013100111560452725, 'samples': 19079424, 'steps': 99371, 'loss/train': 1.3321503400802612} 11/07/2021 11:11:26 - INFO - __main__ - Step 99373: {'lr': 0.00013099644862725308, 'samples': 19079616, 'steps': 99372, 'loss/train': 1.171237826347351} 11/07/2021 11:11:26 - INFO - __main__ - Step 99374: {'lr': 0.00013099178170360005, 'samples': 19079808, 'steps': 99373, 'loss/train': 1.7502145767211914} 11/07/2021 11:11:26 - INFO - __main__ - Step 99375: {'lr': 0.00013098711483357039, 'samples': 19080000, 'steps': 99374, 'loss/train': 1.322843074798584} 11/07/2021 11:11:27 - INFO - __main__ - Step 99376: {'lr': 0.000130982448017166, 'samples': 19080192, 'steps': 99375, 'loss/train': 1.809155821800232} 11/07/2021 11:11:27 - INFO - __main__ - Step 99377: {'lr': 0.00013097778125438915, 'samples': 19080384, 'steps': 99376, 'loss/train': 1.3080328702926636} 11/07/2021 11:11:28 - INFO - __main__ - Step 99378: {'lr': 0.0001309731145452418, 'samples': 19080576, 'steps': 99377, 'loss/train': 1.8561499118804932} 11/07/2021 11:11:28 - INFO - __main__ - Step 99379: {'lr': 0.00013096844788972612, 'samples': 19080768, 'steps': 99378, 'loss/train': 0.46239036321640015} 11/07/2021 11:11:29 - INFO - __main__ - Step 99380: {'lr': 0.00013096378128784426, 'samples': 19080960, 'steps': 99379, 'loss/train': 1.3366564512252808} 11/07/2021 11:11:29 - INFO - __main__ - Step 99381: {'lr': 0.00013095911473959827, 'samples': 19081152, 'steps': 99380, 'loss/train': 1.1611515283584595} 11/07/2021 11:11:29 - INFO - __main__ - Step 99382: {'lr': 0.00013095444824499025, 'samples': 19081344, 'steps': 99381, 'loss/train': 0.8364989757537842} 11/07/2021 11:11:30 - INFO - __main__ - Step 99383: {'lr': 0.00013094978180402234, 'samples': 19081536, 'steps': 99382, 'loss/train': 1.3543758392333984} 11/07/2021 11:11:31 - INFO - __main__ - Step 99384: {'lr': 0.00013094511541669661, 'samples': 19081728, 'steps': 99383, 'loss/train': 1.7040479183197021} 11/07/2021 11:11:31 - INFO - __main__ - Step 99385: {'lr': 0.0001309404490830152, 'samples': 19081920, 'steps': 99384, 'loss/train': 0.5549300312995911} 11/07/2021 11:11:31 - INFO - __main__ - Step 99386: {'lr': 0.00013093578280298017, 'samples': 19082112, 'steps': 99385, 'loss/train': 2.17063570022583} 11/07/2021 11:11:32 - INFO - __main__ - Step 99387: {'lr': 0.00013093111657659363, 'samples': 19082304, 'steps': 99386, 'loss/train': 1.0618807077407837} 11/07/2021 11:11:33 - INFO - __main__ - Step 99388: {'lr': 0.0001309264504038577, 'samples': 19082496, 'steps': 99387, 'loss/train': 1.157123327255249} 11/07/2021 11:11:33 - INFO - __main__ - Step 99389: {'lr': 0.0001309217842847746, 'samples': 19082688, 'steps': 99388, 'loss/train': 1.198432207107544} 11/07/2021 11:11:34 - INFO - __main__ - Step 99390: {'lr': 0.00013091711821934616, 'samples': 19082880, 'steps': 99389, 'loss/train': 2.25274395942688} 11/07/2021 11:11:34 - INFO - __main__ - Step 99391: {'lr': 0.00013091245220757465, 'samples': 19083072, 'steps': 99390, 'loss/train': 1.2081515789031982} 11/07/2021 11:11:34 - INFO - __main__ - Step 99392: {'lr': 0.00013090778624946211, 'samples': 19083264, 'steps': 99391, 'loss/train': 0.5832856297492981} 11/07/2021 11:11:36 - INFO - __main__ - Step 99393: {'lr': 0.00013090312034501073, 'samples': 19083456, 'steps': 99392, 'loss/train': 1.382662296295166} 11/07/2021 11:11:36 - INFO - __main__ - Step 99394: {'lr': 0.00013089845449422256, 'samples': 19083648, 'steps': 99393, 'loss/train': 1.2457172870635986} 11/07/2021 11:11:36 - INFO - __main__ - Step 99395: {'lr': 0.00013089378869709972, 'samples': 19083840, 'steps': 99394, 'loss/train': 1.568896770477295} 11/07/2021 11:11:37 - INFO - __main__ - Step 99396: {'lr': 0.00013088912295364428, 'samples': 19084032, 'steps': 99395, 'loss/train': 1.8238142728805542} 11/07/2021 11:11:37 - INFO - __main__ - Step 99397: {'lr': 0.00013088445726385837, 'samples': 19084224, 'steps': 99396, 'loss/train': 0.8509801030158997} 11/07/2021 11:11:37 - INFO - __main__ - Step 99398: {'lr': 0.00013087979162774407, 'samples': 19084416, 'steps': 99397, 'loss/train': 1.211933970451355} 11/07/2021 11:11:38 - INFO - __main__ - Step 99399: {'lr': 0.00013087512604530353, 'samples': 19084608, 'steps': 99398, 'loss/train': 1.3319599628448486} 11/07/2021 11:11:39 - INFO - __main__ - Step 99400: {'lr': 0.00013087046051653877, 'samples': 19084800, 'steps': 99399, 'loss/train': 1.7427095174789429} 11/07/2021 11:11:39 - INFO - __main__ - Step 99401: {'lr': 0.00013086579504145203, 'samples': 19084992, 'steps': 99400, 'loss/train': 1.5472619533538818} 11/07/2021 11:11:39 - INFO - __main__ - Step 99402: {'lr': 0.00013086112962004535, 'samples': 19085184, 'steps': 99401, 'loss/train': 1.3460277318954468} 11/07/2021 11:11:40 - INFO - __main__ - Step 99403: {'lr': 0.00013085646425232072, 'samples': 19085376, 'steps': 99402, 'loss/train': 1.1169695854187012} 11/07/2021 11:11:41 - INFO - __main__ - Step 99404: {'lr': 0.00013085179893828033, 'samples': 19085568, 'steps': 99403, 'loss/train': 0.9163830876350403} 11/07/2021 11:11:41 - INFO - __main__ - Step 99405: {'lr': 0.00013084713367792628, 'samples': 19085760, 'steps': 99404, 'loss/train': 1.7505402565002441} 11/07/2021 11:11:42 - INFO - __main__ - Step 99406: {'lr': 0.0001308424684712607, 'samples': 19085952, 'steps': 99405, 'loss/train': 2.9980809688568115} 11/07/2021 11:11:42 - INFO - __main__ - Step 99407: {'lr': 0.00013083780331828564, 'samples': 19086144, 'steps': 99406, 'loss/train': 1.344980001449585} 11/07/2021 11:11:42 - INFO - __main__ - Step 99408: {'lr': 0.00013083313821900323, 'samples': 19086336, 'steps': 99407, 'loss/train': 1.2352845668792725} 11/07/2021 11:11:43 - INFO - __main__ - Step 99409: {'lr': 0.00013082847317341556, 'samples': 19086528, 'steps': 99408, 'loss/train': 1.1511167287826538} 11/07/2021 11:11:44 - INFO - __main__ - Step 99410: {'lr': 0.00013082380818152476, 'samples': 19086720, 'steps': 99409, 'loss/train': 1.2060296535491943} 11/07/2021 11:11:44 - INFO - __main__ - Step 99411: {'lr': 0.0001308191432433329, 'samples': 19086912, 'steps': 99410, 'loss/train': 1.3949754238128662} 11/07/2021 11:11:44 - INFO - __main__ - Step 99412: {'lr': 0.00013081447835884208, 'samples': 19087104, 'steps': 99411, 'loss/train': 1.726470708847046} 11/07/2021 11:11:45 - INFO - __main__ - Step 99413: {'lr': 0.00013080981352805445, 'samples': 19087296, 'steps': 99412, 'loss/train': 1.3874880075454712} 11/07/2021 11:11:46 - INFO - __main__ - Step 99414: {'lr': 0.00013080514875097208, 'samples': 19087488, 'steps': 99413, 'loss/train': 0.15981781482696533} 11/07/2021 11:11:46 - INFO - __main__ - Step 99415: {'lr': 0.00013080048402759704, 'samples': 19087680, 'steps': 99414, 'loss/train': 1.3325176239013672} 11/07/2021 11:11:46 - INFO - __main__ - Step 99416: {'lr': 0.00013079581935793158, 'samples': 19087872, 'steps': 99415, 'loss/train': 1.4094611406326294} 11/07/2021 11:11:47 - INFO - __main__ - Step 99417: {'lr': 0.0001307911547419776, 'samples': 19088064, 'steps': 99416, 'loss/train': 1.0921225547790527} 11/07/2021 11:11:47 - INFO - __main__ - Step 99418: {'lr': 0.00013078649017973727, 'samples': 19088256, 'steps': 99417, 'loss/train': 1.240180253982544} 11/07/2021 11:11:48 - INFO - __main__ - Step 99419: {'lr': 0.0001307818256712127, 'samples': 19088448, 'steps': 99418, 'loss/train': 1.2502890825271606} 11/07/2021 11:11:49 - INFO - __main__ - Step 99420: {'lr': 0.00013077716121640597, 'samples': 19088640, 'steps': 99419, 'loss/train': 1.1870628595352173} 11/07/2021 11:11:49 - INFO - __main__ - Step 99421: {'lr': 0.00013077249681531927, 'samples': 19088832, 'steps': 99420, 'loss/train': 0.8261114358901978} 11/07/2021 11:11:49 - INFO - __main__ - Step 99422: {'lr': 0.00013076783246795463, 'samples': 19089024, 'steps': 99421, 'loss/train': 2.1595160961151123} 11/07/2021 11:11:50 - INFO - __main__ - Step 99423: {'lr': 0.00013076316817431415, 'samples': 19089216, 'steps': 99422, 'loss/train': 1.5947266817092896} 11/07/2021 11:11:50 - INFO - __main__ - Step 99424: {'lr': 0.00013075850393439996, 'samples': 19089408, 'steps': 99423, 'loss/train': 1.5265811681747437} 11/07/2021 11:11:51 - INFO - __main__ - Step 99425: {'lr': 0.00013075383974821413, 'samples': 19089600, 'steps': 99424, 'loss/train': 1.5409172773361206} 11/07/2021 11:11:51 - INFO - __main__ - Step 99426: {'lr': 0.00013074917561575877, 'samples': 19089792, 'steps': 99425, 'loss/train': 1.1056419610977173} 11/07/2021 11:11:52 - INFO - __main__ - Step 99427: {'lr': 0.00013074451153703603, 'samples': 19089984, 'steps': 99426, 'loss/train': 1.0843311548233032} 11/07/2021 11:11:52 - INFO - __main__ - Step 99428: {'lr': 0.00013073984751204795, 'samples': 19090176, 'steps': 99427, 'loss/train': 1.5876904726028442} 11/07/2021 11:11:52 - INFO - __main__ - Step 99429: {'lr': 0.00013073518354079678, 'samples': 19090368, 'steps': 99428, 'loss/train': 1.6367419958114624} 11/07/2021 11:11:53 - INFO - __main__ - Step 99430: {'lr': 0.00013073051962328436, 'samples': 19090560, 'steps': 99429, 'loss/train': 1.6081771850585938} 11/07/2021 11:11:54 - INFO - __main__ - Step 99431: {'lr': 0.00013072585575951297, 'samples': 19090752, 'steps': 99430, 'loss/train': 1.2192082405090332} 11/07/2021 11:11:54 - INFO - __main__ - Step 99432: {'lr': 0.0001307211919494846, 'samples': 19090944, 'steps': 99431, 'loss/train': 1.58661687374115} 11/07/2021 11:11:54 - INFO - __main__ - Step 99433: {'lr': 0.00013071652819320146, 'samples': 19091136, 'steps': 99432, 'loss/train': 1.503124475479126} 11/07/2021 11:11:55 - INFO - __main__ - Step 99434: {'lr': 0.00013071186449066562, 'samples': 19091328, 'steps': 99433, 'loss/train': 0.9777485132217407} 11/07/2021 11:11:56 - INFO - __main__ - Step 99435: {'lr': 0.0001307072008418792, 'samples': 19091520, 'steps': 99434, 'loss/train': 1.4821609258651733} 11/07/2021 11:11:56 - INFO - __main__ - Step 99436: {'lr': 0.00013070253724684422, 'samples': 19091712, 'steps': 99435, 'loss/train': 0.9692549705505371} 11/07/2021 11:11:56 - INFO - __main__ - Step 99437: {'lr': 0.00013069787370556285, 'samples': 19091904, 'steps': 99436, 'loss/train': 1.4845385551452637} 11/07/2021 11:11:57 - INFO - __main__ - Step 99438: {'lr': 0.00013069321021803718, 'samples': 19092096, 'steps': 99437, 'loss/train': 1.299461841583252} 11/07/2021 11:11:57 - INFO - __main__ - Step 99439: {'lr': 0.00013068854678426934, 'samples': 19092288, 'steps': 99438, 'loss/train': 1.437882661819458} 11/07/2021 11:11:58 - INFO - __main__ - Step 99440: {'lr': 0.00013068388340426135, 'samples': 19092480, 'steps': 99439, 'loss/train': 1.2650448083877563} 11/07/2021 11:11:59 - INFO - __main__ - Step 99441: {'lr': 0.00013067922007801546, 'samples': 19092672, 'steps': 99440, 'loss/train': 1.3683764934539795} 11/07/2021 11:11:59 - INFO - __main__ - Step 99442: {'lr': 0.00013067455680553362, 'samples': 19092864, 'steps': 99441, 'loss/train': 1.4852979183197021} 11/07/2021 11:11:59 - INFO - __main__ - Step 99443: {'lr': 0.00013066989358681796, 'samples': 19093056, 'steps': 99442, 'loss/train': 0.9239675998687744} 11/07/2021 11:12:00 - INFO - __main__ - Step 99444: {'lr': 0.0001306652304218706, 'samples': 19093248, 'steps': 99443, 'loss/train': 1.4853785037994385} 11/07/2021 11:12:01 - INFO - __main__ - Step 99445: {'lr': 0.00013066056731069365, 'samples': 19093440, 'steps': 99444, 'loss/train': 0.9202119708061218} 11/07/2021 11:12:01 - INFO - __main__ - Step 99446: {'lr': 0.00013065590425328922, 'samples': 19093632, 'steps': 99445, 'loss/train': 1.1114507913589478} 11/07/2021 11:12:01 - INFO - __main__ - Step 99447: {'lr': 0.00013065124124965938, 'samples': 19093824, 'steps': 99446, 'loss/train': 1.3301492929458618} 11/07/2021 11:12:02 - INFO - __main__ - Step 99448: {'lr': 0.00013064657829980626, 'samples': 19094016, 'steps': 99447, 'loss/train': 1.472541093826294} 11/07/2021 11:12:02 - INFO - __main__ - Step 99449: {'lr': 0.00013064191540373193, 'samples': 19094208, 'steps': 99448, 'loss/train': 1.307182788848877} 11/07/2021 11:12:02 - INFO - __main__ - Step 99450: {'lr': 0.00013063725256143852, 'samples': 19094400, 'steps': 99449, 'loss/train': 1.0777900218963623} 11/07/2021 11:12:03 - INFO - __main__ - Step 99451: {'lr': 0.00013063258977292813, 'samples': 19094592, 'steps': 99450, 'loss/train': 1.4737929105758667} 11/07/2021 11:12:04 - INFO - __main__ - Step 99452: {'lr': 0.00013062792703820292, 'samples': 19094784, 'steps': 99451, 'loss/train': 1.609189510345459} 11/07/2021 11:12:04 - INFO - __main__ - Step 99453: {'lr': 0.00013062326435726485, 'samples': 19094976, 'steps': 99452, 'loss/train': 1.580535650253296} 11/07/2021 11:12:04 - INFO - __main__ - Step 99454: {'lr': 0.0001306186017301161, 'samples': 19095168, 'steps': 99453, 'loss/train': 1.3486965894699097} 11/07/2021 11:12:05 - INFO - __main__ - Step 99455: {'lr': 0.00013061393915675878, 'samples': 19095360, 'steps': 99454, 'loss/train': 1.4805850982666016} 11/07/2021 11:12:06 - INFO - __main__ - Step 99456: {'lr': 0.00013060927663719496, 'samples': 19095552, 'steps': 99455, 'loss/train': 1.052220106124878} 11/07/2021 11:12:06 - INFO - __main__ - Step 99457: {'lr': 0.00013060461417142678, 'samples': 19095744, 'steps': 99456, 'loss/train': 1.1985210180282593} 11/07/2021 11:12:07 - INFO - __main__ - Step 99458: {'lr': 0.00013059995175945628, 'samples': 19095936, 'steps': 99457, 'loss/train': 1.6090598106384277} 11/07/2021 11:12:07 - INFO - __main__ - Step 99459: {'lr': 0.00013059528940128563, 'samples': 19096128, 'steps': 99458, 'loss/train': 1.6659141778945923} 11/07/2021 11:12:07 - INFO - __main__ - Step 99460: {'lr': 0.00013059062709691688, 'samples': 19096320, 'steps': 99459, 'loss/train': 1.5269490480422974} 11/07/2021 11:12:08 - INFO - __main__ - Step 99461: {'lr': 0.00013058596484635216, 'samples': 19096512, 'steps': 99460, 'loss/train': 1.376032829284668} 11/07/2021 11:12:09 - INFO - __main__ - Step 99462: {'lr': 0.00013058130264959365, 'samples': 19096704, 'steps': 99461, 'loss/train': 1.0771610736846924} 11/07/2021 11:12:09 - INFO - __main__ - Step 99463: {'lr': 0.00013057664050664325, 'samples': 19096896, 'steps': 99462, 'loss/train': 1.7822294235229492} 11/07/2021 11:12:09 - INFO - __main__ - Step 99464: {'lr': 0.00013057197841750322, 'samples': 19097088, 'steps': 99463, 'loss/train': 1.2491772174835205} 11/07/2021 11:12:10 - INFO - __main__ - Step 99465: {'lr': 0.00013056731638217556, 'samples': 19097280, 'steps': 99464, 'loss/train': 1.1818455457687378} 11/07/2021 11:12:11 - INFO - __main__ - Step 99466: {'lr': 0.00013056265440066246, 'samples': 19097472, 'steps': 99465, 'loss/train': 2.3628270626068115} 11/07/2021 11:12:11 - INFO - __main__ - Step 99467: {'lr': 0.00013055799247296598, 'samples': 19097664, 'steps': 99466, 'loss/train': 1.5544198751449585} 11/07/2021 11:12:12 - INFO - __main__ - Step 99468: {'lr': 0.00013055333059908822, 'samples': 19097856, 'steps': 99467, 'loss/train': 1.4422255754470825} 11/07/2021 11:12:12 - INFO - __main__ - Step 99469: {'lr': 0.00013054866877903128, 'samples': 19098048, 'steps': 99468, 'loss/train': 1.7222586870193481} 11/07/2021 11:12:12 - INFO - __main__ - Step 99470: {'lr': 0.0001305440070127973, 'samples': 19098240, 'steps': 99469, 'loss/train': 1.5192755460739136} 11/07/2021 11:12:13 - INFO - __main__ - Step 99471: {'lr': 0.0001305393453003883, 'samples': 19098432, 'steps': 99470, 'loss/train': 1.4166969060897827} 11/07/2021 11:12:14 - INFO - __main__ - Step 99472: {'lr': 0.00013053468364180646, 'samples': 19098624, 'steps': 99471, 'loss/train': 1.1996318101882935} 11/07/2021 11:12:14 - INFO - __main__ - Step 99473: {'lr': 0.00013053002203705394, 'samples': 19098816, 'steps': 99472, 'loss/train': 1.3674613237380981} 11/07/2021 11:12:14 - INFO - __main__ - Step 99474: {'lr': 0.00013052536048613263, 'samples': 19099008, 'steps': 99473, 'loss/train': 1.1185663938522339} 11/07/2021 11:12:15 - INFO - __main__ - Step 99475: {'lr': 0.00013052069898904478, 'samples': 19099200, 'steps': 99474, 'loss/train': 1.091049313545227} 11/07/2021 11:12:15 - INFO - __main__ - Step 99476: {'lr': 0.00013051603754579244, 'samples': 19099392, 'steps': 99475, 'loss/train': 2.8716444969177246} 11/07/2021 11:12:16 - INFO - __main__ - Step 99477: {'lr': 0.00013051137615637773, 'samples': 19099584, 'steps': 99476, 'loss/train': 1.4490257501602173} 11/07/2021 11:12:16 - INFO - __main__ - Step 99478: {'lr': 0.00013050671482080277, 'samples': 19099776, 'steps': 99477, 'loss/train': 1.2428442239761353} 11/07/2021 11:12:17 - INFO - __main__ - Step 99479: {'lr': 0.00013050205353906964, 'samples': 19099968, 'steps': 99478, 'loss/train': 1.5699082612991333} 11/07/2021 11:12:17 - INFO - __main__ - Step 99480: {'lr': 0.0001304973923111804, 'samples': 19100160, 'steps': 99479, 'loss/train': 1.3459457159042358} 11/07/2021 11:12:17 - INFO - __main__ - Step 99481: {'lr': 0.00013049273113713723, 'samples': 19100352, 'steps': 99480, 'loss/train': 1.257822871208191} 11/07/2021 11:12:19 - INFO - __main__ - Step 99482: {'lr': 0.00013048807001694217, 'samples': 19100544, 'steps': 99481, 'loss/train': 0.6315867304801941} 11/07/2021 11:12:19 - INFO - __main__ - Step 99483: {'lr': 0.00013048340895059735, 'samples': 19100736, 'steps': 99482, 'loss/train': 1.189638614654541} 11/07/2021 11:12:19 - INFO - __main__ - Step 99484: {'lr': 0.00013047874793810493, 'samples': 19100928, 'steps': 99483, 'loss/train': 1.761759638786316} 11/07/2021 11:12:20 - INFO - __main__ - Step 99485: {'lr': 0.0001304740869794669, 'samples': 19101120, 'steps': 99484, 'loss/train': 1.2116836309432983} 11/07/2021 11:12:20 - INFO - __main__ - Step 99486: {'lr': 0.00013046942607468538, 'samples': 19101312, 'steps': 99485, 'loss/train': 1.0947846174240112} 11/07/2021 11:12:20 - INFO - __main__ - Step 99487: {'lr': 0.0001304647652237625, 'samples': 19101504, 'steps': 99486, 'loss/train': 1.5204366445541382} 11/07/2021 11:12:22 - INFO - __main__ - Step 99488: {'lr': 0.0001304601044267003, 'samples': 19101696, 'steps': 99487, 'loss/train': 1.1515028476715088} 11/07/2021 11:12:22 - INFO - __main__ - Step 99489: {'lr': 0.000130455443683501, 'samples': 19101888, 'steps': 99488, 'loss/train': 1.3197928667068481} 11/07/2021 11:12:22 - INFO - __main__ - Step 99490: {'lr': 0.00013045078299416657, 'samples': 19102080, 'steps': 99489, 'loss/train': 1.0662791728973389} 11/07/2021 11:12:23 - INFO - __main__ - Step 99491: {'lr': 0.00013044612235869923, 'samples': 19102272, 'steps': 99490, 'loss/train': 1.4002193212509155} 11/07/2021 11:12:23 - INFO - __main__ - Step 99492: {'lr': 0.00013044146177710098, 'samples': 19102464, 'steps': 99491, 'loss/train': 0.46036723256111145} 11/07/2021 11:12:24 - INFO - __main__ - Step 99493: {'lr': 0.00013043680124937397, 'samples': 19102656, 'steps': 99492, 'loss/train': 0.43013253808021545} 11/07/2021 11:12:25 - INFO - __main__ - Step 99494: {'lr': 0.00013043214077552035, 'samples': 19102848, 'steps': 99493, 'loss/train': 1.18547523021698} 11/07/2021 11:12:25 - INFO - __main__ - Step 99495: {'lr': 0.0001304274803555421, 'samples': 19103040, 'steps': 99494, 'loss/train': 1.4336518049240112} 11/07/2021 11:12:25 - INFO - __main__ - Step 99496: {'lr': 0.0001304228199894415, 'samples': 19103232, 'steps': 99495, 'loss/train': 0.9725179076194763} 11/07/2021 11:12:26 - INFO - __main__ - Step 99497: {'lr': 0.00013041815967722043, 'samples': 19103424, 'steps': 99496, 'loss/train': 0.05559651926159859} 11/07/2021 11:12:27 - INFO - __main__ - Step 99498: {'lr': 0.0001304134994188811, 'samples': 19103616, 'steps': 99497, 'loss/train': 1.6228277683258057} 11/07/2021 11:12:27 - INFO - __main__ - Step 99499: {'lr': 0.0001304088392144256, 'samples': 19103808, 'steps': 99498, 'loss/train': 1.5682942867279053} 11/07/2021 11:12:27 - INFO - __main__ - Step 99500: {'lr': 0.00013040417906385598, 'samples': 19104000, 'steps': 99499, 'loss/train': 1.2365241050720215} 11/07/2021 11:12:28 - INFO - __main__ - Step 99501: {'lr': 0.00013039951896717445, 'samples': 19104192, 'steps': 99500, 'loss/train': 1.554211974143982} 11/07/2021 11:12:28 - INFO - __main__ - Step 99502: {'lr': 0.00013039485892438305, 'samples': 19104384, 'steps': 99501, 'loss/train': 1.2598894834518433} 11/07/2021 11:12:29 - INFO - __main__ - Step 99503: {'lr': 0.00013039019893548387, 'samples': 19104576, 'steps': 99502, 'loss/train': 1.1289232969284058} 11/07/2021 11:12:29 - INFO - __main__ - Step 99504: {'lr': 0.000130385539000479, 'samples': 19104768, 'steps': 99503, 'loss/train': 1.5786142349243164} 11/07/2021 11:12:30 - INFO - __main__ - Step 99505: {'lr': 0.00013038087911937057, 'samples': 19104960, 'steps': 99504, 'loss/train': 1.064719319343567} 11/07/2021 11:12:30 - INFO - __main__ - Step 99506: {'lr': 0.0001303762192921607, 'samples': 19105152, 'steps': 99505, 'loss/train': 1.27829909324646} 11/07/2021 11:12:31 - INFO - __main__ - Step 99507: {'lr': 0.00013037155951885145, 'samples': 19105344, 'steps': 99506, 'loss/train': 1.242706537246704} 11/07/2021 11:12:32 - INFO - __main__ - Step 99508: {'lr': 0.00013036689979944492, 'samples': 19105536, 'steps': 99507, 'loss/train': 2.0829052925109863} 11/07/2021 11:12:32 - INFO - __main__ - Step 99509: {'lr': 0.00013036224013394322, 'samples': 19105728, 'steps': 99508, 'loss/train': 1.4908504486083984} 11/07/2021 11:12:32 - INFO - __main__ - Step 99510: {'lr': 0.00013035758052234853, 'samples': 19105920, 'steps': 99509, 'loss/train': 1.4294613599777222} 11/07/2021 11:12:33 - INFO - __main__ - Step 99511: {'lr': 0.00013035292096466277, 'samples': 19106112, 'steps': 99510, 'loss/train': 1.7449078559875488} 11/07/2021 11:12:33 - INFO - __main__ - Step 99512: {'lr': 0.0001303482614608882, 'samples': 19106304, 'steps': 99511, 'loss/train': 1.5774052143096924} 11/07/2021 11:12:34 - INFO - __main__ - Step 99513: {'lr': 0.0001303436020110268, 'samples': 19106496, 'steps': 99512, 'loss/train': 2.1330134868621826} 11/07/2021 11:12:34 - INFO - __main__ - Step 99514: {'lr': 0.00013033894261508071, 'samples': 19106688, 'steps': 99513, 'loss/train': 1.3004897832870483} 11/07/2021 11:12:35 - INFO - __main__ - Step 99515: {'lr': 0.00013033428327305209, 'samples': 19106880, 'steps': 99514, 'loss/train': 1.593347191810608} 11/07/2021 11:12:35 - INFO - __main__ - Step 99516: {'lr': 0.00013032962398494297, 'samples': 19107072, 'steps': 99515, 'loss/train': 1.3778257369995117} 11/07/2021 11:12:36 - INFO - __main__ - Step 99517: {'lr': 0.0001303249647507555, 'samples': 19107264, 'steps': 99516, 'loss/train': 1.1708794832229614} 11/07/2021 11:12:36 - INFO - __main__ - Step 99518: {'lr': 0.00013032030557049172, 'samples': 19107456, 'steps': 99517, 'loss/train': 1.1249195337295532} 11/07/2021 11:12:37 - INFO - __main__ - Step 99519: {'lr': 0.00013031564644415378, 'samples': 19107648, 'steps': 99518, 'loss/train': 1.025881290435791} 11/07/2021 11:12:37 - INFO - __main__ - Step 99520: {'lr': 0.00013031098737174374, 'samples': 19107840, 'steps': 99519, 'loss/train': 1.4086214303970337} 11/07/2021 11:12:38 - INFO - __main__ - Step 99521: {'lr': 0.00013030632835326378, 'samples': 19108032, 'steps': 99520, 'loss/train': 1.3969788551330566} 11/07/2021 11:12:38 - INFO - __main__ - Step 99522: {'lr': 0.0001303016693887159, 'samples': 19108224, 'steps': 99521, 'loss/train': 1.0869660377502441} 11/07/2021 11:12:38 - INFO - __main__ - Step 99523: {'lr': 0.00013029701047810233, 'samples': 19108416, 'steps': 99522, 'loss/train': 1.1191335916519165} 11/07/2021 11:12:39 - INFO - __main__ - Step 99524: {'lr': 0.000130292351621425, 'samples': 19108608, 'steps': 99523, 'loss/train': 1.3124682903289795} 11/07/2021 11:12:40 - INFO - __main__ - Step 99525: {'lr': 0.00013028769281868608, 'samples': 19108800, 'steps': 99524, 'loss/train': 1.4691604375839233} 11/07/2021 11:12:40 - INFO - __main__ - Step 99526: {'lr': 0.00013028303406988767, 'samples': 19108992, 'steps': 99525, 'loss/train': 1.2494410276412964} 11/07/2021 11:12:41 - INFO - __main__ - Step 99527: {'lr': 0.0001302783753750319, 'samples': 19109184, 'steps': 99526, 'loss/train': 1.0189743041992188} 11/07/2021 11:12:41 - INFO - __main__ - Step 99528: {'lr': 0.00013027371673412087, 'samples': 19109376, 'steps': 99527, 'loss/train': 1.1618123054504395} 11/07/2021 11:12:42 - INFO - __main__ - Step 99529: {'lr': 0.00013026905814715663, 'samples': 19109568, 'steps': 99528, 'loss/train': 1.3257108926773071} 11/07/2021 11:12:42 - INFO - __main__ - Step 99530: {'lr': 0.00013026439961414128, 'samples': 19109760, 'steps': 99529, 'loss/train': 1.0133743286132812} 11/07/2021 11:12:43 - INFO - __main__ - Step 99531: {'lr': 0.00013025974113507695, 'samples': 19109952, 'steps': 99530, 'loss/train': 1.0541722774505615} 11/07/2021 11:12:43 - INFO - __main__ - Step 99532: {'lr': 0.00013025508270996574, 'samples': 19110144, 'steps': 99531, 'loss/train': 1.4856802225112915} 11/07/2021 11:12:43 - INFO - __main__ - Step 99533: {'lr': 0.00013025042433880977, 'samples': 19110336, 'steps': 99532, 'loss/train': 1.5706994533538818} 11/07/2021 11:12:44 - INFO - __main__ - Step 99534: {'lr': 0.0001302457660216111, 'samples': 19110528, 'steps': 99533, 'loss/train': 0.5230199098587036} 11/07/2021 11:12:45 - INFO - __main__ - Step 99535: {'lr': 0.0001302411077583718, 'samples': 19110720, 'steps': 99534, 'loss/train': 1.529754638671875} 11/07/2021 11:12:45 - INFO - __main__ - Step 99536: {'lr': 0.00013023644954909404, 'samples': 19110912, 'steps': 99535, 'loss/train': 1.0844800472259521} 11/07/2021 11:12:45 - INFO - __main__ - Step 99537: {'lr': 0.00013023179139377998, 'samples': 19111104, 'steps': 99536, 'loss/train': 1.166160225868225} 11/07/2021 11:12:46 - INFO - __main__ - Step 99538: {'lr': 0.00013022713329243152, 'samples': 19111296, 'steps': 99537, 'loss/train': 1.4152792692184448} 11/07/2021 11:12:46 - INFO - __main__ - Step 99539: {'lr': 0.0001302224752450509, 'samples': 19111488, 'steps': 99538, 'loss/train': 1.254137635231018} 11/07/2021 11:12:47 - INFO - __main__ - Step 99540: {'lr': 0.00013021781725164016, 'samples': 19111680, 'steps': 99539, 'loss/train': 1.2735601663589478} 11/07/2021 11:12:47 - INFO - __main__ - Step 99541: {'lr': 0.00013021315931220143, 'samples': 19111872, 'steps': 99540, 'loss/train': 1.6306679248809814} 11/07/2021 11:12:48 - INFO - __main__ - Step 99542: {'lr': 0.00013020850142673679, 'samples': 19112064, 'steps': 99541, 'loss/train': 1.0266660451889038} 11/07/2021 11:12:48 - INFO - __main__ - Step 99543: {'lr': 0.00013020384359524833, 'samples': 19112256, 'steps': 99542, 'loss/train': 1.099049687385559} 11/07/2021 11:12:48 - INFO - __main__ - Step 99544: {'lr': 0.0001301991858177382, 'samples': 19112448, 'steps': 99543, 'loss/train': 1.4960018396377563} 11/07/2021 11:12:49 - INFO - __main__ - Step 99545: {'lr': 0.0001301945280942085, 'samples': 19112640, 'steps': 99544, 'loss/train': 1.4878028631210327} 11/07/2021 11:12:50 - INFO - __main__ - Step 99546: {'lr': 0.00013018987042466123, 'samples': 19112832, 'steps': 99545, 'loss/train': 1.1798017024993896} 11/07/2021 11:12:50 - INFO - __main__ - Step 99547: {'lr': 0.00013018521280909863, 'samples': 19113024, 'steps': 99546, 'loss/train': 1.2770863771438599} 11/07/2021 11:12:51 - INFO - __main__ - Step 99548: {'lr': 0.00013018055524752266, 'samples': 19113216, 'steps': 99547, 'loss/train': 1.3063549995422363} 11/07/2021 11:12:51 - INFO - __main__ - Step 99549: {'lr': 0.00013017589773993548, 'samples': 19113408, 'steps': 99548, 'loss/train': 1.566179871559143} 11/07/2021 11:12:52 - INFO - __main__ - Step 99550: {'lr': 0.00013017124028633933, 'samples': 19113600, 'steps': 99549, 'loss/train': 1.1569463014602661} 11/07/2021 11:12:52 - INFO - __main__ - Step 99551: {'lr': 0.00013016658288673606, 'samples': 19113792, 'steps': 99550, 'loss/train': 1.2523283958435059} 11/07/2021 11:12:53 - INFO - __main__ - Step 99552: {'lr': 0.00013016192554112787, 'samples': 19113984, 'steps': 99551, 'loss/train': 1.7444602251052856} 11/07/2021 11:12:53 - INFO - __main__ - Step 99553: {'lr': 0.0001301572682495169, 'samples': 19114176, 'steps': 99552, 'loss/train': 1.5929239988327026} 11/07/2021 11:12:53 - INFO - __main__ - Step 99554: {'lr': 0.00013015261101190519, 'samples': 19114368, 'steps': 99553, 'loss/train': 1.6067273616790771} 11/07/2021 11:12:54 - INFO - __main__ - Step 99555: {'lr': 0.00013014795382829486, 'samples': 19114560, 'steps': 99554, 'loss/train': 2.2781755924224854} 11/07/2021 11:12:55 - INFO - __main__ - Step 99556: {'lr': 0.00013014329669868802, 'samples': 19114752, 'steps': 99555, 'loss/train': 1.3366997241973877} 11/07/2021 11:12:55 - INFO - __main__ - Step 99557: {'lr': 0.00013013863962308675, 'samples': 19114944, 'steps': 99556, 'loss/train': 1.2392700910568237} 11/07/2021 11:12:55 - INFO - __main__ - Step 99558: {'lr': 0.00013013398260149317, 'samples': 19115136, 'steps': 99557, 'loss/train': 1.2603482007980347} 11/07/2021 11:12:56 - INFO - __main__ - Step 99559: {'lr': 0.00013012932563390934, 'samples': 19115328, 'steps': 99558, 'loss/train': 1.4517686367034912} 11/07/2021 11:12:57 - INFO - __main__ - Step 99560: {'lr': 0.0001301246687203374, 'samples': 19115520, 'steps': 99559, 'loss/train': 0.9610048532485962} 11/07/2021 11:12:57 - INFO - __main__ - Step 99561: {'lr': 0.00013012001186077946, 'samples': 19115712, 'steps': 99560, 'loss/train': 1.3061448335647583} 11/07/2021 11:12:58 - INFO - __main__ - Step 99562: {'lr': 0.00013011535505523758, 'samples': 19115904, 'steps': 99561, 'loss/train': 0.941681444644928} 11/07/2021 11:12:58 - INFO - __main__ - Step 99563: {'lr': 0.00013011069830371397, 'samples': 19116096, 'steps': 99562, 'loss/train': 1.3255438804626465} 11/07/2021 11:12:58 - INFO - __main__ - Step 99564: {'lr': 0.00013010604160621053, 'samples': 19116288, 'steps': 99563, 'loss/train': 1.3871718645095825} 11/07/2021 11:12:59 - INFO - __main__ - Step 99565: {'lr': 0.00013010138496272945, 'samples': 19116480, 'steps': 99564, 'loss/train': 1.4642747640609741} 11/07/2021 11:13:00 - INFO - __main__ - Step 99566: {'lr': 0.00013009672837327287, 'samples': 19116672, 'steps': 99565, 'loss/train': 1.434306025505066} 11/07/2021 11:13:00 - INFO - __main__ - Step 99567: {'lr': 0.00013009207183784278, 'samples': 19116864, 'steps': 99566, 'loss/train': 0.9700750708580017} 11/07/2021 11:13:00 - INFO - __main__ - Step 99568: {'lr': 0.0001300874153564414, 'samples': 19117056, 'steps': 99567, 'loss/train': 1.4112331867218018} 11/07/2021 11:13:01 - INFO - __main__ - Step 99569: {'lr': 0.0001300827589290708, 'samples': 19117248, 'steps': 99568, 'loss/train': 1.5632964372634888} 11/07/2021 11:13:01 - INFO - __main__ - Step 99570: {'lr': 0.00013007810255573303, 'samples': 19117440, 'steps': 99569, 'loss/train': 1.1576807498931885} 11/07/2021 11:13:02 - INFO - __main__ - Step 99571: {'lr': 0.00013007344623643019, 'samples': 19117632, 'steps': 99570, 'loss/train': 1.1744294166564941} 11/07/2021 11:13:02 - INFO - __main__ - Step 99572: {'lr': 0.00013006878997116444, 'samples': 19117824, 'steps': 99571, 'loss/train': 1.4891384840011597} 11/07/2021 11:13:03 - INFO - __main__ - Step 99573: {'lr': 0.00013006413375993785, 'samples': 19118016, 'steps': 99572, 'loss/train': 1.231576919555664} 11/07/2021 11:13:03 - INFO - __main__ - Step 99574: {'lr': 0.0001300594776027525, 'samples': 19118208, 'steps': 99573, 'loss/train': 1.303639531135559} 11/07/2021 11:13:03 - INFO - __main__ - Step 99575: {'lr': 0.0001300548214996105, 'samples': 19118400, 'steps': 99574, 'loss/train': 1.2882115840911865} 11/07/2021 11:13:05 - INFO - __main__ - Step 99576: {'lr': 0.00013005016545051396, 'samples': 19118592, 'steps': 99575, 'loss/train': 1.032795786857605} 11/07/2021 11:13:05 - INFO - __main__ - Step 99577: {'lr': 0.00013004550945546503, 'samples': 19118784, 'steps': 99576, 'loss/train': 1.4213122129440308} 11/07/2021 11:13:05 - INFO - __main__ - Step 99578: {'lr': 0.00013004085351446564, 'samples': 19118976, 'steps': 99577, 'loss/train': 1.1368235349655151} 11/07/2021 11:13:06 - INFO - __main__ - Step 99579: {'lr': 0.00013003619762751804, 'samples': 19119168, 'steps': 99578, 'loss/train': 1.166676640510559} 11/07/2021 11:13:06 - INFO - __main__ - Step 99580: {'lr': 0.00013003154179462424, 'samples': 19119360, 'steps': 99579, 'loss/train': 1.3695106506347656} 11/07/2021 11:13:07 - INFO - __main__ - Step 99581: {'lr': 0.0001300268860157864, 'samples': 19119552, 'steps': 99580, 'loss/train': 1.2992883920669556} 11/07/2021 11:13:07 - INFO - __main__ - Step 99582: {'lr': 0.00013002223029100657, 'samples': 19119744, 'steps': 99581, 'loss/train': 1.128021001815796} 11/07/2021 11:13:08 - INFO - __main__ - Step 99583: {'lr': 0.00013001757462028688, 'samples': 19119936, 'steps': 99582, 'loss/train': 1.4588830471038818} 11/07/2021 11:13:08 - INFO - __main__ - Step 99584: {'lr': 0.00013001291900362945, 'samples': 19120128, 'steps': 99583, 'loss/train': 1.3340187072753906} 11/07/2021 11:13:08 - INFO - __main__ - Step 99585: {'lr': 0.00013000826344103627, 'samples': 19120320, 'steps': 99584, 'loss/train': 0.9188142418861389} 11/07/2021 11:13:10 - INFO - __main__ - Step 99586: {'lr': 0.0001300036079325096, 'samples': 19120512, 'steps': 99585, 'loss/train': 0.9676076769828796} 11/07/2021 11:13:10 - INFO - __main__ - Step 99587: {'lr': 0.00012999895247805138, 'samples': 19120704, 'steps': 99586, 'loss/train': 1.312303066253662} 11/07/2021 11:13:10 - INFO - __main__ - Step 99588: {'lr': 0.00012999429707766382, 'samples': 19120896, 'steps': 99587, 'loss/train': 1.4529744386672974} 11/07/2021 11:13:11 - INFO - __main__ - Step 99589: {'lr': 0.00012998964173134897, 'samples': 19121088, 'steps': 99588, 'loss/train': 1.5305782556533813} 11/07/2021 11:13:11 - INFO - __main__ - Step 99590: {'lr': 0.00012998498643910906, 'samples': 19121280, 'steps': 99589, 'loss/train': 0.4941423833370209} 11/07/2021 11:13:12 - INFO - __main__ - Step 99591: {'lr': 0.00012998033120094593, 'samples': 19121472, 'steps': 99590, 'loss/train': 1.1420764923095703} 11/07/2021 11:13:12 - INFO - __main__ - Step 99592: {'lr': 0.00012997567601686182, 'samples': 19121664, 'steps': 99591, 'loss/train': 1.6041041612625122} 11/07/2021 11:13:13 - INFO - __main__ - Step 99593: {'lr': 0.00012997102088685883, 'samples': 19121856, 'steps': 99592, 'loss/train': 1.1585355997085571} 11/07/2021 11:13:13 - INFO - __main__ - Step 99594: {'lr': 0.00012996636581093904, 'samples': 19122048, 'steps': 99593, 'loss/train': 1.3959218263626099} 11/07/2021 11:13:13 - INFO - __main__ - Step 99595: {'lr': 0.00012996171078910457, 'samples': 19122240, 'steps': 99594, 'loss/train': 1.2802988290786743} 11/07/2021 11:13:14 - INFO - __main__ - Step 99596: {'lr': 0.00012995705582135748, 'samples': 19122432, 'steps': 99595, 'loss/train': 1.4587621688842773} 11/07/2021 11:13:15 - INFO - __main__ - Step 99597: {'lr': 0.00012995240090769988, 'samples': 19122624, 'steps': 99596, 'loss/train': 1.5921533107757568} 11/07/2021 11:13:15 - INFO - __main__ - Step 99598: {'lr': 0.00012994774604813386, 'samples': 19122816, 'steps': 99597, 'loss/train': 1.2471044063568115} 11/07/2021 11:13:15 - INFO - __main__ - Step 99599: {'lr': 0.00012994309124266158, 'samples': 19123008, 'steps': 99598, 'loss/train': 1.0203458070755005} 11/07/2021 11:13:16 - INFO - __main__ - Step 99600: {'lr': 0.00012993843649128505, 'samples': 19123200, 'steps': 99599, 'loss/train': 1.8083455562591553} 11/07/2021 11:13:17 - INFO - __main__ - Step 99601: {'lr': 0.00012993378179400645, 'samples': 19123392, 'steps': 99600, 'loss/train': 1.479810357093811} 11/07/2021 11:13:17 - INFO - __main__ - Step 99602: {'lr': 0.0001299291271508278, 'samples': 19123584, 'steps': 99601, 'loss/train': 1.198373556137085} 11/07/2021 11:13:18 - INFO - __main__ - Step 99603: {'lr': 0.00012992447256175124, 'samples': 19123776, 'steps': 99602, 'loss/train': 1.5792347192764282} 11/07/2021 11:13:18 - INFO - __main__ - Step 99604: {'lr': 0.00012991981802677898, 'samples': 19123968, 'steps': 99603, 'loss/train': 1.5824885368347168} 11/07/2021 11:13:18 - INFO - __main__ - Step 99605: {'lr': 0.00012991516354591287, 'samples': 19124160, 'steps': 99604, 'loss/train': 0.7116523385047913} 11/07/2021 11:13:19 - INFO - __main__ - Step 99606: {'lr': 0.00012991050911915513, 'samples': 19124352, 'steps': 99605, 'loss/train': 1.281195878982544} 11/07/2021 11:13:20 - INFO - __main__ - Step 99607: {'lr': 0.0001299058547465079, 'samples': 19124544, 'steps': 99606, 'loss/train': 1.6377439498901367} 11/07/2021 11:13:20 - INFO - __main__ - Step 99608: {'lr': 0.0001299012004279732, 'samples': 19124736, 'steps': 99607, 'loss/train': 1.016700029373169} 11/07/2021 11:13:21 - INFO - __main__ - Step 99609: {'lr': 0.00012989654616355316, 'samples': 19124928, 'steps': 99608, 'loss/train': 1.4571706056594849} 11/07/2021 11:13:21 - INFO - __main__ - Step 99610: {'lr': 0.00012989189195324993, 'samples': 19125120, 'steps': 99609, 'loss/train': 1.104705810546875} 11/07/2021 11:13:21 - INFO - __main__ - Step 99611: {'lr': 0.00012988723779706554, 'samples': 19125312, 'steps': 99610, 'loss/train': 2.103031873703003} 11/07/2021 11:13:22 - INFO - __main__ - Step 99612: {'lr': 0.0001298825836950021, 'samples': 19125504, 'steps': 99611, 'loss/train': 1.5135834217071533} 11/07/2021 11:13:23 - INFO - __main__ - Step 99613: {'lr': 0.00012987792964706175, 'samples': 19125696, 'steps': 99612, 'loss/train': 1.0823694467544556} 11/07/2021 11:13:23 - INFO - __main__ - Step 99614: {'lr': 0.0001298732756532465, 'samples': 19125888, 'steps': 99613, 'loss/train': 1.4249919652938843} 11/07/2021 11:13:23 - INFO - __main__ - Step 99615: {'lr': 0.0001298686217135585, 'samples': 19126080, 'steps': 99614, 'loss/train': 1.5699092149734497} 11/07/2021 11:13:24 - INFO - __main__ - Step 99616: {'lr': 0.00012986396782799987, 'samples': 19126272, 'steps': 99615, 'loss/train': 1.243037223815918} 11/07/2021 11:13:25 - INFO - __main__ - Step 99617: {'lr': 0.00012985931399657277, 'samples': 19126464, 'steps': 99616, 'loss/train': 0.9886820316314697} 11/07/2021 11:13:25 - INFO - __main__ - Step 99618: {'lr': 0.00012985466021927912, 'samples': 19126656, 'steps': 99617, 'loss/train': 1.2005876302719116} 11/07/2021 11:13:25 - INFO - __main__ - Step 99619: {'lr': 0.00012985000649612112, 'samples': 19126848, 'steps': 99618, 'loss/train': 1.3679872751235962} 11/07/2021 11:13:26 - INFO - __main__ - Step 99620: {'lr': 0.0001298453528271008, 'samples': 19127040, 'steps': 99619, 'loss/train': 1.1691551208496094} 11/07/2021 11:13:26 - INFO - __main__ - Step 99621: {'lr': 0.00012984069921222037, 'samples': 19127232, 'steps': 99620, 'loss/train': 1.4387210607528687} 11/07/2021 11:13:27 - INFO - __main__ - Step 99622: {'lr': 0.00012983604565148182, 'samples': 19127424, 'steps': 99621, 'loss/train': 0.8686162829399109} 11/07/2021 11:13:27 - INFO - __main__ - Step 99623: {'lr': 0.00012983139214488732, 'samples': 19127616, 'steps': 99622, 'loss/train': 1.1217694282531738} 11/07/2021 11:13:28 - INFO - __main__ - Step 99624: {'lr': 0.00012982673869243894, 'samples': 19127808, 'steps': 99623, 'loss/train': 1.9725182056427002} 11/07/2021 11:13:28 - INFO - __main__ - Step 99625: {'lr': 0.00012982208529413875, 'samples': 19128000, 'steps': 99624, 'loss/train': 0.8246272206306458} 11/07/2021 11:13:29 - INFO - __main__ - Step 99626: {'lr': 0.00012981743194998891, 'samples': 19128192, 'steps': 99625, 'loss/train': 1.6752445697784424} 11/07/2021 11:13:30 - INFO - __main__ - Step 99627: {'lr': 0.00012981277865999145, 'samples': 19128384, 'steps': 99626, 'loss/train': 1.0437171459197998} 11/07/2021 11:13:30 - INFO - __main__ - Step 99628: {'lr': 0.0001298081254241485, 'samples': 19128576, 'steps': 99627, 'loss/train': 1.0192748308181763} 11/07/2021 11:13:30 - INFO - __main__ - Step 99629: {'lr': 0.0001298034722424622, 'samples': 19128768, 'steps': 99628, 'loss/train': 1.6014412641525269} 11/07/2021 11:13:31 - INFO - __main__ - Step 99630: {'lr': 0.00012979881911493455, 'samples': 19128960, 'steps': 99629, 'loss/train': 0.7714021801948547} 11/07/2021 11:13:31 - INFO - __main__ - Step 99631: {'lr': 0.0001297941660415678, 'samples': 19129152, 'steps': 99630, 'loss/train': 1.7593778371810913} 11/07/2021 11:13:33 - INFO - __main__ - Step 99632: {'lr': 0.00012978951302236385, 'samples': 19129344, 'steps': 99631, 'loss/train': 0.79290771484375} 11/07/2021 11:13:33 - INFO - __main__ - Step 99633: {'lr': 0.00012978486005732492, 'samples': 19129536, 'steps': 99632, 'loss/train': 1.165493130683899} 11/07/2021 11:13:33 - INFO - __main__ - Step 99634: {'lr': 0.00012978020714645306, 'samples': 19129728, 'steps': 99633, 'loss/train': 1.5164276361465454} 11/07/2021 11:13:34 - INFO - __main__ - Step 99635: {'lr': 0.00012977555428975035, 'samples': 19129920, 'steps': 99634, 'loss/train': 1.747701644897461} 11/07/2021 11:13:34 - INFO - __main__ - Step 99636: {'lr': 0.00012977090148721897, 'samples': 19130112, 'steps': 99635, 'loss/train': 1.689514398574829} 11/07/2021 11:13:34 - INFO - __main__ - Step 99637: {'lr': 0.00012976624873886096, 'samples': 19130304, 'steps': 99636, 'loss/train': 1.4372185468673706} 11/07/2021 11:13:35 - INFO - __main__ - Step 99638: {'lr': 0.00012976159604467837, 'samples': 19130496, 'steps': 99637, 'loss/train': 0.7794243693351746} 11/07/2021 11:13:36 - INFO - __main__ - Step 99639: {'lr': 0.00012975694340467341, 'samples': 19130688, 'steps': 99638, 'loss/train': 0.8345756530761719} 11/07/2021 11:13:36 - INFO - __main__ - Step 99640: {'lr': 0.0001297522908188481, 'samples': 19130880, 'steps': 99639, 'loss/train': 1.4992769956588745} 11/07/2021 11:13:37 - INFO - __main__ - Step 99641: {'lr': 0.00012974763828720455, 'samples': 19131072, 'steps': 99640, 'loss/train': 1.1894001960754395} 11/07/2021 11:13:37 - INFO - __main__ - Step 99642: {'lr': 0.00012974298580974484, 'samples': 19131264, 'steps': 99641, 'loss/train': 1.631463646888733} 11/07/2021 11:13:37 - INFO - __main__ - Step 99643: {'lr': 0.00012973833338647108, 'samples': 19131456, 'steps': 99642, 'loss/train': 1.5755860805511475} 11/07/2021 11:13:38 - INFO - __main__ - Step 99644: {'lr': 0.0001297336810173855, 'samples': 19131648, 'steps': 99643, 'loss/train': 1.1299324035644531} 11/07/2021 11:13:39 - INFO - __main__ - Step 99645: {'lr': 0.00012972902870248996, 'samples': 19131840, 'steps': 99644, 'loss/train': 1.5307141542434692} 11/07/2021 11:13:39 - INFO - __main__ - Step 99646: {'lr': 0.00012972437644178666, 'samples': 19132032, 'steps': 99645, 'loss/train': 1.4141945838928223} 11/07/2021 11:13:39 - INFO - __main__ - Step 99647: {'lr': 0.0001297197242352777, 'samples': 19132224, 'steps': 99646, 'loss/train': 0.7351295351982117} 11/07/2021 11:13:40 - INFO - __main__ - Step 99648: {'lr': 0.00012971507208296517, 'samples': 19132416, 'steps': 99647, 'loss/train': 1.340069055557251} 11/07/2021 11:13:41 - INFO - __main__ - Step 99649: {'lr': 0.0001297104199848512, 'samples': 19132608, 'steps': 99648, 'loss/train': 1.453932762145996} 11/07/2021 11:13:41 - INFO - __main__ - Step 99650: {'lr': 0.00012970576794093784, 'samples': 19132800, 'steps': 99649, 'loss/train': 1.3367562294006348} 11/07/2021 11:13:41 - INFO - __main__ - Step 99651: {'lr': 0.0001297011159512272, 'samples': 19132992, 'steps': 99650, 'loss/train': 1.3057100772857666} 11/07/2021 11:13:42 - INFO - __main__ - Step 99652: {'lr': 0.00012969646401572138, 'samples': 19133184, 'steps': 99651, 'loss/train': 1.431180477142334} 11/07/2021 11:13:42 - INFO - __main__ - Step 99653: {'lr': 0.00012969181213442249, 'samples': 19133376, 'steps': 99652, 'loss/train': 1.8231817483901978} 11/07/2021 11:13:43 - INFO - __main__ - Step 99654: {'lr': 0.00012968716030733261, 'samples': 19133568, 'steps': 99653, 'loss/train': 1.0981284379959106} 11/07/2021 11:13:44 - INFO - __main__ - Step 99655: {'lr': 0.00012968250853445383, 'samples': 19133760, 'steps': 99654, 'loss/train': 1.2722516059875488} 11/07/2021 11:13:44 - INFO - __main__ - Step 99656: {'lr': 0.00012967785681578824, 'samples': 19133952, 'steps': 99655, 'loss/train': 1.4641454219818115} 11/07/2021 11:13:44 - INFO - __main__ - Step 99657: {'lr': 0.00012967320515133796, 'samples': 19134144, 'steps': 99656, 'loss/train': 0.6836469173431396} 11/07/2021 11:13:45 - INFO - __main__ - Step 99658: {'lr': 0.00012966855354110517, 'samples': 19134336, 'steps': 99657, 'loss/train': 0.3220337927341461} 11/07/2021 11:13:45 - INFO - __main__ - Step 99659: {'lr': 0.0001296639019850918, 'samples': 19134528, 'steps': 99658, 'loss/train': 1.1853151321411133} 11/07/2021 11:13:46 - INFO - __main__ - Step 99660: {'lr': 0.00012965925048330002, 'samples': 19134720, 'steps': 99659, 'loss/train': 0.6701598763465881} 11/07/2021 11:13:46 - INFO - __main__ - Step 99661: {'lr': 0.0001296545990357319, 'samples': 19134912, 'steps': 99660, 'loss/train': 1.642677903175354} 11/07/2021 11:13:47 - INFO - __main__ - Step 99662: {'lr': 0.00012964994764238957, 'samples': 19135104, 'steps': 99661, 'loss/train': 0.9762499332427979} 11/07/2021 11:13:47 - INFO - __main__ - Step 99663: {'lr': 0.00012964529630327514, 'samples': 19135296, 'steps': 99662, 'loss/train': 1.0847690105438232} 11/07/2021 11:13:47 - INFO - __main__ - Step 99664: {'lr': 0.00012964064501839068, 'samples': 19135488, 'steps': 99663, 'loss/train': 1.1790841817855835} 11/07/2021 11:13:48 - INFO - __main__ - Step 99665: {'lr': 0.00012963599378773826, 'samples': 19135680, 'steps': 99664, 'loss/train': 1.5381412506103516} 11/07/2021 11:13:49 - INFO - __main__ - Step 99666: {'lr': 0.00012963134261132002, 'samples': 19135872, 'steps': 99665, 'loss/train': 1.462415337562561} 11/07/2021 11:13:49 - INFO - __main__ - Step 99667: {'lr': 0.00012962669148913804, 'samples': 19136064, 'steps': 99666, 'loss/train': 1.5833687782287598} 11/07/2021 11:13:50 - INFO - __main__ - Step 99668: {'lr': 0.0001296220404211944, 'samples': 19136256, 'steps': 99667, 'loss/train': 1.6778277158737183} 11/07/2021 11:13:50 - INFO - __main__ - Step 99669: {'lr': 0.00012961738940749123, 'samples': 19136448, 'steps': 99668, 'loss/train': 1.447661280632019} 11/07/2021 11:13:51 - INFO - __main__ - Step 99670: {'lr': 0.00012961273844803057, 'samples': 19136640, 'steps': 99669, 'loss/train': 1.1019049882888794} 11/07/2021 11:13:51 - INFO - __main__ - Step 99671: {'lr': 0.00012960808754281468, 'samples': 19136832, 'steps': 99670, 'loss/train': 1.2858762741088867} 11/07/2021 11:13:52 - INFO - __main__ - Step 99672: {'lr': 0.00012960343669184544, 'samples': 19137024, 'steps': 99671, 'loss/train': 2.0773122310638428} 11/07/2021 11:13:52 - INFO - __main__ - Step 99673: {'lr': 0.00012959878589512502, 'samples': 19137216, 'steps': 99672, 'loss/train': 1.5947965383529663} 11/07/2021 11:13:53 - INFO - __main__ - Step 99674: {'lr': 0.00012959413515265553, 'samples': 19137408, 'steps': 99673, 'loss/train': 0.9156692028045654} 11/07/2021 11:13:53 - INFO - __main__ - Step 99675: {'lr': 0.00012958948446443907, 'samples': 19137600, 'steps': 99674, 'loss/train': 1.4688918590545654} 11/07/2021 11:13:54 - INFO - __main__ - Step 99676: {'lr': 0.00012958483383047773, 'samples': 19137792, 'steps': 99675, 'loss/train': 1.6040067672729492} 11/07/2021 11:13:54 - INFO - __main__ - Step 99677: {'lr': 0.0001295801832507736, 'samples': 19137984, 'steps': 99676, 'loss/train': 1.0703015327453613} 11/07/2021 11:13:55 - INFO - __main__ - Step 99678: {'lr': 0.0001295755327253288, 'samples': 19138176, 'steps': 99677, 'loss/train': 1.4852948188781738} 11/07/2021 11:13:55 - INFO - __main__ - Step 99679: {'lr': 0.00012957088225414539, 'samples': 19138368, 'steps': 99678, 'loss/train': 1.642937421798706} 11/07/2021 11:13:55 - INFO - __main__ - Step 99680: {'lr': 0.00012956623183722543, 'samples': 19138560, 'steps': 99679, 'loss/train': 0.9804261326789856} 11/07/2021 11:13:56 - INFO - __main__ - Step 99681: {'lr': 0.00012956158147457115, 'samples': 19138752, 'steps': 99680, 'loss/train': 1.057584524154663} 11/07/2021 11:13:57 - INFO - __main__ - Step 99682: {'lr': 0.00012955693116618451, 'samples': 19138944, 'steps': 99681, 'loss/train': 1.3627829551696777} 11/07/2021 11:13:57 - INFO - __main__ - Step 99683: {'lr': 0.0001295522809120677, 'samples': 19139136, 'steps': 99682, 'loss/train': 1.5234582424163818} 11/07/2021 11:13:57 - INFO - __main__ - Step 99684: {'lr': 0.00012954763071222286, 'samples': 19139328, 'steps': 99683, 'loss/train': 0.9081193804740906} 11/07/2021 11:13:58 - INFO - __main__ - Step 99685: {'lr': 0.00012954298056665187, 'samples': 19139520, 'steps': 99684, 'loss/train': 0.9612407684326172} 11/07/2021 11:13:59 - INFO - __main__ - Step 99686: {'lr': 0.000129538330475357, 'samples': 19139712, 'steps': 99685, 'loss/train': 0.8116819262504578} 11/07/2021 11:13:59 - INFO - __main__ - Step 99687: {'lr': 0.00012953368043834023, 'samples': 19139904, 'steps': 99686, 'loss/train': 0.9009511470794678} 11/07/2021 11:13:59 - INFO - __main__ - Step 99688: {'lr': 0.0001295290304556038, 'samples': 19140096, 'steps': 99687, 'loss/train': 1.0739623308181763} 11/07/2021 11:14:00 - INFO - __main__ - Step 99689: {'lr': 0.00012952438052714972, 'samples': 19140288, 'steps': 99688, 'loss/train': 1.318109393119812} 11/07/2021 11:14:00 - INFO - __main__ - Step 99690: {'lr': 0.00012951973065298007, 'samples': 19140480, 'steps': 99689, 'loss/train': 1.4280033111572266} 11/07/2021 11:14:01 - INFO - __main__ - Step 99691: {'lr': 0.00012951508083309697, 'samples': 19140672, 'steps': 99690, 'loss/train': 1.1909208297729492} 11/07/2021 11:14:02 - INFO - __main__ - Step 99692: {'lr': 0.00012951043106750252, 'samples': 19140864, 'steps': 99691, 'loss/train': 1.1801596879959106} 11/07/2021 11:14:02 - INFO - __main__ - Step 99693: {'lr': 0.00012950578135619882, 'samples': 19141056, 'steps': 99692, 'loss/train': 4.806519985198975} 11/07/2021 11:14:02 - INFO - __main__ - Step 99694: {'lr': 0.00012950113169918792, 'samples': 19141248, 'steps': 99693, 'loss/train': 1.1422621011734009} 11/07/2021 11:14:03 - INFO - __main__ - Step 99695: {'lr': 0.000129496482096472, 'samples': 19141440, 'steps': 99694, 'loss/train': 2.0147249698638916} 11/07/2021 11:14:03 - INFO - __main__ - Step 99696: {'lr': 0.0001294918325480531, 'samples': 19141632, 'steps': 99695, 'loss/train': 1.5300147533416748} 11/07/2021 11:14:04 - INFO - __main__ - Step 99697: {'lr': 0.00012948718305393327, 'samples': 19141824, 'steps': 99696, 'loss/train': 1.0128235816955566} 11/07/2021 11:14:04 - INFO - __main__ - Step 99698: {'lr': 0.0001294825336141148, 'samples': 19142016, 'steps': 99697, 'loss/train': 1.4615590572357178} 11/07/2021 11:14:05 - INFO - __main__ - Step 99699: {'lr': 0.00012947788422859951, 'samples': 19142208, 'steps': 99698, 'loss/train': 1.7287898063659668} 11/07/2021 11:14:05 - INFO - __main__ - Step 99700: {'lr': 0.00012947323489738966, 'samples': 19142400, 'steps': 99699, 'loss/train': 1.337225317955017} 11/07/2021 11:14:05 - INFO - __main__ - Step 99701: {'lr': 0.0001294685856204873, 'samples': 19142592, 'steps': 99700, 'loss/train': 0.5416224598884583} 11/07/2021 11:14:06 - INFO - __main__ - Step 99702: {'lr': 0.00012946393639789452, 'samples': 19142784, 'steps': 99701, 'loss/train': 1.3768703937530518} 11/07/2021 11:14:07 - INFO - __main__ - Step 99703: {'lr': 0.00012945928722961347, 'samples': 19142976, 'steps': 99702, 'loss/train': 1.5666502714157104} 11/07/2021 11:14:07 - INFO - __main__ - Step 99704: {'lr': 0.00012945463811564616, 'samples': 19143168, 'steps': 99703, 'loss/train': 1.1517456769943237} 11/07/2021 11:14:07 - INFO - __main__ - Step 99705: {'lr': 0.00012944998905599475, 'samples': 19143360, 'steps': 99704, 'loss/train': 1.2105001211166382} 11/07/2021 11:14:08 - INFO - __main__ - Step 99706: {'lr': 0.00012944534005066133, 'samples': 19143552, 'steps': 99705, 'loss/train': 0.8412739038467407} 11/07/2021 11:14:09 - INFO - __main__ - Step 99707: {'lr': 0.00012944069109964795, 'samples': 19143744, 'steps': 99706, 'loss/train': 1.2599642276763916} 11/07/2021 11:14:09 - INFO - __main__ - Step 99708: {'lr': 0.00012943604220295673, 'samples': 19143936, 'steps': 99707, 'loss/train': 1.468335747718811} 11/07/2021 11:14:10 - INFO - __main__ - Step 99709: {'lr': 0.0001294313933605899, 'samples': 19144128, 'steps': 99708, 'loss/train': 1.3792284727096558} 11/07/2021 11:14:10 - INFO - __main__ - Step 99710: {'lr': 0.0001294267445725493, 'samples': 19144320, 'steps': 99709, 'loss/train': 1.4081404209136963} 11/07/2021 11:14:10 - INFO - __main__ - Step 99711: {'lr': 0.00012942209583883716, 'samples': 19144512, 'steps': 99710, 'loss/train': 1.4500527381896973} 11/07/2021 11:14:11 - INFO - __main__ - Step 99712: {'lr': 0.00012941744715945557, 'samples': 19144704, 'steps': 99711, 'loss/train': 0.32748332619667053} 11/07/2021 11:14:12 - INFO - __main__ - Step 99713: {'lr': 0.0001294127985344066, 'samples': 19144896, 'steps': 99712, 'loss/train': 0.9565947651863098} 11/07/2021 11:14:12 - INFO - __main__ - Step 99714: {'lr': 0.0001294081499636924, 'samples': 19145088, 'steps': 99713, 'loss/train': 1.2659810781478882} 11/07/2021 11:14:12 - INFO - __main__ - Step 99715: {'lr': 0.00012940350144731495, 'samples': 19145280, 'steps': 99714, 'loss/train': 1.5313658714294434} 11/07/2021 11:14:13 - INFO - __main__ - Step 99716: {'lr': 0.00012939885298527648, 'samples': 19145472, 'steps': 99715, 'loss/train': 1.6976629495620728} 11/07/2021 11:14:13 - INFO - __main__ - Step 99717: {'lr': 0.000129394204577579, 'samples': 19145664, 'steps': 99716, 'loss/train': 1.521910548210144} 11/07/2021 11:14:14 - INFO - __main__ - Step 99718: {'lr': 0.00012938955622422466, 'samples': 19145856, 'steps': 99717, 'loss/train': 1.1844192743301392} 11/07/2021 11:14:15 - INFO - __main__ - Step 99719: {'lr': 0.0001293849079252155, 'samples': 19146048, 'steps': 99718, 'loss/train': 1.5137403011322021} 11/07/2021 11:14:15 - INFO - __main__ - Step 99720: {'lr': 0.00012938025968055376, 'samples': 19146240, 'steps': 99719, 'loss/train': 0.4820452034473419} 11/07/2021 11:14:15 - INFO - __main__ - Step 99721: {'lr': 0.0001293756114902413, 'samples': 19146432, 'steps': 99720, 'loss/train': 1.5648049116134644} 11/07/2021 11:14:16 - INFO - __main__ - Step 99722: {'lr': 0.00012937096335428034, 'samples': 19146624, 'steps': 99721, 'loss/train': 1.3987045288085938} 11/07/2021 11:14:16 - INFO - __main__ - Step 99723: {'lr': 0.00012936631527267294, 'samples': 19146816, 'steps': 99722, 'loss/train': 0.8937495350837708} 11/07/2021 11:14:17 - INFO - __main__ - Step 99724: {'lr': 0.00012936166724542123, 'samples': 19147008, 'steps': 99723, 'loss/train': 1.0709699392318726} 11/07/2021 11:14:17 - INFO - __main__ - Step 99725: {'lr': 0.0001293570192725273, 'samples': 19147200, 'steps': 99724, 'loss/train': 1.6692699193954468} 11/07/2021 11:14:18 - INFO - __main__ - Step 99726: {'lr': 0.00012935237135399321, 'samples': 19147392, 'steps': 99725, 'loss/train': 1.3875759840011597} 11/07/2021 11:14:18 - INFO - __main__ - Step 99727: {'lr': 0.0001293477234898211, 'samples': 19147584, 'steps': 99726, 'loss/train': 1.290049433708191} 11/07/2021 11:14:18 - INFO - __main__ - Step 99728: {'lr': 0.00012934307568001304, 'samples': 19147776, 'steps': 99727, 'loss/train': 0.939960241317749} 11/07/2021 11:14:19 - INFO - __main__ - Step 99729: {'lr': 0.00012933842792457113, 'samples': 19147968, 'steps': 99728, 'loss/train': 1.2170937061309814} 11/07/2021 11:14:20 - INFO - __main__ - Step 99730: {'lr': 0.00012933378022349747, 'samples': 19148160, 'steps': 99729, 'loss/train': 1.0106863975524902} 11/07/2021 11:14:20 - INFO - __main__ - Step 99731: {'lr': 0.00012932913257679424, 'samples': 19148352, 'steps': 99730, 'loss/train': 1.138999342918396} 11/07/2021 11:14:20 - INFO - __main__ - Step 99732: {'lr': 0.0001293244849844633, 'samples': 19148544, 'steps': 99731, 'loss/train': 1.2931325435638428} 11/07/2021 11:14:21 - INFO - __main__ - Step 99733: {'lr': 0.00012931983744650694, 'samples': 19148736, 'steps': 99732, 'loss/train': 1.33684504032135} 11/07/2021 11:14:22 - INFO - __main__ - Step 99734: {'lr': 0.0001293151899629272, 'samples': 19148928, 'steps': 99733, 'loss/train': 0.9752723574638367} 11/07/2021 11:14:22 - INFO - __main__ - Step 99735: {'lr': 0.00012931054253372616, 'samples': 19149120, 'steps': 99734, 'loss/train': 1.5804922580718994} 11/07/2021 11:14:22 - INFO - __main__ - Step 99736: {'lr': 0.0001293058951589059, 'samples': 19149312, 'steps': 99735, 'loss/train': 0.8821382522583008} 11/07/2021 11:14:23 - INFO - __main__ - Step 99737: {'lr': 0.0001293012478384686, 'samples': 19149504, 'steps': 99736, 'loss/train': 0.9026747345924377} 11/07/2021 11:14:23 - INFO - __main__ - Step 99738: {'lr': 0.00012929660057241622, 'samples': 19149696, 'steps': 99737, 'loss/train': 1.9623489379882812} 11/07/2021 11:14:24 - INFO - __main__ - Step 99739: {'lr': 0.00012929195336075099, 'samples': 19149888, 'steps': 99738, 'loss/train': 1.444045901298523} 11/07/2021 11:14:25 - INFO - __main__ - Step 99740: {'lr': 0.00012928730620347489, 'samples': 19150080, 'steps': 99739, 'loss/train': 1.038154125213623} 11/07/2021 11:14:25 - INFO - __main__ - Step 99741: {'lr': 0.00012928265910059012, 'samples': 19150272, 'steps': 99740, 'loss/train': 1.1706938743591309} 11/07/2021 11:14:25 - INFO - __main__ - Step 99742: {'lr': 0.00012927801205209877, 'samples': 19150464, 'steps': 99741, 'loss/train': 1.11711847782135} 11/07/2021 11:14:26 - INFO - __main__ - Step 99743: {'lr': 0.00012927336505800282, 'samples': 19150656, 'steps': 99742, 'loss/train': 1.4665297269821167} 11/07/2021 11:14:27 - INFO - __main__ - Step 99744: {'lr': 0.00012926871811830444, 'samples': 19150848, 'steps': 99743, 'loss/train': 1.7213983535766602} 11/07/2021 11:14:27 - INFO - __main__ - Step 99745: {'lr': 0.00012926407123300571, 'samples': 19151040, 'steps': 99744, 'loss/train': 1.4479992389678955} 11/07/2021 11:14:27 - INFO - __main__ - Step 99746: {'lr': 0.0001292594244021087, 'samples': 19151232, 'steps': 99745, 'loss/train': 1.4970048666000366} 11/07/2021 11:14:28 - INFO - __main__ - Step 99747: {'lr': 0.00012925477762561554, 'samples': 19151424, 'steps': 99746, 'loss/train': 2.1119370460510254} 11/07/2021 11:14:28 - INFO - __main__ - Step 99748: {'lr': 0.00012925013090352833, 'samples': 19151616, 'steps': 99747, 'loss/train': 0.9309127926826477} 11/07/2021 11:14:29 - INFO - __main__ - Step 99749: {'lr': 0.00012924548423584912, 'samples': 19151808, 'steps': 99748, 'loss/train': 1.6516119241714478} 11/07/2021 11:14:29 - INFO - __main__ - Step 99750: {'lr': 0.00012924083762258005, 'samples': 19152000, 'steps': 99749, 'loss/train': 1.5621763467788696} 11/07/2021 11:14:30 - INFO - __main__ - Step 99751: {'lr': 0.00012923619106372319, 'samples': 19152192, 'steps': 99750, 'loss/train': 2.235356569290161} 11/07/2021 11:14:30 - INFO - __main__ - Step 99752: {'lr': 0.00012923154455928064, 'samples': 19152384, 'steps': 99751, 'loss/train': 1.1958879232406616} 11/07/2021 11:14:30 - INFO - __main__ - Step 99753: {'lr': 0.00012922689810925458, 'samples': 19152576, 'steps': 99752, 'loss/train': 1.2427592277526855} 11/07/2021 11:14:32 - INFO - __main__ - Step 99754: {'lr': 0.00012922225171364693, 'samples': 19152768, 'steps': 99753, 'loss/train': 1.3146698474884033} 11/07/2021 11:14:32 - INFO - __main__ - Step 99755: {'lr': 0.00012921760537245986, 'samples': 19152960, 'steps': 99754, 'loss/train': 0.6468143463134766} 11/07/2021 11:14:32 - INFO - __main__ - Step 99756: {'lr': 0.00012921295908569546, 'samples': 19153152, 'steps': 99755, 'loss/train': 1.0843164920806885} 11/07/2021 11:14:33 - INFO - __main__ - Step 99757: {'lr': 0.0001292083128533559, 'samples': 19153344, 'steps': 99756, 'loss/train': 1.0642870664596558} 11/07/2021 11:14:33 - INFO - __main__ - Step 99758: {'lr': 0.00012920366667544314, 'samples': 19153536, 'steps': 99757, 'loss/train': 1.4833970069885254} 11/07/2021 11:14:33 - INFO - __main__ - Step 99759: {'lr': 0.00012919902055195937, 'samples': 19153728, 'steps': 99758, 'loss/train': 1.6108616590499878} 11/07/2021 11:14:34 - INFO - __main__ - Step 99760: {'lr': 0.00012919437448290666, 'samples': 19153920, 'steps': 99759, 'loss/train': 1.4684091806411743} 11/07/2021 11:14:35 - INFO - __main__ - Step 99761: {'lr': 0.00012918972846828712, 'samples': 19154112, 'steps': 99760, 'loss/train': 1.1055725812911987} 11/07/2021 11:14:35 - INFO - __main__ - Step 99762: {'lr': 0.00012918508250810278, 'samples': 19154304, 'steps': 99761, 'loss/train': 1.2296342849731445} 11/07/2021 11:14:35 - INFO - __main__ - Step 99763: {'lr': 0.0001291804366023558, 'samples': 19154496, 'steps': 99762, 'loss/train': 1.1031019687652588} 11/07/2021 11:14:36 - INFO - __main__ - Step 99764: {'lr': 0.00012917579075104825, 'samples': 19154688, 'steps': 99763, 'loss/train': 1.3444689512252808} 11/07/2021 11:14:37 - INFO - __main__ - Step 99765: {'lr': 0.00012917114495418237, 'samples': 19154880, 'steps': 99764, 'loss/train': 1.3555657863616943} 11/07/2021 11:14:37 - INFO - __main__ - Step 99766: {'lr': 0.00012916649921175993, 'samples': 19155072, 'steps': 99765, 'loss/train': 1.1906741857528687} 11/07/2021 11:14:38 - INFO - __main__ - Step 99767: {'lr': 0.00012916185352378323, 'samples': 19155264, 'steps': 99766, 'loss/train': 1.2434802055358887} 11/07/2021 11:14:38 - INFO - __main__ - Step 99768: {'lr': 0.00012915720789025438, 'samples': 19155456, 'steps': 99767, 'loss/train': 1.1375396251678467} 11/07/2021 11:14:38 - INFO - __main__ - Step 99769: {'lr': 0.00012915256231117532, 'samples': 19155648, 'steps': 99768, 'loss/train': 1.1217751502990723} 11/07/2021 11:14:39 - INFO - __main__ - Step 99770: {'lr': 0.00012914791678654834, 'samples': 19155840, 'steps': 99769, 'loss/train': 0.9467723965644836} 11/07/2021 11:14:40 - INFO - __main__ - Step 99771: {'lr': 0.00012914327131637542, 'samples': 19156032, 'steps': 99770, 'loss/train': 0.841533899307251} 11/07/2021 11:14:40 - INFO - __main__ - Step 99772: {'lr': 0.0001291386259006587, 'samples': 19156224, 'steps': 99771, 'loss/train': 1.7370984554290771} 11/07/2021 11:14:40 - INFO - __main__ - Step 99773: {'lr': 0.00012913398053940024, 'samples': 19156416, 'steps': 99772, 'loss/train': 1.4597140550613403} 11/07/2021 11:14:41 - INFO - __main__ - Step 99774: {'lr': 0.0001291293352326021, 'samples': 19156608, 'steps': 99773, 'loss/train': 1.4565125703811646} 11/07/2021 11:14:42 - INFO - __main__ - Step 99775: {'lr': 0.00012912468998026644, 'samples': 19156800, 'steps': 99774, 'loss/train': 0.920987069606781} 11/07/2021 11:14:42 - INFO - __main__ - Step 99776: {'lr': 0.00012912004478239536, 'samples': 19156992, 'steps': 99775, 'loss/train': 1.453853726387024} 11/07/2021 11:14:42 - INFO - __main__ - Step 99777: {'lr': 0.00012911539963899089, 'samples': 19157184, 'steps': 99776, 'loss/train': 1.281753420829773} 11/07/2021 11:14:43 - INFO - __main__ - Step 99778: {'lr': 0.00012911075455005516, 'samples': 19157376, 'steps': 99777, 'loss/train': 1.61434006690979} 11/07/2021 11:14:43 - INFO - __main__ - Step 99779: {'lr': 0.00012910610951559037, 'samples': 19157568, 'steps': 99778, 'loss/train': 1.5853945016860962} 11/07/2021 11:14:44 - INFO - __main__ - Step 99780: {'lr': 0.0001291014645355984, 'samples': 19157760, 'steps': 99779, 'loss/train': 1.3071086406707764} 11/07/2021 11:14:44 - INFO - __main__ - Step 99781: {'lr': 0.00012909681961008142, 'samples': 19157952, 'steps': 99780, 'loss/train': 1.1577363014221191} 11/07/2021 11:14:45 - INFO - __main__ - Step 99782: {'lr': 0.00012909217473904157, 'samples': 19158144, 'steps': 99781, 'loss/train': 1.650977373123169} 11/07/2021 11:14:45 - INFO - __main__ - Step 99783: {'lr': 0.00012908752992248093, 'samples': 19158336, 'steps': 99782, 'loss/train': 1.1066713333129883} 11/07/2021 11:14:45 - INFO - __main__ - Step 99784: {'lr': 0.00012908288516040155, 'samples': 19158528, 'steps': 99783, 'loss/train': 1.5809214115142822} 11/07/2021 11:14:46 - INFO - __main__ - Step 99785: {'lr': 0.0001290782404528056, 'samples': 19158720, 'steps': 99784, 'loss/train': 1.2185108661651611} 11/07/2021 11:14:47 - INFO - __main__ - Step 99786: {'lr': 0.0001290735957996951, 'samples': 19158912, 'steps': 99785, 'loss/train': 1.3529245853424072} 11/07/2021 11:14:47 - INFO - __main__ - Step 99787: {'lr': 0.0001290689512010722, 'samples': 19159104, 'steps': 99786, 'loss/train': 1.181493878364563} 11/07/2021 11:14:48 - INFO - __main__ - Step 99788: {'lr': 0.0001290643066569389, 'samples': 19159296, 'steps': 99787, 'loss/train': 1.3470009565353394} 11/07/2021 11:14:48 - INFO - __main__ - Step 99789: {'lr': 0.00012905966216729742, 'samples': 19159488, 'steps': 99788, 'loss/train': 1.122991919517517} 11/07/2021 11:14:48 - INFO - __main__ - Step 99790: {'lr': 0.00012905501773214978, 'samples': 19159680, 'steps': 99789, 'loss/train': 1.5542975664138794} 11/07/2021 11:14:50 - INFO - __main__ - Step 99791: {'lr': 0.00012905037335149804, 'samples': 19159872, 'steps': 99790, 'loss/train': 1.0700522661209106} 11/07/2021 11:14:50 - INFO - __main__ - Step 99792: {'lr': 0.0001290457290253445, 'samples': 19160064, 'steps': 99791, 'loss/train': 1.1856613159179688} 11/07/2021 11:14:51 - INFO - __main__ - Step 99793: {'lr': 0.00012904108475369095, 'samples': 19160256, 'steps': 99792, 'loss/train': 0.3109249174594879} 11/07/2021 11:14:51 - INFO - __main__ - Step 99794: {'lr': 0.0001290364405365396, 'samples': 19160448, 'steps': 99793, 'loss/train': 0.6655128002166748} 11/07/2021 11:14:51 - INFO - __main__ - Step 99795: {'lr': 0.00012903179637389263, 'samples': 19160640, 'steps': 99794, 'loss/train': 1.5862308740615845} 11/07/2021 11:14:53 - INFO - __main__ - Step 99796: {'lr': 0.00012902715226575202, 'samples': 19160832, 'steps': 99795, 'loss/train': 1.4952268600463867} 11/07/2021 11:14:53 - INFO - __main__ - Step 99797: {'lr': 0.00012902250821211992, 'samples': 19161024, 'steps': 99796, 'loss/train': 1.2629293203353882} 11/07/2021 11:14:53 - INFO - __main__ - Step 99798: {'lr': 0.00012901786421299838, 'samples': 19161216, 'steps': 99797, 'loss/train': 0.6785764098167419} 11/07/2021 11:14:54 - INFO - __main__ - Step 99799: {'lr': 0.00012901322026838958, 'samples': 19161408, 'steps': 99798, 'loss/train': 1.6183356046676636} 11/07/2021 11:14:54 - INFO - __main__ - Step 99800: {'lr': 0.0001290085763782955, 'samples': 19161600, 'steps': 99799, 'loss/train': 1.3889670372009277} 11/07/2021 11:14:54 - INFO - __main__ - Step 99801: {'lr': 0.0001290039325427183, 'samples': 19161792, 'steps': 99800, 'loss/train': 1.3537391424179077} 11/07/2021 11:14:56 - INFO - __main__ - Step 99802: {'lr': 0.0001289992887616601, 'samples': 19161984, 'steps': 99801, 'loss/train': 0.4716154932975769} 11/07/2021 11:14:56 - INFO - __main__ - Step 99803: {'lr': 0.00012899464503512292, 'samples': 19162176, 'steps': 99802, 'loss/train': 1.4524071216583252} 11/07/2021 11:14:56 - INFO - __main__ - Step 99804: {'lr': 0.0001289900013631089, 'samples': 19162368, 'steps': 99803, 'loss/train': 2.172276020050049} 11/07/2021 11:14:57 - INFO - __main__ - Step 99805: {'lr': 0.0001289853577456202, 'samples': 19162560, 'steps': 99804, 'loss/train': 1.3955937623977661} 11/07/2021 11:14:57 - INFO - __main__ - Step 99806: {'lr': 0.00012898071418265876, 'samples': 19162752, 'steps': 99805, 'loss/train': 1.1120935678482056} 11/07/2021 11:14:57 - INFO - __main__ - Step 99807: {'lr': 0.0001289760706742267, 'samples': 19162944, 'steps': 99806, 'loss/train': 1.769041895866394} 11/07/2021 11:14:59 - INFO - __main__ - Step 99808: {'lr': 0.00012897142722032617, 'samples': 19163136, 'steps': 99807, 'loss/train': 1.5509344339370728} 11/07/2021 11:14:59 - INFO - __main__ - Step 99809: {'lr': 0.00012896678382095928, 'samples': 19163328, 'steps': 99808, 'loss/train': 1.045911431312561} 11/07/2021 11:14:59 - INFO - __main__ - Step 99810: {'lr': 0.00012896214047612806, 'samples': 19163520, 'steps': 99809, 'loss/train': 1.3919957876205444} 11/07/2021 11:15:00 - INFO - __main__ - Step 99811: {'lr': 0.00012895749718583462, 'samples': 19163712, 'steps': 99810, 'loss/train': 1.2906382083892822} 11/07/2021 11:15:00 - INFO - __main__ - Step 99812: {'lr': 0.0001289528539500811, 'samples': 19163904, 'steps': 99811, 'loss/train': 1.721644639968872} 11/07/2021 11:15:00 - INFO - __main__ - Step 99813: {'lr': 0.00012894821076886955, 'samples': 19164096, 'steps': 99812, 'loss/train': 1.684823751449585} 11/07/2021 11:15:01 - INFO - __main__ - Step 99814: {'lr': 0.00012894356764220206, 'samples': 19164288, 'steps': 99813, 'loss/train': 1.1192450523376465} 11/07/2021 11:15:02 - INFO - __main__ - Step 99815: {'lr': 0.00012893892457008072, 'samples': 19164480, 'steps': 99814, 'loss/train': 1.0040501356124878} 11/07/2021 11:15:02 - INFO - __main__ - Step 99816: {'lr': 0.00012893428155250764, 'samples': 19164672, 'steps': 99815, 'loss/train': 0.11251061409711838} 11/07/2021 11:15:03 - INFO - __main__ - Step 99817: {'lr': 0.0001289296385894849, 'samples': 19164864, 'steps': 99816, 'loss/train': 1.396661639213562} 11/07/2021 11:15:03 - INFO - __main__ - Step 99818: {'lr': 0.0001289249956810146, 'samples': 19165056, 'steps': 99817, 'loss/train': 0.7405217289924622} 11/07/2021 11:15:04 - INFO - __main__ - Step 99819: {'lr': 0.0001289203528270989, 'samples': 19165248, 'steps': 99818, 'loss/train': 0.5585747361183167} 11/07/2021 11:15:04 - INFO - __main__ - Step 99820: {'lr': 0.00012891571002773976, 'samples': 19165440, 'steps': 99819, 'loss/train': 1.6264357566833496} 11/07/2021 11:15:05 - INFO - __main__ - Step 99821: {'lr': 0.00012891106728293934, 'samples': 19165632, 'steps': 99820, 'loss/train': 1.3797492980957031} 11/07/2021 11:15:05 - INFO - __main__ - Step 99822: {'lr': 0.00012890642459269968, 'samples': 19165824, 'steps': 99821, 'loss/train': 1.3755820989608765} 11/07/2021 11:15:05 - INFO - __main__ - Step 99823: {'lr': 0.00012890178195702295, 'samples': 19166016, 'steps': 99822, 'loss/train': 0.435686856508255} 11/07/2021 11:15:06 - INFO - __main__ - Step 99824: {'lr': 0.00012889713937591123, 'samples': 19166208, 'steps': 99823, 'loss/train': 0.8462382555007935} 11/07/2021 11:15:07 - INFO - __main__ - Step 99825: {'lr': 0.00012889249684936655, 'samples': 19166400, 'steps': 99824, 'loss/train': 1.7744495868682861} 11/07/2021 11:15:07 - INFO - __main__ - Step 99826: {'lr': 0.00012888785437739102, 'samples': 19166592, 'steps': 99825, 'loss/train': 1.618177056312561} 11/07/2021 11:15:07 - INFO - __main__ - Step 99827: {'lr': 0.0001288832119599868, 'samples': 19166784, 'steps': 99826, 'loss/train': 1.4991806745529175} 11/07/2021 11:15:08 - INFO - __main__ - Step 99828: {'lr': 0.00012887856959715595, 'samples': 19166976, 'steps': 99827, 'loss/train': 1.2951897382736206} 11/07/2021 11:15:09 - INFO - __main__ - Step 99829: {'lr': 0.00012887392728890053, 'samples': 19167168, 'steps': 99828, 'loss/train': 1.527355670928955} 11/07/2021 11:15:09 - INFO - __main__ - Step 99830: {'lr': 0.0001288692850352226, 'samples': 19167360, 'steps': 99829, 'loss/train': 1.7238833904266357} 11/07/2021 11:15:10 - INFO - __main__ - Step 99831: {'lr': 0.00012886464283612436, 'samples': 19167552, 'steps': 99830, 'loss/train': 1.4712799787521362} 11/07/2021 11:15:10 - INFO - __main__ - Step 99832: {'lr': 0.0001288600006916079, 'samples': 19167744, 'steps': 99831, 'loss/train': 3.041618824005127} 11/07/2021 11:15:10 - INFO - __main__ - Step 99833: {'lr': 0.0001288553586016752, 'samples': 19167936, 'steps': 99832, 'loss/train': 4.4410881996154785} 11/07/2021 11:15:11 - INFO - __main__ - Step 99834: {'lr': 0.0001288507165663284, 'samples': 19168128, 'steps': 99833, 'loss/train': 1.3463245630264282} 11/07/2021 11:15:12 - INFO - __main__ - Step 99835: {'lr': 0.00012884607458556958, 'samples': 19168320, 'steps': 99834, 'loss/train': 1.6445159912109375} 11/07/2021 11:15:12 - INFO - __main__ - Step 99836: {'lr': 0.00012884143265940086, 'samples': 19168512, 'steps': 99835, 'loss/train': 1.5450348854064941} 11/07/2021 11:15:12 - INFO - __main__ - Step 99837: {'lr': 0.00012883679078782429, 'samples': 19168704, 'steps': 99836, 'loss/train': 1.2354274988174438} 11/07/2021 11:15:13 - INFO - __main__ - Step 99838: {'lr': 0.00012883214897084204, 'samples': 19168896, 'steps': 99837, 'loss/train': 1.2175498008728027} 11/07/2021 11:15:13 - INFO - __main__ - Step 99839: {'lr': 0.0001288275072084561, 'samples': 19169088, 'steps': 99838, 'loss/train': 1.5376003980636597} 11/07/2021 11:15:14 - INFO - __main__ - Step 99840: {'lr': 0.00012882286550066865, 'samples': 19169280, 'steps': 99839, 'loss/train': 0.48129066824913025} 11/07/2021 11:15:14 - INFO - __main__ - Step 99841: {'lr': 0.00012881822384748176, 'samples': 19169472, 'steps': 99840, 'loss/train': 1.4934301376342773} 11/07/2021 11:15:15 - INFO - __main__ - Step 99842: {'lr': 0.0001288135822488975, 'samples': 19169664, 'steps': 99841, 'loss/train': 0.7169214487075806} 11/07/2021 11:15:15 - INFO - __main__ - Step 99843: {'lr': 0.00012880894070491794, 'samples': 19169856, 'steps': 99842, 'loss/train': 1.110724925994873} 11/07/2021 11:15:15 - INFO - __main__ - Step 99844: {'lr': 0.0001288042992155452, 'samples': 19170048, 'steps': 99843, 'loss/train': 1.717084527015686} 11/07/2021 11:15:16 - INFO - __main__ - Step 99845: {'lr': 0.0001287996577807814, 'samples': 19170240, 'steps': 99844, 'loss/train': 0.9387722015380859} 11/07/2021 11:15:17 - INFO - __main__ - Step 99846: {'lr': 0.0001287950164006287, 'samples': 19170432, 'steps': 99845, 'loss/train': 1.215179681777954} 11/07/2021 11:15:17 - INFO - __main__ - Step 99847: {'lr': 0.000128790375075089, 'samples': 19170624, 'steps': 99846, 'loss/train': 1.2712520360946655} 11/07/2021 11:15:18 - INFO - __main__ - Step 99848: {'lr': 0.00012878573380416448, 'samples': 19170816, 'steps': 99847, 'loss/train': 0.6767792105674744} 11/07/2021 11:15:18 - INFO - __main__ - Step 99849: {'lr': 0.00012878109258785726, 'samples': 19171008, 'steps': 99848, 'loss/train': 1.7320449352264404} 11/07/2021 11:15:19 - INFO - __main__ - Step 99850: {'lr': 0.00012877645142616936, 'samples': 19171200, 'steps': 99849, 'loss/train': 1.5783408880233765} 11/07/2021 11:15:19 - INFO - __main__ - Step 99851: {'lr': 0.00012877181031910296, 'samples': 19171392, 'steps': 99850, 'loss/train': 0.9086865782737732} 11/07/2021 11:15:20 - INFO - __main__ - Step 99852: {'lr': 0.0001287671692666601, 'samples': 19171584, 'steps': 99851, 'loss/train': 1.2101329565048218} 11/07/2021 11:15:20 - INFO - __main__ - Step 99853: {'lr': 0.00012876252826884288, 'samples': 19171776, 'steps': 99852, 'loss/train': 1.7974975109100342} 11/07/2021 11:15:20 - INFO - __main__ - Step 99854: {'lr': 0.00012875788732565337, 'samples': 19171968, 'steps': 99853, 'loss/train': 1.0017246007919312} 11/07/2021 11:15:21 - INFO - __main__ - Step 99855: {'lr': 0.00012875324643709375, 'samples': 19172160, 'steps': 99854, 'loss/train': 1.4122955799102783} 11/07/2021 11:15:22 - INFO - __main__ - Step 99856: {'lr': 0.00012874860560316598, 'samples': 19172352, 'steps': 99855, 'loss/train': 0.9655036330223083} 11/07/2021 11:15:22 - INFO - __main__ - Step 99857: {'lr': 0.00012874396482387223, 'samples': 19172544, 'steps': 99856, 'loss/train': 1.0150150060653687} 11/07/2021 11:15:22 - INFO - __main__ - Step 99858: {'lr': 0.0001287393240992146, 'samples': 19172736, 'steps': 99857, 'loss/train': 1.6685283184051514} 11/07/2021 11:15:23 - INFO - __main__ - Step 99859: {'lr': 0.00012873468342919527, 'samples': 19172928, 'steps': 99858, 'loss/train': 1.3689769506454468} 11/07/2021 11:15:23 - INFO - __main__ - Step 99860: {'lr': 0.0001287300428138161, 'samples': 19173120, 'steps': 99859, 'loss/train': 5.601853370666504} 11/07/2021 11:15:24 - INFO - __main__ - Step 99861: {'lr': 0.00012872540225307926, 'samples': 19173312, 'steps': 99860, 'loss/train': 1.6887156963348389} 11/07/2021 11:15:25 - INFO - __main__ - Step 99862: {'lr': 0.00012872076174698694, 'samples': 19173504, 'steps': 99861, 'loss/train': 1.3536051511764526} 11/07/2021 11:15:25 - INFO - __main__ - Step 99863: {'lr': 0.00012871612129554118, 'samples': 19173696, 'steps': 99862, 'loss/train': 1.4586340188980103} 11/07/2021 11:15:25 - INFO - __main__ - Step 99864: {'lr': 0.00012871148089874403, 'samples': 19173888, 'steps': 99863, 'loss/train': 0.25478899478912354} 11/07/2021 11:15:26 - INFO - __main__ - Step 99865: {'lr': 0.00012870684055659766, 'samples': 19174080, 'steps': 99864, 'loss/train': 1.0876471996307373} 11/07/2021 11:15:27 - INFO - __main__ - Step 99866: {'lr': 0.00012870220026910405, 'samples': 19174272, 'steps': 99865, 'loss/train': 1.1669049263000488} 11/07/2021 11:15:27 - INFO - __main__ - Step 99867: {'lr': 0.0001286975600362654, 'samples': 19174464, 'steps': 99866, 'loss/train': 0.9150713682174683} 11/07/2021 11:15:27 - INFO - __main__ - Step 99868: {'lr': 0.00012869291985808374, 'samples': 19174656, 'steps': 99867, 'loss/train': 1.3241147994995117} 11/07/2021 11:15:28 - INFO - __main__ - Step 99869: {'lr': 0.0001286882797345612, 'samples': 19174848, 'steps': 99868, 'loss/train': 1.362058401107788} 11/07/2021 11:15:28 - INFO - __main__ - Step 99870: {'lr': 0.00012868363966569984, 'samples': 19175040, 'steps': 99869, 'loss/train': 0.7408824563026428} 11/07/2021 11:15:28 - INFO - __main__ - Step 99871: {'lr': 0.00012867899965150176, 'samples': 19175232, 'steps': 99870, 'loss/train': 1.5323783159255981} 11/07/2021 11:15:29 - INFO - __main__ - Step 99872: {'lr': 0.00012867435969196903, 'samples': 19175424, 'steps': 99871, 'loss/train': 0.5870841145515442} 11/07/2021 11:15:30 - INFO - __main__ - Step 99873: {'lr': 0.0001286697197871039, 'samples': 19175616, 'steps': 99872, 'loss/train': 1.1972203254699707} 11/07/2021 11:15:30 - INFO - __main__ - Step 99874: {'lr': 0.00012866507993690817, 'samples': 19175808, 'steps': 99873, 'loss/train': 1.5116145610809326} 11/07/2021 11:15:30 - INFO - __main__ - Step 99875: {'lr': 0.00012866044014138412, 'samples': 19176000, 'steps': 99874, 'loss/train': 1.3592785596847534} 11/07/2021 11:15:31 - INFO - __main__ - Step 99876: {'lr': 0.0001286558004005338, 'samples': 19176192, 'steps': 99875, 'loss/train': 1.4928697347640991} 11/07/2021 11:15:32 - INFO - __main__ - Step 99877: {'lr': 0.00012865116071435927, 'samples': 19176384, 'steps': 99876, 'loss/train': 1.0702846050262451} 11/07/2021 11:15:32 - INFO - __main__ - Step 99878: {'lr': 0.00012864652108286273, 'samples': 19176576, 'steps': 99877, 'loss/train': 1.3312352895736694} 11/07/2021 11:15:33 - INFO - __main__ - Step 99879: {'lr': 0.00012864188150604614, 'samples': 19176768, 'steps': 99878, 'loss/train': 1.5963878631591797} 11/07/2021 11:15:33 - INFO - __main__ - Step 99880: {'lr': 0.00012863724198391164, 'samples': 19176960, 'steps': 99879, 'loss/train': 1.4560528993606567} 11/07/2021 11:15:33 - INFO - __main__ - Step 99881: {'lr': 0.00012863260251646136, 'samples': 19177152, 'steps': 99880, 'loss/train': 1.3619632720947266} 11/07/2021 11:15:34 - INFO - __main__ - Step 99882: {'lr': 0.00012862796310369735, 'samples': 19177344, 'steps': 99881, 'loss/train': 1.7678886651992798} 11/07/2021 11:15:35 - INFO - __main__ - Step 99883: {'lr': 0.0001286233237456217, 'samples': 19177536, 'steps': 99882, 'loss/train': 1.1020559072494507} 11/07/2021 11:15:35 - INFO - __main__ - Step 99884: {'lr': 0.00012861868444223644, 'samples': 19177728, 'steps': 99883, 'loss/train': 1.4464011192321777} 11/07/2021 11:15:35 - INFO - __main__ - Step 99885: {'lr': 0.0001286140451935438, 'samples': 19177920, 'steps': 99884, 'loss/train': 1.1684197187423706} 11/07/2021 11:15:36 - INFO - __main__ - Step 99886: {'lr': 0.0001286094059995459, 'samples': 19178112, 'steps': 99885, 'loss/train': 1.9417990446090698} 11/07/2021 11:15:37 - INFO - __main__ - Step 99887: {'lr': 0.00012860476686024465, 'samples': 19178304, 'steps': 99886, 'loss/train': 1.6553213596343994} 11/07/2021 11:15:37 - INFO - __main__ - Step 99888: {'lr': 0.00012860012777564218, 'samples': 19178496, 'steps': 99887, 'loss/train': 1.0698927640914917} 11/07/2021 11:15:37 - INFO - __main__ - Step 99889: {'lr': 0.0001285954887457406, 'samples': 19178688, 'steps': 99888, 'loss/train': 1.538619041442871} 11/07/2021 11:15:38 - INFO - __main__ - Step 99890: {'lr': 0.00012859084977054203, 'samples': 19178880, 'steps': 99889, 'loss/train': 1.0120488405227661} 11/07/2021 11:15:38 - INFO - __main__ - Step 99891: {'lr': 0.00012858621085004858, 'samples': 19179072, 'steps': 99890, 'loss/train': 1.4833464622497559} 11/07/2021 11:15:39 - INFO - __main__ - Step 99892: {'lr': 0.0001285815719842623, 'samples': 19179264, 'steps': 99891, 'loss/train': 1.2556387186050415} 11/07/2021 11:15:40 - INFO - __main__ - Step 99893: {'lr': 0.00012857693317318527, 'samples': 19179456, 'steps': 99892, 'loss/train': 1.2771600484848022} 11/07/2021 11:15:40 - INFO - __main__ - Step 99894: {'lr': 0.00012857229441681962, 'samples': 19179648, 'steps': 99893, 'loss/train': 1.5086427927017212} 11/07/2021 11:15:40 - INFO - __main__ - Step 99895: {'lr': 0.00012856765571516744, 'samples': 19179840, 'steps': 99894, 'loss/train': 1.851070761680603} 11/07/2021 11:15:41 - INFO - __main__ - Step 99896: {'lr': 0.00012856301706823075, 'samples': 19180032, 'steps': 99895, 'loss/train': 1.0583425760269165} 11/07/2021 11:15:41 - INFO - __main__ - Step 99897: {'lr': 0.0001285583784760117, 'samples': 19180224, 'steps': 99896, 'loss/train': 1.108759880065918} 11/07/2021 11:15:42 - INFO - __main__ - Step 99898: {'lr': 0.00012855373993851237, 'samples': 19180416, 'steps': 99897, 'loss/train': 0.8924877047538757} 11/07/2021 11:15:42 - INFO - __main__ - Step 99899: {'lr': 0.0001285491014557349, 'samples': 19180608, 'steps': 99898, 'loss/train': 1.1226590871810913} 11/07/2021 11:15:43 - INFO - __main__ - Step 99900: {'lr': 0.00012854446302768138, 'samples': 19180800, 'steps': 99899, 'loss/train': 1.292602300643921} 11/07/2021 11:15:43 - INFO - __main__ - Step 99901: {'lr': 0.00012853982465435375, 'samples': 19180992, 'steps': 99900, 'loss/train': 1.358789086341858} 11/07/2021 11:15:43 - INFO - __main__ - Step 99902: {'lr': 0.00012853518633575421, 'samples': 19181184, 'steps': 99901, 'loss/train': 0.9761576056480408} 11/07/2021 11:15:45 - INFO - __main__ - Step 99903: {'lr': 0.00012853054807188488, 'samples': 19181376, 'steps': 99902, 'loss/train': 0.6807472109794617} 11/07/2021 11:15:45 - INFO - __main__ - Step 99904: {'lr': 0.00012852590986274777, 'samples': 19181568, 'steps': 99903, 'loss/train': 0.9773818850517273} 11/07/2021 11:15:45 - INFO - __main__ - Step 99905: {'lr': 0.00012852127170834504, 'samples': 19181760, 'steps': 99904, 'loss/train': 1.638071060180664} 11/07/2021 11:15:46 - INFO - __main__ - Step 99906: {'lr': 0.00012851663360867872, 'samples': 19181952, 'steps': 99905, 'loss/train': 1.3133898973464966} 11/07/2021 11:15:46 - INFO - __main__ - Step 99907: {'lr': 0.00012851199556375095, 'samples': 19182144, 'steps': 99906, 'loss/train': 1.4155277013778687} 11/07/2021 11:15:47 - INFO - __main__ - Step 99908: {'lr': 0.0001285073575735638, 'samples': 19182336, 'steps': 99907, 'loss/train': 1.6784000396728516} 11/07/2021 11:15:47 - INFO - __main__ - Step 99909: {'lr': 0.00012850271963811932, 'samples': 19182528, 'steps': 99908, 'loss/train': 1.6383332014083862} 11/07/2021 11:15:48 - INFO - __main__ - Step 99910: {'lr': 0.0001284980817574197, 'samples': 19182720, 'steps': 99909, 'loss/train': 0.6470781564712524} 11/07/2021 11:15:48 - INFO - __main__ - Step 99911: {'lr': 0.00012849344393146695, 'samples': 19182912, 'steps': 99910, 'loss/train': 1.0668801069259644} 11/07/2021 11:15:48 - INFO - __main__ - Step 99912: {'lr': 0.00012848880616026315, 'samples': 19183104, 'steps': 99911, 'loss/train': 1.21638023853302} 11/07/2021 11:15:50 - INFO - __main__ - Step 99913: {'lr': 0.00012848416844381055, 'samples': 19183296, 'steps': 99912, 'loss/train': 1.3579556941986084} 11/07/2021 11:15:50 - INFO - __main__ - Step 99914: {'lr': 0.000128479530782111, 'samples': 19183488, 'steps': 99913, 'loss/train': 0.5455717444419861} 11/07/2021 11:15:50 - INFO - __main__ - Step 99915: {'lr': 0.0001284748931751667, 'samples': 19183680, 'steps': 99914, 'loss/train': 1.4105679988861084} 11/07/2021 11:15:51 - INFO - __main__ - Step 99916: {'lr': 0.0001284702556229797, 'samples': 19183872, 'steps': 99915, 'loss/train': 0.9492663145065308} 11/07/2021 11:15:51 - INFO - __main__ - Step 99917: {'lr': 0.00012846561812555218, 'samples': 19184064, 'steps': 99916, 'loss/train': 1.184866189956665} 11/07/2021 11:15:52 - INFO - __main__ - Step 99918: {'lr': 0.00012846098068288614, 'samples': 19184256, 'steps': 99917, 'loss/train': 1.4692776203155518} 11/07/2021 11:15:52 - INFO - __main__ - Step 99919: {'lr': 0.00012845634329498374, 'samples': 19184448, 'steps': 99918, 'loss/train': 1.6404054164886475} 11/07/2021 11:15:53 - INFO - __main__ - Step 99920: {'lr': 0.00012845170596184703, 'samples': 19184640, 'steps': 99919, 'loss/train': 1.2929213047027588} 11/07/2021 11:15:53 - INFO - __main__ - Step 99921: {'lr': 0.0001284470686834781, 'samples': 19184832, 'steps': 99920, 'loss/train': 1.3654181957244873} 11/07/2021 11:15:54 - INFO - __main__ - Step 99922: {'lr': 0.00012844243145987902, 'samples': 19185024, 'steps': 99921, 'loss/train': 0.09121531248092651} 11/07/2021 11:15:54 - INFO - __main__ - Step 99923: {'lr': 0.00012843779429105192, 'samples': 19185216, 'steps': 99922, 'loss/train': 1.159901738166809} 11/07/2021 11:15:55 - INFO - __main__ - Step 99924: {'lr': 0.00012843315717699888, 'samples': 19185408, 'steps': 99923, 'loss/train': 1.684001088142395} 11/07/2021 11:15:55 - INFO - __main__ - Step 99925: {'lr': 0.000128428520117722, 'samples': 19185600, 'steps': 99924, 'loss/train': 1.3319934606552124} 11/07/2021 11:15:56 - INFO - __main__ - Step 99926: {'lr': 0.00012842388311322346, 'samples': 19185792, 'steps': 99925, 'loss/train': 1.4066599607467651} 11/07/2021 11:15:56 - INFO - __main__ - Step 99927: {'lr': 0.00012841924616350509, 'samples': 19185984, 'steps': 99926, 'loss/train': 1.3330224752426147} 11/07/2021 11:15:56 - INFO - __main__ - Step 99928: {'lr': 0.00012841460926856917, 'samples': 19186176, 'steps': 99927, 'loss/train': 1.7161089181900024} 11/07/2021 11:15:57 - INFO - __main__ - Step 99929: {'lr': 0.00012840997242841772, 'samples': 19186368, 'steps': 99928, 'loss/train': 1.1155798435211182} 11/07/2021 11:15:58 - INFO - __main__ - Step 99930: {'lr': 0.0001284053356430529, 'samples': 19186560, 'steps': 99929, 'loss/train': 1.1044566631317139} 11/07/2021 11:15:58 - INFO - __main__ - Step 99931: {'lr': 0.00012840069891247675, 'samples': 19186752, 'steps': 99930, 'loss/train': 1.1718884706497192} 11/07/2021 11:15:58 - INFO - __main__ - Step 99932: {'lr': 0.00012839606223669135, 'samples': 19186944, 'steps': 99931, 'loss/train': 1.517983078956604} 11/07/2021 11:15:59 - INFO - __main__ - Step 99933: {'lr': 0.00012839142561569882, 'samples': 19187136, 'steps': 99932, 'loss/train': 0.9763840436935425} 11/07/2021 11:16:00 - INFO - __main__ - Step 99934: {'lr': 0.00012838678904950125, 'samples': 19187328, 'steps': 99933, 'loss/train': 1.8462350368499756} 11/07/2021 11:16:00 - INFO - __main__ - Step 99935: {'lr': 0.00012838215253810069, 'samples': 19187520, 'steps': 99934, 'loss/train': 0.9383664131164551} 11/07/2021 11:16:00 - INFO - __main__ - Step 99936: {'lr': 0.00012837751608149925, 'samples': 19187712, 'steps': 99935, 'loss/train': 1.164741039276123} 11/07/2021 11:16:01 - INFO - __main__ - Step 99937: {'lr': 0.00012837287967969904, 'samples': 19187904, 'steps': 99936, 'loss/train': 1.4513864517211914} 11/07/2021 11:16:01 - INFO - __main__ - Step 99938: {'lr': 0.00012836824333270215, 'samples': 19188096, 'steps': 99937, 'loss/train': 1.4346915483474731} 11/07/2021 11:16:02 - INFO - __main__ - Step 99939: {'lr': 0.00012836360704051065, 'samples': 19188288, 'steps': 99938, 'loss/train': 1.3132206201553345} 11/07/2021 11:16:02 - INFO - __main__ - Step 99940: {'lr': 0.00012835897080312668, 'samples': 19188480, 'steps': 99939, 'loss/train': 1.2757151126861572} 11/07/2021 11:16:03 - INFO - __main__ - Step 99941: {'lr': 0.00012835433462055223, 'samples': 19188672, 'steps': 99940, 'loss/train': 1.6491996049880981} 11/07/2021 11:16:03 - INFO - __main__ - Step 99942: {'lr': 0.00012834969849278945, 'samples': 19188864, 'steps': 99941, 'loss/train': 1.0890833139419556} 11/07/2021 11:16:03 - INFO - __main__ - Step 99943: {'lr': 0.0001283450624198404, 'samples': 19189056, 'steps': 99942, 'loss/train': 1.1059411764144897} 11/07/2021 11:16:05 - INFO - __main__ - Step 99944: {'lr': 0.0001283404264017072, 'samples': 19189248, 'steps': 99943, 'loss/train': 1.2986537218093872} 11/07/2021 11:16:05 - INFO - __main__ - Step 99945: {'lr': 0.0001283357904383919, 'samples': 19189440, 'steps': 99944, 'loss/train': 1.259939193725586} 11/07/2021 11:16:05 - INFO - __main__ - Step 99946: {'lr': 0.0001283311545298966, 'samples': 19189632, 'steps': 99945, 'loss/train': 1.3964414596557617} 11/07/2021 11:16:06 - INFO - __main__ - Step 99947: {'lr': 0.00012832651867622345, 'samples': 19189824, 'steps': 99946, 'loss/train': 0.9355164170265198} 11/07/2021 11:16:06 - INFO - __main__ - Step 99948: {'lr': 0.00012832188287737446, 'samples': 19190016, 'steps': 99947, 'loss/train': 1.7511619329452515} 11/07/2021 11:16:06 - INFO - __main__ - Step 99949: {'lr': 0.00012831724713335179, 'samples': 19190208, 'steps': 99948, 'loss/train': 1.270477294921875} 11/07/2021 11:16:07 - INFO - __main__ - Step 99950: {'lr': 0.00012831261144415746, 'samples': 19190400, 'steps': 99949, 'loss/train': 0.6926747560501099} 11/07/2021 11:16:08 - INFO - __main__ - Step 99951: {'lr': 0.0001283079758097936, 'samples': 19190592, 'steps': 99950, 'loss/train': 0.6936349272727966} 11/07/2021 11:16:08 - INFO - __main__ - Step 99952: {'lr': 0.00012830334023026228, 'samples': 19190784, 'steps': 99951, 'loss/train': 1.0891342163085938} 11/07/2021 11:16:08 - INFO - __main__ - Step 99953: {'lr': 0.0001282987047055657, 'samples': 19190976, 'steps': 99952, 'loss/train': 0.21213218569755554} 11/07/2021 11:16:09 - INFO - __main__ - Step 99954: {'lr': 0.00012829406923570575, 'samples': 19191168, 'steps': 99953, 'loss/train': 1.2220518589019775} 11/07/2021 11:16:10 - INFO - __main__ - Step 99955: {'lr': 0.00012828943382068458, 'samples': 19191360, 'steps': 99954, 'loss/train': 1.4520586729049683} 11/07/2021 11:16:10 - INFO - __main__ - Step 99956: {'lr': 0.00012828479846050436, 'samples': 19191552, 'steps': 99955, 'loss/train': 1.4244593381881714} 11/07/2021 11:16:11 - INFO - __main__ - Step 99957: {'lr': 0.0001282801631551671, 'samples': 19191744, 'steps': 99956, 'loss/train': 1.2319992780685425} 11/07/2021 11:16:11 - INFO - __main__ - Step 99958: {'lr': 0.00012827552790467496, 'samples': 19191936, 'steps': 99957, 'loss/train': 1.2586638927459717} 11/07/2021 11:16:11 - INFO - __main__ - Step 99959: {'lr': 0.00012827089270902998, 'samples': 19192128, 'steps': 99958, 'loss/train': 1.4279600381851196} 11/07/2021 11:16:12 - INFO - __main__ - Step 99960: {'lr': 0.00012826625756823425, 'samples': 19192320, 'steps': 99959, 'loss/train': 1.389358401298523} 11/07/2021 11:16:13 - INFO - __main__ - Step 99961: {'lr': 0.00012826162248228985, 'samples': 19192512, 'steps': 99960, 'loss/train': 1.1533615589141846} 11/07/2021 11:16:13 - INFO - __main__ - Step 99962: {'lr': 0.0001282569874511989, 'samples': 19192704, 'steps': 99961, 'loss/train': 0.10397496074438095} 11/07/2021 11:16:13 - INFO - __main__ - Step 99963: {'lr': 0.00012825235247496347, 'samples': 19192896, 'steps': 99962, 'loss/train': 1.720096468925476} 11/07/2021 11:16:14 - INFO - __main__ - Step 99964: {'lr': 0.00012824771755358565, 'samples': 19193088, 'steps': 99963, 'loss/train': 1.270076036453247} 11/07/2021 11:16:15 - INFO - __main__ - Step 99965: {'lr': 0.00012824308268706753, 'samples': 19193280, 'steps': 99964, 'loss/train': 0.9555913805961609} 11/07/2021 11:16:15 - INFO - __main__ - Step 99966: {'lr': 0.00012823844787541116, 'samples': 19193472, 'steps': 99965, 'loss/train': 1.281333327293396} 11/07/2021 11:16:15 - INFO - __main__ - Step 99967: {'lr': 0.00012823381311861883, 'samples': 19193664, 'steps': 99966, 'loss/train': 1.3881382942199707} 11/07/2021 11:16:16 - INFO - __main__ - Step 99968: {'lr': 0.00012822917841669233, 'samples': 19193856, 'steps': 99967, 'loss/train': 1.637995719909668} 11/07/2021 11:16:16 - INFO - __main__ - Step 99969: {'lr': 0.0001282245437696339, 'samples': 19194048, 'steps': 99968, 'loss/train': 0.9154284596443176} 11/07/2021 11:16:17 - INFO - __main__ - Step 99970: {'lr': 0.00012821990917744557, 'samples': 19194240, 'steps': 99969, 'loss/train': 1.0070785284042358} 11/07/2021 11:16:18 - INFO - __main__ - Step 99971: {'lr': 0.0001282152746401295, 'samples': 19194432, 'steps': 99970, 'loss/train': 1.5794538259506226} 11/07/2021 11:16:18 - INFO - __main__ - Step 99972: {'lr': 0.00012821064015768776, 'samples': 19194624, 'steps': 99971, 'loss/train': 1.6185334920883179} 11/07/2021 11:16:18 - INFO - __main__ - Step 99973: {'lr': 0.00012820600573012242, 'samples': 19194816, 'steps': 99972, 'loss/train': 1.3304331302642822} 11/07/2021 11:16:19 - INFO - __main__ - Step 99974: {'lr': 0.0001282013713574356, 'samples': 19195008, 'steps': 99973, 'loss/train': 1.3095077276229858} 11/07/2021 11:16:19 - INFO - __main__ - Step 99975: {'lr': 0.0001281967370396293, 'samples': 19195200, 'steps': 99974, 'loss/train': 1.387181043624878} 11/07/2021 11:16:20 - INFO - __main__ - Step 99976: {'lr': 0.0001281921027767057, 'samples': 19195392, 'steps': 99975, 'loss/train': 1.3248534202575684} 11/07/2021 11:16:21 - INFO - __main__ - Step 99977: {'lr': 0.00012818746856866687, 'samples': 19195584, 'steps': 99976, 'loss/train': 2.0135583877563477} 11/07/2021 11:16:21 - INFO - __main__ - Step 99978: {'lr': 0.00012818283441551497, 'samples': 19195776, 'steps': 99977, 'loss/train': 1.5000567436218262} 11/07/2021 11:16:21 - INFO - __main__ - Step 99979: {'lr': 0.0001281782003172519, 'samples': 19195968, 'steps': 99978, 'loss/train': 1.5795745849609375} 11/07/2021 11:16:22 - INFO - __main__ - Step 99980: {'lr': 0.00012817356627387987, 'samples': 19196160, 'steps': 99979, 'loss/train': 1.7154347896575928} 11/07/2021 11:16:23 - INFO - __main__ - Step 99981: {'lr': 0.00012816893228540096, 'samples': 19196352, 'steps': 99980, 'loss/train': 0.11059805005788803} 11/07/2021 11:16:23 - INFO - __main__ - Step 99982: {'lr': 0.00012816429835181727, 'samples': 19196544, 'steps': 99981, 'loss/train': 1.3716168403625488} 11/07/2021 11:16:23 - INFO - __main__ - Step 99983: {'lr': 0.00012815966447313082, 'samples': 19196736, 'steps': 99982, 'loss/train': 1.4705363512039185} 11/07/2021 11:16:24 - INFO - __main__ - Step 99984: {'lr': 0.00012815503064934376, 'samples': 19196928, 'steps': 99983, 'loss/train': 1.360336422920227} 11/07/2021 11:16:24 - INFO - __main__ - Step 99985: {'lr': 0.00012815039688045816, 'samples': 19197120, 'steps': 99984, 'loss/train': 0.796242892742157} 11/07/2021 11:16:25 - INFO - __main__ - Step 99986: {'lr': 0.00012814576316647611, 'samples': 19197312, 'steps': 99985, 'loss/train': 1.2831625938415527} 11/07/2021 11:16:25 - INFO - __main__ - Step 99987: {'lr': 0.0001281411295073997, 'samples': 19197504, 'steps': 99986, 'loss/train': 1.5431480407714844} 11/07/2021 11:16:26 - INFO - __main__ - Step 99988: {'lr': 0.00012813649590323102, 'samples': 19197696, 'steps': 99987, 'loss/train': 1.419144630432129} 11/07/2021 11:16:26 - INFO - __main__ - Step 99989: {'lr': 0.00012813186235397224, 'samples': 19197888, 'steps': 99988, 'loss/train': 1.294065237045288} 11/07/2021 11:16:27 - INFO - __main__ - Step 99990: {'lr': 0.0001281272288596253, 'samples': 19198080, 'steps': 99989, 'loss/train': 1.5755294561386108} 11/07/2021 11:16:27 - INFO - __main__ - Step 99991: {'lr': 0.00012812259542019234, 'samples': 19198272, 'steps': 99990, 'loss/train': 1.153651475906372} 11/07/2021 11:16:28 - INFO - __main__ - Step 99992: {'lr': 0.00012811796203567543, 'samples': 19198464, 'steps': 99991, 'loss/train': 1.4147253036499023} 11/07/2021 11:16:28 - INFO - __main__ - Step 99993: {'lr': 0.00012811332870607667, 'samples': 19198656, 'steps': 99992, 'loss/train': 1.0042076110839844} 11/07/2021 11:16:29 - INFO - __main__ - Step 99994: {'lr': 0.0001281086954313982, 'samples': 19198848, 'steps': 99993, 'loss/train': 0.8240808248519897} 11/07/2021 11:16:29 - INFO - __main__ - Step 99995: {'lr': 0.00012810406221164207, 'samples': 19199040, 'steps': 99994, 'loss/train': 1.617904782295227} 11/07/2021 11:16:30 - INFO - __main__ - Step 99996: {'lr': 0.00012809942904681038, 'samples': 19199232, 'steps': 99995, 'loss/train': 1.3491076231002808} 11/07/2021 11:16:30 - INFO - __main__ - Step 99997: {'lr': 0.00012809479593690518, 'samples': 19199424, 'steps': 99996, 'loss/train': 1.399559497833252} 11/07/2021 11:16:31 - INFO - __main__ - Step 99998: {'lr': 0.0001280901628819286, 'samples': 19199616, 'steps': 99997, 'loss/train': 1.2998442649841309} 11/07/2021 11:16:31 - INFO - __main__ - Step 99999: {'lr': 0.0001280855298818827, 'samples': 19199808, 'steps': 99998, 'loss/train': 1.2118370532989502} 11/07/2021 11:16:31 - INFO - __main__ - Step 100000: {'lr': 0.00012808089693676966, 'samples': 19200000, 'steps': 99999, 'loss/train': 0.8527107834815979} 11/07/2021 11:16:32 - INFO - __main__ - Step 100001: {'lr': 0.00012807626404659142, 'samples': 19200192, 'steps': 100000, 'loss/train': 1.5344045162200928} 11/07/2021 11:16:33 - INFO - __main__ - Step 100002: {'lr': 0.00012807163121135012, 'samples': 19200384, 'steps': 100001, 'loss/train': 1.3423640727996826} 11/07/2021 11:16:33 - INFO - __main__ - Step 100003: {'lr': 0.00012806699843104786, 'samples': 19200576, 'steps': 100002, 'loss/train': 1.2706276178359985} 11/07/2021 11:16:33 - INFO - __main__ - Step 100004: {'lr': 0.00012806236570568676, 'samples': 19200768, 'steps': 100003, 'loss/train': 1.3466306924819946} 11/07/2021 11:16:34 - INFO - __main__ - Step 100005: {'lr': 0.00012805773303526885, 'samples': 19200960, 'steps': 100004, 'loss/train': 2.1164753437042236} 11/07/2021 11:16:34 - INFO - __main__ - Step 100006: {'lr': 0.00012805310041979622, 'samples': 19201152, 'steps': 100005, 'loss/train': 1.03085458278656} 11/07/2021 11:16:35 - INFO - __main__ - Step 100007: {'lr': 0.000128048467859271, 'samples': 19201344, 'steps': 100006, 'loss/train': 0.4732477068901062} 11/07/2021 11:16:36 - INFO - __main__ - Step 100008: {'lr': 0.00012804383535369528, 'samples': 19201536, 'steps': 100007, 'loss/train': 1.4057893753051758} 11/07/2021 11:16:36 - INFO - __main__ - Step 100009: {'lr': 0.00012803920290307112, 'samples': 19201728, 'steps': 100008, 'loss/train': 1.1577683687210083} 11/07/2021 11:16:36 - INFO - __main__ - Step 100010: {'lr': 0.00012803457050740059, 'samples': 19201920, 'steps': 100009, 'loss/train': 1.6503721475601196} 11/07/2021 11:16:37 - INFO - __main__ - Step 100011: {'lr': 0.0001280299381666859, 'samples': 19202112, 'steps': 100010, 'loss/train': 1.2273681163787842} 11/07/2021 11:16:38 - INFO - __main__ - Step 100012: {'lr': 0.00012802530588092897, 'samples': 19202304, 'steps': 100011, 'loss/train': 1.5430158376693726} 11/07/2021 11:16:38 - INFO - __main__ - Step 100013: {'lr': 0.00012802067365013192, 'samples': 19202496, 'steps': 100012, 'loss/train': 0.9877128601074219} 11/07/2021 11:16:38 - INFO - __main__ - Step 100014: {'lr': 0.0001280160414742969, 'samples': 19202688, 'steps': 100013, 'loss/train': 1.4542646408081055} 11/07/2021 11:16:39 - INFO - __main__ - Step 100015: {'lr': 0.00012801140935342594, 'samples': 19202880, 'steps': 100014, 'loss/train': 1.504393219947815} 11/07/2021 11:16:39 - INFO - __main__ - Step 100016: {'lr': 0.0001280067772875212, 'samples': 19203072, 'steps': 100015, 'loss/train': 1.4153082370758057} 11/07/2021 11:16:40 - INFO - __main__ - Step 100017: {'lr': 0.00012800214527658468, 'samples': 19203264, 'steps': 100016, 'loss/train': 1.4625693559646606} 11/07/2021 11:16:40 - INFO - __main__ - Step 100018: {'lr': 0.00012799751332061854, 'samples': 19203456, 'steps': 100017, 'loss/train': 1.345199465751648} 11/07/2021 11:16:41 - INFO - __main__ - Step 100019: {'lr': 0.00012799288141962485, 'samples': 19203648, 'steps': 100018, 'loss/train': 1.2434011697769165} 11/07/2021 11:16:41 - INFO - __main__ - Step 100020: {'lr': 0.00012798824957360565, 'samples': 19203840, 'steps': 100019, 'loss/train': 0.748465895652771} 11/07/2021 11:16:41 - INFO - __main__ - Step 100021: {'lr': 0.00012798361778256306, 'samples': 19204032, 'steps': 100020, 'loss/train': 1.2691473960876465} 11/07/2021 11:16:43 - INFO - __main__ - Step 100022: {'lr': 0.00012797898604649928, 'samples': 19204224, 'steps': 100021, 'loss/train': 0.06232396140694618} 11/07/2021 11:16:43 - INFO - __main__ - Step 100023: {'lr': 0.00012797435436541618, 'samples': 19204416, 'steps': 100022, 'loss/train': 1.0832494497299194} 11/07/2021 11:16:43 - INFO - __main__ - Step 100024: {'lr': 0.00012796972273931595, 'samples': 19204608, 'steps': 100023, 'loss/train': 0.8338790535926819} 11/07/2021 11:16:44 - INFO - __main__ - Step 100025: {'lr': 0.00012796509116820071, 'samples': 19204800, 'steps': 100024, 'loss/train': 1.30027437210083} 11/07/2021 11:16:44 - INFO - __main__ - Step 100026: {'lr': 0.00012796045965207247, 'samples': 19204992, 'steps': 100025, 'loss/train': 1.508851408958435} 11/07/2021 11:16:45 - INFO - __main__ - Step 100027: {'lr': 0.00012795582819093344, 'samples': 19205184, 'steps': 100026, 'loss/train': 0.8331226110458374} 11/07/2021 11:16:45 - INFO - __main__ - Step 100028: {'lr': 0.00012795119678478555, 'samples': 19205376, 'steps': 100027, 'loss/train': 1.700709581375122} 11/07/2021 11:16:46 - INFO - __main__ - Step 100029: {'lr': 0.00012794656543363103, 'samples': 19205568, 'steps': 100028, 'loss/train': 1.8317041397094727} 11/07/2021 11:16:46 - INFO - __main__ - Step 100030: {'lr': 0.00012794193413747184, 'samples': 19205760, 'steps': 100029, 'loss/train': 1.2800105810165405} 11/07/2021 11:16:46 - INFO - __main__ - Step 100031: {'lr': 0.00012793730289631017, 'samples': 19205952, 'steps': 100030, 'loss/train': 0.8351216316223145} 11/07/2021 11:16:47 - INFO - __main__ - Step 100032: {'lr': 0.00012793267171014807, 'samples': 19206144, 'steps': 100031, 'loss/train': 1.2279270887374878} 11/07/2021 11:16:48 - INFO - __main__ - Step 100033: {'lr': 0.00012792804057898762, 'samples': 19206336, 'steps': 100032, 'loss/train': 1.4895700216293335} 11/07/2021 11:16:48 - INFO - __main__ - Step 100034: {'lr': 0.00012792340950283098, 'samples': 19206528, 'steps': 100033, 'loss/train': 1.0563573837280273} 11/07/2021 11:16:48 - INFO - __main__ - Step 100035: {'lr': 0.00012791877848168014, 'samples': 19206720, 'steps': 100034, 'loss/train': 1.3344018459320068} 11/07/2021 11:16:49 - INFO - __main__ - Step 100036: {'lr': 0.00012791414751553716, 'samples': 19206912, 'steps': 100035, 'loss/train': 1.4820091724395752} 11/07/2021 11:16:49 - INFO - __main__ - Step 100037: {'lr': 0.0001279095166044042, 'samples': 19207104, 'steps': 100036, 'loss/train': 1.5965746641159058} 11/07/2021 11:16:50 - INFO - __main__ - Step 100038: {'lr': 0.00012790488574828329, 'samples': 19207296, 'steps': 100037, 'loss/train': 3.035846710205078} 11/07/2021 11:16:50 - INFO - __main__ - Step 100039: {'lr': 0.00012790025494717662, 'samples': 19207488, 'steps': 100038, 'loss/train': 1.3040705919265747} 11/07/2021 11:16:51 - INFO - __main__ - Step 100040: {'lr': 0.00012789562420108616, 'samples': 19207680, 'steps': 100039, 'loss/train': 1.0905941724777222} 11/07/2021 11:16:51 - INFO - __main__ - Step 100041: {'lr': 0.00012789099351001408, 'samples': 19207872, 'steps': 100040, 'loss/train': 1.4945976734161377} 11/07/2021 11:16:52 - INFO - __main__ - Step 100042: {'lr': 0.00012788636287396242, 'samples': 19208064, 'steps': 100041, 'loss/train': 1.2963753938674927} 11/07/2021 11:16:52 - INFO - __main__ - Step 100043: {'lr': 0.00012788173229293326, 'samples': 19208256, 'steps': 100042, 'loss/train': 1.6770446300506592} 11/07/2021 11:16:53 - INFO - __main__ - Step 100044: {'lr': 0.00012787710176692874, 'samples': 19208448, 'steps': 100043, 'loss/train': 1.614502191543579} 11/07/2021 11:16:53 - INFO - __main__ - Step 100045: {'lr': 0.00012787247129595087, 'samples': 19208640, 'steps': 100044, 'loss/train': 1.2816299200057983} 11/07/2021 11:16:54 - INFO - __main__ - Step 100046: {'lr': 0.00012786784088000182, 'samples': 19208832, 'steps': 100045, 'loss/train': 1.3318833112716675} 11/07/2021 11:16:54 - INFO - __main__ - Step 100047: {'lr': 0.00012786321051908372, 'samples': 19209024, 'steps': 100046, 'loss/train': 1.1421175003051758} 11/07/2021 11:16:55 - INFO - __main__ - Step 100048: {'lr': 0.00012785858021319846, 'samples': 19209216, 'steps': 100047, 'loss/train': 1.7295717000961304} 11/07/2021 11:16:55 - INFO - __main__ - Step 100049: {'lr': 0.00012785394996234827, 'samples': 19209408, 'steps': 100048, 'loss/train': 1.2409034967422485} 11/07/2021 11:16:56 - INFO - __main__ - Step 100050: {'lr': 0.0001278493197665352, 'samples': 19209600, 'steps': 100049, 'loss/train': 1.7475048303604126} 11/07/2021 11:16:56 - INFO - __main__ - Step 100051: {'lr': 0.00012784468962576134, 'samples': 19209792, 'steps': 100050, 'loss/train': 0.8095641136169434} 11/07/2021 11:16:56 - INFO - __main__ - Step 100052: {'lr': 0.00012784005954002875, 'samples': 19209984, 'steps': 100051, 'loss/train': 1.096522331237793} 11/07/2021 11:16:57 - INFO - __main__ - Step 100053: {'lr': 0.00012783542950933958, 'samples': 19210176, 'steps': 100052, 'loss/train': 0.8502666354179382} 11/07/2021 11:16:58 - INFO - __main__ - Step 100054: {'lr': 0.00012783079953369587, 'samples': 19210368, 'steps': 100053, 'loss/train': 1.3456114530563354} 11/07/2021 11:16:58 - INFO - __main__ - Step 100055: {'lr': 0.0001278261696130997, 'samples': 19210560, 'steps': 100054, 'loss/train': 0.8814296126365662} 11/07/2021 11:16:58 - INFO - __main__ - Step 100056: {'lr': 0.00012782153974755318, 'samples': 19210752, 'steps': 100055, 'loss/train': 1.5439120531082153} 11/07/2021 11:16:59 - INFO - __main__ - Step 100057: {'lr': 0.00012781690993705843, 'samples': 19210944, 'steps': 100056, 'loss/train': 1.0307914018630981} 11/07/2021 11:17:00 - INFO - __main__ - Step 100058: {'lr': 0.00012781228018161745, 'samples': 19211136, 'steps': 100057, 'loss/train': 1.1543430089950562} 11/07/2021 11:17:00 - INFO - __main__ - Step 100059: {'lr': 0.00012780765048123235, 'samples': 19211328, 'steps': 100058, 'loss/train': 1.3315621614456177} 11/07/2021 11:17:00 - INFO - __main__ - Step 100060: {'lr': 0.00012780302083590528, 'samples': 19211520, 'steps': 100059, 'loss/train': 1.0987226963043213} 11/07/2021 11:17:01 - INFO - __main__ - Step 100061: {'lr': 0.00012779839124563836, 'samples': 19211712, 'steps': 100060, 'loss/train': 1.3563554286956787} 11/07/2021 11:17:01 - INFO - __main__ - Step 100062: {'lr': 0.00012779376171043348, 'samples': 19211904, 'steps': 100061, 'loss/train': 1.627705454826355} 11/07/2021 11:17:02 - INFO - __main__ - Step 100063: {'lr': 0.00012778913223029294, 'samples': 19212096, 'steps': 100062, 'loss/train': 0.9297937154769897} 11/07/2021 11:17:03 - INFO - __main__ - Step 100064: {'lr': 0.00012778450280521864, 'samples': 19212288, 'steps': 100063, 'loss/train': 2.3176944255828857} 11/07/2021 11:17:03 - INFO - __main__ - Step 100065: {'lr': 0.00012777987343521278, 'samples': 19212480, 'steps': 100064, 'loss/train': 1.5731794834136963} 11/07/2021 11:17:03 - INFO - __main__ - Step 100066: {'lr': 0.0001277752441202774, 'samples': 19212672, 'steps': 100065, 'loss/train': 1.3812528848648071} 11/07/2021 11:17:04 - INFO - __main__ - Step 100067: {'lr': 0.00012777061486041468, 'samples': 19212864, 'steps': 100066, 'loss/train': 0.7287965416908264} 11/07/2021 11:17:05 - INFO - __main__ - Step 100068: {'lr': 0.00012776598565562657, 'samples': 19213056, 'steps': 100067, 'loss/train': 1.7360318899154663} 11/07/2021 11:17:05 - INFO - __main__ - Step 100069: {'lr': 0.00012776135650591526, 'samples': 19213248, 'steps': 100068, 'loss/train': 1.144724726676941} 11/07/2021 11:17:05 - INFO - __main__ - Step 100070: {'lr': 0.00012775672741128274, 'samples': 19213440, 'steps': 100069, 'loss/train': 2.249786853790283} 11/07/2021 11:17:06 - INFO - __main__ - Step 100071: {'lr': 0.00012775209837173122, 'samples': 19213632, 'steps': 100070, 'loss/train': 1.4100741147994995} 11/07/2021 11:17:06 - INFO - __main__ - Step 100072: {'lr': 0.00012774746938726267, 'samples': 19213824, 'steps': 100071, 'loss/train': 1.433464527130127} 11/07/2021 11:17:06 - INFO - __main__ - Step 100073: {'lr': 0.00012774284045787926, 'samples': 19214016, 'steps': 100072, 'loss/train': 0.1571558117866516} 11/07/2021 11:17:08 - INFO - __main__ - Step 100074: {'lr': 0.0001277382115835831, 'samples': 19214208, 'steps': 100073, 'loss/train': 1.182675838470459} 11/07/2021 11:17:08 - INFO - __main__ - Step 100075: {'lr': 0.00012773358276437614, 'samples': 19214400, 'steps': 100074, 'loss/train': 1.449845790863037} 11/07/2021 11:17:08 - INFO - __main__ - Step 100076: {'lr': 0.0001277289540002605, 'samples': 19214592, 'steps': 100075, 'loss/train': 1.2772555351257324} 11/07/2021 11:17:09 - INFO - __main__ - Step 100077: {'lr': 0.0001277243252912384, 'samples': 19214784, 'steps': 100076, 'loss/train': 1.4918584823608398} 11/07/2021 11:17:09 - INFO - __main__ - Step 100078: {'lr': 0.00012771969663731176, 'samples': 19214976, 'steps': 100077, 'loss/train': 2.8248674869537354} 11/07/2021 11:17:10 - INFO - __main__ - Step 100079: {'lr': 0.00012771506803848276, 'samples': 19215168, 'steps': 100078, 'loss/train': 1.2574117183685303} 11/07/2021 11:17:10 - INFO - __main__ - Step 100080: {'lr': 0.00012771043949475345, 'samples': 19215360, 'steps': 100079, 'loss/train': 1.659069299697876} 11/07/2021 11:17:11 - INFO - __main__ - Step 100081: {'lr': 0.00012770581100612593, 'samples': 19215552, 'steps': 100080, 'loss/train': 0.9816200137138367} 11/07/2021 11:17:11 - INFO - __main__ - Step 100082: {'lr': 0.00012770118257260228, 'samples': 19215744, 'steps': 100081, 'loss/train': 1.7910232543945312} 11/07/2021 11:17:11 - INFO - __main__ - Step 100083: {'lr': 0.0001276965541941846, 'samples': 19215936, 'steps': 100082, 'loss/train': 1.2283165454864502} 11/07/2021 11:17:12 - INFO - __main__ - Step 100084: {'lr': 0.00012769192587087496, 'samples': 19216128, 'steps': 100083, 'loss/train': 1.2340351343154907} 11/07/2021 11:17:13 - INFO - __main__ - Step 100085: {'lr': 0.00012768729760267547, 'samples': 19216320, 'steps': 100084, 'loss/train': 1.0672510862350464} 11/07/2021 11:17:13 - INFO - __main__ - Step 100086: {'lr': 0.00012768266938958817, 'samples': 19216512, 'steps': 100085, 'loss/train': 1.2922344207763672} 11/07/2021 11:17:13 - INFO - __main__ - Step 100087: {'lr': 0.0001276780412316152, 'samples': 19216704, 'steps': 100086, 'loss/train': 0.46512770652770996} 11/07/2021 11:17:14 - INFO - __main__ - Step 100088: {'lr': 0.00012767341312875868, 'samples': 19216896, 'steps': 100087, 'loss/train': 0.9167482852935791} 11/07/2021 11:17:15 - INFO - __main__ - Step 100089: {'lr': 0.0001276687850810206, 'samples': 19217088, 'steps': 100088, 'loss/train': 1.4646302461624146} 11/07/2021 11:17:15 - INFO - __main__ - Step 100090: {'lr': 0.000127664157088403, 'samples': 19217280, 'steps': 100089, 'loss/train': 1.8565146923065186} 11/07/2021 11:17:16 - INFO - __main__ - Step 100091: {'lr': 0.00012765952915090806, 'samples': 19217472, 'steps': 100090, 'loss/train': 1.4352848529815674} 11/07/2021 11:17:16 - INFO - __main__ - Step 100092: {'lr': 0.00012765490126853788, 'samples': 19217664, 'steps': 100091, 'loss/train': 1.2517671585083008} 11/07/2021 11:17:16 - INFO - __main__ - Step 100093: {'lr': 0.0001276502734412945, 'samples': 19217856, 'steps': 100092, 'loss/train': 1.5441906452178955} 11/07/2021 11:17:17 - INFO - __main__ - Step 100094: {'lr': 0.00012764564566918003, 'samples': 19218048, 'steps': 100093, 'loss/train': 1.4432777166366577} 11/07/2021 11:17:18 - INFO - __main__ - Step 100095: {'lr': 0.0001276410179521965, 'samples': 19218240, 'steps': 100094, 'loss/train': 1.3177493810653687} 11/07/2021 11:17:18 - INFO - __main__ - Step 100096: {'lr': 0.00012763639029034609, 'samples': 19218432, 'steps': 100095, 'loss/train': 1.5060609579086304} 11/07/2021 11:17:18 - INFO - __main__ - Step 100097: {'lr': 0.0001276317626836308, 'samples': 19218624, 'steps': 100096, 'loss/train': 1.7425113916397095} 11/07/2021 11:17:19 - INFO - __main__ - Step 100098: {'lr': 0.00012762713513205277, 'samples': 19218816, 'steps': 100097, 'loss/train': 1.189474105834961} 11/07/2021 11:17:19 - INFO - __main__ - Step 100099: {'lr': 0.00012762250763561405, 'samples': 19219008, 'steps': 100098, 'loss/train': 1.2872142791748047} 11/07/2021 11:17:20 - INFO - __main__ - Step 100100: {'lr': 0.00012761788019431675, 'samples': 19219200, 'steps': 100099, 'loss/train': 1.229419469833374} 11/07/2021 11:17:21 - INFO - __main__ - Step 100101: {'lr': 0.00012761325280816305, 'samples': 19219392, 'steps': 100100, 'loss/train': 1.5595403909683228} 11/07/2021 11:17:21 - INFO - __main__ - Step 100102: {'lr': 0.0001276086254771548, 'samples': 19219584, 'steps': 100101, 'loss/train': 1.8476800918579102} 11/07/2021 11:17:21 - INFO - __main__ - Step 100103: {'lr': 0.00012760399820129425, 'samples': 19219776, 'steps': 100102, 'loss/train': 1.7321486473083496} 11/07/2021 11:17:22 - INFO - __main__ - Step 100104: {'lr': 0.00012759937098058343, 'samples': 19219968, 'steps': 100103, 'loss/train': 1.3412619829177856} 11/07/2021 11:17:23 - INFO - __main__ - Step 100105: {'lr': 0.00012759474381502444, 'samples': 19220160, 'steps': 100104, 'loss/train': 1.630109429359436} 11/07/2021 11:17:23 - INFO - __main__ - Step 100106: {'lr': 0.0001275901167046194, 'samples': 19220352, 'steps': 100105, 'loss/train': 1.7400039434432983} 11/07/2021 11:17:23 - INFO - __main__ - Step 100107: {'lr': 0.00012758548964937033, 'samples': 19220544, 'steps': 100106, 'loss/train': 1.6068923473358154} 11/07/2021 11:17:24 - INFO - __main__ - Step 100108: {'lr': 0.00012758086264927937, 'samples': 19220736, 'steps': 100107, 'loss/train': 1.0210758447647095} 11/07/2021 11:17:24 - INFO - __main__ - Step 100109: {'lr': 0.00012757623570434858, 'samples': 19220928, 'steps': 100108, 'loss/train': 1.0222126245498657} 11/07/2021 11:17:25 - INFO - __main__ - Step 100110: {'lr': 0.00012757160881458004, 'samples': 19221120, 'steps': 100109, 'loss/train': 1.7747185230255127} 11/07/2021 11:17:25 - INFO - __main__ - Step 100111: {'lr': 0.00012756698197997584, 'samples': 19221312, 'steps': 100110, 'loss/train': 1.0321334600448608} 11/07/2021 11:17:26 - INFO - __main__ - Step 100112: {'lr': 0.0001275623552005381, 'samples': 19221504, 'steps': 100111, 'loss/train': 1.3898733854293823} 11/07/2021 11:17:26 - INFO - __main__ - Step 100113: {'lr': 0.00012755772847626885, 'samples': 19221696, 'steps': 100112, 'loss/train': 1.3302336931228638} 11/07/2021 11:17:26 - INFO - __main__ - Step 100114: {'lr': 0.0001275531018071702, 'samples': 19221888, 'steps': 100113, 'loss/train': 1.781481146812439} 11/07/2021 11:17:27 - INFO - __main__ - Step 100115: {'lr': 0.00012754847519324432, 'samples': 19222080, 'steps': 100114, 'loss/train': 1.7240746021270752} 11/07/2021 11:17:28 - INFO - __main__ - Step 100116: {'lr': 0.00012754384863449314, 'samples': 19222272, 'steps': 100115, 'loss/train': 1.2935420274734497} 11/07/2021 11:17:28 - INFO - __main__ - Step 100117: {'lr': 0.00012753922213091877, 'samples': 19222464, 'steps': 100116, 'loss/train': 1.4220808744430542} 11/07/2021 11:17:28 - INFO - __main__ - Step 100118: {'lr': 0.00012753459568252338, 'samples': 19222656, 'steps': 100117, 'loss/train': 1.3253391981124878} 11/07/2021 11:17:29 - INFO - __main__ - Step 100119: {'lr': 0.00012752996928930898, 'samples': 19222848, 'steps': 100118, 'loss/train': 0.5927543044090271} 11/07/2021 11:17:29 - INFO - __main__ - Step 100120: {'lr': 0.00012752534295127772, 'samples': 19223040, 'steps': 100119, 'loss/train': 1.520364761352539} 11/07/2021 11:17:30 - INFO - __main__ - Step 100121: {'lr': 0.00012752071666843163, 'samples': 19223232, 'steps': 100120, 'loss/train': 1.7797472476959229} 11/07/2021 11:17:30 - INFO - __main__ - Step 100122: {'lr': 0.00012751609044077278, 'samples': 19223424, 'steps': 100121, 'loss/train': 1.2606381177902222} 11/07/2021 11:17:31 - INFO - __main__ - Step 100123: {'lr': 0.00012751146426830335, 'samples': 19223616, 'steps': 100122, 'loss/train': 1.4353313446044922} 11/07/2021 11:17:31 - INFO - __main__ - Step 100124: {'lr': 0.0001275068381510253, 'samples': 19223808, 'steps': 100123, 'loss/train': 1.014177918434143} 11/07/2021 11:17:32 - INFO - __main__ - Step 100125: {'lr': 0.00012750221208894085, 'samples': 19224000, 'steps': 100124, 'loss/train': 1.1198550462722778} 11/07/2021 11:17:33 - INFO - __main__ - Step 100126: {'lr': 0.00012749758608205197, 'samples': 19224192, 'steps': 100125, 'loss/train': 1.546004295349121} 11/07/2021 11:17:33 - INFO - __main__ - Step 100127: {'lr': 0.0001274929601303608, 'samples': 19224384, 'steps': 100126, 'loss/train': 1.3368033170700073} 11/07/2021 11:17:33 - INFO - __main__ - Step 100128: {'lr': 0.00012748833423386951, 'samples': 19224576, 'steps': 100127, 'loss/train': 0.23513904213905334} 11/07/2021 11:17:34 - INFO - __main__ - Step 100129: {'lr': 0.00012748370839258, 'samples': 19224768, 'steps': 100128, 'loss/train': 1.4989598989486694} 11/07/2021 11:17:34 - INFO - __main__ - Step 100130: {'lr': 0.0001274790826064944, 'samples': 19224960, 'steps': 100129, 'loss/train': 1.1886307001113892} 11/07/2021 11:17:35 - INFO - __main__ - Step 100131: {'lr': 0.00012747445687561487, 'samples': 19225152, 'steps': 100130, 'loss/train': 1.4599733352661133} 11/07/2021 11:17:35 - INFO - __main__ - Step 100132: {'lr': 0.00012746983119994344, 'samples': 19225344, 'steps': 100131, 'loss/train': 0.4667408764362335} 11/07/2021 11:17:36 - INFO - __main__ - Step 100133: {'lr': 0.0001274652055794822, 'samples': 19225536, 'steps': 100132, 'loss/train': 1.2509442567825317} 11/07/2021 11:17:36 - INFO - __main__ - Step 100134: {'lr': 0.0001274605800142333, 'samples': 19225728, 'steps': 100133, 'loss/train': 1.7465637922286987} 11/07/2021 11:17:36 - INFO - __main__ - Step 100135: {'lr': 0.00012745595450419872, 'samples': 19225920, 'steps': 100134, 'loss/train': 1.5694063901901245} 11/07/2021 11:17:37 - INFO - __main__ - Step 100136: {'lr': 0.00012745132904938062, 'samples': 19226112, 'steps': 100135, 'loss/train': 1.1650161743164062} 11/07/2021 11:17:38 - INFO - __main__ - Step 100137: {'lr': 0.00012744670364978105, 'samples': 19226304, 'steps': 100136, 'loss/train': 2.289146900177002} 11/07/2021 11:17:38 - INFO - __main__ - Step 100138: {'lr': 0.00012744207830540214, 'samples': 19226496, 'steps': 100137, 'loss/train': 1.9931751489639282} 11/07/2021 11:17:38 - INFO - __main__ - Step 100139: {'lr': 0.0001274374530162459, 'samples': 19226688, 'steps': 100138, 'loss/train': 1.010164499282837} 11/07/2021 11:17:39 - INFO - __main__ - Step 100140: {'lr': 0.00012743282778231445, 'samples': 19226880, 'steps': 100139, 'loss/train': 1.182600498199463} 11/07/2021 11:17:40 - INFO - __main__ - Step 100141: {'lr': 0.0001274282026036099, 'samples': 19227072, 'steps': 100140, 'loss/train': 2.0171585083007812} 11/07/2021 11:17:40 - INFO - __main__ - Step 100142: {'lr': 0.0001274235774801344, 'samples': 19227264, 'steps': 100141, 'loss/train': 1.4699828624725342} 11/07/2021 11:17:41 - INFO - __main__ - Step 100143: {'lr': 0.00012741895241188982, 'samples': 19227456, 'steps': 100142, 'loss/train': 1.8140921592712402} 11/07/2021 11:17:41 - INFO - __main__ - Step 100144: {'lr': 0.00012741432739887842, 'samples': 19227648, 'steps': 100143, 'loss/train': 1.6687713861465454} 11/07/2021 11:17:41 - INFO - __main__ - Step 100145: {'lr': 0.0001274097024411022, 'samples': 19227840, 'steps': 100144, 'loss/train': 1.1207159757614136} 11/07/2021 11:17:42 - INFO - __main__ - Step 100146: {'lr': 0.00012740507753856327, 'samples': 19228032, 'steps': 100145, 'loss/train': 1.4506263732910156} 11/07/2021 11:17:43 - INFO - __main__ - Step 100147: {'lr': 0.00012740045269126374, 'samples': 19228224, 'steps': 100146, 'loss/train': 1.0461225509643555} 11/07/2021 11:17:43 - INFO - __main__ - Step 100148: {'lr': 0.00012739582789920566, 'samples': 19228416, 'steps': 100147, 'loss/train': 1.197244644165039} 11/07/2021 11:17:43 - INFO - __main__ - Step 100149: {'lr': 0.00012739120316239113, 'samples': 19228608, 'steps': 100148, 'loss/train': 0.8144459128379822} 11/07/2021 11:17:44 - INFO - __main__ - Step 100150: {'lr': 0.00012738657848082225, 'samples': 19228800, 'steps': 100149, 'loss/train': 1.5934377908706665} 11/07/2021 11:17:44 - INFO - __main__ - Step 100151: {'lr': 0.0001273819538545011, 'samples': 19228992, 'steps': 100150, 'loss/train': 0.06417189538478851} 11/07/2021 11:17:45 - INFO - __main__ - Step 100152: {'lr': 0.00012737732928342968, 'samples': 19229184, 'steps': 100151, 'loss/train': 0.9148614406585693} 11/07/2021 11:17:46 - INFO - __main__ - Step 100153: {'lr': 0.0001273727047676102, 'samples': 19229376, 'steps': 100152, 'loss/train': 0.970016598701477} 11/07/2021 11:17:46 - INFO - __main__ - Step 100154: {'lr': 0.00012736808030704467, 'samples': 19229568, 'steps': 100153, 'loss/train': 1.0909523963928223} 11/07/2021 11:17:46 - INFO - __main__ - Step 100155: {'lr': 0.00012736345590173525, 'samples': 19229760, 'steps': 100154, 'loss/train': 1.0720117092132568} 11/07/2021 11:17:47 - INFO - __main__ - Step 100156: {'lr': 0.00012735883155168392, 'samples': 19229952, 'steps': 100155, 'loss/train': 1.2528585195541382} 11/07/2021 11:17:48 - INFO - __main__ - Step 100157: {'lr': 0.00012735420725689281, 'samples': 19230144, 'steps': 100156, 'loss/train': 1.4533116817474365} 11/07/2021 11:17:48 - INFO - __main__ - Step 100158: {'lr': 0.00012734958301736398, 'samples': 19230336, 'steps': 100157, 'loss/train': 1.3053332567214966} 11/07/2021 11:17:49 - INFO - __main__ - Step 100159: {'lr': 0.00012734495883309955, 'samples': 19230528, 'steps': 100158, 'loss/train': 1.169669270515442} 11/07/2021 11:17:49 - INFO - __main__ - Step 100160: {'lr': 0.00012734033470410155, 'samples': 19230720, 'steps': 100159, 'loss/train': 1.4930622577667236} 11/07/2021 11:17:49 - INFO - __main__ - Step 100161: {'lr': 0.00012733571063037213, 'samples': 19230912, 'steps': 100160, 'loss/train': 1.4523788690567017} 11/07/2021 11:17:50 - INFO - __main__ - Step 100162: {'lr': 0.0001273310866119134, 'samples': 19231104, 'steps': 100161, 'loss/train': 1.4381790161132812} 11/07/2021 11:17:51 - INFO - __main__ - Step 100163: {'lr': 0.00012732646264872733, 'samples': 19231296, 'steps': 100162, 'loss/train': 1.4416253566741943} 11/07/2021 11:17:51 - INFO - __main__ - Step 100164: {'lr': 0.00012732183874081604, 'samples': 19231488, 'steps': 100163, 'loss/train': 1.316738247871399} 11/07/2021 11:17:51 - INFO - __main__ - Step 100165: {'lr': 0.00012731721488818169, 'samples': 19231680, 'steps': 100164, 'loss/train': 1.4303683042526245} 11/07/2021 11:17:52 - INFO - __main__ - Step 100166: {'lr': 0.00012731259109082627, 'samples': 19231872, 'steps': 100165, 'loss/train': 0.9265416860580444} 11/07/2021 11:17:52 - INFO - __main__ - Step 100167: {'lr': 0.00012730796734875194, 'samples': 19232064, 'steps': 100166, 'loss/train': 1.3903676271438599} 11/07/2021 11:17:53 - INFO - __main__ - Step 100168: {'lr': 0.0001273033436619608, 'samples': 19232256, 'steps': 100167, 'loss/train': 1.3413152694702148} 11/07/2021 11:17:54 - INFO - __main__ - Step 100169: {'lr': 0.0001272987200304548, 'samples': 19232448, 'steps': 100168, 'loss/train': 0.968889594078064} 11/07/2021 11:17:54 - INFO - __main__ - Step 100170: {'lr': 0.0001272940964542361, 'samples': 19232640, 'steps': 100169, 'loss/train': 1.520232915878296} 11/07/2021 11:17:54 - INFO - __main__ - Step 100171: {'lr': 0.00012728947293330685, 'samples': 19232832, 'steps': 100170, 'loss/train': 1.1486279964447021} 11/07/2021 11:17:55 - INFO - __main__ - Step 100172: {'lr': 0.000127284849467669, 'samples': 19233024, 'steps': 100171, 'loss/train': 1.0032702684402466} 11/07/2021 11:17:55 - INFO - __main__ - Step 100173: {'lr': 0.0001272802260573247, 'samples': 19233216, 'steps': 100172, 'loss/train': 1.2515721321105957} 11/07/2021 11:17:56 - INFO - __main__ - Step 100174: {'lr': 0.00012727560270227607, 'samples': 19233408, 'steps': 100173, 'loss/train': 1.6070619821548462} 11/07/2021 11:17:56 - INFO - __main__ - Step 100175: {'lr': 0.00012727097940252514, 'samples': 19233600, 'steps': 100174, 'loss/train': 1.0884590148925781} 11/07/2021 11:17:57 - INFO - __main__ - Step 100176: {'lr': 0.00012726635615807402, 'samples': 19233792, 'steps': 100175, 'loss/train': 1.3788849115371704} 11/07/2021 11:17:57 - INFO - __main__ - Step 100177: {'lr': 0.0001272617329689248, 'samples': 19233984, 'steps': 100176, 'loss/train': 1.3712371587753296} 11/07/2021 11:17:57 - INFO - __main__ - Step 100178: {'lr': 0.00012725710983507954, 'samples': 19234176, 'steps': 100177, 'loss/train': 1.358227252960205} 11/07/2021 11:17:58 - INFO - __main__ - Step 100179: {'lr': 0.0001272524867565403, 'samples': 19234368, 'steps': 100178, 'loss/train': 0.8664436340332031} 11/07/2021 11:17:59 - INFO - __main__ - Step 100180: {'lr': 0.00012724786373330922, 'samples': 19234560, 'steps': 100179, 'loss/train': 1.3704476356506348} 11/07/2021 11:17:59 - INFO - __main__ - Step 100181: {'lr': 0.00012724324076538837, 'samples': 19234752, 'steps': 100180, 'loss/train': 1.302425742149353} 11/07/2021 11:18:00 - INFO - __main__ - Step 100182: {'lr': 0.0001272386178527799, 'samples': 19234944, 'steps': 100181, 'loss/train': 1.3771412372589111} 11/07/2021 11:18:00 - INFO - __main__ - Step 100183: {'lr': 0.00012723399499548575, 'samples': 19235136, 'steps': 100182, 'loss/train': 2.0331192016601562} 11/07/2021 11:18:01 - INFO - __main__ - Step 100184: {'lr': 0.00012722937219350803, 'samples': 19235328, 'steps': 100183, 'loss/train': 1.571998953819275} 11/07/2021 11:18:01 - INFO - __main__ - Step 100185: {'lr': 0.00012722474944684887, 'samples': 19235520, 'steps': 100184, 'loss/train': 1.34979248046875} 11/07/2021 11:18:02 - INFO - __main__ - Step 100186: {'lr': 0.00012722012675551038, 'samples': 19235712, 'steps': 100185, 'loss/train': 1.3958392143249512} 11/07/2021 11:18:02 - INFO - __main__ - Step 100187: {'lr': 0.00012721550411949457, 'samples': 19235904, 'steps': 100186, 'loss/train': 1.773400068283081} 11/07/2021 11:18:02 - INFO - __main__ - Step 100188: {'lr': 0.00012721088153880357, 'samples': 19236096, 'steps': 100187, 'loss/train': 0.7812200784683228} 11/07/2021 11:18:03 - INFO - __main__ - Step 100189: {'lr': 0.0001272062590134394, 'samples': 19236288, 'steps': 100188, 'loss/train': 1.3435593843460083} 11/07/2021 11:18:04 - INFO - __main__ - Step 100190: {'lr': 0.00012720163654340424, 'samples': 19236480, 'steps': 100189, 'loss/train': 1.3980190753936768} 11/07/2021 11:18:04 - INFO - __main__ - Step 100191: {'lr': 0.00012719701412870014, 'samples': 19236672, 'steps': 100190, 'loss/train': 1.292944312095642} 11/07/2021 11:18:04 - INFO - __main__ - Step 100192: {'lr': 0.00012719239176932917, 'samples': 19236864, 'steps': 100191, 'loss/train': 1.452890157699585} 11/07/2021 11:18:05 - INFO - __main__ - Step 100193: {'lr': 0.00012718776946529336, 'samples': 19237056, 'steps': 100192, 'loss/train': 1.370568871498108} 11/07/2021 11:18:06 - INFO - __main__ - Step 100194: {'lr': 0.0001271831472165949, 'samples': 19237248, 'steps': 100193, 'loss/train': 1.7193063497543335} 11/07/2021 11:18:06 - INFO - __main__ - Step 100195: {'lr': 0.0001271785250232359, 'samples': 19237440, 'steps': 100194, 'loss/train': 1.594557762145996} 11/07/2021 11:18:06 - INFO - __main__ - Step 100196: {'lr': 0.0001271739028852183, 'samples': 19237632, 'steps': 100195, 'loss/train': 1.0631937980651855} 11/07/2021 11:18:07 - INFO - __main__ - Step 100197: {'lr': 0.0001271692808025442, 'samples': 19237824, 'steps': 100196, 'loss/train': 1.1757129430770874} 11/07/2021 11:18:07 - INFO - __main__ - Step 100198: {'lr': 0.00012716465877521572, 'samples': 19238016, 'steps': 100197, 'loss/train': 1.2878644466400146} 11/07/2021 11:18:08 - INFO - __main__ - Step 100199: {'lr': 0.000127160036803235, 'samples': 19238208, 'steps': 100198, 'loss/train': 1.457122564315796} 11/07/2021 11:18:09 - INFO - __main__ - Step 100200: {'lr': 0.00012715541488660405, 'samples': 19238400, 'steps': 100199, 'loss/train': 1.4968267679214478} 11/07/2021 11:18:09 - INFO - __main__ - Step 100201: {'lr': 0.00012715079302532496, 'samples': 19238592, 'steps': 100200, 'loss/train': 0.8394119739532471} 11/07/2021 11:18:09 - INFO - __main__ - Step 100202: {'lr': 0.00012714617121939982, 'samples': 19238784, 'steps': 100201, 'loss/train': 1.3686108589172363} 11/07/2021 11:18:10 - INFO - __main__ - Step 100203: {'lr': 0.00012714154946883073, 'samples': 19238976, 'steps': 100202, 'loss/train': 1.7416640520095825} 11/07/2021 11:18:11 - INFO - __main__ - Step 100204: {'lr': 0.00012713692777361973, 'samples': 19239168, 'steps': 100203, 'loss/train': 1.3754518032073975} 11/07/2021 11:18:11 - INFO - __main__ - Step 100205: {'lr': 0.00012713230613376896, 'samples': 19239360, 'steps': 100204, 'loss/train': 1.182961106300354} 11/07/2021 11:18:12 - INFO - __main__ - Step 100206: {'lr': 0.0001271276845492805, 'samples': 19239552, 'steps': 100205, 'loss/train': 3.1932754516601562} 11/07/2021 11:18:12 - INFO - __main__ - Step 100207: {'lr': 0.0001271230630201564, 'samples': 19239744, 'steps': 100206, 'loss/train': 1.1472960710525513} 11/07/2021 11:18:12 - INFO - __main__ - Step 100208: {'lr': 0.00012711844154639874, 'samples': 19239936, 'steps': 100207, 'loss/train': 1.2937239408493042} 11/07/2021 11:18:13 - INFO - __main__ - Step 100209: {'lr': 0.0001271138201280097, 'samples': 19240128, 'steps': 100208, 'loss/train': 0.8062604665756226} 11/07/2021 11:18:14 - INFO - __main__ - Step 100210: {'lr': 0.0001271091987649912, 'samples': 19240320, 'steps': 100209, 'loss/train': 1.154120922088623} 11/07/2021 11:18:14 - INFO - __main__ - Step 100211: {'lr': 0.0001271045774573454, 'samples': 19240512, 'steps': 100210, 'loss/train': 1.6436059474945068} 11/07/2021 11:18:14 - INFO - __main__ - Step 100212: {'lr': 0.00012709995620507436, 'samples': 19240704, 'steps': 100211, 'loss/train': 1.2885316610336304} 11/07/2021 11:18:15 - INFO - __main__ - Step 100213: {'lr': 0.00012709533500818022, 'samples': 19240896, 'steps': 100212, 'loss/train': 1.0317541360855103} 11/07/2021 11:18:16 - INFO - __main__ - Step 100214: {'lr': 0.00012709071386666498, 'samples': 19241088, 'steps': 100213, 'loss/train': 1.439855933189392} 11/07/2021 11:18:16 - INFO - __main__ - Step 100215: {'lr': 0.00012708609278053079, 'samples': 19241280, 'steps': 100214, 'loss/train': 1.3268566131591797} 11/07/2021 11:18:16 - INFO - __main__ - Step 100216: {'lr': 0.00012708147174977976, 'samples': 19241472, 'steps': 100215, 'loss/train': 1.0018354654312134} 11/07/2021 11:18:17 - INFO - __main__ - Step 100217: {'lr': 0.00012707685077441384, 'samples': 19241664, 'steps': 100216, 'loss/train': 1.765580177307129} 11/07/2021 11:18:17 - INFO - __main__ - Step 100218: {'lr': 0.00012707222985443523, 'samples': 19241856, 'steps': 100217, 'loss/train': 1.4721593856811523} 11/07/2021 11:18:17 - INFO - __main__ - Step 100219: {'lr': 0.000127067608989846, 'samples': 19242048, 'steps': 100218, 'loss/train': 1.5144715309143066} 11/07/2021 11:18:19 - INFO - __main__ - Step 100220: {'lr': 0.00012706298818064815, 'samples': 19242240, 'steps': 100219, 'loss/train': 0.7025138735771179} 11/07/2021 11:18:19 - INFO - __main__ - Step 100221: {'lr': 0.00012705836742684385, 'samples': 19242432, 'steps': 100220, 'loss/train': 1.519779920578003} 11/07/2021 11:18:19 - INFO - __main__ - Step 100222: {'lr': 0.0001270537467284353, 'samples': 19242624, 'steps': 100221, 'loss/train': 1.541776418685913} 11/07/2021 11:18:20 - INFO - __main__ - Step 100223: {'lr': 0.00012704912608542423, 'samples': 19242816, 'steps': 100222, 'loss/train': 1.6696245670318604} 11/07/2021 11:18:20 - INFO - __main__ - Step 100224: {'lr': 0.000127044505497813, 'samples': 19243008, 'steps': 100223, 'loss/train': 1.678124189376831} 11/07/2021 11:18:21 - INFO - __main__ - Step 100225: {'lr': 0.0001270398849656036, 'samples': 19243200, 'steps': 100224, 'loss/train': 1.2004472017288208} 11/07/2021 11:18:21 - INFO - __main__ - Step 100226: {'lr': 0.00012703526448879816, 'samples': 19243392, 'steps': 100225, 'loss/train': 1.5841786861419678} 11/07/2021 11:18:22 - INFO - __main__ - Step 100227: {'lr': 0.0001270306440673987, 'samples': 19243584, 'steps': 100226, 'loss/train': 1.0157396793365479} 11/07/2021 11:18:22 - INFO - __main__ - Step 100228: {'lr': 0.00012702602370140735, 'samples': 19243776, 'steps': 100227, 'loss/train': 0.8395498991012573} 11/07/2021 11:18:22 - INFO - __main__ - Step 100229: {'lr': 0.00012702140339082617, 'samples': 19243968, 'steps': 100228, 'loss/train': 1.3680669069290161} 11/07/2021 11:18:23 - INFO - __main__ - Step 100230: {'lr': 0.00012701678313565724, 'samples': 19244160, 'steps': 100229, 'loss/train': 1.3165168762207031} 11/07/2021 11:18:24 - INFO - __main__ - Step 100231: {'lr': 0.00012701216293590264, 'samples': 19244352, 'steps': 100230, 'loss/train': 1.400101900100708} 11/07/2021 11:18:24 - INFO - __main__ - Step 100232: {'lr': 0.0001270075427915645, 'samples': 19244544, 'steps': 100231, 'loss/train': 1.7809717655181885} 11/07/2021 11:18:24 - INFO - __main__ - Step 100233: {'lr': 0.00012700292270264481, 'samples': 19244736, 'steps': 100232, 'loss/train': 1.294349193572998} 11/07/2021 11:18:25 - INFO - __main__ - Step 100234: {'lr': 0.00012699830266914576, 'samples': 19244928, 'steps': 100233, 'loss/train': 1.200217843055725} 11/07/2021 11:18:26 - INFO - __main__ - Step 100235: {'lr': 0.00012699368269106933, 'samples': 19245120, 'steps': 100234, 'loss/train': 1.2448655366897583} 11/07/2021 11:18:26 - INFO - __main__ - Step 100236: {'lr': 0.00012698906276841776, 'samples': 19245312, 'steps': 100235, 'loss/train': 1.35373854637146} 11/07/2021 11:18:27 - INFO - __main__ - Step 100237: {'lr': 0.00012698444290119292, 'samples': 19245504, 'steps': 100236, 'loss/train': 0.9993017911911011} 11/07/2021 11:18:27 - INFO - __main__ - Step 100238: {'lr': 0.00012697982308939703, 'samples': 19245696, 'steps': 100237, 'loss/train': 0.8026155829429626} 11/07/2021 11:18:27 - INFO - __main__ - Step 100239: {'lr': 0.00012697520333303212, 'samples': 19245888, 'steps': 100238, 'loss/train': 1.1558650732040405} 11/07/2021 11:18:29 - INFO - __main__ - Step 100240: {'lr': 0.00012697058363210026, 'samples': 19246080, 'steps': 100239, 'loss/train': 1.2322440147399902} 11/07/2021 11:18:30 - INFO - __main__ - Step 100241: {'lr': 0.00012696596398660356, 'samples': 19246272, 'steps': 100240, 'loss/train': 1.2330563068389893} 11/07/2021 11:18:30 - INFO - __main__ - Step 100242: {'lr': 0.0001269613443965441, 'samples': 19246464, 'steps': 100241, 'loss/train': 1.8882745504379272} 11/07/2021 11:18:30 - INFO - __main__ - Step 100243: {'lr': 0.00012695672486192397, 'samples': 19246656, 'steps': 100242, 'loss/train': 1.808809757232666} 11/07/2021 11:18:31 - INFO - __main__ - Step 100244: {'lr': 0.00012695210538274525, 'samples': 19246848, 'steps': 100243, 'loss/train': 1.501885175704956} 11/07/2021 11:18:31 - INFO - __main__ - Step 100245: {'lr': 0.00012694748595901001, 'samples': 19247040, 'steps': 100244, 'loss/train': 1.228358507156372} 11/07/2021 11:18:31 - INFO - __main__ - Step 100246: {'lr': 0.0001269428665907203, 'samples': 19247232, 'steps': 100245, 'loss/train': 1.1924949884414673} 11/07/2021 11:18:32 - INFO - __main__ - Step 100247: {'lr': 0.00012693824727787837, 'samples': 19247424, 'steps': 100246, 'loss/train': 1.1909537315368652} 11/07/2021 11:18:33 - INFO - __main__ - Step 100248: {'lr': 0.00012693362802048606, 'samples': 19247616, 'steps': 100247, 'loss/train': 1.5644508600234985} 11/07/2021 11:18:33 - INFO - __main__ - Step 100249: {'lr': 0.00012692900881854552, 'samples': 19247808, 'steps': 100248, 'loss/train': 1.497756838798523} 11/07/2021 11:18:33 - INFO - __main__ - Step 100250: {'lr': 0.00012692438967205894, 'samples': 19248000, 'steps': 100249, 'loss/train': 1.3487058877944946} 11/07/2021 11:18:34 - INFO - __main__ - Step 100251: {'lr': 0.00012691977058102826, 'samples': 19248192, 'steps': 100250, 'loss/train': 1.6234220266342163} 11/07/2021 11:18:35 - INFO - __main__ - Step 100252: {'lr': 0.0001269151515454557, 'samples': 19248384, 'steps': 100251, 'loss/train': 2.587872266769409} 11/07/2021 11:18:35 - INFO - __main__ - Step 100253: {'lr': 0.00012691053256534324, 'samples': 19248576, 'steps': 100252, 'loss/train': 1.1801577806472778} 11/07/2021 11:18:36 - INFO - __main__ - Step 100254: {'lr': 0.000126905913640693, 'samples': 19248768, 'steps': 100253, 'loss/train': 1.2391607761383057} 11/07/2021 11:18:36 - INFO - __main__ - Step 100255: {'lr': 0.00012690129477150702, 'samples': 19248960, 'steps': 100254, 'loss/train': 1.4306164979934692} 11/07/2021 11:18:36 - INFO - __main__ - Step 100256: {'lr': 0.00012689667595778745, 'samples': 19249152, 'steps': 100255, 'loss/train': 1.4120819568634033} 11/07/2021 11:18:37 - INFO - __main__ - Step 100257: {'lr': 0.00012689205719953634, 'samples': 19249344, 'steps': 100256, 'loss/train': 1.738603949546814} 11/07/2021 11:18:38 - INFO - __main__ - Step 100258: {'lr': 0.00012688743849675584, 'samples': 19249536, 'steps': 100257, 'loss/train': 1.2153220176696777} 11/07/2021 11:18:38 - INFO - __main__ - Step 100259: {'lr': 0.0001268828198494479, 'samples': 19249728, 'steps': 100258, 'loss/train': 1.4982984066009521} 11/07/2021 11:18:38 - INFO - __main__ - Step 100260: {'lr': 0.00012687820125761466, 'samples': 19249920, 'steps': 100259, 'loss/train': 1.2920604944229126} 11/07/2021 11:18:39 - INFO - __main__ - Step 100261: {'lr': 0.00012687358272125819, 'samples': 19250112, 'steps': 100260, 'loss/train': 1.3524116277694702} 11/07/2021 11:18:40 - INFO - __main__ - Step 100262: {'lr': 0.00012686896424038058, 'samples': 19250304, 'steps': 100261, 'loss/train': 1.1856024265289307} 11/07/2021 11:18:40 - INFO - __main__ - Step 100263: {'lr': 0.0001268643458149839, 'samples': 19250496, 'steps': 100262, 'loss/train': 1.358160376548767} 11/07/2021 11:18:40 - INFO - __main__ - Step 100264: {'lr': 0.00012685972744507027, 'samples': 19250688, 'steps': 100263, 'loss/train': 1.2827417850494385} 11/07/2021 11:18:41 - INFO - __main__ - Step 100265: {'lr': 0.00012685510913064174, 'samples': 19250880, 'steps': 100264, 'loss/train': 1.94098699092865} 11/07/2021 11:18:41 - INFO - __main__ - Step 100266: {'lr': 0.00012685049087170043, 'samples': 19251072, 'steps': 100265, 'loss/train': 1.3215036392211914} 11/07/2021 11:18:42 - INFO - __main__ - Step 100267: {'lr': 0.00012684587266824832, 'samples': 19251264, 'steps': 100266, 'loss/train': 1.2905973196029663} 11/07/2021 11:18:42 - INFO - __main__ - Step 100268: {'lr': 0.00012684125452028762, 'samples': 19251456, 'steps': 100267, 'loss/train': 1.6654707193374634} 11/07/2021 11:18:43 - INFO - __main__ - Step 100269: {'lr': 0.0001268366364278204, 'samples': 19251648, 'steps': 100268, 'loss/train': 1.6380385160446167} 11/07/2021 11:18:43 - INFO - __main__ - Step 100270: {'lr': 0.0001268320183908486, 'samples': 19251840, 'steps': 100269, 'loss/train': 1.0567479133605957} 11/07/2021 11:18:44 - INFO - __main__ - Step 100271: {'lr': 0.00012682740040937442, 'samples': 19252032, 'steps': 100270, 'loss/train': 0.9389281868934631} 11/07/2021 11:18:44 - INFO - __main__ - Step 100272: {'lr': 0.0001268227824833999, 'samples': 19252224, 'steps': 100271, 'loss/train': 1.4034098386764526} 11/07/2021 11:18:46 - INFO - __main__ - Step 100273: {'lr': 0.00012681816461292713, 'samples': 19252416, 'steps': 100272, 'loss/train': 1.355263352394104} 11/07/2021 11:18:46 - INFO - __main__ - Step 100274: {'lr': 0.0001268135467979582, 'samples': 19252608, 'steps': 100273, 'loss/train': 1.150221347808838} 11/07/2021 11:18:46 - INFO - __main__ - Step 100275: {'lr': 0.0001268089290384952, 'samples': 19252800, 'steps': 100274, 'loss/train': 1.4736082553863525} 11/07/2021 11:18:47 - INFO - __main__ - Step 100276: {'lr': 0.00012680431133454018, 'samples': 19252992, 'steps': 100275, 'loss/train': 1.6791898012161255} 11/07/2021 11:18:47 - INFO - __main__ - Step 100277: {'lr': 0.00012679969368609522, 'samples': 19253184, 'steps': 100276, 'loss/train': 1.3336697816848755} 11/07/2021 11:18:47 - INFO - __main__ - Step 100278: {'lr': 0.00012679507609316242, 'samples': 19253376, 'steps': 100277, 'loss/train': 0.9131267666816711} 11/07/2021 11:18:48 - INFO - __main__ - Step 100279: {'lr': 0.00012679045855574388, 'samples': 19253568, 'steps': 100278, 'loss/train': 0.8045585751533508} 11/07/2021 11:18:49 - INFO - __main__ - Step 100280: {'lr': 0.00012678584107384178, 'samples': 19253760, 'steps': 100279, 'loss/train': 1.2487775087356567} 11/07/2021 11:18:49 - INFO - __main__ - Step 100281: {'lr': 0.0001267812236474579, 'samples': 19253952, 'steps': 100280, 'loss/train': 1.232207179069519} 11/07/2021 11:18:50 - INFO - __main__ - Step 100282: {'lr': 0.00012677660627659457, 'samples': 19254144, 'steps': 100281, 'loss/train': 1.5665245056152344} 11/07/2021 11:18:50 - INFO - __main__ - Step 100283: {'lr': 0.0001267719889612538, 'samples': 19254336, 'steps': 100282, 'loss/train': 1.5141373872756958} 11/07/2021 11:18:50 - INFO - __main__ - Step 100284: {'lr': 0.00012676737170143763, 'samples': 19254528, 'steps': 100283, 'loss/train': 1.4705880880355835} 11/07/2021 11:18:51 - INFO - __main__ - Step 100285: {'lr': 0.00012676275449714818, 'samples': 19254720, 'steps': 100284, 'loss/train': 1.6832624673843384} 11/07/2021 11:18:52 - INFO - __main__ - Step 100286: {'lr': 0.00012675813734838755, 'samples': 19254912, 'steps': 100285, 'loss/train': 0.8376666307449341} 11/07/2021 11:18:52 - INFO - __main__ - Step 100287: {'lr': 0.00012675352025515779, 'samples': 19255104, 'steps': 100286, 'loss/train': 1.37075936794281} 11/07/2021 11:18:52 - INFO - __main__ - Step 100288: {'lr': 0.00012674890321746102, 'samples': 19255296, 'steps': 100287, 'loss/train': 1.310511827468872} 11/07/2021 11:18:53 - INFO - __main__ - Step 100289: {'lr': 0.00012674428623529928, 'samples': 19255488, 'steps': 100288, 'loss/train': 1.2676069736480713} 11/07/2021 11:18:53 - INFO - __main__ - Step 100290: {'lr': 0.00012673966930867476, 'samples': 19255680, 'steps': 100289, 'loss/train': 1.4922893047332764} 11/07/2021 11:18:54 - INFO - __main__ - Step 100291: {'lr': 0.00012673505243758932, 'samples': 19255872, 'steps': 100290, 'loss/train': 1.0572495460510254} 11/07/2021 11:18:54 - INFO - __main__ - Step 100292: {'lr': 0.0001267304356220452, 'samples': 19256064, 'steps': 100291, 'loss/train': 0.6240859031677246} 11/07/2021 11:18:55 - INFO - __main__ - Step 100293: {'lr': 0.00012672581886204442, 'samples': 19256256, 'steps': 100292, 'loss/train': 1.3065521717071533} 11/07/2021 11:18:55 - INFO - __main__ - Step 100294: {'lr': 0.00012672120215758909, 'samples': 19256448, 'steps': 100293, 'loss/train': 1.5738798379898071} 11/07/2021 11:18:55 - INFO - __main__ - Step 100295: {'lr': 0.00012671658550868128, 'samples': 19256640, 'steps': 100294, 'loss/train': 1.4586714506149292} 11/07/2021 11:18:57 - INFO - __main__ - Step 100296: {'lr': 0.00012671196891532308, 'samples': 19256832, 'steps': 100295, 'loss/train': 1.4467051029205322} 11/07/2021 11:18:57 - INFO - __main__ - Step 100297: {'lr': 0.00012670735237751656, 'samples': 19257024, 'steps': 100296, 'loss/train': 1.639136791229248} 11/07/2021 11:18:57 - INFO - __main__ - Step 100298: {'lr': 0.00012670273589526383, 'samples': 19257216, 'steps': 100297, 'loss/train': 1.583533525466919} 11/07/2021 11:18:58 - INFO - __main__ - Step 100299: {'lr': 0.00012669811946856691, 'samples': 19257408, 'steps': 100298, 'loss/train': 6.097108364105225} 11/07/2021 11:18:58 - INFO - __main__ - Step 100300: {'lr': 0.0001266935030974279, 'samples': 19257600, 'steps': 100299, 'loss/train': 1.5414139032363892} 11/07/2021 11:18:58 - INFO - __main__ - Step 100301: {'lr': 0.00012668888678184892, 'samples': 19257792, 'steps': 100300, 'loss/train': 2.3618369102478027} 11/07/2021 11:18:59 - INFO - __main__ - Step 100302: {'lr': 0.00012668427052183208, 'samples': 19257984, 'steps': 100301, 'loss/train': 1.6352787017822266} 11/07/2021 11:19:00 - INFO - __main__ - Step 100303: {'lr': 0.00012667965431737942, 'samples': 19258176, 'steps': 100302, 'loss/train': 1.1656068563461304} 11/07/2021 11:19:00 - INFO - __main__ - Step 100304: {'lr': 0.00012667503816849295, 'samples': 19258368, 'steps': 100303, 'loss/train': 1.6797451972961426} 11/07/2021 11:19:00 - INFO - __main__ - Step 100305: {'lr': 0.00012667042207517476, 'samples': 19258560, 'steps': 100304, 'loss/train': 1.557108998298645} 11/07/2021 11:19:01 - INFO - __main__ - Step 100306: {'lr': 0.000126665806037427, 'samples': 19258752, 'steps': 100305, 'loss/train': 1.1176350116729736} 11/07/2021 11:19:02 - INFO - __main__ - Step 100307: {'lr': 0.00012666119005525173, 'samples': 19258944, 'steps': 100306, 'loss/train': 1.1570649147033691} 11/07/2021 11:19:02 - INFO - __main__ - Step 100308: {'lr': 0.00012665657412865106, 'samples': 19259136, 'steps': 100307, 'loss/train': 1.2561804056167603} 11/07/2021 11:19:02 - INFO - __main__ - Step 100309: {'lr': 0.00012665195825762698, 'samples': 19259328, 'steps': 100308, 'loss/train': 1.6244279146194458} 11/07/2021 11:19:03 - INFO - __main__ - Step 100310: {'lr': 0.00012664734244218165, 'samples': 19259520, 'steps': 100309, 'loss/train': 1.5695656538009644} 11/07/2021 11:19:03 - INFO - __main__ - Step 100311: {'lr': 0.0001266427266823171, 'samples': 19259712, 'steps': 100310, 'loss/train': 1.6711543798446655} 11/07/2021 11:19:04 - INFO - __main__ - Step 100312: {'lr': 0.00012663811097803545, 'samples': 19259904, 'steps': 100311, 'loss/train': 1.2051650285720825} 11/07/2021 11:19:04 - INFO - __main__ - Step 100313: {'lr': 0.00012663349532933876, 'samples': 19260096, 'steps': 100312, 'loss/train': 1.0649467706680298} 11/07/2021 11:19:05 - INFO - __main__ - Step 100314: {'lr': 0.00012662887973622914, 'samples': 19260288, 'steps': 100313, 'loss/train': 1.7289044857025146} 11/07/2021 11:19:05 - INFO - __main__ - Step 100315: {'lr': 0.00012662426419870863, 'samples': 19260480, 'steps': 100314, 'loss/train': 1.662404179573059} 11/07/2021 11:19:06 - INFO - __main__ - Step 100316: {'lr': 0.0001266196487167794, 'samples': 19260672, 'steps': 100315, 'loss/train': 1.3780014514923096} 11/07/2021 11:19:07 - INFO - __main__ - Step 100317: {'lr': 0.00012661503329044338, 'samples': 19260864, 'steps': 100316, 'loss/train': 1.1333699226379395} 11/07/2021 11:19:07 - INFO - __main__ - Step 100318: {'lr': 0.00012661041791970268, 'samples': 19261056, 'steps': 100317, 'loss/train': 1.4219547510147095} 11/07/2021 11:19:07 - INFO - __main__ - Step 100319: {'lr': 0.00012660580260455946, 'samples': 19261248, 'steps': 100318, 'loss/train': 1.1336127519607544} 11/07/2021 11:19:08 - INFO - __main__ - Step 100320: {'lr': 0.00012660118734501575, 'samples': 19261440, 'steps': 100319, 'loss/train': 0.6706944704055786} 11/07/2021 11:19:08 - INFO - __main__ - Step 100321: {'lr': 0.00012659657214107365, 'samples': 19261632, 'steps': 100320, 'loss/train': 1.8498775959014893} 11/07/2021 11:19:09 - INFO - __main__ - Step 100322: {'lr': 0.00012659195699273523, 'samples': 19261824, 'steps': 100321, 'loss/train': 1.9904897212982178} 11/07/2021 11:19:09 - INFO - __main__ - Step 100323: {'lr': 0.00012658734190000253, 'samples': 19262016, 'steps': 100322, 'loss/train': 1.681283712387085} 11/07/2021 11:19:10 - INFO - __main__ - Step 100324: {'lr': 0.00012658272686287772, 'samples': 19262208, 'steps': 100323, 'loss/train': 1.442529320716858} 11/07/2021 11:19:10 - INFO - __main__ - Step 100325: {'lr': 0.0001265781118813628, 'samples': 19262400, 'steps': 100324, 'loss/train': 1.2341029644012451} 11/07/2021 11:19:10 - INFO - __main__ - Step 100326: {'lr': 0.00012657349695545988, 'samples': 19262592, 'steps': 100325, 'loss/train': 1.4783682823181152} 11/07/2021 11:19:11 - INFO - __main__ - Step 100327: {'lr': 0.00012656888208517107, 'samples': 19262784, 'steps': 100326, 'loss/train': 1.4943069219589233} 11/07/2021 11:19:12 - INFO - __main__ - Step 100328: {'lr': 0.0001265642672704984, 'samples': 19262976, 'steps': 100327, 'loss/train': 0.8843269944190979} 11/07/2021 11:19:12 - INFO - __main__ - Step 100329: {'lr': 0.00012655965251144396, 'samples': 19263168, 'steps': 100328, 'loss/train': 1.3027839660644531} 11/07/2021 11:19:13 - INFO - __main__ - Step 100330: {'lr': 0.0001265550378080099, 'samples': 19263360, 'steps': 100329, 'loss/train': 1.8100504875183105} 11/07/2021 11:19:13 - INFO - __main__ - Step 100331: {'lr': 0.00012655042316019822, 'samples': 19263552, 'steps': 100330, 'loss/train': 0.24423694610595703} 11/07/2021 11:19:13 - INFO - __main__ - Step 100332: {'lr': 0.00012654580856801096, 'samples': 19263744, 'steps': 100331, 'loss/train': 1.0811340808868408} 11/07/2021 11:19:15 - INFO - __main__ - Step 100333: {'lr': 0.00012654119403145026, 'samples': 19263936, 'steps': 100332, 'loss/train': 0.9994770884513855} 11/07/2021 11:19:15 - INFO - __main__ - Step 100334: {'lr': 0.00012653657955051818, 'samples': 19264128, 'steps': 100333, 'loss/train': 1.6383143663406372} 11/07/2021 11:19:15 - INFO - __main__ - Step 100335: {'lr': 0.00012653196512521682, 'samples': 19264320, 'steps': 100334, 'loss/train': 1.468465805053711} 11/07/2021 11:19:16 - INFO - __main__ - Step 100336: {'lr': 0.00012652735075554825, 'samples': 19264512, 'steps': 100335, 'loss/train': 1.3999956846237183} 11/07/2021 11:19:16 - INFO - __main__ - Step 100337: {'lr': 0.00012652273644151457, 'samples': 19264704, 'steps': 100336, 'loss/train': 1.5894901752471924} 11/07/2021 11:19:17 - INFO - __main__ - Step 100338: {'lr': 0.00012651812218311781, 'samples': 19264896, 'steps': 100337, 'loss/train': 1.2276924848556519} 11/07/2021 11:19:17 - INFO - __main__ - Step 100339: {'lr': 0.0001265135079803601, 'samples': 19265088, 'steps': 100338, 'loss/train': 1.270487666130066} 11/07/2021 11:19:18 - INFO - __main__ - Step 100340: {'lr': 0.00012650889383324348, 'samples': 19265280, 'steps': 100339, 'loss/train': 0.970559298992157} 11/07/2021 11:19:18 - INFO - __main__ - Step 100341: {'lr': 0.00012650427974177005, 'samples': 19265472, 'steps': 100340, 'loss/train': 1.3132671117782593} 11/07/2021 11:19:18 - INFO - __main__ - Step 100342: {'lr': 0.0001264996657059419, 'samples': 19265664, 'steps': 100341, 'loss/train': 1.6789841651916504} 11/07/2021 11:19:19 - INFO - __main__ - Step 100343: {'lr': 0.0001264950517257612, 'samples': 19265856, 'steps': 100342, 'loss/train': 1.0108649730682373} 11/07/2021 11:19:20 - INFO - __main__ - Step 100344: {'lr': 0.00012649043780122983, 'samples': 19266048, 'steps': 100343, 'loss/train': 1.2954777479171753} 11/07/2021 11:19:20 - INFO - __main__ - Step 100345: {'lr': 0.0001264858239323499, 'samples': 19266240, 'steps': 100344, 'loss/train': 1.3261662721633911} 11/07/2021 11:19:20 - INFO - __main__ - Step 100346: {'lr': 0.0001264812101191236, 'samples': 19266432, 'steps': 100345, 'loss/train': 1.0848276615142822} 11/07/2021 11:19:21 - INFO - __main__ - Step 100347: {'lr': 0.00012647659636155298, 'samples': 19266624, 'steps': 100346, 'loss/train': 0.6615179777145386} 11/07/2021 11:19:22 - INFO - __main__ - Step 100348: {'lr': 0.0001264719826596401, 'samples': 19266816, 'steps': 100347, 'loss/train': 1.145164966583252} 11/07/2021 11:19:22 - INFO - __main__ - Step 100349: {'lr': 0.00012646736901338702, 'samples': 19267008, 'steps': 100348, 'loss/train': 1.5261024236679077} 11/07/2021 11:19:23 - INFO - __main__ - Step 100350: {'lr': 0.0001264627554227958, 'samples': 19267200, 'steps': 100349, 'loss/train': 1.6953697204589844} 11/07/2021 11:19:23 - INFO - __main__ - Step 100351: {'lr': 0.0001264581418878686, 'samples': 19267392, 'steps': 100350, 'loss/train': 1.0178005695343018} 11/07/2021 11:19:23 - INFO - __main__ - Step 100352: {'lr': 0.00012645352840860743, 'samples': 19267584, 'steps': 100351, 'loss/train': 1.3451720476150513} 11/07/2021 11:19:24 - INFO - __main__ - Step 100353: {'lr': 0.00012644891498501444, 'samples': 19267776, 'steps': 100352, 'loss/train': 1.0394785404205322} 11/07/2021 11:19:25 - INFO - __main__ - Step 100354: {'lr': 0.00012644430161709162, 'samples': 19267968, 'steps': 100353, 'loss/train': 1.1861199140548706} 11/07/2021 11:19:25 - INFO - __main__ - Step 100355: {'lr': 0.0001264396883048411, 'samples': 19268160, 'steps': 100354, 'loss/train': 1.647955060005188} 11/07/2021 11:19:25 - INFO - __main__ - Step 100356: {'lr': 0.00012643507504826496, 'samples': 19268352, 'steps': 100355, 'loss/train': 1.4517043828964233} 11/07/2021 11:19:26 - INFO - __main__ - Step 100357: {'lr': 0.00012643046184736533, 'samples': 19268544, 'steps': 100356, 'loss/train': 1.0652014017105103} 11/07/2021 11:19:26 - INFO - __main__ - Step 100358: {'lr': 0.00012642584870214418, 'samples': 19268736, 'steps': 100357, 'loss/train': 1.3126636743545532} 11/07/2021 11:19:28 - INFO - __main__ - Step 100359: {'lr': 0.0001264212356126036, 'samples': 19268928, 'steps': 100358, 'loss/train': 0.9878736138343811} 11/07/2021 11:19:28 - INFO - __main__ - Step 100360: {'lr': 0.00012641662257874574, 'samples': 19269120, 'steps': 100359, 'loss/train': 0.89615398645401} 11/07/2021 11:19:28 - INFO - __main__ - Step 100361: {'lr': 0.0001264120096005726, 'samples': 19269312, 'steps': 100360, 'loss/train': 1.4599796533584595} 11/07/2021 11:19:29 - INFO - __main__ - Step 100362: {'lr': 0.0001264073966780863, 'samples': 19269504, 'steps': 100361, 'loss/train': 0.25144556164741516} 11/07/2021 11:19:29 - INFO - __main__ - Step 100363: {'lr': 0.00012640278381128895, 'samples': 19269696, 'steps': 100362, 'loss/train': 0.5397301912307739} 11/07/2021 11:19:30 - INFO - __main__ - Step 100364: {'lr': 0.0001263981710001826, 'samples': 19269888, 'steps': 100363, 'loss/train': 1.4874157905578613} 11/07/2021 11:19:30 - INFO - __main__ - Step 100365: {'lr': 0.00012639355824476935, 'samples': 19270080, 'steps': 100364, 'loss/train': 1.7724372148513794} 11/07/2021 11:19:31 - INFO - __main__ - Step 100366: {'lr': 0.0001263889455450512, 'samples': 19270272, 'steps': 100365, 'loss/train': 1.4571887254714966} 11/07/2021 11:19:31 - INFO - __main__ - Step 100367: {'lr': 0.00012638433290103028, 'samples': 19270464, 'steps': 100366, 'loss/train': 1.0102996826171875} 11/07/2021 11:19:31 - INFO - __main__ - Step 100368: {'lr': 0.00012637972031270874, 'samples': 19270656, 'steps': 100367, 'loss/train': 1.1596580743789673} 11/07/2021 11:19:32 - INFO - __main__ - Step 100369: {'lr': 0.00012637510778008853, 'samples': 19270848, 'steps': 100368, 'loss/train': 1.476831078529358} 11/07/2021 11:19:33 - INFO - __main__ - Step 100370: {'lr': 0.0001263704953031719, 'samples': 19271040, 'steps': 100369, 'loss/train': 1.40144944190979} 11/07/2021 11:19:33 - INFO - __main__ - Step 100371: {'lr': 0.0001263658828819607, 'samples': 19271232, 'steps': 100370, 'loss/train': 1.431461215019226} 11/07/2021 11:19:33 - INFO - __main__ - Step 100372: {'lr': 0.00012636127051645718, 'samples': 19271424, 'steps': 100371, 'loss/train': 1.2417387962341309} 11/07/2021 11:19:34 - INFO - __main__ - Step 100373: {'lr': 0.00012635665820666332, 'samples': 19271616, 'steps': 100372, 'loss/train': 0.8203567862510681} 11/07/2021 11:19:35 - INFO - __main__ - Step 100374: {'lr': 0.00012635204595258127, 'samples': 19271808, 'steps': 100373, 'loss/train': 1.2804419994354248} 11/07/2021 11:19:35 - INFO - __main__ - Step 100375: {'lr': 0.00012634743375421306, 'samples': 19272000, 'steps': 100374, 'loss/train': 1.4157460927963257} 11/07/2021 11:19:35 - INFO - __main__ - Step 100376: {'lr': 0.0001263428216115608, 'samples': 19272192, 'steps': 100375, 'loss/train': 0.09315863996744156} 11/07/2021 11:19:36 - INFO - __main__ - Step 100377: {'lr': 0.00012633820952462655, 'samples': 19272384, 'steps': 100376, 'loss/train': 1.1100754737854004} 11/07/2021 11:19:36 - INFO - __main__ - Step 100378: {'lr': 0.0001263335974934124, 'samples': 19272576, 'steps': 100377, 'loss/train': 1.155129313468933} 11/07/2021 11:19:37 - INFO - __main__ - Step 100379: {'lr': 0.0001263289855179204, 'samples': 19272768, 'steps': 100378, 'loss/train': 1.1384118795394897} 11/07/2021 11:19:38 - INFO - __main__ - Step 100380: {'lr': 0.0001263243735981527, 'samples': 19272960, 'steps': 100379, 'loss/train': 1.274102807044983} 11/07/2021 11:19:38 - INFO - __main__ - Step 100381: {'lr': 0.00012631976173411126, 'samples': 19273152, 'steps': 100380, 'loss/train': 0.8243192434310913} 11/07/2021 11:19:38 - INFO - __main__ - Step 100382: {'lr': 0.00012631514992579828, 'samples': 19273344, 'steps': 100381, 'loss/train': 1.4893518686294556} 11/07/2021 11:19:39 - INFO - __main__ - Step 100383: {'lr': 0.00012631053817321574, 'samples': 19273536, 'steps': 100382, 'loss/train': 1.3512262105941772} 11/07/2021 11:19:40 - INFO - __main__ - Step 100384: {'lr': 0.0001263059264763659, 'samples': 19273728, 'steps': 100383, 'loss/train': 0.9560406804084778} 11/07/2021 11:19:40 - INFO - __main__ - Step 100385: {'lr': 0.00012630131483525058, 'samples': 19273920, 'steps': 100384, 'loss/train': 1.1115448474884033} 11/07/2021 11:19:40 - INFO - __main__ - Step 100386: {'lr': 0.00012629670324987202, 'samples': 19274112, 'steps': 100385, 'loss/train': 1.1837029457092285} 11/07/2021 11:19:41 - INFO - __main__ - Step 100387: {'lr': 0.0001262920917202322, 'samples': 19274304, 'steps': 100386, 'loss/train': 0.8137231469154358} 11/07/2021 11:19:41 - INFO - __main__ - Step 100388: {'lr': 0.0001262874802463333, 'samples': 19274496, 'steps': 100387, 'loss/train': 1.4100369215011597} 11/07/2021 11:19:41 - INFO - __main__ - Step 100389: {'lr': 0.00012628286882817737, 'samples': 19274688, 'steps': 100388, 'loss/train': 0.09346193820238113} 11/07/2021 11:19:42 - INFO - __main__ - Step 100390: {'lr': 0.0001262782574657664, 'samples': 19274880, 'steps': 100389, 'loss/train': 2.142881393432617} 11/07/2021 11:19:43 - INFO - __main__ - Step 100391: {'lr': 0.00012627364615910259, 'samples': 19275072, 'steps': 100390, 'loss/train': 1.5748493671417236} 11/07/2021 11:19:43 - INFO - __main__ - Step 100392: {'lr': 0.00012626903490818792, 'samples': 19275264, 'steps': 100391, 'loss/train': 1.2669057846069336} 11/07/2021 11:19:44 - INFO - __main__ - Step 100393: {'lr': 0.00012626442371302456, 'samples': 19275456, 'steps': 100392, 'loss/train': 1.0850775241851807} 11/07/2021 11:19:44 - INFO - __main__ - Step 100394: {'lr': 0.00012625981257361453, 'samples': 19275648, 'steps': 100393, 'loss/train': 1.5147004127502441} 11/07/2021 11:19:45 - INFO - __main__ - Step 100395: {'lr': 0.0001262552014899599, 'samples': 19275840, 'steps': 100394, 'loss/train': 1.359411597251892} 11/07/2021 11:19:45 - INFO - __main__ - Step 100396: {'lr': 0.00012625059046206277, 'samples': 19276032, 'steps': 100395, 'loss/train': 1.2739650011062622} 11/07/2021 11:19:46 - INFO - __main__ - Step 100397: {'lr': 0.00012624597948992532, 'samples': 19276224, 'steps': 100396, 'loss/train': 1.419702172279358} 11/07/2021 11:19:46 - INFO - __main__ - Step 100398: {'lr': 0.00012624136857354945, 'samples': 19276416, 'steps': 100397, 'loss/train': 1.1871908903121948} 11/07/2021 11:19:46 - INFO - __main__ - Step 100399: {'lr': 0.00012623675771293726, 'samples': 19276608, 'steps': 100398, 'loss/train': 1.3926188945770264} 11/07/2021 11:19:47 - INFO - __main__ - Step 100400: {'lr': 0.00012623214690809094, 'samples': 19276800, 'steps': 100399, 'loss/train': 1.5254899263381958} 11/07/2021 11:19:48 - INFO - __main__ - Step 100401: {'lr': 0.00012622753615901245, 'samples': 19276992, 'steps': 100400, 'loss/train': 1.3553409576416016} 11/07/2021 11:19:48 - INFO - __main__ - Step 100402: {'lr': 0.00012622292546570393, 'samples': 19277184, 'steps': 100401, 'loss/train': 1.3264625072479248} 11/07/2021 11:19:48 - INFO - __main__ - Step 100403: {'lr': 0.00012621831482816749, 'samples': 19277376, 'steps': 100402, 'loss/train': 1.150076985359192} 11/07/2021 11:19:49 - INFO - __main__ - Step 100404: {'lr': 0.0001262137042464051, 'samples': 19277568, 'steps': 100403, 'loss/train': 0.9690337777137756} 11/07/2021 11:19:50 - INFO - __main__ - Step 100405: {'lr': 0.00012620909372041894, 'samples': 19277760, 'steps': 100404, 'loss/train': 1.8505274057388306} 11/07/2021 11:19:50 - INFO - __main__ - Step 100406: {'lr': 0.00012620448325021105, 'samples': 19277952, 'steps': 100405, 'loss/train': 1.3347463607788086} 11/07/2021 11:19:50 - INFO - __main__ - Step 100407: {'lr': 0.0001261998728357835, 'samples': 19278144, 'steps': 100406, 'loss/train': 1.4470462799072266} 11/07/2021 11:19:51 - INFO - __main__ - Step 100408: {'lr': 0.00012619526247713842, 'samples': 19278336, 'steps': 100407, 'loss/train': 2.546436309814453} 11/07/2021 11:19:51 - INFO - __main__ - Step 100409: {'lr': 0.0001261906521742778, 'samples': 19278528, 'steps': 100408, 'loss/train': 1.1476244926452637} 11/07/2021 11:19:51 - INFO - __main__ - Step 100410: {'lr': 0.0001261860419272039, 'samples': 19278720, 'steps': 100409, 'loss/train': 1.3484514951705933} 11/07/2021 11:19:52 - INFO - __main__ - Step 100411: {'lr': 0.0001261814317359185, 'samples': 19278912, 'steps': 100410, 'loss/train': 1.0936322212219238} 11/07/2021 11:19:53 - INFO - __main__ - Step 100412: {'lr': 0.00012617682160042388, 'samples': 19279104, 'steps': 100411, 'loss/train': 1.0359768867492676} 11/07/2021 11:19:53 - INFO - __main__ - Step 100413: {'lr': 0.00012617221152072205, 'samples': 19279296, 'steps': 100412, 'loss/train': 0.6829764246940613} 11/07/2021 11:19:53 - INFO - __main__ - Step 100414: {'lr': 0.00012616760149681517, 'samples': 19279488, 'steps': 100413, 'loss/train': 1.6831802129745483} 11/07/2021 11:19:54 - INFO - __main__ - Step 100415: {'lr': 0.0001261629915287052, 'samples': 19279680, 'steps': 100414, 'loss/train': 1.3999533653259277} 11/07/2021 11:19:55 - INFO - __main__ - Step 100416: {'lr': 0.0001261583816163943, 'samples': 19279872, 'steps': 100415, 'loss/train': 0.9790731072425842} 11/07/2021 11:19:55 - INFO - __main__ - Step 100417: {'lr': 0.00012615377175988448, 'samples': 19280064, 'steps': 100416, 'loss/train': 0.8034746646881104} 11/07/2021 11:19:56 - INFO - __main__ - Step 100418: {'lr': 0.0001261491619591779, 'samples': 19280256, 'steps': 100417, 'loss/train': 1.094143033027649} 11/07/2021 11:19:56 - INFO - __main__ - Step 100419: {'lr': 0.0001261445522142766, 'samples': 19280448, 'steps': 100418, 'loss/train': 1.6704366207122803} 11/07/2021 11:19:56 - INFO - __main__ - Step 100420: {'lr': 0.00012613994252518262, 'samples': 19280640, 'steps': 100419, 'loss/train': 1.3803844451904297} 11/07/2021 11:19:57 - INFO - __main__ - Step 100421: {'lr': 0.0001261353328918981, 'samples': 19280832, 'steps': 100420, 'loss/train': 1.4110705852508545} 11/07/2021 11:19:58 - INFO - __main__ - Step 100422: {'lr': 0.00012613072331442508, 'samples': 19281024, 'steps': 100421, 'loss/train': 1.8391096591949463} 11/07/2021 11:19:58 - INFO - __main__ - Step 100423: {'lr': 0.0001261261137927656, 'samples': 19281216, 'steps': 100422, 'loss/train': 1.4452272653579712} 11/07/2021 11:19:58 - INFO - __main__ - Step 100424: {'lr': 0.00012612150432692195, 'samples': 19281408, 'steps': 100423, 'loss/train': 1.4589450359344482} 11/07/2021 11:19:59 - INFO - __main__ - Step 100425: {'lr': 0.00012611689491689594, 'samples': 19281600, 'steps': 100424, 'loss/train': 1.2923258543014526} 11/07/2021 11:20:00 - INFO - __main__ - Step 100426: {'lr': 0.00012611228556268973, 'samples': 19281792, 'steps': 100425, 'loss/train': 1.2166956663131714} 11/07/2021 11:20:00 - INFO - __main__ - Step 100427: {'lr': 0.00012610767626430536, 'samples': 19281984, 'steps': 100426, 'loss/train': 1.3784773349761963} 11/07/2021 11:20:00 - INFO - __main__ - Step 100428: {'lr': 0.000126103067021745, 'samples': 19282176, 'steps': 100427, 'loss/train': 0.832977831363678} 11/07/2021 11:20:01 - INFO - __main__ - Step 100429: {'lr': 0.0001260984578350107, 'samples': 19282368, 'steps': 100428, 'loss/train': 1.1977354288101196} 11/07/2021 11:20:01 - INFO - __main__ - Step 100430: {'lr': 0.0001260938487041045, 'samples': 19282560, 'steps': 100429, 'loss/train': 1.8757686614990234} 11/07/2021 11:20:02 - INFO - __main__ - Step 100431: {'lr': 0.00012608923962902853, 'samples': 19282752, 'steps': 100430, 'loss/train': 0.8524510264396667} 11/07/2021 11:20:03 - INFO - __main__ - Step 100432: {'lr': 0.00012608463060978482, 'samples': 19282944, 'steps': 100431, 'loss/train': 1.4107894897460938} 11/07/2021 11:20:03 - INFO - __main__ - Step 100433: {'lr': 0.00012608002164637543, 'samples': 19283136, 'steps': 100432, 'loss/train': 1.6675622463226318} 11/07/2021 11:20:03 - INFO - __main__ - Step 100434: {'lr': 0.00012607541273880251, 'samples': 19283328, 'steps': 100433, 'loss/train': 1.497431755065918} 11/07/2021 11:20:04 - INFO - __main__ - Step 100435: {'lr': 0.0001260708038870681, 'samples': 19283520, 'steps': 100434, 'loss/train': 1.4987082481384277} 11/07/2021 11:20:04 - INFO - __main__ - Step 100436: {'lr': 0.00012606619509117424, 'samples': 19283712, 'steps': 100435, 'loss/train': 1.2248409986495972} 11/07/2021 11:20:05 - INFO - __main__ - Step 100437: {'lr': 0.00012606158635112316, 'samples': 19283904, 'steps': 100436, 'loss/train': 0.8256961703300476} 11/07/2021 11:20:05 - INFO - __main__ - Step 100438: {'lr': 0.0001260569776669167, 'samples': 19284096, 'steps': 100437, 'loss/train': 0.6873278617858887} 11/07/2021 11:20:06 - INFO - __main__ - Step 100439: {'lr': 0.0001260523690385571, 'samples': 19284288, 'steps': 100438, 'loss/train': 1.6121145486831665} 11/07/2021 11:20:06 - INFO - __main__ - Step 100440: {'lr': 0.00012604776046604634, 'samples': 19284480, 'steps': 100439, 'loss/train': 1.297318935394287} 11/07/2021 11:20:06 - INFO - __main__ - Step 100441: {'lr': 0.00012604315194938658, 'samples': 19284672, 'steps': 100440, 'loss/train': 1.1815484762191772} 11/07/2021 11:20:08 - INFO - __main__ - Step 100442: {'lr': 0.00012603854348857985, 'samples': 19284864, 'steps': 100441, 'loss/train': 0.9458504319190979} 11/07/2021 11:20:08 - INFO - __main__ - Step 100443: {'lr': 0.00012603393508362824, 'samples': 19285056, 'steps': 100442, 'loss/train': 1.0646049976348877} 11/07/2021 11:20:08 - INFO - __main__ - Step 100444: {'lr': 0.00012602932673453382, 'samples': 19285248, 'steps': 100443, 'loss/train': 1.6320868730545044} 11/07/2021 11:20:09 - INFO - __main__ - Step 100445: {'lr': 0.00012602471844129867, 'samples': 19285440, 'steps': 100444, 'loss/train': 1.357630729675293} 11/07/2021 11:20:09 - INFO - __main__ - Step 100446: {'lr': 0.0001260201102039249, 'samples': 19285632, 'steps': 100445, 'loss/train': 0.7048534750938416} 11/07/2021 11:20:10 - INFO - __main__ - Step 100447: {'lr': 0.00012601550202241452, 'samples': 19285824, 'steps': 100446, 'loss/train': 1.3291090726852417} 11/07/2021 11:20:10 - INFO - __main__ - Step 100448: {'lr': 0.00012601089389676964, 'samples': 19286016, 'steps': 100447, 'loss/train': 1.1501315832138062} 11/07/2021 11:20:11 - INFO - __main__ - Step 100449: {'lr': 0.00012600628582699235, 'samples': 19286208, 'steps': 100448, 'loss/train': 1.5682519674301147} 11/07/2021 11:20:11 - INFO - __main__ - Step 100450: {'lr': 0.00012600167781308473, 'samples': 19286400, 'steps': 100449, 'loss/train': 1.4837199449539185} 11/07/2021 11:20:11 - INFO - __main__ - Step 100451: {'lr': 0.00012599706985504892, 'samples': 19286592, 'steps': 100450, 'loss/train': 0.7879465818405151} 11/07/2021 11:20:12 - INFO - __main__ - Step 100452: {'lr': 0.00012599246195288681, 'samples': 19286784, 'steps': 100451, 'loss/train': 1.2285337448120117} 11/07/2021 11:20:13 - INFO - __main__ - Step 100453: {'lr': 0.00012598785410660056, 'samples': 19286976, 'steps': 100452, 'loss/train': 1.2068431377410889} 11/07/2021 11:20:13 - INFO - __main__ - Step 100454: {'lr': 0.00012598324631619235, 'samples': 19287168, 'steps': 100453, 'loss/train': 1.4114793539047241} 11/07/2021 11:20:13 - INFO - __main__ - Step 100455: {'lr': 0.00012597863858166412, 'samples': 19287360, 'steps': 100454, 'loss/train': 1.4540272951126099} 11/07/2021 11:20:14 - INFO - __main__ - Step 100456: {'lr': 0.00012597403090301802, 'samples': 19287552, 'steps': 100455, 'loss/train': 0.9569156765937805} 11/07/2021 11:20:15 - INFO - __main__ - Step 100457: {'lr': 0.00012596942328025606, 'samples': 19287744, 'steps': 100456, 'loss/train': 1.5344346761703491} 11/07/2021 11:20:15 - INFO - __main__ - Step 100458: {'lr': 0.00012596481571338042, 'samples': 19287936, 'steps': 100457, 'loss/train': 1.2446739673614502} 11/07/2021 11:20:15 - INFO - __main__ - Step 100459: {'lr': 0.00012596020820239312, 'samples': 19288128, 'steps': 100458, 'loss/train': 1.5227681398391724} 11/07/2021 11:20:16 - INFO - __main__ - Step 100460: {'lr': 0.00012595560074729622, 'samples': 19288320, 'steps': 100459, 'loss/train': 1.1203174591064453} 11/07/2021 11:20:16 - INFO - __main__ - Step 100461: {'lr': 0.0001259509933480918, 'samples': 19288512, 'steps': 100460, 'loss/train': 1.3327611684799194} 11/07/2021 11:20:17 - INFO - __main__ - Step 100462: {'lr': 0.00012594638600478197, 'samples': 19288704, 'steps': 100461, 'loss/train': 1.0106005668640137} 11/07/2021 11:20:17 - INFO - __main__ - Step 100463: {'lr': 0.0001259417787173688, 'samples': 19288896, 'steps': 100462, 'loss/train': 1.3239730596542358} 11/07/2021 11:20:18 - INFO - __main__ - Step 100464: {'lr': 0.00012593717148585436, 'samples': 19289088, 'steps': 100463, 'loss/train': 1.356748342514038} 11/07/2021 11:20:18 - INFO - __main__ - Step 100465: {'lr': 0.0001259325643102407, 'samples': 19289280, 'steps': 100464, 'loss/train': 1.3903419971466064} 11/07/2021 11:20:19 - INFO - __main__ - Step 100466: {'lr': 0.0001259279571905299, 'samples': 19289472, 'steps': 100465, 'loss/train': 1.232366681098938} 11/07/2021 11:20:20 - INFO - __main__ - Step 100467: {'lr': 0.00012592335012672403, 'samples': 19289664, 'steps': 100466, 'loss/train': 1.139418601989746} 11/07/2021 11:20:20 - INFO - __main__ - Step 100468: {'lr': 0.0001259187431188252, 'samples': 19289856, 'steps': 100467, 'loss/train': 1.730873942375183} 11/07/2021 11:20:20 - INFO - __main__ - Step 100469: {'lr': 0.00012591413616683548, 'samples': 19290048, 'steps': 100468, 'loss/train': 1.1963046789169312} 11/07/2021 11:20:21 - INFO - __main__ - Step 100470: {'lr': 0.00012590952927075692, 'samples': 19290240, 'steps': 100469, 'loss/train': 1.4935221672058105} 11/07/2021 11:20:21 - INFO - __main__ - Step 100471: {'lr': 0.0001259049224305916, 'samples': 19290432, 'steps': 100470, 'loss/train': 1.5800951719284058} 11/07/2021 11:20:21 - INFO - __main__ - Step 100472: {'lr': 0.00012590031564634164, 'samples': 19290624, 'steps': 100471, 'loss/train': 1.6048283576965332} 11/07/2021 11:20:22 - INFO - __main__ - Step 100473: {'lr': 0.00012589570891800907, 'samples': 19290816, 'steps': 100472, 'loss/train': 0.6016839146614075} 11/07/2021 11:20:23 - INFO - __main__ - Step 100474: {'lr': 0.00012589110224559593, 'samples': 19291008, 'steps': 100473, 'loss/train': 1.6247849464416504} 11/07/2021 11:20:23 - INFO - __main__ - Step 100475: {'lr': 0.0001258864956291044, 'samples': 19291200, 'steps': 100474, 'loss/train': 1.2649589776992798} 11/07/2021 11:20:24 - INFO - __main__ - Step 100476: {'lr': 0.00012588188906853648, 'samples': 19291392, 'steps': 100475, 'loss/train': 1.2202389240264893} 11/07/2021 11:20:24 - INFO - __main__ - Step 100477: {'lr': 0.00012587728256389425, 'samples': 19291584, 'steps': 100476, 'loss/train': 1.1366794109344482} 11/07/2021 11:20:25 - INFO - __main__ - Step 100478: {'lr': 0.00012587267611517995, 'samples': 19291776, 'steps': 100477, 'loss/train': 1.2825672626495361} 11/07/2021 11:20:26 - INFO - __main__ - Step 100479: {'lr': 0.00012586806972239535, 'samples': 19291968, 'steps': 100478, 'loss/train': 0.9839758276939392} 11/07/2021 11:20:26 - INFO - __main__ - Step 100480: {'lr': 0.00012586346338554273, 'samples': 19292160, 'steps': 100479, 'loss/train': 0.06606881320476532} 11/07/2021 11:20:26 - INFO - __main__ - Step 100481: {'lr': 0.00012585885710462408, 'samples': 19292352, 'steps': 100480, 'loss/train': 1.3643447160720825} 11/07/2021 11:20:27 - INFO - __main__ - Step 100482: {'lr': 0.00012585425087964153, 'samples': 19292544, 'steps': 100481, 'loss/train': 0.4255905747413635} 11/07/2021 11:20:28 - INFO - __main__ - Step 100483: {'lr': 0.00012584964471059712, 'samples': 19292736, 'steps': 100482, 'loss/train': 1.0706002712249756} 11/07/2021 11:20:28 - INFO - __main__ - Step 100484: {'lr': 0.00012584503859749296, 'samples': 19292928, 'steps': 100483, 'loss/train': 0.21099810302257538} 11/07/2021 11:20:29 - INFO - __main__ - Step 100485: {'lr': 0.0001258404325403311, 'samples': 19293120, 'steps': 100484, 'loss/train': 0.6027982831001282} 11/07/2021 11:20:29 - INFO - __main__ - Step 100486: {'lr': 0.00012583582653911369, 'samples': 19293312, 'steps': 100485, 'loss/train': 1.3702703714370728} 11/07/2021 11:20:29 - INFO - __main__ - Step 100487: {'lr': 0.00012583122059384267, 'samples': 19293504, 'steps': 100486, 'loss/train': 1.499953031539917} 11/07/2021 11:20:30 - INFO - __main__ - Step 100488: {'lr': 0.00012582661470452022, 'samples': 19293696, 'steps': 100487, 'loss/train': 1.7224851846694946} 11/07/2021 11:20:31 - INFO - __main__ - Step 100489: {'lr': 0.00012582200887114835, 'samples': 19293888, 'steps': 100488, 'loss/train': 1.4748557806015015} 11/07/2021 11:20:31 - INFO - __main__ - Step 100490: {'lr': 0.00012581740309372918, 'samples': 19294080, 'steps': 100489, 'loss/train': 1.3825725317001343} 11/07/2021 11:20:31 - INFO - __main__ - Step 100491: {'lr': 0.00012581279737226486, 'samples': 19294272, 'steps': 100490, 'loss/train': 0.8254987001419067} 11/07/2021 11:20:32 - INFO - __main__ - Step 100492: {'lr': 0.0001258081917067573, 'samples': 19294464, 'steps': 100491, 'loss/train': 1.1646864414215088} 11/07/2021 11:20:33 - INFO - __main__ - Step 100493: {'lr': 0.00012580358609720865, 'samples': 19294656, 'steps': 100492, 'loss/train': 1.127892255783081} 11/07/2021 11:20:33 - INFO - __main__ - Step 100494: {'lr': 0.00012579898054362098, 'samples': 19294848, 'steps': 100493, 'loss/train': 1.9319523572921753} 11/07/2021 11:20:33 - INFO - __main__ - Step 100495: {'lr': 0.00012579437504599638, 'samples': 19295040, 'steps': 100494, 'loss/train': 1.1528338193893433} 11/07/2021 11:20:34 - INFO - __main__ - Step 100496: {'lr': 0.00012578976960433692, 'samples': 19295232, 'steps': 100495, 'loss/train': 1.3215867280960083} 11/07/2021 11:20:34 - INFO - __main__ - Step 100497: {'lr': 0.00012578516421864465, 'samples': 19295424, 'steps': 100496, 'loss/train': 1.608789086341858} 11/07/2021 11:20:35 - INFO - __main__ - Step 100498: {'lr': 0.0001257805588889217, 'samples': 19295616, 'steps': 100497, 'loss/train': 1.1894968748092651} 11/07/2021 11:20:36 - INFO - __main__ - Step 100499: {'lr': 0.00012577595361517007, 'samples': 19295808, 'steps': 100498, 'loss/train': 0.8696349263191223} 11/07/2021 11:20:36 - INFO - __main__ - Step 100500: {'lr': 0.0001257713483973919, 'samples': 19296000, 'steps': 100499, 'loss/train': 1.279352068901062} 11/07/2021 11:20:36 - INFO - __main__ - Step 100501: {'lr': 0.00012576674323558929, 'samples': 19296192, 'steps': 100500, 'loss/train': 1.3656493425369263} 11/07/2021 11:20:37 - INFO - __main__ - Step 100502: {'lr': 0.00012576213812976424, 'samples': 19296384, 'steps': 100501, 'loss/train': 0.7428439259529114} 11/07/2021 11:20:37 - INFO - __main__ - Step 100503: {'lr': 0.00012575753307991883, 'samples': 19296576, 'steps': 100502, 'loss/train': 1.4109607934951782} 11/07/2021 11:20:38 - INFO - __main__ - Step 100504: {'lr': 0.00012575292808605516, 'samples': 19296768, 'steps': 100503, 'loss/train': 1.3837980031967163} 11/07/2021 11:20:38 - INFO - __main__ - Step 100505: {'lr': 0.00012574832314817542, 'samples': 19296960, 'steps': 100504, 'loss/train': 1.2384237051010132} 11/07/2021 11:20:39 - INFO - __main__ - Step 100506: {'lr': 0.00012574371826628146, 'samples': 19297152, 'steps': 100505, 'loss/train': 1.710884690284729} 11/07/2021 11:20:39 - INFO - __main__ - Step 100507: {'lr': 0.00012573911344037546, 'samples': 19297344, 'steps': 100506, 'loss/train': 1.3132280111312866} 11/07/2021 11:20:39 - INFO - __main__ - Step 100508: {'lr': 0.0001257345086704595, 'samples': 19297536, 'steps': 100507, 'loss/train': 1.5906952619552612} 11/07/2021 11:20:41 - INFO - __main__ - Step 100509: {'lr': 0.00012572990395653567, 'samples': 19297728, 'steps': 100508, 'loss/train': 1.5853703022003174} 11/07/2021 11:20:41 - INFO - __main__ - Step 100510: {'lr': 0.00012572529929860598, 'samples': 19297920, 'steps': 100509, 'loss/train': 1.391985297203064} 11/07/2021 11:20:41 - INFO - __main__ - Step 100511: {'lr': 0.00012572069469667257, 'samples': 19298112, 'steps': 100510, 'loss/train': 1.2949861288070679} 11/07/2021 11:20:42 - INFO - __main__ - Step 100512: {'lr': 0.00012571609015073754, 'samples': 19298304, 'steps': 100511, 'loss/train': 0.8308529257774353} 11/07/2021 11:20:42 - INFO - __main__ - Step 100513: {'lr': 0.00012571148566080286, 'samples': 19298496, 'steps': 100512, 'loss/train': 1.3190876245498657} 11/07/2021 11:20:43 - INFO - __main__ - Step 100514: {'lr': 0.00012570688122687075, 'samples': 19298688, 'steps': 100513, 'loss/train': 1.057621955871582} 11/07/2021 11:20:43 - INFO - __main__ - Step 100515: {'lr': 0.00012570227684894315, 'samples': 19298880, 'steps': 100514, 'loss/train': 1.500719428062439} 11/07/2021 11:20:44 - INFO - __main__ - Step 100516: {'lr': 0.00012569767252702227, 'samples': 19299072, 'steps': 100515, 'loss/train': 1.2579792737960815} 11/07/2021 11:20:44 - INFO - __main__ - Step 100517: {'lr': 0.00012569306826111003, 'samples': 19299264, 'steps': 100516, 'loss/train': 1.4720076322555542} 11/07/2021 11:20:45 - INFO - __main__ - Step 100518: {'lr': 0.00012568846405120853, 'samples': 19299456, 'steps': 100517, 'loss/train': 1.4263023138046265} 11/07/2021 11:20:46 - INFO - __main__ - Step 100519: {'lr': 0.00012568385989731996, 'samples': 19299648, 'steps': 100518, 'loss/train': 1.1015963554382324} 11/07/2021 11:20:46 - INFO - __main__ - Step 100520: {'lr': 0.00012567925579944628, 'samples': 19299840, 'steps': 100519, 'loss/train': 1.3819044828414917} 11/07/2021 11:20:46 - INFO - __main__ - Step 100521: {'lr': 0.0001256746517575896, 'samples': 19300032, 'steps': 100520, 'loss/train': 1.3832008838653564} 11/07/2021 11:20:47 - INFO - __main__ - Step 100522: {'lr': 0.00012567004777175203, 'samples': 19300224, 'steps': 100521, 'loss/train': 1.3085567951202393} 11/07/2021 11:20:47 - INFO - __main__ - Step 100523: {'lr': 0.00012566544384193563, 'samples': 19300416, 'steps': 100522, 'loss/train': 1.5899468660354614} 11/07/2021 11:20:48 - INFO - __main__ - Step 100524: {'lr': 0.00012566083996814242, 'samples': 19300608, 'steps': 100523, 'loss/train': 1.122707486152649} 11/07/2021 11:20:48 - INFO - __main__ - Step 100525: {'lr': 0.00012565623615037451, 'samples': 19300800, 'steps': 100524, 'loss/train': 0.6491504311561584} 11/07/2021 11:20:49 - INFO - __main__ - Step 100526: {'lr': 0.00012565163238863403, 'samples': 19300992, 'steps': 100525, 'loss/train': 1.2262423038482666} 11/07/2021 11:20:49 - INFO - __main__ - Step 100527: {'lr': 0.00012564702868292311, 'samples': 19301184, 'steps': 100526, 'loss/train': 1.0670444965362549} 11/07/2021 11:20:49 - INFO - __main__ - Step 100528: {'lr': 0.00012564242503324357, 'samples': 19301376, 'steps': 100527, 'loss/train': 1.2443130016326904} 11/07/2021 11:20:50 - INFO - __main__ - Step 100529: {'lr': 0.0001256378214395977, 'samples': 19301568, 'steps': 100528, 'loss/train': 1.2881920337677002} 11/07/2021 11:20:51 - INFO - __main__ - Step 100530: {'lr': 0.00012563321790198746, 'samples': 19301760, 'steps': 100529, 'loss/train': 1.0791001319885254} 11/07/2021 11:20:51 - INFO - __main__ - Step 100531: {'lr': 0.00012562861442041496, 'samples': 19301952, 'steps': 100530, 'loss/train': 0.9833387732505798} 11/07/2021 11:20:51 - INFO - __main__ - Step 100532: {'lr': 0.0001256240109948823, 'samples': 19302144, 'steps': 100531, 'loss/train': 1.3593987226486206} 11/07/2021 11:20:52 - INFO - __main__ - Step 100533: {'lr': 0.00012561940762539155, 'samples': 19302336, 'steps': 100532, 'loss/train': 1.6798357963562012} 11/07/2021 11:20:52 - INFO - __main__ - Step 100534: {'lr': 0.00012561480431194479, 'samples': 19302528, 'steps': 100533, 'loss/train': 1.0353020429611206} 11/07/2021 11:20:53 - INFO - __main__ - Step 100535: {'lr': 0.00012561020105454406, 'samples': 19302720, 'steps': 100534, 'loss/train': 1.4039465188980103} 11/07/2021 11:20:54 - INFO - __main__ - Step 100536: {'lr': 0.00012560559785319145, 'samples': 19302912, 'steps': 100535, 'loss/train': 1.2419886589050293} 11/07/2021 11:20:54 - INFO - __main__ - Step 100537: {'lr': 0.00012560099470788915, 'samples': 19303104, 'steps': 100536, 'loss/train': 1.1644549369812012} 11/07/2021 11:20:54 - INFO - __main__ - Step 100538: {'lr': 0.000125596391618639, 'samples': 19303296, 'steps': 100537, 'loss/train': 1.1820532083511353} 11/07/2021 11:20:55 - INFO - __main__ - Step 100539: {'lr': 0.00012559178858544324, 'samples': 19303488, 'steps': 100538, 'loss/train': 1.7523338794708252} 11/07/2021 11:20:56 - INFO - __main__ - Step 100540: {'lr': 0.00012558718560830388, 'samples': 19303680, 'steps': 100539, 'loss/train': 1.7296671867370605} 11/07/2021 11:20:56 - INFO - __main__ - Step 100541: {'lr': 0.000125582582687223, 'samples': 19303872, 'steps': 100540, 'loss/train': 1.5348795652389526} 11/07/2021 11:20:56 - INFO - __main__ - Step 100542: {'lr': 0.0001255779798222027, 'samples': 19304064, 'steps': 100541, 'loss/train': 1.6431158781051636} 11/07/2021 11:20:57 - INFO - __main__ - Step 100543: {'lr': 0.00012557337701324503, 'samples': 19304256, 'steps': 100542, 'loss/train': 1.6667805910110474} 11/07/2021 11:20:57 - INFO - __main__ - Step 100544: {'lr': 0.0001255687742603521, 'samples': 19304448, 'steps': 100543, 'loss/train': 1.3406554460525513} 11/07/2021 11:20:58 - INFO - __main__ - Step 100545: {'lr': 0.00012556417156352597, 'samples': 19304640, 'steps': 100544, 'loss/train': 1.1406803131103516} 11/07/2021 11:20:59 - INFO - __main__ - Step 100546: {'lr': 0.00012555956892276865, 'samples': 19304832, 'steps': 100545, 'loss/train': 1.717518925666809} 11/07/2021 11:20:59 - INFO - __main__ - Step 100547: {'lr': 0.0001255549663380823, 'samples': 19305024, 'steps': 100546, 'loss/train': 1.2895175218582153} 11/07/2021 11:20:59 - INFO - __main__ - Step 100548: {'lr': 0.00012555036380946906, 'samples': 19305216, 'steps': 100547, 'loss/train': 1.16202712059021} 11/07/2021 11:21:00 - INFO - __main__ - Step 100549: {'lr': 0.0001255457613369308, 'samples': 19305408, 'steps': 100548, 'loss/train': 1.02744722366333} 11/07/2021 11:21:01 - INFO - __main__ - Step 100550: {'lr': 0.00012554115892046973, 'samples': 19305600, 'steps': 100549, 'loss/train': 1.5271711349487305} 11/07/2021 11:21:01 - INFO - __main__ - Step 100551: {'lr': 0.00012553655656008782, 'samples': 19305792, 'steps': 100550, 'loss/train': 1.2179840803146362} 11/07/2021 11:21:02 - INFO - __main__ - Step 100552: {'lr': 0.00012553195425578728, 'samples': 19305984, 'steps': 100551, 'loss/train': 0.7690213322639465} 11/07/2021 11:21:02 - INFO - __main__ - Step 100553: {'lr': 0.00012552735200757013, 'samples': 19306176, 'steps': 100552, 'loss/train': 1.0193122625350952} 11/07/2021 11:21:02 - INFO - __main__ - Step 100554: {'lr': 0.00012552274981543843, 'samples': 19306368, 'steps': 100553, 'loss/train': 0.585444450378418} 11/07/2021 11:21:03 - INFO - __main__ - Step 100555: {'lr': 0.00012551814767939424, 'samples': 19306560, 'steps': 100554, 'loss/train': 1.0996699333190918} 11/07/2021 11:21:04 - INFO - __main__ - Step 100556: {'lr': 0.00012551354559943963, 'samples': 19306752, 'steps': 100555, 'loss/train': 1.1834967136383057} 11/07/2021 11:21:04 - INFO - __main__ - Step 100557: {'lr': 0.00012550894357557673, 'samples': 19306944, 'steps': 100556, 'loss/train': 0.6747562289237976} 11/07/2021 11:21:04 - INFO - __main__ - Step 100558: {'lr': 0.00012550434160780755, 'samples': 19307136, 'steps': 100557, 'loss/train': 0.6406893730163574} 11/07/2021 11:21:05 - INFO - __main__ - Step 100559: {'lr': 0.0001254997396961343, 'samples': 19307328, 'steps': 100558, 'loss/train': 1.343943476676941} 11/07/2021 11:21:06 - INFO - __main__ - Step 100560: {'lr': 0.0001254951378405589, 'samples': 19307520, 'steps': 100559, 'loss/train': 1.861322045326233} 11/07/2021 11:21:06 - INFO - __main__ - Step 100561: {'lr': 0.0001254905360410834, 'samples': 19307712, 'steps': 100560, 'loss/train': 1.502820372581482} 11/07/2021 11:21:06 - INFO - __main__ - Step 100562: {'lr': 0.00012548593429770997, 'samples': 19307904, 'steps': 100561, 'loss/train': 1.116566777229309} 11/07/2021 11:21:07 - INFO - __main__ - Step 100563: {'lr': 0.00012548133261044064, 'samples': 19308096, 'steps': 100562, 'loss/train': 1.2414616346359253} 11/07/2021 11:21:07 - INFO - __main__ - Step 100564: {'lr': 0.00012547673097927753, 'samples': 19308288, 'steps': 100563, 'loss/train': 1.2975021600723267} 11/07/2021 11:21:08 - INFO - __main__ - Step 100565: {'lr': 0.00012547212940422264, 'samples': 19308480, 'steps': 100564, 'loss/train': 2.2042059898376465} 11/07/2021 11:21:09 - INFO - __main__ - Step 100566: {'lr': 0.00012546752788527814, 'samples': 19308672, 'steps': 100565, 'loss/train': 1.1695637702941895} 11/07/2021 11:21:09 - INFO - __main__ - Step 100567: {'lr': 0.000125462926422446, 'samples': 19308864, 'steps': 100566, 'loss/train': 1.1906951665878296} 11/07/2021 11:21:09 - INFO - __main__ - Step 100568: {'lr': 0.0001254583250157284, 'samples': 19309056, 'steps': 100567, 'loss/train': 1.3453141450881958} 11/07/2021 11:21:10 - INFO - __main__ - Step 100569: {'lr': 0.0001254537236651273, 'samples': 19309248, 'steps': 100568, 'loss/train': 1.1183042526245117} 11/07/2021 11:21:10 - INFO - __main__ - Step 100570: {'lr': 0.00012544912237064486, 'samples': 19309440, 'steps': 100569, 'loss/train': 1.0271931886672974} 11/07/2021 11:21:11 - INFO - __main__ - Step 100571: {'lr': 0.00012544452113228313, 'samples': 19309632, 'steps': 100570, 'loss/train': 1.2944204807281494} 11/07/2021 11:21:11 - INFO - __main__ - Step 100572: {'lr': 0.0001254399199500443, 'samples': 19309824, 'steps': 100571, 'loss/train': 1.6301113367080688} 11/07/2021 11:21:12 - INFO - __main__ - Step 100573: {'lr': 0.00012543531882393017, 'samples': 19310016, 'steps': 100572, 'loss/train': 1.1393530368804932} 11/07/2021 11:21:12 - INFO - __main__ - Step 100574: {'lr': 0.00012543071775394297, 'samples': 19310208, 'steps': 100573, 'loss/train': 1.217557668685913} 11/07/2021 11:21:12 - INFO - __main__ - Step 100575: {'lr': 0.00012542611674008476, 'samples': 19310400, 'steps': 100574, 'loss/train': 1.4889075756072998} 11/07/2021 11:21:13 - INFO - __main__ - Step 100576: {'lr': 0.00012542151578235762, 'samples': 19310592, 'steps': 100575, 'loss/train': 1.0121265649795532} 11/07/2021 11:21:14 - INFO - __main__ - Step 100577: {'lr': 0.00012541691488076367, 'samples': 19310784, 'steps': 100576, 'loss/train': 1.6277475357055664} 11/07/2021 11:21:14 - INFO - __main__ - Step 100578: {'lr': 0.0001254123140353049, 'samples': 19310976, 'steps': 100577, 'loss/train': 1.4860360622406006} 11/07/2021 11:21:14 - INFO - __main__ - Step 100579: {'lr': 0.00012540771324598345, 'samples': 19311168, 'steps': 100578, 'loss/train': 1.3057551383972168} 11/07/2021 11:21:15 - INFO - __main__ - Step 100580: {'lr': 0.0001254031125128013, 'samples': 19311360, 'steps': 100579, 'loss/train': 1.4169167280197144} 11/07/2021 11:21:16 - INFO - __main__ - Step 100581: {'lr': 0.00012539851183576063, 'samples': 19311552, 'steps': 100580, 'loss/train': 1.3658286333084106} 11/07/2021 11:21:16 - INFO - __main__ - Step 100582: {'lr': 0.00012539391121486342, 'samples': 19311744, 'steps': 100581, 'loss/train': 1.6470754146575928} 11/07/2021 11:21:16 - INFO - __main__ - Step 100583: {'lr': 0.00012538931065011186, 'samples': 19311936, 'steps': 100582, 'loss/train': 1.2963316440582275} 11/07/2021 11:21:17 - INFO - __main__ - Step 100584: {'lr': 0.00012538471014150794, 'samples': 19312128, 'steps': 100583, 'loss/train': 1.2800688743591309} 11/07/2021 11:21:17 - INFO - __main__ - Step 100585: {'lr': 0.00012538010968905382, 'samples': 19312320, 'steps': 100584, 'loss/train': 1.2274959087371826} 11/07/2021 11:21:18 - INFO - __main__ - Step 100586: {'lr': 0.0001253755092927514, 'samples': 19312512, 'steps': 100585, 'loss/train': 1.320258378982544} 11/07/2021 11:21:18 - INFO - __main__ - Step 100587: {'lr': 0.00012537090895260283, 'samples': 19312704, 'steps': 100586, 'loss/train': 1.4725959300994873} 11/07/2021 11:21:19 - INFO - __main__ - Step 100588: {'lr': 0.00012536630866861028, 'samples': 19312896, 'steps': 100587, 'loss/train': 1.02378511428833} 11/07/2021 11:21:19 - INFO - __main__ - Step 100589: {'lr': 0.00012536170844077568, 'samples': 19313088, 'steps': 100588, 'loss/train': 1.2397220134735107} 11/07/2021 11:21:19 - INFO - __main__ - Step 100590: {'lr': 0.0001253571082691012, 'samples': 19313280, 'steps': 100589, 'loss/train': 1.3278712034225464} 11/07/2021 11:21:21 - INFO - __main__ - Step 100591: {'lr': 0.00012535250815358888, 'samples': 19313472, 'steps': 100590, 'loss/train': 1.5375958681106567} 11/07/2021 11:21:21 - INFO - __main__ - Step 100592: {'lr': 0.0001253479080942408, 'samples': 19313664, 'steps': 100591, 'loss/train': 1.1272344589233398} 11/07/2021 11:21:21 - INFO - __main__ - Step 100593: {'lr': 0.00012534330809105902, 'samples': 19313856, 'steps': 100592, 'loss/train': 1.3158907890319824} 11/07/2021 11:21:22 - INFO - __main__ - Step 100594: {'lr': 0.00012533870814404564, 'samples': 19314048, 'steps': 100593, 'loss/train': 0.652056097984314} 11/07/2021 11:21:22 - INFO - __main__ - Step 100595: {'lr': 0.00012533410825320268, 'samples': 19314240, 'steps': 100594, 'loss/train': 1.6004512310028076} 11/07/2021 11:21:23 - INFO - __main__ - Step 100596: {'lr': 0.00012532950841853227, 'samples': 19314432, 'steps': 100595, 'loss/train': 1.3339436054229736} 11/07/2021 11:21:23 - INFO - __main__ - Step 100597: {'lr': 0.00012532490864003646, 'samples': 19314624, 'steps': 100596, 'loss/train': 0.8209711909294128} 11/07/2021 11:21:24 - INFO - __main__ - Step 100598: {'lr': 0.0001253203089177173, 'samples': 19314816, 'steps': 100597, 'loss/train': 1.2380229234695435} 11/07/2021 11:21:24 - INFO - __main__ - Step 100599: {'lr': 0.000125315709251577, 'samples': 19315008, 'steps': 100598, 'loss/train': 1.6003737449645996} 11/07/2021 11:21:24 - INFO - __main__ - Step 100600: {'lr': 0.00012531110964161742, 'samples': 19315200, 'steps': 100599, 'loss/train': 1.6604552268981934} 11/07/2021 11:21:25 - INFO - __main__ - Step 100601: {'lr': 0.00012530651008784075, 'samples': 19315392, 'steps': 100600, 'loss/train': 0.47445008158683777} 11/07/2021 11:21:26 - INFO - __main__ - Step 100602: {'lr': 0.00012530191059024904, 'samples': 19315584, 'steps': 100601, 'loss/train': 1.5166183710098267} 11/07/2021 11:21:26 - INFO - __main__ - Step 100603: {'lr': 0.00012529731114884436, 'samples': 19315776, 'steps': 100602, 'loss/train': 1.146264910697937} 11/07/2021 11:21:26 - INFO - __main__ - Step 100604: {'lr': 0.00012529271176362874, 'samples': 19315968, 'steps': 100603, 'loss/train': 1.4381730556488037} 11/07/2021 11:21:27 - INFO - __main__ - Step 100605: {'lr': 0.00012528811243460436, 'samples': 19316160, 'steps': 100604, 'loss/train': 1.797170639038086} 11/07/2021 11:21:28 - INFO - __main__ - Step 100606: {'lr': 0.0001252835131617732, 'samples': 19316352, 'steps': 100605, 'loss/train': 1.283431053161621} 11/07/2021 11:21:28 - INFO - __main__ - Step 100607: {'lr': 0.00012527891394513736, 'samples': 19316544, 'steps': 100606, 'loss/train': 1.1802252531051636} 11/07/2021 11:21:29 - INFO - __main__ - Step 100608: {'lr': 0.0001252743147846989, 'samples': 19316736, 'steps': 100607, 'loss/train': 1.3997348546981812} 11/07/2021 11:21:29 - INFO - __main__ - Step 100609: {'lr': 0.00012526971568045997, 'samples': 19316928, 'steps': 100608, 'loss/train': 1.2195626497268677} 11/07/2021 11:21:29 - INFO - __main__ - Step 100610: {'lr': 0.00012526511663242258, 'samples': 19317120, 'steps': 100609, 'loss/train': 1.5265231132507324} 11/07/2021 11:21:30 - INFO - __main__ - Step 100611: {'lr': 0.00012526051764058876, 'samples': 19317312, 'steps': 100610, 'loss/train': 1.158758521080017} 11/07/2021 11:21:31 - INFO - __main__ - Step 100612: {'lr': 0.00012525591870496072, 'samples': 19317504, 'steps': 100611, 'loss/train': 1.4764920473098755} 11/07/2021 11:21:32 - INFO - __main__ - Step 100613: {'lr': 0.00012525131982554037, 'samples': 19317696, 'steps': 100612, 'loss/train': 1.34004545211792} 11/07/2021 11:21:32 - INFO - __main__ - Step 100614: {'lr': 0.0001252467210023298, 'samples': 19317888, 'steps': 100613, 'loss/train': 1.2834464311599731} 11/07/2021 11:21:32 - INFO - __main__ - Step 100615: {'lr': 0.00012524212223533122, 'samples': 19318080, 'steps': 100614, 'loss/train': 0.33001020550727844} 11/07/2021 11:21:33 - INFO - __main__ - Step 100616: {'lr': 0.00012523752352454654, 'samples': 19318272, 'steps': 100615, 'loss/train': 1.097434401512146} 11/07/2021 11:21:34 - INFO - __main__ - Step 100617: {'lr': 0.00012523292486997794, 'samples': 19318464, 'steps': 100616, 'loss/train': 1.278036117553711} 11/07/2021 11:21:34 - INFO - __main__ - Step 100618: {'lr': 0.00012522832627162743, 'samples': 19318656, 'steps': 100617, 'loss/train': 1.1267671585083008} 11/07/2021 11:21:34 - INFO - __main__ - Step 100619: {'lr': 0.00012522372772949715, 'samples': 19318848, 'steps': 100618, 'loss/train': 0.9717576503753662} 11/07/2021 11:21:35 - INFO - __main__ - Step 100620: {'lr': 0.00012521912924358912, 'samples': 19319040, 'steps': 100619, 'loss/train': 1.4131824970245361} 11/07/2021 11:21:35 - INFO - __main__ - Step 100621: {'lr': 0.0001252145308139054, 'samples': 19319232, 'steps': 100620, 'loss/train': 1.38057541847229} 11/07/2021 11:21:36 - INFO - __main__ - Step 100622: {'lr': 0.0001252099324404481, 'samples': 19319424, 'steps': 100621, 'loss/train': 1.278188943862915} 11/07/2021 11:21:36 - INFO - __main__ - Step 100623: {'lr': 0.0001252053341232193, 'samples': 19319616, 'steps': 100622, 'loss/train': 1.3660459518432617} 11/07/2021 11:21:37 - INFO - __main__ - Step 100624: {'lr': 0.00012520073586222102, 'samples': 19319808, 'steps': 100623, 'loss/train': 1.0613547563552856} 11/07/2021 11:21:37 - INFO - __main__ - Step 100625: {'lr': 0.00012519613765745542, 'samples': 19320000, 'steps': 100624, 'loss/train': 1.4221079349517822} 11/07/2021 11:21:38 - INFO - __main__ - Step 100626: {'lr': 0.00012519153950892454, 'samples': 19320192, 'steps': 100625, 'loss/train': 0.6932984590530396} 11/07/2021 11:21:39 - INFO - __main__ - Step 100627: {'lr': 0.00012518694141663036, 'samples': 19320384, 'steps': 100626, 'loss/train': 1.370640754699707} 11/07/2021 11:21:39 - INFO - __main__ - Step 100628: {'lr': 0.00012518234338057503, 'samples': 19320576, 'steps': 100627, 'loss/train': 1.6611807346343994} 11/07/2021 11:21:39 - INFO - __main__ - Step 100629: {'lr': 0.0001251777454007606, 'samples': 19320768, 'steps': 100628, 'loss/train': 1.606951117515564} 11/07/2021 11:21:40 - INFO - __main__ - Step 100630: {'lr': 0.00012517314747718914, 'samples': 19320960, 'steps': 100629, 'loss/train': 1.4266983270645142} 11/07/2021 11:21:40 - INFO - __main__ - Step 100631: {'lr': 0.00012516854960986274, 'samples': 19321152, 'steps': 100630, 'loss/train': 1.051461935043335} 11/07/2021 11:21:41 - INFO - __main__ - Step 100632: {'lr': 0.00012516395179878347, 'samples': 19321344, 'steps': 100631, 'loss/train': 0.7687021493911743} 11/07/2021 11:21:41 - INFO - __main__ - Step 100633: {'lr': 0.0001251593540439534, 'samples': 19321536, 'steps': 100632, 'loss/train': 1.4125795364379883} 11/07/2021 11:21:42 - INFO - __main__ - Step 100634: {'lr': 0.0001251547563453746, 'samples': 19321728, 'steps': 100633, 'loss/train': 1.3428523540496826} 11/07/2021 11:21:42 - INFO - __main__ - Step 100635: {'lr': 0.00012515015870304914, 'samples': 19321920, 'steps': 100634, 'loss/train': 1.3761411905288696} 11/07/2021 11:21:42 - INFO - __main__ - Step 100636: {'lr': 0.00012514556111697906, 'samples': 19322112, 'steps': 100635, 'loss/train': 1.3365490436553955} 11/07/2021 11:21:43 - INFO - __main__ - Step 100637: {'lr': 0.00012514096358716648, 'samples': 19322304, 'steps': 100636, 'loss/train': 1.496516466140747} 11/07/2021 11:21:44 - INFO - __main__ - Step 100638: {'lr': 0.00012513636611361347, 'samples': 19322496, 'steps': 100637, 'loss/train': 1.6292662620544434} 11/07/2021 11:21:44 - INFO - __main__ - Step 100639: {'lr': 0.0001251317686963222, 'samples': 19322688, 'steps': 100638, 'loss/train': 1.5316894054412842} 11/07/2021 11:21:44 - INFO - __main__ - Step 100640: {'lr': 0.0001251271713352945, 'samples': 19322880, 'steps': 100639, 'loss/train': 1.201317548751831} 11/07/2021 11:21:45 - INFO - __main__ - Step 100641: {'lr': 0.00012512257403053255, 'samples': 19323072, 'steps': 100640, 'loss/train': 1.5486046075820923} 11/07/2021 11:21:45 - INFO - __main__ - Step 100642: {'lr': 0.0001251179767820385, 'samples': 19323264, 'steps': 100641, 'loss/train': 1.182454228401184} 11/07/2021 11:21:46 - INFO - __main__ - Step 100643: {'lr': 0.00012511337958981433, 'samples': 19323456, 'steps': 100642, 'loss/train': 1.144260048866272} 11/07/2021 11:21:47 - INFO - __main__ - Step 100644: {'lr': 0.00012510878245386214, 'samples': 19323648, 'steps': 100643, 'loss/train': 1.5611202716827393} 11/07/2021 11:21:47 - INFO - __main__ - Step 100645: {'lr': 0.000125104185374184, 'samples': 19323840, 'steps': 100644, 'loss/train': 1.2516261339187622} 11/07/2021 11:21:47 - INFO - __main__ - Step 100646: {'lr': 0.000125099588350782, 'samples': 19324032, 'steps': 100645, 'loss/train': 1.545809030532837} 11/07/2021 11:21:48 - INFO - __main__ - Step 100647: {'lr': 0.0001250949913836582, 'samples': 19324224, 'steps': 100646, 'loss/train': 1.3593811988830566} 11/07/2021 11:21:49 - INFO - __main__ - Step 100648: {'lr': 0.00012509039447281467, 'samples': 19324416, 'steps': 100647, 'loss/train': 1.052096962928772} 11/07/2021 11:21:49 - INFO - __main__ - Step 100649: {'lr': 0.0001250857976182535, 'samples': 19324608, 'steps': 100648, 'loss/train': 1.382841944694519} 11/07/2021 11:21:50 - INFO - __main__ - Step 100650: {'lr': 0.0001250812008199767, 'samples': 19324800, 'steps': 100649, 'loss/train': 1.0178351402282715} 11/07/2021 11:21:50 - INFO - __main__ - Step 100651: {'lr': 0.0001250766040779864, 'samples': 19324992, 'steps': 100650, 'loss/train': 1.4368808269500732} 11/07/2021 11:21:50 - INFO - __main__ - Step 100652: {'lr': 0.00012507200739228475, 'samples': 19325184, 'steps': 100651, 'loss/train': 1.8341132402420044} 11/07/2021 11:21:51 - INFO - __main__ - Step 100653: {'lr': 0.00012506741076287364, 'samples': 19325376, 'steps': 100652, 'loss/train': 1.2527415752410889} 11/07/2021 11:21:52 - INFO - __main__ - Step 100654: {'lr': 0.00012506281418975522, 'samples': 19325568, 'steps': 100653, 'loss/train': 0.9546559453010559} 11/07/2021 11:21:52 - INFO - __main__ - Step 100655: {'lr': 0.00012505821767293157, 'samples': 19325760, 'steps': 100654, 'loss/train': 1.6223740577697754} 11/07/2021 11:21:52 - INFO - __main__ - Step 100656: {'lr': 0.00012505362121240476, 'samples': 19325952, 'steps': 100655, 'loss/train': 1.36245858669281} 11/07/2021 11:21:53 - INFO - __main__ - Step 100657: {'lr': 0.00012504902480817688, 'samples': 19326144, 'steps': 100656, 'loss/train': 1.2804726362228394} 11/07/2021 11:21:54 - INFO - __main__ - Step 100658: {'lr': 0.00012504442846024994, 'samples': 19326336, 'steps': 100657, 'loss/train': 2.5754570960998535} 11/07/2021 11:21:54 - INFO - __main__ - Step 100659: {'lr': 0.00012503983216862607, 'samples': 19326528, 'steps': 100658, 'loss/train': 1.0639057159423828} 11/07/2021 11:21:55 - INFO - __main__ - Step 100660: {'lr': 0.00012503523593330733, 'samples': 19326720, 'steps': 100659, 'loss/train': 1.263903021812439} 11/07/2021 11:21:55 - INFO - __main__ - Step 100661: {'lr': 0.00012503063975429578, 'samples': 19326912, 'steps': 100660, 'loss/train': 1.180719017982483} 11/07/2021 11:21:55 - INFO - __main__ - Step 100662: {'lr': 0.0001250260436315935, 'samples': 19327104, 'steps': 100661, 'loss/train': 1.391923427581787} 11/07/2021 11:21:56 - INFO - __main__ - Step 100663: {'lr': 0.00012502144756520255, 'samples': 19327296, 'steps': 100662, 'loss/train': 1.2539050579071045} 11/07/2021 11:21:57 - INFO - __main__ - Step 100664: {'lr': 0.00012501685155512498, 'samples': 19327488, 'steps': 100663, 'loss/train': 0.9426024556159973} 11/07/2021 11:21:57 - INFO - __main__ - Step 100665: {'lr': 0.0001250122556013629, 'samples': 19327680, 'steps': 100664, 'loss/train': 1.6647104024887085} 11/07/2021 11:21:57 - INFO - __main__ - Step 100666: {'lr': 0.00012500765970391853, 'samples': 19327872, 'steps': 100665, 'loss/train': 1.3072991371154785} 11/07/2021 11:21:58 - INFO - __main__ - Step 100667: {'lr': 0.0001250030638627936, 'samples': 19328064, 'steps': 100666, 'loss/train': 1.422594666481018} 11/07/2021 11:21:58 - INFO - __main__ - Step 100668: {'lr': 0.00012499846807799043, 'samples': 19328256, 'steps': 100667, 'loss/train': 1.0569790601730347} 11/07/2021 11:21:59 - INFO - __main__ - Step 100669: {'lr': 0.00012499387234951096, 'samples': 19328448, 'steps': 100668, 'loss/train': 1.284818410873413} 11/07/2021 11:21:59 - INFO - __main__ - Step 100670: {'lr': 0.00012498927667735734, 'samples': 19328640, 'steps': 100669, 'loss/train': 0.8325573801994324} 11/07/2021 11:22:00 - INFO - __main__ - Step 100671: {'lr': 0.00012498468106153166, 'samples': 19328832, 'steps': 100670, 'loss/train': 1.0454072952270508} 11/07/2021 11:22:00 - INFO - __main__ - Step 100672: {'lr': 0.0001249800855020359, 'samples': 19329024, 'steps': 100671, 'loss/train': 0.9320375919342041} 11/07/2021 11:22:00 - INFO - __main__ - Step 100673: {'lr': 0.00012497548999887222, 'samples': 19329216, 'steps': 100672, 'loss/train': 1.2908449172973633} 11/07/2021 11:22:01 - INFO - __main__ - Step 100674: {'lr': 0.00012497089455204265, 'samples': 19329408, 'steps': 100673, 'loss/train': 1.6215307712554932} 11/07/2021 11:22:02 - INFO - __main__ - Step 100675: {'lr': 0.00012496629916154925, 'samples': 19329600, 'steps': 100674, 'loss/train': 1.6498994827270508} 11/07/2021 11:22:02 - INFO - __main__ - Step 100676: {'lr': 0.00012496170382739414, 'samples': 19329792, 'steps': 100675, 'loss/train': 1.2867368459701538} 11/07/2021 11:22:02 - INFO - __main__ - Step 100677: {'lr': 0.00012495710854957932, 'samples': 19329984, 'steps': 100676, 'loss/train': 1.249769926071167} 11/07/2021 11:22:03 - INFO - __main__ - Step 100678: {'lr': 0.0001249525133281069, 'samples': 19330176, 'steps': 100677, 'loss/train': 0.8907338976860046} 11/07/2021 11:22:04 - INFO - __main__ - Step 100679: {'lr': 0.00012494791816297906, 'samples': 19330368, 'steps': 100678, 'loss/train': 1.154523253440857} 11/07/2021 11:22:04 - INFO - __main__ - Step 100680: {'lr': 0.00012494332305419765, 'samples': 19330560, 'steps': 100679, 'loss/train': 1.6271674633026123} 11/07/2021 11:22:05 - INFO - __main__ - Step 100681: {'lr': 0.00012493872800176486, 'samples': 19330752, 'steps': 100680, 'loss/train': 1.131747841835022} 11/07/2021 11:22:05 - INFO - __main__ - Step 100682: {'lr': 0.00012493413300568274, 'samples': 19330944, 'steps': 100681, 'loss/train': 1.8452390432357788} 11/07/2021 11:22:05 - INFO - __main__ - Step 100683: {'lr': 0.0001249295380659534, 'samples': 19331136, 'steps': 100682, 'loss/train': 1.5238783359527588} 11/07/2021 11:22:06 - INFO - __main__ - Step 100684: {'lr': 0.00012492494318257883, 'samples': 19331328, 'steps': 100683, 'loss/train': 1.5208805799484253} 11/07/2021 11:22:07 - INFO - __main__ - Step 100685: {'lr': 0.00012492034835556118, 'samples': 19331520, 'steps': 100684, 'loss/train': 1.8329921960830688} 11/07/2021 11:22:07 - INFO - __main__ - Step 100686: {'lr': 0.00012491575358490248, 'samples': 19331712, 'steps': 100685, 'loss/train': 1.0029029846191406} 11/07/2021 11:22:07 - INFO - __main__ - Step 100687: {'lr': 0.00012491115887060483, 'samples': 19331904, 'steps': 100686, 'loss/train': 1.0119448900222778} 11/07/2021 11:22:08 - INFO - __main__ - Step 100688: {'lr': 0.00012490656421267028, 'samples': 19332096, 'steps': 100687, 'loss/train': 1.32854425907135} 11/07/2021 11:22:09 - INFO - __main__ - Step 100689: {'lr': 0.00012490196961110087, 'samples': 19332288, 'steps': 100688, 'loss/train': 1.4400948286056519} 11/07/2021 11:22:09 - INFO - __main__ - Step 100690: {'lr': 0.00012489737506589873, 'samples': 19332480, 'steps': 100689, 'loss/train': 1.4900004863739014} 11/07/2021 11:22:09 - INFO - __main__ - Step 100691: {'lr': 0.0001248927805770659, 'samples': 19332672, 'steps': 100690, 'loss/train': 1.1083405017852783} 11/07/2021 11:22:10 - INFO - __main__ - Step 100692: {'lr': 0.00012488818614460445, 'samples': 19332864, 'steps': 100691, 'loss/train': 1.2111918926239014} 11/07/2021 11:22:10 - INFO - __main__ - Step 100693: {'lr': 0.00012488359176851654, 'samples': 19333056, 'steps': 100692, 'loss/train': 0.45349422097206116} 11/07/2021 11:22:10 - INFO - __main__ - Step 100694: {'lr': 0.00012487899744880406, 'samples': 19333248, 'steps': 100693, 'loss/train': 1.3603914976119995} 11/07/2021 11:22:12 - INFO - __main__ - Step 100695: {'lr': 0.0001248744031854692, 'samples': 19333440, 'steps': 100694, 'loss/train': 1.1397830247879028} 11/07/2021 11:22:12 - INFO - __main__ - Step 100696: {'lr': 0.00012486980897851398, 'samples': 19333632, 'steps': 100695, 'loss/train': 1.202148675918579} 11/07/2021 11:22:12 - INFO - __main__ - Step 100697: {'lr': 0.00012486521482794048, 'samples': 19333824, 'steps': 100696, 'loss/train': 1.2595832347869873} 11/07/2021 11:22:13 - INFO - __main__ - Step 100698: {'lr': 0.0001248606207337508, 'samples': 19334016, 'steps': 100697, 'loss/train': 1.0712664127349854} 11/07/2021 11:22:13 - INFO - __main__ - Step 100699: {'lr': 0.00012485602669594698, 'samples': 19334208, 'steps': 100698, 'loss/train': 1.2148308753967285} 11/07/2021 11:22:14 - INFO - __main__ - Step 100700: {'lr': 0.0001248514327145311, 'samples': 19334400, 'steps': 100699, 'loss/train': 1.5582435131072998} 11/07/2021 11:22:14 - INFO - __main__ - Step 100701: {'lr': 0.00012484683878950526, 'samples': 19334592, 'steps': 100700, 'loss/train': 1.3650866746902466} 11/07/2021 11:22:15 - INFO - __main__ - Step 100702: {'lr': 0.0001248422449208715, 'samples': 19334784, 'steps': 100701, 'loss/train': 1.2849466800689697} 11/07/2021 11:22:15 - INFO - __main__ - Step 100703: {'lr': 0.00012483765110863187, 'samples': 19334976, 'steps': 100702, 'loss/train': 1.3958947658538818} 11/07/2021 11:22:15 - INFO - __main__ - Step 100704: {'lr': 0.00012483305735278846, 'samples': 19335168, 'steps': 100703, 'loss/train': 1.201257348060608} 11/07/2021 11:22:17 - INFO - __main__ - Step 100705: {'lr': 0.00012482846365334337, 'samples': 19335360, 'steps': 100704, 'loss/train': 1.8055274486541748} 11/07/2021 11:22:17 - INFO - __main__ - Step 100706: {'lr': 0.00012482387001029873, 'samples': 19335552, 'steps': 100705, 'loss/train': 1.0853408575057983} 11/07/2021 11:22:17 - INFO - __main__ - Step 100707: {'lr': 0.00012481927642365642, 'samples': 19335744, 'steps': 100706, 'loss/train': 1.1674808263778687} 11/07/2021 11:22:18 - INFO - __main__ - Step 100708: {'lr': 0.00012481468289341863, 'samples': 19335936, 'steps': 100707, 'loss/train': 0.6476646065711975} 11/07/2021 11:22:18 - INFO - __main__ - Step 100709: {'lr': 0.00012481008941958737, 'samples': 19336128, 'steps': 100708, 'loss/train': 1.4757256507873535} 11/07/2021 11:22:18 - INFO - __main__ - Step 100710: {'lr': 0.0001248054960021648, 'samples': 19336320, 'steps': 100709, 'loss/train': 1.5747418403625488} 11/07/2021 11:22:19 - INFO - __main__ - Step 100711: {'lr': 0.00012480090264115293, 'samples': 19336512, 'steps': 100710, 'loss/train': 1.9368906021118164} 11/07/2021 11:22:20 - INFO - __main__ - Step 100712: {'lr': 0.0001247963093365538, 'samples': 19336704, 'steps': 100711, 'loss/train': 0.9354365468025208} 11/07/2021 11:22:20 - INFO - __main__ - Step 100713: {'lr': 0.00012479171608836958, 'samples': 19336896, 'steps': 100712, 'loss/train': 1.4941902160644531} 11/07/2021 11:22:20 - INFO - __main__ - Step 100714: {'lr': 0.00012478712289660225, 'samples': 19337088, 'steps': 100713, 'loss/train': 1.650427222251892} 11/07/2021 11:22:21 - INFO - __main__ - Step 100715: {'lr': 0.00012478252976125392, 'samples': 19337280, 'steps': 100714, 'loss/train': 1.3341615200042725} 11/07/2021 11:22:22 - INFO - __main__ - Step 100716: {'lr': 0.00012477793668232666, 'samples': 19337472, 'steps': 100715, 'loss/train': 1.055092692375183} 11/07/2021 11:22:22 - INFO - __main__ - Step 100717: {'lr': 0.00012477334365982248, 'samples': 19337664, 'steps': 100716, 'loss/train': 5.693792819976807} 11/07/2021 11:22:23 - INFO - __main__ - Step 100718: {'lr': 0.00012476875069374356, 'samples': 19337856, 'steps': 100717, 'loss/train': 1.2158024311065674} 11/07/2021 11:22:23 - INFO - __main__ - Step 100719: {'lr': 0.00012476415778409186, 'samples': 19338048, 'steps': 100718, 'loss/train': 1.7165148258209229} 11/07/2021 11:22:23 - INFO - __main__ - Step 100720: {'lr': 0.0001247595649308696, 'samples': 19338240, 'steps': 100719, 'loss/train': 1.1698188781738281} 11/07/2021 11:22:24 - INFO - __main__ - Step 100721: {'lr': 0.0001247549721340787, 'samples': 19338432, 'steps': 100720, 'loss/train': 1.5429959297180176} 11/07/2021 11:22:25 - INFO - __main__ - Step 100722: {'lr': 0.00012475037939372124, 'samples': 19338624, 'steps': 100721, 'loss/train': 1.3416260480880737} 11/07/2021 11:22:25 - INFO - __main__ - Step 100723: {'lr': 0.00012474578670979933, 'samples': 19338816, 'steps': 100722, 'loss/train': 1.4641047716140747} 11/07/2021 11:22:25 - INFO - __main__ - Step 100724: {'lr': 0.00012474119408231504, 'samples': 19339008, 'steps': 100723, 'loss/train': 1.242475152015686} 11/07/2021 11:22:26 - INFO - __main__ - Step 100725: {'lr': 0.00012473660151127042, 'samples': 19339200, 'steps': 100724, 'loss/train': 1.3778830766677856} 11/07/2021 11:22:27 - INFO - __main__ - Step 100726: {'lr': 0.00012473200899666757, 'samples': 19339392, 'steps': 100725, 'loss/train': 1.4994219541549683} 11/07/2021 11:22:27 - INFO - __main__ - Step 100727: {'lr': 0.00012472741653850856, 'samples': 19339584, 'steps': 100726, 'loss/train': 1.1600403785705566} 11/07/2021 11:22:28 - INFO - __main__ - Step 100728: {'lr': 0.0001247228241367954, 'samples': 19339776, 'steps': 100727, 'loss/train': 1.2948319911956787} 11/07/2021 11:22:28 - INFO - __main__ - Step 100729: {'lr': 0.0001247182317915302, 'samples': 19339968, 'steps': 100728, 'loss/train': 2.0578255653381348} 11/07/2021 11:22:28 - INFO - __main__ - Step 100730: {'lr': 0.0001247136395027151, 'samples': 19340160, 'steps': 100729, 'loss/train': 1.1573355197906494} 11/07/2021 11:22:29 - INFO - __main__ - Step 100731: {'lr': 0.00012470904727035205, 'samples': 19340352, 'steps': 100730, 'loss/train': 1.4588371515274048} 11/07/2021 11:22:30 - INFO - __main__ - Step 100732: {'lr': 0.00012470445509444317, 'samples': 19340544, 'steps': 100731, 'loss/train': 1.2883707284927368} 11/07/2021 11:22:30 - INFO - __main__ - Step 100733: {'lr': 0.00012469986297499063, 'samples': 19340736, 'steps': 100732, 'loss/train': 1.6817952394485474} 11/07/2021 11:22:30 - INFO - __main__ - Step 100734: {'lr': 0.0001246952709119963, 'samples': 19340928, 'steps': 100733, 'loss/train': 1.3686343431472778} 11/07/2021 11:22:31 - INFO - __main__ - Step 100735: {'lr': 0.00012469067890546234, 'samples': 19341120, 'steps': 100734, 'loss/train': 1.821969985961914} 11/07/2021 11:22:31 - INFO - __main__ - Step 100736: {'lr': 0.00012468608695539085, 'samples': 19341312, 'steps': 100735, 'loss/train': 1.50774085521698} 11/07/2021 11:22:32 - INFO - __main__ - Step 100737: {'lr': 0.00012468149506178385, 'samples': 19341504, 'steps': 100736, 'loss/train': 1.418441653251648} 11/07/2021 11:22:32 - INFO - __main__ - Step 100738: {'lr': 0.00012467690322464349, 'samples': 19341696, 'steps': 100737, 'loss/train': 1.6571662425994873} 11/07/2021 11:22:33 - INFO - __main__ - Step 100739: {'lr': 0.00012467231144397173, 'samples': 19341888, 'steps': 100738, 'loss/train': 0.9195892214775085} 11/07/2021 11:22:33 - INFO - __main__ - Step 100740: {'lr': 0.0001246677197197707, 'samples': 19342080, 'steps': 100739, 'loss/train': 1.3962374925613403} 11/07/2021 11:22:33 - INFO - __main__ - Step 100741: {'lr': 0.00012466312805204248, 'samples': 19342272, 'steps': 100740, 'loss/train': 0.9302431344985962} 11/07/2021 11:22:34 - INFO - __main__ - Step 100742: {'lr': 0.0001246585364407891, 'samples': 19342464, 'steps': 100741, 'loss/train': 1.3744926452636719} 11/07/2021 11:22:35 - INFO - __main__ - Step 100743: {'lr': 0.00012465394488601265, 'samples': 19342656, 'steps': 100742, 'loss/train': 1.358878493309021} 11/07/2021 11:22:35 - INFO - __main__ - Step 100744: {'lr': 0.00012464935338771517, 'samples': 19342848, 'steps': 100743, 'loss/train': 1.393215298652649} 11/07/2021 11:22:36 - INFO - __main__ - Step 100745: {'lr': 0.00012464476194589883, 'samples': 19343040, 'steps': 100744, 'loss/train': 0.625195324420929} 11/07/2021 11:22:36 - INFO - __main__ - Step 100746: {'lr': 0.00012464017056056556, 'samples': 19343232, 'steps': 100745, 'loss/train': 0.8458629250526428} 11/07/2021 11:22:37 - INFO - __main__ - Step 100747: {'lr': 0.00012463557923171763, 'samples': 19343424, 'steps': 100746, 'loss/train': 1.8504990339279175} 11/07/2021 11:22:38 - INFO - __main__ - Step 100748: {'lr': 0.00012463098795935688, 'samples': 19343616, 'steps': 100747, 'loss/train': 1.22519052028656} 11/07/2021 11:22:38 - INFO - __main__ - Step 100749: {'lr': 0.00012462639674348545, 'samples': 19343808, 'steps': 100748, 'loss/train': 1.2033312320709229} 11/07/2021 11:22:38 - INFO - __main__ - Step 100750: {'lr': 0.00012462180558410544, 'samples': 19344000, 'steps': 100749, 'loss/train': 0.9587706327438354} 11/07/2021 11:22:39 - INFO - __main__ - Step 100751: {'lr': 0.0001246172144812189, 'samples': 19344192, 'steps': 100750, 'loss/train': 0.9626662731170654} 11/07/2021 11:22:40 - INFO - __main__ - Step 100752: {'lr': 0.0001246126234348279, 'samples': 19344384, 'steps': 100751, 'loss/train': 1.5016874074935913} 11/07/2021 11:22:40 - INFO - __main__ - Step 100753: {'lr': 0.00012460803244493455, 'samples': 19344576, 'steps': 100752, 'loss/train': 1.3073548078536987} 11/07/2021 11:22:40 - INFO - __main__ - Step 100754: {'lr': 0.00012460344151154088, 'samples': 19344768, 'steps': 100753, 'loss/train': 1.1301435232162476} 11/07/2021 11:22:41 - INFO - __main__ - Step 100755: {'lr': 0.00012459885063464894, 'samples': 19344960, 'steps': 100754, 'loss/train': 1.3400644063949585} 11/07/2021 11:22:41 - INFO - __main__ - Step 100756: {'lr': 0.00012459425981426085, 'samples': 19345152, 'steps': 100755, 'loss/train': 1.5780456066131592} 11/07/2021 11:22:41 - INFO - __main__ - Step 100757: {'lr': 0.00012458966905037864, 'samples': 19345344, 'steps': 100756, 'loss/train': 1.682469367980957} 11/07/2021 11:22:42 - INFO - __main__ - Step 100758: {'lr': 0.00012458507834300437, 'samples': 19345536, 'steps': 100757, 'loss/train': 1.1270774602890015} 11/07/2021 11:22:43 - INFO - __main__ - Step 100759: {'lr': 0.00012458048769214015, 'samples': 19345728, 'steps': 100758, 'loss/train': 1.5311429500579834} 11/07/2021 11:22:43 - INFO - __main__ - Step 100760: {'lr': 0.00012457589709778812, 'samples': 19345920, 'steps': 100759, 'loss/train': 1.2133023738861084} 11/07/2021 11:22:43 - INFO - __main__ - Step 100761: {'lr': 0.00012457130655995017, 'samples': 19346112, 'steps': 100760, 'loss/train': 1.6170716285705566} 11/07/2021 11:22:44 - INFO - __main__ - Step 100762: {'lr': 0.00012456671607862844, 'samples': 19346304, 'steps': 100761, 'loss/train': 1.2543604373931885} 11/07/2021 11:22:45 - INFO - __main__ - Step 100763: {'lr': 0.00012456212565382498, 'samples': 19346496, 'steps': 100762, 'loss/train': 1.718665361404419} 11/07/2021 11:22:45 - INFO - __main__ - Step 100764: {'lr': 0.00012455753528554196, 'samples': 19346688, 'steps': 100763, 'loss/train': 1.1514593362808228} 11/07/2021 11:22:46 - INFO - __main__ - Step 100765: {'lr': 0.00012455294497378132, 'samples': 19346880, 'steps': 100764, 'loss/train': 1.5372271537780762} 11/07/2021 11:22:46 - INFO - __main__ - Step 100766: {'lr': 0.00012454835471854521, 'samples': 19347072, 'steps': 100765, 'loss/train': 1.7105607986450195} 11/07/2021 11:22:46 - INFO - __main__ - Step 100767: {'lr': 0.00012454376451983567, 'samples': 19347264, 'steps': 100766, 'loss/train': 1.6285982131958008} 11/07/2021 11:22:47 - INFO - __main__ - Step 100768: {'lr': 0.0001245391743776548, 'samples': 19347456, 'steps': 100767, 'loss/train': 1.4043021202087402} 11/07/2021 11:22:48 - INFO - __main__ - Step 100769: {'lr': 0.00012453458429200463, 'samples': 19347648, 'steps': 100768, 'loss/train': 1.2641820907592773} 11/07/2021 11:22:48 - INFO - __main__ - Step 100770: {'lr': 0.00012452999426288723, 'samples': 19347840, 'steps': 100769, 'loss/train': 1.4915541410446167} 11/07/2021 11:22:48 - INFO - __main__ - Step 100771: {'lr': 0.0001245254042903047, 'samples': 19348032, 'steps': 100770, 'loss/train': 1.3669463396072388} 11/07/2021 11:22:49 - INFO - __main__ - Step 100772: {'lr': 0.00012452081437425906, 'samples': 19348224, 'steps': 100771, 'loss/train': 1.0574010610580444} 11/07/2021 11:22:50 - INFO - __main__ - Step 100773: {'lr': 0.0001245162245147525, 'samples': 19348416, 'steps': 100772, 'loss/train': 1.8647284507751465} 11/07/2021 11:22:50 - INFO - __main__ - Step 100774: {'lr': 0.0001245116347117869, 'samples': 19348608, 'steps': 100773, 'loss/train': 2.166860580444336} 11/07/2021 11:22:50 - INFO - __main__ - Step 100775: {'lr': 0.0001245070449653644, 'samples': 19348800, 'steps': 100774, 'loss/train': 1.1552445888519287} 11/07/2021 11:22:51 - INFO - __main__ - Step 100776: {'lr': 0.00012450245527548715, 'samples': 19348992, 'steps': 100775, 'loss/train': 1.1600522994995117} 11/07/2021 11:22:51 - INFO - __main__ - Step 100777: {'lr': 0.00012449786564215713, 'samples': 19349184, 'steps': 100776, 'loss/train': 0.6752856373786926} 11/07/2021 11:22:52 - INFO - __main__ - Step 100778: {'lr': 0.0001244932760653764, 'samples': 19349376, 'steps': 100777, 'loss/train': 1.1337370872497559} 11/07/2021 11:22:53 - INFO - __main__ - Step 100779: {'lr': 0.0001244886865451471, 'samples': 19349568, 'steps': 100778, 'loss/train': 1.1428333520889282} 11/07/2021 11:22:53 - INFO - __main__ - Step 100780: {'lr': 0.00012448409708147126, 'samples': 19349760, 'steps': 100779, 'loss/train': 1.0130465030670166} 11/07/2021 11:22:53 - INFO - __main__ - Step 100781: {'lr': 0.00012447950767435092, 'samples': 19349952, 'steps': 100780, 'loss/train': 1.4470858573913574} 11/07/2021 11:22:54 - INFO - __main__ - Step 100782: {'lr': 0.0001244749183237882, 'samples': 19350144, 'steps': 100781, 'loss/train': 1.4713517427444458} 11/07/2021 11:22:54 - INFO - __main__ - Step 100783: {'lr': 0.00012447032902978517, 'samples': 19350336, 'steps': 100782, 'loss/train': 1.7492226362228394} 11/07/2021 11:22:55 - INFO - __main__ - Step 100784: {'lr': 0.00012446573979234393, 'samples': 19350528, 'steps': 100783, 'loss/train': 1.7181638479232788} 11/07/2021 11:22:55 - INFO - __main__ - Step 100785: {'lr': 0.0001244611506114664, 'samples': 19350720, 'steps': 100784, 'loss/train': 1.5673068761825562} 11/07/2021 11:22:56 - INFO - __main__ - Step 100786: {'lr': 0.00012445656148715476, 'samples': 19350912, 'steps': 100785, 'loss/train': 1.547480583190918} 11/07/2021 11:22:56 - INFO - __main__ - Step 100787: {'lr': 0.00012445197241941103, 'samples': 19351104, 'steps': 100786, 'loss/train': 1.0306757688522339} 11/07/2021 11:22:56 - INFO - __main__ - Step 100788: {'lr': 0.0001244473834082373, 'samples': 19351296, 'steps': 100787, 'loss/train': 0.9413776397705078} 11/07/2021 11:22:57 - INFO - __main__ - Step 100789: {'lr': 0.00012444279445363566, 'samples': 19351488, 'steps': 100788, 'loss/train': 1.4152511358261108} 11/07/2021 11:22:58 - INFO - __main__ - Step 100790: {'lr': 0.00012443820555560817, 'samples': 19351680, 'steps': 100789, 'loss/train': 1.704057216644287} 11/07/2021 11:22:58 - INFO - __main__ - Step 100791: {'lr': 0.00012443361671415687, 'samples': 19351872, 'steps': 100790, 'loss/train': 1.2139861583709717} 11/07/2021 11:22:59 - INFO - __main__ - Step 100792: {'lr': 0.00012442902792928384, 'samples': 19352064, 'steps': 100791, 'loss/train': 0.9350465536117554} 11/07/2021 11:22:59 - INFO - __main__ - Step 100793: {'lr': 0.00012442443920099118, 'samples': 19352256, 'steps': 100792, 'loss/train': 1.4309396743774414} 11/07/2021 11:23:00 - INFO - __main__ - Step 100794: {'lr': 0.0001244198505292809, 'samples': 19352448, 'steps': 100793, 'loss/train': 1.4384156465530396} 11/07/2021 11:23:00 - INFO - __main__ - Step 100795: {'lr': 0.0001244152619141552, 'samples': 19352640, 'steps': 100794, 'loss/train': 1.4425697326660156} 11/07/2021 11:23:01 - INFO - __main__ - Step 100796: {'lr': 0.00012441067335561596, 'samples': 19352832, 'steps': 100795, 'loss/train': 1.4251142740249634} 11/07/2021 11:23:01 - INFO - __main__ - Step 100797: {'lr': 0.0001244060848536653, 'samples': 19353024, 'steps': 100796, 'loss/train': 1.3624476194381714} 11/07/2021 11:23:01 - INFO - __main__ - Step 100798: {'lr': 0.00012440149640830536, 'samples': 19353216, 'steps': 100797, 'loss/train': 1.339455008506775} 11/07/2021 11:23:02 - INFO - __main__ - Step 100799: {'lr': 0.00012439690801953815, 'samples': 19353408, 'steps': 100798, 'loss/train': 0.7365450859069824} 11/07/2021 11:23:03 - INFO - __main__ - Step 100800: {'lr': 0.00012439231968736574, 'samples': 19353600, 'steps': 100799, 'loss/train': 1.7160664796829224} 11/07/2021 11:23:03 - INFO - __main__ - Step 100801: {'lr': 0.00012438773141179024, 'samples': 19353792, 'steps': 100800, 'loss/train': 1.4173452854156494} 11/07/2021 11:23:03 - INFO - __main__ - Step 100802: {'lr': 0.0001243831431928137, 'samples': 19353984, 'steps': 100801, 'loss/train': 1.0978822708129883} 11/07/2021 11:23:04 - INFO - __main__ - Step 100803: {'lr': 0.00012437855503043813, 'samples': 19354176, 'steps': 100802, 'loss/train': 1.4194430112838745} 11/07/2021 11:23:05 - INFO - __main__ - Step 100804: {'lr': 0.00012437396692466568, 'samples': 19354368, 'steps': 100803, 'loss/train': 1.253901720046997} 11/07/2021 11:23:05 - INFO - __main__ - Step 100805: {'lr': 0.00012436937887549837, 'samples': 19354560, 'steps': 100804, 'loss/train': 1.318796157836914} 11/07/2021 11:23:05 - INFO - __main__ - Step 100806: {'lr': 0.0001243647908829384, 'samples': 19354752, 'steps': 100805, 'loss/train': 1.2825548648834229} 11/07/2021 11:23:06 - INFO - __main__ - Step 100807: {'lr': 0.00012436020294698757, 'samples': 19354944, 'steps': 100806, 'loss/train': 1.8905922174453735} 11/07/2021 11:23:06 - INFO - __main__ - Step 100808: {'lr': 0.00012435561506764814, 'samples': 19355136, 'steps': 100807, 'loss/train': 1.6168184280395508} 11/07/2021 11:23:07 - INFO - __main__ - Step 100809: {'lr': 0.00012435102724492211, 'samples': 19355328, 'steps': 100808, 'loss/train': 1.4683820009231567} 11/07/2021 11:23:07 - INFO - __main__ - Step 100810: {'lr': 0.00012434643947881158, 'samples': 19355520, 'steps': 100809, 'loss/train': 0.8399803638458252} 11/07/2021 11:23:08 - INFO - __main__ - Step 100811: {'lr': 0.00012434185176931858, 'samples': 19355712, 'steps': 100810, 'loss/train': 1.1399149894714355} 11/07/2021 11:23:08 - INFO - __main__ - Step 100812: {'lr': 0.0001243372641164452, 'samples': 19355904, 'steps': 100811, 'loss/train': 1.421855092048645} 11/07/2021 11:23:09 - INFO - __main__ - Step 100813: {'lr': 0.00012433267652019357, 'samples': 19356096, 'steps': 100812, 'loss/train': 0.9872385263442993} 11/07/2021 11:23:10 - INFO - __main__ - Step 100814: {'lr': 0.00012432808898056567, 'samples': 19356288, 'steps': 100813, 'loss/train': 1.4577745199203491} 11/07/2021 11:23:10 - INFO - __main__ - Step 100815: {'lr': 0.00012432350149756355, 'samples': 19356480, 'steps': 100814, 'loss/train': 1.0616358518600464} 11/07/2021 11:23:10 - INFO - __main__ - Step 100816: {'lr': 0.00012431891407118937, 'samples': 19356672, 'steps': 100815, 'loss/train': 1.5570052862167358} 11/07/2021 11:23:11 - INFO - __main__ - Step 100817: {'lr': 0.0001243143267014452, 'samples': 19356864, 'steps': 100816, 'loss/train': 1.139508605003357} 11/07/2021 11:23:11 - INFO - __main__ - Step 100818: {'lr': 0.00012430973938833302, 'samples': 19357056, 'steps': 100817, 'loss/train': 0.9441393613815308} 11/07/2021 11:23:11 - INFO - __main__ - Step 100819: {'lr': 0.0001243051521318549, 'samples': 19357248, 'steps': 100818, 'loss/train': 1.1271060705184937} 11/07/2021 11:23:12 - INFO - __main__ - Step 100820: {'lr': 0.0001243005649320129, 'samples': 19357440, 'steps': 100819, 'loss/train': 0.982189416885376} 11/07/2021 11:23:13 - INFO - __main__ - Step 100821: {'lr': 0.0001242959777888092, 'samples': 19357632, 'steps': 100820, 'loss/train': 0.24034327268600464} 11/07/2021 11:23:13 - INFO - __main__ - Step 100822: {'lr': 0.00012429139070224574, 'samples': 19357824, 'steps': 100821, 'loss/train': 1.3343807458877563} 11/07/2021 11:23:13 - INFO - __main__ - Step 100823: {'lr': 0.00012428680367232464, 'samples': 19358016, 'steps': 100822, 'loss/train': 1.200548768043518} 11/07/2021 11:23:14 - INFO - __main__ - Step 100824: {'lr': 0.000124282216699048, 'samples': 19358208, 'steps': 100823, 'loss/train': 0.05032085254788399} 11/07/2021 11:23:15 - INFO - __main__ - Step 100825: {'lr': 0.00012427762978241781, 'samples': 19358400, 'steps': 100824, 'loss/train': 1.0701831579208374} 11/07/2021 11:23:15 - INFO - __main__ - Step 100826: {'lr': 0.00012427304292243622, 'samples': 19358592, 'steps': 100825, 'loss/train': 0.6465757489204407} 11/07/2021 11:23:15 - INFO - __main__ - Step 100827: {'lr': 0.00012426845611910524, 'samples': 19358784, 'steps': 100826, 'loss/train': 1.5561997890472412} 11/07/2021 11:23:16 - INFO - __main__ - Step 100828: {'lr': 0.00012426386937242705, 'samples': 19358976, 'steps': 100827, 'loss/train': 1.6103545427322388} 11/07/2021 11:23:16 - INFO - __main__ - Step 100829: {'lr': 0.00012425928268240352, 'samples': 19359168, 'steps': 100828, 'loss/train': 1.740670919418335} 11/07/2021 11:23:17 - INFO - __main__ - Step 100830: {'lr': 0.00012425469604903681, 'samples': 19359360, 'steps': 100829, 'loss/train': 1.387315273284912} 11/07/2021 11:23:18 - INFO - __main__ - Step 100831: {'lr': 0.000124250109472329, 'samples': 19359552, 'steps': 100830, 'loss/train': 1.0874335765838623} 11/07/2021 11:23:18 - INFO - __main__ - Step 100832: {'lr': 0.00012424552295228216, 'samples': 19359744, 'steps': 100831, 'loss/train': 1.834947109222412} 11/07/2021 11:23:18 - INFO - __main__ - Step 100833: {'lr': 0.00012424093648889833, 'samples': 19359936, 'steps': 100832, 'loss/train': 0.9294775724411011} 11/07/2021 11:23:19 - INFO - __main__ - Step 100834: {'lr': 0.00012423635008217962, 'samples': 19360128, 'steps': 100833, 'loss/train': 1.7475941181182861} 11/07/2021 11:23:20 - INFO - __main__ - Step 100835: {'lr': 0.00012423176373212806, 'samples': 19360320, 'steps': 100834, 'loss/train': 1.1109586954116821} 11/07/2021 11:23:20 - INFO - __main__ - Step 100836: {'lr': 0.0001242271774387457, 'samples': 19360512, 'steps': 100835, 'loss/train': 1.4446711540222168} 11/07/2021 11:23:20 - INFO - __main__ - Step 100837: {'lr': 0.00012422259120203465, 'samples': 19360704, 'steps': 100836, 'loss/train': 1.5789209604263306} 11/07/2021 11:23:21 - INFO - __main__ - Step 100838: {'lr': 0.00012421800502199697, 'samples': 19360896, 'steps': 100837, 'loss/train': 1.6429297924041748} 11/07/2021 11:23:21 - INFO - __main__ - Step 100839: {'lr': 0.00012421341889863472, 'samples': 19361088, 'steps': 100838, 'loss/train': 1.1959969997406006} 11/07/2021 11:23:22 - INFO - __main__ - Step 100840: {'lr': 0.00012420883283194994, 'samples': 19361280, 'steps': 100839, 'loss/train': 1.4376301765441895} 11/07/2021 11:23:22 - INFO - __main__ - Step 100841: {'lr': 0.00012420424682194485, 'samples': 19361472, 'steps': 100840, 'loss/train': 1.8020806312561035} 11/07/2021 11:23:23 - INFO - __main__ - Step 100842: {'lr': 0.00012419966086862124, 'samples': 19361664, 'steps': 100841, 'loss/train': 1.39808189868927} 11/07/2021 11:23:23 - INFO - __main__ - Step 100843: {'lr': 0.00012419507497198138, 'samples': 19361856, 'steps': 100842, 'loss/train': 1.2363258600234985} 11/07/2021 11:23:24 - INFO - __main__ - Step 100844: {'lr': 0.00012419048913202724, 'samples': 19362048, 'steps': 100843, 'loss/train': 0.875817596912384} 11/07/2021 11:23:25 - INFO - __main__ - Step 100845: {'lr': 0.00012418590334876094, 'samples': 19362240, 'steps': 100844, 'loss/train': 1.426274299621582} 11/07/2021 11:23:25 - INFO - __main__ - Step 100846: {'lr': 0.0001241813176221845, 'samples': 19362432, 'steps': 100845, 'loss/train': 1.1213957071304321} 11/07/2021 11:23:25 - INFO - __main__ - Step 100847: {'lr': 0.00012417673195230002, 'samples': 19362624, 'steps': 100846, 'loss/train': 1.5214027166366577} 11/07/2021 11:23:26 - INFO - __main__ - Step 100848: {'lr': 0.00012417214633910962, 'samples': 19362816, 'steps': 100847, 'loss/train': 1.313966155052185} 11/07/2021 11:23:26 - INFO - __main__ - Step 100849: {'lr': 0.00012416756078261526, 'samples': 19363008, 'steps': 100848, 'loss/train': 1.4994348287582397} 11/07/2021 11:23:27 - INFO - __main__ - Step 100850: {'lr': 0.00012416297528281906, 'samples': 19363200, 'steps': 100849, 'loss/train': 0.672645628452301} 11/07/2021 11:23:27 - INFO - __main__ - Step 100851: {'lr': 0.00012415838983972308, 'samples': 19363392, 'steps': 100850, 'loss/train': 1.5875734090805054} 11/07/2021 11:23:28 - INFO - __main__ - Step 100852: {'lr': 0.00012415380445332942, 'samples': 19363584, 'steps': 100851, 'loss/train': 0.9894003868103027} 11/07/2021 11:23:28 - INFO - __main__ - Step 100853: {'lr': 0.00012414921912364007, 'samples': 19363776, 'steps': 100852, 'loss/train': 1.347084641456604} 11/07/2021 11:23:28 - INFO - __main__ - Step 100854: {'lr': 0.00012414463385065723, 'samples': 19363968, 'steps': 100853, 'loss/train': 1.3816511631011963} 11/07/2021 11:23:29 - INFO - __main__ - Step 100855: {'lr': 0.00012414004863438283, 'samples': 19364160, 'steps': 100854, 'loss/train': 1.3087624311447144} 11/07/2021 11:23:30 - INFO - __main__ - Step 100856: {'lr': 0.00012413546347481895, 'samples': 19364352, 'steps': 100855, 'loss/train': 0.6135240197181702} 11/07/2021 11:23:30 - INFO - __main__ - Step 100857: {'lr': 0.00012413087837196768, 'samples': 19364544, 'steps': 100856, 'loss/train': 0.7991507649421692} 11/07/2021 11:23:31 - INFO - __main__ - Step 100858: {'lr': 0.0001241262933258311, 'samples': 19364736, 'steps': 100857, 'loss/train': 1.2277441024780273} 11/07/2021 11:23:31 - INFO - __main__ - Step 100859: {'lr': 0.0001241217083364113, 'samples': 19364928, 'steps': 100858, 'loss/train': 1.2013975381851196} 11/07/2021 11:23:31 - INFO - __main__ - Step 100860: {'lr': 0.0001241171234037103, 'samples': 19365120, 'steps': 100859, 'loss/train': 1.2935534715652466} 11/07/2021 11:23:33 - INFO - __main__ - Step 100861: {'lr': 0.00012411253852773017, 'samples': 19365312, 'steps': 100860, 'loss/train': 0.5587404370307922} 11/07/2021 11:23:33 - INFO - __main__ - Step 100862: {'lr': 0.000124107953708473, 'samples': 19365504, 'steps': 100861, 'loss/train': 0.9746382236480713} 11/07/2021 11:23:33 - INFO - __main__ - Step 100863: {'lr': 0.00012410336894594083, 'samples': 19365696, 'steps': 100862, 'loss/train': 0.9832454323768616} 11/07/2021 11:23:34 - INFO - __main__ - Step 100864: {'lr': 0.00012409878424013573, 'samples': 19365888, 'steps': 100863, 'loss/train': 1.0840879678726196} 11/07/2021 11:23:34 - INFO - __main__ - Step 100865: {'lr': 0.0001240941995910598, 'samples': 19366080, 'steps': 100864, 'loss/train': 1.3789476156234741} 11/07/2021 11:23:35 - INFO - __main__ - Step 100866: {'lr': 0.00012408961499871506, 'samples': 19366272, 'steps': 100865, 'loss/train': 1.2277238368988037} 11/07/2021 11:23:35 - INFO - __main__ - Step 100867: {'lr': 0.00012408503046310363, 'samples': 19366464, 'steps': 100866, 'loss/train': 1.527236819267273} 11/07/2021 11:23:36 - INFO - __main__ - Step 100868: {'lr': 0.0001240804459842276, 'samples': 19366656, 'steps': 100867, 'loss/train': 1.5909299850463867} 11/07/2021 11:23:36 - INFO - __main__ - Step 100869: {'lr': 0.00012407586156208892, 'samples': 19366848, 'steps': 100868, 'loss/train': 1.4436904191970825} 11/07/2021 11:23:36 - INFO - __main__ - Step 100870: {'lr': 0.00012407127719668969, 'samples': 19367040, 'steps': 100869, 'loss/train': 1.2709051370620728} 11/07/2021 11:23:39 - INFO - __main__ - Step 100871: {'lr': 0.00012406669288803199, 'samples': 19367232, 'steps': 100870, 'loss/train': 0.7600957751274109} 11/07/2021 11:23:39 - INFO - __main__ - Step 100872: {'lr': 0.0001240621086361179, 'samples': 19367424, 'steps': 100871, 'loss/train': 1.214964747428894} 11/07/2021 11:23:40 - INFO - __main__ - Step 100873: {'lr': 0.0001240575244409495, 'samples': 19367616, 'steps': 100872, 'loss/train': 1.4132111072540283} 11/07/2021 11:23:40 - INFO - __main__ - Step 100874: {'lr': 0.0001240529403025288, 'samples': 19367808, 'steps': 100873, 'loss/train': 1.3951184749603271} 11/07/2021 11:23:40 - INFO - __main__ - Step 100875: {'lr': 0.00012404835622085793, 'samples': 19368000, 'steps': 100874, 'loss/train': 1.0843206644058228} 11/07/2021 11:23:41 - INFO - __main__ - Step 100876: {'lr': 0.00012404377219593892, 'samples': 19368192, 'steps': 100875, 'loss/train': 1.9092919826507568} 11/07/2021 11:23:41 - INFO - __main__ - Step 100877: {'lr': 0.00012403918822777386, 'samples': 19368384, 'steps': 100876, 'loss/train': 0.8570210933685303} 11/07/2021 11:23:41 - INFO - __main__ - Step 100878: {'lr': 0.00012403460431636477, 'samples': 19368576, 'steps': 100877, 'loss/train': 1.7510682344436646} 11/07/2021 11:23:42 - INFO - __main__ - Step 100879: {'lr': 0.00012403002046171377, 'samples': 19368768, 'steps': 100878, 'loss/train': 1.7625173330307007} 11/07/2021 11:23:43 - INFO - __main__ - Step 100880: {'lr': 0.00012402543666382288, 'samples': 19368960, 'steps': 100879, 'loss/train': 0.181128591299057} 11/07/2021 11:23:43 - INFO - __main__ - Step 100881: {'lr': 0.00012402085292269427, 'samples': 19369152, 'steps': 100880, 'loss/train': 1.6888482570648193} 11/07/2021 11:23:43 - INFO - __main__ - Step 100882: {'lr': 0.00012401626923832983, 'samples': 19369344, 'steps': 100881, 'loss/train': 1.1535791158676147} 11/07/2021 11:23:44 - INFO - __main__ - Step 100883: {'lr': 0.00012401168561073175, 'samples': 19369536, 'steps': 100882, 'loss/train': 1.4298293590545654} 11/07/2021 11:23:44 - INFO - __main__ - Step 100884: {'lr': 0.00012400710203990203, 'samples': 19369728, 'steps': 100883, 'loss/train': 1.3068846464157104} 11/07/2021 11:23:45 - INFO - __main__ - Step 100885: {'lr': 0.00012400251852584277, 'samples': 19369920, 'steps': 100884, 'loss/train': 1.360678791999817} 11/07/2021 11:23:45 - INFO - __main__ - Step 100886: {'lr': 0.00012399793506855602, 'samples': 19370112, 'steps': 100885, 'loss/train': 0.872994601726532} 11/07/2021 11:23:46 - INFO - __main__ - Step 100887: {'lr': 0.00012399335166804386, 'samples': 19370304, 'steps': 100886, 'loss/train': 1.3181447982788086} 11/07/2021 11:23:46 - INFO - __main__ - Step 100888: {'lr': 0.00012398876832430837, 'samples': 19370496, 'steps': 100887, 'loss/train': 1.4015183448791504} 11/07/2021 11:23:46 - INFO - __main__ - Step 100889: {'lr': 0.0001239841850373516, 'samples': 19370688, 'steps': 100888, 'loss/train': 1.3266847133636475} 11/07/2021 11:23:47 - INFO - __main__ - Step 100890: {'lr': 0.00012397960180717557, 'samples': 19370880, 'steps': 100889, 'loss/train': 1.0048167705535889} 11/07/2021 11:23:48 - INFO - __main__ - Step 100891: {'lr': 0.00012397501863378244, 'samples': 19371072, 'steps': 100890, 'loss/train': 1.3734694719314575} 11/07/2021 11:23:48 - INFO - __main__ - Step 100892: {'lr': 0.00012397043551717418, 'samples': 19371264, 'steps': 100891, 'loss/train': 1.4816036224365234} 11/07/2021 11:23:49 - INFO - __main__ - Step 100893: {'lr': 0.0001239658524573529, 'samples': 19371456, 'steps': 100892, 'loss/train': 1.293898582458496} 11/07/2021 11:23:49 - INFO - __main__ - Step 100894: {'lr': 0.0001239612694543208, 'samples': 19371648, 'steps': 100893, 'loss/train': 1.2780381441116333} 11/07/2021 11:23:50 - INFO - __main__ - Step 100895: {'lr': 0.00012395668650807968, 'samples': 19371840, 'steps': 100894, 'loss/train': 1.2928389310836792} 11/07/2021 11:23:50 - INFO - __main__ - Step 100896: {'lr': 0.00012395210361863172, 'samples': 19372032, 'steps': 100895, 'loss/train': 1.59525465965271} 11/07/2021 11:23:51 - INFO - __main__ - Step 100897: {'lr': 0.00012394752078597902, 'samples': 19372224, 'steps': 100896, 'loss/train': 1.1552672386169434} 11/07/2021 11:23:51 - INFO - __main__ - Step 100898: {'lr': 0.0001239429380101236, 'samples': 19372416, 'steps': 100897, 'loss/train': 1.2730035781860352} 11/07/2021 11:23:51 - INFO - __main__ - Step 100899: {'lr': 0.00012393835529106757, 'samples': 19372608, 'steps': 100898, 'loss/train': 1.0699580907821655} 11/07/2021 11:23:52 - INFO - __main__ - Step 100900: {'lr': 0.00012393377262881296, 'samples': 19372800, 'steps': 100899, 'loss/train': 0.9854642748832703} 11/07/2021 11:23:53 - INFO - __main__ - Step 100901: {'lr': 0.00012392919002336184, 'samples': 19372992, 'steps': 100900, 'loss/train': 1.1870864629745483} 11/07/2021 11:23:53 - INFO - __main__ - Step 100902: {'lr': 0.00012392460747471628, 'samples': 19373184, 'steps': 100901, 'loss/train': 1.338470220565796} 11/07/2021 11:23:53 - INFO - __main__ - Step 100903: {'lr': 0.00012392002498287836, 'samples': 19373376, 'steps': 100902, 'loss/train': 0.7509637475013733} 11/07/2021 11:23:54 - INFO - __main__ - Step 100904: {'lr': 0.0001239154425478501, 'samples': 19373568, 'steps': 100903, 'loss/train': 1.2675825357437134} 11/07/2021 11:23:55 - INFO - __main__ - Step 100905: {'lr': 0.00012391086016963365, 'samples': 19373760, 'steps': 100904, 'loss/train': 1.2297288179397583} 11/07/2021 11:23:55 - INFO - __main__ - Step 100906: {'lr': 0.00012390627784823098, 'samples': 19373952, 'steps': 100905, 'loss/train': 1.072208046913147} 11/07/2021 11:23:55 - INFO - __main__ - Step 100907: {'lr': 0.00012390169558364422, 'samples': 19374144, 'steps': 100906, 'loss/train': 1.566315770149231} 11/07/2021 11:23:56 - INFO - __main__ - Step 100908: {'lr': 0.0001238971133758755, 'samples': 19374336, 'steps': 100907, 'loss/train': 1.4367316961288452} 11/07/2021 11:23:56 - INFO - __main__ - Step 100909: {'lr': 0.0001238925312249267, 'samples': 19374528, 'steps': 100908, 'loss/train': 1.2471667528152466} 11/07/2021 11:23:57 - INFO - __main__ - Step 100910: {'lr': 0.00012388794913079996, 'samples': 19374720, 'steps': 100909, 'loss/train': 0.5476423501968384} 11/07/2021 11:23:58 - INFO - __main__ - Step 100911: {'lr': 0.00012388336709349737, 'samples': 19374912, 'steps': 100910, 'loss/train': 1.616876482963562} 11/07/2021 11:23:58 - INFO - __main__ - Step 100912: {'lr': 0.000123878785113021, 'samples': 19375104, 'steps': 100911, 'loss/train': 1.3758981227874756} 11/07/2021 11:23:58 - INFO - __main__ - Step 100913: {'lr': 0.0001238742031893729, 'samples': 19375296, 'steps': 100912, 'loss/train': 1.7632100582122803} 11/07/2021 11:23:59 - INFO - __main__ - Step 100914: {'lr': 0.00012386962132255515, 'samples': 19375488, 'steps': 100913, 'loss/train': 0.633457362651825} 11/07/2021 11:23:59 - INFO - __main__ - Step 100915: {'lr': 0.00012386503951256978, 'samples': 19375680, 'steps': 100914, 'loss/train': 1.3253756761550903} 11/07/2021 11:24:00 - INFO - __main__ - Step 100916: {'lr': 0.0001238604577594189, 'samples': 19375872, 'steps': 100915, 'loss/train': 1.2676864862442017} 11/07/2021 11:24:00 - INFO - __main__ - Step 100917: {'lr': 0.00012385587606310452, 'samples': 19376064, 'steps': 100916, 'loss/train': 1.643951654434204} 11/07/2021 11:24:01 - INFO - __main__ - Step 100918: {'lr': 0.00012385129442362878, 'samples': 19376256, 'steps': 100917, 'loss/train': 1.296842098236084} 11/07/2021 11:24:01 - INFO - __main__ - Step 100919: {'lr': 0.00012384671284099366, 'samples': 19376448, 'steps': 100918, 'loss/train': 1.1884866952896118} 11/07/2021 11:24:02 - INFO - __main__ - Step 100920: {'lr': 0.0001238421313152013, 'samples': 19376640, 'steps': 100919, 'loss/train': 0.12427399307489395} 11/07/2021 11:24:03 - INFO - __main__ - Step 100921: {'lr': 0.00012383754984625377, 'samples': 19376832, 'steps': 100920, 'loss/train': 0.9360403418540955} 11/07/2021 11:24:03 - INFO - __main__ - Step 100922: {'lr': 0.00012383296843415304, 'samples': 19377024, 'steps': 100921, 'loss/train': 1.1205503940582275} 11/07/2021 11:24:03 - INFO - __main__ - Step 100923: {'lr': 0.0001238283870789012, 'samples': 19377216, 'steps': 100922, 'loss/train': 1.3578407764434814} 11/07/2021 11:24:04 - INFO - __main__ - Step 100924: {'lr': 0.00012382380578050036, 'samples': 19377408, 'steps': 100923, 'loss/train': 1.1578923463821411} 11/07/2021 11:24:04 - INFO - __main__ - Step 100925: {'lr': 0.0001238192245389526, 'samples': 19377600, 'steps': 100924, 'loss/train': 1.1647037267684937} 11/07/2021 11:24:05 - INFO - __main__ - Step 100926: {'lr': 0.0001238146433542599, 'samples': 19377792, 'steps': 100925, 'loss/train': 1.6630936861038208} 11/07/2021 11:24:05 - INFO - __main__ - Step 100927: {'lr': 0.0001238100622264244, 'samples': 19377984, 'steps': 100926, 'loss/train': 0.8802040219306946} 11/07/2021 11:24:06 - INFO - __main__ - Step 100928: {'lr': 0.00012380548115544814, 'samples': 19378176, 'steps': 100927, 'loss/train': 1.49116849899292} 11/07/2021 11:24:06 - INFO - __main__ - Step 100929: {'lr': 0.00012380090014133316, 'samples': 19378368, 'steps': 100928, 'loss/train': 1.4770269393920898} 11/07/2021 11:24:06 - INFO - __main__ - Step 100930: {'lr': 0.00012379631918408156, 'samples': 19378560, 'steps': 100929, 'loss/train': 1.2286046743392944} 11/07/2021 11:24:07 - INFO - __main__ - Step 100931: {'lr': 0.00012379173828369538, 'samples': 19378752, 'steps': 100930, 'loss/train': 1.345186710357666} 11/07/2021 11:24:08 - INFO - __main__ - Step 100932: {'lr': 0.0001237871574401767, 'samples': 19378944, 'steps': 100931, 'loss/train': 1.0929968357086182} 11/07/2021 11:24:08 - INFO - __main__ - Step 100933: {'lr': 0.0001237825766535276, 'samples': 19379136, 'steps': 100932, 'loss/train': 1.1312841176986694} 11/07/2021 11:24:08 - INFO - __main__ - Step 100934: {'lr': 0.00012377799592375012, 'samples': 19379328, 'steps': 100933, 'loss/train': 1.3377472162246704} 11/07/2021 11:24:09 - INFO - __main__ - Step 100935: {'lr': 0.0001237734152508464, 'samples': 19379520, 'steps': 100934, 'loss/train': 1.040326714515686} 11/07/2021 11:24:10 - INFO - __main__ - Step 100936: {'lr': 0.00012376883463481833, 'samples': 19379712, 'steps': 100935, 'loss/train': 1.2255698442459106} 11/07/2021 11:24:10 - INFO - __main__ - Step 100937: {'lr': 0.00012376425407566811, 'samples': 19379904, 'steps': 100936, 'loss/train': 1.0356863737106323} 11/07/2021 11:24:11 - INFO - __main__ - Step 100938: {'lr': 0.00012375967357339775, 'samples': 19380096, 'steps': 100937, 'loss/train': 0.7204293012619019} 11/07/2021 11:24:11 - INFO - __main__ - Step 100939: {'lr': 0.00012375509312800934, 'samples': 19380288, 'steps': 100938, 'loss/train': 1.5089302062988281} 11/07/2021 11:24:11 - INFO - __main__ - Step 100940: {'lr': 0.0001237505127395049, 'samples': 19380480, 'steps': 100939, 'loss/train': 1.2544991970062256} 11/07/2021 11:24:12 - INFO - __main__ - Step 100941: {'lr': 0.00012374593240788658, 'samples': 19380672, 'steps': 100940, 'loss/train': 1.3061274290084839} 11/07/2021 11:24:13 - INFO - __main__ - Step 100942: {'lr': 0.00012374135213315637, 'samples': 19380864, 'steps': 100941, 'loss/train': 1.4526660442352295} 11/07/2021 11:24:13 - INFO - __main__ - Step 100943: {'lr': 0.00012373677191531638, 'samples': 19381056, 'steps': 100942, 'loss/train': 1.3112258911132812} 11/07/2021 11:24:13 - INFO - __main__ - Step 100944: {'lr': 0.0001237321917543686, 'samples': 19381248, 'steps': 100943, 'loss/train': 1.117801547050476} 11/07/2021 11:24:14 - INFO - __main__ - Step 100945: {'lr': 0.0001237276116503152, 'samples': 19381440, 'steps': 100944, 'loss/train': 1.6611802577972412} 11/07/2021 11:24:14 - INFO - __main__ - Step 100946: {'lr': 0.00012372303160315817, 'samples': 19381632, 'steps': 100945, 'loss/train': 1.352929711341858} 11/07/2021 11:24:15 - INFO - __main__ - Step 100947: {'lr': 0.0001237184516128996, 'samples': 19381824, 'steps': 100946, 'loss/train': 1.2248053550720215} 11/07/2021 11:24:15 - INFO - __main__ - Step 100948: {'lr': 0.00012371387167954166, 'samples': 19382016, 'steps': 100947, 'loss/train': 1.9153258800506592} 11/07/2021 11:24:16 - INFO - __main__ - Step 100949: {'lr': 0.00012370929180308617, 'samples': 19382208, 'steps': 100948, 'loss/train': 1.0016943216323853} 11/07/2021 11:24:16 - INFO - __main__ - Step 100950: {'lr': 0.00012370471198353534, 'samples': 19382400, 'steps': 100949, 'loss/train': 1.6378109455108643} 11/07/2021 11:24:16 - INFO - __main__ - Step 100951: {'lr': 0.00012370013222089122, 'samples': 19382592, 'steps': 100950, 'loss/train': 1.9177656173706055} 11/07/2021 11:24:18 - INFO - __main__ - Step 100952: {'lr': 0.0001236955525151559, 'samples': 19382784, 'steps': 100951, 'loss/train': 0.906964898109436} 11/07/2021 11:24:18 - INFO - __main__ - Step 100953: {'lr': 0.00012369097286633136, 'samples': 19382976, 'steps': 100952, 'loss/train': 1.0383532047271729} 11/07/2021 11:24:18 - INFO - __main__ - Step 100954: {'lr': 0.00012368639327441975, 'samples': 19383168, 'steps': 100953, 'loss/train': 1.6048253774642944} 11/07/2021 11:24:19 - INFO - __main__ - Step 100955: {'lr': 0.0001236818137394231, 'samples': 19383360, 'steps': 100954, 'loss/train': 1.363606333732605} 11/07/2021 11:24:19 - INFO - __main__ - Step 100956: {'lr': 0.00012367723426134344, 'samples': 19383552, 'steps': 100955, 'loss/train': 1.4633402824401855} 11/07/2021 11:24:20 - INFO - __main__ - Step 100957: {'lr': 0.00012367265484018288, 'samples': 19383744, 'steps': 100956, 'loss/train': 1.1106756925582886} 11/07/2021 11:24:20 - INFO - __main__ - Step 100958: {'lr': 0.00012366807547594354, 'samples': 19383936, 'steps': 100957, 'loss/train': 0.9621509909629822} 11/07/2021 11:24:21 - INFO - __main__ - Step 100959: {'lr': 0.00012366349616862735, 'samples': 19384128, 'steps': 100958, 'loss/train': 1.3786094188690186} 11/07/2021 11:24:21 - INFO - __main__ - Step 100960: {'lr': 0.00012365891691823645, 'samples': 19384320, 'steps': 100959, 'loss/train': 1.3454402685165405} 11/07/2021 11:24:21 - INFO - __main__ - Step 100961: {'lr': 0.00012365433772477288, 'samples': 19384512, 'steps': 100960, 'loss/train': 0.9910200238227844} 11/07/2021 11:24:22 - INFO - __main__ - Step 100962: {'lr': 0.00012364975858823884, 'samples': 19384704, 'steps': 100961, 'loss/train': 1.0215650796890259} 11/07/2021 11:24:23 - INFO - __main__ - Step 100963: {'lr': 0.00012364517950863615, 'samples': 19384896, 'steps': 100962, 'loss/train': 1.2992273569107056} 11/07/2021 11:24:23 - INFO - __main__ - Step 100964: {'lr': 0.000123640600485967, 'samples': 19385088, 'steps': 100963, 'loss/train': 1.3447680473327637} 11/07/2021 11:24:24 - INFO - __main__ - Step 100965: {'lr': 0.00012363602152023348, 'samples': 19385280, 'steps': 100964, 'loss/train': 1.4009175300598145} 11/07/2021 11:24:24 - INFO - __main__ - Step 100966: {'lr': 0.00012363144261143757, 'samples': 19385472, 'steps': 100965, 'loss/train': 1.4657591581344604} 11/07/2021 11:24:25 - INFO - __main__ - Step 100967: {'lr': 0.0001236268637595814, 'samples': 19385664, 'steps': 100966, 'loss/train': 1.432775616645813} 11/07/2021 11:24:25 - INFO - __main__ - Step 100968: {'lr': 0.00012362228496466703, 'samples': 19385856, 'steps': 100967, 'loss/train': 1.2718968391418457} 11/07/2021 11:24:26 - INFO - __main__ - Step 100969: {'lr': 0.0001236177062266965, 'samples': 19386048, 'steps': 100968, 'loss/train': 1.3284324407577515} 11/07/2021 11:24:26 - INFO - __main__ - Step 100970: {'lr': 0.00012361312754567187, 'samples': 19386240, 'steps': 100969, 'loss/train': 0.8932575583457947} 11/07/2021 11:24:26 - INFO - __main__ - Step 100971: {'lr': 0.00012360854892159523, 'samples': 19386432, 'steps': 100970, 'loss/train': 1.9352235794067383} 11/07/2021 11:24:27 - INFO - __main__ - Step 100972: {'lr': 0.0001236039703544686, 'samples': 19386624, 'steps': 100971, 'loss/train': 1.320311188697815} 11/07/2021 11:24:28 - INFO - __main__ - Step 100973: {'lr': 0.0001235993918442941, 'samples': 19386816, 'steps': 100972, 'loss/train': 1.2456300258636475} 11/07/2021 11:24:28 - INFO - __main__ - Step 100974: {'lr': 0.00012359481339107377, 'samples': 19387008, 'steps': 100973, 'loss/train': 1.4477320909500122} 11/07/2021 11:24:28 - INFO - __main__ - Step 100975: {'lr': 0.00012359023499480972, 'samples': 19387200, 'steps': 100974, 'loss/train': 1.1330276727676392} 11/07/2021 11:24:29 - INFO - __main__ - Step 100976: {'lr': 0.0001235856566555039, 'samples': 19387392, 'steps': 100975, 'loss/train': 1.6509379148483276} 11/07/2021 11:24:30 - INFO - __main__ - Step 100977: {'lr': 0.0001235810783731584, 'samples': 19387584, 'steps': 100976, 'loss/train': 1.1300523281097412} 11/07/2021 11:24:30 - INFO - __main__ - Step 100978: {'lr': 0.00012357650014777535, 'samples': 19387776, 'steps': 100977, 'loss/train': 1.3555777072906494} 11/07/2021 11:24:30 - INFO - __main__ - Step 100979: {'lr': 0.00012357192197935677, 'samples': 19387968, 'steps': 100978, 'loss/train': 1.65764582157135} 11/07/2021 11:24:31 - INFO - __main__ - Step 100980: {'lr': 0.0001235673438679047, 'samples': 19388160, 'steps': 100979, 'loss/train': 1.264717698097229} 11/07/2021 11:24:31 - INFO - __main__ - Step 100981: {'lr': 0.00012356276581342127, 'samples': 19388352, 'steps': 100980, 'loss/train': 1.6091142892837524} 11/07/2021 11:24:32 - INFO - __main__ - Step 100982: {'lr': 0.0001235581878159085, 'samples': 19388544, 'steps': 100981, 'loss/train': 0.9377366900444031} 11/07/2021 11:24:33 - INFO - __main__ - Step 100983: {'lr': 0.00012355360987536846, 'samples': 19388736, 'steps': 100982, 'loss/train': 1.432859182357788} 11/07/2021 11:24:33 - INFO - __main__ - Step 100984: {'lr': 0.0001235490319918032, 'samples': 19388928, 'steps': 100983, 'loss/train': 0.9410620331764221} 11/07/2021 11:24:33 - INFO - __main__ - Step 100985: {'lr': 0.0001235444541652148, 'samples': 19389120, 'steps': 100984, 'loss/train': 0.4208478033542633} 11/07/2021 11:24:34 - INFO - __main__ - Step 100986: {'lr': 0.00012353987639560532, 'samples': 19389312, 'steps': 100985, 'loss/train': 1.278558373451233} 11/07/2021 11:24:34 - INFO - __main__ - Step 100987: {'lr': 0.00012353529868297685, 'samples': 19389504, 'steps': 100986, 'loss/train': 1.194886326789856} 11/07/2021 11:24:35 - INFO - __main__ - Step 100988: {'lr': 0.00012353072102733138, 'samples': 19389696, 'steps': 100987, 'loss/train': 1.3738093376159668} 11/07/2021 11:24:35 - INFO - __main__ - Step 100989: {'lr': 0.00012352614342867114, 'samples': 19389888, 'steps': 100988, 'loss/train': 1.306220531463623} 11/07/2021 11:24:36 - INFO - __main__ - Step 100990: {'lr': 0.00012352156588699796, 'samples': 19390080, 'steps': 100989, 'loss/train': 1.1224075555801392} 11/07/2021 11:24:36 - INFO - __main__ - Step 100991: {'lr': 0.000123516988402314, 'samples': 19390272, 'steps': 100990, 'loss/train': 1.2469922304153442} 11/07/2021 11:24:37 - INFO - __main__ - Step 100992: {'lr': 0.00012351241097462132, 'samples': 19390464, 'steps': 100991, 'loss/train': 1.8231850862503052} 11/07/2021 11:24:38 - INFO - __main__ - Step 100993: {'lr': 0.000123507833603922, 'samples': 19390656, 'steps': 100992, 'loss/train': 1.5751792192459106} 11/07/2021 11:24:38 - INFO - __main__ - Step 100994: {'lr': 0.00012350325629021815, 'samples': 19390848, 'steps': 100993, 'loss/train': 1.6766897439956665} 11/07/2021 11:24:38 - INFO - __main__ - Step 100995: {'lr': 0.00012349867903351173, 'samples': 19391040, 'steps': 100994, 'loss/train': 1.028996229171753} 11/07/2021 11:24:39 - INFO - __main__ - Step 100996: {'lr': 0.00012349410183380488, 'samples': 19391232, 'steps': 100995, 'loss/train': 1.5374937057495117} 11/07/2021 11:24:39 - INFO - __main__ - Step 100997: {'lr': 0.0001234895246910996, 'samples': 19391424, 'steps': 100996, 'loss/train': 0.8482545614242554} 11/07/2021 11:24:40 - INFO - __main__ - Step 100998: {'lr': 0.00012348494760539802, 'samples': 19391616, 'steps': 100997, 'loss/train': 1.3090623617172241} 11/07/2021 11:24:41 - INFO - __main__ - Step 100999: {'lr': 0.00012348037057670217, 'samples': 19391808, 'steps': 100998, 'loss/train': 1.2294365167617798} 11/07/2021 11:24:41 - INFO - __main__ - Step 101000: {'lr': 0.0001234757936050141, 'samples': 19392000, 'steps': 100999, 'loss/train': 1.491008996963501} 11/07/2021 11:24:41 - INFO - __main__ - Step 101001: {'lr': 0.0001234712166903359, 'samples': 19392192, 'steps': 101000, 'loss/train': 1.1657425165176392} 11/07/2021 11:24:42 - INFO - __main__ - Step 101002: {'lr': 0.0001234666398326697, 'samples': 19392384, 'steps': 101001, 'loss/train': 1.0371068716049194} 11/07/2021 11:24:43 - INFO - __main__ - Step 101003: {'lr': 0.0001234620630320174, 'samples': 19392576, 'steps': 101002, 'loss/train': 1.0176554918289185} 11/07/2021 11:24:43 - INFO - __main__ - Step 101004: {'lr': 0.00012345748628838114, 'samples': 19392768, 'steps': 101003, 'loss/train': 1.179574728012085} 11/07/2021 11:24:44 - INFO - __main__ - Step 101005: {'lr': 0.00012345290960176294, 'samples': 19392960, 'steps': 101004, 'loss/train': 1.2481472492218018} 11/07/2021 11:24:44 - INFO - __main__ - Step 101006: {'lr': 0.00012344833297216496, 'samples': 19393152, 'steps': 101005, 'loss/train': 1.1538740396499634} 11/07/2021 11:24:44 - INFO - __main__ - Step 101007: {'lr': 0.0001234437563995892, 'samples': 19393344, 'steps': 101006, 'loss/train': 1.3307300806045532} 11/07/2021 11:24:45 - INFO - __main__ - Step 101008: {'lr': 0.0001234391798840377, 'samples': 19393536, 'steps': 101007, 'loss/train': 1.5135107040405273} 11/07/2021 11:24:46 - INFO - __main__ - Step 101009: {'lr': 0.00012343460342551259, 'samples': 19393728, 'steps': 101008, 'loss/train': 1.1055783033370972} 11/07/2021 11:24:46 - INFO - __main__ - Step 101010: {'lr': 0.00012343002702401584, 'samples': 19393920, 'steps': 101009, 'loss/train': 1.9361051321029663} 11/07/2021 11:24:46 - INFO - __main__ - Step 101011: {'lr': 0.00012342545067954965, 'samples': 19394112, 'steps': 101010, 'loss/train': 1.4514005184173584} 11/07/2021 11:24:47 - INFO - __main__ - Step 101012: {'lr': 0.0001234208743921159, 'samples': 19394304, 'steps': 101011, 'loss/train': 1.2968463897705078} 11/07/2021 11:24:49 - INFO - __main__ - Step 101013: {'lr': 0.0001234162981617168, 'samples': 19394496, 'steps': 101012, 'loss/train': 0.9418055415153503} 11/07/2021 11:24:49 - INFO - __main__ - Step 101014: {'lr': 0.00012341172198835438, 'samples': 19394688, 'steps': 101013, 'loss/train': 1.3907924890518188} 11/07/2021 11:24:50 - INFO - __main__ - Step 101015: {'lr': 0.00012340714587203078, 'samples': 19394880, 'steps': 101014, 'loss/train': 1.1215565204620361} 11/07/2021 11:24:50 - INFO - __main__ - Step 101016: {'lr': 0.00012340256981274787, 'samples': 19395072, 'steps': 101015, 'loss/train': 1.3779529333114624} 11/07/2021 11:24:50 - INFO - __main__ - Step 101017: {'lr': 0.0001233979938105078, 'samples': 19395264, 'steps': 101016, 'loss/train': 1.761574625968933} 11/07/2021 11:24:51 - INFO - __main__ - Step 101018: {'lr': 0.0001233934178653126, 'samples': 19395456, 'steps': 101017, 'loss/train': 1.279549479484558} 11/07/2021 11:24:51 - INFO - __main__ - Step 101019: {'lr': 0.00012338884197716441, 'samples': 19395648, 'steps': 101018, 'loss/train': 0.6046305894851685} 11/07/2021 11:24:51 - INFO - __main__ - Step 101020: {'lr': 0.00012338426614606527, 'samples': 19395840, 'steps': 101019, 'loss/train': 0.5748765468597412} 11/07/2021 11:24:52 - INFO - __main__ - Step 101021: {'lr': 0.0001233796903720172, 'samples': 19396032, 'steps': 101020, 'loss/train': 0.5280951857566833} 11/07/2021 11:24:53 - INFO - __main__ - Step 101022: {'lr': 0.0001233751146550223, 'samples': 19396224, 'steps': 101021, 'loss/train': 1.463753581047058} 11/07/2021 11:24:53 - INFO - __main__ - Step 101023: {'lr': 0.0001233705389950826, 'samples': 19396416, 'steps': 101022, 'loss/train': 0.6799025535583496} 11/07/2021 11:24:53 - INFO - __main__ - Step 101024: {'lr': 0.0001233659633922002, 'samples': 19396608, 'steps': 101023, 'loss/train': 1.3963996171951294} 11/07/2021 11:24:54 - INFO - __main__ - Step 101025: {'lr': 0.00012336138784637714, 'samples': 19396800, 'steps': 101024, 'loss/train': 1.3855838775634766} 11/07/2021 11:24:55 - INFO - __main__ - Step 101026: {'lr': 0.00012335681235761548, 'samples': 19396992, 'steps': 101025, 'loss/train': 1.3290833234786987} 11/07/2021 11:24:55 - INFO - __main__ - Step 101027: {'lr': 0.0001233522369259173, 'samples': 19397184, 'steps': 101026, 'loss/train': 1.2451189756393433} 11/07/2021 11:24:56 - INFO - __main__ - Step 101028: {'lr': 0.00012334766155128462, 'samples': 19397376, 'steps': 101027, 'loss/train': 1.5353847742080688} 11/07/2021 11:24:56 - INFO - __main__ - Step 101029: {'lr': 0.00012334308623371964, 'samples': 19397568, 'steps': 101028, 'loss/train': 1.5031373500823975} 11/07/2021 11:24:56 - INFO - __main__ - Step 101030: {'lr': 0.00012333851097322423, 'samples': 19397760, 'steps': 101029, 'loss/train': 1.2725337743759155} 11/07/2021 11:24:57 - INFO - __main__ - Step 101031: {'lr': 0.0001233339357698005, 'samples': 19397952, 'steps': 101030, 'loss/train': 1.1661778688430786} 11/07/2021 11:24:58 - INFO - __main__ - Step 101032: {'lr': 0.00012332936062345057, 'samples': 19398144, 'steps': 101031, 'loss/train': 1.122286081314087} 11/07/2021 11:24:58 - INFO - __main__ - Step 101033: {'lr': 0.00012332478553417648, 'samples': 19398336, 'steps': 101032, 'loss/train': 1.4330142736434937} 11/07/2021 11:24:58 - INFO - __main__ - Step 101034: {'lr': 0.00012332021050198027, 'samples': 19398528, 'steps': 101033, 'loss/train': 1.496986985206604} 11/07/2021 11:24:59 - INFO - __main__ - Step 101035: {'lr': 0.00012331563552686403, 'samples': 19398720, 'steps': 101034, 'loss/train': 1.0491373538970947} 11/07/2021 11:25:00 - INFO - __main__ - Step 101036: {'lr': 0.0001233110606088298, 'samples': 19398912, 'steps': 101035, 'loss/train': 4.53875207901001} 11/07/2021 11:25:00 - INFO - __main__ - Step 101037: {'lr': 0.00012330648574787964, 'samples': 19399104, 'steps': 101036, 'loss/train': 1.462079405784607} 11/07/2021 11:25:00 - INFO - __main__ - Step 101038: {'lr': 0.00012330191094401567, 'samples': 19399296, 'steps': 101037, 'loss/train': 1.5491865873336792} 11/07/2021 11:25:01 - INFO - __main__ - Step 101039: {'lr': 0.00012329733619723986, 'samples': 19399488, 'steps': 101038, 'loss/train': 1.4131757020950317} 11/07/2021 11:25:01 - INFO - __main__ - Step 101040: {'lr': 0.0001232927615075543, 'samples': 19399680, 'steps': 101039, 'loss/train': 1.376551866531372} 11/07/2021 11:25:02 - INFO - __main__ - Step 101041: {'lr': 0.0001232881868749611, 'samples': 19399872, 'steps': 101040, 'loss/train': 1.1917078495025635} 11/07/2021 11:25:03 - INFO - __main__ - Step 101042: {'lr': 0.0001232836122994624, 'samples': 19400064, 'steps': 101041, 'loss/train': 1.1695894002914429} 11/07/2021 11:25:03 - INFO - __main__ - Step 101043: {'lr': 0.00012327903778106, 'samples': 19400256, 'steps': 101042, 'loss/train': 1.6759567260742188} 11/07/2021 11:25:03 - INFO - __main__ - Step 101044: {'lr': 0.00012327446331975616, 'samples': 19400448, 'steps': 101043, 'loss/train': 0.07513879984617233} 11/07/2021 11:25:04 - INFO - __main__ - Step 101045: {'lr': 0.0001232698889155529, 'samples': 19400640, 'steps': 101044, 'loss/train': 1.4574638605117798} 11/07/2021 11:25:04 - INFO - __main__ - Step 101046: {'lr': 0.0001232653145684522, 'samples': 19400832, 'steps': 101045, 'loss/train': 1.0700314044952393} 11/07/2021 11:25:05 - INFO - __main__ - Step 101047: {'lr': 0.00012326074027845625, 'samples': 19401024, 'steps': 101046, 'loss/train': 1.1945794820785522} 11/07/2021 11:25:06 - INFO - __main__ - Step 101048: {'lr': 0.00012325616604556705, 'samples': 19401216, 'steps': 101047, 'loss/train': 1.148174524307251} 11/07/2021 11:25:06 - INFO - __main__ - Step 101049: {'lr': 0.00012325159186978666, 'samples': 19401408, 'steps': 101048, 'loss/train': 0.9929760694503784} 11/07/2021 11:25:06 - INFO - __main__ - Step 101050: {'lr': 0.00012324701775111714, 'samples': 19401600, 'steps': 101049, 'loss/train': 0.16593840718269348} 11/07/2021 11:25:07 - INFO - __main__ - Step 101051: {'lr': 0.00012324244368956057, 'samples': 19401792, 'steps': 101050, 'loss/train': 1.1906460523605347} 11/07/2021 11:25:08 - INFO - __main__ - Step 101052: {'lr': 0.00012323786968511898, 'samples': 19401984, 'steps': 101051, 'loss/train': 1.3941667079925537} 11/07/2021 11:25:08 - INFO - __main__ - Step 101053: {'lr': 0.00012323329573779454, 'samples': 19402176, 'steps': 101052, 'loss/train': 1.0055392980575562} 11/07/2021 11:25:08 - INFO - __main__ - Step 101054: {'lr': 0.00012322872184758916, 'samples': 19402368, 'steps': 101053, 'loss/train': 1.499235987663269} 11/07/2021 11:25:09 - INFO - __main__ - Step 101055: {'lr': 0.00012322414801450493, 'samples': 19402560, 'steps': 101054, 'loss/train': 1.5559340715408325} 11/07/2021 11:25:09 - INFO - __main__ - Step 101056: {'lr': 0.00012321957423854396, 'samples': 19402752, 'steps': 101055, 'loss/train': 1.0083836317062378} 11/07/2021 11:25:10 - INFO - __main__ - Step 101057: {'lr': 0.0001232150005197083, 'samples': 19402944, 'steps': 101056, 'loss/train': 0.775593101978302} 11/07/2021 11:25:10 - INFO - __main__ - Step 101058: {'lr': 0.000123210426858, 'samples': 19403136, 'steps': 101057, 'loss/train': 1.4224143028259277} 11/07/2021 11:25:11 - INFO - __main__ - Step 101059: {'lr': 0.0001232058532534211, 'samples': 19403328, 'steps': 101058, 'loss/train': 1.3674229383468628} 11/07/2021 11:25:11 - INFO - __main__ - Step 101060: {'lr': 0.00012320127970597372, 'samples': 19403520, 'steps': 101059, 'loss/train': 1.303696870803833} 11/07/2021 11:25:11 - INFO - __main__ - Step 101061: {'lr': 0.00012319670621565988, 'samples': 19403712, 'steps': 101060, 'loss/train': 0.954888105392456} 11/07/2021 11:25:12 - INFO - __main__ - Step 101062: {'lr': 0.00012319213278248162, 'samples': 19403904, 'steps': 101061, 'loss/train': 1.243297815322876} 11/07/2021 11:25:13 - INFO - __main__ - Step 101063: {'lr': 0.00012318755940644106, 'samples': 19404096, 'steps': 101062, 'loss/train': 0.5518665313720703} 11/07/2021 11:25:13 - INFO - __main__ - Step 101064: {'lr': 0.00012318298608754032, 'samples': 19404288, 'steps': 101063, 'loss/train': 1.3589632511138916} 11/07/2021 11:25:13 - INFO - __main__ - Step 101065: {'lr': 0.00012317841282578123, 'samples': 19404480, 'steps': 101064, 'loss/train': 1.4208481311798096} 11/07/2021 11:25:14 - INFO - __main__ - Step 101066: {'lr': 0.00012317383962116604, 'samples': 19404672, 'steps': 101065, 'loss/train': 1.2124651670455933} 11/07/2021 11:25:15 - INFO - __main__ - Step 101067: {'lr': 0.00012316926647369675, 'samples': 19404864, 'steps': 101066, 'loss/train': 1.4535208940505981} 11/07/2021 11:25:15 - INFO - __main__ - Step 101068: {'lr': 0.00012316469338337544, 'samples': 19405056, 'steps': 101067, 'loss/train': 1.3484869003295898} 11/07/2021 11:25:15 - INFO - __main__ - Step 101069: {'lr': 0.00012316012035020415, 'samples': 19405248, 'steps': 101068, 'loss/train': 1.4881798028945923} 11/07/2021 11:25:16 - INFO - __main__ - Step 101070: {'lr': 0.00012315554737418494, 'samples': 19405440, 'steps': 101069, 'loss/train': 1.4282864332199097} 11/07/2021 11:25:16 - INFO - __main__ - Step 101071: {'lr': 0.0001231509744553199, 'samples': 19405632, 'steps': 101070, 'loss/train': 0.946509599685669} 11/07/2021 11:25:17 - INFO - __main__ - Step 101072: {'lr': 0.00012314640159361107, 'samples': 19405824, 'steps': 101071, 'loss/train': 0.49485746026039124} 11/07/2021 11:25:18 - INFO - __main__ - Step 101073: {'lr': 0.00012314182878906053, 'samples': 19406016, 'steps': 101072, 'loss/train': 0.7440809607505798} 11/07/2021 11:25:18 - INFO - __main__ - Step 101074: {'lr': 0.0001231372560416703, 'samples': 19406208, 'steps': 101073, 'loss/train': 1.586151123046875} 11/07/2021 11:25:19 - INFO - __main__ - Step 101075: {'lr': 0.00012313268335144257, 'samples': 19406400, 'steps': 101074, 'loss/train': 1.2028634548187256} 11/07/2021 11:25:19 - INFO - __main__ - Step 101076: {'lr': 0.0001231281107183792, 'samples': 19406592, 'steps': 101075, 'loss/train': 1.2668862342834473} 11/07/2021 11:25:20 - INFO - __main__ - Step 101077: {'lr': 0.00012312353814248234, 'samples': 19406784, 'steps': 101076, 'loss/train': 1.554170846939087} 11/07/2021 11:25:20 - INFO - __main__ - Step 101078: {'lr': 0.00012311896562375405, 'samples': 19406976, 'steps': 101077, 'loss/train': 1.6028900146484375} 11/07/2021 11:25:20 - INFO - __main__ - Step 101079: {'lr': 0.00012311439316219642, 'samples': 19407168, 'steps': 101078, 'loss/train': 1.2968031167984009} 11/07/2021 11:25:21 - INFO - __main__ - Step 101080: {'lr': 0.00012310982075781148, 'samples': 19407360, 'steps': 101079, 'loss/train': 1.2412669658660889} 11/07/2021 11:25:21 - INFO - __main__ - Step 101081: {'lr': 0.0001231052484106013, 'samples': 19407552, 'steps': 101080, 'loss/train': 0.951116681098938} 11/07/2021 11:25:22 - INFO - __main__ - Step 101082: {'lr': 0.0001231006761205679, 'samples': 19407744, 'steps': 101081, 'loss/train': 1.2734335660934448} 11/07/2021 11:25:23 - INFO - __main__ - Step 101083: {'lr': 0.0001230961038877134, 'samples': 19407936, 'steps': 101082, 'loss/train': 1.8784774541854858} 11/07/2021 11:25:23 - INFO - __main__ - Step 101084: {'lr': 0.00012309153171203985, 'samples': 19408128, 'steps': 101083, 'loss/train': 1.50171959400177} 11/07/2021 11:25:23 - INFO - __main__ - Step 101085: {'lr': 0.0001230869595935493, 'samples': 19408320, 'steps': 101084, 'loss/train': 1.5703177452087402} 11/07/2021 11:25:24 - INFO - __main__ - Step 101086: {'lr': 0.00012308238753224387, 'samples': 19408512, 'steps': 101085, 'loss/train': 1.274253010749817} 11/07/2021 11:25:24 - INFO - __main__ - Step 101087: {'lr': 0.0001230778155281255, 'samples': 19408704, 'steps': 101086, 'loss/train': 1.4444211721420288} 11/07/2021 11:25:25 - INFO - __main__ - Step 101088: {'lr': 0.00012307324358119628, 'samples': 19408896, 'steps': 101087, 'loss/train': 0.9242881536483765} 11/07/2021 11:25:25 - INFO - __main__ - Step 101089: {'lr': 0.0001230686716914583, 'samples': 19409088, 'steps': 101088, 'loss/train': 1.1277236938476562} 11/07/2021 11:25:26 - INFO - __main__ - Step 101090: {'lr': 0.00012306409985891363, 'samples': 19409280, 'steps': 101089, 'loss/train': 1.5940223932266235} 11/07/2021 11:25:26 - INFO - __main__ - Step 101091: {'lr': 0.00012305952808356433, 'samples': 19409472, 'steps': 101090, 'loss/train': 1.5563995838165283} 11/07/2021 11:25:27 - INFO - __main__ - Step 101092: {'lr': 0.00012305495636541242, 'samples': 19409664, 'steps': 101091, 'loss/train': 1.2844457626342773} 11/07/2021 11:25:28 - INFO - __main__ - Step 101093: {'lr': 0.00012305038470446, 'samples': 19409856, 'steps': 101092, 'loss/train': 0.6638543009757996} 11/07/2021 11:25:28 - INFO - __main__ - Step 101094: {'lr': 0.00012304581310070912, 'samples': 19410048, 'steps': 101093, 'loss/train': 0.952721357345581} 11/07/2021 11:25:29 - INFO - __main__ - Step 101095: {'lr': 0.00012304124155416182, 'samples': 19410240, 'steps': 101094, 'loss/train': 0.20187200605869293} 11/07/2021 11:25:29 - INFO - __main__ - Step 101096: {'lr': 0.0001230366700648202, 'samples': 19410432, 'steps': 101095, 'loss/train': 1.8681073188781738} 11/07/2021 11:25:29 - INFO - __main__ - Step 101097: {'lr': 0.00012303209863268638, 'samples': 19410624, 'steps': 101096, 'loss/train': 1.5422295331954956} 11/07/2021 11:25:30 - INFO - __main__ - Step 101098: {'lr': 0.00012302752725776224, 'samples': 19410816, 'steps': 101097, 'loss/train': 1.6373928785324097} 11/07/2021 11:25:31 - INFO - __main__ - Step 101099: {'lr': 0.00012302295594004997, 'samples': 19411008, 'steps': 101098, 'loss/train': 1.2941604852676392} 11/07/2021 11:25:31 - INFO - __main__ - Step 101100: {'lr': 0.00012301838467955155, 'samples': 19411200, 'steps': 101099, 'loss/train': 1.1627202033996582} 11/07/2021 11:25:31 - INFO - __main__ - Step 101101: {'lr': 0.00012301381347626912, 'samples': 19411392, 'steps': 101100, 'loss/train': 1.086397409439087} 11/07/2021 11:25:32 - INFO - __main__ - Step 101102: {'lr': 0.0001230092423302047, 'samples': 19411584, 'steps': 101101, 'loss/train': 1.4785364866256714} 11/07/2021 11:25:33 - INFO - __main__ - Step 101103: {'lr': 0.00012300467124136034, 'samples': 19411776, 'steps': 101102, 'loss/train': 1.506253957748413} 11/07/2021 11:25:33 - INFO - __main__ - Step 101104: {'lr': 0.0001230001002097381, 'samples': 19411968, 'steps': 101103, 'loss/train': 1.4004981517791748} 11/07/2021 11:25:33 - INFO - __main__ - Step 101105: {'lr': 0.0001229955292353401, 'samples': 19412160, 'steps': 101104, 'loss/train': 1.2961472272872925} 11/07/2021 11:25:34 - INFO - __main__ - Step 101106: {'lr': 0.00012299095831816835, 'samples': 19412352, 'steps': 101105, 'loss/train': 1.2648273706436157} 11/07/2021 11:25:34 - INFO - __main__ - Step 101107: {'lr': 0.00012298638745822488, 'samples': 19412544, 'steps': 101106, 'loss/train': 1.4179465770721436} 11/07/2021 11:25:35 - INFO - __main__ - Step 101108: {'lr': 0.0001229818166555118, 'samples': 19412736, 'steps': 101107, 'loss/train': 1.7470250129699707} 11/07/2021 11:25:36 - INFO - __main__ - Step 101109: {'lr': 0.0001229772459100312, 'samples': 19412928, 'steps': 101108, 'loss/train': 0.9628351926803589} 11/07/2021 11:25:36 - INFO - __main__ - Step 101110: {'lr': 0.00012297267522178512, 'samples': 19413120, 'steps': 101109, 'loss/train': 1.3098143339157104} 11/07/2021 11:25:36 - INFO - __main__ - Step 101111: {'lr': 0.0001229681045907755, 'samples': 19413312, 'steps': 101110, 'loss/train': 1.5222183465957642} 11/07/2021 11:25:37 - INFO - __main__ - Step 101112: {'lr': 0.00012296353401700452, 'samples': 19413504, 'steps': 101111, 'loss/train': 1.534665584564209} 11/07/2021 11:25:37 - INFO - __main__ - Step 101113: {'lr': 0.00012295896350047423, 'samples': 19413696, 'steps': 101112, 'loss/train': 1.5810965299606323} 11/07/2021 11:25:38 - INFO - __main__ - Step 101114: {'lr': 0.00012295439304118664, 'samples': 19413888, 'steps': 101113, 'loss/train': 1.3497761487960815} 11/07/2021 11:25:39 - INFO - __main__ - Step 101115: {'lr': 0.00012294982263914383, 'samples': 19414080, 'steps': 101114, 'loss/train': 0.07158996909856796} 11/07/2021 11:25:39 - INFO - __main__ - Step 101116: {'lr': 0.00012294525229434788, 'samples': 19414272, 'steps': 101115, 'loss/train': 0.05828341469168663} 11/07/2021 11:25:40 - INFO - __main__ - Step 101117: {'lr': 0.00012294068200680087, 'samples': 19414464, 'steps': 101116, 'loss/train': 1.0973765850067139} 11/07/2021 11:25:40 - INFO - __main__ - Step 101118: {'lr': 0.0001229361117765048, 'samples': 19414656, 'steps': 101117, 'loss/train': 1.2876300811767578} 11/07/2021 11:25:40 - INFO - __main__ - Step 101119: {'lr': 0.00012293154160346174, 'samples': 19414848, 'steps': 101118, 'loss/train': 0.8111661076545715} 11/07/2021 11:25:41 - INFO - __main__ - Step 101120: {'lr': 0.0001229269714876738, 'samples': 19415040, 'steps': 101119, 'loss/train': 1.7941445112228394} 11/07/2021 11:25:42 - INFO - __main__ - Step 101121: {'lr': 0.000122922401429143, 'samples': 19415232, 'steps': 101120, 'loss/train': 1.7985575199127197} 11/07/2021 11:25:42 - INFO - __main__ - Step 101122: {'lr': 0.00012291783142787138, 'samples': 19415424, 'steps': 101121, 'loss/train': 1.738523244857788} 11/07/2021 11:25:42 - INFO - __main__ - Step 101123: {'lr': 0.00012291326148386114, 'samples': 19415616, 'steps': 101122, 'loss/train': 1.4922082424163818} 11/07/2021 11:25:43 - INFO - __main__ - Step 101124: {'lr': 0.00012290869159711413, 'samples': 19415808, 'steps': 101123, 'loss/train': 0.7211400866508484} 11/07/2021 11:25:44 - INFO - __main__ - Step 101125: {'lr': 0.0001229041217676325, 'samples': 19416000, 'steps': 101124, 'loss/train': 1.5801281929016113} 11/07/2021 11:25:44 - INFO - __main__ - Step 101126: {'lr': 0.0001228995519954183, 'samples': 19416192, 'steps': 101125, 'loss/train': 1.982628583908081} 11/07/2021 11:25:45 - INFO - __main__ - Step 101127: {'lr': 0.00012289498228047361, 'samples': 19416384, 'steps': 101126, 'loss/train': 0.8924160003662109} 11/07/2021 11:25:45 - INFO - __main__ - Step 101128: {'lr': 0.00012289041262280047, 'samples': 19416576, 'steps': 101127, 'loss/train': 1.29863440990448} 11/07/2021 11:25:45 - INFO - __main__ - Step 101129: {'lr': 0.00012288584302240098, 'samples': 19416768, 'steps': 101128, 'loss/train': 1.369320273399353} 11/07/2021 11:25:46 - INFO - __main__ - Step 101130: {'lr': 0.00012288127347927712, 'samples': 19416960, 'steps': 101129, 'loss/train': 1.0787321329116821} 11/07/2021 11:25:47 - INFO - __main__ - Step 101131: {'lr': 0.00012287670399343102, 'samples': 19417152, 'steps': 101130, 'loss/train': 1.2912522554397583} 11/07/2021 11:25:47 - INFO - __main__ - Step 101132: {'lr': 0.0001228721345648647, 'samples': 19417344, 'steps': 101131, 'loss/train': 1.5248675346374512} 11/07/2021 11:25:47 - INFO - __main__ - Step 101133: {'lr': 0.00012286756519358028, 'samples': 19417536, 'steps': 101132, 'loss/train': 1.332856297492981} 11/07/2021 11:25:48 - INFO - __main__ - Step 101134: {'lr': 0.00012286299587957973, 'samples': 19417728, 'steps': 101133, 'loss/train': 1.5314805507659912} 11/07/2021 11:25:48 - INFO - __main__ - Step 101135: {'lr': 0.00012285842662286518, 'samples': 19417920, 'steps': 101134, 'loss/train': 1.3297075033187866} 11/07/2021 11:25:49 - INFO - __main__ - Step 101136: {'lr': 0.00012285385742343875, 'samples': 19418112, 'steps': 101135, 'loss/train': 0.6978374123573303} 11/07/2021 11:25:49 - INFO - __main__ - Step 101137: {'lr': 0.0001228492882813023, 'samples': 19418304, 'steps': 101136, 'loss/train': 2.3628273010253906} 11/07/2021 11:25:50 - INFO - __main__ - Step 101138: {'lr': 0.000122844719196458, 'samples': 19418496, 'steps': 101137, 'loss/train': 1.7012813091278076} 11/07/2021 11:25:50 - INFO - __main__ - Step 101139: {'lr': 0.0001228401501689079, 'samples': 19418688, 'steps': 101138, 'loss/train': 1.5085904598236084} 11/07/2021 11:25:50 - INFO - __main__ - Step 101140: {'lr': 0.0001228355811986541, 'samples': 19418880, 'steps': 101139, 'loss/train': 1.1147241592407227} 11/07/2021 11:25:52 - INFO - __main__ - Step 101141: {'lr': 0.00012283101228569859, 'samples': 19419072, 'steps': 101140, 'loss/train': 1.3938943147659302} 11/07/2021 11:25:52 - INFO - __main__ - Step 101142: {'lr': 0.00012282644343004348, 'samples': 19419264, 'steps': 101141, 'loss/train': 0.8841127753257751} 11/07/2021 11:25:52 - INFO - __main__ - Step 101143: {'lr': 0.0001228218746316908, 'samples': 19419456, 'steps': 101142, 'loss/train': 1.790623426437378} 11/07/2021 11:25:53 - INFO - __main__ - Step 101144: {'lr': 0.00012281730589064262, 'samples': 19419648, 'steps': 101143, 'loss/train': 1.4045161008834839} 11/07/2021 11:25:53 - INFO - __main__ - Step 101145: {'lr': 0.00012281273720690102, 'samples': 19419840, 'steps': 101144, 'loss/train': 1.1124836206436157} 11/07/2021 11:25:53 - INFO - __main__ - Step 101146: {'lr': 0.00012280816858046802, 'samples': 19420032, 'steps': 101145, 'loss/train': 1.4528666734695435} 11/07/2021 11:25:54 - INFO - __main__ - Step 101147: {'lr': 0.00012280360001134573, 'samples': 19420224, 'steps': 101146, 'loss/train': 1.5984073877334595} 11/07/2021 11:25:55 - INFO - __main__ - Step 101148: {'lr': 0.00012279903149953615, 'samples': 19420416, 'steps': 101147, 'loss/train': 1.539445400238037} 11/07/2021 11:25:55 - INFO - __main__ - Step 101149: {'lr': 0.00012279446304504135, 'samples': 19420608, 'steps': 101148, 'loss/train': 1.5173977613449097} 11/07/2021 11:25:55 - INFO - __main__ - Step 101150: {'lr': 0.00012278989464786352, 'samples': 19420800, 'steps': 101149, 'loss/train': 1.4856493473052979} 11/07/2021 11:25:56 - INFO - __main__ - Step 101151: {'lr': 0.00012278532630800447, 'samples': 19420992, 'steps': 101150, 'loss/train': 1.1233985424041748} 11/07/2021 11:25:57 - INFO - __main__ - Step 101152: {'lr': 0.00012278075802546647, 'samples': 19421184, 'steps': 101151, 'loss/train': 1.6590189933776855} 11/07/2021 11:25:58 - INFO - __main__ - Step 101153: {'lr': 0.0001227761898002514, 'samples': 19421376, 'steps': 101152, 'loss/train': 1.2825342416763306} 11/07/2021 11:25:58 - INFO - __main__ - Step 101154: {'lr': 0.00012277162163236148, 'samples': 19421568, 'steps': 101153, 'loss/train': 1.021215796470642} 11/07/2021 11:25:58 - INFO - __main__ - Step 101155: {'lr': 0.00012276705352179867, 'samples': 19421760, 'steps': 101154, 'loss/train': 1.842391014099121} 11/07/2021 11:25:59 - INFO - __main__ - Step 101156: {'lr': 0.0001227624854685651, 'samples': 19421952, 'steps': 101155, 'loss/train': 1.0546869039535522} 11/07/2021 11:26:00 - INFO - __main__ - Step 101157: {'lr': 0.00012275791747266273, 'samples': 19422144, 'steps': 101156, 'loss/train': 1.4206103086471558} 11/07/2021 11:26:00 - INFO - __main__ - Step 101158: {'lr': 0.00012275334953409372, 'samples': 19422336, 'steps': 101157, 'loss/train': 1.4165072441101074} 11/07/2021 11:26:00 - INFO - __main__ - Step 101159: {'lr': 0.0001227487816528601, 'samples': 19422528, 'steps': 101158, 'loss/train': 1.494648814201355} 11/07/2021 11:26:01 - INFO - __main__ - Step 101160: {'lr': 0.00012274421382896388, 'samples': 19422720, 'steps': 101159, 'loss/train': 1.0176959037780762} 11/07/2021 11:26:01 - INFO - __main__ - Step 101161: {'lr': 0.00012273964606240718, 'samples': 19422912, 'steps': 101160, 'loss/train': 1.2973239421844482} 11/07/2021 11:26:02 - INFO - __main__ - Step 101162: {'lr': 0.00012273507835319203, 'samples': 19423104, 'steps': 101161, 'loss/train': 0.846174955368042} 11/07/2021 11:26:02 - INFO - __main__ - Step 101163: {'lr': 0.00012273051070132057, 'samples': 19423296, 'steps': 101162, 'loss/train': 1.824385643005371} 11/07/2021 11:26:03 - INFO - __main__ - Step 101164: {'lr': 0.0001227259431067947, 'samples': 19423488, 'steps': 101163, 'loss/train': 1.9680858850479126} 11/07/2021 11:26:03 - INFO - __main__ - Step 101165: {'lr': 0.00012272137556961654, 'samples': 19423680, 'steps': 101164, 'loss/train': 1.3907363414764404} 11/07/2021 11:26:03 - INFO - __main__ - Step 101166: {'lr': 0.00012271680808978815, 'samples': 19423872, 'steps': 101165, 'loss/train': 1.4342036247253418} 11/07/2021 11:26:04 - INFO - __main__ - Step 101167: {'lr': 0.00012271224066731163, 'samples': 19424064, 'steps': 101166, 'loss/train': 1.3718867301940918} 11/07/2021 11:26:05 - INFO - __main__ - Step 101168: {'lr': 0.00012270767330218902, 'samples': 19424256, 'steps': 101167, 'loss/train': 1.0462911128997803} 11/07/2021 11:26:05 - INFO - __main__ - Step 101169: {'lr': 0.00012270310599442233, 'samples': 19424448, 'steps': 101168, 'loss/train': 1.3631781339645386} 11/07/2021 11:26:06 - INFO - __main__ - Step 101170: {'lr': 0.00012269853874401367, 'samples': 19424640, 'steps': 101169, 'loss/train': 1.6306991577148438} 11/07/2021 11:26:06 - INFO - __main__ - Step 101171: {'lr': 0.00012269397155096508, 'samples': 19424832, 'steps': 101170, 'loss/train': 1.600846529006958} 11/07/2021 11:26:07 - INFO - __main__ - Step 101172: {'lr': 0.00012268940441527865, 'samples': 19425024, 'steps': 101171, 'loss/train': 1.0634068250656128} 11/07/2021 11:26:07 - INFO - __main__ - Step 101173: {'lr': 0.0001226848373369564, 'samples': 19425216, 'steps': 101172, 'loss/train': 1.5077860355377197} 11/07/2021 11:26:08 - INFO - __main__ - Step 101174: {'lr': 0.00012268027031600036, 'samples': 19425408, 'steps': 101173, 'loss/train': 0.9721436500549316} 11/07/2021 11:26:08 - INFO - __main__ - Step 101175: {'lr': 0.00012267570335241268, 'samples': 19425600, 'steps': 101174, 'loss/train': 1.4825137853622437} 11/07/2021 11:26:08 - INFO - __main__ - Step 101176: {'lr': 0.00012267113644619536, 'samples': 19425792, 'steps': 101175, 'loss/train': 1.1404653787612915} 11/07/2021 11:26:09 - INFO - __main__ - Step 101177: {'lr': 0.0001226665695973505, 'samples': 19425984, 'steps': 101176, 'loss/train': 1.6528396606445312} 11/07/2021 11:26:10 - INFO - __main__ - Step 101178: {'lr': 0.0001226620028058801, 'samples': 19426176, 'steps': 101177, 'loss/train': 0.9242750406265259} 11/07/2021 11:26:10 - INFO - __main__ - Step 101179: {'lr': 0.00012265743607178616, 'samples': 19426368, 'steps': 101178, 'loss/train': 1.1657816171646118} 11/07/2021 11:26:10 - INFO - __main__ - Step 101180: {'lr': 0.00012265286939507086, 'samples': 19426560, 'steps': 101179, 'loss/train': 1.3812332153320312} 11/07/2021 11:26:11 - INFO - __main__ - Step 101181: {'lr': 0.00012264830277573623, 'samples': 19426752, 'steps': 101180, 'loss/train': 1.261064052581787} 11/07/2021 11:26:11 - INFO - __main__ - Step 101182: {'lr': 0.00012264373621378424, 'samples': 19426944, 'steps': 101181, 'loss/train': 0.7107449769973755} 11/07/2021 11:26:12 - INFO - __main__ - Step 101183: {'lr': 0.00012263916970921707, 'samples': 19427136, 'steps': 101182, 'loss/train': 1.5221978425979614} 11/07/2021 11:26:13 - INFO - __main__ - Step 101184: {'lr': 0.0001226346032620367, 'samples': 19427328, 'steps': 101183, 'loss/train': 1.4384034872055054} 11/07/2021 11:26:13 - INFO - __main__ - Step 101185: {'lr': 0.00012263003687224526, 'samples': 19427520, 'steps': 101184, 'loss/train': 1.183737874031067} 11/07/2021 11:26:13 - INFO - __main__ - Step 101186: {'lr': 0.0001226254705398447, 'samples': 19427712, 'steps': 101185, 'loss/train': 1.763066291809082} 11/07/2021 11:26:14 - INFO - __main__ - Step 101187: {'lr': 0.00012262090426483718, 'samples': 19427904, 'steps': 101186, 'loss/train': 0.9274158477783203} 11/07/2021 11:26:15 - INFO - __main__ - Step 101188: {'lr': 0.0001226163380472247, 'samples': 19428096, 'steps': 101187, 'loss/train': 2.5921013355255127} 11/07/2021 11:26:15 - INFO - __main__ - Step 101189: {'lr': 0.00012261177188700932, 'samples': 19428288, 'steps': 101188, 'loss/train': 1.5328885316848755} 11/07/2021 11:26:15 - INFO - __main__ - Step 101190: {'lr': 0.0001226072057841932, 'samples': 19428480, 'steps': 101189, 'loss/train': 1.51652991771698} 11/07/2021 11:26:16 - INFO - __main__ - Step 101191: {'lr': 0.00012260263973877826, 'samples': 19428672, 'steps': 101190, 'loss/train': 1.255226731300354} 11/07/2021 11:26:16 - INFO - __main__ - Step 101192: {'lr': 0.00012259807375076656, 'samples': 19428864, 'steps': 101191, 'loss/train': 1.3803253173828125} 11/07/2021 11:26:17 - INFO - __main__ - Step 101193: {'lr': 0.00012259350782016021, 'samples': 19429056, 'steps': 101192, 'loss/train': 1.2510195970535278} 11/07/2021 11:26:18 - INFO - __main__ - Step 101194: {'lr': 0.00012258894194696127, 'samples': 19429248, 'steps': 101193, 'loss/train': 1.539054274559021} 11/07/2021 11:26:18 - INFO - __main__ - Step 101195: {'lr': 0.0001225843761311718, 'samples': 19429440, 'steps': 101194, 'loss/train': 1.2933895587921143} 11/07/2021 11:26:18 - INFO - __main__ - Step 101196: {'lr': 0.00012257981037279382, 'samples': 19429632, 'steps': 101195, 'loss/train': 1.6280089616775513} 11/07/2021 11:26:19 - INFO - __main__ - Step 101197: {'lr': 0.0001225752446718294, 'samples': 19429824, 'steps': 101196, 'loss/train': 1.7523319721221924} 11/07/2021 11:26:20 - INFO - __main__ - Step 101198: {'lr': 0.0001225706790282806, 'samples': 19430016, 'steps': 101197, 'loss/train': 1.6798473596572876} 11/07/2021 11:26:20 - INFO - __main__ - Step 101199: {'lr': 0.00012256611344214956, 'samples': 19430208, 'steps': 101198, 'loss/train': 1.3637406826019287} 11/07/2021 11:26:21 - INFO - __main__ - Step 101200: {'lr': 0.00012256154791343818, 'samples': 19430400, 'steps': 101199, 'loss/train': 1.441860318183899} 11/07/2021 11:26:21 - INFO - __main__ - Step 101201: {'lr': 0.00012255698244214864, 'samples': 19430592, 'steps': 101200, 'loss/train': 1.3610256910324097} 11/07/2021 11:26:22 - INFO - __main__ - Step 101202: {'lr': 0.00012255241702828295, 'samples': 19430784, 'steps': 101201, 'loss/train': 1.4584577083587646} 11/07/2021 11:26:22 - INFO - __main__ - Step 101203: {'lr': 0.0001225478516718432, 'samples': 19430976, 'steps': 101202, 'loss/train': 1.6261252164840698} 11/07/2021 11:26:22 - INFO - __main__ - Step 101204: {'lr': 0.00012254328637283148, 'samples': 19431168, 'steps': 101203, 'loss/train': 1.5397460460662842} 11/07/2021 11:26:23 - INFO - __main__ - Step 101205: {'lr': 0.0001225387211312497, 'samples': 19431360, 'steps': 101204, 'loss/train': 0.8801894187927246} 11/07/2021 11:26:24 - INFO - __main__ - Step 101206: {'lr': 0.0001225341559471, 'samples': 19431552, 'steps': 101205, 'loss/train': 1.3686389923095703} 11/07/2021 11:26:24 - INFO - __main__ - Step 101207: {'lr': 0.00012252959082038444, 'samples': 19431744, 'steps': 101206, 'loss/train': 1.4649724960327148} 11/07/2021 11:26:24 - INFO - __main__ - Step 101208: {'lr': 0.00012252502575110512, 'samples': 19431936, 'steps': 101207, 'loss/train': 1.299210548400879} 11/07/2021 11:26:25 - INFO - __main__ - Step 101209: {'lr': 0.000122520460739264, 'samples': 19432128, 'steps': 101208, 'loss/train': 1.3618206977844238} 11/07/2021 11:26:26 - INFO - __main__ - Step 101210: {'lr': 0.0001225158957848632, 'samples': 19432320, 'steps': 101209, 'loss/train': 1.2846951484680176} 11/07/2021 11:26:26 - INFO - __main__ - Step 101211: {'lr': 0.0001225113308879048, 'samples': 19432512, 'steps': 101210, 'loss/train': 1.4302319288253784} 11/07/2021 11:26:27 - INFO - __main__ - Step 101212: {'lr': 0.00012250676604839083, 'samples': 19432704, 'steps': 101211, 'loss/train': 1.3744616508483887} 11/07/2021 11:26:27 - INFO - __main__ - Step 101213: {'lr': 0.00012250220126632332, 'samples': 19432896, 'steps': 101212, 'loss/train': 1.4148085117340088} 11/07/2021 11:26:27 - INFO - __main__ - Step 101214: {'lr': 0.00012249763654170436, 'samples': 19433088, 'steps': 101213, 'loss/train': 1.8064080476760864} 11/07/2021 11:26:28 - INFO - __main__ - Step 101215: {'lr': 0.00012249307187453605, 'samples': 19433280, 'steps': 101214, 'loss/train': 1.3372564315795898} 11/07/2021 11:26:29 - INFO - __main__ - Step 101216: {'lr': 0.00012248850726482034, 'samples': 19433472, 'steps': 101215, 'loss/train': 1.6402256488800049} 11/07/2021 11:26:29 - INFO - __main__ - Step 101217: {'lr': 0.0001224839427125594, 'samples': 19433664, 'steps': 101216, 'loss/train': 1.2066577672958374} 11/07/2021 11:26:29 - INFO - __main__ - Step 101218: {'lr': 0.0001224793782177552, 'samples': 19433856, 'steps': 101217, 'loss/train': 1.5026732683181763} 11/07/2021 11:26:30 - INFO - __main__ - Step 101219: {'lr': 0.00012247481378040978, 'samples': 19434048, 'steps': 101218, 'loss/train': 1.469998836517334} 11/07/2021 11:26:31 - INFO - __main__ - Step 101220: {'lr': 0.00012247024940052525, 'samples': 19434240, 'steps': 101219, 'loss/train': 1.327674150466919} 11/07/2021 11:26:31 - INFO - __main__ - Step 101221: {'lr': 0.0001224656850781037, 'samples': 19434432, 'steps': 101220, 'loss/train': 1.353926420211792} 11/07/2021 11:26:32 - INFO - __main__ - Step 101222: {'lr': 0.0001224611208131471, 'samples': 19434624, 'steps': 101221, 'loss/train': 0.9063267707824707} 11/07/2021 11:26:32 - INFO - __main__ - Step 101223: {'lr': 0.00012245655660565754, 'samples': 19434816, 'steps': 101222, 'loss/train': 1.3300790786743164} 11/07/2021 11:26:32 - INFO - __main__ - Step 101224: {'lr': 0.00012245199245563713, 'samples': 19435008, 'steps': 101223, 'loss/train': 1.3407707214355469} 11/07/2021 11:26:33 - INFO - __main__ - Step 101225: {'lr': 0.00012244742836308787, 'samples': 19435200, 'steps': 101224, 'loss/train': 1.1268430948257446} 11/07/2021 11:26:34 - INFO - __main__ - Step 101226: {'lr': 0.00012244286432801184, 'samples': 19435392, 'steps': 101225, 'loss/train': 1.5467814207077026} 11/07/2021 11:26:34 - INFO - __main__ - Step 101227: {'lr': 0.00012243830035041104, 'samples': 19435584, 'steps': 101226, 'loss/train': 1.2552826404571533} 11/07/2021 11:26:34 - INFO - __main__ - Step 101228: {'lr': 0.0001224337364302876, 'samples': 19435776, 'steps': 101227, 'loss/train': 0.9073700904846191} 11/07/2021 11:26:35 - INFO - __main__ - Step 101229: {'lr': 0.00012242917256764354, 'samples': 19435968, 'steps': 101228, 'loss/train': 1.5969407558441162} 11/07/2021 11:26:35 - INFO - __main__ - Step 101230: {'lr': 0.00012242460876248095, 'samples': 19436160, 'steps': 101229, 'loss/train': 1.0604459047317505} 11/07/2021 11:26:36 - INFO - __main__ - Step 101231: {'lr': 0.00012242004501480198, 'samples': 19436352, 'steps': 101230, 'loss/train': 1.1925344467163086} 11/07/2021 11:26:37 - INFO - __main__ - Step 101232: {'lr': 0.0001224154813246084, 'samples': 19436544, 'steps': 101231, 'loss/train': 0.5589534044265747} 11/07/2021 11:26:37 - INFO - __main__ - Step 101233: {'lr': 0.0001224109176919025, 'samples': 19436736, 'steps': 101232, 'loss/train': 1.2006711959838867} 11/07/2021 11:26:37 - INFO - __main__ - Step 101234: {'lr': 0.00012240635411668623, 'samples': 19436928, 'steps': 101233, 'loss/train': 1.652377963066101} 11/07/2021 11:26:38 - INFO - __main__ - Step 101235: {'lr': 0.00012240179059896171, 'samples': 19437120, 'steps': 101234, 'loss/train': 1.2489838600158691} 11/07/2021 11:26:39 - INFO - __main__ - Step 101236: {'lr': 0.000122397227138731, 'samples': 19437312, 'steps': 101235, 'loss/train': 1.286239743232727} 11/07/2021 11:26:39 - INFO - __main__ - Step 101237: {'lr': 0.00012239266373599607, 'samples': 19437504, 'steps': 101236, 'loss/train': 1.410408616065979} 11/07/2021 11:26:39 - INFO - __main__ - Step 101238: {'lr': 0.0001223881003907591, 'samples': 19437696, 'steps': 101237, 'loss/train': 1.341553807258606} 11/07/2021 11:26:40 - INFO - __main__ - Step 101239: {'lr': 0.00012238353710302202, 'samples': 19437888, 'steps': 101238, 'loss/train': 1.453397512435913} 11/07/2021 11:26:40 - INFO - __main__ - Step 101240: {'lr': 0.000122378973872787, 'samples': 19438080, 'steps': 101239, 'loss/train': 1.366095781326294} 11/07/2021 11:26:41 - INFO - __main__ - Step 101241: {'lr': 0.00012237441070005604, 'samples': 19438272, 'steps': 101240, 'loss/train': 0.5344693064689636} 11/07/2021 11:26:41 - INFO - __main__ - Step 101242: {'lr': 0.00012236984758483117, 'samples': 19438464, 'steps': 101241, 'loss/train': 1.362892985343933} 11/07/2021 11:26:42 - INFO - __main__ - Step 101243: {'lr': 0.00012236528452711447, 'samples': 19438656, 'steps': 101242, 'loss/train': 1.5588220357894897} 11/07/2021 11:26:42 - INFO - __main__ - Step 101244: {'lr': 0.00012236072152690814, 'samples': 19438848, 'steps': 101243, 'loss/train': 1.7059507369995117} 11/07/2021 11:26:42 - INFO - __main__ - Step 101245: {'lr': 0.000122356158584214, 'samples': 19439040, 'steps': 101244, 'loss/train': 1.4351974725723267} 11/07/2021 11:26:44 - INFO - __main__ - Step 101246: {'lr': 0.00012235159569903416, 'samples': 19439232, 'steps': 101245, 'loss/train': 1.300586223602295} 11/07/2021 11:26:44 - INFO - __main__ - Step 101247: {'lr': 0.00012234703287137077, 'samples': 19439424, 'steps': 101246, 'loss/train': 0.8415517210960388} 11/07/2021 11:26:44 - INFO - __main__ - Step 101248: {'lr': 0.00012234247010122583, 'samples': 19439616, 'steps': 101247, 'loss/train': 1.3683750629425049} 11/07/2021 11:26:45 - INFO - __main__ - Step 101249: {'lr': 0.0001223379073886014, 'samples': 19439808, 'steps': 101248, 'loss/train': 1.4726701974868774} 11/07/2021 11:26:45 - INFO - __main__ - Step 101250: {'lr': 0.00012233334473349953, 'samples': 19440000, 'steps': 101249, 'loss/train': 1.164370059967041} 11/07/2021 11:26:46 - INFO - __main__ - Step 101251: {'lr': 0.00012232878213592227, 'samples': 19440192, 'steps': 101250, 'loss/train': 1.7194451093673706} 11/07/2021 11:26:46 - INFO - __main__ - Step 101252: {'lr': 0.0001223242195958717, 'samples': 19440384, 'steps': 101251, 'loss/train': 1.447556734085083} 11/07/2021 11:26:47 - INFO - __main__ - Step 101253: {'lr': 0.0001223196571133499, 'samples': 19440576, 'steps': 101252, 'loss/train': 1.3979809284210205} 11/07/2021 11:26:47 - INFO - __main__ - Step 101254: {'lr': 0.00012231509468835886, 'samples': 19440768, 'steps': 101253, 'loss/train': 1.2759636640548706} 11/07/2021 11:26:47 - INFO - __main__ - Step 101255: {'lr': 0.00012231053232090067, 'samples': 19440960, 'steps': 101254, 'loss/train': 1.2700797319412231} 11/07/2021 11:26:48 - INFO - __main__ - Step 101256: {'lr': 0.00012230597001097737, 'samples': 19441152, 'steps': 101255, 'loss/train': 1.0776116847991943} 11/07/2021 11:26:49 - INFO - __main__ - Step 101257: {'lr': 0.00012230140775859117, 'samples': 19441344, 'steps': 101256, 'loss/train': 1.321317195892334} 11/07/2021 11:26:49 - INFO - __main__ - Step 101258: {'lr': 0.00012229684556374384, 'samples': 19441536, 'steps': 101257, 'loss/train': 1.2945237159729004} 11/07/2021 11:26:49 - INFO - __main__ - Step 101259: {'lr': 0.00012229228342643762, 'samples': 19441728, 'steps': 101258, 'loss/train': 0.4369581937789917} 11/07/2021 11:26:50 - INFO - __main__ - Step 101260: {'lr': 0.0001222877213466745, 'samples': 19441920, 'steps': 101259, 'loss/train': 1.0955170392990112} 11/07/2021 11:26:51 - INFO - __main__ - Step 101261: {'lr': 0.0001222831593244566, 'samples': 19442112, 'steps': 101260, 'loss/train': 1.6079045534133911} 11/07/2021 11:26:51 - INFO - __main__ - Step 101262: {'lr': 0.00012227859735978587, 'samples': 19442304, 'steps': 101261, 'loss/train': 1.4797120094299316} 11/07/2021 11:26:52 - INFO - __main__ - Step 101263: {'lr': 0.0001222740354526645, 'samples': 19442496, 'steps': 101262, 'loss/train': 1.1269845962524414} 11/07/2021 11:26:52 - INFO - __main__ - Step 101264: {'lr': 0.00012226947360309442, 'samples': 19442688, 'steps': 101263, 'loss/train': 1.2453464269638062} 11/07/2021 11:26:52 - INFO - __main__ - Step 101265: {'lr': 0.0001222649118110778, 'samples': 19442880, 'steps': 101264, 'loss/train': 1.3305848836898804} 11/07/2021 11:26:53 - INFO - __main__ - Step 101266: {'lr': 0.0001222603500766166, 'samples': 19443072, 'steps': 101265, 'loss/train': 1.4527479410171509} 11/07/2021 11:26:54 - INFO - __main__ - Step 101267: {'lr': 0.00012225578839971293, 'samples': 19443264, 'steps': 101266, 'loss/train': 1.4282265901565552} 11/07/2021 11:26:54 - INFO - __main__ - Step 101268: {'lr': 0.00012225122678036885, 'samples': 19443456, 'steps': 101267, 'loss/train': 1.466986894607544} 11/07/2021 11:26:54 - INFO - __main__ - Step 101269: {'lr': 0.00012224666521858636, 'samples': 19443648, 'steps': 101268, 'loss/train': 1.1248159408569336} 11/07/2021 11:26:55 - INFO - __main__ - Step 101270: {'lr': 0.00012224210371436755, 'samples': 19443840, 'steps': 101269, 'loss/train': 1.2633976936340332} 11/07/2021 11:26:55 - INFO - __main__ - Step 101271: {'lr': 0.00012223754226771462, 'samples': 19444032, 'steps': 101270, 'loss/train': 1.3676156997680664} 11/07/2021 11:26:56 - INFO - __main__ - Step 101272: {'lr': 0.00012223298087862936, 'samples': 19444224, 'steps': 101271, 'loss/train': 0.7444912791252136} 11/07/2021 11:26:56 - INFO - __main__ - Step 101273: {'lr': 0.00012222841954711395, 'samples': 19444416, 'steps': 101272, 'loss/train': 1.5508164167404175} 11/07/2021 11:26:57 - INFO - __main__ - Step 101274: {'lr': 0.00012222385827317041, 'samples': 19444608, 'steps': 101273, 'loss/train': 1.2023595571517944} 11/07/2021 11:26:57 - INFO - __main__ - Step 101275: {'lr': 0.00012221929705680086, 'samples': 19444800, 'steps': 101274, 'loss/train': 1.0778768062591553} 11/07/2021 11:26:57 - INFO - __main__ - Step 101276: {'lr': 0.00012221473589800732, 'samples': 19444992, 'steps': 101275, 'loss/train': 1.6313016414642334} 11/07/2021 11:26:59 - INFO - __main__ - Step 101277: {'lr': 0.00012221017479679182, 'samples': 19445184, 'steps': 101276, 'loss/train': 0.08390719443559647} 11/07/2021 11:26:59 - INFO - __main__ - Step 101278: {'lr': 0.0001222056137531565, 'samples': 19445376, 'steps': 101277, 'loss/train': 1.5787321329116821} 11/07/2021 11:26:59 - INFO - __main__ - Step 101279: {'lr': 0.00012220105276710333, 'samples': 19445568, 'steps': 101278, 'loss/train': 1.1105891466140747} 11/07/2021 11:27:00 - INFO - __main__ - Step 101280: {'lr': 0.0001221964918386344, 'samples': 19445760, 'steps': 101279, 'loss/train': 1.2421479225158691} 11/07/2021 11:27:00 - INFO - __main__ - Step 101281: {'lr': 0.0001221919309677517, 'samples': 19445952, 'steps': 101280, 'loss/train': 1.0593217611312866} 11/07/2021 11:27:01 - INFO - __main__ - Step 101282: {'lr': 0.0001221873701544574, 'samples': 19446144, 'steps': 101281, 'loss/train': 0.660508394241333} 11/07/2021 11:27:01 - INFO - __main__ - Step 101283: {'lr': 0.0001221828093987535, 'samples': 19446336, 'steps': 101282, 'loss/train': 1.3701599836349487} 11/07/2021 11:27:02 - INFO - __main__ - Step 101284: {'lr': 0.00012217824870064216, 'samples': 19446528, 'steps': 101283, 'loss/train': 1.141298532485962} 11/07/2021 11:27:02 - INFO - __main__ - Step 101285: {'lr': 0.0001221736880601252, 'samples': 19446720, 'steps': 101284, 'loss/train': 1.1886085271835327} 11/07/2021 11:27:02 - INFO - __main__ - Step 101286: {'lr': 0.00012216912747720483, 'samples': 19446912, 'steps': 101285, 'loss/train': 1.3639178276062012} 11/07/2021 11:27:03 - INFO - __main__ - Step 101287: {'lr': 0.00012216456695188306, 'samples': 19447104, 'steps': 101286, 'loss/train': 1.0865598917007446} 11/07/2021 11:27:04 - INFO - __main__ - Step 101288: {'lr': 0.00012216000648416199, 'samples': 19447296, 'steps': 101287, 'loss/train': 0.9509721994400024} 11/07/2021 11:27:04 - INFO - __main__ - Step 101289: {'lr': 0.0001221554460740436, 'samples': 19447488, 'steps': 101288, 'loss/train': 1.2690616846084595} 11/07/2021 11:27:04 - INFO - __main__ - Step 101290: {'lr': 0.00012215088572153002, 'samples': 19447680, 'steps': 101289, 'loss/train': 1.5789374113082886} 11/07/2021 11:27:05 - INFO - __main__ - Step 101291: {'lr': 0.0001221463254266233, 'samples': 19447872, 'steps': 101290, 'loss/train': 0.9096230268478394} 11/07/2021 11:27:06 - INFO - __main__ - Step 101292: {'lr': 0.00012214176518932543, 'samples': 19448064, 'steps': 101291, 'loss/train': 1.2193411588668823} 11/07/2021 11:27:06 - INFO - __main__ - Step 101293: {'lr': 0.00012213720500963855, 'samples': 19448256, 'steps': 101292, 'loss/train': 1.1847176551818848} 11/07/2021 11:27:07 - INFO - __main__ - Step 101294: {'lr': 0.00012213264488756466, 'samples': 19448448, 'steps': 101293, 'loss/train': 1.3884031772613525} 11/07/2021 11:27:07 - INFO - __main__ - Step 101295: {'lr': 0.0001221280848231058, 'samples': 19448640, 'steps': 101294, 'loss/train': 1.610235571861267} 11/07/2021 11:27:07 - INFO - __main__ - Step 101296: {'lr': 0.00012212352481626407, 'samples': 19448832, 'steps': 101295, 'loss/train': 1.2318921089172363} 11/07/2021 11:27:08 - INFO - __main__ - Step 101297: {'lr': 0.00012211896486704151, 'samples': 19449024, 'steps': 101296, 'loss/train': 0.5148300528526306} 11/07/2021 11:27:09 - INFO - __main__ - Step 101298: {'lr': 0.00012211440497544027, 'samples': 19449216, 'steps': 101297, 'loss/train': 1.6484218835830688} 11/07/2021 11:27:09 - INFO - __main__ - Step 101299: {'lr': 0.0001221098451414622, 'samples': 19449408, 'steps': 101298, 'loss/train': 0.8639827966690063} 11/07/2021 11:27:09 - INFO - __main__ - Step 101300: {'lr': 0.00012210528536510948, 'samples': 19449600, 'steps': 101299, 'loss/train': 1.4384722709655762} 11/07/2021 11:27:10 - INFO - __main__ - Step 101301: {'lr': 0.0001221007256463841, 'samples': 19449792, 'steps': 101300, 'loss/train': 1.581348180770874} 11/07/2021 11:27:10 - INFO - __main__ - Step 101302: {'lr': 0.0001220961659852882, 'samples': 19449984, 'steps': 101301, 'loss/train': 1.0717953443527222} 11/07/2021 11:27:11 - INFO - __main__ - Step 101303: {'lr': 0.00012209160638182378, 'samples': 19450176, 'steps': 101302, 'loss/train': 1.6713885068893433} 11/07/2021 11:27:11 - INFO - __main__ - Step 101304: {'lr': 0.00012208704683599293, 'samples': 19450368, 'steps': 101303, 'loss/train': 1.0750707387924194} 11/07/2021 11:27:12 - INFO - __main__ - Step 101305: {'lr': 0.00012208248734779767, 'samples': 19450560, 'steps': 101304, 'loss/train': 1.105109453201294} 11/07/2021 11:27:12 - INFO - __main__ - Step 101306: {'lr': 0.00012207792791724004, 'samples': 19450752, 'steps': 101305, 'loss/train': 0.6176126599311829} 11/07/2021 11:27:12 - INFO - __main__ - Step 101307: {'lr': 0.00012207336854432217, 'samples': 19450944, 'steps': 101306, 'loss/train': 0.48122313618659973} 11/07/2021 11:27:13 - INFO - __main__ - Step 101308: {'lr': 0.00012206880922904603, 'samples': 19451136, 'steps': 101307, 'loss/train': 1.8669558763504028} 11/07/2021 11:27:14 - INFO - __main__ - Step 101309: {'lr': 0.00012206424997141371, 'samples': 19451328, 'steps': 101308, 'loss/train': 0.9174730181694031} 11/07/2021 11:27:14 - INFO - __main__ - Step 101310: {'lr': 0.00012205969077142729, 'samples': 19451520, 'steps': 101309, 'loss/train': 1.1259130239486694} 11/07/2021 11:27:15 - INFO - __main__ - Step 101311: {'lr': 0.00012205513162908888, 'samples': 19451712, 'steps': 101310, 'loss/train': 1.297237515449524} 11/07/2021 11:27:15 - INFO - __main__ - Step 101312: {'lr': 0.00012205057254440036, 'samples': 19451904, 'steps': 101311, 'loss/train': 1.2124184370040894} 11/07/2021 11:27:16 - INFO - __main__ - Step 101313: {'lr': 0.00012204601351736385, 'samples': 19452096, 'steps': 101312, 'loss/train': 1.5318907499313354} 11/07/2021 11:27:16 - INFO - __main__ - Step 101314: {'lr': 0.00012204145454798147, 'samples': 19452288, 'steps': 101313, 'loss/train': 0.5102567076683044} 11/07/2021 11:27:17 - INFO - __main__ - Step 101315: {'lr': 0.00012203689563625522, 'samples': 19452480, 'steps': 101314, 'loss/train': 1.4345731735229492} 11/07/2021 11:27:17 - INFO - __main__ - Step 101316: {'lr': 0.00012203233678218717, 'samples': 19452672, 'steps': 101315, 'loss/train': 1.3523664474487305} 11/07/2021 11:27:18 - INFO - __main__ - Step 101317: {'lr': 0.00012202777798577938, 'samples': 19452864, 'steps': 101316, 'loss/train': 1.3811012506484985} 11/07/2021 11:27:19 - INFO - __main__ - Step 101318: {'lr': 0.00012202321924703388, 'samples': 19453056, 'steps': 101317, 'loss/train': 1.6355777978897095} 11/07/2021 11:27:19 - INFO - __main__ - Step 101319: {'lr': 0.00012201866056595279, 'samples': 19453248, 'steps': 101318, 'loss/train': 1.284055471420288} 11/07/2021 11:27:19 - INFO - __main__ - Step 101320: {'lr': 0.00012201410194253806, 'samples': 19453440, 'steps': 101319, 'loss/train': 1.7360575199127197} 11/07/2021 11:27:20 - INFO - __main__ - Step 101321: {'lr': 0.00012200954337679185, 'samples': 19453632, 'steps': 101320, 'loss/train': 1.4220184087753296} 11/07/2021 11:27:20 - INFO - __main__ - Step 101322: {'lr': 0.00012200498486871622, 'samples': 19453824, 'steps': 101321, 'loss/train': 1.4509003162384033} 11/07/2021 11:27:20 - INFO - __main__ - Step 101323: {'lr': 0.0001220004264183131, 'samples': 19454016, 'steps': 101322, 'loss/train': 1.4320480823516846} 11/07/2021 11:27:21 - INFO - __main__ - Step 101324: {'lr': 0.0001219958680255846, 'samples': 19454208, 'steps': 101323, 'loss/train': 1.2435252666473389} 11/07/2021 11:27:22 - INFO - __main__ - Step 101325: {'lr': 0.0001219913096905328, 'samples': 19454400, 'steps': 101324, 'loss/train': 1.7590205669403076} 11/07/2021 11:27:22 - INFO - __main__ - Step 101326: {'lr': 0.0001219867514131597, 'samples': 19454592, 'steps': 101325, 'loss/train': 1.016432285308838} 11/07/2021 11:27:22 - INFO - __main__ - Step 101327: {'lr': 0.00012198219319346743, 'samples': 19454784, 'steps': 101326, 'loss/train': 1.8464399576187134} 11/07/2021 11:27:23 - INFO - __main__ - Step 101328: {'lr': 0.000121977635031458, 'samples': 19454976, 'steps': 101327, 'loss/train': 1.4440869092941284} 11/07/2021 11:27:24 - INFO - __main__ - Step 101329: {'lr': 0.00012197307692713347, 'samples': 19455168, 'steps': 101328, 'loss/train': 1.7582616806030273} 11/07/2021 11:27:24 - INFO - __main__ - Step 101330: {'lr': 0.00012196851888049592, 'samples': 19455360, 'steps': 101329, 'loss/train': 1.1633861064910889} 11/07/2021 11:27:25 - INFO - __main__ - Step 101331: {'lr': 0.00012196396089154734, 'samples': 19455552, 'steps': 101330, 'loss/train': 1.6656498908996582} 11/07/2021 11:27:25 - INFO - __main__ - Step 101332: {'lr': 0.00012195940296028984, 'samples': 19455744, 'steps': 101331, 'loss/train': 0.20168174803256989} 11/07/2021 11:27:25 - INFO - __main__ - Step 101333: {'lr': 0.00012195484508672558, 'samples': 19455936, 'steps': 101332, 'loss/train': 1.2036792039871216} 11/07/2021 11:27:26 - INFO - __main__ - Step 101334: {'lr': 0.00012195028727085636, 'samples': 19456128, 'steps': 101333, 'loss/train': 0.39160385727882385} 11/07/2021 11:27:27 - INFO - __main__ - Step 101335: {'lr': 0.00012194572951268438, 'samples': 19456320, 'steps': 101334, 'loss/train': 1.068116545677185} 11/07/2021 11:27:27 - INFO - __main__ - Step 101336: {'lr': 0.00012194117181221168, 'samples': 19456512, 'steps': 101335, 'loss/train': 1.046228051185608} 11/07/2021 11:27:27 - INFO - __main__ - Step 101337: {'lr': 0.0001219366141694403, 'samples': 19456704, 'steps': 101336, 'loss/train': 1.4916361570358276} 11/07/2021 11:27:28 - INFO - __main__ - Step 101338: {'lr': 0.00012193205658437232, 'samples': 19456896, 'steps': 101337, 'loss/train': 1.3939441442489624} 11/07/2021 11:27:29 - INFO - __main__ - Step 101339: {'lr': 0.00012192749905700976, 'samples': 19457088, 'steps': 101338, 'loss/train': 1.19486403465271} 11/07/2021 11:27:29 - INFO - __main__ - Step 101340: {'lr': 0.0001219229415873547, 'samples': 19457280, 'steps': 101339, 'loss/train': 1.2700765132904053} 11/07/2021 11:27:29 - INFO - __main__ - Step 101341: {'lr': 0.00012191838417540921, 'samples': 19457472, 'steps': 101340, 'loss/train': 1.4088376760482788} 11/07/2021 11:27:30 - INFO - __main__ - Step 101342: {'lr': 0.0001219138268211753, 'samples': 19457664, 'steps': 101341, 'loss/train': 1.2342462539672852} 11/07/2021 11:27:30 - INFO - __main__ - Step 101343: {'lr': 0.00012190926952465504, 'samples': 19457856, 'steps': 101342, 'loss/train': 1.3421458005905151} 11/07/2021 11:27:31 - INFO - __main__ - Step 101344: {'lr': 0.00012190471228585057, 'samples': 19458048, 'steps': 101343, 'loss/train': 1.4025328159332275} 11/07/2021 11:27:31 - INFO - __main__ - Step 101345: {'lr': 0.0001219001551047638, 'samples': 19458240, 'steps': 101344, 'loss/train': 1.3209097385406494} 11/07/2021 11:27:32 - INFO - __main__ - Step 101346: {'lr': 0.00012189559798139682, 'samples': 19458432, 'steps': 101345, 'loss/train': 1.3473972082138062} 11/07/2021 11:27:32 - INFO - __main__ - Step 101347: {'lr': 0.0001218910409157517, 'samples': 19458624, 'steps': 101346, 'loss/train': 1.3859890699386597} 11/07/2021 11:27:33 - INFO - __main__ - Step 101348: {'lr': 0.00012188648390783049, 'samples': 19458816, 'steps': 101347, 'loss/train': 1.343144178390503} 11/07/2021 11:27:33 - INFO - __main__ - Step 101349: {'lr': 0.00012188192695763528, 'samples': 19459008, 'steps': 101348, 'loss/train': 1.3729497194290161} 11/07/2021 11:27:34 - INFO - __main__ - Step 101350: {'lr': 0.00012187737006516811, 'samples': 19459200, 'steps': 101349, 'loss/train': 1.4835797548294067} 11/07/2021 11:27:34 - INFO - __main__ - Step 101351: {'lr': 0.00012187281323043098, 'samples': 19459392, 'steps': 101350, 'loss/train': 1.1938303709030151} 11/07/2021 11:27:35 - INFO - __main__ - Step 101352: {'lr': 0.000121868256453426, 'samples': 19459584, 'steps': 101351, 'loss/train': 1.514434814453125} 11/07/2021 11:27:35 - INFO - __main__ - Step 101353: {'lr': 0.00012186369973415523, 'samples': 19459776, 'steps': 101352, 'loss/train': 1.4192053079605103} 11/07/2021 11:27:35 - INFO - __main__ - Step 101354: {'lr': 0.00012185914307262066, 'samples': 19459968, 'steps': 101353, 'loss/train': 1.3225945234298706} 11/07/2021 11:27:36 - INFO - __main__ - Step 101355: {'lr': 0.00012185458646882453, 'samples': 19460160, 'steps': 101354, 'loss/train': 1.161056399345398} 11/07/2021 11:27:37 - INFO - __main__ - Step 101356: {'lr': 0.00012185002992276859, 'samples': 19460352, 'steps': 101355, 'loss/train': 1.724161148071289} 11/07/2021 11:27:37 - INFO - __main__ - Step 101357: {'lr': 0.0001218454734344551, 'samples': 19460544, 'steps': 101356, 'loss/train': 1.3761242628097534} 11/07/2021 11:27:37 - INFO - __main__ - Step 101358: {'lr': 0.00012184091700388603, 'samples': 19460736, 'steps': 101357, 'loss/train': 0.5391926169395447} 11/07/2021 11:27:38 - INFO - __main__ - Step 101359: {'lr': 0.00012183636063106345, 'samples': 19460928, 'steps': 101358, 'loss/train': 1.6339879035949707} 11/07/2021 11:27:39 - INFO - __main__ - Step 101360: {'lr': 0.00012183180431598947, 'samples': 19461120, 'steps': 101359, 'loss/train': 1.169582486152649} 11/07/2021 11:27:39 - INFO - __main__ - Step 101361: {'lr': 0.00012182724805866607, 'samples': 19461312, 'steps': 101360, 'loss/train': 1.3853405714035034} 11/07/2021 11:27:39 - INFO - __main__ - Step 101362: {'lr': 0.00012182269185909536, 'samples': 19461504, 'steps': 101361, 'loss/train': 1.4525963068008423} 11/07/2021 11:27:40 - INFO - __main__ - Step 101363: {'lr': 0.00012181813571727937, 'samples': 19461696, 'steps': 101362, 'loss/train': 1.2796928882598877} 11/07/2021 11:27:40 - INFO - __main__ - Step 101364: {'lr': 0.00012181357963322012, 'samples': 19461888, 'steps': 101363, 'loss/train': 1.197716236114502} 11/07/2021 11:27:41 - INFO - __main__ - Step 101365: {'lr': 0.00012180902360691982, 'samples': 19462080, 'steps': 101364, 'loss/train': 1.2218133211135864} 11/07/2021 11:27:41 - INFO - __main__ - Step 101366: {'lr': 0.00012180446763838026, 'samples': 19462272, 'steps': 101365, 'loss/train': 0.8332762718200684} 11/07/2021 11:27:42 - INFO - __main__ - Step 101367: {'lr': 0.00012179991172760366, 'samples': 19462464, 'steps': 101366, 'loss/train': 0.7473368644714355} 11/07/2021 11:27:42 - INFO - __main__ - Step 101368: {'lr': 0.00012179535587459204, 'samples': 19462656, 'steps': 101367, 'loss/train': 1.501504898071289} 11/07/2021 11:27:43 - INFO - __main__ - Step 101369: {'lr': 0.00012179080007934746, 'samples': 19462848, 'steps': 101368, 'loss/train': 1.4032315015792847} 11/07/2021 11:27:44 - INFO - __main__ - Step 101370: {'lr': 0.00012178624434187193, 'samples': 19463040, 'steps': 101369, 'loss/train': 1.4872366189956665} 11/07/2021 11:27:44 - INFO - __main__ - Step 101371: {'lr': 0.00012178168866216757, 'samples': 19463232, 'steps': 101370, 'loss/train': 1.4036071300506592} 11/07/2021 11:27:44 - INFO - __main__ - Step 101372: {'lr': 0.0001217771330402364, 'samples': 19463424, 'steps': 101371, 'loss/train': 0.8709986805915833} 11/07/2021 11:27:45 - INFO - __main__ - Step 101373: {'lr': 0.00012177257747608048, 'samples': 19463616, 'steps': 101372, 'loss/train': 1.6144028902053833} 11/07/2021 11:27:45 - INFO - __main__ - Step 101374: {'lr': 0.00012176802196970186, 'samples': 19463808, 'steps': 101373, 'loss/train': 1.3233317136764526} 11/07/2021 11:27:46 - INFO - __main__ - Step 101375: {'lr': 0.00012176346652110257, 'samples': 19464000, 'steps': 101374, 'loss/train': 1.2651222944259644} 11/07/2021 11:27:46 - INFO - __main__ - Step 101376: {'lr': 0.0001217589111302847, 'samples': 19464192, 'steps': 101375, 'loss/train': 1.2235544919967651} 11/07/2021 11:27:47 - INFO - __main__ - Step 101377: {'lr': 0.00012175435579725028, 'samples': 19464384, 'steps': 101376, 'loss/train': 1.3525779247283936} 11/07/2021 11:27:47 - INFO - __main__ - Step 101378: {'lr': 0.00012174980052200146, 'samples': 19464576, 'steps': 101377, 'loss/train': 1.4210253953933716} 11/07/2021 11:27:47 - INFO - __main__ - Step 101379: {'lr': 0.00012174524530454012, 'samples': 19464768, 'steps': 101378, 'loss/train': 1.5144857168197632} 11/07/2021 11:27:48 - INFO - __main__ - Step 101380: {'lr': 0.00012174069014486839, 'samples': 19464960, 'steps': 101379, 'loss/train': 1.2330979108810425} 11/07/2021 11:27:49 - INFO - __main__ - Step 101381: {'lr': 0.00012173613504298831, 'samples': 19465152, 'steps': 101380, 'loss/train': 1.594601035118103} 11/07/2021 11:27:49 - INFO - __main__ - Step 101382: {'lr': 0.00012173157999890194, 'samples': 19465344, 'steps': 101381, 'loss/train': 0.1988002508878708} 11/07/2021 11:27:50 - INFO - __main__ - Step 101383: {'lr': 0.00012172702501261138, 'samples': 19465536, 'steps': 101382, 'loss/train': 0.7698079943656921} 11/07/2021 11:27:50 - INFO - __main__ - Step 101384: {'lr': 0.0001217224700841186, 'samples': 19465728, 'steps': 101383, 'loss/train': 1.1883580684661865} 11/07/2021 11:27:51 - INFO - __main__ - Step 101385: {'lr': 0.00012171791521342573, 'samples': 19465920, 'steps': 101384, 'loss/train': 1.2227933406829834} 11/07/2021 11:27:51 - INFO - __main__ - Step 101386: {'lr': 0.00012171336040053477, 'samples': 19466112, 'steps': 101385, 'loss/train': 1.461118221282959} 11/07/2021 11:27:52 - INFO - __main__ - Step 101387: {'lr': 0.00012170880564544778, 'samples': 19466304, 'steps': 101386, 'loss/train': 0.7040649652481079} 11/07/2021 11:27:52 - INFO - __main__ - Step 101388: {'lr': 0.00012170425094816687, 'samples': 19466496, 'steps': 101387, 'loss/train': 1.3770909309387207} 11/07/2021 11:27:52 - INFO - __main__ - Step 101389: {'lr': 0.00012169969630869399, 'samples': 19466688, 'steps': 101388, 'loss/train': 0.8945060968399048} 11/07/2021 11:27:53 - INFO - __main__ - Step 101390: {'lr': 0.00012169514172703128, 'samples': 19466880, 'steps': 101389, 'loss/train': 1.310351848602295} 11/07/2021 11:27:54 - INFO - __main__ - Step 101391: {'lr': 0.00012169058720318074, 'samples': 19467072, 'steps': 101390, 'loss/train': 1.1244922876358032} 11/07/2021 11:27:54 - INFO - __main__ - Step 101392: {'lr': 0.00012168603273714454, 'samples': 19467264, 'steps': 101391, 'loss/train': 1.4645586013793945} 11/07/2021 11:27:54 - INFO - __main__ - Step 101393: {'lr': 0.00012168147832892457, 'samples': 19467456, 'steps': 101392, 'loss/train': 1.018115758895874} 11/07/2021 11:27:55 - INFO - __main__ - Step 101394: {'lr': 0.0001216769239785229, 'samples': 19467648, 'steps': 101393, 'loss/train': 1.4568254947662354} 11/07/2021 11:27:55 - INFO - __main__ - Step 101395: {'lr': 0.00012167236968594165, 'samples': 19467840, 'steps': 101394, 'loss/train': 1.25934636592865} 11/07/2021 11:27:56 - INFO - __main__ - Step 101396: {'lr': 0.00012166781545118286, 'samples': 19468032, 'steps': 101395, 'loss/train': 1.5433648824691772} 11/07/2021 11:27:57 - INFO - __main__ - Step 101397: {'lr': 0.00012166326127424854, 'samples': 19468224, 'steps': 101396, 'loss/train': 1.0326343774795532} 11/07/2021 11:27:57 - INFO - __main__ - Step 101398: {'lr': 0.00012165870715514079, 'samples': 19468416, 'steps': 101397, 'loss/train': 0.9405523538589478} 11/07/2021 11:27:57 - INFO - __main__ - Step 101399: {'lr': 0.00012165415309386166, 'samples': 19468608, 'steps': 101398, 'loss/train': 1.3253085613250732} 11/07/2021 11:27:58 - INFO - __main__ - Step 101400: {'lr': 0.00012164959909041318, 'samples': 19468800, 'steps': 101399, 'loss/train': 1.340395212173462} 11/07/2021 11:27:59 - INFO - __main__ - Step 101401: {'lr': 0.00012164504514479741, 'samples': 19468992, 'steps': 101400, 'loss/train': 1.2853037118911743} 11/07/2021 11:27:59 - INFO - __main__ - Step 101402: {'lr': 0.0001216404912570164, 'samples': 19469184, 'steps': 101401, 'loss/train': 1.3257758617401123} 11/07/2021 11:27:59 - INFO - __main__ - Step 101403: {'lr': 0.00012163593742707222, 'samples': 19469376, 'steps': 101402, 'loss/train': 1.6441493034362793} 11/07/2021 11:28:00 - INFO - __main__ - Step 101404: {'lr': 0.00012163138365496687, 'samples': 19469568, 'steps': 101403, 'loss/train': 1.3154609203338623} 11/07/2021 11:28:00 - INFO - __main__ - Step 101405: {'lr': 0.00012162682994070257, 'samples': 19469760, 'steps': 101404, 'loss/train': 1.1809074878692627} 11/07/2021 11:28:01 - INFO - __main__ - Step 101406: {'lr': 0.0001216222762842811, 'samples': 19469952, 'steps': 101405, 'loss/train': 1.1491410732269287} 11/07/2021 11:28:01 - INFO - __main__ - Step 101407: {'lr': 0.00012161772268570471, 'samples': 19470144, 'steps': 101406, 'loss/train': 1.908919095993042} 11/07/2021 11:28:02 - INFO - __main__ - Step 101408: {'lr': 0.00012161316914497533, 'samples': 19470336, 'steps': 101407, 'loss/train': 1.2935210466384888} 11/07/2021 11:28:02 - INFO - __main__ - Step 101409: {'lr': 0.00012160861566209511, 'samples': 19470528, 'steps': 101408, 'loss/train': 1.4566062688827515} 11/07/2021 11:28:02 - INFO - __main__ - Step 101410: {'lr': 0.00012160406223706608, 'samples': 19470720, 'steps': 101409, 'loss/train': 1.288155436515808} 11/07/2021 11:28:03 - INFO - __main__ - Step 101411: {'lr': 0.00012159950886989024, 'samples': 19470912, 'steps': 101410, 'loss/train': 1.3913092613220215} 11/07/2021 11:28:04 - INFO - __main__ - Step 101412: {'lr': 0.0001215949555605697, 'samples': 19471104, 'steps': 101411, 'loss/train': 1.0347298383712769} 11/07/2021 11:28:04 - INFO - __main__ - Step 101413: {'lr': 0.00012159040230910651, 'samples': 19471296, 'steps': 101412, 'loss/train': 1.504852533340454} 11/07/2021 11:28:05 - INFO - __main__ - Step 101414: {'lr': 0.00012158584911550269, 'samples': 19471488, 'steps': 101413, 'loss/train': 1.483489751815796} 11/07/2021 11:28:05 - INFO - __main__ - Step 101415: {'lr': 0.00012158129597976029, 'samples': 19471680, 'steps': 101414, 'loss/train': 1.1846433877944946} 11/07/2021 11:28:05 - INFO - __main__ - Step 101416: {'lr': 0.0001215767429018814, 'samples': 19471872, 'steps': 101415, 'loss/train': 1.5302720069885254} 11/07/2021 11:28:06 - INFO - __main__ - Step 101417: {'lr': 0.00012157218988186802, 'samples': 19472064, 'steps': 101416, 'loss/train': 1.4060320854187012} 11/07/2021 11:28:07 - INFO - __main__ - Step 101418: {'lr': 0.00012156763691972226, 'samples': 19472256, 'steps': 101417, 'loss/train': 0.7293546199798584} 11/07/2021 11:28:07 - INFO - __main__ - Step 101419: {'lr': 0.00012156308401544621, 'samples': 19472448, 'steps': 101418, 'loss/train': 1.2125252485275269} 11/07/2021 11:28:07 - INFO - __main__ - Step 101420: {'lr': 0.00012155853116904178, 'samples': 19472640, 'steps': 101419, 'loss/train': 1.3955535888671875} 11/07/2021 11:28:08 - INFO - __main__ - Step 101421: {'lr': 0.00012155397838051108, 'samples': 19472832, 'steps': 101420, 'loss/train': 1.347899317741394} 11/07/2021 11:28:09 - INFO - __main__ - Step 101422: {'lr': 0.00012154942564985617, 'samples': 19473024, 'steps': 101421, 'loss/train': 1.3924529552459717} 11/07/2021 11:28:09 - INFO - __main__ - Step 101423: {'lr': 0.00012154487297707911, 'samples': 19473216, 'steps': 101422, 'loss/train': 0.8564873337745667} 11/07/2021 11:28:09 - INFO - __main__ - Step 101424: {'lr': 0.00012154032036218196, 'samples': 19473408, 'steps': 101423, 'loss/train': 1.311434268951416} 11/07/2021 11:28:10 - INFO - __main__ - Step 101425: {'lr': 0.00012153576780516673, 'samples': 19473600, 'steps': 101424, 'loss/train': 1.248645305633545} 11/07/2021 11:28:10 - INFO - __main__ - Step 101426: {'lr': 0.00012153121530603553, 'samples': 19473792, 'steps': 101425, 'loss/train': 1.2834768295288086} 11/07/2021 11:28:11 - INFO - __main__ - Step 101427: {'lr': 0.00012152666286479039, 'samples': 19473984, 'steps': 101426, 'loss/train': 1.278848648071289} 11/07/2021 11:28:11 - INFO - __main__ - Step 101428: {'lr': 0.00012152211048143333, 'samples': 19474176, 'steps': 101427, 'loss/train': 1.0938340425491333} 11/07/2021 11:28:12 - INFO - __main__ - Step 101429: {'lr': 0.00012151755815596643, 'samples': 19474368, 'steps': 101428, 'loss/train': 1.0509587526321411} 11/07/2021 11:28:12 - INFO - __main__ - Step 101430: {'lr': 0.00012151300588839173, 'samples': 19474560, 'steps': 101429, 'loss/train': 0.9636073112487793} 11/07/2021 11:28:12 - INFO - __main__ - Step 101431: {'lr': 0.0001215084536787113, 'samples': 19474752, 'steps': 101430, 'loss/train': 1.5259968042373657} 11/07/2021 11:28:13 - INFO - __main__ - Step 101432: {'lr': 0.00012150390152692728, 'samples': 19474944, 'steps': 101431, 'loss/train': 1.5241773128509521} 11/07/2021 11:28:14 - INFO - __main__ - Step 101433: {'lr': 0.0001214993494330415, 'samples': 19475136, 'steps': 101432, 'loss/train': 1.4321998357772827} 11/07/2021 11:28:14 - INFO - __main__ - Step 101434: {'lr': 0.00012149479739705613, 'samples': 19475328, 'steps': 101433, 'loss/train': 1.5620213747024536} 11/07/2021 11:28:14 - INFO - __main__ - Step 101435: {'lr': 0.00012149024541897325, 'samples': 19475520, 'steps': 101434, 'loss/train': 1.4634751081466675} 11/07/2021 11:28:15 - INFO - __main__ - Step 101436: {'lr': 0.00012148569349879484, 'samples': 19475712, 'steps': 101435, 'loss/train': 0.8615953922271729} 11/07/2021 11:28:15 - INFO - __main__ - Step 101437: {'lr': 0.00012148114163652305, 'samples': 19475904, 'steps': 101436, 'loss/train': 1.450104832649231} 11/07/2021 11:28:16 - INFO - __main__ - Step 101438: {'lr': 0.00012147658983215984, 'samples': 19476096, 'steps': 101437, 'loss/train': 1.070731520652771} 11/07/2021 11:28:17 - INFO - __main__ - Step 101439: {'lr': 0.00012147203808570728, 'samples': 19476288, 'steps': 101438, 'loss/train': 1.9339704513549805} 11/07/2021 11:28:17 - INFO - __main__ - Step 101440: {'lr': 0.00012146748639716745, 'samples': 19476480, 'steps': 101439, 'loss/train': 1.3621159791946411} 11/07/2021 11:28:17 - INFO - __main__ - Step 101441: {'lr': 0.0001214629347665424, 'samples': 19476672, 'steps': 101440, 'loss/train': 1.8407988548278809} 11/07/2021 11:28:18 - INFO - __main__ - Step 101442: {'lr': 0.00012145838319383418, 'samples': 19476864, 'steps': 101441, 'loss/train': 1.5115315914154053} 11/07/2021 11:28:19 - INFO - __main__ - Step 101443: {'lr': 0.00012145383167904481, 'samples': 19477056, 'steps': 101442, 'loss/train': 0.9618353843688965} 11/07/2021 11:28:19 - INFO - __main__ - Step 101444: {'lr': 0.00012144928022217635, 'samples': 19477248, 'steps': 101443, 'loss/train': 1.554343819618225} 11/07/2021 11:28:19 - INFO - __main__ - Step 101445: {'lr': 0.00012144472882323088, 'samples': 19477440, 'steps': 101444, 'loss/train': 0.9058675169944763} 11/07/2021 11:28:20 - INFO - __main__ - Step 101446: {'lr': 0.0001214401774822105, 'samples': 19477632, 'steps': 101445, 'loss/train': 1.2678899765014648} 11/07/2021 11:28:20 - INFO - __main__ - Step 101447: {'lr': 0.0001214356261991171, 'samples': 19477824, 'steps': 101446, 'loss/train': 1.2618036270141602} 11/07/2021 11:28:21 - INFO - __main__ - Step 101448: {'lr': 0.00012143107497395286, 'samples': 19478016, 'steps': 101447, 'loss/train': 1.2557196617126465} 11/07/2021 11:28:22 - INFO - __main__ - Step 101449: {'lr': 0.00012142652380671976, 'samples': 19478208, 'steps': 101448, 'loss/train': 1.2766857147216797} 11/07/2021 11:28:22 - INFO - __main__ - Step 101450: {'lr': 0.00012142197269741989, 'samples': 19478400, 'steps': 101449, 'loss/train': 1.4663594961166382} 11/07/2021 11:28:22 - INFO - __main__ - Step 101451: {'lr': 0.00012141742164605532, 'samples': 19478592, 'steps': 101450, 'loss/train': 2.1644675731658936} 11/07/2021 11:28:23 - INFO - __main__ - Step 101452: {'lr': 0.00012141287065262805, 'samples': 19478784, 'steps': 101451, 'loss/train': 1.4452106952667236} 11/07/2021 11:28:24 - INFO - __main__ - Step 101453: {'lr': 0.00012140831971714017, 'samples': 19478976, 'steps': 101452, 'loss/train': 1.136212944984436} 11/07/2021 11:28:24 - INFO - __main__ - Step 101454: {'lr': 0.00012140376883959369, 'samples': 19479168, 'steps': 101453, 'loss/train': 0.8170756697654724} 11/07/2021 11:28:24 - INFO - __main__ - Step 101455: {'lr': 0.00012139921801999071, 'samples': 19479360, 'steps': 101454, 'loss/train': 1.1802316904067993} 11/07/2021 11:28:25 - INFO - __main__ - Step 101456: {'lr': 0.00012139466725833326, 'samples': 19479552, 'steps': 101455, 'loss/train': 1.4095011949539185} 11/07/2021 11:28:25 - INFO - __main__ - Step 101457: {'lr': 0.00012139011655462338, 'samples': 19479744, 'steps': 101456, 'loss/train': 1.5888687372207642} 11/07/2021 11:28:26 - INFO - __main__ - Step 101458: {'lr': 0.00012138556590886312, 'samples': 19479936, 'steps': 101457, 'loss/train': 1.4267717599868774} 11/07/2021 11:28:26 - INFO - __main__ - Step 101459: {'lr': 0.00012138101532105467, 'samples': 19480128, 'steps': 101458, 'loss/train': 1.1556349992752075} 11/07/2021 11:28:27 - INFO - __main__ - Step 101460: {'lr': 0.00012137646479119982, 'samples': 19480320, 'steps': 101459, 'loss/train': 1.6247633695602417} 11/07/2021 11:28:27 - INFO - __main__ - Step 101461: {'lr': 0.00012137191431930075, 'samples': 19480512, 'steps': 101460, 'loss/train': 1.276069164276123} 11/07/2021 11:28:28 - INFO - __main__ - Step 101462: {'lr': 0.00012136736390535952, 'samples': 19480704, 'steps': 101461, 'loss/train': 1.3474265336990356} 11/07/2021 11:28:29 - INFO - __main__ - Step 101463: {'lr': 0.00012136281354937817, 'samples': 19480896, 'steps': 101462, 'loss/train': 1.0056970119476318} 11/07/2021 11:28:29 - INFO - __main__ - Step 101464: {'lr': 0.00012135826325135877, 'samples': 19481088, 'steps': 101463, 'loss/train': 1.1711318492889404} 11/07/2021 11:28:29 - INFO - __main__ - Step 101465: {'lr': 0.00012135371301130332, 'samples': 19481280, 'steps': 101464, 'loss/train': 1.0059560537338257} 11/07/2021 11:28:30 - INFO - __main__ - Step 101466: {'lr': 0.00012134916282921393, 'samples': 19481472, 'steps': 101465, 'loss/train': 1.488689661026001} 11/07/2021 11:28:30 - INFO - __main__ - Step 101467: {'lr': 0.00012134461270509259, 'samples': 19481664, 'steps': 101466, 'loss/train': 0.9754695296287537} 11/07/2021 11:28:30 - INFO - __main__ - Step 101468: {'lr': 0.0001213400626389414, 'samples': 19481856, 'steps': 101467, 'loss/train': 1.1037694215774536} 11/07/2021 11:28:31 - INFO - __main__ - Step 101469: {'lr': 0.0001213355126307624, 'samples': 19482048, 'steps': 101468, 'loss/train': 1.294596791267395} 11/07/2021 11:28:32 - INFO - __main__ - Step 101470: {'lr': 0.00012133096268055763, 'samples': 19482240, 'steps': 101469, 'loss/train': 1.2861746549606323} 11/07/2021 11:28:32 - INFO - __main__ - Step 101471: {'lr': 0.00012132641278832915, 'samples': 19482432, 'steps': 101470, 'loss/train': 1.336371660232544} 11/07/2021 11:28:32 - INFO - __main__ - Step 101472: {'lr': 0.00012132186295407899, 'samples': 19482624, 'steps': 101471, 'loss/train': 1.9227213859558105} 11/07/2021 11:28:33 - INFO - __main__ - Step 101473: {'lr': 0.0001213173131778093, 'samples': 19482816, 'steps': 101472, 'loss/train': 1.5872811079025269} 11/07/2021 11:28:34 - INFO - __main__ - Step 101474: {'lr': 0.00012131276345952197, 'samples': 19483008, 'steps': 101473, 'loss/train': 1.6346526145935059} 11/07/2021 11:28:34 - INFO - __main__ - Step 101475: {'lr': 0.0001213082137992191, 'samples': 19483200, 'steps': 101474, 'loss/train': 1.3455204963684082} 11/07/2021 11:28:35 - INFO - __main__ - Step 101476: {'lr': 0.00012130366419690277, 'samples': 19483392, 'steps': 101475, 'loss/train': 0.1082928255200386} 11/07/2021 11:28:35 - INFO - __main__ - Step 101477: {'lr': 0.00012129911465257504, 'samples': 19483584, 'steps': 101476, 'loss/train': 1.742929220199585} 11/07/2021 11:28:35 - INFO - __main__ - Step 101478: {'lr': 0.00012129456516623791, 'samples': 19483776, 'steps': 101477, 'loss/train': 1.1165751218795776} 11/07/2021 11:28:37 - INFO - __main__ - Step 101479: {'lr': 0.0001212900157378935, 'samples': 19483968, 'steps': 101478, 'loss/train': 1.2501299381256104} 11/07/2021 11:28:37 - INFO - __main__ - Step 101480: {'lr': 0.00012128546636754379, 'samples': 19484160, 'steps': 101479, 'loss/train': 1.263221025466919} 11/07/2021 11:28:37 - INFO - __main__ - Step 101481: {'lr': 0.00012128091705519086, 'samples': 19484352, 'steps': 101480, 'loss/train': 0.9899769425392151} 11/07/2021 11:28:38 - INFO - __main__ - Step 101482: {'lr': 0.0001212763678008368, 'samples': 19484544, 'steps': 101481, 'loss/train': 1.2671083211898804} 11/07/2021 11:28:38 - INFO - __main__ - Step 101483: {'lr': 0.00012127181860448361, 'samples': 19484736, 'steps': 101482, 'loss/train': 1.062066674232483} 11/07/2021 11:28:39 - INFO - __main__ - Step 101484: {'lr': 0.00012126726946613334, 'samples': 19484928, 'steps': 101483, 'loss/train': 1.2306877374649048} 11/07/2021 11:28:39 - INFO - __main__ - Step 101485: {'lr': 0.00012126272038578806, 'samples': 19485120, 'steps': 101484, 'loss/train': 1.972767949104309} 11/07/2021 11:28:40 - INFO - __main__ - Step 101486: {'lr': 0.00012125817136344992, 'samples': 19485312, 'steps': 101485, 'loss/train': 1.791816234588623} 11/07/2021 11:28:40 - INFO - __main__ - Step 101487: {'lr': 0.00012125362239912071, 'samples': 19485504, 'steps': 101486, 'loss/train': 1.060530185699463} 11/07/2021 11:28:40 - INFO - __main__ - Step 101488: {'lr': 0.00012124907349280268, 'samples': 19485696, 'steps': 101487, 'loss/train': 1.4140886068344116} 11/07/2021 11:28:41 - INFO - __main__ - Step 101489: {'lr': 0.00012124452464449784, 'samples': 19485888, 'steps': 101488, 'loss/train': 1.5666296482086182} 11/07/2021 11:28:42 - INFO - __main__ - Step 101490: {'lr': 0.0001212399758542082, 'samples': 19486080, 'steps': 101489, 'loss/train': 1.1528871059417725} 11/07/2021 11:28:42 - INFO - __main__ - Step 101491: {'lr': 0.00012123542712193586, 'samples': 19486272, 'steps': 101490, 'loss/train': 1.1768789291381836} 11/07/2021 11:28:42 - INFO - __main__ - Step 101492: {'lr': 0.00012123087844768283, 'samples': 19486464, 'steps': 101491, 'loss/train': 1.2712907791137695} 11/07/2021 11:28:43 - INFO - __main__ - Step 101493: {'lr': 0.00012122632983145118, 'samples': 19486656, 'steps': 101492, 'loss/train': 0.7651158571243286} 11/07/2021 11:28:44 - INFO - __main__ - Step 101494: {'lr': 0.00012122178127324298, 'samples': 19486848, 'steps': 101493, 'loss/train': 1.516926646232605} 11/07/2021 11:28:44 - INFO - __main__ - Step 101495: {'lr': 0.00012121723277306024, 'samples': 19487040, 'steps': 101494, 'loss/train': 0.8453449010848999} 11/07/2021 11:28:44 - INFO - __main__ - Step 101496: {'lr': 0.00012121268433090504, 'samples': 19487232, 'steps': 101495, 'loss/train': 1.2101143598556519} 11/07/2021 11:28:45 - INFO - __main__ - Step 101497: {'lr': 0.00012120813594677942, 'samples': 19487424, 'steps': 101496, 'loss/train': 1.0635172128677368} 11/07/2021 11:28:45 - INFO - __main__ - Step 101498: {'lr': 0.00012120358762068543, 'samples': 19487616, 'steps': 101497, 'loss/train': 1.0677815675735474} 11/07/2021 11:28:46 - INFO - __main__ - Step 101499: {'lr': 0.00012119903935262508, 'samples': 19487808, 'steps': 101498, 'loss/train': 1.3835512399673462} 11/07/2021 11:28:47 - INFO - __main__ - Step 101500: {'lr': 0.0001211944911426006, 'samples': 19488000, 'steps': 101499, 'loss/train': 1.323217749595642} 11/07/2021 11:28:47 - INFO - __main__ - Step 101501: {'lr': 0.00012118994299061376, 'samples': 19488192, 'steps': 101500, 'loss/train': 1.124829888343811} 11/07/2021 11:28:48 - INFO - __main__ - Step 101502: {'lr': 0.00012118539489666674, 'samples': 19488384, 'steps': 101501, 'loss/train': 5.386704921722412} 11/07/2021 11:28:48 - INFO - __main__ - Step 101503: {'lr': 0.00012118084686076164, 'samples': 19488576, 'steps': 101502, 'loss/train': 1.3780854940414429} 11/07/2021 11:28:48 - INFO - __main__ - Step 101504: {'lr': 0.00012117629888290044, 'samples': 19488768, 'steps': 101503, 'loss/train': 1.3773183822631836} 11/07/2021 11:28:49 - INFO - __main__ - Step 101505: {'lr': 0.0001211717509630852, 'samples': 19488960, 'steps': 101504, 'loss/train': 5.852468967437744} 11/07/2021 11:28:50 - INFO - __main__ - Step 101506: {'lr': 0.00012116720310131799, 'samples': 19489152, 'steps': 101505, 'loss/train': 5.733624458312988} 11/07/2021 11:28:50 - INFO - __main__ - Step 101507: {'lr': 0.00012116265529760084, 'samples': 19489344, 'steps': 101506, 'loss/train': 1.1787569522857666} 11/07/2021 11:28:50 - INFO - __main__ - Step 101508: {'lr': 0.00012115810755193582, 'samples': 19489536, 'steps': 101507, 'loss/train': 3.8576014041900635} 11/07/2021 11:28:51 - INFO - __main__ - Step 101509: {'lr': 0.00012115355986432497, 'samples': 19489728, 'steps': 101508, 'loss/train': 1.0902458429336548} 11/07/2021 11:28:51 - INFO - __main__ - Step 101510: {'lr': 0.00012114901223477031, 'samples': 19489920, 'steps': 101509, 'loss/train': 1.9750250577926636} 11/07/2021 11:28:52 - INFO - __main__ - Step 101511: {'lr': 0.00012114446466327394, 'samples': 19490112, 'steps': 101510, 'loss/train': 1.229740023612976} 11/07/2021 11:28:52 - INFO - __main__ - Step 101512: {'lr': 0.00012113991714983791, 'samples': 19490304, 'steps': 101511, 'loss/train': 0.9845739006996155} 11/07/2021 11:28:53 - INFO - __main__ - Step 101513: {'lr': 0.00012113536969446432, 'samples': 19490496, 'steps': 101512, 'loss/train': 1.3053691387176514} 11/07/2021 11:28:53 - INFO - __main__ - Step 101514: {'lr': 0.00012113082229715502, 'samples': 19490688, 'steps': 101513, 'loss/train': 0.6290778517723083} 11/07/2021 11:28:53 - INFO - __main__ - Step 101515: {'lr': 0.00012112627495791222, 'samples': 19490880, 'steps': 101514, 'loss/train': 1.417150616645813} 11/07/2021 11:28:55 - INFO - __main__ - Step 101516: {'lr': 0.00012112172767673793, 'samples': 19491072, 'steps': 101515, 'loss/train': 1.3634425401687622} 11/07/2021 11:28:55 - INFO - __main__ - Step 101517: {'lr': 0.00012111718045363419, 'samples': 19491264, 'steps': 101516, 'loss/train': 1.5972731113433838} 11/07/2021 11:28:55 - INFO - __main__ - Step 101518: {'lr': 0.00012111263328860305, 'samples': 19491456, 'steps': 101517, 'loss/train': 1.5952904224395752} 11/07/2021 11:28:56 - INFO - __main__ - Step 101519: {'lr': 0.0001211080861816466, 'samples': 19491648, 'steps': 101518, 'loss/train': 1.4557735919952393} 11/07/2021 11:28:56 - INFO - __main__ - Step 101520: {'lr': 0.00012110353913276681, 'samples': 19491840, 'steps': 101519, 'loss/train': 0.45468446612358093} 11/07/2021 11:28:57 - INFO - __main__ - Step 101521: {'lr': 0.00012109899214196582, 'samples': 19492032, 'steps': 101520, 'loss/train': 0.7153116464614868} 11/07/2021 11:28:58 - INFO - __main__ - Step 101522: {'lr': 0.00012109444520924561, 'samples': 19492224, 'steps': 101521, 'loss/train': 1.170771598815918} 11/07/2021 11:28:58 - INFO - __main__ - Step 101523: {'lr': 0.00012108989833460826, 'samples': 19492416, 'steps': 101522, 'loss/train': 1.5314483642578125} 11/07/2021 11:28:58 - INFO - __main__ - Step 101524: {'lr': 0.0001210853515180558, 'samples': 19492608, 'steps': 101523, 'loss/train': 1.323093295097351} 11/07/2021 11:28:59 - INFO - __main__ - Step 101525: {'lr': 0.00012108080475959032, 'samples': 19492800, 'steps': 101524, 'loss/train': 1.448489785194397} 11/07/2021 11:28:59 - INFO - __main__ - Step 101526: {'lr': 0.00012107625805921391, 'samples': 19492992, 'steps': 101525, 'loss/train': 1.32510507106781} 11/07/2021 11:29:00 - INFO - __main__ - Step 101527: {'lr': 0.00012107171141692847, 'samples': 19493184, 'steps': 101526, 'loss/train': 1.5142477750778198} 11/07/2021 11:29:00 - INFO - __main__ - Step 101528: {'lr': 0.00012106716483273614, 'samples': 19493376, 'steps': 101527, 'loss/train': 1.483933448791504} 11/07/2021 11:29:01 - INFO - __main__ - Step 101529: {'lr': 0.00012106261830663892, 'samples': 19493568, 'steps': 101528, 'loss/train': 0.8413668870925903} 11/07/2021 11:29:01 - INFO - __main__ - Step 101530: {'lr': 0.0001210580718386389, 'samples': 19493760, 'steps': 101529, 'loss/train': 1.4191949367523193} 11/07/2021 11:29:01 - INFO - __main__ - Step 101531: {'lr': 0.00012105352542873815, 'samples': 19493952, 'steps': 101530, 'loss/train': 1.302381157875061} 11/07/2021 11:29:02 - INFO - __main__ - Step 101532: {'lr': 0.00012104897907693869, 'samples': 19494144, 'steps': 101531, 'loss/train': 2.09073805809021} 11/07/2021 11:29:03 - INFO - __main__ - Step 101533: {'lr': 0.00012104443278324254, 'samples': 19494336, 'steps': 101532, 'loss/train': 1.6573543548583984} 11/07/2021 11:29:03 - INFO - __main__ - Step 101534: {'lr': 0.0001210398865476518, 'samples': 19494528, 'steps': 101533, 'loss/train': 1.2373849153518677} 11/07/2021 11:29:03 - INFO - __main__ - Step 101535: {'lr': 0.0001210353403701685, 'samples': 19494720, 'steps': 101534, 'loss/train': 1.3892375230789185} 11/07/2021 11:29:04 - INFO - __main__ - Step 101536: {'lr': 0.00012103079425079466, 'samples': 19494912, 'steps': 101535, 'loss/train': 1.5014960765838623} 11/07/2021 11:29:05 - INFO - __main__ - Step 101537: {'lr': 0.00012102624818953239, 'samples': 19495104, 'steps': 101536, 'loss/train': 1.4013605117797852} 11/07/2021 11:29:05 - INFO - __main__ - Step 101538: {'lr': 0.00012102170218638367, 'samples': 19495296, 'steps': 101537, 'loss/train': 1.3084635734558105} 11/07/2021 11:29:05 - INFO - __main__ - Step 101539: {'lr': 0.0001210171562413506, 'samples': 19495488, 'steps': 101538, 'loss/train': 1.092158555984497} 11/07/2021 11:29:06 - INFO - __main__ - Step 101540: {'lr': 0.00012101261035443531, 'samples': 19495680, 'steps': 101539, 'loss/train': 0.8409503102302551} 11/07/2021 11:29:06 - INFO - __main__ - Step 101541: {'lr': 0.00012100806452563965, 'samples': 19495872, 'steps': 101540, 'loss/train': 0.6800819635391235} 11/07/2021 11:29:07 - INFO - __main__ - Step 101542: {'lr': 0.00012100351875496573, 'samples': 19496064, 'steps': 101541, 'loss/train': 0.8568156361579895} 11/07/2021 11:29:07 - INFO - __main__ - Step 101543: {'lr': 0.00012099897304241567, 'samples': 19496256, 'steps': 101542, 'loss/train': 1.3215548992156982} 11/07/2021 11:29:08 - INFO - __main__ - Step 101544: {'lr': 0.0001209944273879915, 'samples': 19496448, 'steps': 101543, 'loss/train': 1.3416786193847656} 11/07/2021 11:29:08 - INFO - __main__ - Step 101545: {'lr': 0.00012098988179169521, 'samples': 19496640, 'steps': 101544, 'loss/train': 1.1919176578521729} 11/07/2021 11:29:09 - INFO - __main__ - Step 101546: {'lr': 0.0001209853362535289, 'samples': 19496832, 'steps': 101545, 'loss/train': 1.6274733543395996} 11/07/2021 11:29:10 - INFO - __main__ - Step 101547: {'lr': 0.00012098079077349462, 'samples': 19497024, 'steps': 101546, 'loss/train': 1.3964918851852417} 11/07/2021 11:29:10 - INFO - __main__ - Step 101548: {'lr': 0.00012097624535159438, 'samples': 19497216, 'steps': 101547, 'loss/train': 1.5987346172332764} 11/07/2021 11:29:10 - INFO - __main__ - Step 101549: {'lr': 0.00012097169998783025, 'samples': 19497408, 'steps': 101548, 'loss/train': 1.082443356513977} 11/07/2021 11:29:11 - INFO - __main__ - Step 101550: {'lr': 0.00012096715468220431, 'samples': 19497600, 'steps': 101549, 'loss/train': 1.3111296892166138} 11/07/2021 11:29:11 - INFO - __main__ - Step 101551: {'lr': 0.00012096260943471856, 'samples': 19497792, 'steps': 101550, 'loss/train': 1.4010956287384033} 11/07/2021 11:29:12 - INFO - __main__ - Step 101552: {'lr': 0.00012095806424537508, 'samples': 19497984, 'steps': 101551, 'loss/train': 1.124786376953125} 11/07/2021 11:29:12 - INFO - __main__ - Step 101553: {'lr': 0.00012095351911417598, 'samples': 19498176, 'steps': 101552, 'loss/train': 1.5391700267791748} 11/07/2021 11:29:13 - INFO - __main__ - Step 101554: {'lr': 0.00012094897404112317, 'samples': 19498368, 'steps': 101553, 'loss/train': 0.8208988904953003} 11/07/2021 11:29:13 - INFO - __main__ - Step 101555: {'lr': 0.00012094442902621874, 'samples': 19498560, 'steps': 101554, 'loss/train': 0.9298396110534668} 11/07/2021 11:29:13 - INFO - __main__ - Step 101556: {'lr': 0.00012093988406946477, 'samples': 19498752, 'steps': 101555, 'loss/train': 0.9375703930854797} 11/07/2021 11:29:14 - INFO - __main__ - Step 101557: {'lr': 0.00012093533917086328, 'samples': 19498944, 'steps': 101556, 'loss/train': 0.9158123135566711} 11/07/2021 11:29:15 - INFO - __main__ - Step 101558: {'lr': 0.00012093079433041634, 'samples': 19499136, 'steps': 101557, 'loss/train': 0.9471313953399658} 11/07/2021 11:29:15 - INFO - __main__ - Step 101559: {'lr': 0.000120926249548126, 'samples': 19499328, 'steps': 101558, 'loss/train': 1.445584774017334} 11/07/2021 11:29:15 - INFO - __main__ - Step 101560: {'lr': 0.00012092170482399431, 'samples': 19499520, 'steps': 101559, 'loss/train': 1.2717481851577759} 11/07/2021 11:29:16 - INFO - __main__ - Step 101561: {'lr': 0.00012091716015802329, 'samples': 19499712, 'steps': 101560, 'loss/train': 1.6393005847930908} 11/07/2021 11:29:16 - INFO - __main__ - Step 101562: {'lr': 0.00012091261555021499, 'samples': 19499904, 'steps': 101561, 'loss/train': 1.9293545484542847} 11/07/2021 11:29:17 - INFO - __main__ - Step 101563: {'lr': 0.0001209080710005715, 'samples': 19500096, 'steps': 101562, 'loss/train': 0.7173073291778564} 11/07/2021 11:29:17 - INFO - __main__ - Step 101564: {'lr': 0.00012090352650909483, 'samples': 19500288, 'steps': 101563, 'loss/train': 1.2261123657226562} 11/07/2021 11:29:18 - INFO - __main__ - Step 101565: {'lr': 0.00012089898207578706, 'samples': 19500480, 'steps': 101564, 'loss/train': 1.696694016456604} 11/07/2021 11:29:18 - INFO - __main__ - Step 101566: {'lr': 0.0001208944377006502, 'samples': 19500672, 'steps': 101565, 'loss/train': 1.1167770624160767} 11/07/2021 11:29:19 - INFO - __main__ - Step 101567: {'lr': 0.00012088989338368639, 'samples': 19500864, 'steps': 101566, 'loss/train': 1.477180004119873} 11/07/2021 11:29:20 - INFO - __main__ - Step 101568: {'lr': 0.00012088534912489754, 'samples': 19501056, 'steps': 101567, 'loss/train': 1.4413740634918213} 11/07/2021 11:29:20 - INFO - __main__ - Step 101569: {'lr': 0.00012088080492428575, 'samples': 19501248, 'steps': 101568, 'loss/train': 0.8487969040870667} 11/07/2021 11:29:20 - INFO - __main__ - Step 101570: {'lr': 0.00012087626078185307, 'samples': 19501440, 'steps': 101569, 'loss/train': 1.61789870262146} 11/07/2021 11:29:21 - INFO - __main__ - Step 101571: {'lr': 0.00012087171669760155, 'samples': 19501632, 'steps': 101570, 'loss/train': 1.246108055114746} 11/07/2021 11:29:21 - INFO - __main__ - Step 101572: {'lr': 0.00012086717267153325, 'samples': 19501824, 'steps': 101571, 'loss/train': 1.0097432136535645} 11/07/2021 11:29:22 - INFO - __main__ - Step 101573: {'lr': 0.0001208626287036502, 'samples': 19502016, 'steps': 101572, 'loss/train': 1.1854790449142456} 11/07/2021 11:29:23 - INFO - __main__ - Step 101574: {'lr': 0.00012085808479395446, 'samples': 19502208, 'steps': 101573, 'loss/train': 0.901157796382904} 11/07/2021 11:29:23 - INFO - __main__ - Step 101575: {'lr': 0.00012085354094244808, 'samples': 19502400, 'steps': 101574, 'loss/train': 1.6978267431259155} 11/07/2021 11:29:23 - INFO - __main__ - Step 101576: {'lr': 0.00012084899714913311, 'samples': 19502592, 'steps': 101575, 'loss/train': 1.3805408477783203} 11/07/2021 11:29:24 - INFO - __main__ - Step 101577: {'lr': 0.00012084445341401157, 'samples': 19502784, 'steps': 101576, 'loss/train': 1.5961415767669678} 11/07/2021 11:29:25 - INFO - __main__ - Step 101578: {'lr': 0.00012083990973708554, 'samples': 19502976, 'steps': 101577, 'loss/train': 1.049340009689331} 11/07/2021 11:29:25 - INFO - __main__ - Step 101579: {'lr': 0.00012083536611835704, 'samples': 19503168, 'steps': 101578, 'loss/train': 1.5052772760391235} 11/07/2021 11:29:25 - INFO - __main__ - Step 101580: {'lr': 0.00012083082255782824, 'samples': 19503360, 'steps': 101579, 'loss/train': 1.1080831289291382} 11/07/2021 11:29:26 - INFO - __main__ - Step 101581: {'lr': 0.00012082627905550098, 'samples': 19503552, 'steps': 101580, 'loss/train': 1.3122150897979736} 11/07/2021 11:29:26 - INFO - __main__ - Step 101582: {'lr': 0.00012082173561137741, 'samples': 19503744, 'steps': 101581, 'loss/train': 1.4085087776184082} 11/07/2021 11:29:26 - INFO - __main__ - Step 101583: {'lr': 0.00012081719222545955, 'samples': 19503936, 'steps': 101582, 'loss/train': 1.553123116493225} 11/07/2021 11:29:27 - INFO - __main__ - Step 101584: {'lr': 0.00012081264889774948, 'samples': 19504128, 'steps': 101583, 'loss/train': 1.238189458847046} 11/07/2021 11:29:28 - INFO - __main__ - Step 101585: {'lr': 0.00012080810562824926, 'samples': 19504320, 'steps': 101584, 'loss/train': 1.6686677932739258} 11/07/2021 11:29:28 - INFO - __main__ - Step 101586: {'lr': 0.00012080356241696089, 'samples': 19504512, 'steps': 101585, 'loss/train': 0.7979247570037842} 11/07/2021 11:29:28 - INFO - __main__ - Step 101587: {'lr': 0.00012079901926388645, 'samples': 19504704, 'steps': 101586, 'loss/train': 1.1355736255645752} 11/07/2021 11:29:29 - INFO - __main__ - Step 101588: {'lr': 0.00012079447616902798, 'samples': 19504896, 'steps': 101587, 'loss/train': 0.7179237604141235} 11/07/2021 11:29:30 - INFO - __main__ - Step 101589: {'lr': 0.0001207899331323875, 'samples': 19505088, 'steps': 101588, 'loss/train': 1.7639328241348267} 11/07/2021 11:29:30 - INFO - __main__ - Step 101590: {'lr': 0.0001207853901539671, 'samples': 19505280, 'steps': 101589, 'loss/train': 1.469345211982727} 11/07/2021 11:29:30 - INFO - __main__ - Step 101591: {'lr': 0.00012078084723376892, 'samples': 19505472, 'steps': 101590, 'loss/train': 1.1630111932754517} 11/07/2021 11:29:31 - INFO - __main__ - Step 101592: {'lr': 0.00012077630437179479, 'samples': 19505664, 'steps': 101591, 'loss/train': 1.0202275514602661} 11/07/2021 11:29:31 - INFO - __main__ - Step 101593: {'lr': 0.00012077176156804687, 'samples': 19505856, 'steps': 101592, 'loss/train': 0.9454383254051208} 11/07/2021 11:29:32 - INFO - __main__ - Step 101594: {'lr': 0.0001207672188225272, 'samples': 19506048, 'steps': 101593, 'loss/train': 1.3460341691970825} 11/07/2021 11:29:33 - INFO - __main__ - Step 101595: {'lr': 0.00012076267613523781, 'samples': 19506240, 'steps': 101594, 'loss/train': 1.4381632804870605} 11/07/2021 11:29:33 - INFO - __main__ - Step 101596: {'lr': 0.00012075813350618079, 'samples': 19506432, 'steps': 101595, 'loss/train': 0.9442541003227234} 11/07/2021 11:29:33 - INFO - __main__ - Step 101597: {'lr': 0.00012075359093535812, 'samples': 19506624, 'steps': 101596, 'loss/train': 1.2160303592681885} 11/07/2021 11:29:34 - INFO - __main__ - Step 101598: {'lr': 0.00012074904842277193, 'samples': 19506816, 'steps': 101597, 'loss/train': 1.0905345678329468} 11/07/2021 11:29:35 - INFO - __main__ - Step 101599: {'lr': 0.0001207445059684242, 'samples': 19507008, 'steps': 101598, 'loss/train': 1.2893790006637573} 11/07/2021 11:29:35 - INFO - __main__ - Step 101600: {'lr': 0.00012073996357231701, 'samples': 19507200, 'steps': 101599, 'loss/train': 1.6244579553604126} 11/07/2021 11:29:35 - INFO - __main__ - Step 101601: {'lr': 0.00012073542123445239, 'samples': 19507392, 'steps': 101600, 'loss/train': 1.3317735195159912} 11/07/2021 11:29:36 - INFO - __main__ - Step 101602: {'lr': 0.0001207308789548325, 'samples': 19507584, 'steps': 101601, 'loss/train': 1.0219528675079346} 11/07/2021 11:29:36 - INFO - __main__ - Step 101603: {'lr': 0.00012072633673345917, 'samples': 19507776, 'steps': 101602, 'loss/train': 0.7133709788322449} 11/07/2021 11:29:37 - INFO - __main__ - Step 101604: {'lr': 0.00012072179457033458, 'samples': 19507968, 'steps': 101603, 'loss/train': 1.9419687986373901} 11/07/2021 11:29:37 - INFO - __main__ - Step 101605: {'lr': 0.00012071725246546073, 'samples': 19508160, 'steps': 101604, 'loss/train': 1.2629625797271729} 11/07/2021 11:29:38 - INFO - __main__ - Step 101606: {'lr': 0.00012071271041883971, 'samples': 19508352, 'steps': 101605, 'loss/train': 1.317296028137207} 11/07/2021 11:29:38 - INFO - __main__ - Step 101607: {'lr': 0.00012070816843047356, 'samples': 19508544, 'steps': 101606, 'loss/train': 1.566638708114624} 11/07/2021 11:29:38 - INFO - __main__ - Step 101608: {'lr': 0.0001207036265003643, 'samples': 19508736, 'steps': 101607, 'loss/train': 1.509781837463379} 11/07/2021 11:29:40 - INFO - __main__ - Step 101609: {'lr': 0.00012069908462851394, 'samples': 19508928, 'steps': 101608, 'loss/train': 1.4074256420135498} 11/07/2021 11:29:40 - INFO - __main__ - Step 101610: {'lr': 0.00012069454281492465, 'samples': 19509120, 'steps': 101609, 'loss/train': 1.188350796699524} 11/07/2021 11:29:41 - INFO - __main__ - Step 101611: {'lr': 0.00012069000105959837, 'samples': 19509312, 'steps': 101610, 'loss/train': 1.6692931652069092} 11/07/2021 11:29:41 - INFO - __main__ - Step 101612: {'lr': 0.00012068545936253728, 'samples': 19509504, 'steps': 101611, 'loss/train': 0.2661668658256531} 11/07/2021 11:29:41 - INFO - __main__ - Step 101613: {'lr': 0.00012068091772374323, 'samples': 19509696, 'steps': 101612, 'loss/train': 1.5952606201171875} 11/07/2021 11:29:42 - INFO - __main__ - Step 101614: {'lr': 0.00012067637614321839, 'samples': 19509888, 'steps': 101613, 'loss/train': 1.4724256992340088} 11/07/2021 11:29:43 - INFO - __main__ - Step 101615: {'lr': 0.00012067183462096473, 'samples': 19510080, 'steps': 101614, 'loss/train': 1.169934868812561} 11/07/2021 11:29:43 - INFO - __main__ - Step 101616: {'lr': 0.00012066729315698438, 'samples': 19510272, 'steps': 101615, 'loss/train': 1.3742239475250244} 11/07/2021 11:29:43 - INFO - __main__ - Step 101617: {'lr': 0.00012066275175127935, 'samples': 19510464, 'steps': 101616, 'loss/train': 1.3655316829681396} 11/07/2021 11:29:44 - INFO - __main__ - Step 101618: {'lr': 0.00012065821040385169, 'samples': 19510656, 'steps': 101617, 'loss/train': 1.3469009399414062} 11/07/2021 11:29:45 - INFO - __main__ - Step 101619: {'lr': 0.00012065366911470343, 'samples': 19510848, 'steps': 101618, 'loss/train': 1.2583030462265015} 11/07/2021 11:29:45 - INFO - __main__ - Step 101620: {'lr': 0.00012064912788383663, 'samples': 19511040, 'steps': 101619, 'loss/train': 1.4929759502410889} 11/07/2021 11:29:45 - INFO - __main__ - Step 101621: {'lr': 0.00012064458671125336, 'samples': 19511232, 'steps': 101620, 'loss/train': 0.3383299708366394} 11/07/2021 11:29:46 - INFO - __main__ - Step 101622: {'lr': 0.00012064004559695562, 'samples': 19511424, 'steps': 101621, 'loss/train': 0.5188581347465515} 11/07/2021 11:29:46 - INFO - __main__ - Step 101623: {'lr': 0.00012063550454094558, 'samples': 19511616, 'steps': 101622, 'loss/train': 1.4245326519012451} 11/07/2021 11:29:47 - INFO - __main__ - Step 101624: {'lr': 0.00012063096354322508, 'samples': 19511808, 'steps': 101623, 'loss/train': 1.3202937841415405} 11/07/2021 11:29:48 - INFO - __main__ - Step 101625: {'lr': 0.0001206264226037963, 'samples': 19512000, 'steps': 101624, 'loss/train': 0.9840372800827026} 11/07/2021 11:29:48 - INFO - __main__ - Step 101626: {'lr': 0.00012062188172266123, 'samples': 19512192, 'steps': 101625, 'loss/train': 0.7343273758888245} 11/07/2021 11:29:48 - INFO - __main__ - Step 101627: {'lr': 0.00012061734089982196, 'samples': 19512384, 'steps': 101626, 'loss/train': 1.0608192682266235} 11/07/2021 11:29:49 - INFO - __main__ - Step 101628: {'lr': 0.00012061280013528053, 'samples': 19512576, 'steps': 101627, 'loss/train': 0.8607659339904785} 11/07/2021 11:29:49 - INFO - __main__ - Step 101629: {'lr': 0.00012060825942903894, 'samples': 19512768, 'steps': 101628, 'loss/train': 0.6824367046356201} 11/07/2021 11:29:50 - INFO - __main__ - Step 101630: {'lr': 0.0001206037187810993, 'samples': 19512960, 'steps': 101629, 'loss/train': 1.834912896156311} 11/07/2021 11:29:50 - INFO - __main__ - Step 101631: {'lr': 0.00012059917819146362, 'samples': 19513152, 'steps': 101630, 'loss/train': 0.8665695786476135} 11/07/2021 11:29:51 - INFO - __main__ - Step 101632: {'lr': 0.00012059463766013396, 'samples': 19513344, 'steps': 101631, 'loss/train': 1.3208975791931152} 11/07/2021 11:29:51 - INFO - __main__ - Step 101633: {'lr': 0.00012059009718711233, 'samples': 19513536, 'steps': 101632, 'loss/train': 1.754284381866455} 11/07/2021 11:29:51 - INFO - __main__ - Step 101634: {'lr': 0.00012058555677240093, 'samples': 19513728, 'steps': 101633, 'loss/train': 1.5284525156021118} 11/07/2021 11:29:52 - INFO - __main__ - Step 101635: {'lr': 0.00012058101641600158, 'samples': 19513920, 'steps': 101634, 'loss/train': 1.3947169780731201} 11/07/2021 11:29:53 - INFO - __main__ - Step 101636: {'lr': 0.00012057647611791645, 'samples': 19514112, 'steps': 101635, 'loss/train': 1.7242491245269775} 11/07/2021 11:29:53 - INFO - __main__ - Step 101637: {'lr': 0.00012057193587814752, 'samples': 19514304, 'steps': 101636, 'loss/train': 1.707757592201233} 11/07/2021 11:29:53 - INFO - __main__ - Step 101638: {'lr': 0.00012056739569669688, 'samples': 19514496, 'steps': 101637, 'loss/train': 1.3556504249572754} 11/07/2021 11:29:54 - INFO - __main__ - Step 101639: {'lr': 0.00012056285557356661, 'samples': 19514688, 'steps': 101638, 'loss/train': 1.361133337020874} 11/07/2021 11:29:55 - INFO - __main__ - Step 101640: {'lr': 0.0001205583155087587, 'samples': 19514880, 'steps': 101639, 'loss/train': 1.220310926437378} 11/07/2021 11:29:55 - INFO - __main__ - Step 101641: {'lr': 0.00012055377550227523, 'samples': 19515072, 'steps': 101640, 'loss/train': 1.0809787511825562} 11/07/2021 11:29:56 - INFO - __main__ - Step 101642: {'lr': 0.0001205492355541182, 'samples': 19515264, 'steps': 101641, 'loss/train': 1.2920339107513428} 11/07/2021 11:29:56 - INFO - __main__ - Step 101643: {'lr': 0.00012054469566428971, 'samples': 19515456, 'steps': 101642, 'loss/train': 1.31638503074646} 11/07/2021 11:29:56 - INFO - __main__ - Step 101644: {'lr': 0.00012054015583279179, 'samples': 19515648, 'steps': 101643, 'loss/train': 1.3766744136810303} 11/07/2021 11:29:57 - INFO - __main__ - Step 101645: {'lr': 0.00012053561605962646, 'samples': 19515840, 'steps': 101644, 'loss/train': 1.7784583568572998} 11/07/2021 11:29:58 - INFO - __main__ - Step 101646: {'lr': 0.00012053107634479579, 'samples': 19516032, 'steps': 101645, 'loss/train': 0.8352782726287842} 11/07/2021 11:29:58 - INFO - __main__ - Step 101647: {'lr': 0.0001205265366883019, 'samples': 19516224, 'steps': 101646, 'loss/train': 1.3794269561767578} 11/07/2021 11:29:58 - INFO - __main__ - Step 101648: {'lr': 0.00012052199709014669, 'samples': 19516416, 'steps': 101647, 'loss/train': 1.2024751901626587} 11/07/2021 11:29:59 - INFO - __main__ - Step 101649: {'lr': 0.00012051745755033224, 'samples': 19516608, 'steps': 101648, 'loss/train': 1.3997093439102173} 11/07/2021 11:29:59 - INFO - __main__ - Step 101650: {'lr': 0.00012051291806886067, 'samples': 19516800, 'steps': 101649, 'loss/train': 1.166671872138977} 11/07/2021 11:30:00 - INFO - __main__ - Step 101651: {'lr': 0.00012050837864573394, 'samples': 19516992, 'steps': 101650, 'loss/train': 1.1302663087844849} 11/07/2021 11:30:00 - INFO - __main__ - Step 101652: {'lr': 0.00012050383928095415, 'samples': 19517184, 'steps': 101651, 'loss/train': 1.4004770517349243} 11/07/2021 11:30:01 - INFO - __main__ - Step 101653: {'lr': 0.00012049929997452333, 'samples': 19517376, 'steps': 101652, 'loss/train': 1.0751570463180542} 11/07/2021 11:30:01 - INFO - __main__ - Step 101654: {'lr': 0.00012049476072644352, 'samples': 19517568, 'steps': 101653, 'loss/train': 1.2734516859054565} 11/07/2021 11:30:01 - INFO - __main__ - Step 101655: {'lr': 0.00012049022153671677, 'samples': 19517760, 'steps': 101654, 'loss/train': 1.1067816019058228} 11/07/2021 11:30:03 - INFO - __main__ - Step 101656: {'lr': 0.00012048568240534513, 'samples': 19517952, 'steps': 101655, 'loss/train': 0.9169591069221497} 11/07/2021 11:30:03 - INFO - __main__ - Step 101657: {'lr': 0.00012048114333233065, 'samples': 19518144, 'steps': 101656, 'loss/train': 1.570241093635559} 11/07/2021 11:30:03 - INFO - __main__ - Step 101658: {'lr': 0.00012047660431767537, 'samples': 19518336, 'steps': 101657, 'loss/train': 1.3621925115585327} 11/07/2021 11:30:04 - INFO - __main__ - Step 101659: {'lr': 0.00012047206536138133, 'samples': 19518528, 'steps': 101658, 'loss/train': 1.5165202617645264} 11/07/2021 11:30:04 - INFO - __main__ - Step 101660: {'lr': 0.00012046752646345058, 'samples': 19518720, 'steps': 101659, 'loss/train': 0.06262029707431793} 11/07/2021 11:30:04 - INFO - __main__ - Step 101661: {'lr': 0.00012046298762388527, 'samples': 19518912, 'steps': 101660, 'loss/train': 1.2969911098480225} 11/07/2021 11:30:05 - INFO - __main__ - Step 101662: {'lr': 0.00012045844884268722, 'samples': 19519104, 'steps': 101661, 'loss/train': 1.1789630651474} 11/07/2021 11:30:06 - INFO - __main__ - Step 101663: {'lr': 0.00012045391011985862, 'samples': 19519296, 'steps': 101662, 'loss/train': 1.5958768129348755} 11/07/2021 11:30:06 - INFO - __main__ - Step 101664: {'lr': 0.00012044937145540147, 'samples': 19519488, 'steps': 101663, 'loss/train': 1.4118643999099731} 11/07/2021 11:30:06 - INFO - __main__ - Step 101665: {'lr': 0.00012044483284931785, 'samples': 19519680, 'steps': 101664, 'loss/train': 1.4952062368392944} 11/07/2021 11:30:07 - INFO - __main__ - Step 101666: {'lr': 0.00012044029430160977, 'samples': 19519872, 'steps': 101665, 'loss/train': 1.2834768295288086} 11/07/2021 11:30:08 - INFO - __main__ - Step 101667: {'lr': 0.00012043575581227928, 'samples': 19520064, 'steps': 101666, 'loss/train': 1.356339454650879} 11/07/2021 11:30:08 - INFO - __main__ - Step 101668: {'lr': 0.00012043121738132847, 'samples': 19520256, 'steps': 101667, 'loss/train': 1.017645239830017} 11/07/2021 11:30:09 - INFO - __main__ - Step 101669: {'lr': 0.00012042667900875934, 'samples': 19520448, 'steps': 101668, 'loss/train': 0.696768581867218} 11/07/2021 11:30:09 - INFO - __main__ - Step 101670: {'lr': 0.00012042214069457397, 'samples': 19520640, 'steps': 101669, 'loss/train': 1.0168360471725464} 11/07/2021 11:30:09 - INFO - __main__ - Step 101671: {'lr': 0.00012041760243877436, 'samples': 19520832, 'steps': 101670, 'loss/train': 1.3991388082504272} 11/07/2021 11:30:10 - INFO - __main__ - Step 101672: {'lr': 0.00012041306424136258, 'samples': 19521024, 'steps': 101671, 'loss/train': 1.378111720085144} 11/07/2021 11:30:11 - INFO - __main__ - Step 101673: {'lr': 0.00012040852610234068, 'samples': 19521216, 'steps': 101672, 'loss/train': 1.853860855102539} 11/07/2021 11:30:11 - INFO - __main__ - Step 101674: {'lr': 0.0001204039880217108, 'samples': 19521408, 'steps': 101673, 'loss/train': 1.7430802583694458} 11/07/2021 11:30:11 - INFO - __main__ - Step 101675: {'lr': 0.00012039944999947477, 'samples': 19521600, 'steps': 101674, 'loss/train': 1.124882459640503} 11/07/2021 11:30:12 - INFO - __main__ - Step 101676: {'lr': 0.00012039491203563477, 'samples': 19521792, 'steps': 101675, 'loss/train': 1.660386323928833} 11/07/2021 11:30:13 - INFO - __main__ - Step 101677: {'lr': 0.00012039037413019283, 'samples': 19521984, 'steps': 101676, 'loss/train': 1.099653959274292} 11/07/2021 11:30:13 - INFO - __main__ - Step 101678: {'lr': 0.00012038583628315097, 'samples': 19522176, 'steps': 101677, 'loss/train': 0.8626993894577026} 11/07/2021 11:30:13 - INFO - __main__ - Step 101679: {'lr': 0.00012038129849451124, 'samples': 19522368, 'steps': 101678, 'loss/train': 1.1109845638275146} 11/07/2021 11:30:14 - INFO - __main__ - Step 101680: {'lr': 0.0001203767607642757, 'samples': 19522560, 'steps': 101679, 'loss/train': 1.4386966228485107} 11/07/2021 11:30:14 - INFO - __main__ - Step 101681: {'lr': 0.00012037222309244642, 'samples': 19522752, 'steps': 101680, 'loss/train': 1.6819778680801392} 11/07/2021 11:30:15 - INFO - __main__ - Step 101682: {'lr': 0.00012036768547902538, 'samples': 19522944, 'steps': 101681, 'loss/train': 1.330645203590393} 11/07/2021 11:30:16 - INFO - __main__ - Step 101683: {'lr': 0.00012036314792401467, 'samples': 19523136, 'steps': 101682, 'loss/train': 1.3834847211837769} 11/07/2021 11:30:16 - INFO - __main__ - Step 101684: {'lr': 0.00012035861042741635, 'samples': 19523328, 'steps': 101683, 'loss/train': 0.7844523787498474} 11/07/2021 11:30:16 - INFO - __main__ - Step 101685: {'lr': 0.00012035407298923242, 'samples': 19523520, 'steps': 101684, 'loss/train': 1.3659939765930176} 11/07/2021 11:30:17 - INFO - __main__ - Step 101686: {'lr': 0.00012034953560946497, 'samples': 19523712, 'steps': 101685, 'loss/train': 1.5509204864501953} 11/07/2021 11:30:18 - INFO - __main__ - Step 101687: {'lr': 0.00012034499828811599, 'samples': 19523904, 'steps': 101686, 'loss/train': 1.3843713998794556} 11/07/2021 11:30:18 - INFO - __main__ - Step 101688: {'lr': 0.00012034046102518765, 'samples': 19524096, 'steps': 101687, 'loss/train': 1.069615125656128} 11/07/2021 11:30:18 - INFO - __main__ - Step 101689: {'lr': 0.00012033592382068178, 'samples': 19524288, 'steps': 101688, 'loss/train': 1.203984260559082} 11/07/2021 11:30:19 - INFO - __main__ - Step 101690: {'lr': 0.00012033138667460058, 'samples': 19524480, 'steps': 101689, 'loss/train': 1.744284987449646} 11/07/2021 11:30:19 - INFO - __main__ - Step 101691: {'lr': 0.00012032684958694604, 'samples': 19524672, 'steps': 101690, 'loss/train': 1.5116146802902222} 11/07/2021 11:30:19 - INFO - __main__ - Step 101692: {'lr': 0.00012032231255772022, 'samples': 19524864, 'steps': 101691, 'loss/train': 1.344815969467163} 11/07/2021 11:30:20 - INFO - __main__ - Step 101693: {'lr': 0.00012031777558692516, 'samples': 19525056, 'steps': 101692, 'loss/train': 1.7722703218460083} 11/07/2021 11:30:21 - INFO - __main__ - Step 101694: {'lr': 0.00012031323867456293, 'samples': 19525248, 'steps': 101693, 'loss/train': 1.1879749298095703} 11/07/2021 11:30:21 - INFO - __main__ - Step 101695: {'lr': 0.00012030870182063556, 'samples': 19525440, 'steps': 101694, 'loss/train': 1.3945860862731934} 11/07/2021 11:30:22 - INFO - __main__ - Step 101696: {'lr': 0.00012030416502514504, 'samples': 19525632, 'steps': 101695, 'loss/train': 1.2623049020767212} 11/07/2021 11:30:22 - INFO - __main__ - Step 101697: {'lr': 0.00012029962828809352, 'samples': 19525824, 'steps': 101696, 'loss/train': 1.3673449754714966} 11/07/2021 11:30:23 - INFO - __main__ - Step 101698: {'lr': 0.00012029509160948294, 'samples': 19526016, 'steps': 101697, 'loss/train': 0.7168515920639038} 11/07/2021 11:30:23 - INFO - __main__ - Step 101699: {'lr': 0.0001202905549893154, 'samples': 19526208, 'steps': 101698, 'loss/train': 1.4214731454849243} 11/07/2021 11:30:24 - INFO - __main__ - Step 101700: {'lr': 0.00012028601842759295, 'samples': 19526400, 'steps': 101699, 'loss/train': 2.1640450954437256} 11/07/2021 11:30:24 - INFO - __main__ - Step 101701: {'lr': 0.00012028148192431771, 'samples': 19526592, 'steps': 101700, 'loss/train': 0.41969749331474304} 11/07/2021 11:30:24 - INFO - __main__ - Step 101702: {'lr': 0.00012027694547949153, 'samples': 19526784, 'steps': 101701, 'loss/train': 1.4800668954849243} 11/07/2021 11:30:25 - INFO - __main__ - Step 101703: {'lr': 0.00012027240909311656, 'samples': 19526976, 'steps': 101702, 'loss/train': 1.1571052074432373} 11/07/2021 11:30:26 - INFO - __main__ - Step 101704: {'lr': 0.00012026787276519485, 'samples': 19527168, 'steps': 101703, 'loss/train': 1.7514870166778564} 11/07/2021 11:30:26 - INFO - __main__ - Step 101705: {'lr': 0.0001202633364957284, 'samples': 19527360, 'steps': 101704, 'loss/train': 1.2548916339874268} 11/07/2021 11:30:26 - INFO - __main__ - Step 101706: {'lr': 0.00012025880028471934, 'samples': 19527552, 'steps': 101705, 'loss/train': 1.4776819944381714} 11/07/2021 11:30:27 - INFO - __main__ - Step 101707: {'lr': 0.00012025426413216963, 'samples': 19527744, 'steps': 101706, 'loss/train': 1.4115800857543945} 11/07/2021 11:30:28 - INFO - __main__ - Step 101708: {'lr': 0.00012024972803808135, 'samples': 19527936, 'steps': 101707, 'loss/train': 0.9203571081161499} 11/07/2021 11:30:28 - INFO - __main__ - Step 101709: {'lr': 0.00012024519200245653, 'samples': 19528128, 'steps': 101708, 'loss/train': 1.5872982740402222} 11/07/2021 11:30:29 - INFO - __main__ - Step 101710: {'lr': 0.00012024065602529724, 'samples': 19528320, 'steps': 101709, 'loss/train': 1.3702741861343384} 11/07/2021 11:30:29 - INFO - __main__ - Step 101711: {'lr': 0.00012023612010660551, 'samples': 19528512, 'steps': 101710, 'loss/train': 1.6273910999298096} 11/07/2021 11:30:29 - INFO - __main__ - Step 101712: {'lr': 0.00012023158424638339, 'samples': 19528704, 'steps': 101711, 'loss/train': 1.0407708883285522} 11/07/2021 11:30:30 - INFO - __main__ - Step 101713: {'lr': 0.0001202270484446329, 'samples': 19528896, 'steps': 101712, 'loss/train': 0.9991216063499451} 11/07/2021 11:30:31 - INFO - __main__ - Step 101714: {'lr': 0.00012022251270135609, 'samples': 19529088, 'steps': 101713, 'loss/train': 1.5596179962158203} 11/07/2021 11:30:31 - INFO - __main__ - Step 101715: {'lr': 0.00012021797701655512, 'samples': 19529280, 'steps': 101714, 'loss/train': 1.4701738357543945} 11/07/2021 11:30:31 - INFO - __main__ - Step 101716: {'lr': 0.00012021344139023186, 'samples': 19529472, 'steps': 101715, 'loss/train': 1.4104139804840088} 11/07/2021 11:30:32 - INFO - __main__ - Step 101717: {'lr': 0.00012020890582238838, 'samples': 19529664, 'steps': 101716, 'loss/train': 1.410091519355774} 11/07/2021 11:30:33 - INFO - __main__ - Step 101718: {'lr': 0.00012020437031302677, 'samples': 19529856, 'steps': 101717, 'loss/train': 1.4052821397781372} 11/07/2021 11:30:33 - INFO - __main__ - Step 101719: {'lr': 0.00012019983486214908, 'samples': 19530048, 'steps': 101718, 'loss/train': 1.218915581703186} 11/07/2021 11:30:34 - INFO - __main__ - Step 101720: {'lr': 0.00012019529946975733, 'samples': 19530240, 'steps': 101719, 'loss/train': 1.1883697509765625} 11/07/2021 11:30:34 - INFO - __main__ - Step 101721: {'lr': 0.00012019076413585359, 'samples': 19530432, 'steps': 101720, 'loss/train': 1.3242861032485962} 11/07/2021 11:30:34 - INFO - __main__ - Step 101722: {'lr': 0.00012018622886043987, 'samples': 19530624, 'steps': 101721, 'loss/train': 1.3958263397216797} 11/07/2021 11:30:35 - INFO - __main__ - Step 101723: {'lr': 0.00012018169364351824, 'samples': 19530816, 'steps': 101722, 'loss/train': 1.1716638803482056} 11/07/2021 11:30:36 - INFO - __main__ - Step 101724: {'lr': 0.00012017715848509076, 'samples': 19531008, 'steps': 101723, 'loss/train': 1.6820766925811768} 11/07/2021 11:30:36 - INFO - __main__ - Step 101725: {'lr': 0.00012017262338515941, 'samples': 19531200, 'steps': 101724, 'loss/train': 0.9223925471305847} 11/07/2021 11:30:36 - INFO - __main__ - Step 101726: {'lr': 0.0001201680883437263, 'samples': 19531392, 'steps': 101725, 'loss/train': 1.4137554168701172} 11/07/2021 11:30:37 - INFO - __main__ - Step 101727: {'lr': 0.00012016355336079343, 'samples': 19531584, 'steps': 101726, 'loss/train': 1.8294931650161743} 11/07/2021 11:30:37 - INFO - __main__ - Step 101728: {'lr': 0.00012015901843636295, 'samples': 19531776, 'steps': 101727, 'loss/train': 0.9100181460380554} 11/07/2021 11:30:37 - INFO - __main__ - Step 101729: {'lr': 0.00012015448357043673, 'samples': 19531968, 'steps': 101728, 'loss/train': 1.296576976776123} 11/07/2021 11:30:38 - INFO - __main__ - Step 101730: {'lr': 0.00012014994876301691, 'samples': 19532160, 'steps': 101729, 'loss/train': 1.3983951807022095} 11/07/2021 11:30:39 - INFO - __main__ - Step 101731: {'lr': 0.0001201454140141055, 'samples': 19532352, 'steps': 101730, 'loss/train': 1.2777348756790161} 11/07/2021 11:30:39 - INFO - __main__ - Step 101732: {'lr': 0.00012014087932370457, 'samples': 19532544, 'steps': 101731, 'loss/train': 1.306418776512146} 11/07/2021 11:30:39 - INFO - __main__ - Step 101733: {'lr': 0.00012013634469181614, 'samples': 19532736, 'steps': 101732, 'loss/train': 1.006213665008545} 11/07/2021 11:30:40 - INFO - __main__ - Step 101734: {'lr': 0.00012013181011844229, 'samples': 19532928, 'steps': 101733, 'loss/train': 1.0211716890335083} 11/07/2021 11:30:41 - INFO - __main__ - Step 101735: {'lr': 0.00012012727560358502, 'samples': 19533120, 'steps': 101734, 'loss/train': 1.3672387599945068} 11/07/2021 11:30:41 - INFO - __main__ - Step 101736: {'lr': 0.00012012274114724641, 'samples': 19533312, 'steps': 101735, 'loss/train': 1.2993792295455933} 11/07/2021 11:30:42 - INFO - __main__ - Step 101737: {'lr': 0.0001201182067494285, 'samples': 19533504, 'steps': 101736, 'loss/train': 1.541221022605896} 11/07/2021 11:30:42 - INFO - __main__ - Step 101738: {'lr': 0.00012011367241013329, 'samples': 19533696, 'steps': 101737, 'loss/train': 1.165432095527649} 11/07/2021 11:30:42 - INFO - __main__ - Step 101739: {'lr': 0.00012010913812936289, 'samples': 19533888, 'steps': 101738, 'loss/train': 1.2568132877349854} 11/07/2021 11:30:43 - INFO - __main__ - Step 101740: {'lr': 0.00012010460390711927, 'samples': 19534080, 'steps': 101739, 'loss/train': 1.3802480697631836} 11/07/2021 11:30:44 - INFO - __main__ - Step 101741: {'lr': 0.00012010006974340454, 'samples': 19534272, 'steps': 101740, 'loss/train': 5.6946539878845215} 11/07/2021 11:30:44 - INFO - __main__ - Step 101742: {'lr': 0.00012009553563822081, 'samples': 19534464, 'steps': 101741, 'loss/train': 1.2843644618988037} 11/07/2021 11:30:44 - INFO - __main__ - Step 101743: {'lr': 0.00012009100159156993, 'samples': 19534656, 'steps': 101742, 'loss/train': 1.3331208229064941} 11/07/2021 11:30:45 - INFO - __main__ - Step 101744: {'lr': 0.00012008646760345405, 'samples': 19534848, 'steps': 101743, 'loss/train': 1.5226422548294067} 11/07/2021 11:30:45 - INFO - __main__ - Step 101745: {'lr': 0.00012008193367387518, 'samples': 19535040, 'steps': 101744, 'loss/train': 1.5754610300064087} 11/07/2021 11:30:46 - INFO - __main__ - Step 101746: {'lr': 0.00012007739980283539, 'samples': 19535232, 'steps': 101745, 'loss/train': 1.3124932050704956} 11/07/2021 11:30:47 - INFO - __main__ - Step 101747: {'lr': 0.0001200728659903367, 'samples': 19535424, 'steps': 101746, 'loss/train': 0.953819215297699} 11/07/2021 11:30:47 - INFO - __main__ - Step 101748: {'lr': 0.00012006833223638122, 'samples': 19535616, 'steps': 101747, 'loss/train': 1.3394216299057007} 11/07/2021 11:30:47 - INFO - __main__ - Step 101749: {'lr': 0.0001200637985409709, 'samples': 19535808, 'steps': 101748, 'loss/train': 1.4828088283538818} 11/07/2021 11:30:48 - INFO - __main__ - Step 101750: {'lr': 0.00012005926490410784, 'samples': 19536000, 'steps': 101749, 'loss/train': 0.9128353595733643} 11/07/2021 11:30:49 - INFO - __main__ - Step 101751: {'lr': 0.00012005473132579409, 'samples': 19536192, 'steps': 101750, 'loss/train': 1.6653947830200195} 11/07/2021 11:30:49 - INFO - __main__ - Step 101752: {'lr': 0.00012005019780603166, 'samples': 19536384, 'steps': 101751, 'loss/train': 1.5827263593673706} 11/07/2021 11:30:49 - INFO - __main__ - Step 101753: {'lr': 0.00012004566434482261, 'samples': 19536576, 'steps': 101752, 'loss/train': 1.7789597511291504} 11/07/2021 11:30:50 - INFO - __main__ - Step 101754: {'lr': 0.00012004113094216898, 'samples': 19536768, 'steps': 101753, 'loss/train': 2.1444358825683594} 11/07/2021 11:30:50 - INFO - __main__ - Step 101755: {'lr': 0.00012003659759807289, 'samples': 19536960, 'steps': 101754, 'loss/train': 1.3387045860290527} 11/07/2021 11:30:51 - INFO - __main__ - Step 101756: {'lr': 0.00012003206431253622, 'samples': 19537152, 'steps': 101755, 'loss/train': 0.5140122771263123} 11/07/2021 11:30:52 - INFO - __main__ - Step 101757: {'lr': 0.00012002753108556108, 'samples': 19537344, 'steps': 101756, 'loss/train': 1.3464858531951904} 11/07/2021 11:30:52 - INFO - __main__ - Step 101758: {'lr': 0.00012002299791714955, 'samples': 19537536, 'steps': 101757, 'loss/train': 1.3301571607589722} 11/07/2021 11:30:52 - INFO - __main__ - Step 101759: {'lr': 0.00012001846480730366, 'samples': 19537728, 'steps': 101758, 'loss/train': 0.8282419443130493} 11/07/2021 11:30:53 - INFO - __main__ - Step 101760: {'lr': 0.00012001393175602543, 'samples': 19537920, 'steps': 101759, 'loss/train': 1.393606424331665} 11/07/2021 11:30:54 - INFO - __main__ - Step 101761: {'lr': 0.0001200093987633169, 'samples': 19538112, 'steps': 101760, 'loss/train': 1.6905252933502197} 11/07/2021 11:30:54 - INFO - __main__ - Step 101762: {'lr': 0.00012000486582918013, 'samples': 19538304, 'steps': 101761, 'loss/train': 1.7033089399337769} 11/07/2021 11:30:54 - INFO - __main__ - Step 101763: {'lr': 0.00012000033295361721, 'samples': 19538496, 'steps': 101762, 'loss/train': 1.446099042892456} 11/07/2021 11:30:55 - INFO - __main__ - Step 101764: {'lr': 0.00011999580013663008, 'samples': 19538688, 'steps': 101763, 'loss/train': 1.3005326986312866} 11/07/2021 11:30:55 - INFO - __main__ - Step 101765: {'lr': 0.00011999126737822085, 'samples': 19538880, 'steps': 101764, 'loss/train': 1.0414793491363525} 11/07/2021 11:30:56 - INFO - __main__ - Step 101766: {'lr': 0.00011998673467839155, 'samples': 19539072, 'steps': 101765, 'loss/train': 1.2631210088729858} 11/07/2021 11:30:56 - INFO - __main__ - Step 101767: {'lr': 0.00011998220203714425, 'samples': 19539264, 'steps': 101766, 'loss/train': 1.4106441736221313} 11/07/2021 11:30:57 - INFO - __main__ - Step 101768: {'lr': 0.00011997766945448102, 'samples': 19539456, 'steps': 101767, 'loss/train': 1.0861055850982666} 11/07/2021 11:30:57 - INFO - __main__ - Step 101769: {'lr': 0.00011997313693040377, 'samples': 19539648, 'steps': 101768, 'loss/train': 1.554802417755127} 11/07/2021 11:30:58 - INFO - __main__ - Step 101770: {'lr': 0.00011996860446491462, 'samples': 19539840, 'steps': 101769, 'loss/train': 1.5067238807678223} 11/07/2021 11:30:59 - INFO - __main__ - Step 101771: {'lr': 0.0001199640720580156, 'samples': 19540032, 'steps': 101770, 'loss/train': 1.2515686750411987} 11/07/2021 11:30:59 - INFO - __main__ - Step 101772: {'lr': 0.00011995953970970878, 'samples': 19540224, 'steps': 101771, 'loss/train': 1.586508870124817} 11/07/2021 11:30:59 - INFO - __main__ - Step 101773: {'lr': 0.00011995500741999615, 'samples': 19540416, 'steps': 101772, 'loss/train': 1.153633713722229} 11/07/2021 11:31:00 - INFO - __main__ - Step 101774: {'lr': 0.00011995047518887981, 'samples': 19540608, 'steps': 101773, 'loss/train': 0.6359546184539795} 11/07/2021 11:31:00 - INFO - __main__ - Step 101775: {'lr': 0.00011994594301636178, 'samples': 19540800, 'steps': 101774, 'loss/train': 1.2437653541564941} 11/07/2021 11:31:00 - INFO - __main__ - Step 101776: {'lr': 0.00011994141090244409, 'samples': 19540992, 'steps': 101775, 'loss/train': 1.5538767576217651} 11/07/2021 11:31:01 - INFO - __main__ - Step 101777: {'lr': 0.0001199368788471288, 'samples': 19541184, 'steps': 101776, 'loss/train': 1.759419560432434} 11/07/2021 11:31:02 - INFO - __main__ - Step 101778: {'lr': 0.00011993234685041795, 'samples': 19541376, 'steps': 101777, 'loss/train': 1.1766448020935059} 11/07/2021 11:31:02 - INFO - __main__ - Step 101779: {'lr': 0.00011992781491231358, 'samples': 19541568, 'steps': 101778, 'loss/train': 1.426324725151062} 11/07/2021 11:31:02 - INFO - __main__ - Step 101780: {'lr': 0.00011992328303281772, 'samples': 19541760, 'steps': 101779, 'loss/train': 1.073301076889038} 11/07/2021 11:31:03 - INFO - __main__ - Step 101781: {'lr': 0.00011991875121193241, 'samples': 19541952, 'steps': 101780, 'loss/train': 1.5482536554336548} 11/07/2021 11:31:04 - INFO - __main__ - Step 101782: {'lr': 0.00011991421944965982, 'samples': 19542144, 'steps': 101781, 'loss/train': 0.5929391384124756} 11/07/2021 11:31:04 - INFO - __main__ - Step 101783: {'lr': 0.00011990968774600178, 'samples': 19542336, 'steps': 101782, 'loss/train': 1.3752254247665405} 11/07/2021 11:31:05 - INFO - __main__ - Step 101784: {'lr': 0.00011990515610096042, 'samples': 19542528, 'steps': 101783, 'loss/train': 1.299018383026123} 11/07/2021 11:31:05 - INFO - __main__ - Step 101785: {'lr': 0.00011990062451453778, 'samples': 19542720, 'steps': 101784, 'loss/train': 1.6415208578109741} 11/07/2021 11:31:05 - INFO - __main__ - Step 101786: {'lr': 0.00011989609298673592, 'samples': 19542912, 'steps': 101785, 'loss/train': 1.1804986000061035} 11/07/2021 11:31:06 - INFO - __main__ - Step 101787: {'lr': 0.00011989156151755689, 'samples': 19543104, 'steps': 101786, 'loss/train': 1.3953447341918945} 11/07/2021 11:31:07 - INFO - __main__ - Step 101788: {'lr': 0.00011988703010700269, 'samples': 19543296, 'steps': 101787, 'loss/train': 1.2027992010116577} 11/07/2021 11:31:07 - INFO - __main__ - Step 101789: {'lr': 0.0001198824987550754, 'samples': 19543488, 'steps': 101788, 'loss/train': 1.3936644792556763} 11/07/2021 11:31:07 - INFO - __main__ - Step 101790: {'lr': 0.00011987796746177704, 'samples': 19543680, 'steps': 101789, 'loss/train': 1.3300570249557495} 11/07/2021 11:31:08 - INFO - __main__ - Step 101791: {'lr': 0.00011987343622710966, 'samples': 19543872, 'steps': 101790, 'loss/train': 1.4936546087265015} 11/07/2021 11:31:09 - INFO - __main__ - Step 101792: {'lr': 0.00011986890505107531, 'samples': 19544064, 'steps': 101791, 'loss/train': 1.1869399547576904} 11/07/2021 11:31:09 - INFO - __main__ - Step 101793: {'lr': 0.00011986437393367602, 'samples': 19544256, 'steps': 101792, 'loss/train': 2.0799012184143066} 11/07/2021 11:31:09 - INFO - __main__ - Step 101794: {'lr': 0.00011985984287491383, 'samples': 19544448, 'steps': 101793, 'loss/train': 1.8558603525161743} 11/07/2021 11:31:10 - INFO - __main__ - Step 101795: {'lr': 0.0001198553118747909, 'samples': 19544640, 'steps': 101794, 'loss/train': 1.1746582984924316} 11/07/2021 11:31:10 - INFO - __main__ - Step 101796: {'lr': 0.00011985078093330904, 'samples': 19544832, 'steps': 101795, 'loss/train': 1.6059083938598633} 11/07/2021 11:31:11 - INFO - __main__ - Step 101797: {'lr': 0.00011984625005047042, 'samples': 19545024, 'steps': 101796, 'loss/train': 1.2802637815475464} 11/07/2021 11:31:12 - INFO - __main__ - Step 101798: {'lr': 0.00011984171922627707, 'samples': 19545216, 'steps': 101797, 'loss/train': 1.2187808752059937} 11/07/2021 11:31:12 - INFO - __main__ - Step 101799: {'lr': 0.00011983718846073103, 'samples': 19545408, 'steps': 101798, 'loss/train': 0.8423688411712646} 11/07/2021 11:31:12 - INFO - __main__ - Step 101800: {'lr': 0.00011983265775383434, 'samples': 19545600, 'steps': 101799, 'loss/train': 1.2026000022888184} 11/07/2021 11:31:13 - INFO - __main__ - Step 101801: {'lr': 0.00011982812710558905, 'samples': 19545792, 'steps': 101800, 'loss/train': 1.676497220993042} 11/07/2021 11:31:14 - INFO - __main__ - Step 101802: {'lr': 0.0001198235965159972, 'samples': 19545984, 'steps': 101801, 'loss/train': 1.1696051359176636} 11/07/2021 11:31:14 - INFO - __main__ - Step 101803: {'lr': 0.00011981906598506084, 'samples': 19546176, 'steps': 101802, 'loss/train': 1.4289164543151855} 11/07/2021 11:31:14 - INFO - __main__ - Step 101804: {'lr': 0.00011981453551278199, 'samples': 19546368, 'steps': 101803, 'loss/train': 1.4961183071136475} 11/07/2021 11:31:15 - INFO - __main__ - Step 101805: {'lr': 0.0001198100050991627, 'samples': 19546560, 'steps': 101804, 'loss/train': 1.2819713354110718} 11/07/2021 11:31:15 - INFO - __main__ - Step 101806: {'lr': 0.000119805474744205, 'samples': 19546752, 'steps': 101805, 'loss/train': 1.3431729078292847} 11/07/2021 11:31:16 - INFO - __main__ - Step 101807: {'lr': 0.00011980094444791095, 'samples': 19546944, 'steps': 101806, 'loss/train': 1.5141892433166504} 11/07/2021 11:31:16 - INFO - __main__ - Step 101808: {'lr': 0.00011979641421028261, 'samples': 19547136, 'steps': 101807, 'loss/train': 0.9060997366905212} 11/07/2021 11:31:17 - INFO - __main__ - Step 101809: {'lr': 0.00011979188403132207, 'samples': 19547328, 'steps': 101808, 'loss/train': 0.8946255445480347} 11/07/2021 11:31:17 - INFO - __main__ - Step 101810: {'lr': 0.00011978735391103122, 'samples': 19547520, 'steps': 101809, 'loss/train': 1.438600778579712} 11/07/2021 11:31:17 - INFO - __main__ - Step 101811: {'lr': 0.00011978282384941214, 'samples': 19547712, 'steps': 101810, 'loss/train': 1.127354383468628} 11/07/2021 11:31:18 - INFO - __main__ - Step 101812: {'lr': 0.00011977829384646694, 'samples': 19547904, 'steps': 101811, 'loss/train': 1.2009832859039307} 11/07/2021 11:31:19 - INFO - __main__ - Step 101813: {'lr': 0.00011977376390219764, 'samples': 19548096, 'steps': 101812, 'loss/train': 1.4671692848205566} 11/07/2021 11:31:19 - INFO - __main__ - Step 101814: {'lr': 0.00011976923401660625, 'samples': 19548288, 'steps': 101813, 'loss/train': 1.8987374305725098} 11/07/2021 11:31:19 - INFO - __main__ - Step 101815: {'lr': 0.00011976470418969485, 'samples': 19548480, 'steps': 101814, 'loss/train': 1.5553100109100342} 11/07/2021 11:31:20 - INFO - __main__ - Step 101816: {'lr': 0.00011976017442146545, 'samples': 19548672, 'steps': 101815, 'loss/train': 1.6069191694259644} 11/07/2021 11:31:21 - INFO - __main__ - Step 101817: {'lr': 0.0001197556447119201, 'samples': 19548864, 'steps': 101816, 'loss/train': 1.4737755060195923} 11/07/2021 11:31:21 - INFO - __main__ - Step 101818: {'lr': 0.00011975111506106087, 'samples': 19549056, 'steps': 101817, 'loss/train': 1.6116032600402832} 11/07/2021 11:31:22 - INFO - __main__ - Step 101819: {'lr': 0.00011974658546888977, 'samples': 19549248, 'steps': 101818, 'loss/train': 1.386729121208191} 11/07/2021 11:31:22 - INFO - __main__ - Step 101820: {'lr': 0.00011974205593540884, 'samples': 19549440, 'steps': 101819, 'loss/train': 1.2159456014633179} 11/07/2021 11:31:22 - INFO - __main__ - Step 101821: {'lr': 0.00011973752646062014, 'samples': 19549632, 'steps': 101820, 'loss/train': 0.877809464931488} 11/07/2021 11:31:24 - INFO - __main__ - Step 101822: {'lr': 0.00011973299704452581, 'samples': 19549824, 'steps': 101821, 'loss/train': 1.5040616989135742} 11/07/2021 11:31:24 - INFO - __main__ - Step 101823: {'lr': 0.00011972846768712764, 'samples': 19550016, 'steps': 101822, 'loss/train': 1.3940372467041016} 11/07/2021 11:31:24 - INFO - __main__ - Step 101824: {'lr': 0.00011972393838842785, 'samples': 19550208, 'steps': 101823, 'loss/train': 1.4139653444290161} 11/07/2021 11:31:25 - INFO - __main__ - Step 101825: {'lr': 0.00011971940914842843, 'samples': 19550400, 'steps': 101824, 'loss/train': 1.5498199462890625} 11/07/2021 11:31:25 - INFO - __main__ - Step 101826: {'lr': 0.00011971487996713146, 'samples': 19550592, 'steps': 101825, 'loss/train': 1.5713034868240356} 11/07/2021 11:31:25 - INFO - __main__ - Step 101827: {'lr': 0.0001197103508445389, 'samples': 19550784, 'steps': 101826, 'loss/train': 1.5927979946136475} 11/07/2021 11:31:27 - INFO - __main__ - Step 101828: {'lr': 0.00011970582178065289, 'samples': 19550976, 'steps': 101827, 'loss/train': 1.4247033596038818} 11/07/2021 11:31:27 - INFO - __main__ - Step 101829: {'lr': 0.00011970129277547542, 'samples': 19551168, 'steps': 101828, 'loss/train': 1.0762418508529663} 11/07/2021 11:31:28 - INFO - __main__ - Step 101830: {'lr': 0.00011969676382900852, 'samples': 19551360, 'steps': 101829, 'loss/train': 0.1914094090461731} 11/07/2021 11:31:28 - INFO - __main__ - Step 101831: {'lr': 0.00011969223494125425, 'samples': 19551552, 'steps': 101830, 'loss/train': 0.15115824341773987} 11/07/2021 11:31:28 - INFO - __main__ - Step 101832: {'lr': 0.00011968770611221466, 'samples': 19551744, 'steps': 101831, 'loss/train': 1.9044339656829834} 11/07/2021 11:31:29 - INFO - __main__ - Step 101833: {'lr': 0.0001196831773418918, 'samples': 19551936, 'steps': 101832, 'loss/train': 0.6365652680397034} 11/07/2021 11:31:30 - INFO - __main__ - Step 101834: {'lr': 0.00011967864863028765, 'samples': 19552128, 'steps': 101833, 'loss/train': 1.6439549922943115} 11/07/2021 11:31:30 - INFO - __main__ - Step 101835: {'lr': 0.00011967411997740429, 'samples': 19552320, 'steps': 101834, 'loss/train': 1.8165404796600342} 11/07/2021 11:31:30 - INFO - __main__ - Step 101836: {'lr': 0.00011966959138324387, 'samples': 19552512, 'steps': 101835, 'loss/train': 0.633165717124939} 11/07/2021 11:31:31 - INFO - __main__ - Step 101837: {'lr': 0.00011966506284780823, 'samples': 19552704, 'steps': 101836, 'loss/train': 2.1793503761291504} 11/07/2021 11:31:31 - INFO - __main__ - Step 101838: {'lr': 0.0001196605343710995, 'samples': 19552896, 'steps': 101837, 'loss/train': 1.2271347045898438} 11/07/2021 11:31:32 - INFO - __main__ - Step 101839: {'lr': 0.00011965600595311973, 'samples': 19553088, 'steps': 101838, 'loss/train': 1.0310977697372437} 11/07/2021 11:31:32 - INFO - __main__ - Step 101840: {'lr': 0.00011965147759387093, 'samples': 19553280, 'steps': 101839, 'loss/train': 1.0632938146591187} 11/07/2021 11:31:33 - INFO - __main__ - Step 101841: {'lr': 0.00011964694929335517, 'samples': 19553472, 'steps': 101840, 'loss/train': 1.3816932439804077} 11/07/2021 11:31:33 - INFO - __main__ - Step 101842: {'lr': 0.0001196424210515745, 'samples': 19553664, 'steps': 101841, 'loss/train': 1.1873723268508911} 11/07/2021 11:31:33 - INFO - __main__ - Step 101843: {'lr': 0.00011963789286853093, 'samples': 19553856, 'steps': 101842, 'loss/train': 1.9928758144378662} 11/07/2021 11:31:34 - INFO - __main__ - Step 101844: {'lr': 0.0001196333647442265, 'samples': 19554048, 'steps': 101843, 'loss/train': 0.922770082950592} 11/07/2021 11:31:35 - INFO - __main__ - Step 101845: {'lr': 0.00011962883667866328, 'samples': 19554240, 'steps': 101844, 'loss/train': 1.5050996541976929} 11/07/2021 11:31:35 - INFO - __main__ - Step 101846: {'lr': 0.00011962430867184329, 'samples': 19554432, 'steps': 101845, 'loss/train': 1.533813714981079} 11/07/2021 11:31:36 - INFO - __main__ - Step 101847: {'lr': 0.00011961978072376859, 'samples': 19554624, 'steps': 101846, 'loss/train': 0.8797428011894226} 11/07/2021 11:31:36 - INFO - __main__ - Step 101848: {'lr': 0.0001196152528344413, 'samples': 19554816, 'steps': 101847, 'loss/train': 1.5321941375732422} 11/07/2021 11:31:38 - INFO - __main__ - Step 101849: {'lr': 0.00011961072500386325, 'samples': 19555008, 'steps': 101848, 'loss/train': 1.3899486064910889} 11/07/2021 11:31:38 - INFO - __main__ - Step 101850: {'lr': 0.00011960619723203662, 'samples': 19555200, 'steps': 101849, 'loss/train': 0.8288902044296265} 11/07/2021 11:31:38 - INFO - __main__ - Step 101851: {'lr': 0.00011960166951896339, 'samples': 19555392, 'steps': 101850, 'loss/train': 0.9907975792884827} 11/07/2021 11:31:39 - INFO - __main__ - Step 101852: {'lr': 0.00011959714186464566, 'samples': 19555584, 'steps': 101851, 'loss/train': 1.5990701913833618} 11/07/2021 11:31:39 - INFO - __main__ - Step 101853: {'lr': 0.00011959261426908544, 'samples': 19555776, 'steps': 101852, 'loss/train': 1.056114673614502} 11/07/2021 11:31:39 - INFO - __main__ - Step 101854: {'lr': 0.00011958808673228477, 'samples': 19555968, 'steps': 101853, 'loss/train': 0.9487912654876709} 11/07/2021 11:31:40 - INFO - __main__ - Step 101855: {'lr': 0.0001195835592542457, 'samples': 19556160, 'steps': 101854, 'loss/train': 1.423301339149475} 11/07/2021 11:31:41 - INFO - __main__ - Step 101856: {'lr': 0.00011957903183497026, 'samples': 19556352, 'steps': 101855, 'loss/train': 1.021880030632019} 11/07/2021 11:31:41 - INFO - __main__ - Step 101857: {'lr': 0.0001195745044744605, 'samples': 19556544, 'steps': 101856, 'loss/train': 1.938745141029358} 11/07/2021 11:31:41 - INFO - __main__ - Step 101858: {'lr': 0.00011956997717271848, 'samples': 19556736, 'steps': 101857, 'loss/train': 1.1731173992156982} 11/07/2021 11:31:42 - INFO - __main__ - Step 101859: {'lr': 0.00011956544992974628, 'samples': 19556928, 'steps': 101858, 'loss/train': 1.17055344581604} 11/07/2021 11:31:42 - INFO - __main__ - Step 101860: {'lr': 0.00011956092274554579, 'samples': 19557120, 'steps': 101859, 'loss/train': 1.3958666324615479} 11/07/2021 11:31:43 - INFO - __main__ - Step 101861: {'lr': 0.00011955639562011914, 'samples': 19557312, 'steps': 101860, 'loss/train': 1.2830239534378052} 11/07/2021 11:31:44 - INFO - __main__ - Step 101862: {'lr': 0.00011955186855346836, 'samples': 19557504, 'steps': 101861, 'loss/train': 1.3382385969161987} 11/07/2021 11:31:44 - INFO - __main__ - Step 101863: {'lr': 0.00011954734154559549, 'samples': 19557696, 'steps': 101862, 'loss/train': 1.1190376281738281} 11/07/2021 11:31:44 - INFO - __main__ - Step 101864: {'lr': 0.00011954281459650257, 'samples': 19557888, 'steps': 101863, 'loss/train': 0.697787344455719} 11/07/2021 11:31:45 - INFO - __main__ - Step 101865: {'lr': 0.00011953828770619165, 'samples': 19558080, 'steps': 101864, 'loss/train': 0.4776252806186676} 11/07/2021 11:31:46 - INFO - __main__ - Step 101866: {'lr': 0.00011953376087466478, 'samples': 19558272, 'steps': 101865, 'loss/train': 1.6381574869155884} 11/07/2021 11:31:46 - INFO - __main__ - Step 101867: {'lr': 0.00011952923410192399, 'samples': 19558464, 'steps': 101866, 'loss/train': 0.1368051916360855} 11/07/2021 11:31:47 - INFO - __main__ - Step 101868: {'lr': 0.00011952470738797128, 'samples': 19558656, 'steps': 101867, 'loss/train': 1.3164265155792236} 11/07/2021 11:31:47 - INFO - __main__ - Step 101869: {'lr': 0.00011952018073280873, 'samples': 19558848, 'steps': 101868, 'loss/train': 0.824339747428894} 11/07/2021 11:31:47 - INFO - __main__ - Step 101870: {'lr': 0.0001195156541364385, 'samples': 19559040, 'steps': 101869, 'loss/train': 1.2416677474975586} 11/07/2021 11:31:48 - INFO - __main__ - Step 101871: {'lr': 0.00011951112759886237, 'samples': 19559232, 'steps': 101870, 'loss/train': 1.2399725914001465} 11/07/2021 11:31:49 - INFO - __main__ - Step 101872: {'lr': 0.00011950660112008255, 'samples': 19559424, 'steps': 101871, 'loss/train': 1.4219894409179688} 11/07/2021 11:31:49 - INFO - __main__ - Step 101873: {'lr': 0.00011950207470010103, 'samples': 19559616, 'steps': 101872, 'loss/train': 1.462199330329895} 11/07/2021 11:31:49 - INFO - __main__ - Step 101874: {'lr': 0.00011949754833891981, 'samples': 19559808, 'steps': 101873, 'loss/train': 0.8418935537338257} 11/07/2021 11:31:50 - INFO - __main__ - Step 101875: {'lr': 0.00011949302203654105, 'samples': 19560000, 'steps': 101874, 'loss/train': 1.2953927516937256} 11/07/2021 11:31:51 - INFO - __main__ - Step 101876: {'lr': 0.00011948849579296669, 'samples': 19560192, 'steps': 101875, 'loss/train': 0.8591344952583313} 11/07/2021 11:31:51 - INFO - __main__ - Step 101877: {'lr': 0.00011948396960819879, 'samples': 19560384, 'steps': 101876, 'loss/train': 1.2084928750991821} 11/07/2021 11:31:51 - INFO - __main__ - Step 101878: {'lr': 0.00011947944348223944, 'samples': 19560576, 'steps': 101877, 'loss/train': 1.5609800815582275} 11/07/2021 11:31:52 - INFO - __main__ - Step 101879: {'lr': 0.00011947491741509059, 'samples': 19560768, 'steps': 101878, 'loss/train': 1.6024792194366455} 11/07/2021 11:31:52 - INFO - __main__ - Step 101880: {'lr': 0.00011947039140675436, 'samples': 19560960, 'steps': 101879, 'loss/train': 1.4753680229187012} 11/07/2021 11:31:52 - INFO - __main__ - Step 101881: {'lr': 0.00011946586545723284, 'samples': 19561152, 'steps': 101880, 'loss/train': 1.4710391759872437} 11/07/2021 11:31:54 - INFO - __main__ - Step 101882: {'lr': 0.00011946133956652788, 'samples': 19561344, 'steps': 101881, 'loss/train': 1.4457299709320068} 11/07/2021 11:31:54 - INFO - __main__ - Step 101883: {'lr': 0.00011945681373464166, 'samples': 19561536, 'steps': 101882, 'loss/train': 1.352506399154663} 11/07/2021 11:31:54 - INFO - __main__ - Step 101884: {'lr': 0.00011945228796157614, 'samples': 19561728, 'steps': 101883, 'loss/train': 1.2272123098373413} 11/07/2021 11:31:55 - INFO - __main__ - Step 101885: {'lr': 0.00011944776224733345, 'samples': 19561920, 'steps': 101884, 'loss/train': 0.5435953736305237} 11/07/2021 11:31:55 - INFO - __main__ - Step 101886: {'lr': 0.00011944323659191556, 'samples': 19562112, 'steps': 101885, 'loss/train': 1.5438389778137207} 11/07/2021 11:31:56 - INFO - __main__ - Step 101887: {'lr': 0.00011943871099532455, 'samples': 19562304, 'steps': 101886, 'loss/train': 0.9318904876708984} 11/07/2021 11:31:56 - INFO - __main__ - Step 101888: {'lr': 0.0001194341854575624, 'samples': 19562496, 'steps': 101887, 'loss/train': 1.3874648809432983} 11/07/2021 11:31:57 - INFO - __main__ - Step 101889: {'lr': 0.00011942965997863123, 'samples': 19562688, 'steps': 101888, 'loss/train': 1.213281512260437} 11/07/2021 11:31:57 - INFO - __main__ - Step 101890: {'lr': 0.00011942513455853305, 'samples': 19562880, 'steps': 101889, 'loss/train': 0.8998444080352783} 11/07/2021 11:31:57 - INFO - __main__ - Step 101891: {'lr': 0.00011942060919726985, 'samples': 19563072, 'steps': 101890, 'loss/train': 1.1555395126342773} 11/07/2021 11:31:58 - INFO - __main__ - Step 101892: {'lr': 0.00011941608389484381, 'samples': 19563264, 'steps': 101891, 'loss/train': 1.3718852996826172} 11/07/2021 11:31:59 - INFO - __main__ - Step 101893: {'lr': 0.0001194115586512568, 'samples': 19563456, 'steps': 101892, 'loss/train': 1.5145459175109863} 11/07/2021 11:31:59 - INFO - __main__ - Step 101894: {'lr': 0.00011940703346651091, 'samples': 19563648, 'steps': 101893, 'loss/train': 1.5319373607635498} 11/07/2021 11:31:59 - INFO - __main__ - Step 101895: {'lr': 0.00011940250834060821, 'samples': 19563840, 'steps': 101894, 'loss/train': 1.2096049785614014} 11/07/2021 11:32:00 - INFO - __main__ - Step 101896: {'lr': 0.0001193979832735507, 'samples': 19564032, 'steps': 101895, 'loss/train': 1.545196771621704} 11/07/2021 11:32:01 - INFO - __main__ - Step 101897: {'lr': 0.00011939345826534046, 'samples': 19564224, 'steps': 101896, 'loss/train': 1.4869285821914673} 11/07/2021 11:32:01 - INFO - __main__ - Step 101898: {'lr': 0.00011938893331597953, 'samples': 19564416, 'steps': 101897, 'loss/train': 1.6270148754119873} 11/07/2021 11:32:02 - INFO - __main__ - Step 101899: {'lr': 0.00011938440842546991, 'samples': 19564608, 'steps': 101898, 'loss/train': 1.3414220809936523} 11/07/2021 11:32:02 - INFO - __main__ - Step 101900: {'lr': 0.00011937988359381363, 'samples': 19564800, 'steps': 101899, 'loss/train': 1.2120311260223389} 11/07/2021 11:32:02 - INFO - __main__ - Step 101901: {'lr': 0.00011937535882101281, 'samples': 19564992, 'steps': 101900, 'loss/train': 1.2678056955337524} 11/07/2021 11:32:03 - INFO - __main__ - Step 101902: {'lr': 0.00011937083410706942, 'samples': 19565184, 'steps': 101901, 'loss/train': 1.3874191045761108} 11/07/2021 11:32:04 - INFO - __main__ - Step 101903: {'lr': 0.0001193663094519856, 'samples': 19565376, 'steps': 101902, 'loss/train': 1.6700141429901123} 11/07/2021 11:32:04 - INFO - __main__ - Step 101904: {'lr': 0.00011936178485576321, 'samples': 19565568, 'steps': 101903, 'loss/train': 1.2878649234771729} 11/07/2021 11:32:04 - INFO - __main__ - Step 101905: {'lr': 0.00011935726031840441, 'samples': 19565760, 'steps': 101904, 'loss/train': 1.3891143798828125} 11/07/2021 11:32:05 - INFO - __main__ - Step 101906: {'lr': 0.00011935273583991118, 'samples': 19565952, 'steps': 101905, 'loss/train': 1.4987810850143433} 11/07/2021 11:32:06 - INFO - __main__ - Step 101907: {'lr': 0.0001193482114202856, 'samples': 19566144, 'steps': 101906, 'loss/train': 1.0013906955718994} 11/07/2021 11:32:06 - INFO - __main__ - Step 101908: {'lr': 0.00011934368705952972, 'samples': 19566336, 'steps': 101907, 'loss/train': 1.2025082111358643} 11/07/2021 11:32:07 - INFO - __main__ - Step 101909: {'lr': 0.00011933916275764553, 'samples': 19566528, 'steps': 101908, 'loss/train': 1.4752764701843262} 11/07/2021 11:32:07 - INFO - __main__ - Step 101910: {'lr': 0.0001193346385146351, 'samples': 19566720, 'steps': 101909, 'loss/train': 1.210837721824646} 11/07/2021 11:32:07 - INFO - __main__ - Step 101911: {'lr': 0.00011933011433050051, 'samples': 19566912, 'steps': 101910, 'loss/train': 1.1019856929779053} 11/07/2021 11:32:09 - INFO - __main__ - Step 101912: {'lr': 0.0001193255902052437, 'samples': 19567104, 'steps': 101911, 'loss/train': 1.5232738256454468} 11/07/2021 11:32:09 - INFO - __main__ - Step 101913: {'lr': 0.00011932106613886678, 'samples': 19567296, 'steps': 101912, 'loss/train': 0.8150825500488281} 11/07/2021 11:32:09 - INFO - __main__ - Step 101914: {'lr': 0.00011931654213137177, 'samples': 19567488, 'steps': 101913, 'loss/train': 1.4142147302627563} 11/07/2021 11:32:10 - INFO - __main__ - Step 101915: {'lr': 0.00011931201818276072, 'samples': 19567680, 'steps': 101914, 'loss/train': 1.3311035633087158} 11/07/2021 11:32:10 - INFO - __main__ - Step 101916: {'lr': 0.00011930749429303576, 'samples': 19567872, 'steps': 101915, 'loss/train': 1.515618920326233} 11/07/2021 11:32:10 - INFO - __main__ - Step 101917: {'lr': 0.00011930297046219871, 'samples': 19568064, 'steps': 101916, 'loss/train': 1.6618969440460205} 11/07/2021 11:32:11 - INFO - __main__ - Step 101918: {'lr': 0.00011929844669025172, 'samples': 19568256, 'steps': 101917, 'loss/train': 0.33965590596199036} 11/07/2021 11:32:12 - INFO - __main__ - Step 101919: {'lr': 0.00011929392297719685, 'samples': 19568448, 'steps': 101918, 'loss/train': 1.1508355140686035} 11/07/2021 11:32:12 - INFO - __main__ - Step 101920: {'lr': 0.00011928939932303612, 'samples': 19568640, 'steps': 101919, 'loss/train': 0.4492985010147095} 11/07/2021 11:32:12 - INFO - __main__ - Step 101921: {'lr': 0.00011928487572777158, 'samples': 19568832, 'steps': 101920, 'loss/train': 1.235152006149292} 11/07/2021 11:32:13 - INFO - __main__ - Step 101922: {'lr': 0.00011928035219140523, 'samples': 19569024, 'steps': 101921, 'loss/train': 1.347999095916748} 11/07/2021 11:32:14 - INFO - __main__ - Step 101923: {'lr': 0.00011927582871393916, 'samples': 19569216, 'steps': 101922, 'loss/train': 1.4457767009735107} 11/07/2021 11:32:14 - INFO - __main__ - Step 101924: {'lr': 0.00011927130529537538, 'samples': 19569408, 'steps': 101923, 'loss/train': 1.6052261590957642} 11/07/2021 11:32:14 - INFO - __main__ - Step 101925: {'lr': 0.00011926678193571592, 'samples': 19569600, 'steps': 101924, 'loss/train': 1.1223331689834595} 11/07/2021 11:32:15 - INFO - __main__ - Step 101926: {'lr': 0.00011926225863496285, 'samples': 19569792, 'steps': 101925, 'loss/train': 1.3010679483413696} 11/07/2021 11:32:15 - INFO - __main__ - Step 101927: {'lr': 0.00011925773539311816, 'samples': 19569984, 'steps': 101926, 'loss/train': 1.3394083976745605} 11/07/2021 11:32:15 - INFO - __main__ - Step 101928: {'lr': 0.00011925321221018396, 'samples': 19570176, 'steps': 101927, 'loss/train': 1.2485389709472656} 11/07/2021 11:32:16 - INFO - __main__ - Step 101929: {'lr': 0.00011924868908616222, 'samples': 19570368, 'steps': 101928, 'loss/train': 1.521597981452942} 11/07/2021 11:32:17 - INFO - __main__ - Step 101930: {'lr': 0.00011924416602105508, 'samples': 19570560, 'steps': 101929, 'loss/train': 1.1919288635253906} 11/07/2021 11:32:17 - INFO - __main__ - Step 101931: {'lr': 0.00011923964301486442, 'samples': 19570752, 'steps': 101930, 'loss/train': 1.4923688173294067} 11/07/2021 11:32:18 - INFO - __main__ - Step 101932: {'lr': 0.00011923512006759238, 'samples': 19570944, 'steps': 101931, 'loss/train': 1.2121652364730835} 11/07/2021 11:32:18 - INFO - __main__ - Step 101933: {'lr': 0.00011923059717924095, 'samples': 19571136, 'steps': 101932, 'loss/train': 1.2952700853347778} 11/07/2021 11:32:19 - INFO - __main__ - Step 101934: {'lr': 0.0001192260743498122, 'samples': 19571328, 'steps': 101933, 'loss/train': 1.213035225868225} 11/07/2021 11:32:19 - INFO - __main__ - Step 101935: {'lr': 0.00011922155157930816, 'samples': 19571520, 'steps': 101934, 'loss/train': 1.2083505392074585} 11/07/2021 11:32:20 - INFO - __main__ - Step 101936: {'lr': 0.00011921702886773089, 'samples': 19571712, 'steps': 101935, 'loss/train': 0.5249482989311218} 11/07/2021 11:32:20 - INFO - __main__ - Step 101937: {'lr': 0.0001192125062150824, 'samples': 19571904, 'steps': 101936, 'loss/train': 1.2633756399154663} 11/07/2021 11:32:20 - INFO - __main__ - Step 101938: {'lr': 0.00011920798362136472, 'samples': 19572096, 'steps': 101937, 'loss/train': 0.9046249389648438} 11/07/2021 11:32:21 - INFO - __main__ - Step 101939: {'lr': 0.00011920346108657992, 'samples': 19572288, 'steps': 101938, 'loss/train': 0.8801046013832092} 11/07/2021 11:32:22 - INFO - __main__ - Step 101940: {'lr': 0.00011919893861073003, 'samples': 19572480, 'steps': 101939, 'loss/train': 1.139893889427185} 11/07/2021 11:32:22 - INFO - __main__ - Step 101941: {'lr': 0.00011919441619381708, 'samples': 19572672, 'steps': 101940, 'loss/train': 1.7410529851913452} 11/07/2021 11:32:22 - INFO - __main__ - Step 101942: {'lr': 0.00011918989383584308, 'samples': 19572864, 'steps': 101941, 'loss/train': 1.0774506330490112} 11/07/2021 11:32:23 - INFO - __main__ - Step 101943: {'lr': 0.00011918537153681022, 'samples': 19573056, 'steps': 101942, 'loss/train': 1.2291368246078491} 11/07/2021 11:32:24 - INFO - __main__ - Step 101944: {'lr': 0.00011918084929672029, 'samples': 19573248, 'steps': 101943, 'loss/train': 1.2513099908828735} 11/07/2021 11:32:24 - INFO - __main__ - Step 101945: {'lr': 0.00011917632711557547, 'samples': 19573440, 'steps': 101944, 'loss/train': 0.5847854018211365} 11/07/2021 11:32:25 - INFO - __main__ - Step 101946: {'lr': 0.0001191718049933778, 'samples': 19573632, 'steps': 101945, 'loss/train': 1.4246597290039062} 11/07/2021 11:32:25 - INFO - __main__ - Step 101947: {'lr': 0.00011916728293012927, 'samples': 19573824, 'steps': 101946, 'loss/train': 0.5384560823440552} 11/07/2021 11:32:25 - INFO - __main__ - Step 101948: {'lr': 0.00011916276092583191, 'samples': 19574016, 'steps': 101947, 'loss/train': 1.1555274724960327} 11/07/2021 11:32:26 - INFO - __main__ - Step 101949: {'lr': 0.00011915823898048784, 'samples': 19574208, 'steps': 101948, 'loss/train': 0.9255381226539612} 11/07/2021 11:32:27 - INFO - __main__ - Step 101950: {'lr': 0.00011915371709409903, 'samples': 19574400, 'steps': 101949, 'loss/train': 0.6264830231666565} 11/07/2021 11:32:27 - INFO - __main__ - Step 101951: {'lr': 0.00011914919526666753, 'samples': 19574592, 'steps': 101950, 'loss/train': 1.9121060371398926} 11/07/2021 11:32:27 - INFO - __main__ - Step 101952: {'lr': 0.00011914467349819542, 'samples': 19574784, 'steps': 101951, 'loss/train': 1.5866620540618896} 11/07/2021 11:32:28 - INFO - __main__ - Step 101953: {'lr': 0.00011914015178868468, 'samples': 19574976, 'steps': 101952, 'loss/train': 1.315491795539856} 11/07/2021 11:32:28 - INFO - __main__ - Step 101954: {'lr': 0.00011913563013813735, 'samples': 19575168, 'steps': 101953, 'loss/train': 1.6105118989944458} 11/07/2021 11:32:29 - INFO - __main__ - Step 101955: {'lr': 0.00011913110854655549, 'samples': 19575360, 'steps': 101954, 'loss/train': 1.431010127067566} 11/07/2021 11:32:30 - INFO - __main__ - Step 101956: {'lr': 0.00011912658701394113, 'samples': 19575552, 'steps': 101955, 'loss/train': 1.039467692375183} 11/07/2021 11:32:30 - INFO - __main__ - Step 101957: {'lr': 0.00011912206554029645, 'samples': 19575744, 'steps': 101956, 'loss/train': 1.1742432117462158} 11/07/2021 11:32:30 - INFO - __main__ - Step 101958: {'lr': 0.0001191175441256232, 'samples': 19575936, 'steps': 101957, 'loss/train': 1.615080714225769} 11/07/2021 11:32:31 - INFO - __main__ - Step 101959: {'lr': 0.00011911302276992358, 'samples': 19576128, 'steps': 101958, 'loss/train': 1.6576664447784424} 11/07/2021 11:32:32 - INFO - __main__ - Step 101960: {'lr': 0.00011910850147319962, 'samples': 19576320, 'steps': 101959, 'loss/train': 1.7335491180419922} 11/07/2021 11:32:32 - INFO - __main__ - Step 101961: {'lr': 0.00011910398023545336, 'samples': 19576512, 'steps': 101960, 'loss/train': 1.4316929578781128} 11/07/2021 11:32:32 - INFO - __main__ - Step 101962: {'lr': 0.00011909945905668682, 'samples': 19576704, 'steps': 101961, 'loss/train': 1.1760823726654053} 11/07/2021 11:32:33 - INFO - __main__ - Step 101963: {'lr': 0.00011909493793690201, 'samples': 19576896, 'steps': 101962, 'loss/train': 0.8606479167938232} 11/07/2021 11:32:33 - INFO - __main__ - Step 101964: {'lr': 0.00011909041687610104, 'samples': 19577088, 'steps': 101963, 'loss/train': 1.7035784721374512} 11/07/2021 11:32:35 - INFO - __main__ - Step 101965: {'lr': 0.00011908589587428589, 'samples': 19577280, 'steps': 101964, 'loss/train': 1.182175636291504} 11/07/2021 11:32:35 - INFO - __main__ - Step 101966: {'lr': 0.00011908137493145862, 'samples': 19577472, 'steps': 101965, 'loss/train': 1.296228051185608} 11/07/2021 11:32:35 - INFO - __main__ - Step 101967: {'lr': 0.00011907685404762128, 'samples': 19577664, 'steps': 101966, 'loss/train': 1.6398694515228271} 11/07/2021 11:32:36 - INFO - __main__ - Step 101968: {'lr': 0.00011907233322277586, 'samples': 19577856, 'steps': 101967, 'loss/train': 1.3429021835327148} 11/07/2021 11:32:36 - INFO - __main__ - Step 101969: {'lr': 0.00011906781245692444, 'samples': 19578048, 'steps': 101968, 'loss/train': 1.4680198431015015} 11/07/2021 11:32:36 - INFO - __main__ - Step 101970: {'lr': 0.00011906329175006914, 'samples': 19578240, 'steps': 101969, 'loss/train': 1.9244999885559082} 11/07/2021 11:32:38 - INFO - __main__ - Step 101971: {'lr': 0.00011905877110221181, 'samples': 19578432, 'steps': 101970, 'loss/train': 0.54759281873703} 11/07/2021 11:32:38 - INFO - __main__ - Step 101972: {'lr': 0.00011905425051335456, 'samples': 19578624, 'steps': 101971, 'loss/train': 1.303914189338684} 11/07/2021 11:32:38 - INFO - __main__ - Step 101973: {'lr': 0.00011904972998349945, 'samples': 19578816, 'steps': 101972, 'loss/train': 0.6394945979118347} 11/07/2021 11:32:39 - INFO - __main__ - Step 101974: {'lr': 0.00011904520951264852, 'samples': 19579008, 'steps': 101973, 'loss/train': 1.3729348182678223} 11/07/2021 11:32:39 - INFO - __main__ - Step 101975: {'lr': 0.00011904068910080379, 'samples': 19579200, 'steps': 101974, 'loss/train': 1.2234660387039185} 11/07/2021 11:32:40 - INFO - __main__ - Step 101976: {'lr': 0.0001190361687479673, 'samples': 19579392, 'steps': 101975, 'loss/train': 1.283174991607666} 11/07/2021 11:32:40 - INFO - __main__ - Step 101977: {'lr': 0.00011903164845414111, 'samples': 19579584, 'steps': 101976, 'loss/train': 1.0252811908721924} 11/07/2021 11:32:41 - INFO - __main__ - Step 101978: {'lr': 0.0001190271282193272, 'samples': 19579776, 'steps': 101977, 'loss/train': 0.9133988618850708} 11/07/2021 11:32:41 - INFO - __main__ - Step 101979: {'lr': 0.00011902260804352769, 'samples': 19579968, 'steps': 101978, 'loss/train': 0.9898113012313843} 11/07/2021 11:32:41 - INFO - __main__ - Step 101980: {'lr': 0.00011901808792674457, 'samples': 19580160, 'steps': 101979, 'loss/train': 1.4467355012893677} 11/07/2021 11:32:42 - INFO - __main__ - Step 101981: {'lr': 0.00011901356786897985, 'samples': 19580352, 'steps': 101980, 'loss/train': 1.7011704444885254} 11/07/2021 11:32:43 - INFO - __main__ - Step 101982: {'lr': 0.00011900904787023562, 'samples': 19580544, 'steps': 101981, 'loss/train': 1.2236876487731934} 11/07/2021 11:32:43 - INFO - __main__ - Step 101983: {'lr': 0.00011900452793051387, 'samples': 19580736, 'steps': 101982, 'loss/train': 1.5334433317184448} 11/07/2021 11:32:43 - INFO - __main__ - Step 101984: {'lr': 0.00011900000804981676, 'samples': 19580928, 'steps': 101983, 'loss/train': 0.8715554475784302} 11/07/2021 11:32:44 - INFO - __main__ - Step 101985: {'lr': 0.00011899548822814613, 'samples': 19581120, 'steps': 101984, 'loss/train': 1.1034934520721436} 11/07/2021 11:32:44 - INFO - __main__ - Step 101986: {'lr': 0.00011899096846550412, 'samples': 19581312, 'steps': 101985, 'loss/train': 1.505855679512024} 11/07/2021 11:32:45 - INFO - __main__ - Step 101987: {'lr': 0.00011898644876189275, 'samples': 19581504, 'steps': 101986, 'loss/train': 1.6157296895980835} 11/07/2021 11:32:46 - INFO - __main__ - Step 101988: {'lr': 0.00011898192911731407, 'samples': 19581696, 'steps': 101987, 'loss/train': 0.6131436824798584} 11/07/2021 11:32:46 - INFO - __main__ - Step 101989: {'lr': 0.0001189774095317701, 'samples': 19581888, 'steps': 101988, 'loss/train': 1.4838342666625977} 11/07/2021 11:32:46 - INFO - __main__ - Step 101990: {'lr': 0.0001189728900052629, 'samples': 19582080, 'steps': 101989, 'loss/train': 1.4662169218063354} 11/07/2021 11:32:47 - INFO - __main__ - Step 101991: {'lr': 0.00011896837053779447, 'samples': 19582272, 'steps': 101990, 'loss/train': 2.156440496444702} 11/07/2021 11:32:48 - INFO - __main__ - Step 101992: {'lr': 0.00011896385112936689, 'samples': 19582464, 'steps': 101991, 'loss/train': 1.4784681797027588} 11/07/2021 11:32:48 - INFO - __main__ - Step 101993: {'lr': 0.00011895933177998219, 'samples': 19582656, 'steps': 101992, 'loss/train': 1.7477153539657593} 11/07/2021 11:32:48 - INFO - __main__ - Step 101994: {'lr': 0.00011895481248964238, 'samples': 19582848, 'steps': 101993, 'loss/train': 1.2556759119033813} 11/07/2021 11:32:49 - INFO - __main__ - Step 101995: {'lr': 0.0001189502932583495, 'samples': 19583040, 'steps': 101994, 'loss/train': 1.34673011302948} 11/07/2021 11:32:49 - INFO - __main__ - Step 101996: {'lr': 0.0001189457740861056, 'samples': 19583232, 'steps': 101995, 'loss/train': 1.2625361680984497} 11/07/2021 11:32:50 - INFO - __main__ - Step 101997: {'lr': 0.0001189412549729128, 'samples': 19583424, 'steps': 101996, 'loss/train': 1.1187825202941895} 11/07/2021 11:32:50 - INFO - __main__ - Step 101998: {'lr': 0.00011893673591877297, 'samples': 19583616, 'steps': 101997, 'loss/train': 1.643775224685669} 11/07/2021 11:32:51 - INFO - __main__ - Step 101999: {'lr': 0.00011893221692368822, 'samples': 19583808, 'steps': 101998, 'loss/train': 1.0523601770401} 11/07/2021 11:32:51 - INFO - __main__ - Step 102000: {'lr': 0.0001189276979876606, 'samples': 19584000, 'steps': 101999, 'loss/train': 1.4326636791229248} 11/07/2021 11:32:51 - INFO - __main__ - Step 102001: {'lr': 0.00011892317911069211, 'samples': 19584192, 'steps': 102000, 'loss/train': 1.583204746246338} 11/07/2021 11:32:52 - INFO - __main__ - Step 102002: {'lr': 0.00011891866029278483, 'samples': 19584384, 'steps': 102001, 'loss/train': 1.031388759613037} 11/07/2021 11:32:53 - INFO - __main__ - Step 102003: {'lr': 0.00011891414153394078, 'samples': 19584576, 'steps': 102002, 'loss/train': 1.4684607982635498} 11/07/2021 11:32:53 - INFO - __main__ - Step 102004: {'lr': 0.00011890962283416198, 'samples': 19584768, 'steps': 102003, 'loss/train': 1.7611174583435059} 11/07/2021 11:32:54 - INFO - __main__ - Step 102005: {'lr': 0.00011890510419345049, 'samples': 19584960, 'steps': 102004, 'loss/train': 1.4575424194335938} 11/07/2021 11:32:54 - INFO - __main__ - Step 102006: {'lr': 0.00011890058561180836, 'samples': 19585152, 'steps': 102005, 'loss/train': 1.0098702907562256} 11/07/2021 11:32:54 - INFO - __main__ - Step 102007: {'lr': 0.00011889606708923759, 'samples': 19585344, 'steps': 102006, 'loss/train': 0.9316961765289307} 11/07/2021 11:32:55 - INFO - __main__ - Step 102008: {'lr': 0.0001188915486257402, 'samples': 19585536, 'steps': 102007, 'loss/train': 0.8670324683189392} 11/07/2021 11:32:56 - INFO - __main__ - Step 102009: {'lr': 0.00011888703022131827, 'samples': 19585728, 'steps': 102008, 'loss/train': 1.6394069194793701} 11/07/2021 11:32:56 - INFO - __main__ - Step 102010: {'lr': 0.00011888251187597393, 'samples': 19585920, 'steps': 102009, 'loss/train': 2.1763927936553955} 11/07/2021 11:32:56 - INFO - __main__ - Step 102011: {'lr': 0.00011887799358970902, 'samples': 19586112, 'steps': 102010, 'loss/train': 2.136354446411133} 11/07/2021 11:32:57 - INFO - __main__ - Step 102012: {'lr': 0.00011887347536252565, 'samples': 19586304, 'steps': 102011, 'loss/train': 0.9820921421051025} 11/07/2021 11:32:58 - INFO - __main__ - Step 102013: {'lr': 0.00011886895719442587, 'samples': 19586496, 'steps': 102012, 'loss/train': 1.5362489223480225} 11/07/2021 11:32:58 - INFO - __main__ - Step 102014: {'lr': 0.0001188644390854117, 'samples': 19586688, 'steps': 102013, 'loss/train': 1.3481334447860718} 11/07/2021 11:32:58 - INFO - __main__ - Step 102015: {'lr': 0.0001188599210354852, 'samples': 19586880, 'steps': 102014, 'loss/train': 1.44599187374115} 11/07/2021 11:32:59 - INFO - __main__ - Step 102016: {'lr': 0.0001188554030446484, 'samples': 19587072, 'steps': 102015, 'loss/train': 0.9680827260017395} 11/07/2021 11:32:59 - INFO - __main__ - Step 102017: {'lr': 0.00011885088511290332, 'samples': 19587264, 'steps': 102016, 'loss/train': 1.5620003938674927} 11/07/2021 11:33:00 - INFO - __main__ - Step 102018: {'lr': 0.00011884636724025202, 'samples': 19587456, 'steps': 102017, 'loss/train': 1.7865349054336548} 11/07/2021 11:33:01 - INFO - __main__ - Step 102019: {'lr': 0.00011884184942669651, 'samples': 19587648, 'steps': 102018, 'loss/train': 1.9214707612991333} 11/07/2021 11:33:01 - INFO - __main__ - Step 102020: {'lr': 0.00011883733167223887, 'samples': 19587840, 'steps': 102019, 'loss/train': 1.264249324798584} 11/07/2021 11:33:01 - INFO - __main__ - Step 102021: {'lr': 0.00011883281397688109, 'samples': 19588032, 'steps': 102020, 'loss/train': 1.4549715518951416} 11/07/2021 11:33:02 - INFO - __main__ - Step 102022: {'lr': 0.0001188282963406252, 'samples': 19588224, 'steps': 102021, 'loss/train': 1.6413310766220093} 11/07/2021 11:33:02 - INFO - __main__ - Step 102023: {'lr': 0.00011882377876347327, 'samples': 19588416, 'steps': 102022, 'loss/train': 1.5001435279846191} 11/07/2021 11:33:03 - INFO - __main__ - Step 102024: {'lr': 0.0001188192612454274, 'samples': 19588608, 'steps': 102023, 'loss/train': 1.2040361166000366} 11/07/2021 11:33:03 - INFO - __main__ - Step 102025: {'lr': 0.00011881474378648949, 'samples': 19588800, 'steps': 102024, 'loss/train': 1.437766671180725} 11/07/2021 11:33:04 - INFO - __main__ - Step 102026: {'lr': 0.00011881022638666158, 'samples': 19588992, 'steps': 102025, 'loss/train': 1.583815097808838} 11/07/2021 11:33:04 - INFO - __main__ - Step 102027: {'lr': 0.00011880570904594582, 'samples': 19589184, 'steps': 102026, 'loss/train': 1.5151596069335938} 11/07/2021 11:33:04 - INFO - __main__ - Step 102028: {'lr': 0.00011880119176434411, 'samples': 19589376, 'steps': 102027, 'loss/train': 1.5066347122192383} 11/07/2021 11:33:06 - INFO - __main__ - Step 102029: {'lr': 0.0001187966745418586, 'samples': 19589568, 'steps': 102028, 'loss/train': 1.5210418701171875} 11/07/2021 11:33:06 - INFO - __main__ - Step 102030: {'lr': 0.00011879215737849131, 'samples': 19589760, 'steps': 102029, 'loss/train': 1.5943197011947632} 11/07/2021 11:33:06 - INFO - __main__ - Step 102031: {'lr': 0.00011878764027424421, 'samples': 19589952, 'steps': 102030, 'loss/train': 1.7166959047317505} 11/07/2021 11:33:07 - INFO - __main__ - Step 102032: {'lr': 0.00011878312322911938, 'samples': 19590144, 'steps': 102031, 'loss/train': 1.444366216659546} 11/07/2021 11:33:07 - INFO - __main__ - Step 102033: {'lr': 0.00011877860624311886, 'samples': 19590336, 'steps': 102032, 'loss/train': 0.5609753131866455} 11/07/2021 11:33:08 - INFO - __main__ - Step 102034: {'lr': 0.00011877408931624467, 'samples': 19590528, 'steps': 102033, 'loss/train': 0.8688574433326721} 11/07/2021 11:33:09 - INFO - __main__ - Step 102035: {'lr': 0.00011876957244849884, 'samples': 19590720, 'steps': 102034, 'loss/train': 1.394185185432434} 11/07/2021 11:33:09 - INFO - __main__ - Step 102036: {'lr': 0.00011876505563988344, 'samples': 19590912, 'steps': 102035, 'loss/train': 1.7535878419876099} 11/07/2021 11:33:09 - INFO - __main__ - Step 102037: {'lr': 0.00011876053889040056, 'samples': 19591104, 'steps': 102036, 'loss/train': 1.4878109693527222} 11/07/2021 11:33:10 - INFO - __main__ - Step 102038: {'lr': 0.00011875602220005204, 'samples': 19591296, 'steps': 102037, 'loss/train': 1.7310230731964111} 11/07/2021 11:33:10 - INFO - __main__ - Step 102039: {'lr': 0.00011875150556884006, 'samples': 19591488, 'steps': 102038, 'loss/train': 1.2807016372680664} 11/07/2021 11:33:11 - INFO - __main__ - Step 102040: {'lr': 0.00011874698899676665, 'samples': 19591680, 'steps': 102039, 'loss/train': 0.8907776474952698} 11/07/2021 11:33:11 - INFO - __main__ - Step 102041: {'lr': 0.00011874247248383376, 'samples': 19591872, 'steps': 102040, 'loss/train': 1.4287959337234497} 11/07/2021 11:33:12 - INFO - __main__ - Step 102042: {'lr': 0.00011873795603004353, 'samples': 19592064, 'steps': 102041, 'loss/train': 1.1584445238113403} 11/07/2021 11:33:12 - INFO - __main__ - Step 102043: {'lr': 0.00011873343963539795, 'samples': 19592256, 'steps': 102042, 'loss/train': 1.1831058263778687} 11/07/2021 11:33:12 - INFO - __main__ - Step 102044: {'lr': 0.00011872892329989904, 'samples': 19592448, 'steps': 102043, 'loss/train': 1.2182902097702026} 11/07/2021 11:33:14 - INFO - __main__ - Step 102045: {'lr': 0.00011872440702354887, 'samples': 19592640, 'steps': 102044, 'loss/train': 1.6218160390853882} 11/07/2021 11:33:14 - INFO - __main__ - Step 102046: {'lr': 0.00011871989080634943, 'samples': 19592832, 'steps': 102045, 'loss/train': 1.0441070795059204} 11/07/2021 11:33:14 - INFO - __main__ - Step 102047: {'lr': 0.00011871537464830278, 'samples': 19593024, 'steps': 102046, 'loss/train': 1.271759033203125} 11/07/2021 11:33:15 - INFO - __main__ - Step 102048: {'lr': 0.00011871085854941099, 'samples': 19593216, 'steps': 102047, 'loss/train': 1.7573003768920898} 11/07/2021 11:33:15 - INFO - __main__ - Step 102049: {'lr': 0.00011870634250967604, 'samples': 19593408, 'steps': 102048, 'loss/train': 1.3595495223999023} 11/07/2021 11:33:15 - INFO - __main__ - Step 102050: {'lr': 0.0001187018265291, 'samples': 19593600, 'steps': 102049, 'loss/train': 1.4571330547332764} 11/07/2021 11:33:17 - INFO - __main__ - Step 102051: {'lr': 0.00011869731060768496, 'samples': 19593792, 'steps': 102050, 'loss/train': 1.0398119688034058} 11/07/2021 11:33:17 - INFO - __main__ - Step 102052: {'lr': 0.00011869279474543282, 'samples': 19593984, 'steps': 102051, 'loss/train': 1.0795756578445435} 11/07/2021 11:33:17 - INFO - __main__ - Step 102053: {'lr': 0.00011868827894234566, 'samples': 19594176, 'steps': 102052, 'loss/train': 1.3441764116287231} 11/07/2021 11:33:18 - INFO - __main__ - Step 102054: {'lr': 0.00011868376319842552, 'samples': 19594368, 'steps': 102053, 'loss/train': 0.9405523538589478} 11/07/2021 11:33:18 - INFO - __main__ - Step 102055: {'lr': 0.00011867924751367448, 'samples': 19594560, 'steps': 102054, 'loss/train': 1.222645878791809} 11/07/2021 11:33:19 - INFO - __main__ - Step 102056: {'lr': 0.00011867473188809455, 'samples': 19594752, 'steps': 102055, 'loss/train': 0.09407708048820496} 11/07/2021 11:33:20 - INFO - __main__ - Step 102057: {'lr': 0.00011867021632168774, 'samples': 19594944, 'steps': 102056, 'loss/train': 2.461456060409546} 11/07/2021 11:33:20 - INFO - __main__ - Step 102058: {'lr': 0.0001186657008144561, 'samples': 19595136, 'steps': 102057, 'loss/train': 0.7057860493659973} 11/07/2021 11:33:20 - INFO - __main__ - Step 102059: {'lr': 0.00011866118536640169, 'samples': 19595328, 'steps': 102058, 'loss/train': 2.089855194091797} 11/07/2021 11:33:21 - INFO - __main__ - Step 102060: {'lr': 0.0001186566699775265, 'samples': 19595520, 'steps': 102059, 'loss/train': 1.3916925191879272} 11/07/2021 11:33:22 - INFO - __main__ - Step 102061: {'lr': 0.0001186521546478326, 'samples': 19595712, 'steps': 102060, 'loss/train': 1.328039526939392} 11/07/2021 11:33:22 - INFO - __main__ - Step 102062: {'lr': 0.00011864763937732203, 'samples': 19595904, 'steps': 102061, 'loss/train': 1.3900386095046997} 11/07/2021 11:33:22 - INFO - __main__ - Step 102063: {'lr': 0.00011864312416599679, 'samples': 19596096, 'steps': 102062, 'loss/train': 1.0423777103424072} 11/07/2021 11:33:23 - INFO - __main__ - Step 102064: {'lr': 0.00011863860901385901, 'samples': 19596288, 'steps': 102063, 'loss/train': 0.5821849703788757} 11/07/2021 11:33:23 - INFO - __main__ - Step 102065: {'lr': 0.00011863409392091056, 'samples': 19596480, 'steps': 102064, 'loss/train': 0.5446491241455078} 11/07/2021 11:33:24 - INFO - __main__ - Step 102066: {'lr': 0.00011862957888715359, 'samples': 19596672, 'steps': 102065, 'loss/train': 1.3294527530670166} 11/07/2021 11:33:24 - INFO - __main__ - Step 102067: {'lr': 0.00011862506391259006, 'samples': 19596864, 'steps': 102066, 'loss/train': 1.1237123012542725} 11/07/2021 11:33:25 - INFO - __main__ - Step 102068: {'lr': 0.00011862054899722207, 'samples': 19597056, 'steps': 102067, 'loss/train': 0.852388322353363} 11/07/2021 11:33:25 - INFO - __main__ - Step 102069: {'lr': 0.00011861603414105163, 'samples': 19597248, 'steps': 102068, 'loss/train': 1.3625762462615967} 11/07/2021 11:33:26 - INFO - __main__ - Step 102070: {'lr': 0.0001186115193440808, 'samples': 19597440, 'steps': 102069, 'loss/train': 1.309332013130188} 11/07/2021 11:33:27 - INFO - __main__ - Step 102071: {'lr': 0.00011860700460631155, 'samples': 19597632, 'steps': 102070, 'loss/train': 1.2867941856384277} 11/07/2021 11:33:27 - INFO - __main__ - Step 102072: {'lr': 0.000118602489927746, 'samples': 19597824, 'steps': 102071, 'loss/train': 1.2269381284713745} 11/07/2021 11:33:27 - INFO - __main__ - Step 102073: {'lr': 0.00011859797530838611, 'samples': 19598016, 'steps': 102072, 'loss/train': 1.2176246643066406} 11/07/2021 11:33:28 - INFO - __main__ - Step 102074: {'lr': 0.00011859346074823397, 'samples': 19598208, 'steps': 102073, 'loss/train': 1.0901600122451782} 11/07/2021 11:33:28 - INFO - __main__ - Step 102075: {'lr': 0.00011858894624729155, 'samples': 19598400, 'steps': 102074, 'loss/train': 1.3578321933746338} 11/07/2021 11:33:29 - INFO - __main__ - Step 102076: {'lr': 0.00011858443180556094, 'samples': 19598592, 'steps': 102075, 'loss/train': 1.1341487169265747} 11/07/2021 11:33:29 - INFO - __main__ - Step 102077: {'lr': 0.00011857991742304417, 'samples': 19598784, 'steps': 102076, 'loss/train': 1.6641494035720825} 11/07/2021 11:33:30 - INFO - __main__ - Step 102078: {'lr': 0.00011857540309974335, 'samples': 19598976, 'steps': 102077, 'loss/train': 1.295502781867981} 11/07/2021 11:33:30 - INFO - __main__ - Step 102079: {'lr': 0.00011857088883566033, 'samples': 19599168, 'steps': 102078, 'loss/train': 1.7939642667770386} 11/07/2021 11:33:30 - INFO - __main__ - Step 102080: {'lr': 0.00011856637463079723, 'samples': 19599360, 'steps': 102079, 'loss/train': 1.5644391775131226} 11/07/2021 11:33:31 - INFO - __main__ - Step 102081: {'lr': 0.0001185618604851561, 'samples': 19599552, 'steps': 102080, 'loss/train': 1.4494080543518066} 11/07/2021 11:33:32 - INFO - __main__ - Step 102082: {'lr': 0.00011855734639873897, 'samples': 19599744, 'steps': 102081, 'loss/train': 1.246366262435913} 11/07/2021 11:33:32 - INFO - __main__ - Step 102083: {'lr': 0.00011855283237154788, 'samples': 19599936, 'steps': 102082, 'loss/train': 1.485161304473877} 11/07/2021 11:33:33 - INFO - __main__ - Step 102084: {'lr': 0.00011854831840358485, 'samples': 19600128, 'steps': 102083, 'loss/train': 1.5706098079681396} 11/07/2021 11:33:33 - INFO - __main__ - Step 102085: {'lr': 0.00011854380449485191, 'samples': 19600320, 'steps': 102084, 'loss/train': 1.085910677909851} 11/07/2021 11:33:33 - INFO - __main__ - Step 102086: {'lr': 0.0001185392906453511, 'samples': 19600512, 'steps': 102085, 'loss/train': 1.1394593715667725} 11/07/2021 11:33:34 - INFO - __main__ - Step 102087: {'lr': 0.00011853477685508445, 'samples': 19600704, 'steps': 102086, 'loss/train': 1.415819764137268} 11/07/2021 11:33:35 - INFO - __main__ - Step 102088: {'lr': 0.00011853026312405404, 'samples': 19600896, 'steps': 102087, 'loss/train': 1.563494324684143} 11/07/2021 11:33:35 - INFO - __main__ - Step 102089: {'lr': 0.00011852574945226183, 'samples': 19601088, 'steps': 102088, 'loss/train': 1.6670862436294556} 11/07/2021 11:33:35 - INFO - __main__ - Step 102090: {'lr': 0.00011852123583970992, 'samples': 19601280, 'steps': 102089, 'loss/train': 0.9076662659645081} 11/07/2021 11:33:36 - INFO - __main__ - Step 102091: {'lr': 0.00011851672228640037, 'samples': 19601472, 'steps': 102090, 'loss/train': 1.4842113256454468} 11/07/2021 11:33:37 - INFO - __main__ - Step 102092: {'lr': 0.0001185122087923351, 'samples': 19601664, 'steps': 102091, 'loss/train': 1.2249138355255127} 11/07/2021 11:33:37 - INFO - __main__ - Step 102093: {'lr': 0.00011850769535751615, 'samples': 19601856, 'steps': 102092, 'loss/train': 1.0383362770080566} 11/07/2021 11:33:38 - INFO - __main__ - Step 102094: {'lr': 0.00011850318198194565, 'samples': 19602048, 'steps': 102093, 'loss/train': 1.9595730304718018} 11/07/2021 11:33:38 - INFO - __main__ - Step 102095: {'lr': 0.00011849866866562556, 'samples': 19602240, 'steps': 102094, 'loss/train': 1.4590457677841187} 11/07/2021 11:33:38 - INFO - __main__ - Step 102096: {'lr': 0.00011849415540855795, 'samples': 19602432, 'steps': 102095, 'loss/train': 1.7666720151901245} 11/07/2021 11:33:39 - INFO - __main__ - Step 102097: {'lr': 0.00011848964221074485, 'samples': 19602624, 'steps': 102096, 'loss/train': 1.3342540264129639} 11/07/2021 11:33:40 - INFO - __main__ - Step 102098: {'lr': 0.00011848512907218828, 'samples': 19602816, 'steps': 102097, 'loss/train': 1.5422577857971191} 11/07/2021 11:33:40 - INFO - __main__ - Step 102099: {'lr': 0.00011848061599289029, 'samples': 19603008, 'steps': 102098, 'loss/train': 0.727121889591217} 11/07/2021 11:33:41 - INFO - __main__ - Step 102100: {'lr': 0.00011847610297285288, 'samples': 19603200, 'steps': 102099, 'loss/train': 1.5066097974777222} 11/07/2021 11:33:41 - INFO - __main__ - Step 102101: {'lr': 0.00011847159001207813, 'samples': 19603392, 'steps': 102100, 'loss/train': 1.2890253067016602} 11/07/2021 11:33:41 - INFO - __main__ - Step 102102: {'lr': 0.00011846707711056806, 'samples': 19603584, 'steps': 102101, 'loss/train': 1.660158634185791} 11/07/2021 11:33:42 - INFO - __main__ - Step 102103: {'lr': 0.00011846256426832466, 'samples': 19603776, 'steps': 102102, 'loss/train': 1.1805930137634277} 11/07/2021 11:33:43 - INFO - __main__ - Step 102104: {'lr': 0.00011845805148535005, 'samples': 19603968, 'steps': 102103, 'loss/train': 1.1851638555526733} 11/07/2021 11:33:43 - INFO - __main__ - Step 102105: {'lr': 0.00011845353876164627, 'samples': 19604160, 'steps': 102104, 'loss/train': 0.9792910814285278} 11/07/2021 11:33:43 - INFO - __main__ - Step 102106: {'lr': 0.0001184490260972152, 'samples': 19604352, 'steps': 102105, 'loss/train': 1.3051059246063232} 11/07/2021 11:33:44 - INFO - __main__ - Step 102107: {'lr': 0.00011844451349205898, 'samples': 19604544, 'steps': 102106, 'loss/train': 0.9681282043457031} 11/07/2021 11:33:45 - INFO - __main__ - Step 102108: {'lr': 0.00011844000094617963, 'samples': 19604736, 'steps': 102107, 'loss/train': 1.3090310096740723} 11/07/2021 11:33:45 - INFO - __main__ - Step 102109: {'lr': 0.0001184354884595792, 'samples': 19604928, 'steps': 102108, 'loss/train': 1.168015480041504} 11/07/2021 11:33:45 - INFO - __main__ - Step 102110: {'lr': 0.0001184309760322597, 'samples': 19605120, 'steps': 102109, 'loss/train': 1.6785130500793457} 11/07/2021 11:33:46 - INFO - __main__ - Step 102111: {'lr': 0.00011842646366422317, 'samples': 19605312, 'steps': 102110, 'loss/train': 1.08878493309021} 11/07/2021 11:33:46 - INFO - __main__ - Step 102112: {'lr': 0.00011842195135547162, 'samples': 19605504, 'steps': 102111, 'loss/train': 1.1508371829986572} 11/07/2021 11:33:46 - INFO - __main__ - Step 102113: {'lr': 0.00011841743910600713, 'samples': 19605696, 'steps': 102112, 'loss/train': 1.6124448776245117} 11/07/2021 11:33:48 - INFO - __main__ - Step 102114: {'lr': 0.00011841292691583172, 'samples': 19605888, 'steps': 102113, 'loss/train': 1.7864978313446045} 11/07/2021 11:33:48 - INFO - __main__ - Step 102115: {'lr': 0.0001184084147849474, 'samples': 19606080, 'steps': 102114, 'loss/train': 1.910715937614441} 11/07/2021 11:33:48 - INFO - __main__ - Step 102116: {'lr': 0.00011840390271335624, 'samples': 19606272, 'steps': 102115, 'loss/train': 1.211980938911438} 11/07/2021 11:33:49 - INFO - __main__ - Step 102117: {'lr': 0.00011839939070106032, 'samples': 19606464, 'steps': 102116, 'loss/train': 1.6363883018493652} 11/07/2021 11:33:49 - INFO - __main__ - Step 102118: {'lr': 0.00011839487874806152, 'samples': 19606656, 'steps': 102117, 'loss/train': 1.3956481218338013} 11/07/2021 11:33:50 - INFO - __main__ - Step 102119: {'lr': 0.00011839036685436198, 'samples': 19606848, 'steps': 102118, 'loss/train': 1.5180331468582153} 11/07/2021 11:33:51 - INFO - __main__ - Step 102120: {'lr': 0.00011838585501996366, 'samples': 19607040, 'steps': 102119, 'loss/train': 1.4733465909957886} 11/07/2021 11:33:51 - INFO - __main__ - Step 102121: {'lr': 0.00011838134324486869, 'samples': 19607232, 'steps': 102120, 'loss/train': 0.14303696155548096} 11/07/2021 11:33:51 - INFO - __main__ - Step 102122: {'lr': 0.00011837683152907902, 'samples': 19607424, 'steps': 102121, 'loss/train': 1.3340239524841309} 11/07/2021 11:33:52 - INFO - __main__ - Step 102123: {'lr': 0.00011837231987259672, 'samples': 19607616, 'steps': 102122, 'loss/train': 1.4670480489730835} 11/07/2021 11:33:53 - INFO - __main__ - Step 102124: {'lr': 0.00011836780827542385, 'samples': 19607808, 'steps': 102123, 'loss/train': 1.2717679738998413} 11/07/2021 11:33:53 - INFO - __main__ - Step 102125: {'lr': 0.00011836329673756238, 'samples': 19608000, 'steps': 102124, 'loss/train': 1.0293292999267578} 11/07/2021 11:33:54 - INFO - __main__ - Step 102126: {'lr': 0.00011835878525901442, 'samples': 19608192, 'steps': 102125, 'loss/train': 1.5234235525131226} 11/07/2021 11:33:54 - INFO - __main__ - Step 102127: {'lr': 0.00011835427383978192, 'samples': 19608384, 'steps': 102126, 'loss/train': 1.7840124368667603} 11/07/2021 11:33:54 - INFO - __main__ - Step 102128: {'lr': 0.00011834976247986706, 'samples': 19608576, 'steps': 102127, 'loss/train': 1.6133649349212646} 11/07/2021 11:33:55 - INFO - __main__ - Step 102129: {'lr': 0.0001183452511792717, 'samples': 19608768, 'steps': 102128, 'loss/train': 1.7376123666763306} 11/07/2021 11:33:56 - INFO - __main__ - Step 102130: {'lr': 0.0001183407399379979, 'samples': 19608960, 'steps': 102129, 'loss/train': 1.3051105737686157} 11/07/2021 11:33:56 - INFO - __main__ - Step 102131: {'lr': 0.00011833622875604774, 'samples': 19609152, 'steps': 102130, 'loss/train': 0.8114134073257446} 11/07/2021 11:33:56 - INFO - __main__ - Step 102132: {'lr': 0.00011833171763342324, 'samples': 19609344, 'steps': 102131, 'loss/train': 1.0677529573440552} 11/07/2021 11:33:57 - INFO - __main__ - Step 102133: {'lr': 0.00011832720657012644, 'samples': 19609536, 'steps': 102132, 'loss/train': 1.3927536010742188} 11/07/2021 11:33:58 - INFO - __main__ - Step 102134: {'lr': 0.00011832269556615938, 'samples': 19609728, 'steps': 102133, 'loss/train': 1.3751760721206665} 11/07/2021 11:33:58 - INFO - __main__ - Step 102135: {'lr': 0.00011831818462152408, 'samples': 19609920, 'steps': 102134, 'loss/train': 1.231802225112915} 11/07/2021 11:33:59 - INFO - __main__ - Step 102136: {'lr': 0.00011831367373622256, 'samples': 19610112, 'steps': 102135, 'loss/train': 1.716078519821167} 11/07/2021 11:33:59 - INFO - __main__ - Step 102137: {'lr': 0.00011830916291025687, 'samples': 19610304, 'steps': 102136, 'loss/train': 1.764298677444458} 11/07/2021 11:33:59 - INFO - __main__ - Step 102138: {'lr': 0.00011830465214362907, 'samples': 19610496, 'steps': 102137, 'loss/train': 1.1540642976760864} 11/07/2021 11:34:00 - INFO - __main__ - Step 102139: {'lr': 0.00011830014143634121, 'samples': 19610688, 'steps': 102138, 'loss/train': 0.36864566802978516} 11/07/2021 11:34:01 - INFO - __main__ - Step 102140: {'lr': 0.0001182956307883952, 'samples': 19610880, 'steps': 102139, 'loss/train': 1.526396632194519} 11/07/2021 11:34:01 - INFO - __main__ - Step 102141: {'lr': 0.00011829112019979316, 'samples': 19611072, 'steps': 102140, 'loss/train': 1.487018346786499} 11/07/2021 11:34:01 - INFO - __main__ - Step 102142: {'lr': 0.00011828660967053709, 'samples': 19611264, 'steps': 102141, 'loss/train': 1.1893435716629028} 11/07/2021 11:34:02 - INFO - __main__ - Step 102143: {'lr': 0.00011828209920062905, 'samples': 19611456, 'steps': 102142, 'loss/train': 1.4158143997192383} 11/07/2021 11:34:02 - INFO - __main__ - Step 102144: {'lr': 0.00011827758879007105, 'samples': 19611648, 'steps': 102143, 'loss/train': 1.859777569770813} 11/07/2021 11:34:03 - INFO - __main__ - Step 102145: {'lr': 0.00011827307843886514, 'samples': 19611840, 'steps': 102144, 'loss/train': 0.9460919499397278} 11/07/2021 11:34:04 - INFO - __main__ - Step 102146: {'lr': 0.00011826856814701336, 'samples': 19612032, 'steps': 102145, 'loss/train': 0.35454511642456055} 11/07/2021 11:34:04 - INFO - __main__ - Step 102147: {'lr': 0.00011826405791451772, 'samples': 19612224, 'steps': 102146, 'loss/train': 1.2467687129974365} 11/07/2021 11:34:04 - INFO - __main__ - Step 102148: {'lr': 0.00011825954774138025, 'samples': 19612416, 'steps': 102147, 'loss/train': 1.376586675643921} 11/07/2021 11:34:05 - INFO - __main__ - Step 102149: {'lr': 0.00011825503762760303, 'samples': 19612608, 'steps': 102148, 'loss/train': 1.468748688697815} 11/07/2021 11:34:06 - INFO - __main__ - Step 102150: {'lr': 0.00011825052757318813, 'samples': 19612800, 'steps': 102149, 'loss/train': 1.3582197427749634} 11/07/2021 11:34:06 - INFO - __main__ - Step 102151: {'lr': 0.00011824601757813741, 'samples': 19612992, 'steps': 102150, 'loss/train': 1.021825909614563} 11/07/2021 11:34:06 - INFO - __main__ - Step 102152: {'lr': 0.00011824150764245301, 'samples': 19613184, 'steps': 102151, 'loss/train': 1.0874744653701782} 11/07/2021 11:34:07 - INFO - __main__ - Step 102153: {'lr': 0.00011823699776613698, 'samples': 19613376, 'steps': 102152, 'loss/train': 1.2012673616409302} 11/07/2021 11:34:07 - INFO - __main__ - Step 102154: {'lr': 0.00011823248794919128, 'samples': 19613568, 'steps': 102153, 'loss/train': 1.5170531272888184} 11/07/2021 11:34:08 - INFO - __main__ - Step 102155: {'lr': 0.00011822797819161802, 'samples': 19613760, 'steps': 102154, 'loss/train': 1.2236672639846802} 11/07/2021 11:34:08 - INFO - __main__ - Step 102156: {'lr': 0.00011822346849341917, 'samples': 19613952, 'steps': 102155, 'loss/train': 1.633689045906067} 11/07/2021 11:34:09 - INFO - __main__ - Step 102157: {'lr': 0.0001182189588545968, 'samples': 19614144, 'steps': 102156, 'loss/train': 1.4057704210281372} 11/07/2021 11:34:09 - INFO - __main__ - Step 102158: {'lr': 0.00011821444927515296, 'samples': 19614336, 'steps': 102157, 'loss/train': 2.0702521800994873} 11/07/2021 11:34:09 - INFO - __main__ - Step 102159: {'lr': 0.00011820993975508962, 'samples': 19614528, 'steps': 102158, 'loss/train': 1.4755806922912598} 11/07/2021 11:34:10 - INFO - __main__ - Step 102160: {'lr': 0.00011820543029440887, 'samples': 19614720, 'steps': 102159, 'loss/train': 1.066344141960144} 11/07/2021 11:34:11 - INFO - __main__ - Step 102161: {'lr': 0.0001182009208931128, 'samples': 19614912, 'steps': 102160, 'loss/train': 1.6506186723709106} 11/07/2021 11:34:11 - INFO - __main__ - Step 102162: {'lr': 0.00011819641155120328, 'samples': 19615104, 'steps': 102161, 'loss/train': 0.7008488178253174} 11/07/2021 11:34:12 - INFO - __main__ - Step 102163: {'lr': 0.00011819190226868242, 'samples': 19615296, 'steps': 102162, 'loss/train': 1.470747709274292} 11/07/2021 11:34:12 - INFO - __main__ - Step 102164: {'lr': 0.00011818739304555227, 'samples': 19615488, 'steps': 102163, 'loss/train': 1.0236279964447021} 11/07/2021 11:34:12 - INFO - __main__ - Step 102165: {'lr': 0.0001181828838818148, 'samples': 19615680, 'steps': 102164, 'loss/train': 1.1078416109085083} 11/07/2021 11:34:13 - INFO - __main__ - Step 102166: {'lr': 0.00011817837477747212, 'samples': 19615872, 'steps': 102165, 'loss/train': 1.106970191001892} 11/07/2021 11:34:14 - INFO - __main__ - Step 102167: {'lr': 0.00011817386573252623, 'samples': 19616064, 'steps': 102166, 'loss/train': 1.5874117612838745} 11/07/2021 11:34:14 - INFO - __main__ - Step 102168: {'lr': 0.00011816935674697918, 'samples': 19616256, 'steps': 102167, 'loss/train': 1.4178863763809204} 11/07/2021 11:34:14 - INFO - __main__ - Step 102169: {'lr': 0.00011816484782083295, 'samples': 19616448, 'steps': 102168, 'loss/train': 1.1615993976593018} 11/07/2021 11:34:15 - INFO - __main__ - Step 102170: {'lr': 0.00011816033895408962, 'samples': 19616640, 'steps': 102169, 'loss/train': 0.9611237049102783} 11/07/2021 11:34:16 - INFO - __main__ - Step 102171: {'lr': 0.00011815583014675121, 'samples': 19616832, 'steps': 102170, 'loss/train': 0.9670572280883789} 11/07/2021 11:34:16 - INFO - __main__ - Step 102172: {'lr': 0.00011815132139881984, 'samples': 19617024, 'steps': 102171, 'loss/train': 1.5054211616516113} 11/07/2021 11:34:17 - INFO - __main__ - Step 102173: {'lr': 0.00011814681271029734, 'samples': 19617216, 'steps': 102172, 'loss/train': 1.3661881685256958} 11/07/2021 11:34:17 - INFO - __main__ - Step 102174: {'lr': 0.00011814230408118587, 'samples': 19617408, 'steps': 102173, 'loss/train': 1.3891531229019165} 11/07/2021 11:34:17 - INFO - __main__ - Step 102175: {'lr': 0.00011813779551148745, 'samples': 19617600, 'steps': 102174, 'loss/train': 1.4348845481872559} 11/07/2021 11:34:18 - INFO - __main__ - Step 102176: {'lr': 0.0001181332870012041, 'samples': 19617792, 'steps': 102175, 'loss/train': 1.2881965637207031} 11/07/2021 11:34:19 - INFO - __main__ - Step 102177: {'lr': 0.00011812877855033782, 'samples': 19617984, 'steps': 102176, 'loss/train': 1.135132908821106} 11/07/2021 11:34:19 - INFO - __main__ - Step 102178: {'lr': 0.00011812427015889071, 'samples': 19618176, 'steps': 102177, 'loss/train': 1.329923152923584} 11/07/2021 11:34:19 - INFO - __main__ - Step 102179: {'lr': 0.00011811976182686479, 'samples': 19618368, 'steps': 102178, 'loss/train': 1.4544795751571655} 11/07/2021 11:34:20 - INFO - __main__ - Step 102180: {'lr': 0.00011811525355426204, 'samples': 19618560, 'steps': 102179, 'loss/train': 1.5149328708648682} 11/07/2021 11:34:21 - INFO - __main__ - Step 102181: {'lr': 0.00011811074534108451, 'samples': 19618752, 'steps': 102180, 'loss/train': 1.262332558631897} 11/07/2021 11:34:21 - INFO - __main__ - Step 102182: {'lr': 0.00011810623718733426, 'samples': 19618944, 'steps': 102181, 'loss/train': 1.3752212524414062} 11/07/2021 11:34:22 - INFO - __main__ - Step 102183: {'lr': 0.00011810172909301331, 'samples': 19619136, 'steps': 102182, 'loss/train': 1.2206593751907349} 11/07/2021 11:34:22 - INFO - __main__ - Step 102184: {'lr': 0.00011809722105812367, 'samples': 19619328, 'steps': 102183, 'loss/train': 0.7191754579544067} 11/07/2021 11:34:22 - INFO - __main__ - Step 102185: {'lr': 0.0001180927130826675, 'samples': 19619520, 'steps': 102184, 'loss/train': 1.1161072254180908} 11/07/2021 11:34:23 - INFO - __main__ - Step 102186: {'lr': 0.00011808820516664662, 'samples': 19619712, 'steps': 102185, 'loss/train': 1.2352454662322998} 11/07/2021 11:34:24 - INFO - __main__ - Step 102187: {'lr': 0.00011808369731006315, 'samples': 19619904, 'steps': 102186, 'loss/train': 0.7559825778007507} 11/07/2021 11:34:24 - INFO - __main__ - Step 102188: {'lr': 0.00011807918951291916, 'samples': 19620096, 'steps': 102187, 'loss/train': 1.3782036304473877} 11/07/2021 11:34:24 - INFO - __main__ - Step 102189: {'lr': 0.0001180746817752166, 'samples': 19620288, 'steps': 102188, 'loss/train': 0.9898638129234314} 11/07/2021 11:34:25 - INFO - __main__ - Step 102190: {'lr': 0.00011807017409695758, 'samples': 19620480, 'steps': 102189, 'loss/train': 1.2362226247787476} 11/07/2021 11:34:26 - INFO - __main__ - Step 102191: {'lr': 0.00011806566647814412, 'samples': 19620672, 'steps': 102190, 'loss/train': 1.3159438371658325} 11/07/2021 11:34:26 - INFO - __main__ - Step 102192: {'lr': 0.00011806115891877822, 'samples': 19620864, 'steps': 102191, 'loss/train': 1.1066577434539795} 11/07/2021 11:34:27 - INFO - __main__ - Step 102193: {'lr': 0.00011805665141886191, 'samples': 19621056, 'steps': 102192, 'loss/train': 1.297023057937622} 11/07/2021 11:34:27 - INFO - __main__ - Step 102194: {'lr': 0.00011805214397839725, 'samples': 19621248, 'steps': 102193, 'loss/train': 0.9547217488288879} 11/07/2021 11:34:27 - INFO - __main__ - Step 102195: {'lr': 0.00011804763659738626, 'samples': 19621440, 'steps': 102194, 'loss/train': 1.4266992807388306} 11/07/2021 11:34:28 - INFO - __main__ - Step 102196: {'lr': 0.00011804312927583097, 'samples': 19621632, 'steps': 102195, 'loss/train': 1.303557276725769} 11/07/2021 11:34:29 - INFO - __main__ - Step 102197: {'lr': 0.00011803862201373342, 'samples': 19621824, 'steps': 102196, 'loss/train': 1.2874891757965088} 11/07/2021 11:34:29 - INFO - __main__ - Step 102198: {'lr': 0.00011803411481109561, 'samples': 19622016, 'steps': 102197, 'loss/train': 1.326424241065979} 11/07/2021 11:34:29 - INFO - __main__ - Step 102199: {'lr': 0.0001180296076679197, 'samples': 19622208, 'steps': 102198, 'loss/train': 0.7824151515960693} 11/07/2021 11:34:30 - INFO - __main__ - Step 102200: {'lr': 0.00011802510058420752, 'samples': 19622400, 'steps': 102199, 'loss/train': 1.196159839630127} 11/07/2021 11:34:30 - INFO - __main__ - Step 102201: {'lr': 0.00011802059355996118, 'samples': 19622592, 'steps': 102200, 'loss/train': 1.3319640159606934} 11/07/2021 11:34:31 - INFO - __main__ - Step 102202: {'lr': 0.0001180160865951827, 'samples': 19622784, 'steps': 102201, 'loss/train': 1.164056658744812} 11/07/2021 11:34:31 - INFO - __main__ - Step 102203: {'lr': 0.00011801157968987417, 'samples': 19622976, 'steps': 102202, 'loss/train': 1.6577365398406982} 11/07/2021 11:34:32 - INFO - __main__ - Step 102204: {'lr': 0.00011800707284403759, 'samples': 19623168, 'steps': 102203, 'loss/train': 1.245571494102478} 11/07/2021 11:34:32 - INFO - __main__ - Step 102205: {'lr': 0.00011800256605767498, 'samples': 19623360, 'steps': 102204, 'loss/train': 1.1609227657318115} 11/07/2021 11:34:32 - INFO - __main__ - Step 102206: {'lr': 0.00011799805933078836, 'samples': 19623552, 'steps': 102205, 'loss/train': 1.0493412017822266} 11/07/2021 11:34:34 - INFO - __main__ - Step 102207: {'lr': 0.0001179935526633798, 'samples': 19623744, 'steps': 102206, 'loss/train': 1.5684678554534912} 11/07/2021 11:34:34 - INFO - __main__ - Step 102208: {'lr': 0.0001179890460554513, 'samples': 19623936, 'steps': 102207, 'loss/train': 1.3105913400650024} 11/07/2021 11:34:34 - INFO - __main__ - Step 102209: {'lr': 0.00011798453950700488, 'samples': 19624128, 'steps': 102208, 'loss/train': 1.7416672706604004} 11/07/2021 11:34:35 - INFO - __main__ - Step 102210: {'lr': 0.00011798003301804261, 'samples': 19624320, 'steps': 102209, 'loss/train': 0.8074084520339966} 11/07/2021 11:34:35 - INFO - __main__ - Step 102211: {'lr': 0.0001179755265885665, 'samples': 19624512, 'steps': 102210, 'loss/train': 0.9952569007873535} 11/07/2021 11:34:36 - INFO - __main__ - Step 102212: {'lr': 0.00011797102021857867, 'samples': 19624704, 'steps': 102211, 'loss/train': 1.5465195178985596} 11/07/2021 11:34:36 - INFO - __main__ - Step 102213: {'lr': 0.00011796651390808097, 'samples': 19624896, 'steps': 102212, 'loss/train': 0.8715502023696899} 11/07/2021 11:34:37 - INFO - __main__ - Step 102214: {'lr': 0.00011796200765707551, 'samples': 19625088, 'steps': 102213, 'loss/train': 0.7835828065872192} 11/07/2021 11:34:37 - INFO - __main__ - Step 102215: {'lr': 0.00011795750146556433, 'samples': 19625280, 'steps': 102214, 'loss/train': 1.3076646327972412} 11/07/2021 11:34:37 - INFO - __main__ - Step 102216: {'lr': 0.00011795299533354948, 'samples': 19625472, 'steps': 102215, 'loss/train': 1.4015411138534546} 11/07/2021 11:34:38 - INFO - __main__ - Step 102217: {'lr': 0.00011794848926103296, 'samples': 19625664, 'steps': 102216, 'loss/train': 1.2652181386947632} 11/07/2021 11:34:39 - INFO - __main__ - Step 102218: {'lr': 0.00011794398324801684, 'samples': 19625856, 'steps': 102217, 'loss/train': 1.4103142023086548} 11/07/2021 11:34:39 - INFO - __main__ - Step 102219: {'lr': 0.00011793947729450311, 'samples': 19626048, 'steps': 102218, 'loss/train': 1.581594705581665} 11/07/2021 11:34:40 - INFO - __main__ - Step 102220: {'lr': 0.00011793497140049377, 'samples': 19626240, 'steps': 102219, 'loss/train': 1.1332134008407593} 11/07/2021 11:34:40 - INFO - __main__ - Step 102221: {'lr': 0.00011793046556599094, 'samples': 19626432, 'steps': 102220, 'loss/train': 0.8639228940010071} 11/07/2021 11:34:41 - INFO - __main__ - Step 102222: {'lr': 0.0001179259597909966, 'samples': 19626624, 'steps': 102221, 'loss/train': 1.4980429410934448} 11/07/2021 11:34:41 - INFO - __main__ - Step 102223: {'lr': 0.00011792145407551277, 'samples': 19626816, 'steps': 102222, 'loss/train': 1.2179256677627563} 11/07/2021 11:34:42 - INFO - __main__ - Step 102224: {'lr': 0.00011791694841954151, 'samples': 19627008, 'steps': 102223, 'loss/train': 0.9225218892097473} 11/07/2021 11:34:42 - INFO - __main__ - Step 102225: {'lr': 0.00011791244282308483, 'samples': 19627200, 'steps': 102224, 'loss/train': 1.3468998670578003} 11/07/2021 11:34:42 - INFO - __main__ - Step 102226: {'lr': 0.00011790793728614485, 'samples': 19627392, 'steps': 102225, 'loss/train': 1.6591202020645142} 11/07/2021 11:34:43 - INFO - __main__ - Step 102227: {'lr': 0.00011790343180872342, 'samples': 19627584, 'steps': 102226, 'loss/train': 1.2864181995391846} 11/07/2021 11:34:44 - INFO - __main__ - Step 102228: {'lr': 0.0001178989263908227, 'samples': 19627776, 'steps': 102227, 'loss/train': 1.3412597179412842} 11/07/2021 11:34:44 - INFO - __main__ - Step 102229: {'lr': 0.00011789442103244466, 'samples': 19627968, 'steps': 102228, 'loss/train': 1.3936172723770142} 11/07/2021 11:34:44 - INFO - __main__ - Step 102230: {'lr': 0.00011788991573359134, 'samples': 19628160, 'steps': 102229, 'loss/train': 1.6047478914260864} 11/07/2021 11:34:45 - INFO - __main__ - Step 102231: {'lr': 0.0001178854104942648, 'samples': 19628352, 'steps': 102230, 'loss/train': 1.716235876083374} 11/07/2021 11:34:45 - INFO - __main__ - Step 102232: {'lr': 0.00011788090531446704, 'samples': 19628544, 'steps': 102231, 'loss/train': 1.9181785583496094} 11/07/2021 11:34:46 - INFO - __main__ - Step 102233: {'lr': 0.00011787640019420012, 'samples': 19628736, 'steps': 102232, 'loss/train': 1.5519530773162842} 11/07/2021 11:34:46 - INFO - __main__ - Step 102234: {'lr': 0.00011787189513346607, 'samples': 19628928, 'steps': 102233, 'loss/train': 1.0589860677719116} 11/07/2021 11:34:47 - INFO - __main__ - Step 102235: {'lr': 0.00011786739013226688, 'samples': 19629120, 'steps': 102234, 'loss/train': 1.2136462926864624} 11/07/2021 11:34:47 - INFO - __main__ - Step 102236: {'lr': 0.00011786288519060462, 'samples': 19629312, 'steps': 102235, 'loss/train': 1.4117114543914795} 11/07/2021 11:34:48 - INFO - __main__ - Step 102237: {'lr': 0.00011785838030848132, 'samples': 19629504, 'steps': 102236, 'loss/train': 1.5099806785583496} 11/07/2021 11:34:48 - INFO - __main__ - Step 102238: {'lr': 0.00011785387548589896, 'samples': 19629696, 'steps': 102237, 'loss/train': 1.2346327304840088} 11/07/2021 11:34:49 - INFO - __main__ - Step 102239: {'lr': 0.00011784937072285972, 'samples': 19629888, 'steps': 102238, 'loss/train': 1.5620752573013306} 11/07/2021 11:34:49 - INFO - __main__ - Step 102240: {'lr': 0.00011784486601936542, 'samples': 19630080, 'steps': 102239, 'loss/train': 1.56527841091156} 11/07/2021 11:34:50 - INFO - __main__ - Step 102241: {'lr': 0.00011784036137541818, 'samples': 19630272, 'steps': 102240, 'loss/train': 1.2475793361663818} 11/07/2021 11:34:50 - INFO - __main__ - Step 102242: {'lr': 0.00011783585679102002, 'samples': 19630464, 'steps': 102241, 'loss/train': 0.9302211999893188} 11/07/2021 11:34:51 - INFO - __main__ - Step 102243: {'lr': 0.00011783135226617301, 'samples': 19630656, 'steps': 102242, 'loss/train': 1.3541474342346191} 11/07/2021 11:34:51 - INFO - __main__ - Step 102244: {'lr': 0.00011782684780087912, 'samples': 19630848, 'steps': 102243, 'loss/train': 1.2156074047088623} 11/07/2021 11:34:52 - INFO - __main__ - Step 102245: {'lr': 0.00011782234339514045, 'samples': 19631040, 'steps': 102244, 'loss/train': 1.1749540567398071} 11/07/2021 11:34:52 - INFO - __main__ - Step 102246: {'lr': 0.00011781783904895896, 'samples': 19631232, 'steps': 102245, 'loss/train': 1.459730863571167} 11/07/2021 11:34:52 - INFO - __main__ - Step 102247: {'lr': 0.00011781333476233674, 'samples': 19631424, 'steps': 102246, 'loss/train': 1.1277897357940674} 11/07/2021 11:34:53 - INFO - __main__ - Step 102248: {'lr': 0.00011780883053527577, 'samples': 19631616, 'steps': 102247, 'loss/train': 1.1861095428466797} 11/07/2021 11:34:54 - INFO - __main__ - Step 102249: {'lr': 0.0001178043263677781, 'samples': 19631808, 'steps': 102248, 'loss/train': 1.1221923828125} 11/07/2021 11:34:54 - INFO - __main__ - Step 102250: {'lr': 0.00011779982225984578, 'samples': 19632000, 'steps': 102249, 'loss/train': 1.8829020261764526} 11/07/2021 11:34:54 - INFO - __main__ - Step 102251: {'lr': 0.00011779531821148081, 'samples': 19632192, 'steps': 102250, 'loss/train': 1.2030473947525024} 11/07/2021 11:34:55 - INFO - __main__ - Step 102252: {'lr': 0.00011779081422268531, 'samples': 19632384, 'steps': 102251, 'loss/train': 1.2703578472137451} 11/07/2021 11:34:56 - INFO - __main__ - Step 102253: {'lr': 0.00011778631029346115, 'samples': 19632576, 'steps': 102252, 'loss/train': 1.7078051567077637} 11/07/2021 11:34:56 - INFO - __main__ - Step 102254: {'lr': 0.00011778180642381045, 'samples': 19632768, 'steps': 102253, 'loss/train': 1.0089306831359863} 11/07/2021 11:34:57 - INFO - __main__ - Step 102255: {'lr': 0.0001177773026137352, 'samples': 19632960, 'steps': 102254, 'loss/train': 1.4337867498397827} 11/07/2021 11:34:57 - INFO - __main__ - Step 102256: {'lr': 0.00011777279886323747, 'samples': 19633152, 'steps': 102255, 'loss/train': 1.1483893394470215} 11/07/2021 11:34:57 - INFO - __main__ - Step 102257: {'lr': 0.0001177682951723193, 'samples': 19633344, 'steps': 102256, 'loss/train': 1.356605887413025} 11/07/2021 11:34:58 - INFO - __main__ - Step 102258: {'lr': 0.00011776379154098265, 'samples': 19633536, 'steps': 102257, 'loss/train': 1.1336538791656494} 11/07/2021 11:35:00 - INFO - __main__ - Step 102259: {'lr': 0.00011775928796922963, 'samples': 19633728, 'steps': 102258, 'loss/train': 0.8060869574546814} 11/07/2021 11:35:01 - INFO - __main__ - Step 102260: {'lr': 0.00011775478445706223, 'samples': 19633920, 'steps': 102259, 'loss/train': 1.364347219467163} 11/07/2021 11:35:01 - INFO - __main__ - Step 102261: {'lr': 0.00011775028100448246, 'samples': 19634112, 'steps': 102260, 'loss/train': 1.268092155456543} 11/07/2021 11:35:01 - INFO - __main__ - Step 102262: {'lr': 0.00011774577761149241, 'samples': 19634304, 'steps': 102261, 'loss/train': 0.3837076425552368} 11/07/2021 11:35:02 - INFO - __main__ - Step 102263: {'lr': 0.00011774127427809403, 'samples': 19634496, 'steps': 102262, 'loss/train': 1.2143820524215698} 11/07/2021 11:35:02 - INFO - __main__ - Step 102264: {'lr': 0.00011773677100428942, 'samples': 19634688, 'steps': 102263, 'loss/train': 1.7549690008163452} 11/07/2021 11:35:02 - INFO - __main__ - Step 102265: {'lr': 0.00011773226779008056, 'samples': 19634880, 'steps': 102264, 'loss/train': 1.7619929313659668} 11/07/2021 11:35:03 - INFO - __main__ - Step 102266: {'lr': 0.00011772776463546961, 'samples': 19635072, 'steps': 102265, 'loss/train': 1.7530453205108643} 11/07/2021 11:35:04 - INFO - __main__ - Step 102267: {'lr': 0.0001177232615404584, 'samples': 19635264, 'steps': 102266, 'loss/train': 1.7538927793502808} 11/07/2021 11:35:04 - INFO - __main__ - Step 102268: {'lr': 0.00011771875850504904, 'samples': 19635456, 'steps': 102267, 'loss/train': 1.3253511190414429} 11/07/2021 11:35:04 - INFO - __main__ - Step 102269: {'lr': 0.00011771425552924356, 'samples': 19635648, 'steps': 102268, 'loss/train': 1.1393659114837646} 11/07/2021 11:35:05 - INFO - __main__ - Step 102270: {'lr': 0.00011770975261304401, 'samples': 19635840, 'steps': 102269, 'loss/train': 1.3360038995742798} 11/07/2021 11:35:05 - INFO - __main__ - Step 102271: {'lr': 0.00011770524975645239, 'samples': 19636032, 'steps': 102270, 'loss/train': 1.4558089971542358} 11/07/2021 11:35:06 - INFO - __main__ - Step 102272: {'lr': 0.00011770074695947072, 'samples': 19636224, 'steps': 102271, 'loss/train': 1.132535457611084} 11/07/2021 11:35:07 - INFO - __main__ - Step 102273: {'lr': 0.0001176962442221011, 'samples': 19636416, 'steps': 102272, 'loss/train': 0.7121001482009888} 11/07/2021 11:35:07 - INFO - __main__ - Step 102274: {'lr': 0.00011769174154434548, 'samples': 19636608, 'steps': 102273, 'loss/train': 1.4853301048278809} 11/07/2021 11:35:07 - INFO - __main__ - Step 102275: {'lr': 0.00011768723892620591, 'samples': 19636800, 'steps': 102274, 'loss/train': 0.23960061371326447} 11/07/2021 11:35:08 - INFO - __main__ - Step 102276: {'lr': 0.00011768273636768446, 'samples': 19636992, 'steps': 102275, 'loss/train': 1.2536284923553467} 11/07/2021 11:35:09 - INFO - __main__ - Step 102277: {'lr': 0.00011767823386878312, 'samples': 19637184, 'steps': 102276, 'loss/train': 1.650179147720337} 11/07/2021 11:35:10 - INFO - __main__ - Step 102278: {'lr': 0.00011767373142950392, 'samples': 19637376, 'steps': 102277, 'loss/train': 1.2378435134887695} 11/07/2021 11:35:10 - INFO - __main__ - Step 102279: {'lr': 0.00011766922904984898, 'samples': 19637568, 'steps': 102278, 'loss/train': 0.7855846881866455} 11/07/2021 11:35:10 - INFO - __main__ - Step 102280: {'lr': 0.00011766472672982015, 'samples': 19637760, 'steps': 102279, 'loss/train': 1.7919517755508423} 11/07/2021 11:35:11 - INFO - __main__ - Step 102281: {'lr': 0.00011766022446941957, 'samples': 19637952, 'steps': 102280, 'loss/train': 1.4670095443725586} 11/07/2021 11:35:11 - INFO - __main__ - Step 102282: {'lr': 0.00011765572226864924, 'samples': 19638144, 'steps': 102281, 'loss/train': 1.3441742658615112} 11/07/2021 11:35:12 - INFO - __main__ - Step 102283: {'lr': 0.0001176512201275112, 'samples': 19638336, 'steps': 102282, 'loss/train': 1.0001269578933716} 11/07/2021 11:35:12 - INFO - __main__ - Step 102284: {'lr': 0.00011764671804600746, 'samples': 19638528, 'steps': 102283, 'loss/train': 0.8596161007881165} 11/07/2021 11:35:13 - INFO - __main__ - Step 102285: {'lr': 0.0001176422160241401, 'samples': 19638720, 'steps': 102284, 'loss/train': 0.4333227276802063} 11/07/2021 11:35:13 - INFO - __main__ - Step 102286: {'lr': 0.0001176377140619111, 'samples': 19638912, 'steps': 102285, 'loss/train': 1.3458071947097778} 11/07/2021 11:35:13 - INFO - __main__ - Step 102287: {'lr': 0.00011763321215932249, 'samples': 19639104, 'steps': 102286, 'loss/train': 1.8160730600357056} 11/07/2021 11:35:15 - INFO - __main__ - Step 102288: {'lr': 0.00011762871031637631, 'samples': 19639296, 'steps': 102287, 'loss/train': 1.3415073156356812} 11/07/2021 11:35:15 - INFO - __main__ - Step 102289: {'lr': 0.00011762420853307462, 'samples': 19639488, 'steps': 102288, 'loss/train': 1.5865209102630615} 11/07/2021 11:35:15 - INFO - __main__ - Step 102290: {'lr': 0.00011761970680941941, 'samples': 19639680, 'steps': 102289, 'loss/train': 1.6945385932922363} 11/07/2021 11:35:16 - INFO - __main__ - Step 102291: {'lr': 0.0001176152051454127, 'samples': 19639872, 'steps': 102290, 'loss/train': 1.1607511043548584} 11/07/2021 11:35:16 - INFO - __main__ - Step 102292: {'lr': 0.00011761070354105654, 'samples': 19640064, 'steps': 102291, 'loss/train': 1.3689546585083008} 11/07/2021 11:35:17 - INFO - __main__ - Step 102293: {'lr': 0.00011760620199635307, 'samples': 19640256, 'steps': 102292, 'loss/train': 1.2579126358032227} 11/07/2021 11:35:17 - INFO - __main__ - Step 102294: {'lr': 0.00011760170051130409, 'samples': 19640448, 'steps': 102293, 'loss/train': 1.240069031715393} 11/07/2021 11:35:18 - INFO - __main__ - Step 102295: {'lr': 0.00011759719908591174, 'samples': 19640640, 'steps': 102294, 'loss/train': 1.6688284873962402} 11/07/2021 11:35:18 - INFO - __main__ - Step 102296: {'lr': 0.00011759269772017806, 'samples': 19640832, 'steps': 102295, 'loss/train': 1.3100054264068604} 11/07/2021 11:35:19 - INFO - __main__ - Step 102297: {'lr': 0.00011758819641410506, 'samples': 19641024, 'steps': 102296, 'loss/train': 1.0485928058624268} 11/07/2021 11:35:19 - INFO - __main__ - Step 102298: {'lr': 0.00011758369516769476, 'samples': 19641216, 'steps': 102297, 'loss/train': 1.7640576362609863} 11/07/2021 11:35:20 - INFO - __main__ - Step 102299: {'lr': 0.00011757919398094924, 'samples': 19641408, 'steps': 102298, 'loss/train': 1.572244644165039} 11/07/2021 11:35:20 - INFO - __main__ - Step 102300: {'lr': 0.00011757469285387046, 'samples': 19641600, 'steps': 102299, 'loss/train': 0.9783884882926941} 11/07/2021 11:35:21 - INFO - __main__ - Step 102301: {'lr': 0.0001175701917864605, 'samples': 19641792, 'steps': 102300, 'loss/train': 0.9724418520927429} 11/07/2021 11:35:21 - INFO - __main__ - Step 102302: {'lr': 0.00011756569077872136, 'samples': 19641984, 'steps': 102301, 'loss/train': 1.3498846292495728} 11/07/2021 11:35:21 - INFO - __main__ - Step 102303: {'lr': 0.00011756118983065506, 'samples': 19642176, 'steps': 102302, 'loss/train': 1.3808872699737549} 11/07/2021 11:35:22 - INFO - __main__ - Step 102304: {'lr': 0.00011755668894226368, 'samples': 19642368, 'steps': 102303, 'loss/train': 1.526334285736084} 11/07/2021 11:35:23 - INFO - __main__ - Step 102305: {'lr': 0.00011755218811354918, 'samples': 19642560, 'steps': 102304, 'loss/train': 1.4058135747909546} 11/07/2021 11:35:23 - INFO - __main__ - Step 102306: {'lr': 0.00011754768734451373, 'samples': 19642752, 'steps': 102305, 'loss/train': 1.1698057651519775} 11/07/2021 11:35:23 - INFO - __main__ - Step 102307: {'lr': 0.00011754318663515915, 'samples': 19642944, 'steps': 102306, 'loss/train': 1.5021286010742188} 11/07/2021 11:35:24 - INFO - __main__ - Step 102308: {'lr': 0.00011753868598548756, 'samples': 19643136, 'steps': 102307, 'loss/train': 3.684033155441284} 11/07/2021 11:35:25 - INFO - __main__ - Step 102309: {'lr': 0.00011753418539550101, 'samples': 19643328, 'steps': 102308, 'loss/train': 2.0992586612701416} 11/07/2021 11:35:25 - INFO - __main__ - Step 102310: {'lr': 0.00011752968486520149, 'samples': 19643520, 'steps': 102309, 'loss/train': 1.2839291095733643} 11/07/2021 11:35:26 - INFO - __main__ - Step 102311: {'lr': 0.00011752518439459106, 'samples': 19643712, 'steps': 102310, 'loss/train': 1.4715105295181274} 11/07/2021 11:35:26 - INFO - __main__ - Step 102312: {'lr': 0.00011752068398367174, 'samples': 19643904, 'steps': 102311, 'loss/train': 1.2786718606948853} 11/07/2021 11:35:26 - INFO - __main__ - Step 102313: {'lr': 0.00011751618363244557, 'samples': 19644096, 'steps': 102312, 'loss/train': 1.5262730121612549} 11/07/2021 11:35:27 - INFO - __main__ - Step 102314: {'lr': 0.00011751168334091455, 'samples': 19644288, 'steps': 102313, 'loss/train': 1.2525984048843384} 11/07/2021 11:35:28 - INFO - __main__ - Step 102315: {'lr': 0.00011750718310908071, 'samples': 19644480, 'steps': 102314, 'loss/train': 1.4703062772750854} 11/07/2021 11:35:28 - INFO - __main__ - Step 102316: {'lr': 0.0001175026829369461, 'samples': 19644672, 'steps': 102315, 'loss/train': 1.3287404775619507} 11/07/2021 11:35:28 - INFO - __main__ - Step 102317: {'lr': 0.00011749818282451275, 'samples': 19644864, 'steps': 102316, 'loss/train': 2.034003973007202} 11/07/2021 11:35:29 - INFO - __main__ - Step 102318: {'lr': 0.00011749368277178266, 'samples': 19645056, 'steps': 102317, 'loss/train': 1.3231713771820068} 11/07/2021 11:35:30 - INFO - __main__ - Step 102319: {'lr': 0.00011748918277875787, 'samples': 19645248, 'steps': 102318, 'loss/train': 0.28738933801651} 11/07/2021 11:35:30 - INFO - __main__ - Step 102320: {'lr': 0.0001174846828454405, 'samples': 19645440, 'steps': 102319, 'loss/train': 1.5266647338867188} 11/07/2021 11:35:30 - INFO - __main__ - Step 102321: {'lr': 0.00011748018297183238, 'samples': 19645632, 'steps': 102320, 'loss/train': 1.2023694515228271} 11/07/2021 11:35:31 - INFO - __main__ - Step 102322: {'lr': 0.00011747568315793567, 'samples': 19645824, 'steps': 102321, 'loss/train': 0.1296757310628891} 11/07/2021 11:35:31 - INFO - __main__ - Step 102323: {'lr': 0.00011747118340375238, 'samples': 19646016, 'steps': 102322, 'loss/train': 1.2983412742614746} 11/07/2021 11:35:32 - INFO - __main__ - Step 102324: {'lr': 0.00011746668370928452, 'samples': 19646208, 'steps': 102323, 'loss/train': 1.3144570589065552} 11/07/2021 11:35:33 - INFO - __main__ - Step 102325: {'lr': 0.0001174621840745341, 'samples': 19646400, 'steps': 102324, 'loss/train': 1.3803412914276123} 11/07/2021 11:35:33 - INFO - __main__ - Step 102326: {'lr': 0.0001174576844995032, 'samples': 19646592, 'steps': 102325, 'loss/train': 1.3872599601745605} 11/07/2021 11:35:33 - INFO - __main__ - Step 102327: {'lr': 0.00011745318498419383, 'samples': 19646784, 'steps': 102326, 'loss/train': 0.9449789524078369} 11/07/2021 11:35:34 - INFO - __main__ - Step 102328: {'lr': 0.00011744868552860799, 'samples': 19646976, 'steps': 102327, 'loss/train': 1.4150348901748657} 11/07/2021 11:35:35 - INFO - __main__ - Step 102329: {'lr': 0.00011744418613274773, 'samples': 19647168, 'steps': 102328, 'loss/train': 1.0733563899993896} 11/07/2021 11:35:35 - INFO - __main__ - Step 102330: {'lr': 0.00011743968679661507, 'samples': 19647360, 'steps': 102329, 'loss/train': 1.5335696935653687} 11/07/2021 11:35:35 - INFO - __main__ - Step 102331: {'lr': 0.00011743518752021206, 'samples': 19647552, 'steps': 102330, 'loss/train': 0.938067615032196} 11/07/2021 11:35:36 - INFO - __main__ - Step 102332: {'lr': 0.0001174306883035407, 'samples': 19647744, 'steps': 102331, 'loss/train': 1.5566368103027344} 11/07/2021 11:35:36 - INFO - __main__ - Step 102333: {'lr': 0.00011742618914660311, 'samples': 19647936, 'steps': 102332, 'loss/train': 1.3766061067581177} 11/07/2021 11:35:37 - INFO - __main__ - Step 102334: {'lr': 0.00011742169004940115, 'samples': 19648128, 'steps': 102333, 'loss/train': 0.3349030911922455} 11/07/2021 11:35:37 - INFO - __main__ - Step 102335: {'lr': 0.00011741719101193693, 'samples': 19648320, 'steps': 102334, 'loss/train': 1.0992225408554077} 11/07/2021 11:35:38 - INFO - __main__ - Step 102336: {'lr': 0.00011741269203421248, 'samples': 19648512, 'steps': 102335, 'loss/train': 1.1979687213897705} 11/07/2021 11:35:38 - INFO - __main__ - Step 102337: {'lr': 0.00011740819311622983, 'samples': 19648704, 'steps': 102336, 'loss/train': 1.4213173389434814} 11/07/2021 11:35:38 - INFO - __main__ - Step 102338: {'lr': 0.000117403694257991, 'samples': 19648896, 'steps': 102337, 'loss/train': 1.2319668531417847} 11/07/2021 11:35:40 - INFO - __main__ - Step 102339: {'lr': 0.00011739919545949801, 'samples': 19649088, 'steps': 102338, 'loss/train': 1.1941969394683838} 11/07/2021 11:35:40 - INFO - __main__ - Step 102340: {'lr': 0.0001173946967207529, 'samples': 19649280, 'steps': 102339, 'loss/train': 1.1271440982818604} 11/07/2021 11:35:40 - INFO - __main__ - Step 102341: {'lr': 0.00011739019804175769, 'samples': 19649472, 'steps': 102340, 'loss/train': 1.1969788074493408} 11/07/2021 11:35:41 - INFO - __main__ - Step 102342: {'lr': 0.00011738569942251443, 'samples': 19649664, 'steps': 102341, 'loss/train': 1.5955750942230225} 11/07/2021 11:35:41 - INFO - __main__ - Step 102343: {'lr': 0.00011738120086302509, 'samples': 19649856, 'steps': 102342, 'loss/train': 1.253129005432129} 11/07/2021 11:35:41 - INFO - __main__ - Step 102344: {'lr': 0.00011737670236329176, 'samples': 19650048, 'steps': 102343, 'loss/train': 0.933851420879364} 11/07/2021 11:35:43 - INFO - __main__ - Step 102345: {'lr': 0.00011737220392331644, 'samples': 19650240, 'steps': 102344, 'loss/train': 1.2328660488128662} 11/07/2021 11:35:43 - INFO - __main__ - Step 102346: {'lr': 0.00011736770554310117, 'samples': 19650432, 'steps': 102345, 'loss/train': 0.9045242071151733} 11/07/2021 11:35:43 - INFO - __main__ - Step 102347: {'lr': 0.00011736320722264804, 'samples': 19650624, 'steps': 102346, 'loss/train': 1.457698941230774} 11/07/2021 11:35:44 - INFO - __main__ - Step 102348: {'lr': 0.0001173587089619589, 'samples': 19650816, 'steps': 102347, 'loss/train': 1.695117712020874} 11/07/2021 11:35:44 - INFO - __main__ - Step 102349: {'lr': 0.00011735421076103589, 'samples': 19651008, 'steps': 102348, 'loss/train': 0.9800054430961609} 11/07/2021 11:35:45 - INFO - __main__ - Step 102350: {'lr': 0.00011734971261988104, 'samples': 19651200, 'steps': 102349, 'loss/train': 1.1522936820983887} 11/07/2021 11:35:45 - INFO - __main__ - Step 102351: {'lr': 0.00011734521453849634, 'samples': 19651392, 'steps': 102350, 'loss/train': 1.168721318244934} 11/07/2021 11:35:46 - INFO - __main__ - Step 102352: {'lr': 0.00011734071651688385, 'samples': 19651584, 'steps': 102351, 'loss/train': 0.4351664185523987} 11/07/2021 11:35:46 - INFO - __main__ - Step 102353: {'lr': 0.00011733621855504559, 'samples': 19651776, 'steps': 102352, 'loss/train': 1.5880337953567505} 11/07/2021 11:35:46 - INFO - __main__ - Step 102354: {'lr': 0.00011733172065298358, 'samples': 19651968, 'steps': 102353, 'loss/train': 1.28263521194458} 11/07/2021 11:35:47 - INFO - __main__ - Step 102355: {'lr': 0.00011732722281069985, 'samples': 19652160, 'steps': 102354, 'loss/train': 1.4682564735412598} 11/07/2021 11:35:48 - INFO - __main__ - Step 102356: {'lr': 0.00011732272502819644, 'samples': 19652352, 'steps': 102355, 'loss/train': 1.4500658512115479} 11/07/2021 11:35:48 - INFO - __main__ - Step 102357: {'lr': 0.00011731822730547534, 'samples': 19652544, 'steps': 102356, 'loss/train': 1.574950098991394} 11/07/2021 11:35:48 - INFO - __main__ - Step 102358: {'lr': 0.00011731372964253861, 'samples': 19652736, 'steps': 102357, 'loss/train': 1.3303706645965576} 11/07/2021 11:35:49 - INFO - __main__ - Step 102359: {'lr': 0.00011730923203938826, 'samples': 19652928, 'steps': 102358, 'loss/train': 1.1268219947814941} 11/07/2021 11:35:50 - INFO - __main__ - Step 102360: {'lr': 0.0001173047344960264, 'samples': 19653120, 'steps': 102359, 'loss/train': 0.9632179737091064} 11/07/2021 11:35:50 - INFO - __main__ - Step 102361: {'lr': 0.00011730023701245493, 'samples': 19653312, 'steps': 102360, 'loss/train': 1.147180438041687} 11/07/2021 11:35:50 - INFO - __main__ - Step 102362: {'lr': 0.0001172957395886759, 'samples': 19653504, 'steps': 102361, 'loss/train': 1.0031520128250122} 11/07/2021 11:35:51 - INFO - __main__ - Step 102363: {'lr': 0.00011729124222469134, 'samples': 19653696, 'steps': 102362, 'loss/train': 1.1670762300491333} 11/07/2021 11:35:51 - INFO - __main__ - Step 102364: {'lr': 0.00011728674492050333, 'samples': 19653888, 'steps': 102363, 'loss/train': 0.774925947189331} 11/07/2021 11:35:52 - INFO - __main__ - Step 102365: {'lr': 0.00011728224767611386, 'samples': 19654080, 'steps': 102364, 'loss/train': 1.361011028289795} 11/07/2021 11:35:53 - INFO - __main__ - Step 102366: {'lr': 0.00011727775049152495, 'samples': 19654272, 'steps': 102365, 'loss/train': 1.9479496479034424} 11/07/2021 11:35:53 - INFO - __main__ - Step 102367: {'lr': 0.00011727325336673864, 'samples': 19654464, 'steps': 102366, 'loss/train': 1.411200761795044} 11/07/2021 11:35:53 - INFO - __main__ - Step 102368: {'lr': 0.00011726875630175696, 'samples': 19654656, 'steps': 102367, 'loss/train': 1.5843698978424072} 11/07/2021 11:35:54 - INFO - __main__ - Step 102369: {'lr': 0.00011726425929658193, 'samples': 19654848, 'steps': 102368, 'loss/train': 1.266187071800232} 11/07/2021 11:35:54 - INFO - __main__ - Step 102370: {'lr': 0.00011725976235121557, 'samples': 19655040, 'steps': 102369, 'loss/train': 1.6743392944335938} 11/07/2021 11:35:55 - INFO - __main__ - Step 102371: {'lr': 0.00011725526546565993, 'samples': 19655232, 'steps': 102370, 'loss/train': 1.228100061416626} 11/07/2021 11:35:55 - INFO - __main__ - Step 102372: {'lr': 0.00011725076863991699, 'samples': 19655424, 'steps': 102371, 'loss/train': 1.301822543144226} 11/07/2021 11:35:56 - INFO - __main__ - Step 102373: {'lr': 0.00011724627187398892, 'samples': 19655616, 'steps': 102372, 'loss/train': 1.1536041498184204} 11/07/2021 11:35:56 - INFO - __main__ - Step 102374: {'lr': 0.00011724177516787754, 'samples': 19655808, 'steps': 102373, 'loss/train': 1.6722662448883057} 11/07/2021 11:35:57 - INFO - __main__ - Step 102375: {'lr': 0.00011723727852158495, 'samples': 19656000, 'steps': 102374, 'loss/train': 1.7286159992218018} 11/07/2021 11:35:58 - INFO - __main__ - Step 102376: {'lr': 0.00011723278193511322, 'samples': 19656192, 'steps': 102375, 'loss/train': 1.2804968357086182} 11/07/2021 11:35:58 - INFO - __main__ - Step 102377: {'lr': 0.00011722828540846434, 'samples': 19656384, 'steps': 102376, 'loss/train': 0.8286502957344055} 11/07/2021 11:35:58 - INFO - __main__ - Step 102378: {'lr': 0.00011722378894164031, 'samples': 19656576, 'steps': 102377, 'loss/train': 1.5824158191680908} 11/07/2021 11:35:59 - INFO - __main__ - Step 102379: {'lr': 0.00011721929253464323, 'samples': 19656768, 'steps': 102378, 'loss/train': 1.4049830436706543} 11/07/2021 11:35:59 - INFO - __main__ - Step 102380: {'lr': 0.00011721479618747507, 'samples': 19656960, 'steps': 102379, 'loss/train': 1.056426763534546} 11/07/2021 11:36:00 - INFO - __main__ - Step 102381: {'lr': 0.0001172102999001379, 'samples': 19657152, 'steps': 102380, 'loss/train': 1.5346195697784424} 11/07/2021 11:36:01 - INFO - __main__ - Step 102382: {'lr': 0.0001172058036726337, 'samples': 19657344, 'steps': 102381, 'loss/train': 1.1007028818130493} 11/07/2021 11:36:01 - INFO - __main__ - Step 102383: {'lr': 0.00011720130750496452, 'samples': 19657536, 'steps': 102382, 'loss/train': 1.409245252609253} 11/07/2021 11:36:01 - INFO - __main__ - Step 102384: {'lr': 0.00011719681139713237, 'samples': 19657728, 'steps': 102383, 'loss/train': 1.4253367185592651} 11/07/2021 11:36:02 - INFO - __main__ - Step 102385: {'lr': 0.00011719231534913932, 'samples': 19657920, 'steps': 102384, 'loss/train': 1.3273037672042847} 11/07/2021 11:36:02 - INFO - __main__ - Step 102386: {'lr': 0.00011718781936098744, 'samples': 19658112, 'steps': 102385, 'loss/train': 1.2957042455673218} 11/07/2021 11:36:03 - INFO - __main__ - Step 102387: {'lr': 0.00011718332343267857, 'samples': 19658304, 'steps': 102386, 'loss/train': 0.2352754920721054} 11/07/2021 11:36:03 - INFO - __main__ - Step 102388: {'lr': 0.00011717882756421485, 'samples': 19658496, 'steps': 102387, 'loss/train': 1.447097897529602} 11/07/2021 11:36:04 - INFO - __main__ - Step 102389: {'lr': 0.00011717433175559831, 'samples': 19658688, 'steps': 102388, 'loss/train': 1.5590667724609375} 11/07/2021 11:36:04 - INFO - __main__ - Step 102390: {'lr': 0.00011716983600683096, 'samples': 19658880, 'steps': 102389, 'loss/train': 1.2128804922103882} 11/07/2021 11:36:04 - INFO - __main__ - Step 102391: {'lr': 0.00011716534031791485, 'samples': 19659072, 'steps': 102390, 'loss/train': 1.609163761138916} 11/07/2021 11:36:05 - INFO - __main__ - Step 102392: {'lr': 0.00011716084468885197, 'samples': 19659264, 'steps': 102391, 'loss/train': 1.658158540725708} 11/07/2021 11:36:06 - INFO - __main__ - Step 102393: {'lr': 0.00011715634911964434, 'samples': 19659456, 'steps': 102392, 'loss/train': 1.2879419326782227} 11/07/2021 11:36:06 - INFO - __main__ - Step 102394: {'lr': 0.00011715185361029404, 'samples': 19659648, 'steps': 102393, 'loss/train': 1.0519917011260986} 11/07/2021 11:36:06 - INFO - __main__ - Step 102395: {'lr': 0.00011714735816080308, 'samples': 19659840, 'steps': 102394, 'loss/train': 0.998603880405426} 11/07/2021 11:36:07 - INFO - __main__ - Step 102396: {'lr': 0.00011714286277117344, 'samples': 19660032, 'steps': 102395, 'loss/train': 1.4371542930603027} 11/07/2021 11:36:08 - INFO - __main__ - Step 102397: {'lr': 0.00011713836744140727, 'samples': 19660224, 'steps': 102396, 'loss/train': 1.0269442796707153} 11/07/2021 11:36:08 - INFO - __main__ - Step 102398: {'lr': 0.00011713387217150642, 'samples': 19660416, 'steps': 102397, 'loss/train': 1.476564884185791} 11/07/2021 11:36:09 - INFO - __main__ - Step 102399: {'lr': 0.00011712937696147299, 'samples': 19660608, 'steps': 102398, 'loss/train': 1.759273648262024} 11/07/2021 11:36:09 - INFO - __main__ - Step 102400: {'lr': 0.00011712488181130903, 'samples': 19660800, 'steps': 102399, 'loss/train': 1.2605152130126953} 11/07/2021 11:36:09 - INFO - __main__ - Step 102401: {'lr': 0.00011712038672101654, 'samples': 19660992, 'steps': 102400, 'loss/train': 0.7883184552192688} 11/07/2021 11:36:10 - INFO - __main__ - Step 102402: {'lr': 0.00011711589169059756, 'samples': 19661184, 'steps': 102401, 'loss/train': 1.5040602684020996} 11/07/2021 11:36:11 - INFO - __main__ - Step 102403: {'lr': 0.00011711139672005408, 'samples': 19661376, 'steps': 102402, 'loss/train': 1.662667989730835} 11/07/2021 11:36:11 - INFO - __main__ - Step 102404: {'lr': 0.00011710690180938818, 'samples': 19661568, 'steps': 102403, 'loss/train': 1.4898165464401245} 11/07/2021 11:36:11 - INFO - __main__ - Step 102405: {'lr': 0.00011710240695860183, 'samples': 19661760, 'steps': 102404, 'loss/train': 1.5901751518249512} 11/07/2021 11:36:12 - INFO - __main__ - Step 102406: {'lr': 0.00011709791216769711, 'samples': 19661952, 'steps': 102405, 'loss/train': 1.3292949199676514} 11/07/2021 11:36:13 - INFO - __main__ - Step 102407: {'lr': 0.000117093417436676, 'samples': 19662144, 'steps': 102406, 'loss/train': 1.4611244201660156} 11/07/2021 11:36:13 - INFO - __main__ - Step 102408: {'lr': 0.00011708892276554067, 'samples': 19662336, 'steps': 102407, 'loss/train': 0.8538132905960083} 11/07/2021 11:36:13 - INFO - __main__ - Step 102409: {'lr': 0.00011708442815429291, 'samples': 19662528, 'steps': 102408, 'loss/train': 1.7311077117919922} 11/07/2021 11:36:14 - INFO - __main__ - Step 102410: {'lr': 0.00011707993360293486, 'samples': 19662720, 'steps': 102409, 'loss/train': 1.255387306213379} 11/07/2021 11:36:14 - INFO - __main__ - Step 102411: {'lr': 0.00011707543911146854, 'samples': 19662912, 'steps': 102410, 'loss/train': 1.7556601762771606} 11/07/2021 11:36:15 - INFO - __main__ - Step 102412: {'lr': 0.00011707094467989598, 'samples': 19663104, 'steps': 102411, 'loss/train': 1.1063693761825562} 11/07/2021 11:36:16 - INFO - __main__ - Step 102413: {'lr': 0.00011706645030821919, 'samples': 19663296, 'steps': 102412, 'loss/train': 1.6914258003234863} 11/07/2021 11:36:16 - INFO - __main__ - Step 102414: {'lr': 0.00011706195599644021, 'samples': 19663488, 'steps': 102413, 'loss/train': 1.2046442031860352} 11/07/2021 11:36:16 - INFO - __main__ - Step 102415: {'lr': 0.00011705746174456106, 'samples': 19663680, 'steps': 102414, 'loss/train': 1.5591143369674683} 11/07/2021 11:36:17 - INFO - __main__ - Step 102416: {'lr': 0.00011705296755258376, 'samples': 19663872, 'steps': 102415, 'loss/train': 1.4933804273605347} 11/07/2021 11:36:17 - INFO - __main__ - Step 102417: {'lr': 0.00011704847342051036, 'samples': 19664064, 'steps': 102416, 'loss/train': 1.497816801071167} 11/07/2021 11:36:18 - INFO - __main__ - Step 102418: {'lr': 0.00011704397934834284, 'samples': 19664256, 'steps': 102417, 'loss/train': 1.3561780452728271} 11/07/2021 11:36:18 - INFO - __main__ - Step 102419: {'lr': 0.00011703948533608339, 'samples': 19664448, 'steps': 102418, 'loss/train': 1.4551830291748047} 11/07/2021 11:36:19 - INFO - __main__ - Step 102420: {'lr': 0.00011703499138373375, 'samples': 19664640, 'steps': 102419, 'loss/train': 1.3957302570343018} 11/07/2021 11:36:19 - INFO - __main__ - Step 102421: {'lr': 0.00011703049749129613, 'samples': 19664832, 'steps': 102420, 'loss/train': 1.5921494960784912} 11/07/2021 11:36:19 - INFO - __main__ - Step 102422: {'lr': 0.0001170260036587725, 'samples': 19665024, 'steps': 102421, 'loss/train': 0.8697566390037537} 11/07/2021 11:36:20 - INFO - __main__ - Step 102423: {'lr': 0.0001170215098861649, 'samples': 19665216, 'steps': 102422, 'loss/train': 1.5434134006500244} 11/07/2021 11:36:21 - INFO - __main__ - Step 102424: {'lr': 0.00011701701617347535, 'samples': 19665408, 'steps': 102423, 'loss/train': 1.1544955968856812} 11/07/2021 11:36:21 - INFO - __main__ - Step 102425: {'lr': 0.00011701252252070587, 'samples': 19665600, 'steps': 102424, 'loss/train': 1.4973171949386597} 11/07/2021 11:36:22 - INFO - __main__ - Step 102426: {'lr': 0.00011700802892785852, 'samples': 19665792, 'steps': 102425, 'loss/train': 1.2965126037597656} 11/07/2021 11:36:22 - INFO - __main__ - Step 102427: {'lr': 0.0001170035353949353, 'samples': 19665984, 'steps': 102426, 'loss/train': 1.4242808818817139} 11/07/2021 11:36:23 - INFO - __main__ - Step 102428: {'lr': 0.00011699904192193822, 'samples': 19666176, 'steps': 102427, 'loss/train': 1.153289794921875} 11/07/2021 11:36:23 - INFO - __main__ - Step 102429: {'lr': 0.00011699454850886935, 'samples': 19666368, 'steps': 102428, 'loss/train': 1.6129342317581177} 11/07/2021 11:36:24 - INFO - __main__ - Step 102430: {'lr': 0.00011699005515573075, 'samples': 19666560, 'steps': 102429, 'loss/train': 1.1347030401229858} 11/07/2021 11:36:24 - INFO - __main__ - Step 102431: {'lr': 0.00011698556186252429, 'samples': 19666752, 'steps': 102430, 'loss/train': 1.1077336072921753} 11/07/2021 11:36:24 - INFO - __main__ - Step 102432: {'lr': 0.00011698106862925206, 'samples': 19666944, 'steps': 102431, 'loss/train': 0.9597427845001221} 11/07/2021 11:36:25 - INFO - __main__ - Step 102433: {'lr': 0.00011697657545591614, 'samples': 19667136, 'steps': 102432, 'loss/train': 1.1193993091583252} 11/07/2021 11:36:26 - INFO - __main__ - Step 102434: {'lr': 0.00011697208234251852, 'samples': 19667328, 'steps': 102433, 'loss/train': 1.4679522514343262} 11/07/2021 11:36:26 - INFO - __main__ - Step 102435: {'lr': 0.00011696758928906123, 'samples': 19667520, 'steps': 102434, 'loss/train': 1.5650266408920288} 11/07/2021 11:36:27 - INFO - __main__ - Step 102436: {'lr': 0.00011696309629554627, 'samples': 19667712, 'steps': 102435, 'loss/train': 1.1927984952926636} 11/07/2021 11:36:27 - INFO - __main__ - Step 102437: {'lr': 0.0001169586033619757, 'samples': 19667904, 'steps': 102436, 'loss/train': 0.9996128082275391} 11/07/2021 11:36:27 - INFO - __main__ - Step 102438: {'lr': 0.00011695411048835153, 'samples': 19668096, 'steps': 102437, 'loss/train': 0.8837153315544128} 11/07/2021 11:36:28 - INFO - __main__ - Step 102439: {'lr': 0.00011694961767467576, 'samples': 19668288, 'steps': 102438, 'loss/train': 1.4927375316619873} 11/07/2021 11:36:29 - INFO - __main__ - Step 102440: {'lr': 0.00011694512492095047, 'samples': 19668480, 'steps': 102439, 'loss/train': 1.4599404335021973} 11/07/2021 11:36:29 - INFO - __main__ - Step 102441: {'lr': 0.00011694063222717774, 'samples': 19668672, 'steps': 102440, 'loss/train': 2.021909475326538} 11/07/2021 11:36:29 - INFO - __main__ - Step 102442: {'lr': 0.00011693613959335942, 'samples': 19668864, 'steps': 102441, 'loss/train': 1.3170689344406128} 11/07/2021 11:36:30 - INFO - __main__ - Step 102443: {'lr': 0.00011693164701949763, 'samples': 19669056, 'steps': 102442, 'loss/train': 1.2789441347122192} 11/07/2021 11:36:31 - INFO - __main__ - Step 102444: {'lr': 0.00011692715450559435, 'samples': 19669248, 'steps': 102443, 'loss/train': 0.3715358078479767} 11/07/2021 11:36:31 - INFO - __main__ - Step 102445: {'lr': 0.00011692266205165166, 'samples': 19669440, 'steps': 102444, 'loss/train': 0.8324541449546814} 11/07/2021 11:36:31 - INFO - __main__ - Step 102446: {'lr': 0.00011691816965767157, 'samples': 19669632, 'steps': 102445, 'loss/train': 1.0388332605361938} 11/07/2021 11:36:32 - INFO - __main__ - Step 102447: {'lr': 0.0001169136773236561, 'samples': 19669824, 'steps': 102446, 'loss/train': 1.2313752174377441} 11/07/2021 11:36:32 - INFO - __main__ - Step 102448: {'lr': 0.00011690918504960726, 'samples': 19670016, 'steps': 102447, 'loss/train': 1.8179396390914917} 11/07/2021 11:36:33 - INFO - __main__ - Step 102449: {'lr': 0.00011690469283552713, 'samples': 19670208, 'steps': 102448, 'loss/train': 1.590407133102417} 11/07/2021 11:36:34 - INFO - __main__ - Step 102450: {'lr': 0.00011690020068141766, 'samples': 19670400, 'steps': 102449, 'loss/train': 1.4335838556289673} 11/07/2021 11:36:34 - INFO - __main__ - Step 102451: {'lr': 0.00011689570858728088, 'samples': 19670592, 'steps': 102450, 'loss/train': 1.541591763496399} 11/07/2021 11:36:34 - INFO - __main__ - Step 102452: {'lr': 0.00011689121655311888, 'samples': 19670784, 'steps': 102451, 'loss/train': 1.1869481801986694} 11/07/2021 11:36:35 - INFO - __main__ - Step 102453: {'lr': 0.00011688672457893363, 'samples': 19670976, 'steps': 102452, 'loss/train': 1.7463619709014893} 11/07/2021 11:36:36 - INFO - __main__ - Step 102454: {'lr': 0.00011688223266472726, 'samples': 19671168, 'steps': 102453, 'loss/train': 1.2800500392913818} 11/07/2021 11:36:36 - INFO - __main__ - Step 102455: {'lr': 0.00011687774081050159, 'samples': 19671360, 'steps': 102454, 'loss/train': 1.3029332160949707} 11/07/2021 11:36:36 - INFO - __main__ - Step 102456: {'lr': 0.00011687324901625879, 'samples': 19671552, 'steps': 102455, 'loss/train': 0.7231261730194092} 11/07/2021 11:36:37 - INFO - __main__ - Step 102457: {'lr': 0.00011686875728200083, 'samples': 19671744, 'steps': 102456, 'loss/train': 1.1553137302398682} 11/07/2021 11:36:37 - INFO - __main__ - Step 102458: {'lr': 0.00011686426560772975, 'samples': 19671936, 'steps': 102457, 'loss/train': 1.4326266050338745} 11/07/2021 11:36:38 - INFO - __main__ - Step 102459: {'lr': 0.00011685977399344758, 'samples': 19672128, 'steps': 102458, 'loss/train': 1.1534734964370728} 11/07/2021 11:36:38 - INFO - __main__ - Step 102460: {'lr': 0.00011685528243915635, 'samples': 19672320, 'steps': 102459, 'loss/train': 0.8089714050292969} 11/07/2021 11:36:39 - INFO - __main__ - Step 102461: {'lr': 0.00011685079094485807, 'samples': 19672512, 'steps': 102460, 'loss/train': 1.600786566734314} 11/07/2021 11:36:39 - INFO - __main__ - Step 102462: {'lr': 0.00011684629951055478, 'samples': 19672704, 'steps': 102461, 'loss/train': 0.7297754883766174} 11/07/2021 11:36:39 - INFO - __main__ - Step 102463: {'lr': 0.00011684180813624847, 'samples': 19672896, 'steps': 102462, 'loss/train': 1.3452883958816528} 11/07/2021 11:36:40 - INFO - __main__ - Step 102464: {'lr': 0.0001168373168219412, 'samples': 19673088, 'steps': 102463, 'loss/train': 1.3001964092254639} 11/07/2021 11:36:41 - INFO - __main__ - Step 102465: {'lr': 0.000116832825567635, 'samples': 19673280, 'steps': 102464, 'loss/train': 1.3869822025299072} 11/07/2021 11:36:41 - INFO - __main__ - Step 102466: {'lr': 0.00011682833437333185, 'samples': 19673472, 'steps': 102465, 'loss/train': 0.6343256831169128} 11/07/2021 11:36:42 - INFO - __main__ - Step 102467: {'lr': 0.0001168238432390338, 'samples': 19673664, 'steps': 102466, 'loss/train': 1.3549989461898804} 11/07/2021 11:36:42 - INFO - __main__ - Step 102468: {'lr': 0.00011681935216474296, 'samples': 19673856, 'steps': 102467, 'loss/train': 1.1777565479278564} 11/07/2021 11:36:43 - INFO - __main__ - Step 102469: {'lr': 0.00011681486115046117, 'samples': 19674048, 'steps': 102468, 'loss/train': 1.3768961429595947} 11/07/2021 11:36:44 - INFO - __main__ - Step 102470: {'lr': 0.00011681037019619056, 'samples': 19674240, 'steps': 102469, 'loss/train': 1.4276481866836548} 11/07/2021 11:36:44 - INFO - __main__ - Step 102471: {'lr': 0.00011680587930193315, 'samples': 19674432, 'steps': 102470, 'loss/train': 1.4819254875183105} 11/07/2021 11:36:44 - INFO - __main__ - Step 102472: {'lr': 0.00011680138846769093, 'samples': 19674624, 'steps': 102471, 'loss/train': 1.009633183479309} 11/07/2021 11:36:45 - INFO - __main__ - Step 102473: {'lr': 0.00011679689769346596, 'samples': 19674816, 'steps': 102472, 'loss/train': 1.4052072763442993} 11/07/2021 11:36:45 - INFO - __main__ - Step 102474: {'lr': 0.00011679240697926027, 'samples': 19675008, 'steps': 102473, 'loss/train': 1.863527536392212} 11/07/2021 11:36:46 - INFO - __main__ - Step 102475: {'lr': 0.00011678791632507585, 'samples': 19675200, 'steps': 102474, 'loss/train': 1.3405487537384033} 11/07/2021 11:36:46 - INFO - __main__ - Step 102476: {'lr': 0.00011678342573091474, 'samples': 19675392, 'steps': 102475, 'loss/train': 0.7493880987167358} 11/07/2021 11:36:47 - INFO - __main__ - Step 102477: {'lr': 0.00011677893519677896, 'samples': 19675584, 'steps': 102476, 'loss/train': 1.3320567607879639} 11/07/2021 11:36:47 - INFO - __main__ - Step 102478: {'lr': 0.00011677444472267054, 'samples': 19675776, 'steps': 102477, 'loss/train': 1.377241611480713} 11/07/2021 11:36:47 - INFO - __main__ - Step 102479: {'lr': 0.00011676995430859149, 'samples': 19675968, 'steps': 102478, 'loss/train': 0.897741436958313} 11/07/2021 11:36:48 - INFO - __main__ - Step 102480: {'lr': 0.00011676546395454385, 'samples': 19676160, 'steps': 102479, 'loss/train': 1.5055809020996094} 11/07/2021 11:36:49 - INFO - __main__ - Step 102481: {'lr': 0.00011676097366052974, 'samples': 19676352, 'steps': 102480, 'loss/train': 1.2630559206008911} 11/07/2021 11:36:49 - INFO - __main__ - Step 102482: {'lr': 0.00011675648342655095, 'samples': 19676544, 'steps': 102481, 'loss/train': 1.3685567378997803} 11/07/2021 11:36:49 - INFO - __main__ - Step 102483: {'lr': 0.00011675199325260968, 'samples': 19676736, 'steps': 102482, 'loss/train': 1.2760089635849} 11/07/2021 11:36:50 - INFO - __main__ - Step 102484: {'lr': 0.00011674750313870789, 'samples': 19676928, 'steps': 102483, 'loss/train': 0.38437193632125854} 11/07/2021 11:36:50 - INFO - __main__ - Step 102485: {'lr': 0.00011674301308484761, 'samples': 19677120, 'steps': 102484, 'loss/train': 1.0375893115997314} 11/07/2021 11:36:51 - INFO - __main__ - Step 102486: {'lr': 0.00011673852309103086, 'samples': 19677312, 'steps': 102485, 'loss/train': 1.452390432357788} 11/07/2021 11:36:52 - INFO - __main__ - Step 102487: {'lr': 0.00011673403315725969, 'samples': 19677504, 'steps': 102486, 'loss/train': 1.4168782234191895} 11/07/2021 11:36:52 - INFO - __main__ - Step 102488: {'lr': 0.0001167295432835361, 'samples': 19677696, 'steps': 102487, 'loss/train': 1.3967397212982178} 11/07/2021 11:36:52 - INFO - __main__ - Step 102489: {'lr': 0.00011672505346986214, 'samples': 19677888, 'steps': 102488, 'loss/train': 1.3894259929656982} 11/07/2021 11:36:53 - INFO - __main__ - Step 102490: {'lr': 0.00011672056371623982, 'samples': 19678080, 'steps': 102489, 'loss/train': 1.287712574005127} 11/07/2021 11:36:54 - INFO - __main__ - Step 102491: {'lr': 0.00011671607402267112, 'samples': 19678272, 'steps': 102490, 'loss/train': 0.8732655048370361} 11/07/2021 11:36:54 - INFO - __main__ - Step 102492: {'lr': 0.00011671158438915813, 'samples': 19678464, 'steps': 102491, 'loss/train': 1.4934160709381104} 11/07/2021 11:36:54 - INFO - __main__ - Step 102493: {'lr': 0.00011670709481570285, 'samples': 19678656, 'steps': 102492, 'loss/train': 1.1882187128067017} 11/07/2021 11:36:55 - INFO - __main__ - Step 102494: {'lr': 0.00011670260530230736, 'samples': 19678848, 'steps': 102493, 'loss/train': 1.0222828388214111} 11/07/2021 11:36:55 - INFO - __main__ - Step 102495: {'lr': 0.00011669811584897355, 'samples': 19679040, 'steps': 102494, 'loss/train': 1.114283800125122} 11/07/2021 11:36:56 - INFO - __main__ - Step 102496: {'lr': 0.0001166936264557035, 'samples': 19679232, 'steps': 102495, 'loss/train': 1.7363789081573486} 11/07/2021 11:36:57 - INFO - __main__ - Step 102497: {'lr': 0.00011668913712249923, 'samples': 19679424, 'steps': 102496, 'loss/train': 1.232516884803772} 11/07/2021 11:36:57 - INFO - __main__ - Step 102498: {'lr': 0.0001166846478493628, 'samples': 19679616, 'steps': 102497, 'loss/train': 1.189178466796875} 11/07/2021 11:36:57 - INFO - __main__ - Step 102499: {'lr': 0.00011668015863629623, 'samples': 19679808, 'steps': 102498, 'loss/train': 1.3461463451385498} 11/07/2021 11:36:58 - INFO - __main__ - Step 102500: {'lr': 0.0001166756694833015, 'samples': 19680000, 'steps': 102499, 'loss/train': 1.1028077602386475} 11/07/2021 11:36:58 - INFO - __main__ - Step 102501: {'lr': 0.00011667118039038063, 'samples': 19680192, 'steps': 102500, 'loss/train': 1.4306340217590332} 11/07/2021 11:36:59 - INFO - __main__ - Step 102502: {'lr': 0.00011666669135753571, 'samples': 19680384, 'steps': 102501, 'loss/train': 1.7292520999908447} 11/07/2021 11:36:59 - INFO - __main__ - Step 102503: {'lr': 0.00011666220238476871, 'samples': 19680576, 'steps': 102502, 'loss/train': 1.2933146953582764} 11/07/2021 11:37:00 - INFO - __main__ - Step 102504: {'lr': 0.00011665771347208164, 'samples': 19680768, 'steps': 102503, 'loss/train': 1.0603724718093872} 11/07/2021 11:37:00 - INFO - __main__ - Step 102505: {'lr': 0.00011665322461947658, 'samples': 19680960, 'steps': 102504, 'loss/train': 1.4564142227172852} 11/07/2021 11:37:00 - INFO - __main__ - Step 102506: {'lr': 0.0001166487358269555, 'samples': 19681152, 'steps': 102505, 'loss/train': 1.5569164752960205} 11/07/2021 11:37:01 - INFO - __main__ - Step 102507: {'lr': 0.00011664424709452045, 'samples': 19681344, 'steps': 102506, 'loss/train': 1.2582489252090454} 11/07/2021 11:37:02 - INFO - __main__ - Step 102508: {'lr': 0.00011663975842217353, 'samples': 19681536, 'steps': 102507, 'loss/train': 0.357313871383667} 11/07/2021 11:37:02 - INFO - __main__ - Step 102509: {'lr': 0.0001166352698099166, 'samples': 19681728, 'steps': 102508, 'loss/train': 1.482441782951355} 11/07/2021 11:37:02 - INFO - __main__ - Step 102510: {'lr': 0.00011663078125775173, 'samples': 19681920, 'steps': 102509, 'loss/train': 0.7512518167495728} 11/07/2021 11:37:03 - INFO - __main__ - Step 102511: {'lr': 0.00011662629276568099, 'samples': 19682112, 'steps': 102510, 'loss/train': 1.8054088354110718} 11/07/2021 11:37:04 - INFO - __main__ - Step 102512: {'lr': 0.00011662180433370639, 'samples': 19682304, 'steps': 102511, 'loss/train': 1.2350441217422485} 11/07/2021 11:37:04 - INFO - __main__ - Step 102513: {'lr': 0.00011661731596182995, 'samples': 19682496, 'steps': 102512, 'loss/train': 1.2529898881912231} 11/07/2021 11:37:05 - INFO - __main__ - Step 102514: {'lr': 0.00011661282765005368, 'samples': 19682688, 'steps': 102513, 'loss/train': 0.6463530659675598} 11/07/2021 11:37:05 - INFO - __main__ - Step 102515: {'lr': 0.00011660833939837962, 'samples': 19682880, 'steps': 102514, 'loss/train': 1.4847896099090576} 11/07/2021 11:37:05 - INFO - __main__ - Step 102516: {'lr': 0.00011660385120680977, 'samples': 19683072, 'steps': 102515, 'loss/train': 1.3181971311569214} 11/07/2021 11:37:07 - INFO - __main__ - Step 102517: {'lr': 0.00011659936307534615, 'samples': 19683264, 'steps': 102516, 'loss/train': 0.9258237481117249} 11/07/2021 11:37:07 - INFO - __main__ - Step 102518: {'lr': 0.00011659487500399083, 'samples': 19683456, 'steps': 102517, 'loss/train': 1.3002541065216064} 11/07/2021 11:37:07 - INFO - __main__ - Step 102519: {'lr': 0.0001165903869927458, 'samples': 19683648, 'steps': 102518, 'loss/train': 1.3400708436965942} 11/07/2021 11:37:08 - INFO - __main__ - Step 102520: {'lr': 0.00011658589904161307, 'samples': 19683840, 'steps': 102519, 'loss/train': 1.6637581586837769} 11/07/2021 11:37:08 - INFO - __main__ - Step 102521: {'lr': 0.00011658141115059479, 'samples': 19684032, 'steps': 102520, 'loss/train': 1.5743627548217773} 11/07/2021 11:37:09 - INFO - __main__ - Step 102522: {'lr': 0.00011657692331969275, 'samples': 19684224, 'steps': 102521, 'loss/train': 1.3336764574050903} 11/07/2021 11:37:09 - INFO - __main__ - Step 102523: {'lr': 0.0001165724355489091, 'samples': 19684416, 'steps': 102522, 'loss/train': 1.3903629779815674} 11/07/2021 11:37:10 - INFO - __main__ - Step 102524: {'lr': 0.00011656794783824584, 'samples': 19684608, 'steps': 102523, 'loss/train': 1.650667667388916} 11/07/2021 11:37:10 - INFO - __main__ - Step 102525: {'lr': 0.000116563460187705, 'samples': 19684800, 'steps': 102524, 'loss/train': 1.5443904399871826} 11/07/2021 11:37:10 - INFO - __main__ - Step 102526: {'lr': 0.00011655897259728863, 'samples': 19684992, 'steps': 102525, 'loss/train': 0.994145393371582} 11/07/2021 11:37:11 - INFO - __main__ - Step 102527: {'lr': 0.0001165544850669987, 'samples': 19685184, 'steps': 102526, 'loss/train': 1.026153802871704} 11/07/2021 11:37:12 - INFO - __main__ - Step 102528: {'lr': 0.00011654999759683729, 'samples': 19685376, 'steps': 102527, 'loss/train': 1.3352991342544556} 11/07/2021 11:37:12 - INFO - __main__ - Step 102529: {'lr': 0.00011654551018680637, 'samples': 19685568, 'steps': 102528, 'loss/train': 1.1444138288497925} 11/07/2021 11:37:13 - INFO - __main__ - Step 102530: {'lr': 0.00011654102283690798, 'samples': 19685760, 'steps': 102529, 'loss/train': 0.8959442377090454} 11/07/2021 11:37:13 - INFO - __main__ - Step 102531: {'lr': 0.00011653653554714416, 'samples': 19685952, 'steps': 102530, 'loss/train': 1.155552864074707} 11/07/2021 11:37:13 - INFO - __main__ - Step 102532: {'lr': 0.00011653204831751693, 'samples': 19686144, 'steps': 102531, 'loss/train': 1.2795273065567017} 11/07/2021 11:37:14 - INFO - __main__ - Step 102533: {'lr': 0.00011652756114802829, 'samples': 19686336, 'steps': 102532, 'loss/train': 1.4135655164718628} 11/07/2021 11:37:15 - INFO - __main__ - Step 102534: {'lr': 0.00011652307403868027, 'samples': 19686528, 'steps': 102533, 'loss/train': 0.7151769995689392} 11/07/2021 11:37:15 - INFO - __main__ - Step 102535: {'lr': 0.00011651858698947496, 'samples': 19686720, 'steps': 102534, 'loss/train': 1.394880771636963} 11/07/2021 11:37:15 - INFO - __main__ - Step 102536: {'lr': 0.00011651410000041423, 'samples': 19686912, 'steps': 102535, 'loss/train': 1.0252431631088257} 11/07/2021 11:37:16 - INFO - __main__ - Step 102537: {'lr': 0.00011650961307150021, 'samples': 19687104, 'steps': 102536, 'loss/train': 1.4905333518981934} 11/07/2021 11:37:17 - INFO - __main__ - Step 102538: {'lr': 0.0001165051262027349, 'samples': 19687296, 'steps': 102537, 'loss/train': 1.4505988359451294} 11/07/2021 11:37:17 - INFO - __main__ - Step 102539: {'lr': 0.00011650063939412032, 'samples': 19687488, 'steps': 102538, 'loss/train': 1.5896031856536865} 11/07/2021 11:37:17 - INFO - __main__ - Step 102540: {'lr': 0.00011649615264565846, 'samples': 19687680, 'steps': 102539, 'loss/train': 0.745124340057373} 11/07/2021 11:37:18 - INFO - __main__ - Step 102541: {'lr': 0.00011649166595735139, 'samples': 19687872, 'steps': 102540, 'loss/train': 0.31559616327285767} 11/07/2021 11:37:18 - INFO - __main__ - Step 102542: {'lr': 0.00011648717932920113, 'samples': 19688064, 'steps': 102541, 'loss/train': 1.2888070344924927} 11/07/2021 11:37:19 - INFO - __main__ - Step 102543: {'lr': 0.00011648269276120969, 'samples': 19688256, 'steps': 102542, 'loss/train': 1.3163830041885376} 11/07/2021 11:37:20 - INFO - __main__ - Step 102544: {'lr': 0.00011647820625337905, 'samples': 19688448, 'steps': 102543, 'loss/train': 1.2570923566818237} 11/07/2021 11:37:20 - INFO - __main__ - Step 102545: {'lr': 0.00011647371980571131, 'samples': 19688640, 'steps': 102544, 'loss/train': 1.2790814638137817} 11/07/2021 11:37:20 - INFO - __main__ - Step 102546: {'lr': 0.00011646923341820843, 'samples': 19688832, 'steps': 102545, 'loss/train': 1.3899096250534058} 11/07/2021 11:37:21 - INFO - __main__ - Step 102547: {'lr': 0.00011646474709087246, 'samples': 19689024, 'steps': 102546, 'loss/train': 0.7949965000152588} 11/07/2021 11:37:22 - INFO - __main__ - Step 102548: {'lr': 0.00011646026082370551, 'samples': 19689216, 'steps': 102547, 'loss/train': 2.4167747497558594} 11/07/2021 11:37:22 - INFO - __main__ - Step 102549: {'lr': 0.0001164557746167094, 'samples': 19689408, 'steps': 102548, 'loss/train': 0.79839026927948} 11/07/2021 11:37:22 - INFO - __main__ - Step 102550: {'lr': 0.00011645128846988626, 'samples': 19689600, 'steps': 102549, 'loss/train': 1.1735327243804932} 11/07/2021 11:37:23 - INFO - __main__ - Step 102551: {'lr': 0.00011644680238323813, 'samples': 19689792, 'steps': 102550, 'loss/train': 1.2386184930801392} 11/07/2021 11:37:23 - INFO - __main__ - Step 102552: {'lr': 0.00011644231635676698, 'samples': 19689984, 'steps': 102551, 'loss/train': 1.1614255905151367} 11/07/2021 11:37:24 - INFO - __main__ - Step 102553: {'lr': 0.00011643783039047487, 'samples': 19690176, 'steps': 102552, 'loss/train': 1.2878693342208862} 11/07/2021 11:37:25 - INFO - __main__ - Step 102554: {'lr': 0.00011643334448436382, 'samples': 19690368, 'steps': 102553, 'loss/train': 1.4312494993209839} 11/07/2021 11:37:25 - INFO - __main__ - Step 102555: {'lr': 0.00011642885863843586, 'samples': 19690560, 'steps': 102554, 'loss/train': 1.2297217845916748} 11/07/2021 11:37:26 - INFO - __main__ - Step 102556: {'lr': 0.00011642437285269297, 'samples': 19690752, 'steps': 102555, 'loss/train': 1.11662757396698} 11/07/2021 11:37:26 - INFO - __main__ - Step 102557: {'lr': 0.0001164198871271372, 'samples': 19690944, 'steps': 102556, 'loss/train': 0.1725337952375412} 11/07/2021 11:37:26 - INFO - __main__ - Step 102558: {'lr': 0.00011641540146177057, 'samples': 19691136, 'steps': 102557, 'loss/train': 1.6428309679031372} 11/07/2021 11:37:27 - INFO - __main__ - Step 102559: {'lr': 0.0001164109158565951, 'samples': 19691328, 'steps': 102558, 'loss/train': 1.501121163368225} 11/07/2021 11:37:28 - INFO - __main__ - Step 102560: {'lr': 0.0001164064303116128, 'samples': 19691520, 'steps': 102559, 'loss/train': 1.3679012060165405} 11/07/2021 11:37:28 - INFO - __main__ - Step 102561: {'lr': 0.00011640194482682573, 'samples': 19691712, 'steps': 102560, 'loss/train': 1.5099073648452759} 11/07/2021 11:37:28 - INFO - __main__ - Step 102562: {'lr': 0.00011639745940223596, 'samples': 19691904, 'steps': 102561, 'loss/train': 0.9861010313034058} 11/07/2021 11:37:29 - INFO - __main__ - Step 102563: {'lr': 0.00011639297403784533, 'samples': 19692096, 'steps': 102562, 'loss/train': 1.1846859455108643} 11/07/2021 11:37:29 - INFO - __main__ - Step 102564: {'lr': 0.00011638848873365596, 'samples': 19692288, 'steps': 102563, 'loss/train': 1.6499930620193481} 11/07/2021 11:37:30 - INFO - __main__ - Step 102565: {'lr': 0.0001163840034896699, 'samples': 19692480, 'steps': 102564, 'loss/train': 1.347918152809143} 11/07/2021 11:37:30 - INFO - __main__ - Step 102566: {'lr': 0.00011637951830588914, 'samples': 19692672, 'steps': 102565, 'loss/train': 1.5131465196609497} 11/07/2021 11:37:31 - INFO - __main__ - Step 102567: {'lr': 0.00011637503318231568, 'samples': 19692864, 'steps': 102566, 'loss/train': 1.3843870162963867} 11/07/2021 11:37:31 - INFO - __main__ - Step 102568: {'lr': 0.00011637054811895159, 'samples': 19693056, 'steps': 102567, 'loss/train': 1.358803391456604} 11/07/2021 11:37:31 - INFO - __main__ - Step 102569: {'lr': 0.00011636606311579886, 'samples': 19693248, 'steps': 102568, 'loss/train': 1.4943801164627075} 11/07/2021 11:37:33 - INFO - __main__ - Step 102570: {'lr': 0.00011636157817285953, 'samples': 19693440, 'steps': 102569, 'loss/train': 1.2909249067306519} 11/07/2021 11:37:33 - INFO - __main__ - Step 102571: {'lr': 0.00011635709329013561, 'samples': 19693632, 'steps': 102570, 'loss/train': 1.3042842149734497} 11/07/2021 11:37:33 - INFO - __main__ - Step 102572: {'lr': 0.00011635260846762913, 'samples': 19693824, 'steps': 102571, 'loss/train': 0.8814629912376404} 11/07/2021 11:37:34 - INFO - __main__ - Step 102573: {'lr': 0.00011634812370534209, 'samples': 19694016, 'steps': 102572, 'loss/train': 1.4303313493728638} 11/07/2021 11:37:34 - INFO - __main__ - Step 102574: {'lr': 0.00011634363900327652, 'samples': 19694208, 'steps': 102573, 'loss/train': 1.5593444108963013} 11/07/2021 11:37:34 - INFO - __main__ - Step 102575: {'lr': 0.00011633915436143452, 'samples': 19694400, 'steps': 102574, 'loss/train': 0.8073808550834656} 11/07/2021 11:37:35 - INFO - __main__ - Step 102576: {'lr': 0.00011633466977981797, 'samples': 19694592, 'steps': 102575, 'loss/train': 1.2901527881622314} 11/07/2021 11:37:36 - INFO - __main__ - Step 102577: {'lr': 0.00011633018525842895, 'samples': 19694784, 'steps': 102576, 'loss/train': 0.8559610843658447} 11/07/2021 11:37:36 - INFO - __main__ - Step 102578: {'lr': 0.00011632570079726948, 'samples': 19694976, 'steps': 102577, 'loss/train': 1.7287038564682007} 11/07/2021 11:37:36 - INFO - __main__ - Step 102579: {'lr': 0.0001163212163963416, 'samples': 19695168, 'steps': 102578, 'loss/train': 1.7591508626937866} 11/07/2021 11:37:37 - INFO - __main__ - Step 102580: {'lr': 0.0001163167320556473, 'samples': 19695360, 'steps': 102579, 'loss/train': 1.2519943714141846} 11/07/2021 11:37:38 - INFO - __main__ - Step 102581: {'lr': 0.00011631224777518861, 'samples': 19695552, 'steps': 102580, 'loss/train': 1.7420185804367065} 11/07/2021 11:37:38 - INFO - __main__ - Step 102582: {'lr': 0.00011630776355496758, 'samples': 19695744, 'steps': 102581, 'loss/train': 0.46920692920684814} 11/07/2021 11:37:39 - INFO - __main__ - Step 102583: {'lr': 0.00011630327939498622, 'samples': 19695936, 'steps': 102582, 'loss/train': 1.4239894151687622} 11/07/2021 11:37:39 - INFO - __main__ - Step 102584: {'lr': 0.0001162987952952465, 'samples': 19696128, 'steps': 102583, 'loss/train': 0.189786896109581} 11/07/2021 11:37:39 - INFO - __main__ - Step 102585: {'lr': 0.00011629431125575051, 'samples': 19696320, 'steps': 102584, 'loss/train': 1.3730112314224243} 11/07/2021 11:37:41 - INFO - __main__ - Step 102586: {'lr': 0.00011628982727650023, 'samples': 19696512, 'steps': 102585, 'loss/train': 1.1820056438446045} 11/07/2021 11:37:41 - INFO - __main__ - Step 102587: {'lr': 0.00011628534335749768, 'samples': 19696704, 'steps': 102586, 'loss/train': 1.1603105068206787} 11/07/2021 11:37:41 - INFO - __main__ - Step 102588: {'lr': 0.00011628085949874493, 'samples': 19696896, 'steps': 102587, 'loss/train': 1.3937369585037231} 11/07/2021 11:37:42 - INFO - __main__ - Step 102589: {'lr': 0.00011627637570024402, 'samples': 19697088, 'steps': 102588, 'loss/train': 1.3353227376937866} 11/07/2021 11:37:42 - INFO - __main__ - Step 102590: {'lr': 0.00011627189196199683, 'samples': 19697280, 'steps': 102589, 'loss/train': 1.191152572631836} 11/07/2021 11:37:43 - INFO - __main__ - Step 102591: {'lr': 0.00011626740828400544, 'samples': 19697472, 'steps': 102590, 'loss/train': 1.4841721057891846} 11/07/2021 11:37:44 - INFO - __main__ - Step 102592: {'lr': 0.00011626292466627192, 'samples': 19697664, 'steps': 102591, 'loss/train': 1.2506285905838013} 11/07/2021 11:37:44 - INFO - __main__ - Step 102593: {'lr': 0.00011625844110879823, 'samples': 19697856, 'steps': 102592, 'loss/train': 1.24861741065979} 11/07/2021 11:37:44 - INFO - __main__ - Step 102594: {'lr': 0.00011625395761158646, 'samples': 19698048, 'steps': 102593, 'loss/train': 1.7989517450332642} 11/07/2021 11:37:45 - INFO - __main__ - Step 102595: {'lr': 0.00011624947417463858, 'samples': 19698240, 'steps': 102594, 'loss/train': 1.6905481815338135} 11/07/2021 11:37:46 - INFO - __main__ - Step 102596: {'lr': 0.00011624499079795661, 'samples': 19698432, 'steps': 102595, 'loss/train': 1.4331682920455933} 11/07/2021 11:37:46 - INFO - __main__ - Step 102597: {'lr': 0.00011624050748154261, 'samples': 19698624, 'steps': 102596, 'loss/train': 1.2084527015686035} 11/07/2021 11:37:47 - INFO - __main__ - Step 102598: {'lr': 0.00011623602422539856, 'samples': 19698816, 'steps': 102597, 'loss/train': 1.5356950759887695} 11/07/2021 11:37:47 - INFO - __main__ - Step 102599: {'lr': 0.00011623154102952648, 'samples': 19699008, 'steps': 102598, 'loss/train': 1.675042986869812} 11/07/2021 11:37:47 - INFO - __main__ - Step 102600: {'lr': 0.0001162270578939284, 'samples': 19699200, 'steps': 102599, 'loss/train': 0.602933406829834} 11/07/2021 11:37:48 - INFO - __main__ - Step 102601: {'lr': 0.00011622257481860637, 'samples': 19699392, 'steps': 102600, 'loss/train': 1.2694206237792969} 11/07/2021 11:37:49 - INFO - __main__ - Step 102602: {'lr': 0.00011621809180356246, 'samples': 19699584, 'steps': 102601, 'loss/train': 1.9783799648284912} 11/07/2021 11:37:49 - INFO - __main__ - Step 102603: {'lr': 0.00011621360884879853, 'samples': 19699776, 'steps': 102602, 'loss/train': 1.4487541913986206} 11/07/2021 11:37:49 - INFO - __main__ - Step 102604: {'lr': 0.00011620912595431668, 'samples': 19699968, 'steps': 102603, 'loss/train': 1.5044909715652466} 11/07/2021 11:37:50 - INFO - __main__ - Step 102605: {'lr': 0.00011620464312011894, 'samples': 19700160, 'steps': 102604, 'loss/train': 0.5456273555755615} 11/07/2021 11:37:50 - INFO - __main__ - Step 102606: {'lr': 0.0001162001603462073, 'samples': 19700352, 'steps': 102605, 'loss/train': 1.4215008020401} 11/07/2021 11:37:51 - INFO - __main__ - Step 102607: {'lr': 0.00011619567763258382, 'samples': 19700544, 'steps': 102606, 'loss/train': 1.4704086780548096} 11/07/2021 11:37:52 - INFO - __main__ - Step 102608: {'lr': 0.00011619119497925049, 'samples': 19700736, 'steps': 102607, 'loss/train': 1.2370693683624268} 11/07/2021 11:37:52 - INFO - __main__ - Step 102609: {'lr': 0.00011618671238620936, 'samples': 19700928, 'steps': 102608, 'loss/train': 1.326306939125061} 11/07/2021 11:37:52 - INFO - __main__ - Step 102610: {'lr': 0.00011618222985346244, 'samples': 19701120, 'steps': 102609, 'loss/train': 1.0907737016677856} 11/07/2021 11:37:53 - INFO - __main__ - Step 102611: {'lr': 0.00011617774738101172, 'samples': 19701312, 'steps': 102610, 'loss/train': 1.4031468629837036} 11/07/2021 11:37:54 - INFO - __main__ - Step 102612: {'lr': 0.00011617326496885925, 'samples': 19701504, 'steps': 102611, 'loss/train': 1.38301420211792} 11/07/2021 11:37:54 - INFO - __main__ - Step 102613: {'lr': 0.00011616878261700702, 'samples': 19701696, 'steps': 102612, 'loss/train': 1.2921967506408691} 11/07/2021 11:37:55 - INFO - __main__ - Step 102614: {'lr': 0.00011616430032545711, 'samples': 19701888, 'steps': 102613, 'loss/train': 1.4002403020858765} 11/07/2021 11:37:55 - INFO - __main__ - Step 102615: {'lr': 0.00011615981809421158, 'samples': 19702080, 'steps': 102614, 'loss/train': 1.1185861825942993} 11/07/2021 11:37:55 - INFO - __main__ - Step 102616: {'lr': 0.00011615533592327226, 'samples': 19702272, 'steps': 102615, 'loss/train': 0.06597813963890076} 11/07/2021 11:37:56 - INFO - __main__ - Step 102617: {'lr': 0.00011615085381264132, 'samples': 19702464, 'steps': 102616, 'loss/train': 1.0624111890792847} 11/07/2021 11:37:57 - INFO - __main__ - Step 102618: {'lr': 0.00011614637176232071, 'samples': 19702656, 'steps': 102617, 'loss/train': 1.2289725542068481} 11/07/2021 11:37:57 - INFO - __main__ - Step 102619: {'lr': 0.00011614188977231247, 'samples': 19702848, 'steps': 102618, 'loss/train': 1.225174069404602} 11/07/2021 11:37:57 - INFO - __main__ - Step 102620: {'lr': 0.00011613740784261865, 'samples': 19703040, 'steps': 102619, 'loss/train': 1.2563927173614502} 11/07/2021 11:37:58 - INFO - __main__ - Step 102621: {'lr': 0.00011613292597324123, 'samples': 19703232, 'steps': 102620, 'loss/train': 1.555833339691162} 11/07/2021 11:37:59 - INFO - __main__ - Step 102622: {'lr': 0.00011612844416418228, 'samples': 19703424, 'steps': 102621, 'loss/train': 1.0591182708740234} 11/07/2021 11:37:59 - INFO - __main__ - Step 102623: {'lr': 0.00011612396241544377, 'samples': 19703616, 'steps': 102622, 'loss/train': 1.436214566230774} 11/07/2021 11:37:59 - INFO - __main__ - Step 102624: {'lr': 0.00011611948072702772, 'samples': 19703808, 'steps': 102623, 'loss/train': 1.4227265119552612} 11/07/2021 11:38:00 - INFO - __main__ - Step 102625: {'lr': 0.00011611499909893616, 'samples': 19704000, 'steps': 102624, 'loss/train': 0.9453253149986267} 11/07/2021 11:38:00 - INFO - __main__ - Step 102626: {'lr': 0.00011611051753117115, 'samples': 19704192, 'steps': 102625, 'loss/train': 1.100676417350769} 11/07/2021 11:38:01 - INFO - __main__ - Step 102627: {'lr': 0.00011610603602373466, 'samples': 19704384, 'steps': 102626, 'loss/train': 1.0800682306289673} 11/07/2021 11:38:02 - INFO - __main__ - Step 102628: {'lr': 0.00011610155457662871, 'samples': 19704576, 'steps': 102627, 'loss/train': 1.9179202318191528} 11/07/2021 11:38:02 - INFO - __main__ - Step 102629: {'lr': 0.00011609707318985546, 'samples': 19704768, 'steps': 102628, 'loss/train': 1.4344770908355713} 11/07/2021 11:38:02 - INFO - __main__ - Step 102630: {'lr': 0.00011609259186341667, 'samples': 19704960, 'steps': 102629, 'loss/train': 0.47595158219337463} 11/07/2021 11:38:03 - INFO - __main__ - Step 102631: {'lr': 0.00011608811059731453, 'samples': 19705152, 'steps': 102630, 'loss/train': 1.2475903034210205} 11/07/2021 11:38:03 - INFO - __main__ - Step 102632: {'lr': 0.00011608362939155098, 'samples': 19705344, 'steps': 102631, 'loss/train': 1.111308217048645} 11/07/2021 11:38:04 - INFO - __main__ - Step 102633: {'lr': 0.00011607914824612811, 'samples': 19705536, 'steps': 102632, 'loss/train': 1.5920684337615967} 11/07/2021 11:38:04 - INFO - __main__ - Step 102634: {'lr': 0.00011607466716104792, 'samples': 19705728, 'steps': 102633, 'loss/train': 1.3904722929000854} 11/07/2021 11:38:05 - INFO - __main__ - Step 102635: {'lr': 0.0001160701861363124, 'samples': 19705920, 'steps': 102634, 'loss/train': 1.1364413499832153} 11/07/2021 11:38:05 - INFO - __main__ - Step 102636: {'lr': 0.00011606570517192357, 'samples': 19706112, 'steps': 102635, 'loss/train': 1.5734738111495972} 11/07/2021 11:38:05 - INFO - __main__ - Step 102637: {'lr': 0.00011606122426788349, 'samples': 19706304, 'steps': 102636, 'loss/train': 1.352369785308838} 11/07/2021 11:38:06 - INFO - __main__ - Step 102638: {'lr': 0.00011605674342419414, 'samples': 19706496, 'steps': 102637, 'loss/train': 0.5971621870994568} 11/07/2021 11:38:07 - INFO - __main__ - Step 102639: {'lr': 0.00011605226264085758, 'samples': 19706688, 'steps': 102638, 'loss/train': 1.4703949689865112} 11/07/2021 11:38:07 - INFO - __main__ - Step 102640: {'lr': 0.00011604778191787579, 'samples': 19706880, 'steps': 102639, 'loss/train': 1.5936592817306519} 11/07/2021 11:38:08 - INFO - __main__ - Step 102641: {'lr': 0.00011604330125525078, 'samples': 19707072, 'steps': 102640, 'loss/train': 1.415256381034851} 11/07/2021 11:38:08 - INFO - __main__ - Step 102642: {'lr': 0.00011603882065298471, 'samples': 19707264, 'steps': 102641, 'loss/train': 1.6754474639892578} 11/07/2021 11:38:09 - INFO - __main__ - Step 102643: {'lr': 0.00011603434011107939, 'samples': 19707456, 'steps': 102642, 'loss/train': 0.8479533195495605} 11/07/2021 11:38:09 - INFO - __main__ - Step 102644: {'lr': 0.00011602985962953692, 'samples': 19707648, 'steps': 102643, 'loss/train': 1.0244876146316528} 11/07/2021 11:38:10 - INFO - __main__ - Step 102645: {'lr': 0.00011602537920835932, 'samples': 19707840, 'steps': 102644, 'loss/train': 1.3946669101715088} 11/07/2021 11:38:10 - INFO - __main__ - Step 102646: {'lr': 0.00011602089884754863, 'samples': 19708032, 'steps': 102645, 'loss/train': 0.6151880025863647} 11/07/2021 11:38:10 - INFO - __main__ - Step 102647: {'lr': 0.00011601641854710684, 'samples': 19708224, 'steps': 102646, 'loss/train': 1.4425837993621826} 11/07/2021 11:38:11 - INFO - __main__ - Step 102648: {'lr': 0.00011601193830703602, 'samples': 19708416, 'steps': 102647, 'loss/train': 0.3117143213748932} 11/07/2021 11:38:12 - INFO - __main__ - Step 102649: {'lr': 0.00011600745812733812, 'samples': 19708608, 'steps': 102648, 'loss/train': 1.622767686843872} 11/07/2021 11:38:12 - INFO - __main__ - Step 102650: {'lr': 0.00011600297800801521, 'samples': 19708800, 'steps': 102649, 'loss/train': 2.205110788345337} 11/07/2021 11:38:13 - INFO - __main__ - Step 102651: {'lr': 0.00011599849794906928, 'samples': 19708992, 'steps': 102650, 'loss/train': 1.0191504955291748} 11/07/2021 11:38:13 - INFO - __main__ - Step 102652: {'lr': 0.00011599401795050235, 'samples': 19709184, 'steps': 102651, 'loss/train': 1.3990370035171509} 11/07/2021 11:38:13 - INFO - __main__ - Step 102653: {'lr': 0.00011598953801231646, 'samples': 19709376, 'steps': 102652, 'loss/train': 1.2157775163650513} 11/07/2021 11:38:14 - INFO - __main__ - Step 102654: {'lr': 0.0001159850581345136, 'samples': 19709568, 'steps': 102653, 'loss/train': 1.476638674736023} 11/07/2021 11:38:15 - INFO - __main__ - Step 102655: {'lr': 0.00011598057831709591, 'samples': 19709760, 'steps': 102654, 'loss/train': 1.2464877367019653} 11/07/2021 11:38:15 - INFO - __main__ - Step 102656: {'lr': 0.00011597609856006522, 'samples': 19709952, 'steps': 102655, 'loss/train': 1.8963195085525513} 11/07/2021 11:38:15 - INFO - __main__ - Step 102657: {'lr': 0.0001159716188634236, 'samples': 19710144, 'steps': 102656, 'loss/train': 1.1771830320358276} 11/07/2021 11:38:16 - INFO - __main__ - Step 102658: {'lr': 0.00011596713922717314, 'samples': 19710336, 'steps': 102657, 'loss/train': 1.8506317138671875} 11/07/2021 11:38:17 - INFO - __main__ - Step 102659: {'lr': 0.0001159626596513158, 'samples': 19710528, 'steps': 102658, 'loss/train': 1.2484617233276367} 11/07/2021 11:38:17 - INFO - __main__ - Step 102660: {'lr': 0.00011595818013585362, 'samples': 19710720, 'steps': 102659, 'loss/train': 1.497196912765503} 11/07/2021 11:38:18 - INFO - __main__ - Step 102661: {'lr': 0.00011595370068078861, 'samples': 19710912, 'steps': 102660, 'loss/train': 0.8185768127441406} 11/07/2021 11:38:18 - INFO - __main__ - Step 102662: {'lr': 0.00011594922128612282, 'samples': 19711104, 'steps': 102661, 'loss/train': 0.9135160446166992} 11/07/2021 11:38:18 - INFO - __main__ - Step 102663: {'lr': 0.00011594474195185823, 'samples': 19711296, 'steps': 102662, 'loss/train': 1.6415969133377075} 11/07/2021 11:38:19 - INFO - __main__ - Step 102664: {'lr': 0.00011594026267799684, 'samples': 19711488, 'steps': 102663, 'loss/train': 1.3335235118865967} 11/07/2021 11:38:20 - INFO - __main__ - Step 102665: {'lr': 0.00011593578346454073, 'samples': 19711680, 'steps': 102664, 'loss/train': 1.0602774620056152} 11/07/2021 11:38:20 - INFO - __main__ - Step 102666: {'lr': 0.00011593130431149199, 'samples': 19711872, 'steps': 102665, 'loss/train': 1.4880911111831665} 11/07/2021 11:38:20 - INFO - __main__ - Step 102667: {'lr': 0.00011592682521885243, 'samples': 19712064, 'steps': 102666, 'loss/train': 1.1689987182617188} 11/07/2021 11:38:21 - INFO - __main__ - Step 102668: {'lr': 0.00011592234618662415, 'samples': 19712256, 'steps': 102667, 'loss/train': 2.184819221496582} 11/07/2021 11:38:22 - INFO - __main__ - Step 102669: {'lr': 0.00011591786721480921, 'samples': 19712448, 'steps': 102668, 'loss/train': 1.2803561687469482} 11/07/2021 11:38:22 - INFO - __main__ - Step 102670: {'lr': 0.00011591338830340961, 'samples': 19712640, 'steps': 102669, 'loss/train': 1.2183668613433838} 11/07/2021 11:38:23 - INFO - __main__ - Step 102671: {'lr': 0.00011590890945242738, 'samples': 19712832, 'steps': 102670, 'loss/train': 1.356734037399292} 11/07/2021 11:38:23 - INFO - __main__ - Step 102672: {'lr': 0.00011590443066186451, 'samples': 19713024, 'steps': 102671, 'loss/train': 1.4014626741409302} 11/07/2021 11:38:23 - INFO - __main__ - Step 102673: {'lr': 0.00011589995193172303, 'samples': 19713216, 'steps': 102672, 'loss/train': 0.9903932213783264} 11/07/2021 11:38:24 - INFO - __main__ - Step 102674: {'lr': 0.00011589547326200497, 'samples': 19713408, 'steps': 102673, 'loss/train': 0.9124348163604736} 11/07/2021 11:38:24 - INFO - __main__ - Step 102675: {'lr': 0.00011589099465271233, 'samples': 19713600, 'steps': 102674, 'loss/train': 1.340050458908081} 11/07/2021 11:38:25 - INFO - __main__ - Step 102676: {'lr': 0.00011588651610384715, 'samples': 19713792, 'steps': 102675, 'loss/train': 1.3311338424682617} 11/07/2021 11:38:25 - INFO - __main__ - Step 102677: {'lr': 0.00011588203761541154, 'samples': 19713984, 'steps': 102676, 'loss/train': 1.4520550966262817} 11/07/2021 11:38:26 - INFO - __main__ - Step 102678: {'lr': 0.0001158775591874073, 'samples': 19714176, 'steps': 102677, 'loss/train': 1.7232475280761719} 11/07/2021 11:38:26 - INFO - __main__ - Step 102679: {'lr': 0.00011587308081983657, 'samples': 19714368, 'steps': 102678, 'loss/train': 1.6058779954910278} 11/07/2021 11:38:27 - INFO - __main__ - Step 102680: {'lr': 0.00011586860251270137, 'samples': 19714560, 'steps': 102679, 'loss/train': 1.312191128730774} 11/07/2021 11:38:27 - INFO - __main__ - Step 102681: {'lr': 0.00011586412426600371, 'samples': 19714752, 'steps': 102680, 'loss/train': 1.606053113937378} 11/07/2021 11:38:28 - INFO - __main__ - Step 102682: {'lr': 0.00011585964607974559, 'samples': 19714944, 'steps': 102681, 'loss/train': 1.3257185220718384} 11/07/2021 11:38:28 - INFO - __main__ - Step 102683: {'lr': 0.00011585516795392905, 'samples': 19715136, 'steps': 102682, 'loss/train': 1.4284789562225342} 11/07/2021 11:38:28 - INFO - __main__ - Step 102684: {'lr': 0.0001158506898885561, 'samples': 19715328, 'steps': 102683, 'loss/train': 1.1203874349594116} 11/07/2021 11:38:30 - INFO - __main__ - Step 102685: {'lr': 0.00011584621188362875, 'samples': 19715520, 'steps': 102684, 'loss/train': 0.8465638756752014} 11/07/2021 11:38:30 - INFO - __main__ - Step 102686: {'lr': 0.00011584173393914904, 'samples': 19715712, 'steps': 102685, 'loss/train': 1.413559079170227} 11/07/2021 11:38:30 - INFO - __main__ - Step 102687: {'lr': 0.00011583725605511908, 'samples': 19715904, 'steps': 102686, 'loss/train': 0.2176589071750641} 11/07/2021 11:38:31 - INFO - __main__ - Step 102688: {'lr': 0.00011583277823154068, 'samples': 19716096, 'steps': 102687, 'loss/train': 1.050429344177246} 11/07/2021 11:38:32 - INFO - __main__ - Step 102689: {'lr': 0.00011582830046841594, 'samples': 19716288, 'steps': 102688, 'loss/train': 0.6126284599304199} 11/07/2021 11:38:32 - INFO - __main__ - Step 102690: {'lr': 0.00011582382276574691, 'samples': 19716480, 'steps': 102689, 'loss/train': 1.0931700468063354} 11/07/2021 11:38:33 - INFO - __main__ - Step 102691: {'lr': 0.0001158193451235356, 'samples': 19716672, 'steps': 102690, 'loss/train': 1.5521838665008545} 11/07/2021 11:38:33 - INFO - __main__ - Step 102692: {'lr': 0.00011581486754178403, 'samples': 19716864, 'steps': 102691, 'loss/train': 0.5393563508987427} 11/07/2021 11:38:33 - INFO - __main__ - Step 102693: {'lr': 0.00011581039002049418, 'samples': 19717056, 'steps': 102692, 'loss/train': 1.0900758504867554} 11/07/2021 11:38:34 - INFO - __main__ - Step 102694: {'lr': 0.00011580591255966812, 'samples': 19717248, 'steps': 102693, 'loss/train': 1.4786405563354492} 11/07/2021 11:38:35 - INFO - __main__ - Step 102695: {'lr': 0.00011580143515930785, 'samples': 19717440, 'steps': 102694, 'loss/train': 1.3051283359527588} 11/07/2021 11:38:35 - INFO - __main__ - Step 102696: {'lr': 0.00011579695781941538, 'samples': 19717632, 'steps': 102695, 'loss/train': 1.3382846117019653} 11/07/2021 11:38:35 - INFO - __main__ - Step 102697: {'lr': 0.00011579248053999272, 'samples': 19717824, 'steps': 102696, 'loss/train': 1.689946174621582} 11/07/2021 11:38:36 - INFO - __main__ - Step 102698: {'lr': 0.000115788003321042, 'samples': 19718016, 'steps': 102697, 'loss/train': 1.407332420349121} 11/07/2021 11:38:36 - INFO - __main__ - Step 102699: {'lr': 0.00011578352616256501, 'samples': 19718208, 'steps': 102698, 'loss/train': 1.3819090127944946} 11/07/2021 11:38:37 - INFO - __main__ - Step 102700: {'lr': 0.00011577904906456394, 'samples': 19718400, 'steps': 102699, 'loss/train': 1.1182457208633423} 11/07/2021 11:38:37 - INFO - __main__ - Step 102701: {'lr': 0.00011577457202704073, 'samples': 19718592, 'steps': 102700, 'loss/train': 1.6219910383224487} 11/07/2021 11:38:38 - INFO - __main__ - Step 102702: {'lr': 0.00011577009504999744, 'samples': 19718784, 'steps': 102701, 'loss/train': 1.3956875801086426} 11/07/2021 11:38:38 - INFO - __main__ - Step 102703: {'lr': 0.00011576561813343605, 'samples': 19718976, 'steps': 102702, 'loss/train': 1.227952241897583} 11/07/2021 11:38:38 - INFO - __main__ - Step 102704: {'lr': 0.00011576114127735862, 'samples': 19719168, 'steps': 102703, 'loss/train': 1.2663452625274658} 11/07/2021 11:38:40 - INFO - __main__ - Step 102705: {'lr': 0.00011575666448176717, 'samples': 19719360, 'steps': 102704, 'loss/train': 1.466339111328125} 11/07/2021 11:38:40 - INFO - __main__ - Step 102706: {'lr': 0.00011575218774666366, 'samples': 19719552, 'steps': 102705, 'loss/train': 1.2324153184890747} 11/07/2021 11:38:40 - INFO - __main__ - Step 102707: {'lr': 0.00011574771107205015, 'samples': 19719744, 'steps': 102706, 'loss/train': 1.0168867111206055} 11/07/2021 11:38:41 - INFO - __main__ - Step 102708: {'lr': 0.00011574323445792866, 'samples': 19719936, 'steps': 102707, 'loss/train': 1.2563458681106567} 11/07/2021 11:38:41 - INFO - __main__ - Step 102709: {'lr': 0.00011573875790430119, 'samples': 19720128, 'steps': 102708, 'loss/train': 1.770261526107788} 11/07/2021 11:38:42 - INFO - __main__ - Step 102710: {'lr': 0.00011573428141116987, 'samples': 19720320, 'steps': 102709, 'loss/train': 0.9210589528083801} 11/07/2021 11:38:42 - INFO - __main__ - Step 102711: {'lr': 0.0001157298049785365, 'samples': 19720512, 'steps': 102710, 'loss/train': 1.4455710649490356} 11/07/2021 11:38:43 - INFO - __main__ - Step 102712: {'lr': 0.00011572532860640322, 'samples': 19720704, 'steps': 102711, 'loss/train': 1.4296574592590332} 11/07/2021 11:38:43 - INFO - __main__ - Step 102713: {'lr': 0.00011572085229477203, 'samples': 19720896, 'steps': 102712, 'loss/train': 1.0875312089920044} 11/07/2021 11:38:43 - INFO - __main__ - Step 102714: {'lr': 0.00011571637604364493, 'samples': 19721088, 'steps': 102713, 'loss/train': 1.3934286832809448} 11/07/2021 11:38:45 - INFO - __main__ - Step 102715: {'lr': 0.000115711899853024, 'samples': 19721280, 'steps': 102714, 'loss/train': 1.435342788696289} 11/07/2021 11:38:45 - INFO - __main__ - Step 102716: {'lr': 0.00011570742372291118, 'samples': 19721472, 'steps': 102715, 'loss/train': 1.3199607133865356} 11/07/2021 11:38:45 - INFO - __main__ - Step 102717: {'lr': 0.00011570294765330855, 'samples': 19721664, 'steps': 102716, 'loss/train': 1.4961237907409668} 11/07/2021 11:38:46 - INFO - __main__ - Step 102718: {'lr': 0.0001156984716442181, 'samples': 19721856, 'steps': 102717, 'loss/train': 1.5195404291152954} 11/07/2021 11:38:46 - INFO - __main__ - Step 102719: {'lr': 0.00011569399569564181, 'samples': 19722048, 'steps': 102718, 'loss/train': 2.0864059925079346} 11/07/2021 11:38:47 - INFO - __main__ - Step 102720: {'lr': 0.00011568951980758177, 'samples': 19722240, 'steps': 102719, 'loss/train': 1.1441255807876587} 11/07/2021 11:38:47 - INFO - __main__ - Step 102721: {'lr': 0.00011568504398003996, 'samples': 19722432, 'steps': 102720, 'loss/train': 1.5987565517425537} 11/07/2021 11:38:48 - INFO - __main__ - Step 102722: {'lr': 0.00011568056821301836, 'samples': 19722624, 'steps': 102721, 'loss/train': 0.5540527105331421} 11/07/2021 11:38:48 - INFO - __main__ - Step 102723: {'lr': 0.00011567609250651914, 'samples': 19722816, 'steps': 102722, 'loss/train': 1.2304446697235107} 11/07/2021 11:38:48 - INFO - __main__ - Step 102724: {'lr': 0.00011567161686054411, 'samples': 19723008, 'steps': 102723, 'loss/train': 1.672547698020935} 11/07/2021 11:38:49 - INFO - __main__ - Step 102725: {'lr': 0.0001156671412750954, 'samples': 19723200, 'steps': 102724, 'loss/train': 1.2073005437850952} 11/07/2021 11:38:50 - INFO - __main__ - Step 102726: {'lr': 0.00011566266575017495, 'samples': 19723392, 'steps': 102725, 'loss/train': 1.4759340286254883} 11/07/2021 11:38:50 - INFO - __main__ - Step 102727: {'lr': 0.00011565819028578486, 'samples': 19723584, 'steps': 102726, 'loss/train': 1.9360098838806152} 11/07/2021 11:38:50 - INFO - __main__ - Step 102728: {'lr': 0.0001156537148819271, 'samples': 19723776, 'steps': 102727, 'loss/train': 0.4830029010772705} 11/07/2021 11:38:51 - INFO - __main__ - Step 102729: {'lr': 0.00011564923953860373, 'samples': 19723968, 'steps': 102728, 'loss/train': 1.3661513328552246} 11/07/2021 11:38:51 - INFO - __main__ - Step 102730: {'lr': 0.00011564476425581675, 'samples': 19724160, 'steps': 102729, 'loss/train': 1.7246900796890259} 11/07/2021 11:38:52 - INFO - __main__ - Step 102731: {'lr': 0.00011564028903356813, 'samples': 19724352, 'steps': 102730, 'loss/train': 1.2096542119979858} 11/07/2021 11:38:53 - INFO - __main__ - Step 102732: {'lr': 0.00011563581387185992, 'samples': 19724544, 'steps': 102731, 'loss/train': 1.1770639419555664} 11/07/2021 11:38:53 - INFO - __main__ - Step 102733: {'lr': 0.00011563133877069415, 'samples': 19724736, 'steps': 102732, 'loss/train': 1.6289945840835571} 11/07/2021 11:38:53 - INFO - __main__ - Step 102734: {'lr': 0.00011562686373007284, 'samples': 19724928, 'steps': 102733, 'loss/train': 1.303132176399231} 11/07/2021 11:38:54 - INFO - __main__ - Step 102735: {'lr': 0.00011562238874999797, 'samples': 19725120, 'steps': 102734, 'loss/train': 1.3691954612731934} 11/07/2021 11:38:54 - INFO - __main__ - Step 102736: {'lr': 0.00011561791383047168, 'samples': 19725312, 'steps': 102735, 'loss/train': 1.4454894065856934} 11/07/2021 11:38:55 - INFO - __main__ - Step 102737: {'lr': 0.00011561343897149581, 'samples': 19725504, 'steps': 102736, 'loss/train': 1.2282594442367554} 11/07/2021 11:38:55 - INFO - __main__ - Step 102738: {'lr': 0.00011560896417307243, 'samples': 19725696, 'steps': 102737, 'loss/train': 1.5291059017181396} 11/07/2021 11:38:56 - INFO - __main__ - Step 102739: {'lr': 0.00011560448943520357, 'samples': 19725888, 'steps': 102738, 'loss/train': 1.3602294921875} 11/07/2021 11:38:56 - INFO - __main__ - Step 102740: {'lr': 0.00011560001475789128, 'samples': 19726080, 'steps': 102739, 'loss/train': 1.9824014902114868} 11/07/2021 11:38:56 - INFO - __main__ - Step 102741: {'lr': 0.00011559554014113751, 'samples': 19726272, 'steps': 102740, 'loss/train': 0.845895528793335} 11/07/2021 11:38:58 - INFO - __main__ - Step 102742: {'lr': 0.00011559106558494433, 'samples': 19726464, 'steps': 102741, 'loss/train': 1.489071011543274} 11/07/2021 11:38:58 - INFO - __main__ - Step 102743: {'lr': 0.00011558659108931377, 'samples': 19726656, 'steps': 102742, 'loss/train': 1.3704346418380737} 11/07/2021 11:38:58 - INFO - __main__ - Step 102744: {'lr': 0.0001155821166542478, 'samples': 19726848, 'steps': 102743, 'loss/train': 1.1266621351242065} 11/07/2021 11:38:59 - INFO - __main__ - Step 102745: {'lr': 0.00011557764227974845, 'samples': 19727040, 'steps': 102744, 'loss/train': 1.2107712030410767} 11/07/2021 11:38:59 - INFO - __main__ - Step 102746: {'lr': 0.00011557316796581774, 'samples': 19727232, 'steps': 102745, 'loss/train': 0.5364639163017273} 11/07/2021 11:39:00 - INFO - __main__ - Step 102747: {'lr': 0.0001155686937124577, 'samples': 19727424, 'steps': 102746, 'loss/train': 1.3218528032302856} 11/07/2021 11:39:01 - INFO - __main__ - Step 102748: {'lr': 0.00011556421951967031, 'samples': 19727616, 'steps': 102747, 'loss/train': 1.1782493591308594} 11/07/2021 11:39:01 - INFO - __main__ - Step 102749: {'lr': 0.00011555974538745762, 'samples': 19727808, 'steps': 102748, 'loss/train': 0.8442487120628357} 11/07/2021 11:39:01 - INFO - __main__ - Step 102750: {'lr': 0.00011555527131582173, 'samples': 19728000, 'steps': 102749, 'loss/train': 1.1629987955093384} 11/07/2021 11:39:02 - INFO - __main__ - Step 102751: {'lr': 0.00011555079730476448, 'samples': 19728192, 'steps': 102750, 'loss/train': 1.5565497875213623} 11/07/2021 11:39:02 - INFO - __main__ - Step 102752: {'lr': 0.00011554632335428795, 'samples': 19728384, 'steps': 102751, 'loss/train': 1.5278316736221313} 11/07/2021 11:39:03 - INFO - __main__ - Step 102753: {'lr': 0.00011554184946439417, 'samples': 19728576, 'steps': 102752, 'loss/train': 0.647628903388977} 11/07/2021 11:39:03 - INFO - __main__ - Step 102754: {'lr': 0.00011553737563508515, 'samples': 19728768, 'steps': 102753, 'loss/train': 1.3357783555984497} 11/07/2021 11:39:04 - INFO - __main__ - Step 102755: {'lr': 0.00011553290186636293, 'samples': 19728960, 'steps': 102754, 'loss/train': 1.4217138290405273} 11/07/2021 11:39:04 - INFO - __main__ - Step 102756: {'lr': 0.00011552842815822947, 'samples': 19729152, 'steps': 102755, 'loss/train': 1.5391336679458618} 11/07/2021 11:39:04 - INFO - __main__ - Step 102757: {'lr': 0.00011552395451068686, 'samples': 19729344, 'steps': 102756, 'loss/train': 1.0376707315444946} 11/07/2021 11:39:06 - INFO - __main__ - Step 102758: {'lr': 0.0001155194809237371, 'samples': 19729536, 'steps': 102757, 'loss/train': 1.7768622636795044} 11/07/2021 11:39:06 - INFO - __main__ - Step 102759: {'lr': 0.00011551500739738217, 'samples': 19729728, 'steps': 102758, 'loss/train': 1.417374610900879} 11/07/2021 11:39:06 - INFO - __main__ - Step 102760: {'lr': 0.00011551053393162409, 'samples': 19729920, 'steps': 102759, 'loss/train': 1.2359521389007568} 11/07/2021 11:39:07 - INFO - __main__ - Step 102761: {'lr': 0.00011550606052646489, 'samples': 19730112, 'steps': 102760, 'loss/train': 1.8374155759811401} 11/07/2021 11:39:07 - INFO - __main__ - Step 102762: {'lr': 0.00011550158718190659, 'samples': 19730304, 'steps': 102761, 'loss/train': 1.318320870399475} 11/07/2021 11:39:08 - INFO - __main__ - Step 102763: {'lr': 0.00011549711389795129, 'samples': 19730496, 'steps': 102762, 'loss/train': 1.044413447380066} 11/07/2021 11:39:08 - INFO - __main__ - Step 102764: {'lr': 0.00011549264067460083, 'samples': 19730688, 'steps': 102763, 'loss/train': 1.1128249168395996} 11/07/2021 11:39:09 - INFO - __main__ - Step 102765: {'lr': 0.00011548816751185731, 'samples': 19730880, 'steps': 102764, 'loss/train': 1.3251676559448242} 11/07/2021 11:39:09 - INFO - __main__ - Step 102766: {'lr': 0.00011548369440972272, 'samples': 19731072, 'steps': 102765, 'loss/train': 1.2280908823013306} 11/07/2021 11:39:09 - INFO - __main__ - Step 102767: {'lr': 0.00011547922136819913, 'samples': 19731264, 'steps': 102766, 'loss/train': 1.3636692762374878} 11/07/2021 11:39:10 - INFO - __main__ - Step 102768: {'lr': 0.00011547474838728853, 'samples': 19731456, 'steps': 102767, 'loss/train': 1.0249441862106323} 11/07/2021 11:39:11 - INFO - __main__ - Step 102769: {'lr': 0.00011547027546699293, 'samples': 19731648, 'steps': 102768, 'loss/train': 1.7394630908966064} 11/07/2021 11:39:11 - INFO - __main__ - Step 102770: {'lr': 0.00011546580260731432, 'samples': 19731840, 'steps': 102769, 'loss/train': 1.2900031805038452} 11/07/2021 11:39:11 - INFO - __main__ - Step 102771: {'lr': 0.00011546132980825477, 'samples': 19732032, 'steps': 102770, 'loss/train': 1.2914445400238037} 11/07/2021 11:39:12 - INFO - __main__ - Step 102772: {'lr': 0.00011545685706981627, 'samples': 19732224, 'steps': 102771, 'loss/train': 1.3841561079025269} 11/07/2021 11:39:13 - INFO - __main__ - Step 102773: {'lr': 0.00011545238439200082, 'samples': 19732416, 'steps': 102772, 'loss/train': 1.9513505697250366} 11/07/2021 11:39:13 - INFO - __main__ - Step 102774: {'lr': 0.00011544791177481046, 'samples': 19732608, 'steps': 102773, 'loss/train': 1.1450830698013306} 11/07/2021 11:39:14 - INFO - __main__ - Step 102775: {'lr': 0.00011544343921824718, 'samples': 19732800, 'steps': 102774, 'loss/train': 1.1838682889938354} 11/07/2021 11:39:14 - INFO - __main__ - Step 102776: {'lr': 0.00011543896672231301, 'samples': 19732992, 'steps': 102775, 'loss/train': 1.7773964405059814} 11/07/2021 11:39:15 - INFO - __main__ - Step 102777: {'lr': 0.00011543449428701008, 'samples': 19733184, 'steps': 102776, 'loss/train': 1.8263493776321411} 11/07/2021 11:39:16 - INFO - __main__ - Step 102778: {'lr': 0.0001154300219123402, 'samples': 19733376, 'steps': 102777, 'loss/train': 1.113008975982666} 11/07/2021 11:39:16 - INFO - __main__ - Step 102779: {'lr': 0.00011542554959830545, 'samples': 19733568, 'steps': 102778, 'loss/train': 1.4837173223495483} 11/07/2021 11:39:16 - INFO - __main__ - Step 102780: {'lr': 0.0001154210773449079, 'samples': 19733760, 'steps': 102779, 'loss/train': 0.8281930685043335} 11/07/2021 11:39:17 - INFO - __main__ - Step 102781: {'lr': 0.0001154166051521495, 'samples': 19733952, 'steps': 102780, 'loss/train': 1.0921305418014526} 11/07/2021 11:39:17 - INFO - __main__ - Step 102782: {'lr': 0.00011541213302003231, 'samples': 19734144, 'steps': 102781, 'loss/train': 1.5762865543365479} 11/07/2021 11:39:18 - INFO - __main__ - Step 102783: {'lr': 0.00011540766094855834, 'samples': 19734336, 'steps': 102782, 'loss/train': 1.354197382926941} 11/07/2021 11:39:19 - INFO - __main__ - Step 102784: {'lr': 0.0001154031889377296, 'samples': 19734528, 'steps': 102783, 'loss/train': 1.1669279336929321} 11/07/2021 11:39:19 - INFO - __main__ - Step 102785: {'lr': 0.00011539871698754814, 'samples': 19734720, 'steps': 102784, 'loss/train': 1.3436262607574463} 11/07/2021 11:39:19 - INFO - __main__ - Step 102786: {'lr': 0.00011539424509801591, 'samples': 19734912, 'steps': 102785, 'loss/train': 1.3901572227478027} 11/07/2021 11:39:20 - INFO - __main__ - Step 102787: {'lr': 0.00011538977326913494, 'samples': 19735104, 'steps': 102786, 'loss/train': 1.262436866760254} 11/07/2021 11:39:20 - INFO - __main__ - Step 102788: {'lr': 0.00011538530150090728, 'samples': 19735296, 'steps': 102787, 'loss/train': 0.17552945017814636} 11/07/2021 11:39:21 - INFO - __main__ - Step 102789: {'lr': 0.00011538082979333495, 'samples': 19735488, 'steps': 102788, 'loss/train': 1.3084080219268799} 11/07/2021 11:39:21 - INFO - __main__ - Step 102790: {'lr': 0.00011537635814642, 'samples': 19735680, 'steps': 102789, 'loss/train': 1.151594638824463} 11/07/2021 11:39:22 - INFO - __main__ - Step 102791: {'lr': 0.00011537188656016432, 'samples': 19735872, 'steps': 102790, 'loss/train': 1.5409233570098877} 11/07/2021 11:39:22 - INFO - __main__ - Step 102792: {'lr': 0.00011536741503456997, 'samples': 19736064, 'steps': 102791, 'loss/train': 1.5501198768615723} 11/07/2021 11:39:22 - INFO - __main__ - Step 102793: {'lr': 0.00011536294356963898, 'samples': 19736256, 'steps': 102792, 'loss/train': 1.1032698154449463} 11/07/2021 11:39:24 - INFO - __main__ - Step 102794: {'lr': 0.00011535847216537337, 'samples': 19736448, 'steps': 102793, 'loss/train': 1.191391944885254} 11/07/2021 11:39:24 - INFO - __main__ - Step 102795: {'lr': 0.00011535400082177516, 'samples': 19736640, 'steps': 102794, 'loss/train': 1.0186656713485718} 11/07/2021 11:39:24 - INFO - __main__ - Step 102796: {'lr': 0.00011534952953884636, 'samples': 19736832, 'steps': 102795, 'loss/train': 1.5419156551361084} 11/07/2021 11:39:25 - INFO - __main__ - Step 102797: {'lr': 0.00011534505831658899, 'samples': 19737024, 'steps': 102796, 'loss/train': 1.0396455526351929} 11/07/2021 11:39:25 - INFO - __main__ - Step 102798: {'lr': 0.00011534058715500507, 'samples': 19737216, 'steps': 102797, 'loss/train': 1.0101232528686523} 11/07/2021 11:39:27 - INFO - __main__ - Step 102799: {'lr': 0.00011533611605409658, 'samples': 19737408, 'steps': 102798, 'loss/train': 1.542673945426941} 11/07/2021 11:39:28 - INFO - __main__ - Step 102800: {'lr': 0.00011533164501386556, 'samples': 19737600, 'steps': 102799, 'loss/train': 1.506532073020935} 11/07/2021 11:39:28 - INFO - __main__ - Step 102801: {'lr': 0.00011532717403431403, 'samples': 19737792, 'steps': 102800, 'loss/train': 0.942855179309845} 11/07/2021 11:39:28 - INFO - __main__ - Step 102802: {'lr': 0.00011532270311544399, 'samples': 19737984, 'steps': 102801, 'loss/train': 1.0808570384979248} 11/07/2021 11:39:29 - INFO - __main__ - Step 102803: {'lr': 0.00011531823225725749, 'samples': 19738176, 'steps': 102802, 'loss/train': 1.7645456790924072} 11/07/2021 11:39:29 - INFO - __main__ - Step 102804: {'lr': 0.00011531376145975659, 'samples': 19738368, 'steps': 102803, 'loss/train': 1.7632535696029663} 11/07/2021 11:39:29 - INFO - __main__ - Step 102805: {'lr': 0.00011530929072294313, 'samples': 19738560, 'steps': 102804, 'loss/train': 1.7728123664855957} 11/07/2021 11:39:30 - INFO - __main__ - Step 102806: {'lr': 0.00011530482004681922, 'samples': 19738752, 'steps': 102805, 'loss/train': 0.5977140665054321} 11/07/2021 11:39:31 - INFO - __main__ - Step 102807: {'lr': 0.0001153003494313869, 'samples': 19738944, 'steps': 102806, 'loss/train': 1.5752661228179932} 11/07/2021 11:39:31 - INFO - __main__ - Step 102808: {'lr': 0.00011529587887664816, 'samples': 19739136, 'steps': 102807, 'loss/train': 1.5652540922164917} 11/07/2021 11:39:32 - INFO - __main__ - Step 102809: {'lr': 0.00011529140838260501, 'samples': 19739328, 'steps': 102808, 'loss/train': 0.32257330417633057} 11/07/2021 11:39:32 - INFO - __main__ - Step 102810: {'lr': 0.00011528693794925949, 'samples': 19739520, 'steps': 102809, 'loss/train': 2.052493095397949} 11/07/2021 11:39:32 - INFO - __main__ - Step 102811: {'lr': 0.00011528246757661356, 'samples': 19739712, 'steps': 102810, 'loss/train': 1.1594616174697876} 11/07/2021 11:39:34 - INFO - __main__ - Step 102812: {'lr': 0.00011527799726466931, 'samples': 19739904, 'steps': 102811, 'loss/train': 1.278700828552246} 11/07/2021 11:39:34 - INFO - __main__ - Step 102813: {'lr': 0.0001152735270134287, 'samples': 19740096, 'steps': 102812, 'loss/train': 1.255552887916565} 11/07/2021 11:39:34 - INFO - __main__ - Step 102814: {'lr': 0.00011526905682289373, 'samples': 19740288, 'steps': 102813, 'loss/train': 0.7558601498603821} 11/07/2021 11:39:35 - INFO - __main__ - Step 102815: {'lr': 0.0001152645866930665, 'samples': 19740480, 'steps': 102814, 'loss/train': 1.2085198163986206} 11/07/2021 11:39:35 - INFO - __main__ - Step 102816: {'lr': 0.00011526011662394895, 'samples': 19740672, 'steps': 102815, 'loss/train': 1.2924779653549194} 11/07/2021 11:39:36 - INFO - __main__ - Step 102817: {'lr': 0.0001152556466155432, 'samples': 19740864, 'steps': 102816, 'loss/train': 1.4342164993286133} 11/07/2021 11:39:36 - INFO - __main__ - Step 102818: {'lr': 0.00011525117666785107, 'samples': 19741056, 'steps': 102817, 'loss/train': 1.2825238704681396} 11/07/2021 11:39:37 - INFO - __main__ - Step 102819: {'lr': 0.00011524670678087467, 'samples': 19741248, 'steps': 102818, 'loss/train': 1.078122854232788} 11/07/2021 11:39:37 - INFO - __main__ - Step 102820: {'lr': 0.00011524223695461605, 'samples': 19741440, 'steps': 102819, 'loss/train': 0.9179550409317017} 11/07/2021 11:39:37 - INFO - __main__ - Step 102821: {'lr': 0.0001152377671890772, 'samples': 19741632, 'steps': 102820, 'loss/train': 1.4047572612762451} 11/07/2021 11:39:38 - INFO - __main__ - Step 102822: {'lr': 0.00011523329748426013, 'samples': 19741824, 'steps': 102821, 'loss/train': 1.1781136989593506} 11/07/2021 11:39:39 - INFO - __main__ - Step 102823: {'lr': 0.00011522882784016686, 'samples': 19742016, 'steps': 102822, 'loss/train': 1.3602421283721924} 11/07/2021 11:39:39 - INFO - __main__ - Step 102824: {'lr': 0.00011522435825679938, 'samples': 19742208, 'steps': 102823, 'loss/train': 1.1141449213027954} 11/07/2021 11:39:40 - INFO - __main__ - Step 102825: {'lr': 0.00011521988873415976, 'samples': 19742400, 'steps': 102824, 'loss/train': 1.6370902061462402} 11/07/2021 11:39:40 - INFO - __main__ - Step 102826: {'lr': 0.00011521541927224994, 'samples': 19742592, 'steps': 102825, 'loss/train': 1.2428052425384521} 11/07/2021 11:39:40 - INFO - __main__ - Step 102827: {'lr': 0.00011521094987107198, 'samples': 19742784, 'steps': 102826, 'loss/train': 1.161551833152771} 11/07/2021 11:39:42 - INFO - __main__ - Step 102828: {'lr': 0.00011520648053062791, 'samples': 19742976, 'steps': 102827, 'loss/train': 1.2016602754592896} 11/07/2021 11:39:42 - INFO - __main__ - Step 102829: {'lr': 0.0001152020112509197, 'samples': 19743168, 'steps': 102828, 'loss/train': 1.147251009941101} 11/07/2021 11:39:43 - INFO - __main__ - Step 102830: {'lr': 0.00011519754203194938, 'samples': 19743360, 'steps': 102829, 'loss/train': 1.428360104560852} 11/07/2021 11:39:43 - INFO - __main__ - Step 102831: {'lr': 0.00011519307287371908, 'samples': 19743552, 'steps': 102830, 'loss/train': 1.1366701126098633} 11/07/2021 11:39:43 - INFO - __main__ - Step 102832: {'lr': 0.00011518860377623059, 'samples': 19743744, 'steps': 102831, 'loss/train': 0.16930843889713287} 11/07/2021 11:39:44 - INFO - __main__ - Step 102833: {'lr': 0.00011518413473948605, 'samples': 19743936, 'steps': 102832, 'loss/train': 1.4170031547546387} 11/07/2021 11:39:45 - INFO - __main__ - Step 102834: {'lr': 0.00011517966576348746, 'samples': 19744128, 'steps': 102833, 'loss/train': 1.721602201461792} 11/07/2021 11:39:45 - INFO - __main__ - Step 102835: {'lr': 0.0001151751968482368, 'samples': 19744320, 'steps': 102834, 'loss/train': 1.078048825263977} 11/07/2021 11:39:45 - INFO - __main__ - Step 102836: {'lr': 0.00011517072799373615, 'samples': 19744512, 'steps': 102835, 'loss/train': 1.190798044204712} 11/07/2021 11:39:46 - INFO - __main__ - Step 102837: {'lr': 0.00011516625919998747, 'samples': 19744704, 'steps': 102836, 'loss/train': 1.4839286804199219} 11/07/2021 11:39:47 - INFO - __main__ - Step 102838: {'lr': 0.0001151617904669928, 'samples': 19744896, 'steps': 102837, 'loss/train': 1.2441399097442627} 11/07/2021 11:39:47 - INFO - __main__ - Step 102839: {'lr': 0.00011515732179475416, 'samples': 19745088, 'steps': 102838, 'loss/train': 2.0961873531341553} 11/07/2021 11:39:47 - INFO - __main__ - Step 102840: {'lr': 0.00011515285318327354, 'samples': 19745280, 'steps': 102839, 'loss/train': 1.4559988975524902} 11/07/2021 11:39:48 - INFO - __main__ - Step 102841: {'lr': 0.00011514838463255294, 'samples': 19745472, 'steps': 102840, 'loss/train': 1.327752709388733} 11/07/2021 11:39:48 - INFO - __main__ - Step 102842: {'lr': 0.00011514391614259442, 'samples': 19745664, 'steps': 102841, 'loss/train': 0.9877285361289978} 11/07/2021 11:39:49 - INFO - __main__ - Step 102843: {'lr': 0.00011513944771339999, 'samples': 19745856, 'steps': 102842, 'loss/train': 1.6606940031051636} 11/07/2021 11:39:50 - INFO - __main__ - Step 102844: {'lr': 0.0001151349793449717, 'samples': 19746048, 'steps': 102843, 'loss/train': 1.45623779296875} 11/07/2021 11:39:50 - INFO - __main__ - Step 102845: {'lr': 0.00011513051103731143, 'samples': 19746240, 'steps': 102844, 'loss/train': 1.7033312320709229} 11/07/2021 11:39:50 - INFO - __main__ - Step 102846: {'lr': 0.00011512604279042127, 'samples': 19746432, 'steps': 102845, 'loss/train': 1.4164270162582397} 11/07/2021 11:39:51 - INFO - __main__ - Step 102847: {'lr': 0.00011512157460430322, 'samples': 19746624, 'steps': 102846, 'loss/train': 1.4339759349822998} 11/07/2021 11:39:51 - INFO - __main__ - Step 102848: {'lr': 0.00011511710647895935, 'samples': 19746816, 'steps': 102847, 'loss/train': 1.5791285037994385} 11/07/2021 11:39:52 - INFO - __main__ - Step 102849: {'lr': 0.0001151126384143916, 'samples': 19747008, 'steps': 102848, 'loss/train': 1.070741057395935} 11/07/2021 11:39:52 - INFO - __main__ - Step 102850: {'lr': 0.00011510817041060201, 'samples': 19747200, 'steps': 102849, 'loss/train': 1.2470788955688477} 11/07/2021 11:39:53 - INFO - __main__ - Step 102851: {'lr': 0.00011510370246759258, 'samples': 19747392, 'steps': 102850, 'loss/train': 0.7805832028388977} 11/07/2021 11:39:53 - INFO - __main__ - Step 102852: {'lr': 0.00011509923458536536, 'samples': 19747584, 'steps': 102851, 'loss/train': 1.4098520278930664} 11/07/2021 11:39:53 - INFO - __main__ - Step 102853: {'lr': 0.00011509476676392235, 'samples': 19747776, 'steps': 102852, 'loss/train': 1.4140222072601318} 11/07/2021 11:39:55 - INFO - __main__ - Step 102854: {'lr': 0.00011509029900326556, 'samples': 19747968, 'steps': 102853, 'loss/train': 1.5576881170272827} 11/07/2021 11:39:55 - INFO - __main__ - Step 102855: {'lr': 0.00011508583130339701, 'samples': 19748160, 'steps': 102854, 'loss/train': 1.3304986953735352} 11/07/2021 11:39:55 - INFO - __main__ - Step 102856: {'lr': 0.00011508136366431868, 'samples': 19748352, 'steps': 102855, 'loss/train': 0.5054476857185364} 11/07/2021 11:39:56 - INFO - __main__ - Step 102857: {'lr': 0.0001150768960860327, 'samples': 19748544, 'steps': 102856, 'loss/train': 1.5969425439834595} 11/07/2021 11:39:56 - INFO - __main__ - Step 102858: {'lr': 0.00011507242856854088, 'samples': 19748736, 'steps': 102857, 'loss/train': 0.4675535559654236} 11/07/2021 11:39:57 - INFO - __main__ - Step 102859: {'lr': 0.00011506796111184537, 'samples': 19748928, 'steps': 102858, 'loss/train': 1.3232784271240234} 11/07/2021 11:39:57 - INFO - __main__ - Step 102860: {'lr': 0.00011506349371594815, 'samples': 19749120, 'steps': 102859, 'loss/train': 0.4568232595920563} 11/07/2021 11:39:58 - INFO - __main__ - Step 102861: {'lr': 0.00011505902638085122, 'samples': 19749312, 'steps': 102860, 'loss/train': 1.5592161417007446} 11/07/2021 11:39:58 - INFO - __main__ - Step 102862: {'lr': 0.00011505455910655663, 'samples': 19749504, 'steps': 102861, 'loss/train': 1.2203030586242676} 11/07/2021 11:39:58 - INFO - __main__ - Step 102863: {'lr': 0.00011505009189306636, 'samples': 19749696, 'steps': 102862, 'loss/train': 1.214787244796753} 11/07/2021 11:39:59 - INFO - __main__ - Step 102864: {'lr': 0.00011504562474038244, 'samples': 19749888, 'steps': 102863, 'loss/train': 0.70206218957901} 11/07/2021 11:40:00 - INFO - __main__ - Step 102865: {'lr': 0.00011504115764850689, 'samples': 19750080, 'steps': 102864, 'loss/train': 1.5564109086990356} 11/07/2021 11:40:00 - INFO - __main__ - Step 102866: {'lr': 0.00011503669061744171, 'samples': 19750272, 'steps': 102865, 'loss/train': 1.6168969869613647} 11/07/2021 11:40:00 - INFO - __main__ - Step 102867: {'lr': 0.00011503222364718891, 'samples': 19750464, 'steps': 102866, 'loss/train': 1.3453593254089355} 11/07/2021 11:40:01 - INFO - __main__ - Step 102868: {'lr': 0.00011502775673775049, 'samples': 19750656, 'steps': 102867, 'loss/train': 1.5448545217514038} 11/07/2021 11:40:02 - INFO - __main__ - Step 102869: {'lr': 0.0001150232898891285, 'samples': 19750848, 'steps': 102868, 'loss/train': 1.5264973640441895} 11/07/2021 11:40:02 - INFO - __main__ - Step 102870: {'lr': 0.00011501882310132492, 'samples': 19751040, 'steps': 102869, 'loss/train': 0.6621879935264587} 11/07/2021 11:40:03 - INFO - __main__ - Step 102871: {'lr': 0.00011501435637434188, 'samples': 19751232, 'steps': 102870, 'loss/train': 1.613366723060608} 11/07/2021 11:40:03 - INFO - __main__ - Step 102872: {'lr': 0.00011500988970818119, 'samples': 19751424, 'steps': 102871, 'loss/train': 1.4134525060653687} 11/07/2021 11:40:03 - INFO - __main__ - Step 102873: {'lr': 0.00011500542310284496, 'samples': 19751616, 'steps': 102872, 'loss/train': 2.247835159301758} 11/07/2021 11:40:04 - INFO - __main__ - Step 102874: {'lr': 0.0001150009565583352, 'samples': 19751808, 'steps': 102873, 'loss/train': 1.4928559064865112} 11/07/2021 11:40:05 - INFO - __main__ - Step 102875: {'lr': 0.00011499649007465394, 'samples': 19752000, 'steps': 102874, 'loss/train': 1.2874064445495605} 11/07/2021 11:40:05 - INFO - __main__ - Step 102876: {'lr': 0.00011499202365180317, 'samples': 19752192, 'steps': 102875, 'loss/train': 1.397176742553711} 11/07/2021 11:40:05 - INFO - __main__ - Step 102877: {'lr': 0.0001149875572897849, 'samples': 19752384, 'steps': 102876, 'loss/train': 1.143891453742981} 11/07/2021 11:40:06 - INFO - __main__ - Step 102878: {'lr': 0.00011498309098860115, 'samples': 19752576, 'steps': 102877, 'loss/train': 1.249078392982483} 11/07/2021 11:40:06 - INFO - __main__ - Step 102879: {'lr': 0.00011497862474825397, 'samples': 19752768, 'steps': 102878, 'loss/train': 1.2377533912658691} 11/07/2021 11:40:07 - INFO - __main__ - Step 102880: {'lr': 0.00011497415856874532, 'samples': 19752960, 'steps': 102879, 'loss/train': 1.5782933235168457} 11/07/2021 11:40:08 - INFO - __main__ - Step 102881: {'lr': 0.00011496969245007721, 'samples': 19753152, 'steps': 102880, 'loss/train': 1.3314234018325806} 11/07/2021 11:40:08 - INFO - __main__ - Step 102882: {'lr': 0.00011496522639225171, 'samples': 19753344, 'steps': 102881, 'loss/train': 1.611701250076294} 11/07/2021 11:40:08 - INFO - __main__ - Step 102883: {'lr': 0.00011496076039527075, 'samples': 19753536, 'steps': 102882, 'loss/train': 1.3434277772903442} 11/07/2021 11:40:09 - INFO - __main__ - Step 102884: {'lr': 0.00011495629445913652, 'samples': 19753728, 'steps': 102883, 'loss/train': 1.5226317644119263} 11/07/2021 11:40:10 - INFO - __main__ - Step 102885: {'lr': 0.00011495182858385081, 'samples': 19753920, 'steps': 102884, 'loss/train': 1.5069724321365356} 11/07/2021 11:40:10 - INFO - __main__ - Step 102886: {'lr': 0.0001149473627694157, 'samples': 19754112, 'steps': 102885, 'loss/train': 1.697304606437683} 11/07/2021 11:40:10 - INFO - __main__ - Step 102887: {'lr': 0.00011494289701583322, 'samples': 19754304, 'steps': 102886, 'loss/train': 1.0703359842300415} 11/07/2021 11:40:11 - INFO - __main__ - Step 102888: {'lr': 0.00011493843132310541, 'samples': 19754496, 'steps': 102887, 'loss/train': 1.0656208992004395} 11/07/2021 11:40:11 - INFO - __main__ - Step 102889: {'lr': 0.00011493396569123423, 'samples': 19754688, 'steps': 102888, 'loss/train': 1.0727683305740356} 11/07/2021 11:40:12 - INFO - __main__ - Step 102890: {'lr': 0.00011492950012022174, 'samples': 19754880, 'steps': 102889, 'loss/train': 1.4749441146850586} 11/07/2021 11:40:12 - INFO - __main__ - Step 102891: {'lr': 0.00011492503461006993, 'samples': 19755072, 'steps': 102890, 'loss/train': 1.306970477104187} 11/07/2021 11:40:13 - INFO - __main__ - Step 102892: {'lr': 0.0001149205691607808, 'samples': 19755264, 'steps': 102891, 'loss/train': 1.0648764371871948} 11/07/2021 11:40:13 - INFO - __main__ - Step 102893: {'lr': 0.0001149161037723564, 'samples': 19755456, 'steps': 102892, 'loss/train': 1.496872067451477} 11/07/2021 11:40:13 - INFO - __main__ - Step 102894: {'lr': 0.00011491163844479871, 'samples': 19755648, 'steps': 102893, 'loss/train': 1.614839792251587} 11/07/2021 11:40:15 - INFO - __main__ - Step 102895: {'lr': 0.00011490717317810975, 'samples': 19755840, 'steps': 102894, 'loss/train': 1.6052460670471191} 11/07/2021 11:40:15 - INFO - __main__ - Step 102896: {'lr': 0.00011490270797229154, 'samples': 19756032, 'steps': 102895, 'loss/train': 0.7233445048332214} 11/07/2021 11:40:15 - INFO - __main__ - Step 102897: {'lr': 0.00011489824282734609, 'samples': 19756224, 'steps': 102896, 'loss/train': 0.9956556558609009} 11/07/2021 11:40:16 - INFO - __main__ - Step 102898: {'lr': 0.00011489377774327548, 'samples': 19756416, 'steps': 102897, 'loss/train': 0.8636860251426697} 11/07/2021 11:40:16 - INFO - __main__ - Step 102899: {'lr': 0.00011488931272008158, 'samples': 19756608, 'steps': 102898, 'loss/train': 1.1190297603607178} 11/07/2021 11:40:18 - INFO - __main__ - Step 102900: {'lr': 0.00011488484775776645, 'samples': 19756800, 'steps': 102899, 'loss/train': 1.3876450061798096} 11/07/2021 11:40:18 - INFO - __main__ - Step 102901: {'lr': 0.00011488038285633213, 'samples': 19756992, 'steps': 102900, 'loss/train': 1.4074639081954956} 11/07/2021 11:40:18 - INFO - __main__ - Step 102902: {'lr': 0.00011487591801578062, 'samples': 19757184, 'steps': 102901, 'loss/train': 1.4429941177368164} 11/07/2021 11:40:19 - INFO - __main__ - Step 102903: {'lr': 0.00011487145323611396, 'samples': 19757376, 'steps': 102902, 'loss/train': 0.1340218335390091} 11/07/2021 11:40:19 - INFO - __main__ - Step 102904: {'lr': 0.0001148669885173341, 'samples': 19757568, 'steps': 102903, 'loss/train': 1.3531943559646606} 11/07/2021 11:40:19 - INFO - __main__ - Step 102905: {'lr': 0.0001148625238594431, 'samples': 19757760, 'steps': 102904, 'loss/train': 1.1915607452392578} 11/07/2021 11:40:21 - INFO - __main__ - Step 102906: {'lr': 0.00011485805926244297, 'samples': 19757952, 'steps': 102905, 'loss/train': 1.387157678604126} 11/07/2021 11:40:21 - INFO - __main__ - Step 102907: {'lr': 0.00011485359472633572, 'samples': 19758144, 'steps': 102906, 'loss/train': 1.406618595123291} 11/07/2021 11:40:21 - INFO - __main__ - Step 102908: {'lr': 0.00011484913025112333, 'samples': 19758336, 'steps': 102907, 'loss/train': 1.3058884143829346} 11/07/2021 11:40:22 - INFO - __main__ - Step 102909: {'lr': 0.00011484466583680786, 'samples': 19758528, 'steps': 102908, 'loss/train': 1.0970064401626587} 11/07/2021 11:40:22 - INFO - __main__ - Step 102910: {'lr': 0.00011484020148339131, 'samples': 19758720, 'steps': 102909, 'loss/train': 1.3857609033584595} 11/07/2021 11:40:23 - INFO - __main__ - Step 102911: {'lr': 0.00011483573719087573, 'samples': 19758912, 'steps': 102910, 'loss/train': 1.6140023469924927} 11/07/2021 11:40:23 - INFO - __main__ - Step 102912: {'lr': 0.00011483127295926302, 'samples': 19759104, 'steps': 102911, 'loss/train': 1.252984642982483} 11/07/2021 11:40:24 - INFO - __main__ - Step 102913: {'lr': 0.00011482680878855525, 'samples': 19759296, 'steps': 102912, 'loss/train': 0.9347763657569885} 11/07/2021 11:40:24 - INFO - __main__ - Step 102914: {'lr': 0.00011482234467875444, 'samples': 19759488, 'steps': 102913, 'loss/train': 1.4813573360443115} 11/07/2021 11:40:24 - INFO - __main__ - Step 102915: {'lr': 0.00011481788062986257, 'samples': 19759680, 'steps': 102914, 'loss/train': 1.6292141675949097} 11/07/2021 11:40:25 - INFO - __main__ - Step 102916: {'lr': 0.0001148134166418817, 'samples': 19759872, 'steps': 102915, 'loss/train': 1.1985403299331665} 11/07/2021 11:40:26 - INFO - __main__ - Step 102917: {'lr': 0.00011480895271481381, 'samples': 19760064, 'steps': 102916, 'loss/train': 1.3950409889221191} 11/07/2021 11:40:26 - INFO - __main__ - Step 102918: {'lr': 0.0001148044888486609, 'samples': 19760256, 'steps': 102917, 'loss/train': 1.3704650402069092} 11/07/2021 11:40:27 - INFO - __main__ - Step 102919: {'lr': 0.00011480002504342504, 'samples': 19760448, 'steps': 102918, 'loss/train': 1.548534631729126} 11/07/2021 11:40:27 - INFO - __main__ - Step 102920: {'lr': 0.00011479556129910817, 'samples': 19760640, 'steps': 102919, 'loss/train': 1.2995107173919678} 11/07/2021 11:40:28 - INFO - __main__ - Step 102921: {'lr': 0.00011479109761571235, 'samples': 19760832, 'steps': 102920, 'loss/train': 1.6707243919372559} 11/07/2021 11:40:28 - INFO - __main__ - Step 102922: {'lr': 0.00011478663399323958, 'samples': 19761024, 'steps': 102921, 'loss/train': 1.2375919818878174} 11/07/2021 11:40:29 - INFO - __main__ - Step 102923: {'lr': 0.00011478217043169195, 'samples': 19761216, 'steps': 102922, 'loss/train': 0.9629243016242981} 11/07/2021 11:40:29 - INFO - __main__ - Step 102924: {'lr': 0.00011477770693107129, 'samples': 19761408, 'steps': 102923, 'loss/train': 1.4517985582351685} 11/07/2021 11:40:29 - INFO - __main__ - Step 102925: {'lr': 0.00011477324349137971, 'samples': 19761600, 'steps': 102924, 'loss/train': 1.665656566619873} 11/07/2021 11:40:30 - INFO - __main__ - Step 102926: {'lr': 0.00011476878011261923, 'samples': 19761792, 'steps': 102925, 'loss/train': 0.7798741459846497} 11/07/2021 11:40:31 - INFO - __main__ - Step 102927: {'lr': 0.00011476431679479186, 'samples': 19761984, 'steps': 102926, 'loss/train': 1.8197576999664307} 11/07/2021 11:40:31 - INFO - __main__ - Step 102928: {'lr': 0.00011475985353789958, 'samples': 19762176, 'steps': 102927, 'loss/train': 1.279689908027649} 11/07/2021 11:40:32 - INFO - __main__ - Step 102929: {'lr': 0.00011475539034194443, 'samples': 19762368, 'steps': 102928, 'loss/train': 1.6283948421478271} 11/07/2021 11:40:32 - INFO - __main__ - Step 102930: {'lr': 0.0001147509272069284, 'samples': 19762560, 'steps': 102929, 'loss/train': 1.2042630910873413} 11/07/2021 11:40:32 - INFO - __main__ - Step 102931: {'lr': 0.00011474646413285353, 'samples': 19762752, 'steps': 102930, 'loss/train': 1.1959055662155151} 11/07/2021 11:40:33 - INFO - __main__ - Step 102932: {'lr': 0.00011474200111972182, 'samples': 19762944, 'steps': 102931, 'loss/train': 0.5200999975204468} 11/07/2021 11:40:34 - INFO - __main__ - Step 102933: {'lr': 0.00011473753816753527, 'samples': 19763136, 'steps': 102932, 'loss/train': 1.4544318914413452} 11/07/2021 11:40:34 - INFO - __main__ - Step 102934: {'lr': 0.00011473307527629601, 'samples': 19763328, 'steps': 102933, 'loss/train': 1.6489646434783936} 11/07/2021 11:40:34 - INFO - __main__ - Step 102935: {'lr': 0.00011472861244600582, 'samples': 19763520, 'steps': 102934, 'loss/train': 1.329077959060669} 11/07/2021 11:40:35 - INFO - __main__ - Step 102936: {'lr': 0.00011472414967666683, 'samples': 19763712, 'steps': 102935, 'loss/train': 1.4902366399765015} 11/07/2021 11:40:36 - INFO - __main__ - Step 102937: {'lr': 0.00011471968696828106, 'samples': 19763904, 'steps': 102936, 'loss/train': 1.1471621990203857} 11/07/2021 11:40:37 - INFO - __main__ - Step 102938: {'lr': 0.00011471522432085053, 'samples': 19764096, 'steps': 102937, 'loss/train': 2.2497432231903076} 11/07/2021 11:40:37 - INFO - __main__ - Step 102939: {'lr': 0.0001147107617343772, 'samples': 19764288, 'steps': 102938, 'loss/train': 1.8484097719192505} 11/07/2021 11:40:37 - INFO - __main__ - Step 102940: {'lr': 0.00011470629920886314, 'samples': 19764480, 'steps': 102939, 'loss/train': 1.508772373199463} 11/07/2021 11:40:38 - INFO - __main__ - Step 102941: {'lr': 0.00011470183674431031, 'samples': 19764672, 'steps': 102940, 'loss/train': 1.3798143863677979} 11/07/2021 11:40:39 - INFO - __main__ - Step 102942: {'lr': 0.00011469737434072075, 'samples': 19764864, 'steps': 102941, 'loss/train': 1.559367299079895} 11/07/2021 11:40:39 - INFO - __main__ - Step 102943: {'lr': 0.00011469291199809647, 'samples': 19765056, 'steps': 102942, 'loss/train': 1.127718210220337} 11/07/2021 11:40:40 - INFO - __main__ - Step 102944: {'lr': 0.00011468844971643949, 'samples': 19765248, 'steps': 102943, 'loss/train': 1.4024591445922852} 11/07/2021 11:40:40 - INFO - __main__ - Step 102945: {'lr': 0.00011468398749575188, 'samples': 19765440, 'steps': 102944, 'loss/train': 1.914021611213684} 11/07/2021 11:40:40 - INFO - __main__ - Step 102946: {'lr': 0.00011467952533603549, 'samples': 19765632, 'steps': 102945, 'loss/train': 0.506438672542572} 11/07/2021 11:40:41 - INFO - __main__ - Step 102947: {'lr': 0.00011467506323729243, 'samples': 19765824, 'steps': 102946, 'loss/train': 1.1829652786254883} 11/07/2021 11:40:42 - INFO - __main__ - Step 102948: {'lr': 0.00011467060119952469, 'samples': 19766016, 'steps': 102947, 'loss/train': 0.4816042482852936} 11/07/2021 11:40:42 - INFO - __main__ - Step 102949: {'lr': 0.00011466613922273428, 'samples': 19766208, 'steps': 102948, 'loss/train': 1.241376280784607} 11/07/2021 11:40:43 - INFO - __main__ - Step 102950: {'lr': 0.00011466167730692323, 'samples': 19766400, 'steps': 102949, 'loss/train': 1.2558972835540771} 11/07/2021 11:40:43 - INFO - __main__ - Step 102951: {'lr': 0.00011465721545209354, 'samples': 19766592, 'steps': 102950, 'loss/train': 1.074413537979126} 11/07/2021 11:40:43 - INFO - __main__ - Step 102952: {'lr': 0.0001146527536582472, 'samples': 19766784, 'steps': 102951, 'loss/train': 1.256377935409546} 11/07/2021 11:40:44 - INFO - __main__ - Step 102953: {'lr': 0.00011464829192538625, 'samples': 19766976, 'steps': 102952, 'loss/train': 0.9386645555496216} 11/07/2021 11:40:45 - INFO - __main__ - Step 102954: {'lr': 0.00011464383025351272, 'samples': 19767168, 'steps': 102953, 'loss/train': 1.108354091644287} 11/07/2021 11:40:45 - INFO - __main__ - Step 102955: {'lr': 0.00011463936864262856, 'samples': 19767360, 'steps': 102954, 'loss/train': 1.3759241104125977} 11/07/2021 11:40:45 - INFO - __main__ - Step 102956: {'lr': 0.00011463490709273591, 'samples': 19767552, 'steps': 102955, 'loss/train': 1.2876403331756592} 11/07/2021 11:40:46 - INFO - __main__ - Step 102957: {'lr': 0.00011463044560383659, 'samples': 19767744, 'steps': 102956, 'loss/train': 1.3518657684326172} 11/07/2021 11:40:46 - INFO - __main__ - Step 102958: {'lr': 0.0001146259841759327, 'samples': 19767936, 'steps': 102957, 'loss/train': 1.4282313585281372} 11/07/2021 11:40:47 - INFO - __main__ - Step 102959: {'lr': 0.00011462152280902627, 'samples': 19768128, 'steps': 102958, 'loss/train': 1.1798101663589478} 11/07/2021 11:40:47 - INFO - __main__ - Step 102960: {'lr': 0.00011461706150311927, 'samples': 19768320, 'steps': 102959, 'loss/train': 1.1111589670181274} 11/07/2021 11:40:48 - INFO - __main__ - Step 102961: {'lr': 0.00011461260025821373, 'samples': 19768512, 'steps': 102960, 'loss/train': 1.6773933172225952} 11/07/2021 11:40:48 - INFO - __main__ - Step 102962: {'lr': 0.00011460813907431169, 'samples': 19768704, 'steps': 102961, 'loss/train': 1.4627740383148193} 11/07/2021 11:40:49 - INFO - __main__ - Step 102963: {'lr': 0.00011460367795141513, 'samples': 19768896, 'steps': 102962, 'loss/train': 0.6764463186264038} 11/07/2021 11:40:50 - INFO - __main__ - Step 102964: {'lr': 0.00011459921688952604, 'samples': 19769088, 'steps': 102963, 'loss/train': 1.1458094120025635} 11/07/2021 11:40:50 - INFO - __main__ - Step 102965: {'lr': 0.00011459475588864646, 'samples': 19769280, 'steps': 102964, 'loss/train': 1.3289813995361328} 11/07/2021 11:40:51 - INFO - __main__ - Step 102966: {'lr': 0.0001145902949487784, 'samples': 19769472, 'steps': 102965, 'loss/train': 0.4042385518550873} 11/07/2021 11:40:51 - INFO - __main__ - Step 102967: {'lr': 0.00011458583406992396, 'samples': 19769664, 'steps': 102966, 'loss/train': 1.3231216669082642} 11/07/2021 11:40:51 - INFO - __main__ - Step 102968: {'lr': 0.00011458137325208495, 'samples': 19769856, 'steps': 102967, 'loss/train': 1.469338059425354} 11/07/2021 11:40:52 - INFO - __main__ - Step 102969: {'lr': 0.00011457691249526351, 'samples': 19770048, 'steps': 102968, 'loss/train': 1.6689780950546265} 11/07/2021 11:40:53 - INFO - __main__ - Step 102970: {'lr': 0.0001145724517994616, 'samples': 19770240, 'steps': 102969, 'loss/train': 1.2180029153823853} 11/07/2021 11:40:53 - INFO - __main__ - Step 102971: {'lr': 0.00011456799116468126, 'samples': 19770432, 'steps': 102970, 'loss/train': 1.1594914197921753} 11/07/2021 11:40:53 - INFO - __main__ - Step 102972: {'lr': 0.00011456353059092448, 'samples': 19770624, 'steps': 102971, 'loss/train': 1.539237380027771} 11/07/2021 11:40:54 - INFO - __main__ - Step 102973: {'lr': 0.0001145590700781933, 'samples': 19770816, 'steps': 102972, 'loss/train': 1.0608904361724854} 11/07/2021 11:40:55 - INFO - __main__ - Step 102974: {'lr': 0.0001145546096264897, 'samples': 19771008, 'steps': 102973, 'loss/train': 1.6255220174789429} 11/07/2021 11:40:55 - INFO - __main__ - Step 102975: {'lr': 0.00011455014923581571, 'samples': 19771200, 'steps': 102974, 'loss/train': 1.1929895877838135} 11/07/2021 11:40:56 - INFO - __main__ - Step 102976: {'lr': 0.00011454568890617334, 'samples': 19771392, 'steps': 102975, 'loss/train': 1.3824838399887085} 11/07/2021 11:40:56 - INFO - __main__ - Step 102977: {'lr': 0.00011454122863756458, 'samples': 19771584, 'steps': 102976, 'loss/train': 1.0516568422317505} 11/07/2021 11:40:56 - INFO - __main__ - Step 102978: {'lr': 0.00011453676842999156, 'samples': 19771776, 'steps': 102977, 'loss/train': 1.4717503786087036} 11/07/2021 11:40:57 - INFO - __main__ - Step 102979: {'lr': 0.00011453230828345606, 'samples': 19771968, 'steps': 102978, 'loss/train': 1.2499499320983887} 11/07/2021 11:40:58 - INFO - __main__ - Step 102980: {'lr': 0.00011452784819796026, 'samples': 19772160, 'steps': 102979, 'loss/train': 1.2573716640472412} 11/07/2021 11:40:58 - INFO - __main__ - Step 102981: {'lr': 0.00011452338817350608, 'samples': 19772352, 'steps': 102980, 'loss/train': 0.9679294228553772} 11/07/2021 11:40:58 - INFO - __main__ - Step 102982: {'lr': 0.00011451892821009557, 'samples': 19772544, 'steps': 102981, 'loss/train': 2.429189920425415} 11/07/2021 11:40:59 - INFO - __main__ - Step 102983: {'lr': 0.00011451446830773076, 'samples': 19772736, 'steps': 102982, 'loss/train': 1.3935623168945312} 11/07/2021 11:40:59 - INFO - __main__ - Step 102984: {'lr': 0.00011451000846641363, 'samples': 19772928, 'steps': 102983, 'loss/train': 1.433619737625122} 11/07/2021 11:41:00 - INFO - __main__ - Step 102985: {'lr': 0.00011450554868614622, 'samples': 19773120, 'steps': 102984, 'loss/train': 1.4184870719909668} 11/07/2021 11:41:00 - INFO - __main__ - Step 102986: {'lr': 0.00011450108896693048, 'samples': 19773312, 'steps': 102985, 'loss/train': 1.383633017539978} 11/07/2021 11:41:01 - INFO - __main__ - Step 102987: {'lr': 0.00011449662930876848, 'samples': 19773504, 'steps': 102986, 'loss/train': 0.895272970199585} 11/07/2021 11:41:01 - INFO - __main__ - Step 102988: {'lr': 0.0001144921697116622, 'samples': 19773696, 'steps': 102987, 'loss/train': 1.582015872001648} 11/07/2021 11:41:01 - INFO - __main__ - Step 102989: {'lr': 0.00011448771017561369, 'samples': 19773888, 'steps': 102988, 'loss/train': 1.2323715686798096} 11/07/2021 11:41:03 - INFO - __main__ - Step 102990: {'lr': 0.0001144832507006249, 'samples': 19774080, 'steps': 102989, 'loss/train': 1.3940925598144531} 11/07/2021 11:41:03 - INFO - __main__ - Step 102991: {'lr': 0.00011447879128669787, 'samples': 19774272, 'steps': 102990, 'loss/train': 1.4562972784042358} 11/07/2021 11:41:03 - INFO - __main__ - Step 102992: {'lr': 0.0001144743319338347, 'samples': 19774464, 'steps': 102991, 'loss/train': 1.3325176239013672} 11/07/2021 11:41:04 - INFO - __main__ - Step 102993: {'lr': 0.00011446987264203721, 'samples': 19774656, 'steps': 102992, 'loss/train': 1.328263759613037} 11/07/2021 11:41:04 - INFO - __main__ - Step 102994: {'lr': 0.00011446541341130748, 'samples': 19774848, 'steps': 102993, 'loss/train': 1.6301909685134888} 11/07/2021 11:41:05 - INFO - __main__ - Step 102995: {'lr': 0.00011446095424164757, 'samples': 19775040, 'steps': 102994, 'loss/train': 1.6363188028335571} 11/07/2021 11:41:05 - INFO - __main__ - Step 102996: {'lr': 0.00011445649513305945, 'samples': 19775232, 'steps': 102995, 'loss/train': 1.1943224668502808} 11/07/2021 11:41:06 - INFO - __main__ - Step 102997: {'lr': 0.00011445203608554516, 'samples': 19775424, 'steps': 102996, 'loss/train': 1.189819574356079} 11/07/2021 11:41:06 - INFO - __main__ - Step 102998: {'lr': 0.00011444757709910666, 'samples': 19775616, 'steps': 102997, 'loss/train': 0.9147663712501526} 11/07/2021 11:41:06 - INFO - __main__ - Step 102999: {'lr': 0.00011444311817374603, 'samples': 19775808, 'steps': 102998, 'loss/train': 1.331072449684143} 11/07/2021 11:41:08 - INFO - __main__ - Step 103000: {'lr': 0.00011443865930946521, 'samples': 19776000, 'steps': 102999, 'loss/train': 0.9491423964500427} 11/07/2021 11:41:08 - INFO - __main__ - Step 103001: {'lr': 0.00011443420050626624, 'samples': 19776192, 'steps': 103000, 'loss/train': 1.5342967510223389} 11/07/2021 11:41:08 - INFO - __main__ - Step 103002: {'lr': 0.00011442974176415113, 'samples': 19776384, 'steps': 103001, 'loss/train': 1.4296404123306274} 11/07/2021 11:41:09 - INFO - __main__ - Step 103003: {'lr': 0.00011442528308312192, 'samples': 19776576, 'steps': 103002, 'loss/train': 2.1652965545654297} 11/07/2021 11:41:09 - INFO - __main__ - Step 103004: {'lr': 0.00011442082446318055, 'samples': 19776768, 'steps': 103003, 'loss/train': 1.1496471166610718} 11/07/2021 11:41:10 - INFO - __main__ - Step 103005: {'lr': 0.00011441636590432916, 'samples': 19776960, 'steps': 103004, 'loss/train': 1.3705095052719116} 11/07/2021 11:41:10 - INFO - __main__ - Step 103006: {'lr': 0.00011441190740656956, 'samples': 19777152, 'steps': 103005, 'loss/train': 1.6542900800704956} 11/07/2021 11:41:11 - INFO - __main__ - Step 103007: {'lr': 0.00011440744896990387, 'samples': 19777344, 'steps': 103006, 'loss/train': 1.1673628091812134} 11/07/2021 11:41:11 - INFO - __main__ - Step 103008: {'lr': 0.00011440299059433412, 'samples': 19777536, 'steps': 103007, 'loss/train': 1.1267107725143433} 11/07/2021 11:41:11 - INFO - __main__ - Step 103009: {'lr': 0.00011439853227986227, 'samples': 19777728, 'steps': 103008, 'loss/train': 1.1731904745101929} 11/07/2021 11:41:12 - INFO - __main__ - Step 103010: {'lr': 0.00011439407402649036, 'samples': 19777920, 'steps': 103009, 'loss/train': 1.4327476024627686} 11/07/2021 11:41:13 - INFO - __main__ - Step 103011: {'lr': 0.00011438961583422036, 'samples': 19778112, 'steps': 103010, 'loss/train': 1.170328974723816} 11/07/2021 11:41:13 - INFO - __main__ - Step 103012: {'lr': 0.00011438515770305432, 'samples': 19778304, 'steps': 103011, 'loss/train': 0.7356514930725098} 11/07/2021 11:41:14 - INFO - __main__ - Step 103013: {'lr': 0.00011438069963299425, 'samples': 19778496, 'steps': 103012, 'loss/train': 2.142953872680664} 11/07/2021 11:41:14 - INFO - __main__ - Step 103014: {'lr': 0.00011437624162404212, 'samples': 19778688, 'steps': 103013, 'loss/train': 1.5675543546676636} 11/07/2021 11:41:14 - INFO - __main__ - Step 103015: {'lr': 0.00011437178367619996, 'samples': 19778880, 'steps': 103014, 'loss/train': 1.8247593641281128} 11/07/2021 11:41:15 - INFO - __main__ - Step 103016: {'lr': 0.00011436732578946982, 'samples': 19779072, 'steps': 103015, 'loss/train': 1.1115866899490356} 11/07/2021 11:41:16 - INFO - __main__ - Step 103017: {'lr': 0.00011436286796385362, 'samples': 19779264, 'steps': 103016, 'loss/train': 1.3424043655395508} 11/07/2021 11:41:16 - INFO - __main__ - Step 103018: {'lr': 0.00011435841019935345, 'samples': 19779456, 'steps': 103017, 'loss/train': 1.249899983406067} 11/07/2021 11:41:16 - INFO - __main__ - Step 103019: {'lr': 0.00011435395249597139, 'samples': 19779648, 'steps': 103018, 'loss/train': 1.2747935056686401} 11/07/2021 11:41:17 - INFO - __main__ - Step 103020: {'lr': 0.00011434949485370921, 'samples': 19779840, 'steps': 103019, 'loss/train': 1.713108777999878} 11/07/2021 11:41:18 - INFO - __main__ - Step 103021: {'lr': 0.0001143450372725691, 'samples': 19780032, 'steps': 103020, 'loss/train': 1.3611499071121216} 11/07/2021 11:41:18 - INFO - __main__ - Step 103022: {'lr': 0.00011434057975255299, 'samples': 19780224, 'steps': 103021, 'loss/train': 1.4215264320373535} 11/07/2021 11:41:18 - INFO - __main__ - Step 103023: {'lr': 0.00011433612229366295, 'samples': 19780416, 'steps': 103022, 'loss/train': 1.6677734851837158} 11/07/2021 11:41:19 - INFO - __main__ - Step 103024: {'lr': 0.00011433166489590094, 'samples': 19780608, 'steps': 103023, 'loss/train': 2.9146974086761475} 11/07/2021 11:41:19 - INFO - __main__ - Step 103025: {'lr': 0.00011432720755926898, 'samples': 19780800, 'steps': 103024, 'loss/train': 0.8969225287437439} 11/07/2021 11:41:20 - INFO - __main__ - Step 103026: {'lr': 0.00011432275028376912, 'samples': 19780992, 'steps': 103025, 'loss/train': 1.2096765041351318} 11/07/2021 11:41:21 - INFO - __main__ - Step 103027: {'lr': 0.00011431829306940331, 'samples': 19781184, 'steps': 103026, 'loss/train': 0.7962602376937866} 11/07/2021 11:41:21 - INFO - __main__ - Step 103028: {'lr': 0.00011431383591617359, 'samples': 19781376, 'steps': 103027, 'loss/train': 1.6826908588409424} 11/07/2021 11:41:21 - INFO - __main__ - Step 103029: {'lr': 0.00011430937882408196, 'samples': 19781568, 'steps': 103028, 'loss/train': 1.4292014837265015} 11/07/2021 11:41:22 - INFO - __main__ - Step 103030: {'lr': 0.00011430492179313043, 'samples': 19781760, 'steps': 103029, 'loss/train': 1.4295305013656616} 11/07/2021 11:41:22 - INFO - __main__ - Step 103031: {'lr': 0.00011430046482332101, 'samples': 19781952, 'steps': 103030, 'loss/train': 1.7772220373153687} 11/07/2021 11:41:23 - INFO - __main__ - Step 103032: {'lr': 0.0001142960079146558, 'samples': 19782144, 'steps': 103031, 'loss/train': 5.512005805969238} 11/07/2021 11:41:24 - INFO - __main__ - Step 103033: {'lr': 0.00011429155106713662, 'samples': 19782336, 'steps': 103032, 'loss/train': 0.9567467570304871} 11/07/2021 11:41:24 - INFO - __main__ - Step 103034: {'lr': 0.00011428709428076555, 'samples': 19782528, 'steps': 103033, 'loss/train': 1.3799035549163818} 11/07/2021 11:41:24 - INFO - __main__ - Step 103035: {'lr': 0.00011428263755554465, 'samples': 19782720, 'steps': 103034, 'loss/train': 1.1778219938278198} 11/07/2021 11:41:25 - INFO - __main__ - Step 103036: {'lr': 0.0001142781808914759, 'samples': 19782912, 'steps': 103035, 'loss/train': 1.3339861631393433} 11/07/2021 11:41:25 - INFO - __main__ - Step 103037: {'lr': 0.0001142737242885613, 'samples': 19783104, 'steps': 103036, 'loss/train': 1.2148970365524292} 11/07/2021 11:41:26 - INFO - __main__ - Step 103038: {'lr': 0.00011426926774680288, 'samples': 19783296, 'steps': 103037, 'loss/train': 1.4391112327575684} 11/07/2021 11:41:26 - INFO - __main__ - Step 103039: {'lr': 0.00011426481126620262, 'samples': 19783488, 'steps': 103038, 'loss/train': 1.2598249912261963} 11/07/2021 11:41:27 - INFO - __main__ - Step 103040: {'lr': 0.00011426035484676254, 'samples': 19783680, 'steps': 103039, 'loss/train': 0.983742892742157} 11/07/2021 11:41:27 - INFO - __main__ - Step 103041: {'lr': 0.00011425589848848463, 'samples': 19783872, 'steps': 103040, 'loss/train': 1.4021449089050293} 11/07/2021 11:41:27 - INFO - __main__ - Step 103042: {'lr': 0.00011425144219137096, 'samples': 19784064, 'steps': 103041, 'loss/train': 1.192582368850708} 11/07/2021 11:41:28 - INFO - __main__ - Step 103043: {'lr': 0.00011424698595542346, 'samples': 19784256, 'steps': 103042, 'loss/train': 1.3645325899124146} 11/07/2021 11:41:29 - INFO - __main__ - Step 103044: {'lr': 0.0001142425297806442, 'samples': 19784448, 'steps': 103043, 'loss/train': 1.639849066734314} 11/07/2021 11:41:29 - INFO - __main__ - Step 103045: {'lr': 0.00011423807366703515, 'samples': 19784640, 'steps': 103044, 'loss/train': 2.7623605728149414} 11/07/2021 11:41:29 - INFO - __main__ - Step 103046: {'lr': 0.00011423361761459841, 'samples': 19784832, 'steps': 103045, 'loss/train': 1.2895442247390747} 11/07/2021 11:41:30 - INFO - __main__ - Step 103047: {'lr': 0.00011422916162333583, 'samples': 19785024, 'steps': 103046, 'loss/train': 1.598486065864563} 11/07/2021 11:41:31 - INFO - __main__ - Step 103048: {'lr': 0.00011422470569324949, 'samples': 19785216, 'steps': 103047, 'loss/train': 1.640025019645691} 11/07/2021 11:41:31 - INFO - __main__ - Step 103049: {'lr': 0.0001142202498243414, 'samples': 19785408, 'steps': 103048, 'loss/train': 1.4445502758026123} 11/07/2021 11:41:32 - INFO - __main__ - Step 103050: {'lr': 0.00011421579401661356, 'samples': 19785600, 'steps': 103049, 'loss/train': 1.438758373260498} 11/07/2021 11:41:32 - INFO - __main__ - Step 103051: {'lr': 0.00011421133827006802, 'samples': 19785792, 'steps': 103050, 'loss/train': 0.3457473814487457} 11/07/2021 11:41:32 - INFO - __main__ - Step 103052: {'lr': 0.00011420688258470672, 'samples': 19785984, 'steps': 103051, 'loss/train': 1.2088816165924072} 11/07/2021 11:41:33 - INFO - __main__ - Step 103053: {'lr': 0.00011420242696053174, 'samples': 19786176, 'steps': 103052, 'loss/train': 1.280731439590454} 11/07/2021 11:41:34 - INFO - __main__ - Step 103054: {'lr': 0.00011419797139754501, 'samples': 19786368, 'steps': 103053, 'loss/train': 1.4485971927642822} 11/07/2021 11:41:34 - INFO - __main__ - Step 103055: {'lr': 0.00011419351589574862, 'samples': 19786560, 'steps': 103054, 'loss/train': 1.0931155681610107} 11/07/2021 11:41:34 - INFO - __main__ - Step 103056: {'lr': 0.00011418906045514449, 'samples': 19786752, 'steps': 103055, 'loss/train': 1.3021401166915894} 11/07/2021 11:41:35 - INFO - __main__ - Step 103057: {'lr': 0.00011418460507573469, 'samples': 19786944, 'steps': 103056, 'loss/train': 1.3429028987884521} 11/07/2021 11:41:36 - INFO - __main__ - Step 103058: {'lr': 0.00011418014975752122, 'samples': 19787136, 'steps': 103057, 'loss/train': 1.362148404121399} 11/07/2021 11:41:36 - INFO - __main__ - Step 103059: {'lr': 0.00011417569450050619, 'samples': 19787328, 'steps': 103058, 'loss/train': 1.4268276691436768} 11/07/2021 11:41:36 - INFO - __main__ - Step 103060: {'lr': 0.00011417123930469137, 'samples': 19787520, 'steps': 103059, 'loss/train': 1.7118514776229858} 11/07/2021 11:41:37 - INFO - __main__ - Step 103061: {'lr': 0.00011416678417007892, 'samples': 19787712, 'steps': 103060, 'loss/train': 1.1670992374420166} 11/07/2021 11:41:37 - INFO - __main__ - Step 103062: {'lr': 0.0001141623290966708, 'samples': 19787904, 'steps': 103061, 'loss/train': 1.0685694217681885} 11/07/2021 11:41:38 - INFO - __main__ - Step 103063: {'lr': 0.00011415787408446904, 'samples': 19788096, 'steps': 103062, 'loss/train': 1.6380784511566162} 11/07/2021 11:41:39 - INFO - __main__ - Step 103064: {'lr': 0.00011415341913347565, 'samples': 19788288, 'steps': 103063, 'loss/train': 1.3427015542984009} 11/07/2021 11:41:39 - INFO - __main__ - Step 103065: {'lr': 0.00011414896424369264, 'samples': 19788480, 'steps': 103064, 'loss/train': 1.7798317670822144} 11/07/2021 11:41:39 - INFO - __main__ - Step 103066: {'lr': 0.000114144509415122, 'samples': 19788672, 'steps': 103065, 'loss/train': 1.6536169052124023} 11/07/2021 11:41:40 - INFO - __main__ - Step 103067: {'lr': 0.00011414005464776578, 'samples': 19788864, 'steps': 103066, 'loss/train': 1.3110994100570679} 11/07/2021 11:41:40 - INFO - __main__ - Step 103068: {'lr': 0.00011413559994162592, 'samples': 19789056, 'steps': 103067, 'loss/train': 1.721137285232544} 11/07/2021 11:41:41 - INFO - __main__ - Step 103069: {'lr': 0.00011413114529670446, 'samples': 19789248, 'steps': 103068, 'loss/train': 1.0790681838989258} 11/07/2021 11:41:41 - INFO - __main__ - Step 103070: {'lr': 0.00011412669071300343, 'samples': 19789440, 'steps': 103069, 'loss/train': 1.1797236204147339} 11/07/2021 11:41:42 - INFO - __main__ - Step 103071: {'lr': 0.00011412223619052481, 'samples': 19789632, 'steps': 103070, 'loss/train': 1.3322229385375977} 11/07/2021 11:41:42 - INFO - __main__ - Step 103072: {'lr': 0.0001141177817292706, 'samples': 19789824, 'steps': 103071, 'loss/train': 1.1667377948760986} 11/07/2021 11:41:42 - INFO - __main__ - Step 103073: {'lr': 0.00011411332732924293, 'samples': 19790016, 'steps': 103072, 'loss/train': 1.2233341932296753} 11/07/2021 11:41:43 - INFO - __main__ - Step 103074: {'lr': 0.00011410887299044359, 'samples': 19790208, 'steps': 103073, 'loss/train': 1.453630805015564} 11/07/2021 11:41:44 - INFO - __main__ - Step 103075: {'lr': 0.00011410441871287472, 'samples': 19790400, 'steps': 103074, 'loss/train': 1.3002407550811768} 11/07/2021 11:41:44 - INFO - __main__ - Step 103076: {'lr': 0.00011409996449653828, 'samples': 19790592, 'steps': 103075, 'loss/train': 1.4282079935073853} 11/07/2021 11:41:44 - INFO - __main__ - Step 103077: {'lr': 0.0001140955103414363, 'samples': 19790784, 'steps': 103076, 'loss/train': 1.0945494174957275} 11/07/2021 11:41:45 - INFO - __main__ - Step 103078: {'lr': 0.0001140910562475708, 'samples': 19790976, 'steps': 103077, 'loss/train': 1.5372180938720703} 11/07/2021 11:41:46 - INFO - __main__ - Step 103079: {'lr': 0.00011408660221494377, 'samples': 19791168, 'steps': 103078, 'loss/train': 0.40724071860313416} 11/07/2021 11:41:47 - INFO - __main__ - Step 103080: {'lr': 0.0001140821482435572, 'samples': 19791360, 'steps': 103079, 'loss/train': 1.3460294008255005} 11/07/2021 11:41:47 - INFO - __main__ - Step 103081: {'lr': 0.00011407769433341314, 'samples': 19791552, 'steps': 103080, 'loss/train': 1.4426895380020142} 11/07/2021 11:41:47 - INFO - __main__ - Step 103082: {'lr': 0.00011407324048451356, 'samples': 19791744, 'steps': 103081, 'loss/train': 1.8963589668273926} 11/07/2021 11:41:48 - INFO - __main__ - Step 103083: {'lr': 0.00011406878669686047, 'samples': 19791936, 'steps': 103082, 'loss/train': 1.595716953277588} 11/07/2021 11:41:48 - INFO - __main__ - Step 103084: {'lr': 0.0001140643329704559, 'samples': 19792128, 'steps': 103083, 'loss/train': 1.1143461465835571} 11/07/2021 11:41:50 - INFO - __main__ - Step 103085: {'lr': 0.00011405987930530184, 'samples': 19792320, 'steps': 103084, 'loss/train': 0.8980448246002197} 11/07/2021 11:41:50 - INFO - __main__ - Step 103086: {'lr': 0.0001140554257014004, 'samples': 19792512, 'steps': 103085, 'loss/train': 1.6581122875213623} 11/07/2021 11:41:50 - INFO - __main__ - Step 103087: {'lr': 0.00011405097215875341, 'samples': 19792704, 'steps': 103086, 'loss/train': 0.7737617492675781} 11/07/2021 11:41:51 - INFO - __main__ - Step 103088: {'lr': 0.00011404651867736293, 'samples': 19792896, 'steps': 103087, 'loss/train': 1.4385591745376587} 11/07/2021 11:41:51 - INFO - __main__ - Step 103089: {'lr': 0.00011404206525723102, 'samples': 19793088, 'steps': 103088, 'loss/train': 0.18542082607746124} 11/07/2021 11:41:52 - INFO - __main__ - Step 103090: {'lr': 0.00011403761189835962, 'samples': 19793280, 'steps': 103089, 'loss/train': 1.3928333520889282} 11/07/2021 11:41:52 - INFO - __main__ - Step 103091: {'lr': 0.00011403315860075078, 'samples': 19793472, 'steps': 103090, 'loss/train': 1.366689682006836} 11/07/2021 11:41:53 - INFO - __main__ - Step 103092: {'lr': 0.00011402870536440652, 'samples': 19793664, 'steps': 103091, 'loss/train': 1.4505634307861328} 11/07/2021 11:41:53 - INFO - __main__ - Step 103093: {'lr': 0.00011402425218932883, 'samples': 19793856, 'steps': 103092, 'loss/train': 1.4185949563980103} 11/07/2021 11:41:53 - INFO - __main__ - Step 103094: {'lr': 0.00011401979907551968, 'samples': 19794048, 'steps': 103093, 'loss/train': 1.5503751039505005} 11/07/2021 11:41:54 - INFO - __main__ - Step 103095: {'lr': 0.00011401534602298114, 'samples': 19794240, 'steps': 103094, 'loss/train': 1.4453643560409546} 11/07/2021 11:41:55 - INFO - __main__ - Step 103096: {'lr': 0.00011401089303171516, 'samples': 19794432, 'steps': 103095, 'loss/train': 1.5160413980484009} 11/07/2021 11:41:55 - INFO - __main__ - Step 103097: {'lr': 0.00011400644010172381, 'samples': 19794624, 'steps': 103096, 'loss/train': 1.1245840787887573} 11/07/2021 11:41:55 - INFO - __main__ - Step 103098: {'lr': 0.00011400198723300903, 'samples': 19794816, 'steps': 103097, 'loss/train': 1.2342878580093384} 11/07/2021 11:41:56 - INFO - __main__ - Step 103099: {'lr': 0.00011399753442557298, 'samples': 19795008, 'steps': 103098, 'loss/train': 1.3277220726013184} 11/07/2021 11:41:57 - INFO - __main__ - Step 103100: {'lr': 0.00011399308167941741, 'samples': 19795200, 'steps': 103099, 'loss/train': 1.7389905452728271} 11/07/2021 11:41:57 - INFO - __main__ - Step 103101: {'lr': 0.00011398862899454449, 'samples': 19795392, 'steps': 103100, 'loss/train': 0.7351633310317993} 11/07/2021 11:41:58 - INFO - __main__ - Step 103102: {'lr': 0.00011398417637095618, 'samples': 19795584, 'steps': 103101, 'loss/train': 1.4166475534439087} 11/07/2021 11:41:58 - INFO - __main__ - Step 103103: {'lr': 0.0001139797238086545, 'samples': 19795776, 'steps': 103102, 'loss/train': 1.562617301940918} 11/07/2021 11:41:58 - INFO - __main__ - Step 103104: {'lr': 0.00011397527130764147, 'samples': 19795968, 'steps': 103103, 'loss/train': 1.1107184886932373} 11/07/2021 11:41:59 - INFO - __main__ - Step 103105: {'lr': 0.00011397081886791908, 'samples': 19796160, 'steps': 103104, 'loss/train': 1.2625455856323242} 11/07/2021 11:42:00 - INFO - __main__ - Step 103106: {'lr': 0.00011396636648948932, 'samples': 19796352, 'steps': 103105, 'loss/train': 1.3271129131317139} 11/07/2021 11:42:00 - INFO - __main__ - Step 103107: {'lr': 0.00011396191417235425, 'samples': 19796544, 'steps': 103106, 'loss/train': 0.7690003514289856} 11/07/2021 11:42:00 - INFO - __main__ - Step 103108: {'lr': 0.00011395746191651581, 'samples': 19796736, 'steps': 103107, 'loss/train': 0.8270094394683838} 11/07/2021 11:42:01 - INFO - __main__ - Step 103109: {'lr': 0.00011395300972197606, 'samples': 19796928, 'steps': 103108, 'loss/train': 1.1740403175354004} 11/07/2021 11:42:01 - INFO - __main__ - Step 103110: {'lr': 0.00011394855758873696, 'samples': 19797120, 'steps': 103109, 'loss/train': 1.3106062412261963} 11/07/2021 11:42:03 - INFO - __main__ - Step 103111: {'lr': 0.00011394410551680057, 'samples': 19797312, 'steps': 103110, 'loss/train': 1.2312707901000977} 11/07/2021 11:42:03 - INFO - __main__ - Step 103112: {'lr': 0.00011393965350616887, 'samples': 19797504, 'steps': 103111, 'loss/train': 1.1523469686508179} 11/07/2021 11:42:04 - INFO - __main__ - Step 103113: {'lr': 0.00011393520155684391, 'samples': 19797696, 'steps': 103112, 'loss/train': 1.334336757659912} 11/07/2021 11:42:04 - INFO - __main__ - Step 103114: {'lr': 0.0001139307496688276, 'samples': 19797888, 'steps': 103113, 'loss/train': 1.7814445495605469} 11/07/2021 11:42:04 - INFO - __main__ - Step 103115: {'lr': 0.00011392629784212197, 'samples': 19798080, 'steps': 103114, 'loss/train': 1.7563570737838745} 11/07/2021 11:42:05 - INFO - __main__ - Step 103116: {'lr': 0.00011392184607672906, 'samples': 19798272, 'steps': 103115, 'loss/train': 1.762119174003601} 11/07/2021 11:42:05 - INFO - __main__ - Step 103117: {'lr': 0.00011391739437265086, 'samples': 19798464, 'steps': 103116, 'loss/train': 1.438971996307373} 11/07/2021 11:42:06 - INFO - __main__ - Step 103118: {'lr': 0.0001139129427298894, 'samples': 19798656, 'steps': 103117, 'loss/train': 1.5460362434387207} 11/07/2021 11:42:07 - INFO - __main__ - Step 103119: {'lr': 0.00011390849114844664, 'samples': 19798848, 'steps': 103118, 'loss/train': 1.5523170232772827} 11/07/2021 11:42:07 - INFO - __main__ - Step 103120: {'lr': 0.00011390403962832466, 'samples': 19799040, 'steps': 103119, 'loss/train': 1.761840581893921} 11/07/2021 11:42:07 - INFO - __main__ - Step 103121: {'lr': 0.00011389958816952536, 'samples': 19799232, 'steps': 103120, 'loss/train': 0.6877008676528931} 11/07/2021 11:42:08 - INFO - __main__ - Step 103122: {'lr': 0.00011389513677205084, 'samples': 19799424, 'steps': 103121, 'loss/train': 1.521618127822876} 11/07/2021 11:42:09 - INFO - __main__ - Step 103123: {'lr': 0.00011389068543590309, 'samples': 19799616, 'steps': 103122, 'loss/train': 1.666577935218811} 11/07/2021 11:42:09 - INFO - __main__ - Step 103124: {'lr': 0.00011388623416108406, 'samples': 19799808, 'steps': 103123, 'loss/train': 1.3620541095733643} 11/07/2021 11:42:09 - INFO - __main__ - Step 103125: {'lr': 0.0001138817829475958, 'samples': 19800000, 'steps': 103124, 'loss/train': 1.8319367170333862} 11/07/2021 11:42:10 - INFO - __main__ - Step 103126: {'lr': 0.00011387733179544041, 'samples': 19800192, 'steps': 103125, 'loss/train': 1.4707441329956055} 11/07/2021 11:42:10 - INFO - __main__ - Step 103127: {'lr': 0.0001138728807046197, 'samples': 19800384, 'steps': 103126, 'loss/train': 1.588003158569336} 11/07/2021 11:42:11 - INFO - __main__ - Step 103128: {'lr': 0.00011386842967513578, 'samples': 19800576, 'steps': 103127, 'loss/train': 1.740525484085083} 11/07/2021 11:42:12 - INFO - __main__ - Step 103129: {'lr': 0.00011386397870699062, 'samples': 19800768, 'steps': 103128, 'loss/train': 1.3476063013076782} 11/07/2021 11:42:12 - INFO - __main__ - Step 103130: {'lr': 0.00011385952780018627, 'samples': 19800960, 'steps': 103129, 'loss/train': 1.662906527519226} 11/07/2021 11:42:12 - INFO - __main__ - Step 103131: {'lr': 0.00011385507695472468, 'samples': 19801152, 'steps': 103130, 'loss/train': 1.6209700107574463} 11/07/2021 11:42:13 - INFO - __main__ - Step 103132: {'lr': 0.00011385062617060793, 'samples': 19801344, 'steps': 103131, 'loss/train': 1.2594728469848633} 11/07/2021 11:42:14 - INFO - __main__ - Step 103133: {'lr': 0.00011384617544783799, 'samples': 19801536, 'steps': 103132, 'loss/train': 1.400557518005371} 11/07/2021 11:42:14 - INFO - __main__ - Step 103134: {'lr': 0.00011384172478641686, 'samples': 19801728, 'steps': 103133, 'loss/train': 1.3884779214859009} 11/07/2021 11:42:14 - INFO - __main__ - Step 103135: {'lr': 0.00011383727418634653, 'samples': 19801920, 'steps': 103134, 'loss/train': 1.2791752815246582} 11/07/2021 11:42:15 - INFO - __main__ - Step 103136: {'lr': 0.00011383282364762904, 'samples': 19802112, 'steps': 103135, 'loss/train': 1.2555259466171265} 11/07/2021 11:42:15 - INFO - __main__ - Step 103137: {'lr': 0.00011382837317026637, 'samples': 19802304, 'steps': 103136, 'loss/train': 1.240815281867981} 11/07/2021 11:42:16 - INFO - __main__ - Step 103138: {'lr': 0.00011382392275426052, 'samples': 19802496, 'steps': 103137, 'loss/train': 1.6657476425170898} 11/07/2021 11:42:16 - INFO - __main__ - Step 103139: {'lr': 0.00011381947239961352, 'samples': 19802688, 'steps': 103138, 'loss/train': 1.2332175970077515} 11/07/2021 11:42:17 - INFO - __main__ - Step 103140: {'lr': 0.00011381502210632746, 'samples': 19802880, 'steps': 103139, 'loss/train': 1.841446876525879} 11/07/2021 11:42:17 - INFO - __main__ - Step 103141: {'lr': 0.00011381057187440416, 'samples': 19803072, 'steps': 103140, 'loss/train': 1.1378140449523926} 11/07/2021 11:42:17 - INFO - __main__ - Step 103142: {'lr': 0.00011380612170384572, 'samples': 19803264, 'steps': 103141, 'loss/train': 1.3479254245758057} 11/07/2021 11:42:18 - INFO - __main__ - Step 103143: {'lr': 0.00011380167159465413, 'samples': 19803456, 'steps': 103142, 'loss/train': 1.7374801635742188} 11/07/2021 11:42:19 - INFO - __main__ - Step 103144: {'lr': 0.0001137972215468314, 'samples': 19803648, 'steps': 103143, 'loss/train': 0.7768900990486145} 11/07/2021 11:42:19 - INFO - __main__ - Step 103145: {'lr': 0.00011379277156037954, 'samples': 19803840, 'steps': 103144, 'loss/train': 1.2423423528671265} 11/07/2021 11:42:19 - INFO - __main__ - Step 103146: {'lr': 0.00011378832163530056, 'samples': 19804032, 'steps': 103145, 'loss/train': 1.343618631362915} 11/07/2021 11:42:20 - INFO - __main__ - Step 103147: {'lr': 0.00011378387177159646, 'samples': 19804224, 'steps': 103146, 'loss/train': 1.9558559656143188} 11/07/2021 11:42:20 - INFO - __main__ - Step 103148: {'lr': 0.00011377942196926924, 'samples': 19804416, 'steps': 103147, 'loss/train': 0.13936784863471985} 11/07/2021 11:42:21 - INFO - __main__ - Step 103149: {'lr': 0.00011377497222832092, 'samples': 19804608, 'steps': 103148, 'loss/train': 1.1489757299423218} 11/07/2021 11:42:22 - INFO - __main__ - Step 103150: {'lr': 0.0001137705225487535, 'samples': 19804800, 'steps': 103149, 'loss/train': 0.49721023440361023} 11/07/2021 11:42:22 - INFO - __main__ - Step 103151: {'lr': 0.00011376607293056898, 'samples': 19804992, 'steps': 103150, 'loss/train': 1.286930799484253} 11/07/2021 11:42:22 - INFO - __main__ - Step 103152: {'lr': 0.00011376162337376936, 'samples': 19805184, 'steps': 103151, 'loss/train': 1.4736636877059937} 11/07/2021 11:42:23 - INFO - __main__ - Step 103153: {'lr': 0.00011375717387835674, 'samples': 19805376, 'steps': 103152, 'loss/train': 1.6074599027633667} 11/07/2021 11:42:24 - INFO - __main__ - Step 103154: {'lr': 0.00011375272444433294, 'samples': 19805568, 'steps': 103153, 'loss/train': 1.2522364854812622} 11/07/2021 11:42:24 - INFO - __main__ - Step 103155: {'lr': 0.00011374827507170005, 'samples': 19805760, 'steps': 103154, 'loss/train': 1.3497314453125} 11/07/2021 11:42:24 - INFO - __main__ - Step 103156: {'lr': 0.0001137438257604601, 'samples': 19805952, 'steps': 103155, 'loss/train': 1.3182071447372437} 11/07/2021 11:42:25 - INFO - __main__ - Step 103157: {'lr': 0.00011373937651061509, 'samples': 19806144, 'steps': 103156, 'loss/train': 1.2039138078689575} 11/07/2021 11:42:25 - INFO - __main__ - Step 103158: {'lr': 0.000113734927322167, 'samples': 19806336, 'steps': 103157, 'loss/train': 1.4187883138656616} 11/07/2021 11:42:26 - INFO - __main__ - Step 103159: {'lr': 0.00011373047819511783, 'samples': 19806528, 'steps': 103158, 'loss/train': 1.5846365690231323} 11/07/2021 11:42:27 - INFO - __main__ - Step 103160: {'lr': 0.00011372602912946964, 'samples': 19806720, 'steps': 103159, 'loss/train': 1.899271011352539} 11/07/2021 11:42:27 - INFO - __main__ - Step 103161: {'lr': 0.00011372158012522438, 'samples': 19806912, 'steps': 103160, 'loss/train': 1.3187918663024902} 11/07/2021 11:42:27 - INFO - __main__ - Step 103162: {'lr': 0.00011371713118238408, 'samples': 19807104, 'steps': 103161, 'loss/train': 1.4875434637069702} 11/07/2021 11:42:28 - INFO - __main__ - Step 103163: {'lr': 0.00011371268230095075, 'samples': 19807296, 'steps': 103162, 'loss/train': 1.5725361108779907} 11/07/2021 11:42:29 - INFO - __main__ - Step 103164: {'lr': 0.00011370823348092635, 'samples': 19807488, 'steps': 103163, 'loss/train': 1.053952932357788} 11/07/2021 11:42:29 - INFO - __main__ - Step 103165: {'lr': 0.00011370378472231293, 'samples': 19807680, 'steps': 103164, 'loss/train': 1.4646031856536865} 11/07/2021 11:42:29 - INFO - __main__ - Step 103166: {'lr': 0.00011369933602511248, 'samples': 19807872, 'steps': 103165, 'loss/train': 1.6709203720092773} 11/07/2021 11:42:30 - INFO - __main__ - Step 103167: {'lr': 0.00011369488738932713, 'samples': 19808064, 'steps': 103166, 'loss/train': 1.5182298421859741} 11/07/2021 11:42:30 - INFO - __main__ - Step 103168: {'lr': 0.00011369043881495863, 'samples': 19808256, 'steps': 103167, 'loss/train': 1.565725326538086} 11/07/2021 11:42:31 - INFO - __main__ - Step 103169: {'lr': 0.00011368599030200913, 'samples': 19808448, 'steps': 103168, 'loss/train': 1.150775671005249} 11/07/2021 11:42:32 - INFO - __main__ - Step 103170: {'lr': 0.0001136815418504806, 'samples': 19808640, 'steps': 103169, 'loss/train': 1.8277660608291626} 11/07/2021 11:42:32 - INFO - __main__ - Step 103171: {'lr': 0.00011367709346037508, 'samples': 19808832, 'steps': 103170, 'loss/train': 1.2468018531799316} 11/07/2021 11:42:32 - INFO - __main__ - Step 103172: {'lr': 0.00011367264513169456, 'samples': 19809024, 'steps': 103171, 'loss/train': 1.1301496028900146} 11/07/2021 11:42:33 - INFO - __main__ - Step 103173: {'lr': 0.00011366819686444105, 'samples': 19809216, 'steps': 103172, 'loss/train': 1.2773770093917847} 11/07/2021 11:42:33 - INFO - __main__ - Step 103174: {'lr': 0.00011366374865861653, 'samples': 19809408, 'steps': 103173, 'loss/train': 1.2744719982147217} 11/07/2021 11:42:34 - INFO - __main__ - Step 103175: {'lr': 0.00011365930051422305, 'samples': 19809600, 'steps': 103174, 'loss/train': 1.162096381187439} 11/07/2021 11:42:34 - INFO - __main__ - Step 103176: {'lr': 0.00011365485243126256, 'samples': 19809792, 'steps': 103175, 'loss/train': 1.3380061388015747} 11/07/2021 11:42:35 - INFO - __main__ - Step 103177: {'lr': 0.00011365040440973709, 'samples': 19809984, 'steps': 103176, 'loss/train': 1.3624835014343262} 11/07/2021 11:42:35 - INFO - __main__ - Step 103178: {'lr': 0.00011364595644964865, 'samples': 19810176, 'steps': 103177, 'loss/train': 0.9043922424316406} 11/07/2021 11:42:35 - INFO - __main__ - Step 103179: {'lr': 0.00011364150855099922, 'samples': 19810368, 'steps': 103178, 'loss/train': 1.0339137315750122} 11/07/2021 11:42:37 - INFO - __main__ - Step 103180: {'lr': 0.00011363706071379092, 'samples': 19810560, 'steps': 103179, 'loss/train': 1.221262812614441} 11/07/2021 11:42:37 - INFO - __main__ - Step 103181: {'lr': 0.00011363261293802557, 'samples': 19810752, 'steps': 103180, 'loss/train': 1.1172735691070557} 11/07/2021 11:42:37 - INFO - __main__ - Step 103182: {'lr': 0.00011362816522370529, 'samples': 19810944, 'steps': 103181, 'loss/train': 1.630450963973999} 11/07/2021 11:42:38 - INFO - __main__ - Step 103183: {'lr': 0.00011362371757083201, 'samples': 19811136, 'steps': 103182, 'loss/train': 1.176442265510559} 11/07/2021 11:42:38 - INFO - __main__ - Step 103184: {'lr': 0.0001136192699794078, 'samples': 19811328, 'steps': 103183, 'loss/train': 1.5883612632751465} 11/07/2021 11:42:39 - INFO - __main__ - Step 103185: {'lr': 0.00011361482244943463, 'samples': 19811520, 'steps': 103184, 'loss/train': 1.666293740272522} 11/07/2021 11:42:39 - INFO - __main__ - Step 103186: {'lr': 0.00011361037498091453, 'samples': 19811712, 'steps': 103185, 'loss/train': 0.9714822173118591} 11/07/2021 11:42:40 - INFO - __main__ - Step 103187: {'lr': 0.0001136059275738495, 'samples': 19811904, 'steps': 103186, 'loss/train': 1.7077635526657104} 11/07/2021 11:42:40 - INFO - __main__ - Step 103188: {'lr': 0.00011360148022824152, 'samples': 19812096, 'steps': 103187, 'loss/train': 1.4369757175445557} 11/07/2021 11:42:40 - INFO - __main__ - Step 103189: {'lr': 0.0001135970329440926, 'samples': 19812288, 'steps': 103188, 'loss/train': 1.7830064296722412} 11/07/2021 11:42:41 - INFO - __main__ - Step 103190: {'lr': 0.00011359258572140477, 'samples': 19812480, 'steps': 103189, 'loss/train': 0.8533931374549866} 11/07/2021 11:42:42 - INFO - __main__ - Step 103191: {'lr': 0.00011358813856018, 'samples': 19812672, 'steps': 103190, 'loss/train': 1.783892035484314} 11/07/2021 11:42:42 - INFO - __main__ - Step 103192: {'lr': 0.00011358369146042042, 'samples': 19812864, 'steps': 103191, 'loss/train': 1.107818841934204} 11/07/2021 11:42:42 - INFO - __main__ - Step 103193: {'lr': 0.0001135792444221278, 'samples': 19813056, 'steps': 103192, 'loss/train': 1.3904643058776855} 11/07/2021 11:42:43 - INFO - __main__ - Step 103194: {'lr': 0.00011357479744530427, 'samples': 19813248, 'steps': 103193, 'loss/train': 1.3997527360916138} 11/07/2021 11:42:44 - INFO - __main__ - Step 103195: {'lr': 0.00011357035052995188, 'samples': 19813440, 'steps': 103194, 'loss/train': 1.3317821025848389} 11/07/2021 11:42:44 - INFO - __main__ - Step 103196: {'lr': 0.00011356590367607253, 'samples': 19813632, 'steps': 103195, 'loss/train': 1.6223698854446411} 11/07/2021 11:42:45 - INFO - __main__ - Step 103197: {'lr': 0.00011356145688366831, 'samples': 19813824, 'steps': 103196, 'loss/train': 1.5365108251571655} 11/07/2021 11:42:45 - INFO - __main__ - Step 103198: {'lr': 0.00011355701015274117, 'samples': 19814016, 'steps': 103197, 'loss/train': 1.6401373147964478} 11/07/2021 11:42:45 - INFO - __main__ - Step 103199: {'lr': 0.00011355256348329315, 'samples': 19814208, 'steps': 103198, 'loss/train': 0.7646520733833313} 11/07/2021 11:42:46 - INFO - __main__ - Step 103200: {'lr': 0.00011354811687532626, 'samples': 19814400, 'steps': 103199, 'loss/train': 1.282882809638977} 11/07/2021 11:42:47 - INFO - __main__ - Step 103201: {'lr': 0.00011354367032884244, 'samples': 19814592, 'steps': 103200, 'loss/train': 0.7418365478515625} 11/07/2021 11:42:47 - INFO - __main__ - Step 103202: {'lr': 0.00011353922384384377, 'samples': 19814784, 'steps': 103201, 'loss/train': 1.5332905054092407} 11/07/2021 11:42:47 - INFO - __main__ - Step 103203: {'lr': 0.00011353477742033231, 'samples': 19814976, 'steps': 103202, 'loss/train': 1.2523250579833984} 11/07/2021 11:42:48 - INFO - __main__ - Step 103204: {'lr': 0.00011353033105830987, 'samples': 19815168, 'steps': 103203, 'loss/train': 1.118735671043396} 11/07/2021 11:42:48 - INFO - __main__ - Step 103205: {'lr': 0.00011352588475777856, 'samples': 19815360, 'steps': 103204, 'loss/train': 0.7803447842597961} 11/07/2021 11:42:49 - INFO - __main__ - Step 103206: {'lr': 0.00011352143851874036, 'samples': 19815552, 'steps': 103205, 'loss/train': 1.2271672487258911} 11/07/2021 11:42:50 - INFO - __main__ - Step 103207: {'lr': 0.00011351699234119734, 'samples': 19815744, 'steps': 103206, 'loss/train': 1.4392682313919067} 11/07/2021 11:42:50 - INFO - __main__ - Step 103208: {'lr': 0.00011351254622515142, 'samples': 19815936, 'steps': 103207, 'loss/train': 0.9781469702720642} 11/07/2021 11:42:50 - INFO - __main__ - Step 103209: {'lr': 0.00011350810017060464, 'samples': 19816128, 'steps': 103208, 'loss/train': 0.469216525554657} 11/07/2021 11:42:51 - INFO - __main__ - Step 103210: {'lr': 0.00011350365417755901, 'samples': 19816320, 'steps': 103209, 'loss/train': 1.0092623233795166} 11/07/2021 11:42:52 - INFO - __main__ - Step 103211: {'lr': 0.00011349920824601653, 'samples': 19816512, 'steps': 103210, 'loss/train': 1.5442891120910645} 11/07/2021 11:42:52 - INFO - __main__ - Step 103212: {'lr': 0.00011349476237597922, 'samples': 19816704, 'steps': 103211, 'loss/train': 0.26341861486434937} 11/07/2021 11:42:53 - INFO - __main__ - Step 103213: {'lr': 0.00011349031656744904, 'samples': 19816896, 'steps': 103212, 'loss/train': 1.2888489961624146} 11/07/2021 11:42:53 - INFO - __main__ - Step 103214: {'lr': 0.00011348587082042811, 'samples': 19817088, 'steps': 103213, 'loss/train': 0.971868634223938} 11/07/2021 11:42:53 - INFO - __main__ - Step 103215: {'lr': 0.00011348142513491824, 'samples': 19817280, 'steps': 103214, 'loss/train': 1.391381859779358} 11/07/2021 11:42:54 - INFO - __main__ - Step 103216: {'lr': 0.00011347697951092156, 'samples': 19817472, 'steps': 103215, 'loss/train': 1.4750111103057861} 11/07/2021 11:42:55 - INFO - __main__ - Step 103217: {'lr': 0.00011347253394844004, 'samples': 19817664, 'steps': 103216, 'loss/train': 1.8499959707260132} 11/07/2021 11:42:55 - INFO - __main__ - Step 103218: {'lr': 0.00011346808844747567, 'samples': 19817856, 'steps': 103217, 'loss/train': 1.386900782585144} 11/07/2021 11:42:55 - INFO - __main__ - Step 103219: {'lr': 0.0001134636430080305, 'samples': 19818048, 'steps': 103218, 'loss/train': 0.833393931388855} 11/07/2021 11:42:56 - INFO - __main__ - Step 103220: {'lr': 0.00011345919763010648, 'samples': 19818240, 'steps': 103219, 'loss/train': 1.3414591550827026} 11/07/2021 11:42:57 - INFO - __main__ - Step 103221: {'lr': 0.00011345475231370564, 'samples': 19818432, 'steps': 103220, 'loss/train': 1.4656710624694824} 11/07/2021 11:42:57 - INFO - __main__ - Step 103222: {'lr': 0.00011345030705883, 'samples': 19818624, 'steps': 103221, 'loss/train': 1.6474837064743042} 11/07/2021 11:42:57 - INFO - __main__ - Step 103223: {'lr': 0.00011344586186548153, 'samples': 19818816, 'steps': 103222, 'loss/train': 0.8852766752243042} 11/07/2021 11:42:58 - INFO - __main__ - Step 103224: {'lr': 0.00011344141673366227, 'samples': 19819008, 'steps': 103223, 'loss/train': 1.7579281330108643} 11/07/2021 11:42:58 - INFO - __main__ - Step 103225: {'lr': 0.00011343697166337425, 'samples': 19819200, 'steps': 103224, 'loss/train': 1.4136427640914917} 11/07/2021 11:42:59 - INFO - __main__ - Step 103226: {'lr': 0.00011343252665461936, 'samples': 19819392, 'steps': 103225, 'loss/train': 0.4442676305770874} 11/07/2021 11:42:59 - INFO - __main__ - Step 103227: {'lr': 0.00011342808170739966, 'samples': 19819584, 'steps': 103226, 'loss/train': 1.1007394790649414} 11/07/2021 11:43:00 - INFO - __main__ - Step 103228: {'lr': 0.00011342363682171716, 'samples': 19819776, 'steps': 103227, 'loss/train': 1.4291328191757202} 11/07/2021 11:43:00 - INFO - __main__ - Step 103229: {'lr': 0.00011341919199757387, 'samples': 19819968, 'steps': 103228, 'loss/train': 1.5005377531051636} 11/07/2021 11:43:01 - INFO - __main__ - Step 103230: {'lr': 0.00011341474723497178, 'samples': 19820160, 'steps': 103229, 'loss/train': 1.1028790473937988} 11/07/2021 11:43:01 - INFO - __main__ - Step 103231: {'lr': 0.00011341030253391288, 'samples': 19820352, 'steps': 103230, 'loss/train': 0.829859733581543} 11/07/2021 11:43:02 - INFO - __main__ - Step 103232: {'lr': 0.0001134058578943992, 'samples': 19820544, 'steps': 103231, 'loss/train': 1.087868094444275} 11/07/2021 11:43:02 - INFO - __main__ - Step 103233: {'lr': 0.00011340141331643275, 'samples': 19820736, 'steps': 103232, 'loss/train': 1.6078702211380005} 11/07/2021 11:43:03 - INFO - __main__ - Step 103234: {'lr': 0.00011339696880001548, 'samples': 19820928, 'steps': 103233, 'loss/train': 1.4303631782531738} 11/07/2021 11:43:03 - INFO - __main__ - Step 103235: {'lr': 0.00011339252434514947, 'samples': 19821120, 'steps': 103234, 'loss/train': 0.7963549494743347} 11/07/2021 11:43:03 - INFO - __main__ - Step 103236: {'lr': 0.00011338807995183676, 'samples': 19821312, 'steps': 103235, 'loss/train': 1.31649911403656} 11/07/2021 11:43:04 - INFO - __main__ - Step 103237: {'lr': 0.00011338363562007916, 'samples': 19821504, 'steps': 103236, 'loss/train': 1.5558719635009766} 11/07/2021 11:43:05 - INFO - __main__ - Step 103238: {'lr': 0.00011337919134987881, 'samples': 19821696, 'steps': 103237, 'loss/train': 0.9628690481185913} 11/07/2021 11:43:05 - INFO - __main__ - Step 103239: {'lr': 0.00011337474714123766, 'samples': 19821888, 'steps': 103238, 'loss/train': 1.749988079071045} 11/07/2021 11:43:05 - INFO - __main__ - Step 103240: {'lr': 0.00011337030299415777, 'samples': 19822080, 'steps': 103239, 'loss/train': 1.488995909690857} 11/07/2021 11:43:06 - INFO - __main__ - Step 103241: {'lr': 0.00011336585890864109, 'samples': 19822272, 'steps': 103240, 'loss/train': 1.2773091793060303} 11/07/2021 11:43:07 - INFO - __main__ - Step 103242: {'lr': 0.00011336141488468967, 'samples': 19822464, 'steps': 103241, 'loss/train': 1.629799723625183} 11/07/2021 11:43:07 - INFO - __main__ - Step 103243: {'lr': 0.00011335697092230546, 'samples': 19822656, 'steps': 103242, 'loss/train': 0.567603588104248} 11/07/2021 11:43:08 - INFO - __main__ - Step 103244: {'lr': 0.00011335252702149052, 'samples': 19822848, 'steps': 103243, 'loss/train': 0.9592872858047485} 11/07/2021 11:43:08 - INFO - __main__ - Step 103245: {'lr': 0.00011334808318224679, 'samples': 19823040, 'steps': 103244, 'loss/train': 1.2911165952682495} 11/07/2021 11:43:08 - INFO - __main__ - Step 103246: {'lr': 0.00011334363940457634, 'samples': 19823232, 'steps': 103245, 'loss/train': 1.2485884428024292} 11/07/2021 11:43:09 - INFO - __main__ - Step 103247: {'lr': 0.00011333919568848123, 'samples': 19823424, 'steps': 103246, 'loss/train': 1.3350917100906372} 11/07/2021 11:43:10 - INFO - __main__ - Step 103248: {'lr': 0.00011333475203396323, 'samples': 19823616, 'steps': 103247, 'loss/train': 1.7756367921829224} 11/07/2021 11:43:10 - INFO - __main__ - Step 103249: {'lr': 0.00011333030844102451, 'samples': 19823808, 'steps': 103248, 'loss/train': 1.4510691165924072} 11/07/2021 11:43:11 - INFO - __main__ - Step 103250: {'lr': 0.00011332586490966707, 'samples': 19824000, 'steps': 103249, 'loss/train': 1.5348072052001953} 11/07/2021 11:43:11 - INFO - __main__ - Step 103251: {'lr': 0.00011332142143989285, 'samples': 19824192, 'steps': 103250, 'loss/train': 1.6042907238006592} 11/07/2021 11:43:11 - INFO - __main__ - Step 103252: {'lr': 0.00011331697803170391, 'samples': 19824384, 'steps': 103251, 'loss/train': 1.1206883192062378} 11/07/2021 11:43:12 - INFO - __main__ - Step 103253: {'lr': 0.00011331253468510223, 'samples': 19824576, 'steps': 103252, 'loss/train': 1.6910287141799927} 11/07/2021 11:43:13 - INFO - __main__ - Step 103254: {'lr': 0.0001133080914000898, 'samples': 19824768, 'steps': 103253, 'loss/train': 1.471782922744751} 11/07/2021 11:43:13 - INFO - __main__ - Step 103255: {'lr': 0.00011330364817666864, 'samples': 19824960, 'steps': 103254, 'loss/train': 1.3524940013885498} 11/07/2021 11:43:13 - INFO - __main__ - Step 103256: {'lr': 0.00011329920501484073, 'samples': 19825152, 'steps': 103255, 'loss/train': 1.218902587890625} 11/07/2021 11:43:14 - INFO - __main__ - Step 103257: {'lr': 0.00011329476191460811, 'samples': 19825344, 'steps': 103256, 'loss/train': 1.1074683666229248} 11/07/2021 11:43:15 - INFO - __main__ - Step 103258: {'lr': 0.00011329031887597274, 'samples': 19825536, 'steps': 103257, 'loss/train': 1.5404751300811768} 11/07/2021 11:43:15 - INFO - __main__ - Step 103259: {'lr': 0.00011328587589893666, 'samples': 19825728, 'steps': 103258, 'loss/train': 1.4907366037368774} 11/07/2021 11:43:15 - INFO - __main__ - Step 103260: {'lr': 0.00011328143298350185, 'samples': 19825920, 'steps': 103259, 'loss/train': 1.6717115640640259} 11/07/2021 11:43:16 - INFO - __main__ - Step 103261: {'lr': 0.00011327699012967041, 'samples': 19826112, 'steps': 103260, 'loss/train': 1.4702398777008057} 11/07/2021 11:43:16 - INFO - __main__ - Step 103262: {'lr': 0.00011327254733744416, 'samples': 19826304, 'steps': 103261, 'loss/train': 1.8702692985534668} 11/07/2021 11:43:17 - INFO - __main__ - Step 103263: {'lr': 0.00011326810460682518, 'samples': 19826496, 'steps': 103262, 'loss/train': 1.6358416080474854} 11/07/2021 11:43:17 - INFO - __main__ - Step 103264: {'lr': 0.0001132636619378155, 'samples': 19826688, 'steps': 103263, 'loss/train': 1.276210904121399} 11/07/2021 11:43:18 - INFO - __main__ - Step 103265: {'lr': 0.0001132592193304171, 'samples': 19826880, 'steps': 103264, 'loss/train': 1.1619106531143188} 11/07/2021 11:43:18 - INFO - __main__ - Step 103266: {'lr': 0.00011325477678463198, 'samples': 19827072, 'steps': 103265, 'loss/train': 1.5774726867675781} 11/07/2021 11:43:19 - INFO - __main__ - Step 103267: {'lr': 0.00011325033430046214, 'samples': 19827264, 'steps': 103266, 'loss/train': 1.3350142240524292} 11/07/2021 11:43:20 - INFO - __main__ - Step 103268: {'lr': 0.0001132458918779096, 'samples': 19827456, 'steps': 103267, 'loss/train': 1.7503809928894043} 11/07/2021 11:43:20 - INFO - __main__ - Step 103269: {'lr': 0.00011324144951697634, 'samples': 19827648, 'steps': 103268, 'loss/train': 1.2830983400344849} 11/07/2021 11:43:20 - INFO - __main__ - Step 103270: {'lr': 0.00011323700721766439, 'samples': 19827840, 'steps': 103269, 'loss/train': 0.9423120021820068} 11/07/2021 11:43:21 - INFO - __main__ - Step 103271: {'lr': 0.00011323256497997572, 'samples': 19828032, 'steps': 103270, 'loss/train': 1.6428449153900146} 11/07/2021 11:43:21 - INFO - __main__ - Step 103272: {'lr': 0.00011322812280391234, 'samples': 19828224, 'steps': 103271, 'loss/train': 1.1164186000823975} 11/07/2021 11:43:22 - INFO - __main__ - Step 103273: {'lr': 0.00011322368068947627, 'samples': 19828416, 'steps': 103272, 'loss/train': 1.348711371421814} 11/07/2021 11:43:23 - INFO - __main__ - Step 103274: {'lr': 0.0001132192386366696, 'samples': 19828608, 'steps': 103273, 'loss/train': 1.6492253541946411} 11/07/2021 11:43:23 - INFO - __main__ - Step 103275: {'lr': 0.00011321479664549414, 'samples': 19828800, 'steps': 103274, 'loss/train': 1.139201045036316} 11/07/2021 11:43:23 - INFO - __main__ - Step 103276: {'lr': 0.00011321035471595195, 'samples': 19828992, 'steps': 103275, 'loss/train': 1.1673964262008667} 11/07/2021 11:43:24 - INFO - __main__ - Step 103277: {'lr': 0.00011320591284804508, 'samples': 19829184, 'steps': 103276, 'loss/train': 1.2510912418365479} 11/07/2021 11:43:25 - INFO - __main__ - Step 103278: {'lr': 0.00011320147104177553, 'samples': 19829376, 'steps': 103277, 'loss/train': 1.4538319110870361} 11/07/2021 11:43:25 - INFO - __main__ - Step 103279: {'lr': 0.00011319702929714526, 'samples': 19829568, 'steps': 103278, 'loss/train': 1.1388602256774902} 11/07/2021 11:43:25 - INFO - __main__ - Step 103280: {'lr': 0.00011319258761415632, 'samples': 19829760, 'steps': 103279, 'loss/train': 2.6604859828948975} 11/07/2021 11:43:26 - INFO - __main__ - Step 103281: {'lr': 0.00011318814599281068, 'samples': 19829952, 'steps': 103280, 'loss/train': 1.4088786840438843} 11/07/2021 11:43:26 - INFO - __main__ - Step 103282: {'lr': 0.00011318370443311036, 'samples': 19830144, 'steps': 103281, 'loss/train': 1.563450574874878} 11/07/2021 11:43:26 - INFO - __main__ - Step 103283: {'lr': 0.00011317926293505732, 'samples': 19830336, 'steps': 103282, 'loss/train': 1.6695231199264526} 11/07/2021 11:43:27 - INFO - __main__ - Step 103284: {'lr': 0.00011317482149865363, 'samples': 19830528, 'steps': 103283, 'loss/train': 1.4727674722671509} 11/07/2021 11:43:28 - INFO - __main__ - Step 103285: {'lr': 0.00011317038012390124, 'samples': 19830720, 'steps': 103284, 'loss/train': 1.5867283344268799} 11/07/2021 11:43:28 - INFO - __main__ - Step 103286: {'lr': 0.00011316593881080215, 'samples': 19830912, 'steps': 103285, 'loss/train': 1.4674383401870728} 11/07/2021 11:43:29 - INFO - __main__ - Step 103287: {'lr': 0.00011316149755935839, 'samples': 19831104, 'steps': 103286, 'loss/train': 1.4584927558898926} 11/07/2021 11:43:29 - INFO - __main__ - Step 103288: {'lr': 0.00011315705636957204, 'samples': 19831296, 'steps': 103287, 'loss/train': 1.8205814361572266} 11/07/2021 11:43:30 - INFO - __main__ - Step 103289: {'lr': 0.00011315261524144491, 'samples': 19831488, 'steps': 103288, 'loss/train': 1.2451022863388062} 11/07/2021 11:43:31 - INFO - __main__ - Step 103290: {'lr': 0.00011314817417497911, 'samples': 19831680, 'steps': 103289, 'loss/train': 1.5885133743286133} 11/07/2021 11:43:31 - INFO - __main__ - Step 103291: {'lr': 0.00011314373317017663, 'samples': 19831872, 'steps': 103290, 'loss/train': 1.069899320602417} 11/07/2021 11:43:31 - INFO - __main__ - Step 103292: {'lr': 0.00011313929222703947, 'samples': 19832064, 'steps': 103291, 'loss/train': 1.4079958200454712} 11/07/2021 11:43:32 - INFO - __main__ - Step 103293: {'lr': 0.00011313485134556963, 'samples': 19832256, 'steps': 103292, 'loss/train': 1.334843635559082} 11/07/2021 11:43:33 - INFO - __main__ - Step 103294: {'lr': 0.00011313041052576911, 'samples': 19832448, 'steps': 103293, 'loss/train': 1.3147192001342773} 11/07/2021 11:43:33 - INFO - __main__ - Step 103295: {'lr': 0.00011312596976763991, 'samples': 19832640, 'steps': 103294, 'loss/train': 1.8449199199676514} 11/07/2021 11:43:34 - INFO - __main__ - Step 103296: {'lr': 0.00011312152907118406, 'samples': 19832832, 'steps': 103295, 'loss/train': 1.4760688543319702} 11/07/2021 11:43:34 - INFO - __main__ - Step 103297: {'lr': 0.00011311708843640353, 'samples': 19833024, 'steps': 103296, 'loss/train': 1.8855985403060913} 11/07/2021 11:43:34 - INFO - __main__ - Step 103298: {'lr': 0.00011311264786330033, 'samples': 19833216, 'steps': 103297, 'loss/train': 1.8137320280075073} 11/07/2021 11:43:35 - INFO - __main__ - Step 103299: {'lr': 0.00011310820735187643, 'samples': 19833408, 'steps': 103298, 'loss/train': 1.8230185508728027} 11/07/2021 11:43:36 - INFO - __main__ - Step 103300: {'lr': 0.0001131037669021339, 'samples': 19833600, 'steps': 103299, 'loss/train': 1.0206059217453003} 11/07/2021 11:43:36 - INFO - __main__ - Step 103301: {'lr': 0.00011309932651407475, 'samples': 19833792, 'steps': 103300, 'loss/train': 1.1714781522750854} 11/07/2021 11:43:37 - INFO - __main__ - Step 103302: {'lr': 0.00011309488618770086, 'samples': 19833984, 'steps': 103301, 'loss/train': 1.2806828022003174} 11/07/2021 11:43:37 - INFO - __main__ - Step 103303: {'lr': 0.00011309044592301432, 'samples': 19834176, 'steps': 103302, 'loss/train': 1.694885492324829} 11/07/2021 11:43:37 - INFO - __main__ - Step 103304: {'lr': 0.00011308600572001709, 'samples': 19834368, 'steps': 103303, 'loss/train': 1.0081464052200317} 11/07/2021 11:43:38 - INFO - __main__ - Step 103305: {'lr': 0.00011308156557871118, 'samples': 19834560, 'steps': 103304, 'loss/train': 1.5390923023223877} 11/07/2021 11:43:39 - INFO - __main__ - Step 103306: {'lr': 0.00011307712549909865, 'samples': 19834752, 'steps': 103305, 'loss/train': 0.9407469630241394} 11/07/2021 11:43:39 - INFO - __main__ - Step 103307: {'lr': 0.00011307268548118141, 'samples': 19834944, 'steps': 103306, 'loss/train': 1.1716443300247192} 11/07/2021 11:43:39 - INFO - __main__ - Step 103308: {'lr': 0.00011306824552496154, 'samples': 19835136, 'steps': 103307, 'loss/train': 1.5861679315567017} 11/07/2021 11:43:40 - INFO - __main__ - Step 103309: {'lr': 0.00011306380563044096, 'samples': 19835328, 'steps': 103308, 'loss/train': 1.325718641281128} 11/07/2021 11:43:40 - INFO - __main__ - Step 103310: {'lr': 0.00011305936579762174, 'samples': 19835520, 'steps': 103309, 'loss/train': 1.6108418703079224} 11/07/2021 11:43:41 - INFO - __main__ - Step 103311: {'lr': 0.00011305492602650589, 'samples': 19835712, 'steps': 103310, 'loss/train': 1.2184748649597168} 11/07/2021 11:43:41 - INFO - __main__ - Step 103312: {'lr': 0.00011305048631709533, 'samples': 19835904, 'steps': 103311, 'loss/train': 1.0811980962753296} 11/07/2021 11:43:42 - INFO - __main__ - Step 103313: {'lr': 0.00011304604666939213, 'samples': 19836096, 'steps': 103312, 'loss/train': 1.7696197032928467} 11/07/2021 11:43:42 - INFO - __main__ - Step 103314: {'lr': 0.00011304160708339825, 'samples': 19836288, 'steps': 103313, 'loss/train': 0.6115454435348511} 11/07/2021 11:43:42 - INFO - __main__ - Step 103315: {'lr': 0.00011303716755911583, 'samples': 19836480, 'steps': 103314, 'loss/train': 0.5964123010635376} 11/07/2021 11:43:43 - INFO - __main__ - Step 103316: {'lr': 0.00011303272809654663, 'samples': 19836672, 'steps': 103315, 'loss/train': 0.988227128982544} 11/07/2021 11:43:44 - INFO - __main__ - Step 103317: {'lr': 0.00011302828869569279, 'samples': 19836864, 'steps': 103316, 'loss/train': 1.604321002960205} 11/07/2021 11:43:44 - INFO - __main__ - Step 103318: {'lr': 0.00011302384935655627, 'samples': 19837056, 'steps': 103317, 'loss/train': 1.8787071704864502} 11/07/2021 11:43:45 - INFO - __main__ - Step 103319: {'lr': 0.0001130194100791391, 'samples': 19837248, 'steps': 103318, 'loss/train': 1.6259052753448486} 11/07/2021 11:43:45 - INFO - __main__ - Step 103320: {'lr': 0.00011301497086344325, 'samples': 19837440, 'steps': 103319, 'loss/train': 1.3016480207443237} 11/07/2021 11:43:46 - INFO - __main__ - Step 103321: {'lr': 0.00011301053170947078, 'samples': 19837632, 'steps': 103320, 'loss/train': 1.5656388998031616} 11/07/2021 11:43:46 - INFO - __main__ - Step 103322: {'lr': 0.00011300609261722363, 'samples': 19837824, 'steps': 103321, 'loss/train': 0.9842595458030701} 11/07/2021 11:43:47 - INFO - __main__ - Step 103323: {'lr': 0.00011300165358670381, 'samples': 19838016, 'steps': 103322, 'loss/train': 1.6134848594665527} 11/07/2021 11:43:47 - INFO - __main__ - Step 103324: {'lr': 0.00011299721461791334, 'samples': 19838208, 'steps': 103323, 'loss/train': 1.1973531246185303} 11/07/2021 11:43:47 - INFO - __main__ - Step 103325: {'lr': 0.00011299277571085423, 'samples': 19838400, 'steps': 103324, 'loss/train': 1.5129700899124146} 11/07/2021 11:43:48 - INFO - __main__ - Step 103326: {'lr': 0.00011298833686552843, 'samples': 19838592, 'steps': 103325, 'loss/train': 1.8539942502975464} 11/07/2021 11:43:49 - INFO - __main__ - Step 103327: {'lr': 0.00011298389808193798, 'samples': 19838784, 'steps': 103326, 'loss/train': 1.0086719989776611} 11/07/2021 11:43:49 - INFO - __main__ - Step 103328: {'lr': 0.00011297945936008497, 'samples': 19838976, 'steps': 103327, 'loss/train': 1.3934829235076904} 11/07/2021 11:43:49 - INFO - __main__ - Step 103329: {'lr': 0.0001129750206999712, 'samples': 19839168, 'steps': 103328, 'loss/train': 1.199813723564148} 11/07/2021 11:43:50 - INFO - __main__ - Step 103330: {'lr': 0.00011297058210159877, 'samples': 19839360, 'steps': 103329, 'loss/train': 1.669423222541809} 11/07/2021 11:43:50 - INFO - __main__ - Step 103331: {'lr': 0.0001129661435649697, 'samples': 19839552, 'steps': 103330, 'loss/train': 1.3847593069076538} 11/07/2021 11:43:51 - INFO - __main__ - Step 103332: {'lr': 0.00011296170509008596, 'samples': 19839744, 'steps': 103331, 'loss/train': 1.0988963842391968} 11/07/2021 11:43:52 - INFO - __main__ - Step 103333: {'lr': 0.00011295726667694955, 'samples': 19839936, 'steps': 103332, 'loss/train': 1.386522889137268} 11/07/2021 11:43:52 - INFO - __main__ - Step 103334: {'lr': 0.0001129528283255625, 'samples': 19840128, 'steps': 103333, 'loss/train': 1.1605536937713623} 11/07/2021 11:43:52 - INFO - __main__ - Step 103335: {'lr': 0.0001129483900359268, 'samples': 19840320, 'steps': 103334, 'loss/train': 1.3222286701202393} 11/07/2021 11:43:53 - INFO - __main__ - Step 103336: {'lr': 0.00011294395180804443, 'samples': 19840512, 'steps': 103335, 'loss/train': 1.4546657800674438} 11/07/2021 11:43:54 - INFO - __main__ - Step 103337: {'lr': 0.00011293951364191738, 'samples': 19840704, 'steps': 103336, 'loss/train': 1.6003376245498657} 11/07/2021 11:43:54 - INFO - __main__ - Step 103338: {'lr': 0.00011293507553754767, 'samples': 19840896, 'steps': 103337, 'loss/train': 0.7585793137550354} 11/07/2021 11:43:55 - INFO - __main__ - Step 103339: {'lr': 0.00011293063749493731, 'samples': 19841088, 'steps': 103338, 'loss/train': 1.9999620914459229} 11/07/2021 11:43:55 - INFO - __main__ - Step 103340: {'lr': 0.00011292619951408831, 'samples': 19841280, 'steps': 103339, 'loss/train': 1.4652659893035889} 11/07/2021 11:43:55 - INFO - __main__ - Step 103341: {'lr': 0.00011292176159500272, 'samples': 19841472, 'steps': 103340, 'loss/train': 0.09712259471416473} 11/07/2021 11:43:56 - INFO - __main__ - Step 103342: {'lr': 0.00011291732373768238, 'samples': 19841664, 'steps': 103341, 'loss/train': 1.0300999879837036} 11/07/2021 11:43:57 - INFO - __main__ - Step 103343: {'lr': 0.0001129128859421294, 'samples': 19841856, 'steps': 103342, 'loss/train': 1.255236029624939} 11/07/2021 11:43:57 - INFO - __main__ - Step 103344: {'lr': 0.00011290844820834572, 'samples': 19842048, 'steps': 103343, 'loss/train': 1.310532569885254} 11/07/2021 11:43:57 - INFO - __main__ - Step 103345: {'lr': 0.00011290401053633339, 'samples': 19842240, 'steps': 103344, 'loss/train': 1.2026396989822388} 11/07/2021 11:43:58 - INFO - __main__ - Step 103346: {'lr': 0.00011289957292609443, 'samples': 19842432, 'steps': 103345, 'loss/train': 0.8874821066856384} 11/07/2021 11:43:59 - INFO - __main__ - Step 103347: {'lr': 0.00011289513537763077, 'samples': 19842624, 'steps': 103346, 'loss/train': 1.3796244859695435} 11/07/2021 11:43:59 - INFO - __main__ - Step 103348: {'lr': 0.00011289069789094444, 'samples': 19842816, 'steps': 103347, 'loss/train': 0.5161005258560181} 11/07/2021 11:44:00 - INFO - __main__ - Step 103349: {'lr': 0.00011288626046603748, 'samples': 19843008, 'steps': 103348, 'loss/train': 1.5150254964828491} 11/07/2021 11:44:00 - INFO - __main__ - Step 103350: {'lr': 0.00011288182310291184, 'samples': 19843200, 'steps': 103349, 'loss/train': 1.4731711149215698} 11/07/2021 11:44:00 - INFO - __main__ - Step 103351: {'lr': 0.00011287738580156953, 'samples': 19843392, 'steps': 103350, 'loss/train': 1.3334417343139648} 11/07/2021 11:44:01 - INFO - __main__ - Step 103352: {'lr': 0.00011287294856201255, 'samples': 19843584, 'steps': 103351, 'loss/train': 1.1528481245040894} 11/07/2021 11:44:02 - INFO - __main__ - Step 103353: {'lr': 0.00011286851138424293, 'samples': 19843776, 'steps': 103352, 'loss/train': 1.2813174724578857} 11/07/2021 11:44:02 - INFO - __main__ - Step 103354: {'lr': 0.00011286407426826262, 'samples': 19843968, 'steps': 103353, 'loss/train': 0.8959428071975708} 11/07/2021 11:44:02 - INFO - __main__ - Step 103355: {'lr': 0.00011285963721407371, 'samples': 19844160, 'steps': 103354, 'loss/train': 1.0493067502975464} 11/07/2021 11:44:03 - INFO - __main__ - Step 103356: {'lr': 0.00011285520022167808, 'samples': 19844352, 'steps': 103355, 'loss/train': 1.3966008424758911} 11/07/2021 11:44:03 - INFO - __main__ - Step 103357: {'lr': 0.00011285076329107777, 'samples': 19844544, 'steps': 103356, 'loss/train': 1.1630282402038574} 11/07/2021 11:44:04 - INFO - __main__ - Step 103358: {'lr': 0.0001128463264222748, 'samples': 19844736, 'steps': 103357, 'loss/train': 0.7458139657974243} 11/07/2021 11:44:05 - INFO - __main__ - Step 103359: {'lr': 0.00011284188961527114, 'samples': 19844928, 'steps': 103358, 'loss/train': 1.2258485555648804} 11/07/2021 11:44:05 - INFO - __main__ - Step 103360: {'lr': 0.0001128374528700688, 'samples': 19845120, 'steps': 103359, 'loss/train': 1.2158430814743042} 11/07/2021 11:44:05 - INFO - __main__ - Step 103361: {'lr': 0.0001128330161866698, 'samples': 19845312, 'steps': 103360, 'loss/train': 1.3910471200942993} 11/07/2021 11:44:06 - INFO - __main__ - Step 103362: {'lr': 0.00011282857956507615, 'samples': 19845504, 'steps': 103361, 'loss/train': 1.5887959003448486} 11/07/2021 11:44:07 - INFO - __main__ - Step 103363: {'lr': 0.00011282414300528978, 'samples': 19845696, 'steps': 103362, 'loss/train': 1.0082666873931885} 11/07/2021 11:44:07 - INFO - __main__ - Step 103364: {'lr': 0.00011281970650731277, 'samples': 19845888, 'steps': 103363, 'loss/train': 1.4649986028671265} 11/07/2021 11:44:07 - INFO - __main__ - Step 103365: {'lr': 0.00011281527007114706, 'samples': 19846080, 'steps': 103364, 'loss/train': 1.2809163331985474} 11/07/2021 11:44:08 - INFO - __main__ - Step 103366: {'lr': 0.00011281083369679468, 'samples': 19846272, 'steps': 103365, 'loss/train': 1.193663239479065} 11/07/2021 11:44:08 - INFO - __main__ - Step 103367: {'lr': 0.00011280639738425762, 'samples': 19846464, 'steps': 103366, 'loss/train': 1.1811621189117432} 11/07/2021 11:44:09 - INFO - __main__ - Step 103368: {'lr': 0.000112801961133538, 'samples': 19846656, 'steps': 103367, 'loss/train': 1.2415803670883179} 11/07/2021 11:44:09 - INFO - __main__ - Step 103369: {'lr': 0.00011279752494463757, 'samples': 19846848, 'steps': 103368, 'loss/train': 1.3373680114746094} 11/07/2021 11:44:10 - INFO - __main__ - Step 103370: {'lr': 0.00011279308881755845, 'samples': 19847040, 'steps': 103369, 'loss/train': 1.158776879310608} 11/07/2021 11:44:10 - INFO - __main__ - Step 103371: {'lr': 0.00011278865275230268, 'samples': 19847232, 'steps': 103370, 'loss/train': 1.436722993850708} 11/07/2021 11:44:11 - INFO - __main__ - Step 103372: {'lr': 0.00011278421674887221, 'samples': 19847424, 'steps': 103371, 'loss/train': 1.488995909690857} 11/07/2021 11:44:12 - INFO - __main__ - Step 103373: {'lr': 0.00011277978080726906, 'samples': 19847616, 'steps': 103372, 'loss/train': 1.0597271919250488} 11/07/2021 11:44:12 - INFO - __main__ - Step 103374: {'lr': 0.00011277534492749522, 'samples': 19847808, 'steps': 103373, 'loss/train': 1.3466465473175049} 11/07/2021 11:44:12 - INFO - __main__ - Step 103375: {'lr': 0.0001127709091095527, 'samples': 19848000, 'steps': 103374, 'loss/train': 1.2967329025268555} 11/07/2021 11:44:13 - INFO - __main__ - Step 103376: {'lr': 0.00011276647335344348, 'samples': 19848192, 'steps': 103375, 'loss/train': 0.969178318977356} 11/07/2021 11:44:13 - INFO - __main__ - Step 103377: {'lr': 0.0001127620376591696, 'samples': 19848384, 'steps': 103376, 'loss/train': 1.071413516998291} 11/07/2021 11:44:14 - INFO - __main__ - Step 103378: {'lr': 0.000112757602026733, 'samples': 19848576, 'steps': 103377, 'loss/train': 1.0624316930770874} 11/07/2021 11:44:15 - INFO - __main__ - Step 103379: {'lr': 0.00011275316645613571, 'samples': 19848768, 'steps': 103378, 'loss/train': 1.4844117164611816} 11/07/2021 11:44:15 - INFO - __main__ - Step 103380: {'lr': 0.0001127487309473797, 'samples': 19848960, 'steps': 103379, 'loss/train': 1.1165425777435303} 11/07/2021 11:44:16 - INFO - __main__ - Step 103381: {'lr': 0.00011274429550046702, 'samples': 19849152, 'steps': 103380, 'loss/train': 2.295464515686035} 11/07/2021 11:44:16 - INFO - __main__ - Step 103382: {'lr': 0.00011273986011539974, 'samples': 19849344, 'steps': 103381, 'loss/train': 2.1608786582946777} 11/07/2021 11:44:16 - INFO - __main__ - Step 103383: {'lr': 0.00011273542479217966, 'samples': 19849536, 'steps': 103382, 'loss/train': 1.324515700340271} 11/07/2021 11:44:17 - INFO - __main__ - Step 103384: {'lr': 0.00011273098953080887, 'samples': 19849728, 'steps': 103383, 'loss/train': 0.6006392240524292} 11/07/2021 11:44:18 - INFO - __main__ - Step 103385: {'lr': 0.0001127265543312894, 'samples': 19849920, 'steps': 103384, 'loss/train': 1.6054588556289673} 11/07/2021 11:44:18 - INFO - __main__ - Step 103386: {'lr': 0.00011272211919362322, 'samples': 19850112, 'steps': 103385, 'loss/train': 1.218793272972107} 11/07/2021 11:44:18 - INFO - __main__ - Step 103387: {'lr': 0.00011271768411781232, 'samples': 19850304, 'steps': 103386, 'loss/train': 1.1986582279205322} 11/07/2021 11:44:19 - INFO - __main__ - Step 103388: {'lr': 0.00011271324910385875, 'samples': 19850496, 'steps': 103387, 'loss/train': 1.4874944686889648} 11/07/2021 11:44:19 - INFO - __main__ - Step 103389: {'lr': 0.00011270881415176443, 'samples': 19850688, 'steps': 103388, 'loss/train': 1.9048218727111816} 11/07/2021 11:44:20 - INFO - __main__ - Step 103390: {'lr': 0.0001127043792615314, 'samples': 19850880, 'steps': 103389, 'loss/train': 1.1672725677490234} 11/07/2021 11:44:20 - INFO - __main__ - Step 103391: {'lr': 0.0001126999444331617, 'samples': 19851072, 'steps': 103390, 'loss/train': 1.0395742654800415} 11/07/2021 11:44:21 - INFO - __main__ - Step 103392: {'lr': 0.00011269550966665726, 'samples': 19851264, 'steps': 103391, 'loss/train': 1.3753390312194824} 11/07/2021 11:44:21 - INFO - __main__ - Step 103393: {'lr': 0.00011269107496202008, 'samples': 19851456, 'steps': 103392, 'loss/train': 1.2638658285140991} 11/07/2021 11:44:22 - INFO - __main__ - Step 103394: {'lr': 0.00011268664031925221, 'samples': 19851648, 'steps': 103393, 'loss/train': 1.5103144645690918} 11/07/2021 11:44:23 - INFO - __main__ - Step 103395: {'lr': 0.00011268220573835572, 'samples': 19851840, 'steps': 103394, 'loss/train': 1.543209195137024} 11/07/2021 11:44:23 - INFO - __main__ - Step 103396: {'lr': 0.00011267777121933239, 'samples': 19852032, 'steps': 103395, 'loss/train': 1.2991914749145508} 11/07/2021 11:44:23 - INFO - __main__ - Step 103397: {'lr': 0.00011267333676218437, 'samples': 19852224, 'steps': 103396, 'loss/train': 1.0840914249420166} 11/07/2021 11:44:24 - INFO - __main__ - Step 103398: {'lr': 0.0001126689023669136, 'samples': 19852416, 'steps': 103397, 'loss/train': 0.7526399493217468} 11/07/2021 11:44:24 - INFO - __main__ - Step 103399: {'lr': 0.00011266446803352213, 'samples': 19852608, 'steps': 103398, 'loss/train': 1.5086727142333984} 11/07/2021 11:44:25 - INFO - __main__ - Step 103400: {'lr': 0.0001126600337620119, 'samples': 19852800, 'steps': 103399, 'loss/train': 1.151409387588501} 11/07/2021 11:44:25 - INFO - __main__ - Step 103401: {'lr': 0.00011265559955238496, 'samples': 19852992, 'steps': 103400, 'loss/train': 1.0893192291259766} 11/07/2021 11:44:26 - INFO - __main__ - Step 103402: {'lr': 0.00011265116540464329, 'samples': 19853184, 'steps': 103401, 'loss/train': 1.2209488153457642} 11/07/2021 11:44:26 - INFO - __main__ - Step 103403: {'lr': 0.00011264673131878886, 'samples': 19853376, 'steps': 103402, 'loss/train': 1.7515249252319336} 11/07/2021 11:44:26 - INFO - __main__ - Step 103404: {'lr': 0.00011264229729482372, 'samples': 19853568, 'steps': 103403, 'loss/train': 1.4328944683074951} 11/07/2021 11:44:27 - INFO - __main__ - Step 103405: {'lr': 0.00011263786333274984, 'samples': 19853760, 'steps': 103404, 'loss/train': 1.2373111248016357} 11/07/2021 11:44:28 - INFO - __main__ - Step 103406: {'lr': 0.0001126334294325692, 'samples': 19853952, 'steps': 103405, 'loss/train': 0.9724913239479065} 11/07/2021 11:44:28 - INFO - __main__ - Step 103407: {'lr': 0.00011262899559428383, 'samples': 19854144, 'steps': 103406, 'loss/train': 1.247044324874878} 11/07/2021 11:44:28 - INFO - __main__ - Step 103408: {'lr': 0.00011262456181789571, 'samples': 19854336, 'steps': 103407, 'loss/train': 1.091044306755066} 11/07/2021 11:44:29 - INFO - __main__ - Step 103409: {'lr': 0.00011262012810340694, 'samples': 19854528, 'steps': 103408, 'loss/train': 1.5378228425979614} 11/07/2021 11:44:30 - INFO - __main__ - Step 103410: {'lr': 0.00011261569445081932, 'samples': 19854720, 'steps': 103409, 'loss/train': 1.424614429473877} 11/07/2021 11:44:30 - INFO - __main__ - Step 103411: {'lr': 0.00011261126086013496, 'samples': 19854912, 'steps': 103410, 'loss/train': 1.2805107831954956} 11/07/2021 11:44:31 - INFO - __main__ - Step 103412: {'lr': 0.00011260682733135582, 'samples': 19855104, 'steps': 103411, 'loss/train': 1.3336529731750488} 11/07/2021 11:44:31 - INFO - __main__ - Step 103413: {'lr': 0.00011260239386448396, 'samples': 19855296, 'steps': 103412, 'loss/train': 1.101731777191162} 11/07/2021 11:44:31 - INFO - __main__ - Step 103414: {'lr': 0.00011259796045952134, 'samples': 19855488, 'steps': 103413, 'loss/train': 1.5745359659194946} 11/07/2021 11:44:32 - INFO - __main__ - Step 103415: {'lr': 0.00011259352711646992, 'samples': 19855680, 'steps': 103414, 'loss/train': 1.7002058029174805} 11/07/2021 11:44:33 - INFO - __main__ - Step 103416: {'lr': 0.00011258909383533177, 'samples': 19855872, 'steps': 103415, 'loss/train': 1.6850446462631226} 11/07/2021 11:44:33 - INFO - __main__ - Step 103417: {'lr': 0.00011258466061610883, 'samples': 19856064, 'steps': 103416, 'loss/train': 1.3189245462417603} 11/07/2021 11:44:33 - INFO - __main__ - Step 103418: {'lr': 0.00011258022745880315, 'samples': 19856256, 'steps': 103417, 'loss/train': 1.7069885730743408} 11/07/2021 11:44:34 - INFO - __main__ - Step 103419: {'lr': 0.00011257579436341666, 'samples': 19856448, 'steps': 103418, 'loss/train': 1.49540114402771} 11/07/2021 11:44:35 - INFO - __main__ - Step 103420: {'lr': 0.00011257136132995144, 'samples': 19856640, 'steps': 103419, 'loss/train': 1.0735423564910889} 11/07/2021 11:44:35 - INFO - __main__ - Step 103421: {'lr': 0.00011256692835840943, 'samples': 19856832, 'steps': 103420, 'loss/train': 1.304594874382019} 11/07/2021 11:44:36 - INFO - __main__ - Step 103422: {'lr': 0.00011256249544879271, 'samples': 19857024, 'steps': 103421, 'loss/train': 1.6006667613983154} 11/07/2021 11:44:36 - INFO - __main__ - Step 103423: {'lr': 0.00011255806260110315, 'samples': 19857216, 'steps': 103422, 'loss/train': 1.0200037956237793} 11/07/2021 11:44:36 - INFO - __main__ - Step 103424: {'lr': 0.00011255362981534279, 'samples': 19857408, 'steps': 103423, 'loss/train': 1.5868607759475708} 11/07/2021 11:44:37 - INFO - __main__ - Step 103425: {'lr': 0.00011254919709151365, 'samples': 19857600, 'steps': 103424, 'loss/train': 1.4941990375518799} 11/07/2021 11:44:38 - INFO - __main__ - Step 103426: {'lr': 0.0001125447644296177, 'samples': 19857792, 'steps': 103425, 'loss/train': 1.5732152462005615} 11/07/2021 11:44:38 - INFO - __main__ - Step 103427: {'lr': 0.00011254033182965698, 'samples': 19857984, 'steps': 103426, 'loss/train': 0.5451501607894897} 11/07/2021 11:44:39 - INFO - __main__ - Step 103428: {'lr': 0.00011253589929163346, 'samples': 19858176, 'steps': 103427, 'loss/train': 1.3761216402053833} 11/07/2021 11:44:39 - INFO - __main__ - Step 103429: {'lr': 0.00011253146681554913, 'samples': 19858368, 'steps': 103428, 'loss/train': 1.6675951480865479} 11/07/2021 11:44:39 - INFO - __main__ - Step 103430: {'lr': 0.00011252703440140602, 'samples': 19858560, 'steps': 103429, 'loss/train': 1.4239929914474487} 11/07/2021 11:44:40 - INFO - __main__ - Step 103431: {'lr': 0.00011252260204920608, 'samples': 19858752, 'steps': 103430, 'loss/train': 1.1931400299072266} 11/07/2021 11:44:41 - INFO - __main__ - Step 103432: {'lr': 0.00011251816975895137, 'samples': 19858944, 'steps': 103431, 'loss/train': 1.1050888299942017} 11/07/2021 11:44:41 - INFO - __main__ - Step 103433: {'lr': 0.00011251373753064384, 'samples': 19859136, 'steps': 103432, 'loss/train': 0.8931412696838379} 11/07/2021 11:44:42 - INFO - __main__ - Step 103434: {'lr': 0.00011250930536428547, 'samples': 19859328, 'steps': 103433, 'loss/train': 1.9671369791030884} 11/07/2021 11:44:42 - INFO - __main__ - Step 103435: {'lr': 0.00011250487325987831, 'samples': 19859520, 'steps': 103434, 'loss/train': 0.9443691372871399} 11/07/2021 11:44:42 - INFO - __main__ - Step 103436: {'lr': 0.00011250044121742442, 'samples': 19859712, 'steps': 103435, 'loss/train': 1.3947514295578003} 11/07/2021 11:44:44 - INFO - __main__ - Step 103437: {'lr': 0.00011249600923692562, 'samples': 19859904, 'steps': 103436, 'loss/train': 1.4509873390197754} 11/07/2021 11:44:44 - INFO - __main__ - Step 103438: {'lr': 0.000112491577318384, 'samples': 19860096, 'steps': 103437, 'loss/train': 1.3253345489501953} 11/07/2021 11:44:44 - INFO - __main__ - Step 103439: {'lr': 0.00011248714546180155, 'samples': 19860288, 'steps': 103438, 'loss/train': 0.8934639096260071} 11/07/2021 11:44:45 - INFO - __main__ - Step 103440: {'lr': 0.00011248271366718027, 'samples': 19860480, 'steps': 103439, 'loss/train': 1.5257482528686523} 11/07/2021 11:44:45 - INFO - __main__ - Step 103441: {'lr': 0.00011247828193452215, 'samples': 19860672, 'steps': 103440, 'loss/train': 1.5409198999404907} 11/07/2021 11:44:46 - INFO - __main__ - Step 103442: {'lr': 0.0001124738502638292, 'samples': 19860864, 'steps': 103441, 'loss/train': 1.4682472944259644} 11/07/2021 11:44:47 - INFO - __main__ - Step 103443: {'lr': 0.0001124694186551034, 'samples': 19861056, 'steps': 103442, 'loss/train': 1.4633604288101196} 11/07/2021 11:44:47 - INFO - __main__ - Step 103444: {'lr': 0.00011246498710834679, 'samples': 19861248, 'steps': 103443, 'loss/train': 3.2669858932495117} 11/07/2021 11:44:47 - INFO - __main__ - Step 103445: {'lr': 0.00011246055562356131, 'samples': 19861440, 'steps': 103444, 'loss/train': 1.3715300559997559} 11/07/2021 11:44:48 - INFO - __main__ - Step 103446: {'lr': 0.00011245612420074896, 'samples': 19861632, 'steps': 103445, 'loss/train': 0.11420916020870209} 11/07/2021 11:44:48 - INFO - __main__ - Step 103447: {'lr': 0.0001124516928399118, 'samples': 19861824, 'steps': 103446, 'loss/train': 1.4640450477600098} 11/07/2021 11:44:49 - INFO - __main__ - Step 103448: {'lr': 0.00011244726154105179, 'samples': 19862016, 'steps': 103447, 'loss/train': 1.4532325267791748} 11/07/2021 11:44:49 - INFO - __main__ - Step 103449: {'lr': 0.00011244283030417096, 'samples': 19862208, 'steps': 103448, 'loss/train': 1.2010058164596558} 11/07/2021 11:44:50 - INFO - __main__ - Step 103450: {'lr': 0.00011243839912927123, 'samples': 19862400, 'steps': 103449, 'loss/train': 2.0593228340148926} 11/07/2021 11:44:50 - INFO - __main__ - Step 103451: {'lr': 0.00011243396801635461, 'samples': 19862592, 'steps': 103450, 'loss/train': 1.4077694416046143} 11/07/2021 11:44:50 - INFO - __main__ - Step 103452: {'lr': 0.0001124295369654231, 'samples': 19862784, 'steps': 103451, 'loss/train': 0.9361749291419983} 11/07/2021 11:44:51 - INFO - __main__ - Step 103453: {'lr': 0.00011242510597647875, 'samples': 19862976, 'steps': 103452, 'loss/train': 1.5510607957839966} 11/07/2021 11:44:52 - INFO - __main__ - Step 103454: {'lr': 0.00011242067504952352, 'samples': 19863168, 'steps': 103453, 'loss/train': 1.5476042032241821} 11/07/2021 11:44:52 - INFO - __main__ - Step 103455: {'lr': 0.0001124162441845594, 'samples': 19863360, 'steps': 103454, 'loss/train': 1.5196843147277832} 11/07/2021 11:44:53 - INFO - __main__ - Step 103456: {'lr': 0.0001124118133815884, 'samples': 19863552, 'steps': 103455, 'loss/train': 1.6957018375396729} 11/07/2021 11:44:53 - INFO - __main__ - Step 103457: {'lr': 0.00011240738264061251, 'samples': 19863744, 'steps': 103456, 'loss/train': 1.2255724668502808} 11/07/2021 11:44:53 - INFO - __main__ - Step 103458: {'lr': 0.00011240295196163375, 'samples': 19863936, 'steps': 103457, 'loss/train': 1.3835526704788208} 11/07/2021 11:44:54 - INFO - __main__ - Step 103459: {'lr': 0.00011239852134465408, 'samples': 19864128, 'steps': 103458, 'loss/train': 1.014214038848877} 11/07/2021 11:44:55 - INFO - __main__ - Step 103460: {'lr': 0.00011239409078967552, 'samples': 19864320, 'steps': 103459, 'loss/train': 1.5096908807754517} 11/07/2021 11:44:55 - INFO - __main__ - Step 103461: {'lr': 0.00011238966029670014, 'samples': 19864512, 'steps': 103460, 'loss/train': 1.3006986379623413} 11/07/2021 11:44:56 - INFO - __main__ - Step 103462: {'lr': 0.00011238522986572977, 'samples': 19864704, 'steps': 103461, 'loss/train': 1.1597539186477661} 11/07/2021 11:44:56 - INFO - __main__ - Step 103463: {'lr': 0.0001123807994967665, 'samples': 19864896, 'steps': 103462, 'loss/train': 1.1696891784667969} 11/07/2021 11:44:57 - INFO - __main__ - Step 103464: {'lr': 0.00011237636918981232, 'samples': 19865088, 'steps': 103463, 'loss/train': 1.4106025695800781} 11/07/2021 11:44:57 - INFO - __main__ - Step 103465: {'lr': 0.00011237193894486919, 'samples': 19865280, 'steps': 103464, 'loss/train': 1.0798577070236206} 11/07/2021 11:44:58 - INFO - __main__ - Step 103466: {'lr': 0.00011236750876193918, 'samples': 19865472, 'steps': 103465, 'loss/train': 1.2990269660949707} 11/07/2021 11:44:58 - INFO - __main__ - Step 103467: {'lr': 0.00011236307864102424, 'samples': 19865664, 'steps': 103466, 'loss/train': 1.4244071245193481} 11/07/2021 11:44:58 - INFO - __main__ - Step 103468: {'lr': 0.00011235864858212636, 'samples': 19865856, 'steps': 103467, 'loss/train': 1.2469083070755005} 11/07/2021 11:44:59 - INFO - __main__ - Step 103469: {'lr': 0.00011235421858524755, 'samples': 19866048, 'steps': 103468, 'loss/train': 1.5442918539047241} 11/07/2021 11:45:00 - INFO - __main__ - Step 103470: {'lr': 0.0001123497886503898, 'samples': 19866240, 'steps': 103469, 'loss/train': 1.0200334787368774} 11/07/2021 11:45:00 - INFO - __main__ - Step 103471: {'lr': 0.00011234535877755515, 'samples': 19866432, 'steps': 103470, 'loss/train': 1.4422664642333984} 11/07/2021 11:45:00 - INFO - __main__ - Step 103472: {'lr': 0.00011234092896674561, 'samples': 19866624, 'steps': 103471, 'loss/train': 1.6661145687103271} 11/07/2021 11:45:01 - INFO - __main__ - Step 103473: {'lr': 0.00011233649921796305, 'samples': 19866816, 'steps': 103472, 'loss/train': 1.0618696212768555} 11/07/2021 11:45:02 - INFO - __main__ - Step 103474: {'lr': 0.0001123320695312095, 'samples': 19867008, 'steps': 103473, 'loss/train': 1.5295113325119019} 11/07/2021 11:45:02 - INFO - __main__ - Step 103475: {'lr': 0.00011232763990648704, 'samples': 19867200, 'steps': 103474, 'loss/train': 1.2921648025512695} 11/07/2021 11:45:02 - INFO - __main__ - Step 103476: {'lr': 0.00011232321034379761, 'samples': 19867392, 'steps': 103475, 'loss/train': 1.4357720613479614} 11/07/2021 11:45:03 - INFO - __main__ - Step 103477: {'lr': 0.0001123187808431432, 'samples': 19867584, 'steps': 103476, 'loss/train': 1.4317560195922852} 11/07/2021 11:45:03 - INFO - __main__ - Step 103478: {'lr': 0.00011231435140452583, 'samples': 19867776, 'steps': 103477, 'loss/train': 0.876252293586731} 11/07/2021 11:45:05 - INFO - __main__ - Step 103479: {'lr': 0.00011230992202794752, 'samples': 19867968, 'steps': 103478, 'loss/train': 1.4903762340545654} 11/07/2021 11:45:05 - INFO - __main__ - Step 103480: {'lr': 0.0001123054927134102, 'samples': 19868160, 'steps': 103479, 'loss/train': 0.8036170601844788} 11/07/2021 11:45:05 - INFO - __main__ - Step 103481: {'lr': 0.00011230106346091589, 'samples': 19868352, 'steps': 103480, 'loss/train': 0.6164864301681519} 11/07/2021 11:45:06 - INFO - __main__ - Step 103482: {'lr': 0.00011229663427046663, 'samples': 19868544, 'steps': 103481, 'loss/train': 1.2791571617126465} 11/07/2021 11:45:06 - INFO - __main__ - Step 103483: {'lr': 0.00011229220514206446, 'samples': 19868736, 'steps': 103482, 'loss/train': 1.1823080778121948} 11/07/2021 11:45:07 - INFO - __main__ - Step 103484: {'lr': 0.0001122877760757112, 'samples': 19868928, 'steps': 103483, 'loss/train': 1.3148841857910156} 11/07/2021 11:45:08 - INFO - __main__ - Step 103485: {'lr': 0.00011228334707140898, 'samples': 19869120, 'steps': 103484, 'loss/train': 1.3299099206924438} 11/07/2021 11:45:08 - INFO - __main__ - Step 103486: {'lr': 0.00011227891812915969, 'samples': 19869312, 'steps': 103485, 'loss/train': 1.6222728490829468} 11/07/2021 11:45:08 - INFO - __main__ - Step 103487: {'lr': 0.00011227448924896544, 'samples': 19869504, 'steps': 103486, 'loss/train': 1.6943429708480835} 11/07/2021 11:45:09 - INFO - __main__ - Step 103488: {'lr': 0.00011227006043082818, 'samples': 19869696, 'steps': 103487, 'loss/train': 1.5692262649536133} 11/07/2021 11:45:09 - INFO - __main__ - Step 103489: {'lr': 0.0001122656316747499, 'samples': 19869888, 'steps': 103488, 'loss/train': 1.2798787355422974} 11/07/2021 11:45:10 - INFO - __main__ - Step 103490: {'lr': 0.00011226120298073258, 'samples': 19870080, 'steps': 103489, 'loss/train': 1.2533113956451416} 11/07/2021 11:45:10 - INFO - __main__ - Step 103491: {'lr': 0.00011225677434877826, 'samples': 19870272, 'steps': 103490, 'loss/train': 1.8284850120544434} 11/07/2021 11:45:11 - INFO - __main__ - Step 103492: {'lr': 0.0001122523457788889, 'samples': 19870464, 'steps': 103491, 'loss/train': 1.4373706579208374} 11/07/2021 11:45:11 - INFO - __main__ - Step 103493: {'lr': 0.0001122479172710665, 'samples': 19870656, 'steps': 103492, 'loss/train': 1.3417445421218872} 11/07/2021 11:45:12 - INFO - __main__ - Step 103494: {'lr': 0.00011224348882531318, 'samples': 19870848, 'steps': 103493, 'loss/train': 1.4132466316223145} 11/07/2021 11:45:13 - INFO - __main__ - Step 103495: {'lr': 0.00011223906044163074, 'samples': 19871040, 'steps': 103494, 'loss/train': 1.529997706413269} 11/07/2021 11:45:13 - INFO - __main__ - Step 103496: {'lr': 0.00011223463212002121, 'samples': 19871232, 'steps': 103495, 'loss/train': 1.902406096458435} 11/07/2021 11:45:13 - INFO - __main__ - Step 103497: {'lr': 0.00011223020386048665, 'samples': 19871424, 'steps': 103496, 'loss/train': 1.2834758758544922} 11/07/2021 11:45:14 - INFO - __main__ - Step 103498: {'lr': 0.00011222577566302902, 'samples': 19871616, 'steps': 103497, 'loss/train': 1.428093433380127} 11/07/2021 11:45:14 - INFO - __main__ - Step 103499: {'lr': 0.00011222134752765034, 'samples': 19871808, 'steps': 103498, 'loss/train': 1.5312623977661133} 11/07/2021 11:45:15 - INFO - __main__ - Step 103500: {'lr': 0.00011221691945435262, 'samples': 19872000, 'steps': 103499, 'loss/train': 1.5042120218276978} 11/07/2021 11:45:16 - INFO - __main__ - Step 103501: {'lr': 0.00011221249144313777, 'samples': 19872192, 'steps': 103500, 'loss/train': 1.0710581541061401} 11/07/2021 11:45:16 - INFO - __main__ - Step 103502: {'lr': 0.00011220806349400788, 'samples': 19872384, 'steps': 103501, 'loss/train': 1.3487237691879272} 11/07/2021 11:45:16 - INFO - __main__ - Step 103503: {'lr': 0.00011220363560696492, 'samples': 19872576, 'steps': 103502, 'loss/train': 1.4483742713928223} 11/07/2021 11:45:17 - INFO - __main__ - Step 103504: {'lr': 0.00011219920778201088, 'samples': 19872768, 'steps': 103503, 'loss/train': 1.2219126224517822} 11/07/2021 11:45:17 - INFO - __main__ - Step 103505: {'lr': 0.00011219478001914781, 'samples': 19872960, 'steps': 103504, 'loss/train': 1.3108950853347778} 11/07/2021 11:45:18 - INFO - __main__ - Step 103506: {'lr': 0.00011219035231837759, 'samples': 19873152, 'steps': 103505, 'loss/train': 1.3110682964324951} 11/07/2021 11:45:19 - INFO - __main__ - Step 103507: {'lr': 0.00011218592467970226, 'samples': 19873344, 'steps': 103506, 'loss/train': 1.8492438793182373} 11/07/2021 11:45:19 - INFO - __main__ - Step 103508: {'lr': 0.0001121814971031238, 'samples': 19873536, 'steps': 103507, 'loss/train': 1.6967800855636597} 11/07/2021 11:45:19 - INFO - __main__ - Step 103509: {'lr': 0.00011217706958864426, 'samples': 19873728, 'steps': 103508, 'loss/train': 1.5282385349273682} 11/07/2021 11:45:20 - INFO - __main__ - Step 103510: {'lr': 0.0001121726421362656, 'samples': 19873920, 'steps': 103509, 'loss/train': 1.639284610748291} 11/07/2021 11:45:21 - INFO - __main__ - Step 103511: {'lr': 0.00011216821474598982, 'samples': 19874112, 'steps': 103510, 'loss/train': 1.7087503671646118} 11/07/2021 11:45:21 - INFO - __main__ - Step 103512: {'lr': 0.00011216378741781891, 'samples': 19874304, 'steps': 103511, 'loss/train': 1.5386751890182495} 11/07/2021 11:45:21 - INFO - __main__ - Step 103513: {'lr': 0.00011215936015175488, 'samples': 19874496, 'steps': 103512, 'loss/train': 1.5037175416946411} 11/07/2021 11:45:22 - INFO - __main__ - Step 103514: {'lr': 0.00011215493294779969, 'samples': 19874688, 'steps': 103513, 'loss/train': 1.2855743169784546} 11/07/2021 11:45:22 - INFO - __main__ - Step 103515: {'lr': 0.00011215050580595538, 'samples': 19874880, 'steps': 103514, 'loss/train': 1.0921547412872314} 11/07/2021 11:45:22 - INFO - __main__ - Step 103516: {'lr': 0.000112146078726224, 'samples': 19875072, 'steps': 103515, 'loss/train': 1.3297497034072876} 11/07/2021 11:45:23 - INFO - __main__ - Step 103517: {'lr': 0.0001121416517086074, 'samples': 19875264, 'steps': 103516, 'loss/train': 1.6807334423065186} 11/07/2021 11:45:24 - INFO - __main__ - Step 103518: {'lr': 0.00011213722475310765, 'samples': 19875456, 'steps': 103517, 'loss/train': 1.1686667203903198} 11/07/2021 11:45:24 - INFO - __main__ - Step 103519: {'lr': 0.00011213279785972672, 'samples': 19875648, 'steps': 103518, 'loss/train': 1.3814483880996704} 11/07/2021 11:45:25 - INFO - __main__ - Step 103520: {'lr': 0.00011212837102846663, 'samples': 19875840, 'steps': 103519, 'loss/train': 1.530495524406433} 11/07/2021 11:45:25 - INFO - __main__ - Step 103521: {'lr': 0.00011212394425932938, 'samples': 19876032, 'steps': 103520, 'loss/train': 1.1751341819763184} 11/07/2021 11:45:26 - INFO - __main__ - Step 103522: {'lr': 0.00011211951755231692, 'samples': 19876224, 'steps': 103521, 'loss/train': 1.5960978269577026} 11/07/2021 11:45:26 - INFO - __main__ - Step 103523: {'lr': 0.0001121150909074313, 'samples': 19876416, 'steps': 103522, 'loss/train': 1.4452177286148071} 11/07/2021 11:45:27 - INFO - __main__ - Step 103524: {'lr': 0.00011211066432467448, 'samples': 19876608, 'steps': 103523, 'loss/train': 0.5913840532302856} 11/07/2021 11:45:27 - INFO - __main__ - Step 103525: {'lr': 0.0001121062378040485, 'samples': 19876800, 'steps': 103524, 'loss/train': 1.3029851913452148} 11/07/2021 11:45:27 - INFO - __main__ - Step 103526: {'lr': 0.0001121018113455553, 'samples': 19876992, 'steps': 103525, 'loss/train': 1.4705100059509277} 11/07/2021 11:45:28 - INFO - __main__ - Step 103527: {'lr': 0.00011209738494919689, 'samples': 19877184, 'steps': 103526, 'loss/train': 1.2598137855529785} 11/07/2021 11:45:29 - INFO - __main__ - Step 103528: {'lr': 0.00011209295861497529, 'samples': 19877376, 'steps': 103527, 'loss/train': 1.4939032793045044} 11/07/2021 11:45:29 - INFO - __main__ - Step 103529: {'lr': 0.00011208853234289245, 'samples': 19877568, 'steps': 103528, 'loss/train': 1.2203469276428223} 11/07/2021 11:45:29 - INFO - __main__ - Step 103530: {'lr': 0.00011208410613295047, 'samples': 19877760, 'steps': 103529, 'loss/train': 1.4320732355117798} 11/07/2021 11:45:30 - INFO - __main__ - Step 103531: {'lr': 0.0001120796799851512, 'samples': 19877952, 'steps': 103530, 'loss/train': 1.158941626548767} 11/07/2021 11:45:31 - INFO - __main__ - Step 103532: {'lr': 0.00011207525389949671, 'samples': 19878144, 'steps': 103531, 'loss/train': 1.7673412561416626} 11/07/2021 11:45:31 - INFO - __main__ - Step 103533: {'lr': 0.00011207082787598896, 'samples': 19878336, 'steps': 103532, 'loss/train': 1.0762611627578735} 11/07/2021 11:45:32 - INFO - __main__ - Step 103534: {'lr': 0.00011206640191462996, 'samples': 19878528, 'steps': 103533, 'loss/train': 0.8787805438041687} 11/07/2021 11:45:32 - INFO - __main__ - Step 103535: {'lr': 0.00011206197601542173, 'samples': 19878720, 'steps': 103534, 'loss/train': 1.56797456741333} 11/07/2021 11:45:32 - INFO - __main__ - Step 103536: {'lr': 0.00011205755017836625, 'samples': 19878912, 'steps': 103535, 'loss/train': 1.6438498497009277} 11/07/2021 11:45:33 - INFO - __main__ - Step 103537: {'lr': 0.0001120531244034655, 'samples': 19879104, 'steps': 103536, 'loss/train': 0.519074022769928} 11/07/2021 11:45:34 - INFO - __main__ - Step 103538: {'lr': 0.00011204869869072146, 'samples': 19879296, 'steps': 103537, 'loss/train': 1.436423897743225} 11/07/2021 11:45:34 - INFO - __main__ - Step 103539: {'lr': 0.00011204427304013617, 'samples': 19879488, 'steps': 103538, 'loss/train': 1.2949882745742798} 11/07/2021 11:45:34 - INFO - __main__ - Step 103540: {'lr': 0.00011203984745171159, 'samples': 19879680, 'steps': 103539, 'loss/train': 1.2782175540924072} 11/07/2021 11:45:35 - INFO - __main__ - Step 103541: {'lr': 0.00011203542192544975, 'samples': 19879872, 'steps': 103540, 'loss/train': 1.2835838794708252} 11/07/2021 11:45:35 - INFO - __main__ - Step 103542: {'lr': 0.0001120309964613526, 'samples': 19880064, 'steps': 103541, 'loss/train': 1.3383831977844238} 11/07/2021 11:45:36 - INFO - __main__ - Step 103543: {'lr': 0.00011202657105942224, 'samples': 19880256, 'steps': 103542, 'loss/train': 1.0858490467071533} 11/07/2021 11:45:36 - INFO - __main__ - Step 103544: {'lr': 0.00011202214571966049, 'samples': 19880448, 'steps': 103543, 'loss/train': 1.071030855178833} 11/07/2021 11:45:37 - INFO - __main__ - Step 103545: {'lr': 0.00011201772044206945, 'samples': 19880640, 'steps': 103544, 'loss/train': 1.471193790435791} 11/07/2021 11:45:37 - INFO - __main__ - Step 103546: {'lr': 0.00011201329522665107, 'samples': 19880832, 'steps': 103545, 'loss/train': 1.308884859085083} 11/07/2021 11:45:37 - INFO - __main__ - Step 103547: {'lr': 0.00011200887007340741, 'samples': 19881024, 'steps': 103546, 'loss/train': 1.0795762538909912} 11/07/2021 11:45:39 - INFO - __main__ - Step 103548: {'lr': 0.00011200444498234038, 'samples': 19881216, 'steps': 103547, 'loss/train': 1.471771478652954} 11/07/2021 11:45:39 - INFO - __main__ - Step 103549: {'lr': 0.00011200001995345204, 'samples': 19881408, 'steps': 103548, 'loss/train': 1.6450897455215454} 11/07/2021 11:45:39 - INFO - __main__ - Step 103550: {'lr': 0.00011199559498674436, 'samples': 19881600, 'steps': 103549, 'loss/train': 1.6313869953155518} 11/07/2021 11:45:40 - INFO - __main__ - Step 103551: {'lr': 0.00011199117008221932, 'samples': 19881792, 'steps': 103550, 'loss/train': 1.3925292491912842} 11/07/2021 11:45:40 - INFO - __main__ - Step 103552: {'lr': 0.00011198674523987896, 'samples': 19881984, 'steps': 103551, 'loss/train': 1.6900883913040161} 11/07/2021 11:45:41 - INFO - __main__ - Step 103553: {'lr': 0.00011198232045972523, 'samples': 19882176, 'steps': 103552, 'loss/train': 1.4366722106933594} 11/07/2021 11:45:41 - INFO - __main__ - Step 103554: {'lr': 0.00011197789574176012, 'samples': 19882368, 'steps': 103553, 'loss/train': 0.07977203279733658} 11/07/2021 11:45:42 - INFO - __main__ - Step 103555: {'lr': 0.00011197347108598566, 'samples': 19882560, 'steps': 103554, 'loss/train': 1.3383969068527222} 11/07/2021 11:45:42 - INFO - __main__ - Step 103556: {'lr': 0.0001119690464924038, 'samples': 19882752, 'steps': 103555, 'loss/train': 0.379334419965744} 11/07/2021 11:45:42 - INFO - __main__ - Step 103557: {'lr': 0.00011196462196101667, 'samples': 19882944, 'steps': 103556, 'loss/train': 1.9220612049102783} 11/07/2021 11:45:43 - INFO - __main__ - Step 103558: {'lr': 0.00011196019749182607, 'samples': 19883136, 'steps': 103557, 'loss/train': 1.5613523721694946} 11/07/2021 11:45:44 - INFO - __main__ - Step 103559: {'lr': 0.00011195577308483405, 'samples': 19883328, 'steps': 103558, 'loss/train': 1.2881819009780884} 11/07/2021 11:45:44 - INFO - __main__ - Step 103560: {'lr': 0.00011195134874004265, 'samples': 19883520, 'steps': 103559, 'loss/train': 1.8111246824264526} 11/07/2021 11:45:45 - INFO - __main__ - Step 103561: {'lr': 0.0001119469244574538, 'samples': 19883712, 'steps': 103560, 'loss/train': 1.4503343105316162} 11/07/2021 11:45:45 - INFO - __main__ - Step 103562: {'lr': 0.00011194250023706959, 'samples': 19883904, 'steps': 103561, 'loss/train': 1.8518776893615723} 11/07/2021 11:45:45 - INFO - __main__ - Step 103563: {'lr': 0.00011193807607889192, 'samples': 19884096, 'steps': 103562, 'loss/train': 1.4688103199005127} 11/07/2021 11:45:46 - INFO - __main__ - Step 103564: {'lr': 0.00011193365198292285, 'samples': 19884288, 'steps': 103563, 'loss/train': 5.710091590881348} 11/07/2021 11:45:47 - INFO - __main__ - Step 103565: {'lr': 0.00011192922794916432, 'samples': 19884480, 'steps': 103564, 'loss/train': 1.2298632860183716} 11/07/2021 11:45:47 - INFO - __main__ - Step 103566: {'lr': 0.00011192480397761836, 'samples': 19884672, 'steps': 103565, 'loss/train': 1.511082649230957} 11/07/2021 11:45:47 - INFO - __main__ - Step 103567: {'lr': 0.00011192038006828698, 'samples': 19884864, 'steps': 103566, 'loss/train': 1.3130242824554443} 11/07/2021 11:45:48 - INFO - __main__ - Step 103568: {'lr': 0.0001119159562211721, 'samples': 19885056, 'steps': 103567, 'loss/train': 1.412919521331787} 11/07/2021 11:45:49 - INFO - __main__ - Step 103569: {'lr': 0.00011191153243627578, 'samples': 19885248, 'steps': 103568, 'loss/train': 0.8157038688659668} 11/07/2021 11:45:49 - INFO - __main__ - Step 103570: {'lr': 0.0001119071087136001, 'samples': 19885440, 'steps': 103569, 'loss/train': 1.494933843612671} 11/07/2021 11:45:50 - INFO - __main__ - Step 103571: {'lr': 0.00011190268505314682, 'samples': 19885632, 'steps': 103570, 'loss/train': 1.227675437927246} 11/07/2021 11:45:50 - INFO - __main__ - Step 103572: {'lr': 0.0001118982614549181, 'samples': 19885824, 'steps': 103571, 'loss/train': 1.4703822135925293} 11/07/2021 11:45:50 - INFO - __main__ - Step 103573: {'lr': 0.00011189383791891586, 'samples': 19886016, 'steps': 103572, 'loss/train': 0.5340999960899353} 11/07/2021 11:45:51 - INFO - __main__ - Step 103574: {'lr': 0.00011188941444514214, 'samples': 19886208, 'steps': 103573, 'loss/train': 0.9948744177818298} 11/07/2021 11:45:52 - INFO - __main__ - Step 103575: {'lr': 0.00011188499103359892, 'samples': 19886400, 'steps': 103574, 'loss/train': 2.0192315578460693} 11/07/2021 11:45:52 - INFO - __main__ - Step 103576: {'lr': 0.00011188056768428817, 'samples': 19886592, 'steps': 103575, 'loss/train': 1.500478744506836} 11/07/2021 11:45:53 - INFO - __main__ - Step 103577: {'lr': 0.00011187614439721194, 'samples': 19886784, 'steps': 103576, 'loss/train': 1.0128124952316284} 11/07/2021 11:45:53 - INFO - __main__ - Step 103578: {'lr': 0.00011187172117237216, 'samples': 19886976, 'steps': 103577, 'loss/train': 1.420758605003357} 11/07/2021 11:45:53 - INFO - __main__ - Step 103579: {'lr': 0.00011186729800977085, 'samples': 19887168, 'steps': 103578, 'loss/train': 1.4511831998825073} 11/07/2021 11:45:55 - INFO - __main__ - Step 103580: {'lr': 0.00011186287490941002, 'samples': 19887360, 'steps': 103579, 'loss/train': 1.900110125541687} 11/07/2021 11:45:55 - INFO - __main__ - Step 103581: {'lr': 0.00011185845187129164, 'samples': 19887552, 'steps': 103580, 'loss/train': 1.3089851140975952} 11/07/2021 11:45:56 - INFO - __main__ - Step 103582: {'lr': 0.0001118540288954177, 'samples': 19887744, 'steps': 103581, 'loss/train': 1.765537142753601} 11/07/2021 11:45:56 - INFO - __main__ - Step 103583: {'lr': 0.00011184960598179033, 'samples': 19887936, 'steps': 103582, 'loss/train': 1.5762927532196045} 11/07/2021 11:45:56 - INFO - __main__ - Step 103584: {'lr': 0.00011184518313041128, 'samples': 19888128, 'steps': 103583, 'loss/train': 1.3751003742218018} 11/07/2021 11:45:57 - INFO - __main__ - Step 103585: {'lr': 0.00011184076034128265, 'samples': 19888320, 'steps': 103584, 'loss/train': 1.307672381401062} 11/07/2021 11:45:58 - INFO - __main__ - Step 103586: {'lr': 0.00011183633761440645, 'samples': 19888512, 'steps': 103585, 'loss/train': 0.37513843178749084} 11/07/2021 11:45:58 - INFO - __main__ - Step 103587: {'lr': 0.00011183191494978467, 'samples': 19888704, 'steps': 103586, 'loss/train': 1.0875298976898193} 11/07/2021 11:45:59 - INFO - __main__ - Step 103588: {'lr': 0.00011182749234741929, 'samples': 19888896, 'steps': 103587, 'loss/train': 1.3890061378479004} 11/07/2021 11:45:59 - INFO - __main__ - Step 103589: {'lr': 0.0001118230698073123, 'samples': 19889088, 'steps': 103588, 'loss/train': 1.2266162633895874} 11/07/2021 11:45:59 - INFO - __main__ - Step 103590: {'lr': 0.00011181864732946573, 'samples': 19889280, 'steps': 103589, 'loss/train': 1.3832817077636719} 11/07/2021 11:46:00 - INFO - __main__ - Step 103591: {'lr': 0.00011181422491388152, 'samples': 19889472, 'steps': 103590, 'loss/train': 1.5921989679336548} 11/07/2021 11:46:01 - INFO - __main__ - Step 103592: {'lr': 0.0001118098025605617, 'samples': 19889664, 'steps': 103591, 'loss/train': 1.2805507183074951} 11/07/2021 11:46:01 - INFO - __main__ - Step 103593: {'lr': 0.00011180538026950826, 'samples': 19889856, 'steps': 103592, 'loss/train': 1.4286116361618042} 11/07/2021 11:46:01 - INFO - __main__ - Step 103594: {'lr': 0.00011180095804072315, 'samples': 19890048, 'steps': 103593, 'loss/train': 0.9020841121673584} 11/07/2021 11:46:02 - INFO - __main__ - Step 103595: {'lr': 0.00011179653587420844, 'samples': 19890240, 'steps': 103594, 'loss/train': 1.1676967144012451} 11/07/2021 11:46:03 - INFO - __main__ - Step 103596: {'lr': 0.00011179211376996604, 'samples': 19890432, 'steps': 103595, 'loss/train': 1.3514758348464966} 11/07/2021 11:46:03 - INFO - __main__ - Step 103597: {'lr': 0.0001117876917279981, 'samples': 19890624, 'steps': 103596, 'loss/train': 1.452530860900879} 11/07/2021 11:46:03 - INFO - __main__ - Step 103598: {'lr': 0.00011178326974830638, 'samples': 19890816, 'steps': 103597, 'loss/train': 1.1221131086349487} 11/07/2021 11:46:04 - INFO - __main__ - Step 103599: {'lr': 0.00011177884783089299, 'samples': 19891008, 'steps': 103598, 'loss/train': 1.4749382734298706} 11/07/2021 11:46:04 - INFO - __main__ - Step 103600: {'lr': 0.00011177442597575993, 'samples': 19891200, 'steps': 103599, 'loss/train': 1.1146756410598755} 11/07/2021 11:46:04 - INFO - __main__ - Step 103601: {'lr': 0.00011177000418290917, 'samples': 19891392, 'steps': 103600, 'loss/train': 1.7034821510314941} 11/07/2021 11:46:05 - INFO - __main__ - Step 103602: {'lr': 0.00011176558245234273, 'samples': 19891584, 'steps': 103601, 'loss/train': 1.5903819799423218} 11/07/2021 11:46:06 - INFO - __main__ - Step 103603: {'lr': 0.00011176116078406257, 'samples': 19891776, 'steps': 103602, 'loss/train': 1.0249879360198975} 11/07/2021 11:46:06 - INFO - __main__ - Step 103604: {'lr': 0.0001117567391780707, 'samples': 19891968, 'steps': 103603, 'loss/train': 1.5126820802688599} 11/07/2021 11:46:06 - INFO - __main__ - Step 103605: {'lr': 0.00011175231763436911, 'samples': 19892160, 'steps': 103604, 'loss/train': 1.2981257438659668} 11/07/2021 11:46:07 - INFO - __main__ - Step 103606: {'lr': 0.0001117478961529598, 'samples': 19892352, 'steps': 103605, 'loss/train': 1.041835904121399} 11/07/2021 11:46:08 - INFO - __main__ - Step 103607: {'lr': 0.00011174347473384474, 'samples': 19892544, 'steps': 103606, 'loss/train': 1.7937846183776855} 11/07/2021 11:46:08 - INFO - __main__ - Step 103608: {'lr': 0.00011173905337702594, 'samples': 19892736, 'steps': 103607, 'loss/train': 1.2605048418045044} 11/07/2021 11:46:09 - INFO - __main__ - Step 103609: {'lr': 0.0001117346320825054, 'samples': 19892928, 'steps': 103608, 'loss/train': 0.6409690380096436} 11/07/2021 11:46:09 - INFO - __main__ - Step 103610: {'lr': 0.00011173021085028518, 'samples': 19893120, 'steps': 103609, 'loss/train': 1.3553581237792969} 11/07/2021 11:46:09 - INFO - __main__ - Step 103611: {'lr': 0.00011172578968036712, 'samples': 19893312, 'steps': 103610, 'loss/train': 0.9258628487586975} 11/07/2021 11:46:10 - INFO - __main__ - Step 103612: {'lr': 0.00011172136857275325, 'samples': 19893504, 'steps': 103611, 'loss/train': 1.129651427268982} 11/07/2021 11:46:11 - INFO - __main__ - Step 103613: {'lr': 0.00011171694752744562, 'samples': 19893696, 'steps': 103612, 'loss/train': 1.521697759628296} 11/07/2021 11:46:11 - INFO - __main__ - Step 103614: {'lr': 0.00011171252654444622, 'samples': 19893888, 'steps': 103613, 'loss/train': 1.100273847579956} 11/07/2021 11:46:11 - INFO - __main__ - Step 103615: {'lr': 0.000111708105623757, 'samples': 19894080, 'steps': 103614, 'loss/train': 1.2673842906951904} 11/07/2021 11:46:12 - INFO - __main__ - Step 103616: {'lr': 0.00011170368476537998, 'samples': 19894272, 'steps': 103615, 'loss/train': 1.4489359855651855} 11/07/2021 11:46:13 - INFO - __main__ - Step 103617: {'lr': 0.00011169926396931712, 'samples': 19894464, 'steps': 103616, 'loss/train': 1.2483983039855957} 11/07/2021 11:46:13 - INFO - __main__ - Step 103618: {'lr': 0.00011169484323557047, 'samples': 19894656, 'steps': 103617, 'loss/train': 1.6420103311538696} 11/07/2021 11:46:13 - INFO - __main__ - Step 103619: {'lr': 0.00011169042256414197, 'samples': 19894848, 'steps': 103618, 'loss/train': 1.299409031867981} 11/07/2021 11:46:14 - INFO - __main__ - Step 103620: {'lr': 0.00011168600195503364, 'samples': 19895040, 'steps': 103619, 'loss/train': 2.259232759475708} 11/07/2021 11:46:14 - INFO - __main__ - Step 103621: {'lr': 0.00011168158140824746, 'samples': 19895232, 'steps': 103620, 'loss/train': 0.7387141585350037} 11/07/2021 11:46:15 - INFO - __main__ - Step 103622: {'lr': 0.00011167716092378544, 'samples': 19895424, 'steps': 103621, 'loss/train': 1.4461313486099243} 11/07/2021 11:46:16 - INFO - __main__ - Step 103623: {'lr': 0.00011167274050164955, 'samples': 19895616, 'steps': 103622, 'loss/train': 1.3367385864257812} 11/07/2021 11:46:16 - INFO - __main__ - Step 103624: {'lr': 0.00011166832014184186, 'samples': 19895808, 'steps': 103623, 'loss/train': 1.0695902109146118} 11/07/2021 11:46:16 - INFO - __main__ - Step 103625: {'lr': 0.00011166389984436423, 'samples': 19896000, 'steps': 103624, 'loss/train': 1.0267678499221802} 11/07/2021 11:46:17 - INFO - __main__ - Step 103626: {'lr': 0.00011165947960921868, 'samples': 19896192, 'steps': 103625, 'loss/train': 1.6273659467697144} 11/07/2021 11:46:17 - INFO - __main__ - Step 103627: {'lr': 0.00011165505943640725, 'samples': 19896384, 'steps': 103626, 'loss/train': 0.9761868715286255} 11/07/2021 11:46:18 - INFO - __main__ - Step 103628: {'lr': 0.00011165063932593192, 'samples': 19896576, 'steps': 103627, 'loss/train': 1.55350661277771} 11/07/2021 11:46:18 - INFO - __main__ - Step 103629: {'lr': 0.00011164621927779467, 'samples': 19896768, 'steps': 103628, 'loss/train': 1.4230796098709106} 11/07/2021 11:46:19 - INFO - __main__ - Step 103630: {'lr': 0.0001116417992919975, 'samples': 19896960, 'steps': 103629, 'loss/train': 1.1945124864578247} 11/07/2021 11:46:19 - INFO - __main__ - Step 103631: {'lr': 0.0001116373793685424, 'samples': 19897152, 'steps': 103630, 'loss/train': 1.394078254699707} 11/07/2021 11:46:20 - INFO - __main__ - Step 103632: {'lr': 0.00011163295950743139, 'samples': 19897344, 'steps': 103631, 'loss/train': 2.046485424041748} 11/07/2021 11:46:21 - INFO - __main__ - Step 103633: {'lr': 0.00011162853970866637, 'samples': 19897536, 'steps': 103632, 'loss/train': 1.3086556196212769} 11/07/2021 11:46:21 - INFO - __main__ - Step 103634: {'lr': 0.00011162411997224945, 'samples': 19897728, 'steps': 103633, 'loss/train': 1.51875901222229} 11/07/2021 11:46:22 - INFO - __main__ - Step 103635: {'lr': 0.00011161970029818255, 'samples': 19897920, 'steps': 103634, 'loss/train': 1.2690402269363403} 11/07/2021 11:46:22 - INFO - __main__ - Step 103636: {'lr': 0.00011161528068646767, 'samples': 19898112, 'steps': 103635, 'loss/train': 1.589772343635559} 11/07/2021 11:46:22 - INFO - __main__ - Step 103637: {'lr': 0.00011161086113710692, 'samples': 19898304, 'steps': 103636, 'loss/train': 1.9269084930419922} 11/07/2021 11:46:23 - INFO - __main__ - Step 103638: {'lr': 0.00011160644165010206, 'samples': 19898496, 'steps': 103637, 'loss/train': 2.7560877799987793} 11/07/2021 11:46:24 - INFO - __main__ - Step 103639: {'lr': 0.0001116020222254552, 'samples': 19898688, 'steps': 103638, 'loss/train': 1.2975192070007324} 11/07/2021 11:46:24 - INFO - __main__ - Step 103640: {'lr': 0.00011159760286316836, 'samples': 19898880, 'steps': 103639, 'loss/train': 1.0360136032104492} 11/07/2021 11:46:25 - INFO - __main__ - Step 103641: {'lr': 0.00011159318356324349, 'samples': 19899072, 'steps': 103640, 'loss/train': 1.0889780521392822} 11/07/2021 11:46:25 - INFO - __main__ - Step 103642: {'lr': 0.0001115887643256826, 'samples': 19899264, 'steps': 103641, 'loss/train': 0.9909052848815918} 11/07/2021 11:46:25 - INFO - __main__ - Step 103643: {'lr': 0.00011158434515048768, 'samples': 19899456, 'steps': 103642, 'loss/train': 0.2775697112083435} 11/07/2021 11:46:26 - INFO - __main__ - Step 103644: {'lr': 0.00011157992603766073, 'samples': 19899648, 'steps': 103643, 'loss/train': 0.8877982497215271} 11/07/2021 11:46:27 - INFO - __main__ - Step 103645: {'lr': 0.0001115755069872037, 'samples': 19899840, 'steps': 103644, 'loss/train': 0.9077609181404114} 11/07/2021 11:46:27 - INFO - __main__ - Step 103646: {'lr': 0.00011157108799911863, 'samples': 19900032, 'steps': 103645, 'loss/train': 1.3381093740463257} 11/07/2021 11:46:27 - INFO - __main__ - Step 103647: {'lr': 0.00011156666907340749, 'samples': 19900224, 'steps': 103646, 'loss/train': 1.2782872915267944} 11/07/2021 11:46:28 - INFO - __main__ - Step 103648: {'lr': 0.00011156225021007227, 'samples': 19900416, 'steps': 103647, 'loss/train': 1.3189188241958618} 11/07/2021 11:46:29 - INFO - __main__ - Step 103649: {'lr': 0.00011155783140911496, 'samples': 19900608, 'steps': 103648, 'loss/train': 2.2053558826446533} 11/07/2021 11:46:29 - INFO - __main__ - Step 103650: {'lr': 0.00011155341267053756, 'samples': 19900800, 'steps': 103649, 'loss/train': 1.4147861003875732} 11/07/2021 11:46:30 - INFO - __main__ - Step 103651: {'lr': 0.00011154899399434215, 'samples': 19900992, 'steps': 103650, 'loss/train': 1.384127140045166} 11/07/2021 11:46:30 - INFO - __main__ - Step 103652: {'lr': 0.00011154457538053054, 'samples': 19901184, 'steps': 103651, 'loss/train': 1.8423964977264404} 11/07/2021 11:46:30 - INFO - __main__ - Step 103653: {'lr': 0.00011154015682910479, 'samples': 19901376, 'steps': 103652, 'loss/train': 1.0919398069381714} 11/07/2021 11:46:31 - INFO - __main__ - Step 103654: {'lr': 0.00011153573834006691, 'samples': 19901568, 'steps': 103653, 'loss/train': 1.4428809881210327} 11/07/2021 11:46:32 - INFO - __main__ - Step 103655: {'lr': 0.00011153131991341889, 'samples': 19901760, 'steps': 103654, 'loss/train': 1.5735043287277222} 11/07/2021 11:46:32 - INFO - __main__ - Step 103656: {'lr': 0.00011152690154916273, 'samples': 19901952, 'steps': 103655, 'loss/train': 1.1566307544708252} 11/07/2021 11:46:32 - INFO - __main__ - Step 103657: {'lr': 0.0001115224832473004, 'samples': 19902144, 'steps': 103656, 'loss/train': 1.3583693504333496} 11/07/2021 11:46:33 - INFO - __main__ - Step 103658: {'lr': 0.00011151806500783393, 'samples': 19902336, 'steps': 103657, 'loss/train': 1.105594277381897} 11/07/2021 11:46:33 - INFO - __main__ - Step 103659: {'lr': 0.00011151364683076526, 'samples': 19902528, 'steps': 103658, 'loss/train': 1.3023415803909302} 11/07/2021 11:46:34 - INFO - __main__ - Step 103660: {'lr': 0.00011150922871609639, 'samples': 19902720, 'steps': 103659, 'loss/train': 1.776994228363037} 11/07/2021 11:46:34 - INFO - __main__ - Step 103661: {'lr': 0.00011150481066382937, 'samples': 19902912, 'steps': 103660, 'loss/train': 0.8916557431221008} 11/07/2021 11:46:35 - INFO - __main__ - Step 103662: {'lr': 0.00011150039267396611, 'samples': 19903104, 'steps': 103661, 'loss/train': 1.4739433526992798} 11/07/2021 11:46:35 - INFO - __main__ - Step 103663: {'lr': 0.00011149597474650864, 'samples': 19903296, 'steps': 103662, 'loss/train': 1.351214051246643} 11/07/2021 11:46:35 - INFO - __main__ - Step 103664: {'lr': 0.00011149155688145904, 'samples': 19903488, 'steps': 103663, 'loss/train': 1.5012391805648804} 11/07/2021 11:46:37 - INFO - __main__ - Step 103665: {'lr': 0.00011148713907881914, 'samples': 19903680, 'steps': 103664, 'loss/train': 1.3192552328109741} 11/07/2021 11:46:37 - INFO - __main__ - Step 103666: {'lr': 0.00011148272133859096, 'samples': 19903872, 'steps': 103665, 'loss/train': 1.07534658908844} 11/07/2021 11:46:37 - INFO - __main__ - Step 103667: {'lr': 0.00011147830366077654, 'samples': 19904064, 'steps': 103666, 'loss/train': 0.6812326908111572} 11/07/2021 11:46:38 - INFO - __main__ - Step 103668: {'lr': 0.00011147388604537786, 'samples': 19904256, 'steps': 103667, 'loss/train': 1.681624412536621} 11/07/2021 11:46:38 - INFO - __main__ - Step 103669: {'lr': 0.00011146946849239692, 'samples': 19904448, 'steps': 103668, 'loss/train': 1.1901522874832153} 11/07/2021 11:46:40 - INFO - __main__ - Step 103670: {'lr': 0.00011146505100183568, 'samples': 19904640, 'steps': 103669, 'loss/train': 1.4969819784164429} 11/07/2021 11:46:40 - INFO - __main__ - Step 103671: {'lr': 0.00011146063357369619, 'samples': 19904832, 'steps': 103670, 'loss/train': 1.4594005346298218} 11/07/2021 11:46:40 - INFO - __main__ - Step 103672: {'lr': 0.00011145621620798035, 'samples': 19905024, 'steps': 103671, 'loss/train': 0.47039303183555603} 11/07/2021 11:46:41 - INFO - __main__ - Step 103673: {'lr': 0.00011145179890469023, 'samples': 19905216, 'steps': 103672, 'loss/train': 1.0462963581085205} 11/07/2021 11:46:41 - INFO - __main__ - Step 103674: {'lr': 0.00011144738166382779, 'samples': 19905408, 'steps': 103673, 'loss/train': 1.289834976196289} 11/07/2021 11:46:41 - INFO - __main__ - Step 103675: {'lr': 0.00011144296448539501, 'samples': 19905600, 'steps': 103674, 'loss/train': 1.360195517539978} 11/07/2021 11:46:42 - INFO - __main__ - Step 103676: {'lr': 0.00011143854736939391, 'samples': 19905792, 'steps': 103675, 'loss/train': 1.3860715627670288} 11/07/2021 11:46:43 - INFO - __main__ - Step 103677: {'lr': 0.00011143413031582644, 'samples': 19905984, 'steps': 103676, 'loss/train': 1.303189754486084} 11/07/2021 11:46:43 - INFO - __main__ - Step 103678: {'lr': 0.0001114297133246947, 'samples': 19906176, 'steps': 103677, 'loss/train': 1.4287433624267578} 11/07/2021 11:46:43 - INFO - __main__ - Step 103679: {'lr': 0.00011142529639600051, 'samples': 19906368, 'steps': 103678, 'loss/train': 1.4124735593795776} 11/07/2021 11:46:44 - INFO - __main__ - Step 103680: {'lr': 0.00011142087952974598, 'samples': 19906560, 'steps': 103679, 'loss/train': 1.4111964702606201} 11/07/2021 11:46:45 - INFO - __main__ - Step 103681: {'lr': 0.00011141646272593303, 'samples': 19906752, 'steps': 103680, 'loss/train': 1.2282302379608154} 11/07/2021 11:46:45 - INFO - __main__ - Step 103682: {'lr': 0.00011141204598456367, 'samples': 19906944, 'steps': 103681, 'loss/train': 1.5095072984695435} 11/07/2021 11:46:46 - INFO - __main__ - Step 103683: {'lr': 0.00011140762930563995, 'samples': 19907136, 'steps': 103682, 'loss/train': 1.3188961744308472} 11/07/2021 11:46:46 - INFO - __main__ - Step 103684: {'lr': 0.00011140321268916376, 'samples': 19907328, 'steps': 103683, 'loss/train': 0.7785894870758057} 11/07/2021 11:46:46 - INFO - __main__ - Step 103685: {'lr': 0.00011139879613513718, 'samples': 19907520, 'steps': 103684, 'loss/train': 1.3089395761489868} 11/07/2021 11:46:47 - INFO - __main__ - Step 103686: {'lr': 0.00011139437964356214, 'samples': 19907712, 'steps': 103685, 'loss/train': 1.2382142543792725} 11/07/2021 11:46:48 - INFO - __main__ - Step 103687: {'lr': 0.00011138996321444068, 'samples': 19907904, 'steps': 103686, 'loss/train': 1.4269641637802124} 11/07/2021 11:46:48 - INFO - __main__ - Step 103688: {'lr': 0.00011138554684777475, 'samples': 19908096, 'steps': 103687, 'loss/train': 1.1000748872756958} 11/07/2021 11:46:48 - INFO - __main__ - Step 103689: {'lr': 0.00011138113054356632, 'samples': 19908288, 'steps': 103688, 'loss/train': 0.7610182166099548} 11/07/2021 11:46:49 - INFO - __main__ - Step 103690: {'lr': 0.00011137671430181746, 'samples': 19908480, 'steps': 103689, 'loss/train': 1.2255258560180664} 11/07/2021 11:46:49 - INFO - __main__ - Step 103691: {'lr': 0.00011137229812253019, 'samples': 19908672, 'steps': 103690, 'loss/train': 1.1616665124893188} 11/07/2021 11:46:50 - INFO - __main__ - Step 103692: {'lr': 0.00011136788200570632, 'samples': 19908864, 'steps': 103691, 'loss/train': 1.4324544668197632} 11/07/2021 11:46:50 - INFO - __main__ - Step 103693: {'lr': 0.00011136346595134796, 'samples': 19909056, 'steps': 103692, 'loss/train': 1.0578582286834717} 11/07/2021 11:46:51 - INFO - __main__ - Step 103694: {'lr': 0.00011135904995945709, 'samples': 19909248, 'steps': 103693, 'loss/train': 1.3745137453079224} 11/07/2021 11:46:51 - INFO - __main__ - Step 103695: {'lr': 0.00011135463403003567, 'samples': 19909440, 'steps': 103694, 'loss/train': 1.5452337265014648} 11/07/2021 11:46:52 - INFO - __main__ - Step 103696: {'lr': 0.00011135021816308574, 'samples': 19909632, 'steps': 103695, 'loss/train': 1.1516547203063965} 11/07/2021 11:46:53 - INFO - __main__ - Step 103697: {'lr': 0.00011134580235860925, 'samples': 19909824, 'steps': 103696, 'loss/train': 1.5122846364974976} 11/07/2021 11:46:53 - INFO - __main__ - Step 103698: {'lr': 0.0001113413866166082, 'samples': 19910016, 'steps': 103697, 'loss/train': 1.598258137702942} 11/07/2021 11:46:53 - INFO - __main__ - Step 103699: {'lr': 0.00011133697093708456, 'samples': 19910208, 'steps': 103698, 'loss/train': 1.587246060371399} 11/07/2021 11:46:54 - INFO - __main__ - Step 103700: {'lr': 0.00011133255532004036, 'samples': 19910400, 'steps': 103699, 'loss/train': 1.4389457702636719} 11/07/2021 11:46:54 - INFO - __main__ - Step 103701: {'lr': 0.0001113281397654776, 'samples': 19910592, 'steps': 103700, 'loss/train': 1.035618543624878} 11/07/2021 11:46:55 - INFO - __main__ - Step 103702: {'lr': 0.0001113237242733982, 'samples': 19910784, 'steps': 103701, 'loss/train': 1.2335435152053833} 11/07/2021 11:46:55 - INFO - __main__ - Step 103703: {'lr': 0.0001113193088438042, 'samples': 19910976, 'steps': 103702, 'loss/train': 1.124046802520752} 11/07/2021 11:46:56 - INFO - __main__ - Step 103704: {'lr': 0.00011131489347669768, 'samples': 19911168, 'steps': 103703, 'loss/train': 1.3974087238311768} 11/07/2021 11:46:56 - INFO - __main__ - Step 103705: {'lr': 0.00011131047817208043, 'samples': 19911360, 'steps': 103704, 'loss/train': 1.5020387172698975} 11/07/2021 11:46:56 - INFO - __main__ - Step 103706: {'lr': 0.00011130606292995451, 'samples': 19911552, 'steps': 103705, 'loss/train': 0.7124864459037781} 11/07/2021 11:46:58 - INFO - __main__ - Step 103707: {'lr': 0.00011130164775032198, 'samples': 19911744, 'steps': 103706, 'loss/train': 1.1950111389160156} 11/07/2021 11:46:58 - INFO - __main__ - Step 103708: {'lr': 0.00011129723263318479, 'samples': 19911936, 'steps': 103707, 'loss/train': 1.6018891334533691} 11/07/2021 11:46:58 - INFO - __main__ - Step 103709: {'lr': 0.0001112928175785449, 'samples': 19912128, 'steps': 103708, 'loss/train': 1.6386319398880005} 11/07/2021 11:46:59 - INFO - __main__ - Step 103710: {'lr': 0.00011128840258640433, 'samples': 19912320, 'steps': 103709, 'loss/train': 1.2942230701446533} 11/07/2021 11:46:59 - INFO - __main__ - Step 103711: {'lr': 0.00011128398765676509, 'samples': 19912512, 'steps': 103710, 'loss/train': 1.4044464826583862} 11/07/2021 11:46:59 - INFO - __main__ - Step 103712: {'lr': 0.00011127957278962911, 'samples': 19912704, 'steps': 103711, 'loss/train': 1.3098249435424805} 11/07/2021 11:47:00 - INFO - __main__ - Step 103713: {'lr': 0.00011127515798499844, 'samples': 19912896, 'steps': 103712, 'loss/train': 1.3751587867736816} 11/07/2021 11:47:01 - INFO - __main__ - Step 103714: {'lr': 0.00011127074324287504, 'samples': 19913088, 'steps': 103713, 'loss/train': 1.5923113822937012} 11/07/2021 11:47:01 - INFO - __main__ - Step 103715: {'lr': 0.00011126632856326088, 'samples': 19913280, 'steps': 103714, 'loss/train': 0.6358866691589355} 11/07/2021 11:47:01 - INFO - __main__ - Step 103716: {'lr': 0.000111261913946158, 'samples': 19913472, 'steps': 103715, 'loss/train': 1.2322807312011719} 11/07/2021 11:47:02 - INFO - __main__ - Step 103717: {'lr': 0.00011125749939156835, 'samples': 19913664, 'steps': 103716, 'loss/train': 1.0605570077896118} 11/07/2021 11:47:03 - INFO - __main__ - Step 103718: {'lr': 0.00011125308489949401, 'samples': 19913856, 'steps': 103717, 'loss/train': 1.445562481880188} 11/07/2021 11:47:03 - INFO - __main__ - Step 103719: {'lr': 0.0001112486704699368, 'samples': 19914048, 'steps': 103718, 'loss/train': 1.4582868814468384} 11/07/2021 11:47:03 - INFO - __main__ - Step 103720: {'lr': 0.00011124425610289881, 'samples': 19914240, 'steps': 103719, 'loss/train': 1.3926222324371338} 11/07/2021 11:47:04 - INFO - __main__ - Step 103721: {'lr': 0.000111239841798382, 'samples': 19914432, 'steps': 103720, 'loss/train': 1.3902069330215454} 11/07/2021 11:47:04 - INFO - __main__ - Step 103722: {'lr': 0.00011123542755638841, 'samples': 19914624, 'steps': 103721, 'loss/train': 1.4209126234054565} 11/07/2021 11:47:05 - INFO - __main__ - Step 103723: {'lr': 0.00011123101337691995, 'samples': 19914816, 'steps': 103722, 'loss/train': 0.7838994860649109} 11/07/2021 11:47:06 - INFO - __main__ - Step 103724: {'lr': 0.00011122659925997868, 'samples': 19915008, 'steps': 103723, 'loss/train': 1.5039433240890503} 11/07/2021 11:47:06 - INFO - __main__ - Step 103725: {'lr': 0.00011122218520556657, 'samples': 19915200, 'steps': 103724, 'loss/train': 1.0362496376037598} 11/07/2021 11:47:06 - INFO - __main__ - Step 103726: {'lr': 0.0001112177712136856, 'samples': 19915392, 'steps': 103725, 'loss/train': 1.2080662250518799} 11/07/2021 11:47:07 - INFO - __main__ - Step 103727: {'lr': 0.00011121335728433776, 'samples': 19915584, 'steps': 103726, 'loss/train': 1.7813520431518555} 11/07/2021 11:47:08 - INFO - __main__ - Step 103728: {'lr': 0.00011120894341752502, 'samples': 19915776, 'steps': 103727, 'loss/train': 1.578770637512207} 11/07/2021 11:47:08 - INFO - __main__ - Step 103729: {'lr': 0.00011120452961324939, 'samples': 19915968, 'steps': 103728, 'loss/train': 1.2197245359420776} 11/07/2021 11:47:08 - INFO - __main__ - Step 103730: {'lr': 0.00011120011587151297, 'samples': 19916160, 'steps': 103729, 'loss/train': 0.7584170699119568} 11/07/2021 11:47:09 - INFO - __main__ - Step 103731: {'lr': 0.00011119570219231754, 'samples': 19916352, 'steps': 103730, 'loss/train': 0.9556599855422974} 11/07/2021 11:47:09 - INFO - __main__ - Step 103732: {'lr': 0.00011119128857566518, 'samples': 19916544, 'steps': 103731, 'loss/train': 0.8816158175468445} 11/07/2021 11:47:10 - INFO - __main__ - Step 103733: {'lr': 0.00011118687502155789, 'samples': 19916736, 'steps': 103732, 'loss/train': 1.1762232780456543} 11/07/2021 11:47:11 - INFO - __main__ - Step 103734: {'lr': 0.00011118246152999764, 'samples': 19916928, 'steps': 103733, 'loss/train': 1.5070390701293945} 11/07/2021 11:47:11 - INFO - __main__ - Step 103735: {'lr': 0.00011117804810098642, 'samples': 19917120, 'steps': 103734, 'loss/train': 1.4591054916381836} 11/07/2021 11:47:11 - INFO - __main__ - Step 103736: {'lr': 0.00011117363473452624, 'samples': 19917312, 'steps': 103735, 'loss/train': 1.0793311595916748} 11/07/2021 11:47:12 - INFO - __main__ - Step 103737: {'lr': 0.00011116922143061911, 'samples': 19917504, 'steps': 103736, 'loss/train': 0.7943126559257507} 11/07/2021 11:47:13 - INFO - __main__ - Step 103738: {'lr': 0.00011116480818926694, 'samples': 19917696, 'steps': 103737, 'loss/train': 1.4205293655395508} 11/07/2021 11:47:13 - INFO - __main__ - Step 103739: {'lr': 0.00011116039501047179, 'samples': 19917888, 'steps': 103738, 'loss/train': 1.2593598365783691} 11/07/2021 11:47:13 - INFO - __main__ - Step 103740: {'lr': 0.00011115598189423563, 'samples': 19918080, 'steps': 103739, 'loss/train': 1.3470966815948486} 11/07/2021 11:47:14 - INFO - __main__ - Step 103741: {'lr': 0.00011115156884056052, 'samples': 19918272, 'steps': 103740, 'loss/train': 1.5129297971725464} 11/07/2021 11:47:14 - INFO - __main__ - Step 103742: {'lr': 0.00011114715584944827, 'samples': 19918464, 'steps': 103741, 'loss/train': 1.3259367942810059} 11/07/2021 11:47:14 - INFO - __main__ - Step 103743: {'lr': 0.00011114274292090099, 'samples': 19918656, 'steps': 103742, 'loss/train': 1.4979592561721802} 11/07/2021 11:47:15 - INFO - __main__ - Step 103744: {'lr': 0.00011113833005492063, 'samples': 19918848, 'steps': 103743, 'loss/train': 1.428663730621338} 11/07/2021 11:47:16 - INFO - __main__ - Step 103745: {'lr': 0.0001111339172515092, 'samples': 19919040, 'steps': 103744, 'loss/train': 1.142960548400879} 11/07/2021 11:47:16 - INFO - __main__ - Step 103746: {'lr': 0.00011112950451066869, 'samples': 19919232, 'steps': 103745, 'loss/train': 1.5477536916732788} 11/07/2021 11:47:16 - INFO - __main__ - Step 103747: {'lr': 0.00011112509183240108, 'samples': 19919424, 'steps': 103746, 'loss/train': 1.1885169744491577} 11/07/2021 11:47:17 - INFO - __main__ - Step 103748: {'lr': 0.00011112067921670834, 'samples': 19919616, 'steps': 103747, 'loss/train': 1.1621595621109009} 11/07/2021 11:47:18 - INFO - __main__ - Step 103749: {'lr': 0.0001111162666635925, 'samples': 19919808, 'steps': 103748, 'loss/train': 1.5842595100402832} 11/07/2021 11:47:18 - INFO - __main__ - Step 103750: {'lr': 0.00011111185417305553, 'samples': 19920000, 'steps': 103749, 'loss/train': 1.802696943283081} 11/07/2021 11:47:19 - INFO - __main__ - Step 103751: {'lr': 0.00011110744174509952, 'samples': 19920192, 'steps': 103750, 'loss/train': 0.7983982563018799} 11/07/2021 11:47:19 - INFO - __main__ - Step 103752: {'lr': 0.00011110302937972624, 'samples': 19920384, 'steps': 103751, 'loss/train': 1.0936673879623413} 11/07/2021 11:47:19 - INFO - __main__ - Step 103753: {'lr': 0.0001110986170769378, 'samples': 19920576, 'steps': 103752, 'loss/train': 1.3770694732666016} 11/07/2021 11:47:20 - INFO - __main__ - Step 103754: {'lr': 0.00011109420483673616, 'samples': 19920768, 'steps': 103753, 'loss/train': 1.6136184930801392} 11/07/2021 11:47:21 - INFO - __main__ - Step 103755: {'lr': 0.00011108979265912336, 'samples': 19920960, 'steps': 103754, 'loss/train': 1.034071683883667} 11/07/2021 11:47:21 - INFO - __main__ - Step 103756: {'lr': 0.00011108538054410133, 'samples': 19921152, 'steps': 103755, 'loss/train': 1.3531948328018188} 11/07/2021 11:47:21 - INFO - __main__ - Step 103757: {'lr': 0.0001110809684916721, 'samples': 19921344, 'steps': 103756, 'loss/train': 1.13620126247406} 11/07/2021 11:47:22 - INFO - __main__ - Step 103758: {'lr': 0.00011107655650183762, 'samples': 19921536, 'steps': 103757, 'loss/train': 1.3966734409332275} 11/07/2021 11:47:23 - INFO - __main__ - Step 103759: {'lr': 0.00011107214457459991, 'samples': 19921728, 'steps': 103758, 'loss/train': 1.56337571144104} 11/07/2021 11:47:23 - INFO - __main__ - Step 103760: {'lr': 0.00011106773270996095, 'samples': 19921920, 'steps': 103759, 'loss/train': 1.5520998239517212} 11/07/2021 11:47:24 - INFO - __main__ - Step 103761: {'lr': 0.00011106332090792273, 'samples': 19922112, 'steps': 103760, 'loss/train': 1.280505895614624} 11/07/2021 11:47:24 - INFO - __main__ - Step 103762: {'lr': 0.00011105890916848735, 'samples': 19922304, 'steps': 103761, 'loss/train': 1.1209053993225098} 11/07/2021 11:47:24 - INFO - __main__ - Step 103763: {'lr': 0.00011105449749165655, 'samples': 19922496, 'steps': 103762, 'loss/train': 1.2529950141906738} 11/07/2021 11:47:26 - INFO - __main__ - Step 103764: {'lr': 0.00011105008587743246, 'samples': 19922688, 'steps': 103763, 'loss/train': 1.5664584636688232} 11/07/2021 11:47:26 - INFO - __main__ - Step 103765: {'lr': 0.00011104567432581709, 'samples': 19922880, 'steps': 103764, 'loss/train': 0.6253330111503601} 11/07/2021 11:47:26 - INFO - __main__ - Step 103766: {'lr': 0.00011104126283681234, 'samples': 19923072, 'steps': 103765, 'loss/train': 1.0257740020751953} 11/07/2021 11:47:27 - INFO - __main__ - Step 103767: {'lr': 0.00011103685141042027, 'samples': 19923264, 'steps': 103766, 'loss/train': 1.4143011569976807} 11/07/2021 11:47:27 - INFO - __main__ - Step 103768: {'lr': 0.00011103244004664287, 'samples': 19923456, 'steps': 103767, 'loss/train': 1.360237956047058} 11/07/2021 11:47:27 - INFO - __main__ - Step 103769: {'lr': 0.00011102802874548209, 'samples': 19923648, 'steps': 103768, 'loss/train': 0.9323886632919312} 11/07/2021 11:47:28 - INFO - __main__ - Step 103770: {'lr': 0.00011102361750693996, 'samples': 19923840, 'steps': 103769, 'loss/train': 1.4432138204574585} 11/07/2021 11:47:29 - INFO - __main__ - Step 103771: {'lr': 0.00011101920633101842, 'samples': 19924032, 'steps': 103770, 'loss/train': 1.6669955253601074} 11/07/2021 11:47:29 - INFO - __main__ - Step 103772: {'lr': 0.00011101479521771948, 'samples': 19924224, 'steps': 103771, 'loss/train': 1.54078209400177} 11/07/2021 11:47:29 - INFO - __main__ - Step 103773: {'lr': 0.00011101038416704523, 'samples': 19924416, 'steps': 103772, 'loss/train': 1.4778294563293457} 11/07/2021 11:47:30 - INFO - __main__ - Step 103774: {'lr': 0.00011100597317899747, 'samples': 19924608, 'steps': 103773, 'loss/train': 1.3346340656280518} 11/07/2021 11:47:31 - INFO - __main__ - Step 103775: {'lr': 0.00011100156225357827, 'samples': 19924800, 'steps': 103774, 'loss/train': 1.3121240139007568} 11/07/2021 11:47:31 - INFO - __main__ - Step 103776: {'lr': 0.00011099715139078962, 'samples': 19924992, 'steps': 103775, 'loss/train': 1.6867188215255737} 11/07/2021 11:47:32 - INFO - __main__ - Step 103777: {'lr': 0.0001109927405906335, 'samples': 19925184, 'steps': 103776, 'loss/train': 1.5110735893249512} 11/07/2021 11:47:32 - INFO - __main__ - Step 103778: {'lr': 0.00011098832985311191, 'samples': 19925376, 'steps': 103777, 'loss/train': 0.998872697353363} 11/07/2021 11:47:32 - INFO - __main__ - Step 103779: {'lr': 0.00011098391917822684, 'samples': 19925568, 'steps': 103778, 'loss/train': 1.3682920932769775} 11/07/2021 11:47:33 - INFO - __main__ - Step 103780: {'lr': 0.00011097950856598024, 'samples': 19925760, 'steps': 103779, 'loss/train': 1.3828922510147095} 11/07/2021 11:47:34 - INFO - __main__ - Step 103781: {'lr': 0.00011097509801637418, 'samples': 19925952, 'steps': 103780, 'loss/train': 1.233459711074829} 11/07/2021 11:47:34 - INFO - __main__ - Step 103782: {'lr': 0.00011097068752941056, 'samples': 19926144, 'steps': 103781, 'loss/train': 1.4109103679656982} 11/07/2021 11:47:34 - INFO - __main__ - Step 103783: {'lr': 0.00011096627710509142, 'samples': 19926336, 'steps': 103782, 'loss/train': 0.5662245750427246} 11/07/2021 11:47:35 - INFO - __main__ - Step 103784: {'lr': 0.0001109618667434187, 'samples': 19926528, 'steps': 103783, 'loss/train': 1.5358136892318726} 11/07/2021 11:47:36 - INFO - __main__ - Step 103785: {'lr': 0.00011095745644439453, 'samples': 19926720, 'steps': 103784, 'loss/train': 1.3222674131393433} 11/07/2021 11:47:36 - INFO - __main__ - Step 103786: {'lr': 0.00011095304620802072, 'samples': 19926912, 'steps': 103785, 'loss/train': 0.8800380825996399} 11/07/2021 11:47:36 - INFO - __main__ - Step 103787: {'lr': 0.00011094863603429928, 'samples': 19927104, 'steps': 103786, 'loss/train': 0.7556708455085754} 11/07/2021 11:47:37 - INFO - __main__ - Step 103788: {'lr': 0.00011094422592323224, 'samples': 19927296, 'steps': 103787, 'loss/train': 1.3272593021392822} 11/07/2021 11:47:37 - INFO - __main__ - Step 103789: {'lr': 0.00011093981587482163, 'samples': 19927488, 'steps': 103788, 'loss/train': 1.1647734642028809} 11/07/2021 11:47:38 - INFO - __main__ - Step 103790: {'lr': 0.00011093540588906936, 'samples': 19927680, 'steps': 103789, 'loss/train': 1.530499815940857} 11/07/2021 11:47:39 - INFO - __main__ - Step 103791: {'lr': 0.00011093099596597744, 'samples': 19927872, 'steps': 103790, 'loss/train': 1.3175071477890015} 11/07/2021 11:47:39 - INFO - __main__ - Step 103792: {'lr': 0.0001109265861055479, 'samples': 19928064, 'steps': 103791, 'loss/train': 1.414028286933899} 11/07/2021 11:47:39 - INFO - __main__ - Step 103793: {'lr': 0.00011092217630778267, 'samples': 19928256, 'steps': 103792, 'loss/train': 1.1419272422790527} 11/07/2021 11:47:40 - INFO - __main__ - Step 103794: {'lr': 0.00011091776657268377, 'samples': 19928448, 'steps': 103793, 'loss/train': 1.698318600654602} 11/07/2021 11:47:41 - INFO - __main__ - Step 103795: {'lr': 0.00011091335690025317, 'samples': 19928640, 'steps': 103794, 'loss/train': 1.4672094583511353} 11/07/2021 11:47:41 - INFO - __main__ - Step 103796: {'lr': 0.00011090894729049286, 'samples': 19928832, 'steps': 103795, 'loss/train': 1.4627314805984497} 11/07/2021 11:47:42 - INFO - __main__ - Step 103797: {'lr': 0.00011090453774340484, 'samples': 19929024, 'steps': 103796, 'loss/train': 1.290267825126648} 11/07/2021 11:47:42 - INFO - __main__ - Step 103798: {'lr': 0.0001109001282589911, 'samples': 19929216, 'steps': 103797, 'loss/train': 1.2503595352172852} 11/07/2021 11:47:42 - INFO - __main__ - Step 103799: {'lr': 0.00011089571883725369, 'samples': 19929408, 'steps': 103798, 'loss/train': 1.4684691429138184} 11/07/2021 11:47:43 - INFO - __main__ - Step 103800: {'lr': 0.00011089130947819445, 'samples': 19929600, 'steps': 103799, 'loss/train': 1.2927687168121338} 11/07/2021 11:47:44 - INFO - __main__ - Step 103801: {'lr': 0.00011088690018181544, 'samples': 19929792, 'steps': 103800, 'loss/train': 0.5427897572517395} 11/07/2021 11:47:44 - INFO - __main__ - Step 103802: {'lr': 0.00011088249094811861, 'samples': 19929984, 'steps': 103801, 'loss/train': 1.3410667181015015} 11/07/2021 11:47:45 - INFO - __main__ - Step 103803: {'lr': 0.00011087808177710603, 'samples': 19930176, 'steps': 103802, 'loss/train': 1.4821665287017822} 11/07/2021 11:47:45 - INFO - __main__ - Step 103804: {'lr': 0.00011087367266877963, 'samples': 19930368, 'steps': 103803, 'loss/train': 1.026607871055603} 11/07/2021 11:47:45 - INFO - __main__ - Step 103805: {'lr': 0.00011086926362314137, 'samples': 19930560, 'steps': 103804, 'loss/train': 1.2490233182907104} 11/07/2021 11:47:46 - INFO - __main__ - Step 103806: {'lr': 0.0001108648546401933, 'samples': 19930752, 'steps': 103805, 'loss/train': 1.6869242191314697} 11/07/2021 11:47:47 - INFO - __main__ - Step 103807: {'lr': 0.00011086044571993739, 'samples': 19930944, 'steps': 103806, 'loss/train': 1.3210375308990479} 11/07/2021 11:47:47 - INFO - __main__ - Step 103808: {'lr': 0.00011085603686237558, 'samples': 19931136, 'steps': 103807, 'loss/train': 1.1924118995666504} 11/07/2021 11:47:47 - INFO - __main__ - Step 103809: {'lr': 0.00011085162806750992, 'samples': 19931328, 'steps': 103808, 'loss/train': 0.6171709299087524} 11/07/2021 11:47:48 - INFO - __main__ - Step 103810: {'lr': 0.00011084721933534236, 'samples': 19931520, 'steps': 103809, 'loss/train': 0.8496022820472717} 11/07/2021 11:47:49 - INFO - __main__ - Step 103811: {'lr': 0.0001108428106658749, 'samples': 19931712, 'steps': 103810, 'loss/train': 0.946782112121582} 11/07/2021 11:47:49 - INFO - __main__ - Step 103812: {'lr': 0.00011083840205910964, 'samples': 19931904, 'steps': 103811, 'loss/train': 0.9129055142402649} 11/07/2021 11:47:50 - INFO - __main__ - Step 103813: {'lr': 0.00011083399351504834, 'samples': 19932096, 'steps': 103812, 'loss/train': 0.7940320372581482} 11/07/2021 11:47:50 - INFO - __main__ - Step 103814: {'lr': 0.00011082958503369306, 'samples': 19932288, 'steps': 103813, 'loss/train': 1.5248404741287231} 11/07/2021 11:47:50 - INFO - __main__ - Step 103815: {'lr': 0.00011082517661504584, 'samples': 19932480, 'steps': 103814, 'loss/train': 1.0704607963562012} 11/07/2021 11:47:52 - INFO - __main__ - Step 103816: {'lr': 0.00011082076825910867, 'samples': 19932672, 'steps': 103815, 'loss/train': 0.6968886852264404} 11/07/2021 11:47:52 - INFO - __main__ - Step 103817: {'lr': 0.0001108163599658835, 'samples': 19932864, 'steps': 103816, 'loss/train': 1.534516453742981} 11/07/2021 11:47:53 - INFO - __main__ - Step 103818: {'lr': 0.00011081195173537231, 'samples': 19933056, 'steps': 103817, 'loss/train': 1.3250176906585693} 11/07/2021 11:47:53 - INFO - __main__ - Step 103819: {'lr': 0.00011080754356757714, 'samples': 19933248, 'steps': 103818, 'loss/train': 1.370077133178711} 11/07/2021 11:47:53 - INFO - __main__ - Step 103820: {'lr': 0.00011080313546249993, 'samples': 19933440, 'steps': 103819, 'loss/train': 1.2255717515945435} 11/07/2021 11:47:54 - INFO - __main__ - Step 103821: {'lr': 0.00011079872742014268, 'samples': 19933632, 'steps': 103820, 'loss/train': 0.9692107439041138} 11/07/2021 11:47:54 - INFO - __main__ - Step 103822: {'lr': 0.00011079431944050738, 'samples': 19933824, 'steps': 103821, 'loss/train': 1.0629663467407227} 11/07/2021 11:47:55 - INFO - __main__ - Step 103823: {'lr': 0.000110789911523596, 'samples': 19934016, 'steps': 103822, 'loss/train': 0.996589720249176} 11/07/2021 11:47:55 - INFO - __main__ - Step 103824: {'lr': 0.00011078550366941053, 'samples': 19934208, 'steps': 103823, 'loss/train': 1.5035897493362427} 11/07/2021 11:47:56 - INFO - __main__ - Step 103825: {'lr': 0.0001107810958779531, 'samples': 19934400, 'steps': 103824, 'loss/train': 0.9934889674186707} 11/07/2021 11:47:56 - INFO - __main__ - Step 103826: {'lr': 0.00011077668814922543, 'samples': 19934592, 'steps': 103825, 'loss/train': 1.3523845672607422} 11/07/2021 11:47:56 - INFO - __main__ - Step 103827: {'lr': 0.00011077228048322962, 'samples': 19934784, 'steps': 103826, 'loss/train': 1.3012744188308716} 11/07/2021 11:47:58 - INFO - __main__ - Step 103828: {'lr': 0.0001107678728799677, 'samples': 19934976, 'steps': 103827, 'loss/train': 1.3863359689712524} 11/07/2021 11:47:58 - INFO - __main__ - Step 103829: {'lr': 0.00011076346533944162, 'samples': 19935168, 'steps': 103828, 'loss/train': 1.2967673540115356} 11/07/2021 11:47:58 - INFO - __main__ - Step 103830: {'lr': 0.00011075905786165339, 'samples': 19935360, 'steps': 103829, 'loss/train': 0.6655264496803284} 11/07/2021 11:47:59 - INFO - __main__ - Step 103831: {'lr': 0.00011075465044660496, 'samples': 19935552, 'steps': 103830, 'loss/train': 1.361082673072815} 11/07/2021 11:47:59 - INFO - __main__ - Step 103832: {'lr': 0.00011075024309429835, 'samples': 19935744, 'steps': 103831, 'loss/train': 0.6047565340995789} 11/07/2021 11:48:00 - INFO - __main__ - Step 103833: {'lr': 0.00011074583580473552, 'samples': 19935936, 'steps': 103832, 'loss/train': 1.7172678709030151} 11/07/2021 11:48:00 - INFO - __main__ - Step 103834: {'lr': 0.00011074142857791846, 'samples': 19936128, 'steps': 103833, 'loss/train': 1.4199906587600708} 11/07/2021 11:48:01 - INFO - __main__ - Step 103835: {'lr': 0.0001107370214138492, 'samples': 19936320, 'steps': 103834, 'loss/train': 0.6033650636672974} 11/07/2021 11:48:01 - INFO - __main__ - Step 103836: {'lr': 0.00011073261431252965, 'samples': 19936512, 'steps': 103835, 'loss/train': 0.8268667459487915} 11/07/2021 11:48:01 - INFO - __main__ - Step 103837: {'lr': 0.00011072820727396186, 'samples': 19936704, 'steps': 103836, 'loss/train': 1.2043073177337646} 11/07/2021 11:48:02 - INFO - __main__ - Step 103838: {'lr': 0.00011072380029814777, 'samples': 19936896, 'steps': 103837, 'loss/train': 1.1268049478530884} 11/07/2021 11:48:03 - INFO - __main__ - Step 103839: {'lr': 0.00011071939338508949, 'samples': 19937088, 'steps': 103838, 'loss/train': 1.319192886352539} 11/07/2021 11:48:03 - INFO - __main__ - Step 103840: {'lr': 0.00011071498653478881, 'samples': 19937280, 'steps': 103839, 'loss/train': 1.3833353519439697} 11/07/2021 11:48:03 - INFO - __main__ - Step 103841: {'lr': 0.00011071057974724782, 'samples': 19937472, 'steps': 103840, 'loss/train': 1.3015050888061523} 11/07/2021 11:48:04 - INFO - __main__ - Step 103842: {'lr': 0.00011070617302246847, 'samples': 19937664, 'steps': 103841, 'loss/train': 0.9973832368850708} 11/07/2021 11:48:05 - INFO - __main__ - Step 103843: {'lr': 0.00011070176636045278, 'samples': 19937856, 'steps': 103842, 'loss/train': 1.5051382780075073} 11/07/2021 11:48:05 - INFO - __main__ - Step 103844: {'lr': 0.00011069735976120274, 'samples': 19938048, 'steps': 103843, 'loss/train': 1.2897393703460693} 11/07/2021 11:48:06 - INFO - __main__ - Step 103845: {'lr': 0.00011069295322472028, 'samples': 19938240, 'steps': 103844, 'loss/train': 1.6857430934906006} 11/07/2021 11:48:06 - INFO - __main__ - Step 103846: {'lr': 0.00011068854675100745, 'samples': 19938432, 'steps': 103845, 'loss/train': 1.271488070487976} 11/07/2021 11:48:06 - INFO - __main__ - Step 103847: {'lr': 0.00011068414034006621, 'samples': 19938624, 'steps': 103846, 'loss/train': 5.667080879211426} 11/07/2021 11:48:07 - INFO - __main__ - Step 103848: {'lr': 0.00011067973399189857, 'samples': 19938816, 'steps': 103847, 'loss/train': 1.3967965841293335} 11/07/2021 11:48:08 - INFO - __main__ - Step 103849: {'lr': 0.00011067532770650646, 'samples': 19939008, 'steps': 103848, 'loss/train': 0.9282191395759583} 11/07/2021 11:48:08 - INFO - __main__ - Step 103850: {'lr': 0.0001106709214838919, 'samples': 19939200, 'steps': 103849, 'loss/train': 1.6468900442123413} 11/07/2021 11:48:09 - INFO - __main__ - Step 103851: {'lr': 0.00011066651532405689, 'samples': 19939392, 'steps': 103850, 'loss/train': 1.4206246137619019} 11/07/2021 11:48:09 - INFO - __main__ - Step 103852: {'lr': 0.00011066210922700348, 'samples': 19939584, 'steps': 103851, 'loss/train': 0.8706030249595642} 11/07/2021 11:48:09 - INFO - __main__ - Step 103853: {'lr': 0.00011065770319273346, 'samples': 19939776, 'steps': 103852, 'loss/train': 1.3656362295150757} 11/07/2021 11:48:10 - INFO - __main__ - Step 103854: {'lr': 0.00011065329722124898, 'samples': 19939968, 'steps': 103853, 'loss/train': 1.1663007736206055} 11/07/2021 11:48:11 - INFO - __main__ - Step 103855: {'lr': 0.00011064889131255192, 'samples': 19940160, 'steps': 103854, 'loss/train': 1.0257171392440796} 11/07/2021 11:48:11 - INFO - __main__ - Step 103856: {'lr': 0.00011064448546664435, 'samples': 19940352, 'steps': 103855, 'loss/train': 1.1032283306121826} 11/07/2021 11:48:11 - INFO - __main__ - Step 103857: {'lr': 0.0001106400796835282, 'samples': 19940544, 'steps': 103856, 'loss/train': 1.0807843208312988} 11/07/2021 11:48:12 - INFO - __main__ - Step 103858: {'lr': 0.0001106356739632055, 'samples': 19940736, 'steps': 103857, 'loss/train': 1.4582703113555908} 11/07/2021 11:48:14 - INFO - __main__ - Step 103859: {'lr': 0.00011063126830567824, 'samples': 19940928, 'steps': 103858, 'loss/train': 1.1187913417816162} 11/07/2021 11:48:14 - INFO - __main__ - Step 103860: {'lr': 0.00011062686271094836, 'samples': 19941120, 'steps': 103859, 'loss/train': 1.3706616163253784} 11/07/2021 11:48:14 - INFO - __main__ - Step 103861: {'lr': 0.00011062245717901784, 'samples': 19941312, 'steps': 103860, 'loss/train': 1.5577117204666138} 11/07/2021 11:48:15 - INFO - __main__ - Step 103862: {'lr': 0.0001106180517098887, 'samples': 19941504, 'steps': 103861, 'loss/train': 1.311091661453247} 11/07/2021 11:48:15 - INFO - __main__ - Step 103863: {'lr': 0.00011061364630356293, 'samples': 19941696, 'steps': 103862, 'loss/train': 0.8648540377616882} 11/07/2021 11:48:15 - INFO - __main__ - Step 103864: {'lr': 0.00011060924096004248, 'samples': 19941888, 'steps': 103863, 'loss/train': 1.1617764234542847} 11/07/2021 11:48:16 - INFO - __main__ - Step 103865: {'lr': 0.00011060483567932938, 'samples': 19942080, 'steps': 103864, 'loss/train': 2.877915382385254} 11/07/2021 11:48:16 - INFO - __main__ - Step 103866: {'lr': 0.00011060043046142568, 'samples': 19942272, 'steps': 103865, 'loss/train': 2.7305493354797363} 11/07/2021 11:48:17 - INFO - __main__ - Step 103867: {'lr': 0.00011059602530633317, 'samples': 19942464, 'steps': 103866, 'loss/train': 2.737440824508667} 11/07/2021 11:48:18 - INFO - __main__ - Step 103868: {'lr': 0.00011059162021405394, 'samples': 19942656, 'steps': 103867, 'loss/train': 1.2719608545303345} 11/07/2021 11:48:18 - INFO - __main__ - Step 103869: {'lr': 0.00011058721518458997, 'samples': 19942848, 'steps': 103868, 'loss/train': 0.8276745676994324} 11/07/2021 11:48:18 - INFO - __main__ - Step 103870: {'lr': 0.00011058281021794325, 'samples': 19943040, 'steps': 103869, 'loss/train': 1.144775629043579} 11/07/2021 11:48:19 - INFO - __main__ - Step 103871: {'lr': 0.00011057840531411578, 'samples': 19943232, 'steps': 103870, 'loss/train': 1.097304344177246} 11/07/2021 11:48:20 - INFO - __main__ - Step 103872: {'lr': 0.00011057400047310954, 'samples': 19943424, 'steps': 103871, 'loss/train': 1.611721158027649} 11/07/2021 11:48:20 - INFO - __main__ - Step 103873: {'lr': 0.00011056959569492647, 'samples': 19943616, 'steps': 103872, 'loss/train': 1.6025313138961792} 11/07/2021 11:48:20 - INFO - __main__ - Step 103874: {'lr': 0.00011056519097956861, 'samples': 19943808, 'steps': 103873, 'loss/train': 1.3818567991256714} 11/07/2021 11:48:21 - INFO - __main__ - Step 103875: {'lr': 0.00011056078632703789, 'samples': 19944000, 'steps': 103874, 'loss/train': 1.566216230392456} 11/07/2021 11:48:21 - INFO - __main__ - Step 103876: {'lr': 0.00011055638173733637, 'samples': 19944192, 'steps': 103875, 'loss/train': 0.19929833710193634} 11/07/2021 11:48:22 - INFO - __main__ - Step 103877: {'lr': 0.00011055197721046598, 'samples': 19944384, 'steps': 103876, 'loss/train': 1.5519341230392456} 11/07/2021 11:48:23 - INFO - __main__ - Step 103878: {'lr': 0.0001105475727464287, 'samples': 19944576, 'steps': 103877, 'loss/train': 1.3096612691879272} 11/07/2021 11:48:23 - INFO - __main__ - Step 103879: {'lr': 0.00011054316834522665, 'samples': 19944768, 'steps': 103878, 'loss/train': 1.6539596319198608} 11/07/2021 11:48:23 - INFO - __main__ - Step 103880: {'lr': 0.00011053876400686158, 'samples': 19944960, 'steps': 103879, 'loss/train': 1.2958388328552246} 11/07/2021 11:48:24 - INFO - __main__ - Step 103881: {'lr': 0.0001105343597313356, 'samples': 19945152, 'steps': 103880, 'loss/train': 1.0750818252563477} 11/07/2021 11:48:24 - INFO - __main__ - Step 103882: {'lr': 0.00011052995551865069, 'samples': 19945344, 'steps': 103881, 'loss/train': 1.8185957670211792} 11/07/2021 11:48:26 - INFO - __main__ - Step 103883: {'lr': 0.00011052555136880885, 'samples': 19945536, 'steps': 103882, 'loss/train': 1.3406965732574463} 11/07/2021 11:48:26 - INFO - __main__ - Step 103884: {'lr': 0.00011052114728181201, 'samples': 19945728, 'steps': 103883, 'loss/train': 1.8308563232421875} 11/07/2021 11:48:27 - INFO - __main__ - Step 103885: {'lr': 0.0001105167432576622, 'samples': 19945920, 'steps': 103884, 'loss/train': 1.7633910179138184} 11/07/2021 11:48:27 - INFO - __main__ - Step 103886: {'lr': 0.0001105123392963614, 'samples': 19946112, 'steps': 103885, 'loss/train': 1.629191517829895} 11/07/2021 11:48:27 - INFO - __main__ - Step 103887: {'lr': 0.00011050793539791157, 'samples': 19946304, 'steps': 103886, 'loss/train': 1.2527148723602295} 11/07/2021 11:48:28 - INFO - __main__ - Step 103888: {'lr': 0.00011050353156231474, 'samples': 19946496, 'steps': 103887, 'loss/train': 1.3265963792800903} 11/07/2021 11:48:28 - INFO - __main__ - Step 103889: {'lr': 0.00011049912778957283, 'samples': 19946688, 'steps': 103888, 'loss/train': 3.6018686294555664} 11/07/2021 11:48:29 - INFO - __main__ - Step 103890: {'lr': 0.00011049472407968788, 'samples': 19946880, 'steps': 103889, 'loss/train': 1.1555038690567017} 11/07/2021 11:48:29 - INFO - __main__ - Step 103891: {'lr': 0.00011049032043266186, 'samples': 19947072, 'steps': 103890, 'loss/train': 4.7647905349731445} 11/07/2021 11:48:30 - INFO - __main__ - Step 103892: {'lr': 0.00011048591684849677, 'samples': 19947264, 'steps': 103891, 'loss/train': 3.3075904846191406} 11/07/2021 11:48:30 - INFO - __main__ - Step 103893: {'lr': 0.00011048151332719461, 'samples': 19947456, 'steps': 103892, 'loss/train': 1.4908908605575562} 11/07/2021 11:48:31 - INFO - __main__ - Step 103894: {'lr': 0.00011047710986875729, 'samples': 19947648, 'steps': 103893, 'loss/train': 0.974383533000946} 11/07/2021 11:48:32 - INFO - __main__ - Step 103895: {'lr': 0.0001104727064731868, 'samples': 19947840, 'steps': 103894, 'loss/train': 1.4146214723587036} 11/07/2021 11:48:32 - INFO - __main__ - Step 103896: {'lr': 0.00011046830314048514, 'samples': 19948032, 'steps': 103895, 'loss/train': 1.0569868087768555} 11/07/2021 11:48:32 - INFO - __main__ - Step 103897: {'lr': 0.00011046389987065433, 'samples': 19948224, 'steps': 103896, 'loss/train': 0.46709394454956055} 11/07/2021 11:48:33 - INFO - __main__ - Step 103898: {'lr': 0.00011045949666369634, 'samples': 19948416, 'steps': 103897, 'loss/train': 1.657537817955017} 11/07/2021 11:48:33 - INFO - __main__ - Step 103899: {'lr': 0.00011045509351961314, 'samples': 19948608, 'steps': 103898, 'loss/train': 1.4910712242126465} 11/07/2021 11:48:33 - INFO - __main__ - Step 103900: {'lr': 0.00011045069043840673, 'samples': 19948800, 'steps': 103899, 'loss/train': 1.59012770652771} 11/07/2021 11:48:35 - INFO - __main__ - Step 103901: {'lr': 0.00011044628742007909, 'samples': 19948992, 'steps': 103900, 'loss/train': 1.2026313543319702} 11/07/2021 11:48:35 - INFO - __main__ - Step 103902: {'lr': 0.00011044188446463218, 'samples': 19949184, 'steps': 103901, 'loss/train': 0.47478020191192627} 11/07/2021 11:48:35 - INFO - __main__ - Step 103903: {'lr': 0.00011043748157206802, 'samples': 19949376, 'steps': 103902, 'loss/train': 1.3340866565704346} 11/07/2021 11:48:36 - INFO - __main__ - Step 103904: {'lr': 0.00011043307874238856, 'samples': 19949568, 'steps': 103903, 'loss/train': 1.5510618686676025} 11/07/2021 11:48:36 - INFO - __main__ - Step 103905: {'lr': 0.0001104286759755958, 'samples': 19949760, 'steps': 103904, 'loss/train': 1.0644856691360474} 11/07/2021 11:48:37 - INFO - __main__ - Step 103906: {'lr': 0.00011042427327169183, 'samples': 19949952, 'steps': 103905, 'loss/train': 1.4947491884231567} 11/07/2021 11:48:37 - INFO - __main__ - Step 103907: {'lr': 0.00011041987063067843, 'samples': 19950144, 'steps': 103906, 'loss/train': 1.1867115497589111} 11/07/2021 11:48:38 - INFO - __main__ - Step 103908: {'lr': 0.0001104154680525577, 'samples': 19950336, 'steps': 103907, 'loss/train': 2.0267019271850586} 11/07/2021 11:48:38 - INFO - __main__ - Step 103909: {'lr': 0.00011041106553733157, 'samples': 19950528, 'steps': 103908, 'loss/train': 0.9283954501152039} 11/07/2021 11:48:38 - INFO - __main__ - Step 103910: {'lr': 0.00011040666308500211, 'samples': 19950720, 'steps': 103909, 'loss/train': 1.584511160850525} 11/07/2021 11:48:40 - INFO - __main__ - Step 103911: {'lr': 0.00011040226069557121, 'samples': 19950912, 'steps': 103910, 'loss/train': 1.4128155708312988} 11/07/2021 11:48:40 - INFO - __main__ - Step 103912: {'lr': 0.0001103978583690409, 'samples': 19951104, 'steps': 103911, 'loss/train': 1.3501760959625244} 11/07/2021 11:48:40 - INFO - __main__ - Step 103913: {'lr': 0.00011039345610541317, 'samples': 19951296, 'steps': 103912, 'loss/train': 0.29783615469932556} 11/07/2021 11:48:41 - INFO - __main__ - Step 103914: {'lr': 0.00011038905390469, 'samples': 19951488, 'steps': 103913, 'loss/train': 1.1670342683792114} 11/07/2021 11:48:41 - INFO - __main__ - Step 103915: {'lr': 0.00011038465176687337, 'samples': 19951680, 'steps': 103914, 'loss/train': 1.1689666509628296} 11/07/2021 11:48:42 - INFO - __main__ - Step 103916: {'lr': 0.00011038024969196528, 'samples': 19951872, 'steps': 103915, 'loss/train': 1.2667491436004639} 11/07/2021 11:48:42 - INFO - __main__ - Step 103917: {'lr': 0.00011037584767996767, 'samples': 19952064, 'steps': 103916, 'loss/train': 1.4659700393676758} 11/07/2021 11:48:43 - INFO - __main__ - Step 103918: {'lr': 0.00011037144573088253, 'samples': 19952256, 'steps': 103917, 'loss/train': 1.2464721202850342} 11/07/2021 11:48:43 - INFO - __main__ - Step 103919: {'lr': 0.00011036704384471189, 'samples': 19952448, 'steps': 103918, 'loss/train': 0.9725680947303772} 11/07/2021 11:48:43 - INFO - __main__ - Step 103920: {'lr': 0.00011036264202145779, 'samples': 19952640, 'steps': 103919, 'loss/train': 1.4758316278457642} 11/07/2021 11:48:44 - INFO - __main__ - Step 103921: {'lr': 0.00011035824026112204, 'samples': 19952832, 'steps': 103920, 'loss/train': 1.3732175827026367} 11/07/2021 11:48:45 - INFO - __main__ - Step 103922: {'lr': 0.0001103538385637067, 'samples': 19953024, 'steps': 103921, 'loss/train': 1.3775641918182373} 11/07/2021 11:48:45 - INFO - __main__ - Step 103923: {'lr': 0.00011034943692921378, 'samples': 19953216, 'steps': 103922, 'loss/train': 1.4441722631454468} 11/07/2021 11:48:45 - INFO - __main__ - Step 103924: {'lr': 0.00011034503535764525, 'samples': 19953408, 'steps': 103923, 'loss/train': 1.180473804473877} 11/07/2021 11:48:46 - INFO - __main__ - Step 103925: {'lr': 0.00011034063384900309, 'samples': 19953600, 'steps': 103924, 'loss/train': 1.2199026346206665} 11/07/2021 11:48:46 - INFO - __main__ - Step 103926: {'lr': 0.00011033623240328928, 'samples': 19953792, 'steps': 103925, 'loss/train': 1.3243792057037354} 11/07/2021 11:48:47 - INFO - __main__ - Step 103927: {'lr': 0.00011033183102050581, 'samples': 19953984, 'steps': 103926, 'loss/train': 0.7184540033340454} 11/07/2021 11:48:48 - INFO - __main__ - Step 103928: {'lr': 0.00011032742970065466, 'samples': 19954176, 'steps': 103927, 'loss/train': 1.4413946866989136} 11/07/2021 11:48:48 - INFO - __main__ - Step 103929: {'lr': 0.00011032302844373781, 'samples': 19954368, 'steps': 103928, 'loss/train': 1.3074171543121338} 11/07/2021 11:48:48 - INFO - __main__ - Step 103930: {'lr': 0.00011031862724975724, 'samples': 19954560, 'steps': 103929, 'loss/train': 0.770973801612854} 11/07/2021 11:48:49 - INFO - __main__ - Step 103931: {'lr': 0.00011031422611871497, 'samples': 19954752, 'steps': 103930, 'loss/train': 1.5245229005813599} 11/07/2021 11:48:50 - INFO - __main__ - Step 103932: {'lr': 0.00011030982505061293, 'samples': 19954944, 'steps': 103931, 'loss/train': 1.1849013566970825} 11/07/2021 11:48:50 - INFO - __main__ - Step 103933: {'lr': 0.00011030542404545325, 'samples': 19955136, 'steps': 103932, 'loss/train': 1.4389927387237549} 11/07/2021 11:48:50 - INFO - __main__ - Step 103934: {'lr': 0.00011030102310323767, 'samples': 19955328, 'steps': 103933, 'loss/train': 1.5066173076629639} 11/07/2021 11:48:51 - INFO - __main__ - Step 103935: {'lr': 0.0001102966222239683, 'samples': 19955520, 'steps': 103934, 'loss/train': 1.3338687419891357} 11/07/2021 11:48:51 - INFO - __main__ - Step 103936: {'lr': 0.0001102922214076471, 'samples': 19955712, 'steps': 103935, 'loss/train': 1.913368821144104} 11/07/2021 11:48:52 - INFO - __main__ - Step 103937: {'lr': 0.00011028782065427608, 'samples': 19955904, 'steps': 103936, 'loss/train': 1.1990153789520264} 11/07/2021 11:48:53 - INFO - __main__ - Step 103938: {'lr': 0.00011028341996385724, 'samples': 19956096, 'steps': 103937, 'loss/train': 0.48697733879089355} 11/07/2021 11:48:53 - INFO - __main__ - Step 103939: {'lr': 0.0001102790193363925, 'samples': 19956288, 'steps': 103938, 'loss/train': 1.4015612602233887} 11/07/2021 11:48:53 - INFO - __main__ - Step 103940: {'lr': 0.00011027461877188388, 'samples': 19956480, 'steps': 103939, 'loss/train': 1.552123785018921} 11/07/2021 11:48:54 - INFO - __main__ - Step 103941: {'lr': 0.00011027021827033337, 'samples': 19956672, 'steps': 103940, 'loss/train': 1.2799793481826782} 11/07/2021 11:48:56 - INFO - __main__ - Step 103942: {'lr': 0.00011026581783174298, 'samples': 19956864, 'steps': 103941, 'loss/train': 1.1646884679794312} 11/07/2021 11:48:56 - INFO - __main__ - Step 103943: {'lr': 0.00011026141745611459, 'samples': 19957056, 'steps': 103942, 'loss/train': 1.6559498310089111} 11/07/2021 11:48:57 - INFO - __main__ - Step 103944: {'lr': 0.0001102570171434503, 'samples': 19957248, 'steps': 103943, 'loss/train': 1.1381891965866089} 11/07/2021 11:48:57 - INFO - __main__ - Step 103945: {'lr': 0.00011025261689375201, 'samples': 19957440, 'steps': 103944, 'loss/train': 0.9685118794441223} 11/07/2021 11:48:57 - INFO - __main__ - Step 103946: {'lr': 0.00011024821670702184, 'samples': 19957632, 'steps': 103945, 'loss/train': 1.807383418083191} 11/07/2021 11:48:58 - INFO - __main__ - Step 103947: {'lr': 0.00011024381658326158, 'samples': 19957824, 'steps': 103946, 'loss/train': 1.7758039236068726} 11/07/2021 11:48:58 - INFO - __main__ - Step 103948: {'lr': 0.00011023941652247329, 'samples': 19958016, 'steps': 103947, 'loss/train': 1.7864831686019897} 11/07/2021 11:48:58 - INFO - __main__ - Step 103949: {'lr': 0.00011023501652465895, 'samples': 19958208, 'steps': 103948, 'loss/train': 0.7325311899185181} 11/07/2021 11:48:59 - INFO - __main__ - Step 103950: {'lr': 0.00011023061658982059, 'samples': 19958400, 'steps': 103949, 'loss/train': 1.0219156742095947} 11/07/2021 11:49:00 - INFO - __main__ - Step 103951: {'lr': 0.00011022621671796013, 'samples': 19958592, 'steps': 103950, 'loss/train': 0.1728283315896988} 11/07/2021 11:49:00 - INFO - __main__ - Step 103952: {'lr': 0.0001102218169090796, 'samples': 19958784, 'steps': 103951, 'loss/train': 1.4761135578155518} 11/07/2021 11:49:00 - INFO - __main__ - Step 103953: {'lr': 0.00011021741716318093, 'samples': 19958976, 'steps': 103952, 'loss/train': 1.4499515295028687} 11/07/2021 11:49:01 - INFO - __main__ - Step 103954: {'lr': 0.00011021301748026616, 'samples': 19959168, 'steps': 103953, 'loss/train': 1.5901745557785034} 11/07/2021 11:49:02 - INFO - __main__ - Step 103955: {'lr': 0.00011020861786033723, 'samples': 19959360, 'steps': 103954, 'loss/train': 1.0347315073013306} 11/07/2021 11:49:02 - INFO - __main__ - Step 103956: {'lr': 0.00011020421830339617, 'samples': 19959552, 'steps': 103955, 'loss/train': 1.2114272117614746} 11/07/2021 11:49:02 - INFO - __main__ - Step 103957: {'lr': 0.00011019981880944491, 'samples': 19959744, 'steps': 103956, 'loss/train': 1.1727778911590576} 11/07/2021 11:49:03 - INFO - __main__ - Step 103958: {'lr': 0.00011019541937848546, 'samples': 19959936, 'steps': 103957, 'loss/train': 2.6294491291046143} 11/07/2021 11:49:03 - INFO - __main__ - Step 103959: {'lr': 0.00011019102001051979, 'samples': 19960128, 'steps': 103958, 'loss/train': 1.2780323028564453} 11/07/2021 11:49:04 - INFO - __main__ - Step 103960: {'lr': 0.00011018662070554999, 'samples': 19960320, 'steps': 103959, 'loss/train': 1.4908136129379272} 11/07/2021 11:49:05 - INFO - __main__ - Step 103961: {'lr': 0.00011018222146357785, 'samples': 19960512, 'steps': 103960, 'loss/train': 1.289031982421875} 11/07/2021 11:49:05 - INFO - __main__ - Step 103962: {'lr': 0.00011017782228460544, 'samples': 19960704, 'steps': 103961, 'loss/train': 1.8795804977416992} 11/07/2021 11:49:05 - INFO - __main__ - Step 103963: {'lr': 0.00011017342316863474, 'samples': 19960896, 'steps': 103962, 'loss/train': 1.1500341892242432} 11/07/2021 11:49:06 - INFO - __main__ - Step 103964: {'lr': 0.00011016902411566774, 'samples': 19961088, 'steps': 103963, 'loss/train': 1.1782867908477783} 11/07/2021 11:49:07 - INFO - __main__ - Step 103965: {'lr': 0.0001101646251257064, 'samples': 19961280, 'steps': 103964, 'loss/train': 1.6129286289215088} 11/07/2021 11:49:07 - INFO - __main__ - Step 103966: {'lr': 0.00011016022619875276, 'samples': 19961472, 'steps': 103965, 'loss/train': 1.3444950580596924} 11/07/2021 11:49:07 - INFO - __main__ - Step 103967: {'lr': 0.00011015582733480875, 'samples': 19961664, 'steps': 103966, 'loss/train': 1.2989766597747803} 11/07/2021 11:49:08 - INFO - __main__ - Step 103968: {'lr': 0.00011015142853387636, 'samples': 19961856, 'steps': 103967, 'loss/train': 1.2952637672424316} 11/07/2021 11:49:08 - INFO - __main__ - Step 103969: {'lr': 0.00011014702979595759, 'samples': 19962048, 'steps': 103968, 'loss/train': 1.301087737083435} 11/07/2021 11:49:08 - INFO - __main__ - Step 103970: {'lr': 0.00011014263112105441, 'samples': 19962240, 'steps': 103969, 'loss/train': 1.4563137292861938} 11/07/2021 11:49:10 - INFO - __main__ - Step 103971: {'lr': 0.0001101382325091688, 'samples': 19962432, 'steps': 103970, 'loss/train': 1.6084028482437134} 11/07/2021 11:49:10 - INFO - __main__ - Step 103972: {'lr': 0.00011013383396030271, 'samples': 19962624, 'steps': 103971, 'loss/train': 0.8607473373413086} 11/07/2021 11:49:11 - INFO - __main__ - Step 103973: {'lr': 0.00011012943547445828, 'samples': 19962816, 'steps': 103972, 'loss/train': 1.552879810333252} 11/07/2021 11:49:11 - INFO - __main__ - Step 103974: {'lr': 0.00011012503705163729, 'samples': 19963008, 'steps': 103973, 'loss/train': 1.3010551929473877} 11/07/2021 11:49:11 - INFO - __main__ - Step 103975: {'lr': 0.00011012063869184177, 'samples': 19963200, 'steps': 103974, 'loss/train': 1.0514533519744873} 11/07/2021 11:49:12 - INFO - __main__ - Step 103976: {'lr': 0.00011011624039507376, 'samples': 19963392, 'steps': 103975, 'loss/train': 1.1363252401351929} 11/07/2021 11:49:13 - INFO - __main__ - Step 103977: {'lr': 0.00011011184216133518, 'samples': 19963584, 'steps': 103976, 'loss/train': 0.49220186471939087} 11/07/2021 11:49:13 - INFO - __main__ - Step 103978: {'lr': 0.00011010744399062808, 'samples': 19963776, 'steps': 103977, 'loss/train': 1.4255353212356567} 11/07/2021 11:49:13 - INFO - __main__ - Step 103979: {'lr': 0.00011010304588295439, 'samples': 19963968, 'steps': 103978, 'loss/train': 1.4839015007019043} 11/07/2021 11:49:14 - INFO - __main__ - Step 103980: {'lr': 0.0001100986478383161, 'samples': 19964160, 'steps': 103979, 'loss/train': 1.486770749092102} 11/07/2021 11:49:16 - INFO - __main__ - Step 103981: {'lr': 0.00011009424985671521, 'samples': 19964352, 'steps': 103980, 'loss/train': 1.307554841041565} 11/07/2021 11:49:16 - INFO - __main__ - Step 103982: {'lr': 0.00011008985193815371, 'samples': 19964544, 'steps': 103981, 'loss/train': 1.3632370233535767} 11/07/2021 11:49:16 - INFO - __main__ - Step 103983: {'lr': 0.00011008545408263354, 'samples': 19964736, 'steps': 103982, 'loss/train': 1.5257076025009155} 11/07/2021 11:49:17 - INFO - __main__ - Step 103984: {'lr': 0.00011008105629015672, 'samples': 19964928, 'steps': 103983, 'loss/train': 1.6855570077896118} 11/07/2021 11:49:17 - INFO - __main__ - Step 103985: {'lr': 0.00011007665856072521, 'samples': 19965120, 'steps': 103984, 'loss/train': 1.7849643230438232} 11/07/2021 11:49:17 - INFO - __main__ - Step 103986: {'lr': 0.000110072260894341, 'samples': 19965312, 'steps': 103985, 'loss/train': 1.7594873905181885} 11/07/2021 11:49:18 - INFO - __main__ - Step 103987: {'lr': 0.00011006786329100615, 'samples': 19965504, 'steps': 103986, 'loss/train': 1.2588051557540894} 11/07/2021 11:49:19 - INFO - __main__ - Step 103988: {'lr': 0.00011006346575072249, 'samples': 19965696, 'steps': 103987, 'loss/train': 1.8301599025726318} 11/07/2021 11:49:19 - INFO - __main__ - Step 103989: {'lr': 0.00011005906827349204, 'samples': 19965888, 'steps': 103988, 'loss/train': 1.5129715204238892} 11/07/2021 11:49:19 - INFO - __main__ - Step 103990: {'lr': 0.00011005467085931683, 'samples': 19966080, 'steps': 103989, 'loss/train': 0.6917821764945984} 11/07/2021 11:49:20 - INFO - __main__ - Step 103991: {'lr': 0.00011005027350819886, 'samples': 19966272, 'steps': 103990, 'loss/train': 1.4422520399093628} 11/07/2021 11:49:20 - INFO - __main__ - Step 103992: {'lr': 0.00011004587622014003, 'samples': 19966464, 'steps': 103991, 'loss/train': 1.289534091949463} 11/07/2021 11:49:21 - INFO - __main__ - Step 103993: {'lr': 0.00011004147899514239, 'samples': 19966656, 'steps': 103992, 'loss/train': 1.6603928804397583} 11/07/2021 11:49:21 - INFO - __main__ - Step 103994: {'lr': 0.0001100370818332079, 'samples': 19966848, 'steps': 103993, 'loss/train': 1.0218983888626099} 11/07/2021 11:49:22 - INFO - __main__ - Step 103995: {'lr': 0.00011003268473433853, 'samples': 19967040, 'steps': 103994, 'loss/train': 1.7138926982879639} 11/07/2021 11:49:22 - INFO - __main__ - Step 103996: {'lr': 0.00011002828769853628, 'samples': 19967232, 'steps': 103995, 'loss/train': 1.283647060394287} 11/07/2021 11:49:22 - INFO - __main__ - Step 103997: {'lr': 0.00011002389072580313, 'samples': 19967424, 'steps': 103996, 'loss/train': 0.9840849041938782} 11/07/2021 11:49:24 - INFO - __main__ - Step 103998: {'lr': 0.00011001949381614115, 'samples': 19967616, 'steps': 103997, 'loss/train': 1.4713859558105469} 11/07/2021 11:49:24 - INFO - __main__ - Step 103999: {'lr': 0.00011001509696955211, 'samples': 19967808, 'steps': 103998, 'loss/train': 1.5012617111206055} 11/07/2021 11:49:24 - INFO - __main__ - Step 104000: {'lr': 0.00011001070018603815, 'samples': 19968000, 'steps': 103999, 'loss/train': 1.4423047304153442} 11/07/2021 11:49:25 - INFO - __main__ - Step 104001: {'lr': 0.00011000630346560118, 'samples': 19968192, 'steps': 104000, 'loss/train': 1.1032440662384033} 11/07/2021 11:49:25 - INFO - __main__ - Step 104002: {'lr': 0.0001100019068082432, 'samples': 19968384, 'steps': 104001, 'loss/train': 1.2874641418457031} 11/07/2021 11:49:26 - INFO - __main__ - Step 104003: {'lr': 0.00010999751021396621, 'samples': 19968576, 'steps': 104002, 'loss/train': 1.0378963947296143} 11/07/2021 11:49:26 - INFO - __main__ - Step 104004: {'lr': 0.00010999311368277218, 'samples': 19968768, 'steps': 104003, 'loss/train': 1.5462541580200195} 11/07/2021 11:49:27 - INFO - __main__ - Step 104005: {'lr': 0.00010998871721466311, 'samples': 19968960, 'steps': 104004, 'loss/train': 1.3279646635055542} 11/07/2021 11:49:27 - INFO - __main__ - Step 104006: {'lr': 0.00010998432080964093, 'samples': 19969152, 'steps': 104005, 'loss/train': 1.5993155241012573} 11/07/2021 11:49:27 - INFO - __main__ - Step 104007: {'lr': 0.00010997992446770769, 'samples': 19969344, 'steps': 104006, 'loss/train': 1.0990047454833984} 11/07/2021 11:49:28 - INFO - __main__ - Step 104008: {'lr': 0.0001099755281888653, 'samples': 19969536, 'steps': 104007, 'loss/train': 1.1149640083312988} 11/07/2021 11:49:29 - INFO - __main__ - Step 104009: {'lr': 0.0001099711319731159, 'samples': 19969728, 'steps': 104008, 'loss/train': 1.7463573217391968} 11/07/2021 11:49:29 - INFO - __main__ - Step 104010: {'lr': 0.00010996673582046124, 'samples': 19969920, 'steps': 104009, 'loss/train': 0.4511600732803345} 11/07/2021 11:49:30 - INFO - __main__ - Step 104011: {'lr': 0.00010996233973090342, 'samples': 19970112, 'steps': 104010, 'loss/train': 0.9459747076034546} 11/07/2021 11:49:30 - INFO - __main__ - Step 104012: {'lr': 0.0001099579437044444, 'samples': 19970304, 'steps': 104011, 'loss/train': 1.2441935539245605} 11/07/2021 11:49:30 - INFO - __main__ - Step 104013: {'lr': 0.00010995354774108615, 'samples': 19970496, 'steps': 104012, 'loss/train': 1.6098898649215698} 11/07/2021 11:49:31 - INFO - __main__ - Step 104014: {'lr': 0.00010994915184083071, 'samples': 19970688, 'steps': 104013, 'loss/train': 1.6942055225372314} 11/07/2021 11:49:32 - INFO - __main__ - Step 104015: {'lr': 0.00010994475600367998, 'samples': 19970880, 'steps': 104014, 'loss/train': 1.0025547742843628} 11/07/2021 11:49:32 - INFO - __main__ - Step 104016: {'lr': 0.00010994036022963602, 'samples': 19971072, 'steps': 104015, 'loss/train': 1.5515185594558716} 11/07/2021 11:49:32 - INFO - __main__ - Step 104017: {'lr': 0.00010993596451870074, 'samples': 19971264, 'steps': 104016, 'loss/train': 1.0768991708755493} 11/07/2021 11:49:33 - INFO - __main__ - Step 104018: {'lr': 0.00010993156887087619, 'samples': 19971456, 'steps': 104017, 'loss/train': 1.3496489524841309} 11/07/2021 11:49:34 - INFO - __main__ - Step 104019: {'lr': 0.00010992717328616427, 'samples': 19971648, 'steps': 104018, 'loss/train': 1.7557848691940308} 11/07/2021 11:49:34 - INFO - __main__ - Step 104020: {'lr': 0.00010992277776456713, 'samples': 19971840, 'steps': 104019, 'loss/train': 1.0311187505722046} 11/07/2021 11:49:34 - INFO - __main__ - Step 104021: {'lr': 0.0001099183823060865, 'samples': 19972032, 'steps': 104020, 'loss/train': 1.4238556623458862} 11/07/2021 11:49:35 - INFO - __main__ - Step 104022: {'lr': 0.00010991398691072452, 'samples': 19972224, 'steps': 104021, 'loss/train': 1.039174199104309} 11/07/2021 11:49:35 - INFO - __main__ - Step 104023: {'lr': 0.00010990959157848312, 'samples': 19972416, 'steps': 104022, 'loss/train': 1.4485337734222412} 11/07/2021 11:49:36 - INFO - __main__ - Step 104024: {'lr': 0.0001099051963093643, 'samples': 19972608, 'steps': 104023, 'loss/train': 1.2526535987854004} 11/07/2021 11:49:37 - INFO - __main__ - Step 104025: {'lr': 0.00010990080110337003, 'samples': 19972800, 'steps': 104024, 'loss/train': 0.7102669477462769} 11/07/2021 11:49:37 - INFO - __main__ - Step 104026: {'lr': 0.0001098964059605023, 'samples': 19972992, 'steps': 104025, 'loss/train': 1.1083474159240723} 11/07/2021 11:49:37 - INFO - __main__ - Step 104027: {'lr': 0.00010989201088076308, 'samples': 19973184, 'steps': 104026, 'loss/train': 1.305702567100525} 11/07/2021 11:49:38 - INFO - __main__ - Step 104028: {'lr': 0.00010988761586415438, 'samples': 19973376, 'steps': 104027, 'loss/train': 5.718654632568359} 11/07/2021 11:49:38 - INFO - __main__ - Step 104029: {'lr': 0.00010988322091067816, 'samples': 19973568, 'steps': 104028, 'loss/train': 1.1691783666610718} 11/07/2021 11:49:39 - INFO - __main__ - Step 104030: {'lr': 0.00010987882602033635, 'samples': 19973760, 'steps': 104029, 'loss/train': 1.6384570598602295} 11/07/2021 11:49:40 - INFO - __main__ - Step 104031: {'lr': 0.00010987443119313111, 'samples': 19973952, 'steps': 104030, 'loss/train': 1.4869298934936523} 11/07/2021 11:49:40 - INFO - __main__ - Step 104032: {'lr': 0.00010987003642906421, 'samples': 19974144, 'steps': 104031, 'loss/train': 1.5071070194244385} 11/07/2021 11:49:40 - INFO - __main__ - Step 104033: {'lr': 0.00010986564172813768, 'samples': 19974336, 'steps': 104032, 'loss/train': 1.4546509981155396} 11/07/2021 11:49:41 - INFO - __main__ - Step 104034: {'lr': 0.00010986124709035356, 'samples': 19974528, 'steps': 104033, 'loss/train': 1.5828566551208496} 11/07/2021 11:49:42 - INFO - __main__ - Step 104035: {'lr': 0.00010985685251571376, 'samples': 19974720, 'steps': 104034, 'loss/train': 1.386606216430664} 11/07/2021 11:49:42 - INFO - __main__ - Step 104036: {'lr': 0.00010985245800422033, 'samples': 19974912, 'steps': 104035, 'loss/train': 1.403132438659668} 11/07/2021 11:49:42 - INFO - __main__ - Step 104037: {'lr': 0.0001098480635558752, 'samples': 19975104, 'steps': 104036, 'loss/train': 1.362494945526123} 11/07/2021 11:49:43 - INFO - __main__ - Step 104038: {'lr': 0.0001098436691706804, 'samples': 19975296, 'steps': 104037, 'loss/train': 1.2406632900238037} 11/07/2021 11:49:43 - INFO - __main__ - Step 104039: {'lr': 0.00010983927484863784, 'samples': 19975488, 'steps': 104038, 'loss/train': 1.476192593574524} 11/07/2021 11:49:44 - INFO - __main__ - Step 104040: {'lr': 0.00010983488058974955, 'samples': 19975680, 'steps': 104039, 'loss/train': 1.4532225131988525} 11/07/2021 11:49:44 - INFO - __main__ - Step 104041: {'lr': 0.00010983048639401752, 'samples': 19975872, 'steps': 104040, 'loss/train': 1.063288927078247} 11/07/2021 11:49:45 - INFO - __main__ - Step 104042: {'lr': 0.0001098260922614438, 'samples': 19976064, 'steps': 104041, 'loss/train': 1.6520652770996094} 11/07/2021 11:49:45 - INFO - __main__ - Step 104043: {'lr': 0.00010982169819203017, 'samples': 19976256, 'steps': 104042, 'loss/train': 1.398737907409668} 11/07/2021 11:49:46 - INFO - __main__ - Step 104044: {'lr': 0.00010981730418577873, 'samples': 19976448, 'steps': 104043, 'loss/train': 1.0980838537216187} 11/07/2021 11:49:47 - INFO - __main__ - Step 104045: {'lr': 0.00010981291024269144, 'samples': 19976640, 'steps': 104044, 'loss/train': 0.918048620223999} 11/07/2021 11:49:47 - INFO - __main__ - Step 104046: {'lr': 0.00010980851636277031, 'samples': 19976832, 'steps': 104045, 'loss/train': 0.5622278451919556} 11/07/2021 11:49:47 - INFO - __main__ - Step 104047: {'lr': 0.00010980412254601729, 'samples': 19977024, 'steps': 104046, 'loss/train': 1.3863811492919922} 11/07/2021 11:49:48 - INFO - __main__ - Step 104048: {'lr': 0.00010979972879243436, 'samples': 19977216, 'steps': 104047, 'loss/train': 1.4961199760437012} 11/07/2021 11:49:48 - INFO - __main__ - Step 104049: {'lr': 0.0001097953351020235, 'samples': 19977408, 'steps': 104048, 'loss/train': 1.3803975582122803} 11/07/2021 11:49:48 - INFO - __main__ - Step 104050: {'lr': 0.00010979094147478671, 'samples': 19977600, 'steps': 104049, 'loss/train': 1.4306349754333496} 11/07/2021 11:49:50 - INFO - __main__ - Step 104051: {'lr': 0.00010978654791072598, 'samples': 19977792, 'steps': 104050, 'loss/train': 1.3256738185882568} 11/07/2021 11:49:50 - INFO - __main__ - Step 104052: {'lr': 0.00010978215440984324, 'samples': 19977984, 'steps': 104051, 'loss/train': 1.452977180480957} 11/07/2021 11:49:50 - INFO - __main__ - Step 104053: {'lr': 0.00010977776097214051, 'samples': 19978176, 'steps': 104052, 'loss/train': 1.1177600622177124} 11/07/2021 11:49:51 - INFO - __main__ - Step 104054: {'lr': 0.00010977336759761986, 'samples': 19978368, 'steps': 104053, 'loss/train': 1.317831039428711} 11/07/2021 11:49:51 - INFO - __main__ - Step 104055: {'lr': 0.00010976897428628305, 'samples': 19978560, 'steps': 104054, 'loss/train': 1.0590976476669312} 11/07/2021 11:49:52 - INFO - __main__ - Step 104056: {'lr': 0.00010976458103813219, 'samples': 19978752, 'steps': 104055, 'loss/train': 0.9518942832946777} 11/07/2021 11:49:52 - INFO - __main__ - Step 104057: {'lr': 0.00010976018785316924, 'samples': 19978944, 'steps': 104056, 'loss/train': 1.4541805982589722} 11/07/2021 11:49:53 - INFO - __main__ - Step 104058: {'lr': 0.00010975579473139618, 'samples': 19979136, 'steps': 104057, 'loss/train': 1.0007691383361816} 11/07/2021 11:49:53 - INFO - __main__ - Step 104059: {'lr': 0.000109751401672815, 'samples': 19979328, 'steps': 104058, 'loss/train': 1.2407829761505127} 11/07/2021 11:49:53 - INFO - __main__ - Step 104060: {'lr': 0.00010974700867742768, 'samples': 19979520, 'steps': 104059, 'loss/train': 1.1936328411102295} 11/07/2021 11:49:54 - INFO - __main__ - Step 104061: {'lr': 0.00010974261574523619, 'samples': 19979712, 'steps': 104060, 'loss/train': 1.6046035289764404} 11/07/2021 11:49:55 - INFO - __main__ - Step 104062: {'lr': 0.00010973822287624253, 'samples': 19979904, 'steps': 104061, 'loss/train': 1.4659372568130493} 11/07/2021 11:49:55 - INFO - __main__ - Step 104063: {'lr': 0.00010973383007044863, 'samples': 19980096, 'steps': 104062, 'loss/train': 1.2990601062774658} 11/07/2021 11:49:55 - INFO - __main__ - Step 104064: {'lr': 0.00010972943732785654, 'samples': 19980288, 'steps': 104063, 'loss/train': 1.745099663734436} 11/07/2021 11:49:56 - INFO - __main__ - Step 104065: {'lr': 0.00010972504464846816, 'samples': 19980480, 'steps': 104064, 'loss/train': 1.406613826751709} 11/07/2021 11:49:57 - INFO - __main__ - Step 104066: {'lr': 0.00010972065203228555, 'samples': 19980672, 'steps': 104065, 'loss/train': 1.3353251218795776} 11/07/2021 11:49:57 - INFO - __main__ - Step 104067: {'lr': 0.00010971625947931068, 'samples': 19980864, 'steps': 104066, 'loss/train': 1.296183705329895} 11/07/2021 11:49:58 - INFO - __main__ - Step 104068: {'lr': 0.00010971186698954547, 'samples': 19981056, 'steps': 104067, 'loss/train': 1.1284408569335938} 11/07/2021 11:49:58 - INFO - __main__ - Step 104069: {'lr': 0.0001097074745629919, 'samples': 19981248, 'steps': 104068, 'loss/train': 1.7664519548416138} 11/07/2021 11:49:58 - INFO - __main__ - Step 104070: {'lr': 0.000109703082199652, 'samples': 19981440, 'steps': 104069, 'loss/train': 1.4670312404632568} 11/07/2021 11:49:59 - INFO - __main__ - Step 104071: {'lr': 0.00010969868989952769, 'samples': 19981632, 'steps': 104070, 'loss/train': 1.1833122968673706} 11/07/2021 11:50:00 - INFO - __main__ - Step 104072: {'lr': 0.00010969429766262102, 'samples': 19981824, 'steps': 104071, 'loss/train': 0.6875371336936951} 11/07/2021 11:50:00 - INFO - __main__ - Step 104073: {'lr': 0.0001096899054889339, 'samples': 19982016, 'steps': 104072, 'loss/train': 1.7656772136688232} 11/07/2021 11:50:00 - INFO - __main__ - Step 104074: {'lr': 0.00010968551337846838, 'samples': 19982208, 'steps': 104073, 'loss/train': 1.6924160718917847} 11/07/2021 11:50:01 - INFO - __main__ - Step 104075: {'lr': 0.00010968112133122638, 'samples': 19982400, 'steps': 104074, 'loss/train': 1.6004464626312256} 11/07/2021 11:50:01 - INFO - __main__ - Step 104076: {'lr': 0.0001096767293472099, 'samples': 19982592, 'steps': 104075, 'loss/train': 1.3258482217788696} 11/07/2021 11:50:02 - INFO - __main__ - Step 104077: {'lr': 0.00010967233742642094, 'samples': 19982784, 'steps': 104076, 'loss/train': 1.7048261165618896} 11/07/2021 11:50:03 - INFO - __main__ - Step 104078: {'lr': 0.00010966794556886142, 'samples': 19982976, 'steps': 104077, 'loss/train': 1.508461594581604} 11/07/2021 11:50:03 - INFO - __main__ - Step 104079: {'lr': 0.00010966355377453341, 'samples': 19983168, 'steps': 104078, 'loss/train': 1.604310154914856} 11/07/2021 11:50:03 - INFO - __main__ - Step 104080: {'lr': 0.00010965916204343878, 'samples': 19983360, 'steps': 104079, 'loss/train': 1.10773766040802} 11/07/2021 11:50:04 - INFO - __main__ - Step 104081: {'lr': 0.00010965477037557973, 'samples': 19983552, 'steps': 104080, 'loss/train': 1.2140675783157349} 11/07/2021 11:50:05 - INFO - __main__ - Step 104082: {'lr': 0.00010965037877095793, 'samples': 19983744, 'steps': 104081, 'loss/train': 1.2046843767166138} 11/07/2021 11:50:05 - INFO - __main__ - Step 104083: {'lr': 0.0001096459872295755, 'samples': 19983936, 'steps': 104082, 'loss/train': 1.2150593996047974} 11/07/2021 11:50:05 - INFO - __main__ - Step 104084: {'lr': 0.00010964159575143445, 'samples': 19984128, 'steps': 104083, 'loss/train': 2.5144989490509033} 11/07/2021 11:50:06 - INFO - __main__ - Step 104085: {'lr': 0.00010963720433653671, 'samples': 19984320, 'steps': 104084, 'loss/train': 1.4770385026931763} 11/07/2021 11:50:06 - INFO - __main__ - Step 104086: {'lr': 0.00010963281298488428, 'samples': 19984512, 'steps': 104085, 'loss/train': 1.278486728668213} 11/07/2021 11:50:07 - INFO - __main__ - Step 104087: {'lr': 0.00010962842169647916, 'samples': 19984704, 'steps': 104086, 'loss/train': 0.8803451657295227} 11/07/2021 11:50:07 - INFO - __main__ - Step 104088: {'lr': 0.0001096240304713233, 'samples': 19984896, 'steps': 104087, 'loss/train': 1.1787244081497192} 11/07/2021 11:50:08 - INFO - __main__ - Step 104089: {'lr': 0.00010961963930941867, 'samples': 19985088, 'steps': 104088, 'loss/train': 1.3199658393859863} 11/07/2021 11:50:08 - INFO - __main__ - Step 104090: {'lr': 0.00010961524821076726, 'samples': 19985280, 'steps': 104089, 'loss/train': 0.7812177538871765} 11/07/2021 11:50:08 - INFO - __main__ - Step 104091: {'lr': 0.00010961085717537109, 'samples': 19985472, 'steps': 104090, 'loss/train': 1.6062918901443481} 11/07/2021 11:50:10 - INFO - __main__ - Step 104092: {'lr': 0.00010960646620323209, 'samples': 19985664, 'steps': 104091, 'loss/train': 0.9636266827583313} 11/07/2021 11:50:10 - INFO - __main__ - Step 104093: {'lr': 0.00010960207529435223, 'samples': 19985856, 'steps': 104092, 'loss/train': 1.1440621614456177} 11/07/2021 11:50:10 - INFO - __main__ - Step 104094: {'lr': 0.00010959768444873361, 'samples': 19986048, 'steps': 104093, 'loss/train': 1.5009279251098633} 11/07/2021 11:50:11 - INFO - __main__ - Step 104095: {'lr': 0.00010959329366637802, 'samples': 19986240, 'steps': 104094, 'loss/train': 1.7251732349395752} 11/07/2021 11:50:11 - INFO - __main__ - Step 104096: {'lr': 0.00010958890294728752, 'samples': 19986432, 'steps': 104095, 'loss/train': 1.4761427640914917} 11/07/2021 11:50:12 - INFO - __main__ - Step 104097: {'lr': 0.00010958451229146408, 'samples': 19986624, 'steps': 104096, 'loss/train': 1.1387977600097656} 11/07/2021 11:50:12 - INFO - __main__ - Step 104098: {'lr': 0.00010958012169890972, 'samples': 19986816, 'steps': 104097, 'loss/train': 1.2417019605636597} 11/07/2021 11:50:13 - INFO - __main__ - Step 104099: {'lr': 0.00010957573116962636, 'samples': 19987008, 'steps': 104098, 'loss/train': 1.2372419834136963} 11/07/2021 11:50:13 - INFO - __main__ - Step 104100: {'lr': 0.00010957134070361602, 'samples': 19987200, 'steps': 104099, 'loss/train': 1.319576621055603} 11/07/2021 11:50:13 - INFO - __main__ - Step 104101: {'lr': 0.00010956695030088069, 'samples': 19987392, 'steps': 104100, 'loss/train': 1.4265856742858887} 11/07/2021 11:50:14 - INFO - __main__ - Step 104102: {'lr': 0.00010956255996142227, 'samples': 19987584, 'steps': 104101, 'loss/train': 1.0667662620544434} 11/07/2021 11:50:15 - INFO - __main__ - Step 104103: {'lr': 0.00010955816968524285, 'samples': 19987776, 'steps': 104102, 'loss/train': 0.7404369115829468} 11/07/2021 11:50:15 - INFO - __main__ - Step 104104: {'lr': 0.00010955377947234432, 'samples': 19987968, 'steps': 104103, 'loss/train': 1.1202958822250366} 11/07/2021 11:50:16 - INFO - __main__ - Step 104105: {'lr': 0.00010954938932272871, 'samples': 19988160, 'steps': 104104, 'loss/train': 1.4560136795043945} 11/07/2021 11:50:16 - INFO - __main__ - Step 104106: {'lr': 0.00010954499923639796, 'samples': 19988352, 'steps': 104105, 'loss/train': 1.3814395666122437} 11/07/2021 11:50:16 - INFO - __main__ - Step 104107: {'lr': 0.00010954060921335409, 'samples': 19988544, 'steps': 104106, 'loss/train': 1.026045560836792} 11/07/2021 11:50:17 - INFO - __main__ - Step 104108: {'lr': 0.00010953621925359914, 'samples': 19988736, 'steps': 104107, 'loss/train': 1.4994513988494873} 11/07/2021 11:50:18 - INFO - __main__ - Step 104109: {'lr': 0.00010953182935713488, 'samples': 19988928, 'steps': 104108, 'loss/train': 0.9846181869506836} 11/07/2021 11:50:18 - INFO - __main__ - Step 104110: {'lr': 0.00010952743952396343, 'samples': 19989120, 'steps': 104109, 'loss/train': 1.400067687034607} 11/07/2021 11:50:18 - INFO - __main__ - Step 104111: {'lr': 0.00010952304975408676, 'samples': 19989312, 'steps': 104110, 'loss/train': 1.5176138877868652} 11/07/2021 11:50:19 - INFO - __main__ - Step 104112: {'lr': 0.00010951866004750682, 'samples': 19989504, 'steps': 104111, 'loss/train': 1.0242868661880493} 11/07/2021 11:50:20 - INFO - __main__ - Step 104113: {'lr': 0.00010951427040422563, 'samples': 19989696, 'steps': 104112, 'loss/train': 1.6949868202209473} 11/07/2021 11:50:20 - INFO - __main__ - Step 104114: {'lr': 0.0001095098808242451, 'samples': 19989888, 'steps': 104113, 'loss/train': 1.177956223487854} 11/07/2021 11:50:20 - INFO - __main__ - Step 104115: {'lr': 0.00010950549130756726, 'samples': 19990080, 'steps': 104114, 'loss/train': 1.657029151916504} 11/07/2021 11:50:21 - INFO - __main__ - Step 104116: {'lr': 0.0001095011018541941, 'samples': 19990272, 'steps': 104115, 'loss/train': 1.5441941022872925} 11/07/2021 11:50:21 - INFO - __main__ - Step 104117: {'lr': 0.00010949671246412757, 'samples': 19990464, 'steps': 104116, 'loss/train': 1.3136502504348755} 11/07/2021 11:50:22 - INFO - __main__ - Step 104118: {'lr': 0.00010949232313736965, 'samples': 19990656, 'steps': 104117, 'loss/train': 1.5295560359954834} 11/07/2021 11:50:23 - INFO - __main__ - Step 104119: {'lr': 0.0001094879338739223, 'samples': 19990848, 'steps': 104118, 'loss/train': 1.4818817377090454} 11/07/2021 11:50:23 - INFO - __main__ - Step 104120: {'lr': 0.00010948354467378754, 'samples': 19991040, 'steps': 104119, 'loss/train': 0.6683568954467773} 11/07/2021 11:50:23 - INFO - __main__ - Step 104121: {'lr': 0.0001094791555369674, 'samples': 19991232, 'steps': 104120, 'loss/train': 0.2989262044429779} 11/07/2021 11:50:24 - INFO - __main__ - Step 104122: {'lr': 0.00010947476646346374, 'samples': 19991424, 'steps': 104121, 'loss/train': 1.342953085899353} 11/07/2021 11:50:24 - INFO - __main__ - Step 104123: {'lr': 0.00010947037745327853, 'samples': 19991616, 'steps': 104122, 'loss/train': 1.446966528892517} 11/07/2021 11:50:25 - INFO - __main__ - Step 104124: {'lr': 0.00010946598850641385, 'samples': 19991808, 'steps': 104123, 'loss/train': 1.4454156160354614} 11/07/2021 11:50:26 - INFO - __main__ - Step 104125: {'lr': 0.00010946159962287158, 'samples': 19992000, 'steps': 104124, 'loss/train': 1.7137383222579956} 11/07/2021 11:50:26 - INFO - __main__ - Step 104126: {'lr': 0.00010945721080265375, 'samples': 19992192, 'steps': 104125, 'loss/train': 1.5432953834533691} 11/07/2021 11:50:26 - INFO - __main__ - Step 104127: {'lr': 0.00010945282204576235, 'samples': 19992384, 'steps': 104126, 'loss/train': 1.2827292680740356} 11/07/2021 11:50:27 - INFO - __main__ - Step 104128: {'lr': 0.00010944843335219934, 'samples': 19992576, 'steps': 104127, 'loss/train': 1.11814284324646} 11/07/2021 11:50:28 - INFO - __main__ - Step 104129: {'lr': 0.00010944404472196667, 'samples': 19992768, 'steps': 104128, 'loss/train': 1.332349181175232} 11/07/2021 11:50:28 - INFO - __main__ - Step 104130: {'lr': 0.00010943965615506638, 'samples': 19992960, 'steps': 104129, 'loss/train': 1.5525728464126587} 11/07/2021 11:50:28 - INFO - __main__ - Step 104131: {'lr': 0.00010943526765150038, 'samples': 19993152, 'steps': 104130, 'loss/train': 1.605204463005066} 11/07/2021 11:50:29 - INFO - __main__ - Step 104132: {'lr': 0.00010943087921127071, 'samples': 19993344, 'steps': 104131, 'loss/train': 1.211431860923767} 11/07/2021 11:50:29 - INFO - __main__ - Step 104133: {'lr': 0.0001094264908343793, 'samples': 19993536, 'steps': 104132, 'loss/train': 1.6634521484375} 11/07/2021 11:50:30 - INFO - __main__ - Step 104134: {'lr': 0.00010942210252082815, 'samples': 19993728, 'steps': 104133, 'loss/train': 1.3841338157653809} 11/07/2021 11:50:30 - INFO - __main__ - Step 104135: {'lr': 0.00010941771427061931, 'samples': 19993920, 'steps': 104134, 'loss/train': 1.4625473022460938} 11/07/2021 11:50:31 - INFO - __main__ - Step 104136: {'lr': 0.0001094133260837546, 'samples': 19994112, 'steps': 104135, 'loss/train': 1.6979669332504272} 11/07/2021 11:50:31 - INFO - __main__ - Step 104137: {'lr': 0.00010940893796023607, 'samples': 19994304, 'steps': 104136, 'loss/train': 1.1647791862487793} 11/07/2021 11:50:32 - INFO - __main__ - Step 104138: {'lr': 0.00010940454990006571, 'samples': 19994496, 'steps': 104137, 'loss/train': 1.3541759252548218} 11/07/2021 11:50:33 - INFO - __main__ - Step 104139: {'lr': 0.00010940016190324548, 'samples': 19994688, 'steps': 104138, 'loss/train': 1.3378013372421265} 11/07/2021 11:50:33 - INFO - __main__ - Step 104140: {'lr': 0.00010939577396977738, 'samples': 19994880, 'steps': 104139, 'loss/train': 1.3547005653381348} 11/07/2021 11:50:33 - INFO - __main__ - Step 104141: {'lr': 0.00010939138609966337, 'samples': 19995072, 'steps': 104140, 'loss/train': 1.661720871925354} 11/07/2021 11:50:34 - INFO - __main__ - Step 104142: {'lr': 0.00010938699829290541, 'samples': 19995264, 'steps': 104141, 'loss/train': 1.2158019542694092} 11/07/2021 11:50:34 - INFO - __main__ - Step 104143: {'lr': 0.00010938261054950552, 'samples': 19995456, 'steps': 104142, 'loss/train': 1.193090558052063} 11/07/2021 11:50:34 - INFO - __main__ - Step 104144: {'lr': 0.00010937822286946566, 'samples': 19995648, 'steps': 104143, 'loss/train': 1.1906546354293823} 11/07/2021 11:50:35 - INFO - __main__ - Step 104145: {'lr': 0.00010937383525278779, 'samples': 19995840, 'steps': 104144, 'loss/train': 1.2904720306396484} 11/07/2021 11:50:36 - INFO - __main__ - Step 104146: {'lr': 0.0001093694476994739, 'samples': 19996032, 'steps': 104145, 'loss/train': 1.3957597017288208} 11/07/2021 11:50:36 - INFO - __main__ - Step 104147: {'lr': 0.000109365060209526, 'samples': 19996224, 'steps': 104146, 'loss/train': 1.1968377828598022} 11/07/2021 11:50:37 - INFO - __main__ - Step 104148: {'lr': 0.00010936067278294609, 'samples': 19996416, 'steps': 104147, 'loss/train': 1.555348515510559} 11/07/2021 11:50:37 - INFO - __main__ - Step 104149: {'lr': 0.000109356285419736, 'samples': 19996608, 'steps': 104148, 'loss/train': 0.5632756948471069} 11/07/2021 11:50:38 - INFO - __main__ - Step 104150: {'lr': 0.00010935189811989782, 'samples': 19996800, 'steps': 104149, 'loss/train': 1.3555327653884888} 11/07/2021 11:50:38 - INFO - __main__ - Step 104151: {'lr': 0.00010934751088343348, 'samples': 19996992, 'steps': 104150, 'loss/train': 1.2425802946090698} 11/07/2021 11:50:39 - INFO - __main__ - Step 104152: {'lr': 0.00010934312371034499, 'samples': 19997184, 'steps': 104151, 'loss/train': 0.3486744463443756} 11/07/2021 11:50:39 - INFO - __main__ - Step 104153: {'lr': 0.00010933873660063432, 'samples': 19997376, 'steps': 104152, 'loss/train': 1.4071922302246094} 11/07/2021 11:50:39 - INFO - __main__ - Step 104154: {'lr': 0.00010933434955430344, 'samples': 19997568, 'steps': 104153, 'loss/train': 1.0258845090866089} 11/07/2021 11:50:41 - INFO - __main__ - Step 104155: {'lr': 0.00010932996257135433, 'samples': 19997760, 'steps': 104154, 'loss/train': 1.2579991817474365} 11/07/2021 11:50:41 - INFO - __main__ - Step 104156: {'lr': 0.000109325575651789, 'samples': 19997952, 'steps': 104155, 'loss/train': 1.2719449996948242} 11/07/2021 11:50:41 - INFO - __main__ - Step 104157: {'lr': 0.00010932118879560935, 'samples': 19998144, 'steps': 104156, 'loss/train': 1.395355224609375} 11/07/2021 11:50:42 - INFO - __main__ - Step 104158: {'lr': 0.00010931680200281741, 'samples': 19998336, 'steps': 104157, 'loss/train': 1.3346034288406372} 11/07/2021 11:50:42 - INFO - __main__ - Step 104159: {'lr': 0.00010931241527341518, 'samples': 19998528, 'steps': 104158, 'loss/train': 1.3066699504852295} 11/07/2021 11:50:43 - INFO - __main__ - Step 104160: {'lr': 0.00010930802860740458, 'samples': 19998720, 'steps': 104159, 'loss/train': 1.5067569017410278} 11/07/2021 11:50:43 - INFO - __main__ - Step 104161: {'lr': 0.0001093036420047876, 'samples': 19998912, 'steps': 104160, 'loss/train': 1.2222448587417603} 11/07/2021 11:50:44 - INFO - __main__ - Step 104162: {'lr': 0.00010929925546556636, 'samples': 19999104, 'steps': 104161, 'loss/train': 1.2610201835632324} 11/07/2021 11:50:44 - INFO - __main__ - Step 104163: {'lr': 0.00010929486898974255, 'samples': 19999296, 'steps': 104162, 'loss/train': 1.0135433673858643} 11/07/2021 11:50:44 - INFO - __main__ - Step 104164: {'lr': 0.00010929048257731836, 'samples': 19999488, 'steps': 104163, 'loss/train': 1.5236717462539673} 11/07/2021 11:50:45 - INFO - __main__ - Step 104165: {'lr': 0.00010928609622829566, 'samples': 19999680, 'steps': 104164, 'loss/train': 1.389060378074646} 11/07/2021 11:50:46 - INFO - __main__ - Step 104166: {'lr': 0.0001092817099426765, 'samples': 19999872, 'steps': 104165, 'loss/train': 5.956432342529297} 11/07/2021 11:50:46 - INFO - __main__ - Step 104167: {'lr': 0.00010927732372046283, 'samples': 20000064, 'steps': 104166, 'loss/train': 0.7848170399665833} 11/07/2021 11:50:47 - INFO - __main__ - Step 104168: {'lr': 0.00010927293756165663, 'samples': 20000256, 'steps': 104167, 'loss/train': 1.1203908920288086} 11/07/2021 11:50:47 - INFO - __main__ - Step 104169: {'lr': 0.00010926855146625986, 'samples': 20000448, 'steps': 104168, 'loss/train': 1.3026748895645142} 11/07/2021 11:50:47 - INFO - __main__ - Step 104170: {'lr': 0.00010926416543427453, 'samples': 20000640, 'steps': 104169, 'loss/train': 1.8049389123916626} 11/07/2021 11:50:49 - INFO - __main__ - Step 104171: {'lr': 0.00010925977946570256, 'samples': 20000832, 'steps': 104170, 'loss/train': 1.4258702993392944} 11/07/2021 11:50:49 - INFO - __main__ - Step 104172: {'lr': 0.000109255393560546, 'samples': 20001024, 'steps': 104171, 'loss/train': 1.273722529411316} 11/07/2021 11:50:49 - INFO - __main__ - Step 104173: {'lr': 0.00010925100771880678, 'samples': 20001216, 'steps': 104172, 'loss/train': 1.3526246547698975} 11/07/2021 11:50:50 - INFO - __main__ - Step 104174: {'lr': 0.00010924662194048687, 'samples': 20001408, 'steps': 104173, 'loss/train': 1.1574785709381104} 11/07/2021 11:50:50 - INFO - __main__ - Step 104175: {'lr': 0.00010924223622558835, 'samples': 20001600, 'steps': 104174, 'loss/train': 1.307482123374939} 11/07/2021 11:50:50 - INFO - __main__ - Step 104176: {'lr': 0.00010923785057411304, 'samples': 20001792, 'steps': 104175, 'loss/train': 0.8649108409881592} 11/07/2021 11:50:51 - INFO - __main__ - Step 104177: {'lr': 0.00010923346498606296, 'samples': 20001984, 'steps': 104176, 'loss/train': 1.3926721811294556} 11/07/2021 11:50:52 - INFO - __main__ - Step 104178: {'lr': 0.0001092290794614401, 'samples': 20002176, 'steps': 104177, 'loss/train': 1.235058069229126} 11/07/2021 11:50:52 - INFO - __main__ - Step 104179: {'lr': 0.00010922469400024645, 'samples': 20002368, 'steps': 104178, 'loss/train': 1.1678071022033691} 11/07/2021 11:50:52 - INFO - __main__ - Step 104180: {'lr': 0.000109220308602484, 'samples': 20002560, 'steps': 104179, 'loss/train': 1.6907564401626587} 11/07/2021 11:50:53 - INFO - __main__ - Step 104181: {'lr': 0.00010921592326815468, 'samples': 20002752, 'steps': 104180, 'loss/train': 1.3068736791610718} 11/07/2021 11:50:54 - INFO - __main__ - Step 104182: {'lr': 0.00010921153799726052, 'samples': 20002944, 'steps': 104181, 'loss/train': 1.2782374620437622} 11/07/2021 11:50:54 - INFO - __main__ - Step 104183: {'lr': 0.00010920715278980345, 'samples': 20003136, 'steps': 104182, 'loss/train': 0.6687610149383545} 11/07/2021 11:50:54 - INFO - __main__ - Step 104184: {'lr': 0.00010920276764578545, 'samples': 20003328, 'steps': 104183, 'loss/train': 1.1940089464187622} 11/07/2021 11:50:55 - INFO - __main__ - Step 104185: {'lr': 0.00010919838256520856, 'samples': 20003520, 'steps': 104184, 'loss/train': 1.700769305229187} 11/07/2021 11:50:55 - INFO - __main__ - Step 104186: {'lr': 0.00010919399754807466, 'samples': 20003712, 'steps': 104185, 'loss/train': 1.3194068670272827} 11/07/2021 11:50:56 - INFO - __main__ - Step 104187: {'lr': 0.00010918961259438578, 'samples': 20003904, 'steps': 104186, 'loss/train': 1.1561964750289917} 11/07/2021 11:50:57 - INFO - __main__ - Step 104188: {'lr': 0.00010918522770414398, 'samples': 20004096, 'steps': 104187, 'loss/train': 1.186884880065918} 11/07/2021 11:50:57 - INFO - __main__ - Step 104189: {'lr': 0.00010918084287735108, 'samples': 20004288, 'steps': 104188, 'loss/train': 1.2461469173431396} 11/07/2021 11:50:57 - INFO - __main__ - Step 104190: {'lr': 0.00010917645811400909, 'samples': 20004480, 'steps': 104189, 'loss/train': 1.8054662942886353} 11/07/2021 11:50:58 - INFO - __main__ - Step 104191: {'lr': 0.00010917207341412003, 'samples': 20004672, 'steps': 104190, 'loss/train': 0.7257376909255981} 11/07/2021 11:50:58 - INFO - __main__ - Step 104192: {'lr': 0.00010916768877768585, 'samples': 20004864, 'steps': 104191, 'loss/train': 1.6638168096542358} 11/07/2021 11:50:59 - INFO - __main__ - Step 104193: {'lr': 0.00010916330420470854, 'samples': 20005056, 'steps': 104192, 'loss/train': 0.8417352437973022} 11/07/2021 11:50:59 - INFO - __main__ - Step 104194: {'lr': 0.00010915891969519007, 'samples': 20005248, 'steps': 104193, 'loss/train': 1.703566551208496} 11/07/2021 11:51:00 - INFO - __main__ - Step 104195: {'lr': 0.00010915453524913243, 'samples': 20005440, 'steps': 104194, 'loss/train': 1.339650273323059} 11/07/2021 11:51:00 - INFO - __main__ - Step 104196: {'lr': 0.00010915015086653756, 'samples': 20005632, 'steps': 104195, 'loss/train': 1.406711220741272} 11/07/2021 11:51:00 - INFO - __main__ - Step 104197: {'lr': 0.00010914576654740748, 'samples': 20005824, 'steps': 104196, 'loss/train': 1.2513401508331299} 11/07/2021 11:51:02 - INFO - __main__ - Step 104198: {'lr': 0.00010914138229174414, 'samples': 20006016, 'steps': 104197, 'loss/train': 1.2944691181182861} 11/07/2021 11:51:02 - INFO - __main__ - Step 104199: {'lr': 0.00010913699809954952, 'samples': 20006208, 'steps': 104198, 'loss/train': 1.3278870582580566} 11/07/2021 11:51:02 - INFO - __main__ - Step 104200: {'lr': 0.00010913261397082558, 'samples': 20006400, 'steps': 104199, 'loss/train': 1.5218652486801147} 11/07/2021 11:51:03 - INFO - __main__ - Step 104201: {'lr': 0.0001091282299055743, 'samples': 20006592, 'steps': 104200, 'loss/train': 1.4365577697753906} 11/07/2021 11:51:03 - INFO - __main__ - Step 104202: {'lr': 0.00010912384590379779, 'samples': 20006784, 'steps': 104201, 'loss/train': 1.3547018766403198} 11/07/2021 11:51:04 - INFO - __main__ - Step 104203: {'lr': 0.0001091194619654978, 'samples': 20006976, 'steps': 104202, 'loss/train': 1.7243088483810425} 11/07/2021 11:51:04 - INFO - __main__ - Step 104204: {'lr': 0.00010911507809067642, 'samples': 20007168, 'steps': 104203, 'loss/train': 1.3056211471557617} 11/07/2021 11:51:05 - INFO - __main__ - Step 104205: {'lr': 0.00010911069427933559, 'samples': 20007360, 'steps': 104204, 'loss/train': 1.5905263423919678} 11/07/2021 11:51:05 - INFO - __main__ - Step 104206: {'lr': 0.00010910631053147729, 'samples': 20007552, 'steps': 104205, 'loss/train': 1.4583981037139893} 11/07/2021 11:51:05 - INFO - __main__ - Step 104207: {'lr': 0.00010910192684710354, 'samples': 20007744, 'steps': 104206, 'loss/train': 1.5439295768737793} 11/07/2021 11:51:06 - INFO - __main__ - Step 104208: {'lr': 0.00010909754322621629, 'samples': 20007936, 'steps': 104207, 'loss/train': 1.5273866653442383} 11/07/2021 11:51:07 - INFO - __main__ - Step 104209: {'lr': 0.0001090931596688175, 'samples': 20008128, 'steps': 104208, 'loss/train': 0.7327815890312195} 11/07/2021 11:51:07 - INFO - __main__ - Step 104210: {'lr': 0.00010908877617490917, 'samples': 20008320, 'steps': 104209, 'loss/train': 1.5109357833862305} 11/07/2021 11:51:07 - INFO - __main__ - Step 104211: {'lr': 0.00010908439274449325, 'samples': 20008512, 'steps': 104210, 'loss/train': 1.2662289142608643} 11/07/2021 11:51:08 - INFO - __main__ - Step 104212: {'lr': 0.00010908000937757174, 'samples': 20008704, 'steps': 104211, 'loss/train': 2.0620357990264893} 11/07/2021 11:51:09 - INFO - __main__ - Step 104213: {'lr': 0.0001090756260741466, 'samples': 20008896, 'steps': 104212, 'loss/train': 1.2062901258468628} 11/07/2021 11:51:09 - INFO - __main__ - Step 104214: {'lr': 0.00010907124283421981, 'samples': 20009088, 'steps': 104213, 'loss/train': 1.452959656715393} 11/07/2021 11:51:10 - INFO - __main__ - Step 104215: {'lr': 0.00010906685965779343, 'samples': 20009280, 'steps': 104214, 'loss/train': 1.4771069288253784} 11/07/2021 11:51:10 - INFO - __main__ - Step 104216: {'lr': 0.00010906247654486926, 'samples': 20009472, 'steps': 104215, 'loss/train': 1.1904984712600708} 11/07/2021 11:51:10 - INFO - __main__ - Step 104217: {'lr': 0.00010905809349544935, 'samples': 20009664, 'steps': 104216, 'loss/train': 0.95797199010849} 11/07/2021 11:51:11 - INFO - __main__ - Step 104218: {'lr': 0.00010905371050953569, 'samples': 20009856, 'steps': 104217, 'loss/train': 1.1087826490402222} 11/07/2021 11:51:12 - INFO - __main__ - Step 104219: {'lr': 0.00010904932758713027, 'samples': 20010048, 'steps': 104218, 'loss/train': 1.2525445222854614} 11/07/2021 11:51:12 - INFO - __main__ - Step 104220: {'lr': 0.00010904494472823504, 'samples': 20010240, 'steps': 104219, 'loss/train': 1.731771469116211} 11/07/2021 11:51:12 - INFO - __main__ - Step 104221: {'lr': 0.000109040561932852, 'samples': 20010432, 'steps': 104220, 'loss/train': 1.0392470359802246} 11/07/2021 11:51:13 - INFO - __main__ - Step 104222: {'lr': 0.00010903617920098308, 'samples': 20010624, 'steps': 104221, 'loss/train': 1.4185415506362915} 11/07/2021 11:51:13 - INFO - __main__ - Step 104223: {'lr': 0.00010903179653263029, 'samples': 20010816, 'steps': 104222, 'loss/train': 1.1783984899520874} 11/07/2021 11:51:14 - INFO - __main__ - Step 104224: {'lr': 0.00010902741392779562, 'samples': 20011008, 'steps': 104223, 'loss/train': 1.235437273979187} 11/07/2021 11:51:15 - INFO - __main__ - Step 104225: {'lr': 0.00010902303138648098, 'samples': 20011200, 'steps': 104224, 'loss/train': 1.7157504558563232} 11/07/2021 11:51:15 - INFO - __main__ - Step 104226: {'lr': 0.00010901864890868843, 'samples': 20011392, 'steps': 104225, 'loss/train': 1.0675256252288818} 11/07/2021 11:51:15 - INFO - __main__ - Step 104227: {'lr': 0.00010901426649441987, 'samples': 20011584, 'steps': 104226, 'loss/train': 1.345070719718933} 11/07/2021 11:51:16 - INFO - __main__ - Step 104228: {'lr': 0.00010900988414367732, 'samples': 20011776, 'steps': 104227, 'loss/train': 1.8450373411178589} 11/07/2021 11:51:17 - INFO - __main__ - Step 104229: {'lr': 0.00010900550185646283, 'samples': 20011968, 'steps': 104228, 'loss/train': 1.1905072927474976} 11/07/2021 11:51:17 - INFO - __main__ - Step 104230: {'lr': 0.00010900111963277817, 'samples': 20012160, 'steps': 104229, 'loss/train': 1.3756922483444214} 11/07/2021 11:51:17 - INFO - __main__ - Step 104231: {'lr': 0.00010899673747262545, 'samples': 20012352, 'steps': 104230, 'loss/train': 1.292345643043518} 11/07/2021 11:51:18 - INFO - __main__ - Step 104232: {'lr': 0.00010899235537600663, 'samples': 20012544, 'steps': 104231, 'loss/train': 1.5321463346481323} 11/07/2021 11:51:18 - INFO - __main__ - Step 104233: {'lr': 0.00010898797334292368, 'samples': 20012736, 'steps': 104232, 'loss/train': 1.850608229637146} 11/07/2021 11:51:19 - INFO - __main__ - Step 104234: {'lr': 0.00010898359137337857, 'samples': 20012928, 'steps': 104233, 'loss/train': 1.273004174232483} 11/07/2021 11:51:19 - INFO - __main__ - Step 104235: {'lr': 0.00010897920946737327, 'samples': 20013120, 'steps': 104234, 'loss/train': 1.6646921634674072} 11/07/2021 11:51:20 - INFO - __main__ - Step 104236: {'lr': 0.00010897482762490978, 'samples': 20013312, 'steps': 104235, 'loss/train': 1.2004519701004028} 11/07/2021 11:51:20 - INFO - __main__ - Step 104237: {'lr': 0.00010897044584599003, 'samples': 20013504, 'steps': 104236, 'loss/train': 1.7261232137680054} 11/07/2021 11:51:20 - INFO - __main__ - Step 104238: {'lr': 0.00010896606413061605, 'samples': 20013696, 'steps': 104237, 'loss/train': 0.36385393142700195} 11/07/2021 11:51:22 - INFO - __main__ - Step 104239: {'lr': 0.00010896168247878977, 'samples': 20013888, 'steps': 104238, 'loss/train': 1.3568999767303467} 11/07/2021 11:51:22 - INFO - __main__ - Step 104240: {'lr': 0.00010895730089051317, 'samples': 20014080, 'steps': 104239, 'loss/train': 0.17428581416606903} 11/07/2021 11:51:22 - INFO - __main__ - Step 104241: {'lr': 0.00010895291936578825, 'samples': 20014272, 'steps': 104240, 'loss/train': 1.2157477140426636} 11/07/2021 11:51:23 - INFO - __main__ - Step 104242: {'lr': 0.00010894853790461706, 'samples': 20014464, 'steps': 104241, 'loss/train': 1.873067021369934} 11/07/2021 11:51:23 - INFO - __main__ - Step 104243: {'lr': 0.00010894415650700138, 'samples': 20014656, 'steps': 104242, 'loss/train': 1.4273643493652344} 11/07/2021 11:51:24 - INFO - __main__ - Step 104244: {'lr': 0.00010893977517294329, 'samples': 20014848, 'steps': 104243, 'loss/train': 1.1433268785476685} 11/07/2021 11:51:25 - INFO - __main__ - Step 104245: {'lr': 0.00010893539390244475, 'samples': 20015040, 'steps': 104244, 'loss/train': 1.1594388484954834} 11/07/2021 11:51:25 - INFO - __main__ - Step 104246: {'lr': 0.00010893101269550776, 'samples': 20015232, 'steps': 104245, 'loss/train': 1.291096806526184} 11/07/2021 11:51:25 - INFO - __main__ - Step 104247: {'lr': 0.00010892663155213429, 'samples': 20015424, 'steps': 104246, 'loss/train': 1.2633394002914429} 11/07/2021 11:51:26 - INFO - __main__ - Step 104248: {'lr': 0.0001089222504723263, 'samples': 20015616, 'steps': 104247, 'loss/train': 1.407812476158142} 11/07/2021 11:51:27 - INFO - __main__ - Step 104249: {'lr': 0.00010891786945608573, 'samples': 20015808, 'steps': 104248, 'loss/train': 1.2696559429168701} 11/07/2021 11:51:27 - INFO - __main__ - Step 104250: {'lr': 0.00010891348850341462, 'samples': 20016000, 'steps': 104249, 'loss/train': 1.410223126411438} 11/07/2021 11:51:27 - INFO - __main__ - Step 104251: {'lr': 0.00010890910761431492, 'samples': 20016192, 'steps': 104250, 'loss/train': 1.2927058935165405} 11/07/2021 11:51:28 - INFO - __main__ - Step 104252: {'lr': 0.00010890472678878858, 'samples': 20016384, 'steps': 104251, 'loss/train': 1.0804413557052612} 11/07/2021 11:51:28 - INFO - __main__ - Step 104253: {'lr': 0.0001089003460268376, 'samples': 20016576, 'steps': 104252, 'loss/train': 1.3223198652267456} 11/07/2021 11:51:29 - INFO - __main__ - Step 104254: {'lr': 0.00010889596532846397, 'samples': 20016768, 'steps': 104253, 'loss/train': 1.065435767173767} 11/07/2021 11:51:29 - INFO - __main__ - Step 104255: {'lr': 0.0001088915846936696, 'samples': 20016960, 'steps': 104254, 'loss/train': 0.8274714946746826} 11/07/2021 11:51:30 - INFO - __main__ - Step 104256: {'lr': 0.00010888720412245661, 'samples': 20017152, 'steps': 104255, 'loss/train': 1.6051222085952759} 11/07/2021 11:51:30 - INFO - __main__ - Step 104257: {'lr': 0.00010888282361482679, 'samples': 20017344, 'steps': 104256, 'loss/train': 1.3373234272003174} 11/07/2021 11:51:31 - INFO - __main__ - Step 104258: {'lr': 0.00010887844317078219, 'samples': 20017536, 'steps': 104257, 'loss/train': 1.3428641557693481} 11/07/2021 11:51:31 - INFO - __main__ - Step 104259: {'lr': 0.00010887406279032478, 'samples': 20017728, 'steps': 104258, 'loss/train': 1.2192376852035522} 11/07/2021 11:51:32 - INFO - __main__ - Step 104260: {'lr': 0.00010886968247345655, 'samples': 20017920, 'steps': 104259, 'loss/train': 1.3632051944732666} 11/07/2021 11:51:32 - INFO - __main__ - Step 104261: {'lr': 0.00010886530222017943, 'samples': 20018112, 'steps': 104260, 'loss/train': 1.5878303050994873} 11/07/2021 11:51:33 - INFO - __main__ - Step 104262: {'lr': 0.00010886092203049546, 'samples': 20018304, 'steps': 104261, 'loss/train': 1.340364933013916} 11/07/2021 11:51:33 - INFO - __main__ - Step 104263: {'lr': 0.00010885654190440658, 'samples': 20018496, 'steps': 104262, 'loss/train': 0.9740169644355774} 11/07/2021 11:51:33 - INFO - __main__ - Step 104264: {'lr': 0.00010885216184191474, 'samples': 20018688, 'steps': 104263, 'loss/train': 1.369388222694397} 11/07/2021 11:51:34 - INFO - __main__ - Step 104265: {'lr': 0.00010884778184302196, 'samples': 20018880, 'steps': 104264, 'loss/train': 1.5451115369796753} 11/07/2021 11:51:35 - INFO - __main__ - Step 104266: {'lr': 0.00010884340190773017, 'samples': 20019072, 'steps': 104265, 'loss/train': 1.1240061521530151} 11/07/2021 11:51:35 - INFO - __main__ - Step 104267: {'lr': 0.00010883902203604148, 'samples': 20019264, 'steps': 104266, 'loss/train': 1.854454517364502} 11/07/2021 11:51:35 - INFO - __main__ - Step 104268: {'lr': 0.00010883464222795766, 'samples': 20019456, 'steps': 104267, 'loss/train': 1.3228224515914917} 11/07/2021 11:51:36 - INFO - __main__ - Step 104269: {'lr': 0.00010883026248348076, 'samples': 20019648, 'steps': 104268, 'loss/train': 1.331530213356018} 11/07/2021 11:51:37 - INFO - __main__ - Step 104270: {'lr': 0.00010882588280261277, 'samples': 20019840, 'steps': 104269, 'loss/train': 1.596388578414917} 11/07/2021 11:51:37 - INFO - __main__ - Step 104271: {'lr': 0.00010882150318535564, 'samples': 20020032, 'steps': 104270, 'loss/train': 1.2471473217010498} 11/07/2021 11:51:37 - INFO - __main__ - Step 104272: {'lr': 0.0001088171236317114, 'samples': 20020224, 'steps': 104271, 'loss/train': 1.022462248802185} 11/07/2021 11:51:38 - INFO - __main__ - Step 104273: {'lr': 0.00010881274414168194, 'samples': 20020416, 'steps': 104272, 'loss/train': 1.8239716291427612} 11/07/2021 11:51:38 - INFO - __main__ - Step 104274: {'lr': 0.0001088083647152693, 'samples': 20020608, 'steps': 104273, 'loss/train': 1.6430399417877197} 11/07/2021 11:51:39 - INFO - __main__ - Step 104275: {'lr': 0.00010880398535247543, 'samples': 20020800, 'steps': 104274, 'loss/train': 2.0751616954803467} 11/07/2021 11:51:40 - INFO - __main__ - Step 104276: {'lr': 0.0001087996060533023, 'samples': 20020992, 'steps': 104275, 'loss/train': 1.6237397193908691} 11/07/2021 11:51:40 - INFO - __main__ - Step 104277: {'lr': 0.00010879522681775192, 'samples': 20021184, 'steps': 104276, 'loss/train': 0.9934767484664917} 11/07/2021 11:51:40 - INFO - __main__ - Step 104278: {'lr': 0.00010879084764582629, 'samples': 20021376, 'steps': 104277, 'loss/train': 0.8517100811004639} 11/07/2021 11:51:41 - INFO - __main__ - Step 104279: {'lr': 0.00010878646853752724, 'samples': 20021568, 'steps': 104278, 'loss/train': 1.4619115591049194} 11/07/2021 11:51:42 - INFO - __main__ - Step 104280: {'lr': 0.00010878208949285684, 'samples': 20021760, 'steps': 104279, 'loss/train': 1.5574103593826294} 11/07/2021 11:51:42 - INFO - __main__ - Step 104281: {'lr': 0.00010877771051181703, 'samples': 20021952, 'steps': 104280, 'loss/train': 1.8624916076660156} 11/07/2021 11:51:42 - INFO - __main__ - Step 104282: {'lr': 0.00010877333159440983, 'samples': 20022144, 'steps': 104281, 'loss/train': 0.7258989810943604} 11/07/2021 11:51:43 - INFO - __main__ - Step 104283: {'lr': 0.00010876895274063717, 'samples': 20022336, 'steps': 104282, 'loss/train': 1.4413658380508423} 11/07/2021 11:51:43 - INFO - __main__ - Step 104284: {'lr': 0.00010876457395050105, 'samples': 20022528, 'steps': 104283, 'loss/train': 1.2626336812973022} 11/07/2021 11:51:44 - INFO - __main__ - Step 104285: {'lr': 0.00010876019522400344, 'samples': 20022720, 'steps': 104284, 'loss/train': 1.4085112810134888} 11/07/2021 11:51:44 - INFO - __main__ - Step 104286: {'lr': 0.00010875581656114628, 'samples': 20022912, 'steps': 104285, 'loss/train': 1.3463683128356934} 11/07/2021 11:51:45 - INFO - __main__ - Step 104287: {'lr': 0.0001087514379619316, 'samples': 20023104, 'steps': 104286, 'loss/train': 1.138630986213684} 11/07/2021 11:51:45 - INFO - __main__ - Step 104288: {'lr': 0.00010874705942636131, 'samples': 20023296, 'steps': 104287, 'loss/train': 1.2391993999481201} 11/07/2021 11:51:46 - INFO - __main__ - Step 104289: {'lr': 0.00010874268095443754, 'samples': 20023488, 'steps': 104288, 'loss/train': 1.476576805114746} 11/07/2021 11:51:46 - INFO - __main__ - Step 104290: {'lr': 0.00010873830254616202, 'samples': 20023680, 'steps': 104289, 'loss/train': 1.1362181901931763} 11/07/2021 11:51:47 - INFO - __main__ - Step 104291: {'lr': 0.00010873392420153685, 'samples': 20023872, 'steps': 104290, 'loss/train': 1.2445920705795288} 11/07/2021 11:51:47 - INFO - __main__ - Step 104292: {'lr': 0.000108729545920564, 'samples': 20024064, 'steps': 104291, 'loss/train': 1.3899860382080078} 11/07/2021 11:51:48 - INFO - __main__ - Step 104293: {'lr': 0.00010872516770324544, 'samples': 20024256, 'steps': 104292, 'loss/train': 1.3217240571975708} 11/07/2021 11:51:48 - INFO - __main__ - Step 104294: {'lr': 0.00010872078954958315, 'samples': 20024448, 'steps': 104293, 'loss/train': 1.364881157875061} 11/07/2021 11:51:48 - INFO - __main__ - Step 104295: {'lr': 0.00010871641145957906, 'samples': 20024640, 'steps': 104294, 'loss/train': 1.3443316221237183} 11/07/2021 11:51:49 - INFO - __main__ - Step 104296: {'lr': 0.00010871203343323518, 'samples': 20024832, 'steps': 104295, 'loss/train': 1.530199646949768} 11/07/2021 11:51:50 - INFO - __main__ - Step 104297: {'lr': 0.0001087076554705535, 'samples': 20025024, 'steps': 104296, 'loss/train': 0.9850212931632996} 11/07/2021 11:51:50 - INFO - __main__ - Step 104298: {'lr': 0.00010870327757153595, 'samples': 20025216, 'steps': 104297, 'loss/train': 1.6311228275299072} 11/07/2021 11:51:50 - INFO - __main__ - Step 104299: {'lr': 0.00010869889973618452, 'samples': 20025408, 'steps': 104298, 'loss/train': 1.5444906949996948} 11/07/2021 11:51:51 - INFO - __main__ - Step 104300: {'lr': 0.0001086945219645013, 'samples': 20025600, 'steps': 104299, 'loss/train': 1.4397492408752441} 11/07/2021 11:51:52 - INFO - __main__ - Step 104301: {'lr': 0.00010869014425648804, 'samples': 20025792, 'steps': 104300, 'loss/train': 2.142334461212158} 11/07/2021 11:51:52 - INFO - __main__ - Step 104302: {'lr': 0.00010868576661214683, 'samples': 20025984, 'steps': 104301, 'loss/train': 1.395731806755066} 11/07/2021 11:51:52 - INFO - __main__ - Step 104303: {'lr': 0.00010868138903147961, 'samples': 20026176, 'steps': 104302, 'loss/train': 1.345402479171753} 11/07/2021 11:51:53 - INFO - __main__ - Step 104304: {'lr': 0.00010867701151448842, 'samples': 20026368, 'steps': 104303, 'loss/train': 1.6502219438552856} 11/07/2021 11:51:53 - INFO - __main__ - Step 104305: {'lr': 0.00010867263406117514, 'samples': 20026560, 'steps': 104304, 'loss/train': 1.2288154363632202} 11/07/2021 11:51:54 - INFO - __main__ - Step 104306: {'lr': 0.00010866825667154182, 'samples': 20026752, 'steps': 104305, 'loss/train': 1.3475065231323242} 11/07/2021 11:51:55 - INFO - __main__ - Step 104307: {'lr': 0.00010866387934559039, 'samples': 20026944, 'steps': 104306, 'loss/train': 1.3034230470657349} 11/07/2021 11:51:55 - INFO - __main__ - Step 104308: {'lr': 0.00010865950208332284, 'samples': 20027136, 'steps': 104307, 'loss/train': 1.2842754125595093} 11/07/2021 11:51:55 - INFO - __main__ - Step 104309: {'lr': 0.00010865512488474113, 'samples': 20027328, 'steps': 104308, 'loss/train': 1.1655292510986328} 11/07/2021 11:51:56 - INFO - __main__ - Step 104310: {'lr': 0.00010865074774984723, 'samples': 20027520, 'steps': 104309, 'loss/train': 1.340327262878418} 11/07/2021 11:51:57 - INFO - __main__ - Step 104311: {'lr': 0.00010864637067864325, 'samples': 20027712, 'steps': 104310, 'loss/train': 1.3194206953048706} 11/07/2021 11:51:57 - INFO - __main__ - Step 104312: {'lr': 0.00010864199367113092, 'samples': 20027904, 'steps': 104311, 'loss/train': 1.138992428779602} 11/07/2021 11:51:57 - INFO - __main__ - Step 104313: {'lr': 0.00010863761672731231, 'samples': 20028096, 'steps': 104312, 'loss/train': 3.6361894607543945} 11/07/2021 11:51:58 - INFO - __main__ - Step 104314: {'lr': 0.00010863323984718945, 'samples': 20028288, 'steps': 104313, 'loss/train': 1.6123887300491333} 11/07/2021 11:51:58 - INFO - __main__ - Step 104315: {'lr': 0.00010862886303076425, 'samples': 20028480, 'steps': 104314, 'loss/train': 1.4477245807647705} 11/07/2021 11:51:58 - INFO - __main__ - Step 104316: {'lr': 0.00010862448627803869, 'samples': 20028672, 'steps': 104315, 'loss/train': 1.2267658710479736} 11/07/2021 11:52:00 - INFO - __main__ - Step 104317: {'lr': 0.00010862010958901474, 'samples': 20028864, 'steps': 104316, 'loss/train': 1.07862389087677} 11/07/2021 11:52:00 - INFO - __main__ - Step 104318: {'lr': 0.00010861573296369442, 'samples': 20029056, 'steps': 104317, 'loss/train': 1.4064724445343018} 11/07/2021 11:52:00 - INFO - __main__ - Step 104319: {'lr': 0.00010861135640207966, 'samples': 20029248, 'steps': 104318, 'loss/train': 1.4794392585754395} 11/07/2021 11:52:01 - INFO - __main__ - Step 104320: {'lr': 0.00010860697990417245, 'samples': 20029440, 'steps': 104319, 'loss/train': 3.3669748306274414} 11/07/2021 11:52:01 - INFO - __main__ - Step 104321: {'lr': 0.00010860260346997474, 'samples': 20029632, 'steps': 104320, 'loss/train': 1.6413233280181885} 11/07/2021 11:52:02 - INFO - __main__ - Step 104322: {'lr': 0.00010859822709948853, 'samples': 20029824, 'steps': 104321, 'loss/train': 1.5922307968139648} 11/07/2021 11:52:02 - INFO - __main__ - Step 104323: {'lr': 0.00010859385079271586, 'samples': 20030016, 'steps': 104322, 'loss/train': 1.2395524978637695} 11/07/2021 11:52:03 - INFO - __main__ - Step 104324: {'lr': 0.00010858947454965853, 'samples': 20030208, 'steps': 104323, 'loss/train': 1.3528494834899902} 11/07/2021 11:52:03 - INFO - __main__ - Step 104325: {'lr': 0.0001085850983703186, 'samples': 20030400, 'steps': 104324, 'loss/train': 1.6340487003326416} 11/07/2021 11:52:03 - INFO - __main__ - Step 104326: {'lr': 0.00010858072225469803, 'samples': 20030592, 'steps': 104325, 'loss/train': 1.4418123960494995} 11/07/2021 11:52:05 - INFO - __main__ - Step 104327: {'lr': 0.0001085763462027988, 'samples': 20030784, 'steps': 104326, 'loss/train': 1.233813762664795} 11/07/2021 11:52:05 - INFO - __main__ - Step 104328: {'lr': 0.00010857197021462292, 'samples': 20030976, 'steps': 104327, 'loss/train': 2.5255486965179443} 11/07/2021 11:52:05 - INFO - __main__ - Step 104329: {'lr': 0.0001085675942901723, 'samples': 20031168, 'steps': 104328, 'loss/train': 1.439594030380249} 11/07/2021 11:52:06 - INFO - __main__ - Step 104330: {'lr': 0.00010856321842944894, 'samples': 20031360, 'steps': 104329, 'loss/train': 0.5749872326850891} 11/07/2021 11:52:06 - INFO - __main__ - Step 104331: {'lr': 0.00010855884263245483, 'samples': 20031552, 'steps': 104330, 'loss/train': 1.5576821565628052} 11/07/2021 11:52:07 - INFO - __main__ - Step 104332: {'lr': 0.00010855446689919191, 'samples': 20031744, 'steps': 104331, 'loss/train': 1.2615933418273926} 11/07/2021 11:52:07 - INFO - __main__ - Step 104333: {'lr': 0.00010855009122966217, 'samples': 20031936, 'steps': 104332, 'loss/train': 1.321292757987976} 11/07/2021 11:52:08 - INFO - __main__ - Step 104334: {'lr': 0.00010854571562386756, 'samples': 20032128, 'steps': 104333, 'loss/train': 1.5461745262145996} 11/07/2021 11:52:08 - INFO - __main__ - Step 104335: {'lr': 0.00010854134008181007, 'samples': 20032320, 'steps': 104334, 'loss/train': 1.2995072603225708} 11/07/2021 11:52:08 - INFO - __main__ - Step 104336: {'lr': 0.00010853696460349177, 'samples': 20032512, 'steps': 104335, 'loss/train': 1.5651453733444214} 11/07/2021 11:52:09 - INFO - __main__ - Step 104337: {'lr': 0.00010853258918891445, 'samples': 20032704, 'steps': 104336, 'loss/train': 1.168761968612671} 11/07/2021 11:52:10 - INFO - __main__ - Step 104338: {'lr': 0.00010852821383808015, 'samples': 20032896, 'steps': 104337, 'loss/train': 1.2276599407196045} 11/07/2021 11:52:10 - INFO - __main__ - Step 104339: {'lr': 0.00010852383855099086, 'samples': 20033088, 'steps': 104338, 'loss/train': 1.2774399518966675} 11/07/2021 11:52:11 - INFO - __main__ - Step 104340: {'lr': 0.00010851946332764853, 'samples': 20033280, 'steps': 104339, 'loss/train': 1.7416644096374512} 11/07/2021 11:52:11 - INFO - __main__ - Step 104341: {'lr': 0.00010851508816805516, 'samples': 20033472, 'steps': 104340, 'loss/train': 1.6030163764953613} 11/07/2021 11:52:11 - INFO - __main__ - Step 104342: {'lr': 0.00010851071307221272, 'samples': 20033664, 'steps': 104341, 'loss/train': 1.703507900238037} 11/07/2021 11:52:12 - INFO - __main__ - Step 104343: {'lr': 0.00010850633804012314, 'samples': 20033856, 'steps': 104342, 'loss/train': 1.068310260772705} 11/07/2021 11:52:13 - INFO - __main__ - Step 104344: {'lr': 0.00010850196307178844, 'samples': 20034048, 'steps': 104343, 'loss/train': 1.3756155967712402} 11/07/2021 11:52:13 - INFO - __main__ - Step 104345: {'lr': 0.00010849758816721056, 'samples': 20034240, 'steps': 104344, 'loss/train': 1.6528675556182861} 11/07/2021 11:52:13 - INFO - __main__ - Step 104346: {'lr': 0.00010849321332639151, 'samples': 20034432, 'steps': 104345, 'loss/train': 1.47083580493927} 11/07/2021 11:52:14 - INFO - __main__ - Step 104347: {'lr': 0.0001084888385493332, 'samples': 20034624, 'steps': 104346, 'loss/train': 1.4112498760223389} 11/07/2021 11:52:15 - INFO - __main__ - Step 104348: {'lr': 0.00010848446383603767, 'samples': 20034816, 'steps': 104347, 'loss/train': 1.665765643119812} 11/07/2021 11:52:15 - INFO - __main__ - Step 104349: {'lr': 0.00010848008918650682, 'samples': 20035008, 'steps': 104348, 'loss/train': 1.865490198135376} 11/07/2021 11:52:16 - INFO - __main__ - Step 104350: {'lr': 0.00010847571460074276, 'samples': 20035200, 'steps': 104349, 'loss/train': 1.520905613899231} 11/07/2021 11:52:16 - INFO - __main__ - Step 104351: {'lr': 0.0001084713400787473, 'samples': 20035392, 'steps': 104350, 'loss/train': 1.1681945323944092} 11/07/2021 11:52:16 - INFO - __main__ - Step 104352: {'lr': 0.00010846696562052241, 'samples': 20035584, 'steps': 104351, 'loss/train': 1.4526021480560303} 11/07/2021 11:52:17 - INFO - __main__ - Step 104353: {'lr': 0.00010846259122607016, 'samples': 20035776, 'steps': 104352, 'loss/train': 1.3508059978485107} 11/07/2021 11:52:18 - INFO - __main__ - Step 104354: {'lr': 0.00010845821689539249, 'samples': 20035968, 'steps': 104353, 'loss/train': 0.5585866570472717} 11/07/2021 11:52:18 - INFO - __main__ - Step 104355: {'lr': 0.00010845384262849134, 'samples': 20036160, 'steps': 104354, 'loss/train': 1.232789397239685} 11/07/2021 11:52:18 - INFO - __main__ - Step 104356: {'lr': 0.00010844946842536873, 'samples': 20036352, 'steps': 104355, 'loss/train': 1.5398839712142944} 11/07/2021 11:52:19 - INFO - __main__ - Step 104357: {'lr': 0.00010844509428602659, 'samples': 20036544, 'steps': 104356, 'loss/train': 1.0198559761047363} 11/07/2021 11:52:20 - INFO - __main__ - Step 104358: {'lr': 0.00010844072021046692, 'samples': 20036736, 'steps': 104357, 'loss/train': 0.6496318578720093} 11/07/2021 11:52:20 - INFO - __main__ - Step 104359: {'lr': 0.00010843634619869167, 'samples': 20036928, 'steps': 104358, 'loss/train': 1.182130217552185} 11/07/2021 11:52:20 - INFO - __main__ - Step 104360: {'lr': 0.0001084319722507028, 'samples': 20037120, 'steps': 104359, 'loss/train': 1.1974695920944214} 11/07/2021 11:52:21 - INFO - __main__ - Step 104361: {'lr': 0.00010842759836650231, 'samples': 20037312, 'steps': 104360, 'loss/train': 1.085732340812683} 11/07/2021 11:52:21 - INFO - __main__ - Step 104362: {'lr': 0.00010842322454609216, 'samples': 20037504, 'steps': 104361, 'loss/train': 1.3604927062988281} 11/07/2021 11:52:21 - INFO - __main__ - Step 104363: {'lr': 0.00010841885078947441, 'samples': 20037696, 'steps': 104362, 'loss/train': 0.6246560215950012} 11/07/2021 11:52:23 - INFO - __main__ - Step 104364: {'lr': 0.00010841447709665086, 'samples': 20037888, 'steps': 104363, 'loss/train': 1.2982014417648315} 11/07/2021 11:52:23 - INFO - __main__ - Step 104365: {'lr': 0.00010841010346762356, 'samples': 20038080, 'steps': 104364, 'loss/train': 1.5955638885498047} 11/07/2021 11:52:23 - INFO - __main__ - Step 104366: {'lr': 0.00010840572990239447, 'samples': 20038272, 'steps': 104365, 'loss/train': 1.173125147819519} 11/07/2021 11:52:24 - INFO - __main__ - Step 104367: {'lr': 0.00010840135640096558, 'samples': 20038464, 'steps': 104366, 'loss/train': 1.332020878791809} 11/07/2021 11:52:24 - INFO - __main__ - Step 104368: {'lr': 0.00010839698296333885, 'samples': 20038656, 'steps': 104367, 'loss/train': 1.5510574579238892} 11/07/2021 11:52:25 - INFO - __main__ - Step 104369: {'lr': 0.00010839260958951628, 'samples': 20038848, 'steps': 104368, 'loss/train': 1.1854603290557861} 11/07/2021 11:52:25 - INFO - __main__ - Step 104370: {'lr': 0.00010838823627949978, 'samples': 20039040, 'steps': 104369, 'loss/train': 4.683155059814453} 11/07/2021 11:52:26 - INFO - __main__ - Step 104371: {'lr': 0.00010838386303329137, 'samples': 20039232, 'steps': 104370, 'loss/train': 1.032496690750122} 11/07/2021 11:52:26 - INFO - __main__ - Step 104372: {'lr': 0.00010837948985089299, 'samples': 20039424, 'steps': 104371, 'loss/train': 1.1809035539627075} 11/07/2021 11:52:26 - INFO - __main__ - Step 104373: {'lr': 0.00010837511673230666, 'samples': 20039616, 'steps': 104372, 'loss/train': 1.6558948755264282} 11/07/2021 11:52:28 - INFO - __main__ - Step 104374: {'lr': 0.0001083707436775343, 'samples': 20039808, 'steps': 104373, 'loss/train': 1.3768800497055054} 11/07/2021 11:52:28 - INFO - __main__ - Step 104375: {'lr': 0.00010836637068657787, 'samples': 20040000, 'steps': 104374, 'loss/train': 1.078749418258667} 11/07/2021 11:52:28 - INFO - __main__ - Step 104376: {'lr': 0.00010836199775943942, 'samples': 20040192, 'steps': 104375, 'loss/train': 1.2261637449264526} 11/07/2021 11:52:29 - INFO - __main__ - Step 104377: {'lr': 0.00010835762489612091, 'samples': 20040384, 'steps': 104376, 'loss/train': 1.4767171144485474} 11/07/2021 11:52:29 - INFO - __main__ - Step 104378: {'lr': 0.00010835325209662423, 'samples': 20040576, 'steps': 104377, 'loss/train': 1.5187716484069824} 11/07/2021 11:52:29 - INFO - __main__ - Step 104379: {'lr': 0.00010834887936095134, 'samples': 20040768, 'steps': 104378, 'loss/train': 1.1440186500549316} 11/07/2021 11:52:30 - INFO - __main__ - Step 104380: {'lr': 0.00010834450668910428, 'samples': 20040960, 'steps': 104379, 'loss/train': 1.4623923301696777} 11/07/2021 11:52:31 - INFO - __main__ - Step 104381: {'lr': 0.000108340134081085, 'samples': 20041152, 'steps': 104380, 'loss/train': 1.8423218727111816} 11/07/2021 11:52:31 - INFO - __main__ - Step 104382: {'lr': 0.00010833576153689547, 'samples': 20041344, 'steps': 104381, 'loss/train': 2.091845750808716} 11/07/2021 11:52:31 - INFO - __main__ - Step 104383: {'lr': 0.00010833138905653767, 'samples': 20041536, 'steps': 104382, 'loss/train': 1.4969546794891357} 11/07/2021 11:52:32 - INFO - __main__ - Step 104384: {'lr': 0.00010832701664001354, 'samples': 20041728, 'steps': 104383, 'loss/train': 1.406753659248352} 11/07/2021 11:52:33 - INFO - __main__ - Step 104385: {'lr': 0.00010832264428732508, 'samples': 20041920, 'steps': 104384, 'loss/train': 1.2350823879241943} 11/07/2021 11:52:33 - INFO - __main__ - Step 104386: {'lr': 0.00010831827199847424, 'samples': 20042112, 'steps': 104385, 'loss/train': 0.9705185890197754} 11/07/2021 11:52:33 - INFO - __main__ - Step 104387: {'lr': 0.000108313899773463, 'samples': 20042304, 'steps': 104386, 'loss/train': 1.5211206674575806} 11/07/2021 11:52:34 - INFO - __main__ - Step 104388: {'lr': 0.00010830952761229334, 'samples': 20042496, 'steps': 104387, 'loss/train': 1.541641354560852} 11/07/2021 11:52:34 - INFO - __main__ - Step 104389: {'lr': 0.00010830515551496722, 'samples': 20042688, 'steps': 104388, 'loss/train': 1.1161741018295288} 11/07/2021 11:52:36 - INFO - __main__ - Step 104390: {'lr': 0.0001083007834814867, 'samples': 20042880, 'steps': 104389, 'loss/train': 1.3102720975875854} 11/07/2021 11:52:36 - INFO - __main__ - Step 104391: {'lr': 0.00010829641151185357, 'samples': 20043072, 'steps': 104390, 'loss/train': 1.3978849649429321} 11/07/2021 11:52:36 - INFO - __main__ - Step 104392: {'lr': 0.0001082920396060699, 'samples': 20043264, 'steps': 104391, 'loss/train': 1.5822778940200806} 11/07/2021 11:52:37 - INFO - __main__ - Step 104393: {'lr': 0.00010828766776413762, 'samples': 20043456, 'steps': 104392, 'loss/train': 1.3831270933151245} 11/07/2021 11:52:37 - INFO - __main__ - Step 104394: {'lr': 0.00010828329598605876, 'samples': 20043648, 'steps': 104393, 'loss/train': 1.494424819946289} 11/07/2021 11:52:38 - INFO - __main__ - Step 104395: {'lr': 0.00010827892427183525, 'samples': 20043840, 'steps': 104394, 'loss/train': 0.7058231830596924} 11/07/2021 11:52:38 - INFO - __main__ - Step 104396: {'lr': 0.00010827455262146907, 'samples': 20044032, 'steps': 104395, 'loss/train': 1.6381181478500366} 11/07/2021 11:52:39 - INFO - __main__ - Step 104397: {'lr': 0.0001082701810349622, 'samples': 20044224, 'steps': 104396, 'loss/train': 1.0696884393692017} 11/07/2021 11:52:39 - INFO - __main__ - Step 104398: {'lr': 0.0001082658095123166, 'samples': 20044416, 'steps': 104397, 'loss/train': 1.5318368673324585} 11/07/2021 11:52:40 - INFO - __main__ - Step 104399: {'lr': 0.00010826143805353423, 'samples': 20044608, 'steps': 104398, 'loss/train': 1.3210091590881348} 11/07/2021 11:52:40 - INFO - __main__ - Step 104400: {'lr': 0.00010825706665861707, 'samples': 20044800, 'steps': 104399, 'loss/train': 1.423685908317566} 11/07/2021 11:52:41 - INFO - __main__ - Step 104401: {'lr': 0.00010825269532756707, 'samples': 20044992, 'steps': 104400, 'loss/train': 1.6528027057647705} 11/07/2021 11:52:41 - INFO - __main__ - Step 104402: {'lr': 0.00010824832406038623, 'samples': 20045184, 'steps': 104401, 'loss/train': 1.1971163749694824} 11/07/2021 11:52:42 - INFO - __main__ - Step 104403: {'lr': 0.00010824395285707653, 'samples': 20045376, 'steps': 104402, 'loss/train': 1.2351202964782715} 11/07/2021 11:52:42 - INFO - __main__ - Step 104404: {'lr': 0.00010823958171763998, 'samples': 20045568, 'steps': 104403, 'loss/train': 1.5163434743881226} 11/07/2021 11:52:42 - INFO - __main__ - Step 104405: {'lr': 0.00010823521064207839, 'samples': 20045760, 'steps': 104404, 'loss/train': 1.5413423776626587} 11/07/2021 11:52:43 - INFO - __main__ - Step 104406: {'lr': 0.00010823083963039384, 'samples': 20045952, 'steps': 104405, 'loss/train': 1.3630238771438599} 11/07/2021 11:52:44 - INFO - __main__ - Step 104407: {'lr': 0.00010822646868258831, 'samples': 20046144, 'steps': 104406, 'loss/train': 0.4023608863353729} 11/07/2021 11:52:44 - INFO - __main__ - Step 104408: {'lr': 0.00010822209779866371, 'samples': 20046336, 'steps': 104407, 'loss/train': 1.6685600280761719} 11/07/2021 11:52:45 - INFO - __main__ - Step 104409: {'lr': 0.00010821772697862206, 'samples': 20046528, 'steps': 104408, 'loss/train': 1.2475121021270752} 11/07/2021 11:52:45 - INFO - __main__ - Step 104410: {'lr': 0.0001082133562224653, 'samples': 20046720, 'steps': 104409, 'loss/train': 1.2313474416732788} 11/07/2021 11:52:46 - INFO - __main__ - Step 104411: {'lr': 0.00010820898553019545, 'samples': 20046912, 'steps': 104410, 'loss/train': 1.4591537714004517} 11/07/2021 11:52:46 - INFO - __main__ - Step 104412: {'lr': 0.00010820461490181441, 'samples': 20047104, 'steps': 104411, 'loss/train': 1.402093529701233} 11/07/2021 11:52:47 - INFO - __main__ - Step 104413: {'lr': 0.0001082002443373242, 'samples': 20047296, 'steps': 104412, 'loss/train': 0.6686480045318604} 11/07/2021 11:52:47 - INFO - __main__ - Step 104414: {'lr': 0.00010819587383672678, 'samples': 20047488, 'steps': 104413, 'loss/train': 1.474524736404419} 11/07/2021 11:52:47 - INFO - __main__ - Step 104415: {'lr': 0.0001081915034000241, 'samples': 20047680, 'steps': 104414, 'loss/train': 1.4241960048675537} 11/07/2021 11:52:48 - INFO - __main__ - Step 104416: {'lr': 0.0001081871330272181, 'samples': 20047872, 'steps': 104415, 'loss/train': 1.7358061075210571} 11/07/2021 11:52:49 - INFO - __main__ - Step 104417: {'lr': 0.00010818276271831093, 'samples': 20048064, 'steps': 104416, 'loss/train': 1.85403573513031} 11/07/2021 11:52:49 - INFO - __main__ - Step 104418: {'lr': 0.00010817839247330432, 'samples': 20048256, 'steps': 104417, 'loss/train': 1.3121845722198486} 11/07/2021 11:52:49 - INFO - __main__ - Step 104419: {'lr': 0.00010817402229220032, 'samples': 20048448, 'steps': 104418, 'loss/train': 1.2136777639389038} 11/07/2021 11:52:50 - INFO - __main__ - Step 104420: {'lr': 0.00010816965217500093, 'samples': 20048640, 'steps': 104419, 'loss/train': 1.5322459936141968} 11/07/2021 11:52:51 - INFO - __main__ - Step 104421: {'lr': 0.00010816528212170812, 'samples': 20048832, 'steps': 104420, 'loss/train': 1.2683758735656738} 11/07/2021 11:52:51 - INFO - __main__ - Step 104422: {'lr': 0.00010816091213232385, 'samples': 20049024, 'steps': 104421, 'loss/train': 1.212889313697815} 11/07/2021 11:52:51 - INFO - __main__ - Step 104423: {'lr': 0.00010815654220685006, 'samples': 20049216, 'steps': 104422, 'loss/train': 1.0805492401123047} 11/07/2021 11:52:52 - INFO - __main__ - Step 104424: {'lr': 0.00010815217234528873, 'samples': 20049408, 'steps': 104423, 'loss/train': 1.3389360904693604} 11/07/2021 11:52:52 - INFO - __main__ - Step 104425: {'lr': 0.00010814780254764186, 'samples': 20049600, 'steps': 104424, 'loss/train': 0.9207272529602051} 11/07/2021 11:52:52 - INFO - __main__ - Step 104426: {'lr': 0.00010814343281391143, 'samples': 20049792, 'steps': 104425, 'loss/train': 1.5237395763397217} 11/07/2021 11:52:54 - INFO - __main__ - Step 104427: {'lr': 0.00010813906314409933, 'samples': 20049984, 'steps': 104426, 'loss/train': 1.523102879524231} 11/07/2021 11:52:54 - INFO - __main__ - Step 104428: {'lr': 0.0001081346935382076, 'samples': 20050176, 'steps': 104427, 'loss/train': 1.4679287672042847} 11/07/2021 11:52:54 - INFO - __main__ - Step 104429: {'lr': 0.0001081303239962382, 'samples': 20050368, 'steps': 104428, 'loss/train': 1.3961488008499146} 11/07/2021 11:52:55 - INFO - __main__ - Step 104430: {'lr': 0.00010812595451819315, 'samples': 20050560, 'steps': 104429, 'loss/train': 1.5175589323043823} 11/07/2021 11:52:55 - INFO - __main__ - Step 104431: {'lr': 0.0001081215851040743, 'samples': 20050752, 'steps': 104430, 'loss/train': 0.47755733132362366} 11/07/2021 11:52:56 - INFO - __main__ - Step 104432: {'lr': 0.00010811721575388364, 'samples': 20050944, 'steps': 104431, 'loss/train': 2.0397887229919434} 11/07/2021 11:52:56 - INFO - __main__ - Step 104433: {'lr': 0.0001081128464676232, 'samples': 20051136, 'steps': 104432, 'loss/train': 1.1092721223831177} 11/07/2021 11:52:57 - INFO - __main__ - Step 104434: {'lr': 0.00010810847724529491, 'samples': 20051328, 'steps': 104433, 'loss/train': 1.2408804893493652} 11/07/2021 11:52:57 - INFO - __main__ - Step 104435: {'lr': 0.00010810410808690076, 'samples': 20051520, 'steps': 104434, 'loss/train': 0.9635074734687805} 11/07/2021 11:52:57 - INFO - __main__ - Step 104436: {'lr': 0.00010809973899244269, 'samples': 20051712, 'steps': 104435, 'loss/train': 1.2586045265197754} 11/07/2021 11:52:59 - INFO - __main__ - Step 104437: {'lr': 0.00010809536996192271, 'samples': 20051904, 'steps': 104436, 'loss/train': 1.7243294715881348} 11/07/2021 11:52:59 - INFO - __main__ - Step 104438: {'lr': 0.00010809100099534274, 'samples': 20052096, 'steps': 104437, 'loss/train': 0.6808053255081177} 11/07/2021 11:52:59 - INFO - __main__ - Step 104439: {'lr': 0.0001080866320927048, 'samples': 20052288, 'steps': 104438, 'loss/train': 1.6648552417755127} 11/07/2021 11:53:00 - INFO - __main__ - Step 104440: {'lr': 0.00010808226325401082, 'samples': 20052480, 'steps': 104439, 'loss/train': 1.4778292179107666} 11/07/2021 11:53:00 - INFO - __main__ - Step 104441: {'lr': 0.00010807789447926281, 'samples': 20052672, 'steps': 104440, 'loss/train': 1.0705015659332275} 11/07/2021 11:53:02 - INFO - __main__ - Step 104442: {'lr': 0.00010807352576846268, 'samples': 20052864, 'steps': 104441, 'loss/train': 1.3219146728515625} 11/07/2021 11:53:02 - INFO - __main__ - Step 104443: {'lr': 0.00010806915712161244, 'samples': 20053056, 'steps': 104442, 'loss/train': 0.8676849007606506} 11/07/2021 11:53:02 - INFO - __main__ - Step 104444: {'lr': 0.00010806478853871413, 'samples': 20053248, 'steps': 104443, 'loss/train': 1.175565481185913} 11/07/2021 11:53:03 - INFO - __main__ - Step 104445: {'lr': 0.00010806042001976954, 'samples': 20053440, 'steps': 104444, 'loss/train': 0.9195734262466431} 11/07/2021 11:53:03 - INFO - __main__ - Step 104446: {'lr': 0.00010805605156478076, 'samples': 20053632, 'steps': 104445, 'loss/train': 1.7699306011199951} 11/07/2021 11:53:03 - INFO - __main__ - Step 104447: {'lr': 0.00010805168317374972, 'samples': 20053824, 'steps': 104446, 'loss/train': 0.8120316863059998} 11/07/2021 11:53:05 - INFO - __main__ - Step 104448: {'lr': 0.0001080473148466784, 'samples': 20054016, 'steps': 104447, 'loss/train': 1.736064076423645} 11/07/2021 11:53:05 - INFO - __main__ - Step 104449: {'lr': 0.00010804294658356875, 'samples': 20054208, 'steps': 104448, 'loss/train': 1.3042430877685547} 11/07/2021 11:53:05 - INFO - __main__ - Step 104450: {'lr': 0.00010803857838442279, 'samples': 20054400, 'steps': 104449, 'loss/train': 1.155276894569397} 11/07/2021 11:53:06 - INFO - __main__ - Step 104451: {'lr': 0.00010803421024924246, 'samples': 20054592, 'steps': 104450, 'loss/train': 4.493049144744873} 11/07/2021 11:53:06 - INFO - __main__ - Step 104452: {'lr': 0.00010802984217802968, 'samples': 20054784, 'steps': 104451, 'loss/train': 1.5454277992248535} 11/07/2021 11:53:06 - INFO - __main__ - Step 104453: {'lr': 0.00010802547417078651, 'samples': 20054976, 'steps': 104452, 'loss/train': 1.6374576091766357} 11/07/2021 11:53:07 - INFO - __main__ - Step 104454: {'lr': 0.00010802110622751485, 'samples': 20055168, 'steps': 104453, 'loss/train': 1.337654709815979} 11/07/2021 11:53:07 - INFO - __main__ - Step 104455: {'lr': 0.00010801673834821668, 'samples': 20055360, 'steps': 104454, 'loss/train': 0.7054550647735596} 11/07/2021 11:53:09 - INFO - __main__ - Step 104456: {'lr': 0.00010801237053289398, 'samples': 20055552, 'steps': 104455, 'loss/train': 0.6118089556694031} 11/07/2021 11:53:09 - INFO - __main__ - Step 104457: {'lr': 0.00010800800278154882, 'samples': 20055744, 'steps': 104456, 'loss/train': 1.3689255714416504} 11/07/2021 11:53:09 - INFO - __main__ - Step 104458: {'lr': 0.00010800363509418296, 'samples': 20055936, 'steps': 104457, 'loss/train': 1.5835285186767578} 11/07/2021 11:53:10 - INFO - __main__ - Step 104459: {'lr': 0.00010799926747079847, 'samples': 20056128, 'steps': 104458, 'loss/train': 1.3359017372131348} 11/07/2021 11:53:10 - INFO - __main__ - Step 104460: {'lr': 0.00010799489991139732, 'samples': 20056320, 'steps': 104459, 'loss/train': 1.1633434295654297} 11/07/2021 11:53:11 - INFO - __main__ - Step 104461: {'lr': 0.00010799053241598147, 'samples': 20056512, 'steps': 104460, 'loss/train': 1.3850809335708618} 11/07/2021 11:53:11 - INFO - __main__ - Step 104462: {'lr': 0.00010798616498455291, 'samples': 20056704, 'steps': 104461, 'loss/train': 1.1695358753204346} 11/07/2021 11:53:12 - INFO - __main__ - Step 104463: {'lr': 0.0001079817976171136, 'samples': 20056896, 'steps': 104462, 'loss/train': 1.1453553438186646} 11/07/2021 11:53:12 - INFO - __main__ - Step 104464: {'lr': 0.00010797743031366546, 'samples': 20057088, 'steps': 104463, 'loss/train': 1.4616777896881104} 11/07/2021 11:53:12 - INFO - __main__ - Step 104465: {'lr': 0.00010797306307421053, 'samples': 20057280, 'steps': 104464, 'loss/train': 1.714019775390625} 11/07/2021 11:53:13 - INFO - __main__ - Step 104466: {'lr': 0.00010796869589875077, 'samples': 20057472, 'steps': 104465, 'loss/train': 1.5072637796401978} 11/07/2021 11:53:14 - INFO - __main__ - Step 104467: {'lr': 0.0001079643287872881, 'samples': 20057664, 'steps': 104466, 'loss/train': 1.5732896327972412} 11/07/2021 11:53:14 - INFO - __main__ - Step 104468: {'lr': 0.0001079599617398245, 'samples': 20057856, 'steps': 104467, 'loss/train': 1.3857676982879639} 11/07/2021 11:53:14 - INFO - __main__ - Step 104469: {'lr': 0.00010795559475636196, 'samples': 20058048, 'steps': 104468, 'loss/train': 1.4284855127334595} 11/07/2021 11:53:15 - INFO - __main__ - Step 104470: {'lr': 0.00010795122783690242, 'samples': 20058240, 'steps': 104469, 'loss/train': 1.5117906332015991} 11/07/2021 11:53:16 - INFO - __main__ - Step 104471: {'lr': 0.00010794686098144799, 'samples': 20058432, 'steps': 104470, 'loss/train': 1.5629998445510864} 11/07/2021 11:53:16 - INFO - __main__ - Step 104472: {'lr': 0.00010794249419000038, 'samples': 20058624, 'steps': 104471, 'loss/train': 1.4183629751205444} 11/07/2021 11:53:17 - INFO - __main__ - Step 104473: {'lr': 0.00010793812746256171, 'samples': 20058816, 'steps': 104472, 'loss/train': 1.3496639728546143} 11/07/2021 11:53:17 - INFO - __main__ - Step 104474: {'lr': 0.00010793376079913395, 'samples': 20059008, 'steps': 104473, 'loss/train': 1.1291488409042358} 11/07/2021 11:53:17 - INFO - __main__ - Step 104475: {'lr': 0.000107929394199719, 'samples': 20059200, 'steps': 104474, 'loss/train': 1.0548397302627563} 11/07/2021 11:53:18 - INFO - __main__ - Step 104476: {'lr': 0.00010792502766431891, 'samples': 20059392, 'steps': 104475, 'loss/train': 1.1949361562728882} 11/07/2021 11:53:19 - INFO - __main__ - Step 104477: {'lr': 0.00010792066119293559, 'samples': 20059584, 'steps': 104476, 'loss/train': 1.4604343175888062} 11/07/2021 11:53:19 - INFO - __main__ - Step 104478: {'lr': 0.00010791629478557105, 'samples': 20059776, 'steps': 104477, 'loss/train': 1.5209431648254395} 11/07/2021 11:53:19 - INFO - __main__ - Step 104479: {'lr': 0.00010791192844222722, 'samples': 20059968, 'steps': 104478, 'loss/train': 1.3515795469284058} 11/07/2021 11:53:20 - INFO - __main__ - Step 104480: {'lr': 0.00010790756216290606, 'samples': 20060160, 'steps': 104479, 'loss/train': 1.4357855319976807} 11/07/2021 11:53:21 - INFO - __main__ - Step 104481: {'lr': 0.00010790319594760958, 'samples': 20060352, 'steps': 104480, 'loss/train': 2.239941120147705} 11/07/2021 11:53:21 - INFO - __main__ - Step 104482: {'lr': 0.00010789882979633974, 'samples': 20060544, 'steps': 104481, 'loss/train': 1.1555496454238892} 11/07/2021 11:53:22 - INFO - __main__ - Step 104483: {'lr': 0.0001078944637090985, 'samples': 20060736, 'steps': 104482, 'loss/train': 2.480792999267578} 11/07/2021 11:53:22 - INFO - __main__ - Step 104484: {'lr': 0.0001078900976858879, 'samples': 20060928, 'steps': 104483, 'loss/train': 1.9061124324798584} 11/07/2021 11:53:22 - INFO - __main__ - Step 104485: {'lr': 0.00010788573172670973, 'samples': 20061120, 'steps': 104484, 'loss/train': 1.2726057767868042} 11/07/2021 11:53:23 - INFO - __main__ - Step 104486: {'lr': 0.00010788136583156604, 'samples': 20061312, 'steps': 104485, 'loss/train': 1.2946405410766602} 11/07/2021 11:53:24 - INFO - __main__ - Step 104487: {'lr': 0.00010787700000045886, 'samples': 20061504, 'steps': 104486, 'loss/train': 1.1187068223953247} 11/07/2021 11:53:24 - INFO - __main__ - Step 104488: {'lr': 0.00010787263423339008, 'samples': 20061696, 'steps': 104487, 'loss/train': 1.0246511697769165} 11/07/2021 11:53:24 - INFO - __main__ - Step 104489: {'lr': 0.00010786826853036169, 'samples': 20061888, 'steps': 104488, 'loss/train': 1.7434769868850708} 11/07/2021 11:53:25 - INFO - __main__ - Step 104490: {'lr': 0.00010786390289137569, 'samples': 20062080, 'steps': 104489, 'loss/train': 1.4443455934524536} 11/07/2021 11:53:25 - INFO - __main__ - Step 104491: {'lr': 0.000107859537316434, 'samples': 20062272, 'steps': 104490, 'loss/train': 0.9510664939880371} 11/07/2021 11:53:26 - INFO - __main__ - Step 104492: {'lr': 0.00010785517180553864, 'samples': 20062464, 'steps': 104491, 'loss/train': 1.2539623975753784} 11/07/2021 11:53:26 - INFO - __main__ - Step 104493: {'lr': 0.00010785080635869152, 'samples': 20062656, 'steps': 104492, 'loss/train': 1.5163381099700928} 11/07/2021 11:53:27 - INFO - __main__ - Step 104494: {'lr': 0.00010784644097589463, 'samples': 20062848, 'steps': 104493, 'loss/train': 1.9949193000793457} 11/07/2021 11:53:27 - INFO - __main__ - Step 104495: {'lr': 0.00010784207565714995, 'samples': 20063040, 'steps': 104494, 'loss/train': 1.3396819829940796} 11/07/2021 11:53:27 - INFO - __main__ - Step 104496: {'lr': 0.00010783771040245944, 'samples': 20063232, 'steps': 104495, 'loss/train': 0.7166416049003601} 11/07/2021 11:53:29 - INFO - __main__ - Step 104497: {'lr': 0.00010783334521182505, 'samples': 20063424, 'steps': 104496, 'loss/train': 1.4308408498764038} 11/07/2021 11:53:29 - INFO - __main__ - Step 104498: {'lr': 0.00010782898008524885, 'samples': 20063616, 'steps': 104497, 'loss/train': 1.400026798248291} 11/07/2021 11:53:29 - INFO - __main__ - Step 104499: {'lr': 0.00010782461502273267, 'samples': 20063808, 'steps': 104498, 'loss/train': 1.346091389656067} 11/07/2021 11:53:30 - INFO - __main__ - Step 104500: {'lr': 0.00010782025002427848, 'samples': 20064000, 'steps': 104499, 'loss/train': 1.0753788948059082} 11/07/2021 11:53:30 - INFO - __main__ - Step 104501: {'lr': 0.0001078158850898883, 'samples': 20064192, 'steps': 104500, 'loss/train': 1.0790363550186157} 11/07/2021 11:53:31 - INFO - __main__ - Step 104502: {'lr': 0.00010781152021956408, 'samples': 20064384, 'steps': 104501, 'loss/train': 1.4209321737289429} 11/07/2021 11:53:31 - INFO - __main__ - Step 104503: {'lr': 0.00010780715541330783, 'samples': 20064576, 'steps': 104502, 'loss/train': 1.369503378868103} 11/07/2021 11:53:32 - INFO - __main__ - Step 104504: {'lr': 0.00010780279067112145, 'samples': 20064768, 'steps': 104503, 'loss/train': 1.3571271896362305} 11/07/2021 11:53:32 - INFO - __main__ - Step 104505: {'lr': 0.00010779842599300696, 'samples': 20064960, 'steps': 104504, 'loss/train': 0.6836594343185425} 11/07/2021 11:53:32 - INFO - __main__ - Step 104506: {'lr': 0.00010779406137896627, 'samples': 20065152, 'steps': 104505, 'loss/train': 1.0455989837646484} 11/07/2021 11:53:34 - INFO - __main__ - Step 104507: {'lr': 0.00010778969682900141, 'samples': 20065344, 'steps': 104506, 'loss/train': 1.1811708211898804} 11/07/2021 11:53:34 - INFO - __main__ - Step 104508: {'lr': 0.00010778533234311433, 'samples': 20065536, 'steps': 104507, 'loss/train': 1.607795000076294} 11/07/2021 11:53:34 - INFO - __main__ - Step 104509: {'lr': 0.00010778096792130695, 'samples': 20065728, 'steps': 104508, 'loss/train': 1.6323331594467163} 11/07/2021 11:53:35 - INFO - __main__ - Step 104510: {'lr': 0.0001077766035635813, 'samples': 20065920, 'steps': 104509, 'loss/train': 1.3246349096298218} 11/07/2021 11:53:35 - INFO - __main__ - Step 104511: {'lr': 0.0001077722392699394, 'samples': 20066112, 'steps': 104510, 'loss/train': 2.0842294692993164} 11/07/2021 11:53:35 - INFO - __main__ - Step 104512: {'lr': 0.00010776787504038305, 'samples': 20066304, 'steps': 104511, 'loss/train': 1.1568633317947388} 11/07/2021 11:53:36 - INFO - __main__ - Step 104513: {'lr': 0.00010776351087491426, 'samples': 20066496, 'steps': 104512, 'loss/train': 1.6304038763046265} 11/07/2021 11:53:37 - INFO - __main__ - Step 104514: {'lr': 0.00010775914677353507, 'samples': 20066688, 'steps': 104513, 'loss/train': 0.6740509271621704} 11/07/2021 11:53:37 - INFO - __main__ - Step 104515: {'lr': 0.00010775478273624743, 'samples': 20066880, 'steps': 104514, 'loss/train': 1.4455993175506592} 11/07/2021 11:53:37 - INFO - __main__ - Step 104516: {'lr': 0.00010775041876305328, 'samples': 20067072, 'steps': 104515, 'loss/train': 1.5396957397460938} 11/07/2021 11:53:38 - INFO - __main__ - Step 104517: {'lr': 0.00010774605485395458, 'samples': 20067264, 'steps': 104516, 'loss/train': 0.6117398142814636} 11/07/2021 11:53:39 - INFO - __main__ - Step 104518: {'lr': 0.00010774169100895332, 'samples': 20067456, 'steps': 104517, 'loss/train': 1.4858359098434448} 11/07/2021 11:53:39 - INFO - __main__ - Step 104519: {'lr': 0.00010773732722805146, 'samples': 20067648, 'steps': 104518, 'loss/train': 0.8113764524459839} 11/07/2021 11:53:39 - INFO - __main__ - Step 104520: {'lr': 0.00010773296351125095, 'samples': 20067840, 'steps': 104519, 'loss/train': 1.2594982385635376} 11/07/2021 11:53:40 - INFO - __main__ - Step 104521: {'lr': 0.00010772859985855379, 'samples': 20068032, 'steps': 104520, 'loss/train': 0.7455692887306213} 11/07/2021 11:53:40 - INFO - __main__ - Step 104522: {'lr': 0.00010772423626996192, 'samples': 20068224, 'steps': 104521, 'loss/train': 1.255312442779541} 11/07/2021 11:53:41 - INFO - __main__ - Step 104523: {'lr': 0.00010771987274547732, 'samples': 20068416, 'steps': 104522, 'loss/train': 1.560581088066101} 11/07/2021 11:53:42 - INFO - __main__ - Step 104524: {'lr': 0.00010771550928510196, 'samples': 20068608, 'steps': 104523, 'loss/train': 1.369981288909912} 11/07/2021 11:53:42 - INFO - __main__ - Step 104525: {'lr': 0.00010771114588883787, 'samples': 20068800, 'steps': 104524, 'loss/train': 1.3748503923416138} 11/07/2021 11:53:42 - INFO - __main__ - Step 104526: {'lr': 0.00010770678255668684, 'samples': 20068992, 'steps': 104525, 'loss/train': 1.8447716236114502} 11/07/2021 11:53:43 - INFO - __main__ - Step 104527: {'lr': 0.00010770241928865097, 'samples': 20069184, 'steps': 104526, 'loss/train': 1.0718239545822144} 11/07/2021 11:53:44 - INFO - __main__ - Step 104528: {'lr': 0.00010769805608473218, 'samples': 20069376, 'steps': 104527, 'loss/train': 1.4329333305358887} 11/07/2021 11:53:44 - INFO - __main__ - Step 104529: {'lr': 0.00010769369294493245, 'samples': 20069568, 'steps': 104528, 'loss/train': 1.2209084033966064} 11/07/2021 11:53:45 - INFO - __main__ - Step 104530: {'lr': 0.00010768932986925373, 'samples': 20069760, 'steps': 104529, 'loss/train': 1.4502317905426025} 11/07/2021 11:53:45 - INFO - __main__ - Step 104531: {'lr': 0.00010768496685769802, 'samples': 20069952, 'steps': 104530, 'loss/train': 1.2387391328811646} 11/07/2021 11:53:45 - INFO - __main__ - Step 104532: {'lr': 0.00010768060391026727, 'samples': 20070144, 'steps': 104531, 'loss/train': 1.536970615386963} 11/07/2021 11:53:46 - INFO - __main__ - Step 104533: {'lr': 0.0001076762410269634, 'samples': 20070336, 'steps': 104532, 'loss/train': 1.0961147546768188} 11/07/2021 11:53:47 - INFO - __main__ - Step 104534: {'lr': 0.00010767187820778848, 'samples': 20070528, 'steps': 104533, 'loss/train': 1.5517573356628418} 11/07/2021 11:53:47 - INFO - __main__ - Step 104535: {'lr': 0.0001076675154527444, 'samples': 20070720, 'steps': 104534, 'loss/train': 1.1573545932769775} 11/07/2021 11:53:47 - INFO - __main__ - Step 104536: {'lr': 0.00010766315276183323, 'samples': 20070912, 'steps': 104535, 'loss/train': 1.6657580137252808} 11/07/2021 11:53:48 - INFO - __main__ - Step 104537: {'lr': 0.00010765879013505673, 'samples': 20071104, 'steps': 104536, 'loss/train': 1.278095006942749} 11/07/2021 11:53:49 - INFO - __main__ - Step 104538: {'lr': 0.000107654427572417, 'samples': 20071296, 'steps': 104537, 'loss/train': 1.440406084060669} 11/07/2021 11:53:49 - INFO - __main__ - Step 104539: {'lr': 0.00010765006507391601, 'samples': 20071488, 'steps': 104538, 'loss/train': 0.7795607447624207} 11/07/2021 11:53:49 - INFO - __main__ - Step 104540: {'lr': 0.00010764570263955567, 'samples': 20071680, 'steps': 104539, 'loss/train': 0.5472241640090942} 11/07/2021 11:53:50 - INFO - __main__ - Step 104541: {'lr': 0.000107641340269338, 'samples': 20071872, 'steps': 104540, 'loss/train': 1.4128048419952393} 11/07/2021 11:53:50 - INFO - __main__ - Step 104542: {'lr': 0.00010763697796326493, 'samples': 20072064, 'steps': 104541, 'loss/train': 1.264930248260498} 11/07/2021 11:53:51 - INFO - __main__ - Step 104543: {'lr': 0.00010763261572133845, 'samples': 20072256, 'steps': 104542, 'loss/train': 1.440892219543457} 11/07/2021 11:53:52 - INFO - __main__ - Step 104544: {'lr': 0.00010762825354356054, 'samples': 20072448, 'steps': 104543, 'loss/train': 1.2383081912994385} 11/07/2021 11:53:52 - INFO - __main__ - Step 104545: {'lr': 0.00010762389142993312, 'samples': 20072640, 'steps': 104544, 'loss/train': 4.091971397399902} 11/07/2021 11:53:53 - INFO - __main__ - Step 104546: {'lr': 0.00010761952938045816, 'samples': 20072832, 'steps': 104545, 'loss/train': 1.4222962856292725} 11/07/2021 11:53:53 - INFO - __main__ - Step 104547: {'lr': 0.00010761516739513777, 'samples': 20073024, 'steps': 104546, 'loss/train': 0.41833773255348206} 11/07/2021 11:53:53 - INFO - __main__ - Step 104548: {'lr': 0.00010761080547397367, 'samples': 20073216, 'steps': 104547, 'loss/train': 1.0273083448410034} 11/07/2021 11:53:55 - INFO - __main__ - Step 104549: {'lr': 0.00010760644361696795, 'samples': 20073408, 'steps': 104548, 'loss/train': 0.8370867371559143} 11/07/2021 11:53:55 - INFO - __main__ - Step 104550: {'lr': 0.00010760208182412257, 'samples': 20073600, 'steps': 104549, 'loss/train': 1.7767856121063232} 11/07/2021 11:53:55 - INFO - __main__ - Step 104551: {'lr': 0.0001075977200954395, 'samples': 20073792, 'steps': 104550, 'loss/train': 1.2640318870544434} 11/07/2021 11:53:56 - INFO - __main__ - Step 104552: {'lr': 0.00010759335843092068, 'samples': 20073984, 'steps': 104551, 'loss/train': 1.1244008541107178} 11/07/2021 11:53:56 - INFO - __main__ - Step 104553: {'lr': 0.00010758899683056811, 'samples': 20074176, 'steps': 104552, 'loss/train': 1.0124846696853638} 11/07/2021 11:53:57 - INFO - __main__ - Step 104554: {'lr': 0.00010758463529438376, 'samples': 20074368, 'steps': 104553, 'loss/train': 1.3366224765777588} 11/07/2021 11:53:57 - INFO - __main__ - Step 104555: {'lr': 0.00010758027382236953, 'samples': 20074560, 'steps': 104554, 'loss/train': 1.1557995080947876} 11/07/2021 11:53:58 - INFO - __main__ - Step 104556: {'lr': 0.00010757591241452749, 'samples': 20074752, 'steps': 104555, 'loss/train': 1.513776421546936} 11/07/2021 11:53:58 - INFO - __main__ - Step 104557: {'lr': 0.00010757155107085951, 'samples': 20074944, 'steps': 104556, 'loss/train': 1.3966730833053589} 11/07/2021 11:53:58 - INFO - __main__ - Step 104558: {'lr': 0.00010756718979136768, 'samples': 20075136, 'steps': 104557, 'loss/train': 1.5087344646453857} 11/07/2021 11:53:59 - INFO - __main__ - Step 104559: {'lr': 0.0001075628285760538, 'samples': 20075328, 'steps': 104558, 'loss/train': 0.7827159762382507} 11/07/2021 11:54:00 - INFO - __main__ - Step 104560: {'lr': 0.0001075584674249199, 'samples': 20075520, 'steps': 104559, 'loss/train': 0.9660841226577759} 11/07/2021 11:54:00 - INFO - __main__ - Step 104561: {'lr': 0.00010755410633796797, 'samples': 20075712, 'steps': 104560, 'loss/train': 1.2142914533615112} 11/07/2021 11:54:00 - INFO - __main__ - Step 104562: {'lr': 0.00010754974531519995, 'samples': 20075904, 'steps': 104561, 'loss/train': 1.4702919721603394} 11/07/2021 11:54:01 - INFO - __main__ - Step 104563: {'lr': 0.00010754538435661781, 'samples': 20076096, 'steps': 104562, 'loss/train': 1.5192183256149292} 11/07/2021 11:54:02 - INFO - __main__ - Step 104564: {'lr': 0.00010754102346222353, 'samples': 20076288, 'steps': 104563, 'loss/train': 1.565626621246338} 11/07/2021 11:54:02 - INFO - __main__ - Step 104565: {'lr': 0.00010753666263201906, 'samples': 20076480, 'steps': 104564, 'loss/train': 1.9008270502090454} 11/07/2021 11:54:03 - INFO - __main__ - Step 104566: {'lr': 0.00010753230186600638, 'samples': 20076672, 'steps': 104565, 'loss/train': 1.380635380744934} 11/07/2021 11:54:03 - INFO - __main__ - Step 104567: {'lr': 0.00010752794116418745, 'samples': 20076864, 'steps': 104566, 'loss/train': 0.23818282783031464} 11/07/2021 11:54:03 - INFO - __main__ - Step 104568: {'lr': 0.00010752358052656422, 'samples': 20077056, 'steps': 104567, 'loss/train': 0.7991412878036499} 11/07/2021 11:54:04 - INFO - __main__ - Step 104569: {'lr': 0.00010751921995313876, 'samples': 20077248, 'steps': 104568, 'loss/train': 1.6321765184402466} 11/07/2021 11:54:05 - INFO - __main__ - Step 104570: {'lr': 0.00010751485944391288, 'samples': 20077440, 'steps': 104569, 'loss/train': 1.1716187000274658} 11/07/2021 11:54:05 - INFO - __main__ - Step 104571: {'lr': 0.00010751049899888856, 'samples': 20077632, 'steps': 104570, 'loss/train': 1.0605783462524414} 11/07/2021 11:54:05 - INFO - __main__ - Step 104572: {'lr': 0.00010750613861806783, 'samples': 20077824, 'steps': 104571, 'loss/train': 1.1866053342819214} 11/07/2021 11:54:06 - INFO - __main__ - Step 104573: {'lr': 0.00010750177830145264, 'samples': 20078016, 'steps': 104572, 'loss/train': 1.0997644662857056} 11/07/2021 11:54:06 - INFO - __main__ - Step 104574: {'lr': 0.00010749741804904494, 'samples': 20078208, 'steps': 104573, 'loss/train': 1.6367374658584595} 11/07/2021 11:54:07 - INFO - __main__ - Step 104575: {'lr': 0.00010749305786084671, 'samples': 20078400, 'steps': 104574, 'loss/train': 1.1759318113327026} 11/07/2021 11:54:07 - INFO - __main__ - Step 104576: {'lr': 0.0001074886977368599, 'samples': 20078592, 'steps': 104575, 'loss/train': 1.4354910850524902} 11/07/2021 11:54:08 - INFO - __main__ - Step 104577: {'lr': 0.00010748433767708649, 'samples': 20078784, 'steps': 104576, 'loss/train': 1.0284799337387085} 11/07/2021 11:54:08 - INFO - __main__ - Step 104578: {'lr': 0.00010747997768152845, 'samples': 20078976, 'steps': 104577, 'loss/train': 1.6953941583633423} 11/07/2021 11:54:08 - INFO - __main__ - Step 104579: {'lr': 0.00010747561775018771, 'samples': 20079168, 'steps': 104578, 'loss/train': 1.0245128870010376} 11/07/2021 11:54:09 - INFO - __main__ - Step 104580: {'lr': 0.00010747125788306636, 'samples': 20079360, 'steps': 104579, 'loss/train': 1.5820012092590332} 11/07/2021 11:54:10 - INFO - __main__ - Step 104581: {'lr': 0.00010746689808016619, 'samples': 20079552, 'steps': 104580, 'loss/train': 1.3112726211547852} 11/07/2021 11:54:10 - INFO - __main__ - Step 104582: {'lr': 0.0001074625383414892, 'samples': 20079744, 'steps': 104581, 'loss/train': 1.142626166343689} 11/07/2021 11:54:11 - INFO - __main__ - Step 104583: {'lr': 0.00010745817866703741, 'samples': 20079936, 'steps': 104582, 'loss/train': 1.437633752822876} 11/07/2021 11:54:11 - INFO - __main__ - Step 104584: {'lr': 0.00010745381905681276, 'samples': 20080128, 'steps': 104583, 'loss/train': 1.2578068971633911} 11/07/2021 11:54:12 - INFO - __main__ - Step 104585: {'lr': 0.00010744945951081722, 'samples': 20080320, 'steps': 104584, 'loss/train': 1.0696380138397217} 11/07/2021 11:54:12 - INFO - __main__ - Step 104586: {'lr': 0.00010744510002905278, 'samples': 20080512, 'steps': 104585, 'loss/train': 1.1305371522903442} 11/07/2021 11:54:13 - INFO - __main__ - Step 104587: {'lr': 0.00010744074061152134, 'samples': 20080704, 'steps': 104586, 'loss/train': 1.6456869840621948} 11/07/2021 11:54:13 - INFO - __main__ - Step 104588: {'lr': 0.00010743638125822491, 'samples': 20080896, 'steps': 104587, 'loss/train': 1.6650640964508057} 11/07/2021 11:54:13 - INFO - __main__ - Step 104589: {'lr': 0.00010743202196916546, 'samples': 20081088, 'steps': 104588, 'loss/train': 1.5230541229248047} 11/07/2021 11:54:14 - INFO - __main__ - Step 104590: {'lr': 0.00010742766274434493, 'samples': 20081280, 'steps': 104589, 'loss/train': 1.2497365474700928} 11/07/2021 11:54:15 - INFO - __main__ - Step 104591: {'lr': 0.00010742330358376531, 'samples': 20081472, 'steps': 104590, 'loss/train': 1.165547251701355} 11/07/2021 11:54:15 - INFO - __main__ - Step 104592: {'lr': 0.00010741894448742865, 'samples': 20081664, 'steps': 104591, 'loss/train': 1.646746277809143} 11/07/2021 11:54:15 - INFO - __main__ - Step 104593: {'lr': 0.00010741458545533669, 'samples': 20081856, 'steps': 104592, 'loss/train': 0.7073861360549927} 11/07/2021 11:54:16 - INFO - __main__ - Step 104594: {'lr': 0.00010741022648749153, 'samples': 20082048, 'steps': 104593, 'loss/train': 1.1805400848388672} 11/07/2021 11:54:16 - INFO - __main__ - Step 104595: {'lr': 0.00010740586758389511, 'samples': 20082240, 'steps': 104594, 'loss/train': 0.7972999811172485} 11/07/2021 11:54:17 - INFO - __main__ - Step 104596: {'lr': 0.00010740150874454943, 'samples': 20082432, 'steps': 104595, 'loss/train': 1.3062050342559814} 11/07/2021 11:54:18 - INFO - __main__ - Step 104597: {'lr': 0.0001073971499694564, 'samples': 20082624, 'steps': 104596, 'loss/train': 1.3323259353637695} 11/07/2021 11:54:18 - INFO - __main__ - Step 104598: {'lr': 0.00010739279125861807, 'samples': 20082816, 'steps': 104597, 'loss/train': 1.4323184490203857} 11/07/2021 11:54:18 - INFO - __main__ - Step 104599: {'lr': 0.00010738843261203629, 'samples': 20083008, 'steps': 104598, 'loss/train': 1.6667473316192627} 11/07/2021 11:54:19 - INFO - __main__ - Step 104600: {'lr': 0.00010738407402971309, 'samples': 20083200, 'steps': 104599, 'loss/train': 1.350697636604309} 11/07/2021 11:54:20 - INFO - __main__ - Step 104601: {'lr': 0.00010737971551165043, 'samples': 20083392, 'steps': 104600, 'loss/train': 1.146470069885254} 11/07/2021 11:54:20 - INFO - __main__ - Step 104602: {'lr': 0.00010737535705785028, 'samples': 20083584, 'steps': 104601, 'loss/train': 1.7446863651275635} 11/07/2021 11:54:20 - INFO - __main__ - Step 104603: {'lr': 0.0001073709986683146, 'samples': 20083776, 'steps': 104602, 'loss/train': 1.4877008199691772} 11/07/2021 11:54:21 - INFO - __main__ - Step 104604: {'lr': 0.00010736664034304533, 'samples': 20083968, 'steps': 104603, 'loss/train': 1.2185994386672974} 11/07/2021 11:54:21 - INFO - __main__ - Step 104605: {'lr': 0.00010736228208204454, 'samples': 20084160, 'steps': 104604, 'loss/train': 1.4522202014923096} 11/07/2021 11:54:22 - INFO - __main__ - Step 104606: {'lr': 0.00010735792388531405, 'samples': 20084352, 'steps': 104605, 'loss/train': 0.5924108624458313} 11/07/2021 11:54:22 - INFO - __main__ - Step 104607: {'lr': 0.00010735356575285585, 'samples': 20084544, 'steps': 104606, 'loss/train': 1.1370670795440674} 11/07/2021 11:54:23 - INFO - __main__ - Step 104608: {'lr': 0.0001073492076846719, 'samples': 20084736, 'steps': 104607, 'loss/train': 1.123236060142517} 11/07/2021 11:54:23 - INFO - __main__ - Step 104609: {'lr': 0.00010734484968076425, 'samples': 20084928, 'steps': 104608, 'loss/train': 1.5375429391860962} 11/07/2021 11:54:24 - INFO - __main__ - Step 104610: {'lr': 0.00010734049174113478, 'samples': 20085120, 'steps': 104609, 'loss/train': 1.316432237625122} 11/07/2021 11:54:25 - INFO - __main__ - Step 104611: {'lr': 0.0001073361338657855, 'samples': 20085312, 'steps': 104610, 'loss/train': 0.8831010460853577} 11/07/2021 11:54:25 - INFO - __main__ - Step 104612: {'lr': 0.00010733177605471834, 'samples': 20085504, 'steps': 104611, 'loss/train': 1.0069884061813354} 11/07/2021 11:54:25 - INFO - __main__ - Step 104613: {'lr': 0.0001073274183079353, 'samples': 20085696, 'steps': 104612, 'loss/train': 1.296041488647461} 11/07/2021 11:54:26 - INFO - __main__ - Step 104614: {'lr': 0.0001073230606254383, 'samples': 20085888, 'steps': 104613, 'loss/train': 1.558160662651062} 11/07/2021 11:54:26 - INFO - __main__ - Step 104615: {'lr': 0.00010731870300722934, 'samples': 20086080, 'steps': 104614, 'loss/train': 1.2740039825439453} 11/07/2021 11:54:27 - INFO - __main__ - Step 104616: {'lr': 0.00010731434545331036, 'samples': 20086272, 'steps': 104615, 'loss/train': 0.7662404179573059} 11/07/2021 11:54:28 - INFO - __main__ - Step 104617: {'lr': 0.00010730998796368336, 'samples': 20086464, 'steps': 104616, 'loss/train': 1.45698881149292} 11/07/2021 11:54:28 - INFO - __main__ - Step 104618: {'lr': 0.00010730563053835024, 'samples': 20086656, 'steps': 104617, 'loss/train': 1.5027574300765991} 11/07/2021 11:54:28 - INFO - __main__ - Step 104619: {'lr': 0.00010730127317731311, 'samples': 20086848, 'steps': 104618, 'loss/train': 0.8776001930236816} 11/07/2021 11:54:29 - INFO - __main__ - Step 104620: {'lr': 0.00010729691588057374, 'samples': 20087040, 'steps': 104619, 'loss/train': 1.3849742412567139} 11/07/2021 11:54:29 - INFO - __main__ - Step 104621: {'lr': 0.0001072925586481342, 'samples': 20087232, 'steps': 104620, 'loss/train': 1.2492402791976929} 11/07/2021 11:54:30 - INFO - __main__ - Step 104622: {'lr': 0.00010728820147999638, 'samples': 20087424, 'steps': 104621, 'loss/train': 1.6309314966201782} 11/07/2021 11:54:31 - INFO - __main__ - Step 104623: {'lr': 0.00010728384437616234, 'samples': 20087616, 'steps': 104622, 'loss/train': 1.1219061613082886} 11/07/2021 11:54:31 - INFO - __main__ - Step 104624: {'lr': 0.00010727948733663398, 'samples': 20087808, 'steps': 104623, 'loss/train': 1.4723485708236694} 11/07/2021 11:54:31 - INFO - __main__ - Step 104625: {'lr': 0.00010727513036141327, 'samples': 20088000, 'steps': 104624, 'loss/train': 1.7935796976089478} 11/07/2021 11:54:32 - INFO - __main__ - Step 104626: {'lr': 0.00010727077345050218, 'samples': 20088192, 'steps': 104625, 'loss/train': 0.5765485167503357} 11/07/2021 11:54:33 - INFO - __main__ - Step 104627: {'lr': 0.00010726641660390269, 'samples': 20088384, 'steps': 104626, 'loss/train': 1.235879898071289} 11/07/2021 11:54:33 - INFO - __main__ - Step 104628: {'lr': 0.00010726205982161674, 'samples': 20088576, 'steps': 104627, 'loss/train': 1.6717249155044556} 11/07/2021 11:54:33 - INFO - __main__ - Step 104629: {'lr': 0.00010725770310364633, 'samples': 20088768, 'steps': 104628, 'loss/train': 1.623910665512085} 11/07/2021 11:54:34 - INFO - __main__ - Step 104630: {'lr': 0.00010725334644999338, 'samples': 20088960, 'steps': 104629, 'loss/train': 1.4257056713104248} 11/07/2021 11:54:34 - INFO - __main__ - Step 104631: {'lr': 0.00010724898986065987, 'samples': 20089152, 'steps': 104630, 'loss/train': 1.365080714225769} 11/07/2021 11:54:35 - INFO - __main__ - Step 104632: {'lr': 0.00010724463333564785, 'samples': 20089344, 'steps': 104631, 'loss/train': 1.5907479524612427} 11/07/2021 11:54:36 - INFO - __main__ - Step 104633: {'lr': 0.0001072402768749591, 'samples': 20089536, 'steps': 104632, 'loss/train': 1.2299845218658447} 11/07/2021 11:54:36 - INFO - __main__ - Step 104634: {'lr': 0.00010723592047859567, 'samples': 20089728, 'steps': 104633, 'loss/train': 1.4583590030670166} 11/07/2021 11:54:36 - INFO - __main__ - Step 104635: {'lr': 0.00010723156414655955, 'samples': 20089920, 'steps': 104634, 'loss/train': 1.7274686098098755} 11/07/2021 11:54:37 - INFO - __main__ - Step 104636: {'lr': 0.00010722720787885268, 'samples': 20090112, 'steps': 104635, 'loss/train': 0.5385296940803528} 11/07/2021 11:54:37 - INFO - __main__ - Step 104637: {'lr': 0.00010722285167547702, 'samples': 20090304, 'steps': 104636, 'loss/train': 1.19523024559021} 11/07/2021 11:54:38 - INFO - __main__ - Step 104638: {'lr': 0.00010721849553643456, 'samples': 20090496, 'steps': 104637, 'loss/train': 1.0089384317398071} 11/07/2021 11:54:38 - INFO - __main__ - Step 104639: {'lr': 0.00010721413946172722, 'samples': 20090688, 'steps': 104638, 'loss/train': 1.028079867362976} 11/07/2021 11:54:39 - INFO - __main__ - Step 104640: {'lr': 0.00010720978345135698, 'samples': 20090880, 'steps': 104639, 'loss/train': 1.3473496437072754} 11/07/2021 11:54:39 - INFO - __main__ - Step 104641: {'lr': 0.00010720542750532583, 'samples': 20091072, 'steps': 104640, 'loss/train': 1.3377842903137207} 11/07/2021 11:54:40 - INFO - __main__ - Step 104642: {'lr': 0.00010720107162363571, 'samples': 20091264, 'steps': 104641, 'loss/train': 1.0975511074066162} 11/07/2021 11:54:41 - INFO - __main__ - Step 104643: {'lr': 0.00010719671580628856, 'samples': 20091456, 'steps': 104642, 'loss/train': 1.1689467430114746} 11/07/2021 11:54:41 - INFO - __main__ - Step 104644: {'lr': 0.00010719236005328639, 'samples': 20091648, 'steps': 104643, 'loss/train': 0.9434003233909607} 11/07/2021 11:54:41 - INFO - __main__ - Step 104645: {'lr': 0.00010718800436463114, 'samples': 20091840, 'steps': 104644, 'loss/train': 1.5041662454605103} 11/07/2021 11:54:42 - INFO - __main__ - Step 104646: {'lr': 0.00010718364874032485, 'samples': 20092032, 'steps': 104645, 'loss/train': 1.175686001777649} 11/07/2021 11:54:42 - INFO - __main__ - Step 104647: {'lr': 0.00010717929318036932, 'samples': 20092224, 'steps': 104646, 'loss/train': 1.523377537727356} 11/07/2021 11:54:43 - INFO - __main__ - Step 104648: {'lr': 0.0001071749376847666, 'samples': 20092416, 'steps': 104647, 'loss/train': 0.9406245350837708} 11/07/2021 11:54:43 - INFO - __main__ - Step 104649: {'lr': 0.00010717058225351864, 'samples': 20092608, 'steps': 104648, 'loss/train': 1.3794994354248047} 11/07/2021 11:54:44 - INFO - __main__ - Step 104650: {'lr': 0.00010716622688662742, 'samples': 20092800, 'steps': 104649, 'loss/train': 1.5396687984466553} 11/07/2021 11:54:44 - INFO - __main__ - Step 104651: {'lr': 0.00010716187158409488, 'samples': 20092992, 'steps': 104650, 'loss/train': 1.2820186614990234} 11/07/2021 11:54:44 - INFO - __main__ - Step 104652: {'lr': 0.000107157516345923, 'samples': 20093184, 'steps': 104651, 'loss/train': 1.5264177322387695} 11/07/2021 11:54:45 - INFO - __main__ - Step 104653: {'lr': 0.00010715316117211376, 'samples': 20093376, 'steps': 104652, 'loss/train': 1.4558809995651245} 11/07/2021 11:54:46 - INFO - __main__ - Step 104654: {'lr': 0.00010714880606266908, 'samples': 20093568, 'steps': 104653, 'loss/train': 1.5336343050003052} 11/07/2021 11:54:46 - INFO - __main__ - Step 104655: {'lr': 0.00010714445101759097, 'samples': 20093760, 'steps': 104654, 'loss/train': 1.4339569807052612} 11/07/2021 11:54:47 - INFO - __main__ - Step 104656: {'lr': 0.00010714009603688132, 'samples': 20093952, 'steps': 104655, 'loss/train': 1.6349050998687744} 11/07/2021 11:54:47 - INFO - __main__ - Step 104657: {'lr': 0.00010713574112054217, 'samples': 20094144, 'steps': 104656, 'loss/train': 1.8145390748977661} 11/07/2021 11:54:48 - INFO - __main__ - Step 104658: {'lr': 0.00010713138626857544, 'samples': 20094336, 'steps': 104657, 'loss/train': 1.3434455394744873} 11/07/2021 11:54:48 - INFO - __main__ - Step 104659: {'lr': 0.00010712703148098322, 'samples': 20094528, 'steps': 104658, 'loss/train': 1.3069086074829102} 11/07/2021 11:54:49 - INFO - __main__ - Step 104660: {'lr': 0.00010712267675776721, 'samples': 20094720, 'steps': 104659, 'loss/train': 1.654327392578125} 11/07/2021 11:54:49 - INFO - __main__ - Step 104661: {'lr': 0.00010711832209892955, 'samples': 20094912, 'steps': 104660, 'loss/train': 1.3336124420166016} 11/07/2021 11:54:49 - INFO - __main__ - Step 104662: {'lr': 0.00010711396750447217, 'samples': 20095104, 'steps': 104661, 'loss/train': 1.311354398727417} 11/07/2021 11:54:50 - INFO - __main__ - Step 104663: {'lr': 0.00010710961297439702, 'samples': 20095296, 'steps': 104662, 'loss/train': 1.0149381160736084} 11/07/2021 11:54:51 - INFO - __main__ - Step 104664: {'lr': 0.00010710525850870608, 'samples': 20095488, 'steps': 104663, 'loss/train': 1.202792763710022} 11/07/2021 11:54:51 - INFO - __main__ - Step 104665: {'lr': 0.00010710090410740133, 'samples': 20095680, 'steps': 104664, 'loss/train': 1.766806960105896} 11/07/2021 11:54:51 - INFO - __main__ - Step 104666: {'lr': 0.00010709654977048469, 'samples': 20095872, 'steps': 104665, 'loss/train': 1.395330786705017} 11/07/2021 11:54:52 - INFO - __main__ - Step 104667: {'lr': 0.00010709219549795812, 'samples': 20096064, 'steps': 104666, 'loss/train': 1.528607964515686} 11/07/2021 11:54:52 - INFO - __main__ - Step 104668: {'lr': 0.00010708784128982363, 'samples': 20096256, 'steps': 104667, 'loss/train': 1.5141938924789429} 11/07/2021 11:54:54 - INFO - __main__ - Step 104669: {'lr': 0.00010708348714608312, 'samples': 20096448, 'steps': 104668, 'loss/train': 1.1519516706466675} 11/07/2021 11:54:54 - INFO - __main__ - Step 104670: {'lr': 0.00010707913306673861, 'samples': 20096640, 'steps': 104669, 'loss/train': 2.052382707595825} 11/07/2021 11:54:54 - INFO - __main__ - Step 104671: {'lr': 0.00010707477905179206, 'samples': 20096832, 'steps': 104670, 'loss/train': 1.1590603590011597} 11/07/2021 11:54:55 - INFO - __main__ - Step 104672: {'lr': 0.00010707042510124545, 'samples': 20097024, 'steps': 104671, 'loss/train': 1.441550850868225} 11/07/2021 11:54:55 - INFO - __main__ - Step 104673: {'lr': 0.00010706607121510065, 'samples': 20097216, 'steps': 104672, 'loss/train': 1.2516435384750366} 11/07/2021 11:54:56 - INFO - __main__ - Step 104674: {'lr': 0.00010706171739335965, 'samples': 20097408, 'steps': 104673, 'loss/train': 0.7277579307556152} 11/07/2021 11:54:56 - INFO - __main__ - Step 104675: {'lr': 0.00010705736363602445, 'samples': 20097600, 'steps': 104674, 'loss/train': 1.119154691696167} 11/07/2021 11:54:57 - INFO - __main__ - Step 104676: {'lr': 0.00010705300994309697, 'samples': 20097792, 'steps': 104675, 'loss/train': 0.685964822769165} 11/07/2021 11:54:57 - INFO - __main__ - Step 104677: {'lr': 0.00010704865631457922, 'samples': 20097984, 'steps': 104676, 'loss/train': 0.9932401776313782} 11/07/2021 11:54:57 - INFO - __main__ - Step 104678: {'lr': 0.00010704430275047314, 'samples': 20098176, 'steps': 104677, 'loss/train': 2.3464725017547607} 11/07/2021 11:54:58 - INFO - __main__ - Step 104679: {'lr': 0.00010703994925078067, 'samples': 20098368, 'steps': 104678, 'loss/train': 1.677834153175354} 11/07/2021 11:54:59 - INFO - __main__ - Step 104680: {'lr': 0.00010703559581550382, 'samples': 20098560, 'steps': 104679, 'loss/train': 1.3263862133026123} 11/07/2021 11:54:59 - INFO - __main__ - Step 104681: {'lr': 0.00010703124244464452, 'samples': 20098752, 'steps': 104680, 'loss/train': 1.5476118326187134} 11/07/2021 11:54:59 - INFO - __main__ - Step 104682: {'lr': 0.00010702688913820471, 'samples': 20098944, 'steps': 104681, 'loss/train': 1.548412561416626} 11/07/2021 11:55:00 - INFO - __main__ - Step 104683: {'lr': 0.0001070225358961864, 'samples': 20099136, 'steps': 104682, 'loss/train': 1.4786938428878784} 11/07/2021 11:55:00 - INFO - __main__ - Step 104684: {'lr': 0.00010701818271859154, 'samples': 20099328, 'steps': 104683, 'loss/train': 1.3616509437561035} 11/07/2021 11:55:01 - INFO - __main__ - Step 104685: {'lr': 0.00010701382960542205, 'samples': 20099520, 'steps': 104684, 'loss/train': 1.3915194272994995} 11/07/2021 11:55:02 - INFO - __main__ - Step 104686: {'lr': 0.00010700947655668003, 'samples': 20099712, 'steps': 104685, 'loss/train': 1.3233305215835571} 11/07/2021 11:55:02 - INFO - __main__ - Step 104687: {'lr': 0.00010700512357236725, 'samples': 20099904, 'steps': 104686, 'loss/train': 1.2220014333724976} 11/07/2021 11:55:02 - INFO - __main__ - Step 104688: {'lr': 0.00010700077065248573, 'samples': 20100096, 'steps': 104687, 'loss/train': 1.180167555809021} 11/07/2021 11:55:03 - INFO - __main__ - Step 104689: {'lr': 0.00010699641779703748, 'samples': 20100288, 'steps': 104688, 'loss/train': 0.9564568996429443} 11/07/2021 11:55:04 - INFO - __main__ - Step 104690: {'lr': 0.00010699206500602443, 'samples': 20100480, 'steps': 104689, 'loss/train': 1.1300300359725952} 11/07/2021 11:55:04 - INFO - __main__ - Step 104691: {'lr': 0.00010698771227944857, 'samples': 20100672, 'steps': 104690, 'loss/train': 1.5626345872879028} 11/07/2021 11:55:04 - INFO - __main__ - Step 104692: {'lr': 0.00010698335961731179, 'samples': 20100864, 'steps': 104691, 'loss/train': 0.8084231019020081} 11/07/2021 11:55:05 - INFO - __main__ - Step 104693: {'lr': 0.00010697900701961614, 'samples': 20101056, 'steps': 104692, 'loss/train': 1.3016706705093384} 11/07/2021 11:55:05 - INFO - __main__ - Step 104694: {'lr': 0.00010697465448636354, 'samples': 20101248, 'steps': 104693, 'loss/train': 1.2417651414871216} 11/07/2021 11:55:06 - INFO - __main__ - Step 104695: {'lr': 0.00010697030201755592, 'samples': 20101440, 'steps': 104694, 'loss/train': 1.291439414024353} 11/07/2021 11:55:06 - INFO - __main__ - Step 104696: {'lr': 0.0001069659496131953, 'samples': 20101632, 'steps': 104695, 'loss/train': 1.1586934328079224} 11/07/2021 11:55:07 - INFO - __main__ - Step 104697: {'lr': 0.00010696159727328364, 'samples': 20101824, 'steps': 104696, 'loss/train': 1.549648404121399} 11/07/2021 11:55:07 - INFO - __main__ - Step 104698: {'lr': 0.00010695724499782284, 'samples': 20102016, 'steps': 104697, 'loss/train': 1.117824912071228} 11/07/2021 11:55:07 - INFO - __main__ - Step 104699: {'lr': 0.00010695289278681499, 'samples': 20102208, 'steps': 104698, 'loss/train': 1.6542773246765137} 11/07/2021 11:55:08 - INFO - __main__ - Step 104700: {'lr': 0.00010694854064026191, 'samples': 20102400, 'steps': 104699, 'loss/train': 1.5370680093765259} 11/07/2021 11:55:09 - INFO - __main__ - Step 104701: {'lr': 0.00010694418855816557, 'samples': 20102592, 'steps': 104700, 'loss/train': 1.6700760126113892} 11/07/2021 11:55:09 - INFO - __main__ - Step 104702: {'lr': 0.00010693983654052797, 'samples': 20102784, 'steps': 104701, 'loss/train': 1.638590693473816} 11/07/2021 11:55:10 - INFO - __main__ - Step 104703: {'lr': 0.00010693548458735109, 'samples': 20102976, 'steps': 104702, 'loss/train': 0.9951467514038086} 11/07/2021 11:55:10 - INFO - __main__ - Step 104704: {'lr': 0.00010693113269863688, 'samples': 20103168, 'steps': 104703, 'loss/train': 1.0140604972839355} 11/07/2021 11:55:11 - INFO - __main__ - Step 104705: {'lr': 0.00010692678087438729, 'samples': 20103360, 'steps': 104704, 'loss/train': 1.1339772939682007} 11/07/2021 11:55:11 - INFO - __main__ - Step 104706: {'lr': 0.00010692242911460426, 'samples': 20103552, 'steps': 104705, 'loss/train': 1.4175264835357666} 11/07/2021 11:55:12 - INFO - __main__ - Step 104707: {'lr': 0.0001069180774192898, 'samples': 20103744, 'steps': 104706, 'loss/train': 1.2646173238754272} 11/07/2021 11:55:12 - INFO - __main__ - Step 104708: {'lr': 0.00010691372578844582, 'samples': 20103936, 'steps': 104707, 'loss/train': 0.39238426089286804} 11/07/2021 11:55:12 - INFO - __main__ - Step 104709: {'lr': 0.00010690937422207434, 'samples': 20104128, 'steps': 104708, 'loss/train': 1.217407464981079} 11/07/2021 11:55:14 - INFO - __main__ - Step 104710: {'lr': 0.00010690502272017727, 'samples': 20104320, 'steps': 104709, 'loss/train': 1.307305932044983} 11/07/2021 11:55:14 - INFO - __main__ - Step 104711: {'lr': 0.0001069006712827566, 'samples': 20104512, 'steps': 104710, 'loss/train': 1.718228816986084} 11/07/2021 11:55:14 - INFO - __main__ - Step 104712: {'lr': 0.00010689631990981427, 'samples': 20104704, 'steps': 104711, 'loss/train': 1.5445446968078613} 11/07/2021 11:55:15 - INFO - __main__ - Step 104713: {'lr': 0.00010689196860135234, 'samples': 20104896, 'steps': 104712, 'loss/train': 1.3064231872558594} 11/07/2021 11:55:15 - INFO - __main__ - Step 104714: {'lr': 0.0001068876173573726, 'samples': 20105088, 'steps': 104713, 'loss/train': 1.3213518857955933} 11/07/2021 11:55:15 - INFO - __main__ - Step 104715: {'lr': 0.00010688326617787705, 'samples': 20105280, 'steps': 104714, 'loss/train': 1.7465349435806274} 11/07/2021 11:55:16 - INFO - __main__ - Step 104716: {'lr': 0.00010687891506286773, 'samples': 20105472, 'steps': 104715, 'loss/train': 0.9790735244750977} 11/07/2021 11:55:17 - INFO - __main__ - Step 104717: {'lr': 0.00010687456401234657, 'samples': 20105664, 'steps': 104716, 'loss/train': 1.7955447435379028} 11/07/2021 11:55:17 - INFO - __main__ - Step 104718: {'lr': 0.0001068702130263155, 'samples': 20105856, 'steps': 104717, 'loss/train': 1.54996919631958} 11/07/2021 11:55:17 - INFO - __main__ - Step 104719: {'lr': 0.00010686586210477652, 'samples': 20106048, 'steps': 104718, 'loss/train': 1.1291382312774658} 11/07/2021 11:55:18 - INFO - __main__ - Step 104720: {'lr': 0.00010686151124773157, 'samples': 20106240, 'steps': 104719, 'loss/train': 1.1718355417251587} 11/07/2021 11:55:19 - INFO - __main__ - Step 104721: {'lr': 0.00010685716045518263, 'samples': 20106432, 'steps': 104720, 'loss/train': 1.2124685049057007} 11/07/2021 11:55:19 - INFO - __main__ - Step 104722: {'lr': 0.0001068528097271316, 'samples': 20106624, 'steps': 104721, 'loss/train': 1.4331601858139038} 11/07/2021 11:55:19 - INFO - __main__ - Step 104723: {'lr': 0.00010684845906358052, 'samples': 20106816, 'steps': 104722, 'loss/train': 1.6675713062286377} 11/07/2021 11:55:20 - INFO - __main__ - Step 104724: {'lr': 0.0001068441084645313, 'samples': 20107008, 'steps': 104723, 'loss/train': 1.4688842296600342} 11/07/2021 11:55:20 - INFO - __main__ - Step 104725: {'lr': 0.00010683975792998593, 'samples': 20107200, 'steps': 104724, 'loss/train': 1.272539734840393} 11/07/2021 11:55:21 - INFO - __main__ - Step 104726: {'lr': 0.00010683540745994644, 'samples': 20107392, 'steps': 104725, 'loss/train': 1.7268115282058716} 11/07/2021 11:55:21 - INFO - __main__ - Step 104727: {'lr': 0.00010683105705441463, 'samples': 20107584, 'steps': 104726, 'loss/train': 1.305754542350769} 11/07/2021 11:55:22 - INFO - __main__ - Step 104728: {'lr': 0.0001068267067133925, 'samples': 20107776, 'steps': 104727, 'loss/train': 1.6455577611923218} 11/07/2021 11:55:22 - INFO - __main__ - Step 104729: {'lr': 0.00010682235643688207, 'samples': 20107968, 'steps': 104728, 'loss/train': 1.221250295639038} 11/07/2021 11:55:23 - INFO - __main__ - Step 104730: {'lr': 0.00010681800622488528, 'samples': 20108160, 'steps': 104729, 'loss/train': 1.0779035091400146} 11/07/2021 11:55:24 - INFO - __main__ - Step 104731: {'lr': 0.00010681365607740407, 'samples': 20108352, 'steps': 104730, 'loss/train': 3.6886277198791504} 11/07/2021 11:55:24 - INFO - __main__ - Step 104732: {'lr': 0.00010680930599444044, 'samples': 20108544, 'steps': 104731, 'loss/train': 1.5321719646453857} 11/07/2021 11:55:24 - INFO - __main__ - Step 104733: {'lr': 0.00010680495597599632, 'samples': 20108736, 'steps': 104732, 'loss/train': 1.263384222984314} 11/07/2021 11:55:25 - INFO - __main__ - Step 104734: {'lr': 0.00010680060602207368, 'samples': 20108928, 'steps': 104733, 'loss/train': 1.4073082208633423} 11/07/2021 11:55:25 - INFO - __main__ - Step 104735: {'lr': 0.00010679625613267446, 'samples': 20109120, 'steps': 104734, 'loss/train': 0.8543431162834167} 11/07/2021 11:55:25 - INFO - __main__ - Step 104736: {'lr': 0.00010679190630780065, 'samples': 20109312, 'steps': 104735, 'loss/train': 0.9853021502494812} 11/07/2021 11:55:26 - INFO - __main__ - Step 104737: {'lr': 0.00010678755654745418, 'samples': 20109504, 'steps': 104736, 'loss/train': 1.333220362663269} 11/07/2021 11:55:27 - INFO - __main__ - Step 104738: {'lr': 0.00010678320685163707, 'samples': 20109696, 'steps': 104737, 'loss/train': 0.9309832453727722} 11/07/2021 11:55:27 - INFO - __main__ - Step 104739: {'lr': 0.0001067788572203512, 'samples': 20109888, 'steps': 104738, 'loss/train': 1.3846101760864258} 11/07/2021 11:55:27 - INFO - __main__ - Step 104740: {'lr': 0.00010677450765359865, 'samples': 20110080, 'steps': 104739, 'loss/train': 1.0394078493118286} 11/07/2021 11:55:28 - INFO - __main__ - Step 104741: {'lr': 0.00010677015815138121, 'samples': 20110272, 'steps': 104740, 'loss/train': 1.3420183658599854} 11/07/2021 11:55:29 - INFO - __main__ - Step 104742: {'lr': 0.00010676580871370096, 'samples': 20110464, 'steps': 104741, 'loss/train': 0.9305358529090881} 11/07/2021 11:55:29 - INFO - __main__ - Step 104743: {'lr': 0.00010676145934055981, 'samples': 20110656, 'steps': 104742, 'loss/train': 1.5070528984069824} 11/07/2021 11:55:30 - INFO - __main__ - Step 104744: {'lr': 0.00010675711003195973, 'samples': 20110848, 'steps': 104743, 'loss/train': 1.4471001625061035} 11/07/2021 11:55:30 - INFO - __main__ - Step 104745: {'lr': 0.0001067527607879027, 'samples': 20111040, 'steps': 104744, 'loss/train': 1.2644000053405762} 11/07/2021 11:55:30 - INFO - __main__ - Step 104746: {'lr': 0.00010674841160839063, 'samples': 20111232, 'steps': 104745, 'loss/train': 1.3819893598556519} 11/07/2021 11:55:31 - INFO - __main__ - Step 104747: {'lr': 0.00010674406249342555, 'samples': 20111424, 'steps': 104746, 'loss/train': 1.3442611694335938} 11/07/2021 11:55:32 - INFO - __main__ - Step 104748: {'lr': 0.00010673971344300936, 'samples': 20111616, 'steps': 104747, 'loss/train': 1.6395232677459717} 11/07/2021 11:55:32 - INFO - __main__ - Step 104749: {'lr': 0.00010673536445714407, 'samples': 20111808, 'steps': 104748, 'loss/train': 1.6121317148208618} 11/07/2021 11:55:32 - INFO - __main__ - Step 104750: {'lr': 0.00010673101553583159, 'samples': 20112000, 'steps': 104749, 'loss/train': 1.609340786933899} 11/07/2021 11:55:33 - INFO - __main__ - Step 104751: {'lr': 0.0001067266666790739, 'samples': 20112192, 'steps': 104750, 'loss/train': 1.273408055305481} 11/07/2021 11:55:34 - INFO - __main__ - Step 104752: {'lr': 0.000106722317886873, 'samples': 20112384, 'steps': 104751, 'loss/train': 1.5366266965866089} 11/07/2021 11:55:34 - INFO - __main__ - Step 104753: {'lr': 0.00010671796915923087, 'samples': 20112576, 'steps': 104752, 'loss/train': 0.8258659243583679} 11/07/2021 11:55:35 - INFO - __main__ - Step 104754: {'lr': 0.00010671362049614933, 'samples': 20112768, 'steps': 104753, 'loss/train': 1.4903554916381836} 11/07/2021 11:55:35 - INFO - __main__ - Step 104755: {'lr': 0.00010670927189763044, 'samples': 20112960, 'steps': 104754, 'loss/train': 1.7278833389282227} 11/07/2021 11:55:35 - INFO - __main__ - Step 104756: {'lr': 0.00010670492336367613, 'samples': 20113152, 'steps': 104755, 'loss/train': 1.732537865638733} 11/07/2021 11:55:36 - INFO - __main__ - Step 104757: {'lr': 0.00010670057489428836, 'samples': 20113344, 'steps': 104756, 'loss/train': 0.7908181548118591} 11/07/2021 11:55:37 - INFO - __main__ - Step 104758: {'lr': 0.00010669622648946912, 'samples': 20113536, 'steps': 104757, 'loss/train': 0.060806240886449814} 11/07/2021 11:55:37 - INFO - __main__ - Step 104759: {'lr': 0.00010669187814922032, 'samples': 20113728, 'steps': 104758, 'loss/train': 1.0603548288345337} 11/07/2021 11:55:37 - INFO - __main__ - Step 104760: {'lr': 0.00010668752987354397, 'samples': 20113920, 'steps': 104759, 'loss/train': 1.7073588371276855} 11/07/2021 11:55:38 - INFO - __main__ - Step 104761: {'lr': 0.00010668318166244197, 'samples': 20114112, 'steps': 104760, 'loss/train': 0.997665524482727} 11/07/2021 11:55:38 - INFO - __main__ - Step 104762: {'lr': 0.00010667883351591637, 'samples': 20114304, 'steps': 104761, 'loss/train': 1.2115739583969116} 11/07/2021 11:55:39 - INFO - __main__ - Step 104763: {'lr': 0.00010667448543396904, 'samples': 20114496, 'steps': 104762, 'loss/train': 1.3254938125610352} 11/07/2021 11:55:40 - INFO - __main__ - Step 104764: {'lr': 0.000106670137416602, 'samples': 20114688, 'steps': 104763, 'loss/train': 1.1983991861343384} 11/07/2021 11:55:40 - INFO - __main__ - Step 104765: {'lr': 0.00010666578946381716, 'samples': 20114880, 'steps': 104764, 'loss/train': 1.3538399934768677} 11/07/2021 11:55:40 - INFO - __main__ - Step 104766: {'lr': 0.00010666144157561653, 'samples': 20115072, 'steps': 104765, 'loss/train': 1.4246861934661865} 11/07/2021 11:55:41 - INFO - __main__ - Step 104767: {'lr': 0.00010665709375200211, 'samples': 20115264, 'steps': 104766, 'loss/train': 1.4720115661621094} 11/07/2021 11:55:42 - INFO - __main__ - Step 104768: {'lr': 0.00010665274599297572, 'samples': 20115456, 'steps': 104767, 'loss/train': 0.9919803738594055} 11/07/2021 11:55:42 - INFO - __main__ - Step 104769: {'lr': 0.0001066483982985394, 'samples': 20115648, 'steps': 104768, 'loss/train': 1.5655369758605957} 11/07/2021 11:55:42 - INFO - __main__ - Step 104770: {'lr': 0.00010664405066869506, 'samples': 20115840, 'steps': 104769, 'loss/train': 1.163657784461975} 11/07/2021 11:55:43 - INFO - __main__ - Step 104771: {'lr': 0.00010663970310344474, 'samples': 20116032, 'steps': 104770, 'loss/train': 1.139891266822815} 11/07/2021 11:55:43 - INFO - __main__ - Step 104772: {'lr': 0.00010663535560279031, 'samples': 20116224, 'steps': 104771, 'loss/train': 1.5029587745666504} 11/07/2021 11:55:44 - INFO - __main__ - Step 104773: {'lr': 0.00010663100816673383, 'samples': 20116416, 'steps': 104772, 'loss/train': 1.3972018957138062} 11/07/2021 11:55:44 - INFO - __main__ - Step 104774: {'lr': 0.00010662666079527716, 'samples': 20116608, 'steps': 104773, 'loss/train': 1.2045565843582153} 11/07/2021 11:55:45 - INFO - __main__ - Step 104775: {'lr': 0.00010662231348842232, 'samples': 20116800, 'steps': 104774, 'loss/train': 1.4167135953903198} 11/07/2021 11:55:45 - INFO - __main__ - Step 104776: {'lr': 0.00010661796624617126, 'samples': 20116992, 'steps': 104775, 'loss/train': 1.524343729019165} 11/07/2021 11:55:45 - INFO - __main__ - Step 104777: {'lr': 0.00010661361906852592, 'samples': 20117184, 'steps': 104776, 'loss/train': 1.4291003942489624} 11/07/2021 11:55:47 - INFO - __main__ - Step 104778: {'lr': 0.00010660927195548828, 'samples': 20117376, 'steps': 104777, 'loss/train': 1.5663281679153442} 11/07/2021 11:55:47 - INFO - __main__ - Step 104779: {'lr': 0.00010660492490706031, 'samples': 20117568, 'steps': 104778, 'loss/train': 1.5643082857131958} 11/07/2021 11:55:47 - INFO - __main__ - Step 104780: {'lr': 0.00010660057792324401, 'samples': 20117760, 'steps': 104779, 'loss/train': 1.5052493810653687} 11/07/2021 11:55:48 - INFO - __main__ - Step 104781: {'lr': 0.0001065962310040412, 'samples': 20117952, 'steps': 104780, 'loss/train': 1.5245277881622314} 11/07/2021 11:55:48 - INFO - __main__ - Step 104782: {'lr': 0.0001065918841494539, 'samples': 20118144, 'steps': 104781, 'loss/train': 1.0371458530426025} 11/07/2021 11:55:48 - INFO - __main__ - Step 104783: {'lr': 0.0001065875373594841, 'samples': 20118336, 'steps': 104782, 'loss/train': 1.6731202602386475} 11/07/2021 11:55:49 - INFO - __main__ - Step 104784: {'lr': 0.00010658319063413372, 'samples': 20118528, 'steps': 104783, 'loss/train': 1.3199481964111328} 11/07/2021 11:55:50 - INFO - __main__ - Step 104785: {'lr': 0.00010657884397340475, 'samples': 20118720, 'steps': 104784, 'loss/train': 1.4837146997451782} 11/07/2021 11:55:50 - INFO - __main__ - Step 104786: {'lr': 0.00010657449737729915, 'samples': 20118912, 'steps': 104785, 'loss/train': 1.4058083295822144} 11/07/2021 11:55:50 - INFO - __main__ - Step 104787: {'lr': 0.00010657015084581886, 'samples': 20119104, 'steps': 104786, 'loss/train': 1.7916730642318726} 11/07/2021 11:55:51 - INFO - __main__ - Step 104788: {'lr': 0.00010656580437896588, 'samples': 20119296, 'steps': 104787, 'loss/train': 1.465476155281067} 11/07/2021 11:55:52 - INFO - __main__ - Step 104789: {'lr': 0.0001065614579767421, 'samples': 20119488, 'steps': 104788, 'loss/train': 1.371553897857666} 11/07/2021 11:55:52 - INFO - __main__ - Step 104790: {'lr': 0.00010655711163914952, 'samples': 20119680, 'steps': 104789, 'loss/train': 1.0457390546798706} 11/07/2021 11:55:52 - INFO - __main__ - Step 104791: {'lr': 0.0001065527653661901, 'samples': 20119872, 'steps': 104790, 'loss/train': 1.1897791624069214} 11/07/2021 11:55:53 - INFO - __main__ - Step 104792: {'lr': 0.00010654841915786579, 'samples': 20120064, 'steps': 104791, 'loss/train': 1.4564460515975952} 11/07/2021 11:55:53 - INFO - __main__ - Step 104793: {'lr': 0.00010654407301417862, 'samples': 20120256, 'steps': 104792, 'loss/train': 1.1839368343353271} 11/07/2021 11:55:54 - INFO - __main__ - Step 104794: {'lr': 0.0001065397269351304, 'samples': 20120448, 'steps': 104793, 'loss/train': 1.4067360162734985} 11/07/2021 11:55:55 - INFO - __main__ - Step 104795: {'lr': 0.00010653538092072316, 'samples': 20120640, 'steps': 104794, 'loss/train': 1.865673303604126} 11/07/2021 11:55:55 - INFO - __main__ - Step 104796: {'lr': 0.00010653103497095887, 'samples': 20120832, 'steps': 104795, 'loss/train': 1.5089120864868164} 11/07/2021 11:55:55 - INFO - __main__ - Step 104797: {'lr': 0.00010652668908583949, 'samples': 20121024, 'steps': 104796, 'loss/train': 0.6405284404754639} 11/07/2021 11:55:56 - INFO - __main__ - Step 104798: {'lr': 0.00010652234326536694, 'samples': 20121216, 'steps': 104797, 'loss/train': 1.4117261171340942} 11/07/2021 11:55:57 - INFO - __main__ - Step 104799: {'lr': 0.00010651799750954322, 'samples': 20121408, 'steps': 104798, 'loss/train': 1.2655311822891235} 11/07/2021 11:55:57 - INFO - __main__ - Step 104800: {'lr': 0.0001065136518183703, 'samples': 20121600, 'steps': 104799, 'loss/train': 1.456134557723999} 11/07/2021 11:55:57 - INFO - __main__ - Step 104801: {'lr': 0.00010650930619185009, 'samples': 20121792, 'steps': 104800, 'loss/train': 0.2743145525455475} 11/07/2021 11:55:58 - INFO - __main__ - Step 104802: {'lr': 0.00010650496062998457, 'samples': 20121984, 'steps': 104801, 'loss/train': 1.2476773262023926} 11/07/2021 11:55:58 - INFO - __main__ - Step 104803: {'lr': 0.00010650061513277573, 'samples': 20122176, 'steps': 104802, 'loss/train': 1.6581228971481323} 11/07/2021 11:55:59 - INFO - __main__ - Step 104804: {'lr': 0.00010649626970022547, 'samples': 20122368, 'steps': 104803, 'loss/train': 1.1453379392623901} 11/07/2021 11:55:59 - INFO - __main__ - Step 104805: {'lr': 0.00010649192433233587, 'samples': 20122560, 'steps': 104804, 'loss/train': 1.3723164796829224} 11/07/2021 11:56:00 - INFO - __main__ - Step 104806: {'lr': 0.00010648757902910872, 'samples': 20122752, 'steps': 104805, 'loss/train': 0.893791913986206} 11/07/2021 11:56:00 - INFO - __main__ - Step 104807: {'lr': 0.00010648323379054606, 'samples': 20122944, 'steps': 104806, 'loss/train': 1.0816106796264648} 11/07/2021 11:56:01 - INFO - __main__ - Step 104808: {'lr': 0.0001064788886166498, 'samples': 20123136, 'steps': 104807, 'loss/train': 1.4560985565185547} 11/07/2021 11:56:02 - INFO - __main__ - Step 104809: {'lr': 0.00010647454350742197, 'samples': 20123328, 'steps': 104808, 'loss/train': 1.4199752807617188} 11/07/2021 11:56:02 - INFO - __main__ - Step 104810: {'lr': 0.00010647019846286448, 'samples': 20123520, 'steps': 104809, 'loss/train': 1.7140111923217773} 11/07/2021 11:56:02 - INFO - __main__ - Step 104811: {'lr': 0.00010646585348297932, 'samples': 20123712, 'steps': 104810, 'loss/train': 1.6024303436279297} 11/07/2021 11:56:03 - INFO - __main__ - Step 104812: {'lr': 0.00010646150856776843, 'samples': 20123904, 'steps': 104811, 'loss/train': 1.2322471141815186} 11/07/2021 11:56:03 - INFO - __main__ - Step 104813: {'lr': 0.00010645716371723374, 'samples': 20124096, 'steps': 104812, 'loss/train': 1.4718750715255737} 11/07/2021 11:56:04 - INFO - __main__ - Step 104814: {'lr': 0.00010645281893137726, 'samples': 20124288, 'steps': 104813, 'loss/train': 1.2955008745193481} 11/07/2021 11:56:04 - INFO - __main__ - Step 104815: {'lr': 0.00010644847421020093, 'samples': 20124480, 'steps': 104814, 'loss/train': 1.020352840423584} 11/07/2021 11:56:05 - INFO - __main__ - Step 104816: {'lr': 0.0001064441295537068, 'samples': 20124672, 'steps': 104815, 'loss/train': 0.5434743165969849} 11/07/2021 11:56:05 - INFO - __main__ - Step 104817: {'lr': 0.00010643978496189663, 'samples': 20124864, 'steps': 104816, 'loss/train': 0.8320887088775635} 11/07/2021 11:56:05 - INFO - __main__ - Step 104818: {'lr': 0.00010643544043477247, 'samples': 20125056, 'steps': 104817, 'loss/train': 1.5240561962127686} 11/07/2021 11:56:06 - INFO - __main__ - Step 104819: {'lr': 0.00010643109597233628, 'samples': 20125248, 'steps': 104818, 'loss/train': 1.3303532600402832} 11/07/2021 11:56:07 - INFO - __main__ - Step 104820: {'lr': 0.00010642675157459003, 'samples': 20125440, 'steps': 104819, 'loss/train': 1.4501234292984009} 11/07/2021 11:56:07 - INFO - __main__ - Step 104821: {'lr': 0.00010642240724153568, 'samples': 20125632, 'steps': 104820, 'loss/train': 1.9116206169128418} 11/07/2021 11:56:07 - INFO - __main__ - Step 104822: {'lr': 0.00010641806297317516, 'samples': 20125824, 'steps': 104821, 'loss/train': 1.0827382802963257} 11/07/2021 11:56:08 - INFO - __main__ - Step 104823: {'lr': 0.00010641371876951045, 'samples': 20126016, 'steps': 104822, 'loss/train': 1.5514087677001953} 11/07/2021 11:56:08 - INFO - __main__ - Step 104824: {'lr': 0.00010640937463054351, 'samples': 20126208, 'steps': 104823, 'loss/train': 1.1947860717773438} 11/07/2021 11:56:09 - INFO - __main__ - Step 104825: {'lr': 0.0001064050305562763, 'samples': 20126400, 'steps': 104824, 'loss/train': 1.2205884456634521} 11/07/2021 11:56:10 - INFO - __main__ - Step 104826: {'lr': 0.00010640068654671084, 'samples': 20126592, 'steps': 104825, 'loss/train': 1.5556061267852783} 11/07/2021 11:56:10 - INFO - __main__ - Step 104827: {'lr': 0.00010639634260184894, 'samples': 20126784, 'steps': 104826, 'loss/train': 1.5255162715911865} 11/07/2021 11:56:10 - INFO - __main__ - Step 104828: {'lr': 0.00010639199872169262, 'samples': 20126976, 'steps': 104827, 'loss/train': 1.7246757745742798} 11/07/2021 11:56:11 - INFO - __main__ - Step 104829: {'lr': 0.00010638765490624383, 'samples': 20127168, 'steps': 104828, 'loss/train': 0.7464619874954224} 11/07/2021 11:56:12 - INFO - __main__ - Step 104830: {'lr': 0.00010638331115550459, 'samples': 20127360, 'steps': 104829, 'loss/train': 1.2285963296890259} 11/07/2021 11:56:12 - INFO - __main__ - Step 104831: {'lr': 0.00010637896746947678, 'samples': 20127552, 'steps': 104830, 'loss/train': 1.3937339782714844} 11/07/2021 11:56:12 - INFO - __main__ - Step 104832: {'lr': 0.0001063746238481624, 'samples': 20127744, 'steps': 104831, 'loss/train': 1.3573373556137085} 11/07/2021 11:56:13 - INFO - __main__ - Step 104833: {'lr': 0.0001063702802915634, 'samples': 20127936, 'steps': 104832, 'loss/train': 1.2807562351226807} 11/07/2021 11:56:13 - INFO - __main__ - Step 104834: {'lr': 0.00010636593679968173, 'samples': 20128128, 'steps': 104833, 'loss/train': 1.2189064025878906} 11/07/2021 11:56:14 - INFO - __main__ - Step 104835: {'lr': 0.00010636159337251938, 'samples': 20128320, 'steps': 104834, 'loss/train': 1.9115676879882812} 11/07/2021 11:56:14 - INFO - __main__ - Step 104836: {'lr': 0.00010635725001007826, 'samples': 20128512, 'steps': 104835, 'loss/train': 1.2985293865203857} 11/07/2021 11:56:15 - INFO - __main__ - Step 104837: {'lr': 0.00010635290671236041, 'samples': 20128704, 'steps': 104836, 'loss/train': 1.564009189605713} 11/07/2021 11:56:15 - INFO - __main__ - Step 104838: {'lr': 0.00010634856347936766, 'samples': 20128896, 'steps': 104837, 'loss/train': 1.3638392686843872} 11/07/2021 11:56:15 - INFO - __main__ - Step 104839: {'lr': 0.00010634422031110205, 'samples': 20129088, 'steps': 104838, 'loss/train': 1.4583121538162231} 11/07/2021 11:56:16 - INFO - __main__ - Step 104840: {'lr': 0.0001063398772075655, 'samples': 20129280, 'steps': 104839, 'loss/train': 1.1887428760528564} 11/07/2021 11:56:17 - INFO - __main__ - Step 104841: {'lr': 0.00010633553416875996, 'samples': 20129472, 'steps': 104840, 'loss/train': 1.2590070962905884} 11/07/2021 11:56:17 - INFO - __main__ - Step 104842: {'lr': 0.00010633119119468745, 'samples': 20129664, 'steps': 104841, 'loss/train': 0.8512518405914307} 11/07/2021 11:56:17 - INFO - __main__ - Step 104843: {'lr': 0.00010632684828534985, 'samples': 20129856, 'steps': 104842, 'loss/train': 1.4789592027664185} 11/07/2021 11:56:18 - INFO - __main__ - Step 104844: {'lr': 0.00010632250544074921, 'samples': 20130048, 'steps': 104843, 'loss/train': 1.580299735069275} 11/07/2021 11:56:19 - INFO - __main__ - Step 104845: {'lr': 0.00010631816266088737, 'samples': 20130240, 'steps': 104844, 'loss/train': 1.4069210290908813} 11/07/2021 11:56:19 - INFO - __main__ - Step 104846: {'lr': 0.00010631381994576639, 'samples': 20130432, 'steps': 104845, 'loss/train': 2.5670061111450195} 11/07/2021 11:56:20 - INFO - __main__ - Step 104847: {'lr': 0.00010630947729538818, 'samples': 20130624, 'steps': 104846, 'loss/train': 1.4394910335540771} 11/07/2021 11:56:20 - INFO - __main__ - Step 104848: {'lr': 0.00010630513470975478, 'samples': 20130816, 'steps': 104847, 'loss/train': 1.5127116441726685} 11/07/2021 11:56:20 - INFO - __main__ - Step 104849: {'lr': 0.00010630079218886798, 'samples': 20131008, 'steps': 104848, 'loss/train': 1.0222678184509277} 11/07/2021 11:56:21 - INFO - __main__ - Step 104850: {'lr': 0.00010629644973272984, 'samples': 20131200, 'steps': 104849, 'loss/train': 1.6259193420410156} 11/07/2021 11:56:22 - INFO - __main__ - Step 104851: {'lr': 0.00010629210734134228, 'samples': 20131392, 'steps': 104850, 'loss/train': 1.343137264251709} 11/07/2021 11:56:22 - INFO - __main__ - Step 104852: {'lr': 0.0001062877650147073, 'samples': 20131584, 'steps': 104851, 'loss/train': 1.1302530765533447} 11/07/2021 11:56:23 - INFO - __main__ - Step 104853: {'lr': 0.00010628342275282682, 'samples': 20131776, 'steps': 104852, 'loss/train': 1.2125836610794067} 11/07/2021 11:56:23 - INFO - __main__ - Step 104854: {'lr': 0.00010627908055570282, 'samples': 20131968, 'steps': 104853, 'loss/train': 1.4891841411590576} 11/07/2021 11:56:23 - INFO - __main__ - Step 104855: {'lr': 0.00010627473842333724, 'samples': 20132160, 'steps': 104854, 'loss/train': 1.4593063592910767} 11/07/2021 11:56:25 - INFO - __main__ - Step 104856: {'lr': 0.00010627039635573205, 'samples': 20132352, 'steps': 104855, 'loss/train': 1.4223923683166504} 11/07/2021 11:56:25 - INFO - __main__ - Step 104857: {'lr': 0.00010626605435288919, 'samples': 20132544, 'steps': 104856, 'loss/train': 1.421972393989563} 11/07/2021 11:56:25 - INFO - __main__ - Step 104858: {'lr': 0.00010626171241481067, 'samples': 20132736, 'steps': 104857, 'loss/train': 1.5233181715011597} 11/07/2021 11:56:26 - INFO - __main__ - Step 104859: {'lr': 0.00010625737054149836, 'samples': 20132928, 'steps': 104858, 'loss/train': 0.3320901095867157} 11/07/2021 11:56:26 - INFO - __main__ - Step 104860: {'lr': 0.00010625302873295428, 'samples': 20133120, 'steps': 104859, 'loss/train': 0.8120563626289368} 11/07/2021 11:56:27 - INFO - __main__ - Step 104861: {'lr': 0.00010624868698918044, 'samples': 20133312, 'steps': 104860, 'loss/train': 1.7003878355026245} 11/07/2021 11:56:28 - INFO - __main__ - Step 104862: {'lr': 0.00010624434531017865, 'samples': 20133504, 'steps': 104861, 'loss/train': 1.5524111986160278} 11/07/2021 11:56:28 - INFO - __main__ - Step 104863: {'lr': 0.00010624000369595093, 'samples': 20133696, 'steps': 104862, 'loss/train': 1.641858458518982} 11/07/2021 11:56:28 - INFO - __main__ - Step 104864: {'lr': 0.00010623566214649927, 'samples': 20133888, 'steps': 104863, 'loss/train': 1.3154776096343994} 11/07/2021 11:56:29 - INFO - __main__ - Step 104865: {'lr': 0.00010623132066182559, 'samples': 20134080, 'steps': 104864, 'loss/train': 0.5925225615501404} 11/07/2021 11:56:30 - INFO - __main__ - Step 104866: {'lr': 0.00010622697924193184, 'samples': 20134272, 'steps': 104865, 'loss/train': 1.1940665245056152} 11/07/2021 11:56:30 - INFO - __main__ - Step 104867: {'lr': 0.00010622263788682001, 'samples': 20134464, 'steps': 104866, 'loss/train': 1.3652362823486328} 11/07/2021 11:56:30 - INFO - __main__ - Step 104868: {'lr': 0.00010621829659649204, 'samples': 20134656, 'steps': 104867, 'loss/train': 1.7799382209777832} 11/07/2021 11:56:31 - INFO - __main__ - Step 104869: {'lr': 0.00010621395537094988, 'samples': 20134848, 'steps': 104868, 'loss/train': 1.4448399543762207} 11/07/2021 11:56:31 - INFO - __main__ - Step 104870: {'lr': 0.00010620961421019551, 'samples': 20135040, 'steps': 104869, 'loss/train': 1.3912022113800049} 11/07/2021 11:56:32 - INFO - __main__ - Step 104871: {'lr': 0.00010620527311423083, 'samples': 20135232, 'steps': 104870, 'loss/train': 1.386415719985962} 11/07/2021 11:56:32 - INFO - __main__ - Step 104872: {'lr': 0.00010620093208305789, 'samples': 20135424, 'steps': 104871, 'loss/train': 1.2662230730056763} 11/07/2021 11:56:33 - INFO - __main__ - Step 104873: {'lr': 0.00010619659111667857, 'samples': 20135616, 'steps': 104872, 'loss/train': 0.6944148540496826} 11/07/2021 11:56:33 - INFO - __main__ - Step 104874: {'lr': 0.0001061922502150949, 'samples': 20135808, 'steps': 104873, 'loss/train': 5.191986560821533} 11/07/2021 11:56:33 - INFO - __main__ - Step 104875: {'lr': 0.00010618790937830874, 'samples': 20136000, 'steps': 104874, 'loss/train': 1.6090492010116577} 11/07/2021 11:56:35 - INFO - __main__ - Step 104876: {'lr': 0.00010618356860632208, 'samples': 20136192, 'steps': 104875, 'loss/train': 1.0964504480361938} 11/07/2021 11:56:35 - INFO - __main__ - Step 104877: {'lr': 0.00010617922789913686, 'samples': 20136384, 'steps': 104876, 'loss/train': 1.3904436826705933} 11/07/2021 11:56:35 - INFO - __main__ - Step 104878: {'lr': 0.00010617488725675509, 'samples': 20136576, 'steps': 104877, 'loss/train': 1.2690129280090332} 11/07/2021 11:56:36 - INFO - __main__ - Step 104879: {'lr': 0.00010617054667917869, 'samples': 20136768, 'steps': 104878, 'loss/train': 1.455122709274292} 11/07/2021 11:56:36 - INFO - __main__ - Step 104880: {'lr': 0.0001061662061664096, 'samples': 20136960, 'steps': 104879, 'loss/train': 0.31418123841285706} 11/07/2021 11:56:36 - INFO - __main__ - Step 104881: {'lr': 0.00010616186571844982, 'samples': 20137152, 'steps': 104880, 'loss/train': 1.386563777923584} 11/07/2021 11:56:37 - INFO - __main__ - Step 104882: {'lr': 0.0001061575253353013, 'samples': 20137344, 'steps': 104881, 'loss/train': 1.41497004032135} 11/07/2021 11:56:38 - INFO - __main__ - Step 104883: {'lr': 0.00010615318501696594, 'samples': 20137536, 'steps': 104882, 'loss/train': 1.5751307010650635} 11/07/2021 11:56:38 - INFO - __main__ - Step 104884: {'lr': 0.00010614884476344575, 'samples': 20137728, 'steps': 104883, 'loss/train': 0.9283605813980103} 11/07/2021 11:56:38 - INFO - __main__ - Step 104885: {'lr': 0.00010614450457474267, 'samples': 20137920, 'steps': 104884, 'loss/train': 1.2172988653182983} 11/07/2021 11:56:39 - INFO - __main__ - Step 104886: {'lr': 0.00010614016445085866, 'samples': 20138112, 'steps': 104885, 'loss/train': 1.181249737739563} 11/07/2021 11:56:40 - INFO - __main__ - Step 104887: {'lr': 0.00010613582439179567, 'samples': 20138304, 'steps': 104886, 'loss/train': 1.9265460968017578} 11/07/2021 11:56:41 - INFO - __main__ - Step 104888: {'lr': 0.00010613148439755576, 'samples': 20138496, 'steps': 104887, 'loss/train': 1.1063858270645142} 11/07/2021 11:56:41 - INFO - __main__ - Step 104889: {'lr': 0.00010612714446814068, 'samples': 20138688, 'steps': 104888, 'loss/train': 1.985280156135559} 11/07/2021 11:56:41 - INFO - __main__ - Step 104890: {'lr': 0.00010612280460355247, 'samples': 20138880, 'steps': 104889, 'loss/train': 1.2826319932937622} 11/07/2021 11:56:42 - INFO - __main__ - Step 104891: {'lr': 0.00010611846480379314, 'samples': 20139072, 'steps': 104890, 'loss/train': 1.3986899852752686} 11/07/2021 11:56:43 - INFO - __main__ - Step 104892: {'lr': 0.00010611412506886459, 'samples': 20139264, 'steps': 104891, 'loss/train': 0.6522814035415649} 11/07/2021 11:56:43 - INFO - __main__ - Step 104893: {'lr': 0.0001061097853987688, 'samples': 20139456, 'steps': 104892, 'loss/train': 1.6089067459106445} 11/07/2021 11:56:43 - INFO - __main__ - Step 104894: {'lr': 0.00010610544579350773, 'samples': 20139648, 'steps': 104893, 'loss/train': 0.7180695533752441} 11/07/2021 11:56:44 - INFO - __main__ - Step 104895: {'lr': 0.00010610110625308331, 'samples': 20139840, 'steps': 104894, 'loss/train': 0.9121277332305908} 11/07/2021 11:56:44 - INFO - __main__ - Step 104896: {'lr': 0.00010609676677749752, 'samples': 20140032, 'steps': 104895, 'loss/train': 0.950313150882721} 11/07/2021 11:56:45 - INFO - __main__ - Step 104897: {'lr': 0.00010609242736675231, 'samples': 20140224, 'steps': 104896, 'loss/train': 1.2308640480041504} 11/07/2021 11:56:45 - INFO - __main__ - Step 104898: {'lr': 0.00010608808802084963, 'samples': 20140416, 'steps': 104897, 'loss/train': 1.3872698545455933} 11/07/2021 11:56:46 - INFO - __main__ - Step 104899: {'lr': 0.00010608374873979143, 'samples': 20140608, 'steps': 104898, 'loss/train': 1.3103611469268799} 11/07/2021 11:56:46 - INFO - __main__ - Step 104900: {'lr': 0.00010607940952357966, 'samples': 20140800, 'steps': 104899, 'loss/train': 1.422998070716858} 11/07/2021 11:56:47 - INFO - __main__ - Step 104901: {'lr': 0.0001060750703722164, 'samples': 20140992, 'steps': 104900, 'loss/train': 1.6083855628967285} 11/07/2021 11:56:48 - INFO - __main__ - Step 104902: {'lr': 0.00010607073128570339, 'samples': 20141184, 'steps': 104901, 'loss/train': 1.269245982170105} 11/07/2021 11:56:48 - INFO - __main__ - Step 104903: {'lr': 0.00010606639226404268, 'samples': 20141376, 'steps': 104902, 'loss/train': 1.5452532768249512} 11/07/2021 11:56:49 - INFO - __main__ - Step 104904: {'lr': 0.00010606205330723626, 'samples': 20141568, 'steps': 104903, 'loss/train': 1.1115001440048218} 11/07/2021 11:56:49 - INFO - __main__ - Step 104905: {'lr': 0.00010605771441528602, 'samples': 20141760, 'steps': 104904, 'loss/train': 4.1161885261535645} 11/07/2021 11:56:49 - INFO - __main__ - Step 104906: {'lr': 0.00010605337558819398, 'samples': 20141952, 'steps': 104905, 'loss/train': 5.393091678619385} 11/07/2021 11:56:50 - INFO - __main__ - Step 104907: {'lr': 0.00010604903682596207, 'samples': 20142144, 'steps': 104906, 'loss/train': 0.7278186678886414} 11/07/2021 11:56:50 - INFO - __main__ - Step 104908: {'lr': 0.00010604469812859224, 'samples': 20142336, 'steps': 104907, 'loss/train': 1.551966667175293} 11/07/2021 11:56:51 - INFO - __main__ - Step 104909: {'lr': 0.00010604035949608643, 'samples': 20142528, 'steps': 104908, 'loss/train': 1.1183736324310303} 11/07/2021 11:56:51 - INFO - __main__ - Step 104910: {'lr': 0.00010603602092844664, 'samples': 20142720, 'steps': 104909, 'loss/train': 1.5466928482055664} 11/07/2021 11:56:52 - INFO - __main__ - Step 104911: {'lr': 0.00010603168242567477, 'samples': 20142912, 'steps': 104910, 'loss/train': 1.2179648876190186} 11/07/2021 11:56:52 - INFO - __main__ - Step 104912: {'lr': 0.0001060273439877728, 'samples': 20143104, 'steps': 104911, 'loss/train': 1.1633070707321167} 11/07/2021 11:56:52 - INFO - __main__ - Step 104913: {'lr': 0.0001060230056147427, 'samples': 20143296, 'steps': 104912, 'loss/train': 1.7215162515640259} 11/07/2021 11:56:54 - INFO - __main__ - Step 104914: {'lr': 0.00010601866730658652, 'samples': 20143488, 'steps': 104913, 'loss/train': 1.0904191732406616} 11/07/2021 11:56:54 - INFO - __main__ - Step 104915: {'lr': 0.00010601432906330599, 'samples': 20143680, 'steps': 104914, 'loss/train': 1.426405429840088} 11/07/2021 11:56:54 - INFO - __main__ - Step 104916: {'lr': 0.0001060099908849032, 'samples': 20143872, 'steps': 104915, 'loss/train': 1.0203241109848022} 11/07/2021 11:56:55 - INFO - __main__ - Step 104917: {'lr': 0.00010600565277138008, 'samples': 20144064, 'steps': 104916, 'loss/train': 1.6755871772766113} 11/07/2021 11:56:55 - INFO - __main__ - Step 104918: {'lr': 0.00010600131472273858, 'samples': 20144256, 'steps': 104917, 'loss/train': 1.297507643699646} 11/07/2021 11:56:56 - INFO - __main__ - Step 104919: {'lr': 0.00010599697673898068, 'samples': 20144448, 'steps': 104918, 'loss/train': 1.1861162185668945} 11/07/2021 11:56:56 - INFO - __main__ - Step 104920: {'lr': 0.00010599263882010831, 'samples': 20144640, 'steps': 104919, 'loss/train': 0.9334940910339355} 11/07/2021 11:56:57 - INFO - __main__ - Step 104921: {'lr': 0.00010598830096612344, 'samples': 20144832, 'steps': 104920, 'loss/train': 1.4056018590927124} 11/07/2021 11:56:57 - INFO - __main__ - Step 104922: {'lr': 0.00010598396317702802, 'samples': 20145024, 'steps': 104921, 'loss/train': 1.3131749629974365} 11/07/2021 11:56:57 - INFO - __main__ - Step 104923: {'lr': 0.000105979625452824, 'samples': 20145216, 'steps': 104922, 'loss/train': 1.2857203483581543} 11/07/2021 11:56:58 - INFO - __main__ - Step 104924: {'lr': 0.00010597528779351335, 'samples': 20145408, 'steps': 104923, 'loss/train': 1.4795944690704346} 11/07/2021 11:56:59 - INFO - __main__ - Step 104925: {'lr': 0.000105970950199098, 'samples': 20145600, 'steps': 104924, 'loss/train': 1.5794765949249268} 11/07/2021 11:56:59 - INFO - __main__ - Step 104926: {'lr': 0.00010596661266957991, 'samples': 20145792, 'steps': 104925, 'loss/train': 1.3765596151351929} 11/07/2021 11:56:59 - INFO - __main__ - Step 104927: {'lr': 0.00010596227520496107, 'samples': 20145984, 'steps': 104926, 'loss/train': 1.4092113971710205} 11/07/2021 11:57:00 - INFO - __main__ - Step 104928: {'lr': 0.00010595793780524346, 'samples': 20146176, 'steps': 104927, 'loss/train': 1.2185992002487183} 11/07/2021 11:57:00 - INFO - __main__ - Step 104929: {'lr': 0.00010595360047042893, 'samples': 20146368, 'steps': 104928, 'loss/train': 1.0931273698806763} 11/07/2021 11:57:01 - INFO - __main__ - Step 104930: {'lr': 0.00010594926320051946, 'samples': 20146560, 'steps': 104929, 'loss/train': 1.066316843032837} 11/07/2021 11:57:02 - INFO - __main__ - Step 104931: {'lr': 0.00010594492599551703, 'samples': 20146752, 'steps': 104930, 'loss/train': 1.549232840538025} 11/07/2021 11:57:02 - INFO - __main__ - Step 104932: {'lr': 0.0001059405888554236, 'samples': 20146944, 'steps': 104931, 'loss/train': 1.5493650436401367} 11/07/2021 11:57:02 - INFO - __main__ - Step 104933: {'lr': 0.0001059362517802411, 'samples': 20147136, 'steps': 104932, 'loss/train': 1.4159595966339111} 11/07/2021 11:57:03 - INFO - __main__ - Step 104934: {'lr': 0.00010593191476997152, 'samples': 20147328, 'steps': 104933, 'loss/train': 1.136521816253662} 11/07/2021 11:57:04 - INFO - __main__ - Step 104935: {'lr': 0.00010592757782461679, 'samples': 20147520, 'steps': 104934, 'loss/train': 1.234717845916748} 11/07/2021 11:57:04 - INFO - __main__ - Step 104936: {'lr': 0.00010592324094417888, 'samples': 20147712, 'steps': 104935, 'loss/train': 1.5974518060684204} 11/07/2021 11:57:04 - INFO - __main__ - Step 104937: {'lr': 0.00010591890412865973, 'samples': 20147904, 'steps': 104936, 'loss/train': 1.0428496599197388} 11/07/2021 11:57:05 - INFO - __main__ - Step 104938: {'lr': 0.0001059145673780613, 'samples': 20148096, 'steps': 104937, 'loss/train': 1.2990401983261108} 11/07/2021 11:57:05 - INFO - __main__ - Step 104939: {'lr': 0.00010591023069238553, 'samples': 20148288, 'steps': 104938, 'loss/train': 1.458294153213501} 11/07/2021 11:57:06 - INFO - __main__ - Step 104940: {'lr': 0.00010590589407163439, 'samples': 20148480, 'steps': 104939, 'loss/train': 1.104562759399414} 11/07/2021 11:57:07 - INFO - __main__ - Step 104941: {'lr': 0.00010590155751580993, 'samples': 20148672, 'steps': 104940, 'loss/train': 1.2601901292800903} 11/07/2021 11:57:07 - INFO - __main__ - Step 104942: {'lr': 0.00010589722102491393, 'samples': 20148864, 'steps': 104941, 'loss/train': 1.7631659507751465} 11/07/2021 11:57:07 - INFO - __main__ - Step 104943: {'lr': 0.00010589288459894838, 'samples': 20149056, 'steps': 104942, 'loss/train': 1.5327945947647095} 11/07/2021 11:57:08 - INFO - __main__ - Step 104944: {'lr': 0.0001058885482379153, 'samples': 20149248, 'steps': 104943, 'loss/train': 1.6202201843261719} 11/07/2021 11:57:08 - INFO - __main__ - Step 104945: {'lr': 0.0001058842119418166, 'samples': 20149440, 'steps': 104944, 'loss/train': 1.3520119190216064} 11/07/2021 11:57:09 - INFO - __main__ - Step 104946: {'lr': 0.00010587987571065427, 'samples': 20149632, 'steps': 104945, 'loss/train': 1.3653489351272583} 11/07/2021 11:57:09 - INFO - __main__ - Step 104947: {'lr': 0.00010587553954443021, 'samples': 20149824, 'steps': 104946, 'loss/train': 1.5457180738449097} 11/07/2021 11:57:10 - INFO - __main__ - Step 104948: {'lr': 0.00010587120344314643, 'samples': 20150016, 'steps': 104947, 'loss/train': 1.3389081954956055} 11/07/2021 11:57:10 - INFO - __main__ - Step 104949: {'lr': 0.00010586686740680488, 'samples': 20150208, 'steps': 104948, 'loss/train': 1.2388657331466675} 11/07/2021 11:57:10 - INFO - __main__ - Step 104950: {'lr': 0.00010586253143540748, 'samples': 20150400, 'steps': 104949, 'loss/train': 1.2308317422866821} 11/07/2021 11:57:12 - INFO - __main__ - Step 104951: {'lr': 0.00010585819552895617, 'samples': 20150592, 'steps': 104950, 'loss/train': 1.3460760116577148} 11/07/2021 11:57:13 - INFO - __main__ - Step 104952: {'lr': 0.00010585385968745298, 'samples': 20150784, 'steps': 104951, 'loss/train': 1.757079839706421} 11/07/2021 11:57:13 - INFO - __main__ - Step 104953: {'lr': 0.00010584952391089981, 'samples': 20150976, 'steps': 104952, 'loss/train': 1.7229543924331665} 11/07/2021 11:57:13 - INFO - __main__ - Step 104954: {'lr': 0.00010584518819929858, 'samples': 20151168, 'steps': 104953, 'loss/train': 1.7494665384292603} 11/07/2021 11:57:14 - INFO - __main__ - Step 104955: {'lr': 0.00010584085255265142, 'samples': 20151360, 'steps': 104954, 'loss/train': 1.4709985256195068} 11/07/2021 11:57:14 - INFO - __main__ - Step 104956: {'lr': 0.00010583651697096003, 'samples': 20151552, 'steps': 104955, 'loss/train': 1.5309165716171265} 11/07/2021 11:57:14 - INFO - __main__ - Step 104957: {'lr': 0.0001058321814542265, 'samples': 20151744, 'steps': 104956, 'loss/train': 1.500573992729187} 11/07/2021 11:57:15 - INFO - __main__ - Step 104958: {'lr': 0.00010582784600245273, 'samples': 20151936, 'steps': 104957, 'loss/train': 1.504917860031128} 11/07/2021 11:57:16 - INFO - __main__ - Step 104959: {'lr': 0.00010582351061564075, 'samples': 20152128, 'steps': 104958, 'loss/train': 1.393662929534912} 11/07/2021 11:57:16 - INFO - __main__ - Step 104960: {'lr': 0.00010581917529379242, 'samples': 20152320, 'steps': 104959, 'loss/train': 1.3953139781951904} 11/07/2021 11:57:17 - INFO - __main__ - Step 104961: {'lr': 0.0001058148400369098, 'samples': 20152512, 'steps': 104960, 'loss/train': 0.983253002166748} 11/07/2021 11:57:17 - INFO - __main__ - Step 104962: {'lr': 0.00010581050484499477, 'samples': 20152704, 'steps': 104961, 'loss/train': 1.115997076034546} 11/07/2021 11:57:18 - INFO - __main__ - Step 104963: {'lr': 0.00010580616971804929, 'samples': 20152896, 'steps': 104962, 'loss/train': 1.8152427673339844} 11/07/2021 11:57:18 - INFO - __main__ - Step 104964: {'lr': 0.00010580183465607532, 'samples': 20153088, 'steps': 104963, 'loss/train': 1.3032490015029907} 11/07/2021 11:57:19 - INFO - __main__ - Step 104965: {'lr': 0.00010579749965907485, 'samples': 20153280, 'steps': 104964, 'loss/train': 1.3466532230377197} 11/07/2021 11:57:19 - INFO - __main__ - Step 104966: {'lr': 0.00010579316472704974, 'samples': 20153472, 'steps': 104965, 'loss/train': 1.4380515813827515} 11/07/2021 11:57:19 - INFO - __main__ - Step 104967: {'lr': 0.00010578882986000207, 'samples': 20153664, 'steps': 104966, 'loss/train': 1.1889660358428955} 11/07/2021 11:57:20 - INFO - __main__ - Step 104968: {'lr': 0.00010578449505793378, 'samples': 20153856, 'steps': 104967, 'loss/train': 1.2020961046218872} 11/07/2021 11:57:21 - INFO - __main__ - Step 104969: {'lr': 0.00010578016032084669, 'samples': 20154048, 'steps': 104968, 'loss/train': 1.0836623907089233} 11/07/2021 11:57:21 - INFO - __main__ - Step 104970: {'lr': 0.00010577582564874285, 'samples': 20154240, 'steps': 104969, 'loss/train': 1.5210328102111816} 11/07/2021 11:57:22 - INFO - __main__ - Step 104971: {'lr': 0.0001057714910416242, 'samples': 20154432, 'steps': 104970, 'loss/train': 1.5236109495162964} 11/07/2021 11:57:22 - INFO - __main__ - Step 104972: {'lr': 0.00010576715649949268, 'samples': 20154624, 'steps': 104971, 'loss/train': 1.2337347269058228} 11/07/2021 11:57:23 - INFO - __main__ - Step 104973: {'lr': 0.00010576282202235023, 'samples': 20154816, 'steps': 104972, 'loss/train': 1.0733662843704224} 11/07/2021 11:57:23 - INFO - __main__ - Step 104974: {'lr': 0.00010575848761019885, 'samples': 20155008, 'steps': 104973, 'loss/train': 1.2200565338134766} 11/07/2021 11:57:24 - INFO - __main__ - Step 104975: {'lr': 0.00010575415326304047, 'samples': 20155200, 'steps': 104974, 'loss/train': 1.6513078212738037} 11/07/2021 11:57:24 - INFO - __main__ - Step 104976: {'lr': 0.00010574981898087705, 'samples': 20155392, 'steps': 104975, 'loss/train': 1.6886781454086304} 11/07/2021 11:57:24 - INFO - __main__ - Step 104977: {'lr': 0.00010574548476371051, 'samples': 20155584, 'steps': 104976, 'loss/train': 1.6324406862258911} 11/07/2021 11:57:26 - INFO - __main__ - Step 104978: {'lr': 0.00010574115061154285, 'samples': 20155776, 'steps': 104977, 'loss/train': 1.1235828399658203} 11/07/2021 11:57:26 - INFO - __main__ - Step 104979: {'lr': 0.000105736816524376, 'samples': 20155968, 'steps': 104978, 'loss/train': 1.761168360710144} 11/07/2021 11:57:26 - INFO - __main__ - Step 104980: {'lr': 0.0001057324825022119, 'samples': 20156160, 'steps': 104979, 'loss/train': 1.4728481769561768} 11/07/2021 11:57:27 - INFO - __main__ - Step 104981: {'lr': 0.00010572814854505252, 'samples': 20156352, 'steps': 104980, 'loss/train': 0.3743956685066223} 11/07/2021 11:57:27 - INFO - __main__ - Step 104982: {'lr': 0.0001057238146528999, 'samples': 20156544, 'steps': 104981, 'loss/train': 1.493688941001892} 11/07/2021 11:57:27 - INFO - __main__ - Step 104983: {'lr': 0.00010571948082575583, 'samples': 20156736, 'steps': 104982, 'loss/train': 5.7229323387146} 11/07/2021 11:57:28 - INFO - __main__ - Step 104984: {'lr': 0.00010571514706362231, 'samples': 20156928, 'steps': 104983, 'loss/train': 1.2686561346054077} 11/07/2021 11:57:29 - INFO - __main__ - Step 104985: {'lr': 0.00010571081336650135, 'samples': 20157120, 'steps': 104984, 'loss/train': 1.4059149026870728} 11/07/2021 11:57:29 - INFO - __main__ - Step 104986: {'lr': 0.00010570647973439485, 'samples': 20157312, 'steps': 104985, 'loss/train': 0.8529099822044373} 11/07/2021 11:57:29 - INFO - __main__ - Step 104987: {'lr': 0.00010570214616730478, 'samples': 20157504, 'steps': 104986, 'loss/train': 1.4483401775360107} 11/07/2021 11:57:30 - INFO - __main__ - Step 104988: {'lr': 0.00010569781266523312, 'samples': 20157696, 'steps': 104987, 'loss/train': 1.1512740850448608} 11/07/2021 11:57:31 - INFO - __main__ - Step 104989: {'lr': 0.00010569347922818179, 'samples': 20157888, 'steps': 104988, 'loss/train': 1.3668138980865479} 11/07/2021 11:57:31 - INFO - __main__ - Step 104990: {'lr': 0.00010568914585615274, 'samples': 20158080, 'steps': 104989, 'loss/train': 1.2828694581985474} 11/07/2021 11:57:31 - INFO - __main__ - Step 104991: {'lr': 0.00010568481254914793, 'samples': 20158272, 'steps': 104990, 'loss/train': 1.296118974685669} 11/07/2021 11:57:32 - INFO - __main__ - Step 104992: {'lr': 0.00010568047930716932, 'samples': 20158464, 'steps': 104991, 'loss/train': 1.4046721458435059} 11/07/2021 11:57:32 - INFO - __main__ - Step 104993: {'lr': 0.00010567614613021887, 'samples': 20158656, 'steps': 104992, 'loss/train': 1.5121885538101196} 11/07/2021 11:57:33 - INFO - __main__ - Step 104994: {'lr': 0.0001056718130182985, 'samples': 20158848, 'steps': 104993, 'loss/train': 0.9731578230857849} 11/07/2021 11:57:34 - INFO - __main__ - Step 104995: {'lr': 0.00010566747997141029, 'samples': 20159040, 'steps': 104994, 'loss/train': 1.6322054862976074} 11/07/2021 11:57:34 - INFO - __main__ - Step 104996: {'lr': 0.000105663146989556, 'samples': 20159232, 'steps': 104995, 'loss/train': 0.08865385502576828} 11/07/2021 11:57:35 - INFO - __main__ - Step 104997: {'lr': 0.00010565881407273767, 'samples': 20159424, 'steps': 104996, 'loss/train': 2.0278515815734863} 11/07/2021 11:57:35 - INFO - __main__ - Step 104998: {'lr': 0.00010565448122095725, 'samples': 20159616, 'steps': 104997, 'loss/train': 1.1247024536132812} 11/07/2021 11:57:35 - INFO - __main__ - Step 104999: {'lr': 0.0001056501484342167, 'samples': 20159808, 'steps': 104998, 'loss/train': 1.529325246810913} 11/07/2021 11:57:36 - INFO - __main__ - Step 105000: {'lr': 0.00010564581571251794, 'samples': 20160000, 'steps': 104999, 'loss/train': 1.339005708694458} 11/07/2021 11:57:36 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 12:00:51 - INFO - __main__ - Step 105000: {'loss/eval': 1.2772471904754639, 'perplexity': 3.5867526531219482} 11/07/2021 12:01:08 - WARNING - huggingface_hub.repository - Several commits (7) will be pushed upstream. 11/07/2021 12:01:08 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 12:01:39 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small 73e0bf4..c93cc06 proud-haze-135 -> proud-haze-135 11/07/2021 12:01:40 - INFO - __main__ - Step 105001: {'lr': 0.00010564148305586297, 'samples': 20160192, 'steps': 105000, 'loss/train': 1.3280091285705566} 11/07/2021 12:01:40 - INFO - __main__ - Step 105002: {'lr': 0.0001056371504642537, 'samples': 20160384, 'steps': 105001, 'loss/train': 1.503422737121582} 11/07/2021 12:01:41 - INFO - __main__ - Step 105003: {'lr': 0.00010563281793769211, 'samples': 20160576, 'steps': 105002, 'loss/train': 1.3766703605651855} 11/07/2021 12:01:41 - INFO - __main__ - Step 105004: {'lr': 0.00010562848547618017, 'samples': 20160768, 'steps': 105003, 'loss/train': 1.1379599571228027} 11/07/2021 12:01:42 - INFO - __main__ - Step 105005: {'lr': 0.00010562415307971979, 'samples': 20160960, 'steps': 105004, 'loss/train': 1.2535990476608276} 11/07/2021 12:01:42 - INFO - __main__ - Step 105006: {'lr': 0.00010561982074831292, 'samples': 20161152, 'steps': 105005, 'loss/train': 1.662860631942749} 11/07/2021 12:01:43 - INFO - __main__ - Step 105007: {'lr': 0.00010561548848196157, 'samples': 20161344, 'steps': 105006, 'loss/train': 1.1748247146606445} 11/07/2021 12:01:43 - INFO - __main__ - Step 105008: {'lr': 0.00010561115628066761, 'samples': 20161536, 'steps': 105007, 'loss/train': 0.5016255378723145} 11/07/2021 12:01:43 - INFO - __main__ - Step 105009: {'lr': 0.00010560682414443315, 'samples': 20161728, 'steps': 105008, 'loss/train': 0.4462848901748657} 11/07/2021 12:01:45 - INFO - __main__ - Step 105010: {'lr': 0.00010560249207325992, 'samples': 20161920, 'steps': 105009, 'loss/train': 1.1868723630905151} 11/07/2021 12:01:45 - INFO - __main__ - Step 105011: {'lr': 0.00010559816006715, 'samples': 20162112, 'steps': 105010, 'loss/train': 0.4240592122077942} 11/07/2021 12:01:45 - INFO - __main__ - Step 105012: {'lr': 0.00010559382812610529, 'samples': 20162304, 'steps': 105011, 'loss/train': 1.2222750186920166} 11/07/2021 12:01:46 - INFO - __main__ - Step 105013: {'lr': 0.00010558949625012782, 'samples': 20162496, 'steps': 105012, 'loss/train': 1.3152060508728027} 11/07/2021 12:01:46 - INFO - __main__ - Step 105014: {'lr': 0.00010558516443921946, 'samples': 20162688, 'steps': 105013, 'loss/train': 1.3860414028167725} 11/07/2021 12:01:48 - INFO - __main__ - Step 105015: {'lr': 0.0001055808326933822, 'samples': 20162880, 'steps': 105014, 'loss/train': 0.8549039959907532} 11/07/2021 12:01:48 - INFO - __main__ - Step 105016: {'lr': 0.00010557650101261798, 'samples': 20163072, 'steps': 105015, 'loss/train': 1.5119417905807495} 11/07/2021 12:01:48 - INFO - __main__ - Step 105017: {'lr': 0.00010557216939692879, 'samples': 20163264, 'steps': 105016, 'loss/train': 1.4512944221496582} 11/07/2021 12:01:49 - INFO - __main__ - Step 105018: {'lr': 0.00010556783784631651, 'samples': 20163456, 'steps': 105017, 'loss/train': 1.4606760740280151} 11/07/2021 12:01:49 - INFO - __main__ - Step 105019: {'lr': 0.00010556350636078318, 'samples': 20163648, 'steps': 105018, 'loss/train': 1.5298304557800293} 11/07/2021 12:01:49 - INFO - __main__ - Step 105020: {'lr': 0.00010555917494033068, 'samples': 20163840, 'steps': 105019, 'loss/train': 1.2514508962631226} 11/07/2021 12:01:50 - INFO - __main__ - Step 105021: {'lr': 0.00010555484358496099, 'samples': 20164032, 'steps': 105020, 'loss/train': 1.3810738325119019} 11/07/2021 12:01:51 - INFO - __main__ - Step 105022: {'lr': 0.00010555051229467613, 'samples': 20164224, 'steps': 105021, 'loss/train': 1.2553422451019287} 11/07/2021 12:01:51 - INFO - __main__ - Step 105023: {'lr': 0.00010554618106947792, 'samples': 20164416, 'steps': 105022, 'loss/train': 1.3239033222198486} 11/07/2021 12:01:52 - INFO - __main__ - Step 105024: {'lr': 0.00010554184990936836, 'samples': 20164608, 'steps': 105023, 'loss/train': 1.3184541463851929} 11/07/2021 12:01:52 - INFO - __main__ - Step 105025: {'lr': 0.00010553751881434942, 'samples': 20164800, 'steps': 105024, 'loss/train': 1.3790009021759033} 11/07/2021 12:01:52 - INFO - __main__ - Step 105026: {'lr': 0.00010553318778442303, 'samples': 20164992, 'steps': 105025, 'loss/train': 1.4604458808898926} 11/07/2021 12:01:54 - INFO - __main__ - Step 105027: {'lr': 0.00010552885681959119, 'samples': 20165184, 'steps': 105026, 'loss/train': 1.0467679500579834} 11/07/2021 12:01:54 - INFO - __main__ - Step 105028: {'lr': 0.00010552452591985579, 'samples': 20165376, 'steps': 105027, 'loss/train': 0.7043009996414185} 11/07/2021 12:01:55 - INFO - __main__ - Step 105029: {'lr': 0.00010552019508521879, 'samples': 20165568, 'steps': 105028, 'loss/train': 1.8611435890197754} 11/07/2021 12:01:55 - INFO - __main__ - Step 105030: {'lr': 0.00010551586431568219, 'samples': 20165760, 'steps': 105029, 'loss/train': 1.6543200016021729} 11/07/2021 12:01:55 - INFO - __main__ - Step 105031: {'lr': 0.00010551153361124791, 'samples': 20165952, 'steps': 105030, 'loss/train': 1.1091355085372925} 11/07/2021 12:01:56 - INFO - __main__ - Step 105032: {'lr': 0.00010550720297191787, 'samples': 20166144, 'steps': 105031, 'loss/train': 1.4152735471725464} 11/07/2021 12:01:56 - INFO - __main__ - Step 105033: {'lr': 0.0001055028723976941, 'samples': 20166336, 'steps': 105032, 'loss/train': 1.557138442993164} 11/07/2021 12:01:57 - INFO - __main__ - Step 105034: {'lr': 0.0001054985418885785, 'samples': 20166528, 'steps': 105033, 'loss/train': 0.06153500825166702} 11/07/2021 12:01:58 - INFO - __main__ - Step 105035: {'lr': 0.0001054942114445731, 'samples': 20166720, 'steps': 105034, 'loss/train': 1.5621901750564575} 11/07/2021 12:01:58 - INFO - __main__ - Step 105036: {'lr': 0.00010548988106567969, 'samples': 20166912, 'steps': 105035, 'loss/train': 1.3393617868423462} 11/07/2021 12:01:58 - INFO - __main__ - Step 105037: {'lr': 0.0001054855507519003, 'samples': 20167104, 'steps': 105036, 'loss/train': 1.0561460256576538} 11/07/2021 12:01:59 - INFO - __main__ - Step 105038: {'lr': 0.00010548122050323691, 'samples': 20167296, 'steps': 105037, 'loss/train': 1.633404016494751} 11/07/2021 12:02:00 - INFO - __main__ - Step 105039: {'lr': 0.00010547689031969146, 'samples': 20167488, 'steps': 105038, 'loss/train': 1.4779616594314575} 11/07/2021 12:02:00 - INFO - __main__ - Step 105040: {'lr': 0.00010547256020126589, 'samples': 20167680, 'steps': 105039, 'loss/train': 1.363181471824646} 11/07/2021 12:02:00 - INFO - __main__ - Step 105041: {'lr': 0.00010546823014796214, 'samples': 20167872, 'steps': 105040, 'loss/train': 0.4573241174221039} 11/07/2021 12:02:01 - INFO - __main__ - Step 105042: {'lr': 0.00010546390015978217, 'samples': 20168064, 'steps': 105041, 'loss/train': 1.2871180772781372} 11/07/2021 12:02:01 - INFO - __main__ - Step 105043: {'lr': 0.00010545957023672795, 'samples': 20168256, 'steps': 105042, 'loss/train': 0.7643063068389893} 11/07/2021 12:02:02 - INFO - __main__ - Step 105044: {'lr': 0.00010545524037880142, 'samples': 20168448, 'steps': 105043, 'loss/train': 1.2746649980545044} 11/07/2021 12:02:02 - INFO - __main__ - Step 105045: {'lr': 0.00010545091058600451, 'samples': 20168640, 'steps': 105044, 'loss/train': 1.3556270599365234} 11/07/2021 12:02:03 - INFO - __main__ - Step 105046: {'lr': 0.00010544658085833919, 'samples': 20168832, 'steps': 105045, 'loss/train': 0.8504562973976135} 11/07/2021 12:02:03 - INFO - __main__ - Step 105047: {'lr': 0.00010544225119580741, 'samples': 20169024, 'steps': 105046, 'loss/train': 2.2061657905578613} 11/07/2021 12:02:03 - INFO - __main__ - Step 105048: {'lr': 0.00010543792159841115, 'samples': 20169216, 'steps': 105047, 'loss/train': 1.443334698677063} 11/07/2021 12:02:04 - INFO - __main__ - Step 105049: {'lr': 0.00010543359206615242, 'samples': 20169408, 'steps': 105048, 'loss/train': 1.6078681945800781} 11/07/2021 12:02:05 - INFO - __main__ - Step 105050: {'lr': 0.00010542926259903296, 'samples': 20169600, 'steps': 105049, 'loss/train': 0.996918797492981} 11/07/2021 12:02:05 - INFO - __main__ - Step 105051: {'lr': 0.00010542493319705484, 'samples': 20169792, 'steps': 105050, 'loss/train': 1.6084650754928589} 11/07/2021 12:02:06 - INFO - __main__ - Step 105052: {'lr': 0.00010542060386022004, 'samples': 20169984, 'steps': 105051, 'loss/train': 0.7865095734596252} 11/07/2021 12:02:06 - INFO - __main__ - Step 105053: {'lr': 0.00010541627458853048, 'samples': 20170176, 'steps': 105052, 'loss/train': 0.8976998925209045} 11/07/2021 12:02:07 - INFO - __main__ - Step 105054: {'lr': 0.00010541194538198812, 'samples': 20170368, 'steps': 105053, 'loss/train': 1.170937418937683} 11/07/2021 12:02:07 - INFO - __main__ - Step 105055: {'lr': 0.00010540761624059489, 'samples': 20170560, 'steps': 105054, 'loss/train': 1.703934669494629} 11/07/2021 12:02:08 - INFO - __main__ - Step 105056: {'lr': 0.00010540328716435277, 'samples': 20170752, 'steps': 105055, 'loss/train': 1.500565767288208} 11/07/2021 12:02:08 - INFO - __main__ - Step 105057: {'lr': 0.00010539895815326369, 'samples': 20170944, 'steps': 105056, 'loss/train': 1.199719786643982} 11/07/2021 12:02:08 - INFO - __main__ - Step 105058: {'lr': 0.00010539462920732962, 'samples': 20171136, 'steps': 105057, 'loss/train': 1.3752644062042236} 11/07/2021 12:02:09 - INFO - __main__ - Step 105059: {'lr': 0.0001053903003265525, 'samples': 20171328, 'steps': 105058, 'loss/train': 1.7006720304489136} 11/07/2021 12:02:10 - INFO - __main__ - Step 105060: {'lr': 0.00010538597151093426, 'samples': 20171520, 'steps': 105059, 'loss/train': 1.9461288452148438} 11/07/2021 12:02:10 - INFO - __main__ - Step 105061: {'lr': 0.00010538164276047688, 'samples': 20171712, 'steps': 105060, 'loss/train': 1.7788728475570679} 11/07/2021 12:02:11 - INFO - __main__ - Step 105062: {'lr': 0.00010537731407518238, 'samples': 20171904, 'steps': 105061, 'loss/train': 1.4132206439971924} 11/07/2021 12:02:11 - INFO - __main__ - Step 105063: {'lr': 0.00010537298545505256, 'samples': 20172096, 'steps': 105062, 'loss/train': 1.2432252168655396} 11/07/2021 12:02:12 - INFO - __main__ - Step 105064: {'lr': 0.00010536865690008943, 'samples': 20172288, 'steps': 105063, 'loss/train': 1.418238639831543} 11/07/2021 12:02:12 - INFO - __main__ - Step 105065: {'lr': 0.00010536432841029497, 'samples': 20172480, 'steps': 105064, 'loss/train': 0.9640728235244751} 11/07/2021 12:02:13 - INFO - __main__ - Step 105066: {'lr': 0.00010535999998567108, 'samples': 20172672, 'steps': 105065, 'loss/train': 0.8571939468383789} 11/07/2021 12:02:13 - INFO - __main__ - Step 105067: {'lr': 0.00010535567162621975, 'samples': 20172864, 'steps': 105066, 'loss/train': 1.485898494720459} 11/07/2021 12:02:14 - INFO - __main__ - Step 105068: {'lr': 0.00010535134333194293, 'samples': 20173056, 'steps': 105067, 'loss/train': 1.5903958082199097} 11/07/2021 12:02:14 - INFO - __main__ - Step 105069: {'lr': 0.00010534701510284258, 'samples': 20173248, 'steps': 105068, 'loss/train': 1.7522355318069458} 11/07/2021 12:02:14 - INFO - __main__ - Step 105070: {'lr': 0.0001053426869389206, 'samples': 20173440, 'steps': 105069, 'loss/train': 1.0979504585266113} 11/07/2021 12:02:15 - INFO - __main__ - Step 105071: {'lr': 0.000105338358840179, 'samples': 20173632, 'steps': 105070, 'loss/train': 1.3428528308868408} 11/07/2021 12:02:16 - INFO - __main__ - Step 105072: {'lr': 0.00010533403080661968, 'samples': 20173824, 'steps': 105071, 'loss/train': 1.5073422193527222} 11/07/2021 12:02:16 - INFO - __main__ - Step 105073: {'lr': 0.00010532970283824472, 'samples': 20174016, 'steps': 105072, 'loss/train': 1.5116487741470337} 11/07/2021 12:02:16 - INFO - __main__ - Step 105074: {'lr': 0.00010532537493505587, 'samples': 20174208, 'steps': 105073, 'loss/train': 1.4128687381744385} 11/07/2021 12:02:17 - INFO - __main__ - Step 105075: {'lr': 0.00010532104709705517, 'samples': 20174400, 'steps': 105074, 'loss/train': 1.3349261283874512} 11/07/2021 12:02:18 - INFO - __main__ - Step 105076: {'lr': 0.00010531671932424458, 'samples': 20174592, 'steps': 105075, 'loss/train': 1.5720938444137573} 11/07/2021 12:02:18 - INFO - __main__ - Step 105077: {'lr': 0.00010531239161662603, 'samples': 20174784, 'steps': 105076, 'loss/train': 1.2263199090957642} 11/07/2021 12:02:18 - INFO - __main__ - Step 105078: {'lr': 0.00010530806397420151, 'samples': 20174976, 'steps': 105077, 'loss/train': 1.229313850402832} 11/07/2021 12:02:19 - INFO - __main__ - Step 105079: {'lr': 0.0001053037363969729, 'samples': 20175168, 'steps': 105078, 'loss/train': 0.7607823610305786} 11/07/2021 12:02:19 - INFO - __main__ - Step 105080: {'lr': 0.00010529940888494224, 'samples': 20175360, 'steps': 105079, 'loss/train': 1.747586965560913} 11/07/2021 12:02:20 - INFO - __main__ - Step 105081: {'lr': 0.00010529508143811142, 'samples': 20175552, 'steps': 105080, 'loss/train': 1.3711330890655518} 11/07/2021 12:02:21 - INFO - __main__ - Step 105082: {'lr': 0.00010529075405648239, 'samples': 20175744, 'steps': 105081, 'loss/train': 1.6606082916259766} 11/07/2021 12:02:21 - INFO - __main__ - Step 105083: {'lr': 0.00010528642674005712, 'samples': 20175936, 'steps': 105082, 'loss/train': 1.4554394483566284} 11/07/2021 12:02:21 - INFO - __main__ - Step 105084: {'lr': 0.00010528209948883763, 'samples': 20176128, 'steps': 105083, 'loss/train': 1.3915492296218872} 11/07/2021 12:02:22 - INFO - __main__ - Step 105085: {'lr': 0.00010527777230282572, 'samples': 20176320, 'steps': 105084, 'loss/train': 0.7161414623260498} 11/07/2021 12:02:23 - INFO - __main__ - Step 105086: {'lr': 0.00010527344518202341, 'samples': 20176512, 'steps': 105085, 'loss/train': 1.4949864149093628} 11/07/2021 12:02:23 - INFO - __main__ - Step 105087: {'lr': 0.00010526911812643266, 'samples': 20176704, 'steps': 105086, 'loss/train': 0.9578923583030701} 11/07/2021 12:02:23 - INFO - __main__ - Step 105088: {'lr': 0.00010526479113605539, 'samples': 20176896, 'steps': 105087, 'loss/train': 1.4732357263565063} 11/07/2021 12:02:24 - INFO - __main__ - Step 105089: {'lr': 0.00010526046421089358, 'samples': 20177088, 'steps': 105088, 'loss/train': 1.2827796936035156} 11/07/2021 12:02:24 - INFO - __main__ - Step 105090: {'lr': 0.00010525613735094919, 'samples': 20177280, 'steps': 105089, 'loss/train': 2.015859603881836} 11/07/2021 12:02:24 - INFO - __main__ - Step 105091: {'lr': 0.00010525181055622412, 'samples': 20177472, 'steps': 105090, 'loss/train': 1.0808829069137573} 11/07/2021 12:02:26 - INFO - __main__ - Step 105092: {'lr': 0.00010524748382672039, 'samples': 20177664, 'steps': 105091, 'loss/train': 0.4429701864719391} 11/07/2021 12:02:26 - INFO - __main__ - Step 105093: {'lr': 0.00010524315716243988, 'samples': 20177856, 'steps': 105092, 'loss/train': 1.24095618724823} 11/07/2021 12:02:26 - INFO - __main__ - Step 105094: {'lr': 0.00010523883056338457, 'samples': 20178048, 'steps': 105093, 'loss/train': 1.1591477394104004} 11/07/2021 12:02:27 - INFO - __main__ - Step 105095: {'lr': 0.00010523450402955651, 'samples': 20178240, 'steps': 105094, 'loss/train': 1.4304174184799194} 11/07/2021 12:02:27 - INFO - __main__ - Step 105096: {'lr': 0.00010523017756095745, 'samples': 20178432, 'steps': 105095, 'loss/train': 1.642323613166809} 11/07/2021 12:02:28 - INFO - __main__ - Step 105097: {'lr': 0.00010522585115758945, 'samples': 20178624, 'steps': 105096, 'loss/train': 0.8037461042404175} 11/07/2021 12:02:28 - INFO - __main__ - Step 105098: {'lr': 0.00010522152481945443, 'samples': 20178816, 'steps': 105097, 'loss/train': 1.5301772356033325} 11/07/2021 12:02:29 - INFO - __main__ - Step 105099: {'lr': 0.00010521719854655437, 'samples': 20179008, 'steps': 105098, 'loss/train': 1.4099189043045044} 11/07/2021 12:02:29 - INFO - __main__ - Step 105100: {'lr': 0.00010521287233889121, 'samples': 20179200, 'steps': 105099, 'loss/train': 1.5031651258468628} 11/07/2021 12:02:29 - INFO - __main__ - Step 105101: {'lr': 0.00010520854619646689, 'samples': 20179392, 'steps': 105100, 'loss/train': 1.4944875240325928} 11/07/2021 12:02:30 - INFO - __main__ - Step 105102: {'lr': 0.00010520422011928337, 'samples': 20179584, 'steps': 105101, 'loss/train': 1.5171186923980713} 11/07/2021 12:02:31 - INFO - __main__ - Step 105103: {'lr': 0.0001051998941073426, 'samples': 20179776, 'steps': 105102, 'loss/train': 1.290889024734497} 11/07/2021 12:02:31 - INFO - __main__ - Step 105104: {'lr': 0.00010519556816064649, 'samples': 20179968, 'steps': 105103, 'loss/train': 1.6403645277023315} 11/07/2021 12:02:31 - INFO - __main__ - Step 105105: {'lr': 0.00010519124227919705, 'samples': 20180160, 'steps': 105104, 'loss/train': 1.062207579612732} 11/07/2021 12:02:32 - INFO - __main__ - Step 105106: {'lr': 0.00010518691646299628, 'samples': 20180352, 'steps': 105105, 'loss/train': 1.4207466840744019} 11/07/2021 12:02:33 - INFO - __main__ - Step 105107: {'lr': 0.00010518259071204597, 'samples': 20180544, 'steps': 105106, 'loss/train': 1.4661411046981812} 11/07/2021 12:02:33 - INFO - __main__ - Step 105108: {'lr': 0.00010517826502634815, 'samples': 20180736, 'steps': 105107, 'loss/train': 1.1734308004379272} 11/07/2021 12:02:34 - INFO - __main__ - Step 105109: {'lr': 0.00010517393940590475, 'samples': 20180928, 'steps': 105108, 'loss/train': 1.4842277765274048} 11/07/2021 12:02:34 - INFO - __main__ - Step 105110: {'lr': 0.00010516961385071777, 'samples': 20181120, 'steps': 105109, 'loss/train': 1.163368582725525} 11/07/2021 12:02:34 - INFO - __main__ - Step 105111: {'lr': 0.00010516528836078912, 'samples': 20181312, 'steps': 105110, 'loss/train': 0.655147135257721} 11/07/2021 12:02:36 - INFO - __main__ - Step 105112: {'lr': 0.00010516096293612074, 'samples': 20181504, 'steps': 105111, 'loss/train': 1.0645720958709717} 11/07/2021 12:02:36 - INFO - __main__ - Step 105113: {'lr': 0.00010515663757671459, 'samples': 20181696, 'steps': 105112, 'loss/train': 1.4538705348968506} 11/07/2021 12:02:36 - INFO - __main__ - Step 105114: {'lr': 0.00010515231228257263, 'samples': 20181888, 'steps': 105113, 'loss/train': 0.6843137741088867} 11/07/2021 12:02:37 - INFO - __main__ - Step 105115: {'lr': 0.00010514798705369682, 'samples': 20182080, 'steps': 105114, 'loss/train': 1.088660717010498} 11/07/2021 12:02:37 - INFO - __main__ - Step 105116: {'lr': 0.00010514366189008909, 'samples': 20182272, 'steps': 105115, 'loss/train': 1.3550965785980225} 11/07/2021 12:02:38 - INFO - __main__ - Step 105117: {'lr': 0.00010513933679175147, 'samples': 20182464, 'steps': 105116, 'loss/train': 1.5137184858322144} 11/07/2021 12:02:38 - INFO - __main__ - Step 105118: {'lr': 0.00010513501175868573, 'samples': 20182656, 'steps': 105117, 'loss/train': 1.5540255308151245} 11/07/2021 12:02:39 - INFO - __main__ - Step 105119: {'lr': 0.00010513068679089394, 'samples': 20182848, 'steps': 105118, 'loss/train': 1.2911876440048218} 11/07/2021 12:02:39 - INFO - __main__ - Step 105120: {'lr': 0.00010512636188837801, 'samples': 20183040, 'steps': 105119, 'loss/train': 1.1869521141052246} 11/07/2021 12:02:39 - INFO - __main__ - Step 105121: {'lr': 0.00010512203705113991, 'samples': 20183232, 'steps': 105120, 'loss/train': 1.0379079580307007} 11/07/2021 12:02:41 - INFO - __main__ - Step 105122: {'lr': 0.0001051177122791816, 'samples': 20183424, 'steps': 105121, 'loss/train': 1.032241940498352} 11/07/2021 12:02:41 - INFO - __main__ - Step 105123: {'lr': 0.000105113387572505, 'samples': 20183616, 'steps': 105122, 'loss/train': 1.400663137435913} 11/07/2021 12:02:41 - INFO - __main__ - Step 105124: {'lr': 0.00010510906293111205, 'samples': 20183808, 'steps': 105123, 'loss/train': 1.2532705068588257} 11/07/2021 12:02:42 - INFO - __main__ - Step 105125: {'lr': 0.00010510473835500476, 'samples': 20184000, 'steps': 105124, 'loss/train': 1.4348926544189453} 11/07/2021 12:02:42 - INFO - __main__ - Step 105126: {'lr': 0.000105100413844185, 'samples': 20184192, 'steps': 105125, 'loss/train': 1.569088339805603} 11/07/2021 12:02:42 - INFO - __main__ - Step 105127: {'lr': 0.00010509608939865478, 'samples': 20184384, 'steps': 105126, 'loss/train': 0.9251816272735596} 11/07/2021 12:02:43 - INFO - __main__ - Step 105128: {'lr': 0.00010509176501841602, 'samples': 20184576, 'steps': 105127, 'loss/train': 1.5226824283599854} 11/07/2021 12:02:44 - INFO - __main__ - Step 105129: {'lr': 0.00010508744070347071, 'samples': 20184768, 'steps': 105128, 'loss/train': 1.5890439748764038} 11/07/2021 12:02:44 - INFO - __main__ - Step 105130: {'lr': 0.00010508311645382083, 'samples': 20184960, 'steps': 105129, 'loss/train': 1.3478693962097168} 11/07/2021 12:02:44 - INFO - __main__ - Step 105131: {'lr': 0.00010507879226946814, 'samples': 20185152, 'steps': 105130, 'loss/train': 1.1998802423477173} 11/07/2021 12:02:45 - INFO - __main__ - Step 105132: {'lr': 0.00010507446815041475, 'samples': 20185344, 'steps': 105131, 'loss/train': 1.3999155759811401} 11/07/2021 12:02:46 - INFO - __main__ - Step 105133: {'lr': 0.00010507014409666255, 'samples': 20185536, 'steps': 105132, 'loss/train': 1.5095202922821045} 11/07/2021 12:02:46 - INFO - __main__ - Step 105134: {'lr': 0.0001050658201082135, 'samples': 20185728, 'steps': 105133, 'loss/train': 1.3282712697982788} 11/07/2021 12:02:46 - INFO - __main__ - Step 105135: {'lr': 0.00010506149618506956, 'samples': 20185920, 'steps': 105134, 'loss/train': 1.8129079341888428} 11/07/2021 12:02:47 - INFO - __main__ - Step 105136: {'lr': 0.00010505717232723266, 'samples': 20186112, 'steps': 105135, 'loss/train': 1.670695424079895} 11/07/2021 12:02:47 - INFO - __main__ - Step 105137: {'lr': 0.00010505284853470479, 'samples': 20186304, 'steps': 105136, 'loss/train': 1.617851734161377} 11/07/2021 12:02:48 - INFO - __main__ - Step 105138: {'lr': 0.00010504852480748786, 'samples': 20186496, 'steps': 105137, 'loss/train': 0.2950579524040222} 11/07/2021 12:02:49 - INFO - __main__ - Step 105139: {'lr': 0.00010504420114558382, 'samples': 20186688, 'steps': 105138, 'loss/train': 1.7878975868225098} 11/07/2021 12:02:49 - INFO - __main__ - Step 105140: {'lr': 0.00010503987754899463, 'samples': 20186880, 'steps': 105139, 'loss/train': 1.2107300758361816} 11/07/2021 12:02:49 - INFO - __main__ - Step 105141: {'lr': 0.00010503555401772224, 'samples': 20187072, 'steps': 105140, 'loss/train': 1.5671298503875732} 11/07/2021 12:02:50 - INFO - __main__ - Step 105142: {'lr': 0.00010503123055176861, 'samples': 20187264, 'steps': 105141, 'loss/train': 1.4845006465911865} 11/07/2021 12:02:51 - INFO - __main__ - Step 105143: {'lr': 0.00010502690715113572, 'samples': 20187456, 'steps': 105142, 'loss/train': 1.3060476779937744} 11/07/2021 12:02:51 - INFO - __main__ - Step 105144: {'lr': 0.00010502258381582541, 'samples': 20187648, 'steps': 105143, 'loss/train': 1.3702003955841064} 11/07/2021 12:02:51 - INFO - __main__ - Step 105145: {'lr': 0.00010501826054583968, 'samples': 20187840, 'steps': 105144, 'loss/train': 1.3618391752243042} 11/07/2021 12:02:52 - INFO - __main__ - Step 105146: {'lr': 0.00010501393734118048, 'samples': 20188032, 'steps': 105145, 'loss/train': 1.224135160446167} 11/07/2021 12:02:52 - INFO - __main__ - Step 105147: {'lr': 0.00010500961420184976, 'samples': 20188224, 'steps': 105146, 'loss/train': 0.5495425462722778} 11/07/2021 12:02:53 - INFO - __main__ - Step 105148: {'lr': 0.00010500529112784945, 'samples': 20188416, 'steps': 105147, 'loss/train': 1.5578124523162842} 11/07/2021 12:02:54 - INFO - __main__ - Step 105149: {'lr': 0.00010500096811918156, 'samples': 20188608, 'steps': 105148, 'loss/train': 1.2051723003387451} 11/07/2021 12:02:54 - INFO - __main__ - Step 105150: {'lr': 0.00010499664517584798, 'samples': 20188800, 'steps': 105149, 'loss/train': 1.5320667028427124} 11/07/2021 12:02:54 - INFO - __main__ - Step 105151: {'lr': 0.00010499232229785067, 'samples': 20188992, 'steps': 105150, 'loss/train': 1.3384296894073486} 11/07/2021 12:02:55 - INFO - __main__ - Step 105152: {'lr': 0.00010498799948519158, 'samples': 20189184, 'steps': 105151, 'loss/train': 1.8737810850143433} 11/07/2021 12:02:55 - INFO - __main__ - Step 105153: {'lr': 0.00010498367673787266, 'samples': 20189376, 'steps': 105152, 'loss/train': 1.5140174627304077} 11/07/2021 12:02:56 - INFO - __main__ - Step 105154: {'lr': 0.00010497935405589587, 'samples': 20189568, 'steps': 105153, 'loss/train': 1.333654761314392} 11/07/2021 12:02:56 - INFO - __main__ - Step 105155: {'lr': 0.00010497503143926313, 'samples': 20189760, 'steps': 105154, 'loss/train': 0.11989620327949524} 11/07/2021 12:02:57 - INFO - __main__ - Step 105156: {'lr': 0.0001049707088879765, 'samples': 20189952, 'steps': 105155, 'loss/train': 1.167956829071045} 11/07/2021 12:02:57 - INFO - __main__ - Step 105157: {'lr': 0.00010496638640203774, 'samples': 20190144, 'steps': 105156, 'loss/train': 0.6522610783576965} 11/07/2021 12:02:57 - INFO - __main__ - Step 105158: {'lr': 0.00010496206398144888, 'samples': 20190336, 'steps': 105157, 'loss/train': 0.914226233959198} 11/07/2021 12:02:58 - INFO - __main__ - Step 105159: {'lr': 0.00010495774162621189, 'samples': 20190528, 'steps': 105158, 'loss/train': 1.3843944072723389} 11/07/2021 12:02:59 - INFO - __main__ - Step 105160: {'lr': 0.0001049534193363287, 'samples': 20190720, 'steps': 105159, 'loss/train': 1.5493844747543335} 11/07/2021 12:02:59 - INFO - __main__ - Step 105161: {'lr': 0.00010494909711180125, 'samples': 20190912, 'steps': 105160, 'loss/train': 1.0805574655532837} 11/07/2021 12:03:00 - INFO - __main__ - Step 105162: {'lr': 0.00010494477495263152, 'samples': 20191104, 'steps': 105161, 'loss/train': 1.4902329444885254} 11/07/2021 12:03:00 - INFO - __main__ - Step 105163: {'lr': 0.00010494045285882143, 'samples': 20191296, 'steps': 105162, 'loss/train': 1.4568257331848145} 11/07/2021 12:03:01 - INFO - __main__ - Step 105164: {'lr': 0.0001049361308303729, 'samples': 20191488, 'steps': 105163, 'loss/train': 2.532902717590332} 11/07/2021 12:03:01 - INFO - __main__ - Step 105165: {'lr': 0.00010493180886728796, 'samples': 20191680, 'steps': 105164, 'loss/train': 1.720020055770874} 11/07/2021 12:03:02 - INFO - __main__ - Step 105166: {'lr': 0.00010492748696956846, 'samples': 20191872, 'steps': 105165, 'loss/train': 1.3036680221557617} 11/07/2021 12:03:02 - INFO - __main__ - Step 105167: {'lr': 0.00010492316513721645, 'samples': 20192064, 'steps': 105166, 'loss/train': 1.0634613037109375} 11/07/2021 12:03:02 - INFO - __main__ - Step 105168: {'lr': 0.00010491884337023377, 'samples': 20192256, 'steps': 105167, 'loss/train': 1.1922494173049927} 11/07/2021 12:03:03 - INFO - __main__ - Step 105169: {'lr': 0.00010491452166862245, 'samples': 20192448, 'steps': 105168, 'loss/train': 1.8387924432754517} 11/07/2021 12:03:04 - INFO - __main__ - Step 105170: {'lr': 0.00010491020003238449, 'samples': 20192640, 'steps': 105169, 'loss/train': 1.5018280744552612} 11/07/2021 12:03:04 - INFO - __main__ - Step 105171: {'lr': 0.00010490587846152166, 'samples': 20192832, 'steps': 105170, 'loss/train': 1.4759548902511597} 11/07/2021 12:03:04 - INFO - __main__ - Step 105172: {'lr': 0.00010490155695603604, 'samples': 20193024, 'steps': 105171, 'loss/train': 1.7894482612609863} 11/07/2021 12:03:05 - INFO - __main__ - Step 105173: {'lr': 0.0001048972355159295, 'samples': 20193216, 'steps': 105172, 'loss/train': 1.7407283782958984} 11/07/2021 12:03:05 - INFO - __main__ - Step 105174: {'lr': 0.00010489291414120403, 'samples': 20193408, 'steps': 105173, 'loss/train': 1.763314962387085} 11/07/2021 12:03:06 - INFO - __main__ - Step 105175: {'lr': 0.00010488859283186158, 'samples': 20193600, 'steps': 105174, 'loss/train': 1.3555115461349487} 11/07/2021 12:03:07 - INFO - __main__ - Step 105176: {'lr': 0.00010488427158790408, 'samples': 20193792, 'steps': 105175, 'loss/train': 1.2630841732025146} 11/07/2021 12:03:07 - INFO - __main__ - Step 105177: {'lr': 0.00010487995040933352, 'samples': 20193984, 'steps': 105176, 'loss/train': 1.2503676414489746} 11/07/2021 12:03:07 - INFO - __main__ - Step 105178: {'lr': 0.0001048756292961518, 'samples': 20194176, 'steps': 105177, 'loss/train': 1.2867112159729004} 11/07/2021 12:03:08 - INFO - __main__ - Step 105179: {'lr': 0.00010487130824836086, 'samples': 20194368, 'steps': 105178, 'loss/train': 1.2445173263549805} 11/07/2021 12:03:09 - INFO - __main__ - Step 105180: {'lr': 0.00010486698726596269, 'samples': 20194560, 'steps': 105179, 'loss/train': 1.4286704063415527} 11/07/2021 12:03:09 - INFO - __main__ - Step 105181: {'lr': 0.00010486266634895922, 'samples': 20194752, 'steps': 105180, 'loss/train': 1.322928786277771} 11/07/2021 12:03:09 - INFO - __main__ - Step 105182: {'lr': 0.00010485834549735237, 'samples': 20194944, 'steps': 105181, 'loss/train': 0.4020659625530243} 11/07/2021 12:03:10 - INFO - __main__ - Step 105183: {'lr': 0.00010485402471114422, 'samples': 20195136, 'steps': 105182, 'loss/train': 1.1652309894561768} 11/07/2021 12:03:10 - INFO - __main__ - Step 105184: {'lr': 0.0001048497039903365, 'samples': 20195328, 'steps': 105183, 'loss/train': 1.3581548929214478} 11/07/2021 12:03:11 - INFO - __main__ - Step 105185: {'lr': 0.00010484538333493127, 'samples': 20195520, 'steps': 105184, 'loss/train': 1.4009581804275513} 11/07/2021 12:03:12 - INFO - __main__ - Step 105186: {'lr': 0.00010484106274493049, 'samples': 20195712, 'steps': 105185, 'loss/train': 0.07511761784553528} 11/07/2021 12:03:12 - INFO - __main__ - Step 105187: {'lr': 0.00010483674222033607, 'samples': 20195904, 'steps': 105186, 'loss/train': 1.4837279319763184} 11/07/2021 12:03:12 - INFO - __main__ - Step 105188: {'lr': 0.00010483242176114996, 'samples': 20196096, 'steps': 105187, 'loss/train': 1.1974601745605469} 11/07/2021 12:03:13 - INFO - __main__ - Step 105189: {'lr': 0.00010482810136737414, 'samples': 20196288, 'steps': 105188, 'loss/train': 0.5883745551109314} 11/07/2021 12:03:14 - INFO - __main__ - Step 105190: {'lr': 0.00010482378103901052, 'samples': 20196480, 'steps': 105189, 'loss/train': 1.5742195844650269} 11/07/2021 12:03:15 - INFO - __main__ - Step 105191: {'lr': 0.00010481946077606108, 'samples': 20196672, 'steps': 105190, 'loss/train': 0.8789882659912109} 11/07/2021 12:03:15 - INFO - __main__ - Step 105192: {'lr': 0.00010481514057852776, 'samples': 20196864, 'steps': 105191, 'loss/train': 0.7889737486839294} 11/07/2021 12:03:15 - INFO - __main__ - Step 105193: {'lr': 0.00010481082044641249, 'samples': 20197056, 'steps': 105192, 'loss/train': 1.2612054347991943} 11/07/2021 12:03:16 - INFO - __main__ - Step 105194: {'lr': 0.00010480650037971723, 'samples': 20197248, 'steps': 105193, 'loss/train': 1.7807546854019165} 11/07/2021 12:03:16 - INFO - __main__ - Step 105195: {'lr': 0.00010480218037844389, 'samples': 20197440, 'steps': 105194, 'loss/train': 1.9074182510375977} 11/07/2021 12:03:17 - INFO - __main__ - Step 105196: {'lr': 0.00010479786044259449, 'samples': 20197632, 'steps': 105195, 'loss/train': 0.06175177916884422} 11/07/2021 12:03:17 - INFO - __main__ - Step 105197: {'lr': 0.000104793540572171, 'samples': 20197824, 'steps': 105196, 'loss/train': 1.4095972776412964} 11/07/2021 12:03:18 - INFO - __main__ - Step 105198: {'lr': 0.0001047892207671752, 'samples': 20198016, 'steps': 105197, 'loss/train': 1.6221141815185547} 11/07/2021 12:03:18 - INFO - __main__ - Step 105199: {'lr': 0.00010478490102760915, 'samples': 20198208, 'steps': 105198, 'loss/train': 1.3667834997177124} 11/07/2021 12:03:18 - INFO - __main__ - Step 105200: {'lr': 0.00010478058135347477, 'samples': 20198400, 'steps': 105199, 'loss/train': 1.6517963409423828} 11/07/2021 12:03:19 - INFO - __main__ - Step 105201: {'lr': 0.00010477626174477404, 'samples': 20198592, 'steps': 105200, 'loss/train': 1.3608551025390625} 11/07/2021 12:03:20 - INFO - __main__ - Step 105202: {'lr': 0.00010477194220150887, 'samples': 20198784, 'steps': 105201, 'loss/train': 1.3901206254959106} 11/07/2021 12:03:20 - INFO - __main__ - Step 105203: {'lr': 0.00010476762272368124, 'samples': 20198976, 'steps': 105202, 'loss/train': 1.5333130359649658} 11/07/2021 12:03:20 - INFO - __main__ - Step 105204: {'lr': 0.00010476330331129305, 'samples': 20199168, 'steps': 105203, 'loss/train': 1.632289171218872} 11/07/2021 12:03:21 - INFO - __main__ - Step 105205: {'lr': 0.00010475898396434627, 'samples': 20199360, 'steps': 105204, 'loss/train': 1.3958488702774048} 11/07/2021 12:03:22 - INFO - __main__ - Step 105206: {'lr': 0.0001047546646828429, 'samples': 20199552, 'steps': 105205, 'loss/train': 1.2416660785675049} 11/07/2021 12:03:22 - INFO - __main__ - Step 105207: {'lr': 0.00010475034546678478, 'samples': 20199744, 'steps': 105206, 'loss/train': 1.3470453023910522} 11/07/2021 12:03:23 - INFO - __main__ - Step 105208: {'lr': 0.00010474602631617395, 'samples': 20199936, 'steps': 105207, 'loss/train': 1.5574469566345215} 11/07/2021 12:03:23 - INFO - __main__ - Step 105209: {'lr': 0.00010474170723101231, 'samples': 20200128, 'steps': 105208, 'loss/train': 0.8991352319717407} 11/07/2021 12:03:23 - INFO - __main__ - Step 105210: {'lr': 0.00010473738821130191, 'samples': 20200320, 'steps': 105209, 'loss/train': 2.136735439300537} 11/07/2021 12:03:24 - INFO - __main__ - Step 105211: {'lr': 0.00010473306925704448, 'samples': 20200512, 'steps': 105210, 'loss/train': 1.0650830268859863} 11/07/2021 12:03:25 - INFO - __main__ - Step 105212: {'lr': 0.00010472875036824211, 'samples': 20200704, 'steps': 105211, 'loss/train': 1.150038480758667} 11/07/2021 12:03:25 - INFO - __main__ - Step 105213: {'lr': 0.00010472443154489675, 'samples': 20200896, 'steps': 105212, 'loss/train': 1.4317610263824463} 11/07/2021 12:03:25 - INFO - __main__ - Step 105214: {'lr': 0.0001047201127870103, 'samples': 20201088, 'steps': 105213, 'loss/train': 1.2806487083435059} 11/07/2021 12:03:26 - INFO - __main__ - Step 105215: {'lr': 0.0001047157940945847, 'samples': 20201280, 'steps': 105214, 'loss/train': 1.4228496551513672} 11/07/2021 12:03:27 - INFO - __main__ - Step 105216: {'lr': 0.00010471147546762195, 'samples': 20201472, 'steps': 105215, 'loss/train': 1.3843590021133423} 11/07/2021 12:03:27 - INFO - __main__ - Step 105217: {'lr': 0.00010470715690612396, 'samples': 20201664, 'steps': 105216, 'loss/train': 1.1484838724136353} 11/07/2021 12:03:28 - INFO - __main__ - Step 105218: {'lr': 0.00010470283841009268, 'samples': 20201856, 'steps': 105217, 'loss/train': 1.2357078790664673} 11/07/2021 12:03:28 - INFO - __main__ - Step 105219: {'lr': 0.00010469851997953006, 'samples': 20202048, 'steps': 105218, 'loss/train': 1.4249495267868042} 11/07/2021 12:03:28 - INFO - __main__ - Step 105220: {'lr': 0.00010469420161443805, 'samples': 20202240, 'steps': 105219, 'loss/train': 1.2102819681167603} 11/07/2021 12:03:29 - INFO - __main__ - Step 105221: {'lr': 0.00010468988331481857, 'samples': 20202432, 'steps': 105220, 'loss/train': 1.2540209293365479} 11/07/2021 12:03:30 - INFO - __main__ - Step 105222: {'lr': 0.00010468556508067361, 'samples': 20202624, 'steps': 105221, 'loss/train': 1.308869481086731} 11/07/2021 12:03:30 - INFO - __main__ - Step 105223: {'lr': 0.00010468124691200509, 'samples': 20202816, 'steps': 105222, 'loss/train': 1.5228389501571655} 11/07/2021 12:03:30 - INFO - __main__ - Step 105224: {'lr': 0.00010467692880881504, 'samples': 20203008, 'steps': 105223, 'loss/train': 1.6155362129211426} 11/07/2021 12:03:31 - INFO - __main__ - Step 105225: {'lr': 0.00010467261077110523, 'samples': 20203200, 'steps': 105224, 'loss/train': 1.883145809173584} 11/07/2021 12:03:31 - INFO - __main__ - Step 105226: {'lr': 0.00010466829279887771, 'samples': 20203392, 'steps': 105225, 'loss/train': 1.5463091135025024} 11/07/2021 12:03:32 - INFO - __main__ - Step 105227: {'lr': 0.00010466397489213441, 'samples': 20203584, 'steps': 105226, 'loss/train': 1.401657223701477} 11/07/2021 12:03:33 - INFO - __main__ - Step 105228: {'lr': 0.00010465965705087726, 'samples': 20203776, 'steps': 105227, 'loss/train': 1.3694971799850464} 11/07/2021 12:03:33 - INFO - __main__ - Step 105229: {'lr': 0.00010465533927510826, 'samples': 20203968, 'steps': 105228, 'loss/train': 1.268822431564331} 11/07/2021 12:03:33 - INFO - __main__ - Step 105230: {'lr': 0.0001046510215648293, 'samples': 20204160, 'steps': 105229, 'loss/train': 1.1745753288269043} 11/07/2021 12:03:34 - INFO - __main__ - Step 105231: {'lr': 0.00010464670392004236, 'samples': 20204352, 'steps': 105230, 'loss/train': 0.4086626172065735} 11/07/2021 12:03:35 - INFO - __main__ - Step 105232: {'lr': 0.00010464238634074938, 'samples': 20204544, 'steps': 105231, 'loss/train': 1.3139711618423462} 11/07/2021 12:03:35 - INFO - __main__ - Step 105233: {'lr': 0.00010463806882695229, 'samples': 20204736, 'steps': 105232, 'loss/train': 1.6659736633300781} 11/07/2021 12:03:35 - INFO - __main__ - Step 105234: {'lr': 0.00010463375137865302, 'samples': 20204928, 'steps': 105233, 'loss/train': 1.8222514390945435} 11/07/2021 12:03:36 - INFO - __main__ - Step 105235: {'lr': 0.00010462943399585357, 'samples': 20205120, 'steps': 105234, 'loss/train': 0.7289140820503235} 11/07/2021 12:03:36 - INFO - __main__ - Step 105236: {'lr': 0.00010462511667855581, 'samples': 20205312, 'steps': 105235, 'loss/train': 0.19791853427886963} 11/07/2021 12:03:37 - INFO - __main__ - Step 105237: {'lr': 0.00010462079942676186, 'samples': 20205504, 'steps': 105236, 'loss/train': 1.3523070812225342} 11/07/2021 12:03:38 - INFO - __main__ - Step 105238: {'lr': 0.00010461648224047343, 'samples': 20205696, 'steps': 105237, 'loss/train': 0.9799742102622986} 11/07/2021 12:03:38 - INFO - __main__ - Step 105239: {'lr': 0.00010461216511969257, 'samples': 20205888, 'steps': 105238, 'loss/train': 1.7807281017303467} 11/07/2021 12:03:38 - INFO - __main__ - Step 105240: {'lr': 0.00010460784806442123, 'samples': 20206080, 'steps': 105239, 'loss/train': 1.4118411540985107} 11/07/2021 12:03:39 - INFO - __main__ - Step 105241: {'lr': 0.00010460353107466137, 'samples': 20206272, 'steps': 105240, 'loss/train': 1.2354499101638794} 11/07/2021 12:03:39 - INFO - __main__ - Step 105242: {'lr': 0.00010459921415041487, 'samples': 20206464, 'steps': 105241, 'loss/train': 1.3453601598739624} 11/07/2021 12:03:40 - INFO - __main__ - Step 105243: {'lr': 0.00010459489729168375, 'samples': 20206656, 'steps': 105242, 'loss/train': 1.4993454217910767} 11/07/2021 12:03:41 - INFO - __main__ - Step 105244: {'lr': 0.00010459058049846992, 'samples': 20206848, 'steps': 105243, 'loss/train': 1.498604655265808} 11/07/2021 12:03:41 - INFO - __main__ - Step 105245: {'lr': 0.00010458626377077531, 'samples': 20207040, 'steps': 105244, 'loss/train': 1.171682596206665} 11/07/2021 12:03:41 - INFO - __main__ - Step 105246: {'lr': 0.00010458194710860192, 'samples': 20207232, 'steps': 105245, 'loss/train': 1.4389511346817017} 11/07/2021 12:03:42 - INFO - __main__ - Step 105247: {'lr': 0.00010457763051195165, 'samples': 20207424, 'steps': 105246, 'loss/train': 1.3793363571166992} 11/07/2021 12:03:43 - INFO - __main__ - Step 105248: {'lr': 0.00010457331398082645, 'samples': 20207616, 'steps': 105247, 'loss/train': 1.282370686531067} 11/07/2021 12:03:43 - INFO - __main__ - Step 105249: {'lr': 0.00010456899751522827, 'samples': 20207808, 'steps': 105248, 'loss/train': 1.0810985565185547} 11/07/2021 12:03:43 - INFO - __main__ - Step 105250: {'lr': 0.00010456468111515905, 'samples': 20208000, 'steps': 105249, 'loss/train': 1.4644087553024292} 11/07/2021 12:03:44 - INFO - __main__ - Step 105251: {'lr': 0.00010456036478062083, 'samples': 20208192, 'steps': 105250, 'loss/train': 1.3682259321212769} 11/07/2021 12:03:44 - INFO - __main__ - Step 105252: {'lr': 0.0001045560485116154, 'samples': 20208384, 'steps': 105251, 'loss/train': 0.8912917375564575} 11/07/2021 12:03:45 - INFO - __main__ - Step 105253: {'lr': 0.00010455173230814474, 'samples': 20208576, 'steps': 105252, 'loss/train': 1.6646363735198975} 11/07/2021 12:03:45 - INFO - __main__ - Step 105254: {'lr': 0.00010454741617021086, 'samples': 20208768, 'steps': 105253, 'loss/train': 1.2245420217514038} 11/07/2021 12:03:46 - INFO - __main__ - Step 105255: {'lr': 0.00010454310009781565, 'samples': 20208960, 'steps': 105254, 'loss/train': 1.3495712280273438} 11/07/2021 12:03:46 - INFO - __main__ - Step 105256: {'lr': 0.00010453878409096107, 'samples': 20209152, 'steps': 105255, 'loss/train': 1.4333537817001343} 11/07/2021 12:03:47 - INFO - __main__ - Step 105257: {'lr': 0.00010453446814964906, 'samples': 20209344, 'steps': 105256, 'loss/train': 0.7993539571762085} 11/07/2021 12:03:48 - INFO - __main__ - Step 105258: {'lr': 0.0001045301522738816, 'samples': 20209536, 'steps': 105257, 'loss/train': 1.4865093231201172} 11/07/2021 12:03:48 - INFO - __main__ - Step 105259: {'lr': 0.00010452583646366059, 'samples': 20209728, 'steps': 105258, 'loss/train': 0.21880432963371277} 11/07/2021 12:03:48 - INFO - __main__ - Step 105260: {'lr': 0.00010452152071898799, 'samples': 20209920, 'steps': 105259, 'loss/train': 1.5082000494003296} 11/07/2021 12:03:49 - INFO - __main__ - Step 105261: {'lr': 0.00010451720503986576, 'samples': 20210112, 'steps': 105260, 'loss/train': 1.3885254859924316} 11/07/2021 12:03:49 - INFO - __main__ - Step 105262: {'lr': 0.00010451288942629583, 'samples': 20210304, 'steps': 105261, 'loss/train': 1.2246261835098267} 11/07/2021 12:03:50 - INFO - __main__ - Step 105263: {'lr': 0.00010450857387828014, 'samples': 20210496, 'steps': 105262, 'loss/train': 1.6494895219802856} 11/07/2021 12:03:50 - INFO - __main__ - Step 105264: {'lr': 0.00010450425839582073, 'samples': 20210688, 'steps': 105263, 'loss/train': 1.5644958019256592} 11/07/2021 12:03:51 - INFO - __main__ - Step 105265: {'lr': 0.00010449994297891937, 'samples': 20210880, 'steps': 105264, 'loss/train': 1.151308298110962} 11/07/2021 12:03:51 - INFO - __main__ - Step 105266: {'lr': 0.0001044956276275781, 'samples': 20211072, 'steps': 105265, 'loss/train': 1.4079251289367676} 11/07/2021 12:03:51 - INFO - __main__ - Step 105267: {'lr': 0.00010449131234179884, 'samples': 20211264, 'steps': 105266, 'loss/train': 0.9569866061210632} 11/07/2021 12:03:52 - INFO - __main__ - Step 105268: {'lr': 0.00010448699712158357, 'samples': 20211456, 'steps': 105267, 'loss/train': 1.847949504852295} 11/07/2021 12:03:53 - INFO - __main__ - Step 105269: {'lr': 0.0001044826819669342, 'samples': 20211648, 'steps': 105268, 'loss/train': 1.3025673627853394} 11/07/2021 12:03:53 - INFO - __main__ - Step 105270: {'lr': 0.0001044783668778527, 'samples': 20211840, 'steps': 105269, 'loss/train': 1.4202735424041748} 11/07/2021 12:03:54 - INFO - __main__ - Step 105271: {'lr': 0.00010447405185434097, 'samples': 20212032, 'steps': 105270, 'loss/train': 1.5817373991012573} 11/07/2021 12:03:54 - INFO - __main__ - Step 105272: {'lr': 0.00010446973689640101, 'samples': 20212224, 'steps': 105271, 'loss/train': 1.7027908563613892} 11/07/2021 12:03:54 - INFO - __main__ - Step 105273: {'lr': 0.00010446542200403475, 'samples': 20212416, 'steps': 105272, 'loss/train': 1.1494115591049194} 11/07/2021 12:03:55 - INFO - __main__ - Step 105274: {'lr': 0.0001044611071772441, 'samples': 20212608, 'steps': 105273, 'loss/train': 1.0198898315429688} 11/07/2021 12:03:56 - INFO - __main__ - Step 105275: {'lr': 0.00010445679241603107, 'samples': 20212800, 'steps': 105274, 'loss/train': 0.66750168800354} 11/07/2021 12:03:56 - INFO - __main__ - Step 105276: {'lr': 0.00010445247772039754, 'samples': 20212992, 'steps': 105275, 'loss/train': 1.3887532949447632} 11/07/2021 12:03:57 - INFO - __main__ - Step 105277: {'lr': 0.00010444816309034555, 'samples': 20213184, 'steps': 105276, 'loss/train': 1.414869785308838} 11/07/2021 12:03:57 - INFO - __main__ - Step 105278: {'lr': 0.00010444384852587691, 'samples': 20213376, 'steps': 105277, 'loss/train': 1.341550588607788} 11/07/2021 12:03:58 - INFO - __main__ - Step 105279: {'lr': 0.0001044395340269936, 'samples': 20213568, 'steps': 105278, 'loss/train': 0.8494886755943298} 11/07/2021 12:03:58 - INFO - __main__ - Step 105280: {'lr': 0.0001044352195936976, 'samples': 20213760, 'steps': 105279, 'loss/train': 5.764904499053955} 11/07/2021 12:03:59 - INFO - __main__ - Step 105281: {'lr': 0.00010443090522599086, 'samples': 20213952, 'steps': 105280, 'loss/train': 1.4944238662719727} 11/07/2021 12:03:59 - INFO - __main__ - Step 105282: {'lr': 0.00010442659092387527, 'samples': 20214144, 'steps': 105281, 'loss/train': 1.2581933736801147} 11/07/2021 12:03:59 - INFO - __main__ - Step 105283: {'lr': 0.00010442227668735285, 'samples': 20214336, 'steps': 105282, 'loss/train': 1.33547043800354} 11/07/2021 12:04:00 - INFO - __main__ - Step 105284: {'lr': 0.00010441796251642549, 'samples': 20214528, 'steps': 105283, 'loss/train': 1.5573132038116455} 11/07/2021 12:04:01 - INFO - __main__ - Step 105285: {'lr': 0.00010441364841109515, 'samples': 20214720, 'steps': 105284, 'loss/train': 1.2738016843795776} 11/07/2021 12:04:01 - INFO - __main__ - Step 105286: {'lr': 0.00010440933437136376, 'samples': 20214912, 'steps': 105285, 'loss/train': 1.2555787563323975} 11/07/2021 12:04:02 - INFO - __main__ - Step 105287: {'lr': 0.00010440502039723331, 'samples': 20215104, 'steps': 105286, 'loss/train': 1.2714954614639282} 11/07/2021 12:04:02 - INFO - __main__ - Step 105288: {'lr': 0.0001044007064887057, 'samples': 20215296, 'steps': 105287, 'loss/train': 1.079890251159668} 11/07/2021 12:04:02 - INFO - __main__ - Step 105289: {'lr': 0.00010439639264578288, 'samples': 20215488, 'steps': 105288, 'loss/train': 1.2503001689910889} 11/07/2021 12:04:03 - INFO - __main__ - Step 105290: {'lr': 0.00010439207886846677, 'samples': 20215680, 'steps': 105289, 'loss/train': 0.8144906163215637} 11/07/2021 12:04:04 - INFO - __main__ - Step 105291: {'lr': 0.00010438776515675946, 'samples': 20215872, 'steps': 105290, 'loss/train': 1.0660017728805542} 11/07/2021 12:04:04 - INFO - __main__ - Step 105292: {'lr': 0.0001043834515106627, 'samples': 20216064, 'steps': 105291, 'loss/train': 1.1994737386703491} 11/07/2021 12:04:04 - INFO - __main__ - Step 105293: {'lr': 0.00010437913793017851, 'samples': 20216256, 'steps': 105292, 'loss/train': 0.8626495599746704} 11/07/2021 12:04:05 - INFO - __main__ - Step 105294: {'lr': 0.0001043748244153088, 'samples': 20216448, 'steps': 105293, 'loss/train': 0.7784091830253601} 11/07/2021 12:04:06 - INFO - __main__ - Step 105295: {'lr': 0.00010437051096605556, 'samples': 20216640, 'steps': 105294, 'loss/train': 0.8340812921524048} 11/07/2021 12:04:06 - INFO - __main__ - Step 105296: {'lr': 0.00010436619758242072, 'samples': 20216832, 'steps': 105295, 'loss/train': 1.4273356199264526} 11/07/2021 12:04:07 - INFO - __main__ - Step 105297: {'lr': 0.00010436188426440623, 'samples': 20217024, 'steps': 105296, 'loss/train': 1.5223190784454346} 11/07/2021 12:04:07 - INFO - __main__ - Step 105298: {'lr': 0.00010435757101201404, 'samples': 20217216, 'steps': 105297, 'loss/train': 1.702336072921753} 11/07/2021 12:04:07 - INFO - __main__ - Step 105299: {'lr': 0.00010435325782524608, 'samples': 20217408, 'steps': 105298, 'loss/train': 1.404015302658081} 11/07/2021 12:04:08 - INFO - __main__ - Step 105300: {'lr': 0.00010434894470410428, 'samples': 20217600, 'steps': 105299, 'loss/train': 1.2108582258224487} 11/07/2021 12:04:09 - INFO - __main__ - Step 105301: {'lr': 0.0001043446316485906, 'samples': 20217792, 'steps': 105300, 'loss/train': 1.4937701225280762} 11/07/2021 12:04:09 - INFO - __main__ - Step 105302: {'lr': 0.00010434031865870697, 'samples': 20217984, 'steps': 105301, 'loss/train': 1.4145612716674805} 11/07/2021 12:04:09 - INFO - __main__ - Step 105303: {'lr': 0.00010433600573445538, 'samples': 20218176, 'steps': 105302, 'loss/train': 0.8798491358757019} 11/07/2021 12:04:10 - INFO - __main__ - Step 105304: {'lr': 0.0001043316928758378, 'samples': 20218368, 'steps': 105303, 'loss/train': 1.0549216270446777} 11/07/2021 12:04:10 - INFO - __main__ - Step 105305: {'lr': 0.00010432738008285602, 'samples': 20218560, 'steps': 105304, 'loss/train': 1.5560330152511597} 11/07/2021 12:04:11 - INFO - __main__ - Step 105306: {'lr': 0.00010432306735551209, 'samples': 20218752, 'steps': 105305, 'loss/train': 1.3976892232894897} 11/07/2021 12:04:12 - INFO - __main__ - Step 105307: {'lr': 0.00010431875469380792, 'samples': 20218944, 'steps': 105306, 'loss/train': 1.6255565881729126} 11/07/2021 12:04:12 - INFO - __main__ - Step 105308: {'lr': 0.00010431444209774549, 'samples': 20219136, 'steps': 105307, 'loss/train': 1.1432349681854248} 11/07/2021 12:04:12 - INFO - __main__ - Step 105309: {'lr': 0.00010431012956732668, 'samples': 20219328, 'steps': 105308, 'loss/train': 0.8261331915855408} 11/07/2021 12:04:13 - INFO - __main__ - Step 105310: {'lr': 0.00010430581710255354, 'samples': 20219520, 'steps': 105309, 'loss/train': 1.2342051267623901} 11/07/2021 12:04:14 - INFO - __main__ - Step 105311: {'lr': 0.00010430150470342793, 'samples': 20219712, 'steps': 105310, 'loss/train': 1.214442491531372} 11/07/2021 12:04:14 - INFO - __main__ - Step 105312: {'lr': 0.0001042971923699518, 'samples': 20219904, 'steps': 105311, 'loss/train': 1.5669602155685425} 11/07/2021 12:04:14 - INFO - __main__ - Step 105313: {'lr': 0.00010429288010212712, 'samples': 20220096, 'steps': 105312, 'loss/train': 0.8048188090324402} 11/07/2021 12:04:15 - INFO - __main__ - Step 105314: {'lr': 0.00010428856789995581, 'samples': 20220288, 'steps': 105313, 'loss/train': 1.0100947618484497} 11/07/2021 12:04:15 - INFO - __main__ - Step 105315: {'lr': 0.00010428425576343981, 'samples': 20220480, 'steps': 105314, 'loss/train': 1.5519784688949585} 11/07/2021 12:04:16 - INFO - __main__ - Step 105316: {'lr': 0.00010427994369258109, 'samples': 20220672, 'steps': 105315, 'loss/train': 1.0066519975662231} 11/07/2021 12:04:16 - INFO - __main__ - Step 105317: {'lr': 0.00010427563168738157, 'samples': 20220864, 'steps': 105316, 'loss/train': 0.9657079577445984} 11/07/2021 12:04:17 - INFO - __main__ - Step 105318: {'lr': 0.00010427131974784332, 'samples': 20221056, 'steps': 105317, 'loss/train': 0.7557847499847412} 11/07/2021 12:04:17 - INFO - __main__ - Step 105319: {'lr': 0.00010426700787396806, 'samples': 20221248, 'steps': 105318, 'loss/train': 0.960444986820221} 11/07/2021 12:04:18 - INFO - __main__ - Step 105320: {'lr': 0.00010426269606575783, 'samples': 20221440, 'steps': 105319, 'loss/train': 1.447075605392456} 11/07/2021 12:04:19 - INFO - __main__ - Step 105321: {'lr': 0.00010425838432321457, 'samples': 20221632, 'steps': 105320, 'loss/train': 1.7017905712127686} 11/07/2021 12:04:19 - INFO - __main__ - Step 105322: {'lr': 0.00010425407264634026, 'samples': 20221824, 'steps': 105321, 'loss/train': 1.056805968284607} 11/07/2021 12:04:19 - INFO - __main__ - Step 105323: {'lr': 0.0001042497610351368, 'samples': 20222016, 'steps': 105322, 'loss/train': 1.235877275466919} 11/07/2021 12:04:20 - INFO - __main__ - Step 105324: {'lr': 0.00010424544948960616, 'samples': 20222208, 'steps': 105323, 'loss/train': 1.3377493619918823} 11/07/2021 12:04:20 - INFO - __main__ - Step 105325: {'lr': 0.00010424113800975027, 'samples': 20222400, 'steps': 105324, 'loss/train': 1.1839382648468018} 11/07/2021 12:04:21 - INFO - __main__ - Step 105326: {'lr': 0.00010423682659557107, 'samples': 20222592, 'steps': 105325, 'loss/train': 1.0617351531982422} 11/07/2021 12:04:22 - INFO - __main__ - Step 105327: {'lr': 0.0001042325152470705, 'samples': 20222784, 'steps': 105326, 'loss/train': 1.0341832637786865} 11/07/2021 12:04:22 - INFO - __main__ - Step 105328: {'lr': 0.00010422820396425051, 'samples': 20222976, 'steps': 105327, 'loss/train': 1.2037121057510376} 11/07/2021 12:04:22 - INFO - __main__ - Step 105329: {'lr': 0.00010422389274711305, 'samples': 20223168, 'steps': 105328, 'loss/train': 0.8037098050117493} 11/07/2021 12:04:23 - INFO - __main__ - Step 105330: {'lr': 0.00010421958159566006, 'samples': 20223360, 'steps': 105329, 'loss/train': 1.2393686771392822} 11/07/2021 12:04:23 - INFO - __main__ - Step 105331: {'lr': 0.00010421527050989354, 'samples': 20223552, 'steps': 105330, 'loss/train': 0.677667498588562} 11/07/2021 12:04:24 - INFO - __main__ - Step 105332: {'lr': 0.0001042109594898153, 'samples': 20223744, 'steps': 105331, 'loss/train': 1.294662356376648} 11/07/2021 12:04:24 - INFO - __main__ - Step 105333: {'lr': 0.00010420664853542736, 'samples': 20223936, 'steps': 105332, 'loss/train': 1.4346178770065308} 11/07/2021 12:04:25 - INFO - __main__ - Step 105334: {'lr': 0.00010420233764673162, 'samples': 20224128, 'steps': 105333, 'loss/train': 0.9793675541877747} 11/07/2021 12:04:25 - INFO - __main__ - Step 105335: {'lr': 0.00010419802682373008, 'samples': 20224320, 'steps': 105334, 'loss/train': 1.65200674533844} 11/07/2021 12:04:25 - INFO - __main__ - Step 105336: {'lr': 0.00010419371606642467, 'samples': 20224512, 'steps': 105335, 'loss/train': 1.1995121240615845} 11/07/2021 12:04:26 - INFO - __main__ - Step 105337: {'lr': 0.0001041894053748173, 'samples': 20224704, 'steps': 105336, 'loss/train': 1.586211919784546} 11/07/2021 12:04:27 - INFO - __main__ - Step 105338: {'lr': 0.00010418509474890994, 'samples': 20224896, 'steps': 105337, 'loss/train': 2.04872727394104} 11/07/2021 12:04:27 - INFO - __main__ - Step 105339: {'lr': 0.00010418078418870455, 'samples': 20225088, 'steps': 105338, 'loss/train': 1.479591965675354} 11/07/2021 12:04:28 - INFO - __main__ - Step 105340: {'lr': 0.00010417647369420302, 'samples': 20225280, 'steps': 105339, 'loss/train': 1.6206464767456055} 11/07/2021 12:04:28 - INFO - __main__ - Step 105341: {'lr': 0.00010417216326540732, 'samples': 20225472, 'steps': 105340, 'loss/train': 0.7490705251693726} 11/07/2021 12:04:29 - INFO - __main__ - Step 105342: {'lr': 0.00010416785290231951, 'samples': 20225664, 'steps': 105341, 'loss/train': 1.834619164466858} 11/07/2021 12:04:29 - INFO - __main__ - Step 105343: {'lr': 0.0001041635426049413, 'samples': 20225856, 'steps': 105342, 'loss/train': 0.8802608251571655} 11/07/2021 12:04:30 - INFO - __main__ - Step 105344: {'lr': 0.00010415923237327476, 'samples': 20226048, 'steps': 105343, 'loss/train': 1.6505863666534424} 11/07/2021 12:04:30 - INFO - __main__ - Step 105345: {'lr': 0.00010415492220732181, 'samples': 20226240, 'steps': 105344, 'loss/train': 1.0190131664276123} 11/07/2021 12:04:30 - INFO - __main__ - Step 105346: {'lr': 0.0001041506121070844, 'samples': 20226432, 'steps': 105345, 'loss/train': 1.3311489820480347} 11/07/2021 12:04:31 - INFO - __main__ - Step 105347: {'lr': 0.00010414630207256447, 'samples': 20226624, 'steps': 105346, 'loss/train': 1.3356469869613647} 11/07/2021 12:04:32 - INFO - __main__ - Step 105348: {'lr': 0.00010414199210376399, 'samples': 20226816, 'steps': 105347, 'loss/train': 1.3136873245239258} 11/07/2021 12:04:32 - INFO - __main__ - Step 105349: {'lr': 0.00010413768220068487, 'samples': 20227008, 'steps': 105348, 'loss/train': 1.739066243171692} 11/07/2021 12:04:32 - INFO - __main__ - Step 105350: {'lr': 0.00010413337236332907, 'samples': 20227200, 'steps': 105349, 'loss/train': 1.4907729625701904} 11/07/2021 12:04:33 - INFO - __main__ - Step 105351: {'lr': 0.0001041290625916985, 'samples': 20227392, 'steps': 105350, 'loss/train': 1.1718751192092896} 11/07/2021 12:04:33 - INFO - __main__ - Step 105352: {'lr': 0.00010412475288579512, 'samples': 20227584, 'steps': 105351, 'loss/train': 1.5820667743682861} 11/07/2021 12:04:34 - INFO - __main__ - Step 105353: {'lr': 0.00010412044324562098, 'samples': 20227776, 'steps': 105352, 'loss/train': 1.3261305093765259} 11/07/2021 12:04:35 - INFO - __main__ - Step 105354: {'lr': 0.00010411613367117781, 'samples': 20227968, 'steps': 105353, 'loss/train': 1.7672667503356934} 11/07/2021 12:04:35 - INFO - __main__ - Step 105355: {'lr': 0.00010411182416246768, 'samples': 20228160, 'steps': 105354, 'loss/train': 1.3590116500854492} 11/07/2021 12:04:35 - INFO - __main__ - Step 105356: {'lr': 0.00010410751471949248, 'samples': 20228352, 'steps': 105355, 'loss/train': 1.298156499862671} 11/07/2021 12:04:36 - INFO - __main__ - Step 105357: {'lr': 0.0001041032053422542, 'samples': 20228544, 'steps': 105356, 'loss/train': 1.2358107566833496} 11/07/2021 12:04:37 - INFO - __main__ - Step 105358: {'lr': 0.00010409889603075478, 'samples': 20228736, 'steps': 105357, 'loss/train': 1.3090134859085083} 11/07/2021 12:04:37 - INFO - __main__ - Step 105359: {'lr': 0.00010409458678499615, 'samples': 20228928, 'steps': 105358, 'loss/train': 1.3999916315078735} 11/07/2021 12:04:37 - INFO - __main__ - Step 105360: {'lr': 0.00010409027760498021, 'samples': 20229120, 'steps': 105359, 'loss/train': 1.2762749195098877} 11/07/2021 12:04:38 - INFO - __main__ - Step 105361: {'lr': 0.00010408596849070898, 'samples': 20229312, 'steps': 105360, 'loss/train': 1.2154127359390259} 11/07/2021 12:04:38 - INFO - __main__ - Step 105362: {'lr': 0.00010408165944218431, 'samples': 20229504, 'steps': 105361, 'loss/train': 1.3185149431228638} 11/07/2021 12:04:39 - INFO - __main__ - Step 105363: {'lr': 0.00010407735045940825, 'samples': 20229696, 'steps': 105362, 'loss/train': 1.4381431341171265} 11/07/2021 12:04:40 - INFO - __main__ - Step 105364: {'lr': 0.00010407304154238272, 'samples': 20229888, 'steps': 105363, 'loss/train': 1.8308597803115845} 11/07/2021 12:04:40 - INFO - __main__ - Step 105365: {'lr': 0.00010406873269110959, 'samples': 20230080, 'steps': 105364, 'loss/train': 1.0814141035079956} 11/07/2021 12:04:40 - INFO - __main__ - Step 105366: {'lr': 0.00010406442390559082, 'samples': 20230272, 'steps': 105365, 'loss/train': 1.0640982389450073} 11/07/2021 12:04:41 - INFO - __main__ - Step 105367: {'lr': 0.00010406011518582834, 'samples': 20230464, 'steps': 105366, 'loss/train': 0.9651253819465637} 11/07/2021 12:04:42 - INFO - __main__ - Step 105368: {'lr': 0.00010405580653182415, 'samples': 20230656, 'steps': 105367, 'loss/train': 0.3412196636199951} 11/07/2021 12:04:42 - INFO - __main__ - Step 105369: {'lr': 0.00010405149794358015, 'samples': 20230848, 'steps': 105368, 'loss/train': 1.1942867040634155} 11/07/2021 12:04:42 - INFO - __main__ - Step 105370: {'lr': 0.00010404718942109829, 'samples': 20231040, 'steps': 105369, 'loss/train': 0.6120254397392273} 11/07/2021 12:04:43 - INFO - __main__ - Step 105371: {'lr': 0.00010404288096438052, 'samples': 20231232, 'steps': 105370, 'loss/train': 1.0552129745483398} 11/07/2021 12:04:43 - INFO - __main__ - Step 105372: {'lr': 0.00010403857257342877, 'samples': 20231424, 'steps': 105371, 'loss/train': 1.7474420070648193} 11/07/2021 12:04:44 - INFO - __main__ - Step 105373: {'lr': 0.000104034264248245, 'samples': 20231616, 'steps': 105372, 'loss/train': 1.2984530925750732} 11/07/2021 12:04:44 - INFO - __main__ - Step 105374: {'lr': 0.00010402995598883111, 'samples': 20231808, 'steps': 105373, 'loss/train': 0.9440544843673706} 11/07/2021 12:04:45 - INFO - __main__ - Step 105375: {'lr': 0.00010402564779518919, 'samples': 20232000, 'steps': 105374, 'loss/train': 1.3251346349716187} 11/07/2021 12:04:45 - INFO - __main__ - Step 105376: {'lr': 0.00010402133966732098, 'samples': 20232192, 'steps': 105375, 'loss/train': 1.0830986499786377} 11/07/2021 12:04:46 - INFO - __main__ - Step 105377: {'lr': 0.00010401703160522846, 'samples': 20232384, 'steps': 105376, 'loss/train': 1.3236939907073975} 11/07/2021 12:04:47 - INFO - __main__ - Step 105378: {'lr': 0.00010401272360891364, 'samples': 20232576, 'steps': 105377, 'loss/train': 1.498174786567688} 11/07/2021 12:04:47 - INFO - __main__ - Step 105379: {'lr': 0.00010400841567837843, 'samples': 20232768, 'steps': 105378, 'loss/train': 1.3327261209487915} 11/07/2021 12:04:47 - INFO - __main__ - Step 105380: {'lr': 0.00010400410781362477, 'samples': 20232960, 'steps': 105379, 'loss/train': 1.396457314491272} 11/07/2021 12:04:48 - INFO - __main__ - Step 105381: {'lr': 0.00010399980001465461, 'samples': 20233152, 'steps': 105380, 'loss/train': 0.8505367636680603} 11/07/2021 12:04:48 - INFO - __main__ - Step 105382: {'lr': 0.00010399549228146987, 'samples': 20233344, 'steps': 105381, 'loss/train': 1.1569147109985352} 11/07/2021 12:04:49 - INFO - __main__ - Step 105383: {'lr': 0.00010399118461407254, 'samples': 20233536, 'steps': 105382, 'loss/train': 1.3533635139465332} 11/07/2021 12:04:49 - INFO - __main__ - Step 105384: {'lr': 0.00010398687701246451, 'samples': 20233728, 'steps': 105383, 'loss/train': 1.4275213479995728} 11/07/2021 12:04:50 - INFO - __main__ - Step 105385: {'lr': 0.00010398256947664774, 'samples': 20233920, 'steps': 105384, 'loss/train': 1.335425615310669} 11/07/2021 12:04:50 - INFO - __main__ - Step 105386: {'lr': 0.00010397826200662427, 'samples': 20234112, 'steps': 105385, 'loss/train': 1.1338030099868774} 11/07/2021 12:04:50 - INFO - __main__ - Step 105387: {'lr': 0.00010397395460239583, 'samples': 20234304, 'steps': 105386, 'loss/train': 1.407522201538086} 11/07/2021 12:04:52 - INFO - __main__ - Step 105388: {'lr': 0.00010396964726396452, 'samples': 20234496, 'steps': 105387, 'loss/train': 0.8276821374893188} 11/07/2021 12:04:52 - INFO - __main__ - Step 105389: {'lr': 0.00010396533999133218, 'samples': 20234688, 'steps': 105388, 'loss/train': 1.2787816524505615} 11/07/2021 12:04:52 - INFO - __main__ - Step 105390: {'lr': 0.00010396103278450084, 'samples': 20234880, 'steps': 105389, 'loss/train': 1.3538411855697632} 11/07/2021 12:04:53 - INFO - __main__ - Step 105391: {'lr': 0.00010395672564347239, 'samples': 20235072, 'steps': 105390, 'loss/train': 1.1657015085220337} 11/07/2021 12:04:53 - INFO - __main__ - Step 105392: {'lr': 0.00010395241856824877, 'samples': 20235264, 'steps': 105391, 'loss/train': 1.5381826162338257} 11/07/2021 12:04:53 - INFO - __main__ - Step 105393: {'lr': 0.00010394811155883197, 'samples': 20235456, 'steps': 105392, 'loss/train': 2.9542667865753174} 11/07/2021 12:04:54 - INFO - __main__ - Step 105394: {'lr': 0.00010394380461522387, 'samples': 20235648, 'steps': 105393, 'loss/train': 1.2970671653747559} 11/07/2021 12:04:55 - INFO - __main__ - Step 105395: {'lr': 0.00010393949773742648, 'samples': 20235840, 'steps': 105394, 'loss/train': 1.6042373180389404} 11/07/2021 12:04:55 - INFO - __main__ - Step 105396: {'lr': 0.00010393519092544165, 'samples': 20236032, 'steps': 105395, 'loss/train': 1.3007521629333496} 11/07/2021 12:04:55 - INFO - __main__ - Step 105397: {'lr': 0.00010393088417927137, 'samples': 20236224, 'steps': 105396, 'loss/train': 1.564125657081604} 11/07/2021 12:04:56 - INFO - __main__ - Step 105398: {'lr': 0.00010392657749891771, 'samples': 20236416, 'steps': 105397, 'loss/train': 1.025750994682312} 11/07/2021 12:04:57 - INFO - __main__ - Step 105399: {'lr': 0.00010392227088438236, 'samples': 20236608, 'steps': 105398, 'loss/train': 1.6376559734344482} 11/07/2021 12:04:57 - INFO - __main__ - Step 105400: {'lr': 0.00010391796433566739, 'samples': 20236800, 'steps': 105399, 'loss/train': 1.2885078191757202} 11/07/2021 12:04:58 - INFO - __main__ - Step 105401: {'lr': 0.00010391365785277473, 'samples': 20236992, 'steps': 105400, 'loss/train': 1.3195323944091797} 11/07/2021 12:04:58 - INFO - __main__ - Step 105402: {'lr': 0.00010390935143570631, 'samples': 20237184, 'steps': 105401, 'loss/train': 1.5289183855056763} 11/07/2021 12:04:58 - INFO - __main__ - Step 105403: {'lr': 0.0001039050450844641, 'samples': 20237376, 'steps': 105402, 'loss/train': 0.17540261149406433} 11/07/2021 12:05:00 - INFO - __main__ - Step 105404: {'lr': 0.00010390073879905002, 'samples': 20237568, 'steps': 105403, 'loss/train': 0.9762536287307739} 11/07/2021 12:05:00 - INFO - __main__ - Step 105405: {'lr': 0.00010389643257946602, 'samples': 20237760, 'steps': 105404, 'loss/train': 1.5843273401260376} 11/07/2021 12:05:00 - INFO - __main__ - Step 105406: {'lr': 0.000103892126425714, 'samples': 20237952, 'steps': 105405, 'loss/train': 1.776750087738037} 11/07/2021 12:05:01 - INFO - __main__ - Step 105407: {'lr': 0.00010388782033779595, 'samples': 20238144, 'steps': 105406, 'loss/train': 1.3156721591949463} 11/07/2021 12:05:01 - INFO - __main__ - Step 105408: {'lr': 0.0001038835143157138, 'samples': 20238336, 'steps': 105407, 'loss/train': 1.3293555974960327} 11/07/2021 12:05:02 - INFO - __main__ - Step 105409: {'lr': 0.00010387920835946949, 'samples': 20238528, 'steps': 105408, 'loss/train': 0.9218058586120605} 11/07/2021 12:05:03 - INFO - __main__ - Step 105410: {'lr': 0.00010387490246906495, 'samples': 20238720, 'steps': 105409, 'loss/train': 1.1939440965652466} 11/07/2021 12:05:03 - INFO - __main__ - Step 105411: {'lr': 0.00010387059664450211, 'samples': 20238912, 'steps': 105410, 'loss/train': 1.4347281455993652} 11/07/2021 12:05:03 - INFO - __main__ - Step 105412: {'lr': 0.00010386629088578303, 'samples': 20239104, 'steps': 105411, 'loss/train': 0.7545430660247803} 11/07/2021 12:05:04 - INFO - __main__ - Step 105413: {'lr': 0.00010386198519290943, 'samples': 20239296, 'steps': 105412, 'loss/train': 1.3203730583190918} 11/07/2021 12:05:04 - INFO - __main__ - Step 105414: {'lr': 0.00010385767956588338, 'samples': 20239488, 'steps': 105413, 'loss/train': 0.8770759701728821} 11/07/2021 12:05:05 - INFO - __main__ - Step 105415: {'lr': 0.00010385337400470681, 'samples': 20239680, 'steps': 105414, 'loss/train': 1.651685118675232} 11/07/2021 12:05:05 - INFO - __main__ - Step 105416: {'lr': 0.00010384906850938167, 'samples': 20239872, 'steps': 105415, 'loss/train': 1.6134700775146484} 11/07/2021 12:05:06 - INFO - __main__ - Step 105417: {'lr': 0.00010384476307990987, 'samples': 20240064, 'steps': 105416, 'loss/train': 0.8137121200561523} 11/07/2021 12:05:06 - INFO - __main__ - Step 105418: {'lr': 0.00010384045771629333, 'samples': 20240256, 'steps': 105417, 'loss/train': 1.942958950996399} 11/07/2021 12:05:06 - INFO - __main__ - Step 105419: {'lr': 0.00010383615241853405, 'samples': 20240448, 'steps': 105418, 'loss/train': 1.0881150960922241} 11/07/2021 12:05:07 - INFO - __main__ - Step 105420: {'lr': 0.00010383184718663397, 'samples': 20240640, 'steps': 105419, 'loss/train': 1.2517846822738647} 11/07/2021 12:05:08 - INFO - __main__ - Step 105421: {'lr': 0.00010382754202059497, 'samples': 20240832, 'steps': 105420, 'loss/train': 1.2911711931228638} 11/07/2021 12:05:08 - INFO - __main__ - Step 105422: {'lr': 0.00010382323692041903, 'samples': 20241024, 'steps': 105421, 'loss/train': 1.0918693542480469} 11/07/2021 12:05:08 - INFO - __main__ - Step 105423: {'lr': 0.0001038189318861081, 'samples': 20241216, 'steps': 105422, 'loss/train': 1.3088916540145874} 11/07/2021 12:05:09 - INFO - __main__ - Step 105424: {'lr': 0.00010381462691766411, 'samples': 20241408, 'steps': 105423, 'loss/train': 1.2743396759033203} 11/07/2021 12:05:10 - INFO - __main__ - Step 105425: {'lr': 0.00010381032201508906, 'samples': 20241600, 'steps': 105424, 'loss/train': 1.4931720495224} 11/07/2021 12:05:10 - INFO - __main__ - Step 105426: {'lr': 0.00010380601717838472, 'samples': 20241792, 'steps': 105425, 'loss/train': 1.2009376287460327} 11/07/2021 12:05:10 - INFO - __main__ - Step 105427: {'lr': 0.00010380171240755317, 'samples': 20241984, 'steps': 105426, 'loss/train': 1.64360511302948} 11/07/2021 12:05:11 - INFO - __main__ - Step 105428: {'lr': 0.0001037974077025963, 'samples': 20242176, 'steps': 105427, 'loss/train': 1.596316933631897} 11/07/2021 12:05:11 - INFO - __main__ - Step 105429: {'lr': 0.00010379310306351606, 'samples': 20242368, 'steps': 105428, 'loss/train': 1.1352571249008179} 11/07/2021 12:05:12 - INFO - __main__ - Step 105430: {'lr': 0.00010378879849031439, 'samples': 20242560, 'steps': 105429, 'loss/train': 1.880556583404541} 11/07/2021 12:05:13 - INFO - __main__ - Step 105431: {'lr': 0.00010378449398299322, 'samples': 20242752, 'steps': 105430, 'loss/train': 1.0709176063537598} 11/07/2021 12:05:13 - INFO - __main__ - Step 105432: {'lr': 0.00010378018954155452, 'samples': 20242944, 'steps': 105431, 'loss/train': 0.9870719909667969} 11/07/2021 12:05:13 - INFO - __main__ - Step 105433: {'lr': 0.0001037758851660002, 'samples': 20243136, 'steps': 105432, 'loss/train': 0.9409540295600891} 11/07/2021 12:05:14 - INFO - __main__ - Step 105434: {'lr': 0.00010377158085633221, 'samples': 20243328, 'steps': 105433, 'loss/train': 1.1451669931411743} 11/07/2021 12:05:15 - INFO - __main__ - Step 105435: {'lr': 0.00010376727661255247, 'samples': 20243520, 'steps': 105434, 'loss/train': 1.040324091911316} 11/07/2021 12:05:15 - INFO - __main__ - Step 105436: {'lr': 0.00010376297243466299, 'samples': 20243712, 'steps': 105435, 'loss/train': 1.0458934307098389} 11/07/2021 12:05:15 - INFO - __main__ - Step 105437: {'lr': 0.00010375866832266562, 'samples': 20243904, 'steps': 105436, 'loss/train': 1.2096365690231323} 11/07/2021 12:05:16 - INFO - __main__ - Step 105438: {'lr': 0.00010375436427656235, 'samples': 20244096, 'steps': 105437, 'loss/train': 0.8665537238121033} 11/07/2021 12:05:16 - INFO - __main__ - Step 105439: {'lr': 0.00010375006029635518, 'samples': 20244288, 'steps': 105438, 'loss/train': 0.9303382039070129} 11/07/2021 12:05:17 - INFO - __main__ - Step 105440: {'lr': 0.0001037457563820459, 'samples': 20244480, 'steps': 105439, 'loss/train': 1.773106575012207} 11/07/2021 12:05:17 - INFO - __main__ - Step 105441: {'lr': 0.00010374145253363651, 'samples': 20244672, 'steps': 105440, 'loss/train': 1.1353776454925537} 11/07/2021 12:05:18 - INFO - __main__ - Step 105442: {'lr': 0.00010373714875112897, 'samples': 20244864, 'steps': 105441, 'loss/train': 1.9148916006088257} 11/07/2021 12:05:18 - INFO - __main__ - Step 105443: {'lr': 0.00010373284503452524, 'samples': 20245056, 'steps': 105442, 'loss/train': 1.4212729930877686} 11/07/2021 12:05:18 - INFO - __main__ - Step 105444: {'lr': 0.00010372854138382721, 'samples': 20245248, 'steps': 105443, 'loss/train': 1.2615203857421875} 11/07/2021 12:05:20 - INFO - __main__ - Step 105445: {'lr': 0.00010372423779903683, 'samples': 20245440, 'steps': 105444, 'loss/train': 1.754805326461792} 11/07/2021 12:05:20 - INFO - __main__ - Step 105446: {'lr': 0.00010371993428015608, 'samples': 20245632, 'steps': 105445, 'loss/train': 1.2597272396087646} 11/07/2021 12:05:20 - INFO - __main__ - Step 105447: {'lr': 0.00010371563082718685, 'samples': 20245824, 'steps': 105446, 'loss/train': 1.1997426748275757} 11/07/2021 12:05:21 - INFO - __main__ - Step 105448: {'lr': 0.00010371132744013112, 'samples': 20246016, 'steps': 105447, 'loss/train': 1.491919994354248} 11/07/2021 12:05:21 - INFO - __main__ - Step 105449: {'lr': 0.0001037070241189908, 'samples': 20246208, 'steps': 105448, 'loss/train': 1.6273729801177979} 11/07/2021 12:05:22 - INFO - __main__ - Step 105450: {'lr': 0.00010370272086376784, 'samples': 20246400, 'steps': 105449, 'loss/train': 0.6862915754318237} 11/07/2021 12:05:22 - INFO - __main__ - Step 105451: {'lr': 0.00010369841767446414, 'samples': 20246592, 'steps': 105450, 'loss/train': 1.369989275932312} 11/07/2021 12:05:23 - INFO - __main__ - Step 105452: {'lr': 0.0001036941145510818, 'samples': 20246784, 'steps': 105451, 'loss/train': 1.3624920845031738} 11/07/2021 12:05:23 - INFO - __main__ - Step 105453: {'lr': 0.00010368981149362256, 'samples': 20246976, 'steps': 105452, 'loss/train': 1.8089706897735596} 11/07/2021 12:05:23 - INFO - __main__ - Step 105454: {'lr': 0.00010368550850208841, 'samples': 20247168, 'steps': 105453, 'loss/train': 1.3322569131851196} 11/07/2021 12:05:24 - INFO - __main__ - Step 105455: {'lr': 0.0001036812055764813, 'samples': 20247360, 'steps': 105454, 'loss/train': 1.2068718671798706} 11/07/2021 12:05:25 - INFO - __main__ - Step 105456: {'lr': 0.00010367690271680319, 'samples': 20247552, 'steps': 105455, 'loss/train': 1.3997535705566406} 11/07/2021 12:05:25 - INFO - __main__ - Step 105457: {'lr': 0.00010367259992305602, 'samples': 20247744, 'steps': 105456, 'loss/train': 1.373322606086731} 11/07/2021 12:05:26 - INFO - __main__ - Step 105458: {'lr': 0.00010366829719524173, 'samples': 20247936, 'steps': 105457, 'loss/train': 1.325236201286316} 11/07/2021 12:05:26 - INFO - __main__ - Step 105459: {'lr': 0.00010366399453336223, 'samples': 20248128, 'steps': 105458, 'loss/train': 1.3621885776519775} 11/07/2021 12:05:26 - INFO - __main__ - Step 105460: {'lr': 0.00010365969193741948, 'samples': 20248320, 'steps': 105459, 'loss/train': 1.1869431734085083} 11/07/2021 12:05:27 - INFO - __main__ - Step 105461: {'lr': 0.0001036553894074154, 'samples': 20248512, 'steps': 105460, 'loss/train': 1.1256749629974365} 11/07/2021 12:05:28 - INFO - __main__ - Step 105462: {'lr': 0.00010365108694335196, 'samples': 20248704, 'steps': 105461, 'loss/train': 2.296013832092285} 11/07/2021 12:05:28 - INFO - __main__ - Step 105463: {'lr': 0.00010364678454523107, 'samples': 20248896, 'steps': 105462, 'loss/train': 1.555535078048706} 11/07/2021 12:05:28 - INFO - __main__ - Step 105464: {'lr': 0.00010364248221305469, 'samples': 20249088, 'steps': 105463, 'loss/train': 0.34064680337905884} 11/07/2021 12:05:29 - INFO - __main__ - Step 105465: {'lr': 0.00010363817994682476, 'samples': 20249280, 'steps': 105464, 'loss/train': 1.1041771173477173} 11/07/2021 12:05:30 - INFO - __main__ - Step 105466: {'lr': 0.00010363387774654326, 'samples': 20249472, 'steps': 105465, 'loss/train': 1.6158801317214966} 11/07/2021 12:05:30 - INFO - __main__ - Step 105467: {'lr': 0.00010362957561221204, 'samples': 20249664, 'steps': 105466, 'loss/train': 0.9935389161109924} 11/07/2021 12:05:30 - INFO - __main__ - Step 105468: {'lr': 0.00010362527354383302, 'samples': 20249856, 'steps': 105467, 'loss/train': 1.61991548538208} 11/07/2021 12:05:31 - INFO - __main__ - Step 105469: {'lr': 0.00010362097154140824, 'samples': 20250048, 'steps': 105468, 'loss/train': 2.727529525756836} 11/07/2021 12:05:31 - INFO - __main__ - Step 105470: {'lr': 0.00010361666960493956, 'samples': 20250240, 'steps': 105469, 'loss/train': 5.710907459259033} 11/07/2021 12:05:32 - INFO - __main__ - Step 105471: {'lr': 0.00010361236773442895, 'samples': 20250432, 'steps': 105470, 'loss/train': 1.4975711107254028} 11/07/2021 12:05:33 - INFO - __main__ - Step 105472: {'lr': 0.00010360806592987837, 'samples': 20250624, 'steps': 105471, 'loss/train': 1.2584984302520752} 11/07/2021 12:05:33 - INFO - __main__ - Step 105473: {'lr': 0.00010360376419128972, 'samples': 20250816, 'steps': 105472, 'loss/train': 1.6634010076522827} 11/07/2021 12:05:33 - INFO - __main__ - Step 105474: {'lr': 0.00010359946251866495, 'samples': 20251008, 'steps': 105473, 'loss/train': 1.648511528968811} 11/07/2021 12:05:34 - INFO - __main__ - Step 105475: {'lr': 0.00010359516091200602, 'samples': 20251200, 'steps': 105474, 'loss/train': 1.500343680381775} 11/07/2021 12:05:34 - INFO - __main__ - Step 105476: {'lr': 0.00010359085937131485, 'samples': 20251392, 'steps': 105475, 'loss/train': 1.6799229383468628} 11/07/2021 12:05:35 - INFO - __main__ - Step 105477: {'lr': 0.00010358655789659335, 'samples': 20251584, 'steps': 105476, 'loss/train': 1.0409706830978394} 11/07/2021 12:05:35 - INFO - __main__ - Step 105478: {'lr': 0.00010358225648784354, 'samples': 20251776, 'steps': 105477, 'loss/train': 1.462394118309021} 11/07/2021 12:05:36 - INFO - __main__ - Step 105479: {'lr': 0.00010357795514506734, 'samples': 20251968, 'steps': 105478, 'loss/train': 1.3127533197402954} 11/07/2021 12:05:36 - INFO - __main__ - Step 105480: {'lr': 0.00010357365386826658, 'samples': 20252160, 'steps': 105479, 'loss/train': 1.526505470275879} 11/07/2021 12:05:36 - INFO - __main__ - Step 105481: {'lr': 0.0001035693526574433, 'samples': 20252352, 'steps': 105480, 'loss/train': 1.1079456806182861} 11/07/2021 12:05:37 - INFO - __main__ - Step 105482: {'lr': 0.0001035650515125994, 'samples': 20252544, 'steps': 105481, 'loss/train': 0.2390187829732895} 11/07/2021 12:05:38 - INFO - __main__ - Step 105483: {'lr': 0.0001035607504337368, 'samples': 20252736, 'steps': 105482, 'loss/train': 1.4692264795303345} 11/07/2021 12:05:38 - INFO - __main__ - Step 105484: {'lr': 0.00010355644942085751, 'samples': 20252928, 'steps': 105483, 'loss/train': 1.3289188146591187} 11/07/2021 12:05:38 - INFO - __main__ - Step 105485: {'lr': 0.00010355214847396338, 'samples': 20253120, 'steps': 105484, 'loss/train': 1.7304816246032715} 11/07/2021 12:05:39 - INFO - __main__ - Step 105486: {'lr': 0.00010354784759305644, 'samples': 20253312, 'steps': 105485, 'loss/train': 1.3225631713867188} 11/07/2021 12:05:40 - INFO - __main__ - Step 105487: {'lr': 0.00010354354677813855, 'samples': 20253504, 'steps': 105486, 'loss/train': 1.0747653245925903} 11/07/2021 12:05:40 - INFO - __main__ - Step 105488: {'lr': 0.00010353924602921166, 'samples': 20253696, 'steps': 105487, 'loss/train': 1.315434217453003} 11/07/2021 12:05:41 - INFO - __main__ - Step 105489: {'lr': 0.00010353494534627774, 'samples': 20253888, 'steps': 105488, 'loss/train': 1.4769155979156494} 11/07/2021 12:05:41 - INFO - __main__ - Step 105490: {'lr': 0.00010353064472933873, 'samples': 20254080, 'steps': 105489, 'loss/train': 1.386688232421875} 11/07/2021 12:05:41 - INFO - __main__ - Step 105491: {'lr': 0.00010352634417839654, 'samples': 20254272, 'steps': 105490, 'loss/train': 0.7272307276725769} 11/07/2021 12:05:42 - INFO - __main__ - Step 105492: {'lr': 0.00010352204369345314, 'samples': 20254464, 'steps': 105491, 'loss/train': 1.456168293952942} 11/07/2021 12:05:43 - INFO - __main__ - Step 105493: {'lr': 0.0001035177432745105, 'samples': 20254656, 'steps': 105492, 'loss/train': 1.535865068435669} 11/07/2021 12:05:43 - INFO - __main__ - Step 105494: {'lr': 0.00010351344292157044, 'samples': 20254848, 'steps': 105493, 'loss/train': 1.073330044746399} 11/07/2021 12:05:43 - INFO - __main__ - Step 105495: {'lr': 0.00010350914263463495, 'samples': 20255040, 'steps': 105494, 'loss/train': 1.8548039197921753} 11/07/2021 12:05:44 - INFO - __main__ - Step 105496: {'lr': 0.00010350484241370598, 'samples': 20255232, 'steps': 105495, 'loss/train': 1.2444919347763062} 11/07/2021 12:05:44 - INFO - __main__ - Step 105497: {'lr': 0.00010350054225878546, 'samples': 20255424, 'steps': 105496, 'loss/train': 1.0105223655700684} 11/07/2021 12:05:45 - INFO - __main__ - Step 105498: {'lr': 0.00010349624216987535, 'samples': 20255616, 'steps': 105497, 'loss/train': 0.9471797943115234} 11/07/2021 12:05:46 - INFO - __main__ - Step 105499: {'lr': 0.00010349194214697757, 'samples': 20255808, 'steps': 105498, 'loss/train': 1.539065957069397} 11/07/2021 12:05:46 - INFO - __main__ - Step 105500: {'lr': 0.00010348764219009408, 'samples': 20256000, 'steps': 105499, 'loss/train': 1.6833252906799316} 11/07/2021 12:05:46 - INFO - __main__ - Step 105501: {'lr': 0.00010348334229922676, 'samples': 20256192, 'steps': 105500, 'loss/train': 1.9729291200637817} 11/07/2021 12:05:47 - INFO - __main__ - Step 105502: {'lr': 0.00010347904247437762, 'samples': 20256384, 'steps': 105501, 'loss/train': 1.3552870750427246} 11/07/2021 12:05:48 - INFO - __main__ - Step 105503: {'lr': 0.00010347474271554855, 'samples': 20256576, 'steps': 105502, 'loss/train': 1.6267638206481934} 11/07/2021 12:05:48 - INFO - __main__ - Step 105504: {'lr': 0.00010347044302274147, 'samples': 20256768, 'steps': 105503, 'loss/train': 1.1766754388809204} 11/07/2021 12:05:48 - INFO - __main__ - Step 105505: {'lr': 0.00010346614339595839, 'samples': 20256960, 'steps': 105504, 'loss/train': 1.4914803504943848} 11/07/2021 12:05:49 - INFO - __main__ - Step 105506: {'lr': 0.00010346184383520126, 'samples': 20257152, 'steps': 105505, 'loss/train': 1.2623112201690674} 11/07/2021 12:05:49 - INFO - __main__ - Step 105507: {'lr': 0.00010345754434047189, 'samples': 20257344, 'steps': 105506, 'loss/train': 1.385140299797058} 11/07/2021 12:05:50 - INFO - __main__ - Step 105508: {'lr': 0.0001034532449117723, 'samples': 20257536, 'steps': 105507, 'loss/train': 1.2770330905914307} 11/07/2021 12:05:51 - INFO - __main__ - Step 105509: {'lr': 0.00010344894554910439, 'samples': 20257728, 'steps': 105508, 'loss/train': 1.1968834400177002} 11/07/2021 12:05:51 - INFO - __main__ - Step 105510: {'lr': 0.00010344464625247014, 'samples': 20257920, 'steps': 105509, 'loss/train': 0.9552929401397705} 11/07/2021 12:05:51 - INFO - __main__ - Step 105511: {'lr': 0.00010344034702187147, 'samples': 20258112, 'steps': 105510, 'loss/train': 1.324398398399353} 11/07/2021 12:05:52 - INFO - __main__ - Step 105512: {'lr': 0.00010343604785731031, 'samples': 20258304, 'steps': 105511, 'loss/train': 1.3896631002426147} 11/07/2021 12:05:52 - INFO - __main__ - Step 105513: {'lr': 0.0001034317487587886, 'samples': 20258496, 'steps': 105512, 'loss/train': 1.7134907245635986} 11/07/2021 12:05:52 - INFO - __main__ - Step 105514: {'lr': 0.00010342744972630833, 'samples': 20258688, 'steps': 105513, 'loss/train': 1.6230376958847046} 11/07/2021 12:05:54 - INFO - __main__ - Step 105515: {'lr': 0.00010342315075987133, 'samples': 20258880, 'steps': 105514, 'loss/train': 0.8055984973907471} 11/07/2021 12:05:54 - INFO - __main__ - Step 105516: {'lr': 0.00010341885185947964, 'samples': 20259072, 'steps': 105515, 'loss/train': 1.4774972200393677} 11/07/2021 12:05:54 - INFO - __main__ - Step 105517: {'lr': 0.00010341455302513511, 'samples': 20259264, 'steps': 105516, 'loss/train': 0.6641988754272461} 11/07/2021 12:05:55 - INFO - __main__ - Step 105518: {'lr': 0.00010341025425683976, 'samples': 20259456, 'steps': 105517, 'loss/train': 1.2660683393478394} 11/07/2021 12:05:55 - INFO - __main__ - Step 105519: {'lr': 0.00010340595555459556, 'samples': 20259648, 'steps': 105518, 'loss/train': 0.05904363468289375} 11/07/2021 12:05:56 - INFO - __main__ - Step 105520: {'lr': 0.00010340165691840428, 'samples': 20259840, 'steps': 105519, 'loss/train': 1.3504177331924438} 11/07/2021 12:05:57 - INFO - __main__ - Step 105521: {'lr': 0.00010339735834826797, 'samples': 20260032, 'steps': 105520, 'loss/train': 3.171208381652832} 11/07/2021 12:05:57 - INFO - __main__ - Step 105522: {'lr': 0.00010339305984418854, 'samples': 20260224, 'steps': 105521, 'loss/train': 1.2060655355453491} 11/07/2021 12:05:57 - INFO - __main__ - Step 105523: {'lr': 0.0001033887614061679, 'samples': 20260416, 'steps': 105522, 'loss/train': 1.4247961044311523} 11/07/2021 12:05:58 - INFO - __main__ - Step 105524: {'lr': 0.00010338446303420806, 'samples': 20260608, 'steps': 105523, 'loss/train': 1.1885920763015747} 11/07/2021 12:05:59 - INFO - __main__ - Step 105525: {'lr': 0.00010338016472831091, 'samples': 20260800, 'steps': 105524, 'loss/train': 1.4575613737106323} 11/07/2021 12:05:59 - INFO - __main__ - Step 105526: {'lr': 0.00010337586648847841, 'samples': 20260992, 'steps': 105525, 'loss/train': 1.1828291416168213} 11/07/2021 12:05:59 - INFO - __main__ - Step 105527: {'lr': 0.00010337156831471245, 'samples': 20261184, 'steps': 105526, 'loss/train': 0.877295732498169} 11/07/2021 12:06:00 - INFO - __main__ - Step 105528: {'lr': 0.00010336727020701504, 'samples': 20261376, 'steps': 105527, 'loss/train': 2.181227684020996} 11/07/2021 12:06:00 - INFO - __main__ - Step 105529: {'lr': 0.00010336297216538803, 'samples': 20261568, 'steps': 105528, 'loss/train': 1.3243598937988281} 11/07/2021 12:06:01 - INFO - __main__ - Step 105530: {'lr': 0.00010335867418983345, 'samples': 20261760, 'steps': 105529, 'loss/train': 0.9805232286453247} 11/07/2021 12:06:01 - INFO - __main__ - Step 105531: {'lr': 0.00010335437628035314, 'samples': 20261952, 'steps': 105530, 'loss/train': 1.526213526725769} 11/07/2021 12:06:02 - INFO - __main__ - Step 105532: {'lr': 0.0001033500784369491, 'samples': 20262144, 'steps': 105531, 'loss/train': 1.4128681421279907} 11/07/2021 12:06:02 - INFO - __main__ - Step 105533: {'lr': 0.00010334578065962335, 'samples': 20262336, 'steps': 105532, 'loss/train': 0.9563823342323303} 11/07/2021 12:06:03 - INFO - __main__ - Step 105534: {'lr': 0.00010334148294837764, 'samples': 20262528, 'steps': 105533, 'loss/train': 0.992871105670929} 11/07/2021 12:06:04 - INFO - __main__ - Step 105535: {'lr': 0.000103337185303214, 'samples': 20262720, 'steps': 105534, 'loss/train': 1.2662967443466187} 11/07/2021 12:06:04 - INFO - __main__ - Step 105536: {'lr': 0.00010333288772413435, 'samples': 20262912, 'steps': 105535, 'loss/train': 1.7394973039627075} 11/07/2021 12:06:04 - INFO - __main__ - Step 105537: {'lr': 0.00010332859021114063, 'samples': 20263104, 'steps': 105536, 'loss/train': 1.418251633644104} 11/07/2021 12:06:05 - INFO - __main__ - Step 105538: {'lr': 0.0001033242927642348, 'samples': 20263296, 'steps': 105537, 'loss/train': 1.0301356315612793} 11/07/2021 12:06:05 - INFO - __main__ - Step 105539: {'lr': 0.00010331999538341877, 'samples': 20263488, 'steps': 105538, 'loss/train': 1.36501944065094} 11/07/2021 12:06:06 - INFO - __main__ - Step 105540: {'lr': 0.0001033156980686945, 'samples': 20263680, 'steps': 105539, 'loss/train': 1.096165418624878} 11/07/2021 12:06:06 - INFO - __main__ - Step 105541: {'lr': 0.00010331140082006391, 'samples': 20263872, 'steps': 105540, 'loss/train': 1.4117103815078735} 11/07/2021 12:06:07 - INFO - __main__ - Step 105542: {'lr': 0.00010330710363752893, 'samples': 20264064, 'steps': 105541, 'loss/train': 1.697403073310852} 11/07/2021 12:06:07 - INFO - __main__ - Step 105543: {'lr': 0.0001033028065210915, 'samples': 20264256, 'steps': 105542, 'loss/train': 1.1199848651885986} 11/07/2021 12:06:07 - INFO - __main__ - Step 105544: {'lr': 0.00010329850947075359, 'samples': 20264448, 'steps': 105543, 'loss/train': 1.9459431171417236} 11/07/2021 12:06:08 - INFO - __main__ - Step 105545: {'lr': 0.00010329421248651707, 'samples': 20264640, 'steps': 105544, 'loss/train': 1.149633765220642} 11/07/2021 12:06:09 - INFO - __main__ - Step 105546: {'lr': 0.00010328991556838401, 'samples': 20264832, 'steps': 105545, 'loss/train': 1.2495008707046509} 11/07/2021 12:06:09 - INFO - __main__ - Step 105547: {'lr': 0.0001032856187163562, 'samples': 20265024, 'steps': 105546, 'loss/train': 0.8596476316452026} 11/07/2021 12:06:10 - INFO - __main__ - Step 105548: {'lr': 0.0001032813219304356, 'samples': 20265216, 'steps': 105547, 'loss/train': 1.3537871837615967} 11/07/2021 12:06:10 - INFO - __main__ - Step 105549: {'lr': 0.00010327702521062415, 'samples': 20265408, 'steps': 105548, 'loss/train': 1.0981343984603882} 11/07/2021 12:06:10 - INFO - __main__ - Step 105550: {'lr': 0.00010327272855692385, 'samples': 20265600, 'steps': 105549, 'loss/train': 1.3093687295913696} 11/07/2021 12:06:11 - INFO - __main__ - Step 105551: {'lr': 0.00010326843196933658, 'samples': 20265792, 'steps': 105550, 'loss/train': 1.2504197359085083} 11/07/2021 12:06:12 - INFO - __main__ - Step 105552: {'lr': 0.00010326413544786429, 'samples': 20265984, 'steps': 105551, 'loss/train': 1.6319226026535034} 11/07/2021 12:06:12 - INFO - __main__ - Step 105553: {'lr': 0.0001032598389925089, 'samples': 20266176, 'steps': 105552, 'loss/train': 1.1955783367156982} 11/07/2021 12:06:12 - INFO - __main__ - Step 105554: {'lr': 0.00010325554260327239, 'samples': 20266368, 'steps': 105553, 'loss/train': 1.4402669668197632} 11/07/2021 12:06:13 - INFO - __main__ - Step 105555: {'lr': 0.00010325124628015665, 'samples': 20266560, 'steps': 105554, 'loss/train': 1.3196899890899658} 11/07/2021 12:06:14 - INFO - __main__ - Step 105556: {'lr': 0.00010324695002316362, 'samples': 20266752, 'steps': 105555, 'loss/train': 1.3182960748672485} 11/07/2021 12:06:14 - INFO - __main__ - Step 105557: {'lr': 0.00010324265383229526, 'samples': 20266944, 'steps': 105556, 'loss/train': 1.3344199657440186} 11/07/2021 12:06:14 - INFO - __main__ - Step 105558: {'lr': 0.0001032383577075535, 'samples': 20267136, 'steps': 105557, 'loss/train': 1.0984288454055786} 11/07/2021 12:06:15 - INFO - __main__ - Step 105559: {'lr': 0.00010323406164894027, 'samples': 20267328, 'steps': 105558, 'loss/train': 1.2410061359405518} 11/07/2021 12:06:15 - INFO - __main__ - Step 105560: {'lr': 0.00010322976565645761, 'samples': 20267520, 'steps': 105559, 'loss/train': 1.4046430587768555} 11/07/2021 12:06:16 - INFO - __main__ - Step 105561: {'lr': 0.00010322546973010724, 'samples': 20267712, 'steps': 105560, 'loss/train': 1.3595689535140991} 11/07/2021 12:06:17 - INFO - __main__ - Step 105562: {'lr': 0.00010322117386989122, 'samples': 20267904, 'steps': 105561, 'loss/train': 1.1497108936309814} 11/07/2021 12:06:17 - INFO - __main__ - Step 105563: {'lr': 0.00010321687807581148, 'samples': 20268096, 'steps': 105562, 'loss/train': 1.7569166421890259} 11/07/2021 12:06:17 - INFO - __main__ - Step 105564: {'lr': 0.00010321258234786996, 'samples': 20268288, 'steps': 105563, 'loss/train': 1.5004655122756958} 11/07/2021 12:06:18 - INFO - __main__ - Step 105565: {'lr': 0.00010320828668606855, 'samples': 20268480, 'steps': 105564, 'loss/train': 1.1334751844406128} 11/07/2021 12:06:18 - INFO - __main__ - Step 105566: {'lr': 0.00010320399109040927, 'samples': 20268672, 'steps': 105565, 'loss/train': 1.329032301902771} 11/07/2021 12:06:19 - INFO - __main__ - Step 105567: {'lr': 0.00010319969556089396, 'samples': 20268864, 'steps': 105566, 'loss/train': 0.931068480014801} 11/07/2021 12:06:20 - INFO - __main__ - Step 105568: {'lr': 0.00010319540009752463, 'samples': 20269056, 'steps': 105567, 'loss/train': 1.3297373056411743} 11/07/2021 12:06:20 - INFO - __main__ - Step 105569: {'lr': 0.00010319110470030315, 'samples': 20269248, 'steps': 105568, 'loss/train': 1.5599391460418701} 11/07/2021 12:06:20 - INFO - __main__ - Step 105570: {'lr': 0.00010318680936923153, 'samples': 20269440, 'steps': 105569, 'loss/train': 0.9496349692344666} 11/07/2021 12:06:21 - INFO - __main__ - Step 105571: {'lr': 0.00010318251410431164, 'samples': 20269632, 'steps': 105570, 'loss/train': 1.2620066404342651} 11/07/2021 12:06:22 - INFO - __main__ - Step 105572: {'lr': 0.00010317821890554549, 'samples': 20269824, 'steps': 105571, 'loss/train': 1.5928422212600708} 11/07/2021 12:06:22 - INFO - __main__ - Step 105573: {'lr': 0.00010317392377293503, 'samples': 20270016, 'steps': 105572, 'loss/train': 1.058362364768982} 11/07/2021 12:06:22 - INFO - __main__ - Step 105574: {'lr': 0.00010316962870648203, 'samples': 20270208, 'steps': 105573, 'loss/train': 1.193011999130249} 11/07/2021 12:06:23 - INFO - __main__ - Step 105575: {'lr': 0.00010316533370618856, 'samples': 20270400, 'steps': 105574, 'loss/train': 1.352323055267334} 11/07/2021 12:06:23 - INFO - __main__ - Step 105576: {'lr': 0.00010316103877205649, 'samples': 20270592, 'steps': 105575, 'loss/train': 1.4720104932785034} 11/07/2021 12:06:24 - INFO - __main__ - Step 105577: {'lr': 0.0001031567439040878, 'samples': 20270784, 'steps': 105576, 'loss/train': 1.0651406049728394} 11/07/2021 12:06:24 - INFO - __main__ - Step 105578: {'lr': 0.00010315244910228445, 'samples': 20270976, 'steps': 105577, 'loss/train': 1.3225407600402832} 11/07/2021 12:06:25 - INFO - __main__ - Step 105579: {'lr': 0.00010314815436664828, 'samples': 20271168, 'steps': 105578, 'loss/train': 1.4422154426574707} 11/07/2021 12:06:25 - INFO - __main__ - Step 105580: {'lr': 0.00010314385969718135, 'samples': 20271360, 'steps': 105579, 'loss/train': 1.4184362888336182} 11/07/2021 12:06:25 - INFO - __main__ - Step 105581: {'lr': 0.0001031395650938855, 'samples': 20271552, 'steps': 105580, 'loss/train': 1.1060231924057007} 11/07/2021 12:06:26 - INFO - __main__ - Step 105582: {'lr': 0.0001031352705567627, 'samples': 20271744, 'steps': 105581, 'loss/train': 1.4861047267913818} 11/07/2021 12:06:27 - INFO - __main__ - Step 105583: {'lr': 0.00010313097608581487, 'samples': 20271936, 'steps': 105582, 'loss/train': 1.3110417127609253} 11/07/2021 12:06:27 - INFO - __main__ - Step 105584: {'lr': 0.00010312668168104397, 'samples': 20272128, 'steps': 105583, 'loss/train': 1.2172430753707886} 11/07/2021 12:06:28 - INFO - __main__ - Step 105585: {'lr': 0.00010312238734245192, 'samples': 20272320, 'steps': 105584, 'loss/train': 1.4915778636932373} 11/07/2021 12:06:28 - INFO - __main__ - Step 105586: {'lr': 0.00010311809307004063, 'samples': 20272512, 'steps': 105585, 'loss/train': 1.1894770860671997} 11/07/2021 12:06:28 - INFO - __main__ - Step 105587: {'lr': 0.00010311379886381216, 'samples': 20272704, 'steps': 105586, 'loss/train': 1.3942461013793945} 11/07/2021 12:06:29 - INFO - __main__ - Step 105588: {'lr': 0.00010310950472376829, 'samples': 20272896, 'steps': 105587, 'loss/train': 1.0737781524658203} 11/07/2021 12:06:30 - INFO - __main__ - Step 105589: {'lr': 0.00010310521064991096, 'samples': 20273088, 'steps': 105588, 'loss/train': 1.7462506294250488} 11/07/2021 12:06:30 - INFO - __main__ - Step 105590: {'lr': 0.0001031009166422422, 'samples': 20273280, 'steps': 105589, 'loss/train': 1.4255107641220093} 11/07/2021 12:06:30 - INFO - __main__ - Step 105591: {'lr': 0.0001030966227007639, 'samples': 20273472, 'steps': 105590, 'loss/train': 1.229748249053955} 11/07/2021 12:06:31 - INFO - __main__ - Step 105592: {'lr': 0.00010309232882547795, 'samples': 20273664, 'steps': 105591, 'loss/train': 1.1284033060073853} 11/07/2021 12:06:32 - INFO - __main__ - Step 105593: {'lr': 0.00010308803501638636, 'samples': 20273856, 'steps': 105592, 'loss/train': 0.6305789351463318} 11/07/2021 12:06:32 - INFO - __main__ - Step 105594: {'lr': 0.00010308374127349104, 'samples': 20274048, 'steps': 105593, 'loss/train': 1.4934682846069336} 11/07/2021 12:06:32 - INFO - __main__ - Step 105595: {'lr': 0.00010307944759679392, 'samples': 20274240, 'steps': 105594, 'loss/train': 1.2898812294006348} 11/07/2021 12:06:33 - INFO - __main__ - Step 105596: {'lr': 0.00010307515398629691, 'samples': 20274432, 'steps': 105595, 'loss/train': 1.3989410400390625} 11/07/2021 12:06:33 - INFO - __main__ - Step 105597: {'lr': 0.00010307086044200198, 'samples': 20274624, 'steps': 105596, 'loss/train': 1.345393180847168} 11/07/2021 12:06:34 - INFO - __main__ - Step 105598: {'lr': 0.00010306656696391106, 'samples': 20274816, 'steps': 105597, 'loss/train': 1.2001131772994995} 11/07/2021 12:06:35 - INFO - __main__ - Step 105599: {'lr': 0.00010306227355202608, 'samples': 20275008, 'steps': 105598, 'loss/train': 1.3107614517211914} 11/07/2021 12:06:35 - INFO - __main__ - Step 105600: {'lr': 0.00010305798020634904, 'samples': 20275200, 'steps': 105599, 'loss/train': 1.5565766096115112} 11/07/2021 12:06:35 - INFO - __main__ - Step 105601: {'lr': 0.00010305368692688174, 'samples': 20275392, 'steps': 105600, 'loss/train': 1.1424075365066528} 11/07/2021 12:06:36 - INFO - __main__ - Step 105602: {'lr': 0.00010304939371362618, 'samples': 20275584, 'steps': 105601, 'loss/train': 1.4525169134140015} 11/07/2021 12:06:37 - INFO - __main__ - Step 105603: {'lr': 0.00010304510056658428, 'samples': 20275776, 'steps': 105602, 'loss/train': 1.5931110382080078} 11/07/2021 12:06:37 - INFO - __main__ - Step 105604: {'lr': 0.000103040807485758, 'samples': 20275968, 'steps': 105603, 'loss/train': 0.8545600175857544} 11/07/2021 12:06:37 - INFO - __main__ - Step 105605: {'lr': 0.00010303651447114926, 'samples': 20276160, 'steps': 105604, 'loss/train': 1.6433473825454712} 11/07/2021 12:06:38 - INFO - __main__ - Step 105606: {'lr': 0.00010303222152276001, 'samples': 20276352, 'steps': 105605, 'loss/train': 1.389098048210144} 11/07/2021 12:06:38 - INFO - __main__ - Step 105607: {'lr': 0.00010302792864059216, 'samples': 20276544, 'steps': 105606, 'loss/train': 0.06187174469232559} 11/07/2021 12:06:39 - INFO - __main__ - Step 105608: {'lr': 0.00010302363582464766, 'samples': 20276736, 'steps': 105607, 'loss/train': 1.6253803968429565} 11/07/2021 12:06:39 - INFO - __main__ - Step 105609: {'lr': 0.00010301934307492844, 'samples': 20276928, 'steps': 105608, 'loss/train': 1.3836230039596558} 11/07/2021 12:06:40 - INFO - __main__ - Step 105610: {'lr': 0.00010301505039143644, 'samples': 20277120, 'steps': 105609, 'loss/train': 1.1918634176254272} 11/07/2021 12:06:40 - INFO - __main__ - Step 105611: {'lr': 0.00010301075777417368, 'samples': 20277312, 'steps': 105610, 'loss/train': 1.2472835779190063} 11/07/2021 12:06:40 - INFO - __main__ - Step 105612: {'lr': 0.00010300646522314192, 'samples': 20277504, 'steps': 105611, 'loss/train': 1.635624647140503} 11/07/2021 12:06:42 - INFO - __main__ - Step 105613: {'lr': 0.00010300217273834317, 'samples': 20277696, 'steps': 105612, 'loss/train': 1.6223064661026} 11/07/2021 12:06:42 - INFO - __main__ - Step 105614: {'lr': 0.00010299788031977938, 'samples': 20277888, 'steps': 105613, 'loss/train': 1.7944903373718262} 11/07/2021 12:06:42 - INFO - __main__ - Step 105615: {'lr': 0.00010299358796745248, 'samples': 20278080, 'steps': 105614, 'loss/train': 1.3070764541625977} 11/07/2021 12:06:43 - INFO - __main__ - Step 105616: {'lr': 0.00010298929568136439, 'samples': 20278272, 'steps': 105615, 'loss/train': 1.3411873579025269} 11/07/2021 12:06:43 - INFO - __main__ - Step 105617: {'lr': 0.00010298500346151707, 'samples': 20278464, 'steps': 105616, 'loss/train': 1.44691801071167} 11/07/2021 12:06:43 - INFO - __main__ - Step 105618: {'lr': 0.00010298071130791243, 'samples': 20278656, 'steps': 105617, 'loss/train': 1.2691041231155396} 11/07/2021 12:06:44 - INFO - __main__ - Step 105619: {'lr': 0.0001029764192205524, 'samples': 20278848, 'steps': 105618, 'loss/train': 0.6909158825874329} 11/07/2021 12:06:45 - INFO - __main__ - Step 105620: {'lr': 0.00010297212719943893, 'samples': 20279040, 'steps': 105619, 'loss/train': 1.5588514804840088} 11/07/2021 12:06:45 - INFO - __main__ - Step 105621: {'lr': 0.00010296783524457395, 'samples': 20279232, 'steps': 105620, 'loss/train': 0.9952182173728943} 11/07/2021 12:06:45 - INFO - __main__ - Step 105622: {'lr': 0.0001029635433559595, 'samples': 20279424, 'steps': 105621, 'loss/train': 1.4159111976623535} 11/07/2021 12:06:46 - INFO - __main__ - Step 105623: {'lr': 0.00010295925153359731, 'samples': 20279616, 'steps': 105622, 'loss/train': 1.1800111532211304} 11/07/2021 12:06:47 - INFO - __main__ - Step 105624: {'lr': 0.00010295495977748942, 'samples': 20279808, 'steps': 105623, 'loss/train': 2.0892879962921143} 11/07/2021 12:06:47 - INFO - __main__ - Step 105625: {'lr': 0.00010295066808763775, 'samples': 20280000, 'steps': 105624, 'loss/train': 1.2920665740966797} 11/07/2021 12:06:48 - INFO - __main__ - Step 105626: {'lr': 0.00010294637646404422, 'samples': 20280192, 'steps': 105625, 'loss/train': 1.0806605815887451} 11/07/2021 12:06:48 - INFO - __main__ - Step 105627: {'lr': 0.0001029420849067108, 'samples': 20280384, 'steps': 105626, 'loss/train': 1.4385145902633667} 11/07/2021 12:06:48 - INFO - __main__ - Step 105628: {'lr': 0.00010293779341563942, 'samples': 20280576, 'steps': 105627, 'loss/train': 1.3716057538986206} 11/07/2021 12:06:49 - INFO - __main__ - Step 105629: {'lr': 0.00010293350199083198, 'samples': 20280768, 'steps': 105628, 'loss/train': 1.007691502571106} 11/07/2021 12:06:50 - INFO - __main__ - Step 105630: {'lr': 0.00010292921063229046, 'samples': 20280960, 'steps': 105629, 'loss/train': 1.059717059135437} 11/07/2021 12:06:50 - INFO - __main__ - Step 105631: {'lr': 0.00010292491934001674, 'samples': 20281152, 'steps': 105630, 'loss/train': 1.3486661911010742} 11/07/2021 12:06:50 - INFO - __main__ - Step 105632: {'lr': 0.00010292062811401281, 'samples': 20281344, 'steps': 105631, 'loss/train': 1.0539524555206299} 11/07/2021 12:06:51 - INFO - __main__ - Step 105633: {'lr': 0.00010291633695428066, 'samples': 20281536, 'steps': 105632, 'loss/train': 0.7712005972862244} 11/07/2021 12:06:52 - INFO - __main__ - Step 105634: {'lr': 0.00010291204586082204, 'samples': 20281728, 'steps': 105633, 'loss/train': 1.3171730041503906} 11/07/2021 12:06:52 - INFO - __main__ - Step 105635: {'lr': 0.00010290775483363899, 'samples': 20281920, 'steps': 105634, 'loss/train': 0.5209256410598755} 11/07/2021 12:06:53 - INFO - __main__ - Step 105636: {'lr': 0.00010290346387273341, 'samples': 20282112, 'steps': 105635, 'loss/train': 0.4937751293182373} 11/07/2021 12:06:53 - INFO - __main__ - Step 105637: {'lr': 0.00010289917297810728, 'samples': 20282304, 'steps': 105636, 'loss/train': 1.3597716093063354} 11/07/2021 12:06:53 - INFO - __main__ - Step 105638: {'lr': 0.0001028948821497625, 'samples': 20282496, 'steps': 105637, 'loss/train': 1.1029547452926636} 11/07/2021 12:06:54 - INFO - __main__ - Step 105639: {'lr': 0.00010289059138770104, 'samples': 20282688, 'steps': 105638, 'loss/train': 0.9900223016738892} 11/07/2021 12:06:55 - INFO - __main__ - Step 105640: {'lr': 0.00010288630069192479, 'samples': 20282880, 'steps': 105639, 'loss/train': 1.558945894241333} 11/07/2021 12:06:55 - INFO - __main__ - Step 105641: {'lr': 0.00010288201006243572, 'samples': 20283072, 'steps': 105640, 'loss/train': 1.709099531173706} 11/07/2021 12:06:56 - INFO - __main__ - Step 105642: {'lr': 0.00010287771949923571, 'samples': 20283264, 'steps': 105641, 'loss/train': 1.3179343938827515} 11/07/2021 12:06:56 - INFO - __main__ - Step 105643: {'lr': 0.00010287342900232676, 'samples': 20283456, 'steps': 105642, 'loss/train': 1.6443090438842773} 11/07/2021 12:06:56 - INFO - __main__ - Step 105644: {'lr': 0.00010286913857171088, 'samples': 20283648, 'steps': 105643, 'loss/train': 0.49808773398399353} 11/07/2021 12:06:57 - INFO - __main__ - Step 105645: {'lr': 0.00010286484820738975, 'samples': 20283840, 'steps': 105644, 'loss/train': 1.5603147745132446} 11/07/2021 12:06:58 - INFO - __main__ - Step 105646: {'lr': 0.00010286055790936549, 'samples': 20284032, 'steps': 105645, 'loss/train': 1.206398367881775} 11/07/2021 12:06:58 - INFO - __main__ - Step 105647: {'lr': 0.00010285626767764, 'samples': 20284224, 'steps': 105646, 'loss/train': 1.2388396263122559} 11/07/2021 12:06:58 - INFO - __main__ - Step 105648: {'lr': 0.00010285197751221517, 'samples': 20284416, 'steps': 105647, 'loss/train': 1.5965512990951538} 11/07/2021 12:06:59 - INFO - __main__ - Step 105649: {'lr': 0.000102847687413093, 'samples': 20284608, 'steps': 105648, 'loss/train': 0.8742455244064331} 11/07/2021 12:07:00 - INFO - __main__ - Step 105650: {'lr': 0.00010284339738027538, 'samples': 20284800, 'steps': 105649, 'loss/train': 1.0577226877212524} 11/07/2021 12:07:00 - INFO - __main__ - Step 105651: {'lr': 0.00010283910741376427, 'samples': 20284992, 'steps': 105650, 'loss/train': 1.0638498067855835} 11/07/2021 12:07:00 - INFO - __main__ - Step 105652: {'lr': 0.00010283481751356155, 'samples': 20285184, 'steps': 105651, 'loss/train': 1.5202038288116455} 11/07/2021 12:07:01 - INFO - __main__ - Step 105653: {'lr': 0.00010283052767966922, 'samples': 20285376, 'steps': 105652, 'loss/train': 1.0886237621307373} 11/07/2021 12:07:01 - INFO - __main__ - Step 105654: {'lr': 0.00010282623791208917, 'samples': 20285568, 'steps': 105653, 'loss/train': 0.8728607892990112} 11/07/2021 12:07:02 - INFO - __main__ - Step 105655: {'lr': 0.00010282194821082344, 'samples': 20285760, 'steps': 105654, 'loss/train': 1.3663177490234375} 11/07/2021 12:07:02 - INFO - __main__ - Step 105656: {'lr': 0.00010281765857587377, 'samples': 20285952, 'steps': 105655, 'loss/train': 1.4297096729278564} 11/07/2021 12:07:03 - INFO - __main__ - Step 105657: {'lr': 0.0001028133690072422, 'samples': 20286144, 'steps': 105656, 'loss/train': 1.4963372945785522} 11/07/2021 12:07:03 - INFO - __main__ - Step 105658: {'lr': 0.00010280907950493066, 'samples': 20286336, 'steps': 105657, 'loss/train': 1.6893200874328613} 11/07/2021 12:07:04 - INFO - __main__ - Step 105659: {'lr': 0.00010280479006894108, 'samples': 20286528, 'steps': 105658, 'loss/train': 0.9596771597862244} 11/07/2021 12:07:04 - INFO - __main__ - Step 105660: {'lr': 0.00010280050069927538, 'samples': 20286720, 'steps': 105659, 'loss/train': 1.3507534265518188} 11/07/2021 12:07:05 - INFO - __main__ - Step 105661: {'lr': 0.0001027962113959355, 'samples': 20286912, 'steps': 105660, 'loss/train': 1.0262669324874878} 11/07/2021 12:07:05 - INFO - __main__ - Step 105662: {'lr': 0.00010279192215892339, 'samples': 20287104, 'steps': 105661, 'loss/train': 0.8345233798027039} 11/07/2021 12:07:06 - INFO - __main__ - Step 105663: {'lr': 0.00010278763298824096, 'samples': 20287296, 'steps': 105662, 'loss/train': 1.3201940059661865} 11/07/2021 12:07:06 - INFO - __main__ - Step 105664: {'lr': 0.00010278334388389016, 'samples': 20287488, 'steps': 105663, 'loss/train': 2.190007209777832} 11/07/2021 12:07:07 - INFO - __main__ - Step 105665: {'lr': 0.00010277905484587288, 'samples': 20287680, 'steps': 105664, 'loss/train': 1.6991323232650757} 11/07/2021 12:07:07 - INFO - __main__ - Step 105666: {'lr': 0.00010277476587419112, 'samples': 20287872, 'steps': 105665, 'loss/train': 1.2177413702011108} 11/07/2021 12:07:08 - INFO - __main__ - Step 105667: {'lr': 0.00010277047696884687, 'samples': 20288064, 'steps': 105666, 'loss/train': 1.819709062576294} 11/07/2021 12:07:08 - INFO - __main__ - Step 105668: {'lr': 0.00010276618812984188, 'samples': 20288256, 'steps': 105667, 'loss/train': 0.989504873752594} 11/07/2021 12:07:09 - INFO - __main__ - Step 105669: {'lr': 0.00010276189935717814, 'samples': 20288448, 'steps': 105668, 'loss/train': 1.304787516593933} 11/07/2021 12:07:09 - INFO - __main__ - Step 105670: {'lr': 0.00010275761065085764, 'samples': 20288640, 'steps': 105669, 'loss/train': 1.50724196434021} 11/07/2021 12:07:10 - INFO - __main__ - Step 105671: {'lr': 0.00010275332201088231, 'samples': 20288832, 'steps': 105670, 'loss/train': 0.9976056218147278} 11/07/2021 12:07:10 - INFO - __main__ - Step 105672: {'lr': 0.00010274903343725403, 'samples': 20289024, 'steps': 105671, 'loss/train': 1.6215322017669678} 11/07/2021 12:07:11 - INFO - __main__ - Step 105673: {'lr': 0.00010274474492997477, 'samples': 20289216, 'steps': 105672, 'loss/train': 1.4421403408050537} 11/07/2021 12:07:11 - INFO - __main__ - Step 105674: {'lr': 0.00010274045648904646, 'samples': 20289408, 'steps': 105673, 'loss/train': 1.592057466506958} 11/07/2021 12:07:11 - INFO - __main__ - Step 105675: {'lr': 0.00010273616811447103, 'samples': 20289600, 'steps': 105674, 'loss/train': 1.1676006317138672} 11/07/2021 12:07:12 - INFO - __main__ - Step 105676: {'lr': 0.0001027318798062504, 'samples': 20289792, 'steps': 105675, 'loss/train': 1.1865125894546509} 11/07/2021 12:07:13 - INFO - __main__ - Step 105677: {'lr': 0.00010272759156438651, 'samples': 20289984, 'steps': 105676, 'loss/train': 1.164299726486206} 11/07/2021 12:07:13 - INFO - __main__ - Step 105678: {'lr': 0.0001027233033888813, 'samples': 20290176, 'steps': 105677, 'loss/train': 1.5551775693893433} 11/07/2021 12:07:14 - INFO - __main__ - Step 105679: {'lr': 0.00010271901527973671, 'samples': 20290368, 'steps': 105678, 'loss/train': 1.3400053977966309} 11/07/2021 12:07:14 - INFO - __main__ - Step 105680: {'lr': 0.00010271472723695466, 'samples': 20290560, 'steps': 105679, 'loss/train': 1.5202378034591675} 11/07/2021 12:07:15 - INFO - __main__ - Step 105681: {'lr': 0.00010271043926053716, 'samples': 20290752, 'steps': 105680, 'loss/train': 1.5521771907806396} 11/07/2021 12:07:15 - INFO - __main__ - Step 105682: {'lr': 0.00010270615135048597, 'samples': 20290944, 'steps': 105681, 'loss/train': 1.1645479202270508} 11/07/2021 12:07:16 - INFO - __main__ - Step 105683: {'lr': 0.00010270186350680314, 'samples': 20291136, 'steps': 105682, 'loss/train': 0.9790390729904175} 11/07/2021 12:07:16 - INFO - __main__ - Step 105684: {'lr': 0.00010269757572949054, 'samples': 20291328, 'steps': 105683, 'loss/train': 1.5701559782028198} 11/07/2021 12:07:16 - INFO - __main__ - Step 105685: {'lr': 0.00010269328801855016, 'samples': 20291520, 'steps': 105684, 'loss/train': 1.284644603729248} 11/07/2021 12:07:17 - INFO - __main__ - Step 105686: {'lr': 0.0001026890003739839, 'samples': 20291712, 'steps': 105685, 'loss/train': 1.2740343809127808} 11/07/2021 12:07:18 - INFO - __main__ - Step 105687: {'lr': 0.00010268471279579372, 'samples': 20291904, 'steps': 105686, 'loss/train': 1.4526315927505493} 11/07/2021 12:07:18 - INFO - __main__ - Step 105688: {'lr': 0.00010268042528398153, 'samples': 20292096, 'steps': 105687, 'loss/train': 1.2724488973617554} 11/07/2021 12:07:19 - INFO - __main__ - Step 105689: {'lr': 0.00010267613783854925, 'samples': 20292288, 'steps': 105688, 'loss/train': 1.1726940870285034} 11/07/2021 12:07:19 - INFO - __main__ - Step 105690: {'lr': 0.00010267185045949884, 'samples': 20292480, 'steps': 105689, 'loss/train': 1.5907998085021973} 11/07/2021 12:07:19 - INFO - __main__ - Step 105691: {'lr': 0.00010266756314683224, 'samples': 20292672, 'steps': 105690, 'loss/train': 1.4560770988464355} 11/07/2021 12:07:20 - INFO - __main__ - Step 105692: {'lr': 0.00010266327590055132, 'samples': 20292864, 'steps': 105691, 'loss/train': 0.7935671210289001} 11/07/2021 12:07:21 - INFO - __main__ - Step 105693: {'lr': 0.0001026589887206581, 'samples': 20293056, 'steps': 105692, 'loss/train': 1.1738500595092773} 11/07/2021 12:07:21 - INFO - __main__ - Step 105694: {'lr': 0.00010265470160715453, 'samples': 20293248, 'steps': 105693, 'loss/train': 0.90053391456604} 11/07/2021 12:07:21 - INFO - __main__ - Step 105695: {'lr': 0.0001026504145600424, 'samples': 20293440, 'steps': 105694, 'loss/train': 1.023656964302063} 11/07/2021 12:07:22 - INFO - __main__ - Step 105696: {'lr': 0.00010264612757932371, 'samples': 20293632, 'steps': 105695, 'loss/train': 1.130550503730774} 11/07/2021 12:07:23 - INFO - __main__ - Step 105697: {'lr': 0.0001026418406650004, 'samples': 20293824, 'steps': 105696, 'loss/train': 1.617490291595459} 11/07/2021 12:07:23 - INFO - __main__ - Step 105698: {'lr': 0.00010263755381707441, 'samples': 20294016, 'steps': 105697, 'loss/train': 2.0588841438293457} 11/07/2021 12:07:23 - INFO - __main__ - Step 105699: {'lr': 0.00010263326703554765, 'samples': 20294208, 'steps': 105698, 'loss/train': 1.2022944688796997} 11/07/2021 12:07:24 - INFO - __main__ - Step 105700: {'lr': 0.00010262898032042208, 'samples': 20294400, 'steps': 105699, 'loss/train': 1.4205271005630493} 11/07/2021 12:07:24 - INFO - __main__ - Step 105701: {'lr': 0.00010262469367169963, 'samples': 20294592, 'steps': 105700, 'loss/train': 0.055775534361600876} 11/07/2021 12:07:25 - INFO - __main__ - Step 105702: {'lr': 0.0001026204070893822, 'samples': 20294784, 'steps': 105701, 'loss/train': 1.3685481548309326} 11/07/2021 12:07:26 - INFO - __main__ - Step 105703: {'lr': 0.00010261612057347175, 'samples': 20294976, 'steps': 105702, 'loss/train': 1.278602123260498} 11/07/2021 12:07:26 - INFO - __main__ - Step 105704: {'lr': 0.00010261183412397018, 'samples': 20295168, 'steps': 105703, 'loss/train': 1.1558564901351929} 11/07/2021 12:07:26 - INFO - __main__ - Step 105705: {'lr': 0.00010260754774087947, 'samples': 20295360, 'steps': 105704, 'loss/train': 1.071705937385559} 11/07/2021 12:07:27 - INFO - __main__ - Step 105706: {'lr': 0.00010260326142420151, 'samples': 20295552, 'steps': 105705, 'loss/train': 1.4937596321105957} 11/07/2021 12:07:28 - INFO - __main__ - Step 105707: {'lr': 0.00010259897517393826, 'samples': 20295744, 'steps': 105706, 'loss/train': 1.5457464456558228} 11/07/2021 12:07:28 - INFO - __main__ - Step 105708: {'lr': 0.00010259468899009172, 'samples': 20295936, 'steps': 105707, 'loss/train': 1.279968500137329} 11/07/2021 12:07:28 - INFO - __main__ - Step 105709: {'lr': 0.00010259040287266363, 'samples': 20296128, 'steps': 105708, 'loss/train': 1.1010032892227173} 11/07/2021 12:07:29 - INFO - __main__ - Step 105710: {'lr': 0.00010258611682165605, 'samples': 20296320, 'steps': 105709, 'loss/train': 1.7725484371185303} 11/07/2021 12:07:29 - INFO - __main__ - Step 105711: {'lr': 0.0001025818308370709, 'samples': 20296512, 'steps': 105710, 'loss/train': 1.6109014749526978} 11/07/2021 12:07:30 - INFO - __main__ - Step 105712: {'lr': 0.00010257754491891009, 'samples': 20296704, 'steps': 105711, 'loss/train': 1.0382897853851318} 11/07/2021 12:07:31 - INFO - __main__ - Step 105713: {'lr': 0.00010257325906717554, 'samples': 20296896, 'steps': 105712, 'loss/train': 1.3258330821990967} 11/07/2021 12:07:31 - INFO - __main__ - Step 105714: {'lr': 0.00010256897328186923, 'samples': 20297088, 'steps': 105713, 'loss/train': 1.3670029640197754} 11/07/2021 12:07:31 - INFO - __main__ - Step 105715: {'lr': 0.00010256468756299306, 'samples': 20297280, 'steps': 105714, 'loss/train': 1.1290488243103027} 11/07/2021 12:07:32 - INFO - __main__ - Step 105716: {'lr': 0.00010256040191054897, 'samples': 20297472, 'steps': 105715, 'loss/train': 1.1464537382125854} 11/07/2021 12:07:33 - INFO - __main__ - Step 105717: {'lr': 0.0001025561163245389, 'samples': 20297664, 'steps': 105716, 'loss/train': 1.6800117492675781} 11/07/2021 12:07:33 - INFO - __main__ - Step 105718: {'lr': 0.00010255183080496474, 'samples': 20297856, 'steps': 105717, 'loss/train': 1.361243486404419} 11/07/2021 12:07:33 - INFO - __main__ - Step 105719: {'lr': 0.00010254754535182848, 'samples': 20298048, 'steps': 105718, 'loss/train': 1.2919870615005493} 11/07/2021 12:07:34 - INFO - __main__ - Step 105720: {'lr': 0.000102543259965132, 'samples': 20298240, 'steps': 105719, 'loss/train': 0.6002485752105713} 11/07/2021 12:07:34 - INFO - __main__ - Step 105721: {'lr': 0.00010253897464487735, 'samples': 20298432, 'steps': 105720, 'loss/train': 1.4805138111114502} 11/07/2021 12:07:35 - INFO - __main__ - Step 105722: {'lr': 0.00010253468939106628, 'samples': 20298624, 'steps': 105721, 'loss/train': 1.4421236515045166} 11/07/2021 12:07:36 - INFO - __main__ - Step 105723: {'lr': 0.00010253040420370077, 'samples': 20298816, 'steps': 105722, 'loss/train': 1.4242634773254395} 11/07/2021 12:07:36 - INFO - __main__ - Step 105724: {'lr': 0.0001025261190827828, 'samples': 20299008, 'steps': 105723, 'loss/train': 1.3211793899536133} 11/07/2021 12:07:36 - INFO - __main__ - Step 105725: {'lr': 0.00010252183402831431, 'samples': 20299200, 'steps': 105724, 'loss/train': 1.3489842414855957} 11/07/2021 12:07:37 - INFO - __main__ - Step 105726: {'lr': 0.0001025175490402972, 'samples': 20299392, 'steps': 105725, 'loss/train': 1.1906063556671143} 11/07/2021 12:07:37 - INFO - __main__ - Step 105727: {'lr': 0.00010251326411873338, 'samples': 20299584, 'steps': 105726, 'loss/train': 1.5664016008377075} 11/07/2021 12:07:38 - INFO - __main__ - Step 105728: {'lr': 0.00010250897926362482, 'samples': 20299776, 'steps': 105727, 'loss/train': 0.07800061255693436} 11/07/2021 12:07:38 - INFO - __main__ - Step 105729: {'lr': 0.00010250469447497345, 'samples': 20299968, 'steps': 105728, 'loss/train': 1.052217721939087} 11/07/2021 12:07:39 - INFO - __main__ - Step 105730: {'lr': 0.00010250040975278118, 'samples': 20300160, 'steps': 105729, 'loss/train': 1.5202805995941162} 11/07/2021 12:07:39 - INFO - __main__ - Step 105731: {'lr': 0.00010249612509704995, 'samples': 20300352, 'steps': 105730, 'loss/train': 0.967068612575531} 11/07/2021 12:07:39 - INFO - __main__ - Step 105732: {'lr': 0.00010249184050778168, 'samples': 20300544, 'steps': 105731, 'loss/train': 1.4447766542434692} 11/07/2021 12:07:41 - INFO - __main__ - Step 105733: {'lr': 0.00010248755598497834, 'samples': 20300736, 'steps': 105732, 'loss/train': 1.531379222869873} 11/07/2021 12:07:41 - INFO - __main__ - Step 105734: {'lr': 0.00010248327152864179, 'samples': 20300928, 'steps': 105733, 'loss/train': 1.3378159999847412} 11/07/2021 12:07:41 - INFO - __main__ - Step 105735: {'lr': 0.00010247898713877413, 'samples': 20301120, 'steps': 105734, 'loss/train': 1.4467484951019287} 11/07/2021 12:07:42 - INFO - __main__ - Step 105736: {'lr': 0.00010247470281537704, 'samples': 20301312, 'steps': 105735, 'loss/train': 1.7716659307479858} 11/07/2021 12:07:42 - INFO - __main__ - Step 105737: {'lr': 0.00010247041855845257, 'samples': 20301504, 'steps': 105736, 'loss/train': 0.24729910492897034} 11/07/2021 12:07:43 - INFO - __main__ - Step 105738: {'lr': 0.00010246613436800268, 'samples': 20301696, 'steps': 105737, 'loss/train': 0.9493271112442017} 11/07/2021 12:07:44 - INFO - __main__ - Step 105739: {'lr': 0.00010246185024402927, 'samples': 20301888, 'steps': 105738, 'loss/train': 1.0605465173721313} 11/07/2021 12:07:44 - INFO - __main__ - Step 105740: {'lr': 0.00010245756618653426, 'samples': 20302080, 'steps': 105739, 'loss/train': 1.5132535696029663} 11/07/2021 12:07:44 - INFO - __main__ - Step 105741: {'lr': 0.00010245328219551961, 'samples': 20302272, 'steps': 105740, 'loss/train': 1.2510275840759277} 11/07/2021 12:07:45 - INFO - __main__ - Step 105742: {'lr': 0.00010244899827098721, 'samples': 20302464, 'steps': 105741, 'loss/train': 1.2569518089294434} 11/07/2021 12:07:45 - INFO - __main__ - Step 105743: {'lr': 0.00010244471441293904, 'samples': 20302656, 'steps': 105742, 'loss/train': 1.4124962091445923} 11/07/2021 12:07:46 - INFO - __main__ - Step 105744: {'lr': 0.00010244043062137698, 'samples': 20302848, 'steps': 105743, 'loss/train': 1.4791673421859741} 11/07/2021 12:07:46 - INFO - __main__ - Step 105745: {'lr': 0.00010243614689630302, 'samples': 20303040, 'steps': 105744, 'loss/train': 1.1116631031036377} 11/07/2021 12:07:47 - INFO - __main__ - Step 105746: {'lr': 0.00010243186323771903, 'samples': 20303232, 'steps': 105745, 'loss/train': 1.1797356605529785} 11/07/2021 12:07:47 - INFO - __main__ - Step 105747: {'lr': 0.00010242757964562696, 'samples': 20303424, 'steps': 105746, 'loss/train': 1.100906252861023} 11/07/2021 12:07:47 - INFO - __main__ - Step 105748: {'lr': 0.00010242329612002885, 'samples': 20303616, 'steps': 105747, 'loss/train': 1.760467529296875} 11/07/2021 12:07:48 - INFO - __main__ - Step 105749: {'lr': 0.00010241901266092644, 'samples': 20303808, 'steps': 105748, 'loss/train': 1.2864749431610107} 11/07/2021 12:07:49 - INFO - __main__ - Step 105750: {'lr': 0.00010241472926832171, 'samples': 20304000, 'steps': 105749, 'loss/train': 1.345544695854187} 11/07/2021 12:07:49 - INFO - __main__ - Step 105751: {'lr': 0.00010241044594221666, 'samples': 20304192, 'steps': 105750, 'loss/train': 1.2466678619384766} 11/07/2021 12:07:50 - INFO - __main__ - Step 105752: {'lr': 0.00010240616268261318, 'samples': 20304384, 'steps': 105751, 'loss/train': 1.2672441005706787} 11/07/2021 12:07:50 - INFO - __main__ - Step 105753: {'lr': 0.00010240187948951318, 'samples': 20304576, 'steps': 105752, 'loss/train': 1.14738929271698} 11/07/2021 12:07:51 - INFO - __main__ - Step 105754: {'lr': 0.00010239759636291864, 'samples': 20304768, 'steps': 105753, 'loss/train': 0.8529652953147888} 11/07/2021 12:07:51 - INFO - __main__ - Step 105755: {'lr': 0.00010239331330283147, 'samples': 20304960, 'steps': 105754, 'loss/train': 1.1619516611099243} 11/07/2021 12:07:52 - INFO - __main__ - Step 105756: {'lr': 0.00010238903030925359, 'samples': 20305152, 'steps': 105755, 'loss/train': 0.055113259702920914} 11/07/2021 12:07:52 - INFO - __main__ - Step 105757: {'lr': 0.00010238474738218696, 'samples': 20305344, 'steps': 105756, 'loss/train': 0.7104566097259521} 11/07/2021 12:07:52 - INFO - __main__ - Step 105758: {'lr': 0.00010238046452163343, 'samples': 20305536, 'steps': 105757, 'loss/train': 1.5868254899978638} 11/07/2021 12:07:53 - INFO - __main__ - Step 105759: {'lr': 0.00010237618172759502, 'samples': 20305728, 'steps': 105758, 'loss/train': 1.314861536026001} 11/07/2021 12:07:54 - INFO - __main__ - Step 105760: {'lr': 0.00010237189900007363, 'samples': 20305920, 'steps': 105759, 'loss/train': 1.3252842426300049} 11/07/2021 12:07:54 - INFO - __main__ - Step 105761: {'lr': 0.00010236761633907124, 'samples': 20306112, 'steps': 105760, 'loss/train': 1.0846025943756104} 11/07/2021 12:07:54 - INFO - __main__ - Step 105762: {'lr': 0.00010236333374458967, 'samples': 20306304, 'steps': 105761, 'loss/train': 1.011492133140564} 11/07/2021 12:07:55 - INFO - __main__ - Step 105763: {'lr': 0.00010235905121663089, 'samples': 20306496, 'steps': 105762, 'loss/train': 1.586780309677124} 11/07/2021 12:07:56 - INFO - __main__ - Step 105764: {'lr': 0.00010235476875519683, 'samples': 20306688, 'steps': 105763, 'loss/train': 1.0319819450378418} 11/07/2021 12:07:56 - INFO - __main__ - Step 105765: {'lr': 0.00010235048636028945, 'samples': 20306880, 'steps': 105764, 'loss/train': 1.3241872787475586} 11/07/2021 12:07:57 - INFO - __main__ - Step 105766: {'lr': 0.00010234620403191067, 'samples': 20307072, 'steps': 105765, 'loss/train': 1.3382501602172852} 11/07/2021 12:07:57 - INFO - __main__ - Step 105767: {'lr': 0.00010234192177006241, 'samples': 20307264, 'steps': 105766, 'loss/train': 1.3921090364456177} 11/07/2021 12:07:57 - INFO - __main__ - Step 105768: {'lr': 0.00010233763957474656, 'samples': 20307456, 'steps': 105767, 'loss/train': 1.498682975769043} 11/07/2021 12:07:58 - INFO - __main__ - Step 105769: {'lr': 0.00010233335744596514, 'samples': 20307648, 'steps': 105768, 'loss/train': 2.2820751667022705} 11/07/2021 12:07:59 - INFO - __main__ - Step 105770: {'lr': 0.00010232907538372002, 'samples': 20307840, 'steps': 105769, 'loss/train': 1.2579059600830078} 11/07/2021 12:07:59 - INFO - __main__ - Step 105771: {'lr': 0.00010232479338801312, 'samples': 20308032, 'steps': 105770, 'loss/train': 0.8955056071281433} 11/07/2021 12:07:59 - INFO - __main__ - Step 105772: {'lr': 0.00010232051145884641, 'samples': 20308224, 'steps': 105771, 'loss/train': 1.4583910703659058} 11/07/2021 12:08:00 - INFO - __main__ - Step 105773: {'lr': 0.00010231622959622181, 'samples': 20308416, 'steps': 105772, 'loss/train': 1.2351053953170776} 11/07/2021 12:08:00 - INFO - __main__ - Step 105774: {'lr': 0.00010231194780014119, 'samples': 20308608, 'steps': 105773, 'loss/train': 1.2799357175827026} 11/07/2021 12:08:01 - INFO - __main__ - Step 105775: {'lr': 0.00010230766607060665, 'samples': 20308800, 'steps': 105774, 'loss/train': 1.198398470878601} 11/07/2021 12:08:01 - INFO - __main__ - Step 105776: {'lr': 0.00010230338440761991, 'samples': 20308992, 'steps': 105775, 'loss/train': 1.5537965297698975} 11/07/2021 12:08:02 - INFO - __main__ - Step 105777: {'lr': 0.00010229910281118299, 'samples': 20309184, 'steps': 105776, 'loss/train': 1.3464974164962769} 11/07/2021 12:08:02 - INFO - __main__ - Step 105778: {'lr': 0.0001022948212812978, 'samples': 20309376, 'steps': 105777, 'loss/train': 1.413591980934143} 11/07/2021 12:08:02 - INFO - __main__ - Step 105779: {'lr': 0.0001022905398179663, 'samples': 20309568, 'steps': 105778, 'loss/train': 1.4486544132232666} 11/07/2021 12:08:04 - INFO - __main__ - Step 105780: {'lr': 0.00010228625842119039, 'samples': 20309760, 'steps': 105779, 'loss/train': 4.197091102600098} 11/07/2021 12:08:04 - INFO - __main__ - Step 105781: {'lr': 0.00010228197709097201, 'samples': 20309952, 'steps': 105780, 'loss/train': 1.0486321449279785} 11/07/2021 12:08:04 - INFO - __main__ - Step 105782: {'lr': 0.00010227769582731308, 'samples': 20310144, 'steps': 105781, 'loss/train': 1.5127613544464111} 11/07/2021 12:08:05 - INFO - __main__ - Step 105783: {'lr': 0.00010227341463021555, 'samples': 20310336, 'steps': 105782, 'loss/train': 1.5343399047851562} 11/07/2021 12:08:05 - INFO - __main__ - Step 105784: {'lr': 0.00010226913349968137, 'samples': 20310528, 'steps': 105783, 'loss/train': 0.8271974921226501} 11/07/2021 12:08:06 - INFO - __main__ - Step 105785: {'lr': 0.0001022648524357124, 'samples': 20310720, 'steps': 105784, 'loss/train': 1.065266728401184} 11/07/2021 12:08:06 - INFO - __main__ - Step 105786: {'lr': 0.00010226057143831064, 'samples': 20310912, 'steps': 105785, 'loss/train': 1.2800335884094238} 11/07/2021 12:08:07 - INFO - __main__ - Step 105787: {'lr': 0.00010225629050747796, 'samples': 20311104, 'steps': 105786, 'loss/train': 1.5458567142486572} 11/07/2021 12:08:07 - INFO - __main__ - Step 105788: {'lr': 0.00010225200964321644, 'samples': 20311296, 'steps': 105787, 'loss/train': 1.1556147336959839} 11/07/2021 12:08:07 - INFO - __main__ - Step 105789: {'lr': 0.00010224772884552774, 'samples': 20311488, 'steps': 105788, 'loss/train': 1.145427942276001} 11/07/2021 12:08:09 - INFO - __main__ - Step 105790: {'lr': 0.00010224344811441397, 'samples': 20311680, 'steps': 105789, 'loss/train': 1.1679595708847046} 11/07/2021 12:08:09 - INFO - __main__ - Step 105791: {'lr': 0.00010223916744987702, 'samples': 20311872, 'steps': 105790, 'loss/train': 0.9736291766166687} 11/07/2021 12:08:09 - INFO - __main__ - Step 105792: {'lr': 0.00010223488685191882, 'samples': 20312064, 'steps': 105791, 'loss/train': 1.332542061805725} 11/07/2021 12:08:10 - INFO - __main__ - Step 105793: {'lr': 0.00010223060632054129, 'samples': 20312256, 'steps': 105792, 'loss/train': 1.3444654941558838} 11/07/2021 12:08:10 - INFO - __main__ - Step 105794: {'lr': 0.00010222632585574638, 'samples': 20312448, 'steps': 105793, 'loss/train': 1.318865418434143} 11/07/2021 12:08:10 - INFO - __main__ - Step 105795: {'lr': 0.000102222045457536, 'samples': 20312640, 'steps': 105794, 'loss/train': 1.0033857822418213} 11/07/2021 12:08:11 - INFO - __main__ - Step 105796: {'lr': 0.0001022177651259121, 'samples': 20312832, 'steps': 105795, 'loss/train': 1.4028249979019165} 11/07/2021 12:08:12 - INFO - __main__ - Step 105797: {'lr': 0.00010221348486087659, 'samples': 20313024, 'steps': 105796, 'loss/train': 1.5332705974578857} 11/07/2021 12:08:12 - INFO - __main__ - Step 105798: {'lr': 0.00010220920466243138, 'samples': 20313216, 'steps': 105797, 'loss/train': 1.6705231666564941} 11/07/2021 12:08:13 - INFO - __main__ - Step 105799: {'lr': 0.00010220492453057845, 'samples': 20313408, 'steps': 105798, 'loss/train': 1.6467831134796143} 11/07/2021 12:08:13 - INFO - __main__ - Step 105800: {'lr': 0.00010220064446531968, 'samples': 20313600, 'steps': 105799, 'loss/train': 1.3805426359176636} 11/07/2021 12:08:14 - INFO - __main__ - Step 105801: {'lr': 0.00010219636446665703, 'samples': 20313792, 'steps': 105800, 'loss/train': 1.212359070777893} 11/07/2021 12:08:14 - INFO - __main__ - Step 105802: {'lr': 0.0001021920845345925, 'samples': 20313984, 'steps': 105801, 'loss/train': 1.2707452774047852} 11/07/2021 12:08:15 - INFO - __main__ - Step 105803: {'lr': 0.00010218780466912785, 'samples': 20314176, 'steps': 105802, 'loss/train': 1.5553326606750488} 11/07/2021 12:08:15 - INFO - __main__ - Step 105804: {'lr': 0.00010218352487026511, 'samples': 20314368, 'steps': 105803, 'loss/train': 0.8076202273368835} 11/07/2021 12:08:15 - INFO - __main__ - Step 105805: {'lr': 0.00010217924513800617, 'samples': 20314560, 'steps': 105804, 'loss/train': 1.0777294635772705} 11/07/2021 12:08:16 - INFO - __main__ - Step 105806: {'lr': 0.000102174965472353, 'samples': 20314752, 'steps': 105805, 'loss/train': 0.931740939617157} 11/07/2021 12:08:17 - INFO - __main__ - Step 105807: {'lr': 0.0001021706858733075, 'samples': 20314944, 'steps': 105806, 'loss/train': 1.4417657852172852} 11/07/2021 12:08:17 - INFO - __main__ - Step 105808: {'lr': 0.00010216640634087159, 'samples': 20315136, 'steps': 105807, 'loss/train': 1.561349868774414} 11/07/2021 12:08:18 - INFO - __main__ - Step 105809: {'lr': 0.00010216212687504725, 'samples': 20315328, 'steps': 105808, 'loss/train': 0.8886064887046814} 11/07/2021 12:08:18 - INFO - __main__ - Step 105810: {'lr': 0.00010215784747583634, 'samples': 20315520, 'steps': 105809, 'loss/train': 1.9425277709960938} 11/07/2021 12:08:19 - INFO - __main__ - Step 105811: {'lr': 0.00010215356814324084, 'samples': 20315712, 'steps': 105810, 'loss/train': 1.4022938013076782} 11/07/2021 12:08:19 - INFO - __main__ - Step 105812: {'lr': 0.00010214928887726266, 'samples': 20315904, 'steps': 105811, 'loss/train': 1.3671809434890747} 11/07/2021 12:08:20 - INFO - __main__ - Step 105813: {'lr': 0.00010214500967790375, 'samples': 20316096, 'steps': 105812, 'loss/train': 1.4751296043395996} 11/07/2021 12:08:20 - INFO - __main__ - Step 105814: {'lr': 0.00010214073054516598, 'samples': 20316288, 'steps': 105813, 'loss/train': 1.4759472608566284} 11/07/2021 12:08:20 - INFO - __main__ - Step 105815: {'lr': 0.00010213645147905142, 'samples': 20316480, 'steps': 105814, 'loss/train': 2.0975277423858643} 11/07/2021 12:08:21 - INFO - __main__ - Step 105816: {'lr': 0.0001021321724795618, 'samples': 20316672, 'steps': 105815, 'loss/train': 1.6257905960083008} 11/07/2021 12:08:22 - INFO - __main__ - Step 105817: {'lr': 0.00010212789354669916, 'samples': 20316864, 'steps': 105816, 'loss/train': 1.27703058719635} 11/07/2021 12:08:22 - INFO - __main__ - Step 105818: {'lr': 0.0001021236146804654, 'samples': 20317056, 'steps': 105817, 'loss/train': 1.1563360691070557} 11/07/2021 12:08:22 - INFO - __main__ - Step 105819: {'lr': 0.00010211933588086245, 'samples': 20317248, 'steps': 105818, 'loss/train': 1.1123604774475098} 11/07/2021 12:08:23 - INFO - __main__ - Step 105820: {'lr': 0.00010211505714789223, 'samples': 20317440, 'steps': 105819, 'loss/train': 1.4822075366973877} 11/07/2021 12:08:24 - INFO - __main__ - Step 105821: {'lr': 0.00010211077848155672, 'samples': 20317632, 'steps': 105820, 'loss/train': 1.422587513923645} 11/07/2021 12:08:24 - INFO - __main__ - Step 105822: {'lr': 0.0001021064998818578, 'samples': 20317824, 'steps': 105821, 'loss/train': 1.5798242092132568} 11/07/2021 12:08:24 - INFO - __main__ - Step 105823: {'lr': 0.0001021022213487974, 'samples': 20318016, 'steps': 105822, 'loss/train': 1.1728495359420776} 11/07/2021 12:08:25 - INFO - __main__ - Step 105824: {'lr': 0.00010209794288237745, 'samples': 20318208, 'steps': 105823, 'loss/train': 3.1273913383483887} 11/07/2021 12:08:25 - INFO - __main__ - Step 105825: {'lr': 0.00010209366448259991, 'samples': 20318400, 'steps': 105824, 'loss/train': 1.0942806005477905} 11/07/2021 12:08:25 - INFO - __main__ - Step 105826: {'lr': 0.00010208938614946667, 'samples': 20318592, 'steps': 105825, 'loss/train': 0.7564681768417358} 11/07/2021 12:08:26 - INFO - __main__ - Step 105827: {'lr': 0.00010208510788297965, 'samples': 20318784, 'steps': 105826, 'loss/train': 1.7159886360168457} 11/07/2021 12:08:27 - INFO - __main__ - Step 105828: {'lr': 0.0001020808296831408, 'samples': 20318976, 'steps': 105827, 'loss/train': 1.2655143737792969} 11/07/2021 12:08:27 - INFO - __main__ - Step 105829: {'lr': 0.00010207655154995216, 'samples': 20319168, 'steps': 105828, 'loss/train': 1.3409003019332886} 11/07/2021 12:08:27 - INFO - __main__ - Step 105830: {'lr': 0.00010207227348341545, 'samples': 20319360, 'steps': 105829, 'loss/train': 1.4969691038131714} 11/07/2021 12:08:28 - INFO - __main__ - Step 105831: {'lr': 0.00010206799548353268, 'samples': 20319552, 'steps': 105830, 'loss/train': 1.0106748342514038} 11/07/2021 12:08:29 - INFO - __main__ - Step 105832: {'lr': 0.0001020637175503058, 'samples': 20319744, 'steps': 105831, 'loss/train': 1.4502732753753662} 11/07/2021 12:08:30 - INFO - __main__ - Step 105833: {'lr': 0.0001020594396837367, 'samples': 20319936, 'steps': 105832, 'loss/train': 0.6791005730628967} 11/07/2021 12:08:30 - INFO - __main__ - Step 105834: {'lr': 0.00010205516188382736, 'samples': 20320128, 'steps': 105833, 'loss/train': 1.0916210412979126} 11/07/2021 12:08:30 - INFO - __main__ - Step 105835: {'lr': 0.00010205088415057967, 'samples': 20320320, 'steps': 105834, 'loss/train': 1.1654739379882812} 11/07/2021 12:08:31 - INFO - __main__ - Step 105836: {'lr': 0.00010204660648399558, 'samples': 20320512, 'steps': 105835, 'loss/train': 1.429628610610962} 11/07/2021 12:08:32 - INFO - __main__ - Step 105837: {'lr': 0.00010204232888407699, 'samples': 20320704, 'steps': 105836, 'loss/train': 0.985609769821167} 11/07/2021 12:08:32 - INFO - __main__ - Step 105838: {'lr': 0.00010203805135082586, 'samples': 20320896, 'steps': 105837, 'loss/train': 1.512082576751709} 11/07/2021 12:08:32 - INFO - __main__ - Step 105839: {'lr': 0.00010203377388424409, 'samples': 20321088, 'steps': 105838, 'loss/train': 1.332951545715332} 11/07/2021 12:08:33 - INFO - __main__ - Step 105840: {'lr': 0.00010202949648433363, 'samples': 20321280, 'steps': 105839, 'loss/train': 1.1129547357559204} 11/07/2021 12:08:33 - INFO - __main__ - Step 105841: {'lr': 0.00010202521915109639, 'samples': 20321472, 'steps': 105840, 'loss/train': 1.4545576572418213} 11/07/2021 12:08:34 - INFO - __main__ - Step 105842: {'lr': 0.00010202094188453437, 'samples': 20321664, 'steps': 105841, 'loss/train': 1.4068865776062012} 11/07/2021 12:08:35 - INFO - __main__ - Step 105843: {'lr': 0.00010201666468464937, 'samples': 20321856, 'steps': 105842, 'loss/train': 1.0901504755020142} 11/07/2021 12:08:35 - INFO - __main__ - Step 105844: {'lr': 0.00010201238755144337, 'samples': 20322048, 'steps': 105843, 'loss/train': 1.4156898260116577} 11/07/2021 12:08:35 - INFO - __main__ - Step 105845: {'lr': 0.00010200811048491828, 'samples': 20322240, 'steps': 105844, 'loss/train': 1.3481626510620117} 11/07/2021 12:08:36 - INFO - __main__ - Step 105846: {'lr': 0.00010200383348507607, 'samples': 20322432, 'steps': 105845, 'loss/train': 1.3735212087631226} 11/07/2021 12:08:37 - INFO - __main__ - Step 105847: {'lr': 0.00010199955655191867, 'samples': 20322624, 'steps': 105846, 'loss/train': 1.4197207689285278} 11/07/2021 12:08:37 - INFO - __main__ - Step 105848: {'lr': 0.00010199527968544797, 'samples': 20322816, 'steps': 105847, 'loss/train': 1.4864683151245117} 11/07/2021 12:08:38 - INFO - __main__ - Step 105849: {'lr': 0.0001019910028856659, 'samples': 20323008, 'steps': 105848, 'loss/train': 1.1744974851608276} 11/07/2021 12:08:38 - INFO - __main__ - Step 105850: {'lr': 0.00010198672615257443, 'samples': 20323200, 'steps': 105849, 'loss/train': 1.36332106590271} 11/07/2021 12:08:38 - INFO - __main__ - Step 105851: {'lr': 0.00010198244948617544, 'samples': 20323392, 'steps': 105850, 'loss/train': 1.0481611490249634} 11/07/2021 12:08:40 - INFO - __main__ - Step 105852: {'lr': 0.00010197817288647085, 'samples': 20323584, 'steps': 105851, 'loss/train': 1.3738207817077637} 11/07/2021 12:08:40 - INFO - __main__ - Step 105853: {'lr': 0.00010197389635346263, 'samples': 20323776, 'steps': 105852, 'loss/train': 2.044325828552246} 11/07/2021 12:08:41 - INFO - __main__ - Step 105854: {'lr': 0.0001019696198871527, 'samples': 20323968, 'steps': 105853, 'loss/train': 1.1495224237442017} 11/07/2021 12:08:41 - INFO - __main__ - Step 105855: {'lr': 0.00010196534348754296, 'samples': 20324160, 'steps': 105854, 'loss/train': 0.6140874624252319} 11/07/2021 12:08:42 - INFO - __main__ - Step 105856: {'lr': 0.00010196106715463546, 'samples': 20324352, 'steps': 105855, 'loss/train': 0.7351579070091248} 11/07/2021 12:08:42 - INFO - __main__ - Step 105857: {'lr': 0.0001019567908884319, 'samples': 20324544, 'steps': 105856, 'loss/train': 1.2897790670394897} 11/07/2021 12:08:42 - INFO - __main__ - Step 105858: {'lr': 0.00010195251468893435, 'samples': 20324736, 'steps': 105857, 'loss/train': 1.3912986516952515} 11/07/2021 12:08:43 - INFO - __main__ - Step 105859: {'lr': 0.00010194823855614471, 'samples': 20324928, 'steps': 105858, 'loss/train': 1.502103328704834} 11/07/2021 12:08:44 - INFO - __main__ - Step 105860: {'lr': 0.00010194396249006491, 'samples': 20325120, 'steps': 105859, 'loss/train': 1.2539705038070679} 11/07/2021 12:08:44 - INFO - __main__ - Step 105861: {'lr': 0.00010193968649069688, 'samples': 20325312, 'steps': 105860, 'loss/train': 1.5670719146728516} 11/07/2021 12:08:44 - INFO - __main__ - Step 105862: {'lr': 0.00010193541055804254, 'samples': 20325504, 'steps': 105861, 'loss/train': 1.0414844751358032} 11/07/2021 12:08:45 - INFO - __main__ - Step 105863: {'lr': 0.0001019311346921038, 'samples': 20325696, 'steps': 105862, 'loss/train': 1.6873475313186646} 11/07/2021 12:08:45 - INFO - __main__ - Step 105864: {'lr': 0.00010192685889288261, 'samples': 20325888, 'steps': 105863, 'loss/train': 1.4440747499465942} 11/07/2021 12:08:46 - INFO - __main__ - Step 105865: {'lr': 0.00010192258316038091, 'samples': 20326080, 'steps': 105864, 'loss/train': 0.7586330771446228} 11/07/2021 12:08:46 - INFO - __main__ - Step 105866: {'lr': 0.00010191830749460059, 'samples': 20326272, 'steps': 105865, 'loss/train': 1.5164260864257812} 11/07/2021 12:08:47 - INFO - __main__ - Step 105867: {'lr': 0.00010191403189554361, 'samples': 20326464, 'steps': 105866, 'loss/train': 0.8721860647201538} 11/07/2021 12:08:47 - INFO - __main__ - Step 105868: {'lr': 0.00010190975636321187, 'samples': 20326656, 'steps': 105867, 'loss/train': 1.1660031080245972} 11/07/2021 12:08:48 - INFO - __main__ - Step 105869: {'lr': 0.00010190548089760743, 'samples': 20326848, 'steps': 105868, 'loss/train': 1.220047950744629} 11/07/2021 12:08:48 - INFO - __main__ - Step 105870: {'lr': 0.00010190120549873198, 'samples': 20327040, 'steps': 105869, 'loss/train': 1.2691189050674438} 11/07/2021 12:08:49 - INFO - __main__ - Step 105871: {'lr': 0.00010189693016658755, 'samples': 20327232, 'steps': 105870, 'loss/train': 1.3235019445419312} 11/07/2021 12:08:49 - INFO - __main__ - Step 105872: {'lr': 0.00010189265490117607, 'samples': 20327424, 'steps': 105871, 'loss/train': 1.3636276721954346} 11/07/2021 12:08:50 - INFO - __main__ - Step 105873: {'lr': 0.0001018883797024995, 'samples': 20327616, 'steps': 105872, 'loss/train': 1.3590353727340698} 11/07/2021 12:08:50 - INFO - __main__ - Step 105874: {'lr': 0.00010188410457055975, 'samples': 20327808, 'steps': 105873, 'loss/train': 0.6866306662559509} 11/07/2021 12:08:51 - INFO - __main__ - Step 105875: {'lr': 0.00010187982950535873, 'samples': 20328000, 'steps': 105874, 'loss/train': 0.5100377798080444} 11/07/2021 12:08:51 - INFO - __main__ - Step 105876: {'lr': 0.00010187555450689836, 'samples': 20328192, 'steps': 105875, 'loss/train': 1.3001506328582764} 11/07/2021 12:08:52 - INFO - __main__ - Step 105877: {'lr': 0.00010187127957518058, 'samples': 20328384, 'steps': 105876, 'loss/train': 1.6972612142562866} 11/07/2021 12:08:52 - INFO - __main__ - Step 105878: {'lr': 0.00010186700471020733, 'samples': 20328576, 'steps': 105877, 'loss/train': 2.084892511367798} 11/07/2021 12:08:52 - INFO - __main__ - Step 105879: {'lr': 0.0001018627299119805, 'samples': 20328768, 'steps': 105878, 'loss/train': 1.5476332902908325} 11/07/2021 12:08:53 - INFO - __main__ - Step 105880: {'lr': 0.00010185845518050216, 'samples': 20328960, 'steps': 105879, 'loss/train': 1.249822735786438} 11/07/2021 12:08:54 - INFO - __main__ - Step 105881: {'lr': 0.00010185418051577399, 'samples': 20329152, 'steps': 105880, 'loss/train': 0.860331118106842} 11/07/2021 12:08:54 - INFO - __main__ - Step 105882: {'lr': 0.00010184990591779805, 'samples': 20329344, 'steps': 105881, 'loss/train': 1.080199122428894} 11/07/2021 12:08:55 - INFO - __main__ - Step 105883: {'lr': 0.00010184563138657627, 'samples': 20329536, 'steps': 105882, 'loss/train': 1.379610538482666} 11/07/2021 12:08:55 - INFO - __main__ - Step 105884: {'lr': 0.00010184135692211055, 'samples': 20329728, 'steps': 105883, 'loss/train': 0.953360378742218} 11/07/2021 12:08:55 - INFO - __main__ - Step 105885: {'lr': 0.00010183708252440282, 'samples': 20329920, 'steps': 105884, 'loss/train': 0.5537049174308777} 11/07/2021 12:08:56 - INFO - __main__ - Step 105886: {'lr': 0.00010183280819345503, 'samples': 20330112, 'steps': 105885, 'loss/train': 1.2041425704956055} 11/07/2021 12:08:57 - INFO - __main__ - Step 105887: {'lr': 0.00010182853392926909, 'samples': 20330304, 'steps': 105886, 'loss/train': 1.2371985912322998} 11/07/2021 12:08:57 - INFO - __main__ - Step 105888: {'lr': 0.00010182425973184692, 'samples': 20330496, 'steps': 105887, 'loss/train': 1.537117838859558} 11/07/2021 12:08:57 - INFO - __main__ - Step 105889: {'lr': 0.00010181998560119046, 'samples': 20330688, 'steps': 105888, 'loss/train': 1.3120719194412231} 11/07/2021 12:08:58 - INFO - __main__ - Step 105890: {'lr': 0.00010181571153730163, 'samples': 20330880, 'steps': 105889, 'loss/train': 1.5681203603744507} 11/07/2021 12:08:59 - INFO - __main__ - Step 105891: {'lr': 0.00010181143754018243, 'samples': 20331072, 'steps': 105890, 'loss/train': 1.130089282989502} 11/07/2021 12:08:59 - INFO - __main__ - Step 105892: {'lr': 0.00010180716360983463, 'samples': 20331264, 'steps': 105891, 'loss/train': 0.9365535974502563} 11/07/2021 12:08:59 - INFO - __main__ - Step 105893: {'lr': 0.00010180288974626023, 'samples': 20331456, 'steps': 105892, 'loss/train': 1.4721333980560303} 11/07/2021 12:09:00 - INFO - __main__ - Step 105894: {'lr': 0.00010179861594946116, 'samples': 20331648, 'steps': 105893, 'loss/train': 1.732568621635437} 11/07/2021 12:09:00 - INFO - __main__ - Step 105895: {'lr': 0.00010179434221943935, 'samples': 20331840, 'steps': 105894, 'loss/train': 1.3183519840240479} 11/07/2021 12:09:01 - INFO - __main__ - Step 105896: {'lr': 0.00010179006855619672, 'samples': 20332032, 'steps': 105895, 'loss/train': 0.83222895860672} 11/07/2021 12:09:02 - INFO - __main__ - Step 105897: {'lr': 0.0001017857949597352, 'samples': 20332224, 'steps': 105896, 'loss/train': 1.1552621126174927} 11/07/2021 12:09:02 - INFO - __main__ - Step 105898: {'lr': 0.0001017815214300567, 'samples': 20332416, 'steps': 105897, 'loss/train': 1.000456690788269} 11/07/2021 12:09:02 - INFO - __main__ - Step 105899: {'lr': 0.00010177724796716317, 'samples': 20332608, 'steps': 105898, 'loss/train': 2.78682804107666} 11/07/2021 12:09:03 - INFO - __main__ - Step 105900: {'lr': 0.00010177297457105656, 'samples': 20332800, 'steps': 105899, 'loss/train': 1.3914852142333984} 11/07/2021 12:09:04 - INFO - __main__ - Step 105901: {'lr': 0.00010176870124173878, 'samples': 20332992, 'steps': 105900, 'loss/train': 1.8365306854248047} 11/07/2021 12:09:04 - INFO - __main__ - Step 105902: {'lr': 0.0001017644279792117, 'samples': 20333184, 'steps': 105901, 'loss/train': 1.2272613048553467} 11/07/2021 12:09:04 - INFO - __main__ - Step 105903: {'lr': 0.00010176015478347728, 'samples': 20333376, 'steps': 105902, 'loss/train': 1.1453508138656616} 11/07/2021 12:09:05 - INFO - __main__ - Step 105904: {'lr': 0.00010175588165453741, 'samples': 20333568, 'steps': 105903, 'loss/train': 1.1500743627548218} 11/07/2021 12:09:05 - INFO - __main__ - Step 105905: {'lr': 0.00010175160859239407, 'samples': 20333760, 'steps': 105904, 'loss/train': 0.9259235858917236} 11/07/2021 12:09:06 - INFO - __main__ - Step 105906: {'lr': 0.00010174733559704919, 'samples': 20333952, 'steps': 105905, 'loss/train': 1.5276838541030884} 11/07/2021 12:09:06 - INFO - __main__ - Step 105907: {'lr': 0.00010174306266850464, 'samples': 20334144, 'steps': 105906, 'loss/train': 1.5215234756469727} 11/07/2021 12:09:07 - INFO - __main__ - Step 105908: {'lr': 0.00010173878980676241, 'samples': 20334336, 'steps': 105907, 'loss/train': 0.9085949659347534} 11/07/2021 12:09:07 - INFO - __main__ - Step 105909: {'lr': 0.00010173451701182437, 'samples': 20334528, 'steps': 105908, 'loss/train': 1.2885444164276123} 11/07/2021 12:09:07 - INFO - __main__ - Step 105910: {'lr': 0.00010173024428369245, 'samples': 20334720, 'steps': 105909, 'loss/train': 1.2850793600082397} 11/07/2021 12:09:09 - INFO - __main__ - Step 105911: {'lr': 0.00010172597162236863, 'samples': 20334912, 'steps': 105910, 'loss/train': 1.48818838596344} 11/07/2021 12:09:09 - INFO - __main__ - Step 105912: {'lr': 0.00010172169902785488, 'samples': 20335104, 'steps': 105911, 'loss/train': 1.638323426246643} 11/07/2021 12:09:09 - INFO - __main__ - Step 105913: {'lr': 0.00010171742650015295, 'samples': 20335296, 'steps': 105912, 'loss/train': 1.6280124187469482} 11/07/2021 12:09:10 - INFO - __main__ - Step 105914: {'lr': 0.00010171315403926487, 'samples': 20335488, 'steps': 105913, 'loss/train': 1.5106719732284546} 11/07/2021 12:09:10 - INFO - __main__ - Step 105915: {'lr': 0.00010170888164519254, 'samples': 20335680, 'steps': 105914, 'loss/train': 1.2259453535079956} 11/07/2021 12:09:11 - INFO - __main__ - Step 105916: {'lr': 0.0001017046093179379, 'samples': 20335872, 'steps': 105915, 'loss/train': 1.320255994796753} 11/07/2021 12:09:11 - INFO - __main__ - Step 105917: {'lr': 0.0001017003370575029, 'samples': 20336064, 'steps': 105916, 'loss/train': 1.209667444229126} 11/07/2021 12:09:12 - INFO - __main__ - Step 105918: {'lr': 0.0001016960648638894, 'samples': 20336256, 'steps': 105917, 'loss/train': 1.2780213356018066} 11/07/2021 12:09:12 - INFO - __main__ - Step 105919: {'lr': 0.00010169179273709942, 'samples': 20336448, 'steps': 105918, 'loss/train': 1.9071487188339233} 11/07/2021 12:09:12 - INFO - __main__ - Step 105920: {'lr': 0.00010168752067713477, 'samples': 20336640, 'steps': 105919, 'loss/train': 1.0061920881271362} 11/07/2021 12:09:14 - INFO - __main__ - Step 105921: {'lr': 0.00010168324868399748, 'samples': 20336832, 'steps': 105920, 'loss/train': 1.5487806797027588} 11/07/2021 12:09:14 - INFO - __main__ - Step 105922: {'lr': 0.00010167897675768939, 'samples': 20337024, 'steps': 105921, 'loss/train': 1.3313709497451782} 11/07/2021 12:09:14 - INFO - __main__ - Step 105923: {'lr': 0.00010167470489821257, 'samples': 20337216, 'steps': 105922, 'loss/train': 1.2495988607406616} 11/07/2021 12:09:15 - INFO - __main__ - Step 105924: {'lr': 0.00010167043310556875, 'samples': 20337408, 'steps': 105923, 'loss/train': 1.4566208124160767} 11/07/2021 12:09:15 - INFO - __main__ - Step 105925: {'lr': 0.00010166616137975995, 'samples': 20337600, 'steps': 105924, 'loss/train': 1.378374695777893} 11/07/2021 12:09:16 - INFO - __main__ - Step 105926: {'lr': 0.00010166188972078811, 'samples': 20337792, 'steps': 105925, 'loss/train': 1.4703203439712524} 11/07/2021 12:09:16 - INFO - __main__ - Step 105927: {'lr': 0.00010165761812865509, 'samples': 20337984, 'steps': 105926, 'loss/train': 1.0587258338928223} 11/07/2021 12:09:17 - INFO - __main__ - Step 105928: {'lr': 0.00010165334660336287, 'samples': 20338176, 'steps': 105927, 'loss/train': 1.1391342878341675} 11/07/2021 12:09:17 - INFO - __main__ - Step 105929: {'lr': 0.00010164907514491337, 'samples': 20338368, 'steps': 105928, 'loss/train': 1.189100742340088} 11/07/2021 12:09:17 - INFO - __main__ - Step 105930: {'lr': 0.0001016448037533085, 'samples': 20338560, 'steps': 105929, 'loss/train': 1.3260457515716553} 11/07/2021 12:09:18 - INFO - __main__ - Step 105931: {'lr': 0.0001016405324285502, 'samples': 20338752, 'steps': 105930, 'loss/train': 1.0419381856918335} 11/07/2021 12:09:19 - INFO - __main__ - Step 105932: {'lr': 0.0001016362611706404, 'samples': 20338944, 'steps': 105931, 'loss/train': 1.2862792015075684} 11/07/2021 12:09:19 - INFO - __main__ - Step 105933: {'lr': 0.00010163198997958101, 'samples': 20339136, 'steps': 105932, 'loss/train': 1.322963833808899} 11/07/2021 12:09:20 - INFO - __main__ - Step 105934: {'lr': 0.00010162771885537392, 'samples': 20339328, 'steps': 105933, 'loss/train': 0.9305149912834167} 11/07/2021 12:09:20 - INFO - __main__ - Step 105935: {'lr': 0.00010162344779802113, 'samples': 20339520, 'steps': 105934, 'loss/train': 1.6553027629852295} 11/07/2021 12:09:20 - INFO - __main__ - Step 105936: {'lr': 0.00010161917680752458, 'samples': 20339712, 'steps': 105935, 'loss/train': 1.3893983364105225} 11/07/2021 12:09:21 - INFO - __main__ - Step 105937: {'lr': 0.00010161490588388609, 'samples': 20339904, 'steps': 105936, 'loss/train': 1.1700836420059204} 11/07/2021 12:09:22 - INFO - __main__ - Step 105938: {'lr': 0.0001016106350271076, 'samples': 20340096, 'steps': 105937, 'loss/train': 1.1480729579925537} 11/07/2021 12:09:22 - INFO - __main__ - Step 105939: {'lr': 0.00010160636423719108, 'samples': 20340288, 'steps': 105938, 'loss/train': 1.2884619235992432} 11/07/2021 12:09:22 - INFO - __main__ - Step 105940: {'lr': 0.00010160209351413843, 'samples': 20340480, 'steps': 105939, 'loss/train': 1.251705288887024} 11/07/2021 12:09:23 - INFO - __main__ - Step 105941: {'lr': 0.0001015978228579516, 'samples': 20340672, 'steps': 105940, 'loss/train': 1.2282034158706665} 11/07/2021 12:09:24 - INFO - __main__ - Step 105942: {'lr': 0.0001015935522686325, 'samples': 20340864, 'steps': 105941, 'loss/train': 1.3378044366836548} 11/07/2021 12:09:24 - INFO - __main__ - Step 105943: {'lr': 0.00010158928174618307, 'samples': 20341056, 'steps': 105942, 'loss/train': 1.514535903930664} 11/07/2021 12:09:24 - INFO - __main__ - Step 105944: {'lr': 0.00010158501129060521, 'samples': 20341248, 'steps': 105943, 'loss/train': 1.2121695280075073} 11/07/2021 12:09:25 - INFO - __main__ - Step 105945: {'lr': 0.00010158074090190084, 'samples': 20341440, 'steps': 105944, 'loss/train': 1.503862977027893} 11/07/2021 12:09:25 - INFO - __main__ - Step 105946: {'lr': 0.00010157647058007192, 'samples': 20341632, 'steps': 105945, 'loss/train': 1.176325798034668} 11/07/2021 12:09:26 - INFO - __main__ - Step 105947: {'lr': 0.00010157220032512033, 'samples': 20341824, 'steps': 105946, 'loss/train': 1.6308730840682983} 11/07/2021 12:09:27 - INFO - __main__ - Step 105948: {'lr': 0.00010156793013704802, 'samples': 20342016, 'steps': 105947, 'loss/train': 1.1308567523956299} 11/07/2021 12:09:27 - INFO - __main__ - Step 105949: {'lr': 0.0001015636600158569, 'samples': 20342208, 'steps': 105948, 'loss/train': 1.6202155351638794} 11/07/2021 12:09:27 - INFO - __main__ - Step 105950: {'lr': 0.00010155938996154904, 'samples': 20342400, 'steps': 105949, 'loss/train': 1.1088217496871948} 11/07/2021 12:09:28 - INFO - __main__ - Step 105951: {'lr': 0.00010155511997412609, 'samples': 20342592, 'steps': 105950, 'loss/train': 0.8021571040153503} 11/07/2021 12:09:29 - INFO - __main__ - Step 105952: {'lr': 0.00010155085005359013, 'samples': 20342784, 'steps': 105951, 'loss/train': 1.2563962936401367} 11/07/2021 12:09:29 - INFO - __main__ - Step 105953: {'lr': 0.00010154658019994306, 'samples': 20342976, 'steps': 105952, 'loss/train': 1.218091368675232} 11/07/2021 12:09:30 - INFO - __main__ - Step 105954: {'lr': 0.00010154231041318681, 'samples': 20343168, 'steps': 105953, 'loss/train': 1.4001494646072388} 11/07/2021 12:09:30 - INFO - __main__ - Step 105955: {'lr': 0.00010153804069332332, 'samples': 20343360, 'steps': 105954, 'loss/train': 1.1200474500656128} 11/07/2021 12:09:31 - INFO - __main__ - Step 105956: {'lr': 0.0001015337710403545, 'samples': 20343552, 'steps': 105955, 'loss/train': 1.8005386590957642} 11/07/2021 12:09:31 - INFO - __main__ - Step 105957: {'lr': 0.00010152950145428224, 'samples': 20343744, 'steps': 105956, 'loss/train': 1.298087477684021} 11/07/2021 12:09:32 - INFO - __main__ - Step 105958: {'lr': 0.00010152523193510852, 'samples': 20343936, 'steps': 105957, 'loss/train': 1.4536689519882202} 11/07/2021 12:09:32 - INFO - __main__ - Step 105959: {'lr': 0.00010152096248283524, 'samples': 20344128, 'steps': 105958, 'loss/train': 0.973820149898529} 11/07/2021 12:09:33 - INFO - __main__ - Step 105960: {'lr': 0.0001015166930974643, 'samples': 20344320, 'steps': 105959, 'loss/train': 1.8159568309783936} 11/07/2021 12:09:33 - INFO - __main__ - Step 105961: {'lr': 0.00010151242377899769, 'samples': 20344512, 'steps': 105960, 'loss/train': 1.1748615503311157} 11/07/2021 12:09:33 - INFO - __main__ - Step 105962: {'lr': 0.00010150815452743725, 'samples': 20344704, 'steps': 105961, 'loss/train': 0.9481344223022461} 11/07/2021 12:09:35 - INFO - __main__ - Step 105963: {'lr': 0.00010150388534278507, 'samples': 20344896, 'steps': 105962, 'loss/train': 1.166107177734375} 11/07/2021 12:09:35 - INFO - __main__ - Step 105964: {'lr': 0.00010149961622504283, 'samples': 20345088, 'steps': 105963, 'loss/train': 1.5864297151565552} 11/07/2021 12:09:35 - INFO - __main__ - Step 105965: {'lr': 0.00010149534717421257, 'samples': 20345280, 'steps': 105964, 'loss/train': 1.4385626316070557} 11/07/2021 12:09:36 - INFO - __main__ - Step 105966: {'lr': 0.00010149107819029623, 'samples': 20345472, 'steps': 105965, 'loss/train': 0.04468440264463425} 11/07/2021 12:09:36 - INFO - __main__ - Step 105967: {'lr': 0.0001014868092732957, 'samples': 20345664, 'steps': 105966, 'loss/train': 1.5519191026687622} 11/07/2021 12:09:37 - INFO - __main__ - Step 105968: {'lr': 0.00010148254042321295, 'samples': 20345856, 'steps': 105967, 'loss/train': 1.4420760869979858} 11/07/2021 12:09:37 - INFO - __main__ - Step 105969: {'lr': 0.00010147827164004986, 'samples': 20346048, 'steps': 105968, 'loss/train': 1.7802268266677856} 11/07/2021 12:09:38 - INFO - __main__ - Step 105970: {'lr': 0.00010147400292380837, 'samples': 20346240, 'steps': 105969, 'loss/train': 1.754907488822937} 11/07/2021 12:09:38 - INFO - __main__ - Step 105971: {'lr': 0.00010146973427449039, 'samples': 20346432, 'steps': 105970, 'loss/train': 0.8086148500442505} 11/07/2021 12:09:39 - INFO - __main__ - Step 105972: {'lr': 0.00010146546569209789, 'samples': 20346624, 'steps': 105971, 'loss/train': 0.731637179851532} 11/07/2021 12:09:40 - INFO - __main__ - Step 105973: {'lr': 0.00010146119717663271, 'samples': 20346816, 'steps': 105972, 'loss/train': 0.12082432210445404} 11/07/2021 12:09:40 - INFO - __main__ - Step 105974: {'lr': 0.00010145692872809687, 'samples': 20347008, 'steps': 105973, 'loss/train': 1.122592568397522} 11/07/2021 12:09:40 - INFO - __main__ - Step 105975: {'lr': 0.00010145266034649223, 'samples': 20347200, 'steps': 105974, 'loss/train': 0.9176387786865234} 11/07/2021 12:09:41 - INFO - __main__ - Step 105976: {'lr': 0.00010144839203182071, 'samples': 20347392, 'steps': 105975, 'loss/train': 1.4806787967681885} 11/07/2021 12:09:41 - INFO - __main__ - Step 105977: {'lr': 0.00010144412378408436, 'samples': 20347584, 'steps': 105976, 'loss/train': 1.011250376701355} 11/07/2021 12:09:41 - INFO - __main__ - Step 105978: {'lr': 0.00010143985560328489, 'samples': 20347776, 'steps': 105977, 'loss/train': 1.6514376401901245} 11/07/2021 12:09:42 - INFO - __main__ - Step 105979: {'lr': 0.00010143558748942433, 'samples': 20347968, 'steps': 105978, 'loss/train': 1.5853275060653687} 11/07/2021 12:09:43 - INFO - __main__ - Step 105980: {'lr': 0.00010143131944250463, 'samples': 20348160, 'steps': 105979, 'loss/train': 1.2075831890106201} 11/07/2021 12:09:43 - INFO - __main__ - Step 105981: {'lr': 0.00010142705146252764, 'samples': 20348352, 'steps': 105980, 'loss/train': 0.4572444558143616} 11/07/2021 12:09:43 - INFO - __main__ - Step 105982: {'lr': 0.00010142278354949538, 'samples': 20348544, 'steps': 105981, 'loss/train': 1.2626323699951172} 11/07/2021 12:09:44 - INFO - __main__ - Step 105983: {'lr': 0.00010141851570340967, 'samples': 20348736, 'steps': 105982, 'loss/train': 0.3277093172073364} 11/07/2021 12:09:45 - INFO - __main__ - Step 105984: {'lr': 0.00010141424792427253, 'samples': 20348928, 'steps': 105983, 'loss/train': 1.3231370449066162} 11/07/2021 12:09:45 - INFO - __main__ - Step 105985: {'lr': 0.0001014099802120858, 'samples': 20349120, 'steps': 105984, 'loss/train': 1.3648535013198853} 11/07/2021 12:09:46 - INFO - __main__ - Step 105986: {'lr': 0.00010140571256685147, 'samples': 20349312, 'steps': 105985, 'loss/train': 0.7157310843467712} 11/07/2021 12:09:46 - INFO - __main__ - Step 105987: {'lr': 0.00010140144498857142, 'samples': 20349504, 'steps': 105986, 'loss/train': 1.4822605848312378} 11/07/2021 12:09:46 - INFO - __main__ - Step 105988: {'lr': 0.00010139717747724758, 'samples': 20349696, 'steps': 105987, 'loss/train': 1.2525075674057007} 11/07/2021 12:09:47 - INFO - __main__ - Step 105989: {'lr': 0.00010139291003288189, 'samples': 20349888, 'steps': 105988, 'loss/train': 1.4310872554779053} 11/07/2021 12:09:48 - INFO - __main__ - Step 105990: {'lr': 0.00010138864265547635, 'samples': 20350080, 'steps': 105989, 'loss/train': 1.4678986072540283} 11/07/2021 12:09:48 - INFO - __main__ - Step 105991: {'lr': 0.0001013843753450327, 'samples': 20350272, 'steps': 105990, 'loss/train': 0.9256752729415894} 11/07/2021 12:09:48 - INFO - __main__ - Step 105992: {'lr': 0.00010138010810155296, 'samples': 20350464, 'steps': 105991, 'loss/train': 1.1573508977890015} 11/07/2021 12:09:49 - INFO - __main__ - Step 105993: {'lr': 0.00010137584092503905, 'samples': 20350656, 'steps': 105992, 'loss/train': 1.3786461353302002} 11/07/2021 12:09:50 - INFO - __main__ - Step 105994: {'lr': 0.00010137157381549289, 'samples': 20350848, 'steps': 105993, 'loss/train': 1.3898900747299194} 11/07/2021 12:09:50 - INFO - __main__ - Step 105995: {'lr': 0.00010136730677291639, 'samples': 20351040, 'steps': 105994, 'loss/train': 1.2567260265350342} 11/07/2021 12:09:50 - INFO - __main__ - Step 105996: {'lr': 0.00010136303979731151, 'samples': 20351232, 'steps': 105995, 'loss/train': 1.329642653465271} 11/07/2021 12:09:51 - INFO - __main__ - Step 105997: {'lr': 0.00010135877288868014, 'samples': 20351424, 'steps': 105996, 'loss/train': 1.6831454038619995} 11/07/2021 12:09:51 - INFO - __main__ - Step 105998: {'lr': 0.00010135450604702424, 'samples': 20351616, 'steps': 105997, 'loss/train': 1.3038170337677002} 11/07/2021 12:09:52 - INFO - __main__ - Step 105999: {'lr': 0.00010135023927234565, 'samples': 20351808, 'steps': 105998, 'loss/train': 0.7861827611923218} 11/07/2021 12:09:53 - INFO - __main__ - Step 106000: {'lr': 0.0001013459725646464, 'samples': 20352000, 'steps': 105999, 'loss/train': 1.2327505350112915} 11/07/2021 12:09:53 - INFO - __main__ - Step 106001: {'lr': 0.00010134170592392835, 'samples': 20352192, 'steps': 106000, 'loss/train': 1.3172284364700317} 11/07/2021 12:09:53 - INFO - __main__ - Step 106002: {'lr': 0.00010133743935019343, 'samples': 20352384, 'steps': 106001, 'loss/train': 1.8217920064926147} 11/07/2021 12:09:54 - INFO - __main__ - Step 106003: {'lr': 0.00010133317284344365, 'samples': 20352576, 'steps': 106002, 'loss/train': 1.3415366411209106} 11/07/2021 12:09:55 - INFO - __main__ - Step 106004: {'lr': 0.00010132890640368075, 'samples': 20352768, 'steps': 106003, 'loss/train': 1.4401901960372925} 11/07/2021 12:09:55 - INFO - __main__ - Step 106005: {'lr': 0.00010132464003090677, 'samples': 20352960, 'steps': 106004, 'loss/train': 1.1874464750289917} 11/07/2021 12:09:55 - INFO - __main__ - Step 106006: {'lr': 0.00010132037372512359, 'samples': 20353152, 'steps': 106005, 'loss/train': 1.6817961931228638} 11/07/2021 12:09:56 - INFO - __main__ - Step 106007: {'lr': 0.00010131610748633319, 'samples': 20353344, 'steps': 106006, 'loss/train': 1.0898407697677612} 11/07/2021 12:09:56 - INFO - __main__ - Step 106008: {'lr': 0.00010131184131453741, 'samples': 20353536, 'steps': 106007, 'loss/train': 1.289971947669983} 11/07/2021 12:09:57 - INFO - __main__ - Step 106009: {'lr': 0.00010130757520973826, 'samples': 20353728, 'steps': 106008, 'loss/train': 1.3249289989471436} 11/07/2021 12:09:58 - INFO - __main__ - Step 106010: {'lr': 0.00010130330917193762, 'samples': 20353920, 'steps': 106009, 'loss/train': 1.0469329357147217} 11/07/2021 12:09:58 - INFO - __main__ - Step 106011: {'lr': 0.00010129904320113739, 'samples': 20354112, 'steps': 106010, 'loss/train': 2.3218536376953125} 11/07/2021 12:09:58 - INFO - __main__ - Step 106012: {'lr': 0.00010129477729733951, 'samples': 20354304, 'steps': 106011, 'loss/train': 0.40888142585754395} 11/07/2021 12:09:59 - INFO - __main__ - Step 106013: {'lr': 0.00010129051146054594, 'samples': 20354496, 'steps': 106012, 'loss/train': 1.006661057472229} 11/07/2021 12:09:59 - INFO - __main__ - Step 106014: {'lr': 0.00010128624569075856, 'samples': 20354688, 'steps': 106013, 'loss/train': 1.1657304763793945} 11/07/2021 12:10:00 - INFO - __main__ - Step 106015: {'lr': 0.00010128197998797931, 'samples': 20354880, 'steps': 106014, 'loss/train': 1.2502378225326538} 11/07/2021 12:10:00 - INFO - __main__ - Step 106016: {'lr': 0.00010127771435221009, 'samples': 20355072, 'steps': 106015, 'loss/train': 1.6123942136764526} 11/07/2021 12:10:01 - INFO - __main__ - Step 106017: {'lr': 0.00010127344878345294, 'samples': 20355264, 'steps': 106016, 'loss/train': 0.8869097232818604} 11/07/2021 12:10:01 - INFO - __main__ - Step 106018: {'lr': 0.00010126918328170959, 'samples': 20355456, 'steps': 106017, 'loss/train': 1.4764161109924316} 11/07/2021 12:10:02 - INFO - __main__ - Step 106019: {'lr': 0.00010126491784698202, 'samples': 20355648, 'steps': 106018, 'loss/train': 1.3046215772628784} 11/07/2021 12:10:03 - INFO - __main__ - Step 106020: {'lr': 0.00010126065247927222, 'samples': 20355840, 'steps': 106019, 'loss/train': 2.0521490573883057} 11/07/2021 12:10:03 - INFO - __main__ - Step 106021: {'lr': 0.00010125638717858208, 'samples': 20356032, 'steps': 106020, 'loss/train': 1.8223464488983154} 11/07/2021 12:10:04 - INFO - __main__ - Step 106022: {'lr': 0.00010125212194491349, 'samples': 20356224, 'steps': 106021, 'loss/train': 0.9262349009513855} 11/07/2021 12:10:04 - INFO - __main__ - Step 106023: {'lr': 0.0001012478567782684, 'samples': 20356416, 'steps': 106022, 'loss/train': 0.7356613278388977} 11/07/2021 12:10:04 - INFO - __main__ - Step 106024: {'lr': 0.00010124359167864875, 'samples': 20356608, 'steps': 106023, 'loss/train': 1.6594256162643433} 11/07/2021 12:10:05 - INFO - __main__ - Step 106025: {'lr': 0.00010123932664605642, 'samples': 20356800, 'steps': 106024, 'loss/train': 1.402768850326538} 11/07/2021 12:10:06 - INFO - __main__ - Step 106026: {'lr': 0.00010123506168049334, 'samples': 20356992, 'steps': 106025, 'loss/train': 0.15449462831020355} 11/07/2021 12:10:06 - INFO - __main__ - Step 106027: {'lr': 0.00010123079678196149, 'samples': 20357184, 'steps': 106026, 'loss/train': 1.642093539237976} 11/07/2021 12:10:07 - INFO - __main__ - Step 106028: {'lr': 0.00010122653195046272, 'samples': 20357376, 'steps': 106027, 'loss/train': 0.7418267130851746} 11/07/2021 12:10:07 - INFO - __main__ - Step 106029: {'lr': 0.00010122226718599901, 'samples': 20357568, 'steps': 106028, 'loss/train': 0.5483262538909912} 11/07/2021 12:10:07 - INFO - __main__ - Step 106030: {'lr': 0.0001012180024885723, 'samples': 20357760, 'steps': 106029, 'loss/train': 1.0606938600540161} 11/07/2021 12:10:08 - INFO - __main__ - Step 106031: {'lr': 0.00010121373785818439, 'samples': 20357952, 'steps': 106030, 'loss/train': 1.2843387126922607} 11/07/2021 12:10:09 - INFO - __main__ - Step 106032: {'lr': 0.00010120947329483727, 'samples': 20358144, 'steps': 106031, 'loss/train': 0.9893138408660889} 11/07/2021 12:10:09 - INFO - __main__ - Step 106033: {'lr': 0.00010120520879853287, 'samples': 20358336, 'steps': 106032, 'loss/train': 2.1156952381134033} 11/07/2021 12:10:09 - INFO - __main__ - Step 106034: {'lr': 0.0001012009443692731, 'samples': 20358528, 'steps': 106033, 'loss/train': 1.0208821296691895} 11/07/2021 12:10:10 - INFO - __main__ - Step 106035: {'lr': 0.0001011966800070599, 'samples': 20358720, 'steps': 106034, 'loss/train': 1.5287131071090698} 11/07/2021 12:10:10 - INFO - __main__ - Step 106036: {'lr': 0.00010119241571189517, 'samples': 20358912, 'steps': 106035, 'loss/train': 0.9596022963523865} 11/07/2021 12:10:11 - INFO - __main__ - Step 106037: {'lr': 0.00010118815148378082, 'samples': 20359104, 'steps': 106036, 'loss/train': 1.3211767673492432} 11/07/2021 12:10:11 - INFO - __main__ - Step 106038: {'lr': 0.00010118388732271882, 'samples': 20359296, 'steps': 106037, 'loss/train': 1.178674340248108} 11/07/2021 12:10:12 - INFO - __main__ - Step 106039: {'lr': 0.00010117962322871107, 'samples': 20359488, 'steps': 106038, 'loss/train': 1.2066490650177002} 11/07/2021 12:10:12 - INFO - __main__ - Step 106040: {'lr': 0.00010117535920175946, 'samples': 20359680, 'steps': 106039, 'loss/train': 1.0469250679016113} 11/07/2021 12:10:12 - INFO - __main__ - Step 106041: {'lr': 0.00010117109524186596, 'samples': 20359872, 'steps': 106040, 'loss/train': 1.4127917289733887} 11/07/2021 12:10:14 - INFO - __main__ - Step 106042: {'lr': 0.00010116683134903246, 'samples': 20360064, 'steps': 106041, 'loss/train': 0.9833055138587952} 11/07/2021 12:10:14 - INFO - __main__ - Step 106043: {'lr': 0.0001011625675232609, 'samples': 20360256, 'steps': 106042, 'loss/train': 1.3946014642715454} 11/07/2021 12:10:14 - INFO - __main__ - Step 106044: {'lr': 0.00010115830376455326, 'samples': 20360448, 'steps': 106043, 'loss/train': 1.8544737100601196} 11/07/2021 12:10:15 - INFO - __main__ - Step 106045: {'lr': 0.00010115404007291131, 'samples': 20360640, 'steps': 106044, 'loss/train': 1.411908745765686} 11/07/2021 12:10:15 - INFO - __main__ - Step 106046: {'lr': 0.00010114977644833707, 'samples': 20360832, 'steps': 106045, 'loss/train': 4.914192199707031} 11/07/2021 12:10:16 - INFO - __main__ - Step 106047: {'lr': 0.0001011455128908324, 'samples': 20361024, 'steps': 106046, 'loss/train': 0.79730623960495} 11/07/2021 12:10:16 - INFO - __main__ - Step 106048: {'lr': 0.00010114124940039931, 'samples': 20361216, 'steps': 106047, 'loss/train': 1.4773688316345215} 11/07/2021 12:10:17 - INFO - __main__ - Step 106049: {'lr': 0.00010113698597703965, 'samples': 20361408, 'steps': 106048, 'loss/train': 3.1597883701324463} 11/07/2021 12:10:17 - INFO - __main__ - Step 106050: {'lr': 0.00010113272262075537, 'samples': 20361600, 'steps': 106049, 'loss/train': 1.082383394241333} 11/07/2021 12:10:17 - INFO - __main__ - Step 106051: {'lr': 0.00010112845933154841, 'samples': 20361792, 'steps': 106050, 'loss/train': 1.6031233072280884} 11/07/2021 12:10:18 - INFO - __main__ - Step 106052: {'lr': 0.00010112419610942064, 'samples': 20361984, 'steps': 106051, 'loss/train': 0.8830000758171082} 11/07/2021 12:10:19 - INFO - __main__ - Step 106053: {'lr': 0.00010111993295437403, 'samples': 20362176, 'steps': 106052, 'loss/train': 1.2806833982467651} 11/07/2021 12:10:19 - INFO - __main__ - Step 106054: {'lr': 0.00010111566986641047, 'samples': 20362368, 'steps': 106053, 'loss/train': 1.687402606010437} 11/07/2021 12:10:19 - INFO - __main__ - Step 106055: {'lr': 0.00010111140684553192, 'samples': 20362560, 'steps': 106054, 'loss/train': 1.659553050994873} 11/07/2021 12:10:20 - INFO - __main__ - Step 106056: {'lr': 0.00010110714389174022, 'samples': 20362752, 'steps': 106055, 'loss/train': 1.2777928113937378} 11/07/2021 12:10:21 - INFO - __main__ - Step 106057: {'lr': 0.00010110288100503747, 'samples': 20362944, 'steps': 106056, 'loss/train': 1.2348202466964722} 11/07/2021 12:10:21 - INFO - __main__ - Step 106058: {'lr': 0.00010109861818542538, 'samples': 20363136, 'steps': 106057, 'loss/train': 1.2031151056289673} 11/07/2021 12:10:22 - INFO - __main__ - Step 106059: {'lr': 0.00010109435543290593, 'samples': 20363328, 'steps': 106058, 'loss/train': 1.6603498458862305} 11/07/2021 12:10:22 - INFO - __main__ - Step 106060: {'lr': 0.00010109009274748108, 'samples': 20363520, 'steps': 106059, 'loss/train': 1.589762568473816} 11/07/2021 12:10:22 - INFO - __main__ - Step 106061: {'lr': 0.00010108583012915274, 'samples': 20363712, 'steps': 106060, 'loss/train': 1.6771682500839233} 11/07/2021 12:10:23 - INFO - __main__ - Step 106062: {'lr': 0.0001010815675779228, 'samples': 20363904, 'steps': 106061, 'loss/train': 1.5802503824234009} 11/07/2021 12:10:24 - INFO - __main__ - Step 106063: {'lr': 0.00010107730509379323, 'samples': 20364096, 'steps': 106062, 'loss/train': 1.6016173362731934} 11/07/2021 12:10:24 - INFO - __main__ - Step 106064: {'lr': 0.00010107304267676593, 'samples': 20364288, 'steps': 106063, 'loss/train': 1.7397949695587158} 11/07/2021 12:10:24 - INFO - __main__ - Step 106065: {'lr': 0.0001010687803268428, 'samples': 20364480, 'steps': 106064, 'loss/train': 1.766513705253601} 11/07/2021 12:10:25 - INFO - __main__ - Step 106066: {'lr': 0.0001010645180440258, 'samples': 20364672, 'steps': 106065, 'loss/train': 1.1614910364151} 11/07/2021 12:10:25 - INFO - __main__ - Step 106067: {'lr': 0.00010106025582831682, 'samples': 20364864, 'steps': 106066, 'loss/train': 1.5164341926574707} 11/07/2021 12:10:26 - INFO - __main__ - Step 106068: {'lr': 0.0001010559936797178, 'samples': 20365056, 'steps': 106067, 'loss/train': 1.546983242034912} 11/07/2021 12:10:27 - INFO - __main__ - Step 106069: {'lr': 0.00010105173159823064, 'samples': 20365248, 'steps': 106068, 'loss/train': 1.5900741815567017} 11/07/2021 12:10:27 - INFO - __main__ - Step 106070: {'lr': 0.00010104746958385727, 'samples': 20365440, 'steps': 106069, 'loss/train': 0.4990811049938202} 11/07/2021 12:10:27 - INFO - __main__ - Step 106071: {'lr': 0.00010104320763659971, 'samples': 20365632, 'steps': 106070, 'loss/train': 1.5838987827301025} 11/07/2021 12:10:28 - INFO - __main__ - Step 106072: {'lr': 0.0001010389457564597, 'samples': 20365824, 'steps': 106071, 'loss/train': 1.2605459690093994} 11/07/2021 12:10:29 - INFO - __main__ - Step 106073: {'lr': 0.00010103468394343923, 'samples': 20366016, 'steps': 106072, 'loss/train': 0.6001273393630981} 11/07/2021 12:10:29 - INFO - __main__ - Step 106074: {'lr': 0.00010103042219754025, 'samples': 20366208, 'steps': 106073, 'loss/train': 1.5777875185012817} 11/07/2021 12:10:30 - INFO - __main__ - Step 106075: {'lr': 0.00010102616051876465, 'samples': 20366400, 'steps': 106074, 'loss/train': 1.3593525886535645} 11/07/2021 12:10:30 - INFO - __main__ - Step 106076: {'lr': 0.00010102189890711436, 'samples': 20366592, 'steps': 106075, 'loss/train': 1.115018367767334} 11/07/2021 12:10:30 - INFO - __main__ - Step 106077: {'lr': 0.00010101763736259129, 'samples': 20366784, 'steps': 106076, 'loss/train': 1.4932814836502075} 11/07/2021 12:10:32 - INFO - __main__ - Step 106078: {'lr': 0.00010101337588519737, 'samples': 20366976, 'steps': 106077, 'loss/train': 1.6846777200698853} 11/07/2021 12:10:32 - INFO - __main__ - Step 106079: {'lr': 0.00010100911447493454, 'samples': 20367168, 'steps': 106078, 'loss/train': 1.0418161153793335} 11/07/2021 12:10:33 - INFO - __main__ - Step 106080: {'lr': 0.00010100485313180474, 'samples': 20367360, 'steps': 106079, 'loss/train': 0.8193515539169312} 11/07/2021 12:10:33 - INFO - __main__ - Step 106081: {'lr': 0.00010100059185580982, 'samples': 20367552, 'steps': 106080, 'loss/train': 0.38567012548446655} 11/07/2021 12:10:33 - INFO - __main__ - Step 106082: {'lr': 0.0001009963306469517, 'samples': 20367744, 'steps': 106081, 'loss/train': 0.9672616720199585} 11/07/2021 12:10:35 - INFO - __main__ - Step 106083: {'lr': 0.0001009920695052324, 'samples': 20367936, 'steps': 106082, 'loss/train': 1.311873197555542} 11/07/2021 12:10:35 - INFO - __main__ - Step 106084: {'lr': 0.00010098780843065383, 'samples': 20368128, 'steps': 106083, 'loss/train': 1.348230242729187} 11/07/2021 12:10:35 - INFO - __main__ - Step 106085: {'lr': 0.00010098354742321778, 'samples': 20368320, 'steps': 106084, 'loss/train': 1.2666618824005127} 11/07/2021 12:10:36 - INFO - __main__ - Step 106086: {'lr': 0.0001009792864829262, 'samples': 20368512, 'steps': 106085, 'loss/train': 1.4309583902359009} 11/07/2021 12:10:36 - INFO - __main__ - Step 106087: {'lr': 0.00010097502560978109, 'samples': 20368704, 'steps': 106086, 'loss/train': 1.38627028465271} 11/07/2021 12:10:36 - INFO - __main__ - Step 106088: {'lr': 0.00010097076480378434, 'samples': 20368896, 'steps': 106087, 'loss/train': 1.4466279745101929} 11/07/2021 12:10:38 - INFO - __main__ - Step 106089: {'lr': 0.00010096650406493784, 'samples': 20369088, 'steps': 106088, 'loss/train': 0.657318651676178} 11/07/2021 12:10:38 - INFO - __main__ - Step 106090: {'lr': 0.00010096224339324356, 'samples': 20369280, 'steps': 106089, 'loss/train': 1.134100079536438} 11/07/2021 12:10:39 - INFO - __main__ - Step 106091: {'lr': 0.00010095798278870338, 'samples': 20369472, 'steps': 106090, 'loss/train': 0.21147111058235168} 11/07/2021 12:10:39 - INFO - __main__ - Step 106092: {'lr': 0.00010095372225131924, 'samples': 20369664, 'steps': 106091, 'loss/train': 1.423964500427246} 11/07/2021 12:10:40 - INFO - __main__ - Step 106093: {'lr': 0.00010094946178109304, 'samples': 20369856, 'steps': 106092, 'loss/train': 1.1531659364700317} 11/07/2021 12:10:40 - INFO - __main__ - Step 106094: {'lr': 0.00010094520137802671, 'samples': 20370048, 'steps': 106093, 'loss/train': 1.5660775899887085} 11/07/2021 12:10:40 - INFO - __main__ - Step 106095: {'lr': 0.00010094094104212218, 'samples': 20370240, 'steps': 106094, 'loss/train': 1.4697341918945312} 11/07/2021 12:10:41 - INFO - __main__ - Step 106096: {'lr': 0.00010093668077338136, 'samples': 20370432, 'steps': 106095, 'loss/train': 1.5202305316925049} 11/07/2021 12:10:42 - INFO - __main__ - Step 106097: {'lr': 0.00010093242057180618, 'samples': 20370624, 'steps': 106096, 'loss/train': 0.8316763043403625} 11/07/2021 12:10:42 - INFO - __main__ - Step 106098: {'lr': 0.00010092816043739863, 'samples': 20370816, 'steps': 106097, 'loss/train': 1.0426157712936401} 11/07/2021 12:10:42 - INFO - __main__ - Step 106099: {'lr': 0.00010092390037016048, 'samples': 20371008, 'steps': 106098, 'loss/train': 1.528842806816101} 11/07/2021 12:10:43 - INFO - __main__ - Step 106100: {'lr': 0.00010091964037009369, 'samples': 20371200, 'steps': 106099, 'loss/train': 1.2665252685546875} 11/07/2021 12:10:44 - INFO - __main__ - Step 106101: {'lr': 0.00010091538043720022, 'samples': 20371392, 'steps': 106100, 'loss/train': 1.4561225175857544} 11/07/2021 12:10:44 - INFO - __main__ - Step 106102: {'lr': 0.000100911120571482, 'samples': 20371584, 'steps': 106101, 'loss/train': 1.321009874343872} 11/07/2021 12:10:44 - INFO - __main__ - Step 106103: {'lr': 0.0001009068607729409, 'samples': 20371776, 'steps': 106102, 'loss/train': 1.3950788974761963} 11/07/2021 12:10:45 - INFO - __main__ - Step 106104: {'lr': 0.00010090260104157888, 'samples': 20371968, 'steps': 106103, 'loss/train': 1.3776835203170776} 11/07/2021 12:10:45 - INFO - __main__ - Step 106105: {'lr': 0.00010089834137739783, 'samples': 20372160, 'steps': 106104, 'loss/train': 1.2228528261184692} 11/07/2021 12:10:46 - INFO - __main__ - Step 106106: {'lr': 0.00010089408178039971, 'samples': 20372352, 'steps': 106105, 'loss/train': 1.682555079460144} 11/07/2021 12:10:47 - INFO - __main__ - Step 106107: {'lr': 0.00010088982225058643, 'samples': 20372544, 'steps': 106106, 'loss/train': 1.215510368347168} 11/07/2021 12:10:47 - INFO - __main__ - Step 106108: {'lr': 0.00010088556278795988, 'samples': 20372736, 'steps': 106107, 'loss/train': 1.4911084175109863} 11/07/2021 12:10:47 - INFO - __main__ - Step 106109: {'lr': 0.000100881303392522, 'samples': 20372928, 'steps': 106108, 'loss/train': 1.5170437097549438} 11/07/2021 12:10:48 - INFO - __main__ - Step 106110: {'lr': 0.00010087704406427467, 'samples': 20373120, 'steps': 106109, 'loss/train': 1.5819047689437866} 11/07/2021 12:10:49 - INFO - __main__ - Step 106111: {'lr': 0.00010087278480321996, 'samples': 20373312, 'steps': 106110, 'loss/train': 0.8748830556869507} 11/07/2021 12:10:49 - INFO - __main__ - Step 106112: {'lr': 0.00010086852560935958, 'samples': 20373504, 'steps': 106111, 'loss/train': 1.2667959928512573} 11/07/2021 12:10:49 - INFO - __main__ - Step 106113: {'lr': 0.00010086426648269553, 'samples': 20373696, 'steps': 106112, 'loss/train': 1.3020312786102295} 11/07/2021 12:10:50 - INFO - __main__ - Step 106114: {'lr': 0.00010086000742322976, 'samples': 20373888, 'steps': 106113, 'loss/train': 1.7252708673477173} 11/07/2021 12:10:50 - INFO - __main__ - Step 106115: {'lr': 0.00010085574843096414, 'samples': 20374080, 'steps': 106114, 'loss/train': 1.669666051864624} 11/07/2021 12:10:51 - INFO - __main__ - Step 106116: {'lr': 0.00010085148950590064, 'samples': 20374272, 'steps': 106115, 'loss/train': 1.3053641319274902} 11/07/2021 12:10:51 - INFO - __main__ - Step 106117: {'lr': 0.00010084723064804116, 'samples': 20374464, 'steps': 106116, 'loss/train': 1.5410641431808472} 11/07/2021 12:10:52 - INFO - __main__ - Step 106118: {'lr': 0.00010084297185738761, 'samples': 20374656, 'steps': 106117, 'loss/train': 1.0059541463851929} 11/07/2021 12:10:52 - INFO - __main__ - Step 106119: {'lr': 0.00010083871313394191, 'samples': 20374848, 'steps': 106118, 'loss/train': 1.4201791286468506} 11/07/2021 12:10:52 - INFO - __main__ - Step 106120: {'lr': 0.000100834454477706, 'samples': 20375040, 'steps': 106119, 'loss/train': 1.6922001838684082} 11/07/2021 12:10:54 - INFO - __main__ - Step 106121: {'lr': 0.00010083019588868178, 'samples': 20375232, 'steps': 106120, 'loss/train': 1.4432789087295532} 11/07/2021 12:10:54 - INFO - __main__ - Step 106122: {'lr': 0.00010082593736687115, 'samples': 20375424, 'steps': 106121, 'loss/train': 1.2599177360534668} 11/07/2021 12:10:54 - INFO - __main__ - Step 106123: {'lr': 0.00010082167891227609, 'samples': 20375616, 'steps': 106122, 'loss/train': 0.8947304487228394} 11/07/2021 12:10:55 - INFO - __main__ - Step 106124: {'lr': 0.00010081742052489845, 'samples': 20375808, 'steps': 106123, 'loss/train': 1.171705722808838} 11/07/2021 12:10:55 - INFO - __main__ - Step 106125: {'lr': 0.00010081316220474027, 'samples': 20376000, 'steps': 106124, 'loss/train': 1.812322735786438} 11/07/2021 12:10:56 - INFO - __main__ - Step 106126: {'lr': 0.00010080890395180328, 'samples': 20376192, 'steps': 106125, 'loss/train': 1.7048453092575073} 11/07/2021 12:10:56 - INFO - __main__ - Step 106127: {'lr': 0.00010080464576608952, 'samples': 20376384, 'steps': 106126, 'loss/train': 1.2784723043441772} 11/07/2021 12:10:57 - INFO - __main__ - Step 106128: {'lr': 0.00010080038764760085, 'samples': 20376576, 'steps': 106127, 'loss/train': 1.131177306175232} 11/07/2021 12:10:57 - INFO - __main__ - Step 106129: {'lr': 0.00010079612959633926, 'samples': 20376768, 'steps': 106128, 'loss/train': 1.1779786348342896} 11/07/2021 12:10:57 - INFO - __main__ - Step 106130: {'lr': 0.0001007918716123066, 'samples': 20376960, 'steps': 106129, 'loss/train': 1.2557692527770996} 11/07/2021 12:10:58 - INFO - __main__ - Step 106131: {'lr': 0.00010078761369550485, 'samples': 20377152, 'steps': 106130, 'loss/train': 1.8126264810562134} 11/07/2021 12:10:59 - INFO - __main__ - Step 106132: {'lr': 0.0001007833558459359, 'samples': 20377344, 'steps': 106131, 'loss/train': 1.5299068689346313} 11/07/2021 12:10:59 - INFO - __main__ - Step 106133: {'lr': 0.00010077909806360164, 'samples': 20377536, 'steps': 106132, 'loss/train': 1.6819663047790527} 11/07/2021 12:11:00 - INFO - __main__ - Step 106134: {'lr': 0.00010077484034850403, 'samples': 20377728, 'steps': 106133, 'loss/train': 1.3830897808074951} 11/07/2021 12:11:00 - INFO - __main__ - Step 106135: {'lr': 0.00010077058270064496, 'samples': 20377920, 'steps': 106134, 'loss/train': 1.2294044494628906} 11/07/2021 12:11:01 - INFO - __main__ - Step 106136: {'lr': 0.00010076632512002636, 'samples': 20378112, 'steps': 106135, 'loss/train': 1.1718759536743164} 11/07/2021 12:11:01 - INFO - __main__ - Step 106137: {'lr': 0.00010076206760665019, 'samples': 20378304, 'steps': 106136, 'loss/train': 1.4192752838134766} 11/07/2021 12:11:02 - INFO - __main__ - Step 106138: {'lr': 0.00010075781016051838, 'samples': 20378496, 'steps': 106137, 'loss/train': 0.9775136709213257} 11/07/2021 12:11:02 - INFO - __main__ - Step 106139: {'lr': 0.00010075355278163273, 'samples': 20378688, 'steps': 106138, 'loss/train': 0.9809014797210693} 11/07/2021 12:11:02 - INFO - __main__ - Step 106140: {'lr': 0.00010074929546999523, 'samples': 20378880, 'steps': 106139, 'loss/train': 0.057632479816675186} 11/07/2021 12:11:03 - INFO - __main__ - Step 106141: {'lr': 0.00010074503822560776, 'samples': 20379072, 'steps': 106140, 'loss/train': 0.7139931321144104} 11/07/2021 12:11:04 - INFO - __main__ - Step 106142: {'lr': 0.00010074078104847232, 'samples': 20379264, 'steps': 106141, 'loss/train': 1.1021766662597656} 11/07/2021 12:11:05 - INFO - __main__ - Step 106143: {'lr': 0.00010073652393859076, 'samples': 20379456, 'steps': 106142, 'loss/train': 2.3058955669403076} 11/07/2021 12:11:05 - INFO - __main__ - Step 106144: {'lr': 0.00010073226689596502, 'samples': 20379648, 'steps': 106143, 'loss/train': 0.8319715857505798} 11/07/2021 12:11:05 - INFO - __main__ - Step 106145: {'lr': 0.00010072800992059699, 'samples': 20379840, 'steps': 106144, 'loss/train': 0.8071039319038391} 11/07/2021 12:11:06 - INFO - __main__ - Step 106146: {'lr': 0.00010072375301248864, 'samples': 20380032, 'steps': 106145, 'loss/train': 1.4470703601837158} 11/07/2021 12:11:07 - INFO - __main__ - Step 106147: {'lr': 0.00010071949617164185, 'samples': 20380224, 'steps': 106146, 'loss/train': 0.14044396579265594} 11/07/2021 12:11:07 - INFO - __main__ - Step 106148: {'lr': 0.00010071523939805868, 'samples': 20380416, 'steps': 106147, 'loss/train': 1.1031724214553833} 11/07/2021 12:11:07 - INFO - __main__ - Step 106149: {'lr': 0.00010071098269174078, 'samples': 20380608, 'steps': 106148, 'loss/train': 0.8661872148513794} 11/07/2021 12:11:08 - INFO - __main__ - Step 106150: {'lr': 0.00010070672605269024, 'samples': 20380800, 'steps': 106149, 'loss/train': 1.3139729499816895} 11/07/2021 12:11:08 - INFO - __main__ - Step 106151: {'lr': 0.00010070246948090894, 'samples': 20380992, 'steps': 106150, 'loss/train': 1.1242281198501587} 11/07/2021 12:11:08 - INFO - __main__ - Step 106152: {'lr': 0.00010069821297639881, 'samples': 20381184, 'steps': 106151, 'loss/train': 1.4370640516281128} 11/07/2021 12:11:09 - INFO - __main__ - Step 106153: {'lr': 0.00010069395653916174, 'samples': 20381376, 'steps': 106152, 'loss/train': 1.5267654657363892} 11/07/2021 12:11:10 - INFO - __main__ - Step 106154: {'lr': 0.00010068970016919968, 'samples': 20381568, 'steps': 106153, 'loss/train': 1.659874439239502} 11/07/2021 12:11:10 - INFO - __main__ - Step 106155: {'lr': 0.00010068544386651454, 'samples': 20381760, 'steps': 106154, 'loss/train': 1.5797795057296753} 11/07/2021 12:11:11 - INFO - __main__ - Step 106156: {'lr': 0.00010068118763110824, 'samples': 20381952, 'steps': 106155, 'loss/train': 1.2715065479278564} 11/07/2021 12:11:11 - INFO - __main__ - Step 106157: {'lr': 0.00010067693146298268, 'samples': 20382144, 'steps': 106156, 'loss/train': 1.108883023262024} 11/07/2021 12:11:12 - INFO - __main__ - Step 106158: {'lr': 0.00010067267536213978, 'samples': 20382336, 'steps': 106157, 'loss/train': 1.5449353456497192} 11/07/2021 12:11:12 - INFO - __main__ - Step 106159: {'lr': 0.00010066841932858159, 'samples': 20382528, 'steps': 106158, 'loss/train': 1.4740303754806519} 11/07/2021 12:11:13 - INFO - __main__ - Step 106160: {'lr': 0.0001006641633623098, 'samples': 20382720, 'steps': 106159, 'loss/train': 1.0056453943252563} 11/07/2021 12:11:13 - INFO - __main__ - Step 106161: {'lr': 0.00010065990746332643, 'samples': 20382912, 'steps': 106160, 'loss/train': 1.2774198055267334} 11/07/2021 12:11:13 - INFO - __main__ - Step 106162: {'lr': 0.0001006556516316334, 'samples': 20383104, 'steps': 106161, 'loss/train': 1.52596914768219} 11/07/2021 12:11:15 - INFO - __main__ - Step 106163: {'lr': 0.00010065139586723263, 'samples': 20383296, 'steps': 106162, 'loss/train': 1.529809832572937} 11/07/2021 12:11:16 - INFO - __main__ - Step 106164: {'lr': 0.00010064714017012605, 'samples': 20383488, 'steps': 106163, 'loss/train': 0.7303493618965149} 11/07/2021 12:11:16 - INFO - __main__ - Step 106165: {'lr': 0.00010064288454031554, 'samples': 20383680, 'steps': 106164, 'loss/train': 0.4884365200996399} 11/07/2021 12:11:16 - INFO - __main__ - Step 106166: {'lr': 0.00010063862897780308, 'samples': 20383872, 'steps': 106165, 'loss/train': 0.6546041965484619} 11/07/2021 12:11:17 - INFO - __main__ - Step 106167: {'lr': 0.0001006343734825905, 'samples': 20384064, 'steps': 106166, 'loss/train': 0.6390520334243774} 11/07/2021 12:11:17 - INFO - __main__ - Step 106168: {'lr': 0.00010063011805467978, 'samples': 20384256, 'steps': 106167, 'loss/train': 1.2569777965545654} 11/07/2021 12:11:17 - INFO - __main__ - Step 106169: {'lr': 0.00010062586269407284, 'samples': 20384448, 'steps': 106168, 'loss/train': 1.4577795267105103} 11/07/2021 12:11:18 - INFO - __main__ - Step 106170: {'lr': 0.00010062160740077167, 'samples': 20384640, 'steps': 106169, 'loss/train': 1.4158117771148682} 11/07/2021 12:11:19 - INFO - __main__ - Step 106171: {'lr': 0.00010061735217477799, 'samples': 20384832, 'steps': 106170, 'loss/train': 2.0080349445343018} 11/07/2021 12:11:19 - INFO - __main__ - Step 106172: {'lr': 0.00010061309701609387, 'samples': 20385024, 'steps': 106171, 'loss/train': 1.3777964115142822} 11/07/2021 12:11:19 - INFO - __main__ - Step 106173: {'lr': 0.00010060884192472114, 'samples': 20385216, 'steps': 106172, 'loss/train': 1.5272417068481445} 11/07/2021 12:11:20 - INFO - __main__ - Step 106174: {'lr': 0.00010060458690066176, 'samples': 20385408, 'steps': 106173, 'loss/train': 1.0603532791137695} 11/07/2021 12:11:21 - INFO - __main__ - Step 106175: {'lr': 0.00010060033194391768, 'samples': 20385600, 'steps': 106174, 'loss/train': 1.2659269571304321} 11/07/2021 12:11:21 - INFO - __main__ - Step 106176: {'lr': 0.00010059607705449076, 'samples': 20385792, 'steps': 106175, 'loss/train': 1.4743019342422485} 11/07/2021 12:11:22 - INFO - __main__ - Step 106177: {'lr': 0.00010059182223238294, 'samples': 20385984, 'steps': 106176, 'loss/train': 1.3545424938201904} 11/07/2021 12:11:22 - INFO - __main__ - Step 106178: {'lr': 0.00010058756747759614, 'samples': 20386176, 'steps': 106177, 'loss/train': 0.6867586970329285} 11/07/2021 12:11:22 - INFO - __main__ - Step 106179: {'lr': 0.0001005833127901323, 'samples': 20386368, 'steps': 106178, 'loss/train': 1.4631078243255615} 11/07/2021 12:11:23 - INFO - __main__ - Step 106180: {'lr': 0.0001005790581699933, 'samples': 20386560, 'steps': 106179, 'loss/train': 1.2487783432006836} 11/07/2021 12:11:24 - INFO - __main__ - Step 106181: {'lr': 0.00010057480361718116, 'samples': 20386752, 'steps': 106180, 'loss/train': 0.9729073643684387} 11/07/2021 12:11:24 - INFO - __main__ - Step 106182: {'lr': 0.00010057054913169761, 'samples': 20386944, 'steps': 106181, 'loss/train': 1.0078784227371216} 11/07/2021 12:11:24 - INFO - __main__ - Step 106183: {'lr': 0.00010056629471354466, 'samples': 20387136, 'steps': 106182, 'loss/train': 1.364461898803711} 11/07/2021 12:11:25 - INFO - __main__ - Step 106184: {'lr': 0.00010056204036272426, 'samples': 20387328, 'steps': 106183, 'loss/train': 1.286117434501648} 11/07/2021 12:11:25 - INFO - __main__ - Step 106185: {'lr': 0.00010055778607923829, 'samples': 20387520, 'steps': 106184, 'loss/train': 1.1167562007904053} 11/07/2021 12:11:26 - INFO - __main__ - Step 106186: {'lr': 0.00010055353186308866, 'samples': 20387712, 'steps': 106185, 'loss/train': 1.061708688735962} 11/07/2021 12:11:26 - INFO - __main__ - Step 106187: {'lr': 0.00010054927771427733, 'samples': 20387904, 'steps': 106186, 'loss/train': 1.2086057662963867} 11/07/2021 12:11:27 - INFO - __main__ - Step 106188: {'lr': 0.00010054502363280615, 'samples': 20388096, 'steps': 106187, 'loss/train': 1.202980637550354} 11/07/2021 12:11:27 - INFO - __main__ - Step 106189: {'lr': 0.00010054076961867708, 'samples': 20388288, 'steps': 106188, 'loss/train': 1.3366341590881348} 11/07/2021 12:11:27 - INFO - __main__ - Step 106190: {'lr': 0.00010053651567189207, 'samples': 20388480, 'steps': 106189, 'loss/train': 0.7930883765220642} 11/07/2021 12:11:29 - INFO - __main__ - Step 106191: {'lr': 0.00010053226179245298, 'samples': 20388672, 'steps': 106190, 'loss/train': 1.485855221748352} 11/07/2021 12:11:29 - INFO - __main__ - Step 106192: {'lr': 0.00010052800798036182, 'samples': 20388864, 'steps': 106191, 'loss/train': 1.6950434446334839} 11/07/2021 12:11:29 - INFO - __main__ - Step 106193: {'lr': 0.00010052375423562038, 'samples': 20389056, 'steps': 106192, 'loss/train': 1.5211801528930664} 11/07/2021 12:11:30 - INFO - __main__ - Step 106194: {'lr': 0.00010051950055823058, 'samples': 20389248, 'steps': 106193, 'loss/train': 1.3170356750488281} 11/07/2021 12:11:30 - INFO - __main__ - Step 106195: {'lr': 0.00010051524694819442, 'samples': 20389440, 'steps': 106194, 'loss/train': 1.31413996219635} 11/07/2021 12:11:31 - INFO - __main__ - Step 106196: {'lr': 0.00010051099340551379, 'samples': 20389632, 'steps': 106195, 'loss/train': 0.8715009093284607} 11/07/2021 12:11:31 - INFO - __main__ - Step 106197: {'lr': 0.00010050673993019058, 'samples': 20389824, 'steps': 106196, 'loss/train': 1.2211523056030273} 11/07/2021 12:11:32 - INFO - __main__ - Step 106198: {'lr': 0.00010050248652222674, 'samples': 20390016, 'steps': 106197, 'loss/train': 1.2028027772903442} 11/07/2021 12:11:32 - INFO - __main__ - Step 106199: {'lr': 0.00010049823318162416, 'samples': 20390208, 'steps': 106198, 'loss/train': 1.4481488466262817} 11/07/2021 12:11:32 - INFO - __main__ - Step 106200: {'lr': 0.00010049397990838477, 'samples': 20390400, 'steps': 106199, 'loss/train': 1.1490000486373901} 11/07/2021 12:11:33 - INFO - __main__ - Step 106201: {'lr': 0.0001004897267025105, 'samples': 20390592, 'steps': 106200, 'loss/train': 1.3017690181732178} 11/07/2021 12:11:34 - INFO - __main__ - Step 106202: {'lr': 0.00010048547356400322, 'samples': 20390784, 'steps': 106201, 'loss/train': 1.6667355298995972} 11/07/2021 12:11:34 - INFO - __main__ - Step 106203: {'lr': 0.00010048122049286492, 'samples': 20390976, 'steps': 106202, 'loss/train': 1.388939380645752} 11/07/2021 12:11:35 - INFO - __main__ - Step 106204: {'lr': 0.00010047696748909745, 'samples': 20391168, 'steps': 106203, 'loss/train': 1.0950647592544556} 11/07/2021 12:11:35 - INFO - __main__ - Step 106205: {'lr': 0.00010047271455270285, 'samples': 20391360, 'steps': 106204, 'loss/train': 1.5453120470046997} 11/07/2021 12:11:35 - INFO - __main__ - Step 106206: {'lr': 0.00010046846168368285, 'samples': 20391552, 'steps': 106205, 'loss/train': 1.360041618347168} 11/07/2021 12:11:37 - INFO - __main__ - Step 106207: {'lr': 0.00010046420888203944, 'samples': 20391744, 'steps': 106206, 'loss/train': 1.170130968093872} 11/07/2021 12:11:37 - INFO - __main__ - Step 106208: {'lr': 0.00010045995614777456, 'samples': 20391936, 'steps': 106207, 'loss/train': 1.522104024887085} 11/07/2021 12:11:37 - INFO - __main__ - Step 106209: {'lr': 0.00010045570348089012, 'samples': 20392128, 'steps': 106208, 'loss/train': 1.095491886138916} 11/07/2021 12:11:38 - INFO - __main__ - Step 106210: {'lr': 0.00010045145088138802, 'samples': 20392320, 'steps': 106209, 'loss/train': 1.4404757022857666} 11/07/2021 12:11:38 - INFO - __main__ - Step 106211: {'lr': 0.00010044719834927019, 'samples': 20392512, 'steps': 106210, 'loss/train': 1.5588247776031494} 11/07/2021 12:11:39 - INFO - __main__ - Step 106212: {'lr': 0.00010044294588453857, 'samples': 20392704, 'steps': 106211, 'loss/train': 0.8487818241119385} 11/07/2021 12:11:39 - INFO - __main__ - Step 106213: {'lr': 0.000100438693487195, 'samples': 20392896, 'steps': 106212, 'loss/train': 1.5328294038772583} 11/07/2021 12:11:40 - INFO - __main__ - Step 106214: {'lr': 0.00010043444115724148, 'samples': 20393088, 'steps': 106213, 'loss/train': 1.1664808988571167} 11/07/2021 12:11:40 - INFO - __main__ - Step 106215: {'lr': 0.00010043018889467991, 'samples': 20393280, 'steps': 106214, 'loss/train': 1.1502082347869873} 11/07/2021 12:11:40 - INFO - __main__ - Step 106216: {'lr': 0.00010042593669951216, 'samples': 20393472, 'steps': 106215, 'loss/train': 1.3139395713806152} 11/07/2021 12:11:42 - INFO - __main__ - Step 106217: {'lr': 0.00010042168457174019, 'samples': 20393664, 'steps': 106216, 'loss/train': 1.1184334754943848} 11/07/2021 12:11:42 - INFO - __main__ - Step 106218: {'lr': 0.0001004174325113659, 'samples': 20393856, 'steps': 106217, 'loss/train': 1.525259017944336} 11/07/2021 12:11:42 - INFO - __main__ - Step 106219: {'lr': 0.0001004131805183913, 'samples': 20394048, 'steps': 106218, 'loss/train': 1.0418522357940674} 11/07/2021 12:11:43 - INFO - __main__ - Step 106220: {'lr': 0.00010040892859281811, 'samples': 20394240, 'steps': 106219, 'loss/train': 1.3019341230392456} 11/07/2021 12:11:43 - INFO - __main__ - Step 106221: {'lr': 0.00010040467673464834, 'samples': 20394432, 'steps': 106220, 'loss/train': 1.1281007528305054} 11/07/2021 12:11:44 - INFO - __main__ - Step 106222: {'lr': 0.00010040042494388393, 'samples': 20394624, 'steps': 106221, 'loss/train': 1.3102757930755615} 11/07/2021 12:11:44 - INFO - __main__ - Step 106223: {'lr': 0.00010039617322052677, 'samples': 20394816, 'steps': 106222, 'loss/train': 1.5020464658737183} 11/07/2021 12:11:45 - INFO - __main__ - Step 106224: {'lr': 0.0001003919215645788, 'samples': 20395008, 'steps': 106223, 'loss/train': 1.454315423965454} 11/07/2021 12:11:45 - INFO - __main__ - Step 106225: {'lr': 0.0001003876699760419, 'samples': 20395200, 'steps': 106224, 'loss/train': 0.7745739817619324} 11/07/2021 12:11:45 - INFO - __main__ - Step 106226: {'lr': 0.00010038341845491802, 'samples': 20395392, 'steps': 106225, 'loss/train': 1.4994094371795654} 11/07/2021 12:11:46 - INFO - __main__ - Step 106227: {'lr': 0.00010037916700120908, 'samples': 20395584, 'steps': 106226, 'loss/train': 1.1833051443099976} 11/07/2021 12:11:47 - INFO - __main__ - Step 106228: {'lr': 0.00010037491561491696, 'samples': 20395776, 'steps': 106227, 'loss/train': 1.3953856229782104} 11/07/2021 12:11:47 - INFO - __main__ - Step 106229: {'lr': 0.0001003706642960436, 'samples': 20395968, 'steps': 106228, 'loss/train': 0.5269531011581421} 11/07/2021 12:11:48 - INFO - __main__ - Step 106230: {'lr': 0.0001003664130445909, 'samples': 20396160, 'steps': 106229, 'loss/train': 0.7943627834320068} 11/07/2021 12:11:48 - INFO - __main__ - Step 106231: {'lr': 0.0001003621618605608, 'samples': 20396352, 'steps': 106230, 'loss/train': 1.402671456336975} 11/07/2021 12:11:49 - INFO - __main__ - Step 106232: {'lr': 0.00010035791074395528, 'samples': 20396544, 'steps': 106231, 'loss/train': 1.2817479372024536} 11/07/2021 12:11:49 - INFO - __main__ - Step 106233: {'lr': 0.00010035365969477608, 'samples': 20396736, 'steps': 106232, 'loss/train': 0.8994754552841187} 11/07/2021 12:11:50 - INFO - __main__ - Step 106234: {'lr': 0.00010034940871302523, 'samples': 20396928, 'steps': 106233, 'loss/train': 1.0478293895721436} 11/07/2021 12:11:50 - INFO - __main__ - Step 106235: {'lr': 0.00010034515779870462, 'samples': 20397120, 'steps': 106234, 'loss/train': 1.666935682296753} 11/07/2021 12:11:50 - INFO - __main__ - Step 106236: {'lr': 0.00010034090695181616, 'samples': 20397312, 'steps': 106235, 'loss/train': 0.7889676094055176} 11/07/2021 12:11:51 - INFO - __main__ - Step 106237: {'lr': 0.00010033665617236179, 'samples': 20397504, 'steps': 106236, 'loss/train': 1.2454681396484375} 11/07/2021 12:11:52 - INFO - __main__ - Step 106238: {'lr': 0.00010033240546034342, 'samples': 20397696, 'steps': 106237, 'loss/train': 1.252166986465454} 11/07/2021 12:11:52 - INFO - __main__ - Step 106239: {'lr': 0.00010032815481576296, 'samples': 20397888, 'steps': 106238, 'loss/train': 1.2561492919921875} 11/07/2021 12:11:53 - INFO - __main__ - Step 106240: {'lr': 0.00010032390423862231, 'samples': 20398080, 'steps': 106239, 'loss/train': 1.0283199548721313} 11/07/2021 12:11:53 - INFO - __main__ - Step 106241: {'lr': 0.00010031965372892341, 'samples': 20398272, 'steps': 106240, 'loss/train': 1.3018174171447754} 11/07/2021 12:11:53 - INFO - __main__ - Step 106242: {'lr': 0.00010031540328666816, 'samples': 20398464, 'steps': 106241, 'loss/train': 1.2410324811935425} 11/07/2021 12:11:54 - INFO - __main__ - Step 106243: {'lr': 0.00010031115291185846, 'samples': 20398656, 'steps': 106242, 'loss/train': 1.4529504776000977} 11/07/2021 12:11:55 - INFO - __main__ - Step 106244: {'lr': 0.00010030690260449627, 'samples': 20398848, 'steps': 106243, 'loss/train': 1.5179442167282104} 11/07/2021 12:11:55 - INFO - __main__ - Step 106245: {'lr': 0.00010030265236458347, 'samples': 20399040, 'steps': 106244, 'loss/train': 0.656610906124115} 11/07/2021 12:11:55 - INFO - __main__ - Step 106246: {'lr': 0.00010029840219212208, 'samples': 20399232, 'steps': 106245, 'loss/train': 1.3998979330062866} 11/07/2021 12:11:56 - INFO - __main__ - Step 106247: {'lr': 0.00010029415208711382, 'samples': 20399424, 'steps': 106246, 'loss/train': 1.4487810134887695} 11/07/2021 12:11:57 - INFO - __main__ - Step 106248: {'lr': 0.0001002899020495607, 'samples': 20399616, 'steps': 106247, 'loss/train': 1.1557215452194214} 11/07/2021 12:11:57 - INFO - __main__ - Step 106249: {'lr': 0.00010028565207946466, 'samples': 20399808, 'steps': 106248, 'loss/train': 1.365678071975708} 11/07/2021 12:11:57 - INFO - __main__ - Step 106250: {'lr': 0.0001002814021768276, 'samples': 20400000, 'steps': 106249, 'loss/train': 1.449090838432312} 11/07/2021 12:11:58 - INFO - __main__ - Step 106251: {'lr': 0.00010027715234165141, 'samples': 20400192, 'steps': 106250, 'loss/train': 1.0455963611602783} 11/07/2021 12:11:58 - INFO - __main__ - Step 106252: {'lr': 0.00010027290257393804, 'samples': 20400384, 'steps': 106251, 'loss/train': 0.9967241883277893} 11/07/2021 12:11:59 - INFO - __main__ - Step 106253: {'lr': 0.00010026865287368939, 'samples': 20400576, 'steps': 106252, 'loss/train': 1.4319478273391724} 11/07/2021 12:11:59 - INFO - __main__ - Step 106254: {'lr': 0.00010026440324090735, 'samples': 20400768, 'steps': 106253, 'loss/train': 1.555721640586853} 11/07/2021 12:12:00 - INFO - __main__ - Step 106255: {'lr': 0.00010026015367559388, 'samples': 20400960, 'steps': 106254, 'loss/train': 1.070121169090271} 11/07/2021 12:12:00 - INFO - __main__ - Step 106256: {'lr': 0.00010025590417775085, 'samples': 20401152, 'steps': 106255, 'loss/train': 1.6127605438232422} 11/07/2021 12:12:00 - INFO - __main__ - Step 106257: {'lr': 0.00010025165474738024, 'samples': 20401344, 'steps': 106256, 'loss/train': 1.4422093629837036} 11/07/2021 12:12:01 - INFO - __main__ - Step 106258: {'lr': 0.0001002474053844839, 'samples': 20401536, 'steps': 106257, 'loss/train': 1.206650733947754} 11/07/2021 12:12:02 - INFO - __main__ - Step 106259: {'lr': 0.00010024315608906384, 'samples': 20401728, 'steps': 106258, 'loss/train': 1.4854257106781006} 11/07/2021 12:12:02 - INFO - __main__ - Step 106260: {'lr': 0.00010023890686112183, 'samples': 20401920, 'steps': 106259, 'loss/train': 1.687363624572754} 11/07/2021 12:12:03 - INFO - __main__ - Step 106261: {'lr': 0.00010023465770065987, 'samples': 20402112, 'steps': 106260, 'loss/train': 1.683732032775879} 11/07/2021 12:12:03 - INFO - __main__ - Step 106262: {'lr': 0.00010023040860767984, 'samples': 20402304, 'steps': 106261, 'loss/train': 1.6337592601776123} 11/07/2021 12:12:04 - INFO - __main__ - Step 106263: {'lr': 0.0001002261595821837, 'samples': 20402496, 'steps': 106262, 'loss/train': 1.4718822240829468} 11/07/2021 12:12:04 - INFO - __main__ - Step 106264: {'lr': 0.00010022191062417332, 'samples': 20402688, 'steps': 106263, 'loss/train': 1.3516380786895752} 11/07/2021 12:12:05 - INFO - __main__ - Step 106265: {'lr': 0.00010021766173365066, 'samples': 20402880, 'steps': 106264, 'loss/train': 1.5830334424972534} 11/07/2021 12:12:05 - INFO - __main__ - Step 106266: {'lr': 0.00010021341291061761, 'samples': 20403072, 'steps': 106265, 'loss/train': 1.2159863710403442} 11/07/2021 12:12:05 - INFO - __main__ - Step 106267: {'lr': 0.00010020916415507605, 'samples': 20403264, 'steps': 106266, 'loss/train': 1.0664206743240356} 11/07/2021 12:12:07 - INFO - __main__ - Step 106268: {'lr': 0.00010020491546702795, 'samples': 20403456, 'steps': 106267, 'loss/train': 1.3199230432510376} 11/07/2021 12:12:07 - INFO - __main__ - Step 106269: {'lr': 0.00010020066684647522, 'samples': 20403648, 'steps': 106268, 'loss/train': 1.4341771602630615} 11/07/2021 12:12:07 - INFO - __main__ - Step 106270: {'lr': 0.00010019641829341975, 'samples': 20403840, 'steps': 106269, 'loss/train': 0.8712256550788879} 11/07/2021 12:12:08 - INFO - __main__ - Step 106271: {'lr': 0.00010019216980786344, 'samples': 20404032, 'steps': 106270, 'loss/train': 1.7416701316833496} 11/07/2021 12:12:08 - INFO - __main__ - Step 106272: {'lr': 0.00010018792138980834, 'samples': 20404224, 'steps': 106271, 'loss/train': 1.0134024620056152} 11/07/2021 12:12:08 - INFO - __main__ - Step 106273: {'lr': 0.00010018367303925616, 'samples': 20404416, 'steps': 106272, 'loss/train': 1.464575171470642} 11/07/2021 12:12:09 - INFO - __main__ - Step 106274: {'lr': 0.00010017942475620889, 'samples': 20404608, 'steps': 106273, 'loss/train': 0.09661776572465897} 11/07/2021 12:12:10 - INFO - __main__ - Step 106275: {'lr': 0.00010017517654066846, 'samples': 20404800, 'steps': 106274, 'loss/train': 1.2311593294143677} 11/07/2021 12:12:10 - INFO - __main__ - Step 106276: {'lr': 0.00010017092839263677, 'samples': 20404992, 'steps': 106275, 'loss/train': 1.502898931503296} 11/07/2021 12:12:10 - INFO - __main__ - Step 106277: {'lr': 0.0001001666803121158, 'samples': 20405184, 'steps': 106276, 'loss/train': 0.788158118724823} 11/07/2021 12:12:11 - INFO - __main__ - Step 106278: {'lr': 0.00010016243229910738, 'samples': 20405376, 'steps': 106277, 'loss/train': 1.6011512279510498} 11/07/2021 12:12:11 - INFO - __main__ - Step 106279: {'lr': 0.00010015818435361346, 'samples': 20405568, 'steps': 106278, 'loss/train': 1.2006542682647705} 11/07/2021 12:12:12 - INFO - __main__ - Step 106280: {'lr': 0.00010015393647563592, 'samples': 20405760, 'steps': 106279, 'loss/train': 1.3685449361801147} 11/07/2021 12:12:12 - INFO - __main__ - Step 106281: {'lr': 0.00010014968866517673, 'samples': 20405952, 'steps': 106280, 'loss/train': 1.207958698272705} 11/07/2021 12:12:13 - INFO - __main__ - Step 106282: {'lr': 0.00010014544092223779, 'samples': 20406144, 'steps': 106281, 'loss/train': 1.2306599617004395} 11/07/2021 12:12:13 - INFO - __main__ - Step 106283: {'lr': 0.000100141193246821, 'samples': 20406336, 'steps': 106282, 'loss/train': 1.5953996181488037} 11/07/2021 12:12:14 - INFO - __main__ - Step 106284: {'lr': 0.00010013694563892825, 'samples': 20406528, 'steps': 106283, 'loss/train': 1.5727252960205078} 11/07/2021 12:12:15 - INFO - __main__ - Step 106285: {'lr': 0.00010013269809856148, 'samples': 20406720, 'steps': 106284, 'loss/train': 1.1802356243133545} 11/07/2021 12:12:15 - INFO - __main__ - Step 106286: {'lr': 0.00010012845062572273, 'samples': 20406912, 'steps': 106285, 'loss/train': 1.4309217929840088} 11/07/2021 12:12:15 - INFO - __main__ - Step 106287: {'lr': 0.00010012420322041369, 'samples': 20407104, 'steps': 106286, 'loss/train': 1.0106662511825562} 11/07/2021 12:12:16 - INFO - __main__ - Step 106288: {'lr': 0.00010011995588263633, 'samples': 20407296, 'steps': 106287, 'loss/train': 1.3055087327957153} 11/07/2021 12:12:16 - INFO - __main__ - Step 106289: {'lr': 0.00010011570861239264, 'samples': 20407488, 'steps': 106288, 'loss/train': 0.7381813526153564} 11/07/2021 12:12:17 - INFO - __main__ - Step 106290: {'lr': 0.0001001114614096845, 'samples': 20407680, 'steps': 106289, 'loss/train': 1.6076875925064087} 11/07/2021 12:12:17 - INFO - __main__ - Step 106291: {'lr': 0.0001001072142745138, 'samples': 20407872, 'steps': 106290, 'loss/train': 0.991110622882843} 11/07/2021 12:12:18 - INFO - __main__ - Step 106292: {'lr': 0.0001001029672068825, 'samples': 20408064, 'steps': 106291, 'loss/train': 1.048858642578125} 11/07/2021 12:12:18 - INFO - __main__ - Step 106293: {'lr': 0.00010009872020679248, 'samples': 20408256, 'steps': 106292, 'loss/train': 0.8192484378814697} 11/07/2021 12:12:19 - INFO - __main__ - Step 106294: {'lr': 0.00010009447327424567, 'samples': 20408448, 'steps': 106293, 'loss/train': 1.2475744485855103} 11/07/2021 12:12:20 - INFO - __main__ - Step 106295: {'lr': 0.00010009022640924395, 'samples': 20408640, 'steps': 106294, 'loss/train': 1.3146930932998657} 11/07/2021 12:12:20 - INFO - __main__ - Step 106296: {'lr': 0.00010008597961178931, 'samples': 20408832, 'steps': 106295, 'loss/train': 1.2089062929153442} 11/07/2021 12:12:20 - INFO - __main__ - Step 106297: {'lr': 0.00010008173288188358, 'samples': 20409024, 'steps': 106296, 'loss/train': 0.6000612378120422} 11/07/2021 12:12:21 - INFO - __main__ - Step 106298: {'lr': 0.0001000774862195287, 'samples': 20409216, 'steps': 106297, 'loss/train': 0.7842581868171692} 11/07/2021 12:12:21 - INFO - __main__ - Step 106299: {'lr': 0.00010007323962472669, 'samples': 20409408, 'steps': 106298, 'loss/train': 1.3589733839035034} 11/07/2021 12:12:21 - INFO - __main__ - Step 106300: {'lr': 0.0001000689930974793, 'samples': 20409600, 'steps': 106299, 'loss/train': 0.9655224084854126} 11/07/2021 12:12:22 - INFO - __main__ - Step 106301: {'lr': 0.00010006474663778847, 'samples': 20409792, 'steps': 106300, 'loss/train': 1.2372331619262695} 11/07/2021 12:12:23 - INFO - __main__ - Step 106302: {'lr': 0.00010006050024565619, 'samples': 20409984, 'steps': 106301, 'loss/train': 0.4349133372306824} 11/07/2021 12:12:23 - INFO - __main__ - Step 106303: {'lr': 0.0001000562539210843, 'samples': 20410176, 'steps': 106302, 'loss/train': 1.651429295539856} 11/07/2021 12:12:24 - INFO - __main__ - Step 106304: {'lr': 0.00010005200766407476, 'samples': 20410368, 'steps': 106303, 'loss/train': 1.1096410751342773} 11/07/2021 12:12:24 - INFO - __main__ - Step 106305: {'lr': 0.00010004776147462946, 'samples': 20410560, 'steps': 106304, 'loss/train': 1.2512274980545044} 11/07/2021 12:12:25 - INFO - __main__ - Step 106306: {'lr': 0.00010004351535275036, 'samples': 20410752, 'steps': 106305, 'loss/train': 1.1054760217666626} 11/07/2021 12:12:25 - INFO - __main__ - Step 106307: {'lr': 0.00010003926929843931, 'samples': 20410944, 'steps': 106306, 'loss/train': 1.2045958042144775} 11/07/2021 12:12:26 - INFO - __main__ - Step 106308: {'lr': 0.00010003502331169825, 'samples': 20411136, 'steps': 106307, 'loss/train': 1.5673600435256958} 11/07/2021 12:12:26 - INFO - __main__ - Step 106309: {'lr': 0.00010003077739252911, 'samples': 20411328, 'steps': 106308, 'loss/train': 0.7638879418373108} 11/07/2021 12:12:26 - INFO - __main__ - Step 106310: {'lr': 0.00010002653154093378, 'samples': 20411520, 'steps': 106309, 'loss/train': 0.5977858304977417} 11/07/2021 12:12:27 - INFO - __main__ - Step 106311: {'lr': 0.00010002228575691418, 'samples': 20411712, 'steps': 106310, 'loss/train': 1.219445824623108} 11/07/2021 12:12:28 - INFO - __main__ - Step 106312: {'lr': 0.00010001804004047222, 'samples': 20411904, 'steps': 106311, 'loss/train': 1.2780416011810303} 11/07/2021 12:12:28 - INFO - __main__ - Step 106313: {'lr': 0.0001000137943916099, 'samples': 20412096, 'steps': 106312, 'loss/train': 1.3930914402008057} 11/07/2021 12:12:28 - INFO - __main__ - Step 106314: {'lr': 0.00010000954881032898, 'samples': 20412288, 'steps': 106313, 'loss/train': 1.039530634880066} 11/07/2021 12:12:29 - INFO - __main__ - Step 106315: {'lr': 0.00010000530329663144, 'samples': 20412480, 'steps': 106314, 'loss/train': 1.2939367294311523} 11/07/2021 12:12:30 - INFO - __main__ - Step 106316: {'lr': 0.0001000010578505192, 'samples': 20412672, 'steps': 106315, 'loss/train': 0.6617066860198975} 11/07/2021 12:12:30 - INFO - __main__ - Step 106317: {'lr': 9.999681247199415e-05, 'samples': 20412864, 'steps': 106316, 'loss/train': 1.4153947830200195} 11/07/2021 12:12:30 - INFO - __main__ - Step 106318: {'lr': 9.999256716105823e-05, 'samples': 20413056, 'steps': 106317, 'loss/train': 0.9468328952789307} 11/07/2021 12:12:31 - INFO - __main__ - Step 106319: {'lr': 9.998832191771334e-05, 'samples': 20413248, 'steps': 106318, 'loss/train': 1.5213872194290161} 11/07/2021 12:12:31 - INFO - __main__ - Step 106320: {'lr': 9.998407674196142e-05, 'samples': 20413440, 'steps': 106319, 'loss/train': 1.4505072832107544} 11/07/2021 12:12:32 - INFO - __main__ - Step 106321: {'lr': 9.997983163380434e-05, 'samples': 20413632, 'steps': 106320, 'loss/train': 1.2304024696350098} 11/07/2021 12:12:33 - INFO - __main__ - Step 106322: {'lr': 9.997558659324402e-05, 'samples': 20413824, 'steps': 106321, 'loss/train': 1.285844087600708} 11/07/2021 12:12:33 - INFO - __main__ - Step 106323: {'lr': 9.99713416202824e-05, 'samples': 20414016, 'steps': 106322, 'loss/train': 1.0114476680755615} 11/07/2021 12:12:33 - INFO - __main__ - Step 106324: {'lr': 9.996709671492138e-05, 'samples': 20414208, 'steps': 106323, 'loss/train': 5.693900108337402} 11/07/2021 12:12:34 - INFO - __main__ - Step 106325: {'lr': 9.996285187716286e-05, 'samples': 20414400, 'steps': 106324, 'loss/train': 1.1377824544906616} 11/07/2021 12:12:34 - INFO - __main__ - Step 106326: {'lr': 9.995860710700888e-05, 'samples': 20414592, 'steps': 106325, 'loss/train': 1.3161213397979736} 11/07/2021 12:12:35 - INFO - __main__ - Step 106327: {'lr': 9.995436240446112e-05, 'samples': 20414784, 'steps': 106326, 'loss/train': 1.4825575351715088} 11/07/2021 12:12:36 - INFO - __main__ - Step 106328: {'lr': 9.995011776952162e-05, 'samples': 20414976, 'steps': 106327, 'loss/train': 1.4844409227371216} 11/07/2021 12:12:36 - INFO - __main__ - Step 106329: {'lr': 9.994587320219228e-05, 'samples': 20415168, 'steps': 106328, 'loss/train': 1.3300940990447998} 11/07/2021 12:12:36 - INFO - __main__ - Step 106330: {'lr': 9.9941628702475e-05, 'samples': 20415360, 'steps': 106329, 'loss/train': 1.451361060142517} 11/07/2021 12:12:37 - INFO - __main__ - Step 106331: {'lr': 9.993738427037175e-05, 'samples': 20415552, 'steps': 106330, 'loss/train': 1.2461464405059814} 11/07/2021 12:12:38 - INFO - __main__ - Step 106332: {'lr': 9.993313990588434e-05, 'samples': 20415744, 'steps': 106331, 'loss/train': 1.504154086112976} 11/07/2021 12:12:38 - INFO - __main__ - Step 106333: {'lr': 9.992889560901478e-05, 'samples': 20415936, 'steps': 106332, 'loss/train': 1.2463843822479248} 11/07/2021 12:12:38 - INFO - __main__ - Step 106334: {'lr': 9.992465137976495e-05, 'samples': 20416128, 'steps': 106333, 'loss/train': 1.248310923576355} 11/07/2021 12:12:39 - INFO - __main__ - Step 106335: {'lr': 9.992040721813673e-05, 'samples': 20416320, 'steps': 106334, 'loss/train': 1.4471616744995117} 11/07/2021 12:12:39 - INFO - __main__ - Step 106336: {'lr': 9.991616312413206e-05, 'samples': 20416512, 'steps': 106335, 'loss/train': 1.7511701583862305} 11/07/2021 12:12:40 - INFO - __main__ - Step 106337: {'lr': 9.991191909775287e-05, 'samples': 20416704, 'steps': 106336, 'loss/train': 1.3047993183135986} 11/07/2021 12:12:40 - INFO - __main__ - Step 106338: {'lr': 9.990767513900107e-05, 'samples': 20416896, 'steps': 106337, 'loss/train': 0.16784578561782837} 11/07/2021 12:12:41 - INFO - __main__ - Step 106339: {'lr': 9.990343124787851e-05, 'samples': 20417088, 'steps': 106338, 'loss/train': 1.39777672290802} 11/07/2021 12:12:41 - INFO - __main__ - Step 106340: {'lr': 9.989918742438725e-05, 'samples': 20417280, 'steps': 106339, 'loss/train': 1.3029602766036987} 11/07/2021 12:12:41 - INFO - __main__ - Step 106341: {'lr': 9.989494366852902e-05, 'samples': 20417472, 'steps': 106340, 'loss/train': 1.466521978378296} 11/07/2021 12:12:43 - INFO - __main__ - Step 106342: {'lr': 9.989069998030581e-05, 'samples': 20417664, 'steps': 106341, 'loss/train': 1.470411777496338} 11/07/2021 12:12:43 - INFO - __main__ - Step 106343: {'lr': 9.988645635971954e-05, 'samples': 20417856, 'steps': 106342, 'loss/train': 0.41887590289115906} 11/07/2021 12:12:43 - INFO - __main__ - Step 106344: {'lr': 9.988221280677213e-05, 'samples': 20418048, 'steps': 106343, 'loss/train': 1.2943665981292725} 11/07/2021 12:12:44 - INFO - __main__ - Step 106345: {'lr': 9.987796932146545e-05, 'samples': 20418240, 'steps': 106344, 'loss/train': 0.8354166150093079} 11/07/2021 12:12:44 - INFO - __main__ - Step 106346: {'lr': 9.987372590380145e-05, 'samples': 20418432, 'steps': 106345, 'loss/train': 0.611535906791687} 11/07/2021 12:12:45 - INFO - __main__ - Step 106347: {'lr': 9.986948255378204e-05, 'samples': 20418624, 'steps': 106346, 'loss/train': 0.9613615870475769} 11/07/2021 12:12:45 - INFO - __main__ - Step 106348: {'lr': 9.986523927140909e-05, 'samples': 20418816, 'steps': 106347, 'loss/train': 0.9167613983154297} 11/07/2021 12:12:46 - INFO - __main__ - Step 106349: {'lr': 9.986099605668458e-05, 'samples': 20419008, 'steps': 106348, 'loss/train': 1.198395013809204} 11/07/2021 12:12:46 - INFO - __main__ - Step 106350: {'lr': 9.985675290961038e-05, 'samples': 20419200, 'steps': 106349, 'loss/train': 1.4023329019546509} 11/07/2021 12:12:46 - INFO - __main__ - Step 106351: {'lr': 9.98525098301884e-05, 'samples': 20419392, 'steps': 106350, 'loss/train': 1.3679540157318115} 11/07/2021 12:12:47 - INFO - __main__ - Step 106352: {'lr': 9.984826681842057e-05, 'samples': 20419584, 'steps': 106351, 'loss/train': 1.4141767024993896} 11/07/2021 12:12:48 - INFO - __main__ - Step 106353: {'lr': 9.98440238743089e-05, 'samples': 20419776, 'steps': 106352, 'loss/train': 1.0172114372253418} 11/07/2021 12:12:48 - INFO - __main__ - Step 106354: {'lr': 9.98397809978551e-05, 'samples': 20419968, 'steps': 106353, 'loss/train': 1.3712198734283447} 11/07/2021 12:12:49 - INFO - __main__ - Step 106355: {'lr': 9.983553818906116e-05, 'samples': 20420160, 'steps': 106354, 'loss/train': 1.3200993537902832} 11/07/2021 12:12:49 - INFO - __main__ - Step 106356: {'lr': 9.983129544792902e-05, 'samples': 20420352, 'steps': 106355, 'loss/train': 1.2947713136672974} 11/07/2021 12:12:50 - INFO - __main__ - Step 106357: {'lr': 9.982705277446057e-05, 'samples': 20420544, 'steps': 106356, 'loss/train': 0.935691237449646} 11/07/2021 12:12:50 - INFO - __main__ - Step 106358: {'lr': 9.982281016865777e-05, 'samples': 20420736, 'steps': 106357, 'loss/train': 2.0151538848876953} 11/07/2021 12:12:51 - INFO - __main__ - Step 106359: {'lr': 9.981856763052247e-05, 'samples': 20420928, 'steps': 106358, 'loss/train': 1.1899731159210205} 11/07/2021 12:12:51 - INFO - __main__ - Step 106360: {'lr': 9.981432516005658e-05, 'samples': 20421120, 'steps': 106359, 'loss/train': 1.5142709016799927} 11/07/2021 12:12:51 - INFO - __main__ - Step 106361: {'lr': 9.981008275726208e-05, 'samples': 20421312, 'steps': 106360, 'loss/train': 1.2643169164657593} 11/07/2021 12:12:52 - INFO - __main__ - Step 106362: {'lr': 9.980584042214083e-05, 'samples': 20421504, 'steps': 106361, 'loss/train': 1.3319324254989624} 11/07/2021 12:12:53 - INFO - __main__ - Step 106363: {'lr': 9.980159815469472e-05, 'samples': 20421696, 'steps': 106362, 'loss/train': 1.397775650024414} 11/07/2021 12:12:53 - INFO - __main__ - Step 106364: {'lr': 9.979735595492573e-05, 'samples': 20421888, 'steps': 106363, 'loss/train': 1.1940819025039673} 11/07/2021 12:12:53 - INFO - __main__ - Step 106365: {'lr': 9.97931138228357e-05, 'samples': 20422080, 'steps': 106364, 'loss/train': 1.19482421875} 11/07/2021 12:12:54 - INFO - __main__ - Step 106366: {'lr': 9.97888717584266e-05, 'samples': 20422272, 'steps': 106365, 'loss/train': 1.5417606830596924} 11/07/2021 12:12:55 - INFO - __main__ - Step 106367: {'lr': 9.978462976170041e-05, 'samples': 20422464, 'steps': 106366, 'loss/train': 1.5434985160827637} 11/07/2021 12:12:55 - INFO - __main__ - Step 106368: {'lr': 9.978038783265883e-05, 'samples': 20422656, 'steps': 106367, 'loss/train': 1.3238669633865356} 11/07/2021 12:12:56 - INFO - __main__ - Step 106369: {'lr': 9.977614597130391e-05, 'samples': 20422848, 'steps': 106368, 'loss/train': 1.7387518882751465} 11/07/2021 12:12:56 - INFO - __main__ - Step 106370: {'lr': 9.977190417763754e-05, 'samples': 20423040, 'steps': 106369, 'loss/train': 1.1015359163284302} 11/07/2021 12:12:56 - INFO - __main__ - Step 106371: {'lr': 9.976766245166164e-05, 'samples': 20423232, 'steps': 106370, 'loss/train': 2.531430721282959} 11/07/2021 12:12:57 - INFO - __main__ - Step 106372: {'lr': 9.97634207933781e-05, 'samples': 20423424, 'steps': 106371, 'loss/train': 1.2855688333511353} 11/07/2021 12:12:58 - INFO - __main__ - Step 106373: {'lr': 9.975917920278884e-05, 'samples': 20423616, 'steps': 106372, 'loss/train': 1.3966418504714966} 11/07/2021 12:12:58 - INFO - __main__ - Step 106374: {'lr': 9.97549376798958e-05, 'samples': 20423808, 'steps': 106373, 'loss/train': 1.3641459941864014} 11/07/2021 12:12:58 - INFO - __main__ - Step 106375: {'lr': 9.975069622470084e-05, 'samples': 20424000, 'steps': 106374, 'loss/train': 1.4503862857818604} 11/07/2021 12:12:59 - INFO - __main__ - Step 106376: {'lr': 9.974645483720591e-05, 'samples': 20424192, 'steps': 106375, 'loss/train': 1.9607363939285278} 11/07/2021 12:12:59 - INFO - __main__ - Step 106377: {'lr': 9.974221351741289e-05, 'samples': 20424384, 'steps': 106376, 'loss/train': 0.9940716028213501} 11/07/2021 12:13:00 - INFO - __main__ - Step 106378: {'lr': 9.973797226532372e-05, 'samples': 20424576, 'steps': 106377, 'loss/train': 0.8951369524002075} 11/07/2021 12:13:00 - INFO - __main__ - Step 106379: {'lr': 9.973373108094031e-05, 'samples': 20424768, 'steps': 106378, 'loss/train': 1.6449919939041138} 11/07/2021 12:13:01 - INFO - __main__ - Step 106380: {'lr': 9.972948996426464e-05, 'samples': 20424960, 'steps': 106379, 'loss/train': 1.1644750833511353} 11/07/2021 12:13:01 - INFO - __main__ - Step 106381: {'lr': 9.972524891529847e-05, 'samples': 20425152, 'steps': 106380, 'loss/train': 1.9210715293884277} 11/07/2021 12:13:01 - INFO - __main__ - Step 106382: {'lr': 9.972100793404377e-05, 'samples': 20425344, 'steps': 106381, 'loss/train': 1.3507176637649536} 11/07/2021 12:13:02 - INFO - __main__ - Step 106383: {'lr': 9.971676702050247e-05, 'samples': 20425536, 'steps': 106382, 'loss/train': 1.2880730628967285} 11/07/2021 12:13:03 - INFO - __main__ - Step 106384: {'lr': 9.97125261746765e-05, 'samples': 20425728, 'steps': 106383, 'loss/train': 1.307837724685669} 11/07/2021 12:13:03 - INFO - __main__ - Step 106385: {'lr': 9.970828539656771e-05, 'samples': 20425920, 'steps': 106384, 'loss/train': 1.3818550109863281} 11/07/2021 12:13:04 - INFO - __main__ - Step 106386: {'lr': 9.970404468617805e-05, 'samples': 20426112, 'steps': 106385, 'loss/train': 1.0798205137252808} 11/07/2021 12:13:04 - INFO - __main__ - Step 106387: {'lr': 9.969980404350945e-05, 'samples': 20426304, 'steps': 106386, 'loss/train': 1.2067021131515503} 11/07/2021 12:13:05 - INFO - __main__ - Step 106388: {'lr': 9.969556346856379e-05, 'samples': 20426496, 'steps': 106387, 'loss/train': 1.1976757049560547} 11/07/2021 12:13:05 - INFO - __main__ - Step 106389: {'lr': 9.969132296134298e-05, 'samples': 20426688, 'steps': 106388, 'loss/train': 1.1593685150146484} 11/07/2021 12:13:06 - INFO - __main__ - Step 106390: {'lr': 9.968708252184894e-05, 'samples': 20426880, 'steps': 106389, 'loss/train': 1.2124824523925781} 11/07/2021 12:13:06 - INFO - __main__ - Step 106391: {'lr': 9.968284215008358e-05, 'samples': 20427072, 'steps': 106390, 'loss/train': 1.7975618839263916} 11/07/2021 12:13:06 - INFO - __main__ - Step 106392: {'lr': 9.967860184604882e-05, 'samples': 20427264, 'steps': 106391, 'loss/train': 1.1540435552597046} 11/07/2021 12:13:07 - INFO - __main__ - Step 106393: {'lr': 9.967436160974666e-05, 'samples': 20427456, 'steps': 106392, 'loss/train': 1.5553056001663208} 11/07/2021 12:13:08 - INFO - __main__ - Step 106394: {'lr': 9.967012144117882e-05, 'samples': 20427648, 'steps': 106393, 'loss/train': 1.2260417938232422} 11/07/2021 12:13:08 - INFO - __main__ - Step 106395: {'lr': 9.966588134034729e-05, 'samples': 20427840, 'steps': 106394, 'loss/train': 1.4758062362670898} 11/07/2021 12:13:08 - INFO - __main__ - Step 106396: {'lr': 9.9661641307254e-05, 'samples': 20428032, 'steps': 106395, 'loss/train': 0.972430408000946} 11/07/2021 12:13:09 - INFO - __main__ - Step 106397: {'lr': 9.965740134190087e-05, 'samples': 20428224, 'steps': 106396, 'loss/train': 1.6715682744979858} 11/07/2021 12:13:09 - INFO - __main__ - Step 106398: {'lr': 9.96531614442898e-05, 'samples': 20428416, 'steps': 106397, 'loss/train': 1.2136670351028442} 11/07/2021 12:13:11 - INFO - __main__ - Step 106399: {'lr': 9.964892161442265e-05, 'samples': 20428608, 'steps': 106398, 'loss/train': 1.349844217300415} 11/07/2021 12:13:11 - INFO - __main__ - Step 106400: {'lr': 9.964468185230141e-05, 'samples': 20428800, 'steps': 106399, 'loss/train': 1.1384109258651733} 11/07/2021 12:13:11 - INFO - __main__ - Step 106401: {'lr': 9.964044215792795e-05, 'samples': 20428992, 'steps': 106400, 'loss/train': 1.694475769996643} 11/07/2021 12:13:12 - INFO - __main__ - Step 106402: {'lr': 9.963620253130418e-05, 'samples': 20429184, 'steps': 106401, 'loss/train': 1.4886027574539185} 11/07/2021 12:13:12 - INFO - __main__ - Step 106403: {'lr': 9.963196297243204e-05, 'samples': 20429376, 'steps': 106402, 'loss/train': 1.047844648361206} 11/07/2021 12:13:12 - INFO - __main__ - Step 106404: {'lr': 9.962772348131338e-05, 'samples': 20429568, 'steps': 106403, 'loss/train': 1.7524747848510742} 11/07/2021 12:13:14 - INFO - __main__ - Step 106405: {'lr': 9.962348405795018e-05, 'samples': 20429760, 'steps': 106404, 'loss/train': 1.7029167413711548} 11/07/2021 12:13:14 - INFO - __main__ - Step 106406: {'lr': 9.96192447023444e-05, 'samples': 20429952, 'steps': 106405, 'loss/train': 1.4610165357589722} 11/07/2021 12:13:14 - INFO - __main__ - Step 106407: {'lr': 9.961500541449778e-05, 'samples': 20430144, 'steps': 106406, 'loss/train': 1.0175890922546387} 11/07/2021 12:13:15 - INFO - __main__ - Step 106408: {'lr': 9.961076619441231e-05, 'samples': 20430336, 'steps': 106407, 'loss/train': 1.1377298831939697} 11/07/2021 12:13:15 - INFO - __main__ - Step 106409: {'lr': 9.960652704208988e-05, 'samples': 20430528, 'steps': 106408, 'loss/train': 1.4111690521240234} 11/07/2021 12:13:16 - INFO - __main__ - Step 106410: {'lr': 9.960228795753248e-05, 'samples': 20430720, 'steps': 106409, 'loss/train': 1.3916847705841064} 11/07/2021 12:13:16 - INFO - __main__ - Step 106411: {'lr': 9.959804894074195e-05, 'samples': 20430912, 'steps': 106410, 'loss/train': 0.9060037732124329} 11/07/2021 12:13:17 - INFO - __main__ - Step 106412: {'lr': 9.959380999172021e-05, 'samples': 20431104, 'steps': 106411, 'loss/train': 1.197373867034912} 11/07/2021 12:13:17 - INFO - __main__ - Step 106413: {'lr': 9.958957111046918e-05, 'samples': 20431296, 'steps': 106412, 'loss/train': 1.2787989377975464} 11/07/2021 12:13:17 - INFO - __main__ - Step 106414: {'lr': 9.958533229699076e-05, 'samples': 20431488, 'steps': 106413, 'loss/train': 0.9870840311050415} 11/07/2021 12:13:18 - INFO - __main__ - Step 106415: {'lr': 9.958109355128688e-05, 'samples': 20431680, 'steps': 106414, 'loss/train': 1.4503458738327026} 11/07/2021 12:13:19 - INFO - __main__ - Step 106416: {'lr': 9.957685487335946e-05, 'samples': 20431872, 'steps': 106415, 'loss/train': 1.0733083486557007} 11/07/2021 12:13:19 - INFO - __main__ - Step 106417: {'lr': 9.957261626321045e-05, 'samples': 20432064, 'steps': 106416, 'loss/train': 1.3162708282470703} 11/07/2021 12:13:19 - INFO - __main__ - Step 106418: {'lr': 9.956837772084159e-05, 'samples': 20432256, 'steps': 106417, 'loss/train': 1.3108164072036743} 11/07/2021 12:13:20 - INFO - __main__ - Step 106419: {'lr': 9.956413924625493e-05, 'samples': 20432448, 'steps': 106418, 'loss/train': 1.7638791799545288} 11/07/2021 12:13:21 - INFO - __main__ - Step 106420: {'lr': 9.955990083945235e-05, 'samples': 20432640, 'steps': 106419, 'loss/train': 1.68954336643219} 11/07/2021 12:13:21 - INFO - __main__ - Step 106421: {'lr': 9.955566250043574e-05, 'samples': 20432832, 'steps': 106420, 'loss/train': 1.5297019481658936} 11/07/2021 12:13:22 - INFO - __main__ - Step 106422: {'lr': 9.955142422920704e-05, 'samples': 20433024, 'steps': 106421, 'loss/train': 0.6790896058082581} 11/07/2021 12:13:22 - INFO - __main__ - Step 106423: {'lr': 9.954718602576815e-05, 'samples': 20433216, 'steps': 106422, 'loss/train': 1.4037100076675415} 11/07/2021 12:13:22 - INFO - __main__ - Step 106424: {'lr': 9.954294789012094e-05, 'samples': 20433408, 'steps': 106423, 'loss/train': 1.4512680768966675} 11/07/2021 12:13:23 - INFO - __main__ - Step 106425: {'lr': 9.953870982226739e-05, 'samples': 20433600, 'steps': 106424, 'loss/train': 0.6314427852630615} 11/07/2021 12:13:24 - INFO - __main__ - Step 106426: {'lr': 9.953447182220937e-05, 'samples': 20433792, 'steps': 106425, 'loss/train': 1.284536600112915} 11/07/2021 12:13:24 - INFO - __main__ - Step 106427: {'lr': 9.953023388994881e-05, 'samples': 20433984, 'steps': 106426, 'loss/train': 1.0182015895843506} 11/07/2021 12:13:24 - INFO - __main__ - Step 106428: {'lr': 9.952599602548765e-05, 'samples': 20434176, 'steps': 106427, 'loss/train': 1.56458580493927} 11/07/2021 12:13:25 - INFO - __main__ - Step 106429: {'lr': 9.952175822882769e-05, 'samples': 20434368, 'steps': 106428, 'loss/train': 1.271517038345337} 11/07/2021 12:13:25 - INFO - __main__ - Step 106430: {'lr': 9.951752049997093e-05, 'samples': 20434560, 'steps': 106429, 'loss/train': 1.7395315170288086} 11/07/2021 12:13:26 - INFO - __main__ - Step 106431: {'lr': 9.951328283891922e-05, 'samples': 20434752, 'steps': 106430, 'loss/train': 5.356851577758789} 11/07/2021 12:13:27 - INFO - __main__ - Step 106432: {'lr': 9.95090452456745e-05, 'samples': 20434944, 'steps': 106431, 'loss/train': 1.3204426765441895} 11/07/2021 12:13:27 - INFO - __main__ - Step 106433: {'lr': 9.95048077202387e-05, 'samples': 20435136, 'steps': 106432, 'loss/train': 1.4071487188339233} 11/07/2021 12:13:27 - INFO - __main__ - Step 106434: {'lr': 9.95005702626137e-05, 'samples': 20435328, 'steps': 106433, 'loss/train': 1.669852375984192} 11/07/2021 12:13:28 - INFO - __main__ - Step 106435: {'lr': 9.949633287280144e-05, 'samples': 20435520, 'steps': 106434, 'loss/train': 1.4247382879257202} 11/07/2021 12:13:28 - INFO - __main__ - Step 106436: {'lr': 9.949209555080379e-05, 'samples': 20435712, 'steps': 106435, 'loss/train': 0.9050887823104858} 11/07/2021 12:13:29 - INFO - __main__ - Step 106437: {'lr': 9.948785829662269e-05, 'samples': 20435904, 'steps': 106436, 'loss/train': 1.6037713289260864} 11/07/2021 12:13:30 - INFO - __main__ - Step 106438: {'lr': 9.948362111026002e-05, 'samples': 20436096, 'steps': 106437, 'loss/train': 1.1339439153671265} 11/07/2021 12:13:30 - INFO - __main__ - Step 106439: {'lr': 9.947938399171783e-05, 'samples': 20436288, 'steps': 106438, 'loss/train': 2.090672016143799} 11/07/2021 12:13:30 - INFO - __main__ - Step 106440: {'lr': 9.947514694099777e-05, 'samples': 20436480, 'steps': 106439, 'loss/train': 1.2425881624221802} 11/07/2021 12:13:31 - INFO - __main__ - Step 106441: {'lr': 9.947090995810193e-05, 'samples': 20436672, 'steps': 106440, 'loss/train': 1.0640398263931274} 11/07/2021 12:13:32 - INFO - __main__ - Step 106442: {'lr': 9.946667304303214e-05, 'samples': 20436864, 'steps': 106441, 'loss/train': 1.2306190729141235} 11/07/2021 12:13:32 - INFO - __main__ - Step 106443: {'lr': 9.946243619579038e-05, 'samples': 20437056, 'steps': 106442, 'loss/train': 1.290689468383789} 11/07/2021 12:13:32 - INFO - __main__ - Step 106444: {'lr': 9.945819941637852e-05, 'samples': 20437248, 'steps': 106443, 'loss/train': 1.2943918704986572} 11/07/2021 12:13:33 - INFO - __main__ - Step 106445: {'lr': 9.945396270479845e-05, 'samples': 20437440, 'steps': 106444, 'loss/train': 1.171091914176941} 11/07/2021 12:13:33 - INFO - __main__ - Step 106446: {'lr': 9.94497260610521e-05, 'samples': 20437632, 'steps': 106445, 'loss/train': 1.5072581768035889} 11/07/2021 12:13:34 - INFO - __main__ - Step 106447: {'lr': 9.94454894851414e-05, 'samples': 20437824, 'steps': 106446, 'loss/train': 1.1541849374771118} 11/07/2021 12:13:34 - INFO - __main__ - Step 106448: {'lr': 9.944125297706822e-05, 'samples': 20438016, 'steps': 106447, 'loss/train': 0.968512773513794} 11/07/2021 12:13:35 - INFO - __main__ - Step 106449: {'lr': 9.943701653683449e-05, 'samples': 20438208, 'steps': 106448, 'loss/train': 1.1546909809112549} 11/07/2021 12:13:35 - INFO - __main__ - Step 106450: {'lr': 9.943278016444221e-05, 'samples': 20438400, 'steps': 106449, 'loss/train': 1.3083460330963135} 11/07/2021 12:13:36 - INFO - __main__ - Step 106451: {'lr': 9.942854385989311e-05, 'samples': 20438592, 'steps': 106450, 'loss/train': 0.8149770498275757} 11/07/2021 12:13:37 - INFO - __main__ - Step 106452: {'lr': 9.942430762318919e-05, 'samples': 20438784, 'steps': 106451, 'loss/train': 1.2285902500152588} 11/07/2021 12:13:37 - INFO - __main__ - Step 106453: {'lr': 9.942007145433235e-05, 'samples': 20438976, 'steps': 106452, 'loss/train': 1.4055700302124023} 11/07/2021 12:13:38 - INFO - __main__ - Step 106454: {'lr': 9.941583535332451e-05, 'samples': 20439168, 'steps': 106453, 'loss/train': 1.2014104127883911} 11/07/2021 12:13:38 - INFO - __main__ - Step 106455: {'lr': 9.941159932016755e-05, 'samples': 20439360, 'steps': 106454, 'loss/train': 1.559401035308838} 11/07/2021 12:13:38 - INFO - __main__ - Step 106456: {'lr': 9.940736335486341e-05, 'samples': 20439552, 'steps': 106455, 'loss/train': 1.1795976161956787} 11/07/2021 12:13:40 - INFO - __main__ - Step 106457: {'lr': 9.9403127457414e-05, 'samples': 20439744, 'steps': 106456, 'loss/train': 0.4014163911342621} 11/07/2021 12:13:40 - INFO - __main__ - Step 106458: {'lr': 9.93988916278212e-05, 'samples': 20439936, 'steps': 106457, 'loss/train': 1.2339179515838623} 11/07/2021 12:13:40 - INFO - __main__ - Step 106459: {'lr': 9.939465586608695e-05, 'samples': 20440128, 'steps': 106458, 'loss/train': 0.7126978635787964} 11/07/2021 12:13:41 - INFO - __main__ - Step 106460: {'lr': 9.939042017221314e-05, 'samples': 20440320, 'steps': 106459, 'loss/train': 1.145280361175537} 11/07/2021 12:13:41 - INFO - __main__ - Step 106461: {'lr': 9.938618454620177e-05, 'samples': 20440512, 'steps': 106460, 'loss/train': 1.0447863340377808} 11/07/2021 12:13:41 - INFO - __main__ - Step 106462: {'lr': 9.938194898805455e-05, 'samples': 20440704, 'steps': 106461, 'loss/train': 1.5336852073669434} 11/07/2021 12:13:42 - INFO - __main__ - Step 106463: {'lr': 9.937771349777353e-05, 'samples': 20440896, 'steps': 106462, 'loss/train': 1.2737689018249512} 11/07/2021 12:13:43 - INFO - __main__ - Step 106464: {'lr': 9.937347807536056e-05, 'samples': 20441088, 'steps': 106463, 'loss/train': 1.0670725107192993} 11/07/2021 12:13:43 - INFO - __main__ - Step 106465: {'lr': 9.936924272081762e-05, 'samples': 20441280, 'steps': 106464, 'loss/train': 1.925568699836731} 11/07/2021 12:13:43 - INFO - __main__ - Step 106466: {'lr': 9.936500743414653e-05, 'samples': 20441472, 'steps': 106465, 'loss/train': 1.1221617460250854} 11/07/2021 12:13:44 - INFO - __main__ - Step 106467: {'lr': 9.936077221534928e-05, 'samples': 20441664, 'steps': 106466, 'loss/train': 1.3899644613265991} 11/07/2021 12:13:45 - INFO - __main__ - Step 106468: {'lr': 9.935653706442771e-05, 'samples': 20441856, 'steps': 106467, 'loss/train': 1.3145173788070679} 11/07/2021 12:13:45 - INFO - __main__ - Step 106469: {'lr': 9.935230198138378e-05, 'samples': 20442048, 'steps': 106468, 'loss/train': 1.2035226821899414} 11/07/2021 12:13:45 - INFO - __main__ - Step 106470: {'lr': 9.934806696621937e-05, 'samples': 20442240, 'steps': 106469, 'loss/train': 1.2243810892105103} 11/07/2021 12:13:46 - INFO - __main__ - Step 106471: {'lr': 9.93438320189364e-05, 'samples': 20442432, 'steps': 106470, 'loss/train': 1.2172906398773193} 11/07/2021 12:13:46 - INFO - __main__ - Step 106472: {'lr': 9.933959713953677e-05, 'samples': 20442624, 'steps': 106471, 'loss/train': 1.4012150764465332} 11/07/2021 12:13:47 - INFO - __main__ - Step 106473: {'lr': 9.93353623280224e-05, 'samples': 20442816, 'steps': 106472, 'loss/train': 2.6289331912994385} 11/07/2021 12:13:48 - INFO - __main__ - Step 106474: {'lr': 9.933112758439528e-05, 'samples': 20443008, 'steps': 106473, 'loss/train': 1.2571073770523071} 11/07/2021 12:13:48 - INFO - __main__ - Step 106475: {'lr': 9.932689290865712e-05, 'samples': 20443200, 'steps': 106474, 'loss/train': 1.5841946601867676} 11/07/2021 12:13:48 - INFO - __main__ - Step 106476: {'lr': 9.932265830080998e-05, 'samples': 20443392, 'steps': 106475, 'loss/train': 1.727122187614441} 11/07/2021 12:13:49 - INFO - __main__ - Step 106477: {'lr': 9.931842376085567e-05, 'samples': 20443584, 'steps': 106476, 'loss/train': 1.5977890491485596} 11/07/2021 12:13:49 - INFO - __main__ - Step 106478: {'lr': 9.931418928879618e-05, 'samples': 20443776, 'steps': 106477, 'loss/train': 1.547093152999878} 11/07/2021 12:13:50 - INFO - __main__ - Step 106479: {'lr': 9.930995488463341e-05, 'samples': 20443968, 'steps': 106478, 'loss/train': 1.5427567958831787} 11/07/2021 12:13:50 - INFO - __main__ - Step 106480: {'lr': 9.930572054836923e-05, 'samples': 20444160, 'steps': 106479, 'loss/train': 1.383000135421753} 11/07/2021 12:13:51 - INFO - __main__ - Step 106481: {'lr': 9.930148628000556e-05, 'samples': 20444352, 'steps': 106480, 'loss/train': 1.7151117324829102} 11/07/2021 12:13:51 - INFO - __main__ - Step 106482: {'lr': 9.929725207954433e-05, 'samples': 20444544, 'steps': 106481, 'loss/train': 0.897086501121521} 11/07/2021 12:13:52 - INFO - __main__ - Step 106483: {'lr': 9.929301794698742e-05, 'samples': 20444736, 'steps': 106482, 'loss/train': 1.1285324096679688} 11/07/2021 12:13:52 - INFO - __main__ - Step 106484: {'lr': 9.928878388233676e-05, 'samples': 20444928, 'steps': 106483, 'loss/train': 1.559794306755066} 11/07/2021 12:13:53 - INFO - __main__ - Step 106485: {'lr': 9.928454988559423e-05, 'samples': 20445120, 'steps': 106484, 'loss/train': 1.6323342323303223} 11/07/2021 12:13:53 - INFO - __main__ - Step 106486: {'lr': 9.928031595676177e-05, 'samples': 20445312, 'steps': 106485, 'loss/train': 1.5941283702850342} 11/07/2021 12:13:54 - INFO - __main__ - Step 106487: {'lr': 9.927608209584126e-05, 'samples': 20445504, 'steps': 106486, 'loss/train': 1.0588557720184326} 11/07/2021 12:13:54 - INFO - __main__ - Step 106488: {'lr': 9.927184830283476e-05, 'samples': 20445696, 'steps': 106487, 'loss/train': 1.5022122859954834} 11/07/2021 12:13:55 - INFO - __main__ - Step 106489: {'lr': 9.926761457774389e-05, 'samples': 20445888, 'steps': 106488, 'loss/train': 1.4870917797088623} 11/07/2021 12:13:55 - INFO - __main__ - Step 106490: {'lr': 9.926338092057075e-05, 'samples': 20446080, 'steps': 106489, 'loss/train': 1.4090031385421753} 11/07/2021 12:13:56 - INFO - __main__ - Step 106491: {'lr': 9.92591473313172e-05, 'samples': 20446272, 'steps': 106490, 'loss/train': 1.5319632291793823} 11/07/2021 12:13:56 - INFO - __main__ - Step 106492: {'lr': 9.925491380998511e-05, 'samples': 20446464, 'steps': 106491, 'loss/train': 1.5235764980316162} 11/07/2021 12:13:56 - INFO - __main__ - Step 106493: {'lr': 9.925068035657647e-05, 'samples': 20446656, 'steps': 106492, 'loss/train': 1.1593379974365234} 11/07/2021 12:13:58 - INFO - __main__ - Step 106494: {'lr': 9.924644697109314e-05, 'samples': 20446848, 'steps': 106493, 'loss/train': 1.426497220993042} 11/07/2021 12:13:58 - INFO - __main__ - Step 106495: {'lr': 9.924221365353702e-05, 'samples': 20447040, 'steps': 106494, 'loss/train': 1.360576868057251} 11/07/2021 12:13:58 - INFO - __main__ - Step 106496: {'lr': 9.923798040391005e-05, 'samples': 20447232, 'steps': 106495, 'loss/train': 1.3764740228652954} 11/07/2021 12:13:59 - INFO - __main__ - Step 106497: {'lr': 9.923374722221409e-05, 'samples': 20447424, 'steps': 106496, 'loss/train': 0.8292177319526672} 11/07/2021 12:13:59 - INFO - __main__ - Step 106498: {'lr': 9.92295141084511e-05, 'samples': 20447616, 'steps': 106497, 'loss/train': 1.206242561340332} 11/07/2021 12:13:59 - INFO - __main__ - Step 106499: {'lr': 9.922528106262296e-05, 'samples': 20447808, 'steps': 106498, 'loss/train': 0.8596234321594238} 11/07/2021 12:14:00 - INFO - __main__ - Step 106500: {'lr': 9.92210480847316e-05, 'samples': 20448000, 'steps': 106499, 'loss/train': 1.1286698579788208} 11/07/2021 12:14:01 - INFO - __main__ - Step 106501: {'lr': 9.921681517477899e-05, 'samples': 20448192, 'steps': 106500, 'loss/train': 1.5689997673034668} 11/07/2021 12:14:01 - INFO - __main__ - Step 106502: {'lr': 9.921258233276687e-05, 'samples': 20448384, 'steps': 106501, 'loss/train': 1.400510311126709} 11/07/2021 12:14:01 - INFO - __main__ - Step 106503: {'lr': 9.92083495586972e-05, 'samples': 20448576, 'steps': 106502, 'loss/train': 1.2231347560882568} 11/07/2021 12:14:02 - INFO - __main__ - Step 106504: {'lr': 9.920411685257194e-05, 'samples': 20448768, 'steps': 106503, 'loss/train': 1.3074358701705933} 11/07/2021 12:14:03 - INFO - __main__ - Step 106505: {'lr': 9.9199884214393e-05, 'samples': 20448960, 'steps': 106504, 'loss/train': 1.2554391622543335} 11/07/2021 12:14:03 - INFO - __main__ - Step 106506: {'lr': 9.919565164416224e-05, 'samples': 20449152, 'steps': 106505, 'loss/train': 1.2774736881256104} 11/07/2021 12:14:03 - INFO - __main__ - Step 106507: {'lr': 9.91914191418816e-05, 'samples': 20449344, 'steps': 106506, 'loss/train': 1.3729764223098755} 11/07/2021 12:14:04 - INFO - __main__ - Step 106508: {'lr': 9.918718670755297e-05, 'samples': 20449536, 'steps': 106507, 'loss/train': 1.403830885887146} 11/07/2021 12:14:04 - INFO - __main__ - Step 106509: {'lr': 9.91829543411783e-05, 'samples': 20449728, 'steps': 106508, 'loss/train': 1.489646077156067} 11/07/2021 12:14:05 - INFO - __main__ - Step 106510: {'lr': 9.917872204275944e-05, 'samples': 20449920, 'steps': 106509, 'loss/train': 0.8211663961410522} 11/07/2021 12:14:06 - INFO - __main__ - Step 106511: {'lr': 9.917448981229832e-05, 'samples': 20450112, 'steps': 106510, 'loss/train': 1.3580266237258911} 11/07/2021 12:14:06 - INFO - __main__ - Step 106512: {'lr': 9.917025764979684e-05, 'samples': 20450304, 'steps': 106511, 'loss/train': 0.856960654258728} 11/07/2021 12:14:06 - INFO - __main__ - Step 106513: {'lr': 9.916602555525692e-05, 'samples': 20450496, 'steps': 106512, 'loss/train': 1.4313750267028809} 11/07/2021 12:14:07 - INFO - __main__ - Step 106514: {'lr': 9.916179352868054e-05, 'samples': 20450688, 'steps': 106513, 'loss/train': 0.9347929954528809} 11/07/2021 12:14:08 - INFO - __main__ - Step 106515: {'lr': 9.915756157006947e-05, 'samples': 20450880, 'steps': 106514, 'loss/train': 1.0796517133712769} 11/07/2021 12:14:08 - INFO - __main__ - Step 106516: {'lr': 9.915332967942564e-05, 'samples': 20451072, 'steps': 106515, 'loss/train': 1.603251338005066} 11/07/2021 12:14:09 - INFO - __main__ - Step 106517: {'lr': 9.914909785675102e-05, 'samples': 20451264, 'steps': 106516, 'loss/train': 1.5108057260513306} 11/07/2021 12:14:09 - INFO - __main__ - Step 106518: {'lr': 9.914486610204749e-05, 'samples': 20451456, 'steps': 106517, 'loss/train': 1.4905656576156616} 11/07/2021 12:14:09 - INFO - __main__ - Step 106519: {'lr': 9.914063441531693e-05, 'samples': 20451648, 'steps': 106518, 'loss/train': 1.1953202486038208} 11/07/2021 12:14:10 - INFO - __main__ - Step 106520: {'lr': 9.913640279656128e-05, 'samples': 20451840, 'steps': 106519, 'loss/train': 0.6406259536743164} 11/07/2021 12:14:11 - INFO - __main__ - Step 106521: {'lr': 9.913217124578245e-05, 'samples': 20452032, 'steps': 106520, 'loss/train': 1.1510887145996094} 11/07/2021 12:14:11 - INFO - __main__ - Step 106522: {'lr': 9.912793976298235e-05, 'samples': 20452224, 'steps': 106521, 'loss/train': 1.479303002357483} 11/07/2021 12:14:11 - INFO - __main__ - Step 106523: {'lr': 9.912370834816283e-05, 'samples': 20452416, 'steps': 106522, 'loss/train': 1.501641869544983} 11/07/2021 12:14:12 - INFO - __main__ - Step 106524: {'lr': 9.911947700132587e-05, 'samples': 20452608, 'steps': 106523, 'loss/train': 1.1744109392166138} 11/07/2021 12:14:13 - INFO - __main__ - Step 106525: {'lr': 9.911524572247332e-05, 'samples': 20452800, 'steps': 106524, 'loss/train': 1.7909499406814575} 11/07/2021 12:14:13 - INFO - __main__ - Step 106526: {'lr': 9.911101451160715e-05, 'samples': 20452992, 'steps': 106525, 'loss/train': 1.7810138463974} 11/07/2021 12:14:13 - INFO - __main__ - Step 106527: {'lr': 9.910678336872919e-05, 'samples': 20453184, 'steps': 106526, 'loss/train': 1.6125972270965576} 11/07/2021 12:14:14 - INFO - __main__ - Step 106528: {'lr': 9.91025522938415e-05, 'samples': 20453376, 'steps': 106527, 'loss/train': 1.5949596166610718} 11/07/2021 12:14:14 - INFO - __main__ - Step 106529: {'lr': 9.909832128694577e-05, 'samples': 20453568, 'steps': 106528, 'loss/train': 1.2089784145355225} 11/07/2021 12:14:15 - INFO - __main__ - Step 106530: {'lr': 9.909409034804401e-05, 'samples': 20453760, 'steps': 106529, 'loss/train': 1.5122387409210205} 11/07/2021 12:14:16 - INFO - __main__ - Step 106531: {'lr': 9.908985947713814e-05, 'samples': 20453952, 'steps': 106530, 'loss/train': 1.5090852975845337} 11/07/2021 12:14:16 - INFO - __main__ - Step 106532: {'lr': 9.908562867423002e-05, 'samples': 20454144, 'steps': 106531, 'loss/train': 1.5094486474990845} 11/07/2021 12:14:16 - INFO - __main__ - Step 106533: {'lr': 9.908139793932161e-05, 'samples': 20454336, 'steps': 106532, 'loss/train': 1.1664060354232788} 11/07/2021 12:14:17 - INFO - __main__ - Step 106534: {'lr': 9.907716727241478e-05, 'samples': 20454528, 'steps': 106533, 'loss/train': 1.2241822481155396} 11/07/2021 12:14:18 - INFO - __main__ - Step 106535: {'lr': 9.907293667351148e-05, 'samples': 20454720, 'steps': 106534, 'loss/train': 1.1754454374313354} 11/07/2021 12:14:18 - INFO - __main__ - Step 106536: {'lr': 9.906870614261355e-05, 'samples': 20454912, 'steps': 106535, 'loss/train': 1.5143991708755493} 11/07/2021 12:14:18 - INFO - __main__ - Step 106537: {'lr': 9.906447567972293e-05, 'samples': 20455104, 'steps': 106536, 'loss/train': 1.2436896562576294} 11/07/2021 12:14:19 - INFO - __main__ - Step 106538: {'lr': 9.906024528484155e-05, 'samples': 20455296, 'steps': 106537, 'loss/train': 1.6273927688598633} 11/07/2021 12:14:19 - INFO - __main__ - Step 106539: {'lr': 9.905601495797128e-05, 'samples': 20455488, 'steps': 106538, 'loss/train': 1.1320502758026123} 11/07/2021 12:14:19 - INFO - __main__ - Step 106540: {'lr': 9.905178469911405e-05, 'samples': 20455680, 'steps': 106539, 'loss/train': 1.3389345407485962} 11/07/2021 12:14:20 - INFO - __main__ - Step 106541: {'lr': 9.904755450827185e-05, 'samples': 20455872, 'steps': 106540, 'loss/train': 1.0551937818527222} 11/07/2021 12:14:21 - INFO - __main__ - Step 106542: {'lr': 9.904332438544638e-05, 'samples': 20456064, 'steps': 106541, 'loss/train': 0.10035217553377151} 11/07/2021 12:14:21 - INFO - __main__ - Step 106543: {'lr': 9.903909433063968e-05, 'samples': 20456256, 'steps': 106542, 'loss/train': 1.2173659801483154} 11/07/2021 12:14:21 - INFO - __main__ - Step 106544: {'lr': 9.903486434385364e-05, 'samples': 20456448, 'steps': 106543, 'loss/train': 1.0192368030548096} 11/07/2021 12:14:22 - INFO - __main__ - Step 106545: {'lr': 9.903063442509013e-05, 'samples': 20456640, 'steps': 106544, 'loss/train': 1.4144068956375122} 11/07/2021 12:14:23 - INFO - __main__ - Step 106546: {'lr': 9.902640457435111e-05, 'samples': 20456832, 'steps': 106545, 'loss/train': 1.419664740562439} 11/07/2021 12:14:23 - INFO - __main__ - Step 106547: {'lr': 9.902217479163847e-05, 'samples': 20457024, 'steps': 106546, 'loss/train': 1.2897405624389648} 11/07/2021 12:14:24 - INFO - __main__ - Step 106548: {'lr': 9.90179450769541e-05, 'samples': 20457216, 'steps': 106547, 'loss/train': 1.3580940961837769} 11/07/2021 12:14:24 - INFO - __main__ - Step 106549: {'lr': 9.90137154302999e-05, 'samples': 20457408, 'steps': 106548, 'loss/train': 1.4785064458847046} 11/07/2021 12:14:24 - INFO - __main__ - Step 106550: {'lr': 9.900948585167782e-05, 'samples': 20457600, 'steps': 106549, 'loss/train': 1.6296594142913818} 11/07/2021 12:14:25 - INFO - __main__ - Step 106551: {'lr': 9.90052563410897e-05, 'samples': 20457792, 'steps': 106550, 'loss/train': 1.6220356225967407} 11/07/2021 12:14:26 - INFO - __main__ - Step 106552: {'lr': 9.900102689853751e-05, 'samples': 20457984, 'steps': 106551, 'loss/train': 1.1343470811843872} 11/07/2021 12:14:26 - INFO - __main__ - Step 106553: {'lr': 9.89967975240231e-05, 'samples': 20458176, 'steps': 106552, 'loss/train': 1.1221957206726074} 11/07/2021 12:14:26 - INFO - __main__ - Step 106554: {'lr': 9.899256821754843e-05, 'samples': 20458368, 'steps': 106553, 'loss/train': 0.5341590046882629} 11/07/2021 12:14:27 - INFO - __main__ - Step 106555: {'lr': 9.898833897911547e-05, 'samples': 20458560, 'steps': 106554, 'loss/train': 1.3027368783950806} 11/07/2021 12:14:28 - INFO - __main__ - Step 106556: {'lr': 9.898410980872591e-05, 'samples': 20458752, 'steps': 106555, 'loss/train': 0.9096299409866333} 11/07/2021 12:14:28 - INFO - __main__ - Step 106557: {'lr': 9.897988070638181e-05, 'samples': 20458944, 'steps': 106556, 'loss/train': 0.9684380292892456} 11/07/2021 12:14:29 - INFO - __main__ - Step 106558: {'lr': 9.897565167208502e-05, 'samples': 20459136, 'steps': 106557, 'loss/train': 0.8152120113372803} 11/07/2021 12:14:29 - INFO - __main__ - Step 106559: {'lr': 9.897142270583751e-05, 'samples': 20459328, 'steps': 106558, 'loss/train': 1.423860788345337} 11/07/2021 12:14:29 - INFO - __main__ - Step 106560: {'lr': 9.896719380764114e-05, 'samples': 20459520, 'steps': 106559, 'loss/train': 1.6708855628967285} 11/07/2021 12:14:31 - INFO - __main__ - Step 106561: {'lr': 9.896296497749779e-05, 'samples': 20459712, 'steps': 106560, 'loss/train': 1.4386003017425537} 11/07/2021 12:14:31 - INFO - __main__ - Step 106562: {'lr': 9.89587362154094e-05, 'samples': 20459904, 'steps': 106561, 'loss/train': 1.2772568464279175} 11/07/2021 12:14:31 - INFO - __main__ - Step 106563: {'lr': 9.895450752137788e-05, 'samples': 20460096, 'steps': 106562, 'loss/train': 1.5980315208435059} 11/07/2021 12:14:32 - INFO - __main__ - Step 106564: {'lr': 9.895027889540515e-05, 'samples': 20460288, 'steps': 106563, 'loss/train': 1.1004319190979004} 11/07/2021 12:14:32 - INFO - __main__ - Step 106565: {'lr': 9.894605033749307e-05, 'samples': 20460480, 'steps': 106564, 'loss/train': 1.506199836730957} 11/07/2021 12:14:32 - INFO - __main__ - Step 106566: {'lr': 9.894182184764358e-05, 'samples': 20460672, 'steps': 106565, 'loss/train': 1.3747105598449707} 11/07/2021 12:14:33 - INFO - __main__ - Step 106567: {'lr': 9.893759342585856e-05, 'samples': 20460864, 'steps': 106566, 'loss/train': 1.0701323747634888} 11/07/2021 12:14:34 - INFO - __main__ - Step 106568: {'lr': 9.893336507214004e-05, 'samples': 20461056, 'steps': 106567, 'loss/train': 1.254708170890808} 11/07/2021 12:14:35 - INFO - __main__ - Step 106569: {'lr': 9.892913678648972e-05, 'samples': 20461248, 'steps': 106568, 'loss/train': 1.4905035495758057} 11/07/2021 12:14:35 - INFO - __main__ - Step 106570: {'lr': 9.892490856890959e-05, 'samples': 20461440, 'steps': 106569, 'loss/train': 1.3412615060806274} 11/07/2021 12:14:35 - INFO - __main__ - Step 106571: {'lr': 9.892068041940155e-05, 'samples': 20461632, 'steps': 106570, 'loss/train': 0.07734131813049316} 11/07/2021 12:14:36 - INFO - __main__ - Step 106572: {'lr': 9.891645233796756e-05, 'samples': 20461824, 'steps': 106571, 'loss/train': 1.5028550624847412} 11/07/2021 12:14:37 - INFO - __main__ - Step 106573: {'lr': 9.891222432460947e-05, 'samples': 20462016, 'steps': 106572, 'loss/train': 0.944835364818573} 11/07/2021 12:14:37 - INFO - __main__ - Step 106574: {'lr': 9.890799637932918e-05, 'samples': 20462208, 'steps': 106573, 'loss/train': 1.4534173011779785} 11/07/2021 12:14:37 - INFO - __main__ - Step 106575: {'lr': 9.890376850212865e-05, 'samples': 20462400, 'steps': 106574, 'loss/train': 1.6150357723236084} 11/07/2021 12:14:38 - INFO - __main__ - Step 106576: {'lr': 9.889954069300971e-05, 'samples': 20462592, 'steps': 106575, 'loss/train': 1.2419025897979736} 11/07/2021 12:14:38 - INFO - __main__ - Step 106577: {'lr': 9.889531295197432e-05, 'samples': 20462784, 'steps': 106576, 'loss/train': 1.8837790489196777} 11/07/2021 12:14:39 - INFO - __main__ - Step 106578: {'lr': 9.889108527902437e-05, 'samples': 20462976, 'steps': 106577, 'loss/train': 1.3614081144332886} 11/07/2021 12:14:39 - INFO - __main__ - Step 106579: {'lr': 9.888685767416178e-05, 'samples': 20463168, 'steps': 106578, 'loss/train': 1.3929082155227661} 11/07/2021 12:14:40 - INFO - __main__ - Step 106580: {'lr': 9.888263013738843e-05, 'samples': 20463360, 'steps': 106579, 'loss/train': 1.4609944820404053} 11/07/2021 12:14:40 - INFO - __main__ - Step 106581: {'lr': 9.887840266870624e-05, 'samples': 20463552, 'steps': 106580, 'loss/train': 1.2907179594039917} 11/07/2021 12:14:40 - INFO - __main__ - Step 106582: {'lr': 9.88741752681172e-05, 'samples': 20463744, 'steps': 106581, 'loss/train': 0.8153521418571472} 11/07/2021 12:14:42 - INFO - __main__ - Step 106583: {'lr': 9.886994793562304e-05, 'samples': 20463936, 'steps': 106582, 'loss/train': 1.149641513824463} 11/07/2021 12:14:42 - INFO - __main__ - Step 106584: {'lr': 9.886572067122574e-05, 'samples': 20464128, 'steps': 106583, 'loss/train': 1.4065338373184204} 11/07/2021 12:14:42 - INFO - __main__ - Step 106585: {'lr': 9.886149347492721e-05, 'samples': 20464320, 'steps': 106584, 'loss/train': 1.3719035387039185} 11/07/2021 12:14:43 - INFO - __main__ - Step 106586: {'lr': 9.885726634672937e-05, 'samples': 20464512, 'steps': 106585, 'loss/train': 0.8750010132789612} 11/07/2021 12:14:43 - INFO - __main__ - Step 106587: {'lr': 9.885303928663411e-05, 'samples': 20464704, 'steps': 106586, 'loss/train': 1.2261841297149658} 11/07/2021 12:14:44 - INFO - __main__ - Step 106588: {'lr': 9.884881229464332e-05, 'samples': 20464896, 'steps': 106587, 'loss/train': 1.7015161514282227} 11/07/2021 12:14:44 - INFO - __main__ - Step 106589: {'lr': 9.884458537075894e-05, 'samples': 20465088, 'steps': 106588, 'loss/train': 1.5294957160949707} 11/07/2021 12:14:45 - INFO - __main__ - Step 106590: {'lr': 9.884035851498285e-05, 'samples': 20465280, 'steps': 106589, 'loss/train': 1.2235382795333862} 11/07/2021 12:14:45 - INFO - __main__ - Step 106591: {'lr': 9.883613172731698e-05, 'samples': 20465472, 'steps': 106590, 'loss/train': 1.2582917213439941} 11/07/2021 12:14:45 - INFO - __main__ - Step 106592: {'lr': 9.88319050077632e-05, 'samples': 20465664, 'steps': 106591, 'loss/train': 1.4230834245681763} 11/07/2021 12:14:46 - INFO - __main__ - Step 106593: {'lr': 9.882767835632342e-05, 'samples': 20465856, 'steps': 106592, 'loss/train': 1.5913738012313843} 11/07/2021 12:14:47 - INFO - __main__ - Step 106594: {'lr': 9.882345177299958e-05, 'samples': 20466048, 'steps': 106593, 'loss/train': 1.018288493156433} 11/07/2021 12:14:47 - INFO - __main__ - Step 106595: {'lr': 9.881922525779364e-05, 'samples': 20466240, 'steps': 106594, 'loss/train': 1.353868842124939} 11/07/2021 12:14:47 - INFO - __main__ - Step 106596: {'lr': 9.881499881070733e-05, 'samples': 20466432, 'steps': 106595, 'loss/train': 1.3440886735916138} 11/07/2021 12:14:48 - INFO - __main__ - Step 106597: {'lr': 9.881077243174266e-05, 'samples': 20466624, 'steps': 106596, 'loss/train': 0.7776225805282593} 11/07/2021 12:14:49 - INFO - __main__ - Step 106598: {'lr': 9.880654612090151e-05, 'samples': 20466816, 'steps': 106597, 'loss/train': 1.6494170427322388} 11/07/2021 12:14:49 - INFO - __main__ - Step 106599: {'lr': 9.880231987818581e-05, 'samples': 20467008, 'steps': 106598, 'loss/train': 0.9581094980239868} 11/07/2021 12:14:50 - INFO - __main__ - Step 106600: {'lr': 9.879809370359744e-05, 'samples': 20467200, 'steps': 106599, 'loss/train': 1.4733513593673706} 11/07/2021 12:14:50 - INFO - __main__ - Step 106601: {'lr': 9.879386759713833e-05, 'samples': 20467392, 'steps': 106600, 'loss/train': 1.3070825338363647} 11/07/2021 12:14:50 - INFO - __main__ - Step 106602: {'lr': 9.878964155881034e-05, 'samples': 20467584, 'steps': 106601, 'loss/train': 0.1171899139881134} 11/07/2021 12:14:51 - INFO - __main__ - Step 106603: {'lr': 9.878541558861543e-05, 'samples': 20467776, 'steps': 106602, 'loss/train': 1.4327950477600098} 11/07/2021 12:14:52 - INFO - __main__ - Step 106604: {'lr': 9.878118968655547e-05, 'samples': 20467968, 'steps': 106603, 'loss/train': 1.2387579679489136} 11/07/2021 12:14:52 - INFO - __main__ - Step 106605: {'lr': 9.877696385263238e-05, 'samples': 20468160, 'steps': 106604, 'loss/train': 0.5897341966629028} 11/07/2021 12:14:52 - INFO - __main__ - Step 106606: {'lr': 9.877273808684805e-05, 'samples': 20468352, 'steps': 106605, 'loss/train': 1.221697211265564} 11/07/2021 12:14:53 - INFO - __main__ - Step 106607: {'lr': 9.876851238920439e-05, 'samples': 20468544, 'steps': 106606, 'loss/train': 1.1679624319076538} 11/07/2021 12:14:53 - INFO - __main__ - Step 106608: {'lr': 9.876428675970331e-05, 'samples': 20468736, 'steps': 106607, 'loss/train': 0.3744039237499237} 11/07/2021 12:14:54 - INFO - __main__ - Step 106609: {'lr': 9.87600611983468e-05, 'samples': 20468928, 'steps': 106608, 'loss/train': 1.4966646432876587} 11/07/2021 12:14:54 - INFO - __main__ - Step 106610: {'lr': 9.875583570513657e-05, 'samples': 20469120, 'steps': 106609, 'loss/train': 1.3563268184661865} 11/07/2021 12:14:55 - INFO - __main__ - Step 106611: {'lr': 9.875161028007465e-05, 'samples': 20469312, 'steps': 106610, 'loss/train': 1.2957432270050049} 11/07/2021 12:14:55 - INFO - __main__ - Step 106612: {'lr': 9.87473849231629e-05, 'samples': 20469504, 'steps': 106611, 'loss/train': 0.8913089632987976} 11/07/2021 12:14:56 - INFO - __main__ - Step 106613: {'lr': 9.874315963440326e-05, 'samples': 20469696, 'steps': 106612, 'loss/train': 1.4075102806091309} 11/07/2021 12:14:56 - INFO - __main__ - Step 106614: {'lr': 9.873893441379759e-05, 'samples': 20469888, 'steps': 106613, 'loss/train': 0.994307816028595} 11/07/2021 12:14:57 - INFO - __main__ - Step 106615: {'lr': 9.873470926134783e-05, 'samples': 20470080, 'steps': 106614, 'loss/train': 1.3122174739837646} 11/07/2021 12:14:57 - INFO - __main__ - Step 106616: {'lr': 9.873048417705591e-05, 'samples': 20470272, 'steps': 106615, 'loss/train': 1.331268072128296} 11/07/2021 12:14:58 - INFO - __main__ - Step 106617: {'lr': 9.872625916092365e-05, 'samples': 20470464, 'steps': 106616, 'loss/train': 1.6004356145858765} 11/07/2021 12:14:58 - INFO - __main__ - Step 106618: {'lr': 9.872203421295304e-05, 'samples': 20470656, 'steps': 106617, 'loss/train': 1.4408891201019287} 11/07/2021 12:14:59 - INFO - __main__ - Step 106619: {'lr': 9.871780933314595e-05, 'samples': 20470848, 'steps': 106618, 'loss/train': 0.5594514608383179} 11/07/2021 12:14:59 - INFO - __main__ - Step 106620: {'lr': 9.871358452150425e-05, 'samples': 20471040, 'steps': 106619, 'loss/train': 1.3368128538131714} 11/07/2021 12:15:00 - INFO - __main__ - Step 106621: {'lr': 9.870935977802988e-05, 'samples': 20471232, 'steps': 106620, 'loss/train': 1.221927285194397} 11/07/2021 12:15:00 - INFO - __main__ - Step 106622: {'lr': 9.870513510272486e-05, 'samples': 20471424, 'steps': 106621, 'loss/train': 0.5547413229942322} 11/07/2021 12:15:00 - INFO - __main__ - Step 106623: {'lr': 9.870091049559086e-05, 'samples': 20471616, 'steps': 106622, 'loss/train': 1.1757442951202393} 11/07/2021 12:15:01 - INFO - __main__ - Step 106624: {'lr': 9.86966859566299e-05, 'samples': 20471808, 'steps': 106623, 'loss/train': 1.4771056175231934} 11/07/2021 12:15:02 - INFO - __main__ - Step 106625: {'lr': 9.869246148584385e-05, 'samples': 20472000, 'steps': 106624, 'loss/train': 1.3459234237670898} 11/07/2021 12:15:02 - INFO - __main__ - Step 106626: {'lr': 9.868823708323468e-05, 'samples': 20472192, 'steps': 106625, 'loss/train': 0.9859408140182495} 11/07/2021 12:15:02 - INFO - __main__ - Step 106627: {'lr': 9.868401274880423e-05, 'samples': 20472384, 'steps': 106626, 'loss/train': 1.2844023704528809} 11/07/2021 12:15:03 - INFO - __main__ - Step 106628: {'lr': 9.867978848255443e-05, 'samples': 20472576, 'steps': 106627, 'loss/train': 0.9855440855026245} 11/07/2021 12:15:04 - INFO - __main__ - Step 106629: {'lr': 9.86755642844872e-05, 'samples': 20472768, 'steps': 106628, 'loss/train': 0.9332770705223083} 11/07/2021 12:15:04 - INFO - __main__ - Step 106630: {'lr': 9.867134015460444e-05, 'samples': 20472960, 'steps': 106629, 'loss/train': 1.3388278484344482} 11/07/2021 12:15:04 - INFO - __main__ - Step 106631: {'lr': 9.866711609290801e-05, 'samples': 20473152, 'steps': 106630, 'loss/train': 1.314597487449646} 11/07/2021 12:15:05 - INFO - __main__ - Step 106632: {'lr': 9.866289209939986e-05, 'samples': 20473344, 'steps': 106631, 'loss/train': 0.41897666454315186} 11/07/2021 12:15:05 - INFO - __main__ - Step 106633: {'lr': 9.865866817408184e-05, 'samples': 20473536, 'steps': 106632, 'loss/train': 1.352107286453247} 11/07/2021 12:15:06 - INFO - __main__ - Step 106634: {'lr': 9.865444431695592e-05, 'samples': 20473728, 'steps': 106633, 'loss/train': 1.6553571224212646} 11/07/2021 12:15:07 - INFO - __main__ - Step 106635: {'lr': 9.865022052802405e-05, 'samples': 20473920, 'steps': 106634, 'loss/train': 0.7804974317550659} 11/07/2021 12:15:07 - INFO - __main__ - Step 106636: {'lr': 9.864599680728797e-05, 'samples': 20474112, 'steps': 106635, 'loss/train': 0.9619958400726318} 11/07/2021 12:15:07 - INFO - __main__ - Step 106637: {'lr': 9.864177315474967e-05, 'samples': 20474304, 'steps': 106636, 'loss/train': 1.4504376649856567} 11/07/2021 12:15:08 - INFO - __main__ - Step 106638: {'lr': 9.863754957041104e-05, 'samples': 20474496, 'steps': 106637, 'loss/train': 1.5570251941680908} 11/07/2021 12:15:09 - INFO - __main__ - Step 106639: {'lr': 9.863332605427402e-05, 'samples': 20474688, 'steps': 106638, 'loss/train': 2.0342302322387695} 11/07/2021 12:15:09 - INFO - __main__ - Step 106640: {'lr': 9.862910260634048e-05, 'samples': 20474880, 'steps': 106639, 'loss/train': 0.9917137622833252} 11/07/2021 12:15:10 - INFO - __main__ - Step 106641: {'lr': 9.862487922661232e-05, 'samples': 20475072, 'steps': 106640, 'loss/train': 0.889668881893158} 11/07/2021 12:15:10 - INFO - __main__ - Step 106642: {'lr': 9.862065591509145e-05, 'samples': 20475264, 'steps': 106641, 'loss/train': 1.5730468034744263} 11/07/2021 12:15:10 - INFO - __main__ - Step 106643: {'lr': 9.861643267177977e-05, 'samples': 20475456, 'steps': 106642, 'loss/train': 1.2805827856063843} 11/07/2021 12:15:11 - INFO - __main__ - Step 106644: {'lr': 9.861220949667924e-05, 'samples': 20475648, 'steps': 106643, 'loss/train': 0.6663666367530823} 11/07/2021 12:15:12 - INFO - __main__ - Step 106645: {'lr': 9.860798638979165e-05, 'samples': 20475840, 'steps': 106644, 'loss/train': 1.1124236583709717} 11/07/2021 12:15:12 - INFO - __main__ - Step 106646: {'lr': 9.860376335111901e-05, 'samples': 20476032, 'steps': 106645, 'loss/train': 1.3484867811203003} 11/07/2021 12:15:12 - INFO - __main__ - Step 106647: {'lr': 9.859954038066316e-05, 'samples': 20476224, 'steps': 106646, 'loss/train': 1.7473291158676147} 11/07/2021 12:15:13 - INFO - __main__ - Step 106648: {'lr': 9.859531747842601e-05, 'samples': 20476416, 'steps': 106647, 'loss/train': 1.392471194267273} 11/07/2021 12:15:14 - INFO - __main__ - Step 106649: {'lr': 9.859109464440957e-05, 'samples': 20476608, 'steps': 106648, 'loss/train': 1.4989867210388184} 11/07/2021 12:15:14 - INFO - __main__ - Step 106650: {'lr': 9.858687187861556e-05, 'samples': 20476800, 'steps': 106649, 'loss/train': 1.4315993785858154} 11/07/2021 12:15:15 - INFO - __main__ - Step 106651: {'lr': 9.858264918104595e-05, 'samples': 20476992, 'steps': 106650, 'loss/train': 1.2813842296600342} 11/07/2021 12:15:15 - INFO - __main__ - Step 106652: {'lr': 9.857842655170268e-05, 'samples': 20477184, 'steps': 106651, 'loss/train': 1.2125511169433594} 11/07/2021 12:15:15 - INFO - __main__ - Step 106653: {'lr': 9.857420399058764e-05, 'samples': 20477376, 'steps': 106652, 'loss/train': 1.5108530521392822} 11/07/2021 12:15:16 - INFO - __main__ - Step 106654: {'lr': 9.856998149770274e-05, 'samples': 20477568, 'steps': 106653, 'loss/train': 1.5902149677276611} 11/07/2021 12:15:17 - INFO - __main__ - Step 106655: {'lr': 9.856575907304985e-05, 'samples': 20477760, 'steps': 106654, 'loss/train': 1.5880554914474487} 11/07/2021 12:15:17 - INFO - __main__ - Step 106656: {'lr': 9.856153671663088e-05, 'samples': 20477952, 'steps': 106655, 'loss/train': 1.5357658863067627} 11/07/2021 12:15:17 - INFO - __main__ - Step 106657: {'lr': 9.855731442844775e-05, 'samples': 20478144, 'steps': 106656, 'loss/train': 1.208988070487976} 11/07/2021 12:15:18 - INFO - __main__ - Step 106658: {'lr': 9.855309220850237e-05, 'samples': 20478336, 'steps': 106657, 'loss/train': 1.6197686195373535} 11/07/2021 12:15:18 - INFO - __main__ - Step 106659: {'lr': 9.854887005679663e-05, 'samples': 20478528, 'steps': 106658, 'loss/train': 1.084691047668457} 11/07/2021 12:15:19 - INFO - __main__ - Step 106660: {'lr': 9.854464797333243e-05, 'samples': 20478720, 'steps': 106659, 'loss/train': 1.3878037929534912} 11/07/2021 12:15:19 - INFO - __main__ - Step 106661: {'lr': 9.854042595811167e-05, 'samples': 20478912, 'steps': 106660, 'loss/train': 1.2043637037277222} 11/07/2021 12:15:20 - INFO - __main__ - Step 106662: {'lr': 9.853620401113636e-05, 'samples': 20479104, 'steps': 106661, 'loss/train': 1.1817052364349365} 11/07/2021 12:15:20 - INFO - __main__ - Step 106663: {'lr': 9.853198213240819e-05, 'samples': 20479296, 'steps': 106662, 'loss/train': 1.1516095399856567} 11/07/2021 12:15:21 - INFO - __main__ - Step 106664: {'lr': 9.852776032192917e-05, 'samples': 20479488, 'steps': 106663, 'loss/train': 1.3972612619400024} 11/07/2021 12:15:21 - INFO - __main__ - Step 106665: {'lr': 9.852353857970123e-05, 'samples': 20479680, 'steps': 106664, 'loss/train': 1.2213020324707031} 11/07/2021 12:15:22 - INFO - __main__ - Step 106666: {'lr': 9.851931690572621e-05, 'samples': 20479872, 'steps': 106665, 'loss/train': 1.5424211025238037} 11/07/2021 12:15:22 - INFO - __main__ - Step 106667: {'lr': 9.851509530000607e-05, 'samples': 20480064, 'steps': 106666, 'loss/train': 1.4968587160110474} 11/07/2021 12:15:23 - INFO - __main__ - Step 106668: {'lr': 9.85108737625427e-05, 'samples': 20480256, 'steps': 106667, 'loss/train': 1.5342116355895996} 11/07/2021 12:15:23 - INFO - __main__ - Step 106669: {'lr': 9.850665229333796e-05, 'samples': 20480448, 'steps': 106668, 'loss/train': 1.1542397737503052} 11/07/2021 12:15:24 - INFO - __main__ - Step 106670: {'lr': 9.850243089239383e-05, 'samples': 20480640, 'steps': 106669, 'loss/train': 1.2562053203582764} 11/07/2021 12:15:24 - INFO - __main__ - Step 106671: {'lr': 9.849820955971214e-05, 'samples': 20480832, 'steps': 106670, 'loss/train': 1.6135478019714355} 11/07/2021 12:15:25 - INFO - __main__ - Step 106672: {'lr': 9.849398829529482e-05, 'samples': 20481024, 'steps': 106671, 'loss/train': 1.130021095275879} 11/07/2021 12:15:25 - INFO - __main__ - Step 106673: {'lr': 9.848976709914375e-05, 'samples': 20481216, 'steps': 106672, 'loss/train': 1.2999248504638672} 11/07/2021 12:15:25 - INFO - __main__ - Step 106674: {'lr': 9.848554597126088e-05, 'samples': 20481408, 'steps': 106673, 'loss/train': 1.2906755208969116} 11/07/2021 12:15:26 - INFO - __main__ - Step 106675: {'lr': 9.848132491164819e-05, 'samples': 20481600, 'steps': 106674, 'loss/train': 1.4823576211929321} 11/07/2021 12:15:27 - INFO - __main__ - Step 106676: {'lr': 9.847710392030738e-05, 'samples': 20481792, 'steps': 106675, 'loss/train': 1.6728087663650513} 11/07/2021 12:15:27 - INFO - __main__ - Step 106677: {'lr': 9.847288299724042e-05, 'samples': 20481984, 'steps': 106676, 'loss/train': 1.7930147647857666} 11/07/2021 12:15:27 - INFO - __main__ - Step 106678: {'lr': 9.846866214244926e-05, 'samples': 20482176, 'steps': 106677, 'loss/train': 1.1882784366607666} 11/07/2021 12:15:28 - INFO - __main__ - Step 106679: {'lr': 9.846444135593576e-05, 'samples': 20482368, 'steps': 106678, 'loss/train': 1.1966639757156372} 11/07/2021 12:15:29 - INFO - __main__ - Step 106680: {'lr': 9.846022063770188e-05, 'samples': 20482560, 'steps': 106679, 'loss/train': 1.1287271976470947} 11/07/2021 12:15:29 - INFO - __main__ - Step 106681: {'lr': 9.845599998774946e-05, 'samples': 20482752, 'steps': 106680, 'loss/train': 1.0973814725875854} 11/07/2021 12:15:30 - INFO - __main__ - Step 106682: {'lr': 9.845177940608044e-05, 'samples': 20482944, 'steps': 106681, 'loss/train': 1.3974648714065552} 11/07/2021 12:15:30 - INFO - __main__ - Step 106683: {'lr': 9.844755889269672e-05, 'samples': 20483136, 'steps': 106682, 'loss/train': 1.6639783382415771} 11/07/2021 12:15:30 - INFO - __main__ - Step 106684: {'lr': 9.844333844760018e-05, 'samples': 20483328, 'steps': 106683, 'loss/train': 1.5822867155075073} 11/07/2021 12:15:31 - INFO - __main__ - Step 106685: {'lr': 9.843911807079272e-05, 'samples': 20483520, 'steps': 106684, 'loss/train': 1.376253366470337} 11/07/2021 12:15:32 - INFO - __main__ - Step 106686: {'lr': 9.843489776227634e-05, 'samples': 20483712, 'steps': 106685, 'loss/train': 1.1172969341278076} 11/07/2021 12:15:33 - INFO - __main__ - Step 106687: {'lr': 9.843067752205279e-05, 'samples': 20483904, 'steps': 106686, 'loss/train': 4.491657733917236} 11/07/2021 12:15:33 - INFO - __main__ - Step 106688: {'lr': 9.842645735012404e-05, 'samples': 20484096, 'steps': 106687, 'loss/train': 4.3788323402404785} 11/07/2021 12:15:33 - INFO - __main__ - Step 106689: {'lr': 9.8422237246492e-05, 'samples': 20484288, 'steps': 106688, 'loss/train': 4.157343864440918} 11/07/2021 12:15:34 - INFO - __main__ - Step 106690: {'lr': 9.841801721115853e-05, 'samples': 20484480, 'steps': 106689, 'loss/train': 2.4821367263793945} 11/07/2021 12:15:34 - INFO - __main__ - Step 106691: {'lr': 9.841379724412556e-05, 'samples': 20484672, 'steps': 106690, 'loss/train': 1.1633317470550537} 11/07/2021 12:15:34 - INFO - __main__ - Step 106692: {'lr': 9.840957734539502e-05, 'samples': 20484864, 'steps': 106691, 'loss/train': 1.2615035772323608} 11/07/2021 12:15:36 - INFO - __main__ - Step 106693: {'lr': 9.840535751496876e-05, 'samples': 20485056, 'steps': 106692, 'loss/train': 1.0257023572921753} 11/07/2021 12:15:36 - INFO - __main__ - Step 106694: {'lr': 9.840113775284873e-05, 'samples': 20485248, 'steps': 106693, 'loss/train': 1.5009307861328125} 11/07/2021 12:15:36 - INFO - __main__ - Step 106695: {'lr': 9.839691805903678e-05, 'samples': 20485440, 'steps': 106694, 'loss/train': 1.6782392263412476} 11/07/2021 12:15:37 - INFO - __main__ - Step 106696: {'lr': 9.839269843353487e-05, 'samples': 20485632, 'steps': 106695, 'loss/train': 1.513706088066101} 11/07/2021 12:15:37 - INFO - __main__ - Step 106697: {'lr': 9.838847887634494e-05, 'samples': 20485824, 'steps': 106696, 'loss/train': 1.5356714725494385} 11/07/2021 12:15:38 - INFO - __main__ - Step 106698: {'lr': 9.838425938746875e-05, 'samples': 20486016, 'steps': 106697, 'loss/train': 5.577164649963379} 11/07/2021 12:15:38 - INFO - __main__ - Step 106699: {'lr': 9.838003996690826e-05, 'samples': 20486208, 'steps': 106698, 'loss/train': 1.4006801843643188} 11/07/2021 12:15:39 - INFO - __main__ - Step 106700: {'lr': 9.837582061466538e-05, 'samples': 20486400, 'steps': 106699, 'loss/train': 1.8468527793884277} 11/07/2021 12:15:39 - INFO - __main__ - Step 106701: {'lr': 9.837160133074202e-05, 'samples': 20486592, 'steps': 106700, 'loss/train': 1.219414472579956} 11/07/2021 12:15:39 - INFO - __main__ - Step 106702: {'lr': 9.836738211514007e-05, 'samples': 20486784, 'steps': 106701, 'loss/train': 1.4161202907562256} 11/07/2021 12:15:41 - INFO - __main__ - Step 106703: {'lr': 9.836316296786146e-05, 'samples': 20486976, 'steps': 106702, 'loss/train': 1.3526257276535034} 11/07/2021 12:15:41 - INFO - __main__ - Step 106704: {'lr': 9.835894388890807e-05, 'samples': 20487168, 'steps': 106703, 'loss/train': 1.2549223899841309} 11/07/2021 12:15:41 - INFO - __main__ - Step 106705: {'lr': 9.835472487828176e-05, 'samples': 20487360, 'steps': 106704, 'loss/train': 1.9749412536621094} 11/07/2021 12:15:42 - INFO - __main__ - Step 106706: {'lr': 9.835050593598452e-05, 'samples': 20487552, 'steps': 106705, 'loss/train': 1.174180269241333} 11/07/2021 12:15:42 - INFO - __main__ - Step 106707: {'lr': 9.834628706201817e-05, 'samples': 20487744, 'steps': 106706, 'loss/train': 1.2438604831695557} 11/07/2021 12:15:43 - INFO - __main__ - Step 106708: {'lr': 9.834206825638473e-05, 'samples': 20487936, 'steps': 106707, 'loss/train': 1.3125897645950317} 11/07/2021 12:15:44 - INFO - __main__ - Step 106709: {'lr': 9.833784951908595e-05, 'samples': 20488128, 'steps': 106708, 'loss/train': 1.464727759361267} 11/07/2021 12:15:44 - INFO - __main__ - Step 106710: {'lr': 9.833363085012376e-05, 'samples': 20488320, 'steps': 106709, 'loss/train': 1.4776155948638916} 11/07/2021 12:15:44 - INFO - __main__ - Step 106711: {'lr': 9.832941224950012e-05, 'samples': 20488512, 'steps': 106710, 'loss/train': 1.6694334745407104} 11/07/2021 12:15:45 - INFO - __main__ - Step 106712: {'lr': 9.83251937172169e-05, 'samples': 20488704, 'steps': 106711, 'loss/train': 1.3359276056289673} 11/07/2021 12:15:45 - INFO - __main__ - Step 106713: {'lr': 9.832097525327601e-05, 'samples': 20488896, 'steps': 106712, 'loss/train': 0.48581865429878235} 11/07/2021 12:15:46 - INFO - __main__ - Step 106714: {'lr': 9.831675685767935e-05, 'samples': 20489088, 'steps': 106713, 'loss/train': 0.3291153311729431} 11/07/2021 12:15:46 - INFO - __main__ - Step 106715: {'lr': 9.831253853042882e-05, 'samples': 20489280, 'steps': 106714, 'loss/train': 1.2078778743743896} 11/07/2021 12:15:47 - INFO - __main__ - Step 106716: {'lr': 9.830832027152631e-05, 'samples': 20489472, 'steps': 106715, 'loss/train': 1.5422579050064087} 11/07/2021 12:15:47 - INFO - __main__ - Step 106717: {'lr': 9.830410208097373e-05, 'samples': 20489664, 'steps': 106716, 'loss/train': 1.3047012090682983} 11/07/2021 12:15:47 - INFO - __main__ - Step 106718: {'lr': 9.829988395877299e-05, 'samples': 20489856, 'steps': 106717, 'loss/train': 1.406893253326416} 11/07/2021 12:15:49 - INFO - __main__ - Step 106719: {'lr': 9.829566590492606e-05, 'samples': 20490048, 'steps': 106718, 'loss/train': 1.1893991231918335} 11/07/2021 12:15:49 - INFO - __main__ - Step 106720: {'lr': 9.82914479194347e-05, 'samples': 20490240, 'steps': 106719, 'loss/train': 0.8458820581436157} 11/07/2021 12:15:49 - INFO - __main__ - Step 106721: {'lr': 9.828723000230083e-05, 'samples': 20490432, 'steps': 106720, 'loss/train': 1.4567793607711792} 11/07/2021 12:15:50 - INFO - __main__ - Step 106722: {'lr': 9.828301215352642e-05, 'samples': 20490624, 'steps': 106721, 'loss/train': 1.4382686614990234} 11/07/2021 12:15:50 - INFO - __main__ - Step 106723: {'lr': 9.827879437311335e-05, 'samples': 20490816, 'steps': 106722, 'loss/train': 1.3178085088729858} 11/07/2021 12:15:51 - INFO - __main__ - Step 106724: {'lr': 9.827457666106349e-05, 'samples': 20491008, 'steps': 106723, 'loss/train': 1.418015956878662} 11/07/2021 12:15:51 - INFO - __main__ - Step 106725: {'lr': 9.82703590173788e-05, 'samples': 20491200, 'steps': 106724, 'loss/train': 1.1897108554840088} 11/07/2021 12:15:52 - INFO - __main__ - Step 106726: {'lr': 9.826614144206112e-05, 'samples': 20491392, 'steps': 106725, 'loss/train': 1.6372708082199097} 11/07/2021 12:15:52 - INFO - __main__ - Step 106727: {'lr': 9.826192393511235e-05, 'samples': 20491584, 'steps': 106726, 'loss/train': 1.5238312482833862} 11/07/2021 12:15:52 - INFO - __main__ - Step 106728: {'lr': 9.825770649653446e-05, 'samples': 20491776, 'steps': 106727, 'loss/train': 1.5173434019088745} 11/07/2021 12:15:54 - INFO - __main__ - Step 106729: {'lr': 9.825348912632928e-05, 'samples': 20491968, 'steps': 106728, 'loss/train': 1.2779289484024048} 11/07/2021 12:15:54 - INFO - __main__ - Step 106730: {'lr': 9.824927182449884e-05, 'samples': 20492160, 'steps': 106729, 'loss/train': 1.3809583187103271} 11/07/2021 12:15:54 - INFO - __main__ - Step 106731: {'lr': 9.824505459104485e-05, 'samples': 20492352, 'steps': 106730, 'loss/train': 1.1698280572891235} 11/07/2021 12:15:55 - INFO - __main__ - Step 106732: {'lr': 9.824083742596929e-05, 'samples': 20492544, 'steps': 106731, 'loss/train': 1.8710256814956665} 11/07/2021 12:15:55 - INFO - __main__ - Step 106733: {'lr': 9.823662032927404e-05, 'samples': 20492736, 'steps': 106732, 'loss/train': 1.2747578620910645} 11/07/2021 12:15:56 - INFO - __main__ - Step 106734: {'lr': 9.823240330096106e-05, 'samples': 20492928, 'steps': 106733, 'loss/train': 1.430285930633545} 11/07/2021 12:15:56 - INFO - __main__ - Step 106735: {'lr': 9.822818634103223e-05, 'samples': 20493120, 'steps': 106734, 'loss/train': 1.3271198272705078} 11/07/2021 12:15:57 - INFO - __main__ - Step 106736: {'lr': 9.822396944948942e-05, 'samples': 20493312, 'steps': 106735, 'loss/train': 1.6103737354278564} 11/07/2021 12:15:57 - INFO - __main__ - Step 106737: {'lr': 9.821975262633453e-05, 'samples': 20493504, 'steps': 106736, 'loss/train': 1.4209154844284058} 11/07/2021 12:15:57 - INFO - __main__ - Step 106738: {'lr': 9.821553587156948e-05, 'samples': 20493696, 'steps': 106737, 'loss/train': 1.508849024772644} 11/07/2021 12:15:58 - INFO - __main__ - Step 106739: {'lr': 9.821131918519619e-05, 'samples': 20493888, 'steps': 106738, 'loss/train': 1.1179157495498657} 11/07/2021 12:15:59 - INFO - __main__ - Step 106740: {'lr': 9.820710256721651e-05, 'samples': 20494080, 'steps': 106739, 'loss/train': 1.1243817806243896} 11/07/2021 12:15:59 - INFO - __main__ - Step 106741: {'lr': 9.820288601763238e-05, 'samples': 20494272, 'steps': 106740, 'loss/train': 1.2982304096221924} 11/07/2021 12:15:59 - INFO - __main__ - Step 106742: {'lr': 9.81986695364457e-05, 'samples': 20494464, 'steps': 106741, 'loss/train': 0.8849895596504211} 11/07/2021 12:16:00 - INFO - __main__ - Step 106743: {'lr': 9.819445312365843e-05, 'samples': 20494656, 'steps': 106742, 'loss/train': 1.151196837425232} 11/07/2021 12:16:00 - INFO - __main__ - Step 106744: {'lr': 9.819023677927233e-05, 'samples': 20494848, 'steps': 106743, 'loss/train': 1.2739753723144531} 11/07/2021 12:16:01 - INFO - __main__ - Step 106745: {'lr': 9.818602050328934e-05, 'samples': 20495040, 'steps': 106744, 'loss/train': 1.7594584226608276} 11/07/2021 12:16:02 - INFO - __main__ - Step 106746: {'lr': 9.818180429571141e-05, 'samples': 20495232, 'steps': 106745, 'loss/train': 1.6547560691833496} 11/07/2021 12:16:02 - INFO - __main__ - Step 106747: {'lr': 9.81775881565404e-05, 'samples': 20495424, 'steps': 106746, 'loss/train': 1.636313796043396} 11/07/2021 12:16:02 - INFO - __main__ - Step 106748: {'lr': 9.817337208577823e-05, 'samples': 20495616, 'steps': 106747, 'loss/train': 1.2975438833236694} 11/07/2021 12:16:03 - INFO - __main__ - Step 106749: {'lr': 9.81691560834268e-05, 'samples': 20495808, 'steps': 106748, 'loss/train': 1.5288276672363281} 11/07/2021 12:16:04 - INFO - __main__ - Step 106750: {'lr': 9.816494014948798e-05, 'samples': 20496000, 'steps': 106749, 'loss/train': 1.274942398071289} 11/07/2021 12:16:04 - INFO - __main__ - Step 106751: {'lr': 9.816072428396375e-05, 'samples': 20496192, 'steps': 106750, 'loss/train': 1.1929404735565186} 11/07/2021 12:16:04 - INFO - __main__ - Step 106752: {'lr': 9.815650848685589e-05, 'samples': 20496384, 'steps': 106751, 'loss/train': 1.5385971069335938} 11/07/2021 12:16:05 - INFO - __main__ - Step 106753: {'lr': 9.815229275816643e-05, 'samples': 20496576, 'steps': 106752, 'loss/train': 1.0312212705612183} 11/07/2021 12:16:05 - INFO - __main__ - Step 106754: {'lr': 9.814807709789714e-05, 'samples': 20496768, 'steps': 106753, 'loss/train': 1.5489921569824219} 11/07/2021 12:16:06 - INFO - __main__ - Step 106755: {'lr': 9.814386150605001e-05, 'samples': 20496960, 'steps': 106754, 'loss/train': 0.5329238176345825} 11/07/2021 12:16:06 - INFO - __main__ - Step 106756: {'lr': 9.813964598262701e-05, 'samples': 20497152, 'steps': 106755, 'loss/train': 0.7848790287971497} 11/07/2021 12:16:07 - INFO - __main__ - Step 106757: {'lr': 9.813543052762986e-05, 'samples': 20497344, 'steps': 106756, 'loss/train': 1.0321904420852661} 11/07/2021 12:16:07 - INFO - __main__ - Step 106758: {'lr': 9.813121514106052e-05, 'samples': 20497536, 'steps': 106757, 'loss/train': 1.6618925333023071} 11/07/2021 12:16:08 - INFO - __main__ - Step 106759: {'lr': 9.812699982292092e-05, 'samples': 20497728, 'steps': 106758, 'loss/train': 1.2350926399230957} 11/07/2021 12:16:09 - INFO - __main__ - Step 106760: {'lr': 9.812278457321294e-05, 'samples': 20497920, 'steps': 106759, 'loss/train': 1.4274678230285645} 11/07/2021 12:16:09 - INFO - __main__ - Step 106761: {'lr': 9.81185693919385e-05, 'samples': 20498112, 'steps': 106760, 'loss/train': 1.4772779941558838} 11/07/2021 12:16:09 - INFO - __main__ - Step 106762: {'lr': 9.811435427909951e-05, 'samples': 20498304, 'steps': 106761, 'loss/train': 1.3667991161346436} 11/07/2021 12:16:10 - INFO - __main__ - Step 106763: {'lr': 9.811013923469781e-05, 'samples': 20498496, 'steps': 106762, 'loss/train': 1.2696985006332397} 11/07/2021 12:16:10 - INFO - __main__ - Step 106764: {'lr': 9.810592425873535e-05, 'samples': 20498688, 'steps': 106763, 'loss/train': 1.4604235887527466} 11/07/2021 12:16:10 - INFO - __main__ - Step 106765: {'lr': 9.810170935121404e-05, 'samples': 20498880, 'steps': 106764, 'loss/train': 1.221975564956665} 11/07/2021 12:16:11 - INFO - __main__ - Step 106766: {'lr': 9.809749451213573e-05, 'samples': 20499072, 'steps': 106765, 'loss/train': 1.5322445631027222} 11/07/2021 12:16:12 - INFO - __main__ - Step 106767: {'lr': 9.809327974150234e-05, 'samples': 20499264, 'steps': 106766, 'loss/train': 1.2879159450531006} 11/07/2021 12:16:12 - INFO - __main__ - Step 106768: {'lr': 9.808906503931577e-05, 'samples': 20499456, 'steps': 106767, 'loss/train': 1.4359856843948364} 11/07/2021 12:16:12 - INFO - __main__ - Step 106769: {'lr': 9.808485040557797e-05, 'samples': 20499648, 'steps': 106768, 'loss/train': 1.4224284887313843} 11/07/2021 12:16:13 - INFO - __main__ - Step 106770: {'lr': 9.808063584029084e-05, 'samples': 20499840, 'steps': 106769, 'loss/train': 1.2833278179168701} 11/07/2021 12:16:14 - INFO - __main__ - Step 106771: {'lr': 9.807642134345615e-05, 'samples': 20500032, 'steps': 106770, 'loss/train': 1.6892458200454712} 11/07/2021 12:16:14 - INFO - __main__ - Step 106772: {'lr': 9.807220691507587e-05, 'samples': 20500224, 'steps': 106771, 'loss/train': 1.359013557434082} 11/07/2021 12:16:15 - INFO - __main__ - Step 106773: {'lr': 9.806799255515194e-05, 'samples': 20500416, 'steps': 106772, 'loss/train': 1.3059706687927246} 11/07/2021 12:16:15 - INFO - __main__ - Step 106774: {'lr': 9.80637782636862e-05, 'samples': 20500608, 'steps': 106773, 'loss/train': 1.227962851524353} 11/07/2021 12:16:15 - INFO - __main__ - Step 106775: {'lr': 9.80595640406806e-05, 'samples': 20500800, 'steps': 106774, 'loss/train': 1.3363006114959717} 11/07/2021 12:16:16 - INFO - __main__ - Step 106776: {'lr': 9.805534988613699e-05, 'samples': 20500992, 'steps': 106775, 'loss/train': 1.3897607326507568} 11/07/2021 12:16:17 - INFO - __main__ - Step 106777: {'lr': 9.805113580005732e-05, 'samples': 20501184, 'steps': 106776, 'loss/train': 0.8454599976539612} 11/07/2021 12:16:17 - INFO - __main__ - Step 106778: {'lr': 9.804692178244345e-05, 'samples': 20501376, 'steps': 106777, 'loss/train': 0.11990431696176529} 11/07/2021 12:16:17 - INFO - __main__ - Step 106779: {'lr': 9.804270783329727e-05, 'samples': 20501568, 'steps': 106778, 'loss/train': 1.018586277961731} 11/07/2021 12:16:18 - INFO - __main__ - Step 106780: {'lr': 9.803849395262073e-05, 'samples': 20501760, 'steps': 106779, 'loss/train': 1.1111159324645996} 11/07/2021 12:16:19 - INFO - __main__ - Step 106781: {'lr': 9.803428014041571e-05, 'samples': 20501952, 'steps': 106780, 'loss/train': 1.20730459690094} 11/07/2021 12:16:19 - INFO - __main__ - Step 106782: {'lr': 9.803006639668407e-05, 'samples': 20502144, 'steps': 106781, 'loss/train': 0.8993940353393555} 11/07/2021 12:16:20 - INFO - __main__ - Step 106783: {'lr': 9.802585272142784e-05, 'samples': 20502336, 'steps': 106782, 'loss/train': 0.6335535645484924} 11/07/2021 12:16:20 - INFO - __main__ - Step 106784: {'lr': 9.802163911464873e-05, 'samples': 20502528, 'steps': 106783, 'loss/train': 1.0665385723114014} 11/07/2021 12:16:20 - INFO - __main__ - Step 106785: {'lr': 9.801742557634872e-05, 'samples': 20502720, 'steps': 106784, 'loss/train': 1.203334927558899} 11/07/2021 12:16:22 - INFO - __main__ - Step 106786: {'lr': 9.801321210652973e-05, 'samples': 20502912, 'steps': 106785, 'loss/train': 0.1413404941558838} 11/07/2021 12:16:22 - INFO - __main__ - Step 106787: {'lr': 9.800899870519362e-05, 'samples': 20503104, 'steps': 106786, 'loss/train': 1.352601408958435} 11/07/2021 12:16:22 - INFO - __main__ - Step 106788: {'lr': 9.800478537234231e-05, 'samples': 20503296, 'steps': 106787, 'loss/train': 0.922402024269104} 11/07/2021 12:16:23 - INFO - __main__ - Step 106789: {'lr': 9.800057210797769e-05, 'samples': 20503488, 'steps': 106788, 'loss/train': 1.0772035121917725} 11/07/2021 12:16:23 - INFO - __main__ - Step 106790: {'lr': 9.799635891210167e-05, 'samples': 20503680, 'steps': 106789, 'loss/train': 1.5417747497558594} 11/07/2021 12:16:24 - INFO - __main__ - Step 106791: {'lr': 9.799214578471616e-05, 'samples': 20503872, 'steps': 106790, 'loss/train': 0.6818938851356506} 11/07/2021 12:16:24 - INFO - __main__ - Step 106792: {'lr': 9.798793272582305e-05, 'samples': 20504064, 'steps': 106791, 'loss/train': 1.5350090265274048} 11/07/2021 12:16:25 - INFO - __main__ - Step 106793: {'lr': 9.798371973542419e-05, 'samples': 20504256, 'steps': 106792, 'loss/train': 1.690872311592102} 11/07/2021 12:16:25 - INFO - __main__ - Step 106794: {'lr': 9.797950681352155e-05, 'samples': 20504448, 'steps': 106793, 'loss/train': 1.2616868019104004} 11/07/2021 12:16:25 - INFO - __main__ - Step 106795: {'lr': 9.7975293960117e-05, 'samples': 20504640, 'steps': 106794, 'loss/train': 1.5465337038040161} 11/07/2021 12:16:26 - INFO - __main__ - Step 106796: {'lr': 9.797108117521244e-05, 'samples': 20504832, 'steps': 106795, 'loss/train': 1.2056523561477661} 11/07/2021 12:16:27 - INFO - __main__ - Step 106797: {'lr': 9.796686845880983e-05, 'samples': 20505024, 'steps': 106796, 'loss/train': 0.8592513799667358} 11/07/2021 12:16:27 - INFO - __main__ - Step 106798: {'lr': 9.796265581091096e-05, 'samples': 20505216, 'steps': 106797, 'loss/train': 1.5724087953567505} 11/07/2021 12:16:27 - INFO - __main__ - Step 106799: {'lr': 9.795844323151773e-05, 'samples': 20505408, 'steps': 106798, 'loss/train': 1.178631067276001} 11/07/2021 12:16:28 - INFO - __main__ - Step 106800: {'lr': 9.795423072063208e-05, 'samples': 20505600, 'steps': 106799, 'loss/train': 1.2282689809799194} 11/07/2021 12:16:28 - INFO - __main__ - Step 106801: {'lr': 9.795001827825595e-05, 'samples': 20505792, 'steps': 106800, 'loss/train': 0.9612773656845093} 11/07/2021 12:16:29 - INFO - __main__ - Step 106802: {'lr': 9.794580590439114e-05, 'samples': 20505984, 'steps': 106801, 'loss/train': 1.2105857133865356} 11/07/2021 12:16:30 - INFO - __main__ - Step 106803: {'lr': 9.794159359903965e-05, 'samples': 20506176, 'steps': 106802, 'loss/train': 1.493708610534668} 11/07/2021 12:16:30 - INFO - __main__ - Step 106804: {'lr': 9.793738136220329e-05, 'samples': 20506368, 'steps': 106803, 'loss/train': 1.3209675550460815} 11/07/2021 12:16:30 - INFO - __main__ - Step 106805: {'lr': 9.793316919388404e-05, 'samples': 20506560, 'steps': 106804, 'loss/train': 1.413799524307251} 11/07/2021 12:16:31 - INFO - __main__ - Step 106806: {'lr': 9.792895709408373e-05, 'samples': 20506752, 'steps': 106805, 'loss/train': 1.3354763984680176} 11/07/2021 12:16:32 - INFO - __main__ - Step 106807: {'lr': 9.79247450628043e-05, 'samples': 20506944, 'steps': 106806, 'loss/train': 1.2102240324020386} 11/07/2021 12:16:32 - INFO - __main__ - Step 106808: {'lr': 9.792053310004761e-05, 'samples': 20507136, 'steps': 106807, 'loss/train': 1.0475244522094727} 11/07/2021 12:16:32 - INFO - __main__ - Step 106809: {'lr': 9.791632120581558e-05, 'samples': 20507328, 'steps': 106808, 'loss/train': 1.2339308261871338} 11/07/2021 12:16:33 - INFO - __main__ - Step 106810: {'lr': 9.791210938011021e-05, 'samples': 20507520, 'steps': 106809, 'loss/train': 1.2069374322891235} 11/07/2021 12:16:33 - INFO - __main__ - Step 106811: {'lr': 9.790789762293323e-05, 'samples': 20507712, 'steps': 106810, 'loss/train': 1.3504732847213745} 11/07/2021 12:16:34 - INFO - __main__ - Step 106812: {'lr': 9.790368593428659e-05, 'samples': 20507904, 'steps': 106811, 'loss/train': 1.4461439847946167} 11/07/2021 12:16:35 - INFO - __main__ - Step 106813: {'lr': 9.78994743141722e-05, 'samples': 20508096, 'steps': 106812, 'loss/train': 1.3333327770233154} 11/07/2021 12:16:35 - INFO - __main__ - Step 106814: {'lr': 9.789526276259194e-05, 'samples': 20508288, 'steps': 106813, 'loss/train': 1.2014070749282837} 11/07/2021 12:16:35 - INFO - __main__ - Step 106815: {'lr': 9.789105127954775e-05, 'samples': 20508480, 'steps': 106814, 'loss/train': 1.3865147829055786} 11/07/2021 12:16:36 - INFO - __main__ - Step 106816: {'lr': 9.788683986504152e-05, 'samples': 20508672, 'steps': 106815, 'loss/train': 1.9176790714263916} 11/07/2021 12:16:37 - INFO - __main__ - Step 106817: {'lr': 9.788262851907512e-05, 'samples': 20508864, 'steps': 106816, 'loss/train': 1.6487501859664917} 11/07/2021 12:16:37 - INFO - __main__ - Step 106818: {'lr': 9.787841724165045e-05, 'samples': 20509056, 'steps': 106817, 'loss/train': 1.2179760932922363} 11/07/2021 12:16:37 - INFO - __main__ - Step 106819: {'lr': 9.787420603276942e-05, 'samples': 20509248, 'steps': 106818, 'loss/train': 1.4517138004302979} 11/07/2021 12:16:38 - INFO - __main__ - Step 106820: {'lr': 9.786999489243392e-05, 'samples': 20509440, 'steps': 106819, 'loss/train': 1.1827870607376099} 11/07/2021 12:16:38 - INFO - __main__ - Step 106821: {'lr': 9.786578382064587e-05, 'samples': 20509632, 'steps': 106820, 'loss/train': 1.6803207397460938} 11/07/2021 12:16:39 - INFO - __main__ - Step 106822: {'lr': 9.786157281740712e-05, 'samples': 20509824, 'steps': 106821, 'loss/train': 1.3745743036270142} 11/07/2021 12:16:39 - INFO - __main__ - Step 106823: {'lr': 9.785736188271963e-05, 'samples': 20510016, 'steps': 106822, 'loss/train': 1.3926290273666382} 11/07/2021 12:16:40 - INFO - __main__ - Step 106824: {'lr': 9.785315101658535e-05, 'samples': 20510208, 'steps': 106823, 'loss/train': 1.353623628616333} 11/07/2021 12:16:40 - INFO - __main__ - Step 106825: {'lr': 9.784894021900598e-05, 'samples': 20510400, 'steps': 106824, 'loss/train': 1.0223565101623535} 11/07/2021 12:16:40 - INFO - __main__ - Step 106826: {'lr': 9.784472948998355e-05, 'samples': 20510592, 'steps': 106825, 'loss/train': 1.444017767906189} 11/07/2021 12:16:41 - INFO - __main__ - Step 106827: {'lr': 9.784051882951994e-05, 'samples': 20510784, 'steps': 106826, 'loss/train': 1.2453289031982422} 11/07/2021 12:16:42 - INFO - __main__ - Step 106828: {'lr': 9.783630823761702e-05, 'samples': 20510976, 'steps': 106827, 'loss/train': 1.4784539937973022} 11/07/2021 12:16:42 - INFO - __main__ - Step 106829: {'lr': 9.783209771427673e-05, 'samples': 20511168, 'steps': 106828, 'loss/train': 1.6270039081573486} 11/07/2021 12:16:43 - INFO - __main__ - Step 106830: {'lr': 9.782788725950095e-05, 'samples': 20511360, 'steps': 106829, 'loss/train': 1.7976009845733643} 11/07/2021 12:16:43 - INFO - __main__ - Step 106831: {'lr': 9.782367687329158e-05, 'samples': 20511552, 'steps': 106830, 'loss/train': 0.99161297082901} 11/07/2021 12:16:44 - INFO - __main__ - Step 106832: {'lr': 9.78194665556505e-05, 'samples': 20511744, 'steps': 106831, 'loss/train': 1.265519380569458} 11/07/2021 12:16:44 - INFO - __main__ - Step 106833: {'lr': 9.781525630657964e-05, 'samples': 20511936, 'steps': 106832, 'loss/train': 1.731584072113037} 11/07/2021 12:16:45 - INFO - __main__ - Step 106834: {'lr': 9.781104612608085e-05, 'samples': 20512128, 'steps': 106833, 'loss/train': 1.1366450786590576} 11/07/2021 12:16:45 - INFO - __main__ - Step 106835: {'lr': 9.780683601415607e-05, 'samples': 20512320, 'steps': 106834, 'loss/train': 1.4100940227508545} 11/07/2021 12:16:45 - INFO - __main__ - Step 106836: {'lr': 9.780262597080717e-05, 'samples': 20512512, 'steps': 106835, 'loss/train': 1.2374188899993896} 11/07/2021 12:16:46 - INFO - __main__ - Step 106837: {'lr': 9.779841599603618e-05, 'samples': 20512704, 'steps': 106836, 'loss/train': 1.4882562160491943} 11/07/2021 12:16:47 - INFO - __main__ - Step 106838: {'lr': 9.779420608984474e-05, 'samples': 20512896, 'steps': 106837, 'loss/train': 1.4105747938156128} 11/07/2021 12:16:47 - INFO - __main__ - Step 106839: {'lr': 9.778999625223489e-05, 'samples': 20513088, 'steps': 106838, 'loss/train': 1.2533472776412964} 11/07/2021 12:16:47 - INFO - __main__ - Step 106840: {'lr': 9.778578648320854e-05, 'samples': 20513280, 'steps': 106839, 'loss/train': 1.2132883071899414} 11/07/2021 12:16:48 - INFO - __main__ - Step 106841: {'lr': 9.778157678276756e-05, 'samples': 20513472, 'steps': 106840, 'loss/train': 1.6671408414840698} 11/07/2021 12:16:48 - INFO - __main__ - Step 106842: {'lr': 9.777736715091385e-05, 'samples': 20513664, 'steps': 106841, 'loss/train': 0.9929407835006714} 11/07/2021 12:16:49 - INFO - __main__ - Step 106843: {'lr': 9.777315758764932e-05, 'samples': 20513856, 'steps': 106842, 'loss/train': 0.5898365378379822} 11/07/2021 12:16:49 - INFO - __main__ - Step 106844: {'lr': 9.776894809297585e-05, 'samples': 20514048, 'steps': 106843, 'loss/train': 0.9302205443382263} 11/07/2021 12:16:50 - INFO - __main__ - Step 106845: {'lr': 9.776473866689533e-05, 'samples': 20514240, 'steps': 106844, 'loss/train': 1.2586772441864014} 11/07/2021 12:16:50 - INFO - __main__ - Step 106846: {'lr': 9.776052930940968e-05, 'samples': 20514432, 'steps': 106845, 'loss/train': 1.1666369438171387} 11/07/2021 12:16:51 - INFO - __main__ - Step 106847: {'lr': 9.775632002052079e-05, 'samples': 20514624, 'steps': 106846, 'loss/train': 1.4472200870513916} 11/07/2021 12:16:52 - INFO - __main__ - Step 106848: {'lr': 9.775211080023056e-05, 'samples': 20514816, 'steps': 106847, 'loss/train': 1.465148687362671} 11/07/2021 12:16:52 - INFO - __main__ - Step 106849: {'lr': 9.774790164854086e-05, 'samples': 20515008, 'steps': 106848, 'loss/train': 1.2930617332458496} 11/07/2021 12:16:52 - INFO - __main__ - Step 106850: {'lr': 9.77436925654536e-05, 'samples': 20515200, 'steps': 106849, 'loss/train': 2.5331575870513916} 11/07/2021 12:16:53 - INFO - __main__ - Step 106851: {'lr': 9.773948355097078e-05, 'samples': 20515392, 'steps': 106850, 'loss/train': 0.7574418187141418} 11/07/2021 12:16:53 - INFO - __main__ - Step 106852: {'lr': 9.773527460509413e-05, 'samples': 20515584, 'steps': 106851, 'loss/train': 1.3895485401153564} 11/07/2021 12:16:54 - INFO - __main__ - Step 106853: {'lr': 9.773106572782558e-05, 'samples': 20515776, 'steps': 106852, 'loss/train': 1.6088244915008545} 11/07/2021 12:16:54 - INFO - __main__ - Step 106854: {'lr': 9.77268569191671e-05, 'samples': 20515968, 'steps': 106853, 'loss/train': 1.4229179620742798} 11/07/2021 12:16:55 - INFO - __main__ - Step 106855: {'lr': 9.772264817912052e-05, 'samples': 20516160, 'steps': 106854, 'loss/train': 1.6988755464553833} 11/07/2021 12:16:55 - INFO - __main__ - Step 106856: {'lr': 9.77184395076878e-05, 'samples': 20516352, 'steps': 106855, 'loss/train': 1.5798333883285522} 11/07/2021 12:16:55 - INFO - __main__ - Step 106857: {'lr': 9.771423090487078e-05, 'samples': 20516544, 'steps': 106856, 'loss/train': 1.392296552658081} 11/07/2021 12:16:56 - INFO - __main__ - Step 106858: {'lr': 9.771002237067137e-05, 'samples': 20516736, 'steps': 106857, 'loss/train': 1.504844307899475} 11/07/2021 12:16:57 - INFO - __main__ - Step 106859: {'lr': 9.770581390509148e-05, 'samples': 20516928, 'steps': 106858, 'loss/train': 1.0795012712478638} 11/07/2021 12:16:57 - INFO - __main__ - Step 106860: {'lr': 9.770160550813298e-05, 'samples': 20517120, 'steps': 106859, 'loss/train': 1.7493014335632324} 11/07/2021 12:16:57 - INFO - __main__ - Step 106861: {'lr': 9.769739717979781e-05, 'samples': 20517312, 'steps': 106860, 'loss/train': 1.3657678365707397} 11/07/2021 12:16:58 - INFO - __main__ - Step 106862: {'lr': 9.769318892008785e-05, 'samples': 20517504, 'steps': 106861, 'loss/train': 0.8868981599807739} 11/07/2021 12:16:59 - INFO - __main__ - Step 106863: {'lr': 9.768898072900498e-05, 'samples': 20517696, 'steps': 106862, 'loss/train': 1.0807743072509766} 11/07/2021 12:16:59 - INFO - __main__ - Step 106864: {'lr': 9.768477260655117e-05, 'samples': 20517888, 'steps': 106863, 'loss/train': 1.0928493738174438} 11/07/2021 12:17:00 - INFO - __main__ - Step 106865: {'lr': 9.76805645527282e-05, 'samples': 20518080, 'steps': 106864, 'loss/train': 1.3322678804397583} 11/07/2021 12:17:00 - INFO - __main__ - Step 106866: {'lr': 9.7676356567538e-05, 'samples': 20518272, 'steps': 106865, 'loss/train': 1.4284546375274658} 11/07/2021 12:17:00 - INFO - __main__ - Step 106867: {'lr': 9.767214865098248e-05, 'samples': 20518464, 'steps': 106866, 'loss/train': 1.3317996263504028} 11/07/2021 12:17:01 - INFO - __main__ - Step 106868: {'lr': 9.766794080306354e-05, 'samples': 20518656, 'steps': 106867, 'loss/train': 2.1348354816436768} 11/07/2021 12:17:02 - INFO - __main__ - Step 106869: {'lr': 9.766373302378309e-05, 'samples': 20518848, 'steps': 106868, 'loss/train': 1.492166519165039} 11/07/2021 12:17:02 - INFO - __main__ - Step 106870: {'lr': 9.765952531314301e-05, 'samples': 20519040, 'steps': 106869, 'loss/train': 1.23270845413208} 11/07/2021 12:17:02 - INFO - __main__ - Step 106871: {'lr': 9.76553176711452e-05, 'samples': 20519232, 'steps': 106870, 'loss/train': 1.009442687034607} 11/07/2021 12:17:03 - INFO - __main__ - Step 106872: {'lr': 9.765111009779151e-05, 'samples': 20519424, 'steps': 106871, 'loss/train': 1.700212001800537} 11/07/2021 12:17:04 - INFO - __main__ - Step 106873: {'lr': 9.764690259308393e-05, 'samples': 20519616, 'steps': 106872, 'loss/train': 1.3574376106262207} 11/07/2021 12:17:04 - INFO - __main__ - Step 106874: {'lr': 9.764269515702425e-05, 'samples': 20519808, 'steps': 106873, 'loss/train': 1.0813391208648682} 11/07/2021 12:17:04 - INFO - __main__ - Step 106875: {'lr': 9.763848778961449e-05, 'samples': 20520000, 'steps': 106874, 'loss/train': 1.3764653205871582} 11/07/2021 12:17:05 - INFO - __main__ - Step 106876: {'lr': 9.763428049085643e-05, 'samples': 20520192, 'steps': 106875, 'loss/train': 1.0517008304595947} 11/07/2021 12:17:05 - INFO - __main__ - Step 106877: {'lr': 9.76300732607521e-05, 'samples': 20520384, 'steps': 106876, 'loss/train': 0.931036651134491} 11/07/2021 12:17:06 - INFO - __main__ - Step 106878: {'lr': 9.762586609930324e-05, 'samples': 20520576, 'steps': 106877, 'loss/train': 1.339484691619873} 11/07/2021 12:17:07 - INFO - __main__ - Step 106879: {'lr': 9.762165900651179e-05, 'samples': 20520768, 'steps': 106878, 'loss/train': 2.0166056156158447} 11/07/2021 12:17:07 - INFO - __main__ - Step 106880: {'lr': 9.76174519823797e-05, 'samples': 20520960, 'steps': 106879, 'loss/train': 1.5992014408111572} 11/07/2021 12:17:07 - INFO - __main__ - Step 106881: {'lr': 9.76132450269088e-05, 'samples': 20521152, 'steps': 106880, 'loss/train': 2.5431268215179443} 11/07/2021 12:17:08 - INFO - __main__ - Step 106882: {'lr': 9.760903814010106e-05, 'samples': 20521344, 'steps': 106881, 'loss/train': 1.0503690242767334} 11/07/2021 12:17:08 - INFO - __main__ - Step 106883: {'lr': 9.76048313219583e-05, 'samples': 20521536, 'steps': 106882, 'loss/train': 1.4854764938354492} 11/07/2021 12:17:09 - INFO - __main__ - Step 106884: {'lr': 9.760062457248248e-05, 'samples': 20521728, 'steps': 106883, 'loss/train': 1.5432064533233643} 11/07/2021 12:17:09 - INFO - __main__ - Step 106885: {'lr': 9.759641789167545e-05, 'samples': 20521920, 'steps': 106884, 'loss/train': 1.3619946241378784} 11/07/2021 12:17:10 - INFO - __main__ - Step 106886: {'lr': 9.759221127953912e-05, 'samples': 20522112, 'steps': 106885, 'loss/train': 1.3334503173828125} 11/07/2021 12:17:10 - INFO - __main__ - Step 106887: {'lr': 9.758800473607537e-05, 'samples': 20522304, 'steps': 106886, 'loss/train': 1.0566767454147339} 11/07/2021 12:17:10 - INFO - __main__ - Step 106888: {'lr': 9.758379826128616e-05, 'samples': 20522496, 'steps': 106887, 'loss/train': 1.467769980430603} 11/07/2021 12:17:12 - INFO - __main__ - Step 106889: {'lr': 9.757959185517332e-05, 'samples': 20522688, 'steps': 106888, 'loss/train': 1.4068595170974731} 11/07/2021 12:17:12 - INFO - __main__ - Step 106890: {'lr': 9.757538551773876e-05, 'samples': 20522880, 'steps': 106889, 'loss/train': 1.72011137008667} 11/07/2021 12:17:12 - INFO - __main__ - Step 106891: {'lr': 9.757117924898445e-05, 'samples': 20523072, 'steps': 106890, 'loss/train': 1.3485817909240723} 11/07/2021 12:17:13 - INFO - __main__ - Step 106892: {'lr': 9.756697304891215e-05, 'samples': 20523264, 'steps': 106891, 'loss/train': 1.2596744298934937} 11/07/2021 12:17:13 - INFO - __main__ - Step 106893: {'lr': 9.75627669175238e-05, 'samples': 20523456, 'steps': 106892, 'loss/train': 1.020559310913086} 11/07/2021 12:17:14 - INFO - __main__ - Step 106894: {'lr': 9.755856085482134e-05, 'samples': 20523648, 'steps': 106893, 'loss/train': 1.4202744960784912} 11/07/2021 12:17:14 - INFO - __main__ - Step 106895: {'lr': 9.755435486080663e-05, 'samples': 20523840, 'steps': 106894, 'loss/train': 1.4934031963348389} 11/07/2021 12:17:15 - INFO - __main__ - Step 106896: {'lr': 9.755014893548156e-05, 'samples': 20524032, 'steps': 106895, 'loss/train': 1.4622217416763306} 11/07/2021 12:17:15 - INFO - __main__ - Step 106897: {'lr': 9.754594307884806e-05, 'samples': 20524224, 'steps': 106896, 'loss/train': 1.0765550136566162} 11/07/2021 12:17:15 - INFO - __main__ - Step 106898: {'lr': 9.754173729090798e-05, 'samples': 20524416, 'steps': 106897, 'loss/train': 1.3558101654052734} 11/07/2021 12:17:16 - INFO - __main__ - Step 106899: {'lr': 9.753753157166329e-05, 'samples': 20524608, 'steps': 106898, 'loss/train': 1.6831390857696533} 11/07/2021 12:17:17 - INFO - __main__ - Step 106900: {'lr': 9.753332592111577e-05, 'samples': 20524800, 'steps': 106899, 'loss/train': 0.8236832022666931} 11/07/2021 12:17:17 - INFO - __main__ - Step 106901: {'lr': 9.752912033926742e-05, 'samples': 20524992, 'steps': 106900, 'loss/train': 1.3133220672607422} 11/07/2021 12:17:17 - INFO - __main__ - Step 106902: {'lr': 9.752491482612011e-05, 'samples': 20525184, 'steps': 106901, 'loss/train': 1.8823071718215942} 11/07/2021 12:17:18 - INFO - __main__ - Step 106903: {'lr': 9.75207093816757e-05, 'samples': 20525376, 'steps': 106902, 'loss/train': 1.575466275215149} 11/07/2021 12:17:19 - INFO - __main__ - Step 106904: {'lr': 9.751650400593621e-05, 'samples': 20525568, 'steps': 106903, 'loss/train': 1.4213610887527466} 11/07/2021 12:17:19 - INFO - __main__ - Step 106905: {'lr': 9.75122986989033e-05, 'samples': 20525760, 'steps': 106904, 'loss/train': 1.4164226055145264} 11/07/2021 12:17:20 - INFO - __main__ - Step 106906: {'lr': 9.750809346057904e-05, 'samples': 20525952, 'steps': 106905, 'loss/train': 1.2196232080459595} 11/07/2021 12:17:20 - INFO - __main__ - Step 106907: {'lr': 9.750388829096524e-05, 'samples': 20526144, 'steps': 106906, 'loss/train': 1.1823532581329346} 11/07/2021 12:17:20 - INFO - __main__ - Step 106908: {'lr': 9.749968319006386e-05, 'samples': 20526336, 'steps': 106907, 'loss/train': 1.5777901411056519} 11/07/2021 12:17:21 - INFO - __main__ - Step 106909: {'lr': 9.749547815787677e-05, 'samples': 20526528, 'steps': 106908, 'loss/train': 1.4120023250579834} 11/07/2021 12:17:22 - INFO - __main__ - Step 106910: {'lr': 9.749127319440588e-05, 'samples': 20526720, 'steps': 106909, 'loss/train': 1.1770962476730347} 11/07/2021 12:17:22 - INFO - __main__ - Step 106911: {'lr': 9.748706829965304e-05, 'samples': 20526912, 'steps': 106910, 'loss/train': 1.3104737997055054} 11/07/2021 12:17:22 - INFO - __main__ - Step 106912: {'lr': 9.748286347362017e-05, 'samples': 20527104, 'steps': 106911, 'loss/train': 0.6715050935745239} 11/07/2021 12:17:23 - INFO - __main__ - Step 106913: {'lr': 9.747865871630918e-05, 'samples': 20527296, 'steps': 106912, 'loss/train': 0.5509892702102661} 11/07/2021 12:17:24 - INFO - __main__ - Step 106914: {'lr': 9.747445402772195e-05, 'samples': 20527488, 'steps': 106913, 'loss/train': 1.1029411554336548} 11/07/2021 12:17:24 - INFO - __main__ - Step 106915: {'lr': 9.74702494078604e-05, 'samples': 20527680, 'steps': 106914, 'loss/train': 1.3999745845794678} 11/07/2021 12:17:25 - INFO - __main__ - Step 106916: {'lr': 9.746604485672638e-05, 'samples': 20527872, 'steps': 106915, 'loss/train': 1.6389156579971313} 11/07/2021 12:17:25 - INFO - __main__ - Step 106917: {'lr': 9.746184037432182e-05, 'samples': 20528064, 'steps': 106916, 'loss/train': 1.1924619674682617} 11/07/2021 12:17:25 - INFO - __main__ - Step 106918: {'lr': 9.745763596064866e-05, 'samples': 20528256, 'steps': 106917, 'loss/train': 1.0601948499679565} 11/07/2021 12:17:26 - INFO - __main__ - Step 106919: {'lr': 9.745343161570869e-05, 'samples': 20528448, 'steps': 106918, 'loss/train': 1.6713323593139648} 11/07/2021 12:17:27 - INFO - __main__ - Step 106920: {'lr': 9.744922733950381e-05, 'samples': 20528640, 'steps': 106919, 'loss/train': 1.230093240737915} 11/07/2021 12:17:27 - INFO - __main__ - Step 106921: {'lr': 9.744502313203596e-05, 'samples': 20528832, 'steps': 106920, 'loss/train': 1.4393728971481323} 11/07/2021 12:17:27 - INFO - __main__ - Step 106922: {'lr': 9.744081899330707e-05, 'samples': 20529024, 'steps': 106921, 'loss/train': 1.3886727094650269} 11/07/2021 12:17:28 - INFO - __main__ - Step 106923: {'lr': 9.743661492331895e-05, 'samples': 20529216, 'steps': 106922, 'loss/train': 1.4304143190383911} 11/07/2021 12:17:28 - INFO - __main__ - Step 106924: {'lr': 9.743241092207356e-05, 'samples': 20529408, 'steps': 106923, 'loss/train': 1.7008384466171265} 11/07/2021 12:17:29 - INFO - __main__ - Step 106925: {'lr': 9.742820698957274e-05, 'samples': 20529600, 'steps': 106924, 'loss/train': 1.5282024145126343} 11/07/2021 12:17:30 - INFO - __main__ - Step 106926: {'lr': 9.742400312581845e-05, 'samples': 20529792, 'steps': 106925, 'loss/train': 1.7700902223587036} 11/07/2021 12:17:30 - INFO - __main__ - Step 106927: {'lr': 9.741979933081251e-05, 'samples': 20529984, 'steps': 106926, 'loss/train': 1.484237551689148} 11/07/2021 12:17:30 - INFO - __main__ - Step 106928: {'lr': 9.74155956045569e-05, 'samples': 20530176, 'steps': 106927, 'loss/train': 1.0962918996810913} 11/07/2021 12:17:31 - INFO - __main__ - Step 106929: {'lr': 9.741139194705345e-05, 'samples': 20530368, 'steps': 106928, 'loss/train': 1.6525167226791382} 11/07/2021 12:17:32 - INFO - __main__ - Step 106930: {'lr': 9.740718835830406e-05, 'samples': 20530560, 'steps': 106929, 'loss/train': 0.2099197655916214} 11/07/2021 12:17:32 - INFO - __main__ - Step 106931: {'lr': 9.740298483831073e-05, 'samples': 20530752, 'steps': 106930, 'loss/train': 1.2841405868530273} 11/07/2021 12:17:32 - INFO - __main__ - Step 106932: {'lr': 9.739878138707517e-05, 'samples': 20530944, 'steps': 106931, 'loss/train': 0.5377114415168762} 11/07/2021 12:17:33 - INFO - __main__ - Step 106933: {'lr': 9.739457800459939e-05, 'samples': 20531136, 'steps': 106932, 'loss/train': 1.4179980754852295} 11/07/2021 12:17:33 - INFO - __main__ - Step 106934: {'lr': 9.739037469088524e-05, 'samples': 20531328, 'steps': 106933, 'loss/train': 1.0948771238327026} 11/07/2021 12:17:34 - INFO - __main__ - Step 106935: {'lr': 9.738617144593462e-05, 'samples': 20531520, 'steps': 106934, 'loss/train': 1.0433377027511597} 11/07/2021 12:17:35 - INFO - __main__ - Step 106936: {'lr': 9.738196826974946e-05, 'samples': 20531712, 'steps': 106935, 'loss/train': 1.4617919921875} 11/07/2021 12:17:35 - INFO - __main__ - Step 106937: {'lr': 9.737776516233159e-05, 'samples': 20531904, 'steps': 106936, 'loss/train': 1.7044050693511963} 11/07/2021 12:17:35 - INFO - __main__ - Step 106938: {'lr': 9.737356212368297e-05, 'samples': 20532096, 'steps': 106937, 'loss/train': 2.479689359664917} 11/07/2021 12:17:36 - INFO - __main__ - Step 106939: {'lr': 9.736935915380549e-05, 'samples': 20532288, 'steps': 106938, 'loss/train': 1.1942187547683716} 11/07/2021 12:17:36 - INFO - __main__ - Step 106940: {'lr': 9.7365156252701e-05, 'samples': 20532480, 'steps': 106939, 'loss/train': 1.5756094455718994} 11/07/2021 12:17:37 - INFO - __main__ - Step 106941: {'lr': 9.736095342037141e-05, 'samples': 20532672, 'steps': 106940, 'loss/train': 1.6436595916748047} 11/07/2021 12:17:37 - INFO - __main__ - Step 106942: {'lr': 9.735675065681863e-05, 'samples': 20532864, 'steps': 106941, 'loss/train': 1.6392675638198853} 11/07/2021 12:17:38 - INFO - __main__ - Step 106943: {'lr': 9.735254796204454e-05, 'samples': 20533056, 'steps': 106942, 'loss/train': 0.938705563545227} 11/07/2021 12:17:38 - INFO - __main__ - Step 106944: {'lr': 9.734834533605114e-05, 'samples': 20533248, 'steps': 106943, 'loss/train': 1.0202337503433228} 11/07/2021 12:17:38 - INFO - __main__ - Step 106945: {'lr': 9.73441427788401e-05, 'samples': 20533440, 'steps': 106944, 'loss/train': 0.732130229473114} 11/07/2021 12:17:39 - INFO - __main__ - Step 106946: {'lr': 9.733994029041343e-05, 'samples': 20533632, 'steps': 106945, 'loss/train': 1.381514549255371} 11/07/2021 12:17:40 - INFO - __main__ - Step 106947: {'lr': 9.733573787077307e-05, 'samples': 20533824, 'steps': 106946, 'loss/train': 1.0587632656097412} 11/07/2021 12:17:40 - INFO - __main__ - Step 106948: {'lr': 9.733153551992083e-05, 'samples': 20534016, 'steps': 106947, 'loss/train': 1.3420085906982422} 11/07/2021 12:17:40 - INFO - __main__ - Step 106949: {'lr': 9.732733323785867e-05, 'samples': 20534208, 'steps': 106948, 'loss/train': 1.5053552389144897} 11/07/2021 12:17:41 - INFO - __main__ - Step 106950: {'lr': 9.732313102458845e-05, 'samples': 20534400, 'steps': 106949, 'loss/train': 1.3939746618270874} 11/07/2021 12:17:42 - INFO - __main__ - Step 106951: {'lr': 9.731892888011206e-05, 'samples': 20534592, 'steps': 106950, 'loss/train': 1.2800348997116089} 11/07/2021 12:17:42 - INFO - __main__ - Step 106952: {'lr': 9.731472680443143e-05, 'samples': 20534784, 'steps': 106951, 'loss/train': 0.8816567659378052} 11/07/2021 12:17:42 - INFO - __main__ - Step 106953: {'lr': 9.73105247975484e-05, 'samples': 20534976, 'steps': 106952, 'loss/train': 1.1187968254089355} 11/07/2021 12:17:43 - INFO - __main__ - Step 106954: {'lr': 9.730632285946489e-05, 'samples': 20535168, 'steps': 106953, 'loss/train': 1.3134312629699707} 11/07/2021 12:17:43 - INFO - __main__ - Step 106955: {'lr': 9.73021209901829e-05, 'samples': 20535360, 'steps': 106954, 'loss/train': 1.3031909465789795} 11/07/2021 12:17:44 - INFO - __main__ - Step 106956: {'lr': 9.729791918970413e-05, 'samples': 20535552, 'steps': 106955, 'loss/train': 1.4135689735412598} 11/07/2021 12:17:45 - INFO - __main__ - Step 106957: {'lr': 9.729371745803056e-05, 'samples': 20535744, 'steps': 106956, 'loss/train': 1.288293719291687} 11/07/2021 12:17:45 - INFO - __main__ - Step 106958: {'lr': 9.728951579516409e-05, 'samples': 20535936, 'steps': 106957, 'loss/train': 1.6167999505996704} 11/07/2021 12:17:45 - INFO - __main__ - Step 106959: {'lr': 9.72853142011066e-05, 'samples': 20536128, 'steps': 106958, 'loss/train': 1.2318464517593384} 11/07/2021 12:17:46 - INFO - __main__ - Step 106960: {'lr': 9.728111267585998e-05, 'samples': 20536320, 'steps': 106959, 'loss/train': 1.1938745975494385} 11/07/2021 12:17:47 - INFO - __main__ - Step 106961: {'lr': 9.727691121942614e-05, 'samples': 20536512, 'steps': 106960, 'loss/train': 1.304604411125183} 11/07/2021 12:17:47 - INFO - __main__ - Step 106962: {'lr': 9.727270983180697e-05, 'samples': 20536704, 'steps': 106961, 'loss/train': 1.1171993017196655} 11/07/2021 12:17:47 - INFO - __main__ - Step 106963: {'lr': 9.726850851300436e-05, 'samples': 20536896, 'steps': 106962, 'loss/train': 0.8618493676185608} 11/07/2021 12:17:48 - INFO - __main__ - Step 106964: {'lr': 9.726430726302021e-05, 'samples': 20537088, 'steps': 106963, 'loss/train': 1.5198718309402466} 11/07/2021 12:17:48 - INFO - __main__ - Step 106965: {'lr': 9.72601060818564e-05, 'samples': 20537280, 'steps': 106964, 'loss/train': 1.337241768836975} 11/07/2021 12:17:49 - INFO - __main__ - Step 106966: {'lr': 9.725590496951492e-05, 'samples': 20537472, 'steps': 106965, 'loss/train': 1.1766246557235718} 11/07/2021 12:17:49 - INFO - __main__ - Step 106967: {'lr': 9.72517039259975e-05, 'samples': 20537664, 'steps': 106966, 'loss/train': 1.3758784532546997} 11/07/2021 12:17:50 - INFO - __main__ - Step 106968: {'lr': 9.724750295130608e-05, 'samples': 20537856, 'steps': 106967, 'loss/train': 1.7622257471084595} 11/07/2021 12:17:50 - INFO - __main__ - Step 106969: {'lr': 9.724330204544258e-05, 'samples': 20538048, 'steps': 106968, 'loss/train': 1.5060784816741943} 11/07/2021 12:17:51 - INFO - __main__ - Step 106970: {'lr': 9.72391012084089e-05, 'samples': 20538240, 'steps': 106969, 'loss/train': 1.6232659816741943} 11/07/2021 12:17:52 - INFO - __main__ - Step 106971: {'lr': 9.723490044020692e-05, 'samples': 20538432, 'steps': 106970, 'loss/train': 1.849905014038086} 11/07/2021 12:17:52 - INFO - __main__ - Step 106972: {'lr': 9.723069974083853e-05, 'samples': 20538624, 'steps': 106971, 'loss/train': 2.2669179439544678} 11/07/2021 12:17:52 - INFO - __main__ - Step 106973: {'lr': 9.722649911030565e-05, 'samples': 20538816, 'steps': 106972, 'loss/train': 1.3156019449234009} 11/07/2021 12:17:53 - INFO - __main__ - Step 106974: {'lr': 9.722229854861015e-05, 'samples': 20539008, 'steps': 106973, 'loss/train': 1.0345739126205444} 11/07/2021 12:17:53 - INFO - __main__ - Step 106975: {'lr': 9.721809805575391e-05, 'samples': 20539200, 'steps': 106974, 'loss/train': 1.404309630393982} 11/07/2021 12:17:53 - INFO - __main__ - Step 106976: {'lr': 9.721389763173891e-05, 'samples': 20539392, 'steps': 106975, 'loss/train': 0.9125785827636719} 11/07/2021 12:17:55 - INFO - __main__ - Step 106977: {'lr': 9.72096972765669e-05, 'samples': 20539584, 'steps': 106976, 'loss/train': 1.3645974397659302} 11/07/2021 12:17:55 - INFO - __main__ - Step 106978: {'lr': 9.720549699023984e-05, 'samples': 20539776, 'steps': 106977, 'loss/train': 1.0777167081832886} 11/07/2021 12:17:55 - INFO - __main__ - Step 106979: {'lr': 9.720129677275963e-05, 'samples': 20539968, 'steps': 106978, 'loss/train': 1.527126431465149} 11/07/2021 12:17:56 - INFO - __main__ - Step 106980: {'lr': 9.719709662412818e-05, 'samples': 20540160, 'steps': 106979, 'loss/train': 0.8821859359741211} 11/07/2021 12:17:56 - INFO - __main__ - Step 106981: {'lr': 9.719289654434732e-05, 'samples': 20540352, 'steps': 106980, 'loss/train': 0.7669743895530701} 11/07/2021 12:17:57 - INFO - __main__ - Step 106982: {'lr': 9.7188696533419e-05, 'samples': 20540544, 'steps': 106981, 'loss/train': 1.4184659719467163} 11/07/2021 12:17:57 - INFO - __main__ - Step 106983: {'lr': 9.71844965913451e-05, 'samples': 20540736, 'steps': 106982, 'loss/train': 1.070011854171753} 11/07/2021 12:17:58 - INFO - __main__ - Step 106984: {'lr': 9.71802967181275e-05, 'samples': 20540928, 'steps': 106983, 'loss/train': 1.5341473817825317} 11/07/2021 12:17:58 - INFO - __main__ - Step 106985: {'lr': 9.717609691376811e-05, 'samples': 20541120, 'steps': 106984, 'loss/train': 1.4134502410888672} 11/07/2021 12:17:58 - INFO - __main__ - Step 106986: {'lr': 9.717189717826879e-05, 'samples': 20541312, 'steps': 106985, 'loss/train': 1.1910676956176758} 11/07/2021 12:18:00 - INFO - __main__ - Step 106987: {'lr': 9.716769751163157e-05, 'samples': 20541504, 'steps': 106986, 'loss/train': 1.8501245975494385} 11/07/2021 12:18:00 - INFO - __main__ - Step 106988: {'lr': 9.716349791385811e-05, 'samples': 20541696, 'steps': 106987, 'loss/train': 1.5312479734420776} 11/07/2021 12:18:00 - INFO - __main__ - Step 106989: {'lr': 9.715929838495047e-05, 'samples': 20541888, 'steps': 106988, 'loss/train': 1.0477327108383179} 11/07/2021 12:18:01 - INFO - __main__ - Step 106990: {'lr': 9.715509892491045e-05, 'samples': 20542080, 'steps': 106989, 'loss/train': 1.256164312362671} 11/07/2021 12:18:01 - INFO - __main__ - Step 106991: {'lr': 9.715089953373998e-05, 'samples': 20542272, 'steps': 106990, 'loss/train': 1.235459566116333} 11/07/2021 12:18:02 - INFO - __main__ - Step 106992: {'lr': 9.714670021144097e-05, 'samples': 20542464, 'steps': 106991, 'loss/train': 1.4991352558135986} 11/07/2021 12:18:02 - INFO - __main__ - Step 106993: {'lr': 9.71425009580153e-05, 'samples': 20542656, 'steps': 106992, 'loss/train': 1.4185316562652588} 11/07/2021 12:18:03 - INFO - __main__ - Step 106994: {'lr': 9.713830177346486e-05, 'samples': 20542848, 'steps': 106993, 'loss/train': 0.7957563400268555} 11/07/2021 12:18:03 - INFO - __main__ - Step 106995: {'lr': 9.713410265779155e-05, 'samples': 20543040, 'steps': 106994, 'loss/train': 1.242128610610962} 11/07/2021 12:18:03 - INFO - __main__ - Step 106996: {'lr': 9.712990361099725e-05, 'samples': 20543232, 'steps': 106995, 'loss/train': 1.0033670663833618} 11/07/2021 12:18:04 - INFO - __main__ - Step 106997: {'lr': 9.712570463308384e-05, 'samples': 20543424, 'steps': 106996, 'loss/train': 1.7294964790344238} 11/07/2021 12:18:05 - INFO - __main__ - Step 106998: {'lr': 9.712150572405331e-05, 'samples': 20543616, 'steps': 106997, 'loss/train': 0.9286695122718811} 11/07/2021 12:18:05 - INFO - __main__ - Step 106999: {'lr': 9.71173068839074e-05, 'samples': 20543808, 'steps': 106998, 'loss/train': 1.8029141426086426} 11/07/2021 12:18:06 - INFO - __main__ - Step 107000: {'lr': 9.71131081126481e-05, 'samples': 20544000, 'steps': 106999, 'loss/train': 1.6696100234985352} 11/07/2021 12:18:06 - INFO - __main__ - Step 107001: {'lr': 9.710890941027722e-05, 'samples': 20544192, 'steps': 107000, 'loss/train': 1.498238444328308} 11/07/2021 12:18:07 - INFO - __main__ - Step 107002: {'lr': 9.710471077679675e-05, 'samples': 20544384, 'steps': 107001, 'loss/train': 1.551005244255066} 11/07/2021 12:18:07 - INFO - __main__ - Step 107003: {'lr': 9.710051221220851e-05, 'samples': 20544576, 'steps': 107002, 'loss/train': 0.7980526685714722} 11/07/2021 12:18:08 - INFO - __main__ - Step 107004: {'lr': 9.709631371651443e-05, 'samples': 20544768, 'steps': 107003, 'loss/train': 1.2320231199264526} 11/07/2021 12:18:08 - INFO - __main__ - Step 107005: {'lr': 9.70921152897164e-05, 'samples': 20544960, 'steps': 107004, 'loss/train': 1.7008620500564575} 11/07/2021 12:18:08 - INFO - __main__ - Step 107006: {'lr': 9.708791693181628e-05, 'samples': 20545152, 'steps': 107005, 'loss/train': 1.660915493965149} 11/07/2021 12:18:09 - INFO - __main__ - Step 107007: {'lr': 9.708371864281601e-05, 'samples': 20545344, 'steps': 107006, 'loss/train': 1.528140902519226} 11/07/2021 12:18:10 - INFO - __main__ - Step 107008: {'lr': 9.707952042271745e-05, 'samples': 20545536, 'steps': 107007, 'loss/train': 1.2850297689437866} 11/07/2021 12:18:10 - INFO - __main__ - Step 107009: {'lr': 9.70753222715225e-05, 'samples': 20545728, 'steps': 107008, 'loss/train': 1.3133118152618408} 11/07/2021 12:18:10 - INFO - __main__ - Step 107010: {'lr': 9.707112418923303e-05, 'samples': 20545920, 'steps': 107009, 'loss/train': 1.3471366167068481} 11/07/2021 12:18:11 - INFO - __main__ - Step 107011: {'lr': 9.706692617585097e-05, 'samples': 20546112, 'steps': 107010, 'loss/train': 1.1808322668075562} 11/07/2021 12:18:11 - INFO - __main__ - Step 107012: {'lr': 9.706272823137829e-05, 'samples': 20546304, 'steps': 107011, 'loss/train': 0.38934609293937683} 11/07/2021 12:18:12 - INFO - __main__ - Step 107013: {'lr': 9.705853035581668e-05, 'samples': 20546496, 'steps': 107012, 'loss/train': 0.5779416561126709} 11/07/2021 12:18:13 - INFO - __main__ - Step 107014: {'lr': 9.705433254916814e-05, 'samples': 20546688, 'steps': 107013, 'loss/train': 1.4823623895645142} 11/07/2021 12:18:13 - INFO - __main__ - Step 107015: {'lr': 9.705013481143457e-05, 'samples': 20546880, 'steps': 107014, 'loss/train': 1.2548608779907227} 11/07/2021 12:18:13 - INFO - __main__ - Step 107016: {'lr': 9.704593714261784e-05, 'samples': 20547072, 'steps': 107015, 'loss/train': 1.9544785022735596} 11/07/2021 12:18:14 - INFO - __main__ - Step 107017: {'lr': 9.704173954271984e-05, 'samples': 20547264, 'steps': 107016, 'loss/train': 1.6113299131393433} 11/07/2021 12:18:15 - INFO - __main__ - Step 107018: {'lr': 9.703754201174248e-05, 'samples': 20547456, 'steps': 107017, 'loss/train': 1.2184638977050781} 11/07/2021 12:18:15 - INFO - __main__ - Step 107019: {'lr': 9.703334454968765e-05, 'samples': 20547648, 'steps': 107018, 'loss/train': 1.2466211318969727} 11/07/2021 12:18:15 - INFO - __main__ - Step 107020: {'lr': 9.702914715655723e-05, 'samples': 20547840, 'steps': 107019, 'loss/train': 1.19423246383667} 11/07/2021 12:18:16 - INFO - __main__ - Step 107021: {'lr': 9.702494983235311e-05, 'samples': 20548032, 'steps': 107020, 'loss/train': 1.4154038429260254} 11/07/2021 12:18:16 - INFO - __main__ - Step 107022: {'lr': 9.702075257707718e-05, 'samples': 20548224, 'steps': 107021, 'loss/train': 1.453759789466858} 11/07/2021 12:18:17 - INFO - __main__ - Step 107023: {'lr': 9.701655539073134e-05, 'samples': 20548416, 'steps': 107022, 'loss/train': 1.7419041395187378} 11/07/2021 12:18:18 - INFO - __main__ - Step 107024: {'lr': 9.70123582733175e-05, 'samples': 20548608, 'steps': 107023, 'loss/train': 1.0486923456192017} 11/07/2021 12:18:18 - INFO - __main__ - Step 107025: {'lr': 9.70081612248376e-05, 'samples': 20548800, 'steps': 107024, 'loss/train': 1.1918071508407593} 11/07/2021 12:18:18 - INFO - __main__ - Step 107026: {'lr': 9.70039642452934e-05, 'samples': 20548992, 'steps': 107025, 'loss/train': 1.517500638961792} 11/07/2021 12:18:19 - INFO - __main__ - Step 107027: {'lr': 9.699976733468682e-05, 'samples': 20549184, 'steps': 107026, 'loss/train': 1.5636237859725952} 11/07/2021 12:18:20 - INFO - __main__ - Step 107028: {'lr': 9.69955704930198e-05, 'samples': 20549376, 'steps': 107027, 'loss/train': 1.7391308546066284} 11/07/2021 12:18:20 - INFO - __main__ - Step 107029: {'lr': 9.69913737202942e-05, 'samples': 20549568, 'steps': 107028, 'loss/train': 1.02311372756958} 11/07/2021 12:18:20 - INFO - __main__ - Step 107030: {'lr': 9.698717701651193e-05, 'samples': 20549760, 'steps': 107029, 'loss/train': 1.488446831703186} 11/07/2021 12:18:21 - INFO - __main__ - Step 107031: {'lr': 9.698298038167492e-05, 'samples': 20549952, 'steps': 107030, 'loss/train': 1.7324365377426147} 11/07/2021 12:18:21 - INFO - __main__ - Step 107032: {'lr': 9.697878381578499e-05, 'samples': 20550144, 'steps': 107031, 'loss/train': 1.8129013776779175} 11/07/2021 12:18:22 - INFO - __main__ - Step 107033: {'lr': 9.697458731884404e-05, 'samples': 20550336, 'steps': 107032, 'loss/train': 1.2868351936340332} 11/07/2021 12:18:22 - INFO - __main__ - Step 107034: {'lr': 9.697039089085399e-05, 'samples': 20550528, 'steps': 107033, 'loss/train': 1.4274853467941284} 11/07/2021 12:18:23 - INFO - __main__ - Step 107035: {'lr': 9.696619453181671e-05, 'samples': 20550720, 'steps': 107034, 'loss/train': 1.260695457458496} 11/07/2021 12:18:23 - INFO - __main__ - Step 107036: {'lr': 9.696199824173413e-05, 'samples': 20550912, 'steps': 107035, 'loss/train': 1.6148154735565186} 11/07/2021 12:18:23 - INFO - __main__ - Step 107037: {'lr': 9.695780202060809e-05, 'samples': 20551104, 'steps': 107036, 'loss/train': 1.3375192880630493} 11/07/2021 12:18:24 - INFO - __main__ - Step 107038: {'lr': 9.695360586844052e-05, 'samples': 20551296, 'steps': 107037, 'loss/train': 1.448743462562561} 11/07/2021 12:18:25 - INFO - __main__ - Step 107039: {'lr': 9.694940978523337e-05, 'samples': 20551488, 'steps': 107038, 'loss/train': 1.2448511123657227} 11/07/2021 12:18:26 - INFO - __main__ - Step 107040: {'lr': 9.694521377098837e-05, 'samples': 20551680, 'steps': 107039, 'loss/train': 1.6092666387557983} 11/07/2021 12:18:26 - INFO - __main__ - Step 107041: {'lr': 9.694101782570747e-05, 'samples': 20551872, 'steps': 107040, 'loss/train': 0.12252122163772583} 11/07/2021 12:18:26 - INFO - __main__ - Step 107042: {'lr': 9.69368219493926e-05, 'samples': 20552064, 'steps': 107041, 'loss/train': 1.2351692914962769} 11/07/2021 12:18:27 - INFO - __main__ - Step 107043: {'lr': 9.693262614204566e-05, 'samples': 20552256, 'steps': 107042, 'loss/train': 1.4362648725509644} 11/07/2021 12:18:28 - INFO - __main__ - Step 107044: {'lr': 9.69284304036685e-05, 'samples': 20552448, 'steps': 107043, 'loss/train': 1.5338854789733887} 11/07/2021 12:18:28 - INFO - __main__ - Step 107045: {'lr': 9.692423473426302e-05, 'samples': 20552640, 'steps': 107044, 'loss/train': 1.0726382732391357} 11/07/2021 12:18:28 - INFO - __main__ - Step 107046: {'lr': 9.692003913383113e-05, 'samples': 20552832, 'steps': 107045, 'loss/train': 1.120802879333496} 11/07/2021 12:18:29 - INFO - __main__ - Step 107047: {'lr': 9.691584360237471e-05, 'samples': 20553024, 'steps': 107046, 'loss/train': 1.500553846359253} 11/07/2021 12:18:29 - INFO - __main__ - Step 107048: {'lr': 9.691164813989564e-05, 'samples': 20553216, 'steps': 107047, 'loss/train': 1.3935937881469727} 11/07/2021 12:18:30 - INFO - __main__ - Step 107049: {'lr': 9.690745274639582e-05, 'samples': 20553408, 'steps': 107048, 'loss/train': 1.3646330833435059} 11/07/2021 12:18:30 - INFO - __main__ - Step 107050: {'lr': 9.690325742187714e-05, 'samples': 20553600, 'steps': 107049, 'loss/train': 1.522588849067688} 11/07/2021 12:18:31 - INFO - __main__ - Step 107051: {'lr': 9.689906216634147e-05, 'samples': 20553792, 'steps': 107050, 'loss/train': 0.9323347210884094} 11/07/2021 12:18:31 - INFO - __main__ - Step 107052: {'lr': 9.689486697979083e-05, 'samples': 20553984, 'steps': 107051, 'loss/train': 1.572167158126831} 11/07/2021 12:18:32 - INFO - __main__ - Step 107053: {'lr': 9.689067186222692e-05, 'samples': 20554176, 'steps': 107052, 'loss/train': 1.1111913919448853} 11/07/2021 12:18:32 - INFO - __main__ - Step 107054: {'lr': 9.68864768136517e-05, 'samples': 20554368, 'steps': 107053, 'loss/train': 1.4658352136611938} 11/07/2021 12:18:33 - INFO - __main__ - Step 107055: {'lr': 9.688228183406706e-05, 'samples': 20554560, 'steps': 107054, 'loss/train': 1.323430061340332} 11/07/2021 12:18:33 - INFO - __main__ - Step 107056: {'lr': 9.687808692347492e-05, 'samples': 20554752, 'steps': 107055, 'loss/train': 1.3056073188781738} 11/07/2021 12:18:34 - INFO - __main__ - Step 107057: {'lr': 9.687389208187714e-05, 'samples': 20554944, 'steps': 107056, 'loss/train': 1.157211184501648} 11/07/2021 12:18:34 - INFO - __main__ - Step 107058: {'lr': 9.686969730927564e-05, 'samples': 20555136, 'steps': 107057, 'loss/train': 1.5509698390960693} 11/07/2021 12:18:35 - INFO - __main__ - Step 107059: {'lr': 9.686550260567225e-05, 'samples': 20555328, 'steps': 107058, 'loss/train': 1.582858920097351} 11/07/2021 12:18:35 - INFO - __main__ - Step 107060: {'lr': 9.686130797106896e-05, 'samples': 20555520, 'steps': 107059, 'loss/train': 1.3128430843353271} 11/07/2021 12:18:36 - INFO - __main__ - Step 107061: {'lr': 9.685711340546755e-05, 'samples': 20555712, 'steps': 107060, 'loss/train': 0.7471470236778259} 11/07/2021 12:18:36 - INFO - __main__ - Step 107062: {'lr': 9.685291890886999e-05, 'samples': 20555904, 'steps': 107061, 'loss/train': 1.1756658554077148} 11/07/2021 12:18:36 - INFO - __main__ - Step 107063: {'lr': 9.684872448127813e-05, 'samples': 20556096, 'steps': 107062, 'loss/train': 1.5393139123916626} 11/07/2021 12:18:38 - INFO - __main__ - Step 107064: {'lr': 9.684453012269387e-05, 'samples': 20556288, 'steps': 107063, 'loss/train': 1.4698631763458252} 11/07/2021 12:18:38 - INFO - __main__ - Step 107065: {'lr': 9.68403358331191e-05, 'samples': 20556480, 'steps': 107064, 'loss/train': 1.5452265739440918} 11/07/2021 12:18:38 - INFO - __main__ - Step 107066: {'lr': 9.68361416125558e-05, 'samples': 20556672, 'steps': 107065, 'loss/train': 1.4262909889221191} 11/07/2021 12:18:39 - INFO - __main__ - Step 107067: {'lr': 9.683194746100571e-05, 'samples': 20556864, 'steps': 107066, 'loss/train': 1.442919135093689} 11/07/2021 12:18:39 - INFO - __main__ - Step 107068: {'lr': 9.682775337847075e-05, 'samples': 20557056, 'steps': 107067, 'loss/train': 1.2326353788375854} 11/07/2021 12:18:39 - INFO - __main__ - Step 107069: {'lr': 9.682355936495285e-05, 'samples': 20557248, 'steps': 107068, 'loss/train': 1.1508525609970093} 11/07/2021 12:18:40 - INFO - __main__ - Step 107070: {'lr': 9.681936542045389e-05, 'samples': 20557440, 'steps': 107069, 'loss/train': 1.8246917724609375} 11/07/2021 12:18:41 - INFO - __main__ - Step 107071: {'lr': 9.681517154497576e-05, 'samples': 20557632, 'steps': 107070, 'loss/train': 1.1511647701263428} 11/07/2021 12:18:41 - INFO - __main__ - Step 107072: {'lr': 9.681097773852034e-05, 'samples': 20557824, 'steps': 107071, 'loss/train': 0.7406731843948364} 11/07/2021 12:18:41 - INFO - __main__ - Step 107073: {'lr': 9.680678400108955e-05, 'samples': 20558016, 'steps': 107072, 'loss/train': 1.0005333423614502} 11/07/2021 12:18:42 - INFO - __main__ - Step 107074: {'lr': 9.680259033268524e-05, 'samples': 20558208, 'steps': 107073, 'loss/train': 1.1824487447738647} 11/07/2021 12:18:43 - INFO - __main__ - Step 107075: {'lr': 9.679839673330934e-05, 'samples': 20558400, 'steps': 107074, 'loss/train': 0.985297441482544} 11/07/2021 12:18:43 - INFO - __main__ - Step 107076: {'lr': 9.67942032029637e-05, 'samples': 20558592, 'steps': 107075, 'loss/train': 1.3738188743591309} 11/07/2021 12:18:44 - INFO - __main__ - Step 107077: {'lr': 9.679000974165022e-05, 'samples': 20558784, 'steps': 107076, 'loss/train': 1.2190759181976318} 11/07/2021 12:18:44 - INFO - __main__ - Step 107078: {'lr': 9.678581634937084e-05, 'samples': 20558976, 'steps': 107077, 'loss/train': 1.4857593774795532} 11/07/2021 12:18:44 - INFO - __main__ - Step 107079: {'lr': 9.678162302612744e-05, 'samples': 20559168, 'steps': 107078, 'loss/train': 1.0692037343978882} 11/07/2021 12:18:45 - INFO - __main__ - Step 107080: {'lr': 9.677742977192184e-05, 'samples': 20559360, 'steps': 107079, 'loss/train': 1.2257617712020874} 11/07/2021 12:18:46 - INFO - __main__ - Step 107081: {'lr': 9.677323658675594e-05, 'samples': 20559552, 'steps': 107080, 'loss/train': 1.3742444515228271} 11/07/2021 12:18:46 - INFO - __main__ - Step 107082: {'lr': 9.676904347063164e-05, 'samples': 20559744, 'steps': 107081, 'loss/train': 1.247454047203064} 11/07/2021 12:18:47 - INFO - __main__ - Step 107083: {'lr': 9.676485042355087e-05, 'samples': 20559936, 'steps': 107082, 'loss/train': 1.2033183574676514} 11/07/2021 12:18:47 - INFO - __main__ - Step 107084: {'lr': 9.676065744551548e-05, 'samples': 20560128, 'steps': 107083, 'loss/train': 1.3738372325897217} 11/07/2021 12:18:47 - INFO - __main__ - Step 107085: {'lr': 9.675646453652736e-05, 'samples': 20560320, 'steps': 107084, 'loss/train': 1.6596852540969849} 11/07/2021 12:18:48 - INFO - __main__ - Step 107086: {'lr': 9.675227169658846e-05, 'samples': 20560512, 'steps': 107085, 'loss/train': 1.5428760051727295} 11/07/2021 12:18:49 - INFO - __main__ - Step 107087: {'lr': 9.674807892570059e-05, 'samples': 20560704, 'steps': 107086, 'loss/train': 1.159543752670288} 11/07/2021 12:18:49 - INFO - __main__ - Step 107088: {'lr': 9.674388622386565e-05, 'samples': 20560896, 'steps': 107087, 'loss/train': 0.9245731830596924} 11/07/2021 12:18:49 - INFO - __main__ - Step 107089: {'lr': 9.673969359108559e-05, 'samples': 20561088, 'steps': 107088, 'loss/train': 1.9793362617492676} 11/07/2021 12:18:50 - INFO - __main__ - Step 107090: {'lr': 9.673550102736223e-05, 'samples': 20561280, 'steps': 107089, 'loss/train': 1.075053095817566} 11/07/2021 12:18:50 - INFO - __main__ - Step 107091: {'lr': 9.673130853269751e-05, 'samples': 20561472, 'steps': 107090, 'loss/train': 1.8548078536987305} 11/07/2021 12:18:51 - INFO - __main__ - Step 107092: {'lr': 9.672711610709328e-05, 'samples': 20561664, 'steps': 107091, 'loss/train': 1.3310846090316772} 11/07/2021 12:18:52 - INFO - __main__ - Step 107093: {'lr': 9.672292375055156e-05, 'samples': 20561856, 'steps': 107092, 'loss/train': 1.4227099418640137} 11/07/2021 12:18:52 - INFO - __main__ - Step 107094: {'lr': 9.6718731463074e-05, 'samples': 20562048, 'steps': 107093, 'loss/train': 1.5589336156845093} 11/07/2021 12:18:52 - INFO - __main__ - Step 107095: {'lr': 9.671453924466264e-05, 'samples': 20562240, 'steps': 107094, 'loss/train': 1.1573771238327026} 11/07/2021 12:18:53 - INFO - __main__ - Step 107096: {'lr': 9.671034709531934e-05, 'samples': 20562432, 'steps': 107095, 'loss/train': 1.8756418228149414} 11/07/2021 12:18:54 - INFO - __main__ - Step 107097: {'lr': 9.670615501504598e-05, 'samples': 20562624, 'steps': 107096, 'loss/train': 1.4335120916366577} 11/07/2021 12:18:54 - INFO - __main__ - Step 107098: {'lr': 9.670196300384445e-05, 'samples': 20562816, 'steps': 107097, 'loss/train': 1.6827113628387451} 11/07/2021 12:18:54 - INFO - __main__ - Step 107099: {'lr': 9.669777106171667e-05, 'samples': 20563008, 'steps': 107098, 'loss/train': 0.08443847298622131} 11/07/2021 12:18:55 - INFO - __main__ - Step 107100: {'lr': 9.669357918866454e-05, 'samples': 20563200, 'steps': 107099, 'loss/train': 0.7517154812812805} 11/07/2021 12:18:55 - INFO - __main__ - Step 107101: {'lr': 9.668938738468989e-05, 'samples': 20563392, 'steps': 107100, 'loss/train': 1.0998722314834595} 11/07/2021 12:18:56 - INFO - __main__ - Step 107102: {'lr': 9.668519564979461e-05, 'samples': 20563584, 'steps': 107101, 'loss/train': 1.4401010274887085} 11/07/2021 12:18:56 - INFO - __main__ - Step 107103: {'lr': 9.668100398398063e-05, 'samples': 20563776, 'steps': 107102, 'loss/train': 1.2640860080718994} 11/07/2021 12:18:57 - INFO - __main__ - Step 107104: {'lr': 9.667681238724985e-05, 'samples': 20563968, 'steps': 107103, 'loss/train': 1.0856621265411377} 11/07/2021 12:18:57 - INFO - __main__ - Step 107105: {'lr': 9.66726208596041e-05, 'samples': 20564160, 'steps': 107104, 'loss/train': 1.591874361038208} 11/07/2021 12:18:57 - INFO - __main__ - Step 107106: {'lr': 9.666842940104539e-05, 'samples': 20564352, 'steps': 107105, 'loss/train': 1.2305644750595093} 11/07/2021 12:18:58 - INFO - __main__ - Step 107107: {'lr': 9.666423801157545e-05, 'samples': 20564544, 'steps': 107106, 'loss/train': 1.2405366897583008} 11/07/2021 12:18:59 - INFO - __main__ - Step 107108: {'lr': 9.666004669119624e-05, 'samples': 20564736, 'steps': 107107, 'loss/train': 1.3948696851730347} 11/07/2021 12:18:59 - INFO - __main__ - Step 107109: {'lr': 9.665585543990965e-05, 'samples': 20564928, 'steps': 107108, 'loss/train': 0.9160419702529907} 11/07/2021 12:19:00 - INFO - __main__ - Step 107110: {'lr': 9.665166425771754e-05, 'samples': 20565120, 'steps': 107109, 'loss/train': 1.1229053735733032} 11/07/2021 12:19:00 - INFO - __main__ - Step 107111: {'lr': 9.664747314462186e-05, 'samples': 20565312, 'steps': 107110, 'loss/train': 1.233916997909546} 11/07/2021 12:19:00 - INFO - __main__ - Step 107112: {'lr': 9.664328210062443e-05, 'samples': 20565504, 'steps': 107111, 'loss/train': 1.3189692497253418} 11/07/2021 12:19:01 - INFO - __main__ - Step 107113: {'lr': 9.663909112572716e-05, 'samples': 20565696, 'steps': 107112, 'loss/train': 1.07125985622406} 11/07/2021 12:19:02 - INFO - __main__ - Step 107114: {'lr': 9.663490021993199e-05, 'samples': 20565888, 'steps': 107113, 'loss/train': 1.6576025485992432} 11/07/2021 12:19:02 - INFO - __main__ - Step 107115: {'lr': 9.663070938324075e-05, 'samples': 20566080, 'steps': 107114, 'loss/train': 0.7271732091903687} 11/07/2021 12:19:02 - INFO - __main__ - Step 107116: {'lr': 9.662651861565532e-05, 'samples': 20566272, 'steps': 107115, 'loss/train': 0.6186001300811768} 11/07/2021 12:19:03 - INFO - __main__ - Step 107117: {'lr': 9.662232791717765e-05, 'samples': 20566464, 'steps': 107116, 'loss/train': 1.5344477891921997} 11/07/2021 12:19:04 - INFO - __main__ - Step 107118: {'lr': 9.661813728780958e-05, 'samples': 20566656, 'steps': 107117, 'loss/train': 1.0164209604263306} 11/07/2021 12:19:04 - INFO - __main__ - Step 107119: {'lr': 9.661394672755311e-05, 'samples': 20566848, 'steps': 107118, 'loss/train': 1.1961421966552734} 11/07/2021 12:19:05 - INFO - __main__ - Step 107120: {'lr': 9.660975623640992e-05, 'samples': 20567040, 'steps': 107119, 'loss/train': 1.3392945528030396} 11/07/2021 12:19:05 - INFO - __main__ - Step 107121: {'lr': 9.660556581438202e-05, 'samples': 20567232, 'steps': 107120, 'loss/train': 1.2652815580368042} 11/07/2021 12:19:05 - INFO - __main__ - Step 107122: {'lr': 9.660137546147127e-05, 'samples': 20567424, 'steps': 107121, 'loss/train': 1.425223708152771} 11/07/2021 12:19:06 - INFO - __main__ - Step 107123: {'lr': 9.65971851776796e-05, 'samples': 20567616, 'steps': 107122, 'loss/train': 1.1377111673355103} 11/07/2021 12:19:07 - INFO - __main__ - Step 107124: {'lr': 9.659299496300883e-05, 'samples': 20567808, 'steps': 107123, 'loss/train': 1.1978602409362793} 11/07/2021 12:19:07 - INFO - __main__ - Step 107125: {'lr': 9.658880481746093e-05, 'samples': 20568000, 'steps': 107124, 'loss/train': 1.3309458494186401} 11/07/2021 12:19:07 - INFO - __main__ - Step 107126: {'lr': 9.658461474103772e-05, 'samples': 20568192, 'steps': 107125, 'loss/train': 1.4705805778503418} 11/07/2021 12:19:08 - INFO - __main__ - Step 107127: {'lr': 9.658042473374113e-05, 'samples': 20568384, 'steps': 107126, 'loss/train': 0.9519587755203247} 11/07/2021 12:19:09 - INFO - __main__ - Step 107128: {'lr': 9.657623479557303e-05, 'samples': 20568576, 'steps': 107127, 'loss/train': 0.8378567099571228} 11/07/2021 12:19:09 - INFO - __main__ - Step 107129: {'lr': 9.657204492653532e-05, 'samples': 20568768, 'steps': 107128, 'loss/train': 0.8759094476699829} 11/07/2021 12:19:09 - INFO - __main__ - Step 107130: {'lr': 9.656785512662985e-05, 'samples': 20568960, 'steps': 107129, 'loss/train': 1.0476579666137695} 11/07/2021 12:19:10 - INFO - __main__ - Step 107131: {'lr': 9.656366539585856e-05, 'samples': 20569152, 'steps': 107130, 'loss/train': 1.5098843574523926} 11/07/2021 12:19:10 - INFO - __main__ - Step 107132: {'lr': 9.655947573422333e-05, 'samples': 20569344, 'steps': 107131, 'loss/train': 1.8707661628723145} 11/07/2021 12:19:11 - INFO - __main__ - Step 107133: {'lr': 9.655528614172609e-05, 'samples': 20569536, 'steps': 107132, 'loss/train': 1.8668255805969238} 11/07/2021 12:19:11 - INFO - __main__ - Step 107134: {'lr': 9.655109661836861e-05, 'samples': 20569728, 'steps': 107133, 'loss/train': 1.3517788648605347} 11/07/2021 12:19:12 - INFO - __main__ - Step 107135: {'lr': 9.654690716415282e-05, 'samples': 20569920, 'steps': 107134, 'loss/train': 1.3112215995788574} 11/07/2021 12:19:12 - INFO - __main__ - Step 107136: {'lr': 9.654271777908061e-05, 'samples': 20570112, 'steps': 107135, 'loss/train': 1.6119524240493774} 11/07/2021 12:19:13 - INFO - __main__ - Step 107137: {'lr': 9.653852846315392e-05, 'samples': 20570304, 'steps': 107136, 'loss/train': 1.3108372688293457} 11/07/2021 12:19:14 - INFO - __main__ - Step 107138: {'lr': 9.653433921637459e-05, 'samples': 20570496, 'steps': 107137, 'loss/train': 1.713242530822754} 11/07/2021 12:19:14 - INFO - __main__ - Step 107139: {'lr': 9.653015003874449e-05, 'samples': 20570688, 'steps': 107138, 'loss/train': 0.9783931970596313} 11/07/2021 12:19:14 - INFO - __main__ - Step 107140: {'lr': 9.652596093026555e-05, 'samples': 20570880, 'steps': 107139, 'loss/train': 1.6829190254211426} 11/07/2021 12:19:15 - INFO - __main__ - Step 107141: {'lr': 9.652177189093967e-05, 'samples': 20571072, 'steps': 107140, 'loss/train': 1.4248883724212646} 11/07/2021 12:19:15 - INFO - __main__ - Step 107142: {'lr': 9.651758292076867e-05, 'samples': 20571264, 'steps': 107141, 'loss/train': 0.6256065964698792} 11/07/2021 12:19:15 - INFO - __main__ - Step 107143: {'lr': 9.65133940197545e-05, 'samples': 20571456, 'steps': 107142, 'loss/train': 1.7246582508087158} 11/07/2021 12:19:16 - INFO - __main__ - Step 107144: {'lr': 9.650920518789904e-05, 'samples': 20571648, 'steps': 107143, 'loss/train': 1.1941659450531006} 11/07/2021 12:19:17 - INFO - __main__ - Step 107145: {'lr': 9.650501642520415e-05, 'samples': 20571840, 'steps': 107144, 'loss/train': 1.5177175998687744} 11/07/2021 12:19:17 - INFO - __main__ - Step 107146: {'lr': 9.650082773167182e-05, 'samples': 20572032, 'steps': 107145, 'loss/train': 1.6700563430786133} 11/07/2021 12:19:17 - INFO - __main__ - Step 107147: {'lr': 9.649663910730377e-05, 'samples': 20572224, 'steps': 107146, 'loss/train': 0.8734719753265381} 11/07/2021 12:19:18 - INFO - __main__ - Step 107148: {'lr': 9.649245055210196e-05, 'samples': 20572416, 'steps': 107147, 'loss/train': 1.2435535192489624} 11/07/2021 12:19:19 - INFO - __main__ - Step 107149: {'lr': 9.648826206606826e-05, 'samples': 20572608, 'steps': 107148, 'loss/train': 0.6141279935836792} 11/07/2021 12:19:19 - INFO - __main__ - Step 107150: {'lr': 9.648407364920461e-05, 'samples': 20572800, 'steps': 107149, 'loss/train': 1.07974112033844} 11/07/2021 12:19:20 - INFO - __main__ - Step 107151: {'lr': 9.647988530151285e-05, 'samples': 20572992, 'steps': 107150, 'loss/train': 1.3540714979171753} 11/07/2021 12:19:20 - INFO - __main__ - Step 107152: {'lr': 9.647569702299489e-05, 'samples': 20573184, 'steps': 107151, 'loss/train': 0.9584633111953735} 11/07/2021 12:19:20 - INFO - __main__ - Step 107153: {'lr': 9.647150881365263e-05, 'samples': 20573376, 'steps': 107152, 'loss/train': 1.4245274066925049} 11/07/2021 12:19:21 - INFO - __main__ - Step 107154: {'lr': 9.64673206734879e-05, 'samples': 20573568, 'steps': 107153, 'loss/train': 1.5260471105575562} 11/07/2021 12:19:22 - INFO - __main__ - Step 107155: {'lr': 9.646313260250267e-05, 'samples': 20573760, 'steps': 107154, 'loss/train': 1.2451823949813843} 11/07/2021 12:19:22 - INFO - __main__ - Step 107156: {'lr': 9.645894460069876e-05, 'samples': 20573952, 'steps': 107155, 'loss/train': 0.9413198828697205} 11/07/2021 12:19:22 - INFO - __main__ - Step 107157: {'lr': 9.645475666807807e-05, 'samples': 20574144, 'steps': 107156, 'loss/train': 1.361348032951355} 11/07/2021 12:19:23 - INFO - __main__ - Step 107158: {'lr': 9.64505688046425e-05, 'samples': 20574336, 'steps': 107157, 'loss/train': 1.5701494216918945} 11/07/2021 12:19:24 - INFO - __main__ - Step 107159: {'lr': 9.644638101039396e-05, 'samples': 20574528, 'steps': 107158, 'loss/train': 1.524310827255249} 11/07/2021 12:19:24 - INFO - __main__ - Step 107160: {'lr': 9.644219328533438e-05, 'samples': 20574720, 'steps': 107159, 'loss/train': 1.2504353523254395} 11/07/2021 12:19:24 - INFO - __main__ - Step 107161: {'lr': 9.643800562946551e-05, 'samples': 20574912, 'steps': 107160, 'loss/train': 1.0121276378631592} 11/07/2021 12:19:25 - INFO - __main__ - Step 107162: {'lr': 9.643381804278927e-05, 'samples': 20575104, 'steps': 107161, 'loss/train': 0.6060599088668823} 11/07/2021 12:19:25 - INFO - __main__ - Step 107163: {'lr': 9.642963052530759e-05, 'samples': 20575296, 'steps': 107162, 'loss/train': 1.7241419553756714} 11/07/2021 12:19:25 - INFO - __main__ - Step 107164: {'lr': 9.642544307702236e-05, 'samples': 20575488, 'steps': 107163, 'loss/train': 1.0504204034805298} 11/07/2021 12:19:27 - INFO - __main__ - Step 107165: {'lr': 9.642125569793547e-05, 'samples': 20575680, 'steps': 107164, 'loss/train': 1.2689231634140015} 11/07/2021 12:19:27 - INFO - __main__ - Step 107166: {'lr': 9.641706838804879e-05, 'samples': 20575872, 'steps': 107165, 'loss/train': 1.278436541557312} 11/07/2021 12:19:27 - INFO - __main__ - Step 107167: {'lr': 9.641288114736418e-05, 'samples': 20576064, 'steps': 107166, 'loss/train': 1.082615852355957} 11/07/2021 12:19:28 - INFO - __main__ - Step 107168: {'lr': 9.640869397588356e-05, 'samples': 20576256, 'steps': 107167, 'loss/train': 1.5297256708145142} 11/07/2021 12:19:28 - INFO - __main__ - Step 107169: {'lr': 9.640450687360882e-05, 'samples': 20576448, 'steps': 107168, 'loss/train': 0.09307963401079178} 11/07/2021 12:19:30 - INFO - __main__ - Step 107170: {'lr': 9.640031984054184e-05, 'samples': 20576640, 'steps': 107169, 'loss/train': 1.471591830253601} 11/07/2021 12:19:30 - INFO - __main__ - Step 107171: {'lr': 9.639613287668453e-05, 'samples': 20576832, 'steps': 107170, 'loss/train': 1.4765872955322266} 11/07/2021 12:19:30 - INFO - __main__ - Step 107172: {'lr': 9.639194598203873e-05, 'samples': 20577024, 'steps': 107171, 'loss/train': 0.048212889581918716} 11/07/2021 12:19:31 - INFO - __main__ - Step 107173: {'lr': 9.638775915660644e-05, 'samples': 20577216, 'steps': 107172, 'loss/train': 0.6115336418151855} 11/07/2021 12:19:31 - INFO - __main__ - Step 107174: {'lr': 9.638357240038936e-05, 'samples': 20577408, 'steps': 107173, 'loss/train': 1.0920225381851196} 11/07/2021 12:19:31 - INFO - __main__ - Step 107175: {'lr': 9.637938571338947e-05, 'samples': 20577600, 'steps': 107174, 'loss/train': 1.158583164215088} 11/07/2021 12:19:32 - INFO - __main__ - Step 107176: {'lr': 9.637519909560869e-05, 'samples': 20577792, 'steps': 107175, 'loss/train': 1.3850821256637573} 11/07/2021 12:19:33 - INFO - __main__ - Step 107177: {'lr': 9.637101254704882e-05, 'samples': 20577984, 'steps': 107176, 'loss/train': 1.1193475723266602} 11/07/2021 12:19:33 - INFO - __main__ - Step 107178: {'lr': 9.636682606771185e-05, 'samples': 20578176, 'steps': 107177, 'loss/train': 0.8607202172279358} 11/07/2021 12:19:33 - INFO - __main__ - Step 107179: {'lr': 9.636263965759959e-05, 'samples': 20578368, 'steps': 107178, 'loss/train': 1.4523723125457764} 11/07/2021 12:19:34 - INFO - __main__ - Step 107180: {'lr': 9.635845331671397e-05, 'samples': 20578560, 'steps': 107179, 'loss/train': 1.2991759777069092} 11/07/2021 12:19:35 - INFO - __main__ - Step 107181: {'lr': 9.635426704505684e-05, 'samples': 20578752, 'steps': 107180, 'loss/train': 1.5680257081985474} 11/07/2021 12:19:35 - INFO - __main__ - Step 107182: {'lr': 9.635008084263014e-05, 'samples': 20578944, 'steps': 107181, 'loss/train': 1.2753654718399048} 11/07/2021 12:19:35 - INFO - __main__ - Step 107183: {'lr': 9.634589470943569e-05, 'samples': 20579136, 'steps': 107182, 'loss/train': 1.0320011377334595} 11/07/2021 12:19:36 - INFO - __main__ - Step 107184: {'lr': 9.634170864547542e-05, 'samples': 20579328, 'steps': 107183, 'loss/train': 1.1060771942138672} 11/07/2021 12:19:36 - INFO - __main__ - Step 107185: {'lr': 9.633752265075122e-05, 'samples': 20579520, 'steps': 107184, 'loss/train': 1.3500454425811768} 11/07/2021 12:19:37 - INFO - __main__ - Step 107186: {'lr': 9.633333672526493e-05, 'samples': 20579712, 'steps': 107185, 'loss/train': 1.1761083602905273} 11/07/2021 12:19:38 - INFO - __main__ - Step 107187: {'lr': 9.632915086901858e-05, 'samples': 20579904, 'steps': 107186, 'loss/train': 0.692209005355835} 11/07/2021 12:19:38 - INFO - __main__ - Step 107188: {'lr': 9.632496508201382e-05, 'samples': 20580096, 'steps': 107187, 'loss/train': 1.4420448541641235} 11/07/2021 12:19:38 - INFO - __main__ - Step 107189: {'lr': 9.632077936425271e-05, 'samples': 20580288, 'steps': 107188, 'loss/train': 1.154921531677246} 11/07/2021 12:19:39 - INFO - __main__ - Step 107190: {'lr': 9.631659371573706e-05, 'samples': 20580480, 'steps': 107189, 'loss/train': 1.6022285223007202} 11/07/2021 12:19:39 - INFO - __main__ - Step 107191: {'lr': 9.631240813646878e-05, 'samples': 20580672, 'steps': 107190, 'loss/train': 1.7194974422454834} 11/07/2021 12:19:40 - INFO - __main__ - Step 107192: {'lr': 9.630822262644976e-05, 'samples': 20580864, 'steps': 107191, 'loss/train': 1.4478813409805298} 11/07/2021 12:19:40 - INFO - __main__ - Step 107193: {'lr': 9.630403718568187e-05, 'samples': 20581056, 'steps': 107192, 'loss/train': 0.4400597810745239} 11/07/2021 12:19:41 - INFO - __main__ - Step 107194: {'lr': 9.629985181416703e-05, 'samples': 20581248, 'steps': 107193, 'loss/train': 1.3612574338912964} 11/07/2021 12:19:41 - INFO - __main__ - Step 107195: {'lr': 9.62956665119071e-05, 'samples': 20581440, 'steps': 107194, 'loss/train': 1.4402204751968384} 11/07/2021 12:19:41 - INFO - __main__ - Step 107196: {'lr': 9.629148127890397e-05, 'samples': 20581632, 'steps': 107195, 'loss/train': 1.5134503841400146} 11/07/2021 12:19:43 - INFO - __main__ - Step 107197: {'lr': 9.628729611515951e-05, 'samples': 20581824, 'steps': 107196, 'loss/train': 1.5725375413894653} 11/07/2021 12:19:43 - INFO - __main__ - Step 107198: {'lr': 9.628311102067566e-05, 'samples': 20582016, 'steps': 107197, 'loss/train': 1.4012877941131592} 11/07/2021 12:19:43 - INFO - __main__ - Step 107199: {'lr': 9.627892599545424e-05, 'samples': 20582208, 'steps': 107198, 'loss/train': 1.2915940284729004} 11/07/2021 12:19:44 - INFO - __main__ - Step 107200: {'lr': 9.627474103949727e-05, 'samples': 20582400, 'steps': 107199, 'loss/train': 1.3727110624313354} 11/07/2021 12:19:44 - INFO - __main__ - Step 107201: {'lr': 9.627055615280641e-05, 'samples': 20582592, 'steps': 107200, 'loss/train': 1.251523494720459} 11/07/2021 12:19:45 - INFO - __main__ - Step 107202: {'lr': 9.626637133538368e-05, 'samples': 20582784, 'steps': 107201, 'loss/train': 1.4146682024002075} 11/07/2021 12:19:46 - INFO - __main__ - Step 107203: {'lr': 9.626218658723096e-05, 'samples': 20582976, 'steps': 107202, 'loss/train': 1.2899764776229858} 11/07/2021 12:19:46 - INFO - __main__ - Step 107204: {'lr': 9.625800190835013e-05, 'samples': 20583168, 'steps': 107203, 'loss/train': 0.8480256795883179} 11/07/2021 12:19:46 - INFO - __main__ - Step 107205: {'lr': 9.625381729874308e-05, 'samples': 20583360, 'steps': 107204, 'loss/train': 1.5200258493423462} 11/07/2021 12:19:47 - INFO - __main__ - Step 107206: {'lr': 9.624963275841167e-05, 'samples': 20583552, 'steps': 107205, 'loss/train': 1.3786777257919312} 11/07/2021 12:19:48 - INFO - __main__ - Step 107207: {'lr': 9.62454482873578e-05, 'samples': 20583744, 'steps': 107206, 'loss/train': 1.1373553276062012} 11/07/2021 12:19:48 - INFO - __main__ - Step 107208: {'lr': 9.624126388558335e-05, 'samples': 20583936, 'steps': 107207, 'loss/train': 1.4176992177963257} 11/07/2021 12:19:48 - INFO - __main__ - Step 107209: {'lr': 9.623707955309025e-05, 'samples': 20584128, 'steps': 107208, 'loss/train': 0.846172571182251} 11/07/2021 12:19:49 - INFO - __main__ - Step 107210: {'lr': 9.62328952898803e-05, 'samples': 20584320, 'steps': 107209, 'loss/train': 1.4506512880325317} 11/07/2021 12:19:49 - INFO - __main__ - Step 107211: {'lr': 9.622871109595546e-05, 'samples': 20584512, 'steps': 107210, 'loss/train': 1.4889612197875977} 11/07/2021 12:19:50 - INFO - __main__ - Step 107212: {'lr': 9.62245269713176e-05, 'samples': 20584704, 'steps': 107211, 'loss/train': 1.3805192708969116} 11/07/2021 12:19:51 - INFO - __main__ - Step 107213: {'lr': 9.622034291596868e-05, 'samples': 20584896, 'steps': 107212, 'loss/train': 1.2802011966705322} 11/07/2021 12:19:51 - INFO - __main__ - Step 107214: {'lr': 9.62161589299104e-05, 'samples': 20585088, 'steps': 107213, 'loss/train': 1.0274782180786133} 11/07/2021 12:19:51 - INFO - __main__ - Step 107215: {'lr': 9.621197501314474e-05, 'samples': 20585280, 'steps': 107214, 'loss/train': 0.9079883098602295} 11/07/2021 12:19:52 - INFO - __main__ - Step 107216: {'lr': 9.62077911656736e-05, 'samples': 20585472, 'steps': 107215, 'loss/train': 1.3784680366516113} 11/07/2021 12:19:52 - INFO - __main__ - Step 107217: {'lr': 9.620360738749886e-05, 'samples': 20585664, 'steps': 107216, 'loss/train': 0.8400434255599976} 11/07/2021 12:19:53 - INFO - __main__ - Step 107218: {'lr': 9.619942367862241e-05, 'samples': 20585856, 'steps': 107217, 'loss/train': 1.1230416297912598} 11/07/2021 12:19:53 - INFO - __main__ - Step 107219: {'lr': 9.619524003904612e-05, 'samples': 20586048, 'steps': 107218, 'loss/train': 1.4267948865890503} 11/07/2021 12:19:54 - INFO - __main__ - Step 107220: {'lr': 9.619105646877188e-05, 'samples': 20586240, 'steps': 107219, 'loss/train': 1.4499256610870361} 11/07/2021 12:19:54 - INFO - __main__ - Step 107221: {'lr': 9.618687296780157e-05, 'samples': 20586432, 'steps': 107220, 'loss/train': 1.5640833377838135} 11/07/2021 12:19:54 - INFO - __main__ - Step 107222: {'lr': 9.61826895361371e-05, 'samples': 20586624, 'steps': 107221, 'loss/train': 1.2500163316726685} 11/07/2021 12:19:56 - INFO - __main__ - Step 107223: {'lr': 9.617850617378041e-05, 'samples': 20586816, 'steps': 107222, 'loss/train': 0.9623971581459045} 11/07/2021 12:19:56 - INFO - __main__ - Step 107224: {'lr': 9.617432288073322e-05, 'samples': 20587008, 'steps': 107223, 'loss/train': 2.445042610168457} 11/07/2021 12:19:56 - INFO - __main__ - Step 107225: {'lr': 9.61701396569975e-05, 'samples': 20587200, 'steps': 107224, 'loss/train': 1.3849287033081055} 11/07/2021 12:19:57 - INFO - __main__ - Step 107226: {'lr': 9.616595650257514e-05, 'samples': 20587392, 'steps': 107225, 'loss/train': 1.2431451082229614} 11/07/2021 12:19:57 - INFO - __main__ - Step 107227: {'lr': 9.616177341746807e-05, 'samples': 20587584, 'steps': 107226, 'loss/train': 0.892318069934845} 11/07/2021 12:19:58 - INFO - __main__ - Step 107228: {'lr': 9.61575904016781e-05, 'samples': 20587776, 'steps': 107227, 'loss/train': 1.2967692613601685} 11/07/2021 12:19:58 - INFO - __main__ - Step 107229: {'lr': 9.615340745520712e-05, 'samples': 20587968, 'steps': 107228, 'loss/train': 1.6126527786254883} 11/07/2021 12:19:59 - INFO - __main__ - Step 107230: {'lr': 9.614922457805708e-05, 'samples': 20588160, 'steps': 107229, 'loss/train': 0.8086864352226257} 11/07/2021 12:19:59 - INFO - __main__ - Step 107231: {'lr': 9.614504177022981e-05, 'samples': 20588352, 'steps': 107230, 'loss/train': 1.1572521924972534} 11/07/2021 12:19:59 - INFO - __main__ - Step 107232: {'lr': 9.61408590317272e-05, 'samples': 20588544, 'steps': 107231, 'loss/train': 1.4963102340698242} 11/07/2021 12:20:01 - INFO - __main__ - Step 107233: {'lr': 9.613667636255116e-05, 'samples': 20588736, 'steps': 107232, 'loss/train': 1.4880386590957642} 11/07/2021 12:20:01 - INFO - __main__ - Step 107234: {'lr': 9.613249376270364e-05, 'samples': 20588928, 'steps': 107233, 'loss/train': 1.62791907787323} 11/07/2021 12:20:01 - INFO - __main__ - Step 107235: {'lr': 9.612831123218638e-05, 'samples': 20589120, 'steps': 107234, 'loss/train': 1.7870662212371826} 11/07/2021 12:20:02 - INFO - __main__ - Step 107236: {'lr': 9.61241287710013e-05, 'samples': 20589312, 'steps': 107235, 'loss/train': 4.983157634735107} 11/07/2021 12:20:02 - INFO - __main__ - Step 107237: {'lr': 9.61199463791503e-05, 'samples': 20589504, 'steps': 107236, 'loss/train': 1.4331364631652832} 11/07/2021 12:20:02 - INFO - __main__ - Step 107238: {'lr': 9.611576405663533e-05, 'samples': 20589696, 'steps': 107237, 'loss/train': 5.683313846588135} 11/07/2021 12:20:03 - INFO - __main__ - Step 107239: {'lr': 9.61115818034582e-05, 'samples': 20589888, 'steps': 107238, 'loss/train': 1.2796381711959839} 11/07/2021 12:20:04 - INFO - __main__ - Step 107240: {'lr': 9.610739961962078e-05, 'samples': 20590080, 'steps': 107239, 'loss/train': 1.601725459098816} 11/07/2021 12:20:04 - INFO - __main__ - Step 107241: {'lr': 9.610321750512502e-05, 'samples': 20590272, 'steps': 107240, 'loss/train': 1.425428867340088} 11/07/2021 12:20:05 - INFO - __main__ - Step 107242: {'lr': 9.609903545997278e-05, 'samples': 20590464, 'steps': 107241, 'loss/train': 1.5431160926818848} 11/07/2021 12:20:05 - INFO - __main__ - Step 107243: {'lr': 9.609485348416594e-05, 'samples': 20590656, 'steps': 107242, 'loss/train': 1.6872004270553589} 11/07/2021 12:20:05 - INFO - __main__ - Step 107244: {'lr': 9.609067157770638e-05, 'samples': 20590848, 'steps': 107243, 'loss/train': 0.96229487657547} 11/07/2021 12:20:06 - INFO - __main__ - Step 107245: {'lr': 9.608648974059606e-05, 'samples': 20591040, 'steps': 107244, 'loss/train': 1.496787428855896} 11/07/2021 12:20:07 - INFO - __main__ - Step 107246: {'lr': 9.608230797283673e-05, 'samples': 20591232, 'steps': 107245, 'loss/train': 1.3918991088867188} 11/07/2021 12:20:07 - INFO - __main__ - Step 107247: {'lr': 9.607812627443032e-05, 'samples': 20591424, 'steps': 107246, 'loss/train': 0.9004878997802734} 11/07/2021 12:20:07 - INFO - __main__ - Step 107248: {'lr': 9.607394464537875e-05, 'samples': 20591616, 'steps': 107247, 'loss/train': 1.079949975013733} 11/07/2021 12:20:08 - INFO - __main__ - Step 107249: {'lr': 9.606976308568385e-05, 'samples': 20591808, 'steps': 107248, 'loss/train': 1.4776335954666138} 11/07/2021 12:20:09 - INFO - __main__ - Step 107250: {'lr': 9.606558159534756e-05, 'samples': 20592000, 'steps': 107249, 'loss/train': 1.2627326250076294} 11/07/2021 12:20:09 - INFO - __main__ - Step 107251: {'lr': 9.606140017437176e-05, 'samples': 20592192, 'steps': 107250, 'loss/train': 1.3409780263900757} 11/07/2021 12:20:10 - INFO - __main__ - Step 107252: {'lr': 9.605721882275831e-05, 'samples': 20592384, 'steps': 107251, 'loss/train': 1.4769923686981201} 11/07/2021 12:20:10 - INFO - __main__ - Step 107253: {'lr': 9.60530375405091e-05, 'samples': 20592576, 'steps': 107252, 'loss/train': 1.5291857719421387} 11/07/2021 12:20:10 - INFO - __main__ - Step 107254: {'lr': 9.6048856327626e-05, 'samples': 20592768, 'steps': 107253, 'loss/train': 1.0251829624176025} 11/07/2021 12:20:11 - INFO - __main__ - Step 107255: {'lr': 9.604467518411092e-05, 'samples': 20592960, 'steps': 107254, 'loss/train': 1.606110692024231} 11/07/2021 12:20:12 - INFO - __main__ - Step 107256: {'lr': 9.604049410996582e-05, 'samples': 20593152, 'steps': 107255, 'loss/train': 1.2824679613113403} 11/07/2021 12:20:12 - INFO - __main__ - Step 107257: {'lr': 9.60363131051924e-05, 'samples': 20593344, 'steps': 107256, 'loss/train': 0.8905511498451233} 11/07/2021 12:20:12 - INFO - __main__ - Step 107258: {'lr': 9.603213216979268e-05, 'samples': 20593536, 'steps': 107257, 'loss/train': 1.1611367464065552} 11/07/2021 12:20:13 - INFO - __main__ - Step 107259: {'lr': 9.602795130376846e-05, 'samples': 20593728, 'steps': 107258, 'loss/train': 1.6161725521087646} 11/07/2021 12:20:14 - INFO - __main__ - Step 107260: {'lr': 9.602377050712169e-05, 'samples': 20593920, 'steps': 107259, 'loss/train': 1.4435462951660156} 11/07/2021 12:20:14 - INFO - __main__ - Step 107261: {'lr': 9.601958977985423e-05, 'samples': 20594112, 'steps': 107260, 'loss/train': 1.6753019094467163} 11/07/2021 12:20:14 - INFO - __main__ - Step 107262: {'lr': 9.601540912196796e-05, 'samples': 20594304, 'steps': 107261, 'loss/train': 1.288649320602417} 11/07/2021 12:20:15 - INFO - __main__ - Step 107263: {'lr': 9.601122853346478e-05, 'samples': 20594496, 'steps': 107262, 'loss/train': 1.3823508024215698} 11/07/2021 12:20:15 - INFO - __main__ - Step 107264: {'lr': 9.600704801434657e-05, 'samples': 20594688, 'steps': 107263, 'loss/train': 1.4625802040100098} 11/07/2021 12:20:16 - INFO - __main__ - Step 107265: {'lr': 9.600286756461521e-05, 'samples': 20594880, 'steps': 107264, 'loss/train': 1.4259002208709717} 11/07/2021 12:20:17 - INFO - __main__ - Step 107266: {'lr': 9.599868718427256e-05, 'samples': 20595072, 'steps': 107265, 'loss/train': 0.6631815433502197} 11/07/2021 12:20:17 - INFO - __main__ - Step 107267: {'lr': 9.599450687332062e-05, 'samples': 20595264, 'steps': 107266, 'loss/train': 1.5132358074188232} 11/07/2021 12:20:17 - INFO - __main__ - Step 107268: {'lr': 9.59903266317611e-05, 'samples': 20595456, 'steps': 107267, 'loss/train': 1.4387085437774658} 11/07/2021 12:20:18 - INFO - __main__ - Step 107269: {'lr': 9.598614645959597e-05, 'samples': 20595648, 'steps': 107268, 'loss/train': 1.257895588874817} 11/07/2021 12:20:19 - INFO - __main__ - Step 107270: {'lr': 9.598196635682707e-05, 'samples': 20595840, 'steps': 107269, 'loss/train': 2.3514134883880615} 11/07/2021 12:20:19 - INFO - __main__ - Step 107271: {'lr': 9.597778632345636e-05, 'samples': 20596032, 'steps': 107270, 'loss/train': 1.7813018560409546} 11/07/2021 12:20:19 - INFO - __main__ - Step 107272: {'lr': 9.597360635948565e-05, 'samples': 20596224, 'steps': 107271, 'loss/train': 1.1184436082839966} 11/07/2021 12:20:20 - INFO - __main__ - Step 107273: {'lr': 9.596942646491688e-05, 'samples': 20596416, 'steps': 107272, 'loss/train': 1.341354250907898} 11/07/2021 12:20:20 - INFO - __main__ - Step 107274: {'lr': 9.59652466397519e-05, 'samples': 20596608, 'steps': 107273, 'loss/train': 1.1636401414871216} 11/07/2021 12:20:21 - INFO - __main__ - Step 107275: {'lr': 9.596106688399262e-05, 'samples': 20596800, 'steps': 107274, 'loss/train': 0.9919711947441101} 11/07/2021 12:20:21 - INFO - __main__ - Step 107276: {'lr': 9.595688719764087e-05, 'samples': 20596992, 'steps': 107275, 'loss/train': 1.4070559740066528} 11/07/2021 12:20:22 - INFO - __main__ - Step 107277: {'lr': 9.59527075806986e-05, 'samples': 20597184, 'steps': 107276, 'loss/train': 1.4098877906799316} 11/07/2021 12:20:22 - INFO - __main__ - Step 107278: {'lr': 9.594852803316764e-05, 'samples': 20597376, 'steps': 107277, 'loss/train': 1.2486088275909424} 11/07/2021 12:20:22 - INFO - __main__ - Step 107279: {'lr': 9.594434855504991e-05, 'samples': 20597568, 'steps': 107278, 'loss/train': 0.6597025990486145} 11/07/2021 12:20:24 - INFO - __main__ - Step 107280: {'lr': 9.594016914634728e-05, 'samples': 20597760, 'steps': 107279, 'loss/train': 1.5647239685058594} 11/07/2021 12:20:24 - INFO - __main__ - Step 107281: {'lr': 9.593598980706172e-05, 'samples': 20597952, 'steps': 107280, 'loss/train': 1.7433769702911377} 11/07/2021 12:20:24 - INFO - __main__ - Step 107282: {'lr': 9.593181053719494e-05, 'samples': 20598144, 'steps': 107281, 'loss/train': 1.2441400289535522} 11/07/2021 12:20:25 - INFO - __main__ - Step 107283: {'lr': 9.592763133674892e-05, 'samples': 20598336, 'steps': 107282, 'loss/train': 1.4276244640350342} 11/07/2021 12:20:25 - INFO - __main__ - Step 107284: {'lr': 9.592345220572551e-05, 'samples': 20598528, 'steps': 107283, 'loss/train': 1.3312956094741821} 11/07/2021 12:20:25 - INFO - __main__ - Step 107285: {'lr': 9.591927314412663e-05, 'samples': 20598720, 'steps': 107284, 'loss/train': 0.5307242274284363} 11/07/2021 12:20:26 - INFO - __main__ - Step 107286: {'lr': 9.591509415195413e-05, 'samples': 20598912, 'steps': 107285, 'loss/train': 1.2973579168319702} 11/07/2021 12:20:27 - INFO - __main__ - Step 107287: {'lr': 9.591091522920992e-05, 'samples': 20599104, 'steps': 107286, 'loss/train': 1.4926562309265137} 11/07/2021 12:20:27 - INFO - __main__ - Step 107288: {'lr': 9.590673637589586e-05, 'samples': 20599296, 'steps': 107287, 'loss/train': 0.9402908086776733} 11/07/2021 12:20:27 - INFO - __main__ - Step 107289: {'lr': 9.590255759201388e-05, 'samples': 20599488, 'steps': 107288, 'loss/train': 0.7516981363296509} 11/07/2021 12:20:28 - INFO - __main__ - Step 107290: {'lr': 9.58983788775658e-05, 'samples': 20599680, 'steps': 107289, 'loss/train': 1.0248734951019287} 11/07/2021 12:20:29 - INFO - __main__ - Step 107291: {'lr': 9.589420023255355e-05, 'samples': 20599872, 'steps': 107290, 'loss/train': 1.458762764930725} 11/07/2021 12:20:29 - INFO - __main__ - Step 107292: {'lr': 9.589002165697899e-05, 'samples': 20600064, 'steps': 107291, 'loss/train': 1.2252706289291382} 11/07/2021 12:20:29 - INFO - __main__ - Step 107293: {'lr': 9.5885843150844e-05, 'samples': 20600256, 'steps': 107292, 'loss/train': 1.4706848859786987} 11/07/2021 12:20:30 - INFO - __main__ - Step 107294: {'lr': 9.588166471415058e-05, 'samples': 20600448, 'steps': 107293, 'loss/train': 1.6438279151916504} 11/07/2021 12:20:30 - INFO - __main__ - Step 107295: {'lr': 9.58774863469004e-05, 'samples': 20600640, 'steps': 107294, 'loss/train': 1.573608636856079} 11/07/2021 12:20:31 - INFO - __main__ - Step 107296: {'lr': 9.587330804909544e-05, 'samples': 20600832, 'steps': 107295, 'loss/train': 1.4336079359054565} 11/07/2021 12:20:32 - INFO - __main__ - Step 107297: {'lr': 9.586912982073762e-05, 'samples': 20601024, 'steps': 107296, 'loss/train': 1.0817562341690063} 11/07/2021 12:20:32 - INFO - __main__ - Step 107298: {'lr': 9.586495166182877e-05, 'samples': 20601216, 'steps': 107297, 'loss/train': 1.5986346006393433} 11/07/2021 12:20:32 - INFO - __main__ - Step 107299: {'lr': 9.586077357237077e-05, 'samples': 20601408, 'steps': 107298, 'loss/train': 0.26396721601486206} 11/07/2021 12:20:33 - INFO - __main__ - Step 107300: {'lr': 9.585659555236556e-05, 'samples': 20601600, 'steps': 107299, 'loss/train': 1.4883291721343994} 11/07/2021 12:20:34 - INFO - __main__ - Step 107301: {'lr': 9.585241760181499e-05, 'samples': 20601792, 'steps': 107300, 'loss/train': 1.30563223361969} 11/07/2021 12:20:34 - INFO - __main__ - Step 107302: {'lr': 9.584823972072093e-05, 'samples': 20601984, 'steps': 107301, 'loss/train': 1.3040467500686646} 11/07/2021 12:20:34 - INFO - __main__ - Step 107303: {'lr': 9.584406190908527e-05, 'samples': 20602176, 'steps': 107302, 'loss/train': 0.9310728907585144} 11/07/2021 12:20:35 - INFO - __main__ - Step 107304: {'lr': 9.58398841669099e-05, 'samples': 20602368, 'steps': 107303, 'loss/train': 0.6637000441551208} 11/07/2021 12:20:35 - INFO - __main__ - Step 107305: {'lr': 9.58357064941967e-05, 'samples': 20602560, 'steps': 107304, 'loss/train': 1.5332497358322144} 11/07/2021 12:20:36 - INFO - __main__ - Step 107306: {'lr': 9.583152889094757e-05, 'samples': 20602752, 'steps': 107305, 'loss/train': 1.4002043008804321} 11/07/2021 12:20:36 - INFO - __main__ - Step 107307: {'lr': 9.582735135716437e-05, 'samples': 20602944, 'steps': 107306, 'loss/train': 1.3669893741607666} 11/07/2021 12:20:37 - INFO - __main__ - Step 107308: {'lr': 9.582317389284903e-05, 'samples': 20603136, 'steps': 107307, 'loss/train': 1.7829375267028809} 11/07/2021 12:20:37 - INFO - __main__ - Step 107309: {'lr': 9.581899649800335e-05, 'samples': 20603328, 'steps': 107308, 'loss/train': 0.7193642258644104} 11/07/2021 12:20:37 - INFO - __main__ - Step 107310: {'lr': 9.581481917262924e-05, 'samples': 20603520, 'steps': 107309, 'loss/train': 1.6104660034179688} 11/07/2021 12:20:39 - INFO - __main__ - Step 107311: {'lr': 9.581064191672859e-05, 'samples': 20603712, 'steps': 107310, 'loss/train': 0.2627597153186798} 11/07/2021 12:20:39 - INFO - __main__ - Step 107312: {'lr': 9.580646473030327e-05, 'samples': 20603904, 'steps': 107311, 'loss/train': 1.455505132675171} 11/07/2021 12:20:39 - INFO - __main__ - Step 107313: {'lr': 9.580228761335519e-05, 'samples': 20604096, 'steps': 107312, 'loss/train': 1.5611659288406372} 11/07/2021 12:20:40 - INFO - __main__ - Step 107314: {'lr': 9.579811056588622e-05, 'samples': 20604288, 'steps': 107313, 'loss/train': 0.13782203197479248} 11/07/2021 12:20:40 - INFO - __main__ - Step 107315: {'lr': 9.579393358789825e-05, 'samples': 20604480, 'steps': 107314, 'loss/train': 1.3394923210144043} 11/07/2021 12:20:40 - INFO - __main__ - Step 107316: {'lr': 9.578975667939316e-05, 'samples': 20604672, 'steps': 107315, 'loss/train': 0.895201563835144} 11/07/2021 12:20:42 - INFO - __main__ - Step 107317: {'lr': 9.578557984037281e-05, 'samples': 20604864, 'steps': 107316, 'loss/train': 1.0318148136138916} 11/07/2021 12:20:42 - INFO - __main__ - Step 107318: {'lr': 9.57814030708391e-05, 'samples': 20605056, 'steps': 107317, 'loss/train': 1.4497287273406982} 11/07/2021 12:20:42 - INFO - __main__ - Step 107319: {'lr': 9.57772263707939e-05, 'samples': 20605248, 'steps': 107318, 'loss/train': 1.5497924089431763} 11/07/2021 12:20:43 - INFO - __main__ - Step 107320: {'lr': 9.577304974023911e-05, 'samples': 20605440, 'steps': 107319, 'loss/train': 1.089977741241455} 11/07/2021 12:20:43 - INFO - __main__ - Step 107321: {'lr': 9.57688731791767e-05, 'samples': 20605632, 'steps': 107320, 'loss/train': 0.8861398100852966} 11/07/2021 12:20:44 - INFO - __main__ - Step 107322: {'lr': 9.576469668760837e-05, 'samples': 20605824, 'steps': 107321, 'loss/train': 1.3983078002929688} 11/07/2021 12:20:45 - INFO - __main__ - Step 107323: {'lr': 9.576052026553609e-05, 'samples': 20606016, 'steps': 107322, 'loss/train': 1.371787428855896} 11/07/2021 12:20:45 - INFO - __main__ - Step 107324: {'lr': 9.57563439129617e-05, 'samples': 20606208, 'steps': 107323, 'loss/train': 1.4617819786071777} 11/07/2021 12:20:45 - INFO - __main__ - Step 107325: {'lr': 9.575216762988717e-05, 'samples': 20606400, 'steps': 107324, 'loss/train': 1.2111475467681885} 11/07/2021 12:20:46 - INFO - __main__ - Step 107326: {'lr': 9.574799141631432e-05, 'samples': 20606592, 'steps': 107325, 'loss/train': 1.743431568145752} 11/07/2021 12:20:47 - INFO - __main__ - Step 107327: {'lr': 9.574381527224501e-05, 'samples': 20606784, 'steps': 107326, 'loss/train': 1.4494572877883911} 11/07/2021 12:20:47 - INFO - __main__ - Step 107328: {'lr': 9.57396391976812e-05, 'samples': 20606976, 'steps': 107327, 'loss/train': 1.8317075967788696} 11/07/2021 12:20:47 - INFO - __main__ - Step 107329: {'lr': 9.573546319262472e-05, 'samples': 20607168, 'steps': 107328, 'loss/train': 0.7331166863441467} 11/07/2021 12:20:48 - INFO - __main__ - Step 107330: {'lr': 9.573128725707744e-05, 'samples': 20607360, 'steps': 107329, 'loss/train': 0.9391469359397888} 11/07/2021 12:20:48 - INFO - __main__ - Step 107331: {'lr': 9.572711139104129e-05, 'samples': 20607552, 'steps': 107330, 'loss/train': 1.7270458936691284} 11/07/2021 12:20:49 - INFO - __main__ - Step 107332: {'lr': 9.572293559451811e-05, 'samples': 20607744, 'steps': 107331, 'loss/train': 1.0872670412063599} 11/07/2021 12:20:50 - INFO - __main__ - Step 107333: {'lr': 9.57187598675098e-05, 'samples': 20607936, 'steps': 107332, 'loss/train': 1.2612922191619873} 11/07/2021 12:20:50 - INFO - __main__ - Step 107334: {'lr': 9.571458421001822e-05, 'samples': 20608128, 'steps': 107333, 'loss/train': 1.6463611125946045} 11/07/2021 12:20:50 - INFO - __main__ - Step 107335: {'lr': 9.571040862204536e-05, 'samples': 20608320, 'steps': 107334, 'loss/train': 1.3187963962554932} 11/07/2021 12:20:51 - INFO - __main__ - Step 107336: {'lr': 9.570623310359291e-05, 'samples': 20608512, 'steps': 107335, 'loss/train': 0.6526712775230408} 11/07/2021 12:20:51 - INFO - __main__ - Step 107337: {'lr': 9.570205765466289e-05, 'samples': 20608704, 'steps': 107336, 'loss/train': 0.5152400732040405} 11/07/2021 12:20:52 - INFO - __main__ - Step 107338: {'lr': 9.569788227525711e-05, 'samples': 20608896, 'steps': 107337, 'loss/train': 1.036765217781067} 11/07/2021 12:20:52 - INFO - __main__ - Step 107339: {'lr': 9.56937069653775e-05, 'samples': 20609088, 'steps': 107338, 'loss/train': 1.6235923767089844} 11/07/2021 12:20:53 - INFO - __main__ - Step 107340: {'lr': 9.568953172502589e-05, 'samples': 20609280, 'steps': 107339, 'loss/train': 1.0540646314620972} 11/07/2021 12:20:53 - INFO - __main__ - Step 107341: {'lr': 9.568535655420424e-05, 'samples': 20609472, 'steps': 107340, 'loss/train': 2.162351369857788} 11/07/2021 12:20:53 - INFO - __main__ - Step 107342: {'lr': 9.568118145291437e-05, 'samples': 20609664, 'steps': 107341, 'loss/train': 1.3694220781326294} 11/07/2021 12:20:54 - INFO - __main__ - Step 107343: {'lr': 9.567700642115817e-05, 'samples': 20609856, 'steps': 107342, 'loss/train': 0.9888833165168762} 11/07/2021 12:20:55 - INFO - __main__ - Step 107344: {'lr': 9.567283145893754e-05, 'samples': 20610048, 'steps': 107343, 'loss/train': 1.0382084846496582} 11/07/2021 12:20:55 - INFO - __main__ - Step 107345: {'lr': 9.566865656625435e-05, 'samples': 20610240, 'steps': 107344, 'loss/train': 0.8174879550933838} 11/07/2021 12:20:56 - INFO - __main__ - Step 107346: {'lr': 9.566448174311049e-05, 'samples': 20610432, 'steps': 107345, 'loss/train': 1.024678349494934} 11/07/2021 12:20:56 - INFO - __main__ - Step 107347: {'lr': 9.56603069895078e-05, 'samples': 20610624, 'steps': 107346, 'loss/train': 1.1329008340835571} 11/07/2021 12:20:57 - INFO - __main__ - Step 107348: {'lr': 9.565613230544832e-05, 'samples': 20610816, 'steps': 107347, 'loss/train': 0.17963895201683044} 11/07/2021 12:20:57 - INFO - __main__ - Step 107349: {'lr': 9.565195769093371e-05, 'samples': 20611008, 'steps': 107348, 'loss/train': 1.3743785619735718} 11/07/2021 12:20:58 - INFO - __main__ - Step 107350: {'lr': 9.564778314596592e-05, 'samples': 20611200, 'steps': 107349, 'loss/train': 0.45712170004844666} 11/07/2021 12:20:58 - INFO - __main__ - Step 107351: {'lr': 9.564360867054689e-05, 'samples': 20611392, 'steps': 107350, 'loss/train': 1.3102657794952393} 11/07/2021 12:20:58 - INFO - __main__ - Step 107352: {'lr': 9.563943426467844e-05, 'samples': 20611584, 'steps': 107351, 'loss/train': 1.3147263526916504} 11/07/2021 12:21:00 - INFO - __main__ - Step 107353: {'lr': 9.56352599283625e-05, 'samples': 20611776, 'steps': 107352, 'loss/train': 1.546716570854187} 11/07/2021 12:21:00 - INFO - __main__ - Step 107354: {'lr': 9.563108566160092e-05, 'samples': 20611968, 'steps': 107353, 'loss/train': 1.509513258934021} 11/07/2021 12:21:00 - INFO - __main__ - Step 107355: {'lr': 9.562691146439559e-05, 'samples': 20612160, 'steps': 107354, 'loss/train': 1.3518158197402954} 11/07/2021 12:21:01 - INFO - __main__ - Step 107356: {'lr': 9.56227373367484e-05, 'samples': 20612352, 'steps': 107355, 'loss/train': 1.4659587144851685} 11/07/2021 12:21:01 - INFO - __main__ - Step 107357: {'lr': 9.56185632786612e-05, 'samples': 20612544, 'steps': 107356, 'loss/train': 1.3083428144454956} 11/07/2021 12:21:02 - INFO - __main__ - Step 107358: {'lr': 9.561438929013592e-05, 'samples': 20612736, 'steps': 107357, 'loss/train': 1.3390588760375977} 11/07/2021 12:21:02 - INFO - __main__ - Step 107359: {'lr': 9.56102153711744e-05, 'samples': 20612928, 'steps': 107358, 'loss/train': 1.5644161701202393} 11/07/2021 12:21:03 - INFO - __main__ - Step 107360: {'lr': 9.560604152177855e-05, 'samples': 20613120, 'steps': 107359, 'loss/train': 1.3198485374450684} 11/07/2021 12:21:03 - INFO - __main__ - Step 107361: {'lr': 9.560186774195028e-05, 'samples': 20613312, 'steps': 107360, 'loss/train': 1.391990065574646} 11/07/2021 12:21:03 - INFO - __main__ - Step 107362: {'lr': 9.559769403169138e-05, 'samples': 20613504, 'steps': 107361, 'loss/train': 1.369698405265808} 11/07/2021 12:21:04 - INFO - __main__ - Step 107363: {'lr': 9.559352039100377e-05, 'samples': 20613696, 'steps': 107362, 'loss/train': 1.3217496871948242} 11/07/2021 12:21:05 - INFO - __main__ - Step 107364: {'lr': 9.558934681988935e-05, 'samples': 20613888, 'steps': 107363, 'loss/train': 1.6532548666000366} 11/07/2021 12:21:05 - INFO - __main__ - Step 107365: {'lr': 9.558517331834996e-05, 'samples': 20614080, 'steps': 107364, 'loss/train': 1.4274871349334717} 11/07/2021 12:21:05 - INFO - __main__ - Step 107366: {'lr': 9.558099988638752e-05, 'samples': 20614272, 'steps': 107365, 'loss/train': 1.3938707113265991} 11/07/2021 12:21:06 - INFO - __main__ - Step 107367: {'lr': 9.55768265240039e-05, 'samples': 20614464, 'steps': 107366, 'loss/train': 1.2056691646575928} 11/07/2021 12:21:07 - INFO - __main__ - Step 107368: {'lr': 9.557265323120096e-05, 'samples': 20614656, 'steps': 107367, 'loss/train': 1.3790380954742432} 11/07/2021 12:21:07 - INFO - __main__ - Step 107369: {'lr': 9.556848000798063e-05, 'samples': 20614848, 'steps': 107368, 'loss/train': 1.3919137716293335} 11/07/2021 12:21:08 - INFO - __main__ - Step 107370: {'lr': 9.556430685434474e-05, 'samples': 20615040, 'steps': 107369, 'loss/train': 1.7112927436828613} 11/07/2021 12:21:08 - INFO - __main__ - Step 107371: {'lr': 9.556013377029519e-05, 'samples': 20615232, 'steps': 107370, 'loss/train': 1.6820627450942993} 11/07/2021 12:21:08 - INFO - __main__ - Step 107372: {'lr': 9.555596075583386e-05, 'samples': 20615424, 'steps': 107371, 'loss/train': 0.984386682510376} 11/07/2021 12:21:09 - INFO - __main__ - Step 107373: {'lr': 9.555178781096266e-05, 'samples': 20615616, 'steps': 107372, 'loss/train': 1.862674593925476} 11/07/2021 12:21:10 - INFO - __main__ - Step 107374: {'lr': 9.55476149356834e-05, 'samples': 20615808, 'steps': 107373, 'loss/train': 1.391951322555542} 11/07/2021 12:21:10 - INFO - __main__ - Step 107375: {'lr': 9.55434421299981e-05, 'samples': 20616000, 'steps': 107374, 'loss/train': 1.1977970600128174} 11/07/2021 12:21:10 - INFO - __main__ - Step 107376: {'lr': 9.553926939390847e-05, 'samples': 20616192, 'steps': 107375, 'loss/train': 0.9902551174163818} 11/07/2021 12:21:11 - INFO - __main__ - Step 107377: {'lr': 9.553509672741645e-05, 'samples': 20616384, 'steps': 107376, 'loss/train': 1.252519130706787} 11/07/2021 12:21:11 - INFO - __main__ - Step 107378: {'lr': 9.553092413052394e-05, 'samples': 20616576, 'steps': 107377, 'loss/train': 1.1978703737258911} 11/07/2021 12:21:12 - INFO - __main__ - Step 107379: {'lr': 9.552675160323282e-05, 'samples': 20616768, 'steps': 107378, 'loss/train': 1.1620148420333862} 11/07/2021 12:21:13 - INFO - __main__ - Step 107380: {'lr': 9.552257914554494e-05, 'samples': 20616960, 'steps': 107379, 'loss/train': 1.487878680229187} 11/07/2021 12:21:13 - INFO - __main__ - Step 107381: {'lr': 9.55184067574622e-05, 'samples': 20617152, 'steps': 107380, 'loss/train': 1.5090844631195068} 11/07/2021 12:21:13 - INFO - __main__ - Step 107382: {'lr': 9.551423443898649e-05, 'samples': 20617344, 'steps': 107381, 'loss/train': 1.6123782396316528} 11/07/2021 12:21:14 - INFO - __main__ - Step 107383: {'lr': 9.551006219011971e-05, 'samples': 20617536, 'steps': 107382, 'loss/train': 1.8844830989837646} 11/07/2021 12:21:15 - INFO - __main__ - Step 107384: {'lr': 9.550589001086369e-05, 'samples': 20617728, 'steps': 107383, 'loss/train': 1.358521580696106} 11/07/2021 12:21:15 - INFO - __main__ - Step 107385: {'lr': 9.55017179012203e-05, 'samples': 20617920, 'steps': 107384, 'loss/train': 1.1439869403839111} 11/07/2021 12:21:15 - INFO - __main__ - Step 107386: {'lr': 9.54975458611915e-05, 'samples': 20618112, 'steps': 107385, 'loss/train': 1.2638994455337524} 11/07/2021 12:21:16 - INFO - __main__ - Step 107387: {'lr': 9.549337389077908e-05, 'samples': 20618304, 'steps': 107386, 'loss/train': 1.219038724899292} 11/07/2021 12:21:16 - INFO - __main__ - Step 107388: {'lr': 9.548920198998509e-05, 'samples': 20618496, 'steps': 107387, 'loss/train': 1.3344266414642334} 11/07/2021 12:21:17 - INFO - __main__ - Step 107389: {'lr': 9.548503015881118e-05, 'samples': 20618688, 'steps': 107388, 'loss/train': 1.2983713150024414} 11/07/2021 12:21:17 - INFO - __main__ - Step 107390: {'lr': 9.548085839725931e-05, 'samples': 20618880, 'steps': 107389, 'loss/train': 0.8447322249412537} 11/07/2021 12:21:18 - INFO - __main__ - Step 107391: {'lr': 9.547668670533141e-05, 'samples': 20619072, 'steps': 107390, 'loss/train': 1.418135166168213} 11/07/2021 12:21:18 - INFO - __main__ - Step 107392: {'lr': 9.547251508302931e-05, 'samples': 20619264, 'steps': 107391, 'loss/train': 0.6296589970588684} 11/07/2021 12:21:19 - INFO - __main__ - Step 107393: {'lr': 9.546834353035491e-05, 'samples': 20619456, 'steps': 107392, 'loss/train': 1.3298003673553467} 11/07/2021 12:21:19 - INFO - __main__ - Step 107394: {'lr': 9.546417204731012e-05, 'samples': 20619648, 'steps': 107393, 'loss/train': 1.593458652496338} 11/07/2021 12:21:20 - INFO - __main__ - Step 107395: {'lr': 9.546000063389675e-05, 'samples': 20619840, 'steps': 107394, 'loss/train': 1.124403953552246} 11/07/2021 12:21:20 - INFO - __main__ - Step 107396: {'lr': 9.545582929011676e-05, 'samples': 20620032, 'steps': 107395, 'loss/train': 1.5424760580062866} 11/07/2021 12:21:21 - INFO - __main__ - Step 107397: {'lr': 9.545165801597194e-05, 'samples': 20620224, 'steps': 107396, 'loss/train': 1.052950143814087} 11/07/2021 12:21:21 - INFO - __main__ - Step 107398: {'lr': 9.544748681146425e-05, 'samples': 20620416, 'steps': 107397, 'loss/train': 1.3784046173095703} 11/07/2021 12:21:21 - INFO - __main__ - Step 107399: {'lr': 9.544331567659553e-05, 'samples': 20620608, 'steps': 107398, 'loss/train': 1.2189011573791504} 11/07/2021 12:21:22 - INFO - __main__ - Step 107400: {'lr': 9.543914461136769e-05, 'samples': 20620800, 'steps': 107399, 'loss/train': 1.264886736869812} 11/07/2021 12:21:23 - INFO - __main__ - Step 107401: {'lr': 9.543497361578254e-05, 'samples': 20620992, 'steps': 107400, 'loss/train': 1.3822391033172607} 11/07/2021 12:21:23 - INFO - __main__ - Step 107402: {'lr': 9.543080268984211e-05, 'samples': 20621184, 'steps': 107401, 'loss/train': 1.5468289852142334} 11/07/2021 12:21:24 - INFO - __main__ - Step 107403: {'lr': 9.54266318335481e-05, 'samples': 20621376, 'steps': 107402, 'loss/train': 1.3926324844360352} 11/07/2021 12:21:24 - INFO - __main__ - Step 107404: {'lr': 9.542246104690247e-05, 'samples': 20621568, 'steps': 107403, 'loss/train': 1.0311797857284546} 11/07/2021 12:21:25 - INFO - __main__ - Step 107405: {'lr': 9.541829032990709e-05, 'samples': 20621760, 'steps': 107404, 'loss/train': 1.4189708232879639} 11/07/2021 12:21:26 - INFO - __main__ - Step 107406: {'lr': 9.541411968256383e-05, 'samples': 20621952, 'steps': 107405, 'loss/train': 1.2329210042953491} 11/07/2021 12:21:26 - INFO - __main__ - Step 107407: {'lr': 9.540994910487457e-05, 'samples': 20622144, 'steps': 107406, 'loss/train': 2.362774133682251} 11/07/2021 12:21:26 - INFO - __main__ - Step 107408: {'lr': 9.540577859684124e-05, 'samples': 20622336, 'steps': 107407, 'loss/train': 2.419036388397217} 11/07/2021 12:21:27 - INFO - __main__ - Step 107409: {'lr': 9.540160815846566e-05, 'samples': 20622528, 'steps': 107408, 'loss/train': 1.4531168937683105} 11/07/2021 12:21:28 - INFO - __main__ - Step 107410: {'lr': 9.539743778974975e-05, 'samples': 20622720, 'steps': 107409, 'loss/train': 1.3672817945480347} 11/07/2021 12:21:28 - INFO - __main__ - Step 107411: {'lr': 9.539326749069532e-05, 'samples': 20622912, 'steps': 107410, 'loss/train': 1.3618253469467163} 11/07/2021 12:21:28 - INFO - __main__ - Step 107412: {'lr': 9.538909726130435e-05, 'samples': 20623104, 'steps': 107411, 'loss/train': 1.1488800048828125} 11/07/2021 12:21:29 - INFO - __main__ - Step 107413: {'lr': 9.538492710157865e-05, 'samples': 20623296, 'steps': 107412, 'loss/train': 1.0438427925109863} 11/07/2021 12:21:29 - INFO - __main__ - Step 107414: {'lr': 9.53807570115201e-05, 'samples': 20623488, 'steps': 107413, 'loss/train': 1.2645299434661865} 11/07/2021 12:21:29 - INFO - __main__ - Step 107415: {'lr': 9.537658699113069e-05, 'samples': 20623680, 'steps': 107414, 'loss/train': 1.1380655765533447} 11/07/2021 12:21:31 - INFO - __main__ - Step 107416: {'lr': 9.537241704041211e-05, 'samples': 20623872, 'steps': 107415, 'loss/train': 1.397681713104248} 11/07/2021 12:21:31 - INFO - __main__ - Step 107417: {'lr': 9.536824715936635e-05, 'samples': 20624064, 'steps': 107416, 'loss/train': 2.263239860534668} 11/07/2021 12:21:31 - INFO - __main__ - Step 107418: {'lr': 9.536407734799527e-05, 'samples': 20624256, 'steps': 107417, 'loss/train': 0.857033371925354} 11/07/2021 12:21:32 - INFO - __main__ - Step 107419: {'lr': 9.535990760630072e-05, 'samples': 20624448, 'steps': 107418, 'loss/train': 1.2593283653259277} 11/07/2021 12:21:32 - INFO - __main__ - Step 107420: {'lr': 9.535573793428465e-05, 'samples': 20624640, 'steps': 107419, 'loss/train': 0.9860390424728394} 11/07/2021 12:21:34 - INFO - __main__ - Step 107421: {'lr': 9.535156833194889e-05, 'samples': 20624832, 'steps': 107420, 'loss/train': 1.0543113946914673} 11/07/2021 12:21:34 - INFO - __main__ - Step 107422: {'lr': 9.53473987992953e-05, 'samples': 20625024, 'steps': 107421, 'loss/train': 1.520618200302124} 11/07/2021 12:21:34 - INFO - __main__ - Step 107423: {'lr': 9.534322933632581e-05, 'samples': 20625216, 'steps': 107422, 'loss/train': 1.7776871919631958} 11/07/2021 12:21:35 - INFO - __main__ - Step 107424: {'lr': 9.533905994304226e-05, 'samples': 20625408, 'steps': 107423, 'loss/train': 1.681547999382019} 11/07/2021 12:21:35 - INFO - __main__ - Step 107425: {'lr': 9.533489061944655e-05, 'samples': 20625600, 'steps': 107424, 'loss/train': 1.506897211074829} 11/07/2021 12:21:35 - INFO - __main__ - Step 107426: {'lr': 9.533072136554058e-05, 'samples': 20625792, 'steps': 107425, 'loss/train': 1.3938766717910767} 11/07/2021 12:21:37 - INFO - __main__ - Step 107427: {'lr': 9.532655218132616e-05, 'samples': 20625984, 'steps': 107426, 'loss/train': 1.479050874710083} 11/07/2021 12:21:37 - INFO - __main__ - Step 107428: {'lr': 9.532238306680521e-05, 'samples': 20626176, 'steps': 107427, 'loss/train': 1.1741611957550049} 11/07/2021 12:21:38 - INFO - __main__ - Step 107429: {'lr': 9.531821402197972e-05, 'samples': 20626368, 'steps': 107428, 'loss/train': 1.2562456130981445} 11/07/2021 12:21:38 - INFO - __main__ - Step 107430: {'lr': 9.531404504685134e-05, 'samples': 20626560, 'steps': 107429, 'loss/train': 1.2389647960662842} 11/07/2021 12:21:38 - INFO - __main__ - Step 107431: {'lr': 9.530987614142209e-05, 'samples': 20626752, 'steps': 107430, 'loss/train': 5.432201385498047} 11/07/2021 12:21:39 - INFO - __main__ - Step 107432: {'lr': 9.530570730569383e-05, 'samples': 20626944, 'steps': 107431, 'loss/train': 5.511879920959473} 11/07/2021 12:21:40 - INFO - __main__ - Step 107433: {'lr': 9.530153853966841e-05, 'samples': 20627136, 'steps': 107432, 'loss/train': 2.005730390548706} 11/07/2021 12:21:40 - INFO - __main__ - Step 107434: {'lr': 9.529736984334773e-05, 'samples': 20627328, 'steps': 107433, 'loss/train': 1.6817957162857056} 11/07/2021 12:21:40 - INFO - __main__ - Step 107435: {'lr': 9.529320121673369e-05, 'samples': 20627520, 'steps': 107434, 'loss/train': 0.9208579659461975} 11/07/2021 12:21:41 - INFO - __main__ - Step 107436: {'lr': 9.52890326598281e-05, 'samples': 20627712, 'steps': 107435, 'loss/train': 1.0676395893096924} 11/07/2021 12:21:41 - INFO - __main__ - Step 107437: {'lr': 9.528486417263294e-05, 'samples': 20627904, 'steps': 107436, 'loss/train': 1.3269826173782349} 11/07/2021 12:21:42 - INFO - __main__ - Step 107438: {'lr': 9.528069575515e-05, 'samples': 20628096, 'steps': 107437, 'loss/train': 1.224660038948059} 11/07/2021 12:21:42 - INFO - __main__ - Step 107439: {'lr': 9.527652740738118e-05, 'samples': 20628288, 'steps': 107438, 'loss/train': 1.3121466636657715} 11/07/2021 12:21:43 - INFO - __main__ - Step 107440: {'lr': 9.527235912932839e-05, 'samples': 20628480, 'steps': 107439, 'loss/train': 1.6492106914520264} 11/07/2021 12:21:43 - INFO - __main__ - Step 107441: {'lr': 9.526819092099348e-05, 'samples': 20628672, 'steps': 107440, 'loss/train': 0.986684262752533} 11/07/2021 12:21:43 - INFO - __main__ - Step 107442: {'lr': 9.526402278237842e-05, 'samples': 20628864, 'steps': 107441, 'loss/train': 1.2516591548919678} 11/07/2021 12:21:45 - INFO - __main__ - Step 107443: {'lr': 9.525985471348491e-05, 'samples': 20629056, 'steps': 107442, 'loss/train': 1.300490379333496} 11/07/2021 12:21:45 - INFO - __main__ - Step 107444: {'lr': 9.525568671431495e-05, 'samples': 20629248, 'steps': 107443, 'loss/train': 0.9222065806388855} 11/07/2021 12:21:45 - INFO - __main__ - Step 107445: {'lr': 9.525151878487037e-05, 'samples': 20629440, 'steps': 107444, 'loss/train': 1.1998380422592163} 11/07/2021 12:21:46 - INFO - __main__ - Step 107446: {'lr': 9.524735092515308e-05, 'samples': 20629632, 'steps': 107445, 'loss/train': 1.2915589809417725} 11/07/2021 12:21:46 - INFO - __main__ - Step 107447: {'lr': 9.524318313516495e-05, 'samples': 20629824, 'steps': 107446, 'loss/train': 1.249467134475708} 11/07/2021 12:21:47 - INFO - __main__ - Step 107448: {'lr': 9.523901541490781e-05, 'samples': 20630016, 'steps': 107447, 'loss/train': 1.6038906574249268} 11/07/2021 12:21:48 - INFO - __main__ - Step 107449: {'lr': 9.523484776438363e-05, 'samples': 20630208, 'steps': 107448, 'loss/train': 1.9301977157592773} 11/07/2021 12:21:48 - INFO - __main__ - Step 107450: {'lr': 9.52306801835942e-05, 'samples': 20630400, 'steps': 107449, 'loss/train': 1.4750542640686035} 11/07/2021 12:21:48 - INFO - __main__ - Step 107451: {'lr': 9.522651267254148e-05, 'samples': 20630592, 'steps': 107450, 'loss/train': 1.5731334686279297} 11/07/2021 12:21:49 - INFO - __main__ - Step 107452: {'lr': 9.522234523122728e-05, 'samples': 20630784, 'steps': 107451, 'loss/train': 1.0596282482147217} 11/07/2021 12:21:49 - INFO - __main__ - Step 107453: {'lr': 9.52181778596535e-05, 'samples': 20630976, 'steps': 107452, 'loss/train': 1.1032906770706177} 11/07/2021 12:21:50 - INFO - __main__ - Step 107454: {'lr': 9.521401055782203e-05, 'samples': 20631168, 'steps': 107453, 'loss/train': 1.4896913766860962} 11/07/2021 12:21:50 - INFO - __main__ - Step 107455: {'lr': 9.520984332573474e-05, 'samples': 20631360, 'steps': 107454, 'loss/train': 0.831501305103302} 11/07/2021 12:21:51 - INFO - __main__ - Step 107456: {'lr': 9.52056761633936e-05, 'samples': 20631552, 'steps': 107455, 'loss/train': 1.7128692865371704} 11/07/2021 12:21:51 - INFO - __main__ - Step 107457: {'lr': 9.520150907080027e-05, 'samples': 20631744, 'steps': 107456, 'loss/train': 1.5051120519638062} 11/07/2021 12:21:51 - INFO - __main__ - Step 107458: {'lr': 9.51973420479568e-05, 'samples': 20631936, 'steps': 107457, 'loss/train': 1.695736050605774} 11/07/2021 12:21:53 - INFO - __main__ - Step 107459: {'lr': 9.5193175094865e-05, 'samples': 20632128, 'steps': 107458, 'loss/train': 1.3299919366836548} 11/07/2021 12:21:53 - INFO - __main__ - Step 107460: {'lr': 9.518900821152677e-05, 'samples': 20632320, 'steps': 107459, 'loss/train': 1.0990642309188843} 11/07/2021 12:21:54 - INFO - __main__ - Step 107461: {'lr': 9.518484139794396e-05, 'samples': 20632512, 'steps': 107460, 'loss/train': 1.0392773151397705} 11/07/2021 12:21:54 - INFO - __main__ - Step 107462: {'lr': 9.518067465411848e-05, 'samples': 20632704, 'steps': 107461, 'loss/train': 2.2471539974212646} 11/07/2021 12:21:54 - INFO - __main__ - Step 107463: {'lr': 9.51765079800522e-05, 'samples': 20632896, 'steps': 107462, 'loss/train': 0.9547715187072754} 11/07/2021 12:21:55 - INFO - __main__ - Step 107464: {'lr': 9.517234137574701e-05, 'samples': 20633088, 'steps': 107463, 'loss/train': 1.0756821632385254} 11/07/2021 12:21:56 - INFO - __main__ - Step 107465: {'lr': 9.516817484120477e-05, 'samples': 20633280, 'steps': 107464, 'loss/train': 1.3626644611358643} 11/07/2021 12:21:56 - INFO - __main__ - Step 107466: {'lr': 9.516400837642736e-05, 'samples': 20633472, 'steps': 107465, 'loss/train': 1.202424168586731} 11/07/2021 12:21:57 - INFO - __main__ - Step 107467: {'lr': 9.515984198141666e-05, 'samples': 20633664, 'steps': 107466, 'loss/train': 0.8449694514274597} 11/07/2021 12:21:57 - INFO - __main__ - Step 107468: {'lr': 9.515567565617453e-05, 'samples': 20633856, 'steps': 107467, 'loss/train': 1.2341660261154175} 11/07/2021 12:21:57 - INFO - __main__ - Step 107469: {'lr': 9.515150940070297e-05, 'samples': 20634048, 'steps': 107468, 'loss/train': 1.288142442703247} 11/07/2021 12:21:58 - INFO - __main__ - Step 107470: {'lr': 9.514734321500365e-05, 'samples': 20634240, 'steps': 107469, 'loss/train': 0.9022871255874634} 11/07/2021 12:21:59 - INFO - __main__ - Step 107471: {'lr': 9.514317709907857e-05, 'samples': 20634432, 'steps': 107470, 'loss/train': 1.3279508352279663} 11/07/2021 12:21:59 - INFO - __main__ - Step 107472: {'lr': 9.513901105292958e-05, 'samples': 20634624, 'steps': 107471, 'loss/train': 0.939070463180542} 11/07/2021 12:21:59 - INFO - __main__ - Step 107473: {'lr': 9.513484507655854e-05, 'samples': 20634816, 'steps': 107472, 'loss/train': 1.30000901222229} 11/07/2021 12:22:00 - INFO - __main__ - Step 107474: {'lr': 9.513067916996734e-05, 'samples': 20635008, 'steps': 107473, 'loss/train': 1.4096348285675049} 11/07/2021 12:22:01 - INFO - __main__ - Step 107475: {'lr': 9.512651333315789e-05, 'samples': 20635200, 'steps': 107474, 'loss/train': 2.478377103805542} 11/07/2021 12:22:01 - INFO - __main__ - Step 107476: {'lr': 9.512234756613206e-05, 'samples': 20635392, 'steps': 107475, 'loss/train': 1.1266189813613892} 11/07/2021 12:22:01 - INFO - __main__ - Step 107477: {'lr': 9.511818186889168e-05, 'samples': 20635584, 'steps': 107476, 'loss/train': 0.9254646301269531} 11/07/2021 12:22:02 - INFO - __main__ - Step 107478: {'lr': 9.511401624143867e-05, 'samples': 20635776, 'steps': 107477, 'loss/train': 1.2419527769088745} 11/07/2021 12:22:02 - INFO - __main__ - Step 107479: {'lr': 9.51098506837749e-05, 'samples': 20635968, 'steps': 107478, 'loss/train': 1.3965392112731934} 11/07/2021 12:22:03 - INFO - __main__ - Step 107480: {'lr': 9.51056851959022e-05, 'samples': 20636160, 'steps': 107479, 'loss/train': 1.5037509202957153} 11/07/2021 12:22:03 - INFO - __main__ - Step 107481: {'lr': 9.510151977782264e-05, 'samples': 20636352, 'steps': 107480, 'loss/train': 1.292834758758545} 11/07/2021 12:22:04 - INFO - __main__ - Step 107482: {'lr': 9.509735442953782e-05, 'samples': 20636544, 'steps': 107481, 'loss/train': 1.2124059200286865} 11/07/2021 12:22:04 - INFO - __main__ - Step 107483: {'lr': 9.509318915104976e-05, 'samples': 20636736, 'steps': 107482, 'loss/train': 1.2736321687698364} 11/07/2021 12:22:04 - INFO - __main__ - Step 107484: {'lr': 9.50890239423603e-05, 'samples': 20636928, 'steps': 107483, 'loss/train': 1.0926849842071533} 11/07/2021 12:22:06 - INFO - __main__ - Step 107485: {'lr': 9.508485880347132e-05, 'samples': 20637120, 'steps': 107484, 'loss/train': 1.3165885210037231} 11/07/2021 12:22:06 - INFO - __main__ - Step 107486: {'lr': 9.508069373438475e-05, 'samples': 20637312, 'steps': 107485, 'loss/train': 1.4640170335769653} 11/07/2021 12:22:06 - INFO - __main__ - Step 107487: {'lr': 9.50765287351024e-05, 'samples': 20637504, 'steps': 107486, 'loss/train': 1.570160984992981} 11/07/2021 12:22:07 - INFO - __main__ - Step 107488: {'lr': 9.50723638056262e-05, 'samples': 20637696, 'steps': 107487, 'loss/train': 1.2074174880981445} 11/07/2021 12:22:07 - INFO - __main__ - Step 107489: {'lr': 9.506819894595798e-05, 'samples': 20637888, 'steps': 107488, 'loss/train': 1.3205180168151855} 11/07/2021 12:22:08 - INFO - __main__ - Step 107490: {'lr': 9.506403415609966e-05, 'samples': 20638080, 'steps': 107489, 'loss/train': 1.6252857446670532} 11/07/2021 12:22:08 - INFO - __main__ - Step 107491: {'lr': 9.505986943605307e-05, 'samples': 20638272, 'steps': 107490, 'loss/train': 0.8755864500999451} 11/07/2021 12:22:09 - INFO - __main__ - Step 107492: {'lr': 9.50557047858202e-05, 'samples': 20638464, 'steps': 107491, 'loss/train': 1.6031314134597778} 11/07/2021 12:22:09 - INFO - __main__ - Step 107493: {'lr': 9.505154020540277e-05, 'samples': 20638656, 'steps': 107492, 'loss/train': 1.4392824172973633} 11/07/2021 12:22:09 - INFO - __main__ - Step 107494: {'lr': 9.504737569480273e-05, 'samples': 20638848, 'steps': 107493, 'loss/train': 1.556113839149475} 11/07/2021 12:22:10 - INFO - __main__ - Step 107495: {'lr': 9.504321125402193e-05, 'samples': 20639040, 'steps': 107494, 'loss/train': 0.8657276630401611} 11/07/2021 12:22:11 - INFO - __main__ - Step 107496: {'lr': 9.503904688306227e-05, 'samples': 20639232, 'steps': 107495, 'loss/train': 1.2932443618774414} 11/07/2021 12:22:11 - INFO - __main__ - Step 107497: {'lr': 9.503488258192566e-05, 'samples': 20639424, 'steps': 107496, 'loss/train': 1.8010929822921753} 11/07/2021 12:22:12 - INFO - __main__ - Step 107498: {'lr': 9.503071835061391e-05, 'samples': 20639616, 'steps': 107497, 'loss/train': 1.3772432804107666} 11/07/2021 12:22:12 - INFO - __main__ - Step 107499: {'lr': 9.502655418912892e-05, 'samples': 20639808, 'steps': 107498, 'loss/train': 0.7937059998512268} 11/07/2021 12:22:12 - INFO - __main__ - Step 107500: {'lr': 9.50223900974726e-05, 'samples': 20640000, 'steps': 107499, 'loss/train': 1.0852283239364624} 11/07/2021 12:22:13 - INFO - __main__ - Step 107501: {'lr': 9.501822607564677e-05, 'samples': 20640192, 'steps': 107500, 'loss/train': 1.1952239274978638} 11/07/2021 12:22:14 - INFO - __main__ - Step 107502: {'lr': 9.501406212365334e-05, 'samples': 20640384, 'steps': 107501, 'loss/train': 1.569092035293579} 11/07/2021 12:22:14 - INFO - __main__ - Step 107503: {'lr': 9.500989824149428e-05, 'samples': 20640576, 'steps': 107502, 'loss/train': 1.5940128564834595} 11/07/2021 12:22:14 - INFO - __main__ - Step 107504: {'lr': 9.500573442917129e-05, 'samples': 20640768, 'steps': 107503, 'loss/train': 1.4989341497421265} 11/07/2021 12:22:15 - INFO - __main__ - Step 107505: {'lr': 9.500157068668632e-05, 'samples': 20640960, 'steps': 107504, 'loss/train': 1.3625893592834473} 11/07/2021 12:22:16 - INFO - __main__ - Step 107506: {'lr': 9.499740701404124e-05, 'samples': 20641152, 'steps': 107505, 'loss/train': 1.5507193803787231} 11/07/2021 12:22:16 - INFO - __main__ - Step 107507: {'lr': 9.499324341123793e-05, 'samples': 20641344, 'steps': 107506, 'loss/train': 1.6170618534088135} 11/07/2021 12:22:16 - INFO - __main__ - Step 107508: {'lr': 9.498907987827829e-05, 'samples': 20641536, 'steps': 107507, 'loss/train': 1.536784291267395} 11/07/2021 12:22:17 - INFO - __main__ - Step 107509: {'lr': 9.498491641516418e-05, 'samples': 20641728, 'steps': 107508, 'loss/train': 1.1018437147140503} 11/07/2021 12:22:17 - INFO - __main__ - Step 107510: {'lr': 9.498075302189746e-05, 'samples': 20641920, 'steps': 107509, 'loss/train': 1.753290057182312} 11/07/2021 12:22:17 - INFO - __main__ - Step 107511: {'lr': 9.497658969848002e-05, 'samples': 20642112, 'steps': 107510, 'loss/train': 1.3020565509796143} 11/07/2021 12:22:18 - INFO - __main__ - Step 107512: {'lr': 9.497242644491375e-05, 'samples': 20642304, 'steps': 107511, 'loss/train': 1.0212690830230713} 11/07/2021 12:22:19 - INFO - __main__ - Step 107513: {'lr': 9.496826326120051e-05, 'samples': 20642496, 'steps': 107512, 'loss/train': 1.181544542312622} 11/07/2021 12:22:19 - INFO - __main__ - Step 107514: {'lr': 9.496410014734228e-05, 'samples': 20642688, 'steps': 107513, 'loss/train': 0.841107189655304} 11/07/2021 12:22:20 - INFO - __main__ - Step 107515: {'lr': 9.495993710334072e-05, 'samples': 20642880, 'steps': 107514, 'loss/train': 1.1003283262252808} 11/07/2021 12:22:21 - INFO - __main__ - Step 107516: {'lr': 9.495577412919781e-05, 'samples': 20643072, 'steps': 107515, 'loss/train': 0.9300312399864197} 11/07/2021 12:22:21 - INFO - __main__ - Step 107517: {'lr': 9.495161122491547e-05, 'samples': 20643264, 'steps': 107516, 'loss/train': 0.9668824076652527} 11/07/2021 12:22:21 - INFO - __main__ - Step 107518: {'lr': 9.494744839049552e-05, 'samples': 20643456, 'steps': 107517, 'loss/train': 1.2367366552352905} 11/07/2021 12:22:22 - INFO - __main__ - Step 107519: {'lr': 9.494328562593987e-05, 'samples': 20643648, 'steps': 107518, 'loss/train': 1.6410863399505615} 11/07/2021 12:22:22 - INFO - __main__ - Step 107520: {'lr': 9.493912293125038e-05, 'samples': 20643840, 'steps': 107519, 'loss/train': 1.165464997291565} 11/07/2021 12:22:22 - INFO - __main__ - Step 107521: {'lr': 9.493496030642892e-05, 'samples': 20644032, 'steps': 107520, 'loss/train': 1.2029746770858765} 11/07/2021 12:22:24 - INFO - __main__ - Step 107522: {'lr': 9.493079775147736e-05, 'samples': 20644224, 'steps': 107521, 'loss/train': 1.101172924041748} 11/07/2021 12:22:24 - INFO - __main__ - Step 107523: {'lr': 9.492663526639761e-05, 'samples': 20644416, 'steps': 107522, 'loss/train': 1.423072338104248} 11/07/2021 12:22:24 - INFO - __main__ - Step 107524: {'lr': 9.492247285119155e-05, 'samples': 20644608, 'steps': 107523, 'loss/train': 1.0898269414901733} 11/07/2021 12:22:25 - INFO - __main__ - Step 107525: {'lr': 9.491831050586108e-05, 'samples': 20644800, 'steps': 107524, 'loss/train': 0.14372235536575317} 11/07/2021 12:22:25 - INFO - __main__ - Step 107526: {'lr': 9.491414823040795e-05, 'samples': 20644992, 'steps': 107525, 'loss/train': 1.1253252029418945} 11/07/2021 12:22:26 - INFO - __main__ - Step 107527: {'lr': 9.490998602483411e-05, 'samples': 20645184, 'steps': 107526, 'loss/train': 1.255549669265747} 11/07/2021 12:22:26 - INFO - __main__ - Step 107528: {'lr': 9.490582388914143e-05, 'samples': 20645376, 'steps': 107527, 'loss/train': 1.832959771156311} 11/07/2021 12:22:27 - INFO - __main__ - Step 107529: {'lr': 9.490166182333182e-05, 'samples': 20645568, 'steps': 107528, 'loss/train': 1.5211374759674072} 11/07/2021 12:22:27 - INFO - __main__ - Step 107530: {'lr': 9.489749982740711e-05, 'samples': 20645760, 'steps': 107529, 'loss/train': 1.541837215423584} 11/07/2021 12:22:27 - INFO - __main__ - Step 107531: {'lr': 9.489333790136917e-05, 'samples': 20645952, 'steps': 107530, 'loss/train': 0.9854202270507812} 11/07/2021 12:22:28 - INFO - __main__ - Step 107532: {'lr': 9.488917604521994e-05, 'samples': 20646144, 'steps': 107531, 'loss/train': 1.2732455730438232} 11/07/2021 12:22:29 - INFO - __main__ - Step 107533: {'lr': 9.488501425896124e-05, 'samples': 20646336, 'steps': 107532, 'loss/train': 1.571411371231079} 11/07/2021 12:22:29 - INFO - __main__ - Step 107534: {'lr': 9.488085254259494e-05, 'samples': 20646528, 'steps': 107533, 'loss/train': 1.281514048576355} 11/07/2021 12:22:29 - INFO - __main__ - Step 107535: {'lr': 9.487669089612294e-05, 'samples': 20646720, 'steps': 107534, 'loss/train': 1.6547895669937134} 11/07/2021 12:22:30 - INFO - __main__ - Step 107536: {'lr': 9.48725293195472e-05, 'samples': 20646912, 'steps': 107535, 'loss/train': 1.311187505722046} 11/07/2021 12:22:30 - INFO - __main__ - Step 107537: {'lr': 9.486836781286945e-05, 'samples': 20647104, 'steps': 107536, 'loss/train': 1.6267040967941284} 11/07/2021 12:22:31 - INFO - __main__ - Step 107538: {'lr': 9.486420637609158e-05, 'samples': 20647296, 'steps': 107537, 'loss/train': 1.8585913181304932} 11/07/2021 12:22:32 - INFO - __main__ - Step 107539: {'lr': 9.486004500921552e-05, 'samples': 20647488, 'steps': 107538, 'loss/train': 1.3354110717773438} 11/07/2021 12:22:32 - INFO - __main__ - Step 107540: {'lr': 9.485588371224313e-05, 'samples': 20647680, 'steps': 107539, 'loss/train': 1.390745759010315} 11/07/2021 12:22:32 - INFO - __main__ - Step 107541: {'lr': 9.485172248517626e-05, 'samples': 20647872, 'steps': 107540, 'loss/train': 1.2066930532455444} 11/07/2021 12:22:33 - INFO - __main__ - Step 107542: {'lr': 9.484756132801683e-05, 'samples': 20648064, 'steps': 107541, 'loss/train': 1.780356764793396} 11/07/2021 12:22:34 - INFO - __main__ - Step 107543: {'lr': 9.48434002407667e-05, 'samples': 20648256, 'steps': 107542, 'loss/train': 1.118993878364563} 11/07/2021 12:22:34 - INFO - __main__ - Step 107544: {'lr': 9.483923922342775e-05, 'samples': 20648448, 'steps': 107543, 'loss/train': 1.1551319360733032} 11/07/2021 12:22:34 - INFO - __main__ - Step 107545: {'lr': 9.483507827600182e-05, 'samples': 20648640, 'steps': 107544, 'loss/train': 1.7156283855438232} 11/07/2021 12:22:35 - INFO - __main__ - Step 107546: {'lr': 9.483091739849082e-05, 'samples': 20648832, 'steps': 107545, 'loss/train': 1.4610872268676758} 11/07/2021 12:22:35 - INFO - __main__ - Step 107547: {'lr': 9.482675659089663e-05, 'samples': 20649024, 'steps': 107546, 'loss/train': 1.2214455604553223} 11/07/2021 12:22:36 - INFO - __main__ - Step 107548: {'lr': 9.482259585322109e-05, 'samples': 20649216, 'steps': 107547, 'loss/train': 1.5168730020523071} 11/07/2021 12:22:36 - INFO - __main__ - Step 107549: {'lr': 9.48184351854661e-05, 'samples': 20649408, 'steps': 107548, 'loss/train': 1.5307636260986328} 11/07/2021 12:22:37 - INFO - __main__ - Step 107550: {'lr': 9.481427458763359e-05, 'samples': 20649600, 'steps': 107549, 'loss/train': 1.1744756698608398} 11/07/2021 12:22:37 - INFO - __main__ - Step 107551: {'lr': 9.481011405972531e-05, 'samples': 20649792, 'steps': 107550, 'loss/train': 1.2515318393707275} 11/07/2021 12:22:37 - INFO - __main__ - Step 107552: {'lr': 9.480595360174321e-05, 'samples': 20649984, 'steps': 107551, 'loss/train': 1.208611011505127} 11/07/2021 12:22:39 - INFO - __main__ - Step 107553: {'lr': 9.480179321368912e-05, 'samples': 20650176, 'steps': 107552, 'loss/train': 1.3025519847869873} 11/07/2021 12:22:39 - INFO - __main__ - Step 107554: {'lr': 9.479763289556498e-05, 'samples': 20650368, 'steps': 107553, 'loss/train': 0.37006106972694397} 11/07/2021 12:22:39 - INFO - __main__ - Step 107555: {'lr': 9.479347264737261e-05, 'samples': 20650560, 'steps': 107554, 'loss/train': 1.0160930156707764} 11/07/2021 12:22:40 - INFO - __main__ - Step 107556: {'lr': 9.47893124691139e-05, 'samples': 20650752, 'steps': 107555, 'loss/train': 0.9967358112335205} 11/07/2021 12:22:40 - INFO - __main__ - Step 107557: {'lr': 9.478515236079077e-05, 'samples': 20650944, 'steps': 107556, 'loss/train': 0.8635376691818237} 11/07/2021 12:22:41 - INFO - __main__ - Step 107558: {'lr': 9.478099232240503e-05, 'samples': 20651136, 'steps': 107557, 'loss/train': 1.102769374847412} 11/07/2021 12:22:41 - INFO - __main__ - Step 107559: {'lr': 9.477683235395856e-05, 'samples': 20651328, 'steps': 107558, 'loss/train': 1.1742961406707764} 11/07/2021 12:22:42 - INFO - __main__ - Step 107560: {'lr': 9.47726724554533e-05, 'samples': 20651520, 'steps': 107559, 'loss/train': 0.9953301548957825} 11/07/2021 12:22:42 - INFO - __main__ - Step 107561: {'lr': 9.476851262689103e-05, 'samples': 20651712, 'steps': 107560, 'loss/train': 1.4606690406799316} 11/07/2021 12:22:42 - INFO - __main__ - Step 107562: {'lr': 9.476435286827371e-05, 'samples': 20651904, 'steps': 107561, 'loss/train': 1.644731044769287} 11/07/2021 12:22:43 - INFO - __main__ - Step 107563: {'lr': 9.476019317960325e-05, 'samples': 20652096, 'steps': 107562, 'loss/train': 1.309924840927124} 11/07/2021 12:22:44 - INFO - __main__ - Step 107564: {'lr': 9.475603356088135e-05, 'samples': 20652288, 'steps': 107563, 'loss/train': 1.2664250135421753} 11/07/2021 12:22:44 - INFO - __main__ - Step 107565: {'lr': 9.475187401211e-05, 'samples': 20652480, 'steps': 107564, 'loss/train': 1.0078390836715698} 11/07/2021 12:22:44 - INFO - __main__ - Step 107566: {'lr': 9.474771453329106e-05, 'samples': 20652672, 'steps': 107565, 'loss/train': 1.1863776445388794} 11/07/2021 12:22:45 - INFO - __main__ - Step 107567: {'lr': 9.474355512442639e-05, 'samples': 20652864, 'steps': 107566, 'loss/train': 0.5882775187492371} 11/07/2021 12:22:46 - INFO - __main__ - Step 107568: {'lr': 9.47393957855179e-05, 'samples': 20653056, 'steps': 107567, 'loss/train': 1.4268425703048706} 11/07/2021 12:22:46 - INFO - __main__ - Step 107569: {'lr': 9.473523651656743e-05, 'samples': 20653248, 'steps': 107568, 'loss/train': 1.5360623598098755} 11/07/2021 12:22:46 - INFO - __main__ - Step 107570: {'lr': 9.473107731757689e-05, 'samples': 20653440, 'steps': 107569, 'loss/train': 1.0649189949035645} 11/07/2021 12:22:47 - INFO - __main__ - Step 107571: {'lr': 9.472691818854809e-05, 'samples': 20653632, 'steps': 107570, 'loss/train': 1.3728270530700684} 11/07/2021 12:22:47 - INFO - __main__ - Step 107572: {'lr': 9.472275912948297e-05, 'samples': 20653824, 'steps': 107571, 'loss/train': 0.9244504570960999} 11/07/2021 12:22:48 - INFO - __main__ - Step 107573: {'lr': 9.471860014038336e-05, 'samples': 20654016, 'steps': 107572, 'loss/train': 1.249650239944458} 11/07/2021 12:22:48 - INFO - __main__ - Step 107574: {'lr': 9.471444122125117e-05, 'samples': 20654208, 'steps': 107573, 'loss/train': 0.5265253186225891} 11/07/2021 12:22:49 - INFO - __main__ - Step 107575: {'lr': 9.471028237208826e-05, 'samples': 20654400, 'steps': 107574, 'loss/train': 1.4894977807998657} 11/07/2021 12:22:49 - INFO - __main__ - Step 107576: {'lr': 9.470612359289648e-05, 'samples': 20654592, 'steps': 107575, 'loss/train': 1.5766178369522095} 11/07/2021 12:22:50 - INFO - __main__ - Step 107577: {'lr': 9.470196488367785e-05, 'samples': 20654784, 'steps': 107576, 'loss/train': 1.443886160850525} 11/07/2021 12:22:51 - INFO - __main__ - Step 107578: {'lr': 9.4697806244434e-05, 'samples': 20654976, 'steps': 107577, 'loss/train': 1.807378888130188} 11/07/2021 12:22:51 - INFO - __main__ - Step 107579: {'lr': 9.469364767516691e-05, 'samples': 20655168, 'steps': 107578, 'loss/train': 1.1803724765777588} 11/07/2021 12:22:51 - INFO - __main__ - Step 107580: {'lr': 9.46894891758785e-05, 'samples': 20655360, 'steps': 107579, 'loss/train': 1.6110702753067017} 11/07/2021 12:22:52 - INFO - __main__ - Step 107581: {'lr': 9.46853307465706e-05, 'samples': 20655552, 'steps': 107580, 'loss/train': 1.16302490234375} 11/07/2021 12:22:52 - INFO - __main__ - Step 107582: {'lr': 9.468117238724507e-05, 'samples': 20655744, 'steps': 107581, 'loss/train': 0.8927728533744812} 11/07/2021 12:22:53 - INFO - __main__ - Step 107583: {'lr': 9.467701409790384e-05, 'samples': 20655936, 'steps': 107582, 'loss/train': 1.3389619588851929} 11/07/2021 12:22:54 - INFO - __main__ - Step 107584: {'lr': 9.467285587854874e-05, 'samples': 20656128, 'steps': 107583, 'loss/train': 0.5619633793830872} 11/07/2021 12:22:54 - INFO - __main__ - Step 107585: {'lr': 9.466869772918162e-05, 'samples': 20656320, 'steps': 107584, 'loss/train': 1.3230311870574951} 11/07/2021 12:22:54 - INFO - __main__ - Step 107586: {'lr': 9.466453964980443e-05, 'samples': 20656512, 'steps': 107585, 'loss/train': 1.099129319190979} 11/07/2021 12:22:55 - INFO - __main__ - Step 107587: {'lr': 9.466038164041898e-05, 'samples': 20656704, 'steps': 107586, 'loss/train': 1.4915237426757812} 11/07/2021 12:22:55 - INFO - __main__ - Step 107588: {'lr': 9.46562237010272e-05, 'samples': 20656896, 'steps': 107587, 'loss/train': 1.488452434539795} 11/07/2021 12:22:56 - INFO - __main__ - Step 107589: {'lr': 9.465206583163088e-05, 'samples': 20657088, 'steps': 107588, 'loss/train': 1.3748420476913452} 11/07/2021 12:22:57 - INFO - __main__ - Step 107590: {'lr': 9.464790803223205e-05, 'samples': 20657280, 'steps': 107589, 'loss/train': 1.673203468322754} 11/07/2021 12:22:57 - INFO - __main__ - Step 107591: {'lr': 9.46437503028324e-05, 'samples': 20657472, 'steps': 107590, 'loss/train': 1.2478879690170288} 11/07/2021 12:22:57 - INFO - __main__ - Step 107592: {'lr': 9.463959264343385e-05, 'samples': 20657664, 'steps': 107591, 'loss/train': 1.8862205743789673} 11/07/2021 12:22:58 - INFO - __main__ - Step 107593: {'lr': 9.463543505403834e-05, 'samples': 20657856, 'steps': 107592, 'loss/train': 1.543741226196289} 11/07/2021 12:22:58 - INFO - __main__ - Step 107594: {'lr': 9.463127753464767e-05, 'samples': 20658048, 'steps': 107593, 'loss/train': 1.6696488857269287} 11/07/2021 12:22:59 - INFO - __main__ - Step 107595: {'lr': 9.462712008526378e-05, 'samples': 20658240, 'steps': 107594, 'loss/train': 0.1263873279094696} 11/07/2021 12:23:00 - INFO - __main__ - Step 107596: {'lr': 9.46229627058885e-05, 'samples': 20658432, 'steps': 107595, 'loss/train': 1.2542755603790283} 11/07/2021 12:23:00 - INFO - __main__ - Step 107597: {'lr': 9.46188053965237e-05, 'samples': 20658624, 'steps': 107596, 'loss/train': 0.7959179878234863} 11/07/2021 12:23:00 - INFO - __main__ - Step 107598: {'lr': 9.46146481571713e-05, 'samples': 20658816, 'steps': 107597, 'loss/train': 1.403794765472412} 11/07/2021 12:23:01 - INFO - __main__ - Step 107599: {'lr': 9.461049098783312e-05, 'samples': 20659008, 'steps': 107598, 'loss/train': 1.6777749061584473} 11/07/2021 12:23:02 - INFO - __main__ - Step 107600: {'lr': 9.460633388851106e-05, 'samples': 20659200, 'steps': 107599, 'loss/train': 1.3268190622329712} 11/07/2021 12:23:02 - INFO - __main__ - Step 107601: {'lr': 9.460217685920697e-05, 'samples': 20659392, 'steps': 107600, 'loss/train': 1.2645655870437622} 11/07/2021 12:23:02 - INFO - __main__ - Step 107602: {'lr': 9.459801989992275e-05, 'samples': 20659584, 'steps': 107601, 'loss/train': 1.7157779932022095} 11/07/2021 12:23:03 - INFO - __main__ - Step 107603: {'lr': 9.459386301066036e-05, 'samples': 20659776, 'steps': 107602, 'loss/train': 1.3485462665557861} 11/07/2021 12:23:03 - INFO - __main__ - Step 107604: {'lr': 9.458970619142149e-05, 'samples': 20659968, 'steps': 107603, 'loss/train': 1.3818130493164062} 11/07/2021 12:23:04 - INFO - __main__ - Step 107605: {'lr': 9.45855494422081e-05, 'samples': 20660160, 'steps': 107604, 'loss/train': 1.073848009109497} 11/07/2021 12:23:04 - INFO - __main__ - Step 107606: {'lr': 9.458139276302208e-05, 'samples': 20660352, 'steps': 107605, 'loss/train': 1.82857346534729} 11/07/2021 12:23:05 - INFO - __main__ - Step 107607: {'lr': 9.457723615386526e-05, 'samples': 20660544, 'steps': 107606, 'loss/train': 0.9573533535003662} 11/07/2021 12:23:05 - INFO - __main__ - Step 107608: {'lr': 9.457307961473954e-05, 'samples': 20660736, 'steps': 107607, 'loss/train': 1.4884384870529175} 11/07/2021 12:23:05 - INFO - __main__ - Step 107609: {'lr': 9.45689231456468e-05, 'samples': 20660928, 'steps': 107608, 'loss/train': 1.4611185789108276} 11/07/2021 12:23:06 - INFO - __main__ - Step 107610: {'lr': 9.45647667465889e-05, 'samples': 20661120, 'steps': 107609, 'loss/train': 1.5548405647277832} 11/07/2021 12:23:07 - INFO - __main__ - Step 107611: {'lr': 9.456061041756772e-05, 'samples': 20661312, 'steps': 107610, 'loss/train': 1.3791019916534424} 11/07/2021 12:23:07 - INFO - __main__ - Step 107612: {'lr': 9.455645415858514e-05, 'samples': 20661504, 'steps': 107611, 'loss/train': 1.3689656257629395} 11/07/2021 12:23:08 - INFO - __main__ - Step 107613: {'lr': 9.455229796964302e-05, 'samples': 20661696, 'steps': 107612, 'loss/train': 1.4261682033538818} 11/07/2021 12:23:08 - INFO - __main__ - Step 107614: {'lr': 9.454814185074323e-05, 'samples': 20661888, 'steps': 107613, 'loss/train': 1.260317325592041} 11/07/2021 12:23:08 - INFO - __main__ - Step 107615: {'lr': 9.454398580188764e-05, 'samples': 20662080, 'steps': 107614, 'loss/train': 1.3578966856002808} 11/07/2021 12:23:09 - INFO - __main__ - Step 107616: {'lr': 9.453982982307816e-05, 'samples': 20662272, 'steps': 107615, 'loss/train': 1.3268775939941406} 11/07/2021 12:23:10 - INFO - __main__ - Step 107617: {'lr': 9.45356739143167e-05, 'samples': 20662464, 'steps': 107616, 'loss/train': 1.5364162921905518} 11/07/2021 12:23:10 - INFO - __main__ - Step 107618: {'lr': 9.453151807560498e-05, 'samples': 20662656, 'steps': 107617, 'loss/train': 1.2805452346801758} 11/07/2021 12:23:10 - INFO - __main__ - Step 107619: {'lr': 9.452736230694494e-05, 'samples': 20662848, 'steps': 107618, 'loss/train': 1.04176926612854} 11/07/2021 12:23:11 - INFO - __main__ - Step 107620: {'lr': 9.452320660833849e-05, 'samples': 20663040, 'steps': 107619, 'loss/train': 1.5010401010513306} 11/07/2021 12:23:12 - INFO - __main__ - Step 107621: {'lr': 9.45190509797875e-05, 'samples': 20663232, 'steps': 107620, 'loss/train': 1.3067547082901} 11/07/2021 12:23:12 - INFO - __main__ - Step 107622: {'lr': 9.451489542129379e-05, 'samples': 20663424, 'steps': 107621, 'loss/train': 1.2726246118545532} 11/07/2021 12:23:13 - INFO - __main__ - Step 107623: {'lr': 9.45107399328593e-05, 'samples': 20663616, 'steps': 107622, 'loss/train': 1.4574735164642334} 11/07/2021 12:23:13 - INFO - __main__ - Step 107624: {'lr': 9.450658451448588e-05, 'samples': 20663808, 'steps': 107623, 'loss/train': 1.4379088878631592} 11/07/2021 12:23:13 - INFO - __main__ - Step 107625: {'lr': 9.450242916617535e-05, 'samples': 20664000, 'steps': 107624, 'loss/train': 1.5565006732940674} 11/07/2021 12:23:14 - INFO - __main__ - Step 107626: {'lr': 9.449827388792967e-05, 'samples': 20664192, 'steps': 107625, 'loss/train': 1.372884750366211} 11/07/2021 12:23:15 - INFO - __main__ - Step 107627: {'lr': 9.449411867975063e-05, 'samples': 20664384, 'steps': 107626, 'loss/train': 0.9404796361923218} 11/07/2021 12:23:15 - INFO - __main__ - Step 107628: {'lr': 9.448996354164016e-05, 'samples': 20664576, 'steps': 107627, 'loss/train': 1.388559103012085} 11/07/2021 12:23:15 - INFO - __main__ - Step 107629: {'lr': 9.448580847360013e-05, 'samples': 20664768, 'steps': 107628, 'loss/train': 1.063719391822815} 11/07/2021 12:23:16 - INFO - __main__ - Step 107630: {'lr': 9.448165347563244e-05, 'samples': 20664960, 'steps': 107629, 'loss/train': 1.2597023248672485} 11/07/2021 12:23:17 - INFO - __main__ - Step 107631: {'lr': 9.447749854773888e-05, 'samples': 20665152, 'steps': 107630, 'loss/train': 1.136778712272644} 11/07/2021 12:23:17 - INFO - __main__ - Step 107632: {'lr': 9.447334368992133e-05, 'samples': 20665344, 'steps': 107631, 'loss/train': 1.7267584800720215} 11/07/2021 12:23:18 - INFO - __main__ - Step 107633: {'lr': 9.44691889021817e-05, 'samples': 20665536, 'steps': 107632, 'loss/train': 1.7030079364776611} 11/07/2021 12:23:18 - INFO - __main__ - Step 107634: {'lr': 9.446503418452184e-05, 'samples': 20665728, 'steps': 107633, 'loss/train': 1.527644157409668} 11/07/2021 12:23:19 - INFO - __main__ - Step 107635: {'lr': 9.446087953694366e-05, 'samples': 20665920, 'steps': 107634, 'loss/train': 1.3193635940551758} 11/07/2021 12:23:19 - INFO - __main__ - Step 107636: {'lr': 9.445672495944899e-05, 'samples': 20666112, 'steps': 107635, 'loss/train': 1.3556904792785645} 11/07/2021 12:23:19 - INFO - __main__ - Step 107637: {'lr': 9.44525704520397e-05, 'samples': 20666304, 'steps': 107636, 'loss/train': 1.3840986490249634} 11/07/2021 12:23:20 - INFO - __main__ - Step 107638: {'lr': 9.444841601471771e-05, 'samples': 20666496, 'steps': 107637, 'loss/train': 1.0747829675674438} 11/07/2021 12:23:21 - INFO - __main__ - Step 107639: {'lr': 9.444426164748485e-05, 'samples': 20666688, 'steps': 107638, 'loss/train': 1.43145751953125} 11/07/2021 12:23:21 - INFO - __main__ - Step 107640: {'lr': 9.444010735034304e-05, 'samples': 20666880, 'steps': 107639, 'loss/train': 1.484588861465454} 11/07/2021 12:23:21 - INFO - __main__ - Step 107641: {'lr': 9.443595312329406e-05, 'samples': 20667072, 'steps': 107640, 'loss/train': 1.487840175628662} 11/07/2021 12:23:22 - INFO - __main__ - Step 107642: {'lr': 9.443179896633988e-05, 'samples': 20667264, 'steps': 107641, 'loss/train': 1.402938961982727} 11/07/2021 12:23:23 - INFO - __main__ - Step 107643: {'lr': 9.442764487948233e-05, 'samples': 20667456, 'steps': 107642, 'loss/train': 1.5048675537109375} 11/07/2021 12:23:23 - INFO - __main__ - Step 107644: {'lr': 9.442349086272334e-05, 'samples': 20667648, 'steps': 107643, 'loss/train': 1.1789549589157104} 11/07/2021 12:23:23 - INFO - __main__ - Step 107645: {'lr': 9.441933691606466e-05, 'samples': 20667840, 'steps': 107644, 'loss/train': 1.3102213144302368} 11/07/2021 12:23:24 - INFO - __main__ - Step 107646: {'lr': 9.441518303950822e-05, 'samples': 20668032, 'steps': 107645, 'loss/train': 1.206657886505127} 11/07/2021 12:23:24 - INFO - __main__ - Step 107647: {'lr': 9.441102923305589e-05, 'samples': 20668224, 'steps': 107646, 'loss/train': 1.1570717096328735} 11/07/2021 12:23:25 - INFO - __main__ - Step 107648: {'lr': 9.440687549670957e-05, 'samples': 20668416, 'steps': 107647, 'loss/train': 1.7150744199752808} 11/07/2021 12:23:26 - INFO - __main__ - Step 107649: {'lr': 9.440272183047111e-05, 'samples': 20668608, 'steps': 107648, 'loss/train': 1.1351161003112793} 11/07/2021 12:23:26 - INFO - __main__ - Step 107650: {'lr': 9.439856823434236e-05, 'samples': 20668800, 'steps': 107649, 'loss/train': 1.5025326013565063} 11/07/2021 12:23:26 - INFO - __main__ - Step 107651: {'lr': 9.439441470832525e-05, 'samples': 20668992, 'steps': 107650, 'loss/train': 0.9080139398574829} 11/07/2021 12:23:27 - INFO - __main__ - Step 107652: {'lr': 9.439026125242156e-05, 'samples': 20669184, 'steps': 107651, 'loss/train': 0.8908434510231018} 11/07/2021 12:23:28 - INFO - __main__ - Step 107653: {'lr': 9.438610786663327e-05, 'samples': 20669376, 'steps': 107652, 'loss/train': 1.6327085494995117} 11/07/2021 12:23:28 - INFO - __main__ - Step 107654: {'lr': 9.438195455096216e-05, 'samples': 20669568, 'steps': 107653, 'loss/train': 1.594856858253479} 11/07/2021 12:23:28 - INFO - __main__ - Step 107655: {'lr': 9.437780130541015e-05, 'samples': 20669760, 'steps': 107654, 'loss/train': 1.2759156227111816} 11/07/2021 12:23:29 - INFO - __main__ - Step 107656: {'lr': 9.437364812997912e-05, 'samples': 20669952, 'steps': 107655, 'loss/train': 0.9825313091278076} 11/07/2021 12:23:29 - INFO - __main__ - Step 107657: {'lr': 9.436949502467101e-05, 'samples': 20670144, 'steps': 107656, 'loss/train': 0.8859226107597351} 11/07/2021 12:23:30 - INFO - __main__ - Step 107658: {'lr': 9.436534198948752e-05, 'samples': 20670336, 'steps': 107657, 'loss/train': 1.1667321920394897} 11/07/2021 12:23:31 - INFO - __main__ - Step 107659: {'lr': 9.436118902443059e-05, 'samples': 20670528, 'steps': 107658, 'loss/train': 1.4485656023025513} 11/07/2021 12:23:31 - INFO - __main__ - Step 107660: {'lr': 9.435703612950208e-05, 'samples': 20670720, 'steps': 107659, 'loss/train': 1.0111892223358154} 11/07/2021 12:23:31 - INFO - __main__ - Step 107661: {'lr': 9.435288330470392e-05, 'samples': 20670912, 'steps': 107660, 'loss/train': 1.0172206163406372} 11/07/2021 12:23:32 - INFO - __main__ - Step 107662: {'lr': 9.434873055003796e-05, 'samples': 20671104, 'steps': 107661, 'loss/train': 1.5986926555633545} 11/07/2021 12:23:33 - INFO - __main__ - Step 107663: {'lr': 9.434457786550605e-05, 'samples': 20671296, 'steps': 107662, 'loss/train': 0.26022759079933167} 11/07/2021 12:23:33 - INFO - __main__ - Step 107664: {'lr': 9.434042525111006e-05, 'samples': 20671488, 'steps': 107663, 'loss/train': 1.157070517539978} 11/07/2021 12:23:33 - INFO - __main__ - Step 107665: {'lr': 9.433627270685185e-05, 'samples': 20671680, 'steps': 107664, 'loss/train': 1.4337581396102905} 11/07/2021 12:23:34 - INFO - __main__ - Step 107666: {'lr': 9.433212023273336e-05, 'samples': 20671872, 'steps': 107665, 'loss/train': 1.6754108667373657} 11/07/2021 12:23:34 - INFO - __main__ - Step 107667: {'lr': 9.432796782875638e-05, 'samples': 20672064, 'steps': 107666, 'loss/train': 1.906338095664978} 11/07/2021 12:23:35 - INFO - __main__ - Step 107668: {'lr': 9.432381549492284e-05, 'samples': 20672256, 'steps': 107667, 'loss/train': 1.3213119506835938} 11/07/2021 12:23:36 - INFO - __main__ - Step 107669: {'lr': 9.431966323123458e-05, 'samples': 20672448, 'steps': 107668, 'loss/train': 1.43049955368042} 11/07/2021 12:23:36 - INFO - __main__ - Step 107670: {'lr': 9.431551103769348e-05, 'samples': 20672640, 'steps': 107669, 'loss/train': 1.3053157329559326} 11/07/2021 12:23:36 - INFO - __main__ - Step 107671: {'lr': 9.431135891430148e-05, 'samples': 20672832, 'steps': 107670, 'loss/train': 1.4066518545150757} 11/07/2021 12:23:37 - INFO - __main__ - Step 107672: {'lr': 9.430720686106031e-05, 'samples': 20673024, 'steps': 107671, 'loss/train': 1.4669252634048462} 11/07/2021 12:23:37 - INFO - __main__ - Step 107673: {'lr': 9.430305487797191e-05, 'samples': 20673216, 'steps': 107672, 'loss/train': 1.4393614530563354} 11/07/2021 12:23:38 - INFO - __main__ - Step 107674: {'lr': 9.429890296503815e-05, 'samples': 20673408, 'steps': 107673, 'loss/train': 1.3405325412750244} 11/07/2021 12:23:38 - INFO - __main__ - Step 107675: {'lr': 9.429475112226088e-05, 'samples': 20673600, 'steps': 107674, 'loss/train': 1.3284974098205566} 11/07/2021 12:23:39 - INFO - __main__ - Step 107676: {'lr': 9.429059934964201e-05, 'samples': 20673792, 'steps': 107675, 'loss/train': 1.1470284461975098} 11/07/2021 12:23:39 - INFO - __main__ - Step 107677: {'lr': 9.428644764718338e-05, 'samples': 20673984, 'steps': 107676, 'loss/train': 0.9174728989601135} 11/07/2021 12:23:39 - INFO - __main__ - Step 107678: {'lr': 9.428229601488691e-05, 'samples': 20674176, 'steps': 107677, 'loss/train': 1.3109198808670044} 11/07/2021 12:23:41 - INFO - __main__ - Step 107679: {'lr': 9.42781444527544e-05, 'samples': 20674368, 'steps': 107678, 'loss/train': 1.18946373462677} 11/07/2021 12:23:41 - INFO - __main__ - Step 107680: {'lr': 9.427399296078775e-05, 'samples': 20674560, 'steps': 107679, 'loss/train': 1.270875334739685} 11/07/2021 12:23:41 - INFO - __main__ - Step 107681: {'lr': 9.426984153898887e-05, 'samples': 20674752, 'steps': 107680, 'loss/train': 1.3005149364471436} 11/07/2021 12:23:42 - INFO - __main__ - Step 107682: {'lr': 9.426569018735958e-05, 'samples': 20674944, 'steps': 107681, 'loss/train': 0.6959660053253174} 11/07/2021 12:23:42 - INFO - __main__ - Step 107683: {'lr': 9.426153890590175e-05, 'samples': 20675136, 'steps': 107682, 'loss/train': 1.1924773454666138} 11/07/2021 12:23:43 - INFO - __main__ - Step 107684: {'lr': 9.425738769461739e-05, 'samples': 20675328, 'steps': 107683, 'loss/train': 1.2449449300765991} 11/07/2021 12:23:44 - INFO - __main__ - Step 107685: {'lr': 9.425323655350813e-05, 'samples': 20675520, 'steps': 107684, 'loss/train': 1.1270219087600708} 11/07/2021 12:23:44 - INFO - __main__ - Step 107686: {'lr': 9.424908548257596e-05, 'samples': 20675712, 'steps': 107685, 'loss/train': 1.5717113018035889} 11/07/2021 12:23:44 - INFO - __main__ - Step 107687: {'lr': 9.424493448182275e-05, 'samples': 20675904, 'steps': 107686, 'loss/train': 1.369620442390442} 11/07/2021 12:23:45 - INFO - __main__ - Step 107688: {'lr': 9.424078355125038e-05, 'samples': 20676096, 'steps': 107687, 'loss/train': 1.4311316013336182} 11/07/2021 12:23:46 - INFO - __main__ - Step 107689: {'lr': 9.423663269086072e-05, 'samples': 20676288, 'steps': 107688, 'loss/train': 1.2145655155181885} 11/07/2021 12:23:46 - INFO - __main__ - Step 107690: {'lr': 9.423248190065561e-05, 'samples': 20676480, 'steps': 107689, 'loss/train': 1.5318180322647095} 11/07/2021 12:23:46 - INFO - __main__ - Step 107691: {'lr': 9.422833118063694e-05, 'samples': 20676672, 'steps': 107690, 'loss/train': 0.6367395520210266} 11/07/2021 12:23:47 - INFO - __main__ - Step 107692: {'lr': 9.422418053080658e-05, 'samples': 20676864, 'steps': 107691, 'loss/train': 1.411977767944336} 11/07/2021 12:23:47 - INFO - __main__ - Step 107693: {'lr': 9.422002995116641e-05, 'samples': 20677056, 'steps': 107692, 'loss/train': 1.2638932466506958} 11/07/2021 12:23:48 - INFO - __main__ - Step 107694: {'lr': 9.421587944171828e-05, 'samples': 20677248, 'steps': 107693, 'loss/train': 1.9491839408874512} 11/07/2021 12:23:48 - INFO - __main__ - Step 107695: {'lr': 9.421172900246408e-05, 'samples': 20677440, 'steps': 107694, 'loss/train': 0.8334288597106934} 11/07/2021 12:23:49 - INFO - __main__ - Step 107696: {'lr': 9.420757863340568e-05, 'samples': 20677632, 'steps': 107695, 'loss/train': 0.8937883377075195} 11/07/2021 12:23:49 - INFO - __main__ - Step 107697: {'lr': 9.420342833454492e-05, 'samples': 20677824, 'steps': 107696, 'loss/train': 1.2617402076721191} 11/07/2021 12:23:49 - INFO - __main__ - Step 107698: {'lr': 9.41992781058838e-05, 'samples': 20678016, 'steps': 107697, 'loss/train': 1.041164517402649} 11/07/2021 12:23:51 - INFO - __main__ - Step 107699: {'lr': 9.419512794742397e-05, 'samples': 20678208, 'steps': 107698, 'loss/train': 1.3624892234802246} 11/07/2021 12:23:51 - INFO - __main__ - Step 107700: {'lr': 9.419097785916741e-05, 'samples': 20678400, 'steps': 107699, 'loss/train': 1.0729280710220337} 11/07/2021 12:23:51 - INFO - __main__ - Step 107701: {'lr': 9.418682784111601e-05, 'samples': 20678592, 'steps': 107700, 'loss/train': 1.5774157047271729} 11/07/2021 12:23:52 - INFO - __main__ - Step 107702: {'lr': 9.418267789327161e-05, 'samples': 20678784, 'steps': 107701, 'loss/train': 1.4786040782928467} 11/07/2021 12:23:52 - INFO - __main__ - Step 107703: {'lr': 9.417852801563612e-05, 'samples': 20678976, 'steps': 107702, 'loss/train': 5.1037163734436035} 11/07/2021 12:23:52 - INFO - __main__ - Step 107704: {'lr': 9.417437820821134e-05, 'samples': 20679168, 'steps': 107703, 'loss/train': 0.38111257553100586} 11/07/2021 12:23:53 - INFO - __main__ - Step 107705: {'lr': 9.417022847099921e-05, 'samples': 20679360, 'steps': 107704, 'loss/train': 0.4561055600643158} 11/07/2021 12:23:54 - INFO - __main__ - Step 107706: {'lr': 9.416607880400155e-05, 'samples': 20679552, 'steps': 107705, 'loss/train': 1.2391388416290283} 11/07/2021 12:23:54 - INFO - __main__ - Step 107707: {'lr': 9.416192920722025e-05, 'samples': 20679744, 'steps': 107706, 'loss/train': 2.0651488304138184} 11/07/2021 12:23:55 - INFO - __main__ - Step 107708: {'lr': 9.41577796806572e-05, 'samples': 20679936, 'steps': 107707, 'loss/train': 1.17367684841156} 11/07/2021 12:23:55 - INFO - __main__ - Step 107709: {'lr': 9.415363022431423e-05, 'samples': 20680128, 'steps': 107708, 'loss/train': 2.0584282875061035} 11/07/2021 12:23:55 - INFO - __main__ - Step 107710: {'lr': 9.414948083819325e-05, 'samples': 20680320, 'steps': 107709, 'loss/train': 0.9726949334144592} 11/07/2021 12:23:56 - INFO - __main__ - Step 107711: {'lr': 9.414533152229617e-05, 'samples': 20680512, 'steps': 107710, 'loss/train': 1.0897740125656128} 11/07/2021 12:23:57 - INFO - __main__ - Step 107712: {'lr': 9.414118227662472e-05, 'samples': 20680704, 'steps': 107711, 'loss/train': 0.963969349861145} 11/07/2021 12:23:57 - INFO - __main__ - Step 107713: {'lr': 9.413703310118085e-05, 'samples': 20680896, 'steps': 107712, 'loss/train': 1.895407795906067} 11/07/2021 12:23:57 - INFO - __main__ - Step 107714: {'lr': 9.413288399596642e-05, 'samples': 20681088, 'steps': 107713, 'loss/train': 0.9017595052719116} 11/07/2021 12:23:58 - INFO - __main__ - Step 107715: {'lr': 9.412873496098334e-05, 'samples': 20681280, 'steps': 107714, 'loss/train': 1.316055417060852} 11/07/2021 12:23:59 - INFO - __main__ - Step 107716: {'lr': 9.412458599623341e-05, 'samples': 20681472, 'steps': 107715, 'loss/train': 1.4263765811920166} 11/07/2021 12:23:59 - INFO - __main__ - Step 107717: {'lr': 9.412043710171855e-05, 'samples': 20681664, 'steps': 107716, 'loss/train': 1.343899130821228} 11/07/2021 12:23:59 - INFO - __main__ - Step 107718: {'lr': 9.411628827744062e-05, 'samples': 20681856, 'steps': 107717, 'loss/train': 1.0365629196166992} 11/07/2021 12:24:00 - INFO - __main__ - Step 107719: {'lr': 9.411213952340147e-05, 'samples': 20682048, 'steps': 107718, 'loss/train': 1.119144320487976} 11/07/2021 12:24:00 - INFO - __main__ - Step 107720: {'lr': 9.4107990839603e-05, 'samples': 20682240, 'steps': 107719, 'loss/train': 1.368788719177246} 11/07/2021 12:24:01 - INFO - __main__ - Step 107721: {'lr': 9.410384222604706e-05, 'samples': 20682432, 'steps': 107720, 'loss/train': 1.2211236953735352} 11/07/2021 12:24:02 - INFO - __main__ - Step 107722: {'lr': 9.409969368273552e-05, 'samples': 20682624, 'steps': 107721, 'loss/train': 1.3701133728027344} 11/07/2021 12:24:02 - INFO - __main__ - Step 107723: {'lr': 9.409554520967026e-05, 'samples': 20682816, 'steps': 107722, 'loss/train': 0.974332869052887} 11/07/2021 12:24:02 - INFO - __main__ - Step 107724: {'lr': 9.409139680685322e-05, 'samples': 20683008, 'steps': 107723, 'loss/train': 1.49467933177948} 11/07/2021 12:24:03 - INFO - __main__ - Step 107725: {'lr': 9.408724847428612e-05, 'samples': 20683200, 'steps': 107724, 'loss/train': 0.7886728644371033} 11/07/2021 12:24:04 - INFO - __main__ - Step 107726: {'lr': 9.40831002119709e-05, 'samples': 20683392, 'steps': 107725, 'loss/train': 1.529140830039978} 11/07/2021 12:24:04 - INFO - __main__ - Step 107727: {'lr': 9.40789520199094e-05, 'samples': 20683584, 'steps': 107726, 'loss/train': 1.3529208898544312} 11/07/2021 12:24:05 - INFO - __main__ - Step 107728: {'lr': 9.407480389810356e-05, 'samples': 20683776, 'steps': 107727, 'loss/train': 1.732130765914917} 11/07/2021 12:24:05 - INFO - __main__ - Step 107729: {'lr': 9.407065584655516e-05, 'samples': 20683968, 'steps': 107728, 'loss/train': 1.2927658557891846} 11/07/2021 12:24:05 - INFO - __main__ - Step 107730: {'lr': 9.406650786526613e-05, 'samples': 20684160, 'steps': 107729, 'loss/train': 1.172090768814087} 11/07/2021 12:24:07 - INFO - __main__ - Step 107731: {'lr': 9.406235995423834e-05, 'samples': 20684352, 'steps': 107730, 'loss/train': 1.3344048261642456} 11/07/2021 12:24:07 - INFO - __main__ - Step 107732: {'lr': 9.405821211347365e-05, 'samples': 20684544, 'steps': 107731, 'loss/train': 1.151700496673584} 11/07/2021 12:24:07 - INFO - __main__ - Step 107733: {'lr': 9.405406434297389e-05, 'samples': 20684736, 'steps': 107732, 'loss/train': 0.8313395380973816} 11/07/2021 12:24:08 - INFO - __main__ - Step 107734: {'lr': 9.404991664274098e-05, 'samples': 20684928, 'steps': 107733, 'loss/train': 1.7941924333572388} 11/07/2021 12:24:08 - INFO - __main__ - Step 107735: {'lr': 9.404576901277678e-05, 'samples': 20685120, 'steps': 107734, 'loss/train': 1.197549819946289} 11/07/2021 12:24:08 - INFO - __main__ - Step 107736: {'lr': 9.404162145308314e-05, 'samples': 20685312, 'steps': 107735, 'loss/train': 1.0760133266448975} 11/07/2021 12:24:09 - INFO - __main__ - Step 107737: {'lr': 9.403747396366197e-05, 'samples': 20685504, 'steps': 107736, 'loss/train': 1.1055902242660522} 11/07/2021 12:24:10 - INFO - __main__ - Step 107738: {'lr': 9.403332654451515e-05, 'samples': 20685696, 'steps': 107737, 'loss/train': 1.1666696071624756} 11/07/2021 12:24:10 - INFO - __main__ - Step 107739: {'lr': 9.402917919564444e-05, 'samples': 20685888, 'steps': 107738, 'loss/train': 1.4453917741775513} 11/07/2021 12:24:10 - INFO - __main__ - Step 107740: {'lr': 9.402503191705177e-05, 'samples': 20686080, 'steps': 107739, 'loss/train': 0.7419111132621765} 11/07/2021 12:24:11 - INFO - __main__ - Step 107741: {'lr': 9.402088470873902e-05, 'samples': 20686272, 'steps': 107740, 'loss/train': 1.5362712144851685} 11/07/2021 12:24:12 - INFO - __main__ - Step 107742: {'lr': 9.401673757070806e-05, 'samples': 20686464, 'steps': 107741, 'loss/train': 1.311923861503601} 11/07/2021 12:24:12 - INFO - __main__ - Step 107743: {'lr': 9.401259050296073e-05, 'samples': 20686656, 'steps': 107742, 'loss/train': 1.1026203632354736} 11/07/2021 12:24:13 - INFO - __main__ - Step 107744: {'lr': 9.400844350549893e-05, 'samples': 20686848, 'steps': 107743, 'loss/train': 1.2078156471252441} 11/07/2021 12:24:13 - INFO - __main__ - Step 107745: {'lr': 9.400429657832451e-05, 'samples': 20687040, 'steps': 107744, 'loss/train': 1.2440526485443115} 11/07/2021 12:24:13 - INFO - __main__ - Step 107746: {'lr': 9.400014972143936e-05, 'samples': 20687232, 'steps': 107745, 'loss/train': 1.0771116018295288} 11/07/2021 12:24:14 - INFO - __main__ - Step 107747: {'lr': 9.399600293484533e-05, 'samples': 20687424, 'steps': 107746, 'loss/train': 0.7519904375076294} 11/07/2021 12:24:15 - INFO - __main__ - Step 107748: {'lr': 9.399185621854428e-05, 'samples': 20687616, 'steps': 107747, 'loss/train': 1.525578498840332} 11/07/2021 12:24:15 - INFO - __main__ - Step 107749: {'lr': 9.39877095725381e-05, 'samples': 20687808, 'steps': 107748, 'loss/train': 0.9208197593688965} 11/07/2021 12:24:15 - INFO - __main__ - Step 107750: {'lr': 9.398356299682875e-05, 'samples': 20688000, 'steps': 107749, 'loss/train': 0.8822522163391113} 11/07/2021 12:24:16 - INFO - __main__ - Step 107751: {'lr': 9.397941649141792e-05, 'samples': 20688192, 'steps': 107750, 'loss/train': 1.2623391151428223} 11/07/2021 12:24:17 - INFO - __main__ - Step 107752: {'lr': 9.397527005630754e-05, 'samples': 20688384, 'steps': 107751, 'loss/train': 0.7303262948989868} 11/07/2021 12:24:17 - INFO - __main__ - Step 107753: {'lr': 9.397112369149949e-05, 'samples': 20688576, 'steps': 107752, 'loss/train': 1.1275506019592285} 11/07/2021 12:24:17 - INFO - __main__ - Step 107754: {'lr': 9.396697739699567e-05, 'samples': 20688768, 'steps': 107753, 'loss/train': 1.5500010251998901} 11/07/2021 12:24:18 - INFO - __main__ - Step 107755: {'lr': 9.396283117279788e-05, 'samples': 20688960, 'steps': 107754, 'loss/train': 1.2644314765930176} 11/07/2021 12:24:18 - INFO - __main__ - Step 107756: {'lr': 9.395868501890806e-05, 'samples': 20689152, 'steps': 107755, 'loss/train': 1.6614086627960205} 11/07/2021 12:24:19 - INFO - __main__ - Step 107757: {'lr': 9.395453893532805e-05, 'samples': 20689344, 'steps': 107756, 'loss/train': 1.3322429656982422} 11/07/2021 12:24:20 - INFO - __main__ - Step 107758: {'lr': 9.39503929220597e-05, 'samples': 20689536, 'steps': 107757, 'loss/train': 1.3707770109176636} 11/07/2021 12:24:20 - INFO - __main__ - Step 107759: {'lr': 9.394624697910492e-05, 'samples': 20689728, 'steps': 107758, 'loss/train': 1.0253204107284546} 11/07/2021 12:24:20 - INFO - __main__ - Step 107760: {'lr': 9.394210110646553e-05, 'samples': 20689920, 'steps': 107759, 'loss/train': 0.682023823261261} 11/07/2021 12:24:21 - INFO - __main__ - Step 107761: {'lr': 9.393795530414354e-05, 'samples': 20690112, 'steps': 107760, 'loss/train': 1.0543981790542603} 11/07/2021 12:24:21 - INFO - __main__ - Step 107762: {'lr': 9.393380957214056e-05, 'samples': 20690304, 'steps': 107761, 'loss/train': 1.481751561164856} 11/07/2021 12:24:22 - INFO - __main__ - Step 107763: {'lr': 9.392966391045862e-05, 'samples': 20690496, 'steps': 107762, 'loss/train': 1.487864375114441} 11/07/2021 12:24:22 - INFO - __main__ - Step 107764: {'lr': 9.39255183190996e-05, 'samples': 20690688, 'steps': 107763, 'loss/train': 1.3916946649551392} 11/07/2021 12:24:23 - INFO - __main__ - Step 107765: {'lr': 9.392137279806528e-05, 'samples': 20690880, 'steps': 107764, 'loss/train': 1.273819088935852} 11/07/2021 12:24:23 - INFO - __main__ - Step 107766: {'lr': 9.39172273473576e-05, 'samples': 20691072, 'steps': 107765, 'loss/train': 1.3730859756469727} 11/07/2021 12:24:24 - INFO - __main__ - Step 107767: {'lr': 9.391308196697843e-05, 'samples': 20691264, 'steps': 107766, 'loss/train': 1.4608395099639893} 11/07/2021 12:24:25 - INFO - __main__ - Step 107768: {'lr': 9.39089366569296e-05, 'samples': 20691456, 'steps': 107767, 'loss/train': 1.3385789394378662} 11/07/2021 12:24:25 - INFO - __main__ - Step 107769: {'lr': 9.3904791417213e-05, 'samples': 20691648, 'steps': 107768, 'loss/train': 1.4828089475631714} 11/07/2021 12:24:25 - INFO - __main__ - Step 107770: {'lr': 9.390064624783048e-05, 'samples': 20691840, 'steps': 107769, 'loss/train': 1.434881329536438} 11/07/2021 12:24:26 - INFO - __main__ - Step 107771: {'lr': 9.389650114878393e-05, 'samples': 20692032, 'steps': 107770, 'loss/train': 1.3765039443969727} 11/07/2021 12:24:26 - INFO - __main__ - Step 107772: {'lr': 9.38923561200753e-05, 'samples': 20692224, 'steps': 107771, 'loss/train': 1.6093028783798218} 11/07/2021 12:24:27 - INFO - __main__ - Step 107773: {'lr': 9.388821116170626e-05, 'samples': 20692416, 'steps': 107772, 'loss/train': 1.1464576721191406} 11/07/2021 12:24:27 - INFO - __main__ - Step 107774: {'lr': 9.388406627367879e-05, 'samples': 20692608, 'steps': 107773, 'loss/train': 1.3143792152404785} 11/07/2021 12:24:28 - INFO - __main__ - Step 107775: {'lr': 9.387992145599477e-05, 'samples': 20692800, 'steps': 107774, 'loss/train': 1.5411806106567383} 11/07/2021 12:24:28 - INFO - __main__ - Step 107776: {'lr': 9.387577670865601e-05, 'samples': 20692992, 'steps': 107775, 'loss/train': 1.6310948133468628} 11/07/2021 12:24:28 - INFO - __main__ - Step 107777: {'lr': 9.387163203166445e-05, 'samples': 20693184, 'steps': 107776, 'loss/train': 1.3401504755020142} 11/07/2021 12:24:29 - INFO - __main__ - Step 107778: {'lr': 9.386748742502191e-05, 'samples': 20693376, 'steps': 107777, 'loss/train': 1.2609549760818481} 11/07/2021 12:24:30 - INFO - __main__ - Step 107779: {'lr': 9.386334288873027e-05, 'samples': 20693568, 'steps': 107778, 'loss/train': 1.5444164276123047} 11/07/2021 12:24:30 - INFO - __main__ - Step 107780: {'lr': 9.385919842279142e-05, 'samples': 20693760, 'steps': 107779, 'loss/train': 1.344778299331665} 11/07/2021 12:24:31 - INFO - __main__ - Step 107781: {'lr': 9.385505402720718e-05, 'samples': 20693952, 'steps': 107780, 'loss/train': 1.3927606344223022} 11/07/2021 12:24:31 - INFO - __main__ - Step 107782: {'lr': 9.385090970197945e-05, 'samples': 20694144, 'steps': 107781, 'loss/train': 1.0509028434753418} 11/07/2021 12:24:31 - INFO - __main__ - Step 107783: {'lr': 9.384676544711018e-05, 'samples': 20694336, 'steps': 107782, 'loss/train': 1.40292227268219} 11/07/2021 12:24:32 - INFO - __main__ - Step 107784: {'lr': 9.384262126260107e-05, 'samples': 20694528, 'steps': 107783, 'loss/train': 1.201288104057312} 11/07/2021 12:24:33 - INFO - __main__ - Step 107785: {'lr': 9.383847714845403e-05, 'samples': 20694720, 'steps': 107784, 'loss/train': 1.143218755722046} 11/07/2021 12:24:33 - INFO - __main__ - Step 107786: {'lr': 9.383433310467099e-05, 'samples': 20694912, 'steps': 107785, 'loss/train': 1.737776279449463} 11/07/2021 12:24:33 - INFO - __main__ - Step 107787: {'lr': 9.383018913125379e-05, 'samples': 20695104, 'steps': 107786, 'loss/train': 1.333134651184082} 11/07/2021 12:24:34 - INFO - __main__ - Step 107788: {'lr': 9.382604522820429e-05, 'samples': 20695296, 'steps': 107787, 'loss/train': 1.1556005477905273} 11/07/2021 12:24:35 - INFO - __main__ - Step 107789: {'lr': 9.382190139552438e-05, 'samples': 20695488, 'steps': 107788, 'loss/train': 1.0933586359024048} 11/07/2021 12:24:35 - INFO - __main__ - Step 107790: {'lr': 9.38177576332159e-05, 'samples': 20695680, 'steps': 107789, 'loss/train': 1.201055645942688} 11/07/2021 12:24:36 - INFO - __main__ - Step 107791: {'lr': 9.381361394128071e-05, 'samples': 20695872, 'steps': 107790, 'loss/train': 1.2703081369400024} 11/07/2021 12:24:36 - INFO - __main__ - Step 107792: {'lr': 9.380947031972073e-05, 'samples': 20696064, 'steps': 107791, 'loss/train': 1.3988330364227295} 11/07/2021 12:24:36 - INFO - __main__ - Step 107793: {'lr': 9.380532676853775e-05, 'samples': 20696256, 'steps': 107792, 'loss/train': 1.3323054313659668} 11/07/2021 12:24:37 - INFO - __main__ - Step 107794: {'lr': 9.380118328773382e-05, 'samples': 20696448, 'steps': 107793, 'loss/train': 0.31565189361572266} 11/07/2021 12:24:38 - INFO - __main__ - Step 107795: {'lr': 9.379703987731053e-05, 'samples': 20696640, 'steps': 107794, 'loss/train': 1.0839812755584717} 11/07/2021 12:24:38 - INFO - __main__ - Step 107796: {'lr': 9.37928965372699e-05, 'samples': 20696832, 'steps': 107795, 'loss/train': 1.3610063791275024} 11/07/2021 12:24:38 - INFO - __main__ - Step 107797: {'lr': 9.37887532676138e-05, 'samples': 20697024, 'steps': 107796, 'loss/train': 1.2191792726516724} 11/07/2021 12:24:39 - INFO - __main__ - Step 107798: {'lr': 9.378461006834408e-05, 'samples': 20697216, 'steps': 107797, 'loss/train': 0.9705038666725159} 11/07/2021 12:24:40 - INFO - __main__ - Step 107799: {'lr': 9.378046693946257e-05, 'samples': 20697408, 'steps': 107798, 'loss/train': 1.1020292043685913} 11/07/2021 12:24:40 - INFO - __main__ - Step 107800: {'lr': 9.377632388097119e-05, 'samples': 20697600, 'steps': 107799, 'loss/train': 1.6801238059997559} 11/07/2021 12:24:41 - INFO - __main__ - Step 107801: {'lr': 9.377218089287179e-05, 'samples': 20697792, 'steps': 107800, 'loss/train': 1.4135985374450684} 11/07/2021 12:24:41 - INFO - __main__ - Step 107802: {'lr': 9.376803797516623e-05, 'samples': 20697984, 'steps': 107801, 'loss/train': 1.186326026916504} 11/07/2021 12:24:41 - INFO - __main__ - Step 107803: {'lr': 9.376389512785638e-05, 'samples': 20698176, 'steps': 107802, 'loss/train': 1.7005053758621216} 11/07/2021 12:24:42 - INFO - __main__ - Step 107804: {'lr': 9.375975235094411e-05, 'samples': 20698368, 'steps': 107803, 'loss/train': 1.2201945781707764} 11/07/2021 12:24:43 - INFO - __main__ - Step 107805: {'lr': 9.375560964443136e-05, 'samples': 20698560, 'steps': 107804, 'loss/train': 1.297858476638794} 11/07/2021 12:24:43 - INFO - __main__ - Step 107806: {'lr': 9.375146700831985e-05, 'samples': 20698752, 'steps': 107805, 'loss/train': 1.0508259534835815} 11/07/2021 12:24:43 - INFO - __main__ - Step 107807: {'lr': 9.37473244426115e-05, 'samples': 20698944, 'steps': 107806, 'loss/train': 1.2437573671340942} 11/07/2021 12:24:44 - INFO - __main__ - Step 107808: {'lr': 9.374318194730821e-05, 'samples': 20699136, 'steps': 107807, 'loss/train': 1.2152953147888184} 11/07/2021 12:24:45 - INFO - __main__ - Step 107809: {'lr': 9.373903952241184e-05, 'samples': 20699328, 'steps': 107808, 'loss/train': 1.6891520023345947} 11/07/2021 12:24:45 - INFO - __main__ - Step 107810: {'lr': 9.373489716792422e-05, 'samples': 20699520, 'steps': 107809, 'loss/train': 1.1031584739685059} 11/07/2021 12:24:45 - INFO - __main__ - Step 107811: {'lr': 9.373075488384727e-05, 'samples': 20699712, 'steps': 107810, 'loss/train': 2.0034334659576416} 11/07/2021 12:24:46 - INFO - __main__ - Step 107812: {'lr': 9.372661267018282e-05, 'samples': 20699904, 'steps': 107811, 'loss/train': 1.2462395429611206} 11/07/2021 12:24:46 - INFO - __main__ - Step 107813: {'lr': 9.372247052693275e-05, 'samples': 20700096, 'steps': 107812, 'loss/train': 2.0148534774780273} 11/07/2021 12:24:46 - INFO - __main__ - Step 107814: {'lr': 9.371832845409892e-05, 'samples': 20700288, 'steps': 107813, 'loss/train': 1.3834092617034912} 11/07/2021 12:24:47 - INFO - __main__ - Step 107815: {'lr': 9.37141864516832e-05, 'samples': 20700480, 'steps': 107814, 'loss/train': 1.3206197023391724} 11/07/2021 12:24:48 - INFO - __main__ - Step 107816: {'lr': 9.371004451968745e-05, 'samples': 20700672, 'steps': 107815, 'loss/train': 1.6711905002593994} 11/07/2021 12:24:48 - INFO - __main__ - Step 107817: {'lr': 9.370590265811355e-05, 'samples': 20700864, 'steps': 107816, 'loss/train': 1.2632386684417725} 11/07/2021 12:24:48 - INFO - __main__ - Step 107818: {'lr': 9.370176086696336e-05, 'samples': 20701056, 'steps': 107817, 'loss/train': 1.4610562324523926} 11/07/2021 12:24:49 - INFO - __main__ - Step 107819: {'lr': 9.369761914623884e-05, 'samples': 20701248, 'steps': 107818, 'loss/train': 0.9102050065994263} 11/07/2021 12:24:50 - INFO - __main__ - Step 107820: {'lr': 9.369347749594164e-05, 'samples': 20701440, 'steps': 107819, 'loss/train': 0.709972620010376} 11/07/2021 12:24:50 - INFO - __main__ - Step 107821: {'lr': 9.368933591607378e-05, 'samples': 20701632, 'steps': 107820, 'loss/train': 0.5036253333091736} 11/07/2021 12:24:51 - INFO - __main__ - Step 107822: {'lr': 9.368519440663709e-05, 'samples': 20701824, 'steps': 107821, 'loss/train': 0.9071140885353088} 11/07/2021 12:24:51 - INFO - __main__ - Step 107823: {'lr': 9.368105296763344e-05, 'samples': 20702016, 'steps': 107822, 'loss/train': 0.9716953039169312} 11/07/2021 12:24:51 - INFO - __main__ - Step 107824: {'lr': 9.367691159906466e-05, 'samples': 20702208, 'steps': 107823, 'loss/train': 1.2389118671417236} 11/07/2021 12:24:52 - INFO - __main__ - Step 107825: {'lr': 9.367277030093268e-05, 'samples': 20702400, 'steps': 107824, 'loss/train': 1.3601864576339722} 11/07/2021 12:24:53 - INFO - __main__ - Step 107826: {'lr': 9.366862907323934e-05, 'samples': 20702592, 'steps': 107825, 'loss/train': 1.0284702777862549} 11/07/2021 12:24:53 - INFO - __main__ - Step 107827: {'lr': 9.36644879159865e-05, 'samples': 20702784, 'steps': 107826, 'loss/train': 1.2611069679260254} 11/07/2021 12:24:53 - INFO - __main__ - Step 107828: {'lr': 9.366034682917604e-05, 'samples': 20702976, 'steps': 107827, 'loss/train': 1.409935712814331} 11/07/2021 12:24:54 - INFO - __main__ - Step 107829: {'lr': 9.365620581280979e-05, 'samples': 20703168, 'steps': 107828, 'loss/train': 1.9971991777420044} 11/07/2021 12:24:55 - INFO - __main__ - Step 107830: {'lr': 9.365206486688965e-05, 'samples': 20703360, 'steps': 107829, 'loss/train': 1.2270090579986572} 11/07/2021 12:24:55 - INFO - __main__ - Step 107831: {'lr': 9.36479239914175e-05, 'samples': 20703552, 'steps': 107830, 'loss/train': 1.5547841787338257} 11/07/2021 12:24:56 - INFO - __main__ - Step 107832: {'lr': 9.364378318639524e-05, 'samples': 20703744, 'steps': 107831, 'loss/train': 1.1817069053649902} 11/07/2021 12:24:56 - INFO - __main__ - Step 107833: {'lr': 9.36396424518246e-05, 'samples': 20703936, 'steps': 107832, 'loss/train': 1.285109281539917} 11/07/2021 12:24:56 - INFO - __main__ - Step 107834: {'lr': 9.363550178770754e-05, 'samples': 20704128, 'steps': 107833, 'loss/train': 0.6606548428535461} 11/07/2021 12:24:57 - INFO - __main__ - Step 107835: {'lr': 9.363136119404589e-05, 'samples': 20704320, 'steps': 107834, 'loss/train': 1.4841492176055908} 11/07/2021 12:24:58 - INFO - __main__ - Step 107836: {'lr': 9.362722067084156e-05, 'samples': 20704512, 'steps': 107835, 'loss/train': 0.5371600389480591} 11/07/2021 12:24:58 - INFO - __main__ - Step 107837: {'lr': 9.362308021809637e-05, 'samples': 20704704, 'steps': 107836, 'loss/train': 1.421675205230713} 11/07/2021 12:24:58 - INFO - __main__ - Step 107838: {'lr': 9.361893983581221e-05, 'samples': 20704896, 'steps': 107837, 'loss/train': 2.0219433307647705} 11/07/2021 12:24:59 - INFO - __main__ - Step 107839: {'lr': 9.361479952399093e-05, 'samples': 20705088, 'steps': 107838, 'loss/train': 1.335172176361084} 11/07/2021 12:24:59 - INFO - __main__ - Step 107840: {'lr': 9.361065928263443e-05, 'samples': 20705280, 'steps': 107839, 'loss/train': 0.7633854746818542} 11/07/2021 12:25:00 - INFO - __main__ - Step 107841: {'lr': 9.360651911174455e-05, 'samples': 20705472, 'steps': 107840, 'loss/train': 1.6135374307632446} 11/07/2021 12:25:00 - INFO - __main__ - Step 107842: {'lr': 9.360237901132316e-05, 'samples': 20705664, 'steps': 107841, 'loss/train': 1.4460698366165161} 11/07/2021 12:25:01 - INFO - __main__ - Step 107843: {'lr': 9.359823898137212e-05, 'samples': 20705856, 'steps': 107842, 'loss/train': 0.5256249308586121} 11/07/2021 12:25:01 - INFO - __main__ - Step 107844: {'lr': 9.35940990218933e-05, 'samples': 20706048, 'steps': 107843, 'loss/train': 1.3261126279830933} 11/07/2021 12:25:01 - INFO - __main__ - Step 107845: {'lr': 9.358995913288865e-05, 'samples': 20706240, 'steps': 107844, 'loss/train': 1.07716965675354} 11/07/2021 12:25:03 - INFO - __main__ - Step 107846: {'lr': 9.358581931435987e-05, 'samples': 20706432, 'steps': 107845, 'loss/train': 1.3760478496551514} 11/07/2021 12:25:03 - INFO - __main__ - Step 107847: {'lr': 9.358167956630889e-05, 'samples': 20706624, 'steps': 107846, 'loss/train': 1.2555288076400757} 11/07/2021 12:25:03 - INFO - __main__ - Step 107848: {'lr': 9.357753988873763e-05, 'samples': 20706816, 'steps': 107847, 'loss/train': 1.2422322034835815} 11/07/2021 12:25:04 - INFO - __main__ - Step 107849: {'lr': 9.357340028164787e-05, 'samples': 20707008, 'steps': 107848, 'loss/train': 1.2605531215667725} 11/07/2021 12:25:04 - INFO - __main__ - Step 107850: {'lr': 9.356926074504155e-05, 'samples': 20707200, 'steps': 107849, 'loss/train': 1.2684392929077148} 11/07/2021 12:25:05 - INFO - __main__ - Step 107851: {'lr': 9.35651212789205e-05, 'samples': 20707392, 'steps': 107850, 'loss/train': 1.2386751174926758} 11/07/2021 12:25:05 - INFO - __main__ - Step 107852: {'lr': 9.35609818832866e-05, 'samples': 20707584, 'steps': 107851, 'loss/train': 1.2622259855270386} 11/07/2021 12:25:06 - INFO - __main__ - Step 107853: {'lr': 9.35568425581417e-05, 'samples': 20707776, 'steps': 107852, 'loss/train': 1.4591217041015625} 11/07/2021 12:25:06 - INFO - __main__ - Step 107854: {'lr': 9.355270330348767e-05, 'samples': 20707968, 'steps': 107853, 'loss/train': 1.3256834745407104} 11/07/2021 12:25:06 - INFO - __main__ - Step 107855: {'lr': 9.354856411932639e-05, 'samples': 20708160, 'steps': 107854, 'loss/train': 1.119855523109436} 11/07/2021 12:25:08 - INFO - __main__ - Step 107856: {'lr': 9.354442500565968e-05, 'samples': 20708352, 'steps': 107855, 'loss/train': 1.4773911237716675} 11/07/2021 12:25:08 - INFO - __main__ - Step 107857: {'lr': 9.354028596248948e-05, 'samples': 20708544, 'steps': 107856, 'loss/train': 1.7157175540924072} 11/07/2021 12:25:08 - INFO - __main__ - Step 107858: {'lr': 9.353614698981761e-05, 'samples': 20708736, 'steps': 107857, 'loss/train': 1.3560137748718262} 11/07/2021 12:25:09 - INFO - __main__ - Step 107859: {'lr': 9.3532008087646e-05, 'samples': 20708928, 'steps': 107858, 'loss/train': 0.9941315054893494} 11/07/2021 12:25:09 - INFO - __main__ - Step 107860: {'lr': 9.352786925597636e-05, 'samples': 20709120, 'steps': 107859, 'loss/train': 1.516483187675476} 11/07/2021 12:25:10 - INFO - __main__ - Step 107861: {'lr': 9.352373049481067e-05, 'samples': 20709312, 'steps': 107860, 'loss/train': 1.3721060752868652} 11/07/2021 12:25:10 - INFO - __main__ - Step 107862: {'lr': 9.351959180415077e-05, 'samples': 20709504, 'steps': 107861, 'loss/train': 0.9944309592247009} 11/07/2021 12:25:11 - INFO - __main__ - Step 107863: {'lr': 9.351545318399851e-05, 'samples': 20709696, 'steps': 107862, 'loss/train': 1.3503587245941162} 11/07/2021 12:25:11 - INFO - __main__ - Step 107864: {'lr': 9.351131463435581e-05, 'samples': 20709888, 'steps': 107863, 'loss/train': 1.2682923078536987} 11/07/2021 12:25:11 - INFO - __main__ - Step 107865: {'lr': 9.350717615522444e-05, 'samples': 20710080, 'steps': 107864, 'loss/train': 1.4292038679122925} 11/07/2021 12:25:12 - INFO - __main__ - Step 107866: {'lr': 9.350303774660637e-05, 'samples': 20710272, 'steps': 107865, 'loss/train': 1.0334349870681763} 11/07/2021 12:25:13 - INFO - __main__ - Step 107867: {'lr': 9.34988994085034e-05, 'samples': 20710464, 'steps': 107866, 'loss/train': 1.4666415452957153} 11/07/2021 12:25:13 - INFO - __main__ - Step 107868: {'lr': 9.34947611409174e-05, 'samples': 20710656, 'steps': 107867, 'loss/train': 1.2044650316238403} 11/07/2021 12:25:14 - INFO - __main__ - Step 107869: {'lr': 9.349062294385027e-05, 'samples': 20710848, 'steps': 107868, 'loss/train': 1.1277378797531128} 11/07/2021 12:25:14 - INFO - __main__ - Step 107870: {'lr': 9.348648481730382e-05, 'samples': 20711040, 'steps': 107869, 'loss/train': 1.471817135810852} 11/07/2021 12:25:14 - INFO - __main__ - Step 107871: {'lr': 9.348234676127998e-05, 'samples': 20711232, 'steps': 107870, 'loss/train': 1.2165684700012207} 11/07/2021 12:25:15 - INFO - __main__ - Step 107872: {'lr': 9.347820877578064e-05, 'samples': 20711424, 'steps': 107871, 'loss/train': 0.10217444598674774} 11/07/2021 12:25:16 - INFO - __main__ - Step 107873: {'lr': 9.347407086080753e-05, 'samples': 20711616, 'steps': 107872, 'loss/train': 1.4057539701461792} 11/07/2021 12:25:16 - INFO - __main__ - Step 107874: {'lr': 9.346993301636256e-05, 'samples': 20711808, 'steps': 107873, 'loss/train': 1.556358814239502} 11/07/2021 12:25:17 - INFO - __main__ - Step 107875: {'lr': 9.346579524244767e-05, 'samples': 20712000, 'steps': 107874, 'loss/train': 1.1894478797912598} 11/07/2021 12:25:17 - INFO - __main__ - Step 107876: {'lr': 9.346165753906464e-05, 'samples': 20712192, 'steps': 107875, 'loss/train': 1.3047250509262085} 11/07/2021 12:25:18 - INFO - __main__ - Step 107877: {'lr': 9.345751990621537e-05, 'samples': 20712384, 'steps': 107876, 'loss/train': 1.182763695716858} 11/07/2021 12:25:18 - INFO - __main__ - Step 107878: {'lr': 9.345338234390174e-05, 'samples': 20712576, 'steps': 107877, 'loss/train': 1.3833993673324585} 11/07/2021 12:25:19 - INFO - __main__ - Step 107879: {'lr': 9.344924485212561e-05, 'samples': 20712768, 'steps': 107878, 'loss/train': 0.7837182283401489} 11/07/2021 12:25:19 - INFO - __main__ - Step 107880: {'lr': 9.344510743088882e-05, 'samples': 20712960, 'steps': 107879, 'loss/train': 0.7335546016693115} 11/07/2021 12:25:19 - INFO - __main__ - Step 107881: {'lr': 9.344097008019326e-05, 'samples': 20713152, 'steps': 107880, 'loss/train': 1.0599346160888672} 11/07/2021 12:25:21 - INFO - __main__ - Step 107882: {'lr': 9.343683280004078e-05, 'samples': 20713344, 'steps': 107881, 'loss/train': 2.1308703422546387} 11/07/2021 12:25:21 - INFO - __main__ - Step 107883: {'lr': 9.343269559043324e-05, 'samples': 20713536, 'steps': 107882, 'loss/train': 1.479851484298706} 11/07/2021 12:25:21 - INFO - __main__ - Step 107884: {'lr': 9.342855845137252e-05, 'samples': 20713728, 'steps': 107883, 'loss/train': 1.9554352760314941} 11/07/2021 12:25:22 - INFO - __main__ - Step 107885: {'lr': 9.342442138286048e-05, 'samples': 20713920, 'steps': 107884, 'loss/train': 1.5484787225723267} 11/07/2021 12:25:22 - INFO - __main__ - Step 107886: {'lr': 9.342028438489905e-05, 'samples': 20714112, 'steps': 107885, 'loss/train': 1.5930277109146118} 11/07/2021 12:25:23 - INFO - __main__ - Step 107887: {'lr': 9.341614745748995e-05, 'samples': 20714304, 'steps': 107886, 'loss/train': 1.4718036651611328} 11/07/2021 12:25:23 - INFO - __main__ - Step 107888: {'lr': 9.34120106006351e-05, 'samples': 20714496, 'steps': 107887, 'loss/train': 0.3395828604698181} 11/07/2021 12:25:24 - INFO - __main__ - Step 107889: {'lr': 9.340787381433638e-05, 'samples': 20714688, 'steps': 107888, 'loss/train': 1.1698247194290161} 11/07/2021 12:25:24 - INFO - __main__ - Step 107890: {'lr': 9.340373709859567e-05, 'samples': 20714880, 'steps': 107889, 'loss/train': 1.1474895477294922} 11/07/2021 12:25:24 - INFO - __main__ - Step 107891: {'lr': 9.339960045341483e-05, 'samples': 20715072, 'steps': 107890, 'loss/train': 1.8113093376159668} 11/07/2021 12:25:26 - INFO - __main__ - Step 107892: {'lr': 9.339546387879568e-05, 'samples': 20715264, 'steps': 107891, 'loss/train': 0.16035214066505432} 11/07/2021 12:25:26 - INFO - __main__ - Step 107893: {'lr': 9.339132737474015e-05, 'samples': 20715456, 'steps': 107892, 'loss/train': 1.265616536140442} 11/07/2021 12:25:26 - INFO - __main__ - Step 107894: {'lr': 9.338719094125007e-05, 'samples': 20715648, 'steps': 107893, 'loss/train': 1.381485104560852} 11/07/2021 12:25:27 - INFO - __main__ - Step 107895: {'lr': 9.33830545783273e-05, 'samples': 20715840, 'steps': 107894, 'loss/train': 0.6154116988182068} 11/07/2021 12:25:27 - INFO - __main__ - Step 107896: {'lr': 9.33789182859737e-05, 'samples': 20716032, 'steps': 107895, 'loss/train': 1.0629322528839111} 11/07/2021 12:25:28 - INFO - __main__ - Step 107897: {'lr': 9.337478206419115e-05, 'samples': 20716224, 'steps': 107896, 'loss/train': 1.1512749195098877} 11/07/2021 12:25:29 - INFO - __main__ - Step 107898: {'lr': 9.33706459129815e-05, 'samples': 20716416, 'steps': 107897, 'loss/train': 1.145039677619934} 11/07/2021 12:25:29 - INFO - __main__ - Step 107899: {'lr': 9.336650983234671e-05, 'samples': 20716608, 'steps': 107898, 'loss/train': 1.0342243909835815} 11/07/2021 12:25:29 - INFO - __main__ - Step 107900: {'lr': 9.336237382228846e-05, 'samples': 20716800, 'steps': 107899, 'loss/train': 1.5204002857208252} 11/07/2021 12:25:30 - INFO - __main__ - Step 107901: {'lr': 9.335823788280873e-05, 'samples': 20716992, 'steps': 107900, 'loss/train': 0.443486750125885} 11/07/2021 12:25:31 - INFO - __main__ - Step 107902: {'lr': 9.335410201390934e-05, 'samples': 20717184, 'steps': 107901, 'loss/train': 1.6502881050109863} 11/07/2021 12:25:31 - INFO - __main__ - Step 107903: {'lr': 9.334996621559219e-05, 'samples': 20717376, 'steps': 107902, 'loss/train': 1.161240577697754} 11/07/2021 12:25:31 - INFO - __main__ - Step 107904: {'lr': 9.33458304878591e-05, 'samples': 20717568, 'steps': 107903, 'loss/train': 1.467363953590393} 11/07/2021 12:25:32 - INFO - __main__ - Step 107905: {'lr': 9.334169483071201e-05, 'samples': 20717760, 'steps': 107904, 'loss/train': 0.9648863077163696} 11/07/2021 12:25:32 - INFO - __main__ - Step 107906: {'lr': 9.333755924415269e-05, 'samples': 20717952, 'steps': 107905, 'loss/train': 1.7008779048919678} 11/07/2021 12:25:32 - INFO - __main__ - Step 107907: {'lr': 9.333342372818307e-05, 'samples': 20718144, 'steps': 107906, 'loss/train': 1.2170090675354004} 11/07/2021 12:25:33 - INFO - __main__ - Step 107908: {'lr': 9.332928828280499e-05, 'samples': 20718336, 'steps': 107907, 'loss/train': 1.3436702489852905} 11/07/2021 12:25:34 - INFO - __main__ - Step 107909: {'lr': 9.332515290802029e-05, 'samples': 20718528, 'steps': 107908, 'loss/train': 1.8727067708969116} 11/07/2021 12:25:34 - INFO - __main__ - Step 107910: {'lr': 9.332101760383088e-05, 'samples': 20718720, 'steps': 107909, 'loss/train': 1.7114132642745972} 11/07/2021 12:25:35 - INFO - __main__ - Step 107911: {'lr': 9.331688237023861e-05, 'samples': 20718912, 'steps': 107910, 'loss/train': 1.2484219074249268} 11/07/2021 12:25:35 - INFO - __main__ - Step 107912: {'lr': 9.331274720724531e-05, 'samples': 20719104, 'steps': 107911, 'loss/train': 1.1808760166168213} 11/07/2021 12:25:36 - INFO - __main__ - Step 107913: {'lr': 9.330861211485298e-05, 'samples': 20719296, 'steps': 107912, 'loss/train': 1.1000326871871948} 11/07/2021 12:25:36 - INFO - __main__ - Step 107914: {'lr': 9.330447709306328e-05, 'samples': 20719488, 'steps': 107913, 'loss/train': 1.4758144617080688} 11/07/2021 12:25:37 - INFO - __main__ - Step 107915: {'lr': 9.330034214187816e-05, 'samples': 20719680, 'steps': 107914, 'loss/train': 1.3068596124649048} 11/07/2021 12:25:37 - INFO - __main__ - Step 107916: {'lr': 9.329620726129948e-05, 'samples': 20719872, 'steps': 107915, 'loss/train': 1.4766052961349487} 11/07/2021 12:25:37 - INFO - __main__ - Step 107917: {'lr': 9.329207245132912e-05, 'samples': 20720064, 'steps': 107916, 'loss/train': 1.0065937042236328} 11/07/2021 12:25:38 - INFO - __main__ - Step 107918: {'lr': 9.328793771196892e-05, 'samples': 20720256, 'steps': 107917, 'loss/train': 0.9692122936248779} 11/07/2021 12:25:39 - INFO - __main__ - Step 107919: {'lr': 9.328380304322078e-05, 'samples': 20720448, 'steps': 107918, 'loss/train': 1.0606292486190796} 11/07/2021 12:25:39 - INFO - __main__ - Step 107920: {'lr': 9.327966844508654e-05, 'samples': 20720640, 'steps': 107919, 'loss/train': 0.055504001677036285} 11/07/2021 12:25:40 - INFO - __main__ - Step 107921: {'lr': 9.327553391756804e-05, 'samples': 20720832, 'steps': 107920, 'loss/train': 1.3343132734298706} 11/07/2021 12:25:40 - INFO - __main__ - Step 107922: {'lr': 9.327139946066718e-05, 'samples': 20721024, 'steps': 107921, 'loss/train': 0.7159186601638794} 11/07/2021 12:25:40 - INFO - __main__ - Step 107923: {'lr': 9.32672650743858e-05, 'samples': 20721216, 'steps': 107922, 'loss/train': 1.0699976682662964} 11/07/2021 12:25:41 - INFO - __main__ - Step 107924: {'lr': 9.326313075872578e-05, 'samples': 20721408, 'steps': 107923, 'loss/train': 0.322318434715271} 11/07/2021 12:25:42 - INFO - __main__ - Step 107925: {'lr': 9.325899651368897e-05, 'samples': 20721600, 'steps': 107924, 'loss/train': 0.7999330163002014} 11/07/2021 12:25:42 - INFO - __main__ - Step 107926: {'lr': 9.325486233927732e-05, 'samples': 20721792, 'steps': 107925, 'loss/train': 0.7400789260864258} 11/07/2021 12:25:42 - INFO - __main__ - Step 107927: {'lr': 9.325072823549256e-05, 'samples': 20721984, 'steps': 107926, 'loss/train': 1.2941838502883911} 11/07/2021 12:25:43 - INFO - __main__ - Step 107928: {'lr': 9.324659420233655e-05, 'samples': 20722176, 'steps': 107927, 'loss/train': 1.4900838136672974} 11/07/2021 12:25:44 - INFO - __main__ - Step 107929: {'lr': 9.324246023981123e-05, 'samples': 20722368, 'steps': 107928, 'loss/train': 1.2108575105667114} 11/07/2021 12:25:44 - INFO - __main__ - Step 107930: {'lr': 9.323832634791846e-05, 'samples': 20722560, 'steps': 107929, 'loss/train': 0.9690811038017273} 11/07/2021 12:25:44 - INFO - __main__ - Step 107931: {'lr': 9.323419252666004e-05, 'samples': 20722752, 'steps': 107930, 'loss/train': 0.6833871603012085} 11/07/2021 12:25:45 - INFO - __main__ - Step 107932: {'lr': 9.323005877603791e-05, 'samples': 20722944, 'steps': 107931, 'loss/train': 1.5374304056167603} 11/07/2021 12:25:45 - INFO - __main__ - Step 107933: {'lr': 9.322592509605388e-05, 'samples': 20723136, 'steps': 107932, 'loss/train': 1.7136651277542114} 11/07/2021 12:25:46 - INFO - __main__ - Step 107934: {'lr': 9.322179148670981e-05, 'samples': 20723328, 'steps': 107933, 'loss/train': 1.1808593273162842} 11/07/2021 12:25:46 - INFO - __main__ - Step 107935: {'lr': 9.32176579480076e-05, 'samples': 20723520, 'steps': 107934, 'loss/train': 1.2367923259735107} 11/07/2021 12:25:47 - INFO - __main__ - Step 107936: {'lr': 9.32135244799491e-05, 'samples': 20723712, 'steps': 107935, 'loss/train': 1.780774712562561} 11/07/2021 12:25:47 - INFO - __main__ - Step 107937: {'lr': 9.320939108253618e-05, 'samples': 20723904, 'steps': 107936, 'loss/train': 1.9837968349456787} 11/07/2021 12:25:48 - INFO - __main__ - Step 107938: {'lr': 9.320525775577065e-05, 'samples': 20724096, 'steps': 107937, 'loss/train': 1.4850119352340698} 11/07/2021 12:25:49 - INFO - __main__ - Step 107939: {'lr': 9.320112449965446e-05, 'samples': 20724288, 'steps': 107938, 'loss/train': 1.33302640914917} 11/07/2021 12:25:49 - INFO - __main__ - Step 107940: {'lr': 9.319699131418946e-05, 'samples': 20724480, 'steps': 107939, 'loss/train': 1.4399338960647583} 11/07/2021 12:25:49 - INFO - __main__ - Step 107941: {'lr': 9.319285819937742e-05, 'samples': 20724672, 'steps': 107940, 'loss/train': 1.3635996580123901} 11/07/2021 12:25:50 - INFO - __main__ - Step 107942: {'lr': 9.318872515522026e-05, 'samples': 20724864, 'steps': 107941, 'loss/train': 1.1046584844589233} 11/07/2021 12:25:50 - INFO - __main__ - Step 107943: {'lr': 9.318459218171982e-05, 'samples': 20725056, 'steps': 107942, 'loss/train': 1.9914077520370483} 11/07/2021 12:25:51 - INFO - __main__ - Step 107944: {'lr': 9.3180459278878e-05, 'samples': 20725248, 'steps': 107943, 'loss/train': 1.3793425559997559} 11/07/2021 12:25:51 - INFO - __main__ - Step 107945: {'lr': 9.317632644669662e-05, 'samples': 20725440, 'steps': 107944, 'loss/train': 0.9407713413238525} 11/07/2021 12:25:52 - INFO - __main__ - Step 107946: {'lr': 9.317219368517759e-05, 'samples': 20725632, 'steps': 107945, 'loss/train': 1.2713770866394043} 11/07/2021 12:25:52 - INFO - __main__ - Step 107947: {'lr': 9.316806099432276e-05, 'samples': 20725824, 'steps': 107946, 'loss/train': 1.192600131034851} 11/07/2021 12:25:52 - INFO - __main__ - Step 107948: {'lr': 9.316392837413396e-05, 'samples': 20726016, 'steps': 107947, 'loss/train': 1.0337837934494019} 11/07/2021 12:25:53 - INFO - __main__ - Step 107949: {'lr': 9.31597958246131e-05, 'samples': 20726208, 'steps': 107948, 'loss/train': 0.7187107801437378} 11/07/2021 12:25:54 - INFO - __main__ - Step 107950: {'lr': 9.315566334576197e-05, 'samples': 20726400, 'steps': 107949, 'loss/train': 0.8932701945304871} 11/07/2021 12:25:54 - INFO - __main__ - Step 107951: {'lr': 9.315153093758249e-05, 'samples': 20726592, 'steps': 107950, 'loss/train': 1.1819391250610352} 11/07/2021 12:25:55 - INFO - __main__ - Step 107952: {'lr': 9.314739860007654e-05, 'samples': 20726784, 'steps': 107951, 'loss/train': 1.518740177154541} 11/07/2021 12:25:55 - INFO - __main__ - Step 107953: {'lr': 9.314326633324602e-05, 'samples': 20726976, 'steps': 107952, 'loss/train': 1.3048781156539917} 11/07/2021 12:25:56 - INFO - __main__ - Step 107954: {'lr': 9.313913413709266e-05, 'samples': 20727168, 'steps': 107953, 'loss/train': 0.6794820427894592} 11/07/2021 12:25:56 - INFO - __main__ - Step 107955: {'lr': 9.313500201161834e-05, 'samples': 20727360, 'steps': 107954, 'loss/train': 0.20461505651474} 11/07/2021 12:25:57 - INFO - __main__ - Step 107956: {'lr': 9.313086995682501e-05, 'samples': 20727552, 'steps': 107955, 'loss/train': 1.3463687896728516} 11/07/2021 12:25:57 - INFO - __main__ - Step 107957: {'lr': 9.312673797271447e-05, 'samples': 20727744, 'steps': 107956, 'loss/train': 1.4938501119613647} 11/07/2021 12:25:57 - INFO - __main__ - Step 107958: {'lr': 9.31226060592886e-05, 'samples': 20727936, 'steps': 107957, 'loss/train': 0.9650664329528809} 11/07/2021 12:25:58 - INFO - __main__ - Step 107959: {'lr': 9.311847421654926e-05, 'samples': 20728128, 'steps': 107958, 'loss/train': 1.3073344230651855} 11/07/2021 12:25:59 - INFO - __main__ - Step 107960: {'lr': 9.311434244449831e-05, 'samples': 20728320, 'steps': 107959, 'loss/train': 0.9898633360862732} 11/07/2021 12:25:59 - INFO - __main__ - Step 107961: {'lr': 9.311021074313763e-05, 'samples': 20728512, 'steps': 107960, 'loss/train': 1.417095422744751} 11/07/2021 12:25:59 - INFO - __main__ - Step 107962: {'lr': 9.310607911246907e-05, 'samples': 20728704, 'steps': 107961, 'loss/train': 1.1931499242782593} 11/07/2021 12:26:00 - INFO - __main__ - Step 107963: {'lr': 9.310194755249449e-05, 'samples': 20728896, 'steps': 107962, 'loss/train': 1.2115873098373413} 11/07/2021 12:26:00 - INFO - __main__ - Step 107964: {'lr': 9.309781606321576e-05, 'samples': 20729088, 'steps': 107963, 'loss/train': 0.8399997353553772} 11/07/2021 12:26:01 - INFO - __main__ - Step 107965: {'lr': 9.309368464463473e-05, 'samples': 20729280, 'steps': 107964, 'loss/train': 1.4692449569702148} 11/07/2021 12:26:02 - INFO - __main__ - Step 107966: {'lr': 9.308955329675333e-05, 'samples': 20729472, 'steps': 107965, 'loss/train': 1.5471900701522827} 11/07/2021 12:26:02 - INFO - __main__ - Step 107967: {'lr': 9.30854220195733e-05, 'samples': 20729664, 'steps': 107966, 'loss/train': 1.2584402561187744} 11/07/2021 12:26:02 - INFO - __main__ - Step 107968: {'lr': 9.308129081309652e-05, 'samples': 20729856, 'steps': 107967, 'loss/train': 1.544921875} 11/07/2021 12:26:03 - INFO - __main__ - Step 107969: {'lr': 9.307715967732491e-05, 'samples': 20730048, 'steps': 107968, 'loss/train': 1.0096837282180786} 11/07/2021 12:26:04 - INFO - __main__ - Step 107970: {'lr': 9.30730286122603e-05, 'samples': 20730240, 'steps': 107969, 'loss/train': 1.1565172672271729} 11/07/2021 12:26:04 - INFO - __main__ - Step 107971: {'lr': 9.306889761790458e-05, 'samples': 20730432, 'steps': 107970, 'loss/train': 1.1328755617141724} 11/07/2021 12:26:04 - INFO - __main__ - Step 107972: {'lr': 9.306476669425957e-05, 'samples': 20730624, 'steps': 107971, 'loss/train': 1.1492810249328613} 11/07/2021 12:26:05 - INFO - __main__ - Step 107973: {'lr': 9.306063584132717e-05, 'samples': 20730816, 'steps': 107972, 'loss/train': 1.1026725769042969} 11/07/2021 12:26:05 - INFO - __main__ - Step 107974: {'lr': 9.305650505910922e-05, 'samples': 20731008, 'steps': 107973, 'loss/train': 1.2260016202926636} 11/07/2021 12:26:06 - INFO - __main__ - Step 107975: {'lr': 9.305237434760758e-05, 'samples': 20731200, 'steps': 107974, 'loss/train': 1.2583353519439697} 11/07/2021 12:26:07 - INFO - __main__ - Step 107976: {'lr': 9.304824370682414e-05, 'samples': 20731392, 'steps': 107975, 'loss/train': 1.740592360496521} 11/07/2021 12:26:07 - INFO - __main__ - Step 107977: {'lr': 9.304411313676073e-05, 'samples': 20731584, 'steps': 107976, 'loss/train': 1.2652274370193481} 11/07/2021 12:26:07 - INFO - __main__ - Step 107978: {'lr': 9.303998263741923e-05, 'samples': 20731776, 'steps': 107977, 'loss/train': 1.5417091846466064} 11/07/2021 12:26:08 - INFO - __main__ - Step 107979: {'lr': 9.303585220880146e-05, 'samples': 20731968, 'steps': 107978, 'loss/train': 1.210779070854187} 11/07/2021 12:26:09 - INFO - __main__ - Step 107980: {'lr': 9.303172185090941e-05, 'samples': 20732160, 'steps': 107979, 'loss/train': 1.1589632034301758} 11/07/2021 12:26:09 - INFO - __main__ - Step 107981: {'lr': 9.302759156374477e-05, 'samples': 20732352, 'steps': 107980, 'loss/train': 1.3709659576416016} 11/07/2021 12:26:09 - INFO - __main__ - Step 107982: {'lr': 9.30234613473095e-05, 'samples': 20732544, 'steps': 107981, 'loss/train': 1.7597010135650635} 11/07/2021 12:26:10 - INFO - __main__ - Step 107983: {'lr': 9.301933120160538e-05, 'samples': 20732736, 'steps': 107982, 'loss/train': 3.662371873855591} 11/07/2021 12:26:10 - INFO - __main__ - Step 107984: {'lr': 9.301520112663437e-05, 'samples': 20732928, 'steps': 107983, 'loss/train': 1.539462685585022} 11/07/2021 12:26:11 - INFO - __main__ - Step 107985: {'lr': 9.301107112239826e-05, 'samples': 20733120, 'steps': 107984, 'loss/train': 1.1627376079559326} 11/07/2021 12:26:11 - INFO - __main__ - Step 107986: {'lr': 9.300694118889896e-05, 'samples': 20733312, 'steps': 107985, 'loss/train': 1.607412338256836} 11/07/2021 12:26:12 - INFO - __main__ - Step 107987: {'lr': 9.30028113261383e-05, 'samples': 20733504, 'steps': 107986, 'loss/train': 1.275806188583374} 11/07/2021 12:26:12 - INFO - __main__ - Step 107988: {'lr': 9.299868153411814e-05, 'samples': 20733696, 'steps': 107987, 'loss/train': 1.2375043630599976} 11/07/2021 12:26:12 - INFO - __main__ - Step 107989: {'lr': 9.299455181284036e-05, 'samples': 20733888, 'steps': 107988, 'loss/train': 1.634045958518982} 11/07/2021 12:26:13 - INFO - __main__ - Step 107990: {'lr': 9.299042216230682e-05, 'samples': 20734080, 'steps': 107989, 'loss/train': 1.404764175415039} 11/07/2021 12:26:14 - INFO - __main__ - Step 107991: {'lr': 9.298629258251936e-05, 'samples': 20734272, 'steps': 107990, 'loss/train': 1.2627549171447754} 11/07/2021 12:26:15 - INFO - __main__ - Step 107992: {'lr': 9.298216307347988e-05, 'samples': 20734464, 'steps': 107991, 'loss/train': 1.2513351440429688} 11/07/2021 12:26:15 - INFO - __main__ - Step 107993: {'lr': 9.297803363519029e-05, 'samples': 20734656, 'steps': 107992, 'loss/train': 0.9514933228492737} 11/07/2021 12:26:16 - INFO - __main__ - Step 107994: {'lr': 9.297390426765226e-05, 'samples': 20734848, 'steps': 107993, 'loss/train': 1.7470157146453857} 11/07/2021 12:26:16 - INFO - __main__ - Step 107995: {'lr': 9.29697749708678e-05, 'samples': 20735040, 'steps': 107994, 'loss/train': 1.7504838705062866} 11/07/2021 12:26:16 - INFO - __main__ - Step 107996: {'lr': 9.296564574483873e-05, 'samples': 20735232, 'steps': 107995, 'loss/train': 1.4117892980575562} 11/07/2021 12:26:17 - INFO - __main__ - Step 107997: {'lr': 9.296151658956689e-05, 'samples': 20735424, 'steps': 107996, 'loss/train': 1.8281060457229614} 11/07/2021 12:26:18 - INFO - __main__ - Step 107998: {'lr': 9.295738750505419e-05, 'samples': 20735616, 'steps': 107997, 'loss/train': 1.3647775650024414} 11/07/2021 12:26:18 - INFO - __main__ - Step 107999: {'lr': 9.295325849130249e-05, 'samples': 20735808, 'steps': 107998, 'loss/train': 1.1171550750732422} 11/07/2021 12:26:18 - INFO - __main__ - Step 108000: {'lr': 9.294912954831359e-05, 'samples': 20736000, 'steps': 107999, 'loss/train': 1.5057704448699951} 11/07/2021 12:26:19 - INFO - __main__ - Step 108001: {'lr': 9.29450006760894e-05, 'samples': 20736192, 'steps': 108000, 'loss/train': 1.4059938192367554} 11/07/2021 12:26:20 - INFO - __main__ - Step 108002: {'lr': 9.294087187463176e-05, 'samples': 20736384, 'steps': 108001, 'loss/train': 1.1058043241500854} 11/07/2021 12:26:20 - INFO - __main__ - Step 108003: {'lr': 9.293674314394258e-05, 'samples': 20736576, 'steps': 108002, 'loss/train': 1.5379176139831543} 11/07/2021 12:26:20 - INFO - __main__ - Step 108004: {'lr': 9.293261448402363e-05, 'samples': 20736768, 'steps': 108003, 'loss/train': 0.43849459290504456} 11/07/2021 12:26:21 - INFO - __main__ - Step 108005: {'lr': 9.292848589487684e-05, 'samples': 20736960, 'steps': 108004, 'loss/train': 2.601500988006592} 11/07/2021 12:26:21 - INFO - __main__ - Step 108006: {'lr': 9.292435737650406e-05, 'samples': 20737152, 'steps': 108005, 'loss/train': 0.9081299901008606} 11/07/2021 12:26:22 - INFO - __main__ - Step 108007: {'lr': 9.292022892890723e-05, 'samples': 20737344, 'steps': 108006, 'loss/train': 1.0446826219558716} 11/07/2021 12:26:23 - INFO - __main__ - Step 108008: {'lr': 9.291610055208802e-05, 'samples': 20737536, 'steps': 108007, 'loss/train': 1.488263487815857} 11/07/2021 12:26:23 - INFO - __main__ - Step 108009: {'lr': 9.291197224604839e-05, 'samples': 20737728, 'steps': 108008, 'loss/train': 1.0438975095748901} 11/07/2021 12:26:23 - INFO - __main__ - Step 108010: {'lr': 9.29078440107902e-05, 'samples': 20737920, 'steps': 108009, 'loss/train': 1.1672606468200684} 11/07/2021 12:26:24 - INFO - __main__ - Step 108011: {'lr': 9.290371584631532e-05, 'samples': 20738112, 'steps': 108010, 'loss/train': 1.4436194896697998} 11/07/2021 12:26:24 - INFO - __main__ - Step 108012: {'lr': 9.289958775262561e-05, 'samples': 20738304, 'steps': 108011, 'loss/train': 1.1434088945388794} 11/07/2021 12:26:25 - INFO - __main__ - Step 108013: {'lr': 9.28954597297229e-05, 'samples': 20738496, 'steps': 108012, 'loss/train': 0.15607145428657532} 11/07/2021 12:26:25 - INFO - __main__ - Step 108014: {'lr': 9.289133177760908e-05, 'samples': 20738688, 'steps': 108013, 'loss/train': 1.3154873847961426} 11/07/2021 12:26:26 - INFO - __main__ - Step 108015: {'lr': 9.2887203896286e-05, 'samples': 20738880, 'steps': 108014, 'loss/train': 1.9798985719680786} 11/07/2021 12:26:26 - INFO - __main__ - Step 108016: {'lr': 9.288307608575552e-05, 'samples': 20739072, 'steps': 108015, 'loss/train': 1.5847796201705933} 11/07/2021 12:26:26 - INFO - __main__ - Step 108017: {'lr': 9.287894834601951e-05, 'samples': 20739264, 'steps': 108016, 'loss/train': 1.3706971406936646} 11/07/2021 12:26:27 - INFO - __main__ - Step 108018: {'lr': 9.287482067707983e-05, 'samples': 20739456, 'steps': 108017, 'loss/train': 1.46781587600708} 11/07/2021 12:26:28 - INFO - __main__ - Step 108019: {'lr': 9.28706930789384e-05, 'samples': 20739648, 'steps': 108018, 'loss/train': 1.5221140384674072} 11/07/2021 12:26:28 - INFO - __main__ - Step 108020: {'lr': 9.286656555159692e-05, 'samples': 20739840, 'steps': 108019, 'loss/train': 1.9185564517974854} 11/07/2021 12:26:28 - INFO - __main__ - Step 108021: {'lr': 9.286243809505738e-05, 'samples': 20740032, 'steps': 108020, 'loss/train': 1.4136637449264526} 11/07/2021 12:26:29 - INFO - __main__ - Step 108022: {'lr': 9.285831070932155e-05, 'samples': 20740224, 'steps': 108021, 'loss/train': 1.3169881105422974} 11/07/2021 12:26:30 - INFO - __main__ - Step 108023: {'lr': 9.285418339439136e-05, 'samples': 20740416, 'steps': 108022, 'loss/train': 1.3443727493286133} 11/07/2021 12:26:30 - INFO - __main__ - Step 108024: {'lr': 9.285005615026865e-05, 'samples': 20740608, 'steps': 108023, 'loss/train': 1.2094887495040894} 11/07/2021 12:26:31 - INFO - __main__ - Step 108025: {'lr': 9.28459289769553e-05, 'samples': 20740800, 'steps': 108024, 'loss/train': 1.1993845701217651} 11/07/2021 12:26:31 - INFO - __main__ - Step 108026: {'lr': 9.284180187445312e-05, 'samples': 20740992, 'steps': 108025, 'loss/train': 1.251917839050293} 11/07/2021 12:26:31 - INFO - __main__ - Step 108027: {'lr': 9.2837674842764e-05, 'samples': 20741184, 'steps': 108026, 'loss/train': 1.672936201095581} 11/07/2021 12:26:32 - INFO - __main__ - Step 108028: {'lr': 9.283354788188982e-05, 'samples': 20741376, 'steps': 108027, 'loss/train': 1.2721898555755615} 11/07/2021 12:26:33 - INFO - __main__ - Step 108029: {'lr': 9.282942099183242e-05, 'samples': 20741568, 'steps': 108028, 'loss/train': 1.532462239265442} 11/07/2021 12:26:33 - INFO - __main__ - Step 108030: {'lr': 9.282529417259372e-05, 'samples': 20741760, 'steps': 108029, 'loss/train': 1.2444334030151367} 11/07/2021 12:26:34 - INFO - __main__ - Step 108031: {'lr': 9.282116742417543e-05, 'samples': 20741952, 'steps': 108030, 'loss/train': 1.3670628070831299} 11/07/2021 12:26:34 - INFO - __main__ - Step 108032: {'lr': 9.281704074657951e-05, 'samples': 20742144, 'steps': 108031, 'loss/train': 1.013521432876587} 11/07/2021 12:26:35 - INFO - __main__ - Step 108033: {'lr': 9.28129141398078e-05, 'samples': 20742336, 'steps': 108032, 'loss/train': 0.5855145454406738} 11/07/2021 12:26:35 - INFO - __main__ - Step 108034: {'lr': 9.280878760386218e-05, 'samples': 20742528, 'steps': 108033, 'loss/train': 1.23637056350708} 11/07/2021 12:26:36 - INFO - __main__ - Step 108035: {'lr': 9.280466113874447e-05, 'samples': 20742720, 'steps': 108034, 'loss/train': 0.5475242137908936} 11/07/2021 12:26:36 - INFO - __main__ - Step 108036: {'lr': 9.280053474445657e-05, 'samples': 20742912, 'steps': 108035, 'loss/train': 1.2796614170074463} 11/07/2021 12:26:36 - INFO - __main__ - Step 108037: {'lr': 9.279640842100035e-05, 'samples': 20743104, 'steps': 108036, 'loss/train': 1.056008219718933} 11/07/2021 12:26:37 - INFO - __main__ - Step 108038: {'lr': 9.27922821683776e-05, 'samples': 20743296, 'steps': 108037, 'loss/train': 0.7505257725715637} 11/07/2021 12:26:38 - INFO - __main__ - Step 108039: {'lr': 9.278815598659024e-05, 'samples': 20743488, 'steps': 108038, 'loss/train': 1.0553101301193237} 11/07/2021 12:26:38 - INFO - __main__ - Step 108040: {'lr': 9.278402987564011e-05, 'samples': 20743680, 'steps': 108039, 'loss/train': 1.2291499376296997} 11/07/2021 12:26:39 - INFO - __main__ - Step 108041: {'lr': 9.277990383552914e-05, 'samples': 20743872, 'steps': 108040, 'loss/train': 1.471685528755188} 11/07/2021 12:26:39 - INFO - __main__ - Step 108042: {'lr': 9.277577786625904e-05, 'samples': 20744064, 'steps': 108041, 'loss/train': 1.6929372549057007} 11/07/2021 12:26:39 - INFO - __main__ - Step 108043: {'lr': 9.277165196783177e-05, 'samples': 20744256, 'steps': 108042, 'loss/train': 1.65022611618042} 11/07/2021 12:26:40 - INFO - __main__ - Step 108044: {'lr': 9.276752614024914e-05, 'samples': 20744448, 'steps': 108043, 'loss/train': 1.1157127618789673} 11/07/2021 12:26:41 - INFO - __main__ - Step 108045: {'lr': 9.276340038351305e-05, 'samples': 20744640, 'steps': 108044, 'loss/train': 1.6000571250915527} 11/07/2021 12:26:41 - INFO - __main__ - Step 108046: {'lr': 9.275927469762535e-05, 'samples': 20744832, 'steps': 108045, 'loss/train': 1.0169578790664673} 11/07/2021 12:26:41 - INFO - __main__ - Step 108047: {'lr': 9.27551490825879e-05, 'samples': 20745024, 'steps': 108046, 'loss/train': 1.7386929988861084} 11/07/2021 12:26:42 - INFO - __main__ - Step 108048: {'lr': 9.275102353840253e-05, 'samples': 20745216, 'steps': 108047, 'loss/train': 1.0621000528335571} 11/07/2021 12:26:43 - INFO - __main__ - Step 108049: {'lr': 9.274689806507114e-05, 'samples': 20745408, 'steps': 108048, 'loss/train': 1.3542945384979248} 11/07/2021 12:26:43 - INFO - __main__ - Step 108050: {'lr': 9.274277266259557e-05, 'samples': 20745600, 'steps': 108049, 'loss/train': 1.6817667484283447} 11/07/2021 12:26:43 - INFO - __main__ - Step 108051: {'lr': 9.273864733097775e-05, 'samples': 20745792, 'steps': 108050, 'loss/train': 1.1464811563491821} 11/07/2021 12:26:44 - INFO - __main__ - Step 108052: {'lr': 9.27345220702194e-05, 'samples': 20745984, 'steps': 108051, 'loss/train': 1.581311583518982} 11/07/2021 12:26:44 - INFO - __main__ - Step 108053: {'lr': 9.273039688032244e-05, 'samples': 20746176, 'steps': 108052, 'loss/train': 0.4640374183654785} 11/07/2021 12:26:45 - INFO - __main__ - Step 108054: {'lr': 9.272627176128873e-05, 'samples': 20746368, 'steps': 108053, 'loss/train': 1.5240975618362427} 11/07/2021 12:26:46 - INFO - __main__ - Step 108055: {'lr': 9.272214671312015e-05, 'samples': 20746560, 'steps': 108054, 'loss/train': 1.6478278636932373} 11/07/2021 12:26:46 - INFO - __main__ - Step 108056: {'lr': 9.271802173581854e-05, 'samples': 20746752, 'steps': 108055, 'loss/train': 1.5108721256256104} 11/07/2021 12:26:46 - INFO - __main__ - Step 108057: {'lr': 9.271389682938574e-05, 'samples': 20746944, 'steps': 108056, 'loss/train': 0.6500720977783203} 11/07/2021 12:26:47 - INFO - __main__ - Step 108058: {'lr': 9.270977199382365e-05, 'samples': 20747136, 'steps': 108057, 'loss/train': 0.5100274085998535} 11/07/2021 12:26:48 - INFO - __main__ - Step 108059: {'lr': 9.270564722913413e-05, 'samples': 20747328, 'steps': 108058, 'loss/train': 1.358678936958313} 11/07/2021 12:26:48 - INFO - __main__ - Step 108060: {'lr': 9.270152253531899e-05, 'samples': 20747520, 'steps': 108059, 'loss/train': 1.572750449180603} 11/07/2021 12:26:48 - INFO - __main__ - Step 108061: {'lr': 9.269739791238013e-05, 'samples': 20747712, 'steps': 108060, 'loss/train': 1.3745838403701782} 11/07/2021 12:26:49 - INFO - __main__ - Step 108062: {'lr': 9.269327336031946e-05, 'samples': 20747904, 'steps': 108061, 'loss/train': 0.7543210387229919} 11/07/2021 12:26:49 - INFO - __main__ - Step 108063: {'lr': 9.26891488791387e-05, 'samples': 20748096, 'steps': 108062, 'loss/train': 1.1760227680206299} 11/07/2021 12:26:49 - INFO - __main__ - Step 108064: {'lr': 9.268502446883981e-05, 'samples': 20748288, 'steps': 108063, 'loss/train': 0.7775659561157227} 11/07/2021 12:26:50 - INFO - __main__ - Step 108065: {'lr': 9.26809001294246e-05, 'samples': 20748480, 'steps': 108064, 'loss/train': 1.3220247030258179} 11/07/2021 12:26:51 - INFO - __main__ - Step 108066: {'lr': 9.267677586089493e-05, 'samples': 20748672, 'steps': 108065, 'loss/train': 1.7122740745544434} 11/07/2021 12:26:51 - INFO - __main__ - Step 108067: {'lr': 9.26726516632527e-05, 'samples': 20748864, 'steps': 108066, 'loss/train': 2.0092175006866455} 11/07/2021 12:26:51 - INFO - __main__ - Step 108068: {'lr': 9.266852753649974e-05, 'samples': 20749056, 'steps': 108067, 'loss/train': 1.4847526550292969} 11/07/2021 12:26:52 - INFO - __main__ - Step 108069: {'lr': 9.26644034806379e-05, 'samples': 20749248, 'steps': 108068, 'loss/train': 1.0867317914962769} 11/07/2021 12:26:53 - INFO - __main__ - Step 108070: {'lr': 9.266027949566908e-05, 'samples': 20749440, 'steps': 108069, 'loss/train': 1.1066964864730835} 11/07/2021 12:26:53 - INFO - __main__ - Step 108071: {'lr': 9.265615558159507e-05, 'samples': 20749632, 'steps': 108070, 'loss/train': 1.679037094116211} 11/07/2021 12:26:54 - INFO - __main__ - Step 108072: {'lr': 9.26520317384178e-05, 'samples': 20749824, 'steps': 108071, 'loss/train': 1.4728683233261108} 11/07/2021 12:26:54 - INFO - __main__ - Step 108073: {'lr': 9.264790796613909e-05, 'samples': 20750016, 'steps': 108072, 'loss/train': 1.7476578950881958} 11/07/2021 12:26:54 - INFO - __main__ - Step 108074: {'lr': 9.26437842647609e-05, 'samples': 20750208, 'steps': 108073, 'loss/train': 1.5022791624069214} 11/07/2021 12:26:55 - INFO - __main__ - Step 108075: {'lr': 9.263966063428489e-05, 'samples': 20750400, 'steps': 108074, 'loss/train': 1.3286144733428955} 11/07/2021 12:26:56 - INFO - __main__ - Step 108076: {'lr': 9.2635537074713e-05, 'samples': 20750592, 'steps': 108075, 'loss/train': 1.3260152339935303} 11/07/2021 12:26:56 - INFO - __main__ - Step 108077: {'lr': 9.263141358604715e-05, 'samples': 20750784, 'steps': 108076, 'loss/train': 0.8114716410636902} 11/07/2021 12:26:56 - INFO - __main__ - Step 108078: {'lr': 9.262729016828914e-05, 'samples': 20750976, 'steps': 108077, 'loss/train': 1.2519468069076538} 11/07/2021 12:26:57 - INFO - __main__ - Step 108079: {'lr': 9.262316682144084e-05, 'samples': 20751168, 'steps': 108078, 'loss/train': 1.9518687725067139} 11/07/2021 12:26:58 - INFO - __main__ - Step 108080: {'lr': 9.261904354550413e-05, 'samples': 20751360, 'steps': 108079, 'loss/train': 1.2052185535430908} 11/07/2021 12:26:58 - INFO - __main__ - Step 108081: {'lr': 9.261492034048083e-05, 'samples': 20751552, 'steps': 108080, 'loss/train': 1.5925614833831787} 11/07/2021 12:26:58 - INFO - __main__ - Step 108082: {'lr': 9.261079720637284e-05, 'samples': 20751744, 'steps': 108081, 'loss/train': 1.4345948696136475} 11/07/2021 12:26:59 - INFO - __main__ - Step 108083: {'lr': 9.260667414318197e-05, 'samples': 20751936, 'steps': 108082, 'loss/train': 0.7581444382667542} 11/07/2021 12:26:59 - INFO - __main__ - Step 108084: {'lr': 9.260255115091013e-05, 'samples': 20752128, 'steps': 108083, 'loss/train': 1.1312731504440308} 11/07/2021 12:27:00 - INFO - __main__ - Step 108085: {'lr': 9.259842822955914e-05, 'samples': 20752320, 'steps': 108084, 'loss/train': 1.4354628324508667} 11/07/2021 12:27:01 - INFO - __main__ - Step 108086: {'lr': 9.259430537913085e-05, 'samples': 20752512, 'steps': 108085, 'loss/train': 1.7830637693405151} 11/07/2021 12:27:01 - INFO - __main__ - Step 108087: {'lr': 9.259018259962727e-05, 'samples': 20752704, 'steps': 108086, 'loss/train': 1.514787197113037} 11/07/2021 12:27:01 - INFO - __main__ - Step 108088: {'lr': 9.258605989104999e-05, 'samples': 20752896, 'steps': 108087, 'loss/train': 0.8288999795913696} 11/07/2021 12:27:02 - INFO - __main__ - Step 108089: {'lr': 9.258193725340103e-05, 'samples': 20753088, 'steps': 108088, 'loss/train': 1.1109094619750977} 11/07/2021 12:27:03 - INFO - __main__ - Step 108090: {'lr': 9.257781468668222e-05, 'samples': 20753280, 'steps': 108089, 'loss/train': 1.580065369606018} 11/07/2021 12:27:03 - INFO - __main__ - Step 108091: {'lr': 9.25736921908954e-05, 'samples': 20753472, 'steps': 108090, 'loss/train': 0.36525866389274597} 11/07/2021 12:27:03 - INFO - __main__ - Step 108092: {'lr': 9.256956976604244e-05, 'samples': 20753664, 'steps': 108091, 'loss/train': 0.9478047490119934} 11/07/2021 12:27:04 - INFO - __main__ - Step 108093: {'lr': 9.256544741212524e-05, 'samples': 20753856, 'steps': 108092, 'loss/train': 0.9595232605934143} 11/07/2021 12:27:04 - INFO - __main__ - Step 108094: {'lr': 9.256132512914558e-05, 'samples': 20754048, 'steps': 108093, 'loss/train': 1.4425517320632935} 11/07/2021 12:27:05 - INFO - __main__ - Step 108095: {'lr': 9.25572029171054e-05, 'samples': 20754240, 'steps': 108094, 'loss/train': 1.2411726713180542} 11/07/2021 12:27:05 - INFO - __main__ - Step 108096: {'lr': 9.255308077600647e-05, 'samples': 20754432, 'steps': 108095, 'loss/train': 1.1667835712432861} 11/07/2021 12:27:06 - INFO - __main__ - Step 108097: {'lr': 9.254895870585073e-05, 'samples': 20754624, 'steps': 108096, 'loss/train': 1.319838285446167} 11/07/2021 12:27:06 - INFO - __main__ - Step 108098: {'lr': 9.254483670663997e-05, 'samples': 20754816, 'steps': 108097, 'loss/train': 1.1319910287857056} 11/07/2021 12:27:07 - INFO - __main__ - Step 108099: {'lr': 9.254071477837609e-05, 'samples': 20755008, 'steps': 108098, 'loss/train': 1.2684288024902344} 11/07/2021 12:27:08 - INFO - __main__ - Step 108100: {'lr': 9.253659292106092e-05, 'samples': 20755200, 'steps': 108099, 'loss/train': 0.9539857506752014} 11/07/2021 12:27:08 - INFO - __main__ - Step 108101: {'lr': 9.253247113469646e-05, 'samples': 20755392, 'steps': 108100, 'loss/train': 1.166452169418335} 11/07/2021 12:27:08 - INFO - __main__ - Step 108102: {'lr': 9.252834941928431e-05, 'samples': 20755584, 'steps': 108101, 'loss/train': 1.3040653467178345} 11/07/2021 12:27:09 - INFO - __main__ - Step 108103: {'lr': 9.252422777482646e-05, 'samples': 20755776, 'steps': 108102, 'loss/train': 1.6282631158828735} 11/07/2021 12:27:09 - INFO - __main__ - Step 108104: {'lr': 9.252010620132478e-05, 'samples': 20755968, 'steps': 108103, 'loss/train': 1.3870511054992676} 11/07/2021 12:27:09 - INFO - __main__ - Step 108105: {'lr': 9.251598469878111e-05, 'samples': 20756160, 'steps': 108104, 'loss/train': 1.2491650581359863} 11/07/2021 12:27:10 - INFO - __main__ - Step 108106: {'lr': 9.251186326719729e-05, 'samples': 20756352, 'steps': 108105, 'loss/train': 1.5483461618423462} 11/07/2021 12:27:11 - INFO - __main__ - Step 108107: {'lr': 9.250774190657521e-05, 'samples': 20756544, 'steps': 108106, 'loss/train': 1.3878716230392456} 11/07/2021 12:27:11 - INFO - __main__ - Step 108108: {'lr': 9.25036206169167e-05, 'samples': 20756736, 'steps': 108107, 'loss/train': 1.3144384622573853} 11/07/2021 12:27:12 - INFO - __main__ - Step 108109: {'lr': 9.249949939822363e-05, 'samples': 20756928, 'steps': 108108, 'loss/train': 1.1027672290802002} 11/07/2021 12:27:12 - INFO - __main__ - Step 108110: {'lr': 9.249537825049786e-05, 'samples': 20757120, 'steps': 108109, 'loss/train': 1.335938811302185} 11/07/2021 12:27:13 - INFO - __main__ - Step 108111: {'lr': 9.249125717374124e-05, 'samples': 20757312, 'steps': 108110, 'loss/train': 0.6259243488311768} 11/07/2021 12:27:13 - INFO - __main__ - Step 108112: {'lr': 9.248713616795562e-05, 'samples': 20757504, 'steps': 108111, 'loss/train': 1.637254238128662} 11/07/2021 12:27:14 - INFO - __main__ - Step 108113: {'lr': 9.24830152331429e-05, 'samples': 20757696, 'steps': 108112, 'loss/train': 1.1720716953277588} 11/07/2021 12:27:14 - INFO - __main__ - Step 108114: {'lr': 9.247889436930495e-05, 'samples': 20757888, 'steps': 108113, 'loss/train': 0.3444804847240448} 11/07/2021 12:27:14 - INFO - __main__ - Step 108115: {'lr': 9.247477357644351e-05, 'samples': 20758080, 'steps': 108114, 'loss/train': 0.5285440683364868} 11/07/2021 12:27:15 - INFO - __main__ - Step 108116: {'lr': 9.247065285456049e-05, 'samples': 20758272, 'steps': 108115, 'loss/train': 1.1440179347991943} 11/07/2021 12:27:16 - INFO - __main__ - Step 108117: {'lr': 9.246653220365778e-05, 'samples': 20758464, 'steps': 108116, 'loss/train': 1.2350672483444214} 11/07/2021 12:27:16 - INFO - __main__ - Step 108118: {'lr': 9.246241162373722e-05, 'samples': 20758656, 'steps': 108117, 'loss/train': 1.1084558963775635} 11/07/2021 12:27:16 - INFO - __main__ - Step 108119: {'lr': 9.245829111480067e-05, 'samples': 20758848, 'steps': 108118, 'loss/train': 0.7631455659866333} 11/07/2021 12:27:17 - INFO - __main__ - Step 108120: {'lr': 9.245417067684997e-05, 'samples': 20759040, 'steps': 108119, 'loss/train': 1.133901834487915} 11/07/2021 12:27:18 - INFO - __main__ - Step 108121: {'lr': 9.245005030988699e-05, 'samples': 20759232, 'steps': 108120, 'loss/train': 1.8790230751037598} 11/07/2021 12:27:18 - INFO - __main__ - Step 108122: {'lr': 9.24459300139136e-05, 'samples': 20759424, 'steps': 108121, 'loss/train': 0.7520204782485962} 11/07/2021 12:27:19 - INFO - __main__ - Step 108123: {'lr': 9.244180978893163e-05, 'samples': 20759616, 'steps': 108122, 'loss/train': 1.375352144241333} 11/07/2021 12:27:19 - INFO - __main__ - Step 108124: {'lr': 9.243768963494295e-05, 'samples': 20759808, 'steps': 108123, 'loss/train': 1.2183716297149658} 11/07/2021 12:27:19 - INFO - __main__ - Step 108125: {'lr': 9.243356955194943e-05, 'samples': 20760000, 'steps': 108124, 'loss/train': 1.1601483821868896} 11/07/2021 12:27:20 - INFO - __main__ - Step 108126: {'lr': 9.242944953995289e-05, 'samples': 20760192, 'steps': 108125, 'loss/train': 1.0575077533721924} 11/07/2021 12:27:21 - INFO - __main__ - Step 108127: {'lr': 9.242532959895522e-05, 'samples': 20760384, 'steps': 108126, 'loss/train': 1.2330104112625122} 11/07/2021 12:27:21 - INFO - __main__ - Step 108128: {'lr': 9.242120972895835e-05, 'samples': 20760576, 'steps': 108127, 'loss/train': 1.1713624000549316} 11/07/2021 12:27:22 - INFO - __main__ - Step 108129: {'lr': 9.241708992996398e-05, 'samples': 20760768, 'steps': 108128, 'loss/train': 1.1293607950210571} 11/07/2021 12:27:22 - INFO - __main__ - Step 108130: {'lr': 9.241297020197401e-05, 'samples': 20760960, 'steps': 108129, 'loss/train': 1.3534626960754395} 11/07/2021 12:27:22 - INFO - __main__ - Step 108131: {'lr': 9.240885054499034e-05, 'samples': 20761152, 'steps': 108130, 'loss/train': 1.17191743850708} 11/07/2021 12:27:23 - INFO - __main__ - Step 108132: {'lr': 9.240473095901481e-05, 'samples': 20761344, 'steps': 108131, 'loss/train': 1.2475032806396484} 11/07/2021 12:27:24 - INFO - __main__ - Step 108133: {'lr': 9.240061144404926e-05, 'samples': 20761536, 'steps': 108132, 'loss/train': 1.3278170824050903} 11/07/2021 12:27:24 - INFO - __main__ - Step 108134: {'lr': 9.239649200009559e-05, 'samples': 20761728, 'steps': 108133, 'loss/train': 1.468125343322754} 11/07/2021 12:27:24 - INFO - __main__ - Step 108135: {'lr': 9.23923726271556e-05, 'samples': 20761920, 'steps': 108134, 'loss/train': 1.188758134841919} 11/07/2021 12:27:25 - INFO - __main__ - Step 108136: {'lr': 9.23882533252312e-05, 'samples': 20762112, 'steps': 108135, 'loss/train': 1.4092549085617065} 11/07/2021 12:27:26 - INFO - __main__ - Step 108137: {'lr': 9.238413409432423e-05, 'samples': 20762304, 'steps': 108136, 'loss/train': 1.600480318069458} 11/07/2021 12:27:26 - INFO - __main__ - Step 108138: {'lr': 9.23800149344365e-05, 'samples': 20762496, 'steps': 108137, 'loss/train': 0.15017962455749512} 11/07/2021 12:27:26 - INFO - __main__ - Step 108139: {'lr': 9.237589584556994e-05, 'samples': 20762688, 'steps': 108138, 'loss/train': 1.1226599216461182} 11/07/2021 12:27:27 - INFO - __main__ - Step 108140: {'lr': 9.237177682772635e-05, 'samples': 20762880, 'steps': 108139, 'loss/train': 1.2020398378372192} 11/07/2021 12:27:27 - INFO - __main__ - Step 108141: {'lr': 9.236765788090767e-05, 'samples': 20763072, 'steps': 108140, 'loss/train': 1.3195178508758545} 11/07/2021 12:27:28 - INFO - __main__ - Step 108142: {'lr': 9.236353900511563e-05, 'samples': 20763264, 'steps': 108141, 'loss/train': 1.4382102489471436} 11/07/2021 12:27:29 - INFO - __main__ - Step 108143: {'lr': 9.235942020035215e-05, 'samples': 20763456, 'steps': 108142, 'loss/train': 1.1492013931274414} 11/07/2021 12:27:29 - INFO - __main__ - Step 108144: {'lr': 9.235530146661908e-05, 'samples': 20763648, 'steps': 108143, 'loss/train': 1.3103694915771484} 11/07/2021 12:27:29 - INFO - __main__ - Step 108145: {'lr': 9.235118280391827e-05, 'samples': 20763840, 'steps': 108144, 'loss/train': 1.3631457090377808} 11/07/2021 12:27:30 - INFO - __main__ - Step 108146: {'lr': 9.234706421225158e-05, 'samples': 20764032, 'steps': 108145, 'loss/train': 1.2029409408569336} 11/07/2021 12:27:31 - INFO - __main__ - Step 108147: {'lr': 9.234294569162088e-05, 'samples': 20764224, 'steps': 108146, 'loss/train': 1.1005275249481201} 11/07/2021 12:27:31 - INFO - __main__ - Step 108148: {'lr': 9.233882724202802e-05, 'samples': 20764416, 'steps': 108147, 'loss/train': 1.3339382410049438} 11/07/2021 12:27:31 - INFO - __main__ - Step 108149: {'lr': 9.233470886347484e-05, 'samples': 20764608, 'steps': 108148, 'loss/train': 1.000917673110962} 11/07/2021 12:27:32 - INFO - __main__ - Step 108150: {'lr': 9.233059055596321e-05, 'samples': 20764800, 'steps': 108149, 'loss/train': 1.4677060842514038} 11/07/2021 12:27:32 - INFO - __main__ - Step 108151: {'lr': 9.232647231949501e-05, 'samples': 20764992, 'steps': 108150, 'loss/train': 1.4960663318634033} 11/07/2021 12:27:33 - INFO - __main__ - Step 108152: {'lr': 9.232235415407204e-05, 'samples': 20765184, 'steps': 108151, 'loss/train': 1.6580429077148438} 11/07/2021 12:27:33 - INFO - __main__ - Step 108153: {'lr': 9.23182360596962e-05, 'samples': 20765376, 'steps': 108152, 'loss/train': 1.1087392568588257} 11/07/2021 12:27:34 - INFO - __main__ - Step 108154: {'lr': 9.23141180363693e-05, 'samples': 20765568, 'steps': 108153, 'loss/train': 1.3501964807510376} 11/07/2021 12:27:34 - INFO - __main__ - Step 108155: {'lr': 9.231000008409332e-05, 'samples': 20765760, 'steps': 108154, 'loss/train': 1.4599676132202148} 11/07/2021 12:27:34 - INFO - __main__ - Step 108156: {'lr': 9.230588220286995e-05, 'samples': 20765952, 'steps': 108155, 'loss/train': 1.5301685333251953} 11/07/2021 12:27:36 - INFO - __main__ - Step 108157: {'lr': 9.230176439270111e-05, 'samples': 20766144, 'steps': 108156, 'loss/train': 1.1592967510223389} 11/07/2021 12:27:36 - INFO - __main__ - Step 108158: {'lr': 9.229764665358867e-05, 'samples': 20766336, 'steps': 108157, 'loss/train': 1.192828893661499} 11/07/2021 12:27:36 - INFO - __main__ - Step 108159: {'lr': 9.229352898553447e-05, 'samples': 20766528, 'steps': 108158, 'loss/train': 1.4041707515716553} 11/07/2021 12:27:37 - INFO - __main__ - Step 108160: {'lr': 9.228941138854039e-05, 'samples': 20766720, 'steps': 108159, 'loss/train': 1.4462827444076538} 11/07/2021 12:27:37 - INFO - __main__ - Step 108161: {'lr': 9.228529386260823e-05, 'samples': 20766912, 'steps': 108160, 'loss/train': 1.3806239366531372} 11/07/2021 12:27:37 - INFO - __main__ - Step 108162: {'lr': 9.228117640773989e-05, 'samples': 20767104, 'steps': 108161, 'loss/train': 1.3138242959976196} 11/07/2021 12:27:39 - INFO - __main__ - Step 108163: {'lr': 9.227705902393724e-05, 'samples': 20767296, 'steps': 108162, 'loss/train': 1.3350061178207397} 11/07/2021 12:27:39 - INFO - __main__ - Step 108164: {'lr': 9.22729417112021e-05, 'samples': 20767488, 'steps': 108163, 'loss/train': 1.149536371231079} 11/07/2021 12:27:39 - INFO - __main__ - Step 108165: {'lr': 9.226882446953636e-05, 'samples': 20767680, 'steps': 108164, 'loss/train': 1.078829050064087} 11/07/2021 12:27:40 - INFO - __main__ - Step 108166: {'lr': 9.226470729894182e-05, 'samples': 20767872, 'steps': 108165, 'loss/train': 1.1648063659667969} 11/07/2021 12:27:40 - INFO - __main__ - Step 108167: {'lr': 9.22605901994204e-05, 'samples': 20768064, 'steps': 108166, 'loss/train': 1.3509937524795532} 11/07/2021 12:27:40 - INFO - __main__ - Step 108168: {'lr': 9.225647317097399e-05, 'samples': 20768256, 'steps': 108167, 'loss/train': 1.6781216859817505} 11/07/2021 12:27:41 - INFO - __main__ - Step 108169: {'lr': 9.225235621360428e-05, 'samples': 20768448, 'steps': 108168, 'loss/train': 1.6904393434524536} 11/07/2021 12:27:42 - INFO - __main__ - Step 108170: {'lr': 9.224823932731325e-05, 'samples': 20768640, 'steps': 108169, 'loss/train': 1.2953546047210693} 11/07/2021 12:27:42 - INFO - __main__ - Step 108171: {'lr': 9.224412251210274e-05, 'samples': 20768832, 'steps': 108170, 'loss/train': 1.4906086921691895} 11/07/2021 12:27:43 - INFO - __main__ - Step 108172: {'lr': 9.224000576797456e-05, 'samples': 20769024, 'steps': 108171, 'loss/train': 1.326008915901184} 11/07/2021 12:27:43 - INFO - __main__ - Step 108173: {'lr': 9.223588909493061e-05, 'samples': 20769216, 'steps': 108172, 'loss/train': 1.490883708000183} 11/07/2021 12:27:44 - INFO - __main__ - Step 108174: {'lr': 9.223177249297274e-05, 'samples': 20769408, 'steps': 108173, 'loss/train': 1.0796812772750854} 11/07/2021 12:27:44 - INFO - __main__ - Step 108175: {'lr': 9.22276559621028e-05, 'samples': 20769600, 'steps': 108174, 'loss/train': 0.5751110315322876} 11/07/2021 12:27:45 - INFO - __main__ - Step 108176: {'lr': 9.222353950232265e-05, 'samples': 20769792, 'steps': 108175, 'loss/train': 0.9961237907409668} 11/07/2021 12:27:45 - INFO - __main__ - Step 108177: {'lr': 9.221942311363413e-05, 'samples': 20769984, 'steps': 108176, 'loss/train': 0.3156287968158722} 11/07/2021 12:27:45 - INFO - __main__ - Step 108178: {'lr': 9.221530679603909e-05, 'samples': 20770176, 'steps': 108177, 'loss/train': 1.1251113414764404} 11/07/2021 12:27:46 - INFO - __main__ - Step 108179: {'lr': 9.221119054953942e-05, 'samples': 20770368, 'steps': 108178, 'loss/train': 1.2976629734039307} 11/07/2021 12:27:47 - INFO - __main__ - Step 108180: {'lr': 9.220707437413694e-05, 'samples': 20770560, 'steps': 108179, 'loss/train': 1.2066279649734497} 11/07/2021 12:27:47 - INFO - __main__ - Step 108181: {'lr': 9.220295826983352e-05, 'samples': 20770752, 'steps': 108180, 'loss/train': 1.289471983909607} 11/07/2021 12:27:47 - INFO - __main__ - Step 108182: {'lr': 9.219884223663108e-05, 'samples': 20770944, 'steps': 108181, 'loss/train': 0.602823793888092} 11/07/2021 12:27:48 - INFO - __main__ - Step 108183: {'lr': 9.219472627453135e-05, 'samples': 20771136, 'steps': 108182, 'loss/train': 1.419625163078308} 11/07/2021 12:27:49 - INFO - __main__ - Step 108184: {'lr': 9.219061038353623e-05, 'samples': 20771328, 'steps': 108183, 'loss/train': 1.580323576927185} 11/07/2021 12:27:49 - INFO - __main__ - Step 108185: {'lr': 9.218649456364758e-05, 'samples': 20771520, 'steps': 108184, 'loss/train': 1.2726247310638428} 11/07/2021 12:27:50 - INFO - __main__ - Step 108186: {'lr': 9.218237881486727e-05, 'samples': 20771712, 'steps': 108185, 'loss/train': 1.2947279214859009} 11/07/2021 12:27:50 - INFO - __main__ - Step 108187: {'lr': 9.217826313719716e-05, 'samples': 20771904, 'steps': 108186, 'loss/train': 1.51015305519104} 11/07/2021 12:27:50 - INFO - __main__ - Step 108188: {'lr': 9.217414753063905e-05, 'samples': 20772096, 'steps': 108187, 'loss/train': 1.0017069578170776} 11/07/2021 12:27:51 - INFO - __main__ - Step 108189: {'lr': 9.217003199519486e-05, 'samples': 20772288, 'steps': 108188, 'loss/train': 1.2461737394332886} 11/07/2021 12:27:52 - INFO - __main__ - Step 108190: {'lr': 9.216591653086643e-05, 'samples': 20772480, 'steps': 108189, 'loss/train': 0.7475268244743347} 11/07/2021 12:27:52 - INFO - __main__ - Step 108191: {'lr': 9.216180113765556e-05, 'samples': 20772672, 'steps': 108190, 'loss/train': 1.2182409763336182} 11/07/2021 12:27:52 - INFO - __main__ - Step 108192: {'lr': 9.215768581556419e-05, 'samples': 20772864, 'steps': 108191, 'loss/train': 1.1940863132476807} 11/07/2021 12:27:53 - INFO - __main__ - Step 108193: {'lr': 9.215357056459412e-05, 'samples': 20773056, 'steps': 108192, 'loss/train': 1.23123300075531} 11/07/2021 12:27:54 - INFO - __main__ - Step 108194: {'lr': 9.21494553847472e-05, 'samples': 20773248, 'steps': 108193, 'loss/train': 1.6880675554275513} 11/07/2021 12:27:54 - INFO - __main__ - Step 108195: {'lr': 9.21453402760254e-05, 'samples': 20773440, 'steps': 108194, 'loss/train': 1.4223906993865967} 11/07/2021 12:27:55 - INFO - __main__ - Step 108196: {'lr': 9.214122523843035e-05, 'samples': 20773632, 'steps': 108195, 'loss/train': 0.865196168422699} 11/07/2021 12:27:55 - INFO - __main__ - Step 108197: {'lr': 9.213711027196409e-05, 'samples': 20773824, 'steps': 108196, 'loss/train': 1.2991890907287598} 11/07/2021 12:27:55 - INFO - __main__ - Step 108198: {'lr': 9.213299537662836e-05, 'samples': 20774016, 'steps': 108197, 'loss/train': 1.4886983633041382} 11/07/2021 12:27:56 - INFO - __main__ - Step 108199: {'lr': 9.21288805524251e-05, 'samples': 20774208, 'steps': 108198, 'loss/train': 1.2434762716293335} 11/07/2021 12:27:57 - INFO - __main__ - Step 108200: {'lr': 9.212476579935611e-05, 'samples': 20774400, 'steps': 108199, 'loss/train': 1.1173391342163086} 11/07/2021 12:27:57 - INFO - __main__ - Step 108201: {'lr': 9.212065111742326e-05, 'samples': 20774592, 'steps': 108200, 'loss/train': 1.4044480323791504} 11/07/2021 12:27:57 - INFO - __main__ - Step 108202: {'lr': 9.211653650662844e-05, 'samples': 20774784, 'steps': 108201, 'loss/train': 1.302696943283081} 11/07/2021 12:27:58 - INFO - __main__ - Step 108203: {'lr': 9.211242196697345e-05, 'samples': 20774976, 'steps': 108202, 'loss/train': 1.383952260017395} 11/07/2021 12:27:59 - INFO - __main__ - Step 108204: {'lr': 9.210830749846016e-05, 'samples': 20775168, 'steps': 108203, 'loss/train': 0.7863637208938599} 11/07/2021 12:27:59 - INFO - __main__ - Step 108205: {'lr': 9.210419310109044e-05, 'samples': 20775360, 'steps': 108204, 'loss/train': 1.4416477680206299} 11/07/2021 12:27:59 - INFO - __main__ - Step 108206: {'lr': 9.210007877486615e-05, 'samples': 20775552, 'steps': 108205, 'loss/train': 1.3098700046539307} 11/07/2021 12:28:00 - INFO - __main__ - Step 108207: {'lr': 9.20959645197891e-05, 'samples': 20775744, 'steps': 108206, 'loss/train': 1.2506073713302612} 11/07/2021 12:28:00 - INFO - __main__ - Step 108208: {'lr': 9.209185033586129e-05, 'samples': 20775936, 'steps': 108207, 'loss/train': 1.0473817586898804} 11/07/2021 12:28:01 - INFO - __main__ - Step 108209: {'lr': 9.208773622308433e-05, 'samples': 20776128, 'steps': 108208, 'loss/train': 1.4180842638015747} 11/07/2021 12:28:02 - INFO - __main__ - Step 108210: {'lr': 9.208362218146021e-05, 'samples': 20776320, 'steps': 108209, 'loss/train': 0.5198246240615845} 11/07/2021 12:28:02 - INFO - __main__ - Step 108211: {'lr': 9.207950821099078e-05, 'samples': 20776512, 'steps': 108210, 'loss/train': 1.350550889968872} 11/07/2021 12:28:02 - INFO - __main__ - Step 108212: {'lr': 9.207539431167792e-05, 'samples': 20776704, 'steps': 108211, 'loss/train': 0.7183363437652588} 11/07/2021 12:28:03 - INFO - __main__ - Step 108213: {'lr': 9.207128048352339e-05, 'samples': 20776896, 'steps': 108212, 'loss/train': 1.5179787874221802} 11/07/2021 12:28:03 - INFO - __main__ - Step 108214: {'lr': 9.206716672652915e-05, 'samples': 20777088, 'steps': 108213, 'loss/train': 1.823535442352295} 11/07/2021 12:28:04 - INFO - __main__ - Step 108215: {'lr': 9.206305304069699e-05, 'samples': 20777280, 'steps': 108214, 'loss/train': 2.0658836364746094} 11/07/2021 12:28:04 - INFO - __main__ - Step 108216: {'lr': 9.205893942602878e-05, 'samples': 20777472, 'steps': 108215, 'loss/train': 1.048104166984558} 11/07/2021 12:28:05 - INFO - __main__ - Step 108217: {'lr': 9.205482588252637e-05, 'samples': 20777664, 'steps': 108216, 'loss/train': 1.286679744720459} 11/07/2021 12:28:05 - INFO - __main__ - Step 108218: {'lr': 9.205071241019164e-05, 'samples': 20777856, 'steps': 108217, 'loss/train': 1.1051385402679443} 11/07/2021 12:28:05 - INFO - __main__ - Step 108219: {'lr': 9.20465990090264e-05, 'samples': 20778048, 'steps': 108218, 'loss/train': 1.1795696020126343} 11/07/2021 12:28:07 - INFO - __main__ - Step 108220: {'lr': 9.204248567903254e-05, 'samples': 20778240, 'steps': 108219, 'loss/train': 0.4670080840587616} 11/07/2021 12:28:07 - INFO - __main__ - Step 108221: {'lr': 9.203837242021187e-05, 'samples': 20778432, 'steps': 108220, 'loss/train': 1.002703070640564} 11/07/2021 12:28:07 - INFO - __main__ - Step 108222: {'lr': 9.20342592325664e-05, 'samples': 20778624, 'steps': 108221, 'loss/train': 1.229762315750122} 11/07/2021 12:28:08 - INFO - __main__ - Step 108223: {'lr': 9.203014611609772e-05, 'samples': 20778816, 'steps': 108222, 'loss/train': 1.125083088874817} 11/07/2021 12:28:08 - INFO - __main__ - Step 108224: {'lr': 9.202603307080787e-05, 'samples': 20779008, 'steps': 108223, 'loss/train': 0.9533366560935974} 11/07/2021 12:28:08 - INFO - __main__ - Step 108225: {'lr': 9.202192009669863e-05, 'samples': 20779200, 'steps': 108224, 'loss/train': 1.3994982242584229} 11/07/2021 12:28:09 - INFO - __main__ - Step 108226: {'lr': 9.201780719377186e-05, 'samples': 20779392, 'steps': 108225, 'loss/train': 1.5506278276443481} 11/07/2021 12:28:10 - INFO - __main__ - Step 108227: {'lr': 9.201369436202944e-05, 'samples': 20779584, 'steps': 108226, 'loss/train': 0.71644127368927} 11/07/2021 12:28:10 - INFO - __main__ - Step 108228: {'lr': 9.200958160147322e-05, 'samples': 20779776, 'steps': 108227, 'loss/train': 0.969578742980957} 11/07/2021 12:28:10 - INFO - __main__ - Step 108229: {'lr': 9.200546891210504e-05, 'samples': 20779968, 'steps': 108228, 'loss/train': 1.389793038368225} 11/07/2021 12:28:11 - INFO - __main__ - Step 108230: {'lr': 9.200135629392675e-05, 'samples': 20780160, 'steps': 108229, 'loss/train': 1.4429543018341064} 11/07/2021 12:28:12 - INFO - __main__ - Step 108231: {'lr': 9.199724374694021e-05, 'samples': 20780352, 'steps': 108230, 'loss/train': 1.4395835399627686} 11/07/2021 12:28:12 - INFO - __main__ - Step 108232: {'lr': 9.199313127114728e-05, 'samples': 20780544, 'steps': 108231, 'loss/train': 0.7575939893722534} 11/07/2021 12:28:12 - INFO - __main__ - Step 108233: {'lr': 9.198901886654982e-05, 'samples': 20780736, 'steps': 108232, 'loss/train': 1.2405858039855957} 11/07/2021 12:28:13 - INFO - __main__ - Step 108234: {'lr': 9.198490653314965e-05, 'samples': 20780928, 'steps': 108233, 'loss/train': 1.1837193965911865} 11/07/2021 12:28:13 - INFO - __main__ - Step 108235: {'lr': 9.198079427094872e-05, 'samples': 20781120, 'steps': 108234, 'loss/train': 1.4065747261047363} 11/07/2021 12:28:14 - INFO - __main__ - Step 108236: {'lr': 9.197668207994874e-05, 'samples': 20781312, 'steps': 108235, 'loss/train': 0.8539830446243286} 11/07/2021 12:28:14 - INFO - __main__ - Step 108237: {'lr': 9.197256996015163e-05, 'samples': 20781504, 'steps': 108236, 'loss/train': 1.404282569885254} 11/07/2021 12:28:15 - INFO - __main__ - Step 108238: {'lr': 9.196845791155923e-05, 'samples': 20781696, 'steps': 108237, 'loss/train': 0.6572585105895996} 11/07/2021 12:28:15 - INFO - __main__ - Step 108239: {'lr': 9.196434593417341e-05, 'samples': 20781888, 'steps': 108238, 'loss/train': 1.4397639036178589} 11/07/2021 12:28:16 - INFO - __main__ - Step 108240: {'lr': 9.196023402799603e-05, 'samples': 20782080, 'steps': 108239, 'loss/train': 1.6950546503067017} 11/07/2021 12:28:17 - INFO - __main__ - Step 108241: {'lr': 9.19561221930289e-05, 'samples': 20782272, 'steps': 108240, 'loss/train': 1.5264878273010254} 11/07/2021 12:28:17 - INFO - __main__ - Step 108242: {'lr': 9.195201042927393e-05, 'samples': 20782464, 'steps': 108241, 'loss/train': 1.1830729246139526} 11/07/2021 12:28:17 - INFO - __main__ - Step 108243: {'lr': 9.194789873673292e-05, 'samples': 20782656, 'steps': 108242, 'loss/train': 1.5641874074935913} 11/07/2021 12:28:18 - INFO - __main__ - Step 108244: {'lr': 9.194378711540776e-05, 'samples': 20782848, 'steps': 108243, 'loss/train': 1.0258326530456543} 11/07/2021 12:28:18 - INFO - __main__ - Step 108245: {'lr': 9.19396755653003e-05, 'samples': 20783040, 'steps': 108244, 'loss/train': 1.4572802782058716} 11/07/2021 12:28:19 - INFO - __main__ - Step 108246: {'lr': 9.193556408641238e-05, 'samples': 20783232, 'steps': 108245, 'loss/train': 1.6662629842758179} 11/07/2021 12:28:20 - INFO - __main__ - Step 108247: {'lr': 9.193145267874583e-05, 'samples': 20783424, 'steps': 108246, 'loss/train': 1.2552452087402344} 11/07/2021 12:28:20 - INFO - __main__ - Step 108248: {'lr': 9.192734134230257e-05, 'samples': 20783616, 'steps': 108247, 'loss/train': 1.147140622138977} 11/07/2021 12:28:20 - INFO - __main__ - Step 108249: {'lr': 9.192323007708448e-05, 'samples': 20783808, 'steps': 108248, 'loss/train': 1.069672703742981} 11/07/2021 12:28:21 - INFO - __main__ - Step 108250: {'lr': 9.191911888309323e-05, 'samples': 20784000, 'steps': 108249, 'loss/train': 1.6666079759597778} 11/07/2021 12:28:22 - INFO - __main__ - Step 108251: {'lr': 9.191500776033082e-05, 'samples': 20784192, 'steps': 108250, 'loss/train': 1.1067203283309937} 11/07/2021 12:28:22 - INFO - __main__ - Step 108252: {'lr': 9.191089670879907e-05, 'samples': 20784384, 'steps': 108251, 'loss/train': 1.5075114965438843} 11/07/2021 12:28:22 - INFO - __main__ - Step 108253: {'lr': 9.190678572849981e-05, 'samples': 20784576, 'steps': 108252, 'loss/train': 1.5900630950927734} 11/07/2021 12:28:23 - INFO - __main__ - Step 108254: {'lr': 9.19026748194349e-05, 'samples': 20784768, 'steps': 108253, 'loss/train': 0.4520360231399536} 11/07/2021 12:28:23 - INFO - __main__ - Step 108255: {'lr': 9.189856398160623e-05, 'samples': 20784960, 'steps': 108254, 'loss/train': 1.0440129041671753} 11/07/2021 12:28:23 - INFO - __main__ - Step 108256: {'lr': 9.189445321501563e-05, 'samples': 20785152, 'steps': 108255, 'loss/train': 1.6583421230316162} 11/07/2021 12:28:24 - INFO - __main__ - Step 108257: {'lr': 9.189034251966494e-05, 'samples': 20785344, 'steps': 108256, 'loss/train': 1.8096842765808105} 11/07/2021 12:28:25 - INFO - __main__ - Step 108258: {'lr': 9.188623189555603e-05, 'samples': 20785536, 'steps': 108257, 'loss/train': 1.4172455072402954} 11/07/2021 12:28:25 - INFO - __main__ - Step 108259: {'lr': 9.188212134269075e-05, 'samples': 20785728, 'steps': 108258, 'loss/train': 1.9334089756011963} 11/07/2021 12:28:25 - INFO - __main__ - Step 108260: {'lr': 9.187801086107092e-05, 'samples': 20785920, 'steps': 108259, 'loss/train': 1.0233792066574097} 11/07/2021 12:28:26 - INFO - __main__ - Step 108261: {'lr': 9.187390045069844e-05, 'samples': 20786112, 'steps': 108260, 'loss/train': 1.4027591943740845} 11/07/2021 12:28:27 - INFO - __main__ - Step 108262: {'lr': 9.18697901115752e-05, 'samples': 20786304, 'steps': 108261, 'loss/train': 1.2497514486312866} 11/07/2021 12:28:27 - INFO - __main__ - Step 108263: {'lr': 9.186567984370294e-05, 'samples': 20786496, 'steps': 108262, 'loss/train': 1.430876612663269} 11/07/2021 12:28:28 - INFO - __main__ - Step 108264: {'lr': 9.186156964708357e-05, 'samples': 20786688, 'steps': 108263, 'loss/train': 1.4166185855865479} 11/07/2021 12:28:28 - INFO - __main__ - Step 108265: {'lr': 9.185745952171889e-05, 'samples': 20786880, 'steps': 108264, 'loss/train': 1.096161127090454} 11/07/2021 12:28:28 - INFO - __main__ - Step 108266: {'lr': 9.185334946761084e-05, 'samples': 20787072, 'steps': 108265, 'loss/train': 1.2077487707138062} 11/07/2021 12:28:29 - INFO - __main__ - Step 108267: {'lr': 9.18492394847612e-05, 'samples': 20787264, 'steps': 108266, 'loss/train': 0.6936427354812622} 11/07/2021 12:28:30 - INFO - __main__ - Step 108268: {'lr': 9.184512957317187e-05, 'samples': 20787456, 'steps': 108267, 'loss/train': 1.1788220405578613} 11/07/2021 12:28:30 - INFO - __main__ - Step 108269: {'lr': 9.18410197328447e-05, 'samples': 20787648, 'steps': 108268, 'loss/train': 1.1262788772583008} 11/07/2021 12:28:30 - INFO - __main__ - Step 108270: {'lr': 9.18369099637815e-05, 'samples': 20787840, 'steps': 108269, 'loss/train': 1.4834996461868286} 11/07/2021 12:28:31 - INFO - __main__ - Step 108271: {'lr': 9.183280026598415e-05, 'samples': 20788032, 'steps': 108270, 'loss/train': 1.5831142663955688} 11/07/2021 12:28:32 - INFO - __main__ - Step 108272: {'lr': 9.182869063945451e-05, 'samples': 20788224, 'steps': 108271, 'loss/train': 4.396462917327881} 11/07/2021 12:28:32 - INFO - __main__ - Step 108273: {'lr': 9.182458108419441e-05, 'samples': 20788416, 'steps': 108272, 'loss/train': 0.9845770001411438} 11/07/2021 12:28:33 - INFO - __main__ - Step 108274: {'lr': 9.182047160020573e-05, 'samples': 20788608, 'steps': 108273, 'loss/train': 1.2657216787338257} 11/07/2021 12:28:33 - INFO - __main__ - Step 108275: {'lr': 9.181636218749029e-05, 'samples': 20788800, 'steps': 108274, 'loss/train': 1.267283320426941} 11/07/2021 12:28:33 - INFO - __main__ - Step 108276: {'lr': 9.181225284605005e-05, 'samples': 20788992, 'steps': 108275, 'loss/train': 1.2058240175247192} 11/07/2021 12:28:34 - INFO - __main__ - Step 108277: {'lr': 9.180814357588668e-05, 'samples': 20789184, 'steps': 108276, 'loss/train': 1.3626583814620972} 11/07/2021 12:28:35 - INFO - __main__ - Step 108278: {'lr': 9.18040343770021e-05, 'samples': 20789376, 'steps': 108277, 'loss/train': 1.133974552154541} 11/07/2021 12:28:35 - INFO - __main__ - Step 108279: {'lr': 9.179992524939821e-05, 'samples': 20789568, 'steps': 108278, 'loss/train': 1.6728696823120117} 11/07/2021 12:28:35 - INFO - __main__ - Step 108280: {'lr': 9.179581619307684e-05, 'samples': 20789760, 'steps': 108279, 'loss/train': 1.5909463167190552} 11/07/2021 12:28:36 - INFO - __main__ - Step 108281: {'lr': 9.179170720803981e-05, 'samples': 20789952, 'steps': 108280, 'loss/train': 1.4170501232147217} 11/07/2021 12:28:36 - INFO - __main__ - Step 108282: {'lr': 9.178759829428898e-05, 'samples': 20790144, 'steps': 108281, 'loss/train': 0.6014296412467957} 11/07/2021 12:28:37 - INFO - __main__ - Step 108283: {'lr': 9.178348945182624e-05, 'samples': 20790336, 'steps': 108282, 'loss/train': 0.8887532353401184} 11/07/2021 12:28:37 - INFO - __main__ - Step 108284: {'lr': 9.177938068065341e-05, 'samples': 20790528, 'steps': 108283, 'loss/train': 1.4203953742980957} 11/07/2021 12:28:38 - INFO - __main__ - Step 108285: {'lr': 9.177527198077237e-05, 'samples': 20790720, 'steps': 108284, 'loss/train': 0.8137930035591125} 11/07/2021 12:28:38 - INFO - __main__ - Step 108286: {'lr': 9.177116335218494e-05, 'samples': 20790912, 'steps': 108285, 'loss/train': 1.3270009756088257} 11/07/2021 12:28:39 - INFO - __main__ - Step 108287: {'lr': 9.176705479489298e-05, 'samples': 20791104, 'steps': 108286, 'loss/train': 1.2596324682235718} 11/07/2021 12:28:40 - INFO - __main__ - Step 108288: {'lr': 9.176294630889842e-05, 'samples': 20791296, 'steps': 108287, 'loss/train': 1.3468936681747437} 11/07/2021 12:28:40 - INFO - __main__ - Step 108289: {'lr': 9.175883789420294e-05, 'samples': 20791488, 'steps': 108288, 'loss/train': 1.1714110374450684} 11/07/2021 12:28:40 - INFO - __main__ - Step 108290: {'lr': 9.175472955080852e-05, 'samples': 20791680, 'steps': 108289, 'loss/train': 1.2252931594848633} 11/07/2021 12:28:41 - INFO - __main__ - Step 108291: {'lr': 9.175062127871697e-05, 'samples': 20791872, 'steps': 108290, 'loss/train': 1.4589464664459229} 11/07/2021 12:28:41 - INFO - __main__ - Step 108292: {'lr': 9.174651307793014e-05, 'samples': 20792064, 'steps': 108291, 'loss/train': 0.7285060882568359} 11/07/2021 12:28:41 - INFO - __main__ - Step 108293: {'lr': 9.174240494844987e-05, 'samples': 20792256, 'steps': 108292, 'loss/train': 1.5683298110961914} 11/07/2021 12:28:42 - INFO - __main__ - Step 108294: {'lr': 9.173829689027805e-05, 'samples': 20792448, 'steps': 108293, 'loss/train': 0.9385831356048584} 11/07/2021 12:28:43 - INFO - __main__ - Step 108295: {'lr': 9.173418890341651e-05, 'samples': 20792640, 'steps': 108294, 'loss/train': 1.349226713180542} 11/07/2021 12:28:43 - INFO - __main__ - Step 108296: {'lr': 9.173008098786712e-05, 'samples': 20792832, 'steps': 108295, 'loss/train': 1.3215293884277344} 11/07/2021 12:28:43 - INFO - __main__ - Step 108297: {'lr': 9.172597314363168e-05, 'samples': 20793024, 'steps': 108296, 'loss/train': 0.9868611693382263} 11/07/2021 12:28:44 - INFO - __main__ - Step 108298: {'lr': 9.172186537071217e-05, 'samples': 20793216, 'steps': 108297, 'loss/train': 1.312996745109558} 11/07/2021 12:28:45 - INFO - __main__ - Step 108299: {'lr': 9.171775766911025e-05, 'samples': 20793408, 'steps': 108298, 'loss/train': 0.91105055809021} 11/07/2021 12:28:45 - INFO - __main__ - Step 108300: {'lr': 9.17136500388279e-05, 'samples': 20793600, 'steps': 108299, 'loss/train': 1.1441842317581177} 11/07/2021 12:28:46 - INFO - __main__ - Step 108301: {'lr': 9.170954247986691e-05, 'samples': 20793792, 'steps': 108300, 'loss/train': 1.135537028312683} 11/07/2021 12:28:46 - INFO - __main__ - Step 108302: {'lr': 9.170543499222917e-05, 'samples': 20793984, 'steps': 108301, 'loss/train': 1.2464579343795776} 11/07/2021 12:28:46 - INFO - __main__ - Step 108303: {'lr': 9.170132757591651e-05, 'samples': 20794176, 'steps': 108302, 'loss/train': 1.1710585355758667} 11/07/2021 12:28:47 - INFO - __main__ - Step 108304: {'lr': 9.169722023093077e-05, 'samples': 20794368, 'steps': 108303, 'loss/train': 1.4751845598220825} 11/07/2021 12:28:48 - INFO - __main__ - Step 108305: {'lr': 9.169311295727387e-05, 'samples': 20794560, 'steps': 108304, 'loss/train': 1.1692986488342285} 11/07/2021 12:28:48 - INFO - __main__ - Step 108306: {'lr': 9.168900575494757e-05, 'samples': 20794752, 'steps': 108305, 'loss/train': 1.3882427215576172} 11/07/2021 12:28:48 - INFO - __main__ - Step 108307: {'lr': 9.168489862395377e-05, 'samples': 20794944, 'steps': 108306, 'loss/train': 1.5743979215621948} 11/07/2021 12:28:49 - INFO - __main__ - Step 108308: {'lr': 9.168079156429433e-05, 'samples': 20795136, 'steps': 108307, 'loss/train': 1.4061176776885986} 11/07/2021 12:28:50 - INFO - __main__ - Step 108309: {'lr': 9.167668457597114e-05, 'samples': 20795328, 'steps': 108308, 'loss/train': 1.4122390747070312} 11/07/2021 12:28:50 - INFO - __main__ - Step 108310: {'lr': 9.16725776589859e-05, 'samples': 20795520, 'steps': 108309, 'loss/train': 1.3235479593276978} 11/07/2021 12:28:51 - INFO - __main__ - Step 108311: {'lr': 9.166847081334059e-05, 'samples': 20795712, 'steps': 108310, 'loss/train': 1.2790066003799438} 11/07/2021 12:28:51 - INFO - __main__ - Step 108312: {'lr': 9.1664364039037e-05, 'samples': 20795904, 'steps': 108311, 'loss/train': 1.4066447019577026} 11/07/2021 12:28:51 - INFO - __main__ - Step 108313: {'lr': 9.166025733607702e-05, 'samples': 20796096, 'steps': 108312, 'loss/train': 1.1875866651535034} 11/07/2021 12:28:52 - INFO - __main__ - Step 108314: {'lr': 9.165615070446248e-05, 'samples': 20796288, 'steps': 108313, 'loss/train': 0.9770153164863586} 11/07/2021 12:28:53 - INFO - __main__ - Step 108315: {'lr': 9.165204414419523e-05, 'samples': 20796480, 'steps': 108314, 'loss/train': 1.0813055038452148} 11/07/2021 12:28:53 - INFO - __main__ - Step 108316: {'lr': 9.164793765527712e-05, 'samples': 20796672, 'steps': 108315, 'loss/train': 1.3271530866622925} 11/07/2021 12:28:53 - INFO - __main__ - Step 108317: {'lr': 9.164383123771e-05, 'samples': 20796864, 'steps': 108316, 'loss/train': 0.23279428482055664} 11/07/2021 12:28:54 - INFO - __main__ - Step 108318: {'lr': 9.163972489149574e-05, 'samples': 20797056, 'steps': 108317, 'loss/train': 1.3340579271316528} 11/07/2021 12:28:54 - INFO - __main__ - Step 108319: {'lr': 9.163561861663619e-05, 'samples': 20797248, 'steps': 108318, 'loss/train': 1.2574541568756104} 11/07/2021 12:28:55 - INFO - __main__ - Step 108320: {'lr': 9.163151241313325e-05, 'samples': 20797440, 'steps': 108319, 'loss/train': 1.2988494634628296} 11/07/2021 12:28:55 - INFO - __main__ - Step 108321: {'lr': 9.162740628098862e-05, 'samples': 20797632, 'steps': 108320, 'loss/train': 1.565801978111267} 11/07/2021 12:28:56 - INFO - __main__ - Step 108322: {'lr': 9.162330022020423e-05, 'samples': 20797824, 'steps': 108321, 'loss/train': 2.034095287322998} 11/07/2021 12:28:56 - INFO - __main__ - Step 108323: {'lr': 9.161919423078196e-05, 'samples': 20798016, 'steps': 108322, 'loss/train': 1.5095840692520142} 11/07/2021 12:28:57 - INFO - __main__ - Step 108324: {'lr': 9.161508831272364e-05, 'samples': 20798208, 'steps': 108323, 'loss/train': 1.3548686504364014} 11/07/2021 12:28:58 - INFO - __main__ - Step 108325: {'lr': 9.161098246603111e-05, 'samples': 20798400, 'steps': 108324, 'loss/train': 1.2075676918029785} 11/07/2021 12:28:58 - INFO - __main__ - Step 108326: {'lr': 9.160687669070623e-05, 'samples': 20798592, 'steps': 108325, 'loss/train': 1.3582026958465576} 11/07/2021 12:28:58 - INFO - __main__ - Step 108327: {'lr': 9.160277098675082e-05, 'samples': 20798784, 'steps': 108326, 'loss/train': 1.7143102884292603} 11/07/2021 12:28:59 - INFO - __main__ - Step 108328: {'lr': 9.159866535416678e-05, 'samples': 20798976, 'steps': 108327, 'loss/train': 1.61554753780365} 11/07/2021 12:28:59 - INFO - __main__ - Step 108329: {'lr': 9.159455979295594e-05, 'samples': 20799168, 'steps': 108328, 'loss/train': 1.5237021446228027} 11/07/2021 12:29:00 - INFO - __main__ - Step 108330: {'lr': 9.159045430312013e-05, 'samples': 20799360, 'steps': 108329, 'loss/train': 1.1910183429718018} 11/07/2021 12:29:00 - INFO - __main__ - Step 108331: {'lr': 9.158634888466133e-05, 'samples': 20799552, 'steps': 108330, 'loss/train': 1.209729790687561} 11/07/2021 12:29:01 - INFO - __main__ - Step 108332: {'lr': 9.158224353758115e-05, 'samples': 20799744, 'steps': 108331, 'loss/train': 1.3211573362350464} 11/07/2021 12:29:01 - INFO - __main__ - Step 108333: {'lr': 9.157813826188161e-05, 'samples': 20799936, 'steps': 108332, 'loss/train': 1.3706340789794922} 11/07/2021 12:29:01 - INFO - __main__ - Step 108334: {'lr': 9.157403305756451e-05, 'samples': 20800128, 'steps': 108333, 'loss/train': 2.104912757873535} 11/07/2021 12:29:02 - INFO - __main__ - Step 108335: {'lr': 9.156992792463167e-05, 'samples': 20800320, 'steps': 108334, 'loss/train': 1.3150355815887451} 11/07/2021 12:29:03 - INFO - __main__ - Step 108336: {'lr': 9.156582286308501e-05, 'samples': 20800512, 'steps': 108335, 'loss/train': 1.5423146486282349} 11/07/2021 12:29:03 - INFO - __main__ - Step 108337: {'lr': 9.156171787292633e-05, 'samples': 20800704, 'steps': 108336, 'loss/train': 0.9274486303329468} 11/07/2021 12:29:03 - INFO - __main__ - Step 108338: {'lr': 9.155761295415751e-05, 'samples': 20800896, 'steps': 108337, 'loss/train': 1.101447343826294} 11/07/2021 12:29:04 - INFO - __main__ - Step 108339: {'lr': 9.155350810678037e-05, 'samples': 20801088, 'steps': 108338, 'loss/train': 0.9602704644203186} 11/07/2021 12:29:05 - INFO - __main__ - Step 108340: {'lr': 9.154940333079678e-05, 'samples': 20801280, 'steps': 108339, 'loss/train': 1.3501964807510376} 11/07/2021 12:29:05 - INFO - __main__ - Step 108341: {'lr': 9.154529862620858e-05, 'samples': 20801472, 'steps': 108340, 'loss/train': 1.5760631561279297} 11/07/2021 12:29:06 - INFO - __main__ - Step 108342: {'lr': 9.154119399301764e-05, 'samples': 20801664, 'steps': 108341, 'loss/train': 1.6005423069000244} 11/07/2021 12:29:06 - INFO - __main__ - Step 108343: {'lr': 9.153708943122585e-05, 'samples': 20801856, 'steps': 108342, 'loss/train': 1.3057913780212402} 11/07/2021 12:29:06 - INFO - __main__ - Step 108344: {'lr': 9.153298494083492e-05, 'samples': 20802048, 'steps': 108343, 'loss/train': 1.5352308750152588} 11/07/2021 12:29:07 - INFO - __main__ - Step 108345: {'lr': 9.15288805218468e-05, 'samples': 20802240, 'steps': 108344, 'loss/train': 1.6994794607162476} 11/07/2021 12:29:08 - INFO - __main__ - Step 108346: {'lr': 9.152477617426333e-05, 'samples': 20802432, 'steps': 108345, 'loss/train': 1.2781322002410889} 11/07/2021 12:29:08 - INFO - __main__ - Step 108347: {'lr': 9.152067189808633e-05, 'samples': 20802624, 'steps': 108346, 'loss/train': 1.4183173179626465} 11/07/2021 12:29:08 - INFO - __main__ - Step 108348: {'lr': 9.151656769331767e-05, 'samples': 20802816, 'steps': 108347, 'loss/train': 1.358349323272705} 11/07/2021 12:29:09 - INFO - __main__ - Step 108349: {'lr': 9.15124635599592e-05, 'samples': 20803008, 'steps': 108348, 'loss/train': 1.2535569667816162} 11/07/2021 12:29:09 - INFO - __main__ - Step 108350: {'lr': 9.150835949801278e-05, 'samples': 20803200, 'steps': 108349, 'loss/train': 1.4126604795455933} 11/07/2021 12:29:10 - INFO - __main__ - Step 108351: {'lr': 9.150425550748023e-05, 'samples': 20803392, 'steps': 108350, 'loss/train': 1.4990029335021973} 11/07/2021 12:29:10 - INFO - __main__ - Step 108352: {'lr': 9.150015158836345e-05, 'samples': 20803584, 'steps': 108351, 'loss/train': 1.450385332107544} 11/07/2021 12:29:11 - INFO - __main__ - Step 108353: {'lr': 9.149604774066422e-05, 'samples': 20803776, 'steps': 108352, 'loss/train': 1.276322364807129} 11/07/2021 12:29:11 - INFO - __main__ - Step 108354: {'lr': 9.149194396438442e-05, 'samples': 20803968, 'steps': 108353, 'loss/train': 0.9898984432220459} 11/07/2021 12:29:12 - INFO - __main__ - Step 108355: {'lr': 9.148784025952594e-05, 'samples': 20804160, 'steps': 108354, 'loss/train': 1.5353307723999023} 11/07/2021 12:29:13 - INFO - __main__ - Step 108356: {'lr': 9.148373662609067e-05, 'samples': 20804352, 'steps': 108355, 'loss/train': 1.656529188156128} 11/07/2021 12:29:13 - INFO - __main__ - Step 108357: {'lr': 9.147963306408028e-05, 'samples': 20804544, 'steps': 108356, 'loss/train': 1.024702548980713} 11/07/2021 12:29:13 - INFO - __main__ - Step 108358: {'lr': 9.147552957349672e-05, 'samples': 20804736, 'steps': 108357, 'loss/train': 1.3574888706207275} 11/07/2021 12:29:14 - INFO - __main__ - Step 108359: {'lr': 9.147142615434184e-05, 'samples': 20804928, 'steps': 108358, 'loss/train': 1.4982231855392456} 11/07/2021 12:29:14 - INFO - __main__ - Step 108360: {'lr': 9.146732280661749e-05, 'samples': 20805120, 'steps': 108359, 'loss/train': 0.48312026262283325} 11/07/2021 12:29:15 - INFO - __main__ - Step 108361: {'lr': 9.146321953032555e-05, 'samples': 20805312, 'steps': 108360, 'loss/train': 1.4300718307495117} 11/07/2021 12:29:15 - INFO - __main__ - Step 108362: {'lr': 9.14591163254678e-05, 'samples': 20805504, 'steps': 108361, 'loss/train': 1.318000316619873} 11/07/2021 12:29:16 - INFO - __main__ - Step 108363: {'lr': 9.145501319204613e-05, 'samples': 20805696, 'steps': 108362, 'loss/train': 1.2846620082855225} 11/07/2021 12:29:16 - INFO - __main__ - Step 108364: {'lr': 9.14509101300624e-05, 'samples': 20805888, 'steps': 108363, 'loss/train': 1.1181578636169434} 11/07/2021 12:29:16 - INFO - __main__ - Step 108365: {'lr': 9.144680713951845e-05, 'samples': 20806080, 'steps': 108364, 'loss/train': 1.588564157485962} 11/07/2021 12:29:17 - INFO - __main__ - Step 108366: {'lr': 9.14427042204161e-05, 'samples': 20806272, 'steps': 108365, 'loss/train': 1.1155333518981934} 11/07/2021 12:29:18 - INFO - __main__ - Step 108367: {'lr': 9.143860137275723e-05, 'samples': 20806464, 'steps': 108366, 'loss/train': 1.4449472427368164} 11/07/2021 12:29:18 - INFO - __main__ - Step 108368: {'lr': 9.143449859654366e-05, 'samples': 20806656, 'steps': 108367, 'loss/train': 1.4873734712600708} 11/07/2021 12:29:18 - INFO - __main__ - Step 108369: {'lr': 9.143039589177728e-05, 'samples': 20806848, 'steps': 108368, 'loss/train': 1.280625343322754} 11/07/2021 12:29:19 - INFO - __main__ - Step 108370: {'lr': 9.142629325846e-05, 'samples': 20807040, 'steps': 108369, 'loss/train': 0.5271053314208984} 11/07/2021 12:29:20 - INFO - __main__ - Step 108371: {'lr': 9.142219069659349e-05, 'samples': 20807232, 'steps': 108370, 'loss/train': 1.6530168056488037} 11/07/2021 12:29:20 - INFO - __main__ - Step 108372: {'lr': 9.141808820617972e-05, 'samples': 20807424, 'steps': 108371, 'loss/train': 1.4701478481292725} 11/07/2021 12:29:21 - INFO - __main__ - Step 108373: {'lr': 9.141398578722049e-05, 'samples': 20807616, 'steps': 108372, 'loss/train': 1.7735778093338013} 11/07/2021 12:29:21 - INFO - __main__ - Step 108374: {'lr': 9.140988343971768e-05, 'samples': 20807808, 'steps': 108373, 'loss/train': 1.3145076036453247} 11/07/2021 12:29:21 - INFO - __main__ - Step 108375: {'lr': 9.140578116367312e-05, 'samples': 20808000, 'steps': 108374, 'loss/train': 0.9450241327285767} 11/07/2021 12:29:22 - INFO - __main__ - Step 108376: {'lr': 9.140167895908866e-05, 'samples': 20808192, 'steps': 108375, 'loss/train': 1.4190229177474976} 11/07/2021 12:29:23 - INFO - __main__ - Step 108377: {'lr': 9.139757682596616e-05, 'samples': 20808384, 'steps': 108376, 'loss/train': 1.1661028861999512} 11/07/2021 12:29:23 - INFO - __main__ - Step 108378: {'lr': 9.139347476430748e-05, 'samples': 20808576, 'steps': 108377, 'loss/train': 0.6937206387519836} 11/07/2021 12:29:23 - INFO - __main__ - Step 108379: {'lr': 9.138937277411446e-05, 'samples': 20808768, 'steps': 108378, 'loss/train': 1.4907102584838867} 11/07/2021 12:29:24 - INFO - __main__ - Step 108380: {'lr': 9.138527085538892e-05, 'samples': 20808960, 'steps': 108379, 'loss/train': 1.2109081745147705} 11/07/2021 12:29:24 - INFO - __main__ - Step 108381: {'lr': 9.138116900813274e-05, 'samples': 20809152, 'steps': 108380, 'loss/train': 0.5287148952484131} 11/07/2021 12:29:25 - INFO - __main__ - Step 108382: {'lr': 9.137706723234776e-05, 'samples': 20809344, 'steps': 108381, 'loss/train': 2.0409629344940186} 11/07/2021 12:29:26 - INFO - __main__ - Step 108383: {'lr': 9.137296552803589e-05, 'samples': 20809536, 'steps': 108382, 'loss/train': 0.10278447717428207} 11/07/2021 12:29:26 - INFO - __main__ - Step 108384: {'lr': 9.136886389519885e-05, 'samples': 20809728, 'steps': 108383, 'loss/train': 1.511156439781189} 11/07/2021 12:29:27 - INFO - __main__ - Step 108385: {'lr': 9.136476233383853e-05, 'samples': 20809920, 'steps': 108384, 'loss/train': 1.5282493829727173} 11/07/2021 12:29:27 - INFO - __main__ - Step 108386: {'lr': 9.136066084395683e-05, 'samples': 20810112, 'steps': 108385, 'loss/train': 1.4406393766403198} 11/07/2021 12:29:28 - INFO - __main__ - Step 108387: {'lr': 9.135655942555555e-05, 'samples': 20810304, 'steps': 108386, 'loss/train': 0.6125751733779907} 11/07/2021 12:29:28 - INFO - __main__ - Step 108388: {'lr': 9.135245807863658e-05, 'samples': 20810496, 'steps': 108387, 'loss/train': 1.333690881729126} 11/07/2021 12:29:29 - INFO - __main__ - Step 108389: {'lr': 9.134835680320172e-05, 'samples': 20810688, 'steps': 108388, 'loss/train': 1.3626940250396729} 11/07/2021 12:29:29 - INFO - __main__ - Step 108390: {'lr': 9.134425559925283e-05, 'samples': 20810880, 'steps': 108389, 'loss/train': 1.3988438844680786} 11/07/2021 12:29:29 - INFO - __main__ - Step 108391: {'lr': 9.134015446679178e-05, 'samples': 20811072, 'steps': 108390, 'loss/train': 1.0055797100067139} 11/07/2021 12:29:30 - INFO - __main__ - Step 108392: {'lr': 9.133605340582044e-05, 'samples': 20811264, 'steps': 108391, 'loss/train': 1.3760981559753418} 11/07/2021 12:29:31 - INFO - __main__ - Step 108393: {'lr': 9.13319524163406e-05, 'samples': 20811456, 'steps': 108392, 'loss/train': 1.404174566268921} 11/07/2021 12:29:31 - INFO - __main__ - Step 108394: {'lr': 9.132785149835413e-05, 'samples': 20811648, 'steps': 108393, 'loss/train': 1.0102992057800293} 11/07/2021 12:29:31 - INFO - __main__ - Step 108395: {'lr': 9.132375065186289e-05, 'samples': 20811840, 'steps': 108394, 'loss/train': 1.52694571018219} 11/07/2021 12:29:32 - INFO - __main__ - Step 108396: {'lr': 9.131964987686872e-05, 'samples': 20812032, 'steps': 108395, 'loss/train': 1.3893141746520996} 11/07/2021 12:29:32 - INFO - __main__ - Step 108397: {'lr': 9.131554917337354e-05, 'samples': 20812224, 'steps': 108396, 'loss/train': 0.6831670999526978} 11/07/2021 12:29:33 - INFO - __main__ - Step 108398: {'lr': 9.131144854137904e-05, 'samples': 20812416, 'steps': 108397, 'loss/train': 1.3357853889465332} 11/07/2021 12:29:34 - INFO - __main__ - Step 108399: {'lr': 9.130734798088716e-05, 'samples': 20812608, 'steps': 108398, 'loss/train': 1.1836215257644653} 11/07/2021 12:29:34 - INFO - __main__ - Step 108400: {'lr': 9.130324749189975e-05, 'samples': 20812800, 'steps': 108399, 'loss/train': 1.0880963802337646} 11/07/2021 12:29:34 - INFO - __main__ - Step 108401: {'lr': 9.129914707441864e-05, 'samples': 20812992, 'steps': 108400, 'loss/train': 1.3941961526870728} 11/07/2021 12:29:35 - INFO - __main__ - Step 108402: {'lr': 9.129504672844568e-05, 'samples': 20813184, 'steps': 108401, 'loss/train': 1.3487697839736938} 11/07/2021 12:29:36 - INFO - __main__ - Step 108403: {'lr': 9.129094645398272e-05, 'samples': 20813376, 'steps': 108402, 'loss/train': 1.2502557039260864} 11/07/2021 12:29:36 - INFO - __main__ - Step 108404: {'lr': 9.128684625103162e-05, 'samples': 20813568, 'steps': 108403, 'loss/train': 1.4534025192260742} 11/07/2021 12:29:36 - INFO - __main__ - Step 108405: {'lr': 9.128274611959422e-05, 'samples': 20813760, 'steps': 108404, 'loss/train': 1.8849431276321411} 11/07/2021 12:29:37 - INFO - __main__ - Step 108406: {'lr': 9.127864605967237e-05, 'samples': 20813952, 'steps': 108405, 'loss/train': 0.8015621900558472} 11/07/2021 12:29:37 - INFO - __main__ - Step 108407: {'lr': 9.12745460712679e-05, 'samples': 20814144, 'steps': 108406, 'loss/train': 1.5493980646133423} 11/07/2021 12:29:38 - INFO - __main__ - Step 108408: {'lr': 9.127044615438268e-05, 'samples': 20814336, 'steps': 108407, 'loss/train': 0.6901655793190002} 11/07/2021 12:29:39 - INFO - __main__ - Step 108409: {'lr': 9.126634630901853e-05, 'samples': 20814528, 'steps': 108408, 'loss/train': 1.3400989770889282} 11/07/2021 12:29:39 - INFO - __main__ - Step 108410: {'lr': 9.126224653517743e-05, 'samples': 20814720, 'steps': 108409, 'loss/train': 1.6122784614562988} 11/07/2021 12:29:39 - INFO - __main__ - Step 108411: {'lr': 9.125814683286099e-05, 'samples': 20814912, 'steps': 108410, 'loss/train': 1.8079371452331543} 11/07/2021 12:29:40 - INFO - __main__ - Step 108412: {'lr': 9.12540472020712e-05, 'samples': 20815104, 'steps': 108411, 'loss/train': 1.410862922668457} 11/07/2021 12:29:41 - INFO - __main__ - Step 108413: {'lr': 9.124994764280989e-05, 'samples': 20815296, 'steps': 108412, 'loss/train': 1.2164784669876099} 11/07/2021 12:29:41 - INFO - __main__ - Step 108414: {'lr': 9.124584815507888e-05, 'samples': 20815488, 'steps': 108413, 'loss/train': 1.5544235706329346} 11/07/2021 12:29:41 - INFO - __main__ - Step 108415: {'lr': 9.124174873888008e-05, 'samples': 20815680, 'steps': 108414, 'loss/train': 1.6110172271728516} 11/07/2021 12:29:42 - INFO - __main__ - Step 108416: {'lr': 9.123764939421528e-05, 'samples': 20815872, 'steps': 108415, 'loss/train': 1.1199620962142944} 11/07/2021 12:29:42 - INFO - __main__ - Step 108417: {'lr': 9.123355012108634e-05, 'samples': 20816064, 'steps': 108416, 'loss/train': 1.2006244659423828} 11/07/2021 12:29:43 - INFO - __main__ - Step 108418: {'lr': 9.12294509194951e-05, 'samples': 20816256, 'steps': 108417, 'loss/train': 1.3171305656433105} 11/07/2021 12:29:43 - INFO - __main__ - Step 108419: {'lr': 9.122535178944346e-05, 'samples': 20816448, 'steps': 108418, 'loss/train': 1.4085135459899902} 11/07/2021 12:29:44 - INFO - __main__ - Step 108420: {'lr': 9.122125273093321e-05, 'samples': 20816640, 'steps': 108419, 'loss/train': 1.6152747869491577} 11/07/2021 12:29:44 - INFO - __main__ - Step 108421: {'lr': 9.12171537439662e-05, 'samples': 20816832, 'steps': 108420, 'loss/train': 1.5073963403701782} 11/07/2021 12:29:44 - INFO - __main__ - Step 108422: {'lr': 9.121305482854427e-05, 'samples': 20817024, 'steps': 108421, 'loss/train': 0.9011577367782593} 11/07/2021 12:29:45 - INFO - __main__ - Step 108423: {'lr': 9.120895598466933e-05, 'samples': 20817216, 'steps': 108422, 'loss/train': 2.061945915222168} 11/07/2021 12:29:47 - INFO - __main__ - Step 108424: {'lr': 9.120485721234325e-05, 'samples': 20817408, 'steps': 108423, 'loss/train': 1.6316641569137573} 11/07/2021 12:29:47 - INFO - __main__ - Step 108425: {'lr': 9.120075851156773e-05, 'samples': 20817600, 'steps': 108424, 'loss/train': 1.7026100158691406} 11/07/2021 12:29:48 - INFO - __main__ - Step 108426: {'lr': 9.119665988234472e-05, 'samples': 20817792, 'steps': 108425, 'loss/train': 1.7395981550216675} 11/07/2021 12:29:48 - INFO - __main__ - Step 108427: {'lr': 9.119256132467602e-05, 'samples': 20817984, 'steps': 108426, 'loss/train': 1.7395579814910889} 11/07/2021 12:29:48 - INFO - __main__ - Step 108428: {'lr': 9.118846283856349e-05, 'samples': 20818176, 'steps': 108427, 'loss/train': 1.7436033487319946} 11/07/2021 12:29:49 - INFO - __main__ - Step 108429: {'lr': 9.118436442400898e-05, 'samples': 20818368, 'steps': 108428, 'loss/train': 1.6552473306655884} 11/07/2021 12:29:49 - INFO - __main__ - Step 108430: {'lr': 9.118026608101438e-05, 'samples': 20818560, 'steps': 108429, 'loss/train': 1.371156930923462} 11/07/2021 12:29:50 - INFO - __main__ - Step 108431: {'lr': 9.11761678095815e-05, 'samples': 20818752, 'steps': 108430, 'loss/train': 1.5423991680145264} 11/07/2021 12:29:50 - INFO - __main__ - Step 108432: {'lr': 9.117206960971216e-05, 'samples': 20818944, 'steps': 108431, 'loss/train': 1.1362591981887817} 11/07/2021 12:29:51 - INFO - __main__ - Step 108433: {'lr': 9.116797148140823e-05, 'samples': 20819136, 'steps': 108432, 'loss/train': 0.8809433579444885} 11/07/2021 12:29:51 - INFO - __main__ - Step 108434: {'lr': 9.116387342467161e-05, 'samples': 20819328, 'steps': 108433, 'loss/train': 0.7885024547576904} 11/07/2021 12:29:52 - INFO - __main__ - Step 108435: {'lr': 9.115977543950404e-05, 'samples': 20819520, 'steps': 108434, 'loss/train': 1.0931965112686157} 11/07/2021 12:29:53 - INFO - __main__ - Step 108436: {'lr': 9.115567752590748e-05, 'samples': 20819712, 'steps': 108435, 'loss/train': 1.4529598951339722} 11/07/2021 12:29:53 - INFO - __main__ - Step 108437: {'lr': 9.115157968388376e-05, 'samples': 20819904, 'steps': 108436, 'loss/train': 1.1362709999084473} 11/07/2021 12:29:53 - INFO - __main__ - Step 108438: {'lr': 9.114748191343464e-05, 'samples': 20820096, 'steps': 108437, 'loss/train': 1.3266056776046753} 11/07/2021 12:29:54 - INFO - __main__ - Step 108439: {'lr': 9.114338421456197e-05, 'samples': 20820288, 'steps': 108438, 'loss/train': 0.9184315800666809} 11/07/2021 12:29:54 - INFO - __main__ - Step 108440: {'lr': 9.113928658726767e-05, 'samples': 20820480, 'steps': 108439, 'loss/train': 0.6202395558357239} 11/07/2021 12:29:55 - INFO - __main__ - Step 108441: {'lr': 9.113518903155354e-05, 'samples': 20820672, 'steps': 108440, 'loss/train': 1.1781601905822754} 11/07/2021 12:29:55 - INFO - __main__ - Step 108442: {'lr': 9.113109154742146e-05, 'samples': 20820864, 'steps': 108441, 'loss/train': 1.0990698337554932} 11/07/2021 12:29:56 - INFO - __main__ - Step 108443: {'lr': 9.112699413487324e-05, 'samples': 20821056, 'steps': 108442, 'loss/train': 1.1961710453033447} 11/07/2021 12:29:56 - INFO - __main__ - Step 108444: {'lr': 9.112289679391075e-05, 'samples': 20821248, 'steps': 108443, 'loss/train': 1.4983036518096924} 11/07/2021 12:29:57 - INFO - __main__ - Step 108445: {'lr': 9.111879952453586e-05, 'samples': 20821440, 'steps': 108444, 'loss/train': 1.0862258672714233} 11/07/2021 12:29:57 - INFO - __main__ - Step 108446: {'lr': 9.111470232675034e-05, 'samples': 20821632, 'steps': 108445, 'loss/train': 1.457174301147461} 11/07/2021 12:29:58 - INFO - __main__ - Step 108447: {'lr': 9.11106052005561e-05, 'samples': 20821824, 'steps': 108446, 'loss/train': 1.094888687133789} 11/07/2021 12:29:59 - INFO - __main__ - Step 108448: {'lr': 9.1106508145955e-05, 'samples': 20822016, 'steps': 108447, 'loss/train': 1.184564232826233} 11/07/2021 12:29:59 - INFO - __main__ - Step 108449: {'lr': 9.110241116294882e-05, 'samples': 20822208, 'steps': 108448, 'loss/train': 1.576259732246399} 11/07/2021 12:29:59 - INFO - __main__ - Step 108450: {'lr': 9.109831425153956e-05, 'samples': 20822400, 'steps': 108449, 'loss/train': 1.1714954376220703} 11/07/2021 12:30:00 - INFO - __main__ - Step 108451: {'lr': 9.109421741172883e-05, 'samples': 20822592, 'steps': 108450, 'loss/train': 1.4846144914627075} 11/07/2021 12:30:01 - INFO - __main__ - Step 108452: {'lr': 9.10901206435186e-05, 'samples': 20822784, 'steps': 108451, 'loss/train': 1.6100677251815796} 11/07/2021 12:30:01 - INFO - __main__ - Step 108453: {'lr': 9.108602394691071e-05, 'samples': 20822976, 'steps': 108452, 'loss/train': 1.3232301473617554} 11/07/2021 12:30:01 - INFO - __main__ - Step 108454: {'lr': 9.108192732190701e-05, 'samples': 20823168, 'steps': 108453, 'loss/train': 0.9447699785232544} 11/07/2021 12:30:02 - INFO - __main__ - Step 108455: {'lr': 9.107783076850933e-05, 'samples': 20823360, 'steps': 108454, 'loss/train': 1.2425744533538818} 11/07/2021 12:30:02 - INFO - __main__ - Step 108456: {'lr': 9.107373428671955e-05, 'samples': 20823552, 'steps': 108455, 'loss/train': 0.60201495885849} 11/07/2021 12:30:03 - INFO - __main__ - Step 108457: {'lr': 9.106963787653949e-05, 'samples': 20823744, 'steps': 108456, 'loss/train': 3.6316380500793457} 11/07/2021 12:30:03 - INFO - __main__ - Step 108458: {'lr': 9.106554153797097e-05, 'samples': 20823936, 'steps': 108457, 'loss/train': 1.1945868730545044} 11/07/2021 12:30:04 - INFO - __main__ - Step 108459: {'lr': 9.10614452710159e-05, 'samples': 20824128, 'steps': 108458, 'loss/train': 1.2270675897598267} 11/07/2021 12:30:04 - INFO - __main__ - Step 108460: {'lr': 9.105734907567606e-05, 'samples': 20824320, 'steps': 108459, 'loss/train': 1.368660807609558} 11/07/2021 12:30:05 - INFO - __main__ - Step 108461: {'lr': 9.105325295195335e-05, 'samples': 20824512, 'steps': 108460, 'loss/train': 1.0007708072662354} 11/07/2021 12:30:06 - INFO - __main__ - Step 108462: {'lr': 9.104915689984957e-05, 'samples': 20824704, 'steps': 108461, 'loss/train': 1.5655536651611328} 11/07/2021 12:30:06 - INFO - __main__ - Step 108463: {'lr': 9.104506091936659e-05, 'samples': 20824896, 'steps': 108462, 'loss/train': 1.373734712600708} 11/07/2021 12:30:06 - INFO - __main__ - Step 108464: {'lr': 9.104096501050635e-05, 'samples': 20825088, 'steps': 108463, 'loss/train': 1.6434587240219116} 11/07/2021 12:30:07 - INFO - __main__ - Step 108465: {'lr': 9.103686917327053e-05, 'samples': 20825280, 'steps': 108464, 'loss/train': 1.462850570678711} 11/07/2021 12:30:07 - INFO - __main__ - Step 108466: {'lr': 9.1032773407661e-05, 'samples': 20825472, 'steps': 108465, 'loss/train': 1.402922511100769} 11/07/2021 12:30:08 - INFO - __main__ - Step 108467: {'lr': 9.102867771367967e-05, 'samples': 20825664, 'steps': 108466, 'loss/train': 1.6148717403411865} 11/07/2021 12:30:09 - INFO - __main__ - Step 108468: {'lr': 9.102458209132839e-05, 'samples': 20825856, 'steps': 108467, 'loss/train': 1.0300025939941406} 11/07/2021 12:30:09 - INFO - __main__ - Step 108469: {'lr': 9.102048654060896e-05, 'samples': 20826048, 'steps': 108468, 'loss/train': 1.141360878944397} 11/07/2021 12:30:09 - INFO - __main__ - Step 108470: {'lr': 9.101639106152324e-05, 'samples': 20826240, 'steps': 108469, 'loss/train': 1.435723900794983} 11/07/2021 12:30:10 - INFO - __main__ - Step 108471: {'lr': 9.101229565407307e-05, 'samples': 20826432, 'steps': 108470, 'loss/train': 0.41231876611709595} 11/07/2021 12:30:11 - INFO - __main__ - Step 108472: {'lr': 9.100820031826032e-05, 'samples': 20826624, 'steps': 108471, 'loss/train': 1.5507807731628418} 11/07/2021 12:30:11 - INFO - __main__ - Step 108473: {'lr': 9.100410505408682e-05, 'samples': 20826816, 'steps': 108472, 'loss/train': 1.4729011058807373} 11/07/2021 12:30:11 - INFO - __main__ - Step 108474: {'lr': 9.100000986155443e-05, 'samples': 20827008, 'steps': 108473, 'loss/train': 1.6278151273727417} 11/07/2021 12:30:12 - INFO - __main__ - Step 108475: {'lr': 9.099591474066496e-05, 'samples': 20827200, 'steps': 108474, 'loss/train': 1.4056133031845093} 11/07/2021 12:30:12 - INFO - __main__ - Step 108476: {'lr': 9.099181969142029e-05, 'samples': 20827392, 'steps': 108475, 'loss/train': 1.8852910995483398} 11/07/2021 12:30:12 - INFO - __main__ - Step 108477: {'lr': 9.098772471382233e-05, 'samples': 20827584, 'steps': 108476, 'loss/train': 1.1713811159133911} 11/07/2021 12:30:14 - INFO - __main__ - Step 108478: {'lr': 9.098362980787278e-05, 'samples': 20827776, 'steps': 108477, 'loss/train': 1.0451890230178833} 11/07/2021 12:30:14 - INFO - __main__ - Step 108479: {'lr': 9.097953497357354e-05, 'samples': 20827968, 'steps': 108478, 'loss/train': 1.484763503074646} 11/07/2021 12:30:14 - INFO - __main__ - Step 108480: {'lr': 9.097544021092647e-05, 'samples': 20828160, 'steps': 108479, 'loss/train': 1.2766910791397095} 11/07/2021 12:30:15 - INFO - __main__ - Step 108481: {'lr': 9.097134551993342e-05, 'samples': 20828352, 'steps': 108480, 'loss/train': 1.2549391984939575} 11/07/2021 12:30:15 - INFO - __main__ - Step 108482: {'lr': 9.096725090059621e-05, 'samples': 20828544, 'steps': 108481, 'loss/train': 1.2083231210708618} 11/07/2021 12:30:16 - INFO - __main__ - Step 108483: {'lr': 9.096315635291671e-05, 'samples': 20828736, 'steps': 108482, 'loss/train': 1.294958233833313} 11/07/2021 12:30:16 - INFO - __main__ - Step 108484: {'lr': 9.095906187689676e-05, 'samples': 20828928, 'steps': 108483, 'loss/train': 0.7526826858520508} 11/07/2021 12:30:17 - INFO - __main__ - Step 108485: {'lr': 9.09549674725382e-05, 'samples': 20829120, 'steps': 108484, 'loss/train': 0.07668790221214294} 11/07/2021 12:30:17 - INFO - __main__ - Step 108486: {'lr': 9.095087313984287e-05, 'samples': 20829312, 'steps': 108485, 'loss/train': 1.4110751152038574} 11/07/2021 12:30:17 - INFO - __main__ - Step 108487: {'lr': 9.094677887881264e-05, 'samples': 20829504, 'steps': 108486, 'loss/train': 1.3831911087036133} 11/07/2021 12:30:19 - INFO - __main__ - Step 108488: {'lr': 9.094268468944933e-05, 'samples': 20829696, 'steps': 108487, 'loss/train': 1.6398119926452637} 11/07/2021 12:30:19 - INFO - __main__ - Step 108489: {'lr': 9.093859057175479e-05, 'samples': 20829888, 'steps': 108488, 'loss/train': 1.8949426412582397} 11/07/2021 12:30:19 - INFO - __main__ - Step 108490: {'lr': 9.093449652573086e-05, 'samples': 20830080, 'steps': 108489, 'loss/train': 1.3999663591384888} 11/07/2021 12:30:20 - INFO - __main__ - Step 108491: {'lr': 9.093040255137949e-05, 'samples': 20830272, 'steps': 108490, 'loss/train': 1.2470399141311646} 11/07/2021 12:30:20 - INFO - __main__ - Step 108492: {'lr': 9.092630864870233e-05, 'samples': 20830464, 'steps': 108491, 'loss/train': 0.8958637714385986} 11/07/2021 12:30:21 - INFO - __main__ - Step 108493: {'lr': 9.092221481770133e-05, 'samples': 20830656, 'steps': 108492, 'loss/train': 1.485874891281128} 11/07/2021 12:30:21 - INFO - __main__ - Step 108494: {'lr': 9.091812105837833e-05, 'samples': 20830848, 'steps': 108493, 'loss/train': 1.4909846782684326} 11/07/2021 12:30:22 - INFO - __main__ - Step 108495: {'lr': 9.091402737073514e-05, 'samples': 20831040, 'steps': 108494, 'loss/train': 0.8528505563735962} 11/07/2021 12:30:22 - INFO - __main__ - Step 108496: {'lr': 9.090993375477366e-05, 'samples': 20831232, 'steps': 108495, 'loss/train': 2.142784357070923} 11/07/2021 12:30:22 - INFO - __main__ - Step 108497: {'lr': 9.09058402104957e-05, 'samples': 20831424, 'steps': 108496, 'loss/train': 1.7424252033233643} 11/07/2021 12:30:24 - INFO - __main__ - Step 108498: {'lr': 9.090174673790311e-05, 'samples': 20831616, 'steps': 108497, 'loss/train': 1.2323111295700073} 11/07/2021 12:30:24 - INFO - __main__ - Step 108499: {'lr': 9.089765333699776e-05, 'samples': 20831808, 'steps': 108498, 'loss/train': 0.8273144364356995} 11/07/2021 12:30:24 - INFO - __main__ - Step 108500: {'lr': 9.089356000778145e-05, 'samples': 20832000, 'steps': 108499, 'loss/train': 1.476417064666748} 11/07/2021 12:30:25 - INFO - __main__ - Step 108501: {'lr': 9.088946675025606e-05, 'samples': 20832192, 'steps': 108500, 'loss/train': 1.599969744682312} 11/07/2021 12:30:25 - INFO - __main__ - Step 108502: {'lr': 9.088537356442342e-05, 'samples': 20832384, 'steps': 108501, 'loss/train': 1.6538479328155518} 11/07/2021 12:30:25 - INFO - __main__ - Step 108503: {'lr': 9.088128045028535e-05, 'samples': 20832576, 'steps': 108502, 'loss/train': 1.2732282876968384} 11/07/2021 12:30:26 - INFO - __main__ - Step 108504: {'lr': 9.087718740784385e-05, 'samples': 20832768, 'steps': 108503, 'loss/train': 1.0700361728668213} 11/07/2021 12:30:27 - INFO - __main__ - Step 108505: {'lr': 9.087309443710051e-05, 'samples': 20832960, 'steps': 108504, 'loss/train': 1.6029165983200073} 11/07/2021 12:30:27 - INFO - __main__ - Step 108506: {'lr': 9.08690015380573e-05, 'samples': 20833152, 'steps': 108505, 'loss/train': 1.4051315784454346} 11/07/2021 12:30:27 - INFO - __main__ - Step 108507: {'lr': 9.086490871071609e-05, 'samples': 20833344, 'steps': 108506, 'loss/train': 1.4060612916946411} 11/07/2021 12:30:28 - INFO - __main__ - Step 108508: {'lr': 9.086081595507867e-05, 'samples': 20833536, 'steps': 108507, 'loss/train': 1.230907917022705} 11/07/2021 12:30:29 - INFO - __main__ - Step 108509: {'lr': 9.085672327114691e-05, 'samples': 20833728, 'steps': 108508, 'loss/train': 1.08930504322052} 11/07/2021 12:30:29 - INFO - __main__ - Step 108510: {'lr': 9.085263065892268e-05, 'samples': 20833920, 'steps': 108509, 'loss/train': 1.8134279251098633} 11/07/2021 12:30:29 - INFO - __main__ - Step 108511: {'lr': 9.084853811840779e-05, 'samples': 20834112, 'steps': 108510, 'loss/train': 1.1821459531784058} 11/07/2021 12:30:30 - INFO - __main__ - Step 108512: {'lr': 9.084444564960409e-05, 'samples': 20834304, 'steps': 108511, 'loss/train': 1.185217261314392} 11/07/2021 12:30:30 - INFO - __main__ - Step 108513: {'lr': 9.084035325251341e-05, 'samples': 20834496, 'steps': 108512, 'loss/train': 1.2795937061309814} 11/07/2021 12:30:31 - INFO - __main__ - Step 108514: {'lr': 9.083626092713765e-05, 'samples': 20834688, 'steps': 108513, 'loss/train': 1.6709253787994385} 11/07/2021 12:30:32 - INFO - __main__ - Step 108515: {'lr': 9.083216867347857e-05, 'samples': 20834880, 'steps': 108514, 'loss/train': 1.4068623781204224} 11/07/2021 12:30:32 - INFO - __main__ - Step 108516: {'lr': 9.08280764915381e-05, 'samples': 20835072, 'steps': 108515, 'loss/train': 1.135498285293579} 11/07/2021 12:30:32 - INFO - __main__ - Step 108517: {'lr': 9.0823984381318e-05, 'samples': 20835264, 'steps': 108516, 'loss/train': 0.6800389289855957} 11/07/2021 12:30:33 - INFO - __main__ - Step 108518: {'lr': 9.081989234282025e-05, 'samples': 20835456, 'steps': 108517, 'loss/train': 1.8415236473083496} 11/07/2021 12:30:34 - INFO - __main__ - Step 108519: {'lr': 9.081580037604656e-05, 'samples': 20835648, 'steps': 108518, 'loss/train': 0.7956506609916687} 11/07/2021 12:30:34 - INFO - __main__ - Step 108520: {'lr': 9.081170848099876e-05, 'samples': 20835840, 'steps': 108519, 'loss/train': 1.6329959630966187} 11/07/2021 12:30:34 - INFO - __main__ - Step 108521: {'lr': 9.080761665767878e-05, 'samples': 20836032, 'steps': 108520, 'loss/train': 1.4056396484375} 11/07/2021 12:30:35 - INFO - __main__ - Step 108522: {'lr': 9.080352490608842e-05, 'samples': 20836224, 'steps': 108521, 'loss/train': 1.6917119026184082} 11/07/2021 12:30:35 - INFO - __main__ - Step 108523: {'lr': 9.079943322622954e-05, 'samples': 20836416, 'steps': 108522, 'loss/train': 1.3858532905578613} 11/07/2021 12:30:36 - INFO - __main__ - Step 108524: {'lr': 9.079534161810396e-05, 'samples': 20836608, 'steps': 108523, 'loss/train': 1.3557343482971191} 11/07/2021 12:30:37 - INFO - __main__ - Step 108525: {'lr': 9.079125008171358e-05, 'samples': 20836800, 'steps': 108524, 'loss/train': 1.7004305124282837} 11/07/2021 12:30:37 - INFO - __main__ - Step 108526: {'lr': 9.078715861706016e-05, 'samples': 20836992, 'steps': 108525, 'loss/train': 2.5539278984069824} 11/07/2021 12:30:37 - INFO - __main__ - Step 108527: {'lr': 9.078306722414562e-05, 'samples': 20837184, 'steps': 108526, 'loss/train': 2.020127296447754} 11/07/2021 12:30:38 - INFO - __main__ - Step 108528: {'lr': 9.077897590297177e-05, 'samples': 20837376, 'steps': 108527, 'loss/train': 1.5275720357894897} 11/07/2021 12:30:39 - INFO - __main__ - Step 108529: {'lr': 9.077488465354044e-05, 'samples': 20837568, 'steps': 108528, 'loss/train': 1.5315030813217163} 11/07/2021 12:30:39 - INFO - __main__ - Step 108530: {'lr': 9.077079347585352e-05, 'samples': 20837760, 'steps': 108529, 'loss/train': 0.9443750977516174} 11/07/2021 12:30:39 - INFO - __main__ - Step 108531: {'lr': 9.076670236991289e-05, 'samples': 20837952, 'steps': 108530, 'loss/train': 1.5171027183532715} 11/07/2021 12:30:40 - INFO - __main__ - Step 108532: {'lr': 9.076261133572026e-05, 'samples': 20838144, 'steps': 108531, 'loss/train': 1.2690203189849854} 11/07/2021 12:30:40 - INFO - __main__ - Step 108533: {'lr': 9.075852037327751e-05, 'samples': 20838336, 'steps': 108532, 'loss/train': 1.299867033958435} 11/07/2021 12:30:41 - INFO - __main__ - Step 108534: {'lr': 9.075442948258653e-05, 'samples': 20838528, 'steps': 108533, 'loss/train': 1.4003134965896606} 11/07/2021 12:30:41 - INFO - __main__ - Step 108535: {'lr': 9.075033866364912e-05, 'samples': 20838720, 'steps': 108534, 'loss/train': 1.2776999473571777} 11/07/2021 12:30:42 - INFO - __main__ - Step 108536: {'lr': 9.074624791646718e-05, 'samples': 20838912, 'steps': 108535, 'loss/train': 1.3417744636535645} 11/07/2021 12:30:42 - INFO - __main__ - Step 108537: {'lr': 9.074215724104254e-05, 'samples': 20839104, 'steps': 108536, 'loss/train': 1.4402861595153809} 11/07/2021 12:30:42 - INFO - __main__ - Step 108538: {'lr': 9.0738066637377e-05, 'samples': 20839296, 'steps': 108537, 'loss/train': 1.1722807884216309} 11/07/2021 12:30:43 - INFO - __main__ - Step 108539: {'lr': 9.073397610547244e-05, 'samples': 20839488, 'steps': 108538, 'loss/train': 1.9188899993896484} 11/07/2021 12:30:44 - INFO - __main__ - Step 108540: {'lr': 9.072988564533066e-05, 'samples': 20839680, 'steps': 108539, 'loss/train': 1.0996464490890503} 11/07/2021 12:30:44 - INFO - __main__ - Step 108541: {'lr': 9.072579525695357e-05, 'samples': 20839872, 'steps': 108540, 'loss/train': 1.2439165115356445} 11/07/2021 12:30:44 - INFO - __main__ - Step 108542: {'lr': 9.072170494034296e-05, 'samples': 20840064, 'steps': 108541, 'loss/train': 1.4768271446228027} 11/07/2021 12:30:45 - INFO - __main__ - Step 108543: {'lr': 9.071761469550071e-05, 'samples': 20840256, 'steps': 108542, 'loss/train': 1.1761400699615479} 11/07/2021 12:30:45 - INFO - __main__ - Step 108544: {'lr': 9.071352452242865e-05, 'samples': 20840448, 'steps': 108543, 'loss/train': 0.9603530168533325} 11/07/2021 12:30:46 - INFO - __main__ - Step 108545: {'lr': 9.070943442112867e-05, 'samples': 20840640, 'steps': 108544, 'loss/train': 1.293430209159851} 11/07/2021 12:30:47 - INFO - __main__ - Step 108546: {'lr': 9.07053443916025e-05, 'samples': 20840832, 'steps': 108545, 'loss/train': 1.6520060300827026} 11/07/2021 12:30:47 - INFO - __main__ - Step 108547: {'lr': 9.070125443385205e-05, 'samples': 20841024, 'steps': 108546, 'loss/train': 1.400207281112671} 11/07/2021 12:30:47 - INFO - __main__ - Step 108548: {'lr': 9.069716454787914e-05, 'samples': 20841216, 'steps': 108547, 'loss/train': 1.1454346179962158} 11/07/2021 12:30:48 - INFO - __main__ - Step 108549: {'lr': 9.069307473368562e-05, 'samples': 20841408, 'steps': 108548, 'loss/train': 1.4884828329086304} 11/07/2021 12:30:49 - INFO - __main__ - Step 108550: {'lr': 9.068898499127337e-05, 'samples': 20841600, 'steps': 108549, 'loss/train': 1.551822304725647} 11/07/2021 12:30:49 - INFO - __main__ - Step 108551: {'lr': 9.068489532064419e-05, 'samples': 20841792, 'steps': 108550, 'loss/train': 1.4580305814743042} 11/07/2021 12:30:50 - INFO - __main__ - Step 108552: {'lr': 9.068080572179995e-05, 'samples': 20841984, 'steps': 108551, 'loss/train': 1.2470487356185913} 11/07/2021 12:30:50 - INFO - __main__ - Step 108553: {'lr': 9.067671619474247e-05, 'samples': 20842176, 'steps': 108552, 'loss/train': 0.564156174659729} 11/07/2021 12:30:50 - INFO - __main__ - Step 108554: {'lr': 9.067262673947361e-05, 'samples': 20842368, 'steps': 108553, 'loss/train': 1.305868148803711} 11/07/2021 12:30:51 - INFO - __main__ - Step 108555: {'lr': 9.06685373559952e-05, 'samples': 20842560, 'steps': 108554, 'loss/train': 1.389886498451233} 11/07/2021 12:30:52 - INFO - __main__ - Step 108556: {'lr': 9.066444804430917e-05, 'samples': 20842752, 'steps': 108555, 'loss/train': 0.38732895255088806} 11/07/2021 12:30:52 - INFO - __main__ - Step 108557: {'lr': 9.06603588044172e-05, 'samples': 20842944, 'steps': 108556, 'loss/train': 1.0991681814193726} 11/07/2021 12:30:52 - INFO - __main__ - Step 108558: {'lr': 9.06562696363212e-05, 'samples': 20843136, 'steps': 108557, 'loss/train': 1.213672161102295} 11/07/2021 12:30:53 - INFO - __main__ - Step 108559: {'lr': 9.065218054002306e-05, 'samples': 20843328, 'steps': 108558, 'loss/train': 1.29802668094635} 11/07/2021 12:30:54 - INFO - __main__ - Step 108560: {'lr': 9.064809151552456e-05, 'samples': 20843520, 'steps': 108559, 'loss/train': 1.5531446933746338} 11/07/2021 12:30:55 - INFO - __main__ - Step 108561: {'lr': 9.064400256282756e-05, 'samples': 20843712, 'steps': 108560, 'loss/train': 1.263887882232666} 11/07/2021 12:30:55 - INFO - __main__ - Step 108562: {'lr': 9.063991368193394e-05, 'samples': 20843904, 'steps': 108561, 'loss/train': 1.6762139797210693} 11/07/2021 12:30:55 - INFO - __main__ - Step 108563: {'lr': 9.063582487284553e-05, 'samples': 20844096, 'steps': 108562, 'loss/train': 1.7404037714004517} 11/07/2021 12:30:56 - INFO - __main__ - Step 108564: {'lr': 9.06317361355641e-05, 'samples': 20844288, 'steps': 108563, 'loss/train': 1.240892767906189} 11/07/2021 12:30:56 - INFO - __main__ - Step 108565: {'lr': 9.062764747009162e-05, 'samples': 20844480, 'steps': 108564, 'loss/train': 1.3843432664871216} 11/07/2021 12:30:57 - INFO - __main__ - Step 108566: {'lr': 9.062355887642981e-05, 'samples': 20844672, 'steps': 108565, 'loss/train': 1.2268239259719849} 11/07/2021 12:30:57 - INFO - __main__ - Step 108567: {'lr': 9.061947035458068e-05, 'samples': 20844864, 'steps': 108566, 'loss/train': 1.7981739044189453} 11/07/2021 12:30:58 - INFO - __main__ - Step 108568: {'lr': 9.061538190454586e-05, 'samples': 20845056, 'steps': 108567, 'loss/train': 1.513143539428711} 11/07/2021 12:30:58 - INFO - __main__ - Step 108569: {'lr': 9.06112935263273e-05, 'samples': 20845248, 'steps': 108568, 'loss/train': 1.3547836542129517} 11/07/2021 12:30:58 - INFO - __main__ - Step 108570: {'lr': 9.060720521992682e-05, 'samples': 20845440, 'steps': 108569, 'loss/train': 1.5327178239822388} 11/07/2021 12:31:01 - INFO - __main__ - Step 108571: {'lr': 9.060311698534627e-05, 'samples': 20845632, 'steps': 108570, 'loss/train': 1.4580130577087402} 11/07/2021 12:31:01 - INFO - __main__ - Step 108572: {'lr': 9.05990288225875e-05, 'samples': 20845824, 'steps': 108571, 'loss/train': 1.1052953004837036} 11/07/2021 12:31:01 - INFO - __main__ - Step 108573: {'lr': 9.059494073165236e-05, 'samples': 20846016, 'steps': 108572, 'loss/train': 1.3078556060791016} 11/07/2021 12:31:02 - INFO - __main__ - Step 108574: {'lr': 9.059085271254266e-05, 'samples': 20846208, 'steps': 108573, 'loss/train': 1.785513162612915} 11/07/2021 12:31:02 - INFO - __main__ - Step 108575: {'lr': 9.058676476526029e-05, 'samples': 20846400, 'steps': 108574, 'loss/train': 1.7650436162948608} 11/07/2021 12:31:02 - INFO - __main__ - Step 108576: {'lr': 9.058267688980703e-05, 'samples': 20846592, 'steps': 108575, 'loss/train': 1.755894660949707} 11/07/2021 12:31:03 - INFO - __main__ - Step 108577: {'lr': 9.057858908618477e-05, 'samples': 20846784, 'steps': 108576, 'loss/train': 1.7661875486373901} 11/07/2021 12:31:03 - INFO - __main__ - Step 108578: {'lr': 9.057450135439544e-05, 'samples': 20846976, 'steps': 108577, 'loss/train': 1.8874479532241821} 11/07/2021 12:31:04 - INFO - __main__ - Step 108579: {'lr': 9.057041369444066e-05, 'samples': 20847168, 'steps': 108578, 'loss/train': 1.6809687614440918} 11/07/2021 12:31:05 - INFO - __main__ - Step 108580: {'lr': 9.056632610632243e-05, 'samples': 20847360, 'steps': 108579, 'loss/train': 1.464518427848816} 11/07/2021 12:31:05 - INFO - __main__ - Step 108581: {'lr': 9.056223859004253e-05, 'samples': 20847552, 'steps': 108580, 'loss/train': 1.0404167175292969} 11/07/2021 12:31:05 - INFO - __main__ - Step 108582: {'lr': 9.055815114560281e-05, 'samples': 20847744, 'steps': 108581, 'loss/train': 1.524406909942627} 11/07/2021 12:31:06 - INFO - __main__ - Step 108583: {'lr': 9.055406377300515e-05, 'samples': 20847936, 'steps': 108582, 'loss/train': 1.2610586881637573} 11/07/2021 12:31:07 - INFO - __main__ - Step 108584: {'lr': 9.054997647225138e-05, 'samples': 20848128, 'steps': 108583, 'loss/train': 1.0086967945098877} 11/07/2021 12:31:07 - INFO - __main__ - Step 108585: {'lr': 9.054588924334332e-05, 'samples': 20848320, 'steps': 108584, 'loss/train': 1.3168208599090576} 11/07/2021 12:31:07 - INFO - __main__ - Step 108586: {'lr': 9.05418020862828e-05, 'samples': 20848512, 'steps': 108585, 'loss/train': 0.8266898989677429} 11/07/2021 12:31:08 - INFO - __main__ - Step 108587: {'lr': 9.053771500107169e-05, 'samples': 20848704, 'steps': 108586, 'loss/train': 0.9786770343780518} 11/07/2021 12:31:08 - INFO - __main__ - Step 108588: {'lr': 9.053362798771184e-05, 'samples': 20848896, 'steps': 108587, 'loss/train': 1.3559666872024536} 11/07/2021 12:31:09 - INFO - __main__ - Step 108589: {'lr': 9.052954104620517e-05, 'samples': 20849088, 'steps': 108588, 'loss/train': 1.6188124418258667} 11/07/2021 12:31:09 - INFO - __main__ - Step 108590: {'lr': 9.052545417655333e-05, 'samples': 20849280, 'steps': 108589, 'loss/train': 1.6182780265808105} 11/07/2021 12:31:10 - INFO - __main__ - Step 108591: {'lr': 9.052136737875824e-05, 'samples': 20849472, 'steps': 108590, 'loss/train': 1.4034143686294556} 11/07/2021 12:31:10 - INFO - __main__ - Step 108592: {'lr': 9.05172806528218e-05, 'samples': 20849664, 'steps': 108591, 'loss/train': 1.1655490398406982} 11/07/2021 12:31:11 - INFO - __main__ - Step 108593: {'lr': 9.051319399874577e-05, 'samples': 20849856, 'steps': 108592, 'loss/train': 1.4858406782150269} 11/07/2021 12:31:12 - INFO - __main__ - Step 108594: {'lr': 9.050910741653206e-05, 'samples': 20850048, 'steps': 108593, 'loss/train': 1.4829339981079102} 11/07/2021 12:31:12 - INFO - __main__ - Step 108595: {'lr': 9.050502090618248e-05, 'samples': 20850240, 'steps': 108594, 'loss/train': 1.348148226737976} 11/07/2021 12:31:12 - INFO - __main__ - Step 108596: {'lr': 9.050093446769889e-05, 'samples': 20850432, 'steps': 108595, 'loss/train': 1.3573572635650635} 11/07/2021 12:31:13 - INFO - __main__ - Step 108597: {'lr': 9.04968481010831e-05, 'samples': 20850624, 'steps': 108596, 'loss/train': 1.7028462886810303} 11/07/2021 12:31:13 - INFO - __main__ - Step 108598: {'lr': 9.049276180633698e-05, 'samples': 20850816, 'steps': 108597, 'loss/train': 0.8140706419944763} 11/07/2021 12:31:14 - INFO - __main__ - Step 108599: {'lr': 9.048867558346236e-05, 'samples': 20851008, 'steps': 108598, 'loss/train': 1.2304824590682983} 11/07/2021 12:31:14 - INFO - __main__ - Step 108600: {'lr': 9.048458943246116e-05, 'samples': 20851200, 'steps': 108599, 'loss/train': 1.5007249116897583} 11/07/2021 12:31:15 - INFO - __main__ - Step 108601: {'lr': 9.048050335333505e-05, 'samples': 20851392, 'steps': 108600, 'loss/train': 1.8924510478973389} 11/07/2021 12:31:15 - INFO - __main__ - Step 108602: {'lr': 9.047641734608597e-05, 'samples': 20851584, 'steps': 108601, 'loss/train': 0.9339269995689392} 11/07/2021 12:31:15 - INFO - __main__ - Step 108603: {'lr': 9.047233141071576e-05, 'samples': 20851776, 'steps': 108602, 'loss/train': 1.7685647010803223} 11/07/2021 12:31:16 - INFO - __main__ - Step 108604: {'lr': 9.046824554722624e-05, 'samples': 20851968, 'steps': 108603, 'loss/train': 1.7785810232162476} 11/07/2021 12:31:17 - INFO - __main__ - Step 108605: {'lr': 9.046415975561928e-05, 'samples': 20852160, 'steps': 108604, 'loss/train': 1.194557547569275} 11/07/2021 12:31:17 - INFO - __main__ - Step 108606: {'lr': 9.04600740358967e-05, 'samples': 20852352, 'steps': 108605, 'loss/train': 1.516126275062561} 11/07/2021 12:31:18 - INFO - __main__ - Step 108607: {'lr': 9.045598838806038e-05, 'samples': 20852544, 'steps': 108606, 'loss/train': 1.173627495765686} 11/07/2021 12:31:18 - INFO - __main__ - Step 108608: {'lr': 9.04519028121121e-05, 'samples': 20852736, 'steps': 108607, 'loss/train': 1.4749773740768433} 11/07/2021 12:31:18 - INFO - __main__ - Step 108609: {'lr': 9.044781730805374e-05, 'samples': 20852928, 'steps': 108608, 'loss/train': 1.338724970817566} 11/07/2021 12:31:19 - INFO - __main__ - Step 108610: {'lr': 9.044373187588711e-05, 'samples': 20853120, 'steps': 108609, 'loss/train': 1.0385074615478516} 11/07/2021 12:31:20 - INFO - __main__ - Step 108611: {'lr': 9.04396465156141e-05, 'samples': 20853312, 'steps': 108610, 'loss/train': 0.7479522824287415} 11/07/2021 12:31:20 - INFO - __main__ - Step 108612: {'lr': 9.043556122723658e-05, 'samples': 20853504, 'steps': 108611, 'loss/train': 1.0138912200927734} 11/07/2021 12:31:20 - INFO - __main__ - Step 108613: {'lr': 9.04314760107563e-05, 'samples': 20853696, 'steps': 108612, 'loss/train': 0.34371325373649597} 11/07/2021 12:31:21 - INFO - __main__ - Step 108614: {'lr': 9.042739086617507e-05, 'samples': 20853888, 'steps': 108613, 'loss/train': 1.2398467063903809} 11/07/2021 12:31:22 - INFO - __main__ - Step 108615: {'lr': 9.042330579349484e-05, 'samples': 20854080, 'steps': 108614, 'loss/train': 1.6546324491500854} 11/07/2021 12:31:22 - INFO - __main__ - Step 108616: {'lr': 9.041922079271739e-05, 'samples': 20854272, 'steps': 108615, 'loss/train': 1.2347108125686646} 11/07/2021 12:31:23 - INFO - __main__ - Step 108617: {'lr': 9.041513586384458e-05, 'samples': 20854464, 'steps': 108616, 'loss/train': 0.4739481508731842} 11/07/2021 12:31:23 - INFO - __main__ - Step 108618: {'lr': 9.041105100687824e-05, 'samples': 20854656, 'steps': 108617, 'loss/train': 1.5542114973068237} 11/07/2021 12:31:23 - INFO - __main__ - Step 108619: {'lr': 9.040696622182023e-05, 'samples': 20854848, 'steps': 108618, 'loss/train': 1.3450920581817627} 11/07/2021 12:31:24 - INFO - __main__ - Step 108620: {'lr': 9.040288150867238e-05, 'samples': 20855040, 'steps': 108619, 'loss/train': 1.2923359870910645} 11/07/2021 12:31:25 - INFO - __main__ - Step 108621: {'lr': 9.039879686743652e-05, 'samples': 20855232, 'steps': 108620, 'loss/train': 1.2659757137298584} 11/07/2021 12:31:25 - INFO - __main__ - Step 108622: {'lr': 9.03947122981145e-05, 'samples': 20855424, 'steps': 108621, 'loss/train': 1.3408869504928589} 11/07/2021 12:31:25 - INFO - __main__ - Step 108623: {'lr': 9.039062780070817e-05, 'samples': 20855616, 'steps': 108622, 'loss/train': 0.9770870804786682} 11/07/2021 12:31:26 - INFO - __main__ - Step 108624: {'lr': 9.038654337521934e-05, 'samples': 20855808, 'steps': 108623, 'loss/train': 1.4477357864379883} 11/07/2021 12:31:26 - INFO - __main__ - Step 108625: {'lr': 9.038245902164996e-05, 'samples': 20856000, 'steps': 108624, 'loss/train': 1.3831937313079834} 11/07/2021 12:31:27 - INFO - __main__ - Step 108626: {'lr': 9.03783747400017e-05, 'samples': 20856192, 'steps': 108625, 'loss/train': 0.7624614238739014} 11/07/2021 12:31:27 - INFO - __main__ - Step 108627: {'lr': 9.037429053027648e-05, 'samples': 20856384, 'steps': 108626, 'loss/train': 1.1508337259292603} 11/07/2021 12:31:28 - INFO - __main__ - Step 108628: {'lr': 9.037020639247614e-05, 'samples': 20856576, 'steps': 108627, 'loss/train': 1.095122218132019} 11/07/2021 12:31:28 - INFO - __main__ - Step 108629: {'lr': 9.036612232660255e-05, 'samples': 20856768, 'steps': 108628, 'loss/train': 1.0088934898376465} 11/07/2021 12:31:28 - INFO - __main__ - Step 108630: {'lr': 9.03620383326575e-05, 'samples': 20856960, 'steps': 108629, 'loss/train': 1.389054536819458} 11/07/2021 12:31:30 - INFO - __main__ - Step 108631: {'lr': 9.035795441064285e-05, 'samples': 20857152, 'steps': 108630, 'loss/train': 1.224412202835083} 11/07/2021 12:31:30 - INFO - __main__ - Step 108632: {'lr': 9.035387056056044e-05, 'samples': 20857344, 'steps': 108631, 'loss/train': 1.0502846240997314} 11/07/2021 12:31:30 - INFO - __main__ - Step 108633: {'lr': 9.034978678241213e-05, 'samples': 20857536, 'steps': 108632, 'loss/train': 1.5873576402664185} 11/07/2021 12:31:31 - INFO - __main__ - Step 108634: {'lr': 9.034570307619972e-05, 'samples': 20857728, 'steps': 108633, 'loss/train': 1.408555507659912} 11/07/2021 12:31:31 - INFO - __main__ - Step 108635: {'lr': 9.034161944192506e-05, 'samples': 20857920, 'steps': 108634, 'loss/train': 1.3338189125061035} 11/07/2021 12:31:32 - INFO - __main__ - Step 108636: {'lr': 9.033753587959004e-05, 'samples': 20858112, 'steps': 108635, 'loss/train': 1.1799015998840332} 11/07/2021 12:31:33 - INFO - __main__ - Step 108637: {'lr': 9.033345238919643e-05, 'samples': 20858304, 'steps': 108636, 'loss/train': 0.5409135818481445} 11/07/2021 12:31:33 - INFO - __main__ - Step 108638: {'lr': 9.032936897074615e-05, 'samples': 20858496, 'steps': 108637, 'loss/train': 0.36155807971954346} 11/07/2021 12:31:34 - INFO - __main__ - Step 108639: {'lr': 9.032528562424103e-05, 'samples': 20858688, 'steps': 108638, 'loss/train': 1.4797818660736084} 11/07/2021 12:31:34 - INFO - __main__ - Step 108640: {'lr': 9.03212023496828e-05, 'samples': 20858880, 'steps': 108639, 'loss/train': 1.1721444129943848} 11/07/2021 12:31:35 - INFO - __main__ - Step 108641: {'lr': 9.031711914707339e-05, 'samples': 20859072, 'steps': 108640, 'loss/train': 1.0416165590286255} 11/07/2021 12:31:35 - INFO - __main__ - Step 108642: {'lr': 9.031303601641461e-05, 'samples': 20859264, 'steps': 108641, 'loss/train': 1.3671051263809204} 11/07/2021 12:31:36 - INFO - __main__ - Step 108643: {'lr': 9.03089529577083e-05, 'samples': 20859456, 'steps': 108642, 'loss/train': 1.0515458583831787} 11/07/2021 12:31:36 - INFO - __main__ - Step 108644: {'lr': 9.030486997095633e-05, 'samples': 20859648, 'steps': 108643, 'loss/train': 1.522704839706421} 11/07/2021 12:31:36 - INFO - __main__ - Step 108645: {'lr': 9.030078705616049e-05, 'samples': 20859840, 'steps': 108644, 'loss/train': 0.6943202018737793} 11/07/2021 12:31:37 - INFO - __main__ - Step 108646: {'lr': 9.029670421332267e-05, 'samples': 20860032, 'steps': 108645, 'loss/train': 1.477813959121704} 11/07/2021 12:31:38 - INFO - __main__ - Step 108647: {'lr': 9.029262144244471e-05, 'samples': 20860224, 'steps': 108646, 'loss/train': 1.1460131406784058} 11/07/2021 12:31:38 - INFO - __main__ - Step 108648: {'lr': 9.028853874352841e-05, 'samples': 20860416, 'steps': 108647, 'loss/train': 1.3642067909240723} 11/07/2021 12:31:38 - INFO - __main__ - Step 108649: {'lr': 9.028445611657563e-05, 'samples': 20860608, 'steps': 108648, 'loss/train': 1.564026117324829} 11/07/2021 12:31:39 - INFO - __main__ - Step 108650: {'lr': 9.028037356158822e-05, 'samples': 20860800, 'steps': 108649, 'loss/train': 1.6379125118255615} 11/07/2021 12:31:40 - INFO - __main__ - Step 108651: {'lr': 9.0276291078568e-05, 'samples': 20860992, 'steps': 108650, 'loss/train': 1.4201008081436157} 11/07/2021 12:31:40 - INFO - __main__ - Step 108652: {'lr': 9.027220866751689e-05, 'samples': 20861184, 'steps': 108651, 'loss/train': 1.2100850343704224} 11/07/2021 12:31:40 - INFO - __main__ - Step 108653: {'lr': 9.026812632843661e-05, 'samples': 20861376, 'steps': 108652, 'loss/train': 1.0999430418014526} 11/07/2021 12:31:41 - INFO - __main__ - Step 108654: {'lr': 9.026404406132901e-05, 'samples': 20861568, 'steps': 108653, 'loss/train': 1.4126389026641846} 11/07/2021 12:31:41 - INFO - __main__ - Step 108655: {'lr': 9.0259961866196e-05, 'samples': 20861760, 'steps': 108654, 'loss/train': 1.4220423698425293} 11/07/2021 12:31:42 - INFO - __main__ - Step 108656: {'lr': 9.025587974303937e-05, 'samples': 20861952, 'steps': 108655, 'loss/train': 1.506300449371338} 11/07/2021 12:31:43 - INFO - __main__ - Step 108657: {'lr': 9.025179769186098e-05, 'samples': 20862144, 'steps': 108656, 'loss/train': 1.3159770965576172} 11/07/2021 12:31:43 - INFO - __main__ - Step 108658: {'lr': 9.024771571266266e-05, 'samples': 20862336, 'steps': 108657, 'loss/train': 1.1041697263717651} 11/07/2021 12:31:43 - INFO - __main__ - Step 108659: {'lr': 9.024363380544626e-05, 'samples': 20862528, 'steps': 108658, 'loss/train': 1.6202201843261719} 11/07/2021 12:31:44 - INFO - __main__ - Step 108660: {'lr': 9.023955197021361e-05, 'samples': 20862720, 'steps': 108659, 'loss/train': 1.1387547254562378} 11/07/2021 12:31:44 - INFO - __main__ - Step 108661: {'lr': 9.023547020696654e-05, 'samples': 20862912, 'steps': 108660, 'loss/train': 0.9852837324142456} 11/07/2021 12:31:45 - INFO - __main__ - Step 108662: {'lr': 9.023138851570692e-05, 'samples': 20863104, 'steps': 108661, 'loss/train': 1.193202018737793} 11/07/2021 12:31:46 - INFO - __main__ - Step 108663: {'lr': 9.022730689643654e-05, 'samples': 20863296, 'steps': 108662, 'loss/train': 1.34829580783844} 11/07/2021 12:31:46 - INFO - __main__ - Step 108664: {'lr': 9.022322534915731e-05, 'samples': 20863488, 'steps': 108663, 'loss/train': 1.471274971961975} 11/07/2021 12:31:46 - INFO - __main__ - Step 108665: {'lr': 9.021914387387101e-05, 'samples': 20863680, 'steps': 108664, 'loss/train': 1.1453601121902466} 11/07/2021 12:31:47 - INFO - __main__ - Step 108666: {'lr': 9.021506247057959e-05, 'samples': 20863872, 'steps': 108665, 'loss/train': 0.3422747850418091} 11/07/2021 12:31:48 - INFO - __main__ - Step 108667: {'lr': 9.021098113928472e-05, 'samples': 20864064, 'steps': 108666, 'loss/train': 1.2801514863967896} 11/07/2021 12:31:48 - INFO - __main__ - Step 108668: {'lr': 9.020689987998828e-05, 'samples': 20864256, 'steps': 108667, 'loss/train': 1.3941118717193604} 11/07/2021 12:31:48 - INFO - __main__ - Step 108669: {'lr': 9.020281869269217e-05, 'samples': 20864448, 'steps': 108668, 'loss/train': 2.1980795860290527} 11/07/2021 12:31:49 - INFO - __main__ - Step 108670: {'lr': 9.019873757739821e-05, 'samples': 20864640, 'steps': 108669, 'loss/train': 1.584359049797058} 11/07/2021 12:31:49 - INFO - __main__ - Step 108671: {'lr': 9.019465653410824e-05, 'samples': 20864832, 'steps': 108670, 'loss/train': 0.7065557837486267} 11/07/2021 12:31:50 - INFO - __main__ - Step 108672: {'lr': 9.019057556282406e-05, 'samples': 20865024, 'steps': 108671, 'loss/train': 0.9755332469940186} 11/07/2021 12:31:51 - INFO - __main__ - Step 108673: {'lr': 9.018649466354758e-05, 'samples': 20865216, 'steps': 108672, 'loss/train': 1.269700527191162} 11/07/2021 12:31:51 - INFO - __main__ - Step 108674: {'lr': 9.018241383628056e-05, 'samples': 20865408, 'steps': 108673, 'loss/train': 1.4095174074172974} 11/07/2021 12:31:51 - INFO - __main__ - Step 108675: {'lr': 9.017833308102491e-05, 'samples': 20865600, 'steps': 108674, 'loss/train': 0.7926313877105713} 11/07/2021 12:31:52 - INFO - __main__ - Step 108676: {'lr': 9.017425239778242e-05, 'samples': 20865792, 'steps': 108675, 'loss/train': 1.251627802848816} 11/07/2021 12:31:52 - INFO - __main__ - Step 108677: {'lr': 9.017017178655495e-05, 'samples': 20865984, 'steps': 108676, 'loss/train': 1.1836925745010376} 11/07/2021 12:31:53 - INFO - __main__ - Step 108678: {'lr': 9.016609124734435e-05, 'samples': 20866176, 'steps': 108677, 'loss/train': 1.360695481300354} 11/07/2021 12:31:53 - INFO - __main__ - Step 108679: {'lr': 9.016201078015248e-05, 'samples': 20866368, 'steps': 108678, 'loss/train': 2.8324878215789795} 11/07/2021 12:31:54 - INFO - __main__ - Step 108680: {'lr': 9.01579303849811e-05, 'samples': 20866560, 'steps': 108679, 'loss/train': 1.500550627708435} 11/07/2021 12:31:54 - INFO - __main__ - Step 108681: {'lr': 9.015385006183207e-05, 'samples': 20866752, 'steps': 108680, 'loss/train': 1.223642110824585} 11/07/2021 12:31:54 - INFO - __main__ - Step 108682: {'lr': 9.014976981070727e-05, 'samples': 20866944, 'steps': 108681, 'loss/train': 1.0815016031265259} 11/07/2021 12:31:55 - INFO - __main__ - Step 108683: {'lr': 9.014568963160849e-05, 'samples': 20867136, 'steps': 108682, 'loss/train': 1.8999485969543457} 11/07/2021 12:31:56 - INFO - __main__ - Step 108684: {'lr': 9.014160952453762e-05, 'samples': 20867328, 'steps': 108683, 'loss/train': 1.7725788354873657} 11/07/2021 12:31:56 - INFO - __main__ - Step 108685: {'lr': 9.013752948949647e-05, 'samples': 20867520, 'steps': 108684, 'loss/train': 0.7718872427940369} 11/07/2021 12:31:56 - INFO - __main__ - Step 108686: {'lr': 9.013344952648686e-05, 'samples': 20867712, 'steps': 108685, 'loss/train': 1.6192295551300049} 11/07/2021 12:31:57 - INFO - __main__ - Step 108687: {'lr': 9.01293696355107e-05, 'samples': 20867904, 'steps': 108686, 'loss/train': 1.5155298709869385} 11/07/2021 12:31:58 - INFO - __main__ - Step 108688: {'lr': 9.012528981656973e-05, 'samples': 20868096, 'steps': 108687, 'loss/train': 1.7543604373931885} 11/07/2021 12:31:58 - INFO - __main__ - Step 108689: {'lr': 9.012121006966583e-05, 'samples': 20868288, 'steps': 108688, 'loss/train': 1.8267220258712769} 11/07/2021 12:31:59 - INFO - __main__ - Step 108690: {'lr': 9.011713039480088e-05, 'samples': 20868480, 'steps': 108689, 'loss/train': 1.64411199092865} 11/07/2021 12:31:59 - INFO - __main__ - Step 108691: {'lr': 9.011305079197669e-05, 'samples': 20868672, 'steps': 108690, 'loss/train': 1.4482578039169312} 11/07/2021 12:31:59 - INFO - __main__ - Step 108692: {'lr': 9.010897126119517e-05, 'samples': 20868864, 'steps': 108691, 'loss/train': 1.1076008081436157} 11/07/2021 12:32:00 - INFO - __main__ - Step 108693: {'lr': 9.010489180245796e-05, 'samples': 20869056, 'steps': 108692, 'loss/train': 1.0517082214355469} 11/07/2021 12:32:01 - INFO - __main__ - Step 108694: {'lr': 9.010081241576703e-05, 'samples': 20869248, 'steps': 108693, 'loss/train': 1.3984988927841187} 11/07/2021 12:32:01 - INFO - __main__ - Step 108695: {'lr': 9.009673310112424e-05, 'samples': 20869440, 'steps': 108694, 'loss/train': 1.2634516954421997} 11/07/2021 12:32:02 - INFO - __main__ - Step 108696: {'lr': 9.009265385853138e-05, 'samples': 20869632, 'steps': 108695, 'loss/train': 1.073246717453003} 11/07/2021 12:32:02 - INFO - __main__ - Step 108697: {'lr': 9.008857468799028e-05, 'samples': 20869824, 'steps': 108696, 'loss/train': 1.7476357221603394} 11/07/2021 12:32:03 - INFO - __main__ - Step 108698: {'lr': 9.008449558950283e-05, 'samples': 20870016, 'steps': 108697, 'loss/train': 1.3321176767349243} 11/07/2021 12:32:04 - INFO - __main__ - Step 108699: {'lr': 9.008041656307081e-05, 'samples': 20870208, 'steps': 108698, 'loss/train': 0.9402371644973755} 11/07/2021 12:32:04 - INFO - __main__ - Step 108700: {'lr': 9.007633760869614e-05, 'samples': 20870400, 'steps': 108699, 'loss/train': 1.6950352191925049} 11/07/2021 12:32:04 - INFO - __main__ - Step 108701: {'lr': 9.007225872638053e-05, 'samples': 20870592, 'steps': 108700, 'loss/train': 1.0116167068481445} 11/07/2021 12:32:05 - INFO - __main__ - Step 108702: {'lr': 9.006817991612595e-05, 'samples': 20870784, 'steps': 108701, 'loss/train': 1.4512673616409302} 11/07/2021 12:32:05 - INFO - __main__ - Step 108703: {'lr': 9.006410117793415e-05, 'samples': 20870976, 'steps': 108702, 'loss/train': 1.4324603080749512} 11/07/2021 12:32:06 - INFO - __main__ - Step 108704: {'lr': 9.006002251180701e-05, 'samples': 20871168, 'steps': 108703, 'loss/train': 1.8607447147369385} 11/07/2021 12:32:06 - INFO - __main__ - Step 108705: {'lr': 9.005594391774635e-05, 'samples': 20871360, 'steps': 108704, 'loss/train': 1.135546326637268} 11/07/2021 12:32:07 - INFO - __main__ - Step 108706: {'lr': 9.00518653957541e-05, 'samples': 20871552, 'steps': 108705, 'loss/train': 1.3290084600448608} 11/07/2021 12:32:07 - INFO - __main__ - Step 108707: {'lr': 9.004778694583193e-05, 'samples': 20871744, 'steps': 108706, 'loss/train': 1.1078619956970215} 11/07/2021 12:32:07 - INFO - __main__ - Step 108708: {'lr': 9.004370856798177e-05, 'samples': 20871936, 'steps': 108707, 'loss/train': 1.5468567609786987} 11/07/2021 12:32:09 - INFO - __main__ - Step 108709: {'lr': 9.003963026220543e-05, 'samples': 20872128, 'steps': 108708, 'loss/train': 1.552451491355896} 11/07/2021 12:32:09 - INFO - __main__ - Step 108710: {'lr': 9.003555202850478e-05, 'samples': 20872320, 'steps': 108709, 'loss/train': 1.3788211345672607} 11/07/2021 12:32:09 - INFO - __main__ - Step 108711: {'lr': 9.003147386688163e-05, 'samples': 20872512, 'steps': 108710, 'loss/train': 1.2703044414520264} 11/07/2021 12:32:10 - INFO - __main__ - Step 108712: {'lr': 9.002739577733782e-05, 'samples': 20872704, 'steps': 108711, 'loss/train': 1.8312374353408813} 11/07/2021 12:32:10 - INFO - __main__ - Step 108713: {'lr': 9.002331775987522e-05, 'samples': 20872896, 'steps': 108712, 'loss/train': 1.2688816785812378} 11/07/2021 12:32:11 - INFO - __main__ - Step 108714: {'lr': 9.00192398144956e-05, 'samples': 20873088, 'steps': 108713, 'loss/train': 1.3340353965759277} 11/07/2021 12:32:11 - INFO - __main__ - Step 108715: {'lr': 9.001516194120088e-05, 'samples': 20873280, 'steps': 108714, 'loss/train': 1.1483741998672485} 11/07/2021 12:32:12 - INFO - __main__ - Step 108716: {'lr': 9.001108413999287e-05, 'samples': 20873472, 'steps': 108715, 'loss/train': 0.9533692002296448} 11/07/2021 12:32:12 - INFO - __main__ - Step 108717: {'lr': 9.000700641087336e-05, 'samples': 20873664, 'steps': 108716, 'loss/train': 1.4536328315734863} 11/07/2021 12:32:12 - INFO - __main__ - Step 108718: {'lr': 9.000292875384425e-05, 'samples': 20873856, 'steps': 108717, 'loss/train': 1.0209496021270752} 11/07/2021 12:32:13 - INFO - __main__ - Step 108719: {'lr': 8.999885116890744e-05, 'samples': 20874048, 'steps': 108718, 'loss/train': 1.4960006475448608} 11/07/2021 12:32:14 - INFO - __main__ - Step 108720: {'lr': 8.999477365606457e-05, 'samples': 20874240, 'steps': 108719, 'loss/train': 0.9767176508903503} 11/07/2021 12:32:14 - INFO - __main__ - Step 108721: {'lr': 8.999069621531761e-05, 'samples': 20874432, 'steps': 108720, 'loss/train': 1.064178228378296} 11/07/2021 12:32:14 - INFO - __main__ - Step 108722: {'lr': 8.998661884666837e-05, 'samples': 20874624, 'steps': 108721, 'loss/train': 1.3570002317428589} 11/07/2021 12:32:15 - INFO - __main__ - Step 108723: {'lr': 8.998254155011868e-05, 'samples': 20874816, 'steps': 108722, 'loss/train': 1.3016881942749023} 11/07/2021 12:32:15 - INFO - __main__ - Step 108724: {'lr': 8.997846432567039e-05, 'samples': 20875008, 'steps': 108723, 'loss/train': 4.966608047485352} 11/07/2021 12:32:16 - INFO - __main__ - Step 108725: {'lr': 8.997438717332532e-05, 'samples': 20875200, 'steps': 108724, 'loss/train': 1.4937459230422974} 11/07/2021 12:32:17 - INFO - __main__ - Step 108726: {'lr': 8.997031009308535e-05, 'samples': 20875392, 'steps': 108725, 'loss/train': 1.311858057975769} 11/07/2021 12:32:17 - INFO - __main__ - Step 108727: {'lr': 8.996623308495227e-05, 'samples': 20875584, 'steps': 108726, 'loss/train': 0.8737362623214722} 11/07/2021 12:32:17 - INFO - __main__ - Step 108728: {'lr': 8.996215614892794e-05, 'samples': 20875776, 'steps': 108727, 'loss/train': 0.948207437992096} 11/07/2021 12:32:18 - INFO - __main__ - Step 108729: {'lr': 8.99580792850142e-05, 'samples': 20875968, 'steps': 108728, 'loss/train': 1.4748897552490234} 11/07/2021 12:32:18 - INFO - __main__ - Step 108730: {'lr': 8.995400249321287e-05, 'samples': 20876160, 'steps': 108729, 'loss/train': 1.6605799198150635} 11/07/2021 12:32:19 - INFO - __main__ - Step 108731: {'lr': 8.994992577352582e-05, 'samples': 20876352, 'steps': 108730, 'loss/train': 0.8911291360855103} 11/07/2021 12:32:19 - INFO - __main__ - Step 108732: {'lr': 8.994584912595483e-05, 'samples': 20876544, 'steps': 108731, 'loss/train': 1.3168466091156006} 11/07/2021 12:32:20 - INFO - __main__ - Step 108733: {'lr': 8.994177255050187e-05, 'samples': 20876736, 'steps': 108732, 'loss/train': 1.6324440240859985} 11/07/2021 12:32:20 - INFO - __main__ - Step 108734: {'lr': 8.993769604716858e-05, 'samples': 20876928, 'steps': 108733, 'loss/train': 1.416268229484558} 11/07/2021 12:32:20 - INFO - __main__ - Step 108735: {'lr': 8.993361961595691e-05, 'samples': 20877120, 'steps': 108734, 'loss/train': 1.3303204774856567} 11/07/2021 12:32:22 - INFO - __main__ - Step 108736: {'lr': 8.99295432568687e-05, 'samples': 20877312, 'steps': 108735, 'loss/train': 1.240810751914978} 11/07/2021 12:32:22 - INFO - __main__ - Step 108737: {'lr': 8.992546696990575e-05, 'samples': 20877504, 'steps': 108736, 'loss/train': 0.9461471438407898} 11/07/2021 12:32:22 - INFO - __main__ - Step 108738: {'lr': 8.992139075506988e-05, 'samples': 20877696, 'steps': 108737, 'loss/train': 1.4326937198638916} 11/07/2021 12:32:23 - INFO - __main__ - Step 108739: {'lr': 8.991731461236302e-05, 'samples': 20877888, 'steps': 108738, 'loss/train': 1.5388985872268677} 11/07/2021 12:32:23 - INFO - __main__ - Step 108740: {'lr': 8.99132385417869e-05, 'samples': 20878080, 'steps': 108739, 'loss/train': 1.2448748350143433} 11/07/2021 12:32:24 - INFO - __main__ - Step 108741: {'lr': 8.990916254334345e-05, 'samples': 20878272, 'steps': 108740, 'loss/train': 1.1719425916671753} 11/07/2021 12:32:24 - INFO - __main__ - Step 108742: {'lr': 8.990508661703441e-05, 'samples': 20878464, 'steps': 108741, 'loss/train': 0.6448035836219788} 11/07/2021 12:32:25 - INFO - __main__ - Step 108743: {'lr': 8.990101076286169e-05, 'samples': 20878656, 'steps': 108742, 'loss/train': 1.240567684173584} 11/07/2021 12:32:25 - INFO - __main__ - Step 108744: {'lr': 8.989693498082713e-05, 'samples': 20878848, 'steps': 108743, 'loss/train': 1.3640040159225464} 11/07/2021 12:32:25 - INFO - __main__ - Step 108745: {'lr': 8.98928592709325e-05, 'samples': 20879040, 'steps': 108744, 'loss/train': 1.1583319902420044} 11/07/2021 12:32:26 - INFO - __main__ - Step 108746: {'lr': 8.98887836331798e-05, 'samples': 20879232, 'steps': 108745, 'loss/train': 1.2746026515960693} 11/07/2021 12:32:27 - INFO - __main__ - Step 108747: {'lr': 8.988470806757062e-05, 'samples': 20879424, 'steps': 108746, 'loss/train': 1.4458836317062378} 11/07/2021 12:32:27 - INFO - __main__ - Step 108748: {'lr': 8.988063257410695e-05, 'samples': 20879616, 'steps': 108747, 'loss/train': 1.6974791288375854} 11/07/2021 12:32:28 - INFO - __main__ - Step 108749: {'lr': 8.987655715279058e-05, 'samples': 20879808, 'steps': 108748, 'loss/train': 1.1554020643234253} 11/07/2021 12:32:28 - INFO - __main__ - Step 108750: {'lr': 8.987248180362337e-05, 'samples': 20880000, 'steps': 108749, 'loss/train': 1.0746513605117798} 11/07/2021 12:32:29 - INFO - __main__ - Step 108751: {'lr': 8.986840652660714e-05, 'samples': 20880192, 'steps': 108750, 'loss/train': 0.8845254778862} 11/07/2021 12:32:29 - INFO - __main__ - Step 108752: {'lr': 8.986433132174374e-05, 'samples': 20880384, 'steps': 108751, 'loss/train': 1.1511764526367188} 11/07/2021 12:32:30 - INFO - __main__ - Step 108753: {'lr': 8.986025618903499e-05, 'samples': 20880576, 'steps': 108752, 'loss/train': 1.3499183654785156} 11/07/2021 12:32:30 - INFO - __main__ - Step 108754: {'lr': 8.985618112848277e-05, 'samples': 20880768, 'steps': 108753, 'loss/train': 1.0195846557617188} 11/07/2021 12:32:30 - INFO - __main__ - Step 108755: {'lr': 8.985210614008884e-05, 'samples': 20880960, 'steps': 108754, 'loss/train': 1.41270112991333} 11/07/2021 12:32:31 - INFO - __main__ - Step 108756: {'lr': 8.98480312238551e-05, 'samples': 20881152, 'steps': 108755, 'loss/train': 1.3448880910873413} 11/07/2021 12:32:32 - INFO - __main__ - Step 108757: {'lr': 8.984395637978338e-05, 'samples': 20881344, 'steps': 108756, 'loss/train': 1.7164772748947144} 11/07/2021 12:32:32 - INFO - __main__ - Step 108758: {'lr': 8.983988160787548e-05, 'samples': 20881536, 'steps': 108757, 'loss/train': 1.2469079494476318} 11/07/2021 12:32:32 - INFO - __main__ - Step 108759: {'lr': 8.983580690813328e-05, 'samples': 20881728, 'steps': 108758, 'loss/train': 0.5312018990516663} 11/07/2021 12:32:33 - INFO - __main__ - Step 108760: {'lr': 8.983173228055866e-05, 'samples': 20881920, 'steps': 108759, 'loss/train': 1.4435032606124878} 11/07/2021 12:32:33 - INFO - __main__ - Step 108761: {'lr': 8.98276577251533e-05, 'samples': 20882112, 'steps': 108760, 'loss/train': 1.222772240638733} 11/07/2021 12:32:34 - INFO - __main__ - Step 108762: {'lr': 8.982358324191917e-05, 'samples': 20882304, 'steps': 108761, 'loss/train': 1.1395162343978882} 11/07/2021 12:32:35 - INFO - __main__ - Step 108763: {'lr': 8.981950883085801e-05, 'samples': 20882496, 'steps': 108762, 'loss/train': 1.0238348245620728} 11/07/2021 12:32:35 - INFO - __main__ - Step 108764: {'lr': 8.981543449197172e-05, 'samples': 20882688, 'steps': 108763, 'loss/train': 1.3085795640945435} 11/07/2021 12:32:35 - INFO - __main__ - Step 108765: {'lr': 8.981136022526215e-05, 'samples': 20882880, 'steps': 108764, 'loss/train': 1.7534111738204956} 11/07/2021 12:32:36 - INFO - __main__ - Step 108766: {'lr': 8.980728603073107e-05, 'samples': 20883072, 'steps': 108765, 'loss/train': 1.0223568677902222} 11/07/2021 12:32:36 - INFO - __main__ - Step 108767: {'lr': 8.980321190838039e-05, 'samples': 20883264, 'steps': 108766, 'loss/train': 1.7779386043548584} 11/07/2021 12:32:37 - INFO - __main__ - Step 108768: {'lr': 8.979913785821189e-05, 'samples': 20883456, 'steps': 108767, 'loss/train': 1.3030169010162354} 11/07/2021 12:32:38 - INFO - __main__ - Step 108769: {'lr': 8.979506388022743e-05, 'samples': 20883648, 'steps': 108768, 'loss/train': 1.3654170036315918} 11/07/2021 12:32:38 - INFO - __main__ - Step 108770: {'lr': 8.979098997442883e-05, 'samples': 20883840, 'steps': 108769, 'loss/train': 1.4453333616256714} 11/07/2021 12:32:38 - INFO - __main__ - Step 108771: {'lr': 8.978691614081796e-05, 'samples': 20884032, 'steps': 108770, 'loss/train': 1.2013474702835083} 11/07/2021 12:32:39 - INFO - __main__ - Step 108772: {'lr': 8.978284237939663e-05, 'samples': 20884224, 'steps': 108771, 'loss/train': 1.166913628578186} 11/07/2021 12:32:40 - INFO - __main__ - Step 108773: {'lr': 8.977876869016677e-05, 'samples': 20884416, 'steps': 108772, 'loss/train': 0.5917485356330872} 11/07/2021 12:32:40 - INFO - __main__ - Step 108774: {'lr': 8.977469507313002e-05, 'samples': 20884608, 'steps': 108773, 'loss/train': 1.0242139101028442} 11/07/2021 12:32:40 - INFO - __main__ - Step 108775: {'lr': 8.977062152828832e-05, 'samples': 20884800, 'steps': 108774, 'loss/train': 1.6884987354278564} 11/07/2021 12:32:41 - INFO - __main__ - Step 108776: {'lr': 8.976654805564352e-05, 'samples': 20884992, 'steps': 108775, 'loss/train': 1.265694499015808} 11/07/2021 12:32:41 - INFO - __main__ - Step 108777: {'lr': 8.976247465519743e-05, 'samples': 20885184, 'steps': 108776, 'loss/train': 0.715874969959259} 11/07/2021 12:32:42 - INFO - __main__ - Step 108778: {'lr': 8.97584013269519e-05, 'samples': 20885376, 'steps': 108777, 'loss/train': 1.3864920139312744} 11/07/2021 12:32:43 - INFO - __main__ - Step 108779: {'lr': 8.975432807090877e-05, 'samples': 20885568, 'steps': 108778, 'loss/train': 1.4766509532928467} 11/07/2021 12:32:43 - INFO - __main__ - Step 108780: {'lr': 8.975025488706986e-05, 'samples': 20885760, 'steps': 108779, 'loss/train': 1.6468812227249146} 11/07/2021 12:32:43 - INFO - __main__ - Step 108781: {'lr': 8.9746181775437e-05, 'samples': 20885952, 'steps': 108780, 'loss/train': 1.7753329277038574} 11/07/2021 12:32:44 - INFO - __main__ - Step 108782: {'lr': 8.974210873601205e-05, 'samples': 20886144, 'steps': 108781, 'loss/train': 1.3782110214233398} 11/07/2021 12:32:45 - INFO - __main__ - Step 108783: {'lr': 8.973803576879683e-05, 'samples': 20886336, 'steps': 108782, 'loss/train': 1.4836374521255493} 11/07/2021 12:32:45 - INFO - __main__ - Step 108784: {'lr': 8.973396287379318e-05, 'samples': 20886528, 'steps': 108783, 'loss/train': 0.6165769100189209} 11/07/2021 12:32:45 - INFO - __main__ - Step 108785: {'lr': 8.972989005100293e-05, 'samples': 20886720, 'steps': 108784, 'loss/train': 1.4369585514068604} 11/07/2021 12:32:46 - INFO - __main__ - Step 108786: {'lr': 8.972581730042792e-05, 'samples': 20886912, 'steps': 108785, 'loss/train': 1.2934714555740356} 11/07/2021 12:32:46 - INFO - __main__ - Step 108787: {'lr': 8.972174462207009e-05, 'samples': 20887104, 'steps': 108786, 'loss/train': 1.3284767866134644} 11/07/2021 12:32:46 - INFO - __main__ - Step 108788: {'lr': 8.971767201593106e-05, 'samples': 20887296, 'steps': 108787, 'loss/train': 0.5547190308570862} 11/07/2021 12:32:47 - INFO - __main__ - Step 108789: {'lr': 8.971359948201276e-05, 'samples': 20887488, 'steps': 108788, 'loss/train': 1.5562490224838257} 11/07/2021 12:32:48 - INFO - __main__ - Step 108790: {'lr': 8.970952702031707e-05, 'samples': 20887680, 'steps': 108789, 'loss/train': 1.2066868543624878} 11/07/2021 12:32:48 - INFO - __main__ - Step 108791: {'lr': 8.970545463084578e-05, 'samples': 20887872, 'steps': 108790, 'loss/train': 0.15270966291427612} 11/07/2021 12:32:48 - INFO - __main__ - Step 108792: {'lr': 8.970138231360075e-05, 'samples': 20888064, 'steps': 108791, 'loss/train': 1.0251471996307373} 11/07/2021 12:32:49 - INFO - __main__ - Step 108793: {'lr': 8.969731006858378e-05, 'samples': 20888256, 'steps': 108792, 'loss/train': 1.53164541721344} 11/07/2021 12:32:50 - INFO - __main__ - Step 108794: {'lr': 8.969323789579675e-05, 'samples': 20888448, 'steps': 108793, 'loss/train': 0.9951635003089905} 11/07/2021 12:32:50 - INFO - __main__ - Step 108795: {'lr': 8.968916579524147e-05, 'samples': 20888640, 'steps': 108794, 'loss/train': 1.6505744457244873} 11/07/2021 12:32:51 - INFO - __main__ - Step 108796: {'lr': 8.968509376691977e-05, 'samples': 20888832, 'steps': 108795, 'loss/train': 1.449545979499817} 11/07/2021 12:32:51 - INFO - __main__ - Step 108797: {'lr': 8.968102181083349e-05, 'samples': 20889024, 'steps': 108796, 'loss/train': 1.2805382013320923} 11/07/2021 12:32:51 - INFO - __main__ - Step 108798: {'lr': 8.967694992698447e-05, 'samples': 20889216, 'steps': 108797, 'loss/train': 1.364295482635498} 11/07/2021 12:32:52 - INFO - __main__ - Step 108799: {'lr': 8.967287811537455e-05, 'samples': 20889408, 'steps': 108798, 'loss/train': 0.8187587857246399} 11/07/2021 12:32:53 - INFO - __main__ - Step 108800: {'lr': 8.966880637600564e-05, 'samples': 20889600, 'steps': 108799, 'loss/train': 1.0179036855697632} 11/07/2021 12:32:53 - INFO - __main__ - Step 108801: {'lr': 8.96647347088794e-05, 'samples': 20889792, 'steps': 108800, 'loss/train': 1.3411073684692383} 11/07/2021 12:32:53 - INFO - __main__ - Step 108802: {'lr': 8.966066311399776e-05, 'samples': 20889984, 'steps': 108801, 'loss/train': 1.3675544261932373} 11/07/2021 12:32:54 - INFO - __main__ - Step 108803: {'lr': 8.965659159136255e-05, 'samples': 20890176, 'steps': 108802, 'loss/train': 1.6433792114257812} 11/07/2021 12:32:54 - INFO - __main__ - Step 108804: {'lr': 8.965252014097561e-05, 'samples': 20890368, 'steps': 108803, 'loss/train': 1.3117694854736328} 11/07/2021 12:32:56 - INFO - __main__ - Step 108805: {'lr': 8.964844876283876e-05, 'samples': 20890560, 'steps': 108804, 'loss/train': 1.2280621528625488} 11/07/2021 12:32:56 - INFO - __main__ - Step 108806: {'lr': 8.964437745695386e-05, 'samples': 20890752, 'steps': 108805, 'loss/train': 1.2575349807739258} 11/07/2021 12:32:56 - INFO - __main__ - Step 108807: {'lr': 8.964030622332273e-05, 'samples': 20890944, 'steps': 108806, 'loss/train': 1.3028138875961304} 11/07/2021 12:32:57 - INFO - __main__ - Step 108808: {'lr': 8.963623506194718e-05, 'samples': 20891136, 'steps': 108807, 'loss/train': 1.5043867826461792} 11/07/2021 12:32:57 - INFO - __main__ - Step 108809: {'lr': 8.963216397282909e-05, 'samples': 20891328, 'steps': 108808, 'loss/train': 0.6471688151359558} 11/07/2021 12:32:58 - INFO - __main__ - Step 108810: {'lr': 8.962809295597028e-05, 'samples': 20891520, 'steps': 108809, 'loss/train': 0.7812075018882751} 11/07/2021 12:32:58 - INFO - __main__ - Step 108811: {'lr': 8.962402201137254e-05, 'samples': 20891712, 'steps': 108810, 'loss/train': 1.3874835968017578} 11/07/2021 12:32:59 - INFO - __main__ - Step 108812: {'lr': 8.961995113903775e-05, 'samples': 20891904, 'steps': 108811, 'loss/train': 1.6512616872787476} 11/07/2021 12:32:59 - INFO - __main__ - Step 108813: {'lr': 8.961588033896784e-05, 'samples': 20892096, 'steps': 108812, 'loss/train': 1.4083657264709473} 11/07/2021 12:32:59 - INFO - __main__ - Step 108814: {'lr': 8.961180961116447e-05, 'samples': 20892288, 'steps': 108813, 'loss/train': 0.7298242449760437} 11/07/2021 12:33:00 - INFO - __main__ - Step 108815: {'lr': 8.960773895562951e-05, 'samples': 20892480, 'steps': 108814, 'loss/train': 1.3194847106933594} 11/07/2021 12:33:01 - INFO - __main__ - Step 108816: {'lr': 8.960366837236483e-05, 'samples': 20892672, 'steps': 108815, 'loss/train': 1.7502434253692627} 11/07/2021 12:33:01 - INFO - __main__ - Step 108817: {'lr': 8.959959786137229e-05, 'samples': 20892864, 'steps': 108816, 'loss/train': 1.6353001594543457} 11/07/2021 12:33:01 - INFO - __main__ - Step 108818: {'lr': 8.959552742265367e-05, 'samples': 20893056, 'steps': 108817, 'loss/train': 1.4027066230773926} 11/07/2021 12:33:02 - INFO - __main__ - Step 108819: {'lr': 8.959145705621083e-05, 'samples': 20893248, 'steps': 108818, 'loss/train': 1.2222315073013306} 11/07/2021 12:33:03 - INFO - __main__ - Step 108820: {'lr': 8.958738676204562e-05, 'samples': 20893440, 'steps': 108819, 'loss/train': 0.843715488910675} 11/07/2021 12:33:03 - INFO - __main__ - Step 108821: {'lr': 8.958331654015983e-05, 'samples': 20893632, 'steps': 108820, 'loss/train': 1.1815199851989746} 11/07/2021 12:33:04 - INFO - __main__ - Step 108822: {'lr': 8.957924639055534e-05, 'samples': 20893824, 'steps': 108821, 'loss/train': 1.3726756572723389} 11/07/2021 12:33:04 - INFO - __main__ - Step 108823: {'lr': 8.957517631323397e-05, 'samples': 20894016, 'steps': 108822, 'loss/train': 1.2697243690490723} 11/07/2021 12:33:04 - INFO - __main__ - Step 108824: {'lr': 8.957110630819757e-05, 'samples': 20894208, 'steps': 108823, 'loss/train': 1.3719861507415771} 11/07/2021 12:33:05 - INFO - __main__ - Step 108825: {'lr': 8.9567036375448e-05, 'samples': 20894400, 'steps': 108824, 'loss/train': 1.3298048973083496} 11/07/2021 12:33:06 - INFO - __main__ - Step 108826: {'lr': 8.956296651498699e-05, 'samples': 20894592, 'steps': 108825, 'loss/train': 1.3959507942199707} 11/07/2021 12:33:06 - INFO - __main__ - Step 108827: {'lr': 8.955889672681642e-05, 'samples': 20894784, 'steps': 108826, 'loss/train': 1.7618767023086548} 11/07/2021 12:33:06 - INFO - __main__ - Step 108828: {'lr': 8.955482701093811e-05, 'samples': 20894976, 'steps': 108827, 'loss/train': 1.1446741819381714} 11/07/2021 12:33:07 - INFO - __main__ - Step 108829: {'lr': 8.955075736735397e-05, 'samples': 20895168, 'steps': 108828, 'loss/train': 1.8734077215194702} 11/07/2021 12:33:08 - INFO - __main__ - Step 108830: {'lr': 8.954668779606576e-05, 'samples': 20895360, 'steps': 108829, 'loss/train': 1.6865078210830688} 11/07/2021 12:33:08 - INFO - __main__ - Step 108831: {'lr': 8.954261829707533e-05, 'samples': 20895552, 'steps': 108830, 'loss/train': 1.2920795679092407} 11/07/2021 12:33:08 - INFO - __main__ - Step 108832: {'lr': 8.953854887038451e-05, 'samples': 20895744, 'steps': 108831, 'loss/train': 1.1880912780761719} 11/07/2021 12:33:09 - INFO - __main__ - Step 108833: {'lr': 8.953447951599516e-05, 'samples': 20895936, 'steps': 108832, 'loss/train': 1.550989031791687} 11/07/2021 12:33:09 - INFO - __main__ - Step 108834: {'lr': 8.953041023390912e-05, 'samples': 20896128, 'steps': 108833, 'loss/train': 1.2559236288070679} 11/07/2021 12:33:10 - INFO - __main__ - Step 108835: {'lr': 8.952634102412815e-05, 'samples': 20896320, 'steps': 108834, 'loss/train': 1.1624895334243774} 11/07/2021 12:33:11 - INFO - __main__ - Step 108836: {'lr': 8.952227188665426e-05, 'samples': 20896512, 'steps': 108835, 'loss/train': 1.3946839570999146} 11/07/2021 12:33:11 - INFO - __main__ - Step 108837: {'lr': 8.951820282148906e-05, 'samples': 20896704, 'steps': 108836, 'loss/train': 1.1851813793182373} 11/07/2021 12:33:11 - INFO - __main__ - Step 108838: {'lr': 8.951413382863449e-05, 'samples': 20896896, 'steps': 108837, 'loss/train': 0.8518271446228027} 11/07/2021 12:33:12 - INFO - __main__ - Step 108839: {'lr': 8.951006490809236e-05, 'samples': 20897088, 'steps': 108838, 'loss/train': 1.3350194692611694} 11/07/2021 12:33:12 - INFO - __main__ - Step 108840: {'lr': 8.95059960598645e-05, 'samples': 20897280, 'steps': 108839, 'loss/train': 1.080790638923645} 11/07/2021 12:33:13 - INFO - __main__ - Step 108841: {'lr': 8.950192728395281e-05, 'samples': 20897472, 'steps': 108840, 'loss/train': 1.402274489402771} 11/07/2021 12:33:13 - INFO - __main__ - Step 108842: {'lr': 8.949785858035906e-05, 'samples': 20897664, 'steps': 108841, 'loss/train': 1.080141544342041} 11/07/2021 12:33:14 - INFO - __main__ - Step 108843: {'lr': 8.949378994908509e-05, 'samples': 20897856, 'steps': 108842, 'loss/train': 1.2970845699310303} 11/07/2021 12:33:14 - INFO - __main__ - Step 108844: {'lr': 8.948972139013273e-05, 'samples': 20898048, 'steps': 108843, 'loss/train': 1.1216533184051514} 11/07/2021 12:33:14 - INFO - __main__ - Step 108845: {'lr': 8.948565290350383e-05, 'samples': 20898240, 'steps': 108844, 'loss/train': 0.9293527603149414} 11/07/2021 12:33:16 - INFO - __main__ - Step 108846: {'lr': 8.948158448920021e-05, 'samples': 20898432, 'steps': 108845, 'loss/train': 1.1717374324798584} 11/07/2021 12:33:16 - INFO - __main__ - Step 108847: {'lr': 8.947751614722382e-05, 'samples': 20898624, 'steps': 108846, 'loss/train': 1.4918714761734009} 11/07/2021 12:33:16 - INFO - __main__ - Step 108848: {'lr': 8.94734478775763e-05, 'samples': 20898816, 'steps': 108847, 'loss/train': 0.9477084875106812} 11/07/2021 12:33:17 - INFO - __main__ - Step 108849: {'lr': 8.946937968025956e-05, 'samples': 20899008, 'steps': 108848, 'loss/train': 1.0707756280899048} 11/07/2021 12:33:17 - INFO - __main__ - Step 108850: {'lr': 8.946531155527543e-05, 'samples': 20899200, 'steps': 108849, 'loss/train': 1.271299123764038} 11/07/2021 12:33:18 - INFO - __main__ - Step 108851: {'lr': 8.946124350262574e-05, 'samples': 20899392, 'steps': 108850, 'loss/train': 1.156221628189087} 11/07/2021 12:33:18 - INFO - __main__ - Step 108852: {'lr': 8.945717552231236e-05, 'samples': 20899584, 'steps': 108851, 'loss/train': 1.2264293432235718} 11/07/2021 12:33:19 - INFO - __main__ - Step 108853: {'lr': 8.945310761433712e-05, 'samples': 20899776, 'steps': 108852, 'loss/train': 1.588697075843811} 11/07/2021 12:33:19 - INFO - __main__ - Step 108854: {'lr': 8.944903977870178e-05, 'samples': 20899968, 'steps': 108853, 'loss/train': 1.1584833860397339} 11/07/2021 12:33:19 - INFO - __main__ - Step 108855: {'lr': 8.944497201540827e-05, 'samples': 20900160, 'steps': 108854, 'loss/train': 1.1359058618545532} 11/07/2021 12:33:20 - INFO - __main__ - Step 108856: {'lr': 8.944090432445837e-05, 'samples': 20900352, 'steps': 108855, 'loss/train': 0.7882573008537292} 11/07/2021 12:33:21 - INFO - __main__ - Step 108857: {'lr': 8.94368367058539e-05, 'samples': 20900544, 'steps': 108856, 'loss/train': 1.7593028545379639} 11/07/2021 12:33:21 - INFO - __main__ - Step 108858: {'lr': 8.94327691595968e-05, 'samples': 20900736, 'steps': 108857, 'loss/train': 1.5679035186767578} 11/07/2021 12:33:21 - INFO - __main__ - Step 108859: {'lr': 8.942870168568876e-05, 'samples': 20900928, 'steps': 108858, 'loss/train': 1.223501443862915} 11/07/2021 12:33:22 - INFO - __main__ - Step 108860: {'lr': 8.942463428413164e-05, 'samples': 20901120, 'steps': 108859, 'loss/train': 1.3726317882537842} 11/07/2021 12:33:23 - INFO - __main__ - Step 108861: {'lr': 8.942056695492731e-05, 'samples': 20901312, 'steps': 108860, 'loss/train': 1.5308239459991455} 11/07/2021 12:33:23 - INFO - __main__ - Step 108862: {'lr': 8.94164996980776e-05, 'samples': 20901504, 'steps': 108861, 'loss/train': 0.7976707220077515} 11/07/2021 12:33:23 - INFO - __main__ - Step 108863: {'lr': 8.941243251358433e-05, 'samples': 20901696, 'steps': 108862, 'loss/train': 1.3160842657089233} 11/07/2021 12:33:24 - INFO - __main__ - Step 108864: {'lr': 8.940836540144937e-05, 'samples': 20901888, 'steps': 108863, 'loss/train': 1.1886085271835327} 11/07/2021 12:33:24 - INFO - __main__ - Step 108865: {'lr': 8.940429836167449e-05, 'samples': 20902080, 'steps': 108864, 'loss/train': 1.5136955976486206} 11/07/2021 12:33:25 - INFO - __main__ - Step 108866: {'lr': 8.940023139426157e-05, 'samples': 20902272, 'steps': 108865, 'loss/train': 1.3926095962524414} 11/07/2021 12:33:26 - INFO - __main__ - Step 108867: {'lr': 8.93961644992124e-05, 'samples': 20902464, 'steps': 108866, 'loss/train': 0.766731321811676} 11/07/2021 12:33:26 - INFO - __main__ - Step 108868: {'lr': 8.939209767652887e-05, 'samples': 20902656, 'steps': 108867, 'loss/train': 1.1244800090789795} 11/07/2021 12:33:26 - INFO - __main__ - Step 108869: {'lr': 8.938803092621284e-05, 'samples': 20902848, 'steps': 108868, 'loss/train': 1.7706718444824219} 11/07/2021 12:33:27 - INFO - __main__ - Step 108870: {'lr': 8.938396424826603e-05, 'samples': 20903040, 'steps': 108869, 'loss/train': 1.6539726257324219} 11/07/2021 12:33:27 - INFO - __main__ - Step 108871: {'lr': 8.937989764269031e-05, 'samples': 20903232, 'steps': 108870, 'loss/train': 1.2512472867965698} 11/07/2021 12:33:28 - INFO - __main__ - Step 108872: {'lr': 8.937583110948755e-05, 'samples': 20903424, 'steps': 108871, 'loss/train': 0.4847485423088074} 11/07/2021 12:33:29 - INFO - __main__ - Step 108873: {'lr': 8.937176464865953e-05, 'samples': 20903616, 'steps': 108872, 'loss/train': 1.116176962852478} 11/07/2021 12:33:29 - INFO - __main__ - Step 108874: {'lr': 8.936769826020813e-05, 'samples': 20903808, 'steps': 108873, 'loss/train': 0.8134695887565613} 11/07/2021 12:33:29 - INFO - __main__ - Step 108875: {'lr': 8.936363194413516e-05, 'samples': 20904000, 'steps': 108874, 'loss/train': 1.4435776472091675} 11/07/2021 12:33:30 - INFO - __main__ - Step 108876: {'lr': 8.935956570044249e-05, 'samples': 20904192, 'steps': 108875, 'loss/train': 1.2842907905578613} 11/07/2021 12:33:31 - INFO - __main__ - Step 108877: {'lr': 8.935549952913189e-05, 'samples': 20904384, 'steps': 108876, 'loss/train': 1.499633550643921} 11/07/2021 12:33:31 - INFO - __main__ - Step 108878: {'lr': 8.935143343020521e-05, 'samples': 20904576, 'steps': 108877, 'loss/train': 1.3603509664535522} 11/07/2021 12:33:31 - INFO - __main__ - Step 108879: {'lr': 8.934736740366433e-05, 'samples': 20904768, 'steps': 108878, 'loss/train': 1.124873399734497} 11/07/2021 12:33:32 - INFO - __main__ - Step 108880: {'lr': 8.934330144951103e-05, 'samples': 20904960, 'steps': 108879, 'loss/train': 1.2510411739349365} 11/07/2021 12:33:32 - INFO - __main__ - Step 108881: {'lr': 8.933923556774725e-05, 'samples': 20905152, 'steps': 108880, 'loss/train': 1.6437987089157104} 11/07/2021 12:33:33 - INFO - __main__ - Step 108882: {'lr': 8.933516975837463e-05, 'samples': 20905344, 'steps': 108881, 'loss/train': 0.9297564625740051} 11/07/2021 12:33:33 - INFO - __main__ - Step 108883: {'lr': 8.933110402139514e-05, 'samples': 20905536, 'steps': 108882, 'loss/train': 1.1940096616744995} 11/07/2021 12:33:34 - INFO - __main__ - Step 108884: {'lr': 8.932703835681052e-05, 'samples': 20905728, 'steps': 108883, 'loss/train': 1.1922600269317627} 11/07/2021 12:33:34 - INFO - __main__ - Step 108885: {'lr': 8.93229727646227e-05, 'samples': 20905920, 'steps': 108884, 'loss/train': 1.3201568126678467} 11/07/2021 12:33:34 - INFO - __main__ - Step 108886: {'lr': 8.931890724483346e-05, 'samples': 20906112, 'steps': 108885, 'loss/train': 0.7878607511520386} 11/07/2021 12:33:36 - INFO - __main__ - Step 108887: {'lr': 8.931484179744465e-05, 'samples': 20906304, 'steps': 108886, 'loss/train': 1.0301522016525269} 11/07/2021 12:33:36 - INFO - __main__ - Step 108888: {'lr': 8.931077642245808e-05, 'samples': 20906496, 'steps': 108887, 'loss/train': 1.698082447052002} 11/07/2021 12:33:37 - INFO - __main__ - Step 108889: {'lr': 8.930671111987559e-05, 'samples': 20906688, 'steps': 108888, 'loss/train': 1.2709590196609497} 11/07/2021 12:33:37 - INFO - __main__ - Step 108890: {'lr': 8.930264588969903e-05, 'samples': 20906880, 'steps': 108889, 'loss/train': 1.208675742149353} 11/07/2021 12:33:37 - INFO - __main__ - Step 108891: {'lr': 8.929858073193021e-05, 'samples': 20907072, 'steps': 108890, 'loss/train': 0.09368780255317688} 11/07/2021 12:33:39 - INFO - __main__ - Step 108892: {'lr': 8.929451564657095e-05, 'samples': 20907264, 'steps': 108891, 'loss/train': 1.5180275440216064} 11/07/2021 12:33:39 - INFO - __main__ - Step 108893: {'lr': 8.929045063362312e-05, 'samples': 20907456, 'steps': 108892, 'loss/train': 1.4741771221160889} 11/07/2021 12:33:39 - INFO - __main__ - Step 108894: {'lr': 8.928638569308862e-05, 'samples': 20907648, 'steps': 108893, 'loss/train': 1.3321623802185059} 11/07/2021 12:33:40 - INFO - __main__ - Step 108895: {'lr': 8.928232082496912e-05, 'samples': 20907840, 'steps': 108894, 'loss/train': 1.7946687936782837} 11/07/2021 12:33:40 - INFO - __main__ - Step 108896: {'lr': 8.927825602926651e-05, 'samples': 20908032, 'steps': 108895, 'loss/train': 1.357767939567566} 11/07/2021 12:33:41 - INFO - __main__ - Step 108897: {'lr': 8.927419130598264e-05, 'samples': 20908224, 'steps': 108896, 'loss/train': 1.260075330734253} 11/07/2021 12:33:41 - INFO - __main__ - Step 108898: {'lr': 8.927012665511933e-05, 'samples': 20908416, 'steps': 108897, 'loss/train': 1.5195716619491577} 11/07/2021 12:33:42 - INFO - __main__ - Step 108899: {'lr': 8.926606207667846e-05, 'samples': 20908608, 'steps': 108898, 'loss/train': 1.7424571514129639} 11/07/2021 12:33:42 - INFO - __main__ - Step 108900: {'lr': 8.926199757066178e-05, 'samples': 20908800, 'steps': 108899, 'loss/train': 1.5377579927444458} 11/07/2021 12:33:42 - INFO - __main__ - Step 108901: {'lr': 8.925793313707117e-05, 'samples': 20908992, 'steps': 108900, 'loss/train': 1.5439618825912476} 11/07/2021 12:33:44 - INFO - __main__ - Step 108902: {'lr': 8.925386877590847e-05, 'samples': 20909184, 'steps': 108901, 'loss/train': 0.9095390439033508} 11/07/2021 12:33:44 - INFO - __main__ - Step 108903: {'lr': 8.924980448717548e-05, 'samples': 20909376, 'steps': 108902, 'loss/train': 1.349973201751709} 11/07/2021 12:33:44 - INFO - __main__ - Step 108904: {'lr': 8.924574027087406e-05, 'samples': 20909568, 'steps': 108903, 'loss/train': 1.3283687829971313} 11/07/2021 12:33:45 - INFO - __main__ - Step 108905: {'lr': 8.924167612700604e-05, 'samples': 20909760, 'steps': 108904, 'loss/train': 1.3927726745605469} 11/07/2021 12:33:45 - INFO - __main__ - Step 108906: {'lr': 8.92376120555732e-05, 'samples': 20909952, 'steps': 108905, 'loss/train': 1.0580302476882935} 11/07/2021 12:33:45 - INFO - __main__ - Step 108907: {'lr': 8.923354805657746e-05, 'samples': 20910144, 'steps': 108906, 'loss/train': 1.278235912322998} 11/07/2021 12:33:46 - INFO - __main__ - Step 108908: {'lr': 8.922948413002065e-05, 'samples': 20910336, 'steps': 108907, 'loss/train': 0.8244919180870056} 11/07/2021 12:33:47 - INFO - __main__ - Step 108909: {'lr': 8.922542027590449e-05, 'samples': 20910528, 'steps': 108908, 'loss/train': 1.090295672416687} 11/07/2021 12:33:47 - INFO - __main__ - Step 108910: {'lr': 8.922135649423088e-05, 'samples': 20910720, 'steps': 108909, 'loss/train': 1.2368175983428955} 11/07/2021 12:33:47 - INFO - __main__ - Step 108911: {'lr': 8.921729278500163e-05, 'samples': 20910912, 'steps': 108910, 'loss/train': 1.485788345336914} 11/07/2021 12:33:48 - INFO - __main__ - Step 108912: {'lr': 8.921322914821859e-05, 'samples': 20911104, 'steps': 108911, 'loss/train': 1.3357298374176025} 11/07/2021 12:33:49 - INFO - __main__ - Step 108913: {'lr': 8.920916558388359e-05, 'samples': 20911296, 'steps': 108912, 'loss/train': 1.4206582307815552} 11/07/2021 12:33:49 - INFO - __main__ - Step 108914: {'lr': 8.920510209199844e-05, 'samples': 20911488, 'steps': 108913, 'loss/train': 1.3126007318496704} 11/07/2021 12:33:50 - INFO - __main__ - Step 108915: {'lr': 8.920103867256502e-05, 'samples': 20911680, 'steps': 108914, 'loss/train': 1.1621489524841309} 11/07/2021 12:33:50 - INFO - __main__ - Step 108916: {'lr': 8.919697532558512e-05, 'samples': 20911872, 'steps': 108915, 'loss/train': 1.376942753791809} 11/07/2021 12:33:50 - INFO - __main__ - Step 108917: {'lr': 8.91929120510606e-05, 'samples': 20912064, 'steps': 108916, 'loss/train': 1.3394838571548462} 11/07/2021 12:33:51 - INFO - __main__ - Step 108918: {'lr': 8.918884884899323e-05, 'samples': 20912256, 'steps': 108917, 'loss/train': 0.9198578596115112} 11/07/2021 12:33:52 - INFO - __main__ - Step 108919: {'lr': 8.91847857193849e-05, 'samples': 20912448, 'steps': 108918, 'loss/train': 1.1014622449874878} 11/07/2021 12:33:52 - INFO - __main__ - Step 108920: {'lr': 8.918072266223742e-05, 'samples': 20912640, 'steps': 108919, 'loss/train': 1.0487732887268066} 11/07/2021 12:33:52 - INFO - __main__ - Step 108921: {'lr': 8.917665967755272e-05, 'samples': 20912832, 'steps': 108920, 'loss/train': 1.3527615070343018} 11/07/2021 12:33:53 - INFO - __main__ - Step 108922: {'lr': 8.917259676533246e-05, 'samples': 20913024, 'steps': 108921, 'loss/train': 1.1441925764083862} 11/07/2021 12:33:54 - INFO - __main__ - Step 108923: {'lr': 8.916853392557852e-05, 'samples': 20913216, 'steps': 108922, 'loss/train': 1.810709834098816} 11/07/2021 12:33:54 - INFO - __main__ - Step 108924: {'lr': 8.916447115829279e-05, 'samples': 20913408, 'steps': 108923, 'loss/train': 1.5867706537246704} 11/07/2021 12:33:55 - INFO - __main__ - Step 108925: {'lr': 8.916040846347707e-05, 'samples': 20913600, 'steps': 108924, 'loss/train': 1.7467528581619263} 11/07/2021 12:33:55 - INFO - __main__ - Step 108926: {'lr': 8.915634584113314e-05, 'samples': 20913792, 'steps': 108925, 'loss/train': 1.3215138912200928} 11/07/2021 12:33:56 - INFO - __main__ - Step 108927: {'lr': 8.91522832912629e-05, 'samples': 20913984, 'steps': 108926, 'loss/train': 0.06597314029932022} 11/07/2021 12:33:56 - INFO - __main__ - Step 108928: {'lr': 8.914822081386817e-05, 'samples': 20914176, 'steps': 108927, 'loss/train': 1.2535985708236694} 11/07/2021 12:33:57 - INFO - __main__ - Step 108929: {'lr': 8.914415840895076e-05, 'samples': 20914368, 'steps': 108928, 'loss/train': 0.5296422243118286} 11/07/2021 12:33:57 - INFO - __main__ - Step 108930: {'lr': 8.914009607651253e-05, 'samples': 20914560, 'steps': 108929, 'loss/train': 0.8044378757476807} 11/07/2021 12:33:58 - INFO - __main__ - Step 108931: {'lr': 8.913603381655528e-05, 'samples': 20914752, 'steps': 108930, 'loss/train': 1.2945656776428223} 11/07/2021 12:33:58 - INFO - __main__ - Step 108932: {'lr': 8.913197162908085e-05, 'samples': 20914944, 'steps': 108931, 'loss/train': 0.6085963249206543} 11/07/2021 12:33:58 - INFO - __main__ - Step 108933: {'lr': 8.912790951409105e-05, 'samples': 20915136, 'steps': 108932, 'loss/train': 1.339223861694336} 11/07/2021 12:33:59 - INFO - __main__ - Step 108934: {'lr': 8.912384747158783e-05, 'samples': 20915328, 'steps': 108933, 'loss/train': 1.215396761894226} 11/07/2021 12:34:00 - INFO - __main__ - Step 108935: {'lr': 8.911978550157284e-05, 'samples': 20915520, 'steps': 108934, 'loss/train': 1.4098758697509766} 11/07/2021 12:34:00 - INFO - __main__ - Step 108936: {'lr': 8.911572360404802e-05, 'samples': 20915712, 'steps': 108935, 'loss/train': 1.2687394618988037} 11/07/2021 12:34:00 - INFO - __main__ - Step 108937: {'lr': 8.911166177901514e-05, 'samples': 20915904, 'steps': 108936, 'loss/train': 1.0392382144927979} 11/07/2021 12:34:01 - INFO - __main__ - Step 108938: {'lr': 8.910760002647605e-05, 'samples': 20916096, 'steps': 108937, 'loss/train': 0.7503860592842102} 11/07/2021 12:34:02 - INFO - __main__ - Step 108939: {'lr': 8.910353834643262e-05, 'samples': 20916288, 'steps': 108938, 'loss/train': 1.0457273721694946} 11/07/2021 12:34:02 - INFO - __main__ - Step 108940: {'lr': 8.909947673888666e-05, 'samples': 20916480, 'steps': 108939, 'loss/train': 1.2280158996582031} 11/07/2021 12:34:03 - INFO - __main__ - Step 108941: {'lr': 8.909541520383996e-05, 'samples': 20916672, 'steps': 108940, 'loss/train': 1.319599986076355} 11/07/2021 12:34:03 - INFO - __main__ - Step 108942: {'lr': 8.90913537412944e-05, 'samples': 20916864, 'steps': 108941, 'loss/train': 1.116646647453308} 11/07/2021 12:34:03 - INFO - __main__ - Step 108943: {'lr': 8.908729235125179e-05, 'samples': 20917056, 'steps': 108942, 'loss/train': 1.4462125301361084} 11/07/2021 12:34:04 - INFO - __main__ - Step 108944: {'lr': 8.908323103371396e-05, 'samples': 20917248, 'steps': 108943, 'loss/train': 0.5628167986869812} 11/07/2021 12:34:05 - INFO - __main__ - Step 108945: {'lr': 8.907916978868278e-05, 'samples': 20917440, 'steps': 108944, 'loss/train': 1.3994531631469727} 11/07/2021 12:34:05 - INFO - __main__ - Step 108946: {'lr': 8.907510861616e-05, 'samples': 20917632, 'steps': 108945, 'loss/train': 1.4934107065200806} 11/07/2021 12:34:05 - INFO - __main__ - Step 108947: {'lr': 8.90710475161475e-05, 'samples': 20917824, 'steps': 108946, 'loss/train': 1.369509220123291} 11/07/2021 12:34:06 - INFO - __main__ - Step 108948: {'lr': 8.906698648864719e-05, 'samples': 20918016, 'steps': 108947, 'loss/train': 1.779238224029541} 11/07/2021 12:34:06 - INFO - __main__ - Step 108949: {'lr': 8.906292553366072e-05, 'samples': 20918208, 'steps': 108948, 'loss/train': 1.5366965532302856} 11/07/2021 12:34:07 - INFO - __main__ - Step 108950: {'lr': 8.905886465119003e-05, 'samples': 20918400, 'steps': 108949, 'loss/train': 1.4858497381210327} 11/07/2021 12:34:07 - INFO - __main__ - Step 108951: {'lr': 8.905480384123693e-05, 'samples': 20918592, 'steps': 108950, 'loss/train': 1.4009153842926025} 11/07/2021 12:34:08 - INFO - __main__ - Step 108952: {'lr': 8.905074310380323e-05, 'samples': 20918784, 'steps': 108951, 'loss/train': 1.5737446546554565} 11/07/2021 12:34:08 - INFO - __main__ - Step 108953: {'lr': 8.90466824388908e-05, 'samples': 20918976, 'steps': 108952, 'loss/train': 1.397402286529541} 11/07/2021 12:34:09 - INFO - __main__ - Step 108954: {'lr': 8.904262184650148e-05, 'samples': 20919168, 'steps': 108953, 'loss/train': 1.021007776260376} 11/07/2021 12:34:10 - INFO - __main__ - Step 108955: {'lr': 8.903856132663701e-05, 'samples': 20919360, 'steps': 108954, 'loss/train': 1.635277509689331} 11/07/2021 12:34:10 - INFO - __main__ - Step 108956: {'lr': 8.903450087929931e-05, 'samples': 20919552, 'steps': 108955, 'loss/train': 1.5964727401733398} 11/07/2021 12:34:10 - INFO - __main__ - Step 108957: {'lr': 8.903044050449019e-05, 'samples': 20919744, 'steps': 108956, 'loss/train': 1.7074366807937622} 11/07/2021 12:34:11 - INFO - __main__ - Step 108958: {'lr': 8.902638020221145e-05, 'samples': 20919936, 'steps': 108957, 'loss/train': 0.45838671922683716} 11/07/2021 12:34:11 - INFO - __main__ - Step 108959: {'lr': 8.902231997246496e-05, 'samples': 20920128, 'steps': 108958, 'loss/train': 1.4009721279144287} 11/07/2021 12:34:12 - INFO - __main__ - Step 108960: {'lr': 8.901825981525252e-05, 'samples': 20920320, 'steps': 108959, 'loss/train': 1.3315675258636475} 11/07/2021 12:34:12 - INFO - __main__ - Step 108961: {'lr': 8.901419973057603e-05, 'samples': 20920512, 'steps': 108960, 'loss/train': 1.436379313468933} 11/07/2021 12:34:13 - INFO - __main__ - Step 108962: {'lr': 8.901013971843722e-05, 'samples': 20920704, 'steps': 108961, 'loss/train': 0.7518317103385925} 11/07/2021 12:34:13 - INFO - __main__ - Step 108963: {'lr': 8.900607977883792e-05, 'samples': 20920896, 'steps': 108962, 'loss/train': 1.5758570432662964} 11/07/2021 12:34:13 - INFO - __main__ - Step 108964: {'lr': 8.900201991178e-05, 'samples': 20921088, 'steps': 108963, 'loss/train': 1.084384799003601} 11/07/2021 12:34:14 - INFO - __main__ - Step 108965: {'lr': 8.89979601172653e-05, 'samples': 20921280, 'steps': 108964, 'loss/train': 1.340240716934204} 11/07/2021 12:34:15 - INFO - __main__ - Step 108966: {'lr': 8.899390039529561e-05, 'samples': 20921472, 'steps': 108965, 'loss/train': 1.5659546852111816} 11/07/2021 12:34:15 - INFO - __main__ - Step 108967: {'lr': 8.898984074587282e-05, 'samples': 20921664, 'steps': 108966, 'loss/train': 1.2060978412628174} 11/07/2021 12:34:15 - INFO - __main__ - Step 108968: {'lr': 8.89857811689987e-05, 'samples': 20921856, 'steps': 108967, 'loss/train': 1.7023544311523438} 11/07/2021 12:34:16 - INFO - __main__ - Step 108969: {'lr': 8.89817216646751e-05, 'samples': 20922048, 'steps': 108968, 'loss/train': 1.2975654602050781} 11/07/2021 12:34:16 - INFO - __main__ - Step 108970: {'lr': 8.897766223290385e-05, 'samples': 20922240, 'steps': 108969, 'loss/train': 1.0438990592956543} 11/07/2021 12:34:17 - INFO - __main__ - Step 108971: {'lr': 8.897360287368681e-05, 'samples': 20922432, 'steps': 108970, 'loss/train': 1.327979564666748} 11/07/2021 12:34:18 - INFO - __main__ - Step 108972: {'lr': 8.896954358702574e-05, 'samples': 20922624, 'steps': 108971, 'loss/train': 1.2979719638824463} 11/07/2021 12:34:18 - INFO - __main__ - Step 108973: {'lr': 8.896548437292254e-05, 'samples': 20922816, 'steps': 108972, 'loss/train': 1.7346094846725464} 11/07/2021 12:34:18 - INFO - __main__ - Step 108974: {'lr': 8.896142523137899e-05, 'samples': 20923008, 'steps': 108973, 'loss/train': 1.2772762775421143} 11/07/2021 12:34:19 - INFO - __main__ - Step 108975: {'lr': 8.895736616239703e-05, 'samples': 20923200, 'steps': 108974, 'loss/train': 0.2913212478160858} 11/07/2021 12:34:20 - INFO - __main__ - Step 108976: {'lr': 8.89533071659783e-05, 'samples': 20923392, 'steps': 108975, 'loss/train': 1.3885557651519775} 11/07/2021 12:34:20 - INFO - __main__ - Step 108977: {'lr': 8.894924824212475e-05, 'samples': 20923584, 'steps': 108976, 'loss/train': 1.3028150796890259} 11/07/2021 12:34:20 - INFO - __main__ - Step 108978: {'lr': 8.894518939083815e-05, 'samples': 20923776, 'steps': 108977, 'loss/train': 1.8384151458740234} 11/07/2021 12:34:21 - INFO - __main__ - Step 108979: {'lr': 8.894113061212039e-05, 'samples': 20923968, 'steps': 108978, 'loss/train': 1.3396117687225342} 11/07/2021 12:34:21 - INFO - __main__ - Step 108980: {'lr': 8.893707190597325e-05, 'samples': 20924160, 'steps': 108979, 'loss/train': 1.2347469329833984} 11/07/2021 12:34:22 - INFO - __main__ - Step 108981: {'lr': 8.893301327239859e-05, 'samples': 20924352, 'steps': 108980, 'loss/train': 1.1292065382003784} 11/07/2021 12:34:23 - INFO - __main__ - Step 108982: {'lr': 8.892895471139822e-05, 'samples': 20924544, 'steps': 108981, 'loss/train': 1.5755643844604492} 11/07/2021 12:34:23 - INFO - __main__ - Step 108983: {'lr': 8.892489622297397e-05, 'samples': 20924736, 'steps': 108982, 'loss/train': 0.7726569771766663} 11/07/2021 12:34:23 - INFO - __main__ - Step 108984: {'lr': 8.89208378071277e-05, 'samples': 20924928, 'steps': 108983, 'loss/train': 1.3984978199005127} 11/07/2021 12:34:24 - INFO - __main__ - Step 108985: {'lr': 8.891677946386123e-05, 'samples': 20925120, 'steps': 108984, 'loss/train': 0.7621168494224548} 11/07/2021 12:34:25 - INFO - __main__ - Step 108986: {'lr': 8.891272119317634e-05, 'samples': 20925312, 'steps': 108985, 'loss/train': 1.1262935400009155} 11/07/2021 12:34:25 - INFO - __main__ - Step 108987: {'lr': 8.89086629950749e-05, 'samples': 20925504, 'steps': 108986, 'loss/train': 1.598976969718933} 11/07/2021 12:34:25 - INFO - __main__ - Step 108988: {'lr': 8.890460486955882e-05, 'samples': 20925696, 'steps': 108987, 'loss/train': 1.0263316631317139} 11/07/2021 12:34:26 - INFO - __main__ - Step 108989: {'lr': 8.890054681662976e-05, 'samples': 20925888, 'steps': 108988, 'loss/train': 1.522050142288208} 11/07/2021 12:34:26 - INFO - __main__ - Step 108990: {'lr': 8.889648883628961e-05, 'samples': 20926080, 'steps': 108989, 'loss/train': 0.4134552478790283} 11/07/2021 12:34:26 - INFO - __main__ - Step 108991: {'lr': 8.889243092854021e-05, 'samples': 20926272, 'steps': 108990, 'loss/train': 1.650740146636963} 11/07/2021 12:34:28 - INFO - __main__ - Step 108992: {'lr': 8.888837309338344e-05, 'samples': 20926464, 'steps': 108991, 'loss/train': 1.5543733835220337} 11/07/2021 12:34:28 - INFO - __main__ - Step 108993: {'lr': 8.888431533082104e-05, 'samples': 20926656, 'steps': 108992, 'loss/train': 1.230610728263855} 11/07/2021 12:34:28 - INFO - __main__ - Step 108994: {'lr': 8.888025764085489e-05, 'samples': 20926848, 'steps': 108993, 'loss/train': 1.3523287773132324} 11/07/2021 12:34:29 - INFO - __main__ - Step 108995: {'lr': 8.887620002348681e-05, 'samples': 20927040, 'steps': 108994, 'loss/train': 1.241328239440918} 11/07/2021 12:34:29 - INFO - __main__ - Step 108996: {'lr': 8.887214247871864e-05, 'samples': 20927232, 'steps': 108995, 'loss/train': 1.4309265613555908} 11/07/2021 12:34:30 - INFO - __main__ - Step 108997: {'lr': 8.886808500655219e-05, 'samples': 20927424, 'steps': 108996, 'loss/train': 1.1186732053756714} 11/07/2021 12:34:30 - INFO - __main__ - Step 108998: {'lr': 8.886402760698931e-05, 'samples': 20927616, 'steps': 108997, 'loss/train': 0.5308315753936768} 11/07/2021 12:34:31 - INFO - __main__ - Step 108999: {'lr': 8.885997028003179e-05, 'samples': 20927808, 'steps': 108998, 'loss/train': 1.253472924232483} 11/07/2021 12:34:31 - INFO - __main__ - Step 109000: {'lr': 8.885591302568147e-05, 'samples': 20928000, 'steps': 108999, 'loss/train': 0.6856914162635803} 11/07/2021 12:34:32 - INFO - __main__ - Step 109001: {'lr': 8.88518558439402e-05, 'samples': 20928192, 'steps': 109000, 'loss/train': 1.00327730178833} 11/07/2021 12:34:32 - INFO - __main__ - Step 109002: {'lr': 8.884779873480991e-05, 'samples': 20928384, 'steps': 109001, 'loss/train': 0.5959376692771912} 11/07/2021 12:34:33 - INFO - __main__ - Step 109003: {'lr': 8.884374169829221e-05, 'samples': 20928576, 'steps': 109002, 'loss/train': 1.5929988622665405} 11/07/2021 12:34:33 - INFO - __main__ - Step 109004: {'lr': 8.883968473438903e-05, 'samples': 20928768, 'steps': 109003, 'loss/train': 1.1154863834381104} 11/07/2021 12:34:34 - INFO - __main__ - Step 109005: {'lr': 8.88356278431022e-05, 'samples': 20928960, 'steps': 109004, 'loss/train': 1.1458436250686646} 11/07/2021 12:34:34 - INFO - __main__ - Step 109006: {'lr': 8.883157102443354e-05, 'samples': 20929152, 'steps': 109005, 'loss/train': 0.8906897306442261} 11/07/2021 12:34:35 - INFO - __main__ - Step 109007: {'lr': 8.88275142783849e-05, 'samples': 20929344, 'steps': 109006, 'loss/train': 1.3005648851394653} 11/07/2021 12:34:35 - INFO - __main__ - Step 109008: {'lr': 8.88234576049581e-05, 'samples': 20929536, 'steps': 109007, 'loss/train': 1.6879740953445435} 11/07/2021 12:34:36 - INFO - __main__ - Step 109009: {'lr': 8.881940100415495e-05, 'samples': 20929728, 'steps': 109008, 'loss/train': 1.4182039499282837} 11/07/2021 12:34:36 - INFO - __main__ - Step 109010: {'lr': 8.881534447597731e-05, 'samples': 20929920, 'steps': 109009, 'loss/train': 1.2469661235809326} 11/07/2021 12:34:36 - INFO - __main__ - Step 109011: {'lr': 8.881128802042695e-05, 'samples': 20930112, 'steps': 109010, 'loss/train': 1.4794032573699951} 11/07/2021 12:34:37 - INFO - __main__ - Step 109012: {'lr': 8.880723163750579e-05, 'samples': 20930304, 'steps': 109011, 'loss/train': 1.648417353630066} 11/07/2021 12:34:38 - INFO - __main__ - Step 109013: {'lr': 8.880317532721558e-05, 'samples': 20930496, 'steps': 109012, 'loss/train': 0.9362017512321472} 11/07/2021 12:34:38 - INFO - __main__ - Step 109014: {'lr': 8.879911908955815e-05, 'samples': 20930688, 'steps': 109013, 'loss/train': 1.2833313941955566} 11/07/2021 12:34:39 - INFO - __main__ - Step 109015: {'lr': 8.879506292453546e-05, 'samples': 20930880, 'steps': 109014, 'loss/train': 1.4528635740280151} 11/07/2021 12:34:39 - INFO - __main__ - Step 109016: {'lr': 8.879100683214913e-05, 'samples': 20931072, 'steps': 109015, 'loss/train': 5.697320938110352} 11/07/2021 12:34:39 - INFO - __main__ - Step 109017: {'lr': 8.87869508124011e-05, 'samples': 20931264, 'steps': 109016, 'loss/train': 1.3786171674728394} 11/07/2021 12:34:40 - INFO - __main__ - Step 109018: {'lr': 8.878289486529317e-05, 'samples': 20931456, 'steps': 109017, 'loss/train': 0.8116582632064819} 11/07/2021 12:34:41 - INFO - __main__ - Step 109019: {'lr': 8.877883899082717e-05, 'samples': 20931648, 'steps': 109018, 'loss/train': 1.4642919301986694} 11/07/2021 12:34:41 - INFO - __main__ - Step 109020: {'lr': 8.877478318900494e-05, 'samples': 20931840, 'steps': 109019, 'loss/train': 0.6593176126480103} 11/07/2021 12:34:41 - INFO - __main__ - Step 109021: {'lr': 8.877072745982834e-05, 'samples': 20932032, 'steps': 109020, 'loss/train': 1.5425138473510742} 11/07/2021 12:34:42 - INFO - __main__ - Step 109022: {'lr': 8.876667180329912e-05, 'samples': 20932224, 'steps': 109021, 'loss/train': 1.2724894285202026} 11/07/2021 12:34:43 - INFO - __main__ - Step 109023: {'lr': 8.876261621941916e-05, 'samples': 20932416, 'steps': 109022, 'loss/train': 1.8750232458114624} 11/07/2021 12:34:44 - INFO - __main__ - Step 109024: {'lr': 8.87585607081903e-05, 'samples': 20932608, 'steps': 109023, 'loss/train': 1.514605164527893} 11/07/2021 12:34:44 - INFO - __main__ - Step 109025: {'lr': 8.875450526961431e-05, 'samples': 20932800, 'steps': 109024, 'loss/train': 1.1323587894439697} 11/07/2021 12:34:44 - INFO - __main__ - Step 109026: {'lr': 8.875044990369308e-05, 'samples': 20932992, 'steps': 109025, 'loss/train': 1.2571605443954468} 11/07/2021 12:34:45 - INFO - __main__ - Step 109027: {'lr': 8.874639461042838e-05, 'samples': 20933184, 'steps': 109026, 'loss/train': 1.387742280960083} 11/07/2021 12:34:45 - INFO - __main__ - Step 109028: {'lr': 8.87423393898221e-05, 'samples': 20933376, 'steps': 109027, 'loss/train': 1.1725927591323853} 11/07/2021 12:34:46 - INFO - __main__ - Step 109029: {'lr': 8.87382842418761e-05, 'samples': 20933568, 'steps': 109028, 'loss/train': 1.0664314031600952} 11/07/2021 12:34:46 - INFO - __main__ - Step 109030: {'lr': 8.873422916659207e-05, 'samples': 20933760, 'steps': 109029, 'loss/train': 0.7524254322052002} 11/07/2021 12:34:47 - INFO - __main__ - Step 109031: {'lr': 8.873017416397189e-05, 'samples': 20933952, 'steps': 109030, 'loss/train': 1.9693238735198975} 11/07/2021 12:34:47 - INFO - __main__ - Step 109032: {'lr': 8.872611923401741e-05, 'samples': 20934144, 'steps': 109031, 'loss/train': 1.6071912050247192} 11/07/2021 12:34:47 - INFO - __main__ - Step 109033: {'lr': 8.872206437673045e-05, 'samples': 20934336, 'steps': 109032, 'loss/train': 1.6027554273605347} 11/07/2021 12:34:49 - INFO - __main__ - Step 109034: {'lr': 8.871800959211284e-05, 'samples': 20934528, 'steps': 109033, 'loss/train': 1.506380558013916} 11/07/2021 12:34:49 - INFO - __main__ - Step 109035: {'lr': 8.871395488016642e-05, 'samples': 20934720, 'steps': 109034, 'loss/train': 1.561060905456543} 11/07/2021 12:34:50 - INFO - __main__ - Step 109036: {'lr': 8.870990024089299e-05, 'samples': 20934912, 'steps': 109035, 'loss/train': 1.5213497877120972} 11/07/2021 12:34:50 - INFO - __main__ - Step 109037: {'lr': 8.870584567429437e-05, 'samples': 20935104, 'steps': 109036, 'loss/train': 0.8885570168495178} 11/07/2021 12:34:50 - INFO - __main__ - Step 109038: {'lr': 8.870179118037244e-05, 'samples': 20935296, 'steps': 109037, 'loss/train': 1.4968106746673584} 11/07/2021 12:34:51 - INFO - __main__ - Step 109039: {'lr': 8.869773675912899e-05, 'samples': 20935488, 'steps': 109038, 'loss/train': 1.2853206396102905} 11/07/2021 12:34:52 - INFO - __main__ - Step 109040: {'lr': 8.869368241056583e-05, 'samples': 20935680, 'steps': 109039, 'loss/train': 1.3491955995559692} 11/07/2021 12:34:52 - INFO - __main__ - Step 109041: {'lr': 8.868962813468484e-05, 'samples': 20935872, 'steps': 109040, 'loss/train': 1.2523040771484375} 11/07/2021 12:34:52 - INFO - __main__ - Step 109042: {'lr': 8.868557393148787e-05, 'samples': 20936064, 'steps': 109041, 'loss/train': 1.4687790870666504} 11/07/2021 12:34:53 - INFO - __main__ - Step 109043: {'lr': 8.868151980097661e-05, 'samples': 20936256, 'steps': 109042, 'loss/train': 0.8430145382881165} 11/07/2021 12:34:53 - INFO - __main__ - Step 109044: {'lr': 8.867746574315299e-05, 'samples': 20936448, 'steps': 109043, 'loss/train': 1.4764007329940796} 11/07/2021 12:34:54 - INFO - __main__ - Step 109045: {'lr': 8.867341175801879e-05, 'samples': 20936640, 'steps': 109044, 'loss/train': 0.08607266843318939} 11/07/2021 12:34:54 - INFO - __main__ - Step 109046: {'lr': 8.866935784557587e-05, 'samples': 20936832, 'steps': 109045, 'loss/train': 1.0162007808685303} 11/07/2021 12:34:55 - INFO - __main__ - Step 109047: {'lr': 8.866530400582607e-05, 'samples': 20937024, 'steps': 109046, 'loss/train': 1.5654041767120361} 11/07/2021 12:34:55 - INFO - __main__ - Step 109048: {'lr': 8.866125023877116e-05, 'samples': 20937216, 'steps': 109047, 'loss/train': 0.7249902486801147} 11/07/2021 12:34:55 - INFO - __main__ - Step 109049: {'lr': 8.865719654441302e-05, 'samples': 20937408, 'steps': 109048, 'loss/train': 1.3984194993972778} 11/07/2021 12:34:57 - INFO - __main__ - Step 109050: {'lr': 8.865314292275345e-05, 'samples': 20937600, 'steps': 109049, 'loss/train': 0.6833963990211487} 11/07/2021 12:34:57 - INFO - __main__ - Step 109051: {'lr': 8.864908937379429e-05, 'samples': 20937792, 'steps': 109050, 'loss/train': 1.0963479280471802} 11/07/2021 12:34:57 - INFO - __main__ - Step 109052: {'lr': 8.864503589753733e-05, 'samples': 20937984, 'steps': 109051, 'loss/train': 1.4726511240005493} 11/07/2021 12:34:58 - INFO - __main__ - Step 109053: {'lr': 8.864098249398448e-05, 'samples': 20938176, 'steps': 109052, 'loss/train': 1.3018699884414673} 11/07/2021 12:34:58 - INFO - __main__ - Step 109054: {'lr': 8.863692916313749e-05, 'samples': 20938368, 'steps': 109053, 'loss/train': 1.2697343826293945} 11/07/2021 12:34:59 - INFO - __main__ - Step 109055: {'lr': 8.863287590499827e-05, 'samples': 20938560, 'steps': 109054, 'loss/train': 1.0353786945343018} 11/07/2021 12:34:59 - INFO - __main__ - Step 109056: {'lr': 8.862882271956852e-05, 'samples': 20938752, 'steps': 109055, 'loss/train': 1.1086640357971191} 11/07/2021 12:35:00 - INFO - __main__ - Step 109057: {'lr': 8.862476960685014e-05, 'samples': 20938944, 'steps': 109056, 'loss/train': 1.0763828754425049} 11/07/2021 12:35:00 - INFO - __main__ - Step 109058: {'lr': 8.862071656684495e-05, 'samples': 20939136, 'steps': 109057, 'loss/train': 1.7271944284439087} 11/07/2021 12:35:00 - INFO - __main__ - Step 109059: {'lr': 8.861666359955475e-05, 'samples': 20939328, 'steps': 109058, 'loss/train': 1.506090521812439} 11/07/2021 12:35:01 - INFO - __main__ - Step 109060: {'lr': 8.861261070498142e-05, 'samples': 20939520, 'steps': 109059, 'loss/train': 1.3997786045074463} 11/07/2021 12:35:02 - INFO - __main__ - Step 109061: {'lr': 8.860855788312674e-05, 'samples': 20939712, 'steps': 109060, 'loss/train': 1.375430703163147} 11/07/2021 12:35:02 - INFO - __main__ - Step 109062: {'lr': 8.860450513399257e-05, 'samples': 20939904, 'steps': 109061, 'loss/train': 1.2762032747268677} 11/07/2021 12:35:03 - INFO - __main__ - Step 109063: {'lr': 8.860045245758069e-05, 'samples': 20940096, 'steps': 109062, 'loss/train': 1.5821852684020996} 11/07/2021 12:35:03 - INFO - __main__ - Step 109064: {'lr': 8.859639985389297e-05, 'samples': 20940288, 'steps': 109063, 'loss/train': 0.772814929485321} 11/07/2021 12:35:03 - INFO - __main__ - Step 109065: {'lr': 8.859234732293126e-05, 'samples': 20940480, 'steps': 109064, 'loss/train': 1.316403865814209} 11/07/2021 12:35:04 - INFO - __main__ - Step 109066: {'lr': 8.85882948646973e-05, 'samples': 20940672, 'steps': 109065, 'loss/train': 1.5122889280319214} 11/07/2021 12:35:05 - INFO - __main__ - Step 109067: {'lr': 8.858424247919298e-05, 'samples': 20940864, 'steps': 109066, 'loss/train': 1.5326507091522217} 11/07/2021 12:35:05 - INFO - __main__ - Step 109068: {'lr': 8.858019016642011e-05, 'samples': 20941056, 'steps': 109067, 'loss/train': 1.4858791828155518} 11/07/2021 12:35:06 - INFO - __main__ - Step 109069: {'lr': 8.857613792638059e-05, 'samples': 20941248, 'steps': 109068, 'loss/train': 1.3564434051513672} 11/07/2021 12:35:06 - INFO - __main__ - Step 109070: {'lr': 8.85720857590761e-05, 'samples': 20941440, 'steps': 109069, 'loss/train': 1.2205287218093872} 11/07/2021 12:35:07 - INFO - __main__ - Step 109071: {'lr': 8.856803366450853e-05, 'samples': 20941632, 'steps': 109070, 'loss/train': 1.3702542781829834} 11/07/2021 12:35:07 - INFO - __main__ - Step 109072: {'lr': 8.85639816426797e-05, 'samples': 20941824, 'steps': 109071, 'loss/train': 1.4318351745605469} 11/07/2021 12:35:08 - INFO - __main__ - Step 109073: {'lr': 8.855992969359147e-05, 'samples': 20942016, 'steps': 109072, 'loss/train': 1.158159613609314} 11/07/2021 12:35:08 - INFO - __main__ - Step 109074: {'lr': 8.855587781724565e-05, 'samples': 20942208, 'steps': 109073, 'loss/train': 1.1141866445541382} 11/07/2021 12:35:08 - INFO - __main__ - Step 109075: {'lr': 8.855182601364403e-05, 'samples': 20942400, 'steps': 109074, 'loss/train': 1.3256064653396606} 11/07/2021 12:35:09 - INFO - __main__ - Step 109076: {'lr': 8.854777428278848e-05, 'samples': 20942592, 'steps': 109075, 'loss/train': 1.4922430515289307} 11/07/2021 12:35:10 - INFO - __main__ - Step 109077: {'lr': 8.854372262468083e-05, 'samples': 20942784, 'steps': 109076, 'loss/train': 1.2999153137207031} 11/07/2021 12:35:10 - INFO - __main__ - Step 109078: {'lr': 8.853967103932286e-05, 'samples': 20942976, 'steps': 109077, 'loss/train': 0.6517590880393982} 11/07/2021 12:35:10 - INFO - __main__ - Step 109079: {'lr': 8.853561952671646e-05, 'samples': 20943168, 'steps': 109078, 'loss/train': 1.0466907024383545} 11/07/2021 12:35:11 - INFO - __main__ - Step 109080: {'lr': 8.853156808686338e-05, 'samples': 20943360, 'steps': 109079, 'loss/train': 1.4588009119033813} 11/07/2021 12:35:12 - INFO - __main__ - Step 109081: {'lr': 8.85275167197655e-05, 'samples': 20943552, 'steps': 109080, 'loss/train': 0.9887416958808899} 11/07/2021 12:35:12 - INFO - __main__ - Step 109082: {'lr': 8.852346542542471e-05, 'samples': 20943744, 'steps': 109081, 'loss/train': 1.3908003568649292} 11/07/2021 12:35:12 - INFO - __main__ - Step 109083: {'lr': 8.851941420384266e-05, 'samples': 20943936, 'steps': 109082, 'loss/train': 1.6136592626571655} 11/07/2021 12:35:13 - INFO - __main__ - Step 109084: {'lr': 8.851536305502128e-05, 'samples': 20944128, 'steps': 109083, 'loss/train': 1.2231547832489014} 11/07/2021 12:35:13 - INFO - __main__ - Step 109085: {'lr': 8.851131197896239e-05, 'samples': 20944320, 'steps': 109084, 'loss/train': 1.3053908348083496} 11/07/2021 12:35:14 - INFO - __main__ - Step 109086: {'lr': 8.85072609756678e-05, 'samples': 20944512, 'steps': 109085, 'loss/train': 1.419897198677063} 11/07/2021 12:35:14 - INFO - __main__ - Step 109087: {'lr': 8.850321004513937e-05, 'samples': 20944704, 'steps': 109086, 'loss/train': 1.3525389432907104} 11/07/2021 12:35:15 - INFO - __main__ - Step 109088: {'lr': 8.849915918737889e-05, 'samples': 20944896, 'steps': 109087, 'loss/train': 1.1566404104232788} 11/07/2021 12:35:15 - INFO - __main__ - Step 109089: {'lr': 8.84951084023882e-05, 'samples': 20945088, 'steps': 109088, 'loss/train': 1.1265101432800293} 11/07/2021 12:35:16 - INFO - __main__ - Step 109090: {'lr': 8.84910576901691e-05, 'samples': 20945280, 'steps': 109089, 'loss/train': 1.071544885635376} 11/07/2021 12:35:16 - INFO - __main__ - Step 109091: {'lr': 8.848700705072346e-05, 'samples': 20945472, 'steps': 109090, 'loss/train': 1.0909335613250732} 11/07/2021 12:35:17 - INFO - __main__ - Step 109092: {'lr': 8.848295648405308e-05, 'samples': 20945664, 'steps': 109091, 'loss/train': 1.562625527381897} 11/07/2021 12:35:17 - INFO - __main__ - Step 109093: {'lr': 8.847890599015975e-05, 'samples': 20945856, 'steps': 109092, 'loss/train': 1.655277132987976} 11/07/2021 12:35:18 - INFO - __main__ - Step 109094: {'lr': 8.847485556904547e-05, 'samples': 20946048, 'steps': 109093, 'loss/train': 1.4641497135162354} 11/07/2021 12:35:18 - INFO - __main__ - Step 109095: {'lr': 8.847080522071183e-05, 'samples': 20946240, 'steps': 109094, 'loss/train': 0.8856701254844666} 11/07/2021 12:35:18 - INFO - __main__ - Step 109096: {'lr': 8.846675494516074e-05, 'samples': 20946432, 'steps': 109095, 'loss/train': 0.5593999028205872} 11/07/2021 12:35:19 - INFO - __main__ - Step 109097: {'lr': 8.846270474239403e-05, 'samples': 20946624, 'steps': 109096, 'loss/train': 1.506491780281067} 11/07/2021 12:35:20 - INFO - __main__ - Step 109098: {'lr': 8.845865461241354e-05, 'samples': 20946816, 'steps': 109097, 'loss/train': 1.2522605657577515} 11/07/2021 12:35:20 - INFO - __main__ - Step 109099: {'lr': 8.845460455522111e-05, 'samples': 20947008, 'steps': 109098, 'loss/train': 1.560623288154602} 11/07/2021 12:35:21 - INFO - __main__ - Step 109100: {'lr': 8.845055457081852e-05, 'samples': 20947200, 'steps': 109099, 'loss/train': 1.0906881093978882} 11/07/2021 12:35:21 - INFO - __main__ - Step 109101: {'lr': 8.844650465920761e-05, 'samples': 20947392, 'steps': 109100, 'loss/train': 1.3355073928833008} 11/07/2021 12:35:22 - INFO - __main__ - Step 109102: {'lr': 8.844245482039023e-05, 'samples': 20947584, 'steps': 109101, 'loss/train': 1.03346848487854} 11/07/2021 12:35:22 - INFO - __main__ - Step 109103: {'lr': 8.843840505436818e-05, 'samples': 20947776, 'steps': 109102, 'loss/train': 1.7778671979904175} 11/07/2021 12:35:23 - INFO - __main__ - Step 109104: {'lr': 8.84343553611433e-05, 'samples': 20947968, 'steps': 109103, 'loss/train': 1.4352880716323853} 11/07/2021 12:35:23 - INFO - __main__ - Step 109105: {'lr': 8.843030574071747e-05, 'samples': 20948160, 'steps': 109104, 'loss/train': 1.6565160751342773} 11/07/2021 12:35:23 - INFO - __main__ - Step 109106: {'lr': 8.842625619309239e-05, 'samples': 20948352, 'steps': 109105, 'loss/train': 1.447616457939148} 11/07/2021 12:35:24 - INFO - __main__ - Step 109107: {'lr': 8.842220671826992e-05, 'samples': 20948544, 'steps': 109106, 'loss/train': 1.3845102787017822} 11/07/2021 12:35:25 - INFO - __main__ - Step 109108: {'lr': 8.841815731625194e-05, 'samples': 20948736, 'steps': 109107, 'loss/train': 1.272392749786377} 11/07/2021 12:35:25 - INFO - __main__ - Step 109109: {'lr': 8.841410798704022e-05, 'samples': 20948928, 'steps': 109108, 'loss/train': 1.4763257503509521} 11/07/2021 12:35:25 - INFO - __main__ - Step 109110: {'lr': 8.841005873063662e-05, 'samples': 20949120, 'steps': 109109, 'loss/train': 0.7532762885093689} 11/07/2021 12:35:26 - INFO - __main__ - Step 109111: {'lr': 8.840600954704295e-05, 'samples': 20949312, 'steps': 109110, 'loss/train': 1.222991943359375} 11/07/2021 12:35:27 - INFO - __main__ - Step 109112: {'lr': 8.840196043626106e-05, 'samples': 20949504, 'steps': 109111, 'loss/train': 1.1675026416778564} 11/07/2021 12:35:27 - INFO - __main__ - Step 109113: {'lr': 8.839791139829273e-05, 'samples': 20949696, 'steps': 109112, 'loss/train': 1.476405143737793} 11/07/2021 12:35:27 - INFO - __main__ - Step 109114: {'lr': 8.839386243313982e-05, 'samples': 20949888, 'steps': 109113, 'loss/train': 1.1754504442214966} 11/07/2021 12:35:28 - INFO - __main__ - Step 109115: {'lr': 8.838981354080411e-05, 'samples': 20950080, 'steps': 109114, 'loss/train': 1.39669930934906} 11/07/2021 12:35:28 - INFO - __main__ - Step 109116: {'lr': 8.838576472128757e-05, 'samples': 20950272, 'steps': 109115, 'loss/train': 1.428232192993164} 11/07/2021 12:35:29 - INFO - __main__ - Step 109117: {'lr': 8.838171597459182e-05, 'samples': 20950464, 'steps': 109116, 'loss/train': 0.6812964677810669} 11/07/2021 12:35:30 - INFO - __main__ - Step 109118: {'lr': 8.837766730071878e-05, 'samples': 20950656, 'steps': 109117, 'loss/train': 1.3189828395843506} 11/07/2021 12:35:30 - INFO - __main__ - Step 109119: {'lr': 8.837361869967029e-05, 'samples': 20950848, 'steps': 109118, 'loss/train': 1.2193962335586548} 11/07/2021 12:35:30 - INFO - __main__ - Step 109120: {'lr': 8.836957017144812e-05, 'samples': 20951040, 'steps': 109119, 'loss/train': 1.2207037210464478} 11/07/2021 12:35:31 - INFO - __main__ - Step 109121: {'lr': 8.836552171605413e-05, 'samples': 20951232, 'steps': 109120, 'loss/train': 1.1261166334152222} 11/07/2021 12:35:32 - INFO - __main__ - Step 109122: {'lr': 8.836147333349015e-05, 'samples': 20951424, 'steps': 109121, 'loss/train': 1.0931670665740967} 11/07/2021 12:35:32 - INFO - __main__ - Step 109123: {'lr': 8.835742502375799e-05, 'samples': 20951616, 'steps': 109122, 'loss/train': 1.533278226852417} 11/07/2021 12:35:32 - INFO - __main__ - Step 109124: {'lr': 8.835337678685948e-05, 'samples': 20951808, 'steps': 109123, 'loss/train': 1.2807804346084595} 11/07/2021 12:35:33 - INFO - __main__ - Step 109125: {'lr': 8.834932862279646e-05, 'samples': 20952000, 'steps': 109124, 'loss/train': 0.9886094927787781} 11/07/2021 12:35:33 - INFO - __main__ - Step 109126: {'lr': 8.834528053157081e-05, 'samples': 20952192, 'steps': 109125, 'loss/train': 1.474685549736023} 11/07/2021 12:35:34 - INFO - __main__ - Step 109127: {'lr': 8.834123251318419e-05, 'samples': 20952384, 'steps': 109126, 'loss/train': 1.4336583614349365} 11/07/2021 12:35:35 - INFO - __main__ - Step 109128: {'lr': 8.833718456763853e-05, 'samples': 20952576, 'steps': 109127, 'loss/train': 1.3152920007705688} 11/07/2021 12:35:35 - INFO - __main__ - Step 109129: {'lr': 8.833313669493561e-05, 'samples': 20952768, 'steps': 109128, 'loss/train': 1.3200548887252808} 11/07/2021 12:35:35 - INFO - __main__ - Step 109130: {'lr': 8.832908889507732e-05, 'samples': 20952960, 'steps': 109129, 'loss/train': 1.3482632637023926} 11/07/2021 12:35:36 - INFO - __main__ - Step 109131: {'lr': 8.832504116806545e-05, 'samples': 20953152, 'steps': 109130, 'loss/train': 1.0964149236679077} 11/07/2021 12:35:37 - INFO - __main__ - Step 109132: {'lr': 8.83209935139018e-05, 'samples': 20953344, 'steps': 109131, 'loss/train': 1.2242995500564575} 11/07/2021 12:35:37 - INFO - __main__ - Step 109133: {'lr': 8.831694593258824e-05, 'samples': 20953536, 'steps': 109132, 'loss/train': 1.4045040607452393} 11/07/2021 12:35:37 - INFO - __main__ - Step 109134: {'lr': 8.831289842412655e-05, 'samples': 20953728, 'steps': 109133, 'loss/train': 1.1874973773956299} 11/07/2021 12:35:38 - INFO - __main__ - Step 109135: {'lr': 8.830885098851857e-05, 'samples': 20953920, 'steps': 109134, 'loss/train': 2.0143918991088867} 11/07/2021 12:35:38 - INFO - __main__ - Step 109136: {'lr': 8.830480362576613e-05, 'samples': 20954112, 'steps': 109135, 'loss/train': 1.4845690727233887} 11/07/2021 12:35:38 - INFO - __main__ - Step 109137: {'lr': 8.830075633587115e-05, 'samples': 20954304, 'steps': 109136, 'loss/train': 1.1346689462661743} 11/07/2021 12:35:39 - INFO - __main__ - Step 109138: {'lr': 8.829670911883525e-05, 'samples': 20954496, 'steps': 109137, 'loss/train': 1.3920680284500122} 11/07/2021 12:35:40 - INFO - __main__ - Step 109139: {'lr': 8.829266197466035e-05, 'samples': 20954688, 'steps': 109138, 'loss/train': 1.4247034788131714} 11/07/2021 12:35:40 - INFO - __main__ - Step 109140: {'lr': 8.82886149033483e-05, 'samples': 20954880, 'steps': 109139, 'loss/train': 1.2527798414230347} 11/07/2021 12:35:40 - INFO - __main__ - Step 109141: {'lr': 8.828456790490092e-05, 'samples': 20955072, 'steps': 109140, 'loss/train': 1.0177206993103027} 11/07/2021 12:35:41 - INFO - __main__ - Step 109142: {'lr': 8.828052097931999e-05, 'samples': 20955264, 'steps': 109141, 'loss/train': 1.4441728591918945} 11/07/2021 12:35:42 - INFO - __main__ - Step 109143: {'lr': 8.827647412660735e-05, 'samples': 20955456, 'steps': 109142, 'loss/train': 1.373703956604004} 11/07/2021 12:35:42 - INFO - __main__ - Step 109144: {'lr': 8.827242734676485e-05, 'samples': 20955648, 'steps': 109143, 'loss/train': 1.577939748764038} 11/07/2021 12:35:42 - INFO - __main__ - Step 109145: {'lr': 8.82683806397943e-05, 'samples': 20955840, 'steps': 109144, 'loss/train': 1.706140398979187} 11/07/2021 12:35:43 - INFO - __main__ - Step 109146: {'lr': 8.826433400569755e-05, 'samples': 20956032, 'steps': 109145, 'loss/train': 1.5856621265411377} 11/07/2021 12:35:43 - INFO - __main__ - Step 109147: {'lr': 8.826028744447637e-05, 'samples': 20956224, 'steps': 109146, 'loss/train': 1.4574315547943115} 11/07/2021 12:35:44 - INFO - __main__ - Step 109148: {'lr': 8.82562409561326e-05, 'samples': 20956416, 'steps': 109147, 'loss/train': 1.4800925254821777} 11/07/2021 12:35:44 - INFO - __main__ - Step 109149: {'lr': 8.82521945406681e-05, 'samples': 20956608, 'steps': 109148, 'loss/train': 1.5470229387283325} 11/07/2021 12:35:45 - INFO - __main__ - Step 109150: {'lr': 8.824814819808472e-05, 'samples': 20956800, 'steps': 109149, 'loss/train': 1.0312368869781494} 11/07/2021 12:35:45 - INFO - __main__ - Step 109151: {'lr': 8.824410192838417e-05, 'samples': 20956992, 'steps': 109150, 'loss/train': 1.6574862003326416} 11/07/2021 12:35:46 - INFO - __main__ - Step 109152: {'lr': 8.82400557315683e-05, 'samples': 20957184, 'steps': 109151, 'loss/train': 1.441376805305481} 11/07/2021 12:35:47 - INFO - __main__ - Step 109153: {'lr': 8.8236009607639e-05, 'samples': 20957376, 'steps': 109152, 'loss/train': 1.4622169733047485} 11/07/2021 12:35:47 - INFO - __main__ - Step 109154: {'lr': 8.823196355659801e-05, 'samples': 20957568, 'steps': 109153, 'loss/train': 1.1756083965301514} 11/07/2021 12:35:47 - INFO - __main__ - Step 109155: {'lr': 8.822791757844726e-05, 'samples': 20957760, 'steps': 109154, 'loss/train': 1.473479151725769} 11/07/2021 12:35:48 - INFO - __main__ - Step 109156: {'lr': 8.822387167318846e-05, 'samples': 20957952, 'steps': 109155, 'loss/train': 0.9430810213088989} 11/07/2021 12:35:48 - INFO - __main__ - Step 109157: {'lr': 8.821982584082353e-05, 'samples': 20958144, 'steps': 109156, 'loss/train': 1.692064881324768} 11/07/2021 12:35:49 - INFO - __main__ - Step 109158: {'lr': 8.821578008135423e-05, 'samples': 20958336, 'steps': 109157, 'loss/train': 1.3648854494094849} 11/07/2021 12:35:49 - INFO - __main__ - Step 109159: {'lr': 8.821173439478241e-05, 'samples': 20958528, 'steps': 109158, 'loss/train': 1.2383731603622437} 11/07/2021 12:35:50 - INFO - __main__ - Step 109160: {'lr': 8.820768878110988e-05, 'samples': 20958720, 'steps': 109159, 'loss/train': 1.3463225364685059} 11/07/2021 12:35:50 - INFO - __main__ - Step 109161: {'lr': 8.820364324033847e-05, 'samples': 20958912, 'steps': 109160, 'loss/train': 1.603014349937439} 11/07/2021 12:35:50 - INFO - __main__ - Step 109162: {'lr': 8.819959777246999e-05, 'samples': 20959104, 'steps': 109161, 'loss/train': 0.5857007503509521} 11/07/2021 12:35:51 - INFO - __main__ - Step 109163: {'lr': 8.819555237750637e-05, 'samples': 20959296, 'steps': 109162, 'loss/train': 1.2961680889129639} 11/07/2021 12:35:52 - INFO - __main__ - Step 109164: {'lr': 8.819150705544926e-05, 'samples': 20959488, 'steps': 109163, 'loss/train': 0.5698115825653076} 11/07/2021 12:35:52 - INFO - __main__ - Step 109165: {'lr': 8.818746180630054e-05, 'samples': 20959680, 'steps': 109164, 'loss/train': 1.4009487628936768} 11/07/2021 12:35:53 - INFO - __main__ - Step 109166: {'lr': 8.818341663006208e-05, 'samples': 20959872, 'steps': 109165, 'loss/train': 1.9323904514312744} 11/07/2021 12:35:53 - INFO - __main__ - Step 109167: {'lr': 8.817937152673566e-05, 'samples': 20960064, 'steps': 109166, 'loss/train': 1.2850196361541748} 11/07/2021 12:35:54 - INFO - __main__ - Step 109168: {'lr': 8.817532649632312e-05, 'samples': 20960256, 'steps': 109167, 'loss/train': 1.3628590106964111} 11/07/2021 12:35:54 - INFO - __main__ - Step 109169: {'lr': 8.817128153882628e-05, 'samples': 20960448, 'steps': 109168, 'loss/train': 0.699771523475647} 11/07/2021 12:35:55 - INFO - __main__ - Step 109170: {'lr': 8.816723665424698e-05, 'samples': 20960640, 'steps': 109169, 'loss/train': 1.5156224966049194} 11/07/2021 12:35:55 - INFO - __main__ - Step 109171: {'lr': 8.8163191842587e-05, 'samples': 20960832, 'steps': 109170, 'loss/train': 2.450629711151123} 11/07/2021 12:35:55 - INFO - __main__ - Step 109172: {'lr': 8.815914710384821e-05, 'samples': 20961024, 'steps': 109171, 'loss/train': 1.5006414651870728} 11/07/2021 12:35:56 - INFO - __main__ - Step 109173: {'lr': 8.815510243803238e-05, 'samples': 20961216, 'steps': 109172, 'loss/train': 1.4148963689804077} 11/07/2021 12:35:57 - INFO - __main__ - Step 109174: {'lr': 8.815105784514139e-05, 'samples': 20961408, 'steps': 109173, 'loss/train': 1.3523893356323242} 11/07/2021 12:35:57 - INFO - __main__ - Step 109175: {'lr': 8.814701332517702e-05, 'samples': 20961600, 'steps': 109174, 'loss/train': 1.259416103363037} 11/07/2021 12:35:57 - INFO - __main__ - Step 109176: {'lr': 8.814296887814122e-05, 'samples': 20961792, 'steps': 109175, 'loss/train': 1.4700299501419067} 11/07/2021 12:35:58 - INFO - __main__ - Step 109177: {'lr': 8.81389245040356e-05, 'samples': 20961984, 'steps': 109176, 'loss/train': 0.4254362881183624} 11/07/2021 12:35:59 - INFO - __main__ - Step 109178: {'lr': 8.813488020286206e-05, 'samples': 20962176, 'steps': 109177, 'loss/train': 1.0840657949447632} 11/07/2021 12:35:59 - INFO - __main__ - Step 109179: {'lr': 8.813083597462249e-05, 'samples': 20962368, 'steps': 109178, 'loss/train': 1.45475435256958} 11/07/2021 12:35:59 - INFO - __main__ - Step 109180: {'lr': 8.812679181931863e-05, 'samples': 20962560, 'steps': 109179, 'loss/train': 1.4821555614471436} 11/07/2021 12:36:00 - INFO - __main__ - Step 109181: {'lr': 8.812274773695236e-05, 'samples': 20962752, 'steps': 109180, 'loss/train': 1.110040307044983} 11/07/2021 12:36:00 - INFO - __main__ - Step 109182: {'lr': 8.811870372752548e-05, 'samples': 20962944, 'steps': 109181, 'loss/train': 1.2753137350082397} 11/07/2021 12:36:00 - INFO - __main__ - Step 109183: {'lr': 8.81146597910398e-05, 'samples': 20963136, 'steps': 109182, 'loss/train': 1.1207315921783447} 11/07/2021 12:36:02 - INFO - __main__ - Step 109184: {'lr': 8.811061592749716e-05, 'samples': 20963328, 'steps': 109183, 'loss/train': 1.3326085805892944} 11/07/2021 12:36:02 - INFO - __main__ - Step 109185: {'lr': 8.810657213689938e-05, 'samples': 20963520, 'steps': 109184, 'loss/train': 1.4117960929870605} 11/07/2021 12:36:02 - INFO - __main__ - Step 109186: {'lr': 8.810252841924829e-05, 'samples': 20963712, 'steps': 109185, 'loss/train': 1.5294767618179321} 11/07/2021 12:36:03 - INFO - __main__ - Step 109187: {'lr': 8.809848477454568e-05, 'samples': 20963904, 'steps': 109186, 'loss/train': 1.543724775314331} 11/07/2021 12:36:03 - INFO - __main__ - Step 109188: {'lr': 8.809444120279342e-05, 'samples': 20964096, 'steps': 109187, 'loss/train': 1.335805892944336} 11/07/2021 12:36:04 - INFO - __main__ - Step 109189: {'lr': 8.809039770399329e-05, 'samples': 20964288, 'steps': 109188, 'loss/train': 1.1938774585723877} 11/07/2021 12:36:04 - INFO - __main__ - Step 109190: {'lr': 8.808635427814723e-05, 'samples': 20964480, 'steps': 109189, 'loss/train': 1.37705659866333} 11/07/2021 12:36:05 - INFO - __main__ - Step 109191: {'lr': 8.808231092525687e-05, 'samples': 20964672, 'steps': 109190, 'loss/train': 1.6230615377426147} 11/07/2021 12:36:05 - INFO - __main__ - Step 109192: {'lr': 8.807826764532412e-05, 'samples': 20964864, 'steps': 109191, 'loss/train': 0.4264315366744995} 11/07/2021 12:36:05 - INFO - __main__ - Step 109193: {'lr': 8.807422443835081e-05, 'samples': 20965056, 'steps': 109192, 'loss/train': 1.2493951320648193} 11/07/2021 12:36:07 - INFO - __main__ - Step 109194: {'lr': 8.807018130433875e-05, 'samples': 20965248, 'steps': 109193, 'loss/train': 1.4304004907608032} 11/07/2021 12:36:07 - INFO - __main__ - Step 109195: {'lr': 8.806613824328976e-05, 'samples': 20965440, 'steps': 109194, 'loss/train': 1.329346776008606} 11/07/2021 12:36:07 - INFO - __main__ - Step 109196: {'lr': 8.806209525520567e-05, 'samples': 20965632, 'steps': 109195, 'loss/train': 1.4774609804153442} 11/07/2021 12:36:08 - INFO - __main__ - Step 109197: {'lr': 8.805805234008832e-05, 'samples': 20965824, 'steps': 109196, 'loss/train': 1.2884776592254639} 11/07/2021 12:36:08 - INFO - __main__ - Step 109198: {'lr': 8.805400949793948e-05, 'samples': 20966016, 'steps': 109197, 'loss/train': 0.8668197393417358} 11/07/2021 12:36:09 - INFO - __main__ - Step 109199: {'lr': 8.804996672876103e-05, 'samples': 20966208, 'steps': 109198, 'loss/train': 1.286667823791504} 11/07/2021 12:36:09 - INFO - __main__ - Step 109200: {'lr': 8.804592403255477e-05, 'samples': 20966400, 'steps': 109199, 'loss/train': 1.3266550302505493} 11/07/2021 12:36:10 - INFO - __main__ - Step 109201: {'lr': 8.804188140932252e-05, 'samples': 20966592, 'steps': 109200, 'loss/train': 1.2024890184402466} 11/07/2021 12:36:10 - INFO - __main__ - Step 109202: {'lr': 8.803783885906608e-05, 'samples': 20966784, 'steps': 109201, 'loss/train': 1.1307777166366577} 11/07/2021 12:36:10 - INFO - __main__ - Step 109203: {'lr': 8.803379638178735e-05, 'samples': 20966976, 'steps': 109202, 'loss/train': 0.5964301228523254} 11/07/2021 12:36:11 - INFO - __main__ - Step 109204: {'lr': 8.802975397748805e-05, 'samples': 20967168, 'steps': 109203, 'loss/train': 1.795189380645752} 11/07/2021 12:36:12 - INFO - __main__ - Step 109205: {'lr': 8.802571164617004e-05, 'samples': 20967360, 'steps': 109204, 'loss/train': 1.5588104724884033} 11/07/2021 12:36:12 - INFO - __main__ - Step 109206: {'lr': 8.802166938783512e-05, 'samples': 20967552, 'steps': 109205, 'loss/train': 2.1491165161132812} 11/07/2021 12:36:13 - INFO - __main__ - Step 109207: {'lr': 8.801762720248516e-05, 'samples': 20967744, 'steps': 109206, 'loss/train': 0.894626796245575} 11/07/2021 12:36:13 - INFO - __main__ - Step 109208: {'lr': 8.801358509012194e-05, 'samples': 20967936, 'steps': 109207, 'loss/train': 1.492684006690979} 11/07/2021 12:36:13 - INFO - __main__ - Step 109209: {'lr': 8.800954305074732e-05, 'samples': 20968128, 'steps': 109208, 'loss/train': 0.12886196374893188} 11/07/2021 12:36:14 - INFO - __main__ - Step 109210: {'lr': 8.800550108436308e-05, 'samples': 20968320, 'steps': 109209, 'loss/train': 1.5170422792434692} 11/07/2021 12:36:15 - INFO - __main__ - Step 109211: {'lr': 8.800145919097108e-05, 'samples': 20968512, 'steps': 109210, 'loss/train': 1.3908954858779907} 11/07/2021 12:36:15 - INFO - __main__ - Step 109212: {'lr': 8.799741737057313e-05, 'samples': 20968704, 'steps': 109211, 'loss/train': 1.3393627405166626} 11/07/2021 12:36:16 - INFO - __main__ - Step 109213: {'lr': 8.799337562317103e-05, 'samples': 20968896, 'steps': 109212, 'loss/train': 1.2760534286499023} 11/07/2021 12:36:16 - INFO - __main__ - Step 109214: {'lr': 8.79893339487666e-05, 'samples': 20969088, 'steps': 109213, 'loss/train': 1.4061487913131714} 11/07/2021 12:36:17 - INFO - __main__ - Step 109215: {'lr': 8.798529234736168e-05, 'samples': 20969280, 'steps': 109214, 'loss/train': 1.1332602500915527} 11/07/2021 12:36:17 - INFO - __main__ - Step 109216: {'lr': 8.79812508189581e-05, 'samples': 20969472, 'steps': 109215, 'loss/train': 1.0226420164108276} 11/07/2021 12:36:18 - INFO - __main__ - Step 109217: {'lr': 8.797720936355777e-05, 'samples': 20969664, 'steps': 109216, 'loss/train': 0.42321696877479553} 11/07/2021 12:36:18 - INFO - __main__ - Step 109218: {'lr': 8.797316798116228e-05, 'samples': 20969856, 'steps': 109217, 'loss/train': 1.6680492162704468} 11/07/2021 12:36:18 - INFO - __main__ - Step 109219: {'lr': 8.796912667177361e-05, 'samples': 20970048, 'steps': 109218, 'loss/train': 1.5968403816223145} 11/07/2021 12:36:19 - INFO - __main__ - Step 109220: {'lr': 8.796508543539356e-05, 'samples': 20970240, 'steps': 109219, 'loss/train': 1.311609148979187} 11/07/2021 12:36:20 - INFO - __main__ - Step 109221: {'lr': 8.796104427202392e-05, 'samples': 20970432, 'steps': 109220, 'loss/train': 1.6429259777069092} 11/07/2021 12:36:20 - INFO - __main__ - Step 109222: {'lr': 8.795700318166654e-05, 'samples': 20970624, 'steps': 109221, 'loss/train': 1.5453715324401855} 11/07/2021 12:36:20 - INFO - __main__ - Step 109223: {'lr': 8.795296216432325e-05, 'samples': 20970816, 'steps': 109222, 'loss/train': 0.9338439106941223} 11/07/2021 12:36:21 - INFO - __main__ - Step 109224: {'lr': 8.794892121999581e-05, 'samples': 20971008, 'steps': 109223, 'loss/train': 1.2101905345916748} 11/07/2021 12:36:21 - INFO - __main__ - Step 109225: {'lr': 8.794488034868614e-05, 'samples': 20971200, 'steps': 109224, 'loss/train': 1.4048653841018677} 11/07/2021 12:36:22 - INFO - __main__ - Step 109226: {'lr': 8.794083955039597e-05, 'samples': 20971392, 'steps': 109225, 'loss/train': 1.2967745065689087} 11/07/2021 12:36:23 - INFO - __main__ - Step 109227: {'lr': 8.793679882512717e-05, 'samples': 20971584, 'steps': 109226, 'loss/train': 1.052297830581665} 11/07/2021 12:36:23 - INFO - __main__ - Step 109228: {'lr': 8.793275817288154e-05, 'samples': 20971776, 'steps': 109227, 'loss/train': 1.0834318399429321} 11/07/2021 12:36:23 - INFO - __main__ - Step 109229: {'lr': 8.792871759366091e-05, 'samples': 20971968, 'steps': 109228, 'loss/train': 1.1783485412597656} 11/07/2021 12:36:24 - INFO - __main__ - Step 109230: {'lr': 8.792467708746721e-05, 'samples': 20972160, 'steps': 109229, 'loss/train': 1.4147319793701172} 11/07/2021 12:36:25 - INFO - __main__ - Step 109231: {'lr': 8.792063665430203e-05, 'samples': 20972352, 'steps': 109230, 'loss/train': 1.119420051574707} 11/07/2021 12:36:25 - INFO - __main__ - Step 109232: {'lr': 8.791659629416731e-05, 'samples': 20972544, 'steps': 109231, 'loss/train': 1.3370862007141113} 11/07/2021 12:36:25 - INFO - __main__ - Step 109233: {'lr': 8.791255600706489e-05, 'samples': 20972736, 'steps': 109232, 'loss/train': 1.1186611652374268} 11/07/2021 12:36:26 - INFO - __main__ - Step 109234: {'lr': 8.790851579299658e-05, 'samples': 20972928, 'steps': 109233, 'loss/train': 1.3022997379302979} 11/07/2021 12:36:26 - INFO - __main__ - Step 109235: {'lr': 8.790447565196416e-05, 'samples': 20973120, 'steps': 109234, 'loss/train': 1.1969963312149048} 11/07/2021 12:36:27 - INFO - __main__ - Step 109236: {'lr': 8.790043558396951e-05, 'samples': 20973312, 'steps': 109235, 'loss/train': 1.2976419925689697} 11/07/2021 12:36:28 - INFO - __main__ - Step 109237: {'lr': 8.789639558901441e-05, 'samples': 20973504, 'steps': 109236, 'loss/train': 1.0064427852630615} 11/07/2021 12:36:28 - INFO - __main__ - Step 109238: {'lr': 8.789235566710069e-05, 'samples': 20973696, 'steps': 109237, 'loss/train': 1.5629326105117798} 11/07/2021 12:36:28 - INFO - __main__ - Step 109239: {'lr': 8.788831581823018e-05, 'samples': 20973888, 'steps': 109238, 'loss/train': 1.1719326972961426} 11/07/2021 12:36:29 - INFO - __main__ - Step 109240: {'lr': 8.788427604240467e-05, 'samples': 20974080, 'steps': 109239, 'loss/train': 0.8773406147956848} 11/07/2021 12:36:29 - INFO - __main__ - Step 109241: {'lr': 8.788023633962603e-05, 'samples': 20974272, 'steps': 109240, 'loss/train': 1.0200634002685547} 11/07/2021 12:36:30 - INFO - __main__ - Step 109242: {'lr': 8.787619670989605e-05, 'samples': 20974464, 'steps': 109241, 'loss/train': 1.7172555923461914} 11/07/2021 12:36:30 - INFO - __main__ - Step 109243: {'lr': 8.787215715321656e-05, 'samples': 20974656, 'steps': 109242, 'loss/train': 1.2568172216415405} 11/07/2021 12:36:31 - INFO - __main__ - Step 109244: {'lr': 8.786811766958944e-05, 'samples': 20974848, 'steps': 109243, 'loss/train': 1.177657127380371} 11/07/2021 12:36:31 - INFO - __main__ - Step 109245: {'lr': 8.786407825901638e-05, 'samples': 20975040, 'steps': 109244, 'loss/train': 1.2474064826965332} 11/07/2021 12:36:31 - INFO - __main__ - Step 109246: {'lr': 8.786003892149925e-05, 'samples': 20975232, 'steps': 109245, 'loss/train': 1.6186323165893555} 11/07/2021 12:36:32 - INFO - __main__ - Step 109247: {'lr': 8.785599965703989e-05, 'samples': 20975424, 'steps': 109246, 'loss/train': 1.3989784717559814} 11/07/2021 12:36:33 - INFO - __main__ - Step 109248: {'lr': 8.785196046564012e-05, 'samples': 20975616, 'steps': 109247, 'loss/train': 1.2874155044555664} 11/07/2021 12:36:33 - INFO - __main__ - Step 109249: {'lr': 8.784792134730174e-05, 'samples': 20975808, 'steps': 109248, 'loss/train': 1.0466768741607666} 11/07/2021 12:36:33 - INFO - __main__ - Step 109250: {'lr': 8.784388230202658e-05, 'samples': 20976000, 'steps': 109249, 'loss/train': 1.1021721363067627} 11/07/2021 12:36:34 - INFO - __main__ - Step 109251: {'lr': 8.783984332981649e-05, 'samples': 20976192, 'steps': 109250, 'loss/train': 1.6432783603668213} 11/07/2021 12:36:35 - INFO - __main__ - Step 109252: {'lr': 8.783580443067324e-05, 'samples': 20976384, 'steps': 109251, 'loss/train': 0.9189971089363098} 11/07/2021 12:36:35 - INFO - __main__ - Step 109253: {'lr': 8.783176560459869e-05, 'samples': 20976576, 'steps': 109252, 'loss/train': 1.0097557306289673} 11/07/2021 12:36:35 - INFO - __main__ - Step 109254: {'lr': 8.782772685159463e-05, 'samples': 20976768, 'steps': 109253, 'loss/train': 0.994839608669281} 11/07/2021 12:36:36 - INFO - __main__ - Step 109255: {'lr': 8.782368817166291e-05, 'samples': 20976960, 'steps': 109254, 'loss/train': 1.480065941810608} 11/07/2021 12:36:36 - INFO - __main__ - Step 109256: {'lr': 8.781964956480531e-05, 'samples': 20977152, 'steps': 109255, 'loss/train': 1.4149857759475708} 11/07/2021 12:36:37 - INFO - __main__ - Step 109257: {'lr': 8.781561103102378e-05, 'samples': 20977344, 'steps': 109256, 'loss/train': 1.2777003049850464} 11/07/2021 12:36:38 - INFO - __main__ - Step 109258: {'lr': 8.781157257031996e-05, 'samples': 20977536, 'steps': 109257, 'loss/train': 1.0246638059616089} 11/07/2021 12:36:38 - INFO - __main__ - Step 109259: {'lr': 8.780753418269571e-05, 'samples': 20977728, 'steps': 109258, 'loss/train': 1.0088857412338257} 11/07/2021 12:36:38 - INFO - __main__ - Step 109260: {'lr': 8.78034958681529e-05, 'samples': 20977920, 'steps': 109259, 'loss/train': 1.575204849243164} 11/07/2021 12:36:39 - INFO - __main__ - Step 109261: {'lr': 8.779945762669334e-05, 'samples': 20978112, 'steps': 109260, 'loss/train': 1.0068496465682983} 11/07/2021 12:36:40 - INFO - __main__ - Step 109262: {'lr': 8.779541945831881e-05, 'samples': 20978304, 'steps': 109261, 'loss/train': 1.1373591423034668} 11/07/2021 12:36:40 - INFO - __main__ - Step 109263: {'lr': 8.779138136303119e-05, 'samples': 20978496, 'steps': 109262, 'loss/train': 1.0737124681472778} 11/07/2021 12:36:40 - INFO - __main__ - Step 109264: {'lr': 8.778734334083226e-05, 'samples': 20978688, 'steps': 109263, 'loss/train': 1.0456408262252808} 11/07/2021 12:36:41 - INFO - __main__ - Step 109265: {'lr': 8.778330539172385e-05, 'samples': 20978880, 'steps': 109264, 'loss/train': 1.5521552562713623} 11/07/2021 12:36:41 - INFO - __main__ - Step 109266: {'lr': 8.77792675157078e-05, 'samples': 20979072, 'steps': 109265, 'loss/train': 0.9325183033943176} 11/07/2021 12:36:42 - INFO - __main__ - Step 109267: {'lr': 8.777522971278587e-05, 'samples': 20979264, 'steps': 109266, 'loss/train': 1.149326205253601} 11/07/2021 12:36:42 - INFO - __main__ - Step 109268: {'lr': 8.777119198295996e-05, 'samples': 20979456, 'steps': 109267, 'loss/train': 1.3742014169692993} 11/07/2021 12:36:43 - INFO - __main__ - Step 109269: {'lr': 8.776715432623181e-05, 'samples': 20979648, 'steps': 109268, 'loss/train': 1.50819730758667} 11/07/2021 12:36:43 - INFO - __main__ - Step 109270: {'lr': 8.776311674260329e-05, 'samples': 20979840, 'steps': 109269, 'loss/train': 1.4054707288742065} 11/07/2021 12:36:43 - INFO - __main__ - Step 109271: {'lr': 8.775907923207629e-05, 'samples': 20980032, 'steps': 109270, 'loss/train': 1.1825008392333984} 11/07/2021 12:36:45 - INFO - __main__ - Step 109272: {'lr': 8.775504179465249e-05, 'samples': 20980224, 'steps': 109271, 'loss/train': 1.3960927724838257} 11/07/2021 12:36:45 - INFO - __main__ - Step 109273: {'lr': 8.775100443033374e-05, 'samples': 20980416, 'steps': 109272, 'loss/train': 1.2688570022583008} 11/07/2021 12:36:45 - INFO - __main__ - Step 109274: {'lr': 8.774696713912186e-05, 'samples': 20980608, 'steps': 109273, 'loss/train': 1.2078659534454346} 11/07/2021 12:36:46 - INFO - __main__ - Step 109275: {'lr': 8.774292992101873e-05, 'samples': 20980800, 'steps': 109274, 'loss/train': 1.0976594686508179} 11/07/2021 12:36:46 - INFO - __main__ - Step 109276: {'lr': 8.773889277602611e-05, 'samples': 20980992, 'steps': 109275, 'loss/train': 0.6759865880012512} 11/07/2021 12:36:47 - INFO - __main__ - Step 109277: {'lr': 8.773485570414586e-05, 'samples': 20981184, 'steps': 109276, 'loss/train': 1.1144182682037354} 11/07/2021 12:36:47 - INFO - __main__ - Step 109278: {'lr': 8.773081870537978e-05, 'samples': 20981376, 'steps': 109277, 'loss/train': 1.1654938459396362} 11/07/2021 12:36:48 - INFO - __main__ - Step 109279: {'lr': 8.772678177972968e-05, 'samples': 20981568, 'steps': 109278, 'loss/train': 1.200194239616394} 11/07/2021 12:36:48 - INFO - __main__ - Step 109280: {'lr': 8.772274492719737e-05, 'samples': 20981760, 'steps': 109279, 'loss/train': 1.3017443418502808} 11/07/2021 12:36:48 - INFO - __main__ - Step 109281: {'lr': 8.77187081477847e-05, 'samples': 20981952, 'steps': 109280, 'loss/train': 0.25941798090934753} 11/07/2021 12:36:49 - INFO - __main__ - Step 109282: {'lr': 8.771467144149348e-05, 'samples': 20982144, 'steps': 109281, 'loss/train': 1.2889400720596313} 11/07/2021 12:36:50 - INFO - __main__ - Step 109283: {'lr': 8.771063480832553e-05, 'samples': 20982336, 'steps': 109282, 'loss/train': 1.6476829051971436} 11/07/2021 12:36:50 - INFO - __main__ - Step 109284: {'lr': 8.770659824828276e-05, 'samples': 20982528, 'steps': 109283, 'loss/train': 1.3712842464447021} 11/07/2021 12:36:50 - INFO - __main__ - Step 109285: {'lr': 8.770256176136676e-05, 'samples': 20982720, 'steps': 109284, 'loss/train': 1.5889657735824585} 11/07/2021 12:36:51 - INFO - __main__ - Step 109286: {'lr': 8.769852534757953e-05, 'samples': 20982912, 'steps': 109285, 'loss/train': 0.8874640464782715} 11/07/2021 12:36:52 - INFO - __main__ - Step 109287: {'lr': 8.769448900692281e-05, 'samples': 20983104, 'steps': 109286, 'loss/train': 1.0997709035873413} 11/07/2021 12:36:52 - INFO - __main__ - Step 109288: {'lr': 8.769045273939846e-05, 'samples': 20983296, 'steps': 109287, 'loss/train': 1.5511282682418823} 11/07/2021 12:36:53 - INFO - __main__ - Step 109289: {'lr': 8.768641654500828e-05, 'samples': 20983488, 'steps': 109288, 'loss/train': 1.5116844177246094} 11/07/2021 12:36:53 - INFO - __main__ - Step 109290: {'lr': 8.768238042375412e-05, 'samples': 20983680, 'steps': 109289, 'loss/train': 1.4706076383590698} 11/07/2021 12:36:53 - INFO - __main__ - Step 109291: {'lr': 8.767834437563776e-05, 'samples': 20983872, 'steps': 109290, 'loss/train': 1.1738113164901733} 11/07/2021 12:36:54 - INFO - __main__ - Step 109292: {'lr': 8.767430840066103e-05, 'samples': 20984064, 'steps': 109291, 'loss/train': 0.9548802971839905} 11/07/2021 12:36:55 - INFO - __main__ - Step 109293: {'lr': 8.767027249882576e-05, 'samples': 20984256, 'steps': 109292, 'loss/train': 1.2139523029327393} 11/07/2021 12:36:55 - INFO - __main__ - Step 109294: {'lr': 8.766623667013374e-05, 'samples': 20984448, 'steps': 109293, 'loss/train': 1.4300402402877808} 11/07/2021 12:36:56 - INFO - __main__ - Step 109295: {'lr': 8.766220091458682e-05, 'samples': 20984640, 'steps': 109294, 'loss/train': 1.4884344339370728} 11/07/2021 12:36:56 - INFO - __main__ - Step 109296: {'lr': 8.765816523218681e-05, 'samples': 20984832, 'steps': 109295, 'loss/train': 1.6067272424697876} 11/07/2021 12:36:56 - INFO - __main__ - Step 109297: {'lr': 8.765412962293562e-05, 'samples': 20985024, 'steps': 109296, 'loss/train': 1.4936376810073853} 11/07/2021 12:36:57 - INFO - __main__ - Step 109298: {'lr': 8.765009408683488e-05, 'samples': 20985216, 'steps': 109297, 'loss/train': 0.45345208048820496} 11/07/2021 12:36:58 - INFO - __main__ - Step 109299: {'lr': 8.76460586238865e-05, 'samples': 20985408, 'steps': 109298, 'loss/train': 1.121160864830017} 11/07/2021 12:36:58 - INFO - __main__ - Step 109300: {'lr': 8.764202323409232e-05, 'samples': 20985600, 'steps': 109299, 'loss/train': 1.5431116819381714} 11/07/2021 12:36:59 - INFO - __main__ - Step 109301: {'lr': 8.763798791745412e-05, 'samples': 20985792, 'steps': 109300, 'loss/train': 0.15662230551242828} 11/07/2021 12:36:59 - INFO - __main__ - Step 109302: {'lr': 8.763395267397373e-05, 'samples': 20985984, 'steps': 109301, 'loss/train': 1.4700989723205566} 11/07/2021 12:36:59 - INFO - __main__ - Step 109303: {'lr': 8.762991750365298e-05, 'samples': 20986176, 'steps': 109302, 'loss/train': 1.5843350887298584} 11/07/2021 12:37:00 - INFO - __main__ - Step 109304: {'lr': 8.762588240649369e-05, 'samples': 20986368, 'steps': 109303, 'loss/train': 1.4736545085906982} 11/07/2021 12:37:01 - INFO - __main__ - Step 109305: {'lr': 8.762184738249767e-05, 'samples': 20986560, 'steps': 109304, 'loss/train': 1.2483137845993042} 11/07/2021 12:37:01 - INFO - __main__ - Step 109306: {'lr': 8.761781243166675e-05, 'samples': 20986752, 'steps': 109305, 'loss/train': 1.1354968547821045} 11/07/2021 12:37:01 - INFO - __main__ - Step 109307: {'lr': 8.761377755400271e-05, 'samples': 20986944, 'steps': 109306, 'loss/train': 1.7483121156692505} 11/07/2021 12:37:02 - INFO - __main__ - Step 109308: {'lr': 8.760974274950741e-05, 'samples': 20987136, 'steps': 109307, 'loss/train': 1.390002965927124} 11/07/2021 12:37:03 - INFO - __main__ - Step 109309: {'lr': 8.760570801818266e-05, 'samples': 20987328, 'steps': 109308, 'loss/train': 0.6433727741241455} 11/07/2021 12:37:03 - INFO - __main__ - Step 109310: {'lr': 8.760167336003028e-05, 'samples': 20987520, 'steps': 109309, 'loss/train': 1.5728964805603027} 11/07/2021 12:37:04 - INFO - __main__ - Step 109311: {'lr': 8.759763877505214e-05, 'samples': 20987712, 'steps': 109310, 'loss/train': 1.5908392667770386} 11/07/2021 12:37:04 - INFO - __main__ - Step 109312: {'lr': 8.759360426324994e-05, 'samples': 20987904, 'steps': 109311, 'loss/train': 1.2297539710998535} 11/07/2021 12:37:04 - INFO - __main__ - Step 109313: {'lr': 8.758956982462555e-05, 'samples': 20988096, 'steps': 109312, 'loss/train': 1.4686354398727417} 11/07/2021 12:37:05 - INFO - __main__ - Step 109314: {'lr': 8.758553545918077e-05, 'samples': 20988288, 'steps': 109313, 'loss/train': 1.0323902368545532} 11/07/2021 12:37:06 - INFO - __main__ - Step 109315: {'lr': 8.758150116691746e-05, 'samples': 20988480, 'steps': 109314, 'loss/train': 1.0805624723434448} 11/07/2021 12:37:06 - INFO - __main__ - Step 109316: {'lr': 8.757746694783744e-05, 'samples': 20988672, 'steps': 109315, 'loss/train': 1.2884920835494995} 11/07/2021 12:37:06 - INFO - __main__ - Step 109317: {'lr': 8.757343280194246e-05, 'samples': 20988864, 'steps': 109316, 'loss/train': 1.4229761362075806} 11/07/2021 12:37:07 - INFO - __main__ - Step 109318: {'lr': 8.756939872923442e-05, 'samples': 20989056, 'steps': 109317, 'loss/train': 1.4362218379974365} 11/07/2021 12:37:07 - INFO - __main__ - Step 109319: {'lr': 8.75653647297151e-05, 'samples': 20989248, 'steps': 109318, 'loss/train': 1.2483711242675781} 11/07/2021 12:37:08 - INFO - __main__ - Step 109320: {'lr': 8.75613308033863e-05, 'samples': 20989440, 'steps': 109319, 'loss/train': 1.3251339197158813} 11/07/2021 12:37:09 - INFO - __main__ - Step 109321: {'lr': 8.755729695024989e-05, 'samples': 20989632, 'steps': 109320, 'loss/train': 1.0727185010910034} 11/07/2021 12:37:09 - INFO - __main__ - Step 109322: {'lr': 8.755326317030763e-05, 'samples': 20989824, 'steps': 109321, 'loss/train': 2.253906011581421} 11/07/2021 12:37:09 - INFO - __main__ - Step 109323: {'lr': 8.754922946356136e-05, 'samples': 20990016, 'steps': 109322, 'loss/train': 0.8005405068397522} 11/07/2021 12:37:10 - INFO - __main__ - Step 109324: {'lr': 8.7545195830013e-05, 'samples': 20990208, 'steps': 109323, 'loss/train': 1.0746431350708008} 11/07/2021 12:37:11 - INFO - __main__ - Step 109325: {'lr': 8.754116226966418e-05, 'samples': 20990400, 'steps': 109324, 'loss/train': 1.1986804008483887} 11/07/2021 12:37:11 - INFO - __main__ - Step 109326: {'lr': 8.75371287825168e-05, 'samples': 20990592, 'steps': 109325, 'loss/train': 1.0010579824447632} 11/07/2021 12:37:12 - INFO - __main__ - Step 109327: {'lr': 8.753309536857268e-05, 'samples': 20990784, 'steps': 109326, 'loss/train': 0.7305944561958313} 11/07/2021 12:37:12 - INFO - __main__ - Step 109328: {'lr': 8.752906202783364e-05, 'samples': 20990976, 'steps': 109327, 'loss/train': 1.4052610397338867} 11/07/2021 12:37:12 - INFO - __main__ - Step 109329: {'lr': 8.752502876030153e-05, 'samples': 20991168, 'steps': 109328, 'loss/train': 1.350021243095398} 11/07/2021 12:37:13 - INFO - __main__ - Step 109330: {'lr': 8.752099556597809e-05, 'samples': 20991360, 'steps': 109329, 'loss/train': 1.3236488103866577} 11/07/2021 12:37:14 - INFO - __main__ - Step 109331: {'lr': 8.751696244486521e-05, 'samples': 20991552, 'steps': 109330, 'loss/train': 1.1577725410461426} 11/07/2021 12:37:14 - INFO - __main__ - Step 109332: {'lr': 8.751292939696467e-05, 'samples': 20991744, 'steps': 109331, 'loss/train': 1.1865663528442383} 11/07/2021 12:37:15 - INFO - __main__ - Step 109333: {'lr': 8.75088964222783e-05, 'samples': 20991936, 'steps': 109332, 'loss/train': 1.616316556930542} 11/07/2021 12:37:15 - INFO - __main__ - Step 109334: {'lr': 8.750486352080789e-05, 'samples': 20992128, 'steps': 109333, 'loss/train': 1.1767194271087646} 11/07/2021 12:37:15 - INFO - __main__ - Step 109335: {'lr': 8.750083069255532e-05, 'samples': 20992320, 'steps': 109334, 'loss/train': 1.3634769916534424} 11/07/2021 12:37:16 - INFO - __main__ - Step 109336: {'lr': 8.749679793752232e-05, 'samples': 20992512, 'steps': 109335, 'loss/train': 0.8126884698867798} 11/07/2021 12:37:17 - INFO - __main__ - Step 109337: {'lr': 8.749276525571082e-05, 'samples': 20992704, 'steps': 109336, 'loss/train': 1.1952756643295288} 11/07/2021 12:37:17 - INFO - __main__ - Step 109338: {'lr': 8.748873264712259e-05, 'samples': 20992896, 'steps': 109337, 'loss/train': 1.4786357879638672} 11/07/2021 12:37:17 - INFO - __main__ - Step 109339: {'lr': 8.748470011175938e-05, 'samples': 20993088, 'steps': 109338, 'loss/train': 0.9314643144607544} 11/07/2021 12:37:18 - INFO - __main__ - Step 109340: {'lr': 8.748066764962307e-05, 'samples': 20993280, 'steps': 109339, 'loss/train': 1.2987823486328125} 11/07/2021 12:37:19 - INFO - __main__ - Step 109341: {'lr': 8.747663526071545e-05, 'samples': 20993472, 'steps': 109340, 'loss/train': 1.2261989116668701} 11/07/2021 12:37:19 - INFO - __main__ - Step 109342: {'lr': 8.747260294503834e-05, 'samples': 20993664, 'steps': 109341, 'loss/train': 1.3386640548706055} 11/07/2021 12:37:19 - INFO - __main__ - Step 109343: {'lr': 8.746857070259356e-05, 'samples': 20993856, 'steps': 109342, 'loss/train': 1.7368181943893433} 11/07/2021 12:37:20 - INFO - __main__ - Step 109344: {'lr': 8.746453853338296e-05, 'samples': 20994048, 'steps': 109343, 'loss/train': 1.7280049324035645} 11/07/2021 12:37:21 - INFO - __main__ - Step 109345: {'lr': 8.746050643740833e-05, 'samples': 20994240, 'steps': 109344, 'loss/train': 0.9193772077560425} 11/07/2021 12:37:21 - INFO - __main__ - Step 109346: {'lr': 8.745647441467147e-05, 'samples': 20994432, 'steps': 109345, 'loss/train': 1.2906972169876099} 11/07/2021 12:37:22 - INFO - __main__ - Step 109347: {'lr': 8.745244246517423e-05, 'samples': 20994624, 'steps': 109346, 'loss/train': 1.364662528038025} 11/07/2021 12:37:22 - INFO - __main__ - Step 109348: {'lr': 8.74484105889184e-05, 'samples': 20994816, 'steps': 109347, 'loss/train': 0.9430907964706421} 11/07/2021 12:37:22 - INFO - __main__ - Step 109349: {'lr': 8.74443787859058e-05, 'samples': 20995008, 'steps': 109348, 'loss/train': 1.4787893295288086} 11/07/2021 12:37:23 - INFO - __main__ - Step 109350: {'lr': 8.744034705613827e-05, 'samples': 20995200, 'steps': 109349, 'loss/train': 0.9043475389480591} 11/07/2021 12:37:24 - INFO - __main__ - Step 109351: {'lr': 8.743631539961769e-05, 'samples': 20995392, 'steps': 109350, 'loss/train': 1.408238410949707} 11/07/2021 12:37:24 - INFO - __main__ - Step 109352: {'lr': 8.743228381634571e-05, 'samples': 20995584, 'steps': 109351, 'loss/train': 1.2446829080581665} 11/07/2021 12:37:24 - INFO - __main__ - Step 109353: {'lr': 8.742825230632423e-05, 'samples': 20995776, 'steps': 109352, 'loss/train': 1.2870748043060303} 11/07/2021 12:37:25 - INFO - __main__ - Step 109354: {'lr': 8.742422086955509e-05, 'samples': 20995968, 'steps': 109353, 'loss/train': 0.7755106091499329} 11/07/2021 12:37:25 - INFO - __main__ - Step 109355: {'lr': 8.742018950604005e-05, 'samples': 20996160, 'steps': 109354, 'loss/train': 1.3575044870376587} 11/07/2021 12:37:26 - INFO - __main__ - Step 109356: {'lr': 8.7416158215781e-05, 'samples': 20996352, 'steps': 109355, 'loss/train': 1.6010016202926636} 11/07/2021 12:37:26 - INFO - __main__ - Step 109357: {'lr': 8.74121269987797e-05, 'samples': 20996544, 'steps': 109356, 'loss/train': 1.3070656061172485} 11/07/2021 12:37:27 - INFO - __main__ - Step 109358: {'lr': 8.7408095855038e-05, 'samples': 20996736, 'steps': 109357, 'loss/train': 1.1114267110824585} 11/07/2021 12:37:27 - INFO - __main__ - Step 109359: {'lr': 8.74040647845577e-05, 'samples': 20996928, 'steps': 109358, 'loss/train': 1.357845425605774} 11/07/2021 12:37:27 - INFO - __main__ - Step 109360: {'lr': 8.740003378734062e-05, 'samples': 20997120, 'steps': 109359, 'loss/train': 0.9762235283851624} 11/07/2021 12:37:29 - INFO - __main__ - Step 109361: {'lr': 8.739600286338859e-05, 'samples': 20997312, 'steps': 109360, 'loss/train': 1.5888283252716064} 11/07/2021 12:37:29 - INFO - __main__ - Step 109362: {'lr': 8.739197201270338e-05, 'samples': 20997504, 'steps': 109361, 'loss/train': 1.6336231231689453} 11/07/2021 12:37:29 - INFO - __main__ - Step 109363: {'lr': 8.738794123528696e-05, 'samples': 20997696, 'steps': 109362, 'loss/train': 0.09196428954601288} 11/07/2021 12:37:30 - INFO - __main__ - Step 109364: {'lr': 8.738391053114092e-05, 'samples': 20997888, 'steps': 109363, 'loss/train': 1.7606796026229858} 11/07/2021 12:37:30 - INFO - __main__ - Step 109365: {'lr': 8.737987990026717e-05, 'samples': 20998080, 'steps': 109364, 'loss/train': 1.443760633468628} 11/07/2021 12:37:31 - INFO - __main__ - Step 109366: {'lr': 8.737584934266754e-05, 'samples': 20998272, 'steps': 109365, 'loss/train': 1.3465032577514648} 11/07/2021 12:37:32 - INFO - __main__ - Step 109367: {'lr': 8.737181885834386e-05, 'samples': 20998464, 'steps': 109366, 'loss/train': 0.8474177718162537} 11/07/2021 12:37:32 - INFO - __main__ - Step 109368: {'lr': 8.736778844729792e-05, 'samples': 20998656, 'steps': 109367, 'loss/train': 1.3912473917007446} 11/07/2021 12:37:32 - INFO - __main__ - Step 109369: {'lr': 8.736375810953154e-05, 'samples': 20998848, 'steps': 109368, 'loss/train': 1.1375811100006104} 11/07/2021 12:37:33 - INFO - __main__ - Step 109370: {'lr': 8.735972784504654e-05, 'samples': 20999040, 'steps': 109369, 'loss/train': 1.0753841400146484} 11/07/2021 12:37:34 - INFO - __main__ - Step 109371: {'lr': 8.735569765384474e-05, 'samples': 20999232, 'steps': 109370, 'loss/train': 1.5681102275848389} 11/07/2021 12:37:34 - INFO - __main__ - Step 109372: {'lr': 8.735166753592797e-05, 'samples': 20999424, 'steps': 109371, 'loss/train': 1.0124962329864502} 11/07/2021 12:37:34 - INFO - __main__ - Step 109373: {'lr': 8.734763749129809e-05, 'samples': 20999616, 'steps': 109372, 'loss/train': 1.371174931526184} 11/07/2021 12:37:35 - INFO - __main__ - Step 109374: {'lr': 8.734360751995677e-05, 'samples': 20999808, 'steps': 109373, 'loss/train': 1.0197515487670898} 11/07/2021 12:37:35 - INFO - __main__ - Step 109375: {'lr': 8.733957762190593e-05, 'samples': 21000000, 'steps': 109374, 'loss/train': 1.5040127038955688} 11/07/2021 12:37:36 - INFO - __main__ - Step 109376: {'lr': 8.733554779714734e-05, 'samples': 21000192, 'steps': 109375, 'loss/train': 0.8635242581367493} 11/07/2021 12:37:37 - INFO - __main__ - Step 109377: {'lr': 8.733151804568287e-05, 'samples': 21000384, 'steps': 109376, 'loss/train': 1.3385415077209473} 11/07/2021 12:37:37 - INFO - __main__ - Step 109378: {'lr': 8.732748836751427e-05, 'samples': 21000576, 'steps': 109377, 'loss/train': 1.2189593315124512} 11/07/2021 12:37:37 - INFO - __main__ - Step 109379: {'lr': 8.732345876264344e-05, 'samples': 21000768, 'steps': 109378, 'loss/train': 1.604904055595398} 11/07/2021 12:37:38 - INFO - __main__ - Step 109380: {'lr': 8.731942923107211e-05, 'samples': 21000960, 'steps': 109379, 'loss/train': 0.9626331329345703} 11/07/2021 12:37:39 - INFO - __main__ - Step 109381: {'lr': 8.731539977280217e-05, 'samples': 21001152, 'steps': 109380, 'loss/train': 1.1262909173965454} 11/07/2021 12:37:39 - INFO - __main__ - Step 109382: {'lr': 8.731137038783537e-05, 'samples': 21001344, 'steps': 109381, 'loss/train': 1.594778060913086} 11/07/2021 12:37:39 - INFO - __main__ - Step 109383: {'lr': 8.730734107617358e-05, 'samples': 21001536, 'steps': 109382, 'loss/train': 1.2193217277526855} 11/07/2021 12:37:40 - INFO - __main__ - Step 109384: {'lr': 8.730331183781867e-05, 'samples': 21001728, 'steps': 109383, 'loss/train': 1.4077467918395996} 11/07/2021 12:37:40 - INFO - __main__ - Step 109385: {'lr': 8.72992826727723e-05, 'samples': 21001920, 'steps': 109384, 'loss/train': 1.2433514595031738} 11/07/2021 12:37:40 - INFO - __main__ - Step 109386: {'lr': 8.729525358103632e-05, 'samples': 21002112, 'steps': 109385, 'loss/train': 1.4349806308746338} 11/07/2021 12:37:42 - INFO - __main__ - Step 109387: {'lr': 8.729122456261262e-05, 'samples': 21002304, 'steps': 109386, 'loss/train': 1.5014914274215698} 11/07/2021 12:37:42 - INFO - __main__ - Step 109388: {'lr': 8.728719561750298e-05, 'samples': 21002496, 'steps': 109387, 'loss/train': 1.446302056312561} 11/07/2021 12:37:42 - INFO - __main__ - Step 109389: {'lr': 8.728316674570924e-05, 'samples': 21002688, 'steps': 109388, 'loss/train': 1.0825453996658325} 11/07/2021 12:37:43 - INFO - __main__ - Step 109390: {'lr': 8.727913794723316e-05, 'samples': 21002880, 'steps': 109389, 'loss/train': 1.1317572593688965} 11/07/2021 12:37:43 - INFO - __main__ - Step 109391: {'lr': 8.72751092220766e-05, 'samples': 21003072, 'steps': 109390, 'loss/train': 1.169187307357788} 11/07/2021 12:37:44 - INFO - __main__ - Step 109392: {'lr': 8.727108057024138e-05, 'samples': 21003264, 'steps': 109391, 'loss/train': 0.7597986459732056} 11/07/2021 12:37:44 - INFO - __main__ - Step 109393: {'lr': 8.72670519917293e-05, 'samples': 21003456, 'steps': 109392, 'loss/train': 0.8280845880508423} 11/07/2021 12:37:45 - INFO - __main__ - Step 109394: {'lr': 8.726302348654216e-05, 'samples': 21003648, 'steps': 109393, 'loss/train': 1.213230013847351} 11/07/2021 12:37:45 - INFO - __main__ - Step 109395: {'lr': 8.725899505468188e-05, 'samples': 21003840, 'steps': 109394, 'loss/train': 1.8559595346450806} 11/07/2021 12:37:45 - INFO - __main__ - Step 109396: {'lr': 8.72549666961501e-05, 'samples': 21004032, 'steps': 109395, 'loss/train': 1.5475939512252808} 11/07/2021 12:37:46 - INFO - __main__ - Step 109397: {'lr': 8.725093841094873e-05, 'samples': 21004224, 'steps': 109396, 'loss/train': 0.9312816262245178} 11/07/2021 12:37:47 - INFO - __main__ - Step 109398: {'lr': 8.724691019907954e-05, 'samples': 21004416, 'steps': 109397, 'loss/train': 1.341935157775879} 11/07/2021 12:37:47 - INFO - __main__ - Step 109399: {'lr': 8.724288206054443e-05, 'samples': 21004608, 'steps': 109398, 'loss/train': 1.4160226583480835} 11/07/2021 12:37:48 - INFO - __main__ - Step 109400: {'lr': 8.723885399534512e-05, 'samples': 21004800, 'steps': 109399, 'loss/train': 1.2560279369354248} 11/07/2021 12:37:48 - INFO - __main__ - Step 109401: {'lr': 8.72348260034835e-05, 'samples': 21004992, 'steps': 109400, 'loss/train': 0.6388311982154846} 11/07/2021 12:37:49 - INFO - __main__ - Step 109402: {'lr': 8.723079808496135e-05, 'samples': 21005184, 'steps': 109401, 'loss/train': 1.2577670812606812} 11/07/2021 12:37:49 - INFO - __main__ - Step 109403: {'lr': 8.722677023978048e-05, 'samples': 21005376, 'steps': 109402, 'loss/train': 0.4002048671245575} 11/07/2021 12:37:50 - INFO - __main__ - Step 109404: {'lr': 8.722274246794273e-05, 'samples': 21005568, 'steps': 109403, 'loss/train': 1.464889407157898} 11/07/2021 12:37:50 - INFO - __main__ - Step 109405: {'lr': 8.72187147694499e-05, 'samples': 21005760, 'steps': 109404, 'loss/train': 1.3557003736495972} 11/07/2021 12:37:50 - INFO - __main__ - Step 109406: {'lr': 8.72146871443039e-05, 'samples': 21005952, 'steps': 109405, 'loss/train': 1.3011969327926636} 11/07/2021 12:37:51 - INFO - __main__ - Step 109407: {'lr': 8.721065959250635e-05, 'samples': 21006144, 'steps': 109406, 'loss/train': 1.237425446510315} 11/07/2021 12:37:52 - INFO - __main__ - Step 109408: {'lr': 8.720663211405915e-05, 'samples': 21006336, 'steps': 109407, 'loss/train': 1.5795445442199707} 11/07/2021 12:37:52 - INFO - __main__ - Step 109409: {'lr': 8.720260470896416e-05, 'samples': 21006528, 'steps': 109408, 'loss/train': 1.0668994188308716} 11/07/2021 12:37:52 - INFO - __main__ - Step 109410: {'lr': 8.719857737722314e-05, 'samples': 21006720, 'steps': 109409, 'loss/train': 0.8126243948936462} 11/07/2021 12:37:53 - INFO - __main__ - Step 109411: {'lr': 8.719455011883795e-05, 'samples': 21006912, 'steps': 109410, 'loss/train': 1.00425386428833} 11/07/2021 12:37:53 - INFO - __main__ - Step 109412: {'lr': 8.719052293381036e-05, 'samples': 21007104, 'steps': 109411, 'loss/train': 1.9856243133544922} 11/07/2021 12:37:54 - INFO - __main__ - Step 109413: {'lr': 8.718649582214222e-05, 'samples': 21007296, 'steps': 109412, 'loss/train': 1.4896087646484375} 11/07/2021 12:37:55 - INFO - __main__ - Step 109414: {'lr': 8.718246878383535e-05, 'samples': 21007488, 'steps': 109413, 'loss/train': 1.2555949687957764} 11/07/2021 12:37:55 - INFO - __main__ - Step 109415: {'lr': 8.717844181889153e-05, 'samples': 21007680, 'steps': 109414, 'loss/train': 1.3563488721847534} 11/07/2021 12:37:55 - INFO - __main__ - Step 109416: {'lr': 8.717441492731259e-05, 'samples': 21007872, 'steps': 109415, 'loss/train': 0.9712457656860352} 11/07/2021 12:37:56 - INFO - __main__ - Step 109417: {'lr': 8.717038810910035e-05, 'samples': 21008064, 'steps': 109416, 'loss/train': 0.9784360527992249} 11/07/2021 12:37:57 - INFO - __main__ - Step 109418: {'lr': 8.716636136425671e-05, 'samples': 21008256, 'steps': 109417, 'loss/train': 1.2123337984085083} 11/07/2021 12:37:57 - INFO - __main__ - Step 109419: {'lr': 8.716233469278328e-05, 'samples': 21008448, 'steps': 109418, 'loss/train': 1.6838785409927368} 11/07/2021 12:37:58 - INFO - __main__ - Step 109420: {'lr': 8.715830809468203e-05, 'samples': 21008640, 'steps': 109419, 'loss/train': 1.576236605644226} 11/07/2021 12:37:58 - INFO - __main__ - Step 109421: {'lr': 8.715428156995473e-05, 'samples': 21008832, 'steps': 109420, 'loss/train': 1.0567519664764404} 11/07/2021 12:37:58 - INFO - __main__ - Step 109422: {'lr': 8.715025511860317e-05, 'samples': 21009024, 'steps': 109421, 'loss/train': 1.263759732246399} 11/07/2021 12:37:59 - INFO - __main__ - Step 109423: {'lr': 8.714622874062919e-05, 'samples': 21009216, 'steps': 109422, 'loss/train': 1.0159578323364258} 11/07/2021 12:38:00 - INFO - __main__ - Step 109424: {'lr': 8.714220243603462e-05, 'samples': 21009408, 'steps': 109423, 'loss/train': 0.9658799767494202} 11/07/2021 12:38:00 - INFO - __main__ - Step 109425: {'lr': 8.713817620482129e-05, 'samples': 21009600, 'steps': 109424, 'loss/train': 1.2756850719451904} 11/07/2021 12:38:00 - INFO - __main__ - Step 109426: {'lr': 8.713415004699093e-05, 'samples': 21009792, 'steps': 109425, 'loss/train': 0.8828177452087402} 11/07/2021 12:38:01 - INFO - __main__ - Step 109427: {'lr': 8.713012396254547e-05, 'samples': 21009984, 'steps': 109426, 'loss/train': 1.2622092962265015} 11/07/2021 12:38:02 - INFO - __main__ - Step 109428: {'lr': 8.712609795148662e-05, 'samples': 21010176, 'steps': 109427, 'loss/train': 1.4421626329421997} 11/07/2021 12:38:02 - INFO - __main__ - Step 109429: {'lr': 8.712207201381625e-05, 'samples': 21010368, 'steps': 109428, 'loss/train': 1.674971103668213} 11/07/2021 12:38:02 - INFO - __main__ - Step 109430: {'lr': 8.711804614953614e-05, 'samples': 21010560, 'steps': 109429, 'loss/train': 1.388034462928772} 11/07/2021 12:38:03 - INFO - __main__ - Step 109431: {'lr': 8.711402035864815e-05, 'samples': 21010752, 'steps': 109430, 'loss/train': 1.5153969526290894} 11/07/2021 12:38:03 - INFO - __main__ - Step 109432: {'lr': 8.710999464115416e-05, 'samples': 21010944, 'steps': 109431, 'loss/train': 1.4581719636917114} 11/07/2021 12:38:04 - INFO - __main__ - Step 109433: {'lr': 8.710596899705579e-05, 'samples': 21011136, 'steps': 109432, 'loss/train': 1.1307947635650635} 11/07/2021 12:38:05 - INFO - __main__ - Step 109434: {'lr': 8.710194342635495e-05, 'samples': 21011328, 'steps': 109433, 'loss/train': 1.1279562711715698} 11/07/2021 12:38:05 - INFO - __main__ - Step 109435: {'lr': 8.709791792905347e-05, 'samples': 21011520, 'steps': 109434, 'loss/train': 1.440211296081543} 11/07/2021 12:38:05 - INFO - __main__ - Step 109436: {'lr': 8.709389250515314e-05, 'samples': 21011712, 'steps': 109435, 'loss/train': 0.9357926845550537} 11/07/2021 12:38:06 - INFO - __main__ - Step 109437: {'lr': 8.708986715465583e-05, 'samples': 21011904, 'steps': 109436, 'loss/train': 1.3597466945648193} 11/07/2021 12:38:07 - INFO - __main__ - Step 109438: {'lr': 8.708584187756326e-05, 'samples': 21012096, 'steps': 109437, 'loss/train': 1.3745222091674805} 11/07/2021 12:38:07 - INFO - __main__ - Step 109439: {'lr': 8.708181667387736e-05, 'samples': 21012288, 'steps': 109438, 'loss/train': 0.8557381629943848} 11/07/2021 12:38:07 - INFO - __main__ - Step 109440: {'lr': 8.707779154359982e-05, 'samples': 21012480, 'steps': 109439, 'loss/train': 0.9314526319503784} 11/07/2021 12:38:08 - INFO - __main__ - Step 109441: {'lr': 8.707376648673255e-05, 'samples': 21012672, 'steps': 109440, 'loss/train': 1.4401708841323853} 11/07/2021 12:38:08 - INFO - __main__ - Step 109442: {'lr': 8.706974150327732e-05, 'samples': 21012864, 'steps': 109441, 'loss/train': 1.0353684425354004} 11/07/2021 12:38:09 - INFO - __main__ - Step 109443: {'lr': 8.706571659323592e-05, 'samples': 21013056, 'steps': 109442, 'loss/train': 1.2268218994140625} 11/07/2021 12:38:09 - INFO - __main__ - Step 109444: {'lr': 8.706169175661022e-05, 'samples': 21013248, 'steps': 109443, 'loss/train': 1.5545986890792847} 11/07/2021 12:38:10 - INFO - __main__ - Step 109445: {'lr': 8.70576669934021e-05, 'samples': 21013440, 'steps': 109444, 'loss/train': 1.197177767753601} 11/07/2021 12:38:10 - INFO - __main__ - Step 109446: {'lr': 8.705364230361318e-05, 'samples': 21013632, 'steps': 109445, 'loss/train': 1.1665219068527222} 11/07/2021 12:38:10 - INFO - __main__ - Step 109447: {'lr': 8.704961768724537e-05, 'samples': 21013824, 'steps': 109446, 'loss/train': 1.3260515928268433} 11/07/2021 12:38:11 - INFO - __main__ - Step 109448: {'lr': 8.70455931443005e-05, 'samples': 21014016, 'steps': 109447, 'loss/train': 1.3737566471099854} 11/07/2021 12:38:12 - INFO - __main__ - Step 109449: {'lr': 8.704156867478036e-05, 'samples': 21014208, 'steps': 109448, 'loss/train': 1.288751482963562} 11/07/2021 12:38:12 - INFO - __main__ - Step 109450: {'lr': 8.703754427868679e-05, 'samples': 21014400, 'steps': 109449, 'loss/train': 1.4935063123703003} 11/07/2021 12:38:12 - INFO - __main__ - Step 109451: {'lr': 8.703351995602158e-05, 'samples': 21014592, 'steps': 109450, 'loss/train': 0.7415152192115784} 11/07/2021 12:38:13 - INFO - __main__ - Step 109452: {'lr': 8.702949570678656e-05, 'samples': 21014784, 'steps': 109451, 'loss/train': 1.2456947565078735} 11/07/2021 12:38:14 - INFO - __main__ - Step 109453: {'lr': 8.70254715309835e-05, 'samples': 21014976, 'steps': 109452, 'loss/train': 1.3228585720062256} 11/07/2021 12:38:14 - INFO - __main__ - Step 109454: {'lr': 8.702144742861429e-05, 'samples': 21015168, 'steps': 109453, 'loss/train': 1.4092463254928589} 11/07/2021 12:38:15 - INFO - __main__ - Step 109455: {'lr': 8.70174233996807e-05, 'samples': 21015360, 'steps': 109454, 'loss/train': 1.5054681301116943} 11/07/2021 12:38:15 - INFO - __main__ - Step 109456: {'lr': 8.701339944418452e-05, 'samples': 21015552, 'steps': 109455, 'loss/train': 1.2326840162277222} 11/07/2021 12:38:15 - INFO - __main__ - Step 109457: {'lr': 8.70093755621276e-05, 'samples': 21015744, 'steps': 109456, 'loss/train': 1.1854254007339478} 11/07/2021 12:38:16 - INFO - __main__ - Step 109458: {'lr': 8.70053517535117e-05, 'samples': 21015936, 'steps': 109457, 'loss/train': 1.1591154336929321} 11/07/2021 12:38:17 - INFO - __main__ - Step 109459: {'lr': 8.700132801833883e-05, 'samples': 21016128, 'steps': 109458, 'loss/train': 1.780238151550293} 11/07/2021 12:38:17 - INFO - __main__ - Step 109460: {'lr': 8.699730435661052e-05, 'samples': 21016320, 'steps': 109459, 'loss/train': 0.34958088397979736} 11/07/2021 12:38:17 - INFO - __main__ - Step 109461: {'lr': 8.699328076832871e-05, 'samples': 21016512, 'steps': 109460, 'loss/train': 1.5292853116989136} 11/07/2021 12:38:18 - INFO - __main__ - Step 109462: {'lr': 8.698925725349522e-05, 'samples': 21016704, 'steps': 109461, 'loss/train': 0.9809553027153015} 11/07/2021 12:38:18 - INFO - __main__ - Step 109463: {'lr': 8.698523381211185e-05, 'samples': 21016896, 'steps': 109462, 'loss/train': 1.0867112874984741} 11/07/2021 12:38:19 - INFO - __main__ - Step 109464: {'lr': 8.698121044418042e-05, 'samples': 21017088, 'steps': 109463, 'loss/train': 1.1893726587295532} 11/07/2021 12:38:19 - INFO - __main__ - Step 109465: {'lr': 8.697718714970274e-05, 'samples': 21017280, 'steps': 109464, 'loss/train': 1.6146787405014038} 11/07/2021 12:38:20 - INFO - __main__ - Step 109466: {'lr': 8.697316392868063e-05, 'samples': 21017472, 'steps': 109465, 'loss/train': 1.3089274168014526} 11/07/2021 12:38:20 - INFO - __main__ - Step 109467: {'lr': 8.696914078111586e-05, 'samples': 21017664, 'steps': 109466, 'loss/train': 1.446690320968628} 11/07/2021 12:38:21 - INFO - __main__ - Step 109468: {'lr': 8.696511770701032e-05, 'samples': 21017856, 'steps': 109467, 'loss/train': 0.935953676700592} 11/07/2021 12:38:22 - INFO - __main__ - Step 109469: {'lr': 8.696109470636579e-05, 'samples': 21018048, 'steps': 109468, 'loss/train': 1.1791553497314453} 11/07/2021 12:38:22 - INFO - __main__ - Step 109470: {'lr': 8.695707177918405e-05, 'samples': 21018240, 'steps': 109469, 'loss/train': 1.4599695205688477} 11/07/2021 12:38:22 - INFO - __main__ - Step 109471: {'lr': 8.695304892546696e-05, 'samples': 21018432, 'steps': 109470, 'loss/train': 0.9472946524620056} 11/07/2021 12:38:23 - INFO - __main__ - Step 109472: {'lr': 8.694902614521639e-05, 'samples': 21018624, 'steps': 109471, 'loss/train': 0.8974823355674744} 11/07/2021 12:38:23 - INFO - __main__ - Step 109473: {'lr': 8.694500343843395e-05, 'samples': 21018816, 'steps': 109472, 'loss/train': 1.0659451484680176} 11/07/2021 12:38:24 - INFO - __main__ - Step 109474: {'lr': 8.694098080512161e-05, 'samples': 21019008, 'steps': 109473, 'loss/train': 1.4551575183868408} 11/07/2021 12:38:25 - INFO - __main__ - Step 109475: {'lr': 8.693695824528113e-05, 'samples': 21019200, 'steps': 109474, 'loss/train': 0.9056461453437805} 11/07/2021 12:38:25 - INFO - __main__ - Step 109476: {'lr': 8.693293575891437e-05, 'samples': 21019392, 'steps': 109475, 'loss/train': 0.9089435935020447} 11/07/2021 12:38:25 - INFO - __main__ - Step 109477: {'lr': 8.69289133460231e-05, 'samples': 21019584, 'steps': 109476, 'loss/train': 0.834360659122467} 11/07/2021 12:38:26 - INFO - __main__ - Step 109478: {'lr': 8.692489100660913e-05, 'samples': 21019776, 'steps': 109477, 'loss/train': 1.5667290687561035} 11/07/2021 12:38:27 - INFO - __main__ - Step 109479: {'lr': 8.692086874067432e-05, 'samples': 21019968, 'steps': 109478, 'loss/train': 1.0930355787277222} 11/07/2021 12:38:27 - INFO - __main__ - Step 109480: {'lr': 8.69168465482204e-05, 'samples': 21020160, 'steps': 109479, 'loss/train': 1.5243175029754639} 11/07/2021 12:38:27 - INFO - __main__ - Step 109481: {'lr': 8.691282442924927e-05, 'samples': 21020352, 'steps': 109480, 'loss/train': 1.1496057510375977} 11/07/2021 12:38:28 - INFO - __main__ - Step 109482: {'lr': 8.690880238376269e-05, 'samples': 21020544, 'steps': 109481, 'loss/train': 1.136419653892517} 11/07/2021 12:38:28 - INFO - __main__ - Step 109483: {'lr': 8.69047804117625e-05, 'samples': 21020736, 'steps': 109482, 'loss/train': 1.501999855041504} 11/07/2021 12:38:28 - INFO - __main__ - Step 109484: {'lr': 8.69007585132505e-05, 'samples': 21020928, 'steps': 109483, 'loss/train': 1.5174683332443237} 11/07/2021 12:38:29 - INFO - __main__ - Step 109485: {'lr': 8.689673668822848e-05, 'samples': 21021120, 'steps': 109484, 'loss/train': 1.535753846168518} 11/07/2021 12:38:30 - INFO - __main__ - Step 109486: {'lr': 8.689271493669836e-05, 'samples': 21021312, 'steps': 109485, 'loss/train': 1.3664700984954834} 11/07/2021 12:38:30 - INFO - __main__ - Step 109487: {'lr': 8.68886932586618e-05, 'samples': 21021504, 'steps': 109486, 'loss/train': 1.3481159210205078} 11/07/2021 12:38:30 - INFO - __main__ - Step 109488: {'lr': 8.688467165412067e-05, 'samples': 21021696, 'steps': 109487, 'loss/train': 0.9343228936195374} 11/07/2021 12:38:31 - INFO - __main__ - Step 109489: {'lr': 8.68806501230768e-05, 'samples': 21021888, 'steps': 109488, 'loss/train': 1.994948387145996} 11/07/2021 12:38:32 - INFO - __main__ - Step 109490: {'lr': 8.687662866553197e-05, 'samples': 21022080, 'steps': 109489, 'loss/train': 1.566896915435791} 11/07/2021 12:38:32 - INFO - __main__ - Step 109491: {'lr': 8.687260728148805e-05, 'samples': 21022272, 'steps': 109490, 'loss/train': 0.7631884217262268} 11/07/2021 12:38:32 - INFO - __main__ - Step 109492: {'lr': 8.68685859709468e-05, 'samples': 21022464, 'steps': 109491, 'loss/train': 0.9795402884483337} 11/07/2021 12:38:33 - INFO - __main__ - Step 109493: {'lr': 8.686456473391003e-05, 'samples': 21022656, 'steps': 109492, 'loss/train': 1.2105776071548462} 11/07/2021 12:38:33 - INFO - __main__ - Step 109494: {'lr': 8.686054357037959e-05, 'samples': 21022848, 'steps': 109493, 'loss/train': 1.4699618816375732} 11/07/2021 12:38:34 - INFO - __main__ - Step 109495: {'lr': 8.685652248035725e-05, 'samples': 21023040, 'steps': 109494, 'loss/train': 1.4371767044067383} 11/07/2021 12:38:34 - INFO - __main__ - Step 109496: {'lr': 8.685250146384486e-05, 'samples': 21023232, 'steps': 109495, 'loss/train': 1.5259662866592407} 11/07/2021 12:38:35 - INFO - __main__ - Step 109497: {'lr': 8.68484805208442e-05, 'samples': 21023424, 'steps': 109496, 'loss/train': 1.6694622039794922} 11/07/2021 12:38:35 - INFO - __main__ - Step 109498: {'lr': 8.684445965135712e-05, 'samples': 21023616, 'steps': 109497, 'loss/train': 1.438754677772522} 11/07/2021 12:38:36 - INFO - __main__ - Step 109499: {'lr': 8.684043885538548e-05, 'samples': 21023808, 'steps': 109498, 'loss/train': 1.0772814750671387} 11/07/2021 12:38:37 - INFO - __main__ - Step 109500: {'lr': 8.683641813293094e-05, 'samples': 21024000, 'steps': 109499, 'loss/train': 0.7416524291038513} 11/07/2021 12:38:37 - INFO - __main__ - Step 109501: {'lr': 8.683239748399538e-05, 'samples': 21024192, 'steps': 109500, 'loss/train': 1.3961999416351318} 11/07/2021 12:38:37 - INFO - __main__ - Step 109502: {'lr': 8.682837690858064e-05, 'samples': 21024384, 'steps': 109501, 'loss/train': 1.1799495220184326} 11/07/2021 12:38:38 - INFO - __main__ - Step 109503: {'lr': 8.682435640668851e-05, 'samples': 21024576, 'steps': 109502, 'loss/train': 0.9172258973121643} 11/07/2021 12:38:38 - INFO - __main__ - Step 109504: {'lr': 8.682033597832078e-05, 'samples': 21024768, 'steps': 109503, 'loss/train': 0.806364119052887} 11/07/2021 12:38:39 - INFO - __main__ - Step 109505: {'lr': 8.681631562347933e-05, 'samples': 21024960, 'steps': 109504, 'loss/train': 1.5718929767608643} 11/07/2021 12:38:39 - INFO - __main__ - Step 109506: {'lr': 8.681229534216592e-05, 'samples': 21025152, 'steps': 109505, 'loss/train': 1.0273730754852295} 11/07/2021 12:38:40 - INFO - __main__ - Step 109507: {'lr': 8.680827513438236e-05, 'samples': 21025344, 'steps': 109506, 'loss/train': 1.3026758432388306} 11/07/2021 12:38:40 - INFO - __main__ - Step 109508: {'lr': 8.680425500013047e-05, 'samples': 21025536, 'steps': 109507, 'loss/train': 1.2996834516525269} 11/07/2021 12:38:40 - INFO - __main__ - Step 109509: {'lr': 8.680023493941208e-05, 'samples': 21025728, 'steps': 109508, 'loss/train': 1.4531313180923462} 11/07/2021 12:38:42 - INFO - __main__ - Step 109510: {'lr': 8.679621495222898e-05, 'samples': 21025920, 'steps': 109509, 'loss/train': 0.37421122193336487} 11/07/2021 12:38:42 - INFO - __main__ - Step 109511: {'lr': 8.679219503858299e-05, 'samples': 21026112, 'steps': 109510, 'loss/train': 1.496970534324646} 11/07/2021 12:38:42 - INFO - __main__ - Step 109512: {'lr': 8.678817519847592e-05, 'samples': 21026304, 'steps': 109511, 'loss/train': 1.3726422786712646} 11/07/2021 12:38:43 - INFO - __main__ - Step 109513: {'lr': 8.678415543190965e-05, 'samples': 21026496, 'steps': 109512, 'loss/train': 1.4127227067947388} 11/07/2021 12:38:43 - INFO - __main__ - Step 109514: {'lr': 8.678013573888585e-05, 'samples': 21026688, 'steps': 109513, 'loss/train': 1.2257020473480225} 11/07/2021 12:38:44 - INFO - __main__ - Step 109515: {'lr': 8.677611611940639e-05, 'samples': 21026880, 'steps': 109514, 'loss/train': 0.505042314529419} 11/07/2021 12:38:44 - INFO - __main__ - Step 109516: {'lr': 8.677209657347312e-05, 'samples': 21027072, 'steps': 109515, 'loss/train': 1.2882862091064453} 11/07/2021 12:38:45 - INFO - __main__ - Step 109517: {'lr': 8.676807710108781e-05, 'samples': 21027264, 'steps': 109516, 'loss/train': 1.5297279357910156} 11/07/2021 12:38:45 - INFO - __main__ - Step 109518: {'lr': 8.676405770225226e-05, 'samples': 21027456, 'steps': 109517, 'loss/train': 1.3191920518875122} 11/07/2021 12:38:45 - INFO - __main__ - Step 109519: {'lr': 8.676003837696833e-05, 'samples': 21027648, 'steps': 109518, 'loss/train': 1.4119523763656616} 11/07/2021 12:38:46 - INFO - __main__ - Step 109520: {'lr': 8.67560191252378e-05, 'samples': 21027840, 'steps': 109519, 'loss/train': 2.690629482269287} 11/07/2021 12:38:47 - INFO - __main__ - Step 109521: {'lr': 8.675199994706251e-05, 'samples': 21028032, 'steps': 109520, 'loss/train': 1.5424535274505615} 11/07/2021 12:38:47 - INFO - __main__ - Step 109522: {'lr': 8.674798084244423e-05, 'samples': 21028224, 'steps': 109521, 'loss/train': 1.2447773218154907} 11/07/2021 12:38:47 - INFO - __main__ - Step 109523: {'lr': 8.67439618113848e-05, 'samples': 21028416, 'steps': 109522, 'loss/train': 1.3851417303085327} 11/07/2021 12:38:48 - INFO - __main__ - Step 109524: {'lr': 8.6739942853886e-05, 'samples': 21028608, 'steps': 109523, 'loss/train': 1.420189619064331} 11/07/2021 12:38:49 - INFO - __main__ - Step 109525: {'lr': 8.67359239699497e-05, 'samples': 21028800, 'steps': 109524, 'loss/train': 1.5016146898269653} 11/07/2021 12:38:49 - INFO - __main__ - Step 109526: {'lr': 8.673190515957774e-05, 'samples': 21028992, 'steps': 109525, 'loss/train': 1.4130280017852783} 11/07/2021 12:38:50 - INFO - __main__ - Step 109527: {'lr': 8.672788642277177e-05, 'samples': 21029184, 'steps': 109526, 'loss/train': 1.0138171911239624} 11/07/2021 12:38:50 - INFO - __main__ - Step 109528: {'lr': 8.672386775953369e-05, 'samples': 21029376, 'steps': 109527, 'loss/train': 1.0642585754394531} 11/07/2021 12:38:50 - INFO - __main__ - Step 109529: {'lr': 8.671984916986533e-05, 'samples': 21029568, 'steps': 109528, 'loss/train': 1.7626925706863403} 11/07/2021 12:38:51 - INFO - __main__ - Step 109530: {'lr': 8.671583065376848e-05, 'samples': 21029760, 'steps': 109529, 'loss/train': 1.6172269582748413} 11/07/2021 12:38:52 - INFO - __main__ - Step 109531: {'lr': 8.671181221124497e-05, 'samples': 21029952, 'steps': 109530, 'loss/train': 0.6689000725746155} 11/07/2021 12:38:52 - INFO - __main__ - Step 109532: {'lr': 8.67077938422966e-05, 'samples': 21030144, 'steps': 109531, 'loss/train': 1.4081746339797974} 11/07/2021 12:38:52 - INFO - __main__ - Step 109533: {'lr': 8.670377554692516e-05, 'samples': 21030336, 'steps': 109532, 'loss/train': 1.510782241821289} 11/07/2021 12:38:53 - INFO - __main__ - Step 109534: {'lr': 8.66997573251325e-05, 'samples': 21030528, 'steps': 109533, 'loss/train': 0.9921889901161194} 11/07/2021 12:38:54 - INFO - __main__ - Step 109535: {'lr': 8.669573917692039e-05, 'samples': 21030720, 'steps': 109534, 'loss/train': 0.791553795337677} 11/07/2021 12:38:54 - INFO - __main__ - Step 109536: {'lr': 8.669172110229065e-05, 'samples': 21030912, 'steps': 109535, 'loss/train': 1.3206175565719604} 11/07/2021 12:38:54 - INFO - __main__ - Step 109537: {'lr': 8.668770310124513e-05, 'samples': 21031104, 'steps': 109536, 'loss/train': 0.9963027834892273} 11/07/2021 12:38:55 - INFO - __main__ - Step 109538: {'lr': 8.668368517378558e-05, 'samples': 21031296, 'steps': 109537, 'loss/train': 1.2704379558563232} 11/07/2021 12:38:55 - INFO - __main__ - Step 109539: {'lr': 8.667966731991394e-05, 'samples': 21031488, 'steps': 109538, 'loss/train': 1.5674598217010498} 11/07/2021 12:38:56 - INFO - __main__ - Step 109540: {'lr': 8.667564953963183e-05, 'samples': 21031680, 'steps': 109539, 'loss/train': 1.24200439453125} 11/07/2021 12:38:56 - INFO - __main__ - Step 109541: {'lr': 8.667163183294119e-05, 'samples': 21031872, 'steps': 109540, 'loss/train': 1.2364170551300049} 11/07/2021 12:38:57 - INFO - __main__ - Step 109542: {'lr': 8.666761419984376e-05, 'samples': 21032064, 'steps': 109541, 'loss/train': 1.849450707435608} 11/07/2021 12:38:57 - INFO - __main__ - Step 109543: {'lr': 8.666359664034137e-05, 'samples': 21032256, 'steps': 109542, 'loss/train': 1.014613151550293} 11/07/2021 12:38:58 - INFO - __main__ - Step 109544: {'lr': 8.665957915443587e-05, 'samples': 21032448, 'steps': 109543, 'loss/train': 1.3349952697753906} 11/07/2021 12:38:58 - INFO - __main__ - Step 109545: {'lr': 8.665556174212905e-05, 'samples': 21032640, 'steps': 109544, 'loss/train': 1.7255232334136963} 11/07/2021 12:38:59 - INFO - __main__ - Step 109546: {'lr': 8.665154440342269e-05, 'samples': 21032832, 'steps': 109545, 'loss/train': 0.35729849338531494} 11/07/2021 12:38:59 - INFO - __main__ - Step 109547: {'lr': 8.66475271383186e-05, 'samples': 21033024, 'steps': 109546, 'loss/train': 1.6423699855804443} 11/07/2021 12:39:00 - INFO - __main__ - Step 109548: {'lr': 8.664350994681866e-05, 'samples': 21033216, 'steps': 109547, 'loss/train': 0.8189883232116699} 11/07/2021 12:39:00 - INFO - __main__ - Step 109549: {'lr': 8.663949282892461e-05, 'samples': 21033408, 'steps': 109548, 'loss/train': 1.3251681327819824} 11/07/2021 12:39:00 - INFO - __main__ - Step 109550: {'lr': 8.663547578463829e-05, 'samples': 21033600, 'steps': 109549, 'loss/train': 1.228116750717163} 11/07/2021 12:39:01 - INFO - __main__ - Step 109551: {'lr': 8.66314588139615e-05, 'samples': 21033792, 'steps': 109550, 'loss/train': 1.3280316591262817} 11/07/2021 12:39:02 - INFO - __main__ - Step 109552: {'lr': 8.662744191689606e-05, 'samples': 21033984, 'steps': 109551, 'loss/train': 0.975111186504364} 11/07/2021 12:39:02 - INFO - __main__ - Step 109553: {'lr': 8.662342509344387e-05, 'samples': 21034176, 'steps': 109552, 'loss/train': 1.038526177406311} 11/07/2021 12:39:02 - INFO - __main__ - Step 109554: {'lr': 8.661940834360655e-05, 'samples': 21034368, 'steps': 109553, 'loss/train': 1.2423104047775269} 11/07/2021 12:39:03 - INFO - __main__ - Step 109555: {'lr': 8.6615391667386e-05, 'samples': 21034560, 'steps': 109554, 'loss/train': 1.3647711277008057} 11/07/2021 12:39:04 - INFO - __main__ - Step 109556: {'lr': 8.661137506478403e-05, 'samples': 21034752, 'steps': 109555, 'loss/train': 1.7168614864349365} 11/07/2021 12:39:04 - INFO - __main__ - Step 109557: {'lr': 8.660735853580245e-05, 'samples': 21034944, 'steps': 109556, 'loss/train': 1.2889509201049805} 11/07/2021 12:39:05 - INFO - __main__ - Step 109558: {'lr': 8.660334208044307e-05, 'samples': 21035136, 'steps': 109557, 'loss/train': 0.6953356266021729} 11/07/2021 12:39:05 - INFO - __main__ - Step 109559: {'lr': 8.659932569870771e-05, 'samples': 21035328, 'steps': 109558, 'loss/train': 1.5105459690093994} 11/07/2021 12:39:05 - INFO - __main__ - Step 109560: {'lr': 8.659530939059818e-05, 'samples': 21035520, 'steps': 109559, 'loss/train': 1.2317816019058228} 11/07/2021 12:39:06 - INFO - __main__ - Step 109561: {'lr': 8.659129315611627e-05, 'samples': 21035712, 'steps': 109560, 'loss/train': 1.6819026470184326} 11/07/2021 12:39:07 - INFO - __main__ - Step 109562: {'lr': 8.65872769952638e-05, 'samples': 21035904, 'steps': 109561, 'loss/train': 1.4165223836898804} 11/07/2021 12:39:07 - INFO - __main__ - Step 109563: {'lr': 8.658326090804262e-05, 'samples': 21036096, 'steps': 109562, 'loss/train': 1.245913028717041} 11/07/2021 12:39:07 - INFO - __main__ - Step 109564: {'lr': 8.657924489445446e-05, 'samples': 21036288, 'steps': 109563, 'loss/train': 0.8825846910476685} 11/07/2021 12:39:08 - INFO - __main__ - Step 109565: {'lr': 8.657522895450118e-05, 'samples': 21036480, 'steps': 109564, 'loss/train': 1.4850566387176514} 11/07/2021 12:39:09 - INFO - __main__ - Step 109566: {'lr': 8.657121308818467e-05, 'samples': 21036672, 'steps': 109565, 'loss/train': 1.0421466827392578} 11/07/2021 12:39:09 - INFO - __main__ - Step 109567: {'lr': 8.656719729550655e-05, 'samples': 21036864, 'steps': 109566, 'loss/train': 0.05787850543856621} 11/07/2021 12:39:09 - INFO - __main__ - Step 109568: {'lr': 8.656318157646875e-05, 'samples': 21037056, 'steps': 109567, 'loss/train': 1.0019452571868896} 11/07/2021 12:39:10 - INFO - __main__ - Step 109569: {'lr': 8.655916593107305e-05, 'samples': 21037248, 'steps': 109568, 'loss/train': 1.5683108568191528} 11/07/2021 12:39:10 - INFO - __main__ - Step 109570: {'lr': 8.655515035932127e-05, 'samples': 21037440, 'steps': 109569, 'loss/train': 1.4468467235565186} 11/07/2021 12:39:11 - INFO - __main__ - Step 109571: {'lr': 8.655113486121519e-05, 'samples': 21037632, 'steps': 109570, 'loss/train': 1.379692554473877} 11/07/2021 12:39:12 - INFO - __main__ - Step 109572: {'lr': 8.654711943675666e-05, 'samples': 21037824, 'steps': 109571, 'loss/train': 1.30409836769104} 11/07/2021 12:39:12 - INFO - __main__ - Step 109573: {'lr': 8.65431040859475e-05, 'samples': 21038016, 'steps': 109572, 'loss/train': 0.08743428438901901} 11/07/2021 12:39:12 - INFO - __main__ - Step 109574: {'lr': 8.653908880878947e-05, 'samples': 21038208, 'steps': 109573, 'loss/train': 1.6106488704681396} 11/07/2021 12:39:13 - INFO - __main__ - Step 109575: {'lr': 8.653507360528442e-05, 'samples': 21038400, 'steps': 109574, 'loss/train': 1.2165815830230713} 11/07/2021 12:39:14 - INFO - __main__ - Step 109576: {'lr': 8.653105847543413e-05, 'samples': 21038592, 'steps': 109575, 'loss/train': 1.4932581186294556} 11/07/2021 12:39:14 - INFO - __main__ - Step 109577: {'lr': 8.652704341924045e-05, 'samples': 21038784, 'steps': 109576, 'loss/train': 1.5549110174179077} 11/07/2021 12:39:14 - INFO - __main__ - Step 109578: {'lr': 8.652302843670512e-05, 'samples': 21038976, 'steps': 109577, 'loss/train': 1.3182114362716675} 11/07/2021 12:39:15 - INFO - __main__ - Step 109579: {'lr': 8.651901352783001e-05, 'samples': 21039168, 'steps': 109578, 'loss/train': 1.1491705179214478} 11/07/2021 12:39:15 - INFO - __main__ - Step 109580: {'lr': 8.6514998692617e-05, 'samples': 21039360, 'steps': 109579, 'loss/train': 1.191624402999878} 11/07/2021 12:39:16 - INFO - __main__ - Step 109581: {'lr': 8.651098393106774e-05, 'samples': 21039552, 'steps': 109580, 'loss/train': 1.094993233680725} 11/07/2021 12:39:17 - INFO - __main__ - Step 109582: {'lr': 8.650696924318407e-05, 'samples': 21039744, 'steps': 109581, 'loss/train': 1.241729497909546} 11/07/2021 12:39:17 - INFO - __main__ - Step 109583: {'lr': 8.650295462896787e-05, 'samples': 21039936, 'steps': 109582, 'loss/train': 1.2008239030838013} 11/07/2021 12:39:17 - INFO - __main__ - Step 109584: {'lr': 8.649894008842088e-05, 'samples': 21040128, 'steps': 109583, 'loss/train': 1.504479169845581} 11/07/2021 12:39:18 - INFO - __main__ - Step 109585: {'lr': 8.649492562154499e-05, 'samples': 21040320, 'steps': 109584, 'loss/train': 1.3264412879943848} 11/07/2021 12:39:19 - INFO - __main__ - Step 109586: {'lr': 8.649091122834193e-05, 'samples': 21040512, 'steps': 109585, 'loss/train': 1.2631933689117432} 11/07/2021 12:39:19 - INFO - __main__ - Step 109587: {'lr': 8.648689690881356e-05, 'samples': 21040704, 'steps': 109586, 'loss/train': 1.4454103708267212} 11/07/2021 12:39:19 - INFO - __main__ - Step 109588: {'lr': 8.648288266296164e-05, 'samples': 21040896, 'steps': 109587, 'loss/train': 1.6364792585372925} 11/07/2021 12:39:20 - INFO - __main__ - Step 109589: {'lr': 8.647886849078804e-05, 'samples': 21041088, 'steps': 109588, 'loss/train': 1.5456748008728027} 11/07/2021 12:39:20 - INFO - __main__ - Step 109590: {'lr': 8.647485439229452e-05, 'samples': 21041280, 'steps': 109589, 'loss/train': 1.4427076578140259} 11/07/2021 12:39:21 - INFO - __main__ - Step 109591: {'lr': 8.647084036748292e-05, 'samples': 21041472, 'steps': 109590, 'loss/train': 1.1253374814987183} 11/07/2021 12:39:21 - INFO - __main__ - Step 109592: {'lr': 8.646682641635506e-05, 'samples': 21041664, 'steps': 109591, 'loss/train': 1.9547975063323975} 11/07/2021 12:39:22 - INFO - __main__ - Step 109593: {'lr': 8.646281253891278e-05, 'samples': 21041856, 'steps': 109592, 'loss/train': 2.195685863494873} 11/07/2021 12:39:22 - INFO - __main__ - Step 109594: {'lr': 8.645879873515774e-05, 'samples': 21042048, 'steps': 109593, 'loss/train': 1.2299861907958984} 11/07/2021 12:39:22 - INFO - __main__ - Step 109595: {'lr': 8.645478500509185e-05, 'samples': 21042240, 'steps': 109594, 'loss/train': 0.9699141383171082} 11/07/2021 12:39:23 - INFO - __main__ - Step 109596: {'lr': 8.645077134871693e-05, 'samples': 21042432, 'steps': 109595, 'loss/train': 1.7861255407333374} 11/07/2021 12:39:24 - INFO - __main__ - Step 109597: {'lr': 8.644675776603475e-05, 'samples': 21042624, 'steps': 109596, 'loss/train': 1.1880378723144531} 11/07/2021 12:39:24 - INFO - __main__ - Step 109598: {'lr': 8.644274425704713e-05, 'samples': 21042816, 'steps': 109597, 'loss/train': 1.3093923330307007} 11/07/2021 12:39:25 - INFO - __main__ - Step 109599: {'lr': 8.643873082175591e-05, 'samples': 21043008, 'steps': 109598, 'loss/train': 1.2067174911499023} 11/07/2021 12:39:25 - INFO - __main__ - Step 109600: {'lr': 8.643471746016285e-05, 'samples': 21043200, 'steps': 109599, 'loss/train': 1.0492392778396606} 11/07/2021 12:39:25 - INFO - __main__ - Step 109601: {'lr': 8.643070417226978e-05, 'samples': 21043392, 'steps': 109600, 'loss/train': 0.5554840564727783} 11/07/2021 12:39:26 - INFO - __main__ - Step 109602: {'lr': 8.642669095807853e-05, 'samples': 21043584, 'steps': 109601, 'loss/train': 1.0005848407745361} 11/07/2021 12:39:27 - INFO - __main__ - Step 109603: {'lr': 8.64226778175909e-05, 'samples': 21043776, 'steps': 109602, 'loss/train': 1.3444514274597168} 11/07/2021 12:39:27 - INFO - __main__ - Step 109604: {'lr': 8.641866475080865e-05, 'samples': 21043968, 'steps': 109603, 'loss/train': 0.8685914278030396} 11/07/2021 12:39:27 - INFO - __main__ - Step 109605: {'lr': 8.641465175773364e-05, 'samples': 21044160, 'steps': 109604, 'loss/train': 1.3931766748428345} 11/07/2021 12:39:28 - INFO - __main__ - Step 109606: {'lr': 8.641063883836767e-05, 'samples': 21044352, 'steps': 109605, 'loss/train': 1.2424511909484863} 11/07/2021 12:39:29 - INFO - __main__ - Step 109607: {'lr': 8.640662599271262e-05, 'samples': 21044544, 'steps': 109606, 'loss/train': 1.4402018785476685} 11/07/2021 12:39:29 - INFO - __main__ - Step 109608: {'lr': 8.640261322077015e-05, 'samples': 21044736, 'steps': 109607, 'loss/train': 1.1321368217468262} 11/07/2021 12:39:30 - INFO - __main__ - Step 109609: {'lr': 8.639860052254212e-05, 'samples': 21044928, 'steps': 109608, 'loss/train': 1.2471896409988403} 11/07/2021 12:39:30 - INFO - __main__ - Step 109610: {'lr': 8.639458789803037e-05, 'samples': 21045120, 'steps': 109609, 'loss/train': 1.435999870300293} 11/07/2021 12:39:30 - INFO - __main__ - Step 109611: {'lr': 8.639057534723668e-05, 'samples': 21045312, 'steps': 109610, 'loss/train': 1.0229389667510986} 11/07/2021 12:39:31 - INFO - __main__ - Step 109612: {'lr': 8.638656287016288e-05, 'samples': 21045504, 'steps': 109611, 'loss/train': 1.2778798341751099} 11/07/2021 12:39:32 - INFO - __main__ - Step 109613: {'lr': 8.638255046681077e-05, 'samples': 21045696, 'steps': 109612, 'loss/train': 1.3900896310806274} 11/07/2021 12:39:32 - INFO - __main__ - Step 109614: {'lr': 8.637853813718216e-05, 'samples': 21045888, 'steps': 109613, 'loss/train': 0.7831546068191528} 11/07/2021 12:39:32 - INFO - __main__ - Step 109615: {'lr': 8.637452588127887e-05, 'samples': 21046080, 'steps': 109614, 'loss/train': 1.4069499969482422} 11/07/2021 12:39:33 - INFO - __main__ - Step 109616: {'lr': 8.637051369910265e-05, 'samples': 21046272, 'steps': 109615, 'loss/train': 1.422537922859192} 11/07/2021 12:39:34 - INFO - __main__ - Step 109617: {'lr': 8.63665015906554e-05, 'samples': 21046464, 'steps': 109616, 'loss/train': 1.236759901046753} 11/07/2021 12:39:34 - INFO - __main__ - Step 109618: {'lr': 8.636248955593883e-05, 'samples': 21046656, 'steps': 109617, 'loss/train': 1.3936355113983154} 11/07/2021 12:39:34 - INFO - __main__ - Step 109619: {'lr': 8.635847759495485e-05, 'samples': 21046848, 'steps': 109618, 'loss/train': 1.3187679052352905} 11/07/2021 12:39:35 - INFO - __main__ - Step 109620: {'lr': 8.635446570770528e-05, 'samples': 21047040, 'steps': 109619, 'loss/train': 1.084200143814087} 11/07/2021 12:39:35 - INFO - __main__ - Step 109621: {'lr': 8.635045389419178e-05, 'samples': 21047232, 'steps': 109620, 'loss/train': 1.097173810005188} 11/07/2021 12:39:36 - INFO - __main__ - Step 109622: {'lr': 8.63464421544162e-05, 'samples': 21047424, 'steps': 109621, 'loss/train': 1.0716532468795776} 11/07/2021 12:39:37 - INFO - __main__ - Step 109623: {'lr': 8.634243048838045e-05, 'samples': 21047616, 'steps': 109622, 'loss/train': 1.4578293561935425} 11/07/2021 12:39:37 - INFO - __main__ - Step 109624: {'lr': 8.633841889608623e-05, 'samples': 21047808, 'steps': 109623, 'loss/train': 0.8986608982086182} 11/07/2021 12:39:37 - INFO - __main__ - Step 109625: {'lr': 8.63344073775354e-05, 'samples': 21048000, 'steps': 109624, 'loss/train': 1.6427667140960693} 11/07/2021 12:39:38 - INFO - __main__ - Step 109626: {'lr': 8.63303959327298e-05, 'samples': 21048192, 'steps': 109625, 'loss/train': 1.6031625270843506} 11/07/2021 12:39:38 - INFO - __main__ - Step 109627: {'lr': 8.632638456167114e-05, 'samples': 21048384, 'steps': 109626, 'loss/train': 1.5136082172393799} 11/07/2021 12:39:39 - INFO - __main__ - Step 109628: {'lr': 8.632237326436132e-05, 'samples': 21048576, 'steps': 109627, 'loss/train': 1.2487561702728271} 11/07/2021 12:39:39 - INFO - __main__ - Step 109629: {'lr': 8.631836204080209e-05, 'samples': 21048768, 'steps': 109628, 'loss/train': 1.4844799041748047} 11/07/2021 12:39:40 - INFO - __main__ - Step 109630: {'lr': 8.631435089099532e-05, 'samples': 21048960, 'steps': 109629, 'loss/train': 0.6845731735229492} 11/07/2021 12:39:40 - INFO - __main__ - Step 109631: {'lr': 8.631033981494282e-05, 'samples': 21049152, 'steps': 109630, 'loss/train': 1.102375864982605} 11/07/2021 12:39:40 - INFO - __main__ - Step 109632: {'lr': 8.63063288126463e-05, 'samples': 21049344, 'steps': 109631, 'loss/train': 1.3250319957733154} 11/07/2021 12:39:41 - INFO - __main__ - Step 109633: {'lr': 8.630231788410762e-05, 'samples': 21049536, 'steps': 109632, 'loss/train': 1.3415547609329224} 11/07/2021 12:39:42 - INFO - __main__ - Step 109634: {'lr': 8.629830702932856e-05, 'samples': 21049728, 'steps': 109633, 'loss/train': 1.65328049659729} 11/07/2021 12:39:42 - INFO - __main__ - Step 109635: {'lr': 8.629429624831098e-05, 'samples': 21049920, 'steps': 109634, 'loss/train': 1.2411943674087524} 11/07/2021 12:39:43 - INFO - __main__ - Step 109636: {'lr': 8.629028554105666e-05, 'samples': 21050112, 'steps': 109635, 'loss/train': 1.3777978420257568} 11/07/2021 12:39:43 - INFO - __main__ - Step 109637: {'lr': 8.628627490756743e-05, 'samples': 21050304, 'steps': 109636, 'loss/train': 1.3278683423995972} 11/07/2021 12:39:43 - INFO - __main__ - Step 109638: {'lr': 8.628226434784506e-05, 'samples': 21050496, 'steps': 109637, 'loss/train': 1.0151748657226562} 11/07/2021 12:39:44 - INFO - __main__ - Step 109639: {'lr': 8.627825386189136e-05, 'samples': 21050688, 'steps': 109638, 'loss/train': 1.2436001300811768} 11/07/2021 12:39:45 - INFO - __main__ - Step 109640: {'lr': 8.62742434497082e-05, 'samples': 21050880, 'steps': 109639, 'loss/train': 1.400095820426941} 11/07/2021 12:39:45 - INFO - __main__ - Step 109641: {'lr': 8.627023311129729e-05, 'samples': 21051072, 'steps': 109640, 'loss/train': 1.4314934015274048} 11/07/2021 12:39:45 - INFO - __main__ - Step 109642: {'lr': 8.626622284666058e-05, 'samples': 21051264, 'steps': 109641, 'loss/train': 0.9365708231925964} 11/07/2021 12:39:46 - INFO - __main__ - Step 109643: {'lr': 8.626221265579973e-05, 'samples': 21051456, 'steps': 109642, 'loss/train': 1.2865192890167236} 11/07/2021 12:39:47 - INFO - __main__ - Step 109644: {'lr': 8.625820253871655e-05, 'samples': 21051648, 'steps': 109643, 'loss/train': 1.161800503730774} 11/07/2021 12:39:48 - INFO - __main__ - Step 109645: {'lr': 8.625419249541294e-05, 'samples': 21051840, 'steps': 109644, 'loss/train': 1.6685791015625} 11/07/2021 12:39:48 - INFO - __main__ - Step 109646: {'lr': 8.625018252589065e-05, 'samples': 21052032, 'steps': 109645, 'loss/train': 1.423176884651184} 11/07/2021 12:39:48 - INFO - __main__ - Step 109647: {'lr': 8.62461726301515e-05, 'samples': 21052224, 'steps': 109646, 'loss/train': 2.445878505706787} 11/07/2021 12:39:49 - INFO - __main__ - Step 109648: {'lr': 8.62421628081973e-05, 'samples': 21052416, 'steps': 109647, 'loss/train': 1.476043701171875} 11/07/2021 12:39:50 - INFO - __main__ - Step 109649: {'lr': 8.623815306002986e-05, 'samples': 21052608, 'steps': 109648, 'loss/train': 1.1735496520996094} 11/07/2021 12:39:50 - INFO - __main__ - Step 109650: {'lr': 8.623414338565097e-05, 'samples': 21052800, 'steps': 109649, 'loss/train': 1.2576918601989746} 11/07/2021 12:39:50 - INFO - __main__ - Step 109651: {'lr': 8.623013378506245e-05, 'samples': 21052992, 'steps': 109650, 'loss/train': 1.4181746244430542} 11/07/2021 12:39:51 - INFO - __main__ - Step 109652: {'lr': 8.622612425826612e-05, 'samples': 21053184, 'steps': 109651, 'loss/train': 1.613542914390564} 11/07/2021 12:39:51 - INFO - __main__ - Step 109653: {'lr': 8.622211480526382e-05, 'samples': 21053376, 'steps': 109652, 'loss/train': 1.113917589187622} 11/07/2021 12:39:52 - INFO - __main__ - Step 109654: {'lr': 8.621810542605727e-05, 'samples': 21053568, 'steps': 109653, 'loss/train': 1.126990556716919} 11/07/2021 12:39:52 - INFO - __main__ - Step 109655: {'lr': 8.621409612064829e-05, 'samples': 21053760, 'steps': 109654, 'loss/train': 1.0562613010406494} 11/07/2021 12:39:53 - INFO - __main__ - Step 109656: {'lr': 8.621008688903869e-05, 'samples': 21053952, 'steps': 109655, 'loss/train': 1.5755577087402344} 11/07/2021 12:39:53 - INFO - __main__ - Step 109657: {'lr': 8.620607773123031e-05, 'samples': 21054144, 'steps': 109656, 'loss/train': 1.3870666027069092} 11/07/2021 12:39:54 - INFO - __main__ - Step 109658: {'lr': 8.620206864722496e-05, 'samples': 21054336, 'steps': 109657, 'loss/train': 0.9834876656532288} 11/07/2021 12:39:54 - INFO - __main__ - Step 109659: {'lr': 8.619805963702443e-05, 'samples': 21054528, 'steps': 109658, 'loss/train': 2.018601417541504} 11/07/2021 12:39:55 - INFO - __main__ - Step 109660: {'lr': 8.619405070063052e-05, 'samples': 21054720, 'steps': 109659, 'loss/train': 1.3390679359436035} 11/07/2021 12:39:55 - INFO - __main__ - Step 109661: {'lr': 8.619004183804505e-05, 'samples': 21054912, 'steps': 109660, 'loss/train': 1.4461740255355835} 11/07/2021 12:39:56 - INFO - __main__ - Step 109662: {'lr': 8.618603304926981e-05, 'samples': 21055104, 'steps': 109661, 'loss/train': 1.2067677974700928} 11/07/2021 12:39:56 - INFO - __main__ - Step 109663: {'lr': 8.61820243343066e-05, 'samples': 21055296, 'steps': 109662, 'loss/train': 0.9609567523002625} 11/07/2021 12:39:56 - INFO - __main__ - Step 109664: {'lr': 8.617801569315736e-05, 'samples': 21055488, 'steps': 109663, 'loss/train': 1.0229648351669312} 11/07/2021 12:39:57 - INFO - __main__ - Step 109665: {'lr': 8.617400712582369e-05, 'samples': 21055680, 'steps': 109664, 'loss/train': 1.2370128631591797} 11/07/2021 12:39:58 - INFO - __main__ - Step 109666: {'lr': 8.616999863230746e-05, 'samples': 21055872, 'steps': 109665, 'loss/train': 1.4735091924667358} 11/07/2021 12:39:58 - INFO - __main__ - Step 109667: {'lr': 8.616599021261052e-05, 'samples': 21056064, 'steps': 109666, 'loss/train': 1.2132291793823242} 11/07/2021 12:39:58 - INFO - __main__ - Step 109668: {'lr': 8.616198186673462e-05, 'samples': 21056256, 'steps': 109667, 'loss/train': 1.4267163276672363} 11/07/2021 12:39:59 - INFO - __main__ - Step 109669: {'lr': 8.615797359468166e-05, 'samples': 21056448, 'steps': 109668, 'loss/train': 1.0316007137298584} 11/07/2021 12:40:00 - INFO - __main__ - Step 109670: {'lr': 8.615396539645334e-05, 'samples': 21056640, 'steps': 109669, 'loss/train': 1.209389328956604} 11/07/2021 12:40:00 - INFO - __main__ - Step 109671: {'lr': 8.614995727205155e-05, 'samples': 21056832, 'steps': 109670, 'loss/train': 1.3964790105819702} 11/07/2021 12:40:01 - INFO - __main__ - Step 109672: {'lr': 8.614594922147805e-05, 'samples': 21057024, 'steps': 109671, 'loss/train': 1.3384597301483154} 11/07/2021 12:40:01 - INFO - __main__ - Step 109673: {'lr': 8.614194124473465e-05, 'samples': 21057216, 'steps': 109672, 'loss/train': 1.0422289371490479} 11/07/2021 12:40:01 - INFO - __main__ - Step 109674: {'lr': 8.613793334182316e-05, 'samples': 21057408, 'steps': 109673, 'loss/train': 1.2188456058502197} 11/07/2021 12:40:02 - INFO - __main__ - Step 109675: {'lr': 8.613392551274549e-05, 'samples': 21057600, 'steps': 109674, 'loss/train': 1.4283243417739868} 11/07/2021 12:40:03 - INFO - __main__ - Step 109676: {'lr': 8.612991775750326e-05, 'samples': 21057792, 'steps': 109675, 'loss/train': 1.8350350856781006} 11/07/2021 12:40:03 - INFO - __main__ - Step 109677: {'lr': 8.612591007609832e-05, 'samples': 21057984, 'steps': 109676, 'loss/train': 1.1169956922531128} 11/07/2021 12:40:04 - INFO - __main__ - Step 109678: {'lr': 8.612190246853257e-05, 'samples': 21058176, 'steps': 109677, 'loss/train': 1.338417887687683} 11/07/2021 12:40:04 - INFO - __main__ - Step 109679: {'lr': 8.611789493480773e-05, 'samples': 21058368, 'steps': 109678, 'loss/train': 0.9270911812782288} 11/07/2021 12:40:04 - INFO - __main__ - Step 109680: {'lr': 8.611388747492565e-05, 'samples': 21058560, 'steps': 109679, 'loss/train': 1.717898964881897} 11/07/2021 12:40:05 - INFO - __main__ - Step 109681: {'lr': 8.610988008888812e-05, 'samples': 21058752, 'steps': 109680, 'loss/train': 2.0552871227264404} 11/07/2021 12:40:06 - INFO - __main__ - Step 109682: {'lr': 8.610587277669696e-05, 'samples': 21058944, 'steps': 109681, 'loss/train': 1.713053584098816} 11/07/2021 12:40:06 - INFO - __main__ - Step 109683: {'lr': 8.610186553835394e-05, 'samples': 21059136, 'steps': 109682, 'loss/train': 1.3602261543273926} 11/07/2021 12:40:06 - INFO - __main__ - Step 109684: {'lr': 8.609785837386092e-05, 'samples': 21059328, 'steps': 109683, 'loss/train': 1.186760663986206} 11/07/2021 12:40:07 - INFO - __main__ - Step 109685: {'lr': 8.609385128321965e-05, 'samples': 21059520, 'steps': 109684, 'loss/train': 1.1155489683151245} 11/07/2021 12:40:08 - INFO - __main__ - Step 109686: {'lr': 8.608984426643196e-05, 'samples': 21059712, 'steps': 109685, 'loss/train': 1.1115977764129639} 11/07/2021 12:40:08 - INFO - __main__ - Step 109687: {'lr': 8.608583732349976e-05, 'samples': 21059904, 'steps': 109686, 'loss/train': 1.1407934427261353} 11/07/2021 12:40:09 - INFO - __main__ - Step 109688: {'lr': 8.608183045442466e-05, 'samples': 21060096, 'steps': 109687, 'loss/train': 1.1973202228546143} 11/07/2021 12:40:09 - INFO - __main__ - Step 109689: {'lr': 8.607782365920854e-05, 'samples': 21060288, 'steps': 109688, 'loss/train': 0.7578951716423035} 11/07/2021 12:40:09 - INFO - __main__ - Step 109690: {'lr': 8.607381693785326e-05, 'samples': 21060480, 'steps': 109689, 'loss/train': 1.4691694974899292} 11/07/2021 12:40:10 - INFO - __main__ - Step 109691: {'lr': 8.606981029036057e-05, 'samples': 21060672, 'steps': 109690, 'loss/train': 0.881562352180481} 11/07/2021 12:40:11 - INFO - __main__ - Step 109692: {'lr': 8.606580371673228e-05, 'samples': 21060864, 'steps': 109691, 'loss/train': 1.692188024520874} 11/07/2021 12:40:11 - INFO - __main__ - Step 109693: {'lr': 8.606179721697022e-05, 'samples': 21061056, 'steps': 109692, 'loss/train': 0.9462144374847412} 11/07/2021 12:40:11 - INFO - __main__ - Step 109694: {'lr': 8.60577907910762e-05, 'samples': 21061248, 'steps': 109693, 'loss/train': 1.9374030828475952} 11/07/2021 12:40:12 - INFO - __main__ - Step 109695: {'lr': 8.605378443905199e-05, 'samples': 21061440, 'steps': 109694, 'loss/train': 1.2537202835083008} 11/07/2021 12:40:13 - INFO - __main__ - Step 109696: {'lr': 8.604977816089942e-05, 'samples': 21061632, 'steps': 109695, 'loss/train': 0.28288009762763977} 11/07/2021 12:40:13 - INFO - __main__ - Step 109697: {'lr': 8.604577195662031e-05, 'samples': 21061824, 'steps': 109696, 'loss/train': 1.3210670948028564} 11/07/2021 12:40:13 - INFO - __main__ - Step 109698: {'lr': 8.604176582621642e-05, 'samples': 21062016, 'steps': 109697, 'loss/train': 0.6972091197967529} 11/07/2021 12:40:14 - INFO - __main__ - Step 109699: {'lr': 8.603775976968959e-05, 'samples': 21062208, 'steps': 109698, 'loss/train': 1.2407387495040894} 11/07/2021 12:40:14 - INFO - __main__ - Step 109700: {'lr': 8.60337537870416e-05, 'samples': 21062400, 'steps': 109699, 'loss/train': 0.9771680235862732} 11/07/2021 12:40:15 - INFO - __main__ - Step 109701: {'lr': 8.602974787827436e-05, 'samples': 21062592, 'steps': 109700, 'loss/train': 2.0365052223205566} 11/07/2021 12:40:16 - INFO - __main__ - Step 109702: {'lr': 8.602574204338953e-05, 'samples': 21062784, 'steps': 109701, 'loss/train': 1.0570974349975586} 11/07/2021 12:40:16 - INFO - __main__ - Step 109703: {'lr': 8.602173628238893e-05, 'samples': 21062976, 'steps': 109702, 'loss/train': 0.9338483810424805} 11/07/2021 12:40:16 - INFO - __main__ - Step 109704: {'lr': 8.601773059527442e-05, 'samples': 21063168, 'steps': 109703, 'loss/train': 1.799865961074829} 11/07/2021 12:40:17 - INFO - __main__ - Step 109705: {'lr': 8.601372498204779e-05, 'samples': 21063360, 'steps': 109704, 'loss/train': 1.144895315170288} 11/07/2021 12:40:17 - INFO - __main__ - Step 109706: {'lr': 8.600971944271086e-05, 'samples': 21063552, 'steps': 109705, 'loss/train': 1.5557206869125366} 11/07/2021 12:40:18 - INFO - __main__ - Step 109707: {'lr': 8.600571397726543e-05, 'samples': 21063744, 'steps': 109706, 'loss/train': 0.6608743071556091} 11/07/2021 12:40:18 - INFO - __main__ - Step 109708: {'lr': 8.600170858571326e-05, 'samples': 21063936, 'steps': 109707, 'loss/train': 1.5831602811813354} 11/07/2021 12:40:19 - INFO - __main__ - Step 109709: {'lr': 8.59977032680562e-05, 'samples': 21064128, 'steps': 109708, 'loss/train': 1.1905407905578613} 11/07/2021 12:40:19 - INFO - __main__ - Step 109710: {'lr': 8.599369802429604e-05, 'samples': 21064320, 'steps': 109709, 'loss/train': 1.1174310445785522} 11/07/2021 12:40:19 - INFO - __main__ - Step 109711: {'lr': 8.598969285443461e-05, 'samples': 21064512, 'steps': 109710, 'loss/train': 1.4736831188201904} 11/07/2021 12:40:21 - INFO - __main__ - Step 109712: {'lr': 8.598568775847368e-05, 'samples': 21064704, 'steps': 109711, 'loss/train': 1.4839786291122437} 11/07/2021 12:40:21 - INFO - __main__ - Step 109713: {'lr': 8.598168273641507e-05, 'samples': 21064896, 'steps': 109712, 'loss/train': 1.3295202255249023} 11/07/2021 12:40:21 - INFO - __main__ - Step 109714: {'lr': 8.597767778826065e-05, 'samples': 21065088, 'steps': 109713, 'loss/train': 0.7080580592155457} 11/07/2021 12:40:22 - INFO - __main__ - Step 109715: {'lr': 8.59736729140121e-05, 'samples': 21065280, 'steps': 109714, 'loss/train': 0.20581898093223572} 11/07/2021 12:40:22 - INFO - __main__ - Step 109716: {'lr': 8.596966811367127e-05, 'samples': 21065472, 'steps': 109715, 'loss/train': 1.2813599109649658} 11/07/2021 12:40:23 - INFO - __main__ - Step 109717: {'lr': 8.596566338723996e-05, 'samples': 21065664, 'steps': 109716, 'loss/train': 1.3696314096450806} 11/07/2021 12:40:23 - INFO - __main__ - Step 109718: {'lr': 8.596165873472004e-05, 'samples': 21065856, 'steps': 109717, 'loss/train': 0.7773477435112} 11/07/2021 12:40:24 - INFO - __main__ - Step 109719: {'lr': 8.59576541561132e-05, 'samples': 21066048, 'steps': 109718, 'loss/train': 1.2018145322799683} 11/07/2021 12:40:24 - INFO - __main__ - Step 109720: {'lr': 8.595364965142136e-05, 'samples': 21066240, 'steps': 109719, 'loss/train': 1.3415937423706055} 11/07/2021 12:40:24 - INFO - __main__ - Step 109721: {'lr': 8.594964522064624e-05, 'samples': 21066432, 'steps': 109720, 'loss/train': 1.5936576128005981} 11/07/2021 12:40:25 - INFO - __main__ - Step 109722: {'lr': 8.59456408637897e-05, 'samples': 21066624, 'steps': 109721, 'loss/train': 1.5184584856033325} 11/07/2021 12:40:26 - INFO - __main__ - Step 109723: {'lr': 8.594163658085352e-05, 'samples': 21066816, 'steps': 109722, 'loss/train': 1.124348521232605} 11/07/2021 12:40:26 - INFO - __main__ - Step 109724: {'lr': 8.593763237183952e-05, 'samples': 21067008, 'steps': 109723, 'loss/train': 1.0686177015304565} 11/07/2021 12:40:26 - INFO - __main__ - Step 109725: {'lr': 8.593362823674947e-05, 'samples': 21067200, 'steps': 109724, 'loss/train': 1.0405559539794922} 11/07/2021 12:40:27 - INFO - __main__ - Step 109726: {'lr': 8.59296241755852e-05, 'samples': 21067392, 'steps': 109725, 'loss/train': 0.07274783402681351} 11/07/2021 12:40:28 - INFO - __main__ - Step 109727: {'lr': 8.592562018834851e-05, 'samples': 21067584, 'steps': 109726, 'loss/train': 1.3916609287261963} 11/07/2021 12:40:28 - INFO - __main__ - Step 109728: {'lr': 8.592161627504127e-05, 'samples': 21067776, 'steps': 109727, 'loss/train': 1.7376413345336914} 11/07/2021 12:40:29 - INFO - __main__ - Step 109729: {'lr': 8.591761243566518e-05, 'samples': 21067968, 'steps': 109728, 'loss/train': 1.1080464124679565} 11/07/2021 12:40:29 - INFO - __main__ - Step 109730: {'lr': 8.591360867022206e-05, 'samples': 21068160, 'steps': 109729, 'loss/train': 1.4181666374206543} 11/07/2021 12:40:29 - INFO - __main__ - Step 109731: {'lr': 8.590960497871373e-05, 'samples': 21068352, 'steps': 109730, 'loss/train': 0.9487775564193726} 11/07/2021 12:40:30 - INFO - __main__ - Step 109732: {'lr': 8.590560136114198e-05, 'samples': 21068544, 'steps': 109731, 'loss/train': 1.1262195110321045} 11/07/2021 12:40:31 - INFO - __main__ - Step 109733: {'lr': 8.590159781750867e-05, 'samples': 21068736, 'steps': 109732, 'loss/train': 1.397900104522705} 11/07/2021 12:40:31 - INFO - __main__ - Step 109734: {'lr': 8.589759434781555e-05, 'samples': 21068928, 'steps': 109733, 'loss/train': 1.4252519607543945} 11/07/2021 12:40:31 - INFO - __main__ - Step 109735: {'lr': 8.589359095206445e-05, 'samples': 21069120, 'steps': 109734, 'loss/train': 1.3139609098434448} 11/07/2021 12:40:32 - INFO - __main__ - Step 109736: {'lr': 8.588958763025715e-05, 'samples': 21069312, 'steps': 109735, 'loss/train': 1.2397074699401855} 11/07/2021 12:40:32 - INFO - __main__ - Step 109737: {'lr': 8.588558438239547e-05, 'samples': 21069504, 'steps': 109736, 'loss/train': 1.403495192527771} 11/07/2021 12:40:33 - INFO - __main__ - Step 109738: {'lr': 8.588158120848122e-05, 'samples': 21069696, 'steps': 109737, 'loss/train': 1.385003685951233} 11/07/2021 12:40:33 - INFO - __main__ - Step 109739: {'lr': 8.587757810851621e-05, 'samples': 21069888, 'steps': 109738, 'loss/train': 1.5131313800811768} 11/07/2021 12:40:34 - INFO - __main__ - Step 109740: {'lr': 8.58735750825022e-05, 'samples': 21070080, 'steps': 109739, 'loss/train': 0.8652055263519287} 11/07/2021 12:40:34 - INFO - __main__ - Step 109741: {'lr': 8.586957213044114e-05, 'samples': 21070272, 'steps': 109740, 'loss/train': 0.70872962474823} 11/07/2021 12:40:35 - INFO - __main__ - Step 109742: {'lr': 8.586556925233463e-05, 'samples': 21070464, 'steps': 109741, 'loss/train': 1.5569120645523071} 11/07/2021 12:40:37 - INFO - __main__ - Step 109743: {'lr': 8.586156644818455e-05, 'samples': 21070656, 'steps': 109742, 'loss/train': 2.0604026317596436} 11/07/2021 12:40:37 - INFO - __main__ - Step 109744: {'lr': 8.585756371799272e-05, 'samples': 21070848, 'steps': 109743, 'loss/train': 1.1936031579971313} 11/07/2021 12:40:37 - INFO - __main__ - Step 109745: {'lr': 8.585356106176093e-05, 'samples': 21071040, 'steps': 109744, 'loss/train': 0.9806881546974182} 11/07/2021 12:40:38 - INFO - __main__ - Step 109746: {'lr': 8.5849558479491e-05, 'samples': 21071232, 'steps': 109745, 'loss/train': 0.9436607956886292} 11/07/2021 12:40:38 - INFO - __main__ - Step 109747: {'lr': 8.584555597118474e-05, 'samples': 21071424, 'steps': 109746, 'loss/train': 1.6214749813079834} 11/07/2021 12:40:38 - INFO - __main__ - Step 109748: {'lr': 8.584155353684392e-05, 'samples': 21071616, 'steps': 109747, 'loss/train': 1.3572351932525635} 11/07/2021 12:40:39 - INFO - __main__ - Step 109749: {'lr': 8.583755117647038e-05, 'samples': 21071808, 'steps': 109748, 'loss/train': 1.7375586032867432} 11/07/2021 12:40:39 - INFO - __main__ - Step 109750: {'lr': 8.583354889006589e-05, 'samples': 21072000, 'steps': 109749, 'loss/train': 1.6872859001159668} 11/07/2021 12:40:40 - INFO - __main__ - Step 109751: {'lr': 8.582954667763226e-05, 'samples': 21072192, 'steps': 109750, 'loss/train': 1.7194541692733765} 11/07/2021 12:40:40 - INFO - __main__ - Step 109752: {'lr': 8.582554453917132e-05, 'samples': 21072384, 'steps': 109751, 'loss/train': 1.1367214918136597} 11/07/2021 12:40:41 - INFO - __main__ - Step 109753: {'lr': 8.582154247468485e-05, 'samples': 21072576, 'steps': 109752, 'loss/train': 1.3248826265335083} 11/07/2021 12:40:41 - INFO - __main__ - Step 109754: {'lr': 8.581754048417468e-05, 'samples': 21072768, 'steps': 109753, 'loss/train': 1.2728869915008545} 11/07/2021 12:40:41 - INFO - __main__ - Step 109755: {'lr': 8.581353856764266e-05, 'samples': 21072960, 'steps': 109754, 'loss/train': 1.0274631977081299} 11/07/2021 12:40:42 - INFO - __main__ - Step 109756: {'lr': 8.580953672509043e-05, 'samples': 21073152, 'steps': 109755, 'loss/train': 1.5212085247039795} 11/07/2021 12:40:43 - INFO - __main__ - Step 109757: {'lr': 8.580553495651991e-05, 'samples': 21073344, 'steps': 109756, 'loss/train': 1.1744725704193115} 11/07/2021 12:40:43 - INFO - __main__ - Step 109758: {'lr': 8.580153326193288e-05, 'samples': 21073536, 'steps': 109757, 'loss/train': 1.3811489343643188} 11/07/2021 12:40:44 - INFO - __main__ - Step 109759: {'lr': 8.579753164133114e-05, 'samples': 21073728, 'steps': 109758, 'loss/train': 1.5415197610855103} 11/07/2021 12:40:44 - INFO - __main__ - Step 109760: {'lr': 8.579353009471649e-05, 'samples': 21073920, 'steps': 109759, 'loss/train': 1.3651715517044067} 11/07/2021 12:40:45 - INFO - __main__ - Step 109761: {'lr': 8.578952862209075e-05, 'samples': 21074112, 'steps': 109760, 'loss/train': 1.0937339067459106} 11/07/2021 12:40:45 - INFO - __main__ - Step 109762: {'lr': 8.57855272234557e-05, 'samples': 21074304, 'steps': 109761, 'loss/train': 1.2353973388671875} 11/07/2021 12:40:46 - INFO - __main__ - Step 109763: {'lr': 8.578152589881318e-05, 'samples': 21074496, 'steps': 109762, 'loss/train': 1.1214522123336792} 11/07/2021 12:40:46 - INFO - __main__ - Step 109764: {'lr': 8.577752464816496e-05, 'samples': 21074688, 'steps': 109763, 'loss/train': 1.5371402502059937} 11/07/2021 12:40:46 - INFO - __main__ - Step 109765: {'lr': 8.577352347151285e-05, 'samples': 21074880, 'steps': 109764, 'loss/train': 1.428184151649475} 11/07/2021 12:40:48 - INFO - __main__ - Step 109766: {'lr': 8.576952236885868e-05, 'samples': 21075072, 'steps': 109765, 'loss/train': 1.3437470197677612} 11/07/2021 12:40:48 - INFO - __main__ - Step 109767: {'lr': 8.57655213402042e-05, 'samples': 21075264, 'steps': 109766, 'loss/train': 1.1108698844909668} 11/07/2021 12:40:48 - INFO - __main__ - Step 109768: {'lr': 8.576152038555132e-05, 'samples': 21075456, 'steps': 109767, 'loss/train': 1.60726797580719} 11/07/2021 12:40:49 - INFO - __main__ - Step 109769: {'lr': 8.575751950490172e-05, 'samples': 21075648, 'steps': 109768, 'loss/train': 1.253921389579773} 11/07/2021 12:40:49 - INFO - __main__ - Step 109770: {'lr': 8.57535186982572e-05, 'samples': 21075840, 'steps': 109769, 'loss/train': 1.4163028001785278} 11/07/2021 12:40:49 - INFO - __main__ - Step 109771: {'lr': 8.574951796561964e-05, 'samples': 21076032, 'steps': 109770, 'loss/train': 1.4390407800674438} 11/07/2021 12:40:50 - INFO - __main__ - Step 109772: {'lr': 8.574551730699082e-05, 'samples': 21076224, 'steps': 109771, 'loss/train': 1.549027919769287} 11/07/2021 12:40:51 - INFO - __main__ - Step 109773: {'lr': 8.574151672237251e-05, 'samples': 21076416, 'steps': 109772, 'loss/train': 1.18636953830719} 11/07/2021 12:40:51 - INFO - __main__ - Step 109774: {'lr': 8.573751621176657e-05, 'samples': 21076608, 'steps': 109773, 'loss/train': 1.051295518875122} 11/07/2021 12:40:51 - INFO - __main__ - Step 109775: {'lr': 8.573351577517474e-05, 'samples': 21076800, 'steps': 109774, 'loss/train': 1.5103155374526978} 11/07/2021 12:40:52 - INFO - __main__ - Step 109776: {'lr': 8.572951541259885e-05, 'samples': 21076992, 'steps': 109775, 'loss/train': 1.4963186979293823} 11/07/2021 12:40:53 - INFO - __main__ - Step 109777: {'lr': 8.572551512404072e-05, 'samples': 21077184, 'steps': 109776, 'loss/train': 1.304079294204712} 11/07/2021 12:40:53 - INFO - __main__ - Step 109778: {'lr': 8.572151490950214e-05, 'samples': 21077376, 'steps': 109777, 'loss/train': 0.7513630986213684} 11/07/2021 12:40:53 - INFO - __main__ - Step 109779: {'lr': 8.57175147689849e-05, 'samples': 21077568, 'steps': 109778, 'loss/train': 1.228804349899292} 11/07/2021 12:40:54 - INFO - __main__ - Step 109780: {'lr': 8.571351470249083e-05, 'samples': 21077760, 'steps': 109779, 'loss/train': 1.2246630191802979} 11/07/2021 12:40:54 - INFO - __main__ - Step 109781: {'lr': 8.570951471002178e-05, 'samples': 21077952, 'steps': 109780, 'loss/train': 0.5473723411560059} 11/07/2021 12:40:55 - INFO - __main__ - Step 109782: {'lr': 8.570551479157942e-05, 'samples': 21078144, 'steps': 109781, 'loss/train': 0.9489910006523132} 11/07/2021 12:40:56 - INFO - __main__ - Step 109783: {'lr': 8.570151494716561e-05, 'samples': 21078336, 'steps': 109782, 'loss/train': 1.7326632738113403} 11/07/2021 12:40:56 - INFO - __main__ - Step 109784: {'lr': 8.569751517678218e-05, 'samples': 21078528, 'steps': 109783, 'loss/train': 1.5036579370498657} 11/07/2021 12:40:56 - INFO - __main__ - Step 109785: {'lr': 8.569351548043089e-05, 'samples': 21078720, 'steps': 109784, 'loss/train': 1.14159095287323} 11/07/2021 12:40:57 - INFO - __main__ - Step 109786: {'lr': 8.568951585811358e-05, 'samples': 21078912, 'steps': 109785, 'loss/train': 0.6580864787101746} 11/07/2021 12:40:58 - INFO - __main__ - Step 109787: {'lr': 8.568551630983201e-05, 'samples': 21079104, 'steps': 109786, 'loss/train': 1.2831569910049438} 11/07/2021 12:40:58 - INFO - __main__ - Step 109788: {'lr': 8.568151683558806e-05, 'samples': 21079296, 'steps': 109787, 'loss/train': 0.9254705905914307} 11/07/2021 12:40:58 - INFO - __main__ - Step 109789: {'lr': 8.567751743538344e-05, 'samples': 21079488, 'steps': 109788, 'loss/train': 1.304762601852417} 11/07/2021 12:40:59 - INFO - __main__ - Step 109790: {'lr': 8.567351810922003e-05, 'samples': 21079680, 'steps': 109789, 'loss/train': 1.3918192386627197} 11/07/2021 12:40:59 - INFO - __main__ - Step 109791: {'lr': 8.566951885709956e-05, 'samples': 21079872, 'steps': 109790, 'loss/train': 1.351511001586914} 11/07/2021 12:41:00 - INFO - __main__ - Step 109792: {'lr': 8.56655196790239e-05, 'samples': 21080064, 'steps': 109791, 'loss/train': 1.5037249326705933} 11/07/2021 12:41:01 - INFO - __main__ - Step 109793: {'lr': 8.566152057499479e-05, 'samples': 21080256, 'steps': 109792, 'loss/train': 1.2113022804260254} 11/07/2021 12:41:01 - INFO - __main__ - Step 109794: {'lr': 8.565752154501411e-05, 'samples': 21080448, 'steps': 109793, 'loss/train': 1.6312719583511353} 11/07/2021 12:41:01 - INFO - __main__ - Step 109795: {'lr': 8.565352258908365e-05, 'samples': 21080640, 'steps': 109794, 'loss/train': 1.3370171785354614} 11/07/2021 12:41:02 - INFO - __main__ - Step 109796: {'lr': 8.564952370720513e-05, 'samples': 21080832, 'steps': 109795, 'loss/train': 1.4649436473846436} 11/07/2021 12:41:03 - INFO - __main__ - Step 109797: {'lr': 8.564552489938037e-05, 'samples': 21081024, 'steps': 109796, 'loss/train': 1.2439969778060913} 11/07/2021 12:41:03 - INFO - __main__ - Step 109798: {'lr': 8.564152616561122e-05, 'samples': 21081216, 'steps': 109797, 'loss/train': 1.5530428886413574} 11/07/2021 12:41:03 - INFO - __main__ - Step 109799: {'lr': 8.563752750589946e-05, 'samples': 21081408, 'steps': 109798, 'loss/train': 1.2392524480819702} 11/07/2021 12:41:04 - INFO - __main__ - Step 109800: {'lr': 8.56335289202469e-05, 'samples': 21081600, 'steps': 109799, 'loss/train': 1.4712456464767456} 11/07/2021 12:41:04 - INFO - __main__ - Step 109801: {'lr': 8.562953040865531e-05, 'samples': 21081792, 'steps': 109800, 'loss/train': 1.4132636785507202} 11/07/2021 12:41:05 - INFO - __main__ - Step 109802: {'lr': 8.562553197112651e-05, 'samples': 21081984, 'steps': 109801, 'loss/train': 1.289312720298767} 11/07/2021 12:41:05 - INFO - __main__ - Step 109803: {'lr': 8.562153360766234e-05, 'samples': 21082176, 'steps': 109802, 'loss/train': 1.3230804204940796} 11/07/2021 12:41:06 - INFO - __main__ - Step 109804: {'lr': 8.561753531826457e-05, 'samples': 21082368, 'steps': 109803, 'loss/train': 1.1987558603286743} 11/07/2021 12:41:06 - INFO - __main__ - Step 109805: {'lr': 8.561353710293499e-05, 'samples': 21082560, 'steps': 109804, 'loss/train': 0.8248642086982727} 11/07/2021 12:41:06 - INFO - __main__ - Step 109806: {'lr': 8.560953896167542e-05, 'samples': 21082752, 'steps': 109805, 'loss/train': 1.134406566619873} 11/07/2021 12:41:08 - INFO - __main__ - Step 109807: {'lr': 8.560554089448766e-05, 'samples': 21082944, 'steps': 109806, 'loss/train': 1.249934196472168} 11/07/2021 12:41:08 - INFO - __main__ - Step 109808: {'lr': 8.560154290137357e-05, 'samples': 21083136, 'steps': 109807, 'loss/train': 1.1257516145706177} 11/07/2021 12:41:09 - INFO - __main__ - Step 109809: {'lr': 8.559754498233483e-05, 'samples': 21083328, 'steps': 109808, 'loss/train': 1.7266919612884521} 11/07/2021 12:41:09 - INFO - __main__ - Step 109810: {'lr': 8.559354713737327e-05, 'samples': 21083520, 'steps': 109809, 'loss/train': 0.360226035118103} 11/07/2021 12:41:09 - INFO - __main__ - Step 109811: {'lr': 8.558954936649074e-05, 'samples': 21083712, 'steps': 109810, 'loss/train': 1.2126262187957764} 11/07/2021 12:41:10 - INFO - __main__ - Step 109812: {'lr': 8.5585551669689e-05, 'samples': 21083904, 'steps': 109811, 'loss/train': 0.945391833782196} 11/07/2021 12:41:11 - INFO - __main__ - Step 109813: {'lr': 8.55815540469699e-05, 'samples': 21084096, 'steps': 109812, 'loss/train': 0.42331987619400024} 11/07/2021 12:41:11 - INFO - __main__ - Step 109814: {'lr': 8.55775564983352e-05, 'samples': 21084288, 'steps': 109813, 'loss/train': 1.3650816679000854} 11/07/2021 12:41:11 - INFO - __main__ - Step 109815: {'lr': 8.557355902378675e-05, 'samples': 21084480, 'steps': 109814, 'loss/train': 1.479638934135437} 11/07/2021 12:41:12 - INFO - __main__ - Step 109816: {'lr': 8.556956162332628e-05, 'samples': 21084672, 'steps': 109815, 'loss/train': 1.296069622039795} 11/07/2021 12:41:12 - INFO - __main__ - Step 109817: {'lr': 8.556556429695564e-05, 'samples': 21084864, 'steps': 109816, 'loss/train': 1.1117889881134033} 11/07/2021 12:41:13 - INFO - __main__ - Step 109818: {'lr': 8.55615670446766e-05, 'samples': 21085056, 'steps': 109817, 'loss/train': 0.9257173538208008} 11/07/2021 12:41:14 - INFO - __main__ - Step 109819: {'lr': 8.555756986649099e-05, 'samples': 21085248, 'steps': 109818, 'loss/train': 1.4570564031600952} 11/07/2021 12:41:14 - INFO - __main__ - Step 109820: {'lr': 8.555357276240062e-05, 'samples': 21085440, 'steps': 109819, 'loss/train': 1.0408133268356323} 11/07/2021 12:41:14 - INFO - __main__ - Step 109821: {'lr': 8.554957573240727e-05, 'samples': 21085632, 'steps': 109820, 'loss/train': 1.2081727981567383} 11/07/2021 12:41:15 - INFO - __main__ - Step 109822: {'lr': 8.554557877651281e-05, 'samples': 21085824, 'steps': 109821, 'loss/train': 1.2054158449172974} 11/07/2021 12:41:15 - INFO - __main__ - Step 109823: {'lr': 8.554158189471889e-05, 'samples': 21086016, 'steps': 109822, 'loss/train': 1.3676501512527466} 11/07/2021 12:41:16 - INFO - __main__ - Step 109824: {'lr': 8.553758508702742e-05, 'samples': 21086208, 'steps': 109823, 'loss/train': 1.509377121925354} 11/07/2021 12:41:16 - INFO - __main__ - Step 109825: {'lr': 8.553358835344015e-05, 'samples': 21086400, 'steps': 109824, 'loss/train': 0.8208490014076233} 11/07/2021 12:41:17 - INFO - __main__ - Step 109826: {'lr': 8.552959169395894e-05, 'samples': 21086592, 'steps': 109825, 'loss/train': 0.9652143120765686} 11/07/2021 12:41:17 - INFO - __main__ - Step 109827: {'lr': 8.552559510858552e-05, 'samples': 21086784, 'steps': 109826, 'loss/train': 1.8170335292816162} 11/07/2021 12:41:17 - INFO - __main__ - Step 109828: {'lr': 8.552159859732176e-05, 'samples': 21086976, 'steps': 109827, 'loss/train': 1.3133357763290405} 11/07/2021 12:41:19 - INFO - __main__ - Step 109829: {'lr': 8.551760216016941e-05, 'samples': 21087168, 'steps': 109828, 'loss/train': 1.2675493955612183} 11/07/2021 12:41:19 - INFO - __main__ - Step 109830: {'lr': 8.551360579713027e-05, 'samples': 21087360, 'steps': 109829, 'loss/train': 1.2750533819198608} 11/07/2021 12:41:19 - INFO - __main__ - Step 109831: {'lr': 8.550960950820619e-05, 'samples': 21087552, 'steps': 109830, 'loss/train': 1.2946105003356934} 11/07/2021 12:41:20 - INFO - __main__ - Step 109832: {'lr': 8.550561329339895e-05, 'samples': 21087744, 'steps': 109831, 'loss/train': 1.5500996112823486} 11/07/2021 12:41:20 - INFO - __main__ - Step 109833: {'lr': 8.550161715271032e-05, 'samples': 21087936, 'steps': 109832, 'loss/train': 1.4916739463806152} 11/07/2021 12:41:20 - INFO - __main__ - Step 109834: {'lr': 8.549762108614215e-05, 'samples': 21088128, 'steps': 109833, 'loss/train': 1.3018308877944946} 11/07/2021 12:41:22 - INFO - __main__ - Step 109835: {'lr': 8.549362509369626e-05, 'samples': 21088320, 'steps': 109834, 'loss/train': 0.05773922801017761} 11/07/2021 12:41:22 - INFO - __main__ - Step 109836: {'lr': 8.548962917537434e-05, 'samples': 21088512, 'steps': 109835, 'loss/train': 1.2336252927780151} 11/07/2021 12:41:22 - INFO - __main__ - Step 109837: {'lr': 8.548563333117826e-05, 'samples': 21088704, 'steps': 109836, 'loss/train': 1.4828004837036133} 11/07/2021 12:41:23 - INFO - __main__ - Step 109838: {'lr': 8.548163756110983e-05, 'samples': 21088896, 'steps': 109837, 'loss/train': 1.5375205278396606} 11/07/2021 12:41:23 - INFO - __main__ - Step 109839: {'lr': 8.547764186517079e-05, 'samples': 21089088, 'steps': 109838, 'loss/train': 1.2931528091430664} 11/07/2021 12:41:24 - INFO - __main__ - Step 109840: {'lr': 8.547364624336301e-05, 'samples': 21089280, 'steps': 109839, 'loss/train': 0.8751877546310425} 11/07/2021 12:41:24 - INFO - __main__ - Step 109841: {'lr': 8.546965069568827e-05, 'samples': 21089472, 'steps': 109840, 'loss/train': 1.4517009258270264} 11/07/2021 12:41:25 - INFO - __main__ - Step 109842: {'lr': 8.546565522214838e-05, 'samples': 21089664, 'steps': 109841, 'loss/train': 1.3138976097106934} 11/07/2021 12:41:25 - INFO - __main__ - Step 109843: {'lr': 8.54616598227451e-05, 'samples': 21089856, 'steps': 109842, 'loss/train': 1.2666525840759277} 11/07/2021 12:41:25 - INFO - __main__ - Step 109844: {'lr': 8.545766449748027e-05, 'samples': 21090048, 'steps': 109843, 'loss/train': 1.8796626329421997} 11/07/2021 12:41:27 - INFO - __main__ - Step 109845: {'lr': 8.545366924635566e-05, 'samples': 21090240, 'steps': 109844, 'loss/train': 1.2019484043121338} 11/07/2021 12:41:27 - INFO - __main__ - Step 109846: {'lr': 8.544967406937313e-05, 'samples': 21090432, 'steps': 109845, 'loss/train': 1.625535011291504} 11/07/2021 12:41:27 - INFO - __main__ - Step 109847: {'lr': 8.544567896653441e-05, 'samples': 21090624, 'steps': 109846, 'loss/train': 1.1499677896499634} 11/07/2021 12:41:28 - INFO - __main__ - Step 109848: {'lr': 8.544168393784132e-05, 'samples': 21090816, 'steps': 109847, 'loss/train': 1.8360041379928589} 11/07/2021 12:41:28 - INFO - __main__ - Step 109849: {'lr': 8.543768898329577e-05, 'samples': 21091008, 'steps': 109848, 'loss/train': 0.8494935631752014} 11/07/2021 12:41:29 - INFO - __main__ - Step 109850: {'lr': 8.543369410289936e-05, 'samples': 21091200, 'steps': 109849, 'loss/train': 0.9077058434486389} 11/07/2021 12:41:29 - INFO - __main__ - Step 109851: {'lr': 8.542969929665397e-05, 'samples': 21091392, 'steps': 109850, 'loss/train': 1.432254433631897} 11/07/2021 12:41:30 - INFO - __main__ - Step 109852: {'lr': 8.542570456456144e-05, 'samples': 21091584, 'steps': 109851, 'loss/train': 1.3194810152053833} 11/07/2021 12:41:30 - INFO - __main__ - Step 109853: {'lr': 8.542170990662357e-05, 'samples': 21091776, 'steps': 109852, 'loss/train': 1.582522988319397} 11/07/2021 12:41:30 - INFO - __main__ - Step 109854: {'lr': 8.541771532284212e-05, 'samples': 21091968, 'steps': 109853, 'loss/train': 1.7777750492095947} 11/07/2021 12:41:32 - INFO - __main__ - Step 109855: {'lr': 8.541372081321888e-05, 'samples': 21092160, 'steps': 109854, 'loss/train': 1.4334421157836914} 11/07/2021 12:41:32 - INFO - __main__ - Step 109856: {'lr': 8.540972637775571e-05, 'samples': 21092352, 'steps': 109855, 'loss/train': 1.2299014329910278} 11/07/2021 12:41:32 - INFO - __main__ - Step 109857: {'lr': 8.540573201645438e-05, 'samples': 21092544, 'steps': 109856, 'loss/train': 1.3030369281768799} 11/07/2021 12:41:33 - INFO - __main__ - Step 109858: {'lr': 8.540173772931667e-05, 'samples': 21092736, 'steps': 109857, 'loss/train': 0.5839546322822571} 11/07/2021 12:41:33 - INFO - __main__ - Step 109859: {'lr': 8.539774351634441e-05, 'samples': 21092928, 'steps': 109858, 'loss/train': 1.4411892890930176} 11/07/2021 12:41:34 - INFO - __main__ - Step 109860: {'lr': 8.539374937753938e-05, 'samples': 21093120, 'steps': 109859, 'loss/train': 2.404362440109253} 11/07/2021 12:41:34 - INFO - __main__ - Step 109861: {'lr': 8.538975531290339e-05, 'samples': 21093312, 'steps': 109860, 'loss/train': 0.5230094790458679} 11/07/2021 12:41:35 - INFO - __main__ - Step 109862: {'lr': 8.53857613224383e-05, 'samples': 21093504, 'steps': 109861, 'loss/train': 1.4962208271026611} 11/07/2021 12:41:35 - INFO - __main__ - Step 109863: {'lr': 8.538176740614578e-05, 'samples': 21093696, 'steps': 109862, 'loss/train': 1.4530322551727295} 11/07/2021 12:41:35 - INFO - __main__ - Step 109864: {'lr': 8.53777735640277e-05, 'samples': 21093888, 'steps': 109863, 'loss/train': 1.185937762260437} 11/07/2021 12:41:36 - INFO - __main__ - Step 109865: {'lr': 8.537377979608586e-05, 'samples': 21094080, 'steps': 109864, 'loss/train': 1.7144160270690918} 11/07/2021 12:41:37 - INFO - __main__ - Step 109866: {'lr': 8.536978610232205e-05, 'samples': 21094272, 'steps': 109865, 'loss/train': 1.3422094583511353} 11/07/2021 12:41:37 - INFO - __main__ - Step 109867: {'lr': 8.536579248273804e-05, 'samples': 21094464, 'steps': 109866, 'loss/train': 1.285503625869751} 11/07/2021 12:41:37 - INFO - __main__ - Step 109868: {'lr': 8.53617989373357e-05, 'samples': 21094656, 'steps': 109867, 'loss/train': 1.4507122039794922} 11/07/2021 12:41:38 - INFO - __main__ - Step 109869: {'lr': 8.53578054661168e-05, 'samples': 21094848, 'steps': 109868, 'loss/train': 1.2696659564971924} 11/07/2021 12:41:39 - INFO - __main__ - Step 109870: {'lr': 8.53538120690831e-05, 'samples': 21095040, 'steps': 109869, 'loss/train': 1.045226812362671} 11/07/2021 12:41:39 - INFO - __main__ - Step 109871: {'lr': 8.534981874623646e-05, 'samples': 21095232, 'steps': 109870, 'loss/train': 1.362764835357666} 11/07/2021 12:41:39 - INFO - __main__ - Step 109872: {'lr': 8.534582549757864e-05, 'samples': 21095424, 'steps': 109871, 'loss/train': 1.3895514011383057} 11/07/2021 12:41:40 - INFO - __main__ - Step 109873: {'lr': 8.534183232311143e-05, 'samples': 21095616, 'steps': 109872, 'loss/train': 1.0102277994155884} 11/07/2021 12:41:40 - INFO - __main__ - Step 109874: {'lr': 8.533783922283669e-05, 'samples': 21095808, 'steps': 109873, 'loss/train': 1.4873721599578857} 11/07/2021 12:41:41 - INFO - __main__ - Step 109875: {'lr': 8.533384619675616e-05, 'samples': 21096000, 'steps': 109874, 'loss/train': 0.5045526027679443} 11/07/2021 12:41:42 - INFO - __main__ - Step 109876: {'lr': 8.532985324487173e-05, 'samples': 21096192, 'steps': 109875, 'loss/train': 1.1611766815185547} 11/07/2021 12:41:42 - INFO - __main__ - Step 109877: {'lr': 8.532586036718503e-05, 'samples': 21096384, 'steps': 109876, 'loss/train': 1.4966243505477905} 11/07/2021 12:41:43 - INFO - __main__ - Step 109878: {'lr': 8.5321867563698e-05, 'samples': 21096576, 'steps': 109877, 'loss/train': 0.8758857250213623} 11/07/2021 12:41:43 - INFO - __main__ - Step 109879: {'lr': 8.531787483441236e-05, 'samples': 21096768, 'steps': 109878, 'loss/train': 1.1517088413238525} 11/07/2021 12:41:43 - INFO - __main__ - Step 109880: {'lr': 8.531388217932998e-05, 'samples': 21096960, 'steps': 109879, 'loss/train': 0.5903496742248535} 11/07/2021 12:41:44 - INFO - __main__ - Step 109881: {'lr': 8.53098895984526e-05, 'samples': 21097152, 'steps': 109880, 'loss/train': 1.5686513185501099} 11/07/2021 12:41:45 - INFO - __main__ - Step 109882: {'lr': 8.530589709178204e-05, 'samples': 21097344, 'steps': 109881, 'loss/train': 0.11142381280660629} 11/07/2021 12:41:45 - INFO - __main__ - Step 109883: {'lr': 8.530190465932011e-05, 'samples': 21097536, 'steps': 109882, 'loss/train': 0.4852885901927948} 11/07/2021 12:41:45 - INFO - __main__ - Step 109884: {'lr': 8.529791230106859e-05, 'samples': 21097728, 'steps': 109883, 'loss/train': 1.4601788520812988} 11/07/2021 12:41:46 - INFO - __main__ - Step 109885: {'lr': 8.529392001702929e-05, 'samples': 21097920, 'steps': 109884, 'loss/train': 1.30715012550354} 11/07/2021 12:41:47 - INFO - __main__ - Step 109886: {'lr': 8.528992780720402e-05, 'samples': 21098112, 'steps': 109885, 'loss/train': 1.218582272529602} 11/07/2021 12:41:48 - INFO - __main__ - Step 109887: {'lr': 8.528593567159456e-05, 'samples': 21098304, 'steps': 109886, 'loss/train': 1.0218369960784912} 11/07/2021 12:41:48 - INFO - __main__ - Step 109888: {'lr': 8.528194361020272e-05, 'samples': 21098496, 'steps': 109887, 'loss/train': 1.1335440874099731} 11/07/2021 12:41:48 - INFO - __main__ - Step 109889: {'lr': 8.527795162303037e-05, 'samples': 21098688, 'steps': 109888, 'loss/train': 0.5908734202384949} 11/07/2021 12:41:49 - INFO - __main__ - Step 109890: {'lr': 8.527395971007914e-05, 'samples': 21098880, 'steps': 109889, 'loss/train': 0.9868991374969482} 11/07/2021 12:41:50 - INFO - __main__ - Step 109891: {'lr': 8.526996787135094e-05, 'samples': 21099072, 'steps': 109890, 'loss/train': 1.1868191957473755} 11/07/2021 12:41:50 - INFO - __main__ - Step 109892: {'lr': 8.526597610684755e-05, 'samples': 21099264, 'steps': 109891, 'loss/train': 1.3996704816818237} 11/07/2021 12:41:50 - INFO - __main__ - Step 109893: {'lr': 8.526198441657077e-05, 'samples': 21099456, 'steps': 109892, 'loss/train': 1.2035269737243652} 11/07/2021 12:41:51 - INFO - __main__ - Step 109894: {'lr': 8.52579928005224e-05, 'samples': 21099648, 'steps': 109893, 'loss/train': 1.3418415784835815} 11/07/2021 12:41:51 - INFO - __main__ - Step 109895: {'lr': 8.525400125870422e-05, 'samples': 21099840, 'steps': 109894, 'loss/train': 1.3452590703964233} 11/07/2021 12:41:51 - INFO - __main__ - Step 109896: {'lr': 8.525000979111806e-05, 'samples': 21100032, 'steps': 109895, 'loss/train': 1.4678651094436646} 11/07/2021 12:41:52 - INFO - __main__ - Step 109897: {'lr': 8.52460183977657e-05, 'samples': 21100224, 'steps': 109896, 'loss/train': 0.799149751663208} 11/07/2021 12:41:53 - INFO - __main__ - Step 109898: {'lr': 8.524202707864892e-05, 'samples': 21100416, 'steps': 109897, 'loss/train': 1.28591787815094} 11/07/2021 12:41:53 - INFO - __main__ - Step 109899: {'lr': 8.523803583376957e-05, 'samples': 21100608, 'steps': 109898, 'loss/train': 1.0232174396514893} 11/07/2021 12:41:54 - INFO - __main__ - Step 109900: {'lr': 8.523404466312951e-05, 'samples': 21100800, 'steps': 109899, 'loss/train': 1.386069416999817} 11/07/2021 12:41:54 - INFO - __main__ - Step 109901: {'lr': 8.523005356673032e-05, 'samples': 21100992, 'steps': 109900, 'loss/train': 1.4492307901382446} 11/07/2021 12:41:55 - INFO - __main__ - Step 109902: {'lr': 8.522606254457396e-05, 'samples': 21101184, 'steps': 109901, 'loss/train': 1.3476585149765015} 11/07/2021 12:41:55 - INFO - __main__ - Step 109903: {'lr': 8.522207159666217e-05, 'samples': 21101376, 'steps': 109902, 'loss/train': 1.7593908309936523} 11/07/2021 12:41:56 - INFO - __main__ - Step 109904: {'lr': 8.52180807229968e-05, 'samples': 21101568, 'steps': 109903, 'loss/train': 1.1418062448501587} 11/07/2021 12:41:56 - INFO - __main__ - Step 109905: {'lr': 8.52140899235796e-05, 'samples': 21101760, 'steps': 109904, 'loss/train': 1.393298864364624} 11/07/2021 12:41:56 - INFO - __main__ - Step 109906: {'lr': 8.521009919841242e-05, 'samples': 21101952, 'steps': 109905, 'loss/train': 0.06595554947853088} 11/07/2021 12:41:58 - INFO - __main__ - Step 109907: {'lr': 8.520610854749697e-05, 'samples': 21102144, 'steps': 109906, 'loss/train': 1.4954631328582764} 11/07/2021 12:41:58 - INFO - __main__ - Step 109908: {'lr': 8.520211797083516e-05, 'samples': 21102336, 'steps': 109907, 'loss/train': 1.5076755285263062} 11/07/2021 12:41:58 - INFO - __main__ - Step 109909: {'lr': 8.51981274684287e-05, 'samples': 21102528, 'steps': 109908, 'loss/train': 1.291897177696228} 11/07/2021 12:41:59 - INFO - __main__ - Step 109910: {'lr': 8.519413704027942e-05, 'samples': 21102720, 'steps': 109909, 'loss/train': 1.282742977142334} 11/07/2021 12:41:59 - INFO - __main__ - Step 109911: {'lr': 8.519014668638922e-05, 'samples': 21102912, 'steps': 109910, 'loss/train': 1.7082209587097168} 11/07/2021 12:42:00 - INFO - __main__ - Step 109912: {'lr': 8.51861564067597e-05, 'samples': 21103104, 'steps': 109911, 'loss/train': 1.1625052690505981} 11/07/2021 12:42:01 - INFO - __main__ - Step 109913: {'lr': 8.518216620139273e-05, 'samples': 21103296, 'steps': 109912, 'loss/train': 1.4524483680725098} 11/07/2021 12:42:01 - INFO - __main__ - Step 109914: {'lr': 8.517817607029019e-05, 'samples': 21103488, 'steps': 109913, 'loss/train': 1.438080906867981} 11/07/2021 12:42:01 - INFO - __main__ - Step 109915: {'lr': 8.517418601345378e-05, 'samples': 21103680, 'steps': 109914, 'loss/train': 1.2599987983703613} 11/07/2021 12:42:02 - INFO - __main__ - Step 109916: {'lr': 8.517019603088536e-05, 'samples': 21103872, 'steps': 109915, 'loss/train': 1.1746587753295898} 11/07/2021 12:42:03 - INFO - __main__ - Step 109917: {'lr': 8.516620612258668e-05, 'samples': 21104064, 'steps': 109916, 'loss/train': 1.1908900737762451} 11/07/2021 12:42:03 - INFO - __main__ - Step 109918: {'lr': 8.516221628855961e-05, 'samples': 21104256, 'steps': 109917, 'loss/train': 1.2126396894454956} 11/07/2021 12:42:03 - INFO - __main__ - Step 109919: {'lr': 8.515822652880586e-05, 'samples': 21104448, 'steps': 109918, 'loss/train': 1.2736448049545288} 11/07/2021 12:42:04 - INFO - __main__ - Step 109920: {'lr': 8.515423684332726e-05, 'samples': 21104640, 'steps': 109919, 'loss/train': 1.2176604270935059} 11/07/2021 12:42:04 - INFO - __main__ - Step 109921: {'lr': 8.515024723212566e-05, 'samples': 21104832, 'steps': 109920, 'loss/train': 2.0584614276885986} 11/07/2021 12:42:04 - INFO - __main__ - Step 109922: {'lr': 8.514625769520288e-05, 'samples': 21105024, 'steps': 109921, 'loss/train': 1.5469715595245361} 11/07/2021 12:42:06 - INFO - __main__ - Step 109923: {'lr': 8.514226823256054e-05, 'samples': 21105216, 'steps': 109922, 'loss/train': 1.0165220499038696} 11/07/2021 12:42:06 - INFO - __main__ - Step 109924: {'lr': 8.513827884420059e-05, 'samples': 21105408, 'steps': 109923, 'loss/train': 1.3360859155654907} 11/07/2021 12:42:06 - INFO - __main__ - Step 109925: {'lr': 8.513428953012478e-05, 'samples': 21105600, 'steps': 109924, 'loss/train': 1.0168551206588745} 11/07/2021 12:42:07 - INFO - __main__ - Step 109926: {'lr': 8.513030029033492e-05, 'samples': 21105792, 'steps': 109925, 'loss/train': 1.4960423707962036} 11/07/2021 12:42:07 - INFO - __main__ - Step 109927: {'lr': 8.512631112483276e-05, 'samples': 21105984, 'steps': 109926, 'loss/train': 0.5322497487068176} 11/07/2021 12:42:08 - INFO - __main__ - Step 109928: {'lr': 8.512232203362019e-05, 'samples': 21106176, 'steps': 109927, 'loss/train': 1.2730845212936401} 11/07/2021 12:42:09 - INFO - __main__ - Step 109929: {'lr': 8.511833301669894e-05, 'samples': 21106368, 'steps': 109928, 'loss/train': 1.3002873659133911} 11/07/2021 12:42:09 - INFO - __main__ - Step 109930: {'lr': 8.511434407407082e-05, 'samples': 21106560, 'steps': 109929, 'loss/train': 1.0652353763580322} 11/07/2021 12:42:09 - INFO - __main__ - Step 109931: {'lr': 8.511035520573764e-05, 'samples': 21106752, 'steps': 109930, 'loss/train': 1.2545450925827026} 11/07/2021 12:42:10 - INFO - __main__ - Step 109932: {'lr': 8.510636641170119e-05, 'samples': 21106944, 'steps': 109931, 'loss/train': 0.053558483719825745} 11/07/2021 12:42:11 - INFO - __main__ - Step 109933: {'lr': 8.510237769196333e-05, 'samples': 21107136, 'steps': 109932, 'loss/train': 1.2743200063705444} 11/07/2021 12:42:11 - INFO - __main__ - Step 109934: {'lr': 8.509838904652573e-05, 'samples': 21107328, 'steps': 109933, 'loss/train': 1.2598135471343994} 11/07/2021 12:42:11 - INFO - __main__ - Step 109935: {'lr': 8.509440047539024e-05, 'samples': 21107520, 'steps': 109934, 'loss/train': 1.496118426322937} 11/07/2021 12:42:12 - INFO - __main__ - Step 109936: {'lr': 8.509041197855869e-05, 'samples': 21107712, 'steps': 109935, 'loss/train': 0.5063716769218445} 11/07/2021 12:42:12 - INFO - __main__ - Step 109937: {'lr': 8.508642355603285e-05, 'samples': 21107904, 'steps': 109936, 'loss/train': 1.2654772996902466} 11/07/2021 12:42:13 - INFO - __main__ - Step 109938: {'lr': 8.50824352078145e-05, 'samples': 21108096, 'steps': 109937, 'loss/train': 1.1044973134994507} 11/07/2021 12:42:13 - INFO - __main__ - Step 109939: {'lr': 8.507844693390548e-05, 'samples': 21108288, 'steps': 109938, 'loss/train': 1.4128179550170898} 11/07/2021 12:42:14 - INFO - __main__ - Step 109940: {'lr': 8.507445873430756e-05, 'samples': 21108480, 'steps': 109939, 'loss/train': 1.1562596559524536} 11/07/2021 12:42:14 - INFO - __main__ - Step 109941: {'lr': 8.507047060902253e-05, 'samples': 21108672, 'steps': 109940, 'loss/train': 1.388988971710205} 11/07/2021 12:42:15 - INFO - __main__ - Step 109942: {'lr': 8.506648255805225e-05, 'samples': 21108864, 'steps': 109941, 'loss/train': 1.3916295766830444} 11/07/2021 12:42:16 - INFO - __main__ - Step 109943: {'lr': 8.506249458139843e-05, 'samples': 21109056, 'steps': 109942, 'loss/train': 1.1766966581344604} 11/07/2021 12:42:16 - INFO - __main__ - Step 109944: {'lr': 8.505850667906298e-05, 'samples': 21109248, 'steps': 109943, 'loss/train': 0.06015947833657265} 11/07/2021 12:42:16 - INFO - __main__ - Step 109945: {'lr': 8.505451885104756e-05, 'samples': 21109440, 'steps': 109944, 'loss/train': 1.5676755905151367} 11/07/2021 12:42:17 - INFO - __main__ - Step 109946: {'lr': 8.5050531097354e-05, 'samples': 21109632, 'steps': 109945, 'loss/train': 1.5035978555679321} 11/07/2021 12:42:17 - INFO - __main__ - Step 109947: {'lr': 8.504654341798415e-05, 'samples': 21109824, 'steps': 109946, 'loss/train': 1.3091926574707031} 11/07/2021 12:42:18 - INFO - __main__ - Step 109948: {'lr': 8.50425558129398e-05, 'samples': 21110016, 'steps': 109947, 'loss/train': 1.6581209897994995} 11/07/2021 12:42:19 - INFO - __main__ - Step 109949: {'lr': 8.50385682822227e-05, 'samples': 21110208, 'steps': 109948, 'loss/train': 1.616438627243042} 11/07/2021 12:42:19 - INFO - __main__ - Step 109950: {'lr': 8.503458082583468e-05, 'samples': 21110400, 'steps': 109949, 'loss/train': 1.619005560874939} 11/07/2021 12:42:19 - INFO - __main__ - Step 109951: {'lr': 8.503059344377754e-05, 'samples': 21110592, 'steps': 109950, 'loss/train': 1.3247618675231934} 11/07/2021 12:42:20 - INFO - __main__ - Step 109952: {'lr': 8.502660613605303e-05, 'samples': 21110784, 'steps': 109951, 'loss/train': 1.0169963836669922} 11/07/2021 12:42:21 - INFO - __main__ - Step 109953: {'lr': 8.502261890266302e-05, 'samples': 21110976, 'steps': 109952, 'loss/train': 1.3617208003997803} 11/07/2021 12:42:21 - INFO - __main__ - Step 109954: {'lr': 8.501863174360927e-05, 'samples': 21111168, 'steps': 109953, 'loss/train': 1.4423296451568604} 11/07/2021 12:42:21 - INFO - __main__ - Step 109955: {'lr': 8.501464465889358e-05, 'samples': 21111360, 'steps': 109954, 'loss/train': 1.3869887590408325} 11/07/2021 12:42:22 - INFO - __main__ - Step 109956: {'lr': 8.501065764851784e-05, 'samples': 21111552, 'steps': 109955, 'loss/train': 1.4052060842514038} 11/07/2021 12:42:22 - INFO - __main__ - Step 109957: {'lr': 8.500667071248366e-05, 'samples': 21111744, 'steps': 109956, 'loss/train': 1.293920636177063} 11/07/2021 12:42:23 - INFO - __main__ - Step 109958: {'lr': 8.500268385079294e-05, 'samples': 21111936, 'steps': 109957, 'loss/train': 1.4787518978118896} 11/07/2021 12:42:23 - INFO - __main__ - Step 109959: {'lr': 8.499869706344742e-05, 'samples': 21112128, 'steps': 109958, 'loss/train': 1.1078468561172485} 11/07/2021 12:42:24 - INFO - __main__ - Step 109960: {'lr': 8.499471035044897e-05, 'samples': 21112320, 'steps': 109959, 'loss/train': 1.2702616453170776} 11/07/2021 12:42:24 - INFO - __main__ - Step 109961: {'lr': 8.499072371179936e-05, 'samples': 21112512, 'steps': 109960, 'loss/train': 1.132218837738037} 11/07/2021 12:42:24 - INFO - __main__ - Step 109962: {'lr': 8.498673714750038e-05, 'samples': 21112704, 'steps': 109961, 'loss/train': 1.3670017719268799} 11/07/2021 12:42:26 - INFO - __main__ - Step 109963: {'lr': 8.498275065755384e-05, 'samples': 21112896, 'steps': 109962, 'loss/train': 0.6962500810623169} 11/07/2021 12:42:26 - INFO - __main__ - Step 109964: {'lr': 8.497876424196152e-05, 'samples': 21113088, 'steps': 109963, 'loss/train': 0.9703274965286255} 11/07/2021 12:42:26 - INFO - __main__ - Step 109965: {'lr': 8.497477790072522e-05, 'samples': 21113280, 'steps': 109964, 'loss/train': 5.669217586517334} 11/07/2021 12:42:27 - INFO - __main__ - Step 109966: {'lr': 8.497079163384674e-05, 'samples': 21113472, 'steps': 109965, 'loss/train': 1.1774767637252808} 11/07/2021 12:42:27 - INFO - __main__ - Step 109967: {'lr': 8.496680544132788e-05, 'samples': 21113664, 'steps': 109966, 'loss/train': 1.161854863166809} 11/07/2021 12:42:27 - INFO - __main__ - Step 109968: {'lr': 8.49628193231704e-05, 'samples': 21113856, 'steps': 109967, 'loss/train': 1.1333671808242798} 11/07/2021 12:42:28 - INFO - __main__ - Step 109969: {'lr': 8.495883327937615e-05, 'samples': 21114048, 'steps': 109968, 'loss/train': 1.4454785585403442} 11/07/2021 12:42:29 - INFO - __main__ - Step 109970: {'lr': 8.495484730994699e-05, 'samples': 21114240, 'steps': 109969, 'loss/train': 0.9199925065040588} 11/07/2021 12:42:29 - INFO - __main__ - Step 109971: {'lr': 8.495086141488454e-05, 'samples': 21114432, 'steps': 109970, 'loss/train': 1.3393391370773315} 11/07/2021 12:42:29 - INFO - __main__ - Step 109972: {'lr': 8.494687559419071e-05, 'samples': 21114624, 'steps': 109971, 'loss/train': 1.7542757987976074} 11/07/2021 12:42:30 - INFO - __main__ - Step 109973: {'lr': 8.494288984786724e-05, 'samples': 21114816, 'steps': 109972, 'loss/train': 1.3624187707901} 11/07/2021 12:42:31 - INFO - __main__ - Step 109974: {'lr': 8.493890417591596e-05, 'samples': 21115008, 'steps': 109973, 'loss/train': 1.3246873617172241} 11/07/2021 12:42:31 - INFO - __main__ - Step 109975: {'lr': 8.493491857833865e-05, 'samples': 21115200, 'steps': 109974, 'loss/train': 1.0060242414474487} 11/07/2021 12:42:32 - INFO - __main__ - Step 109976: {'lr': 8.493093305513716e-05, 'samples': 21115392, 'steps': 109975, 'loss/train': 1.3315554857254028} 11/07/2021 12:42:32 - INFO - __main__ - Step 109977: {'lr': 8.49269476063132e-05, 'samples': 21115584, 'steps': 109976, 'loss/train': 0.9383650422096252} 11/07/2021 12:42:32 - INFO - __main__ - Step 109978: {'lr': 8.492296223186866e-05, 'samples': 21115776, 'steps': 109977, 'loss/train': 1.2113957405090332} 11/07/2021 12:42:33 - INFO - __main__ - Step 109979: {'lr': 8.491897693180523e-05, 'samples': 21115968, 'steps': 109978, 'loss/train': 1.1805874109268188} 11/07/2021 12:42:34 - INFO - __main__ - Step 109980: {'lr': 8.491499170612479e-05, 'samples': 21116160, 'steps': 109979, 'loss/train': 1.5884130001068115} 11/07/2021 12:42:34 - INFO - __main__ - Step 109981: {'lr': 8.491100655482911e-05, 'samples': 21116352, 'steps': 109980, 'loss/train': 1.0463392734527588} 11/07/2021 12:42:34 - INFO - __main__ - Step 109982: {'lr': 8.490702147791998e-05, 'samples': 21116544, 'steps': 109981, 'loss/train': 1.2874287366867065} 11/07/2021 12:42:35 - INFO - __main__ - Step 109983: {'lr': 8.490303647539929e-05, 'samples': 21116736, 'steps': 109982, 'loss/train': 1.269610047340393} 11/07/2021 12:42:35 - INFO - __main__ - Step 109984: {'lr': 8.489905154726865e-05, 'samples': 21116928, 'steps': 109983, 'loss/train': 1.348105549812317} 11/07/2021 12:42:36 - INFO - __main__ - Step 109985: {'lr': 8.489506669352995e-05, 'samples': 21117120, 'steps': 109984, 'loss/train': 1.0644090175628662} 11/07/2021 12:42:36 - INFO - __main__ - Step 109986: {'lr': 8.489108191418499e-05, 'samples': 21117312, 'steps': 109985, 'loss/train': 1.2548094987869263} 11/07/2021 12:42:37 - INFO - __main__ - Step 109987: {'lr': 8.488709720923554e-05, 'samples': 21117504, 'steps': 109986, 'loss/train': 1.0693262815475464} 11/07/2021 12:42:37 - INFO - __main__ - Step 109988: {'lr': 8.488311257868344e-05, 'samples': 21117696, 'steps': 109987, 'loss/train': 1.449662446975708} 11/07/2021 12:42:38 - INFO - __main__ - Step 109989: {'lr': 8.487912802253045e-05, 'samples': 21117888, 'steps': 109988, 'loss/train': 0.24267618358135223} 11/07/2021 12:42:38 - INFO - __main__ - Step 109990: {'lr': 8.487514354077839e-05, 'samples': 21118080, 'steps': 109989, 'loss/train': 1.454827904701233} 11/07/2021 12:42:39 - INFO - __main__ - Step 109991: {'lr': 8.487115913342902e-05, 'samples': 21118272, 'steps': 109990, 'loss/train': 1.5184937715530396} 11/07/2021 12:42:39 - INFO - __main__ - Step 109992: {'lr': 8.48671748004842e-05, 'samples': 21118464, 'steps': 109991, 'loss/train': 1.3481346368789673} 11/07/2021 12:42:40 - INFO - __main__ - Step 109993: {'lr': 8.486319054194563e-05, 'samples': 21118656, 'steps': 109992, 'loss/train': 1.5117989778518677} 11/07/2021 12:42:40 - INFO - __main__ - Step 109994: {'lr': 8.485920635781519e-05, 'samples': 21118848, 'steps': 109993, 'loss/train': 1.4475911855697632} 11/07/2021 12:42:41 - INFO - __main__ - Step 109995: {'lr': 8.485522224809464e-05, 'samples': 21119040, 'steps': 109994, 'loss/train': 1.5957809686660767} 11/07/2021 12:42:41 - INFO - __main__ - Step 109996: {'lr': 8.485123821278579e-05, 'samples': 21119232, 'steps': 109995, 'loss/train': 1.3953895568847656} 11/07/2021 12:42:42 - INFO - __main__ - Step 109997: {'lr': 8.484725425189047e-05, 'samples': 21119424, 'steps': 109996, 'loss/train': 1.4087443351745605} 11/07/2021 12:42:42 - INFO - __main__ - Step 109998: {'lr': 8.484327036541037e-05, 'samples': 21119616, 'steps': 109997, 'loss/train': 1.2319884300231934} 11/07/2021 12:42:43 - INFO - __main__ - Step 109999: {'lr': 8.483928655334733e-05, 'samples': 21119808, 'steps': 109998, 'loss/train': 1.0583456754684448} 11/07/2021 12:42:44 - INFO - __main__ - Step 110000: {'lr': 8.483530281570317e-05, 'samples': 21120000, 'steps': 109999, 'loss/train': 1.4805086851119995} 11/07/2021 12:42:44 - INFO - __main__ - Step 110001: {'lr': 8.483131915247969e-05, 'samples': 21120192, 'steps': 110000, 'loss/train': 1.6772103309631348} 11/07/2021 12:42:44 - INFO - __main__ - Step 110002: {'lr': 8.482733556367864e-05, 'samples': 21120384, 'steps': 110001, 'loss/train': 1.3454021215438843} 11/07/2021 12:42:45 - INFO - __main__ - Step 110003: {'lr': 8.482335204930186e-05, 'samples': 21120576, 'steps': 110002, 'loss/train': 1.5655955076217651} 11/07/2021 12:42:45 - INFO - __main__ - Step 110004: {'lr': 8.481936860935113e-05, 'samples': 21120768, 'steps': 110003, 'loss/train': 1.2387104034423828} 11/07/2021 12:42:45 - INFO - __main__ - Step 110005: {'lr': 8.481538524382823e-05, 'samples': 21120960, 'steps': 110004, 'loss/train': 1.2738308906555176} 11/07/2021 12:42:46 - INFO - __main__ - Step 110006: {'lr': 8.481140195273498e-05, 'samples': 21121152, 'steps': 110005, 'loss/train': 0.6470704674720764} 11/07/2021 12:42:47 - INFO - __main__ - Step 110007: {'lr': 8.480741873607318e-05, 'samples': 21121344, 'steps': 110006, 'loss/train': 1.395994782447815} 11/07/2021 12:42:47 - INFO - __main__ - Step 110008: {'lr': 8.480343559384457e-05, 'samples': 21121536, 'steps': 110007, 'loss/train': 1.023787498474121} 11/07/2021 12:42:47 - INFO - __main__ - Step 110009: {'lr': 8.479945252605101e-05, 'samples': 21121728, 'steps': 110008, 'loss/train': 1.3334224224090576} 11/07/2021 12:42:48 - INFO - __main__ - Step 110010: {'lr': 8.479546953269434e-05, 'samples': 21121920, 'steps': 110009, 'loss/train': 1.2924777269363403} 11/07/2021 12:42:50 - INFO - __main__ - Step 110011: {'lr': 8.479148661377619e-05, 'samples': 21122112, 'steps': 110010, 'loss/train': 1.0071734189987183} 11/07/2021 12:42:50 - INFO - __main__ - Step 110012: {'lr': 8.478750376929848e-05, 'samples': 21122304, 'steps': 110011, 'loss/train': 1.8097130060195923} 11/07/2021 12:42:50 - INFO - __main__ - Step 110013: {'lr': 8.478352099926293e-05, 'samples': 21122496, 'steps': 110012, 'loss/train': 1.4649488925933838} 11/07/2021 12:42:51 - INFO - __main__ - Step 110014: {'lr': 8.477953830367141e-05, 'samples': 21122688, 'steps': 110013, 'loss/train': 1.4703574180603027} 11/07/2021 12:42:51 - INFO - __main__ - Step 110015: {'lr': 8.477555568252566e-05, 'samples': 21122880, 'steps': 110014, 'loss/train': 1.5166524648666382} 11/07/2021 12:42:51 - INFO - __main__ - Step 110016: {'lr': 8.477157313582751e-05, 'samples': 21123072, 'steps': 110015, 'loss/train': 1.259495735168457} 11/07/2021 12:42:52 - INFO - __main__ - Step 110017: {'lr': 8.476759066357873e-05, 'samples': 21123264, 'steps': 110016, 'loss/train': 1.036316990852356} 11/07/2021 12:42:53 - INFO - __main__ - Step 110018: {'lr': 8.476360826578112e-05, 'samples': 21123456, 'steps': 110017, 'loss/train': 1.3096281290054321} 11/07/2021 12:42:53 - INFO - __main__ - Step 110019: {'lr': 8.475962594243647e-05, 'samples': 21123648, 'steps': 110018, 'loss/train': 1.5705046653747559} 11/07/2021 12:42:54 - INFO - __main__ - Step 110020: {'lr': 8.47556436935466e-05, 'samples': 21123840, 'steps': 110019, 'loss/train': 1.1508634090423584} 11/07/2021 12:42:54 - INFO - __main__ - Step 110021: {'lr': 8.47516615191133e-05, 'samples': 21124032, 'steps': 110020, 'loss/train': 1.163988709449768} 11/07/2021 12:42:54 - INFO - __main__ - Step 110022: {'lr': 8.474767941913831e-05, 'samples': 21124224, 'steps': 110021, 'loss/train': 1.1636452674865723} 11/07/2021 12:42:55 - INFO - __main__ - Step 110023: {'lr': 8.474369739362359e-05, 'samples': 21124416, 'steps': 110022, 'loss/train': 1.3961578607559204} 11/07/2021 12:42:56 - INFO - __main__ - Step 110024: {'lr': 8.47397154425707e-05, 'samples': 21124608, 'steps': 110023, 'loss/train': 1.3495502471923828} 11/07/2021 12:42:56 - INFO - __main__ - Step 110025: {'lr': 8.473573356598155e-05, 'samples': 21124800, 'steps': 110024, 'loss/train': 1.039428472518921} 11/07/2021 12:42:56 - INFO - __main__ - Step 110026: {'lr': 8.473175176385795e-05, 'samples': 21124992, 'steps': 110025, 'loss/train': 1.656488060951233} 11/07/2021 12:42:57 - INFO - __main__ - Step 110027: {'lr': 8.472777003620164e-05, 'samples': 21125184, 'steps': 110026, 'loss/train': 1.0460890531539917} 11/07/2021 12:42:58 - INFO - __main__ - Step 110028: {'lr': 8.472378838301445e-05, 'samples': 21125376, 'steps': 110027, 'loss/train': 1.5365498065948486} 11/07/2021 12:42:58 - INFO - __main__ - Step 110029: {'lr': 8.471980680429819e-05, 'samples': 21125568, 'steps': 110028, 'loss/train': 1.4254387617111206} 11/07/2021 12:42:58 - INFO - __main__ - Step 110030: {'lr': 8.471582530005462e-05, 'samples': 21125760, 'steps': 110029, 'loss/train': 0.688463568687439} 11/07/2021 12:42:59 - INFO - __main__ - Step 110031: {'lr': 8.471184387028555e-05, 'samples': 21125952, 'steps': 110030, 'loss/train': 1.4769057035446167} 11/07/2021 12:42:59 - INFO - __main__ - Step 110032: {'lr': 8.470786251499279e-05, 'samples': 21126144, 'steps': 110031, 'loss/train': 1.1981526613235474} 11/07/2021 12:43:00 - INFO - __main__ - Step 110033: {'lr': 8.470388123417811e-05, 'samples': 21126336, 'steps': 110032, 'loss/train': 1.1784805059432983} 11/07/2021 12:43:01 - INFO - __main__ - Step 110034: {'lr': 8.469990002784328e-05, 'samples': 21126528, 'steps': 110033, 'loss/train': 1.0691090822219849} 11/07/2021 12:43:01 - INFO - __main__ - Step 110035: {'lr': 8.469591889599016e-05, 'samples': 21126720, 'steps': 110034, 'loss/train': 1.262725830078125} 11/07/2021 12:43:01 - INFO - __main__ - Step 110036: {'lr': 8.46919378386205e-05, 'samples': 21126912, 'steps': 110035, 'loss/train': 1.2245509624481201} 11/07/2021 12:43:02 - INFO - __main__ - Step 110037: {'lr': 8.468795685573619e-05, 'samples': 21127104, 'steps': 110036, 'loss/train': 1.0520365238189697} 11/07/2021 12:43:03 - INFO - __main__ - Step 110038: {'lr': 8.468397594733884e-05, 'samples': 21127296, 'steps': 110037, 'loss/train': 1.352707028388977} 11/07/2021 12:43:03 - INFO - __main__ - Step 110039: {'lr': 8.467999511343033e-05, 'samples': 21127488, 'steps': 110038, 'loss/train': 1.225661277770996} 11/07/2021 12:43:03 - INFO - __main__ - Step 110040: {'lr': 8.467601435401249e-05, 'samples': 21127680, 'steps': 110039, 'loss/train': 0.8535853028297424} 11/07/2021 12:43:04 - INFO - __main__ - Step 110041: {'lr': 8.467203366908707e-05, 'samples': 21127872, 'steps': 110040, 'loss/train': 1.5119603872299194} 11/07/2021 12:43:04 - INFO - __main__ - Step 110042: {'lr': 8.466805305865588e-05, 'samples': 21128064, 'steps': 110041, 'loss/train': 0.860254168510437} 11/07/2021 12:43:04 - INFO - __main__ - Step 110043: {'lr': 8.466407252272071e-05, 'samples': 21128256, 'steps': 110042, 'loss/train': 1.4334907531738281} 11/07/2021 12:43:05 - INFO - __main__ - Step 110044: {'lr': 8.466009206128337e-05, 'samples': 21128448, 'steps': 110043, 'loss/train': 0.9766755700111389} 11/07/2021 12:43:06 - INFO - __main__ - Step 110045: {'lr': 8.465611167434564e-05, 'samples': 21128640, 'steps': 110044, 'loss/train': 0.5460708737373352} 11/07/2021 12:43:06 - INFO - __main__ - Step 110046: {'lr': 8.465213136190931e-05, 'samples': 21128832, 'steps': 110045, 'loss/train': 1.3510174751281738} 11/07/2021 12:43:07 - INFO - __main__ - Step 110047: {'lr': 8.464815112397617e-05, 'samples': 21129024, 'steps': 110046, 'loss/train': 1.6820693016052246} 11/07/2021 12:43:07 - INFO - __main__ - Step 110048: {'lr': 8.464417096054804e-05, 'samples': 21129216, 'steps': 110047, 'loss/train': 1.2617851495742798} 11/07/2021 12:43:08 - INFO - __main__ - Step 110049: {'lr': 8.46401908716267e-05, 'samples': 21129408, 'steps': 110048, 'loss/train': 1.330229640007019} 11/07/2021 12:43:08 - INFO - __main__ - Step 110050: {'lr': 8.463621085721398e-05, 'samples': 21129600, 'steps': 110049, 'loss/train': 0.9833459854125977} 11/07/2021 12:43:09 - INFO - __main__ - Step 110051: {'lr': 8.46322309173116e-05, 'samples': 21129792, 'steps': 110050, 'loss/train': 1.0564686059951782} 11/07/2021 12:43:09 - INFO - __main__ - Step 110052: {'lr': 8.462825105192135e-05, 'samples': 21129984, 'steps': 110051, 'loss/train': 1.4330623149871826} 11/07/2021 12:43:09 - INFO - __main__ - Step 110053: {'lr': 8.462427126104507e-05, 'samples': 21130176, 'steps': 110052, 'loss/train': 1.5885183811187744} 11/07/2021 12:43:10 - INFO - __main__ - Step 110054: {'lr': 8.462029154468454e-05, 'samples': 21130368, 'steps': 110053, 'loss/train': 1.3562893867492676} 11/07/2021 12:43:11 - INFO - __main__ - Step 110055: {'lr': 8.461631190284156e-05, 'samples': 21130560, 'steps': 110054, 'loss/train': 1.4071067571640015} 11/07/2021 12:43:11 - INFO - __main__ - Step 110056: {'lr': 8.46123323355179e-05, 'samples': 21130752, 'steps': 110055, 'loss/train': 1.1911897659301758} 11/07/2021 12:43:11 - INFO - __main__ - Step 110057: {'lr': 8.46083528427154e-05, 'samples': 21130944, 'steps': 110056, 'loss/train': 0.8914830088615417} 11/07/2021 12:43:12 - INFO - __main__ - Step 110058: {'lr': 8.46043734244358e-05, 'samples': 21131136, 'steps': 110057, 'loss/train': 1.251608967781067} 11/07/2021 12:43:12 - INFO - __main__ - Step 110059: {'lr': 8.460039408068093e-05, 'samples': 21131328, 'steps': 110058, 'loss/train': 0.8062607049942017} 11/07/2021 12:43:13 - INFO - __main__ - Step 110060: {'lr': 8.459641481145255e-05, 'samples': 21131520, 'steps': 110059, 'loss/train': 1.150093674659729} 11/07/2021 12:43:14 - INFO - __main__ - Step 110061: {'lr': 8.45924356167525e-05, 'samples': 21131712, 'steps': 110060, 'loss/train': 0.8107708692550659} 11/07/2021 12:43:14 - INFO - __main__ - Step 110062: {'lr': 8.458845649658253e-05, 'samples': 21131904, 'steps': 110061, 'loss/train': 1.0894643068313599} 11/07/2021 12:43:14 - INFO - __main__ - Step 110063: {'lr': 8.458447745094446e-05, 'samples': 21132096, 'steps': 110062, 'loss/train': 1.7837131023406982} 11/07/2021 12:43:15 - INFO - __main__ - Step 110064: {'lr': 8.458049847984015e-05, 'samples': 21132288, 'steps': 110063, 'loss/train': 1.8193334341049194} 11/07/2021 12:43:16 - INFO - __main__ - Step 110065: {'lr': 8.457651958327122e-05, 'samples': 21132480, 'steps': 110064, 'loss/train': 1.265326976776123} 11/07/2021 12:43:16 - INFO - __main__ - Step 110066: {'lr': 8.457254076123957e-05, 'samples': 21132672, 'steps': 110065, 'loss/train': 1.4597727060317993} 11/07/2021 12:43:16 - INFO - __main__ - Step 110067: {'lr': 8.456856201374699e-05, 'samples': 21132864, 'steps': 110066, 'loss/train': 1.4691100120544434} 11/07/2021 12:43:17 - INFO - __main__ - Step 110068: {'lr': 8.456458334079525e-05, 'samples': 21133056, 'steps': 110067, 'loss/train': 1.2630470991134644} 11/07/2021 12:43:17 - INFO - __main__ - Step 110069: {'lr': 8.456060474238616e-05, 'samples': 21133248, 'steps': 110068, 'loss/train': 1.0478858947753906} 11/07/2021 12:43:18 - INFO - __main__ - Step 110070: {'lr': 8.45566262185215e-05, 'samples': 21133440, 'steps': 110069, 'loss/train': 1.038630723953247} 11/07/2021 12:43:18 - INFO - __main__ - Step 110071: {'lr': 8.45526477692031e-05, 'samples': 21133632, 'steps': 110070, 'loss/train': 1.329419732093811} 11/07/2021 12:43:19 - INFO - __main__ - Step 110072: {'lr': 8.45486693944327e-05, 'samples': 21133824, 'steps': 110071, 'loss/train': 1.4621930122375488} 11/07/2021 12:43:19 - INFO - __main__ - Step 110073: {'lr': 8.454469109421211e-05, 'samples': 21134016, 'steps': 110072, 'loss/train': 1.281948208808899} 11/07/2021 12:43:19 - INFO - __main__ - Step 110074: {'lr': 8.454071286854314e-05, 'samples': 21134208, 'steps': 110073, 'loss/train': 1.151221752166748} 11/07/2021 12:43:21 - INFO - __main__ - Step 110075: {'lr': 8.453673471742757e-05, 'samples': 21134400, 'steps': 110074, 'loss/train': 1.3049874305725098} 11/07/2021 12:43:21 - INFO - __main__ - Step 110076: {'lr': 8.453275664086719e-05, 'samples': 21134592, 'steps': 110075, 'loss/train': 1.4682831764221191} 11/07/2021 12:43:21 - INFO - __main__ - Step 110077: {'lr': 8.452877863886388e-05, 'samples': 21134784, 'steps': 110076, 'loss/train': 1.5305914878845215} 11/07/2021 12:43:22 - INFO - __main__ - Step 110078: {'lr': 8.452480071141927e-05, 'samples': 21134976, 'steps': 110077, 'loss/train': 0.8035637736320496} 11/07/2021 12:43:22 - INFO - __main__ - Step 110079: {'lr': 8.452082285853524e-05, 'samples': 21135168, 'steps': 110078, 'loss/train': 1.0796576738357544} 11/07/2021 12:43:22 - INFO - __main__ - Step 110080: {'lr': 8.451684508021355e-05, 'samples': 21135360, 'steps': 110079, 'loss/train': 0.9980447292327881} 11/07/2021 12:43:23 - INFO - __main__ - Step 110081: {'lr': 8.451286737645603e-05, 'samples': 21135552, 'steps': 110080, 'loss/train': 1.3970369100570679} 11/07/2021 12:43:24 - INFO - __main__ - Step 110082: {'lr': 8.450888974726446e-05, 'samples': 21135744, 'steps': 110081, 'loss/train': 0.07692388445138931} 11/07/2021 12:43:24 - INFO - __main__ - Step 110083: {'lr': 8.450491219264061e-05, 'samples': 21135936, 'steps': 110082, 'loss/train': 1.217166781425476} 11/07/2021 12:43:24 - INFO - __main__ - Step 110084: {'lr': 8.450093471258632e-05, 'samples': 21136128, 'steps': 110083, 'loss/train': 1.5018614530563354} 11/07/2021 12:43:25 - INFO - __main__ - Step 110085: {'lr': 8.449695730710335e-05, 'samples': 21136320, 'steps': 110084, 'loss/train': 1.1248328685760498} 11/07/2021 12:43:26 - INFO - __main__ - Step 110086: {'lr': 8.449297997619351e-05, 'samples': 21136512, 'steps': 110085, 'loss/train': 0.8431835174560547} 11/07/2021 12:43:26 - INFO - __main__ - Step 110087: {'lr': 8.448900271985854e-05, 'samples': 21136704, 'steps': 110086, 'loss/train': 1.037136197090149} 11/07/2021 12:43:27 - INFO - __main__ - Step 110088: {'lr': 8.448502553810031e-05, 'samples': 21136896, 'steps': 110087, 'loss/train': 1.472050428390503} 11/07/2021 12:43:27 - INFO - __main__ - Step 110089: {'lr': 8.448104843092055e-05, 'samples': 21137088, 'steps': 110088, 'loss/train': 1.4148074388504028} 11/07/2021 12:43:27 - INFO - __main__ - Step 110090: {'lr': 8.447707139832109e-05, 'samples': 21137280, 'steps': 110089, 'loss/train': 1.2272720336914062} 11/07/2021 12:43:29 - INFO - __main__ - Step 110091: {'lr': 8.447309444030379e-05, 'samples': 21137472, 'steps': 110090, 'loss/train': 0.06235019117593765} 11/07/2021 12:43:29 - INFO - __main__ - Step 110092: {'lr': 8.446911755687025e-05, 'samples': 21137664, 'steps': 110091, 'loss/train': 1.3667229413986206} 11/07/2021 12:43:29 - INFO - __main__ - Step 110093: {'lr': 8.44651407480224e-05, 'samples': 21137856, 'steps': 110092, 'loss/train': 1.397566556930542} 11/07/2021 12:43:30 - INFO - __main__ - Step 110094: {'lr': 8.4461164013762e-05, 'samples': 21138048, 'steps': 110093, 'loss/train': 0.6949518322944641} 11/07/2021 12:43:30 - INFO - __main__ - Step 110095: {'lr': 8.445718735409083e-05, 'samples': 21138240, 'steps': 110094, 'loss/train': 1.145603895187378} 11/07/2021 12:43:31 - INFO - __main__ - Step 110096: {'lr': 8.445321076901072e-05, 'samples': 21138432, 'steps': 110095, 'loss/train': 0.76636803150177} 11/07/2021 12:43:32 - INFO - __main__ - Step 110097: {'lr': 8.44492342585234e-05, 'samples': 21138624, 'steps': 110096, 'loss/train': 1.5889803171157837} 11/07/2021 12:43:32 - INFO - __main__ - Step 110098: {'lr': 8.444525782263074e-05, 'samples': 21138816, 'steps': 110097, 'loss/train': 1.3643181324005127} 11/07/2021 12:43:32 - INFO - __main__ - Step 110099: {'lr': 8.444128146133448e-05, 'samples': 21139008, 'steps': 110098, 'loss/train': 0.555284321308136} 11/07/2021 12:43:33 - INFO - __main__ - Step 110100: {'lr': 8.443730517463643e-05, 'samples': 21139200, 'steps': 110099, 'loss/train': 0.9763466715812683} 11/07/2021 12:43:33 - INFO - __main__ - Step 110101: {'lr': 8.443332896253836e-05, 'samples': 21139392, 'steps': 110100, 'loss/train': 1.0577747821807861} 11/07/2021 12:43:34 - INFO - __main__ - Step 110102: {'lr': 8.442935282504208e-05, 'samples': 21139584, 'steps': 110101, 'loss/train': 0.08680881559848785} 11/07/2021 12:43:35 - INFO - __main__ - Step 110103: {'lr': 8.44253767621494e-05, 'samples': 21139776, 'steps': 110102, 'loss/train': 1.0594629049301147} 11/07/2021 12:43:35 - INFO - __main__ - Step 110104: {'lr': 8.442140077386215e-05, 'samples': 21139968, 'steps': 110103, 'loss/train': 1.4357731342315674} 11/07/2021 12:43:35 - INFO - __main__ - Step 110105: {'lr': 8.441742486018198e-05, 'samples': 21140160, 'steps': 110104, 'loss/train': 0.8286007046699524} 11/07/2021 12:43:36 - INFO - __main__ - Step 110106: {'lr': 8.441344902111076e-05, 'samples': 21140352, 'steps': 110105, 'loss/train': 1.1790695190429688} 11/07/2021 12:43:37 - INFO - __main__ - Step 110107: {'lr': 8.44094732566503e-05, 'samples': 21140544, 'steps': 110106, 'loss/train': 1.238074779510498} 11/07/2021 12:43:37 - INFO - __main__ - Step 110108: {'lr': 8.440549756680238e-05, 'samples': 21140736, 'steps': 110107, 'loss/train': 0.5873978734016418} 11/07/2021 12:43:38 - INFO - __main__ - Step 110109: {'lr': 8.440152195156878e-05, 'samples': 21140928, 'steps': 110108, 'loss/train': 0.8818144798278809} 11/07/2021 12:43:38 - INFO - __main__ - Step 110110: {'lr': 8.439754641095129e-05, 'samples': 21141120, 'steps': 110109, 'loss/train': 1.328269600868225} 11/07/2021 12:43:38 - INFO - __main__ - Step 110111: {'lr': 8.43935709449517e-05, 'samples': 21141312, 'steps': 110110, 'loss/train': 0.052880048751831055} 11/07/2021 12:43:39 - INFO - __main__ - Step 110112: {'lr': 8.438959555357184e-05, 'samples': 21141504, 'steps': 110111, 'loss/train': 1.5509066581726074} 11/07/2021 12:43:40 - INFO - __main__ - Step 110113: {'lr': 8.438562023681346e-05, 'samples': 21141696, 'steps': 110112, 'loss/train': 1.0512665510177612} 11/07/2021 12:43:40 - INFO - __main__ - Step 110114: {'lr': 8.438164499467834e-05, 'samples': 21141888, 'steps': 110113, 'loss/train': 1.2870677709579468} 11/07/2021 12:43:41 - INFO - __main__ - Step 110115: {'lr': 8.437766982716835e-05, 'samples': 21142080, 'steps': 110114, 'loss/train': 0.6209398508071899} 11/07/2021 12:43:41 - INFO - __main__ - Step 110116: {'lr': 8.437369473428518e-05, 'samples': 21142272, 'steps': 110115, 'loss/train': 1.6452468633651733} 11/07/2021 12:43:41 - INFO - __main__ - Step 110117: {'lr': 8.436971971603069e-05, 'samples': 21142464, 'steps': 110116, 'loss/train': 1.2241551876068115} 11/07/2021 12:43:42 - INFO - __main__ - Step 110118: {'lr': 8.436574477240672e-05, 'samples': 21142656, 'steps': 110117, 'loss/train': 0.0579431913793087} 11/07/2021 12:43:43 - INFO - __main__ - Step 110119: {'lr': 8.436176990341491e-05, 'samples': 21142848, 'steps': 110118, 'loss/train': 1.287915825843811} 11/07/2021 12:43:43 - INFO - __main__ - Step 110120: {'lr': 8.435779510905715e-05, 'samples': 21143040, 'steps': 110119, 'loss/train': 1.253906488418579} 11/07/2021 12:43:44 - INFO - __main__ - Step 110121: {'lr': 8.435382038933517e-05, 'samples': 21143232, 'steps': 110120, 'loss/train': 1.2142372131347656} 11/07/2021 12:43:44 - INFO - __main__ - Step 110122: {'lr': 8.434984574425084e-05, 'samples': 21143424, 'steps': 110121, 'loss/train': 1.9711551666259766} 11/07/2021 12:43:45 - INFO - __main__ - Step 110123: {'lr': 8.434587117380587e-05, 'samples': 21143616, 'steps': 110122, 'loss/train': 0.046566832810640335} 11/07/2021 12:43:45 - INFO - __main__ - Step 110124: {'lr': 8.434189667800212e-05, 'samples': 21143808, 'steps': 110123, 'loss/train': 1.2057970762252808} 11/07/2021 12:43:46 - INFO - __main__ - Step 110125: {'lr': 8.433792225684139e-05, 'samples': 21144000, 'steps': 110124, 'loss/train': 1.4562889337539673} 11/07/2021 12:43:46 - INFO - __main__ - Step 110126: {'lr': 8.43339479103254e-05, 'samples': 21144192, 'steps': 110125, 'loss/train': 1.1787505149841309} 11/07/2021 12:43:46 - INFO - __main__ - Step 110127: {'lr': 8.432997363845599e-05, 'samples': 21144384, 'steps': 110126, 'loss/train': 1.1192547082901} 11/07/2021 12:43:48 - INFO - __main__ - Step 110128: {'lr': 8.432599944123492e-05, 'samples': 21144576, 'steps': 110127, 'loss/train': 1.7346445322036743} 11/07/2021 12:43:48 - INFO - __main__ - Step 110129: {'lr': 8.4322025318664e-05, 'samples': 21144768, 'steps': 110128, 'loss/train': 1.1289008855819702} 11/07/2021 12:43:48 - INFO - __main__ - Step 110130: {'lr': 8.431805127074502e-05, 'samples': 21144960, 'steps': 110129, 'loss/train': 1.6688023805618286} 11/07/2021 12:43:49 - INFO - __main__ - Step 110131: {'lr': 8.431407729747987e-05, 'samples': 21145152, 'steps': 110130, 'loss/train': 0.84247225522995} 11/07/2021 12:43:49 - INFO - __main__ - Step 110132: {'lr': 8.431010339887012e-05, 'samples': 21145344, 'steps': 110131, 'loss/train': 1.2437734603881836} 11/07/2021 12:43:50 - INFO - __main__ - Step 110133: {'lr': 8.43061295749177e-05, 'samples': 21145536, 'steps': 110132, 'loss/train': 1.2802209854125977} 11/07/2021 12:43:50 - INFO - __main__ - Step 110134: {'lr': 8.43021558256244e-05, 'samples': 21145728, 'steps': 110133, 'loss/train': 1.2890620231628418} 11/07/2021 12:43:51 - INFO - __main__ - Step 110135: {'lr': 8.429818215099197e-05, 'samples': 21145920, 'steps': 110134, 'loss/train': 1.1460368633270264} 11/07/2021 12:43:51 - INFO - __main__ - Step 110136: {'lr': 8.429420855102224e-05, 'samples': 21146112, 'steps': 110135, 'loss/train': 1.5149060487747192} 11/07/2021 12:43:51 - INFO - __main__ - Step 110137: {'lr': 8.429023502571698e-05, 'samples': 21146304, 'steps': 110136, 'loss/train': 1.2192778587341309} 11/07/2021 12:43:52 - INFO - __main__ - Step 110138: {'lr': 8.428626157507796e-05, 'samples': 21146496, 'steps': 110137, 'loss/train': 1.1405901908874512} 11/07/2021 12:43:53 - INFO - __main__ - Step 110139: {'lr': 8.428228819910703e-05, 'samples': 21146688, 'steps': 110138, 'loss/train': 1.4637434482574463} 11/07/2021 12:43:53 - INFO - __main__ - Step 110140: {'lr': 8.42783148978059e-05, 'samples': 21146880, 'steps': 110139, 'loss/train': 1.4432337284088135} 11/07/2021 12:43:53 - INFO - __main__ - Step 110141: {'lr': 8.427434167117646e-05, 'samples': 21147072, 'steps': 110140, 'loss/train': 1.3234835863113403} 11/07/2021 12:43:54 - INFO - __main__ - Step 110142: {'lr': 8.42703685192204e-05, 'samples': 21147264, 'steps': 110141, 'loss/train': 1.6523808240890503} 11/07/2021 12:43:55 - INFO - __main__ - Step 110143: {'lr': 8.426639544193957e-05, 'samples': 21147456, 'steps': 110142, 'loss/train': 1.509628176689148} 11/07/2021 12:43:55 - INFO - __main__ - Step 110144: {'lr': 8.426242243933582e-05, 'samples': 21147648, 'steps': 110143, 'loss/train': 1.1680924892425537} 11/07/2021 12:43:56 - INFO - __main__ - Step 110145: {'lr': 8.425844951141079e-05, 'samples': 21147840, 'steps': 110144, 'loss/train': 1.1663566827774048} 11/07/2021 12:43:56 - INFO - __main__ - Step 110146: {'lr': 8.425447665816633e-05, 'samples': 21148032, 'steps': 110145, 'loss/train': 0.9582146406173706} 11/07/2021 12:43:56 - INFO - __main__ - Step 110147: {'lr': 8.425050387960426e-05, 'samples': 21148224, 'steps': 110146, 'loss/train': 1.2396758794784546} 11/07/2021 12:43:57 - INFO - __main__ - Step 110148: {'lr': 8.424653117572633e-05, 'samples': 21148416, 'steps': 110147, 'loss/train': 1.2688225507736206} 11/07/2021 12:43:58 - INFO - __main__ - Step 110149: {'lr': 8.424255854653439e-05, 'samples': 21148608, 'steps': 110148, 'loss/train': 1.0508899688720703} 11/07/2021 12:43:58 - INFO - __main__ - Step 110150: {'lr': 8.423858599203018e-05, 'samples': 21148800, 'steps': 110149, 'loss/train': 0.5833397507667542} 11/07/2021 12:43:58 - INFO - __main__ - Step 110151: {'lr': 8.423461351221551e-05, 'samples': 21148992, 'steps': 110150, 'loss/train': 1.4928463697433472} 11/07/2021 12:43:59 - INFO - __main__ - Step 110152: {'lr': 8.423064110709216e-05, 'samples': 21149184, 'steps': 110151, 'loss/train': 0.5198938846588135} 11/07/2021 12:43:59 - INFO - __main__ - Step 110153: {'lr': 8.422666877666194e-05, 'samples': 21149376, 'steps': 110152, 'loss/train': 1.329728364944458} 11/07/2021 12:44:00 - INFO - __main__ - Step 110154: {'lr': 8.42226965209266e-05, 'samples': 21149568, 'steps': 110153, 'loss/train': 0.49565622210502625} 11/07/2021 12:44:01 - INFO - __main__ - Step 110155: {'lr': 8.421872433988798e-05, 'samples': 21149760, 'steps': 110154, 'loss/train': 1.8083323240280151} 11/07/2021 12:44:01 - INFO - __main__ - Step 110156: {'lr': 8.421475223354782e-05, 'samples': 21149952, 'steps': 110155, 'loss/train': 1.2320866584777832} 11/07/2021 12:44:01 - INFO - __main__ - Step 110157: {'lr': 8.421078020190794e-05, 'samples': 21150144, 'steps': 110156, 'loss/train': 1.2658547163009644} 11/07/2021 12:44:02 - INFO - __main__ - Step 110158: {'lr': 8.420680824497023e-05, 'samples': 21150336, 'steps': 110157, 'loss/train': 1.055605411529541} 11/07/2021 12:44:03 - INFO - __main__ - Step 110159: {'lr': 8.420283636273626e-05, 'samples': 21150528, 'steps': 110158, 'loss/train': 0.9815608859062195} 11/07/2021 12:44:03 - INFO - __main__ - Step 110160: {'lr': 8.419886455520795e-05, 'samples': 21150720, 'steps': 110159, 'loss/train': 1.1947245597839355} 11/07/2021 12:44:04 - INFO - __main__ - Step 110161: {'lr': 8.419489282238707e-05, 'samples': 21150912, 'steps': 110160, 'loss/train': 1.0520042181015015} 11/07/2021 12:44:04 - INFO - __main__ - Step 110162: {'lr': 8.419092116427542e-05, 'samples': 21151104, 'steps': 110161, 'loss/train': 1.849714994430542} 11/07/2021 12:44:04 - INFO - __main__ - Step 110163: {'lr': 8.418694958087477e-05, 'samples': 21151296, 'steps': 110162, 'loss/train': 1.3254117965698242} 11/07/2021 12:44:05 - INFO - __main__ - Step 110164: {'lr': 8.418297807218695e-05, 'samples': 21151488, 'steps': 110163, 'loss/train': 1.1684409379959106} 11/07/2021 12:44:06 - INFO - __main__ - Step 110165: {'lr': 8.417900663821368e-05, 'samples': 21151680, 'steps': 110164, 'loss/train': 0.9383488297462463} 11/07/2021 12:44:06 - INFO - __main__ - Step 110166: {'lr': 8.417503527895681e-05, 'samples': 21151872, 'steps': 110165, 'loss/train': 1.0981500148773193} 11/07/2021 12:44:06 - INFO - __main__ - Step 110167: {'lr': 8.417106399441813e-05, 'samples': 21152064, 'steps': 110166, 'loss/train': 1.5380655527114868} 11/07/2021 12:44:07 - INFO - __main__ - Step 110168: {'lr': 8.416709278459939e-05, 'samples': 21152256, 'steps': 110167, 'loss/train': 1.8112796545028687} 11/07/2021 12:44:08 - INFO - __main__ - Step 110169: {'lr': 8.416312164950246e-05, 'samples': 21152448, 'steps': 110168, 'loss/train': 1.3841685056686401} 11/07/2021 12:44:08 - INFO - __main__ - Step 110170: {'lr': 8.415915058912901e-05, 'samples': 21152640, 'steps': 110169, 'loss/train': 1.3221290111541748} 11/07/2021 12:44:08 - INFO - __main__ - Step 110171: {'lr': 8.41551796034809e-05, 'samples': 21152832, 'steps': 110170, 'loss/train': 0.7169828414916992} 11/07/2021 12:44:09 - INFO - __main__ - Step 110172: {'lr': 8.415120869255987e-05, 'samples': 21153024, 'steps': 110171, 'loss/train': 0.9850237965583801} 11/07/2021 12:44:09 - INFO - __main__ - Step 110173: {'lr': 8.41472378563678e-05, 'samples': 21153216, 'steps': 110172, 'loss/train': 1.481419324874878} 11/07/2021 12:44:10 - INFO - __main__ - Step 110174: {'lr': 8.414326709490638e-05, 'samples': 21153408, 'steps': 110173, 'loss/train': 1.4445512294769287} 11/07/2021 12:44:11 - INFO - __main__ - Step 110175: {'lr': 8.413929640817746e-05, 'samples': 21153600, 'steps': 110174, 'loss/train': 1.4286235570907593} 11/07/2021 12:44:11 - INFO - __main__ - Step 110176: {'lr': 8.41353257961828e-05, 'samples': 21153792, 'steps': 110175, 'loss/train': 1.969601035118103} 11/07/2021 12:44:11 - INFO - __main__ - Step 110177: {'lr': 8.413135525892423e-05, 'samples': 21153984, 'steps': 110176, 'loss/train': 0.9277823567390442} 11/07/2021 12:44:12 - INFO - __main__ - Step 110178: {'lr': 8.41273847964035e-05, 'samples': 21154176, 'steps': 110177, 'loss/train': 0.9763568043708801} 11/07/2021 12:44:12 - INFO - __main__ - Step 110179: {'lr': 8.412341440862239e-05, 'samples': 21154368, 'steps': 110178, 'loss/train': 1.2201944589614868} 11/07/2021 12:44:13 - INFO - __main__ - Step 110180: {'lr': 8.41194440955828e-05, 'samples': 21154560, 'steps': 110179, 'loss/train': 1.3771417140960693} 11/07/2021 12:44:13 - INFO - __main__ - Step 110181: {'lr': 8.411547385728638e-05, 'samples': 21154752, 'steps': 110180, 'loss/train': 1.1708898544311523} 11/07/2021 12:44:14 - INFO - __main__ - Step 110182: {'lr': 8.411150369373494e-05, 'samples': 21154944, 'steps': 110181, 'loss/train': 1.1850582361221313} 11/07/2021 12:44:14 - INFO - __main__ - Step 110183: {'lr': 8.410753360493032e-05, 'samples': 21155136, 'steps': 110182, 'loss/train': 1.3710962533950806} 11/07/2021 12:44:14 - INFO - __main__ - Step 110184: {'lr': 8.410356359087424e-05, 'samples': 21155328, 'steps': 110183, 'loss/train': 1.5157437324523926} 11/07/2021 12:44:15 - INFO - __main__ - Step 110185: {'lr': 8.409959365156857e-05, 'samples': 21155520, 'steps': 110184, 'loss/train': 1.1341543197631836} 11/07/2021 12:44:16 - INFO - __main__ - Step 110186: {'lr': 8.409562378701504e-05, 'samples': 21155712, 'steps': 110185, 'loss/train': 1.0916818380355835} 11/07/2021 12:44:16 - INFO - __main__ - Step 110187: {'lr': 8.409165399721549e-05, 'samples': 21155904, 'steps': 110186, 'loss/train': 0.03853055089712143} 11/07/2021 12:44:17 - INFO - __main__ - Step 110188: {'lr': 8.408768428217167e-05, 'samples': 21156096, 'steps': 110187, 'loss/train': 1.145095944404602} 11/07/2021 12:44:17 - INFO - __main__ - Step 110189: {'lr': 8.408371464188536e-05, 'samples': 21156288, 'steps': 110188, 'loss/train': 1.1853455305099487} 11/07/2021 12:44:18 - INFO - __main__ - Step 110190: {'lr': 8.407974507635838e-05, 'samples': 21156480, 'steps': 110189, 'loss/train': 1.4074989557266235} 11/07/2021 12:44:18 - INFO - __main__ - Step 110191: {'lr': 8.40757755855926e-05, 'samples': 21156672, 'steps': 110190, 'loss/train': 0.9447341561317444} 11/07/2021 12:44:19 - INFO - __main__ - Step 110192: {'lr': 8.407180616958962e-05, 'samples': 21156864, 'steps': 110191, 'loss/train': 1.4810479879379272} 11/07/2021 12:44:19 - INFO - __main__ - Step 110193: {'lr': 8.406783682835134e-05, 'samples': 21157056, 'steps': 110192, 'loss/train': 1.1091222763061523} 11/07/2021 12:44:19 - INFO - __main__ - Step 110194: {'lr': 8.40638675618795e-05, 'samples': 21157248, 'steps': 110193, 'loss/train': 1.1153877973556519} 11/07/2021 12:44:20 - INFO - __main__ - Step 110195: {'lr': 8.405989837017597e-05, 'samples': 21157440, 'steps': 110194, 'loss/train': 1.362699270248413} 11/07/2021 12:44:21 - INFO - __main__ - Step 110196: {'lr': 8.405592925324246e-05, 'samples': 21157632, 'steps': 110195, 'loss/train': 1.655337929725647} 11/07/2021 12:44:21 - INFO - __main__ - Step 110197: {'lr': 8.405196021108077e-05, 'samples': 21157824, 'steps': 110196, 'loss/train': 1.6280068159103394} 11/07/2021 12:44:21 - INFO - __main__ - Step 110198: {'lr': 8.404799124369272e-05, 'samples': 21158016, 'steps': 110197, 'loss/train': 1.386885643005371} 11/07/2021 12:44:22 - INFO - __main__ - Step 110199: {'lr': 8.40440223510801e-05, 'samples': 21158208, 'steps': 110198, 'loss/train': 1.0860679149627686} 11/07/2021 12:44:23 - INFO - __main__ - Step 110200: {'lr': 8.404005353324468e-05, 'samples': 21158400, 'steps': 110199, 'loss/train': 1.702516794204712} 11/07/2021 12:44:23 - INFO - __main__ - Step 110201: {'lr': 8.403608479018832e-05, 'samples': 21158592, 'steps': 110200, 'loss/train': 1.4964879751205444} 11/07/2021 12:44:23 - INFO - __main__ - Step 110202: {'lr': 8.403211612191266e-05, 'samples': 21158784, 'steps': 110201, 'loss/train': 1.2576981782913208} 11/07/2021 12:44:24 - INFO - __main__ - Step 110203: {'lr': 8.402814752841956e-05, 'samples': 21158976, 'steps': 110202, 'loss/train': 1.8419394493103027} 11/07/2021 12:44:24 - INFO - __main__ - Step 110204: {'lr': 8.402417900971082e-05, 'samples': 21159168, 'steps': 110203, 'loss/train': 1.4961349964141846} 11/07/2021 12:44:25 - INFO - __main__ - Step 110205: {'lr': 8.402021056578823e-05, 'samples': 21159360, 'steps': 110204, 'loss/train': 1.198204755783081} 11/07/2021 12:44:26 - INFO - __main__ - Step 110206: {'lr': 8.401624219665358e-05, 'samples': 21159552, 'steps': 110205, 'loss/train': 1.2226669788360596} 11/07/2021 12:44:26 - INFO - __main__ - Step 110207: {'lr': 8.401227390230864e-05, 'samples': 21159744, 'steps': 110206, 'loss/train': 1.0375310182571411} 11/07/2021 12:44:26 - INFO - __main__ - Step 110208: {'lr': 8.400830568275519e-05, 'samples': 21159936, 'steps': 110207, 'loss/train': 1.265438199043274} 11/07/2021 12:44:27 - INFO - __main__ - Step 110209: {'lr': 8.400433753799508e-05, 'samples': 21160128, 'steps': 110208, 'loss/train': 1.620546817779541} 11/07/2021 12:44:27 - INFO - __main__ - Step 110210: {'lr': 8.400036946803002e-05, 'samples': 21160320, 'steps': 110209, 'loss/train': 1.2713998556137085} 11/07/2021 12:44:28 - INFO - __main__ - Step 110211: {'lr': 8.399640147286183e-05, 'samples': 21160512, 'steps': 110210, 'loss/train': 1.4336072206497192} 11/07/2021 12:44:29 - INFO - __main__ - Step 110212: {'lr': 8.399243355249239e-05, 'samples': 21160704, 'steps': 110211, 'loss/train': 0.876369833946228} 11/07/2021 12:44:29 - INFO - __main__ - Step 110213: {'lr': 8.398846570692334e-05, 'samples': 21160896, 'steps': 110212, 'loss/train': 1.5038539171218872} 11/07/2021 12:44:29 - INFO - __main__ - Step 110214: {'lr': 8.39844979361565e-05, 'samples': 21161088, 'steps': 110213, 'loss/train': 1.2196367979049683} 11/07/2021 12:44:30 - INFO - __main__ - Step 110215: {'lr': 8.398053024019366e-05, 'samples': 21161280, 'steps': 110214, 'loss/train': 1.4575557708740234} 11/07/2021 12:44:31 - INFO - __main__ - Step 110216: {'lr': 8.397656261903668e-05, 'samples': 21161472, 'steps': 110215, 'loss/train': 1.0137664079666138} 11/07/2021 12:44:31 - INFO - __main__ - Step 110217: {'lr': 8.397259507268728e-05, 'samples': 21161664, 'steps': 110216, 'loss/train': 0.7111397385597229} 11/07/2021 12:44:31 - INFO - __main__ - Step 110218: {'lr': 8.396862760114726e-05, 'samples': 21161856, 'steps': 110217, 'loss/train': 1.4258450269699097} 11/07/2021 12:44:32 - INFO - __main__ - Step 110219: {'lr': 8.396466020441842e-05, 'samples': 21162048, 'steps': 110218, 'loss/train': 1.4167696237564087} 11/07/2021 12:44:32 - INFO - __main__ - Step 110220: {'lr': 8.396069288250254e-05, 'samples': 21162240, 'steps': 110219, 'loss/train': 0.5898609161376953} 11/07/2021 12:44:33 - INFO - __main__ - Step 110221: {'lr': 8.395672563540141e-05, 'samples': 21162432, 'steps': 110220, 'loss/train': 1.4392139911651611} 11/07/2021 12:44:33 - INFO - __main__ - Step 110222: {'lr': 8.395275846311681e-05, 'samples': 21162624, 'steps': 110221, 'loss/train': 1.783064603805542} 11/07/2021 12:44:34 - INFO - __main__ - Step 110223: {'lr': 8.394879136565053e-05, 'samples': 21162816, 'steps': 110222, 'loss/train': 1.145729660987854} 11/07/2021 12:44:34 - INFO - __main__ - Step 110224: {'lr': 8.39448243430044e-05, 'samples': 21163008, 'steps': 110223, 'loss/train': 0.9924625158309937} 11/07/2021 12:44:34 - INFO - __main__ - Step 110225: {'lr': 8.394085739518023e-05, 'samples': 21163200, 'steps': 110224, 'loss/train': 1.1109038591384888} 11/07/2021 12:44:36 - INFO - __main__ - Step 110226: {'lr': 8.393689052217964e-05, 'samples': 21163392, 'steps': 110225, 'loss/train': 1.315563440322876} 11/07/2021 12:44:36 - INFO - __main__ - Step 110227: {'lr': 8.393292372400455e-05, 'samples': 21163584, 'steps': 110226, 'loss/train': 1.5094650983810425} 11/07/2021 12:44:36 - INFO - __main__ - Step 110228: {'lr': 8.392895700065673e-05, 'samples': 21163776, 'steps': 110227, 'loss/train': 1.4984805583953857} 11/07/2021 12:44:37 - INFO - __main__ - Step 110229: {'lr': 8.392499035213794e-05, 'samples': 21163968, 'steps': 110228, 'loss/train': 1.0160024166107178} 11/07/2021 12:44:37 - INFO - __main__ - Step 110230: {'lr': 8.392102377844998e-05, 'samples': 21164160, 'steps': 110229, 'loss/train': 1.117851734161377} 11/07/2021 12:44:38 - INFO - __main__ - Step 110231: {'lr': 8.391705727959467e-05, 'samples': 21164352, 'steps': 110230, 'loss/train': 0.9977421164512634} 11/07/2021 12:44:38 - INFO - __main__ - Step 110232: {'lr': 8.391309085557375e-05, 'samples': 21164544, 'steps': 110231, 'loss/train': 1.2839192152023315} 11/07/2021 12:44:39 - INFO - __main__ - Step 110233: {'lr': 8.390912450638904e-05, 'samples': 21164736, 'steps': 110232, 'loss/train': 1.290453553199768} 11/07/2021 12:44:39 - INFO - __main__ - Step 110234: {'lr': 8.390515823204231e-05, 'samples': 21164928, 'steps': 110233, 'loss/train': 1.2003415822982788} 11/07/2021 12:44:39 - INFO - __main__ - Step 110235: {'lr': 8.390119203253538e-05, 'samples': 21165120, 'steps': 110234, 'loss/train': 1.2008657455444336} 11/07/2021 12:44:40 - INFO - __main__ - Step 110236: {'lr': 8.389722590786997e-05, 'samples': 21165312, 'steps': 110235, 'loss/train': 1.2749770879745483} 11/07/2021 12:44:41 - INFO - __main__ - Step 110237: {'lr': 8.389325985804794e-05, 'samples': 21165504, 'steps': 110236, 'loss/train': 1.0364019870758057} 11/07/2021 12:44:41 - INFO - __main__ - Step 110238: {'lr': 8.3889293883071e-05, 'samples': 21165696, 'steps': 110237, 'loss/train': 0.967374861240387} 11/07/2021 12:44:42 - INFO - __main__ - Step 110239: {'lr': 8.38853279829411e-05, 'samples': 21165888, 'steps': 110238, 'loss/train': 0.8201249837875366} 11/07/2021 12:44:42 - INFO - __main__ - Step 110240: {'lr': 8.388136215765982e-05, 'samples': 21166080, 'steps': 110239, 'loss/train': 1.1772730350494385} 11/07/2021 12:44:42 - INFO - __main__ - Step 110241: {'lr': 8.387739640722902e-05, 'samples': 21166272, 'steps': 110240, 'loss/train': 1.5844632387161255} 11/07/2021 12:44:43 - INFO - __main__ - Step 110242: {'lr': 8.387343073165052e-05, 'samples': 21166464, 'steps': 110241, 'loss/train': 1.10725998878479} 11/07/2021 12:44:44 - INFO - __main__ - Step 110243: {'lr': 8.386946513092608e-05, 'samples': 21166656, 'steps': 110242, 'loss/train': 1.0491951704025269} 11/07/2021 12:44:44 - INFO - __main__ - Step 110244: {'lr': 8.386549960505749e-05, 'samples': 21166848, 'steps': 110243, 'loss/train': 1.3798755407333374} 11/07/2021 12:44:44 - INFO - __main__ - Step 110245: {'lr': 8.386153415404654e-05, 'samples': 21167040, 'steps': 110244, 'loss/train': 0.7077001929283142} 11/07/2021 12:44:45 - INFO - __main__ - Step 110246: {'lr': 8.385756877789504e-05, 'samples': 21167232, 'steps': 110245, 'loss/train': 1.5979068279266357} 11/07/2021 12:44:46 - INFO - __main__ - Step 110247: {'lr': 8.385360347660476e-05, 'samples': 21167424, 'steps': 110246, 'loss/train': 1.4689818620681763} 11/07/2021 12:44:46 - INFO - __main__ - Step 110248: {'lr': 8.384963825017744e-05, 'samples': 21167616, 'steps': 110247, 'loss/train': 0.7930727005004883} 11/07/2021 12:44:46 - INFO - __main__ - Step 110249: {'lr': 8.384567309861494e-05, 'samples': 21167808, 'steps': 110248, 'loss/train': 1.2660194635391235} 11/07/2021 12:44:47 - INFO - __main__ - Step 110250: {'lr': 8.3841708021919e-05, 'samples': 21168000, 'steps': 110249, 'loss/train': 1.532701015472412} 11/07/2021 12:44:47 - INFO - __main__ - Step 110251: {'lr': 8.383774302009145e-05, 'samples': 21168192, 'steps': 110250, 'loss/train': 1.1230562925338745} 11/07/2021 12:44:48 - INFO - __main__ - Step 110252: {'lr': 8.383377809313411e-05, 'samples': 21168384, 'steps': 110251, 'loss/train': 1.5070250034332275} 11/07/2021 12:44:49 - INFO - __main__ - Step 110253: {'lr': 8.382981324104863e-05, 'samples': 21168576, 'steps': 110252, 'loss/train': 1.1055632829666138} 11/07/2021 12:44:49 - INFO - __main__ - Step 110254: {'lr': 8.382584846383687e-05, 'samples': 21168768, 'steps': 110253, 'loss/train': 1.345012903213501} 11/07/2021 12:44:49 - INFO - __main__ - Step 110255: {'lr': 8.38218837615006e-05, 'samples': 21168960, 'steps': 110254, 'loss/train': 1.0613932609558105} 11/07/2021 12:44:50 - INFO - __main__ - Step 110256: {'lr': 8.381791913404166e-05, 'samples': 21169152, 'steps': 110255, 'loss/train': 1.2191119194030762} 11/07/2021 12:44:51 - INFO - __main__ - Step 110257: {'lr': 8.381395458146179e-05, 'samples': 21169344, 'steps': 110256, 'loss/train': 1.3739354610443115} 11/07/2021 12:44:51 - INFO - __main__ - Step 110258: {'lr': 8.380999010376278e-05, 'samples': 21169536, 'steps': 110257, 'loss/train': 1.463364601135254} 11/07/2021 12:44:51 - INFO - __main__ - Step 110259: {'lr': 8.380602570094642e-05, 'samples': 21169728, 'steps': 110258, 'loss/train': 1.152427077293396} 11/07/2021 12:44:52 - INFO - __main__ - Step 110260: {'lr': 8.38020613730145e-05, 'samples': 21169920, 'steps': 110259, 'loss/train': 1.3384760618209839} 11/07/2021 12:44:52 - INFO - __main__ - Step 110261: {'lr': 8.37980971199688e-05, 'samples': 21170112, 'steps': 110260, 'loss/train': 1.1207562685012817} 11/07/2021 12:44:53 - INFO - __main__ - Step 110262: {'lr': 8.379413294181116e-05, 'samples': 21170304, 'steps': 110261, 'loss/train': 1.6224541664123535} 11/07/2021 12:44:54 - INFO - __main__ - Step 110263: {'lr': 8.379016883854327e-05, 'samples': 21170496, 'steps': 110262, 'loss/train': 1.2777564525604248} 11/07/2021 12:44:54 - INFO - __main__ - Step 110264: {'lr': 8.378620481016697e-05, 'samples': 21170688, 'steps': 110263, 'loss/train': 1.308493971824646} 11/07/2021 12:44:54 - INFO - __main__ - Step 110265: {'lr': 8.378224085668412e-05, 'samples': 21170880, 'steps': 110264, 'loss/train': 1.4337259531021118} 11/07/2021 12:44:55 - INFO - __main__ - Step 110266: {'lr': 8.377827697809637e-05, 'samples': 21171072, 'steps': 110265, 'loss/train': 1.0598442554473877} 11/07/2021 12:44:55 - INFO - __main__ - Step 110267: {'lr': 8.377431317440556e-05, 'samples': 21171264, 'steps': 110266, 'loss/train': 5.6817545890808105} 11/07/2021 12:44:56 - INFO - __main__ - Step 110268: {'lr': 8.377034944561346e-05, 'samples': 21171456, 'steps': 110267, 'loss/train': 1.0972579717636108} 11/07/2021 12:44:56 - INFO - __main__ - Step 110269: {'lr': 8.376638579172189e-05, 'samples': 21171648, 'steps': 110268, 'loss/train': 1.3498581647872925} 11/07/2021 12:44:57 - INFO - __main__ - Step 110270: {'lr': 8.376242221273262e-05, 'samples': 21171840, 'steps': 110269, 'loss/train': 1.7048555612564087} 11/07/2021 12:44:57 - INFO - __main__ - Step 110271: {'lr': 8.375845870864743e-05, 'samples': 21172032, 'steps': 110270, 'loss/train': 1.2869364023208618} 11/07/2021 12:44:58 - INFO - __main__ - Step 110272: {'lr': 8.375449527946812e-05, 'samples': 21172224, 'steps': 110271, 'loss/train': 1.493484377861023} 11/07/2021 12:44:58 - INFO - __main__ - Step 110273: {'lr': 8.375053192519647e-05, 'samples': 21172416, 'steps': 110272, 'loss/train': 1.905726432800293} 11/07/2021 12:44:59 - INFO - __main__ - Step 110274: {'lr': 8.374656864583427e-05, 'samples': 21172608, 'steps': 110273, 'loss/train': 1.6574686765670776} 11/07/2021 12:44:59 - INFO - __main__ - Step 110275: {'lr': 8.374260544138329e-05, 'samples': 21172800, 'steps': 110274, 'loss/train': 1.3897427320480347} 11/07/2021 12:45:00 - INFO - __main__ - Step 110276: {'lr': 8.373864231184533e-05, 'samples': 21172992, 'steps': 110275, 'loss/train': 1.5171540975570679} 11/07/2021 12:45:00 - INFO - __main__ - Step 110277: {'lr': 8.373467925722217e-05, 'samples': 21173184, 'steps': 110276, 'loss/train': 1.7052695751190186} 11/07/2021 12:45:01 - INFO - __main__ - Step 110278: {'lr': 8.373071627751561e-05, 'samples': 21173376, 'steps': 110277, 'loss/train': 1.311681866645813} 11/07/2021 12:45:01 - INFO - __main__ - Step 110279: {'lr': 8.372675337272747e-05, 'samples': 21173568, 'steps': 110278, 'loss/train': 1.3174935579299927} 11/07/2021 12:45:02 - INFO - __main__ - Step 110280: {'lr': 8.372279054285944e-05, 'samples': 21173760, 'steps': 110279, 'loss/train': 1.1181085109710693} 11/07/2021 12:45:02 - INFO - __main__ - Step 110281: {'lr': 8.371882778791334e-05, 'samples': 21173952, 'steps': 110280, 'loss/train': 1.3194255828857422} 11/07/2021 12:45:02 - INFO - __main__ - Step 110282: {'lr': 8.371486510789097e-05, 'samples': 21174144, 'steps': 110281, 'loss/train': 0.8396094441413879} 11/07/2021 12:45:03 - INFO - __main__ - Step 110283: {'lr': 8.371090250279415e-05, 'samples': 21174336, 'steps': 110282, 'loss/train': 1.3182169198989868} 11/07/2021 12:45:04 - INFO - __main__ - Step 110284: {'lr': 8.37069399726246e-05, 'samples': 21174528, 'steps': 110283, 'loss/train': 1.682065725326538} 11/07/2021 12:45:04 - INFO - __main__ - Step 110285: {'lr': 8.370297751738416e-05, 'samples': 21174720, 'steps': 110284, 'loss/train': 1.1217864751815796} 11/07/2021 12:45:04 - INFO - __main__ - Step 110286: {'lr': 8.369901513707456e-05, 'samples': 21174912, 'steps': 110285, 'loss/train': 1.3667373657226562} 11/07/2021 12:45:05 - INFO - __main__ - Step 110287: {'lr': 8.369505283169763e-05, 'samples': 21175104, 'steps': 110286, 'loss/train': 1.2912418842315674} 11/07/2021 12:45:05 - INFO - __main__ - Step 110288: {'lr': 8.369109060125513e-05, 'samples': 21175296, 'steps': 110287, 'loss/train': 1.4457956552505493} 11/07/2021 12:45:06 - INFO - __main__ - Step 110289: {'lr': 8.368712844574888e-05, 'samples': 21175488, 'steps': 110288, 'loss/train': 1.5273181200027466} 11/07/2021 12:45:07 - INFO - __main__ - Step 110290: {'lr': 8.368316636518064e-05, 'samples': 21175680, 'steps': 110289, 'loss/train': 1.582413911819458} 11/07/2021 12:45:07 - INFO - __main__ - Step 110291: {'lr': 8.367920435955217e-05, 'samples': 21175872, 'steps': 110290, 'loss/train': 0.6687083840370178} 11/07/2021 12:45:07 - INFO - __main__ - Step 110292: {'lr': 8.36752424288654e-05, 'samples': 21176064, 'steps': 110291, 'loss/train': 0.8781284093856812} 11/07/2021 12:45:08 - INFO - __main__ - Step 110293: {'lr': 8.367128057312192e-05, 'samples': 21176256, 'steps': 110292, 'loss/train': 1.4556028842926025} 11/07/2021 12:45:09 - INFO - __main__ - Step 110294: {'lr': 8.366731879232359e-05, 'samples': 21176448, 'steps': 110293, 'loss/train': 1.2103627920150757} 11/07/2021 12:45:09 - INFO - __main__ - Step 110295: {'lr': 8.366335708647218e-05, 'samples': 21176640, 'steps': 110294, 'loss/train': 1.713140845298767} 11/07/2021 12:45:09 - INFO - __main__ - Step 110296: {'lr': 8.36593954555695e-05, 'samples': 21176832, 'steps': 110295, 'loss/train': 0.13709713518619537} 11/07/2021 12:45:10 - INFO - __main__ - Step 110297: {'lr': 8.365543389961735e-05, 'samples': 21177024, 'steps': 110296, 'loss/train': 1.649994134902954} 11/07/2021 12:45:10 - INFO - __main__ - Step 110298: {'lr': 8.365147241861748e-05, 'samples': 21177216, 'steps': 110297, 'loss/train': 1.3158749341964722} 11/07/2021 12:45:11 - INFO - __main__ - Step 110299: {'lr': 8.364751101257167e-05, 'samples': 21177408, 'steps': 110298, 'loss/train': 1.3479268550872803} 11/07/2021 12:45:12 - INFO - __main__ - Step 110300: {'lr': 8.364354968148177e-05, 'samples': 21177600, 'steps': 110299, 'loss/train': 1.081790804862976} 11/07/2021 12:45:12 - INFO - __main__ - Step 110301: {'lr': 8.363958842534949e-05, 'samples': 21177792, 'steps': 110300, 'loss/train': 1.5916328430175781} 11/07/2021 12:45:12 - INFO - __main__ - Step 110302: {'lr': 8.363562724417664e-05, 'samples': 21177984, 'steps': 110301, 'loss/train': 1.6322970390319824} 11/07/2021 12:45:13 - INFO - __main__ - Step 110303: {'lr': 8.363166613796502e-05, 'samples': 21178176, 'steps': 110302, 'loss/train': 1.310332179069519} 11/07/2021 12:45:14 - INFO - __main__ - Step 110304: {'lr': 8.362770510671641e-05, 'samples': 21178368, 'steps': 110303, 'loss/train': 1.6396560668945312} 11/07/2021 12:45:14 - INFO - __main__ - Step 110305: {'lr': 8.362374415043258e-05, 'samples': 21178560, 'steps': 110304, 'loss/train': 1.0700336694717407} 11/07/2021 12:45:14 - INFO - __main__ - Step 110306: {'lr': 8.361978326911541e-05, 'samples': 21178752, 'steps': 110305, 'loss/train': 1.5104929208755493} 11/07/2021 12:45:15 - INFO - __main__ - Step 110307: {'lr': 8.36158224627665e-05, 'samples': 21178944, 'steps': 110306, 'loss/train': 1.195690393447876} 11/07/2021 12:45:15 - INFO - __main__ - Step 110308: {'lr': 8.361186173138774e-05, 'samples': 21179136, 'steps': 110307, 'loss/train': 1.8324611186981201} 11/07/2021 12:45:16 - INFO - __main__ - Step 110309: {'lr': 8.360790107498092e-05, 'samples': 21179328, 'steps': 110308, 'loss/train': 1.3535950183868408} 11/07/2021 12:45:17 - INFO - __main__ - Step 110310: {'lr': 8.36039404935478e-05, 'samples': 21179520, 'steps': 110309, 'loss/train': 1.6856472492218018} 11/07/2021 12:45:17 - INFO - __main__ - Step 110311: {'lr': 8.359997998709019e-05, 'samples': 21179712, 'steps': 110310, 'loss/train': 0.8365227580070496} 11/07/2021 12:45:17 - INFO - __main__ - Step 110312: {'lr': 8.359601955560986e-05, 'samples': 21179904, 'steps': 110311, 'loss/train': 1.0794011354446411} 11/07/2021 12:45:18 - INFO - __main__ - Step 110313: {'lr': 8.359205919910858e-05, 'samples': 21180096, 'steps': 110312, 'loss/train': 1.6463251113891602} 11/07/2021 12:45:18 - INFO - __main__ - Step 110314: {'lr': 8.358809891758815e-05, 'samples': 21180288, 'steps': 110313, 'loss/train': 1.2709426879882812} 11/07/2021 12:45:19 - INFO - __main__ - Step 110315: {'lr': 8.358413871105037e-05, 'samples': 21180480, 'steps': 110314, 'loss/train': 1.3945839405059814} 11/07/2021 12:45:19 - INFO - __main__ - Step 110316: {'lr': 8.358017857949701e-05, 'samples': 21180672, 'steps': 110315, 'loss/train': 1.507439136505127} 11/07/2021 12:45:20 - INFO - __main__ - Step 110317: {'lr': 8.357621852292984e-05, 'samples': 21180864, 'steps': 110316, 'loss/train': 0.8934213519096375} 11/07/2021 12:45:20 - INFO - __main__ - Step 110318: {'lr': 8.357225854135067e-05, 'samples': 21181056, 'steps': 110317, 'loss/train': 1.7538604736328125} 11/07/2021 12:45:20 - INFO - __main__ - Step 110319: {'lr': 8.356829863476134e-05, 'samples': 21181248, 'steps': 110318, 'loss/train': 1.4599498510360718} 11/07/2021 12:45:22 - INFO - __main__ - Step 110320: {'lr': 8.35643388031635e-05, 'samples': 21181440, 'steps': 110319, 'loss/train': 1.7608333826065063} 11/07/2021 12:45:22 - INFO - __main__ - Step 110321: {'lr': 8.3560379046559e-05, 'samples': 21181632, 'steps': 110320, 'loss/train': 0.9670586585998535} 11/07/2021 12:45:22 - INFO - __main__ - Step 110322: {'lr': 8.355641936494962e-05, 'samples': 21181824, 'steps': 110321, 'loss/train': 1.3353321552276611} 11/07/2021 12:45:23 - INFO - __main__ - Step 110323: {'lr': 8.355245975833714e-05, 'samples': 21182016, 'steps': 110322, 'loss/train': 0.9376024007797241} 11/07/2021 12:45:23 - INFO - __main__ - Step 110324: {'lr': 8.354850022672336e-05, 'samples': 21182208, 'steps': 110323, 'loss/train': 1.1628438234329224} 11/07/2021 12:45:24 - INFO - __main__ - Step 110325: {'lr': 8.354454077011006e-05, 'samples': 21182400, 'steps': 110324, 'loss/train': 0.7060209512710571} 11/07/2021 12:45:24 - INFO - __main__ - Step 110326: {'lr': 8.354058138849902e-05, 'samples': 21182592, 'steps': 110325, 'loss/train': 1.5237045288085938} 11/07/2021 12:45:25 - INFO - __main__ - Step 110327: {'lr': 8.353662208189203e-05, 'samples': 21182784, 'steps': 110326, 'loss/train': 1.059633493423462} 11/07/2021 12:45:25 - INFO - __main__ - Step 110328: {'lr': 8.353266285029085e-05, 'samples': 21182976, 'steps': 110327, 'loss/train': 1.3928890228271484} 11/07/2021 12:45:25 - INFO - __main__ - Step 110329: {'lr': 8.352870369369731e-05, 'samples': 21183168, 'steps': 110328, 'loss/train': 1.4638038873672485} 11/07/2021 12:45:26 - INFO - __main__ - Step 110330: {'lr': 8.352474461211315e-05, 'samples': 21183360, 'steps': 110329, 'loss/train': 1.2380766868591309} 11/07/2021 12:45:27 - INFO - __main__ - Step 110331: {'lr': 8.352078560554019e-05, 'samples': 21183552, 'steps': 110330, 'loss/train': 1.6711512804031372} 11/07/2021 12:45:27 - INFO - __main__ - Step 110332: {'lr': 8.351682667398017e-05, 'samples': 21183744, 'steps': 110331, 'loss/train': 1.3326361179351807} 11/07/2021 12:45:28 - INFO - __main__ - Step 110333: {'lr': 8.3512867817435e-05, 'samples': 21183936, 'steps': 110332, 'loss/train': 1.4001576900482178} 11/07/2021 12:45:28 - INFO - __main__ - Step 110334: {'lr': 8.350890903590627e-05, 'samples': 21184128, 'steps': 110333, 'loss/train': 2.6775736808776855} 11/07/2021 12:45:28 - INFO - __main__ - Step 110335: {'lr': 8.350495032939587e-05, 'samples': 21184320, 'steps': 110334, 'loss/train': 0.8642292618751526} 11/07/2021 12:45:29 - INFO - __main__ - Step 110336: {'lr': 8.350099169790554e-05, 'samples': 21184512, 'steps': 110335, 'loss/train': 1.2359791994094849} 11/07/2021 12:45:30 - INFO - __main__ - Step 110337: {'lr': 8.349703314143711e-05, 'samples': 21184704, 'steps': 110336, 'loss/train': 1.5550228357315063} 11/07/2021 12:45:30 - INFO - __main__ - Step 110338: {'lr': 8.349307465999236e-05, 'samples': 21184896, 'steps': 110337, 'loss/train': 0.635684609413147} 11/07/2021 12:45:30 - INFO - __main__ - Step 110339: {'lr': 8.348911625357305e-05, 'samples': 21185088, 'steps': 110338, 'loss/train': 1.136888861656189} 11/07/2021 12:45:31 - INFO - __main__ - Step 110340: {'lr': 8.348515792218098e-05, 'samples': 21185280, 'steps': 110339, 'loss/train': 1.2116453647613525} 11/07/2021 12:45:32 - INFO - __main__ - Step 110341: {'lr': 8.348119966581793e-05, 'samples': 21185472, 'steps': 110340, 'loss/train': 1.3906054496765137} 11/07/2021 12:45:32 - INFO - __main__ - Step 110342: {'lr': 8.347724148448569e-05, 'samples': 21185664, 'steps': 110341, 'loss/train': 1.0987275838851929} 11/07/2021 12:45:32 - INFO - __main__ - Step 110343: {'lr': 8.3473283378186e-05, 'samples': 21185856, 'steps': 110342, 'loss/train': 1.1250864267349243} 11/07/2021 12:45:33 - INFO - __main__ - Step 110344: {'lr': 8.346932534692069e-05, 'samples': 21186048, 'steps': 110343, 'loss/train': 0.11655717343091965} 11/07/2021 12:45:33 - INFO - __main__ - Step 110345: {'lr': 8.346536739069155e-05, 'samples': 21186240, 'steps': 110344, 'loss/train': 1.5852957963943481} 11/07/2021 12:45:34 - INFO - __main__ - Step 110346: {'lr': 8.346140950950043e-05, 'samples': 21186432, 'steps': 110345, 'loss/train': 0.7196362018585205} 11/07/2021 12:45:35 - INFO - __main__ - Step 110347: {'lr': 8.345745170334896e-05, 'samples': 21186624, 'steps': 110346, 'loss/train': 1.5627856254577637} 11/07/2021 12:45:35 - INFO - __main__ - Step 110348: {'lr': 8.345349397223894e-05, 'samples': 21186816, 'steps': 110347, 'loss/train': 1.6240429878234863} 11/07/2021 12:45:35 - INFO - __main__ - Step 110349: {'lr': 8.344953631617225e-05, 'samples': 21187008, 'steps': 110348, 'loss/train': 1.2778735160827637} 11/07/2021 12:45:36 - INFO - __main__ - Step 110350: {'lr': 8.344557873515063e-05, 'samples': 21187200, 'steps': 110349, 'loss/train': 1.4543812274932861} 11/07/2021 12:45:37 - INFO - __main__ - Step 110351: {'lr': 8.344162122917584e-05, 'samples': 21187392, 'steps': 110350, 'loss/train': 1.371073603630066} 11/07/2021 12:45:37 - INFO - __main__ - Step 110352: {'lr': 8.343766379824969e-05, 'samples': 21187584, 'steps': 110351, 'loss/train': 1.6810907125473022} 11/07/2021 12:45:38 - INFO - __main__ - Step 110353: {'lr': 8.343370644237397e-05, 'samples': 21187776, 'steps': 110352, 'loss/train': 1.3902004957199097} 11/07/2021 12:45:38 - INFO - __main__ - Step 110354: {'lr': 8.342974916155044e-05, 'samples': 21187968, 'steps': 110353, 'loss/train': 0.9895471334457397} 11/07/2021 12:45:38 - INFO - __main__ - Step 110355: {'lr': 8.34257919557809e-05, 'samples': 21188160, 'steps': 110354, 'loss/train': 1.3460921049118042} 11/07/2021 12:45:39 - INFO - __main__ - Step 110356: {'lr': 8.342183482506713e-05, 'samples': 21188352, 'steps': 110355, 'loss/train': 0.8014458417892456} 11/07/2021 12:45:40 - INFO - __main__ - Step 110357: {'lr': 8.34178777694109e-05, 'samples': 21188544, 'steps': 110356, 'loss/train': 1.4877111911773682} 11/07/2021 12:45:40 - INFO - __main__ - Step 110358: {'lr': 8.3413920788814e-05, 'samples': 21188736, 'steps': 110357, 'loss/train': 1.353651523590088} 11/07/2021 12:45:40 - INFO - __main__ - Step 110359: {'lr': 8.340996388327823e-05, 'samples': 21188928, 'steps': 110358, 'loss/train': 1.0683854818344116} 11/07/2021 12:45:41 - INFO - __main__ - Step 110360: {'lr': 8.340600705280544e-05, 'samples': 21189120, 'steps': 110359, 'loss/train': 1.2656480073928833} 11/07/2021 12:45:42 - INFO - __main__ - Step 110361: {'lr': 8.340205029739725e-05, 'samples': 21189312, 'steps': 110360, 'loss/train': 1.2734559774398804} 11/07/2021 12:45:42 - INFO - __main__ - Step 110362: {'lr': 8.339809361705553e-05, 'samples': 21189504, 'steps': 110361, 'loss/train': 1.2481772899627686} 11/07/2021 12:45:42 - INFO - __main__ - Step 110363: {'lr': 8.339413701178206e-05, 'samples': 21189696, 'steps': 110362, 'loss/train': 1.188209056854248} 11/07/2021 12:45:43 - INFO - __main__ - Step 110364: {'lr': 8.339018048157859e-05, 'samples': 21189888, 'steps': 110363, 'loss/train': 1.3392086029052734} 11/07/2021 12:45:43 - INFO - __main__ - Step 110365: {'lr': 8.338622402644696e-05, 'samples': 21190080, 'steps': 110364, 'loss/train': 1.025466799736023} 11/07/2021 12:45:45 - INFO - __main__ - Step 110366: {'lr': 8.338226764638892e-05, 'samples': 21190272, 'steps': 110365, 'loss/train': 0.912063479423523} 11/07/2021 12:45:45 - INFO - __main__ - Step 110367: {'lr': 8.337831134140628e-05, 'samples': 21190464, 'steps': 110366, 'loss/train': 1.523315191268921} 11/07/2021 12:45:45 - INFO - __main__ - Step 110368: {'lr': 8.337435511150077e-05, 'samples': 21190656, 'steps': 110367, 'loss/train': 0.770602822303772} 11/07/2021 12:45:46 - INFO - __main__ - Step 110369: {'lr': 8.337039895667423e-05, 'samples': 21190848, 'steps': 110368, 'loss/train': 1.0783518552780151} 11/07/2021 12:45:46 - INFO - __main__ - Step 110370: {'lr': 8.336644287692841e-05, 'samples': 21191040, 'steps': 110369, 'loss/train': 1.5102839469909668} 11/07/2021 12:45:46 - INFO - __main__ - Step 110371: {'lr': 8.336248687226508e-05, 'samples': 21191232, 'steps': 110370, 'loss/train': 1.2271690368652344} 11/07/2021 12:45:48 - INFO - __main__ - Step 110372: {'lr': 8.335853094268605e-05, 'samples': 21191424, 'steps': 110371, 'loss/train': 0.34982770681381226} 11/07/2021 12:45:48 - INFO - __main__ - Step 110373: {'lr': 8.335457508819319e-05, 'samples': 21191616, 'steps': 110372, 'loss/train': 1.7188764810562134} 11/07/2021 12:45:48 - INFO - __main__ - Step 110374: {'lr': 8.33506193087881e-05, 'samples': 21191808, 'steps': 110373, 'loss/train': 1.3137942552566528} 11/07/2021 12:45:49 - INFO - __main__ - Step 110375: {'lr': 8.334666360447265e-05, 'samples': 21192000, 'steps': 110374, 'loss/train': 0.67342609167099} 11/07/2021 12:45:49 - INFO - __main__ - Step 110376: {'lr': 8.334270797524862e-05, 'samples': 21192192, 'steps': 110375, 'loss/train': 1.3623697757720947} 11/07/2021 12:45:50 - INFO - __main__ - Step 110377: {'lr': 8.33387524211178e-05, 'samples': 21192384, 'steps': 110376, 'loss/train': 1.2157307863235474} 11/07/2021 12:45:50 - INFO - __main__ - Step 110378: {'lr': 8.333479694208196e-05, 'samples': 21192576, 'steps': 110377, 'loss/train': 1.6820051670074463} 11/07/2021 12:45:51 - INFO - __main__ - Step 110379: {'lr': 8.333084153814288e-05, 'samples': 21192768, 'steps': 110378, 'loss/train': 1.209717869758606} 11/07/2021 12:45:51 - INFO - __main__ - Step 110380: {'lr': 8.332688620930237e-05, 'samples': 21192960, 'steps': 110379, 'loss/train': 1.3500772714614868} 11/07/2021 12:45:51 - INFO - __main__ - Step 110381: {'lr': 8.332293095556218e-05, 'samples': 21193152, 'steps': 110380, 'loss/train': 1.1480263471603394} 11/07/2021 12:45:52 - INFO - __main__ - Step 110382: {'lr': 8.33189757769241e-05, 'samples': 21193344, 'steps': 110381, 'loss/train': 1.8098210096359253} 11/07/2021 12:45:53 - INFO - __main__ - Step 110383: {'lr': 8.331502067338995e-05, 'samples': 21193536, 'steps': 110382, 'loss/train': 0.886736273765564} 11/07/2021 12:45:53 - INFO - __main__ - Step 110384: {'lr': 8.331106564496143e-05, 'samples': 21193728, 'steps': 110383, 'loss/train': 0.42525482177734375} 11/07/2021 12:45:53 - INFO - __main__ - Step 110385: {'lr': 8.33071106916404e-05, 'samples': 21193920, 'steps': 110384, 'loss/train': 1.3818817138671875} 11/07/2021 12:45:54 - INFO - __main__ - Step 110386: {'lr': 8.33031558134287e-05, 'samples': 21194112, 'steps': 110385, 'loss/train': 1.4447896480560303} 11/07/2021 12:45:55 - INFO - __main__ - Step 110387: {'lr': 8.329920101032795e-05, 'samples': 21194304, 'steps': 110386, 'loss/train': 0.9426982402801514} 11/07/2021 12:45:55 - INFO - __main__ - Step 110388: {'lr': 8.329524628233998e-05, 'samples': 21194496, 'steps': 110387, 'loss/train': 1.7062453031539917} 11/07/2021 12:45:56 - INFO - __main__ - Step 110389: {'lr': 8.329129162946661e-05, 'samples': 21194688, 'steps': 110388, 'loss/train': 1.5237003564834595} 11/07/2021 12:45:56 - INFO - __main__ - Step 110390: {'lr': 8.328733705170963e-05, 'samples': 21194880, 'steps': 110389, 'loss/train': 1.3955179452896118} 11/07/2021 12:45:56 - INFO - __main__ - Step 110391: {'lr': 8.328338254907079e-05, 'samples': 21195072, 'steps': 110390, 'loss/train': 1.7853823900222778} 11/07/2021 12:45:57 - INFO - __main__ - Step 110392: {'lr': 8.327942812155187e-05, 'samples': 21195264, 'steps': 110391, 'loss/train': 1.5028547048568726} 11/07/2021 12:45:58 - INFO - __main__ - Step 110393: {'lr': 8.32754737691547e-05, 'samples': 21195456, 'steps': 110392, 'loss/train': 1.055674433708191} 11/07/2021 12:45:58 - INFO - __main__ - Step 110394: {'lr': 8.3271519491881e-05, 'samples': 21195648, 'steps': 110393, 'loss/train': 1.7425087690353394} 11/07/2021 12:45:59 - INFO - __main__ - Step 110395: {'lr': 8.326756528973259e-05, 'samples': 21195840, 'steps': 110394, 'loss/train': 1.2210216522216797} 11/07/2021 12:45:59 - INFO - __main__ - Step 110396: {'lr': 8.326361116271125e-05, 'samples': 21196032, 'steps': 110395, 'loss/train': 1.1482770442962646} 11/07/2021 12:45:59 - INFO - __main__ - Step 110397: {'lr': 8.325965711081873e-05, 'samples': 21196224, 'steps': 110396, 'loss/train': 1.3031599521636963} 11/07/2021 12:46:00 - INFO - __main__ - Step 110398: {'lr': 8.325570313405687e-05, 'samples': 21196416, 'steps': 110397, 'loss/train': 1.470884919166565} 11/07/2021 12:46:01 - INFO - __main__ - Step 110399: {'lr': 8.32517492324274e-05, 'samples': 21196608, 'steps': 110398, 'loss/train': 1.2857211828231812} 11/07/2021 12:46:01 - INFO - __main__ - Step 110400: {'lr': 8.32477954059322e-05, 'samples': 21196800, 'steps': 110399, 'loss/train': 1.3149957656860352} 11/07/2021 12:46:01 - INFO - __main__ - Step 110401: {'lr': 8.324384165457289e-05, 'samples': 21196992, 'steps': 110400, 'loss/train': 1.2007925510406494} 11/07/2021 12:46:02 - INFO - __main__ - Step 110402: {'lr': 8.323988797835132e-05, 'samples': 21197184, 'steps': 110401, 'loss/train': 1.4637548923492432} 11/07/2021 12:46:03 - INFO - __main__ - Step 110403: {'lr': 8.323593437726928e-05, 'samples': 21197376, 'steps': 110402, 'loss/train': 1.4455631971359253} 11/07/2021 12:46:03 - INFO - __main__ - Step 110404: {'lr': 8.323198085132857e-05, 'samples': 21197568, 'steps': 110403, 'loss/train': 1.6974941492080688} 11/07/2021 12:46:04 - INFO - __main__ - Step 110405: {'lr': 8.322802740053098e-05, 'samples': 21197760, 'steps': 110404, 'loss/train': 1.1542168855667114} 11/07/2021 12:46:04 - INFO - __main__ - Step 110406: {'lr': 8.322407402487822e-05, 'samples': 21197952, 'steps': 110405, 'loss/train': 2.177446126937866} 11/07/2021 12:46:04 - INFO - __main__ - Step 110407: {'lr': 8.322012072437216e-05, 'samples': 21198144, 'steps': 110406, 'loss/train': 1.1534007787704468} 11/07/2021 12:46:05 - INFO - __main__ - Step 110408: {'lr': 8.32161674990145e-05, 'samples': 21198336, 'steps': 110407, 'loss/train': 1.0487678050994873} 11/07/2021 12:46:06 - INFO - __main__ - Step 110409: {'lr': 8.32122143488071e-05, 'samples': 21198528, 'steps': 110408, 'loss/train': 1.154464602470398} 11/07/2021 12:46:06 - INFO - __main__ - Step 110410: {'lr': 8.320826127375166e-05, 'samples': 21198720, 'steps': 110409, 'loss/train': 1.0932073593139648} 11/07/2021 12:46:06 - INFO - __main__ - Step 110411: {'lr': 8.320430827385004e-05, 'samples': 21198912, 'steps': 110410, 'loss/train': 0.940899670124054} 11/07/2021 12:46:07 - INFO - __main__ - Step 110412: {'lr': 8.320035534910397e-05, 'samples': 21199104, 'steps': 110411, 'loss/train': 1.368837594985962} 11/07/2021 12:46:07 - INFO - __main__ - Step 110413: {'lr': 8.319640249951535e-05, 'samples': 21199296, 'steps': 110412, 'loss/train': 0.6257731318473816} 11/07/2021 12:46:08 - INFO - __main__ - Step 110414: {'lr': 8.319244972508574e-05, 'samples': 21199488, 'steps': 110413, 'loss/train': 0.9692069888114929} 11/07/2021 12:46:09 - INFO - __main__ - Step 110415: {'lr': 8.318849702581707e-05, 'samples': 21199680, 'steps': 110414, 'loss/train': 1.6633418798446655} 11/07/2021 12:46:09 - INFO - __main__ - Step 110416: {'lr': 8.318454440171106e-05, 'samples': 21199872, 'steps': 110415, 'loss/train': 1.5284385681152344} 11/07/2021 12:46:09 - INFO - __main__ - Step 110417: {'lr': 8.318059185276955e-05, 'samples': 21200064, 'steps': 110416, 'loss/train': 1.0809868574142456} 11/07/2021 12:46:10 - INFO - __main__ - Step 110418: {'lr': 8.317663937899425e-05, 'samples': 21200256, 'steps': 110417, 'loss/train': 1.267250418663025} 11/07/2021 12:46:11 - INFO - __main__ - Step 110419: {'lr': 8.317268698038701e-05, 'samples': 21200448, 'steps': 110418, 'loss/train': 1.4475077390670776} 11/07/2021 12:46:11 - INFO - __main__ - Step 110420: {'lr': 8.316873465694957e-05, 'samples': 21200640, 'steps': 110419, 'loss/train': 1.3970305919647217} 11/07/2021 12:46:11 - INFO - __main__ - Step 110421: {'lr': 8.316478240868375e-05, 'samples': 21200832, 'steps': 110420, 'loss/train': 1.150930404663086} 11/07/2021 12:46:12 - INFO - __main__ - Step 110422: {'lr': 8.316083023559129e-05, 'samples': 21201024, 'steps': 110421, 'loss/train': 1.5786257982254028} 11/07/2021 12:46:12 - INFO - __main__ - Step 110423: {'lr': 8.315687813767397e-05, 'samples': 21201216, 'steps': 110422, 'loss/train': 1.3478463888168335} 11/07/2021 12:46:13 - INFO - __main__ - Step 110424: {'lr': 8.315292611493361e-05, 'samples': 21201408, 'steps': 110423, 'loss/train': 0.8417947888374329} 11/07/2021 12:46:13 - INFO - __main__ - Step 110425: {'lr': 8.314897416737197e-05, 'samples': 21201600, 'steps': 110424, 'loss/train': 1.1550267934799194} 11/07/2021 12:46:14 - INFO - __main__ - Step 110426: {'lr': 8.31450222949908e-05, 'samples': 21201792, 'steps': 110425, 'loss/train': 1.3808892965316772} 11/07/2021 12:46:14 - INFO - __main__ - Step 110427: {'lr': 8.314107049779202e-05, 'samples': 21201984, 'steps': 110426, 'loss/train': 1.2833929061889648} 11/07/2021 12:46:14 - INFO - __main__ - Step 110428: {'lr': 8.313711877577718e-05, 'samples': 21202176, 'steps': 110427, 'loss/train': 1.2585479021072388} 11/07/2021 12:46:15 - INFO - __main__ - Step 110429: {'lr': 8.313316712894819e-05, 'samples': 21202368, 'steps': 110428, 'loss/train': 0.5658190250396729} 11/07/2021 12:46:16 - INFO - __main__ - Step 110430: {'lr': 8.312921555730685e-05, 'samples': 21202560, 'steps': 110429, 'loss/train': 1.3181043863296509} 11/07/2021 12:46:16 - INFO - __main__ - Step 110431: {'lr': 8.312526406085489e-05, 'samples': 21202752, 'steps': 110430, 'loss/train': 1.1242406368255615} 11/07/2021 12:46:17 - INFO - __main__ - Step 110432: {'lr': 8.312131263959411e-05, 'samples': 21202944, 'steps': 110431, 'loss/train': 1.2696782350540161} 11/07/2021 12:46:17 - INFO - __main__ - Step 110433: {'lr': 8.311736129352629e-05, 'samples': 21203136, 'steps': 110432, 'loss/train': 1.34514582157135} 11/07/2021 12:46:18 - INFO - __main__ - Step 110434: {'lr': 8.311341002265322e-05, 'samples': 21203328, 'steps': 110433, 'loss/train': 1.2202765941619873} 11/07/2021 12:46:18 - INFO - __main__ - Step 110435: {'lr': 8.310945882697665e-05, 'samples': 21203520, 'steps': 110434, 'loss/train': 1.3278783559799194} 11/07/2021 12:46:19 - INFO - __main__ - Step 110436: {'lr': 8.310550770649841e-05, 'samples': 21203712, 'steps': 110435, 'loss/train': 1.6589758396148682} 11/07/2021 12:46:19 - INFO - __main__ - Step 110437: {'lr': 8.310155666122032e-05, 'samples': 21203904, 'steps': 110436, 'loss/train': 0.6208242774009705} 11/07/2021 12:46:19 - INFO - __main__ - Step 110438: {'lr': 8.309760569114403e-05, 'samples': 21204096, 'steps': 110437, 'loss/train': 1.4278956651687622} 11/07/2021 12:46:20 - INFO - __main__ - Step 110439: {'lr': 8.309365479627138e-05, 'samples': 21204288, 'steps': 110438, 'loss/train': 1.3697333335876465} 11/07/2021 12:46:21 - INFO - __main__ - Step 110440: {'lr': 8.308970397660412e-05, 'samples': 21204480, 'steps': 110439, 'loss/train': 1.264281988143921} 11/07/2021 12:46:21 - INFO - __main__ - Step 110441: {'lr': 8.308575323214409e-05, 'samples': 21204672, 'steps': 110440, 'loss/train': 1.5267568826675415} 11/07/2021 12:46:22 - INFO - __main__ - Step 110442: {'lr': 8.308180256289306e-05, 'samples': 21204864, 'steps': 110441, 'loss/train': 0.9824158549308777} 11/07/2021 12:46:22 - INFO - __main__ - Step 110443: {'lr': 8.307785196885276e-05, 'samples': 21205056, 'steps': 110442, 'loss/train': 1.3495583534240723} 11/07/2021 12:46:22 - INFO - __main__ - Step 110444: {'lr': 8.307390145002503e-05, 'samples': 21205248, 'steps': 110443, 'loss/train': 5.701181888580322} 11/07/2021 12:46:23 - INFO - __main__ - Step 110445: {'lr': 8.30699510064116e-05, 'samples': 21205440, 'steps': 110444, 'loss/train': 1.0914796590805054} 11/07/2021 12:46:24 - INFO - __main__ - Step 110446: {'lr': 8.306600063801428e-05, 'samples': 21205632, 'steps': 110445, 'loss/train': 0.8250386714935303} 11/07/2021 12:46:24 - INFO - __main__ - Step 110447: {'lr': 8.306205034483485e-05, 'samples': 21205824, 'steps': 110446, 'loss/train': 1.3023056983947754} 11/07/2021 12:46:24 - INFO - __main__ - Step 110448: {'lr': 8.305810012687518e-05, 'samples': 21206016, 'steps': 110447, 'loss/train': 1.3856228590011597} 11/07/2021 12:46:25 - INFO - __main__ - Step 110449: {'lr': 8.305414998413685e-05, 'samples': 21206208, 'steps': 110448, 'loss/train': 1.4995532035827637} 11/07/2021 12:46:26 - INFO - __main__ - Step 110450: {'lr': 8.305019991662178e-05, 'samples': 21206400, 'steps': 110449, 'loss/train': 1.1835401058197021} 11/07/2021 12:46:26 - INFO - __main__ - Step 110451: {'lr': 8.304624992433168e-05, 'samples': 21206592, 'steps': 110450, 'loss/train': 0.9365787506103516} 11/07/2021 12:46:26 - INFO - __main__ - Step 110452: {'lr': 8.304230000726837e-05, 'samples': 21206784, 'steps': 110451, 'loss/train': 1.0441653728485107} 11/07/2021 12:46:27 - INFO - __main__ - Step 110453: {'lr': 8.303835016543362e-05, 'samples': 21206976, 'steps': 110452, 'loss/train': 1.4440258741378784} 11/07/2021 12:46:27 - INFO - __main__ - Step 110454: {'lr': 8.30344003988292e-05, 'samples': 21207168, 'steps': 110453, 'loss/train': 1.3772850036621094} 11/07/2021 12:46:28 - INFO - __main__ - Step 110455: {'lr': 8.303045070745693e-05, 'samples': 21207360, 'steps': 110454, 'loss/train': 1.4636521339416504} 11/07/2021 12:46:29 - INFO - __main__ - Step 110456: {'lr': 8.302650109131857e-05, 'samples': 21207552, 'steps': 110455, 'loss/train': 0.679559051990509} 11/07/2021 12:46:29 - INFO - __main__ - Step 110457: {'lr': 8.302255155041586e-05, 'samples': 21207744, 'steps': 110456, 'loss/train': 1.4529777765274048} 11/07/2021 12:46:29 - INFO - __main__ - Step 110458: {'lr': 8.301860208475062e-05, 'samples': 21207936, 'steps': 110457, 'loss/train': 1.6362090110778809} 11/07/2021 12:46:30 - INFO - __main__ - Step 110459: {'lr': 8.301465269432471e-05, 'samples': 21208128, 'steps': 110458, 'loss/train': 1.544660210609436} 11/07/2021 12:46:31 - INFO - __main__ - Step 110460: {'lr': 8.301070337913974e-05, 'samples': 21208320, 'steps': 110459, 'loss/train': 1.3701536655426025} 11/07/2021 12:46:31 - INFO - __main__ - Step 110461: {'lr': 8.300675413919757e-05, 'samples': 21208512, 'steps': 110460, 'loss/train': 1.4873390197753906} 11/07/2021 12:46:31 - INFO - __main__ - Step 110462: {'lr': 8.300280497449997e-05, 'samples': 21208704, 'steps': 110461, 'loss/train': 1.1064295768737793} 11/07/2021 12:46:32 - INFO - __main__ - Step 110463: {'lr': 8.299885588504874e-05, 'samples': 21208896, 'steps': 110462, 'loss/train': 1.4859626293182373} 11/07/2021 12:46:32 - INFO - __main__ - Step 110464: {'lr': 8.299490687084566e-05, 'samples': 21209088, 'steps': 110463, 'loss/train': 1.2691829204559326} 11/07/2021 12:46:32 - INFO - __main__ - Step 110465: {'lr': 8.299095793189249e-05, 'samples': 21209280, 'steps': 110464, 'loss/train': 1.2860043048858643} 11/07/2021 12:46:33 - INFO - __main__ - Step 110466: {'lr': 8.2987009068191e-05, 'samples': 21209472, 'steps': 110465, 'loss/train': 1.4081830978393555} 11/07/2021 12:46:34 - INFO - __main__ - Step 110467: {'lr': 8.2983060279743e-05, 'samples': 21209664, 'steps': 110466, 'loss/train': 1.2165685892105103} 11/07/2021 12:46:34 - INFO - __main__ - Step 110468: {'lr': 8.297911156655025e-05, 'samples': 21209856, 'steps': 110467, 'loss/train': 1.397833228111267} 11/07/2021 12:46:34 - INFO - __main__ - Step 110469: {'lr': 8.297516292861454e-05, 'samples': 21210048, 'steps': 110468, 'loss/train': 0.9338855147361755} 11/07/2021 12:46:35 - INFO - __main__ - Step 110470: {'lr': 8.297121436593771e-05, 'samples': 21210240, 'steps': 110469, 'loss/train': 1.1496014595031738} 11/07/2021 12:46:36 - INFO - __main__ - Step 110471: {'lr': 8.296726587852141e-05, 'samples': 21210432, 'steps': 110470, 'loss/train': 1.2901500463485718} 11/07/2021 12:46:36 - INFO - __main__ - Step 110472: {'lr': 8.296331746636748e-05, 'samples': 21210624, 'steps': 110471, 'loss/train': 1.6021864414215088} 11/07/2021 12:46:37 - INFO - __main__ - Step 110473: {'lr': 8.295936912947772e-05, 'samples': 21210816, 'steps': 110472, 'loss/train': 5.611033916473389} 11/07/2021 12:46:37 - INFO - __main__ - Step 110474: {'lr': 8.295542086785385e-05, 'samples': 21211008, 'steps': 110473, 'loss/train': 1.3335459232330322} 11/07/2021 12:46:37 - INFO - __main__ - Step 110475: {'lr': 8.295147268149772e-05, 'samples': 21211200, 'steps': 110474, 'loss/train': 1.228450059890747} 11/07/2021 12:46:38 - INFO - __main__ - Step 110476: {'lr': 8.294752457041108e-05, 'samples': 21211392, 'steps': 110475, 'loss/train': 1.3997611999511719} 11/07/2021 12:46:39 - INFO - __main__ - Step 110477: {'lr': 8.29435765345957e-05, 'samples': 21211584, 'steps': 110476, 'loss/train': 0.6496056318283081} 11/07/2021 12:46:39 - INFO - __main__ - Step 110478: {'lr': 8.293962857405335e-05, 'samples': 21211776, 'steps': 110477, 'loss/train': 0.9945899844169617} 11/07/2021 12:46:39 - INFO - __main__ - Step 110479: {'lr': 8.293568068878585e-05, 'samples': 21211968, 'steps': 110478, 'loss/train': 1.8478527069091797} 11/07/2021 12:46:40 - INFO - __main__ - Step 110480: {'lr': 8.293173287879493e-05, 'samples': 21212160, 'steps': 110479, 'loss/train': 1.002988338470459} 11/07/2021 12:46:41 - INFO - __main__ - Step 110481: {'lr': 8.292778514408251e-05, 'samples': 21212352, 'steps': 110480, 'loss/train': 1.6699870824813843} 11/07/2021 12:46:41 - INFO - __main__ - Step 110482: {'lr': 8.292383748465013e-05, 'samples': 21212544, 'steps': 110481, 'loss/train': 1.1735416650772095} 11/07/2021 12:46:42 - INFO - __main__ - Step 110483: {'lr': 8.29198899004997e-05, 'samples': 21212736, 'steps': 110482, 'loss/train': 1.357321858406067} 11/07/2021 12:46:42 - INFO - __main__ - Step 110484: {'lr': 8.291594239163299e-05, 'samples': 21212928, 'steps': 110483, 'loss/train': 1.2521206140518188} 11/07/2021 12:46:42 - INFO - __main__ - Step 110485: {'lr': 8.29119949580518e-05, 'samples': 21213120, 'steps': 110484, 'loss/train': 0.33420878648757935} 11/07/2021 12:46:43 - INFO - __main__ - Step 110486: {'lr': 8.290804759975788e-05, 'samples': 21213312, 'steps': 110485, 'loss/train': 1.1692287921905518} 11/07/2021 12:46:44 - INFO - __main__ - Step 110487: {'lr': 8.2904100316753e-05, 'samples': 21213504, 'steps': 110486, 'loss/train': 1.1390957832336426} 11/07/2021 12:46:44 - INFO - __main__ - Step 110488: {'lr': 8.290015310903895e-05, 'samples': 21213696, 'steps': 110487, 'loss/train': 1.6491811275482178} 11/07/2021 12:46:44 - INFO - __main__ - Step 110489: {'lr': 8.289620597661754e-05, 'samples': 21213888, 'steps': 110488, 'loss/train': 1.1585098505020142} 11/07/2021 12:46:45 - INFO - __main__ - Step 110490: {'lr': 8.289225891949051e-05, 'samples': 21214080, 'steps': 110489, 'loss/train': 1.204472541809082} 11/07/2021 12:46:46 - INFO - __main__ - Step 110491: {'lr': 8.288831193765963e-05, 'samples': 21214272, 'steps': 110490, 'loss/train': 1.2234348058700562} 11/07/2021 12:46:46 - INFO - __main__ - Step 110492: {'lr': 8.288436503112673e-05, 'samples': 21214464, 'steps': 110491, 'loss/train': 1.517896056175232} 11/07/2021 12:46:47 - INFO - __main__ - Step 110493: {'lr': 8.288041819989353e-05, 'samples': 21214656, 'steps': 110492, 'loss/train': 1.804652452468872} 11/07/2021 12:46:47 - INFO - __main__ - Step 110494: {'lr': 8.287647144396194e-05, 'samples': 21214848, 'steps': 110493, 'loss/train': 1.4070231914520264} 11/07/2021 12:46:47 - INFO - __main__ - Step 110495: {'lr': 8.287252476333354e-05, 'samples': 21215040, 'steps': 110494, 'loss/train': 1.5877455472946167} 11/07/2021 12:46:48 - INFO - __main__ - Step 110496: {'lr': 8.286857815801019e-05, 'samples': 21215232, 'steps': 110495, 'loss/train': 1.7447035312652588} 11/07/2021 12:46:49 - INFO - __main__ - Step 110497: {'lr': 8.286463162799368e-05, 'samples': 21215424, 'steps': 110496, 'loss/train': 1.578831672668457} 11/07/2021 12:46:49 - INFO - __main__ - Step 110498: {'lr': 8.286068517328579e-05, 'samples': 21215616, 'steps': 110497, 'loss/train': 0.9298644065856934} 11/07/2021 12:46:50 - INFO - __main__ - Step 110499: {'lr': 8.28567387938883e-05, 'samples': 21215808, 'steps': 110498, 'loss/train': 0.8298739790916443} 11/07/2021 12:46:50 - INFO - __main__ - Step 110500: {'lr': 8.285279248980301e-05, 'samples': 21216000, 'steps': 110499, 'loss/train': 1.1181418895721436} 11/07/2021 12:46:50 - INFO - __main__ - Step 110501: {'lr': 8.284884626103165e-05, 'samples': 21216192, 'steps': 110500, 'loss/train': 0.9960819482803345} 11/07/2021 12:46:51 - INFO - __main__ - Step 110502: {'lr': 8.284490010757601e-05, 'samples': 21216384, 'steps': 110501, 'loss/train': 1.1312596797943115} 11/07/2021 12:46:52 - INFO - __main__ - Step 110503: {'lr': 8.284095402943789e-05, 'samples': 21216576, 'steps': 110502, 'loss/train': 1.1917301416397095} 11/07/2021 12:46:52 - INFO - __main__ - Step 110504: {'lr': 8.283700802661906e-05, 'samples': 21216768, 'steps': 110503, 'loss/train': 1.1905730962753296} 11/07/2021 12:46:52 - INFO - __main__ - Step 110505: {'lr': 8.283306209912128e-05, 'samples': 21216960, 'steps': 110504, 'loss/train': 1.5381839275360107} 11/07/2021 12:46:53 - INFO - __main__ - Step 110506: {'lr': 8.282911624694636e-05, 'samples': 21217152, 'steps': 110505, 'loss/train': 1.4196255207061768} 11/07/2021 12:46:53 - INFO - __main__ - Step 110507: {'lr': 8.282517047009614e-05, 'samples': 21217344, 'steps': 110506, 'loss/train': 1.4042335748672485} 11/07/2021 12:46:54 - INFO - __main__ - Step 110508: {'lr': 8.282122476857223e-05, 'samples': 21217536, 'steps': 110507, 'loss/train': 1.007710576057434} 11/07/2021 12:46:55 - INFO - __main__ - Step 110509: {'lr': 8.28172791423765e-05, 'samples': 21217728, 'steps': 110508, 'loss/train': 1.308085322380066} 11/07/2021 12:46:55 - INFO - __main__ - Step 110510: {'lr': 8.281333359151072e-05, 'samples': 21217920, 'steps': 110509, 'loss/train': 1.402611494064331} 11/07/2021 12:46:55 - INFO - __main__ - Step 110511: {'lr': 8.280938811597668e-05, 'samples': 21218112, 'steps': 110510, 'loss/train': 1.0125880241394043} 11/07/2021 12:46:56 - INFO - __main__ - Step 110512: {'lr': 8.280544271577614e-05, 'samples': 21218304, 'steps': 110511, 'loss/train': 1.5269501209259033} 11/07/2021 12:46:57 - INFO - __main__ - Step 110513: {'lr': 8.280149739091088e-05, 'samples': 21218496, 'steps': 110512, 'loss/train': 1.5370351076126099} 11/07/2021 12:46:57 - INFO - __main__ - Step 110514: {'lr': 8.27975521413827e-05, 'samples': 21218688, 'steps': 110513, 'loss/train': 1.629805326461792} 11/07/2021 12:46:57 - INFO - __main__ - Step 110515: {'lr': 8.279360696719338e-05, 'samples': 21218880, 'steps': 110514, 'loss/train': 1.484848976135254} 11/07/2021 12:46:58 - INFO - __main__ - Step 110516: {'lr': 8.278966186834463e-05, 'samples': 21219072, 'steps': 110515, 'loss/train': 0.8565672039985657} 11/07/2021 12:46:58 - INFO - __main__ - Step 110517: {'lr': 8.278571684483832e-05, 'samples': 21219264, 'steps': 110516, 'loss/train': 1.6692839860916138} 11/07/2021 12:46:59 - INFO - __main__ - Step 110518: {'lr': 8.278177189667618e-05, 'samples': 21219456, 'steps': 110517, 'loss/train': 1.5734609365463257} 11/07/2021 12:46:59 - INFO - __main__ - Step 110519: {'lr': 8.277782702386e-05, 'samples': 21219648, 'steps': 110518, 'loss/train': 1.4152357578277588} 11/07/2021 12:47:00 - INFO - __main__ - Step 110520: {'lr': 8.277388222639154e-05, 'samples': 21219840, 'steps': 110519, 'loss/train': 1.0154287815093994} 11/07/2021 12:47:00 - INFO - __main__ - Step 110521: {'lr': 8.276993750427267e-05, 'samples': 21220032, 'steps': 110520, 'loss/train': 1.4152077436447144} 11/07/2021 12:47:00 - INFO - __main__ - Step 110522: {'lr': 8.276599285750499e-05, 'samples': 21220224, 'steps': 110521, 'loss/train': 0.32099437713623047} 11/07/2021 12:47:01 - INFO - __main__ - Step 110523: {'lr': 8.276204828609038e-05, 'samples': 21220416, 'steps': 110522, 'loss/train': 1.1635143756866455} 11/07/2021 12:47:02 - INFO - __main__ - Step 110524: {'lr': 8.275810379003063e-05, 'samples': 21220608, 'steps': 110523, 'loss/train': 0.8826422095298767} 11/07/2021 12:47:02 - INFO - __main__ - Step 110525: {'lr': 8.275415936932748e-05, 'samples': 21220800, 'steps': 110524, 'loss/train': 1.0981906652450562} 11/07/2021 12:47:02 - INFO - __main__ - Step 110526: {'lr': 8.275021502398273e-05, 'samples': 21220992, 'steps': 110525, 'loss/train': 2.084597110748291} 11/07/2021 12:47:03 - INFO - __main__ - Step 110527: {'lr': 8.274627075399816e-05, 'samples': 21221184, 'steps': 110526, 'loss/train': 1.7795987129211426} 11/07/2021 12:47:03 - INFO - __main__ - Step 110528: {'lr': 8.274232655937553e-05, 'samples': 21221376, 'steps': 110527, 'loss/train': 0.9932836890220642} 11/07/2021 12:47:04 - INFO - __main__ - Step 110529: {'lr': 8.273838244011661e-05, 'samples': 21221568, 'steps': 110528, 'loss/train': 1.4554811716079712} 11/07/2021 12:47:05 - INFO - __main__ - Step 110530: {'lr': 8.273443839622321e-05, 'samples': 21221760, 'steps': 110529, 'loss/train': 1.299773097038269} 11/07/2021 12:47:05 - INFO - __main__ - Step 110531: {'lr': 8.273049442769708e-05, 'samples': 21221952, 'steps': 110530, 'loss/train': 1.2525007724761963} 11/07/2021 12:47:05 - INFO - __main__ - Step 110532: {'lr': 8.272655053454004e-05, 'samples': 21222144, 'steps': 110531, 'loss/train': 1.0782182216644287} 11/07/2021 12:47:06 - INFO - __main__ - Step 110533: {'lr': 8.272260671675381e-05, 'samples': 21222336, 'steps': 110532, 'loss/train': 1.4414266347885132} 11/07/2021 12:47:07 - INFO - __main__ - Step 110534: {'lr': 8.271866297434028e-05, 'samples': 21222528, 'steps': 110533, 'loss/train': 1.0048859119415283} 11/07/2021 12:47:07 - INFO - __main__ - Step 110535: {'lr': 8.271471930730107e-05, 'samples': 21222720, 'steps': 110534, 'loss/train': 0.6490646004676819} 11/07/2021 12:47:07 - INFO - __main__ - Step 110536: {'lr': 8.2710775715638e-05, 'samples': 21222912, 'steps': 110535, 'loss/train': 1.3694384098052979} 11/07/2021 12:47:08 - INFO - __main__ - Step 110537: {'lr': 8.270683219935288e-05, 'samples': 21223104, 'steps': 110536, 'loss/train': 0.6895626783370972} 11/07/2021 12:47:08 - INFO - __main__ - Step 110538: {'lr': 8.27028887584475e-05, 'samples': 21223296, 'steps': 110537, 'loss/train': 0.8739347457885742} 11/07/2021 12:47:09 - INFO - __main__ - Step 110539: {'lr': 8.269894539292361e-05, 'samples': 21223488, 'steps': 110538, 'loss/train': 1.2020868062973022} 11/07/2021 12:47:10 - INFO - __main__ - Step 110540: {'lr': 8.269500210278296e-05, 'samples': 21223680, 'steps': 110539, 'loss/train': 1.751380205154419} 11/07/2021 12:47:10 - INFO - __main__ - Step 110541: {'lr': 8.269105888802739e-05, 'samples': 21223872, 'steps': 110540, 'loss/train': 1.002161979675293} 11/07/2021 12:47:10 - INFO - __main__ - Step 110542: {'lr': 8.268711574865864e-05, 'samples': 21224064, 'steps': 110541, 'loss/train': 1.0979886054992676} 11/07/2021 12:47:11 - INFO - __main__ - Step 110543: {'lr': 8.268317268467851e-05, 'samples': 21224256, 'steps': 110542, 'loss/train': 1.1616792678833008} 11/07/2021 12:47:12 - INFO - __main__ - Step 110544: {'lr': 8.267922969608874e-05, 'samples': 21224448, 'steps': 110543, 'loss/train': 1.2397816181182861} 11/07/2021 12:47:12 - INFO - __main__ - Step 110545: {'lr': 8.267528678289113e-05, 'samples': 21224640, 'steps': 110544, 'loss/train': 1.1109213829040527} 11/07/2021 12:47:12 - INFO - __main__ - Step 110546: {'lr': 8.267134394508747e-05, 'samples': 21224832, 'steps': 110545, 'loss/train': 1.0369112491607666} 11/07/2021 12:47:13 - INFO - __main__ - Step 110547: {'lr': 8.266740118267951e-05, 'samples': 21225024, 'steps': 110546, 'loss/train': 1.3372615575790405} 11/07/2021 12:47:13 - INFO - __main__ - Step 110548: {'lr': 8.266345849566912e-05, 'samples': 21225216, 'steps': 110547, 'loss/train': 1.3365970849990845} 11/07/2021 12:47:14 - INFO - __main__ - Step 110549: {'lr': 8.265951588405791e-05, 'samples': 21225408, 'steps': 110548, 'loss/train': 1.4704630374908447} 11/07/2021 12:47:15 - INFO - __main__ - Step 110550: {'lr': 8.265557334784773e-05, 'samples': 21225600, 'steps': 110549, 'loss/train': 1.3873852491378784} 11/07/2021 12:47:15 - INFO - __main__ - Step 110551: {'lr': 8.26516308870404e-05, 'samples': 21225792, 'steps': 110550, 'loss/train': 1.5571285486221313} 11/07/2021 12:47:15 - INFO - __main__ - Step 110552: {'lr': 8.264768850163762e-05, 'samples': 21225984, 'steps': 110551, 'loss/train': 1.3655115365982056} 11/07/2021 12:47:16 - INFO - __main__ - Step 110553: {'lr': 8.264374619164122e-05, 'samples': 21226176, 'steps': 110552, 'loss/train': 1.3682466745376587} 11/07/2021 12:47:16 - INFO - __main__ - Step 110554: {'lr': 8.263980395705298e-05, 'samples': 21226368, 'steps': 110553, 'loss/train': 1.2352575063705444} 11/07/2021 12:47:17 - INFO - __main__ - Step 110555: {'lr': 8.263586179787466e-05, 'samples': 21226560, 'steps': 110554, 'loss/train': 1.2347394227981567} 11/07/2021 12:47:17 - INFO - __main__ - Step 110556: {'lr': 8.263191971410803e-05, 'samples': 21226752, 'steps': 110555, 'loss/train': 1.3757779598236084} 11/07/2021 12:47:18 - INFO - __main__ - Step 110557: {'lr': 8.262797770575489e-05, 'samples': 21226944, 'steps': 110556, 'loss/train': 1.0479975938796997} 11/07/2021 12:47:18 - INFO - __main__ - Step 110558: {'lr': 8.262403577281696e-05, 'samples': 21227136, 'steps': 110557, 'loss/train': 0.7487270832061768} 11/07/2021 12:47:18 - INFO - __main__ - Step 110559: {'lr': 8.262009391529609e-05, 'samples': 21227328, 'steps': 110558, 'loss/train': 0.5311243534088135} 11/07/2021 12:47:20 - INFO - __main__ - Step 110560: {'lr': 8.261615213319403e-05, 'samples': 21227520, 'steps': 110559, 'loss/train': 1.4934788942337036} 11/07/2021 12:47:20 - INFO - __main__ - Step 110561: {'lr': 8.261221042651263e-05, 'samples': 21227712, 'steps': 110560, 'loss/train': 1.2467124462127686} 11/07/2021 12:47:20 - INFO - __main__ - Step 110562: {'lr': 8.260826879525349e-05, 'samples': 21227904, 'steps': 110561, 'loss/train': 0.5401248931884766} 11/07/2021 12:47:21 - INFO - __main__ - Step 110563: {'lr': 8.260432723941846e-05, 'samples': 21228096, 'steps': 110562, 'loss/train': 0.8960866332054138} 11/07/2021 12:47:21 - INFO - __main__ - Step 110564: {'lr': 8.260038575900939e-05, 'samples': 21228288, 'steps': 110563, 'loss/train': 1.1891435384750366} 11/07/2021 12:47:22 - INFO - __main__ - Step 110565: {'lr': 8.259644435402797e-05, 'samples': 21228480, 'steps': 110564, 'loss/train': 1.3707956075668335} 11/07/2021 12:47:22 - INFO - __main__ - Step 110566: {'lr': 8.2592503024476e-05, 'samples': 21228672, 'steps': 110565, 'loss/train': 1.1208312511444092} 11/07/2021 12:47:23 - INFO - __main__ - Step 110567: {'lr': 8.258856177035528e-05, 'samples': 21228864, 'steps': 110566, 'loss/train': 1.2963709831237793} 11/07/2021 12:47:23 - INFO - __main__ - Step 110568: {'lr': 8.258462059166758e-05, 'samples': 21229056, 'steps': 110567, 'loss/train': 1.3252464532852173} 11/07/2021 12:47:23 - INFO - __main__ - Step 110569: {'lr': 8.258067948841463e-05, 'samples': 21229248, 'steps': 110568, 'loss/train': 2.203983783721924} 11/07/2021 12:47:24 - INFO - __main__ - Step 110570: {'lr': 8.257673846059827e-05, 'samples': 21229440, 'steps': 110569, 'loss/train': 1.1362547874450684} 11/07/2021 12:47:25 - INFO - __main__ - Step 110571: {'lr': 8.257279750822025e-05, 'samples': 21229632, 'steps': 110570, 'loss/train': 0.5876405835151672} 11/07/2021 12:47:25 - INFO - __main__ - Step 110572: {'lr': 8.256885663128233e-05, 'samples': 21229824, 'steps': 110571, 'loss/train': 1.7395622730255127} 11/07/2021 12:47:26 - INFO - __main__ - Step 110573: {'lr': 8.25649158297863e-05, 'samples': 21230016, 'steps': 110572, 'loss/train': 1.3243162631988525} 11/07/2021 12:47:26 - INFO - __main__ - Step 110574: {'lr': 8.256097510373395e-05, 'samples': 21230208, 'steps': 110573, 'loss/train': 2.021122694015503} 11/07/2021 12:47:27 - INFO - __main__ - Step 110575: {'lr': 8.255703445312712e-05, 'samples': 21230400, 'steps': 110574, 'loss/train': 1.6894702911376953} 11/07/2021 12:47:27 - INFO - __main__ - Step 110576: {'lr': 8.255309387796742e-05, 'samples': 21230592, 'steps': 110575, 'loss/train': 1.4730147123336792} 11/07/2021 12:47:28 - INFO - __main__ - Step 110577: {'lr': 8.254915337825672e-05, 'samples': 21230784, 'steps': 110576, 'loss/train': 1.1270502805709839} 11/07/2021 12:47:28 - INFO - __main__ - Step 110578: {'lr': 8.254521295399678e-05, 'samples': 21230976, 'steps': 110577, 'loss/train': 1.593934416770935} 11/07/2021 12:47:28 - INFO - __main__ - Step 110579: {'lr': 8.254127260518937e-05, 'samples': 21231168, 'steps': 110578, 'loss/train': 0.6147167682647705} 11/07/2021 12:47:29 - INFO - __main__ - Step 110580: {'lr': 8.25373323318363e-05, 'samples': 21231360, 'steps': 110579, 'loss/train': 1.3140307664871216} 11/07/2021 12:47:30 - INFO - __main__ - Step 110581: {'lr': 8.253339213393931e-05, 'samples': 21231552, 'steps': 110580, 'loss/train': 0.6198866963386536} 11/07/2021 12:47:30 - INFO - __main__ - Step 110582: {'lr': 8.252945201150019e-05, 'samples': 21231744, 'steps': 110581, 'loss/train': 1.2335416078567505} 11/07/2021 12:47:30 - INFO - __main__ - Step 110583: {'lr': 8.252551196452075e-05, 'samples': 21231936, 'steps': 110582, 'loss/train': 1.3451288938522339} 11/07/2021 12:47:31 - INFO - __main__ - Step 110584: {'lr': 8.252157199300266e-05, 'samples': 21232128, 'steps': 110583, 'loss/train': 1.3605070114135742} 11/07/2021 12:47:31 - INFO - __main__ - Step 110585: {'lr': 8.251763209694782e-05, 'samples': 21232320, 'steps': 110584, 'loss/train': 1.8755675554275513} 11/07/2021 12:47:32 - INFO - __main__ - Step 110586: {'lr': 8.251369227635794e-05, 'samples': 21232512, 'steps': 110585, 'loss/train': 0.4436931014060974} 11/07/2021 12:47:33 - INFO - __main__ - Step 110587: {'lr': 8.25097525312348e-05, 'samples': 21232704, 'steps': 110586, 'loss/train': 1.1748918294906616} 11/07/2021 12:47:33 - INFO - __main__ - Step 110588: {'lr': 8.250581286158026e-05, 'samples': 21232896, 'steps': 110587, 'loss/train': 1.3210314512252808} 11/07/2021 12:47:33 - INFO - __main__ - Step 110589: {'lr': 8.250187326739594e-05, 'samples': 21233088, 'steps': 110588, 'loss/train': 1.375685214996338} 11/07/2021 12:47:34 - INFO - __main__ - Step 110590: {'lr': 8.24979337486837e-05, 'samples': 21233280, 'steps': 110589, 'loss/train': 1.2182400226593018} 11/07/2021 12:47:35 - INFO - __main__ - Step 110591: {'lr': 8.24939943054453e-05, 'samples': 21233472, 'steps': 110590, 'loss/train': 1.2992846965789795} 11/07/2021 12:47:35 - INFO - __main__ - Step 110592: {'lr': 8.249005493768253e-05, 'samples': 21233664, 'steps': 110591, 'loss/train': 1.8763258457183838} 11/07/2021 12:47:36 - INFO - __main__ - Step 110593: {'lr': 8.248611564539713e-05, 'samples': 21233856, 'steps': 110592, 'loss/train': 1.0362553596496582} 11/07/2021 12:47:36 - INFO - __main__ - Step 110594: {'lr': 8.248217642859091e-05, 'samples': 21234048, 'steps': 110593, 'loss/train': 1.3349857330322266} 11/07/2021 12:47:36 - INFO - __main__ - Step 110595: {'lr': 8.247823728726563e-05, 'samples': 21234240, 'steps': 110594, 'loss/train': 1.4114094972610474} 11/07/2021 12:47:37 - INFO - __main__ - Step 110596: {'lr': 8.247429822142311e-05, 'samples': 21234432, 'steps': 110595, 'loss/train': 1.549581527709961} 11/07/2021 12:47:38 - INFO - __main__ - Step 110597: {'lr': 8.247035923106505e-05, 'samples': 21234624, 'steps': 110596, 'loss/train': 1.206872820854187} 11/07/2021 12:47:38 - INFO - __main__ - Step 110598: {'lr': 8.246642031619327e-05, 'samples': 21234816, 'steps': 110597, 'loss/train': 0.7379025220870972} 11/07/2021 12:47:38 - INFO - __main__ - Step 110599: {'lr': 8.246248147680954e-05, 'samples': 21235008, 'steps': 110598, 'loss/train': 0.553264319896698} 11/07/2021 12:47:39 - INFO - __main__ - Step 110600: {'lr': 8.245854271291561e-05, 'samples': 21235200, 'steps': 110599, 'loss/train': 1.200520634651184} 11/07/2021 12:47:40 - INFO - __main__ - Step 110601: {'lr': 8.24546040245133e-05, 'samples': 21235392, 'steps': 110600, 'loss/train': 1.0796058177947998} 11/07/2021 12:47:40 - INFO - __main__ - Step 110602: {'lr': 8.245066541160442e-05, 'samples': 21235584, 'steps': 110601, 'loss/train': 1.1699714660644531} 11/07/2021 12:47:41 - INFO - __main__ - Step 110603: {'lr': 8.244672687419064e-05, 'samples': 21235776, 'steps': 110602, 'loss/train': 1.5377284288406372} 11/07/2021 12:47:41 - INFO - __main__ - Step 110604: {'lr': 8.244278841227376e-05, 'samples': 21235968, 'steps': 110603, 'loss/train': 1.3654758930206299} 11/07/2021 12:47:41 - INFO - __main__ - Step 110605: {'lr': 8.243885002585556e-05, 'samples': 21236160, 'steps': 110604, 'loss/train': 0.4833988845348358} 11/07/2021 12:47:42 - INFO - __main__ - Step 110606: {'lr': 8.243491171493783e-05, 'samples': 21236352, 'steps': 110605, 'loss/train': 1.2508193254470825} 11/07/2021 12:47:43 - INFO - __main__ - Step 110607: {'lr': 8.243097347952236e-05, 'samples': 21236544, 'steps': 110606, 'loss/train': 1.4158129692077637} 11/07/2021 12:47:43 - INFO - __main__ - Step 110608: {'lr': 8.24270353196109e-05, 'samples': 21236736, 'steps': 110607, 'loss/train': 1.6220699548721313} 11/07/2021 12:47:43 - INFO - __main__ - Step 110609: {'lr': 8.242309723520522e-05, 'samples': 21236928, 'steps': 110608, 'loss/train': 1.4717552661895752} 11/07/2021 12:47:44 - INFO - __main__ - Step 110610: {'lr': 8.241915922630713e-05, 'samples': 21237120, 'steps': 110609, 'loss/train': 1.1814935207366943} 11/07/2021 12:47:44 - INFO - __main__ - Step 110611: {'lr': 8.241522129291837e-05, 'samples': 21237312, 'steps': 110610, 'loss/train': 1.1762770414352417} 11/07/2021 12:47:45 - INFO - __main__ - Step 110612: {'lr': 8.241128343504073e-05, 'samples': 21237504, 'steps': 110611, 'loss/train': 1.2336806058883667} 11/07/2021 12:47:45 - INFO - __main__ - Step 110613: {'lr': 8.240734565267597e-05, 'samples': 21237696, 'steps': 110612, 'loss/train': 1.2485852241516113} 11/07/2021 12:47:46 - INFO - __main__ - Step 110614: {'lr': 8.240340794582587e-05, 'samples': 21237888, 'steps': 110613, 'loss/train': 1.5219755172729492} 11/07/2021 12:47:46 - INFO - __main__ - Step 110615: {'lr': 8.23994703144923e-05, 'samples': 21238080, 'steps': 110614, 'loss/train': 1.4918179512023926} 11/07/2021 12:47:47 - INFO - __main__ - Step 110616: {'lr': 8.239553275867687e-05, 'samples': 21238272, 'steps': 110615, 'loss/train': 1.353966474533081} 11/07/2021 12:47:48 - INFO - __main__ - Step 110617: {'lr': 8.239159527838142e-05, 'samples': 21238464, 'steps': 110616, 'loss/train': 1.4161224365234375} 11/07/2021 12:47:48 - INFO - __main__ - Step 110618: {'lr': 8.238765787360772e-05, 'samples': 21238656, 'steps': 110617, 'loss/train': 0.9747772216796875} 11/07/2021 12:47:48 - INFO - __main__ - Step 110619: {'lr': 8.238372054435755e-05, 'samples': 21238848, 'steps': 110618, 'loss/train': 1.5910298824310303} 11/07/2021 12:47:49 - INFO - __main__ - Step 110620: {'lr': 8.237978329063269e-05, 'samples': 21239040, 'steps': 110619, 'loss/train': 1.134179711341858} 11/07/2021 12:47:49 - INFO - __main__ - Step 110621: {'lr': 8.237584611243493e-05, 'samples': 21239232, 'steps': 110620, 'loss/train': 1.098825454711914} 11/07/2021 12:47:50 - INFO - __main__ - Step 110622: {'lr': 8.237190900976602e-05, 'samples': 21239424, 'steps': 110621, 'loss/train': 1.3955832719802856} 11/07/2021 12:47:50 - INFO - __main__ - Step 110623: {'lr': 8.236797198262775e-05, 'samples': 21239616, 'steps': 110622, 'loss/train': 1.0751957893371582} 11/07/2021 12:47:51 - INFO - __main__ - Step 110624: {'lr': 8.236403503102185e-05, 'samples': 21239808, 'steps': 110623, 'loss/train': 1.617588758468628} 11/07/2021 12:47:51 - INFO - __main__ - Step 110625: {'lr': 8.236009815495018e-05, 'samples': 21240000, 'steps': 110624, 'loss/train': 1.5924136638641357} 11/07/2021 12:47:51 - INFO - __main__ - Step 110626: {'lr': 8.235616135441443e-05, 'samples': 21240192, 'steps': 110625, 'loss/train': 1.568914771080017} 11/07/2021 12:47:52 - INFO - __main__ - Step 110627: {'lr': 8.235222462941641e-05, 'samples': 21240384, 'steps': 110626, 'loss/train': 1.028577446937561} 11/07/2021 12:47:53 - INFO - __main__ - Step 110628: {'lr': 8.2348287979958e-05, 'samples': 21240576, 'steps': 110627, 'loss/train': 1.270805835723877} 11/07/2021 12:47:53 - INFO - __main__ - Step 110629: {'lr': 8.234435140604074e-05, 'samples': 21240768, 'steps': 110628, 'loss/train': 1.214325189590454} 11/07/2021 12:47:53 - INFO - __main__ - Step 110630: {'lr': 8.234041490766656e-05, 'samples': 21240960, 'steps': 110629, 'loss/train': 1.0459619760513306} 11/07/2021 12:47:54 - INFO - __main__ - Step 110631: {'lr': 8.23364784848372e-05, 'samples': 21241152, 'steps': 110630, 'loss/train': 1.2849316596984863} 11/07/2021 12:47:55 - INFO - __main__ - Step 110632: {'lr': 8.233254213755442e-05, 'samples': 21241344, 'steps': 110631, 'loss/train': 1.4365472793579102} 11/07/2021 12:47:55 - INFO - __main__ - Step 110633: {'lr': 8.232860586582e-05, 'samples': 21241536, 'steps': 110632, 'loss/train': 1.6418211460113525} 11/07/2021 12:47:56 - INFO - __main__ - Step 110634: {'lr': 8.232466966963575e-05, 'samples': 21241728, 'steps': 110633, 'loss/train': 1.1039024591445923} 11/07/2021 12:47:56 - INFO - __main__ - Step 110635: {'lr': 8.232073354900341e-05, 'samples': 21241920, 'steps': 110634, 'loss/train': 1.8946419954299927} 11/07/2021 12:47:56 - INFO - __main__ - Step 110636: {'lr': 8.231679750392473e-05, 'samples': 21242112, 'steps': 110635, 'loss/train': 0.9889124631881714} 11/07/2021 12:47:57 - INFO - __main__ - Step 110637: {'lr': 8.231286153440154e-05, 'samples': 21242304, 'steps': 110636, 'loss/train': 1.4296389818191528} 11/07/2021 12:47:58 - INFO - __main__ - Step 110638: {'lr': 8.23089256404356e-05, 'samples': 21242496, 'steps': 110637, 'loss/train': 0.8668188452720642} 11/07/2021 12:47:58 - INFO - __main__ - Step 110639: {'lr': 8.230498982202864e-05, 'samples': 21242688, 'steps': 110638, 'loss/train': 1.246320128440857} 11/07/2021 12:47:58 - INFO - __main__ - Step 110640: {'lr': 8.230105407918248e-05, 'samples': 21242880, 'steps': 110639, 'loss/train': 2.1222879886627197} 11/07/2021 12:47:59 - INFO - __main__ - Step 110641: {'lr': 8.229711841189889e-05, 'samples': 21243072, 'steps': 110640, 'loss/train': 1.346043586730957} 11/07/2021 12:47:59 - INFO - __main__ - Step 110642: {'lr': 8.22931828201797e-05, 'samples': 21243264, 'steps': 110641, 'loss/train': 1.0852361917495728} 11/07/2021 12:48:00 - INFO - __main__ - Step 110643: {'lr': 8.228924730402654e-05, 'samples': 21243456, 'steps': 110642, 'loss/train': 1.1534372568130493} 11/07/2021 12:48:00 - INFO - __main__ - Step 110644: {'lr': 8.228531186344124e-05, 'samples': 21243648, 'steps': 110643, 'loss/train': 1.3870668411254883} 11/07/2021 12:48:01 - INFO - __main__ - Step 110645: {'lr': 8.22813764984256e-05, 'samples': 21243840, 'steps': 110644, 'loss/train': 1.6123826503753662} 11/07/2021 12:48:01 - INFO - __main__ - Step 110646: {'lr': 8.227744120898136e-05, 'samples': 21244032, 'steps': 110645, 'loss/train': 1.3731058835983276} 11/07/2021 12:48:01 - INFO - __main__ - Step 110647: {'lr': 8.227350599511036e-05, 'samples': 21244224, 'steps': 110646, 'loss/train': 1.0561603307724} 11/07/2021 12:48:03 - INFO - __main__ - Step 110648: {'lr': 8.22695708568143e-05, 'samples': 21244416, 'steps': 110647, 'loss/train': 1.3964895009994507} 11/07/2021 12:48:03 - INFO - __main__ - Step 110649: {'lr': 8.226563579409498e-05, 'samples': 21244608, 'steps': 110648, 'loss/train': 1.6014235019683838} 11/07/2021 12:48:03 - INFO - __main__ - Step 110650: {'lr': 8.226170080695419e-05, 'samples': 21244800, 'steps': 110649, 'loss/train': 1.7008286714553833} 11/07/2021 12:48:04 - INFO - __main__ - Step 110651: {'lr': 8.225776589539372e-05, 'samples': 21244992, 'steps': 110650, 'loss/train': 1.0748753547668457} 11/07/2021 12:48:04 - INFO - __main__ - Step 110652: {'lr': 8.225383105941525e-05, 'samples': 21245184, 'steps': 110651, 'loss/train': 1.645250916481018} 11/07/2021 12:48:04 - INFO - __main__ - Step 110653: {'lr': 8.224989629902066e-05, 'samples': 21245376, 'steps': 110652, 'loss/train': 1.4471291303634644} 11/07/2021 12:48:05 - INFO - __main__ - Step 110654: {'lr': 8.224596161421166e-05, 'samples': 21245568, 'steps': 110653, 'loss/train': 1.5861237049102783} 11/07/2021 12:48:06 - INFO - __main__ - Step 110655: {'lr': 8.224202700499011e-05, 'samples': 21245760, 'steps': 110654, 'loss/train': 1.5163633823394775} 11/07/2021 12:48:06 - INFO - __main__ - Step 110656: {'lr': 8.223809247135766e-05, 'samples': 21245952, 'steps': 110655, 'loss/train': 1.0536514520645142} 11/07/2021 12:48:06 - INFO - __main__ - Step 110657: {'lr': 8.223415801331614e-05, 'samples': 21246144, 'steps': 110656, 'loss/train': 1.0560173988342285} 11/07/2021 12:48:07 - INFO - __main__ - Step 110658: {'lr': 8.22302236308673e-05, 'samples': 21246336, 'steps': 110657, 'loss/train': 0.7287185788154602} 11/07/2021 12:48:08 - INFO - __main__ - Step 110659: {'lr': 8.222628932401293e-05, 'samples': 21246528, 'steps': 110658, 'loss/train': 1.3590891361236572} 11/07/2021 12:48:08 - INFO - __main__ - Step 110660: {'lr': 8.222235509275483e-05, 'samples': 21246720, 'steps': 110659, 'loss/train': 1.11533784866333} 11/07/2021 12:48:09 - INFO - __main__ - Step 110661: {'lr': 8.221842093709473e-05, 'samples': 21246912, 'steps': 110660, 'loss/train': 1.1552317142486572} 11/07/2021 12:48:09 - INFO - __main__ - Step 110662: {'lr': 8.221448685703442e-05, 'samples': 21247104, 'steps': 110661, 'loss/train': 1.4168277978897095} 11/07/2021 12:48:09 - INFO - __main__ - Step 110663: {'lr': 8.221055285257568e-05, 'samples': 21247296, 'steps': 110662, 'loss/train': 1.6361604928970337} 11/07/2021 12:48:10 - INFO - __main__ - Step 110664: {'lr': 8.220661892372025e-05, 'samples': 21247488, 'steps': 110663, 'loss/train': 1.0479000806808472} 11/07/2021 12:48:11 - INFO - __main__ - Step 110665: {'lr': 8.220268507046997e-05, 'samples': 21247680, 'steps': 110664, 'loss/train': 1.4414684772491455} 11/07/2021 12:48:11 - INFO - __main__ - Step 110666: {'lr': 8.219875129282652e-05, 'samples': 21247872, 'steps': 110665, 'loss/train': 1.4551947116851807} 11/07/2021 12:48:11 - INFO - __main__ - Step 110667: {'lr': 8.219481759079176e-05, 'samples': 21248064, 'steps': 110666, 'loss/train': 1.0030345916748047} 11/07/2021 12:48:12 - INFO - __main__ - Step 110668: {'lr': 8.219088396436741e-05, 'samples': 21248256, 'steps': 110667, 'loss/train': 1.6022595167160034} 11/07/2021 12:48:13 - INFO - __main__ - Step 110669: {'lr': 8.218695041355537e-05, 'samples': 21248448, 'steps': 110668, 'loss/train': 1.1307629346847534} 11/07/2021 12:48:13 - INFO - __main__ - Step 110670: {'lr': 8.218301693835719e-05, 'samples': 21248640, 'steps': 110669, 'loss/train': 1.5571632385253906} 11/07/2021 12:48:13 - INFO - __main__ - Step 110671: {'lr': 8.217908353877476e-05, 'samples': 21248832, 'steps': 110670, 'loss/train': 1.4117851257324219} 11/07/2021 12:48:14 - INFO - __main__ - Step 110672: {'lr': 8.217515021480983e-05, 'samples': 21249024, 'steps': 110671, 'loss/train': 1.3462209701538086} 11/07/2021 12:48:14 - INFO - __main__ - Step 110673: {'lr': 8.217121696646421e-05, 'samples': 21249216, 'steps': 110672, 'loss/train': 1.2532153129577637} 11/07/2021 12:48:15 - INFO - __main__ - Step 110674: {'lr': 8.216728379373964e-05, 'samples': 21249408, 'steps': 110673, 'loss/train': 1.51374351978302} 11/07/2021 12:48:16 - INFO - __main__ - Step 110675: {'lr': 8.216335069663791e-05, 'samples': 21249600, 'steps': 110674, 'loss/train': 1.297188639640808} 11/07/2021 12:48:16 - INFO - __main__ - Step 110676: {'lr': 8.215941767516077e-05, 'samples': 21249792, 'steps': 110675, 'loss/train': 1.987684965133667} 11/07/2021 12:48:16 - INFO - __main__ - Step 110677: {'lr': 8.215548472931004e-05, 'samples': 21249984, 'steps': 110676, 'loss/train': 1.4090352058410645} 11/07/2021 12:48:17 - INFO - __main__ - Step 110678: {'lr': 8.215155185908743e-05, 'samples': 21250176, 'steps': 110677, 'loss/train': 2.168642282485962} 11/07/2021 12:48:18 - INFO - __main__ - Step 110679: {'lr': 8.214761906449475e-05, 'samples': 21250368, 'steps': 110678, 'loss/train': 1.55726158618927} 11/07/2021 12:48:18 - INFO - __main__ - Step 110680: {'lr': 8.214368634553374e-05, 'samples': 21250560, 'steps': 110679, 'loss/train': 1.149858832359314} 11/07/2021 12:48:18 - INFO - __main__ - Step 110681: {'lr': 8.213975370220622e-05, 'samples': 21250752, 'steps': 110680, 'loss/train': 1.0687888860702515} 11/07/2021 12:48:19 - INFO - __main__ - Step 110682: {'lr': 8.213582113451401e-05, 'samples': 21250944, 'steps': 110681, 'loss/train': 1.5312366485595703} 11/07/2021 12:48:19 - INFO - __main__ - Step 110683: {'lr': 8.213188864245873e-05, 'samples': 21251136, 'steps': 110682, 'loss/train': 0.850086510181427} 11/07/2021 12:48:19 - INFO - __main__ - Step 110684: {'lr': 8.212795622604222e-05, 'samples': 21251328, 'steps': 110683, 'loss/train': 1.1422226428985596} 11/07/2021 12:48:21 - INFO - __main__ - Step 110685: {'lr': 8.212402388526627e-05, 'samples': 21251520, 'steps': 110684, 'loss/train': 1.0919674634933472} 11/07/2021 12:48:21 - INFO - __main__ - Step 110686: {'lr': 8.212009162013264e-05, 'samples': 21251712, 'steps': 110685, 'loss/train': 1.3335826396942139} 11/07/2021 12:48:21 - INFO - __main__ - Step 110687: {'lr': 8.211615943064312e-05, 'samples': 21251904, 'steps': 110686, 'loss/train': 0.6186112761497498} 11/07/2021 12:48:22 - INFO - __main__ - Step 110688: {'lr': 8.211222731679946e-05, 'samples': 21252096, 'steps': 110687, 'loss/train': 1.366358757019043} 11/07/2021 12:48:22 - INFO - __main__ - Step 110689: {'lr': 8.210829527860344e-05, 'samples': 21252288, 'steps': 110688, 'loss/train': 1.3547495603561401} 11/07/2021 12:48:23 - INFO - __main__ - Step 110690: {'lr': 8.210436331605683e-05, 'samples': 21252480, 'steps': 110689, 'loss/train': 1.7464109659194946} 11/07/2021 12:48:23 - INFO - __main__ - Step 110691: {'lr': 8.21004314291614e-05, 'samples': 21252672, 'steps': 110690, 'loss/train': 1.4842932224273682} 11/07/2021 12:48:24 - INFO - __main__ - Step 110692: {'lr': 8.209649961791893e-05, 'samples': 21252864, 'steps': 110691, 'loss/train': 1.1335171461105347} 11/07/2021 12:48:24 - INFO - __main__ - Step 110693: {'lr': 8.209256788233119e-05, 'samples': 21253056, 'steps': 110692, 'loss/train': 1.2780518531799316} 11/07/2021 12:48:24 - INFO - __main__ - Step 110694: {'lr': 8.208863622239995e-05, 'samples': 21253248, 'steps': 110693, 'loss/train': 0.8067029714584351} 11/07/2021 12:48:25 - INFO - __main__ - Step 110695: {'lr': 8.208470463812706e-05, 'samples': 21253440, 'steps': 110694, 'loss/train': 1.427488088607788} 11/07/2021 12:48:26 - INFO - __main__ - Step 110696: {'lr': 8.208077312951412e-05, 'samples': 21253632, 'steps': 110695, 'loss/train': 1.3478630781173706} 11/07/2021 12:48:26 - INFO - __main__ - Step 110697: {'lr': 8.207684169656298e-05, 'samples': 21253824, 'steps': 110696, 'loss/train': 1.5847318172454834} 11/07/2021 12:48:26 - INFO - __main__ - Step 110698: {'lr': 8.207291033927545e-05, 'samples': 21254016, 'steps': 110697, 'loss/train': 1.539792776107788} 11/07/2021 12:48:27 - INFO - __main__ - Step 110699: {'lr': 8.206897905765326e-05, 'samples': 21254208, 'steps': 110698, 'loss/train': 1.3739140033721924} 11/07/2021 12:48:28 - INFO - __main__ - Step 110700: {'lr': 8.206504785169821e-05, 'samples': 21254400, 'steps': 110699, 'loss/train': 1.3555974960327148} 11/07/2021 12:48:28 - INFO - __main__ - Step 110701: {'lr': 8.206111672141204e-05, 'samples': 21254592, 'steps': 110700, 'loss/train': 1.5597007274627686} 11/07/2021 12:48:29 - INFO - __main__ - Step 110702: {'lr': 8.205718566679654e-05, 'samples': 21254784, 'steps': 110701, 'loss/train': 1.3383417129516602} 11/07/2021 12:48:29 - INFO - __main__ - Step 110703: {'lr': 8.205325468785348e-05, 'samples': 21254976, 'steps': 110702, 'loss/train': 1.0597721338272095} 11/07/2021 12:48:29 - INFO - __main__ - Step 110704: {'lr': 8.204932378458466e-05, 'samples': 21255168, 'steps': 110703, 'loss/train': 0.5761095881462097} 11/07/2021 12:48:30 - INFO - __main__ - Step 110705: {'lr': 8.204539295699182e-05, 'samples': 21255360, 'steps': 110704, 'loss/train': 1.7123957872390747} 11/07/2021 12:48:31 - INFO - __main__ - Step 110706: {'lr': 8.20414622050768e-05, 'samples': 21255552, 'steps': 110705, 'loss/train': 1.3238344192504883} 11/07/2021 12:48:31 - INFO - __main__ - Step 110707: {'lr': 8.203753152884122e-05, 'samples': 21255744, 'steps': 110706, 'loss/train': 1.0540164709091187} 11/07/2021 12:48:31 - INFO - __main__ - Step 110708: {'lr': 8.203360092828693e-05, 'samples': 21255936, 'steps': 110707, 'loss/train': 1.1530207395553589} 11/07/2021 12:48:32 - INFO - __main__ - Step 110709: {'lr': 8.202967040341572e-05, 'samples': 21256128, 'steps': 110708, 'loss/train': 1.409467339515686} 11/07/2021 12:48:32 - INFO - __main__ - Step 110710: {'lr': 8.202573995422935e-05, 'samples': 21256320, 'steps': 110709, 'loss/train': 1.5471082925796509} 11/07/2021 12:48:33 - INFO - __main__ - Step 110711: {'lr': 8.202180958072958e-05, 'samples': 21256512, 'steps': 110710, 'loss/train': 0.94731205701828} 11/07/2021 12:48:33 - INFO - __main__ - Step 110712: {'lr': 8.20178792829182e-05, 'samples': 21256704, 'steps': 110711, 'loss/train': 1.6338773965835571} 11/07/2021 12:48:34 - INFO - __main__ - Step 110713: {'lr': 8.201394906079698e-05, 'samples': 21256896, 'steps': 110712, 'loss/train': 1.233245849609375} 11/07/2021 12:48:34 - INFO - __main__ - Step 110714: {'lr': 8.201001891436765e-05, 'samples': 21257088, 'steps': 110713, 'loss/train': 1.2065162658691406} 11/07/2021 12:48:35 - INFO - __main__ - Step 110715: {'lr': 8.200608884363204e-05, 'samples': 21257280, 'steps': 110714, 'loss/train': 1.5564137697219849} 11/07/2021 12:48:36 - INFO - __main__ - Step 110716: {'lr': 8.200215884859188e-05, 'samples': 21257472, 'steps': 110715, 'loss/train': 1.610504150390625} 11/07/2021 12:48:36 - INFO - __main__ - Step 110717: {'lr': 8.199822892924905e-05, 'samples': 21257664, 'steps': 110716, 'loss/train': 1.2108469009399414} 11/07/2021 12:48:36 - INFO - __main__ - Step 110718: {'lr': 8.199429908560516e-05, 'samples': 21257856, 'steps': 110717, 'loss/train': 1.1237478256225586} 11/07/2021 12:48:37 - INFO - __main__ - Step 110719: {'lr': 8.199036931766202e-05, 'samples': 21258048, 'steps': 110718, 'loss/train': 1.176120638847351} 11/07/2021 12:48:37 - INFO - __main__ - Step 110720: {'lr': 8.198643962542143e-05, 'samples': 21258240, 'steps': 110719, 'loss/train': 1.3786499500274658} 11/07/2021 12:48:39 - INFO - __main__ - Step 110721: {'lr': 8.198251000888516e-05, 'samples': 21258432, 'steps': 110720, 'loss/train': 0.9710098505020142} 11/07/2021 12:48:39 - INFO - __main__ - Step 110722: {'lr': 8.197858046805498e-05, 'samples': 21258624, 'steps': 110721, 'loss/train': 1.1385937929153442} 11/07/2021 12:48:39 - INFO - __main__ - Step 110723: {'lr': 8.197465100293264e-05, 'samples': 21258816, 'steps': 110722, 'loss/train': 1.5600953102111816} 11/07/2021 12:48:40 - INFO - __main__ - Step 110724: {'lr': 8.197072161351996e-05, 'samples': 21259008, 'steps': 110723, 'loss/train': 2.0252888202667236} 11/07/2021 12:48:40 - INFO - __main__ - Step 110725: {'lr': 8.196679229981866e-05, 'samples': 21259200, 'steps': 110724, 'loss/train': 2.309241771697998} 11/07/2021 12:48:40 - INFO - __main__ - Step 110726: {'lr': 8.196286306183054e-05, 'samples': 21259392, 'steps': 110725, 'loss/train': 1.5494650602340698} 11/07/2021 12:48:41 - INFO - __main__ - Step 110727: {'lr': 8.195893389955735e-05, 'samples': 21259584, 'steps': 110726, 'loss/train': 1.373620629310608} 11/07/2021 12:48:42 - INFO - __main__ - Step 110728: {'lr': 8.195500481300097e-05, 'samples': 21259776, 'steps': 110727, 'loss/train': 1.3436307907104492} 11/07/2021 12:48:42 - INFO - __main__ - Step 110729: {'lr': 8.195107580216298e-05, 'samples': 21259968, 'steps': 110728, 'loss/train': 1.1869405508041382} 11/07/2021 12:48:42 - INFO - __main__ - Step 110730: {'lr': 8.194714686704524e-05, 'samples': 21260160, 'steps': 110729, 'loss/train': 1.2966328859329224} 11/07/2021 12:48:43 - INFO - __main__ - Step 110731: {'lr': 8.194321800764953e-05, 'samples': 21260352, 'steps': 110730, 'loss/train': 1.7454824447631836} 11/07/2021 12:48:44 - INFO - __main__ - Step 110732: {'lr': 8.193928922397762e-05, 'samples': 21260544, 'steps': 110731, 'loss/train': 1.4439668655395508} 11/07/2021 12:48:44 - INFO - __main__ - Step 110733: {'lr': 8.193536051603123e-05, 'samples': 21260736, 'steps': 110732, 'loss/train': 1.5072152614593506} 11/07/2021 12:48:45 - INFO - __main__ - Step 110734: {'lr': 8.193143188381221e-05, 'samples': 21260928, 'steps': 110733, 'loss/train': 1.7511011362075806} 11/07/2021 12:48:45 - INFO - __main__ - Step 110735: {'lr': 8.192750332732229e-05, 'samples': 21261120, 'steps': 110734, 'loss/train': 0.8850861191749573} 11/07/2021 12:48:45 - INFO - __main__ - Step 110736: {'lr': 8.192357484656324e-05, 'samples': 21261312, 'steps': 110735, 'loss/train': 1.288577675819397} 11/07/2021 12:48:46 - INFO - __main__ - Step 110737: {'lr': 8.191964644153686e-05, 'samples': 21261504, 'steps': 110736, 'loss/train': 1.440421223640442} 11/07/2021 12:48:47 - INFO - __main__ - Step 110738: {'lr': 8.191571811224486e-05, 'samples': 21261696, 'steps': 110737, 'loss/train': 0.6176782250404358} 11/07/2021 12:48:47 - INFO - __main__ - Step 110739: {'lr': 8.191178985868914e-05, 'samples': 21261888, 'steps': 110738, 'loss/train': 1.3988465070724487} 11/07/2021 12:48:47 - INFO - __main__ - Step 110740: {'lr': 8.190786168087128e-05, 'samples': 21262080, 'steps': 110739, 'loss/train': 0.9429416656494141} 11/07/2021 12:48:48 - INFO - __main__ - Step 110741: {'lr': 8.190393357879313e-05, 'samples': 21262272, 'steps': 110740, 'loss/train': 1.203632116317749} 11/07/2021 12:48:48 - INFO - __main__ - Step 110742: {'lr': 8.19000055524565e-05, 'samples': 21262464, 'steps': 110741, 'loss/train': 1.4019746780395508} 11/07/2021 12:48:49 - INFO - __main__ - Step 110743: {'lr': 8.189607760186313e-05, 'samples': 21262656, 'steps': 110742, 'loss/train': 1.3236186504364014} 11/07/2021 12:48:49 - INFO - __main__ - Step 110744: {'lr': 8.189214972701478e-05, 'samples': 21262848, 'steps': 110743, 'loss/train': 1.390566110610962} 11/07/2021 12:48:50 - INFO - __main__ - Step 110745: {'lr': 8.188822192791326e-05, 'samples': 21263040, 'steps': 110744, 'loss/train': 1.6164491176605225} 11/07/2021 12:48:50 - INFO - __main__ - Step 110746: {'lr': 8.188429420456028e-05, 'samples': 21263232, 'steps': 110745, 'loss/train': 1.0560944080352783} 11/07/2021 12:48:50 - INFO - __main__ - Step 110747: {'lr': 8.188036655695766e-05, 'samples': 21263424, 'steps': 110746, 'loss/train': 0.74655681848526} 11/07/2021 12:48:52 - INFO - __main__ - Step 110748: {'lr': 8.187643898510716e-05, 'samples': 21263616, 'steps': 110747, 'loss/train': 1.0808771848678589} 11/07/2021 12:48:52 - INFO - __main__ - Step 110749: {'lr': 8.187251148901053e-05, 'samples': 21263808, 'steps': 110748, 'loss/train': 1.1667860746383667} 11/07/2021 12:48:52 - INFO - __main__ - Step 110750: {'lr': 8.186858406866965e-05, 'samples': 21264000, 'steps': 110749, 'loss/train': 1.5590254068374634} 11/07/2021 12:48:53 - INFO - __main__ - Step 110751: {'lr': 8.186465672408608e-05, 'samples': 21264192, 'steps': 110750, 'loss/train': 1.3226673603057861} 11/07/2021 12:48:53 - INFO - __main__ - Step 110752: {'lr': 8.186072945526174e-05, 'samples': 21264384, 'steps': 110751, 'loss/train': 0.10842128098011017} 11/07/2021 12:48:54 - INFO - __main__ - Step 110753: {'lr': 8.185680226219832e-05, 'samples': 21264576, 'steps': 110752, 'loss/train': 1.2636268138885498} 11/07/2021 12:48:54 - INFO - __main__ - Step 110754: {'lr': 8.185287514489767e-05, 'samples': 21264768, 'steps': 110753, 'loss/train': 1.3390061855316162} 11/07/2021 12:48:55 - INFO - __main__ - Step 110755: {'lr': 8.184894810336149e-05, 'samples': 21264960, 'steps': 110754, 'loss/train': 1.455955147743225} 11/07/2021 12:48:55 - INFO - __main__ - Step 110756: {'lr': 8.18450211375916e-05, 'samples': 21265152, 'steps': 110755, 'loss/train': 1.3939706087112427} 11/07/2021 12:48:55 - INFO - __main__ - Step 110757: {'lr': 8.184109424758973e-05, 'samples': 21265344, 'steps': 110756, 'loss/train': 1.5078188180923462} 11/07/2021 12:48:56 - INFO - __main__ - Step 110758: {'lr': 8.183716743335767e-05, 'samples': 21265536, 'steps': 110757, 'loss/train': 0.20046356320381165} 11/07/2021 12:48:57 - INFO - __main__ - Step 110759: {'lr': 8.18332406948972e-05, 'samples': 21265728, 'steps': 110758, 'loss/train': 0.9585129022598267} 11/07/2021 12:48:57 - INFO - __main__ - Step 110760: {'lr': 8.182931403221006e-05, 'samples': 21265920, 'steps': 110759, 'loss/train': 1.4737564325332642} 11/07/2021 12:48:58 - INFO - __main__ - Step 110761: {'lr': 8.182538744529805e-05, 'samples': 21266112, 'steps': 110760, 'loss/train': 1.5325512886047363} 11/07/2021 12:48:58 - INFO - __main__ - Step 110762: {'lr': 8.182146093416292e-05, 'samples': 21266304, 'steps': 110761, 'loss/train': 0.8542195558547974} 11/07/2021 12:48:58 - INFO - __main__ - Step 110763: {'lr': 8.181753449880652e-05, 'samples': 21266496, 'steps': 110762, 'loss/train': 1.0631719827651978} 11/07/2021 12:48:59 - INFO - __main__ - Step 110764: {'lr': 8.181360813923047e-05, 'samples': 21266688, 'steps': 110763, 'loss/train': 1.1335355043411255} 11/07/2021 12:49:00 - INFO - __main__ - Step 110765: {'lr': 8.18096818554366e-05, 'samples': 21266880, 'steps': 110764, 'loss/train': 1.121647834777832} 11/07/2021 12:49:00 - INFO - __main__ - Step 110766: {'lr': 8.180575564742673e-05, 'samples': 21267072, 'steps': 110765, 'loss/train': 1.3370468616485596} 11/07/2021 12:49:00 - INFO - __main__ - Step 110767: {'lr': 8.180182951520257e-05, 'samples': 21267264, 'steps': 110766, 'loss/train': 1.4781783819198608} 11/07/2021 12:49:01 - INFO - __main__ - Step 110768: {'lr': 8.179790345876589e-05, 'samples': 21267456, 'steps': 110767, 'loss/train': 1.410469651222229} 11/07/2021 12:49:02 - INFO - __main__ - Step 110769: {'lr': 8.179397747811851e-05, 'samples': 21267648, 'steps': 110768, 'loss/train': 1.389719843864441} 11/07/2021 12:49:02 - INFO - __main__ - Step 110770: {'lr': 8.179005157326214e-05, 'samples': 21267840, 'steps': 110769, 'loss/train': 1.2153953313827515} 11/07/2021 12:49:03 - INFO - __main__ - Step 110771: {'lr': 8.17861257441986e-05, 'samples': 21268032, 'steps': 110770, 'loss/train': 0.35816988348960876} 11/07/2021 12:49:03 - INFO - __main__ - Step 110772: {'lr': 8.178219999092962e-05, 'samples': 21268224, 'steps': 110771, 'loss/train': 1.4301100969314575} 11/07/2021 12:49:03 - INFO - __main__ - Step 110773: {'lr': 8.1778274313457e-05, 'samples': 21268416, 'steps': 110772, 'loss/train': 1.9223368167877197} 11/07/2021 12:49:04 - INFO - __main__ - Step 110774: {'lr': 8.177434871178247e-05, 'samples': 21268608, 'steps': 110773, 'loss/train': 1.8488402366638184} 11/07/2021 12:49:05 - INFO - __main__ - Step 110775: {'lr': 8.177042318590785e-05, 'samples': 21268800, 'steps': 110774, 'loss/train': 1.4539192914962769} 11/07/2021 12:49:05 - INFO - __main__ - Step 110776: {'lr': 8.176649773583495e-05, 'samples': 21268992, 'steps': 110775, 'loss/train': 2.009864330291748} 11/07/2021 12:49:05 - INFO - __main__ - Step 110777: {'lr': 8.176257236156539e-05, 'samples': 21269184, 'steps': 110776, 'loss/train': 1.4098371267318726} 11/07/2021 12:49:06 - INFO - __main__ - Step 110778: {'lr': 8.175864706310102e-05, 'samples': 21269376, 'steps': 110777, 'loss/train': 2.1723744869232178} 11/07/2021 12:49:07 - INFO - __main__ - Step 110779: {'lr': 8.175472184044361e-05, 'samples': 21269568, 'steps': 110778, 'loss/train': 1.4956415891647339} 11/07/2021 12:49:08 - INFO - __main__ - Step 110780: {'lr': 8.175079669359492e-05, 'samples': 21269760, 'steps': 110779, 'loss/train': 0.8331304788589478} 11/07/2021 12:49:08 - INFO - __main__ - Step 110781: {'lr': 8.174687162255672e-05, 'samples': 21269952, 'steps': 110780, 'loss/train': 1.3459707498550415} 11/07/2021 12:49:08 - INFO - __main__ - Step 110782: {'lr': 8.174294662733078e-05, 'samples': 21270144, 'steps': 110781, 'loss/train': 1.6465091705322266} 11/07/2021 12:49:09 - INFO - __main__ - Step 110783: {'lr': 8.173902170791888e-05, 'samples': 21270336, 'steps': 110782, 'loss/train': 1.8692642450332642} 11/07/2021 12:49:09 - INFO - __main__ - Step 110784: {'lr': 8.173509686432279e-05, 'samples': 21270528, 'steps': 110783, 'loss/train': 2.0070583820343018} 11/07/2021 12:49:09 - INFO - __main__ - Step 110785: {'lr': 8.173117209654427e-05, 'samples': 21270720, 'steps': 110784, 'loss/train': 1.5341084003448486} 11/07/2021 12:49:10 - INFO - __main__ - Step 110786: {'lr': 8.172724740458506e-05, 'samples': 21270912, 'steps': 110785, 'loss/train': 1.89753258228302} 11/07/2021 12:49:11 - INFO - __main__ - Step 110787: {'lr': 8.172332278844699e-05, 'samples': 21271104, 'steps': 110786, 'loss/train': 1.3705977201461792} 11/07/2021 12:49:11 - INFO - __main__ - Step 110788: {'lr': 8.171939824813176e-05, 'samples': 21271296, 'steps': 110787, 'loss/train': 1.4612141847610474} 11/07/2021 12:49:12 - INFO - __main__ - Step 110789: {'lr': 8.17154737836412e-05, 'samples': 21271488, 'steps': 110788, 'loss/train': 1.091131567955017} 11/07/2021 12:49:12 - INFO - __main__ - Step 110790: {'lr': 8.171154939497713e-05, 'samples': 21271680, 'steps': 110789, 'loss/train': 1.3668166399002075} 11/07/2021 12:49:13 - INFO - __main__ - Step 110791: {'lr': 8.170762508214114e-05, 'samples': 21271872, 'steps': 110790, 'loss/train': 1.8429981470108032} 11/07/2021 12:49:13 - INFO - __main__ - Step 110792: {'lr': 8.170370084513511e-05, 'samples': 21272064, 'steps': 110791, 'loss/train': 1.4374723434448242} 11/07/2021 12:49:14 - INFO - __main__ - Step 110793: {'lr': 8.16997766839608e-05, 'samples': 21272256, 'steps': 110792, 'loss/train': 1.1470707654953003} 11/07/2021 12:49:14 - INFO - __main__ - Step 110794: {'lr': 8.169585259861997e-05, 'samples': 21272448, 'steps': 110793, 'loss/train': 1.7712818384170532} 11/07/2021 12:49:14 - INFO - __main__ - Step 110795: {'lr': 8.169192858911436e-05, 'samples': 21272640, 'steps': 110794, 'loss/train': 1.270838975906372} 11/07/2021 12:49:15 - INFO - __main__ - Step 110796: {'lr': 8.168800465544582e-05, 'samples': 21272832, 'steps': 110795, 'loss/train': 1.1294279098510742} 11/07/2021 12:49:16 - INFO - __main__ - Step 110797: {'lr': 8.168408079761605e-05, 'samples': 21273024, 'steps': 110796, 'loss/train': 1.422444462776184} 11/07/2021 12:49:16 - INFO - __main__ - Step 110798: {'lr': 8.168015701562684e-05, 'samples': 21273216, 'steps': 110797, 'loss/train': 1.2843780517578125} 11/07/2021 12:49:16 - INFO - __main__ - Step 110799: {'lr': 8.167623330947993e-05, 'samples': 21273408, 'steps': 110798, 'loss/train': 1.4651267528533936} 11/07/2021 12:49:17 - INFO - __main__ - Step 110800: {'lr': 8.167230967917713e-05, 'samples': 21273600, 'steps': 110799, 'loss/train': 1.5689659118652344} 11/07/2021 12:49:18 - INFO - __main__ - Step 110801: {'lr': 8.166838612472019e-05, 'samples': 21273792, 'steps': 110800, 'loss/train': 1.5162473917007446} 11/07/2021 12:49:18 - INFO - __main__ - Step 110802: {'lr': 8.166446264611088e-05, 'samples': 21273984, 'steps': 110801, 'loss/train': 1.2190803289413452} 11/07/2021 12:49:19 - INFO - __main__ - Step 110803: {'lr': 8.166053924335104e-05, 'samples': 21274176, 'steps': 110802, 'loss/train': 1.0923455953598022} 11/07/2021 12:49:19 - INFO - __main__ - Step 110804: {'lr': 8.165661591644227e-05, 'samples': 21274368, 'steps': 110803, 'loss/train': 0.9283493161201477} 11/07/2021 12:49:19 - INFO - __main__ - Step 110805: {'lr': 8.165269266538644e-05, 'samples': 21274560, 'steps': 110804, 'loss/train': 1.1295156478881836} 11/07/2021 12:49:20 - INFO - __main__ - Step 110806: {'lr': 8.164876949018532e-05, 'samples': 21274752, 'steps': 110805, 'loss/train': 1.2567299604415894} 11/07/2021 12:49:21 - INFO - __main__ - Step 110807: {'lr': 8.164484639084065e-05, 'samples': 21274944, 'steps': 110806, 'loss/train': 1.2733113765716553} 11/07/2021 12:49:21 - INFO - __main__ - Step 110808: {'lr': 8.164092336735424e-05, 'samples': 21275136, 'steps': 110807, 'loss/train': 1.496654748916626} 11/07/2021 12:49:21 - INFO - __main__ - Step 110809: {'lr': 8.163700041972783e-05, 'samples': 21275328, 'steps': 110808, 'loss/train': 1.101141095161438} 11/07/2021 12:49:22 - INFO - __main__ - Step 110810: {'lr': 8.163307754796318e-05, 'samples': 21275520, 'steps': 110809, 'loss/train': 1.1141598224639893} 11/07/2021 12:49:22 - INFO - __main__ - Step 110811: {'lr': 8.162915475206206e-05, 'samples': 21275712, 'steps': 110810, 'loss/train': 0.9029640555381775} 11/07/2021 12:49:23 - INFO - __main__ - Step 110812: {'lr': 8.162523203202623e-05, 'samples': 21275904, 'steps': 110811, 'loss/train': 1.334350824356079} 11/07/2021 12:49:24 - INFO - __main__ - Step 110813: {'lr': 8.16213093878575e-05, 'samples': 21276096, 'steps': 110812, 'loss/train': 1.9672266244888306} 11/07/2021 12:49:24 - INFO - __main__ - Step 110814: {'lr': 8.16173868195576e-05, 'samples': 21276288, 'steps': 110813, 'loss/train': 1.1391140222549438} 11/07/2021 12:49:24 - INFO - __main__ - Step 110815: {'lr': 8.16134643271283e-05, 'samples': 21276480, 'steps': 110814, 'loss/train': 1.1215722560882568} 11/07/2021 12:49:25 - INFO - __main__ - Step 110816: {'lr': 8.160954191057137e-05, 'samples': 21276672, 'steps': 110815, 'loss/train': 1.133870005607605} 11/07/2021 12:49:26 - INFO - __main__ - Step 110817: {'lr': 8.160561956988868e-05, 'samples': 21276864, 'steps': 110816, 'loss/train': 0.9702134728431702} 11/07/2021 12:49:26 - INFO - __main__ - Step 110818: {'lr': 8.160169730508182e-05, 'samples': 21277056, 'steps': 110817, 'loss/train': 1.5285617113113403} 11/07/2021 12:49:26 - INFO - __main__ - Step 110819: {'lr': 8.159777511615263e-05, 'samples': 21277248, 'steps': 110818, 'loss/train': 1.1507803201675415} 11/07/2021 12:49:27 - INFO - __main__ - Step 110820: {'lr': 8.15938530031029e-05, 'samples': 21277440, 'steps': 110819, 'loss/train': 1.5310238599777222} 11/07/2021 12:49:27 - INFO - __main__ - Step 110821: {'lr': 8.158993096593437e-05, 'samples': 21277632, 'steps': 110820, 'loss/train': 1.2636816501617432} 11/07/2021 12:49:28 - INFO - __main__ - Step 110822: {'lr': 8.15860090046488e-05, 'samples': 21277824, 'steps': 110821, 'loss/train': 1.5227453708648682} 11/07/2021 12:49:28 - INFO - __main__ - Step 110823: {'lr': 8.1582087119248e-05, 'samples': 21278016, 'steps': 110822, 'loss/train': 1.335868239402771} 11/07/2021 12:49:29 - INFO - __main__ - Step 110824: {'lr': 8.15781653097337e-05, 'samples': 21278208, 'steps': 110823, 'loss/train': 1.1854832172393799} 11/07/2021 12:49:29 - INFO - __main__ - Step 110825: {'lr': 8.157424357610768e-05, 'samples': 21278400, 'steps': 110824, 'loss/train': 1.2137876749038696} 11/07/2021 12:49:30 - INFO - __main__ - Step 110826: {'lr': 8.157032191837171e-05, 'samples': 21278592, 'steps': 110825, 'loss/train': 1.6205250024795532} 11/07/2021 12:49:31 - INFO - __main__ - Step 110827: {'lr': 8.156640033652754e-05, 'samples': 21278784, 'steps': 110826, 'loss/train': 1.4535924196243286} 11/07/2021 12:49:31 - INFO - __main__ - Step 110828: {'lr': 8.156247883057696e-05, 'samples': 21278976, 'steps': 110827, 'loss/train': 1.2376035451889038} 11/07/2021 12:49:31 - INFO - __main__ - Step 110829: {'lr': 8.155855740052173e-05, 'samples': 21279168, 'steps': 110828, 'loss/train': 2.772515296936035} 11/07/2021 12:49:32 - INFO - __main__ - Step 110830: {'lr': 8.15546360463637e-05, 'samples': 21279360, 'steps': 110829, 'loss/train': 1.3019856214523315} 11/07/2021 12:49:32 - INFO - __main__ - Step 110831: {'lr': 8.155071476810446e-05, 'samples': 21279552, 'steps': 110830, 'loss/train': 1.3743095397949219} 11/07/2021 12:49:33 - INFO - __main__ - Step 110832: {'lr': 8.15467935657459e-05, 'samples': 21279744, 'steps': 110831, 'loss/train': 1.2826733589172363} 11/07/2021 12:49:33 - INFO - __main__ - Step 110833: {'lr': 8.154287243928973e-05, 'samples': 21279936, 'steps': 110832, 'loss/train': 1.5792527198791504} 11/07/2021 12:49:34 - INFO - __main__ - Step 110834: {'lr': 8.153895138873773e-05, 'samples': 21280128, 'steps': 110833, 'loss/train': 1.4602439403533936} 11/07/2021 12:49:34 - INFO - __main__ - Step 110835: {'lr': 8.153503041409172e-05, 'samples': 21280320, 'steps': 110834, 'loss/train': 1.775022029876709} 11/07/2021 12:49:34 - INFO - __main__ - Step 110836: {'lr': 8.153110951535339e-05, 'samples': 21280512, 'steps': 110835, 'loss/train': 1.5425642728805542} 11/07/2021 12:49:35 - INFO - __main__ - Step 110837: {'lr': 8.152718869252454e-05, 'samples': 21280704, 'steps': 110836, 'loss/train': 1.3077425956726074} 11/07/2021 12:49:36 - INFO - __main__ - Step 110838: {'lr': 8.152326794560697e-05, 'samples': 21280896, 'steps': 110837, 'loss/train': 0.5632463693618774} 11/07/2021 12:49:36 - INFO - __main__ - Step 110839: {'lr': 8.151934727460239e-05, 'samples': 21281088, 'steps': 110838, 'loss/train': 1.3444397449493408} 11/07/2021 12:49:37 - INFO - __main__ - Step 110840: {'lr': 8.151542667951258e-05, 'samples': 21281280, 'steps': 110839, 'loss/train': 0.9100433588027954} 11/07/2021 12:49:37 - INFO - __main__ - Step 110841: {'lr': 8.151150616033934e-05, 'samples': 21281472, 'steps': 110840, 'loss/train': 1.2637569904327393} 11/07/2021 12:49:37 - INFO - __main__ - Step 110842: {'lr': 8.150758571708442e-05, 'samples': 21281664, 'steps': 110841, 'loss/train': 1.3413604497909546} 11/07/2021 12:49:39 - INFO - __main__ - Step 110843: {'lr': 8.150366534974956e-05, 'samples': 21281856, 'steps': 110842, 'loss/train': 1.5384808778762817} 11/07/2021 12:49:39 - INFO - __main__ - Step 110844: {'lr': 8.149974505833665e-05, 'samples': 21282048, 'steps': 110843, 'loss/train': 1.2755688428878784} 11/07/2021 12:49:40 - INFO - __main__ - Step 110845: {'lr': 8.149582484284728e-05, 'samples': 21282240, 'steps': 110844, 'loss/train': 1.209907054901123} 11/07/2021 12:49:40 - INFO - __main__ - Step 110846: {'lr': 8.149190470328327e-05, 'samples': 21282432, 'steps': 110845, 'loss/train': 0.9587525129318237} 11/07/2021 12:49:40 - INFO - __main__ - Step 110847: {'lr': 8.148798463964643e-05, 'samples': 21282624, 'steps': 110846, 'loss/train': 1.7196909189224243} 11/07/2021 12:49:41 - INFO - __main__ - Step 110848: {'lr': 8.14840646519385e-05, 'samples': 21282816, 'steps': 110847, 'loss/train': 1.7195281982421875} 11/07/2021 12:49:41 - INFO - __main__ - Step 110849: {'lr': 8.148014474016122e-05, 'samples': 21283008, 'steps': 110848, 'loss/train': 1.047878623008728} 11/07/2021 12:49:42 - INFO - __main__ - Step 110850: {'lr': 8.147622490431642e-05, 'samples': 21283200, 'steps': 110849, 'loss/train': 1.4128254652023315} 11/07/2021 12:49:43 - INFO - __main__ - Step 110851: {'lr': 8.147230514440582e-05, 'samples': 21283392, 'steps': 110850, 'loss/train': 1.4766085147857666} 11/07/2021 12:49:43 - INFO - __main__ - Step 110852: {'lr': 8.146838546043119e-05, 'samples': 21283584, 'steps': 110851, 'loss/train': 1.7057946920394897} 11/07/2021 12:49:43 - INFO - __main__ - Step 110853: {'lr': 8.14644658523943e-05, 'samples': 21283776, 'steps': 110852, 'loss/train': 1.1840074062347412} 11/07/2021 12:49:44 - INFO - __main__ - Step 110854: {'lr': 8.146054632029695e-05, 'samples': 21283968, 'steps': 110853, 'loss/train': 0.45377954840660095} 11/07/2021 12:49:45 - INFO - __main__ - Step 110855: {'lr': 8.145662686414085e-05, 'samples': 21284160, 'steps': 110854, 'loss/train': 0.6882617473602295} 11/07/2021 12:49:45 - INFO - __main__ - Step 110856: {'lr': 8.14527074839278e-05, 'samples': 21284352, 'steps': 110855, 'loss/train': 1.4025958776474} 11/07/2021 12:49:46 - INFO - __main__ - Step 110857: {'lr': 8.144878817965968e-05, 'samples': 21284544, 'steps': 110856, 'loss/train': 0.6807875037193298} 11/07/2021 12:49:46 - INFO - __main__ - Step 110858: {'lr': 8.144486895133798e-05, 'samples': 21284736, 'steps': 110857, 'loss/train': 0.8918752670288086} 11/07/2021 12:49:46 - INFO - __main__ - Step 110859: {'lr': 8.144094979896469e-05, 'samples': 21284928, 'steps': 110858, 'loss/train': 1.3761358261108398} 11/07/2021 12:49:47 - INFO - __main__ - Step 110860: {'lr': 8.143703072254147e-05, 'samples': 21285120, 'steps': 110859, 'loss/train': 0.7309658527374268} 11/07/2021 12:49:48 - INFO - __main__ - Step 110861: {'lr': 8.143311172207013e-05, 'samples': 21285312, 'steps': 110860, 'loss/train': 1.4547866582870483} 11/07/2021 12:49:48 - INFO - __main__ - Step 110862: {'lr': 8.142919279755243e-05, 'samples': 21285504, 'steps': 110861, 'loss/train': 0.8610925674438477} 11/07/2021 12:49:48 - INFO - __main__ - Step 110863: {'lr': 8.142527394899013e-05, 'samples': 21285696, 'steps': 110862, 'loss/train': 1.546951174736023} 11/07/2021 12:49:49 - INFO - __main__ - Step 110864: {'lr': 8.1421355176385e-05, 'samples': 21285888, 'steps': 110863, 'loss/train': 0.7182362675666809} 11/07/2021 12:49:50 - INFO - __main__ - Step 110865: {'lr': 8.141743647973881e-05, 'samples': 21286080, 'steps': 110864, 'loss/train': 1.6204395294189453} 11/07/2021 12:49:50 - INFO - __main__ - Step 110866: {'lr': 8.141351785905332e-05, 'samples': 21286272, 'steps': 110865, 'loss/train': 1.0831358432769775} 11/07/2021 12:49:50 - INFO - __main__ - Step 110867: {'lr': 8.140959931433028e-05, 'samples': 21286464, 'steps': 110866, 'loss/train': 1.430601954460144} 11/07/2021 12:49:51 - INFO - __main__ - Step 110868: {'lr': 8.140568084557151e-05, 'samples': 21286656, 'steps': 110867, 'loss/train': 1.3352700471878052} 11/07/2021 12:49:51 - INFO - __main__ - Step 110869: {'lr': 8.140176245277872e-05, 'samples': 21286848, 'steps': 110868, 'loss/train': 1.3482691049575806} 11/07/2021 12:49:52 - INFO - __main__ - Step 110870: {'lr': 8.139784413595369e-05, 'samples': 21287040, 'steps': 110869, 'loss/train': 1.257608413696289} 11/07/2021 12:49:53 - INFO - __main__ - Step 110871: {'lr': 8.139392589509827e-05, 'samples': 21287232, 'steps': 110870, 'loss/train': 1.501379370689392} 11/07/2021 12:49:53 - INFO - __main__ - Step 110872: {'lr': 8.139000773021407e-05, 'samples': 21287424, 'steps': 110871, 'loss/train': 1.580474615097046} 11/07/2021 12:49:53 - INFO - __main__ - Step 110873: {'lr': 8.138608964130292e-05, 'samples': 21287616, 'steps': 110872, 'loss/train': 1.3046756982803345} 11/07/2021 12:49:54 - INFO - __main__ - Step 110874: {'lr': 8.138217162836662e-05, 'samples': 21287808, 'steps': 110873, 'loss/train': 1.0827423334121704} 11/07/2021 12:49:54 - INFO - __main__ - Step 110875: {'lr': 8.137825369140689e-05, 'samples': 21288000, 'steps': 110874, 'loss/train': 1.2176250219345093} 11/07/2021 12:49:55 - INFO - __main__ - Step 110876: {'lr': 8.137433583042553e-05, 'samples': 21288192, 'steps': 110875, 'loss/train': 1.0617823600769043} 11/07/2021 12:49:55 - INFO - __main__ - Step 110877: {'lr': 8.13704180454243e-05, 'samples': 21288384, 'steps': 110876, 'loss/train': 0.9845383763313293} 11/07/2021 12:49:56 - INFO - __main__ - Step 110878: {'lr': 8.136650033640494e-05, 'samples': 21288576, 'steps': 110877, 'loss/train': 1.7631244659423828} 11/07/2021 12:49:56 - INFO - __main__ - Step 110879: {'lr': 8.136258270336924e-05, 'samples': 21288768, 'steps': 110878, 'loss/train': 1.3860493898391724} 11/07/2021 12:49:57 - INFO - __main__ - Step 110880: {'lr': 8.135866514631895e-05, 'samples': 21288960, 'steps': 110879, 'loss/train': 1.1018716096878052} 11/07/2021 12:49:58 - INFO - __main__ - Step 110881: {'lr': 8.135474766525586e-05, 'samples': 21289152, 'steps': 110880, 'loss/train': 1.6834535598754883} 11/07/2021 12:49:58 - INFO - __main__ - Step 110882: {'lr': 8.135083026018169e-05, 'samples': 21289344, 'steps': 110881, 'loss/train': 1.322518229484558} 11/07/2021 12:49:58 - INFO - __main__ - Step 110883: {'lr': 8.134691293109825e-05, 'samples': 21289536, 'steps': 110882, 'loss/train': 1.656830072402954} 11/07/2021 12:49:59 - INFO - __main__ - Step 110884: {'lr': 8.134299567800738e-05, 'samples': 21289728, 'steps': 110883, 'loss/train': 1.2264069318771362} 11/07/2021 12:49:59 - INFO - __main__ - Step 110885: {'lr': 8.133907850091066e-05, 'samples': 21289920, 'steps': 110884, 'loss/train': 1.5478435754776} 11/07/2021 12:50:00 - INFO - __main__ - Step 110886: {'lr': 8.133516139980996e-05, 'samples': 21290112, 'steps': 110885, 'loss/train': 0.9246073365211487} 11/07/2021 12:50:00 - INFO - __main__ - Step 110887: {'lr': 8.133124437470702e-05, 'samples': 21290304, 'steps': 110886, 'loss/train': 1.3060518503189087} 11/07/2021 12:50:01 - INFO - __main__ - Step 110888: {'lr': 8.132732742560361e-05, 'samples': 21290496, 'steps': 110887, 'loss/train': 0.9745162129402161} 11/07/2021 12:50:01 - INFO - __main__ - Step 110889: {'lr': 8.132341055250153e-05, 'samples': 21290688, 'steps': 110888, 'loss/train': 1.5609861612319946} 11/07/2021 12:50:01 - INFO - __main__ - Step 110890: {'lr': 8.131949375540249e-05, 'samples': 21290880, 'steps': 110889, 'loss/train': 1.2901002168655396} 11/07/2021 12:50:03 - INFO - __main__ - Step 110891: {'lr': 8.13155770343083e-05, 'samples': 21291072, 'steps': 110890, 'loss/train': 1.7419425249099731} 11/07/2021 12:50:03 - INFO - __main__ - Step 110892: {'lr': 8.131166038922072e-05, 'samples': 21291264, 'steps': 110891, 'loss/train': 1.3407580852508545} 11/07/2021 12:50:03 - INFO - __main__ - Step 110893: {'lr': 8.130774382014147e-05, 'samples': 21291456, 'steps': 110892, 'loss/train': 1.6949310302734375} 11/07/2021 12:50:04 - INFO - __main__ - Step 110894: {'lr': 8.130382732707236e-05, 'samples': 21291648, 'steps': 110893, 'loss/train': 1.0373462438583374} 11/07/2021 12:50:04 - INFO - __main__ - Step 110895: {'lr': 8.129991091001515e-05, 'samples': 21291840, 'steps': 110894, 'loss/train': 1.5554659366607666} 11/07/2021 12:50:04 - INFO - __main__ - Step 110896: {'lr': 8.12959945689716e-05, 'samples': 21292032, 'steps': 110895, 'loss/train': 1.3626822233200073} 11/07/2021 12:50:05 - INFO - __main__ - Step 110897: {'lr': 8.129207830394355e-05, 'samples': 21292224, 'steps': 110896, 'loss/train': 1.704513669013977} 11/07/2021 12:50:06 - INFO - __main__ - Step 110898: {'lr': 8.128816211493261e-05, 'samples': 21292416, 'steps': 110897, 'loss/train': 1.5965791940689087} 11/07/2021 12:50:06 - INFO - __main__ - Step 110899: {'lr': 8.128424600194062e-05, 'samples': 21292608, 'steps': 110898, 'loss/train': 0.898344099521637} 11/07/2021 12:50:06 - INFO - __main__ - Step 110900: {'lr': 8.128032996496934e-05, 'samples': 21292800, 'steps': 110899, 'loss/train': 0.9828882813453674} 11/07/2021 12:50:07 - INFO - __main__ - Step 110901: {'lr': 8.127641400402053e-05, 'samples': 21292992, 'steps': 110900, 'loss/train': 1.7934998273849487} 11/07/2021 12:50:08 - INFO - __main__ - Step 110902: {'lr': 8.127249811909598e-05, 'samples': 21293184, 'steps': 110901, 'loss/train': 3.0099103450775146} 11/07/2021 12:50:08 - INFO - __main__ - Step 110903: {'lr': 8.126858231019742e-05, 'samples': 21293376, 'steps': 110902, 'loss/train': 1.3849751949310303} 11/07/2021 12:50:09 - INFO - __main__ - Step 110904: {'lr': 8.126466657732665e-05, 'samples': 21293568, 'steps': 110903, 'loss/train': 1.1593751907348633} 11/07/2021 12:50:09 - INFO - __main__ - Step 110905: {'lr': 8.126075092048541e-05, 'samples': 21293760, 'steps': 110904, 'loss/train': 1.4588154554367065} 11/07/2021 12:50:09 - INFO - __main__ - Step 110906: {'lr': 8.125683533967548e-05, 'samples': 21293952, 'steps': 110905, 'loss/train': 1.5388537645339966} 11/07/2021 12:50:10 - INFO - __main__ - Step 110907: {'lr': 8.12529198348986e-05, 'samples': 21294144, 'steps': 110906, 'loss/train': 1.2644188404083252} 11/07/2021 12:50:11 - INFO - __main__ - Step 110908: {'lr': 8.124900440615657e-05, 'samples': 21294336, 'steps': 110907, 'loss/train': 1.5934194326400757} 11/07/2021 12:50:11 - INFO - __main__ - Step 110909: {'lr': 8.124508905345112e-05, 'samples': 21294528, 'steps': 110908, 'loss/train': 1.6780472993850708} 11/07/2021 12:50:11 - INFO - __main__ - Step 110910: {'lr': 8.124117377678405e-05, 'samples': 21294720, 'steps': 110909, 'loss/train': 1.3359873294830322} 11/07/2021 12:50:12 - INFO - __main__ - Step 110911: {'lr': 8.123725857615716e-05, 'samples': 21294912, 'steps': 110910, 'loss/train': 1.3226256370544434} 11/07/2021 12:50:12 - INFO - __main__ - Step 110912: {'lr': 8.12333434515721e-05, 'samples': 21295104, 'steps': 110911, 'loss/train': 0.683992326259613} 11/07/2021 12:50:13 - INFO - __main__ - Step 110913: {'lr': 8.122942840303067e-05, 'samples': 21295296, 'steps': 110912, 'loss/train': 0.577841579914093} 11/07/2021 12:50:14 - INFO - __main__ - Step 110914: {'lr': 8.122551343053467e-05, 'samples': 21295488, 'steps': 110913, 'loss/train': 1.142346739768982} 11/07/2021 12:50:14 - INFO - __main__ - Step 110915: {'lr': 8.122159853408583e-05, 'samples': 21295680, 'steps': 110914, 'loss/train': 1.355435848236084} 11/07/2021 12:50:14 - INFO - __main__ - Step 110916: {'lr': 8.121768371368593e-05, 'samples': 21295872, 'steps': 110915, 'loss/train': 1.4151089191436768} 11/07/2021 12:50:15 - INFO - __main__ - Step 110917: {'lr': 8.121376896933677e-05, 'samples': 21296064, 'steps': 110916, 'loss/train': 1.458957314491272} 11/07/2021 12:50:16 - INFO - __main__ - Step 110918: {'lr': 8.120985430104005e-05, 'samples': 21296256, 'steps': 110917, 'loss/train': 2.1727962493896484} 11/07/2021 12:50:17 - INFO - __main__ - Step 110919: {'lr': 8.120593970879758e-05, 'samples': 21296448, 'steps': 110918, 'loss/train': 1.4213310480117798} 11/07/2021 12:50:17 - INFO - __main__ - Step 110920: {'lr': 8.120202519261111e-05, 'samples': 21296640, 'steps': 110919, 'loss/train': 0.15861324965953827} 11/07/2021 12:50:17 - INFO - __main__ - Step 110921: {'lr': 8.11981107524824e-05, 'samples': 21296832, 'steps': 110920, 'loss/train': 1.3706010580062866} 11/07/2021 12:50:18 - INFO - __main__ - Step 110922: {'lr': 8.119419638841322e-05, 'samples': 21297024, 'steps': 110921, 'loss/train': 1.5108870267868042} 11/07/2021 12:50:18 - INFO - __main__ - Step 110923: {'lr': 8.119028210040533e-05, 'samples': 21297216, 'steps': 110922, 'loss/train': 1.2280018329620361} 11/07/2021 12:50:19 - INFO - __main__ - Step 110924: {'lr': 8.118636788846057e-05, 'samples': 21297408, 'steps': 110923, 'loss/train': 0.5917441844940186} 11/07/2021 12:50:20 - INFO - __main__ - Step 110925: {'lr': 8.118245375258055e-05, 'samples': 21297600, 'steps': 110924, 'loss/train': 1.1698813438415527} 11/07/2021 12:50:20 - INFO - __main__ - Step 110926: {'lr': 8.11785396927671e-05, 'samples': 21297792, 'steps': 110925, 'loss/train': 1.537506341934204} 11/07/2021 12:50:20 - INFO - __main__ - Step 110927: {'lr': 8.117462570902201e-05, 'samples': 21297984, 'steps': 110926, 'loss/train': 0.9638217091560364} 11/07/2021 12:50:21 - INFO - __main__ - Step 110928: {'lr': 8.117071180134703e-05, 'samples': 21298176, 'steps': 110927, 'loss/train': 1.0796406269073486} 11/07/2021 12:50:21 - INFO - __main__ - Step 110929: {'lr': 8.116679796974389e-05, 'samples': 21298368, 'steps': 110928, 'loss/train': 1.796680212020874} 11/07/2021 12:50:22 - INFO - __main__ - Step 110930: {'lr': 8.116288421421441e-05, 'samples': 21298560, 'steps': 110929, 'loss/train': 0.5355945229530334} 11/07/2021 12:50:23 - INFO - __main__ - Step 110931: {'lr': 8.115897053476034e-05, 'samples': 21298752, 'steps': 110930, 'loss/train': 0.9235758781433105} 11/07/2021 12:50:23 - INFO - __main__ - Step 110932: {'lr': 8.115505693138341e-05, 'samples': 21298944, 'steps': 110931, 'loss/train': 1.2739791870117188} 11/07/2021 12:50:23 - INFO - __main__ - Step 110933: {'lr': 8.115114340408541e-05, 'samples': 21299136, 'steps': 110932, 'loss/train': 1.0382658243179321} 11/07/2021 12:50:24 - INFO - __main__ - Step 110934: {'lr': 8.114722995286811e-05, 'samples': 21299328, 'steps': 110933, 'loss/train': 1.3495756387710571} 11/07/2021 12:50:25 - INFO - __main__ - Step 110935: {'lr': 8.114331657773327e-05, 'samples': 21299520, 'steps': 110934, 'loss/train': 1.5063982009887695} 11/07/2021 12:50:25 - INFO - __main__ - Step 110936: {'lr': 8.113940327868263e-05, 'samples': 21299712, 'steps': 110935, 'loss/train': 1.2438709735870361} 11/07/2021 12:50:26 - INFO - __main__ - Step 110937: {'lr': 8.1135490055718e-05, 'samples': 21299904, 'steps': 110936, 'loss/train': 1.277685284614563} 11/07/2021 12:50:26 - INFO - __main__ - Step 110938: {'lr': 8.113157690884115e-05, 'samples': 21300096, 'steps': 110937, 'loss/train': 0.9588587880134583} 11/07/2021 12:50:26 - INFO - __main__ - Step 110939: {'lr': 8.112766383805373e-05, 'samples': 21300288, 'steps': 110938, 'loss/train': 1.5492407083511353} 11/07/2021 12:50:27 - INFO - __main__ - Step 110940: {'lr': 8.11237508433576e-05, 'samples': 21300480, 'steps': 110939, 'loss/train': 1.2334016561508179} 11/07/2021 12:50:28 - INFO - __main__ - Step 110941: {'lr': 8.111983792475449e-05, 'samples': 21300672, 'steps': 110940, 'loss/train': 1.612709879875183} 11/07/2021 12:50:28 - INFO - __main__ - Step 110942: {'lr': 8.111592508224618e-05, 'samples': 21300864, 'steps': 110941, 'loss/train': 1.4424140453338623} 11/07/2021 12:50:28 - INFO - __main__ - Step 110943: {'lr': 8.111201231583443e-05, 'samples': 21301056, 'steps': 110942, 'loss/train': 1.140516757965088} 11/07/2021 12:50:29 - INFO - __main__ - Step 110944: {'lr': 8.110809962552099e-05, 'samples': 21301248, 'steps': 110943, 'loss/train': 1.3859680891036987} 11/07/2021 12:50:30 - INFO - __main__ - Step 110945: {'lr': 8.110418701130765e-05, 'samples': 21301440, 'steps': 110944, 'loss/train': 1.395917296409607} 11/07/2021 12:50:30 - INFO - __main__ - Step 110946: {'lr': 8.110027447319614e-05, 'samples': 21301632, 'steps': 110945, 'loss/train': 1.4579615592956543} 11/07/2021 12:50:31 - INFO - __main__ - Step 110947: {'lr': 8.109636201118825e-05, 'samples': 21301824, 'steps': 110946, 'loss/train': 1.0973877906799316} 11/07/2021 12:50:31 - INFO - __main__ - Step 110948: {'lr': 8.109244962528575e-05, 'samples': 21302016, 'steps': 110947, 'loss/train': 1.4419341087341309} 11/07/2021 12:50:31 - INFO - __main__ - Step 110949: {'lr': 8.108853731549035e-05, 'samples': 21302208, 'steps': 110948, 'loss/train': 1.6764649152755737} 11/07/2021 12:50:32 - INFO - __main__ - Step 110950: {'lr': 8.108462508180386e-05, 'samples': 21302400, 'steps': 110949, 'loss/train': 1.3726950883865356} 11/07/2021 12:50:33 - INFO - __main__ - Step 110951: {'lr': 8.108071292422815e-05, 'samples': 21302592, 'steps': 110950, 'loss/train': 1.3758715391159058} 11/07/2021 12:50:33 - INFO - __main__ - Step 110952: {'lr': 8.107680084276472e-05, 'samples': 21302784, 'steps': 110951, 'loss/train': 3.3622450828552246} 11/07/2021 12:50:33 - INFO - __main__ - Step 110953: {'lr': 8.10728888374155e-05, 'samples': 21302976, 'steps': 110952, 'loss/train': 1.5194624662399292} 11/07/2021 12:50:34 - INFO - __main__ - Step 110954: {'lr': 8.106897690818227e-05, 'samples': 21303168, 'steps': 110953, 'loss/train': 1.3464140892028809} 11/07/2021 12:50:34 - INFO - __main__ - Step 110955: {'lr': 8.106506505506672e-05, 'samples': 21303360, 'steps': 110954, 'loss/train': 1.7576048374176025} 11/07/2021 12:50:35 - INFO - __main__ - Step 110956: {'lr': 8.106115327807064e-05, 'samples': 21303552, 'steps': 110955, 'loss/train': 1.2295750379562378} 11/07/2021 12:50:35 - INFO - __main__ - Step 110957: {'lr': 8.10572415771958e-05, 'samples': 21303744, 'steps': 110956, 'loss/train': 1.0951952934265137} 11/07/2021 12:50:36 - INFO - __main__ - Step 110958: {'lr': 8.105332995244396e-05, 'samples': 21303936, 'steps': 110957, 'loss/train': 1.1275765895843506} 11/07/2021 12:50:36 - INFO - __main__ - Step 110959: {'lr': 8.104941840381689e-05, 'samples': 21304128, 'steps': 110958, 'loss/train': 1.6977903842926025} 11/07/2021 12:50:36 - INFO - __main__ - Step 110960: {'lr': 8.104550693131635e-05, 'samples': 21304320, 'steps': 110959, 'loss/train': 1.1481331586837769} 11/07/2021 12:50:38 - INFO - __main__ - Step 110961: {'lr': 8.104159553494408e-05, 'samples': 21304512, 'steps': 110960, 'loss/train': 1.3239096403121948} 11/07/2021 12:50:38 - INFO - __main__ - Step 110962: {'lr': 8.103768421470187e-05, 'samples': 21304704, 'steps': 110961, 'loss/train': 2.6404597759246826} 11/07/2021 12:50:38 - INFO - __main__ - Step 110963: {'lr': 8.103377297059147e-05, 'samples': 21304896, 'steps': 110962, 'loss/train': 1.5804122686386108} 11/07/2021 12:50:39 - INFO - __main__ - Step 110964: {'lr': 8.102986180261473e-05, 'samples': 21305088, 'steps': 110963, 'loss/train': 1.52670156955719} 11/07/2021 12:50:39 - INFO - __main__ - Step 110965: {'lr': 8.102595071077323e-05, 'samples': 21305280, 'steps': 110964, 'loss/train': 1.3874105215072632} 11/07/2021 12:50:40 - INFO - __main__ - Step 110966: {'lr': 8.102203969506886e-05, 'samples': 21305472, 'steps': 110965, 'loss/train': 1.6212595701217651} 11/07/2021 12:50:40 - INFO - __main__ - Step 110967: {'lr': 8.101812875550332e-05, 'samples': 21305664, 'steps': 110966, 'loss/train': 1.3159419298171997} 11/07/2021 12:50:41 - INFO - __main__ - Step 110968: {'lr': 8.101421789207841e-05, 'samples': 21305856, 'steps': 110967, 'loss/train': 1.1159569025039673} 11/07/2021 12:50:41 - INFO - __main__ - Step 110969: {'lr': 8.10103071047959e-05, 'samples': 21306048, 'steps': 110968, 'loss/train': 1.6480275392532349} 11/07/2021 12:50:41 - INFO - __main__ - Step 110970: {'lr': 8.100639639365754e-05, 'samples': 21306240, 'steps': 110969, 'loss/train': 1.5446703433990479} 11/07/2021 12:50:42 - INFO - __main__ - Step 110971: {'lr': 8.100248575866506e-05, 'samples': 21306432, 'steps': 110970, 'loss/train': 0.8860666155815125} 11/07/2021 12:50:43 - INFO - __main__ - Step 110972: {'lr': 8.099857519982027e-05, 'samples': 21306624, 'steps': 110971, 'loss/train': 1.5820342302322388} 11/07/2021 12:50:43 - INFO - __main__ - Step 110973: {'lr': 8.099466471712491e-05, 'samples': 21306816, 'steps': 110972, 'loss/train': 1.2150548696517944} 11/07/2021 12:50:44 - INFO - __main__ - Step 110974: {'lr': 8.099075431058075e-05, 'samples': 21307008, 'steps': 110973, 'loss/train': 0.4305111765861511} 11/07/2021 12:50:44 - INFO - __main__ - Step 110975: {'lr': 8.098684398018965e-05, 'samples': 21307200, 'steps': 110974, 'loss/train': 1.6216679811477661} 11/07/2021 12:50:44 - INFO - __main__ - Step 110976: {'lr': 8.098293372595317e-05, 'samples': 21307392, 'steps': 110975, 'loss/train': 1.2145148515701294} 11/07/2021 12:50:45 - INFO - __main__ - Step 110977: {'lr': 8.097902354787318e-05, 'samples': 21307584, 'steps': 110976, 'loss/train': 1.3889455795288086} 11/07/2021 12:50:46 - INFO - __main__ - Step 110978: {'lr': 8.097511344595141e-05, 'samples': 21307776, 'steps': 110977, 'loss/train': 1.1696065664291382} 11/07/2021 12:50:46 - INFO - __main__ - Step 110979: {'lr': 8.097120342018965e-05, 'samples': 21307968, 'steps': 110978, 'loss/train': 1.7739200592041016} 11/07/2021 12:50:46 - INFO - __main__ - Step 110980: {'lr': 8.096729347058968e-05, 'samples': 21308160, 'steps': 110979, 'loss/train': 1.1030828952789307} 11/07/2021 12:50:47 - INFO - __main__ - Step 110981: {'lr': 8.096338359715322e-05, 'samples': 21308352, 'steps': 110980, 'loss/train': 1.2593814134597778} 11/07/2021 12:50:48 - INFO - __main__ - Step 110982: {'lr': 8.095947379988208e-05, 'samples': 21308544, 'steps': 110981, 'loss/train': 1.68683922290802} 11/07/2021 12:50:48 - INFO - __main__ - Step 110983: {'lr': 8.095556407877796e-05, 'samples': 21308736, 'steps': 110982, 'loss/train': 1.572864055633545} 11/07/2021 12:50:48 - INFO - __main__ - Step 110984: {'lr': 8.095165443384267e-05, 'samples': 21308928, 'steps': 110983, 'loss/train': 1.3173456192016602} 11/07/2021 12:50:49 - INFO - __main__ - Step 110985: {'lr': 8.094774486507794e-05, 'samples': 21309120, 'steps': 110984, 'loss/train': 1.3166110515594482} 11/07/2021 12:50:49 - INFO - __main__ - Step 110986: {'lr': 8.094383537248565e-05, 'samples': 21309312, 'steps': 110985, 'loss/train': 1.2020694017410278} 11/07/2021 12:50:50 - INFO - __main__ - Step 110987: {'lr': 8.093992595606736e-05, 'samples': 21309504, 'steps': 110986, 'loss/train': 1.3478615283966064} 11/07/2021 12:50:50 - INFO - __main__ - Step 110988: {'lr': 8.093601661582495e-05, 'samples': 21309696, 'steps': 110987, 'loss/train': 1.0895745754241943} 11/07/2021 12:50:51 - INFO - __main__ - Step 110989: {'lr': 8.093210735176015e-05, 'samples': 21309888, 'steps': 110988, 'loss/train': 1.0765161514282227} 11/07/2021 12:50:51 - INFO - __main__ - Step 110990: {'lr': 8.092819816387472e-05, 'samples': 21310080, 'steps': 110989, 'loss/train': 1.4890363216400146} 11/07/2021 12:50:52 - INFO - __main__ - Step 110991: {'lr': 8.092428905217048e-05, 'samples': 21310272, 'steps': 110990, 'loss/train': 1.4653630256652832} 11/07/2021 12:50:53 - INFO - __main__ - Step 110992: {'lr': 8.092038001664912e-05, 'samples': 21310464, 'steps': 110991, 'loss/train': 0.9919858574867249} 11/07/2021 12:50:53 - INFO - __main__ - Step 110993: {'lr': 8.09164710573124e-05, 'samples': 21310656, 'steps': 110992, 'loss/train': 1.167583703994751} 11/07/2021 12:50:53 - INFO - __main__ - Step 110994: {'lr': 8.091256217416215e-05, 'samples': 21310848, 'steps': 110993, 'loss/train': 0.36280032992362976} 11/07/2021 12:50:54 - INFO - __main__ - Step 110995: {'lr': 8.090865336720007e-05, 'samples': 21311040, 'steps': 110994, 'loss/train': 1.070132851600647} 11/07/2021 12:50:54 - INFO - __main__ - Step 110996: {'lr': 8.090474463642794e-05, 'samples': 21311232, 'steps': 110995, 'loss/train': 1.3867714405059814} 11/07/2021 12:50:55 - INFO - __main__ - Step 110997: {'lr': 8.09008359818476e-05, 'samples': 21311424, 'steps': 110996, 'loss/train': 1.1835918426513672} 11/07/2021 12:50:55 - INFO - __main__ - Step 110998: {'lr': 8.089692740346066e-05, 'samples': 21311616, 'steps': 110997, 'loss/train': 1.1961264610290527} 11/07/2021 12:50:56 - INFO - __main__ - Step 110999: {'lr': 8.089301890126896e-05, 'samples': 21311808, 'steps': 110998, 'loss/train': 1.4102692604064941} 11/07/2021 12:50:56 - INFO - __main__ - Step 111000: {'lr': 8.088911047527425e-05, 'samples': 21312000, 'steps': 110999, 'loss/train': 0.9315155744552612} 11/07/2021 12:50:56 - INFO - __main__ - Step 111001: {'lr': 8.088520212547831e-05, 'samples': 21312192, 'steps': 111000, 'loss/train': 1.4106382131576538} 11/07/2021 12:50:57 - INFO - __main__ - Step 111002: {'lr': 8.088129385188289e-05, 'samples': 21312384, 'steps': 111001, 'loss/train': 1.3119646310806274} 11/07/2021 12:50:58 - INFO - __main__ - Step 111003: {'lr': 8.087738565448974e-05, 'samples': 21312576, 'steps': 111002, 'loss/train': 1.395928978919983} 11/07/2021 12:50:58 - INFO - __main__ - Step 111004: {'lr': 8.087347753330063e-05, 'samples': 21312768, 'steps': 111003, 'loss/train': 1.657660961151123} 11/07/2021 12:50:59 - INFO - __main__ - Step 111005: {'lr': 8.08695694883173e-05, 'samples': 21312960, 'steps': 111004, 'loss/train': 0.6089848875999451} 11/07/2021 12:50:59 - INFO - __main__ - Step 111006: {'lr': 8.086566151954156e-05, 'samples': 21313152, 'steps': 111005, 'loss/train': 1.2631795406341553} 11/07/2021 12:50:59 - INFO - __main__ - Step 111007: {'lr': 8.086175362697513e-05, 'samples': 21313344, 'steps': 111006, 'loss/train': 1.2402974367141724} 11/07/2021 12:51:00 - INFO - __main__ - Step 111008: {'lr': 8.085784581061987e-05, 'samples': 21313536, 'steps': 111007, 'loss/train': 1.282853126525879} 11/07/2021 12:51:01 - INFO - __main__ - Step 111009: {'lr': 8.085393807047737e-05, 'samples': 21313728, 'steps': 111008, 'loss/train': 1.0939797163009644} 11/07/2021 12:51:01 - INFO - __main__ - Step 111010: {'lr': 8.085003040654948e-05, 'samples': 21313920, 'steps': 111009, 'loss/train': 1.435384750366211} 11/07/2021 12:51:01 - INFO - __main__ - Step 111011: {'lr': 8.084612281883796e-05, 'samples': 21314112, 'steps': 111010, 'loss/train': 1.2304857969284058} 11/07/2021 12:51:02 - INFO - __main__ - Step 111012: {'lr': 8.084221530734457e-05, 'samples': 21314304, 'steps': 111011, 'loss/train': 1.1946815252304077} 11/07/2021 12:51:03 - INFO - __main__ - Step 111013: {'lr': 8.083830787207106e-05, 'samples': 21314496, 'steps': 111012, 'loss/train': 1.0566662549972534} 11/07/2021 12:51:03 - INFO - __main__ - Step 111014: {'lr': 8.083440051301919e-05, 'samples': 21314688, 'steps': 111013, 'loss/train': 1.429646611213684} 11/07/2021 12:51:03 - INFO - __main__ - Step 111015: {'lr': 8.083049323019074e-05, 'samples': 21314880, 'steps': 111014, 'loss/train': 1.0755043029785156} 11/07/2021 12:51:04 - INFO - __main__ - Step 111016: {'lr': 8.082658602358745e-05, 'samples': 21315072, 'steps': 111015, 'loss/train': 1.2924847602844238} 11/07/2021 12:51:04 - INFO - __main__ - Step 111017: {'lr': 8.08226788932111e-05, 'samples': 21315264, 'steps': 111016, 'loss/train': 1.079609751701355} 11/07/2021 12:51:05 - INFO - __main__ - Step 111018: {'lr': 8.081877183906342e-05, 'samples': 21315456, 'steps': 111017, 'loss/train': 1.2294102907180786} 11/07/2021 12:51:06 - INFO - __main__ - Step 111019: {'lr': 8.081486486114631e-05, 'samples': 21315648, 'steps': 111018, 'loss/train': 1.1227023601531982} 11/07/2021 12:51:06 - INFO - __main__ - Step 111020: {'lr': 8.08109579594613e-05, 'samples': 21315840, 'steps': 111019, 'loss/train': 1.4605190753936768} 11/07/2021 12:51:06 - INFO - __main__ - Step 111021: {'lr': 8.080705113401026e-05, 'samples': 21316032, 'steps': 111020, 'loss/train': 0.8219882845878601} 11/07/2021 12:51:07 - INFO - __main__ - Step 111022: {'lr': 8.080314438479496e-05, 'samples': 21316224, 'steps': 111021, 'loss/train': 1.2730602025985718} 11/07/2021 12:51:08 - INFO - __main__ - Step 111023: {'lr': 8.079923771181716e-05, 'samples': 21316416, 'steps': 111022, 'loss/train': 1.2585300207138062} 11/07/2021 12:51:08 - INFO - __main__ - Step 111024: {'lr': 8.079533111507861e-05, 'samples': 21316608, 'steps': 111023, 'loss/train': 1.264156460762024} 11/07/2021 12:51:08 - INFO - __main__ - Step 111025: {'lr': 8.079142459458106e-05, 'samples': 21316800, 'steps': 111024, 'loss/train': 0.8711103796958923} 11/07/2021 12:51:09 - INFO - __main__ - Step 111026: {'lr': 8.078751815032629e-05, 'samples': 21316992, 'steps': 111025, 'loss/train': 1.5565332174301147} 11/07/2021 12:51:09 - INFO - __main__ - Step 111027: {'lr': 8.078361178231605e-05, 'samples': 21317184, 'steps': 111026, 'loss/train': 0.9458386301994324} 11/07/2021 12:51:10 - INFO - __main__ - Step 111028: {'lr': 8.07797054905521e-05, 'samples': 21317376, 'steps': 111027, 'loss/train': 1.1349154710769653} 11/07/2021 12:51:10 - INFO - __main__ - Step 111029: {'lr': 8.077579927503622e-05, 'samples': 21317568, 'steps': 111028, 'loss/train': 1.547726035118103} 11/07/2021 12:51:11 - INFO - __main__ - Step 111030: {'lr': 8.077189313577016e-05, 'samples': 21317760, 'steps': 111029, 'loss/train': 1.4653407335281372} 11/07/2021 12:51:11 - INFO - __main__ - Step 111031: {'lr': 8.076798707275565e-05, 'samples': 21317952, 'steps': 111030, 'loss/train': 1.157876968383789} 11/07/2021 12:51:11 - INFO - __main__ - Step 111032: {'lr': 8.076408108599456e-05, 'samples': 21318144, 'steps': 111031, 'loss/train': 1.4445749521255493} 11/07/2021 12:51:12 - INFO - __main__ - Step 111033: {'lr': 8.07601751754885e-05, 'samples': 21318336, 'steps': 111032, 'loss/train': 1.8768936395645142} 11/07/2021 12:51:13 - INFO - __main__ - Step 111034: {'lr': 8.075626934123928e-05, 'samples': 21318528, 'steps': 111033, 'loss/train': 0.621311604976654} 11/07/2021 12:51:13 - INFO - __main__ - Step 111035: {'lr': 8.075236358324866e-05, 'samples': 21318720, 'steps': 111034, 'loss/train': 1.5963706970214844} 11/07/2021 12:51:14 - INFO - __main__ - Step 111036: {'lr': 8.074845790151844e-05, 'samples': 21318912, 'steps': 111035, 'loss/train': 0.8327434659004211} 11/07/2021 12:51:14 - INFO - __main__ - Step 111037: {'lr': 8.074455229605032e-05, 'samples': 21319104, 'steps': 111036, 'loss/train': 1.047060489654541} 11/07/2021 12:51:14 - INFO - __main__ - Step 111038: {'lr': 8.074064676684611e-05, 'samples': 21319296, 'steps': 111037, 'loss/train': 1.5055397748947144} 11/07/2021 12:51:15 - INFO - __main__ - Step 111039: {'lr': 8.073674131390757e-05, 'samples': 21319488, 'steps': 111038, 'loss/train': 0.9274377226829529} 11/07/2021 12:51:16 - INFO - __main__ - Step 111040: {'lr': 8.073283593723644e-05, 'samples': 21319680, 'steps': 111039, 'loss/train': 2.2902960777282715} 11/07/2021 12:51:16 - INFO - __main__ - Step 111041: {'lr': 8.072893063683446e-05, 'samples': 21319872, 'steps': 111040, 'loss/train': 1.3881915807724} 11/07/2021 12:51:17 - INFO - __main__ - Step 111042: {'lr': 8.07250254127034e-05, 'samples': 21320064, 'steps': 111041, 'loss/train': 0.5956548452377319} 11/07/2021 12:51:17 - INFO - __main__ - Step 111043: {'lr': 8.072112026484507e-05, 'samples': 21320256, 'steps': 111042, 'loss/train': 0.8849467039108276} 11/07/2021 12:51:17 - INFO - __main__ - Step 111044: {'lr': 8.071721519326117e-05, 'samples': 21320448, 'steps': 111043, 'loss/train': 1.2154555320739746} 11/07/2021 12:51:18 - INFO - __main__ - Step 111045: {'lr': 8.071331019795358e-05, 'samples': 21320640, 'steps': 111044, 'loss/train': 1.5947798490524292} 11/07/2021 12:51:19 - INFO - __main__ - Step 111046: {'lr': 8.070940527892387e-05, 'samples': 21320832, 'steps': 111045, 'loss/train': 2.1670637130737305} 11/07/2021 12:51:19 - INFO - __main__ - Step 111047: {'lr': 8.070550043617386e-05, 'samples': 21321024, 'steps': 111046, 'loss/train': 1.560269832611084} 11/07/2021 12:51:19 - INFO - __main__ - Step 111048: {'lr': 8.070159566970539e-05, 'samples': 21321216, 'steps': 111047, 'loss/train': 1.5029510259628296} 11/07/2021 12:51:20 - INFO - __main__ - Step 111049: {'lr': 8.069769097952012e-05, 'samples': 21321408, 'steps': 111048, 'loss/train': 1.0334217548370361} 11/07/2021 12:51:21 - INFO - __main__ - Step 111050: {'lr': 8.069378636561989e-05, 'samples': 21321600, 'steps': 111049, 'loss/train': 1.4724920988082886} 11/07/2021 12:51:21 - INFO - __main__ - Step 111051: {'lr': 8.068988182800641e-05, 'samples': 21321792, 'steps': 111050, 'loss/train': 1.6003271341323853} 11/07/2021 12:51:21 - INFO - __main__ - Step 111052: {'lr': 8.068597736668149e-05, 'samples': 21321984, 'steps': 111051, 'loss/train': 1.2131717205047607} 11/07/2021 12:51:22 - INFO - __main__ - Step 111053: {'lr': 8.068207298164682e-05, 'samples': 21322176, 'steps': 111052, 'loss/train': 2.0060791969299316} 11/07/2021 12:51:22 - INFO - __main__ - Step 111054: {'lr': 8.06781686729042e-05, 'samples': 21322368, 'steps': 111053, 'loss/train': 1.3570799827575684} 11/07/2021 12:51:23 - INFO - __main__ - Step 111055: {'lr': 8.06742644404554e-05, 'samples': 21322560, 'steps': 111054, 'loss/train': 1.4784481525421143} 11/07/2021 12:51:24 - INFO - __main__ - Step 111056: {'lr': 8.067036028430213e-05, 'samples': 21322752, 'steps': 111055, 'loss/train': 1.0487003326416016} 11/07/2021 12:51:24 - INFO - __main__ - Step 111057: {'lr': 8.066645620444621e-05, 'samples': 21322944, 'steps': 111056, 'loss/train': 0.3110053539276123} 11/07/2021 12:51:24 - INFO - __main__ - Step 111058: {'lr': 8.066255220088939e-05, 'samples': 21323136, 'steps': 111057, 'loss/train': 1.3858226537704468} 11/07/2021 12:51:25 - INFO - __main__ - Step 111059: {'lr': 8.065864827363345e-05, 'samples': 21323328, 'steps': 111058, 'loss/train': 1.4215298891067505} 11/07/2021 12:51:26 - INFO - __main__ - Step 111060: {'lr': 8.065474442268006e-05, 'samples': 21323520, 'steps': 111059, 'loss/train': 1.2718360424041748} 11/07/2021 12:51:26 - INFO - __main__ - Step 111061: {'lr': 8.065084064803103e-05, 'samples': 21323712, 'steps': 111060, 'loss/train': 1.5050787925720215} 11/07/2021 12:51:26 - INFO - __main__ - Step 111062: {'lr': 8.064693694968808e-05, 'samples': 21323904, 'steps': 111061, 'loss/train': 1.0374767780303955} 11/07/2021 12:51:27 - INFO - __main__ - Step 111063: {'lr': 8.064303332765305e-05, 'samples': 21324096, 'steps': 111062, 'loss/train': 1.5157793760299683} 11/07/2021 12:51:27 - INFO - __main__ - Step 111064: {'lr': 8.063912978192763e-05, 'samples': 21324288, 'steps': 111063, 'loss/train': 1.3840687274932861} 11/07/2021 12:51:28 - INFO - __main__ - Step 111065: {'lr': 8.06352263125136e-05, 'samples': 21324480, 'steps': 111064, 'loss/train': 0.9933359622955322} 11/07/2021 12:51:28 - INFO - __main__ - Step 111066: {'lr': 8.063132291941275e-05, 'samples': 21324672, 'steps': 111065, 'loss/train': 0.5805865526199341} 11/07/2021 12:51:29 - INFO - __main__ - Step 111067: {'lr': 8.062741960262681e-05, 'samples': 21324864, 'steps': 111066, 'loss/train': 1.8048088550567627} 11/07/2021 12:51:29 - INFO - __main__ - Step 111068: {'lr': 8.062351636215753e-05, 'samples': 21325056, 'steps': 111067, 'loss/train': 1.1330629587173462} 11/07/2021 12:51:30 - INFO - __main__ - Step 111069: {'lr': 8.061961319800668e-05, 'samples': 21325248, 'steps': 111068, 'loss/train': 0.8294498920440674} 11/07/2021 12:51:31 - INFO - __main__ - Step 111070: {'lr': 8.061571011017601e-05, 'samples': 21325440, 'steps': 111069, 'loss/train': 1.3918254375457764} 11/07/2021 12:51:31 - INFO - __main__ - Step 111071: {'lr': 8.061180709866731e-05, 'samples': 21325632, 'steps': 111070, 'loss/train': 1.2048615217208862} 11/07/2021 12:51:31 - INFO - __main__ - Step 111072: {'lr': 8.060790416348238e-05, 'samples': 21325824, 'steps': 111071, 'loss/train': 1.4364691972732544} 11/07/2021 12:51:32 - INFO - __main__ - Step 111073: {'lr': 8.060400130462284e-05, 'samples': 21326016, 'steps': 111072, 'loss/train': 0.8556206822395325} 11/07/2021 12:51:32 - INFO - __main__ - Step 111074: {'lr': 8.060009852209052e-05, 'samples': 21326208, 'steps': 111073, 'loss/train': 1.4441920518875122} 11/07/2021 12:51:32 - INFO - __main__ - Step 111075: {'lr': 8.059619581588717e-05, 'samples': 21326400, 'steps': 111074, 'loss/train': 1.3586163520812988} 11/07/2021 12:51:33 - INFO - __main__ - Step 111076: {'lr': 8.059229318601457e-05, 'samples': 21326592, 'steps': 111075, 'loss/train': 1.5440046787261963} 11/07/2021 12:51:34 - INFO - __main__ - Step 111077: {'lr': 8.058839063247447e-05, 'samples': 21326784, 'steps': 111076, 'loss/train': 1.2939339876174927} 11/07/2021 12:51:34 - INFO - __main__ - Step 111078: {'lr': 8.058448815526865e-05, 'samples': 21326976, 'steps': 111077, 'loss/train': 1.5508348941802979} 11/07/2021 12:51:34 - INFO - __main__ - Step 111079: {'lr': 8.05805857543988e-05, 'samples': 21327168, 'steps': 111078, 'loss/train': 1.4089548587799072} 11/07/2021 12:51:35 - INFO - __main__ - Step 111080: {'lr': 8.057668342986673e-05, 'samples': 21327360, 'steps': 111079, 'loss/train': 1.092051386833191} 11/07/2021 12:51:36 - INFO - __main__ - Step 111081: {'lr': 8.057278118167421e-05, 'samples': 21327552, 'steps': 111080, 'loss/train': 5.707455158233643} 11/07/2021 12:51:36 - INFO - __main__ - Step 111082: {'lr': 8.056887900982298e-05, 'samples': 21327744, 'steps': 111081, 'loss/train': 0.820378303527832} 11/07/2021 12:51:37 - INFO - __main__ - Step 111083: {'lr': 8.05649769143148e-05, 'samples': 21327936, 'steps': 111082, 'loss/train': 1.5048640966415405} 11/07/2021 12:51:37 - INFO - __main__ - Step 111084: {'lr': 8.056107489515143e-05, 'samples': 21328128, 'steps': 111083, 'loss/train': 0.226120263338089} 11/07/2021 12:51:37 - INFO - __main__ - Step 111085: {'lr': 8.055717295233465e-05, 'samples': 21328320, 'steps': 111084, 'loss/train': 1.726914405822754} 11/07/2021 12:51:38 - INFO - __main__ - Step 111086: {'lr': 8.055327108586621e-05, 'samples': 21328512, 'steps': 111085, 'loss/train': 0.9880268573760986} 11/07/2021 12:51:39 - INFO - __main__ - Step 111087: {'lr': 8.054936929574782e-05, 'samples': 21328704, 'steps': 111086, 'loss/train': 1.221173644065857} 11/07/2021 12:51:39 - INFO - __main__ - Step 111088: {'lr': 8.054546758198125e-05, 'samples': 21328896, 'steps': 111087, 'loss/train': 1.5438820123672485} 11/07/2021 12:51:39 - INFO - __main__ - Step 111089: {'lr': 8.054156594456827e-05, 'samples': 21329088, 'steps': 111088, 'loss/train': 1.0463920831680298} 11/07/2021 12:51:40 - INFO - __main__ - Step 111090: {'lr': 8.053766438351068e-05, 'samples': 21329280, 'steps': 111089, 'loss/train': 1.3396165370941162} 11/07/2021 12:51:40 - INFO - __main__ - Step 111091: {'lr': 8.053376289881017e-05, 'samples': 21329472, 'steps': 111090, 'loss/train': 1.2139325141906738} 11/07/2021 12:51:41 - INFO - __main__ - Step 111092: {'lr': 8.052986149046854e-05, 'samples': 21329664, 'steps': 111091, 'loss/train': 1.5402233600616455} 11/07/2021 12:51:42 - INFO - __main__ - Step 111093: {'lr': 8.052596015848754e-05, 'samples': 21329856, 'steps': 111092, 'loss/train': 0.5381118655204773} 11/07/2021 12:51:42 - INFO - __main__ - Step 111094: {'lr': 8.052205890286892e-05, 'samples': 21330048, 'steps': 111093, 'loss/train': 1.8212244510650635} 11/07/2021 12:51:42 - INFO - __main__ - Step 111095: {'lr': 8.051815772361446e-05, 'samples': 21330240, 'steps': 111094, 'loss/train': 1.4155381917953491} 11/07/2021 12:51:43 - INFO - __main__ - Step 111096: {'lr': 8.05142566207259e-05, 'samples': 21330432, 'steps': 111095, 'loss/train': 1.329818606376648} 11/07/2021 12:51:44 - INFO - __main__ - Step 111097: {'lr': 8.0510355594205e-05, 'samples': 21330624, 'steps': 111096, 'loss/train': 0.7683250904083252} 11/07/2021 12:51:44 - INFO - __main__ - Step 111098: {'lr': 8.050645464405352e-05, 'samples': 21330816, 'steps': 111097, 'loss/train': 1.6211442947387695} 11/07/2021 12:51:45 - INFO - __main__ - Step 111099: {'lr': 8.050255377027327e-05, 'samples': 21331008, 'steps': 111098, 'loss/train': 1.5767403841018677} 11/07/2021 12:51:45 - INFO - __main__ - Step 111100: {'lr': 8.049865297286591e-05, 'samples': 21331200, 'steps': 111099, 'loss/train': 1.4846503734588623} 11/07/2021 12:51:45 - INFO - __main__ - Step 111101: {'lr': 8.049475225183323e-05, 'samples': 21331392, 'steps': 111100, 'loss/train': 1.1770650148391724} 11/07/2021 12:51:46 - INFO - __main__ - Step 111102: {'lr': 8.049085160717699e-05, 'samples': 21331584, 'steps': 111101, 'loss/train': 1.696389079093933} 11/07/2021 12:51:47 - INFO - __main__ - Step 111103: {'lr': 8.048695103889895e-05, 'samples': 21331776, 'steps': 111102, 'loss/train': 1.4038751125335693} 11/07/2021 12:51:47 - INFO - __main__ - Step 111104: {'lr': 8.048305054700089e-05, 'samples': 21331968, 'steps': 111103, 'loss/train': 0.32827287912368774} 11/07/2021 12:51:47 - INFO - __main__ - Step 111105: {'lr': 8.047915013148454e-05, 'samples': 21332160, 'steps': 111104, 'loss/train': 1.4318021535873413} 11/07/2021 12:51:48 - INFO - __main__ - Step 111106: {'lr': 8.047524979235168e-05, 'samples': 21332352, 'steps': 111105, 'loss/train': 0.8151130080223083} 11/07/2021 12:51:49 - INFO - __main__ - Step 111107: {'lr': 8.047134952960405e-05, 'samples': 21332544, 'steps': 111106, 'loss/train': 0.7908045053482056} 11/07/2021 12:51:50 - INFO - __main__ - Step 111108: {'lr': 8.046744934324343e-05, 'samples': 21332736, 'steps': 111107, 'loss/train': 1.0015619993209839} 11/07/2021 12:51:50 - INFO - __main__ - Step 111109: {'lr': 8.046354923327154e-05, 'samples': 21332928, 'steps': 111108, 'loss/train': 0.2788054943084717} 11/07/2021 12:51:50 - INFO - __main__ - Step 111110: {'lr': 8.045964919969018e-05, 'samples': 21333120, 'steps': 111109, 'loss/train': 0.16637833416461945} 11/07/2021 12:51:51 - INFO - __main__ - Step 111111: {'lr': 8.045574924250106e-05, 'samples': 21333312, 'steps': 111110, 'loss/train': 1.3438912630081177} 11/07/2021 12:51:52 - INFO - __main__ - Step 111112: {'lr': 8.045184936170596e-05, 'samples': 21333504, 'steps': 111111, 'loss/train': 1.5179730653762817} 11/07/2021 12:51:52 - INFO - __main__ - Step 111113: {'lr': 8.044794955730675e-05, 'samples': 21333696, 'steps': 111112, 'loss/train': 1.1917095184326172} 11/07/2021 12:51:52 - INFO - __main__ - Step 111114: {'lr': 8.044404982930498e-05, 'samples': 21333888, 'steps': 111113, 'loss/train': 1.488929271697998} 11/07/2021 12:51:53 - INFO - __main__ - Step 111115: {'lr': 8.04401501777025e-05, 'samples': 21334080, 'steps': 111114, 'loss/train': 1.1846626996994019} 11/07/2021 12:51:53 - INFO - __main__ - Step 111116: {'lr': 8.04362506025011e-05, 'samples': 21334272, 'steps': 111115, 'loss/train': 1.2010349035263062} 11/07/2021 12:51:54 - INFO - __main__ - Step 111117: {'lr': 8.043235110370247e-05, 'samples': 21334464, 'steps': 111116, 'loss/train': 1.5117970705032349} 11/07/2021 12:51:54 - INFO - __main__ - Step 111118: {'lr': 8.042845168130844e-05, 'samples': 21334656, 'steps': 111117, 'loss/train': 1.215187430381775} 11/07/2021 12:51:55 - INFO - __main__ - Step 111119: {'lr': 8.042455233532072e-05, 'samples': 21334848, 'steps': 111118, 'loss/train': 1.4232581853866577} 11/07/2021 12:51:55 - INFO - __main__ - Step 111120: {'lr': 8.042065306574106e-05, 'samples': 21335040, 'steps': 111119, 'loss/train': 1.212514877319336} 11/07/2021 12:51:56 - INFO - __main__ - Step 111121: {'lr': 8.041675387257127e-05, 'samples': 21335232, 'steps': 111120, 'loss/train': 2.358269214630127} 11/07/2021 12:51:56 - INFO - __main__ - Step 111122: {'lr': 8.041285475581306e-05, 'samples': 21335424, 'steps': 111121, 'loss/train': 1.2159689664840698} 11/07/2021 12:51:57 - INFO - __main__ - Step 111123: {'lr': 8.040895571546818e-05, 'samples': 21335616, 'steps': 111122, 'loss/train': 1.3345519304275513} 11/07/2021 12:51:57 - INFO - __main__ - Step 111124: {'lr': 8.040505675153845e-05, 'samples': 21335808, 'steps': 111123, 'loss/train': 1.1600792407989502} 11/07/2021 12:51:58 - INFO - __main__ - Step 111125: {'lr': 8.040115786402555e-05, 'samples': 21336000, 'steps': 111124, 'loss/train': 1.3555570840835571} 11/07/2021 12:51:58 - INFO - __main__ - Step 111126: {'lr': 8.039725905293138e-05, 'samples': 21336192, 'steps': 111125, 'loss/train': 1.4659183025360107} 11/07/2021 12:51:58 - INFO - __main__ - Step 111127: {'lr': 8.039336031825748e-05, 'samples': 21336384, 'steps': 111126, 'loss/train': 1.1615123748779297} 11/07/2021 12:52:01 - INFO - __main__ - Step 111128: {'lr': 8.038946166000575e-05, 'samples': 21336576, 'steps': 111127, 'loss/train': 1.2056297063827515} 11/07/2021 12:52:01 - INFO - __main__ - Step 111129: {'lr': 8.038556307817787e-05, 'samples': 21336768, 'steps': 111128, 'loss/train': 1.1732053756713867} 11/07/2021 12:52:02 - INFO - __main__ - Step 111130: {'lr': 8.038166457277565e-05, 'samples': 21336960, 'steps': 111129, 'loss/train': 1.7416839599609375} 11/07/2021 12:52:02 - INFO - __main__ - Step 111131: {'lr': 8.037776614380085e-05, 'samples': 21337152, 'steps': 111130, 'loss/train': 1.753572940826416} 11/07/2021 12:52:02 - INFO - __main__ - Step 111132: {'lr': 8.03738677912552e-05, 'samples': 21337344, 'steps': 111131, 'loss/train': 1.7480080127716064} 11/07/2021 12:52:03 - INFO - __main__ - Step 111133: {'lr': 8.036996951514047e-05, 'samples': 21337536, 'steps': 111132, 'loss/train': 1.706466555595398} 11/07/2021 12:52:03 - INFO - __main__ - Step 111134: {'lr': 8.036607131545839e-05, 'samples': 21337728, 'steps': 111133, 'loss/train': 1.2738986015319824} 11/07/2021 12:52:03 - INFO - __main__ - Step 111135: {'lr': 8.036217319221079e-05, 'samples': 21337920, 'steps': 111134, 'loss/train': 1.0853976011276245} 11/07/2021 12:52:04 - INFO - __main__ - Step 111136: {'lr': 8.035827514539934e-05, 'samples': 21338112, 'steps': 111135, 'loss/train': 1.0763407945632935} 11/07/2021 12:52:05 - INFO - __main__ - Step 111137: {'lr': 8.035437717502583e-05, 'samples': 21338304, 'steps': 111136, 'loss/train': 1.116508960723877} 11/07/2021 12:52:05 - INFO - __main__ - Step 111138: {'lr': 8.035047928109204e-05, 'samples': 21338496, 'steps': 111137, 'loss/train': 1.3657084703445435} 11/07/2021 12:52:05 - INFO - __main__ - Step 111139: {'lr': 8.034658146359977e-05, 'samples': 21338688, 'steps': 111138, 'loss/train': 1.6440058946609497} 11/07/2021 12:52:06 - INFO - __main__ - Step 111140: {'lr': 8.034268372255066e-05, 'samples': 21338880, 'steps': 111139, 'loss/train': 1.7365344762802124} 11/07/2021 12:52:07 - INFO - __main__ - Step 111141: {'lr': 8.03387860579465e-05, 'samples': 21339072, 'steps': 111140, 'loss/train': 1.3819223642349243} 11/07/2021 12:52:07 - INFO - __main__ - Step 111142: {'lr': 8.033488846978907e-05, 'samples': 21339264, 'steps': 111141, 'loss/train': 1.5127794742584229} 11/07/2021 12:52:08 - INFO - __main__ - Step 111143: {'lr': 8.03309909580801e-05, 'samples': 21339456, 'steps': 111142, 'loss/train': 1.29436194896698} 11/07/2021 12:52:08 - INFO - __main__ - Step 111144: {'lr': 8.032709352282138e-05, 'samples': 21339648, 'steps': 111143, 'loss/train': 1.6433627605438232} 11/07/2021 12:52:08 - INFO - __main__ - Step 111145: {'lr': 8.032319616401468e-05, 'samples': 21339840, 'steps': 111144, 'loss/train': 1.7846217155456543} 11/07/2021 12:52:09 - INFO - __main__ - Step 111146: {'lr': 8.03192988816617e-05, 'samples': 21340032, 'steps': 111145, 'loss/train': 1.799882173538208} 11/07/2021 12:52:10 - INFO - __main__ - Step 111147: {'lr': 8.031540167576423e-05, 'samples': 21340224, 'steps': 111146, 'loss/train': 1.5354814529418945} 11/07/2021 12:52:10 - INFO - __main__ - Step 111148: {'lr': 8.031150454632402e-05, 'samples': 21340416, 'steps': 111147, 'loss/train': 0.9997944235801697} 11/07/2021 12:52:10 - INFO - __main__ - Step 111149: {'lr': 8.030760749334285e-05, 'samples': 21340608, 'steps': 111148, 'loss/train': 1.3737599849700928} 11/07/2021 12:52:11 - INFO - __main__ - Step 111150: {'lr': 8.030371051682241e-05, 'samples': 21340800, 'steps': 111149, 'loss/train': 1.4054672718048096} 11/07/2021 12:52:12 - INFO - __main__ - Step 111151: {'lr': 8.029981361676455e-05, 'samples': 21340992, 'steps': 111150, 'loss/train': 1.532629370689392} 11/07/2021 12:52:12 - INFO - __main__ - Step 111152: {'lr': 8.029591679317094e-05, 'samples': 21341184, 'steps': 111151, 'loss/train': 1.2205544710159302} 11/07/2021 12:52:13 - INFO - __main__ - Step 111153: {'lr': 8.029202004604346e-05, 'samples': 21341376, 'steps': 111152, 'loss/train': 0.756764829158783} 11/07/2021 12:52:13 - INFO - __main__ - Step 111154: {'lr': 8.028812337538371e-05, 'samples': 21341568, 'steps': 111153, 'loss/train': 1.6272075176239014} 11/07/2021 12:52:13 - INFO - __main__ - Step 111155: {'lr': 8.028422678119348e-05, 'samples': 21341760, 'steps': 111154, 'loss/train': 1.36777925491333} 11/07/2021 12:52:14 - INFO - __main__ - Step 111156: {'lr': 8.028033026347459e-05, 'samples': 21341952, 'steps': 111155, 'loss/train': 1.1925925016403198} 11/07/2021 12:52:15 - INFO - __main__ - Step 111157: {'lr': 8.027643382222877e-05, 'samples': 21342144, 'steps': 111156, 'loss/train': 1.2820253372192383} 11/07/2021 12:52:15 - INFO - __main__ - Step 111158: {'lr': 8.027253745745775e-05, 'samples': 21342336, 'steps': 111157, 'loss/train': 1.4095544815063477} 11/07/2021 12:52:16 - INFO - __main__ - Step 111159: {'lr': 8.026864116916329e-05, 'samples': 21342528, 'steps': 111158, 'loss/train': 1.131885051727295} 11/07/2021 12:52:16 - INFO - __main__ - Step 111160: {'lr': 8.02647449573472e-05, 'samples': 21342720, 'steps': 111159, 'loss/train': 1.0158344507217407} 11/07/2021 12:52:17 - INFO - __main__ - Step 111161: {'lr': 8.026084882201118e-05, 'samples': 21342912, 'steps': 111160, 'loss/train': 1.180544376373291} 11/07/2021 12:52:17 - INFO - __main__ - Step 111162: {'lr': 8.025695276315701e-05, 'samples': 21343104, 'steps': 111161, 'loss/train': 1.4581660032272339} 11/07/2021 12:52:18 - INFO - __main__ - Step 111163: {'lr': 8.02530567807864e-05, 'samples': 21343296, 'steps': 111162, 'loss/train': 1.2480677366256714} 11/07/2021 12:52:18 - INFO - __main__ - Step 111164: {'lr': 8.024916087490119e-05, 'samples': 21343488, 'steps': 111163, 'loss/train': 1.2271920442581177} 11/07/2021 12:52:18 - INFO - __main__ - Step 111165: {'lr': 8.024526504550306e-05, 'samples': 21343680, 'steps': 111164, 'loss/train': 1.437137246131897} 11/07/2021 12:52:19 - INFO - __main__ - Step 111166: {'lr': 8.024136929259391e-05, 'samples': 21343872, 'steps': 111165, 'loss/train': 0.9988192915916443} 11/07/2021 12:52:20 - INFO - __main__ - Step 111167: {'lr': 8.023747361617526e-05, 'samples': 21344064, 'steps': 111166, 'loss/train': 0.9329038858413696} 11/07/2021 12:52:20 - INFO - __main__ - Step 111168: {'lr': 8.0233578016249e-05, 'samples': 21344256, 'steps': 111167, 'loss/train': 0.8724433183670044} 11/07/2021 12:52:21 - INFO - __main__ - Step 111169: {'lr': 8.022968249281687e-05, 'samples': 21344448, 'steps': 111168, 'loss/train': 1.4977355003356934} 11/07/2021 12:52:21 - INFO - __main__ - Step 111170: {'lr': 8.02257870458806e-05, 'samples': 21344640, 'steps': 111169, 'loss/train': 1.1357836723327637} 11/07/2021 12:52:21 - INFO - __main__ - Step 111171: {'lr': 8.0221891675442e-05, 'samples': 21344832, 'steps': 111170, 'loss/train': 1.3382654190063477} 11/07/2021 12:52:22 - INFO - __main__ - Step 111172: {'lr': 8.021799638150278e-05, 'samples': 21345024, 'steps': 111171, 'loss/train': 0.9873957633972168} 11/07/2021 12:52:23 - INFO - __main__ - Step 111173: {'lr': 8.021410116406474e-05, 'samples': 21345216, 'steps': 111172, 'loss/train': 1.4212526082992554} 11/07/2021 12:52:23 - INFO - __main__ - Step 111174: {'lr': 8.021020602312959e-05, 'samples': 21345408, 'steps': 111173, 'loss/train': 1.3685848712921143} 11/07/2021 12:52:23 - INFO - __main__ - Step 111175: {'lr': 8.02063109586991e-05, 'samples': 21345600, 'steps': 111174, 'loss/train': 1.505331039428711} 11/07/2021 12:52:24 - INFO - __main__ - Step 111176: {'lr': 8.020241597077501e-05, 'samples': 21345792, 'steps': 111175, 'loss/train': 1.4030920267105103} 11/07/2021 12:52:25 - INFO - __main__ - Step 111177: {'lr': 8.019852105935912e-05, 'samples': 21345984, 'steps': 111176, 'loss/train': 2.168952703475952} 11/07/2021 12:52:25 - INFO - __main__ - Step 111178: {'lr': 8.019462622445314e-05, 'samples': 21346176, 'steps': 111177, 'loss/train': 1.038351058959961} 11/07/2021 12:52:26 - INFO - __main__ - Step 111179: {'lr': 8.019073146605884e-05, 'samples': 21346368, 'steps': 111178, 'loss/train': 1.3126269578933716} 11/07/2021 12:52:26 - INFO - __main__ - Step 111180: {'lr': 8.018683678417806e-05, 'samples': 21346560, 'steps': 111179, 'loss/train': 0.1556587964296341} 11/07/2021 12:52:26 - INFO - __main__ - Step 111181: {'lr': 8.018294217881238e-05, 'samples': 21346752, 'steps': 111180, 'loss/train': 1.8321038484573364} 11/07/2021 12:52:27 - INFO - __main__ - Step 111182: {'lr': 8.017904764996367e-05, 'samples': 21346944, 'steps': 111181, 'loss/train': 1.304168462753296} 11/07/2021 12:52:28 - INFO - __main__ - Step 111183: {'lr': 8.017515319763363e-05, 'samples': 21347136, 'steps': 111182, 'loss/train': 1.5634918212890625} 11/07/2021 12:52:28 - INFO - __main__ - Step 111184: {'lr': 8.017125882182408e-05, 'samples': 21347328, 'steps': 111183, 'loss/train': 1.7324897050857544} 11/07/2021 12:52:28 - INFO - __main__ - Step 111185: {'lr': 8.01673645225367e-05, 'samples': 21347520, 'steps': 111184, 'loss/train': 0.8292839527130127} 11/07/2021 12:52:29 - INFO - __main__ - Step 111186: {'lr': 8.016347029977334e-05, 'samples': 21347712, 'steps': 111185, 'loss/train': 1.428707480430603} 11/07/2021 12:52:30 - INFO - __main__ - Step 111187: {'lr': 8.015957615353564e-05, 'samples': 21347904, 'steps': 111186, 'loss/train': 1.2424720525741577} 11/07/2021 12:52:30 - INFO - __main__ - Step 111188: {'lr': 8.015568208382545e-05, 'samples': 21348096, 'steps': 111187, 'loss/train': 1.4164167642593384} 11/07/2021 12:52:30 - INFO - __main__ - Step 111189: {'lr': 8.015178809064447e-05, 'samples': 21348288, 'steps': 111188, 'loss/train': 1.1602271795272827} 11/07/2021 12:52:31 - INFO - __main__ - Step 111190: {'lr': 8.014789417399448e-05, 'samples': 21348480, 'steps': 111189, 'loss/train': 1.3691517114639282} 11/07/2021 12:52:31 - INFO - __main__ - Step 111191: {'lr': 8.014400033387725e-05, 'samples': 21348672, 'steps': 111190, 'loss/train': 1.2248941659927368} 11/07/2021 12:52:32 - INFO - __main__ - Step 111192: {'lr': 8.01401065702945e-05, 'samples': 21348864, 'steps': 111191, 'loss/train': 1.3213863372802734} 11/07/2021 12:52:33 - INFO - __main__ - Step 111193: {'lr': 8.013621288324805e-05, 'samples': 21349056, 'steps': 111192, 'loss/train': 1.6075561046600342} 11/07/2021 12:52:33 - INFO - __main__ - Step 111194: {'lr': 8.013231927273954e-05, 'samples': 21349248, 'steps': 111193, 'loss/train': 1.6069155931472778} 11/07/2021 12:52:33 - INFO - __main__ - Step 111195: {'lr': 8.01284257387708e-05, 'samples': 21349440, 'steps': 111194, 'loss/train': 5.712595462799072} 11/07/2021 12:52:34 - INFO - __main__ - Step 111196: {'lr': 8.012453228134356e-05, 'samples': 21349632, 'steps': 111195, 'loss/train': 1.385443925857544} 11/07/2021 12:52:34 - INFO - __main__ - Step 111197: {'lr': 8.012063890045957e-05, 'samples': 21349824, 'steps': 111196, 'loss/train': 1.5625038146972656} 11/07/2021 12:52:35 - INFO - __main__ - Step 111198: {'lr': 8.011674559612061e-05, 'samples': 21350016, 'steps': 111197, 'loss/train': 1.4253301620483398} 11/07/2021 12:52:35 - INFO - __main__ - Step 111199: {'lr': 8.011285236832843e-05, 'samples': 21350208, 'steps': 111198, 'loss/train': 1.4239346981048584} 11/07/2021 12:52:36 - INFO - __main__ - Step 111200: {'lr': 8.010895921708478e-05, 'samples': 21350400, 'steps': 111199, 'loss/train': 1.3140829801559448} 11/07/2021 12:52:36 - INFO - __main__ - Step 111201: {'lr': 8.010506614239138e-05, 'samples': 21350592, 'steps': 111200, 'loss/train': 1.1885403394699097} 11/07/2021 12:52:36 - INFO - __main__ - Step 111202: {'lr': 8.010117314425006e-05, 'samples': 21350784, 'steps': 111201, 'loss/train': 1.5869858264923096} 11/07/2021 12:52:37 - INFO - __main__ - Step 111203: {'lr': 8.00972802226625e-05, 'samples': 21350976, 'steps': 111202, 'loss/train': 1.2563282251358032} 11/07/2021 12:52:38 - INFO - __main__ - Step 111204: {'lr': 8.009338737763047e-05, 'samples': 21351168, 'steps': 111203, 'loss/train': 1.0987976789474487} 11/07/2021 12:52:38 - INFO - __main__ - Step 111205: {'lr': 8.008949460915577e-05, 'samples': 21351360, 'steps': 111204, 'loss/train': 1.1961357593536377} 11/07/2021 12:52:38 - INFO - __main__ - Step 111206: {'lr': 8.008560191724012e-05, 'samples': 21351552, 'steps': 111205, 'loss/train': 1.0439471006393433} 11/07/2021 12:52:39 - INFO - __main__ - Step 111207: {'lr': 8.008170930188536e-05, 'samples': 21351744, 'steps': 111206, 'loss/train': 1.2888320684432983} 11/07/2021 12:52:39 - INFO - __main__ - Step 111208: {'lr': 8.007781676309306e-05, 'samples': 21351936, 'steps': 111207, 'loss/train': 1.3456848859786987} 11/07/2021 12:52:41 - INFO - __main__ - Step 111209: {'lr': 8.007392430086507e-05, 'samples': 21352128, 'steps': 111208, 'loss/train': 1.7668726444244385} 11/07/2021 12:52:41 - INFO - __main__ - Step 111210: {'lr': 8.007003191520316e-05, 'samples': 21352320, 'steps': 111209, 'loss/train': 1.5278609991073608} 11/07/2021 12:52:41 - INFO - __main__ - Step 111211: {'lr': 8.006613960610906e-05, 'samples': 21352512, 'steps': 111210, 'loss/train': 1.4724661111831665} 11/07/2021 12:52:42 - INFO - __main__ - Step 111212: {'lr': 8.006224737358455e-05, 'samples': 21352704, 'steps': 111211, 'loss/train': 1.1714750528335571} 11/07/2021 12:52:42 - INFO - __main__ - Step 111213: {'lr': 8.005835521763136e-05, 'samples': 21352896, 'steps': 111212, 'loss/train': 0.42286452651023865} 11/07/2021 12:52:43 - INFO - __main__ - Step 111214: {'lr': 8.005446313825126e-05, 'samples': 21353088, 'steps': 111213, 'loss/train': 0.7685848474502563} 11/07/2021 12:52:43 - INFO - __main__ - Step 111215: {'lr': 8.005057113544598e-05, 'samples': 21353280, 'steps': 111214, 'loss/train': 0.9284830093383789} 11/07/2021 12:52:44 - INFO - __main__ - Step 111216: {'lr': 8.004667920921732e-05, 'samples': 21353472, 'steps': 111215, 'loss/train': 1.059516429901123} 11/07/2021 12:52:44 - INFO - __main__ - Step 111217: {'lr': 8.004278735956697e-05, 'samples': 21353664, 'steps': 111216, 'loss/train': 1.0307165384292603} 11/07/2021 12:52:45 - INFO - __main__ - Step 111218: {'lr': 8.003889558649674e-05, 'samples': 21353856, 'steps': 111217, 'loss/train': 0.9536795616149902} 11/07/2021 12:52:46 - INFO - __main__ - Step 111219: {'lr': 8.003500389000837e-05, 'samples': 21354048, 'steps': 111218, 'loss/train': 1.5708441734313965} 11/07/2021 12:52:46 - INFO - __main__ - Step 111220: {'lr': 8.003111227010365e-05, 'samples': 21354240, 'steps': 111219, 'loss/train': 1.1090513467788696} 11/07/2021 12:52:46 - INFO - __main__ - Step 111221: {'lr': 8.002722072678423e-05, 'samples': 21354432, 'steps': 111220, 'loss/train': 1.5062235593795776} 11/07/2021 12:52:47 - INFO - __main__ - Step 111222: {'lr': 8.002332926005192e-05, 'samples': 21354624, 'steps': 111221, 'loss/train': 1.1638263463974} 11/07/2021 12:52:47 - INFO - __main__ - Step 111223: {'lr': 8.001943786990848e-05, 'samples': 21354816, 'steps': 111222, 'loss/train': 1.2093605995178223} 11/07/2021 12:52:48 - INFO - __main__ - Step 111224: {'lr': 8.001554655635565e-05, 'samples': 21355008, 'steps': 111223, 'loss/train': 1.0497746467590332} 11/07/2021 12:52:48 - INFO - __main__ - Step 111225: {'lr': 8.001165531939519e-05, 'samples': 21355200, 'steps': 111224, 'loss/train': 1.2424830198287964} 11/07/2021 12:52:49 - INFO - __main__ - Step 111226: {'lr': 8.000776415902886e-05, 'samples': 21355392, 'steps': 111225, 'loss/train': 1.1875802278518677} 11/07/2021 12:52:49 - INFO - __main__ - Step 111227: {'lr': 8.000387307525841e-05, 'samples': 21355584, 'steps': 111226, 'loss/train': 1.3812183141708374} 11/07/2021 12:52:49 - INFO - __main__ - Step 111228: {'lr': 7.999998206808559e-05, 'samples': 21355776, 'steps': 111227, 'loss/train': 1.513831615447998} 11/07/2021 12:52:51 - INFO - __main__ - Step 111229: {'lr': 7.999609113751217e-05, 'samples': 21355968, 'steps': 111228, 'loss/train': 1.6567716598510742} 11/07/2021 12:52:51 - INFO - __main__ - Step 111230: {'lr': 7.999220028353985e-05, 'samples': 21356160, 'steps': 111229, 'loss/train': 0.9311666488647461} 11/07/2021 12:52:51 - INFO - __main__ - Step 111231: {'lr': 7.998830950617044e-05, 'samples': 21356352, 'steps': 111230, 'loss/train': 1.0671560764312744} 11/07/2021 12:52:52 - INFO - __main__ - Step 111232: {'lr': 7.998441880540569e-05, 'samples': 21356544, 'steps': 111231, 'loss/train': 1.3786717653274536} 11/07/2021 12:52:52 - INFO - __main__ - Step 111233: {'lr': 7.99805281812474e-05, 'samples': 21356736, 'steps': 111232, 'loss/train': 1.760961890220642} 11/07/2021 12:52:52 - INFO - __main__ - Step 111234: {'lr': 7.99766376336972e-05, 'samples': 21356928, 'steps': 111233, 'loss/train': 1.1611666679382324} 11/07/2021 12:52:53 - INFO - __main__ - Step 111235: {'lr': 7.997274716275688e-05, 'samples': 21357120, 'steps': 111234, 'loss/train': 0.9695661067962646} 11/07/2021 12:52:54 - INFO - __main__ - Step 111236: {'lr': 7.996885676842822e-05, 'samples': 21357312, 'steps': 111235, 'loss/train': 1.8618582487106323} 11/07/2021 12:52:54 - INFO - __main__ - Step 111237: {'lr': 7.996496645071296e-05, 'samples': 21357504, 'steps': 111236, 'loss/train': 1.7905884981155396} 11/07/2021 12:52:54 - INFO - __main__ - Step 111238: {'lr': 7.996107620961291e-05, 'samples': 21357696, 'steps': 111237, 'loss/train': 0.4085133671760559} 11/07/2021 12:52:55 - INFO - __main__ - Step 111239: {'lr': 7.995718604512972e-05, 'samples': 21357888, 'steps': 111238, 'loss/train': 1.165042519569397} 11/07/2021 12:52:56 - INFO - __main__ - Step 111240: {'lr': 7.995329595726522e-05, 'samples': 21358080, 'steps': 111239, 'loss/train': 1.4503669738769531} 11/07/2021 12:52:56 - INFO - __main__ - Step 111241: {'lr': 7.994940594602116e-05, 'samples': 21358272, 'steps': 111240, 'loss/train': 1.3834364414215088} 11/07/2021 12:52:56 - INFO - __main__ - Step 111242: {'lr': 7.994551601139924e-05, 'samples': 21358464, 'steps': 111241, 'loss/train': 1.3334261178970337} 11/07/2021 12:52:57 - INFO - __main__ - Step 111243: {'lr': 7.994162615340125e-05, 'samples': 21358656, 'steps': 111242, 'loss/train': 1.3401070833206177} 11/07/2021 12:52:57 - INFO - __main__ - Step 111244: {'lr': 7.993773637202903e-05, 'samples': 21358848, 'steps': 111243, 'loss/train': 1.5750256776809692} 11/07/2021 12:52:58 - INFO - __main__ - Step 111245: {'lr': 7.993384666728418e-05, 'samples': 21359040, 'steps': 111244, 'loss/train': 1.7958818674087524} 11/07/2021 12:52:59 - INFO - __main__ - Step 111246: {'lr': 7.99299570391685e-05, 'samples': 21359232, 'steps': 111245, 'loss/train': 1.7491052150726318} 11/07/2021 12:52:59 - INFO - __main__ - Step 111247: {'lr': 7.992606748768374e-05, 'samples': 21359424, 'steps': 111246, 'loss/train': 1.406026840209961} 11/07/2021 12:52:59 - INFO - __main__ - Step 111248: {'lr': 7.992217801283169e-05, 'samples': 21359616, 'steps': 111247, 'loss/train': 1.7763200998306274} 11/07/2021 12:53:00 - INFO - __main__ - Step 111249: {'lr': 7.991828861461406e-05, 'samples': 21359808, 'steps': 111248, 'loss/train': 1.085996389389038} 11/07/2021 12:53:02 - INFO - __main__ - Step 111250: {'lr': 7.991439929303265e-05, 'samples': 21360000, 'steps': 111249, 'loss/train': 1.2446320056915283} 11/07/2021 12:53:02 - INFO - __main__ - Step 111251: {'lr': 7.991051004808916e-05, 'samples': 21360192, 'steps': 111250, 'loss/train': 1.06394624710083} 11/07/2021 12:53:03 - INFO - __main__ - Step 111252: {'lr': 7.99066208797854e-05, 'samples': 21360384, 'steps': 111251, 'loss/train': 1.739075779914856} 11/07/2021 12:53:03 - INFO - __main__ - Step 111253: {'lr': 7.990273178812307e-05, 'samples': 21360576, 'steps': 111252, 'loss/train': 1.7379380464553833} 11/07/2021 12:53:03 - INFO - __main__ - Step 111254: {'lr': 7.989884277310398e-05, 'samples': 21360768, 'steps': 111253, 'loss/train': 1.4491729736328125} 11/07/2021 12:53:04 - INFO - __main__ - Step 111255: {'lr': 7.989495383472989e-05, 'samples': 21360960, 'steps': 111254, 'loss/train': 1.221103310585022} 11/07/2021 12:53:04 - INFO - __main__ - Step 111256: {'lr': 7.989106497300241e-05, 'samples': 21361152, 'steps': 111255, 'loss/train': 1.0499237775802612} 11/07/2021 12:53:04 - INFO - __main__ - Step 111257: {'lr': 7.988717618792344e-05, 'samples': 21361344, 'steps': 111256, 'loss/train': 1.4176360368728638} 11/07/2021 12:53:05 - INFO - __main__ - Step 111258: {'lr': 7.988328747949467e-05, 'samples': 21361536, 'steps': 111257, 'loss/train': 1.0869039297103882} 11/07/2021 12:53:06 - INFO - __main__ - Step 111259: {'lr': 7.987939884771786e-05, 'samples': 21361728, 'steps': 111258, 'loss/train': 1.1852790117263794} 11/07/2021 12:53:06 - INFO - __main__ - Step 111260: {'lr': 7.987551029259474e-05, 'samples': 21361920, 'steps': 111259, 'loss/train': 1.5337212085723877} 11/07/2021 12:53:07 - INFO - __main__ - Step 111261: {'lr': 7.987162181412713e-05, 'samples': 21362112, 'steps': 111260, 'loss/train': 1.5620872974395752} 11/07/2021 12:53:07 - INFO - __main__ - Step 111262: {'lr': 7.986773341231673e-05, 'samples': 21362304, 'steps': 111261, 'loss/train': 1.283207893371582} 11/07/2021 12:53:08 - INFO - __main__ - Step 111263: {'lr': 7.986384508716529e-05, 'samples': 21362496, 'steps': 111262, 'loss/train': 1.2176116704940796} 11/07/2021 12:53:08 - INFO - __main__ - Step 111264: {'lr': 7.985995683867459e-05, 'samples': 21362688, 'steps': 111263, 'loss/train': 1.4654161930084229} 11/07/2021 12:53:09 - INFO - __main__ - Step 111265: {'lr': 7.985606866684635e-05, 'samples': 21362880, 'steps': 111264, 'loss/train': 1.3316258192062378} 11/07/2021 12:53:09 - INFO - __main__ - Step 111266: {'lr': 7.985218057168245e-05, 'samples': 21363072, 'steps': 111265, 'loss/train': 1.1441298723220825} 11/07/2021 12:53:09 - INFO - __main__ - Step 111267: {'lr': 7.984829255318444e-05, 'samples': 21363264, 'steps': 111266, 'loss/train': 1.5092111825942993} 11/07/2021 12:53:10 - INFO - __main__ - Step 111268: {'lr': 7.984440461135418e-05, 'samples': 21363456, 'steps': 111267, 'loss/train': 0.9244523048400879} 11/07/2021 12:53:11 - INFO - __main__ - Step 111269: {'lr': 7.984051674619338e-05, 'samples': 21363648, 'steps': 111268, 'loss/train': 0.9003018736839294} 11/07/2021 12:53:11 - INFO - __main__ - Step 111270: {'lr': 7.983662895770383e-05, 'samples': 21363840, 'steps': 111269, 'loss/train': 1.1033756732940674} 11/07/2021 12:53:11 - INFO - __main__ - Step 111271: {'lr': 7.983274124588724e-05, 'samples': 21364032, 'steps': 111270, 'loss/train': 0.8955477476119995} 11/07/2021 12:53:12 - INFO - __main__ - Step 111272: {'lr': 7.982885361074544e-05, 'samples': 21364224, 'steps': 111271, 'loss/train': 1.412158727645874} 11/07/2021 12:53:13 - INFO - __main__ - Step 111273: {'lr': 7.98249660522801e-05, 'samples': 21364416, 'steps': 111272, 'loss/train': 1.1566866636276245} 11/07/2021 12:53:13 - INFO - __main__ - Step 111274: {'lr': 7.9821078570493e-05, 'samples': 21364608, 'steps': 111273, 'loss/train': 1.4557453393936157} 11/07/2021 12:53:13 - INFO - __main__ - Step 111275: {'lr': 7.981719116538591e-05, 'samples': 21364800, 'steps': 111274, 'loss/train': 1.4992923736572266} 11/07/2021 12:53:14 - INFO - __main__ - Step 111276: {'lr': 7.981330383696064e-05, 'samples': 21364992, 'steps': 111275, 'loss/train': 1.4074862003326416} 11/07/2021 12:53:14 - INFO - __main__ - Step 111277: {'lr': 7.980941658521882e-05, 'samples': 21365184, 'steps': 111276, 'loss/train': 0.90383380651474} 11/07/2021 12:53:15 - INFO - __main__ - Step 111278: {'lr': 7.980552941016219e-05, 'samples': 21365376, 'steps': 111277, 'loss/train': 1.1357040405273438} 11/07/2021 12:53:16 - INFO - __main__ - Step 111279: {'lr': 7.980164231179262e-05, 'samples': 21365568, 'steps': 111278, 'loss/train': 1.26871657371521} 11/07/2021 12:53:16 - INFO - __main__ - Step 111280: {'lr': 7.979775529011177e-05, 'samples': 21365760, 'steps': 111279, 'loss/train': 1.5171265602111816} 11/07/2021 12:53:17 - INFO - __main__ - Step 111281: {'lr': 7.979386834512145e-05, 'samples': 21365952, 'steps': 111280, 'loss/train': 0.39322924613952637} 11/07/2021 12:53:17 - INFO - __main__ - Step 111282: {'lr': 7.978998147682338e-05, 'samples': 21366144, 'steps': 111281, 'loss/train': 1.4523799419403076} 11/07/2021 12:53:18 - INFO - __main__ - Step 111283: {'lr': 7.97860946852193e-05, 'samples': 21366336, 'steps': 111282, 'loss/train': 1.9698550701141357} 11/07/2021 12:53:18 - INFO - __main__ - Step 111284: {'lr': 7.978220797031099e-05, 'samples': 21366528, 'steps': 111283, 'loss/train': 1.4463369846343994} 11/07/2021 12:53:19 - INFO - __main__ - Step 111285: {'lr': 7.977832133210019e-05, 'samples': 21366720, 'steps': 111284, 'loss/train': 1.3958704471588135} 11/07/2021 12:53:19 - INFO - __main__ - Step 111286: {'lr': 7.977443477058866e-05, 'samples': 21366912, 'steps': 111285, 'loss/train': 1.3704861402511597} 11/07/2021 12:53:19 - INFO - __main__ - Step 111287: {'lr': 7.97705482857782e-05, 'samples': 21367104, 'steps': 111286, 'loss/train': 1.1312836408615112} 11/07/2021 12:53:20 - INFO - __main__ - Step 111288: {'lr': 7.976666187767042e-05, 'samples': 21367296, 'steps': 111287, 'loss/train': 0.906754195690155} 11/07/2021 12:53:22 - INFO - __main__ - Step 111289: {'lr': 7.976277554626718e-05, 'samples': 21367488, 'steps': 111288, 'loss/train': 1.43138587474823} 11/07/2021 12:53:22 - INFO - __main__ - Step 111290: {'lr': 7.975888929157021e-05, 'samples': 21367680, 'steps': 111289, 'loss/train': 1.707297921180725} 11/07/2021 12:53:22 - INFO - __main__ - Step 111291: {'lr': 7.975500311358125e-05, 'samples': 21367872, 'steps': 111290, 'loss/train': 1.0553135871887207} 11/07/2021 12:53:23 - INFO - __main__ - Step 111292: {'lr': 7.975111701230206e-05, 'samples': 21368064, 'steps': 111291, 'loss/train': 1.7338653802871704} 11/07/2021 12:53:23 - INFO - __main__ - Step 111293: {'lr': 7.974723098773437e-05, 'samples': 21368256, 'steps': 111292, 'loss/train': 1.728636384010315} 11/07/2021 12:53:23 - INFO - __main__ - Step 111294: {'lr': 7.974334503987998e-05, 'samples': 21368448, 'steps': 111293, 'loss/train': 1.5385342836380005} 11/07/2021 12:53:25 - INFO - __main__ - Step 111295: {'lr': 7.973945916874059e-05, 'samples': 21368640, 'steps': 111294, 'loss/train': 1.225166916847229} 11/07/2021 12:53:25 - INFO - __main__ - Step 111296: {'lr': 7.9735573374318e-05, 'samples': 21368832, 'steps': 111295, 'loss/train': 1.6828476190567017} 11/07/2021 12:53:25 - INFO - __main__ - Step 111297: {'lr': 7.97316876566139e-05, 'samples': 21369024, 'steps': 111296, 'loss/train': 1.2039108276367188} 11/07/2021 12:53:26 - INFO - __main__ - Step 111298: {'lr': 7.97278020156301e-05, 'samples': 21369216, 'steps': 111297, 'loss/train': 1.115358591079712} 11/07/2021 12:53:26 - INFO - __main__ - Step 111299: {'lr': 7.972391645136831e-05, 'samples': 21369408, 'steps': 111298, 'loss/train': 1.3033488988876343} 11/07/2021 12:53:27 - INFO - __main__ - Step 111300: {'lr': 7.97200309638303e-05, 'samples': 21369600, 'steps': 111299, 'loss/train': 0.7353556752204895} 11/07/2021 12:53:28 - INFO - __main__ - Step 111301: {'lr': 7.97161455530179e-05, 'samples': 21369792, 'steps': 111300, 'loss/train': 0.3103999197483063} 11/07/2021 12:53:28 - INFO - __main__ - Step 111302: {'lr': 7.971226021893269e-05, 'samples': 21369984, 'steps': 111301, 'loss/train': 1.3830089569091797} 11/07/2021 12:53:28 - INFO - __main__ - Step 111303: {'lr': 7.970837496157651e-05, 'samples': 21370176, 'steps': 111302, 'loss/train': 1.7292802333831787} 11/07/2021 12:53:29 - INFO - __main__ - Step 111304: {'lr': 7.97044897809511e-05, 'samples': 21370368, 'steps': 111303, 'loss/train': 1.531328558921814} 11/07/2021 12:53:29 - INFO - __main__ - Step 111305: {'lr': 7.970060467705822e-05, 'samples': 21370560, 'steps': 111304, 'loss/train': 1.7513974905014038} 11/07/2021 12:53:30 - INFO - __main__ - Step 111306: {'lr': 7.969671964989964e-05, 'samples': 21370752, 'steps': 111305, 'loss/train': 1.2329604625701904} 11/07/2021 12:53:30 - INFO - __main__ - Step 111307: {'lr': 7.969283469947708e-05, 'samples': 21370944, 'steps': 111306, 'loss/train': 1.5928385257720947} 11/07/2021 12:53:31 - INFO - __main__ - Step 111308: {'lr': 7.968894982579228e-05, 'samples': 21371136, 'steps': 111307, 'loss/train': 0.8198584914207458} 11/07/2021 12:53:31 - INFO - __main__ - Step 111309: {'lr': 7.968506502884703e-05, 'samples': 21371328, 'steps': 111308, 'loss/train': 1.0636990070343018} 11/07/2021 12:53:31 - INFO - __main__ - Step 111310: {'lr': 7.968118030864307e-05, 'samples': 21371520, 'steps': 111309, 'loss/train': 1.6910488605499268} 11/07/2021 12:53:32 - INFO - __main__ - Step 111311: {'lr': 7.967729566518215e-05, 'samples': 21371712, 'steps': 111310, 'loss/train': 1.7570013999938965} 11/07/2021 12:53:33 - INFO - __main__ - Step 111312: {'lr': 7.967341109846599e-05, 'samples': 21371904, 'steps': 111311, 'loss/train': 0.8402312994003296} 11/07/2021 12:53:33 - INFO - __main__ - Step 111313: {'lr': 7.966952660849635e-05, 'samples': 21372096, 'steps': 111312, 'loss/train': 1.5094927549362183} 11/07/2021 12:53:33 - INFO - __main__ - Step 111314: {'lr': 7.966564219527511e-05, 'samples': 21372288, 'steps': 111313, 'loss/train': 1.3533068895339966} 11/07/2021 12:53:34 - INFO - __main__ - Step 111315: {'lr': 7.966175785880378e-05, 'samples': 21372480, 'steps': 111314, 'loss/train': 0.46009668707847595} 11/07/2021 12:53:35 - INFO - __main__ - Step 111316: {'lr': 7.965787359908428e-05, 'samples': 21372672, 'steps': 111315, 'loss/train': 1.2719618082046509} 11/07/2021 12:53:35 - INFO - __main__ - Step 111317: {'lr': 7.96539894161183e-05, 'samples': 21372864, 'steps': 111316, 'loss/train': 1.1634241342544556} 11/07/2021 12:53:36 - INFO - __main__ - Step 111318: {'lr': 7.965010530990758e-05, 'samples': 21373056, 'steps': 111317, 'loss/train': 1.568893313407898} 11/07/2021 12:53:36 - INFO - __main__ - Step 111319: {'lr': 7.96462212804539e-05, 'samples': 21373248, 'steps': 111318, 'loss/train': 0.8577872514724731} 11/07/2021 12:53:37 - INFO - __main__ - Step 111320: {'lr': 7.964233732775902e-05, 'samples': 21373440, 'steps': 111319, 'loss/train': 1.6549041271209717} 11/07/2021 12:53:37 - INFO - __main__ - Step 111321: {'lr': 7.963845345182466e-05, 'samples': 21373632, 'steps': 111320, 'loss/train': 1.6081050634384155} 11/07/2021 12:53:38 - INFO - __main__ - Step 111322: {'lr': 7.96345696526526e-05, 'samples': 21373824, 'steps': 111321, 'loss/train': 1.5667610168457031} 11/07/2021 12:53:38 - INFO - __main__ - Step 111323: {'lr': 7.963068593024455e-05, 'samples': 21374016, 'steps': 111322, 'loss/train': 1.380725622177124} 11/07/2021 12:53:39 - INFO - __main__ - Step 111324: {'lr': 7.962680228460232e-05, 'samples': 21374208, 'steps': 111323, 'loss/train': 1.4612869024276733} 11/07/2021 12:53:39 - INFO - __main__ - Step 111325: {'lr': 7.96229187157276e-05, 'samples': 21374400, 'steps': 111324, 'loss/train': 1.475783348083496} 11/07/2021 12:53:39 - INFO - __main__ - Step 111326: {'lr': 7.961903522362215e-05, 'samples': 21374592, 'steps': 111325, 'loss/train': 1.0940351486206055} 11/07/2021 12:53:40 - INFO - __main__ - Step 111327: {'lr': 7.961515180828777e-05, 'samples': 21374784, 'steps': 111326, 'loss/train': 1.1805541515350342} 11/07/2021 12:53:41 - INFO - __main__ - Step 111328: {'lr': 7.961126846972622e-05, 'samples': 21374976, 'steps': 111327, 'loss/train': 1.7808208465576172} 11/07/2021 12:53:41 - INFO - __main__ - Step 111329: {'lr': 7.960738520793914e-05, 'samples': 21375168, 'steps': 111328, 'loss/train': 0.8536880016326904} 11/07/2021 12:53:41 - INFO - __main__ - Step 111330: {'lr': 7.960350202292834e-05, 'samples': 21375360, 'steps': 111329, 'loss/train': 1.318652629852295} 11/07/2021 12:53:42 - INFO - __main__ - Step 111331: {'lr': 7.959961891469558e-05, 'samples': 21375552, 'steps': 111330, 'loss/train': 1.2909042835235596} 11/07/2021 12:53:42 - INFO - __main__ - Step 111332: {'lr': 7.959573588324259e-05, 'samples': 21375744, 'steps': 111331, 'loss/train': 1.3286782503128052} 11/07/2021 12:53:43 - INFO - __main__ - Step 111333: {'lr': 7.959185292857113e-05, 'samples': 21375936, 'steps': 111332, 'loss/train': 0.5558999180793762} 11/07/2021 12:53:43 - INFO - __main__ - Step 111334: {'lr': 7.958797005068296e-05, 'samples': 21376128, 'steps': 111333, 'loss/train': 1.463145136833191} 11/07/2021 12:53:44 - INFO - __main__ - Step 111335: {'lr': 7.958408724957982e-05, 'samples': 21376320, 'steps': 111334, 'loss/train': 1.084385871887207} 11/07/2021 12:53:44 - INFO - __main__ - Step 111336: {'lr': 7.958020452526346e-05, 'samples': 21376512, 'steps': 111335, 'loss/train': 1.0432474613189697} 11/07/2021 12:53:45 - INFO - __main__ - Step 111337: {'lr': 7.957632187773565e-05, 'samples': 21376704, 'steps': 111336, 'loss/train': 1.3835034370422363} 11/07/2021 12:53:45 - INFO - __main__ - Step 111338: {'lr': 7.957243930699809e-05, 'samples': 21376896, 'steps': 111337, 'loss/train': 1.1952344179153442} 11/07/2021 12:53:46 - INFO - __main__ - Step 111339: {'lr': 7.956855681305256e-05, 'samples': 21377088, 'steps': 111338, 'loss/train': 1.3621962070465088} 11/07/2021 12:53:46 - INFO - __main__ - Step 111340: {'lr': 7.956467439590082e-05, 'samples': 21377280, 'steps': 111339, 'loss/train': 0.7035302519798279} 11/07/2021 12:53:47 - INFO - __main__ - Step 111341: {'lr': 7.956079205554468e-05, 'samples': 21377472, 'steps': 111340, 'loss/train': 1.156550407409668} 11/07/2021 12:53:47 - INFO - __main__ - Step 111342: {'lr': 7.955690979198573e-05, 'samples': 21377664, 'steps': 111341, 'loss/train': 1.3400559425354004} 11/07/2021 12:53:48 - INFO - __main__ - Step 111343: {'lr': 7.955302760522582e-05, 'samples': 21377856, 'steps': 111342, 'loss/train': 1.46306574344635} 11/07/2021 12:53:48 - INFO - __main__ - Step 111344: {'lr': 7.954914549526668e-05, 'samples': 21378048, 'steps': 111343, 'loss/train': 1.38862144947052} 11/07/2021 12:53:49 - INFO - __main__ - Step 111345: {'lr': 7.954526346211008e-05, 'samples': 21378240, 'steps': 111344, 'loss/train': 1.346417784690857} 11/07/2021 12:53:49 - INFO - __main__ - Step 111346: {'lr': 7.954138150575773e-05, 'samples': 21378432, 'steps': 111345, 'loss/train': 1.6196346282958984} 11/07/2021 12:53:49 - INFO - __main__ - Step 111347: {'lr': 7.953749962621142e-05, 'samples': 21378624, 'steps': 111346, 'loss/train': 1.060076355934143} 11/07/2021 12:53:50 - INFO - __main__ - Step 111348: {'lr': 7.953361782347288e-05, 'samples': 21378816, 'steps': 111347, 'loss/train': 1.171303629875183} 11/07/2021 12:53:51 - INFO - __main__ - Step 111349: {'lr': 7.952973609754386e-05, 'samples': 21379008, 'steps': 111348, 'loss/train': 1.62783944606781} 11/07/2021 12:53:51 - INFO - __main__ - Step 111350: {'lr': 7.952585444842611e-05, 'samples': 21379200, 'steps': 111349, 'loss/train': 1.12759268283844} 11/07/2021 12:53:51 - INFO - __main__ - Step 111351: {'lr': 7.952197287612137e-05, 'samples': 21379392, 'steps': 111350, 'loss/train': 1.6989980936050415} 11/07/2021 12:53:52 - INFO - __main__ - Step 111352: {'lr': 7.951809138063141e-05, 'samples': 21379584, 'steps': 111351, 'loss/train': 1.5228618383407593} 11/07/2021 12:53:53 - INFO - __main__ - Step 111353: {'lr': 7.951420996195796e-05, 'samples': 21379776, 'steps': 111352, 'loss/train': 1.1721317768096924} 11/07/2021 12:53:53 - INFO - __main__ - Step 111354: {'lr': 7.95103286201028e-05, 'samples': 21379968, 'steps': 111353, 'loss/train': 1.6261321306228638} 11/07/2021 12:53:54 - INFO - __main__ - Step 111355: {'lr': 7.950644735506771e-05, 'samples': 21380160, 'steps': 111354, 'loss/train': 1.684741497039795} 11/07/2021 12:53:54 - INFO - __main__ - Step 111356: {'lr': 7.950256616685431e-05, 'samples': 21380352, 'steps': 111355, 'loss/train': 1.0386360883712769} 11/07/2021 12:53:54 - INFO - __main__ - Step 111357: {'lr': 7.949868505546443e-05, 'samples': 21380544, 'steps': 111356, 'loss/train': 1.2003159523010254} 11/07/2021 12:53:55 - INFO - __main__ - Step 111358: {'lr': 7.949480402089978e-05, 'samples': 21380736, 'steps': 111357, 'loss/train': 1.34766685962677} 11/07/2021 12:53:56 - INFO - __main__ - Step 111359: {'lr': 7.949092306316219e-05, 'samples': 21380928, 'steps': 111358, 'loss/train': 1.376224398612976} 11/07/2021 12:53:56 - INFO - __main__ - Step 111360: {'lr': 7.948704218225333e-05, 'samples': 21381120, 'steps': 111359, 'loss/train': 1.4233745336532593} 11/07/2021 12:53:56 - INFO - __main__ - Step 111361: {'lr': 7.948316137817497e-05, 'samples': 21381312, 'steps': 111360, 'loss/train': 0.582184910774231} 11/07/2021 12:53:57 - INFO - __main__ - Step 111362: {'lr': 7.947928065092888e-05, 'samples': 21381504, 'steps': 111361, 'loss/train': 1.1358052492141724} 11/07/2021 12:53:57 - INFO - __main__ - Step 111363: {'lr': 7.947540000051678e-05, 'samples': 21381696, 'steps': 111362, 'loss/train': 1.097632646560669} 11/07/2021 12:53:58 - INFO - __main__ - Step 111364: {'lr': 7.947151942694045e-05, 'samples': 21381888, 'steps': 111363, 'loss/train': 1.3074140548706055} 11/07/2021 12:53:59 - INFO - __main__ - Step 111365: {'lr': 7.946763893020162e-05, 'samples': 21382080, 'steps': 111364, 'loss/train': 1.7917635440826416} 11/07/2021 12:53:59 - INFO - __main__ - Step 111366: {'lr': 7.946375851030205e-05, 'samples': 21382272, 'steps': 111365, 'loss/train': 0.7590343952178955} 11/07/2021 12:53:59 - INFO - __main__ - Step 111367: {'lr': 7.945987816724346e-05, 'samples': 21382464, 'steps': 111366, 'loss/train': 1.5100475549697876} 11/07/2021 12:54:00 - INFO - __main__ - Step 111368: {'lr': 7.94559979010277e-05, 'samples': 21382656, 'steps': 111367, 'loss/train': 1.6120779514312744} 11/07/2021 12:54:01 - INFO - __main__ - Step 111369: {'lr': 7.945211771165636e-05, 'samples': 21382848, 'steps': 111368, 'loss/train': 1.5701053142547607} 11/07/2021 12:54:01 - INFO - __main__ - Step 111370: {'lr': 7.944823759913128e-05, 'samples': 21383040, 'steps': 111369, 'loss/train': 1.064441204071045} 11/07/2021 12:54:01 - INFO - __main__ - Step 111371: {'lr': 7.944435756345417e-05, 'samples': 21383232, 'steps': 111370, 'loss/train': 1.441196084022522} 11/07/2021 12:54:02 - INFO - __main__ - Step 111372: {'lr': 7.94404776046268e-05, 'samples': 21383424, 'steps': 111371, 'loss/train': 0.9348549246788025} 11/07/2021 12:54:02 - INFO - __main__ - Step 111373: {'lr': 7.94365977226509e-05, 'samples': 21383616, 'steps': 111372, 'loss/train': 1.3359711170196533} 11/07/2021 12:54:03 - INFO - __main__ - Step 111374: {'lr': 7.943271791752829e-05, 'samples': 21383808, 'steps': 111373, 'loss/train': 1.4457623958587646} 11/07/2021 12:54:03 - INFO - __main__ - Step 111375: {'lr': 7.942883818926063e-05, 'samples': 21384000, 'steps': 111374, 'loss/train': 1.331249475479126} 11/07/2021 12:54:04 - INFO - __main__ - Step 111376: {'lr': 7.942495853784972e-05, 'samples': 21384192, 'steps': 111375, 'loss/train': 1.4278342723846436} 11/07/2021 12:54:04 - INFO - __main__ - Step 111377: {'lr': 7.942107896329728e-05, 'samples': 21384384, 'steps': 111376, 'loss/train': 1.1502984762191772} 11/07/2021 12:54:05 - INFO - __main__ - Step 111378: {'lr': 7.941719946560507e-05, 'samples': 21384576, 'steps': 111377, 'loss/train': 2.040442705154419} 11/07/2021 12:54:06 - INFO - __main__ - Step 111379: {'lr': 7.941332004477483e-05, 'samples': 21384768, 'steps': 111378, 'loss/train': 1.6311739683151245} 11/07/2021 12:54:06 - INFO - __main__ - Step 111380: {'lr': 7.940944070080832e-05, 'samples': 21384960, 'steps': 111379, 'loss/train': 0.8009911179542542} 11/07/2021 12:54:06 - INFO - __main__ - Step 111381: {'lr': 7.940556143370739e-05, 'samples': 21385152, 'steps': 111380, 'loss/train': 1.1633620262145996} 11/07/2021 12:54:07 - INFO - __main__ - Step 111382: {'lr': 7.940168224347358e-05, 'samples': 21385344, 'steps': 111381, 'loss/train': 1.1268551349639893} 11/07/2021 12:54:07 - INFO - __main__ - Step 111383: {'lr': 7.939780313010875e-05, 'samples': 21385536, 'steps': 111382, 'loss/train': 1.7094184160232544} 11/07/2021 12:54:08 - INFO - __main__ - Step 111384: {'lr': 7.939392409361462e-05, 'samples': 21385728, 'steps': 111383, 'loss/train': 1.6845343112945557} 11/07/2021 12:54:08 - INFO - __main__ - Step 111385: {'lr': 7.939004513399295e-05, 'samples': 21385920, 'steps': 111384, 'loss/train': 1.0552754402160645} 11/07/2021 12:54:09 - INFO - __main__ - Step 111386: {'lr': 7.93861662512455e-05, 'samples': 21386112, 'steps': 111385, 'loss/train': 1.0440216064453125} 11/07/2021 12:54:09 - INFO - __main__ - Step 111387: {'lr': 7.938228744537404e-05, 'samples': 21386304, 'steps': 111386, 'loss/train': 1.406472086906433} 11/07/2021 12:54:09 - INFO - __main__ - Step 111388: {'lr': 7.937840871638025e-05, 'samples': 21386496, 'steps': 111387, 'loss/train': 1.6508318185806274} 11/07/2021 12:54:10 - INFO - __main__ - Step 111389: {'lr': 7.937453006426592e-05, 'samples': 21386688, 'steps': 111388, 'loss/train': 0.7792830467224121} 11/07/2021 12:54:11 - INFO - __main__ - Step 111390: {'lr': 7.937065148903283e-05, 'samples': 21386880, 'steps': 111389, 'loss/train': 1.2337404489517212} 11/07/2021 12:54:11 - INFO - __main__ - Step 111391: {'lr': 7.936677299068265e-05, 'samples': 21387072, 'steps': 111390, 'loss/train': 1.4819567203521729} 11/07/2021 12:54:11 - INFO - __main__ - Step 111392: {'lr': 7.93628945692172e-05, 'samples': 21387264, 'steps': 111391, 'loss/train': 1.1772080659866333} 11/07/2021 12:54:12 - INFO - __main__ - Step 111393: {'lr': 7.935901622463818e-05, 'samples': 21387456, 'steps': 111392, 'loss/train': 1.5115214586257935} 11/07/2021 12:54:13 - INFO - __main__ - Step 111394: {'lr': 7.935513795694735e-05, 'samples': 21387648, 'steps': 111393, 'loss/train': 1.092956781387329} 11/07/2021 12:54:13 - INFO - __main__ - Step 111395: {'lr': 7.935125976614655e-05, 'samples': 21387840, 'steps': 111394, 'loss/train': 1.309692144393921} 11/07/2021 12:54:14 - INFO - __main__ - Step 111396: {'lr': 7.934738165223737e-05, 'samples': 21388032, 'steps': 111395, 'loss/train': 1.2500531673431396} 11/07/2021 12:54:14 - INFO - __main__ - Step 111397: {'lr': 7.93435036152216e-05, 'samples': 21388224, 'steps': 111396, 'loss/train': 0.6333301663398743} 11/07/2021 12:54:14 - INFO - __main__ - Step 111398: {'lr': 7.933962565510103e-05, 'samples': 21388416, 'steps': 111397, 'loss/train': 1.1761759519577026} 11/07/2021 12:54:15 - INFO - __main__ - Step 111399: {'lr': 7.933574777187738e-05, 'samples': 21388608, 'steps': 111398, 'loss/train': 1.1777900457382202} 11/07/2021 12:54:16 - INFO - __main__ - Step 111400: {'lr': 7.933186996555244e-05, 'samples': 21388800, 'steps': 111399, 'loss/train': 1.153892159461975} 11/07/2021 12:54:16 - INFO - __main__ - Step 111401: {'lr': 7.932799223612788e-05, 'samples': 21388992, 'steps': 111400, 'loss/train': 1.162835955619812} 11/07/2021 12:54:16 - INFO - __main__ - Step 111402: {'lr': 7.932411458360553e-05, 'samples': 21389184, 'steps': 111401, 'loss/train': 1.4945003986358643} 11/07/2021 12:54:17 - INFO - __main__ - Step 111403: {'lr': 7.93202370079871e-05, 'samples': 21389376, 'steps': 111402, 'loss/train': 0.9872706532478333} 11/07/2021 12:54:18 - INFO - __main__ - Step 111404: {'lr': 7.931635950927432e-05, 'samples': 21389568, 'steps': 111403, 'loss/train': 0.9951861500740051} 11/07/2021 12:54:18 - INFO - __main__ - Step 111405: {'lr': 7.931248208746895e-05, 'samples': 21389760, 'steps': 111404, 'loss/train': 1.7006845474243164} 11/07/2021 12:54:18 - INFO - __main__ - Step 111406: {'lr': 7.930860474257276e-05, 'samples': 21389952, 'steps': 111405, 'loss/train': 1.6981260776519775} 11/07/2021 12:54:19 - INFO - __main__ - Step 111407: {'lr': 7.930472747458747e-05, 'samples': 21390144, 'steps': 111406, 'loss/train': 1.053165078163147} 11/07/2021 12:54:19 - INFO - __main__ - Step 111408: {'lr': 7.930085028351492e-05, 'samples': 21390336, 'steps': 111407, 'loss/train': 1.251224398612976} 11/07/2021 12:54:19 - INFO - __main__ - Step 111409: {'lr': 7.929697316935666e-05, 'samples': 21390528, 'steps': 111408, 'loss/train': 1.6910111904144287} 11/07/2021 12:54:21 - INFO - __main__ - Step 111410: {'lr': 7.929309613211457e-05, 'samples': 21390720, 'steps': 111409, 'loss/train': 1.0600420236587524} 11/07/2021 12:54:21 - INFO - __main__ - Step 111411: {'lr': 7.928921917179041e-05, 'samples': 21390912, 'steps': 111410, 'loss/train': 0.998562216758728} 11/07/2021 12:54:21 - INFO - __main__ - Step 111412: {'lr': 7.928534228838585e-05, 'samples': 21391104, 'steps': 111411, 'loss/train': 1.604280948638916} 11/07/2021 12:54:22 - INFO - __main__ - Step 111413: {'lr': 7.928146548190271e-05, 'samples': 21391296, 'steps': 111412, 'loss/train': 1.524640440940857} 11/07/2021 12:54:22 - INFO - __main__ - Step 111414: {'lr': 7.92775887523427e-05, 'samples': 21391488, 'steps': 111413, 'loss/train': 1.6773749589920044} 11/07/2021 12:54:23 - INFO - __main__ - Step 111415: {'lr': 7.927371209970757e-05, 'samples': 21391680, 'steps': 111414, 'loss/train': 1.2910747528076172} 11/07/2021 12:54:24 - INFO - __main__ - Step 111416: {'lr': 7.926983552399908e-05, 'samples': 21391872, 'steps': 111415, 'loss/train': 1.356550693511963} 11/07/2021 12:54:24 - INFO - __main__ - Step 111417: {'lr': 7.926595902521893e-05, 'samples': 21392064, 'steps': 111416, 'loss/train': 1.2144854068756104} 11/07/2021 12:54:24 - INFO - __main__ - Step 111418: {'lr': 7.926208260336896e-05, 'samples': 21392256, 'steps': 111417, 'loss/train': 1.2039490938186646} 11/07/2021 12:54:25 - INFO - __main__ - Step 111419: {'lr': 7.925820625845082e-05, 'samples': 21392448, 'steps': 111418, 'loss/train': 1.357364296913147} 11/07/2021 12:54:26 - INFO - __main__ - Step 111420: {'lr': 7.92543299904663e-05, 'samples': 21392640, 'steps': 111419, 'loss/train': 1.2083094120025635} 11/07/2021 12:54:26 - INFO - __main__ - Step 111421: {'lr': 7.925045379941717e-05, 'samples': 21392832, 'steps': 111420, 'loss/train': 1.4397046566009521} 11/07/2021 12:54:27 - INFO - __main__ - Step 111422: {'lr': 7.924657768530521e-05, 'samples': 21393024, 'steps': 111421, 'loss/train': 1.00277578830719} 11/07/2021 12:54:27 - INFO - __main__ - Step 111423: {'lr': 7.924270164813205e-05, 'samples': 21393216, 'steps': 111422, 'loss/train': 1.2290360927581787} 11/07/2021 12:54:27 - INFO - __main__ - Step 111424: {'lr': 7.923882568789947e-05, 'samples': 21393408, 'steps': 111423, 'loss/train': 1.3742018938064575} 11/07/2021 12:54:28 - INFO - __main__ - Step 111425: {'lr': 7.923494980460924e-05, 'samples': 21393600, 'steps': 111424, 'loss/train': 1.134477138519287} 11/07/2021 12:54:29 - INFO - __main__ - Step 111426: {'lr': 7.923107399826313e-05, 'samples': 21393792, 'steps': 111425, 'loss/train': 1.6324793100357056} 11/07/2021 12:54:29 - INFO - __main__ - Step 111427: {'lr': 7.922719826886283e-05, 'samples': 21393984, 'steps': 111426, 'loss/train': 1.3679492473602295} 11/07/2021 12:54:29 - INFO - __main__ - Step 111428: {'lr': 7.922332261641013e-05, 'samples': 21394176, 'steps': 111427, 'loss/train': 1.63848078250885} 11/07/2021 12:54:30 - INFO - __main__ - Step 111429: {'lr': 7.921944704090678e-05, 'samples': 21394368, 'steps': 111428, 'loss/train': 1.309370517730713} 11/07/2021 12:54:30 - INFO - __main__ - Step 111430: {'lr': 7.92155715423545e-05, 'samples': 21394560, 'steps': 111429, 'loss/train': 1.1540770530700684} 11/07/2021 12:54:31 - INFO - __main__ - Step 111431: {'lr': 7.921169612075504e-05, 'samples': 21394752, 'steps': 111430, 'loss/train': 1.2383952140808105} 11/07/2021 12:54:31 - INFO - __main__ - Step 111432: {'lr': 7.920782077611019e-05, 'samples': 21394944, 'steps': 111431, 'loss/train': 1.6615070104599} 11/07/2021 12:54:32 - INFO - __main__ - Step 111433: {'lr': 7.920394550842163e-05, 'samples': 21395136, 'steps': 111432, 'loss/train': 1.581546664237976} 11/07/2021 12:54:32 - INFO - __main__ - Step 111434: {'lr': 7.920007031769114e-05, 'samples': 21395328, 'steps': 111433, 'loss/train': 0.8972306251525879} 11/07/2021 12:54:32 - INFO - __main__ - Step 111435: {'lr': 7.919619520392055e-05, 'samples': 21395520, 'steps': 111434, 'loss/train': 1.8227872848510742} 11/07/2021 12:54:34 - INFO - __main__ - Step 111436: {'lr': 7.919232016711142e-05, 'samples': 21395712, 'steps': 111435, 'loss/train': 1.2835665941238403} 11/07/2021 12:54:34 - INFO - __main__ - Step 111437: {'lr': 7.918844520726561e-05, 'samples': 21395904, 'steps': 111436, 'loss/train': 0.9554334878921509} 11/07/2021 12:54:34 - INFO - __main__ - Step 111438: {'lr': 7.918457032438487e-05, 'samples': 21396096, 'steps': 111437, 'loss/train': 1.6963697671890259} 11/07/2021 12:54:35 - INFO - __main__ - Step 111439: {'lr': 7.91806955184709e-05, 'samples': 21396288, 'steps': 111438, 'loss/train': 0.9714764356613159} 11/07/2021 12:54:35 - INFO - __main__ - Step 111440: {'lr': 7.917682078952549e-05, 'samples': 21396480, 'steps': 111439, 'loss/train': 0.8957384824752808} 11/07/2021 12:54:36 - INFO - __main__ - Step 111441: {'lr': 7.917294613755033e-05, 'samples': 21396672, 'steps': 111440, 'loss/train': 1.647456169128418} 11/07/2021 12:54:36 - INFO - __main__ - Step 111442: {'lr': 7.916907156254724e-05, 'samples': 21396864, 'steps': 111441, 'loss/train': 1.3692318201065063} 11/07/2021 12:54:37 - INFO - __main__ - Step 111443: {'lr': 7.916519706451791e-05, 'samples': 21397056, 'steps': 111442, 'loss/train': 1.405704140663147} 11/07/2021 12:54:37 - INFO - __main__ - Step 111444: {'lr': 7.916132264346412e-05, 'samples': 21397248, 'steps': 111443, 'loss/train': 1.3634865283966064} 11/07/2021 12:54:38 - INFO - __main__ - Step 111445: {'lr': 7.915744829938762e-05, 'samples': 21397440, 'steps': 111444, 'loss/train': 1.6284173727035522} 11/07/2021 12:54:38 - INFO - __main__ - Step 111446: {'lr': 7.915357403229012e-05, 'samples': 21397632, 'steps': 111445, 'loss/train': 1.2590198516845703} 11/07/2021 12:54:39 - INFO - __main__ - Step 111447: {'lr': 7.914969984217337e-05, 'samples': 21397824, 'steps': 111446, 'loss/train': 1.1121912002563477} 11/07/2021 12:54:39 - INFO - __main__ - Step 111448: {'lr': 7.914582572903914e-05, 'samples': 21398016, 'steps': 111447, 'loss/train': 1.0723018646240234} 11/07/2021 12:54:40 - INFO - __main__ - Step 111449: {'lr': 7.914195169288924e-05, 'samples': 21398208, 'steps': 111448, 'loss/train': 1.3868025541305542} 11/07/2021 12:54:40 - INFO - __main__ - Step 111450: {'lr': 7.913807773372527e-05, 'samples': 21398400, 'steps': 111449, 'loss/train': 1.8258190155029297} 11/07/2021 12:54:40 - INFO - __main__ - Step 111451: {'lr': 7.913420385154904e-05, 'samples': 21398592, 'steps': 111450, 'loss/train': 1.1372183561325073} 11/07/2021 12:54:41 - INFO - __main__ - Step 111452: {'lr': 7.91303300463623e-05, 'samples': 21398784, 'steps': 111451, 'loss/train': 0.9747481942176819} 11/07/2021 12:54:42 - INFO - __main__ - Step 111453: {'lr': 7.91264563181668e-05, 'samples': 21398976, 'steps': 111452, 'loss/train': 1.0707300901412964} 11/07/2021 12:54:42 - INFO - __main__ - Step 111454: {'lr': 7.912258266696428e-05, 'samples': 21399168, 'steps': 111453, 'loss/train': 1.0816468000411987} 11/07/2021 12:54:42 - INFO - __main__ - Step 111455: {'lr': 7.911870909275647e-05, 'samples': 21399360, 'steps': 111454, 'loss/train': 1.6872721910476685} 11/07/2021 12:54:43 - INFO - __main__ - Step 111456: {'lr': 7.911483559554516e-05, 'samples': 21399552, 'steps': 111455, 'loss/train': 1.2851612567901611} 11/07/2021 12:54:44 - INFO - __main__ - Step 111457: {'lr': 7.911096217533206e-05, 'samples': 21399744, 'steps': 111456, 'loss/train': 1.0040764808654785} 11/07/2021 12:54:44 - INFO - __main__ - Step 111458: {'lr': 7.910708883211892e-05, 'samples': 21399936, 'steps': 111457, 'loss/train': 1.4490339756011963} 11/07/2021 12:54:45 - INFO - __main__ - Step 111459: {'lr': 7.91032155659075e-05, 'samples': 21400128, 'steps': 111458, 'loss/train': 1.291704773902893} 11/07/2021 12:54:45 - INFO - __main__ - Step 111460: {'lr': 7.909934237669952e-05, 'samples': 21400320, 'steps': 111459, 'loss/train': 1.1978046894073486} 11/07/2021 12:54:45 - INFO - __main__ - Step 111461: {'lr': 7.909546926449675e-05, 'samples': 21400512, 'steps': 111460, 'loss/train': 0.4976511597633362} 11/07/2021 12:54:47 - INFO - __main__ - Step 111462: {'lr': 7.909159622930102e-05, 'samples': 21400704, 'steps': 111461, 'loss/train': 0.8511744737625122} 11/07/2021 12:54:47 - INFO - __main__ - Step 111463: {'lr': 7.908772327111386e-05, 'samples': 21400896, 'steps': 111462, 'loss/train': 1.3388879299163818} 11/07/2021 12:54:47 - INFO - __main__ - Step 111464: {'lr': 7.908385038993715e-05, 'samples': 21401088, 'steps': 111463, 'loss/train': 1.2551676034927368} 11/07/2021 12:54:48 - INFO - __main__ - Step 111465: {'lr': 7.907997758577262e-05, 'samples': 21401280, 'steps': 111464, 'loss/train': 1.4821702241897583} 11/07/2021 12:54:48 - INFO - __main__ - Step 111466: {'lr': 7.907610485862202e-05, 'samples': 21401472, 'steps': 111465, 'loss/train': 1.167733073234558} 11/07/2021 12:54:49 - INFO - __main__ - Step 111467: {'lr': 7.907223220848708e-05, 'samples': 21401664, 'steps': 111466, 'loss/train': 1.2506113052368164} 11/07/2021 12:54:49 - INFO - __main__ - Step 111468: {'lr': 7.906835963536956e-05, 'samples': 21401856, 'steps': 111467, 'loss/train': 0.9575666785240173} 11/07/2021 12:54:50 - INFO - __main__ - Step 111469: {'lr': 7.90644871392712e-05, 'samples': 21402048, 'steps': 111468, 'loss/train': 1.384626865386963} 11/07/2021 12:54:50 - INFO - __main__ - Step 111470: {'lr': 7.906061472019374e-05, 'samples': 21402240, 'steps': 111469, 'loss/train': 1.427290916442871} 11/07/2021 12:54:51 - INFO - __main__ - Step 111471: {'lr': 7.905674237813895e-05, 'samples': 21402432, 'steps': 111470, 'loss/train': 1.6036102771759033} 11/07/2021 12:54:51 - INFO - __main__ - Step 111472: {'lr': 7.905287011310852e-05, 'samples': 21402624, 'steps': 111471, 'loss/train': 1.5481064319610596} 11/07/2021 12:54:52 - INFO - __main__ - Step 111473: {'lr': 7.904899792510425e-05, 'samples': 21402816, 'steps': 111472, 'loss/train': 1.2246304750442505} 11/07/2021 12:54:52 - INFO - __main__ - Step 111474: {'lr': 7.904512581412787e-05, 'samples': 21403008, 'steps': 111473, 'loss/train': 1.5073808431625366} 11/07/2021 12:54:53 - INFO - __main__ - Step 111475: {'lr': 7.904125378018109e-05, 'samples': 21403200, 'steps': 111474, 'loss/train': 1.2063539028167725} 11/07/2021 12:54:53 - INFO - __main__ - Step 111476: {'lr': 7.90373818232658e-05, 'samples': 21403392, 'steps': 111475, 'loss/train': 1.4500936269760132} 11/07/2021 12:54:54 - INFO - __main__ - Step 111477: {'lr': 7.903350994338351e-05, 'samples': 21403584, 'steps': 111476, 'loss/train': 1.5963678359985352} 11/07/2021 12:54:54 - INFO - __main__ - Step 111478: {'lr': 7.902963814053612e-05, 'samples': 21403776, 'steps': 111477, 'loss/train': 1.2277846336364746} 11/07/2021 12:54:55 - INFO - __main__ - Step 111479: {'lr': 7.902576641472531e-05, 'samples': 21403968, 'steps': 111478, 'loss/train': 1.5625921487808228} 11/07/2021 12:54:55 - INFO - __main__ - Step 111480: {'lr': 7.902189476595287e-05, 'samples': 21404160, 'steps': 111479, 'loss/train': 1.4048751592636108} 11/07/2021 12:54:56 - INFO - __main__ - Step 111481: {'lr': 7.901802319422052e-05, 'samples': 21404352, 'steps': 111480, 'loss/train': 1.2264031171798706} 11/07/2021 12:54:56 - INFO - __main__ - Step 111482: {'lr': 7.901415169953e-05, 'samples': 21404544, 'steps': 111481, 'loss/train': 1.5200762748718262} 11/07/2021 12:54:57 - INFO - __main__ - Step 111483: {'lr': 7.901028028188306e-05, 'samples': 21404736, 'steps': 111482, 'loss/train': 1.6650946140289307} 11/07/2021 12:54:57 - INFO - __main__ - Step 111484: {'lr': 7.900640894128147e-05, 'samples': 21404928, 'steps': 111483, 'loss/train': 1.259650707244873} 11/07/2021 12:54:58 - INFO - __main__ - Step 111485: {'lr': 7.900253767772694e-05, 'samples': 21405120, 'steps': 111484, 'loss/train': 1.1902893781661987} 11/07/2021 12:54:58 - INFO - __main__ - Step 111486: {'lr': 7.899866649122123e-05, 'samples': 21405312, 'steps': 111485, 'loss/train': 1.0880457162857056} 11/07/2021 12:54:58 - INFO - __main__ - Step 111487: {'lr': 7.899479538176607e-05, 'samples': 21405504, 'steps': 111486, 'loss/train': 1.3587099313735962} 11/07/2021 12:55:00 - INFO - __main__ - Step 111488: {'lr': 7.899092434936325e-05, 'samples': 21405696, 'steps': 111487, 'loss/train': 0.5653348565101624} 11/07/2021 12:55:00 - INFO - __main__ - Step 111489: {'lr': 7.898705339401455e-05, 'samples': 21405888, 'steps': 111488, 'loss/train': 1.3326871395111084} 11/07/2021 12:55:00 - INFO - __main__ - Step 111490: {'lr': 7.898318251572153e-05, 'samples': 21406080, 'steps': 111489, 'loss/train': 1.6917225122451782} 11/07/2021 12:55:01 - INFO - __main__ - Step 111491: {'lr': 7.897931171448608e-05, 'samples': 21406272, 'steps': 111490, 'loss/train': 0.9075872302055359} 11/07/2021 12:55:01 - INFO - __main__ - Step 111492: {'lr': 7.89754409903099e-05, 'samples': 21406464, 'steps': 111491, 'loss/train': 1.5004959106445312} 11/07/2021 12:55:02 - INFO - __main__ - Step 111493: {'lr': 7.897157034319476e-05, 'samples': 21406656, 'steps': 111492, 'loss/train': 1.6277607679367065} 11/07/2021 12:55:03 - INFO - __main__ - Step 111494: {'lr': 7.896769977314239e-05, 'samples': 21406848, 'steps': 111493, 'loss/train': 1.1908867359161377} 11/07/2021 12:55:03 - INFO - __main__ - Step 111495: {'lr': 7.896382928015452e-05, 'samples': 21407040, 'steps': 111494, 'loss/train': 1.9826288223266602} 11/07/2021 12:55:03 - INFO - __main__ - Step 111496: {'lr': 7.895995886423293e-05, 'samples': 21407232, 'steps': 111495, 'loss/train': 1.667885422706604} 11/07/2021 12:55:04 - INFO - __main__ - Step 111497: {'lr': 7.895608852537934e-05, 'samples': 21407424, 'steps': 111496, 'loss/train': 1.1552022695541382} 11/07/2021 12:55:05 - INFO - __main__ - Step 111498: {'lr': 7.89522182635955e-05, 'samples': 21407616, 'steps': 111497, 'loss/train': 1.0230977535247803} 11/07/2021 12:55:05 - INFO - __main__ - Step 111499: {'lr': 7.894834807888313e-05, 'samples': 21407808, 'steps': 111498, 'loss/train': 1.4769163131713867} 11/07/2021 12:55:05 - INFO - __main__ - Step 111500: {'lr': 7.894447797124401e-05, 'samples': 21408000, 'steps': 111499, 'loss/train': 1.5558581352233887} 11/07/2021 12:55:06 - INFO - __main__ - Step 111501: {'lr': 7.894060794067987e-05, 'samples': 21408192, 'steps': 111500, 'loss/train': 1.6044660806655884} 11/07/2021 12:55:06 - INFO - __main__ - Step 111502: {'lr': 7.893673798719253e-05, 'samples': 21408384, 'steps': 111501, 'loss/train': 1.3721364736557007} 11/07/2021 12:55:07 - INFO - __main__ - Step 111503: {'lr': 7.893286811078357e-05, 'samples': 21408576, 'steps': 111502, 'loss/train': 0.728068470954895} 11/07/2021 12:55:08 - INFO - __main__ - Step 111504: {'lr': 7.892899831145481e-05, 'samples': 21408768, 'steps': 111503, 'loss/train': 2.165837049484253} 11/07/2021 12:55:08 - INFO - __main__ - Step 111505: {'lr': 7.892512858920803e-05, 'samples': 21408960, 'steps': 111504, 'loss/train': 0.6574528217315674} 11/07/2021 12:55:08 - INFO - __main__ - Step 111506: {'lr': 7.892125894404492e-05, 'samples': 21409152, 'steps': 111505, 'loss/train': 1.2771601676940918} 11/07/2021 12:55:09 - INFO - __main__ - Step 111507: {'lr': 7.891738937596729e-05, 'samples': 21409344, 'steps': 111506, 'loss/train': 0.5444215536117554} 11/07/2021 12:55:09 - INFO - __main__ - Step 111508: {'lr': 7.89135198849768e-05, 'samples': 21409536, 'steps': 111507, 'loss/train': 1.0278124809265137} 11/07/2021 12:55:10 - INFO - __main__ - Step 111509: {'lr': 7.890965047107526e-05, 'samples': 21409728, 'steps': 111508, 'loss/train': 1.2261775732040405} 11/07/2021 12:55:10 - INFO - __main__ - Step 111510: {'lr': 7.890578113426439e-05, 'samples': 21409920, 'steps': 111509, 'loss/train': 1.1831538677215576} 11/07/2021 12:55:11 - INFO - __main__ - Step 111511: {'lr': 7.890191187454593e-05, 'samples': 21410112, 'steps': 111510, 'loss/train': 1.6497234106063843} 11/07/2021 12:55:11 - INFO - __main__ - Step 111512: {'lr': 7.889804269192172e-05, 'samples': 21410304, 'steps': 111511, 'loss/train': 1.5257701873779297} 11/07/2021 12:55:11 - INFO - __main__ - Step 111513: {'lr': 7.889417358639334e-05, 'samples': 21410496, 'steps': 111512, 'loss/train': 1.1688923835754395} 11/07/2021 12:55:12 - INFO - __main__ - Step 111514: {'lr': 7.889030455796259e-05, 'samples': 21410688, 'steps': 111513, 'loss/train': 2.0790388584136963} 11/07/2021 12:55:13 - INFO - __main__ - Step 111515: {'lr': 7.888643560663123e-05, 'samples': 21410880, 'steps': 111514, 'loss/train': 1.4559998512268066} 11/07/2021 12:55:13 - INFO - __main__ - Step 111516: {'lr': 7.888256673240099e-05, 'samples': 21411072, 'steps': 111515, 'loss/train': 1.5685334205627441} 11/07/2021 12:55:14 - INFO - __main__ - Step 111517: {'lr': 7.887869793527363e-05, 'samples': 21411264, 'steps': 111516, 'loss/train': 0.5954333543777466} 11/07/2021 12:55:14 - INFO - __main__ - Step 111518: {'lr': 7.887482921525088e-05, 'samples': 21411456, 'steps': 111517, 'loss/train': 1.4848206043243408} 11/07/2021 12:55:15 - INFO - __main__ - Step 111519: {'lr': 7.88709605723345e-05, 'samples': 21411648, 'steps': 111518, 'loss/train': 1.1759223937988281} 11/07/2021 12:55:15 - INFO - __main__ - Step 111520: {'lr': 7.886709200652625e-05, 'samples': 21411840, 'steps': 111519, 'loss/train': 1.203844666481018} 11/07/2021 12:55:16 - INFO - __main__ - Step 111521: {'lr': 7.886322351782782e-05, 'samples': 21412032, 'steps': 111520, 'loss/train': 1.1364309787750244} 11/07/2021 12:55:16 - INFO - __main__ - Step 111522: {'lr': 7.885935510624099e-05, 'samples': 21412224, 'steps': 111521, 'loss/train': 1.0184208154678345} 11/07/2021 12:55:16 - INFO - __main__ - Step 111523: {'lr': 7.885548677176756e-05, 'samples': 21412416, 'steps': 111522, 'loss/train': 1.441230297088623} 11/07/2021 12:55:17 - INFO - __main__ - Step 111524: {'lr': 7.885161851440914e-05, 'samples': 21412608, 'steps': 111523, 'loss/train': 0.46842852234840393} 11/07/2021 12:55:18 - INFO - __main__ - Step 111525: {'lr': 7.884775033416755e-05, 'samples': 21412800, 'steps': 111524, 'loss/train': 1.5562163591384888} 11/07/2021 12:55:18 - INFO - __main__ - Step 111526: {'lr': 7.884388223104449e-05, 'samples': 21412992, 'steps': 111525, 'loss/train': 1.3621174097061157} 11/07/2021 12:55:18 - INFO - __main__ - Step 111527: {'lr': 7.884001420504175e-05, 'samples': 21413184, 'steps': 111526, 'loss/train': 1.3090075254440308} 11/07/2021 12:55:19 - INFO - __main__ - Step 111528: {'lr': 7.883614625616109e-05, 'samples': 21413376, 'steps': 111527, 'loss/train': 1.2102771997451782} 11/07/2021 12:55:19 - INFO - __main__ - Step 111529: {'lr': 7.883227838440419e-05, 'samples': 21413568, 'steps': 111528, 'loss/train': 1.7052369117736816} 11/07/2021 12:55:20 - INFO - __main__ - Step 111530: {'lr': 7.882841058977283e-05, 'samples': 21413760, 'steps': 111529, 'loss/train': 1.2326432466506958} 11/07/2021 12:55:20 - INFO - __main__ - Step 111531: {'lr': 7.882454287226873e-05, 'samples': 21413952, 'steps': 111530, 'loss/train': 1.1779673099517822} 11/07/2021 12:55:21 - INFO - __main__ - Step 111532: {'lr': 7.882067523189368e-05, 'samples': 21414144, 'steps': 111531, 'loss/train': 2.457674264907837} 11/07/2021 12:55:21 - INFO - __main__ - Step 111533: {'lr': 7.881680766864938e-05, 'samples': 21414336, 'steps': 111532, 'loss/train': 1.8391149044036865} 11/07/2021 12:55:22 - INFO - __main__ - Step 111534: {'lr': 7.881294018253765e-05, 'samples': 21414528, 'steps': 111533, 'loss/train': 1.347501277923584} 11/07/2021 12:55:23 - INFO - __main__ - Step 111535: {'lr': 7.880907277356008e-05, 'samples': 21414720, 'steps': 111534, 'loss/train': 1.4975287914276123} 11/07/2021 12:55:23 - INFO - __main__ - Step 111536: {'lr': 7.880520544171854e-05, 'samples': 21414912, 'steps': 111535, 'loss/train': 1.3305262327194214} 11/07/2021 12:55:23 - INFO - __main__ - Step 111537: {'lr': 7.880133818701471e-05, 'samples': 21415104, 'steps': 111536, 'loss/train': 1.5823136568069458} 11/07/2021 12:55:24 - INFO - __main__ - Step 111538: {'lr': 7.879747100945037e-05, 'samples': 21415296, 'steps': 111537, 'loss/train': 0.8818501830101013} 11/07/2021 12:55:24 - INFO - __main__ - Step 111539: {'lr': 7.879360390902724e-05, 'samples': 21415488, 'steps': 111538, 'loss/train': 1.0626778602600098} 11/07/2021 12:55:25 - INFO - __main__ - Step 111540: {'lr': 7.878973688574706e-05, 'samples': 21415680, 'steps': 111539, 'loss/train': 1.2169352769851685} 11/07/2021 12:55:25 - INFO - __main__ - Step 111541: {'lr': 7.87858699396116e-05, 'samples': 21415872, 'steps': 111540, 'loss/train': 1.6864700317382812} 11/07/2021 12:55:26 - INFO - __main__ - Step 111542: {'lr': 7.878200307062255e-05, 'samples': 21416064, 'steps': 111541, 'loss/train': 1.3711163997650146} 11/07/2021 12:55:26 - INFO - __main__ - Step 111543: {'lr': 7.877813627878173e-05, 'samples': 21416256, 'steps': 111542, 'loss/train': 1.3710976839065552} 11/07/2021 12:55:27 - INFO - __main__ - Step 111544: {'lr': 7.877426956409082e-05, 'samples': 21416448, 'steps': 111543, 'loss/train': 2.0352401733398438} 11/07/2021 12:55:28 - INFO - __main__ - Step 111545: {'lr': 7.877040292655165e-05, 'samples': 21416640, 'steps': 111544, 'loss/train': 1.2414149045944214} 11/07/2021 12:55:28 - INFO - __main__ - Step 111546: {'lr': 7.876653636616582e-05, 'samples': 21416832, 'steps': 111545, 'loss/train': 1.2958179712295532} 11/07/2021 12:55:28 - INFO - __main__ - Step 111547: {'lr': 7.876266988293515e-05, 'samples': 21417024, 'steps': 111546, 'loss/train': 1.53633451461792} 11/07/2021 12:55:29 - INFO - __main__ - Step 111548: {'lr': 7.87588034768614e-05, 'samples': 21417216, 'steps': 111547, 'loss/train': 1.4383805990219116} 11/07/2021 12:55:29 - INFO - __main__ - Step 111549: {'lr': 7.875493714794627e-05, 'samples': 21417408, 'steps': 111548, 'loss/train': 0.9807944893836975} 11/07/2021 12:55:30 - INFO - __main__ - Step 111550: {'lr': 7.875107089619152e-05, 'samples': 21417600, 'steps': 111549, 'loss/train': 1.1765401363372803} 11/07/2021 12:55:30 - INFO - __main__ - Step 111551: {'lr': 7.874720472159891e-05, 'samples': 21417792, 'steps': 111550, 'loss/train': 0.8905967473983765} 11/07/2021 12:55:31 - INFO - __main__ - Step 111552: {'lr': 7.874333862417016e-05, 'samples': 21417984, 'steps': 111551, 'loss/train': 1.517830491065979} 11/07/2021 12:55:31 - INFO - __main__ - Step 111553: {'lr': 7.873947260390701e-05, 'samples': 21418176, 'steps': 111552, 'loss/train': 1.0231744050979614} 11/07/2021 12:55:31 - INFO - __main__ - Step 111554: {'lr': 7.87356066608112e-05, 'samples': 21418368, 'steps': 111553, 'loss/train': 0.9071043729782104} 11/07/2021 12:55:32 - INFO - __main__ - Step 111555: {'lr': 7.873174079488452e-05, 'samples': 21418560, 'steps': 111554, 'loss/train': 1.1376545429229736} 11/07/2021 12:55:33 - INFO - __main__ - Step 111556: {'lr': 7.872787500612874e-05, 'samples': 21418752, 'steps': 111555, 'loss/train': 1.3863136768341064} 11/07/2021 12:55:33 - INFO - __main__ - Step 111557: {'lr': 7.872400929454543e-05, 'samples': 21418944, 'steps': 111556, 'loss/train': 0.8104294538497925} 11/07/2021 12:55:34 - INFO - __main__ - Step 111558: {'lr': 7.872014366013647e-05, 'samples': 21419136, 'steps': 111557, 'loss/train': 0.6543984413146973} 11/07/2021 12:55:34 - INFO - __main__ - Step 111559: {'lr': 7.871627810290355e-05, 'samples': 21419328, 'steps': 111558, 'loss/train': 1.4592348337173462} 11/07/2021 12:55:34 - INFO - __main__ - Step 111560: {'lr': 7.871241262284845e-05, 'samples': 21419520, 'steps': 111559, 'loss/train': 1.5856999158859253} 11/07/2021 12:55:35 - INFO - __main__ - Step 111561: {'lr': 7.870854721997289e-05, 'samples': 21419712, 'steps': 111560, 'loss/train': 1.5113155841827393} 11/07/2021 12:55:36 - INFO - __main__ - Step 111562: {'lr': 7.87046818942786e-05, 'samples': 21419904, 'steps': 111561, 'loss/train': 1.3182727098464966} 11/07/2021 12:55:36 - INFO - __main__ - Step 111563: {'lr': 7.870081664576737e-05, 'samples': 21420096, 'steps': 111562, 'loss/train': 1.2898951768875122} 11/07/2021 12:55:36 - INFO - __main__ - Step 111564: {'lr': 7.869695147444087e-05, 'samples': 21420288, 'steps': 111563, 'loss/train': 1.1778262853622437} 11/07/2021 12:55:37 - INFO - __main__ - Step 111565: {'lr': 7.86930863803009e-05, 'samples': 21420480, 'steps': 111564, 'loss/train': 1.5216056108474731} 11/07/2021 12:55:38 - INFO - __main__ - Step 111566: {'lr': 7.868922136334919e-05, 'samples': 21420672, 'steps': 111565, 'loss/train': 1.3518555164337158} 11/07/2021 12:55:38 - INFO - __main__ - Step 111567: {'lr': 7.868535642358746e-05, 'samples': 21420864, 'steps': 111566, 'loss/train': 2.2207257747650146} 11/07/2021 12:55:39 - INFO - __main__ - Step 111568: {'lr': 7.868149156101748e-05, 'samples': 21421056, 'steps': 111567, 'loss/train': 1.0504307746887207} 11/07/2021 12:55:39 - INFO - __main__ - Step 111569: {'lr': 7.867762677564094e-05, 'samples': 21421248, 'steps': 111568, 'loss/train': 1.4736754894256592} 11/07/2021 12:55:39 - INFO - __main__ - Step 111570: {'lr': 7.867376206745974e-05, 'samples': 21421440, 'steps': 111569, 'loss/train': 1.1957839727401733} 11/07/2021 12:55:40 - INFO - __main__ - Step 111571: {'lr': 7.86698974364754e-05, 'samples': 21421632, 'steps': 111570, 'loss/train': 1.5100995302200317} 11/07/2021 12:55:41 - INFO - __main__ - Step 111572: {'lr': 7.866603288268976e-05, 'samples': 21421824, 'steps': 111571, 'loss/train': 1.1900579929351807} 11/07/2021 12:55:41 - INFO - __main__ - Step 111573: {'lr': 7.866216840610455e-05, 'samples': 21422016, 'steps': 111572, 'loss/train': 0.9917522668838501} 11/07/2021 12:55:41 - INFO - __main__ - Step 111574: {'lr': 7.865830400672152e-05, 'samples': 21422208, 'steps': 111573, 'loss/train': 1.7892756462097168} 11/07/2021 12:55:42 - INFO - __main__ - Step 111575: {'lr': 7.865443968454245e-05, 'samples': 21422400, 'steps': 111574, 'loss/train': 1.7736190557479858} 11/07/2021 12:55:42 - INFO - __main__ - Step 111576: {'lr': 7.865057543956902e-05, 'samples': 21422592, 'steps': 111575, 'loss/train': 1.4199156761169434} 11/07/2021 12:55:43 - INFO - __main__ - Step 111577: {'lr': 7.8646711271803e-05, 'samples': 21422784, 'steps': 111576, 'loss/train': 1.4993500709533691} 11/07/2021 12:55:44 - INFO - __main__ - Step 111578: {'lr': 7.864284718124615e-05, 'samples': 21422976, 'steps': 111577, 'loss/train': 2.0082292556762695} 11/07/2021 12:55:44 - INFO - __main__ - Step 111579: {'lr': 7.863898316790016e-05, 'samples': 21423168, 'steps': 111578, 'loss/train': 1.4016281366348267} 11/07/2021 12:55:44 - INFO - __main__ - Step 111580: {'lr': 7.86351192317668e-05, 'samples': 21423360, 'steps': 111579, 'loss/train': 1.330757975578308} 11/07/2021 12:55:45 - INFO - __main__ - Step 111581: {'lr': 7.863125537284783e-05, 'samples': 21423552, 'steps': 111580, 'loss/train': 1.5664821863174438} 11/07/2021 12:55:46 - INFO - __main__ - Step 111582: {'lr': 7.862739159114496e-05, 'samples': 21423744, 'steps': 111581, 'loss/train': 1.3749938011169434} 11/07/2021 12:55:46 - INFO - __main__ - Step 111583: {'lr': 7.862352788666003e-05, 'samples': 21423936, 'steps': 111582, 'loss/train': 1.5131710767745972} 11/07/2021 12:55:46 - INFO - __main__ - Step 111584: {'lr': 7.86196642593946e-05, 'samples': 21424128, 'steps': 111583, 'loss/train': 1.2271016836166382} 11/07/2021 12:55:47 - INFO - __main__ - Step 111585: {'lr': 7.861580070935051e-05, 'samples': 21424320, 'steps': 111584, 'loss/train': 1.3538684844970703} 11/07/2021 12:55:47 - INFO - __main__ - Step 111586: {'lr': 7.861193723652951e-05, 'samples': 21424512, 'steps': 111585, 'loss/train': 0.5177410244941711} 11/07/2021 12:55:48 - INFO - __main__ - Step 111587: {'lr': 7.86080738409333e-05, 'samples': 21424704, 'steps': 111586, 'loss/train': 0.9630221724510193} 11/07/2021 12:55:49 - INFO - __main__ - Step 111588: {'lr': 7.860421052256366e-05, 'samples': 21424896, 'steps': 111587, 'loss/train': 1.5410974025726318} 11/07/2021 12:55:49 - INFO - __main__ - Step 111589: {'lr': 7.860034728142231e-05, 'samples': 21425088, 'steps': 111588, 'loss/train': 1.4691542387008667} 11/07/2021 12:55:49 - INFO - __main__ - Step 111590: {'lr': 7.859648411751103e-05, 'samples': 21425280, 'steps': 111589, 'loss/train': 1.3779407739639282} 11/07/2021 12:55:50 - INFO - __main__ - Step 111591: {'lr': 7.85926210308315e-05, 'samples': 21425472, 'steps': 111590, 'loss/train': 0.6796651482582092} 11/07/2021 12:55:51 - INFO - __main__ - Step 111592: {'lr': 7.858875802138552e-05, 'samples': 21425664, 'steps': 111591, 'loss/train': 1.3583241701126099} 11/07/2021 12:55:51 - INFO - __main__ - Step 111593: {'lr': 7.858489508917477e-05, 'samples': 21425856, 'steps': 111592, 'loss/train': 0.3918403685092926} 11/07/2021 12:55:51 - INFO - __main__ - Step 111594: {'lr': 7.858103223420101e-05, 'samples': 21426048, 'steps': 111593, 'loss/train': 1.4101828336715698} 11/07/2021 12:55:52 - INFO - __main__ - Step 111595: {'lr': 7.857716945646603e-05, 'samples': 21426240, 'steps': 111594, 'loss/train': 1.273740291595459} 11/07/2021 12:55:52 - INFO - __main__ - Step 111596: {'lr': 7.857330675597152e-05, 'samples': 21426432, 'steps': 111595, 'loss/train': 1.2833701372146606} 11/07/2021 12:55:52 - INFO - __main__ - Step 111597: {'lr': 7.85694441327193e-05, 'samples': 21426624, 'steps': 111596, 'loss/train': 1.2783843278884888} 11/07/2021 12:55:54 - INFO - __main__ - Step 111598: {'lr': 7.856558158671095e-05, 'samples': 21426816, 'steps': 111597, 'loss/train': 1.2787775993347168} 11/07/2021 12:55:54 - INFO - __main__ - Step 111599: {'lr': 7.856171911794834e-05, 'samples': 21427008, 'steps': 111598, 'loss/train': 1.7505501508712769} 11/07/2021 12:55:55 - INFO - __main__ - Step 111600: {'lr': 7.855785672643315e-05, 'samples': 21427200, 'steps': 111599, 'loss/train': 1.7493486404418945} 11/07/2021 12:55:55 - INFO - __main__ - Step 111601: {'lr': 7.855399441216716e-05, 'samples': 21427392, 'steps': 111600, 'loss/train': 1.181374192237854} 11/07/2021 12:55:55 - INFO - __main__ - Step 111602: {'lr': 7.855013217515209e-05, 'samples': 21427584, 'steps': 111601, 'loss/train': 1.4636636972427368} 11/07/2021 12:55:56 - INFO - __main__ - Step 111603: {'lr': 7.854627001538966e-05, 'samples': 21427776, 'steps': 111602, 'loss/train': 0.2325226068496704} 11/07/2021 12:55:56 - INFO - __main__ - Step 111604: {'lr': 7.854240793288167e-05, 'samples': 21427968, 'steps': 111603, 'loss/train': 0.6950873136520386} 11/07/2021 12:55:57 - INFO - __main__ - Step 111605: {'lr': 7.853854592762983e-05, 'samples': 21428160, 'steps': 111604, 'loss/train': 1.7627614736557007} 11/07/2021 12:55:57 - INFO - __main__ - Step 111606: {'lr': 7.853468399963584e-05, 'samples': 21428352, 'steps': 111605, 'loss/train': 1.5435394048690796} 11/07/2021 12:55:58 - INFO - __main__ - Step 111607: {'lr': 7.85308221489015e-05, 'samples': 21428544, 'steps': 111606, 'loss/train': 0.7238286137580872} 11/07/2021 12:55:58 - INFO - __main__ - Step 111608: {'lr': 7.85269603754285e-05, 'samples': 21428736, 'steps': 111607, 'loss/train': 0.8588588237762451} 11/07/2021 12:55:59 - INFO - __main__ - Step 111609: {'lr': 7.852309867921864e-05, 'samples': 21428928, 'steps': 111608, 'loss/train': 1.0944101810455322} 11/07/2021 12:55:59 - INFO - __main__ - Step 111610: {'lr': 7.85192370602737e-05, 'samples': 21429120, 'steps': 111609, 'loss/train': 1.4173856973648071} 11/07/2021 12:56:00 - INFO - __main__ - Step 111611: {'lr': 7.851537551859525e-05, 'samples': 21429312, 'steps': 111610, 'loss/train': 1.1087745428085327} 11/07/2021 12:56:00 - INFO - __main__ - Step 111612: {'lr': 7.85115140541851e-05, 'samples': 21429504, 'steps': 111611, 'loss/train': 1.6792948246002197} 11/07/2021 12:56:01 - INFO - __main__ - Step 111613: {'lr': 7.850765266704507e-05, 'samples': 21429696, 'steps': 111612, 'loss/train': 1.7707465887069702} 11/07/2021 12:56:01 - INFO - __main__ - Step 111614: {'lr': 7.850379135717681e-05, 'samples': 21429888, 'steps': 111613, 'loss/train': 1.4340392351150513} 11/07/2021 12:56:02 - INFO - __main__ - Step 111615: {'lr': 7.849993012458211e-05, 'samples': 21430080, 'steps': 111614, 'loss/train': 1.2329140901565552} 11/07/2021 12:56:02 - INFO - __main__ - Step 111616: {'lr': 7.84960689692627e-05, 'samples': 21430272, 'steps': 111615, 'loss/train': 1.2692209482192993} 11/07/2021 12:56:03 - INFO - __main__ - Step 111617: {'lr': 7.84922078912203e-05, 'samples': 21430464, 'steps': 111616, 'loss/train': 1.4191745519638062} 11/07/2021 12:56:03 - INFO - __main__ - Step 111618: {'lr': 7.848834689045667e-05, 'samples': 21430656, 'steps': 111617, 'loss/train': 1.409928560256958} 11/07/2021 12:56:03 - INFO - __main__ - Step 111619: {'lr': 7.848448596697355e-05, 'samples': 21430848, 'steps': 111618, 'loss/train': 1.5289506912231445} 11/07/2021 12:56:04 - INFO - __main__ - Step 111620: {'lr': 7.848062512077267e-05, 'samples': 21431040, 'steps': 111619, 'loss/train': 0.5750308632850647} 11/07/2021 12:56:05 - INFO - __main__ - Step 111621: {'lr': 7.847676435185577e-05, 'samples': 21431232, 'steps': 111620, 'loss/train': 1.7739609479904175} 11/07/2021 12:56:05 - INFO - __main__ - Step 111622: {'lr': 7.847290366022459e-05, 'samples': 21431424, 'steps': 111621, 'loss/train': 1.4093602895736694} 11/07/2021 12:56:05 - INFO - __main__ - Step 111623: {'lr': 7.846904304588096e-05, 'samples': 21431616, 'steps': 111622, 'loss/train': 0.9250414967536926} 11/07/2021 12:56:06 - INFO - __main__ - Step 111624: {'lr': 7.846518250882645e-05, 'samples': 21431808, 'steps': 111623, 'loss/train': 1.9021302461624146} 11/07/2021 12:56:07 - INFO - __main__ - Step 111625: {'lr': 7.84613220490629e-05, 'samples': 21432000, 'steps': 111624, 'loss/train': 1.4193310737609863} 11/07/2021 12:56:07 - INFO - __main__ - Step 111626: {'lr': 7.845746166659201e-05, 'samples': 21432192, 'steps': 111625, 'loss/train': 1.363284945487976} 11/07/2021 12:56:08 - INFO - __main__ - Step 111627: {'lr': 7.845360136141556e-05, 'samples': 21432384, 'steps': 111626, 'loss/train': 1.7274754047393799} 11/07/2021 12:56:08 - INFO - __main__ - Step 111628: {'lr': 7.844974113353523e-05, 'samples': 21432576, 'steps': 111627, 'loss/train': 1.3186804056167603} 11/07/2021 12:56:08 - INFO - __main__ - Step 111629: {'lr': 7.844588098295283e-05, 'samples': 21432768, 'steps': 111628, 'loss/train': 1.804366111755371} 11/07/2021 12:56:09 - INFO - __main__ - Step 111630: {'lr': 7.844202090967006e-05, 'samples': 21432960, 'steps': 111629, 'loss/train': 1.4902026653289795} 11/07/2021 12:56:10 - INFO - __main__ - Step 111631: {'lr': 7.843816091368866e-05, 'samples': 21433152, 'steps': 111630, 'loss/train': 1.1606078147888184} 11/07/2021 12:56:10 - INFO - __main__ - Step 111632: {'lr': 7.84343009950104e-05, 'samples': 21433344, 'steps': 111631, 'loss/train': 1.1313714981079102} 11/07/2021 12:56:10 - INFO - __main__ - Step 111633: {'lr': 7.843044115363698e-05, 'samples': 21433536, 'steps': 111632, 'loss/train': 1.5796160697937012} 11/07/2021 12:56:11 - INFO - __main__ - Step 111634: {'lr': 7.842658138957018e-05, 'samples': 21433728, 'steps': 111633, 'loss/train': 1.1919018030166626} 11/07/2021 12:56:11 - INFO - __main__ - Step 111635: {'lr': 7.842272170281168e-05, 'samples': 21433920, 'steps': 111634, 'loss/train': 1.7157771587371826} 11/07/2021 12:56:12 - INFO - __main__ - Step 111636: {'lr': 7.841886209336327e-05, 'samples': 21434112, 'steps': 111635, 'loss/train': 1.083540678024292} 11/07/2021 12:56:13 - INFO - __main__ - Step 111637: {'lr': 7.841500256122674e-05, 'samples': 21434304, 'steps': 111636, 'loss/train': 1.7357600927352905} 11/07/2021 12:56:13 - INFO - __main__ - Step 111638: {'lr': 7.841114310640371e-05, 'samples': 21434496, 'steps': 111637, 'loss/train': 1.1942869424819946} 11/07/2021 12:56:13 - INFO - __main__ - Step 111639: {'lr': 7.840728372889597e-05, 'samples': 21434688, 'steps': 111638, 'loss/train': 1.3674196004867554} 11/07/2021 12:56:14 - INFO - __main__ - Step 111640: {'lr': 7.840342442870524e-05, 'samples': 21434880, 'steps': 111639, 'loss/train': 1.1847878694534302} 11/07/2021 12:56:15 - INFO - __main__ - Step 111641: {'lr': 7.839956520583327e-05, 'samples': 21435072, 'steps': 111640, 'loss/train': 2.1029160022735596} 11/07/2021 12:56:15 - INFO - __main__ - Step 111642: {'lr': 7.839570606028185e-05, 'samples': 21435264, 'steps': 111641, 'loss/train': 1.4268354177474976} 11/07/2021 12:56:15 - INFO - __main__ - Step 111643: {'lr': 7.839184699205263e-05, 'samples': 21435456, 'steps': 111642, 'loss/train': 1.4569356441497803} 11/07/2021 12:56:16 - INFO - __main__ - Step 111644: {'lr': 7.838798800114741e-05, 'samples': 21435648, 'steps': 111643, 'loss/train': 1.576712965965271} 11/07/2021 12:56:16 - INFO - __main__ - Step 111645: {'lr': 7.838412908756792e-05, 'samples': 21435840, 'steps': 111644, 'loss/train': 1.529646635055542} 11/07/2021 12:56:17 - INFO - __main__ - Step 111646: {'lr': 7.838027025131592e-05, 'samples': 21436032, 'steps': 111645, 'loss/train': 1.008934497833252} 11/07/2021 12:56:17 - INFO - __main__ - Step 111647: {'lr': 7.837641149239308e-05, 'samples': 21436224, 'steps': 111646, 'loss/train': 1.1704376935958862} 11/07/2021 12:56:18 - INFO - __main__ - Step 111648: {'lr': 7.837255281080119e-05, 'samples': 21436416, 'steps': 111647, 'loss/train': 1.046692967414856} 11/07/2021 12:56:18 - INFO - __main__ - Step 111649: {'lr': 7.8368694206542e-05, 'samples': 21436608, 'steps': 111648, 'loss/train': 1.0347952842712402} 11/07/2021 12:56:18 - INFO - __main__ - Step 111650: {'lr': 7.836483567961727e-05, 'samples': 21436800, 'steps': 111649, 'loss/train': 1.2889777421951294} 11/07/2021 12:56:20 - INFO - __main__ - Step 111651: {'lr': 7.836097723002866e-05, 'samples': 21436992, 'steps': 111650, 'loss/train': 1.133056402206421} 11/07/2021 12:56:20 - INFO - __main__ - Step 111652: {'lr': 7.83571188577779e-05, 'samples': 21437184, 'steps': 111651, 'loss/train': 0.7303914427757263} 11/07/2021 12:56:21 - INFO - __main__ - Step 111653: {'lr': 7.835326056286682e-05, 'samples': 21437376, 'steps': 111652, 'loss/train': 1.5743736028671265} 11/07/2021 12:56:21 - INFO - __main__ - Step 111654: {'lr': 7.834940234529709e-05, 'samples': 21437568, 'steps': 111653, 'loss/train': 1.4373537302017212} 11/07/2021 12:56:21 - INFO - __main__ - Step 111655: {'lr': 7.834554420507048e-05, 'samples': 21437760, 'steps': 111654, 'loss/train': 1.211174488067627} 11/07/2021 12:56:22 - INFO - __main__ - Step 111656: {'lr': 7.83416861421887e-05, 'samples': 21437952, 'steps': 111655, 'loss/train': 1.2655179500579834} 11/07/2021 12:56:23 - INFO - __main__ - Step 111657: {'lr': 7.833782815665353e-05, 'samples': 21438144, 'steps': 111656, 'loss/train': 0.6213098168373108} 11/07/2021 12:56:23 - INFO - __main__ - Step 111658: {'lr': 7.833397024846666e-05, 'samples': 21438336, 'steps': 111657, 'loss/train': 1.4208332300186157} 11/07/2021 12:56:23 - INFO - __main__ - Step 111659: {'lr': 7.833011241762988e-05, 'samples': 21438528, 'steps': 111658, 'loss/train': 0.870498538017273} 11/07/2021 12:56:24 - INFO - __main__ - Step 111660: {'lr': 7.83262546641449e-05, 'samples': 21438720, 'steps': 111659, 'loss/train': 1.6702181100845337} 11/07/2021 12:56:24 - INFO - __main__ - Step 111661: {'lr': 7.832239698801344e-05, 'samples': 21438912, 'steps': 111660, 'loss/train': 0.9546363949775696} 11/07/2021 12:56:25 - INFO - __main__ - Step 111662: {'lr': 7.831853938923727e-05, 'samples': 21439104, 'steps': 111661, 'loss/train': 1.5966825485229492} 11/07/2021 12:56:25 - INFO - __main__ - Step 111663: {'lr': 7.831468186781812e-05, 'samples': 21439296, 'steps': 111662, 'loss/train': 1.4127938747406006} 11/07/2021 12:56:26 - INFO - __main__ - Step 111664: {'lr': 7.831082442375778e-05, 'samples': 21439488, 'steps': 111663, 'loss/train': 1.741213321685791} 11/07/2021 12:56:26 - INFO - __main__ - Step 111665: {'lr': 7.830696705705789e-05, 'samples': 21439680, 'steps': 111664, 'loss/train': 1.696073055267334} 11/07/2021 12:56:26 - INFO - __main__ - Step 111666: {'lr': 7.830310976772021e-05, 'samples': 21439872, 'steps': 111665, 'loss/train': 1.29892897605896} 11/07/2021 12:56:28 - INFO - __main__ - Step 111667: {'lr': 7.829925255574652e-05, 'samples': 21440064, 'steps': 111666, 'loss/train': 1.5196973085403442} 11/07/2021 12:56:28 - INFO - __main__ - Step 111668: {'lr': 7.829539542113851e-05, 'samples': 21440256, 'steps': 111667, 'loss/train': 1.3678888082504272} 11/07/2021 12:56:29 - INFO - __main__ - Step 111669: {'lr': 7.829153836389796e-05, 'samples': 21440448, 'steps': 111668, 'loss/train': 1.3665248155593872} 11/07/2021 12:56:29 - INFO - __main__ - Step 111670: {'lr': 7.828768138402659e-05, 'samples': 21440640, 'steps': 111669, 'loss/train': 0.07374546676874161} 11/07/2021 12:56:29 - INFO - __main__ - Step 111671: {'lr': 7.828382448152615e-05, 'samples': 21440832, 'steps': 111670, 'loss/train': 1.1316888332366943} 11/07/2021 12:56:30 - INFO - __main__ - Step 111672: {'lr': 7.827996765639836e-05, 'samples': 21441024, 'steps': 111671, 'loss/train': 0.966317355632782} 11/07/2021 12:56:31 - INFO - __main__ - Step 111673: {'lr': 7.827611090864495e-05, 'samples': 21441216, 'steps': 111672, 'loss/train': 1.3689167499542236} 11/07/2021 12:56:31 - INFO - __main__ - Step 111674: {'lr': 7.82722542382677e-05, 'samples': 21441408, 'steps': 111673, 'loss/train': 0.7478421330451965} 11/07/2021 12:56:31 - INFO - __main__ - Step 111675: {'lr': 7.826839764526833e-05, 'samples': 21441600, 'steps': 111674, 'loss/train': 1.191166639328003} 11/07/2021 12:56:32 - INFO - __main__ - Step 111676: {'lr': 7.826454112964853e-05, 'samples': 21441792, 'steps': 111675, 'loss/train': 1.267878770828247} 11/07/2021 12:56:32 - INFO - __main__ - Step 111677: {'lr': 7.82606846914102e-05, 'samples': 21441984, 'steps': 111676, 'loss/train': 1.3144519329071045} 11/07/2021 12:56:33 - INFO - __main__ - Step 111678: {'lr': 7.825682833055487e-05, 'samples': 21442176, 'steps': 111677, 'loss/train': 2.8408126831054688} 11/07/2021 12:56:34 - INFO - __main__ - Step 111679: {'lr': 7.825297204708434e-05, 'samples': 21442368, 'steps': 111678, 'loss/train': 0.5709503293037415} 11/07/2021 12:56:34 - INFO - __main__ - Step 111680: {'lr': 7.824911584100037e-05, 'samples': 21442560, 'steps': 111679, 'loss/train': 1.412814974784851} 11/07/2021 12:56:34 - INFO - __main__ - Step 111681: {'lr': 7.824525971230473e-05, 'samples': 21442752, 'steps': 111680, 'loss/train': 1.3399730920791626} 11/07/2021 12:56:35 - INFO - __main__ - Step 111682: {'lr': 7.824140366099907e-05, 'samples': 21442944, 'steps': 111681, 'loss/train': 1.3440779447555542} 11/07/2021 12:56:36 - INFO - __main__ - Step 111683: {'lr': 7.823754768708525e-05, 'samples': 21443136, 'steps': 111682, 'loss/train': 1.314469337463379} 11/07/2021 12:56:36 - INFO - __main__ - Step 111684: {'lr': 7.823369179056489e-05, 'samples': 21443328, 'steps': 111683, 'loss/train': 1.1225805282592773} 11/07/2021 12:56:36 - INFO - __main__ - Step 111685: {'lr': 7.822983597143982e-05, 'samples': 21443520, 'steps': 111684, 'loss/train': 1.16379714012146} 11/07/2021 12:56:37 - INFO - __main__ - Step 111686: {'lr': 7.82259802297117e-05, 'samples': 21443712, 'steps': 111685, 'loss/train': 1.4704253673553467} 11/07/2021 12:56:37 - INFO - __main__ - Step 111687: {'lr': 7.82221245653823e-05, 'samples': 21443904, 'steps': 111686, 'loss/train': 1.1859383583068848} 11/07/2021 12:56:38 - INFO - __main__ - Step 111688: {'lr': 7.821826897845338e-05, 'samples': 21444096, 'steps': 111687, 'loss/train': 1.2390570640563965} 11/07/2021 12:56:38 - INFO - __main__ - Step 111689: {'lr': 7.821441346892667e-05, 'samples': 21444288, 'steps': 111688, 'loss/train': 1.6568520069122314} 11/07/2021 12:56:39 - INFO - __main__ - Step 111690: {'lr': 7.821055803680386e-05, 'samples': 21444480, 'steps': 111689, 'loss/train': 1.3645565509796143} 11/07/2021 12:56:39 - INFO - __main__ - Step 111691: {'lr': 7.820670268208682e-05, 'samples': 21444672, 'steps': 111690, 'loss/train': 1.3243085145950317} 11/07/2021 12:56:40 - INFO - __main__ - Step 111692: {'lr': 7.820284740477712e-05, 'samples': 21444864, 'steps': 111691, 'loss/train': 1.4985789060592651} 11/07/2021 12:56:41 - INFO - __main__ - Step 111693: {'lr': 7.819899220487655e-05, 'samples': 21445056, 'steps': 111692, 'loss/train': 1.7182806730270386} 11/07/2021 12:56:41 - INFO - __main__ - Step 111694: {'lr': 7.819513708238684e-05, 'samples': 21445248, 'steps': 111693, 'loss/train': 1.3166755437850952} 11/07/2021 12:56:41 - INFO - __main__ - Step 111695: {'lr': 7.819128203730979e-05, 'samples': 21445440, 'steps': 111694, 'loss/train': 1.4078928232192993} 11/07/2021 12:56:42 - INFO - __main__ - Step 111696: {'lr': 7.81874270696471e-05, 'samples': 21445632, 'steps': 111695, 'loss/train': 1.6719350814819336} 11/07/2021 12:56:42 - INFO - __main__ - Step 111697: {'lr': 7.818357217940048e-05, 'samples': 21445824, 'steps': 111696, 'loss/train': 1.0790187120437622} 11/07/2021 12:56:43 - INFO - __main__ - Step 111698: {'lr': 7.81797173665717e-05, 'samples': 21446016, 'steps': 111697, 'loss/train': 1.3560293912887573} 11/07/2021 12:56:44 - INFO - __main__ - Step 111699: {'lr': 7.817586263116247e-05, 'samples': 21446208, 'steps': 111698, 'loss/train': 1.0301951169967651} 11/07/2021 12:56:44 - INFO - __main__ - Step 111700: {'lr': 7.817200797317458e-05, 'samples': 21446400, 'steps': 111699, 'loss/train': 1.16623055934906} 11/07/2021 12:56:44 - INFO - __main__ - Step 111701: {'lr': 7.816815339260972e-05, 'samples': 21446592, 'steps': 111700, 'loss/train': 0.5120809078216553} 11/07/2021 12:56:45 - INFO - __main__ - Step 111702: {'lr': 7.81642988894696e-05, 'samples': 21446784, 'steps': 111701, 'loss/train': 1.4682151079177856} 11/07/2021 12:56:45 - INFO - __main__ - Step 111703: {'lr': 7.816044446375603e-05, 'samples': 21446976, 'steps': 111702, 'loss/train': 1.197375774383545} 11/07/2021 12:56:46 - INFO - __main__ - Step 111704: {'lr': 7.81565901154708e-05, 'samples': 21447168, 'steps': 111703, 'loss/train': 1.6413425207138062} 11/07/2021 12:56:46 - INFO - __main__ - Step 111705: {'lr': 7.815273584461546e-05, 'samples': 21447360, 'steps': 111704, 'loss/train': 5.663436412811279} 11/07/2021 12:56:47 - INFO - __main__ - Step 111706: {'lr': 7.814888165119186e-05, 'samples': 21447552, 'steps': 111705, 'loss/train': 1.6440176963806152} 11/07/2021 12:56:47 - INFO - __main__ - Step 111707: {'lr': 7.814502753520173e-05, 'samples': 21447744, 'steps': 111706, 'loss/train': 1.417360544204712} 11/07/2021 12:56:48 - INFO - __main__ - Step 111708: {'lr': 7.814117349664676e-05, 'samples': 21447936, 'steps': 111707, 'loss/train': 1.57285475730896} 11/07/2021 12:56:48 - INFO - __main__ - Step 111709: {'lr': 7.813731953552877e-05, 'samples': 21448128, 'steps': 111708, 'loss/train': 1.1806565523147583} 11/07/2021 12:56:49 - INFO - __main__ - Step 111710: {'lr': 7.813346565184943e-05, 'samples': 21448320, 'steps': 111709, 'loss/train': 1.7122398614883423} 11/07/2021 12:56:49 - INFO - __main__ - Step 111711: {'lr': 7.812961184561048e-05, 'samples': 21448512, 'steps': 111710, 'loss/train': 0.9957064986228943} 11/07/2021 12:56:50 - INFO - __main__ - Step 111712: {'lr': 7.812575811681371e-05, 'samples': 21448704, 'steps': 111711, 'loss/train': 1.1467375755310059} 11/07/2021 12:56:50 - INFO - __main__ - Step 111713: {'lr': 7.81219044654608e-05, 'samples': 21448896, 'steps': 111712, 'loss/train': 1.2708966732025146} 11/07/2021 12:56:50 - INFO - __main__ - Step 111714: {'lr': 7.811805089155352e-05, 'samples': 21449088, 'steps': 111713, 'loss/train': 1.081525444984436} 11/07/2021 12:56:51 - INFO - __main__ - Step 111715: {'lr': 7.811419739509359e-05, 'samples': 21449280, 'steps': 111714, 'loss/train': 1.1811727285385132} 11/07/2021 12:56:52 - INFO - __main__ - Step 111716: {'lr': 7.811034397608275e-05, 'samples': 21449472, 'steps': 111715, 'loss/train': 0.8907897472381592} 11/07/2021 12:56:52 - INFO - __main__ - Step 111717: {'lr': 7.810649063452272e-05, 'samples': 21449664, 'steps': 111716, 'loss/train': 0.4596668779850006} 11/07/2021 12:56:53 - INFO - __main__ - Step 111718: {'lr': 7.810263737041534e-05, 'samples': 21449856, 'steps': 111717, 'loss/train': 1.4414584636688232} 11/07/2021 12:56:53 - INFO - __main__ - Step 111719: {'lr': 7.809878418376221e-05, 'samples': 21450048, 'steps': 111718, 'loss/train': 1.3662018775939941} 11/07/2021 12:56:54 - INFO - __main__ - Step 111720: {'lr': 7.809493107456508e-05, 'samples': 21450240, 'steps': 111719, 'loss/train': 1.5244230031967163} 11/07/2021 12:56:54 - INFO - __main__ - Step 111721: {'lr': 7.809107804282572e-05, 'samples': 21450432, 'steps': 111720, 'loss/train': 0.8929114937782288} 11/07/2021 12:56:55 - INFO - __main__ - Step 111722: {'lr': 7.80872250885459e-05, 'samples': 21450624, 'steps': 111721, 'loss/train': 0.9241324663162231} 11/07/2021 12:56:55 - INFO - __main__ - Step 111723: {'lr': 7.808337221172729e-05, 'samples': 21450816, 'steps': 111722, 'loss/train': 0.7126185894012451} 11/07/2021 12:56:55 - INFO - __main__ - Step 111724: {'lr': 7.807951941237168e-05, 'samples': 21451008, 'steps': 111723, 'loss/train': 1.1160398721694946} 11/07/2021 12:56:57 - INFO - __main__ - Step 111725: {'lr': 7.807566669048078e-05, 'samples': 21451200, 'steps': 111724, 'loss/train': 2.271494150161743} 11/07/2021 12:56:57 - INFO - __main__ - Step 111726: {'lr': 7.807181404605634e-05, 'samples': 21451392, 'steps': 111725, 'loss/train': 1.5750131607055664} 11/07/2021 12:56:57 - INFO - __main__ - Step 111727: {'lr': 7.806796147910005e-05, 'samples': 21451584, 'steps': 111726, 'loss/train': 1.3952785730361938} 11/07/2021 12:56:58 - INFO - __main__ - Step 111728: {'lr': 7.806410898961372e-05, 'samples': 21451776, 'steps': 111727, 'loss/train': 1.2315189838409424} 11/07/2021 12:56:58 - INFO - __main__ - Step 111729: {'lr': 7.806025657759905e-05, 'samples': 21451968, 'steps': 111728, 'loss/train': 0.9528037905693054} 11/07/2021 12:56:59 - INFO - __main__ - Step 111730: {'lr': 7.805640424305777e-05, 'samples': 21452160, 'steps': 111729, 'loss/train': 1.1150472164154053} 11/07/2021 12:56:59 - INFO - __main__ - Step 111731: {'lr': 7.805255198599171e-05, 'samples': 21452352, 'steps': 111730, 'loss/train': 1.3884233236312866} 11/07/2021 12:57:00 - INFO - __main__ - Step 111732: {'lr': 7.804869980640242e-05, 'samples': 21452544, 'steps': 111731, 'loss/train': 1.2668228149414062} 11/07/2021 12:57:00 - INFO - __main__ - Step 111733: {'lr': 7.804484770429174e-05, 'samples': 21452736, 'steps': 111732, 'loss/train': 1.40778648853302} 11/07/2021 12:57:00 - INFO - __main__ - Step 111734: {'lr': 7.804099567966139e-05, 'samples': 21452928, 'steps': 111733, 'loss/train': 1.6712208986282349} 11/07/2021 12:57:02 - INFO - __main__ - Step 111735: {'lr': 7.803714373251311e-05, 'samples': 21453120, 'steps': 111734, 'loss/train': 1.3948007822036743} 11/07/2021 12:57:02 - INFO - __main__ - Step 111736: {'lr': 7.803329186284866e-05, 'samples': 21453312, 'steps': 111735, 'loss/train': 1.4777973890304565} 11/07/2021 12:57:02 - INFO - __main__ - Step 111737: {'lr': 7.802944007066973e-05, 'samples': 21453504, 'steps': 111736, 'loss/train': 1.292580485343933} 11/07/2021 12:57:03 - INFO - __main__ - Step 111738: {'lr': 7.802558835597809e-05, 'samples': 21453696, 'steps': 111737, 'loss/train': 1.2444162368774414} 11/07/2021 12:57:03 - INFO - __main__ - Step 111739: {'lr': 7.802173671877547e-05, 'samples': 21453888, 'steps': 111738, 'loss/train': 1.1537936925888062} 11/07/2021 12:57:03 - INFO - __main__ - Step 111740: {'lr': 7.801788515906361e-05, 'samples': 21454080, 'steps': 111739, 'loss/train': 1.3681954145431519} 11/07/2021 12:57:06 - INFO - __main__ - Step 111741: {'lr': 7.801403367684423e-05, 'samples': 21454272, 'steps': 111740, 'loss/train': 2.2117695808410645} 11/07/2021 12:57:06 - INFO - __main__ - Step 111742: {'lr': 7.801018227211906e-05, 'samples': 21454464, 'steps': 111741, 'loss/train': 1.9900307655334473} 11/07/2021 12:57:06 - INFO - __main__ - Step 111743: {'lr': 7.800633094488987e-05, 'samples': 21454656, 'steps': 111742, 'loss/train': 1.6839544773101807} 11/07/2021 12:57:07 - INFO - __main__ - Step 111744: {'lr': 7.800247969515845e-05, 'samples': 21454848, 'steps': 111743, 'loss/train': 1.144332766532898} 11/07/2021 12:57:07 - INFO - __main__ - Step 111745: {'lr': 7.799862852292635e-05, 'samples': 21455040, 'steps': 111744, 'loss/train': 0.8948817253112793} 11/07/2021 12:57:07 - INFO - __main__ - Step 111746: {'lr': 7.799477742819544e-05, 'samples': 21455232, 'steps': 111745, 'loss/train': 1.756209135055542} 11/07/2021 12:57:08 - INFO - __main__ - Step 111747: {'lr': 7.799092641096742e-05, 'samples': 21455424, 'steps': 111746, 'loss/train': 1.7596395015716553} 11/07/2021 12:57:08 - INFO - __main__ - Step 111748: {'lr': 7.798707547124404e-05, 'samples': 21455616, 'steps': 111747, 'loss/train': 1.7708452939987183} 11/07/2021 12:57:09 - INFO - __main__ - Step 111749: {'lr': 7.798322460902704e-05, 'samples': 21455808, 'steps': 111748, 'loss/train': 1.7718048095703125} 11/07/2021 12:57:09 - INFO - __main__ - Step 111750: {'lr': 7.797937382431813e-05, 'samples': 21456000, 'steps': 111749, 'loss/train': 1.2369927167892456} 11/07/2021 12:57:10 - INFO - __main__ - Step 111751: {'lr': 7.797552311711905e-05, 'samples': 21456192, 'steps': 111750, 'loss/train': 1.6740872859954834} 11/07/2021 12:57:10 - INFO - __main__ - Step 111752: {'lr': 7.797167248743156e-05, 'samples': 21456384, 'steps': 111751, 'loss/train': 1.256893277168274} 11/07/2021 12:57:11 - INFO - __main__ - Step 111753: {'lr': 7.79678219352574e-05, 'samples': 21456576, 'steps': 111752, 'loss/train': 1.1192430257797241} 11/07/2021 12:57:12 - INFO - __main__ - Step 111754: {'lr': 7.796397146059824e-05, 'samples': 21456768, 'steps': 111753, 'loss/train': 1.2062393426895142} 11/07/2021 12:57:12 - INFO - __main__ - Step 111755: {'lr': 7.796012106345587e-05, 'samples': 21456960, 'steps': 111754, 'loss/train': 0.6274598240852356} 11/07/2021 12:57:13 - INFO - __main__ - Step 111756: {'lr': 7.795627074383204e-05, 'samples': 21457152, 'steps': 111755, 'loss/train': 1.3766850233078003} 11/07/2021 12:57:13 - INFO - __main__ - Step 111757: {'lr': 7.795242050172844e-05, 'samples': 21457344, 'steps': 111756, 'loss/train': 1.117336630821228} 11/07/2021 12:57:13 - INFO - __main__ - Step 111758: {'lr': 7.794857033714691e-05, 'samples': 21457536, 'steps': 111757, 'loss/train': 1.3405141830444336} 11/07/2021 12:57:14 - INFO - __main__ - Step 111759: {'lr': 7.794472025008903e-05, 'samples': 21457728, 'steps': 111758, 'loss/train': 0.5606338381767273} 11/07/2021 12:57:15 - INFO - __main__ - Step 111760: {'lr': 7.79408702405566e-05, 'samples': 21457920, 'steps': 111759, 'loss/train': 1.2004704475402832} 11/07/2021 12:57:15 - INFO - __main__ - Step 111761: {'lr': 7.793702030855135e-05, 'samples': 21458112, 'steps': 111760, 'loss/train': 0.9795536994934082} 11/07/2021 12:57:15 - INFO - __main__ - Step 111762: {'lr': 7.793317045407502e-05, 'samples': 21458304, 'steps': 111761, 'loss/train': 0.9452747702598572} 11/07/2021 12:57:16 - INFO - __main__ - Step 111763: {'lr': 7.792932067712935e-05, 'samples': 21458496, 'steps': 111762, 'loss/train': 1.2647093534469604} 11/07/2021 12:57:17 - INFO - __main__ - Step 111764: {'lr': 7.792547097771608e-05, 'samples': 21458688, 'steps': 111763, 'loss/train': 1.6624807119369507} 11/07/2021 12:57:17 - INFO - __main__ - Step 111765: {'lr': 7.792162135583694e-05, 'samples': 21458880, 'steps': 111764, 'loss/train': 1.3776464462280273} 11/07/2021 12:57:18 - INFO - __main__ - Step 111766: {'lr': 7.791777181149364e-05, 'samples': 21459072, 'steps': 111765, 'loss/train': 0.9792373776435852} 11/07/2021 12:57:18 - INFO - __main__ - Step 111767: {'lr': 7.791392234468797e-05, 'samples': 21459264, 'steps': 111766, 'loss/train': 1.3969794511795044} 11/07/2021 12:57:18 - INFO - __main__ - Step 111768: {'lr': 7.79100729554216e-05, 'samples': 21459456, 'steps': 111767, 'loss/train': 1.6324914693832397} 11/07/2021 12:57:19 - INFO - __main__ - Step 111769: {'lr': 7.790622364369632e-05, 'samples': 21459648, 'steps': 111768, 'loss/train': 1.228520154953003} 11/07/2021 12:57:20 - INFO - __main__ - Step 111770: {'lr': 7.790237440951389e-05, 'samples': 21459840, 'steps': 111769, 'loss/train': 1.214124321937561} 11/07/2021 12:57:20 - INFO - __main__ - Step 111771: {'lr': 7.789852525287593e-05, 'samples': 21460032, 'steps': 111770, 'loss/train': 1.3783955574035645} 11/07/2021 12:57:20 - INFO - __main__ - Step 111772: {'lr': 7.789467617378426e-05, 'samples': 21460224, 'steps': 111771, 'loss/train': 1.4314377307891846} 11/07/2021 12:57:21 - INFO - __main__ - Step 111773: {'lr': 7.789082717224058e-05, 'samples': 21460416, 'steps': 111772, 'loss/train': 1.3790065050125122} 11/07/2021 12:57:22 - INFO - __main__ - Step 111774: {'lr': 7.788697824824664e-05, 'samples': 21460608, 'steps': 111773, 'loss/train': 0.8801288604736328} 11/07/2021 12:57:22 - INFO - __main__ - Step 111775: {'lr': 7.788312940180417e-05, 'samples': 21460800, 'steps': 111774, 'loss/train': 1.2187458276748657} 11/07/2021 12:57:22 - INFO - __main__ - Step 111776: {'lr': 7.787928063291489e-05, 'samples': 21460992, 'steps': 111775, 'loss/train': 1.593963861465454} 11/07/2021 12:57:23 - INFO - __main__ - Step 111777: {'lr': 7.787543194158056e-05, 'samples': 21461184, 'steps': 111776, 'loss/train': 1.8039659261703491} 11/07/2021 12:57:23 - INFO - __main__ - Step 111778: {'lr': 7.787158332780292e-05, 'samples': 21461376, 'steps': 111777, 'loss/train': 0.8765349984169006} 11/07/2021 12:57:23 - INFO - __main__ - Step 111779: {'lr': 7.786773479158365e-05, 'samples': 21461568, 'steps': 111778, 'loss/train': 1.3857204914093018} 11/07/2021 12:57:25 - INFO - __main__ - Step 111780: {'lr': 7.786388633292457e-05, 'samples': 21461760, 'steps': 111779, 'loss/train': 1.4329047203063965} 11/07/2021 12:57:25 - INFO - __main__ - Step 111781: {'lr': 7.786003795182742e-05, 'samples': 21461952, 'steps': 111780, 'loss/train': 1.174928903579712} 11/07/2021 12:57:25 - INFO - __main__ - Step 111782: {'lr': 7.78561896482938e-05, 'samples': 21462144, 'steps': 111781, 'loss/train': 1.5310118198394775} 11/07/2021 12:57:26 - INFO - __main__ - Step 111783: {'lr': 7.785234142232552e-05, 'samples': 21462336, 'steps': 111782, 'loss/train': 1.2604173421859741} 11/07/2021 12:57:26 - INFO - __main__ - Step 111784: {'lr': 7.784849327392432e-05, 'samples': 21462528, 'steps': 111783, 'loss/train': 1.7722889184951782} 11/07/2021 12:57:27 - INFO - __main__ - Step 111785: {'lr': 7.784464520309196e-05, 'samples': 21462720, 'steps': 111784, 'loss/train': 1.5119659900665283} 11/07/2021 12:57:27 - INFO - __main__ - Step 111786: {'lr': 7.784079720983012e-05, 'samples': 21462912, 'steps': 111785, 'loss/train': 1.5213954448699951} 11/07/2021 12:57:28 - INFO - __main__ - Step 111787: {'lr': 7.783694929414056e-05, 'samples': 21463104, 'steps': 111786, 'loss/train': 1.6822019815444946} 11/07/2021 12:57:28 - INFO - __main__ - Step 111788: {'lr': 7.783310145602502e-05, 'samples': 21463296, 'steps': 111787, 'loss/train': 1.8514176607131958} 11/07/2021 12:57:28 - INFO - __main__ - Step 111789: {'lr': 7.782925369548524e-05, 'samples': 21463488, 'steps': 111788, 'loss/train': 1.0011868476867676} 11/07/2021 12:57:29 - INFO - __main__ - Step 111790: {'lr': 7.782540601252291e-05, 'samples': 21463680, 'steps': 111789, 'loss/train': 1.6932353973388672} 11/07/2021 12:57:30 - INFO - __main__ - Step 111791: {'lr': 7.782155840713984e-05, 'samples': 21463872, 'steps': 111790, 'loss/train': 1.0971145629882812} 11/07/2021 12:57:30 - INFO - __main__ - Step 111792: {'lr': 7.781771087933775e-05, 'samples': 21464064, 'steps': 111791, 'loss/train': 0.6731851100921631} 11/07/2021 12:57:30 - INFO - __main__ - Step 111793: {'lr': 7.78138634291183e-05, 'samples': 21464256, 'steps': 111792, 'loss/train': 1.0890966653823853} 11/07/2021 12:57:31 - INFO - __main__ - Step 111794: {'lr': 7.781001605648324e-05, 'samples': 21464448, 'steps': 111793, 'loss/train': 1.7543699741363525} 11/07/2021 12:57:32 - INFO - __main__ - Step 111795: {'lr': 7.780616876143435e-05, 'samples': 21464640, 'steps': 111794, 'loss/train': 1.5104742050170898} 11/07/2021 12:57:32 - INFO - __main__ - Step 111796: {'lr': 7.780232154397334e-05, 'samples': 21464832, 'steps': 111795, 'loss/train': 5.131962776184082} 11/07/2021 12:57:33 - INFO - __main__ - Step 111797: {'lr': 7.779847440410196e-05, 'samples': 21465024, 'steps': 111796, 'loss/train': 1.4677274227142334} 11/07/2021 12:57:33 - INFO - __main__ - Step 111798: {'lr': 7.779462734182188e-05, 'samples': 21465216, 'steps': 111797, 'loss/train': 1.3800536394119263} 11/07/2021 12:57:33 - INFO - __main__ - Step 111799: {'lr': 7.779078035713493e-05, 'samples': 21465408, 'steps': 111798, 'loss/train': 1.4846047163009644} 11/07/2021 12:57:34 - INFO - __main__ - Step 111800: {'lr': 7.778693345004278e-05, 'samples': 21465600, 'steps': 111799, 'loss/train': 1.1548014879226685} 11/07/2021 12:57:35 - INFO - __main__ - Step 111801: {'lr': 7.77830866205472e-05, 'samples': 21465792, 'steps': 111800, 'loss/train': 1.7221636772155762} 11/07/2021 12:57:35 - INFO - __main__ - Step 111802: {'lr': 7.777923986864987e-05, 'samples': 21465984, 'steps': 111801, 'loss/train': 1.9111183881759644} 11/07/2021 12:57:35 - INFO - __main__ - Step 111803: {'lr': 7.777539319435267e-05, 'samples': 21466176, 'steps': 111802, 'loss/train': 1.0423917770385742} 11/07/2021 12:57:36 - INFO - __main__ - Step 111804: {'lr': 7.777154659765712e-05, 'samples': 21466368, 'steps': 111803, 'loss/train': 1.5090429782867432} 11/07/2021 12:57:36 - INFO - __main__ - Step 111805: {'lr': 7.776770007856504e-05, 'samples': 21466560, 'steps': 111804, 'loss/train': 1.4581419229507446} 11/07/2021 12:57:37 - INFO - __main__ - Step 111806: {'lr': 7.776385363707821e-05, 'samples': 21466752, 'steps': 111805, 'loss/train': 1.413017988204956} 11/07/2021 12:57:37 - INFO - __main__ - Step 111807: {'lr': 7.776000727319832e-05, 'samples': 21466944, 'steps': 111806, 'loss/train': 1.7792869806289673} 11/07/2021 12:57:38 - INFO - __main__ - Step 111808: {'lr': 7.775616098692708e-05, 'samples': 21467136, 'steps': 111807, 'loss/train': 1.5508581399917603} 11/07/2021 12:57:38 - INFO - __main__ - Step 111809: {'lr': 7.77523147782663e-05, 'samples': 21467328, 'steps': 111808, 'loss/train': 1.406025767326355} 11/07/2021 12:57:39 - INFO - __main__ - Step 111810: {'lr': 7.774846864721766e-05, 'samples': 21467520, 'steps': 111809, 'loss/train': 1.4128044843673706} 11/07/2021 12:57:40 - INFO - __main__ - Step 111811: {'lr': 7.77446225937829e-05, 'samples': 21467712, 'steps': 111810, 'loss/train': 1.6591758728027344} 11/07/2021 12:57:40 - INFO - __main__ - Step 111812: {'lr': 7.774077661796374e-05, 'samples': 21467904, 'steps': 111811, 'loss/train': 1.2663415670394897} 11/07/2021 12:57:40 - INFO - __main__ - Step 111813: {'lr': 7.773693071976192e-05, 'samples': 21468096, 'steps': 111812, 'loss/train': 1.4329520463943481} 11/07/2021 12:57:41 - INFO - __main__ - Step 111814: {'lr': 7.773308489917929e-05, 'samples': 21468288, 'steps': 111813, 'loss/train': 1.1956660747528076} 11/07/2021 12:57:41 - INFO - __main__ - Step 111815: {'lr': 7.772923915621737e-05, 'samples': 21468480, 'steps': 111814, 'loss/train': 1.3632034063339233} 11/07/2021 12:57:42 - INFO - __main__ - Step 111816: {'lr': 7.772539349087802e-05, 'samples': 21468672, 'steps': 111815, 'loss/train': 1.6541788578033447} 11/07/2021 12:57:42 - INFO - __main__ - Step 111817: {'lr': 7.772154790316294e-05, 'samples': 21468864, 'steps': 111816, 'loss/train': 1.3805171251296997} 11/07/2021 12:57:43 - INFO - __main__ - Step 111818: {'lr': 7.771770239307388e-05, 'samples': 21469056, 'steps': 111817, 'loss/train': 0.8551280498504639} 11/07/2021 12:57:43 - INFO - __main__ - Step 111819: {'lr': 7.771385696061253e-05, 'samples': 21469248, 'steps': 111818, 'loss/train': 1.4664509296417236} 11/07/2021 12:57:43 - INFO - __main__ - Step 111820: {'lr': 7.77100116057807e-05, 'samples': 21469440, 'steps': 111819, 'loss/train': 1.1359549760818481} 11/07/2021 12:57:44 - INFO - __main__ - Step 111821: {'lr': 7.770616632858005e-05, 'samples': 21469632, 'steps': 111820, 'loss/train': 0.9625791907310486} 11/07/2021 12:57:45 - INFO - __main__ - Step 111822: {'lr': 7.770232112901235e-05, 'samples': 21469824, 'steps': 111821, 'loss/train': 1.278997540473938} 11/07/2021 12:57:45 - INFO - __main__ - Step 111823: {'lr': 7.769847600707936e-05, 'samples': 21470016, 'steps': 111822, 'loss/train': 1.2919247150421143} 11/07/2021 12:57:45 - INFO - __main__ - Step 111824: {'lr': 7.769463096278273e-05, 'samples': 21470208, 'steps': 111823, 'loss/train': 1.3902603387832642} 11/07/2021 12:57:46 - INFO - __main__ - Step 111825: {'lr': 7.769078599612433e-05, 'samples': 21470400, 'steps': 111824, 'loss/train': 1.2521896362304688} 11/07/2021 12:57:47 - INFO - __main__ - Step 111826: {'lr': 7.768694110710575e-05, 'samples': 21470592, 'steps': 111825, 'loss/train': 1.2212682962417603} 11/07/2021 12:57:47 - INFO - __main__ - Step 111827: {'lr': 7.768309629572875e-05, 'samples': 21470784, 'steps': 111826, 'loss/train': 1.353686809539795} 11/07/2021 12:57:48 - INFO - __main__ - Step 111828: {'lr': 7.76792515619951e-05, 'samples': 21470976, 'steps': 111827, 'loss/train': 1.090521216392517} 11/07/2021 12:57:48 - INFO - __main__ - Step 111829: {'lr': 7.767540690590652e-05, 'samples': 21471168, 'steps': 111828, 'loss/train': 1.0437899827957153} 11/07/2021 12:57:48 - INFO - __main__ - Step 111830: {'lr': 7.767156232746473e-05, 'samples': 21471360, 'steps': 111829, 'loss/train': 1.3483721017837524} 11/07/2021 12:57:49 - INFO - __main__ - Step 111831: {'lr': 7.766771782667148e-05, 'samples': 21471552, 'steps': 111830, 'loss/train': 1.446899175643921} 11/07/2021 12:57:50 - INFO - __main__ - Step 111832: {'lr': 7.766387340352852e-05, 'samples': 21471744, 'steps': 111831, 'loss/train': 1.1093010902404785} 11/07/2021 12:57:50 - INFO - __main__ - Step 111833: {'lr': 7.766002905803751e-05, 'samples': 21471936, 'steps': 111832, 'loss/train': 2.023871421813965} 11/07/2021 12:57:50 - INFO - __main__ - Step 111834: {'lr': 7.765618479020026e-05, 'samples': 21472128, 'steps': 111833, 'loss/train': 1.095612645149231} 11/07/2021 12:57:51 - INFO - __main__ - Step 111835: {'lr': 7.76523406000185e-05, 'samples': 21472320, 'steps': 111834, 'loss/train': 1.7092164754867554} 11/07/2021 12:57:51 - INFO - __main__ - Step 111836: {'lr': 7.76484964874939e-05, 'samples': 21472512, 'steps': 111835, 'loss/train': 0.35544246435165405} 11/07/2021 12:57:52 - INFO - __main__ - Step 111837: {'lr': 7.764465245262822e-05, 'samples': 21472704, 'steps': 111836, 'loss/train': 1.602924108505249} 11/07/2021 12:57:53 - INFO - __main__ - Step 111838: {'lr': 7.764080849542323e-05, 'samples': 21472896, 'steps': 111837, 'loss/train': 0.8676981925964355} 11/07/2021 12:57:53 - INFO - __main__ - Step 111839: {'lr': 7.763696461588069e-05, 'samples': 21473088, 'steps': 111838, 'loss/train': 1.3409610986709595} 11/07/2021 12:57:53 - INFO - __main__ - Step 111840: {'lr': 7.76331208140022e-05, 'samples': 21473280, 'steps': 111839, 'loss/train': 1.2619154453277588} 11/07/2021 12:57:54 - INFO - __main__ - Step 111841: {'lr': 7.762927708978959e-05, 'samples': 21473472, 'steps': 111840, 'loss/train': 1.4533601999282837} 11/07/2021 12:57:54 - INFO - __main__ - Step 111842: {'lr': 7.762543344324454e-05, 'samples': 21473664, 'steps': 111841, 'loss/train': 1.3970245122909546} 11/07/2021 12:57:55 - INFO - __main__ - Step 111843: {'lr': 7.762158987436881e-05, 'samples': 21473856, 'steps': 111842, 'loss/train': 1.3116662502288818} 11/07/2021 12:57:55 - INFO - __main__ - Step 111844: {'lr': 7.761774638316416e-05, 'samples': 21474048, 'steps': 111843, 'loss/train': 1.6014965772628784} 11/07/2021 12:57:56 - INFO - __main__ - Step 111845: {'lr': 7.761390296963224e-05, 'samples': 21474240, 'steps': 111844, 'loss/train': 0.8887640833854675} 11/07/2021 12:57:56 - INFO - __main__ - Step 111846: {'lr': 7.761005963377487e-05, 'samples': 21474432, 'steps': 111845, 'loss/train': 1.1604461669921875} 11/07/2021 12:57:57 - INFO - __main__ - Step 111847: {'lr': 7.760621637559375e-05, 'samples': 21474624, 'steps': 111846, 'loss/train': 1.1432253122329712} 11/07/2021 12:57:58 - INFO - __main__ - Step 111848: {'lr': 7.760237319509061e-05, 'samples': 21474816, 'steps': 111847, 'loss/train': 1.5711863040924072} 11/07/2021 12:57:58 - INFO - __main__ - Step 111849: {'lr': 7.759853009226717e-05, 'samples': 21475008, 'steps': 111848, 'loss/train': 1.7942105531692505} 11/07/2021 12:57:58 - INFO - __main__ - Step 111850: {'lr': 7.759468706712519e-05, 'samples': 21475200, 'steps': 111849, 'loss/train': 1.4401092529296875} 11/07/2021 12:57:59 - INFO - __main__ - Step 111851: {'lr': 7.759084411966636e-05, 'samples': 21475392, 'steps': 111850, 'loss/train': 1.3509677648544312} 11/07/2021 12:57:59 - INFO - __main__ - Step 111852: {'lr': 7.758700124989254e-05, 'samples': 21475584, 'steps': 111851, 'loss/train': 1.25223970413208} 11/07/2021 12:58:01 - INFO - __main__ - Step 111853: {'lr': 7.758315845780526e-05, 'samples': 21475776, 'steps': 111852, 'loss/train': 1.2944791316986084} 11/07/2021 12:58:01 - INFO - __main__ - Step 111854: {'lr': 7.757931574340635e-05, 'samples': 21475968, 'steps': 111853, 'loss/train': 1.4400699138641357} 11/07/2021 12:58:01 - INFO - __main__ - Step 111855: {'lr': 7.757547310669752e-05, 'samples': 21476160, 'steps': 111854, 'loss/train': 1.4672657251358032} 11/07/2021 12:58:02 - INFO - __main__ - Step 111856: {'lr': 7.757163054768055e-05, 'samples': 21476352, 'steps': 111855, 'loss/train': 1.451083779335022} 11/07/2021 12:58:02 - INFO - __main__ - Step 111857: {'lr': 7.756778806635714e-05, 'samples': 21476544, 'steps': 111856, 'loss/train': 0.47618693113327026} 11/07/2021 12:58:02 - INFO - __main__ - Step 111858: {'lr': 7.756394566272901e-05, 'samples': 21476736, 'steps': 111857, 'loss/train': 0.5769348740577698} 11/07/2021 12:58:04 - INFO - __main__ - Step 111859: {'lr': 7.756010333679791e-05, 'samples': 21476928, 'steps': 111858, 'loss/train': 1.4617875814437866} 11/07/2021 12:58:04 - INFO - __main__ - Step 111860: {'lr': 7.755626108856556e-05, 'samples': 21477120, 'steps': 111859, 'loss/train': 0.7941327095031738} 11/07/2021 12:58:04 - INFO - __main__ - Step 111861: {'lr': 7.755241891803372e-05, 'samples': 21477312, 'steps': 111860, 'loss/train': 0.9716963171958923} 11/07/2021 12:58:05 - INFO - __main__ - Step 111862: {'lr': 7.754857682520408e-05, 'samples': 21477504, 'steps': 111861, 'loss/train': 1.430889368057251} 11/07/2021 12:58:05 - INFO - __main__ - Step 111863: {'lr': 7.75447348100784e-05, 'samples': 21477696, 'steps': 111862, 'loss/train': 1.5899375677108765} 11/07/2021 12:58:06 - INFO - __main__ - Step 111864: {'lr': 7.75408928726584e-05, 'samples': 21477888, 'steps': 111863, 'loss/train': 1.188367486000061} 11/07/2021 12:58:06 - INFO - __main__ - Step 111865: {'lr': 7.753705101294589e-05, 'samples': 21478080, 'steps': 111864, 'loss/train': 1.5679185390472412} 11/07/2021 12:58:07 - INFO - __main__ - Step 111866: {'lr': 7.753320923094246e-05, 'samples': 21478272, 'steps': 111865, 'loss/train': 0.3390650153160095} 11/07/2021 12:58:07 - INFO - __main__ - Step 111867: {'lr': 7.752936752664988e-05, 'samples': 21478464, 'steps': 111866, 'loss/train': 1.028367519378662} 11/07/2021 12:58:07 - INFO - __main__ - Step 111868: {'lr': 7.75255259000699e-05, 'samples': 21478656, 'steps': 111867, 'loss/train': 1.866943120956421} 11/07/2021 12:58:08 - INFO - __main__ - Step 111869: {'lr': 7.752168435120426e-05, 'samples': 21478848, 'steps': 111868, 'loss/train': 1.5468679666519165} 11/07/2021 12:58:09 - INFO - __main__ - Step 111870: {'lr': 7.75178428800547e-05, 'samples': 21479040, 'steps': 111869, 'loss/train': 1.4131840467453003} 11/07/2021 12:58:09 - INFO - __main__ - Step 111871: {'lr': 7.751400148662293e-05, 'samples': 21479232, 'steps': 111870, 'loss/train': 1.4831018447875977} 11/07/2021 12:58:09 - INFO - __main__ - Step 111872: {'lr': 7.75101601709107e-05, 'samples': 21479424, 'steps': 111871, 'loss/train': 1.3139100074768066} 11/07/2021 12:58:10 - INFO - __main__ - Step 111873: {'lr': 7.750631893291973e-05, 'samples': 21479616, 'steps': 111872, 'loss/train': 1.702991008758545} 11/07/2021 12:58:11 - INFO - __main__ - Step 111874: {'lr': 7.750247777265177e-05, 'samples': 21479808, 'steps': 111873, 'loss/train': 1.5337474346160889} 11/07/2021 12:58:11 - INFO - __main__ - Step 111875: {'lr': 7.749863669010848e-05, 'samples': 21480000, 'steps': 111874, 'loss/train': 1.3724688291549683} 11/07/2021 12:58:12 - INFO - __main__ - Step 111876: {'lr': 7.749479568529169e-05, 'samples': 21480192, 'steps': 111875, 'loss/train': 1.5081160068511963} 11/07/2021 12:58:12 - INFO - __main__ - Step 111877: {'lr': 7.749095475820306e-05, 'samples': 21480384, 'steps': 111876, 'loss/train': 1.4439945220947266} 11/07/2021 12:58:12 - INFO - __main__ - Step 111878: {'lr': 7.748711390884434e-05, 'samples': 21480576, 'steps': 111877, 'loss/train': 1.587721347808838} 11/07/2021 12:58:13 - INFO - __main__ - Step 111879: {'lr': 7.748327313721737e-05, 'samples': 21480768, 'steps': 111878, 'loss/train': 1.3366717100143433} 11/07/2021 12:58:14 - INFO - __main__ - Step 111880: {'lr': 7.747943244332367e-05, 'samples': 21480960, 'steps': 111879, 'loss/train': 1.0746608972549438} 11/07/2021 12:58:14 - INFO - __main__ - Step 111881: {'lr': 7.747559182716507e-05, 'samples': 21481152, 'steps': 111880, 'loss/train': 1.2321951389312744} 11/07/2021 12:58:14 - INFO - __main__ - Step 111882: {'lr': 7.747175128874331e-05, 'samples': 21481344, 'steps': 111881, 'loss/train': 1.3358898162841797} 11/07/2021 12:58:15 - INFO - __main__ - Step 111883: {'lr': 7.746791082806015e-05, 'samples': 21481536, 'steps': 111882, 'loss/train': 1.477003812789917} 11/07/2021 12:58:16 - INFO - __main__ - Step 111884: {'lr': 7.746407044511724e-05, 'samples': 21481728, 'steps': 111883, 'loss/train': 1.5177289247512817} 11/07/2021 12:58:16 - INFO - __main__ - Step 111885: {'lr': 7.746023013991641e-05, 'samples': 21481920, 'steps': 111884, 'loss/train': 0.06877534836530685} 11/07/2021 12:58:16 - INFO - __main__ - Step 111886: {'lr': 7.745638991245929e-05, 'samples': 21482112, 'steps': 111885, 'loss/train': 1.4346158504486084} 11/07/2021 12:58:17 - INFO - __main__ - Step 111887: {'lr': 7.745254976274769e-05, 'samples': 21482304, 'steps': 111886, 'loss/train': 0.6984245181083679} 11/07/2021 12:58:17 - INFO - __main__ - Step 111888: {'lr': 7.744870969078327e-05, 'samples': 21482496, 'steps': 111887, 'loss/train': 1.2921258211135864} 11/07/2021 12:58:18 - INFO - __main__ - Step 111889: {'lr': 7.744486969656783e-05, 'samples': 21482688, 'steps': 111888, 'loss/train': 1.0885882377624512} 11/07/2021 12:58:19 - INFO - __main__ - Step 111890: {'lr': 7.744102978010306e-05, 'samples': 21482880, 'steps': 111889, 'loss/train': 1.476280689239502} 11/07/2021 12:58:19 - INFO - __main__ - Step 111891: {'lr': 7.743718994139071e-05, 'samples': 21483072, 'steps': 111890, 'loss/train': 1.4417784214019775} 11/07/2021 12:58:19 - INFO - __main__ - Step 111892: {'lr': 7.743335018043257e-05, 'samples': 21483264, 'steps': 111891, 'loss/train': 1.3684279918670654} 11/07/2021 12:58:20 - INFO - __main__ - Step 111893: {'lr': 7.742951049723022e-05, 'samples': 21483456, 'steps': 111892, 'loss/train': 1.1372594833374023} 11/07/2021 12:58:20 - INFO - __main__ - Step 111894: {'lr': 7.742567089178546e-05, 'samples': 21483648, 'steps': 111893, 'loss/train': 0.6876744627952576} 11/07/2021 12:58:21 - INFO - __main__ - Step 111895: {'lr': 7.742183136410006e-05, 'samples': 21483840, 'steps': 111894, 'loss/train': 0.5392594337463379} 11/07/2021 12:58:22 - INFO - __main__ - Step 111896: {'lr': 7.741799191417567e-05, 'samples': 21484032, 'steps': 111895, 'loss/train': 1.3663113117218018} 11/07/2021 12:58:22 - INFO - __main__ - Step 111897: {'lr': 7.741415254201411e-05, 'samples': 21484224, 'steps': 111896, 'loss/train': 1.0334409475326538} 11/07/2021 12:58:22 - INFO - __main__ - Step 111898: {'lr': 7.741031324761707e-05, 'samples': 21484416, 'steps': 111897, 'loss/train': 1.46759831905365} 11/07/2021 12:58:23 - INFO - __main__ - Step 111899: {'lr': 7.740647403098627e-05, 'samples': 21484608, 'steps': 111898, 'loss/train': 1.5106923580169678} 11/07/2021 12:58:24 - INFO - __main__ - Step 111900: {'lr': 7.740263489212343e-05, 'samples': 21484800, 'steps': 111899, 'loss/train': 1.282575011253357} 11/07/2021 12:58:24 - INFO - __main__ - Step 111901: {'lr': 7.739879583103033e-05, 'samples': 21484992, 'steps': 111900, 'loss/train': 1.4186428785324097} 11/07/2021 12:58:25 - INFO - __main__ - Step 111902: {'lr': 7.739495684770864e-05, 'samples': 21485184, 'steps': 111901, 'loss/train': 1.8386684656143188} 11/07/2021 12:58:25 - INFO - __main__ - Step 111903: {'lr': 7.739111794216014e-05, 'samples': 21485376, 'steps': 111902, 'loss/train': 1.3467397689819336} 11/07/2021 12:58:25 - INFO - __main__ - Step 111904: {'lr': 7.738727911438653e-05, 'samples': 21485568, 'steps': 111903, 'loss/train': 1.2867701053619385} 11/07/2021 12:58:26 - INFO - __main__ - Step 111905: {'lr': 7.738344036438957e-05, 'samples': 21485760, 'steps': 111904, 'loss/train': 1.4802416563034058} 11/07/2021 12:58:27 - INFO - __main__ - Step 111906: {'lr': 7.737960169217104e-05, 'samples': 21485952, 'steps': 111905, 'loss/train': 1.7731285095214844} 11/07/2021 12:58:27 - INFO - __main__ - Step 111907: {'lr': 7.73757630977325e-05, 'samples': 21486144, 'steps': 111906, 'loss/train': 1.4249913692474365} 11/07/2021 12:58:27 - INFO - __main__ - Step 111908: {'lr': 7.737192458107578e-05, 'samples': 21486336, 'steps': 111907, 'loss/train': 1.2750136852264404} 11/07/2021 12:58:28 - INFO - __main__ - Step 111909: {'lr': 7.736808614220262e-05, 'samples': 21486528, 'steps': 111908, 'loss/train': 1.3589940071105957} 11/07/2021 12:58:28 - INFO - __main__ - Step 111910: {'lr': 7.736424778111473e-05, 'samples': 21486720, 'steps': 111909, 'loss/train': 1.2591277360916138} 11/07/2021 12:58:29 - INFO - __main__ - Step 111911: {'lr': 7.736040949781384e-05, 'samples': 21486912, 'steps': 111910, 'loss/train': 1.6480035781860352} 11/07/2021 12:58:29 - INFO - __main__ - Step 111912: {'lr': 7.73565712923017e-05, 'samples': 21487104, 'steps': 111911, 'loss/train': 1.4969086647033691} 11/07/2021 12:58:30 - INFO - __main__ - Step 111913: {'lr': 7.735273316457999e-05, 'samples': 21487296, 'steps': 111912, 'loss/train': 1.5463359355926514} 11/07/2021 12:58:30 - INFO - __main__ - Step 111914: {'lr': 7.734889511465051e-05, 'samples': 21487488, 'steps': 111913, 'loss/train': 1.2250893115997314} 11/07/2021 12:58:30 - INFO - __main__ - Step 111915: {'lr': 7.734505714251493e-05, 'samples': 21487680, 'steps': 111914, 'loss/train': 0.9206143617630005} 11/07/2021 12:58:32 - INFO - __main__ - Step 111916: {'lr': 7.734121924817505e-05, 'samples': 21487872, 'steps': 111915, 'loss/train': 1.3635613918304443} 11/07/2021 12:58:32 - INFO - __main__ - Step 111917: {'lr': 7.733738143163252e-05, 'samples': 21488064, 'steps': 111916, 'loss/train': 0.35674381256103516} 11/07/2021 12:58:32 - INFO - __main__ - Step 111918: {'lr': 7.733354369288909e-05, 'samples': 21488256, 'steps': 111917, 'loss/train': 1.4585832357406616} 11/07/2021 12:58:33 - INFO - __main__ - Step 111919: {'lr': 7.732970603194659e-05, 'samples': 21488448, 'steps': 111918, 'loss/train': 0.9523752927780151} 11/07/2021 12:58:33 - INFO - __main__ - Step 111920: {'lr': 7.73258684488066e-05, 'samples': 21488640, 'steps': 111919, 'loss/train': 0.6939034461975098} 11/07/2021 12:58:34 - INFO - __main__ - Step 111921: {'lr': 7.732203094347088e-05, 'samples': 21488832, 'steps': 111920, 'loss/train': 1.7382380962371826} 11/07/2021 12:58:34 - INFO - __main__ - Step 111922: {'lr': 7.73181935159412e-05, 'samples': 21489024, 'steps': 111921, 'loss/train': 1.1929413080215454} 11/07/2021 12:58:35 - INFO - __main__ - Step 111923: {'lr': 7.731435616621926e-05, 'samples': 21489216, 'steps': 111922, 'loss/train': 1.1786600351333618} 11/07/2021 12:58:35 - INFO - __main__ - Step 111924: {'lr': 7.731051889430685e-05, 'samples': 21489408, 'steps': 111923, 'loss/train': 0.8438966870307922} 11/07/2021 12:58:35 - INFO - __main__ - Step 111925: {'lr': 7.730668170020561e-05, 'samples': 21489600, 'steps': 111924, 'loss/train': 1.4476145505905151} 11/07/2021 12:58:36 - INFO - __main__ - Step 111926: {'lr': 7.730284458391734e-05, 'samples': 21489792, 'steps': 111925, 'loss/train': 1.5181376934051514} 11/07/2021 12:58:37 - INFO - __main__ - Step 111927: {'lr': 7.729900754544373e-05, 'samples': 21489984, 'steps': 111926, 'loss/train': 0.9939373135566711} 11/07/2021 12:58:37 - INFO - __main__ - Step 111928: {'lr': 7.729517058478653e-05, 'samples': 21490176, 'steps': 111927, 'loss/train': 1.272496223449707} 11/07/2021 12:58:37 - INFO - __main__ - Step 111929: {'lr': 7.729133370194747e-05, 'samples': 21490368, 'steps': 111928, 'loss/train': 1.1214419603347778} 11/07/2021 12:58:38 - INFO - __main__ - Step 111930: {'lr': 7.728749689692823e-05, 'samples': 21490560, 'steps': 111929, 'loss/train': 1.2085622549057007} 11/07/2021 12:58:38 - INFO - __main__ - Step 111931: {'lr': 7.728366016973062e-05, 'samples': 21490752, 'steps': 111930, 'loss/train': 1.0847054719924927} 11/07/2021 12:58:39 - INFO - __main__ - Step 111932: {'lr': 7.727982352035631e-05, 'samples': 21490944, 'steps': 111931, 'loss/train': 1.499227523803711} 11/07/2021 12:58:40 - INFO - __main__ - Step 111933: {'lr': 7.727598694880714e-05, 'samples': 21491136, 'steps': 111932, 'loss/train': 0.9280983209609985} 11/07/2021 12:58:40 - INFO - __main__ - Step 111934: {'lr': 7.727215045508464e-05, 'samples': 21491328, 'steps': 111933, 'loss/train': 1.742488980293274} 11/07/2021 12:58:40 - INFO - __main__ - Step 111935: {'lr': 7.726831403919068e-05, 'samples': 21491520, 'steps': 111934, 'loss/train': 1.1608470678329468} 11/07/2021 12:58:41 - INFO - __main__ - Step 111936: {'lr': 7.726447770112693e-05, 'samples': 21491712, 'steps': 111935, 'loss/train': 0.8200398683547974} 11/07/2021 12:58:42 - INFO - __main__ - Step 111937: {'lr': 7.726064144089515e-05, 'samples': 21491904, 'steps': 111936, 'loss/train': 0.8498646020889282} 11/07/2021 12:58:42 - INFO - __main__ - Step 111938: {'lr': 7.725680525849704e-05, 'samples': 21492096, 'steps': 111937, 'loss/train': 1.271918535232544} 11/07/2021 12:58:42 - INFO - __main__ - Step 111939: {'lr': 7.725296915393438e-05, 'samples': 21492288, 'steps': 111938, 'loss/train': 1.2880982160568237} 11/07/2021 12:58:43 - INFO - __main__ - Step 111940: {'lr': 7.724913312720886e-05, 'samples': 21492480, 'steps': 111939, 'loss/train': 1.2919639348983765} 11/07/2021 12:58:43 - INFO - __main__ - Step 111941: {'lr': 7.724529717832218e-05, 'samples': 21492672, 'steps': 111940, 'loss/train': 1.2629400491714478} 11/07/2021 12:58:44 - INFO - __main__ - Step 111942: {'lr': 7.724146130727614e-05, 'samples': 21492864, 'steps': 111941, 'loss/train': 1.5952576398849487} 11/07/2021 12:58:44 - INFO - __main__ - Step 111943: {'lr': 7.72376255140724e-05, 'samples': 21493056, 'steps': 111942, 'loss/train': 1.4114402532577515} 11/07/2021 12:58:45 - INFO - __main__ - Step 111944: {'lr': 7.723378979871276e-05, 'samples': 21493248, 'steps': 111943, 'loss/train': 1.8827037811279297} 11/07/2021 12:58:45 - INFO - __main__ - Step 111945: {'lr': 7.722995416119888e-05, 'samples': 21493440, 'steps': 111944, 'loss/train': 1.718947172164917} 11/07/2021 12:58:45 - INFO - __main__ - Step 111946: {'lr': 7.722611860153259e-05, 'samples': 21493632, 'steps': 111945, 'loss/train': 1.3600164651870728} 11/07/2021 12:58:46 - INFO - __main__ - Step 111947: {'lr': 7.72222831197155e-05, 'samples': 21493824, 'steps': 111946, 'loss/train': 0.8856589794158936} 11/07/2021 12:58:47 - INFO - __main__ - Step 111948: {'lr': 7.721844771574934e-05, 'samples': 21494016, 'steps': 111947, 'loss/train': 1.4444315433502197} 11/07/2021 12:58:47 - INFO - __main__ - Step 111949: {'lr': 7.721461238963589e-05, 'samples': 21494208, 'steps': 111948, 'loss/train': 1.1455411911010742} 11/07/2021 12:58:47 - INFO - __main__ - Step 111950: {'lr': 7.721077714137689e-05, 'samples': 21494400, 'steps': 111949, 'loss/train': 1.479238748550415} 11/07/2021 12:58:48 - INFO - __main__ - Step 111951: {'lr': 7.720694197097406e-05, 'samples': 21494592, 'steps': 111950, 'loss/train': 1.9757455587387085} 11/07/2021 12:58:49 - INFO - __main__ - Step 111952: {'lr': 7.720310687842908e-05, 'samples': 21494784, 'steps': 111951, 'loss/train': 0.4575786888599396} 11/07/2021 12:58:49 - INFO - __main__ - Step 111953: {'lr': 7.719927186374373e-05, 'samples': 21494976, 'steps': 111952, 'loss/train': 1.3430932760238647} 11/07/2021 12:58:50 - INFO - __main__ - Step 111954: {'lr': 7.719543692691972e-05, 'samples': 21495168, 'steps': 111953, 'loss/train': 1.2686119079589844} 11/07/2021 12:58:50 - INFO - __main__ - Step 111955: {'lr': 7.719160206795877e-05, 'samples': 21495360, 'steps': 111954, 'loss/train': 0.6635996103286743} 11/07/2021 12:58:50 - INFO - __main__ - Step 111956: {'lr': 7.718776728686263e-05, 'samples': 21495552, 'steps': 111955, 'loss/train': 1.4355710744857788} 11/07/2021 12:58:51 - INFO - __main__ - Step 111957: {'lr': 7.718393258363302e-05, 'samples': 21495744, 'steps': 111956, 'loss/train': 1.2863820791244507} 11/07/2021 12:58:52 - INFO - __main__ - Step 111958: {'lr': 7.718009795827166e-05, 'samples': 21495936, 'steps': 111957, 'loss/train': 1.5231561660766602} 11/07/2021 12:58:52 - INFO - __main__ - Step 111959: {'lr': 7.717626341078027e-05, 'samples': 21496128, 'steps': 111958, 'loss/train': 1.1395231485366821} 11/07/2021 12:58:53 - INFO - __main__ - Step 111960: {'lr': 7.717242894116067e-05, 'samples': 21496320, 'steps': 111959, 'loss/train': 1.4511805772781372} 11/07/2021 12:58:53 - INFO - __main__ - Step 111961: {'lr': 7.716859454941444e-05, 'samples': 21496512, 'steps': 111960, 'loss/train': 1.3950237035751343} 11/07/2021 12:58:53 - INFO - __main__ - Step 111962: {'lr': 7.716476023554336e-05, 'samples': 21496704, 'steps': 111961, 'loss/train': 1.461380958557129} 11/07/2021 12:58:55 - INFO - __main__ - Step 111963: {'lr': 7.716092599954919e-05, 'samples': 21496896, 'steps': 111962, 'loss/train': 1.612472653388977} 11/07/2021 12:58:55 - INFO - __main__ - Step 111964: {'lr': 7.715709184143363e-05, 'samples': 21497088, 'steps': 111963, 'loss/train': 0.9638867378234863} 11/07/2021 12:58:55 - INFO - __main__ - Step 111965: {'lr': 7.715325776119841e-05, 'samples': 21497280, 'steps': 111964, 'loss/train': 1.5907723903656006} 11/07/2021 12:58:56 - INFO - __main__ - Step 111966: {'lr': 7.714942375884526e-05, 'samples': 21497472, 'steps': 111965, 'loss/train': 1.4284688234329224} 11/07/2021 12:58:56 - INFO - __main__ - Step 111967: {'lr': 7.714558983437594e-05, 'samples': 21497664, 'steps': 111966, 'loss/train': 1.3589117527008057} 11/07/2021 12:58:57 - INFO - __main__ - Step 111968: {'lr': 7.714175598779213e-05, 'samples': 21497856, 'steps': 111967, 'loss/train': 1.3623263835906982} 11/07/2021 12:58:57 - INFO - __main__ - Step 111969: {'lr': 7.713792221909558e-05, 'samples': 21498048, 'steps': 111968, 'loss/train': 1.3776628971099854} 11/07/2021 12:58:58 - INFO - __main__ - Step 111970: {'lr': 7.713408852828801e-05, 'samples': 21498240, 'steps': 111969, 'loss/train': 1.2726901769638062} 11/07/2021 12:58:58 - INFO - __main__ - Step 111971: {'lr': 7.713025491537115e-05, 'samples': 21498432, 'steps': 111970, 'loss/train': 1.0327043533325195} 11/07/2021 12:58:58 - INFO - __main__ - Step 111972: {'lr': 7.712642138034676e-05, 'samples': 21498624, 'steps': 111971, 'loss/train': 1.213622808456421} 11/07/2021 12:59:00 - INFO - __main__ - Step 111973: {'lr': 7.712258792321658e-05, 'samples': 21498816, 'steps': 111972, 'loss/train': 1.0919638872146606} 11/07/2021 12:59:01 - INFO - __main__ - Step 111974: {'lr': 7.711875454398223e-05, 'samples': 21499008, 'steps': 111973, 'loss/train': 1.0733580589294434} 11/07/2021 12:59:01 - INFO - __main__ - Step 111975: {'lr': 7.711492124264549e-05, 'samples': 21499200, 'steps': 111974, 'loss/train': 1.5447198152542114} 11/07/2021 12:59:01 - INFO - __main__ - Step 111976: {'lr': 7.71110880192081e-05, 'samples': 21499392, 'steps': 111975, 'loss/train': 0.6111003160476685} 11/07/2021 12:59:02 - INFO - __main__ - Step 111977: {'lr': 7.710725487367182e-05, 'samples': 21499584, 'steps': 111976, 'loss/train': 2.5371036529541016} 11/07/2021 12:59:02 - INFO - __main__ - Step 111978: {'lr': 7.71034218060383e-05, 'samples': 21499776, 'steps': 111977, 'loss/train': 0.5029197335243225} 11/07/2021 12:59:03 - INFO - __main__ - Step 111979: {'lr': 7.709958881630932e-05, 'samples': 21499968, 'steps': 111978, 'loss/train': 0.4117504954338074} 11/07/2021 12:59:03 - INFO - __main__ - Step 111980: {'lr': 7.709575590448661e-05, 'samples': 21500160, 'steps': 111979, 'loss/train': 1.297309160232544} 11/07/2021 12:59:04 - INFO - __main__ - Step 111981: {'lr': 7.70919230705719e-05, 'samples': 21500352, 'steps': 111980, 'loss/train': 1.5514963865280151} 11/07/2021 12:59:04 - INFO - __main__ - Step 111982: {'lr': 7.708809031456688e-05, 'samples': 21500544, 'steps': 111981, 'loss/train': 1.4019392728805542} 11/07/2021 12:59:04 - INFO - __main__ - Step 111983: {'lr': 7.708425763647328e-05, 'samples': 21500736, 'steps': 111982, 'loss/train': 0.7593502402305603} 11/07/2021 12:59:05 - INFO - __main__ - Step 111984: {'lr': 7.708042503629286e-05, 'samples': 21500928, 'steps': 111983, 'loss/train': 1.4973891973495483} 11/07/2021 12:59:06 - INFO - __main__ - Step 111985: {'lr': 7.707659251402735e-05, 'samples': 21501120, 'steps': 111984, 'loss/train': 1.1786630153656006} 11/07/2021 12:59:06 - INFO - __main__ - Step 111986: {'lr': 7.707276006967854e-05, 'samples': 21501312, 'steps': 111985, 'loss/train': 1.6891155242919922} 11/07/2021 12:59:06 - INFO - __main__ - Step 111987: {'lr': 7.706892770324798e-05, 'samples': 21501504, 'steps': 111986, 'loss/train': 1.2074121236801147} 11/07/2021 12:59:07 - INFO - __main__ - Step 111988: {'lr': 7.706509541473747e-05, 'samples': 21501696, 'steps': 111987, 'loss/train': 1.114173173904419} 11/07/2021 12:59:07 - INFO - __main__ - Step 111989: {'lr': 7.706126320414877e-05, 'samples': 21501888, 'steps': 111988, 'loss/train': 1.5866061449050903} 11/07/2021 12:59:08 - INFO - __main__ - Step 111990: {'lr': 7.705743107148363e-05, 'samples': 21502080, 'steps': 111989, 'loss/train': 1.4017903804779053} 11/07/2021 12:59:09 - INFO - __main__ - Step 111991: {'lr': 7.705359901674371e-05, 'samples': 21502272, 'steps': 111990, 'loss/train': 1.4148832559585571} 11/07/2021 12:59:09 - INFO - __main__ - Step 111992: {'lr': 7.70497670399308e-05, 'samples': 21502464, 'steps': 111991, 'loss/train': 0.6564176082611084} 11/07/2021 12:59:09 - INFO - __main__ - Step 111993: {'lr': 7.704593514104658e-05, 'samples': 21502656, 'steps': 111992, 'loss/train': 1.7869033813476562} 11/07/2021 12:59:10 - INFO - __main__ - Step 111994: {'lr': 7.704210332009278e-05, 'samples': 21502848, 'steps': 111993, 'loss/train': 1.4801753759384155} 11/07/2021 12:59:11 - INFO - __main__ - Step 111995: {'lr': 7.703827157707116e-05, 'samples': 21503040, 'steps': 111994, 'loss/train': 1.6122039556503296} 11/07/2021 12:59:11 - INFO - __main__ - Step 111996: {'lr': 7.703443991198342e-05, 'samples': 21503232, 'steps': 111995, 'loss/train': 1.8586242198944092} 11/07/2021 12:59:11 - INFO - __main__ - Step 111997: {'lr': 7.703060832483128e-05, 'samples': 21503424, 'steps': 111996, 'loss/train': 1.051661729812622} 11/07/2021 12:59:12 - INFO - __main__ - Step 111998: {'lr': 7.702677681561649e-05, 'samples': 21503616, 'steps': 111997, 'loss/train': 1.550277829170227} 11/07/2021 12:59:12 - INFO - __main__ - Step 111999: {'lr': 7.702294538434077e-05, 'samples': 21503808, 'steps': 111998, 'loss/train': 1.4339511394500732} 11/07/2021 12:59:13 - INFO - __main__ - Step 112000: {'lr': 7.70191140310059e-05, 'samples': 21504000, 'steps': 111999, 'loss/train': 1.31199049949646} 11/07/2021 12:59:14 - INFO - __main__ - Step 112001: {'lr': 7.701528275561349e-05, 'samples': 21504192, 'steps': 112000, 'loss/train': 2.2497076988220215} 11/07/2021 12:59:14 - INFO - __main__ - Step 112002: {'lr': 7.70114515581653e-05, 'samples': 21504384, 'steps': 112001, 'loss/train': 1.164486289024353} 11/07/2021 12:59:14 - INFO - __main__ - Step 112003: {'lr': 7.700762043866311e-05, 'samples': 21504576, 'steps': 112002, 'loss/train': 1.201538324356079} 11/07/2021 12:59:15 - INFO - __main__ - Step 112004: {'lr': 7.700378939710859e-05, 'samples': 21504768, 'steps': 112003, 'loss/train': 0.9604354500770569} 11/07/2021 12:59:15 - INFO - __main__ - Step 112005: {'lr': 7.699995843350351e-05, 'samples': 21504960, 'steps': 112004, 'loss/train': 1.504807710647583} 11/07/2021 12:59:16 - INFO - __main__ - Step 112006: {'lr': 7.699612754784957e-05, 'samples': 21505152, 'steps': 112005, 'loss/train': 1.0690909624099731} 11/07/2021 12:59:16 - INFO - __main__ - Step 112007: {'lr': 7.699229674014851e-05, 'samples': 21505344, 'steps': 112006, 'loss/train': 1.08782160282135} 11/07/2021 12:59:17 - INFO - __main__ - Step 112008: {'lr': 7.698846601040205e-05, 'samples': 21505536, 'steps': 112007, 'loss/train': 1.0594587326049805} 11/07/2021 12:59:17 - INFO - __main__ - Step 112009: {'lr': 7.698463535861192e-05, 'samples': 21505728, 'steps': 112008, 'loss/train': 1.3347798585891724} 11/07/2021 12:59:17 - INFO - __main__ - Step 112010: {'lr': 7.698080478477984e-05, 'samples': 21505920, 'steps': 112009, 'loss/train': 1.29973566532135} 11/07/2021 12:59:18 - INFO - __main__ - Step 112011: {'lr': 7.697697428890754e-05, 'samples': 21506112, 'steps': 112010, 'loss/train': 1.3268458843231201} 11/07/2021 12:59:19 - INFO - __main__ - Step 112012: {'lr': 7.697314387099676e-05, 'samples': 21506304, 'steps': 112011, 'loss/train': 1.3129148483276367} 11/07/2021 12:59:19 - INFO - __main__ - Step 112013: {'lr': 7.696931353104927e-05, 'samples': 21506496, 'steps': 112012, 'loss/train': 1.5400784015655518} 11/07/2021 12:59:20 - INFO - __main__ - Step 112014: {'lr': 7.696548326906668e-05, 'samples': 21506688, 'steps': 112013, 'loss/train': 1.1876370906829834} 11/07/2021 12:59:20 - INFO - __main__ - Step 112015: {'lr': 7.696165308505076e-05, 'samples': 21506880, 'steps': 112014, 'loss/train': 1.2869545221328735} 11/07/2021 12:59:21 - INFO - __main__ - Step 112016: {'lr': 7.695782297900325e-05, 'samples': 21507072, 'steps': 112015, 'loss/train': 1.2004621028900146} 11/07/2021 12:59:21 - INFO - __main__ - Step 112017: {'lr': 7.695399295092586e-05, 'samples': 21507264, 'steps': 112016, 'loss/train': 1.138474941253662} 11/07/2021 12:59:22 - INFO - __main__ - Step 112018: {'lr': 7.695016300082036e-05, 'samples': 21507456, 'steps': 112017, 'loss/train': 0.8619021773338318} 11/07/2021 12:59:22 - INFO - __main__ - Step 112019: {'lr': 7.694633312868843e-05, 'samples': 21507648, 'steps': 112018, 'loss/train': 1.6847918033599854} 11/07/2021 12:59:22 - INFO - __main__ - Step 112020: {'lr': 7.694250333453182e-05, 'samples': 21507840, 'steps': 112019, 'loss/train': 1.5053503513336182} 11/07/2021 12:59:23 - INFO - __main__ - Step 112021: {'lr': 7.693867361835222e-05, 'samples': 21508032, 'steps': 112020, 'loss/train': 1.6679519414901733} 11/07/2021 12:59:24 - INFO - __main__ - Step 112022: {'lr': 7.693484398015141e-05, 'samples': 21508224, 'steps': 112021, 'loss/train': 1.4683597087860107} 11/07/2021 12:59:24 - INFO - __main__ - Step 112023: {'lr': 7.693101441993108e-05, 'samples': 21508416, 'steps': 112022, 'loss/train': 1.509779453277588} 11/07/2021 12:59:25 - INFO - __main__ - Step 112024: {'lr': 7.692718493769296e-05, 'samples': 21508608, 'steps': 112023, 'loss/train': 1.6372652053833008} 11/07/2021 12:59:25 - INFO - __main__ - Step 112025: {'lr': 7.69233555334388e-05, 'samples': 21508800, 'steps': 112024, 'loss/train': 0.5618067383766174} 11/07/2021 12:59:25 - INFO - __main__ - Step 112026: {'lr': 7.69195262071703e-05, 'samples': 21508992, 'steps': 112025, 'loss/train': 0.8436044454574585} 11/07/2021 12:59:26 - INFO - __main__ - Step 112027: {'lr': 7.691569695888925e-05, 'samples': 21509184, 'steps': 112026, 'loss/train': 1.5174850225448608} 11/07/2021 12:59:27 - INFO - __main__ - Step 112028: {'lr': 7.691186778859724e-05, 'samples': 21509376, 'steps': 112027, 'loss/train': 0.9212328791618347} 11/07/2021 12:59:27 - INFO - __main__ - Step 112029: {'lr': 7.690803869629609e-05, 'samples': 21509568, 'steps': 112028, 'loss/train': 0.7129878401756287} 11/07/2021 12:59:27 - INFO - __main__ - Step 112030: {'lr': 7.690420968198749e-05, 'samples': 21509760, 'steps': 112029, 'loss/train': 0.09558170288801193} 11/07/2021 12:59:28 - INFO - __main__ - Step 112031: {'lr': 7.690038074567319e-05, 'samples': 21509952, 'steps': 112030, 'loss/train': 1.6905524730682373} 11/07/2021 12:59:29 - INFO - __main__ - Step 112032: {'lr': 7.689655188735493e-05, 'samples': 21510144, 'steps': 112031, 'loss/train': 1.1544759273529053} 11/07/2021 12:59:29 - INFO - __main__ - Step 112033: {'lr': 7.689272310703438e-05, 'samples': 21510336, 'steps': 112032, 'loss/train': 1.3880879878997803} 11/07/2021 12:59:30 - INFO - __main__ - Step 112034: {'lr': 7.688889440471331e-05, 'samples': 21510528, 'steps': 112033, 'loss/train': 1.5252270698547363} 11/07/2021 12:59:30 - INFO - __main__ - Step 112035: {'lr': 7.688506578039341e-05, 'samples': 21510720, 'steps': 112034, 'loss/train': 1.7230488061904907} 11/07/2021 12:59:30 - INFO - __main__ - Step 112036: {'lr': 7.688123723407644e-05, 'samples': 21510912, 'steps': 112035, 'loss/train': 1.5162838697433472} 11/07/2021 12:59:31 - INFO - __main__ - Step 112037: {'lr': 7.687740876576413e-05, 'samples': 21511104, 'steps': 112036, 'loss/train': 0.8899295926094055} 11/07/2021 12:59:32 - INFO - __main__ - Step 112038: {'lr': 7.687358037545819e-05, 'samples': 21511296, 'steps': 112037, 'loss/train': 0.8749821186065674} 11/07/2021 12:59:32 - INFO - __main__ - Step 112039: {'lr': 7.68697520631604e-05, 'samples': 21511488, 'steps': 112038, 'loss/train': 1.1905531883239746} 11/07/2021 12:59:32 - INFO - __main__ - Step 112040: {'lr': 7.686592382887236e-05, 'samples': 21511680, 'steps': 112039, 'loss/train': 1.6808552742004395} 11/07/2021 12:59:33 - INFO - __main__ - Step 112041: {'lr': 7.686209567259586e-05, 'samples': 21511872, 'steps': 112040, 'loss/train': 0.5486907362937927} 11/07/2021 12:59:34 - INFO - __main__ - Step 112042: {'lr': 7.685826759433263e-05, 'samples': 21512064, 'steps': 112041, 'loss/train': 1.4237700700759888} 11/07/2021 12:59:34 - INFO - __main__ - Step 112043: {'lr': 7.68544395940844e-05, 'samples': 21512256, 'steps': 112042, 'loss/train': 1.3201812505722046} 11/07/2021 12:59:34 - INFO - __main__ - Step 112044: {'lr': 7.685061167185287e-05, 'samples': 21512448, 'steps': 112043, 'loss/train': 1.424920678138733} 11/07/2021 12:59:35 - INFO - __main__ - Step 112045: {'lr': 7.684678382763979e-05, 'samples': 21512640, 'steps': 112044, 'loss/train': 1.2603673934936523} 11/07/2021 12:59:35 - INFO - __main__ - Step 112046: {'lr': 7.68429560614469e-05, 'samples': 21512832, 'steps': 112045, 'loss/train': 0.8867942690849304} 11/07/2021 12:59:36 - INFO - __main__ - Step 112047: {'lr': 7.683912837327589e-05, 'samples': 21513024, 'steps': 112046, 'loss/train': 0.89472895860672} 11/07/2021 12:59:36 - INFO - __main__ - Step 112048: {'lr': 7.683530076312848e-05, 'samples': 21513216, 'steps': 112047, 'loss/train': 1.3625224828720093} 11/07/2021 12:59:37 - INFO - __main__ - Step 112049: {'lr': 7.683147323100642e-05, 'samples': 21513408, 'steps': 112048, 'loss/train': 1.3979668617248535} 11/07/2021 12:59:37 - INFO - __main__ - Step 112050: {'lr': 7.682764577691151e-05, 'samples': 21513600, 'steps': 112049, 'loss/train': 0.9854925274848938} 11/07/2021 12:59:38 - INFO - __main__ - Step 112051: {'lr': 7.68238184008453e-05, 'samples': 21513792, 'steps': 112050, 'loss/train': 0.736030101776123} 11/07/2021 12:59:38 - INFO - __main__ - Step 112052: {'lr': 7.681999110280963e-05, 'samples': 21513984, 'steps': 112051, 'loss/train': 1.5329842567443848} 11/07/2021 12:59:39 - INFO - __main__ - Step 112053: {'lr': 7.681616388280619e-05, 'samples': 21514176, 'steps': 112052, 'loss/train': 1.2023653984069824} 11/07/2021 12:59:39 - INFO - __main__ - Step 112054: {'lr': 7.681233674083668e-05, 'samples': 21514368, 'steps': 112053, 'loss/train': 1.4042216539382935} 11/07/2021 12:59:40 - INFO - __main__ - Step 112055: {'lr': 7.68085096769029e-05, 'samples': 21514560, 'steps': 112054, 'loss/train': 1.1354979276657104} 11/07/2021 12:59:40 - INFO - __main__ - Step 112056: {'lr': 7.680468269100651e-05, 'samples': 21514752, 'steps': 112055, 'loss/train': 1.4515730142593384} 11/07/2021 12:59:40 - INFO - __main__ - Step 112057: {'lr': 7.680085578314927e-05, 'samples': 21514944, 'steps': 112056, 'loss/train': 0.7137318849563599} 11/07/2021 12:59:41 - INFO - __main__ - Step 112058: {'lr': 7.679702895333287e-05, 'samples': 21515136, 'steps': 112057, 'loss/train': 1.4259604215621948} 11/07/2021 12:59:42 - INFO - __main__ - Step 112059: {'lr': 7.679320220155908e-05, 'samples': 21515328, 'steps': 112058, 'loss/train': 1.4109363555908203} 11/07/2021 12:59:42 - INFO - __main__ - Step 112060: {'lr': 7.67893755278296e-05, 'samples': 21515520, 'steps': 112059, 'loss/train': 1.4211223125457764} 11/07/2021 12:59:42 - INFO - __main__ - Step 112061: {'lr': 7.678554893214623e-05, 'samples': 21515712, 'steps': 112060, 'loss/train': 1.3604174852371216} 11/07/2021 12:59:43 - INFO - __main__ - Step 112062: {'lr': 7.678172241451053e-05, 'samples': 21515904, 'steps': 112061, 'loss/train': 1.566056728363037} 11/07/2021 12:59:44 - INFO - __main__ - Step 112063: {'lr': 7.677789597492433e-05, 'samples': 21516096, 'steps': 112062, 'loss/train': 0.15530893206596375} 11/07/2021 12:59:44 - INFO - __main__ - Step 112064: {'lr': 7.677406961338935e-05, 'samples': 21516288, 'steps': 112063, 'loss/train': 1.401062250137329} 11/07/2021 12:59:45 - INFO - __main__ - Step 112065: {'lr': 7.677024332990726e-05, 'samples': 21516480, 'steps': 112064, 'loss/train': 1.4535675048828125} 11/07/2021 12:59:45 - INFO - __main__ - Step 112066: {'lr': 7.676641712447984e-05, 'samples': 21516672, 'steps': 112065, 'loss/train': 1.1467894315719604} 11/07/2021 12:59:45 - INFO - __main__ - Step 112067: {'lr': 7.67625909971088e-05, 'samples': 21516864, 'steps': 112066, 'loss/train': 1.315542459487915} 11/07/2021 12:59:46 - INFO - __main__ - Step 112068: {'lr': 7.675876494779587e-05, 'samples': 21517056, 'steps': 112067, 'loss/train': 5.644721508026123} 11/07/2021 12:59:47 - INFO - __main__ - Step 112069: {'lr': 7.675493897654276e-05, 'samples': 21517248, 'steps': 112068, 'loss/train': 1.1659849882125854} 11/07/2021 12:59:47 - INFO - __main__ - Step 112070: {'lr': 7.675111308335119e-05, 'samples': 21517440, 'steps': 112069, 'loss/train': 1.6495320796966553} 11/07/2021 12:59:47 - INFO - __main__ - Step 112071: {'lr': 7.674728726822294e-05, 'samples': 21517632, 'steps': 112070, 'loss/train': 0.9086334705352783} 11/07/2021 12:59:48 - INFO - __main__ - Step 112072: {'lr': 7.674346153115975e-05, 'samples': 21517824, 'steps': 112071, 'loss/train': 1.53413724899292} 11/07/2021 12:59:49 - INFO - __main__ - Step 112073: {'lr': 7.673963587216318e-05, 'samples': 21518016, 'steps': 112072, 'loss/train': 1.6542872190475464} 11/07/2021 12:59:49 - INFO - __main__ - Step 112074: {'lr': 7.673581029123506e-05, 'samples': 21518208, 'steps': 112073, 'loss/train': 0.7787349224090576} 11/07/2021 12:59:49 - INFO - __main__ - Step 112075: {'lr': 7.673198478837711e-05, 'samples': 21518400, 'steps': 112074, 'loss/train': 1.7090023756027222} 11/07/2021 12:59:50 - INFO - __main__ - Step 112076: {'lr': 7.672815936359106e-05, 'samples': 21518592, 'steps': 112075, 'loss/train': 1.3618158102035522} 11/07/2021 12:59:50 - INFO - __main__ - Step 112077: {'lr': 7.672433401687864e-05, 'samples': 21518784, 'steps': 112076, 'loss/train': 1.901769757270813} 11/07/2021 12:59:50 - INFO - __main__ - Step 112078: {'lr': 7.672050874824154e-05, 'samples': 21518976, 'steps': 112077, 'loss/train': 1.2632242441177368} 11/07/2021 12:59:52 - INFO - __main__ - Step 112079: {'lr': 7.671668355768152e-05, 'samples': 21519168, 'steps': 112078, 'loss/train': 1.2516181468963623} 11/07/2021 12:59:52 - INFO - __main__ - Step 112080: {'lr': 7.67128584452003e-05, 'samples': 21519360, 'steps': 112079, 'loss/train': 0.9957550168037415} 11/07/2021 12:59:52 - INFO - __main__ - Step 112081: {'lr': 7.670903341079957e-05, 'samples': 21519552, 'steps': 112080, 'loss/train': 0.8621919751167297} 11/07/2021 12:59:53 - INFO - __main__ - Step 112082: {'lr': 7.670520845448109e-05, 'samples': 21519744, 'steps': 112081, 'loss/train': 0.9189344644546509} 11/07/2021 12:59:53 - INFO - __main__ - Step 112083: {'lr': 7.670138357624665e-05, 'samples': 21519936, 'steps': 112082, 'loss/train': 0.6305346488952637} 11/07/2021 12:59:54 - INFO - __main__ - Step 112084: {'lr': 7.66975587760978e-05, 'samples': 21520128, 'steps': 112083, 'loss/train': 1.3863288164138794} 11/07/2021 12:59:54 - INFO - __main__ - Step 112085: {'lr': 7.669373405403635e-05, 'samples': 21520320, 'steps': 112084, 'loss/train': 0.607761025428772} 11/07/2021 12:59:55 - INFO - __main__ - Step 112086: {'lr': 7.668990941006404e-05, 'samples': 21520512, 'steps': 112085, 'loss/train': 1.3819903135299683} 11/07/2021 12:59:55 - INFO - __main__ - Step 112087: {'lr': 7.668608484418257e-05, 'samples': 21520704, 'steps': 112086, 'loss/train': 1.112589716911316} 11/07/2021 12:59:55 - INFO - __main__ - Step 112088: {'lr': 7.66822603563937e-05, 'samples': 21520896, 'steps': 112087, 'loss/train': 1.3657315969467163} 11/07/2021 12:59:57 - INFO - __main__ - Step 112089: {'lr': 7.66784359466991e-05, 'samples': 21521088, 'steps': 112088, 'loss/train': 1.11689293384552} 11/07/2021 12:59:57 - INFO - __main__ - Step 112090: {'lr': 7.667461161510056e-05, 'samples': 21521280, 'steps': 112089, 'loss/train': 1.2995103597640991} 11/07/2021 12:59:57 - INFO - __main__ - Step 112091: {'lr': 7.667078736159974e-05, 'samples': 21521472, 'steps': 112090, 'loss/train': 1.0503573417663574} 11/07/2021 12:59:58 - INFO - __main__ - Step 112092: {'lr': 7.66669631861984e-05, 'samples': 21521664, 'steps': 112091, 'loss/train': 1.5416061878204346} 11/07/2021 12:59:58 - INFO - __main__ - Step 112093: {'lr': 7.666313908889822e-05, 'samples': 21521856, 'steps': 112092, 'loss/train': 1.3149042129516602} 11/07/2021 12:59:59 - INFO - __main__ - Step 112094: {'lr': 7.665931506970105e-05, 'samples': 21522048, 'steps': 112093, 'loss/train': 1.0934946537017822} 11/07/2021 13:00:00 - INFO - __main__ - Step 112095: {'lr': 7.665549112860845e-05, 'samples': 21522240, 'steps': 112094, 'loss/train': 0.6486417651176453} 11/07/2021 13:00:00 - INFO - __main__ - Step 112096: {'lr': 7.665166726562223e-05, 'samples': 21522432, 'steps': 112095, 'loss/train': 1.4250329732894897} 11/07/2021 13:00:00 - INFO - __main__ - Step 112097: {'lr': 7.664784348074404e-05, 'samples': 21522624, 'steps': 112096, 'loss/train': 1.3461544513702393} 11/07/2021 13:00:01 - INFO - __main__ - Step 112098: {'lr': 7.66440197739757e-05, 'samples': 21522816, 'steps': 112097, 'loss/train': 1.0384997129440308} 11/07/2021 13:00:02 - INFO - __main__ - Step 112099: {'lr': 7.66401961453189e-05, 'samples': 21523008, 'steps': 112098, 'loss/train': 1.481756567955017} 11/07/2021 13:00:02 - INFO - __main__ - Step 112100: {'lr': 7.663637259477532e-05, 'samples': 21523200, 'steps': 112099, 'loss/train': 1.2812795639038086} 11/07/2021 13:00:02 - INFO - __main__ - Step 112101: {'lr': 7.663254912234671e-05, 'samples': 21523392, 'steps': 112100, 'loss/train': 1.1827889680862427} 11/07/2021 13:00:03 - INFO - __main__ - Step 112102: {'lr': 7.662872572803484e-05, 'samples': 21523584, 'steps': 112101, 'loss/train': 1.311134696006775} 11/07/2021 13:00:03 - INFO - __main__ - Step 112103: {'lr': 7.662490241184134e-05, 'samples': 21523776, 'steps': 112102, 'loss/train': 1.0187097787857056} 11/07/2021 13:00:04 - INFO - __main__ - Step 112104: {'lr': 7.662107917376802e-05, 'samples': 21523968, 'steps': 112103, 'loss/train': 0.7894670367240906} 11/07/2021 13:00:04 - INFO - __main__ - Step 112105: {'lr': 7.661725601381655e-05, 'samples': 21524160, 'steps': 112104, 'loss/train': 0.8853874802589417} 11/07/2021 13:00:05 - INFO - __main__ - Step 112106: {'lr': 7.661343293198866e-05, 'samples': 21524352, 'steps': 112105, 'loss/train': 1.870097041130066} 11/07/2021 13:00:05 - INFO - __main__ - Step 112107: {'lr': 7.660960992828619e-05, 'samples': 21524544, 'steps': 112106, 'loss/train': 1.2551590204238892} 11/07/2021 13:00:06 - INFO - __main__ - Step 112108: {'lr': 7.660578700271064e-05, 'samples': 21524736, 'steps': 112107, 'loss/train': 1.490772008895874} 11/07/2021 13:00:06 - INFO - __main__ - Step 112109: {'lr': 7.660196415526388e-05, 'samples': 21524928, 'steps': 112108, 'loss/train': 1.5718952417373657} 11/07/2021 13:00:07 - INFO - __main__ - Step 112110: {'lr': 7.659814138594759e-05, 'samples': 21525120, 'steps': 112109, 'loss/train': 1.301326036453247} 11/07/2021 13:00:07 - INFO - __main__ - Step 112111: {'lr': 7.65943186947635e-05, 'samples': 21525312, 'steps': 112110, 'loss/train': 1.44242262840271} 11/07/2021 13:00:08 - INFO - __main__ - Step 112112: {'lr': 7.659049608171334e-05, 'samples': 21525504, 'steps': 112111, 'loss/train': 1.330289363861084} 11/07/2021 13:00:08 - INFO - __main__ - Step 112113: {'lr': 7.65866735467988e-05, 'samples': 21525696, 'steps': 112112, 'loss/train': 1.7746074199676514} 11/07/2021 13:00:08 - INFO - __main__ - Step 112114: {'lr': 7.658285109002164e-05, 'samples': 21525888, 'steps': 112113, 'loss/train': 0.11676155030727386} 11/07/2021 13:00:09 - INFO - __main__ - Step 112115: {'lr': 7.657902871138359e-05, 'samples': 21526080, 'steps': 112114, 'loss/train': 1.4798321723937988} 11/07/2021 13:00:10 - INFO - __main__ - Step 112116: {'lr': 7.657520641088634e-05, 'samples': 21526272, 'steps': 112115, 'loss/train': 1.5907222032546997} 11/07/2021 13:00:10 - INFO - __main__ - Step 112117: {'lr': 7.657138418853162e-05, 'samples': 21526464, 'steps': 112116, 'loss/train': 1.5306538343429565} 11/07/2021 13:00:10 - INFO - __main__ - Step 112118: {'lr': 7.656756204432116e-05, 'samples': 21526656, 'steps': 112117, 'loss/train': 1.0859637260437012} 11/07/2021 13:00:11 - INFO - __main__ - Step 112119: {'lr': 7.65637399782567e-05, 'samples': 21526848, 'steps': 112118, 'loss/train': 0.9258933663368225} 11/07/2021 13:00:12 - INFO - __main__ - Step 112120: {'lr': 7.655991799033992e-05, 'samples': 21527040, 'steps': 112119, 'loss/train': 1.1451455354690552} 11/07/2021 13:00:12 - INFO - __main__ - Step 112121: {'lr': 7.655609608057265e-05, 'samples': 21527232, 'steps': 112120, 'loss/train': 1.31475830078125} 11/07/2021 13:00:13 - INFO - __main__ - Step 112122: {'lr': 7.655227424895647e-05, 'samples': 21527424, 'steps': 112121, 'loss/train': 1.1537562608718872} 11/07/2021 13:00:13 - INFO - __main__ - Step 112123: {'lr': 7.654845249549314e-05, 'samples': 21527616, 'steps': 112122, 'loss/train': 1.405279517173767} 11/07/2021 13:00:13 - INFO - __main__ - Step 112124: {'lr': 7.65446308201844e-05, 'samples': 21527808, 'steps': 112123, 'loss/train': 1.2482131719589233} 11/07/2021 13:00:14 - INFO - __main__ - Step 112125: {'lr': 7.654080922303198e-05, 'samples': 21528000, 'steps': 112124, 'loss/train': 1.1702395677566528} 11/07/2021 13:00:15 - INFO - __main__ - Step 112126: {'lr': 7.653698770403755e-05, 'samples': 21528192, 'steps': 112125, 'loss/train': 5.167581081390381} 11/07/2021 13:00:15 - INFO - __main__ - Step 112127: {'lr': 7.653316626320292e-05, 'samples': 21528384, 'steps': 112126, 'loss/train': 1.311768889427185} 11/07/2021 13:00:15 - INFO - __main__ - Step 112128: {'lr': 7.652934490052977e-05, 'samples': 21528576, 'steps': 112127, 'loss/train': 1.6497163772583008} 11/07/2021 13:00:16 - INFO - __main__ - Step 112129: {'lr': 7.652552361601981e-05, 'samples': 21528768, 'steps': 112128, 'loss/train': 1.4339975118637085} 11/07/2021 13:00:16 - INFO - __main__ - Step 112130: {'lr': 7.652170240967477e-05, 'samples': 21528960, 'steps': 112129, 'loss/train': 1.1123689413070679} 11/07/2021 13:00:17 - INFO - __main__ - Step 112131: {'lr': 7.651788128149639e-05, 'samples': 21529152, 'steps': 112130, 'loss/train': 1.3910243511199951} 11/07/2021 13:00:18 - INFO - __main__ - Step 112132: {'lr': 7.651406023148635e-05, 'samples': 21529344, 'steps': 112131, 'loss/train': 1.1842131614685059} 11/07/2021 13:00:18 - INFO - __main__ - Step 112133: {'lr': 7.651023925964642e-05, 'samples': 21529536, 'steps': 112132, 'loss/train': 1.30616295337677} 11/07/2021 13:00:19 - INFO - __main__ - Step 112134: {'lr': 7.650641836597838e-05, 'samples': 21529728, 'steps': 112133, 'loss/train': 1.033524513244629} 11/07/2021 13:00:19 - INFO - __main__ - Step 112135: {'lr': 7.650259755048378e-05, 'samples': 21529920, 'steps': 112134, 'loss/train': 1.2932647466659546} 11/07/2021 13:00:20 - INFO - __main__ - Step 112136: {'lr': 7.649877681316441e-05, 'samples': 21530112, 'steps': 112135, 'loss/train': 1.3944209814071655} 11/07/2021 13:00:20 - INFO - __main__ - Step 112137: {'lr': 7.649495615402205e-05, 'samples': 21530304, 'steps': 112136, 'loss/train': 1.2395381927490234} 11/07/2021 13:00:21 - INFO - __main__ - Step 112138: {'lr': 7.649113557305836e-05, 'samples': 21530496, 'steps': 112137, 'loss/train': 1.4160059690475464} 11/07/2021 13:00:21 - INFO - __main__ - Step 112139: {'lr': 7.648731507027511e-05, 'samples': 21530688, 'steps': 112138, 'loss/train': 1.4115657806396484} 11/07/2021 13:00:21 - INFO - __main__ - Step 112140: {'lr': 7.6483494645674e-05, 'samples': 21530880, 'steps': 112139, 'loss/train': 0.943827211856842} 11/07/2021 13:00:22 - INFO - __main__ - Step 112141: {'lr': 7.647967429925673e-05, 'samples': 21531072, 'steps': 112140, 'loss/train': 1.4737179279327393} 11/07/2021 13:00:23 - INFO - __main__ - Step 112142: {'lr': 7.647585403102506e-05, 'samples': 21531264, 'steps': 112141, 'loss/train': 1.567901372909546} 11/07/2021 13:00:23 - INFO - __main__ - Step 112143: {'lr': 7.647203384098067e-05, 'samples': 21531456, 'steps': 112142, 'loss/train': 1.2708749771118164} 11/07/2021 13:00:24 - INFO - __main__ - Step 112144: {'lr': 7.646821372912533e-05, 'samples': 21531648, 'steps': 112143, 'loss/train': 0.8923711180686951} 11/07/2021 13:00:24 - INFO - __main__ - Step 112145: {'lr': 7.64643936954607e-05, 'samples': 21531840, 'steps': 112144, 'loss/train': 0.9809072613716125} 11/07/2021 13:00:24 - INFO - __main__ - Step 112146: {'lr': 7.646057373998858e-05, 'samples': 21532032, 'steps': 112145, 'loss/train': 0.745877742767334} 11/07/2021 13:00:25 - INFO - __main__ - Step 112147: {'lr': 7.645675386271062e-05, 'samples': 21532224, 'steps': 112146, 'loss/train': 0.4957447350025177} 11/07/2021 13:00:26 - INFO - __main__ - Step 112148: {'lr': 7.645293406362863e-05, 'samples': 21532416, 'steps': 112147, 'loss/train': 1.2138521671295166} 11/07/2021 13:00:26 - INFO - __main__ - Step 112149: {'lr': 7.644911434274423e-05, 'samples': 21532608, 'steps': 112148, 'loss/train': 1.3227163553237915} 11/07/2021 13:00:26 - INFO - __main__ - Step 112150: {'lr': 7.644529470005917e-05, 'samples': 21532800, 'steps': 112149, 'loss/train': 1.4674978256225586} 11/07/2021 13:00:27 - INFO - __main__ - Step 112151: {'lr': 7.644147513557517e-05, 'samples': 21532992, 'steps': 112150, 'loss/train': 1.4843758344650269} 11/07/2021 13:00:28 - INFO - __main__ - Step 112152: {'lr': 7.643765564929397e-05, 'samples': 21533184, 'steps': 112151, 'loss/train': 1.136168122291565} 11/07/2021 13:00:28 - INFO - __main__ - Step 112153: {'lr': 7.643383624121727e-05, 'samples': 21533376, 'steps': 112152, 'loss/train': 1.1132609844207764} 11/07/2021 13:00:28 - INFO - __main__ - Step 112154: {'lr': 7.643001691134682e-05, 'samples': 21533568, 'steps': 112153, 'loss/train': 1.1187185049057007} 11/07/2021 13:00:29 - INFO - __main__ - Step 112155: {'lr': 7.642619765968433e-05, 'samples': 21533760, 'steps': 112154, 'loss/train': 0.8824843764305115} 11/07/2021 13:00:29 - INFO - __main__ - Step 112156: {'lr': 7.642237848623151e-05, 'samples': 21533952, 'steps': 112155, 'loss/train': 1.5424838066101074} 11/07/2021 13:00:30 - INFO - __main__ - Step 112157: {'lr': 7.64185593909901e-05, 'samples': 21534144, 'steps': 112156, 'loss/train': 1.0103825330734253} 11/07/2021 13:00:31 - INFO - __main__ - Step 112158: {'lr': 7.64147403739618e-05, 'samples': 21534336, 'steps': 112157, 'loss/train': 1.5725305080413818} 11/07/2021 13:00:31 - INFO - __main__ - Step 112159: {'lr': 7.641092143514832e-05, 'samples': 21534528, 'steps': 112158, 'loss/train': 1.3117870092391968} 11/07/2021 13:00:31 - INFO - __main__ - Step 112160: {'lr': 7.640710257455143e-05, 'samples': 21534720, 'steps': 112159, 'loss/train': 1.0233627557754517} 11/07/2021 13:00:32 - INFO - __main__ - Step 112161: {'lr': 7.64032837921729e-05, 'samples': 21534912, 'steps': 112160, 'loss/train': 1.2543469667434692} 11/07/2021 13:00:32 - INFO - __main__ - Step 112162: {'lr': 7.639946508801427e-05, 'samples': 21535104, 'steps': 112161, 'loss/train': 1.6166515350341797} 11/07/2021 13:00:33 - INFO - __main__ - Step 112163: {'lr': 7.639564646207737e-05, 'samples': 21535296, 'steps': 112162, 'loss/train': 1.530747413635254} 11/07/2021 13:00:33 - INFO - __main__ - Step 112164: {'lr': 7.639182791436392e-05, 'samples': 21535488, 'steps': 112163, 'loss/train': 1.521694540977478} 11/07/2021 13:00:34 - INFO - __main__ - Step 112165: {'lr': 7.638800944487561e-05, 'samples': 21535680, 'steps': 112164, 'loss/train': 4.0558953285217285} 11/07/2021 13:00:34 - INFO - __main__ - Step 112166: {'lr': 7.638419105361422e-05, 'samples': 21535872, 'steps': 112165, 'loss/train': 1.4169856309890747} 11/07/2021 13:00:35 - INFO - __main__ - Step 112167: {'lr': 7.63803727405814e-05, 'samples': 21536064, 'steps': 112166, 'loss/train': 2.1689064502716064} 11/07/2021 13:00:36 - INFO - __main__ - Step 112168: {'lr': 7.637655450577893e-05, 'samples': 21536256, 'steps': 112167, 'loss/train': 1.4688830375671387} 11/07/2021 13:00:36 - INFO - __main__ - Step 112169: {'lr': 7.637273634920849e-05, 'samples': 21536448, 'steps': 112168, 'loss/train': 1.9911984205245972} 11/07/2021 13:00:36 - INFO - __main__ - Step 112170: {'lr': 7.636891827087183e-05, 'samples': 21536640, 'steps': 112169, 'loss/train': 1.3738913536071777} 11/07/2021 13:00:37 - INFO - __main__ - Step 112171: {'lr': 7.636510027077065e-05, 'samples': 21536832, 'steps': 112170, 'loss/train': 1.663400650024414} 11/07/2021 13:00:37 - INFO - __main__ - Step 112172: {'lr': 7.636128234890669e-05, 'samples': 21537024, 'steps': 112171, 'loss/train': 1.3288639783859253} 11/07/2021 13:00:38 - INFO - __main__ - Step 112173: {'lr': 7.635746450528164e-05, 'samples': 21537216, 'steps': 112172, 'loss/train': 1.4289202690124512} 11/07/2021 13:00:39 - INFO - __main__ - Step 112174: {'lr': 7.635364673989722e-05, 'samples': 21537408, 'steps': 112173, 'loss/train': 1.1996992826461792} 11/07/2021 13:00:39 - INFO - __main__ - Step 112175: {'lr': 7.634982905275528e-05, 'samples': 21537600, 'steps': 112174, 'loss/train': 1.165361762046814} 11/07/2021 13:00:39 - INFO - __main__ - Step 112176: {'lr': 7.634601144385733e-05, 'samples': 21537792, 'steps': 112175, 'loss/train': 1.6218098402023315} 11/07/2021 13:00:40 - INFO - __main__ - Step 112177: {'lr': 7.63421939132052e-05, 'samples': 21537984, 'steps': 112176, 'loss/train': 1.2190122604370117} 11/07/2021 13:00:40 - INFO - __main__ - Step 112178: {'lr': 7.633837646080058e-05, 'samples': 21538176, 'steps': 112177, 'loss/train': 1.7676737308502197} 11/07/2021 13:00:41 - INFO - __main__ - Step 112179: {'lr': 7.633455908664522e-05, 'samples': 21538368, 'steps': 112178, 'loss/train': 1.183923602104187} 11/07/2021 13:00:42 - INFO - __main__ - Step 112180: {'lr': 7.633074179074085e-05, 'samples': 21538560, 'steps': 112179, 'loss/train': 1.5108171701431274} 11/07/2021 13:00:42 - INFO - __main__ - Step 112181: {'lr': 7.632692457308912e-05, 'samples': 21538752, 'steps': 112180, 'loss/train': 0.5280315279960632} 11/07/2021 13:00:42 - INFO - __main__ - Step 112182: {'lr': 7.632310743369183e-05, 'samples': 21538944, 'steps': 112181, 'loss/train': 1.7576148509979248} 11/07/2021 13:00:43 - INFO - __main__ - Step 112183: {'lr': 7.631929037255064e-05, 'samples': 21539136, 'steps': 112182, 'loss/train': 1.3975356817245483} 11/07/2021 13:00:44 - INFO - __main__ - Step 112184: {'lr': 7.631547338966733e-05, 'samples': 21539328, 'steps': 112183, 'loss/train': 0.9471161365509033} 11/07/2021 13:00:45 - INFO - __main__ - Step 112185: {'lr': 7.631165648504357e-05, 'samples': 21539520, 'steps': 112184, 'loss/train': 1.434158444404602} 11/07/2021 13:00:45 - INFO - __main__ - Step 112186: {'lr': 7.63078396586811e-05, 'samples': 21539712, 'steps': 112185, 'loss/train': 1.8434127569198608} 11/07/2021 13:00:45 - INFO - __main__ - Step 112187: {'lr': 7.630402291058164e-05, 'samples': 21539904, 'steps': 112186, 'loss/train': 1.0558141469955444} 11/07/2021 13:00:46 - INFO - __main__ - Step 112188: {'lr': 7.630020624074699e-05, 'samples': 21540096, 'steps': 112187, 'loss/train': 0.2080109715461731} 11/07/2021 13:00:46 - INFO - __main__ - Step 112189: {'lr': 7.62963896491787e-05, 'samples': 21540288, 'steps': 112188, 'loss/train': 1.5838426351547241} 11/07/2021 13:00:47 - INFO - __main__ - Step 112190: {'lr': 7.62925731358786e-05, 'samples': 21540480, 'steps': 112189, 'loss/train': 1.5651220083236694} 11/07/2021 13:00:47 - INFO - __main__ - Step 112191: {'lr': 7.628875670084834e-05, 'samples': 21540672, 'steps': 112190, 'loss/train': 1.6294255256652832} 11/07/2021 13:00:48 - INFO - __main__ - Step 112192: {'lr': 7.628494034408972e-05, 'samples': 21540864, 'steps': 112191, 'loss/train': 1.4615793228149414} 11/07/2021 13:00:48 - INFO - __main__ - Step 112193: {'lr': 7.628112406560441e-05, 'samples': 21541056, 'steps': 112192, 'loss/train': 1.4553042650222778} 11/07/2021 13:00:48 - INFO - __main__ - Step 112194: {'lr': 7.627730786539416e-05, 'samples': 21541248, 'steps': 112193, 'loss/train': 1.2155038118362427} 11/07/2021 13:00:49 - INFO - __main__ - Step 112195: {'lr': 7.627349174346065e-05, 'samples': 21541440, 'steps': 112194, 'loss/train': 1.2396782636642456} 11/07/2021 13:00:50 - INFO - __main__ - Step 112196: {'lr': 7.626967569980564e-05, 'samples': 21541632, 'steps': 112195, 'loss/train': 1.4115478992462158} 11/07/2021 13:00:50 - INFO - __main__ - Step 112197: {'lr': 7.626585973443084e-05, 'samples': 21541824, 'steps': 112196, 'loss/train': 1.2372398376464844} 11/07/2021 13:00:50 - INFO - __main__ - Step 112198: {'lr': 7.626204384733795e-05, 'samples': 21542016, 'steps': 112197, 'loss/train': 1.3281800746917725} 11/07/2021 13:00:51 - INFO - __main__ - Step 112199: {'lr': 7.625822803852872e-05, 'samples': 21542208, 'steps': 112198, 'loss/train': 1.4774900674819946} 11/07/2021 13:00:52 - INFO - __main__ - Step 112200: {'lr': 7.625441230800484e-05, 'samples': 21542400, 'steps': 112199, 'loss/train': 1.3354982137680054} 11/07/2021 13:00:52 - INFO - __main__ - Step 112201: {'lr': 7.625059665576803e-05, 'samples': 21542592, 'steps': 112200, 'loss/train': 0.6541229486465454} 11/07/2021 13:00:53 - INFO - __main__ - Step 112202: {'lr': 7.624678108182009e-05, 'samples': 21542784, 'steps': 112201, 'loss/train': 1.3686182498931885} 11/07/2021 13:00:53 - INFO - __main__ - Step 112203: {'lr': 7.624296558616261e-05, 'samples': 21542976, 'steps': 112202, 'loss/train': 1.329926609992981} 11/07/2021 13:00:53 - INFO - __main__ - Step 112204: {'lr': 7.623915016879737e-05, 'samples': 21543168, 'steps': 112203, 'loss/train': 1.04850435256958} 11/07/2021 13:00:54 - INFO - __main__ - Step 112205: {'lr': 7.623533482972608e-05, 'samples': 21543360, 'steps': 112204, 'loss/train': 1.0595216751098633} 11/07/2021 13:00:55 - INFO - __main__ - Step 112206: {'lr': 7.62315195689505e-05, 'samples': 21543552, 'steps': 112205, 'loss/train': 1.5458332300186157} 11/07/2021 13:00:55 - INFO - __main__ - Step 112207: {'lr': 7.622770438647227e-05, 'samples': 21543744, 'steps': 112206, 'loss/train': 0.7497797012329102} 11/07/2021 13:00:55 - INFO - __main__ - Step 112208: {'lr': 7.622388928229318e-05, 'samples': 21543936, 'steps': 112207, 'loss/train': 1.351815938949585} 11/07/2021 13:00:56 - INFO - __main__ - Step 112209: {'lr': 7.622007425641491e-05, 'samples': 21544128, 'steps': 112208, 'loss/train': 1.5177513360977173} 11/07/2021 13:00:57 - INFO - __main__ - Step 112210: {'lr': 7.621625930883922e-05, 'samples': 21544320, 'steps': 112209, 'loss/train': 1.4494973421096802} 11/07/2021 13:00:57 - INFO - __main__ - Step 112211: {'lr': 7.621244443956776e-05, 'samples': 21544512, 'steps': 112210, 'loss/train': 1.341911792755127} 11/07/2021 13:00:57 - INFO - __main__ - Step 112212: {'lr': 7.620862964860231e-05, 'samples': 21544704, 'steps': 112211, 'loss/train': 1.238002896308899} 11/07/2021 13:00:58 - INFO - __main__ - Step 112213: {'lr': 7.620481493594458e-05, 'samples': 21544896, 'steps': 112212, 'loss/train': 1.6044398546218872} 11/07/2021 13:00:58 - INFO - __main__ - Step 112214: {'lr': 7.620100030159627e-05, 'samples': 21545088, 'steps': 112213, 'loss/train': 1.2015674114227295} 11/07/2021 13:00:58 - INFO - __main__ - Step 112215: {'lr': 7.619718574555917e-05, 'samples': 21545280, 'steps': 112214, 'loss/train': 1.4767459630966187} 11/07/2021 13:01:00 - INFO - __main__ - Step 112216: {'lr': 7.619337126783488e-05, 'samples': 21545472, 'steps': 112215, 'loss/train': 1.253984808921814} 11/07/2021 13:01:00 - INFO - __main__ - Step 112217: {'lr': 7.618955686842519e-05, 'samples': 21545664, 'steps': 112216, 'loss/train': 0.38396984338760376} 11/07/2021 13:01:01 - INFO - __main__ - Step 112218: {'lr': 7.618574254733177e-05, 'samples': 21545856, 'steps': 112217, 'loss/train': 1.5897979736328125} 11/07/2021 13:01:01 - INFO - __main__ - Step 112219: {'lr': 7.618192830455639e-05, 'samples': 21546048, 'steps': 112218, 'loss/train': 1.3177766799926758} 11/07/2021 13:01:01 - INFO - __main__ - Step 112220: {'lr': 7.617811414010073e-05, 'samples': 21546240, 'steps': 112219, 'loss/train': 0.9654430747032166} 11/07/2021 13:01:02 - INFO - __main__ - Step 112221: {'lr': 7.617430005396656e-05, 'samples': 21546432, 'steps': 112220, 'loss/train': 0.49197179079055786} 11/07/2021 13:01:03 - INFO - __main__ - Step 112222: {'lr': 7.617048604615554e-05, 'samples': 21546624, 'steps': 112221, 'loss/train': 0.6851052045822144} 11/07/2021 13:01:03 - INFO - __main__ - Step 112223: {'lr': 7.616667211666944e-05, 'samples': 21546816, 'steps': 112222, 'loss/train': 1.3484649658203125} 11/07/2021 13:01:04 - INFO - __main__ - Step 112224: {'lr': 7.616285826550995e-05, 'samples': 21547008, 'steps': 112223, 'loss/train': 1.5598723888397217} 11/07/2021 13:01:04 - INFO - __main__ - Step 112225: {'lr': 7.615904449267877e-05, 'samples': 21547200, 'steps': 112224, 'loss/train': 0.35347461700439453} 11/07/2021 13:01:05 - INFO - __main__ - Step 112226: {'lr': 7.615523079817765e-05, 'samples': 21547392, 'steps': 112225, 'loss/train': 0.09111226350069046} 11/07/2021 13:01:05 - INFO - __main__ - Step 112227: {'lr': 7.615141718200832e-05, 'samples': 21547584, 'steps': 112226, 'loss/train': 1.4563264846801758} 11/07/2021 13:01:06 - INFO - __main__ - Step 112228: {'lr': 7.614760364417256e-05, 'samples': 21547776, 'steps': 112227, 'loss/train': 1.150647759437561} 11/07/2021 13:01:06 - INFO - __main__ - Step 112229: {'lr': 7.61437901846719e-05, 'samples': 21547968, 'steps': 112228, 'loss/train': 1.116710901260376} 11/07/2021 13:01:06 - INFO - __main__ - Step 112230: {'lr': 7.613997680350821e-05, 'samples': 21548160, 'steps': 112229, 'loss/train': 1.290346384048462} 11/07/2021 13:01:08 - INFO - __main__ - Step 112231: {'lr': 7.613616350068312e-05, 'samples': 21548352, 'steps': 112230, 'loss/train': 1.4587156772613525} 11/07/2021 13:01:09 - INFO - __main__ - Step 112232: {'lr': 7.613235027619841e-05, 'samples': 21548544, 'steps': 112231, 'loss/train': 1.1292139291763306} 11/07/2021 13:01:09 - INFO - __main__ - Step 112233: {'lr': 7.612853713005577e-05, 'samples': 21548736, 'steps': 112232, 'loss/train': 1.517744541168213} 11/07/2021 13:01:09 - INFO - __main__ - Step 112234: {'lr': 7.612472406225696e-05, 'samples': 21548928, 'steps': 112233, 'loss/train': 1.3170539140701294} 11/07/2021 13:01:10 - INFO - __main__ - Step 112235: {'lr': 7.612091107280364e-05, 'samples': 21549120, 'steps': 112234, 'loss/train': 1.2012014389038086} 11/07/2021 13:01:10 - INFO - __main__ - Step 112236: {'lr': 7.611709816169754e-05, 'samples': 21549312, 'steps': 112235, 'loss/train': 0.7529308199882507} 11/07/2021 13:01:10 - INFO - __main__ - Step 112237: {'lr': 7.61132853289404e-05, 'samples': 21549504, 'steps': 112236, 'loss/train': 0.6828066110610962} 11/07/2021 13:01:11 - INFO - __main__ - Step 112238: {'lr': 7.610947257453396e-05, 'samples': 21549696, 'steps': 112237, 'loss/train': 0.646206796169281} 11/07/2021 13:01:12 - INFO - __main__ - Step 112239: {'lr': 7.610565989847987e-05, 'samples': 21549888, 'steps': 112238, 'loss/train': 0.939633846282959} 11/07/2021 13:01:12 - INFO - __main__ - Step 112240: {'lr': 7.610184730077991e-05, 'samples': 21550080, 'steps': 112239, 'loss/train': 1.6367344856262207} 11/07/2021 13:01:13 - INFO - __main__ - Step 112241: {'lr': 7.609803478143576e-05, 'samples': 21550272, 'steps': 112240, 'loss/train': 1.1985498666763306} 11/07/2021 13:01:13 - INFO - __main__ - Step 112242: {'lr': 7.609422234044924e-05, 'samples': 21550464, 'steps': 112241, 'loss/train': 0.6826992034912109} 11/07/2021 13:01:13 - INFO - __main__ - Step 112243: {'lr': 7.609040997782191e-05, 'samples': 21550656, 'steps': 112242, 'loss/train': 1.4705324172973633} 11/07/2021 13:01:14 - INFO - __main__ - Step 112244: {'lr': 7.608659769355555e-05, 'samples': 21550848, 'steps': 112243, 'loss/train': 1.234243392944336} 11/07/2021 13:01:15 - INFO - __main__ - Step 112245: {'lr': 7.608278548765187e-05, 'samples': 21551040, 'steps': 112244, 'loss/train': 1.5987801551818848} 11/07/2021 13:01:15 - INFO - __main__ - Step 112246: {'lr': 7.607897336011263e-05, 'samples': 21551232, 'steps': 112245, 'loss/train': 1.170837163925171} 11/07/2021 13:01:15 - INFO - __main__ - Step 112247: {'lr': 7.60751613109395e-05, 'samples': 21551424, 'steps': 112246, 'loss/train': 1.5739694833755493} 11/07/2021 13:01:16 - INFO - __main__ - Step 112248: {'lr': 7.607134934013424e-05, 'samples': 21551616, 'steps': 112247, 'loss/train': 1.295827031135559} 11/07/2021 13:01:16 - INFO - __main__ - Step 112249: {'lr': 7.60675374476985e-05, 'samples': 21551808, 'steps': 112248, 'loss/train': 1.3899164199829102} 11/07/2021 13:01:17 - INFO - __main__ - Step 112250: {'lr': 7.606372563363409e-05, 'samples': 21552000, 'steps': 112249, 'loss/train': 0.8059987425804138} 11/07/2021 13:01:17 - INFO - __main__ - Step 112251: {'lr': 7.605991389794267e-05, 'samples': 21552192, 'steps': 112250, 'loss/train': 1.61483633518219} 11/07/2021 13:01:18 - INFO - __main__ - Step 112252: {'lr': 7.605610224062598e-05, 'samples': 21552384, 'steps': 112251, 'loss/train': 1.1339294910430908} 11/07/2021 13:01:18 - INFO - __main__ - Step 112253: {'lr': 7.60522906616857e-05, 'samples': 21552576, 'steps': 112252, 'loss/train': 1.608298659324646} 11/07/2021 13:01:19 - INFO - __main__ - Step 112254: {'lr': 7.60484791611236e-05, 'samples': 21552768, 'steps': 112253, 'loss/train': 1.197544813156128} 11/07/2021 13:01:20 - INFO - __main__ - Step 112255: {'lr': 7.604466773894142e-05, 'samples': 21552960, 'steps': 112254, 'loss/train': 1.1851199865341187} 11/07/2021 13:01:20 - INFO - __main__ - Step 112256: {'lr': 7.604085639514077e-05, 'samples': 21553152, 'steps': 112255, 'loss/train': 1.1502803564071655} 11/07/2021 13:01:21 - INFO - __main__ - Step 112257: {'lr': 7.603704512972342e-05, 'samples': 21553344, 'steps': 112256, 'loss/train': 0.8956708312034607} 11/07/2021 13:01:21 - INFO - __main__ - Step 112258: {'lr': 7.60332339426911e-05, 'samples': 21553536, 'steps': 112257, 'loss/train': 1.4785178899765015} 11/07/2021 13:01:22 - INFO - __main__ - Step 112259: {'lr': 7.602942283404551e-05, 'samples': 21553728, 'steps': 112258, 'loss/train': 0.27256229519844055} 11/07/2021 13:01:23 - INFO - __main__ - Step 112260: {'lr': 7.602561180378837e-05, 'samples': 21553920, 'steps': 112259, 'loss/train': 1.4366716146469116} 11/07/2021 13:01:23 - INFO - __main__ - Step 112261: {'lr': 7.602180085192142e-05, 'samples': 21554112, 'steps': 112260, 'loss/train': 1.2316333055496216} 11/07/2021 13:01:23 - INFO - __main__ - Step 112262: {'lr': 7.601798997844636e-05, 'samples': 21554304, 'steps': 112261, 'loss/train': 1.5635720491409302} 11/07/2021 13:01:24 - INFO - __main__ - Step 112263: {'lr': 7.601417918336489e-05, 'samples': 21554496, 'steps': 112262, 'loss/train': 1.097198247909546} 11/07/2021 13:01:24 - INFO - __main__ - Step 112264: {'lr': 7.601036846667877e-05, 'samples': 21554688, 'steps': 112263, 'loss/train': 1.3098894357681274} 11/07/2021 13:01:25 - INFO - __main__ - Step 112265: {'lr': 7.60065578283897e-05, 'samples': 21554880, 'steps': 112264, 'loss/train': 1.8371518850326538} 11/07/2021 13:01:25 - INFO - __main__ - Step 112266: {'lr': 7.600274726849937e-05, 'samples': 21555072, 'steps': 112265, 'loss/train': 1.1783335208892822} 11/07/2021 13:01:26 - INFO - __main__ - Step 112267: {'lr': 7.599893678700954e-05, 'samples': 21555264, 'steps': 112266, 'loss/train': 0.44982314109802246} 11/07/2021 13:01:26 - INFO - __main__ - Step 112268: {'lr': 7.599512638392186e-05, 'samples': 21555456, 'steps': 112267, 'loss/train': 1.5969862937927246} 11/07/2021 13:01:26 - INFO - __main__ - Step 112269: {'lr': 7.59913160592382e-05, 'samples': 21555648, 'steps': 112268, 'loss/train': 1.3779913187026978} 11/07/2021 13:01:27 - INFO - __main__ - Step 112270: {'lr': 7.59875058129601e-05, 'samples': 21555840, 'steps': 112269, 'loss/train': 1.5750141143798828} 11/07/2021 13:01:28 - INFO - __main__ - Step 112271: {'lr': 7.598369564508934e-05, 'samples': 21556032, 'steps': 112270, 'loss/train': 1.408857822418213} 11/07/2021 13:01:28 - INFO - __main__ - Step 112272: {'lr': 7.597988555562762e-05, 'samples': 21556224, 'steps': 112271, 'loss/train': 1.083513617515564} 11/07/2021 13:01:28 - INFO - __main__ - Step 112273: {'lr': 7.597607554457669e-05, 'samples': 21556416, 'steps': 112272, 'loss/train': 1.2054040431976318} 11/07/2021 13:01:29 - INFO - __main__ - Step 112274: {'lr': 7.597226561193826e-05, 'samples': 21556608, 'steps': 112273, 'loss/train': 1.226529836654663} 11/07/2021 13:01:30 - INFO - __main__ - Step 112275: {'lr': 7.596845575771403e-05, 'samples': 21556800, 'steps': 112274, 'loss/train': 0.9178663492202759} 11/07/2021 13:01:30 - INFO - __main__ - Step 112276: {'lr': 7.596464598190575e-05, 'samples': 21556992, 'steps': 112275, 'loss/train': 1.1992205381393433} 11/07/2021 13:01:31 - INFO - __main__ - Step 112277: {'lr': 7.596083628451508e-05, 'samples': 21557184, 'steps': 112276, 'loss/train': 0.9411499500274658} 11/07/2021 13:01:31 - INFO - __main__ - Step 112278: {'lr': 7.59570266655438e-05, 'samples': 21557376, 'steps': 112277, 'loss/train': 1.2202599048614502} 11/07/2021 13:01:31 - INFO - __main__ - Step 112279: {'lr': 7.595321712499359e-05, 'samples': 21557568, 'steps': 112278, 'loss/train': 1.2490780353546143} 11/07/2021 13:01:32 - INFO - __main__ - Step 112280: {'lr': 7.594940766286618e-05, 'samples': 21557760, 'steps': 112279, 'loss/train': 1.0547058582305908} 11/07/2021 13:01:33 - INFO - __main__ - Step 112281: {'lr': 7.594559827916328e-05, 'samples': 21557952, 'steps': 112280, 'loss/train': 1.3100910186767578} 11/07/2021 13:01:33 - INFO - __main__ - Step 112282: {'lr': 7.594178897388668e-05, 'samples': 21558144, 'steps': 112281, 'loss/train': 1.5539476871490479} 11/07/2021 13:01:33 - INFO - __main__ - Step 112283: {'lr': 7.593797974703795e-05, 'samples': 21558336, 'steps': 112282, 'loss/train': 1.365785002708435} 11/07/2021 13:01:34 - INFO - __main__ - Step 112284: {'lr': 7.593417059861887e-05, 'samples': 21558528, 'steps': 112283, 'loss/train': 1.1356781721115112} 11/07/2021 13:01:34 - INFO - __main__ - Step 112285: {'lr': 7.593036152863117e-05, 'samples': 21558720, 'steps': 112284, 'loss/train': 1.3240710496902466} 11/07/2021 13:01:35 - INFO - __main__ - Step 112286: {'lr': 7.592655253707659e-05, 'samples': 21558912, 'steps': 112285, 'loss/train': 1.4353861808776855} 11/07/2021 13:01:35 - INFO - __main__ - Step 112287: {'lr': 7.592274362395679e-05, 'samples': 21559104, 'steps': 112286, 'loss/train': 1.0440839529037476} 11/07/2021 13:01:36 - INFO - __main__ - Step 112288: {'lr': 7.591893478927354e-05, 'samples': 21559296, 'steps': 112287, 'loss/train': 1.4990510940551758} 11/07/2021 13:01:36 - INFO - __main__ - Step 112289: {'lr': 7.59151260330285e-05, 'samples': 21559488, 'steps': 112288, 'loss/train': 1.2229905128479004} 11/07/2021 13:01:37 - INFO - __main__ - Step 112290: {'lr': 7.591131735522344e-05, 'samples': 21559680, 'steps': 112289, 'loss/train': 1.30697500705719} 11/07/2021 13:01:38 - INFO - __main__ - Step 112291: {'lr': 7.590750875586002e-05, 'samples': 21559872, 'steps': 112290, 'loss/train': 0.9283201694488525} 11/07/2021 13:01:38 - INFO - __main__ - Step 112292: {'lr': 7.590370023494003e-05, 'samples': 21560064, 'steps': 112291, 'loss/train': 1.7048964500427246} 11/07/2021 13:01:38 - INFO - __main__ - Step 112293: {'lr': 7.589989179246515e-05, 'samples': 21560256, 'steps': 112292, 'loss/train': 1.1745469570159912} 11/07/2021 13:01:39 - INFO - __main__ - Step 112294: {'lr': 7.589608342843707e-05, 'samples': 21560448, 'steps': 112293, 'loss/train': 1.4864490032196045} 11/07/2021 13:01:39 - INFO - __main__ - Step 112295: {'lr': 7.589227514285751e-05, 'samples': 21560640, 'steps': 112294, 'loss/train': 1.4168834686279297} 11/07/2021 13:01:40 - INFO - __main__ - Step 112296: {'lr': 7.588846693572831e-05, 'samples': 21560832, 'steps': 112295, 'loss/train': 1.688236951828003} 11/07/2021 13:01:40 - INFO - __main__ - Step 112297: {'lr': 7.588465880705101e-05, 'samples': 21561024, 'steps': 112296, 'loss/train': 1.8114441633224487} 11/07/2021 13:01:41 - INFO - __main__ - Step 112298: {'lr': 7.588085075682738e-05, 'samples': 21561216, 'steps': 112297, 'loss/train': 1.191612720489502} 11/07/2021 13:01:41 - INFO - __main__ - Step 112299: {'lr': 7.587704278505917e-05, 'samples': 21561408, 'steps': 112298, 'loss/train': 0.6627227067947388} 11/07/2021 13:01:42 - INFO - __main__ - Step 112300: {'lr': 7.587323489174804e-05, 'samples': 21561600, 'steps': 112299, 'loss/train': 1.5409809350967407} 11/07/2021 13:01:43 - INFO - __main__ - Step 112301: {'lr': 7.586942707689578e-05, 'samples': 21561792, 'steps': 112300, 'loss/train': 1.3614699840545654} 11/07/2021 13:01:43 - INFO - __main__ - Step 112302: {'lr': 7.586561934050407e-05, 'samples': 21561984, 'steps': 112301, 'loss/train': 0.5992012619972229} 11/07/2021 13:01:43 - INFO - __main__ - Step 112303: {'lr': 7.586181168257461e-05, 'samples': 21562176, 'steps': 112302, 'loss/train': 1.61826491355896} 11/07/2021 13:01:44 - INFO - __main__ - Step 112304: {'lr': 7.585800410310912e-05, 'samples': 21562368, 'steps': 112303, 'loss/train': 1.3236480951309204} 11/07/2021 13:01:44 - INFO - __main__ - Step 112305: {'lr': 7.585419660210934e-05, 'samples': 21562560, 'steps': 112304, 'loss/train': 1.3251171112060547} 11/07/2021 13:01:45 - INFO - __main__ - Step 112306: {'lr': 7.585038917957695e-05, 'samples': 21562752, 'steps': 112305, 'loss/train': 1.2569456100463867} 11/07/2021 13:01:45 - INFO - __main__ - Step 112307: {'lr': 7.584658183551371e-05, 'samples': 21562944, 'steps': 112306, 'loss/train': 1.4157028198242188} 11/07/2021 13:01:46 - INFO - __main__ - Step 112308: {'lr': 7.58427745699214e-05, 'samples': 21563136, 'steps': 112307, 'loss/train': 1.075366735458374} 11/07/2021 13:01:46 - INFO - __main__ - Step 112309: {'lr': 7.583896738280155e-05, 'samples': 21563328, 'steps': 112308, 'loss/train': 1.4459996223449707} 11/07/2021 13:01:46 - INFO - __main__ - Step 112310: {'lr': 7.583516027415599e-05, 'samples': 21563520, 'steps': 112309, 'loss/train': 1.3507765531539917} 11/07/2021 13:01:48 - INFO - __main__ - Step 112311: {'lr': 7.58313532439864e-05, 'samples': 21563712, 'steps': 112310, 'loss/train': 1.4200810194015503} 11/07/2021 13:01:48 - INFO - __main__ - Step 112312: {'lr': 7.582754629229454e-05, 'samples': 21563904, 'steps': 112311, 'loss/train': 0.8371207118034363} 11/07/2021 13:01:48 - INFO - __main__ - Step 112313: {'lr': 7.582373941908208e-05, 'samples': 21564096, 'steps': 112312, 'loss/train': 1.3859508037567139} 11/07/2021 13:01:49 - INFO - __main__ - Step 112314: {'lr': 7.581993262435078e-05, 'samples': 21564288, 'steps': 112313, 'loss/train': 1.5363421440124512} 11/07/2021 13:01:49 - INFO - __main__ - Step 112315: {'lr': 7.58161259081023e-05, 'samples': 21564480, 'steps': 112314, 'loss/train': 1.056653380393982} 11/07/2021 13:01:49 - INFO - __main__ - Step 112316: {'lr': 7.581231927033838e-05, 'samples': 21564672, 'steps': 112315, 'loss/train': 1.4178158044815063} 11/07/2021 13:01:50 - INFO - __main__ - Step 112317: {'lr': 7.580851271106076e-05, 'samples': 21564864, 'steps': 112316, 'loss/train': 0.9468859434127808} 11/07/2021 13:01:51 - INFO - __main__ - Step 112318: {'lr': 7.580470623027113e-05, 'samples': 21565056, 'steps': 112317, 'loss/train': 1.007071614265442} 11/07/2021 13:01:51 - INFO - __main__ - Step 112319: {'lr': 7.58008998279713e-05, 'samples': 21565248, 'steps': 112318, 'loss/train': 1.5589076280593872} 11/07/2021 13:01:52 - INFO - __main__ - Step 112320: {'lr': 7.579709350416283e-05, 'samples': 21565440, 'steps': 112319, 'loss/train': 0.5907018184661865} 11/07/2021 13:01:52 - INFO - __main__ - Step 112321: {'lr': 7.579328725884748e-05, 'samples': 21565632, 'steps': 112320, 'loss/train': 1.415836215019226} 11/07/2021 13:01:52 - INFO - __main__ - Step 112322: {'lr': 7.578948109202699e-05, 'samples': 21565824, 'steps': 112321, 'loss/train': 0.9065616726875305} 11/07/2021 13:01:54 - INFO - __main__ - Step 112323: {'lr': 7.578567500370306e-05, 'samples': 21566016, 'steps': 112322, 'loss/train': 1.6657742261886597} 11/07/2021 13:01:54 - INFO - __main__ - Step 112324: {'lr': 7.578186899387742e-05, 'samples': 21566208, 'steps': 112323, 'loss/train': 1.6644940376281738} 11/07/2021 13:01:54 - INFO - __main__ - Step 112325: {'lr': 7.577806306255181e-05, 'samples': 21566400, 'steps': 112324, 'loss/train': 0.16230152547359467} 11/07/2021 13:01:55 - INFO - __main__ - Step 112326: {'lr': 7.577425720972789e-05, 'samples': 21566592, 'steps': 112325, 'loss/train': 1.4455958604812622} 11/07/2021 13:01:55 - INFO - __main__ - Step 112327: {'lr': 7.577045143540742e-05, 'samples': 21566784, 'steps': 112326, 'loss/train': 1.7261412143707275} 11/07/2021 13:01:55 - INFO - __main__ - Step 112328: {'lr': 7.576664573959208e-05, 'samples': 21566976, 'steps': 112327, 'loss/train': 0.8725466728210449} 11/07/2021 13:01:56 - INFO - __main__ - Step 112329: {'lr': 7.57628401222836e-05, 'samples': 21567168, 'steps': 112328, 'loss/train': 1.391726016998291} 11/07/2021 13:01:57 - INFO - __main__ - Step 112330: {'lr': 7.57590345834838e-05, 'samples': 21567360, 'steps': 112329, 'loss/train': 1.5331939458847046} 11/07/2021 13:01:57 - INFO - __main__ - Step 112331: {'lr': 7.575522912319418e-05, 'samples': 21567552, 'steps': 112330, 'loss/train': 1.3907572031021118} 11/07/2021 13:01:57 - INFO - __main__ - Step 112332: {'lr': 7.575142374141658e-05, 'samples': 21567744, 'steps': 112331, 'loss/train': 0.9057857394218445} 11/07/2021 13:01:58 - INFO - __main__ - Step 112333: {'lr': 7.57476184381527e-05, 'samples': 21567936, 'steps': 112332, 'loss/train': 1.3024872541427612} 11/07/2021 13:01:59 - INFO - __main__ - Step 112334: {'lr': 7.574381321340423e-05, 'samples': 21568128, 'steps': 112333, 'loss/train': 1.0959618091583252} 11/07/2021 13:01:59 - INFO - __main__ - Step 112335: {'lr': 7.574000806717294e-05, 'samples': 21568320, 'steps': 112334, 'loss/train': 1.4695781469345093} 11/07/2021 13:02:00 - INFO - __main__ - Step 112336: {'lr': 7.573620299946048e-05, 'samples': 21568512, 'steps': 112335, 'loss/train': 1.4639008045196533} 11/07/2021 13:02:00 - INFO - __main__ - Step 112337: {'lr': 7.573239801026862e-05, 'samples': 21568704, 'steps': 112336, 'loss/train': 1.665595293045044} 11/07/2021 13:02:01 - INFO - __main__ - Step 112338: {'lr': 7.572859309959906e-05, 'samples': 21568896, 'steps': 112337, 'loss/train': 1.2052745819091797} 11/07/2021 13:02:01 - INFO - __main__ - Step 112339: {'lr': 7.572478826745349e-05, 'samples': 21569088, 'steps': 112338, 'loss/train': 0.629953145980835} 11/07/2021 13:02:02 - INFO - __main__ - Step 112340: {'lr': 7.572098351383366e-05, 'samples': 21569280, 'steps': 112339, 'loss/train': 1.3785958290100098} 11/07/2021 13:02:02 - INFO - __main__ - Step 112341: {'lr': 7.571717883874135e-05, 'samples': 21569472, 'steps': 112340, 'loss/train': 1.4204626083374023} 11/07/2021 13:02:03 - INFO - __main__ - Step 112342: {'lr': 7.571337424217808e-05, 'samples': 21569664, 'steps': 112341, 'loss/train': 1.5268869400024414} 11/07/2021 13:02:03 - INFO - __main__ - Step 112343: {'lr': 7.57095697241457e-05, 'samples': 21569856, 'steps': 112342, 'loss/train': 1.3206734657287598} 11/07/2021 13:02:04 - INFO - __main__ - Step 112344: {'lr': 7.570576528464587e-05, 'samples': 21570048, 'steps': 112343, 'loss/train': 0.7402544021606445} 11/07/2021 13:02:04 - INFO - __main__ - Step 112345: {'lr': 7.570196092368037e-05, 'samples': 21570240, 'steps': 112344, 'loss/train': 0.994042694568634} 11/07/2021 13:02:05 - INFO - __main__ - Step 112346: {'lr': 7.569815664125085e-05, 'samples': 21570432, 'steps': 112345, 'loss/train': 0.6551625728607178} 11/07/2021 13:02:05 - INFO - __main__ - Step 112347: {'lr': 7.569435243735907e-05, 'samples': 21570624, 'steps': 112346, 'loss/train': 1.3372833728790283} 11/07/2021 13:02:05 - INFO - __main__ - Step 112348: {'lr': 7.56905483120067e-05, 'samples': 21570816, 'steps': 112347, 'loss/train': 1.0929574966430664} 11/07/2021 13:02:06 - INFO - __main__ - Step 112349: {'lr': 7.56867442651955e-05, 'samples': 21571008, 'steps': 112348, 'loss/train': 1.0766303539276123} 11/07/2021 13:02:07 - INFO - __main__ - Step 112350: {'lr': 7.568294029692715e-05, 'samples': 21571200, 'steps': 112349, 'loss/train': 1.4437053203582764} 11/07/2021 13:02:07 - INFO - __main__ - Step 112351: {'lr': 7.567913640720348e-05, 'samples': 21571392, 'steps': 112350, 'loss/train': 1.738366961479187} 11/07/2021 13:02:07 - INFO - __main__ - Step 112352: {'lr': 7.5675332596026e-05, 'samples': 21571584, 'steps': 112351, 'loss/train': 1.6550862789154053} 11/07/2021 13:02:08 - INFO - __main__ - Step 112353: {'lr': 7.56715288633965e-05, 'samples': 21571776, 'steps': 112352, 'loss/train': 1.2475042343139648} 11/07/2021 13:02:09 - INFO - __main__ - Step 112354: {'lr': 7.566772520931679e-05, 'samples': 21571968, 'steps': 112353, 'loss/train': 1.5996184349060059} 11/07/2021 13:02:09 - INFO - __main__ - Step 112355: {'lr': 7.566392163378846e-05, 'samples': 21572160, 'steps': 112354, 'loss/train': 1.5995761156082153} 11/07/2021 13:02:10 - INFO - __main__ - Step 112356: {'lr': 7.566011813681328e-05, 'samples': 21572352, 'steps': 112355, 'loss/train': 0.756195068359375} 11/07/2021 13:02:10 - INFO - __main__ - Step 112357: {'lr': 7.565631471839296e-05, 'samples': 21572544, 'steps': 112356, 'loss/train': 1.439205527305603} 11/07/2021 13:02:10 - INFO - __main__ - Step 112358: {'lr': 7.565251137852924e-05, 'samples': 21572736, 'steps': 112357, 'loss/train': 1.6773654222488403} 11/07/2021 13:02:11 - INFO - __main__ - Step 112359: {'lr': 7.564870811722377e-05, 'samples': 21572928, 'steps': 112358, 'loss/train': 0.9193297028541565} 11/07/2021 13:02:12 - INFO - __main__ - Step 112360: {'lr': 7.564490493447834e-05, 'samples': 21573120, 'steps': 112359, 'loss/train': 1.2863471508026123} 11/07/2021 13:02:12 - INFO - __main__ - Step 112361: {'lr': 7.56411018302946e-05, 'samples': 21573312, 'steps': 112360, 'loss/train': 0.6999697685241699} 11/07/2021 13:02:12 - INFO - __main__ - Step 112362: {'lr': 7.563729880467429e-05, 'samples': 21573504, 'steps': 112361, 'loss/train': 1.5666557550430298} 11/07/2021 13:02:13 - INFO - __main__ - Step 112363: {'lr': 7.563349585761922e-05, 'samples': 21573696, 'steps': 112362, 'loss/train': 1.7336517572402954} 11/07/2021 13:02:14 - INFO - __main__ - Step 112364: {'lr': 7.562969298913091e-05, 'samples': 21573888, 'steps': 112363, 'loss/train': 1.3049689531326294} 11/07/2021 13:02:14 - INFO - __main__ - Step 112365: {'lr': 7.562589019921117e-05, 'samples': 21574080, 'steps': 112364, 'loss/train': 1.2214853763580322} 11/07/2021 13:02:14 - INFO - __main__ - Step 112366: {'lr': 7.562208748786172e-05, 'samples': 21574272, 'steps': 112365, 'loss/train': 1.3663307428359985} 11/07/2021 13:02:15 - INFO - __main__ - Step 112367: {'lr': 7.561828485508426e-05, 'samples': 21574464, 'steps': 112366, 'loss/train': 1.1382416486740112} 11/07/2021 13:02:15 - INFO - __main__ - Step 112368: {'lr': 7.561448230088053e-05, 'samples': 21574656, 'steps': 112367, 'loss/train': 0.9238501787185669} 11/07/2021 13:02:16 - INFO - __main__ - Step 112369: {'lr': 7.561067982525222e-05, 'samples': 21574848, 'steps': 112368, 'loss/train': 0.46951061487197876} 11/07/2021 13:02:16 - INFO - __main__ - Step 112370: {'lr': 7.560687742820103e-05, 'samples': 21575040, 'steps': 112369, 'loss/train': 1.0010457038879395} 11/07/2021 13:02:17 - INFO - __main__ - Step 112371: {'lr': 7.560307510972869e-05, 'samples': 21575232, 'steps': 112370, 'loss/train': 1.151564359664917} 11/07/2021 13:02:17 - INFO - __main__ - Step 112372: {'lr': 7.559927286983692e-05, 'samples': 21575424, 'steps': 112371, 'loss/train': 1.2428334951400757} 11/07/2021 13:02:17 - INFO - __main__ - Step 112373: {'lr': 7.559547070852744e-05, 'samples': 21575616, 'steps': 112372, 'loss/train': 1.75639009475708} 11/07/2021 13:02:18 - INFO - __main__ - Step 112374: {'lr': 7.559166862580192e-05, 'samples': 21575808, 'steps': 112373, 'loss/train': 1.3311941623687744} 11/07/2021 13:02:19 - INFO - __main__ - Step 112375: {'lr': 7.558786662166212e-05, 'samples': 21576000, 'steps': 112374, 'loss/train': 1.3855875730514526} 11/07/2021 13:02:19 - INFO - __main__ - Step 112376: {'lr': 7.558406469610981e-05, 'samples': 21576192, 'steps': 112375, 'loss/train': 1.3374974727630615} 11/07/2021 13:02:20 - INFO - __main__ - Step 112377: {'lr': 7.558026284914655e-05, 'samples': 21576384, 'steps': 112376, 'loss/train': 1.478851318359375} 11/07/2021 13:02:20 - INFO - __main__ - Step 112378: {'lr': 7.557646108077412e-05, 'samples': 21576576, 'steps': 112377, 'loss/train': 1.5699886083602905} 11/07/2021 13:02:21 - INFO - __main__ - Step 112379: {'lr': 7.557265939099428e-05, 'samples': 21576768, 'steps': 112378, 'loss/train': 3.16621732711792} 11/07/2021 13:02:21 - INFO - __main__ - Step 112380: {'lr': 7.556885777980868e-05, 'samples': 21576960, 'steps': 112379, 'loss/train': 1.4837298393249512} 11/07/2021 13:02:22 - INFO - __main__ - Step 112381: {'lr': 7.556505624721907e-05, 'samples': 21577152, 'steps': 112380, 'loss/train': 1.5230765342712402} 11/07/2021 13:02:22 - INFO - __main__ - Step 112382: {'lr': 7.556125479322714e-05, 'samples': 21577344, 'steps': 112381, 'loss/train': 1.1245993375778198} 11/07/2021 13:02:23 - INFO - __main__ - Step 112383: {'lr': 7.555745341783463e-05, 'samples': 21577536, 'steps': 112382, 'loss/train': 1.6293907165527344} 11/07/2021 13:02:23 - INFO - __main__ - Step 112384: {'lr': 7.555365212104325e-05, 'samples': 21577728, 'steps': 112383, 'loss/train': 0.8526588082313538} 11/07/2021 13:02:24 - INFO - __main__ - Step 112385: {'lr': 7.554985090285468e-05, 'samples': 21577920, 'steps': 112384, 'loss/train': 1.5094671249389648} 11/07/2021 13:02:24 - INFO - __main__ - Step 112386: {'lr': 7.554604976327068e-05, 'samples': 21578112, 'steps': 112385, 'loss/train': 1.4245675802230835} 11/07/2021 13:02:25 - INFO - __main__ - Step 112387: {'lr': 7.554224870229292e-05, 'samples': 21578304, 'steps': 112386, 'loss/train': 1.4947872161865234} 11/07/2021 13:02:25 - INFO - __main__ - Step 112388: {'lr': 7.553844771992313e-05, 'samples': 21578496, 'steps': 112387, 'loss/train': 1.213882327079773} 11/07/2021 13:02:25 - INFO - __main__ - Step 112389: {'lr': 7.553464681616303e-05, 'samples': 21578688, 'steps': 112388, 'loss/train': 1.6606659889221191} 11/07/2021 13:02:26 - INFO - __main__ - Step 112390: {'lr': 7.553084599101443e-05, 'samples': 21578880, 'steps': 112389, 'loss/train': 1.2037297487258911} 11/07/2021 13:02:27 - INFO - __main__ - Step 112391: {'lr': 7.552704524447881e-05, 'samples': 21579072, 'steps': 112390, 'loss/train': 1.2029893398284912} 11/07/2021 13:02:27 - INFO - __main__ - Step 112392: {'lr': 7.552324457655804e-05, 'samples': 21579264, 'steps': 112391, 'loss/train': 1.5791206359863281} 11/07/2021 13:02:27 - INFO - __main__ - Step 112393: {'lr': 7.55194439872538e-05, 'samples': 21579456, 'steps': 112392, 'loss/train': 0.2147262543439865} 11/07/2021 13:02:28 - INFO - __main__ - Step 112394: {'lr': 7.55156434765678e-05, 'samples': 21579648, 'steps': 112393, 'loss/train': 0.7609724998474121} 11/07/2021 13:02:29 - INFO - __main__ - Step 112395: {'lr': 7.551184304450176e-05, 'samples': 21579840, 'steps': 112394, 'loss/train': 1.0845760107040405} 11/07/2021 13:02:29 - INFO - __main__ - Step 112396: {'lr': 7.55080426910574e-05, 'samples': 21580032, 'steps': 112395, 'loss/train': 1.2637132406234741} 11/07/2021 13:02:29 - INFO - __main__ - Step 112397: {'lr': 7.55042424162364e-05, 'samples': 21580224, 'steps': 112396, 'loss/train': 1.6334898471832275} 11/07/2021 13:02:30 - INFO - __main__ - Step 112398: {'lr': 7.550044222004051e-05, 'samples': 21580416, 'steps': 112397, 'loss/train': 0.9123774766921997} 11/07/2021 13:02:30 - INFO - __main__ - Step 112399: {'lr': 7.549664210247145e-05, 'samples': 21580608, 'steps': 112398, 'loss/train': 0.5197703838348389} 11/07/2021 13:02:31 - INFO - __main__ - Step 112400: {'lr': 7.549284206353088e-05, 'samples': 21580800, 'steps': 112399, 'loss/train': 0.8213845491409302} 11/07/2021 13:02:32 - INFO - __main__ - Step 112401: {'lr': 7.548904210322058e-05, 'samples': 21580992, 'steps': 112400, 'loss/train': 0.05157356336712837} 11/07/2021 13:02:32 - INFO - __main__ - Step 112402: {'lr': 7.548524222154218e-05, 'samples': 21581184, 'steps': 112401, 'loss/train': 5.70773458480835} 11/07/2021 13:02:32 - INFO - __main__ - Step 112403: {'lr': 7.548144241849755e-05, 'samples': 21581376, 'steps': 112402, 'loss/train': 1.3677023649215698} 11/07/2021 13:02:33 - INFO - __main__ - Step 112404: {'lr': 7.547764269408818e-05, 'samples': 21581568, 'steps': 112403, 'loss/train': 0.2766214907169342} 11/07/2021 13:02:33 - INFO - __main__ - Step 112405: {'lr': 7.547384304831592e-05, 'samples': 21581760, 'steps': 112404, 'loss/train': 1.2171155214309692} 11/07/2021 13:02:34 - INFO - __main__ - Step 112406: {'lr': 7.547004348118245e-05, 'samples': 21581952, 'steps': 112405, 'loss/train': 1.4652748107910156} 11/07/2021 13:02:35 - INFO - __main__ - Step 112407: {'lr': 7.546624399268945e-05, 'samples': 21582144, 'steps': 112406, 'loss/train': 1.4069023132324219} 11/07/2021 13:02:35 - INFO - __main__ - Step 112408: {'lr': 7.546244458283869e-05, 'samples': 21582336, 'steps': 112407, 'loss/train': 1.3420745134353638} 11/07/2021 13:02:35 - INFO - __main__ - Step 112409: {'lr': 7.545864525163188e-05, 'samples': 21582528, 'steps': 112408, 'loss/train': 0.8262253403663635} 11/07/2021 13:02:36 - INFO - __main__ - Step 112410: {'lr': 7.545484599907068e-05, 'samples': 21582720, 'steps': 112409, 'loss/train': 1.4645713567733765} 11/07/2021 13:02:37 - INFO - __main__ - Step 112411: {'lr': 7.545104682515685e-05, 'samples': 21582912, 'steps': 112410, 'loss/train': 1.2686206102371216} 11/07/2021 13:02:37 - INFO - __main__ - Step 112412: {'lr': 7.544724772989209e-05, 'samples': 21583104, 'steps': 112411, 'loss/train': 1.481626033782959} 11/07/2021 13:02:37 - INFO - __main__ - Step 112413: {'lr': 7.544344871327807e-05, 'samples': 21583296, 'steps': 112412, 'loss/train': 1.4539421796798706} 11/07/2021 13:02:38 - INFO - __main__ - Step 112414: {'lr': 7.543964977531658e-05, 'samples': 21583488, 'steps': 112413, 'loss/train': 1.0589091777801514} 11/07/2021 13:02:38 - INFO - __main__ - Step 112415: {'lr': 7.543585091600927e-05, 'samples': 21583680, 'steps': 112414, 'loss/train': 1.3982540369033813} 11/07/2021 13:02:39 - INFO - __main__ - Step 112416: {'lr': 7.543205213535786e-05, 'samples': 21583872, 'steps': 112415, 'loss/train': 1.550460934638977} 11/07/2021 13:02:40 - INFO - __main__ - Step 112417: {'lr': 7.542825343336418e-05, 'samples': 21584064, 'steps': 112416, 'loss/train': 1.39277982711792} 11/07/2021 13:02:40 - INFO - __main__ - Step 112418: {'lr': 7.542445481002974e-05, 'samples': 21584256, 'steps': 112417, 'loss/train': 1.3474066257476807} 11/07/2021 13:02:40 - INFO - __main__ - Step 112419: {'lr': 7.542065626535638e-05, 'samples': 21584448, 'steps': 112418, 'loss/train': 1.311171293258667} 11/07/2021 13:02:41 - INFO - __main__ - Step 112420: {'lr': 7.541685779934574e-05, 'samples': 21584640, 'steps': 112419, 'loss/train': 1.283276915550232} 11/07/2021 13:02:42 - INFO - __main__ - Step 112421: {'lr': 7.541305941199958e-05, 'samples': 21584832, 'steps': 112420, 'loss/train': 1.1507853269577026} 11/07/2021 13:02:42 - INFO - __main__ - Step 112422: {'lr': 7.540926110331961e-05, 'samples': 21585024, 'steps': 112421, 'loss/train': 1.509724736213684} 11/07/2021 13:02:42 - INFO - __main__ - Step 112423: {'lr': 7.540546287330751e-05, 'samples': 21585216, 'steps': 112422, 'loss/train': 1.0798676013946533} 11/07/2021 13:02:43 - INFO - __main__ - Step 112424: {'lr': 7.540166472196503e-05, 'samples': 21585408, 'steps': 112423, 'loss/train': 1.5060315132141113} 11/07/2021 13:02:43 - INFO - __main__ - Step 112425: {'lr': 7.539786664929388e-05, 'samples': 21585600, 'steps': 112424, 'loss/train': 1.4495652914047241} 11/07/2021 13:02:44 - INFO - __main__ - Step 112426: {'lr': 7.539406865529574e-05, 'samples': 21585792, 'steps': 112425, 'loss/train': 1.302678108215332} 11/07/2021 13:02:44 - INFO - __main__ - Step 112427: {'lr': 7.539027073997235e-05, 'samples': 21585984, 'steps': 112426, 'loss/train': 1.5007898807525635} 11/07/2021 13:02:45 - INFO - __main__ - Step 112428: {'lr': 7.538647290332537e-05, 'samples': 21586176, 'steps': 112427, 'loss/train': 1.3473596572875977} 11/07/2021 13:02:45 - INFO - __main__ - Step 112429: {'lr': 7.53826751453566e-05, 'samples': 21586368, 'steps': 112428, 'loss/train': 1.4285075664520264} 11/07/2021 13:02:46 - INFO - __main__ - Step 112430: {'lr': 7.537887746606775e-05, 'samples': 21586560, 'steps': 112429, 'loss/train': 1.1770113706588745} 11/07/2021 13:02:46 - INFO - __main__ - Step 112431: {'lr': 7.537507986546041e-05, 'samples': 21586752, 'steps': 112430, 'loss/train': 1.4552415609359741} 11/07/2021 13:02:47 - INFO - __main__ - Step 112432: {'lr': 7.537128234353638e-05, 'samples': 21586944, 'steps': 112431, 'loss/train': 1.2411723136901855} 11/07/2021 13:02:47 - INFO - __main__ - Step 112433: {'lr': 7.536748490029735e-05, 'samples': 21587136, 'steps': 112432, 'loss/train': 1.7508677244186401} 11/07/2021 13:02:48 - INFO - __main__ - Step 112434: {'lr': 7.536368753574501e-05, 'samples': 21587328, 'steps': 112433, 'loss/train': 1.18521249294281} 11/07/2021 13:02:48 - INFO - __main__ - Step 112435: {'lr': 7.535989024988113e-05, 'samples': 21587520, 'steps': 112434, 'loss/train': 0.5362591743469238} 11/07/2021 13:02:49 - INFO - __main__ - Step 112436: {'lr': 7.535609304270738e-05, 'samples': 21587712, 'steps': 112435, 'loss/train': 1.3576176166534424} 11/07/2021 13:02:49 - INFO - __main__ - Step 112437: {'lr': 7.535229591422549e-05, 'samples': 21587904, 'steps': 112436, 'loss/train': 1.3624939918518066} 11/07/2021 13:02:51 - INFO - __main__ - Step 112438: {'lr': 7.534849886443714e-05, 'samples': 21588096, 'steps': 112437, 'loss/train': 1.1364420652389526} 11/07/2021 13:02:51 - INFO - __main__ - Step 112439: {'lr': 7.534470189334408e-05, 'samples': 21588288, 'steps': 112438, 'loss/train': 0.5942984819412231} 11/07/2021 13:02:52 - INFO - __main__ - Step 112440: {'lr': 7.534090500094798e-05, 'samples': 21588480, 'steps': 112439, 'loss/train': 0.9299629330635071} 11/07/2021 13:02:52 - INFO - __main__ - Step 112441: {'lr': 7.53371081872506e-05, 'samples': 21588672, 'steps': 112440, 'loss/train': 1.7763656377792358} 11/07/2021 13:02:53 - INFO - __main__ - Step 112442: {'lr': 7.533331145225361e-05, 'samples': 21588864, 'steps': 112441, 'loss/train': 1.7583551406860352} 11/07/2021 13:02:53 - INFO - __main__ - Step 112443: {'lr': 7.532951479595873e-05, 'samples': 21589056, 'steps': 112442, 'loss/train': 1.7471373081207275} 11/07/2021 13:02:53 - INFO - __main__ - Step 112444: {'lr': 7.532571821836776e-05, 'samples': 21589248, 'steps': 112443, 'loss/train': 1.5316846370697021} 11/07/2021 13:02:54 - INFO - __main__ - Step 112445: {'lr': 7.532192171948224e-05, 'samples': 21589440, 'steps': 112444, 'loss/train': 1.652511715888977} 11/07/2021 13:02:55 - INFO - __main__ - Step 112446: {'lr': 7.531812529930399e-05, 'samples': 21589632, 'steps': 112445, 'loss/train': 1.0198229551315308} 11/07/2021 13:02:55 - INFO - __main__ - Step 112447: {'lr': 7.531432895783466e-05, 'samples': 21589824, 'steps': 112446, 'loss/train': 1.3539848327636719} 11/07/2021 13:02:55 - INFO - __main__ - Step 112448: {'lr': 7.5310532695076e-05, 'samples': 21590016, 'steps': 112447, 'loss/train': 1.2341794967651367} 11/07/2021 13:02:56 - INFO - __main__ - Step 112449: {'lr': 7.530673651102976e-05, 'samples': 21590208, 'steps': 112448, 'loss/train': 1.1418018341064453} 11/07/2021 13:02:57 - INFO - __main__ - Step 112450: {'lr': 7.530294040569757e-05, 'samples': 21590400, 'steps': 112449, 'loss/train': 1.2362844944000244} 11/07/2021 13:02:57 - INFO - __main__ - Step 112451: {'lr': 7.52991443790812e-05, 'samples': 21590592, 'steps': 112450, 'loss/train': 0.7854363322257996} 11/07/2021 13:02:57 - INFO - __main__ - Step 112452: {'lr': 7.529534843118232e-05, 'samples': 21590784, 'steps': 112451, 'loss/train': 1.3092604875564575} 11/07/2021 13:02:58 - INFO - __main__ - Step 112453: {'lr': 7.529155256200269e-05, 'samples': 21590976, 'steps': 112452, 'loss/train': 1.2230160236358643} 11/07/2021 13:02:58 - INFO - __main__ - Step 112454: {'lr': 7.528775677154398e-05, 'samples': 21591168, 'steps': 112453, 'loss/train': 1.339980125427246} 11/07/2021 13:02:59 - INFO - __main__ - Step 112455: {'lr': 7.52839610598079e-05, 'samples': 21591360, 'steps': 112454, 'loss/train': 1.2608880996704102} 11/07/2021 13:03:00 - INFO - __main__ - Step 112456: {'lr': 7.528016542679616e-05, 'samples': 21591552, 'steps': 112455, 'loss/train': 1.2581675052642822} 11/07/2021 13:03:00 - INFO - __main__ - Step 112457: {'lr': 7.527636987251058e-05, 'samples': 21591744, 'steps': 112456, 'loss/train': 1.524531602859497} 11/07/2021 13:03:00 - INFO - __main__ - Step 112458: {'lr': 7.52725743969527e-05, 'samples': 21591936, 'steps': 112457, 'loss/train': 1.6018452644348145} 11/07/2021 13:03:01 - INFO - __main__ - Step 112459: {'lr': 7.526877900012429e-05, 'samples': 21592128, 'steps': 112458, 'loss/train': 1.2013427019119263} 11/07/2021 13:03:02 - INFO - __main__ - Step 112460: {'lr': 7.526498368202709e-05, 'samples': 21592320, 'steps': 112459, 'loss/train': 1.295382022857666} 11/07/2021 13:03:02 - INFO - __main__ - Step 112461: {'lr': 7.526118844266274e-05, 'samples': 21592512, 'steps': 112460, 'loss/train': 1.2200692892074585} 11/07/2021 13:03:02 - INFO - __main__ - Step 112462: {'lr': 7.525739328203304e-05, 'samples': 21592704, 'steps': 112461, 'loss/train': 0.8682898879051208} 11/07/2021 13:03:03 - INFO - __main__ - Step 112463: {'lr': 7.525359820013966e-05, 'samples': 21592896, 'steps': 112462, 'loss/train': 1.0666122436523438} 11/07/2021 13:03:03 - INFO - __main__ - Step 112464: {'lr': 7.524980319698433e-05, 'samples': 21593088, 'steps': 112463, 'loss/train': 1.6608175039291382} 11/07/2021 13:03:03 - INFO - __main__ - Step 112465: {'lr': 7.524600827256872e-05, 'samples': 21593280, 'steps': 112464, 'loss/train': 1.3277318477630615} 11/07/2021 13:03:05 - INFO - __main__ - Step 112466: {'lr': 7.524221342689455e-05, 'samples': 21593472, 'steps': 112465, 'loss/train': 1.2159677743911743} 11/07/2021 13:03:05 - INFO - __main__ - Step 112467: {'lr': 7.523841865996356e-05, 'samples': 21593664, 'steps': 112466, 'loss/train': 1.5634113550186157} 11/07/2021 13:03:05 - INFO - __main__ - Step 112468: {'lr': 7.523462397177744e-05, 'samples': 21593856, 'steps': 112467, 'loss/train': 1.9804884195327759} 11/07/2021 13:03:06 - INFO - __main__ - Step 112469: {'lr': 7.52308293623379e-05, 'samples': 21594048, 'steps': 112468, 'loss/train': 1.1615703105926514} 11/07/2021 13:03:06 - INFO - __main__ - Step 112470: {'lr': 7.522703483164673e-05, 'samples': 21594240, 'steps': 112469, 'loss/train': 1.4041029214859009} 11/07/2021 13:03:07 - INFO - __main__ - Step 112471: {'lr': 7.522324037970549e-05, 'samples': 21594432, 'steps': 112470, 'loss/train': 1.3776859045028687} 11/07/2021 13:03:08 - INFO - __main__ - Step 112472: {'lr': 7.521944600651595e-05, 'samples': 21594624, 'steps': 112471, 'loss/train': 0.6528915166854858} 11/07/2021 13:03:08 - INFO - __main__ - Step 112473: {'lr': 7.521565171207984e-05, 'samples': 21594816, 'steps': 112472, 'loss/train': 1.4290837049484253} 11/07/2021 13:03:08 - INFO - __main__ - Step 112474: {'lr': 7.521185749639886e-05, 'samples': 21595008, 'steps': 112473, 'loss/train': 0.2537479102611542} 11/07/2021 13:03:09 - INFO - __main__ - Step 112475: {'lr': 7.520806335947469e-05, 'samples': 21595200, 'steps': 112474, 'loss/train': 1.3928656578063965} 11/07/2021 13:03:09 - INFO - __main__ - Step 112476: {'lr': 7.520426930130911e-05, 'samples': 21595392, 'steps': 112475, 'loss/train': 1.4616749286651611} 11/07/2021 13:03:10 - INFO - __main__ - Step 112477: {'lr': 7.520047532190377e-05, 'samples': 21595584, 'steps': 112476, 'loss/train': 1.3680907487869263} 11/07/2021 13:03:10 - INFO - __main__ - Step 112478: {'lr': 7.51966814212604e-05, 'samples': 21595776, 'steps': 112477, 'loss/train': 1.6362210512161255} 11/07/2021 13:03:11 - INFO - __main__ - Step 112479: {'lr': 7.51928875993807e-05, 'samples': 21595968, 'steps': 112478, 'loss/train': 1.2496050596237183} 11/07/2021 13:03:11 - INFO - __main__ - Step 112480: {'lr': 7.51890938562664e-05, 'samples': 21596160, 'steps': 112479, 'loss/train': 1.3233848810195923} 11/07/2021 13:03:11 - INFO - __main__ - Step 112481: {'lr': 7.518530019191922e-05, 'samples': 21596352, 'steps': 112480, 'loss/train': 1.2442361116409302} 11/07/2021 13:03:12 - INFO - __main__ - Step 112482: {'lr': 7.518150660634079e-05, 'samples': 21596544, 'steps': 112481, 'loss/train': 1.410184621810913} 11/07/2021 13:03:13 - INFO - __main__ - Step 112483: {'lr': 7.517771309953292e-05, 'samples': 21596736, 'steps': 112482, 'loss/train': 1.3797616958618164} 11/07/2021 13:03:13 - INFO - __main__ - Step 112484: {'lr': 7.517391967149734e-05, 'samples': 21596928, 'steps': 112483, 'loss/train': 1.3477922677993774} 11/07/2021 13:03:13 - INFO - __main__ - Step 112485: {'lr': 7.51701263222356e-05, 'samples': 21597120, 'steps': 112484, 'loss/train': 1.4174423217773438} 11/07/2021 13:03:14 - INFO - __main__ - Step 112486: {'lr': 7.516633305174953e-05, 'samples': 21597312, 'steps': 112485, 'loss/train': 0.9439211487770081} 11/07/2021 13:03:15 - INFO - __main__ - Step 112487: {'lr': 7.51625398600408e-05, 'samples': 21597504, 'steps': 112486, 'loss/train': 1.3344563245773315} 11/07/2021 13:03:15 - INFO - __main__ - Step 112488: {'lr': 7.515874674711113e-05, 'samples': 21597696, 'steps': 112487, 'loss/train': 1.5138486623764038} 11/07/2021 13:03:16 - INFO - __main__ - Step 112489: {'lr': 7.515495371296225e-05, 'samples': 21597888, 'steps': 112488, 'loss/train': 1.4948538541793823} 11/07/2021 13:03:16 - INFO - __main__ - Step 112490: {'lr': 7.515116075759582e-05, 'samples': 21598080, 'steps': 112489, 'loss/train': 1.4986313581466675} 11/07/2021 13:03:16 - INFO - __main__ - Step 112491: {'lr': 7.514736788101359e-05, 'samples': 21598272, 'steps': 112490, 'loss/train': 1.799429178237915} 11/07/2021 13:03:17 - INFO - __main__ - Step 112492: {'lr': 7.514357508321726e-05, 'samples': 21598464, 'steps': 112491, 'loss/train': 0.5887948274612427} 11/07/2021 13:03:18 - INFO - __main__ - Step 112493: {'lr': 7.513978236420855e-05, 'samples': 21598656, 'steps': 112492, 'loss/train': 1.4741109609603882} 11/07/2021 13:03:18 - INFO - __main__ - Step 112494: {'lr': 7.513598972398913e-05, 'samples': 21598848, 'steps': 112493, 'loss/train': 0.8428110480308533} 11/07/2021 13:03:19 - INFO - __main__ - Step 112495: {'lr': 7.513219716256073e-05, 'samples': 21599040, 'steps': 112494, 'loss/train': 1.3868130445480347} 11/07/2021 13:03:19 - INFO - __main__ - Step 112496: {'lr': 7.51284046799251e-05, 'samples': 21599232, 'steps': 112495, 'loss/train': 1.5171499252319336} 11/07/2021 13:03:20 - INFO - __main__ - Step 112497: {'lr': 7.512461227608397e-05, 'samples': 21599424, 'steps': 112496, 'loss/train': 1.3855563402175903} 11/07/2021 13:03:20 - INFO - __main__ - Step 112498: {'lr': 7.51208199510389e-05, 'samples': 21599616, 'steps': 112497, 'loss/train': 1.8196070194244385} 11/07/2021 13:03:21 - INFO - __main__ - Step 112499: {'lr': 7.51170277047917e-05, 'samples': 21599808, 'steps': 112498, 'loss/train': 0.4630320370197296} 11/07/2021 13:03:21 - INFO - __main__ - Step 112500: {'lr': 7.511323553734409e-05, 'samples': 21600000, 'steps': 112499, 'loss/train': 1.3380557298660278} 11/07/2021 13:03:21 - INFO - __main__ - Step 112501: {'lr': 7.510944344869774e-05, 'samples': 21600192, 'steps': 112500, 'loss/train': 1.5857707262039185} 11/07/2021 13:03:23 - INFO - __main__ - Step 112502: {'lr': 7.510565143885436e-05, 'samples': 21600384, 'steps': 112501, 'loss/train': 1.2975338697433472} 11/07/2021 13:03:23 - INFO - __main__ - Step 112503: {'lr': 7.51018595078157e-05, 'samples': 21600576, 'steps': 112502, 'loss/train': 1.3317784070968628} 11/07/2021 13:03:23 - INFO - __main__ - Step 112504: {'lr': 7.509806765558344e-05, 'samples': 21600768, 'steps': 112503, 'loss/train': 1.4121787548065186} 11/07/2021 13:03:24 - INFO - __main__ - Step 112505: {'lr': 7.509427588215928e-05, 'samples': 21600960, 'steps': 112504, 'loss/train': 1.3042542934417725} 11/07/2021 13:03:24 - INFO - __main__ - Step 112506: {'lr': 7.509048418754494e-05, 'samples': 21601152, 'steps': 112505, 'loss/train': 1.309411883354187} 11/07/2021 13:03:24 - INFO - __main__ - Step 112507: {'lr': 7.508669257174214e-05, 'samples': 21601344, 'steps': 112506, 'loss/train': 0.6715258359909058} 11/07/2021 13:03:25 - INFO - __main__ - Step 112508: {'lr': 7.508290103475257e-05, 'samples': 21601536, 'steps': 112507, 'loss/train': 1.399143934249878} 11/07/2021 13:03:26 - INFO - __main__ - Step 112509: {'lr': 7.507910957657796e-05, 'samples': 21601728, 'steps': 112508, 'loss/train': 1.5773085355758667} 11/07/2021 13:03:26 - INFO - __main__ - Step 112510: {'lr': 7.507531819721997e-05, 'samples': 21601920, 'steps': 112509, 'loss/train': 1.073347568511963} 11/07/2021 13:03:26 - INFO - __main__ - Step 112511: {'lr': 7.507152689668045e-05, 'samples': 21602112, 'steps': 112510, 'loss/train': 1.2321726083755493} 11/07/2021 13:03:27 - INFO - __main__ - Step 112512: {'lr': 7.506773567496092e-05, 'samples': 21602304, 'steps': 112511, 'loss/train': 1.852706789970398} 11/07/2021 13:03:28 - INFO - __main__ - Step 112513: {'lr': 7.506394453206317e-05, 'samples': 21602496, 'steps': 112512, 'loss/train': 1.2236636877059937} 11/07/2021 13:03:28 - INFO - __main__ - Step 112514: {'lr': 7.506015346798889e-05, 'samples': 21602688, 'steps': 112513, 'loss/train': 1.3074408769607544} 11/07/2021 13:03:28 - INFO - __main__ - Step 112515: {'lr': 7.505636248273981e-05, 'samples': 21602880, 'steps': 112514, 'loss/train': 0.6204796433448792} 11/07/2021 13:03:29 - INFO - __main__ - Step 112516: {'lr': 7.505257157631765e-05, 'samples': 21603072, 'steps': 112515, 'loss/train': 1.238903522491455} 11/07/2021 13:03:29 - INFO - __main__ - Step 112517: {'lr': 7.50487807487241e-05, 'samples': 21603264, 'steps': 112516, 'loss/train': 1.3383328914642334} 11/07/2021 13:03:30 - INFO - __main__ - Step 112518: {'lr': 7.504498999996084e-05, 'samples': 21603456, 'steps': 112517, 'loss/train': 1.0929057598114014} 11/07/2021 13:03:31 - INFO - __main__ - Step 112519: {'lr': 7.504119933002964e-05, 'samples': 21603648, 'steps': 112518, 'loss/train': 1.2920622825622559} 11/07/2021 13:03:31 - INFO - __main__ - Step 112520: {'lr': 7.503740873893217e-05, 'samples': 21603840, 'steps': 112519, 'loss/train': 1.6968001127243042} 11/07/2021 13:03:31 - INFO - __main__ - Step 112521: {'lr': 7.503361822667012e-05, 'samples': 21604032, 'steps': 112520, 'loss/train': 1.5287686586380005} 11/07/2021 13:03:32 - INFO - __main__ - Step 112522: {'lr': 7.502982779324525e-05, 'samples': 21604224, 'steps': 112521, 'loss/train': 1.303566575050354} 11/07/2021 13:03:33 - INFO - __main__ - Step 112523: {'lr': 7.502603743865924e-05, 'samples': 21604416, 'steps': 112522, 'loss/train': 1.2288485765457153} 11/07/2021 13:03:33 - INFO - __main__ - Step 112524: {'lr': 7.502224716291386e-05, 'samples': 21604608, 'steps': 112523, 'loss/train': 1.3946870565414429} 11/07/2021 13:03:33 - INFO - __main__ - Step 112525: {'lr': 7.501845696601068e-05, 'samples': 21604800, 'steps': 112524, 'loss/train': 1.412288784980774} 11/07/2021 13:03:34 - INFO - __main__ - Step 112526: {'lr': 7.50146668479515e-05, 'samples': 21604992, 'steps': 112525, 'loss/train': 0.6849725246429443} 11/07/2021 13:03:34 - INFO - __main__ - Step 112527: {'lr': 7.501087680873798e-05, 'samples': 21605184, 'steps': 112526, 'loss/train': 1.4011391401290894} 11/07/2021 13:03:35 - INFO - __main__ - Step 112528: {'lr': 7.500708684837187e-05, 'samples': 21605376, 'steps': 112527, 'loss/train': 0.7103037238121033} 11/07/2021 13:03:36 - INFO - __main__ - Step 112529: {'lr': 7.500329696685487e-05, 'samples': 21605568, 'steps': 112528, 'loss/train': 1.0625295639038086} 11/07/2021 13:03:37 - INFO - __main__ - Step 112530: {'lr': 7.499950716418869e-05, 'samples': 21605760, 'steps': 112529, 'loss/train': 1.5584039688110352} 11/07/2021 13:03:37 - INFO - __main__ - Step 112531: {'lr': 7.499571744037504e-05, 'samples': 21605952, 'steps': 112530, 'loss/train': 1.1693147420883179} 11/07/2021 13:03:37 - INFO - __main__ - Step 112532: {'lr': 7.499192779541561e-05, 'samples': 21606144, 'steps': 112531, 'loss/train': 0.5005372762680054} 11/07/2021 13:03:38 - INFO - __main__ - Step 112533: {'lr': 7.49881382293121e-05, 'samples': 21606336, 'steps': 112532, 'loss/train': 0.48615676164627075} 11/07/2021 13:03:38 - INFO - __main__ - Step 112534: {'lr': 7.498434874206624e-05, 'samples': 21606528, 'steps': 112533, 'loss/train': 1.6395068168640137} 11/07/2021 13:03:39 - INFO - __main__ - Step 112535: {'lr': 7.498055933367976e-05, 'samples': 21606720, 'steps': 112534, 'loss/train': 0.8068966269493103} 11/07/2021 13:03:40 - INFO - __main__ - Step 112536: {'lr': 7.497677000415432e-05, 'samples': 21606912, 'steps': 112535, 'loss/train': 1.142640471458435} 11/07/2021 13:03:40 - INFO - __main__ - Step 112537: {'lr': 7.497298075349163e-05, 'samples': 21607104, 'steps': 112536, 'loss/train': 1.9279648065567017} 11/07/2021 13:03:40 - INFO - __main__ - Step 112538: {'lr': 7.496919158169352e-05, 'samples': 21607296, 'steps': 112537, 'loss/train': 1.3883267641067505} 11/07/2021 13:03:41 - INFO - __main__ - Step 112539: {'lr': 7.496540248876149e-05, 'samples': 21607488, 'steps': 112538, 'loss/train': 1.727932095527649} 11/07/2021 13:03:42 - INFO - __main__ - Step 112540: {'lr': 7.496161347469738e-05, 'samples': 21607680, 'steps': 112539, 'loss/train': 2.0320160388946533} 11/07/2021 13:03:42 - INFO - __main__ - Step 112541: {'lr': 7.495782453950284e-05, 'samples': 21607872, 'steps': 112540, 'loss/train': 1.123210072517395} 11/07/2021 13:03:42 - INFO - __main__ - Step 112542: {'lr': 7.49540356831796e-05, 'samples': 21608064, 'steps': 112541, 'loss/train': 1.24380362033844} 11/07/2021 13:03:43 - INFO - __main__ - Step 112543: {'lr': 7.495024690572937e-05, 'samples': 21608256, 'steps': 112542, 'loss/train': 1.587599515914917} 11/07/2021 13:03:43 - INFO - __main__ - Step 112544: {'lr': 7.494645820715387e-05, 'samples': 21608448, 'steps': 112543, 'loss/train': 1.0880200862884521} 11/07/2021 13:03:45 - INFO - __main__ - Step 112545: {'lr': 7.494266958745479e-05, 'samples': 21608640, 'steps': 112544, 'loss/train': 1.9630604982376099} 11/07/2021 13:03:45 - INFO - __main__ - Step 112546: {'lr': 7.493888104663385e-05, 'samples': 21608832, 'steps': 112545, 'loss/train': 1.7203178405761719} 11/07/2021 13:03:46 - INFO - __main__ - Step 112547: {'lr': 7.493509258469275e-05, 'samples': 21609024, 'steps': 112546, 'loss/train': 1.6019495725631714} 11/07/2021 13:03:46 - INFO - __main__ - Step 112548: {'lr': 7.49313042016332e-05, 'samples': 21609216, 'steps': 112547, 'loss/train': 1.378307819366455} 11/07/2021 13:03:47 - INFO - __main__ - Step 112549: {'lr': 7.492751589745686e-05, 'samples': 21609408, 'steps': 112548, 'loss/train': 1.0757733583450317} 11/07/2021 13:03:47 - INFO - __main__ - Step 112550: {'lr': 7.492372767216551e-05, 'samples': 21609600, 'steps': 112549, 'loss/train': 0.8141987919807434} 11/07/2021 13:03:47 - INFO - __main__ - Step 112551: {'lr': 7.491993952576093e-05, 'samples': 21609792, 'steps': 112550, 'loss/train': 0.6696363091468811} 11/07/2021 13:03:48 - INFO - __main__ - Step 112552: {'lr': 7.49161514582446e-05, 'samples': 21609984, 'steps': 112551, 'loss/train': 0.7089333534240723} 11/07/2021 13:03:49 - INFO - __main__ - Step 112553: {'lr': 7.491236346961838e-05, 'samples': 21610176, 'steps': 112552, 'loss/train': 0.7073181867599487} 11/07/2021 13:03:49 - INFO - __main__ - Step 112554: {'lr': 7.490857555988395e-05, 'samples': 21610368, 'steps': 112553, 'loss/train': 1.4483274221420288} 11/07/2021 13:03:49 - INFO - __main__ - Step 112555: {'lr': 7.490478772904299e-05, 'samples': 21610560, 'steps': 112554, 'loss/train': 1.3591790199279785} 11/07/2021 13:03:50 - INFO - __main__ - Step 112556: {'lr': 7.490099997709724e-05, 'samples': 21610752, 'steps': 112555, 'loss/train': 1.4547991752624512} 11/07/2021 13:03:50 - INFO - __main__ - Step 112557: {'lr': 7.489721230404842e-05, 'samples': 21610944, 'steps': 112556, 'loss/train': 1.3288685083389282} 11/07/2021 13:03:51 - INFO - __main__ - Step 112558: {'lr': 7.489342470989818e-05, 'samples': 21611136, 'steps': 112557, 'loss/train': 1.3566126823425293} 11/07/2021 13:03:51 - INFO - __main__ - Step 112559: {'lr': 7.488963719464828e-05, 'samples': 21611328, 'steps': 112558, 'loss/train': 1.5429236888885498} 11/07/2021 13:03:52 - INFO - __main__ - Step 112560: {'lr': 7.48858497583004e-05, 'samples': 21611520, 'steps': 112559, 'loss/train': 1.4147263765335083} 11/07/2021 13:03:52 - INFO - __main__ - Step 112561: {'lr': 7.488206240085627e-05, 'samples': 21611712, 'steps': 112560, 'loss/train': 1.2937166690826416} 11/07/2021 13:03:52 - INFO - __main__ - Step 112562: {'lr': 7.487827512231754e-05, 'samples': 21611904, 'steps': 112561, 'loss/train': 0.7733573317527771} 11/07/2021 13:03:53 - INFO - __main__ - Step 112563: {'lr': 7.487448792268601e-05, 'samples': 21612096, 'steps': 112562, 'loss/train': 1.5921804904937744} 11/07/2021 13:03:54 - INFO - __main__ - Step 112564: {'lr': 7.487070080196329e-05, 'samples': 21612288, 'steps': 112563, 'loss/train': 1.071090579032898} 11/07/2021 13:03:54 - INFO - __main__ - Step 112565: {'lr': 7.486691376015123e-05, 'samples': 21612480, 'steps': 112564, 'loss/train': 1.1798183917999268} 11/07/2021 13:03:55 - INFO - __main__ - Step 112566: {'lr': 7.486312679725135e-05, 'samples': 21612672, 'steps': 112565, 'loss/train': 0.9442809820175171} 11/07/2021 13:03:55 - INFO - __main__ - Step 112567: {'lr': 7.485933991326546e-05, 'samples': 21612864, 'steps': 112566, 'loss/train': 1.3008705377578735} 11/07/2021 13:03:55 - INFO - __main__ - Step 112568: {'lr': 7.485555310819522e-05, 'samples': 21613056, 'steps': 112567, 'loss/train': 1.1567214727401733} 11/07/2021 13:03:56 - INFO - __main__ - Step 112569: {'lr': 7.485176638204239e-05, 'samples': 21613248, 'steps': 112568, 'loss/train': 1.4867197275161743} 11/07/2021 13:03:57 - INFO - __main__ - Step 112570: {'lr': 7.484797973480865e-05, 'samples': 21613440, 'steps': 112569, 'loss/train': 1.5848220586776733} 11/07/2021 13:03:57 - INFO - __main__ - Step 112571: {'lr': 7.484419316649569e-05, 'samples': 21613632, 'steps': 112570, 'loss/train': 1.1291519403457642} 11/07/2021 13:03:57 - INFO - __main__ - Step 112572: {'lr': 7.484040667710523e-05, 'samples': 21613824, 'steps': 112571, 'loss/train': 1.4560766220092773} 11/07/2021 13:03:58 - INFO - __main__ - Step 112573: {'lr': 7.483662026663901e-05, 'samples': 21614016, 'steps': 112572, 'loss/train': 1.4098068475723267} 11/07/2021 13:03:59 - INFO - __main__ - Step 112574: {'lr': 7.483283393509869e-05, 'samples': 21614208, 'steps': 112573, 'loss/train': 1.0109683275222778} 11/07/2021 13:03:59 - INFO - __main__ - Step 112575: {'lr': 7.4829047682486e-05, 'samples': 21614400, 'steps': 112574, 'loss/train': 1.404454231262207} 11/07/2021 13:04:00 - INFO - __main__ - Step 112576: {'lr': 7.482526150880261e-05, 'samples': 21614592, 'steps': 112575, 'loss/train': 0.05605725198984146} 11/07/2021 13:04:00 - INFO - __main__ - Step 112577: {'lr': 7.482147541405035e-05, 'samples': 21614784, 'steps': 112576, 'loss/train': 1.556825041770935} 11/07/2021 13:04:00 - INFO - __main__ - Step 112578: {'lr': 7.481768939823075e-05, 'samples': 21614976, 'steps': 112577, 'loss/train': 1.1409128904342651} 11/07/2021 13:04:01 - INFO - __main__ - Step 112579: {'lr': 7.481390346134562e-05, 'samples': 21615168, 'steps': 112578, 'loss/train': 1.7998912334442139} 11/07/2021 13:04:02 - INFO - __main__ - Step 112580: {'lr': 7.481011760339662e-05, 'samples': 21615360, 'steps': 112579, 'loss/train': 1.1821060180664062} 11/07/2021 13:04:02 - INFO - __main__ - Step 112581: {'lr': 7.480633182438548e-05, 'samples': 21615552, 'steps': 112580, 'loss/train': 1.1696940660476685} 11/07/2021 13:04:02 - INFO - __main__ - Step 112582: {'lr': 7.48025461243139e-05, 'samples': 21615744, 'steps': 112581, 'loss/train': 1.1292757987976074} 11/07/2021 13:04:03 - INFO - __main__ - Step 112583: {'lr': 7.479876050318358e-05, 'samples': 21615936, 'steps': 112582, 'loss/train': 1.35703444480896} 11/07/2021 13:04:03 - INFO - __main__ - Step 112584: {'lr': 7.479497496099624e-05, 'samples': 21616128, 'steps': 112583, 'loss/train': 1.1809418201446533} 11/07/2021 13:04:04 - INFO - __main__ - Step 112585: {'lr': 7.47911894977536e-05, 'samples': 21616320, 'steps': 112584, 'loss/train': 1.3356409072875977} 11/07/2021 13:04:05 - INFO - __main__ - Step 112586: {'lr': 7.478740411345732e-05, 'samples': 21616512, 'steps': 112585, 'loss/train': 1.6558454036712646} 11/07/2021 13:04:05 - INFO - __main__ - Step 112587: {'lr': 7.478361880810924e-05, 'samples': 21616704, 'steps': 112586, 'loss/train': 1.9245041608810425} 11/07/2021 13:04:05 - INFO - __main__ - Step 112588: {'lr': 7.477983358171087e-05, 'samples': 21616896, 'steps': 112587, 'loss/train': 0.8448824882507324} 11/07/2021 13:04:06 - INFO - __main__ - Step 112589: {'lr': 7.477604843426397e-05, 'samples': 21617088, 'steps': 112588, 'loss/train': 1.244848370552063} 11/07/2021 13:04:07 - INFO - __main__ - Step 112590: {'lr': 7.477226336577031e-05, 'samples': 21617280, 'steps': 112589, 'loss/train': 1.5384559631347656} 11/07/2021 13:04:07 - INFO - __main__ - Step 112591: {'lr': 7.476847837623157e-05, 'samples': 21617472, 'steps': 112590, 'loss/train': 1.3950332403182983} 11/07/2021 13:04:07 - INFO - __main__ - Step 112592: {'lr': 7.476469346564942e-05, 'samples': 21617664, 'steps': 112591, 'loss/train': 1.6321502923965454} 11/07/2021 13:04:08 - INFO - __main__ - Step 112593: {'lr': 7.476090863402563e-05, 'samples': 21617856, 'steps': 112592, 'loss/train': 1.533238172531128} 11/07/2021 13:04:08 - INFO - __main__ - Step 112594: {'lr': 7.475712388136185e-05, 'samples': 21618048, 'steps': 112593, 'loss/train': 1.121731162071228} 11/07/2021 13:04:09 - INFO - __main__ - Step 112595: {'lr': 7.475333920765981e-05, 'samples': 21618240, 'steps': 112594, 'loss/train': 1.6694523096084595} 11/07/2021 13:04:10 - INFO - __main__ - Step 112596: {'lr': 7.474955461292121e-05, 'samples': 21618432, 'steps': 112595, 'loss/train': 0.8882414102554321} 11/07/2021 13:04:10 - INFO - __main__ - Step 112597: {'lr': 7.474577009714776e-05, 'samples': 21618624, 'steps': 112596, 'loss/train': 1.5640771389007568} 11/07/2021 13:04:10 - INFO - __main__ - Step 112598: {'lr': 7.474198566034124e-05, 'samples': 21618816, 'steps': 112597, 'loss/train': 1.4062700271606445} 11/07/2021 13:04:11 - INFO - __main__ - Step 112599: {'lr': 7.473820130250319e-05, 'samples': 21619008, 'steps': 112598, 'loss/train': 1.093153476715088} 11/07/2021 13:04:12 - INFO - __main__ - Step 112600: {'lr': 7.473441702363542e-05, 'samples': 21619200, 'steps': 112599, 'loss/train': 0.06221112236380577} 11/07/2021 13:04:12 - INFO - __main__ - Step 112601: {'lr': 7.47306328237396e-05, 'samples': 21619392, 'steps': 112600, 'loss/train': 1.6170927286148071} 11/07/2021 13:04:12 - INFO - __main__ - Step 112602: {'lr': 7.472684870281746e-05, 'samples': 21619584, 'steps': 112601, 'loss/train': 1.6827950477600098} 11/07/2021 13:04:13 - INFO - __main__ - Step 112603: {'lr': 7.47230646608707e-05, 'samples': 21619776, 'steps': 112602, 'loss/train': 1.280103325843811} 11/07/2021 13:04:13 - INFO - __main__ - Step 112604: {'lr': 7.471928069790101e-05, 'samples': 21619968, 'steps': 112603, 'loss/train': 1.3400275707244873} 11/07/2021 13:04:14 - INFO - __main__ - Step 112605: {'lr': 7.47154968139101e-05, 'samples': 21620160, 'steps': 112604, 'loss/train': 1.268478512763977} 11/07/2021 13:04:15 - INFO - __main__ - Step 112606: {'lr': 7.47117130088997e-05, 'samples': 21620352, 'steps': 112605, 'loss/train': 1.5162140130996704} 11/07/2021 13:04:15 - INFO - __main__ - Step 112607: {'lr': 7.470792928287151e-05, 'samples': 21620544, 'steps': 112606, 'loss/train': 1.0958888530731201} 11/07/2021 13:04:15 - INFO - __main__ - Step 112608: {'lr': 7.470414563582719e-05, 'samples': 21620736, 'steps': 112607, 'loss/train': 1.5468602180480957} 11/07/2021 13:04:16 - INFO - __main__ - Step 112609: {'lr': 7.470036206776859e-05, 'samples': 21620928, 'steps': 112608, 'loss/train': 2.0067832469940186} 11/07/2021 13:04:17 - INFO - __main__ - Step 112610: {'lr': 7.469657857869719e-05, 'samples': 21621120, 'steps': 112609, 'loss/train': 1.950521469116211} 11/07/2021 13:04:17 - INFO - __main__ - Step 112611: {'lr': 7.469279516861483e-05, 'samples': 21621312, 'steps': 112610, 'loss/train': 1.5772937536239624} 11/07/2021 13:04:17 - INFO - __main__ - Step 112612: {'lr': 7.468901183752319e-05, 'samples': 21621504, 'steps': 112611, 'loss/train': 1.5206633806228638} 11/07/2021 13:04:18 - INFO - __main__ - Step 112613: {'lr': 7.468522858542395e-05, 'samples': 21621696, 'steps': 112612, 'loss/train': 1.2678637504577637} 11/07/2021 13:04:18 - INFO - __main__ - Step 112614: {'lr': 7.468144541231886e-05, 'samples': 21621888, 'steps': 112613, 'loss/train': 1.3025637865066528} 11/07/2021 13:04:19 - INFO - __main__ - Step 112615: {'lr': 7.46776623182096e-05, 'samples': 21622080, 'steps': 112614, 'loss/train': 1.509914755821228} 11/07/2021 13:04:19 - INFO - __main__ - Step 112616: {'lr': 7.467387930309791e-05, 'samples': 21622272, 'steps': 112615, 'loss/train': 0.593270480632782} 11/07/2021 13:04:20 - INFO - __main__ - Step 112617: {'lr': 7.467009636698544e-05, 'samples': 21622464, 'steps': 112616, 'loss/train': 1.23935866355896} 11/07/2021 13:04:20 - INFO - __main__ - Step 112618: {'lr': 7.466631350987391e-05, 'samples': 21622656, 'steps': 112617, 'loss/train': 1.027669072151184} 11/07/2021 13:04:20 - INFO - __main__ - Step 112619: {'lr': 7.466253073176504e-05, 'samples': 21622848, 'steps': 112618, 'loss/train': 1.309256672859192} 11/07/2021 13:04:21 - INFO - __main__ - Step 112620: {'lr': 7.465874803266062e-05, 'samples': 21623040, 'steps': 112619, 'loss/train': 1.3302842378616333} 11/07/2021 13:04:22 - INFO - __main__ - Step 112621: {'lr': 7.465496541256217e-05, 'samples': 21623232, 'steps': 112620, 'loss/train': 1.2921578884124756} 11/07/2021 13:04:22 - INFO - __main__ - Step 112622: {'lr': 7.465118287147149e-05, 'samples': 21623424, 'steps': 112621, 'loss/train': 1.35545015335083} 11/07/2021 13:04:22 - INFO - __main__ - Step 112623: {'lr': 7.464740040939027e-05, 'samples': 21623616, 'steps': 112622, 'loss/train': 1.7080552577972412} 11/07/2021 13:04:23 - INFO - __main__ - Step 112624: {'lr': 7.464361802632025e-05, 'samples': 21623808, 'steps': 112623, 'loss/train': 1.0024434328079224} 11/07/2021 13:04:23 - INFO - __main__ - Step 112625: {'lr': 7.46398357222631e-05, 'samples': 21624000, 'steps': 112624, 'loss/train': 2.9577178955078125} 11/07/2021 13:04:24 - INFO - __main__ - Step 112626: {'lr': 7.463605349722052e-05, 'samples': 21624192, 'steps': 112625, 'loss/train': 1.4501639604568481} 11/07/2021 13:04:25 - INFO - __main__ - Step 112627: {'lr': 7.463227135119424e-05, 'samples': 21624384, 'steps': 112626, 'loss/train': 1.4747072458267212} 11/07/2021 13:04:25 - INFO - __main__ - Step 112628: {'lr': 7.462848928418595e-05, 'samples': 21624576, 'steps': 112627, 'loss/train': 1.5420721769332886} 11/07/2021 13:04:26 - INFO - __main__ - Step 112629: {'lr': 7.462470729619736e-05, 'samples': 21624768, 'steps': 112628, 'loss/train': 0.8242138028144836} 11/07/2021 13:04:26 - INFO - __main__ - Step 112630: {'lr': 7.462092538723017e-05, 'samples': 21624960, 'steps': 112629, 'loss/train': 1.354301929473877} 11/07/2021 13:04:27 - INFO - __main__ - Step 112631: {'lr': 7.461714355728607e-05, 'samples': 21625152, 'steps': 112630, 'loss/train': 1.0782743692398071} 11/07/2021 13:04:27 - INFO - __main__ - Step 112632: {'lr': 7.461336180636687e-05, 'samples': 21625344, 'steps': 112631, 'loss/train': 1.4369988441467285} 11/07/2021 13:04:28 - INFO - __main__ - Step 112633: {'lr': 7.460958013447411e-05, 'samples': 21625536, 'steps': 112632, 'loss/train': 1.0600279569625854} 11/07/2021 13:04:28 - INFO - __main__ - Step 112634: {'lr': 7.460579854160957e-05, 'samples': 21625728, 'steps': 112633, 'loss/train': 1.0679993629455566} 11/07/2021 13:04:28 - INFO - __main__ - Step 112635: {'lr': 7.460201702777494e-05, 'samples': 21625920, 'steps': 112634, 'loss/train': 1.2476778030395508} 11/07/2021 13:04:29 - INFO - __main__ - Step 112636: {'lr': 7.459823559297194e-05, 'samples': 21626112, 'steps': 112635, 'loss/train': 1.2792264223098755} 11/07/2021 13:04:30 - INFO - __main__ - Step 112637: {'lr': 7.459445423720227e-05, 'samples': 21626304, 'steps': 112636, 'loss/train': 1.3445597887039185} 11/07/2021 13:04:30 - INFO - __main__ - Step 112638: {'lr': 7.459067296046761e-05, 'samples': 21626496, 'steps': 112637, 'loss/train': 1.990456461906433} 11/07/2021 13:04:30 - INFO - __main__ - Step 112639: {'lr': 7.45868917627697e-05, 'samples': 21626688, 'steps': 112638, 'loss/train': 0.4476604461669922} 11/07/2021 13:04:31 - INFO - __main__ - Step 112640: {'lr': 7.458311064411025e-05, 'samples': 21626880, 'steps': 112639, 'loss/train': 1.0676841735839844} 11/07/2021 13:04:31 - INFO - __main__ - Step 112641: {'lr': 7.457932960449094e-05, 'samples': 21627072, 'steps': 112640, 'loss/train': 1.5739487409591675} 11/07/2021 13:04:32 - INFO - __main__ - Step 112642: {'lr': 7.457554864391345e-05, 'samples': 21627264, 'steps': 112641, 'loss/train': 1.2567356824874878} 11/07/2021 13:04:32 - INFO - __main__ - Step 112643: {'lr': 7.457176776237951e-05, 'samples': 21627456, 'steps': 112642, 'loss/train': 1.496107578277588} 11/07/2021 13:04:33 - INFO - __main__ - Step 112644: {'lr': 7.456798695989084e-05, 'samples': 21627648, 'steps': 112643, 'loss/train': 1.3249878883361816} 11/07/2021 13:04:33 - INFO - __main__ - Step 112645: {'lr': 7.456420623644922e-05, 'samples': 21627840, 'steps': 112644, 'loss/train': 1.2365455627441406} 11/07/2021 13:04:33 - INFO - __main__ - Step 112646: {'lr': 7.456042559205616e-05, 'samples': 21628032, 'steps': 112645, 'loss/train': 0.9657087922096252} 11/07/2021 13:04:35 - INFO - __main__ - Step 112647: {'lr': 7.455664502671347e-05, 'samples': 21628224, 'steps': 112646, 'loss/train': 1.9881932735443115} 11/07/2021 13:04:35 - INFO - __main__ - Step 112648: {'lr': 7.455286454042284e-05, 'samples': 21628416, 'steps': 112647, 'loss/train': 0.880395770072937} 11/07/2021 13:04:36 - INFO - __main__ - Step 112649: {'lr': 7.454908413318601e-05, 'samples': 21628608, 'steps': 112648, 'loss/train': 1.31515371799469} 11/07/2021 13:04:36 - INFO - __main__ - Step 112650: {'lr': 7.454530380500463e-05, 'samples': 21628800, 'steps': 112649, 'loss/train': 1.6827387809753418} 11/07/2021 13:04:36 - INFO - __main__ - Step 112651: {'lr': 7.454152355588046e-05, 'samples': 21628992, 'steps': 112650, 'loss/train': 1.48855459690094} 11/07/2021 13:04:37 - INFO - __main__ - Step 112652: {'lr': 7.453774338581515e-05, 'samples': 21629184, 'steps': 112651, 'loss/train': 0.8951484560966492} 11/07/2021 13:04:38 - INFO - __main__ - Step 112653: {'lr': 7.453396329481041e-05, 'samples': 21629376, 'steps': 112652, 'loss/train': 0.3999517858028412} 11/07/2021 13:04:38 - INFO - __main__ - Step 112654: {'lr': 7.453018328286797e-05, 'samples': 21629568, 'steps': 112653, 'loss/train': 1.5755727291107178} 11/07/2021 13:04:39 - INFO - __main__ - Step 112655: {'lr': 7.452640334998953e-05, 'samples': 21629760, 'steps': 112654, 'loss/train': 1.2999988794326782} 11/07/2021 13:04:39 - INFO - __main__ - Step 112656: {'lr': 7.452262349617678e-05, 'samples': 21629952, 'steps': 112655, 'loss/train': 1.286699891090393} 11/07/2021 13:04:39 - INFO - __main__ - Step 112657: {'lr': 7.451884372143145e-05, 'samples': 21630144, 'steps': 112656, 'loss/train': 1.5663174390792847} 11/07/2021 13:04:40 - INFO - __main__ - Step 112658: {'lr': 7.451506402575517e-05, 'samples': 21630336, 'steps': 112657, 'loss/train': 1.1167796850204468} 11/07/2021 13:04:41 - INFO - __main__ - Step 112659: {'lr': 7.451128440914981e-05, 'samples': 21630528, 'steps': 112658, 'loss/train': 1.1550710201263428} 11/07/2021 13:04:41 - INFO - __main__ - Step 112660: {'lr': 7.450750487161687e-05, 'samples': 21630720, 'steps': 112659, 'loss/train': 1.721755862236023} 11/07/2021 13:04:41 - INFO - __main__ - Step 112661: {'lr': 7.450372541315815e-05, 'samples': 21630912, 'steps': 112660, 'loss/train': 1.650891661643982} 11/07/2021 13:04:42 - INFO - __main__ - Step 112662: {'lr': 7.449994603377533e-05, 'samples': 21631104, 'steps': 112661, 'loss/train': 0.08021118491888046} 11/07/2021 13:04:43 - INFO - __main__ - Step 112663: {'lr': 7.449616673347012e-05, 'samples': 21631296, 'steps': 112662, 'loss/train': 1.1759743690490723} 11/07/2021 13:04:43 - INFO - __main__ - Step 112664: {'lr': 7.449238751224425e-05, 'samples': 21631488, 'steps': 112663, 'loss/train': 1.0915172100067139} 11/07/2021 13:04:44 - INFO - __main__ - Step 112665: {'lr': 7.448860837009941e-05, 'samples': 21631680, 'steps': 112664, 'loss/train': 1.0561294555664062} 11/07/2021 13:04:44 - INFO - __main__ - Step 112666: {'lr': 7.448482930703725e-05, 'samples': 21631872, 'steps': 112665, 'loss/train': 1.3484246730804443} 11/07/2021 13:04:44 - INFO - __main__ - Step 112667: {'lr': 7.448105032305954e-05, 'samples': 21632064, 'steps': 112666, 'loss/train': 1.3939826488494873} 11/07/2021 13:04:45 - INFO - __main__ - Step 112668: {'lr': 7.447727141816798e-05, 'samples': 21632256, 'steps': 112667, 'loss/train': 1.7830358743667603} 11/07/2021 13:04:46 - INFO - __main__ - Step 112669: {'lr': 7.447349259236424e-05, 'samples': 21632448, 'steps': 112668, 'loss/train': 1.5779070854187012} 11/07/2021 13:04:46 - INFO - __main__ - Step 112670: {'lr': 7.446971384565004e-05, 'samples': 21632640, 'steps': 112669, 'loss/train': 1.4997093677520752} 11/07/2021 13:04:46 - INFO - __main__ - Step 112671: {'lr': 7.446593517802707e-05, 'samples': 21632832, 'steps': 112670, 'loss/train': 1.0497604608535767} 11/07/2021 13:04:47 - INFO - __main__ - Step 112672: {'lr': 7.446215658949713e-05, 'samples': 21633024, 'steps': 112671, 'loss/train': 1.8591352701187134} 11/07/2021 13:04:48 - INFO - __main__ - Step 112673: {'lr': 7.445837808006172e-05, 'samples': 21633216, 'steps': 112672, 'loss/train': 1.3836615085601807} 11/07/2021 13:04:48 - INFO - __main__ - Step 112674: {'lr': 7.44545996497227e-05, 'samples': 21633408, 'steps': 112673, 'loss/train': 1.6956043243408203} 11/07/2021 13:04:48 - INFO - __main__ - Step 112675: {'lr': 7.445082129848172e-05, 'samples': 21633600, 'steps': 112674, 'loss/train': 1.1969977617263794} 11/07/2021 13:04:49 - INFO - __main__ - Step 112676: {'lr': 7.444704302634048e-05, 'samples': 21633792, 'steps': 112675, 'loss/train': 1.4928377866744995} 11/07/2021 13:04:49 - INFO - __main__ - Step 112677: {'lr': 7.44432648333007e-05, 'samples': 21633984, 'steps': 112676, 'loss/train': 1.8090393543243408} 11/07/2021 13:04:49 - INFO - __main__ - Step 112678: {'lr': 7.443948671936404e-05, 'samples': 21634176, 'steps': 112677, 'loss/train': 1.7629847526550293} 11/07/2021 13:04:50 - INFO - __main__ - Step 112679: {'lr': 7.443570868453228e-05, 'samples': 21634368, 'steps': 112678, 'loss/train': 0.9133136868476868} 11/07/2021 13:04:51 - INFO - __main__ - Step 112680: {'lr': 7.443193072880707e-05, 'samples': 21634560, 'steps': 112679, 'loss/train': 0.8553277850151062} 11/07/2021 13:04:51 - INFO - __main__ - Step 112681: {'lr': 7.442815285219012e-05, 'samples': 21634752, 'steps': 112680, 'loss/train': 1.0689361095428467} 11/07/2021 13:04:52 - INFO - __main__ - Step 112682: {'lr': 7.442437505468313e-05, 'samples': 21634944, 'steps': 112681, 'loss/train': 1.0242384672164917} 11/07/2021 13:04:52 - INFO - __main__ - Step 112683: {'lr': 7.442059733628784e-05, 'samples': 21635136, 'steps': 112682, 'loss/train': 1.4006940126419067} 11/07/2021 13:04:53 - INFO - __main__ - Step 112684: {'lr': 7.44168196970059e-05, 'samples': 21635328, 'steps': 112683, 'loss/train': 0.2608295679092407} 11/07/2021 13:04:53 - INFO - __main__ - Step 112685: {'lr': 7.4413042136839e-05, 'samples': 21635520, 'steps': 112684, 'loss/train': 1.4760569334030151} 11/07/2021 13:04:54 - INFO - __main__ - Step 112686: {'lr': 7.440926465578898e-05, 'samples': 21635712, 'steps': 112685, 'loss/train': 1.4408072233200073} 11/07/2021 13:04:54 - INFO - __main__ - Step 112687: {'lr': 7.440548725385737e-05, 'samples': 21635904, 'steps': 112686, 'loss/train': 1.3829299211502075} 11/07/2021 13:04:54 - INFO - __main__ - Step 112688: {'lr': 7.440170993104592e-05, 'samples': 21636096, 'steps': 112687, 'loss/train': 0.856231153011322} 11/07/2021 13:04:55 - INFO - __main__ - Step 112689: {'lr': 7.439793268735634e-05, 'samples': 21636288, 'steps': 112688, 'loss/train': 2.1451923847198486} 11/07/2021 13:04:56 - INFO - __main__ - Step 112690: {'lr': 7.439415552279036e-05, 'samples': 21636480, 'steps': 112689, 'loss/train': 0.4993569552898407} 11/07/2021 13:04:56 - INFO - __main__ - Step 112691: {'lr': 7.439037843734967e-05, 'samples': 21636672, 'steps': 112690, 'loss/train': 1.2063831090927124} 11/07/2021 13:04:56 - INFO - __main__ - Step 112692: {'lr': 7.438660143103596e-05, 'samples': 21636864, 'steps': 112691, 'loss/train': 1.2221943140029907} 11/07/2021 13:04:57 - INFO - __main__ - Step 112693: {'lr': 7.438282450385092e-05, 'samples': 21637056, 'steps': 112692, 'loss/train': 1.2730032205581665} 11/07/2021 13:04:58 - INFO - __main__ - Step 112694: {'lr': 7.437904765579629e-05, 'samples': 21637248, 'steps': 112693, 'loss/train': 1.125010371208191} 11/07/2021 13:04:58 - INFO - __main__ - Step 112695: {'lr': 7.437527088687374e-05, 'samples': 21637440, 'steps': 112694, 'loss/train': 1.2572071552276611} 11/07/2021 13:04:59 - INFO - __main__ - Step 112696: {'lr': 7.437149419708497e-05, 'samples': 21637632, 'steps': 112695, 'loss/train': 1.2733025550842285} 11/07/2021 13:04:59 - INFO - __main__ - Step 112697: {'lr': 7.436771758643174e-05, 'samples': 21637824, 'steps': 112696, 'loss/train': 1.3241569995880127} 11/07/2021 13:04:59 - INFO - __main__ - Step 112698: {'lr': 7.436394105491567e-05, 'samples': 21638016, 'steps': 112697, 'loss/train': 1.2871415615081787} 11/07/2021 13:05:00 - INFO - __main__ - Step 112699: {'lr': 7.436016460253858e-05, 'samples': 21638208, 'steps': 112698, 'loss/train': 1.473225474357605} 11/07/2021 13:05:01 - INFO - __main__ - Step 112700: {'lr': 7.435638822930202e-05, 'samples': 21638400, 'steps': 112699, 'loss/train': 1.1090283393859863} 11/07/2021 13:05:01 - INFO - __main__ - Step 112701: {'lr': 7.435261193520773e-05, 'samples': 21638592, 'steps': 112700, 'loss/train': 0.9839694499969482} 11/07/2021 13:05:01 - INFO - __main__ - Step 112702: {'lr': 7.43488357202575e-05, 'samples': 21638784, 'steps': 112701, 'loss/train': 0.7883259654045105} 11/07/2021 13:05:02 - INFO - __main__ - Step 112703: {'lr': 7.434505958445293e-05, 'samples': 21638976, 'steps': 112702, 'loss/train': 1.1265355348587036} 11/07/2021 13:05:02 - INFO - __main__ - Step 112704: {'lr': 7.434128352779576e-05, 'samples': 21639168, 'steps': 112703, 'loss/train': 1.365365743637085} 11/07/2021 13:05:03 - INFO - __main__ - Step 112705: {'lr': 7.433750755028773e-05, 'samples': 21639360, 'steps': 112704, 'loss/train': 1.8469605445861816} 11/07/2021 13:05:04 - INFO - __main__ - Step 112706: {'lr': 7.43337316519305e-05, 'samples': 21639552, 'steps': 112705, 'loss/train': 0.562397301197052} 11/07/2021 13:05:04 - INFO - __main__ - Step 112707: {'lr': 7.432995583272575e-05, 'samples': 21639744, 'steps': 112706, 'loss/train': 1.506473183631897} 11/07/2021 13:05:04 - INFO - __main__ - Step 112708: {'lr': 7.432618009267525e-05, 'samples': 21639936, 'steps': 112707, 'loss/train': 1.4645724296569824} 11/07/2021 13:05:05 - INFO - __main__ - Step 112709: {'lr': 7.432240443178065e-05, 'samples': 21640128, 'steps': 112708, 'loss/train': 1.7391585111618042} 11/07/2021 13:05:06 - INFO - __main__ - Step 112710: {'lr': 7.431862885004364e-05, 'samples': 21640320, 'steps': 112709, 'loss/train': 1.0352853536605835} 11/07/2021 13:05:06 - INFO - __main__ - Step 112711: {'lr': 7.4314853347466e-05, 'samples': 21640512, 'steps': 112710, 'loss/train': 1.3070226907730103} 11/07/2021 13:05:06 - INFO - __main__ - Step 112712: {'lr': 7.43110779240494e-05, 'samples': 21640704, 'steps': 112711, 'loss/train': 1.4085884094238281} 11/07/2021 13:05:07 - INFO - __main__ - Step 112713: {'lr': 7.430730257979545e-05, 'samples': 21640896, 'steps': 112712, 'loss/train': 1.5501197576522827} 11/07/2021 13:05:07 - INFO - __main__ - Step 112714: {'lr': 7.430352731470593e-05, 'samples': 21641088, 'steps': 112713, 'loss/train': 1.1519101858139038} 11/07/2021 13:05:08 - INFO - __main__ - Step 112715: {'lr': 7.429975212878254e-05, 'samples': 21641280, 'steps': 112714, 'loss/train': 1.2884931564331055} 11/07/2021 13:05:08 - INFO - __main__ - Step 112716: {'lr': 7.429597702202695e-05, 'samples': 21641472, 'steps': 112715, 'loss/train': 1.4208118915557861} 11/07/2021 13:05:09 - INFO - __main__ - Step 112717: {'lr': 7.42922019944409e-05, 'samples': 21641664, 'steps': 112716, 'loss/train': 1.3653854131698608} 11/07/2021 13:05:09 - INFO - __main__ - Step 112718: {'lr': 7.428842704602604e-05, 'samples': 21641856, 'steps': 112717, 'loss/train': 1.4682785272598267} 11/07/2021 13:05:09 - INFO - __main__ - Step 112719: {'lr': 7.428465217678412e-05, 'samples': 21642048, 'steps': 112718, 'loss/train': 1.463945746421814} 11/07/2021 13:05:10 - INFO - __main__ - Step 112720: {'lr': 7.428087738671686e-05, 'samples': 21642240, 'steps': 112719, 'loss/train': 0.3828143775463104} 11/07/2021 13:05:11 - INFO - __main__ - Step 112721: {'lr': 7.427710267582588e-05, 'samples': 21642432, 'steps': 112720, 'loss/train': 1.4755299091339111} 11/07/2021 13:05:11 - INFO - __main__ - Step 112722: {'lr': 7.427332804411294e-05, 'samples': 21642624, 'steps': 112721, 'loss/train': 0.7178643345832825} 11/07/2021 13:05:12 - INFO - __main__ - Step 112723: {'lr': 7.426955349157971e-05, 'samples': 21642816, 'steps': 112722, 'loss/train': 1.177880048751831} 11/07/2021 13:05:12 - INFO - __main__ - Step 112724: {'lr': 7.426577901822793e-05, 'samples': 21643008, 'steps': 112723, 'loss/train': 0.7449324727058411} 11/07/2021 13:05:12 - INFO - __main__ - Step 112725: {'lr': 7.426200462405928e-05, 'samples': 21643200, 'steps': 112724, 'loss/train': 1.4953837394714355} 11/07/2021 13:05:13 - INFO - __main__ - Step 112726: {'lr': 7.425823030907553e-05, 'samples': 21643392, 'steps': 112725, 'loss/train': 1.204140067100525} 11/07/2021 13:05:14 - INFO - __main__ - Step 112727: {'lr': 7.425445607327822e-05, 'samples': 21643584, 'steps': 112726, 'loss/train': 0.16685384511947632} 11/07/2021 13:05:14 - INFO - __main__ - Step 112728: {'lr': 7.425068191666914e-05, 'samples': 21643776, 'steps': 112727, 'loss/train': 0.814339816570282} 11/07/2021 13:05:14 - INFO - __main__ - Step 112729: {'lr': 7.424690783925e-05, 'samples': 21643968, 'steps': 112728, 'loss/train': 1.7567975521087646} 11/07/2021 13:05:15 - INFO - __main__ - Step 112730: {'lr': 7.424313384102252e-05, 'samples': 21644160, 'steps': 112729, 'loss/train': 1.5944644212722778} 11/07/2021 13:05:16 - INFO - __main__ - Step 112731: {'lr': 7.423935992198832e-05, 'samples': 21644352, 'steps': 112730, 'loss/train': 1.0680077075958252} 11/07/2021 13:05:16 - INFO - __main__ - Step 112732: {'lr': 7.42355860821492e-05, 'samples': 21644544, 'steps': 112731, 'loss/train': 0.8792305588722229} 11/07/2021 13:05:17 - INFO - __main__ - Step 112733: {'lr': 7.423181232150677e-05, 'samples': 21644736, 'steps': 112732, 'loss/train': 1.3336834907531738} 11/07/2021 13:05:17 - INFO - __main__ - Step 112734: {'lr': 7.422803864006281e-05, 'samples': 21644928, 'steps': 112733, 'loss/train': 1.21372652053833} 11/07/2021 13:05:17 - INFO - __main__ - Step 112735: {'lr': 7.422426503781896e-05, 'samples': 21645120, 'steps': 112734, 'loss/train': 1.4467254877090454} 11/07/2021 13:05:18 - INFO - __main__ - Step 112736: {'lr': 7.422049151477695e-05, 'samples': 21645312, 'steps': 112735, 'loss/train': 1.5798968076705933} 11/07/2021 13:05:19 - INFO - __main__ - Step 112737: {'lr': 7.421671807093847e-05, 'samples': 21645504, 'steps': 112736, 'loss/train': 1.2297697067260742} 11/07/2021 13:05:19 - INFO - __main__ - Step 112738: {'lr': 7.421294470630524e-05, 'samples': 21645696, 'steps': 112737, 'loss/train': 0.8994830250740051} 11/07/2021 13:05:19 - INFO - __main__ - Step 112739: {'lr': 7.420917142087899e-05, 'samples': 21645888, 'steps': 112738, 'loss/train': 1.4897170066833496} 11/07/2021 13:05:20 - INFO - __main__ - Step 112740: {'lr': 7.420539821466132e-05, 'samples': 21646080, 'steps': 112739, 'loss/train': 1.6217707395553589} 11/07/2021 13:05:21 - INFO - __main__ - Step 112741: {'lr': 7.420162508765399e-05, 'samples': 21646272, 'steps': 112740, 'loss/train': 0.8641698956489563} 11/07/2021 13:05:21 - INFO - __main__ - Step 112742: {'lr': 7.419785203985868e-05, 'samples': 21646464, 'steps': 112741, 'loss/train': 1.505284070968628} 11/07/2021 13:05:21 - INFO - __main__ - Step 112743: {'lr': 7.419407907127712e-05, 'samples': 21646656, 'steps': 112742, 'loss/train': 1.516717553138733} 11/07/2021 13:05:22 - INFO - __main__ - Step 112744: {'lr': 7.4190306181911e-05, 'samples': 21646848, 'steps': 112743, 'loss/train': 0.8206745982170105} 11/07/2021 13:05:22 - INFO - __main__ - Step 112745: {'lr': 7.418653337176198e-05, 'samples': 21647040, 'steps': 112744, 'loss/train': 1.0997118949890137} 11/07/2021 13:05:22 - INFO - __main__ - Step 112746: {'lr': 7.418276064083182e-05, 'samples': 21647232, 'steps': 112745, 'loss/train': 1.3133673667907715} 11/07/2021 13:05:24 - INFO - __main__ - Step 112747: {'lr': 7.41789879891222e-05, 'samples': 21647424, 'steps': 112746, 'loss/train': 1.5643683671951294} 11/07/2021 13:05:24 - INFO - __main__ - Step 112748: {'lr': 7.41752154166348e-05, 'samples': 21647616, 'steps': 112747, 'loss/train': 1.4579298496246338} 11/07/2021 13:05:24 - INFO - __main__ - Step 112749: {'lr': 7.417144292337135e-05, 'samples': 21647808, 'steps': 112748, 'loss/train': 1.1080869436264038} 11/07/2021 13:05:25 - INFO - __main__ - Step 112750: {'lr': 7.416767050933354e-05, 'samples': 21648000, 'steps': 112749, 'loss/train': 1.6253105401992798} 11/07/2021 13:05:25 - INFO - __main__ - Step 112751: {'lr': 7.416389817452304e-05, 'samples': 21648192, 'steps': 112750, 'loss/train': 1.3158831596374512} 11/07/2021 13:05:26 - INFO - __main__ - Step 112752: {'lr': 7.416012591894158e-05, 'samples': 21648384, 'steps': 112751, 'loss/train': 1.4324051141738892} 11/07/2021 13:05:26 - INFO - __main__ - Step 112753: {'lr': 7.415635374259094e-05, 'samples': 21648576, 'steps': 112752, 'loss/train': 0.8898590803146362} 11/07/2021 13:05:27 - INFO - __main__ - Step 112754: {'lr': 7.415258164547268e-05, 'samples': 21648768, 'steps': 112753, 'loss/train': 1.006111979484558} 11/07/2021 13:05:27 - INFO - __main__ - Step 112755: {'lr': 7.414880962758849e-05, 'samples': 21648960, 'steps': 112754, 'loss/train': 0.8473302721977234} 11/07/2021 13:05:27 - INFO - __main__ - Step 112756: {'lr': 7.414503768894019e-05, 'samples': 21649152, 'steps': 112755, 'loss/train': 1.2059766054153442} 11/07/2021 13:05:29 - INFO - __main__ - Step 112757: {'lr': 7.41412658295294e-05, 'samples': 21649344, 'steps': 112756, 'loss/train': 1.580086350440979} 11/07/2021 13:05:29 - INFO - __main__ - Step 112758: {'lr': 7.413749404935785e-05, 'samples': 21649536, 'steps': 112757, 'loss/train': 1.654593586921692} 11/07/2021 13:05:29 - INFO - __main__ - Step 112759: {'lr': 7.413372234842722e-05, 'samples': 21649728, 'steps': 112758, 'loss/train': 0.9242705702781677} 11/07/2021 13:05:30 - INFO - __main__ - Step 112760: {'lr': 7.412995072673923e-05, 'samples': 21649920, 'steps': 112759, 'loss/train': 2.1378297805786133} 11/07/2021 13:05:30 - INFO - __main__ - Step 112761: {'lr': 7.412617918429556e-05, 'samples': 21650112, 'steps': 112760, 'loss/train': 1.2582671642303467} 11/07/2021 13:05:31 - INFO - __main__ - Step 112762: {'lr': 7.412240772109794e-05, 'samples': 21650304, 'steps': 112761, 'loss/train': 1.278376579284668} 11/07/2021 13:05:31 - INFO - __main__ - Step 112763: {'lr': 7.411863633714802e-05, 'samples': 21650496, 'steps': 112762, 'loss/train': 1.0443944931030273} 11/07/2021 13:05:32 - INFO - __main__ - Step 112764: {'lr': 7.411486503244754e-05, 'samples': 21650688, 'steps': 112763, 'loss/train': 1.0253757238388062} 11/07/2021 13:05:32 - INFO - __main__ - Step 112765: {'lr': 7.411109380699818e-05, 'samples': 21650880, 'steps': 112764, 'loss/train': 1.3767664432525635} 11/07/2021 13:05:32 - INFO - __main__ - Step 112766: {'lr': 7.410732266080175e-05, 'samples': 21651072, 'steps': 112765, 'loss/train': 1.367293357849121} 11/07/2021 13:05:33 - INFO - __main__ - Step 112767: {'lr': 7.410355159385976e-05, 'samples': 21651264, 'steps': 112766, 'loss/train': 1.2371490001678467} 11/07/2021 13:05:34 - INFO - __main__ - Step 112768: {'lr': 7.409978060617398e-05, 'samples': 21651456, 'steps': 112767, 'loss/train': 1.0745227336883545} 11/07/2021 13:05:34 - INFO - __main__ - Step 112769: {'lr': 7.409600969774614e-05, 'samples': 21651648, 'steps': 112768, 'loss/train': 0.9065642356872559} 11/07/2021 13:05:34 - INFO - __main__ - Step 112770: {'lr': 7.409223886857791e-05, 'samples': 21651840, 'steps': 112769, 'loss/train': 1.2155903577804565} 11/07/2021 13:05:35 - INFO - __main__ - Step 112771: {'lr': 7.408846811867101e-05, 'samples': 21652032, 'steps': 112770, 'loss/train': 1.4022296667099} 11/07/2021 13:05:36 - INFO - __main__ - Step 112772: {'lr': 7.408469744802715e-05, 'samples': 21652224, 'steps': 112771, 'loss/train': 1.2617210149765015} 11/07/2021 13:05:36 - INFO - __main__ - Step 112773: {'lr': 7.4080926856648e-05, 'samples': 21652416, 'steps': 112772, 'loss/train': 1.4715031385421753} 11/07/2021 13:05:37 - INFO - __main__ - Step 112774: {'lr': 7.407715634453523e-05, 'samples': 21652608, 'steps': 112773, 'loss/train': 1.9968140125274658} 11/07/2021 13:05:37 - INFO - __main__ - Step 112775: {'lr': 7.407338591169063e-05, 'samples': 21652800, 'steps': 112774, 'loss/train': 1.5091607570648193} 11/07/2021 13:05:37 - INFO - __main__ - Step 112776: {'lr': 7.406961555811584e-05, 'samples': 21652992, 'steps': 112775, 'loss/train': 0.8475468158721924} 11/07/2021 13:05:38 - INFO - __main__ - Step 112777: {'lr': 7.406584528381255e-05, 'samples': 21653184, 'steps': 112776, 'loss/train': 1.53568696975708} 11/07/2021 13:05:39 - INFO - __main__ - Step 112778: {'lr': 7.406207508878249e-05, 'samples': 21653376, 'steps': 112777, 'loss/train': 1.4894399642944336} 11/07/2021 13:05:39 - INFO - __main__ - Step 112779: {'lr': 7.405830497302732e-05, 'samples': 21653568, 'steps': 112778, 'loss/train': 1.3160313367843628} 11/07/2021 13:05:39 - INFO - __main__ - Step 112780: {'lr': 7.405453493654887e-05, 'samples': 21653760, 'steps': 112779, 'loss/train': 1.8395220041275024} 11/07/2021 13:05:40 - INFO - __main__ - Step 112781: {'lr': 7.405076497934862e-05, 'samples': 21653952, 'steps': 112780, 'loss/train': 1.263141393661499} 11/07/2021 13:05:40 - INFO - __main__ - Step 112782: {'lr': 7.40469951014284e-05, 'samples': 21654144, 'steps': 112781, 'loss/train': 1.1966904401779175} 11/07/2021 13:05:41 - INFO - __main__ - Step 112783: {'lr': 7.40432253027899e-05, 'samples': 21654336, 'steps': 112782, 'loss/train': 1.3865816593170166} 11/07/2021 13:05:42 - INFO - __main__ - Step 112784: {'lr': 7.40394555834348e-05, 'samples': 21654528, 'steps': 112783, 'loss/train': 1.6079691648483276} 11/07/2021 13:05:42 - INFO - __main__ - Step 112785: {'lr': 7.403568594336479e-05, 'samples': 21654720, 'steps': 112784, 'loss/train': 1.2864924669265747} 11/07/2021 13:05:42 - INFO - __main__ - Step 112786: {'lr': 7.403191638258162e-05, 'samples': 21654912, 'steps': 112785, 'loss/train': 1.1866847276687622} 11/07/2021 13:05:43 - INFO - __main__ - Step 112787: {'lr': 7.402814690108692e-05, 'samples': 21655104, 'steps': 112786, 'loss/train': 1.3498661518096924} 11/07/2021 13:05:44 - INFO - __main__ - Step 112788: {'lr': 7.402437749888244e-05, 'samples': 21655296, 'steps': 112787, 'loss/train': 1.4431121349334717} 11/07/2021 13:05:44 - INFO - __main__ - Step 112789: {'lr': 7.402060817596984e-05, 'samples': 21655488, 'steps': 112788, 'loss/train': 1.7981139421463013} 11/07/2021 13:05:44 - INFO - __main__ - Step 112790: {'lr': 7.401683893235084e-05, 'samples': 21655680, 'steps': 112789, 'loss/train': 1.3308857679367065} 11/07/2021 13:05:45 - INFO - __main__ - Step 112791: {'lr': 7.401306976802716e-05, 'samples': 21655872, 'steps': 112790, 'loss/train': 1.3468976020812988} 11/07/2021 13:05:45 - INFO - __main__ - Step 112792: {'lr': 7.400930068300046e-05, 'samples': 21656064, 'steps': 112791, 'loss/train': 1.1771470308303833} 11/07/2021 13:05:46 - INFO - __main__ - Step 112793: {'lr': 7.400553167727253e-05, 'samples': 21656256, 'steps': 112792, 'loss/train': 1.3160004615783691} 11/07/2021 13:05:46 - INFO - __main__ - Step 112794: {'lr': 7.400176275084492e-05, 'samples': 21656448, 'steps': 112793, 'loss/train': 1.526835322380066} 11/07/2021 13:05:47 - INFO - __main__ - Step 112795: {'lr': 7.39979939037194e-05, 'samples': 21656640, 'steps': 112794, 'loss/train': 0.6058102250099182} 11/07/2021 13:05:47 - INFO - __main__ - Step 112796: {'lr': 7.399422513589765e-05, 'samples': 21656832, 'steps': 112795, 'loss/train': 1.0425167083740234} 11/07/2021 13:05:48 - INFO - __main__ - Step 112797: {'lr': 7.399045644738143e-05, 'samples': 21657024, 'steps': 112796, 'loss/train': 0.8398134708404541} 11/07/2021 13:05:49 - INFO - __main__ - Step 112798: {'lr': 7.398668783817236e-05, 'samples': 21657216, 'steps': 112797, 'loss/train': 0.9880686402320862} 11/07/2021 13:05:49 - INFO - __main__ - Step 112799: {'lr': 7.398291930827216e-05, 'samples': 21657408, 'steps': 112798, 'loss/train': 1.24125075340271} 11/07/2021 13:05:49 - INFO - __main__ - Step 112800: {'lr': 7.397915085768257e-05, 'samples': 21657600, 'steps': 112799, 'loss/train': 0.9197309613227844} 11/07/2021 13:05:50 - INFO - __main__ - Step 112801: {'lr': 7.397538248640526e-05, 'samples': 21657792, 'steps': 112800, 'loss/train': 1.274638056755066} 11/07/2021 13:05:50 - INFO - __main__ - Step 112802: {'lr': 7.39716141944419e-05, 'samples': 21657984, 'steps': 112801, 'loss/train': 1.1697803735733032} 11/07/2021 13:05:51 - INFO - __main__ - Step 112803: {'lr': 7.396784598179424e-05, 'samples': 21658176, 'steps': 112802, 'loss/train': 1.543640375137329} 11/07/2021 13:05:51 - INFO - __main__ - Step 112804: {'lr': 7.396407784846393e-05, 'samples': 21658368, 'steps': 112803, 'loss/train': 0.9017319083213806} 11/07/2021 13:05:52 - INFO - __main__ - Step 112805: {'lr': 7.396030979445271e-05, 'samples': 21658560, 'steps': 112804, 'loss/train': 1.7980453968048096} 11/07/2021 13:05:52 - INFO - __main__ - Step 112806: {'lr': 7.395654181976224e-05, 'samples': 21658752, 'steps': 112805, 'loss/train': 1.3760807514190674} 11/07/2021 13:05:52 - INFO - __main__ - Step 112807: {'lr': 7.395277392439431e-05, 'samples': 21658944, 'steps': 112806, 'loss/train': 1.3201267719268799} 11/07/2021 13:05:55 - INFO - __main__ - Step 112808: {'lr': 7.394900610835049e-05, 'samples': 21659136, 'steps': 112807, 'loss/train': 1.4339522123336792} 11/07/2021 13:05:55 - INFO - __main__ - Step 112809: {'lr': 7.394523837163253e-05, 'samples': 21659328, 'steps': 112808, 'loss/train': 1.3049412965774536} 11/07/2021 13:05:56 - INFO - __main__ - Step 112810: {'lr': 7.39414707142421e-05, 'samples': 21659520, 'steps': 112809, 'loss/train': 0.8225497007369995} 11/07/2021 13:05:56 - INFO - __main__ - Step 112811: {'lr': 7.393770313618095e-05, 'samples': 21659712, 'steps': 112810, 'loss/train': 1.2839546203613281} 11/07/2021 13:05:57 - INFO - __main__ - Step 112812: {'lr': 7.393393563745073e-05, 'samples': 21659904, 'steps': 112811, 'loss/train': 2.053307294845581} 11/07/2021 13:05:57 - INFO - __main__ - Step 112813: {'lr': 7.393016821805321e-05, 'samples': 21660096, 'steps': 112812, 'loss/train': 1.6317167282104492} 11/07/2021 13:05:57 - INFO - __main__ - Step 112814: {'lr': 7.392640087798999e-05, 'samples': 21660288, 'steps': 112813, 'loss/train': 1.746673822402954} 11/07/2021 13:05:58 - INFO - __main__ - Step 112815: {'lr': 7.392263361726284e-05, 'samples': 21660480, 'steps': 112814, 'loss/train': 1.7392584085464478} 11/07/2021 13:05:59 - INFO - __main__ - Step 112816: {'lr': 7.391886643587343e-05, 'samples': 21660672, 'steps': 112815, 'loss/train': 1.742626428604126} 11/07/2021 13:05:59 - INFO - __main__ - Step 112817: {'lr': 7.391509933382346e-05, 'samples': 21660864, 'steps': 112816, 'loss/train': 2.144057512283325} 11/07/2021 13:05:59 - INFO - __main__ - Step 112818: {'lr': 7.391133231111464e-05, 'samples': 21661056, 'steps': 112817, 'loss/train': 0.7036740779876709} 11/07/2021 13:06:00 - INFO - __main__ - Step 112819: {'lr': 7.390756536774865e-05, 'samples': 21661248, 'steps': 112818, 'loss/train': 1.4082850217819214} 11/07/2021 13:06:00 - INFO - __main__ - Step 112820: {'lr': 7.390379850372728e-05, 'samples': 21661440, 'steps': 112819, 'loss/train': 1.5312690734863281} 11/07/2021 13:06:00 - INFO - __main__ - Step 112821: {'lr': 7.390003171905205e-05, 'samples': 21661632, 'steps': 112820, 'loss/train': 0.9538128972053528} 11/07/2021 13:06:01 - INFO - __main__ - Step 112822: {'lr': 7.389626501372474e-05, 'samples': 21661824, 'steps': 112821, 'loss/train': 1.6919230222702026} 11/07/2021 13:06:02 - INFO - __main__ - Step 112823: {'lr': 7.389249838774706e-05, 'samples': 21662016, 'steps': 112822, 'loss/train': 1.2064491510391235} 11/07/2021 13:06:02 - INFO - __main__ - Step 112824: {'lr': 7.388873184112071e-05, 'samples': 21662208, 'steps': 112823, 'loss/train': 1.1645480394363403} 11/07/2021 13:06:02 - INFO - __main__ - Step 112825: {'lr': 7.388496537384736e-05, 'samples': 21662400, 'steps': 112824, 'loss/train': 0.9300564527511597} 11/07/2021 13:06:03 - INFO - __main__ - Step 112826: {'lr': 7.388119898592876e-05, 'samples': 21662592, 'steps': 112825, 'loss/train': 1.0152932405471802} 11/07/2021 13:06:04 - INFO - __main__ - Step 112827: {'lr': 7.387743267736658e-05, 'samples': 21662784, 'steps': 112826, 'loss/train': 0.9718204736709595} 11/07/2021 13:06:04 - INFO - __main__ - Step 112828: {'lr': 7.387366644816249e-05, 'samples': 21662976, 'steps': 112827, 'loss/train': 0.869454026222229} 11/07/2021 13:06:05 - INFO - __main__ - Step 112829: {'lr': 7.386990029831819e-05, 'samples': 21663168, 'steps': 112828, 'loss/train': 1.417578935623169} 11/07/2021 13:06:05 - INFO - __main__ - Step 112830: {'lr': 7.386613422783542e-05, 'samples': 21663360, 'steps': 112829, 'loss/train': 1.296955943107605} 11/07/2021 13:06:05 - INFO - __main__ - Step 112831: {'lr': 7.386236823671585e-05, 'samples': 21663552, 'steps': 112830, 'loss/train': 1.2647981643676758} 11/07/2021 13:06:06 - INFO - __main__ - Step 112832: {'lr': 7.385860232496117e-05, 'samples': 21663744, 'steps': 112831, 'loss/train': 1.5597282648086548} 11/07/2021 13:06:07 - INFO - __main__ - Step 112833: {'lr': 7.385483649257318e-05, 'samples': 21663936, 'steps': 112832, 'loss/train': 1.307091474533081} 11/07/2021 13:06:07 - INFO - __main__ - Step 112834: {'lr': 7.385107073955338e-05, 'samples': 21664128, 'steps': 112833, 'loss/train': 1.6456571817398071} 11/07/2021 13:06:07 - INFO - __main__ - Step 112835: {'lr': 7.38473050659036e-05, 'samples': 21664320, 'steps': 112834, 'loss/train': 1.239084005355835} 11/07/2021 13:06:08 - INFO - __main__ - Step 112836: {'lr': 7.384353947162547e-05, 'samples': 21664512, 'steps': 112835, 'loss/train': 1.3547327518463135} 11/07/2021 13:06:09 - INFO - __main__ - Step 112837: {'lr': 7.383977395672076e-05, 'samples': 21664704, 'steps': 112836, 'loss/train': 1.466848611831665} 11/07/2021 13:06:09 - INFO - __main__ - Step 112838: {'lr': 7.38360085211911e-05, 'samples': 21664896, 'steps': 112837, 'loss/train': 1.1537925004959106} 11/07/2021 13:06:09 - INFO - __main__ - Step 112839: {'lr': 7.383224316503823e-05, 'samples': 21665088, 'steps': 112838, 'loss/train': 1.6767750978469849} 11/07/2021 13:06:10 - INFO - __main__ - Step 112840: {'lr': 7.382847788826383e-05, 'samples': 21665280, 'steps': 112839, 'loss/train': 1.2572293281555176} 11/07/2021 13:06:10 - INFO - __main__ - Step 112841: {'lr': 7.38247126908696e-05, 'samples': 21665472, 'steps': 112840, 'loss/train': 1.4555803537368774} 11/07/2021 13:06:11 - INFO - __main__ - Step 112842: {'lr': 7.382094757285724e-05, 'samples': 21665664, 'steps': 112841, 'loss/train': 1.3583440780639648} 11/07/2021 13:06:12 - INFO - __main__ - Step 112843: {'lr': 7.381718253422842e-05, 'samples': 21665856, 'steps': 112842, 'loss/train': 1.4776555299758911} 11/07/2021 13:06:12 - INFO - __main__ - Step 112844: {'lr': 7.381341757498489e-05, 'samples': 21666048, 'steps': 112843, 'loss/train': 1.5515328645706177} 11/07/2021 13:06:12 - INFO - __main__ - Step 112845: {'lr': 7.380965269512837e-05, 'samples': 21666240, 'steps': 112844, 'loss/train': 1.555945634841919} 11/07/2021 13:06:13 - INFO - __main__ - Step 112846: {'lr': 7.380588789466044e-05, 'samples': 21666432, 'steps': 112845, 'loss/train': 1.369847297668457} 11/07/2021 13:06:14 - INFO - __main__ - Step 112847: {'lr': 7.380212317358287e-05, 'samples': 21666624, 'steps': 112846, 'loss/train': 1.0670890808105469} 11/07/2021 13:06:14 - INFO - __main__ - Step 112848: {'lr': 7.379835853189731e-05, 'samples': 21666816, 'steps': 112847, 'loss/train': 1.457047462463379} 11/07/2021 13:06:15 - INFO - __main__ - Step 112849: {'lr': 7.379459396960551e-05, 'samples': 21667008, 'steps': 112848, 'loss/train': 1.2142919301986694} 11/07/2021 13:06:15 - INFO - __main__ - Step 112850: {'lr': 7.379082948670915e-05, 'samples': 21667200, 'steps': 112849, 'loss/train': 1.0458519458770752} 11/07/2021 13:06:15 - INFO - __main__ - Step 112851: {'lr': 7.378706508320993e-05, 'samples': 21667392, 'steps': 112850, 'loss/train': 1.6500781774520874} 11/07/2021 13:06:16 - INFO - __main__ - Step 112852: {'lr': 7.378330075910949e-05, 'samples': 21667584, 'steps': 112851, 'loss/train': 1.300378680229187} 11/07/2021 13:06:17 - INFO - __main__ - Step 112853: {'lr': 7.377953651440964e-05, 'samples': 21667776, 'steps': 112852, 'loss/train': 1.6998051404953003} 11/07/2021 13:06:17 - INFO - __main__ - Step 112854: {'lr': 7.377577234911198e-05, 'samples': 21667968, 'steps': 112853, 'loss/train': 1.7653849124908447} 11/07/2021 13:06:17 - INFO - __main__ - Step 112855: {'lr': 7.377200826321823e-05, 'samples': 21668160, 'steps': 112854, 'loss/train': 1.3348454236984253} 11/07/2021 13:06:18 - INFO - __main__ - Step 112856: {'lr': 7.376824425673017e-05, 'samples': 21668352, 'steps': 112855, 'loss/train': 1.6142079830169678} 11/07/2021 13:06:18 - INFO - __main__ - Step 112857: {'lr': 7.376448032964938e-05, 'samples': 21668544, 'steps': 112856, 'loss/train': 1.12921941280365} 11/07/2021 13:06:19 - INFO - __main__ - Step 112858: {'lr': 7.376071648197757e-05, 'samples': 21668736, 'steps': 112857, 'loss/train': 1.6477546691894531} 11/07/2021 13:06:20 - INFO - __main__ - Step 112859: {'lr': 7.375695271371646e-05, 'samples': 21668928, 'steps': 112858, 'loss/train': 1.1950263977050781} 11/07/2021 13:06:20 - INFO - __main__ - Step 112860: {'lr': 7.375318902486775e-05, 'samples': 21669120, 'steps': 112859, 'loss/train': 1.1922776699066162} 11/07/2021 13:06:20 - INFO - __main__ - Step 112861: {'lr': 7.374942541543315e-05, 'samples': 21669312, 'steps': 112860, 'loss/train': 1.038171410560608} 11/07/2021 13:06:21 - INFO - __main__ - Step 112862: {'lr': 7.37456618854143e-05, 'samples': 21669504, 'steps': 112861, 'loss/train': 0.7247509956359863} 11/07/2021 13:06:22 - INFO - __main__ - Step 112863: {'lr': 7.374189843481297e-05, 'samples': 21669696, 'steps': 112862, 'loss/train': 1.2627389430999756} 11/07/2021 13:06:22 - INFO - __main__ - Step 112864: {'lr': 7.37381350636308e-05, 'samples': 21669888, 'steps': 112863, 'loss/train': 1.220573902130127} 11/07/2021 13:06:22 - INFO - __main__ - Step 112865: {'lr': 7.373437177186951e-05, 'samples': 21670080, 'steps': 112864, 'loss/train': 1.8122478723526} 11/07/2021 13:06:23 - INFO - __main__ - Step 112866: {'lr': 7.373060855953079e-05, 'samples': 21670272, 'steps': 112865, 'loss/train': 0.0609080009162426} 11/07/2021 13:06:23 - INFO - __main__ - Step 112867: {'lr': 7.372684542661643e-05, 'samples': 21670464, 'steps': 112866, 'loss/train': 1.006561517715454} 11/07/2021 13:06:24 - INFO - __main__ - Step 112868: {'lr': 7.372308237312794e-05, 'samples': 21670656, 'steps': 112867, 'loss/train': 1.283977746963501} 11/07/2021 13:06:25 - INFO - __main__ - Step 112869: {'lr': 7.371931939906712e-05, 'samples': 21670848, 'steps': 112868, 'loss/train': 1.5391650199890137} 11/07/2021 13:06:25 - INFO - __main__ - Step 112870: {'lr': 7.371555650443565e-05, 'samples': 21671040, 'steps': 112869, 'loss/train': 1.140981912612915} 11/07/2021 13:06:25 - INFO - __main__ - Step 112871: {'lr': 7.371179368923522e-05, 'samples': 21671232, 'steps': 112870, 'loss/train': 1.0621100664138794} 11/07/2021 13:06:26 - INFO - __main__ - Step 112872: {'lr': 7.370803095346757e-05, 'samples': 21671424, 'steps': 112871, 'loss/train': 1.8889076709747314} 11/07/2021 13:06:26 - INFO - __main__ - Step 112873: {'lr': 7.370426829713433e-05, 'samples': 21671616, 'steps': 112872, 'loss/train': 1.5327870845794678} 11/07/2021 13:06:27 - INFO - __main__ - Step 112874: {'lr': 7.370050572023723e-05, 'samples': 21671808, 'steps': 112873, 'loss/train': 1.4290043115615845} 11/07/2021 13:06:28 - INFO - __main__ - Step 112875: {'lr': 7.369674322277798e-05, 'samples': 21672000, 'steps': 112874, 'loss/train': 1.6480038166046143} 11/07/2021 13:06:28 - INFO - __main__ - Step 112876: {'lr': 7.369298080475822e-05, 'samples': 21672192, 'steps': 112875, 'loss/train': 1.31851065158844} 11/07/2021 13:06:28 - INFO - __main__ - Step 112877: {'lr': 7.368921846617971e-05, 'samples': 21672384, 'steps': 112876, 'loss/train': 1.3038090467453003} 11/07/2021 13:06:29 - INFO - __main__ - Step 112878: {'lr': 7.36854562070442e-05, 'samples': 21672576, 'steps': 112877, 'loss/train': 5.704061031341553} 11/07/2021 13:06:30 - INFO - __main__ - Step 112879: {'lr': 7.368169402735322e-05, 'samples': 21672768, 'steps': 112878, 'loss/train': 1.4321855306625366} 11/07/2021 13:06:30 - INFO - __main__ - Step 112880: {'lr': 7.367793192710853e-05, 'samples': 21672960, 'steps': 112879, 'loss/train': 1.484136939048767} 11/07/2021 13:06:30 - INFO - __main__ - Step 112881: {'lr': 7.367416990631187e-05, 'samples': 21673152, 'steps': 112880, 'loss/train': 1.3543294668197632} 11/07/2021 13:06:31 - INFO - __main__ - Step 112882: {'lr': 7.367040796496487e-05, 'samples': 21673344, 'steps': 112881, 'loss/train': 1.3934866189956665} 11/07/2021 13:06:31 - INFO - __main__ - Step 112883: {'lr': 7.36666461030693e-05, 'samples': 21673536, 'steps': 112882, 'loss/train': 1.078586220741272} 11/07/2021 13:06:32 - INFO - __main__ - Step 112884: {'lr': 7.366288432062682e-05, 'samples': 21673728, 'steps': 112883, 'loss/train': 1.386454463005066} 11/07/2021 13:06:32 - INFO - __main__ - Step 112885: {'lr': 7.36591226176391e-05, 'samples': 21673920, 'steps': 112884, 'loss/train': 1.1986184120178223} 11/07/2021 13:06:33 - INFO - __main__ - Step 112886: {'lr': 7.365536099410786e-05, 'samples': 21674112, 'steps': 112885, 'loss/train': 1.820876955986023} 11/07/2021 13:06:33 - INFO - __main__ - Step 112887: {'lr': 7.365159945003481e-05, 'samples': 21674304, 'steps': 112886, 'loss/train': 1.6473407745361328} 11/07/2021 13:06:33 - INFO - __main__ - Step 112888: {'lr': 7.36478379854216e-05, 'samples': 21674496, 'steps': 112887, 'loss/train': 1.675660490989685} 11/07/2021 13:06:35 - INFO - __main__ - Step 112889: {'lr': 7.364407660027006e-05, 'samples': 21674688, 'steps': 112888, 'loss/train': 1.0861318111419678} 11/07/2021 13:06:35 - INFO - __main__ - Step 112890: {'lr': 7.364031529458171e-05, 'samples': 21674880, 'steps': 112889, 'loss/train': 1.4367738962173462} 11/07/2021 13:06:35 - INFO - __main__ - Step 112891: {'lr': 7.363655406835829e-05, 'samples': 21675072, 'steps': 112890, 'loss/train': 1.5228043794631958} 11/07/2021 13:06:36 - INFO - __main__ - Step 112892: {'lr': 7.36327929216015e-05, 'samples': 21675264, 'steps': 112891, 'loss/train': 1.3226014375686646} 11/07/2021 13:06:36 - INFO - __main__ - Step 112893: {'lr': 7.362903185431307e-05, 'samples': 21675456, 'steps': 112892, 'loss/train': 1.0587350130081177} 11/07/2021 13:06:37 - INFO - __main__ - Step 112894: {'lr': 7.362527086649468e-05, 'samples': 21675648, 'steps': 112893, 'loss/train': 1.4013400077819824} 11/07/2021 13:06:37 - INFO - __main__ - Step 112895: {'lr': 7.362150995814801e-05, 'samples': 21675840, 'steps': 112894, 'loss/train': 1.5456122159957886} 11/07/2021 13:06:38 - INFO - __main__ - Step 112896: {'lr': 7.361774912927479e-05, 'samples': 21676032, 'steps': 112895, 'loss/train': 1.285007357597351} 11/07/2021 13:06:38 - INFO - __main__ - Step 112897: {'lr': 7.361398837987668e-05, 'samples': 21676224, 'steps': 112896, 'loss/train': 0.9920331835746765} 11/07/2021 13:06:38 - INFO - __main__ - Step 112898: {'lr': 7.361022770995538e-05, 'samples': 21676416, 'steps': 112897, 'loss/train': 1.4790605306625366} 11/07/2021 13:06:39 - INFO - __main__ - Step 112899: {'lr': 7.360646711951257e-05, 'samples': 21676608, 'steps': 112898, 'loss/train': 1.285859227180481} 11/07/2021 13:06:40 - INFO - __main__ - Step 112900: {'lr': 7.360270660855001e-05, 'samples': 21676800, 'steps': 112899, 'loss/train': 1.9940794706344604} 11/07/2021 13:06:40 - INFO - __main__ - Step 112901: {'lr': 7.359894617706939e-05, 'samples': 21676992, 'steps': 112900, 'loss/train': 1.2156764268875122} 11/07/2021 13:06:41 - INFO - __main__ - Step 112902: {'lr': 7.359518582507229e-05, 'samples': 21677184, 'steps': 112901, 'loss/train': 1.0581191778182983} 11/07/2021 13:06:41 - INFO - __main__ - Step 112903: {'lr': 7.359142555256048e-05, 'samples': 21677376, 'steps': 112902, 'loss/train': 0.7456344962120056} 11/07/2021 13:06:41 - INFO - __main__ - Step 112904: {'lr': 7.358766535953565e-05, 'samples': 21677568, 'steps': 112903, 'loss/train': 1.193034052848816} 11/07/2021 13:06:42 - INFO - __main__ - Step 112905: {'lr': 7.35839052459995e-05, 'samples': 21677760, 'steps': 112904, 'loss/train': 0.7119191288948059} 11/07/2021 13:06:43 - INFO - __main__ - Step 112906: {'lr': 7.358014521195372e-05, 'samples': 21677952, 'steps': 112905, 'loss/train': 1.5754902362823486} 11/07/2021 13:06:43 - INFO - __main__ - Step 112907: {'lr': 7.357638525740001e-05, 'samples': 21678144, 'steps': 112906, 'loss/train': 1.312813639640808} 11/07/2021 13:06:44 - INFO - __main__ - Step 112908: {'lr': 7.357262538234005e-05, 'samples': 21678336, 'steps': 112907, 'loss/train': 1.27462899684906} 11/07/2021 13:06:44 - INFO - __main__ - Step 112909: {'lr': 7.356886558677555e-05, 'samples': 21678528, 'steps': 112908, 'loss/train': 1.3388495445251465} 11/07/2021 13:06:45 - INFO - __main__ - Step 112910: {'lr': 7.356510587070819e-05, 'samples': 21678720, 'steps': 112909, 'loss/train': 1.435093879699707} 11/07/2021 13:06:45 - INFO - __main__ - Step 112911: {'lr': 7.356134623413968e-05, 'samples': 21678912, 'steps': 112910, 'loss/train': 1.985727071762085} 11/07/2021 13:06:46 - INFO - __main__ - Step 112912: {'lr': 7.355758667707168e-05, 'samples': 21679104, 'steps': 112911, 'loss/train': 1.3823142051696777} 11/07/2021 13:06:46 - INFO - __main__ - Step 112913: {'lr': 7.355382719950593e-05, 'samples': 21679296, 'steps': 112912, 'loss/train': 1.2005504369735718} 11/07/2021 13:06:46 - INFO - __main__ - Step 112914: {'lr': 7.355006780144419e-05, 'samples': 21679488, 'steps': 112913, 'loss/train': 1.1096370220184326} 11/07/2021 13:06:47 - INFO - __main__ - Step 112915: {'lr': 7.354630848288796e-05, 'samples': 21679680, 'steps': 112914, 'loss/train': 0.6819249391555786} 11/07/2021 13:06:48 - INFO - __main__ - Step 112916: {'lr': 7.354254924383907e-05, 'samples': 21679872, 'steps': 112915, 'loss/train': 1.4539254903793335} 11/07/2021 13:06:48 - INFO - __main__ - Step 112917: {'lr': 7.353879008429917e-05, 'samples': 21680064, 'steps': 112916, 'loss/train': 1.5298418998718262} 11/07/2021 13:06:48 - INFO - __main__ - Step 112918: {'lr': 7.353503100426995e-05, 'samples': 21680256, 'steps': 112917, 'loss/train': 1.3806614875793457} 11/07/2021 13:06:49 - INFO - __main__ - Step 112919: {'lr': 7.353127200375315e-05, 'samples': 21680448, 'steps': 112918, 'loss/train': 1.2559387683868408} 11/07/2021 13:06:50 - INFO - __main__ - Step 112920: {'lr': 7.352751308275043e-05, 'samples': 21680640, 'steps': 112919, 'loss/train': 1.0402805805206299} 11/07/2021 13:06:50 - INFO - __main__ - Step 112921: {'lr': 7.352375424126347e-05, 'samples': 21680832, 'steps': 112920, 'loss/train': 1.4245799779891968} 11/07/2021 13:06:50 - INFO - __main__ - Step 112922: {'lr': 7.3519995479294e-05, 'samples': 21681024, 'steps': 112921, 'loss/train': 0.929055392742157} 11/07/2021 13:06:51 - INFO - __main__ - Step 112923: {'lr': 7.351623679684372e-05, 'samples': 21681216, 'steps': 112922, 'loss/train': 1.5169340372085571} 11/07/2021 13:06:51 - INFO - __main__ - Step 112924: {'lr': 7.351247819391427e-05, 'samples': 21681408, 'steps': 112923, 'loss/train': 1.4598678350448608} 11/07/2021 13:06:52 - INFO - __main__ - Step 112925: {'lr': 7.350871967050738e-05, 'samples': 21681600, 'steps': 112924, 'loss/train': 1.275792121887207} 11/07/2021 13:06:53 - INFO - __main__ - Step 112926: {'lr': 7.350496122662472e-05, 'samples': 21681792, 'steps': 112925, 'loss/train': 1.5503073930740356} 11/07/2021 13:06:53 - INFO - __main__ - Step 112927: {'lr': 7.350120286226803e-05, 'samples': 21681984, 'steps': 112926, 'loss/train': 1.4939043521881104} 11/07/2021 13:06:53 - INFO - __main__ - Step 112928: {'lr': 7.349744457743904e-05, 'samples': 21682176, 'steps': 112927, 'loss/train': 1.5688592195510864} 11/07/2021 13:06:54 - INFO - __main__ - Step 112929: {'lr': 7.34936863721393e-05, 'samples': 21682368, 'steps': 112928, 'loss/train': 1.3659569025039673} 11/07/2021 13:06:54 - INFO - __main__ - Step 112930: {'lr': 7.348992824637057e-05, 'samples': 21682560, 'steps': 112929, 'loss/train': 1.345741629600525} 11/07/2021 13:06:55 - INFO - __main__ - Step 112931: {'lr': 7.348617020013457e-05, 'samples': 21682752, 'steps': 112930, 'loss/train': 1.6973868608474731} 11/07/2021 13:06:55 - INFO - __main__ - Step 112932: {'lr': 7.348241223343299e-05, 'samples': 21682944, 'steps': 112931, 'loss/train': 1.5301198959350586} 11/07/2021 13:06:56 - INFO - __main__ - Step 112933: {'lr': 7.34786543462675e-05, 'samples': 21683136, 'steps': 112932, 'loss/train': 1.1131656169891357} 11/07/2021 13:06:56 - INFO - __main__ - Step 112934: {'lr': 7.347489653863979e-05, 'samples': 21683328, 'steps': 112933, 'loss/train': 1.5431323051452637} 11/07/2021 13:06:56 - INFO - __main__ - Step 112935: {'lr': 7.34711388105516e-05, 'samples': 21683520, 'steps': 112934, 'loss/train': 1.4149177074432373} 11/07/2021 13:06:58 - INFO - __main__ - Step 112936: {'lr': 7.346738116200455e-05, 'samples': 21683712, 'steps': 112935, 'loss/train': 1.4227325916290283} 11/07/2021 13:06:58 - INFO - __main__ - Step 112937: {'lr': 7.346362359300038e-05, 'samples': 21683904, 'steps': 112936, 'loss/train': 1.5281894207000732} 11/07/2021 13:06:58 - INFO - __main__ - Step 112938: {'lr': 7.34598661035408e-05, 'samples': 21684096, 'steps': 112937, 'loss/train': 1.615063190460205} 11/07/2021 13:06:59 - INFO - __main__ - Step 112939: {'lr': 7.345610869362746e-05, 'samples': 21684288, 'steps': 112938, 'loss/train': 1.2670005559921265} 11/07/2021 13:06:59 - INFO - __main__ - Step 112940: {'lr': 7.345235136326208e-05, 'samples': 21684480, 'steps': 112939, 'loss/train': 1.7682714462280273} 11/07/2021 13:07:00 - INFO - __main__ - Step 112941: {'lr': 7.344859411244645e-05, 'samples': 21684672, 'steps': 112940, 'loss/train': 1.4609136581420898} 11/07/2021 13:07:00 - INFO - __main__ - Step 112942: {'lr': 7.344483694118203e-05, 'samples': 21684864, 'steps': 112941, 'loss/train': 1.0085331201553345} 11/07/2021 13:07:01 - INFO - __main__ - Step 112943: {'lr': 7.344107984947068e-05, 'samples': 21685056, 'steps': 112942, 'loss/train': 1.3264195919036865} 11/07/2021 13:07:01 - INFO - __main__ - Step 112944: {'lr': 7.343732283731405e-05, 'samples': 21685248, 'steps': 112943, 'loss/train': 1.3076804876327515} 11/07/2021 13:07:01 - INFO - __main__ - Step 112945: {'lr': 7.343356590471384e-05, 'samples': 21685440, 'steps': 112944, 'loss/train': 1.1987158060073853} 11/07/2021 13:07:03 - INFO - __main__ - Step 112946: {'lr': 7.342980905167173e-05, 'samples': 21685632, 'steps': 112945, 'loss/train': 1.4295140504837036} 11/07/2021 13:07:03 - INFO - __main__ - Step 112947: {'lr': 7.342605227818944e-05, 'samples': 21685824, 'steps': 112946, 'loss/train': 1.5495986938476562} 11/07/2021 13:07:03 - INFO - __main__ - Step 112948: {'lr': 7.342229558426864e-05, 'samples': 21686016, 'steps': 112947, 'loss/train': 1.8170409202575684} 11/07/2021 13:07:04 - INFO - __main__ - Step 112949: {'lr': 7.341853896991099e-05, 'samples': 21686208, 'steps': 112948, 'loss/train': 1.1049748659133911} 11/07/2021 13:07:04 - INFO - __main__ - Step 112950: {'lr': 7.341478243511825e-05, 'samples': 21686400, 'steps': 112949, 'loss/train': 1.6262774467468262} 11/07/2021 13:07:05 - INFO - __main__ - Step 112951: {'lr': 7.34110259798921e-05, 'samples': 21686592, 'steps': 112950, 'loss/train': 1.4626511335372925} 11/07/2021 13:07:05 - INFO - __main__ - Step 112952: {'lr': 7.340726960423421e-05, 'samples': 21686784, 'steps': 112951, 'loss/train': 1.7481218576431274} 11/07/2021 13:07:06 - INFO - __main__ - Step 112953: {'lr': 7.340351330814626e-05, 'samples': 21686976, 'steps': 112952, 'loss/train': 1.505455732345581} 11/07/2021 13:07:06 - INFO - __main__ - Step 112954: {'lr': 7.339975709163008e-05, 'samples': 21687168, 'steps': 112953, 'loss/train': 1.695163607597351} 11/07/2021 13:07:06 - INFO - __main__ - Step 112955: {'lr': 7.339600095468712e-05, 'samples': 21687360, 'steps': 112954, 'loss/train': 0.9636628031730652} 11/07/2021 13:07:07 - INFO - __main__ - Step 112956: {'lr': 7.339224489731921e-05, 'samples': 21687552, 'steps': 112955, 'loss/train': 1.1911985874176025} 11/07/2021 13:07:08 - INFO - __main__ - Step 112957: {'lr': 7.338848891952804e-05, 'samples': 21687744, 'steps': 112956, 'loss/train': 1.0706552267074585} 11/07/2021 13:07:08 - INFO - __main__ - Step 112958: {'lr': 7.338473302131529e-05, 'samples': 21687936, 'steps': 112957, 'loss/train': 1.1894819736480713} 11/07/2021 13:07:08 - INFO - __main__ - Step 112959: {'lr': 7.338097720268267e-05, 'samples': 21688128, 'steps': 112958, 'loss/train': 1.6361526250839233} 11/07/2021 13:07:09 - INFO - __main__ - Step 112960: {'lr': 7.337722146363182e-05, 'samples': 21688320, 'steps': 112959, 'loss/train': 0.4030868411064148} 11/07/2021 13:07:09 - INFO - __main__ - Step 112961: {'lr': 7.337346580416449e-05, 'samples': 21688512, 'steps': 112960, 'loss/train': 0.7461603879928589} 11/07/2021 13:07:11 - INFO - __main__ - Step 112962: {'lr': 7.336971022428235e-05, 'samples': 21688704, 'steps': 112961, 'loss/train': 1.3480337858200073} 11/07/2021 13:07:11 - INFO - __main__ - Step 112963: {'lr': 7.336595472398711e-05, 'samples': 21688896, 'steps': 112962, 'loss/train': 1.2355949878692627} 11/07/2021 13:07:11 - INFO - __main__ - Step 112964: {'lr': 7.336219930328042e-05, 'samples': 21689088, 'steps': 112963, 'loss/train': 0.44545409083366394} 11/07/2021 13:07:12 - INFO - __main__ - Step 112965: {'lr': 7.335844396216399e-05, 'samples': 21689280, 'steps': 112964, 'loss/train': 0.6956256628036499} 11/07/2021 13:07:12 - INFO - __main__ - Step 112966: {'lr': 7.335468870063952e-05, 'samples': 21689472, 'steps': 112965, 'loss/train': 0.7998329401016235} 11/07/2021 13:07:13 - INFO - __main__ - Step 112967: {'lr': 7.335093351870873e-05, 'samples': 21689664, 'steps': 112966, 'loss/train': 1.3985320329666138} 11/07/2021 13:07:14 - INFO - __main__ - Step 112968: {'lr': 7.334717841637334e-05, 'samples': 21689856, 'steps': 112967, 'loss/train': 1.580386757850647} 11/07/2021 13:07:14 - INFO - __main__ - Step 112969: {'lr': 7.334342339363492e-05, 'samples': 21690048, 'steps': 112968, 'loss/train': 0.6345682740211487} 11/07/2021 13:07:14 - INFO - __main__ - Step 112970: {'lr': 7.333966845049522e-05, 'samples': 21690240, 'steps': 112969, 'loss/train': 1.1423654556274414} 11/07/2021 13:07:15 - INFO - __main__ - Step 112971: {'lr': 7.333591358695594e-05, 'samples': 21690432, 'steps': 112970, 'loss/train': 1.1744898557662964} 11/07/2021 13:07:16 - INFO - __main__ - Step 112972: {'lr': 7.333215880301877e-05, 'samples': 21690624, 'steps': 112971, 'loss/train': 0.68037348985672} 11/07/2021 13:07:16 - INFO - __main__ - Step 112973: {'lr': 7.332840409868541e-05, 'samples': 21690816, 'steps': 112972, 'loss/train': 2.3763222694396973} 11/07/2021 13:07:17 - INFO - __main__ - Step 112974: {'lr': 7.332464947395753e-05, 'samples': 21691008, 'steps': 112973, 'loss/train': 1.127955436706543} 11/07/2021 13:07:17 - INFO - __main__ - Step 112975: {'lr': 7.332089492883684e-05, 'samples': 21691200, 'steps': 112974, 'loss/train': 1.8125134706497192} 11/07/2021 13:07:17 - INFO - __main__ - Step 112976: {'lr': 7.331714046332504e-05, 'samples': 21691392, 'steps': 112975, 'loss/train': 1.4716120958328247} 11/07/2021 13:07:18 - INFO - __main__ - Step 112977: {'lr': 7.33133860774238e-05, 'samples': 21691584, 'steps': 112976, 'loss/train': 1.4086589813232422} 11/07/2021 13:07:19 - INFO - __main__ - Step 112978: {'lr': 7.330963177113484e-05, 'samples': 21691776, 'steps': 112977, 'loss/train': 1.6325459480285645} 11/07/2021 13:07:19 - INFO - __main__ - Step 112979: {'lr': 7.33058775444598e-05, 'samples': 21691968, 'steps': 112978, 'loss/train': 1.3913064002990723} 11/07/2021 13:07:19 - INFO - __main__ - Step 112980: {'lr': 7.330212339740045e-05, 'samples': 21692160, 'steps': 112979, 'loss/train': 1.4357784986495972} 11/07/2021 13:07:20 - INFO - __main__ - Step 112981: {'lr': 7.329836932995848e-05, 'samples': 21692352, 'steps': 112980, 'loss/train': 1.4566987752914429} 11/07/2021 13:07:21 - INFO - __main__ - Step 112982: {'lr': 7.329461534213546e-05, 'samples': 21692544, 'steps': 112981, 'loss/train': 0.9548531770706177} 11/07/2021 13:07:21 - INFO - __main__ - Step 112983: {'lr': 7.329086143393318e-05, 'samples': 21692736, 'steps': 112982, 'loss/train': 0.8534504771232605} 11/07/2021 13:07:21 - INFO - __main__ - Step 112984: {'lr': 7.328710760535329e-05, 'samples': 21692928, 'steps': 112983, 'loss/train': 1.5242244005203247} 11/07/2021 13:07:22 - INFO - __main__ - Step 112985: {'lr': 7.328335385639751e-05, 'samples': 21693120, 'steps': 112984, 'loss/train': 1.2800407409667969} 11/07/2021 13:07:22 - INFO - __main__ - Step 112986: {'lr': 7.327960018706753e-05, 'samples': 21693312, 'steps': 112985, 'loss/train': 1.1498373746871948} 11/07/2021 13:07:23 - INFO - __main__ - Step 112987: {'lr': 7.327584659736503e-05, 'samples': 21693504, 'steps': 112986, 'loss/train': 1.7879998683929443} 11/07/2021 13:07:24 - INFO - __main__ - Step 112988: {'lr': 7.327209308729171e-05, 'samples': 21693696, 'steps': 112987, 'loss/train': 1.4364192485809326} 11/07/2021 13:07:24 - INFO - __main__ - Step 112989: {'lr': 7.326833965684925e-05, 'samples': 21693888, 'steps': 112988, 'loss/train': 1.4071568250656128} 11/07/2021 13:07:24 - INFO - __main__ - Step 112990: {'lr': 7.326458630603936e-05, 'samples': 21694080, 'steps': 112989, 'loss/train': 1.1539154052734375} 11/07/2021 13:07:25 - INFO - __main__ - Step 112991: {'lr': 7.326083303486372e-05, 'samples': 21694272, 'steps': 112990, 'loss/train': 1.362653136253357} 11/07/2021 13:07:25 - INFO - __main__ - Step 112992: {'lr': 7.3257079843324e-05, 'samples': 21694464, 'steps': 112991, 'loss/train': 1.4513171911239624} 11/07/2021 13:07:26 - INFO - __main__ - Step 112993: {'lr': 7.325332673142193e-05, 'samples': 21694656, 'steps': 112992, 'loss/train': 1.6739143133163452} 11/07/2021 13:07:26 - INFO - __main__ - Step 112994: {'lr': 7.324957369915919e-05, 'samples': 21694848, 'steps': 112993, 'loss/train': 1.477474570274353} 11/07/2021 13:07:27 - INFO - __main__ - Step 112995: {'lr': 7.324582074653754e-05, 'samples': 21695040, 'steps': 112994, 'loss/train': 1.3854910135269165} 11/07/2021 13:07:27 - INFO - __main__ - Step 112996: {'lr': 7.32420678735585e-05, 'samples': 21695232, 'steps': 112995, 'loss/train': 1.2257897853851318} 11/07/2021 13:07:27 - INFO - __main__ - Step 112997: {'lr': 7.323831508022388e-05, 'samples': 21695424, 'steps': 112996, 'loss/train': 1.5206879377365112} 11/07/2021 13:07:28 - INFO - __main__ - Step 112998: {'lr': 7.323456236653534e-05, 'samples': 21695616, 'steps': 112997, 'loss/train': 0.6834633946418762} 11/07/2021 13:07:29 - INFO - __main__ - Step 112999: {'lr': 7.323080973249457e-05, 'samples': 21695808, 'steps': 112998, 'loss/train': 1.3342560529708862} 11/07/2021 13:07:29 - INFO - __main__ - Step 113000: {'lr': 7.322705717810327e-05, 'samples': 21696000, 'steps': 112999, 'loss/train': 1.3565133810043335} 11/07/2021 13:07:30 - INFO - __main__ - Step 113001: {'lr': 7.322330470336314e-05, 'samples': 21696192, 'steps': 113000, 'loss/train': 1.194752812385559} 11/07/2021 13:07:30 - INFO - __main__ - Step 113002: {'lr': 7.321955230827585e-05, 'samples': 21696384, 'steps': 113001, 'loss/train': 1.4199937582015991} 11/07/2021 13:07:31 - INFO - __main__ - Step 113003: {'lr': 7.321579999284311e-05, 'samples': 21696576, 'steps': 113002, 'loss/train': 1.0952413082122803} 11/07/2021 13:07:31 - INFO - __main__ - Step 113004: {'lr': 7.32120477570666e-05, 'samples': 21696768, 'steps': 113003, 'loss/train': 1.2995818853378296} 11/07/2021 13:07:32 - INFO - __main__ - Step 113005: {'lr': 7.320829560094802e-05, 'samples': 21696960, 'steps': 113004, 'loss/train': 1.2329127788543701} 11/07/2021 13:07:32 - INFO - __main__ - Step 113006: {'lr': 7.320454352448905e-05, 'samples': 21697152, 'steps': 113005, 'loss/train': 1.4079055786132812} 11/07/2021 13:07:32 - INFO - __main__ - Step 113007: {'lr': 7.320079152769138e-05, 'samples': 21697344, 'steps': 113006, 'loss/train': 1.1770099401474} 11/07/2021 13:07:33 - INFO - __main__ - Step 113008: {'lr': 7.319703961055679e-05, 'samples': 21697536, 'steps': 113007, 'loss/train': 1.0271306037902832} 11/07/2021 13:07:34 - INFO - __main__ - Step 113009: {'lr': 7.319328777308679e-05, 'samples': 21697728, 'steps': 113008, 'loss/train': 1.2015150785446167} 11/07/2021 13:07:34 - INFO - __main__ - Step 113010: {'lr': 7.318953601528319e-05, 'samples': 21697920, 'steps': 113009, 'loss/train': 1.144504189491272} 11/07/2021 13:07:34 - INFO - __main__ - Step 113011: {'lr': 7.318578433714765e-05, 'samples': 21698112, 'steps': 113010, 'loss/train': 1.5710526704788208} 11/07/2021 13:07:35 - INFO - __main__ - Step 113012: {'lr': 7.318203273868185e-05, 'samples': 21698304, 'steps': 113011, 'loss/train': 0.30280861258506775} 11/07/2021 13:07:36 - INFO - __main__ - Step 113013: {'lr': 7.317828121988752e-05, 'samples': 21698496, 'steps': 113012, 'loss/train': 1.675839900970459} 11/07/2021 13:07:36 - INFO - __main__ - Step 113014: {'lr': 7.317452978076631e-05, 'samples': 21698688, 'steps': 113013, 'loss/train': 1.1489084959030151} 11/07/2021 13:07:36 - INFO - __main__ - Step 113015: {'lr': 7.317077842131995e-05, 'samples': 21698880, 'steps': 113014, 'loss/train': 1.7032514810562134} 11/07/2021 13:07:37 - INFO - __main__ - Step 113016: {'lr': 7.316702714155007e-05, 'samples': 21699072, 'steps': 113015, 'loss/train': 0.872230589389801} 11/07/2021 13:07:37 - INFO - __main__ - Step 113017: {'lr': 7.316327594145843e-05, 'samples': 21699264, 'steps': 113016, 'loss/train': 1.08505117893219} 11/07/2021 13:07:38 - INFO - __main__ - Step 113018: {'lr': 7.315952482104668e-05, 'samples': 21699456, 'steps': 113017, 'loss/train': 1.2706142663955688} 11/07/2021 13:07:39 - INFO - __main__ - Step 113019: {'lr': 7.315577378031654e-05, 'samples': 21699648, 'steps': 113018, 'loss/train': 0.8919275403022766} 11/07/2021 13:07:39 - INFO - __main__ - Step 113020: {'lr': 7.315202281926966e-05, 'samples': 21699840, 'steps': 113019, 'loss/train': 0.7888439893722534} 11/07/2021 13:07:39 - INFO - __main__ - Step 113021: {'lr': 7.314827193790774e-05, 'samples': 21700032, 'steps': 113020, 'loss/train': 1.5987905263900757} 11/07/2021 13:07:40 - INFO - __main__ - Step 113022: {'lr': 7.314452113623257e-05, 'samples': 21700224, 'steps': 113021, 'loss/train': 1.47966468334198} 11/07/2021 13:07:40 - INFO - __main__ - Step 113023: {'lr': 7.314077041424569e-05, 'samples': 21700416, 'steps': 113022, 'loss/train': 1.2819148302078247} 11/07/2021 13:07:41 - INFO - __main__ - Step 113024: {'lr': 7.313701977194884e-05, 'samples': 21700608, 'steps': 113023, 'loss/train': 1.1744979619979858} 11/07/2021 13:07:41 - INFO - __main__ - Step 113025: {'lr': 7.313326920934368e-05, 'samples': 21700800, 'steps': 113024, 'loss/train': 1.5120583772659302} 11/07/2021 13:07:42 - INFO - __main__ - Step 113026: {'lr': 7.312951872643198e-05, 'samples': 21700992, 'steps': 113025, 'loss/train': 0.7475871443748474} 11/07/2021 13:07:42 - INFO - __main__ - Step 113027: {'lr': 7.312576832321538e-05, 'samples': 21701184, 'steps': 113026, 'loss/train': 1.1263835430145264} 11/07/2021 13:07:42 - INFO - __main__ - Step 113028: {'lr': 7.312201799969559e-05, 'samples': 21701376, 'steps': 113027, 'loss/train': 1.0130690336227417} 11/07/2021 13:07:44 - INFO - __main__ - Step 113029: {'lr': 7.311826775587426e-05, 'samples': 21701568, 'steps': 113028, 'loss/train': 1.0001686811447144} 11/07/2021 13:07:44 - INFO - __main__ - Step 113030: {'lr': 7.311451759175314e-05, 'samples': 21701760, 'steps': 113029, 'loss/train': 0.6947020292282104} 11/07/2021 13:07:44 - INFO - __main__ - Step 113031: {'lr': 7.311076750733389e-05, 'samples': 21701952, 'steps': 113030, 'loss/train': 0.9089015126228333} 11/07/2021 13:07:45 - INFO - __main__ - Step 113032: {'lr': 7.310701750261817e-05, 'samples': 21702144, 'steps': 113031, 'loss/train': 1.603838562965393} 11/07/2021 13:07:45 - INFO - __main__ - Step 113033: {'lr': 7.310326757760772e-05, 'samples': 21702336, 'steps': 113032, 'loss/train': 1.353121042251587} 11/07/2021 13:07:46 - INFO - __main__ - Step 113034: {'lr': 7.30995177323042e-05, 'samples': 21702528, 'steps': 113033, 'loss/train': 1.5777865648269653} 11/07/2021 13:07:46 - INFO - __main__ - Step 113035: {'lr': 7.309576796670938e-05, 'samples': 21702720, 'steps': 113034, 'loss/train': 1.2601759433746338} 11/07/2021 13:07:47 - INFO - __main__ - Step 113036: {'lr': 7.309201828082482e-05, 'samples': 21702912, 'steps': 113035, 'loss/train': 1.4641375541687012} 11/07/2021 13:07:47 - INFO - __main__ - Step 113037: {'lr': 7.308826867465223e-05, 'samples': 21703104, 'steps': 113036, 'loss/train': 1.3657807111740112} 11/07/2021 13:07:48 - INFO - __main__ - Step 113038: {'lr': 7.30845191481934e-05, 'samples': 21703296, 'steps': 113037, 'loss/train': 0.18651755154132843} 11/07/2021 13:07:49 - INFO - __main__ - Step 113039: {'lr': 7.308076970144989e-05, 'samples': 21703488, 'steps': 113038, 'loss/train': 1.5630106925964355} 11/07/2021 13:07:49 - INFO - __main__ - Step 113040: {'lr': 7.307702033442348e-05, 'samples': 21703680, 'steps': 113039, 'loss/train': 1.4717917442321777} 11/07/2021 13:07:49 - INFO - __main__ - Step 113041: {'lr': 7.307327104711583e-05, 'samples': 21703872, 'steps': 113040, 'loss/train': 1.453553318977356} 11/07/2021 13:07:50 - INFO - __main__ - Step 113042: {'lr': 7.306952183952863e-05, 'samples': 21704064, 'steps': 113041, 'loss/train': 1.3639992475509644} 11/07/2021 13:07:50 - INFO - __main__ - Step 113043: {'lr': 7.30657727116636e-05, 'samples': 21704256, 'steps': 113042, 'loss/train': 1.3984498977661133} 11/07/2021 13:07:51 - INFO - __main__ - Step 113044: {'lr': 7.306202366352238e-05, 'samples': 21704448, 'steps': 113043, 'loss/train': 1.1933602094650269} 11/07/2021 13:07:51 - INFO - __main__ - Step 113045: {'lr': 7.30582746951067e-05, 'samples': 21704640, 'steps': 113044, 'loss/train': 1.0223677158355713} 11/07/2021 13:07:52 - INFO - __main__ - Step 113046: {'lr': 7.305452580641822e-05, 'samples': 21704832, 'steps': 113045, 'loss/train': 1.2481240034103394} 11/07/2021 13:07:52 - INFO - __main__ - Step 113047: {'lr': 7.305077699745863e-05, 'samples': 21705024, 'steps': 113046, 'loss/train': 1.4280247688293457} 11/07/2021 13:07:52 - INFO - __main__ - Step 113048: {'lr': 7.304702826822962e-05, 'samples': 21705216, 'steps': 113047, 'loss/train': 1.9711318016052246} 11/07/2021 13:07:53 - INFO - __main__ - Step 113049: {'lr': 7.3043279618733e-05, 'samples': 21705408, 'steps': 113048, 'loss/train': 1.0254220962524414} 11/07/2021 13:07:54 - INFO - __main__ - Step 113050: {'lr': 7.303953104897024e-05, 'samples': 21705600, 'steps': 113049, 'loss/train': 1.6731981039047241} 11/07/2021 13:07:54 - INFO - __main__ - Step 113051: {'lr': 7.303578255894316e-05, 'samples': 21705792, 'steps': 113050, 'loss/train': 1.1585453748703003} 11/07/2021 13:07:55 - INFO - __main__ - Step 113052: {'lr': 7.303203414865342e-05, 'samples': 21705984, 'steps': 113051, 'loss/train': 1.193825364112854} 11/07/2021 13:07:55 - INFO - __main__ - Step 113053: {'lr': 7.30282858181027e-05, 'samples': 21706176, 'steps': 113052, 'loss/train': 1.3463538885116577} 11/07/2021 13:07:55 - INFO - __main__ - Step 113054: {'lr': 7.302453756729272e-05, 'samples': 21706368, 'steps': 113053, 'loss/train': 1.4403339624404907} 11/07/2021 13:07:56 - INFO - __main__ - Step 113055: {'lr': 7.302078939622513e-05, 'samples': 21706560, 'steps': 113054, 'loss/train': 1.3466089963912964} 11/07/2021 13:07:57 - INFO - __main__ - Step 113056: {'lr': 7.301704130490166e-05, 'samples': 21706752, 'steps': 113055, 'loss/train': 1.0737788677215576} 11/07/2021 13:07:57 - INFO - __main__ - Step 113057: {'lr': 7.301329329332398e-05, 'samples': 21706944, 'steps': 113056, 'loss/train': 0.9973214268684387} 11/07/2021 13:07:57 - INFO - __main__ - Step 113058: {'lr': 7.300954536149379e-05, 'samples': 21707136, 'steps': 113057, 'loss/train': 1.4938818216323853} 11/07/2021 13:07:58 - INFO - __main__ - Step 113059: {'lr': 7.300579750941275e-05, 'samples': 21707328, 'steps': 113058, 'loss/train': 1.3567442893981934} 11/07/2021 13:07:59 - INFO - __main__ - Step 113060: {'lr': 7.300204973708258e-05, 'samples': 21707520, 'steps': 113059, 'loss/train': 2.0369722843170166} 11/07/2021 13:07:59 - INFO - __main__ - Step 113061: {'lr': 7.299830204450495e-05, 'samples': 21707712, 'steps': 113060, 'loss/train': 1.7369539737701416} 11/07/2021 13:08:00 - INFO - __main__ - Step 113062: {'lr': 7.299455443168162e-05, 'samples': 21707904, 'steps': 113061, 'loss/train': 1.2272897958755493} 11/07/2021 13:08:00 - INFO - __main__ - Step 113063: {'lr': 7.299080689861415e-05, 'samples': 21708096, 'steps': 113062, 'loss/train': 1.027104377746582} 11/07/2021 13:08:00 - INFO - __main__ - Step 113064: {'lr': 7.298705944530425e-05, 'samples': 21708288, 'steps': 113063, 'loss/train': 1.4109816551208496} 11/07/2021 13:08:01 - INFO - __main__ - Step 113065: {'lr': 7.298331207175371e-05, 'samples': 21708480, 'steps': 113064, 'loss/train': 1.6644296646118164} 11/07/2021 13:08:02 - INFO - __main__ - Step 113066: {'lr': 7.297956477796414e-05, 'samples': 21708672, 'steps': 113065, 'loss/train': 1.433578610420227} 11/07/2021 13:08:02 - INFO - __main__ - Step 113067: {'lr': 7.297581756393723e-05, 'samples': 21708864, 'steps': 113066, 'loss/train': 1.7941982746124268} 11/07/2021 13:08:02 - INFO - __main__ - Step 113068: {'lr': 7.297207042967468e-05, 'samples': 21709056, 'steps': 113067, 'loss/train': 1.4276906251907349} 11/07/2021 13:08:03 - INFO - __main__ - Step 113069: {'lr': 7.29683233751782e-05, 'samples': 21709248, 'steps': 113068, 'loss/train': 0.8522816896438599} 11/07/2021 13:08:03 - INFO - __main__ - Step 113070: {'lr': 7.296457640044945e-05, 'samples': 21709440, 'steps': 113069, 'loss/train': 1.2660263776779175} 11/07/2021 13:08:04 - INFO - __main__ - Step 113071: {'lr': 7.296082950549015e-05, 'samples': 21709632, 'steps': 113070, 'loss/train': 1.4339839220046997} 11/07/2021 13:08:05 - INFO - __main__ - Step 113072: {'lr': 7.295708269030194e-05, 'samples': 21709824, 'steps': 113071, 'loss/train': 0.9341626763343811} 11/07/2021 13:08:05 - INFO - __main__ - Step 113073: {'lr': 7.295333595488657e-05, 'samples': 21710016, 'steps': 113072, 'loss/train': 1.3323558568954468} 11/07/2021 13:08:05 - INFO - __main__ - Step 113074: {'lr': 7.294958929924567e-05, 'samples': 21710208, 'steps': 113073, 'loss/train': 0.7209179401397705} 11/07/2021 13:08:06 - INFO - __main__ - Step 113075: {'lr': 7.294584272338103e-05, 'samples': 21710400, 'steps': 113074, 'loss/train': 1.5147937536239624} 11/07/2021 13:08:07 - INFO - __main__ - Step 113076: {'lr': 7.294209622729419e-05, 'samples': 21710592, 'steps': 113075, 'loss/train': 1.1608575582504272} 11/07/2021 13:08:07 - INFO - __main__ - Step 113077: {'lr': 7.293834981098692e-05, 'samples': 21710784, 'steps': 113076, 'loss/train': 1.5451158285140991} 11/07/2021 13:08:07 - INFO - __main__ - Step 113078: {'lr': 7.293460347446088e-05, 'samples': 21710976, 'steps': 113077, 'loss/train': 1.39764404296875} 11/07/2021 13:08:08 - INFO - __main__ - Step 113079: {'lr': 7.293085721771778e-05, 'samples': 21711168, 'steps': 113078, 'loss/train': 0.9588995575904846} 11/07/2021 13:08:08 - INFO - __main__ - Step 113080: {'lr': 7.29271110407593e-05, 'samples': 21711360, 'steps': 113079, 'loss/train': 1.5160846710205078} 11/07/2021 13:08:09 - INFO - __main__ - Step 113081: {'lr': 7.292336494358714e-05, 'samples': 21711552, 'steps': 113080, 'loss/train': 1.8008288145065308} 11/07/2021 13:08:09 - INFO - __main__ - Step 113082: {'lr': 7.2919618926203e-05, 'samples': 21711744, 'steps': 113081, 'loss/train': 1.4842015504837036} 11/07/2021 13:08:10 - INFO - __main__ - Step 113083: {'lr': 7.291587298860853e-05, 'samples': 21711936, 'steps': 113082, 'loss/train': 1.9127579927444458} 11/07/2021 13:08:10 - INFO - __main__ - Step 113084: {'lr': 7.291212713080542e-05, 'samples': 21712128, 'steps': 113083, 'loss/train': 1.4543359279632568} 11/07/2021 13:08:11 - INFO - __main__ - Step 113085: {'lr': 7.290838135279537e-05, 'samples': 21712320, 'steps': 113084, 'loss/train': 1.3080083131790161} 11/07/2021 13:08:11 - INFO - __main__ - Step 113086: {'lr': 7.290463565458008e-05, 'samples': 21712512, 'steps': 113085, 'loss/train': 1.1198515892028809} 11/07/2021 13:08:12 - INFO - __main__ - Step 113087: {'lr': 7.290089003616124e-05, 'samples': 21712704, 'steps': 113086, 'loss/train': 2.1270313262939453} 11/07/2021 13:08:12 - INFO - __main__ - Step 113088: {'lr': 7.289714449754051e-05, 'samples': 21712896, 'steps': 113087, 'loss/train': 1.3601465225219727} 11/07/2021 13:08:13 - INFO - __main__ - Step 113089: {'lr': 7.289339903871969e-05, 'samples': 21713088, 'steps': 113088, 'loss/train': 1.2852421998977661} 11/07/2021 13:08:13 - INFO - __main__ - Step 113090: {'lr': 7.28896536597003e-05, 'samples': 21713280, 'steps': 113089, 'loss/train': 1.1085444688796997} 11/07/2021 13:08:13 - INFO - __main__ - Step 113091: {'lr': 7.288590836048406e-05, 'samples': 21713472, 'steps': 113090, 'loss/train': 1.2103462219238281} 11/07/2021 13:08:14 - INFO - __main__ - Step 113092: {'lr': 7.288216314107271e-05, 'samples': 21713664, 'steps': 113091, 'loss/train': 1.3606783151626587} 11/07/2021 13:08:15 - INFO - __main__ - Step 113093: {'lr': 7.287841800146794e-05, 'samples': 21713856, 'steps': 113092, 'loss/train': 1.249220609664917} 11/07/2021 13:08:15 - INFO - __main__ - Step 113094: {'lr': 7.287467294167142e-05, 'samples': 21714048, 'steps': 113093, 'loss/train': 1.3074989318847656} 11/07/2021 13:08:15 - INFO - __main__ - Step 113095: {'lr': 7.287092796168484e-05, 'samples': 21714240, 'steps': 113094, 'loss/train': 0.9867348074913025} 11/07/2021 13:08:16 - INFO - __main__ - Step 113096: {'lr': 7.286718306150989e-05, 'samples': 21714432, 'steps': 113095, 'loss/train': 1.1288058757781982} 11/07/2021 13:08:17 - INFO - __main__ - Step 113097: {'lr': 7.286343824114821e-05, 'samples': 21714624, 'steps': 113096, 'loss/train': 1.4700638055801392} 11/07/2021 13:08:17 - INFO - __main__ - Step 113098: {'lr': 7.285969350060159e-05, 'samples': 21714816, 'steps': 113097, 'loss/train': 1.3235517740249634} 11/07/2021 13:08:18 - INFO - __main__ - Step 113099: {'lr': 7.285594883987162e-05, 'samples': 21715008, 'steps': 113098, 'loss/train': 1.4808837175369263} 11/07/2021 13:08:18 - INFO - __main__ - Step 113100: {'lr': 7.285220425896005e-05, 'samples': 21715200, 'steps': 113099, 'loss/train': 1.4070448875427246} 11/07/2021 13:08:19 - INFO - __main__ - Step 113101: {'lr': 7.284845975786853e-05, 'samples': 21715392, 'steps': 113100, 'loss/train': 1.0031098127365112} 11/07/2021 13:08:19 - INFO - __main__ - Step 113102: {'lr': 7.284471533659884e-05, 'samples': 21715584, 'steps': 113101, 'loss/train': 2.258112907409668} 11/07/2021 13:08:20 - INFO - __main__ - Step 113103: {'lr': 7.284097099515253e-05, 'samples': 21715776, 'steps': 113102, 'loss/train': 0.9113914370536804} 11/07/2021 13:08:20 - INFO - __main__ - Step 113104: {'lr': 7.283722673353133e-05, 'samples': 21715968, 'steps': 113103, 'loss/train': 1.4700936079025269} 11/07/2021 13:08:21 - INFO - __main__ - Step 113105: {'lr': 7.283348255173692e-05, 'samples': 21716160, 'steps': 113104, 'loss/train': 1.3413199186325073} 11/07/2021 13:08:21 - INFO - __main__ - Step 113106: {'lr': 7.282973844977103e-05, 'samples': 21716352, 'steps': 113105, 'loss/train': 1.6833932399749756} 11/07/2021 13:08:21 - INFO - __main__ - Step 113107: {'lr': 7.282599442763532e-05, 'samples': 21716544, 'steps': 113106, 'loss/train': 1.213270902633667} 11/07/2021 13:08:22 - INFO - __main__ - Step 113108: {'lr': 7.282225048533148e-05, 'samples': 21716736, 'steps': 113107, 'loss/train': 1.4439811706542969} 11/07/2021 13:08:23 - INFO - __main__ - Step 113109: {'lr': 7.281850662286121e-05, 'samples': 21716928, 'steps': 113108, 'loss/train': 1.4705177545547485} 11/07/2021 13:08:23 - INFO - __main__ - Step 113110: {'lr': 7.281476284022618e-05, 'samples': 21717120, 'steps': 113109, 'loss/train': 1.2914519309997559} 11/07/2021 13:08:24 - INFO - __main__ - Step 113111: {'lr': 7.28110191374281e-05, 'samples': 21717312, 'steps': 113110, 'loss/train': 1.4140428304672241} 11/07/2021 13:08:24 - INFO - __main__ - Step 113112: {'lr': 7.280727551446862e-05, 'samples': 21717504, 'steps': 113111, 'loss/train': 1.3564093112945557} 11/07/2021 13:08:25 - INFO - __main__ - Step 113113: {'lr': 7.280353197134945e-05, 'samples': 21717696, 'steps': 113112, 'loss/train': 1.0310124158859253} 11/07/2021 13:08:25 - INFO - __main__ - Step 113114: {'lr': 7.279978850807237e-05, 'samples': 21717888, 'steps': 113113, 'loss/train': 1.3620327711105347} 11/07/2021 13:08:26 - INFO - __main__ - Step 113115: {'lr': 7.279604512463886e-05, 'samples': 21718080, 'steps': 113114, 'loss/train': 1.131003975868225} 11/07/2021 13:08:26 - INFO - __main__ - Step 113116: {'lr': 7.279230182105075e-05, 'samples': 21718272, 'steps': 113115, 'loss/train': 1.3256922960281372} 11/07/2021 13:08:26 - INFO - __main__ - Step 113117: {'lr': 7.278855859730968e-05, 'samples': 21718464, 'steps': 113116, 'loss/train': 1.3307698965072632} 11/07/2021 13:08:27 - INFO - __main__ - Step 113118: {'lr': 7.278481545341737e-05, 'samples': 21718656, 'steps': 113117, 'loss/train': 1.5271741151809692} 11/07/2021 13:08:28 - INFO - __main__ - Step 113119: {'lr': 7.278107238937545e-05, 'samples': 21718848, 'steps': 113118, 'loss/train': 1.5954688787460327} 11/07/2021 13:08:28 - INFO - __main__ - Step 113120: {'lr': 7.277732940518566e-05, 'samples': 21719040, 'steps': 113119, 'loss/train': 1.1821526288986206} 11/07/2021 13:08:28 - INFO - __main__ - Step 113121: {'lr': 7.277358650084967e-05, 'samples': 21719232, 'steps': 113120, 'loss/train': 1.2905813455581665} 11/07/2021 13:08:29 - INFO - __main__ - Step 113122: {'lr': 7.276984367636918e-05, 'samples': 21719424, 'steps': 113121, 'loss/train': 0.18237389624118805} 11/07/2021 13:08:30 - INFO - __main__ - Step 113123: {'lr': 7.276610093174585e-05, 'samples': 21719616, 'steps': 113122, 'loss/train': 1.410173773765564} 11/07/2021 13:08:30 - INFO - __main__ - Step 113124: {'lr': 7.276235826698138e-05, 'samples': 21719808, 'steps': 113123, 'loss/train': 0.7657790780067444} 11/07/2021 13:08:31 - INFO - __main__ - Step 113125: {'lr': 7.275861568207756e-05, 'samples': 21720000, 'steps': 113124, 'loss/train': 1.6314029693603516} 11/07/2021 13:08:31 - INFO - __main__ - Step 113126: {'lr': 7.275487317703586e-05, 'samples': 21720192, 'steps': 113125, 'loss/train': 1.242364525794983} 11/07/2021 13:08:31 - INFO - __main__ - Step 113127: {'lr': 7.275113075185811e-05, 'samples': 21720384, 'steps': 113126, 'loss/train': 1.4068622589111328} 11/07/2021 13:08:32 - INFO - __main__ - Step 113128: {'lr': 7.274738840654594e-05, 'samples': 21720576, 'steps': 113127, 'loss/train': 1.0895036458969116} 11/07/2021 13:08:33 - INFO - __main__ - Step 113129: {'lr': 7.274364614110108e-05, 'samples': 21720768, 'steps': 113128, 'loss/train': 1.1677837371826172} 11/07/2021 13:08:33 - INFO - __main__ - Step 113130: {'lr': 7.273990395552519e-05, 'samples': 21720960, 'steps': 113129, 'loss/train': 1.240020751953125} 11/07/2021 13:08:33 - INFO - __main__ - Step 113131: {'lr': 7.273616184981995e-05, 'samples': 21721152, 'steps': 113130, 'loss/train': 0.730035662651062} 11/07/2021 13:08:34 - INFO - __main__ - Step 113132: {'lr': 7.273241982398706e-05, 'samples': 21721344, 'steps': 113131, 'loss/train': 1.325060486793518} 11/07/2021 13:08:34 - INFO - __main__ - Step 113133: {'lr': 7.272867787802823e-05, 'samples': 21721536, 'steps': 113132, 'loss/train': 1.1585930585861206} 11/07/2021 13:08:35 - INFO - __main__ - Step 113134: {'lr': 7.272493601194513e-05, 'samples': 21721728, 'steps': 113133, 'loss/train': 1.1953293085098267} 11/07/2021 13:08:36 - INFO - __main__ - Step 113135: {'lr': 7.272119422573941e-05, 'samples': 21721920, 'steps': 113134, 'loss/train': 1.3309534788131714} 11/07/2021 13:08:36 - INFO - __main__ - Step 113136: {'lr': 7.271745251941287e-05, 'samples': 21722112, 'steps': 113135, 'loss/train': 1.4108526706695557} 11/07/2021 13:08:36 - INFO - __main__ - Step 113137: {'lr': 7.271371089296702e-05, 'samples': 21722304, 'steps': 113136, 'loss/train': 1.5113017559051514} 11/07/2021 13:08:37 - INFO - __main__ - Step 113138: {'lr': 7.270996934640367e-05, 'samples': 21722496, 'steps': 113137, 'loss/train': 1.2714171409606934} 11/07/2021 13:08:38 - INFO - __main__ - Step 113139: {'lr': 7.270622787972444e-05, 'samples': 21722688, 'steps': 113138, 'loss/train': 1.2830454111099243} 11/07/2021 13:08:38 - INFO - __main__ - Step 113140: {'lr': 7.270248649293107e-05, 'samples': 21722880, 'steps': 113139, 'loss/train': 0.6704883575439453} 11/07/2021 13:08:38 - INFO - __main__ - Step 113141: {'lr': 7.26987451860252e-05, 'samples': 21723072, 'steps': 113140, 'loss/train': 1.531638503074646} 11/07/2021 13:08:39 - INFO - __main__ - Step 113142: {'lr': 7.269500395900857e-05, 'samples': 21723264, 'steps': 113141, 'loss/train': 0.9649618864059448} 11/07/2021 13:08:39 - INFO - __main__ - Step 113143: {'lr': 7.269126281188282e-05, 'samples': 21723456, 'steps': 113142, 'loss/train': 1.057966947555542} 11/07/2021 13:08:40 - INFO - __main__ - Step 113144: {'lr': 7.268752174464966e-05, 'samples': 21723648, 'steps': 113143, 'loss/train': 1.406427264213562} 11/07/2021 13:08:40 - INFO - __main__ - Step 113145: {'lr': 7.268378075731074e-05, 'samples': 21723840, 'steps': 113144, 'loss/train': 1.384684443473816} 11/07/2021 13:08:41 - INFO - __main__ - Step 113146: {'lr': 7.268003984986779e-05, 'samples': 21724032, 'steps': 113145, 'loss/train': 0.968220055103302} 11/07/2021 13:08:41 - INFO - __main__ - Step 113147: {'lr': 7.267629902232256e-05, 'samples': 21724224, 'steps': 113146, 'loss/train': 1.4248331785202026} 11/07/2021 13:08:42 - INFO - __main__ - Step 113148: {'lr': 7.267255827467655e-05, 'samples': 21724416, 'steps': 113147, 'loss/train': 1.4606642723083496} 11/07/2021 13:08:43 - INFO - __main__ - Step 113149: {'lr': 7.266881760693158e-05, 'samples': 21724608, 'steps': 113148, 'loss/train': 1.3866147994995117} 11/07/2021 13:08:43 - INFO - __main__ - Step 113150: {'lr': 7.266507701908928e-05, 'samples': 21724800, 'steps': 113149, 'loss/train': 0.4953925907611847} 11/07/2021 13:08:43 - INFO - __main__ - Step 113151: {'lr': 7.26613365111514e-05, 'samples': 21724992, 'steps': 113150, 'loss/train': 1.4878671169281006} 11/07/2021 13:08:44 - INFO - __main__ - Step 113152: {'lr': 7.265759608311956e-05, 'samples': 21725184, 'steps': 113151, 'loss/train': 1.6413989067077637} 11/07/2021 13:08:44 - INFO - __main__ - Step 113153: {'lr': 7.265385573499545e-05, 'samples': 21725376, 'steps': 113152, 'loss/train': 1.0162391662597656} 11/07/2021 13:08:45 - INFO - __main__ - Step 113154: {'lr': 7.26501154667808e-05, 'samples': 21725568, 'steps': 113153, 'loss/train': 1.289401650428772} 11/07/2021 13:08:45 - INFO - __main__ - Step 113155: {'lr': 7.264637527847726e-05, 'samples': 21725760, 'steps': 113154, 'loss/train': 1.1743181943893433} 11/07/2021 13:08:46 - INFO - __main__ - Step 113156: {'lr': 7.264263517008654e-05, 'samples': 21725952, 'steps': 113155, 'loss/train': 1.3813748359680176} 11/07/2021 13:08:46 - INFO - __main__ - Step 113157: {'lr': 7.26388951416103e-05, 'samples': 21726144, 'steps': 113156, 'loss/train': 1.223056435585022} 11/07/2021 13:08:46 - INFO - __main__ - Step 113158: {'lr': 7.263515519305033e-05, 'samples': 21726336, 'steps': 113157, 'loss/train': 1.3266164064407349} 11/07/2021 13:08:47 - INFO - __main__ - Step 113159: {'lr': 7.263141532440811e-05, 'samples': 21726528, 'steps': 113158, 'loss/train': 1.0366671085357666} 11/07/2021 13:08:48 - INFO - __main__ - Step 113160: {'lr': 7.262767553568548e-05, 'samples': 21726720, 'steps': 113159, 'loss/train': 1.4764702320098877} 11/07/2021 13:08:48 - INFO - __main__ - Step 113161: {'lr': 7.262393582688407e-05, 'samples': 21726912, 'steps': 113160, 'loss/train': 0.8992858529090881} 11/07/2021 13:08:48 - INFO - __main__ - Step 113162: {'lr': 7.262019619800556e-05, 'samples': 21727104, 'steps': 113161, 'loss/train': 1.5072972774505615} 11/07/2021 13:08:49 - INFO - __main__ - Step 113163: {'lr': 7.261645664905167e-05, 'samples': 21727296, 'steps': 113162, 'loss/train': 0.6501049399375916} 11/07/2021 13:08:50 - INFO - __main__ - Step 113164: {'lr': 7.261271718002404e-05, 'samples': 21727488, 'steps': 113163, 'loss/train': 1.718833088874817} 11/07/2021 13:08:50 - INFO - __main__ - Step 113165: {'lr': 7.260897779092443e-05, 'samples': 21727680, 'steps': 113164, 'loss/train': 0.9476105570793152} 11/07/2021 13:08:51 - INFO - __main__ - Step 113166: {'lr': 7.260523848175443e-05, 'samples': 21727872, 'steps': 113165, 'loss/train': 1.5761698484420776} 11/07/2021 13:08:51 - INFO - __main__ - Step 113167: {'lr': 7.26014992525158e-05, 'samples': 21728064, 'steps': 113166, 'loss/train': 1.63089120388031} 11/07/2021 13:08:51 - INFO - __main__ - Step 113168: {'lr': 7.25977601032102e-05, 'samples': 21728256, 'steps': 113167, 'loss/train': 1.5950157642364502} 11/07/2021 13:08:52 - INFO - __main__ - Step 113169: {'lr': 7.25940210338393e-05, 'samples': 21728448, 'steps': 113168, 'loss/train': 1.6652666330337524} 11/07/2021 13:08:53 - INFO - __main__ - Step 113170: {'lr': 7.259028204440487e-05, 'samples': 21728640, 'steps': 113169, 'loss/train': 1.2318798303604126} 11/07/2021 13:08:53 - INFO - __main__ - Step 113171: {'lr': 7.258654313490845e-05, 'samples': 21728832, 'steps': 113170, 'loss/train': 1.7274514436721802} 11/07/2021 13:08:53 - INFO - __main__ - Step 113172: {'lr': 7.25828043053518e-05, 'samples': 21729024, 'steps': 113171, 'loss/train': 1.3673875331878662} 11/07/2021 13:08:54 - INFO - __main__ - Step 113173: {'lr': 7.257906555573659e-05, 'samples': 21729216, 'steps': 113172, 'loss/train': 1.3811856508255005} 11/07/2021 13:08:54 - INFO - __main__ - Step 113174: {'lr': 7.257532688606452e-05, 'samples': 21729408, 'steps': 113173, 'loss/train': 1.0385037660598755} 11/07/2021 13:08:55 - INFO - __main__ - Step 113175: {'lr': 7.257158829633728e-05, 'samples': 21729600, 'steps': 113174, 'loss/train': 1.3631073236465454} 11/07/2021 13:08:56 - INFO - __main__ - Step 113176: {'lr': 7.256784978655654e-05, 'samples': 21729792, 'steps': 113175, 'loss/train': 1.288598656654358} 11/07/2021 13:08:56 - INFO - __main__ - Step 113177: {'lr': 7.256411135672398e-05, 'samples': 21729984, 'steps': 113176, 'loss/train': 1.8288277387619019} 11/07/2021 13:08:56 - INFO - __main__ - Step 113178: {'lr': 7.256037300684129e-05, 'samples': 21730176, 'steps': 113177, 'loss/train': 1.3218995332717896} 11/07/2021 13:08:57 - INFO - __main__ - Step 113179: {'lr': 7.255663473691016e-05, 'samples': 21730368, 'steps': 113178, 'loss/train': 1.4409352540969849} 11/07/2021 13:08:58 - INFO - __main__ - Step 113180: {'lr': 7.255289654693228e-05, 'samples': 21730560, 'steps': 113179, 'loss/train': 1.3191338777542114} 11/07/2021 13:08:58 - INFO - __main__ - Step 113181: {'lr': 7.254915843690932e-05, 'samples': 21730752, 'steps': 113180, 'loss/train': 1.2864124774932861} 11/07/2021 13:08:58 - INFO - __main__ - Step 113182: {'lr': 7.254542040684298e-05, 'samples': 21730944, 'steps': 113181, 'loss/train': 1.5736591815948486} 11/07/2021 13:08:59 - INFO - __main__ - Step 113183: {'lr': 7.254168245673501e-05, 'samples': 21731136, 'steps': 113182, 'loss/train': 0.9236506223678589} 11/07/2021 13:08:59 - INFO - __main__ - Step 113184: {'lr': 7.253794458658694e-05, 'samples': 21731328, 'steps': 113183, 'loss/train': 1.3595584630966187} 11/07/2021 13:09:00 - INFO - __main__ - Step 113185: {'lr': 7.253420679640055e-05, 'samples': 21731520, 'steps': 113184, 'loss/train': 1.338339924812317} 11/07/2021 13:09:00 - INFO - __main__ - Step 113186: {'lr': 7.253046908617747e-05, 'samples': 21731712, 'steps': 113185, 'loss/train': 1.4131364822387695} 11/07/2021 13:09:01 - INFO - __main__ - Step 113187: {'lr': 7.252673145591945e-05, 'samples': 21731904, 'steps': 113186, 'loss/train': 1.1706401109695435} 11/07/2021 13:09:01 - INFO - __main__ - Step 113188: {'lr': 7.252299390562814e-05, 'samples': 21732096, 'steps': 113187, 'loss/train': 1.0367465019226074} 11/07/2021 13:09:02 - INFO - __main__ - Step 113189: {'lr': 7.251925643530524e-05, 'samples': 21732288, 'steps': 113188, 'loss/train': 1.43130362033844} 11/07/2021 13:09:03 - INFO - __main__ - Step 113190: {'lr': 7.251551904495241e-05, 'samples': 21732480, 'steps': 113189, 'loss/train': 1.2004963159561157} 11/07/2021 13:09:03 - INFO - __main__ - Step 113191: {'lr': 7.251178173457135e-05, 'samples': 21732672, 'steps': 113190, 'loss/train': 1.328484058380127} 11/07/2021 13:09:03 - INFO - __main__ - Step 113192: {'lr': 7.250804450416376e-05, 'samples': 21732864, 'steps': 113191, 'loss/train': 1.549126148223877} 11/07/2021 13:09:04 - INFO - __main__ - Step 113193: {'lr': 7.250430735373129e-05, 'samples': 21733056, 'steps': 113192, 'loss/train': 1.048905849456787} 11/07/2021 13:09:04 - INFO - __main__ - Step 113194: {'lr': 7.250057028327564e-05, 'samples': 21733248, 'steps': 113193, 'loss/train': 1.545036792755127} 11/07/2021 13:09:05 - INFO - __main__ - Step 113195: {'lr': 7.24968332927985e-05, 'samples': 21733440, 'steps': 113194, 'loss/train': 1.1020406484603882} 11/07/2021 13:09:05 - INFO - __main__ - Step 113196: {'lr': 7.249309638230162e-05, 'samples': 21733632, 'steps': 113195, 'loss/train': 1.1649162769317627} 11/07/2021 13:09:06 - INFO - __main__ - Step 113197: {'lr': 7.248935955178654e-05, 'samples': 21733824, 'steps': 113196, 'loss/train': 1.3178731203079224} 11/07/2021 13:09:06 - INFO - __main__ - Step 113198: {'lr': 7.248562280125501e-05, 'samples': 21734016, 'steps': 113197, 'loss/train': 0.985451877117157} 11/07/2021 13:09:06 - INFO - __main__ - Step 113199: {'lr': 7.248188613070871e-05, 'samples': 21734208, 'steps': 113198, 'loss/train': 0.8909398317337036} 11/07/2021 13:09:07 - INFO - __main__ - Step 113200: {'lr': 7.247814954014934e-05, 'samples': 21734400, 'steps': 113199, 'loss/train': 1.402791142463684} 11/07/2021 13:09:08 - INFO - __main__ - Step 113201: {'lr': 7.247441302957858e-05, 'samples': 21734592, 'steps': 113200, 'loss/train': 1.4118406772613525} 11/07/2021 13:09:08 - INFO - __main__ - Step 113202: {'lr': 7.247067659899812e-05, 'samples': 21734784, 'steps': 113201, 'loss/train': 1.5192025899887085} 11/07/2021 13:09:09 - INFO - __main__ - Step 113203: {'lr': 7.24669402484096e-05, 'samples': 21734976, 'steps': 113202, 'loss/train': 1.4619964361190796} 11/07/2021 13:09:09 - INFO - __main__ - Step 113204: {'lr': 7.246320397781478e-05, 'samples': 21735168, 'steps': 113203, 'loss/train': 1.2377525568008423} 11/07/2021 13:09:09 - INFO - __main__ - Step 113205: {'lr': 7.245946778721526e-05, 'samples': 21735360, 'steps': 113204, 'loss/train': 1.1977750062942505} 11/07/2021 13:09:10 - INFO - __main__ - Step 113206: {'lr': 7.245573167661282e-05, 'samples': 21735552, 'steps': 113205, 'loss/train': 0.8562372326850891} 11/07/2021 13:09:11 - INFO - __main__ - Step 113207: {'lr': 7.245199564600904e-05, 'samples': 21735744, 'steps': 113206, 'loss/train': 1.458470106124878} 11/07/2021 13:09:11 - INFO - __main__ - Step 113208: {'lr': 7.244825969540566e-05, 'samples': 21735936, 'steps': 113207, 'loss/train': 1.9036232233047485} 11/07/2021 13:09:11 - INFO - __main__ - Step 113209: {'lr': 7.244452382480435e-05, 'samples': 21736128, 'steps': 113208, 'loss/train': 1.3101918697357178} 11/07/2021 13:09:12 - INFO - __main__ - Step 113210: {'lr': 7.244078803420689e-05, 'samples': 21736320, 'steps': 113209, 'loss/train': 0.7623542547225952} 11/07/2021 13:09:13 - INFO - __main__ - Step 113211: {'lr': 7.24370523236148e-05, 'samples': 21736512, 'steps': 113210, 'loss/train': 1.0044667720794678} 11/07/2021 13:09:13 - INFO - __main__ - Step 113212: {'lr': 7.243331669302982e-05, 'samples': 21736704, 'steps': 113211, 'loss/train': 1.0964834690093994} 11/07/2021 13:09:14 - INFO - __main__ - Step 113213: {'lr': 7.242958114245365e-05, 'samples': 21736896, 'steps': 113212, 'loss/train': 0.685562252998352} 11/07/2021 13:09:14 - INFO - __main__ - Step 113214: {'lr': 7.242584567188796e-05, 'samples': 21737088, 'steps': 113213, 'loss/train': 1.4096251726150513} 11/07/2021 13:09:14 - INFO - __main__ - Step 113215: {'lr': 7.242211028133447e-05, 'samples': 21737280, 'steps': 113214, 'loss/train': 1.346423864364624} 11/07/2021 13:09:15 - INFO - __main__ - Step 113216: {'lr': 7.241837497079481e-05, 'samples': 21737472, 'steps': 113215, 'loss/train': 1.1753278970718384} 11/07/2021 13:09:16 - INFO - __main__ - Step 113217: {'lr': 7.241463974027072e-05, 'samples': 21737664, 'steps': 113216, 'loss/train': 1.4026446342468262} 11/07/2021 13:09:16 - INFO - __main__ - Step 113218: {'lr': 7.241090458976382e-05, 'samples': 21737856, 'steps': 113217, 'loss/train': 1.1105942726135254} 11/07/2021 13:09:16 - INFO - __main__ - Step 113219: {'lr': 7.240716951927584e-05, 'samples': 21738048, 'steps': 113218, 'loss/train': 1.0380284786224365} 11/07/2021 13:09:17 - INFO - __main__ - Step 113220: {'lr': 7.240343452880843e-05, 'samples': 21738240, 'steps': 113219, 'loss/train': 1.9267064332962036} 11/07/2021 13:09:17 - INFO - __main__ - Step 113221: {'lr': 7.239969961836332e-05, 'samples': 21738432, 'steps': 113220, 'loss/train': 1.5319364070892334} 11/07/2021 13:09:18 - INFO - __main__ - Step 113222: {'lr': 7.239596478794216e-05, 'samples': 21738624, 'steps': 113221, 'loss/train': 1.4949023723602295} 11/07/2021 13:09:18 - INFO - __main__ - Step 113223: {'lr': 7.239223003754672e-05, 'samples': 21738816, 'steps': 113222, 'loss/train': 1.285146713256836} 11/07/2021 13:09:19 - INFO - __main__ - Step 113224: {'lr': 7.238849536717851e-05, 'samples': 21739008, 'steps': 113223, 'loss/train': 0.8023540377616882} 11/07/2021 13:09:19 - INFO - __main__ - Step 113225: {'lr': 7.23847607768393e-05, 'samples': 21739200, 'steps': 113224, 'loss/train': 1.496015191078186} 11/07/2021 13:09:20 - INFO - __main__ - Step 113226: {'lr': 7.238102626653079e-05, 'samples': 21739392, 'steps': 113225, 'loss/train': 1.4048516750335693} 11/07/2021 13:09:21 - INFO - __main__ - Step 113227: {'lr': 7.237729183625463e-05, 'samples': 21739584, 'steps': 113226, 'loss/train': 1.2958579063415527} 11/07/2021 13:09:21 - INFO - __main__ - Step 113228: {'lr': 7.237355748601255e-05, 'samples': 21739776, 'steps': 113227, 'loss/train': 0.9573037624359131} 11/07/2021 13:09:21 - INFO - __main__ - Step 113229: {'lr': 7.236982321580618e-05, 'samples': 21739968, 'steps': 113228, 'loss/train': 1.2675679922103882} 11/07/2021 13:09:22 - INFO - __main__ - Step 113230: {'lr': 7.236608902563724e-05, 'samples': 21740160, 'steps': 113229, 'loss/train': 1.7948429584503174} 11/07/2021 13:09:22 - INFO - __main__ - Step 113231: {'lr': 7.236235491550738e-05, 'samples': 21740352, 'steps': 113230, 'loss/train': 1.1876945495605469} 11/07/2021 13:09:23 - INFO - __main__ - Step 113232: {'lr': 7.23586208854183e-05, 'samples': 21740544, 'steps': 113231, 'loss/train': 1.2470171451568604} 11/07/2021 13:09:23 - INFO - __main__ - Step 113233: {'lr': 7.235488693537171e-05, 'samples': 21740736, 'steps': 113232, 'loss/train': 0.3975304663181305} 11/07/2021 13:09:24 - INFO - __main__ - Step 113234: {'lr': 7.235115306536927e-05, 'samples': 21740928, 'steps': 113233, 'loss/train': 1.3782535791397095} 11/07/2021 13:09:24 - INFO - __main__ - Step 113235: {'lr': 7.234741927541264e-05, 'samples': 21741120, 'steps': 113234, 'loss/train': 1.0885363817214966} 11/07/2021 13:09:24 - INFO - __main__ - Step 113236: {'lr': 7.234368556550353e-05, 'samples': 21741312, 'steps': 113235, 'loss/train': 1.4460922479629517} 11/07/2021 13:09:25 - INFO - __main__ - Step 113237: {'lr': 7.233995193564369e-05, 'samples': 21741504, 'steps': 113236, 'loss/train': 1.4761534929275513} 11/07/2021 13:09:26 - INFO - __main__ - Step 113238: {'lr': 7.233621838583462e-05, 'samples': 21741696, 'steps': 113237, 'loss/train': 1.2621572017669678} 11/07/2021 13:09:26 - INFO - __main__ - Step 113239: {'lr': 7.233248491607816e-05, 'samples': 21741888, 'steps': 113238, 'loss/train': 0.9016534090042114} 11/07/2021 13:09:26 - INFO - __main__ - Step 113240: {'lr': 7.232875152637591e-05, 'samples': 21742080, 'steps': 113239, 'loss/train': 0.6885189414024353} 11/07/2021 13:09:27 - INFO - __main__ - Step 113241: {'lr': 7.232501821672957e-05, 'samples': 21742272, 'steps': 113240, 'loss/train': 1.422356128692627} 11/07/2021 13:09:28 - INFO - __main__ - Step 113242: {'lr': 7.232128498714086e-05, 'samples': 21742464, 'steps': 113241, 'loss/train': 0.9051923751831055} 11/07/2021 13:09:28 - INFO - __main__ - Step 113243: {'lr': 7.231755183761143e-05, 'samples': 21742656, 'steps': 113242, 'loss/train': 1.3854622840881348} 11/07/2021 13:09:29 - INFO - __main__ - Step 113244: {'lr': 7.231381876814296e-05, 'samples': 21742848, 'steps': 113243, 'loss/train': 1.1743298768997192} 11/07/2021 13:09:29 - INFO - __main__ - Step 113245: {'lr': 7.231008577873719e-05, 'samples': 21743040, 'steps': 113244, 'loss/train': 0.7676616311073303} 11/07/2021 13:09:29 - INFO - __main__ - Step 113246: {'lr': 7.230635286939569e-05, 'samples': 21743232, 'steps': 113245, 'loss/train': 1.6908817291259766} 11/07/2021 13:09:30 - INFO - __main__ - Step 113247: {'lr': 7.230262004012023e-05, 'samples': 21743424, 'steps': 113246, 'loss/train': 1.4355859756469727} 11/07/2021 13:09:31 - INFO - __main__ - Step 113248: {'lr': 7.229888729091247e-05, 'samples': 21743616, 'steps': 113247, 'loss/train': 0.9367222189903259} 11/07/2021 13:09:31 - INFO - __main__ - Step 113249: {'lr': 7.229515462177408e-05, 'samples': 21743808, 'steps': 113248, 'loss/train': 1.4296684265136719} 11/07/2021 13:09:31 - INFO - __main__ - Step 113250: {'lr': 7.229142203270687e-05, 'samples': 21744000, 'steps': 113249, 'loss/train': 0.5570635795593262} 11/07/2021 13:09:32 - INFO - __main__ - Step 113251: {'lr': 7.228768952371226e-05, 'samples': 21744192, 'steps': 113250, 'loss/train': 1.3791354894638062} 11/07/2021 13:09:33 - INFO - __main__ - Step 113252: {'lr': 7.22839570947921e-05, 'samples': 21744384, 'steps': 113251, 'loss/train': 0.7298678755760193} 11/07/2021 13:09:33 - INFO - __main__ - Step 113253: {'lr': 7.228022474594805e-05, 'samples': 21744576, 'steps': 113252, 'loss/train': 1.9882323741912842} 11/07/2021 13:09:33 - INFO - __main__ - Step 113254: {'lr': 7.227649247718182e-05, 'samples': 21744768, 'steps': 113253, 'loss/train': 0.8374021649360657} 11/07/2021 13:09:34 - INFO - __main__ - Step 113255: {'lr': 7.227276028849503e-05, 'samples': 21744960, 'steps': 113254, 'loss/train': 1.5999765396118164} 11/07/2021 13:09:34 - INFO - __main__ - Step 113256: {'lr': 7.22690281798894e-05, 'samples': 21745152, 'steps': 113255, 'loss/train': 0.7008528709411621} 11/07/2021 13:09:35 - INFO - __main__ - Step 113257: {'lr': 7.226529615136657e-05, 'samples': 21745344, 'steps': 113256, 'loss/train': 1.0382851362228394} 11/07/2021 13:09:36 - INFO - __main__ - Step 113258: {'lr': 7.226156420292829e-05, 'samples': 21745536, 'steps': 113257, 'loss/train': 1.1874679327011108} 11/07/2021 13:09:36 - INFO - __main__ - Step 113259: {'lr': 7.225783233457619e-05, 'samples': 21745728, 'steps': 113258, 'loss/train': 1.630632758140564} 11/07/2021 13:09:36 - INFO - __main__ - Step 113260: {'lr': 7.225410054631199e-05, 'samples': 21745920, 'steps': 113259, 'loss/train': 0.34677550196647644} 11/07/2021 13:09:37 - INFO - __main__ - Step 113261: {'lr': 7.225036883813733e-05, 'samples': 21746112, 'steps': 113260, 'loss/train': 1.7995792627334595} 11/07/2021 13:09:38 - INFO - __main__ - Step 113262: {'lr': 7.224663721005393e-05, 'samples': 21746304, 'steps': 113261, 'loss/train': 1.4381237030029297} 11/07/2021 13:09:38 - INFO - __main__ - Step 113263: {'lr': 7.224290566206343e-05, 'samples': 21746496, 'steps': 113262, 'loss/train': 1.8385999202728271} 11/07/2021 13:09:39 - INFO - __main__ - Step 113264: {'lr': 7.223917419416762e-05, 'samples': 21746688, 'steps': 113263, 'loss/train': 1.364318609237671} 11/07/2021 13:09:39 - INFO - __main__ - Step 113265: {'lr': 7.223544280636802e-05, 'samples': 21746880, 'steps': 113264, 'loss/train': 0.832789421081543} 11/07/2021 13:09:39 - INFO - __main__ - Step 113266: {'lr': 7.223171149866636e-05, 'samples': 21747072, 'steps': 113265, 'loss/train': 1.3132165670394897} 11/07/2021 13:09:40 - INFO - __main__ - Step 113267: {'lr': 7.222798027106439e-05, 'samples': 21747264, 'steps': 113266, 'loss/train': 1.2769145965576172} 11/07/2021 13:09:41 - INFO - __main__ - Step 113268: {'lr': 7.222424912356373e-05, 'samples': 21747456, 'steps': 113267, 'loss/train': 1.3719732761383057} 11/07/2021 13:09:41 - INFO - __main__ - Step 113269: {'lr': 7.222051805616609e-05, 'samples': 21747648, 'steps': 113268, 'loss/train': 1.031886100769043} 11/07/2021 13:09:41 - INFO - __main__ - Step 113270: {'lr': 7.22167870688731e-05, 'samples': 21747840, 'steps': 113269, 'loss/train': 0.6328811049461365} 11/07/2021 13:09:42 - INFO - __main__ - Step 113271: {'lr': 7.221305616168653e-05, 'samples': 21748032, 'steps': 113270, 'loss/train': 1.2218718528747559} 11/07/2021 13:09:42 - INFO - __main__ - Step 113272: {'lr': 7.2209325334608e-05, 'samples': 21748224, 'steps': 113271, 'loss/train': 1.4228469133377075} 11/07/2021 13:09:43 - INFO - __main__ - Step 113273: {'lr': 7.22055945876392e-05, 'samples': 21748416, 'steps': 113272, 'loss/train': 1.2945533990859985} 11/07/2021 13:09:43 - INFO - __main__ - Step 113274: {'lr': 7.220186392078182e-05, 'samples': 21748608, 'steps': 113273, 'loss/train': 0.9063930511474609} 11/07/2021 13:09:44 - INFO - __main__ - Step 113275: {'lr': 7.219813333403755e-05, 'samples': 21748800, 'steps': 113274, 'loss/train': 1.271399974822998} 11/07/2021 13:09:44 - INFO - __main__ - Step 113276: {'lr': 7.219440282740802e-05, 'samples': 21748992, 'steps': 113275, 'loss/train': 1.0740282535552979} 11/07/2021 13:09:44 - INFO - __main__ - Step 113277: {'lr': 7.219067240089505e-05, 'samples': 21749184, 'steps': 113276, 'loss/train': 1.1334148645401} 11/07/2021 13:09:46 - INFO - __main__ - Step 113278: {'lr': 7.218694205450013e-05, 'samples': 21749376, 'steps': 113277, 'loss/train': 1.3723796606063843} 11/07/2021 13:09:46 - INFO - __main__ - Step 113279: {'lr': 7.218321178822507e-05, 'samples': 21749568, 'steps': 113278, 'loss/train': 1.2368555068969727} 11/07/2021 13:09:46 - INFO - __main__ - Step 113280: {'lr': 7.217948160207147e-05, 'samples': 21749760, 'steps': 113279, 'loss/train': 0.8239074945449829} 11/07/2021 13:09:47 - INFO - __main__ - Step 113281: {'lr': 7.217575149604105e-05, 'samples': 21749952, 'steps': 113280, 'loss/train': 1.4348247051239014} 11/07/2021 13:09:47 - INFO - __main__ - Step 113282: {'lr': 7.217202147013552e-05, 'samples': 21750144, 'steps': 113281, 'loss/train': 1.2769863605499268} 11/07/2021 13:09:48 - INFO - __main__ - Step 113283: {'lr': 7.21682915243565e-05, 'samples': 21750336, 'steps': 113282, 'loss/train': 1.1503633260726929} 11/07/2021 13:09:48 - INFO - __main__ - Step 113284: {'lr': 7.216456165870572e-05, 'samples': 21750528, 'steps': 113283, 'loss/train': 1.7205989360809326} 11/07/2021 13:09:49 - INFO - __main__ - Step 113285: {'lr': 7.216083187318487e-05, 'samples': 21750720, 'steps': 113284, 'loss/train': 1.3416146039962769} 11/07/2021 13:09:49 - INFO - __main__ - Step 113286: {'lr': 7.215710216779555e-05, 'samples': 21750912, 'steps': 113285, 'loss/train': 1.112105369567871} 11/07/2021 13:09:49 - INFO - __main__ - Step 113287: {'lr': 7.215337254253954e-05, 'samples': 21751104, 'steps': 113286, 'loss/train': 1.3512040376663208} 11/07/2021 13:09:50 - INFO - __main__ - Step 113288: {'lr': 7.214964299741847e-05, 'samples': 21751296, 'steps': 113287, 'loss/train': 1.4860105514526367} 11/07/2021 13:09:51 - INFO - __main__ - Step 113289: {'lr': 7.214591353243402e-05, 'samples': 21751488, 'steps': 113288, 'loss/train': 1.0151742696762085} 11/07/2021 13:09:51 - INFO - __main__ - Step 113290: {'lr': 7.214218414758786e-05, 'samples': 21751680, 'steps': 113289, 'loss/train': 0.9518873691558838} 11/07/2021 13:09:51 - INFO - __main__ - Step 113291: {'lr': 7.213845484288179e-05, 'samples': 21751872, 'steps': 113290, 'loss/train': 1.3880332708358765} 11/07/2021 13:09:52 - INFO - __main__ - Step 113292: {'lr': 7.21347256183173e-05, 'samples': 21752064, 'steps': 113291, 'loss/train': 0.9977116584777832} 11/07/2021 13:09:53 - INFO - __main__ - Step 113293: {'lr': 7.213099647389614e-05, 'samples': 21752256, 'steps': 113292, 'loss/train': 1.0162934064865112} 11/07/2021 13:09:53 - INFO - __main__ - Step 113294: {'lr': 7.212726740962002e-05, 'samples': 21752448, 'steps': 113293, 'loss/train': 1.1544078588485718} 11/07/2021 13:09:54 - INFO - __main__ - Step 113295: {'lr': 7.212353842549064e-05, 'samples': 21752640, 'steps': 113294, 'loss/train': 1.2484921216964722} 11/07/2021 13:09:54 - INFO - __main__ - Step 113296: {'lr': 7.211980952150962e-05, 'samples': 21752832, 'steps': 113295, 'loss/train': 1.0846471786499023} 11/07/2021 13:09:54 - INFO - __main__ - Step 113297: {'lr': 7.211608069767867e-05, 'samples': 21753024, 'steps': 113296, 'loss/train': 1.1502536535263062} 11/07/2021 13:09:55 - INFO - __main__ - Step 113298: {'lr': 7.211235195399948e-05, 'samples': 21753216, 'steps': 113297, 'loss/train': 1.1086056232452393} 11/07/2021 13:09:56 - INFO - __main__ - Step 113299: {'lr': 7.210862329047371e-05, 'samples': 21753408, 'steps': 113298, 'loss/train': 1.2269871234893799} 11/07/2021 13:09:56 - INFO - __main__ - Step 113300: {'lr': 7.210489470710304e-05, 'samples': 21753600, 'steps': 113299, 'loss/train': 1.0832401514053345} 11/07/2021 13:09:56 - INFO - __main__ - Step 113301: {'lr': 7.210116620388917e-05, 'samples': 21753792, 'steps': 113300, 'loss/train': 1.2162667512893677} 11/07/2021 13:09:57 - INFO - __main__ - Step 113302: {'lr': 7.209743778083377e-05, 'samples': 21753984, 'steps': 113301, 'loss/train': 0.9803428053855896} 11/07/2021 13:09:57 - INFO - __main__ - Step 113303: {'lr': 7.209370943793853e-05, 'samples': 21754176, 'steps': 113302, 'loss/train': 1.2865145206451416} 11/07/2021 13:09:58 - INFO - __main__ - Step 113304: {'lr': 7.208998117520518e-05, 'samples': 21754368, 'steps': 113303, 'loss/train': 1.6687556505203247} 11/07/2021 13:09:59 - INFO - __main__ - Step 113305: {'lr': 7.208625299263526e-05, 'samples': 21754560, 'steps': 113304, 'loss/train': 1.647821307182312} 11/07/2021 13:09:59 - INFO - __main__ - Step 113306: {'lr': 7.208252489023054e-05, 'samples': 21754752, 'steps': 113305, 'loss/train': 1.808219075202942} 11/07/2021 13:09:59 - INFO - __main__ - Step 113307: {'lr': 7.207879686799268e-05, 'samples': 21754944, 'steps': 113306, 'loss/train': 1.2460720539093018} 11/07/2021 13:10:00 - INFO - __main__ - Step 113308: {'lr': 7.207506892592339e-05, 'samples': 21755136, 'steps': 113307, 'loss/train': 1.4099167585372925} 11/07/2021 13:10:00 - INFO - __main__ - Step 113309: {'lr': 7.20713410640243e-05, 'samples': 21755328, 'steps': 113308, 'loss/train': 5.698655128479004} 11/07/2021 13:10:01 - INFO - __main__ - Step 113310: {'lr': 7.206761328229714e-05, 'samples': 21755520, 'steps': 113309, 'loss/train': 1.3664941787719727} 11/07/2021 13:10:01 - INFO - __main__ - Step 113311: {'lr': 7.206388558074356e-05, 'samples': 21755712, 'steps': 113310, 'loss/train': 1.0784127712249756} 11/07/2021 13:10:02 - INFO - __main__ - Step 113312: {'lr': 7.206015795936524e-05, 'samples': 21755904, 'steps': 113311, 'loss/train': 1.8052246570587158} 11/07/2021 13:10:02 - INFO - __main__ - Step 113313: {'lr': 7.205643041816387e-05, 'samples': 21756096, 'steps': 113312, 'loss/train': 1.3956338167190552} 11/07/2021 13:10:03 - INFO - __main__ - Step 113314: {'lr': 7.205270295714111e-05, 'samples': 21756288, 'steps': 113313, 'loss/train': 1.3785414695739746} 11/07/2021 13:10:04 - INFO - __main__ - Step 113315: {'lr': 7.204897557629869e-05, 'samples': 21756480, 'steps': 113314, 'loss/train': 1.4246258735656738} 11/07/2021 13:10:04 - INFO - __main__ - Step 113316: {'lr': 7.204524827563824e-05, 'samples': 21756672, 'steps': 113315, 'loss/train': 1.651593565940857} 11/07/2021 13:10:04 - INFO - __main__ - Step 113317: {'lr': 7.204152105516154e-05, 'samples': 21756864, 'steps': 113316, 'loss/train': 5.694239616394043} 11/07/2021 13:10:05 - INFO - __main__ - Step 113318: {'lr': 7.203779391487009e-05, 'samples': 21757056, 'steps': 113317, 'loss/train': 1.2370119094848633} 11/07/2021 13:10:05 - INFO - __main__ - Step 113319: {'lr': 7.203406685476568e-05, 'samples': 21757248, 'steps': 113318, 'loss/train': 1.6050856113433838} 11/07/2021 13:10:05 - INFO - __main__ - Step 113320: {'lr': 7.203033987484997e-05, 'samples': 21757440, 'steps': 113319, 'loss/train': 1.2064235210418701} 11/07/2021 13:10:06 - INFO - __main__ - Step 113321: {'lr': 7.202661297512464e-05, 'samples': 21757632, 'steps': 113320, 'loss/train': 1.333357572555542} 11/07/2021 13:10:07 - INFO - __main__ - Step 113322: {'lr': 7.202288615559138e-05, 'samples': 21757824, 'steps': 113321, 'loss/train': 1.1317862272262573} 11/07/2021 13:10:07 - INFO - __main__ - Step 113323: {'lr': 7.201915941625184e-05, 'samples': 21758016, 'steps': 113322, 'loss/train': 0.8126118183135986} 11/07/2021 13:10:07 - INFO - __main__ - Step 113324: {'lr': 7.201543275710773e-05, 'samples': 21758208, 'steps': 113323, 'loss/train': 1.3817442655563354} 11/07/2021 13:10:08 - INFO - __main__ - Step 113325: {'lr': 7.201170617816072e-05, 'samples': 21758400, 'steps': 113324, 'loss/train': 1.5957435369491577} 11/07/2021 13:10:09 - INFO - __main__ - Step 113326: {'lr': 7.20079796794125e-05, 'samples': 21758592, 'steps': 113325, 'loss/train': 1.3789457082748413} 11/07/2021 13:10:09 - INFO - __main__ - Step 113327: {'lr': 7.200425326086474e-05, 'samples': 21758784, 'steps': 113326, 'loss/train': 1.3476667404174805} 11/07/2021 13:10:10 - INFO - __main__ - Step 113328: {'lr': 7.20005269225191e-05, 'samples': 21758976, 'steps': 113327, 'loss/train': 1.2012470960617065} 11/07/2021 13:10:10 - INFO - __main__ - Step 113329: {'lr': 7.199680066437728e-05, 'samples': 21759168, 'steps': 113328, 'loss/train': 1.1474535465240479} 11/07/2021 13:10:10 - INFO - __main__ - Step 113330: {'lr': 7.199307448644097e-05, 'samples': 21759360, 'steps': 113329, 'loss/train': 0.23088566958904266} 11/07/2021 13:10:11 - INFO - __main__ - Step 113331: {'lr': 7.198934838871187e-05, 'samples': 21759552, 'steps': 113330, 'loss/train': 1.139679193496704} 11/07/2021 13:10:12 - INFO - __main__ - Step 113332: {'lr': 7.198562237119158e-05, 'samples': 21759744, 'steps': 113331, 'loss/train': 1.440394401550293} 11/07/2021 13:10:12 - INFO - __main__ - Step 113333: {'lr': 7.198189643388184e-05, 'samples': 21759936, 'steps': 113332, 'loss/train': 1.491108775138855} 11/07/2021 13:10:12 - INFO - __main__ - Step 113334: {'lr': 7.197817057678427e-05, 'samples': 21760128, 'steps': 113333, 'loss/train': 1.0671041011810303} 11/07/2021 13:10:13 - INFO - __main__ - Step 113335: {'lr': 7.197444479990062e-05, 'samples': 21760320, 'steps': 113334, 'loss/train': 1.5980654954910278} 11/07/2021 13:10:14 - INFO - __main__ - Step 113336: {'lr': 7.197071910323252e-05, 'samples': 21760512, 'steps': 113335, 'loss/train': 0.8731480836868286} 11/07/2021 13:10:14 - INFO - __main__ - Step 113337: {'lr': 7.196699348678165e-05, 'samples': 21760704, 'steps': 113336, 'loss/train': 1.1520205736160278} 11/07/2021 13:10:15 - INFO - __main__ - Step 113338: {'lr': 7.196326795054974e-05, 'samples': 21760896, 'steps': 113337, 'loss/train': 1.344746708869934} 11/07/2021 13:10:15 - INFO - __main__ - Step 113339: {'lr': 7.195954249453842e-05, 'samples': 21761088, 'steps': 113338, 'loss/train': 1.4105923175811768} 11/07/2021 13:10:15 - INFO - __main__ - Step 113340: {'lr': 7.195581711874938e-05, 'samples': 21761280, 'steps': 113339, 'loss/train': 1.5679229497909546} 11/07/2021 13:10:17 - INFO - __main__ - Step 113341: {'lr': 7.19520918231843e-05, 'samples': 21761472, 'steps': 113340, 'loss/train': 1.2832417488098145} 11/07/2021 13:10:17 - INFO - __main__ - Step 113342: {'lr': 7.194836660784487e-05, 'samples': 21761664, 'steps': 113341, 'loss/train': 1.4732091426849365} 11/07/2021 13:10:18 - INFO - __main__ - Step 113343: {'lr': 7.194464147273275e-05, 'samples': 21761856, 'steps': 113342, 'loss/train': 1.3410704135894775} 11/07/2021 13:10:18 - INFO - __main__ - Step 113344: {'lr': 7.194091641784969e-05, 'samples': 21762048, 'steps': 113343, 'loss/train': 1.759730577468872} 11/07/2021 13:10:18 - INFO - __main__ - Step 113345: {'lr': 7.193719144319727e-05, 'samples': 21762240, 'steps': 113344, 'loss/train': 1.7565206289291382} 11/07/2021 13:10:19 - INFO - __main__ - Step 113346: {'lr': 7.193346654877717e-05, 'samples': 21762432, 'steps': 113345, 'loss/train': 1.3054425716400146} 11/07/2021 13:10:19 - INFO - __main__ - Step 113347: {'lr': 7.19297417345911e-05, 'samples': 21762624, 'steps': 113346, 'loss/train': 1.1844408512115479} 11/07/2021 13:10:20 - INFO - __main__ - Step 113348: {'lr': 7.192601700064078e-05, 'samples': 21762816, 'steps': 113347, 'loss/train': 1.4256138801574707} 11/07/2021 13:10:21 - INFO - __main__ - Step 113349: {'lr': 7.192229234692779e-05, 'samples': 21763008, 'steps': 113348, 'loss/train': 1.4270435571670532} 11/07/2021 13:10:21 - INFO - __main__ - Step 113350: {'lr': 7.19185677734539e-05, 'samples': 21763200, 'steps': 113349, 'loss/train': 1.0384602546691895} 11/07/2021 13:10:21 - INFO - __main__ - Step 113351: {'lr': 7.191484328022077e-05, 'samples': 21763392, 'steps': 113350, 'loss/train': 1.5050925016403198} 11/07/2021 13:10:22 - INFO - __main__ - Step 113352: {'lr': 7.191111886723003e-05, 'samples': 21763584, 'steps': 113351, 'loss/train': 1.2740418910980225} 11/07/2021 13:10:22 - INFO - __main__ - Step 113353: {'lr': 7.190739453448341e-05, 'samples': 21763776, 'steps': 113352, 'loss/train': 0.7247132062911987} 11/07/2021 13:10:23 - INFO - __main__ - Step 113354: {'lr': 7.190367028198258e-05, 'samples': 21763968, 'steps': 113353, 'loss/train': 1.3463900089263916} 11/07/2021 13:10:23 - INFO - __main__ - Step 113355: {'lr': 7.189994610972919e-05, 'samples': 21764160, 'steps': 113354, 'loss/train': 1.4101221561431885} 11/07/2021 13:10:24 - INFO - __main__ - Step 113356: {'lr': 7.189622201772494e-05, 'samples': 21764352, 'steps': 113355, 'loss/train': 1.2186434268951416} 11/07/2021 13:10:24 - INFO - __main__ - Step 113357: {'lr': 7.18924980059715e-05, 'samples': 21764544, 'steps': 113356, 'loss/train': 1.584224820137024} 11/07/2021 13:10:24 - INFO - __main__ - Step 113358: {'lr': 7.188877407447065e-05, 'samples': 21764736, 'steps': 113357, 'loss/train': 1.3883707523345947} 11/07/2021 13:10:25 - INFO - __main__ - Step 113359: {'lr': 7.188505022322386e-05, 'samples': 21764928, 'steps': 113358, 'loss/train': 1.2634400129318237} 11/07/2021 13:10:26 - INFO - __main__ - Step 113360: {'lr': 7.188132645223295e-05, 'samples': 21765120, 'steps': 113359, 'loss/train': 1.2769066095352173} 11/07/2021 13:10:26 - INFO - __main__ - Step 113361: {'lr': 7.187760276149954e-05, 'samples': 21765312, 'steps': 113360, 'loss/train': 1.6014275550842285} 11/07/2021 13:10:26 - INFO - __main__ - Step 113362: {'lr': 7.187387915102536e-05, 'samples': 21765504, 'steps': 113361, 'loss/train': 1.9170836210250854} 11/07/2021 13:10:27 - INFO - __main__ - Step 113363: {'lr': 7.187015562081203e-05, 'samples': 21765696, 'steps': 113362, 'loss/train': 1.6430972814559937} 11/07/2021 13:10:28 - INFO - __main__ - Step 113364: {'lr': 7.186643217086128e-05, 'samples': 21765888, 'steps': 113363, 'loss/train': 1.6409122943878174} 11/07/2021 13:10:28 - INFO - __main__ - Step 113365: {'lr': 7.186270880117476e-05, 'samples': 21766080, 'steps': 113364, 'loss/train': 1.0804773569107056} 11/07/2021 13:10:29 - INFO - __main__ - Step 113366: {'lr': 7.185898551175418e-05, 'samples': 21766272, 'steps': 113365, 'loss/train': 1.1403357982635498} 11/07/2021 13:10:29 - INFO - __main__ - Step 113367: {'lr': 7.185526230260117e-05, 'samples': 21766464, 'steps': 113366, 'loss/train': 1.6789206266403198} 11/07/2021 13:10:29 - INFO - __main__ - Step 113368: {'lr': 7.185153917371742e-05, 'samples': 21766656, 'steps': 113367, 'loss/train': 1.1281230449676514} 11/07/2021 13:10:30 - INFO - __main__ - Step 113369: {'lr': 7.184781612510464e-05, 'samples': 21766848, 'steps': 113368, 'loss/train': 1.1769461631774902} 11/07/2021 13:10:31 - INFO - __main__ - Step 113370: {'lr': 7.184409315676446e-05, 'samples': 21767040, 'steps': 113369, 'loss/train': 0.8704937100410461} 11/07/2021 13:10:31 - INFO - __main__ - Step 113371: {'lr': 7.184037026869867e-05, 'samples': 21767232, 'steps': 113370, 'loss/train': 1.4400289058685303} 11/07/2021 13:10:31 - INFO - __main__ - Step 113372: {'lr': 7.183664746090879e-05, 'samples': 21767424, 'steps': 113371, 'loss/train': 0.8085153102874756} 11/07/2021 13:10:32 - INFO - __main__ - Step 113373: {'lr': 7.183292473339656e-05, 'samples': 21767616, 'steps': 113372, 'loss/train': 0.7476247549057007} 11/07/2021 13:10:33 - INFO - __main__ - Step 113374: {'lr': 7.182920208616367e-05, 'samples': 21767808, 'steps': 113373, 'loss/train': 1.2249163389205933} 11/07/2021 13:10:33 - INFO - __main__ - Step 113375: {'lr': 7.182547951921178e-05, 'samples': 21768000, 'steps': 113374, 'loss/train': 1.569082498550415} 11/07/2021 13:10:33 - INFO - __main__ - Step 113376: {'lr': 7.18217570325426e-05, 'samples': 21768192, 'steps': 113375, 'loss/train': 1.3697453737258911} 11/07/2021 13:10:34 - INFO - __main__ - Step 113377: {'lr': 7.181803462615777e-05, 'samples': 21768384, 'steps': 113376, 'loss/train': 2.990234613418579} 11/07/2021 13:10:34 - INFO - __main__ - Step 113378: {'lr': 7.1814312300059e-05, 'samples': 21768576, 'steps': 113377, 'loss/train': 0.9613967537879944} 11/07/2021 13:10:35 - INFO - __main__ - Step 113379: {'lr': 7.181059005424793e-05, 'samples': 21768768, 'steps': 113378, 'loss/train': 1.5225067138671875} 11/07/2021 13:10:36 - INFO - __main__ - Step 113380: {'lr': 7.180686788872628e-05, 'samples': 21768960, 'steps': 113379, 'loss/train': 1.1024353504180908} 11/07/2021 13:10:36 - INFO - __main__ - Step 113381: {'lr': 7.180314580349568e-05, 'samples': 21769152, 'steps': 113380, 'loss/train': 1.38719642162323} 11/07/2021 13:10:36 - INFO - __main__ - Step 113382: {'lr': 7.179942379855786e-05, 'samples': 21769344, 'steps': 113381, 'loss/train': 0.9932367205619812} 11/07/2021 13:10:37 - INFO - __main__ - Step 113383: {'lr': 7.179570187391454e-05, 'samples': 21769536, 'steps': 113382, 'loss/train': 0.7482710480690002} 11/07/2021 13:10:37 - INFO - __main__ - Step 113384: {'lr': 7.179198002956724e-05, 'samples': 21769728, 'steps': 113383, 'loss/train': 1.2129404544830322} 11/07/2021 13:10:38 - INFO - __main__ - Step 113385: {'lr': 7.178825826551772e-05, 'samples': 21769920, 'steps': 113384, 'loss/train': 1.3733798265457153} 11/07/2021 13:10:38 - INFO - __main__ - Step 113386: {'lr': 7.178453658176767e-05, 'samples': 21770112, 'steps': 113385, 'loss/train': 1.1316741704940796} 11/07/2021 13:10:39 - INFO - __main__ - Step 113387: {'lr': 7.178081497831877e-05, 'samples': 21770304, 'steps': 113386, 'loss/train': 1.7185609340667725} 11/07/2021 13:10:39 - INFO - __main__ - Step 113388: {'lr': 7.177709345517266e-05, 'samples': 21770496, 'steps': 113387, 'loss/train': 1.4591573476791382} 11/07/2021 13:10:40 - INFO - __main__ - Step 113389: {'lr': 7.177337201233106e-05, 'samples': 21770688, 'steps': 113388, 'loss/train': 1.0743606090545654} 11/07/2021 13:10:41 - INFO - __main__ - Step 113390: {'lr': 7.176965064979562e-05, 'samples': 21770880, 'steps': 113389, 'loss/train': 1.8877265453338623} 11/07/2021 13:10:41 - INFO - __main__ - Step 113391: {'lr': 7.176592936756801e-05, 'samples': 21771072, 'steps': 113390, 'loss/train': 1.5673490762710571} 11/07/2021 13:10:41 - INFO - __main__ - Step 113392: {'lr': 7.176220816564995e-05, 'samples': 21771264, 'steps': 113391, 'loss/train': 1.4734251499176025} 11/07/2021 13:10:42 - INFO - __main__ - Step 113393: {'lr': 7.175848704404309e-05, 'samples': 21771456, 'steps': 113392, 'loss/train': 1.2357897758483887} 11/07/2021 13:10:42 - INFO - __main__ - Step 113394: {'lr': 7.175476600274916e-05, 'samples': 21771648, 'steps': 113393, 'loss/train': 1.0646332502365112} 11/07/2021 13:10:43 - INFO - __main__ - Step 113395: {'lr': 7.175104504176971e-05, 'samples': 21771840, 'steps': 113394, 'loss/train': 1.3079262971878052} 11/07/2021 13:10:43 - INFO - __main__ - Step 113396: {'lr': 7.174732416110649e-05, 'samples': 21772032, 'steps': 113395, 'loss/train': 1.2816027402877808} 11/07/2021 13:10:44 - INFO - __main__ - Step 113397: {'lr': 7.17436033607612e-05, 'samples': 21772224, 'steps': 113396, 'loss/train': 1.6958558559417725} 11/07/2021 13:10:44 - INFO - __main__ - Step 113398: {'lr': 7.173988264073544e-05, 'samples': 21772416, 'steps': 113397, 'loss/train': 1.9031352996826172} 11/07/2021 13:10:44 - INFO - __main__ - Step 113399: {'lr': 7.173616200103098e-05, 'samples': 21772608, 'steps': 113398, 'loss/train': 1.4354298114776611} 11/07/2021 13:10:45 - INFO - __main__ - Step 113400: {'lr': 7.173244144164945e-05, 'samples': 21772800, 'steps': 113399, 'loss/train': 1.408145546913147} 11/07/2021 13:10:46 - INFO - __main__ - Step 113401: {'lr': 7.172872096259253e-05, 'samples': 21772992, 'steps': 113400, 'loss/train': 1.1243200302124023} 11/07/2021 13:10:46 - INFO - __main__ - Step 113402: {'lr': 7.172500056386189e-05, 'samples': 21773184, 'steps': 113401, 'loss/train': 3.331940174102783} 11/07/2021 13:10:46 - INFO - __main__ - Step 113403: {'lr': 7.17212802454592e-05, 'samples': 21773376, 'steps': 113402, 'loss/train': 1.3912484645843506} 11/07/2021 13:10:47 - INFO - __main__ - Step 113404: {'lr': 7.171756000738616e-05, 'samples': 21773568, 'steps': 113403, 'loss/train': 1.3031271696090698} 11/07/2021 13:10:48 - INFO - __main__ - Step 113405: {'lr': 7.171383984964452e-05, 'samples': 21773760, 'steps': 113404, 'loss/train': 1.8784970045089722} 11/07/2021 13:10:48 - INFO - __main__ - Step 113406: {'lr': 7.171011977223579e-05, 'samples': 21773952, 'steps': 113405, 'loss/train': 1.9317266941070557} 11/07/2021 13:10:49 - INFO - __main__ - Step 113407: {'lr': 7.170639977516174e-05, 'samples': 21774144, 'steps': 113406, 'loss/train': 1.4371483325958252} 11/07/2021 13:10:49 - INFO - __main__ - Step 113408: {'lr': 7.170267985842405e-05, 'samples': 21774336, 'steps': 113407, 'loss/train': 1.250447154045105} 11/07/2021 13:10:49 - INFO - __main__ - Step 113409: {'lr': 7.169896002202434e-05, 'samples': 21774528, 'steps': 113408, 'loss/train': 1.480455994606018} 11/07/2021 13:10:50 - INFO - __main__ - Step 113410: {'lr': 7.169524026596436e-05, 'samples': 21774720, 'steps': 113409, 'loss/train': 1.4283596277236938} 11/07/2021 13:10:51 - INFO - __main__ - Step 113411: {'lr': 7.169152059024572e-05, 'samples': 21774912, 'steps': 113410, 'loss/train': 1.489270567893982} 11/07/2021 13:10:51 - INFO - __main__ - Step 113412: {'lr': 7.168780099487016e-05, 'samples': 21775104, 'steps': 113411, 'loss/train': 1.45760178565979} 11/07/2021 13:10:51 - INFO - __main__ - Step 113413: {'lr': 7.168408147983931e-05, 'samples': 21775296, 'steps': 113412, 'loss/train': 0.9192724823951721} 11/07/2021 13:10:52 - INFO - __main__ - Step 113414: {'lr': 7.168036204515487e-05, 'samples': 21775488, 'steps': 113413, 'loss/train': 0.9352954030036926} 11/07/2021 13:10:53 - INFO - __main__ - Step 113415: {'lr': 7.16766426908185e-05, 'samples': 21775680, 'steps': 113414, 'loss/train': 0.6168649196624756} 11/07/2021 13:10:53 - INFO - __main__ - Step 113416: {'lr': 7.167292341683196e-05, 'samples': 21775872, 'steps': 113415, 'loss/train': 1.5000689029693604} 11/07/2021 13:10:54 - INFO - __main__ - Step 113417: {'lr': 7.166920422319678e-05, 'samples': 21776064, 'steps': 113416, 'loss/train': 1.372514009475708} 11/07/2021 13:10:54 - INFO - __main__ - Step 113418: {'lr': 7.16654851099147e-05, 'samples': 21776256, 'steps': 113417, 'loss/train': 1.1568111181259155} 11/07/2021 13:10:54 - INFO - __main__ - Step 113419: {'lr': 7.16617660769874e-05, 'samples': 21776448, 'steps': 113418, 'loss/train': 1.5847079753875732} 11/07/2021 13:10:55 - INFO - __main__ - Step 113420: {'lr': 7.165804712441656e-05, 'samples': 21776640, 'steps': 113419, 'loss/train': 1.4138790369033813} 11/07/2021 13:10:56 - INFO - __main__ - Step 113421: {'lr': 7.165432825220384e-05, 'samples': 21776832, 'steps': 113420, 'loss/train': 1.4246245622634888} 11/07/2021 13:10:56 - INFO - __main__ - Step 113422: {'lr': 7.165060946035093e-05, 'samples': 21777024, 'steps': 113421, 'loss/train': 1.3581950664520264} 11/07/2021 13:10:56 - INFO - __main__ - Step 113423: {'lr': 7.164689074885952e-05, 'samples': 21777216, 'steps': 113422, 'loss/train': 0.9711194634437561} 11/07/2021 13:10:57 - INFO - __main__ - Step 113424: {'lr': 7.164317211773125e-05, 'samples': 21777408, 'steps': 113423, 'loss/train': 2.769901752471924} 11/07/2021 13:10:57 - INFO - __main__ - Step 113425: {'lr': 7.163945356696782e-05, 'samples': 21777600, 'steps': 113424, 'loss/train': 1.4236235618591309} 11/07/2021 13:10:58 - INFO - __main__ - Step 113426: {'lr': 7.163573509657098e-05, 'samples': 21777792, 'steps': 113425, 'loss/train': 1.2142890691757202} 11/07/2021 13:10:58 - INFO - __main__ - Step 113427: {'lr': 7.163201670654226e-05, 'samples': 21777984, 'steps': 113426, 'loss/train': 1.360680103302002} 11/07/2021 13:10:59 - INFO - __main__ - Step 113428: {'lr': 7.16282983968834e-05, 'samples': 21778176, 'steps': 113427, 'loss/train': 1.0866023302078247} 11/07/2021 13:10:59 - INFO - __main__ - Step 113429: {'lr': 7.162458016759604e-05, 'samples': 21778368, 'steps': 113428, 'loss/train': 0.21762888133525848} 11/07/2021 13:10:59 - INFO - __main__ - Step 113430: {'lr': 7.162086201868192e-05, 'samples': 21778560, 'steps': 113429, 'loss/train': 1.4939485788345337} 11/07/2021 13:11:01 - INFO - __main__ - Step 113431: {'lr': 7.161714395014272e-05, 'samples': 21778752, 'steps': 113430, 'loss/train': 0.9362300634384155} 11/07/2021 13:11:01 - INFO - __main__ - Step 113432: {'lr': 7.161342596198004e-05, 'samples': 21778944, 'steps': 113431, 'loss/train': 1.574115514755249} 11/07/2021 13:11:01 - INFO - __main__ - Step 113433: {'lr': 7.160970805419558e-05, 'samples': 21779136, 'steps': 113432, 'loss/train': 1.4549049139022827} 11/07/2021 13:11:02 - INFO - __main__ - Step 113434: {'lr': 7.160599022679107e-05, 'samples': 21779328, 'steps': 113433, 'loss/train': 1.3762117624282837} 11/07/2021 13:11:02 - INFO - __main__ - Step 113435: {'lr': 7.160227247976814e-05, 'samples': 21779520, 'steps': 113434, 'loss/train': 1.5057451725006104} 11/07/2021 13:11:03 - INFO - __main__ - Step 113436: {'lr': 7.159855481312846e-05, 'samples': 21779712, 'steps': 113435, 'loss/train': 0.6938132643699646} 11/07/2021 13:11:03 - INFO - __main__ - Step 113437: {'lr': 7.15948372268737e-05, 'samples': 21779904, 'steps': 113436, 'loss/train': 1.3396098613739014} 11/07/2021 13:11:04 - INFO - __main__ - Step 113438: {'lr': 7.159111972100568e-05, 'samples': 21780096, 'steps': 113437, 'loss/train': 1.8087801933288574} 11/07/2021 13:11:04 - INFO - __main__ - Step 113439: {'lr': 7.158740229552585e-05, 'samples': 21780288, 'steps': 113438, 'loss/train': 1.0796488523483276} 11/07/2021 13:11:05 - INFO - __main__ - Step 113440: {'lr': 7.158368495043599e-05, 'samples': 21780480, 'steps': 113439, 'loss/train': 1.5128257274627686} 11/07/2021 13:11:06 - INFO - __main__ - Step 113441: {'lr': 7.157996768573773e-05, 'samples': 21780672, 'steps': 113440, 'loss/train': 1.2307392358779907} 11/07/2021 13:11:06 - INFO - __main__ - Step 113442: {'lr': 7.157625050143282e-05, 'samples': 21780864, 'steps': 113441, 'loss/train': 1.5126163959503174} 11/07/2021 13:11:06 - INFO - __main__ - Step 113443: {'lr': 7.15725333975229e-05, 'samples': 21781056, 'steps': 113442, 'loss/train': 1.432816743850708} 11/07/2021 13:11:07 - INFO - __main__ - Step 113444: {'lr': 7.156881637400964e-05, 'samples': 21781248, 'steps': 113443, 'loss/train': 1.0073740482330322} 11/07/2021 13:11:07 - INFO - __main__ - Step 113445: {'lr': 7.156509943089471e-05, 'samples': 21781440, 'steps': 113444, 'loss/train': 1.5376167297363281} 11/07/2021 13:11:07 - INFO - __main__ - Step 113446: {'lr': 7.156138256817979e-05, 'samples': 21781632, 'steps': 113445, 'loss/train': 1.4778356552124023} 11/07/2021 13:11:08 - INFO - __main__ - Step 113447: {'lr': 7.155766578586656e-05, 'samples': 21781824, 'steps': 113446, 'loss/train': 1.7298648357391357} 11/07/2021 13:11:09 - INFO - __main__ - Step 113448: {'lr': 7.155394908395671e-05, 'samples': 21782016, 'steps': 113447, 'loss/train': 1.269055962562561} 11/07/2021 13:11:09 - INFO - __main__ - Step 113449: {'lr': 7.155023246245188e-05, 'samples': 21782208, 'steps': 113448, 'loss/train': 1.3981236219406128} 11/07/2021 13:11:09 - INFO - __main__ - Step 113450: {'lr': 7.154651592135374e-05, 'samples': 21782400, 'steps': 113449, 'loss/train': 1.5131454467773438} 11/07/2021 13:11:10 - INFO - __main__ - Step 113451: {'lr': 7.154279946066402e-05, 'samples': 21782592, 'steps': 113450, 'loss/train': 0.9303115606307983} 11/07/2021 13:11:11 - INFO - __main__ - Step 113452: {'lr': 7.153908308038446e-05, 'samples': 21782784, 'steps': 113451, 'loss/train': 0.8645359873771667} 11/07/2021 13:11:11 - INFO - __main__ - Step 113453: {'lr': 7.153536678051651e-05, 'samples': 21782976, 'steps': 113452, 'loss/train': 1.2679165601730347} 11/07/2021 13:11:11 - INFO - __main__ - Step 113454: {'lr': 7.153165056106197e-05, 'samples': 21783168, 'steps': 113453, 'loss/train': 1.2874224185943604} 11/07/2021 13:11:12 - INFO - __main__ - Step 113455: {'lr': 7.152793442202254e-05, 'samples': 21783360, 'steps': 113454, 'loss/train': 1.2310590744018555} 11/07/2021 13:11:12 - INFO - __main__ - Step 113456: {'lr': 7.152421836339987e-05, 'samples': 21783552, 'steps': 113455, 'loss/train': 1.5324852466583252} 11/07/2021 13:11:14 - INFO - __main__ - Step 113457: {'lr': 7.152050238519561e-05, 'samples': 21783744, 'steps': 113456, 'loss/train': 0.902463972568512} 11/07/2021 13:11:14 - INFO - __main__ - Step 113458: {'lr': 7.151678648741148e-05, 'samples': 21783936, 'steps': 113457, 'loss/train': 1.0641182661056519} 11/07/2021 13:11:14 - INFO - __main__ - Step 113459: {'lr': 7.151307067004911e-05, 'samples': 21784128, 'steps': 113458, 'loss/train': 1.0483167171478271} 11/07/2021 13:11:15 - INFO - __main__ - Step 113460: {'lr': 7.150935493311023e-05, 'samples': 21784320, 'steps': 113459, 'loss/train': 1.5994681119918823} 11/07/2021 13:11:15 - INFO - __main__ - Step 113461: {'lr': 7.150563927659645e-05, 'samples': 21784512, 'steps': 113460, 'loss/train': 1.4154837131500244} 11/07/2021 13:11:16 - INFO - __main__ - Step 113462: {'lr': 7.150192370050948e-05, 'samples': 21784704, 'steps': 113461, 'loss/train': 0.25775083899497986} 11/07/2021 13:11:16 - INFO - __main__ - Step 113463: {'lr': 7.149820820485098e-05, 'samples': 21784896, 'steps': 113462, 'loss/train': 0.6217975616455078} 11/07/2021 13:11:17 - INFO - __main__ - Step 113464: {'lr': 7.149449278962267e-05, 'samples': 21785088, 'steps': 113463, 'loss/train': 1.6069978475570679} 11/07/2021 13:11:17 - INFO - __main__ - Step 113465: {'lr': 7.14907774548262e-05, 'samples': 21785280, 'steps': 113464, 'loss/train': 1.3730055093765259} 11/07/2021 13:11:18 - INFO - __main__ - Step 113466: {'lr': 7.148706220046322e-05, 'samples': 21785472, 'steps': 113465, 'loss/train': 0.9740806818008423} 11/07/2021 13:11:18 - INFO - __main__ - Step 113467: {'lr': 7.148334702653539e-05, 'samples': 21785664, 'steps': 113466, 'loss/train': 1.2389650344848633} 11/07/2021 13:11:19 - INFO - __main__ - Step 113468: {'lr': 7.147963193304441e-05, 'samples': 21785856, 'steps': 113467, 'loss/train': 1.253801703453064} 11/07/2021 13:11:19 - INFO - __main__ - Step 113469: {'lr': 7.147591691999195e-05, 'samples': 21786048, 'steps': 113468, 'loss/train': 1.7389373779296875} 11/07/2021 13:11:20 - INFO - __main__ - Step 113470: {'lr': 7.147220198737966e-05, 'samples': 21786240, 'steps': 113469, 'loss/train': 1.0351312160491943} 11/07/2021 13:11:20 - INFO - __main__ - Step 113471: {'lr': 7.146848713520929e-05, 'samples': 21786432, 'steps': 113470, 'loss/train': 1.9765064716339111} 11/07/2021 13:11:20 - INFO - __main__ - Step 113472: {'lr': 7.146477236348245e-05, 'samples': 21786624, 'steps': 113471, 'loss/train': 1.5881426334381104} 11/07/2021 13:11:21 - INFO - __main__ - Step 113473: {'lr': 7.146105767220082e-05, 'samples': 21786816, 'steps': 113472, 'loss/train': 1.2901843786239624} 11/07/2021 13:11:22 - INFO - __main__ - Step 113474: {'lr': 7.145734306136609e-05, 'samples': 21787008, 'steps': 113473, 'loss/train': 0.9200223684310913} 11/07/2021 13:11:22 - INFO - __main__ - Step 113475: {'lr': 7.145362853097989e-05, 'samples': 21787200, 'steps': 113474, 'loss/train': 0.9723110198974609} 11/07/2021 13:11:22 - INFO - __main__ - Step 113476: {'lr': 7.144991408104398e-05, 'samples': 21787392, 'steps': 113475, 'loss/train': 0.7309736609458923} 11/07/2021 13:11:23 - INFO - __main__ - Step 113477: {'lr': 7.144619971155997e-05, 'samples': 21787584, 'steps': 113476, 'loss/train': 0.6855825781822205} 11/07/2021 13:11:24 - INFO - __main__ - Step 113478: {'lr': 7.144248542252954e-05, 'samples': 21787776, 'steps': 113477, 'loss/train': 1.2079310417175293} 11/07/2021 13:11:24 - INFO - __main__ - Step 113479: {'lr': 7.143877121395445e-05, 'samples': 21787968, 'steps': 113478, 'loss/train': 1.098430871963501} 11/07/2021 13:11:24 - INFO - __main__ - Step 113480: {'lr': 7.14350570858362e-05, 'samples': 21788160, 'steps': 113479, 'loss/train': 1.9103196859359741} 11/07/2021 13:11:25 - INFO - __main__ - Step 113481: {'lr': 7.143134303817659e-05, 'samples': 21788352, 'steps': 113480, 'loss/train': 1.377873182296753} 11/07/2021 13:11:25 - INFO - __main__ - Step 113482: {'lr': 7.142762907097721e-05, 'samples': 21788544, 'steps': 113481, 'loss/train': 1.484807014465332} 11/07/2021 13:11:26 - INFO - __main__ - Step 113483: {'lr': 7.142391518423986e-05, 'samples': 21788736, 'steps': 113482, 'loss/train': 1.694084882736206} 11/07/2021 13:11:27 - INFO - __main__ - Step 113484: {'lr': 7.142020137796606e-05, 'samples': 21788928, 'steps': 113483, 'loss/train': 1.368160367012024} 11/07/2021 13:11:27 - INFO - __main__ - Step 113485: {'lr': 7.141648765215761e-05, 'samples': 21789120, 'steps': 113484, 'loss/train': 1.4357409477233887} 11/07/2021 13:11:27 - INFO - __main__ - Step 113486: {'lr': 7.141277400681615e-05, 'samples': 21789312, 'steps': 113485, 'loss/train': 1.418857455253601} 11/07/2021 13:11:28 - INFO - __main__ - Step 113487: {'lr': 7.140906044194329e-05, 'samples': 21789504, 'steps': 113486, 'loss/train': 1.2169880867004395} 11/07/2021 13:11:29 - INFO - __main__ - Step 113488: {'lr': 7.140534695754078e-05, 'samples': 21789696, 'steps': 113487, 'loss/train': 1.0626591444015503} 11/07/2021 13:11:29 - INFO - __main__ - Step 113489: {'lr': 7.140163355361027e-05, 'samples': 21789888, 'steps': 113488, 'loss/train': 1.465347170829773} 11/07/2021 13:11:29 - INFO - __main__ - Step 113490: {'lr': 7.13979202301534e-05, 'samples': 21790080, 'steps': 113489, 'loss/train': 1.5717421770095825} 11/07/2021 13:11:30 - INFO - __main__ - Step 113491: {'lr': 7.139420698717188e-05, 'samples': 21790272, 'steps': 113490, 'loss/train': 1.3363163471221924} 11/07/2021 13:11:30 - INFO - __main__ - Step 113492: {'lr': 7.139049382466747e-05, 'samples': 21790464, 'steps': 113491, 'loss/train': 1.473175048828125} 11/07/2021 13:11:31 - INFO - __main__ - Step 113493: {'lr': 7.138678074264165e-05, 'samples': 21790656, 'steps': 113492, 'loss/train': 1.5691558122634888} 11/07/2021 13:11:32 - INFO - __main__ - Step 113494: {'lr': 7.138306774109621e-05, 'samples': 21790848, 'steps': 113493, 'loss/train': 1.2840766906738281} 11/07/2021 13:11:32 - INFO - __main__ - Step 113495: {'lr': 7.13793548200328e-05, 'samples': 21791040, 'steps': 113494, 'loss/train': 1.4763556718826294} 11/07/2021 13:11:32 - INFO - __main__ - Step 113496: {'lr': 7.137564197945309e-05, 'samples': 21791232, 'steps': 113495, 'loss/train': 1.450034737586975} 11/07/2021 13:11:33 - INFO - __main__ - Step 113497: {'lr': 7.137192921935876e-05, 'samples': 21791424, 'steps': 113496, 'loss/train': 1.29623281955719} 11/07/2021 13:11:33 - INFO - __main__ - Step 113498: {'lr': 7.136821653975147e-05, 'samples': 21791616, 'steps': 113497, 'loss/train': 1.3612935543060303} 11/07/2021 13:11:34 - INFO - __main__ - Step 113499: {'lr': 7.136450394063293e-05, 'samples': 21791808, 'steps': 113498, 'loss/train': 1.3232369422912598} 11/07/2021 13:11:34 - INFO - __main__ - Step 113500: {'lr': 7.136079142200478e-05, 'samples': 21792000, 'steps': 113499, 'loss/train': 1.3599092960357666} 11/07/2021 13:11:35 - INFO - __main__ - Step 113501: {'lr': 7.13570789838687e-05, 'samples': 21792192, 'steps': 113500, 'loss/train': 1.2706416845321655} 11/07/2021 13:11:35 - INFO - __main__ - Step 113502: {'lr': 7.135336662622635e-05, 'samples': 21792384, 'steps': 113501, 'loss/train': 1.6821284294128418} 11/07/2021 13:11:35 - INFO - __main__ - Step 113503: {'lr': 7.134965434907942e-05, 'samples': 21792576, 'steps': 113502, 'loss/train': 1.5237464904785156} 11/07/2021 13:11:36 - INFO - __main__ - Step 113504: {'lr': 7.134594215242959e-05, 'samples': 21792768, 'steps': 113503, 'loss/train': 1.438501238822937} 11/07/2021 13:11:37 - INFO - __main__ - Step 113505: {'lr': 7.134223003627851e-05, 'samples': 21792960, 'steps': 113504, 'loss/train': 1.3698172569274902} 11/07/2021 13:11:37 - INFO - __main__ - Step 113506: {'lr': 7.133851800062796e-05, 'samples': 21793152, 'steps': 113505, 'loss/train': 1.1368027925491333} 11/07/2021 13:11:37 - INFO - __main__ - Step 113507: {'lr': 7.133480604547943e-05, 'samples': 21793344, 'steps': 113506, 'loss/train': 1.7225738763809204} 11/07/2021 13:11:38 - INFO - __main__ - Step 113508: {'lr': 7.133109417083466e-05, 'samples': 21793536, 'steps': 113507, 'loss/train': 1.252265453338623} 11/07/2021 13:11:39 - INFO - __main__ - Step 113509: {'lr': 7.132738237669536e-05, 'samples': 21793728, 'steps': 113508, 'loss/train': 1.228623628616333} 11/07/2021 13:11:40 - INFO - __main__ - Step 113510: {'lr': 7.132367066306319e-05, 'samples': 21793920, 'steps': 113509, 'loss/train': 1.4856452941894531} 11/07/2021 13:11:40 - INFO - __main__ - Step 113511: {'lr': 7.131995902993981e-05, 'samples': 21794112, 'steps': 113510, 'loss/train': 0.2797142565250397} 11/07/2021 13:11:40 - INFO - __main__ - Step 113512: {'lr': 7.13162474773269e-05, 'samples': 21794304, 'steps': 113511, 'loss/train': 1.6111572980880737} 11/07/2021 13:11:41 - INFO - __main__ - Step 113513: {'lr': 7.131253600522614e-05, 'samples': 21794496, 'steps': 113512, 'loss/train': 0.24415339529514313} 11/07/2021 13:11:42 - INFO - __main__ - Step 113514: {'lr': 7.130882461363916e-05, 'samples': 21794688, 'steps': 113513, 'loss/train': 1.3983774185180664} 11/07/2021 13:11:42 - INFO - __main__ - Step 113515: {'lr': 7.13051133025677e-05, 'samples': 21794880, 'steps': 113514, 'loss/train': 1.2781882286071777} 11/07/2021 13:11:42 - INFO - __main__ - Step 113516: {'lr': 7.130140207201338e-05, 'samples': 21795072, 'steps': 113515, 'loss/train': 1.3321969509124756} 11/07/2021 13:11:43 - INFO - __main__ - Step 113517: {'lr': 7.129769092197791e-05, 'samples': 21795264, 'steps': 113516, 'loss/train': 0.9868422746658325} 11/07/2021 13:11:43 - INFO - __main__ - Step 113518: {'lr': 7.129397985246294e-05, 'samples': 21795456, 'steps': 113517, 'loss/train': 1.1083072423934937} 11/07/2021 13:11:44 - INFO - __main__ - Step 113519: {'lr': 7.12902688634702e-05, 'samples': 21795648, 'steps': 113518, 'loss/train': 1.6566238403320312} 11/07/2021 13:11:44 - INFO - __main__ - Step 113520: {'lr': 7.128655795500127e-05, 'samples': 21795840, 'steps': 113519, 'loss/train': 1.3249461650848389} 11/07/2021 13:11:45 - INFO - __main__ - Step 113521: {'lr': 7.128284712705782e-05, 'samples': 21796032, 'steps': 113520, 'loss/train': 1.0583815574645996} 11/07/2021 13:11:45 - INFO - __main__ - Step 113522: {'lr': 7.12791363796416e-05, 'samples': 21796224, 'steps': 113521, 'loss/train': 1.5055028200149536} 11/07/2021 13:11:45 - INFO - __main__ - Step 113523: {'lr': 7.127542571275419e-05, 'samples': 21796416, 'steps': 113522, 'loss/train': 1.35244882106781} 11/07/2021 13:11:47 - INFO - __main__ - Step 113524: {'lr': 7.127171512639735e-05, 'samples': 21796608, 'steps': 113523, 'loss/train': 1.3540138006210327} 11/07/2021 13:11:47 - INFO - __main__ - Step 113525: {'lr': 7.126800462057273e-05, 'samples': 21796800, 'steps': 113524, 'loss/train': 1.531159520149231} 11/07/2021 13:11:47 - INFO - __main__ - Step 113526: {'lr': 7.126429419528196e-05, 'samples': 21796992, 'steps': 113525, 'loss/train': 1.3466697931289673} 11/07/2021 13:11:48 - INFO - __main__ - Step 113527: {'lr': 7.126058385052676e-05, 'samples': 21797184, 'steps': 113526, 'loss/train': 1.629143476486206} 11/07/2021 13:11:48 - INFO - __main__ - Step 113528: {'lr': 7.125687358630878e-05, 'samples': 21797376, 'steps': 113527, 'loss/train': 1.081350326538086} 11/07/2021 13:11:48 - INFO - __main__ - Step 113529: {'lr': 7.125316340262969e-05, 'samples': 21797568, 'steps': 113528, 'loss/train': 1.0363633632659912} 11/07/2021 13:11:49 - INFO - __main__ - Step 113530: {'lr': 7.124945329949115e-05, 'samples': 21797760, 'steps': 113529, 'loss/train': 1.0836153030395508} 11/07/2021 13:11:50 - INFO - __main__ - Step 113531: {'lr': 7.124574327689487e-05, 'samples': 21797952, 'steps': 113530, 'loss/train': 1.4157780408859253} 11/07/2021 13:11:50 - INFO - __main__ - Step 113532: {'lr': 7.12420333348425e-05, 'samples': 21798144, 'steps': 113531, 'loss/train': 0.9082149863243103} 11/07/2021 13:11:50 - INFO - __main__ - Step 113533: {'lr': 7.123832347333578e-05, 'samples': 21798336, 'steps': 113532, 'loss/train': 1.4086518287658691} 11/07/2021 13:11:51 - INFO - __main__ - Step 113534: {'lr': 7.123461369237624e-05, 'samples': 21798528, 'steps': 113533, 'loss/train': 1.3917756080627441} 11/07/2021 13:11:52 - INFO - __main__ - Step 113535: {'lr': 7.12309039919656e-05, 'samples': 21798720, 'steps': 113534, 'loss/train': 1.2074710130691528} 11/07/2021 13:11:52 - INFO - __main__ - Step 113536: {'lr': 7.12271943721056e-05, 'samples': 21798912, 'steps': 113535, 'loss/train': 0.6500455141067505} 11/07/2021 13:11:53 - INFO - __main__ - Step 113537: {'lr': 7.122348483279783e-05, 'samples': 21799104, 'steps': 113536, 'loss/train': 1.5310863256454468} 11/07/2021 13:11:53 - INFO - __main__ - Step 113538: {'lr': 7.121977537404403e-05, 'samples': 21799296, 'steps': 113537, 'loss/train': 1.5365774631500244} 11/07/2021 13:11:53 - INFO - __main__ - Step 113539: {'lr': 7.121606599584582e-05, 'samples': 21799488, 'steps': 113538, 'loss/train': 1.2654114961624146} 11/07/2021 13:11:54 - INFO - __main__ - Step 113540: {'lr': 7.121235669820489e-05, 'samples': 21799680, 'steps': 113539, 'loss/train': 1.3910691738128662} 11/07/2021 13:11:55 - INFO - __main__ - Step 113541: {'lr': 7.120864748112293e-05, 'samples': 21799872, 'steps': 113540, 'loss/train': 1.1123335361480713} 11/07/2021 13:11:55 - INFO - __main__ - Step 113542: {'lr': 7.12049383446016e-05, 'samples': 21800064, 'steps': 113541, 'loss/train': 1.9581146240234375} 11/07/2021 13:11:55 - INFO - __main__ - Step 113543: {'lr': 7.120122928864253e-05, 'samples': 21800256, 'steps': 113542, 'loss/train': 1.2840540409088135} 11/07/2021 13:11:56 - INFO - __main__ - Step 113544: {'lr': 7.119752031324745e-05, 'samples': 21800448, 'steps': 113543, 'loss/train': 1.8524359464645386} 11/07/2021 13:11:57 - INFO - __main__ - Step 113545: {'lr': 7.119381141841802e-05, 'samples': 21800640, 'steps': 113544, 'loss/train': 1.3938751220703125} 11/07/2021 13:11:57 - INFO - __main__ - Step 113546: {'lr': 7.119010260415595e-05, 'samples': 21800832, 'steps': 113545, 'loss/train': 1.5557857751846313} 11/07/2021 13:11:57 - INFO - __main__ - Step 113547: {'lr': 7.118639387046281e-05, 'samples': 21801024, 'steps': 113546, 'loss/train': 1.3288620710372925} 11/07/2021 13:11:58 - INFO - __main__ - Step 113548: {'lr': 7.11826852173403e-05, 'samples': 21801216, 'steps': 113547, 'loss/train': 1.3275294303894043} 11/07/2021 13:11:58 - INFO - __main__ - Step 113549: {'lr': 7.117897664479012e-05, 'samples': 21801408, 'steps': 113548, 'loss/train': 1.174923300743103} 11/07/2021 13:11:59 - INFO - __main__ - Step 113550: {'lr': 7.117526815281394e-05, 'samples': 21801600, 'steps': 113549, 'loss/train': 1.277293086051941} 11/07/2021 13:12:00 - INFO - __main__ - Step 113551: {'lr': 7.11715597414134e-05, 'samples': 21801792, 'steps': 113550, 'loss/train': 0.9683926701545715} 11/07/2021 13:12:00 - INFO - __main__ - Step 113552: {'lr': 7.116785141059023e-05, 'samples': 21801984, 'steps': 113551, 'loss/train': 1.439343810081482} 11/07/2021 13:12:00 - INFO - __main__ - Step 113553: {'lr': 7.116414316034606e-05, 'samples': 21802176, 'steps': 113552, 'loss/train': 1.0105516910552979} 11/07/2021 13:12:01 - INFO - __main__ - Step 113554: {'lr': 7.116043499068256e-05, 'samples': 21802368, 'steps': 113553, 'loss/train': 0.792324960231781} 11/07/2021 13:12:02 - INFO - __main__ - Step 113555: {'lr': 7.11567269016014e-05, 'samples': 21802560, 'steps': 113554, 'loss/train': 0.7765027284622192} 11/07/2021 13:12:02 - INFO - __main__ - Step 113556: {'lr': 7.115301889310427e-05, 'samples': 21802752, 'steps': 113555, 'loss/train': 1.4385900497436523} 11/07/2021 13:12:03 - INFO - __main__ - Step 113557: {'lr': 7.114931096519281e-05, 'samples': 21802944, 'steps': 113556, 'loss/train': 1.6212834119796753} 11/07/2021 13:12:03 - INFO - __main__ - Step 113558: {'lr': 7.114560311786874e-05, 'samples': 21803136, 'steps': 113557, 'loss/train': 1.7149707078933716} 11/07/2021 13:12:04 - INFO - __main__ - Step 113559: {'lr': 7.114189535113377e-05, 'samples': 21803328, 'steps': 113558, 'loss/train': 1.6212108135223389} 11/07/2021 13:12:04 - INFO - __main__ - Step 113560: {'lr': 7.113818766498942e-05, 'samples': 21803520, 'steps': 113559, 'loss/train': 1.59329354763031} 11/07/2021 13:12:05 - INFO - __main__ - Step 113561: {'lr': 7.113448005943743e-05, 'samples': 21803712, 'steps': 113560, 'loss/train': 1.675001621246338} 11/07/2021 13:12:05 - INFO - __main__ - Step 113562: {'lr': 7.11307725344795e-05, 'samples': 21803904, 'steps': 113561, 'loss/train': 1.366613507270813} 11/07/2021 13:12:06 - INFO - __main__ - Step 113563: {'lr': 7.11270650901173e-05, 'samples': 21804096, 'steps': 113562, 'loss/train': 1.4552959203720093} 11/07/2021 13:12:06 - INFO - __main__ - Step 113564: {'lr': 7.112335772635245e-05, 'samples': 21804288, 'steps': 113563, 'loss/train': 1.9106974601745605} 11/07/2021 13:12:06 - INFO - __main__ - Step 113565: {'lr': 7.111965044318667e-05, 'samples': 21804480, 'steps': 113564, 'loss/train': 1.3379405736923218} 11/07/2021 13:12:08 - INFO - __main__ - Step 113566: {'lr': 7.111594324062162e-05, 'samples': 21804672, 'steps': 113565, 'loss/train': 1.3861033916473389} 11/07/2021 13:12:08 - INFO - __main__ - Step 113567: {'lr': 7.111223611865895e-05, 'samples': 21804864, 'steps': 113566, 'loss/train': 1.4362521171569824} 11/07/2021 13:12:08 - INFO - __main__ - Step 113568: {'lr': 7.110852907730036e-05, 'samples': 21805056, 'steps': 113567, 'loss/train': 1.0492359399795532} 11/07/2021 13:12:09 - INFO - __main__ - Step 113569: {'lr': 7.11048221165475e-05, 'samples': 21805248, 'steps': 113568, 'loss/train': 0.7808606028556824} 11/07/2021 13:12:09 - INFO - __main__ - Step 113570: {'lr': 7.110111523640205e-05, 'samples': 21805440, 'steps': 113569, 'loss/train': 1.3490021228790283} 11/07/2021 13:12:09 - INFO - __main__ - Step 113571: {'lr': 7.109740843686568e-05, 'samples': 21805632, 'steps': 113570, 'loss/train': 1.5077658891677856} 11/07/2021 13:12:10 - INFO - __main__ - Step 113572: {'lr': 7.109370171794005e-05, 'samples': 21805824, 'steps': 113571, 'loss/train': 2.425794839859009} 11/07/2021 13:12:11 - INFO - __main__ - Step 113573: {'lr': 7.10899950796269e-05, 'samples': 21806016, 'steps': 113572, 'loss/train': 1.7305243015289307} 11/07/2021 13:12:11 - INFO - __main__ - Step 113574: {'lr': 7.10862885219278e-05, 'samples': 21806208, 'steps': 113573, 'loss/train': 1.2553575038909912} 11/07/2021 13:12:11 - INFO - __main__ - Step 113575: {'lr': 7.108258204484445e-05, 'samples': 21806400, 'steps': 113574, 'loss/train': 1.5762559175491333} 11/07/2021 13:12:12 - INFO - __main__ - Step 113576: {'lr': 7.10788756483785e-05, 'samples': 21806592, 'steps': 113575, 'loss/train': 1.5951958894729614} 11/07/2021 13:12:12 - INFO - __main__ - Step 113577: {'lr': 7.107516933253166e-05, 'samples': 21806784, 'steps': 113576, 'loss/train': 1.7054716348648071} 11/07/2021 13:12:13 - INFO - __main__ - Step 113578: {'lr': 7.107146309730558e-05, 'samples': 21806976, 'steps': 113577, 'loss/train': 1.5844247341156006} 11/07/2021 13:12:14 - INFO - __main__ - Step 113579: {'lr': 7.106775694270196e-05, 'samples': 21807168, 'steps': 113578, 'loss/train': 1.8241102695465088} 11/07/2021 13:12:14 - INFO - __main__ - Step 113580: {'lr': 7.106405086872242e-05, 'samples': 21807360, 'steps': 113579, 'loss/train': 1.2846183776855469} 11/07/2021 13:12:14 - INFO - __main__ - Step 113581: {'lr': 7.106034487536866e-05, 'samples': 21807552, 'steps': 113580, 'loss/train': 1.5052683353424072} 11/07/2021 13:12:15 - INFO - __main__ - Step 113582: {'lr': 7.105663896264236e-05, 'samples': 21807744, 'steps': 113581, 'loss/train': 1.4947772026062012} 11/07/2021 13:12:16 - INFO - __main__ - Step 113583: {'lr': 7.10529331305452e-05, 'samples': 21807936, 'steps': 113582, 'loss/train': 0.7966917753219604} 11/07/2021 13:12:16 - INFO - __main__ - Step 113584: {'lr': 7.104922737907879e-05, 'samples': 21808128, 'steps': 113583, 'loss/train': 2.0344128608703613} 11/07/2021 13:12:16 - INFO - __main__ - Step 113585: {'lr': 7.104552170824485e-05, 'samples': 21808320, 'steps': 113584, 'loss/train': 1.4551502466201782} 11/07/2021 13:12:17 - INFO - __main__ - Step 113586: {'lr': 7.10418161180451e-05, 'samples': 21808512, 'steps': 113585, 'loss/train': 1.5962592363357544} 11/07/2021 13:12:17 - INFO - __main__ - Step 113587: {'lr': 7.10381106084811e-05, 'samples': 21808704, 'steps': 113586, 'loss/train': 1.5092025995254517} 11/07/2021 13:12:18 - INFO - __main__ - Step 113588: {'lr': 7.103440517955454e-05, 'samples': 21808896, 'steps': 113587, 'loss/train': 1.775223731994629} 11/07/2021 13:12:18 - INFO - __main__ - Step 113589: {'lr': 7.103069983126714e-05, 'samples': 21809088, 'steps': 113588, 'loss/train': 1.2280138731002808} 11/07/2021 13:12:19 - INFO - __main__ - Step 113590: {'lr': 7.102699456362053e-05, 'samples': 21809280, 'steps': 113589, 'loss/train': 1.3310476541519165} 11/07/2021 13:12:19 - INFO - __main__ - Step 113591: {'lr': 7.102328937661637e-05, 'samples': 21809472, 'steps': 113590, 'loss/train': 1.4628525972366333} 11/07/2021 13:12:19 - INFO - __main__ - Step 113592: {'lr': 7.10195842702564e-05, 'samples': 21809664, 'steps': 113591, 'loss/train': 1.6299718618392944} 11/07/2021 13:12:21 - INFO - __main__ - Step 113593: {'lr': 7.10158792445422e-05, 'samples': 21809856, 'steps': 113592, 'loss/train': 1.5007851123809814} 11/07/2021 13:12:21 - INFO - __main__ - Step 113594: {'lr': 7.101217429947552e-05, 'samples': 21810048, 'steps': 113593, 'loss/train': 1.333997130393982} 11/07/2021 13:12:21 - INFO - __main__ - Step 113595: {'lr': 7.100846943505799e-05, 'samples': 21810240, 'steps': 113594, 'loss/train': 0.9277707934379578} 11/07/2021 13:12:22 - INFO - __main__ - Step 113596: {'lr': 7.100476465129125e-05, 'samples': 21810432, 'steps': 113595, 'loss/train': 1.4883168935775757} 11/07/2021 13:12:22 - INFO - __main__ - Step 113597: {'lr': 7.100105994817702e-05, 'samples': 21810624, 'steps': 113596, 'loss/train': 1.492628812789917} 11/07/2021 13:12:23 - INFO - __main__ - Step 113598: {'lr': 7.099735532571694e-05, 'samples': 21810816, 'steps': 113597, 'loss/train': 1.513278603553772} 11/07/2021 13:12:23 - INFO - __main__ - Step 113599: {'lr': 7.099365078391271e-05, 'samples': 21811008, 'steps': 113598, 'loss/train': 1.3831206560134888} 11/07/2021 13:12:24 - INFO - __main__ - Step 113600: {'lr': 7.098994632276603e-05, 'samples': 21811200, 'steps': 113599, 'loss/train': 1.3215900659561157} 11/07/2021 13:12:24 - INFO - __main__ - Step 113601: {'lr': 7.098624194227845e-05, 'samples': 21811392, 'steps': 113600, 'loss/train': 1.2414021492004395} 11/07/2021 13:12:24 - INFO - __main__ - Step 113602: {'lr': 7.098253764245171e-05, 'samples': 21811584, 'steps': 113601, 'loss/train': 1.081100344657898} 11/07/2021 13:12:25 - INFO - __main__ - Step 113603: {'lr': 7.097883342328748e-05, 'samples': 21811776, 'steps': 113602, 'loss/train': 1.784088373184204} 11/07/2021 13:12:26 - INFO - __main__ - Step 113604: {'lr': 7.097512928478744e-05, 'samples': 21811968, 'steps': 113603, 'loss/train': 1.3592946529388428} 11/07/2021 13:12:26 - INFO - __main__ - Step 113605: {'lr': 7.097142522695321e-05, 'samples': 21812160, 'steps': 113604, 'loss/train': 1.269849419593811} 11/07/2021 13:12:27 - INFO - __main__ - Step 113606: {'lr': 7.096772124978651e-05, 'samples': 21812352, 'steps': 113605, 'loss/train': 1.437166690826416} 11/07/2021 13:12:27 - INFO - __main__ - Step 113607: {'lr': 7.0964017353289e-05, 'samples': 21812544, 'steps': 113606, 'loss/train': 1.313603162765503} 11/07/2021 13:12:27 - INFO - __main__ - Step 113608: {'lr': 7.096031353746235e-05, 'samples': 21812736, 'steps': 113607, 'loss/train': 1.1589796543121338} 11/07/2021 13:12:28 - INFO - __main__ - Step 113609: {'lr': 7.09566098023082e-05, 'samples': 21812928, 'steps': 113608, 'loss/train': 1.3726726770401} 11/07/2021 13:12:29 - INFO - __main__ - Step 113610: {'lr': 7.095290614782823e-05, 'samples': 21813120, 'steps': 113609, 'loss/train': 1.0751780271530151} 11/07/2021 13:12:29 - INFO - __main__ - Step 113611: {'lr': 7.094920257402413e-05, 'samples': 21813312, 'steps': 113610, 'loss/train': 1.1954325437545776} 11/07/2021 13:12:29 - INFO - __main__ - Step 113612: {'lr': 7.094549908089756e-05, 'samples': 21813504, 'steps': 113611, 'loss/train': 1.5596405267715454} 11/07/2021 13:12:30 - INFO - __main__ - Step 113613: {'lr': 7.094179566845027e-05, 'samples': 21813696, 'steps': 113612, 'loss/train': 1.2888294458389282} 11/07/2021 13:12:31 - INFO - __main__ - Step 113614: {'lr': 7.093809233668374e-05, 'samples': 21813888, 'steps': 113613, 'loss/train': 1.1570848226547241} 11/07/2021 13:12:32 - INFO - __main__ - Step 113615: {'lr': 7.093438908559977e-05, 'samples': 21814080, 'steps': 113614, 'loss/train': 0.5242037177085876} 11/07/2021 13:12:32 - INFO - __main__ - Step 113616: {'lr': 7.093068591519999e-05, 'samples': 21814272, 'steps': 113615, 'loss/train': 0.7866669297218323} 11/07/2021 13:12:32 - INFO - __main__ - Step 113617: {'lr': 7.092698282548607e-05, 'samples': 21814464, 'steps': 113616, 'loss/train': 0.9510837197303772} 11/07/2021 13:12:33 - INFO - __main__ - Step 113618: {'lr': 7.092327981645971e-05, 'samples': 21814656, 'steps': 113617, 'loss/train': 1.299705147743225} 11/07/2021 13:12:33 - INFO - __main__ - Step 113619: {'lr': 7.091957688812253e-05, 'samples': 21814848, 'steps': 113618, 'loss/train': 1.084004282951355} 11/07/2021 13:12:33 - INFO - __main__ - Step 113620: {'lr': 7.091587404047625e-05, 'samples': 21815040, 'steps': 113619, 'loss/train': 1.0760787725448608} 11/07/2021 13:12:34 - INFO - __main__ - Step 113621: {'lr': 7.09121712735225e-05, 'samples': 21815232, 'steps': 113620, 'loss/train': 1.3976778984069824} 11/07/2021 13:12:35 - INFO - __main__ - Step 113622: {'lr': 7.090846858726296e-05, 'samples': 21815424, 'steps': 113621, 'loss/train': 1.5120772123336792} 11/07/2021 13:12:35 - INFO - __main__ - Step 113623: {'lr': 7.090476598169932e-05, 'samples': 21815616, 'steps': 113622, 'loss/train': 1.2966816425323486} 11/07/2021 13:12:36 - INFO - __main__ - Step 113624: {'lr': 7.09010634568332e-05, 'samples': 21815808, 'steps': 113623, 'loss/train': 1.5847946405410767} 11/07/2021 13:12:36 - INFO - __main__ - Step 113625: {'lr': 7.08973610126663e-05, 'samples': 21816000, 'steps': 113624, 'loss/train': 1.0231491327285767} 11/07/2021 13:12:37 - INFO - __main__ - Step 113626: {'lr': 7.08936586492003e-05, 'samples': 21816192, 'steps': 113625, 'loss/train': 1.5807379484176636} 11/07/2021 13:12:37 - INFO - __main__ - Step 113627: {'lr': 7.088995636643694e-05, 'samples': 21816384, 'steps': 113626, 'loss/train': 1.23664128780365} 11/07/2021 13:12:38 - INFO - __main__ - Step 113628: {'lr': 7.088625416437772e-05, 'samples': 21816576, 'steps': 113627, 'loss/train': 0.9211175441741943} 11/07/2021 13:12:38 - INFO - __main__ - Step 113629: {'lr': 7.088255204302437e-05, 'samples': 21816768, 'steps': 113628, 'loss/train': 0.3608643412590027} 11/07/2021 13:12:38 - INFO - __main__ - Step 113630: {'lr': 7.087885000237859e-05, 'samples': 21816960, 'steps': 113629, 'loss/train': 1.5480574369430542} 11/07/2021 13:12:39 - INFO - __main__ - Step 113631: {'lr': 7.087514804244205e-05, 'samples': 21817152, 'steps': 113630, 'loss/train': 1.5039446353912354} 11/07/2021 13:12:40 - INFO - __main__ - Step 113632: {'lr': 7.087144616321639e-05, 'samples': 21817344, 'steps': 113631, 'loss/train': 1.2338083982467651} 11/07/2021 13:12:40 - INFO - __main__ - Step 113633: {'lr': 7.086774436470328e-05, 'samples': 21817536, 'steps': 113632, 'loss/train': 1.5054142475128174} 11/07/2021 13:12:40 - INFO - __main__ - Step 113634: {'lr': 7.086404264690443e-05, 'samples': 21817728, 'steps': 113633, 'loss/train': 1.0881307125091553} 11/07/2021 13:12:41 - INFO - __main__ - Step 113635: {'lr': 7.086034100982145e-05, 'samples': 21817920, 'steps': 113634, 'loss/train': 1.6458600759506226} 11/07/2021 13:12:42 - INFO - __main__ - Step 113636: {'lr': 7.085663945345605e-05, 'samples': 21818112, 'steps': 113635, 'loss/train': 1.2671446800231934} 11/07/2021 13:12:42 - INFO - __main__ - Step 113637: {'lr': 7.085293797780989e-05, 'samples': 21818304, 'steps': 113636, 'loss/train': 1.4831931591033936} 11/07/2021 13:12:42 - INFO - __main__ - Step 113638: {'lr': 7.084923658288462e-05, 'samples': 21818496, 'steps': 113637, 'loss/train': 1.1222879886627197} 11/07/2021 13:12:43 - INFO - __main__ - Step 113639: {'lr': 7.084553526868192e-05, 'samples': 21818688, 'steps': 113638, 'loss/train': 1.4725441932678223} 11/07/2021 13:12:43 - INFO - __main__ - Step 113640: {'lr': 7.084183403520353e-05, 'samples': 21818880, 'steps': 113639, 'loss/train': 1.2958241701126099} 11/07/2021 13:12:44 - INFO - __main__ - Step 113641: {'lr': 7.083813288245098e-05, 'samples': 21819072, 'steps': 113640, 'loss/train': 1.4206016063690186} 11/07/2021 13:12:45 - INFO - __main__ - Step 113642: {'lr': 7.0834431810426e-05, 'samples': 21819264, 'steps': 113641, 'loss/train': 1.2754385471343994} 11/07/2021 13:12:45 - INFO - __main__ - Step 113643: {'lr': 7.083073081913027e-05, 'samples': 21819456, 'steps': 113642, 'loss/train': 0.9062396287918091} 11/07/2021 13:12:45 - INFO - __main__ - Step 113644: {'lr': 7.082702990856543e-05, 'samples': 21819648, 'steps': 113643, 'loss/train': 1.4558407068252563} 11/07/2021 13:12:46 - INFO - __main__ - Step 113645: {'lr': 7.082332907873317e-05, 'samples': 21819840, 'steps': 113644, 'loss/train': 1.5723286867141724} 11/07/2021 13:12:46 - INFO - __main__ - Step 113646: {'lr': 7.081962832963515e-05, 'samples': 21820032, 'steps': 113645, 'loss/train': 1.3266162872314453} 11/07/2021 13:12:47 - INFO - __main__ - Step 113647: {'lr': 7.081592766127304e-05, 'samples': 21820224, 'steps': 113646, 'loss/train': 1.372610092163086} 11/07/2021 13:12:47 - INFO - __main__ - Step 113648: {'lr': 7.081222707364851e-05, 'samples': 21820416, 'steps': 113647, 'loss/train': 1.0738364458084106} 11/07/2021 13:12:48 - INFO - __main__ - Step 113649: {'lr': 7.080852656676323e-05, 'samples': 21820608, 'steps': 113648, 'loss/train': 1.2355842590332031} 11/07/2021 13:12:48 - INFO - __main__ - Step 113650: {'lr': 7.080482614061887e-05, 'samples': 21820800, 'steps': 113649, 'loss/train': 1.4560965299606323} 11/07/2021 13:12:48 - INFO - __main__ - Step 113651: {'lr': 7.080112579521709e-05, 'samples': 21820992, 'steps': 113650, 'loss/train': 1.2442587614059448} 11/07/2021 13:12:49 - INFO - __main__ - Step 113652: {'lr': 7.079742553055962e-05, 'samples': 21821184, 'steps': 113651, 'loss/train': 1.4322298765182495} 11/07/2021 13:12:50 - INFO - __main__ - Step 113653: {'lr': 7.079372534664799e-05, 'samples': 21821376, 'steps': 113652, 'loss/train': 0.9517223834991455} 11/07/2021 13:12:50 - INFO - __main__ - Step 113654: {'lr': 7.079002524348396e-05, 'samples': 21821568, 'steps': 113653, 'loss/train': 1.121625542640686} 11/07/2021 13:12:50 - INFO - __main__ - Step 113655: {'lr': 7.078632522106915e-05, 'samples': 21821760, 'steps': 113654, 'loss/train': 1.9016519784927368} 11/07/2021 13:12:51 - INFO - __main__ - Step 113656: {'lr': 7.07826252794053e-05, 'samples': 21821952, 'steps': 113655, 'loss/train': 1.4947270154953003} 11/07/2021 13:12:52 - INFO - __main__ - Step 113657: {'lr': 7.077892541849398e-05, 'samples': 21822144, 'steps': 113656, 'loss/train': 1.446871042251587} 11/07/2021 13:12:52 - INFO - __main__ - Step 113658: {'lr': 7.077522563833694e-05, 'samples': 21822336, 'steps': 113657, 'loss/train': 1.1107327938079834} 11/07/2021 13:12:53 - INFO - __main__ - Step 113659: {'lr': 7.077152593893583e-05, 'samples': 21822528, 'steps': 113658, 'loss/train': 1.3723671436309814} 11/07/2021 13:12:53 - INFO - __main__ - Step 113660: {'lr': 7.076782632029227e-05, 'samples': 21822720, 'steps': 113659, 'loss/train': 1.1308685541152954} 11/07/2021 13:12:53 - INFO - __main__ - Step 113661: {'lr': 7.076412678240798e-05, 'samples': 21822912, 'steps': 113660, 'loss/train': 1.181972622871399} 11/07/2021 13:12:54 - INFO - __main__ - Step 113662: {'lr': 7.07604273252847e-05, 'samples': 21823104, 'steps': 113661, 'loss/train': 1.948379635810852} 11/07/2021 13:12:55 - INFO - __main__ - Step 113663: {'lr': 7.07567279489239e-05, 'samples': 21823296, 'steps': 113662, 'loss/train': 1.64454984664917} 11/07/2021 13:12:55 - INFO - __main__ - Step 113664: {'lr': 7.075302865332736e-05, 'samples': 21823488, 'steps': 113663, 'loss/train': 0.9332961440086365} 11/07/2021 13:12:55 - INFO - __main__ - Step 113665: {'lr': 7.074932943849677e-05, 'samples': 21823680, 'steps': 113664, 'loss/train': 1.2675812244415283} 11/07/2021 13:12:56 - INFO - __main__ - Step 113666: {'lr': 7.074563030443373e-05, 'samples': 21823872, 'steps': 113665, 'loss/train': 0.6212610602378845} 11/07/2021 13:12:57 - INFO - __main__ - Step 113667: {'lr': 7.074193125113996e-05, 'samples': 21824064, 'steps': 113666, 'loss/train': 1.155055046081543} 11/07/2021 13:12:57 - INFO - __main__ - Step 113668: {'lr': 7.073823227861712e-05, 'samples': 21824256, 'steps': 113667, 'loss/train': 1.4757227897644043} 11/07/2021 13:12:58 - INFO - __main__ - Step 113669: {'lr': 7.073453338686684e-05, 'samples': 21824448, 'steps': 113668, 'loss/train': 0.9213953018188477} 11/07/2021 13:12:58 - INFO - __main__ - Step 113670: {'lr': 7.073083457589083e-05, 'samples': 21824640, 'steps': 113669, 'loss/train': 1.598501443862915} 11/07/2021 13:12:58 - INFO - __main__ - Step 113671: {'lr': 7.072713584569071e-05, 'samples': 21824832, 'steps': 113670, 'loss/train': 0.675862729549408} 11/07/2021 13:12:59 - INFO - __main__ - Step 113672: {'lr': 7.072343719626822e-05, 'samples': 21825024, 'steps': 113671, 'loss/train': 1.4417997598648071} 11/07/2021 13:13:00 - INFO - __main__ - Step 113673: {'lr': 7.071973862762504e-05, 'samples': 21825216, 'steps': 113672, 'loss/train': 5.714617729187012} 11/07/2021 13:13:00 - INFO - __main__ - Step 113674: {'lr': 7.071604013976268e-05, 'samples': 21825408, 'steps': 113673, 'loss/train': 1.2484283447265625} 11/07/2021 13:13:00 - INFO - __main__ - Step 113675: {'lr': 7.071234173268296e-05, 'samples': 21825600, 'steps': 113674, 'loss/train': 0.8904268145561218} 11/07/2021 13:13:01 - INFO - __main__ - Step 113676: {'lr': 7.070864340638744e-05, 'samples': 21825792, 'steps': 113675, 'loss/train': 1.4045313596725464} 11/07/2021 13:13:01 - INFO - __main__ - Step 113677: {'lr': 7.070494516087786e-05, 'samples': 21825984, 'steps': 113676, 'loss/train': 0.9134686589241028} 11/07/2021 13:13:02 - INFO - __main__ - Step 113678: {'lr': 7.070124699615588e-05, 'samples': 21826176, 'steps': 113677, 'loss/train': 1.4524012804031372} 11/07/2021 13:13:03 - INFO - __main__ - Step 113679: {'lr': 7.069754891222313e-05, 'samples': 21826368, 'steps': 113678, 'loss/train': 1.3818011283874512} 11/07/2021 13:13:03 - INFO - __main__ - Step 113680: {'lr': 7.069385090908128e-05, 'samples': 21826560, 'steps': 113679, 'loss/train': 0.8719752430915833} 11/07/2021 13:13:03 - INFO - __main__ - Step 113681: {'lr': 7.069015298673206e-05, 'samples': 21826752, 'steps': 113680, 'loss/train': 1.542598009109497} 11/07/2021 13:13:04 - INFO - __main__ - Step 113682: {'lr': 7.068645514517707e-05, 'samples': 21826944, 'steps': 113681, 'loss/train': 1.0678131580352783} 11/07/2021 13:13:04 - INFO - __main__ - Step 113683: {'lr': 7.068275738441798e-05, 'samples': 21827136, 'steps': 113682, 'loss/train': 0.0929284319281578} 11/07/2021 13:13:05 - INFO - __main__ - Step 113684: {'lr': 7.067905970445657e-05, 'samples': 21827328, 'steps': 113683, 'loss/train': 1.3791160583496094} 11/07/2021 13:13:05 - INFO - __main__ - Step 113685: {'lr': 7.067536210529433e-05, 'samples': 21827520, 'steps': 113684, 'loss/train': 0.8437812924385071} 11/07/2021 13:13:06 - INFO - __main__ - Step 113686: {'lr': 7.0671664586933e-05, 'samples': 21827712, 'steps': 113685, 'loss/train': 1.5468168258666992} 11/07/2021 13:13:06 - INFO - __main__ - Step 113687: {'lr': 7.066796714937424e-05, 'samples': 21827904, 'steps': 113686, 'loss/train': 1.5132994651794434} 11/07/2021 13:13:06 - INFO - __main__ - Step 113688: {'lr': 7.066426979261975e-05, 'samples': 21828096, 'steps': 113687, 'loss/train': 0.7571097612380981} 11/07/2021 13:13:08 - INFO - __main__ - Step 113689: {'lr': 7.066057251667116e-05, 'samples': 21828288, 'steps': 113688, 'loss/train': 1.0638960599899292} 11/07/2021 13:13:08 - INFO - __main__ - Step 113690: {'lr': 7.065687532153015e-05, 'samples': 21828480, 'steps': 113689, 'loss/train': 1.280683159828186} 11/07/2021 13:13:08 - INFO - __main__ - Step 113691: {'lr': 7.065317820719838e-05, 'samples': 21828672, 'steps': 113690, 'loss/train': 1.2152329683303833} 11/07/2021 13:13:09 - INFO - __main__ - Step 113692: {'lr': 7.06494811736775e-05, 'samples': 21828864, 'steps': 113691, 'loss/train': 1.0361942052841187} 11/07/2021 13:13:09 - INFO - __main__ - Step 113693: {'lr': 7.064578422096924e-05, 'samples': 21829056, 'steps': 113692, 'loss/train': 1.199116826057434} 11/07/2021 13:13:10 - INFO - __main__ - Step 113694: {'lr': 7.06420873490752e-05, 'samples': 21829248, 'steps': 113693, 'loss/train': 1.8086899518966675} 11/07/2021 13:13:11 - INFO - __main__ - Step 113695: {'lr': 7.063839055799714e-05, 'samples': 21829440, 'steps': 113694, 'loss/train': 1.8684568405151367} 11/07/2021 13:13:11 - INFO - __main__ - Step 113696: {'lr': 7.063469384773658e-05, 'samples': 21829632, 'steps': 113695, 'loss/train': 1.5954350233078003} 11/07/2021 13:13:11 - INFO - __main__ - Step 113697: {'lr': 7.063099721829528e-05, 'samples': 21829824, 'steps': 113696, 'loss/train': 1.3643499612808228} 11/07/2021 13:13:12 - INFO - __main__ - Step 113698: {'lr': 7.062730066967485e-05, 'samples': 21830016, 'steps': 113697, 'loss/train': 0.9932175874710083} 11/07/2021 13:13:13 - INFO - __main__ - Step 113699: {'lr': 7.062360420187703e-05, 'samples': 21830208, 'steps': 113698, 'loss/train': 1.3718628883361816} 11/07/2021 13:13:13 - INFO - __main__ - Step 113700: {'lr': 7.06199078149034e-05, 'samples': 21830400, 'steps': 113699, 'loss/train': 1.3363821506500244} 11/07/2021 13:13:13 - INFO - __main__ - Step 113701: {'lr': 7.061621150875569e-05, 'samples': 21830592, 'steps': 113700, 'loss/train': 0.9237774610519409} 11/07/2021 13:13:14 - INFO - __main__ - Step 113702: {'lr': 7.061251528343557e-05, 'samples': 21830784, 'steps': 113701, 'loss/train': 1.4923937320709229} 11/07/2021 13:13:14 - INFO - __main__ - Step 113703: {'lr': 7.060881913894465e-05, 'samples': 21830976, 'steps': 113702, 'loss/train': 1.135838270187378} 11/07/2021 13:13:15 - INFO - __main__ - Step 113704: {'lr': 7.060512307528466e-05, 'samples': 21831168, 'steps': 113703, 'loss/train': 1.3053076267242432} 11/07/2021 13:13:15 - INFO - __main__ - Step 113705: {'lr': 7.060142709245721e-05, 'samples': 21831360, 'steps': 113704, 'loss/train': 1.0941754579544067} 11/07/2021 13:13:16 - INFO - __main__ - Step 113706: {'lr': 7.059773119046397e-05, 'samples': 21831552, 'steps': 113705, 'loss/train': 1.1480584144592285} 11/07/2021 13:13:16 - INFO - __main__ - Step 113707: {'lr': 7.059403536930675e-05, 'samples': 21831744, 'steps': 113706, 'loss/train': 1.5117346048355103} 11/07/2021 13:13:16 - INFO - __main__ - Step 113708: {'lr': 7.059033962898698e-05, 'samples': 21831936, 'steps': 113707, 'loss/train': 1.1367188692092896} 11/07/2021 13:13:18 - INFO - __main__ - Step 113709: {'lr': 7.058664396950645e-05, 'samples': 21832128, 'steps': 113708, 'loss/train': 1.6050692796707153} 11/07/2021 13:13:18 - INFO - __main__ - Step 113710: {'lr': 7.058294839086679e-05, 'samples': 21832320, 'steps': 113709, 'loss/train': 1.3372830152511597} 11/07/2021 13:13:18 - INFO - __main__ - Step 113711: {'lr': 7.057925289306972e-05, 'samples': 21832512, 'steps': 113710, 'loss/train': 1.6677043437957764} 11/07/2021 13:13:19 - INFO - __main__ - Step 113712: {'lr': 7.057555747611683e-05, 'samples': 21832704, 'steps': 113711, 'loss/train': 1.0269278287887573} 11/07/2021 13:13:19 - INFO - __main__ - Step 113713: {'lr': 7.057186214000985e-05, 'samples': 21832896, 'steps': 113712, 'loss/train': 1.1312813758850098} 11/07/2021 13:13:19 - INFO - __main__ - Step 113714: {'lr': 7.05681668847504e-05, 'samples': 21833088, 'steps': 113713, 'loss/train': 1.6240352392196655} 11/07/2021 13:13:21 - INFO - __main__ - Step 113715: {'lr': 7.056447171034017e-05, 'samples': 21833280, 'steps': 113714, 'loss/train': 0.8681838512420654} 11/07/2021 13:13:21 - INFO - __main__ - Step 113716: {'lr': 7.056077661678084e-05, 'samples': 21833472, 'steps': 113715, 'loss/train': 1.5890192985534668} 11/07/2021 13:13:22 - INFO - __main__ - Step 113717: {'lr': 7.055708160407404e-05, 'samples': 21833664, 'steps': 113716, 'loss/train': 1.4998035430908203} 11/07/2021 13:13:22 - INFO - __main__ - Step 113718: {'lr': 7.055338667222144e-05, 'samples': 21833856, 'steps': 113717, 'loss/train': 1.173094630241394} 11/07/2021 13:13:22 - INFO - __main__ - Step 113719: {'lr': 7.054969182122473e-05, 'samples': 21834048, 'steps': 113718, 'loss/train': 1.6208416223526} 11/07/2021 13:13:23 - INFO - __main__ - Step 113720: {'lr': 7.054599705108556e-05, 'samples': 21834240, 'steps': 113719, 'loss/train': 1.3380155563354492} 11/07/2021 13:13:24 - INFO - __main__ - Step 113721: {'lr': 7.054230236180567e-05, 'samples': 21834432, 'steps': 113720, 'loss/train': 0.8778579831123352} 11/07/2021 13:13:24 - INFO - __main__ - Step 113722: {'lr': 7.053860775338658e-05, 'samples': 21834624, 'steps': 113721, 'loss/train': 1.873878002166748} 11/07/2021 13:13:24 - INFO - __main__ - Step 113723: {'lr': 7.053491322583e-05, 'samples': 21834816, 'steps': 113722, 'loss/train': 1.6086971759796143} 11/07/2021 13:13:25 - INFO - __main__ - Step 113724: {'lr': 7.053121877913765e-05, 'samples': 21835008, 'steps': 113723, 'loss/train': 0.6449277997016907} 11/07/2021 13:13:25 - INFO - __main__ - Step 113725: {'lr': 7.052752441331114e-05, 'samples': 21835200, 'steps': 113724, 'loss/train': 1.5431180000305176} 11/07/2021 13:13:26 - INFO - __main__ - Step 113726: {'lr': 7.052383012835217e-05, 'samples': 21835392, 'steps': 113725, 'loss/train': 1.3353830575942993} 11/07/2021 13:13:26 - INFO - __main__ - Step 113727: {'lr': 7.052013592426237e-05, 'samples': 21835584, 'steps': 113726, 'loss/train': 2.811612844467163} 11/07/2021 13:13:27 - INFO - __main__ - Step 113728: {'lr': 7.051644180104346e-05, 'samples': 21835776, 'steps': 113727, 'loss/train': 1.2782293558120728} 11/07/2021 13:13:27 - INFO - __main__ - Step 113729: {'lr': 7.051274775869706e-05, 'samples': 21835968, 'steps': 113728, 'loss/train': 0.864604115486145} 11/07/2021 13:13:28 - INFO - __main__ - Step 113730: {'lr': 7.050905379722483e-05, 'samples': 21836160, 'steps': 113729, 'loss/train': 1.4494132995605469} 11/07/2021 13:13:28 - INFO - __main__ - Step 113731: {'lr': 7.050535991662849e-05, 'samples': 21836352, 'steps': 113730, 'loss/train': 1.2621577978134155} 11/07/2021 13:13:29 - INFO - __main__ - Step 113732: {'lr': 7.050166611690962e-05, 'samples': 21836544, 'steps': 113731, 'loss/train': 1.5192198753356934} 11/07/2021 13:13:29 - INFO - __main__ - Step 113733: {'lr': 7.049797239806996e-05, 'samples': 21836736, 'steps': 113732, 'loss/train': 1.0398447513580322} 11/07/2021 13:13:30 - INFO - __main__ - Step 113734: {'lr': 7.049427876011119e-05, 'samples': 21836928, 'steps': 113733, 'loss/train': 5.665144443511963} 11/07/2021 13:13:30 - INFO - __main__ - Step 113735: {'lr': 7.049058520303489e-05, 'samples': 21837120, 'steps': 113734, 'loss/train': 1.5120396614074707} 11/07/2021 13:13:30 - INFO - __main__ - Step 113736: {'lr': 7.048689172684272e-05, 'samples': 21837312, 'steps': 113735, 'loss/train': 0.9865121245384216} 11/07/2021 13:13:31 - INFO - __main__ - Step 113737: {'lr': 7.048319833153641e-05, 'samples': 21837504, 'steps': 113736, 'loss/train': 1.5343670845031738} 11/07/2021 13:13:32 - INFO - __main__ - Step 113738: {'lr': 7.047950501711762e-05, 'samples': 21837696, 'steps': 113737, 'loss/train': 1.364935278892517} 11/07/2021 13:13:32 - INFO - __main__ - Step 113739: {'lr': 7.047581178358797e-05, 'samples': 21837888, 'steps': 113738, 'loss/train': 1.2627149820327759} 11/07/2021 13:13:33 - INFO - __main__ - Step 113740: {'lr': 7.047211863094915e-05, 'samples': 21838080, 'steps': 113739, 'loss/train': 1.1461849212646484} 11/07/2021 13:13:33 - INFO - __main__ - Step 113741: {'lr': 7.046842555920283e-05, 'samples': 21838272, 'steps': 113740, 'loss/train': 1.0798490047454834} 11/07/2021 13:13:33 - INFO - __main__ - Step 113742: {'lr': 7.046473256835065e-05, 'samples': 21838464, 'steps': 113741, 'loss/train': 1.3935081958770752} 11/07/2021 13:13:34 - INFO - __main__ - Step 113743: {'lr': 7.046103965839431e-05, 'samples': 21838656, 'steps': 113742, 'loss/train': 1.1790639162063599} 11/07/2021 13:13:35 - INFO - __main__ - Step 113744: {'lr': 7.045734682933547e-05, 'samples': 21838848, 'steps': 113743, 'loss/train': 1.0600634813308716} 11/07/2021 13:13:35 - INFO - __main__ - Step 113745: {'lr': 7.045365408117573e-05, 'samples': 21839040, 'steps': 113744, 'loss/train': 1.5753546953201294} 11/07/2021 13:13:36 - INFO - __main__ - Step 113746: {'lr': 7.044996141391685e-05, 'samples': 21839232, 'steps': 113745, 'loss/train': 1.549675703048706} 11/07/2021 13:13:36 - INFO - __main__ - Step 113747: {'lr': 7.044626882756041e-05, 'samples': 21839424, 'steps': 113746, 'loss/train': 1.445500135421753} 11/07/2021 13:13:37 - INFO - __main__ - Step 113748: {'lr': 7.044257632210822e-05, 'samples': 21839616, 'steps': 113747, 'loss/train': 1.4030653238296509} 11/07/2021 13:13:38 - INFO - __main__ - Step 113749: {'lr': 7.043888389756176e-05, 'samples': 21839808, 'steps': 113748, 'loss/train': 1.2465745210647583} 11/07/2021 13:13:38 - INFO - __main__ - Step 113750: {'lr': 7.043519155392272e-05, 'samples': 21840000, 'steps': 113749, 'loss/train': 1.6646201610565186} 11/07/2021 13:13:38 - INFO - __main__ - Step 113751: {'lr': 7.043149929119285e-05, 'samples': 21840192, 'steps': 113750, 'loss/train': 1.4405267238616943} 11/07/2021 13:13:39 - INFO - __main__ - Step 113752: {'lr': 7.042780710937377e-05, 'samples': 21840384, 'steps': 113751, 'loss/train': 1.0002772808074951} 11/07/2021 13:13:40 - INFO - __main__ - Step 113753: {'lr': 7.042411500846715e-05, 'samples': 21840576, 'steps': 113752, 'loss/train': 0.4016653299331665} 11/07/2021 13:13:40 - INFO - __main__ - Step 113754: {'lr': 7.042042298847464e-05, 'samples': 21840768, 'steps': 113753, 'loss/train': 1.201458215713501} 11/07/2021 13:13:41 - INFO - __main__ - Step 113755: {'lr': 7.041673104939794e-05, 'samples': 21840960, 'steps': 113754, 'loss/train': 1.428313970565796} 11/07/2021 13:13:41 - INFO - __main__ - Step 113756: {'lr': 7.041303919123868e-05, 'samples': 21841152, 'steps': 113755, 'loss/train': 1.6826757192611694} 11/07/2021 13:13:41 - INFO - __main__ - Step 113757: {'lr': 7.040934741399854e-05, 'samples': 21841344, 'steps': 113756, 'loss/train': 1.412794589996338} 11/07/2021 13:13:43 - INFO - __main__ - Step 113758: {'lr': 7.040565571767916e-05, 'samples': 21841536, 'steps': 113757, 'loss/train': 1.3862833976745605} 11/07/2021 13:13:43 - INFO - __main__ - Step 113759: {'lr': 7.040196410228223e-05, 'samples': 21841728, 'steps': 113758, 'loss/train': 1.5359870195388794} 11/07/2021 13:13:43 - INFO - __main__ - Step 113760: {'lr': 7.03982725678094e-05, 'samples': 21841920, 'steps': 113759, 'loss/train': 0.7387335896492004} 11/07/2021 13:13:44 - INFO - __main__ - Step 113761: {'lr': 7.039458111426241e-05, 'samples': 21842112, 'steps': 113760, 'loss/train': 1.5261894464492798} 11/07/2021 13:13:44 - INFO - __main__ - Step 113762: {'lr': 7.039088974164278e-05, 'samples': 21842304, 'steps': 113761, 'loss/train': 0.7910393476486206} 11/07/2021 13:13:45 - INFO - __main__ - Step 113763: {'lr': 7.038719844995226e-05, 'samples': 21842496, 'steps': 113762, 'loss/train': 1.6670186519622803} 11/07/2021 13:13:45 - INFO - __main__ - Step 113764: {'lr': 7.038350723919246e-05, 'samples': 21842688, 'steps': 113763, 'loss/train': 0.9178605079650879} 11/07/2021 13:13:46 - INFO - __main__ - Step 113765: {'lr': 7.037981610936509e-05, 'samples': 21842880, 'steps': 113764, 'loss/train': 1.247951865196228} 11/07/2021 13:13:46 - INFO - __main__ - Step 113766: {'lr': 7.037612506047183e-05, 'samples': 21843072, 'steps': 113765, 'loss/train': 1.2684062719345093} 11/07/2021 13:13:46 - INFO - __main__ - Step 113767: {'lr': 7.037243409251429e-05, 'samples': 21843264, 'steps': 113766, 'loss/train': 1.2757117748260498} 11/07/2021 13:13:47 - INFO - __main__ - Step 113768: {'lr': 7.036874320549416e-05, 'samples': 21843456, 'steps': 113767, 'loss/train': 1.4357136487960815} 11/07/2021 13:13:48 - INFO - __main__ - Step 113769: {'lr': 7.036505239941313e-05, 'samples': 21843648, 'steps': 113768, 'loss/train': 1.2155417203903198} 11/07/2021 13:13:48 - INFO - __main__ - Step 113770: {'lr': 7.036136167427279e-05, 'samples': 21843840, 'steps': 113769, 'loss/train': 1.0957551002502441} 11/07/2021 13:13:48 - INFO - __main__ - Step 113771: {'lr': 7.03576710300749e-05, 'samples': 21844032, 'steps': 113770, 'loss/train': 0.7781845927238464} 11/07/2021 13:13:49 - INFO - __main__ - Step 113772: {'lr': 7.035398046682104e-05, 'samples': 21844224, 'steps': 113771, 'loss/train': 0.9517421126365662} 11/07/2021 13:13:49 - INFO - __main__ - Step 113773: {'lr': 7.03502899845129e-05, 'samples': 21844416, 'steps': 113772, 'loss/train': 1.4735101461410522} 11/07/2021 13:13:50 - INFO - __main__ - Step 113774: {'lr': 7.034659958315215e-05, 'samples': 21844608, 'steps': 113773, 'loss/train': 1.0905177593231201} 11/07/2021 13:13:51 - INFO - __main__ - Step 113775: {'lr': 7.034290926274054e-05, 'samples': 21844800, 'steps': 113774, 'loss/train': 0.5971431732177734} 11/07/2021 13:13:51 - INFO - __main__ - Step 113776: {'lr': 7.033921902327955e-05, 'samples': 21844992, 'steps': 113775, 'loss/train': 1.1208239793777466} 11/07/2021 13:13:51 - INFO - __main__ - Step 113777: {'lr': 7.033552886477096e-05, 'samples': 21845184, 'steps': 113776, 'loss/train': 0.9171279072761536} 11/07/2021 13:13:52 - INFO - __main__ - Step 113778: {'lr': 7.033183878721639e-05, 'samples': 21845376, 'steps': 113777, 'loss/train': 1.642511248588562} 11/07/2021 13:13:53 - INFO - __main__ - Step 113779: {'lr': 7.032814879061753e-05, 'samples': 21845568, 'steps': 113778, 'loss/train': 1.16130793094635} 11/07/2021 13:13:53 - INFO - __main__ - Step 113780: {'lr': 7.032445887497602e-05, 'samples': 21845760, 'steps': 113779, 'loss/train': 1.3705711364746094} 11/07/2021 13:13:54 - INFO - __main__ - Step 113781: {'lr': 7.032076904029356e-05, 'samples': 21845952, 'steps': 113780, 'loss/train': 0.6398329734802246} 11/07/2021 13:13:54 - INFO - __main__ - Step 113782: {'lr': 7.031707928657174e-05, 'samples': 21846144, 'steps': 113781, 'loss/train': 1.232438087463379} 11/07/2021 13:13:54 - INFO - __main__ - Step 113783: {'lr': 7.031338961381231e-05, 'samples': 21846336, 'steps': 113782, 'loss/train': 1.0066370964050293} 11/07/2021 13:13:55 - INFO - __main__ - Step 113784: {'lr': 7.03097000220169e-05, 'samples': 21846528, 'steps': 113783, 'loss/train': 1.7351970672607422} 11/07/2021 13:13:56 - INFO - __main__ - Step 113785: {'lr': 7.030601051118715e-05, 'samples': 21846720, 'steps': 113784, 'loss/train': 1.4022475481033325} 11/07/2021 13:13:56 - INFO - __main__ - Step 113786: {'lr': 7.030232108132475e-05, 'samples': 21846912, 'steps': 113785, 'loss/train': 1.2004284858703613} 11/07/2021 13:13:56 - INFO - __main__ - Step 113787: {'lr': 7.029863173243134e-05, 'samples': 21847104, 'steps': 113786, 'loss/train': 1.6122201681137085} 11/07/2021 13:13:57 - INFO - __main__ - Step 113788: {'lr': 7.029494246450869e-05, 'samples': 21847296, 'steps': 113787, 'loss/train': 1.5058261156082153} 11/07/2021 13:13:58 - INFO - __main__ - Step 113789: {'lr': 7.029125327755825e-05, 'samples': 21847488, 'steps': 113788, 'loss/train': 0.8316236138343811} 11/07/2021 13:13:58 - INFO - __main__ - Step 113790: {'lr': 7.028756417158183e-05, 'samples': 21847680, 'steps': 113789, 'loss/train': 1.207962989807129} 11/07/2021 13:13:59 - INFO - __main__ - Step 113791: {'lr': 7.028387514658105e-05, 'samples': 21847872, 'steps': 113790, 'loss/train': 1.2524081468582153} 11/07/2021 13:13:59 - INFO - __main__ - Step 113792: {'lr': 7.02801862025576e-05, 'samples': 21848064, 'steps': 113791, 'loss/train': 1.2666670083999634} 11/07/2021 13:13:59 - INFO - __main__ - Step 113793: {'lr': 7.027649733951311e-05, 'samples': 21848256, 'steps': 113792, 'loss/train': 1.7471003532409668} 11/07/2021 13:14:00 - INFO - __main__ - Step 113794: {'lr': 7.027280855744925e-05, 'samples': 21848448, 'steps': 113793, 'loss/train': 1.2287733554840088} 11/07/2021 13:14:01 - INFO - __main__ - Step 113795: {'lr': 7.02691198563677e-05, 'samples': 21848640, 'steps': 113794, 'loss/train': 1.4470957517623901} 11/07/2021 13:14:01 - INFO - __main__ - Step 113796: {'lr': 7.02654312362701e-05, 'samples': 21848832, 'steps': 113795, 'loss/train': 1.4002124071121216} 11/07/2021 13:14:02 - INFO - __main__ - Step 113797: {'lr': 7.026174269715812e-05, 'samples': 21849024, 'steps': 113796, 'loss/train': 1.4801911115646362} 11/07/2021 13:14:02 - INFO - __main__ - Step 113798: {'lr': 7.025805423903345e-05, 'samples': 21849216, 'steps': 113797, 'loss/train': 1.5340070724487305} 11/07/2021 13:14:02 - INFO - __main__ - Step 113799: {'lr': 7.025436586189771e-05, 'samples': 21849408, 'steps': 113798, 'loss/train': 1.5307730436325073} 11/07/2021 13:14:03 - INFO - __main__ - Step 113800: {'lr': 7.02506775657526e-05, 'samples': 21849600, 'steps': 113799, 'loss/train': 1.6332234144210815} 11/07/2021 13:14:04 - INFO - __main__ - Step 113801: {'lr': 7.024698935059981e-05, 'samples': 21849792, 'steps': 113800, 'loss/train': 1.7756874561309814} 11/07/2021 13:14:04 - INFO - __main__ - Step 113802: {'lr': 7.024330121644088e-05, 'samples': 21849984, 'steps': 113801, 'loss/train': 1.0190521478652954} 11/07/2021 13:14:04 - INFO - __main__ - Step 113803: {'lr': 7.023961316327756e-05, 'samples': 21850176, 'steps': 113802, 'loss/train': 1.3608232736587524} 11/07/2021 13:14:05 - INFO - __main__ - Step 113804: {'lr': 7.023592519111149e-05, 'samples': 21850368, 'steps': 113803, 'loss/train': 1.3582789897918701} 11/07/2021 13:14:06 - INFO - __main__ - Step 113805: {'lr': 7.023223729994432e-05, 'samples': 21850560, 'steps': 113804, 'loss/train': 1.3534014225006104} 11/07/2021 13:14:06 - INFO - __main__ - Step 113806: {'lr': 7.022854948977775e-05, 'samples': 21850752, 'steps': 113805, 'loss/train': 1.2957723140716553} 11/07/2021 13:14:07 - INFO - __main__ - Step 113807: {'lr': 7.022486176061344e-05, 'samples': 21850944, 'steps': 113806, 'loss/train': 1.184610366821289} 11/07/2021 13:14:07 - INFO - __main__ - Step 113808: {'lr': 7.022117411245299e-05, 'samples': 21851136, 'steps': 113807, 'loss/train': 1.999616265296936} 11/07/2021 13:14:07 - INFO - __main__ - Step 113809: {'lr': 7.021748654529813e-05, 'samples': 21851328, 'steps': 113808, 'loss/train': 1.3758302927017212} 11/07/2021 13:14:08 - INFO - __main__ - Step 113810: {'lr': 7.021379905915048e-05, 'samples': 21851520, 'steps': 113809, 'loss/train': 1.1900273561477661} 11/07/2021 13:14:09 - INFO - __main__ - Step 113811: {'lr': 7.021011165401173e-05, 'samples': 21851712, 'steps': 113810, 'loss/train': 0.12625452876091003} 11/07/2021 13:14:09 - INFO - __main__ - Step 113812: {'lr': 7.020642432988353e-05, 'samples': 21851904, 'steps': 113811, 'loss/train': 1.5572125911712646} 11/07/2021 13:14:10 - INFO - __main__ - Step 113813: {'lr': 7.020273708676756e-05, 'samples': 21852096, 'steps': 113812, 'loss/train': 0.38079625368118286} 11/07/2021 13:14:10 - INFO - __main__ - Step 113814: {'lr': 7.019904992466542e-05, 'samples': 21852288, 'steps': 113813, 'loss/train': 0.7269201874732971} 11/07/2021 13:14:11 - INFO - __main__ - Step 113815: {'lr': 7.019536284357891e-05, 'samples': 21852480, 'steps': 113814, 'loss/train': 0.6041656136512756} 11/07/2021 13:14:12 - INFO - __main__ - Step 113816: {'lr': 7.019167584350953e-05, 'samples': 21852672, 'steps': 113815, 'loss/train': 1.4825494289398193} 11/07/2021 13:14:12 - INFO - __main__ - Step 113817: {'lr': 7.018798892445899e-05, 'samples': 21852864, 'steps': 113816, 'loss/train': 1.319240689277649} 11/07/2021 13:14:12 - INFO - __main__ - Step 113818: {'lr': 7.018430208642898e-05, 'samples': 21853056, 'steps': 113817, 'loss/train': 0.8016405701637268} 11/07/2021 13:14:13 - INFO - __main__ - Step 113819: {'lr': 7.018061532942113e-05, 'samples': 21853248, 'steps': 113818, 'loss/train': 1.4368515014648438} 11/07/2021 13:14:13 - INFO - __main__ - Step 113820: {'lr': 7.017692865343714e-05, 'samples': 21853440, 'steps': 113819, 'loss/train': 1.0818849802017212} 11/07/2021 13:14:14 - INFO - __main__ - Step 113821: {'lr': 7.017324205847864e-05, 'samples': 21853632, 'steps': 113820, 'loss/train': 1.6345868110656738} 11/07/2021 13:14:14 - INFO - __main__ - Step 113822: {'lr': 7.016955554454732e-05, 'samples': 21853824, 'steps': 113821, 'loss/train': 1.592926025390625} 11/07/2021 13:14:15 - INFO - __main__ - Step 113823: {'lr': 7.01658691116448e-05, 'samples': 21854016, 'steps': 113822, 'loss/train': 1.021407127380371} 11/07/2021 13:14:15 - INFO - __main__ - Step 113824: {'lr': 7.016218275977277e-05, 'samples': 21854208, 'steps': 113823, 'loss/train': 0.9476186037063599} 11/07/2021 13:14:15 - INFO - __main__ - Step 113825: {'lr': 7.015849648893288e-05, 'samples': 21854400, 'steps': 113824, 'loss/train': 1.0149840116500854} 11/07/2021 13:14:16 - INFO - __main__ - Step 113826: {'lr': 7.015481029912682e-05, 'samples': 21854592, 'steps': 113825, 'loss/train': 1.2739371061325073} 11/07/2021 13:14:17 - INFO - __main__ - Step 113827: {'lr': 7.015112419035621e-05, 'samples': 21854784, 'steps': 113826, 'loss/train': 1.1488406658172607} 11/07/2021 13:14:17 - INFO - __main__ - Step 113828: {'lr': 7.014743816262281e-05, 'samples': 21854976, 'steps': 113827, 'loss/train': 1.3082882165908813} 11/07/2021 13:14:18 - INFO - __main__ - Step 113829: {'lr': 7.014375221592812e-05, 'samples': 21855168, 'steps': 113828, 'loss/train': 1.2503015995025635} 11/07/2021 13:14:18 - INFO - __main__ - Step 113830: {'lr': 7.014006635027387e-05, 'samples': 21855360, 'steps': 113829, 'loss/train': 1.238014817237854} 11/07/2021 13:14:18 - INFO - __main__ - Step 113831: {'lr': 7.013638056566174e-05, 'samples': 21855552, 'steps': 113830, 'loss/train': 1.109755516052246} 11/07/2021 13:14:19 - INFO - __main__ - Step 113832: {'lr': 7.01326948620934e-05, 'samples': 21855744, 'steps': 113831, 'loss/train': 1.1983745098114014} 11/07/2021 13:14:20 - INFO - __main__ - Step 113833: {'lr': 7.012900923957047e-05, 'samples': 21855936, 'steps': 113832, 'loss/train': 1.6558263301849365} 11/07/2021 13:14:20 - INFO - __main__ - Step 113834: {'lr': 7.012532369809462e-05, 'samples': 21856128, 'steps': 113833, 'loss/train': 1.6177722215652466} 11/07/2021 13:14:20 - INFO - __main__ - Step 113835: {'lr': 7.012163823766757e-05, 'samples': 21856320, 'steps': 113834, 'loss/train': 0.6748082041740417} 11/07/2021 13:14:21 - INFO - __main__ - Step 113836: {'lr': 7.011795285829089e-05, 'samples': 21856512, 'steps': 113835, 'loss/train': 0.8923110961914062} 11/07/2021 13:14:22 - INFO - __main__ - Step 113837: {'lr': 7.01142675599663e-05, 'samples': 21856704, 'steps': 113836, 'loss/train': 0.7233128547668457} 11/07/2021 13:14:22 - INFO - __main__ - Step 113838: {'lr': 7.011058234269543e-05, 'samples': 21856896, 'steps': 113837, 'loss/train': 0.9672796726226807} 11/07/2021 13:14:22 - INFO - __main__ - Step 113839: {'lr': 7.010689720647998e-05, 'samples': 21857088, 'steps': 113838, 'loss/train': 1.3769094944000244} 11/07/2021 13:14:23 - INFO - __main__ - Step 113840: {'lr': 7.010321215132159e-05, 'samples': 21857280, 'steps': 113839, 'loss/train': 1.2642841339111328} 11/07/2021 13:14:23 - INFO - __main__ - Step 113841: {'lr': 7.00995271772219e-05, 'samples': 21857472, 'steps': 113840, 'loss/train': 1.3199081420898438} 11/07/2021 13:14:24 - INFO - __main__ - Step 113842: {'lr': 7.009584228418267e-05, 'samples': 21857664, 'steps': 113841, 'loss/train': 1.36673104763031} 11/07/2021 13:14:25 - INFO - __main__ - Step 113843: {'lr': 7.009215747220538e-05, 'samples': 21857856, 'steps': 113842, 'loss/train': 1.4715745449066162} 11/07/2021 13:14:25 - INFO - __main__ - Step 113844: {'lr': 7.008847274129182e-05, 'samples': 21858048, 'steps': 113843, 'loss/train': 1.1143661737442017} 11/07/2021 13:14:25 - INFO - __main__ - Step 113845: {'lr': 7.008478809144359e-05, 'samples': 21858240, 'steps': 113844, 'loss/train': 1.3237955570220947} 11/07/2021 13:14:26 - INFO - __main__ - Step 113846: {'lr': 7.008110352266239e-05, 'samples': 21858432, 'steps': 113845, 'loss/train': 1.4261963367462158} 11/07/2021 13:14:27 - INFO - __main__ - Step 113847: {'lr': 7.007741903494987e-05, 'samples': 21858624, 'steps': 113846, 'loss/train': 0.9318341612815857} 11/07/2021 13:14:27 - INFO - __main__ - Step 113848: {'lr': 7.007373462830768e-05, 'samples': 21858816, 'steps': 113847, 'loss/train': 1.2008357048034668} 11/07/2021 13:14:28 - INFO - __main__ - Step 113849: {'lr': 7.007005030273753e-05, 'samples': 21859008, 'steps': 113848, 'loss/train': 0.6149412393569946} 11/07/2021 13:14:28 - INFO - __main__ - Step 113850: {'lr': 7.006636605824099e-05, 'samples': 21859200, 'steps': 113849, 'loss/train': 1.0357494354248047} 11/07/2021 13:14:28 - INFO - __main__ - Step 113851: {'lr': 7.006268189481979e-05, 'samples': 21859392, 'steps': 113850, 'loss/train': 1.308577537536621} 11/07/2021 13:14:29 - INFO - __main__ - Step 113852: {'lr': 7.005899781247557e-05, 'samples': 21859584, 'steps': 113851, 'loss/train': 1.1376867294311523} 11/07/2021 13:14:30 - INFO - __main__ - Step 113853: {'lr': 7.005531381120997e-05, 'samples': 21859776, 'steps': 113852, 'loss/train': 0.8340480923652649} 11/07/2021 13:14:30 - INFO - __main__ - Step 113854: {'lr': 7.005162989102467e-05, 'samples': 21859968, 'steps': 113853, 'loss/train': 1.1174639463424683} 11/07/2021 13:14:31 - INFO - __main__ - Step 113855: {'lr': 7.004794605192144e-05, 'samples': 21860160, 'steps': 113854, 'loss/train': 1.0002249479293823} 11/07/2021 13:14:31 - INFO - __main__ - Step 113856: {'lr': 7.004426229390174e-05, 'samples': 21860352, 'steps': 113855, 'loss/train': 1.4886127710342407} 11/07/2021 13:14:31 - INFO - __main__ - Step 113857: {'lr': 7.004057861696727e-05, 'samples': 21860544, 'steps': 113856, 'loss/train': 1.010086178779602} 11/07/2021 13:14:32 - INFO - __main__ - Step 113858: {'lr': 7.003689502111979e-05, 'samples': 21860736, 'steps': 113857, 'loss/train': 1.2876957654953003} 11/07/2021 13:14:33 - INFO - __main__ - Step 113859: {'lr': 7.003321150636091e-05, 'samples': 21860928, 'steps': 113858, 'loss/train': 1.1230204105377197} 11/07/2021 13:14:33 - INFO - __main__ - Step 113860: {'lr': 7.002952807269225e-05, 'samples': 21861120, 'steps': 113859, 'loss/train': 1.557190179824829} 11/07/2021 13:14:33 - INFO - __main__ - Step 113861: {'lr': 7.002584472011553e-05, 'samples': 21861312, 'steps': 113860, 'loss/train': 1.2328248023986816} 11/07/2021 13:14:34 - INFO - __main__ - Step 113862: {'lr': 7.00221614486324e-05, 'samples': 21861504, 'steps': 113861, 'loss/train': 1.20138680934906} 11/07/2021 13:14:35 - INFO - __main__ - Step 113863: {'lr': 7.001847825824448e-05, 'samples': 21861696, 'steps': 113862, 'loss/train': 1.3299864530563354} 11/07/2021 13:14:35 - INFO - __main__ - Step 113864: {'lr': 7.001479514895349e-05, 'samples': 21861888, 'steps': 113863, 'loss/train': 1.4345242977142334} 11/07/2021 13:14:36 - INFO - __main__ - Step 113865: {'lr': 7.001111212076103e-05, 'samples': 21862080, 'steps': 113864, 'loss/train': 1.234286904335022} 11/07/2021 13:14:36 - INFO - __main__ - Step 113866: {'lr': 7.000742917366878e-05, 'samples': 21862272, 'steps': 113865, 'loss/train': 1.5053704977035522} 11/07/2021 13:14:36 - INFO - __main__ - Step 113867: {'lr': 7.000374630767842e-05, 'samples': 21862464, 'steps': 113866, 'loss/train': 1.4371421337127686} 11/07/2021 13:14:37 - INFO - __main__ - Step 113868: {'lr': 7.00000635227916e-05, 'samples': 21862656, 'steps': 113867, 'loss/train': 1.4111260175704956} 11/07/2021 13:14:38 - INFO - __main__ - Step 113869: {'lr': 6.999638081901002e-05, 'samples': 21862848, 'steps': 113868, 'loss/train': 1.1894906759262085} 11/07/2021 13:14:38 - INFO - __main__ - Step 113870: {'lr': 6.999269819633525e-05, 'samples': 21863040, 'steps': 113869, 'loss/train': 1.7012755870819092} 11/07/2021 13:14:38 - INFO - __main__ - Step 113871: {'lr': 6.998901565476898e-05, 'samples': 21863232, 'steps': 113870, 'loss/train': 1.5557763576507568} 11/07/2021 13:14:39 - INFO - __main__ - Step 113872: {'lr': 6.998533319431288e-05, 'samples': 21863424, 'steps': 113871, 'loss/train': 1.309370756149292} 11/07/2021 13:14:39 - INFO - __main__ - Step 113873: {'lr': 6.998165081496863e-05, 'samples': 21863616, 'steps': 113872, 'loss/train': 1.4434969425201416} 11/07/2021 13:14:40 - INFO - __main__ - Step 113874: {'lr': 6.997796851673785e-05, 'samples': 21863808, 'steps': 113873, 'loss/train': 1.395436406135559} 11/07/2021 13:14:40 - INFO - __main__ - Step 113875: {'lr': 6.997428629962221e-05, 'samples': 21864000, 'steps': 113874, 'loss/train': 1.350899338722229} 11/07/2021 13:14:41 - INFO - __main__ - Step 113876: {'lr': 6.997060416362338e-05, 'samples': 21864192, 'steps': 113875, 'loss/train': 1.435974359512329} 11/07/2021 13:14:41 - INFO - __main__ - Step 113877: {'lr': 6.996692210874305e-05, 'samples': 21864384, 'steps': 113876, 'loss/train': 1.132426381111145} 11/07/2021 13:14:41 - INFO - __main__ - Step 113878: {'lr': 6.996324013498282e-05, 'samples': 21864576, 'steps': 113877, 'loss/train': 1.66049325466156} 11/07/2021 13:14:43 - INFO - __main__ - Step 113879: {'lr': 6.995955824234437e-05, 'samples': 21864768, 'steps': 113878, 'loss/train': 1.2456457614898682} 11/07/2021 13:14:43 - INFO - __main__ - Step 113880: {'lr': 6.99558764308294e-05, 'samples': 21864960, 'steps': 113879, 'loss/train': 1.2486915588378906} 11/07/2021 13:14:43 - INFO - __main__ - Step 113881: {'lr': 6.995219470043951e-05, 'samples': 21865152, 'steps': 113880, 'loss/train': 0.7746672034263611} 11/07/2021 13:14:44 - INFO - __main__ - Step 113882: {'lr': 6.994851305117644e-05, 'samples': 21865344, 'steps': 113881, 'loss/train': 1.5496516227722168} 11/07/2021 13:14:44 - INFO - __main__ - Step 113883: {'lr': 6.994483148304175e-05, 'samples': 21865536, 'steps': 113882, 'loss/train': 1.3364979028701782} 11/07/2021 13:14:45 - INFO - __main__ - Step 113884: {'lr': 6.994114999603713e-05, 'samples': 21865728, 'steps': 113883, 'loss/train': 0.9281055331230164} 11/07/2021 13:14:46 - INFO - __main__ - Step 113885: {'lr': 6.993746859016422e-05, 'samples': 21865920, 'steps': 113884, 'loss/train': 1.6113029718399048} 11/07/2021 13:14:46 - INFO - __main__ - Step 113886: {'lr': 6.993378726542476e-05, 'samples': 21866112, 'steps': 113885, 'loss/train': 0.36309367418289185} 11/07/2021 13:14:47 - INFO - __main__ - Step 113887: {'lr': 6.993010602182031e-05, 'samples': 21866304, 'steps': 113886, 'loss/train': 1.4138970375061035} 11/07/2021 13:14:47 - INFO - __main__ - Step 113888: {'lr': 6.992642485935261e-05, 'samples': 21866496, 'steps': 113887, 'loss/train': 1.5060310363769531} 11/07/2021 13:14:48 - INFO - __main__ - Step 113889: {'lr': 6.992274377802327e-05, 'samples': 21866688, 'steps': 113888, 'loss/train': 1.5016283988952637} 11/07/2021 13:14:48 - INFO - __main__ - Step 113890: {'lr': 6.991906277783396e-05, 'samples': 21866880, 'steps': 113889, 'loss/train': 1.606451153755188} 11/07/2021 13:14:49 - INFO - __main__ - Step 113891: {'lr': 6.991538185878634e-05, 'samples': 21867072, 'steps': 113890, 'loss/train': 1.414528489112854} 11/07/2021 13:14:49 - INFO - __main__ - Step 113892: {'lr': 6.991170102088207e-05, 'samples': 21867264, 'steps': 113891, 'loss/train': 1.1304913759231567} 11/07/2021 13:14:49 - INFO - __main__ - Step 113893: {'lr': 6.990802026412283e-05, 'samples': 21867456, 'steps': 113892, 'loss/train': 1.6834338903427124} 11/07/2021 13:14:51 - INFO - __main__ - Step 113894: {'lr': 6.990433958851023e-05, 'samples': 21867648, 'steps': 113893, 'loss/train': 1.4779751300811768} 11/07/2021 13:14:51 - INFO - __main__ - Step 113895: {'lr': 6.990065899404597e-05, 'samples': 21867840, 'steps': 113894, 'loss/train': 1.2500971555709839} 11/07/2021 13:14:51 - INFO - __main__ - Step 113896: {'lr': 6.989697848073177e-05, 'samples': 21868032, 'steps': 113895, 'loss/train': 1.1935776472091675} 11/07/2021 13:14:52 - INFO - __main__ - Step 113897: {'lr': 6.989329804856912e-05, 'samples': 21868224, 'steps': 113896, 'loss/train': 1.2594969272613525} 11/07/2021 13:14:52 - INFO - __main__ - Step 113898: {'lr': 6.988961769755978e-05, 'samples': 21868416, 'steps': 113897, 'loss/train': 1.7135833501815796} 11/07/2021 13:14:52 - INFO - __main__ - Step 113899: {'lr': 6.98859374277054e-05, 'samples': 21868608, 'steps': 113898, 'loss/train': 1.2164254188537598} 11/07/2021 13:14:53 - INFO - __main__ - Step 113900: {'lr': 6.988225723900765e-05, 'samples': 21868800, 'steps': 113899, 'loss/train': 1.8582491874694824} 11/07/2021 13:14:54 - INFO - __main__ - Step 113901: {'lr': 6.987857713146817e-05, 'samples': 21868992, 'steps': 113900, 'loss/train': 1.2578827142715454} 11/07/2021 13:14:54 - INFO - __main__ - Step 113902: {'lr': 6.98748971050886e-05, 'samples': 21869184, 'steps': 113901, 'loss/train': 1.3682135343551636} 11/07/2021 13:14:54 - INFO - __main__ - Step 113903: {'lr': 6.987121715987066e-05, 'samples': 21869376, 'steps': 113902, 'loss/train': 1.2496006488800049} 11/07/2021 13:14:55 - INFO - __main__ - Step 113904: {'lr': 6.986753729581594e-05, 'samples': 21869568, 'steps': 113903, 'loss/train': 1.0619688034057617} 11/07/2021 13:14:56 - INFO - __main__ - Step 113905: {'lr': 6.986385751292615e-05, 'samples': 21869760, 'steps': 113904, 'loss/train': 0.2686358690261841} 11/07/2021 13:14:56 - INFO - __main__ - Step 113906: {'lr': 6.986017781120291e-05, 'samples': 21869952, 'steps': 113905, 'loss/train': 1.7109932899475098} 11/07/2021 13:14:57 - INFO - __main__ - Step 113907: {'lr': 6.985649819064788e-05, 'samples': 21870144, 'steps': 113906, 'loss/train': 1.1769260168075562} 11/07/2021 13:14:57 - INFO - __main__ - Step 113908: {'lr': 6.985281865126275e-05, 'samples': 21870336, 'steps': 113907, 'loss/train': 1.3322479724884033} 11/07/2021 13:14:57 - INFO - __main__ - Step 113909: {'lr': 6.984913919304925e-05, 'samples': 21870528, 'steps': 113908, 'loss/train': 1.3689743280410767} 11/07/2021 13:14:58 - INFO - __main__ - Step 113910: {'lr': 6.984545981600884e-05, 'samples': 21870720, 'steps': 113909, 'loss/train': 0.7624500393867493} 11/07/2021 13:14:59 - INFO - __main__ - Step 113911: {'lr': 6.984178052014331e-05, 'samples': 21870912, 'steps': 113910, 'loss/train': 1.0369311571121216} 11/07/2021 13:14:59 - INFO - __main__ - Step 113912: {'lr': 6.983810130545429e-05, 'samples': 21871104, 'steps': 113911, 'loss/train': 1.6976830959320068} 11/07/2021 13:14:59 - INFO - __main__ - Step 113913: {'lr': 6.983442217194344e-05, 'samples': 21871296, 'steps': 113912, 'loss/train': 1.5253510475158691} 11/07/2021 13:15:00 - INFO - __main__ - Step 113914: {'lr': 6.983074311961244e-05, 'samples': 21871488, 'steps': 113913, 'loss/train': 1.3777883052825928} 11/07/2021 13:15:01 - INFO - __main__ - Step 113915: {'lr': 6.982706414846288e-05, 'samples': 21871680, 'steps': 113914, 'loss/train': 1.8355497121810913} 11/07/2021 13:15:01 - INFO - __main__ - Step 113916: {'lr': 6.982338525849649e-05, 'samples': 21871872, 'steps': 113915, 'loss/train': 1.551209568977356} 11/07/2021 13:15:01 - INFO - __main__ - Step 113917: {'lr': 6.981970644971492e-05, 'samples': 21872064, 'steps': 113916, 'loss/train': 1.4069017171859741} 11/07/2021 13:15:02 - INFO - __main__ - Step 113918: {'lr': 6.981602772211979e-05, 'samples': 21872256, 'steps': 113917, 'loss/train': 1.5166810750961304} 11/07/2021 13:15:02 - INFO - __main__ - Step 113919: {'lr': 6.981234907571277e-05, 'samples': 21872448, 'steps': 113918, 'loss/train': 1.608609676361084} 11/07/2021 13:15:03 - INFO - __main__ - Step 113920: {'lr': 6.98086705104956e-05, 'samples': 21872640, 'steps': 113919, 'loss/train': 0.8816307187080383} 11/07/2021 13:15:04 - INFO - __main__ - Step 113921: {'lr': 6.980499202646981e-05, 'samples': 21872832, 'steps': 113920, 'loss/train': 1.3421504497528076} 11/07/2021 13:15:04 - INFO - __main__ - Step 113922: {'lr': 6.980131362363709e-05, 'samples': 21873024, 'steps': 113921, 'loss/train': 0.7430932521820068} 11/07/2021 13:15:04 - INFO - __main__ - Step 113923: {'lr': 6.979763530199914e-05, 'samples': 21873216, 'steps': 113922, 'loss/train': 1.3061810731887817} 11/07/2021 13:15:05 - INFO - __main__ - Step 113924: {'lr': 6.979395706155758e-05, 'samples': 21873408, 'steps': 113923, 'loss/train': 1.2412447929382324} 11/07/2021 13:15:06 - INFO - __main__ - Step 113925: {'lr': 6.979027890231407e-05, 'samples': 21873600, 'steps': 113924, 'loss/train': 1.5681251287460327} 11/07/2021 13:15:06 - INFO - __main__ - Step 113926: {'lr': 6.97866008242703e-05, 'samples': 21873792, 'steps': 113925, 'loss/train': 1.3025026321411133} 11/07/2021 13:15:06 - INFO - __main__ - Step 113927: {'lr': 6.978292282742791e-05, 'samples': 21873984, 'steps': 113926, 'loss/train': 1.58919095993042} 11/07/2021 13:15:07 - INFO - __main__ - Step 113928: {'lr': 6.977924491178852e-05, 'samples': 21874176, 'steps': 113927, 'loss/train': 0.8198537230491638} 11/07/2021 13:15:07 - INFO - __main__ - Step 113929: {'lr': 6.977556707735385e-05, 'samples': 21874368, 'steps': 113928, 'loss/train': 0.7591789960861206} 11/07/2021 13:15:08 - INFO - __main__ - Step 113930: {'lr': 6.977188932412554e-05, 'samples': 21874560, 'steps': 113929, 'loss/train': 1.4028092622756958} 11/07/2021 13:15:08 - INFO - __main__ - Step 113931: {'lr': 6.976821165210528e-05, 'samples': 21874752, 'steps': 113930, 'loss/train': 1.3287237882614136} 11/07/2021 13:15:09 - INFO - __main__ - Step 113932: {'lr': 6.976453406129462e-05, 'samples': 21874944, 'steps': 113931, 'loss/train': 1.7452256679534912} 11/07/2021 13:15:09 - INFO - __main__ - Step 113933: {'lr': 6.976085655169529e-05, 'samples': 21875136, 'steps': 113932, 'loss/train': 1.1766839027404785} 11/07/2021 13:15:10 - INFO - __main__ - Step 113934: {'lr': 6.975717912330892e-05, 'samples': 21875328, 'steps': 113933, 'loss/train': 0.7455056309700012} 11/07/2021 13:15:10 - INFO - __main__ - Step 113935: {'lr': 6.975350177613718e-05, 'samples': 21875520, 'steps': 113934, 'loss/train': 1.3434779644012451} 11/07/2021 13:15:11 - INFO - __main__ - Step 113936: {'lr': 6.974982451018175e-05, 'samples': 21875712, 'steps': 113935, 'loss/train': 1.646866798400879} 11/07/2021 13:15:11 - INFO - __main__ - Step 113937: {'lr': 6.974614732544426e-05, 'samples': 21875904, 'steps': 113936, 'loss/train': 1.0135326385498047} 11/07/2021 13:15:12 - INFO - __main__ - Step 113938: {'lr': 6.974247022192636e-05, 'samples': 21876096, 'steps': 113937, 'loss/train': 1.5205442905426025} 11/07/2021 13:15:12 - INFO - __main__ - Step 113939: {'lr': 6.973879319962975e-05, 'samples': 21876288, 'steps': 113938, 'loss/train': 1.7071882486343384} 11/07/2021 13:15:12 - INFO - __main__ - Step 113940: {'lr': 6.973511625855605e-05, 'samples': 21876480, 'steps': 113939, 'loss/train': 1.202074408531189} 11/07/2021 13:15:14 - INFO - __main__ - Step 113941: {'lr': 6.973143939870691e-05, 'samples': 21876672, 'steps': 113940, 'loss/train': 1.3697891235351562} 11/07/2021 13:15:14 - INFO - __main__ - Step 113942: {'lr': 6.97277626200841e-05, 'samples': 21876864, 'steps': 113941, 'loss/train': 1.3291751146316528} 11/07/2021 13:15:14 - INFO - __main__ - Step 113943: {'lr': 6.972408592268909e-05, 'samples': 21877056, 'steps': 113942, 'loss/train': 0.8324254751205444} 11/07/2021 13:15:15 - INFO - __main__ - Step 113944: {'lr': 6.97204093065236e-05, 'samples': 21877248, 'steps': 113943, 'loss/train': 1.390617847442627} 11/07/2021 13:15:15 - INFO - __main__ - Step 113945: {'lr': 6.971673277158936e-05, 'samples': 21877440, 'steps': 113944, 'loss/train': 1.5344557762145996} 11/07/2021 13:15:15 - INFO - __main__ - Step 113946: {'lr': 6.971305631788794e-05, 'samples': 21877632, 'steps': 113945, 'loss/train': 1.7462583780288696} 11/07/2021 13:15:16 - INFO - __main__ - Step 113947: {'lr': 6.970937994542104e-05, 'samples': 21877824, 'steps': 113946, 'loss/train': 1.6579697132110596} 11/07/2021 13:15:17 - INFO - __main__ - Step 113948: {'lr': 6.970570365419032e-05, 'samples': 21878016, 'steps': 113947, 'loss/train': 2.038961887359619} 11/07/2021 13:15:17 - INFO - __main__ - Step 113949: {'lr': 6.970202744419743e-05, 'samples': 21878208, 'steps': 113948, 'loss/train': 1.2397816181182861} 11/07/2021 13:15:18 - INFO - __main__ - Step 113950: {'lr': 6.969835131544403e-05, 'samples': 21878400, 'steps': 113949, 'loss/train': 0.7079851627349854} 11/07/2021 13:15:18 - INFO - __main__ - Step 113951: {'lr': 6.969467526793174e-05, 'samples': 21878592, 'steps': 113950, 'loss/train': 1.1017800569534302} 11/07/2021 13:15:19 - INFO - __main__ - Step 113952: {'lr': 6.969099930166228e-05, 'samples': 21878784, 'steps': 113951, 'loss/train': 1.3843470811843872} 11/07/2021 13:15:19 - INFO - __main__ - Step 113953: {'lr': 6.968732341663733e-05, 'samples': 21878976, 'steps': 113952, 'loss/train': 1.405146837234497} 11/07/2021 13:15:20 - INFO - __main__ - Step 113954: {'lr': 6.968364761285842e-05, 'samples': 21879168, 'steps': 113953, 'loss/train': 1.1345176696777344} 11/07/2021 13:15:20 - INFO - __main__ - Step 113955: {'lr': 6.96799718903273e-05, 'samples': 21879360, 'steps': 113954, 'loss/train': 1.2423579692840576} 11/07/2021 13:15:20 - INFO - __main__ - Step 113956: {'lr': 6.967629624904556e-05, 'samples': 21879552, 'steps': 113955, 'loss/train': 1.7068381309509277} 11/07/2021 13:15:21 - INFO - __main__ - Step 113957: {'lr': 6.967262068901492e-05, 'samples': 21879744, 'steps': 113956, 'loss/train': 0.7507059574127197} 11/07/2021 13:15:22 - INFO - __main__ - Step 113958: {'lr': 6.966894521023704e-05, 'samples': 21879936, 'steps': 113957, 'loss/train': 1.218630075454712} 11/07/2021 13:15:22 - INFO - __main__ - Step 113959: {'lr': 6.966526981271352e-05, 'samples': 21880128, 'steps': 113958, 'loss/train': 1.2154055833816528} 11/07/2021 13:15:22 - INFO - __main__ - Step 113960: {'lr': 6.966159449644605e-05, 'samples': 21880320, 'steps': 113959, 'loss/train': 1.3143056631088257} 11/07/2021 13:15:23 - INFO - __main__ - Step 113961: {'lr': 6.965791926143627e-05, 'samples': 21880512, 'steps': 113960, 'loss/train': 1.438015341758728} 11/07/2021 13:15:24 - INFO - __main__ - Step 113962: {'lr': 6.965424410768587e-05, 'samples': 21880704, 'steps': 113961, 'loss/train': 1.512374758720398} 11/07/2021 13:15:24 - INFO - __main__ - Step 113963: {'lr': 6.965056903519648e-05, 'samples': 21880896, 'steps': 113962, 'loss/train': 1.7346247434616089} 11/07/2021 13:15:24 - INFO - __main__ - Step 113964: {'lr': 6.964689404396981e-05, 'samples': 21881088, 'steps': 113963, 'loss/train': 1.4077738523483276} 11/07/2021 13:15:25 - INFO - __main__ - Step 113965: {'lr': 6.964321913400742e-05, 'samples': 21881280, 'steps': 113964, 'loss/train': 1.2901335954666138} 11/07/2021 13:15:25 - INFO - __main__ - Step 113966: {'lr': 6.963954430531103e-05, 'samples': 21881472, 'steps': 113965, 'loss/train': 1.2894080877304077} 11/07/2021 13:15:26 - INFO - __main__ - Step 113967: {'lr': 6.963586955788224e-05, 'samples': 21881664, 'steps': 113966, 'loss/train': 1.5476675033569336} 11/07/2021 13:15:27 - INFO - __main__ - Step 113968: {'lr': 6.963219489172276e-05, 'samples': 21881856, 'steps': 113967, 'loss/train': 1.4174528121948242} 11/07/2021 13:15:27 - INFO - __main__ - Step 113969: {'lr': 6.962852030683423e-05, 'samples': 21882048, 'steps': 113968, 'loss/train': 1.0813630819320679} 11/07/2021 13:15:27 - INFO - __main__ - Step 113970: {'lr': 6.962484580321829e-05, 'samples': 21882240, 'steps': 113969, 'loss/train': 1.0974828004837036} 11/07/2021 13:15:28 - INFO - __main__ - Step 113971: {'lr': 6.962117138087662e-05, 'samples': 21882432, 'steps': 113970, 'loss/train': 1.0983787775039673} 11/07/2021 13:15:28 - INFO - __main__ - Step 113972: {'lr': 6.961749703981087e-05, 'samples': 21882624, 'steps': 113971, 'loss/train': 1.6683170795440674} 11/07/2021 13:15:29 - INFO - __main__ - Step 113973: {'lr': 6.96138227800227e-05, 'samples': 21882816, 'steps': 113972, 'loss/train': 1.3637537956237793} 11/07/2021 13:15:29 - INFO - __main__ - Step 113974: {'lr': 6.961014860151376e-05, 'samples': 21883008, 'steps': 113973, 'loss/train': 1.3947583436965942} 11/07/2021 13:15:30 - INFO - __main__ - Step 113975: {'lr': 6.96064745042857e-05, 'samples': 21883200, 'steps': 113974, 'loss/train': 0.8978440761566162} 11/07/2021 13:15:30 - INFO - __main__ - Step 113976: {'lr': 6.960280048834025e-05, 'samples': 21883392, 'steps': 113975, 'loss/train': 1.4549392461776733} 11/07/2021 13:15:31 - INFO - __main__ - Step 113977: {'lr': 6.959912655367892e-05, 'samples': 21883584, 'steps': 113976, 'loss/train': 0.5486623048782349} 11/07/2021 13:15:32 - INFO - __main__ - Step 113978: {'lr': 6.959545270030343e-05, 'samples': 21883776, 'steps': 113977, 'loss/train': 1.7167960405349731} 11/07/2021 13:15:32 - INFO - __main__ - Step 113979: {'lr': 6.959177892821544e-05, 'samples': 21883968, 'steps': 113978, 'loss/train': 1.3181908130645752} 11/07/2021 13:15:32 - INFO - __main__ - Step 113980: {'lr': 6.958810523741663e-05, 'samples': 21884160, 'steps': 113979, 'loss/train': 1.2694332599639893} 11/07/2021 13:15:33 - INFO - __main__ - Step 113981: {'lr': 6.958443162790864e-05, 'samples': 21884352, 'steps': 113980, 'loss/train': 0.8062087297439575} 11/07/2021 13:15:33 - INFO - __main__ - Step 113982: {'lr': 6.958075809969311e-05, 'samples': 21884544, 'steps': 113981, 'loss/train': 1.408117413520813} 11/07/2021 13:15:34 - INFO - __main__ - Step 113983: {'lr': 6.95770846527717e-05, 'samples': 21884736, 'steps': 113982, 'loss/train': 1.2928086519241333} 11/07/2021 13:15:35 - INFO - __main__ - Step 113984: {'lr': 6.957341128714608e-05, 'samples': 21884928, 'steps': 113983, 'loss/train': 1.4444695711135864} 11/07/2021 13:15:35 - INFO - __main__ - Step 113985: {'lr': 6.956973800281791e-05, 'samples': 21885120, 'steps': 113984, 'loss/train': 0.627788782119751} 11/07/2021 13:15:35 - INFO - __main__ - Step 113986: {'lr': 6.95660647997888e-05, 'samples': 21885312, 'steps': 113985, 'loss/train': 1.400361180305481} 11/07/2021 13:15:36 - INFO - __main__ - Step 113987: {'lr': 6.956239167806048e-05, 'samples': 21885504, 'steps': 113986, 'loss/train': 1.0480142831802368} 11/07/2021 13:15:37 - INFO - __main__ - Step 113988: {'lr': 6.955871863763452e-05, 'samples': 21885696, 'steps': 113987, 'loss/train': 1.5604304075241089} 11/07/2021 13:15:37 - INFO - __main__ - Step 113989: {'lr': 6.955504567851264e-05, 'samples': 21885888, 'steps': 113988, 'loss/train': 1.3667006492614746} 11/07/2021 13:15:37 - INFO - __main__ - Step 113990: {'lr': 6.955137280069653e-05, 'samples': 21886080, 'steps': 113989, 'loss/train': 1.6208546161651611} 11/07/2021 13:15:38 - INFO - __main__ - Step 113991: {'lr': 6.954770000418773e-05, 'samples': 21886272, 'steps': 113990, 'loss/train': 1.4368860721588135} 11/07/2021 13:15:38 - INFO - __main__ - Step 113992: {'lr': 6.954402728898796e-05, 'samples': 21886464, 'steps': 113991, 'loss/train': 0.9772302508354187} 11/07/2021 13:15:38 - INFO - __main__ - Step 113993: {'lr': 6.954035465509884e-05, 'samples': 21886656, 'steps': 113992, 'loss/train': 1.2296479940414429} 11/07/2021 13:15:39 - INFO - __main__ - Step 113994: {'lr': 6.953668210252207e-05, 'samples': 21886848, 'steps': 113993, 'loss/train': 1.5099835395812988} 11/07/2021 13:15:40 - INFO - __main__ - Step 113995: {'lr': 6.953300963125928e-05, 'samples': 21887040, 'steps': 113994, 'loss/train': 1.514382243156433} 11/07/2021 13:15:40 - INFO - __main__ - Step 113996: {'lr': 6.952933724131211e-05, 'samples': 21887232, 'steps': 113995, 'loss/train': 1.5323938131332397} 11/07/2021 13:15:40 - INFO - __main__ - Step 113997: {'lr': 6.952566493268225e-05, 'samples': 21887424, 'steps': 113996, 'loss/train': 1.4376972913742065} 11/07/2021 13:15:41 - INFO - __main__ - Step 113998: {'lr': 6.952199270537136e-05, 'samples': 21887616, 'steps': 113997, 'loss/train': 1.2878566980361938} 11/07/2021 13:15:42 - INFO - __main__ - Step 113999: {'lr': 6.951832055938106e-05, 'samples': 21887808, 'steps': 113998, 'loss/train': 1.3506247997283936} 11/07/2021 13:15:42 - INFO - __main__ - Step 114000: {'lr': 6.9514648494713e-05, 'samples': 21888000, 'steps': 113999, 'loss/train': 1.587280511856079} 11/07/2021 13:15:43 - INFO - __main__ - Step 114001: {'lr': 6.95109765113689e-05, 'samples': 21888192, 'steps': 114000, 'loss/train': 0.9825098514556885} 11/07/2021 13:15:43 - INFO - __main__ - Step 114002: {'lr': 6.950730460935034e-05, 'samples': 21888384, 'steps': 114001, 'loss/train': 0.8945605158805847} 11/07/2021 13:15:43 - INFO - __main__ - Step 114003: {'lr': 6.950363278865909e-05, 'samples': 21888576, 'steps': 114002, 'loss/train': 1.700819730758667} 11/07/2021 13:15:44 - INFO - __main__ - Step 114004: {'lr': 6.949996104929663e-05, 'samples': 21888768, 'steps': 114003, 'loss/train': 1.2189844846725464} 11/07/2021 13:15:45 - INFO - __main__ - Step 114005: {'lr': 6.949628939126471e-05, 'samples': 21888960, 'steps': 114004, 'loss/train': 0.8596415519714355} 11/07/2021 13:15:45 - INFO - __main__ - Step 114006: {'lr': 6.949261781456497e-05, 'samples': 21889152, 'steps': 114005, 'loss/train': 2.5953385829925537} 11/07/2021 13:15:45 - INFO - __main__ - Step 114007: {'lr': 6.948894631919908e-05, 'samples': 21889344, 'steps': 114006, 'loss/train': 1.1926482915878296} 11/07/2021 13:15:46 - INFO - __main__ - Step 114008: {'lr': 6.948527490516867e-05, 'samples': 21889536, 'steps': 114007, 'loss/train': 1.7131853103637695} 11/07/2021 13:15:47 - INFO - __main__ - Step 114009: {'lr': 6.948160357247543e-05, 'samples': 21889728, 'steps': 114008, 'loss/train': 1.5584473609924316} 11/07/2021 13:15:47 - INFO - __main__ - Step 114010: {'lr': 6.947793232112098e-05, 'samples': 21889920, 'steps': 114009, 'loss/train': 1.3720465898513794} 11/07/2021 13:15:47 - INFO - __main__ - Step 114011: {'lr': 6.9474261151107e-05, 'samples': 21890112, 'steps': 114010, 'loss/train': 1.2762490510940552} 11/07/2021 13:15:48 - INFO - __main__ - Step 114012: {'lr': 6.947059006243511e-05, 'samples': 21890304, 'steps': 114011, 'loss/train': 1.0678989887237549} 11/07/2021 13:15:48 - INFO - __main__ - Step 114013: {'lr': 6.946691905510702e-05, 'samples': 21890496, 'steps': 114012, 'loss/train': 1.2235913276672363} 11/07/2021 13:15:49 - INFO - __main__ - Step 114014: {'lr': 6.946324812912433e-05, 'samples': 21890688, 'steps': 114013, 'loss/train': 1.2290784120559692} 11/07/2021 13:15:50 - INFO - __main__ - Step 114015: {'lr': 6.945957728448871e-05, 'samples': 21890880, 'steps': 114014, 'loss/train': 1.0904992818832397} 11/07/2021 13:15:50 - INFO - __main__ - Step 114016: {'lr': 6.945590652120182e-05, 'samples': 21891072, 'steps': 114015, 'loss/train': 0.8923799395561218} 11/07/2021 13:15:50 - INFO - __main__ - Step 114017: {'lr': 6.945223583926538e-05, 'samples': 21891264, 'steps': 114016, 'loss/train': 1.2681546211242676} 11/07/2021 13:15:51 - INFO - __main__ - Step 114018: {'lr': 6.944856523868092e-05, 'samples': 21891456, 'steps': 114017, 'loss/train': 1.380521297454834} 11/07/2021 13:15:51 - INFO - __main__ - Step 114019: {'lr': 6.944489471945015e-05, 'samples': 21891648, 'steps': 114018, 'loss/train': 1.264206886291504} 11/07/2021 13:15:52 - INFO - __main__ - Step 114020: {'lr': 6.944122428157473e-05, 'samples': 21891840, 'steps': 114019, 'loss/train': 1.1947999000549316} 11/07/2021 13:15:53 - INFO - __main__ - Step 114021: {'lr': 6.94375539250563e-05, 'samples': 21892032, 'steps': 114020, 'loss/train': 1.437972903251648} 11/07/2021 13:15:53 - INFO - __main__ - Step 114022: {'lr': 6.94338836498965e-05, 'samples': 21892224, 'steps': 114021, 'loss/train': 1.2531437873840332} 11/07/2021 13:15:53 - INFO - __main__ - Step 114023: {'lr': 6.943021345609704e-05, 'samples': 21892416, 'steps': 114022, 'loss/train': 0.3852730095386505} 11/07/2021 13:15:54 - INFO - __main__ - Step 114024: {'lr': 6.94265433436595e-05, 'samples': 21892608, 'steps': 114023, 'loss/train': 0.758378803730011} 11/07/2021 13:15:55 - INFO - __main__ - Step 114025: {'lr': 6.942287331258562e-05, 'samples': 21892800, 'steps': 114024, 'loss/train': 1.1645917892456055} 11/07/2021 13:15:55 - INFO - __main__ - Step 114026: {'lr': 6.941920336287696e-05, 'samples': 21892992, 'steps': 114025, 'loss/train': 1.2857370376586914} 11/07/2021 13:15:56 - INFO - __main__ - Step 114027: {'lr': 6.941553349453525e-05, 'samples': 21893184, 'steps': 114026, 'loss/train': 1.4859120845794678} 11/07/2021 13:15:56 - INFO - __main__ - Step 114028: {'lr': 6.941186370756211e-05, 'samples': 21893376, 'steps': 114027, 'loss/train': 1.5789124965667725} 11/07/2021 13:15:56 - INFO - __main__ - Step 114029: {'lr': 6.940819400195919e-05, 'samples': 21893568, 'steps': 114028, 'loss/train': 1.7919678688049316} 11/07/2021 13:15:57 - INFO - __main__ - Step 114030: {'lr': 6.940452437772824e-05, 'samples': 21893760, 'steps': 114029, 'loss/train': 0.8108546733856201} 11/07/2021 13:15:58 - INFO - __main__ - Step 114031: {'lr': 6.940085483487074e-05, 'samples': 21893952, 'steps': 114030, 'loss/train': 1.6766357421875} 11/07/2021 13:15:58 - INFO - __main__ - Step 114032: {'lr': 6.939718537338843e-05, 'samples': 21894144, 'steps': 114031, 'loss/train': 1.1721701622009277} 11/07/2021 13:15:58 - INFO - __main__ - Step 114033: {'lr': 6.939351599328298e-05, 'samples': 21894336, 'steps': 114032, 'loss/train': 1.3842936754226685} 11/07/2021 13:15:59 - INFO - __main__ - Step 114034: {'lr': 6.9389846694556e-05, 'samples': 21894528, 'steps': 114033, 'loss/train': 1.0456808805465698} 11/07/2021 13:16:00 - INFO - __main__ - Step 114035: {'lr': 6.938617747720916e-05, 'samples': 21894720, 'steps': 114034, 'loss/train': 1.1043201684951782} 11/07/2021 13:16:00 - INFO - __main__ - Step 114036: {'lr': 6.938250834124413e-05, 'samples': 21894912, 'steps': 114035, 'loss/train': 1.733001947402954} 11/07/2021 13:16:00 - INFO - __main__ - Step 114037: {'lr': 6.937883928666256e-05, 'samples': 21895104, 'steps': 114036, 'loss/train': 1.572912573814392} 11/07/2021 13:16:01 - INFO - __main__ - Step 114038: {'lr': 6.937517031346611e-05, 'samples': 21895296, 'steps': 114037, 'loss/train': 1.5119916200637817} 11/07/2021 13:16:01 - INFO - __main__ - Step 114039: {'lr': 6.93715014216564e-05, 'samples': 21895488, 'steps': 114038, 'loss/train': 0.16592279076576233} 11/07/2021 13:16:02 - INFO - __main__ - Step 114040: {'lr': 6.936783261123511e-05, 'samples': 21895680, 'steps': 114039, 'loss/train': 1.2500503063201904} 11/07/2021 13:16:03 - INFO - __main__ - Step 114041: {'lr': 6.93641638822039e-05, 'samples': 21895872, 'steps': 114040, 'loss/train': 1.404416561126709} 11/07/2021 13:16:03 - INFO - __main__ - Step 114042: {'lr': 6.936049523456439e-05, 'samples': 21896064, 'steps': 114041, 'loss/train': 1.1731359958648682} 11/07/2021 13:16:03 - INFO - __main__ - Step 114043: {'lr': 6.935682666831836e-05, 'samples': 21896256, 'steps': 114042, 'loss/train': 1.1424353122711182} 11/07/2021 13:16:04 - INFO - __main__ - Step 114044: {'lr': 6.935315818346725e-05, 'samples': 21896448, 'steps': 114043, 'loss/train': 1.1629620790481567} 11/07/2021 13:16:04 - INFO - __main__ - Step 114045: {'lr': 6.934948978001281e-05, 'samples': 21896640, 'steps': 114044, 'loss/train': 1.0727425813674927} 11/07/2021 13:16:05 - INFO - __main__ - Step 114046: {'lr': 6.934582145795673e-05, 'samples': 21896832, 'steps': 114045, 'loss/train': 1.0496026277542114} 11/07/2021 13:16:05 - INFO - __main__ - Step 114047: {'lr': 6.934215321730064e-05, 'samples': 21897024, 'steps': 114046, 'loss/train': 1.5608587265014648} 11/07/2021 13:16:06 - INFO - __main__ - Step 114048: {'lr': 6.933848505804616e-05, 'samples': 21897216, 'steps': 114047, 'loss/train': 1.762201189994812} 11/07/2021 13:16:06 - INFO - __main__ - Step 114049: {'lr': 6.9334816980195e-05, 'samples': 21897408, 'steps': 114048, 'loss/train': 1.3935997486114502} 11/07/2021 13:16:06 - INFO - __main__ - Step 114050: {'lr': 6.933114898374876e-05, 'samples': 21897600, 'steps': 114049, 'loss/train': 1.3764725923538208} 11/07/2021 13:16:07 - INFO - __main__ - Step 114051: {'lr': 6.932748106870912e-05, 'samples': 21897792, 'steps': 114050, 'loss/train': 1.1502143144607544} 11/07/2021 13:16:08 - INFO - __main__ - Step 114052: {'lr': 6.932381323507775e-05, 'samples': 21897984, 'steps': 114051, 'loss/train': 0.932966947555542} 11/07/2021 13:16:08 - INFO - __main__ - Step 114053: {'lr': 6.932014548285625e-05, 'samples': 21898176, 'steps': 114052, 'loss/train': 1.3280091285705566} 11/07/2021 13:16:09 - INFO - __main__ - Step 114054: {'lr': 6.931647781204633e-05, 'samples': 21898368, 'steps': 114053, 'loss/train': 1.3866246938705444} 11/07/2021 13:16:09 - INFO - __main__ - Step 114055: {'lr': 6.93128102226496e-05, 'samples': 21898560, 'steps': 114054, 'loss/train': 1.3089373111724854} 11/07/2021 13:16:10 - INFO - __main__ - Step 114056: {'lr': 6.930914271466776e-05, 'samples': 21898752, 'steps': 114055, 'loss/train': 0.5778146982192993} 11/07/2021 13:16:10 - INFO - __main__ - Step 114057: {'lr': 6.930547528810247e-05, 'samples': 21898944, 'steps': 114056, 'loss/train': 1.0432484149932861} 11/07/2021 13:16:11 - INFO - __main__ - Step 114058: {'lr': 6.930180794295529e-05, 'samples': 21899136, 'steps': 114057, 'loss/train': 1.07554030418396} 11/07/2021 13:16:11 - INFO - __main__ - Step 114059: {'lr': 6.929814067922794e-05, 'samples': 21899328, 'steps': 114058, 'loss/train': 0.87868332862854} 11/07/2021 13:16:11 - INFO - __main__ - Step 114060: {'lr': 6.929447349692203e-05, 'samples': 21899520, 'steps': 114059, 'loss/train': 1.2259763479232788} 11/07/2021 13:16:12 - INFO - __main__ - Step 114061: {'lr': 6.929080639603924e-05, 'samples': 21899712, 'steps': 114060, 'loss/train': 1.6063600778579712} 11/07/2021 13:16:13 - INFO - __main__ - Step 114062: {'lr': 6.928713937658124e-05, 'samples': 21899904, 'steps': 114061, 'loss/train': 1.2978150844573975} 11/07/2021 13:16:13 - INFO - __main__ - Step 114063: {'lr': 6.928347243854966e-05, 'samples': 21900096, 'steps': 114062, 'loss/train': 1.4677410125732422} 11/07/2021 13:16:13 - INFO - __main__ - Step 114064: {'lr': 6.927980558194616e-05, 'samples': 21900288, 'steps': 114063, 'loss/train': 0.14004464447498322} 11/07/2021 13:16:14 - INFO - __main__ - Step 114065: {'lr': 6.927613880677238e-05, 'samples': 21900480, 'steps': 114064, 'loss/train': 1.7615039348602295} 11/07/2021 13:16:15 - INFO - __main__ - Step 114066: {'lr': 6.927247211303001e-05, 'samples': 21900672, 'steps': 114065, 'loss/train': 1.1601307392120361} 11/07/2021 13:16:15 - INFO - __main__ - Step 114067: {'lr': 6.926880550072065e-05, 'samples': 21900864, 'steps': 114066, 'loss/train': 1.2063465118408203} 11/07/2021 13:16:15 - INFO - __main__ - Step 114068: {'lr': 6.926513896984602e-05, 'samples': 21901056, 'steps': 114067, 'loss/train': 1.38154935836792} 11/07/2021 13:16:16 - INFO - __main__ - Step 114069: {'lr': 6.926147252040768e-05, 'samples': 21901248, 'steps': 114068, 'loss/train': 1.3503249883651733} 11/07/2021 13:16:16 - INFO - __main__ - Step 114070: {'lr': 6.925780615240742e-05, 'samples': 21901440, 'steps': 114069, 'loss/train': 1.0787084102630615} 11/07/2021 13:16:17 - INFO - __main__ - Step 114071: {'lr': 6.925413986584675e-05, 'samples': 21901632, 'steps': 114070, 'loss/train': 1.082156777381897} 11/07/2021 13:16:17 - INFO - __main__ - Step 114072: {'lr': 6.925047366072734e-05, 'samples': 21901824, 'steps': 114071, 'loss/train': 0.5948973894119263} 11/07/2021 13:16:18 - INFO - __main__ - Step 114073: {'lr': 6.92468075370509e-05, 'samples': 21902016, 'steps': 114072, 'loss/train': 0.3876270055770874} 11/07/2021 13:16:18 - INFO - __main__ - Step 114074: {'lr': 6.924314149481905e-05, 'samples': 21902208, 'steps': 114073, 'loss/train': 1.2417924404144287} 11/07/2021 13:16:19 - INFO - __main__ - Step 114075: {'lr': 6.923947553403345e-05, 'samples': 21902400, 'steps': 114074, 'loss/train': 1.6055206060409546} 11/07/2021 13:16:20 - INFO - __main__ - Step 114076: {'lr': 6.923580965469578e-05, 'samples': 21902592, 'steps': 114075, 'loss/train': 1.4018094539642334} 11/07/2021 13:16:20 - INFO - __main__ - Step 114077: {'lr': 6.923214385680765e-05, 'samples': 21902784, 'steps': 114076, 'loss/train': 1.2456036806106567} 11/07/2021 13:16:20 - INFO - __main__ - Step 114078: {'lr': 6.92284781403707e-05, 'samples': 21902976, 'steps': 114077, 'loss/train': 1.1787583827972412} 11/07/2021 13:16:21 - INFO - __main__ - Step 114079: {'lr': 6.922481250538665e-05, 'samples': 21903168, 'steps': 114078, 'loss/train': 1.1905434131622314} 11/07/2021 13:16:21 - INFO - __main__ - Step 114080: {'lr': 6.922114695185708e-05, 'samples': 21903360, 'steps': 114079, 'loss/train': 1.0991235971450806} 11/07/2021 13:16:22 - INFO - __main__ - Step 114081: {'lr': 6.921748147978368e-05, 'samples': 21903552, 'steps': 114080, 'loss/train': 1.9049072265625} 11/07/2021 13:16:22 - INFO - __main__ - Step 114082: {'lr': 6.92138160891681e-05, 'samples': 21903744, 'steps': 114081, 'loss/train': 0.25391680002212524} 11/07/2021 13:16:23 - INFO - __main__ - Step 114083: {'lr': 6.921015078001197e-05, 'samples': 21903936, 'steps': 114082, 'loss/train': 0.9210821986198425} 11/07/2021 13:16:23 - INFO - __main__ - Step 114084: {'lr': 6.920648555231704e-05, 'samples': 21904128, 'steps': 114083, 'loss/train': 1.3422420024871826} 11/07/2021 13:16:23 - INFO - __main__ - Step 114085: {'lr': 6.92028204060848e-05, 'samples': 21904320, 'steps': 114084, 'loss/train': 1.326611042022705} 11/07/2021 13:16:24 - INFO - __main__ - Step 114086: {'lr': 6.919915534131698e-05, 'samples': 21904512, 'steps': 114085, 'loss/train': 1.302201747894287} 11/07/2021 13:16:25 - INFO - __main__ - Step 114087: {'lr': 6.919549035801522e-05, 'samples': 21904704, 'steps': 114086, 'loss/train': 1.4721338748931885} 11/07/2021 13:16:25 - INFO - __main__ - Step 114088: {'lr': 6.919182545618121e-05, 'samples': 21904896, 'steps': 114087, 'loss/train': 1.2280991077423096} 11/07/2021 13:16:26 - INFO - __main__ - Step 114089: {'lr': 6.918816063581657e-05, 'samples': 21905088, 'steps': 114088, 'loss/train': 1.270735263824463} 11/07/2021 13:16:26 - INFO - __main__ - Step 114090: {'lr': 6.918449589692294e-05, 'samples': 21905280, 'steps': 114089, 'loss/train': 1.354485034942627} 11/07/2021 13:16:26 - INFO - __main__ - Step 114091: {'lr': 6.918083123950198e-05, 'samples': 21905472, 'steps': 114090, 'loss/train': 0.9975744485855103} 11/07/2021 13:16:27 - INFO - __main__ - Step 114092: {'lr': 6.917716666355536e-05, 'samples': 21905664, 'steps': 114091, 'loss/train': 0.723375678062439} 11/07/2021 13:16:28 - INFO - __main__ - Step 114093: {'lr': 6.917350216908471e-05, 'samples': 21905856, 'steps': 114092, 'loss/train': 1.7579412460327148} 11/07/2021 13:16:28 - INFO - __main__ - Step 114094: {'lr': 6.91698377560917e-05, 'samples': 21906048, 'steps': 114093, 'loss/train': 0.9658165574073792} 11/07/2021 13:16:28 - INFO - __main__ - Step 114095: {'lr': 6.916617342457796e-05, 'samples': 21906240, 'steps': 114094, 'loss/train': 1.1166120767593384} 11/07/2021 13:16:29 - INFO - __main__ - Step 114096: {'lr': 6.916250917454516e-05, 'samples': 21906432, 'steps': 114095, 'loss/train': 1.145221471786499} 11/07/2021 13:16:30 - INFO - __main__ - Step 114097: {'lr': 6.9158845005995e-05, 'samples': 21906624, 'steps': 114096, 'loss/train': 1.2187892198562622} 11/07/2021 13:16:30 - INFO - __main__ - Step 114098: {'lr': 6.915518091892903e-05, 'samples': 21906816, 'steps': 114097, 'loss/train': 1.093135952949524} 11/07/2021 13:16:30 - INFO - __main__ - Step 114099: {'lr': 6.915151691334892e-05, 'samples': 21907008, 'steps': 114098, 'loss/train': 1.7930405139923096} 11/07/2021 13:16:31 - INFO - __main__ - Step 114100: {'lr': 6.914785298925636e-05, 'samples': 21907200, 'steps': 114099, 'loss/train': 1.5388518571853638} 11/07/2021 13:16:31 - INFO - __main__ - Step 114101: {'lr': 6.914418914665299e-05, 'samples': 21907392, 'steps': 114100, 'loss/train': 1.2197548151016235} 11/07/2021 13:16:32 - INFO - __main__ - Step 114102: {'lr': 6.914052538554044e-05, 'samples': 21907584, 'steps': 114101, 'loss/train': 1.5836763381958008} 11/07/2021 13:16:33 - INFO - __main__ - Step 114103: {'lr': 6.913686170592037e-05, 'samples': 21907776, 'steps': 114102, 'loss/train': 1.172329306602478} 11/07/2021 13:16:33 - INFO - __main__ - Step 114104: {'lr': 6.913319810779448e-05, 'samples': 21907968, 'steps': 114103, 'loss/train': 1.3993885517120361} 11/07/2021 13:16:33 - INFO - __main__ - Step 114105: {'lr': 6.912953459116433e-05, 'samples': 21908160, 'steps': 114104, 'loss/train': 0.9833842515945435} 11/07/2021 13:16:34 - INFO - __main__ - Step 114106: {'lr': 6.912587115603167e-05, 'samples': 21908352, 'steps': 114105, 'loss/train': 1.3030822277069092} 11/07/2021 13:16:35 - INFO - __main__ - Step 114107: {'lr': 6.912220780239806e-05, 'samples': 21908544, 'steps': 114106, 'loss/train': 0.7581377625465393} 11/07/2021 13:16:35 - INFO - __main__ - Step 114108: {'lr': 6.911854453026522e-05, 'samples': 21908736, 'steps': 114107, 'loss/train': 1.2935971021652222} 11/07/2021 13:16:36 - INFO - __main__ - Step 114109: {'lr': 6.911488133963475e-05, 'samples': 21908928, 'steps': 114108, 'loss/train': 0.8796278238296509} 11/07/2021 13:16:36 - INFO - __main__ - Step 114110: {'lr': 6.911121823050834e-05, 'samples': 21909120, 'steps': 114109, 'loss/train': 1.0611026287078857} 11/07/2021 13:16:36 - INFO - __main__ - Step 114111: {'lr': 6.91075552028877e-05, 'samples': 21909312, 'steps': 114110, 'loss/train': 1.192522406578064} 11/07/2021 13:16:37 - INFO - __main__ - Step 114112: {'lr': 6.910389225677433e-05, 'samples': 21909504, 'steps': 114111, 'loss/train': 1.3271160125732422} 11/07/2021 13:16:38 - INFO - __main__ - Step 114113: {'lr': 6.910022939216994e-05, 'samples': 21909696, 'steps': 114112, 'loss/train': 1.4163955450057983} 11/07/2021 13:16:38 - INFO - __main__ - Step 114114: {'lr': 6.90965666090762e-05, 'samples': 21909888, 'steps': 114113, 'loss/train': 1.8373994827270508} 11/07/2021 13:16:39 - INFO - __main__ - Step 114115: {'lr': 6.909290390749479e-05, 'samples': 21910080, 'steps': 114114, 'loss/train': 0.9219862222671509} 11/07/2021 13:16:39 - INFO - __main__ - Step 114116: {'lr': 6.908924128742727e-05, 'samples': 21910272, 'steps': 114115, 'loss/train': 1.5515995025634766} 11/07/2021 13:16:40 - INFO - __main__ - Step 114117: {'lr': 6.908557874887538e-05, 'samples': 21910464, 'steps': 114116, 'loss/train': 1.3725230693817139} 11/07/2021 13:16:40 - INFO - __main__ - Step 114118: {'lr': 6.908191629184074e-05, 'samples': 21910656, 'steps': 114117, 'loss/train': 1.413751482963562} 11/07/2021 13:16:41 - INFO - __main__ - Step 114119: {'lr': 6.907825391632497e-05, 'samples': 21910848, 'steps': 114118, 'loss/train': 1.3290903568267822} 11/07/2021 13:16:41 - INFO - __main__ - Step 114120: {'lr': 6.907459162232976e-05, 'samples': 21911040, 'steps': 114119, 'loss/train': 1.7717336416244507} 11/07/2021 13:16:41 - INFO - __main__ - Step 114121: {'lr': 6.907092940985676e-05, 'samples': 21911232, 'steps': 114120, 'loss/train': 1.4826996326446533} 11/07/2021 13:16:42 - INFO - __main__ - Step 114122: {'lr': 6.906726727890758e-05, 'samples': 21911424, 'steps': 114121, 'loss/train': 1.3241138458251953} 11/07/2021 13:16:43 - INFO - __main__ - Step 114123: {'lr': 6.906360522948393e-05, 'samples': 21911616, 'steps': 114122, 'loss/train': 1.5880552530288696} 11/07/2021 13:16:44 - INFO - __main__ - Step 114124: {'lr': 6.905994326158748e-05, 'samples': 21911808, 'steps': 114123, 'loss/train': 1.104339838027954} 11/07/2021 13:16:44 - INFO - __main__ - Step 114125: {'lr': 6.905628137521977e-05, 'samples': 21912000, 'steps': 114124, 'loss/train': 0.8546680808067322} 11/07/2021 13:16:44 - INFO - __main__ - Step 114126: {'lr': 6.905261957038251e-05, 'samples': 21912192, 'steps': 114125, 'loss/train': 2.022071599960327} 11/07/2021 13:16:45 - INFO - __main__ - Step 114127: {'lr': 6.904895784707732e-05, 'samples': 21912384, 'steps': 114126, 'loss/train': 2.0247414112091064} 11/07/2021 13:16:45 - INFO - __main__ - Step 114128: {'lr': 6.904529620530589e-05, 'samples': 21912576, 'steps': 114127, 'loss/train': 1.6101555824279785} 11/07/2021 13:16:46 - INFO - __main__ - Step 114129: {'lr': 6.904163464506985e-05, 'samples': 21912768, 'steps': 114128, 'loss/train': 1.552188515663147} 11/07/2021 13:16:46 - INFO - __main__ - Step 114130: {'lr': 6.903797316637086e-05, 'samples': 21912960, 'steps': 114129, 'loss/train': 1.4589005708694458} 11/07/2021 13:16:47 - INFO - __main__ - Step 114131: {'lr': 6.903431176921058e-05, 'samples': 21913152, 'steps': 114130, 'loss/train': 1.5300484895706177} 11/07/2021 13:16:47 - INFO - __main__ - Step 114132: {'lr': 6.903065045359064e-05, 'samples': 21913344, 'steps': 114131, 'loss/train': 1.4074177742004395} 11/07/2021 13:16:47 - INFO - __main__ - Step 114133: {'lr': 6.90269892195127e-05, 'samples': 21913536, 'steps': 114132, 'loss/train': 0.8641519546508789} 11/07/2021 13:16:49 - INFO - __main__ - Step 114134: {'lr': 6.902332806697839e-05, 'samples': 21913728, 'steps': 114133, 'loss/train': 1.4533878564834595} 11/07/2021 13:16:49 - INFO - __main__ - Step 114135: {'lr': 6.901966699598939e-05, 'samples': 21913920, 'steps': 114134, 'loss/train': 1.7288694381713867} 11/07/2021 13:16:49 - INFO - __main__ - Step 114136: {'lr': 6.901600600654734e-05, 'samples': 21914112, 'steps': 114135, 'loss/train': 0.9085667729377747} 11/07/2021 13:16:50 - INFO - __main__ - Step 114137: {'lr': 6.901234509865387e-05, 'samples': 21914304, 'steps': 114136, 'loss/train': 1.2443528175354004} 11/07/2021 13:16:50 - INFO - __main__ - Step 114138: {'lr': 6.900868427231074e-05, 'samples': 21914496, 'steps': 114137, 'loss/train': 1.385299563407898} 11/07/2021 13:16:51 - INFO - __main__ - Step 114139: {'lr': 6.900502352751942e-05, 'samples': 21914688, 'steps': 114138, 'loss/train': 0.6465228199958801} 11/07/2021 13:16:51 - INFO - __main__ - Step 114140: {'lr': 6.900136286428163e-05, 'samples': 21914880, 'steps': 114139, 'loss/train': 1.7785946130752563} 11/07/2021 13:16:52 - INFO - __main__ - Step 114141: {'lr': 6.899770228259905e-05, 'samples': 21915072, 'steps': 114140, 'loss/train': 1.3683005571365356} 11/07/2021 13:16:52 - INFO - __main__ - Step 114142: {'lr': 6.899404178247328e-05, 'samples': 21915264, 'steps': 114141, 'loss/train': 1.5432379245758057} 11/07/2021 13:16:52 - INFO - __main__ - Step 114143: {'lr': 6.899038136390603e-05, 'samples': 21915456, 'steps': 114142, 'loss/train': 1.3073134422302246} 11/07/2021 13:16:54 - INFO - __main__ - Step 114144: {'lr': 6.898672102689893e-05, 'samples': 21915648, 'steps': 114143, 'loss/train': 1.1691914796829224} 11/07/2021 13:16:54 - INFO - __main__ - Step 114145: {'lr': 6.898306077145361e-05, 'samples': 21915840, 'steps': 114144, 'loss/train': 1.181565284729004} 11/07/2021 13:16:54 - INFO - __main__ - Step 114146: {'lr': 6.897940059757171e-05, 'samples': 21916032, 'steps': 114145, 'loss/train': 1.1470112800598145} 11/07/2021 13:16:55 - INFO - __main__ - Step 114147: {'lr': 6.897574050525493e-05, 'samples': 21916224, 'steps': 114146, 'loss/train': 1.5303783416748047} 11/07/2021 13:16:55 - INFO - __main__ - Step 114148: {'lr': 6.897208049450488e-05, 'samples': 21916416, 'steps': 114147, 'loss/train': 1.2907873392105103} 11/07/2021 13:16:55 - INFO - __main__ - Step 114149: {'lr': 6.89684205653232e-05, 'samples': 21916608, 'steps': 114148, 'loss/train': 0.6985037326812744} 11/07/2021 13:16:56 - INFO - __main__ - Step 114150: {'lr': 6.896476071771157e-05, 'samples': 21916800, 'steps': 114149, 'loss/train': 1.5254114866256714} 11/07/2021 13:16:57 - INFO - __main__ - Step 114151: {'lr': 6.896110095167171e-05, 'samples': 21916992, 'steps': 114150, 'loss/train': 1.0660682916641235} 11/07/2021 13:16:57 - INFO - __main__ - Step 114152: {'lr': 6.89574412672051e-05, 'samples': 21917184, 'steps': 114151, 'loss/train': 1.4128557443618774} 11/07/2021 13:16:58 - INFO - __main__ - Step 114153: {'lr': 6.895378166431346e-05, 'samples': 21917376, 'steps': 114152, 'loss/train': 1.0734738111495972} 11/07/2021 13:16:58 - INFO - __main__ - Step 114154: {'lr': 6.895012214299846e-05, 'samples': 21917568, 'steps': 114153, 'loss/train': 0.7431625723838806} 11/07/2021 13:16:59 - INFO - __main__ - Step 114155: {'lr': 6.894646270326175e-05, 'samples': 21917760, 'steps': 114154, 'loss/train': 1.5998313426971436} 11/07/2021 13:16:59 - INFO - __main__ - Step 114156: {'lr': 6.894280334510498e-05, 'samples': 21917952, 'steps': 114155, 'loss/train': 1.5445375442504883} 11/07/2021 13:17:00 - INFO - __main__ - Step 114157: {'lr': 6.893914406852974e-05, 'samples': 21918144, 'steps': 114156, 'loss/train': 0.432672381401062} 11/07/2021 13:17:00 - INFO - __main__ - Step 114158: {'lr': 6.893548487353777e-05, 'samples': 21918336, 'steps': 114157, 'loss/train': 0.39563632011413574} 11/07/2021 13:17:00 - INFO - __main__ - Step 114159: {'lr': 6.893182576013065e-05, 'samples': 21918528, 'steps': 114158, 'loss/train': 1.741895079612732} 11/07/2021 13:17:01 - INFO - __main__ - Step 114160: {'lr': 6.892816672831009e-05, 'samples': 21918720, 'steps': 114159, 'loss/train': 0.9785300493240356} 11/07/2021 13:17:02 - INFO - __main__ - Step 114161: {'lr': 6.892450777807769e-05, 'samples': 21918912, 'steps': 114160, 'loss/train': 1.7077012062072754} 11/07/2021 13:17:02 - INFO - __main__ - Step 114162: {'lr': 6.89208489094351e-05, 'samples': 21919104, 'steps': 114161, 'loss/train': 1.4768040180206299} 11/07/2021 13:17:02 - INFO - __main__ - Step 114163: {'lr': 6.891719012238399e-05, 'samples': 21919296, 'steps': 114162, 'loss/train': 1.5504934787750244} 11/07/2021 13:17:03 - INFO - __main__ - Step 114164: {'lr': 6.891353141692608e-05, 'samples': 21919488, 'steps': 114163, 'loss/train': 1.2469731569290161} 11/07/2021 13:17:04 - INFO - __main__ - Step 114165: {'lr': 6.890987279306285e-05, 'samples': 21919680, 'steps': 114164, 'loss/train': 1.3511844873428345} 11/07/2021 13:17:04 - INFO - __main__ - Step 114166: {'lr': 6.890621425079604e-05, 'samples': 21919872, 'steps': 114165, 'loss/train': 1.5107629299163818} 11/07/2021 13:17:04 - INFO - __main__ - Step 114167: {'lr': 6.89025557901273e-05, 'samples': 21920064, 'steps': 114166, 'loss/train': 1.0422207117080688} 11/07/2021 13:17:05 - INFO - __main__ - Step 114168: {'lr': 6.889889741105828e-05, 'samples': 21920256, 'steps': 114167, 'loss/train': 1.4646316766738892} 11/07/2021 13:17:05 - INFO - __main__ - Step 114169: {'lr': 6.889523911359063e-05, 'samples': 21920448, 'steps': 114168, 'loss/train': 1.211122989654541} 11/07/2021 13:17:06 - INFO - __main__ - Step 114170: {'lr': 6.889158089772599e-05, 'samples': 21920640, 'steps': 114169, 'loss/train': 1.3616633415222168} 11/07/2021 13:17:07 - INFO - __main__ - Step 114171: {'lr': 6.888792276346597e-05, 'samples': 21920832, 'steps': 114170, 'loss/train': 0.8764964938163757} 11/07/2021 13:17:07 - INFO - __main__ - Step 114172: {'lr': 6.88842647108123e-05, 'samples': 21921024, 'steps': 114171, 'loss/train': 1.3099350929260254} 11/07/2021 13:17:07 - INFO - __main__ - Step 114173: {'lr': 6.888060673976656e-05, 'samples': 21921216, 'steps': 114172, 'loss/train': 1.0244501829147339} 11/07/2021 13:17:08 - INFO - __main__ - Step 114174: {'lr': 6.887694885033044e-05, 'samples': 21921408, 'steps': 114173, 'loss/train': 1.7701644897460938} 11/07/2021 13:17:08 - INFO - __main__ - Step 114175: {'lr': 6.887329104250556e-05, 'samples': 21921600, 'steps': 114174, 'loss/train': 1.3335858583450317} 11/07/2021 13:17:09 - INFO - __main__ - Step 114176: {'lr': 6.88696333162936e-05, 'samples': 21921792, 'steps': 114175, 'loss/train': 1.192387342453003} 11/07/2021 13:17:09 - INFO - __main__ - Step 114177: {'lr': 6.886597567169617e-05, 'samples': 21921984, 'steps': 114176, 'loss/train': 1.0543709993362427} 11/07/2021 13:17:10 - INFO - __main__ - Step 114178: {'lr': 6.886231810871502e-05, 'samples': 21922176, 'steps': 114177, 'loss/train': 1.3932738304138184} 11/07/2021 13:17:10 - INFO - __main__ - Step 114179: {'lr': 6.885866062735163e-05, 'samples': 21922368, 'steps': 114178, 'loss/train': 1.2877964973449707} 11/07/2021 13:17:10 - INFO - __main__ - Step 114180: {'lr': 6.885500322760773e-05, 'samples': 21922560, 'steps': 114179, 'loss/train': 1.3771122694015503} 11/07/2021 13:17:11 - INFO - __main__ - Step 114181: {'lr': 6.885134590948497e-05, 'samples': 21922752, 'steps': 114180, 'loss/train': 1.5172648429870605} 11/07/2021 13:17:12 - INFO - __main__ - Step 114182: {'lr': 6.8847688672985e-05, 'samples': 21922944, 'steps': 114181, 'loss/train': 1.2558916807174683} 11/07/2021 13:17:12 - INFO - __main__ - Step 114183: {'lr': 6.884403151810947e-05, 'samples': 21923136, 'steps': 114182, 'loss/train': 1.3003156185150146} 11/07/2021 13:17:12 - INFO - __main__ - Step 114184: {'lr': 6.884037444486002e-05, 'samples': 21923328, 'steps': 114183, 'loss/train': 0.8521623015403748} 11/07/2021 13:17:13 - INFO - __main__ - Step 114185: {'lr': 6.883671745323833e-05, 'samples': 21923520, 'steps': 114184, 'loss/train': 1.5453758239746094} 11/07/2021 13:17:14 - INFO - __main__ - Step 114186: {'lr': 6.883306054324598e-05, 'samples': 21923712, 'steps': 114185, 'loss/train': 1.5053253173828125} 11/07/2021 13:17:14 - INFO - __main__ - Step 114187: {'lr': 6.88294037148847e-05, 'samples': 21923904, 'steps': 114186, 'loss/train': 1.3597612380981445} 11/07/2021 13:17:15 - INFO - __main__ - Step 114188: {'lr': 6.882574696815605e-05, 'samples': 21924096, 'steps': 114187, 'loss/train': 0.7412821054458618} 11/07/2021 13:17:15 - INFO - __main__ - Step 114189: {'lr': 6.882209030306181e-05, 'samples': 21924288, 'steps': 114188, 'loss/train': 1.2809401750564575} 11/07/2021 13:17:15 - INFO - __main__ - Step 114190: {'lr': 6.881843371960348e-05, 'samples': 21924480, 'steps': 114189, 'loss/train': 1.2393678426742554} 11/07/2021 13:17:16 - INFO - __main__ - Step 114191: {'lr': 6.881477721778276e-05, 'samples': 21924672, 'steps': 114190, 'loss/train': 1.5517127513885498} 11/07/2021 13:17:17 - INFO - __main__ - Step 114192: {'lr': 6.88111207976013e-05, 'samples': 21924864, 'steps': 114191, 'loss/train': 1.369262933731079} 11/07/2021 13:17:17 - INFO - __main__ - Step 114193: {'lr': 6.880746445906075e-05, 'samples': 21925056, 'steps': 114192, 'loss/train': 1.468173623085022} 11/07/2021 13:17:17 - INFO - __main__ - Step 114194: {'lr': 6.880380820216279e-05, 'samples': 21925248, 'steps': 114193, 'loss/train': 0.9554060101509094} 11/07/2021 13:17:18 - INFO - __main__ - Step 114195: {'lr': 6.880015202690901e-05, 'samples': 21925440, 'steps': 114194, 'loss/train': 0.9654168486595154} 11/07/2021 13:17:18 - INFO - __main__ - Step 114196: {'lr': 6.87964959333011e-05, 'samples': 21925632, 'steps': 114195, 'loss/train': 1.2653425931930542} 11/07/2021 13:17:19 - INFO - __main__ - Step 114197: {'lr': 6.879283992134066e-05, 'samples': 21925824, 'steps': 114196, 'loss/train': 1.3910163640975952} 11/07/2021 13:17:20 - INFO - __main__ - Step 114198: {'lr': 6.87891839910294e-05, 'samples': 21926016, 'steps': 114197, 'loss/train': 1.4142537117004395} 11/07/2021 13:17:20 - INFO - __main__ - Step 114199: {'lr': 6.878552814236894e-05, 'samples': 21926208, 'steps': 114198, 'loss/train': 1.131905436515808} 11/07/2021 13:17:20 - INFO - __main__ - Step 114200: {'lr': 6.878187237536099e-05, 'samples': 21926400, 'steps': 114199, 'loss/train': 1.8295997381210327} 11/07/2021 13:17:21 - INFO - __main__ - Step 114201: {'lr': 6.877821669000705e-05, 'samples': 21926592, 'steps': 114200, 'loss/train': 1.2714042663574219} 11/07/2021 13:17:22 - INFO - __main__ - Step 114202: {'lr': 6.877456108630886e-05, 'samples': 21926784, 'steps': 114201, 'loss/train': 1.778363585472107} 11/07/2021 13:17:22 - INFO - __main__ - Step 114203: {'lr': 6.877090556426807e-05, 'samples': 21926976, 'steps': 114202, 'loss/train': 1.368945598602295} 11/07/2021 13:17:22 - INFO - __main__ - Step 114204: {'lr': 6.87672501238863e-05, 'samples': 21927168, 'steps': 114203, 'loss/train': 1.504682183265686} 11/07/2021 13:17:23 - INFO - __main__ - Step 114205: {'lr': 6.87635947651652e-05, 'samples': 21927360, 'steps': 114204, 'loss/train': 1.5115586519241333} 11/07/2021 13:17:23 - INFO - __main__ - Step 114206: {'lr': 6.875993948810643e-05, 'samples': 21927552, 'steps': 114205, 'loss/train': 0.7577998638153076} 11/07/2021 13:17:24 - INFO - __main__ - Step 114207: {'lr': 6.875628429271166e-05, 'samples': 21927744, 'steps': 114206, 'loss/train': 1.2196718454360962} 11/07/2021 13:17:25 - INFO - __main__ - Step 114208: {'lr': 6.875262917898248e-05, 'samples': 21927936, 'steps': 114207, 'loss/train': 1.0236351490020752} 11/07/2021 13:17:25 - INFO - __main__ - Step 114209: {'lr': 6.874897414692057e-05, 'samples': 21928128, 'steps': 114208, 'loss/train': 5.678946018218994} 11/07/2021 13:17:25 - INFO - __main__ - Step 114210: {'lr': 6.87453191965276e-05, 'samples': 21928320, 'steps': 114209, 'loss/train': 1.3900995254516602} 11/07/2021 13:17:26 - INFO - __main__ - Step 114211: {'lr': 6.874166432780526e-05, 'samples': 21928512, 'steps': 114210, 'loss/train': 1.6983553171157837} 11/07/2021 13:17:26 - INFO - __main__ - Step 114212: {'lr': 6.873800954075505e-05, 'samples': 21928704, 'steps': 114211, 'loss/train': 1.3907999992370605} 11/07/2021 13:17:27 - INFO - __main__ - Step 114213: {'lr': 6.873435483537869e-05, 'samples': 21928896, 'steps': 114212, 'loss/train': 5.683573246002197} 11/07/2021 13:17:27 - INFO - __main__ - Step 114214: {'lr': 6.873070021167783e-05, 'samples': 21929088, 'steps': 114213, 'loss/train': 1.4572783708572388} 11/07/2021 13:17:28 - INFO - __main__ - Step 114215: {'lr': 6.872704566965413e-05, 'samples': 21929280, 'steps': 114214, 'loss/train': 1.4234461784362793} 11/07/2021 13:17:28 - INFO - __main__ - Step 114216: {'lr': 6.872339120930921e-05, 'samples': 21929472, 'steps': 114215, 'loss/train': 0.8245030045509338} 11/07/2021 13:17:28 - INFO - __main__ - Step 114217: {'lr': 6.871973683064475e-05, 'samples': 21929664, 'steps': 114216, 'loss/train': 0.9524115920066833} 11/07/2021 13:17:30 - INFO - __main__ - Step 114218: {'lr': 6.871608253366238e-05, 'samples': 21929856, 'steps': 114217, 'loss/train': 1.3630073070526123} 11/07/2021 13:17:30 - INFO - __main__ - Step 114219: {'lr': 6.871242831836374e-05, 'samples': 21930048, 'steps': 114218, 'loss/train': 0.9898295998573303} 11/07/2021 13:17:30 - INFO - __main__ - Step 114220: {'lr': 6.870877418475047e-05, 'samples': 21930240, 'steps': 114219, 'loss/train': 0.9422495365142822} 11/07/2021 13:17:31 - INFO - __main__ - Step 114221: {'lr': 6.870512013282423e-05, 'samples': 21930432, 'steps': 114220, 'loss/train': 1.361147403717041} 11/07/2021 13:17:31 - INFO - __main__ - Step 114222: {'lr': 6.870146616258677e-05, 'samples': 21930624, 'steps': 114221, 'loss/train': 1.826316237449646} 11/07/2021 13:17:32 - INFO - __main__ - Step 114223: {'lr': 6.869781227403954e-05, 'samples': 21930816, 'steps': 114222, 'loss/train': 0.8649567365646362} 11/07/2021 13:17:32 - INFO - __main__ - Step 114224: {'lr': 6.869415846718427e-05, 'samples': 21931008, 'steps': 114223, 'loss/train': 1.3034740686416626} 11/07/2021 13:17:33 - INFO - __main__ - Step 114225: {'lr': 6.869050474202263e-05, 'samples': 21931200, 'steps': 114224, 'loss/train': 1.580039620399475} 11/07/2021 13:17:33 - INFO - __main__ - Step 114226: {'lr': 6.868685109855624e-05, 'samples': 21931392, 'steps': 114225, 'loss/train': 1.4148088693618774} 11/07/2021 13:17:33 - INFO - __main__ - Step 114227: {'lr': 6.868319753678675e-05, 'samples': 21931584, 'steps': 114226, 'loss/train': 1.0844249725341797} 11/07/2021 13:17:35 - INFO - __main__ - Step 114228: {'lr': 6.867954405671581e-05, 'samples': 21931776, 'steps': 114227, 'loss/train': 1.2943682670593262} 11/07/2021 13:17:35 - INFO - __main__ - Step 114229: {'lr': 6.867589065834509e-05, 'samples': 21931968, 'steps': 114228, 'loss/train': 1.3226358890533447} 11/07/2021 13:17:35 - INFO - __main__ - Step 114230: {'lr': 6.867223734167622e-05, 'samples': 21932160, 'steps': 114229, 'loss/train': 1.3310171365737915} 11/07/2021 13:17:36 - INFO - __main__ - Step 114231: {'lr': 6.866858410671081e-05, 'samples': 21932352, 'steps': 114230, 'loss/train': 1.2432538270950317} 11/07/2021 13:17:36 - INFO - __main__ - Step 114232: {'lr': 6.866493095345055e-05, 'samples': 21932544, 'steps': 114231, 'loss/train': 1.5134090185165405} 11/07/2021 13:17:37 - INFO - __main__ - Step 114233: {'lr': 6.866127788189717e-05, 'samples': 21932736, 'steps': 114232, 'loss/train': 0.7592271566390991} 11/07/2021 13:17:37 - INFO - __main__ - Step 114234: {'lr': 6.865762489205213e-05, 'samples': 21932928, 'steps': 114233, 'loss/train': 1.1843218803405762} 11/07/2021 13:17:38 - INFO - __main__ - Step 114235: {'lr': 6.865397198391715e-05, 'samples': 21933120, 'steps': 114234, 'loss/train': 1.1117373704910278} 11/07/2021 13:17:38 - INFO - __main__ - Step 114236: {'lr': 6.865031915749392e-05, 'samples': 21933312, 'steps': 114235, 'loss/train': 1.1852295398712158} 11/07/2021 13:17:38 - INFO - __main__ - Step 114237: {'lr': 6.864666641278405e-05, 'samples': 21933504, 'steps': 114236, 'loss/train': 1.9001727104187012} 11/07/2021 13:17:39 - INFO - __main__ - Step 114238: {'lr': 6.864301374978918e-05, 'samples': 21933696, 'steps': 114237, 'loss/train': 1.3893001079559326} 11/07/2021 13:17:40 - INFO - __main__ - Step 114239: {'lr': 6.8639361168511e-05, 'samples': 21933888, 'steps': 114238, 'loss/train': 1.2468714714050293} 11/07/2021 13:17:40 - INFO - __main__ - Step 114240: {'lr': 6.863570866895109e-05, 'samples': 21934080, 'steps': 114239, 'loss/train': 1.0603777170181274} 11/07/2021 13:17:40 - INFO - __main__ - Step 114241: {'lr': 6.863205625111113e-05, 'samples': 21934272, 'steps': 114240, 'loss/train': 0.5469822883605957} 11/07/2021 13:17:41 - INFO - __main__ - Step 114242: {'lr': 6.862840391499278e-05, 'samples': 21934464, 'steps': 114241, 'loss/train': 0.6212637424468994} 11/07/2021 13:17:42 - INFO - __main__ - Step 114243: {'lr': 6.862475166059767e-05, 'samples': 21934656, 'steps': 114242, 'loss/train': 1.2338271141052246} 11/07/2021 13:17:42 - INFO - __main__ - Step 114244: {'lr': 6.862109948792746e-05, 'samples': 21934848, 'steps': 114243, 'loss/train': 1.4066907167434692} 11/07/2021 13:17:43 - INFO - __main__ - Step 114245: {'lr': 6.861744739698386e-05, 'samples': 21935040, 'steps': 114244, 'loss/train': 1.3538289070129395} 11/07/2021 13:17:43 - INFO - __main__ - Step 114246: {'lr': 6.861379538776835e-05, 'samples': 21935232, 'steps': 114245, 'loss/train': 1.0628702640533447} 11/07/2021 13:17:43 - INFO - __main__ - Step 114247: {'lr': 6.861014346028268e-05, 'samples': 21935424, 'steps': 114246, 'loss/train': 1.5921316146850586} 11/07/2021 13:17:44 - INFO - __main__ - Step 114248: {'lr': 6.860649161452848e-05, 'samples': 21935616, 'steps': 114247, 'loss/train': 1.7214696407318115} 11/07/2021 13:17:45 - INFO - __main__ - Step 114249: {'lr': 6.860283985050738e-05, 'samples': 21935808, 'steps': 114248, 'loss/train': 1.263340711593628} 11/07/2021 13:17:45 - INFO - __main__ - Step 114250: {'lr': 6.859918816822103e-05, 'samples': 21936000, 'steps': 114249, 'loss/train': 0.35923486948013306} 11/07/2021 13:17:45 - INFO - __main__ - Step 114251: {'lr': 6.859553656767112e-05, 'samples': 21936192, 'steps': 114250, 'loss/train': 1.4908674955368042} 11/07/2021 13:17:46 - INFO - __main__ - Step 114252: {'lr': 6.859188504885924e-05, 'samples': 21936384, 'steps': 114251, 'loss/train': 1.1648850440979004} 11/07/2021 13:17:46 - INFO - __main__ - Step 114253: {'lr': 6.858823361178706e-05, 'samples': 21936576, 'steps': 114252, 'loss/train': 1.200985312461853} 11/07/2021 13:17:47 - INFO - __main__ - Step 114254: {'lr': 6.858458225645622e-05, 'samples': 21936768, 'steps': 114253, 'loss/train': 1.2374762296676636} 11/07/2021 13:17:48 - INFO - __main__ - Step 114255: {'lr': 6.858093098286839e-05, 'samples': 21936960, 'steps': 114254, 'loss/train': 0.0552549734711647} 11/07/2021 13:17:48 - INFO - __main__ - Step 114256: {'lr': 6.857727979102518e-05, 'samples': 21937152, 'steps': 114255, 'loss/train': 1.101680040359497} 11/07/2021 13:17:48 - INFO - __main__ - Step 114257: {'lr': 6.857362868092823e-05, 'samples': 21937344, 'steps': 114256, 'loss/train': 1.6141445636749268} 11/07/2021 13:17:49 - INFO - __main__ - Step 114258: {'lr': 6.856997765257921e-05, 'samples': 21937536, 'steps': 114257, 'loss/train': 1.622375249862671} 11/07/2021 13:17:50 - INFO - __main__ - Step 114259: {'lr': 6.856632670597988e-05, 'samples': 21937728, 'steps': 114258, 'loss/train': 1.4436312913894653} 11/07/2021 13:17:50 - INFO - __main__ - Step 114260: {'lr': 6.856267584113163e-05, 'samples': 21937920, 'steps': 114259, 'loss/train': 1.4199132919311523} 11/07/2021 13:17:50 - INFO - __main__ - Step 114261: {'lr': 6.855902505803627e-05, 'samples': 21938112, 'steps': 114260, 'loss/train': 1.4152381420135498} 11/07/2021 13:17:51 - INFO - __main__ - Step 114262: {'lr': 6.855537435669539e-05, 'samples': 21938304, 'steps': 114261, 'loss/train': 1.5222817659378052} 11/07/2021 13:17:51 - INFO - __main__ - Step 114263: {'lr': 6.855172373711066e-05, 'samples': 21938496, 'steps': 114262, 'loss/train': 1.1811712980270386} 11/07/2021 13:17:52 - INFO - __main__ - Step 114264: {'lr': 6.854807319928375e-05, 'samples': 21938688, 'steps': 114263, 'loss/train': 1.4880093336105347} 11/07/2021 13:17:52 - INFO - __main__ - Step 114265: {'lr': 6.854442274321626e-05, 'samples': 21938880, 'steps': 114264, 'loss/train': 0.5060787200927734} 11/07/2021 13:17:53 - INFO - __main__ - Step 114266: {'lr': 6.854077236890985e-05, 'samples': 21939072, 'steps': 114265, 'loss/train': 1.2582902908325195} 11/07/2021 13:17:53 - INFO - __main__ - Step 114267: {'lr': 6.853712207636617e-05, 'samples': 21939264, 'steps': 114266, 'loss/train': 1.4131591320037842} 11/07/2021 13:17:54 - INFO - __main__ - Step 114268: {'lr': 6.853347186558686e-05, 'samples': 21939456, 'steps': 114267, 'loss/train': 1.3412176370620728} 11/07/2021 13:17:55 - INFO - __main__ - Step 114269: {'lr': 6.852982173657357e-05, 'samples': 21939648, 'steps': 114268, 'loss/train': 1.6670193672180176} 11/07/2021 13:17:55 - INFO - __main__ - Step 114270: {'lr': 6.852617168932796e-05, 'samples': 21939840, 'steps': 114269, 'loss/train': 1.147147297859192} 11/07/2021 13:17:55 - INFO - __main__ - Step 114271: {'lr': 6.852252172385165e-05, 'samples': 21940032, 'steps': 114270, 'loss/train': 1.4661883115768433} 11/07/2021 13:17:56 - INFO - __main__ - Step 114272: {'lr': 6.851887184014635e-05, 'samples': 21940224, 'steps': 114271, 'loss/train': 1.658922791481018} 11/07/2021 13:17:56 - INFO - __main__ - Step 114273: {'lr': 6.851522203821358e-05, 'samples': 21940416, 'steps': 114272, 'loss/train': 1.4802988767623901} 11/07/2021 13:17:57 - INFO - __main__ - Step 114274: {'lr': 6.851157231805504e-05, 'samples': 21940608, 'steps': 114273, 'loss/train': 1.4029467105865479} 11/07/2021 13:17:57 - INFO - __main__ - Step 114275: {'lr': 6.85079226796724e-05, 'samples': 21940800, 'steps': 114274, 'loss/train': 0.752137303352356} 11/07/2021 13:17:58 - INFO - __main__ - Step 114276: {'lr': 6.850427312306729e-05, 'samples': 21940992, 'steps': 114275, 'loss/train': 1.5266746282577515} 11/07/2021 13:17:58 - INFO - __main__ - Step 114277: {'lr': 6.850062364824136e-05, 'samples': 21941184, 'steps': 114276, 'loss/train': 1.4531797170639038} 11/07/2021 13:17:58 - INFO - __main__ - Step 114278: {'lr': 6.849697425519621e-05, 'samples': 21941376, 'steps': 114277, 'loss/train': 1.1556391716003418} 11/07/2021 13:18:00 - INFO - __main__ - Step 114279: {'lr': 6.849332494393356e-05, 'samples': 21941568, 'steps': 114278, 'loss/train': 1.0481609106063843} 11/07/2021 13:18:00 - INFO - __main__ - Step 114280: {'lr': 6.848967571445503e-05, 'samples': 21941760, 'steps': 114279, 'loss/train': 1.2040717601776123} 11/07/2021 13:18:00 - INFO - __main__ - Step 114281: {'lr': 6.84860265667622e-05, 'samples': 21941952, 'steps': 114280, 'loss/train': 1.4186296463012695} 11/07/2021 13:18:01 - INFO - __main__ - Step 114282: {'lr': 6.848237750085681e-05, 'samples': 21942144, 'steps': 114281, 'loss/train': 1.4192125797271729} 11/07/2021 13:18:01 - INFO - __main__ - Step 114283: {'lr': 6.847872851674044e-05, 'samples': 21942336, 'steps': 114282, 'loss/train': 1.2045776844024658} 11/07/2021 13:18:01 - INFO - __main__ - Step 114284: {'lr': 6.847507961441474e-05, 'samples': 21942528, 'steps': 114283, 'loss/train': 1.2342997789382935} 11/07/2021 13:18:03 - INFO - __main__ - Step 114285: {'lr': 6.847143079388146e-05, 'samples': 21942720, 'steps': 114284, 'loss/train': 1.2433454990386963} 11/07/2021 13:18:03 - INFO - __main__ - Step 114286: {'lr': 6.846778205514209e-05, 'samples': 21942912, 'steps': 114285, 'loss/train': 1.1107006072998047} 11/07/2021 13:18:03 - INFO - __main__ - Step 114287: {'lr': 6.846413339819832e-05, 'samples': 21943104, 'steps': 114286, 'loss/train': 1.0433590412139893} 11/07/2021 13:18:04 - INFO - __main__ - Step 114288: {'lr': 6.846048482305181e-05, 'samples': 21943296, 'steps': 114287, 'loss/train': 1.2758628129959106} 11/07/2021 13:18:04 - INFO - __main__ - Step 114289: {'lr': 6.84568363297042e-05, 'samples': 21943488, 'steps': 114288, 'loss/train': 1.4983724355697632} 11/07/2021 13:18:05 - INFO - __main__ - Step 114290: {'lr': 6.845318791815717e-05, 'samples': 21943680, 'steps': 114289, 'loss/train': 1.5594812631607056} 11/07/2021 13:18:05 - INFO - __main__ - Step 114291: {'lr': 6.844953958841229e-05, 'samples': 21943872, 'steps': 114290, 'loss/train': 1.5859993696212769} 11/07/2021 13:18:06 - INFO - __main__ - Step 114292: {'lr': 6.844589134047127e-05, 'samples': 21944064, 'steps': 114291, 'loss/train': 1.5294854640960693} 11/07/2021 13:18:06 - INFO - __main__ - Step 114293: {'lr': 6.844224317433572e-05, 'samples': 21944256, 'steps': 114292, 'loss/train': 1.1700546741485596} 11/07/2021 13:18:06 - INFO - __main__ - Step 114294: {'lr': 6.84385950900073e-05, 'samples': 21944448, 'steps': 114293, 'loss/train': 1.3284037113189697} 11/07/2021 13:18:07 - INFO - __main__ - Step 114295: {'lr': 6.843494708748765e-05, 'samples': 21944640, 'steps': 114294, 'loss/train': 1.4365830421447754} 11/07/2021 13:18:08 - INFO - __main__ - Step 114296: {'lr': 6.84312991667784e-05, 'samples': 21944832, 'steps': 114295, 'loss/train': 1.725490927696228} 11/07/2021 13:18:08 - INFO - __main__ - Step 114297: {'lr': 6.84276513278812e-05, 'samples': 21945024, 'steps': 114296, 'loss/train': 1.3782657384872437} 11/07/2021 13:18:08 - INFO - __main__ - Step 114298: {'lr': 6.842400357079773e-05, 'samples': 21945216, 'steps': 114297, 'loss/train': 1.2088912725448608} 11/07/2021 13:18:09 - INFO - __main__ - Step 114299: {'lr': 6.842035589552964e-05, 'samples': 21945408, 'steps': 114298, 'loss/train': 1.6131662130355835} 11/07/2021 13:18:10 - INFO - __main__ - Step 114300: {'lr': 6.841670830207846e-05, 'samples': 21945600, 'steps': 114299, 'loss/train': 1.506654143333435} 11/07/2021 13:18:10 - INFO - __main__ - Step 114301: {'lr': 6.841306079044596e-05, 'samples': 21945792, 'steps': 114300, 'loss/train': 2.782235860824585} 11/07/2021 13:18:11 - INFO - __main__ - Step 114302: {'lr': 6.840941336063369e-05, 'samples': 21945984, 'steps': 114301, 'loss/train': 1.774582028388977} 11/07/2021 13:18:11 - INFO - __main__ - Step 114303: {'lr': 6.840576601264334e-05, 'samples': 21946176, 'steps': 114302, 'loss/train': 1.3056713342666626} 11/07/2021 13:18:11 - INFO - __main__ - Step 114304: {'lr': 6.840211874647656e-05, 'samples': 21946368, 'steps': 114303, 'loss/train': 1.5994278192520142} 11/07/2021 13:18:12 - INFO - __main__ - Step 114305: {'lr': 6.839847156213497e-05, 'samples': 21946560, 'steps': 114304, 'loss/train': 1.8483617305755615} 11/07/2021 13:18:13 - INFO - __main__ - Step 114306: {'lr': 6.839482445962023e-05, 'samples': 21946752, 'steps': 114305, 'loss/train': 1.6676695346832275} 11/07/2021 13:18:13 - INFO - __main__ - Step 114307: {'lr': 6.8391177438934e-05, 'samples': 21946944, 'steps': 114306, 'loss/train': 1.3482030630111694} 11/07/2021 13:18:14 - INFO - __main__ - Step 114308: {'lr': 6.838753050007788e-05, 'samples': 21947136, 'steps': 114307, 'loss/train': 1.7361984252929688} 11/07/2021 13:18:14 - INFO - __main__ - Step 114309: {'lr': 6.838388364305353e-05, 'samples': 21947328, 'steps': 114308, 'loss/train': 1.27479088306427} 11/07/2021 13:18:15 - INFO - __main__ - Step 114310: {'lr': 6.838023686786262e-05, 'samples': 21947520, 'steps': 114309, 'loss/train': 1.2560386657714844} 11/07/2021 13:18:15 - INFO - __main__ - Step 114311: {'lr': 6.837659017450676e-05, 'samples': 21947712, 'steps': 114310, 'loss/train': 1.5884077548980713} 11/07/2021 13:18:16 - INFO - __main__ - Step 114312: {'lr': 6.83729435629877e-05, 'samples': 21947904, 'steps': 114311, 'loss/train': 1.1583898067474365} 11/07/2021 13:18:16 - INFO - __main__ - Step 114313: {'lr': 6.836929703330688e-05, 'samples': 21948096, 'steps': 114312, 'loss/train': 1.262795090675354} 11/07/2021 13:18:16 - INFO - __main__ - Step 114314: {'lr': 6.836565058546609e-05, 'samples': 21948288, 'steps': 114313, 'loss/train': 1.3329397439956665} 11/07/2021 13:18:17 - INFO - __main__ - Step 114315: {'lr': 6.836200421946692e-05, 'samples': 21948480, 'steps': 114314, 'loss/train': 0.13700997829437256} 11/07/2021 13:18:18 - INFO - __main__ - Step 114316: {'lr': 6.8358357935311e-05, 'samples': 21948672, 'steps': 114315, 'loss/train': 1.4135738611221313} 11/07/2021 13:18:18 - INFO - __main__ - Step 114317: {'lr': 6.835471173300004e-05, 'samples': 21948864, 'steps': 114316, 'loss/train': 1.016982078552246} 11/07/2021 13:18:18 - INFO - __main__ - Step 114318: {'lr': 6.835106561253562e-05, 'samples': 21949056, 'steps': 114317, 'loss/train': 1.4628876447677612} 11/07/2021 13:18:19 - INFO - __main__ - Step 114319: {'lr': 6.834741957391943e-05, 'samples': 21949248, 'steps': 114318, 'loss/train': 1.4406851530075073} 11/07/2021 13:18:19 - INFO - __main__ - Step 114320: {'lr': 6.834377361715308e-05, 'samples': 21949440, 'steps': 114319, 'loss/train': 1.5211528539657593} 11/07/2021 13:18:20 - INFO - __main__ - Step 114321: {'lr': 6.834012774223821e-05, 'samples': 21949632, 'steps': 114320, 'loss/train': 1.2887600660324097} 11/07/2021 13:18:21 - INFO - __main__ - Step 114322: {'lr': 6.83364819491765e-05, 'samples': 21949824, 'steps': 114321, 'loss/train': 1.5070414543151855} 11/07/2021 13:18:21 - INFO - __main__ - Step 114323: {'lr': 6.833283623796955e-05, 'samples': 21950016, 'steps': 114322, 'loss/train': 1.7162787914276123} 11/07/2021 13:18:21 - INFO - __main__ - Step 114324: {'lr': 6.8329190608619e-05, 'samples': 21950208, 'steps': 114323, 'loss/train': 1.3365037441253662} 11/07/2021 13:18:22 - INFO - __main__ - Step 114325: {'lr': 6.832554506112657e-05, 'samples': 21950400, 'steps': 114324, 'loss/train': 1.0766057968139648} 11/07/2021 13:18:23 - INFO - __main__ - Step 114326: {'lr': 6.832189959549387e-05, 'samples': 21950592, 'steps': 114325, 'loss/train': 1.299869179725647} 11/07/2021 13:18:23 - INFO - __main__ - Step 114327: {'lr': 6.831825421172247e-05, 'samples': 21950784, 'steps': 114326, 'loss/train': 1.4472960233688354} 11/07/2021 13:18:23 - INFO - __main__ - Step 114328: {'lr': 6.831460890981403e-05, 'samples': 21950976, 'steps': 114327, 'loss/train': 1.1244810819625854} 11/07/2021 13:18:24 - INFO - __main__ - Step 114329: {'lr': 6.831096368977028e-05, 'samples': 21951168, 'steps': 114328, 'loss/train': 1.5724685192108154} 11/07/2021 13:18:24 - INFO - __main__ - Step 114330: {'lr': 6.830731855159275e-05, 'samples': 21951360, 'steps': 114329, 'loss/train': 1.3318967819213867} 11/07/2021 13:18:25 - INFO - __main__ - Step 114331: {'lr': 6.830367349528316e-05, 'samples': 21951552, 'steps': 114330, 'loss/train': 1.2998137474060059} 11/07/2021 13:18:26 - INFO - __main__ - Step 114332: {'lr': 6.830002852084314e-05, 'samples': 21951744, 'steps': 114331, 'loss/train': 1.2118009328842163} 11/07/2021 13:18:26 - INFO - __main__ - Step 114333: {'lr': 6.829638362827431e-05, 'samples': 21951936, 'steps': 114332, 'loss/train': 1.1311674118041992} 11/07/2021 13:18:26 - INFO - __main__ - Step 114334: {'lr': 6.829273881757833e-05, 'samples': 21952128, 'steps': 114333, 'loss/train': 1.421194314956665} 11/07/2021 13:18:27 - INFO - __main__ - Step 114335: {'lr': 6.828909408875683e-05, 'samples': 21952320, 'steps': 114334, 'loss/train': 0.8529707789421082} 11/07/2021 13:18:28 - INFO - __main__ - Step 114336: {'lr': 6.828544944181147e-05, 'samples': 21952512, 'steps': 114335, 'loss/train': 1.1149095296859741} 11/07/2021 13:18:28 - INFO - __main__ - Step 114337: {'lr': 6.828180487674387e-05, 'samples': 21952704, 'steps': 114336, 'loss/train': 1.5906078815460205} 11/07/2021 13:18:28 - INFO - __main__ - Step 114338: {'lr': 6.827816039355572e-05, 'samples': 21952896, 'steps': 114337, 'loss/train': 1.1590453386306763} 11/07/2021 13:18:29 - INFO - __main__ - Step 114339: {'lr': 6.827451599224867e-05, 'samples': 21953088, 'steps': 114338, 'loss/train': 1.7741888761520386} 11/07/2021 13:18:29 - INFO - __main__ - Step 114340: {'lr': 6.827087167282425e-05, 'samples': 21953280, 'steps': 114339, 'loss/train': 1.2724133729934692} 11/07/2021 13:18:30 - INFO - __main__ - Step 114341: {'lr': 6.826722743528419e-05, 'samples': 21953472, 'steps': 114340, 'loss/train': 0.9490765333175659} 11/07/2021 13:18:30 - INFO - __main__ - Step 114342: {'lr': 6.826358327963009e-05, 'samples': 21953664, 'steps': 114341, 'loss/train': 1.2096726894378662} 11/07/2021 13:18:31 - INFO - __main__ - Step 114343: {'lr': 6.825993920586359e-05, 'samples': 21953856, 'steps': 114342, 'loss/train': 1.5275582075119019} 11/07/2021 13:18:31 - INFO - __main__ - Step 114344: {'lr': 6.825629521398641e-05, 'samples': 21954048, 'steps': 114343, 'loss/train': 1.3071445226669312} 11/07/2021 13:18:31 - INFO - __main__ - Step 114345: {'lr': 6.825265130400011e-05, 'samples': 21954240, 'steps': 114344, 'loss/train': 1.519370675086975} 11/07/2021 13:18:32 - INFO - __main__ - Step 114346: {'lr': 6.824900747590637e-05, 'samples': 21954432, 'steps': 114345, 'loss/train': 1.5122320652008057} 11/07/2021 13:18:33 - INFO - __main__ - Step 114347: {'lr': 6.824536372970683e-05, 'samples': 21954624, 'steps': 114346, 'loss/train': 1.5764981508255005} 11/07/2021 13:18:33 - INFO - __main__ - Step 114348: {'lr': 6.824172006540311e-05, 'samples': 21954816, 'steps': 114347, 'loss/train': 1.2375221252441406} 11/07/2021 13:18:34 - INFO - __main__ - Step 114349: {'lr': 6.823807648299688e-05, 'samples': 21955008, 'steps': 114348, 'loss/train': 1.4940940141677856} 11/07/2021 13:18:34 - INFO - __main__ - Step 114350: {'lr': 6.823443298248974e-05, 'samples': 21955200, 'steps': 114349, 'loss/train': 1.0946468114852905} 11/07/2021 13:18:34 - INFO - __main__ - Step 114351: {'lr': 6.823078956388337e-05, 'samples': 21955392, 'steps': 114350, 'loss/train': 1.3887393474578857} 11/07/2021 13:18:35 - INFO - __main__ - Step 114352: {'lr': 6.822714622717944e-05, 'samples': 21955584, 'steps': 114351, 'loss/train': 1.0601869821548462} 11/07/2021 13:18:36 - INFO - __main__ - Step 114353: {'lr': 6.822350297237958e-05, 'samples': 21955776, 'steps': 114352, 'loss/train': 0.9294612407684326} 11/07/2021 13:18:36 - INFO - __main__ - Step 114354: {'lr': 6.821985979948533e-05, 'samples': 21955968, 'steps': 114353, 'loss/train': 1.4280040264129639} 11/07/2021 13:18:36 - INFO - __main__ - Step 114355: {'lr': 6.821621670849845e-05, 'samples': 21956160, 'steps': 114354, 'loss/train': 1.413906455039978} 11/07/2021 13:18:37 - INFO - __main__ - Step 114356: {'lr': 6.821257369942049e-05, 'samples': 21956352, 'steps': 114355, 'loss/train': 1.3006792068481445} 11/07/2021 13:18:37 - INFO - __main__ - Step 114357: {'lr': 6.820893077225318e-05, 'samples': 21956544, 'steps': 114356, 'loss/train': 1.6970094442367554} 11/07/2021 13:18:38 - INFO - __main__ - Step 114358: {'lr': 6.82052879269981e-05, 'samples': 21956736, 'steps': 114357, 'loss/train': 1.3542165756225586} 11/07/2021 13:18:39 - INFO - __main__ - Step 114359: {'lr': 6.820164516365691e-05, 'samples': 21956928, 'steps': 114358, 'loss/train': 1.660957932472229} 11/07/2021 13:18:39 - INFO - __main__ - Step 114360: {'lr': 6.819800248223123e-05, 'samples': 21957120, 'steps': 114359, 'loss/train': 1.2673046588897705} 11/07/2021 13:18:39 - INFO - __main__ - Step 114361: {'lr': 6.819435988272276e-05, 'samples': 21957312, 'steps': 114360, 'loss/train': 1.46151602268219} 11/07/2021 13:18:40 - INFO - __main__ - Step 114362: {'lr': 6.81907173651331e-05, 'samples': 21957504, 'steps': 114361, 'loss/train': 1.4962540864944458} 11/07/2021 13:18:41 - INFO - __main__ - Step 114363: {'lr': 6.818707492946391e-05, 'samples': 21957696, 'steps': 114362, 'loss/train': 1.22461998462677} 11/07/2021 13:18:41 - INFO - __main__ - Step 114364: {'lr': 6.818343257571679e-05, 'samples': 21957888, 'steps': 114363, 'loss/train': 0.7789157629013062} 11/07/2021 13:18:42 - INFO - __main__ - Step 114365: {'lr': 6.817979030389343e-05, 'samples': 21958080, 'steps': 114364, 'loss/train': 1.6390050649642944} 11/07/2021 13:18:42 - INFO - __main__ - Step 114366: {'lr': 6.817614811399551e-05, 'samples': 21958272, 'steps': 114365, 'loss/train': 0.6972365379333496} 11/07/2021 13:18:42 - INFO - __main__ - Step 114367: {'lr': 6.817250600602454e-05, 'samples': 21958464, 'steps': 114366, 'loss/train': 1.336212396621704} 11/07/2021 13:18:43 - INFO - __main__ - Step 114368: {'lr': 6.816886397998226e-05, 'samples': 21958656, 'steps': 114367, 'loss/train': 1.5304487943649292} 11/07/2021 13:18:44 - INFO - __main__ - Step 114369: {'lr': 6.816522203587025e-05, 'samples': 21958848, 'steps': 114368, 'loss/train': 1.2050291299819946} 11/07/2021 13:18:44 - INFO - __main__ - Step 114370: {'lr': 6.81615801736902e-05, 'samples': 21959040, 'steps': 114369, 'loss/train': 1.6603294610977173} 11/07/2021 13:18:44 - INFO - __main__ - Step 114371: {'lr': 6.815793839344372e-05, 'samples': 21959232, 'steps': 114370, 'loss/train': 0.9243922829627991} 11/07/2021 13:18:45 - INFO - __main__ - Step 114372: {'lr': 6.815429669513249e-05, 'samples': 21959424, 'steps': 114371, 'loss/train': 1.492747187614441} 11/07/2021 13:18:46 - INFO - __main__ - Step 114373: {'lr': 6.815065507875811e-05, 'samples': 21959616, 'steps': 114372, 'loss/train': 1.4145796298980713} 11/07/2021 13:18:46 - INFO - __main__ - Step 114374: {'lr': 6.814701354432226e-05, 'samples': 21959808, 'steps': 114373, 'loss/train': 1.1970008611679077} 11/07/2021 13:18:46 - INFO - __main__ - Step 114375: {'lr': 6.814337209182652e-05, 'samples': 21960000, 'steps': 114374, 'loss/train': 1.1572014093399048} 11/07/2021 13:18:47 - INFO - __main__ - Step 114376: {'lr': 6.813973072127261e-05, 'samples': 21960192, 'steps': 114375, 'loss/train': 0.41802579164505005} 11/07/2021 13:18:47 - INFO - __main__ - Step 114377: {'lr': 6.813608943266211e-05, 'samples': 21960384, 'steps': 114376, 'loss/train': 1.5486775636672974} 11/07/2021 13:18:48 - INFO - __main__ - Step 114378: {'lr': 6.81324482259967e-05, 'samples': 21960576, 'steps': 114377, 'loss/train': 1.3655040264129639} 11/07/2021 13:18:48 - INFO - __main__ - Step 114379: {'lr': 6.8128807101278e-05, 'samples': 21960768, 'steps': 114378, 'loss/train': 1.278262734413147} 11/07/2021 13:18:49 - INFO - __main__ - Step 114380: {'lr': 6.812516605850771e-05, 'samples': 21960960, 'steps': 114379, 'loss/train': 0.9971288442611694} 11/07/2021 13:18:49 - INFO - __main__ - Step 114381: {'lr': 6.812152509768734e-05, 'samples': 21961152, 'steps': 114380, 'loss/train': 1.3430873155593872} 11/07/2021 13:18:50 - INFO - __main__ - Step 114382: {'lr': 6.811788421881862e-05, 'samples': 21961344, 'steps': 114381, 'loss/train': 1.3543171882629395} 11/07/2021 13:18:50 - INFO - __main__ - Step 114383: {'lr': 6.811424342190318e-05, 'samples': 21961536, 'steps': 114382, 'loss/train': 1.0562546253204346} 11/07/2021 13:18:51 - INFO - __main__ - Step 114384: {'lr': 6.811060270694263e-05, 'samples': 21961728, 'steps': 114383, 'loss/train': 1.1844531297683716} 11/07/2021 13:18:51 - INFO - __main__ - Step 114385: {'lr': 6.810696207393865e-05, 'samples': 21961920, 'steps': 114384, 'loss/train': 1.059882640838623} 11/07/2021 13:18:52 - INFO - __main__ - Step 114386: {'lr': 6.810332152289286e-05, 'samples': 21962112, 'steps': 114385, 'loss/train': 0.9537425637245178} 11/07/2021 13:18:52 - INFO - __main__ - Step 114387: {'lr': 6.809968105380692e-05, 'samples': 21962304, 'steps': 114386, 'loss/train': 1.088280200958252} 11/07/2021 13:18:52 - INFO - __main__ - Step 114388: {'lr': 6.809604066668246e-05, 'samples': 21962496, 'steps': 114387, 'loss/train': 1.5121413469314575} 11/07/2021 13:18:53 - INFO - __main__ - Step 114389: {'lr': 6.809240036152109e-05, 'samples': 21962688, 'steps': 114388, 'loss/train': 1.4785007238388062} 11/07/2021 13:18:54 - INFO - __main__ - Step 114390: {'lr': 6.80887601383245e-05, 'samples': 21962880, 'steps': 114389, 'loss/train': 1.8351949453353882} 11/07/2021 13:18:54 - INFO - __main__ - Step 114391: {'lr': 6.80851199970943e-05, 'samples': 21963072, 'steps': 114390, 'loss/train': 1.4136059284210205} 11/07/2021 13:18:54 - INFO - __main__ - Step 114392: {'lr': 6.808147993783215e-05, 'samples': 21963264, 'steps': 114391, 'loss/train': 1.3921617269515991} 11/07/2021 13:18:55 - INFO - __main__ - Step 114393: {'lr': 6.807783996053974e-05, 'samples': 21963456, 'steps': 114392, 'loss/train': 1.504216194152832} 11/07/2021 13:18:56 - INFO - __main__ - Step 114394: {'lr': 6.807420006521855e-05, 'samples': 21963648, 'steps': 114393, 'loss/train': 1.3896944522857666} 11/07/2021 13:18:56 - INFO - __main__ - Step 114395: {'lr': 6.807056025187036e-05, 'samples': 21963840, 'steps': 114394, 'loss/train': 0.9815194010734558} 11/07/2021 13:18:56 - INFO - __main__ - Step 114396: {'lr': 6.806692052049674e-05, 'samples': 21964032, 'steps': 114395, 'loss/train': 1.4144947528839111} 11/07/2021 13:18:57 - INFO - __main__ - Step 114397: {'lr': 6.806328087109937e-05, 'samples': 21964224, 'steps': 114396, 'loss/train': 1.6249943971633911} 11/07/2021 13:18:57 - INFO - __main__ - Step 114398: {'lr': 6.805964130367986e-05, 'samples': 21964416, 'steps': 114397, 'loss/train': 1.2952631711959839} 11/07/2021 13:18:58 - INFO - __main__ - Step 114399: {'lr': 6.80560018182399e-05, 'samples': 21964608, 'steps': 114398, 'loss/train': 1.361703872680664} 11/07/2021 13:18:59 - INFO - __main__ - Step 114400: {'lr': 6.805236241478108e-05, 'samples': 21964800, 'steps': 114399, 'loss/train': 0.36793288588523865} 11/07/2021 13:18:59 - INFO - __main__ - Step 114401: {'lr': 6.804872309330506e-05, 'samples': 21964992, 'steps': 114400, 'loss/train': 1.848799228668213} 11/07/2021 13:18:59 - INFO - __main__ - Step 114402: {'lr': 6.804508385381348e-05, 'samples': 21965184, 'steps': 114401, 'loss/train': 0.9464281797409058} 11/07/2021 13:19:00 - INFO - __main__ - Step 114403: {'lr': 6.8041444696308e-05, 'samples': 21965376, 'steps': 114402, 'loss/train': 1.253160834312439} 11/07/2021 13:19:01 - INFO - __main__ - Step 114404: {'lr': 6.803780562079021e-05, 'samples': 21965568, 'steps': 114403, 'loss/train': 1.4383240938186646} 11/07/2021 13:19:01 - INFO - __main__ - Step 114405: {'lr': 6.803416662726175e-05, 'samples': 21965760, 'steps': 114404, 'loss/train': 1.260521650314331} 11/07/2021 13:19:01 - INFO - __main__ - Step 114406: {'lr': 6.80305277157244e-05, 'samples': 21965952, 'steps': 114405, 'loss/train': 0.9484583139419556} 11/07/2021 13:19:02 - INFO - __main__ - Step 114407: {'lr': 6.802688888617962e-05, 'samples': 21966144, 'steps': 114406, 'loss/train': 1.1552664041519165} 11/07/2021 13:19:02 - INFO - __main__ - Step 114408: {'lr': 6.802325013862908e-05, 'samples': 21966336, 'steps': 114407, 'loss/train': 1.347771406173706} 11/07/2021 13:19:03 - INFO - __main__ - Step 114409: {'lr': 6.801961147307447e-05, 'samples': 21966528, 'steps': 114408, 'loss/train': 1.2632569074630737} 11/07/2021 13:19:03 - INFO - __main__ - Step 114410: {'lr': 6.801597288951745e-05, 'samples': 21966720, 'steps': 114409, 'loss/train': 1.0887693166732788} 11/07/2021 13:19:04 - INFO - __main__ - Step 114411: {'lr': 6.801233438795957e-05, 'samples': 21966912, 'steps': 114410, 'loss/train': 1.6217764616012573} 11/07/2021 13:19:04 - INFO - __main__ - Step 114412: {'lr': 6.800869596840257e-05, 'samples': 21967104, 'steps': 114411, 'loss/train': 1.0590217113494873} 11/07/2021 13:19:05 - INFO - __main__ - Step 114413: {'lr': 6.8005057630848e-05, 'samples': 21967296, 'steps': 114412, 'loss/train': 1.7211040258407593} 11/07/2021 13:19:06 - INFO - __main__ - Step 114414: {'lr': 6.800141937529755e-05, 'samples': 21967488, 'steps': 114413, 'loss/train': 0.5096229314804077} 11/07/2021 13:19:06 - INFO - __main__ - Step 114415: {'lr': 6.799778120175287e-05, 'samples': 21967680, 'steps': 114414, 'loss/train': 0.994030773639679} 11/07/2021 13:19:06 - INFO - __main__ - Step 114416: {'lr': 6.79941431102156e-05, 'samples': 21967872, 'steps': 114415, 'loss/train': 1.2395955324172974} 11/07/2021 13:19:07 - INFO - __main__ - Step 114417: {'lr': 6.799050510068733e-05, 'samples': 21968064, 'steps': 114416, 'loss/train': 1.5699938535690308} 11/07/2021 13:19:07 - INFO - __main__ - Step 114418: {'lr': 6.798686717316973e-05, 'samples': 21968256, 'steps': 114417, 'loss/train': 1.4535181522369385} 11/07/2021 13:19:08 - INFO - __main__ - Step 114419: {'lr': 6.798322932766446e-05, 'samples': 21968448, 'steps': 114418, 'loss/train': 1.2595101594924927} 11/07/2021 13:19:08 - INFO - __main__ - Step 114420: {'lr': 6.797959156417318e-05, 'samples': 21968640, 'steps': 114419, 'loss/train': 1.4898625612258911} 11/07/2021 13:19:09 - INFO - __main__ - Step 114421: {'lr': 6.797595388269745e-05, 'samples': 21968832, 'steps': 114420, 'loss/train': 1.2637734413146973} 11/07/2021 13:19:09 - INFO - __main__ - Step 114422: {'lr': 6.797231628323892e-05, 'samples': 21969024, 'steps': 114421, 'loss/train': 1.2508459091186523} 11/07/2021 13:19:10 - INFO - __main__ - Step 114423: {'lr': 6.796867876579926e-05, 'samples': 21969216, 'steps': 114422, 'loss/train': 1.0617058277130127} 11/07/2021 13:19:10 - INFO - __main__ - Step 114424: {'lr': 6.796504133038012e-05, 'samples': 21969408, 'steps': 114423, 'loss/train': 1.6005059480667114} 11/07/2021 13:19:11 - INFO - __main__ - Step 114425: {'lr': 6.796140397698311e-05, 'samples': 21969600, 'steps': 114424, 'loss/train': 1.2146368026733398} 11/07/2021 13:19:11 - INFO - __main__ - Step 114426: {'lr': 6.795776670560988e-05, 'samples': 21969792, 'steps': 114425, 'loss/train': 1.4087008237838745} 11/07/2021 13:19:12 - INFO - __main__ - Step 114427: {'lr': 6.795412951626206e-05, 'samples': 21969984, 'steps': 114426, 'loss/train': 1.3280378580093384} 11/07/2021 13:19:12 - INFO - __main__ - Step 114428: {'lr': 6.795049240894132e-05, 'samples': 21970176, 'steps': 114427, 'loss/train': 1.5395996570587158} 11/07/2021 13:19:12 - INFO - __main__ - Step 114429: {'lr': 6.794685538364928e-05, 'samples': 21970368, 'steps': 114428, 'loss/train': 1.6297246217727661} 11/07/2021 13:19:13 - INFO - __main__ - Step 114430: {'lr': 6.794321844038756e-05, 'samples': 21970560, 'steps': 114429, 'loss/train': 1.2783969640731812} 11/07/2021 13:19:14 - INFO - __main__ - Step 114431: {'lr': 6.793958157915784e-05, 'samples': 21970752, 'steps': 114430, 'loss/train': 1.3064095973968506} 11/07/2021 13:19:14 - INFO - __main__ - Step 114432: {'lr': 6.79359447999617e-05, 'samples': 21970944, 'steps': 114431, 'loss/train': 1.2595939636230469} 11/07/2021 13:19:14 - INFO - __main__ - Step 114433: {'lr': 6.793230810280093e-05, 'samples': 21971136, 'steps': 114432, 'loss/train': 1.5574284791946411} 11/07/2021 13:19:15 - INFO - __main__ - Step 114434: {'lr': 6.792867148767695e-05, 'samples': 21971328, 'steps': 114433, 'loss/train': 1.3378154039382935} 11/07/2021 13:19:16 - INFO - __main__ - Step 114435: {'lr': 6.792503495459152e-05, 'samples': 21971520, 'steps': 114434, 'loss/train': 1.1563547849655151} 11/07/2021 13:19:16 - INFO - __main__ - Step 114436: {'lr': 6.792139850354626e-05, 'samples': 21971712, 'steps': 114435, 'loss/train': 0.389203280210495} 11/07/2021 13:19:17 - INFO - __main__ - Step 114437: {'lr': 6.791776213454279e-05, 'samples': 21971904, 'steps': 114436, 'loss/train': 1.304323434829712} 11/07/2021 13:19:17 - INFO - __main__ - Step 114438: {'lr': 6.791412584758278e-05, 'samples': 21972096, 'steps': 114437, 'loss/train': 1.3076865673065186} 11/07/2021 13:19:17 - INFO - __main__ - Step 114439: {'lr': 6.791048964266786e-05, 'samples': 21972288, 'steps': 114438, 'loss/train': 1.4342527389526367} 11/07/2021 13:19:18 - INFO - __main__ - Step 114440: {'lr': 6.790685351979963e-05, 'samples': 21972480, 'steps': 114439, 'loss/train': 0.9982027411460876} 11/07/2021 13:19:19 - INFO - __main__ - Step 114441: {'lr': 6.790321747897979e-05, 'samples': 21972672, 'steps': 114440, 'loss/train': 1.0323045253753662} 11/07/2021 13:19:19 - INFO - __main__ - Step 114442: {'lr': 6.789958152020995e-05, 'samples': 21972864, 'steps': 114441, 'loss/train': 0.7706254124641418} 11/07/2021 13:19:20 - INFO - __main__ - Step 114443: {'lr': 6.789594564349175e-05, 'samples': 21973056, 'steps': 114442, 'loss/train': 1.198805332183838} 11/07/2021 13:19:20 - INFO - __main__ - Step 114444: {'lr': 6.789230984882683e-05, 'samples': 21973248, 'steps': 114443, 'loss/train': 1.1316590309143066} 11/07/2021 13:19:21 - INFO - __main__ - Step 114445: {'lr': 6.78886741362168e-05, 'samples': 21973440, 'steps': 114444, 'loss/train': 1.2516274452209473} 11/07/2021 13:19:21 - INFO - __main__ - Step 114446: {'lr': 6.788503850566336e-05, 'samples': 21973632, 'steps': 114445, 'loss/train': 1.2959258556365967} 11/07/2021 13:19:22 - INFO - __main__ - Step 114447: {'lr': 6.788140295716816e-05, 'samples': 21973824, 'steps': 114446, 'loss/train': 1.5695878267288208} 11/07/2021 13:19:22 - INFO - __main__ - Step 114448: {'lr': 6.787776749073271e-05, 'samples': 21974016, 'steps': 114447, 'loss/train': 1.391709804534912} 11/07/2021 13:19:22 - INFO - __main__ - Step 114449: {'lr': 6.787413210635874e-05, 'samples': 21974208, 'steps': 114448, 'loss/train': 1.4384334087371826} 11/07/2021 13:19:23 - INFO - __main__ - Step 114450: {'lr': 6.787049680404789e-05, 'samples': 21974400, 'steps': 114449, 'loss/train': 1.2577900886535645} 11/07/2021 13:19:24 - INFO - __main__ - Step 114451: {'lr': 6.786686158380176e-05, 'samples': 21974592, 'steps': 114450, 'loss/train': 1.2712599039077759} 11/07/2021 13:19:24 - INFO - __main__ - Step 114452: {'lr': 6.786322644562202e-05, 'samples': 21974784, 'steps': 114451, 'loss/train': 1.1123363971710205} 11/07/2021 13:19:24 - INFO - __main__ - Step 114453: {'lr': 6.785959138951028e-05, 'samples': 21974976, 'steps': 114452, 'loss/train': 1.345187783241272} 11/07/2021 13:19:25 - INFO - __main__ - Step 114454: {'lr': 6.785595641546825e-05, 'samples': 21975168, 'steps': 114453, 'loss/train': 1.106482744216919} 11/07/2021 13:19:26 - INFO - __main__ - Step 114455: {'lr': 6.785232152349746e-05, 'samples': 21975360, 'steps': 114454, 'loss/train': 0.36716029047966003} 11/07/2021 13:19:26 - INFO - __main__ - Step 114456: {'lr': 6.784868671359962e-05, 'samples': 21975552, 'steps': 114455, 'loss/train': 1.1762763261795044} 11/07/2021 13:19:27 - INFO - __main__ - Step 114457: {'lr': 6.784505198577637e-05, 'samples': 21975744, 'steps': 114456, 'loss/train': 1.018115520477295} 11/07/2021 13:19:27 - INFO - __main__ - Step 114458: {'lr': 6.784141734002939e-05, 'samples': 21975936, 'steps': 114457, 'loss/train': 1.2693490982055664} 11/07/2021 13:19:27 - INFO - __main__ - Step 114459: {'lr': 6.783778277636019e-05, 'samples': 21976128, 'steps': 114458, 'loss/train': 1.5010524988174438} 11/07/2021 13:19:28 - INFO - __main__ - Step 114460: {'lr': 6.783414829477044e-05, 'samples': 21976320, 'steps': 114459, 'loss/train': 1.2078351974487305} 11/07/2021 13:19:29 - INFO - __main__ - Step 114461: {'lr': 6.783051389526184e-05, 'samples': 21976512, 'steps': 114460, 'loss/train': 1.5879956483840942} 11/07/2021 13:19:29 - INFO - __main__ - Step 114462: {'lr': 6.7826879577836e-05, 'samples': 21976704, 'steps': 114461, 'loss/train': 1.2455061674118042} 11/07/2021 13:19:29 - INFO - __main__ - Step 114463: {'lr': 6.782324534249456e-05, 'samples': 21976896, 'steps': 114462, 'loss/train': 1.3918012380599976} 11/07/2021 13:19:30 - INFO - __main__ - Step 114464: {'lr': 6.781961118923916e-05, 'samples': 21977088, 'steps': 114463, 'loss/train': 1.5430216789245605} 11/07/2021 13:19:30 - INFO - __main__ - Step 114465: {'lr': 6.781597711807142e-05, 'samples': 21977280, 'steps': 114464, 'loss/train': 1.4374876022338867} 11/07/2021 13:19:31 - INFO - __main__ - Step 114466: {'lr': 6.781234312899299e-05, 'samples': 21977472, 'steps': 114465, 'loss/train': 1.6920148134231567} 11/07/2021 13:19:32 - INFO - __main__ - Step 114467: {'lr': 6.780870922200549e-05, 'samples': 21977664, 'steps': 114466, 'loss/train': 1.5680721998214722} 11/07/2021 13:19:32 - INFO - __main__ - Step 114468: {'lr': 6.780507539711058e-05, 'samples': 21977856, 'steps': 114467, 'loss/train': 1.747613787651062} 11/07/2021 13:19:32 - INFO - __main__ - Step 114469: {'lr': 6.780144165430999e-05, 'samples': 21978048, 'steps': 114468, 'loss/train': 1.5015138387680054} 11/07/2021 13:19:33 - INFO - __main__ - Step 114470: {'lr': 6.779780799360518e-05, 'samples': 21978240, 'steps': 114469, 'loss/train': 0.6079055070877075} 11/07/2021 13:19:34 - INFO - __main__ - Step 114471: {'lr': 6.779417441499786e-05, 'samples': 21978432, 'steps': 114470, 'loss/train': 1.294543981552124} 11/07/2021 13:19:34 - INFO - __main__ - Step 114472: {'lr': 6.779054091848966e-05, 'samples': 21978624, 'steps': 114471, 'loss/train': 1.2269760370254517} 11/07/2021 13:19:34 - INFO - __main__ - Step 114473: {'lr': 6.778690750408226e-05, 'samples': 21978816, 'steps': 114472, 'loss/train': 1.514197587966919} 11/07/2021 13:19:35 - INFO - __main__ - Step 114474: {'lr': 6.778327417177724e-05, 'samples': 21979008, 'steps': 114473, 'loss/train': 1.640023946762085} 11/07/2021 13:19:35 - INFO - __main__ - Step 114475: {'lr': 6.777964092157626e-05, 'samples': 21979200, 'steps': 114474, 'loss/train': 1.4843302965164185} 11/07/2021 13:19:36 - INFO - __main__ - Step 114476: {'lr': 6.777600775348097e-05, 'samples': 21979392, 'steps': 114475, 'loss/train': 1.4577823877334595} 11/07/2021 13:19:36 - INFO - __main__ - Step 114477: {'lr': 6.777237466749304e-05, 'samples': 21979584, 'steps': 114476, 'loss/train': 1.293402075767517} 11/07/2021 13:19:37 - INFO - __main__ - Step 114478: {'lr': 6.776874166361402e-05, 'samples': 21979776, 'steps': 114477, 'loss/train': 1.078452467918396} 11/07/2021 13:19:37 - INFO - __main__ - Step 114479: {'lr': 6.77651087418456e-05, 'samples': 21979968, 'steps': 114478, 'loss/train': 1.2752870321273804} 11/07/2021 13:19:37 - INFO - __main__ - Step 114480: {'lr': 6.776147590218947e-05, 'samples': 21980160, 'steps': 114479, 'loss/train': 0.45789235830307007} 11/07/2021 13:19:39 - INFO - __main__ - Step 114481: {'lr': 6.775784314464717e-05, 'samples': 21980352, 'steps': 114480, 'loss/train': 1.2785108089447021} 11/07/2021 13:19:39 - INFO - __main__ - Step 114482: {'lr': 6.775421046922034e-05, 'samples': 21980544, 'steps': 114481, 'loss/train': 1.3770172595977783} 11/07/2021 13:19:39 - INFO - __main__ - Step 114483: {'lr': 6.775057787591069e-05, 'samples': 21980736, 'steps': 114482, 'loss/train': 1.528590202331543} 11/07/2021 13:19:40 - INFO - __main__ - Step 114484: {'lr': 6.774694536471979e-05, 'samples': 21980928, 'steps': 114483, 'loss/train': 1.5549256801605225} 11/07/2021 13:19:40 - INFO - __main__ - Step 114485: {'lr': 6.774331293564931e-05, 'samples': 21981120, 'steps': 114484, 'loss/train': 0.7486491799354553} 11/07/2021 13:19:41 - INFO - __main__ - Step 114486: {'lr': 6.773968058870086e-05, 'samples': 21981312, 'steps': 114485, 'loss/train': 1.647523283958435} 11/07/2021 13:19:41 - INFO - __main__ - Step 114487: {'lr': 6.773604832387611e-05, 'samples': 21981504, 'steps': 114486, 'loss/train': 0.9689453840255737} 11/07/2021 13:19:42 - INFO - __main__ - Step 114488: {'lr': 6.77324161411767e-05, 'samples': 21981696, 'steps': 114487, 'loss/train': 1.2822915315628052} 11/07/2021 13:19:42 - INFO - __main__ - Step 114489: {'lr': 6.772878404060424e-05, 'samples': 21981888, 'steps': 114488, 'loss/train': 1.1263502836227417} 11/07/2021 13:19:42 - INFO - __main__ - Step 114490: {'lr': 6.772515202216037e-05, 'samples': 21982080, 'steps': 114489, 'loss/train': 1.3756532669067383} 11/07/2021 13:19:44 - INFO - __main__ - Step 114491: {'lr': 6.772152008584681e-05, 'samples': 21982272, 'steps': 114490, 'loss/train': 1.033660650253296} 11/07/2021 13:19:44 - INFO - __main__ - Step 114492: {'lr': 6.771788823166505e-05, 'samples': 21982464, 'steps': 114491, 'loss/train': 1.3663212060928345} 11/07/2021 13:19:44 - INFO - __main__ - Step 114493: {'lr': 6.771425645961682e-05, 'samples': 21982656, 'steps': 114492, 'loss/train': 1.4104164838790894} 11/07/2021 13:19:45 - INFO - __main__ - Step 114494: {'lr': 6.771062476970372e-05, 'samples': 21982848, 'steps': 114493, 'loss/train': 1.2529217004776} 11/07/2021 13:19:45 - INFO - __main__ - Step 114495: {'lr': 6.770699316192738e-05, 'samples': 21983040, 'steps': 114494, 'loss/train': 1.5320043563842773} 11/07/2021 13:19:46 - INFO - __main__ - Step 114496: {'lr': 6.770336163628946e-05, 'samples': 21983232, 'steps': 114495, 'loss/train': 1.1530539989471436} 11/07/2021 13:19:46 - INFO - __main__ - Step 114497: {'lr': 6.76997301927916e-05, 'samples': 21983424, 'steps': 114496, 'loss/train': 1.5224522352218628} 11/07/2021 13:19:47 - INFO - __main__ - Step 114498: {'lr': 6.769609883143544e-05, 'samples': 21983616, 'steps': 114497, 'loss/train': 1.0200389623641968} 11/07/2021 13:19:47 - INFO - __main__ - Step 114499: {'lr': 6.769246755222258e-05, 'samples': 21983808, 'steps': 114498, 'loss/train': 1.4187146425247192} 11/07/2021 13:19:47 - INFO - __main__ - Step 114500: {'lr': 6.768883635515468e-05, 'samples': 21984000, 'steps': 114499, 'loss/train': 1.3368037939071655} 11/07/2021 13:19:48 - INFO - __main__ - Step 114501: {'lr': 6.768520524023347e-05, 'samples': 21984192, 'steps': 114500, 'loss/train': 1.3774508237838745} 11/07/2021 13:19:49 - INFO - __main__ - Step 114502: {'lr': 6.768157420746043e-05, 'samples': 21984384, 'steps': 114501, 'loss/train': 1.6675195693969727} 11/07/2021 13:19:49 - INFO - __main__ - Step 114503: {'lr': 6.767794325683724e-05, 'samples': 21984576, 'steps': 114502, 'loss/train': 1.2835235595703125} 11/07/2021 13:19:49 - INFO - __main__ - Step 114504: {'lr': 6.767431238836554e-05, 'samples': 21984768, 'steps': 114503, 'loss/train': 1.4874298572540283} 11/07/2021 13:19:50 - INFO - __main__ - Step 114505: {'lr': 6.7670681602047e-05, 'samples': 21984960, 'steps': 114504, 'loss/train': 1.7558375597000122} 11/07/2021 13:19:51 - INFO - __main__ - Step 114506: {'lr': 6.766705089788325e-05, 'samples': 21985152, 'steps': 114505, 'loss/train': 1.2478816509246826} 11/07/2021 13:19:52 - INFO - __main__ - Step 114507: {'lr': 6.766342027587592e-05, 'samples': 21985344, 'steps': 114506, 'loss/train': 1.388999581336975} 11/07/2021 13:19:52 - INFO - __main__ - Step 114508: {'lr': 6.76597897360266e-05, 'samples': 21985536, 'steps': 114507, 'loss/train': 1.3800699710845947} 11/07/2021 13:19:52 - INFO - __main__ - Step 114509: {'lr': 6.765615927833698e-05, 'samples': 21985728, 'steps': 114508, 'loss/train': 0.9162271618843079} 11/07/2021 13:19:53 - INFO - __main__ - Step 114510: {'lr': 6.765252890280868e-05, 'samples': 21985920, 'steps': 114509, 'loss/train': 1.4591412544250488} 11/07/2021 13:19:53 - INFO - __main__ - Step 114511: {'lr': 6.764889860944334e-05, 'samples': 21986112, 'steps': 114510, 'loss/train': 0.6901004910469055} 11/07/2021 13:19:54 - INFO - __main__ - Step 114512: {'lr': 6.764526839824262e-05, 'samples': 21986304, 'steps': 114511, 'loss/train': 1.503389596939087} 11/07/2021 13:19:55 - INFO - __main__ - Step 114513: {'lr': 6.764163826920807e-05, 'samples': 21986496, 'steps': 114512, 'loss/train': 1.3358705043792725} 11/07/2021 13:19:55 - INFO - __main__ - Step 114514: {'lr': 6.76380082223415e-05, 'samples': 21986688, 'steps': 114513, 'loss/train': 0.493132084608078} 11/07/2021 13:19:55 - INFO - __main__ - Step 114515: {'lr': 6.763437825764435e-05, 'samples': 21986880, 'steps': 114514, 'loss/train': 1.5190021991729736} 11/07/2021 13:19:56 - INFO - __main__ - Step 114516: {'lr': 6.763074837511835e-05, 'samples': 21987072, 'steps': 114515, 'loss/train': 0.8139083385467529} 11/07/2021 13:19:56 - INFO - __main__ - Step 114517: {'lr': 6.762711857476509e-05, 'samples': 21987264, 'steps': 114516, 'loss/train': 1.2792004346847534} 11/07/2021 13:19:57 - INFO - __main__ - Step 114518: {'lr': 6.762348885658626e-05, 'samples': 21987456, 'steps': 114517, 'loss/train': 1.8717042207717896} 11/07/2021 13:19:57 - INFO - __main__ - Step 114519: {'lr': 6.761985922058344e-05, 'samples': 21987648, 'steps': 114518, 'loss/train': 1.6791871786117554} 11/07/2021 13:19:58 - INFO - __main__ - Step 114520: {'lr': 6.761622966675832e-05, 'samples': 21987840, 'steps': 114519, 'loss/train': 1.531240463256836} 11/07/2021 13:19:58 - INFO - __main__ - Step 114521: {'lr': 6.761260019511251e-05, 'samples': 21988032, 'steps': 114520, 'loss/train': 1.75594162940979} 11/07/2021 13:19:59 - INFO - __main__ - Step 114522: {'lr': 6.760897080564766e-05, 'samples': 21988224, 'steps': 114521, 'loss/train': 1.341163992881775} 11/07/2021 13:20:00 - INFO - __main__ - Step 114523: {'lr': 6.760534149836537e-05, 'samples': 21988416, 'steps': 114522, 'loss/train': 1.232574701309204} 11/07/2021 13:20:00 - INFO - __main__ - Step 114524: {'lr': 6.760171227326731e-05, 'samples': 21988608, 'steps': 114523, 'loss/train': 0.7449004054069519} 11/07/2021 13:20:00 - INFO - __main__ - Step 114525: {'lr': 6.75980831303551e-05, 'samples': 21988800, 'steps': 114524, 'loss/train': 1.3261102437973022} 11/07/2021 13:20:01 - INFO - __main__ - Step 114526: {'lr': 6.759445406963038e-05, 'samples': 21988992, 'steps': 114525, 'loss/train': 1.3358646631240845} 11/07/2021 13:20:02 - INFO - __main__ - Step 114527: {'lr': 6.759082509109488e-05, 'samples': 21989184, 'steps': 114526, 'loss/train': 1.1436160802841187} 11/07/2021 13:20:02 - INFO - __main__ - Step 114528: {'lr': 6.758719619475004e-05, 'samples': 21989376, 'steps': 114527, 'loss/train': 1.3103315830230713} 11/07/2021 13:20:03 - INFO - __main__ - Step 114529: {'lr': 6.758356738059759e-05, 'samples': 21989568, 'steps': 114528, 'loss/train': 1.0798652172088623} 11/07/2021 13:20:03 - INFO - __main__ - Step 114530: {'lr': 6.757993864863917e-05, 'samples': 21989760, 'steps': 114529, 'loss/train': 1.556368350982666} 11/07/2021 13:20:03 - INFO - __main__ - Step 114531: {'lr': 6.757630999887643e-05, 'samples': 21989952, 'steps': 114530, 'loss/train': 1.5016250610351562} 11/07/2021 13:20:04 - INFO - __main__ - Step 114532: {'lr': 6.757268143131098e-05, 'samples': 21990144, 'steps': 114531, 'loss/train': 2.0862300395965576} 11/07/2021 13:20:05 - INFO - __main__ - Step 114533: {'lr': 6.756905294594448e-05, 'samples': 21990336, 'steps': 114532, 'loss/train': 0.49318888783454895} 11/07/2021 13:20:05 - INFO - __main__ - Step 114534: {'lr': 6.756542454277853e-05, 'samples': 21990528, 'steps': 114533, 'loss/train': 1.1033742427825928} 11/07/2021 13:20:06 - INFO - __main__ - Step 114535: {'lr': 6.75617962218148e-05, 'samples': 21990720, 'steps': 114534, 'loss/train': 1.5626096725463867} 11/07/2021 13:20:06 - INFO - __main__ - Step 114536: {'lr': 6.755816798305492e-05, 'samples': 21990912, 'steps': 114535, 'loss/train': 3.608077049255371} 11/07/2021 13:20:06 - INFO - __main__ - Step 114537: {'lr': 6.755453982650047e-05, 'samples': 21991104, 'steps': 114536, 'loss/train': 1.5040740966796875} 11/07/2021 13:20:07 - INFO - __main__ - Step 114538: {'lr': 6.755091175215316e-05, 'samples': 21991296, 'steps': 114537, 'loss/train': 0.702882707118988} 11/07/2021 13:20:08 - INFO - __main__ - Step 114539: {'lr': 6.75472837600146e-05, 'samples': 21991488, 'steps': 114538, 'loss/train': 1.650857925415039} 11/07/2021 13:20:08 - INFO - __main__ - Step 114540: {'lr': 6.75436558500864e-05, 'samples': 21991680, 'steps': 114539, 'loss/train': 1.6739729642868042} 11/07/2021 13:20:09 - INFO - __main__ - Step 114541: {'lr': 6.75400280223703e-05, 'samples': 21991872, 'steps': 114540, 'loss/train': 0.9482312798500061} 11/07/2021 13:20:09 - INFO - __main__ - Step 114542: {'lr': 6.753640027686778e-05, 'samples': 21992064, 'steps': 114541, 'loss/train': 1.865362286567688} 11/07/2021 13:20:09 - INFO - __main__ - Step 114543: {'lr': 6.753277261358054e-05, 'samples': 21992256, 'steps': 114542, 'loss/train': 1.2126516103744507} 11/07/2021 13:20:10 - INFO - __main__ - Step 114544: {'lr': 6.752914503251021e-05, 'samples': 21992448, 'steps': 114543, 'loss/train': 0.7975401878356934} 11/07/2021 13:20:11 - INFO - __main__ - Step 114545: {'lr': 6.752551753365843e-05, 'samples': 21992640, 'steps': 114544, 'loss/train': 1.5032191276550293} 11/07/2021 13:20:11 - INFO - __main__ - Step 114546: {'lr': 6.752189011702683e-05, 'samples': 21992832, 'steps': 114545, 'loss/train': 0.9514973163604736} 11/07/2021 13:20:11 - INFO - __main__ - Step 114547: {'lr': 6.751826278261705e-05, 'samples': 21993024, 'steps': 114546, 'loss/train': 1.7028056383132935} 11/07/2021 13:20:12 - INFO - __main__ - Step 114548: {'lr': 6.751463553043075e-05, 'samples': 21993216, 'steps': 114547, 'loss/train': 1.2782909870147705} 11/07/2021 13:20:13 - INFO - __main__ - Step 114549: {'lr': 6.75110083604695e-05, 'samples': 21993408, 'steps': 114548, 'loss/train': 1.8722180128097534} 11/07/2021 13:20:14 - INFO - __main__ - Step 114550: {'lr': 6.750738127273501e-05, 'samples': 21993600, 'steps': 114549, 'loss/train': 1.260642170906067} 11/07/2021 13:20:14 - INFO - __main__ - Step 114551: {'lr': 6.750375426722886e-05, 'samples': 21993792, 'steps': 114550, 'loss/train': 0.43982043862342834} 11/07/2021 13:20:14 - INFO - __main__ - Step 114552: {'lr': 6.75001273439527e-05, 'samples': 21993984, 'steps': 114551, 'loss/train': 0.8228330016136169} 11/07/2021 13:20:15 - INFO - __main__ - Step 114553: {'lr': 6.749650050290818e-05, 'samples': 21994176, 'steps': 114552, 'loss/train': 1.3460779190063477} 11/07/2021 13:20:15 - INFO - __main__ - Step 114554: {'lr': 6.749287374409698e-05, 'samples': 21994368, 'steps': 114553, 'loss/train': 1.4073880910873413} 11/07/2021 13:20:15 - INFO - __main__ - Step 114555: {'lr': 6.748924706752061e-05, 'samples': 21994560, 'steps': 114554, 'loss/train': 1.7429628372192383} 11/07/2021 13:20:17 - INFO - __main__ - Step 114556: {'lr': 6.748562047318076e-05, 'samples': 21994752, 'steps': 114555, 'loss/train': 1.704080581665039} 11/07/2021 13:20:17 - INFO - __main__ - Step 114557: {'lr': 6.748199396107909e-05, 'samples': 21994944, 'steps': 114556, 'loss/train': 1.4017585515975952} 11/07/2021 13:20:17 - INFO - __main__ - Step 114558: {'lr': 6.747836753121719e-05, 'samples': 21995136, 'steps': 114557, 'loss/train': 1.410822868347168} 11/07/2021 13:20:18 - INFO - __main__ - Step 114559: {'lr': 6.747474118359675e-05, 'samples': 21995328, 'steps': 114558, 'loss/train': 1.349172830581665} 11/07/2021 13:20:18 - INFO - __main__ - Step 114560: {'lr': 6.747111491821937e-05, 'samples': 21995520, 'steps': 114559, 'loss/train': 1.5823982954025269} 11/07/2021 13:20:19 - INFO - __main__ - Step 114561: {'lr': 6.746748873508669e-05, 'samples': 21995712, 'steps': 114560, 'loss/train': 1.5106909275054932} 11/07/2021 13:20:19 - INFO - __main__ - Step 114562: {'lr': 6.746386263420032e-05, 'samples': 21995904, 'steps': 114561, 'loss/train': 1.2026026248931885} 11/07/2021 13:20:20 - INFO - __main__ - Step 114563: {'lr': 6.746023661556194e-05, 'samples': 21996096, 'steps': 114562, 'loss/train': 1.5607811212539673} 11/07/2021 13:20:20 - INFO - __main__ - Step 114564: {'lr': 6.745661067917314e-05, 'samples': 21996288, 'steps': 114563, 'loss/train': 1.186659336090088} 11/07/2021 13:20:20 - INFO - __main__ - Step 114565: {'lr': 6.745298482503559e-05, 'samples': 21996480, 'steps': 114564, 'loss/train': 1.2158758640289307} 11/07/2021 13:20:21 - INFO - __main__ - Step 114566: {'lr': 6.744935905315091e-05, 'samples': 21996672, 'steps': 114565, 'loss/train': 1.3266949653625488} 11/07/2021 13:20:22 - INFO - __main__ - Step 114567: {'lr': 6.744573336352072e-05, 'samples': 21996864, 'steps': 114566, 'loss/train': 1.4020475149154663} 11/07/2021 13:20:22 - INFO - __main__ - Step 114568: {'lr': 6.744210775614676e-05, 'samples': 21997056, 'steps': 114567, 'loss/train': 1.3756814002990723} 11/07/2021 13:20:22 - INFO - __main__ - Step 114569: {'lr': 6.743848223103047e-05, 'samples': 21997248, 'steps': 114568, 'loss/train': 1.2069790363311768} 11/07/2021 13:20:23 - INFO - __main__ - Step 114570: {'lr': 6.74348567881736e-05, 'samples': 21997440, 'steps': 114569, 'loss/train': 1.0407356023788452} 11/07/2021 13:20:23 - INFO - __main__ - Step 114571: {'lr': 6.743123142757776e-05, 'samples': 21997632, 'steps': 114570, 'loss/train': 1.1469274759292603} 11/07/2021 13:20:24 - INFO - __main__ - Step 114572: {'lr': 6.74276061492446e-05, 'samples': 21997824, 'steps': 114571, 'loss/train': 1.3162389993667603} 11/07/2021 13:20:24 - INFO - __main__ - Step 114573: {'lr': 6.742398095317573e-05, 'samples': 21998016, 'steps': 114572, 'loss/train': 1.2092236280441284} 11/07/2021 13:20:25 - INFO - __main__ - Step 114574: {'lr': 6.742035583937278e-05, 'samples': 21998208, 'steps': 114573, 'loss/train': 1.4419928789138794} 11/07/2021 13:20:25 - INFO - __main__ - Step 114575: {'lr': 6.741673080783742e-05, 'samples': 21998400, 'steps': 114574, 'loss/train': 2.090238571166992} 11/07/2021 13:20:26 - INFO - __main__ - Step 114576: {'lr': 6.741310585857127e-05, 'samples': 21998592, 'steps': 114575, 'loss/train': 2.5609328746795654} 11/07/2021 13:20:27 - INFO - __main__ - Step 114577: {'lr': 6.740948099157596e-05, 'samples': 21998784, 'steps': 114576, 'loss/train': 1.9717737436294556} 11/07/2021 13:20:27 - INFO - __main__ - Step 114578: {'lr': 6.740585620685311e-05, 'samples': 21998976, 'steps': 114577, 'loss/train': 0.8772927522659302} 11/07/2021 13:20:27 - INFO - __main__ - Step 114579: {'lr': 6.740223150440435e-05, 'samples': 21999168, 'steps': 114578, 'loss/train': 1.179053783416748} 11/07/2021 13:20:28 - INFO - __main__ - Step 114580: {'lr': 6.739860688423135e-05, 'samples': 21999360, 'steps': 114579, 'loss/train': 1.369741439819336} 11/07/2021 13:20:28 - INFO - __main__ - Step 114581: {'lr': 6.739498234633579e-05, 'samples': 21999552, 'steps': 114580, 'loss/train': 1.444332242012024} 11/07/2021 13:20:29 - INFO - __main__ - Step 114582: {'lr': 6.739135789071915e-05, 'samples': 21999744, 'steps': 114581, 'loss/train': 1.2789620161056519} 11/07/2021 13:20:29 - INFO - __main__ - Step 114583: {'lr': 6.738773351738317e-05, 'samples': 21999936, 'steps': 114582, 'loss/train': 1.3952462673187256} 11/07/2021 13:20:30 - INFO - __main__ - Step 114584: {'lr': 6.738410922632943e-05, 'samples': 22000128, 'steps': 114583, 'loss/train': 1.2695502042770386} 11/07/2021 13:20:30 - INFO - __main__ - Step 114585: {'lr': 6.738048501755961e-05, 'samples': 22000320, 'steps': 114584, 'loss/train': 1.7382241487503052} 11/07/2021 13:20:30 - INFO - __main__ - Step 114586: {'lr': 6.73768608910753e-05, 'samples': 22000512, 'steps': 114585, 'loss/train': 1.1086727380752563} 11/07/2021 13:20:31 - INFO - __main__ - Step 114587: {'lr': 6.737323684687818e-05, 'samples': 22000704, 'steps': 114586, 'loss/train': 1.2913247346878052} 11/07/2021 13:20:32 - INFO - __main__ - Step 114588: {'lr': 6.736961288496988e-05, 'samples': 22000896, 'steps': 114587, 'loss/train': 0.7948440909385681} 11/07/2021 13:20:32 - INFO - __main__ - Step 114589: {'lr': 6.736598900535198e-05, 'samples': 22001088, 'steps': 114588, 'loss/train': 1.2270281314849854} 11/07/2021 13:20:32 - INFO - __main__ - Step 114590: {'lr': 6.736236520802616e-05, 'samples': 22001280, 'steps': 114589, 'loss/train': 1.661998987197876} 11/07/2021 13:20:33 - INFO - __main__ - Step 114591: {'lr': 6.735874149299404e-05, 'samples': 22001472, 'steps': 114590, 'loss/train': 1.2482160329818726} 11/07/2021 13:20:33 - INFO - __main__ - Step 114592: {'lr': 6.735511786025725e-05, 'samples': 22001664, 'steps': 114591, 'loss/train': 1.5063284635543823} 11/07/2021 13:20:34 - INFO - __main__ - Step 114593: {'lr': 6.735149430981743e-05, 'samples': 22001856, 'steps': 114592, 'loss/train': 1.6711466312408447} 11/07/2021 13:20:35 - INFO - __main__ - Step 114594: {'lr': 6.73478708416762e-05, 'samples': 22002048, 'steps': 114593, 'loss/train': 0.9312669634819031} 11/07/2021 13:20:35 - INFO - __main__ - Step 114595: {'lr': 6.734424745583528e-05, 'samples': 22002240, 'steps': 114594, 'loss/train': 0.8157418966293335} 11/07/2021 13:20:35 - INFO - __main__ - Step 114596: {'lr': 6.734062415229616e-05, 'samples': 22002432, 'steps': 114595, 'loss/train': 1.1271498203277588} 11/07/2021 13:20:36 - INFO - __main__ - Step 114597: {'lr': 6.733700093106055e-05, 'samples': 22002624, 'steps': 114596, 'loss/train': 1.3777351379394531} 11/07/2021 13:20:37 - INFO - __main__ - Step 114598: {'lr': 6.733337779213003e-05, 'samples': 22002816, 'steps': 114597, 'loss/train': 1.875518560409546} 11/07/2021 13:20:37 - INFO - __main__ - Step 114599: {'lr': 6.732975473550629e-05, 'samples': 22003008, 'steps': 114598, 'loss/train': 1.3200161457061768} 11/07/2021 13:20:37 - INFO - __main__ - Step 114600: {'lr': 6.732613176119093e-05, 'samples': 22003200, 'steps': 114599, 'loss/train': 1.5245753526687622} 11/07/2021 13:20:38 - INFO - __main__ - Step 114601: {'lr': 6.732250886918562e-05, 'samples': 22003392, 'steps': 114600, 'loss/train': 1.317897915840149} 11/07/2021 13:20:38 - INFO - __main__ - Step 114602: {'lr': 6.731888605949197e-05, 'samples': 22003584, 'steps': 114601, 'loss/train': 1.6130075454711914} 11/07/2021 13:20:39 - INFO - __main__ - Step 114603: {'lr': 6.731526333211157e-05, 'samples': 22003776, 'steps': 114602, 'loss/train': 1.299340009689331} 11/07/2021 13:20:40 - INFO - __main__ - Step 114604: {'lr': 6.731164068704612e-05, 'samples': 22003968, 'steps': 114603, 'loss/train': 1.177851915359497} 11/07/2021 13:20:40 - INFO - __main__ - Step 114605: {'lr': 6.730801812429724e-05, 'samples': 22004160, 'steps': 114604, 'loss/train': 1.1340137720108032} 11/07/2021 13:20:40 - INFO - __main__ - Step 114606: {'lr': 6.730439564386654e-05, 'samples': 22004352, 'steps': 114605, 'loss/train': 1.67519211769104} 11/07/2021 13:20:41 - INFO - __main__ - Step 114607: {'lr': 6.730077324575564e-05, 'samples': 22004544, 'steps': 114606, 'loss/train': 1.3728468418121338} 11/07/2021 13:20:42 - INFO - __main__ - Step 114608: {'lr': 6.72971509299663e-05, 'samples': 22004736, 'steps': 114607, 'loss/train': 1.1583874225616455} 11/07/2021 13:20:42 - INFO - __main__ - Step 114609: {'lr': 6.729352869649993e-05, 'samples': 22004928, 'steps': 114608, 'loss/train': 1.522826910018921} 11/07/2021 13:20:42 - INFO - __main__ - Step 114610: {'lr': 6.72899065453583e-05, 'samples': 22005120, 'steps': 114609, 'loss/train': 1.2469854354858398} 11/07/2021 13:20:43 - INFO - __main__ - Step 114611: {'lr': 6.728628447654304e-05, 'samples': 22005312, 'steps': 114610, 'loss/train': 1.1278743743896484} 11/07/2021 13:20:43 - INFO - __main__ - Step 114612: {'lr': 6.728266249005572e-05, 'samples': 22005504, 'steps': 114611, 'loss/train': 1.7470577955245972} 11/07/2021 13:20:44 - INFO - __main__ - Step 114613: {'lr': 6.727904058589804e-05, 'samples': 22005696, 'steps': 114612, 'loss/train': 1.5908610820770264} 11/07/2021 13:20:45 - INFO - __main__ - Step 114614: {'lr': 6.727541876407159e-05, 'samples': 22005888, 'steps': 114613, 'loss/train': 1.526936411857605} 11/07/2021 13:20:45 - INFO - __main__ - Step 114615: {'lr': 6.7271797024578e-05, 'samples': 22006080, 'steps': 114614, 'loss/train': 1.211879849433899} 11/07/2021 13:20:45 - INFO - __main__ - Step 114616: {'lr': 6.726817536741894e-05, 'samples': 22006272, 'steps': 114615, 'loss/train': 1.1182650327682495} 11/07/2021 13:20:46 - INFO - __main__ - Step 114617: {'lr': 6.726455379259602e-05, 'samples': 22006464, 'steps': 114616, 'loss/train': 1.5291298627853394} 11/07/2021 13:20:47 - INFO - __main__ - Step 114618: {'lr': 6.726093230011088e-05, 'samples': 22006656, 'steps': 114617, 'loss/train': 1.5135530233383179} 11/07/2021 13:20:47 - INFO - __main__ - Step 114619: {'lr': 6.725731088996515e-05, 'samples': 22006848, 'steps': 114618, 'loss/train': 1.027336597442627} 11/07/2021 13:20:47 - INFO - __main__ - Step 114620: {'lr': 6.725368956216044e-05, 'samples': 22007040, 'steps': 114619, 'loss/train': 1.0406299829483032} 11/07/2021 13:20:48 - INFO - __main__ - Step 114621: {'lr': 6.725006831669839e-05, 'samples': 22007232, 'steps': 114620, 'loss/train': 0.712893545627594} 11/07/2021 13:20:48 - INFO - __main__ - Step 114622: {'lr': 6.724644715358072e-05, 'samples': 22007424, 'steps': 114621, 'loss/train': 1.3909587860107422} 11/07/2021 13:20:48 - INFO - __main__ - Step 114623: {'lr': 6.724282607280893e-05, 'samples': 22007616, 'steps': 114622, 'loss/train': 1.1667665243148804} 11/07/2021 13:20:50 - INFO - __main__ - Step 114624: {'lr': 6.723920507438466e-05, 'samples': 22007808, 'steps': 114623, 'loss/train': 1.3206132650375366} 11/07/2021 13:20:50 - INFO - __main__ - Step 114625: {'lr': 6.723558415830963e-05, 'samples': 22008000, 'steps': 114624, 'loss/train': 1.279835820198059} 11/07/2021 13:20:50 - INFO - __main__ - Step 114626: {'lr': 6.723196332458539e-05, 'samples': 22008192, 'steps': 114625, 'loss/train': 1.689650535583496} 11/07/2021 13:20:51 - INFO - __main__ - Step 114627: {'lr': 6.722834257321361e-05, 'samples': 22008384, 'steps': 114626, 'loss/train': 1.6249513626098633} 11/07/2021 13:20:51 - INFO - __main__ - Step 114628: {'lr': 6.722472190419593e-05, 'samples': 22008576, 'steps': 114627, 'loss/train': 0.5695286989212036} 11/07/2021 13:20:52 - INFO - __main__ - Step 114629: {'lr': 6.722110131753398e-05, 'samples': 22008768, 'steps': 114628, 'loss/train': 1.8023359775543213} 11/07/2021 13:20:52 - INFO - __main__ - Step 114630: {'lr': 6.721748081322938e-05, 'samples': 22008960, 'steps': 114629, 'loss/train': 1.2760142087936401} 11/07/2021 13:20:53 - INFO - __main__ - Step 114631: {'lr': 6.721386039128374e-05, 'samples': 22009152, 'steps': 114630, 'loss/train': 1.2977832555770874} 11/07/2021 13:20:53 - INFO - __main__ - Step 114632: {'lr': 6.721024005169874e-05, 'samples': 22009344, 'steps': 114631, 'loss/train': 0.7937024235725403} 11/07/2021 13:20:53 - INFO - __main__ - Step 114633: {'lr': 6.720661979447595e-05, 'samples': 22009536, 'steps': 114632, 'loss/train': 0.9877071976661682} 11/07/2021 13:20:54 - INFO - __main__ - Step 114634: {'lr': 6.720299961961707e-05, 'samples': 22009728, 'steps': 114633, 'loss/train': 1.531469702720642} 11/07/2021 13:20:55 - INFO - __main__ - Step 114635: {'lr': 6.719937952712376e-05, 'samples': 22009920, 'steps': 114634, 'loss/train': 1.6633917093276978} 11/07/2021 13:20:55 - INFO - __main__ - Step 114636: {'lr': 6.71957595169975e-05, 'samples': 22010112, 'steps': 114635, 'loss/train': 0.793513298034668} 11/07/2021 13:20:56 - INFO - __main__ - Step 114637: {'lr': 6.719213958924003e-05, 'samples': 22010304, 'steps': 114636, 'loss/train': 1.07709538936615} 11/07/2021 13:20:56 - INFO - __main__ - Step 114638: {'lr': 6.718851974385296e-05, 'samples': 22010496, 'steps': 114637, 'loss/train': 1.297031044960022} 11/07/2021 13:20:57 - INFO - __main__ - Step 114639: {'lr': 6.71848999808379e-05, 'samples': 22010688, 'steps': 114638, 'loss/train': 1.2523696422576904} 11/07/2021 13:20:58 - INFO - __main__ - Step 114640: {'lr': 6.718128030019651e-05, 'samples': 22010880, 'steps': 114639, 'loss/train': 0.8614270687103271} 11/07/2021 13:20:58 - INFO - __main__ - Step 114641: {'lr': 6.717766070193043e-05, 'samples': 22011072, 'steps': 114640, 'loss/train': 1.3725706338882446} 11/07/2021 13:20:58 - INFO - __main__ - Step 114642: {'lr': 6.717404118604129e-05, 'samples': 22011264, 'steps': 114641, 'loss/train': 0.46247342228889465} 11/07/2021 13:20:59 - INFO - __main__ - Step 114643: {'lr': 6.717042175253068e-05, 'samples': 22011456, 'steps': 114642, 'loss/train': 1.3354482650756836} 11/07/2021 13:21:00 - INFO - __main__ - Step 114644: {'lr': 6.716680240140025e-05, 'samples': 22011648, 'steps': 114643, 'loss/train': 1.4394594430923462} 11/07/2021 13:21:00 - INFO - __main__ - Step 114645: {'lr': 6.716318313265166e-05, 'samples': 22011840, 'steps': 114644, 'loss/train': 1.2137805223464966} 11/07/2021 13:21:01 - INFO - __main__ - Step 114646: {'lr': 6.71595639462865e-05, 'samples': 22012032, 'steps': 114645, 'loss/train': 1.5043808221817017} 11/07/2021 13:21:01 - INFO - __main__ - Step 114647: {'lr': 6.715594484230645e-05, 'samples': 22012224, 'steps': 114646, 'loss/train': 1.7514243125915527} 11/07/2021 13:21:01 - INFO - __main__ - Step 114648: {'lr': 6.715232582071315e-05, 'samples': 22012416, 'steps': 114647, 'loss/train': 1.3413200378417969} 11/07/2021 13:21:02 - INFO - __main__ - Step 114649: {'lr': 6.714870688150812e-05, 'samples': 22012608, 'steps': 114648, 'loss/train': 0.8300752639770508} 11/07/2021 13:21:02 - INFO - __main__ - Step 114650: {'lr': 6.714508802469308e-05, 'samples': 22012800, 'steps': 114649, 'loss/train': 1.2605456113815308} 11/07/2021 13:21:04 - INFO - __main__ - Step 114651: {'lr': 6.714146925026962e-05, 'samples': 22012992, 'steps': 114650, 'loss/train': 1.2051100730895996} 11/07/2021 13:21:05 - INFO - __main__ - Step 114652: {'lr': 6.71378505582394e-05, 'samples': 22013184, 'steps': 114651, 'loss/train': 1.4003971815109253} 11/07/2021 13:21:05 - INFO - __main__ - Step 114653: {'lr': 6.713423194860405e-05, 'samples': 22013376, 'steps': 114652, 'loss/train': 1.4949617385864258} 11/07/2021 13:21:06 - INFO - __main__ - Step 114654: {'lr': 6.713061342136517e-05, 'samples': 22013568, 'steps': 114653, 'loss/train': 1.383083462715149} 11/07/2021 13:21:06 - INFO - __main__ - Step 114655: {'lr': 6.712699497652444e-05, 'samples': 22013760, 'steps': 114654, 'loss/train': 1.4042772054672241} 11/07/2021 13:21:06 - INFO - __main__ - Step 114656: {'lr': 6.712337661408347e-05, 'samples': 22013952, 'steps': 114655, 'loss/train': 1.7251803874969482} 11/07/2021 13:21:07 - INFO - __main__ - Step 114657: {'lr': 6.711975833404385e-05, 'samples': 22014144, 'steps': 114656, 'loss/train': 1.5350788831710815} 11/07/2021 13:21:07 - INFO - __main__ - Step 114658: {'lr': 6.711614013640727e-05, 'samples': 22014336, 'steps': 114657, 'loss/train': 1.5342323780059814} 11/07/2021 13:21:08 - INFO - __main__ - Step 114659: {'lr': 6.711252202117533e-05, 'samples': 22014528, 'steps': 114658, 'loss/train': 1.7466291189193726} 11/07/2021 13:21:08 - INFO - __main__ - Step 114660: {'lr': 6.710890398834968e-05, 'samples': 22014720, 'steps': 114659, 'loss/train': 1.632486343383789} 11/07/2021 13:21:09 - INFO - __main__ - Step 114661: {'lr': 6.71052860379319e-05, 'samples': 22014912, 'steps': 114660, 'loss/train': 1.3376306295394897} 11/07/2021 13:21:09 - INFO - __main__ - Step 114662: {'lr': 6.710166816992377e-05, 'samples': 22015104, 'steps': 114661, 'loss/train': 1.29634428024292} 11/07/2021 13:21:09 - INFO - __main__ - Step 114663: {'lr': 6.70980503843267e-05, 'samples': 22015296, 'steps': 114662, 'loss/train': 1.2961465120315552} 11/07/2021 13:21:10 - INFO - __main__ - Step 114664: {'lr': 6.709443268114243e-05, 'samples': 22015488, 'steps': 114663, 'loss/train': 1.2620785236358643} 11/07/2021 13:21:11 - INFO - __main__ - Step 114665: {'lr': 6.709081506037262e-05, 'samples': 22015680, 'steps': 114664, 'loss/train': 1.1056513786315918} 11/07/2021 13:21:11 - INFO - __main__ - Step 114666: {'lr': 6.708719752201883e-05, 'samples': 22015872, 'steps': 114665, 'loss/train': 1.3788681030273438} 11/07/2021 13:21:11 - INFO - __main__ - Step 114667: {'lr': 6.708358006608273e-05, 'samples': 22016064, 'steps': 114666, 'loss/train': 0.8516267538070679} 11/07/2021 13:21:12 - INFO - __main__ - Step 114668: {'lr': 6.707996269256598e-05, 'samples': 22016256, 'steps': 114667, 'loss/train': 1.0965020656585693} 11/07/2021 13:21:13 - INFO - __main__ - Step 114669: {'lr': 6.707634540147014e-05, 'samples': 22016448, 'steps': 114668, 'loss/train': 1.4898784160614014} 11/07/2021 13:21:13 - INFO - __main__ - Step 114670: {'lr': 6.707272819279688e-05, 'samples': 22016640, 'steps': 114669, 'loss/train': 1.3324463367462158} 11/07/2021 13:21:13 - INFO - __main__ - Step 114671: {'lr': 6.706911106654784e-05, 'samples': 22016832, 'steps': 114670, 'loss/train': 0.1023205891251564} 11/07/2021 13:21:14 - INFO - __main__ - Step 114672: {'lr': 6.706549402272463e-05, 'samples': 22017024, 'steps': 114671, 'loss/train': 0.8669499754905701} 11/07/2021 13:21:14 - INFO - __main__ - Step 114673: {'lr': 6.706187706132888e-05, 'samples': 22017216, 'steps': 114672, 'loss/train': 1.4609944820404053} 11/07/2021 13:21:15 - INFO - __main__ - Step 114674: {'lr': 6.705826018236222e-05, 'samples': 22017408, 'steps': 114673, 'loss/train': 0.7021875977516174} 11/07/2021 13:21:16 - INFO - __main__ - Step 114675: {'lr': 6.705464338582637e-05, 'samples': 22017600, 'steps': 114674, 'loss/train': 1.3422023057937622} 11/07/2021 13:21:16 - INFO - __main__ - Step 114676: {'lr': 6.70510266717228e-05, 'samples': 22017792, 'steps': 114675, 'loss/train': 1.4331873655319214} 11/07/2021 13:21:16 - INFO - __main__ - Step 114677: {'lr': 6.704741004005322e-05, 'samples': 22017984, 'steps': 114676, 'loss/train': 0.9654125571250916} 11/07/2021 13:21:17 - INFO - __main__ - Step 114678: {'lr': 6.704379349081924e-05, 'samples': 22018176, 'steps': 114677, 'loss/train': 0.9227594137191772} 11/07/2021 13:21:17 - INFO - __main__ - Step 114679: {'lr': 6.704017702402251e-05, 'samples': 22018368, 'steps': 114678, 'loss/train': 0.8681671023368835} 11/07/2021 13:21:18 - INFO - __main__ - Step 114680: {'lr': 6.703656063966466e-05, 'samples': 22018560, 'steps': 114679, 'loss/train': 1.0198702812194824} 11/07/2021 13:21:18 - INFO - __main__ - Step 114681: {'lr': 6.703294433774731e-05, 'samples': 22018752, 'steps': 114680, 'loss/train': 1.2437397241592407} 11/07/2021 13:21:19 - INFO - __main__ - Step 114682: {'lr': 6.70293281182721e-05, 'samples': 22018944, 'steps': 114681, 'loss/train': 1.2561566829681396} 11/07/2021 13:21:19 - INFO - __main__ - Step 114683: {'lr': 6.702571198124064e-05, 'samples': 22019136, 'steps': 114682, 'loss/train': 1.658673644065857} 11/07/2021 13:21:19 - INFO - __main__ - Step 114684: {'lr': 6.702209592665457e-05, 'samples': 22019328, 'steps': 114683, 'loss/train': 1.4372892379760742} 11/07/2021 13:21:21 - INFO - __main__ - Step 114685: {'lr': 6.701847995451552e-05, 'samples': 22019520, 'steps': 114684, 'loss/train': 1.2522335052490234} 11/07/2021 13:21:21 - INFO - __main__ - Step 114686: {'lr': 6.70148640648251e-05, 'samples': 22019712, 'steps': 114685, 'loss/train': 1.7766480445861816} 11/07/2021 13:21:21 - INFO - __main__ - Step 114687: {'lr': 6.701124825758498e-05, 'samples': 22019904, 'steps': 114686, 'loss/train': 1.479533076286316} 11/07/2021 13:21:22 - INFO - __main__ - Step 114688: {'lr': 6.700763253279676e-05, 'samples': 22020096, 'steps': 114687, 'loss/train': 1.0584380626678467} 11/07/2021 13:21:22 - INFO - __main__ - Step 114689: {'lr': 6.700401689046217e-05, 'samples': 22020288, 'steps': 114688, 'loss/train': 1.3103477954864502} 11/07/2021 13:21:23 - INFO - __main__ - Step 114690: {'lr': 6.700040133058266e-05, 'samples': 22020480, 'steps': 114689, 'loss/train': 1.2039164304733276} 11/07/2021 13:21:23 - INFO - __main__ - Step 114691: {'lr': 6.699678585315994e-05, 'samples': 22020672, 'steps': 114690, 'loss/train': 1.2706997394561768} 11/07/2021 13:21:24 - INFO - __main__ - Step 114692: {'lr': 6.699317045819564e-05, 'samples': 22020864, 'steps': 114691, 'loss/train': 1.015685796737671} 11/07/2021 13:21:24 - INFO - __main__ - Step 114693: {'lr': 6.698955514569141e-05, 'samples': 22021056, 'steps': 114692, 'loss/train': 1.3606117963790894} 11/07/2021 13:21:24 - INFO - __main__ - Step 114694: {'lr': 6.698593991564886e-05, 'samples': 22021248, 'steps': 114693, 'loss/train': 1.2970292568206787} 11/07/2021 13:21:25 - INFO - __main__ - Step 114695: {'lr': 6.698232476806962e-05, 'samples': 22021440, 'steps': 114694, 'loss/train': 1.300830364227295} 11/07/2021 13:21:26 - INFO - __main__ - Step 114696: {'lr': 6.697870970295531e-05, 'samples': 22021632, 'steps': 114695, 'loss/train': 1.2432773113250732} 11/07/2021 13:21:26 - INFO - __main__ - Step 114697: {'lr': 6.697509472030758e-05, 'samples': 22021824, 'steps': 114696, 'loss/train': 1.4436683654785156} 11/07/2021 13:21:26 - INFO - __main__ - Step 114698: {'lr': 6.697147982012803e-05, 'samples': 22022016, 'steps': 114697, 'loss/train': 1.6213808059692383} 11/07/2021 13:21:27 - INFO - __main__ - Step 114699: {'lr': 6.696786500241834e-05, 'samples': 22022208, 'steps': 114698, 'loss/train': 1.5162935256958008} 11/07/2021 13:21:28 - INFO - __main__ - Step 114700: {'lr': 6.696425026718006e-05, 'samples': 22022400, 'steps': 114699, 'loss/train': 1.3185560703277588} 11/07/2021 13:21:28 - INFO - __main__ - Step 114701: {'lr': 6.69606356144149e-05, 'samples': 22022592, 'steps': 114700, 'loss/train': 1.2734100818634033} 11/07/2021 13:21:29 - INFO - __main__ - Step 114702: {'lr': 6.695702104412452e-05, 'samples': 22022784, 'steps': 114701, 'loss/train': 1.5824896097183228} 11/07/2021 13:21:29 - INFO - __main__ - Step 114703: {'lr': 6.695340655631041e-05, 'samples': 22022976, 'steps': 114702, 'loss/train': 1.5791230201721191} 11/07/2021 13:21:29 - INFO - __main__ - Step 114704: {'lr': 6.694979215097426e-05, 'samples': 22023168, 'steps': 114703, 'loss/train': 1.1075499057769775} 11/07/2021 13:21:30 - INFO - __main__ - Step 114705: {'lr': 6.694617782811772e-05, 'samples': 22023360, 'steps': 114704, 'loss/train': 1.0866575241088867} 11/07/2021 13:21:31 - INFO - __main__ - Step 114706: {'lr': 6.694256358774239e-05, 'samples': 22023552, 'steps': 114705, 'loss/train': 1.0420726537704468} 11/07/2021 13:21:31 - INFO - __main__ - Step 114707: {'lr': 6.693894942984993e-05, 'samples': 22023744, 'steps': 114706, 'loss/train': 1.0369676351547241} 11/07/2021 13:21:31 - INFO - __main__ - Step 114708: {'lr': 6.693533535444197e-05, 'samples': 22023936, 'steps': 114707, 'loss/train': 1.6342779397964478} 11/07/2021 13:21:32 - INFO - __main__ - Step 114709: {'lr': 6.693172136152009e-05, 'samples': 22024128, 'steps': 114708, 'loss/train': 1.5559792518615723} 11/07/2021 13:21:32 - INFO - __main__ - Step 114710: {'lr': 6.692810745108599e-05, 'samples': 22024320, 'steps': 114709, 'loss/train': 1.191683053970337} 11/07/2021 13:21:33 - INFO - __main__ - Step 114711: {'lr': 6.692449362314123e-05, 'samples': 22024512, 'steps': 114710, 'loss/train': 1.665897250175476} 11/07/2021 13:21:34 - INFO - __main__ - Step 114712: {'lr': 6.692087987768746e-05, 'samples': 22024704, 'steps': 114711, 'loss/train': 1.3621634244918823} 11/07/2021 13:21:34 - INFO - __main__ - Step 114713: {'lr': 6.691726621472635e-05, 'samples': 22024896, 'steps': 114712, 'loss/train': 1.531437635421753} 11/07/2021 13:21:34 - INFO - __main__ - Step 114714: {'lr': 6.691365263425948e-05, 'samples': 22025088, 'steps': 114713, 'loss/train': 1.318253755569458} 11/07/2021 13:21:35 - INFO - __main__ - Step 114715: {'lr': 6.691003913628848e-05, 'samples': 22025280, 'steps': 114714, 'loss/train': 1.199650526046753} 11/07/2021 13:21:36 - INFO - __main__ - Step 114716: {'lr': 6.690642572081507e-05, 'samples': 22025472, 'steps': 114715, 'loss/train': 1.610133171081543} 11/07/2021 13:21:36 - INFO - __main__ - Step 114717: {'lr': 6.690281238784075e-05, 'samples': 22025664, 'steps': 114716, 'loss/train': 1.1838165521621704} 11/07/2021 13:21:36 - INFO - __main__ - Step 114718: {'lr': 6.689919913736717e-05, 'samples': 22025856, 'steps': 114717, 'loss/train': 1.1301839351654053} 11/07/2021 13:21:37 - INFO - __main__ - Step 114719: {'lr': 6.689558596939599e-05, 'samples': 22026048, 'steps': 114718, 'loss/train': 1.0649011135101318} 11/07/2021 13:21:37 - INFO - __main__ - Step 114720: {'lr': 6.689197288392885e-05, 'samples': 22026240, 'steps': 114719, 'loss/train': 1.4496957063674927} 11/07/2021 13:21:38 - INFO - __main__ - Step 114721: {'lr': 6.688835988096734e-05, 'samples': 22026432, 'steps': 114720, 'loss/train': 1.207480788230896} 11/07/2021 13:21:39 - INFO - __main__ - Step 114722: {'lr': 6.688474696051312e-05, 'samples': 22026624, 'steps': 114721, 'loss/train': 1.1702244281768799} 11/07/2021 13:21:39 - INFO - __main__ - Step 114723: {'lr': 6.68811341225678e-05, 'samples': 22026816, 'steps': 114722, 'loss/train': 1.161081314086914} 11/07/2021 13:21:39 - INFO - __main__ - Step 114724: {'lr': 6.687752136713301e-05, 'samples': 22027008, 'steps': 114723, 'loss/train': 1.4575061798095703} 11/07/2021 13:21:40 - INFO - __main__ - Step 114725: {'lr': 6.68739086942104e-05, 'samples': 22027200, 'steps': 114724, 'loss/train': 1.1955608129501343} 11/07/2021 13:21:40 - INFO - __main__ - Step 114726: {'lr': 6.687029610380158e-05, 'samples': 22027392, 'steps': 114725, 'loss/train': 1.6012910604476929} 11/07/2021 13:21:41 - INFO - __main__ - Step 114727: {'lr': 6.686668359590825e-05, 'samples': 22027584, 'steps': 114726, 'loss/train': 1.0366421937942505} 11/07/2021 13:21:41 - INFO - __main__ - Step 114728: {'lr': 6.68630711705319e-05, 'samples': 22027776, 'steps': 114727, 'loss/train': 1.0815582275390625} 11/07/2021 13:21:42 - INFO - __main__ - Step 114729: {'lr': 6.68594588276742e-05, 'samples': 22027968, 'steps': 114728, 'loss/train': 1.3976120948791504} 11/07/2021 13:21:42 - INFO - __main__ - Step 114730: {'lr': 6.685584656733682e-05, 'samples': 22028160, 'steps': 114729, 'loss/train': 0.8788458108901978} 11/07/2021 13:21:42 - INFO - __main__ - Step 114731: {'lr': 6.685223438952134e-05, 'samples': 22028352, 'steps': 114730, 'loss/train': 1.6282744407653809} 11/07/2021 13:21:43 - INFO - __main__ - Step 114732: {'lr': 6.684862229422945e-05, 'samples': 22028544, 'steps': 114731, 'loss/train': 1.4365142583847046} 11/07/2021 13:21:44 - INFO - __main__ - Step 114733: {'lr': 6.684501028146272e-05, 'samples': 22028736, 'steps': 114732, 'loss/train': 1.6703778505325317} 11/07/2021 13:21:44 - INFO - __main__ - Step 114734: {'lr': 6.684139835122282e-05, 'samples': 22028928, 'steps': 114733, 'loss/train': 1.221098780632019} 11/07/2021 13:21:45 - INFO - __main__ - Step 114735: {'lr': 6.683778650351138e-05, 'samples': 22029120, 'steps': 114734, 'loss/train': 1.45979642868042} 11/07/2021 13:21:45 - INFO - __main__ - Step 114736: {'lr': 6.683417473832998e-05, 'samples': 22029312, 'steps': 114735, 'loss/train': 1.8075214624404907} 11/07/2021 13:21:46 - INFO - __main__ - Step 114737: {'lr': 6.683056305568036e-05, 'samples': 22029504, 'steps': 114736, 'loss/train': 1.62300705909729} 11/07/2021 13:21:46 - INFO - __main__ - Step 114738: {'lr': 6.682695145556397e-05, 'samples': 22029696, 'steps': 114737, 'loss/train': 1.5791009664535522} 11/07/2021 13:21:47 - INFO - __main__ - Step 114739: {'lr': 6.682333993798254e-05, 'samples': 22029888, 'steps': 114738, 'loss/train': 1.376049280166626} 11/07/2021 13:21:47 - INFO - __main__ - Step 114740: {'lr': 6.681972850293769e-05, 'samples': 22030080, 'steps': 114739, 'loss/train': 1.2040044069290161} 11/07/2021 13:21:47 - INFO - __main__ - Step 114741: {'lr': 6.681611715043104e-05, 'samples': 22030272, 'steps': 114740, 'loss/train': 1.4539380073547363} 11/07/2021 13:21:48 - INFO - __main__ - Step 114742: {'lr': 6.681250588046422e-05, 'samples': 22030464, 'steps': 114741, 'loss/train': 1.2555097341537476} 11/07/2021 13:21:49 - INFO - __main__ - Step 114743: {'lr': 6.680889469303885e-05, 'samples': 22030656, 'steps': 114742, 'loss/train': 1.390221357345581} 11/07/2021 13:21:49 - INFO - __main__ - Step 114744: {'lr': 6.68052835881566e-05, 'samples': 22030848, 'steps': 114743, 'loss/train': 1.5637791156768799} 11/07/2021 13:21:49 - INFO - __main__ - Step 114745: {'lr': 6.680167256581904e-05, 'samples': 22031040, 'steps': 114744, 'loss/train': 1.6687785387039185} 11/07/2021 13:21:50 - INFO - __main__ - Step 114746: {'lr': 6.67980616260278e-05, 'samples': 22031232, 'steps': 114745, 'loss/train': 1.3971413373947144} 11/07/2021 13:21:50 - INFO - __main__ - Step 114747: {'lr': 6.679445076878455e-05, 'samples': 22031424, 'steps': 114746, 'loss/train': 1.1675119400024414} 11/07/2021 13:21:51 - INFO - __main__ - Step 114748: {'lr': 6.679083999409097e-05, 'samples': 22031616, 'steps': 114747, 'loss/train': 1.3905311822891235} 11/07/2021 13:21:51 - INFO - __main__ - Step 114749: {'lr': 6.678722930194853e-05, 'samples': 22031808, 'steps': 114748, 'loss/train': 1.5082402229309082} 11/07/2021 13:21:52 - INFO - __main__ - Step 114750: {'lr': 6.678361869235891e-05, 'samples': 22032000, 'steps': 114749, 'loss/train': 1.6554416418075562} 11/07/2021 13:21:52 - INFO - __main__ - Step 114751: {'lr': 6.678000816532381e-05, 'samples': 22032192, 'steps': 114750, 'loss/train': 1.3256231546401978} 11/07/2021 13:21:53 - INFO - __main__ - Step 114752: {'lr': 6.67763977208448e-05, 'samples': 22032384, 'steps': 114751, 'loss/train': 1.2012393474578857} 11/07/2021 13:21:54 - INFO - __main__ - Step 114753: {'lr': 6.677278735892348e-05, 'samples': 22032576, 'steps': 114752, 'loss/train': 0.8633297085762024} 11/07/2021 13:21:54 - INFO - __main__ - Step 114754: {'lr': 6.676917707956154e-05, 'samples': 22032768, 'steps': 114753, 'loss/train': 1.9256798028945923} 11/07/2021 13:21:54 - INFO - __main__ - Step 114755: {'lr': 6.676556688276058e-05, 'samples': 22032960, 'steps': 114754, 'loss/train': 1.4437305927276611} 11/07/2021 13:21:55 - INFO - __main__ - Step 114756: {'lr': 6.676195676852223e-05, 'samples': 22033152, 'steps': 114755, 'loss/train': 1.3595736026763916} 11/07/2021 13:21:55 - INFO - __main__ - Step 114757: {'lr': 6.675834673684814e-05, 'samples': 22033344, 'steps': 114756, 'loss/train': 1.1930781602859497} 11/07/2021 13:21:56 - INFO - __main__ - Step 114758: {'lr': 6.675473678773989e-05, 'samples': 22033536, 'steps': 114757, 'loss/train': 1.8304752111434937} 11/07/2021 13:21:56 - INFO - __main__ - Step 114759: {'lr': 6.675112692119919e-05, 'samples': 22033728, 'steps': 114758, 'loss/train': 1.4239023923873901} 11/07/2021 13:21:57 - INFO - __main__ - Step 114760: {'lr': 6.674751713722755e-05, 'samples': 22033920, 'steps': 114759, 'loss/train': 1.7297899723052979} 11/07/2021 13:21:57 - INFO - __main__ - Step 114761: {'lr': 6.674390743582662e-05, 'samples': 22034112, 'steps': 114760, 'loss/train': 1.1338752508163452} 11/07/2021 13:21:57 - INFO - __main__ - Step 114762: {'lr': 6.674029781699809e-05, 'samples': 22034304, 'steps': 114761, 'loss/train': 1.3207234144210815} 11/07/2021 13:21:59 - INFO - __main__ - Step 114763: {'lr': 6.673668828074354e-05, 'samples': 22034496, 'steps': 114762, 'loss/train': 1.3813087940216064} 11/07/2021 13:21:59 - INFO - __main__ - Step 114764: {'lr': 6.673307882706461e-05, 'samples': 22034688, 'steps': 114763, 'loss/train': 1.0893253087997437} 11/07/2021 13:21:59 - INFO - __main__ - Step 114765: {'lr': 6.672946945596292e-05, 'samples': 22034880, 'steps': 114764, 'loss/train': 1.2002487182617188} 11/07/2021 13:22:00 - INFO - __main__ - Step 114766: {'lr': 6.672586016744012e-05, 'samples': 22035072, 'steps': 114765, 'loss/train': 1.6133534908294678} 11/07/2021 13:22:00 - INFO - __main__ - Step 114767: {'lr': 6.672225096149781e-05, 'samples': 22035264, 'steps': 114766, 'loss/train': 0.7368627786636353} 11/07/2021 13:22:01 - INFO - __main__ - Step 114768: {'lr': 6.671864183813765e-05, 'samples': 22035456, 'steps': 114767, 'loss/train': 1.1487127542495728} 11/07/2021 13:22:01 - INFO - __main__ - Step 114769: {'lr': 6.671503279736121e-05, 'samples': 22035648, 'steps': 114768, 'loss/train': 1.443128228187561} 11/07/2021 13:22:02 - INFO - __main__ - Step 114770: {'lr': 6.671142383917023e-05, 'samples': 22035840, 'steps': 114769, 'loss/train': 1.46552312374115} 11/07/2021 13:22:02 - INFO - __main__ - Step 114771: {'lr': 6.670781496356618e-05, 'samples': 22036032, 'steps': 114770, 'loss/train': 1.4918252229690552} 11/07/2021 13:22:02 - INFO - __main__ - Step 114772: {'lr': 6.670420617055076e-05, 'samples': 22036224, 'steps': 114771, 'loss/train': 1.1976782083511353} 11/07/2021 13:22:03 - INFO - __main__ - Step 114773: {'lr': 6.670059746012561e-05, 'samples': 22036416, 'steps': 114772, 'loss/train': 1.563338279724121} 11/07/2021 13:22:04 - INFO - __main__ - Step 114774: {'lr': 6.669698883229233e-05, 'samples': 22036608, 'steps': 114773, 'loss/train': 1.4221161603927612} 11/07/2021 13:22:04 - INFO - __main__ - Step 114775: {'lr': 6.669338028705255e-05, 'samples': 22036800, 'steps': 114774, 'loss/train': 1.3559117317199707} 11/07/2021 13:22:05 - INFO - __main__ - Step 114776: {'lr': 6.668977182440792e-05, 'samples': 22036992, 'steps': 114775, 'loss/train': 1.3211315870285034} 11/07/2021 13:22:05 - INFO - __main__ - Step 114777: {'lr': 6.668616344436005e-05, 'samples': 22037184, 'steps': 114776, 'loss/train': 1.234968900680542} 11/07/2021 13:22:05 - INFO - __main__ - Step 114778: {'lr': 6.668255514691055e-05, 'samples': 22037376, 'steps': 114777, 'loss/train': 1.1890212297439575} 11/07/2021 13:22:07 - INFO - __main__ - Step 114779: {'lr': 6.667894693206106e-05, 'samples': 22037568, 'steps': 114778, 'loss/train': 1.5693068504333496} 11/07/2021 13:22:07 - INFO - __main__ - Step 114780: {'lr': 6.667533879981322e-05, 'samples': 22037760, 'steps': 114779, 'loss/train': 1.2216466665267944} 11/07/2021 13:22:07 - INFO - __main__ - Step 114781: {'lr': 6.667173075016864e-05, 'samples': 22037952, 'steps': 114780, 'loss/train': 1.3947453498840332} 11/07/2021 13:22:08 - INFO - __main__ - Step 114782: {'lr': 6.666812278312895e-05, 'samples': 22038144, 'steps': 114781, 'loss/train': 1.5785547494888306} 11/07/2021 13:22:08 - INFO - __main__ - Step 114783: {'lr': 6.666451489869585e-05, 'samples': 22038336, 'steps': 114782, 'loss/train': 1.0076255798339844} 11/07/2021 13:22:08 - INFO - __main__ - Step 114784: {'lr': 6.66609070968708e-05, 'samples': 22038528, 'steps': 114783, 'loss/train': 1.6632964611053467} 11/07/2021 13:22:10 - INFO - __main__ - Step 114785: {'lr': 6.665729937765556e-05, 'samples': 22038720, 'steps': 114784, 'loss/train': 1.7661070823669434} 11/07/2021 13:22:10 - INFO - __main__ - Step 114786: {'lr': 6.665369174105169e-05, 'samples': 22038912, 'steps': 114785, 'loss/train': 1.3752776384353638} 11/07/2021 13:22:10 - INFO - __main__ - Step 114787: {'lr': 6.665008418706081e-05, 'samples': 22039104, 'steps': 114786, 'loss/train': 1.1222505569458008} 11/07/2021 13:22:11 - INFO - __main__ - Step 114788: {'lr': 6.664647671568461e-05, 'samples': 22039296, 'steps': 114787, 'loss/train': 1.0150684118270874} 11/07/2021 13:22:11 - INFO - __main__ - Step 114789: {'lr': 6.664286932692464e-05, 'samples': 22039488, 'steps': 114788, 'loss/train': 1.1945457458496094} 11/07/2021 13:22:12 - INFO - __main__ - Step 114790: {'lr': 6.66392620207826e-05, 'samples': 22039680, 'steps': 114789, 'loss/train': 1.4641642570495605} 11/07/2021 13:22:13 - INFO - __main__ - Step 114791: {'lr': 6.663565479726008e-05, 'samples': 22039872, 'steps': 114790, 'loss/train': 1.7454560995101929} 11/07/2021 13:22:13 - INFO - __main__ - Step 114792: {'lr': 6.663204765635869e-05, 'samples': 22040064, 'steps': 114791, 'loss/train': 1.4735448360443115} 11/07/2021 13:22:13 - INFO - __main__ - Step 114793: {'lr': 6.662844059808007e-05, 'samples': 22040256, 'steps': 114792, 'loss/train': 0.7678276896476746} 11/07/2021 13:22:14 - INFO - __main__ - Step 114794: {'lr': 6.662483362242583e-05, 'samples': 22040448, 'steps': 114793, 'loss/train': 1.3340595960617065} 11/07/2021 13:22:15 - INFO - __main__ - Step 114795: {'lr': 6.662122672939764e-05, 'samples': 22040640, 'steps': 114794, 'loss/train': 1.31819486618042} 11/07/2021 13:22:15 - INFO - __main__ - Step 114796: {'lr': 6.661761991899715e-05, 'samples': 22040832, 'steps': 114795, 'loss/train': 1.2661967277526855} 11/07/2021 13:22:15 - INFO - __main__ - Step 114797: {'lr': 6.661401319122587e-05, 'samples': 22041024, 'steps': 114796, 'loss/train': 1.5157724618911743} 11/07/2021 13:22:16 - INFO - __main__ - Step 114798: {'lr': 6.661040654608547e-05, 'samples': 22041216, 'steps': 114797, 'loss/train': 1.2782162427902222} 11/07/2021 13:22:16 - INFO - __main__ - Step 114799: {'lr': 6.660679998357761e-05, 'samples': 22041408, 'steps': 114798, 'loss/train': 0.9694265723228455} 11/07/2021 13:22:18 - INFO - __main__ - Step 114800: {'lr': 6.660319350370386e-05, 'samples': 22041600, 'steps': 114799, 'loss/train': 1.193953275680542} 11/07/2021 13:22:18 - INFO - __main__ - Step 114801: {'lr': 6.65995871064659e-05, 'samples': 22041792, 'steps': 114800, 'loss/train': 0.8617885112762451} 11/07/2021 13:22:19 - INFO - __main__ - Step 114802: {'lr': 6.659598079186535e-05, 'samples': 22041984, 'steps': 114801, 'loss/train': 1.7707642316818237} 11/07/2021 13:22:19 - INFO - __main__ - Step 114803: {'lr': 6.659237455990383e-05, 'samples': 22042176, 'steps': 114802, 'loss/train': 1.7623850107192993} 11/07/2021 13:22:19 - INFO - __main__ - Step 114804: {'lr': 6.658876841058292e-05, 'samples': 22042368, 'steps': 114803, 'loss/train': 0.5717687010765076} 11/07/2021 13:22:20 - INFO - __main__ - Step 114805: {'lr': 6.65851623439043e-05, 'samples': 22042560, 'steps': 114804, 'loss/train': 0.8410599231719971} 11/07/2021 13:22:20 - INFO - __main__ - Step 114806: {'lr': 6.65815563598696e-05, 'samples': 22042752, 'steps': 114805, 'loss/train': 1.3925211429595947} 11/07/2021 13:22:21 - INFO - __main__ - Step 114807: {'lr': 6.657795045848039e-05, 'samples': 22042944, 'steps': 114806, 'loss/train': 1.5026382207870483} 11/07/2021 13:22:22 - INFO - __main__ - Step 114808: {'lr': 6.657434463973833e-05, 'samples': 22043136, 'steps': 114807, 'loss/train': 1.417210340499878} 11/07/2021 13:22:22 - INFO - __main__ - Step 114809: {'lr': 6.657073890364504e-05, 'samples': 22043328, 'steps': 114808, 'loss/train': 1.0671446323394775} 11/07/2021 13:22:22 - INFO - __main__ - Step 114810: {'lr': 6.656713325020219e-05, 'samples': 22043520, 'steps': 114809, 'loss/train': 1.3308769464492798} 11/07/2021 13:22:23 - INFO - __main__ - Step 114811: {'lr': 6.656352767941132e-05, 'samples': 22043712, 'steps': 114810, 'loss/train': 1.198529601097107} 11/07/2021 13:22:24 - INFO - __main__ - Step 114812: {'lr': 6.65599221912741e-05, 'samples': 22043904, 'steps': 114811, 'loss/train': 1.4095059633255005} 11/07/2021 13:22:24 - INFO - __main__ - Step 114813: {'lr': 6.655631678579213e-05, 'samples': 22044096, 'steps': 114812, 'loss/train': 1.247361183166504} 11/07/2021 13:22:24 - INFO - __main__ - Step 114814: {'lr': 6.655271146296707e-05, 'samples': 22044288, 'steps': 114813, 'loss/train': 1.8852871656417847} 11/07/2021 13:22:25 - INFO - __main__ - Step 114815: {'lr': 6.65491062228005e-05, 'samples': 22044480, 'steps': 114814, 'loss/train': 1.5073001384735107} 11/07/2021 13:22:25 - INFO - __main__ - Step 114816: {'lr': 6.654550106529411e-05, 'samples': 22044672, 'steps': 114815, 'loss/train': 0.9286700487136841} 11/07/2021 13:22:26 - INFO - __main__ - Step 114817: {'lr': 6.654189599044946e-05, 'samples': 22044864, 'steps': 114816, 'loss/train': 1.0897164344787598} 11/07/2021 13:22:26 - INFO - __main__ - Step 114818: {'lr': 6.653829099826819e-05, 'samples': 22045056, 'steps': 114817, 'loss/train': 1.083791971206665} 11/07/2021 13:22:27 - INFO - __main__ - Step 114819: {'lr': 6.653468608875196e-05, 'samples': 22045248, 'steps': 114818, 'loss/train': 1.3279541730880737} 11/07/2021 13:22:27 - INFO - __main__ - Step 114820: {'lr': 6.653108126190235e-05, 'samples': 22045440, 'steps': 114819, 'loss/train': 1.2473535537719727} 11/07/2021 13:22:27 - INFO - __main__ - Step 114821: {'lr': 6.652747651772104e-05, 'samples': 22045632, 'steps': 114820, 'loss/train': 1.4698636531829834} 11/07/2021 13:22:28 - INFO - __main__ - Step 114822: {'lr': 6.652387185620956e-05, 'samples': 22045824, 'steps': 114821, 'loss/train': 1.5833317041397095} 11/07/2021 13:22:29 - INFO - __main__ - Step 114823: {'lr': 6.65202672773697e-05, 'samples': 22046016, 'steps': 114822, 'loss/train': 0.9870120882987976} 11/07/2021 13:22:29 - INFO - __main__ - Step 114824: {'lr': 6.651666278120291e-05, 'samples': 22046208, 'steps': 114823, 'loss/train': 1.50173020362854} 11/07/2021 13:22:30 - INFO - __main__ - Step 114825: {'lr': 6.651305836771087e-05, 'samples': 22046400, 'steps': 114824, 'loss/train': 1.7851269245147705} 11/07/2021 13:22:30 - INFO - __main__ - Step 114826: {'lr': 6.650945403689521e-05, 'samples': 22046592, 'steps': 114825, 'loss/train': 1.6306172609329224} 11/07/2021 13:22:30 - INFO - __main__ - Step 114827: {'lr': 6.650584978875757e-05, 'samples': 22046784, 'steps': 114826, 'loss/train': 1.0681270360946655} 11/07/2021 13:22:31 - INFO - __main__ - Step 114828: {'lr': 6.650224562329957e-05, 'samples': 22046976, 'steps': 114827, 'loss/train': 1.0722379684448242} 11/07/2021 13:22:32 - INFO - __main__ - Step 114829: {'lr': 6.649864154052279e-05, 'samples': 22047168, 'steps': 114828, 'loss/train': 1.2742522954940796} 11/07/2021 13:22:32 - INFO - __main__ - Step 114830: {'lr': 6.649503754042893e-05, 'samples': 22047360, 'steps': 114829, 'loss/train': 0.9898256659507751} 11/07/2021 13:22:32 - INFO - __main__ - Step 114831: {'lr': 6.649143362301954e-05, 'samples': 22047552, 'steps': 114830, 'loss/train': 1.4366790056228638} 11/07/2021 13:22:33 - INFO - __main__ - Step 114832: {'lr': 6.648782978829632e-05, 'samples': 22047744, 'steps': 114831, 'loss/train': 1.3117576837539673} 11/07/2021 13:22:34 - INFO - __main__ - Step 114833: {'lr': 6.64842260362608e-05, 'samples': 22047936, 'steps': 114832, 'loss/train': 1.578776240348816} 11/07/2021 13:22:34 - INFO - __main__ - Step 114834: {'lr': 6.64806223669147e-05, 'samples': 22048128, 'steps': 114833, 'loss/train': 1.553816556930542} 11/07/2021 13:22:35 - INFO - __main__ - Step 114835: {'lr': 6.647701878025958e-05, 'samples': 22048320, 'steps': 114834, 'loss/train': 1.7312567234039307} 11/07/2021 13:22:35 - INFO - __main__ - Step 114836: {'lr': 6.647341527629707e-05, 'samples': 22048512, 'steps': 114835, 'loss/train': 1.3067734241485596} 11/07/2021 13:22:35 - INFO - __main__ - Step 114837: {'lr': 6.646981185502893e-05, 'samples': 22048704, 'steps': 114836, 'loss/train': 1.1822330951690674} 11/07/2021 13:22:36 - INFO - __main__ - Step 114838: {'lr': 6.646620851645654e-05, 'samples': 22048896, 'steps': 114837, 'loss/train': 1.1108145713806152} 11/07/2021 13:22:37 - INFO - __main__ - Step 114839: {'lr': 6.646260526058167e-05, 'samples': 22049088, 'steps': 114838, 'loss/train': 1.7093067169189453} 11/07/2021 13:22:37 - INFO - __main__ - Step 114840: {'lr': 6.645900208740591e-05, 'samples': 22049280, 'steps': 114839, 'loss/train': 1.411329984664917} 11/07/2021 13:22:37 - INFO - __main__ - Step 114841: {'lr': 6.645539899693087e-05, 'samples': 22049472, 'steps': 114840, 'loss/train': 1.5281621217727661} 11/07/2021 13:22:38 - INFO - __main__ - Step 114842: {'lr': 6.64517959891582e-05, 'samples': 22049664, 'steps': 114841, 'loss/train': 0.9605057835578918} 11/07/2021 13:22:38 - INFO - __main__ - Step 114843: {'lr': 6.644819306408956e-05, 'samples': 22049856, 'steps': 114842, 'loss/train': 1.1774041652679443} 11/07/2021 13:22:40 - INFO - __main__ - Step 114844: {'lr': 6.64445902217265e-05, 'samples': 22050048, 'steps': 114843, 'loss/train': 1.1560490131378174} 11/07/2021 13:22:40 - INFO - __main__ - Step 114845: {'lr': 6.644098746207067e-05, 'samples': 22050240, 'steps': 114844, 'loss/train': 1.303429126739502} 11/07/2021 13:22:40 - INFO - __main__ - Step 114846: {'lr': 6.64373847851237e-05, 'samples': 22050432, 'steps': 114845, 'loss/train': 0.8204655647277832} 11/07/2021 13:22:41 - INFO - __main__ - Step 114847: {'lr': 6.643378219088722e-05, 'samples': 22050624, 'steps': 114846, 'loss/train': 0.4776018261909485} 11/07/2021 13:22:41 - INFO - __main__ - Step 114848: {'lr': 6.643017967936285e-05, 'samples': 22050816, 'steps': 114847, 'loss/train': 1.6526738405227661} 11/07/2021 13:22:42 - INFO - __main__ - Step 114849: {'lr': 6.642657725055221e-05, 'samples': 22051008, 'steps': 114848, 'loss/train': 1.2660949230194092} 11/07/2021 13:22:42 - INFO - __main__ - Step 114850: {'lr': 6.642297490445698e-05, 'samples': 22051200, 'steps': 114849, 'loss/train': 3.1790683269500732} 11/07/2021 13:22:43 - INFO - __main__ - Step 114851: {'lr': 6.641937264107867e-05, 'samples': 22051392, 'steps': 114850, 'loss/train': 1.3242837190628052} 11/07/2021 13:22:43 - INFO - __main__ - Step 114852: {'lr': 6.641577046041894e-05, 'samples': 22051584, 'steps': 114851, 'loss/train': 0.87513667345047} 11/07/2021 13:22:43 - INFO - __main__ - Step 114853: {'lr': 6.641216836247946e-05, 'samples': 22051776, 'steps': 114852, 'loss/train': 1.4195349216461182} 11/07/2021 13:22:44 - INFO - __main__ - Step 114854: {'lr': 6.640856634726178e-05, 'samples': 22051968, 'steps': 114853, 'loss/train': 1.1299833059310913} 11/07/2021 13:22:45 - INFO - __main__ - Step 114855: {'lr': 6.640496441476759e-05, 'samples': 22052160, 'steps': 114854, 'loss/train': 1.0184639692306519} 11/07/2021 13:22:45 - INFO - __main__ - Step 114856: {'lr': 6.640136256499848e-05, 'samples': 22052352, 'steps': 114855, 'loss/train': 1.51383638381958} 11/07/2021 13:22:45 - INFO - __main__ - Step 114857: {'lr': 6.639776079795612e-05, 'samples': 22052544, 'steps': 114856, 'loss/train': 1.1917762756347656} 11/07/2021 13:22:46 - INFO - __main__ - Step 114858: {'lr': 6.639415911364205e-05, 'samples': 22052736, 'steps': 114857, 'loss/train': 1.5640921592712402} 11/07/2021 13:22:47 - INFO - __main__ - Step 114859: {'lr': 6.639055751205797e-05, 'samples': 22052928, 'steps': 114858, 'loss/train': 1.051909327507019} 11/07/2021 13:22:48 - INFO - __main__ - Step 114860: {'lr': 6.638695599320547e-05, 'samples': 22053120, 'steps': 114859, 'loss/train': 1.0312107801437378} 11/07/2021 13:22:48 - INFO - __main__ - Step 114861: {'lr': 6.638335455708613e-05, 'samples': 22053312, 'steps': 114860, 'loss/train': 1.865790605545044} 11/07/2021 13:22:48 - INFO - __main__ - Step 114862: {'lr': 6.637975320370165e-05, 'samples': 22053504, 'steps': 114861, 'loss/train': 1.6520503759384155} 11/07/2021 13:22:49 - INFO - __main__ - Step 114863: {'lr': 6.637615193305362e-05, 'samples': 22053696, 'steps': 114862, 'loss/train': 1.5826336145401} 11/07/2021 13:22:49 - INFO - __main__ - Step 114864: {'lr': 6.637255074514375e-05, 'samples': 22053888, 'steps': 114863, 'loss/train': 1.443145990371704} 11/07/2021 13:22:50 - INFO - __main__ - Step 114865: {'lr': 6.636894963997348e-05, 'samples': 22054080, 'steps': 114864, 'loss/train': 1.9893535375595093} 11/07/2021 13:22:51 - INFO - __main__ - Step 114866: {'lr': 6.636534861754453e-05, 'samples': 22054272, 'steps': 114865, 'loss/train': 1.233039379119873} 11/07/2021 13:22:51 - INFO - __main__ - Step 114867: {'lr': 6.636174767785855e-05, 'samples': 22054464, 'steps': 114866, 'loss/train': 1.475785255432129} 11/07/2021 13:22:51 - INFO - __main__ - Step 114868: {'lr': 6.63581468209171e-05, 'samples': 22054656, 'steps': 114867, 'loss/train': 1.5175714492797852} 11/07/2021 13:22:52 - INFO - __main__ - Step 114869: {'lr': 6.635454604672183e-05, 'samples': 22054848, 'steps': 114868, 'loss/train': 1.1895418167114258} 11/07/2021 13:22:53 - INFO - __main__ - Step 114870: {'lr': 6.63509453552744e-05, 'samples': 22055040, 'steps': 114869, 'loss/train': 1.4155182838439941} 11/07/2021 13:22:53 - INFO - __main__ - Step 114871: {'lr': 6.634734474657636e-05, 'samples': 22055232, 'steps': 114870, 'loss/train': 0.8586273789405823} 11/07/2021 13:22:53 - INFO - __main__ - Step 114872: {'lr': 6.634374422062939e-05, 'samples': 22055424, 'steps': 114871, 'loss/train': 1.231122374534607} 11/07/2021 13:22:54 - INFO - __main__ - Step 114873: {'lr': 6.634014377743511e-05, 'samples': 22055616, 'steps': 114872, 'loss/train': 1.4869457483291626} 11/07/2021 13:22:54 - INFO - __main__ - Step 114874: {'lr': 6.63365434169951e-05, 'samples': 22055808, 'steps': 114873, 'loss/train': 1.3176106214523315} 11/07/2021 13:22:55 - INFO - __main__ - Step 114875: {'lr': 6.633294313931104e-05, 'samples': 22056000, 'steps': 114874, 'loss/train': 0.9134776592254639} 11/07/2021 13:22:55 - INFO - __main__ - Step 114876: {'lr': 6.63293429443845e-05, 'samples': 22056192, 'steps': 114875, 'loss/train': 1.388453483581543} 11/07/2021 13:22:56 - INFO - __main__ - Step 114877: {'lr': 6.63257428322172e-05, 'samples': 22056384, 'steps': 114876, 'loss/train': 1.790836215019226} 11/07/2021 13:22:56 - INFO - __main__ - Step 114878: {'lr': 6.632214280281063e-05, 'samples': 22056576, 'steps': 114877, 'loss/train': 1.210416555404663} 11/07/2021 13:22:56 - INFO - __main__ - Step 114879: {'lr': 6.631854285616646e-05, 'samples': 22056768, 'steps': 114878, 'loss/train': 1.4874072074890137} 11/07/2021 13:22:57 - INFO - __main__ - Step 114880: {'lr': 6.63149429922863e-05, 'samples': 22056960, 'steps': 114879, 'loss/train': 1.4346206188201904} 11/07/2021 13:22:58 - INFO - __main__ - Step 114881: {'lr': 6.63113432111718e-05, 'samples': 22057152, 'steps': 114880, 'loss/train': 1.456918478012085} 11/07/2021 13:22:58 - INFO - __main__ - Step 114882: {'lr': 6.630774351282459e-05, 'samples': 22057344, 'steps': 114881, 'loss/train': 1.3533231019973755} 11/07/2021 13:22:58 - INFO - __main__ - Step 114883: {'lr': 6.630414389724626e-05, 'samples': 22057536, 'steps': 114882, 'loss/train': 1.3843586444854736} 11/07/2021 13:22:59 - INFO - __main__ - Step 114884: {'lr': 6.630054436443847e-05, 'samples': 22057728, 'steps': 114883, 'loss/train': 1.3341939449310303} 11/07/2021 13:22:59 - INFO - __main__ - Step 114885: {'lr': 6.62969449144028e-05, 'samples': 22057920, 'steps': 114884, 'loss/train': 1.066430687904358} 11/07/2021 13:23:00 - INFO - __main__ - Step 114886: {'lr': 6.629334554714089e-05, 'samples': 22058112, 'steps': 114885, 'loss/train': 1.263878583908081} 11/07/2021 13:23:01 - INFO - __main__ - Step 114887: {'lr': 6.628974626265439e-05, 'samples': 22058304, 'steps': 114886, 'loss/train': 1.064562439918518} 11/07/2021 13:23:01 - INFO - __main__ - Step 114888: {'lr': 6.628614706094488e-05, 'samples': 22058496, 'steps': 114887, 'loss/train': 1.1197209358215332} 11/07/2021 13:23:01 - INFO - __main__ - Step 114889: {'lr': 6.628254794201399e-05, 'samples': 22058688, 'steps': 114888, 'loss/train': 1.026862382888794} 11/07/2021 13:23:02 - INFO - __main__ - Step 114890: {'lr': 6.627894890586342e-05, 'samples': 22058880, 'steps': 114889, 'loss/train': 0.9731253385543823} 11/07/2021 13:23:03 - INFO - __main__ - Step 114891: {'lr': 6.627534995249465e-05, 'samples': 22059072, 'steps': 114890, 'loss/train': 1.39302659034729} 11/07/2021 13:23:03 - INFO - __main__ - Step 114892: {'lr': 6.627175108190938e-05, 'samples': 22059264, 'steps': 114891, 'loss/train': 1.72722327709198} 11/07/2021 13:23:03 - INFO - __main__ - Step 114893: {'lr': 6.62681522941092e-05, 'samples': 22059456, 'steps': 114892, 'loss/train': 1.3436450958251953} 11/07/2021 13:23:04 - INFO - __main__ - Step 114894: {'lr': 6.62645535890958e-05, 'samples': 22059648, 'steps': 114893, 'loss/train': 1.1068198680877686} 11/07/2021 13:23:04 - INFO - __main__ - Step 114895: {'lr': 6.626095496687074e-05, 'samples': 22059840, 'steps': 114894, 'loss/train': 1.6681709289550781} 11/07/2021 13:23:05 - INFO - __main__ - Step 114896: {'lr': 6.625735642743563e-05, 'samples': 22060032, 'steps': 114895, 'loss/train': 1.1421105861663818} 11/07/2021 13:23:06 - INFO - __main__ - Step 114897: {'lr': 6.625375797079213e-05, 'samples': 22060224, 'steps': 114896, 'loss/train': 0.831339418888092} 11/07/2021 13:23:06 - INFO - __main__ - Step 114898: {'lr': 6.625015959694189e-05, 'samples': 22060416, 'steps': 114897, 'loss/train': 1.9636424779891968} 11/07/2021 13:23:06 - INFO - __main__ - Step 114899: {'lr': 6.624656130588644e-05, 'samples': 22060608, 'steps': 114898, 'loss/train': 0.928109347820282} 11/07/2021 13:23:07 - INFO - __main__ - Step 114900: {'lr': 6.624296309762748e-05, 'samples': 22060800, 'steps': 114899, 'loss/train': 1.3381025791168213} 11/07/2021 13:23:08 - INFO - __main__ - Step 114901: {'lr': 6.623936497216663e-05, 'samples': 22060992, 'steps': 114900, 'loss/train': 1.477587103843689} 11/07/2021 13:23:08 - INFO - __main__ - Step 114902: {'lr': 6.623576692950545e-05, 'samples': 22061184, 'steps': 114901, 'loss/train': 1.4751514196395874} 11/07/2021 13:23:08 - INFO - __main__ - Step 114903: {'lr': 6.623216896964559e-05, 'samples': 22061376, 'steps': 114902, 'loss/train': 1.6539461612701416} 11/07/2021 13:23:09 - INFO - __main__ - Step 114904: {'lr': 6.622857109258879e-05, 'samples': 22061568, 'steps': 114903, 'loss/train': 1.7093538045883179} 11/07/2021 13:23:09 - INFO - __main__ - Step 114905: {'lr': 6.622497329833647e-05, 'samples': 22061760, 'steps': 114904, 'loss/train': 1.5171358585357666} 11/07/2021 13:23:10 - INFO - __main__ - Step 114906: {'lr': 6.622137558689031e-05, 'samples': 22061952, 'steps': 114905, 'loss/train': 0.39936596155166626} 11/07/2021 13:23:10 - INFO - __main__ - Step 114907: {'lr': 6.621777795825201e-05, 'samples': 22062144, 'steps': 114906, 'loss/train': 1.3701832294464111} 11/07/2021 13:23:11 - INFO - __main__ - Step 114908: {'lr': 6.62141804124231e-05, 'samples': 22062336, 'steps': 114907, 'loss/train': 1.5517598390579224} 11/07/2021 13:23:11 - INFO - __main__ - Step 114909: {'lr': 6.621058294940529e-05, 'samples': 22062528, 'steps': 114908, 'loss/train': 1.6514108180999756} 11/07/2021 13:23:12 - INFO - __main__ - Step 114910: {'lr': 6.620698556920013e-05, 'samples': 22062720, 'steps': 114909, 'loss/train': 1.4564099311828613} 11/07/2021 13:23:12 - INFO - __main__ - Step 114911: {'lr': 6.620338827180928e-05, 'samples': 22062912, 'steps': 114910, 'loss/train': 1.8818858861923218} 11/07/2021 13:23:13 - INFO - __main__ - Step 114912: {'lr': 6.619979105723433e-05, 'samples': 22063104, 'steps': 114911, 'loss/train': 1.4793438911437988} 11/07/2021 13:23:13 - INFO - __main__ - Step 114913: {'lr': 6.619619392547693e-05, 'samples': 22063296, 'steps': 114912, 'loss/train': 1.0224428176879883} 11/07/2021 13:23:14 - INFO - __main__ - Step 114914: {'lr': 6.619259687653867e-05, 'samples': 22063488, 'steps': 114913, 'loss/train': 0.571816623210907} 11/07/2021 13:23:14 - INFO - __main__ - Step 114915: {'lr': 6.618899991042121e-05, 'samples': 22063680, 'steps': 114914, 'loss/train': 0.6343730688095093} 11/07/2021 13:23:14 - INFO - __main__ - Step 114916: {'lr': 6.618540302712614e-05, 'samples': 22063872, 'steps': 114915, 'loss/train': 1.0675814151763916} 11/07/2021 13:23:15 - INFO - __main__ - Step 114917: {'lr': 6.618180622665517e-05, 'samples': 22064064, 'steps': 114916, 'loss/train': 1.5799099206924438} 11/07/2021 13:23:16 - INFO - __main__ - Step 114918: {'lr': 6.617820950900977e-05, 'samples': 22064256, 'steps': 114917, 'loss/train': 1.478054404258728} 11/07/2021 13:23:16 - INFO - __main__ - Step 114919: {'lr': 6.617461287419163e-05, 'samples': 22064448, 'steps': 114918, 'loss/train': 1.3475629091262817} 11/07/2021 13:23:16 - INFO - __main__ - Step 114920: {'lr': 6.617101632220238e-05, 'samples': 22064640, 'steps': 114919, 'loss/train': 0.9940940141677856} 11/07/2021 13:23:17 - INFO - __main__ - Step 114921: {'lr': 6.616741985304361e-05, 'samples': 22064832, 'steps': 114920, 'loss/train': 1.3531864881515503} 11/07/2021 13:23:18 - INFO - __main__ - Step 114922: {'lr': 6.616382346671698e-05, 'samples': 22065024, 'steps': 114921, 'loss/train': 0.9858458042144775} 11/07/2021 13:23:18 - INFO - __main__ - Step 114923: {'lr': 6.61602271632241e-05, 'samples': 22065216, 'steps': 114922, 'loss/train': 1.3603808879852295} 11/07/2021 13:23:18 - INFO - __main__ - Step 114924: {'lr': 6.615663094256658e-05, 'samples': 22065408, 'steps': 114923, 'loss/train': 1.5117217302322388} 11/07/2021 13:23:19 - INFO - __main__ - Step 114925: {'lr': 6.615303480474601e-05, 'samples': 22065600, 'steps': 114924, 'loss/train': 1.2700464725494385} 11/07/2021 13:23:19 - INFO - __main__ - Step 114926: {'lr': 6.614943874976409e-05, 'samples': 22065792, 'steps': 114925, 'loss/train': 1.1690702438354492} 11/07/2021 13:23:20 - INFO - __main__ - Step 114927: {'lr': 6.61458427776224e-05, 'samples': 22065984, 'steps': 114926, 'loss/train': 1.4858403205871582} 11/07/2021 13:23:21 - INFO - __main__ - Step 114928: {'lr': 6.614224688832255e-05, 'samples': 22066176, 'steps': 114927, 'loss/train': 1.5743948221206665} 11/07/2021 13:23:21 - INFO - __main__ - Step 114929: {'lr': 6.613865108186615e-05, 'samples': 22066368, 'steps': 114928, 'loss/train': 1.442663311958313} 11/07/2021 13:23:22 - INFO - __main__ - Step 114930: {'lr': 6.613505535825485e-05, 'samples': 22066560, 'steps': 114929, 'loss/train': 1.1448825597763062} 11/07/2021 13:23:22 - INFO - __main__ - Step 114931: {'lr': 6.613145971749029e-05, 'samples': 22066752, 'steps': 114930, 'loss/train': 1.2100489139556885} 11/07/2021 13:23:22 - INFO - __main__ - Step 114932: {'lr': 6.612786415957403e-05, 'samples': 22066944, 'steps': 114931, 'loss/train': 1.7608733177185059} 11/07/2021 13:23:23 - INFO - __main__ - Step 114933: {'lr': 6.612426868450771e-05, 'samples': 22067136, 'steps': 114932, 'loss/train': 1.388185977935791} 11/07/2021 13:23:24 - INFO - __main__ - Step 114934: {'lr': 6.612067329229296e-05, 'samples': 22067328, 'steps': 114933, 'loss/train': 1.2679643630981445} 11/07/2021 13:23:24 - INFO - __main__ - Step 114935: {'lr': 6.611707798293137e-05, 'samples': 22067520, 'steps': 114934, 'loss/train': 1.1631906032562256} 11/07/2021 13:23:24 - INFO - __main__ - Step 114936: {'lr': 6.611348275642462e-05, 'samples': 22067712, 'steps': 114935, 'loss/train': 1.2402024269104004} 11/07/2021 13:23:25 - INFO - __main__ - Step 114937: {'lr': 6.610988761277428e-05, 'samples': 22067904, 'steps': 114936, 'loss/train': 1.3340585231781006} 11/07/2021 13:23:26 - INFO - __main__ - Step 114938: {'lr': 6.610629255198197e-05, 'samples': 22068096, 'steps': 114937, 'loss/train': 1.1749733686447144} 11/07/2021 13:23:26 - INFO - __main__ - Step 114939: {'lr': 6.610269757404936e-05, 'samples': 22068288, 'steps': 114938, 'loss/train': 0.6226837038993835} 11/07/2021 13:23:26 - INFO - __main__ - Step 114940: {'lr': 6.609910267897804e-05, 'samples': 22068480, 'steps': 114939, 'loss/train': 1.2634104490280151} 11/07/2021 13:23:27 - INFO - __main__ - Step 114941: {'lr': 6.60955078667696e-05, 'samples': 22068672, 'steps': 114940, 'loss/train': 1.3719313144683838} 11/07/2021 13:23:27 - INFO - __main__ - Step 114942: {'lr': 6.609191313742569e-05, 'samples': 22068864, 'steps': 114941, 'loss/train': 1.5968323945999146} 11/07/2021 13:23:28 - INFO - __main__ - Step 114943: {'lr': 6.608831849094792e-05, 'samples': 22069056, 'steps': 114942, 'loss/train': 1.3209874629974365} 11/07/2021 13:23:28 - INFO - __main__ - Step 114944: {'lr': 6.608472392733802e-05, 'samples': 22069248, 'steps': 114943, 'loss/train': 0.77107834815979} 11/07/2021 13:23:29 - INFO - __main__ - Step 114945: {'lr': 6.608112944659741e-05, 'samples': 22069440, 'steps': 114944, 'loss/train': 1.3097102642059326} 11/07/2021 13:23:29 - INFO - __main__ - Step 114946: {'lr': 6.607753504872783e-05, 'samples': 22069632, 'steps': 114945, 'loss/train': 1.4034576416015625} 11/07/2021 13:23:30 - INFO - __main__ - Step 114947: {'lr': 6.607394073373083e-05, 'samples': 22069824, 'steps': 114946, 'loss/train': 1.185167670249939} 11/07/2021 13:23:31 - INFO - __main__ - Step 114948: {'lr': 6.60703465016081e-05, 'samples': 22070016, 'steps': 114947, 'loss/train': 1.434088110923767} 11/07/2021 13:23:31 - INFO - __main__ - Step 114949: {'lr': 6.606675235236122e-05, 'samples': 22070208, 'steps': 114948, 'loss/train': 1.2200368642807007} 11/07/2021 13:23:31 - INFO - __main__ - Step 114950: {'lr': 6.606315828599185e-05, 'samples': 22070400, 'steps': 114949, 'loss/train': 0.3927927017211914} 11/07/2021 13:23:32 - INFO - __main__ - Step 114951: {'lr': 6.605956430250156e-05, 'samples': 22070592, 'steps': 114950, 'loss/train': 1.2136317491531372} 11/07/2021 13:23:32 - INFO - __main__ - Step 114952: {'lr': 6.605597040189201e-05, 'samples': 22070784, 'steps': 114951, 'loss/train': 1.2445539236068726} 11/07/2021 13:23:33 - INFO - __main__ - Step 114953: {'lr': 6.60523765841648e-05, 'samples': 22070976, 'steps': 114952, 'loss/train': 1.2464540004730225} 11/07/2021 13:23:33 - INFO - __main__ - Step 114954: {'lr': 6.604878284932153e-05, 'samples': 22071168, 'steps': 114953, 'loss/train': 1.3177034854888916} 11/07/2021 13:23:34 - INFO - __main__ - Step 114955: {'lr': 6.604518919736385e-05, 'samples': 22071360, 'steps': 114954, 'loss/train': 1.0693724155426025} 11/07/2021 13:23:34 - INFO - __main__ - Step 114956: {'lr': 6.604159562829338e-05, 'samples': 22071552, 'steps': 114955, 'loss/train': 1.0096791982650757} 11/07/2021 13:23:35 - INFO - __main__ - Step 114957: {'lr': 6.60380021421117e-05, 'samples': 22071744, 'steps': 114956, 'loss/train': 1.952582597732544} 11/07/2021 13:23:35 - INFO - __main__ - Step 114958: {'lr': 6.603440873882055e-05, 'samples': 22071936, 'steps': 114957, 'loss/train': 1.28401780128479} 11/07/2021 13:23:36 - INFO - __main__ - Step 114959: {'lr': 6.603081541842137e-05, 'samples': 22072128, 'steps': 114958, 'loss/train': 1.0316916704177856} 11/07/2021 13:23:36 - INFO - __main__ - Step 114960: {'lr': 6.602722218091589e-05, 'samples': 22072320, 'steps': 114959, 'loss/train': 1.3639075756072998} 11/07/2021 13:23:37 - INFO - __main__ - Step 114961: {'lr': 6.602362902630571e-05, 'samples': 22072512, 'steps': 114960, 'loss/train': 1.358481764793396} 11/07/2021 13:23:37 - INFO - __main__ - Step 114962: {'lr': 6.60200359545924e-05, 'samples': 22072704, 'steps': 114961, 'loss/train': 1.2567566633224487} 11/07/2021 13:23:38 - INFO - __main__ - Step 114963: {'lr': 6.601644296577766e-05, 'samples': 22072896, 'steps': 114962, 'loss/train': 1.5946986675262451} 11/07/2021 13:23:38 - INFO - __main__ - Step 114964: {'lr': 6.601285005986307e-05, 'samples': 22073088, 'steps': 114963, 'loss/train': 1.3498371839523315} 11/07/2021 13:23:39 - INFO - __main__ - Step 114965: {'lr': 6.600925723685025e-05, 'samples': 22073280, 'steps': 114964, 'loss/train': 1.4508639574050903} 11/07/2021 13:23:39 - INFO - __main__ - Step 114966: {'lr': 6.600566449674081e-05, 'samples': 22073472, 'steps': 114965, 'loss/train': 0.92336106300354} 11/07/2021 13:23:39 - INFO - __main__ - Step 114967: {'lr': 6.600207183953638e-05, 'samples': 22073664, 'steps': 114966, 'loss/train': 1.6147397756576538} 11/07/2021 13:23:40 - INFO - __main__ - Step 114968: {'lr': 6.599847926523855e-05, 'samples': 22073856, 'steps': 114967, 'loss/train': 1.5309033393859863} 11/07/2021 13:23:41 - INFO - __main__ - Step 114969: {'lr': 6.599488677384902e-05, 'samples': 22074048, 'steps': 114968, 'loss/train': 1.2006433010101318} 11/07/2021 13:23:41 - INFO - __main__ - Step 114970: {'lr': 6.599129436536933e-05, 'samples': 22074240, 'steps': 114969, 'loss/train': 0.6393088102340698} 11/07/2021 13:23:41 - INFO - __main__ - Step 114971: {'lr': 6.598770203980117e-05, 'samples': 22074432, 'steps': 114970, 'loss/train': 1.8070377111434937} 11/07/2021 13:23:42 - INFO - __main__ - Step 114972: {'lr': 6.598410979714609e-05, 'samples': 22074624, 'steps': 114971, 'loss/train': 0.3972395956516266} 11/07/2021 13:23:42 - INFO - __main__ - Step 114973: {'lr': 6.59805176374057e-05, 'samples': 22074816, 'steps': 114972, 'loss/train': 0.8383896350860596} 11/07/2021 13:23:43 - INFO - __main__ - Step 114974: {'lr': 6.597692556058163e-05, 'samples': 22075008, 'steps': 114973, 'loss/train': 1.4545464515686035} 11/07/2021 13:23:44 - INFO - __main__ - Step 114975: {'lr': 6.597333356667557e-05, 'samples': 22075200, 'steps': 114974, 'loss/train': 1.3173660039901733} 11/07/2021 13:23:44 - INFO - __main__ - Step 114976: {'lr': 6.596974165568903e-05, 'samples': 22075392, 'steps': 114975, 'loss/train': 1.3176283836364746} 11/07/2021 13:23:44 - INFO - __main__ - Step 114977: {'lr': 6.596614982762372e-05, 'samples': 22075584, 'steps': 114976, 'loss/train': 1.1452738046646118} 11/07/2021 13:23:45 - INFO - __main__ - Step 114978: {'lr': 6.596255808248122e-05, 'samples': 22075776, 'steps': 114977, 'loss/train': 0.6278060078620911} 11/07/2021 13:23:46 - INFO - __main__ - Step 114979: {'lr': 6.595896642026315e-05, 'samples': 22075968, 'steps': 114978, 'loss/train': 1.2892941236495972} 11/07/2021 13:23:46 - INFO - __main__ - Step 114980: {'lr': 6.595537484097112e-05, 'samples': 22076160, 'steps': 114979, 'loss/train': 0.7802904844284058} 11/07/2021 13:23:46 - INFO - __main__ - Step 114981: {'lr': 6.595178334460674e-05, 'samples': 22076352, 'steps': 114980, 'loss/train': 1.5227206945419312} 11/07/2021 13:23:47 - INFO - __main__ - Step 114982: {'lr': 6.594819193117168e-05, 'samples': 22076544, 'steps': 114981, 'loss/train': 1.1612069606781006} 11/07/2021 13:23:47 - INFO - __main__ - Step 114983: {'lr': 6.594460060066754e-05, 'samples': 22076736, 'steps': 114982, 'loss/train': 1.8162955045700073} 11/07/2021 13:23:48 - INFO - __main__ - Step 114984: {'lr': 6.594100935309596e-05, 'samples': 22076928, 'steps': 114983, 'loss/train': 1.3240196704864502} 11/07/2021 13:23:48 - INFO - __main__ - Step 114985: {'lr': 6.593741818845845e-05, 'samples': 22077120, 'steps': 114984, 'loss/train': 1.8366795778274536} 11/07/2021 13:23:49 - INFO - __main__ - Step 114986: {'lr': 6.593382710675672e-05, 'samples': 22077312, 'steps': 114985, 'loss/train': 1.021881341934204} 11/07/2021 13:23:49 - INFO - __main__ - Step 114987: {'lr': 6.593023610799234e-05, 'samples': 22077504, 'steps': 114986, 'loss/train': 1.2161986827850342} 11/07/2021 13:23:49 - INFO - __main__ - Step 114988: {'lr': 6.592664519216698e-05, 'samples': 22077696, 'steps': 114987, 'loss/train': 1.380768060684204} 11/07/2021 13:23:51 - INFO - __main__ - Step 114989: {'lr': 6.592305435928222e-05, 'samples': 22077888, 'steps': 114988, 'loss/train': 2.080043315887451} 11/07/2021 13:23:51 - INFO - __main__ - Step 114990: {'lr': 6.59194636093397e-05, 'samples': 22078080, 'steps': 114989, 'loss/train': 1.2132465839385986} 11/07/2021 13:23:51 - INFO - __main__ - Step 114991: {'lr': 6.591587294234102e-05, 'samples': 22078272, 'steps': 114990, 'loss/train': 1.466543436050415} 11/07/2021 13:23:52 - INFO - __main__ - Step 114992: {'lr': 6.591228235828781e-05, 'samples': 22078464, 'steps': 114991, 'loss/train': 1.6245265007019043} 11/07/2021 13:23:52 - INFO - __main__ - Step 114993: {'lr': 6.590869185718169e-05, 'samples': 22078656, 'steps': 114992, 'loss/train': 1.3550353050231934} 11/07/2021 13:23:53 - INFO - __main__ - Step 114994: {'lr': 6.590510143902425e-05, 'samples': 22078848, 'steps': 114993, 'loss/train': 1.2789344787597656} 11/07/2021 13:23:54 - INFO - __main__ - Step 114995: {'lr': 6.590151110381723e-05, 'samples': 22079040, 'steps': 114994, 'loss/train': 1.018890380859375} 11/07/2021 13:23:54 - INFO - __main__ - Step 114996: {'lr': 6.589792085156207e-05, 'samples': 22079232, 'steps': 114995, 'loss/train': 1.5638974905014038} 11/07/2021 13:23:54 - INFO - __main__ - Step 114997: {'lr': 6.589433068226047e-05, 'samples': 22079424, 'steps': 114996, 'loss/train': 1.6851218938827515} 11/07/2021 13:23:55 - INFO - __main__ - Step 114998: {'lr': 6.589074059591404e-05, 'samples': 22079616, 'steps': 114997, 'loss/train': 0.8386140465736389} 11/07/2021 13:23:55 - INFO - __main__ - Step 114999: {'lr': 6.58871505925244e-05, 'samples': 22079808, 'steps': 114998, 'loss/train': 1.3524149656295776} 11/07/2021 13:23:56 - INFO - __main__ - Step 115000: {'lr': 6.588356067209316e-05, 'samples': 22080000, 'steps': 114999, 'loss/train': 1.3512872457504272} 11/07/2021 13:23:56 - INFO - __main__ - Step 115001: {'lr': 6.587997083462196e-05, 'samples': 22080192, 'steps': 115000, 'loss/train': 1.5819560289382935} 11/07/2021 13:23:57 - INFO - __main__ - Step 115002: {'lr': 6.58763810801124e-05, 'samples': 22080384, 'steps': 115001, 'loss/train': 1.4905917644500732} 11/07/2021 13:23:57 - INFO - __main__ - Step 115003: {'lr': 6.587279140856609e-05, 'samples': 22080576, 'steps': 115002, 'loss/train': 1.3235459327697754} 11/07/2021 13:23:58 - INFO - __main__ - Step 115004: {'lr': 6.586920181998468e-05, 'samples': 22080768, 'steps': 115003, 'loss/train': 1.4494658708572388} 11/07/2021 13:23:59 - INFO - __main__ - Step 115005: {'lr': 6.586561231436975e-05, 'samples': 22080960, 'steps': 115004, 'loss/train': 1.5982922315597534} 11/07/2021 13:23:59 - INFO - __main__ - Step 115006: {'lr': 6.5862022891723e-05, 'samples': 22081152, 'steps': 115005, 'loss/train': 1.2986860275268555} 11/07/2021 13:24:00 - INFO - __main__ - Step 115007: {'lr': 6.585843355204593e-05, 'samples': 22081344, 'steps': 115006, 'loss/train': 1.6683686971664429} 11/07/2021 13:24:00 - INFO - __main__ - Step 115008: {'lr': 6.58548442953402e-05, 'samples': 22081536, 'steps': 115007, 'loss/train': 1.6090137958526611} 11/07/2021 13:24:00 - INFO - __main__ - Step 115009: {'lr': 6.585125512160742e-05, 'samples': 22081728, 'steps': 115008, 'loss/train': 0.5511904358863831} 11/07/2021 13:24:01 - INFO - __main__ - Step 115010: {'lr': 6.584766603084924e-05, 'samples': 22081920, 'steps': 115009, 'loss/train': 0.7090771198272705} 11/07/2021 13:24:01 - INFO - __main__ - Step 115011: {'lr': 6.584407702306727e-05, 'samples': 22082112, 'steps': 115010, 'loss/train': 0.9470782279968262} 11/07/2021 13:24:02 - INFO - __main__ - Step 115012: {'lr': 6.58404880982631e-05, 'samples': 22082304, 'steps': 115011, 'loss/train': 1.5655674934387207} 11/07/2021 13:24:02 - INFO - __main__ - Step 115013: {'lr': 6.583689925643835e-05, 'samples': 22082496, 'steps': 115012, 'loss/train': 1.6223071813583374} 11/07/2021 13:24:03 - INFO - __main__ - Step 115014: {'lr': 6.583331049759467e-05, 'samples': 22082688, 'steps': 115013, 'loss/train': 0.9163028001785278} 11/07/2021 13:24:03 - INFO - __main__ - Step 115015: {'lr': 6.582972182173366e-05, 'samples': 22082880, 'steps': 115014, 'loss/train': 1.6446908712387085} 11/07/2021 13:24:03 - INFO - __main__ - Step 115016: {'lr': 6.582613322885695e-05, 'samples': 22083072, 'steps': 115015, 'loss/train': 1.424792766571045} 11/07/2021 13:24:05 - INFO - __main__ - Step 115017: {'lr': 6.582254471896618e-05, 'samples': 22083264, 'steps': 115016, 'loss/train': 1.1136754751205444} 11/07/2021 13:24:05 - INFO - __main__ - Step 115018: {'lr': 6.581895629206288e-05, 'samples': 22083456, 'steps': 115017, 'loss/train': 1.0421730279922485} 11/07/2021 13:24:05 - INFO - __main__ - Step 115019: {'lr': 6.581536794814871e-05, 'samples': 22083648, 'steps': 115018, 'loss/train': 1.1311036348342896} 11/07/2021 13:24:06 - INFO - __main__ - Step 115020: {'lr': 6.581177968722529e-05, 'samples': 22083840, 'steps': 115019, 'loss/train': 1.1512330770492554} 11/07/2021 13:24:06 - INFO - __main__ - Step 115021: {'lr': 6.580819150929427e-05, 'samples': 22084032, 'steps': 115020, 'loss/train': 1.3408863544464111} 11/07/2021 13:24:07 - INFO - __main__ - Step 115022: {'lr': 6.58046034143572e-05, 'samples': 22084224, 'steps': 115021, 'loss/train': 1.310315728187561} 11/07/2021 13:24:07 - INFO - __main__ - Step 115023: {'lr': 6.580101540241573e-05, 'samples': 22084416, 'steps': 115022, 'loss/train': 1.5151896476745605} 11/07/2021 13:24:08 - INFO - __main__ - Step 115024: {'lr': 6.57974274734715e-05, 'samples': 22084608, 'steps': 115023, 'loss/train': 1.1996909379959106} 11/07/2021 13:24:08 - INFO - __main__ - Step 115025: {'lr': 6.579383962752611e-05, 'samples': 22084800, 'steps': 115024, 'loss/train': 1.2609057426452637} 11/07/2021 13:24:08 - INFO - __main__ - Step 115026: {'lr': 6.579025186458116e-05, 'samples': 22084992, 'steps': 115025, 'loss/train': 0.9864089488983154} 11/07/2021 13:24:09 - INFO - __main__ - Step 115027: {'lr': 6.578666418463827e-05, 'samples': 22085184, 'steps': 115026, 'loss/train': 1.3741209506988525} 11/07/2021 13:24:10 - INFO - __main__ - Step 115028: {'lr': 6.578307658769916e-05, 'samples': 22085376, 'steps': 115027, 'loss/train': 0.6363844871520996} 11/07/2021 13:24:10 - INFO - __main__ - Step 115029: {'lr': 6.577948907376527e-05, 'samples': 22085568, 'steps': 115028, 'loss/train': 0.8633424043655396} 11/07/2021 13:24:10 - INFO - __main__ - Step 115030: {'lr': 6.577590164283831e-05, 'samples': 22085760, 'steps': 115029, 'loss/train': 1.668300986289978} 11/07/2021 13:24:11 - INFO - __main__ - Step 115031: {'lr': 6.577231429491986e-05, 'samples': 22085952, 'steps': 115030, 'loss/train': 1.4866822957992554} 11/07/2021 13:24:12 - INFO - __main__ - Step 115032: {'lr': 6.57687270300116e-05, 'samples': 22086144, 'steps': 115031, 'loss/train': 1.0765608549118042} 11/07/2021 13:24:12 - INFO - __main__ - Step 115033: {'lr': 6.576513984811508e-05, 'samples': 22086336, 'steps': 115032, 'loss/train': 1.0781025886535645} 11/07/2021 13:24:13 - INFO - __main__ - Step 115034: {'lr': 6.576155274923196e-05, 'samples': 22086528, 'steps': 115033, 'loss/train': 1.3166016340255737} 11/07/2021 13:24:13 - INFO - __main__ - Step 115035: {'lr': 6.575796573336384e-05, 'samples': 22086720, 'steps': 115034, 'loss/train': 1.0335427522659302} 11/07/2021 13:24:13 - INFO - __main__ - Step 115036: {'lr': 6.575437880051233e-05, 'samples': 22086912, 'steps': 115035, 'loss/train': 0.5460379719734192} 11/07/2021 13:24:14 - INFO - __main__ - Step 115037: {'lr': 6.575079195067907e-05, 'samples': 22087104, 'steps': 115036, 'loss/train': 1.3080050945281982} 11/07/2021 13:24:15 - INFO - __main__ - Step 115038: {'lr': 6.574720518386565e-05, 'samples': 22087296, 'steps': 115037, 'loss/train': 1.6045305728912354} 11/07/2021 13:24:15 - INFO - __main__ - Step 115039: {'lr': 6.574361850007376e-05, 'samples': 22087488, 'steps': 115038, 'loss/train': 1.471306324005127} 11/07/2021 13:24:15 - INFO - __main__ - Step 115040: {'lr': 6.574003189930488e-05, 'samples': 22087680, 'steps': 115039, 'loss/train': 1.7909144163131714} 11/07/2021 13:24:16 - INFO - __main__ - Step 115041: {'lr': 6.57364453815607e-05, 'samples': 22087872, 'steps': 115040, 'loss/train': 1.5803649425506592} 11/07/2021 13:24:16 - INFO - __main__ - Step 115042: {'lr': 6.573285894684287e-05, 'samples': 22088064, 'steps': 115041, 'loss/train': 1.2519028186798096} 11/07/2021 13:24:17 - INFO - __main__ - Step 115043: {'lr': 6.572927259515293e-05, 'samples': 22088256, 'steps': 115042, 'loss/train': 1.207200527191162} 11/07/2021 13:24:17 - INFO - __main__ - Step 115044: {'lr': 6.572568632649253e-05, 'samples': 22088448, 'steps': 115043, 'loss/train': 1.8945735692977905} 11/07/2021 13:24:18 - INFO - __main__ - Step 115045: {'lr': 6.572210014086333e-05, 'samples': 22088640, 'steps': 115044, 'loss/train': 1.1898632049560547} 11/07/2021 13:24:18 - INFO - __main__ - Step 115046: {'lr': 6.571851403826686e-05, 'samples': 22088832, 'steps': 115045, 'loss/train': 1.1795860528945923} 11/07/2021 13:24:18 - INFO - __main__ - Step 115047: {'lr': 6.571492801870483e-05, 'samples': 22089024, 'steps': 115046, 'loss/train': 1.4837384223937988} 11/07/2021 13:24:20 - INFO - __main__ - Step 115048: {'lr': 6.571134208217877e-05, 'samples': 22089216, 'steps': 115047, 'loss/train': 1.0090211629867554} 11/07/2021 13:24:21 - INFO - __main__ - Step 115049: {'lr': 6.570775622869039e-05, 'samples': 22089408, 'steps': 115048, 'loss/train': 1.2115797996520996} 11/07/2021 13:24:21 - INFO - __main__ - Step 115050: {'lr': 6.57041704582412e-05, 'samples': 22089600, 'steps': 115049, 'loss/train': 1.1211411952972412} 11/07/2021 13:24:21 - INFO - __main__ - Step 115051: {'lr': 6.570058477083288e-05, 'samples': 22089792, 'steps': 115050, 'loss/train': 1.6788368225097656} 11/07/2021 13:24:22 - INFO - __main__ - Step 115052: {'lr': 6.56969991664671e-05, 'samples': 22089984, 'steps': 115051, 'loss/train': 1.9025706052780151} 11/07/2021 13:24:22 - INFO - __main__ - Step 115053: {'lr': 6.569341364514537e-05, 'samples': 22090176, 'steps': 115052, 'loss/train': 1.3947290182113647} 11/07/2021 13:24:23 - INFO - __main__ - Step 115054: {'lr': 6.568982820686931e-05, 'samples': 22090368, 'steps': 115053, 'loss/train': 1.065372347831726} 11/07/2021 13:24:23 - INFO - __main__ - Step 115055: {'lr': 6.568624285164057e-05, 'samples': 22090560, 'steps': 115054, 'loss/train': 1.5297727584838867} 11/07/2021 13:24:24 - INFO - __main__ - Step 115056: {'lr': 6.568265757946076e-05, 'samples': 22090752, 'steps': 115055, 'loss/train': 1.2802915573120117} 11/07/2021 13:24:24 - INFO - __main__ - Step 115057: {'lr': 6.567907239033153e-05, 'samples': 22090944, 'steps': 115056, 'loss/train': 0.4609087407588959} 11/07/2021 13:24:24 - INFO - __main__ - Step 115058: {'lr': 6.567548728425443e-05, 'samples': 22091136, 'steps': 115057, 'loss/train': 1.5775425434112549} 11/07/2021 13:24:25 - INFO - __main__ - Step 115059: {'lr': 6.567190226123113e-05, 'samples': 22091328, 'steps': 115058, 'loss/train': 1.2633126974105835} 11/07/2021 13:24:26 - INFO - __main__ - Step 115060: {'lr': 6.566831732126324e-05, 'samples': 22091520, 'steps': 115059, 'loss/train': 1.298119306564331} 11/07/2021 13:24:26 - INFO - __main__ - Step 115061: {'lr': 6.566473246435234e-05, 'samples': 22091712, 'steps': 115060, 'loss/train': 1.2604063749313354} 11/07/2021 13:24:27 - INFO - __main__ - Step 115062: {'lr': 6.566114769050008e-05, 'samples': 22091904, 'steps': 115061, 'loss/train': 1.3029637336730957} 11/07/2021 13:24:27 - INFO - __main__ - Step 115063: {'lr': 6.565756299970804e-05, 'samples': 22092096, 'steps': 115062, 'loss/train': 1.0210778713226318} 11/07/2021 13:24:27 - INFO - __main__ - Step 115064: {'lr': 6.56539783919779e-05, 'samples': 22092288, 'steps': 115063, 'loss/train': 1.4212077856063843} 11/07/2021 13:24:28 - INFO - __main__ - Step 115065: {'lr': 6.565039386731128e-05, 'samples': 22092480, 'steps': 115064, 'loss/train': 1.570522665977478} 11/07/2021 13:24:29 - INFO - __main__ - Step 115066: {'lr': 6.564680942570966e-05, 'samples': 22092672, 'steps': 115065, 'loss/train': 1.3645367622375488} 11/07/2021 13:24:29 - INFO - __main__ - Step 115067: {'lr': 6.564322506717477e-05, 'samples': 22092864, 'steps': 115066, 'loss/train': 1.3102209568023682} 11/07/2021 13:24:29 - INFO - __main__ - Step 115068: {'lr': 6.563964079170817e-05, 'samples': 22093056, 'steps': 115067, 'loss/train': 1.4231486320495605} 11/07/2021 13:24:30 - INFO - __main__ - Step 115069: {'lr': 6.563605659931152e-05, 'samples': 22093248, 'steps': 115068, 'loss/train': 1.3305063247680664} 11/07/2021 13:24:31 - INFO - __main__ - Step 115070: {'lr': 6.563247248998644e-05, 'samples': 22093440, 'steps': 115069, 'loss/train': 1.382962703704834} 11/07/2021 13:24:31 - INFO - __main__ - Step 115071: {'lr': 6.56288884637345e-05, 'samples': 22093632, 'steps': 115070, 'loss/train': 1.217934489250183} 11/07/2021 13:24:32 - INFO - __main__ - Step 115072: {'lr': 6.562530452055731e-05, 'samples': 22093824, 'steps': 115071, 'loss/train': 1.0069695711135864} 11/07/2021 13:24:32 - INFO - __main__ - Step 115073: {'lr': 6.562172066045655e-05, 'samples': 22094016, 'steps': 115072, 'loss/train': 0.9528453946113586} 11/07/2021 13:24:32 - INFO - __main__ - Step 115074: {'lr': 6.56181368834338e-05, 'samples': 22094208, 'steps': 115073, 'loss/train': 1.2835344076156616} 11/07/2021 13:24:33 - INFO - __main__ - Step 115075: {'lr': 6.561455318949063e-05, 'samples': 22094400, 'steps': 115074, 'loss/train': 0.9954005479812622} 11/07/2021 13:24:34 - INFO - __main__ - Step 115076: {'lr': 6.561096957862875e-05, 'samples': 22094592, 'steps': 115075, 'loss/train': 1.6083441972732544} 11/07/2021 13:24:34 - INFO - __main__ - Step 115077: {'lr': 6.56073860508497e-05, 'samples': 22094784, 'steps': 115076, 'loss/train': 0.04727107658982277} 11/07/2021 13:24:34 - INFO - __main__ - Step 115078: {'lr': 6.560380260615512e-05, 'samples': 22094976, 'steps': 115077, 'loss/train': 0.9320102334022522} 11/07/2021 13:24:35 - INFO - __main__ - Step 115079: {'lr': 6.560021924454668e-05, 'samples': 22095168, 'steps': 115078, 'loss/train': 1.7452421188354492} 11/07/2021 13:24:36 - INFO - __main__ - Step 115080: {'lr': 6.559663596602588e-05, 'samples': 22095360, 'steps': 115079, 'loss/train': 1.3894516229629517} 11/07/2021 13:24:36 - INFO - __main__ - Step 115081: {'lr': 6.559305277059438e-05, 'samples': 22095552, 'steps': 115080, 'loss/train': 1.3089408874511719} 11/07/2021 13:24:36 - INFO - __main__ - Step 115082: {'lr': 6.55894696582538e-05, 'samples': 22095744, 'steps': 115081, 'loss/train': 1.6780396699905396} 11/07/2021 13:24:37 - INFO - __main__ - Step 115083: {'lr': 6.558588662900577e-05, 'samples': 22095936, 'steps': 115082, 'loss/train': 1.2064541578292847} 11/07/2021 13:24:37 - INFO - __main__ - Step 115084: {'lr': 6.558230368285189e-05, 'samples': 22096128, 'steps': 115083, 'loss/train': 0.92364501953125} 11/07/2021 13:24:38 - INFO - __main__ - Step 115085: {'lr': 6.55787208197938e-05, 'samples': 22096320, 'steps': 115084, 'loss/train': 1.312469244003296} 11/07/2021 13:24:39 - INFO - __main__ - Step 115086: {'lr': 6.557513803983306e-05, 'samples': 22096512, 'steps': 115085, 'loss/train': 1.5647066831588745} 11/07/2021 13:24:39 - INFO - __main__ - Step 115087: {'lr': 6.557155534297133e-05, 'samples': 22096704, 'steps': 115086, 'loss/train': 1.218130111694336} 11/07/2021 13:24:39 - INFO - __main__ - Step 115088: {'lr': 6.55679727292102e-05, 'samples': 22096896, 'steps': 115087, 'loss/train': 1.2064937353134155} 11/07/2021 13:24:40 - INFO - __main__ - Step 115089: {'lr': 6.556439019855131e-05, 'samples': 22097088, 'steps': 115088, 'loss/train': 1.4365125894546509} 11/07/2021 13:24:40 - INFO - __main__ - Step 115090: {'lr': 6.556080775099626e-05, 'samples': 22097280, 'steps': 115089, 'loss/train': 1.525551438331604} 11/07/2021 13:24:42 - INFO - __main__ - Step 115091: {'lr': 6.555722538654665e-05, 'samples': 22097472, 'steps': 115090, 'loss/train': 1.3865209817886353} 11/07/2021 13:24:42 - INFO - __main__ - Step 115092: {'lr': 6.555364310520421e-05, 'samples': 22097664, 'steps': 115091, 'loss/train': 0.3358631134033203} 11/07/2021 13:24:42 - INFO - __main__ - Step 115093: {'lr': 6.555006090697035e-05, 'samples': 22097856, 'steps': 115092, 'loss/train': 1.475423812866211} 11/07/2021 13:24:43 - INFO - __main__ - Step 115094: {'lr': 6.55464787918468e-05, 'samples': 22098048, 'steps': 115093, 'loss/train': 1.5787688493728638} 11/07/2021 13:24:43 - INFO - __main__ - Step 115095: {'lr': 6.554289675983516e-05, 'samples': 22098240, 'steps': 115094, 'loss/train': 0.9556688070297241} 11/07/2021 13:24:43 - INFO - __main__ - Step 115096: {'lr': 6.553931481093703e-05, 'samples': 22098432, 'steps': 115095, 'loss/train': 1.2837797403335571} 11/07/2021 13:24:44 - INFO - __main__ - Step 115097: {'lr': 6.553573294515405e-05, 'samples': 22098624, 'steps': 115096, 'loss/train': 5.489304065704346} 11/07/2021 13:24:45 - INFO - __main__ - Step 115098: {'lr': 6.553215116248781e-05, 'samples': 22098816, 'steps': 115097, 'loss/train': 4.063844680786133} 11/07/2021 13:24:45 - INFO - __main__ - Step 115099: {'lr': 6.552856946293998e-05, 'samples': 22099008, 'steps': 115098, 'loss/train': 1.6348930597305298} 11/07/2021 13:24:45 - INFO - __main__ - Step 115100: {'lr': 6.552498784651209e-05, 'samples': 22099200, 'steps': 115099, 'loss/train': 1.46405827999115} 11/07/2021 13:24:46 - INFO - __main__ - Step 115101: {'lr': 6.55214063132058e-05, 'samples': 22099392, 'steps': 115100, 'loss/train': 1.2641700506210327} 11/07/2021 13:24:46 - INFO - __main__ - Step 115102: {'lr': 6.551782486302271e-05, 'samples': 22099584, 'steps': 115101, 'loss/train': 1.5481162071228027} 11/07/2021 13:24:47 - INFO - __main__ - Step 115103: {'lr': 6.551424349596444e-05, 'samples': 22099776, 'steps': 115102, 'loss/train': 1.3509297370910645} 11/07/2021 13:24:48 - INFO - __main__ - Step 115104: {'lr': 6.55106622120326e-05, 'samples': 22099968, 'steps': 115103, 'loss/train': 1.0808037519454956} 11/07/2021 13:24:48 - INFO - __main__ - Step 115105: {'lr': 6.550708101122885e-05, 'samples': 22100160, 'steps': 115104, 'loss/train': 1.3089083433151245} 11/07/2021 13:24:48 - INFO - __main__ - Step 115106: {'lr': 6.550349989355481e-05, 'samples': 22100352, 'steps': 115105, 'loss/train': 1.5020167827606201} 11/07/2021 13:24:49 - INFO - __main__ - Step 115107: {'lr': 6.549991885901197e-05, 'samples': 22100544, 'steps': 115106, 'loss/train': 0.8090583086013794} 11/07/2021 13:24:50 - INFO - __main__ - Step 115108: {'lr': 6.549633790760204e-05, 'samples': 22100736, 'steps': 115107, 'loss/train': 1.2231601476669312} 11/07/2021 13:24:50 - INFO - __main__ - Step 115109: {'lr': 6.549275703932659e-05, 'samples': 22100928, 'steps': 115108, 'loss/train': 1.1208573579788208} 11/07/2021 13:24:50 - INFO - __main__ - Step 115110: {'lr': 6.548917625418727e-05, 'samples': 22101120, 'steps': 115109, 'loss/train': 1.46486234664917} 11/07/2021 13:24:51 - INFO - __main__ - Step 115111: {'lr': 6.548559555218567e-05, 'samples': 22101312, 'steps': 115110, 'loss/train': 1.3871527910232544} 11/07/2021 13:24:51 - INFO - __main__ - Step 115112: {'lr': 6.54820149333234e-05, 'samples': 22101504, 'steps': 115111, 'loss/train': 0.967499315738678} 11/07/2021 13:24:52 - INFO - __main__ - Step 115113: {'lr': 6.547843439760209e-05, 'samples': 22101696, 'steps': 115112, 'loss/train': 1.5134556293487549} 11/07/2021 13:24:52 - INFO - __main__ - Step 115114: {'lr': 6.547485394502337e-05, 'samples': 22101888, 'steps': 115113, 'loss/train': 0.9744186997413635} 11/07/2021 13:24:53 - INFO - __main__ - Step 115115: {'lr': 6.547127357558883e-05, 'samples': 22102080, 'steps': 115114, 'loss/train': 1.1842093467712402} 11/07/2021 13:24:53 - INFO - __main__ - Step 115116: {'lr': 6.546769328930008e-05, 'samples': 22102272, 'steps': 115115, 'loss/train': 1.9014456272125244} 11/07/2021 13:24:54 - INFO - __main__ - Step 115117: {'lr': 6.546411308615873e-05, 'samples': 22102464, 'steps': 115116, 'loss/train': 1.6897779703140259} 11/07/2021 13:24:55 - INFO - __main__ - Step 115118: {'lr': 6.546053296616644e-05, 'samples': 22102656, 'steps': 115117, 'loss/train': 1.2072621583938599} 11/07/2021 13:24:55 - INFO - __main__ - Step 115119: {'lr': 6.545695292932482e-05, 'samples': 22102848, 'steps': 115118, 'loss/train': 0.9353774189949036} 11/07/2021 13:24:56 - INFO - __main__ - Step 115120: {'lr': 6.545337297563539e-05, 'samples': 22103040, 'steps': 115119, 'loss/train': 1.3823274374008179} 11/07/2021 13:24:56 - INFO - __main__ - Step 115121: {'lr': 6.544979310509983e-05, 'samples': 22103232, 'steps': 115120, 'loss/train': 1.3066924810409546} 11/07/2021 13:24:56 - INFO - __main__ - Step 115122: {'lr': 6.544621331771974e-05, 'samples': 22103424, 'steps': 115121, 'loss/train': 0.9733366370201111} 11/07/2021 13:24:57 - INFO - __main__ - Step 115123: {'lr': 6.544263361349673e-05, 'samples': 22103616, 'steps': 115122, 'loss/train': 1.1766777038574219} 11/07/2021 13:24:58 - INFO - __main__ - Step 115124: {'lr': 6.543905399243244e-05, 'samples': 22103808, 'steps': 115123, 'loss/train': 0.9566808938980103} 11/07/2021 13:24:58 - INFO - __main__ - Step 115125: {'lr': 6.543547445452844e-05, 'samples': 22104000, 'steps': 115124, 'loss/train': 1.2372472286224365} 11/07/2021 13:24:58 - INFO - __main__ - Step 115126: {'lr': 6.543189499978639e-05, 'samples': 22104192, 'steps': 115125, 'loss/train': 1.1391185522079468} 11/07/2021 13:24:59 - INFO - __main__ - Step 115127: {'lr': 6.542831562820787e-05, 'samples': 22104384, 'steps': 115126, 'loss/train': 1.0601953268051147} 11/07/2021 13:24:59 - INFO - __main__ - Step 115128: {'lr': 6.54247363397945e-05, 'samples': 22104576, 'steps': 115127, 'loss/train': 1.1911826133728027} 11/07/2021 13:25:00 - INFO - __main__ - Step 115129: {'lr': 6.542115713454791e-05, 'samples': 22104768, 'steps': 115128, 'loss/train': 0.35191968083381653} 11/07/2021 13:25:01 - INFO - __main__ - Step 115130: {'lr': 6.541757801246968e-05, 'samples': 22104960, 'steps': 115129, 'loss/train': 1.4060463905334473} 11/07/2021 13:25:01 - INFO - __main__ - Step 115131: {'lr': 6.541399897356143e-05, 'samples': 22105152, 'steps': 115130, 'loss/train': 1.1877061128616333} 11/07/2021 13:25:01 - INFO - __main__ - Step 115132: {'lr': 6.541042001782488e-05, 'samples': 22105344, 'steps': 115131, 'loss/train': 1.1639753580093384} 11/07/2021 13:25:02 - INFO - __main__ - Step 115133: {'lr': 6.540684114526147e-05, 'samples': 22105536, 'steps': 115132, 'loss/train': 1.3861688375473022} 11/07/2021 13:25:03 - INFO - __main__ - Step 115134: {'lr': 6.54032623558729e-05, 'samples': 22105728, 'steps': 115133, 'loss/train': 1.1881771087646484} 11/07/2021 13:25:03 - INFO - __main__ - Step 115135: {'lr': 6.539968364966076e-05, 'samples': 22105920, 'steps': 115134, 'loss/train': 1.2631865739822388} 11/07/2021 13:25:03 - INFO - __main__ - Step 115136: {'lr': 6.539610502662666e-05, 'samples': 22106112, 'steps': 115135, 'loss/train': 1.7726457118988037} 11/07/2021 13:25:04 - INFO - __main__ - Step 115137: {'lr': 6.539252648677224e-05, 'samples': 22106304, 'steps': 115136, 'loss/train': 0.9207097291946411} 11/07/2021 13:25:04 - INFO - __main__ - Step 115138: {'lr': 6.538894803009909e-05, 'samples': 22106496, 'steps': 115137, 'loss/train': 1.480177402496338} 11/07/2021 13:25:05 - INFO - __main__ - Step 115139: {'lr': 6.538536965660886e-05, 'samples': 22106688, 'steps': 115138, 'loss/train': 1.4203317165374756} 11/07/2021 13:25:05 - INFO - __main__ - Step 115140: {'lr': 6.53817913663031e-05, 'samples': 22106880, 'steps': 115139, 'loss/train': 1.5342119932174683} 11/07/2021 13:25:06 - INFO - __main__ - Step 115141: {'lr': 6.537821315918347e-05, 'samples': 22107072, 'steps': 115140, 'loss/train': 2.1844687461853027} 11/07/2021 13:25:06 - INFO - __main__ - Step 115142: {'lr': 6.537463503525157e-05, 'samples': 22107264, 'steps': 115141, 'loss/train': 1.2893714904785156} 11/07/2021 13:25:07 - INFO - __main__ - Step 115143: {'lr': 6.537105699450901e-05, 'samples': 22107456, 'steps': 115142, 'loss/train': 1.3250261545181274} 11/07/2021 13:25:08 - INFO - __main__ - Step 115144: {'lr': 6.536747903695739e-05, 'samples': 22107648, 'steps': 115143, 'loss/train': 1.2649295330047607} 11/07/2021 13:25:08 - INFO - __main__ - Step 115145: {'lr': 6.536390116259835e-05, 'samples': 22107840, 'steps': 115144, 'loss/train': 1.2754877805709839} 11/07/2021 13:25:08 - INFO - __main__ - Step 115146: {'lr': 6.536032337143355e-05, 'samples': 22108032, 'steps': 115145, 'loss/train': 0.8130956888198853} 11/07/2021 13:25:09 - INFO - __main__ - Step 115147: {'lr': 6.535674566346448e-05, 'samples': 22108224, 'steps': 115146, 'loss/train': 1.4559834003448486} 11/07/2021 13:25:09 - INFO - __main__ - Step 115148: {'lr': 6.535316803869279e-05, 'samples': 22108416, 'steps': 115147, 'loss/train': 1.3513847589492798} 11/07/2021 13:25:10 - INFO - __main__ - Step 115149: {'lr': 6.534959049712014e-05, 'samples': 22108608, 'steps': 115148, 'loss/train': 1.4770135879516602} 11/07/2021 13:25:10 - INFO - __main__ - Step 115150: {'lr': 6.53460130387481e-05, 'samples': 22108800, 'steps': 115149, 'loss/train': 1.6394482851028442} 11/07/2021 13:25:11 - INFO - __main__ - Step 115151: {'lr': 6.53424356635783e-05, 'samples': 22108992, 'steps': 115150, 'loss/train': 0.7824711799621582} 11/07/2021 13:25:11 - INFO - __main__ - Step 115152: {'lr': 6.533885837161236e-05, 'samples': 22109184, 'steps': 115151, 'loss/train': 1.1036616563796997} 11/07/2021 13:25:11 - INFO - __main__ - Step 115153: {'lr': 6.533528116285184e-05, 'samples': 22109376, 'steps': 115152, 'loss/train': 1.5268487930297852} 11/07/2021 13:25:12 - INFO - __main__ - Step 115154: {'lr': 6.533170403729843e-05, 'samples': 22109568, 'steps': 115153, 'loss/train': 1.3364548683166504} 11/07/2021 13:25:13 - INFO - __main__ - Step 115155: {'lr': 6.532812699495369e-05, 'samples': 22109760, 'steps': 115154, 'loss/train': 1.1984174251556396} 11/07/2021 13:25:13 - INFO - __main__ - Step 115156: {'lr': 6.532455003581925e-05, 'samples': 22109952, 'steps': 115155, 'loss/train': 0.790915846824646} 11/07/2021 13:25:14 - INFO - __main__ - Step 115157: {'lr': 6.532097315989675e-05, 'samples': 22110144, 'steps': 115156, 'loss/train': 1.4740246534347534} 11/07/2021 13:25:14 - INFO - __main__ - Step 115158: {'lr': 6.531739636718773e-05, 'samples': 22110336, 'steps': 115157, 'loss/train': 1.4106749296188354} 11/07/2021 13:25:14 - INFO - __main__ - Step 115159: {'lr': 6.531381965769392e-05, 'samples': 22110528, 'steps': 115158, 'loss/train': 1.202832579612732} 11/07/2021 13:25:15 - INFO - __main__ - Step 115160: {'lr': 6.531024303141678e-05, 'samples': 22110720, 'steps': 115159, 'loss/train': 1.007632851600647} 11/07/2021 13:25:16 - INFO - __main__ - Step 115161: {'lr': 6.530666648835801e-05, 'samples': 22110912, 'steps': 115160, 'loss/train': 1.5354713201522827} 11/07/2021 13:25:16 - INFO - __main__ - Step 115162: {'lr': 6.530309002851917e-05, 'samples': 22111104, 'steps': 115161, 'loss/train': 1.1866369247436523} 11/07/2021 13:25:16 - INFO - __main__ - Step 115163: {'lr': 6.529951365190195e-05, 'samples': 22111296, 'steps': 115162, 'loss/train': 1.5480420589447021} 11/07/2021 13:25:17 - INFO - __main__ - Step 115164: {'lr': 6.529593735850789e-05, 'samples': 22111488, 'steps': 115163, 'loss/train': 1.5796316862106323} 11/07/2021 13:25:18 - INFO - __main__ - Step 115165: {'lr': 6.529236114833864e-05, 'samples': 22111680, 'steps': 115164, 'loss/train': 1.4617528915405273} 11/07/2021 13:25:18 - INFO - __main__ - Step 115166: {'lr': 6.528878502139582e-05, 'samples': 22111872, 'steps': 115165, 'loss/train': 1.3571668863296509} 11/07/2021 13:25:18 - INFO - __main__ - Step 115167: {'lr': 6.528520897768101e-05, 'samples': 22112064, 'steps': 115166, 'loss/train': 1.1977590322494507} 11/07/2021 13:25:19 - INFO - __main__ - Step 115168: {'lr': 6.52816330171958e-05, 'samples': 22112256, 'steps': 115167, 'loss/train': 1.8403844833374023} 11/07/2021 13:25:19 - INFO - __main__ - Step 115169: {'lr': 6.527805713994189e-05, 'samples': 22112448, 'steps': 115168, 'loss/train': 1.3652700185775757} 11/07/2021 13:25:20 - INFO - __main__ - Step 115170: {'lr': 6.527448134592082e-05, 'samples': 22112640, 'steps': 115169, 'loss/train': 1.4332776069641113} 11/07/2021 13:25:21 - INFO - __main__ - Step 115171: {'lr': 6.527090563513419e-05, 'samples': 22112832, 'steps': 115170, 'loss/train': 0.9522882103919983} 11/07/2021 13:25:21 - INFO - __main__ - Step 115172: {'lr': 6.526733000758368e-05, 'samples': 22113024, 'steps': 115171, 'loss/train': 0.7535358667373657} 11/07/2021 13:25:21 - INFO - __main__ - Step 115173: {'lr': 6.52637544632709e-05, 'samples': 22113216, 'steps': 115172, 'loss/train': 1.3198566436767578} 11/07/2021 13:25:22 - INFO - __main__ - Step 115174: {'lr': 6.526017900219738e-05, 'samples': 22113408, 'steps': 115173, 'loss/train': 0.8013850450515747} 11/07/2021 13:25:23 - INFO - __main__ - Step 115175: {'lr': 6.525660362436475e-05, 'samples': 22113600, 'steps': 115174, 'loss/train': 1.7144705057144165} 11/07/2021 13:25:23 - INFO - __main__ - Step 115176: {'lr': 6.525302832977465e-05, 'samples': 22113792, 'steps': 115175, 'loss/train': 1.5082019567489624} 11/07/2021 13:25:23 - INFO - __main__ - Step 115177: {'lr': 6.524945311842867e-05, 'samples': 22113984, 'steps': 115176, 'loss/train': 1.3027002811431885} 11/07/2021 13:25:24 - INFO - __main__ - Step 115178: {'lr': 6.524587799032846e-05, 'samples': 22114176, 'steps': 115177, 'loss/train': 1.2895119190216064} 11/07/2021 13:25:24 - INFO - __main__ - Step 115179: {'lr': 6.524230294547559e-05, 'samples': 22114368, 'steps': 115178, 'loss/train': 1.4055477380752563} 11/07/2021 13:25:25 - INFO - __main__ - Step 115180: {'lr': 6.52387279838717e-05, 'samples': 22114560, 'steps': 115179, 'loss/train': 1.2481276988983154} 11/07/2021 13:25:25 - INFO - __main__ - Step 115181: {'lr': 6.52351531055184e-05, 'samples': 22114752, 'steps': 115180, 'loss/train': 1.4509257078170776} 11/07/2021 13:25:26 - INFO - __main__ - Step 115182: {'lr': 6.523157831041727e-05, 'samples': 22114944, 'steps': 115181, 'loss/train': 1.612001657485962} 11/07/2021 13:25:26 - INFO - __main__ - Step 115183: {'lr': 6.522800359856992e-05, 'samples': 22115136, 'steps': 115182, 'loss/train': 1.0597373247146606} 11/07/2021 13:25:26 - INFO - __main__ - Step 115184: {'lr': 6.522442896997801e-05, 'samples': 22115328, 'steps': 115183, 'loss/train': 1.6124860048294067} 11/07/2021 13:25:28 - INFO - __main__ - Step 115185: {'lr': 6.52208544246431e-05, 'samples': 22115520, 'steps': 115184, 'loss/train': 1.175419569015503} 11/07/2021 13:25:28 - INFO - __main__ - Step 115186: {'lr': 6.52172799625669e-05, 'samples': 22115712, 'steps': 115185, 'loss/train': 1.0650733709335327} 11/07/2021 13:25:28 - INFO - __main__ - Step 115187: {'lr': 6.521370558375089e-05, 'samples': 22115904, 'steps': 115186, 'loss/train': 1.1464147567749023} 11/07/2021 13:25:29 - INFO - __main__ - Step 115188: {'lr': 6.521013128819673e-05, 'samples': 22116096, 'steps': 115187, 'loss/train': 0.6910331845283508} 11/07/2021 13:25:29 - INFO - __main__ - Step 115189: {'lr': 6.5206557075906e-05, 'samples': 22116288, 'steps': 115188, 'loss/train': 1.4899377822875977} 11/07/2021 13:25:30 - INFO - __main__ - Step 115190: {'lr': 6.520298294688037e-05, 'samples': 22116480, 'steps': 115189, 'loss/train': 1.080623745918274} 11/07/2021 13:25:31 - INFO - __main__ - Step 115191: {'lr': 6.519940890112141e-05, 'samples': 22116672, 'steps': 115190, 'loss/train': 1.222895860671997} 11/07/2021 13:25:31 - INFO - __main__ - Step 115192: {'lr': 6.519583493863077e-05, 'samples': 22116864, 'steps': 115191, 'loss/train': 1.6092385053634644} 11/07/2021 13:25:31 - INFO - __main__ - Step 115193: {'lr': 6.519226105941003e-05, 'samples': 22117056, 'steps': 115192, 'loss/train': 1.3794691562652588} 11/07/2021 13:25:32 - INFO - __main__ - Step 115194: {'lr': 6.518868726346078e-05, 'samples': 22117248, 'steps': 115193, 'loss/train': 1.4615271091461182} 11/07/2021 13:25:32 - INFO - __main__ - Step 115195: {'lr': 6.518511355078468e-05, 'samples': 22117440, 'steps': 115194, 'loss/train': 1.3637205362319946} 11/07/2021 13:25:33 - INFO - __main__ - Step 115196: {'lr': 6.518153992138332e-05, 'samples': 22117632, 'steps': 115195, 'loss/train': 1.466386079788208} 11/07/2021 13:25:33 - INFO - __main__ - Step 115197: {'lr': 6.517796637525827e-05, 'samples': 22117824, 'steps': 115196, 'loss/train': 1.2103607654571533} 11/07/2021 13:25:34 - INFO - __main__ - Step 115198: {'lr': 6.517439291241121e-05, 'samples': 22118016, 'steps': 115197, 'loss/train': 1.2387839555740356} 11/07/2021 13:25:34 - INFO - __main__ - Step 115199: {'lr': 6.51708195328437e-05, 'samples': 22118208, 'steps': 115198, 'loss/train': 1.6446795463562012} 11/07/2021 13:25:34 - INFO - __main__ - Step 115200: {'lr': 6.516724623655745e-05, 'samples': 22118400, 'steps': 115199, 'loss/train': 1.1665271520614624} 11/07/2021 13:25:36 - INFO - __main__ - Step 115201: {'lr': 6.516367302355392e-05, 'samples': 22118592, 'steps': 115200, 'loss/train': 1.6758277416229248} 11/07/2021 13:25:36 - INFO - __main__ - Step 115202: {'lr': 6.516009989383476e-05, 'samples': 22118784, 'steps': 115201, 'loss/train': 2.5804972648620605} 11/07/2021 13:25:36 - INFO - __main__ - Step 115203: {'lr': 6.515652684740164e-05, 'samples': 22118976, 'steps': 115202, 'loss/train': 1.583508849143982} 11/07/2021 13:25:37 - INFO - __main__ - Step 115204: {'lr': 6.51529538842561e-05, 'samples': 22119168, 'steps': 115203, 'loss/train': 1.08511483669281} 11/07/2021 13:25:37 - INFO - __main__ - Step 115205: {'lr': 6.514938100439982e-05, 'samples': 22119360, 'steps': 115204, 'loss/train': 1.8187943696975708} 11/07/2021 13:25:37 - INFO - __main__ - Step 115206: {'lr': 6.514580820783436e-05, 'samples': 22119552, 'steps': 115205, 'loss/train': 1.066774606704712} 11/07/2021 13:25:38 - INFO - __main__ - Step 115207: {'lr': 6.514223549456136e-05, 'samples': 22119744, 'steps': 115206, 'loss/train': 0.8354218602180481} 11/07/2021 13:25:39 - INFO - __main__ - Step 115208: {'lr': 6.51386628645824e-05, 'samples': 22119936, 'steps': 115207, 'loss/train': 1.3138774633407593} 11/07/2021 13:25:39 - INFO - __main__ - Step 115209: {'lr': 6.513509031789911e-05, 'samples': 22120128, 'steps': 115208, 'loss/train': 1.0249247550964355} 11/07/2021 13:25:39 - INFO - __main__ - Step 115210: {'lr': 6.51315178545131e-05, 'samples': 22120320, 'steps': 115209, 'loss/train': 1.237687349319458} 11/07/2021 13:25:40 - INFO - __main__ - Step 115211: {'lr': 6.512794547442597e-05, 'samples': 22120512, 'steps': 115210, 'loss/train': 1.3412195444107056} 11/07/2021 13:25:41 - INFO - __main__ - Step 115212: {'lr': 6.512437317763934e-05, 'samples': 22120704, 'steps': 115211, 'loss/train': 1.0266022682189941} 11/07/2021 13:25:41 - INFO - __main__ - Step 115213: {'lr': 6.512080096415488e-05, 'samples': 22120896, 'steps': 115212, 'loss/train': 1.8688000440597534} 11/07/2021 13:25:42 - INFO - __main__ - Step 115214: {'lr': 6.511722883397406e-05, 'samples': 22121088, 'steps': 115213, 'loss/train': 0.9909967184066772} 11/07/2021 13:25:42 - INFO - __main__ - Step 115215: {'lr': 6.511365678709857e-05, 'samples': 22121280, 'steps': 115214, 'loss/train': 1.3093359470367432} 11/07/2021 13:25:42 - INFO - __main__ - Step 115216: {'lr': 6.511008482353001e-05, 'samples': 22121472, 'steps': 115215, 'loss/train': 1.0157517194747925} 11/07/2021 13:25:43 - INFO - __main__ - Step 115217: {'lr': 6.510651294327e-05, 'samples': 22121664, 'steps': 115216, 'loss/train': 1.7526835203170776} 11/07/2021 13:25:44 - INFO - __main__ - Step 115218: {'lr': 6.510294114632015e-05, 'samples': 22121856, 'steps': 115217, 'loss/train': 1.5193853378295898} 11/07/2021 13:25:44 - INFO - __main__ - Step 115219: {'lr': 6.509936943268205e-05, 'samples': 22122048, 'steps': 115218, 'loss/train': 1.4247798919677734} 11/07/2021 13:25:44 - INFO - __main__ - Step 115220: {'lr': 6.50957978023573e-05, 'samples': 22122240, 'steps': 115219, 'loss/train': 1.5904840230941772} 11/07/2021 13:25:45 - INFO - __main__ - Step 115221: {'lr': 6.509222625534755e-05, 'samples': 22122432, 'steps': 115220, 'loss/train': 1.0299288034439087} 11/07/2021 13:25:45 - INFO - __main__ - Step 115222: {'lr': 6.508865479165441e-05, 'samples': 22122624, 'steps': 115221, 'loss/train': 0.9380496740341187} 11/07/2021 13:25:46 - INFO - __main__ - Step 115223: {'lr': 6.508508341127945e-05, 'samples': 22122816, 'steps': 115222, 'loss/train': 0.32298505306243896} 11/07/2021 13:25:47 - INFO - __main__ - Step 115224: {'lr': 6.508151211422427e-05, 'samples': 22123008, 'steps': 115223, 'loss/train': 0.4139179587364197} 11/07/2021 13:25:47 - INFO - __main__ - Step 115225: {'lr': 6.507794090049055e-05, 'samples': 22123200, 'steps': 115224, 'loss/train': 1.2207125425338745} 11/07/2021 13:25:47 - INFO - __main__ - Step 115226: {'lr': 6.507436977007985e-05, 'samples': 22123392, 'steps': 115225, 'loss/train': 1.2323179244995117} 11/07/2021 13:25:48 - INFO - __main__ - Step 115227: {'lr': 6.507079872299384e-05, 'samples': 22123584, 'steps': 115226, 'loss/train': 1.4905539751052856} 11/07/2021 13:25:49 - INFO - __main__ - Step 115228: {'lr': 6.506722775923402e-05, 'samples': 22123776, 'steps': 115227, 'loss/train': 1.502442717552185} 11/07/2021 13:25:49 - INFO - __main__ - Step 115229: {'lr': 6.506365687880203e-05, 'samples': 22123968, 'steps': 115228, 'loss/train': 1.6638619899749756} 11/07/2021 13:25:50 - INFO - __main__ - Step 115230: {'lr': 6.506008608169953e-05, 'samples': 22124160, 'steps': 115229, 'loss/train': 1.344157099723816} 11/07/2021 13:25:50 - INFO - __main__ - Step 115231: {'lr': 6.505651536792808e-05, 'samples': 22124352, 'steps': 115230, 'loss/train': 0.9447053670883179} 11/07/2021 13:25:50 - INFO - __main__ - Step 115232: {'lr': 6.505294473748932e-05, 'samples': 22124544, 'steps': 115231, 'loss/train': 0.9321936368942261} 11/07/2021 13:25:51 - INFO - __main__ - Step 115233: {'lr': 6.504937419038485e-05, 'samples': 22124736, 'steps': 115232, 'loss/train': 0.5493776798248291} 11/07/2021 13:25:52 - INFO - __main__ - Step 115234: {'lr': 6.504580372661628e-05, 'samples': 22124928, 'steps': 115233, 'loss/train': 1.7630141973495483} 11/07/2021 13:25:52 - INFO - __main__ - Step 115235: {'lr': 6.50422333461852e-05, 'samples': 22125120, 'steps': 115234, 'loss/train': 0.7167105078697205} 11/07/2021 13:25:52 - INFO - __main__ - Step 115236: {'lr': 6.503866304909326e-05, 'samples': 22125312, 'steps': 115235, 'loss/train': 2.1132795810699463} 11/07/2021 13:25:53 - INFO - __main__ - Step 115237: {'lr': 6.503509283534204e-05, 'samples': 22125504, 'steps': 115236, 'loss/train': 1.3210375308990479} 11/07/2021 13:25:54 - INFO - __main__ - Step 115238: {'lr': 6.503152270493312e-05, 'samples': 22125696, 'steps': 115237, 'loss/train': 1.171970248222351} 11/07/2021 13:25:54 - INFO - __main__ - Step 115239: {'lr': 6.502795265786817e-05, 'samples': 22125888, 'steps': 115238, 'loss/train': 1.4815462827682495} 11/07/2021 13:25:54 - INFO - __main__ - Step 115240: {'lr': 6.502438269414884e-05, 'samples': 22126080, 'steps': 115239, 'loss/train': 1.127162218093872} 11/07/2021 13:25:55 - INFO - __main__ - Step 115241: {'lr': 6.502081281377661e-05, 'samples': 22126272, 'steps': 115240, 'loss/train': 0.7578745484352112} 11/07/2021 13:25:55 - INFO - __main__ - Step 115242: {'lr': 6.501724301675313e-05, 'samples': 22126464, 'steps': 115241, 'loss/train': 1.4423843622207642} 11/07/2021 13:25:56 - INFO - __main__ - Step 115243: {'lr': 6.501367330308003e-05, 'samples': 22126656, 'steps': 115242, 'loss/train': 1.3149563074111938} 11/07/2021 13:25:57 - INFO - __main__ - Step 115244: {'lr': 6.501010367275892e-05, 'samples': 22126848, 'steps': 115243, 'loss/train': 0.9555005431175232} 11/07/2021 13:25:57 - INFO - __main__ - Step 115245: {'lr': 6.500653412579139e-05, 'samples': 22127040, 'steps': 115244, 'loss/train': 1.5837819576263428} 11/07/2021 13:25:57 - INFO - __main__ - Step 115246: {'lr': 6.500296466217906e-05, 'samples': 22127232, 'steps': 115245, 'loss/train': 1.5976896286010742} 11/07/2021 13:25:58 - INFO - __main__ - Step 115247: {'lr': 6.499939528192356e-05, 'samples': 22127424, 'steps': 115246, 'loss/train': 1.552871823310852} 11/07/2021 13:25:58 - INFO - __main__ - Step 115248: {'lr': 6.499582598502645e-05, 'samples': 22127616, 'steps': 115247, 'loss/train': 1.5842511653900146} 11/07/2021 13:25:59 - INFO - __main__ - Step 115249: {'lr': 6.49922567714894e-05, 'samples': 22127808, 'steps': 115248, 'loss/train': 1.3723493814468384} 11/07/2021 13:26:00 - INFO - __main__ - Step 115250: {'lr': 6.498868764131396e-05, 'samples': 22128000, 'steps': 115249, 'loss/train': 1.1105878353118896} 11/07/2021 13:26:00 - INFO - __main__ - Step 115251: {'lr': 6.498511859450176e-05, 'samples': 22128192, 'steps': 115250, 'loss/train': 1.0650054216384888} 11/07/2021 13:26:00 - INFO - __main__ - Step 115252: {'lr': 6.498154963105441e-05, 'samples': 22128384, 'steps': 115251, 'loss/train': 1.2914243936538696} 11/07/2021 13:26:01 - INFO - __main__ - Step 115253: {'lr': 6.497798075097361e-05, 'samples': 22128576, 'steps': 115252, 'loss/train': 1.5585130453109741} 11/07/2021 13:26:02 - INFO - __main__ - Step 115254: {'lr': 6.497441195426079e-05, 'samples': 22128768, 'steps': 115253, 'loss/train': 1.4885566234588623} 11/07/2021 13:26:02 - INFO - __main__ - Step 115255: {'lr': 6.497084324091765e-05, 'samples': 22128960, 'steps': 115254, 'loss/train': 1.3614017963409424} 11/07/2021 13:26:02 - INFO - __main__ - Step 115256: {'lr': 6.496727461094579e-05, 'samples': 22129152, 'steps': 115255, 'loss/train': 1.2531076669692993} 11/07/2021 13:26:03 - INFO - __main__ - Step 115257: {'lr': 6.496370606434682e-05, 'samples': 22129344, 'steps': 115256, 'loss/train': 1.383999228477478} 11/07/2021 13:26:03 - INFO - __main__ - Step 115258: {'lr': 6.496013760112235e-05, 'samples': 22129536, 'steps': 115257, 'loss/train': 1.8189029693603516} 11/07/2021 13:26:04 - INFO - __main__ - Step 115259: {'lr': 6.495656922127399e-05, 'samples': 22129728, 'steps': 115258, 'loss/train': 1.2461787462234497} 11/07/2021 13:26:05 - INFO - __main__ - Step 115260: {'lr': 6.495300092480332e-05, 'samples': 22129920, 'steps': 115259, 'loss/train': 0.9930471181869507} 11/07/2021 13:26:05 - INFO - __main__ - Step 115261: {'lr': 6.494943271171202e-05, 'samples': 22130112, 'steps': 115260, 'loss/train': 1.6124465465545654} 11/07/2021 13:26:05 - INFO - __main__ - Step 115262: {'lr': 6.494586458200161e-05, 'samples': 22130304, 'steps': 115261, 'loss/train': 1.1063206195831299} 11/07/2021 13:26:06 - INFO - __main__ - Step 115263: {'lr': 6.494229653567377e-05, 'samples': 22130496, 'steps': 115262, 'loss/train': 1.12002694606781} 11/07/2021 13:26:06 - INFO - __main__ - Step 115264: {'lr': 6.493872857273012e-05, 'samples': 22130688, 'steps': 115263, 'loss/train': 1.168875813484192} 11/07/2021 13:26:08 - INFO - __main__ - Step 115265: {'lr': 6.493516069317218e-05, 'samples': 22130880, 'steps': 115264, 'loss/train': 1.0616744756698608} 11/07/2021 13:26:08 - INFO - __main__ - Step 115266: {'lr': 6.493159289700157e-05, 'samples': 22131072, 'steps': 115265, 'loss/train': 1.5745192766189575} 11/07/2021 13:26:09 - INFO - __main__ - Step 115267: {'lr': 6.492802518421994e-05, 'samples': 22131264, 'steps': 115266, 'loss/train': 1.1868036985397339} 11/07/2021 13:26:09 - INFO - __main__ - Step 115268: {'lr': 6.49244575548289e-05, 'samples': 22131456, 'steps': 115267, 'loss/train': 1.5100284814834595} 11/07/2021 13:26:09 - INFO - __main__ - Step 115269: {'lr': 6.492089000883e-05, 'samples': 22131648, 'steps': 115268, 'loss/train': 1.4126020669937134} 11/07/2021 13:26:10 - INFO - __main__ - Step 115270: {'lr': 6.491732254622493e-05, 'samples': 22131840, 'steps': 115269, 'loss/train': 1.2656550407409668} 11/07/2021 13:26:10 - INFO - __main__ - Step 115271: {'lr': 6.491375516701526e-05, 'samples': 22132032, 'steps': 115270, 'loss/train': 1.5450888872146606} 11/07/2021 13:26:11 - INFO - __main__ - Step 115272: {'lr': 6.491018787120259e-05, 'samples': 22132224, 'steps': 115271, 'loss/train': 1.742723822593689} 11/07/2021 13:26:11 - INFO - __main__ - Step 115273: {'lr': 6.490662065878853e-05, 'samples': 22132416, 'steps': 115272, 'loss/train': 1.4013843536376953} 11/07/2021 13:26:12 - INFO - __main__ - Step 115274: {'lr': 6.490305352977469e-05, 'samples': 22132608, 'steps': 115273, 'loss/train': 1.7367421388626099} 11/07/2021 13:26:12 - INFO - __main__ - Step 115275: {'lr': 6.489948648416274e-05, 'samples': 22132800, 'steps': 115274, 'loss/train': 1.5794994831085205} 11/07/2021 13:26:12 - INFO - __main__ - Step 115276: {'lr': 6.489591952195417e-05, 'samples': 22132992, 'steps': 115275, 'loss/train': 1.530930519104004} 11/07/2021 13:26:13 - INFO - __main__ - Step 115277: {'lr': 6.489235264315063e-05, 'samples': 22133184, 'steps': 115276, 'loss/train': 2.243126153945923} 11/07/2021 13:26:14 - INFO - __main__ - Step 115278: {'lr': 6.488878584775374e-05, 'samples': 22133376, 'steps': 115277, 'loss/train': 1.5090402364730835} 11/07/2021 13:26:14 - INFO - __main__ - Step 115279: {'lr': 6.488521913576512e-05, 'samples': 22133568, 'steps': 115278, 'loss/train': 1.3082749843597412} 11/07/2021 13:26:14 - INFO - __main__ - Step 115280: {'lr': 6.488165250718634e-05, 'samples': 22133760, 'steps': 115279, 'loss/train': 0.9504750967025757} 11/07/2021 13:26:15 - INFO - __main__ - Step 115281: {'lr': 6.487808596201905e-05, 'samples': 22133952, 'steps': 115280, 'loss/train': 1.3766906261444092} 11/07/2021 13:26:15 - INFO - __main__ - Step 115282: {'lr': 6.487451950026482e-05, 'samples': 22134144, 'steps': 115281, 'loss/train': 1.427403450012207} 11/07/2021 13:26:16 - INFO - __main__ - Step 115283: {'lr': 6.487095312192529e-05, 'samples': 22134336, 'steps': 115282, 'loss/train': 1.388016700744629} 11/07/2021 13:26:17 - INFO - __main__ - Step 115284: {'lr': 6.486738682700204e-05, 'samples': 22134528, 'steps': 115283, 'loss/train': 1.1327201128005981} 11/07/2021 13:26:17 - INFO - __main__ - Step 115285: {'lr': 6.486382061549673e-05, 'samples': 22134720, 'steps': 115284, 'loss/train': 1.13971745967865} 11/07/2021 13:26:17 - INFO - __main__ - Step 115286: {'lr': 6.486025448741095e-05, 'samples': 22134912, 'steps': 115285, 'loss/train': 2.24934458732605} 11/07/2021 13:26:18 - INFO - __main__ - Step 115287: {'lr': 6.485668844274623e-05, 'samples': 22135104, 'steps': 115286, 'loss/train': 1.4816782474517822} 11/07/2021 13:26:19 - INFO - __main__ - Step 115288: {'lr': 6.485312248150421e-05, 'samples': 22135296, 'steps': 115287, 'loss/train': 0.6405913233757019} 11/07/2021 13:26:20 - INFO - __main__ - Step 115289: {'lr': 6.484955660368655e-05, 'samples': 22135488, 'steps': 115288, 'loss/train': 1.2868441343307495} 11/07/2021 13:26:20 - INFO - __main__ - Step 115290: {'lr': 6.484599080929479e-05, 'samples': 22135680, 'steps': 115289, 'loss/train': 2.1577820777893066} 11/07/2021 13:26:20 - INFO - __main__ - Step 115291: {'lr': 6.48424250983306e-05, 'samples': 22135872, 'steps': 115290, 'loss/train': 1.3399287462234497} 11/07/2021 13:26:21 - INFO - __main__ - Step 115292: {'lr': 6.483885947079554e-05, 'samples': 22136064, 'steps': 115291, 'loss/train': 1.423654556274414} 11/07/2021 13:26:21 - INFO - __main__ - Step 115293: {'lr': 6.483529392669121e-05, 'samples': 22136256, 'steps': 115292, 'loss/train': 1.0811035633087158} 11/07/2021 13:26:21 - INFO - __main__ - Step 115294: {'lr': 6.483172846601928e-05, 'samples': 22136448, 'steps': 115293, 'loss/train': 0.6253234148025513} 11/07/2021 13:26:23 - INFO - __main__ - Step 115295: {'lr': 6.482816308878129e-05, 'samples': 22136640, 'steps': 115294, 'loss/train': 0.812944233417511} 11/07/2021 13:26:23 - INFO - __main__ - Step 115296: {'lr': 6.482459779497887e-05, 'samples': 22136832, 'steps': 115295, 'loss/train': 1.6132252216339111} 11/07/2021 13:26:24 - INFO - __main__ - Step 115297: {'lr': 6.482103258461373e-05, 'samples': 22137024, 'steps': 115296, 'loss/train': 1.3961167335510254} 11/07/2021 13:26:24 - INFO - __main__ - Step 115298: {'lr': 6.481746745768729e-05, 'samples': 22137216, 'steps': 115297, 'loss/train': 1.4045931100845337} 11/07/2021 13:26:24 - INFO - __main__ - Step 115299: {'lr': 6.481390241420123e-05, 'samples': 22137408, 'steps': 115298, 'loss/train': 0.6718412041664124} 11/07/2021 13:26:25 - INFO - __main__ - Step 115300: {'lr': 6.481033745415719e-05, 'samples': 22137600, 'steps': 115299, 'loss/train': 1.2920106649398804} 11/07/2021 13:26:26 - INFO - __main__ - Step 115301: {'lr': 6.480677257755671e-05, 'samples': 22137792, 'steps': 115300, 'loss/train': 0.9045752286911011} 11/07/2021 13:26:26 - INFO - __main__ - Step 115302: {'lr': 6.48032077844015e-05, 'samples': 22137984, 'steps': 115301, 'loss/train': 1.2126301527023315} 11/07/2021 13:26:26 - INFO - __main__ - Step 115303: {'lr': 6.479964307469305e-05, 'samples': 22138176, 'steps': 115302, 'loss/train': 1.5079180002212524} 11/07/2021 13:26:27 - INFO - __main__ - Step 115304: {'lr': 6.479607844843305e-05, 'samples': 22138368, 'steps': 115303, 'loss/train': 1.60152268409729} 11/07/2021 13:26:27 - INFO - __main__ - Step 115305: {'lr': 6.479251390562308e-05, 'samples': 22138560, 'steps': 115304, 'loss/train': 1.6837043762207031} 11/07/2021 13:26:28 - INFO - __main__ - Step 115306: {'lr': 6.478894944626474e-05, 'samples': 22138752, 'steps': 115305, 'loss/train': 1.3043419122695923} 11/07/2021 13:26:28 - INFO - __main__ - Step 115307: {'lr': 6.478538507035964e-05, 'samples': 22138944, 'steps': 115306, 'loss/train': 1.4380789995193481} 11/07/2021 13:26:29 - INFO - __main__ - Step 115308: {'lr': 6.478182077790948e-05, 'samples': 22139136, 'steps': 115307, 'loss/train': 1.0806877613067627} 11/07/2021 13:26:29 - INFO - __main__ - Step 115309: {'lr': 6.477825656891567e-05, 'samples': 22139328, 'steps': 115308, 'loss/train': 1.0089865922927856} 11/07/2021 13:26:30 - INFO - __main__ - Step 115310: {'lr': 6.477469244337994e-05, 'samples': 22139520, 'steps': 115309, 'loss/train': 1.0550967454910278} 11/07/2021 13:26:30 - INFO - __main__ - Step 115311: {'lr': 6.477112840130387e-05, 'samples': 22139712, 'steps': 115310, 'loss/train': 1.2156028747558594} 11/07/2021 13:26:32 - INFO - __main__ - Step 115312: {'lr': 6.476756444268908e-05, 'samples': 22139904, 'steps': 115311, 'loss/train': 1.322425127029419} 11/07/2021 13:26:32 - INFO - __main__ - Step 115313: {'lr': 6.476400056753715e-05, 'samples': 22140096, 'steps': 115312, 'loss/train': 2.0881927013397217} 11/07/2021 13:26:33 - INFO - __main__ - Step 115314: {'lr': 6.476043677584972e-05, 'samples': 22140288, 'steps': 115313, 'loss/train': 1.6276476383209229} 11/07/2021 13:26:33 - INFO - __main__ - Step 115315: {'lr': 6.475687306762837e-05, 'samples': 22140480, 'steps': 115314, 'loss/train': 1.6458135843276978} 11/07/2021 13:26:33 - INFO - __main__ - Step 115316: {'lr': 6.475330944287472e-05, 'samples': 22140672, 'steps': 115315, 'loss/train': 1.6089963912963867} 11/07/2021 13:26:34 - INFO - __main__ - Step 115317: {'lr': 6.474974590159036e-05, 'samples': 22140864, 'steps': 115316, 'loss/train': 1.647676706314087} 11/07/2021 13:26:34 - INFO - __main__ - Step 115318: {'lr': 6.474618244377689e-05, 'samples': 22141056, 'steps': 115317, 'loss/train': 1.495574712753296} 11/07/2021 13:26:34 - INFO - __main__ - Step 115319: {'lr': 6.474261906943596e-05, 'samples': 22141248, 'steps': 115318, 'loss/train': 0.8746519684791565} 11/07/2021 13:26:35 - INFO - __main__ - Step 115320: {'lr': 6.473905577856915e-05, 'samples': 22141440, 'steps': 115319, 'loss/train': 0.8671051859855652} 11/07/2021 13:26:36 - INFO - __main__ - Step 115321: {'lr': 6.473549257117811e-05, 'samples': 22141632, 'steps': 115320, 'loss/train': 1.4496562480926514} 11/07/2021 13:26:36 - INFO - __main__ - Step 115322: {'lr': 6.473192944726437e-05, 'samples': 22141824, 'steps': 115321, 'loss/train': 1.1548407077789307} 11/07/2021 13:26:36 - INFO - __main__ - Step 115323: {'lr': 6.472836640682953e-05, 'samples': 22142016, 'steps': 115322, 'loss/train': 1.328809142112732} 11/07/2021 13:26:37 - INFO - __main__ - Step 115324: {'lr': 6.472480344987522e-05, 'samples': 22142208, 'steps': 115323, 'loss/train': 1.5221647024154663} 11/07/2021 13:26:38 - INFO - __main__ - Step 115325: {'lr': 6.472124057640308e-05, 'samples': 22142400, 'steps': 115324, 'loss/train': 1.3393259048461914} 11/07/2021 13:26:38 - INFO - __main__ - Step 115326: {'lr': 6.471767778641466e-05, 'samples': 22142592, 'steps': 115325, 'loss/train': 0.9129831790924072} 11/07/2021 13:26:39 - INFO - __main__ - Step 115327: {'lr': 6.471411507991163e-05, 'samples': 22142784, 'steps': 115326, 'loss/train': 1.408302903175354} 11/07/2021 13:26:39 - INFO - __main__ - Step 115328: {'lr': 6.471055245689553e-05, 'samples': 22142976, 'steps': 115327, 'loss/train': 0.9337659478187561} 11/07/2021 13:26:39 - INFO - __main__ - Step 115329: {'lr': 6.470698991736801e-05, 'samples': 22143168, 'steps': 115328, 'loss/train': 1.1517329216003418} 11/07/2021 13:26:40 - INFO - __main__ - Step 115330: {'lr': 6.470342746133068e-05, 'samples': 22143360, 'steps': 115329, 'loss/train': 1.4573042392730713} 11/07/2021 13:26:41 - INFO - __main__ - Step 115331: {'lr': 6.469986508878508e-05, 'samples': 22143552, 'steps': 115330, 'loss/train': 1.4381804466247559} 11/07/2021 13:26:41 - INFO - __main__ - Step 115332: {'lr': 6.46963027997329e-05, 'samples': 22143744, 'steps': 115331, 'loss/train': 1.6564464569091797} 11/07/2021 13:26:41 - INFO - __main__ - Step 115333: {'lr': 6.46927405941757e-05, 'samples': 22143936, 'steps': 115332, 'loss/train': 0.4969753623008728} 11/07/2021 13:26:42 - INFO - __main__ - Step 115334: {'lr': 6.468917847211517e-05, 'samples': 22144128, 'steps': 115333, 'loss/train': 1.3046460151672363} 11/07/2021 13:26:42 - INFO - __main__ - Step 115335: {'lr': 6.468561643355276e-05, 'samples': 22144320, 'steps': 115334, 'loss/train': 1.593543291091919} 11/07/2021 13:26:43 - INFO - __main__ - Step 115336: {'lr': 6.468205447849012e-05, 'samples': 22144512, 'steps': 115335, 'loss/train': 1.8688095808029175} 11/07/2021 13:26:44 - INFO - __main__ - Step 115337: {'lr': 6.467849260692893e-05, 'samples': 22144704, 'steps': 115336, 'loss/train': 1.5832350254058838} 11/07/2021 13:26:44 - INFO - __main__ - Step 115338: {'lr': 6.467493081887071e-05, 'samples': 22144896, 'steps': 115337, 'loss/train': 1.606640338897705} 11/07/2021 13:26:44 - INFO - __main__ - Step 115339: {'lr': 6.467136911431715e-05, 'samples': 22145088, 'steps': 115338, 'loss/train': 0.4933831989765167} 11/07/2021 13:26:45 - INFO - __main__ - Step 115340: {'lr': 6.466780749326978e-05, 'samples': 22145280, 'steps': 115339, 'loss/train': 0.7067137360572815} 11/07/2021 13:26:46 - INFO - __main__ - Step 115341: {'lr': 6.466424595573026e-05, 'samples': 22145472, 'steps': 115340, 'loss/train': 1.4675828218460083} 11/07/2021 13:26:46 - INFO - __main__ - Step 115342: {'lr': 6.466068450170015e-05, 'samples': 22145664, 'steps': 115341, 'loss/train': 1.0220065116882324} 11/07/2021 13:26:47 - INFO - __main__ - Step 115343: {'lr': 6.465712313118107e-05, 'samples': 22145856, 'steps': 115342, 'loss/train': 1.3556127548217773} 11/07/2021 13:26:47 - INFO - __main__ - Step 115344: {'lr': 6.465356184417465e-05, 'samples': 22146048, 'steps': 115343, 'loss/train': 1.200849175453186} 11/07/2021 13:26:47 - INFO - __main__ - Step 115345: {'lr': 6.465000064068247e-05, 'samples': 22146240, 'steps': 115344, 'loss/train': 1.363584041595459} 11/07/2021 13:26:48 - INFO - __main__ - Step 115346: {'lr': 6.464643952070614e-05, 'samples': 22146432, 'steps': 115345, 'loss/train': 1.518925428390503} 11/07/2021 13:26:49 - INFO - __main__ - Step 115347: {'lr': 6.464287848424727e-05, 'samples': 22146624, 'steps': 115346, 'loss/train': 1.7005023956298828} 11/07/2021 13:26:49 - INFO - __main__ - Step 115348: {'lr': 6.463931753130752e-05, 'samples': 22146816, 'steps': 115347, 'loss/train': 1.264565110206604} 11/07/2021 13:26:49 - INFO - __main__ - Step 115349: {'lr': 6.463575666188837e-05, 'samples': 22147008, 'steps': 115348, 'loss/train': 1.7483104467391968} 11/07/2021 13:26:50 - INFO - __main__ - Step 115350: {'lr': 6.463219587599148e-05, 'samples': 22147200, 'steps': 115349, 'loss/train': 1.4601327180862427} 11/07/2021 13:26:50 - INFO - __main__ - Step 115351: {'lr': 6.462863517361847e-05, 'samples': 22147392, 'steps': 115350, 'loss/train': 1.1782923936843872} 11/07/2021 13:26:51 - INFO - __main__ - Step 115352: {'lr': 6.462507455477092e-05, 'samples': 22147584, 'steps': 115351, 'loss/train': 1.2074912786483765} 11/07/2021 13:26:52 - INFO - __main__ - Step 115353: {'lr': 6.462151401945046e-05, 'samples': 22147776, 'steps': 115352, 'loss/train': 0.6665347814559937} 11/07/2021 13:26:52 - INFO - __main__ - Step 115354: {'lr': 6.461795356765868e-05, 'samples': 22147968, 'steps': 115353, 'loss/train': 1.7785483598709106} 11/07/2021 13:26:52 - INFO - __main__ - Step 115355: {'lr': 6.461439319939721e-05, 'samples': 22148160, 'steps': 115354, 'loss/train': 1.6292994022369385} 11/07/2021 13:26:53 - INFO - __main__ - Step 115356: {'lr': 6.461083291466762e-05, 'samples': 22148352, 'steps': 115355, 'loss/train': 1.253823161125183} 11/07/2021 13:26:53 - INFO - __main__ - Step 115357: {'lr': 6.460727271347153e-05, 'samples': 22148544, 'steps': 115356, 'loss/train': 0.6042273044586182} 11/07/2021 13:26:54 - INFO - __main__ - Step 115358: {'lr': 6.460371259581052e-05, 'samples': 22148736, 'steps': 115357, 'loss/train': 1.3668190240859985} 11/07/2021 13:26:54 - INFO - __main__ - Step 115359: {'lr': 6.460015256168625e-05, 'samples': 22148928, 'steps': 115358, 'loss/train': 1.3930456638336182} 11/07/2021 13:26:55 - INFO - __main__ - Step 115360: {'lr': 6.459659261110029e-05, 'samples': 22149120, 'steps': 115359, 'loss/train': 1.2975620031356812} 11/07/2021 13:26:55 - INFO - __main__ - Step 115361: {'lr': 6.459303274405429e-05, 'samples': 22149312, 'steps': 115360, 'loss/train': 0.9016289710998535} 11/07/2021 13:26:55 - INFO - __main__ - Step 115362: {'lr': 6.458947296054977e-05, 'samples': 22149504, 'steps': 115361, 'loss/train': 1.2544177770614624} 11/07/2021 13:26:56 - INFO - __main__ - Step 115363: {'lr': 6.458591326058832e-05, 'samples': 22149696, 'steps': 115362, 'loss/train': 0.7089571952819824} 11/07/2021 13:26:57 - INFO - __main__ - Step 115364: {'lr': 6.458235364417164e-05, 'samples': 22149888, 'steps': 115363, 'loss/train': 1.4235013723373413} 11/07/2021 13:26:57 - INFO - __main__ - Step 115365: {'lr': 6.457879411130127e-05, 'samples': 22150080, 'steps': 115364, 'loss/train': 0.43066099286079407} 11/07/2021 13:26:57 - INFO - __main__ - Step 115366: {'lr': 6.457523466197884e-05, 'samples': 22150272, 'steps': 115365, 'loss/train': 1.4399933815002441} 11/07/2021 13:26:58 - INFO - __main__ - Step 115367: {'lr': 6.457167529620597e-05, 'samples': 22150464, 'steps': 115366, 'loss/train': 1.597130298614502} 11/07/2021 13:26:59 - INFO - __main__ - Step 115368: {'lr': 6.456811601398421e-05, 'samples': 22150656, 'steps': 115367, 'loss/train': 1.2124040126800537} 11/07/2021 13:26:59 - INFO - __main__ - Step 115369: {'lr': 6.456455681531522e-05, 'samples': 22150848, 'steps': 115368, 'loss/train': 1.312838077545166} 11/07/2021 13:27:00 - INFO - __main__ - Step 115370: {'lr': 6.456099770020058e-05, 'samples': 22151040, 'steps': 115369, 'loss/train': 0.3636056184768677} 11/07/2021 13:27:00 - INFO - __main__ - Step 115371: {'lr': 6.455743866864186e-05, 'samples': 22151232, 'steps': 115370, 'loss/train': 1.4314926862716675} 11/07/2021 13:27:00 - INFO - __main__ - Step 115372: {'lr': 6.455387972064073e-05, 'samples': 22151424, 'steps': 115371, 'loss/train': 2.018904447555542} 11/07/2021 13:27:01 - INFO - __main__ - Step 115373: {'lr': 6.455032085619874e-05, 'samples': 22151616, 'steps': 115372, 'loss/train': 1.4519809484481812} 11/07/2021 13:27:02 - INFO - __main__ - Step 115374: {'lr': 6.454676207531751e-05, 'samples': 22151808, 'steps': 115373, 'loss/train': 0.8942020535469055} 11/07/2021 13:27:02 - INFO - __main__ - Step 115375: {'lr': 6.454320337799874e-05, 'samples': 22152000, 'steps': 115374, 'loss/train': 1.1496187448501587} 11/07/2021 13:27:03 - INFO - __main__ - Step 115376: {'lr': 6.453964476424387e-05, 'samples': 22152192, 'steps': 115375, 'loss/train': 1.359665870666504} 11/07/2021 13:27:03 - INFO - __main__ - Step 115377: {'lr': 6.453608623405454e-05, 'samples': 22152384, 'steps': 115376, 'loss/train': 1.4604796171188354} 11/07/2021 13:27:03 - INFO - __main__ - Step 115378: {'lr': 6.453252778743244e-05, 'samples': 22152576, 'steps': 115377, 'loss/train': 0.7008925080299377} 11/07/2021 13:27:04 - INFO - __main__ - Step 115379: {'lr': 6.452896942437909e-05, 'samples': 22152768, 'steps': 115378, 'loss/train': 1.2685580253601074} 11/07/2021 13:27:05 - INFO - __main__ - Step 115380: {'lr': 6.452541114489613e-05, 'samples': 22152960, 'steps': 115379, 'loss/train': 1.5449483394622803} 11/07/2021 13:27:05 - INFO - __main__ - Step 115381: {'lr': 6.452185294898514e-05, 'samples': 22153152, 'steps': 115380, 'loss/train': 0.9138295650482178} 11/07/2021 13:27:05 - INFO - __main__ - Step 115382: {'lr': 6.451829483664775e-05, 'samples': 22153344, 'steps': 115381, 'loss/train': 1.4112763404846191} 11/07/2021 13:27:06 - INFO - __main__ - Step 115383: {'lr': 6.451473680788555e-05, 'samples': 22153536, 'steps': 115382, 'loss/train': 1.2875444889068604} 11/07/2021 13:27:07 - INFO - __main__ - Step 115384: {'lr': 6.451117886270017e-05, 'samples': 22153728, 'steps': 115383, 'loss/train': 1.192428469657898} 11/07/2021 13:27:07 - INFO - __main__ - Step 115385: {'lr': 6.450762100109317e-05, 'samples': 22153920, 'steps': 115384, 'loss/train': 1.3781713247299194} 11/07/2021 13:27:07 - INFO - __main__ - Step 115386: {'lr': 6.450406322306618e-05, 'samples': 22154112, 'steps': 115385, 'loss/train': 0.8419159650802612} 11/07/2021 13:27:08 - INFO - __main__ - Step 115387: {'lr': 6.45005055286208e-05, 'samples': 22154304, 'steps': 115386, 'loss/train': 0.9842338562011719} 11/07/2021 13:27:08 - INFO - __main__ - Step 115388: {'lr': 6.44969479177587e-05, 'samples': 22154496, 'steps': 115387, 'loss/train': 1.1639103889465332} 11/07/2021 13:27:09 - INFO - __main__ - Step 115389: {'lr': 6.449339039048136e-05, 'samples': 22154688, 'steps': 115388, 'loss/train': 1.5799731016159058} 11/07/2021 13:27:10 - INFO - __main__ - Step 115390: {'lr': 6.44898329467904e-05, 'samples': 22154880, 'steps': 115389, 'loss/train': 1.180612564086914} 11/07/2021 13:27:10 - INFO - __main__ - Step 115391: {'lr': 6.448627558668748e-05, 'samples': 22155072, 'steps': 115390, 'loss/train': 1.0363692045211792} 11/07/2021 13:27:10 - INFO - __main__ - Step 115392: {'lr': 6.448271831017418e-05, 'samples': 22155264, 'steps': 115391, 'loss/train': 1.61077082157135} 11/07/2021 13:27:11 - INFO - __main__ - Step 115393: {'lr': 6.44791611172521e-05, 'samples': 22155456, 'steps': 115392, 'loss/train': 1.6394809484481812} 11/07/2021 13:27:11 - INFO - __main__ - Step 115394: {'lr': 6.447560400792286e-05, 'samples': 22155648, 'steps': 115393, 'loss/train': 1.437584638595581} 11/07/2021 13:27:12 - INFO - __main__ - Step 115395: {'lr': 6.447204698218803e-05, 'samples': 22155840, 'steps': 115394, 'loss/train': 1.288232445716858} 11/07/2021 13:27:13 - INFO - __main__ - Step 115396: {'lr': 6.446849004004924e-05, 'samples': 22156032, 'steps': 115395, 'loss/train': 1.145321249961853} 11/07/2021 13:27:13 - INFO - __main__ - Step 115397: {'lr': 6.446493318150809e-05, 'samples': 22156224, 'steps': 115396, 'loss/train': 1.1770888566970825} 11/07/2021 13:27:13 - INFO - __main__ - Step 115398: {'lr': 6.446137640656616e-05, 'samples': 22156416, 'steps': 115397, 'loss/train': 1.309821605682373} 11/07/2021 13:27:14 - INFO - __main__ - Step 115399: {'lr': 6.445781971522507e-05, 'samples': 22156608, 'steps': 115398, 'loss/train': 1.5093975067138672} 11/07/2021 13:27:15 - INFO - __main__ - Step 115400: {'lr': 6.445426310748644e-05, 'samples': 22156800, 'steps': 115399, 'loss/train': 1.300968885421753} 11/07/2021 13:27:15 - INFO - __main__ - Step 115401: {'lr': 6.445070658335195e-05, 'samples': 22156992, 'steps': 115400, 'loss/train': 1.3113961219787598} 11/07/2021 13:27:15 - INFO - __main__ - Step 115402: {'lr': 6.444715014282301e-05, 'samples': 22157184, 'steps': 115401, 'loss/train': 2.1187961101531982} 11/07/2021 13:27:16 - INFO - __main__ - Step 115403: {'lr': 6.444359378590131e-05, 'samples': 22157376, 'steps': 115402, 'loss/train': 1.512392520904541} 11/07/2021 13:27:16 - INFO - __main__ - Step 115404: {'lr': 6.444003751258848e-05, 'samples': 22157568, 'steps': 115403, 'loss/train': 1.4600412845611572} 11/07/2021 13:27:17 - INFO - __main__ - Step 115405: {'lr': 6.44364813228861e-05, 'samples': 22157760, 'steps': 115404, 'loss/train': 1.6099812984466553} 11/07/2021 13:27:18 - INFO - __main__ - Step 115406: {'lr': 6.443292521679578e-05, 'samples': 22157952, 'steps': 115405, 'loss/train': 1.3300875425338745} 11/07/2021 13:27:18 - INFO - __main__ - Step 115407: {'lr': 6.442936919431913e-05, 'samples': 22158144, 'steps': 115406, 'loss/train': 0.9736981391906738} 11/07/2021 13:27:18 - INFO - __main__ - Step 115408: {'lr': 6.442581325545774e-05, 'samples': 22158336, 'steps': 115407, 'loss/train': 0.9444044232368469} 11/07/2021 13:27:19 - INFO - __main__ - Step 115409: {'lr': 6.44222574002132e-05, 'samples': 22158528, 'steps': 115408, 'loss/train': 1.6490297317504883} 11/07/2021 13:27:20 - INFO - __main__ - Step 115410: {'lr': 6.441870162858714e-05, 'samples': 22158720, 'steps': 115409, 'loss/train': 1.323272466659546} 11/07/2021 13:27:20 - INFO - __main__ - Step 115411: {'lr': 6.441514594058115e-05, 'samples': 22158912, 'steps': 115410, 'loss/train': 1.136725664138794} 11/07/2021 13:27:20 - INFO - __main__ - Step 115412: {'lr': 6.441159033619681e-05, 'samples': 22159104, 'steps': 115411, 'loss/train': 1.1770190000534058} 11/07/2021 13:27:21 - INFO - __main__ - Step 115413: {'lr': 6.440803481543578e-05, 'samples': 22159296, 'steps': 115412, 'loss/train': 0.7459855079650879} 11/07/2021 13:27:21 - INFO - __main__ - Step 115414: {'lr': 6.44044793782996e-05, 'samples': 22159488, 'steps': 115413, 'loss/train': 1.3393197059631348} 11/07/2021 13:27:22 - INFO - __main__ - Step 115415: {'lr': 6.440092402478997e-05, 'samples': 22159680, 'steps': 115414, 'loss/train': 1.5064364671707153} 11/07/2021 13:27:23 - INFO - __main__ - Step 115416: {'lr': 6.439736875490836e-05, 'samples': 22159872, 'steps': 115415, 'loss/train': 0.885315477848053} 11/07/2021 13:27:23 - INFO - __main__ - Step 115417: {'lr': 6.439381356865642e-05, 'samples': 22160064, 'steps': 115416, 'loss/train': 1.104349970817566} 11/07/2021 13:27:23 - INFO - __main__ - Step 115418: {'lr': 6.439025846603578e-05, 'samples': 22160256, 'steps': 115417, 'loss/train': 1.1309584379196167} 11/07/2021 13:27:24 - INFO - __main__ - Step 115419: {'lr': 6.4386703447048e-05, 'samples': 22160448, 'steps': 115418, 'loss/train': 1.2056130170822144} 11/07/2021 13:27:25 - INFO - __main__ - Step 115420: {'lr': 6.438314851169472e-05, 'samples': 22160640, 'steps': 115419, 'loss/train': 1.737018346786499} 11/07/2021 13:27:25 - INFO - __main__ - Step 115421: {'lr': 6.437959365997753e-05, 'samples': 22160832, 'steps': 115420, 'loss/train': 1.1479135751724243} 11/07/2021 13:27:25 - INFO - __main__ - Step 115422: {'lr': 6.437603889189805e-05, 'samples': 22161024, 'steps': 115421, 'loss/train': 0.9764405488967896} 11/07/2021 13:27:26 - INFO - __main__ - Step 115423: {'lr': 6.437248420745783e-05, 'samples': 22161216, 'steps': 115422, 'loss/train': 1.1941226720809937} 11/07/2021 13:27:26 - INFO - __main__ - Step 115424: {'lr': 6.436892960665853e-05, 'samples': 22161408, 'steps': 115423, 'loss/train': 1.3653191328048706} 11/07/2021 13:27:26 - INFO - __main__ - Step 115425: {'lr': 6.436537508950172e-05, 'samples': 22161600, 'steps': 115424, 'loss/train': 1.6326091289520264} 11/07/2021 13:27:27 - INFO - __main__ - Step 115426: {'lr': 6.4361820655989e-05, 'samples': 22161792, 'steps': 115425, 'loss/train': 1.1605764627456665} 11/07/2021 13:27:28 - INFO - __main__ - Step 115427: {'lr': 6.435826630612197e-05, 'samples': 22161984, 'steps': 115426, 'loss/train': 1.3495023250579834} 11/07/2021 13:27:28 - INFO - __main__ - Step 115428: {'lr': 6.435471203990231e-05, 'samples': 22162176, 'steps': 115427, 'loss/train': 1.5259453058242798} 11/07/2021 13:27:28 - INFO - __main__ - Step 115429: {'lr': 6.43511578573315e-05, 'samples': 22162368, 'steps': 115428, 'loss/train': 1.2415024042129517} 11/07/2021 13:27:29 - INFO - __main__ - Step 115430: {'lr': 6.43476037584112e-05, 'samples': 22162560, 'steps': 115429, 'loss/train': 1.2604680061340332} 11/07/2021 13:27:30 - INFO - __main__ - Step 115431: {'lr': 6.434404974314297e-05, 'samples': 22162752, 'steps': 115430, 'loss/train': 1.2298133373260498} 11/07/2021 13:27:30 - INFO - __main__ - Step 115432: {'lr': 6.434049581152848e-05, 'samples': 22162944, 'steps': 115431, 'loss/train': 1.230634093284607} 11/07/2021 13:27:31 - INFO - __main__ - Step 115433: {'lr': 6.43369419635693e-05, 'samples': 22163136, 'steps': 115432, 'loss/train': 1.4521269798278809} 11/07/2021 13:27:31 - INFO - __main__ - Step 115434: {'lr': 6.433338819926701e-05, 'samples': 22163328, 'steps': 115433, 'loss/train': 1.3372623920440674} 11/07/2021 13:27:31 - INFO - __main__ - Step 115435: {'lr': 6.432983451862323e-05, 'samples': 22163520, 'steps': 115434, 'loss/train': 1.368395447731018} 11/07/2021 13:27:32 - INFO - __main__ - Step 115436: {'lr': 6.432628092163955e-05, 'samples': 22163712, 'steps': 115435, 'loss/train': 1.230516791343689} 11/07/2021 13:27:33 - INFO - __main__ - Step 115437: {'lr': 6.432272740831759e-05, 'samples': 22163904, 'steps': 115436, 'loss/train': 1.4643386602401733} 11/07/2021 13:27:33 - INFO - __main__ - Step 115438: {'lr': 6.431917397865897e-05, 'samples': 22164096, 'steps': 115437, 'loss/train': 1.3689619302749634} 11/07/2021 13:27:33 - INFO - __main__ - Step 115439: {'lr': 6.431562063266524e-05, 'samples': 22164288, 'steps': 115438, 'loss/train': 1.627417802810669} 11/07/2021 13:27:34 - INFO - __main__ - Step 115440: {'lr': 6.431206737033804e-05, 'samples': 22164480, 'steps': 115439, 'loss/train': 1.1347821950912476} 11/07/2021 13:27:35 - INFO - __main__ - Step 115441: {'lr': 6.430851419167896e-05, 'samples': 22164672, 'steps': 115440, 'loss/train': 1.2113574743270874} 11/07/2021 13:27:35 - INFO - __main__ - Step 115442: {'lr': 6.430496109668965e-05, 'samples': 22164864, 'steps': 115441, 'loss/train': 1.3729785680770874} 11/07/2021 13:27:35 - INFO - __main__ - Step 115443: {'lr': 6.43014080853716e-05, 'samples': 22165056, 'steps': 115442, 'loss/train': 0.9456220269203186} 11/07/2021 13:27:36 - INFO - __main__ - Step 115444: {'lr': 6.429785515772646e-05, 'samples': 22165248, 'steps': 115443, 'loss/train': 1.1630204916000366} 11/07/2021 13:27:36 - INFO - __main__ - Step 115445: {'lr': 6.429430231375585e-05, 'samples': 22165440, 'steps': 115444, 'loss/train': 0.8758394718170166} 11/07/2021 13:27:37 - INFO - __main__ - Step 115446: {'lr': 6.429074955346137e-05, 'samples': 22165632, 'steps': 115445, 'loss/train': 1.1971466541290283} 11/07/2021 13:27:38 - INFO - __main__ - Step 115447: {'lr': 6.428719687684462e-05, 'samples': 22165824, 'steps': 115446, 'loss/train': 1.3095009326934814} 11/07/2021 13:27:38 - INFO - __main__ - Step 115448: {'lr': 6.428364428390714e-05, 'samples': 22166016, 'steps': 115447, 'loss/train': 0.5561941266059875} 11/07/2021 13:27:38 - INFO - __main__ - Step 115449: {'lr': 6.428009177465064e-05, 'samples': 22166208, 'steps': 115448, 'loss/train': 1.1558588743209839} 11/07/2021 13:27:39 - INFO - __main__ - Step 115450: {'lr': 6.427653934907665e-05, 'samples': 22166400, 'steps': 115449, 'loss/train': 0.947493851184845} 11/07/2021 13:27:40 - INFO - __main__ - Step 115451: {'lr': 6.427298700718678e-05, 'samples': 22166592, 'steps': 115450, 'loss/train': 1.5658506155014038} 11/07/2021 13:27:40 - INFO - __main__ - Step 115452: {'lr': 6.426943474898264e-05, 'samples': 22166784, 'steps': 115451, 'loss/train': 1.0943251848220825} 11/07/2021 13:27:40 - INFO - __main__ - Step 115453: {'lr': 6.42658825744658e-05, 'samples': 22166976, 'steps': 115452, 'loss/train': 1.176370620727539} 11/07/2021 13:27:41 - INFO - __main__ - Step 115454: {'lr': 6.426233048363795e-05, 'samples': 22167168, 'steps': 115453, 'loss/train': 1.8221055269241333} 11/07/2021 13:27:41 - INFO - __main__ - Step 115455: {'lr': 6.425877847650064e-05, 'samples': 22167360, 'steps': 115454, 'loss/train': 0.7642796635627747} 11/07/2021 13:27:42 - INFO - __main__ - Step 115456: {'lr': 6.425522655305541e-05, 'samples': 22167552, 'steps': 115455, 'loss/train': 1.5144083499908447} 11/07/2021 13:27:42 - INFO - __main__ - Step 115457: {'lr': 6.42516747133039e-05, 'samples': 22167744, 'steps': 115456, 'loss/train': 1.1367435455322266} 11/07/2021 13:27:43 - INFO - __main__ - Step 115458: {'lr': 6.424812295724775e-05, 'samples': 22167936, 'steps': 115457, 'loss/train': 1.3801681995391846} 11/07/2021 13:27:43 - INFO - __main__ - Step 115459: {'lr': 6.424457128488847e-05, 'samples': 22168128, 'steps': 115458, 'loss/train': 0.4700964391231537} 11/07/2021 13:27:44 - INFO - __main__ - Step 115460: {'lr': 6.424101969622779e-05, 'samples': 22168320, 'steps': 115459, 'loss/train': 1.543501615524292} 11/07/2021 13:27:44 - INFO - __main__ - Step 115461: {'lr': 6.423746819126718e-05, 'samples': 22168512, 'steps': 115460, 'loss/train': 1.3835827112197876} 11/07/2021 13:27:45 - INFO - __main__ - Step 115462: {'lr': 6.423391677000834e-05, 'samples': 22168704, 'steps': 115461, 'loss/train': 1.2870984077453613} 11/07/2021 13:27:45 - INFO - __main__ - Step 115463: {'lr': 6.423036543245281e-05, 'samples': 22168896, 'steps': 115462, 'loss/train': 1.2253906726837158} 11/07/2021 13:27:46 - INFO - __main__ - Step 115464: {'lr': 6.422681417860221e-05, 'samples': 22169088, 'steps': 115463, 'loss/train': 1.1872189044952393} 11/07/2021 13:27:46 - INFO - __main__ - Step 115465: {'lr': 6.422326300845815e-05, 'samples': 22169280, 'steps': 115464, 'loss/train': 1.5605517625808716} 11/07/2021 13:27:46 - INFO - __main__ - Step 115466: {'lr': 6.421971192202222e-05, 'samples': 22169472, 'steps': 115465, 'loss/train': 1.5464098453521729} 11/07/2021 13:27:48 - INFO - __main__ - Step 115467: {'lr': 6.421616091929602e-05, 'samples': 22169664, 'steps': 115466, 'loss/train': 1.3770867586135864} 11/07/2021 13:27:48 - INFO - __main__ - Step 115468: {'lr': 6.421261000028114e-05, 'samples': 22169856, 'steps': 115467, 'loss/train': 0.47882315516471863} 11/07/2021 13:27:48 - INFO - __main__ - Step 115469: {'lr': 6.420905916497927e-05, 'samples': 22170048, 'steps': 115468, 'loss/train': 1.2282922267913818} 11/07/2021 13:27:49 - INFO - __main__ - Step 115470: {'lr': 6.420550841339187e-05, 'samples': 22170240, 'steps': 115469, 'loss/train': 1.6085478067398071} 11/07/2021 13:27:49 - INFO - __main__ - Step 115471: {'lr': 6.420195774552059e-05, 'samples': 22170432, 'steps': 115470, 'loss/train': 1.3906277418136597} 11/07/2021 13:27:49 - INFO - __main__ - Step 115472: {'lr': 6.419840716136705e-05, 'samples': 22170624, 'steps': 115471, 'loss/train': 1.0764869451522827} 11/07/2021 13:27:50 - INFO - __main__ - Step 115473: {'lr': 6.419485666093283e-05, 'samples': 22170816, 'steps': 115472, 'loss/train': 1.6763055324554443} 11/07/2021 13:27:51 - INFO - __main__ - Step 115474: {'lr': 6.419130624421954e-05, 'samples': 22171008, 'steps': 115473, 'loss/train': 1.3404704332351685} 11/07/2021 13:27:51 - INFO - __main__ - Step 115475: {'lr': 6.418775591122881e-05, 'samples': 22171200, 'steps': 115474, 'loss/train': 1.4326015710830688} 11/07/2021 13:27:51 - INFO - __main__ - Step 115476: {'lr': 6.418420566196217e-05, 'samples': 22171392, 'steps': 115475, 'loss/train': 1.2674041986465454} 11/07/2021 13:27:52 - INFO - __main__ - Step 115477: {'lr': 6.418065549642127e-05, 'samples': 22171584, 'steps': 115476, 'loss/train': 1.311934471130371} 11/07/2021 13:27:53 - INFO - __main__ - Step 115478: {'lr': 6.41771054146077e-05, 'samples': 22171776, 'steps': 115477, 'loss/train': 1.4794095754623413} 11/07/2021 13:27:53 - INFO - __main__ - Step 115479: {'lr': 6.417355541652306e-05, 'samples': 22171968, 'steps': 115478, 'loss/train': 1.701339602470398} 11/07/2021 13:27:54 - INFO - __main__ - Step 115480: {'lr': 6.417000550216896e-05, 'samples': 22172160, 'steps': 115479, 'loss/train': 1.7164206504821777} 11/07/2021 13:27:54 - INFO - __main__ - Step 115481: {'lr': 6.416645567154697e-05, 'samples': 22172352, 'steps': 115480, 'loss/train': 0.8545663356781006} 11/07/2021 13:27:54 - INFO - __main__ - Step 115482: {'lr': 6.416290592465879e-05, 'samples': 22172544, 'steps': 115481, 'loss/train': 1.7383325099945068} 11/07/2021 13:27:55 - INFO - __main__ - Step 115483: {'lr': 6.415935626150587e-05, 'samples': 22172736, 'steps': 115482, 'loss/train': 1.39093017578125} 11/07/2021 13:27:56 - INFO - __main__ - Step 115484: {'lr': 6.415580668208987e-05, 'samples': 22172928, 'steps': 115483, 'loss/train': 1.2636901140213013} 11/07/2021 13:27:56 - INFO - __main__ - Step 115485: {'lr': 6.41522571864124e-05, 'samples': 22173120, 'steps': 115484, 'loss/train': 1.3340333700180054} 11/07/2021 13:27:56 - INFO - __main__ - Step 115486: {'lr': 6.414870777447504e-05, 'samples': 22173312, 'steps': 115485, 'loss/train': 1.3510632514953613} 11/07/2021 13:27:57 - INFO - __main__ - Step 115487: {'lr': 6.414515844627942e-05, 'samples': 22173504, 'steps': 115486, 'loss/train': 1.3367990255355835} 11/07/2021 13:27:57 - INFO - __main__ - Step 115488: {'lr': 6.414160920182712e-05, 'samples': 22173696, 'steps': 115487, 'loss/train': 0.9997902512550354} 11/07/2021 13:27:58 - INFO - __main__ - Step 115489: {'lr': 6.413806004111974e-05, 'samples': 22173888, 'steps': 115488, 'loss/train': 1.3131370544433594} 11/07/2021 13:27:59 - INFO - __main__ - Step 115490: {'lr': 6.41345109641589e-05, 'samples': 22174080, 'steps': 115489, 'loss/train': 1.5965713262557983} 11/07/2021 13:27:59 - INFO - __main__ - Step 115491: {'lr': 6.413096197094615e-05, 'samples': 22174272, 'steps': 115490, 'loss/train': 1.4419763088226318} 11/07/2021 13:27:59 - INFO - __main__ - Step 115492: {'lr': 6.412741306148315e-05, 'samples': 22174464, 'steps': 115491, 'loss/train': 1.2609145641326904} 11/07/2021 13:28:00 - INFO - __main__ - Step 115493: {'lr': 6.412386423577143e-05, 'samples': 22174656, 'steps': 115492, 'loss/train': 1.348496913909912} 11/07/2021 13:28:01 - INFO - __main__ - Step 115494: {'lr': 6.412031549381266e-05, 'samples': 22174848, 'steps': 115493, 'loss/train': 0.9955356121063232} 11/07/2021 13:28:01 - INFO - __main__ - Step 115495: {'lr': 6.411676683560841e-05, 'samples': 22175040, 'steps': 115494, 'loss/train': 1.3470244407653809} 11/07/2021 13:28:01 - INFO - __main__ - Step 115496: {'lr': 6.411321826116034e-05, 'samples': 22175232, 'steps': 115495, 'loss/train': 1.098673939704895} 11/07/2021 13:28:02 - INFO - __main__ - Step 115497: {'lr': 6.410966977046993e-05, 'samples': 22175424, 'steps': 115496, 'loss/train': 1.0016555786132812} 11/07/2021 13:28:02 - INFO - __main__ - Step 115498: {'lr': 6.41061213635388e-05, 'samples': 22175616, 'steps': 115497, 'loss/train': 1.2388678789138794} 11/07/2021 13:28:03 - INFO - __main__ - Step 115499: {'lr': 6.410257304036859e-05, 'samples': 22175808, 'steps': 115498, 'loss/train': 1.2268061637878418} 11/07/2021 13:28:04 - INFO - __main__ - Step 115500: {'lr': 6.409902480096091e-05, 'samples': 22176000, 'steps': 115499, 'loss/train': 0.8866768479347229} 11/07/2021 13:28:04 - INFO - __main__ - Step 115501: {'lr': 6.409547664531735e-05, 'samples': 22176192, 'steps': 115500, 'loss/train': 1.7744395732879639} 11/07/2021 13:28:04 - INFO - __main__ - Step 115502: {'lr': 6.409192857343946e-05, 'samples': 22176384, 'steps': 115501, 'loss/train': 2.245530366897583} 11/07/2021 13:28:05 - INFO - __main__ - Step 115503: {'lr': 6.40883805853289e-05, 'samples': 22176576, 'steps': 115502, 'loss/train': 1.2818092107772827} 11/07/2021 13:28:06 - INFO - __main__ - Step 115504: {'lr': 6.408483268098725e-05, 'samples': 22176768, 'steps': 115503, 'loss/train': 1.6354774236679077} 11/07/2021 13:28:06 - INFO - __main__ - Step 115505: {'lr': 6.408128486041611e-05, 'samples': 22176960, 'steps': 115504, 'loss/train': 1.058597207069397} 11/07/2021 13:28:06 - INFO - __main__ - Step 115506: {'lr': 6.407773712361706e-05, 'samples': 22177152, 'steps': 115505, 'loss/train': 1.2945133447647095} 11/07/2021 13:28:07 - INFO - __main__ - Step 115507: {'lr': 6.407418947059173e-05, 'samples': 22177344, 'steps': 115506, 'loss/train': 1.4904725551605225} 11/07/2021 13:28:07 - INFO - __main__ - Step 115508: {'lr': 6.407064190134168e-05, 'samples': 22177536, 'steps': 115507, 'loss/train': 1.3861547708511353} 11/07/2021 13:28:08 - INFO - __main__ - Step 115509: {'lr': 6.406709441586863e-05, 'samples': 22177728, 'steps': 115508, 'loss/train': 1.37725031375885} 11/07/2021 13:28:09 - INFO - __main__ - Step 115510: {'lr': 6.4063547014174e-05, 'samples': 22177920, 'steps': 115509, 'loss/train': 0.7128508687019348} 11/07/2021 13:28:09 - INFO - __main__ - Step 115511: {'lr': 6.405999969625944e-05, 'samples': 22178112, 'steps': 115510, 'loss/train': 1.4449968338012695} 11/07/2021 13:28:09 - INFO - __main__ - Step 115512: {'lr': 6.40564524621266e-05, 'samples': 22178304, 'steps': 115511, 'loss/train': 1.0130177736282349} 11/07/2021 13:28:10 - INFO - __main__ - Step 115513: {'lr': 6.405290531177704e-05, 'samples': 22178496, 'steps': 115512, 'loss/train': 1.5514131784439087} 11/07/2021 13:28:10 - INFO - __main__ - Step 115514: {'lr': 6.404935824521238e-05, 'samples': 22178688, 'steps': 115513, 'loss/train': 1.2761110067367554} 11/07/2021 13:28:11 - INFO - __main__ - Step 115515: {'lr': 6.40458112624342e-05, 'samples': 22178880, 'steps': 115514, 'loss/train': 1.434399962425232} 11/07/2021 13:28:11 - INFO - __main__ - Step 115516: {'lr': 6.404226436344412e-05, 'samples': 22179072, 'steps': 115515, 'loss/train': 2.078749418258667} 11/07/2021 13:28:12 - INFO - __main__ - Step 115517: {'lr': 6.403871754824372e-05, 'samples': 22179264, 'steps': 115516, 'loss/train': 1.3314720392227173} 11/07/2021 13:28:12 - INFO - __main__ - Step 115518: {'lr': 6.40351708168346e-05, 'samples': 22179456, 'steps': 115517, 'loss/train': 1.6598249673843384} 11/07/2021 13:28:12 - INFO - __main__ - Step 115519: {'lr': 6.403162416921837e-05, 'samples': 22179648, 'steps': 115518, 'loss/train': 1.0876004695892334} 11/07/2021 13:28:14 - INFO - __main__ - Step 115520: {'lr': 6.402807760539661e-05, 'samples': 22179840, 'steps': 115519, 'loss/train': 1.4775232076644897} 11/07/2021 13:28:14 - INFO - __main__ - Step 115521: {'lr': 6.402453112537093e-05, 'samples': 22180032, 'steps': 115520, 'loss/train': 0.9791207909584045} 11/07/2021 13:28:14 - INFO - __main__ - Step 115522: {'lr': 6.4020984729143e-05, 'samples': 22180224, 'steps': 115521, 'loss/train': 0.6207470893859863} 11/07/2021 13:28:15 - INFO - __main__ - Step 115523: {'lr': 6.401743841671429e-05, 'samples': 22180416, 'steps': 115522, 'loss/train': 1.0589673519134521} 11/07/2021 13:28:15 - INFO - __main__ - Step 115524: {'lr': 6.401389218808643e-05, 'samples': 22180608, 'steps': 115523, 'loss/train': 1.2155473232269287} 11/07/2021 13:28:16 - INFO - __main__ - Step 115525: {'lr': 6.401034604326105e-05, 'samples': 22180800, 'steps': 115524, 'loss/train': 1.4802409410476685} 11/07/2021 13:28:16 - INFO - __main__ - Step 115526: {'lr': 6.400679998223974e-05, 'samples': 22180992, 'steps': 115525, 'loss/train': 0.8562851548194885} 11/07/2021 13:28:17 - INFO - __main__ - Step 115527: {'lr': 6.40032540050241e-05, 'samples': 22181184, 'steps': 115526, 'loss/train': 1.4196661710739136} 11/07/2021 13:28:17 - INFO - __main__ - Step 115528: {'lr': 6.399970811161571e-05, 'samples': 22181376, 'steps': 115527, 'loss/train': 1.1357133388519287} 11/07/2021 13:28:17 - INFO - __main__ - Step 115529: {'lr': 6.399616230201619e-05, 'samples': 22181568, 'steps': 115528, 'loss/train': 1.3654210567474365} 11/07/2021 13:28:18 - INFO - __main__ - Step 115530: {'lr': 6.399261657622712e-05, 'samples': 22181760, 'steps': 115529, 'loss/train': 1.051012396812439} 11/07/2021 13:28:19 - INFO - __main__ - Step 115531: {'lr': 6.398907093425013e-05, 'samples': 22181952, 'steps': 115530, 'loss/train': 1.3579734563827515} 11/07/2021 13:28:19 - INFO - __main__ - Step 115532: {'lr': 6.398552537608676e-05, 'samples': 22182144, 'steps': 115531, 'loss/train': 1.5175261497497559} 11/07/2021 13:28:20 - INFO - __main__ - Step 115533: {'lr': 6.398197990173874e-05, 'samples': 22182336, 'steps': 115532, 'loss/train': 1.5827537775039673} 11/07/2021 13:28:20 - INFO - __main__ - Step 115534: {'lr': 6.39784345112075e-05, 'samples': 22182528, 'steps': 115533, 'loss/train': 1.5896532535552979} 11/07/2021 13:28:21 - INFO - __main__ - Step 115535: {'lr': 6.397488920449468e-05, 'samples': 22182720, 'steps': 115534, 'loss/train': 1.0397437810897827} 11/07/2021 13:28:21 - INFO - __main__ - Step 115536: {'lr': 6.397134398160192e-05, 'samples': 22182912, 'steps': 115535, 'loss/train': 1.1332740783691406} 11/07/2021 13:28:22 - INFO - __main__ - Step 115537: {'lr': 6.396779884253081e-05, 'samples': 22183104, 'steps': 115536, 'loss/train': 1.425927758216858} 11/07/2021 13:28:22 - INFO - __main__ - Step 115538: {'lr': 6.396425378728294e-05, 'samples': 22183296, 'steps': 115537, 'loss/train': 1.2522084712982178} 11/07/2021 13:28:22 - INFO - __main__ - Step 115539: {'lr': 6.396070881585988e-05, 'samples': 22183488, 'steps': 115538, 'loss/train': 1.7062772512435913} 11/07/2021 13:28:23 - INFO - __main__ - Step 115540: {'lr': 6.395716392826328e-05, 'samples': 22183680, 'steps': 115539, 'loss/train': 0.7012368440628052} 11/07/2021 13:28:24 - INFO - __main__ - Step 115541: {'lr': 6.395361912449472e-05, 'samples': 22183872, 'steps': 115540, 'loss/train': 1.3245525360107422} 11/07/2021 13:28:24 - INFO - __main__ - Step 115542: {'lr': 6.395007440455577e-05, 'samples': 22184064, 'steps': 115541, 'loss/train': 1.520730972290039} 11/07/2021 13:28:24 - INFO - __main__ - Step 115543: {'lr': 6.394652976844804e-05, 'samples': 22184256, 'steps': 115542, 'loss/train': 1.2028300762176514} 11/07/2021 13:28:25 - INFO - __main__ - Step 115544: {'lr': 6.39429852161732e-05, 'samples': 22184448, 'steps': 115543, 'loss/train': 0.8582241535186768} 11/07/2021 13:28:26 - INFO - __main__ - Step 115545: {'lr': 6.393944074773273e-05, 'samples': 22184640, 'steps': 115544, 'loss/train': 1.438714861869812} 11/07/2021 13:28:26 - INFO - __main__ - Step 115546: {'lr': 6.393589636312827e-05, 'samples': 22184832, 'steps': 115545, 'loss/train': 0.9927906394004822} 11/07/2021 13:28:26 - INFO - __main__ - Step 115547: {'lr': 6.393235206236143e-05, 'samples': 22185024, 'steps': 115546, 'loss/train': 1.5260404348373413} 11/07/2021 13:28:27 - INFO - __main__ - Step 115548: {'lr': 6.392880784543378e-05, 'samples': 22185216, 'steps': 115547, 'loss/train': 1.1573808193206787} 11/07/2021 13:28:27 - INFO - __main__ - Step 115549: {'lr': 6.392526371234694e-05, 'samples': 22185408, 'steps': 115548, 'loss/train': 0.8869931697845459} 11/07/2021 13:28:28 - INFO - __main__ - Step 115550: {'lr': 6.392171966310253e-05, 'samples': 22185600, 'steps': 115549, 'loss/train': 1.1223231554031372} 11/07/2021 13:28:29 - INFO - __main__ - Step 115551: {'lr': 6.391817569770211e-05, 'samples': 22185792, 'steps': 115550, 'loss/train': 1.2928682565689087} 11/07/2021 13:28:29 - INFO - __main__ - Step 115552: {'lr': 6.391463181614726e-05, 'samples': 22185984, 'steps': 115551, 'loss/train': 1.241314172744751} 11/07/2021 13:28:29 - INFO - __main__ - Step 115553: {'lr': 6.391108801843964e-05, 'samples': 22186176, 'steps': 115552, 'loss/train': 1.0999610424041748} 11/07/2021 13:28:30 - INFO - __main__ - Step 115554: {'lr': 6.390754430458081e-05, 'samples': 22186368, 'steps': 115553, 'loss/train': 1.195088267326355} 11/07/2021 13:28:30 - INFO - __main__ - Step 115555: {'lr': 6.390400067457245e-05, 'samples': 22186560, 'steps': 115554, 'loss/train': 0.046811021864414215} 11/07/2021 13:28:31 - INFO - __main__ - Step 115556: {'lr': 6.390045712841597e-05, 'samples': 22186752, 'steps': 115555, 'loss/train': 1.7036327123641968} 11/07/2021 13:28:31 - INFO - __main__ - Step 115557: {'lr': 6.389691366611308e-05, 'samples': 22186944, 'steps': 115556, 'loss/train': 1.263990044593811} 11/07/2021 13:28:32 - INFO - __main__ - Step 115558: {'lr': 6.389337028766539e-05, 'samples': 22187136, 'steps': 115557, 'loss/train': 0.764149010181427} 11/07/2021 13:28:32 - INFO - __main__ - Step 115559: {'lr': 6.388982699307447e-05, 'samples': 22187328, 'steps': 115558, 'loss/train': 1.2150506973266602} 11/07/2021 13:28:32 - INFO - __main__ - Step 115560: {'lr': 6.388628378234191e-05, 'samples': 22187520, 'steps': 115559, 'loss/train': 1.4437958002090454} 11/07/2021 13:28:33 - INFO - __main__ - Step 115561: {'lr': 6.388274065546931e-05, 'samples': 22187712, 'steps': 115560, 'loss/train': 1.2612757682800293} 11/07/2021 13:28:34 - INFO - __main__ - Step 115562: {'lr': 6.38791976124583e-05, 'samples': 22187904, 'steps': 115561, 'loss/train': 0.7988664507865906} 11/07/2021 13:28:34 - INFO - __main__ - Step 115563: {'lr': 6.387565465331044e-05, 'samples': 22188096, 'steps': 115562, 'loss/train': 1.3434914350509644} 11/07/2021 13:28:34 - INFO - __main__ - Step 115564: {'lr': 6.387211177802735e-05, 'samples': 22188288, 'steps': 115563, 'loss/train': 1.638801097869873} 11/07/2021 13:28:35 - INFO - __main__ - Step 115565: {'lr': 6.386856898661059e-05, 'samples': 22188480, 'steps': 115564, 'loss/train': 1.025709867477417} 11/07/2021 13:28:35 - INFO - __main__ - Step 115566: {'lr': 6.386502627906188e-05, 'samples': 22188672, 'steps': 115565, 'loss/train': 1.8310247659683228} 11/07/2021 13:28:36 - INFO - __main__ - Step 115567: {'lr': 6.386148365538263e-05, 'samples': 22188864, 'steps': 115566, 'loss/train': 1.4981237649917603} 11/07/2021 13:28:37 - INFO - __main__ - Step 115568: {'lr': 6.385794111557453e-05, 'samples': 22189056, 'steps': 115567, 'loss/train': 1.4472367763519287} 11/07/2021 13:28:37 - INFO - __main__ - Step 115569: {'lr': 6.385439865963916e-05, 'samples': 22189248, 'steps': 115568, 'loss/train': 1.360410451889038} 11/07/2021 13:28:37 - INFO - __main__ - Step 115570: {'lr': 6.385085628757811e-05, 'samples': 22189440, 'steps': 115569, 'loss/train': 0.9578264355659485} 11/07/2021 13:28:38 - INFO - __main__ - Step 115571: {'lr': 6.384731399939303e-05, 'samples': 22189632, 'steps': 115570, 'loss/train': 1.2631149291992188} 11/07/2021 13:28:39 - INFO - __main__ - Step 115572: {'lr': 6.384377179508543e-05, 'samples': 22189824, 'steps': 115571, 'loss/train': 1.563348650932312} 11/07/2021 13:28:39 - INFO - __main__ - Step 115573: {'lr': 6.384022967465699e-05, 'samples': 22190016, 'steps': 115572, 'loss/train': 0.9562385082244873} 11/07/2021 13:28:39 - INFO - __main__ - Step 115574: {'lr': 6.383668763810926e-05, 'samples': 22190208, 'steps': 115573, 'loss/train': 1.4906837940216064} 11/07/2021 13:28:40 - INFO - __main__ - Step 115575: {'lr': 6.383314568544385e-05, 'samples': 22190400, 'steps': 115574, 'loss/train': 0.485300213098526} 11/07/2021 13:28:40 - INFO - __main__ - Step 115576: {'lr': 6.382960381666244e-05, 'samples': 22190592, 'steps': 115575, 'loss/train': 0.9719594717025757} 11/07/2021 13:28:41 - INFO - __main__ - Step 115577: {'lr': 6.382606203176644e-05, 'samples': 22190784, 'steps': 115576, 'loss/train': 1.3396449089050293} 11/07/2021 13:28:41 - INFO - __main__ - Step 115578: {'lr': 6.382252033075755e-05, 'samples': 22190976, 'steps': 115577, 'loss/train': 1.2586907148361206} 11/07/2021 13:28:42 - INFO - __main__ - Step 115579: {'lr': 6.381897871363737e-05, 'samples': 22191168, 'steps': 115578, 'loss/train': 1.4841073751449585} 11/07/2021 13:28:42 - INFO - __main__ - Step 115580: {'lr': 6.381543718040747e-05, 'samples': 22191360, 'steps': 115579, 'loss/train': 1.0802079439163208} 11/07/2021 13:28:42 - INFO - __main__ - Step 115581: {'lr': 6.381189573106947e-05, 'samples': 22191552, 'steps': 115580, 'loss/train': 1.1479393243789673} 11/07/2021 13:28:44 - INFO - __main__ - Step 115582: {'lr': 6.380835436562496e-05, 'samples': 22191744, 'steps': 115581, 'loss/train': 1.4548636674880981} 11/07/2021 13:28:45 - INFO - __main__ - Step 115583: {'lr': 6.380481308407552e-05, 'samples': 22191936, 'steps': 115582, 'loss/train': 0.29666760563850403} 11/07/2021 13:28:45 - INFO - __main__ - Step 115584: {'lr': 6.380127188642277e-05, 'samples': 22192128, 'steps': 115583, 'loss/train': 1.6929192543029785} 11/07/2021 13:28:45 - INFO - __main__ - Step 115585: {'lr': 6.379773077266829e-05, 'samples': 22192320, 'steps': 115584, 'loss/train': 0.7033878564834595} 11/07/2021 13:28:46 - INFO - __main__ - Step 115586: {'lr': 6.379418974281367e-05, 'samples': 22192512, 'steps': 115585, 'loss/train': 0.8906382918357849} 11/07/2021 13:28:46 - INFO - __main__ - Step 115587: {'lr': 6.379064879686053e-05, 'samples': 22192704, 'steps': 115586, 'loss/train': 1.9600107669830322} 11/07/2021 13:28:46 - INFO - __main__ - Step 115588: {'lr': 6.378710793481044e-05, 'samples': 22192896, 'steps': 115587, 'loss/train': 1.577537178993225} 11/07/2021 13:28:47 - INFO - __main__ - Step 115589: {'lr': 6.3783567156665e-05, 'samples': 22193088, 'steps': 115588, 'loss/train': 1.4135551452636719} 11/07/2021 13:28:48 - INFO - __main__ - Step 115590: {'lr': 6.37800264624259e-05, 'samples': 22193280, 'steps': 115589, 'loss/train': 1.4306297302246094} 11/07/2021 13:28:48 - INFO - __main__ - Step 115591: {'lr': 6.377648585209455e-05, 'samples': 22193472, 'steps': 115590, 'loss/train': 1.2105052471160889} 11/07/2021 13:28:49 - INFO - __main__ - Step 115592: {'lr': 6.377294532567265e-05, 'samples': 22193664, 'steps': 115591, 'loss/train': 1.4034428596496582} 11/07/2021 13:28:49 - INFO - __main__ - Step 115593: {'lr': 6.376940488316179e-05, 'samples': 22193856, 'steps': 115592, 'loss/train': 1.04597008228302} 11/07/2021 13:28:50 - INFO - __main__ - Step 115594: {'lr': 6.376586452456359e-05, 'samples': 22194048, 'steps': 115593, 'loss/train': 1.3058533668518066} 11/07/2021 13:28:50 - INFO - __main__ - Step 115595: {'lr': 6.37623242498796e-05, 'samples': 22194240, 'steps': 115594, 'loss/train': 1.6054383516311646} 11/07/2021 13:28:51 - INFO - __main__ - Step 115596: {'lr': 6.375878405911143e-05, 'samples': 22194432, 'steps': 115595, 'loss/train': 0.9724910855293274} 11/07/2021 13:28:51 - INFO - __main__ - Step 115597: {'lr': 6.375524395226064e-05, 'samples': 22194624, 'steps': 115596, 'loss/train': 1.0680402517318726} 11/07/2021 13:28:51 - INFO - __main__ - Step 115598: {'lr': 6.375170392932891e-05, 'samples': 22194816, 'steps': 115597, 'loss/train': 1.6121606826782227} 11/07/2021 13:28:53 - INFO - __main__ - Step 115599: {'lr': 6.37481639903178e-05, 'samples': 22195008, 'steps': 115598, 'loss/train': 0.9649008512496948} 11/07/2021 13:28:53 - INFO - __main__ - Step 115600: {'lr': 6.374462413522886e-05, 'samples': 22195200, 'steps': 115599, 'loss/train': 1.5983386039733887} 11/07/2021 13:28:53 - INFO - __main__ - Step 115601: {'lr': 6.374108436406373e-05, 'samples': 22195392, 'steps': 115600, 'loss/train': 1.768314242362976} 11/07/2021 13:28:54 - INFO - __main__ - Step 115602: {'lr': 6.373754467682399e-05, 'samples': 22195584, 'steps': 115601, 'loss/train': 1.426060438156128} 11/07/2021 13:28:54 - INFO - __main__ - Step 115603: {'lr': 6.373400507351132e-05, 'samples': 22195776, 'steps': 115602, 'loss/train': 1.1130141019821167} 11/07/2021 13:28:54 - INFO - __main__ - Step 115604: {'lr': 6.373046555412715e-05, 'samples': 22195968, 'steps': 115603, 'loss/train': 1.3087481260299683} 11/07/2021 13:28:55 - INFO - __main__ - Step 115605: {'lr': 6.372692611867314e-05, 'samples': 22196160, 'steps': 115604, 'loss/train': 1.1753593683242798} 11/07/2021 13:28:56 - INFO - __main__ - Step 115606: {'lr': 6.372338676715095e-05, 'samples': 22196352, 'steps': 115605, 'loss/train': 1.6452417373657227} 11/07/2021 13:28:56 - INFO - __main__ - Step 115607: {'lr': 6.371984749956208e-05, 'samples': 22196544, 'steps': 115606, 'loss/train': 1.6203725337982178} 11/07/2021 13:28:56 - INFO - __main__ - Step 115608: {'lr': 6.371630831590822e-05, 'samples': 22196736, 'steps': 115607, 'loss/train': 1.1567003726959229} 11/07/2021 13:28:57 - INFO - __main__ - Step 115609: {'lr': 6.371276921619087e-05, 'samples': 22196928, 'steps': 115608, 'loss/train': 1.5112494230270386} 11/07/2021 13:28:58 - INFO - __main__ - Step 115610: {'lr': 6.37092302004117e-05, 'samples': 22197120, 'steps': 115609, 'loss/train': 1.1674394607543945} 11/07/2021 13:28:58 - INFO - __main__ - Step 115611: {'lr': 6.370569126857225e-05, 'samples': 22197312, 'steps': 115610, 'loss/train': 0.9106939435005188} 11/07/2021 13:28:59 - INFO - __main__ - Step 115612: {'lr': 6.370215242067418e-05, 'samples': 22197504, 'steps': 115611, 'loss/train': 1.0689219236373901} 11/07/2021 13:28:59 - INFO - __main__ - Step 115613: {'lr': 6.369861365671903e-05, 'samples': 22197696, 'steps': 115612, 'loss/train': 1.0907635688781738} 11/07/2021 13:28:59 - INFO - __main__ - Step 115614: {'lr': 6.36950749767084e-05, 'samples': 22197888, 'steps': 115613, 'loss/train': 1.1966521739959717} 11/07/2021 13:29:00 - INFO - __main__ - Step 115615: {'lr': 6.36915363806439e-05, 'samples': 22198080, 'steps': 115614, 'loss/train': 1.508121132850647} 11/07/2021 13:29:01 - INFO - __main__ - Step 115616: {'lr': 6.368799786852711e-05, 'samples': 22198272, 'steps': 115615, 'loss/train': 2.729327440261841} 11/07/2021 13:29:01 - INFO - __main__ - Step 115617: {'lr': 6.368445944035972e-05, 'samples': 22198464, 'steps': 115616, 'loss/train': 1.5361138582229614} 11/07/2021 13:29:01 - INFO - __main__ - Step 115618: {'lr': 6.368092109614315e-05, 'samples': 22198656, 'steps': 115617, 'loss/train': 1.3773585557937622} 11/07/2021 13:29:02 - INFO - __main__ - Step 115619: {'lr': 6.36773828358791e-05, 'samples': 22198848, 'steps': 115618, 'loss/train': 1.4723501205444336} 11/07/2021 13:29:03 - INFO - __main__ - Step 115620: {'lr': 6.367384465956913e-05, 'samples': 22199040, 'steps': 115619, 'loss/train': 1.2538026571273804} 11/07/2021 13:29:03 - INFO - __main__ - Step 115621: {'lr': 6.367030656721484e-05, 'samples': 22199232, 'steps': 115620, 'loss/train': 1.3974838256835938} 11/07/2021 13:29:04 - INFO - __main__ - Step 115622: {'lr': 6.366676855881786e-05, 'samples': 22199424, 'steps': 115621, 'loss/train': 1.6391587257385254} 11/07/2021 13:29:04 - INFO - __main__ - Step 115623: {'lr': 6.366323063437976e-05, 'samples': 22199616, 'steps': 115622, 'loss/train': 1.070708155632019} 11/07/2021 13:29:04 - INFO - __main__ - Step 115624: {'lr': 6.365969279390213e-05, 'samples': 22199808, 'steps': 115623, 'loss/train': 1.4489772319793701} 11/07/2021 13:29:05 - INFO - __main__ - Step 115625: {'lr': 6.365615503738656e-05, 'samples': 22200000, 'steps': 115624, 'loss/train': 1.5656237602233887} 11/07/2021 13:29:05 - INFO - __main__ - Step 115626: {'lr': 6.365261736483464e-05, 'samples': 22200192, 'steps': 115625, 'loss/train': 0.24574509263038635} 11/07/2021 13:29:06 - INFO - __main__ - Step 115627: {'lr': 6.364907977624799e-05, 'samples': 22200384, 'steps': 115626, 'loss/train': 0.6453309059143066} 11/07/2021 13:29:07 - INFO - __main__ - Step 115628: {'lr': 6.364554227162819e-05, 'samples': 22200576, 'steps': 115627, 'loss/train': 1.3520936965942383} 11/07/2021 13:29:07 - INFO - __main__ - Step 115629: {'lr': 6.364200485097682e-05, 'samples': 22200768, 'steps': 115628, 'loss/train': 1.3464434146881104} 11/07/2021 13:29:07 - INFO - __main__ - Step 115630: {'lr': 6.363846751429555e-05, 'samples': 22200960, 'steps': 115629, 'loss/train': 1.285538673400879} 11/07/2021 13:29:08 - INFO - __main__ - Step 115631: {'lr': 6.363493026158587e-05, 'samples': 22201152, 'steps': 115630, 'loss/train': 1.4438446760177612} 11/07/2021 13:29:09 - INFO - __main__ - Step 115632: {'lr': 6.363139309284941e-05, 'samples': 22201344, 'steps': 115631, 'loss/train': 1.7631756067276} 11/07/2021 13:29:09 - INFO - __main__ - Step 115633: {'lr': 6.362785600808777e-05, 'samples': 22201536, 'steps': 115632, 'loss/train': 1.409590482711792} 11/07/2021 13:29:09 - INFO - __main__ - Step 115634: {'lr': 6.362431900730251e-05, 'samples': 22201728, 'steps': 115633, 'loss/train': 1.2355918884277344} 11/07/2021 13:29:10 - INFO - __main__ - Step 115635: {'lr': 6.362078209049526e-05, 'samples': 22201920, 'steps': 115634, 'loss/train': 1.1294198036193848} 11/07/2021 13:29:10 - INFO - __main__ - Step 115636: {'lr': 6.361724525766766e-05, 'samples': 22202112, 'steps': 115635, 'loss/train': 1.1878840923309326} 11/07/2021 13:29:11 - INFO - __main__ - Step 115637: {'lr': 6.36137085088212e-05, 'samples': 22202304, 'steps': 115636, 'loss/train': 2.621582269668579} 11/07/2021 13:29:11 - INFO - __main__ - Step 115638: {'lr': 6.361017184395757e-05, 'samples': 22202496, 'steps': 115637, 'loss/train': 1.8231043815612793} 11/07/2021 13:29:12 - INFO - __main__ - Step 115639: {'lr': 6.360663526307828e-05, 'samples': 22202688, 'steps': 115638, 'loss/train': 5.68012809753418} 11/07/2021 13:29:12 - INFO - __main__ - Step 115640: {'lr': 6.360309876618498e-05, 'samples': 22202880, 'steps': 115639, 'loss/train': 0.9602149128913879} 11/07/2021 13:29:12 - INFO - __main__ - Step 115641: {'lr': 6.359956235327924e-05, 'samples': 22203072, 'steps': 115640, 'loss/train': 1.230296015739441} 11/07/2021 13:29:13 - INFO - __main__ - Step 115642: {'lr': 6.359602602436268e-05, 'samples': 22203264, 'steps': 115641, 'loss/train': 1.225727915763855} 11/07/2021 13:29:14 - INFO - __main__ - Step 115643: {'lr': 6.359248977943693e-05, 'samples': 22203456, 'steps': 115642, 'loss/train': 1.482310175895691} 11/07/2021 13:29:14 - INFO - __main__ - Step 115644: {'lr': 6.358895361850347e-05, 'samples': 22203648, 'steps': 115643, 'loss/train': 1.5378307104110718} 11/07/2021 13:29:15 - INFO - __main__ - Step 115645: {'lr': 6.358541754156394e-05, 'samples': 22203840, 'steps': 115644, 'loss/train': 0.45263540744781494} 11/07/2021 13:29:15 - INFO - __main__ - Step 115646: {'lr': 6.358188154861994e-05, 'samples': 22204032, 'steps': 115645, 'loss/train': 1.2722276449203491} 11/07/2021 13:29:15 - INFO - __main__ - Step 115647: {'lr': 6.357834563967307e-05, 'samples': 22204224, 'steps': 115646, 'loss/train': 1.4239293336868286} 11/07/2021 13:29:16 - INFO - __main__ - Step 115648: {'lr': 6.357480981472493e-05, 'samples': 22204416, 'steps': 115647, 'loss/train': 1.2072712182998657} 11/07/2021 13:29:17 - INFO - __main__ - Step 115649: {'lr': 6.35712740737771e-05, 'samples': 22204608, 'steps': 115648, 'loss/train': 1.3331279754638672} 11/07/2021 13:29:17 - INFO - __main__ - Step 115650: {'lr': 6.356773841683116e-05, 'samples': 22204800, 'steps': 115649, 'loss/train': 1.2671220302581787} 11/07/2021 13:29:18 - INFO - __main__ - Step 115651: {'lr': 6.356420284388876e-05, 'samples': 22204992, 'steps': 115650, 'loss/train': 1.4687023162841797} 11/07/2021 13:29:18 - INFO - __main__ - Step 115652: {'lr': 6.356066735495142e-05, 'samples': 22205184, 'steps': 115651, 'loss/train': 1.2918050289154053} 11/07/2021 13:29:19 - INFO - __main__ - Step 115653: {'lr': 6.355713195002078e-05, 'samples': 22205376, 'steps': 115652, 'loss/train': 1.5569539070129395} 11/07/2021 13:29:19 - INFO - __main__ - Step 115654: {'lr': 6.35535966290984e-05, 'samples': 22205568, 'steps': 115653, 'loss/train': 1.5138931274414062} 11/07/2021 13:29:20 - INFO - __main__ - Step 115655: {'lr': 6.355006139218592e-05, 'samples': 22205760, 'steps': 115654, 'loss/train': 1.5131237506866455} 11/07/2021 13:29:20 - INFO - __main__ - Step 115656: {'lr': 6.354652623928489e-05, 'samples': 22205952, 'steps': 115655, 'loss/train': 1.89573073387146} 11/07/2021 13:29:20 - INFO - __main__ - Step 115657: {'lr': 6.3542991170397e-05, 'samples': 22206144, 'steps': 115656, 'loss/train': 1.9273245334625244} 11/07/2021 13:29:21 - INFO - __main__ - Step 115658: {'lr': 6.353945618552367e-05, 'samples': 22206336, 'steps': 115657, 'loss/train': 1.5973713397979736} 11/07/2021 13:29:22 - INFO - __main__ - Step 115659: {'lr': 6.353592128466662e-05, 'samples': 22206528, 'steps': 115658, 'loss/train': 0.9815806150436401} 11/07/2021 13:29:22 - INFO - __main__ - Step 115660: {'lr': 6.353238646782739e-05, 'samples': 22206720, 'steps': 115659, 'loss/train': 1.1249302625656128} 11/07/2021 13:29:23 - INFO - __main__ - Step 115661: {'lr': 6.352885173500755e-05, 'samples': 22206912, 'steps': 115660, 'loss/train': 1.2182360887527466} 11/07/2021 13:29:23 - INFO - __main__ - Step 115662: {'lr': 6.352531708620878e-05, 'samples': 22207104, 'steps': 115661, 'loss/train': 0.49081891775131226} 11/07/2021 13:29:23 - INFO - __main__ - Step 115663: {'lr': 6.352178252143262e-05, 'samples': 22207296, 'steps': 115662, 'loss/train': 1.0892750024795532} 11/07/2021 13:29:24 - INFO - __main__ - Step 115664: {'lr': 6.351824804068066e-05, 'samples': 22207488, 'steps': 115663, 'loss/train': 1.3534746170043945} 11/07/2021 13:29:25 - INFO - __main__ - Step 115665: {'lr': 6.351471364395448e-05, 'samples': 22207680, 'steps': 115664, 'loss/train': 1.4836965799331665} 11/07/2021 13:29:25 - INFO - __main__ - Step 115666: {'lr': 6.351117933125569e-05, 'samples': 22207872, 'steps': 115665, 'loss/train': 1.8300634622573853} 11/07/2021 13:29:25 - INFO - __main__ - Step 115667: {'lr': 6.350764510258592e-05, 'samples': 22208064, 'steps': 115666, 'loss/train': 0.4944573640823364} 11/07/2021 13:29:26 - INFO - __main__ - Step 115668: {'lr': 6.35041109579467e-05, 'samples': 22208256, 'steps': 115667, 'loss/train': 1.4520463943481445} 11/07/2021 13:29:28 - INFO - __main__ - Step 115669: {'lr': 6.350057689733968e-05, 'samples': 22208448, 'steps': 115668, 'loss/train': 1.4796867370605469} 11/07/2021 13:29:28 - INFO - __main__ - Step 115670: {'lr': 6.349704292076647e-05, 'samples': 22208640, 'steps': 115669, 'loss/train': 1.2394297122955322} 11/07/2021 13:29:28 - INFO - __main__ - Step 115671: {'lr': 6.349350902822854e-05, 'samples': 22208832, 'steps': 115670, 'loss/train': 1.4198825359344482} 11/07/2021 13:29:29 - INFO - __main__ - Step 115672: {'lr': 6.348997521972758e-05, 'samples': 22209024, 'steps': 115671, 'loss/train': 1.3919938802719116} 11/07/2021 13:29:29 - INFO - __main__ - Step 115673: {'lr': 6.348644149526512e-05, 'samples': 22209216, 'steps': 115672, 'loss/train': 0.6422358155250549} 11/07/2021 13:29:30 - INFO - __main__ - Step 115674: {'lr': 6.348290785484282e-05, 'samples': 22209408, 'steps': 115673, 'loss/train': 0.8352566957473755} 11/07/2021 13:29:30 - INFO - __main__ - Step 115675: {'lr': 6.347937429846224e-05, 'samples': 22209600, 'steps': 115674, 'loss/train': 1.5315223932266235} 11/07/2021 13:29:30 - INFO - __main__ - Step 115676: {'lr': 6.347584082612498e-05, 'samples': 22209792, 'steps': 115675, 'loss/train': 1.5494153499603271} 11/07/2021 13:29:31 - INFO - __main__ - Step 115677: {'lr': 6.347230743783262e-05, 'samples': 22209984, 'steps': 115676, 'loss/train': 1.6884856224060059} 11/07/2021 13:29:32 - INFO - __main__ - Step 115678: {'lr': 6.346877413358677e-05, 'samples': 22210176, 'steps': 115677, 'loss/train': 1.159229040145874} 11/07/2021 13:29:32 - INFO - __main__ - Step 115679: {'lr': 6.346524091338899e-05, 'samples': 22210368, 'steps': 115678, 'loss/train': 1.2165875434875488} 11/07/2021 13:29:32 - INFO - __main__ - Step 115680: {'lr': 6.346170777724089e-05, 'samples': 22210560, 'steps': 115679, 'loss/train': 1.164906620979309} 11/07/2021 13:29:33 - INFO - __main__ - Step 115681: {'lr': 6.345817472514409e-05, 'samples': 22210752, 'steps': 115680, 'loss/train': 1.2413204908370972} 11/07/2021 13:29:34 - INFO - __main__ - Step 115682: {'lr': 6.345464175710017e-05, 'samples': 22210944, 'steps': 115681, 'loss/train': 1.4015393257141113} 11/07/2021 13:29:34 - INFO - __main__ - Step 115683: {'lr': 6.345110887311068e-05, 'samples': 22211136, 'steps': 115682, 'loss/train': 1.2648563385009766} 11/07/2021 13:29:34 - INFO - __main__ - Step 115684: {'lr': 6.344757607317734e-05, 'samples': 22211328, 'steps': 115683, 'loss/train': 1.3574352264404297} 11/07/2021 13:29:35 - INFO - __main__ - Step 115685: {'lr': 6.344404335730152e-05, 'samples': 22211520, 'steps': 115684, 'loss/train': 1.032344937324524} 11/07/2021 13:29:35 - INFO - __main__ - Step 115686: {'lr': 6.344051072548499e-05, 'samples': 22211712, 'steps': 115685, 'loss/train': 1.987704873085022} 11/07/2021 13:29:36 - INFO - __main__ - Step 115687: {'lr': 6.343697817772928e-05, 'samples': 22211904, 'steps': 115686, 'loss/train': 1.018426537513733} 11/07/2021 13:29:37 - INFO - __main__ - Step 115688: {'lr': 6.343344571403598e-05, 'samples': 22212096, 'steps': 115687, 'loss/train': 1.3100826740264893} 11/07/2021 13:29:37 - INFO - __main__ - Step 115689: {'lr': 6.342991333440667e-05, 'samples': 22212288, 'steps': 115688, 'loss/train': 1.330299973487854} 11/07/2021 13:29:37 - INFO - __main__ - Step 115690: {'lr': 6.342638103884299e-05, 'samples': 22212480, 'steps': 115689, 'loss/train': 1.1949743032455444} 11/07/2021 13:29:38 - INFO - __main__ - Step 115691: {'lr': 6.34228488273465e-05, 'samples': 22212672, 'steps': 115690, 'loss/train': 1.22719144821167} 11/07/2021 13:29:39 - INFO - __main__ - Step 115692: {'lr': 6.341931669991877e-05, 'samples': 22212864, 'steps': 115691, 'loss/train': 0.9346979856491089} 11/07/2021 13:29:39 - INFO - __main__ - Step 115693: {'lr': 6.341578465656145e-05, 'samples': 22213056, 'steps': 115692, 'loss/train': 1.70442533493042} 11/07/2021 13:29:39 - INFO - __main__ - Step 115694: {'lr': 6.341225269727608e-05, 'samples': 22213248, 'steps': 115693, 'loss/train': 1.97982656955719} 11/07/2021 13:29:40 - INFO - __main__ - Step 115695: {'lr': 6.340872082206428e-05, 'samples': 22213440, 'steps': 115694, 'loss/train': 1.5284780263900757} 11/07/2021 13:29:40 - INFO - __main__ - Step 115696: {'lr': 6.340518903092762e-05, 'samples': 22213632, 'steps': 115695, 'loss/train': 1.315250277519226} 11/07/2021 13:29:40 - INFO - __main__ - Step 115697: {'lr': 6.340165732386777e-05, 'samples': 22213824, 'steps': 115696, 'loss/train': 1.503278136253357} 11/07/2021 13:29:42 - INFO - __main__ - Step 115698: {'lr': 6.339812570088622e-05, 'samples': 22214016, 'steps': 115697, 'loss/train': 1.3183646202087402} 11/07/2021 13:29:42 - INFO - __main__ - Step 115699: {'lr': 6.339459416198454e-05, 'samples': 22214208, 'steps': 115698, 'loss/train': 1.2702587842941284} 11/07/2021 13:29:42 - INFO - __main__ - Step 115700: {'lr': 6.339106270716442e-05, 'samples': 22214400, 'steps': 115699, 'loss/train': 0.4598589837551117} 11/07/2021 13:29:43 - INFO - __main__ - Step 115701: {'lr': 6.338753133642738e-05, 'samples': 22214592, 'steps': 115700, 'loss/train': 1.1402747631072998} 11/07/2021 13:29:43 - INFO - __main__ - Step 115702: {'lr': 6.338400004977505e-05, 'samples': 22214784, 'steps': 115701, 'loss/train': 0.6682000756263733} 11/07/2021 13:29:44 - INFO - __main__ - Step 115703: {'lr': 6.338046884720899e-05, 'samples': 22214976, 'steps': 115702, 'loss/train': 1.8724578619003296} 11/07/2021 13:29:44 - INFO - __main__ - Step 115704: {'lr': 6.337693772873084e-05, 'samples': 22215168, 'steps': 115703, 'loss/train': 1.6037096977233887} 11/07/2021 13:29:45 - INFO - __main__ - Step 115705: {'lr': 6.337340669434216e-05, 'samples': 22215360, 'steps': 115704, 'loss/train': 1.3090226650238037} 11/07/2021 13:29:45 - INFO - __main__ - Step 115706: {'lr': 6.336987574404454e-05, 'samples': 22215552, 'steps': 115705, 'loss/train': 1.1546176671981812} 11/07/2021 13:29:45 - INFO - __main__ - Step 115707: {'lr': 6.336634487783957e-05, 'samples': 22215744, 'steps': 115706, 'loss/train': 1.33553147315979} 11/07/2021 13:29:46 - INFO - __main__ - Step 115708: {'lr': 6.336281409572884e-05, 'samples': 22215936, 'steps': 115707, 'loss/train': 0.8557405471801758} 11/07/2021 13:29:47 - INFO - __main__ - Step 115709: {'lr': 6.335928339771393e-05, 'samples': 22216128, 'steps': 115708, 'loss/train': 1.3885622024536133} 11/07/2021 13:29:47 - INFO - __main__ - Step 115710: {'lr': 6.335575278379649e-05, 'samples': 22216320, 'steps': 115709, 'loss/train': 1.29410982131958} 11/07/2021 13:29:48 - INFO - __main__ - Step 115711: {'lr': 6.33522222539781e-05, 'samples': 22216512, 'steps': 115710, 'loss/train': 1.1489309072494507} 11/07/2021 13:29:48 - INFO - __main__ - Step 115712: {'lr': 6.334869180826027e-05, 'samples': 22216704, 'steps': 115711, 'loss/train': 1.4643007516860962} 11/07/2021 13:29:49 - INFO - __main__ - Step 115713: {'lr': 6.334516144664465e-05, 'samples': 22216896, 'steps': 115712, 'loss/train': 0.759775698184967} 11/07/2021 13:29:49 - INFO - __main__ - Step 115714: {'lr': 6.33416311691328e-05, 'samples': 22217088, 'steps': 115713, 'loss/train': 1.3833690881729126} 11/07/2021 13:29:50 - INFO - __main__ - Step 115715: {'lr': 6.333810097572631e-05, 'samples': 22217280, 'steps': 115714, 'loss/train': 1.2429442405700684} 11/07/2021 13:29:50 - INFO - __main__ - Step 115716: {'lr': 6.333457086642683e-05, 'samples': 22217472, 'steps': 115715, 'loss/train': 1.3671892881393433} 11/07/2021 13:29:50 - INFO - __main__ - Step 115717: {'lr': 6.333104084123589e-05, 'samples': 22217664, 'steps': 115716, 'loss/train': 1.5082436800003052} 11/07/2021 13:29:51 - INFO - __main__ - Step 115718: {'lr': 6.332751090015512e-05, 'samples': 22217856, 'steps': 115717, 'loss/train': 1.249049425125122} 11/07/2021 13:29:52 - INFO - __main__ - Step 115719: {'lr': 6.332398104318606e-05, 'samples': 22218048, 'steps': 115718, 'loss/train': 1.0762085914611816} 11/07/2021 13:29:52 - INFO - __main__ - Step 115720: {'lr': 6.332045127033037e-05, 'samples': 22218240, 'steps': 115719, 'loss/train': 1.7684677839279175} 11/07/2021 13:29:52 - INFO - __main__ - Step 115721: {'lr': 6.331692158158958e-05, 'samples': 22218432, 'steps': 115720, 'loss/train': 1.759324073791504} 11/07/2021 13:29:53 - INFO - __main__ - Step 115722: {'lr': 6.331339197696531e-05, 'samples': 22218624, 'steps': 115721, 'loss/train': 1.2728418111801147} 11/07/2021 13:29:53 - INFO - __main__ - Step 115723: {'lr': 6.330986245645917e-05, 'samples': 22218816, 'steps': 115722, 'loss/train': 1.5275545120239258} 11/07/2021 13:29:54 - INFO - __main__ - Step 115724: {'lr': 6.330633302007277e-05, 'samples': 22219008, 'steps': 115723, 'loss/train': 1.2255308628082275} 11/07/2021 13:29:55 - INFO - __main__ - Step 115725: {'lr': 6.330280366780758e-05, 'samples': 22219200, 'steps': 115724, 'loss/train': 1.513486623764038} 11/07/2021 13:29:55 - INFO - __main__ - Step 115726: {'lr': 6.32992743996653e-05, 'samples': 22219392, 'steps': 115725, 'loss/train': 1.4643491506576538} 11/07/2021 13:29:55 - INFO - __main__ - Step 115727: {'lr': 6.329574521564746e-05, 'samples': 22219584, 'steps': 115726, 'loss/train': 1.336957335472107} 11/07/2021 13:29:56 - INFO - __main__ - Step 115728: {'lr': 6.329221611575567e-05, 'samples': 22219776, 'steps': 115727, 'loss/train': 1.2483093738555908} 11/07/2021 13:29:57 - INFO - __main__ - Step 115729: {'lr': 6.328868709999152e-05, 'samples': 22219968, 'steps': 115728, 'loss/train': 1.529489517211914} 11/07/2021 13:29:57 - INFO - __main__ - Step 115730: {'lr': 6.328515816835664e-05, 'samples': 22220160, 'steps': 115729, 'loss/train': 1.301279902458191} 11/07/2021 13:29:57 - INFO - __main__ - Step 115731: {'lr': 6.328162932085254e-05, 'samples': 22220352, 'steps': 115730, 'loss/train': 1.7628235816955566} 11/07/2021 13:29:58 - INFO - __main__ - Step 115732: {'lr': 6.32781005574809e-05, 'samples': 22220544, 'steps': 115731, 'loss/train': 1.1755499839782715} 11/07/2021 13:29:58 - INFO - __main__ - Step 115733: {'lr': 6.327457187824326e-05, 'samples': 22220736, 'steps': 115732, 'loss/train': 1.480010747909546} 11/07/2021 13:29:59 - INFO - __main__ - Step 115734: {'lr': 6.32710432831412e-05, 'samples': 22220928, 'steps': 115733, 'loss/train': 0.2814549505710602} 11/07/2021 13:29:59 - INFO - __main__ - Step 115735: {'lr': 6.326751477217632e-05, 'samples': 22221120, 'steps': 115734, 'loss/train': 1.5624792575836182} 11/07/2021 13:30:00 - INFO - __main__ - Step 115736: {'lr': 6.326398634535024e-05, 'samples': 22221312, 'steps': 115735, 'loss/train': 1.449449896812439} 11/07/2021 13:30:00 - INFO - __main__ - Step 115737: {'lr': 6.326045800266452e-05, 'samples': 22221504, 'steps': 115736, 'loss/train': 1.2131667137145996} 11/07/2021 13:30:01 - INFO - __main__ - Step 115738: {'lr': 6.325692974412081e-05, 'samples': 22221696, 'steps': 115737, 'loss/train': 1.2749559879302979} 11/07/2021 13:30:02 - INFO - __main__ - Step 115739: {'lr': 6.325340156972059e-05, 'samples': 22221888, 'steps': 115738, 'loss/train': 1.16648268699646} 11/07/2021 13:30:02 - INFO - __main__ - Step 115740: {'lr': 6.32498734794655e-05, 'samples': 22222080, 'steps': 115739, 'loss/train': 1.0469512939453125} 11/07/2021 13:30:02 - INFO - __main__ - Step 115741: {'lr': 6.324634547335714e-05, 'samples': 22222272, 'steps': 115740, 'loss/train': 1.124932885169983} 11/07/2021 13:30:03 - INFO - __main__ - Step 115742: {'lr': 6.324281755139711e-05, 'samples': 22222464, 'steps': 115741, 'loss/train': 1.2574700117111206} 11/07/2021 13:30:03 - INFO - __main__ - Step 115743: {'lr': 6.323928971358698e-05, 'samples': 22222656, 'steps': 115742, 'loss/train': 1.1404799222946167} 11/07/2021 13:30:04 - INFO - __main__ - Step 115744: {'lr': 6.323576195992831e-05, 'samples': 22222848, 'steps': 115743, 'loss/train': 1.2774678468704224} 11/07/2021 13:30:05 - INFO - __main__ - Step 115745: {'lr': 6.323223429042274e-05, 'samples': 22223040, 'steps': 115744, 'loss/train': 1.1902287006378174} 11/07/2021 13:30:05 - INFO - __main__ - Step 115746: {'lr': 6.322870670507186e-05, 'samples': 22223232, 'steps': 115745, 'loss/train': 1.0589264631271362} 11/07/2021 13:30:05 - INFO - __main__ - Step 115747: {'lr': 6.322517920387725e-05, 'samples': 22223424, 'steps': 115746, 'loss/train': 1.5237300395965576} 11/07/2021 13:30:06 - INFO - __main__ - Step 115748: {'lr': 6.322165178684044e-05, 'samples': 22223616, 'steps': 115747, 'loss/train': 1.2695789337158203} 11/07/2021 13:30:06 - INFO - __main__ - Step 115749: {'lr': 6.321812445396313e-05, 'samples': 22223808, 'steps': 115748, 'loss/train': 1.243692398071289} 11/07/2021 13:30:07 - INFO - __main__ - Step 115750: {'lr': 6.32145972052468e-05, 'samples': 22224000, 'steps': 115749, 'loss/train': 1.6367367506027222} 11/07/2021 13:30:07 - INFO - __main__ - Step 115751: {'lr': 6.32110700406932e-05, 'samples': 22224192, 'steps': 115750, 'loss/train': 1.627453327178955} 11/07/2021 13:30:08 - INFO - __main__ - Step 115752: {'lr': 6.320754296030373e-05, 'samples': 22224384, 'steps': 115751, 'loss/train': 1.154607892036438} 11/07/2021 13:30:08 - INFO - __main__ - Step 115753: {'lr': 6.320401596408007e-05, 'samples': 22224576, 'steps': 115752, 'loss/train': 1.0361042022705078} 11/07/2021 13:30:08 - INFO - __main__ - Step 115754: {'lr': 6.320048905202378e-05, 'samples': 22224768, 'steps': 115753, 'loss/train': 1.4945399761199951} 11/07/2021 13:30:09 - INFO - __main__ - Step 115755: {'lr': 6.319696222413645e-05, 'samples': 22224960, 'steps': 115754, 'loss/train': 1.114384651184082} 11/07/2021 13:30:10 - INFO - __main__ - Step 115756: {'lr': 6.319343548041973e-05, 'samples': 22225152, 'steps': 115755, 'loss/train': 1.5072201490402222} 11/07/2021 13:30:10 - INFO - __main__ - Step 115757: {'lr': 6.318990882087513e-05, 'samples': 22225344, 'steps': 115756, 'loss/train': 0.806045651435852} 11/07/2021 13:30:10 - INFO - __main__ - Step 115758: {'lr': 6.318638224550429e-05, 'samples': 22225536, 'steps': 115757, 'loss/train': 1.4784860610961914} 11/07/2021 13:30:11 - INFO - __main__ - Step 115759: {'lr': 6.318285575430877e-05, 'samples': 22225728, 'steps': 115758, 'loss/train': 1.4056456089019775} 11/07/2021 13:30:12 - INFO - __main__ - Step 115760: {'lr': 6.317932934729018e-05, 'samples': 22225920, 'steps': 115759, 'loss/train': 1.4596675634384155} 11/07/2021 13:30:12 - INFO - __main__ - Step 115761: {'lr': 6.317580302445011e-05, 'samples': 22226112, 'steps': 115760, 'loss/train': 1.354278802871704} 11/07/2021 13:30:12 - INFO - __main__ - Step 115762: {'lr': 6.317227678579013e-05, 'samples': 22226304, 'steps': 115761, 'loss/train': 1.6131277084350586} 11/07/2021 13:30:13 - INFO - __main__ - Step 115763: {'lr': 6.316875063131186e-05, 'samples': 22226496, 'steps': 115762, 'loss/train': 1.2830517292022705} 11/07/2021 13:30:13 - INFO - __main__ - Step 115764: {'lr': 6.316522456101693e-05, 'samples': 22226688, 'steps': 115763, 'loss/train': 1.5164965391159058} 11/07/2021 13:30:14 - INFO - __main__ - Step 115765: {'lr': 6.316169857490678e-05, 'samples': 22226880, 'steps': 115764, 'loss/train': 1.2797549962997437} 11/07/2021 13:30:15 - INFO - __main__ - Step 115766: {'lr': 6.315817267298307e-05, 'samples': 22227072, 'steps': 115765, 'loss/train': 1.3897075653076172} 11/07/2021 13:30:15 - INFO - __main__ - Step 115767: {'lr': 6.315464685524744e-05, 'samples': 22227264, 'steps': 115766, 'loss/train': 1.6430052518844604} 11/07/2021 13:30:15 - INFO - __main__ - Step 115768: {'lr': 6.315112112170143e-05, 'samples': 22227456, 'steps': 115767, 'loss/train': 0.9618814587593079} 11/07/2021 13:30:16 - INFO - __main__ - Step 115769: {'lr': 6.314759547234664e-05, 'samples': 22227648, 'steps': 115768, 'loss/train': 1.2622894048690796} 11/07/2021 13:30:16 - INFO - __main__ - Step 115770: {'lr': 6.314406990718466e-05, 'samples': 22227840, 'steps': 115769, 'loss/train': 1.3455204963684082} 11/07/2021 13:30:17 - INFO - __main__ - Step 115771: {'lr': 6.314054442621709e-05, 'samples': 22228032, 'steps': 115770, 'loss/train': 2.6062259674072266} 11/07/2021 13:30:17 - INFO - __main__ - Step 115772: {'lr': 6.313701902944549e-05, 'samples': 22228224, 'steps': 115771, 'loss/train': 0.6866845488548279} 11/07/2021 13:30:18 - INFO - __main__ - Step 115773: {'lr': 6.313349371687147e-05, 'samples': 22228416, 'steps': 115772, 'loss/train': 1.5108726024627686} 11/07/2021 13:30:18 - INFO - __main__ - Step 115774: {'lr': 6.312996848849662e-05, 'samples': 22228608, 'steps': 115773, 'loss/train': 1.1766321659088135} 11/07/2021 13:30:19 - INFO - __main__ - Step 115775: {'lr': 6.312644334432252e-05, 'samples': 22228800, 'steps': 115774, 'loss/train': 0.9183626770973206} 11/07/2021 13:30:20 - INFO - __main__ - Step 115776: {'lr': 6.312291828435076e-05, 'samples': 22228992, 'steps': 115775, 'loss/train': 0.7691401839256287} 11/07/2021 13:30:20 - INFO - __main__ - Step 115777: {'lr': 6.311939330858293e-05, 'samples': 22229184, 'steps': 115776, 'loss/train': 1.6097959280014038} 11/07/2021 13:30:20 - INFO - __main__ - Step 115778: {'lr': 6.311586841702069e-05, 'samples': 22229376, 'steps': 115777, 'loss/train': 1.2755056619644165} 11/07/2021 13:30:21 - INFO - __main__ - Step 115779: {'lr': 6.31123436096655e-05, 'samples': 22229568, 'steps': 115778, 'loss/train': 0.981674313545227} 11/07/2021 13:30:21 - INFO - __main__ - Step 115780: {'lr': 6.310881888651898e-05, 'samples': 22229760, 'steps': 115779, 'loss/train': 1.3463863134384155} 11/07/2021 13:30:22 - INFO - __main__ - Step 115781: {'lr': 6.310529424758276e-05, 'samples': 22229952, 'steps': 115780, 'loss/train': 1.2028521299362183} 11/07/2021 13:30:22 - INFO - __main__ - Step 115782: {'lr': 6.310176969285839e-05, 'samples': 22230144, 'steps': 115781, 'loss/train': 1.3336505889892578} 11/07/2021 13:30:23 - INFO - __main__ - Step 115783: {'lr': 6.30982452223475e-05, 'samples': 22230336, 'steps': 115782, 'loss/train': 0.7593415975570679} 11/07/2021 13:30:23 - INFO - __main__ - Step 115784: {'lr': 6.309472083605165e-05, 'samples': 22230528, 'steps': 115783, 'loss/train': 1.3988027572631836} 11/07/2021 13:30:24 - INFO - __main__ - Step 115785: {'lr': 6.309119653397241e-05, 'samples': 22230720, 'steps': 115784, 'loss/train': 1.4677443504333496} 11/07/2021 13:30:25 - INFO - __main__ - Step 115786: {'lr': 6.308767231611142e-05, 'samples': 22230912, 'steps': 115785, 'loss/train': 1.5196582078933716} 11/07/2021 13:30:25 - INFO - __main__ - Step 115787: {'lr': 6.308414818247024e-05, 'samples': 22231104, 'steps': 115786, 'loss/train': 1.2481884956359863} 11/07/2021 13:30:25 - INFO - __main__ - Step 115788: {'lr': 6.308062413305046e-05, 'samples': 22231296, 'steps': 115787, 'loss/train': 0.9102097749710083} 11/07/2021 13:30:26 - INFO - __main__ - Step 115789: {'lr': 6.307710016785365e-05, 'samples': 22231488, 'steps': 115788, 'loss/train': 1.6287089586257935} 11/07/2021 13:30:26 - INFO - __main__ - Step 115790: {'lr': 6.307357628688143e-05, 'samples': 22231680, 'steps': 115789, 'loss/train': 1.6089298725128174} 11/07/2021 13:30:27 - INFO - __main__ - Step 115791: {'lr': 6.307005249013545e-05, 'samples': 22231872, 'steps': 115790, 'loss/train': 1.394788384437561} 11/07/2021 13:30:27 - INFO - __main__ - Step 115792: {'lr': 6.306652877761712e-05, 'samples': 22232064, 'steps': 115791, 'loss/train': 1.270636796951294} 11/07/2021 13:30:28 - INFO - __main__ - Step 115793: {'lr': 6.306300514932814e-05, 'samples': 22232256, 'steps': 115792, 'loss/train': 1.2645933628082275} 11/07/2021 13:30:28 - INFO - __main__ - Step 115794: {'lr': 6.305948160527009e-05, 'samples': 22232448, 'steps': 115793, 'loss/train': 1.3718864917755127} 11/07/2021 13:30:28 - INFO - __main__ - Step 115795: {'lr': 6.305595814544458e-05, 'samples': 22232640, 'steps': 115794, 'loss/train': 1.192461609840393} 11/07/2021 13:30:29 - INFO - __main__ - Step 115796: {'lr': 6.305243476985311e-05, 'samples': 22232832, 'steps': 115795, 'loss/train': 1.6559984683990479} 11/07/2021 13:30:30 - INFO - __main__ - Step 115797: {'lr': 6.304891147849737e-05, 'samples': 22233024, 'steps': 115796, 'loss/train': 1.1830228567123413} 11/07/2021 13:30:30 - INFO - __main__ - Step 115798: {'lr': 6.30453882713789e-05, 'samples': 22233216, 'steps': 115797, 'loss/train': 1.3825318813323975} 11/07/2021 13:30:30 - INFO - __main__ - Step 115799: {'lr': 6.304186514849928e-05, 'samples': 22233408, 'steps': 115798, 'loss/train': 0.04802282899618149} 11/07/2021 13:30:31 - INFO - __main__ - Step 115800: {'lr': 6.303834210986012e-05, 'samples': 22233600, 'steps': 115799, 'loss/train': 1.1007945537567139} 11/07/2021 13:30:31 - INFO - __main__ - Step 115801: {'lr': 6.303481915546299e-05, 'samples': 22233792, 'steps': 115800, 'loss/train': 1.5669426918029785} 11/07/2021 13:30:32 - INFO - __main__ - Step 115802: {'lr': 6.303129628530957e-05, 'samples': 22233984, 'steps': 115801, 'loss/train': 1.1240108013153076} 11/07/2021 13:30:33 - INFO - __main__ - Step 115803: {'lr': 6.302777349940128e-05, 'samples': 22234176, 'steps': 115802, 'loss/train': 1.3482338190078735} 11/07/2021 13:30:33 - INFO - __main__ - Step 115804: {'lr': 6.302425079773979e-05, 'samples': 22234368, 'steps': 115803, 'loss/train': 1.1147558689117432} 11/07/2021 13:30:33 - INFO - __main__ - Step 115805: {'lr': 6.302072818032672e-05, 'samples': 22234560, 'steps': 115804, 'loss/train': 1.0261918306350708} 11/07/2021 13:30:34 - INFO - __main__ - Step 115806: {'lr': 6.30172056471636e-05, 'samples': 22234752, 'steps': 115805, 'loss/train': 1.0780720710754395} 11/07/2021 13:30:35 - INFO - __main__ - Step 115807: {'lr': 6.301368319825204e-05, 'samples': 22234944, 'steps': 115806, 'loss/train': 1.2767361402511597} 11/07/2021 13:30:35 - INFO - __main__ - Step 115808: {'lr': 6.301016083359362e-05, 'samples': 22235136, 'steps': 115807, 'loss/train': 3.640977144241333} 11/07/2021 13:30:35 - INFO - __main__ - Step 115809: {'lr': 6.300663855318994e-05, 'samples': 22235328, 'steps': 115808, 'loss/train': 1.0541926622390747} 11/07/2021 13:30:36 - INFO - __main__ - Step 115810: {'lr': 6.300311635704259e-05, 'samples': 22235520, 'steps': 115809, 'loss/train': 1.3467758893966675} 11/07/2021 13:30:36 - INFO - __main__ - Step 115811: {'lr': 6.299959424515314e-05, 'samples': 22235712, 'steps': 115810, 'loss/train': 1.004647135734558} 11/07/2021 13:30:37 - INFO - __main__ - Step 115812: {'lr': 6.299607221752327e-05, 'samples': 22235904, 'steps': 115811, 'loss/train': 1.5670747756958008} 11/07/2021 13:30:37 - INFO - __main__ - Step 115813: {'lr': 6.299255027415443e-05, 'samples': 22236096, 'steps': 115812, 'loss/train': 1.5078030824661255} 11/07/2021 13:30:38 - INFO - __main__ - Step 115814: {'lr': 6.298902841504822e-05, 'samples': 22236288, 'steps': 115813, 'loss/train': 1.2493822574615479} 11/07/2021 13:30:38 - INFO - __main__ - Step 115815: {'lr': 6.29855066402063e-05, 'samples': 22236480, 'steps': 115814, 'loss/train': 1.1178672313690186} 11/07/2021 13:30:38 - INFO - __main__ - Step 115816: {'lr': 6.29819849496302e-05, 'samples': 22236672, 'steps': 115815, 'loss/train': 0.9093924164772034} 11/07/2021 13:30:40 - INFO - __main__ - Step 115817: {'lr': 6.297846334332155e-05, 'samples': 22236864, 'steps': 115816, 'loss/train': 1.369698405265808} 11/07/2021 13:30:40 - INFO - __main__ - Step 115818: {'lr': 6.297494182128192e-05, 'samples': 22237056, 'steps': 115817, 'loss/train': 0.9926871061325073} 11/07/2021 13:30:40 - INFO - __main__ - Step 115819: {'lr': 6.297142038351289e-05, 'samples': 22237248, 'steps': 115818, 'loss/train': 1.887786626815796} 11/07/2021 13:30:41 - INFO - __main__ - Step 115820: {'lr': 6.296789903001604e-05, 'samples': 22237440, 'steps': 115819, 'loss/train': 1.4687551259994507} 11/07/2021 13:30:41 - INFO - __main__ - Step 115821: {'lr': 6.2964377760793e-05, 'samples': 22237632, 'steps': 115820, 'loss/train': 0.9832595586776733} 11/07/2021 13:30:42 - INFO - __main__ - Step 115822: {'lr': 6.29608565758453e-05, 'samples': 22237824, 'steps': 115821, 'loss/train': 1.738234281539917} 11/07/2021 13:30:42 - INFO - __main__ - Step 115823: {'lr': 6.295733547517463e-05, 'samples': 22238016, 'steps': 115822, 'loss/train': 1.5836777687072754} 11/07/2021 13:30:43 - INFO - __main__ - Step 115824: {'lr': 6.295381445878243e-05, 'samples': 22238208, 'steps': 115823, 'loss/train': 0.56464684009552} 11/07/2021 13:30:43 - INFO - __main__ - Step 115825: {'lr': 6.295029352667033e-05, 'samples': 22238400, 'steps': 115824, 'loss/train': 1.0400965213775635} 11/07/2021 13:30:43 - INFO - __main__ - Step 115826: {'lr': 6.294677267883997e-05, 'samples': 22238592, 'steps': 115825, 'loss/train': 1.6723731756210327} 11/07/2021 13:30:44 - INFO - __main__ - Step 115827: {'lr': 6.29432519152929e-05, 'samples': 22238784, 'steps': 115826, 'loss/train': 1.2424120903015137} 11/07/2021 13:30:45 - INFO - __main__ - Step 115828: {'lr': 6.293973123603073e-05, 'samples': 22238976, 'steps': 115827, 'loss/train': 1.5989980697631836} 11/07/2021 13:30:45 - INFO - __main__ - Step 115829: {'lr': 6.293621064105501e-05, 'samples': 22239168, 'steps': 115828, 'loss/train': 1.4981971979141235} 11/07/2021 13:30:45 - INFO - __main__ - Step 115830: {'lr': 6.293269013036734e-05, 'samples': 22239360, 'steps': 115829, 'loss/train': 1.4543663263320923} 11/07/2021 13:30:46 - INFO - __main__ - Step 115831: {'lr': 6.292916970396934e-05, 'samples': 22239552, 'steps': 115830, 'loss/train': 1.6549468040466309} 11/07/2021 13:30:46 - INFO - __main__ - Step 115832: {'lr': 6.292564936186254e-05, 'samples': 22239744, 'steps': 115831, 'loss/train': 1.6316282749176025} 11/07/2021 13:30:47 - INFO - __main__ - Step 115833: {'lr': 6.292212910404857e-05, 'samples': 22239936, 'steps': 115832, 'loss/train': 1.251482367515564} 11/07/2021 13:30:48 - INFO - __main__ - Step 115834: {'lr': 6.291860893052908e-05, 'samples': 22240128, 'steps': 115833, 'loss/train': 1.4081352949142456} 11/07/2021 13:30:48 - INFO - __main__ - Step 115835: {'lr': 6.291508884130548e-05, 'samples': 22240320, 'steps': 115834, 'loss/train': 0.810236930847168} 11/07/2021 13:30:48 - INFO - __main__ - Step 115836: {'lr': 6.29115688363795e-05, 'samples': 22240512, 'steps': 115835, 'loss/train': 1.0337283611297607} 11/07/2021 13:30:49 - INFO - __main__ - Step 115837: {'lr': 6.290804891575263e-05, 'samples': 22240704, 'steps': 115836, 'loss/train': 0.7696272730827332} 11/07/2021 13:30:50 - INFO - __main__ - Step 115838: {'lr': 6.290452907942653e-05, 'samples': 22240896, 'steps': 115837, 'loss/train': 1.4344285726547241} 11/07/2021 13:30:50 - INFO - __main__ - Step 115839: {'lr': 6.290100932740278e-05, 'samples': 22241088, 'steps': 115838, 'loss/train': 0.5621609687805176} 11/07/2021 13:30:50 - INFO - __main__ - Step 115840: {'lr': 6.289748965968292e-05, 'samples': 22241280, 'steps': 115839, 'loss/train': 1.0876197814941406} 11/07/2021 13:30:51 - INFO - __main__ - Step 115841: {'lr': 6.289397007626856e-05, 'samples': 22241472, 'steps': 115840, 'loss/train': 1.4028100967407227} 11/07/2021 13:30:51 - INFO - __main__ - Step 115842: {'lr': 6.28904505771613e-05, 'samples': 22241664, 'steps': 115841, 'loss/train': 1.233718991279602} 11/07/2021 13:30:52 - INFO - __main__ - Step 115843: {'lr': 6.288693116236275e-05, 'samples': 22241856, 'steps': 115842, 'loss/train': 1.1995761394500732} 11/07/2021 13:30:53 - INFO - __main__ - Step 115844: {'lr': 6.28834118318744e-05, 'samples': 22242048, 'steps': 115843, 'loss/train': 1.6098724603652954} 11/07/2021 13:30:53 - INFO - __main__ - Step 115845: {'lr': 6.287989258569801e-05, 'samples': 22242240, 'steps': 115844, 'loss/train': 2.0489602088928223} 11/07/2021 13:30:53 - INFO - __main__ - Step 115846: {'lr': 6.287637342383498e-05, 'samples': 22242432, 'steps': 115845, 'loss/train': 1.5536202192306519} 11/07/2021 13:30:54 - INFO - __main__ - Step 115847: {'lr': 6.287285434628696e-05, 'samples': 22242624, 'steps': 115846, 'loss/train': 1.6124464273452759} 11/07/2021 13:30:55 - INFO - __main__ - Step 115848: {'lr': 6.286933535305556e-05, 'samples': 22242816, 'steps': 115847, 'loss/train': 1.1672512292861938} 11/07/2021 13:30:55 - INFO - __main__ - Step 115849: {'lr': 6.286581644414233e-05, 'samples': 22243008, 'steps': 115848, 'loss/train': 1.4026622772216797} 11/07/2021 13:30:55 - INFO - __main__ - Step 115850: {'lr': 6.286229761954887e-05, 'samples': 22243200, 'steps': 115849, 'loss/train': 1.3415471315383911} 11/07/2021 13:30:56 - INFO - __main__ - Step 115851: {'lr': 6.285877887927676e-05, 'samples': 22243392, 'steps': 115850, 'loss/train': 0.6561129689216614} 11/07/2021 13:30:56 - INFO - __main__ - Step 115852: {'lr': 6.285526022332763e-05, 'samples': 22243584, 'steps': 115851, 'loss/train': 1.0498775243759155} 11/07/2021 13:30:57 - INFO - __main__ - Step 115853: {'lr': 6.285174165170302e-05, 'samples': 22243776, 'steps': 115852, 'loss/train': 1.5972059965133667} 11/07/2021 13:30:57 - INFO - __main__ - Step 115854: {'lr': 6.284822316440452e-05, 'samples': 22243968, 'steps': 115853, 'loss/train': 1.1468936204910278} 11/07/2021 13:30:58 - INFO - __main__ - Step 115855: {'lr': 6.284470476143372e-05, 'samples': 22244160, 'steps': 115854, 'loss/train': 1.6557905673980713} 11/07/2021 13:30:58 - INFO - __main__ - Step 115856: {'lr': 6.284118644279224e-05, 'samples': 22244352, 'steps': 115855, 'loss/train': 0.5637621283531189} 11/07/2021 13:30:58 - INFO - __main__ - Step 115857: {'lr': 6.283766820848161e-05, 'samples': 22244544, 'steps': 115856, 'loss/train': 1.575150728225708} 11/07/2021 13:31:00 - INFO - __main__ - Step 115858: {'lr': 6.283415005850343e-05, 'samples': 22244736, 'steps': 115857, 'loss/train': 1.0173906087875366} 11/07/2021 13:31:00 - INFO - __main__ - Step 115859: {'lr': 6.283063199285938e-05, 'samples': 22244928, 'steps': 115858, 'loss/train': 0.8503777384757996} 11/07/2021 13:31:00 - INFO - __main__ - Step 115860: {'lr': 6.282711401155089e-05, 'samples': 22245120, 'steps': 115859, 'loss/train': 1.1268442869186401} 11/07/2021 13:31:01 - INFO - __main__ - Step 115861: {'lr': 6.28235961145796e-05, 'samples': 22245312, 'steps': 115860, 'loss/train': 1.5261855125427246} 11/07/2021 13:31:01 - INFO - __main__ - Step 115862: {'lr': 6.28200783019471e-05, 'samples': 22245504, 'steps': 115861, 'loss/train': 1.3729194402694702} 11/07/2021 13:31:01 - INFO - __main__ - Step 115863: {'lr': 6.2816560573655e-05, 'samples': 22245696, 'steps': 115862, 'loss/train': 1.4959882497787476} 11/07/2021 13:31:02 - INFO - __main__ - Step 115864: {'lr': 6.281304292970489e-05, 'samples': 22245888, 'steps': 115863, 'loss/train': 1.6059846878051758} 11/07/2021 13:31:03 - INFO - __main__ - Step 115865: {'lr': 6.28095253700983e-05, 'samples': 22246080, 'steps': 115864, 'loss/train': 1.640284538269043} 11/07/2021 13:31:03 - INFO - __main__ - Step 115866: {'lr': 6.280600789483686e-05, 'samples': 22246272, 'steps': 115865, 'loss/train': 1.3214348554611206} 11/07/2021 13:31:03 - INFO - __main__ - Step 115867: {'lr': 6.280249050392215e-05, 'samples': 22246464, 'steps': 115866, 'loss/train': 1.1372976303100586} 11/07/2021 13:31:04 - INFO - __main__ - Step 115868: {'lr': 6.279897319735576e-05, 'samples': 22246656, 'steps': 115867, 'loss/train': 0.8470140099525452} 11/07/2021 13:31:05 - INFO - __main__ - Step 115869: {'lr': 6.279545597513925e-05, 'samples': 22246848, 'steps': 115868, 'loss/train': 1.3761392831802368} 11/07/2021 13:31:05 - INFO - __main__ - Step 115870: {'lr': 6.279193883727421e-05, 'samples': 22247040, 'steps': 115869, 'loss/train': 1.4368672370910645} 11/07/2021 13:31:06 - INFO - __main__ - Step 115871: {'lr': 6.278842178376224e-05, 'samples': 22247232, 'steps': 115870, 'loss/train': 1.3563722372055054} 11/07/2021 13:31:06 - INFO - __main__ - Step 115872: {'lr': 6.2784904814605e-05, 'samples': 22247424, 'steps': 115871, 'loss/train': 1.4498220682144165} 11/07/2021 13:31:06 - INFO - __main__ - Step 115873: {'lr': 6.27813879298039e-05, 'samples': 22247616, 'steps': 115872, 'loss/train': 1.1625605821609497} 11/07/2021 13:31:07 - INFO - __main__ - Step 115874: {'lr': 6.277787112936065e-05, 'samples': 22247808, 'steps': 115873, 'loss/train': 1.2579989433288574} 11/07/2021 13:31:08 - INFO - __main__ - Step 115875: {'lr': 6.277435441327678e-05, 'samples': 22248000, 'steps': 115874, 'loss/train': 0.43649590015411377} 11/07/2021 13:31:08 - INFO - __main__ - Step 115876: {'lr': 6.27708377815539e-05, 'samples': 22248192, 'steps': 115875, 'loss/train': 1.4135327339172363} 11/07/2021 13:31:08 - INFO - __main__ - Step 115877: {'lr': 6.27673212341936e-05, 'samples': 22248384, 'steps': 115876, 'loss/train': 1.409812092781067} 11/07/2021 13:31:09 - INFO - __main__ - Step 115878: {'lr': 6.276380477119742e-05, 'samples': 22248576, 'steps': 115877, 'loss/train': 1.8611207008361816} 11/07/2021 13:31:10 - INFO - __main__ - Step 115879: {'lr': 6.276028839256703e-05, 'samples': 22248768, 'steps': 115878, 'loss/train': 1.6029975414276123} 11/07/2021 13:31:10 - INFO - __main__ - Step 115880: {'lr': 6.275677209830393e-05, 'samples': 22248960, 'steps': 115879, 'loss/train': 1.5427080392837524} 11/07/2021 13:31:10 - INFO - __main__ - Step 115881: {'lr': 6.275325588840975e-05, 'samples': 22249152, 'steps': 115880, 'loss/train': 1.4358173608779907} 11/07/2021 13:31:11 - INFO - __main__ - Step 115882: {'lr': 6.274973976288606e-05, 'samples': 22249344, 'steps': 115881, 'loss/train': 1.4506100416183472} 11/07/2021 13:31:11 - INFO - __main__ - Step 115883: {'lr': 6.274622372173447e-05, 'samples': 22249536, 'steps': 115882, 'loss/train': 0.9600144624710083} 11/07/2021 13:31:12 - INFO - __main__ - Step 115884: {'lr': 6.274270776495652e-05, 'samples': 22249728, 'steps': 115883, 'loss/train': 1.6726652383804321} 11/07/2021 13:31:12 - INFO - __main__ - Step 115885: {'lr': 6.273919189255389e-05, 'samples': 22249920, 'steps': 115884, 'loss/train': 1.1418819427490234} 11/07/2021 13:31:13 - INFO - __main__ - Step 115886: {'lr': 6.273567610452801e-05, 'samples': 22250112, 'steps': 115885, 'loss/train': 1.393389344215393} 11/07/2021 13:31:13 - INFO - __main__ - Step 115887: {'lr': 6.273216040088056e-05, 'samples': 22250304, 'steps': 115886, 'loss/train': 0.985611081123352} 11/07/2021 13:31:13 - INFO - __main__ - Step 115888: {'lr': 6.272864478161311e-05, 'samples': 22250496, 'steps': 115887, 'loss/train': 1.3990792036056519} 11/07/2021 13:31:14 - INFO - __main__ - Step 115889: {'lr': 6.272512924672725e-05, 'samples': 22250688, 'steps': 115888, 'loss/train': 1.444237470626831} 11/07/2021 13:31:15 - INFO - __main__ - Step 115890: {'lr': 6.272161379622454e-05, 'samples': 22250880, 'steps': 115889, 'loss/train': 1.2998297214508057} 11/07/2021 13:31:15 - INFO - __main__ - Step 115891: {'lr': 6.271809843010659e-05, 'samples': 22251072, 'steps': 115890, 'loss/train': 1.7408510446548462} 11/07/2021 13:31:16 - INFO - __main__ - Step 115892: {'lr': 6.271458314837498e-05, 'samples': 22251264, 'steps': 115891, 'loss/train': 0.9643539786338806} 11/07/2021 13:31:16 - INFO - __main__ - Step 115893: {'lr': 6.271106795103127e-05, 'samples': 22251456, 'steps': 115892, 'loss/train': 0.554351270198822} 11/07/2021 13:31:17 - INFO - __main__ - Step 115894: {'lr': 6.270755283807708e-05, 'samples': 22251648, 'steps': 115893, 'loss/train': 1.5333240032196045} 11/07/2021 13:31:17 - INFO - __main__ - Step 115895: {'lr': 6.270403780951394e-05, 'samples': 22251840, 'steps': 115894, 'loss/train': 1.3104543685913086} 11/07/2021 13:31:18 - INFO - __main__ - Step 115896: {'lr': 6.270052286534353e-05, 'samples': 22252032, 'steps': 115895, 'loss/train': 1.2423362731933594} 11/07/2021 13:31:18 - INFO - __main__ - Step 115897: {'lr': 6.269700800556732e-05, 'samples': 22252224, 'steps': 115896, 'loss/train': 1.340396523475647} 11/07/2021 13:31:18 - INFO - __main__ - Step 115898: {'lr': 6.2693493230187e-05, 'samples': 22252416, 'steps': 115897, 'loss/train': 1.4588254690170288} 11/07/2021 13:31:19 - INFO - __main__ - Step 115899: {'lr': 6.268997853920413e-05, 'samples': 22252608, 'steps': 115898, 'loss/train': 0.20702891051769257} 11/07/2021 13:31:20 - INFO - __main__ - Step 115900: {'lr': 6.26864639326202e-05, 'samples': 22252800, 'steps': 115899, 'loss/train': 1.383514165878296} 11/07/2021 13:31:20 - INFO - __main__ - Step 115901: {'lr': 6.268294941043687e-05, 'samples': 22252992, 'steps': 115900, 'loss/train': 0.5539613366127014} 11/07/2021 13:31:21 - INFO - __main__ - Step 115902: {'lr': 6.267943497265571e-05, 'samples': 22253184, 'steps': 115901, 'loss/train': 1.402698278427124} 11/07/2021 13:31:21 - INFO - __main__ - Step 115903: {'lr': 6.267592061927833e-05, 'samples': 22253376, 'steps': 115902, 'loss/train': 1.2877668142318726} 11/07/2021 13:31:21 - INFO - __main__ - Step 115904: {'lr': 6.267240635030624e-05, 'samples': 22253568, 'steps': 115903, 'loss/train': 0.7637879252433777} 11/07/2021 13:31:22 - INFO - __main__ - Step 115905: {'lr': 6.266889216574112e-05, 'samples': 22253760, 'steps': 115904, 'loss/train': 1.3150761127471924} 11/07/2021 13:31:23 - INFO - __main__ - Step 115906: {'lr': 6.266537806558448e-05, 'samples': 22253952, 'steps': 115905, 'loss/train': 1.338979959487915} 11/07/2021 13:31:23 - INFO - __main__ - Step 115907: {'lr': 6.266186404983792e-05, 'samples': 22254144, 'steps': 115906, 'loss/train': 1.5587244033813477} 11/07/2021 13:31:23 - INFO - __main__ - Step 115908: {'lr': 6.265835011850307e-05, 'samples': 22254336, 'steps': 115907, 'loss/train': 1.4392871856689453} 11/07/2021 13:31:24 - INFO - __main__ - Step 115909: {'lr': 6.265483627158144e-05, 'samples': 22254528, 'steps': 115908, 'loss/train': 1.2337597608566284} 11/07/2021 13:31:26 - INFO - __main__ - Step 115910: {'lr': 6.265132250907468e-05, 'samples': 22254720, 'steps': 115909, 'loss/train': 1.4789201021194458} 11/07/2021 13:31:26 - INFO - __main__ - Step 115911: {'lr': 6.264780883098431e-05, 'samples': 22254912, 'steps': 115910, 'loss/train': 1.3399816751480103} 11/07/2021 13:31:26 - INFO - __main__ - Step 115912: {'lr': 6.264429523731205e-05, 'samples': 22255104, 'steps': 115911, 'loss/train': 1.182593822479248} 11/07/2021 13:31:27 - INFO - __main__ - Step 115913: {'lr': 6.264078172805929e-05, 'samples': 22255296, 'steps': 115912, 'loss/train': 1.7447929382324219} 11/07/2021 13:31:27 - INFO - __main__ - Step 115914: {'lr': 6.263726830322772e-05, 'samples': 22255488, 'steps': 115913, 'loss/train': 1.7519863843917847} 11/07/2021 13:31:27 - INFO - __main__ - Step 115915: {'lr': 6.26337549628189e-05, 'samples': 22255680, 'steps': 115914, 'loss/train': 1.009513020515442} 11/07/2021 13:31:28 - INFO - __main__ - Step 115916: {'lr': 6.26302417068344e-05, 'samples': 22255872, 'steps': 115915, 'loss/train': 1.3428471088409424} 11/07/2021 13:31:29 - INFO - __main__ - Step 115917: {'lr': 6.262672853527581e-05, 'samples': 22256064, 'steps': 115916, 'loss/train': 1.5518817901611328} 11/07/2021 13:31:29 - INFO - __main__ - Step 115918: {'lr': 6.262321544814476e-05, 'samples': 22256256, 'steps': 115917, 'loss/train': 0.7824559211730957} 11/07/2021 13:31:29 - INFO - __main__ - Step 115919: {'lr': 6.26197024454428e-05, 'samples': 22256448, 'steps': 115918, 'loss/train': 1.6101793050765991} 11/07/2021 13:31:30 - INFO - __main__ - Step 115920: {'lr': 6.261618952717149e-05, 'samples': 22256640, 'steps': 115919, 'loss/train': 1.5109494924545288} 11/07/2021 13:31:30 - INFO - __main__ - Step 115921: {'lr': 6.261267669333242e-05, 'samples': 22256832, 'steps': 115920, 'loss/train': 1.4497830867767334} 11/07/2021 13:31:31 - INFO - __main__ - Step 115922: {'lr': 6.260916394392721e-05, 'samples': 22257024, 'steps': 115921, 'loss/train': 1.0890744924545288} 11/07/2021 13:31:31 - INFO - __main__ - Step 115923: {'lr': 6.260565127895743e-05, 'samples': 22257216, 'steps': 115922, 'loss/train': 1.4480574131011963} 11/07/2021 13:31:32 - INFO - __main__ - Step 115924: {'lr': 6.260213869842462e-05, 'samples': 22257408, 'steps': 115923, 'loss/train': 1.5392054319381714} 11/07/2021 13:31:32 - INFO - __main__ - Step 115925: {'lr': 6.259862620233043e-05, 'samples': 22257600, 'steps': 115924, 'loss/train': 1.4185467958450317} 11/07/2021 13:31:33 - INFO - __main__ - Step 115926: {'lr': 6.259511379067645e-05, 'samples': 22257792, 'steps': 115925, 'loss/train': 1.8481420278549194} 11/07/2021 13:31:34 - INFO - __main__ - Step 115927: {'lr': 6.259160146346416e-05, 'samples': 22257984, 'steps': 115926, 'loss/train': 1.134596347808838} 11/07/2021 13:31:34 - INFO - __main__ - Step 115928: {'lr': 6.25880892206952e-05, 'samples': 22258176, 'steps': 115927, 'loss/train': 1.1184523105621338} 11/07/2021 13:31:34 - INFO - __main__ - Step 115929: {'lr': 6.258457706237116e-05, 'samples': 22258368, 'steps': 115928, 'loss/train': 1.385659098625183} 11/07/2021 13:31:35 - INFO - __main__ - Step 115930: {'lr': 6.258106498849361e-05, 'samples': 22258560, 'steps': 115929, 'loss/train': 1.2312973737716675} 11/07/2021 13:31:35 - INFO - __main__ - Step 115931: {'lr': 6.257755299906415e-05, 'samples': 22258752, 'steps': 115930, 'loss/train': 1.6338481903076172} 11/07/2021 13:31:36 - INFO - __main__ - Step 115932: {'lr': 6.257404109408435e-05, 'samples': 22258944, 'steps': 115931, 'loss/train': 1.6640712022781372} 11/07/2021 13:31:36 - INFO - __main__ - Step 115933: {'lr': 6.257052927355577e-05, 'samples': 22259136, 'steps': 115932, 'loss/train': 1.0067121982574463} 11/07/2021 13:31:37 - INFO - __main__ - Step 115934: {'lr': 6.256701753748007e-05, 'samples': 22259328, 'steps': 115933, 'loss/train': 1.4874571561813354} 11/07/2021 13:31:37 - INFO - __main__ - Step 115935: {'lr': 6.256350588585873e-05, 'samples': 22259520, 'steps': 115934, 'loss/train': 1.0196914672851562} 11/07/2021 13:31:37 - INFO - __main__ - Step 115936: {'lr': 6.255999431869342e-05, 'samples': 22259712, 'steps': 115935, 'loss/train': 1.3454203605651855} 11/07/2021 13:31:39 - INFO - __main__ - Step 115937: {'lr': 6.255648283598565e-05, 'samples': 22259904, 'steps': 115936, 'loss/train': 1.3092074394226074} 11/07/2021 13:31:39 - INFO - __main__ - Step 115938: {'lr': 6.255297143773705e-05, 'samples': 22260096, 'steps': 115937, 'loss/train': 1.479955792427063} 11/07/2021 13:31:39 - INFO - __main__ - Step 115939: {'lr': 6.254946012394926e-05, 'samples': 22260288, 'steps': 115938, 'loss/train': 1.258827805519104} 11/07/2021 13:31:40 - INFO - __main__ - Step 115940: {'lr': 6.254594889462373e-05, 'samples': 22260480, 'steps': 115939, 'loss/train': 1.4366220235824585} 11/07/2021 13:31:40 - INFO - __main__ - Step 115941: {'lr': 6.25424377497621e-05, 'samples': 22260672, 'steps': 115940, 'loss/train': 1.7014758586883545} 11/07/2021 13:31:40 - INFO - __main__ - Step 115942: {'lr': 6.253892668936593e-05, 'samples': 22260864, 'steps': 115941, 'loss/train': 1.2770692110061646} 11/07/2021 13:31:41 - INFO - __main__ - Step 115943: {'lr': 6.253541571343686e-05, 'samples': 22261056, 'steps': 115942, 'loss/train': 1.2695136070251465} 11/07/2021 13:31:42 - INFO - __main__ - Step 115944: {'lr': 6.25319048219764e-05, 'samples': 22261248, 'steps': 115943, 'loss/train': 1.1584328413009644} 11/07/2021 13:31:42 - INFO - __main__ - Step 115945: {'lr': 6.25283940149862e-05, 'samples': 22261440, 'steps': 115944, 'loss/train': 1.1324917078018188} 11/07/2021 13:31:42 - INFO - __main__ - Step 115946: {'lr': 6.25248832924678e-05, 'samples': 22261632, 'steps': 115945, 'loss/train': 1.602713704109192} 11/07/2021 13:31:43 - INFO - __main__ - Step 115947: {'lr': 6.252137265442282e-05, 'samples': 22261824, 'steps': 115946, 'loss/train': 1.878832459449768} 11/07/2021 13:31:44 - INFO - __main__ - Step 115948: {'lr': 6.251786210085281e-05, 'samples': 22262016, 'steps': 115947, 'loss/train': 1.448452353477478} 11/07/2021 13:31:44 - INFO - __main__ - Step 115949: {'lr': 6.251435163175933e-05, 'samples': 22262208, 'steps': 115948, 'loss/train': 1.0870848894119263} 11/07/2021 13:31:44 - INFO - __main__ - Step 115950: {'lr': 6.251084124714402e-05, 'samples': 22262400, 'steps': 115949, 'loss/train': 0.8140231370925903} 11/07/2021 13:31:45 - INFO - __main__ - Step 115951: {'lr': 6.250733094700842e-05, 'samples': 22262592, 'steps': 115950, 'loss/train': 1.6386315822601318} 11/07/2021 13:31:45 - INFO - __main__ - Step 115952: {'lr': 6.25038207313541e-05, 'samples': 22262784, 'steps': 115951, 'loss/train': 1.178579568862915} 11/07/2021 13:31:46 - INFO - __main__ - Step 115953: {'lr': 6.250031060018277e-05, 'samples': 22262976, 'steps': 115952, 'loss/train': 1.2830454111099243} 11/07/2021 13:31:47 - INFO - __main__ - Step 115954: {'lr': 6.249680055349583e-05, 'samples': 22263168, 'steps': 115953, 'loss/train': 1.280548095703125} 11/07/2021 13:31:47 - INFO - __main__ - Step 115955: {'lr': 6.249329059129494e-05, 'samples': 22263360, 'steps': 115954, 'loss/train': 1.40640389919281} 11/07/2021 13:31:47 - INFO - __main__ - Step 115956: {'lr': 6.248978071358166e-05, 'samples': 22263552, 'steps': 115955, 'loss/train': 1.2430661916732788} 11/07/2021 13:31:48 - INFO - __main__ - Step 115957: {'lr': 6.248627092035761e-05, 'samples': 22263744, 'steps': 115956, 'loss/train': 1.2996336221694946} 11/07/2021 13:31:49 - INFO - __main__ - Step 115958: {'lr': 6.248276121162432e-05, 'samples': 22263936, 'steps': 115957, 'loss/train': 0.9229205250740051} 11/07/2021 13:31:49 - INFO - __main__ - Step 115959: {'lr': 6.247925158738344e-05, 'samples': 22264128, 'steps': 115958, 'loss/train': 1.4994779825210571} 11/07/2021 13:31:49 - INFO - __main__ - Step 115960: {'lr': 6.247574204763651e-05, 'samples': 22264320, 'steps': 115959, 'loss/train': 1.0243794918060303} 11/07/2021 13:31:50 - INFO - __main__ - Step 115961: {'lr': 6.24722325923851e-05, 'samples': 22264512, 'steps': 115960, 'loss/train': 1.5021507740020752} 11/07/2021 13:31:50 - INFO - __main__ - Step 115962: {'lr': 6.246872322163083e-05, 'samples': 22264704, 'steps': 115961, 'loss/train': 1.5680806636810303} 11/07/2021 13:31:51 - INFO - __main__ - Step 115963: {'lr': 6.246521393537527e-05, 'samples': 22264896, 'steps': 115962, 'loss/train': 1.192112684249878} 11/07/2021 13:31:51 - INFO - __main__ - Step 115964: {'lr': 6.246170473361995e-05, 'samples': 22265088, 'steps': 115963, 'loss/train': 0.420183926820755} 11/07/2021 13:31:52 - INFO - __main__ - Step 115965: {'lr': 6.245819561636653e-05, 'samples': 22265280, 'steps': 115964, 'loss/train': 1.27749764919281} 11/07/2021 13:31:52 - INFO - __main__ - Step 115966: {'lr': 6.245468658361662e-05, 'samples': 22265472, 'steps': 115965, 'loss/train': 0.7860783338546753} 11/07/2021 13:31:52 - INFO - __main__ - Step 115967: {'lr': 6.245117763537164e-05, 'samples': 22265664, 'steps': 115966, 'loss/train': 1.3842439651489258} 11/07/2021 13:31:53 - INFO - __main__ - Step 115968: {'lr': 6.244766877163327e-05, 'samples': 22265856, 'steps': 115967, 'loss/train': 1.637595295906067} 11/07/2021 13:31:54 - INFO - __main__ - Step 115969: {'lr': 6.24441599924031e-05, 'samples': 22266048, 'steps': 115968, 'loss/train': 1.0250110626220703} 11/07/2021 13:31:54 - INFO - __main__ - Step 115970: {'lr': 6.244065129768267e-05, 'samples': 22266240, 'steps': 115969, 'loss/train': 1.2015146017074585} 11/07/2021 13:31:54 - INFO - __main__ - Step 115971: {'lr': 6.243714268747364e-05, 'samples': 22266432, 'steps': 115970, 'loss/train': 0.976936399936676} 11/07/2021 13:31:55 - INFO - __main__ - Step 115972: {'lr': 6.24336341617775e-05, 'samples': 22266624, 'steps': 115971, 'loss/train': 1.4531753063201904} 11/07/2021 13:31:55 - INFO - __main__ - Step 115973: {'lr': 6.243012572059586e-05, 'samples': 22266816, 'steps': 115972, 'loss/train': 1.2848137617111206} 11/07/2021 13:31:56 - INFO - __main__ - Step 115974: {'lr': 6.242661736393035e-05, 'samples': 22267008, 'steps': 115973, 'loss/train': 0.1363682597875595} 11/07/2021 13:31:57 - INFO - __main__ - Step 115975: {'lr': 6.242310909178248e-05, 'samples': 22267200, 'steps': 115974, 'loss/train': 1.3163057565689087} 11/07/2021 13:31:57 - INFO - __main__ - Step 115976: {'lr': 6.241960090415388e-05, 'samples': 22267392, 'steps': 115975, 'loss/train': 0.5008898973464966} 11/07/2021 13:31:57 - INFO - __main__ - Step 115977: {'lr': 6.24160928010461e-05, 'samples': 22267584, 'steps': 115976, 'loss/train': 1.1818844079971313} 11/07/2021 13:31:58 - INFO - __main__ - Step 115978: {'lr': 6.241258478246073e-05, 'samples': 22267776, 'steps': 115977, 'loss/train': 1.5752557516098022} 11/07/2021 13:31:59 - INFO - __main__ - Step 115979: {'lr': 6.240907684839935e-05, 'samples': 22267968, 'steps': 115978, 'loss/train': 1.0558134317398071} 11/07/2021 13:31:59 - INFO - __main__ - Step 115980: {'lr': 6.240556899886366e-05, 'samples': 22268160, 'steps': 115979, 'loss/train': 1.3053311109542847} 11/07/2021 13:32:00 - INFO - __main__ - Step 115981: {'lr': 6.240206123385503e-05, 'samples': 22268352, 'steps': 115980, 'loss/train': 1.074504017829895} 11/07/2021 13:32:00 - INFO - __main__ - Step 115982: {'lr': 6.239855355337512e-05, 'samples': 22268544, 'steps': 115981, 'loss/train': 1.5738179683685303} 11/07/2021 13:32:00 - INFO - __main__ - Step 115983: {'lr': 6.239504595742554e-05, 'samples': 22268736, 'steps': 115982, 'loss/train': 1.3814812898635864} 11/07/2021 13:32:01 - INFO - __main__ - Step 115984: {'lr': 6.239153844600787e-05, 'samples': 22268928, 'steps': 115983, 'loss/train': 1.2257964611053467} 11/07/2021 13:32:02 - INFO - __main__ - Step 115985: {'lr': 6.238803101912366e-05, 'samples': 22269120, 'steps': 115984, 'loss/train': 1.160934567451477} 11/07/2021 13:32:02 - INFO - __main__ - Step 115986: {'lr': 6.23845236767745e-05, 'samples': 22269312, 'steps': 115985, 'loss/train': 1.295892357826233} 11/07/2021 13:32:02 - INFO - __main__ - Step 115987: {'lr': 6.238101641896199e-05, 'samples': 22269504, 'steps': 115986, 'loss/train': 1.2781723737716675} 11/07/2021 13:32:03 - INFO - __main__ - Step 115988: {'lr': 6.237750924568772e-05, 'samples': 22269696, 'steps': 115987, 'loss/train': 2.0546886920928955} 11/07/2021 13:32:04 - INFO - __main__ - Step 115989: {'lr': 6.237400215695321e-05, 'samples': 22269888, 'steps': 115988, 'loss/train': 1.3847126960754395} 11/07/2021 13:32:04 - INFO - __main__ - Step 115990: {'lr': 6.23704951527601e-05, 'samples': 22270080, 'steps': 115989, 'loss/train': 1.0735890865325928} 11/07/2021 13:32:04 - INFO - __main__ - Step 115991: {'lr': 6.236698823310996e-05, 'samples': 22270272, 'steps': 115990, 'loss/train': 0.8093170523643494} 11/07/2021 13:32:05 - INFO - __main__ - Step 115992: {'lr': 6.236348139800436e-05, 'samples': 22270464, 'steps': 115991, 'loss/train': 1.5612494945526123} 11/07/2021 13:32:05 - INFO - __main__ - Step 115993: {'lr': 6.235997464744492e-05, 'samples': 22270656, 'steps': 115992, 'loss/train': 0.9681026935577393} 11/07/2021 13:32:06 - INFO - __main__ - Step 115994: {'lr': 6.235646798143313e-05, 'samples': 22270848, 'steps': 115993, 'loss/train': 1.5283169746398926} 11/07/2021 13:32:07 - INFO - __main__ - Step 115995: {'lr': 6.235296139997062e-05, 'samples': 22271040, 'steps': 115994, 'loss/train': 1.4127708673477173} 11/07/2021 13:32:07 - INFO - __main__ - Step 115996: {'lr': 6.234945490305896e-05, 'samples': 22271232, 'steps': 115995, 'loss/train': 1.373048186302185} 11/07/2021 13:32:07 - INFO - __main__ - Step 115997: {'lr': 6.234594849069975e-05, 'samples': 22271424, 'steps': 115996, 'loss/train': 1.2491912841796875} 11/07/2021 13:32:08 - INFO - __main__ - Step 115998: {'lr': 6.234244216289456e-05, 'samples': 22271616, 'steps': 115997, 'loss/train': 0.34569063782691956} 11/07/2021 13:32:08 - INFO - __main__ - Step 115999: {'lr': 6.233893591964495e-05, 'samples': 22271808, 'steps': 115998, 'loss/train': 1.0592790842056274} 11/07/2021 13:32:09 - INFO - __main__ - Step 116000: {'lr': 6.233542976095255e-05, 'samples': 22272000, 'steps': 115999, 'loss/train': 1.0353846549987793} 11/07/2021 13:32:09 - INFO - __main__ - Step 116001: {'lr': 6.23319236868189e-05, 'samples': 22272192, 'steps': 116000, 'loss/train': 1.4424717426300049} 11/07/2021 13:32:10 - INFO - __main__ - Step 116002: {'lr': 6.23284176972456e-05, 'samples': 22272384, 'steps': 116001, 'loss/train': 1.55483877658844} 11/07/2021 13:32:10 - INFO - __main__ - Step 116003: {'lr': 6.232491179223421e-05, 'samples': 22272576, 'steps': 116002, 'loss/train': 0.5293502807617188} 11/07/2021 13:32:10 - INFO - __main__ - Step 116004: {'lr': 6.232140597178629e-05, 'samples': 22272768, 'steps': 116003, 'loss/train': 1.2984873056411743} 11/07/2021 13:32:12 - INFO - __main__ - Step 116005: {'lr': 6.231790023590348e-05, 'samples': 22272960, 'steps': 116004, 'loss/train': 1.2537689208984375} 11/07/2021 13:32:12 - INFO - __main__ - Step 116006: {'lr': 6.23143945845874e-05, 'samples': 22273152, 'steps': 116005, 'loss/train': 1.8357155323028564} 11/07/2021 13:32:12 - INFO - __main__ - Step 116007: {'lr': 6.231088901783947e-05, 'samples': 22273344, 'steps': 116006, 'loss/train': 1.287537932395935} 11/07/2021 13:32:13 - INFO - __main__ - Step 116008: {'lr': 6.230738353566137e-05, 'samples': 22273536, 'steps': 116007, 'loss/train': 0.652751088142395} 11/07/2021 13:32:13 - INFO - __main__ - Step 116009: {'lr': 6.230387813805467e-05, 'samples': 22273728, 'steps': 116008, 'loss/train': 1.5597590208053589} 11/07/2021 13:32:14 - INFO - __main__ - Step 116010: {'lr': 6.230037282502093e-05, 'samples': 22273920, 'steps': 116009, 'loss/train': 1.0132783651351929} 11/07/2021 13:32:14 - INFO - __main__ - Step 116011: {'lr': 6.229686759656175e-05, 'samples': 22274112, 'steps': 116010, 'loss/train': 1.357300043106079} 11/07/2021 13:32:15 - INFO - __main__ - Step 116012: {'lr': 6.229336245267872e-05, 'samples': 22274304, 'steps': 116011, 'loss/train': 1.0272822380065918} 11/07/2021 13:32:15 - INFO - __main__ - Step 116013: {'lr': 6.22898573933734e-05, 'samples': 22274496, 'steps': 116012, 'loss/train': 1.044644832611084} 11/07/2021 13:32:15 - INFO - __main__ - Step 116014: {'lr': 6.228635241864736e-05, 'samples': 22274688, 'steps': 116013, 'loss/train': 0.6629257798194885} 11/07/2021 13:32:17 - INFO - __main__ - Step 116015: {'lr': 6.228284752850218e-05, 'samples': 22274880, 'steps': 116014, 'loss/train': 1.6819350719451904} 11/07/2021 13:32:17 - INFO - __main__ - Step 116016: {'lr': 6.227934272293947e-05, 'samples': 22275072, 'steps': 116015, 'loss/train': 1.5920199155807495} 11/07/2021 13:32:17 - INFO - __main__ - Step 116017: {'lr': 6.227583800196079e-05, 'samples': 22275264, 'steps': 116016, 'loss/train': 1.3052177429199219} 11/07/2021 13:32:18 - INFO - __main__ - Step 116018: {'lr': 6.227233336556772e-05, 'samples': 22275456, 'steps': 116017, 'loss/train': 1.2437798976898193} 11/07/2021 13:32:18 - INFO - __main__ - Step 116019: {'lr': 6.226882881376186e-05, 'samples': 22275648, 'steps': 116018, 'loss/train': 0.9031924605369568} 11/07/2021 13:32:19 - INFO - __main__ - Step 116020: {'lr': 6.226532434654484e-05, 'samples': 22275840, 'steps': 116019, 'loss/train': 1.6794986724853516} 11/07/2021 13:32:19 - INFO - __main__ - Step 116021: {'lr': 6.226181996391809e-05, 'samples': 22276032, 'steps': 116020, 'loss/train': 1.300244927406311} 11/07/2021 13:32:20 - INFO - __main__ - Step 116022: {'lr': 6.225831566588324e-05, 'samples': 22276224, 'steps': 116021, 'loss/train': 1.0802966356277466} 11/07/2021 13:32:20 - INFO - __main__ - Step 116023: {'lr': 6.22548114524419e-05, 'samples': 22276416, 'steps': 116022, 'loss/train': 1.418538212776184} 11/07/2021 13:32:20 - INFO - __main__ - Step 116024: {'lr': 6.225130732359566e-05, 'samples': 22276608, 'steps': 116023, 'loss/train': 1.2494127750396729} 11/07/2021 13:32:22 - INFO - __main__ - Step 116025: {'lr': 6.224780327934609e-05, 'samples': 22276800, 'steps': 116024, 'loss/train': 1.3129884004592896} 11/07/2021 13:32:22 - INFO - __main__ - Step 116026: {'lr': 6.224429931969474e-05, 'samples': 22276992, 'steps': 116025, 'loss/train': 1.5000474452972412} 11/07/2021 13:32:22 - INFO - __main__ - Step 116027: {'lr': 6.224079544464326e-05, 'samples': 22277184, 'steps': 116026, 'loss/train': 1.2619426250457764} 11/07/2021 13:32:23 - INFO - __main__ - Step 116028: {'lr': 6.223729165419311e-05, 'samples': 22277376, 'steps': 116027, 'loss/train': 1.5113962888717651} 11/07/2021 13:32:23 - INFO - __main__ - Step 116029: {'lr': 6.2233787948346e-05, 'samples': 22277568, 'steps': 116028, 'loss/train': 0.8028006553649902} 11/07/2021 13:32:23 - INFO - __main__ - Step 116030: {'lr': 6.223028432710343e-05, 'samples': 22277760, 'steps': 116029, 'loss/train': 1.2467466592788696} 11/07/2021 13:32:24 - INFO - __main__ - Step 116031: {'lr': 6.222678079046699e-05, 'samples': 22277952, 'steps': 116030, 'loss/train': 0.7815026640892029} 11/07/2021 13:32:25 - INFO - __main__ - Step 116032: {'lr': 6.222327733843824e-05, 'samples': 22278144, 'steps': 116031, 'loss/train': 1.6167787313461304} 11/07/2021 13:32:25 - INFO - __main__ - Step 116033: {'lr': 6.221977397101889e-05, 'samples': 22278336, 'steps': 116032, 'loss/train': 1.259174108505249} 11/07/2021 13:32:26 - INFO - __main__ - Step 116034: {'lr': 6.221627068821035e-05, 'samples': 22278528, 'steps': 116033, 'loss/train': 1.7882634401321411} 11/07/2021 13:32:26 - INFO - __main__ - Step 116035: {'lr': 6.221276749001423e-05, 'samples': 22278720, 'steps': 116034, 'loss/train': 1.2665663957595825} 11/07/2021 13:32:27 - INFO - __main__ - Step 116036: {'lr': 6.220926437643215e-05, 'samples': 22278912, 'steps': 116035, 'loss/train': 1.5184258222579956} 11/07/2021 13:32:28 - INFO - __main__ - Step 116037: {'lr': 6.220576134746567e-05, 'samples': 22279104, 'steps': 116036, 'loss/train': 1.171805500984192} 11/07/2021 13:32:28 - INFO - __main__ - Step 116038: {'lr': 6.220225840311638e-05, 'samples': 22279296, 'steps': 116037, 'loss/train': 1.1345728635787964} 11/07/2021 13:32:28 - INFO - __main__ - Step 116039: {'lr': 6.219875554338586e-05, 'samples': 22279488, 'steps': 116038, 'loss/train': 1.2595006227493286} 11/07/2021 13:32:29 - INFO - __main__ - Step 116040: {'lr': 6.21952527682757e-05, 'samples': 22279680, 'steps': 116039, 'loss/train': 1.3322585821151733} 11/07/2021 13:32:30 - INFO - __main__ - Step 116041: {'lr': 6.219175007778744e-05, 'samples': 22279872, 'steps': 116040, 'loss/train': 0.21658053994178772} 11/07/2021 13:32:30 - INFO - __main__ - Step 116042: {'lr': 6.218824747192267e-05, 'samples': 22280064, 'steps': 116041, 'loss/train': 1.3141295909881592} 11/07/2021 13:32:30 - INFO - __main__ - Step 116043: {'lr': 6.2184744950683e-05, 'samples': 22280256, 'steps': 116042, 'loss/train': 1.05606210231781} 11/07/2021 13:32:31 - INFO - __main__ - Step 116044: {'lr': 6.218124251406998e-05, 'samples': 22280448, 'steps': 116043, 'loss/train': 1.383421540260315} 11/07/2021 13:32:31 - INFO - __main__ - Step 116045: {'lr': 6.217774016208518e-05, 'samples': 22280640, 'steps': 116044, 'loss/train': 1.5227872133255005} 11/07/2021 13:32:32 - INFO - __main__ - Step 116046: {'lr': 6.21742378947302e-05, 'samples': 22280832, 'steps': 116045, 'loss/train': 1.2699717283248901} 11/07/2021 13:32:33 - INFO - __main__ - Step 116047: {'lr': 6.217073571200668e-05, 'samples': 22281024, 'steps': 116046, 'loss/train': 1.3047667741775513} 11/07/2021 13:32:33 - INFO - __main__ - Step 116048: {'lr': 6.216723361391607e-05, 'samples': 22281216, 'steps': 116047, 'loss/train': 1.1993942260742188} 11/07/2021 13:32:33 - INFO - __main__ - Step 116049: {'lr': 6.216373160045999e-05, 'samples': 22281408, 'steps': 116048, 'loss/train': 0.5935969352722168} 11/07/2021 13:32:34 - INFO - __main__ - Step 116050: {'lr': 6.216022967164004e-05, 'samples': 22281600, 'steps': 116049, 'loss/train': 1.7326775789260864} 11/07/2021 13:32:34 - INFO - __main__ - Step 116051: {'lr': 6.215672782745779e-05, 'samples': 22281792, 'steps': 116050, 'loss/train': 1.0372726917266846} 11/07/2021 13:32:35 - INFO - __main__ - Step 116052: {'lr': 6.215322606791482e-05, 'samples': 22281984, 'steps': 116051, 'loss/train': 0.9713031053543091} 11/07/2021 13:32:35 - INFO - __main__ - Step 116053: {'lr': 6.214972439301273e-05, 'samples': 22282176, 'steps': 116052, 'loss/train': 0.6728294491767883} 11/07/2021 13:32:36 - INFO - __main__ - Step 116054: {'lr': 6.214622280275304e-05, 'samples': 22282368, 'steps': 116053, 'loss/train': 1.2355514764785767} 11/07/2021 13:32:36 - INFO - __main__ - Step 116055: {'lr': 6.214272129713738e-05, 'samples': 22282560, 'steps': 116054, 'loss/train': 1.8033809661865234} 11/07/2021 13:32:36 - INFO - __main__ - Step 116056: {'lr': 6.21392198761673e-05, 'samples': 22282752, 'steps': 116055, 'loss/train': 1.3363189697265625} 11/07/2021 13:32:38 - INFO - __main__ - Step 116057: {'lr': 6.21357185398444e-05, 'samples': 22282944, 'steps': 116056, 'loss/train': 1.5166265964508057} 11/07/2021 13:32:38 - INFO - __main__ - Step 116058: {'lr': 6.213221728817025e-05, 'samples': 22283136, 'steps': 116057, 'loss/train': 1.0030168294906616} 11/07/2021 13:32:38 - INFO - __main__ - Step 116059: {'lr': 6.212871612114648e-05, 'samples': 22283328, 'steps': 116058, 'loss/train': 1.630981206893921} 11/07/2021 13:32:39 - INFO - __main__ - Step 116060: {'lr': 6.212521503877455e-05, 'samples': 22283520, 'steps': 116059, 'loss/train': 1.5278366804122925} 11/07/2021 13:32:39 - INFO - __main__ - Step 116061: {'lr': 6.21217140410561e-05, 'samples': 22283712, 'steps': 116060, 'loss/train': 0.8296043872833252} 11/07/2021 13:32:40 - INFO - __main__ - Step 116062: {'lr': 6.21182131279927e-05, 'samples': 22283904, 'steps': 116061, 'loss/train': 1.1646097898483276} 11/07/2021 13:32:40 - INFO - __main__ - Step 116063: {'lr': 6.211471229958595e-05, 'samples': 22284096, 'steps': 116062, 'loss/train': 1.1908636093139648} 11/07/2021 13:32:41 - INFO - __main__ - Step 116064: {'lr': 6.21112115558374e-05, 'samples': 22284288, 'steps': 116063, 'loss/train': 0.999692440032959} 11/07/2021 13:32:41 - INFO - __main__ - Step 116065: {'lr': 6.210771089674863e-05, 'samples': 22284480, 'steps': 116064, 'loss/train': 1.0185538530349731} 11/07/2021 13:32:41 - INFO - __main__ - Step 116066: {'lr': 6.210421032232125e-05, 'samples': 22284672, 'steps': 116065, 'loss/train': 1.6722469329833984} 11/07/2021 13:32:43 - INFO - __main__ - Step 116067: {'lr': 6.21007098325568e-05, 'samples': 22284864, 'steps': 116066, 'loss/train': 1.2367088794708252} 11/07/2021 13:32:43 - INFO - __main__ - Step 116068: {'lr': 6.209720942745686e-05, 'samples': 22285056, 'steps': 116067, 'loss/train': 1.5163894891738892} 11/07/2021 13:32:43 - INFO - __main__ - Step 116069: {'lr': 6.209370910702302e-05, 'samples': 22285248, 'steps': 116068, 'loss/train': 1.4886353015899658} 11/07/2021 13:32:44 - INFO - __main__ - Step 116070: {'lr': 6.209020887125694e-05, 'samples': 22285440, 'steps': 116069, 'loss/train': 0.8982657790184021} 11/07/2021 13:32:44 - INFO - __main__ - Step 116071: {'lr': 6.208670872016003e-05, 'samples': 22285632, 'steps': 116070, 'loss/train': 1.9799188375473022} 11/07/2021 13:32:44 - INFO - __main__ - Step 116072: {'lr': 6.208320865373396e-05, 'samples': 22285824, 'steps': 116071, 'loss/train': 1.515866994857788} 11/07/2021 13:32:45 - INFO - __main__ - Step 116073: {'lr': 6.207970867198028e-05, 'samples': 22286016, 'steps': 116072, 'loss/train': 1.0731889009475708} 11/07/2021 13:32:46 - INFO - __main__ - Step 116074: {'lr': 6.207620877490061e-05, 'samples': 22286208, 'steps': 116073, 'loss/train': 1.7325530052185059} 11/07/2021 13:32:46 - INFO - __main__ - Step 116075: {'lr': 6.207270896249648e-05, 'samples': 22286400, 'steps': 116074, 'loss/train': 1.7170631885528564} 11/07/2021 13:32:46 - INFO - __main__ - Step 116076: {'lr': 6.20692092347695e-05, 'samples': 22286592, 'steps': 116075, 'loss/train': 1.2981969118118286} 11/07/2021 13:32:47 - INFO - __main__ - Step 116077: {'lr': 6.206570959172122e-05, 'samples': 22286784, 'steps': 116076, 'loss/train': 1.388187289237976} 11/07/2021 13:32:48 - INFO - __main__ - Step 116078: {'lr': 6.206221003335325e-05, 'samples': 22286976, 'steps': 116077, 'loss/train': 1.1987248659133911} 11/07/2021 13:32:48 - INFO - __main__ - Step 116079: {'lr': 6.205871055966713e-05, 'samples': 22287168, 'steps': 116078, 'loss/train': 1.1935995817184448} 11/07/2021 13:32:49 - INFO - __main__ - Step 116080: {'lr': 6.205521117066445e-05, 'samples': 22287360, 'steps': 116079, 'loss/train': 0.8756219148635864} 11/07/2021 13:32:49 - INFO - __main__ - Step 116081: {'lr': 6.20517118663469e-05, 'samples': 22287552, 'steps': 116080, 'loss/train': 1.5070769786834717} 11/07/2021 13:32:49 - INFO - __main__ - Step 116082: {'lr': 6.204821264671584e-05, 'samples': 22287744, 'steps': 116081, 'loss/train': 1.2157795429229736} 11/07/2021 13:32:50 - INFO - __main__ - Step 116083: {'lr': 6.204471351177296e-05, 'samples': 22287936, 'steps': 116082, 'loss/train': 1.4839062690734863} 11/07/2021 13:32:51 - INFO - __main__ - Step 116084: {'lr': 6.204121446151983e-05, 'samples': 22288128, 'steps': 116083, 'loss/train': 1.2524253129959106} 11/07/2021 13:32:51 - INFO - __main__ - Step 116085: {'lr': 6.203771549595804e-05, 'samples': 22288320, 'steps': 116084, 'loss/train': 1.0288245677947998} 11/07/2021 13:32:51 - INFO - __main__ - Step 116086: {'lr': 6.203421661508916e-05, 'samples': 22288512, 'steps': 116085, 'loss/train': 0.9900434613227844} 11/07/2021 13:32:52 - INFO - __main__ - Step 116087: {'lr': 6.203071781891476e-05, 'samples': 22288704, 'steps': 116086, 'loss/train': 1.275526762008667} 11/07/2021 13:32:53 - INFO - __main__ - Step 116088: {'lr': 6.202721910743639e-05, 'samples': 22288896, 'steps': 116087, 'loss/train': 1.077700138092041} 11/07/2021 13:32:53 - INFO - __main__ - Step 116089: {'lr': 6.202372048065569e-05, 'samples': 22289088, 'steps': 116088, 'loss/train': 1.2213327884674072} 11/07/2021 13:32:53 - INFO - __main__ - Step 116090: {'lr': 6.202022193857417e-05, 'samples': 22289280, 'steps': 116089, 'loss/train': 1.4312868118286133} 11/07/2021 13:32:54 - INFO - __main__ - Step 116091: {'lr': 6.201672348119348e-05, 'samples': 22289472, 'steps': 116090, 'loss/train': 1.1884684562683105} 11/07/2021 13:32:54 - INFO - __main__ - Step 116092: {'lr': 6.201322510851518e-05, 'samples': 22289664, 'steps': 116091, 'loss/train': 1.6234991550445557} 11/07/2021 13:32:55 - INFO - __main__ - Step 116093: {'lr': 6.200972682054076e-05, 'samples': 22289856, 'steps': 116092, 'loss/train': 1.4422383308410645} 11/07/2021 13:32:56 - INFO - __main__ - Step 116094: {'lr': 6.200622861727187e-05, 'samples': 22290048, 'steps': 116093, 'loss/train': 1.3999450206756592} 11/07/2021 13:32:56 - INFO - __main__ - Step 116095: {'lr': 6.200273049871006e-05, 'samples': 22290240, 'steps': 116094, 'loss/train': 1.5522029399871826} 11/07/2021 13:32:56 - INFO - __main__ - Step 116096: {'lr': 6.199923246485692e-05, 'samples': 22290432, 'steps': 116095, 'loss/train': 1.4779167175292969} 11/07/2021 13:32:57 - INFO - __main__ - Step 116097: {'lr': 6.199573451571403e-05, 'samples': 22290624, 'steps': 116096, 'loss/train': 1.3458590507507324} 11/07/2021 13:32:58 - INFO - __main__ - Step 116098: {'lr': 6.199223665128297e-05, 'samples': 22290816, 'steps': 116097, 'loss/train': 1.0822234153747559} 11/07/2021 13:32:58 - INFO - __main__ - Step 116099: {'lr': 6.198873887156528e-05, 'samples': 22291008, 'steps': 116098, 'loss/train': 0.7030258178710938} 11/07/2021 13:32:59 - INFO - __main__ - Step 116100: {'lr': 6.198524117656259e-05, 'samples': 22291200, 'steps': 116099, 'loss/train': 1.579332947731018} 11/07/2021 13:32:59 - INFO - __main__ - Step 116101: {'lr': 6.198174356627645e-05, 'samples': 22291392, 'steps': 116100, 'loss/train': 1.6628334522247314} 11/07/2021 13:33:00 - INFO - __main__ - Step 116102: {'lr': 6.197824604070842e-05, 'samples': 22291584, 'steps': 116101, 'loss/train': 1.6388434171676636} 11/07/2021 13:33:00 - INFO - __main__ - Step 116103: {'lr': 6.197474859986016e-05, 'samples': 22291776, 'steps': 116102, 'loss/train': 1.9654600620269775} 11/07/2021 13:33:00 - INFO - __main__ - Step 116104: {'lr': 6.197125124373313e-05, 'samples': 22291968, 'steps': 116103, 'loss/train': 0.8191379904747009} 11/07/2021 13:33:01 - INFO - __main__ - Step 116105: {'lr': 6.196775397232893e-05, 'samples': 22292160, 'steps': 116104, 'loss/train': 1.3336553573608398} 11/07/2021 13:33:02 - INFO - __main__ - Step 116106: {'lr': 6.196425678564916e-05, 'samples': 22292352, 'steps': 116105, 'loss/train': 1.1797858476638794} 11/07/2021 13:33:02 - INFO - __main__ - Step 116107: {'lr': 6.196075968369538e-05, 'samples': 22292544, 'steps': 116106, 'loss/train': 0.8839355111122131} 11/07/2021 13:33:02 - INFO - __main__ - Step 116108: {'lr': 6.19572626664692e-05, 'samples': 22292736, 'steps': 116107, 'loss/train': 1.3979787826538086} 11/07/2021 13:33:03 - INFO - __main__ - Step 116109: {'lr': 6.195376573397218e-05, 'samples': 22292928, 'steps': 116108, 'loss/train': 1.33836829662323} 11/07/2021 13:33:04 - INFO - __main__ - Step 116110: {'lr': 6.195026888620589e-05, 'samples': 22293120, 'steps': 116109, 'loss/train': 1.180429458618164} 11/07/2021 13:33:04 - INFO - __main__ - Step 116111: {'lr': 6.19467721231719e-05, 'samples': 22293312, 'steps': 116110, 'loss/train': 1.2211769819259644} 11/07/2021 13:33:04 - INFO - __main__ - Step 116112: {'lr': 6.19432754448718e-05, 'samples': 22293504, 'steps': 116111, 'loss/train': 1.6209282875061035} 11/07/2021 13:33:05 - INFO - __main__ - Step 116113: {'lr': 6.193977885130714e-05, 'samples': 22293696, 'steps': 116112, 'loss/train': 1.2709957361221313} 11/07/2021 13:33:05 - INFO - __main__ - Step 116114: {'lr': 6.193628234247961e-05, 'samples': 22293888, 'steps': 116113, 'loss/train': 1.3271920680999756} 11/07/2021 13:33:06 - INFO - __main__ - Step 116115: {'lr': 6.19327859183906e-05, 'samples': 22294080, 'steps': 116114, 'loss/train': 1.5399682521820068} 11/07/2021 13:33:07 - INFO - __main__ - Step 116116: {'lr': 6.192928957904179e-05, 'samples': 22294272, 'steps': 116115, 'loss/train': 1.2226290702819824} 11/07/2021 13:33:07 - INFO - __main__ - Step 116117: {'lr': 6.192579332443471e-05, 'samples': 22294464, 'steps': 116116, 'loss/train': 1.2991918325424194} 11/07/2021 13:33:07 - INFO - __main__ - Step 116118: {'lr': 6.192229715457098e-05, 'samples': 22294656, 'steps': 116117, 'loss/train': 1.5647327899932861} 11/07/2021 13:33:08 - INFO - __main__ - Step 116119: {'lr': 6.191880106945219e-05, 'samples': 22294848, 'steps': 116118, 'loss/train': 1.7822421789169312} 11/07/2021 13:33:08 - INFO - __main__ - Step 116120: {'lr': 6.191530506907985e-05, 'samples': 22295040, 'steps': 116119, 'loss/train': 1.7312053442001343} 11/07/2021 13:33:09 - INFO - __main__ - Step 116121: {'lr': 6.19118091534556e-05, 'samples': 22295232, 'steps': 116120, 'loss/train': 0.8728405237197876} 11/07/2021 13:33:09 - INFO - __main__ - Step 116122: {'lr': 6.190831332258095e-05, 'samples': 22295424, 'steps': 116121, 'loss/train': 1.46366548538208} 11/07/2021 13:33:10 - INFO - __main__ - Step 116123: {'lr': 6.190481757645753e-05, 'samples': 22295616, 'steps': 116122, 'loss/train': 1.2752788066864014} 11/07/2021 13:33:10 - INFO - __main__ - Step 116124: {'lr': 6.19013219150869e-05, 'samples': 22295808, 'steps': 116123, 'loss/train': 1.0205022096633911} 11/07/2021 13:33:10 - INFO - __main__ - Step 116125: {'lr': 6.189782633847063e-05, 'samples': 22296000, 'steps': 116124, 'loss/train': 1.3545478582382202} 11/07/2021 13:33:11 - INFO - __main__ - Step 116126: {'lr': 6.189433084661031e-05, 'samples': 22296192, 'steps': 116125, 'loss/train': 1.4850186109542847} 11/07/2021 13:33:12 - INFO - __main__ - Step 116127: {'lr': 6.189083543950755e-05, 'samples': 22296384, 'steps': 116126, 'loss/train': 1.5508840084075928} 11/07/2021 13:33:12 - INFO - __main__ - Step 116128: {'lr': 6.188734011716382e-05, 'samples': 22296576, 'steps': 116127, 'loss/train': 1.4376230239868164} 11/07/2021 13:33:12 - INFO - __main__ - Step 116129: {'lr': 6.188384487958074e-05, 'samples': 22296768, 'steps': 116128, 'loss/train': 1.4317314624786377} 11/07/2021 13:33:13 - INFO - __main__ - Step 116130: {'lr': 6.18803497267599e-05, 'samples': 22296960, 'steps': 116129, 'loss/train': 1.4097481966018677} 11/07/2021 13:33:14 - INFO - __main__ - Step 116131: {'lr': 6.187685465870287e-05, 'samples': 22297152, 'steps': 116130, 'loss/train': 1.430259346961975} 11/07/2021 13:33:14 - INFO - __main__ - Step 116132: {'lr': 6.187335967541125e-05, 'samples': 22297344, 'steps': 116131, 'loss/train': 1.589196801185608} 11/07/2021 13:33:15 - INFO - __main__ - Step 116133: {'lr': 6.186986477688657e-05, 'samples': 22297536, 'steps': 116132, 'loss/train': 1.7785532474517822} 11/07/2021 13:33:15 - INFO - __main__ - Step 116134: {'lr': 6.186636996313041e-05, 'samples': 22297728, 'steps': 116133, 'loss/train': 1.0356541872024536} 11/07/2021 13:33:15 - INFO - __main__ - Step 116135: {'lr': 6.186287523414438e-05, 'samples': 22297920, 'steps': 116134, 'loss/train': 1.0572388172149658} 11/07/2021 13:33:16 - INFO - __main__ - Step 116136: {'lr': 6.185938058993005e-05, 'samples': 22298112, 'steps': 116135, 'loss/train': 1.5050990581512451} 11/07/2021 13:33:17 - INFO - __main__ - Step 116137: {'lr': 6.185588603048898e-05, 'samples': 22298304, 'steps': 116136, 'loss/train': 1.0326021909713745} 11/07/2021 13:33:17 - INFO - __main__ - Step 116138: {'lr': 6.185239155582274e-05, 'samples': 22298496, 'steps': 116137, 'loss/train': 1.5447708368301392} 11/07/2021 13:33:18 - INFO - __main__ - Step 116139: {'lr': 6.184889716593286e-05, 'samples': 22298688, 'steps': 116138, 'loss/train': 0.9595929384231567} 11/07/2021 13:33:18 - INFO - __main__ - Step 116140: {'lr': 6.184540286082103e-05, 'samples': 22298880, 'steps': 116139, 'loss/train': 1.0613738298416138} 11/07/2021 13:33:18 - INFO - __main__ - Step 116141: {'lr': 6.184190864048877e-05, 'samples': 22299072, 'steps': 116140, 'loss/train': 1.4478650093078613} 11/07/2021 13:33:19 - INFO - __main__ - Step 116142: {'lr': 6.183841450493763e-05, 'samples': 22299264, 'steps': 116141, 'loss/train': 1.4335566759109497} 11/07/2021 13:33:20 - INFO - __main__ - Step 116143: {'lr': 6.183492045416916e-05, 'samples': 22299456, 'steps': 116142, 'loss/train': 0.9218486547470093} 11/07/2021 13:33:20 - INFO - __main__ - Step 116144: {'lr': 6.183142648818499e-05, 'samples': 22299648, 'steps': 116143, 'loss/train': 1.5619533061981201} 11/07/2021 13:33:20 - INFO - __main__ - Step 116145: {'lr': 6.182793260698666e-05, 'samples': 22299840, 'steps': 116144, 'loss/train': 1.2718372344970703} 11/07/2021 13:33:21 - INFO - __main__ - Step 116146: {'lr': 6.182443881057576e-05, 'samples': 22300032, 'steps': 116145, 'loss/train': 1.2563955783843994} 11/07/2021 13:33:22 - INFO - __main__ - Step 116147: {'lr': 6.182094509895386e-05, 'samples': 22300224, 'steps': 116146, 'loss/train': 1.3399882316589355} 11/07/2021 13:33:22 - INFO - __main__ - Step 116148: {'lr': 6.181745147212257e-05, 'samples': 22300416, 'steps': 116147, 'loss/train': 1.553357720375061} 11/07/2021 13:33:22 - INFO - __main__ - Step 116149: {'lr': 6.18139579300834e-05, 'samples': 22300608, 'steps': 116148, 'loss/train': 1.378612756729126} 11/07/2021 13:33:23 - INFO - __main__ - Step 116150: {'lr': 6.181046447283798e-05, 'samples': 22300800, 'steps': 116149, 'loss/train': 1.2320363521575928} 11/07/2021 13:33:23 - INFO - __main__ - Step 116151: {'lr': 6.180697110038783e-05, 'samples': 22300992, 'steps': 116150, 'loss/train': 1.4650583267211914} 11/07/2021 13:33:24 - INFO - __main__ - Step 116152: {'lr': 6.180347781273457e-05, 'samples': 22301184, 'steps': 116151, 'loss/train': 1.4350571632385254} 11/07/2021 13:33:25 - INFO - __main__ - Step 116153: {'lr': 6.179998460987976e-05, 'samples': 22301376, 'steps': 116152, 'loss/train': 1.2569040060043335} 11/07/2021 13:33:25 - INFO - __main__ - Step 116154: {'lr': 6.179649149182507e-05, 'samples': 22301568, 'steps': 116153, 'loss/train': 1.6546359062194824} 11/07/2021 13:33:26 - INFO - __main__ - Step 116155: {'lr': 6.179299845857186e-05, 'samples': 22301760, 'steps': 116154, 'loss/train': 1.5531543493270874} 11/07/2021 13:33:26 - INFO - __main__ - Step 116156: {'lr': 6.178950551012185e-05, 'samples': 22301952, 'steps': 116155, 'loss/train': 1.140254259109497} 11/07/2021 13:33:27 - INFO - __main__ - Step 116157: {'lr': 6.178601264647659e-05, 'samples': 22302144, 'steps': 116156, 'loss/train': 1.4030158519744873} 11/07/2021 13:33:27 - INFO - __main__ - Step 116158: {'lr': 6.178251986763764e-05, 'samples': 22302336, 'steps': 116157, 'loss/train': 1.2561557292938232} 11/07/2021 13:33:28 - INFO - __main__ - Step 116159: {'lr': 6.177902717360656e-05, 'samples': 22302528, 'steps': 116158, 'loss/train': 0.8550292253494263} 11/07/2021 13:33:28 - INFO - __main__ - Step 116160: {'lr': 6.177553456438498e-05, 'samples': 22302720, 'steps': 116159, 'loss/train': 1.4741263389587402} 11/07/2021 13:33:28 - INFO - __main__ - Step 116161: {'lr': 6.177204203997441e-05, 'samples': 22302912, 'steps': 116160, 'loss/train': 1.3883905410766602} 11/07/2021 13:33:29 - INFO - __main__ - Step 116162: {'lr': 6.176854960037648e-05, 'samples': 22303104, 'steps': 116161, 'loss/train': 1.441322684288025} 11/07/2021 13:33:30 - INFO - __main__ - Step 116163: {'lr': 6.176505724559272e-05, 'samples': 22303296, 'steps': 116162, 'loss/train': 1.0854685306549072} 11/07/2021 13:33:30 - INFO - __main__ - Step 116164: {'lr': 6.176156497562471e-05, 'samples': 22303488, 'steps': 116163, 'loss/train': 1.1257152557373047} 11/07/2021 13:33:30 - INFO - __main__ - Step 116165: {'lr': 6.175807279047405e-05, 'samples': 22303680, 'steps': 116164, 'loss/train': 1.266660213470459} 11/07/2021 13:33:31 - INFO - __main__ - Step 116166: {'lr': 6.175458069014231e-05, 'samples': 22303872, 'steps': 116165, 'loss/train': 1.0612409114837646} 11/07/2021 13:33:31 - INFO - __main__ - Step 116167: {'lr': 6.175108867463103e-05, 'samples': 22304064, 'steps': 116166, 'loss/train': 1.7184323072433472} 11/07/2021 13:33:33 - INFO - __main__ - Step 116168: {'lr': 6.17475967439419e-05, 'samples': 22304256, 'steps': 116167, 'loss/train': 1.2818398475646973} 11/07/2021 13:33:33 - INFO - __main__ - Step 116169: {'lr': 6.17441048980763e-05, 'samples': 22304448, 'steps': 116168, 'loss/train': 1.5280905961990356} 11/07/2021 13:33:34 - INFO - __main__ - Step 116170: {'lr': 6.174061313703591e-05, 'samples': 22304640, 'steps': 116169, 'loss/train': 1.6203677654266357} 11/07/2021 13:33:34 - INFO - __main__ - Step 116171: {'lr': 6.17371214608223e-05, 'samples': 22304832, 'steps': 116170, 'loss/train': 0.7056220769882202} 11/07/2021 13:33:34 - INFO - __main__ - Step 116172: {'lr': 6.173362986943703e-05, 'samples': 22305024, 'steps': 116171, 'loss/train': 1.3460060358047485} 11/07/2021 13:33:35 - INFO - __main__ - Step 116173: {'lr': 6.173013836288169e-05, 'samples': 22305216, 'steps': 116172, 'loss/train': 1.002805233001709} 11/07/2021 13:33:35 - INFO - __main__ - Step 116174: {'lr': 6.172664694115782e-05, 'samples': 22305408, 'steps': 116173, 'loss/train': 1.7060680389404297} 11/07/2021 13:33:36 - INFO - __main__ - Step 116175: {'lr': 6.172315560426705e-05, 'samples': 22305600, 'steps': 116174, 'loss/train': 1.6900397539138794} 11/07/2021 13:33:37 - INFO - __main__ - Step 116176: {'lr': 6.171966435221091e-05, 'samples': 22305792, 'steps': 116175, 'loss/train': 1.7371302843093872} 11/07/2021 13:33:37 - INFO - __main__ - Step 116177: {'lr': 6.171617318499098e-05, 'samples': 22305984, 'steps': 116176, 'loss/train': 1.1871429681777954} 11/07/2021 13:33:37 - INFO - __main__ - Step 116178: {'lr': 6.171268210260883e-05, 'samples': 22306176, 'steps': 116177, 'loss/train': 1.322939157485962} 11/07/2021 13:33:38 - INFO - __main__ - Step 116179: {'lr': 6.170919110506606e-05, 'samples': 22306368, 'steps': 116178, 'loss/train': 1.43672513961792} 11/07/2021 13:33:38 - INFO - __main__ - Step 116180: {'lr': 6.17057001923642e-05, 'samples': 22306560, 'steps': 116179, 'loss/train': 1.4329791069030762} 11/07/2021 13:33:39 - INFO - __main__ - Step 116181: {'lr': 6.170220936450494e-05, 'samples': 22306752, 'steps': 116180, 'loss/train': 1.584855318069458} 11/07/2021 13:33:40 - INFO - __main__ - Step 116182: {'lr': 6.169871862148968e-05, 'samples': 22306944, 'steps': 116181, 'loss/train': 1.3129783868789673} 11/07/2021 13:33:40 - INFO - __main__ - Step 116183: {'lr': 6.169522796332005e-05, 'samples': 22307136, 'steps': 116182, 'loss/train': 1.4279922246932983} 11/07/2021 13:33:40 - INFO - __main__ - Step 116184: {'lr': 6.169173738999767e-05, 'samples': 22307328, 'steps': 116183, 'loss/train': 1.3962652683258057} 11/07/2021 13:33:41 - INFO - __main__ - Step 116185: {'lr': 6.168824690152408e-05, 'samples': 22307520, 'steps': 116184, 'loss/train': 1.3913145065307617} 11/07/2021 13:33:42 - INFO - __main__ - Step 116186: {'lr': 6.168475649790086e-05, 'samples': 22307712, 'steps': 116185, 'loss/train': 0.9412005543708801} 11/07/2021 13:33:42 - INFO - __main__ - Step 116187: {'lr': 6.168126617912958e-05, 'samples': 22307904, 'steps': 116186, 'loss/train': 1.574444055557251} 11/07/2021 13:33:43 - INFO - __main__ - Step 116188: {'lr': 6.167777594521181e-05, 'samples': 22308096, 'steps': 116187, 'loss/train': 1.1667438745498657} 11/07/2021 13:33:43 - INFO - __main__ - Step 116189: {'lr': 6.167428579614915e-05, 'samples': 22308288, 'steps': 116188, 'loss/train': 1.1204506158828735} 11/07/2021 13:33:43 - INFO - __main__ - Step 116190: {'lr': 6.167079573194314e-05, 'samples': 22308480, 'steps': 116189, 'loss/train': 1.1096961498260498} 11/07/2021 13:33:44 - INFO - __main__ - Step 116191: {'lr': 6.166730575259535e-05, 'samples': 22308672, 'steps': 116190, 'loss/train': 1.3848350048065186} 11/07/2021 13:33:45 - INFO - __main__ - Step 116192: {'lr': 6.166381585810737e-05, 'samples': 22308864, 'steps': 116191, 'loss/train': 1.7170275449752808} 11/07/2021 13:33:45 - INFO - __main__ - Step 116193: {'lr': 6.166032604848079e-05, 'samples': 22309056, 'steps': 116192, 'loss/train': 1.0254509449005127} 11/07/2021 13:33:45 - INFO - __main__ - Step 116194: {'lr': 6.165683632371716e-05, 'samples': 22309248, 'steps': 116193, 'loss/train': 1.5865055322647095} 11/07/2021 13:33:46 - INFO - __main__ - Step 116195: {'lr': 6.165334668381812e-05, 'samples': 22309440, 'steps': 116194, 'loss/train': 1.028334617614746} 11/07/2021 13:33:47 - INFO - __main__ - Step 116196: {'lr': 6.164985712878507e-05, 'samples': 22309632, 'steps': 116195, 'loss/train': 2.186422109603882} 11/07/2021 13:33:47 - INFO - __main__ - Step 116197: {'lr': 6.164636765861972e-05, 'samples': 22309824, 'steps': 116196, 'loss/train': 1.3665804862976074} 11/07/2021 13:33:48 - INFO - __main__ - Step 116198: {'lr': 6.16428782733236e-05, 'samples': 22310016, 'steps': 116197, 'loss/train': 1.5157644748687744} 11/07/2021 13:33:48 - INFO - __main__ - Step 116199: {'lr': 6.163938897289831e-05, 'samples': 22310208, 'steps': 116198, 'loss/train': 1.475342869758606} 11/07/2021 13:33:48 - INFO - __main__ - Step 116200: {'lr': 6.163589975734537e-05, 'samples': 22310400, 'steps': 116199, 'loss/train': 1.4067387580871582} 11/07/2021 13:33:49 - INFO - __main__ - Step 116201: {'lr': 6.163241062666641e-05, 'samples': 22310592, 'steps': 116200, 'loss/train': 1.6020632982254028} 11/07/2021 13:33:50 - INFO - __main__ - Step 116202: {'lr': 6.162892158086297e-05, 'samples': 22310784, 'steps': 116201, 'loss/train': 2.5736727714538574} 11/07/2021 13:33:50 - INFO - __main__ - Step 116203: {'lr': 6.162543261993664e-05, 'samples': 22310976, 'steps': 116202, 'loss/train': 1.3483585119247437} 11/07/2021 13:33:51 - INFO - __main__ - Step 116204: {'lr': 6.1621943743889e-05, 'samples': 22311168, 'steps': 116203, 'loss/train': 1.2737034559249878} 11/07/2021 13:33:51 - INFO - __main__ - Step 116205: {'lr': 6.161845495272159e-05, 'samples': 22311360, 'steps': 116204, 'loss/train': 1.3783087730407715} 11/07/2021 13:33:51 - INFO - __main__ - Step 116206: {'lr': 6.161496624643598e-05, 'samples': 22311552, 'steps': 116205, 'loss/train': 1.7586121559143066} 11/07/2021 13:33:52 - INFO - __main__ - Step 116207: {'lr': 6.161147762503378e-05, 'samples': 22311744, 'steps': 116206, 'loss/train': 3.6151983737945557} 11/07/2021 13:33:53 - INFO - __main__ - Step 116208: {'lr': 6.160798908851658e-05, 'samples': 22311936, 'steps': 116207, 'loss/train': 1.8370929956436157} 11/07/2021 13:33:53 - INFO - __main__ - Step 116209: {'lr': 6.160450063688589e-05, 'samples': 22312128, 'steps': 116208, 'loss/train': 1.3071238994598389} 11/07/2021 13:33:53 - INFO - __main__ - Step 116210: {'lr': 6.160101227014328e-05, 'samples': 22312320, 'steps': 116209, 'loss/train': 1.1064424514770508} 11/07/2021 13:33:54 - INFO - __main__ - Step 116211: {'lr': 6.159752398829035e-05, 'samples': 22312512, 'steps': 116210, 'loss/train': 1.5002444982528687} 11/07/2021 13:33:54 - INFO - __main__ - Step 116212: {'lr': 6.159403579132866e-05, 'samples': 22312704, 'steps': 116211, 'loss/train': 1.1947917938232422} 11/07/2021 13:33:55 - INFO - __main__ - Step 116213: {'lr': 6.159054767925978e-05, 'samples': 22312896, 'steps': 116212, 'loss/train': 1.2136000394821167} 11/07/2021 13:33:55 - INFO - __main__ - Step 116214: {'lr': 6.158705965208533e-05, 'samples': 22313088, 'steps': 116213, 'loss/train': 1.520963430404663} 11/07/2021 13:33:56 - INFO - __main__ - Step 116215: {'lr': 6.158357170980683e-05, 'samples': 22313280, 'steps': 116214, 'loss/train': 1.0956796407699585} 11/07/2021 13:33:56 - INFO - __main__ - Step 116216: {'lr': 6.158008385242583e-05, 'samples': 22313472, 'steps': 116215, 'loss/train': 1.5228227376937866} 11/07/2021 13:33:56 - INFO - __main__ - Step 116217: {'lr': 6.157659607994398e-05, 'samples': 22313664, 'steps': 116216, 'loss/train': 1.268465280532837} 11/07/2021 13:33:57 - INFO - __main__ - Step 116218: {'lr': 6.15731083923628e-05, 'samples': 22313856, 'steps': 116217, 'loss/train': 1.5613646507263184} 11/07/2021 13:33:58 - INFO - __main__ - Step 116219: {'lr': 6.156962078968387e-05, 'samples': 22314048, 'steps': 116218, 'loss/train': 1.337719440460205} 11/07/2021 13:33:58 - INFO - __main__ - Step 116220: {'lr': 6.156613327190874e-05, 'samples': 22314240, 'steps': 116219, 'loss/train': 1.4101492166519165} 11/07/2021 13:33:59 - INFO - __main__ - Step 116221: {'lr': 6.1562645839039e-05, 'samples': 22314432, 'steps': 116220, 'loss/train': 1.2885761260986328} 11/07/2021 13:33:59 - INFO - __main__ - Step 116222: {'lr': 6.155915849107633e-05, 'samples': 22314624, 'steps': 116221, 'loss/train': 1.6675399541854858} 11/07/2021 13:34:00 - INFO - __main__ - Step 116223: {'lr': 6.155567122802211e-05, 'samples': 22314816, 'steps': 116222, 'loss/train': 1.414674997329712} 11/07/2021 13:34:00 - INFO - __main__ - Step 116224: {'lr': 6.155218404987797e-05, 'samples': 22315008, 'steps': 116223, 'loss/train': 1.0498988628387451} 11/07/2021 13:34:01 - INFO - __main__ - Step 116225: {'lr': 6.154869695664555e-05, 'samples': 22315200, 'steps': 116224, 'loss/train': 0.9038160443305969} 11/07/2021 13:34:01 - INFO - __main__ - Step 116226: {'lr': 6.154520994832635e-05, 'samples': 22315392, 'steps': 116225, 'loss/train': 1.3143134117126465} 11/07/2021 13:34:01 - INFO - __main__ - Step 116227: {'lr': 6.154172302492197e-05, 'samples': 22315584, 'steps': 116226, 'loss/train': 1.5381247997283936} 11/07/2021 13:34:02 - INFO - __main__ - Step 116228: {'lr': 6.153823618643401e-05, 'samples': 22315776, 'steps': 116227, 'loss/train': 1.5515509843826294} 11/07/2021 13:34:03 - INFO - __main__ - Step 116229: {'lr': 6.153474943286399e-05, 'samples': 22315968, 'steps': 116228, 'loss/train': 1.3107035160064697} 11/07/2021 13:34:03 - INFO - __main__ - Step 116230: {'lr': 6.153126276421351e-05, 'samples': 22316160, 'steps': 116229, 'loss/train': 1.1173853874206543} 11/07/2021 13:34:03 - INFO - __main__ - Step 116231: {'lr': 6.152777618048414e-05, 'samples': 22316352, 'steps': 116230, 'loss/train': 0.694162905216217} 11/07/2021 13:34:04 - INFO - __main__ - Step 116232: {'lr': 6.152428968167742e-05, 'samples': 22316544, 'steps': 116231, 'loss/train': 1.7677085399627686} 11/07/2021 13:34:04 - INFO - __main__ - Step 116233: {'lr': 6.152080326779497e-05, 'samples': 22316736, 'steps': 116232, 'loss/train': 1.106573224067688} 11/07/2021 13:34:05 - INFO - __main__ - Step 116234: {'lr': 6.151731693883833e-05, 'samples': 22316928, 'steps': 116233, 'loss/train': 0.9479115605354309} 11/07/2021 13:34:06 - INFO - __main__ - Step 116235: {'lr': 6.151383069480914e-05, 'samples': 22317120, 'steps': 116234, 'loss/train': 1.281650185585022} 11/07/2021 13:34:06 - INFO - __main__ - Step 116236: {'lr': 6.151034453570887e-05, 'samples': 22317312, 'steps': 116235, 'loss/train': 1.2615834474563599} 11/07/2021 13:34:06 - INFO - __main__ - Step 116237: {'lr': 6.150685846153911e-05, 'samples': 22317504, 'steps': 116236, 'loss/train': 0.7610580325126648} 11/07/2021 13:34:07 - INFO - __main__ - Step 116238: {'lr': 6.150337247230145e-05, 'samples': 22317696, 'steps': 116237, 'loss/train': 0.69276362657547} 11/07/2021 13:34:08 - INFO - __main__ - Step 116239: {'lr': 6.149988656799746e-05, 'samples': 22317888, 'steps': 116238, 'loss/train': 1.2096737623214722} 11/07/2021 13:34:08 - INFO - __main__ - Step 116240: {'lr': 6.149640074862872e-05, 'samples': 22318080, 'steps': 116239, 'loss/train': 1.265875220298767} 11/07/2021 13:34:08 - INFO - __main__ - Step 116241: {'lr': 6.149291501419679e-05, 'samples': 22318272, 'steps': 116240, 'loss/train': 1.1024837493896484} 11/07/2021 13:34:09 - INFO - __main__ - Step 116242: {'lr': 6.148942936470325e-05, 'samples': 22318464, 'steps': 116241, 'loss/train': 1.389771580696106} 11/07/2021 13:34:09 - INFO - __main__ - Step 116243: {'lr': 6.148594380014966e-05, 'samples': 22318656, 'steps': 116242, 'loss/train': 0.6889646649360657} 11/07/2021 13:34:10 - INFO - __main__ - Step 116244: {'lr': 6.148245832053759e-05, 'samples': 22318848, 'steps': 116243, 'loss/train': 1.3249131441116333} 11/07/2021 13:34:11 - INFO - __main__ - Step 116245: {'lr': 6.14789729258686e-05, 'samples': 22319040, 'steps': 116244, 'loss/train': 0.5961148142814636} 11/07/2021 13:34:11 - INFO - __main__ - Step 116246: {'lr': 6.14754876161443e-05, 'samples': 22319232, 'steps': 116245, 'loss/train': 0.9757425785064697} 11/07/2021 13:34:11 - INFO - __main__ - Step 116247: {'lr': 6.147200239136622e-05, 'samples': 22319424, 'steps': 116246, 'loss/train': 1.1958194971084595} 11/07/2021 13:34:12 - INFO - __main__ - Step 116248: {'lr': 6.146851725153604e-05, 'samples': 22319616, 'steps': 116247, 'loss/train': 1.0943574905395508} 11/07/2021 13:34:13 - INFO - __main__ - Step 116249: {'lr': 6.146503219665514e-05, 'samples': 22319808, 'steps': 116248, 'loss/train': 1.5219793319702148} 11/07/2021 13:34:13 - INFO - __main__ - Step 116250: {'lr': 6.14615472267252e-05, 'samples': 22320000, 'steps': 116249, 'loss/train': 1.3130896091461182} 11/07/2021 13:34:13 - INFO - __main__ - Step 116251: {'lr': 6.145806234174778e-05, 'samples': 22320192, 'steps': 116250, 'loss/train': 1.455933928489685} 11/07/2021 13:34:14 - INFO - __main__ - Step 116252: {'lr': 6.145457754172446e-05, 'samples': 22320384, 'steps': 116251, 'loss/train': 1.2109206914901733} 11/07/2021 13:34:14 - INFO - __main__ - Step 116253: {'lr': 6.145109282665675e-05, 'samples': 22320576, 'steps': 116252, 'loss/train': 0.9332900047302246} 11/07/2021 13:34:15 - INFO - __main__ - Step 116254: {'lr': 6.144760819654632e-05, 'samples': 22320768, 'steps': 116253, 'loss/train': 1.3252640962600708} 11/07/2021 13:34:16 - INFO - __main__ - Step 116255: {'lr': 6.144412365139468e-05, 'samples': 22320960, 'steps': 116254, 'loss/train': 1.3999906778335571} 11/07/2021 13:34:16 - INFO - __main__ - Step 116256: {'lr': 6.144063919120338e-05, 'samples': 22321152, 'steps': 116255, 'loss/train': 1.5621167421340942} 11/07/2021 13:34:16 - INFO - __main__ - Step 116257: {'lr': 6.143715481597403e-05, 'samples': 22321344, 'steps': 116256, 'loss/train': 1.3161746263504028} 11/07/2021 13:34:17 - INFO - __main__ - Step 116258: {'lr': 6.143367052570819e-05, 'samples': 22321536, 'steps': 116257, 'loss/train': 0.897550642490387} 11/07/2021 13:34:17 - INFO - __main__ - Step 116259: {'lr': 6.143018632040745e-05, 'samples': 22321728, 'steps': 116258, 'loss/train': 1.5968509912490845} 11/07/2021 13:34:18 - INFO - __main__ - Step 116260: {'lr': 6.142670220007335e-05, 'samples': 22321920, 'steps': 116259, 'loss/train': 1.259556770324707} 11/07/2021 13:34:18 - INFO - __main__ - Step 116261: {'lr': 6.142321816470745e-05, 'samples': 22322112, 'steps': 116260, 'loss/train': 1.0456056594848633} 11/07/2021 13:34:19 - INFO - __main__ - Step 116262: {'lr': 6.141973421431141e-05, 'samples': 22322304, 'steps': 116261, 'loss/train': 1.4476770162582397} 11/07/2021 13:34:19 - INFO - __main__ - Step 116263: {'lr': 6.141625034888668e-05, 'samples': 22322496, 'steps': 116262, 'loss/train': 1.5107582807540894} 11/07/2021 13:34:19 - INFO - __main__ - Step 116264: {'lr': 6.141276656843484e-05, 'samples': 22322688, 'steps': 116263, 'loss/train': 1.6596723794937134} 11/07/2021 13:34:21 - INFO - __main__ - Step 116265: {'lr': 6.140928287295753e-05, 'samples': 22322880, 'steps': 116264, 'loss/train': 0.7324263453483582} 11/07/2021 13:34:21 - INFO - __main__ - Step 116266: {'lr': 6.140579926245627e-05, 'samples': 22323072, 'steps': 116265, 'loss/train': 1.3863587379455566} 11/07/2021 13:34:21 - INFO - __main__ - Step 116267: {'lr': 6.140231573693267e-05, 'samples': 22323264, 'steps': 116266, 'loss/train': 1.7445299625396729} 11/07/2021 13:34:22 - INFO - __main__ - Step 116268: {'lr': 6.139883229638823e-05, 'samples': 22323456, 'steps': 116267, 'loss/train': 1.7824034690856934} 11/07/2021 13:34:22 - INFO - __main__ - Step 116269: {'lr': 6.13953489408246e-05, 'samples': 22323648, 'steps': 116268, 'loss/train': 1.428354263305664} 11/07/2021 13:34:23 - INFO - __main__ - Step 116270: {'lr': 6.139186567024333e-05, 'samples': 22323840, 'steps': 116269, 'loss/train': 1.6031575202941895} 11/07/2021 13:34:23 - INFO - __main__ - Step 116271: {'lr': 6.138838248464595e-05, 'samples': 22324032, 'steps': 116270, 'loss/train': 1.0459717512130737} 11/07/2021 13:34:24 - INFO - __main__ - Step 116272: {'lr': 6.138489938403405e-05, 'samples': 22324224, 'steps': 116271, 'loss/train': 1.6529027223587036} 11/07/2021 13:34:24 - INFO - __main__ - Step 116273: {'lr': 6.138141636840922e-05, 'samples': 22324416, 'steps': 116272, 'loss/train': 1.614423394203186} 11/07/2021 13:34:24 - INFO - __main__ - Step 116274: {'lr': 6.1377933437773e-05, 'samples': 22324608, 'steps': 116273, 'loss/train': 1.5543456077575684} 11/07/2021 13:34:25 - INFO - __main__ - Step 116275: {'lr': 6.137445059212707e-05, 'samples': 22324800, 'steps': 116274, 'loss/train': 1.4923985004425049} 11/07/2021 13:34:26 - INFO - __main__ - Step 116276: {'lr': 6.13709678314728e-05, 'samples': 22324992, 'steps': 116275, 'loss/train': 1.041009783744812} 11/07/2021 13:34:26 - INFO - __main__ - Step 116277: {'lr': 6.136748515581187e-05, 'samples': 22325184, 'steps': 116276, 'loss/train': 0.5473176836967468} 11/07/2021 13:34:26 - INFO - __main__ - Step 116278: {'lr': 6.136400256514585e-05, 'samples': 22325376, 'steps': 116277, 'loss/train': 1.6600421667099} 11/07/2021 13:34:27 - INFO - __main__ - Step 116279: {'lr': 6.136052005947629e-05, 'samples': 22325568, 'steps': 116278, 'loss/train': 1.2428736686706543} 11/07/2021 13:34:28 - INFO - __main__ - Step 116280: {'lr': 6.135703763880477e-05, 'samples': 22325760, 'steps': 116279, 'loss/train': 1.2053946256637573} 11/07/2021 13:34:28 - INFO - __main__ - Step 116281: {'lr': 6.135355530313286e-05, 'samples': 22325952, 'steps': 116280, 'loss/train': 1.0578253269195557} 11/07/2021 13:34:28 - INFO - __main__ - Step 116282: {'lr': 6.135007305246212e-05, 'samples': 22326144, 'steps': 116281, 'loss/train': 1.6884076595306396} 11/07/2021 13:34:29 - INFO - __main__ - Step 116283: {'lr': 6.134659088679412e-05, 'samples': 22326336, 'steps': 116282, 'loss/train': 1.4525302648544312} 11/07/2021 13:34:29 - INFO - __main__ - Step 116284: {'lr': 6.134310880613045e-05, 'samples': 22326528, 'steps': 116283, 'loss/train': 1.2230982780456543} 11/07/2021 13:34:30 - INFO - __main__ - Step 116285: {'lr': 6.133962681047267e-05, 'samples': 22326720, 'steps': 116284, 'loss/train': 1.138339638710022} 11/07/2021 13:34:31 - INFO - __main__ - Step 116286: {'lr': 6.133614489982234e-05, 'samples': 22326912, 'steps': 116285, 'loss/train': 1.1509579420089722} 11/07/2021 13:34:31 - INFO - __main__ - Step 116287: {'lr': 6.1332663074181e-05, 'samples': 22327104, 'steps': 116286, 'loss/train': 1.3730411529541016} 11/07/2021 13:34:31 - INFO - __main__ - Step 116288: {'lr': 6.132918133355029e-05, 'samples': 22327296, 'steps': 116287, 'loss/train': 1.0136042833328247} 11/07/2021 13:34:32 - INFO - __main__ - Step 116289: {'lr': 6.13256996779318e-05, 'samples': 22327488, 'steps': 116288, 'loss/train': 0.8279873132705688} 11/07/2021 13:34:32 - INFO - __main__ - Step 116290: {'lr': 6.132221810732697e-05, 'samples': 22327680, 'steps': 116289, 'loss/train': 1.433046579360962} 11/07/2021 13:34:33 - INFO - __main__ - Step 116291: {'lr': 6.131873662173743e-05, 'samples': 22327872, 'steps': 116290, 'loss/train': 1.225347876548767} 11/07/2021 13:34:33 - INFO - __main__ - Step 116292: {'lr': 6.131525522116476e-05, 'samples': 22328064, 'steps': 116291, 'loss/train': 1.2763973474502563} 11/07/2021 13:34:34 - INFO - __main__ - Step 116293: {'lr': 6.131177390561052e-05, 'samples': 22328256, 'steps': 116292, 'loss/train': 1.1100599765777588} 11/07/2021 13:34:34 - INFO - __main__ - Step 116294: {'lr': 6.130829267507629e-05, 'samples': 22328448, 'steps': 116293, 'loss/train': 1.2575430870056152} 11/07/2021 13:34:35 - INFO - __main__ - Step 116295: {'lr': 6.130481152956364e-05, 'samples': 22328640, 'steps': 116294, 'loss/train': 0.8160523176193237} 11/07/2021 13:34:35 - INFO - __main__ - Step 116296: {'lr': 6.13013304690741e-05, 'samples': 22328832, 'steps': 116295, 'loss/train': 1.3615154027938843} 11/07/2021 13:34:36 - INFO - __main__ - Step 116297: {'lr': 6.129784949360928e-05, 'samples': 22329024, 'steps': 116296, 'loss/train': 1.1053000688552856} 11/07/2021 13:34:36 - INFO - __main__ - Step 116298: {'lr': 6.129436860317076e-05, 'samples': 22329216, 'steps': 116297, 'loss/train': 0.9711077213287354} 11/07/2021 13:34:37 - INFO - __main__ - Step 116299: {'lr': 6.129088779776005e-05, 'samples': 22329408, 'steps': 116298, 'loss/train': 1.5431302785873413} 11/07/2021 13:34:37 - INFO - __main__ - Step 116300: {'lr': 6.128740707737876e-05, 'samples': 22329600, 'steps': 116299, 'loss/train': 1.1483087539672852} 11/07/2021 13:34:38 - INFO - __main__ - Step 116301: {'lr': 6.128392644202848e-05, 'samples': 22329792, 'steps': 116300, 'loss/train': 1.4223085641860962} 11/07/2021 13:34:38 - INFO - __main__ - Step 116302: {'lr': 6.128044589171081e-05, 'samples': 22329984, 'steps': 116301, 'loss/train': 1.5203360319137573} 11/07/2021 13:34:39 - INFO - __main__ - Step 116303: {'lr': 6.127696542642718e-05, 'samples': 22330176, 'steps': 116302, 'loss/train': 1.176263689994812} 11/07/2021 13:34:39 - INFO - __main__ - Step 116304: {'lr': 6.127348504617924e-05, 'samples': 22330368, 'steps': 116303, 'loss/train': 1.201292634010315} 11/07/2021 13:34:39 - INFO - __main__ - Step 116305: {'lr': 6.127000475096855e-05, 'samples': 22330560, 'steps': 116304, 'loss/train': 1.143800139427185} 11/07/2021 13:34:40 - INFO - __main__ - Step 116306: {'lr': 6.126652454079671e-05, 'samples': 22330752, 'steps': 116305, 'loss/train': 0.4310374855995178} 11/07/2021 13:34:41 - INFO - __main__ - Step 116307: {'lr': 6.126304441566521e-05, 'samples': 22330944, 'steps': 116306, 'loss/train': 0.9014872312545776} 11/07/2021 13:34:41 - INFO - __main__ - Step 116308: {'lr': 6.125956437557572e-05, 'samples': 22331136, 'steps': 116307, 'loss/train': 1.4595972299575806} 11/07/2021 13:34:41 - INFO - __main__ - Step 116309: {'lr': 6.125608442052974e-05, 'samples': 22331328, 'steps': 116308, 'loss/train': 1.2275232076644897} 11/07/2021 13:34:42 - INFO - __main__ - Step 116310: {'lr': 6.125260455052886e-05, 'samples': 22331520, 'steps': 116309, 'loss/train': 1.2600550651550293} 11/07/2021 13:34:42 - INFO - __main__ - Step 116311: {'lr': 6.124912476557462e-05, 'samples': 22331712, 'steps': 116310, 'loss/train': 0.985718846321106} 11/07/2021 13:34:43 - INFO - __main__ - Step 116312: {'lr': 6.124564506566866e-05, 'samples': 22331904, 'steps': 116311, 'loss/train': 0.34847697615623474} 11/07/2021 13:34:44 - INFO - __main__ - Step 116313: {'lr': 6.124216545081245e-05, 'samples': 22332096, 'steps': 116312, 'loss/train': 1.3154823780059814} 11/07/2021 13:34:44 - INFO - __main__ - Step 116314: {'lr': 6.123868592100761e-05, 'samples': 22332288, 'steps': 116313, 'loss/train': 0.8181939721107483} 11/07/2021 13:34:44 - INFO - __main__ - Step 116315: {'lr': 6.123520647625575e-05, 'samples': 22332480, 'steps': 116314, 'loss/train': 1.8967044353485107} 11/07/2021 13:34:45 - INFO - __main__ - Step 116316: {'lr': 6.123172711655845e-05, 'samples': 22332672, 'steps': 116315, 'loss/train': 0.9681918025016785} 11/07/2021 13:34:46 - INFO - __main__ - Step 116317: {'lr': 6.122824784191713e-05, 'samples': 22332864, 'steps': 116316, 'loss/train': 1.2578964233398438} 11/07/2021 13:34:46 - INFO - __main__ - Step 116318: {'lr': 6.122476865233346e-05, 'samples': 22333056, 'steps': 116317, 'loss/train': 1.3182944059371948} 11/07/2021 13:34:47 - INFO - __main__ - Step 116319: {'lr': 6.122128954780898e-05, 'samples': 22333248, 'steps': 116318, 'loss/train': 0.8415798544883728} 11/07/2021 13:34:47 - INFO - __main__ - Step 116320: {'lr': 6.12178105283453e-05, 'samples': 22333440, 'steps': 116319, 'loss/train': 1.434717059135437} 11/07/2021 13:34:47 - INFO - __main__ - Step 116321: {'lr': 6.121433159394394e-05, 'samples': 22333632, 'steps': 116320, 'loss/train': 1.8431694507598877} 11/07/2021 13:34:48 - INFO - __main__ - Step 116322: {'lr': 6.12108527446065e-05, 'samples': 22333824, 'steps': 116321, 'loss/train': 1.3449784517288208} 11/07/2021 13:34:49 - INFO - __main__ - Step 116323: {'lr': 6.120737398033452e-05, 'samples': 22334016, 'steps': 116322, 'loss/train': 1.2560715675354004} 11/07/2021 13:34:49 - INFO - __main__ - Step 116324: {'lr': 6.120389530112961e-05, 'samples': 22334208, 'steps': 116323, 'loss/train': 1.2576295137405396} 11/07/2021 13:34:49 - INFO - __main__ - Step 116325: {'lr': 6.120041670699328e-05, 'samples': 22334400, 'steps': 116324, 'loss/train': 1.577949047088623} 11/07/2021 13:34:50 - INFO - __main__ - Step 116326: {'lr': 6.119693819792716e-05, 'samples': 22334592, 'steps': 116325, 'loss/train': 1.0111188888549805} 11/07/2021 13:34:50 - INFO - __main__ - Step 116327: {'lr': 6.119345977393276e-05, 'samples': 22334784, 'steps': 116326, 'loss/train': 0.9957031011581421} 11/07/2021 13:34:51 - INFO - __main__ - Step 116328: {'lr': 6.118998143501178e-05, 'samples': 22334976, 'steps': 116327, 'loss/train': 1.150267481803894} 11/07/2021 13:34:52 - INFO - __main__ - Step 116329: {'lr': 6.118650318116559e-05, 'samples': 22335168, 'steps': 116328, 'loss/train': 1.4463913440704346} 11/07/2021 13:34:52 - INFO - __main__ - Step 116330: {'lr': 6.118302501239584e-05, 'samples': 22335360, 'steps': 116329, 'loss/train': 1.5716418027877808} 11/07/2021 13:34:52 - INFO - __main__ - Step 116331: {'lr': 6.117954692870411e-05, 'samples': 22335552, 'steps': 116330, 'loss/train': 1.03744375705719} 11/07/2021 13:34:53 - INFO - __main__ - Step 116332: {'lr': 6.117606893009195e-05, 'samples': 22335744, 'steps': 116331, 'loss/train': 2.677684783935547} 11/07/2021 13:34:54 - INFO - __main__ - Step 116333: {'lr': 6.117259101656097e-05, 'samples': 22335936, 'steps': 116332, 'loss/train': 1.1720134019851685} 11/07/2021 13:34:54 - INFO - __main__ - Step 116334: {'lr': 6.116911318811269e-05, 'samples': 22336128, 'steps': 116333, 'loss/train': 1.2372366189956665} 11/07/2021 13:34:54 - INFO - __main__ - Step 116335: {'lr': 6.116563544474867e-05, 'samples': 22336320, 'steps': 116334, 'loss/train': 0.900973916053772} 11/07/2021 13:34:55 - INFO - __main__ - Step 116336: {'lr': 6.116215778647056e-05, 'samples': 22336512, 'steps': 116335, 'loss/train': 1.0621756315231323} 11/07/2021 13:34:55 - INFO - __main__ - Step 116337: {'lr': 6.11586802132798e-05, 'samples': 22336704, 'steps': 116336, 'loss/train': 1.402687907218933} 11/07/2021 13:34:56 - INFO - __main__ - Step 116338: {'lr': 6.115520272517808e-05, 'samples': 22336896, 'steps': 116337, 'loss/train': 1.1312507390975952} 11/07/2021 13:34:56 - INFO - __main__ - Step 116339: {'lr': 6.115172532216695e-05, 'samples': 22337088, 'steps': 116338, 'loss/train': 1.3719135522842407} 11/07/2021 13:34:57 - INFO - __main__ - Step 116340: {'lr': 6.11482480042479e-05, 'samples': 22337280, 'steps': 116339, 'loss/train': 1.391871690750122} 11/07/2021 13:34:57 - INFO - __main__ - Step 116341: {'lr': 6.11447707714225e-05, 'samples': 22337472, 'steps': 116340, 'loss/train': 1.4698843955993652} 11/07/2021 13:34:57 - INFO - __main__ - Step 116342: {'lr': 6.114129362369237e-05, 'samples': 22337664, 'steps': 116341, 'loss/train': 1.3556084632873535} 11/07/2021 13:34:59 - INFO - __main__ - Step 116343: {'lr': 6.113781656105904e-05, 'samples': 22337856, 'steps': 116342, 'loss/train': 1.7479740381240845} 11/07/2021 13:34:59 - INFO - __main__ - Step 116344: {'lr': 6.113433958352413e-05, 'samples': 22338048, 'steps': 116343, 'loss/train': 1.0799360275268555} 11/07/2021 13:34:59 - INFO - __main__ - Step 116345: {'lr': 6.113086269108914e-05, 'samples': 22338240, 'steps': 116344, 'loss/train': 1.2744773626327515} 11/07/2021 13:35:00 - INFO - __main__ - Step 116346: {'lr': 6.11273858837557e-05, 'samples': 22338432, 'steps': 116345, 'loss/train': 1.177225947380066} 11/07/2021 13:35:00 - INFO - __main__ - Step 116347: {'lr': 6.11239091615253e-05, 'samples': 22338624, 'steps': 116346, 'loss/train': 1.3975071907043457} 11/07/2021 13:35:00 - INFO - __main__ - Step 116348: {'lr': 6.112043252439958e-05, 'samples': 22338816, 'steps': 116347, 'loss/train': 1.2157191038131714} 11/07/2021 13:35:02 - INFO - __main__ - Step 116349: {'lr': 6.111695597238006e-05, 'samples': 22339008, 'steps': 116348, 'loss/train': 1.431906819343567} 11/07/2021 13:35:02 - INFO - __main__ - Step 116350: {'lr': 6.111347950546845e-05, 'samples': 22339200, 'steps': 116349, 'loss/train': 1.454278826713562} 11/07/2021 13:35:02 - INFO - __main__ - Step 116351: {'lr': 6.111000312366607e-05, 'samples': 22339392, 'steps': 116350, 'loss/train': 1.4509209394454956} 11/07/2021 13:35:03 - INFO - __main__ - Step 116352: {'lr': 6.110652682697462e-05, 'samples': 22339584, 'steps': 116351, 'loss/train': 0.972993016242981} 11/07/2021 13:35:03 - INFO - __main__ - Step 116353: {'lr': 6.110305061539565e-05, 'samples': 22339776, 'steps': 116352, 'loss/train': 0.8950340747833252} 11/07/2021 13:35:04 - INFO - __main__ - Step 116354: {'lr': 6.109957448893074e-05, 'samples': 22339968, 'steps': 116353, 'loss/train': 1.1721638441085815} 11/07/2021 13:35:05 - INFO - __main__ - Step 116355: {'lr': 6.109609844758144e-05, 'samples': 22340160, 'steps': 116354, 'loss/train': 1.2087116241455078} 11/07/2021 13:35:05 - INFO - __main__ - Step 116356: {'lr': 6.109262249134931e-05, 'samples': 22340352, 'steps': 116355, 'loss/train': 1.5749679803848267} 11/07/2021 13:35:05 - INFO - __main__ - Step 116357: {'lr': 6.108914662023596e-05, 'samples': 22340544, 'steps': 116356, 'loss/train': 1.556368350982666} 11/07/2021 13:35:06 - INFO - __main__ - Step 116358: {'lr': 6.108567083424291e-05, 'samples': 22340736, 'steps': 116357, 'loss/train': 1.3329087495803833} 11/07/2021 13:35:07 - INFO - __main__ - Step 116359: {'lr': 6.108219513337174e-05, 'samples': 22340928, 'steps': 116358, 'loss/train': 1.6060351133346558} 11/07/2021 13:35:07 - INFO - __main__ - Step 116360: {'lr': 6.1078719517624e-05, 'samples': 22341120, 'steps': 116359, 'loss/train': 1.293129324913025} 11/07/2021 13:35:08 - INFO - __main__ - Step 116361: {'lr': 6.107524398700137e-05, 'samples': 22341312, 'steps': 116360, 'loss/train': 0.9448784589767456} 11/07/2021 13:35:08 - INFO - __main__ - Step 116362: {'lr': 6.107176854150526e-05, 'samples': 22341504, 'steps': 116361, 'loss/train': 1.1641967296600342} 11/07/2021 13:35:08 - INFO - __main__ - Step 116363: {'lr': 6.106829318113726e-05, 'samples': 22341696, 'steps': 116362, 'loss/train': 1.4358189105987549} 11/07/2021 13:35:10 - INFO - __main__ - Step 116364: {'lr': 6.106481790589901e-05, 'samples': 22341888, 'steps': 116363, 'loss/train': 1.2879470586776733} 11/07/2021 13:35:10 - INFO - __main__ - Step 116365: {'lr': 6.1061342715792e-05, 'samples': 22342080, 'steps': 116364, 'loss/train': 1.3697967529296875} 11/07/2021 13:35:10 - INFO - __main__ - Step 116366: {'lr': 6.105786761081786e-05, 'samples': 22342272, 'steps': 116365, 'loss/train': 0.9494787454605103} 11/07/2021 13:35:11 - INFO - __main__ - Step 116367: {'lr': 6.105439259097812e-05, 'samples': 22342464, 'steps': 116366, 'loss/train': 5.660369396209717} 11/07/2021 13:35:11 - INFO - __main__ - Step 116368: {'lr': 6.105091765627435e-05, 'samples': 22342656, 'steps': 116367, 'loss/train': 1.4715609550476074} 11/07/2021 13:35:11 - INFO - __main__ - Step 116369: {'lr': 6.104744280670813e-05, 'samples': 22342848, 'steps': 116368, 'loss/train': 3.156322479248047} 11/07/2021 13:35:12 - INFO - __main__ - Step 116370: {'lr': 6.104396804228101e-05, 'samples': 22343040, 'steps': 116369, 'loss/train': 3.4606709480285645} 11/07/2021 13:35:13 - INFO - __main__ - Step 116371: {'lr': 6.104049336299458e-05, 'samples': 22343232, 'steps': 116370, 'loss/train': 1.3726527690887451} 11/07/2021 13:35:13 - INFO - __main__ - Step 116372: {'lr': 6.103701876885043e-05, 'samples': 22343424, 'steps': 116371, 'loss/train': 1.1654012203216553} 11/07/2021 13:35:13 - INFO - __main__ - Step 116373: {'lr': 6.103354425985003e-05, 'samples': 22343616, 'steps': 116372, 'loss/train': 1.6768755912780762} 11/07/2021 13:35:14 - INFO - __main__ - Step 116374: {'lr': 6.1030069835995016e-05, 'samples': 22343808, 'steps': 116373, 'loss/train': 1.3636531829833984} 11/07/2021 13:35:14 - INFO - __main__ - Step 116375: {'lr': 6.10265954972869e-05, 'samples': 22344000, 'steps': 116374, 'loss/train': 1.478110432624817} 11/07/2021 13:35:15 - INFO - __main__ - Step 116376: {'lr': 6.1023121243727306e-05, 'samples': 22344192, 'steps': 116375, 'loss/train': 0.39889222383499146} 11/07/2021 13:35:16 - INFO - __main__ - Step 116377: {'lr': 6.1019647075317766e-05, 'samples': 22344384, 'steps': 116376, 'loss/train': 0.8334119915962219} 11/07/2021 13:35:16 - INFO - __main__ - Step 116378: {'lr': 6.101617299205986e-05, 'samples': 22344576, 'steps': 116377, 'loss/train': 1.3180876970291138} 11/07/2021 13:35:16 - INFO - __main__ - Step 116379: {'lr': 6.101269899395514e-05, 'samples': 22344768, 'steps': 116378, 'loss/train': 1.0081177949905396} 11/07/2021 13:35:17 - INFO - __main__ - Step 116380: {'lr': 6.1009225081005203e-05, 'samples': 22344960, 'steps': 116379, 'loss/train': 1.4420994520187378} 11/07/2021 13:35:18 - INFO - __main__ - Step 116381: {'lr': 6.1005751253211586e-05, 'samples': 22345152, 'steps': 116380, 'loss/train': 1.4326032400131226} 11/07/2021 13:35:18 - INFO - __main__ - Step 116382: {'lr': 6.100227751057588e-05, 'samples': 22345344, 'steps': 116381, 'loss/train': 1.4442064762115479} 11/07/2021 13:35:18 - INFO - __main__ - Step 116383: {'lr': 6.0998803853099666e-05, 'samples': 22345536, 'steps': 116382, 'loss/train': 1.2400343418121338} 11/07/2021 13:35:19 - INFO - __main__ - Step 116384: {'lr': 6.099533028078444e-05, 'samples': 22345728, 'steps': 116383, 'loss/train': 1.5136590003967285} 11/07/2021 13:35:19 - INFO - __main__ - Step 116385: {'lr': 6.0991856793631756e-05, 'samples': 22345920, 'steps': 116384, 'loss/train': 1.3484408855438232} 11/07/2021 13:35:20 - INFO - __main__ - Step 116386: {'lr': 6.0988383391643255e-05, 'samples': 22346112, 'steps': 116385, 'loss/train': 1.410912275314331} 11/07/2021 13:35:20 - INFO - __main__ - Step 116387: {'lr': 6.098491007482046e-05, 'samples': 22346304, 'steps': 116386, 'loss/train': 1.5549936294555664} 11/07/2021 13:35:21 - INFO - __main__ - Step 116388: {'lr': 6.098143684316496e-05, 'samples': 22346496, 'steps': 116387, 'loss/train': 1.4020518064498901} 11/07/2021 13:35:21 - INFO - __main__ - Step 116389: {'lr': 6.0977963696678294e-05, 'samples': 22346688, 'steps': 116388, 'loss/train': 1.4478974342346191} 11/07/2021 13:35:21 - INFO - __main__ - Step 116390: {'lr': 6.097449063536206e-05, 'samples': 22346880, 'steps': 116389, 'loss/train': 1.4030089378356934} 11/07/2021 13:35:22 - INFO - __main__ - Step 116391: {'lr': 6.097101765921781e-05, 'samples': 22347072, 'steps': 116390, 'loss/train': 1.3998687267303467} 11/07/2021 13:35:23 - INFO - __main__ - Step 116392: {'lr': 6.096754476824706e-05, 'samples': 22347264, 'steps': 116391, 'loss/train': 1.31907057762146} 11/07/2021 13:35:23 - INFO - __main__ - Step 116393: {'lr': 6.096407196245146e-05, 'samples': 22347456, 'steps': 116392, 'loss/train': 1.3861734867095947} 11/07/2021 13:35:23 - INFO - __main__ - Step 116394: {'lr': 6.0960599241832505e-05, 'samples': 22347648, 'steps': 116393, 'loss/train': 1.3243619203567505} 11/07/2021 13:35:24 - INFO - __main__ - Step 116395: {'lr': 6.09571266063918e-05, 'samples': 22347840, 'steps': 116394, 'loss/train': 0.6741749048233032} 11/07/2021 13:35:25 - INFO - __main__ - Step 116396: {'lr': 6.0953654056130955e-05, 'samples': 22348032, 'steps': 116395, 'loss/train': 1.1392418146133423} 11/07/2021 13:35:25 - INFO - __main__ - Step 116397: {'lr': 6.0950181591051425e-05, 'samples': 22348224, 'steps': 116396, 'loss/train': 1.0609939098358154} 11/07/2021 13:35:26 - INFO - __main__ - Step 116398: {'lr': 6.0946709211154806e-05, 'samples': 22348416, 'steps': 116397, 'loss/train': 1.6761256456375122} 11/07/2021 13:35:26 - INFO - __main__ - Step 116399: {'lr': 6.094323691644271e-05, 'samples': 22348608, 'steps': 116398, 'loss/train': 1.2895344495773315} 11/07/2021 13:35:26 - INFO - __main__ - Step 116400: {'lr': 6.0939764706916646e-05, 'samples': 22348800, 'steps': 116399, 'loss/train': 1.7105225324630737} 11/07/2021 13:35:27 - INFO - __main__ - Step 116401: {'lr': 6.0936292582578215e-05, 'samples': 22348992, 'steps': 116400, 'loss/train': 1.9343291521072388} 11/07/2021 13:35:28 - INFO - __main__ - Step 116402: {'lr': 6.093282054342897e-05, 'samples': 22349184, 'steps': 116401, 'loss/train': 0.9525164365768433} 11/07/2021 13:35:28 - INFO - __main__ - Step 116403: {'lr': 6.092934858947049e-05, 'samples': 22349376, 'steps': 116402, 'loss/train': 1.6020407676696777} 11/07/2021 13:35:28 - INFO - __main__ - Step 116404: {'lr': 6.092587672070432e-05, 'samples': 22349568, 'steps': 116403, 'loss/train': 1.5890274047851562} 11/07/2021 13:35:29 - INFO - __main__ - Step 116405: {'lr': 6.0922404937132054e-05, 'samples': 22349760, 'steps': 116404, 'loss/train': 1.6942285299301147} 11/07/2021 13:35:30 - INFO - __main__ - Step 116406: {'lr': 6.091893323875519e-05, 'samples': 22349952, 'steps': 116405, 'loss/train': 1.512725591659546} 11/07/2021 13:35:30 - INFO - __main__ - Step 116407: {'lr': 6.091546162557537e-05, 'samples': 22350144, 'steps': 116406, 'loss/train': 1.3254164457321167} 11/07/2021 13:35:31 - INFO - __main__ - Step 116408: {'lr': 6.091199009759413e-05, 'samples': 22350336, 'steps': 116407, 'loss/train': 0.7960078120231628} 11/07/2021 13:35:31 - INFO - __main__ - Step 116409: {'lr': 6.0908518654813007e-05, 'samples': 22350528, 'steps': 116408, 'loss/train': 1.5313470363616943} 11/07/2021 13:35:31 - INFO - __main__ - Step 116410: {'lr': 6.0905047297233676e-05, 'samples': 22350720, 'steps': 116409, 'loss/train': 1.4039616584777832} 11/07/2021 13:35:32 - INFO - __main__ - Step 116411: {'lr': 6.090157602485752e-05, 'samples': 22350912, 'steps': 116410, 'loss/train': 0.6514115333557129} 11/07/2021 13:35:33 - INFO - __main__ - Step 116412: {'lr': 6.0898104837686206e-05, 'samples': 22351104, 'steps': 116411, 'loss/train': 0.8785869479179382} 11/07/2021 13:35:33 - INFO - __main__ - Step 116413: {'lr': 6.089463373572129e-05, 'samples': 22351296, 'steps': 116412, 'loss/train': 1.104901671409607} 11/07/2021 13:35:33 - INFO - __main__ - Step 116414: {'lr': 6.089116271896436e-05, 'samples': 22351488, 'steps': 116413, 'loss/train': 1.3425242900848389} 11/07/2021 13:35:34 - INFO - __main__ - Step 116415: {'lr': 6.0887691787416903e-05, 'samples': 22351680, 'steps': 116414, 'loss/train': 2.156522274017334} 11/07/2021 13:35:34 - INFO - __main__ - Step 116416: {'lr': 6.0884220941080564e-05, 'samples': 22351872, 'steps': 116415, 'loss/train': 2.0010876655578613} 11/07/2021 13:35:35 - INFO - __main__ - Step 116417: {'lr': 6.08807501799569e-05, 'samples': 22352064, 'steps': 116416, 'loss/train': 1.5437067747116089} 11/07/2021 13:35:35 - INFO - __main__ - Step 116418: {'lr': 6.0877279504047396e-05, 'samples': 22352256, 'steps': 116417, 'loss/train': 1.4269671440124512} 11/07/2021 13:35:36 - INFO - __main__ - Step 116419: {'lr': 6.0873808913353704e-05, 'samples': 22352448, 'steps': 116418, 'loss/train': 1.373354434967041} 11/07/2021 13:35:36 - INFO - __main__ - Step 116420: {'lr': 6.087033840787737e-05, 'samples': 22352640, 'steps': 116419, 'loss/train': 1.4471533298492432} 11/07/2021 13:35:36 - INFO - __main__ - Step 116421: {'lr': 6.086686798761992e-05, 'samples': 22352832, 'steps': 116420, 'loss/train': 1.3916473388671875} 11/07/2021 13:35:37 - INFO - __main__ - Step 116422: {'lr': 6.0863397652582944e-05, 'samples': 22353024, 'steps': 116421, 'loss/train': 1.2678494453430176} 11/07/2021 13:35:38 - INFO - __main__ - Step 116423: {'lr': 6.085992740276808e-05, 'samples': 22353216, 'steps': 116422, 'loss/train': 1.1694531440734863} 11/07/2021 13:35:38 - INFO - __main__ - Step 116424: {'lr': 6.085645723817673e-05, 'samples': 22353408, 'steps': 116423, 'loss/train': 1.3190723657608032} 11/07/2021 13:35:38 - INFO - __main__ - Step 116425: {'lr': 6.0852987158810545e-05, 'samples': 22353600, 'steps': 116424, 'loss/train': 1.660751461982727} 11/07/2021 13:35:39 - INFO - __main__ - Step 116426: {'lr': 6.08495171646711e-05, 'samples': 22353792, 'steps': 116425, 'loss/train': 1.0839776992797852} 11/07/2021 13:35:40 - INFO - __main__ - Step 116427: {'lr': 6.0846047255759926e-05, 'samples': 22353984, 'steps': 116426, 'loss/train': 1.5764334201812744} 11/07/2021 13:35:40 - INFO - __main__ - Step 116428: {'lr': 6.0842577432078604e-05, 'samples': 22354176, 'steps': 116427, 'loss/train': 0.9992587566375732} 11/07/2021 13:35:41 - INFO - __main__ - Step 116429: {'lr': 6.083910769362871e-05, 'samples': 22354368, 'steps': 116428, 'loss/train': 1.2527967691421509} 11/07/2021 13:35:41 - INFO - __main__ - Step 116430: {'lr': 6.0835638040411815e-05, 'samples': 22354560, 'steps': 116429, 'loss/train': 1.4747414588928223} 11/07/2021 13:35:41 - INFO - __main__ - Step 116431: {'lr': 6.0832168472429424e-05, 'samples': 22354752, 'steps': 116430, 'loss/train': 1.278976321220398} 11/07/2021 13:35:42 - INFO - __main__ - Step 116432: {'lr': 6.082869898968316e-05, 'samples': 22354944, 'steps': 116431, 'loss/train': 1.6084431409835815} 11/07/2021 13:35:43 - INFO - __main__ - Step 116433: {'lr': 6.0825229592174544e-05, 'samples': 22355136, 'steps': 116432, 'loss/train': 1.538872241973877} 11/07/2021 13:35:43 - INFO - __main__ - Step 116434: {'lr': 6.0821760279905187e-05, 'samples': 22355328, 'steps': 116433, 'loss/train': 0.5034523606300354} 11/07/2021 13:35:43 - INFO - __main__ - Step 116435: {'lr': 6.081829105287662e-05, 'samples': 22355520, 'steps': 116434, 'loss/train': 1.7032618522644043} 11/07/2021 13:35:44 - INFO - __main__ - Step 116436: {'lr': 6.081482191109039e-05, 'samples': 22355712, 'steps': 116435, 'loss/train': 1.512654423713684} 11/07/2021 13:35:45 - INFO - __main__ - Step 116437: {'lr': 6.081135285454817e-05, 'samples': 22355904, 'steps': 116436, 'loss/train': 1.2253789901733398} 11/07/2021 13:35:45 - INFO - __main__ - Step 116438: {'lr': 6.080788388325137e-05, 'samples': 22356096, 'steps': 116437, 'loss/train': 0.9544686079025269} 11/07/2021 13:35:45 - INFO - __main__ - Step 116439: {'lr': 6.0804414997201604e-05, 'samples': 22356288, 'steps': 116438, 'loss/train': 1.4536045789718628} 11/07/2021 13:35:46 - INFO - __main__ - Step 116440: {'lr': 6.080094619640045e-05, 'samples': 22356480, 'steps': 116439, 'loss/train': 1.4053696393966675} 11/07/2021 13:35:46 - INFO - __main__ - Step 116441: {'lr': 6.079747748084949e-05, 'samples': 22356672, 'steps': 116440, 'loss/train': 1.2773774862289429} 11/07/2021 13:35:47 - INFO - __main__ - Step 116442: {'lr': 6.079400885055025e-05, 'samples': 22356864, 'steps': 116441, 'loss/train': 0.7227569818496704} 11/07/2021 13:35:48 - INFO - __main__ - Step 116443: {'lr': 6.079054030550432e-05, 'samples': 22357056, 'steps': 116442, 'loss/train': 1.5568922758102417} 11/07/2021 13:35:48 - INFO - __main__ - Step 116444: {'lr': 6.078707184571325e-05, 'samples': 22357248, 'steps': 116443, 'loss/train': 1.241224765777588} 11/07/2021 13:35:48 - INFO - __main__ - Step 116445: {'lr': 6.078360347117859e-05, 'samples': 22357440, 'steps': 116444, 'loss/train': 1.0998940467834473} 11/07/2021 13:35:49 - INFO - __main__ - Step 116446: {'lr': 6.0780135181901925e-05, 'samples': 22357632, 'steps': 116445, 'loss/train': 1.9443764686584473} 11/07/2021 13:35:49 - INFO - __main__ - Step 116447: {'lr': 6.077666697788481e-05, 'samples': 22357824, 'steps': 116446, 'loss/train': 1.676353096961975} 11/07/2021 13:35:50 - INFO - __main__ - Step 116448: {'lr': 6.0773198859128826e-05, 'samples': 22358016, 'steps': 116447, 'loss/train': 1.3804606199264526} 11/07/2021 13:35:50 - INFO - __main__ - Step 116449: {'lr': 6.0769730825635496e-05, 'samples': 22358208, 'steps': 116448, 'loss/train': 0.8394715189933777} 11/07/2021 13:35:51 - INFO - __main__ - Step 116450: {'lr': 6.0766262877406496e-05, 'samples': 22358400, 'steps': 116449, 'loss/train': 0.9642571210861206} 11/07/2021 13:35:51 - INFO - __main__ - Step 116451: {'lr': 6.076279501444323e-05, 'samples': 22358592, 'steps': 116450, 'loss/train': 1.1868696212768555} 11/07/2021 13:35:51 - INFO - __main__ - Step 116452: {'lr': 6.075932723674732e-05, 'samples': 22358784, 'steps': 116451, 'loss/train': 1.7757723331451416} 11/07/2021 13:35:53 - INFO - __main__ - Step 116453: {'lr': 6.075585954432033e-05, 'samples': 22358976, 'steps': 116452, 'loss/train': 1.406821608543396} 11/07/2021 13:35:53 - INFO - __main__ - Step 116454: {'lr': 6.075239193716384e-05, 'samples': 22359168, 'steps': 116453, 'loss/train': 1.2127841711044312} 11/07/2021 13:35:53 - INFO - __main__ - Step 116455: {'lr': 6.074892441527938e-05, 'samples': 22359360, 'steps': 116454, 'loss/train': 1.1517070531845093} 11/07/2021 13:35:54 - INFO - __main__ - Step 116456: {'lr': 6.074545697866854e-05, 'samples': 22359552, 'steps': 116455, 'loss/train': 1.2263898849487305} 11/07/2021 13:35:54 - INFO - __main__ - Step 116457: {'lr': 6.074198962733291e-05, 'samples': 22359744, 'steps': 116456, 'loss/train': 1.1439415216445923} 11/07/2021 13:35:55 - INFO - __main__ - Step 116458: {'lr': 6.073852236127397e-05, 'samples': 22359936, 'steps': 116457, 'loss/train': 1.7336537837982178} 11/07/2021 13:35:55 - INFO - __main__ - Step 116459: {'lr': 6.073505518049338e-05, 'samples': 22360128, 'steps': 116458, 'loss/train': 1.061104655265808} 11/07/2021 13:35:56 - INFO - __main__ - Step 116460: {'lr': 6.073158808499263e-05, 'samples': 22360320, 'steps': 116459, 'loss/train': 1.3343287706375122} 11/07/2021 13:35:56 - INFO - __main__ - Step 116461: {'lr': 6.072812107477329e-05, 'samples': 22360512, 'steps': 116460, 'loss/train': 1.4965178966522217} 11/07/2021 13:35:56 - INFO - __main__ - Step 116462: {'lr': 6.072465414983697e-05, 'samples': 22360704, 'steps': 116461, 'loss/train': 1.4102668762207031} 11/07/2021 13:35:57 - INFO - __main__ - Step 116463: {'lr': 6.072118731018517e-05, 'samples': 22360896, 'steps': 116462, 'loss/train': 1.2338086366653442} 11/07/2021 13:35:58 - INFO - __main__ - Step 116464: {'lr': 6.071772055581959e-05, 'samples': 22361088, 'steps': 116463, 'loss/train': 1.4827946424484253} 11/07/2021 13:35:58 - INFO - __main__ - Step 116465: {'lr': 6.0714253886741595e-05, 'samples': 22361280, 'steps': 116464, 'loss/train': 1.3274084329605103} 11/07/2021 13:35:58 - INFO - __main__ - Step 116466: {'lr': 6.071078730295282e-05, 'samples': 22361472, 'steps': 116465, 'loss/train': 1.555983066558838} 11/07/2021 13:35:59 - INFO - __main__ - Step 116467: {'lr': 6.0707320804454846e-05, 'samples': 22361664, 'steps': 116466, 'loss/train': 1.4198259115219116} 11/07/2021 13:35:59 - INFO - __main__ - Step 116468: {'lr': 6.0703854391249256e-05, 'samples': 22361856, 'steps': 116467, 'loss/train': 1.2571550607681274} 11/07/2021 13:36:01 - INFO - __main__ - Step 116469: {'lr': 6.070038806333758e-05, 'samples': 22362048, 'steps': 116468, 'loss/train': 1.157383918762207} 11/07/2021 13:36:01 - INFO - __main__ - Step 116470: {'lr': 6.069692182072137e-05, 'samples': 22362240, 'steps': 116469, 'loss/train': 1.6261106729507446} 11/07/2021 13:36:01 - INFO - __main__ - Step 116471: {'lr': 6.069345566340223e-05, 'samples': 22362432, 'steps': 116470, 'loss/train': 1.353670597076416} 11/07/2021 13:36:02 - INFO - __main__ - Step 116472: {'lr': 6.0689989591381666e-05, 'samples': 22362624, 'steps': 116471, 'loss/train': 1.2513160705566406} 11/07/2021 13:36:02 - INFO - __main__ - Step 116473: {'lr': 6.068652360466131e-05, 'samples': 22362816, 'steps': 116472, 'loss/train': 1.4427868127822876} 11/07/2021 13:36:02 - INFO - __main__ - Step 116474: {'lr': 6.068305770324267e-05, 'samples': 22363008, 'steps': 116473, 'loss/train': 1.4315388202667236} 11/07/2021 13:36:04 - INFO - __main__ - Step 116475: {'lr': 6.067959188712732e-05, 'samples': 22363200, 'steps': 116474, 'loss/train': 0.5183081030845642} 11/07/2021 13:36:04 - INFO - __main__ - Step 116476: {'lr': 6.067612615631682e-05, 'samples': 22363392, 'steps': 116475, 'loss/train': 0.12809863686561584} 11/07/2021 13:36:04 - INFO - __main__ - Step 116477: {'lr': 6.06726605108128e-05, 'samples': 22363584, 'steps': 116476, 'loss/train': 1.1810060739517212} 11/07/2021 13:36:05 - INFO - __main__ - Step 116478: {'lr': 6.066919495061671e-05, 'samples': 22363776, 'steps': 116477, 'loss/train': 0.9833187460899353} 11/07/2021 13:36:05 - INFO - __main__ - Step 116479: {'lr': 6.066572947573015e-05, 'samples': 22363968, 'steps': 116478, 'loss/train': 1.3805279731750488} 11/07/2021 13:36:06 - INFO - __main__ - Step 116480: {'lr': 6.066226408615469e-05, 'samples': 22364160, 'steps': 116479, 'loss/train': 1.630420446395874} 11/07/2021 13:36:06 - INFO - __main__ - Step 116481: {'lr': 6.065879878189187e-05, 'samples': 22364352, 'steps': 116480, 'loss/train': 1.4636191129684448} 11/07/2021 13:36:07 - INFO - __main__ - Step 116482: {'lr': 6.065533356294331e-05, 'samples': 22364544, 'steps': 116481, 'loss/train': 1.5517730712890625} 11/07/2021 13:36:07 - INFO - __main__ - Step 116483: {'lr': 6.0651868429310505e-05, 'samples': 22364736, 'steps': 116482, 'loss/train': 1.3162035942077637} 11/07/2021 13:36:08 - INFO - __main__ - Step 116484: {'lr': 6.064840338099506e-05, 'samples': 22364928, 'steps': 116483, 'loss/train': 1.3230845928192139} 11/07/2021 13:36:09 - INFO - __main__ - Step 116485: {'lr': 6.064493841799854e-05, 'samples': 22365120, 'steps': 116484, 'loss/train': 0.9579149484634399} 11/07/2021 13:36:09 - INFO - __main__ - Step 116486: {'lr': 6.064147354032246e-05, 'samples': 22365312, 'steps': 116485, 'loss/train': 1.5394517183303833} 11/07/2021 13:36:09 - INFO - __main__ - Step 116487: {'lr': 6.063800874796843e-05, 'samples': 22365504, 'steps': 116486, 'loss/train': 1.8959087133407593} 11/07/2021 13:36:10 - INFO - __main__ - Step 116488: {'lr': 6.063454404093796e-05, 'samples': 22365696, 'steps': 116487, 'loss/train': 1.2315319776535034} 11/07/2021 13:36:10 - INFO - __main__ - Step 116489: {'lr': 6.063107941923268e-05, 'samples': 22365888, 'steps': 116488, 'loss/train': 1.7571184635162354} 11/07/2021 13:36:11 - INFO - __main__ - Step 116490: {'lr': 6.0627614882854174e-05, 'samples': 22366080, 'steps': 116489, 'loss/train': 0.7027588486671448} 11/07/2021 13:36:11 - INFO - __main__ - Step 116491: {'lr': 6.062415043180386e-05, 'samples': 22366272, 'steps': 116490, 'loss/train': 1.3175297975540161} 11/07/2021 13:36:12 - INFO - __main__ - Step 116492: {'lr': 6.06206860660834e-05, 'samples': 22366464, 'steps': 116491, 'loss/train': 1.1457825899124146} 11/07/2021 13:36:12 - INFO - __main__ - Step 116493: {'lr': 6.0617221785694315e-05, 'samples': 22366656, 'steps': 116492, 'loss/train': 1.4938971996307373} 11/07/2021 13:36:12 - INFO - __main__ - Step 116494: {'lr': 6.0613757590638196e-05, 'samples': 22366848, 'steps': 116493, 'loss/train': 1.360615611076355} 11/07/2021 13:36:13 - INFO - __main__ - Step 116495: {'lr': 6.0610293480916595e-05, 'samples': 22367040, 'steps': 116494, 'loss/train': 1.016119122505188} 11/07/2021 13:36:14 - INFO - __main__ - Step 116496: {'lr': 6.060682945653106e-05, 'samples': 22367232, 'steps': 116495, 'loss/train': 1.475079894065857} 11/07/2021 13:36:14 - INFO - __main__ - Step 116497: {'lr': 6.0603365517483186e-05, 'samples': 22367424, 'steps': 116496, 'loss/train': 1.9045474529266357} 11/07/2021 13:36:15 - INFO - __main__ - Step 116498: {'lr': 6.059990166377452e-05, 'samples': 22367616, 'steps': 116497, 'loss/train': 1.154112458229065} 11/07/2021 13:36:15 - INFO - __main__ - Step 116499: {'lr': 6.0596437895406615e-05, 'samples': 22367808, 'steps': 116498, 'loss/train': 1.4901313781738281} 11/07/2021 13:36:15 - INFO - __main__ - Step 116500: {'lr': 6.0592974212381e-05, 'samples': 22368000, 'steps': 116499, 'loss/train': 1.1283774375915527} 11/07/2021 13:36:16 - INFO - __main__ - Step 116501: {'lr': 6.0589510614699305e-05, 'samples': 22368192, 'steps': 116500, 'loss/train': 1.1330300569534302} 11/07/2021 13:36:17 - INFO - __main__ - Step 116502: {'lr': 6.058604710236304e-05, 'samples': 22368384, 'steps': 116501, 'loss/train': 1.5602339506149292} 11/07/2021 13:36:17 - INFO - __main__ - Step 116503: {'lr': 6.058258367537376e-05, 'samples': 22368576, 'steps': 116502, 'loss/train': 1.6601676940917969} 11/07/2021 13:36:17 - INFO - __main__ - Step 116504: {'lr': 6.057912033373314e-05, 'samples': 22368768, 'steps': 116503, 'loss/train': 1.3707396984100342} 11/07/2021 13:36:18 - INFO - __main__ - Step 116505: {'lr': 6.057565707744256e-05, 'samples': 22368960, 'steps': 116504, 'loss/train': 1.6533197164535522} 11/07/2021 13:36:19 - INFO - __main__ - Step 116506: {'lr': 6.0572193906503676e-05, 'samples': 22369152, 'steps': 116505, 'loss/train': 1.7793828248977661} 11/07/2021 13:36:19 - INFO - __main__ - Step 116507: {'lr': 6.0568730820918044e-05, 'samples': 22369344, 'steps': 116506, 'loss/train': 2.157790184020996} 11/07/2021 13:36:20 - INFO - __main__ - Step 116508: {'lr': 6.056526782068719e-05, 'samples': 22369536, 'steps': 116507, 'loss/train': 1.5929471254348755} 11/07/2021 13:36:20 - INFO - __main__ - Step 116509: {'lr': 6.056180490581273e-05, 'samples': 22369728, 'steps': 116508, 'loss/train': 1.0125572681427002} 11/07/2021 13:36:20 - INFO - __main__ - Step 116510: {'lr': 6.055834207629621e-05, 'samples': 22369920, 'steps': 116509, 'loss/train': 1.3165316581726074} 11/07/2021 13:36:21 - INFO - __main__ - Step 116511: {'lr': 6.055487933213916e-05, 'samples': 22370112, 'steps': 116510, 'loss/train': 1.6842255592346191} 11/07/2021 13:36:22 - INFO - __main__ - Step 116512: {'lr': 6.055141667334313e-05, 'samples': 22370304, 'steps': 116511, 'loss/train': 1.258949875831604} 11/07/2021 13:36:22 - INFO - __main__ - Step 116513: {'lr': 6.0547954099909736e-05, 'samples': 22370496, 'steps': 116512, 'loss/train': 1.6809492111206055} 11/07/2021 13:36:22 - INFO - __main__ - Step 116514: {'lr': 6.054449161184053e-05, 'samples': 22370688, 'steps': 116513, 'loss/train': 2.176130533218384} 11/07/2021 13:36:23 - INFO - __main__ - Step 116515: {'lr': 6.054102920913701e-05, 'samples': 22370880, 'steps': 116514, 'loss/train': 1.5172865390777588} 11/07/2021 13:36:24 - INFO - __main__ - Step 116516: {'lr': 6.0537566891800815e-05, 'samples': 22371072, 'steps': 116515, 'loss/train': 1.4647811651229858} 11/07/2021 13:36:24 - INFO - __main__ - Step 116517: {'lr': 6.053410465983353e-05, 'samples': 22371264, 'steps': 116516, 'loss/train': 1.1861892938613892} 11/07/2021 13:36:24 - INFO - __main__ - Step 116518: {'lr': 6.053064251323656e-05, 'samples': 22371456, 'steps': 116517, 'loss/train': 1.5234730243682861} 11/07/2021 13:36:25 - INFO - __main__ - Step 116519: {'lr': 6.052718045201158e-05, 'samples': 22371648, 'steps': 116518, 'loss/train': 1.643728256225586} 11/07/2021 13:36:25 - INFO - __main__ - Step 116520: {'lr': 6.052371847616015e-05, 'samples': 22371840, 'steps': 116519, 'loss/train': 1.3251994848251343} 11/07/2021 13:36:26 - INFO - __main__ - Step 116521: {'lr': 6.0520256585683774e-05, 'samples': 22372032, 'steps': 116520, 'loss/train': 1.4006612300872803} 11/07/2021 13:36:26 - INFO - __main__ - Step 116522: {'lr': 6.051679478058405e-05, 'samples': 22372224, 'steps': 116521, 'loss/train': 2.011695623397827} 11/07/2021 13:36:27 - INFO - __main__ - Step 116523: {'lr': 6.051333306086254e-05, 'samples': 22372416, 'steps': 116522, 'loss/train': 1.3115612268447876} 11/07/2021 13:36:27 - INFO - __main__ - Step 116524: {'lr': 6.0509871426520814e-05, 'samples': 22372608, 'steps': 116523, 'loss/train': 0.7295114994049072} 11/07/2021 13:36:28 - INFO - __main__ - Step 116525: {'lr': 6.05064098775604e-05, 'samples': 22372800, 'steps': 116524, 'loss/train': 1.468630075454712} 11/07/2021 13:36:28 - INFO - __main__ - Step 116526: {'lr': 6.050294841398285e-05, 'samples': 22372992, 'steps': 116525, 'loss/train': 0.9894965291023254} 11/07/2021 13:36:29 - INFO - __main__ - Step 116527: {'lr': 6.049948703578978e-05, 'samples': 22373184, 'steps': 116526, 'loss/train': 0.862932562828064} 11/07/2021 13:36:29 - INFO - __main__ - Step 116528: {'lr': 6.0496025742982715e-05, 'samples': 22373376, 'steps': 116527, 'loss/train': 1.2781537771224976} 11/07/2021 13:36:30 - INFO - __main__ - Step 116529: {'lr': 6.0492564535563204e-05, 'samples': 22373568, 'steps': 116528, 'loss/train': 1.5087898969650269} 11/07/2021 13:36:30 - INFO - __main__ - Step 116530: {'lr': 6.048910341353284e-05, 'samples': 22373760, 'steps': 116529, 'loss/train': 1.295822024345398} 11/07/2021 13:36:30 - INFO - __main__ - Step 116531: {'lr': 6.0485642376893216e-05, 'samples': 22373952, 'steps': 116530, 'loss/train': 1.5246609449386597} 11/07/2021 13:36:32 - INFO - __main__ - Step 116532: {'lr': 6.048218142564577e-05, 'samples': 22374144, 'steps': 116531, 'loss/train': 1.4063018560409546} 11/07/2021 13:36:32 - INFO - __main__ - Step 116533: {'lr': 6.047872055979212e-05, 'samples': 22374336, 'steps': 116532, 'loss/train': 2.0617754459381104} 11/07/2021 13:36:33 - INFO - __main__ - Step 116534: {'lr': 6.0475259779333854e-05, 'samples': 22374528, 'steps': 116533, 'loss/train': 1.1455450057983398} 11/07/2021 13:36:33 - INFO - __main__ - Step 116535: {'lr': 6.04717990842725e-05, 'samples': 22374720, 'steps': 116534, 'loss/train': 1.6930010318756104} 11/07/2021 13:36:33 - INFO - __main__ - Step 116536: {'lr': 6.046833847460961e-05, 'samples': 22374912, 'steps': 116535, 'loss/train': 1.3122202157974243} 11/07/2021 13:36:34 - INFO - __main__ - Step 116537: {'lr': 6.046487795034678e-05, 'samples': 22375104, 'steps': 116536, 'loss/train': 0.9179819822311401} 11/07/2021 13:36:35 - INFO - __main__ - Step 116538: {'lr': 6.0461417511485565e-05, 'samples': 22375296, 'steps': 116537, 'loss/train': 0.4049886167049408} 11/07/2021 13:36:36 - INFO - __main__ - Step 116539: {'lr': 6.0457957158027486e-05, 'samples': 22375488, 'steps': 116538, 'loss/train': 0.7802691459655762} 11/07/2021 13:36:36 - INFO - __main__ - Step 116540: {'lr': 6.045449688997415e-05, 'samples': 22375680, 'steps': 116539, 'loss/train': 1.3417367935180664} 11/07/2021 13:36:36 - INFO - __main__ - Step 116541: {'lr': 6.045103670732707e-05, 'samples': 22375872, 'steps': 116540, 'loss/train': 1.5721001625061035} 11/07/2021 13:36:37 - INFO - __main__ - Step 116542: {'lr': 6.044757661008785e-05, 'samples': 22376064, 'steps': 116541, 'loss/train': 1.1698328256607056} 11/07/2021 13:36:37 - INFO - __main__ - Step 116543: {'lr': 6.044411659825799e-05, 'samples': 22376256, 'steps': 116542, 'loss/train': 0.8788069486618042} 11/07/2021 13:36:38 - INFO - __main__ - Step 116544: {'lr': 6.0440656671839204e-05, 'samples': 22376448, 'steps': 116543, 'loss/train': 3.7619850635528564} 11/07/2021 13:36:38 - INFO - __main__ - Step 116545: {'lr': 6.043719683083282e-05, 'samples': 22376640, 'steps': 116544, 'loss/train': 0.2567160427570343} 11/07/2021 13:36:39 - INFO - __main__ - Step 116546: {'lr': 6.043373707524055e-05, 'samples': 22376832, 'steps': 116545, 'loss/train': 1.326436996459961} 11/07/2021 13:36:39 - INFO - __main__ - Step 116547: {'lr': 6.0430277405063875e-05, 'samples': 22377024, 'steps': 116546, 'loss/train': 0.9184982180595398} 11/07/2021 13:36:39 - INFO - __main__ - Step 116548: {'lr': 6.042681782030443e-05, 'samples': 22377216, 'steps': 116547, 'loss/train': 1.3100110292434692} 11/07/2021 13:36:40 - INFO - __main__ - Step 116549: {'lr': 6.0423358320963715e-05, 'samples': 22377408, 'steps': 116548, 'loss/train': 1.0779850482940674} 11/07/2021 13:36:41 - INFO - __main__ - Step 116550: {'lr': 6.04198989070433e-05, 'samples': 22377600, 'steps': 116549, 'loss/train': 1.205915927886963} 11/07/2021 13:36:41 - INFO - __main__ - Step 116551: {'lr': 6.041643957854476e-05, 'samples': 22377792, 'steps': 116550, 'loss/train': 1.46928071975708} 11/07/2021 13:36:41 - INFO - __main__ - Step 116552: {'lr': 6.041298033546966e-05, 'samples': 22377984, 'steps': 116551, 'loss/train': 1.7746033668518066} 11/07/2021 13:36:42 - INFO - __main__ - Step 116553: {'lr': 6.040952117781953e-05, 'samples': 22378176, 'steps': 116552, 'loss/train': 1.6570372581481934} 11/07/2021 13:36:43 - INFO - __main__ - Step 116554: {'lr': 6.040606210559593e-05, 'samples': 22378368, 'steps': 116553, 'loss/train': 1.103076696395874} 11/07/2021 13:36:43 - INFO - __main__ - Step 116555: {'lr': 6.040260311880047e-05, 'samples': 22378560, 'steps': 116554, 'loss/train': 1.5813924074172974} 11/07/2021 13:36:44 - INFO - __main__ - Step 116556: {'lr': 6.039914421743464e-05, 'samples': 22378752, 'steps': 116555, 'loss/train': 1.1517115831375122} 11/07/2021 13:36:44 - INFO - __main__ - Step 116557: {'lr': 6.039568540150006e-05, 'samples': 22378944, 'steps': 116556, 'loss/train': 1.314858317375183} 11/07/2021 13:36:44 - INFO - __main__ - Step 116558: {'lr': 6.039222667099831e-05, 'samples': 22379136, 'steps': 116557, 'loss/train': 1.5546441078186035} 11/07/2021 13:36:45 - INFO - __main__ - Step 116559: {'lr': 6.038876802593082e-05, 'samples': 22379328, 'steps': 116558, 'loss/train': 1.205175757408142} 11/07/2021 13:36:46 - INFO - __main__ - Step 116560: {'lr': 6.038530946629925e-05, 'samples': 22379520, 'steps': 116559, 'loss/train': 0.8402830362319946} 11/07/2021 13:36:46 - INFO - __main__ - Step 116561: {'lr': 6.038185099210511e-05, 'samples': 22379712, 'steps': 116560, 'loss/train': 1.4952958822250366} 11/07/2021 13:36:46 - INFO - __main__ - Step 116562: {'lr': 6.037839260334999e-05, 'samples': 22379904, 'steps': 116561, 'loss/train': 1.5708682537078857} 11/07/2021 13:36:47 - INFO - __main__ - Step 116563: {'lr': 6.037493430003543e-05, 'samples': 22380096, 'steps': 116562, 'loss/train': 1.302416443824768} 11/07/2021 13:36:47 - INFO - __main__ - Step 116564: {'lr': 6.0371476082163006e-05, 'samples': 22380288, 'steps': 116563, 'loss/train': 1.401304841041565} 11/07/2021 13:36:48 - INFO - __main__ - Step 116565: {'lr': 6.036801794973429e-05, 'samples': 22380480, 'steps': 116564, 'loss/train': 1.4297974109649658} 11/07/2021 13:36:48 - INFO - __main__ - Step 116566: {'lr': 6.036455990275078e-05, 'samples': 22380672, 'steps': 116565, 'loss/train': 1.1643409729003906} 11/07/2021 13:36:49 - INFO - __main__ - Step 116567: {'lr': 6.036110194121411e-05, 'samples': 22380864, 'steps': 116566, 'loss/train': 1.4368818998336792} 11/07/2021 13:36:49 - INFO - __main__ - Step 116568: {'lr': 6.035764406512578e-05, 'samples': 22381056, 'steps': 116567, 'loss/train': 1.5050373077392578} 11/07/2021 13:36:50 - INFO - __main__ - Step 116569: {'lr': 6.0354186274487356e-05, 'samples': 22381248, 'steps': 116568, 'loss/train': 1.358869194984436} 11/07/2021 13:36:51 - INFO - __main__ - Step 116570: {'lr': 6.0350728569300434e-05, 'samples': 22381440, 'steps': 116569, 'loss/train': 1.1755764484405518} 11/07/2021 13:36:51 - INFO - __main__ - Step 116571: {'lr': 6.0347270949566626e-05, 'samples': 22381632, 'steps': 116570, 'loss/train': 0.8963015675544739} 11/07/2021 13:36:51 - INFO - __main__ - Step 116572: {'lr': 6.03438134152873e-05, 'samples': 22381824, 'steps': 116571, 'loss/train': 1.2981470823287964} 11/07/2021 13:36:52 - INFO - __main__ - Step 116573: {'lr': 6.034035596646417e-05, 'samples': 22382016, 'steps': 116572, 'loss/train': 1.2010583877563477} 11/07/2021 13:36:52 - INFO - __main__ - Step 116574: {'lr': 6.033689860309871e-05, 'samples': 22382208, 'steps': 116573, 'loss/train': 1.2418192625045776} 11/07/2021 13:36:53 - INFO - __main__ - Step 116575: {'lr': 6.0333441325192557e-05, 'samples': 22382400, 'steps': 116574, 'loss/train': 1.638004183769226} 11/07/2021 13:36:54 - INFO - __main__ - Step 116576: {'lr': 6.032998413274721e-05, 'samples': 22382592, 'steps': 116575, 'loss/train': 1.3522907495498657} 11/07/2021 13:36:54 - INFO - __main__ - Step 116577: {'lr': 6.032652702576424e-05, 'samples': 22382784, 'steps': 116576, 'loss/train': 0.7118661999702454} 11/07/2021 13:36:54 - INFO - __main__ - Step 116578: {'lr': 6.03230700042452e-05, 'samples': 22382976, 'steps': 116577, 'loss/train': 1.3256767988204956} 11/07/2021 13:36:55 - INFO - __main__ - Step 116579: {'lr': 6.031961306819167e-05, 'samples': 22383168, 'steps': 116578, 'loss/train': 0.3344864547252655} 11/07/2021 13:36:56 - INFO - __main__ - Step 116580: {'lr': 6.031615621760519e-05, 'samples': 22383360, 'steps': 116579, 'loss/train': 1.2524049282073975} 11/07/2021 13:36:56 - INFO - __main__ - Step 116581: {'lr': 6.031269945248735e-05, 'samples': 22383552, 'steps': 116580, 'loss/train': 1.1369739770889282} 11/07/2021 13:36:57 - INFO - __main__ - Step 116582: {'lr': 6.030924277283964e-05, 'samples': 22383744, 'steps': 116581, 'loss/train': 1.4021354913711548} 11/07/2021 13:36:57 - INFO - __main__ - Step 116583: {'lr': 6.0305786178663693e-05, 'samples': 22383936, 'steps': 116582, 'loss/train': 0.5148460865020752} 11/07/2021 13:36:57 - INFO - __main__ - Step 116584: {'lr': 6.030232966996102e-05, 'samples': 22384128, 'steps': 116583, 'loss/train': 1.5417089462280273} 11/07/2021 13:36:58 - INFO - __main__ - Step 116585: {'lr': 6.029887324673325e-05, 'samples': 22384320, 'steps': 116584, 'loss/train': 1.2885174751281738} 11/07/2021 13:36:59 - INFO - __main__ - Step 116586: {'lr': 6.029541690898183e-05, 'samples': 22384512, 'steps': 116585, 'loss/train': 1.343186616897583} 11/07/2021 13:36:59 - INFO - __main__ - Step 116587: {'lr': 6.029196065670833e-05, 'samples': 22384704, 'steps': 116586, 'loss/train': 1.3703750371932983} 11/07/2021 13:37:00 - INFO - __main__ - Step 116588: {'lr': 6.028850448991438e-05, 'samples': 22384896, 'steps': 116587, 'loss/train': 1.463145136833191} 11/07/2021 13:37:00 - INFO - __main__ - Step 116589: {'lr': 6.02850484086015e-05, 'samples': 22385088, 'steps': 116588, 'loss/train': 1.1425294876098633} 11/07/2021 13:37:01 - INFO - __main__ - Step 116590: {'lr': 6.028159241277123e-05, 'samples': 22385280, 'steps': 116589, 'loss/train': 1.3154205083847046} 11/07/2021 13:37:02 - INFO - __main__ - Step 116591: {'lr': 6.027813650242517e-05, 'samples': 22385472, 'steps': 116590, 'loss/train': 1.7379779815673828} 11/07/2021 13:37:02 - INFO - __main__ - Step 116592: {'lr': 6.0274680677564835e-05, 'samples': 22385664, 'steps': 116591, 'loss/train': 1.0818729400634766} 11/07/2021 13:37:02 - INFO - __main__ - Step 116593: {'lr': 6.027122493819182e-05, 'samples': 22385856, 'steps': 116592, 'loss/train': 1.3952034711837769} 11/07/2021 13:37:03 - INFO - __main__ - Step 116594: {'lr': 6.026776928430763e-05, 'samples': 22386048, 'steps': 116593, 'loss/train': 1.397139072418213} 11/07/2021 13:37:03 - INFO - __main__ - Step 116595: {'lr': 6.02643137159139e-05, 'samples': 22386240, 'steps': 116594, 'loss/train': 1.533809781074524} 11/07/2021 13:37:04 - INFO - __main__ - Step 116596: {'lr': 6.026085823301211e-05, 'samples': 22386432, 'steps': 116595, 'loss/train': 1.2682111263275146} 11/07/2021 13:37:04 - INFO - __main__ - Step 116597: {'lr': 6.0257402835603934e-05, 'samples': 22386624, 'steps': 116596, 'loss/train': 1.6272225379943848} 11/07/2021 13:37:05 - INFO - __main__ - Step 116598: {'lr': 6.025394752369076e-05, 'samples': 22386816, 'steps': 116597, 'loss/train': 1.506327748298645} 11/07/2021 13:37:05 - INFO - __main__ - Step 116599: {'lr': 6.025049229727425e-05, 'samples': 22387008, 'steps': 116598, 'loss/train': 1.2156614065170288} 11/07/2021 13:37:05 - INFO - __main__ - Step 116600: {'lr': 6.0247037156355935e-05, 'samples': 22387200, 'steps': 116599, 'loss/train': 0.9313337802886963} 11/07/2021 13:37:06 - INFO - __main__ - Step 116601: {'lr': 6.024358210093736e-05, 'samples': 22387392, 'steps': 116600, 'loss/train': 1.5723378658294678} 11/07/2021 13:37:07 - INFO - __main__ - Step 116602: {'lr': 6.024012713102012e-05, 'samples': 22387584, 'steps': 116601, 'loss/train': 1.142594814300537} 11/07/2021 13:37:07 - INFO - __main__ - Step 116603: {'lr': 6.023667224660573e-05, 'samples': 22387776, 'steps': 116602, 'loss/train': 1.31128990650177} 11/07/2021 13:37:08 - INFO - __main__ - Step 116604: {'lr': 6.023321744769578e-05, 'samples': 22387968, 'steps': 116603, 'loss/train': 0.5294831991195679} 11/07/2021 13:37:08 - INFO - __main__ - Step 116605: {'lr': 6.022976273429182e-05, 'samples': 22388160, 'steps': 116604, 'loss/train': 1.47433602809906} 11/07/2021 13:37:09 - INFO - __main__ - Step 116606: {'lr': 6.02263081063954e-05, 'samples': 22388352, 'steps': 116605, 'loss/train': 1.3560872077941895} 11/07/2021 13:37:10 - INFO - __main__ - Step 116607: {'lr': 6.0222853564008056e-05, 'samples': 22388544, 'steps': 116606, 'loss/train': 1.091264247894287} 11/07/2021 13:37:10 - INFO - __main__ - Step 116608: {'lr': 6.021939910713145e-05, 'samples': 22388736, 'steps': 116607, 'loss/train': 2.3888304233551025} 11/07/2021 13:37:10 - INFO - __main__ - Step 116609: {'lr': 6.0215944735767e-05, 'samples': 22388928, 'steps': 116608, 'loss/train': 1.290395975112915} 11/07/2021 13:37:11 - INFO - __main__ - Step 116610: {'lr': 6.021249044991628e-05, 'samples': 22389120, 'steps': 116609, 'loss/train': 1.344673752784729} 11/07/2021 13:37:13 - INFO - __main__ - Step 116611: {'lr': 6.0209036249580905e-05, 'samples': 22389312, 'steps': 116610, 'loss/train': 1.4711428880691528} 11/07/2021 13:37:13 - INFO - __main__ - Step 116612: {'lr': 6.020558213476243e-05, 'samples': 22389504, 'steps': 116611, 'loss/train': 1.4761073589324951} 11/07/2021 13:37:14 - INFO - __main__ - Step 116613: {'lr': 6.020212810546236e-05, 'samples': 22389696, 'steps': 116612, 'loss/train': 1.7400124073028564} 11/07/2021 13:37:14 - INFO - __main__ - Step 116614: {'lr': 6.0198674161682285e-05, 'samples': 22389888, 'steps': 116613, 'loss/train': 1.750777244567871} 11/07/2021 13:37:14 - INFO - __main__ - Step 116615: {'lr': 6.019522030342378e-05, 'samples': 22390080, 'steps': 116614, 'loss/train': 0.6729578971862793} 11/07/2021 13:37:15 - INFO - __main__ - Step 116616: {'lr': 6.019176653068836e-05, 'samples': 22390272, 'steps': 116615, 'loss/train': 1.56698739528656} 11/07/2021 13:37:15 - INFO - __main__ - Step 116617: {'lr': 6.018831284347762e-05, 'samples': 22390464, 'steps': 116616, 'loss/train': 0.9853206276893616} 11/07/2021 13:37:15 - INFO - __main__ - Step 116618: {'lr': 6.0184859241793064e-05, 'samples': 22390656, 'steps': 116617, 'loss/train': 1.668282151222229} 11/07/2021 13:37:16 - INFO - __main__ - Step 116619: {'lr': 6.018140572563638e-05, 'samples': 22390848, 'steps': 116618, 'loss/train': 1.3910447359085083} 11/07/2021 13:37:17 - INFO - __main__ - Step 116620: {'lr': 6.017795229500894e-05, 'samples': 22391040, 'steps': 116619, 'loss/train': 1.6862566471099854} 11/07/2021 13:37:17 - INFO - __main__ - Step 116621: {'lr': 6.0174498949912396e-05, 'samples': 22391232, 'steps': 116620, 'loss/train': 0.9245049953460693} 11/07/2021 13:37:17 - INFO - __main__ - Step 116622: {'lr': 6.0171045690348285e-05, 'samples': 22391424, 'steps': 116621, 'loss/train': 1.3005543947219849} 11/07/2021 13:37:18 - INFO - __main__ - Step 116623: {'lr': 6.016759251631818e-05, 'samples': 22391616, 'steps': 116622, 'loss/train': 1.2396634817123413} 11/07/2021 13:37:19 - INFO - __main__ - Step 116624: {'lr': 6.016413942782362e-05, 'samples': 22391808, 'steps': 116623, 'loss/train': 1.5328902006149292} 11/07/2021 13:37:19 - INFO - __main__ - Step 116625: {'lr': 6.0160686424866193e-05, 'samples': 22392000, 'steps': 116624, 'loss/train': 1.681913137435913} 11/07/2021 13:37:20 - INFO - __main__ - Step 116626: {'lr': 6.0157233507447394e-05, 'samples': 22392192, 'steps': 116625, 'loss/train': 1.3760477304458618} 11/07/2021 13:37:20 - INFO - __main__ - Step 116627: {'lr': 6.015378067556884e-05, 'samples': 22392384, 'steps': 116626, 'loss/train': 1.343566656112671} 11/07/2021 13:37:20 - INFO - __main__ - Step 116628: {'lr': 6.015032792923206e-05, 'samples': 22392576, 'steps': 116627, 'loss/train': 0.8601738214492798} 11/07/2021 13:37:21 - INFO - __main__ - Step 116629: {'lr': 6.0146875268438595e-05, 'samples': 22392768, 'steps': 116628, 'loss/train': 1.160861849784851} 11/07/2021 13:37:22 - INFO - __main__ - Step 116630: {'lr': 6.014342269319012e-05, 'samples': 22392960, 'steps': 116629, 'loss/train': 1.0489484071731567} 11/07/2021 13:37:22 - INFO - __main__ - Step 116631: {'lr': 6.013997020348799e-05, 'samples': 22393152, 'steps': 116630, 'loss/train': 1.1672903299331665} 11/07/2021 13:37:22 - INFO - __main__ - Step 116632: {'lr': 6.013651779933388e-05, 'samples': 22393344, 'steps': 116631, 'loss/train': 1.2466458082199097} 11/07/2021 13:37:23 - INFO - __main__ - Step 116633: {'lr': 6.0133065480729334e-05, 'samples': 22393536, 'steps': 116632, 'loss/train': 1.597133755683899} 11/07/2021 13:37:24 - INFO - __main__ - Step 116634: {'lr': 6.012961324767588e-05, 'samples': 22393728, 'steps': 116633, 'loss/train': 1.3890283107757568} 11/07/2021 13:37:24 - INFO - __main__ - Step 116635: {'lr': 6.012616110017508e-05, 'samples': 22393920, 'steps': 116634, 'loss/train': 1.0723567008972168} 11/07/2021 13:37:24 - INFO - __main__ - Step 116636: {'lr': 6.012270903822853e-05, 'samples': 22394112, 'steps': 116635, 'loss/train': 0.8833932280540466} 11/07/2021 13:37:25 - INFO - __main__ - Step 116637: {'lr': 6.011925706183774e-05, 'samples': 22394304, 'steps': 116636, 'loss/train': 1.5132534503936768} 11/07/2021 13:37:25 - INFO - __main__ - Step 116638: {'lr': 6.011580517100429e-05, 'samples': 22394496, 'steps': 116637, 'loss/train': 1.5639909505844116} 11/07/2021 13:37:25 - INFO - __main__ - Step 116639: {'lr': 6.0112353365729706e-05, 'samples': 22394688, 'steps': 116638, 'loss/train': 1.4862170219421387} 11/07/2021 13:37:26 - INFO - __main__ - Step 116640: {'lr': 6.010890164601568e-05, 'samples': 22394880, 'steps': 116639, 'loss/train': 1.1201269626617432} 11/07/2021 13:37:27 - INFO - __main__ - Step 116641: {'lr': 6.010545001186354e-05, 'samples': 22395072, 'steps': 116640, 'loss/train': 1.26686692237854} 11/07/2021 13:37:27 - INFO - __main__ - Step 116642: {'lr': 6.010199846327496e-05, 'samples': 22395264, 'steps': 116641, 'loss/train': 1.2601600885391235} 11/07/2021 13:37:27 - INFO - __main__ - Step 116643: {'lr': 6.0098547000251524e-05, 'samples': 22395456, 'steps': 116642, 'loss/train': 1.1195874214172363} 11/07/2021 13:37:28 - INFO - __main__ - Step 116644: {'lr': 6.009509562279472e-05, 'samples': 22395648, 'steps': 116643, 'loss/train': 1.6728448867797852} 11/07/2021 13:37:29 - INFO - __main__ - Step 116645: {'lr': 6.009164433090614e-05, 'samples': 22395840, 'steps': 116644, 'loss/train': 1.2412792444229126} 11/07/2021 13:37:29 - INFO - __main__ - Step 116646: {'lr': 6.008819312458735e-05, 'samples': 22396032, 'steps': 116645, 'loss/train': 1.3823356628417969} 11/07/2021 13:37:30 - INFO - __main__ - Step 116647: {'lr': 6.008474200383987e-05, 'samples': 22396224, 'steps': 116646, 'loss/train': 1.4664669036865234} 11/07/2021 13:37:30 - INFO - __main__ - Step 116648: {'lr': 6.0081290968665296e-05, 'samples': 22396416, 'steps': 116647, 'loss/train': 1.0472404956817627} 11/07/2021 13:37:30 - INFO - __main__ - Step 116649: {'lr': 6.007784001906513e-05, 'samples': 22396608, 'steps': 116648, 'loss/train': 1.3248580694198608} 11/07/2021 13:37:31 - INFO - __main__ - Step 116650: {'lr': 6.0074389155040984e-05, 'samples': 22396800, 'steps': 116649, 'loss/train': 1.3836246728897095} 11/07/2021 13:37:32 - INFO - __main__ - Step 116651: {'lr': 6.0070938376594384e-05, 'samples': 22396992, 'steps': 116650, 'loss/train': 1.1628996133804321} 11/07/2021 13:37:32 - INFO - __main__ - Step 116652: {'lr': 6.0067487683726965e-05, 'samples': 22397184, 'steps': 116651, 'loss/train': 1.1869505643844604} 11/07/2021 13:37:32 - INFO - __main__ - Step 116653: {'lr': 6.006403707644012e-05, 'samples': 22397376, 'steps': 116652, 'loss/train': 1.122796654701233} 11/07/2021 13:37:33 - INFO - __main__ - Step 116654: {'lr': 6.006058655473548e-05, 'samples': 22397568, 'steps': 116653, 'loss/train': 1.64042067527771} 11/07/2021 13:37:34 - INFO - __main__ - Step 116655: {'lr': 6.005713611861463e-05, 'samples': 22397760, 'steps': 116654, 'loss/train': 1.405087947845459} 11/07/2021 13:37:34 - INFO - __main__ - Step 116656: {'lr': 6.00536857680791e-05, 'samples': 22397952, 'steps': 116655, 'loss/train': 1.252675175666809} 11/07/2021 13:37:35 - INFO - __main__ - Step 116657: {'lr': 6.0050235503130434e-05, 'samples': 22398144, 'steps': 116656, 'loss/train': 1.252802848815918} 11/07/2021 13:37:35 - INFO - __main__ - Step 116658: {'lr': 6.004678532377023e-05, 'samples': 22398336, 'steps': 116657, 'loss/train': 0.9154936671257019} 11/07/2021 13:37:35 - INFO - __main__ - Step 116659: {'lr': 6.004333523e-05, 'samples': 22398528, 'steps': 116658, 'loss/train': 1.0528228282928467} 11/07/2021 13:37:36 - INFO - __main__ - Step 116660: {'lr': 6.00398852218213e-05, 'samples': 22398720, 'steps': 116659, 'loss/train': 0.9136022925376892} 11/07/2021 13:37:37 - INFO - __main__ - Step 116661: {'lr': 6.003643529923569e-05, 'samples': 22398912, 'steps': 116660, 'loss/train': 1.4138087034225464} 11/07/2021 13:37:37 - INFO - __main__ - Step 116662: {'lr': 6.0032985462244756e-05, 'samples': 22399104, 'steps': 116661, 'loss/train': 1.1534205675125122} 11/07/2021 13:37:37 - INFO - __main__ - Step 116663: {'lr': 6.002953571085001e-05, 'samples': 22399296, 'steps': 116662, 'loss/train': 0.917469322681427} 11/07/2021 13:37:38 - INFO - __main__ - Step 116664: {'lr': 6.0026086045053025e-05, 'samples': 22399488, 'steps': 116663, 'loss/train': 1.189683198928833} 11/07/2021 13:37:38 - INFO - __main__ - Step 116665: {'lr': 6.002263646485545e-05, 'samples': 22399680, 'steps': 116664, 'loss/train': 1.6760830879211426} 11/07/2021 13:37:39 - INFO - __main__ - Step 116666: {'lr': 6.0019186970258655e-05, 'samples': 22399872, 'steps': 116665, 'loss/train': 1.2422678470611572} 11/07/2021 13:37:39 - INFO - __main__ - Step 116667: {'lr': 6.0015737561264275e-05, 'samples': 22400064, 'steps': 116666, 'loss/train': 1.4368116855621338} 11/07/2021 13:37:40 - INFO - __main__ - Step 116668: {'lr': 6.001228823787386e-05, 'samples': 22400256, 'steps': 116667, 'loss/train': 1.2392876148223877} 11/07/2021 13:37:40 - INFO - __main__ - Step 116669: {'lr': 6.000883900008899e-05, 'samples': 22400448, 'steps': 116668, 'loss/train': 1.426276683807373} 11/07/2021 13:37:40 - INFO - __main__ - Step 116670: {'lr': 6.000538984791121e-05, 'samples': 22400640, 'steps': 116669, 'loss/train': 1.016495943069458} 11/07/2021 13:37:42 - INFO - __main__ - Step 116671: {'lr': 6.000194078134208e-05, 'samples': 22400832, 'steps': 116670, 'loss/train': 1.2652157545089722} 11/07/2021 13:37:42 - INFO - __main__ - Step 116672: {'lr': 5.9998491800383137e-05, 'samples': 22401024, 'steps': 116671, 'loss/train': 1.3643908500671387} 11/07/2021 13:37:42 - INFO - __main__ - Step 116673: {'lr': 5.999504290503593e-05, 'samples': 22401216, 'steps': 116672, 'loss/train': 1.1797491312026978} 11/07/2021 13:37:43 - INFO - __main__ - Step 116674: {'lr': 5.999159409530203e-05, 'samples': 22401408, 'steps': 116673, 'loss/train': 1.3199290037155151} 11/07/2021 13:37:43 - INFO - __main__ - Step 116675: {'lr': 5.9988145371182996e-05, 'samples': 22401600, 'steps': 116674, 'loss/train': 1.5893601179122925} 11/07/2021 13:37:44 - INFO - __main__ - Step 116676: {'lr': 5.9984696732680366e-05, 'samples': 22401792, 'steps': 116675, 'loss/train': 1.2698678970336914} 11/07/2021 13:37:44 - INFO - __main__ - Step 116677: {'lr': 5.998124817979569e-05, 'samples': 22401984, 'steps': 116676, 'loss/train': 1.8644427061080933} 11/07/2021 13:37:45 - INFO - __main__ - Step 116678: {'lr': 5.997779971253054e-05, 'samples': 22402176, 'steps': 116677, 'loss/train': 1.6116832494735718} 11/07/2021 13:37:45 - INFO - __main__ - Step 116679: {'lr': 5.997435133088652e-05, 'samples': 22402368, 'steps': 116678, 'loss/train': 1.191458821296692} 11/07/2021 13:37:46 - INFO - __main__ - Step 116680: {'lr': 5.9970903034865076e-05, 'samples': 22402560, 'steps': 116679, 'loss/train': 1.4094101190567017} 11/07/2021 13:37:47 - INFO - __main__ - Step 116681: {'lr': 5.996745482446778e-05, 'samples': 22402752, 'steps': 116680, 'loss/train': 1.0855708122253418} 11/07/2021 13:37:47 - INFO - __main__ - Step 116682: {'lr': 5.9964006699696235e-05, 'samples': 22402944, 'steps': 116681, 'loss/train': 1.684455394744873} 11/07/2021 13:37:47 - INFO - __main__ - Step 116683: {'lr': 5.996055866055197e-05, 'samples': 22403136, 'steps': 116682, 'loss/train': 1.2580845355987549} 11/07/2021 13:37:48 - INFO - __main__ - Step 116684: {'lr': 5.995711070703658e-05, 'samples': 22403328, 'steps': 116683, 'loss/train': 1.1295098066329956} 11/07/2021 13:37:48 - INFO - __main__ - Step 116685: {'lr': 5.995366283915155e-05, 'samples': 22403520, 'steps': 116684, 'loss/train': 1.494374394416809} 11/07/2021 13:37:49 - INFO - __main__ - Step 116686: {'lr': 5.995021505689846e-05, 'samples': 22403712, 'steps': 116685, 'loss/train': 1.431885838508606} 11/07/2021 13:37:49 - INFO - __main__ - Step 116687: {'lr': 5.9946767360278874e-05, 'samples': 22403904, 'steps': 116686, 'loss/train': 0.814640998840332} 11/07/2021 13:37:50 - INFO - __main__ - Step 116688: {'lr': 5.994331974929434e-05, 'samples': 22404096, 'steps': 116687, 'loss/train': 1.5098339319229126} 11/07/2021 13:37:50 - INFO - __main__ - Step 116689: {'lr': 5.993987222394645e-05, 'samples': 22404288, 'steps': 116688, 'loss/train': 1.3214417695999146} 11/07/2021 13:37:50 - INFO - __main__ - Step 116690: {'lr': 5.993642478423669e-05, 'samples': 22404480, 'steps': 116689, 'loss/train': 0.8475300669670105} 11/07/2021 13:37:51 - INFO - __main__ - Step 116691: {'lr': 5.993297743016665e-05, 'samples': 22404672, 'steps': 116690, 'loss/train': 1.5755337476730347} 11/07/2021 13:37:52 - INFO - __main__ - Step 116692: {'lr': 5.992953016173794e-05, 'samples': 22404864, 'steps': 116691, 'loss/train': 1.8724544048309326} 11/07/2021 13:37:52 - INFO - __main__ - Step 116693: {'lr': 5.992608297895199e-05, 'samples': 22405056, 'steps': 116692, 'loss/train': 1.145453929901123} 11/07/2021 13:37:53 - INFO - __main__ - Step 116694: {'lr': 5.99226358818104e-05, 'samples': 22405248, 'steps': 116693, 'loss/train': 1.3387540578842163} 11/07/2021 13:37:53 - INFO - __main__ - Step 116695: {'lr': 5.9919188870314753e-05, 'samples': 22405440, 'steps': 116694, 'loss/train': 1.7498581409454346} 11/07/2021 13:37:53 - INFO - __main__ - Step 116696: {'lr': 5.991574194446658e-05, 'samples': 22405632, 'steps': 116695, 'loss/train': 1.053209900856018} 11/07/2021 13:37:54 - INFO - __main__ - Step 116697: {'lr': 5.991229510426744e-05, 'samples': 22405824, 'steps': 116696, 'loss/train': 1.156272292137146} 11/07/2021 13:37:55 - INFO - __main__ - Step 116698: {'lr': 5.990884834971888e-05, 'samples': 22406016, 'steps': 116697, 'loss/train': 1.1956918239593506} 11/07/2021 13:37:55 - INFO - __main__ - Step 116699: {'lr': 5.990540168082248e-05, 'samples': 22406208, 'steps': 116698, 'loss/train': 1.3823693990707397} 11/07/2021 13:37:55 - INFO - __main__ - Step 116700: {'lr': 5.990195509757976e-05, 'samples': 22406400, 'steps': 116699, 'loss/train': 1.2660558223724365} 11/07/2021 13:37:56 - INFO - __main__ - Step 116701: {'lr': 5.989850859999227e-05, 'samples': 22406592, 'steps': 116700, 'loss/train': 1.2822380065917969} 11/07/2021 13:37:57 - INFO - __main__ - Step 116702: {'lr': 5.9895062188061594e-05, 'samples': 22406784, 'steps': 116701, 'loss/train': 0.9021138548851013} 11/07/2021 13:37:57 - INFO - __main__ - Step 116703: {'lr': 5.9891615861789286e-05, 'samples': 22406976, 'steps': 116702, 'loss/train': 0.9281202554702759} 11/07/2021 13:37:57 - INFO - __main__ - Step 116704: {'lr': 5.988816962117685e-05, 'samples': 22407168, 'steps': 116703, 'loss/train': 1.445176124572754} 11/07/2021 13:37:58 - INFO - __main__ - Step 116705: {'lr': 5.9884723466225897e-05, 'samples': 22407360, 'steps': 116704, 'loss/train': 1.9373561143875122} 11/07/2021 13:37:58 - INFO - __main__ - Step 116706: {'lr': 5.988127739693802e-05, 'samples': 22407552, 'steps': 116705, 'loss/train': 1.2643765211105347} 11/07/2021 13:37:59 - INFO - __main__ - Step 116707: {'lr': 5.987783141331463e-05, 'samples': 22407744, 'steps': 116706, 'loss/train': 1.2757359743118286} 11/07/2021 13:37:59 - INFO - __main__ - Step 116708: {'lr': 5.9874385515357345e-05, 'samples': 22407936, 'steps': 116707, 'loss/train': 1.3494162559509277} 11/07/2021 13:38:00 - INFO - __main__ - Step 116709: {'lr': 5.9870939703067754e-05, 'samples': 22408128, 'steps': 116708, 'loss/train': 1.5739787817001343} 11/07/2021 13:38:00 - INFO - __main__ - Step 116710: {'lr': 5.986749397644736e-05, 'samples': 22408320, 'steps': 116709, 'loss/train': 1.3440428972244263} 11/07/2021 13:38:01 - INFO - __main__ - Step 116711: {'lr': 5.986404833549774e-05, 'samples': 22408512, 'steps': 116710, 'loss/train': 1.3858757019042969} 11/07/2021 13:38:02 - INFO - __main__ - Step 116712: {'lr': 5.986060278022046e-05, 'samples': 22408704, 'steps': 116711, 'loss/train': 1.1881946325302124} 11/07/2021 13:38:02 - INFO - __main__ - Step 116713: {'lr': 5.9857157310617054e-05, 'samples': 22408896, 'steps': 116712, 'loss/train': 1.2442477941513062} 11/07/2021 13:38:03 - INFO - __main__ - Step 116714: {'lr': 5.985371192668907e-05, 'samples': 22409088, 'steps': 116713, 'loss/train': 1.4273639917373657} 11/07/2021 13:38:03 - INFO - __main__ - Step 116715: {'lr': 5.9850266628438096e-05, 'samples': 22409280, 'steps': 116714, 'loss/train': 1.115591049194336} 11/07/2021 13:38:03 - INFO - __main__ - Step 116716: {'lr': 5.984682141586561e-05, 'samples': 22409472, 'steps': 116715, 'loss/train': 1.378233551979065} 11/07/2021 13:38:04 - INFO - __main__ - Step 116717: {'lr': 5.9843376288973236e-05, 'samples': 22409664, 'steps': 116716, 'loss/train': 1.1766180992126465} 11/07/2021 13:38:05 - INFO - __main__ - Step 116718: {'lr': 5.983993124776252e-05, 'samples': 22409856, 'steps': 116717, 'loss/train': 0.9057744741439819} 11/07/2021 13:38:05 - INFO - __main__ - Step 116719: {'lr': 5.9836486292235065e-05, 'samples': 22410048, 'steps': 116718, 'loss/train': 0.6997603178024292} 11/07/2021 13:38:05 - INFO - __main__ - Step 116720: {'lr': 5.9833041422392264e-05, 'samples': 22410240, 'steps': 116719, 'loss/train': 1.2931599617004395} 11/07/2021 13:38:06 - INFO - __main__ - Step 116721: {'lr': 5.982959663823576e-05, 'samples': 22410432, 'steps': 116720, 'loss/train': 1.0887788534164429} 11/07/2021 13:38:06 - INFO - __main__ - Step 116722: {'lr': 5.982615193976712e-05, 'samples': 22410624, 'steps': 116721, 'loss/train': 0.990706741809845} 11/07/2021 13:38:07 - INFO - __main__ - Step 116723: {'lr': 5.9822707326987886e-05, 'samples': 22410816, 'steps': 116722, 'loss/train': 1.7304997444152832} 11/07/2021 13:38:08 - INFO - __main__ - Step 116724: {'lr': 5.9819262799899576e-05, 'samples': 22411008, 'steps': 116723, 'loss/train': 1.278717279434204} 11/07/2021 13:38:08 - INFO - __main__ - Step 116725: {'lr': 5.98158183585038e-05, 'samples': 22411200, 'steps': 116724, 'loss/train': 1.468743920326233} 11/07/2021 13:38:08 - INFO - __main__ - Step 116726: {'lr': 5.9812374002802065e-05, 'samples': 22411392, 'steps': 116725, 'loss/train': 0.9963787198066711} 11/07/2021 13:38:09 - INFO - __main__ - Step 116727: {'lr': 5.980892973279595e-05, 'samples': 22411584, 'steps': 116726, 'loss/train': 1.2682814598083496} 11/07/2021 13:38:09 - INFO - __main__ - Step 116728: {'lr': 5.9805485548487e-05, 'samples': 22411776, 'steps': 116727, 'loss/train': 0.8130848407745361} 11/07/2021 13:38:10 - INFO - __main__ - Step 116729: {'lr': 5.980204144987675e-05, 'samples': 22411968, 'steps': 116728, 'loss/train': 1.259871244430542} 11/07/2021 13:38:11 - INFO - __main__ - Step 116730: {'lr': 5.979859743696678e-05, 'samples': 22412160, 'steps': 116729, 'loss/train': 1.4541774988174438} 11/07/2021 13:38:11 - INFO - __main__ - Step 116731: {'lr': 5.9795153509758615e-05, 'samples': 22412352, 'steps': 116730, 'loss/train': 1.6669732332229614} 11/07/2021 13:38:11 - INFO - __main__ - Step 116732: {'lr': 5.9791709668253894e-05, 'samples': 22412544, 'steps': 116731, 'loss/train': 1.0474529266357422} 11/07/2021 13:38:12 - INFO - __main__ - Step 116733: {'lr': 5.978826591245401e-05, 'samples': 22412736, 'steps': 116732, 'loss/train': 1.0963674783706665} 11/07/2021 13:38:13 - INFO - __main__ - Step 116734: {'lr': 5.978482224236062e-05, 'samples': 22412928, 'steps': 116733, 'loss/train': 1.652567982673645} 11/07/2021 13:38:13 - INFO - __main__ - Step 116735: {'lr': 5.978137865797523e-05, 'samples': 22413120, 'steps': 116734, 'loss/train': 0.3407343924045563} 11/07/2021 13:38:13 - INFO - __main__ - Step 116736: {'lr': 5.977793515929944e-05, 'samples': 22413312, 'steps': 116735, 'loss/train': 2.141791343688965} 11/07/2021 13:38:14 - INFO - __main__ - Step 116737: {'lr': 5.977449174633476e-05, 'samples': 22413504, 'steps': 116736, 'loss/train': 1.0440773963928223} 11/07/2021 13:38:14 - INFO - __main__ - Step 116738: {'lr': 5.977104841908276e-05, 'samples': 22413696, 'steps': 116737, 'loss/train': 1.436188817024231} 11/07/2021 13:38:15 - INFO - __main__ - Step 116739: {'lr': 5.976760517754501e-05, 'samples': 22413888, 'steps': 116738, 'loss/train': 1.3841485977172852} 11/07/2021 13:38:16 - INFO - __main__ - Step 116740: {'lr': 5.976416202172302e-05, 'samples': 22414080, 'steps': 116739, 'loss/train': 0.9253873229026794} 11/07/2021 13:38:16 - INFO - __main__ - Step 116741: {'lr': 5.9760718951618356e-05, 'samples': 22414272, 'steps': 116740, 'loss/train': 1.6720669269561768} 11/07/2021 13:38:16 - INFO - __main__ - Step 116742: {'lr': 5.97572759672326e-05, 'samples': 22414464, 'steps': 116741, 'loss/train': 1.2981923818588257} 11/07/2021 13:38:17 - INFO - __main__ - Step 116743: {'lr': 5.975383306856727e-05, 'samples': 22414656, 'steps': 116742, 'loss/train': 1.5277860164642334} 11/07/2021 13:38:18 - INFO - __main__ - Step 116744: {'lr': 5.975039025562393e-05, 'samples': 22414848, 'steps': 116743, 'loss/train': 1.0506770610809326} 11/07/2021 13:38:18 - INFO - __main__ - Step 116745: {'lr': 5.974694752840412e-05, 'samples': 22415040, 'steps': 116744, 'loss/train': 2.352510690689087} 11/07/2021 13:38:18 - INFO - __main__ - Step 116746: {'lr': 5.974350488690947e-05, 'samples': 22415232, 'steps': 116745, 'loss/train': 1.3092511892318726} 11/07/2021 13:38:19 - INFO - __main__ - Step 116747: {'lr': 5.97400623311414e-05, 'samples': 22415424, 'steps': 116746, 'loss/train': 1.3490482568740845} 11/07/2021 13:38:19 - INFO - __main__ - Step 116748: {'lr': 5.973661986110151e-05, 'samples': 22415616, 'steps': 116747, 'loss/train': 1.052933931350708} 11/07/2021 13:38:20 - INFO - __main__ - Step 116749: {'lr': 5.9733177476791386e-05, 'samples': 22415808, 'steps': 116748, 'loss/train': 1.99933660030365} 11/07/2021 13:38:20 - INFO - __main__ - Step 116750: {'lr': 5.9729735178212535e-05, 'samples': 22416000, 'steps': 116749, 'loss/train': 1.39078950881958} 11/07/2021 13:38:21 - INFO - __main__ - Step 116751: {'lr': 5.972629296536655e-05, 'samples': 22416192, 'steps': 116750, 'loss/train': 1.0497586727142334} 11/07/2021 13:38:21 - INFO - __main__ - Step 116752: {'lr': 5.9722850838254935e-05, 'samples': 22416384, 'steps': 116751, 'loss/train': 1.4858787059783936} 11/07/2021 13:38:21 - INFO - __main__ - Step 116753: {'lr': 5.971940879687929e-05, 'samples': 22416576, 'steps': 116752, 'loss/train': 1.4408199787139893} 11/07/2021 13:38:22 - INFO - __main__ - Step 116754: {'lr': 5.9715966841241115e-05, 'samples': 22416768, 'steps': 116753, 'loss/train': 1.4610265493392944} 11/07/2021 13:38:23 - INFO - __main__ - Step 116755: {'lr': 5.9712524971342026e-05, 'samples': 22416960, 'steps': 116754, 'loss/train': 1.2351781129837036} 11/07/2021 13:38:23 - INFO - __main__ - Step 116756: {'lr': 5.9709083187183515e-05, 'samples': 22417152, 'steps': 116755, 'loss/train': 1.4229363203048706} 11/07/2021 13:38:23 - INFO - __main__ - Step 116757: {'lr': 5.970564148876714e-05, 'samples': 22417344, 'steps': 116756, 'loss/train': 1.1473965644836426} 11/07/2021 13:38:24 - INFO - __main__ - Step 116758: {'lr': 5.970219987609449e-05, 'samples': 22417536, 'steps': 116757, 'loss/train': 1.4551717042922974} 11/07/2021 13:38:24 - INFO - __main__ - Step 116759: {'lr': 5.969875834916716e-05, 'samples': 22417728, 'steps': 116758, 'loss/train': 1.8710750341415405} 11/07/2021 13:38:25 - INFO - __main__ - Step 116760: {'lr': 5.9695316907986544e-05, 'samples': 22417920, 'steps': 116759, 'loss/train': 1.3565317392349243} 11/07/2021 13:38:26 - INFO - __main__ - Step 116761: {'lr': 5.9691875552554315e-05, 'samples': 22418112, 'steps': 116760, 'loss/train': 1.413757562637329} 11/07/2021 13:38:26 - INFO - __main__ - Step 116762: {'lr': 5.968843428287196e-05, 'samples': 22418304, 'steps': 116761, 'loss/train': 1.8395074605941772} 11/07/2021 13:38:26 - INFO - __main__ - Step 116763: {'lr': 5.968499309894107e-05, 'samples': 22418496, 'steps': 116762, 'loss/train': 1.4563276767730713} 11/07/2021 13:38:27 - INFO - __main__ - Step 116764: {'lr': 5.9681552000763194e-05, 'samples': 22418688, 'steps': 116763, 'loss/train': 1.2822551727294922} 11/07/2021 13:38:28 - INFO - __main__ - Step 116765: {'lr': 5.9678110988339864e-05, 'samples': 22418880, 'steps': 116764, 'loss/train': 1.4821374416351318} 11/07/2021 13:38:28 - INFO - __main__ - Step 116766: {'lr': 5.9674670061672656e-05, 'samples': 22419072, 'steps': 116765, 'loss/train': 1.621544361114502} 11/07/2021 13:38:28 - INFO - __main__ - Step 116767: {'lr': 5.967122922076307e-05, 'samples': 22419264, 'steps': 116766, 'loss/train': 1.1141895055770874} 11/07/2021 13:38:29 - INFO - __main__ - Step 116768: {'lr': 5.9667788465612716e-05, 'samples': 22419456, 'steps': 116767, 'loss/train': 1.5722562074661255} 11/07/2021 13:38:29 - INFO - __main__ - Step 116769: {'lr': 5.9664347796223126e-05, 'samples': 22419648, 'steps': 116768, 'loss/train': 1.3990607261657715} 11/07/2021 13:38:30 - INFO - __main__ - Step 116770: {'lr': 5.9660907212595846e-05, 'samples': 22419840, 'steps': 116769, 'loss/train': 1.4724912643432617} 11/07/2021 13:38:31 - INFO - __main__ - Step 116771: {'lr': 5.965746671473241e-05, 'samples': 22420032, 'steps': 116770, 'loss/train': 1.0149190425872803} 11/07/2021 13:38:31 - INFO - __main__ - Step 116772: {'lr': 5.965402630263436e-05, 'samples': 22420224, 'steps': 116771, 'loss/train': 0.6564897894859314} 11/07/2021 13:38:31 - INFO - __main__ - Step 116773: {'lr': 5.965058597630338e-05, 'samples': 22420416, 'steps': 116772, 'loss/train': 1.2738174200057983} 11/07/2021 13:38:32 - INFO - __main__ - Step 116774: {'lr': 5.964714573574082e-05, 'samples': 22420608, 'steps': 116773, 'loss/train': 1.7703003883361816} 11/07/2021 13:38:33 - INFO - __main__ - Step 116775: {'lr': 5.964370558094831e-05, 'samples': 22420800, 'steps': 116774, 'loss/train': 1.5777440071105957} 11/07/2021 13:38:33 - INFO - __main__ - Step 116776: {'lr': 5.9640265511927445e-05, 'samples': 22420992, 'steps': 116775, 'loss/train': 1.4156686067581177} 11/07/2021 13:38:34 - INFO - __main__ - Step 116777: {'lr': 5.9636825528679686e-05, 'samples': 22421184, 'steps': 116776, 'loss/train': 1.7907088994979858} 11/07/2021 13:38:34 - INFO - __main__ - Step 116778: {'lr': 5.9633385631206685e-05, 'samples': 22421376, 'steps': 116777, 'loss/train': 1.194462776184082} 11/07/2021 13:38:34 - INFO - __main__ - Step 116779: {'lr': 5.9629945819509926e-05, 'samples': 22421568, 'steps': 116778, 'loss/train': 1.1448042392730713} 11/07/2021 13:38:35 - INFO - __main__ - Step 116780: {'lr': 5.9626506093590966e-05, 'samples': 22421760, 'steps': 116779, 'loss/train': 2.86967134475708} 11/07/2021 13:38:36 - INFO - __main__ - Step 116781: {'lr': 5.9623066453451365e-05, 'samples': 22421952, 'steps': 116780, 'loss/train': 0.9932544231414795} 11/07/2021 13:38:36 - INFO - __main__ - Step 116782: {'lr': 5.961962689909267e-05, 'samples': 22422144, 'steps': 116781, 'loss/train': 1.575187087059021} 11/07/2021 13:38:36 - INFO - __main__ - Step 116783: {'lr': 5.961618743051645e-05, 'samples': 22422336, 'steps': 116782, 'loss/train': 1.6247214078903198} 11/07/2021 13:38:37 - INFO - __main__ - Step 116784: {'lr': 5.961274804772423e-05, 'samples': 22422528, 'steps': 116783, 'loss/train': 1.1673212051391602} 11/07/2021 13:38:37 - INFO - __main__ - Step 116785: {'lr': 5.960930875071757e-05, 'samples': 22422720, 'steps': 116784, 'loss/train': 1.512290120124817} 11/07/2021 13:38:38 - INFO - __main__ - Step 116786: {'lr': 5.960586953949809e-05, 'samples': 22422912, 'steps': 116785, 'loss/train': 1.0729560852050781} 11/07/2021 13:38:39 - INFO - __main__ - Step 116787: {'lr': 5.960243041406718e-05, 'samples': 22423104, 'steps': 116786, 'loss/train': 1.4019328355789185} 11/07/2021 13:38:39 - INFO - __main__ - Step 116788: {'lr': 5.959899137442648e-05, 'samples': 22423296, 'steps': 116787, 'loss/train': 0.9320028424263} 11/07/2021 13:38:39 - INFO - __main__ - Step 116789: {'lr': 5.9595552420577545e-05, 'samples': 22423488, 'steps': 116788, 'loss/train': 1.4408038854599} 11/07/2021 13:38:40 - INFO - __main__ - Step 116790: {'lr': 5.959211355252192e-05, 'samples': 22423680, 'steps': 116789, 'loss/train': 1.4254889488220215} 11/07/2021 13:38:41 - INFO - __main__ - Step 116791: {'lr': 5.9588674770261143e-05, 'samples': 22423872, 'steps': 116790, 'loss/train': 1.4709008932113647} 11/07/2021 13:38:41 - INFO - __main__ - Step 116792: {'lr': 5.958523607379679e-05, 'samples': 22424064, 'steps': 116791, 'loss/train': 1.096118688583374} 11/07/2021 13:38:41 - INFO - __main__ - Step 116793: {'lr': 5.958179746313036e-05, 'samples': 22424256, 'steps': 116792, 'loss/train': 0.8072879314422607} 11/07/2021 13:38:42 - INFO - __main__ - Step 116794: {'lr': 5.957835893826347e-05, 'samples': 22424448, 'steps': 116793, 'loss/train': 1.2635115385055542} 11/07/2021 13:38:42 - INFO - __main__ - Step 116795: {'lr': 5.9574920499197606e-05, 'samples': 22424640, 'steps': 116794, 'loss/train': 1.4564729928970337} 11/07/2021 13:38:43 - INFO - __main__ - Step 116796: {'lr': 5.957148214593436e-05, 'samples': 22424832, 'steps': 116795, 'loss/train': 1.3120005130767822} 11/07/2021 13:38:44 - INFO - __main__ - Step 116797: {'lr': 5.956804387847525e-05, 'samples': 22425024, 'steps': 116796, 'loss/train': 1.7325270175933838} 11/07/2021 13:38:44 - INFO - __main__ - Step 116798: {'lr': 5.956460569682184e-05, 'samples': 22425216, 'steps': 116797, 'loss/train': 1.7695075273513794} 11/07/2021 13:38:45 - INFO - __main__ - Step 116799: {'lr': 5.956116760097569e-05, 'samples': 22425408, 'steps': 116798, 'loss/train': 1.646373987197876} 11/07/2021 13:38:45 - INFO - __main__ - Step 116800: {'lr': 5.955772959093842e-05, 'samples': 22425600, 'steps': 116799, 'loss/train': 1.5329030752182007} 11/07/2021 13:38:45 - INFO - __main__ - Step 116801: {'lr': 5.955429166671139e-05, 'samples': 22425792, 'steps': 116800, 'loss/train': 0.6172065138816833} 11/07/2021 13:38:46 - INFO - __main__ - Step 116802: {'lr': 5.955085382829631e-05, 'samples': 22425984, 'steps': 116801, 'loss/train': 1.4416660070419312} 11/07/2021 13:38:47 - INFO - __main__ - Step 116803: {'lr': 5.954741607569464e-05, 'samples': 22426176, 'steps': 116802, 'loss/train': 1.5324610471725464} 11/07/2021 13:38:47 - INFO - __main__ - Step 116804: {'lr': 5.9543978408907965e-05, 'samples': 22426368, 'steps': 116803, 'loss/train': 1.2646276950836182} 11/07/2021 13:38:47 - INFO - __main__ - Step 116805: {'lr': 5.9540540827937836e-05, 'samples': 22426560, 'steps': 116804, 'loss/train': 1.5121564865112305} 11/07/2021 13:38:48 - INFO - __main__ - Step 116806: {'lr': 5.9537103332785805e-05, 'samples': 22426752, 'steps': 116805, 'loss/train': 1.4942171573638916} 11/07/2021 13:38:49 - INFO - __main__ - Step 116807: {'lr': 5.953366592345344e-05, 'samples': 22426944, 'steps': 116806, 'loss/train': 1.3105621337890625} 11/07/2021 13:38:49 - INFO - __main__ - Step 116808: {'lr': 5.9530228599942227e-05, 'samples': 22427136, 'steps': 116807, 'loss/train': 0.7734687924385071} 11/07/2021 13:38:49 - INFO - __main__ - Step 116809: {'lr': 5.952679136225378e-05, 'samples': 22427328, 'steps': 116808, 'loss/train': 1.0284769535064697} 11/07/2021 13:38:50 - INFO - __main__ - Step 116810: {'lr': 5.952335421038962e-05, 'samples': 22427520, 'steps': 116809, 'loss/train': 1.6227571964263916} 11/07/2021 13:38:50 - INFO - __main__ - Step 116811: {'lr': 5.9519917144351286e-05, 'samples': 22427712, 'steps': 116810, 'loss/train': 1.490281343460083} 11/07/2021 13:38:51 - INFO - __main__ - Step 116812: {'lr': 5.951648016414035e-05, 'samples': 22427904, 'steps': 116811, 'loss/train': 1.1357473134994507} 11/07/2021 13:38:52 - INFO - __main__ - Step 116813: {'lr': 5.95130432697584e-05, 'samples': 22428096, 'steps': 116812, 'loss/train': 1.5304744243621826} 11/07/2021 13:38:52 - INFO - __main__ - Step 116814: {'lr': 5.95096064612069e-05, 'samples': 22428288, 'steps': 116813, 'loss/train': 1.0417311191558838} 11/07/2021 13:38:52 - INFO - __main__ - Step 116815: {'lr': 5.950616973848738e-05, 'samples': 22428480, 'steps': 116814, 'loss/train': 1.0505366325378418} 11/07/2021 13:38:53 - INFO - __main__ - Step 116816: {'lr': 5.950273310160148e-05, 'samples': 22428672, 'steps': 116815, 'loss/train': 1.3269561529159546} 11/07/2021 13:38:53 - INFO - __main__ - Step 116817: {'lr': 5.94992965505507e-05, 'samples': 22428864, 'steps': 116816, 'loss/train': 1.1395021677017212} 11/07/2021 13:38:54 - INFO - __main__ - Step 116818: {'lr': 5.949586008533658e-05, 'samples': 22429056, 'steps': 116817, 'loss/train': 0.8451865315437317} 11/07/2021 13:38:54 - INFO - __main__ - Step 116819: {'lr': 5.9492423705960724e-05, 'samples': 22429248, 'steps': 116818, 'loss/train': 1.3880770206451416} 11/07/2021 13:38:55 - INFO - __main__ - Step 116820: {'lr': 5.9488987412424615e-05, 'samples': 22429440, 'steps': 116819, 'loss/train': 1.0350655317306519} 11/07/2021 13:38:55 - INFO - __main__ - Step 116821: {'lr': 5.948555120472981e-05, 'samples': 22429632, 'steps': 116820, 'loss/train': 1.4389923810958862} 11/07/2021 13:38:55 - INFO - __main__ - Step 116822: {'lr': 5.9482115082877903e-05, 'samples': 22429824, 'steps': 116821, 'loss/train': 1.8188037872314453} 11/07/2021 13:38:56 - INFO - __main__ - Step 116823: {'lr': 5.9478679046870405e-05, 'samples': 22430016, 'steps': 116822, 'loss/train': 1.190621256828308} 11/07/2021 13:38:57 - INFO - __main__ - Step 116824: {'lr': 5.9475243096708876e-05, 'samples': 22430208, 'steps': 116823, 'loss/train': 1.0090652704238892} 11/07/2021 13:38:57 - INFO - __main__ - Step 116825: {'lr': 5.947180723239487e-05, 'samples': 22430400, 'steps': 116824, 'loss/train': 1.165360450744629} 11/07/2021 13:38:57 - INFO - __main__ - Step 116826: {'lr': 5.9468371453929946e-05, 'samples': 22430592, 'steps': 116825, 'loss/train': 0.9129101037979126} 11/07/2021 13:38:58 - INFO - __main__ - Step 116827: {'lr': 5.9464935761315676e-05, 'samples': 22430784, 'steps': 116826, 'loss/train': 0.7013183236122131} 11/07/2021 13:38:59 - INFO - __main__ - Step 116828: {'lr': 5.946150015455348e-05, 'samples': 22430976, 'steps': 116827, 'loss/train': 0.6376263499259949} 11/07/2021 13:38:59 - INFO - __main__ - Step 116829: {'lr': 5.945806463364503e-05, 'samples': 22431168, 'steps': 116828, 'loss/train': 1.1669162511825562} 11/07/2021 13:39:00 - INFO - __main__ - Step 116830: {'lr': 5.945462919859182e-05, 'samples': 22431360, 'steps': 116829, 'loss/train': 1.2509427070617676} 11/07/2021 13:39:00 - INFO - __main__ - Step 116831: {'lr': 5.94511938493954e-05, 'samples': 22431552, 'steps': 116830, 'loss/train': 1.259737491607666} 11/07/2021 13:39:00 - INFO - __main__ - Step 116832: {'lr': 5.944775858605736e-05, 'samples': 22431744, 'steps': 116831, 'loss/train': 1.4016131162643433} 11/07/2021 13:39:01 - INFO - __main__ - Step 116833: {'lr': 5.9444323408579196e-05, 'samples': 22431936, 'steps': 116832, 'loss/train': 0.5159794092178345} 11/07/2021 13:39:02 - INFO - __main__ - Step 116834: {'lr': 5.944088831696248e-05, 'samples': 22432128, 'steps': 116833, 'loss/train': 1.5677441358566284} 11/07/2021 13:39:02 - INFO - __main__ - Step 116835: {'lr': 5.9437453311208754e-05, 'samples': 22432320, 'steps': 116834, 'loss/train': 1.04087495803833} 11/07/2021 13:39:02 - INFO - __main__ - Step 116836: {'lr': 5.9434018391319596e-05, 'samples': 22432512, 'steps': 116835, 'loss/train': 1.2499589920043945} 11/07/2021 13:39:03 - INFO - __main__ - Step 116837: {'lr': 5.9430583557296525e-05, 'samples': 22432704, 'steps': 116836, 'loss/train': 0.9035962224006653} 11/07/2021 13:39:04 - INFO - __main__ - Step 116838: {'lr': 5.9427148809141074e-05, 'samples': 22432896, 'steps': 116837, 'loss/train': 2.050753355026245} 11/07/2021 13:39:04 - INFO - __main__ - Step 116839: {'lr': 5.942371414685482e-05, 'samples': 22433088, 'steps': 116838, 'loss/train': 1.6259560585021973} 11/07/2021 13:39:05 - INFO - __main__ - Step 116840: {'lr': 5.942027957043935e-05, 'samples': 22433280, 'steps': 116839, 'loss/train': 1.2048635482788086} 11/07/2021 13:39:05 - INFO - __main__ - Step 116841: {'lr': 5.941684507989611e-05, 'samples': 22433472, 'steps': 116840, 'loss/train': 1.028533697128296} 11/07/2021 13:39:05 - INFO - __main__ - Step 116842: {'lr': 5.941341067522671e-05, 'samples': 22433664, 'steps': 116841, 'loss/train': 0.9564761519432068} 11/07/2021 13:39:06 - INFO - __main__ - Step 116843: {'lr': 5.940997635643267e-05, 'samples': 22433856, 'steps': 116842, 'loss/train': 1.2999277114868164} 11/07/2021 13:39:07 - INFO - __main__ - Step 116844: {'lr': 5.940654212351557e-05, 'samples': 22434048, 'steps': 116843, 'loss/train': 1.3635642528533936} 11/07/2021 13:39:07 - INFO - __main__ - Step 116845: {'lr': 5.940310797647691e-05, 'samples': 22434240, 'steps': 116844, 'loss/train': 1.604210615158081} 11/07/2021 13:39:08 - INFO - __main__ - Step 116846: {'lr': 5.939967391531831e-05, 'samples': 22434432, 'steps': 116845, 'loss/train': 1.2626765966415405} 11/07/2021 13:39:08 - INFO - __main__ - Step 116847: {'lr': 5.939623994004123e-05, 'samples': 22434624, 'steps': 116846, 'loss/train': 1.4560718536376953} 11/07/2021 13:39:08 - INFO - __main__ - Step 116848: {'lr': 5.9392806050647286e-05, 'samples': 22434816, 'steps': 116847, 'loss/train': 0.9800010919570923} 11/07/2021 13:39:10 - INFO - __main__ - Step 116849: {'lr': 5.9389372247138004e-05, 'samples': 22435008, 'steps': 116848, 'loss/train': 0.34154075384140015} 11/07/2021 13:39:10 - INFO - __main__ - Step 116850: {'lr': 5.938593852951493e-05, 'samples': 22435200, 'steps': 116849, 'loss/train': 1.4393444061279297} 11/07/2021 13:39:10 - INFO - __main__ - Step 116851: {'lr': 5.9382504897779606e-05, 'samples': 22435392, 'steps': 116850, 'loss/train': 1.121929407119751} 11/07/2021 13:39:11 - INFO - __main__ - Step 116852: {'lr': 5.93790713519336e-05, 'samples': 22435584, 'steps': 116851, 'loss/train': 1.340136170387268} 11/07/2021 13:39:11 - INFO - __main__ - Step 116853: {'lr': 5.93756378919785e-05, 'samples': 22435776, 'steps': 116852, 'loss/train': 1.1308289766311646} 11/07/2021 13:39:12 - INFO - __main__ - Step 116854: {'lr': 5.9372204517915725e-05, 'samples': 22435968, 'steps': 116853, 'loss/train': 1.1020387411117554} 11/07/2021 13:39:12 - INFO - __main__ - Step 116855: {'lr': 5.936877122974688e-05, 'samples': 22436160, 'steps': 116854, 'loss/train': 1.1626801490783691} 11/07/2021 13:39:13 - INFO - __main__ - Step 116856: {'lr': 5.9365338027473544e-05, 'samples': 22436352, 'steps': 116855, 'loss/train': 1.2233648300170898} 11/07/2021 13:39:13 - INFO - __main__ - Step 116857: {'lr': 5.936190491109725e-05, 'samples': 22436544, 'steps': 116856, 'loss/train': 1.2484411001205444} 11/07/2021 13:39:13 - INFO - __main__ - Step 116858: {'lr': 5.9358471880619516e-05, 'samples': 22436736, 'steps': 116857, 'loss/train': 1.1138628721237183} 11/07/2021 13:39:15 - INFO - __main__ - Step 116859: {'lr': 5.935503893604194e-05, 'samples': 22436928, 'steps': 116858, 'loss/train': 1.5049413442611694} 11/07/2021 13:39:15 - INFO - __main__ - Step 116860: {'lr': 5.9351606077366e-05, 'samples': 22437120, 'steps': 116859, 'loss/train': 1.103046178817749} 11/07/2021 13:39:15 - INFO - __main__ - Step 116861: {'lr': 5.934817330459333e-05, 'samples': 22437312, 'steps': 116860, 'loss/train': 1.3972760438919067} 11/07/2021 13:39:16 - INFO - __main__ - Step 116862: {'lr': 5.9344740617725405e-05, 'samples': 22437504, 'steps': 116861, 'loss/train': 1.1240829229354858} 11/07/2021 13:39:16 - INFO - __main__ - Step 116863: {'lr': 5.934130801676382e-05, 'samples': 22437696, 'steps': 116862, 'loss/train': 1.164346694946289} 11/07/2021 13:39:17 - INFO - __main__ - Step 116864: {'lr': 5.9337875501710074e-05, 'samples': 22437888, 'steps': 116863, 'loss/train': 1.2066315412521362} 11/07/2021 13:39:17 - INFO - __main__ - Step 116865: {'lr': 5.933444307256575e-05, 'samples': 22438080, 'steps': 116864, 'loss/train': 1.2103071212768555} 11/07/2021 13:39:18 - INFO - __main__ - Step 116866: {'lr': 5.933101072933247e-05, 'samples': 22438272, 'steps': 116865, 'loss/train': 1.4130299091339111} 11/07/2021 13:39:18 - INFO - __main__ - Step 116867: {'lr': 5.932757847201159e-05, 'samples': 22438464, 'steps': 116866, 'loss/train': 1.3372911214828491} 11/07/2021 13:39:18 - INFO - __main__ - Step 116868: {'lr': 5.9324146300604786e-05, 'samples': 22438656, 'steps': 116867, 'loss/train': 1.325485110282898} 11/07/2021 13:39:19 - INFO - __main__ - Step 116869: {'lr': 5.932071421511359e-05, 'samples': 22438848, 'steps': 116868, 'loss/train': 0.8360185027122498} 11/07/2021 13:39:20 - INFO - __main__ - Step 116870: {'lr': 5.93172822155395e-05, 'samples': 22439040, 'steps': 116869, 'loss/train': 1.0260149240493774} 11/07/2021 13:39:20 - INFO - __main__ - Step 116871: {'lr': 5.931385030188413e-05, 'samples': 22439232, 'steps': 116870, 'loss/train': 1.2448921203613281} 11/07/2021 13:39:20 - INFO - __main__ - Step 116872: {'lr': 5.9310418474149e-05, 'samples': 22439424, 'steps': 116871, 'loss/train': 1.189630389213562} 11/07/2021 13:39:21 - INFO - __main__ - Step 116873: {'lr': 5.930698673233564e-05, 'samples': 22439616, 'steps': 116872, 'loss/train': 1.3632107973098755} 11/07/2021 13:39:21 - INFO - __main__ - Step 116874: {'lr': 5.930355507644561e-05, 'samples': 22439808, 'steps': 116873, 'loss/train': 1.246156930923462} 11/07/2021 13:39:22 - INFO - __main__ - Step 116875: {'lr': 5.930012350648045e-05, 'samples': 22440000, 'steps': 116874, 'loss/train': 1.1605193614959717} 11/07/2021 13:39:23 - INFO - __main__ - Step 116876: {'lr': 5.92966920224417e-05, 'samples': 22440192, 'steps': 116875, 'loss/train': 1.8459546566009521} 11/07/2021 13:39:23 - INFO - __main__ - Step 116877: {'lr': 5.929326062433102e-05, 'samples': 22440384, 'steps': 116876, 'loss/train': 1.3228240013122559} 11/07/2021 13:39:23 - INFO - __main__ - Step 116878: {'lr': 5.928982931214977e-05, 'samples': 22440576, 'steps': 116877, 'loss/train': 1.5060409307479858} 11/07/2021 13:39:24 - INFO - __main__ - Step 116879: {'lr': 5.9286398085899586e-05, 'samples': 22440768, 'steps': 116878, 'loss/train': 1.3018653392791748} 11/07/2021 13:39:25 - INFO - __main__ - Step 116880: {'lr': 5.928296694558199e-05, 'samples': 22440960, 'steps': 116879, 'loss/train': 1.4334553480148315} 11/07/2021 13:39:25 - INFO - __main__ - Step 116881: {'lr': 5.9279535891198556e-05, 'samples': 22441152, 'steps': 116880, 'loss/train': 1.522463321685791} 11/07/2021 13:39:25 - INFO - __main__ - Step 116882: {'lr': 5.927610492275082e-05, 'samples': 22441344, 'steps': 116881, 'loss/train': 1.8018457889556885} 11/07/2021 13:39:26 - INFO - __main__ - Step 116883: {'lr': 5.9272674040240334e-05, 'samples': 22441536, 'steps': 116882, 'loss/train': 1.5630605220794678} 11/07/2021 13:39:26 - INFO - __main__ - Step 116884: {'lr': 5.926924324366864e-05, 'samples': 22441728, 'steps': 116883, 'loss/train': 1.2809420824050903} 11/07/2021 13:39:27 - INFO - __main__ - Step 116885: {'lr': 5.926581253303728e-05, 'samples': 22441920, 'steps': 116884, 'loss/train': 1.7116782665252686} 11/07/2021 13:39:28 - INFO - __main__ - Step 116886: {'lr': 5.926238190834779e-05, 'samples': 22442112, 'steps': 116885, 'loss/train': 1.215799331665039} 11/07/2021 13:39:28 - INFO - __main__ - Step 116887: {'lr': 5.92589513696018e-05, 'samples': 22442304, 'steps': 116886, 'loss/train': 1.1140109300613403} 11/07/2021 13:39:28 - INFO - __main__ - Step 116888: {'lr': 5.9255520916800724e-05, 'samples': 22442496, 'steps': 116887, 'loss/train': 0.7568415999412537} 11/07/2021 13:39:29 - INFO - __main__ - Step 116889: {'lr': 5.925209054994615e-05, 'samples': 22442688, 'steps': 116888, 'loss/train': 1.547627568244934} 11/07/2021 13:39:30 - INFO - __main__ - Step 116890: {'lr': 5.9248660269039665e-05, 'samples': 22442880, 'steps': 116889, 'loss/train': 1.3108793497085571} 11/07/2021 13:39:30 - INFO - __main__ - Step 116891: {'lr': 5.924523007408278e-05, 'samples': 22443072, 'steps': 116890, 'loss/train': 1.7544938325881958} 11/07/2021 13:39:30 - INFO - __main__ - Step 116892: {'lr': 5.9241799965077034e-05, 'samples': 22443264, 'steps': 116891, 'loss/train': 1.4184551239013672} 11/07/2021 13:39:31 - INFO - __main__ - Step 116893: {'lr': 5.9238369942024e-05, 'samples': 22443456, 'steps': 116892, 'loss/train': 1.2930599451065063} 11/07/2021 13:39:31 - INFO - __main__ - Step 116894: {'lr': 5.923494000492521e-05, 'samples': 22443648, 'steps': 116893, 'loss/train': 0.5854922533035278} 11/07/2021 13:39:32 - INFO - __main__ - Step 116895: {'lr': 5.9231510153782224e-05, 'samples': 22443840, 'steps': 116894, 'loss/train': 1.2835251092910767} 11/07/2021 13:39:32 - INFO - __main__ - Step 116896: {'lr': 5.9228080388596587e-05, 'samples': 22444032, 'steps': 116895, 'loss/train': 1.3380745649337769} 11/07/2021 13:39:33 - INFO - __main__ - Step 116897: {'lr': 5.9224650709369804e-05, 'samples': 22444224, 'steps': 116896, 'loss/train': 0.631722092628479} 11/07/2021 13:39:33 - INFO - __main__ - Step 116898: {'lr': 5.922122111610354e-05, 'samples': 22444416, 'steps': 116897, 'loss/train': 1.3897143602371216} 11/07/2021 13:39:33 - INFO - __main__ - Step 116899: {'lr': 5.9217791608799174e-05, 'samples': 22444608, 'steps': 116898, 'loss/train': 1.2395355701446533} 11/07/2021 13:39:35 - INFO - __main__ - Step 116900: {'lr': 5.921436218745832e-05, 'samples': 22444800, 'steps': 116899, 'loss/train': 1.160714030265808} 11/07/2021 13:39:35 - INFO - __main__ - Step 116901: {'lr': 5.921093285208254e-05, 'samples': 22444992, 'steps': 116900, 'loss/train': 0.9005883932113647} 11/07/2021 13:39:35 - INFO - __main__ - Step 116902: {'lr': 5.9207503602673354e-05, 'samples': 22445184, 'steps': 116901, 'loss/train': 1.435362458229065} 11/07/2021 13:39:36 - INFO - __main__ - Step 116903: {'lr': 5.920407443923234e-05, 'samples': 22445376, 'steps': 116902, 'loss/train': 1.3444674015045166} 11/07/2021 13:39:36 - INFO - __main__ - Step 116904: {'lr': 5.920064536176101e-05, 'samples': 22445568, 'steps': 116903, 'loss/train': 1.0166014432907104} 11/07/2021 13:39:36 - INFO - __main__ - Step 116905: {'lr': 5.919721637026093e-05, 'samples': 22445760, 'steps': 116904, 'loss/train': 1.0996369123458862} 11/07/2021 13:39:38 - INFO - __main__ - Step 116906: {'lr': 5.919378746473364e-05, 'samples': 22445952, 'steps': 116905, 'loss/train': 1.174825668334961} 11/07/2021 13:39:38 - INFO - __main__ - Step 116907: {'lr': 5.9190358645180713e-05, 'samples': 22446144, 'steps': 116906, 'loss/train': 1.2228349447250366} 11/07/2021 13:39:38 - INFO - __main__ - Step 116908: {'lr': 5.9186929911603624e-05, 'samples': 22446336, 'steps': 116907, 'loss/train': 1.4582984447479248} 11/07/2021 13:39:39 - INFO - __main__ - Step 116909: {'lr': 5.918350126400407e-05, 'samples': 22446528, 'steps': 116908, 'loss/train': 1.669284701347351} 11/07/2021 13:39:39 - INFO - __main__ - Step 116910: {'lr': 5.918007270238337e-05, 'samples': 22446720, 'steps': 116909, 'loss/train': 1.4486973285675049} 11/07/2021 13:39:40 - INFO - __main__ - Step 116911: {'lr': 5.917664422674321e-05, 'samples': 22446912, 'steps': 116910, 'loss/train': 0.07019086927175522} 11/07/2021 13:39:41 - INFO - __main__ - Step 116912: {'lr': 5.917321583708513e-05, 'samples': 22447104, 'steps': 116911, 'loss/train': 1.3024142980575562} 11/07/2021 13:39:41 - INFO - __main__ - Step 116913: {'lr': 5.9169787533410625e-05, 'samples': 22447296, 'steps': 116912, 'loss/train': 1.3303683996200562} 11/07/2021 13:39:41 - INFO - __main__ - Step 116914: {'lr': 5.9166359315721256e-05, 'samples': 22447488, 'steps': 116913, 'loss/train': 1.1962223052978516} 11/07/2021 13:39:42 - INFO - __main__ - Step 116915: {'lr': 5.9162931184018606e-05, 'samples': 22447680, 'steps': 116914, 'loss/train': 1.4400520324707031} 11/07/2021 13:39:42 - INFO - __main__ - Step 116916: {'lr': 5.915950313830421e-05, 'samples': 22447872, 'steps': 116915, 'loss/train': 1.392845869064331} 11/07/2021 13:39:43 - INFO - __main__ - Step 116917: {'lr': 5.915607517857957e-05, 'samples': 22448064, 'steps': 116916, 'loss/train': 0.42270979285240173} 11/07/2021 13:39:43 - INFO - __main__ - Step 116918: {'lr': 5.9152647304846265e-05, 'samples': 22448256, 'steps': 116917, 'loss/train': 1.3963953256607056} 11/07/2021 13:39:44 - INFO - __main__ - Step 116919: {'lr': 5.914921951710583e-05, 'samples': 22448448, 'steps': 116918, 'loss/train': 0.994508683681488} 11/07/2021 13:39:44 - INFO - __main__ - Step 116920: {'lr': 5.914579181535981e-05, 'samples': 22448640, 'steps': 116919, 'loss/train': 1.3079020977020264} 11/07/2021 13:39:44 - INFO - __main__ - Step 116921: {'lr': 5.914236419960983e-05, 'samples': 22448832, 'steps': 116920, 'loss/train': 1.838836908340454} 11/07/2021 13:39:45 - INFO - __main__ - Step 116922: {'lr': 5.9138936669857286e-05, 'samples': 22449024, 'steps': 116921, 'loss/train': 1.3381879329681396} 11/07/2021 13:39:46 - INFO - __main__ - Step 116923: {'lr': 5.913550922610378e-05, 'samples': 22449216, 'steps': 116922, 'loss/train': 1.1087677478790283} 11/07/2021 13:39:46 - INFO - __main__ - Step 116924: {'lr': 5.9132081868350866e-05, 'samples': 22449408, 'steps': 116923, 'loss/train': 1.408492088317871} 11/07/2021 13:39:47 - INFO - __main__ - Step 116925: {'lr': 5.9128654596600104e-05, 'samples': 22449600, 'steps': 116924, 'loss/train': 1.0572586059570312} 11/07/2021 13:39:47 - INFO - __main__ - Step 116926: {'lr': 5.912522741085302e-05, 'samples': 22449792, 'steps': 116925, 'loss/train': 0.3249611258506775} 11/07/2021 13:39:48 - INFO - __main__ - Step 116927: {'lr': 5.912180031111117e-05, 'samples': 22449984, 'steps': 116926, 'loss/train': 0.5995003581047058} 11/07/2021 13:39:48 - INFO - __main__ - Step 116928: {'lr': 5.911837329737607e-05, 'samples': 22450176, 'steps': 116927, 'loss/train': 1.4550445079803467} 11/07/2021 13:39:49 - INFO - __main__ - Step 116929: {'lr': 5.9114946369649316e-05, 'samples': 22450368, 'steps': 116928, 'loss/train': 1.2643970251083374} 11/07/2021 13:39:49 - INFO - __main__ - Step 116930: {'lr': 5.91115195279324e-05, 'samples': 22450560, 'steps': 116929, 'loss/train': 0.4143041670322418} 11/07/2021 13:39:49 - INFO - __main__ - Step 116931: {'lr': 5.9108092772226896e-05, 'samples': 22450752, 'steps': 116930, 'loss/train': 1.1510329246520996} 11/07/2021 13:39:50 - INFO - __main__ - Step 116932: {'lr': 5.9104666102534345e-05, 'samples': 22450944, 'steps': 116931, 'loss/train': 1.1331833600997925} 11/07/2021 13:39:51 - INFO - __main__ - Step 116933: {'lr': 5.910123951885626e-05, 'samples': 22451136, 'steps': 116932, 'loss/train': 0.7514689564704895} 11/07/2021 13:39:51 - INFO - __main__ - Step 116934: {'lr': 5.90978130211943e-05, 'samples': 22451328, 'steps': 116933, 'loss/train': 1.8405482769012451} 11/07/2021 13:39:51 - INFO - __main__ - Step 116935: {'lr': 5.9094386609549855e-05, 'samples': 22451520, 'steps': 116934, 'loss/train': 1.4110653400421143} 11/07/2021 13:39:52 - INFO - __main__ - Step 116936: {'lr': 5.909096028392452e-05, 'samples': 22451712, 'steps': 116935, 'loss/train': 0.9587780833244324} 11/07/2021 13:39:52 - INFO - __main__ - Step 116937: {'lr': 5.908753404431985e-05, 'samples': 22451904, 'steps': 116936, 'loss/train': 1.3564400672912598} 11/07/2021 13:39:54 - INFO - __main__ - Step 116938: {'lr': 5.90841078907374e-05, 'samples': 22452096, 'steps': 116937, 'loss/train': 1.2535990476608276} 11/07/2021 13:39:54 - INFO - __main__ - Step 116939: {'lr': 5.908068182317872e-05, 'samples': 22452288, 'steps': 116938, 'loss/train': 1.3023837804794312} 11/07/2021 13:39:54 - INFO - __main__ - Step 116940: {'lr': 5.907725584164533e-05, 'samples': 22452480, 'steps': 116939, 'loss/train': 1.2768974304199219} 11/07/2021 13:39:55 - INFO - __main__ - Step 116941: {'lr': 5.907382994613877e-05, 'samples': 22452672, 'steps': 116940, 'loss/train': 0.4907156229019165} 11/07/2021 13:39:55 - INFO - __main__ - Step 116942: {'lr': 5.9070404136660594e-05, 'samples': 22452864, 'steps': 116941, 'loss/train': 1.083008885383606} 11/07/2021 13:39:56 - INFO - __main__ - Step 116943: {'lr': 5.9066978413212374e-05, 'samples': 22453056, 'steps': 116942, 'loss/train': 1.4204976558685303} 11/07/2021 13:39:56 - INFO - __main__ - Step 116944: {'lr': 5.906355277579559e-05, 'samples': 22453248, 'steps': 116943, 'loss/train': 0.9644710421562195} 11/07/2021 13:39:57 - INFO - __main__ - Step 116945: {'lr': 5.906012722441184e-05, 'samples': 22453440, 'steps': 116944, 'loss/train': 1.0488373041152954} 11/07/2021 13:39:57 - INFO - __main__ - Step 116946: {'lr': 5.905670175906266e-05, 'samples': 22453632, 'steps': 116945, 'loss/train': 5.67734956741333} 11/07/2021 13:39:57 - INFO - __main__ - Step 116947: {'lr': 5.9053276379749584e-05, 'samples': 22453824, 'steps': 116946, 'loss/train': 0.7490483522415161} 11/07/2021 13:39:58 - INFO - __main__ - Step 116948: {'lr': 5.90498510864742e-05, 'samples': 22454016, 'steps': 116947, 'loss/train': 0.7221468091011047} 11/07/2021 13:39:59 - INFO - __main__ - Step 116949: {'lr': 5.904642587923797e-05, 'samples': 22454208, 'steps': 116948, 'loss/train': 1.7788200378417969} 11/07/2021 13:39:59 - INFO - __main__ - Step 116950: {'lr': 5.904300075804245e-05, 'samples': 22454400, 'steps': 116949, 'loss/train': 1.0774835348129272} 11/07/2021 13:40:00 - INFO - __main__ - Step 116951: {'lr': 5.903957572288923e-05, 'samples': 22454592, 'steps': 116950, 'loss/train': 1.0600249767303467} 11/07/2021 13:40:00 - INFO - __main__ - Step 116952: {'lr': 5.90361507737798e-05, 'samples': 22454784, 'steps': 116951, 'loss/train': 1.3870340585708618} 11/07/2021 13:40:00 - INFO - __main__ - Step 116953: {'lr': 5.903272591071576e-05, 'samples': 22454976, 'steps': 116952, 'loss/train': 1.489662528038025} 11/07/2021 13:40:02 - INFO - __main__ - Step 116954: {'lr': 5.902930113369862e-05, 'samples': 22455168, 'steps': 116953, 'loss/train': 1.3688609600067139} 11/07/2021 13:40:02 - INFO - __main__ - Step 116955: {'lr': 5.902587644272991e-05, 'samples': 22455360, 'steps': 116954, 'loss/train': 0.7610368132591248} 11/07/2021 13:40:02 - INFO - __main__ - Step 116956: {'lr': 5.902245183781122e-05, 'samples': 22455552, 'steps': 116955, 'loss/train': 1.672531247138977} 11/07/2021 13:40:03 - INFO - __main__ - Step 116957: {'lr': 5.901902731894404e-05, 'samples': 22455744, 'steps': 116956, 'loss/train': 1.070824146270752} 11/07/2021 13:40:03 - INFO - __main__ - Step 116958: {'lr': 5.901560288612998e-05, 'samples': 22455936, 'steps': 116957, 'loss/train': 1.3610291481018066} 11/07/2021 13:40:03 - INFO - __main__ - Step 116959: {'lr': 5.901217853937049e-05, 'samples': 22456128, 'steps': 116958, 'loss/train': 1.143405795097351} 11/07/2021 13:40:04 - INFO - __main__ - Step 116960: {'lr': 5.9008754278667196e-05, 'samples': 22456320, 'steps': 116959, 'loss/train': 1.4816007614135742} 11/07/2021 13:40:05 - INFO - __main__ - Step 116961: {'lr': 5.9005330104021675e-05, 'samples': 22456512, 'steps': 116960, 'loss/train': 1.5139358043670654} 11/07/2021 13:40:05 - INFO - __main__ - Step 116962: {'lr': 5.9001906015435344e-05, 'samples': 22456704, 'steps': 116961, 'loss/train': 1.1924829483032227} 11/07/2021 13:40:05 - INFO - __main__ - Step 116963: {'lr': 5.8998482012909804e-05, 'samples': 22456896, 'steps': 116962, 'loss/train': 1.58516526222229} 11/07/2021 13:40:06 - INFO - __main__ - Step 116964: {'lr': 5.899505809644659e-05, 'samples': 22457088, 'steps': 116963, 'loss/train': 1.370505928993225} 11/07/2021 13:40:07 - INFO - __main__ - Step 116965: {'lr': 5.8991634266047254e-05, 'samples': 22457280, 'steps': 116964, 'loss/train': 1.538162112236023} 11/07/2021 13:40:07 - INFO - __main__ - Step 116966: {'lr': 5.8988210521713355e-05, 'samples': 22457472, 'steps': 116965, 'loss/train': 1.0671063661575317} 11/07/2021 13:40:08 - INFO - __main__ - Step 116967: {'lr': 5.898478686344641e-05, 'samples': 22457664, 'steps': 116966, 'loss/train': 1.4562230110168457} 11/07/2021 13:40:08 - INFO - __main__ - Step 116968: {'lr': 5.898136329124798e-05, 'samples': 22457856, 'steps': 116967, 'loss/train': 1.7682489156723022} 11/07/2021 13:40:08 - INFO - __main__ - Step 116969: {'lr': 5.897793980511959e-05, 'samples': 22458048, 'steps': 116968, 'loss/train': 1.2363203763961792} 11/07/2021 13:40:09 - INFO - __main__ - Step 116970: {'lr': 5.8974516405062824e-05, 'samples': 22458240, 'steps': 116969, 'loss/train': 1.3633012771606445} 11/07/2021 13:40:10 - INFO - __main__ - Step 116971: {'lr': 5.8971093091079145e-05, 'samples': 22458432, 'steps': 116970, 'loss/train': 1.009773850440979} 11/07/2021 13:40:10 - INFO - __main__ - Step 116972: {'lr': 5.8967669863170175e-05, 'samples': 22458624, 'steps': 116971, 'loss/train': 0.9999613165855408} 11/07/2021 13:40:10 - INFO - __main__ - Step 116973: {'lr': 5.8964246721337404e-05, 'samples': 22458816, 'steps': 116972, 'loss/train': 1.12766695022583} 11/07/2021 13:40:11 - INFO - __main__ - Step 116974: {'lr': 5.89608236655825e-05, 'samples': 22459008, 'steps': 116973, 'loss/train': 1.2066317796707153} 11/07/2021 13:40:12 - INFO - __main__ - Step 116975: {'lr': 5.89574006959068e-05, 'samples': 22459200, 'steps': 116974, 'loss/train': 1.7622545957565308} 11/07/2021 13:40:12 - INFO - __main__ - Step 116976: {'lr': 5.895397781231196e-05, 'samples': 22459392, 'steps': 116975, 'loss/train': 1.7268986701965332} 11/07/2021 13:40:12 - INFO - __main__ - Step 116977: {'lr': 5.895055501479951e-05, 'samples': 22459584, 'steps': 116976, 'loss/train': 1.3105254173278809} 11/07/2021 13:40:13 - INFO - __main__ - Step 116978: {'lr': 5.894713230337098e-05, 'samples': 22459776, 'steps': 116977, 'loss/train': 1.324151873588562} 11/07/2021 13:40:13 - INFO - __main__ - Step 116979: {'lr': 5.894370967802793e-05, 'samples': 22459968, 'steps': 116978, 'loss/train': 1.4609959125518799} 11/07/2021 13:40:14 - INFO - __main__ - Step 116980: {'lr': 5.8940287138771895e-05, 'samples': 22460160, 'steps': 116979, 'loss/train': 1.1544703245162964} 11/07/2021 13:40:15 - INFO - __main__ - Step 116981: {'lr': 5.893686468560444e-05, 'samples': 22460352, 'steps': 116980, 'loss/train': 0.5540047883987427} 11/07/2021 13:40:15 - INFO - __main__ - Step 116982: {'lr': 5.893344231852707e-05, 'samples': 22460544, 'steps': 116981, 'loss/train': 1.1688127517700195} 11/07/2021 13:40:15 - INFO - __main__ - Step 116983: {'lr': 5.8930020037541335e-05, 'samples': 22460736, 'steps': 116982, 'loss/train': 1.525012731552124} 11/07/2021 13:40:16 - INFO - __main__ - Step 116984: {'lr': 5.8926597842648784e-05, 'samples': 22460928, 'steps': 116983, 'loss/train': 1.2732293605804443} 11/07/2021 13:40:17 - INFO - __main__ - Step 116985: {'lr': 5.8923175733850945e-05, 'samples': 22461120, 'steps': 116984, 'loss/train': 1.4886003732681274} 11/07/2021 13:40:17 - INFO - __main__ - Step 116986: {'lr': 5.891975371114941e-05, 'samples': 22461312, 'steps': 116985, 'loss/train': 1.463805079460144} 11/07/2021 13:40:17 - INFO - __main__ - Step 116987: {'lr': 5.8916331774545665e-05, 'samples': 22461504, 'steps': 116986, 'loss/train': 0.8691604137420654} 11/07/2021 13:40:18 - INFO - __main__ - Step 116988: {'lr': 5.8912909924041355e-05, 'samples': 22461696, 'steps': 116987, 'loss/train': 1.3898780345916748} 11/07/2021 13:40:18 - INFO - __main__ - Step 116989: {'lr': 5.890948815963787e-05, 'samples': 22461888, 'steps': 116988, 'loss/train': 1.5481314659118652} 11/07/2021 13:40:19 - INFO - __main__ - Step 116990: {'lr': 5.8906066481336815e-05, 'samples': 22462080, 'steps': 116989, 'loss/train': 1.0038045644760132} 11/07/2021 13:40:19 - INFO - __main__ - Step 116991: {'lr': 5.890264488913971e-05, 'samples': 22462272, 'steps': 116990, 'loss/train': 1.4793020486831665} 11/07/2021 13:40:20 - INFO - __main__ - Step 116992: {'lr': 5.889922338304815e-05, 'samples': 22462464, 'steps': 116991, 'loss/train': 0.971707284450531} 11/07/2021 13:40:20 - INFO - __main__ - Step 116993: {'lr': 5.8895801963063656e-05, 'samples': 22462656, 'steps': 116992, 'loss/train': 1.3261526823043823} 11/07/2021 13:40:20 - INFO - __main__ - Step 116994: {'lr': 5.889238062918775e-05, 'samples': 22462848, 'steps': 116993, 'loss/train': 1.3773616552352905} 11/07/2021 13:40:21 - INFO - __main__ - Step 116995: {'lr': 5.8888959381422025e-05, 'samples': 22463040, 'steps': 116994, 'loss/train': 1.3851304054260254} 11/07/2021 13:40:22 - INFO - __main__ - Step 116996: {'lr': 5.8885538219767944e-05, 'samples': 22463232, 'steps': 116995, 'loss/train': 1.2798888683319092} 11/07/2021 13:40:22 - INFO - __main__ - Step 116997: {'lr': 5.8882117144227115e-05, 'samples': 22463424, 'steps': 116996, 'loss/train': 1.1109365224838257} 11/07/2021 13:40:23 - INFO - __main__ - Step 116998: {'lr': 5.887869615480104e-05, 'samples': 22463616, 'steps': 116997, 'loss/train': 1.932264804840088} 11/07/2021 13:40:23 - INFO - __main__ - Step 116999: {'lr': 5.887527525149128e-05, 'samples': 22463808, 'steps': 116998, 'loss/train': 1.408097743988037} 11/07/2021 13:40:23 - INFO - __main__ - Step 117000: {'lr': 5.8871854434299375e-05, 'samples': 22464000, 'steps': 116999, 'loss/train': 0.967668890953064} 11/07/2021 13:40:24 - INFO - __main__ - Step 117001: {'lr': 5.886843370322692e-05, 'samples': 22464192, 'steps': 117000, 'loss/train': 1.1651898622512817} 11/07/2021 13:40:25 - INFO - __main__ - Step 117002: {'lr': 5.8865013058275354e-05, 'samples': 22464384, 'steps': 117001, 'loss/train': 0.8170110583305359} 11/07/2021 13:40:25 - INFO - __main__ - Step 117003: {'lr': 5.8861592499446225e-05, 'samples': 22464576, 'steps': 117002, 'loss/train': 1.1112165451049805} 11/07/2021 13:40:25 - INFO - __main__ - Step 117004: {'lr': 5.885817202674115e-05, 'samples': 22464768, 'steps': 117003, 'loss/train': 1.4065886735916138} 11/07/2021 13:40:26 - INFO - __main__ - Step 117005: {'lr': 5.8854751640161633e-05, 'samples': 22464960, 'steps': 117004, 'loss/train': 1.44010591506958} 11/07/2021 13:40:27 - INFO - __main__ - Step 117006: {'lr': 5.8851331339709186e-05, 'samples': 22465152, 'steps': 117005, 'loss/train': 1.4609875679016113} 11/07/2021 13:40:27 - INFO - __main__ - Step 117007: {'lr': 5.88479111253854e-05, 'samples': 22465344, 'steps': 117006, 'loss/train': 1.4756412506103516} 11/07/2021 13:40:28 - INFO - __main__ - Step 117008: {'lr': 5.88444909971918e-05, 'samples': 22465536, 'steps': 117007, 'loss/train': 0.6649128794670105} 11/07/2021 13:40:28 - INFO - __main__ - Step 117009: {'lr': 5.8841070955129916e-05, 'samples': 22465728, 'steps': 117008, 'loss/train': 1.1394966840744019} 11/07/2021 13:40:28 - INFO - __main__ - Step 117010: {'lr': 5.883765099920127e-05, 'samples': 22465920, 'steps': 117009, 'loss/train': 1.3889902830123901} 11/07/2021 13:40:29 - INFO - __main__ - Step 117011: {'lr': 5.8834231129407476e-05, 'samples': 22466112, 'steps': 117010, 'loss/train': 1.6371984481811523} 11/07/2021 13:40:30 - INFO - __main__ - Step 117012: {'lr': 5.8830811345749995e-05, 'samples': 22466304, 'steps': 117011, 'loss/train': 1.3199527263641357} 11/07/2021 13:40:30 - INFO - __main__ - Step 117013: {'lr': 5.882739164823039e-05, 'samples': 22466496, 'steps': 117012, 'loss/train': 1.0578457117080688} 11/07/2021 13:40:30 - INFO - __main__ - Step 117014: {'lr': 5.882397203685025e-05, 'samples': 22466688, 'steps': 117013, 'loss/train': 0.6982430815696716} 11/07/2021 13:40:31 - INFO - __main__ - Step 117015: {'lr': 5.882055251161114e-05, 'samples': 22466880, 'steps': 117014, 'loss/train': 1.1814836263656616} 11/07/2021 13:40:31 - INFO - __main__ - Step 117016: {'lr': 5.8817133072514463e-05, 'samples': 22467072, 'steps': 117015, 'loss/train': 0.9767279028892517} 11/07/2021 13:40:32 - INFO - __main__ - Step 117017: {'lr': 5.881371371956182e-05, 'samples': 22467264, 'steps': 117016, 'loss/train': 1.6181637048721313} 11/07/2021 13:40:32 - INFO - __main__ - Step 117018: {'lr': 5.881029445275476e-05, 'samples': 22467456, 'steps': 117017, 'loss/train': 1.2085151672363281} 11/07/2021 13:40:33 - INFO - __main__ - Step 117019: {'lr': 5.880687527209486e-05, 'samples': 22467648, 'steps': 117018, 'loss/train': 1.4032313823699951} 11/07/2021 13:40:33 - INFO - __main__ - Step 117020: {'lr': 5.880345617758362e-05, 'samples': 22467840, 'steps': 117019, 'loss/train': 1.4889724254608154} 11/07/2021 13:40:34 - INFO - __main__ - Step 117021: {'lr': 5.880003716922261e-05, 'samples': 22468032, 'steps': 117020, 'loss/train': 1.3282092809677124} 11/07/2021 13:40:35 - INFO - __main__ - Step 117022: {'lr': 5.879661824701332e-05, 'samples': 22468224, 'steps': 117021, 'loss/train': 1.3285770416259766} 11/07/2021 13:40:35 - INFO - __main__ - Step 117023: {'lr': 5.879319941095734e-05, 'samples': 22468416, 'steps': 117022, 'loss/train': 1.46878182888031} 11/07/2021 13:40:35 - INFO - __main__ - Step 117024: {'lr': 5.8789780661056194e-05, 'samples': 22468608, 'steps': 117023, 'loss/train': 1.3253042697906494} 11/07/2021 13:40:36 - INFO - __main__ - Step 117025: {'lr': 5.8786361997311436e-05, 'samples': 22468800, 'steps': 117024, 'loss/train': 0.9580855965614319} 11/07/2021 13:40:36 - INFO - __main__ - Step 117026: {'lr': 5.8782943419724565e-05, 'samples': 22468992, 'steps': 117025, 'loss/train': 0.9338411688804626} 11/07/2021 13:40:37 - INFO - __main__ - Step 117027: {'lr': 5.877952492829716e-05, 'samples': 22469184, 'steps': 117026, 'loss/train': 1.4360641241073608} 11/07/2021 13:40:37 - INFO - __main__ - Step 117028: {'lr': 5.8776106523030835e-05, 'samples': 22469376, 'steps': 117027, 'loss/train': 1.5278421640396118} 11/07/2021 13:40:38 - INFO - __main__ - Step 117029: {'lr': 5.877268820392698e-05, 'samples': 22469568, 'steps': 117028, 'loss/train': 1.1860475540161133} 11/07/2021 13:40:38 - INFO - __main__ - Step 117030: {'lr': 5.876926997098717e-05, 'samples': 22469760, 'steps': 117029, 'loss/train': 1.2874386310577393} 11/07/2021 13:40:38 - INFO - __main__ - Step 117031: {'lr': 5.8765851824212984e-05, 'samples': 22469952, 'steps': 117030, 'loss/train': 1.0756815671920776} 11/07/2021 13:40:40 - INFO - __main__ - Step 117032: {'lr': 5.876243376360596e-05, 'samples': 22470144, 'steps': 117031, 'loss/train': 1.6257925033569336} 11/07/2021 13:40:40 - INFO - __main__ - Step 117033: {'lr': 5.8759015789167646e-05, 'samples': 22470336, 'steps': 117032, 'loss/train': 1.2987945079803467} 11/07/2021 13:40:40 - INFO - __main__ - Step 117034: {'lr': 5.875559790089957e-05, 'samples': 22470528, 'steps': 117033, 'loss/train': 1.3793561458587646} 11/07/2021 13:40:41 - INFO - __main__ - Step 117035: {'lr': 5.875218009880323e-05, 'samples': 22470720, 'steps': 117034, 'loss/train': 1.3088585138320923} 11/07/2021 13:40:41 - INFO - __main__ - Step 117036: {'lr': 5.8748762382880236e-05, 'samples': 22470912, 'steps': 117035, 'loss/train': 0.8346037268638611} 11/07/2021 13:40:43 - INFO - __main__ - Step 117037: {'lr': 5.874534475313212e-05, 'samples': 22471104, 'steps': 117036, 'loss/train': 1.5169118642807007} 11/07/2021 13:40:44 - INFO - __main__ - Step 117038: {'lr': 5.874192720956037e-05, 'samples': 22471296, 'steps': 117037, 'loss/train': 0.7475417852401733} 11/07/2021 13:40:44 - INFO - __main__ - Step 117039: {'lr': 5.873850975216655e-05, 'samples': 22471488, 'steps': 117038, 'loss/train': 1.7445718050003052} 11/07/2021 13:40:44 - INFO - __main__ - Step 117040: {'lr': 5.873509238095223e-05, 'samples': 22471680, 'steps': 117039, 'loss/train': 0.35532644391059875} 11/07/2021 13:40:45 - INFO - __main__ - Step 117041: {'lr': 5.873167509591892e-05, 'samples': 22471872, 'steps': 117040, 'loss/train': 1.734529972076416} 11/07/2021 13:40:45 - INFO - __main__ - Step 117042: {'lr': 5.8728257897068234e-05, 'samples': 22472064, 'steps': 117041, 'loss/train': 0.9720117449760437} 11/07/2021 13:40:45 - INFO - __main__ - Step 117043: {'lr': 5.872484078440154e-05, 'samples': 22472256, 'steps': 117042, 'loss/train': 2.0847415924072266} 11/07/2021 13:40:46 - INFO - __main__ - Step 117044: {'lr': 5.872142375792053e-05, 'samples': 22472448, 'steps': 117043, 'loss/train': 0.9903038740158081} 11/07/2021 13:40:47 - INFO - __main__ - Step 117045: {'lr': 5.871800681762668e-05, 'samples': 22472640, 'steps': 117044, 'loss/train': 1.4904435873031616} 11/07/2021 13:40:47 - INFO - __main__ - Step 117046: {'lr': 5.8714589963521524e-05, 'samples': 22472832, 'steps': 117045, 'loss/train': 1.2893191576004028} 11/07/2021 13:40:47 - INFO - __main__ - Step 117047: {'lr': 5.871117319560665e-05, 'samples': 22473024, 'steps': 117046, 'loss/train': 1.56229567527771} 11/07/2021 13:40:48 - INFO - __main__ - Step 117048: {'lr': 5.8707756513883546e-05, 'samples': 22473216, 'steps': 117047, 'loss/train': 0.8746272325515747} 11/07/2021 13:40:48 - INFO - __main__ - Step 117049: {'lr': 5.8704339918353806e-05, 'samples': 22473408, 'steps': 117048, 'loss/train': 1.4965845346450806} 11/07/2021 13:40:49 - INFO - __main__ - Step 117050: {'lr': 5.870092340901892e-05, 'samples': 22473600, 'steps': 117049, 'loss/train': 0.5678204298019409} 11/07/2021 13:40:50 - INFO - __main__ - Step 117051: {'lr': 5.8697506985880446e-05, 'samples': 22473792, 'steps': 117050, 'loss/train': 2.0697524547576904} 11/07/2021 13:40:50 - INFO - __main__ - Step 117052: {'lr': 5.869409064893991e-05, 'samples': 22473984, 'steps': 117051, 'loss/train': 1.4456045627593994} 11/07/2021 13:40:50 - INFO - __main__ - Step 117053: {'lr': 5.869067439819889e-05, 'samples': 22474176, 'steps': 117052, 'loss/train': 1.134583592414856} 11/07/2021 13:40:51 - INFO - __main__ - Step 117054: {'lr': 5.868725823365889e-05, 'samples': 22474368, 'steps': 117053, 'loss/train': 1.4777271747589111} 11/07/2021 13:40:52 - INFO - __main__ - Step 117055: {'lr': 5.868384215532152e-05, 'samples': 22474560, 'steps': 117054, 'loss/train': 1.3096705675125122} 11/07/2021 13:40:52 - INFO - __main__ - Step 117056: {'lr': 5.8680426163188195e-05, 'samples': 22474752, 'steps': 117055, 'loss/train': 1.387418508529663} 11/07/2021 13:40:52 - INFO - __main__ - Step 117057: {'lr': 5.867701025726052e-05, 'samples': 22474944, 'steps': 117056, 'loss/train': 0.9410250186920166} 11/07/2021 13:40:53 - INFO - __main__ - Step 117058: {'lr': 5.867359443754003e-05, 'samples': 22475136, 'steps': 117057, 'loss/train': 1.0883870124816895} 11/07/2021 13:40:53 - INFO - __main__ - Step 117059: {'lr': 5.867017870402827e-05, 'samples': 22475328, 'steps': 117058, 'loss/train': 1.3903597593307495} 11/07/2021 13:40:54 - INFO - __main__ - Step 117060: {'lr': 5.8666763056726777e-05, 'samples': 22475520, 'steps': 117059, 'loss/train': 1.4413424730300903} 11/07/2021 13:40:55 - INFO - __main__ - Step 117061: {'lr': 5.866334749563707e-05, 'samples': 22475712, 'steps': 117060, 'loss/train': 0.8736732602119446} 11/07/2021 13:40:55 - INFO - __main__ - Step 117062: {'lr': 5.865993202076073e-05, 'samples': 22475904, 'steps': 117061, 'loss/train': 1.0551437139511108} 11/07/2021 13:40:55 - INFO - __main__ - Step 117063: {'lr': 5.865651663209925e-05, 'samples': 22476096, 'steps': 117062, 'loss/train': 1.2057267427444458} 11/07/2021 13:40:56 - INFO - __main__ - Step 117064: {'lr': 5.865310132965421e-05, 'samples': 22476288, 'steps': 117063, 'loss/train': 1.4696568250656128} 11/07/2021 13:40:56 - INFO - __main__ - Step 117065: {'lr': 5.864968611342711e-05, 'samples': 22476480, 'steps': 117064, 'loss/train': 1.2347036600112915} 11/07/2021 13:40:57 - INFO - __main__ - Step 117066: {'lr': 5.8646270983419515e-05, 'samples': 22476672, 'steps': 117065, 'loss/train': 0.8601624965667725} 11/07/2021 13:40:57 - INFO - __main__ - Step 117067: {'lr': 5.864285593963298e-05, 'samples': 22476864, 'steps': 117066, 'loss/train': 1.1581350564956665} 11/07/2021 13:40:58 - INFO - __main__ - Step 117068: {'lr': 5.8639440982068995e-05, 'samples': 22477056, 'steps': 117067, 'loss/train': 1.4388623237609863} 11/07/2021 13:40:58 - INFO - __main__ - Step 117069: {'lr': 5.863602611072921e-05, 'samples': 22477248, 'steps': 117068, 'loss/train': 1.3223108053207397} 11/07/2021 13:40:58 - INFO - __main__ - Step 117070: {'lr': 5.863261132561501e-05, 'samples': 22477440, 'steps': 117069, 'loss/train': 1.594852089881897} 11/07/2021 13:41:00 - INFO - __main__ - Step 117071: {'lr': 5.8629196626728e-05, 'samples': 22477632, 'steps': 117070, 'loss/train': 1.1130616664886475} 11/07/2021 13:41:00 - INFO - __main__ - Step 117072: {'lr': 5.8625782014069706e-05, 'samples': 22477824, 'steps': 117071, 'loss/train': 0.7379843592643738} 11/07/2021 13:41:00 - INFO - __main__ - Step 117073: {'lr': 5.8622367487641685e-05, 'samples': 22478016, 'steps': 117072, 'loss/train': 1.5208454132080078} 11/07/2021 13:41:01 - INFO - __main__ - Step 117074: {'lr': 5.8618953047445494e-05, 'samples': 22478208, 'steps': 117073, 'loss/train': 1.3095027208328247} 11/07/2021 13:41:01 - INFO - __main__ - Step 117075: {'lr': 5.861553869348263e-05, 'samples': 22478400, 'steps': 117074, 'loss/train': 0.9283031821250916} 11/07/2021 13:41:02 - INFO - __main__ - Step 117076: {'lr': 5.861212442575464e-05, 'samples': 22478592, 'steps': 117075, 'loss/train': 0.6932740211486816} 11/07/2021 13:41:02 - INFO - __main__ - Step 117077: {'lr': 5.860871024426309e-05, 'samples': 22478784, 'steps': 117076, 'loss/train': 1.2897392511367798} 11/07/2021 13:41:03 - INFO - __main__ - Step 117078: {'lr': 5.86052961490095e-05, 'samples': 22478976, 'steps': 117077, 'loss/train': 1.2164095640182495} 11/07/2021 13:41:03 - INFO - __main__ - Step 117079: {'lr': 5.8601882139995426e-05, 'samples': 22479168, 'steps': 117078, 'loss/train': 1.297766089439392} 11/07/2021 13:41:03 - INFO - __main__ - Step 117080: {'lr': 5.859846821722237e-05, 'samples': 22479360, 'steps': 117079, 'loss/train': 1.2262517213821411} 11/07/2021 13:41:05 - INFO - __main__ - Step 117081: {'lr': 5.85950543806919e-05, 'samples': 22479552, 'steps': 117080, 'loss/train': 1.3047382831573486} 11/07/2021 13:41:05 - INFO - __main__ - Step 117082: {'lr': 5.859164063040562e-05, 'samples': 22479744, 'steps': 117081, 'loss/train': 1.1451447010040283} 11/07/2021 13:41:05 - INFO - __main__ - Step 117083: {'lr': 5.85882269663649e-05, 'samples': 22479936, 'steps': 117082, 'loss/train': 2.257002592086792} 11/07/2021 13:41:06 - INFO - __main__ - Step 117084: {'lr': 5.858481338857138e-05, 'samples': 22480128, 'steps': 117083, 'loss/train': 1.2183047533035278} 11/07/2021 13:41:06 - INFO - __main__ - Step 117085: {'lr': 5.85813998970266e-05, 'samples': 22480320, 'steps': 117084, 'loss/train': 1.4305025339126587} 11/07/2021 13:41:06 - INFO - __main__ - Step 117086: {'lr': 5.857798649173207e-05, 'samples': 22480512, 'steps': 117085, 'loss/train': 1.5568188428878784} 11/07/2021 13:41:07 - INFO - __main__ - Step 117087: {'lr': 5.8574573172689356e-05, 'samples': 22480704, 'steps': 117086, 'loss/train': 1.2141940593719482} 11/07/2021 13:41:08 - INFO - __main__ - Step 117088: {'lr': 5.857115993990001e-05, 'samples': 22480896, 'steps': 117087, 'loss/train': 1.423673152923584} 11/07/2021 13:41:08 - INFO - __main__ - Step 117089: {'lr': 5.856774679336552e-05, 'samples': 22481088, 'steps': 117088, 'loss/train': 1.0374914407730103} 11/07/2021 13:41:08 - INFO - __main__ - Step 117090: {'lr': 5.856433373308745e-05, 'samples': 22481280, 'steps': 117089, 'loss/train': 1.1964470148086548} 11/07/2021 13:41:09 - INFO - __main__ - Step 117091: {'lr': 5.856092075906733e-05, 'samples': 22481472, 'steps': 117090, 'loss/train': 1.1768798828125} 11/07/2021 13:41:10 - INFO - __main__ - Step 117092: {'lr': 5.8557507871306705e-05, 'samples': 22481664, 'steps': 117091, 'loss/train': 0.9978674650192261} 11/07/2021 13:41:10 - INFO - __main__ - Step 117093: {'lr': 5.855409506980713e-05, 'samples': 22481856, 'steps': 117092, 'loss/train': 1.7329561710357666} 11/07/2021 13:41:11 - INFO - __main__ - Step 117094: {'lr': 5.855068235457012e-05, 'samples': 22482048, 'steps': 117093, 'loss/train': 1.190156102180481} 11/07/2021 13:41:11 - INFO - __main__ - Step 117095: {'lr': 5.854726972559729e-05, 'samples': 22482240, 'steps': 117094, 'loss/train': 1.3651973009109497} 11/07/2021 13:41:11 - INFO - __main__ - Step 117096: {'lr': 5.854385718289004e-05, 'samples': 22482432, 'steps': 117095, 'loss/train': 1.4931409358978271} 11/07/2021 13:41:13 - INFO - __main__ - Step 117097: {'lr': 5.854044472644995e-05, 'samples': 22482624, 'steps': 117096, 'loss/train': 1.4084750413894653} 11/07/2021 13:41:13 - INFO - __main__ - Step 117098: {'lr': 5.8537032356278605e-05, 'samples': 22482816, 'steps': 117097, 'loss/train': 0.7876242399215698} 11/07/2021 13:41:13 - INFO - __main__ - Step 117099: {'lr': 5.85336200723775e-05, 'samples': 22483008, 'steps': 117098, 'loss/train': 1.6948518753051758} 11/07/2021 13:41:14 - INFO - __main__ - Step 117100: {'lr': 5.853020787474822e-05, 'samples': 22483200, 'steps': 117099, 'loss/train': 0.8712669610977173} 11/07/2021 13:41:14 - INFO - __main__ - Step 117101: {'lr': 5.8526795763392235e-05, 'samples': 22483392, 'steps': 117100, 'loss/train': 0.9197744727134705} 11/07/2021 13:41:14 - INFO - __main__ - Step 117102: {'lr': 5.852338373831115e-05, 'samples': 22483584, 'steps': 117101, 'loss/train': 0.6221929788589478} 11/07/2021 13:41:16 - INFO - __main__ - Step 117103: {'lr': 5.851997179950647e-05, 'samples': 22483776, 'steps': 117102, 'loss/train': 1.2154440879821777} 11/07/2021 13:41:16 - INFO - __main__ - Step 117104: {'lr': 5.8516559946979714e-05, 'samples': 22483968, 'steps': 117103, 'loss/train': 0.8480746746063232} 11/07/2021 13:41:17 - INFO - __main__ - Step 117105: {'lr': 5.8513148180732476e-05, 'samples': 22484160, 'steps': 117104, 'loss/train': 1.481086254119873} 11/07/2021 13:41:17 - INFO - __main__ - Step 117106: {'lr': 5.850973650076624e-05, 'samples': 22484352, 'steps': 117105, 'loss/train': 1.2972086668014526} 11/07/2021 13:41:17 - INFO - __main__ - Step 117107: {'lr': 5.850632490708255e-05, 'samples': 22484544, 'steps': 117106, 'loss/train': 0.14370258152484894} 11/07/2021 13:41:18 - INFO - __main__ - Step 117108: {'lr': 5.850291339968297e-05, 'samples': 22484736, 'steps': 117107, 'loss/train': 1.0322624444961548} 11/07/2021 13:41:19 - INFO - __main__ - Step 117109: {'lr': 5.8499501978569094e-05, 'samples': 22484928, 'steps': 117108, 'loss/train': 1.312805414199829} 11/07/2021 13:41:19 - INFO - __main__ - Step 117110: {'lr': 5.849609064374231e-05, 'samples': 22485120, 'steps': 117109, 'loss/train': 1.3904106616973877} 11/07/2021 13:41:19 - INFO - __main__ - Step 117111: {'lr': 5.8492679395204254e-05, 'samples': 22485312, 'steps': 117110, 'loss/train': 1.3386403322219849} 11/07/2021 13:41:20 - INFO - __main__ - Step 117112: {'lr': 5.848926823295642e-05, 'samples': 22485504, 'steps': 117111, 'loss/train': 1.7421865463256836} 11/07/2021 13:41:20 - INFO - __main__ - Step 117113: {'lr': 5.848585715700036e-05, 'samples': 22485696, 'steps': 117112, 'loss/train': 1.4526581764221191} 11/07/2021 13:41:21 - INFO - __main__ - Step 117114: {'lr': 5.848244616733764e-05, 'samples': 22485888, 'steps': 117113, 'loss/train': 1.2274279594421387} 11/07/2021 13:41:21 - INFO - __main__ - Step 117115: {'lr': 5.847903526396975e-05, 'samples': 22486080, 'steps': 117114, 'loss/train': 1.0706980228424072} 11/07/2021 13:41:22 - INFO - __main__ - Step 117116: {'lr': 5.8475624446898275e-05, 'samples': 22486272, 'steps': 117115, 'loss/train': 1.4483288526535034} 11/07/2021 13:41:22 - INFO - __main__ - Step 117117: {'lr': 5.847221371612471e-05, 'samples': 22486464, 'steps': 117116, 'loss/train': 1.2516123056411743} 11/07/2021 13:41:23 - INFO - __main__ - Step 117118: {'lr': 5.846880307165062e-05, 'samples': 22486656, 'steps': 117117, 'loss/train': 1.4266480207443237} 11/07/2021 13:41:24 - INFO - __main__ - Step 117119: {'lr': 5.8465392513477514e-05, 'samples': 22486848, 'steps': 117118, 'loss/train': 1.427091121673584} 11/07/2021 13:41:24 - INFO - __main__ - Step 117120: {'lr': 5.8461982041606964e-05, 'samples': 22487040, 'steps': 117119, 'loss/train': 1.4845592975616455} 11/07/2021 13:41:24 - INFO - __main__ - Step 117121: {'lr': 5.8458571656040486e-05, 'samples': 22487232, 'steps': 117120, 'loss/train': 1.5104540586471558} 11/07/2021 13:41:25 - INFO - __main__ - Step 117122: {'lr': 5.84551613567797e-05, 'samples': 22487424, 'steps': 117121, 'loss/train': 0.8728494048118591} 11/07/2021 13:41:25 - INFO - __main__ - Step 117123: {'lr': 5.845175114382598e-05, 'samples': 22487616, 'steps': 117122, 'loss/train': 1.4714537858963013} 11/07/2021 13:41:26 - INFO - __main__ - Step 117124: {'lr': 5.844834101718094e-05, 'samples': 22487808, 'steps': 117123, 'loss/train': 1.3339295387268066} 11/07/2021 13:41:26 - INFO - __main__ - Step 117125: {'lr': 5.8444930976846136e-05, 'samples': 22488000, 'steps': 117124, 'loss/train': 1.1935431957244873} 11/07/2021 13:41:27 - INFO - __main__ - Step 117126: {'lr': 5.8441521022823076e-05, 'samples': 22488192, 'steps': 117125, 'loss/train': 1.431269884109497} 11/07/2021 13:41:27 - INFO - __main__ - Step 117127: {'lr': 5.8438111155113324e-05, 'samples': 22488384, 'steps': 117126, 'loss/train': 1.496519684791565} 11/07/2021 13:41:27 - INFO - __main__ - Step 117128: {'lr': 5.84347013737184e-05, 'samples': 22488576, 'steps': 117127, 'loss/train': 1.393254041671753} 11/07/2021 13:41:28 - INFO - __main__ - Step 117129: {'lr': 5.8431291678639836e-05, 'samples': 22488768, 'steps': 117128, 'loss/train': 1.2878978252410889} 11/07/2021 13:41:29 - INFO - __main__ - Step 117130: {'lr': 5.84278820698792e-05, 'samples': 22488960, 'steps': 117129, 'loss/train': 1.7346165180206299} 11/07/2021 13:41:29 - INFO - __main__ - Step 117131: {'lr': 5.842447254743796e-05, 'samples': 22489152, 'steps': 117130, 'loss/train': 1.3418464660644531} 11/07/2021 13:41:29 - INFO - __main__ - Step 117132: {'lr': 5.842106311131773e-05, 'samples': 22489344, 'steps': 117131, 'loss/train': 1.3911869525909424} 11/07/2021 13:41:30 - INFO - __main__ - Step 117133: {'lr': 5.8417653761520004e-05, 'samples': 22489536, 'steps': 117132, 'loss/train': 0.9716026782989502} 11/07/2021 13:41:31 - INFO - __main__ - Step 117134: {'lr': 5.8414244498046416e-05, 'samples': 22489728, 'steps': 117133, 'loss/train': 1.3059165477752686} 11/07/2021 13:41:31 - INFO - __main__ - Step 117135: {'lr': 5.8410835320898306e-05, 'samples': 22489920, 'steps': 117134, 'loss/train': 0.9078803062438965} 11/07/2021 13:41:32 - INFO - __main__ - Step 117136: {'lr': 5.8407426230077334e-05, 'samples': 22490112, 'steps': 117135, 'loss/train': 1.2804408073425293} 11/07/2021 13:41:32 - INFO - __main__ - Step 117137: {'lr': 5.8404017225585025e-05, 'samples': 22490304, 'steps': 117136, 'loss/train': 1.216346263885498} 11/07/2021 13:41:32 - INFO - __main__ - Step 117138: {'lr': 5.840060830742292e-05, 'samples': 22490496, 'steps': 117137, 'loss/train': 0.5975107550621033} 11/07/2021 13:41:33 - INFO - __main__ - Step 117139: {'lr': 5.839719947559252e-05, 'samples': 22490688, 'steps': 117138, 'loss/train': 1.1539658308029175} 11/07/2021 13:41:34 - INFO - __main__ - Step 117140: {'lr': 5.83937907300954e-05, 'samples': 22490880, 'steps': 117139, 'loss/train': 1.2997139692306519} 11/07/2021 13:41:34 - INFO - __main__ - Step 117141: {'lr': 5.839038207093309e-05, 'samples': 22491072, 'steps': 117140, 'loss/train': 0.387315571308136} 11/07/2021 13:41:34 - INFO - __main__ - Step 117142: {'lr': 5.838697349810709e-05, 'samples': 22491264, 'steps': 117141, 'loss/train': 1.1680470705032349} 11/07/2021 13:41:35 - INFO - __main__ - Step 117143: {'lr': 5.838356501161898e-05, 'samples': 22491456, 'steps': 117142, 'loss/train': 1.4994920492172241} 11/07/2021 13:41:36 - INFO - __main__ - Step 117144: {'lr': 5.838015661147028e-05, 'samples': 22491648, 'steps': 117143, 'loss/train': 1.5517079830169678} 11/07/2021 13:41:37 - INFO - __main__ - Step 117145: {'lr': 5.837674829766257e-05, 'samples': 22491840, 'steps': 117144, 'loss/train': 1.2230983972549438} 11/07/2021 13:41:37 - INFO - __main__ - Step 117146: {'lr': 5.837334007019729e-05, 'samples': 22492032, 'steps': 117145, 'loss/train': 1.3324776887893677} 11/07/2021 13:41:37 - INFO - __main__ - Step 117147: {'lr': 5.8369931929076026e-05, 'samples': 22492224, 'steps': 117146, 'loss/train': 0.0767856314778328} 11/07/2021 13:41:38 - INFO - __main__ - Step 117148: {'lr': 5.8366523874300334e-05, 'samples': 22492416, 'steps': 117147, 'loss/train': 1.3275532722473145} 11/07/2021 13:41:39 - INFO - __main__ - Step 117149: {'lr': 5.83631159058717e-05, 'samples': 22492608, 'steps': 117148, 'loss/train': 0.6582574844360352} 11/07/2021 13:41:39 - INFO - __main__ - Step 117150: {'lr': 5.8359708023791704e-05, 'samples': 22492800, 'steps': 117149, 'loss/train': 1.0378507375717163} 11/07/2021 13:41:39 - INFO - __main__ - Step 117151: {'lr': 5.835630022806185e-05, 'samples': 22492992, 'steps': 117150, 'loss/train': 1.3685001134872437} 11/07/2021 13:41:40 - INFO - __main__ - Step 117152: {'lr': 5.8352892518683695e-05, 'samples': 22493184, 'steps': 117151, 'loss/train': 1.0940966606140137} 11/07/2021 13:41:40 - INFO - __main__ - Step 117153: {'lr': 5.8349484895658775e-05, 'samples': 22493376, 'steps': 117152, 'loss/train': 1.2348086833953857} 11/07/2021 13:41:41 - INFO - __main__ - Step 117154: {'lr': 5.834607735898861e-05, 'samples': 22493568, 'steps': 117153, 'loss/train': 1.4066046476364136} 11/07/2021 13:41:42 - INFO - __main__ - Step 117155: {'lr': 5.834266990867476e-05, 'samples': 22493760, 'steps': 117154, 'loss/train': 0.6594268679618835} 11/07/2021 13:41:42 - INFO - __main__ - Step 117156: {'lr': 5.8339262544718826e-05, 'samples': 22493952, 'steps': 117155, 'loss/train': 1.529061198234558} 11/07/2021 13:41:42 - INFO - __main__ - Step 117157: {'lr': 5.8335855267122176e-05, 'samples': 22494144, 'steps': 117156, 'loss/train': 1.2580944299697876} 11/07/2021 13:41:43 - INFO - __main__ - Step 117158: {'lr': 5.8332448075886416e-05, 'samples': 22494336, 'steps': 117157, 'loss/train': 1.1315178871154785} 11/07/2021 13:41:43 - INFO - __main__ - Step 117159: {'lr': 5.832904097101313e-05, 'samples': 22494528, 'steps': 117158, 'loss/train': 1.3172234296798706} 11/07/2021 13:41:44 - INFO - __main__ - Step 117160: {'lr': 5.832563395250379e-05, 'samples': 22494720, 'steps': 117159, 'loss/train': 1.5783637762069702} 11/07/2021 13:41:44 - INFO - __main__ - Step 117161: {'lr': 5.832222702036e-05, 'samples': 22494912, 'steps': 117160, 'loss/train': 1.2883747816085815} 11/07/2021 13:41:45 - INFO - __main__ - Step 117162: {'lr': 5.831882017458323e-05, 'samples': 22495104, 'steps': 117161, 'loss/train': 1.4969596862792969} 11/07/2021 13:41:45 - INFO - __main__ - Step 117163: {'lr': 5.8315413415175045e-05, 'samples': 22495296, 'steps': 117162, 'loss/train': 1.6011375188827515} 11/07/2021 13:41:45 - INFO - __main__ - Step 117164: {'lr': 5.8312006742136966e-05, 'samples': 22495488, 'steps': 117163, 'loss/train': 2.019878625869751} 11/07/2021 13:41:47 - INFO - __main__ - Step 117165: {'lr': 5.8308600155470545e-05, 'samples': 22495680, 'steps': 117164, 'loss/train': 1.6993565559387207} 11/07/2021 13:41:47 - INFO - __main__ - Step 117166: {'lr': 5.830519365517731e-05, 'samples': 22495872, 'steps': 117165, 'loss/train': 2.0188519954681396} 11/07/2021 13:41:47 - INFO - __main__ - Step 117167: {'lr': 5.830178724125887e-05, 'samples': 22496064, 'steps': 117166, 'loss/train': 1.4140044450759888} 11/07/2021 13:41:48 - INFO - __main__ - Step 117168: {'lr': 5.829838091371664e-05, 'samples': 22496256, 'steps': 117167, 'loss/train': 1.149349331855774} 11/07/2021 13:41:48 - INFO - __main__ - Step 117169: {'lr': 5.829497467255218e-05, 'samples': 22496448, 'steps': 117168, 'loss/train': 1.3826826810836792} 11/07/2021 13:41:49 - INFO - __main__ - Step 117170: {'lr': 5.829156851776704e-05, 'samples': 22496640, 'steps': 117169, 'loss/train': 0.7325538992881775} 11/07/2021 13:41:49 - INFO - __main__ - Step 117171: {'lr': 5.8288162449362774e-05, 'samples': 22496832, 'steps': 117170, 'loss/train': 1.2681225538253784} 11/07/2021 13:41:50 - INFO - __main__ - Step 117172: {'lr': 5.828475646734088e-05, 'samples': 22497024, 'steps': 117171, 'loss/train': 1.2977147102355957} 11/07/2021 13:41:50 - INFO - __main__ - Step 117173: {'lr': 5.828135057170295e-05, 'samples': 22497216, 'steps': 117172, 'loss/train': 1.1082404851913452} 11/07/2021 13:41:50 - INFO - __main__ - Step 117174: {'lr': 5.827794476245046e-05, 'samples': 22497408, 'steps': 117173, 'loss/train': 1.299723744392395} 11/07/2021 13:41:51 - INFO - __main__ - Step 117175: {'lr': 5.827453903958496e-05, 'samples': 22497600, 'steps': 117174, 'loss/train': 0.7589747309684753} 11/07/2021 13:41:52 - INFO - __main__ - Step 117176: {'lr': 5.827113340310802e-05, 'samples': 22497792, 'steps': 117175, 'loss/train': 1.3268452882766724} 11/07/2021 13:41:52 - INFO - __main__ - Step 117177: {'lr': 5.826772785302114e-05, 'samples': 22497984, 'steps': 117176, 'loss/train': 1.5187389850616455} 11/07/2021 13:41:53 - INFO - __main__ - Step 117178: {'lr': 5.826432238932594e-05, 'samples': 22498176, 'steps': 117177, 'loss/train': 1.0719338655471802} 11/07/2021 13:41:53 - INFO - __main__ - Step 117179: {'lr': 5.82609170120238e-05, 'samples': 22498368, 'steps': 117178, 'loss/train': 1.502893090248108} 11/07/2021 13:41:54 - INFO - __main__ - Step 117180: {'lr': 5.825751172111635e-05, 'samples': 22498560, 'steps': 117179, 'loss/train': 0.6454712748527527} 11/07/2021 13:41:54 - INFO - __main__ - Step 117181: {'lr': 5.825410651660507e-05, 'samples': 22498752, 'steps': 117180, 'loss/train': 1.6573785543441772} 11/07/2021 13:41:55 - INFO - __main__ - Step 117182: {'lr': 5.825070139849156e-05, 'samples': 22498944, 'steps': 117181, 'loss/train': 1.3084787130355835} 11/07/2021 13:41:55 - INFO - __main__ - Step 117183: {'lr': 5.824729636677731e-05, 'samples': 22499136, 'steps': 117182, 'loss/train': 0.9857640862464905} 11/07/2021 13:41:55 - INFO - __main__ - Step 117184: {'lr': 5.8243891421463884e-05, 'samples': 22499328, 'steps': 117183, 'loss/train': 1.4249324798583984} 11/07/2021 13:41:56 - INFO - __main__ - Step 117185: {'lr': 5.824048656255279e-05, 'samples': 22499520, 'steps': 117184, 'loss/train': 1.5140706300735474} 11/07/2021 13:41:57 - INFO - __main__ - Step 117186: {'lr': 5.823708179004558e-05, 'samples': 22499712, 'steps': 117185, 'loss/train': 1.3823692798614502} 11/07/2021 13:41:57 - INFO - __main__ - Step 117187: {'lr': 5.8233677103943785e-05, 'samples': 22499904, 'steps': 117186, 'loss/train': 1.4706878662109375} 11/07/2021 13:41:57 - INFO - __main__ - Step 117188: {'lr': 5.823027250424892e-05, 'samples': 22500096, 'steps': 117187, 'loss/train': 1.4512436389923096} 11/07/2021 13:41:58 - INFO - __main__ - Step 117189: {'lr': 5.8226867990962554e-05, 'samples': 22500288, 'steps': 117188, 'loss/train': 0.547772228717804} 11/07/2021 13:41:58 - INFO - __main__ - Step 117190: {'lr': 5.8223463564086255e-05, 'samples': 22500480, 'steps': 117189, 'loss/train': 1.1067458391189575} 11/07/2021 13:41:59 - INFO - __main__ - Step 117191: {'lr': 5.8220059223621444e-05, 'samples': 22500672, 'steps': 117190, 'loss/train': 1.0431678295135498} 11/07/2021 13:42:00 - INFO - __main__ - Step 117192: {'lr': 5.821665496956971e-05, 'samples': 22500864, 'steps': 117191, 'loss/train': 0.9956937432289124} 11/07/2021 13:42:00 - INFO - __main__ - Step 117193: {'lr': 5.82132508019326e-05, 'samples': 22501056, 'steps': 117192, 'loss/train': 0.9082359075546265} 11/07/2021 13:42:00 - INFO - __main__ - Step 117194: {'lr': 5.820984672071161e-05, 'samples': 22501248, 'steps': 117193, 'loss/train': 2.0013880729675293} 11/07/2021 13:42:01 - INFO - __main__ - Step 117195: {'lr': 5.8206442725908334e-05, 'samples': 22501440, 'steps': 117194, 'loss/train': 0.5322666168212891} 11/07/2021 13:42:02 - INFO - __main__ - Step 117196: {'lr': 5.820303881752429e-05, 'samples': 22501632, 'steps': 117195, 'loss/train': 1.284137487411499} 11/07/2021 13:42:02 - INFO - __main__ - Step 117197: {'lr': 5.819963499556097e-05, 'samples': 22501824, 'steps': 117196, 'loss/train': 1.6511754989624023} 11/07/2021 13:42:03 - INFO - __main__ - Step 117198: {'lr': 5.819623126001994e-05, 'samples': 22502016, 'steps': 117197, 'loss/train': 1.025836706161499} 11/07/2021 13:42:03 - INFO - __main__ - Step 117199: {'lr': 5.819282761090272e-05, 'samples': 22502208, 'steps': 117198, 'loss/train': 1.2918912172317505} 11/07/2021 13:42:03 - INFO - __main__ - Step 117200: {'lr': 5.8189424048210874e-05, 'samples': 22502400, 'steps': 117199, 'loss/train': 1.2436699867248535} 11/07/2021 13:42:04 - INFO - __main__ - Step 117201: {'lr': 5.818602057194589e-05, 'samples': 22502592, 'steps': 117200, 'loss/train': 1.5253636837005615} 11/07/2021 13:42:05 - INFO - __main__ - Step 117202: {'lr': 5.818261718210935e-05, 'samples': 22502784, 'steps': 117201, 'loss/train': 1.0821279287338257} 11/07/2021 13:42:05 - INFO - __main__ - Step 117203: {'lr': 5.8179213878702814e-05, 'samples': 22502976, 'steps': 117202, 'loss/train': 0.6177984476089478} 11/07/2021 13:42:05 - INFO - __main__ - Step 117204: {'lr': 5.81758106617277e-05, 'samples': 22503168, 'steps': 117203, 'loss/train': 0.5528703331947327} 11/07/2021 13:42:06 - INFO - __main__ - Step 117205: {'lr': 5.817240753118561e-05, 'samples': 22503360, 'steps': 117204, 'loss/train': 1.3129453659057617} 11/07/2021 13:42:07 - INFO - __main__ - Step 117206: {'lr': 5.816900448707807e-05, 'samples': 22503552, 'steps': 117205, 'loss/train': 1.0349857807159424} 11/07/2021 13:42:07 - INFO - __main__ - Step 117207: {'lr': 5.816560152940662e-05, 'samples': 22503744, 'steps': 117206, 'loss/train': 1.3797667026519775} 11/07/2021 13:42:07 - INFO - __main__ - Step 117208: {'lr': 5.816219865817277e-05, 'samples': 22503936, 'steps': 117207, 'loss/train': 0.9183844923973083} 11/07/2021 13:42:08 - INFO - __main__ - Step 117209: {'lr': 5.815879587337808e-05, 'samples': 22504128, 'steps': 117208, 'loss/train': 1.2849280834197998} 11/07/2021 13:42:08 - INFO - __main__ - Step 117210: {'lr': 5.815539317502408e-05, 'samples': 22504320, 'steps': 117209, 'loss/train': 1.4732941389083862} 11/07/2021 13:42:10 - INFO - __main__ - Step 117211: {'lr': 5.815199056311232e-05, 'samples': 22504512, 'steps': 117210, 'loss/train': 1.5282280445098877} 11/07/2021 13:42:10 - INFO - __main__ - Step 117212: {'lr': 5.814858803764428e-05, 'samples': 22504704, 'steps': 117211, 'loss/train': 1.2470624446868896} 11/07/2021 13:42:10 - INFO - __main__ - Step 117213: {'lr': 5.814518559862156e-05, 'samples': 22504896, 'steps': 117212, 'loss/train': 0.32209092378616333} 11/07/2021 13:42:11 - INFO - __main__ - Step 117214: {'lr': 5.814178324604563e-05, 'samples': 22505088, 'steps': 117213, 'loss/train': 1.5837119817733765} 11/07/2021 13:42:11 - INFO - __main__ - Step 117215: {'lr': 5.8138380979918055e-05, 'samples': 22505280, 'steps': 117214, 'loss/train': 1.5925626754760742} 11/07/2021 13:42:11 - INFO - __main__ - Step 117216: {'lr': 5.813497880024046e-05, 'samples': 22505472, 'steps': 117215, 'loss/train': 1.3692570924758911} 11/07/2021 13:42:12 - INFO - __main__ - Step 117217: {'lr': 5.813157670701419e-05, 'samples': 22505664, 'steps': 117216, 'loss/train': 0.7432905435562134} 11/07/2021 13:42:13 - INFO - __main__ - Step 117218: {'lr': 5.812817470024087e-05, 'samples': 22505856, 'steps': 117217, 'loss/train': 1.3546732664108276} 11/07/2021 13:42:13 - INFO - __main__ - Step 117219: {'lr': 5.812477277992204e-05, 'samples': 22506048, 'steps': 117218, 'loss/train': 0.7620497345924377} 11/07/2021 13:42:13 - INFO - __main__ - Step 117220: {'lr': 5.812137094605924e-05, 'samples': 22506240, 'steps': 117219, 'loss/train': 1.344664216041565} 11/07/2021 13:42:14 - INFO - __main__ - Step 117221: {'lr': 5.8117969198653977e-05, 'samples': 22506432, 'steps': 117220, 'loss/train': 0.9210915565490723} 11/07/2021 13:42:15 - INFO - __main__ - Step 117222: {'lr': 5.8114567537707774e-05, 'samples': 22506624, 'steps': 117221, 'loss/train': 1.2035903930664062} 11/07/2021 13:42:15 - INFO - __main__ - Step 117223: {'lr': 5.8111165963222216e-05, 'samples': 22506816, 'steps': 117222, 'loss/train': 1.058464765548706} 11/07/2021 13:42:15 - INFO - __main__ - Step 117224: {'lr': 5.810776447519881e-05, 'samples': 22507008, 'steps': 117223, 'loss/train': 0.828029990196228} 11/07/2021 13:42:16 - INFO - __main__ - Step 117225: {'lr': 5.8104363073639063e-05, 'samples': 22507200, 'steps': 117224, 'loss/train': 1.4406144618988037} 11/07/2021 13:42:16 - INFO - __main__ - Step 117226: {'lr': 5.810096175854454e-05, 'samples': 22507392, 'steps': 117225, 'loss/train': 1.1716716289520264} 11/07/2021 13:42:17 - INFO - __main__ - Step 117227: {'lr': 5.809756052991674e-05, 'samples': 22507584, 'steps': 117226, 'loss/train': 1.3927550315856934} 11/07/2021 13:42:18 - INFO - __main__ - Step 117228: {'lr': 5.809415938775725e-05, 'samples': 22507776, 'steps': 117227, 'loss/train': 1.372605562210083} 11/07/2021 13:42:18 - INFO - __main__ - Step 117229: {'lr': 5.809075833206756e-05, 'samples': 22507968, 'steps': 117228, 'loss/train': 1.5280259847640991} 11/07/2021 13:42:18 - INFO - __main__ - Step 117230: {'lr': 5.808735736284929e-05, 'samples': 22508160, 'steps': 117229, 'loss/train': 1.3050005435943604} 11/07/2021 13:42:19 - INFO - __main__ - Step 117231: {'lr': 5.808395648010381e-05, 'samples': 22508352, 'steps': 117230, 'loss/train': 1.3981447219848633} 11/07/2021 13:42:20 - INFO - __main__ - Step 117232: {'lr': 5.808055568383275e-05, 'samples': 22508544, 'steps': 117231, 'loss/train': 0.9786062240600586} 11/07/2021 13:42:20 - INFO - __main__ - Step 117233: {'lr': 5.8077154974037624e-05, 'samples': 22508736, 'steps': 117232, 'loss/train': 1.583125352859497} 11/07/2021 13:42:20 - INFO - __main__ - Step 117234: {'lr': 5.807375435071996e-05, 'samples': 22508928, 'steps': 117233, 'loss/train': 1.5713499784469604} 11/07/2021 13:42:21 - INFO - __main__ - Step 117235: {'lr': 5.8070353813881315e-05, 'samples': 22509120, 'steps': 117234, 'loss/train': 0.3704763650894165} 11/07/2021 13:42:21 - INFO - __main__ - Step 117236: {'lr': 5.8066953363523186e-05, 'samples': 22509312, 'steps': 117235, 'loss/train': 1.3842664957046509} 11/07/2021 13:42:22 - INFO - __main__ - Step 117237: {'lr': 5.806355299964716e-05, 'samples': 22509504, 'steps': 117236, 'loss/train': 1.7499178647994995} 11/07/2021 13:42:23 - INFO - __main__ - Step 117238: {'lr': 5.806015272225471e-05, 'samples': 22509696, 'steps': 117237, 'loss/train': 1.623698115348816} 11/07/2021 13:42:23 - INFO - __main__ - Step 117239: {'lr': 5.80567525313474e-05, 'samples': 22509888, 'steps': 117238, 'loss/train': 1.2860561609268188} 11/07/2021 13:42:23 - INFO - __main__ - Step 117240: {'lr': 5.805335242692675e-05, 'samples': 22510080, 'steps': 117239, 'loss/train': 1.6248279809951782} 11/07/2021 13:42:24 - INFO - __main__ - Step 117241: {'lr': 5.80499524089943e-05, 'samples': 22510272, 'steps': 117240, 'loss/train': 1.3543052673339844} 11/07/2021 13:42:24 - INFO - __main__ - Step 117242: {'lr': 5.804655247755158e-05, 'samples': 22510464, 'steps': 117241, 'loss/train': 0.9831050038337708} 11/07/2021 13:42:25 - INFO - __main__ - Step 117243: {'lr': 5.804315263260021e-05, 'samples': 22510656, 'steps': 117242, 'loss/train': 1.4594014883041382} 11/07/2021 13:42:26 - INFO - __main__ - Step 117244: {'lr': 5.803975287414154e-05, 'samples': 22510848, 'steps': 117243, 'loss/train': 0.9312843084335327} 11/07/2021 13:42:26 - INFO - __main__ - Step 117245: {'lr': 5.803635320217721e-05, 'samples': 22511040, 'steps': 117244, 'loss/train': 1.52717924118042} 11/07/2021 13:42:26 - INFO - __main__ - Step 117246: {'lr': 5.803295361670874e-05, 'samples': 22511232, 'steps': 117245, 'loss/train': 1.4199930429458618} 11/07/2021 13:42:27 - INFO - __main__ - Step 117247: {'lr': 5.802955411773764e-05, 'samples': 22511424, 'steps': 117246, 'loss/train': 1.5569767951965332} 11/07/2021 13:42:28 - INFO - __main__ - Step 117248: {'lr': 5.802615470526548e-05, 'samples': 22511616, 'steps': 117247, 'loss/train': 1.3352360725402832} 11/07/2021 13:42:28 - INFO - __main__ - Step 117249: {'lr': 5.8022755379293744e-05, 'samples': 22511808, 'steps': 117248, 'loss/train': 1.2237616777420044} 11/07/2021 13:42:28 - INFO - __main__ - Step 117250: {'lr': 5.801935613982403e-05, 'samples': 22512000, 'steps': 117249, 'loss/train': 0.7396135330200195} 11/07/2021 13:42:29 - INFO - __main__ - Step 117251: {'lr': 5.80159569868578e-05, 'samples': 22512192, 'steps': 117250, 'loss/train': 0.862813413143158} 11/07/2021 13:42:29 - INFO - __main__ - Step 117252: {'lr': 5.801255792039664e-05, 'samples': 22512384, 'steps': 117251, 'loss/train': 1.5499317646026611} 11/07/2021 13:42:29 - INFO - __main__ - Step 117253: {'lr': 5.800915894044204e-05, 'samples': 22512576, 'steps': 117252, 'loss/train': 1.2585655450820923} 11/07/2021 13:42:30 - INFO - __main__ - Step 117254: {'lr': 5.8005760046995567e-05, 'samples': 22512768, 'steps': 117253, 'loss/train': 1.376974105834961} 11/07/2021 13:42:31 - INFO - __main__ - Step 117255: {'lr': 5.8002361240058724e-05, 'samples': 22512960, 'steps': 117254, 'loss/train': 0.9036844372749329} 11/07/2021 13:42:31 - INFO - __main__ - Step 117256: {'lr': 5.799896251963305e-05, 'samples': 22513152, 'steps': 117255, 'loss/train': 1.0558983087539673} 11/07/2021 13:42:31 - INFO - __main__ - Step 117257: {'lr': 5.7995563885720164e-05, 'samples': 22513344, 'steps': 117256, 'loss/train': 1.5348354578018188} 11/07/2021 13:42:32 - INFO - __main__ - Step 117258: {'lr': 5.799216533832144e-05, 'samples': 22513536, 'steps': 117257, 'loss/train': 0.7269257307052612} 11/07/2021 13:42:33 - INFO - __main__ - Step 117259: {'lr': 5.798876687743848e-05, 'samples': 22513728, 'steps': 117258, 'loss/train': 1.247428297996521} 11/07/2021 13:42:33 - INFO - __main__ - Step 117260: {'lr': 5.798536850307282e-05, 'samples': 22513920, 'steps': 117259, 'loss/train': 1.221897840499878} 11/07/2021 13:42:34 - INFO - __main__ - Step 117261: {'lr': 5.7981970215225996e-05, 'samples': 22514112, 'steps': 117260, 'loss/train': 1.3653217554092407} 11/07/2021 13:42:34 - INFO - __main__ - Step 117262: {'lr': 5.797857201389953e-05, 'samples': 22514304, 'steps': 117261, 'loss/train': 1.3936638832092285} 11/07/2021 13:42:34 - INFO - __main__ - Step 117263: {'lr': 5.797517389909496e-05, 'samples': 22514496, 'steps': 117262, 'loss/train': 1.269303321838379} 11/07/2021 13:42:36 - INFO - __main__ - Step 117264: {'lr': 5.7971775870813815e-05, 'samples': 22514688, 'steps': 117263, 'loss/train': 1.0532394647598267} 11/07/2021 13:42:36 - INFO - __main__ - Step 117265: {'lr': 5.7968377929057594e-05, 'samples': 22514880, 'steps': 117264, 'loss/train': 1.1858482360839844} 11/07/2021 13:42:37 - INFO - __main__ - Step 117266: {'lr': 5.796498007382789e-05, 'samples': 22515072, 'steps': 117265, 'loss/train': 1.5215179920196533} 11/07/2021 13:42:37 - INFO - __main__ - Step 117267: {'lr': 5.796158230512621e-05, 'samples': 22515264, 'steps': 117266, 'loss/train': 1.5005570650100708} 11/07/2021 13:42:37 - INFO - __main__ - Step 117268: {'lr': 5.795818462295405e-05, 'samples': 22515456, 'steps': 117267, 'loss/train': 1.5537015199661255} 11/07/2021 13:42:38 - INFO - __main__ - Step 117269: {'lr': 5.795478702731299e-05, 'samples': 22515648, 'steps': 117268, 'loss/train': 1.540580153465271} 11/07/2021 13:42:39 - INFO - __main__ - Step 117270: {'lr': 5.795138951820461e-05, 'samples': 22515840, 'steps': 117269, 'loss/train': 1.265372395515442} 11/07/2021 13:42:39 - INFO - __main__ - Step 117271: {'lr': 5.7947992095630284e-05, 'samples': 22516032, 'steps': 117270, 'loss/train': 1.2619684934616089} 11/07/2021 13:42:39 - INFO - __main__ - Step 117272: {'lr': 5.7944594759591626e-05, 'samples': 22516224, 'steps': 117271, 'loss/train': 1.4209345579147339} 11/07/2021 13:42:40 - INFO - __main__ - Step 117273: {'lr': 5.794119751009019e-05, 'samples': 22516416, 'steps': 117272, 'loss/train': 0.23174722492694855} 11/07/2021 13:42:40 - INFO - __main__ - Step 117274: {'lr': 5.793780034712748e-05, 'samples': 22516608, 'steps': 117273, 'loss/train': 1.3056368827819824} 11/07/2021 13:42:41 - INFO - __main__ - Step 117275: {'lr': 5.7934403270705035e-05, 'samples': 22516800, 'steps': 117274, 'loss/train': 1.5079513788223267} 11/07/2021 13:42:41 - INFO - __main__ - Step 117276: {'lr': 5.793100628082437e-05, 'samples': 22516992, 'steps': 117275, 'loss/train': 0.9084156155586243} 11/07/2021 13:42:42 - INFO - __main__ - Step 117277: {'lr': 5.792760937748703e-05, 'samples': 22517184, 'steps': 117276, 'loss/train': 1.2254854440689087} 11/07/2021 13:42:42 - INFO - __main__ - Step 117278: {'lr': 5.792421256069458e-05, 'samples': 22517376, 'steps': 117277, 'loss/train': 1.2773768901824951} 11/07/2021 13:42:42 - INFO - __main__ - Step 117279: {'lr': 5.792081583044847e-05, 'samples': 22517568, 'steps': 117278, 'loss/train': 1.347213625907898} 11/07/2021 13:42:44 - INFO - __main__ - Step 117280: {'lr': 5.791741918675031e-05, 'samples': 22517760, 'steps': 117279, 'loss/train': 1.508521556854248} 11/07/2021 13:42:44 - INFO - __main__ - Step 117281: {'lr': 5.791402262960158e-05, 'samples': 22517952, 'steps': 117280, 'loss/train': 1.5524168014526367} 11/07/2021 13:42:44 - INFO - __main__ - Step 117282: {'lr': 5.791062615900383e-05, 'samples': 22518144, 'steps': 117281, 'loss/train': 1.65780508518219} 11/07/2021 13:42:45 - INFO - __main__ - Step 117283: {'lr': 5.79072297749586e-05, 'samples': 22518336, 'steps': 117282, 'loss/train': 0.6654042601585388} 11/07/2021 13:42:45 - INFO - __main__ - Step 117284: {'lr': 5.7903833477467475e-05, 'samples': 22518528, 'steps': 117283, 'loss/train': 1.2769007682800293} 11/07/2021 13:42:45 - INFO - __main__ - Step 117285: {'lr': 5.7900437266531826e-05, 'samples': 22518720, 'steps': 117284, 'loss/train': 1.2457853555679321} 11/07/2021 13:42:46 - INFO - __main__ - Step 117286: {'lr': 5.78970411421533e-05, 'samples': 22518912, 'steps': 117285, 'loss/train': 1.640753984451294} 11/07/2021 13:42:47 - INFO - __main__ - Step 117287: {'lr': 5.78936451043334e-05, 'samples': 22519104, 'steps': 117286, 'loss/train': 1.2856988906860352} 11/07/2021 13:42:47 - INFO - __main__ - Step 117288: {'lr': 5.7890249153073644e-05, 'samples': 22519296, 'steps': 117287, 'loss/train': 1.065885305404663} 11/07/2021 13:42:47 - INFO - __main__ - Step 117289: {'lr': 5.7886853288375594e-05, 'samples': 22519488, 'steps': 117288, 'loss/train': 1.417258381843567} 11/07/2021 13:42:48 - INFO - __main__ - Step 117290: {'lr': 5.788345751024074e-05, 'samples': 22519680, 'steps': 117289, 'loss/train': 1.0712124109268188} 11/07/2021 13:42:49 - INFO - __main__ - Step 117291: {'lr': 5.788006181867064e-05, 'samples': 22519872, 'steps': 117290, 'loss/train': 1.3261222839355469} 11/07/2021 13:42:50 - INFO - __main__ - Step 117292: {'lr': 5.7876666213666854e-05, 'samples': 22520064, 'steps': 117291, 'loss/train': 1.3677945137023926} 11/07/2021 13:42:50 - INFO - __main__ - Step 117293: {'lr': 5.787327069523085e-05, 'samples': 22520256, 'steps': 117292, 'loss/train': 1.2864881753921509} 11/07/2021 13:42:50 - INFO - __main__ - Step 117294: {'lr': 5.786987526336418e-05, 'samples': 22520448, 'steps': 117293, 'loss/train': 1.3720675706863403} 11/07/2021 13:42:51 - INFO - __main__ - Step 117295: {'lr': 5.78664799180684e-05, 'samples': 22520640, 'steps': 117294, 'loss/train': 1.456824541091919} 11/07/2021 13:42:52 - INFO - __main__ - Step 117296: {'lr': 5.7863084659345e-05, 'samples': 22520832, 'steps': 117295, 'loss/train': 0.6477102637290955} 11/07/2021 13:42:52 - INFO - __main__ - Step 117297: {'lr': 5.785968948719561e-05, 'samples': 22521024, 'steps': 117296, 'loss/train': 1.540724515914917} 11/07/2021 13:42:52 - INFO - __main__ - Step 117298: {'lr': 5.78562944016216e-05, 'samples': 22521216, 'steps': 117297, 'loss/train': 1.1161761283874512} 11/07/2021 13:42:53 - INFO - __main__ - Step 117299: {'lr': 5.785289940262459e-05, 'samples': 22521408, 'steps': 117298, 'loss/train': 1.5828361511230469} 11/07/2021 13:42:53 - INFO - __main__ - Step 117300: {'lr': 5.7849504490206095e-05, 'samples': 22521600, 'steps': 117299, 'loss/train': 1.1416935920715332} 11/07/2021 13:42:54 - INFO - __main__ - Step 117301: {'lr': 5.784610966436765e-05, 'samples': 22521792, 'steps': 117300, 'loss/train': 0.9185302257537842} 11/07/2021 13:42:55 - INFO - __main__ - Step 117302: {'lr': 5.7842714925110783e-05, 'samples': 22521984, 'steps': 117301, 'loss/train': 1.2479064464569092} 11/07/2021 13:42:55 - INFO - __main__ - Step 117303: {'lr': 5.7839320272437016e-05, 'samples': 22522176, 'steps': 117302, 'loss/train': 1.385848045349121} 11/07/2021 13:42:55 - INFO - __main__ - Step 117304: {'lr': 5.783592570634788e-05, 'samples': 22522368, 'steps': 117303, 'loss/train': 1.0549968481063843} 11/07/2021 13:42:56 - INFO - __main__ - Step 117305: {'lr': 5.783253122684493e-05, 'samples': 22522560, 'steps': 117304, 'loss/train': 0.9741055965423584} 11/07/2021 13:42:56 - INFO - __main__ - Step 117306: {'lr': 5.7829136833929676e-05, 'samples': 22522752, 'steps': 117305, 'loss/train': 1.5552129745483398} 11/07/2021 13:42:57 - INFO - __main__ - Step 117307: {'lr': 5.782574252760364e-05, 'samples': 22522944, 'steps': 117306, 'loss/train': 1.9102319478988647} 11/07/2021 13:42:57 - INFO - __main__ - Step 117308: {'lr': 5.7822348307868336e-05, 'samples': 22523136, 'steps': 117307, 'loss/train': 1.1962746381759644} 11/07/2021 13:42:58 - INFO - __main__ - Step 117309: {'lr': 5.781895417472535e-05, 'samples': 22523328, 'steps': 117308, 'loss/train': 1.2631081342697144} 11/07/2021 13:42:58 - INFO - __main__ - Step 117310: {'lr': 5.781556012817618e-05, 'samples': 22523520, 'steps': 117309, 'loss/train': 0.9798382520675659} 11/07/2021 13:42:58 - INFO - __main__ - Step 117311: {'lr': 5.7812166168222406e-05, 'samples': 22523712, 'steps': 117310, 'loss/train': 1.1961009502410889} 11/07/2021 13:42:59 - INFO - __main__ - Step 117312: {'lr': 5.780877229486542e-05, 'samples': 22523904, 'steps': 117311, 'loss/train': 0.9985366463661194} 11/07/2021 13:43:00 - INFO - __main__ - Step 117313: {'lr': 5.7805378508106856e-05, 'samples': 22524096, 'steps': 117312, 'loss/train': 1.0980674028396606} 11/07/2021 13:43:00 - INFO - __main__ - Step 117314: {'lr': 5.780198480794824e-05, 'samples': 22524288, 'steps': 117313, 'loss/train': 1.094273567199707} 11/07/2021 13:43:00 - INFO - __main__ - Step 117315: {'lr': 5.779859119439104e-05, 'samples': 22524480, 'steps': 117314, 'loss/train': 1.3301219940185547} 11/07/2021 13:43:01 - INFO - __main__ - Step 117316: {'lr': 5.779519766743688e-05, 'samples': 22524672, 'steps': 117315, 'loss/train': 1.4927315711975098} 11/07/2021 13:43:02 - INFO - __main__ - Step 117317: {'lr': 5.7791804227087184e-05, 'samples': 22524864, 'steps': 117316, 'loss/train': 1.2473764419555664} 11/07/2021 13:43:02 - INFO - __main__ - Step 117318: {'lr': 5.778841087334358e-05, 'samples': 22525056, 'steps': 117317, 'loss/train': 1.1922686100006104} 11/07/2021 13:43:03 - INFO - __main__ - Step 117319: {'lr': 5.7785017606207524e-05, 'samples': 22525248, 'steps': 117318, 'loss/train': 1.521957278251648} 11/07/2021 13:43:03 - INFO - __main__ - Step 117320: {'lr': 5.7781624425680577e-05, 'samples': 22525440, 'steps': 117319, 'loss/train': 1.3700135946273804} 11/07/2021 13:43:03 - INFO - __main__ - Step 117321: {'lr': 5.777823133176427e-05, 'samples': 22525632, 'steps': 117320, 'loss/train': 1.4320770502090454} 11/07/2021 13:43:05 - INFO - __main__ - Step 117322: {'lr': 5.777483832446012e-05, 'samples': 22525824, 'steps': 117321, 'loss/train': 1.4175621271133423} 11/07/2021 13:43:05 - INFO - __main__ - Step 117323: {'lr': 5.777144540376969e-05, 'samples': 22526016, 'steps': 117322, 'loss/train': 0.9860479235649109} 11/07/2021 13:43:05 - INFO - __main__ - Step 117324: {'lr': 5.776805256969453e-05, 'samples': 22526208, 'steps': 117323, 'loss/train': 1.5316689014434814} 11/07/2021 13:43:06 - INFO - __main__ - Step 117325: {'lr': 5.7764659822236024e-05, 'samples': 22526400, 'steps': 117324, 'loss/train': 0.23761115968227386} 11/07/2021 13:43:06 - INFO - __main__ - Step 117326: {'lr': 5.776126716139582e-05, 'samples': 22526592, 'steps': 117325, 'loss/train': 1.1972826719284058} 11/07/2021 13:43:07 - INFO - __main__ - Step 117327: {'lr': 5.7757874587175437e-05, 'samples': 22526784, 'steps': 117326, 'loss/train': 0.7915644645690918} 11/07/2021 13:43:08 - INFO - __main__ - Step 117328: {'lr': 5.7754482099576375e-05, 'samples': 22526976, 'steps': 117327, 'loss/train': 1.8554383516311646} 11/07/2021 13:43:08 - INFO - __main__ - Step 117329: {'lr': 5.7751089698600154e-05, 'samples': 22527168, 'steps': 117328, 'loss/train': 1.7218605279922485} 11/07/2021 13:43:08 - INFO - __main__ - Step 117330: {'lr': 5.774769738424837e-05, 'samples': 22527360, 'steps': 117329, 'loss/train': 1.5888124704360962} 11/07/2021 13:43:09 - INFO - __main__ - Step 117331: {'lr': 5.774430515652246e-05, 'samples': 22527552, 'steps': 117330, 'loss/train': 1.5375386476516724} 11/07/2021 13:43:10 - INFO - __main__ - Step 117332: {'lr': 5.7740913015424026e-05, 'samples': 22527744, 'steps': 117331, 'loss/train': 0.3745051920413971} 11/07/2021 13:43:10 - INFO - __main__ - Step 117333: {'lr': 5.7737520960954585e-05, 'samples': 22527936, 'steps': 117332, 'loss/train': 1.288177728652954} 11/07/2021 13:43:10 - INFO - __main__ - Step 117334: {'lr': 5.7734128993115614e-05, 'samples': 22528128, 'steps': 117333, 'loss/train': 1.3156489133834839} 11/07/2021 13:43:11 - INFO - __main__ - Step 117335: {'lr': 5.773073711190871e-05, 'samples': 22528320, 'steps': 117334, 'loss/train': 1.0728943347930908} 11/07/2021 13:43:11 - INFO - __main__ - Step 117336: {'lr': 5.772734531733534e-05, 'samples': 22528512, 'steps': 117335, 'loss/train': 1.2569804191589355} 11/07/2021 13:43:12 - INFO - __main__ - Step 117337: {'lr': 5.7723953609397166e-05, 'samples': 22528704, 'steps': 117336, 'loss/train': 1.4902116060256958} 11/07/2021 13:43:12 - INFO - __main__ - Step 117338: {'lr': 5.77205619880955e-05, 'samples': 22528896, 'steps': 117337, 'loss/train': 1.188405156135559} 11/07/2021 13:43:13 - INFO - __main__ - Step 117339: {'lr': 5.7717170453432e-05, 'samples': 22529088, 'steps': 117338, 'loss/train': 1.3168011903762817} 11/07/2021 13:43:13 - INFO - __main__ - Step 117340: {'lr': 5.7713779005408196e-05, 'samples': 22529280, 'steps': 117339, 'loss/train': 1.4980255365371704} 11/07/2021 13:43:13 - INFO - __main__ - Step 117341: {'lr': 5.7710387644025584e-05, 'samples': 22529472, 'steps': 117340, 'loss/train': 1.5922586917877197} 11/07/2021 13:43:14 - INFO - __main__ - Step 117342: {'lr': 5.7706996369285695e-05, 'samples': 22529664, 'steps': 117341, 'loss/train': 0.897852897644043} 11/07/2021 13:43:15 - INFO - __main__ - Step 117343: {'lr': 5.770360518119005e-05, 'samples': 22529856, 'steps': 117342, 'loss/train': 0.875709056854248} 11/07/2021 13:43:15 - INFO - __main__ - Step 117344: {'lr': 5.7700214079740216e-05, 'samples': 22530048, 'steps': 117343, 'loss/train': 1.3454629182815552} 11/07/2021 13:43:16 - INFO - __main__ - Step 117345: {'lr': 5.76968230649377e-05, 'samples': 22530240, 'steps': 117344, 'loss/train': 1.3317577838897705} 11/07/2021 13:43:16 - INFO - __main__ - Step 117346: {'lr': 5.7693432136784017e-05, 'samples': 22530432, 'steps': 117345, 'loss/train': 1.1357197761535645} 11/07/2021 13:43:16 - INFO - __main__ - Step 117347: {'lr': 5.769004129528072e-05, 'samples': 22530624, 'steps': 117346, 'loss/train': 0.9569864869117737} 11/07/2021 13:43:17 - INFO - __main__ - Step 117348: {'lr': 5.768665054042932e-05, 'samples': 22530816, 'steps': 117347, 'loss/train': 1.3135641813278198} 11/07/2021 13:43:18 - INFO - __main__ - Step 117349: {'lr': 5.7683259872231356e-05, 'samples': 22531008, 'steps': 117348, 'loss/train': 1.3172322511672974} 11/07/2021 13:43:18 - INFO - __main__ - Step 117350: {'lr': 5.767986929068833e-05, 'samples': 22531200, 'steps': 117349, 'loss/train': 1.293400764465332} 11/07/2021 13:43:19 - INFO - __main__ - Step 117351: {'lr': 5.767647879580184e-05, 'samples': 22531392, 'steps': 117350, 'loss/train': 4.474649906158447} 11/07/2021 13:43:19 - INFO - __main__ - Step 117352: {'lr': 5.767308838757332e-05, 'samples': 22531584, 'steps': 117351, 'loss/train': 1.1647807359695435} 11/07/2021 13:43:20 - INFO - __main__ - Step 117353: {'lr': 5.766969806600433e-05, 'samples': 22531776, 'steps': 117352, 'loss/train': 1.5081647634506226} 11/07/2021 13:43:20 - INFO - __main__ - Step 117354: {'lr': 5.7666307831096415e-05, 'samples': 22531968, 'steps': 117353, 'loss/train': 1.5838338136672974} 11/07/2021 13:43:21 - INFO - __main__ - Step 117355: {'lr': 5.766291768285109e-05, 'samples': 22532160, 'steps': 117354, 'loss/train': 1.121017336845398} 11/07/2021 13:43:21 - INFO - __main__ - Step 117356: {'lr': 5.765952762126989e-05, 'samples': 22532352, 'steps': 117355, 'loss/train': 1.253415822982788} 11/07/2021 13:43:21 - INFO - __main__ - Step 117357: {'lr': 5.765613764635433e-05, 'samples': 22532544, 'steps': 117356, 'loss/train': 1.113394021987915} 11/07/2021 13:43:22 - INFO - __main__ - Step 117358: {'lr': 5.7652747758105946e-05, 'samples': 22532736, 'steps': 117357, 'loss/train': 1.1282973289489746} 11/07/2021 13:43:23 - INFO - __main__ - Step 117359: {'lr': 5.764935795652626e-05, 'samples': 22532928, 'steps': 117358, 'loss/train': 1.5796326398849487} 11/07/2021 13:43:23 - INFO - __main__ - Step 117360: {'lr': 5.764596824161683e-05, 'samples': 22533120, 'steps': 117359, 'loss/train': 1.1918309926986694} 11/07/2021 13:43:23 - INFO - __main__ - Step 117361: {'lr': 5.764257861337913e-05, 'samples': 22533312, 'steps': 117360, 'loss/train': 1.4284194707870483} 11/07/2021 13:43:24 - INFO - __main__ - Step 117362: {'lr': 5.763918907181473e-05, 'samples': 22533504, 'steps': 117361, 'loss/train': 1.1603689193725586} 11/07/2021 13:43:24 - INFO - __main__ - Step 117363: {'lr': 5.7635799616925136e-05, 'samples': 22533696, 'steps': 117362, 'loss/train': 1.5568993091583252} 11/07/2021 13:43:25 - INFO - __main__ - Step 117364: {'lr': 5.763241024871196e-05, 'samples': 22533888, 'steps': 117363, 'loss/train': 1.343454360961914} 11/07/2021 13:43:25 - INFO - __main__ - Step 117365: {'lr': 5.762902096717656e-05, 'samples': 22534080, 'steps': 117364, 'loss/train': 1.193359136581421} 11/07/2021 13:43:26 - INFO - __main__ - Step 117366: {'lr': 5.762563177232058e-05, 'samples': 22534272, 'steps': 117365, 'loss/train': 0.8888168931007385} 11/07/2021 13:43:26 - INFO - __main__ - Step 117367: {'lr': 5.762224266414554e-05, 'samples': 22534464, 'steps': 117366, 'loss/train': 1.1211416721343994} 11/07/2021 13:43:27 - INFO - __main__ - Step 117368: {'lr': 5.76188536426529e-05, 'samples': 22534656, 'steps': 117367, 'loss/train': 1.4122530221939087} 11/07/2021 13:43:28 - INFO - __main__ - Step 117369: {'lr': 5.7615464707844264e-05, 'samples': 22534848, 'steps': 117368, 'loss/train': 1.435349941253662} 11/07/2021 13:43:28 - INFO - __main__ - Step 117370: {'lr': 5.7612075859721144e-05, 'samples': 22535040, 'steps': 117369, 'loss/train': 1.2931402921676636} 11/07/2021 13:43:28 - INFO - __main__ - Step 117371: {'lr': 5.7608687098285015e-05, 'samples': 22535232, 'steps': 117370, 'loss/train': 1.782580852508545} 11/07/2021 13:43:29 - INFO - __main__ - Step 117372: {'lr': 5.7605298423537483e-05, 'samples': 22535424, 'steps': 117371, 'loss/train': 1.1337952613830566} 11/07/2021 13:43:29 - INFO - __main__ - Step 117373: {'lr': 5.760190983548e-05, 'samples': 22535616, 'steps': 117372, 'loss/train': 1.524487853050232} 11/07/2021 13:43:30 - INFO - __main__ - Step 117374: {'lr': 5.759852133411414e-05, 'samples': 22535808, 'steps': 117373, 'loss/train': 1.545153260231018} 11/07/2021 13:43:30 - INFO - __main__ - Step 117375: {'lr': 5.759513291944143e-05, 'samples': 22536000, 'steps': 117374, 'loss/train': 1.2313032150268555} 11/07/2021 13:43:31 - INFO - __main__ - Step 117376: {'lr': 5.759174459146338e-05, 'samples': 22536192, 'steps': 117375, 'loss/train': 1.1561743021011353} 11/07/2021 13:43:31 - INFO - __main__ - Step 117377: {'lr': 5.7588356350181505e-05, 'samples': 22536384, 'steps': 117376, 'loss/train': 1.7998143434524536} 11/07/2021 13:43:31 - INFO - __main__ - Step 117378: {'lr': 5.758496819559744e-05, 'samples': 22536576, 'steps': 117377, 'loss/train': 1.3023579120635986} 11/07/2021 13:43:32 - INFO - __main__ - Step 117379: {'lr': 5.758158012771253e-05, 'samples': 22536768, 'steps': 117378, 'loss/train': 0.6621732115745544} 11/07/2021 13:43:33 - INFO - __main__ - Step 117380: {'lr': 5.75781921465284e-05, 'samples': 22536960, 'steps': 117379, 'loss/train': 1.4753111600875854} 11/07/2021 13:43:33 - INFO - __main__ - Step 117381: {'lr': 5.757480425204656e-05, 'samples': 22537152, 'steps': 117380, 'loss/train': 1.0191365480422974} 11/07/2021 13:43:34 - INFO - __main__ - Step 117382: {'lr': 5.757141644426856e-05, 'samples': 22537344, 'steps': 117381, 'loss/train': 1.375459909439087} 11/07/2021 13:43:34 - INFO - __main__ - Step 117383: {'lr': 5.756802872319589e-05, 'samples': 22537536, 'steps': 117382, 'loss/train': 1.280717372894287} 11/07/2021 13:43:35 - INFO - __main__ - Step 117384: {'lr': 5.7564641088830114e-05, 'samples': 22537728, 'steps': 117383, 'loss/train': 1.2591296434402466} 11/07/2021 13:43:35 - INFO - __main__ - Step 117385: {'lr': 5.756125354117272e-05, 'samples': 22537920, 'steps': 117384, 'loss/train': 1.2384743690490723} 11/07/2021 13:43:36 - INFO - __main__ - Step 117386: {'lr': 5.755786608022528e-05, 'samples': 22538112, 'steps': 117385, 'loss/train': 1.287619948387146} 11/07/2021 13:43:36 - INFO - __main__ - Step 117387: {'lr': 5.755447870598929e-05, 'samples': 22538304, 'steps': 117386, 'loss/train': 1.1569328308105469} 11/07/2021 13:43:36 - INFO - __main__ - Step 117388: {'lr': 5.755109141846626e-05, 'samples': 22538496, 'steps': 117387, 'loss/train': 0.5343697667121887} 11/07/2021 13:43:37 - INFO - __main__ - Step 117389: {'lr': 5.754770421765776e-05, 'samples': 22538688, 'steps': 117388, 'loss/train': 1.5843220949172974} 11/07/2021 13:43:38 - INFO - __main__ - Step 117390: {'lr': 5.7544317103565306e-05, 'samples': 22538880, 'steps': 117389, 'loss/train': 1.4257612228393555} 11/07/2021 13:43:38 - INFO - __main__ - Step 117391: {'lr': 5.754093007619046e-05, 'samples': 22539072, 'steps': 117390, 'loss/train': 1.6533410549163818} 11/07/2021 13:43:38 - INFO - __main__ - Step 117392: {'lr': 5.7537543135534637e-05, 'samples': 22539264, 'steps': 117391, 'loss/train': 0.7164379954338074} 11/07/2021 13:43:39 - INFO - __main__ - Step 117393: {'lr': 5.753415628159944e-05, 'samples': 22539456, 'steps': 117392, 'loss/train': 1.5226359367370605} 11/07/2021 13:43:39 - INFO - __main__ - Step 117394: {'lr': 5.7530769514386375e-05, 'samples': 22539648, 'steps': 117393, 'loss/train': 1.359534740447998} 11/07/2021 13:43:40 - INFO - __main__ - Step 117395: {'lr': 5.752738283389697e-05, 'samples': 22539840, 'steps': 117394, 'loss/train': 1.069156527519226} 11/07/2021 13:43:41 - INFO - __main__ - Step 117396: {'lr': 5.7523996240132745e-05, 'samples': 22540032, 'steps': 117395, 'loss/train': 0.06277098506689072} 11/07/2021 13:43:41 - INFO - __main__ - Step 117397: {'lr': 5.752060973309525e-05, 'samples': 22540224, 'steps': 117396, 'loss/train': 1.0763002634048462} 11/07/2021 13:43:41 - INFO - __main__ - Step 117398: {'lr': 5.7517223312786e-05, 'samples': 22540416, 'steps': 117397, 'loss/train': 1.2451859712600708} 11/07/2021 13:43:42 - INFO - __main__ - Step 117399: {'lr': 5.751383697920653e-05, 'samples': 22540608, 'steps': 117398, 'loss/train': 1.3535337448120117} 11/07/2021 13:43:43 - INFO - __main__ - Step 117400: {'lr': 5.7510450732358335e-05, 'samples': 22540800, 'steps': 117399, 'loss/train': 1.5816906690597534} 11/07/2021 13:43:43 - INFO - __main__ - Step 117401: {'lr': 5.7507064572242976e-05, 'samples': 22540992, 'steps': 117400, 'loss/train': 1.054213285446167} 11/07/2021 13:43:44 - INFO - __main__ - Step 117402: {'lr': 5.7503678498861955e-05, 'samples': 22541184, 'steps': 117401, 'loss/train': 1.1782819032669067} 11/07/2021 13:43:44 - INFO - __main__ - Step 117403: {'lr': 5.750029251221686e-05, 'samples': 22541376, 'steps': 117402, 'loss/train': 1.2757763862609863} 11/07/2021 13:43:44 - INFO - __main__ - Step 117404: {'lr': 5.749690661230914e-05, 'samples': 22541568, 'steps': 117403, 'loss/train': 1.152740716934204} 11/07/2021 13:43:45 - INFO - __main__ - Step 117405: {'lr': 5.7493520799140304e-05, 'samples': 22541760, 'steps': 117404, 'loss/train': 0.9790191650390625} 11/07/2021 13:43:46 - INFO - __main__ - Step 117406: {'lr': 5.749013507271192e-05, 'samples': 22541952, 'steps': 117405, 'loss/train': 1.2159723043441772} 11/07/2021 13:43:46 - INFO - __main__ - Step 117407: {'lr': 5.748674943302551e-05, 'samples': 22542144, 'steps': 117406, 'loss/train': 1.631805181503296} 11/07/2021 13:43:46 - INFO - __main__ - Step 117408: {'lr': 5.74833638800826e-05, 'samples': 22542336, 'steps': 117407, 'loss/train': 1.0158147811889648} 11/07/2021 13:43:47 - INFO - __main__ - Step 117409: {'lr': 5.747997841388472e-05, 'samples': 22542528, 'steps': 117408, 'loss/train': 1.333762288093567} 11/07/2021 13:43:48 - INFO - __main__ - Step 117410: {'lr': 5.747659303443339e-05, 'samples': 22542720, 'steps': 117409, 'loss/train': 1.5769522190093994} 11/07/2021 13:43:49 - INFO - __main__ - Step 117411: {'lr': 5.7473207741730146e-05, 'samples': 22542912, 'steps': 117410, 'loss/train': 0.8527240753173828} 11/07/2021 13:43:49 - INFO - __main__ - Step 117412: {'lr': 5.746982253577651e-05, 'samples': 22543104, 'steps': 117411, 'loss/train': 1.0835906267166138} 11/07/2021 13:43:49 - INFO - __main__ - Step 117413: {'lr': 5.746643741657398e-05, 'samples': 22543296, 'steps': 117412, 'loss/train': 1.3971213102340698} 11/07/2021 13:43:50 - INFO - __main__ - Step 117414: {'lr': 5.7463052384124194e-05, 'samples': 22543488, 'steps': 117413, 'loss/train': 1.3005688190460205} 11/07/2021 13:43:50 - INFO - __main__ - Step 117415: {'lr': 5.745966743842848e-05, 'samples': 22543680, 'steps': 117414, 'loss/train': 0.8730303049087524} 11/07/2021 13:43:51 - INFO - __main__ - Step 117416: {'lr': 5.745628257948851e-05, 'samples': 22543872, 'steps': 117415, 'loss/train': 0.0675036832690239} 11/07/2021 13:43:51 - INFO - __main__ - Step 117417: {'lr': 5.7452897807305725e-05, 'samples': 22544064, 'steps': 117416, 'loss/train': 1.1097453832626343} 11/07/2021 13:43:52 - INFO - __main__ - Step 117418: {'lr': 5.7449513121881734e-05, 'samples': 22544256, 'steps': 117417, 'loss/train': 1.5349431037902832} 11/07/2021 13:43:52 - INFO - __main__ - Step 117419: {'lr': 5.7446128523218013e-05, 'samples': 22544448, 'steps': 117418, 'loss/train': 1.5454273223876953} 11/07/2021 13:43:52 - INFO - __main__ - Step 117420: {'lr': 5.744274401131608e-05, 'samples': 22544640, 'steps': 117419, 'loss/train': 1.899514079093933} 11/07/2021 13:43:53 - INFO - __main__ - Step 117421: {'lr': 5.743935958617746e-05, 'samples': 22544832, 'steps': 117420, 'loss/train': 1.1194953918457031} 11/07/2021 13:43:54 - INFO - __main__ - Step 117422: {'lr': 5.7435975247803726e-05, 'samples': 22545024, 'steps': 117421, 'loss/train': 1.8499258756637573} 11/07/2021 13:43:54 - INFO - __main__ - Step 117423: {'lr': 5.743259099619635e-05, 'samples': 22545216, 'steps': 117422, 'loss/train': 1.3212355375289917} 11/07/2021 13:43:54 - INFO - __main__ - Step 117424: {'lr': 5.742920683135689e-05, 'samples': 22545408, 'steps': 117423, 'loss/train': 1.6101716756820679} 11/07/2021 13:43:55 - INFO - __main__ - Step 117425: {'lr': 5.742582275328692e-05, 'samples': 22545600, 'steps': 117424, 'loss/train': 1.1956132650375366} 11/07/2021 13:43:55 - INFO - __main__ - Step 117426: {'lr': 5.7422438761987856e-05, 'samples': 22545792, 'steps': 117425, 'loss/train': 0.996773362159729} 11/07/2021 13:43:56 - INFO - __main__ - Step 117427: {'lr': 5.741905485746124e-05, 'samples': 22545984, 'steps': 117426, 'loss/train': 1.4015611410140991} 11/07/2021 13:43:57 - INFO - __main__ - Step 117428: {'lr': 5.741567103970863e-05, 'samples': 22546176, 'steps': 117427, 'loss/train': 0.9336305856704712} 11/07/2021 13:43:57 - INFO - __main__ - Step 117429: {'lr': 5.7412287308731546e-05, 'samples': 22546368, 'steps': 117428, 'loss/train': 0.9363494515419006} 11/07/2021 13:43:57 - INFO - __main__ - Step 117430: {'lr': 5.740890366453153e-05, 'samples': 22546560, 'steps': 117429, 'loss/train': 0.5311631560325623} 11/07/2021 13:43:58 - INFO - __main__ - Step 117431: {'lr': 5.740552010711009e-05, 'samples': 22546752, 'steps': 117430, 'loss/train': 1.2153939008712769} 11/07/2021 13:43:59 - INFO - __main__ - Step 117432: {'lr': 5.740213663646873e-05, 'samples': 22546944, 'steps': 117431, 'loss/train': 1.6235367059707642} 11/07/2021 13:43:59 - INFO - __main__ - Step 117433: {'lr': 5.739875325260902e-05, 'samples': 22547136, 'steps': 117432, 'loss/train': 1.3064548969268799} 11/07/2021 13:43:59 - INFO - __main__ - Step 117434: {'lr': 5.739536995553243e-05, 'samples': 22547328, 'steps': 117433, 'loss/train': 0.8921785950660706} 11/07/2021 13:44:00 - INFO - __main__ - Step 117435: {'lr': 5.7391986745240516e-05, 'samples': 22547520, 'steps': 117434, 'loss/train': 1.420248031616211} 11/07/2021 13:44:00 - INFO - __main__ - Step 117436: {'lr': 5.73886036217349e-05, 'samples': 22547712, 'steps': 117435, 'loss/train': 0.9633342027664185} 11/07/2021 13:44:01 - INFO - __main__ - Step 117437: {'lr': 5.7385220585016914e-05, 'samples': 22547904, 'steps': 117436, 'loss/train': 1.1666970252990723} 11/07/2021 13:44:02 - INFO - __main__ - Step 117438: {'lr': 5.7381837635088195e-05, 'samples': 22548096, 'steps': 117437, 'loss/train': 1.5799617767333984} 11/07/2021 13:44:02 - INFO - __main__ - Step 117439: {'lr': 5.737845477195022e-05, 'samples': 22548288, 'steps': 117438, 'loss/train': 1.1769119501113892} 11/07/2021 13:44:02 - INFO - __main__ - Step 117440: {'lr': 5.737507199560457e-05, 'samples': 22548480, 'steps': 117439, 'loss/train': 1.0908358097076416} 11/07/2021 13:44:03 - INFO - __main__ - Step 117441: {'lr': 5.737168930605272e-05, 'samples': 22548672, 'steps': 117440, 'loss/train': 1.0480577945709229} 11/07/2021 13:44:04 - INFO - __main__ - Step 117442: {'lr': 5.7368306703296234e-05, 'samples': 22548864, 'steps': 117441, 'loss/train': 1.5745733976364136} 11/07/2021 13:44:04 - INFO - __main__ - Step 117443: {'lr': 5.7364924187336606e-05, 'samples': 22549056, 'steps': 117442, 'loss/train': 0.5435487031936646} 11/07/2021 13:44:04 - INFO - __main__ - Step 117444: {'lr': 5.7361541758175344e-05, 'samples': 22549248, 'steps': 117443, 'loss/train': 1.249085783958435} 11/07/2021 13:44:05 - INFO - __main__ - Step 117445: {'lr': 5.735815941581404e-05, 'samples': 22549440, 'steps': 117444, 'loss/train': 1.1026153564453125} 11/07/2021 13:44:05 - INFO - __main__ - Step 117446: {'lr': 5.735477716025417e-05, 'samples': 22549632, 'steps': 117445, 'loss/train': 1.5947262048721313} 11/07/2021 13:44:06 - INFO - __main__ - Step 117447: {'lr': 5.73513949914973e-05, 'samples': 22549824, 'steps': 117446, 'loss/train': 1.7337474822998047} 11/07/2021 13:44:06 - INFO - __main__ - Step 117448: {'lr': 5.734801290954489e-05, 'samples': 22550016, 'steps': 117447, 'loss/train': 1.7758747339248657} 11/07/2021 13:44:07 - INFO - __main__ - Step 117449: {'lr': 5.7344630914398455e-05, 'samples': 22550208, 'steps': 117448, 'loss/train': 0.18250586092472076} 11/07/2021 13:44:07 - INFO - __main__ - Step 117450: {'lr': 5.734124900605958e-05, 'samples': 22550400, 'steps': 117449, 'loss/train': 1.5838338136672974} 11/07/2021 13:44:07 - INFO - __main__ - Step 117451: {'lr': 5.733786718452977e-05, 'samples': 22550592, 'steps': 117450, 'loss/train': 1.266605257987976} 11/07/2021 13:44:09 - INFO - __main__ - Step 117452: {'lr': 5.733448544981054e-05, 'samples': 22550784, 'steps': 117451, 'loss/train': 1.2338298559188843} 11/07/2021 13:44:09 - INFO - __main__ - Step 117453: {'lr': 5.7331103801903403e-05, 'samples': 22550976, 'steps': 117452, 'loss/train': 1.3364216089248657} 11/07/2021 13:44:09 - INFO - __main__ - Step 117454: {'lr': 5.7327722240809926e-05, 'samples': 22551168, 'steps': 117453, 'loss/train': 1.3059444427490234} 11/07/2021 13:44:10 - INFO - __main__ - Step 117455: {'lr': 5.732434076653159e-05, 'samples': 22551360, 'steps': 117454, 'loss/train': 1.1398711204528809} 11/07/2021 13:44:10 - INFO - __main__ - Step 117456: {'lr': 5.732095937906992e-05, 'samples': 22551552, 'steps': 117455, 'loss/train': 1.4897524118423462} 11/07/2021 13:44:11 - INFO - __main__ - Step 117457: {'lr': 5.731757807842647e-05, 'samples': 22551744, 'steps': 117456, 'loss/train': 0.7809873223304749} 11/07/2021 13:44:11 - INFO - __main__ - Step 117458: {'lr': 5.731419686460279e-05, 'samples': 22551936, 'steps': 117457, 'loss/train': 1.4015789031982422} 11/07/2021 13:44:12 - INFO - __main__ - Step 117459: {'lr': 5.731081573760033e-05, 'samples': 22552128, 'steps': 117458, 'loss/train': 1.2006325721740723} 11/07/2021 13:44:12 - INFO - __main__ - Step 117460: {'lr': 5.7307434697420616e-05, 'samples': 22552320, 'steps': 117459, 'loss/train': 1.4932461977005005} 11/07/2021 13:44:12 - INFO - __main__ - Step 117461: {'lr': 5.7304053744065193e-05, 'samples': 22552512, 'steps': 117460, 'loss/train': 1.0435717105865479} 11/07/2021 13:44:13 - INFO - __main__ - Step 117462: {'lr': 5.7300672877535595e-05, 'samples': 22552704, 'steps': 117461, 'loss/train': 1.3663729429244995} 11/07/2021 13:44:14 - INFO - __main__ - Step 117463: {'lr': 5.729729209783335e-05, 'samples': 22552896, 'steps': 117462, 'loss/train': 1.3687621355056763} 11/07/2021 13:44:14 - INFO - __main__ - Step 117464: {'lr': 5.729391140495999e-05, 'samples': 22553088, 'steps': 117463, 'loss/train': 1.5782363414764404} 11/07/2021 13:44:14 - INFO - __main__ - Step 117465: {'lr': 5.7290530798916995e-05, 'samples': 22553280, 'steps': 117464, 'loss/train': 1.2253555059432983} 11/07/2021 13:44:15 - INFO - __main__ - Step 117466: {'lr': 5.7287150279705904e-05, 'samples': 22553472, 'steps': 117465, 'loss/train': 0.5126550793647766} 11/07/2021 13:44:15 - INFO - __main__ - Step 117467: {'lr': 5.728376984732825e-05, 'samples': 22553664, 'steps': 117466, 'loss/train': 1.3675644397735596} 11/07/2021 13:44:16 - INFO - __main__ - Step 117468: {'lr': 5.7280389501785575e-05, 'samples': 22553856, 'steps': 117467, 'loss/train': 0.868456244468689} 11/07/2021 13:44:17 - INFO - __main__ - Step 117469: {'lr': 5.727700924307938e-05, 'samples': 22554048, 'steps': 117468, 'loss/train': 1.2350696325302124} 11/07/2021 13:44:17 - INFO - __main__ - Step 117470: {'lr': 5.727362907121117e-05, 'samples': 22554240, 'steps': 117469, 'loss/train': 1.3286561965942383} 11/07/2021 13:44:17 - INFO - __main__ - Step 117471: {'lr': 5.727024898618252e-05, 'samples': 22554432, 'steps': 117470, 'loss/train': 1.4250952005386353} 11/07/2021 13:44:18 - INFO - __main__ - Step 117472: {'lr': 5.726686898799496e-05, 'samples': 22554624, 'steps': 117471, 'loss/train': 1.2126760482788086} 11/07/2021 13:44:19 - INFO - __main__ - Step 117473: {'lr': 5.726348907664994e-05, 'samples': 22554816, 'steps': 117472, 'loss/train': 1.3393715620040894} 11/07/2021 13:44:19 - INFO - __main__ - Step 117474: {'lr': 5.7260109252149e-05, 'samples': 22555008, 'steps': 117473, 'loss/train': 1.3640556335449219} 11/07/2021 13:44:19 - INFO - __main__ - Step 117475: {'lr': 5.7256729514493677e-05, 'samples': 22555200, 'steps': 117474, 'loss/train': 1.415959358215332} 11/07/2021 13:44:20 - INFO - __main__ - Step 117476: {'lr': 5.72533498636855e-05, 'samples': 22555392, 'steps': 117475, 'loss/train': 1.6485673189163208} 11/07/2021 13:44:20 - INFO - __main__ - Step 117477: {'lr': 5.7249970299725977e-05, 'samples': 22555584, 'steps': 117476, 'loss/train': 1.2475403547286987} 11/07/2021 13:44:21 - INFO - __main__ - Step 117478: {'lr': 5.7246590822616654e-05, 'samples': 22555776, 'steps': 117477, 'loss/train': 1.1006752252578735} 11/07/2021 13:44:22 - INFO - __main__ - Step 117479: {'lr': 5.7243211432359055e-05, 'samples': 22555968, 'steps': 117478, 'loss/train': 2.048168420791626} 11/07/2021 13:44:22 - INFO - __main__ - Step 117480: {'lr': 5.723983212895467e-05, 'samples': 22556160, 'steps': 117479, 'loss/train': 1.0950523614883423} 11/07/2021 13:44:22 - INFO - __main__ - Step 117481: {'lr': 5.7236452912405034e-05, 'samples': 22556352, 'steps': 117480, 'loss/train': 1.9280272722244263} 11/07/2021 13:44:23 - INFO - __main__ - Step 117482: {'lr': 5.72330737827117e-05, 'samples': 22556544, 'steps': 117481, 'loss/train': 1.7427324056625366} 11/07/2021 13:44:24 - INFO - __main__ - Step 117483: {'lr': 5.722969473987616e-05, 'samples': 22556736, 'steps': 117482, 'loss/train': 1.8786656856536865} 11/07/2021 13:44:24 - INFO - __main__ - Step 117484: {'lr': 5.722631578389995e-05, 'samples': 22556928, 'steps': 117483, 'loss/train': 0.8718059659004211} 11/07/2021 13:44:24 - INFO - __main__ - Step 117485: {'lr': 5.722293691478467e-05, 'samples': 22557120, 'steps': 117484, 'loss/train': 1.3870651721954346} 11/07/2021 13:44:25 - INFO - __main__ - Step 117486: {'lr': 5.7219558132531656e-05, 'samples': 22557312, 'steps': 117485, 'loss/train': 1.066039800643921} 11/07/2021 13:44:25 - INFO - __main__ - Step 117487: {'lr': 5.7216179437142576e-05, 'samples': 22557504, 'steps': 117486, 'loss/train': 1.1238877773284912} 11/07/2021 13:44:26 - INFO - __main__ - Step 117488: {'lr': 5.721280082861888e-05, 'samples': 22557696, 'steps': 117487, 'loss/train': 1.337391972541809} 11/07/2021 13:44:27 - INFO - __main__ - Step 117489: {'lr': 5.720942230696213e-05, 'samples': 22557888, 'steps': 117488, 'loss/train': 0.9766577482223511} 11/07/2021 13:44:27 - INFO - __main__ - Step 117490: {'lr': 5.7206043872173846e-05, 'samples': 22558080, 'steps': 117489, 'loss/train': 1.5007355213165283} 11/07/2021 13:44:27 - INFO - __main__ - Step 117491: {'lr': 5.7202665524255516e-05, 'samples': 22558272, 'steps': 117490, 'loss/train': 1.0401362180709839} 11/07/2021 13:44:28 - INFO - __main__ - Step 117492: {'lr': 5.719928726320872e-05, 'samples': 22558464, 'steps': 117491, 'loss/train': 1.211668848991394} 11/07/2021 13:44:29 - INFO - __main__ - Step 117493: {'lr': 5.719590908903494e-05, 'samples': 22558656, 'steps': 117492, 'loss/train': 0.8191694617271423} 11/07/2021 13:44:29 - INFO - __main__ - Step 117494: {'lr': 5.7192531001735716e-05, 'samples': 22558848, 'steps': 117493, 'loss/train': 1.1925883293151855} 11/07/2021 13:44:29 - INFO - __main__ - Step 117495: {'lr': 5.718915300131256e-05, 'samples': 22559040, 'steps': 117494, 'loss/train': 1.5213723182678223} 11/07/2021 13:44:30 - INFO - __main__ - Step 117496: {'lr': 5.718577508776698e-05, 'samples': 22559232, 'steps': 117495, 'loss/train': 1.3243447542190552} 11/07/2021 13:44:30 - INFO - __main__ - Step 117497: {'lr': 5.718239726110053e-05, 'samples': 22559424, 'steps': 117496, 'loss/train': 1.7361583709716797} 11/07/2021 13:44:31 - INFO - __main__ - Step 117498: {'lr': 5.717901952131471e-05, 'samples': 22559616, 'steps': 117497, 'loss/train': 1.397274136543274} 11/07/2021 13:44:31 - INFO - __main__ - Step 117499: {'lr': 5.7175641868411124e-05, 'samples': 22559808, 'steps': 117498, 'loss/train': 0.5414063930511475} 11/07/2021 13:44:32 - INFO - __main__ - Step 117500: {'lr': 5.7172264302391145e-05, 'samples': 22560000, 'steps': 117499, 'loss/train': 0.9901521801948547} 11/07/2021 13:44:32 - INFO - __main__ - Step 117501: {'lr': 5.7168886823256355e-05, 'samples': 22560192, 'steps': 117500, 'loss/train': 1.058496117591858} 11/07/2021 13:44:32 - INFO - __main__ - Step 117502: {'lr': 5.7165509431008315e-05, 'samples': 22560384, 'steps': 117501, 'loss/train': 1.317308783531189} 11/07/2021 13:44:34 - INFO - __main__ - Step 117503: {'lr': 5.7162132125648495e-05, 'samples': 22560576, 'steps': 117502, 'loss/train': 1.4708045721054077} 11/07/2021 13:44:34 - INFO - __main__ - Step 117504: {'lr': 5.715875490717845e-05, 'samples': 22560768, 'steps': 117503, 'loss/train': 1.4222301244735718} 11/07/2021 13:44:34 - INFO - __main__ - Step 117505: {'lr': 5.7155377775599706e-05, 'samples': 22560960, 'steps': 117504, 'loss/train': 0.981773316860199} 11/07/2021 13:44:35 - INFO - __main__ - Step 117506: {'lr': 5.715200073091378e-05, 'samples': 22561152, 'steps': 117505, 'loss/train': 1.7359849214553833} 11/07/2021 13:44:35 - INFO - __main__ - Step 117507: {'lr': 5.714862377312216e-05, 'samples': 22561344, 'steps': 117506, 'loss/train': 1.2356584072113037} 11/07/2021 13:44:35 - INFO - __main__ - Step 117508: {'lr': 5.7145246902226416e-05, 'samples': 22561536, 'steps': 117507, 'loss/train': 1.3807988166809082} 11/07/2021 13:44:36 - INFO - __main__ - Step 117509: {'lr': 5.7141870118228026e-05, 'samples': 22561728, 'steps': 117508, 'loss/train': 1.0735732316970825} 11/07/2021 13:44:37 - INFO - __main__ - Step 117510: {'lr': 5.713849342112856e-05, 'samples': 22561920, 'steps': 117509, 'loss/train': 1.14808988571167} 11/07/2021 13:44:37 - INFO - __main__ - Step 117511: {'lr': 5.713511681092951e-05, 'samples': 22562112, 'steps': 117510, 'loss/train': 1.234729528427124} 11/07/2021 13:44:37 - INFO - __main__ - Step 117512: {'lr': 5.713174028763246e-05, 'samples': 22562304, 'steps': 117511, 'loss/train': 1.2274434566497803} 11/07/2021 13:44:38 - INFO - __main__ - Step 117513: {'lr': 5.712836385123879e-05, 'samples': 22562496, 'steps': 117512, 'loss/train': 0.7414796352386475} 11/07/2021 13:44:39 - INFO - __main__ - Step 117514: {'lr': 5.71249875017501e-05, 'samples': 22562688, 'steps': 117513, 'loss/train': 1.385809302330017} 11/07/2021 13:44:40 - INFO - __main__ - Step 117515: {'lr': 5.7121611239167954e-05, 'samples': 22562880, 'steps': 117514, 'loss/train': 0.6535149216651917} 11/07/2021 13:44:40 - INFO - __main__ - Step 117516: {'lr': 5.711823506349379e-05, 'samples': 22563072, 'steps': 117515, 'loss/train': 1.4835697412490845} 11/07/2021 13:44:40 - INFO - __main__ - Step 117517: {'lr': 5.7114858974729204e-05, 'samples': 22563264, 'steps': 117516, 'loss/train': 1.2552851438522339} 11/07/2021 13:44:41 - INFO - __main__ - Step 117518: {'lr': 5.7111482972875664e-05, 'samples': 22563456, 'steps': 117517, 'loss/train': 1.453314185142517} 11/07/2021 13:44:42 - INFO - __main__ - Step 117519: {'lr': 5.710810705793473e-05, 'samples': 22563648, 'steps': 117518, 'loss/train': 0.3778119385242462} 11/07/2021 13:44:42 - INFO - __main__ - Step 117520: {'lr': 5.710473122990789e-05, 'samples': 22563840, 'steps': 117519, 'loss/train': 1.1587661504745483} 11/07/2021 13:44:42 - INFO - __main__ - Step 117521: {'lr': 5.710135548879669e-05, 'samples': 22564032, 'steps': 117520, 'loss/train': 1.3765850067138672} 11/07/2021 13:44:43 - INFO - __main__ - Step 117522: {'lr': 5.709797983460266e-05, 'samples': 22564224, 'steps': 117521, 'loss/train': 0.8203315734863281} 11/07/2021 13:44:43 - INFO - __main__ - Step 117523: {'lr': 5.709460426732727e-05, 'samples': 22564416, 'steps': 117522, 'loss/train': 1.3823504447937012} 11/07/2021 13:44:43 - INFO - __main__ - Step 117524: {'lr': 5.7091228786972094e-05, 'samples': 22564608, 'steps': 117523, 'loss/train': 1.3991835117340088} 11/07/2021 13:44:45 - INFO - __main__ - Step 117525: {'lr': 5.708785339353864e-05, 'samples': 22564800, 'steps': 117524, 'loss/train': 1.0330028533935547} 11/07/2021 13:44:45 - INFO - __main__ - Step 117526: {'lr': 5.7084478087028494e-05, 'samples': 22564992, 'steps': 117525, 'loss/train': 1.512765645980835} 11/07/2021 13:44:46 - INFO - __main__ - Step 117527: {'lr': 5.7081102867443e-05, 'samples': 22565184, 'steps': 117526, 'loss/train': 1.3492798805236816} 11/07/2021 13:44:46 - INFO - __main__ - Step 117528: {'lr': 5.7077727734783814e-05, 'samples': 22565376, 'steps': 117527, 'loss/train': 1.7537869215011597} 11/07/2021 13:44:46 - INFO - __main__ - Step 117529: {'lr': 5.7074352689052425e-05, 'samples': 22565568, 'steps': 117528, 'loss/train': 1.7403355836868286} 11/07/2021 13:44:47 - INFO - __main__ - Step 117530: {'lr': 5.707097773025035e-05, 'samples': 22565760, 'steps': 117529, 'loss/train': 1.4791247844696045} 11/07/2021 13:44:48 - INFO - __main__ - Step 117531: {'lr': 5.7067602858379144e-05, 'samples': 22565952, 'steps': 117530, 'loss/train': 1.1720657348632812} 11/07/2021 13:44:48 - INFO - __main__ - Step 117532: {'lr': 5.706422807344025e-05, 'samples': 22566144, 'steps': 117531, 'loss/train': 0.962854266166687} 11/07/2021 13:44:48 - INFO - __main__ - Step 117533: {'lr': 5.7060853375435264e-05, 'samples': 22566336, 'steps': 117532, 'loss/train': 1.447394609451294} 11/07/2021 13:44:49 - INFO - __main__ - Step 117534: {'lr': 5.7057478764365676e-05, 'samples': 22566528, 'steps': 117533, 'loss/train': 0.8455798029899597} 11/07/2021 13:44:49 - INFO - __main__ - Step 117535: {'lr': 5.705410424023302e-05, 'samples': 22566720, 'steps': 117534, 'loss/train': 0.9215230941772461} 11/07/2021 13:44:50 - INFO - __main__ - Step 117536: {'lr': 5.70507298030388e-05, 'samples': 22566912, 'steps': 117535, 'loss/train': 1.8707101345062256} 11/07/2021 13:44:51 - INFO - __main__ - Step 117537: {'lr': 5.704735545278453e-05, 'samples': 22567104, 'steps': 117536, 'loss/train': 1.247555136680603} 11/07/2021 13:44:51 - INFO - __main__ - Step 117538: {'lr': 5.704398118947177e-05, 'samples': 22567296, 'steps': 117537, 'loss/train': 1.2039079666137695} 11/07/2021 13:44:51 - INFO - __main__ - Step 117539: {'lr': 5.704060701310207e-05, 'samples': 22567488, 'steps': 117538, 'loss/train': 0.904130220413208} 11/07/2021 13:44:52 - INFO - __main__ - Step 117540: {'lr': 5.703723292367682e-05, 'samples': 22567680, 'steps': 117539, 'loss/train': 1.1861698627471924} 11/07/2021 13:44:52 - INFO - __main__ - Step 117541: {'lr': 5.703385892119764e-05, 'samples': 22567872, 'steps': 117540, 'loss/train': 1.3601967096328735} 11/07/2021 13:44:53 - INFO - __main__ - Step 117542: {'lr': 5.703048500566599e-05, 'samples': 22568064, 'steps': 117541, 'loss/train': 0.833301842212677} 11/07/2021 13:44:54 - INFO - __main__ - Step 117543: {'lr': 5.702711117708345e-05, 'samples': 22568256, 'steps': 117542, 'loss/train': 1.3800103664398193} 11/07/2021 13:44:54 - INFO - __main__ - Step 117544: {'lr': 5.70237374354515e-05, 'samples': 22568448, 'steps': 117543, 'loss/train': 1.6703296899795532} 11/07/2021 13:44:54 - INFO - __main__ - Step 117545: {'lr': 5.702036378077169e-05, 'samples': 22568640, 'steps': 117544, 'loss/train': 1.113674283027649} 11/07/2021 13:44:55 - INFO - __main__ - Step 117546: {'lr': 5.7016990213045516e-05, 'samples': 22568832, 'steps': 117545, 'loss/train': 0.971510112285614} 11/07/2021 13:44:56 - INFO - __main__ - Step 117547: {'lr': 5.7013616732274534e-05, 'samples': 22569024, 'steps': 117546, 'loss/train': 0.8476462364196777} 11/07/2021 13:44:56 - INFO - __main__ - Step 117548: {'lr': 5.701024333846019e-05, 'samples': 22569216, 'steps': 117547, 'loss/train': 1.3881988525390625} 11/07/2021 13:44:56 - INFO - __main__ - Step 117549: {'lr': 5.7006870031604096e-05, 'samples': 22569408, 'steps': 117548, 'loss/train': 1.7252914905548096} 11/07/2021 13:44:57 - INFO - __main__ - Step 117550: {'lr': 5.7003496811707716e-05, 'samples': 22569600, 'steps': 117549, 'loss/train': 1.6202712059020996} 11/07/2021 13:44:57 - INFO - __main__ - Step 117551: {'lr': 5.700012367877258e-05, 'samples': 22569792, 'steps': 117550, 'loss/train': 1.1049997806549072} 11/07/2021 13:44:57 - INFO - __main__ - Step 117552: {'lr': 5.6996750632800215e-05, 'samples': 22569984, 'steps': 117551, 'loss/train': 1.450827956199646} 11/07/2021 13:44:58 - INFO - __main__ - Step 117553: {'lr': 5.6993377673792205e-05, 'samples': 22570176, 'steps': 117552, 'loss/train': 1.6507575511932373} 11/07/2021 13:44:59 - INFO - __main__ - Step 117554: {'lr': 5.69900048017499e-05, 'samples': 22570368, 'steps': 117553, 'loss/train': 1.288050889968872} 11/07/2021 13:44:59 - INFO - __main__ - Step 117555: {'lr': 5.698663201667495e-05, 'samples': 22570560, 'steps': 117554, 'loss/train': 1.0069130659103394} 11/07/2021 13:44:59 - INFO - __main__ - Step 117556: {'lr': 5.698325931856885e-05, 'samples': 22570752, 'steps': 117555, 'loss/train': 1.1526705026626587} 11/07/2021 13:45:00 - INFO - __main__ - Step 117557: {'lr': 5.6979886707433123e-05, 'samples': 22570944, 'steps': 117556, 'loss/train': 1.327100396156311} 11/07/2021 13:45:01 - INFO - __main__ - Step 117558: {'lr': 5.697651418326924e-05, 'samples': 22571136, 'steps': 117557, 'loss/train': 1.0956896543502808} 11/07/2021 13:45:01 - INFO - __main__ - Step 117559: {'lr': 5.697314174607879e-05, 'samples': 22571328, 'steps': 117558, 'loss/train': 1.1451126337051392} 11/07/2021 13:45:02 - INFO - __main__ - Step 117560: {'lr': 5.696976939586327e-05, 'samples': 22571520, 'steps': 117559, 'loss/train': 0.9464536309242249} 11/07/2021 13:45:02 - INFO - __main__ - Step 117561: {'lr': 5.6966397132624166e-05, 'samples': 22571712, 'steps': 117560, 'loss/train': 1.1819303035736084} 11/07/2021 13:45:02 - INFO - __main__ - Step 117562: {'lr': 5.696302495636305e-05, 'samples': 22571904, 'steps': 117561, 'loss/train': 1.2904917001724243} 11/07/2021 13:45:03 - INFO - __main__ - Step 117563: {'lr': 5.695965286708141e-05, 'samples': 22572096, 'steps': 117562, 'loss/train': 1.2831695079803467} 11/07/2021 13:45:04 - INFO - __main__ - Step 117564: {'lr': 5.6956280864780775e-05, 'samples': 22572288, 'steps': 117563, 'loss/train': 1.315781831741333} 11/07/2021 13:45:04 - INFO - __main__ - Step 117565: {'lr': 5.6952908949462646e-05, 'samples': 22572480, 'steps': 117564, 'loss/train': 1.134021520614624} 11/07/2021 13:45:04 - INFO - __main__ - Step 117566: {'lr': 5.694953712112863e-05, 'samples': 22572672, 'steps': 117565, 'loss/train': 1.0623160600662231} 11/07/2021 13:45:05 - INFO - __main__ - Step 117567: {'lr': 5.6946165379780115e-05, 'samples': 22572864, 'steps': 117566, 'loss/train': 1.2146424055099487} 11/07/2021 13:45:06 - INFO - __main__ - Step 117568: {'lr': 5.694279372541866e-05, 'samples': 22573056, 'steps': 117567, 'loss/train': 1.4164693355560303} 11/07/2021 13:45:06 - INFO - __main__ - Step 117569: {'lr': 5.6939422158045844e-05, 'samples': 22573248, 'steps': 117568, 'loss/train': 1.4815367460250854} 11/07/2021 13:45:07 - INFO - __main__ - Step 117570: {'lr': 5.693605067766311e-05, 'samples': 22573440, 'steps': 117569, 'loss/train': 1.3653864860534668} 11/07/2021 13:45:07 - INFO - __main__ - Step 117571: {'lr': 5.693267928427201e-05, 'samples': 22573632, 'steps': 117570, 'loss/train': 0.9948859214782715} 11/07/2021 13:45:07 - INFO - __main__ - Step 117572: {'lr': 5.6929307977874076e-05, 'samples': 22573824, 'steps': 117571, 'loss/train': 1.617419958114624} 11/07/2021 13:45:08 - INFO - __main__ - Step 117573: {'lr': 5.692593675847082e-05, 'samples': 22574016, 'steps': 117572, 'loss/train': 0.8091336488723755} 11/07/2021 13:45:09 - INFO - __main__ - Step 117574: {'lr': 5.692256562606376e-05, 'samples': 22574208, 'steps': 117573, 'loss/train': 1.410736322402954} 11/07/2021 13:45:09 - INFO - __main__ - Step 117575: {'lr': 5.691919458065439e-05, 'samples': 22574400, 'steps': 117574, 'loss/train': 1.6389225721359253} 11/07/2021 13:45:09 - INFO - __main__ - Step 117576: {'lr': 5.691582362224429e-05, 'samples': 22574592, 'steps': 117575, 'loss/train': 1.1280983686447144} 11/07/2021 13:45:10 - INFO - __main__ - Step 117577: {'lr': 5.69124527508349e-05, 'samples': 22574784, 'steps': 117576, 'loss/train': 1.3027817010879517} 11/07/2021 13:45:10 - INFO - __main__ - Step 117578: {'lr': 5.69090819664278e-05, 'samples': 22574976, 'steps': 117577, 'loss/train': 1.6675665378570557} 11/07/2021 13:45:11 - INFO - __main__ - Step 117579: {'lr': 5.690571126902458e-05, 'samples': 22575168, 'steps': 117578, 'loss/train': 0.9385937452316284} 11/07/2021 13:45:11 - INFO - __main__ - Step 117580: {'lr': 5.6902340658626564e-05, 'samples': 22575360, 'steps': 117579, 'loss/train': 1.4852052927017212} 11/07/2021 13:45:12 - INFO - __main__ - Step 117581: {'lr': 5.689897013523537e-05, 'samples': 22575552, 'steps': 117580, 'loss/train': 1.2644062042236328} 11/07/2021 13:45:12 - INFO - __main__ - Step 117582: {'lr': 5.689559969885255e-05, 'samples': 22575744, 'steps': 117581, 'loss/train': 1.0453585386276245} 11/07/2021 13:45:12 - INFO - __main__ - Step 117583: {'lr': 5.689222934947958e-05, 'samples': 22575936, 'steps': 117582, 'loss/train': 1.3299312591552734} 11/07/2021 13:45:14 - INFO - __main__ - Step 117584: {'lr': 5.688885908711797e-05, 'samples': 22576128, 'steps': 117583, 'loss/train': 1.2792409658432007} 11/07/2021 13:45:14 - INFO - __main__ - Step 117585: {'lr': 5.688548891176929e-05, 'samples': 22576320, 'steps': 117584, 'loss/train': 1.1616463661193848} 11/07/2021 13:45:15 - INFO - __main__ - Step 117586: {'lr': 5.6882118823435e-05, 'samples': 22576512, 'steps': 117585, 'loss/train': 0.5115747451782227} 11/07/2021 13:45:15 - INFO - __main__ - Step 117587: {'lr': 5.687874882211666e-05, 'samples': 22576704, 'steps': 117586, 'loss/train': 0.9701082706451416} 11/07/2021 13:45:16 - INFO - __main__ - Step 117588: {'lr': 5.68753789078158e-05, 'samples': 22576896, 'steps': 117587, 'loss/train': 0.9033852219581604} 11/07/2021 13:45:17 - INFO - __main__ - Step 117589: {'lr': 5.6872009080533885e-05, 'samples': 22577088, 'steps': 117588, 'loss/train': 1.064430832862854} 11/07/2021 13:45:17 - INFO - __main__ - Step 117590: {'lr': 5.6868639340272474e-05, 'samples': 22577280, 'steps': 117589, 'loss/train': 1.23395836353302} 11/07/2021 13:45:17 - INFO - __main__ - Step 117591: {'lr': 5.6865269687033066e-05, 'samples': 22577472, 'steps': 117590, 'loss/train': 1.4708921909332275} 11/07/2021 13:45:18 - INFO - __main__ - Step 117592: {'lr': 5.686190012081719e-05, 'samples': 22577664, 'steps': 117591, 'loss/train': 0.9647570252418518} 11/07/2021 13:45:18 - INFO - __main__ - Step 117593: {'lr': 5.685853064162644e-05, 'samples': 22577856, 'steps': 117592, 'loss/train': 0.48343825340270996} 11/07/2021 13:45:18 - INFO - __main__ - Step 117594: {'lr': 5.685516124946219e-05, 'samples': 22578048, 'steps': 117593, 'loss/train': 2.338508129119873} 11/07/2021 13:45:19 - INFO - __main__ - Step 117595: {'lr': 5.685179194432599e-05, 'samples': 22578240, 'steps': 117594, 'loss/train': 2.260207176208496} 11/07/2021 13:45:20 - INFO - __main__ - Step 117596: {'lr': 5.684842272621943e-05, 'samples': 22578432, 'steps': 117595, 'loss/train': 1.1508238315582275} 11/07/2021 13:45:20 - INFO - __main__ - Step 117597: {'lr': 5.684505359514397e-05, 'samples': 22578624, 'steps': 117596, 'loss/train': 1.4793508052825928} 11/07/2021 13:45:21 - INFO - __main__ - Step 117598: {'lr': 5.684168455110117e-05, 'samples': 22578816, 'steps': 117597, 'loss/train': 1.4289804697036743} 11/07/2021 13:45:21 - INFO - __main__ - Step 117599: {'lr': 5.68383155940925e-05, 'samples': 22579008, 'steps': 117598, 'loss/train': 0.8329165577888489} 11/07/2021 13:45:22 - INFO - __main__ - Step 117600: {'lr': 5.6834946724119515e-05, 'samples': 22579200, 'steps': 117599, 'loss/train': 0.8784219622612} 11/07/2021 13:45:22 - INFO - __main__ - Step 117601: {'lr': 5.683157794118371e-05, 'samples': 22579392, 'steps': 117600, 'loss/train': 1.2057762145996094} 11/07/2021 13:45:23 - INFO - __main__ - Step 117602: {'lr': 5.6828209245286644e-05, 'samples': 22579584, 'steps': 117601, 'loss/train': 1.0206139087677002} 11/07/2021 13:45:23 - INFO - __main__ - Step 117603: {'lr': 5.682484063642979e-05, 'samples': 22579776, 'steps': 117602, 'loss/train': 1.4360988140106201} 11/07/2021 13:45:23 - INFO - __main__ - Step 117604: {'lr': 5.6821472114614666e-05, 'samples': 22579968, 'steps': 117603, 'loss/train': 1.3820061683654785} 11/07/2021 13:45:24 - INFO - __main__ - Step 117605: {'lr': 5.681810367984283e-05, 'samples': 22580160, 'steps': 117604, 'loss/train': 1.1244457960128784} 11/07/2021 13:45:25 - INFO - __main__ - Step 117606: {'lr': 5.6814735332115844e-05, 'samples': 22580352, 'steps': 117605, 'loss/train': 1.6339874267578125} 11/07/2021 13:45:25 - INFO - __main__ - Step 117607: {'lr': 5.681136707143506e-05, 'samples': 22580544, 'steps': 117606, 'loss/train': 1.4854265451431274} 11/07/2021 13:45:25 - INFO - __main__ - Step 117608: {'lr': 5.680799889780211e-05, 'samples': 22580736, 'steps': 117607, 'loss/train': 1.2418453693389893} 11/07/2021 13:45:26 - INFO - __main__ - Step 117609: {'lr': 5.680463081121851e-05, 'samples': 22580928, 'steps': 117608, 'loss/train': 1.4217063188552856} 11/07/2021 13:45:26 - INFO - __main__ - Step 117610: {'lr': 5.680126281168574e-05, 'samples': 22581120, 'steps': 117609, 'loss/train': 1.1872005462646484} 11/07/2021 13:45:28 - INFO - __main__ - Step 117611: {'lr': 5.679789489920536e-05, 'samples': 22581312, 'steps': 117610, 'loss/train': 1.5312334299087524} 11/07/2021 13:45:28 - INFO - __main__ - Step 117612: {'lr': 5.6794527073778854e-05, 'samples': 22581504, 'steps': 117611, 'loss/train': 0.6843357086181641} 11/07/2021 13:45:28 - INFO - __main__ - Step 117613: {'lr': 5.679115933540777e-05, 'samples': 22581696, 'steps': 117612, 'loss/train': 1.4919344186782837} 11/07/2021 13:45:29 - INFO - __main__ - Step 117614: {'lr': 5.6787791684093563e-05, 'samples': 22581888, 'steps': 117613, 'loss/train': 1.3772603273391724} 11/07/2021 13:45:29 - INFO - __main__ - Step 117615: {'lr': 5.678442411983783e-05, 'samples': 22582080, 'steps': 117614, 'loss/train': 0.08289730548858643} 11/07/2021 13:45:30 - INFO - __main__ - Step 117616: {'lr': 5.678105664264205e-05, 'samples': 22582272, 'steps': 117615, 'loss/train': 1.5609713792800903} 11/07/2021 13:45:30 - INFO - __main__ - Step 117617: {'lr': 5.677768925250776e-05, 'samples': 22582464, 'steps': 117616, 'loss/train': 1.57851243019104} 11/07/2021 13:45:31 - INFO - __main__ - Step 117618: {'lr': 5.677432194943644e-05, 'samples': 22582656, 'steps': 117617, 'loss/train': 1.062320590019226} 11/07/2021 13:45:31 - INFO - __main__ - Step 117619: {'lr': 5.677095473342964e-05, 'samples': 22582848, 'steps': 117618, 'loss/train': 1.2084071636199951} 11/07/2021 13:45:31 - INFO - __main__ - Step 117620: {'lr': 5.676758760448891e-05, 'samples': 22583040, 'steps': 117619, 'loss/train': 1.2170820236206055} 11/07/2021 13:45:33 - INFO - __main__ - Step 117621: {'lr': 5.6764220562615685e-05, 'samples': 22583232, 'steps': 117620, 'loss/train': 0.35588404536247253} 11/07/2021 13:45:33 - INFO - __main__ - Step 117622: {'lr': 5.676085360781152e-05, 'samples': 22583424, 'steps': 117621, 'loss/train': 1.7161911725997925} 11/07/2021 13:45:33 - INFO - __main__ - Step 117623: {'lr': 5.6757486740077916e-05, 'samples': 22583616, 'steps': 117622, 'loss/train': 1.7533888816833496} 11/07/2021 13:45:34 - INFO - __main__ - Step 117624: {'lr': 5.67541199594164e-05, 'samples': 22583808, 'steps': 117623, 'loss/train': 1.2867119312286377} 11/07/2021 13:45:34 - INFO - __main__ - Step 117625: {'lr': 5.6750753265828514e-05, 'samples': 22584000, 'steps': 117624, 'loss/train': 1.4487080574035645} 11/07/2021 13:45:36 - INFO - __main__ - Step 117626: {'lr': 5.6747386659315755e-05, 'samples': 22584192, 'steps': 117625, 'loss/train': 0.9148855805397034} 11/07/2021 13:45:36 - INFO - __main__ - Step 117627: {'lr': 5.674402013987964e-05, 'samples': 22584384, 'steps': 117626, 'loss/train': 1.8299412727355957} 11/07/2021 13:45:36 - INFO - __main__ - Step 117628: {'lr': 5.674065370752168e-05, 'samples': 22584576, 'steps': 117627, 'loss/train': 1.1705652475357056} 11/07/2021 13:45:37 - INFO - __main__ - Step 117629: {'lr': 5.67372873622434e-05, 'samples': 22584768, 'steps': 117628, 'loss/train': 1.142176628112793} 11/07/2021 13:45:37 - INFO - __main__ - Step 117630: {'lr': 5.673392110404632e-05, 'samples': 22584960, 'steps': 117629, 'loss/train': 1.7170982360839844} 11/07/2021 13:45:37 - INFO - __main__ - Step 117631: {'lr': 5.673055493293197e-05, 'samples': 22585152, 'steps': 117630, 'loss/train': 0.988183319568634} 11/07/2021 13:45:38 - INFO - __main__ - Step 117632: {'lr': 5.672718884890182e-05, 'samples': 22585344, 'steps': 117631, 'loss/train': 1.1978484392166138} 11/07/2021 13:45:39 - INFO - __main__ - Step 117633: {'lr': 5.672382285195751e-05, 'samples': 22585536, 'steps': 117632, 'loss/train': 1.4443470239639282} 11/07/2021 13:45:39 - INFO - __main__ - Step 117634: {'lr': 5.6720456942100374e-05, 'samples': 22585728, 'steps': 117633, 'loss/train': 1.4260761737823486} 11/07/2021 13:45:39 - INFO - __main__ - Step 117635: {'lr': 5.6717091119332016e-05, 'samples': 22585920, 'steps': 117634, 'loss/train': 1.511960744857788} 11/07/2021 13:45:40 - INFO - __main__ - Step 117636: {'lr': 5.6713725383653965e-05, 'samples': 22586112, 'steps': 117635, 'loss/train': 1.205812692642212} 11/07/2021 13:45:40 - INFO - __main__ - Step 117637: {'lr': 5.671035973506775e-05, 'samples': 22586304, 'steps': 117636, 'loss/train': 0.9158077836036682} 11/07/2021 13:45:41 - INFO - __main__ - Step 117638: {'lr': 5.670699417357483e-05, 'samples': 22586496, 'steps': 117637, 'loss/train': 1.4932072162628174} 11/07/2021 13:45:42 - INFO - __main__ - Step 117639: {'lr': 5.670362869917675e-05, 'samples': 22586688, 'steps': 117638, 'loss/train': 1.1679792404174805} 11/07/2021 13:45:42 - INFO - __main__ - Step 117640: {'lr': 5.670026331187505e-05, 'samples': 22586880, 'steps': 117639, 'loss/train': 1.2673931121826172} 11/07/2021 13:45:42 - INFO - __main__ - Step 117641: {'lr': 5.6696898011671244e-05, 'samples': 22587072, 'steps': 117640, 'loss/train': 0.05332955718040466} 11/07/2021 13:45:43 - INFO - __main__ - Step 117642: {'lr': 5.6693532798566816e-05, 'samples': 22587264, 'steps': 117641, 'loss/train': 1.14006769657135} 11/07/2021 13:45:44 - INFO - __main__ - Step 117643: {'lr': 5.66901676725633e-05, 'samples': 22587456, 'steps': 117642, 'loss/train': 0.8758665323257446} 11/07/2021 13:45:44 - INFO - __main__ - Step 117644: {'lr': 5.668680263366219e-05, 'samples': 22587648, 'steps': 117643, 'loss/train': 1.4188977479934692} 11/07/2021 13:45:45 - INFO - __main__ - Step 117645: {'lr': 5.668343768186504e-05, 'samples': 22587840, 'steps': 117644, 'loss/train': 0.053156014531850815} 11/07/2021 13:45:45 - INFO - __main__ - Step 117646: {'lr': 5.668007281717336e-05, 'samples': 22588032, 'steps': 117645, 'loss/train': 1.249374270439148} 11/07/2021 13:45:45 - INFO - __main__ - Step 117647: {'lr': 5.6676708039588715e-05, 'samples': 22588224, 'steps': 117646, 'loss/train': 1.0452877283096313} 11/07/2021 13:45:46 - INFO - __main__ - Step 117648: {'lr': 5.6673343349112506e-05, 'samples': 22588416, 'steps': 117647, 'loss/train': 1.3117533922195435} 11/07/2021 13:45:47 - INFO - __main__ - Step 117649: {'lr': 5.666997874574628e-05, 'samples': 22588608, 'steps': 117648, 'loss/train': 1.0726748704910278} 11/07/2021 13:45:47 - INFO - __main__ - Step 117650: {'lr': 5.66666142294916e-05, 'samples': 22588800, 'steps': 117649, 'loss/train': 1.342643141746521} 11/07/2021 13:45:48 - INFO - __main__ - Step 117651: {'lr': 5.666324980034995e-05, 'samples': 22588992, 'steps': 117650, 'loss/train': 1.551982045173645} 11/07/2021 13:45:48 - INFO - __main__ - Step 117652: {'lr': 5.6659885458322875e-05, 'samples': 22589184, 'steps': 117651, 'loss/train': 1.0756114721298218} 11/07/2021 13:45:49 - INFO - __main__ - Step 117653: {'lr': 5.665652120341186e-05, 'samples': 22589376, 'steps': 117652, 'loss/train': 0.8245126008987427} 11/07/2021 13:45:49 - INFO - __main__ - Step 117654: {'lr': 5.665315703561844e-05, 'samples': 22589568, 'steps': 117653, 'loss/train': 1.0015997886657715} 11/07/2021 13:45:50 - INFO - __main__ - Step 117655: {'lr': 5.6649792954944104e-05, 'samples': 22589760, 'steps': 117654, 'loss/train': 1.5895686149597168} 11/07/2021 13:45:50 - INFO - __main__ - Step 117656: {'lr': 5.664642896139041e-05, 'samples': 22589952, 'steps': 117655, 'loss/train': 1.1816083192825317} 11/07/2021 13:45:50 - INFO - __main__ - Step 117657: {'lr': 5.6643065054958836e-05, 'samples': 22590144, 'steps': 117656, 'loss/train': 1.332321286201477} 11/07/2021 13:45:52 - INFO - __main__ - Step 117658: {'lr': 5.6639701235650904e-05, 'samples': 22590336, 'steps': 117657, 'loss/train': 1.2152622938156128} 11/07/2021 13:45:52 - INFO - __main__ - Step 117659: {'lr': 5.6636337503468164e-05, 'samples': 22590528, 'steps': 117658, 'loss/train': 1.428560733795166} 11/07/2021 13:45:52 - INFO - __main__ - Step 117660: {'lr': 5.663297385841215e-05, 'samples': 22590720, 'steps': 117659, 'loss/train': 0.890975832939148} 11/07/2021 13:45:53 - INFO - __main__ - Step 117661: {'lr': 5.662961030048427e-05, 'samples': 22590912, 'steps': 117660, 'loss/train': 0.609849214553833} 11/07/2021 13:45:53 - INFO - __main__ - Step 117662: {'lr': 5.6626246829686115e-05, 'samples': 22591104, 'steps': 117661, 'loss/train': 1.2917389869689941} 11/07/2021 13:45:54 - INFO - __main__ - Step 117663: {'lr': 5.662288344601921e-05, 'samples': 22591296, 'steps': 117662, 'loss/train': 0.8907296061515808} 11/07/2021 13:45:55 - INFO - __main__ - Step 117664: {'lr': 5.6619520149485015e-05, 'samples': 22591488, 'steps': 117663, 'loss/train': 1.601893424987793} 11/07/2021 13:45:55 - INFO - __main__ - Step 117665: {'lr': 5.6616156940085095e-05, 'samples': 22591680, 'steps': 117664, 'loss/train': 1.7863658666610718} 11/07/2021 13:45:55 - INFO - __main__ - Step 117666: {'lr': 5.6612793817820945e-05, 'samples': 22591872, 'steps': 117665, 'loss/train': 1.3376305103302002} 11/07/2021 13:45:56 - INFO - __main__ - Step 117667: {'lr': 5.6609430782694096e-05, 'samples': 22592064, 'steps': 117666, 'loss/train': 1.106544852256775} 11/07/2021 13:45:57 - INFO - __main__ - Step 117668: {'lr': 5.660606783470604e-05, 'samples': 22592256, 'steps': 117667, 'loss/train': 1.3371835947036743} 11/07/2021 13:45:57 - INFO - __main__ - Step 117669: {'lr': 5.6602704973858306e-05, 'samples': 22592448, 'steps': 117668, 'loss/train': 1.661702275276184} 11/07/2021 13:45:57 - INFO - __main__ - Step 117670: {'lr': 5.659934220015242e-05, 'samples': 22592640, 'steps': 117669, 'loss/train': 1.1875038146972656} 11/07/2021 13:45:58 - INFO - __main__ - Step 117671: {'lr': 5.6595979513589883e-05, 'samples': 22592832, 'steps': 117670, 'loss/train': 1.0302557945251465} 11/07/2021 13:45:58 - INFO - __main__ - Step 117672: {'lr': 5.6592616914172277e-05, 'samples': 22593024, 'steps': 117671, 'loss/train': 1.5834431648254395} 11/07/2021 13:45:58 - INFO - __main__ - Step 117673: {'lr': 5.658925440190099e-05, 'samples': 22593216, 'steps': 117672, 'loss/train': 1.9365628957748413} 11/07/2021 13:45:59 - INFO - __main__ - Step 117674: {'lr': 5.6585891976777604e-05, 'samples': 22593408, 'steps': 117673, 'loss/train': 1.001682996749878} 11/07/2021 13:46:00 - INFO - __main__ - Step 117675: {'lr': 5.6582529638803614e-05, 'samples': 22593600, 'steps': 117674, 'loss/train': 1.3119581937789917} 11/07/2021 13:46:00 - INFO - __main__ - Step 117676: {'lr': 5.657916738798055e-05, 'samples': 22593792, 'steps': 117675, 'loss/train': 1.287880539894104} 11/07/2021 13:46:00 - INFO - __main__ - Step 117677: {'lr': 5.657580522430994e-05, 'samples': 22593984, 'steps': 117676, 'loss/train': 1.3279439210891724} 11/07/2021 13:46:01 - INFO - __main__ - Step 117678: {'lr': 5.657244314779331e-05, 'samples': 22594176, 'steps': 117677, 'loss/train': 1.5704129934310913} 11/07/2021 13:46:03 - INFO - __main__ - Step 117679: {'lr': 5.656908115843212e-05, 'samples': 22594368, 'steps': 117678, 'loss/train': 1.5619384050369263} 11/07/2021 13:46:03 - INFO - __main__ - Step 117680: {'lr': 5.656571925622792e-05, 'samples': 22594560, 'steps': 117679, 'loss/train': 1.5514278411865234} 11/07/2021 13:46:03 - INFO - __main__ - Step 117681: {'lr': 5.656235744118224e-05, 'samples': 22594752, 'steps': 117680, 'loss/train': 0.854888916015625} 11/07/2021 13:46:04 - INFO - __main__ - Step 117682: {'lr': 5.655899571329656e-05, 'samples': 22594944, 'steps': 117681, 'loss/train': 1.36998450756073} 11/07/2021 13:46:04 - INFO - __main__ - Step 117683: {'lr': 5.65556340725725e-05, 'samples': 22595136, 'steps': 117682, 'loss/train': 0.7974054217338562} 11/07/2021 13:46:04 - INFO - __main__ - Step 117684: {'lr': 5.6552272519011375e-05, 'samples': 22595328, 'steps': 117683, 'loss/train': 0.773597776889801} 11/07/2021 13:46:06 - INFO - __main__ - Step 117685: {'lr': 5.654891105261487e-05, 'samples': 22595520, 'steps': 117684, 'loss/train': 0.8425158262252808} 11/07/2021 13:46:06 - INFO - __main__ - Step 117686: {'lr': 5.654554967338441e-05, 'samples': 22595712, 'steps': 117685, 'loss/train': 1.444057822227478} 11/07/2021 13:46:06 - INFO - __main__ - Step 117687: {'lr': 5.654218838132152e-05, 'samples': 22595904, 'steps': 117686, 'loss/train': 1.5502578020095825} 11/07/2021 13:46:07 - INFO - __main__ - Step 117688: {'lr': 5.653882717642775e-05, 'samples': 22596096, 'steps': 117687, 'loss/train': 0.8521437644958496} 11/07/2021 13:46:07 - INFO - __main__ - Step 117689: {'lr': 5.65354660587046e-05, 'samples': 22596288, 'steps': 117688, 'loss/train': 1.8494064807891846} 11/07/2021 13:46:08 - INFO - __main__ - Step 117690: {'lr': 5.6532105028153594e-05, 'samples': 22596480, 'steps': 117689, 'loss/train': 0.630007803440094} 11/07/2021 13:46:08 - INFO - __main__ - Step 117691: {'lr': 5.6528744084776234e-05, 'samples': 22596672, 'steps': 117690, 'loss/train': 0.9455832839012146} 11/07/2021 13:46:09 - INFO - __main__ - Step 117692: {'lr': 5.6525383228574036e-05, 'samples': 22596864, 'steps': 117691, 'loss/train': 0.8169618844985962} 11/07/2021 13:46:09 - INFO - __main__ - Step 117693: {'lr': 5.6522022459548515e-05, 'samples': 22597056, 'steps': 117692, 'loss/train': 1.40217125415802} 11/07/2021 13:46:09 - INFO - __main__ - Step 117694: {'lr': 5.651866177770124e-05, 'samples': 22597248, 'steps': 117693, 'loss/train': 1.2806265354156494} 11/07/2021 13:46:10 - INFO - __main__ - Step 117695: {'lr': 5.651530118303361e-05, 'samples': 22597440, 'steps': 117694, 'loss/train': 1.169434905052185} 11/07/2021 13:46:11 - INFO - __main__ - Step 117696: {'lr': 5.651194067554721e-05, 'samples': 22597632, 'steps': 117695, 'loss/train': 0.8458448648452759} 11/07/2021 13:46:11 - INFO - __main__ - Step 117697: {'lr': 5.650858025524353e-05, 'samples': 22597824, 'steps': 117696, 'loss/train': 0.6441358327865601} 11/07/2021 13:46:12 - INFO - __main__ - Step 117698: {'lr': 5.650521992212409e-05, 'samples': 22598016, 'steps': 117697, 'loss/train': 1.6840578317642212} 11/07/2021 13:46:12 - INFO - __main__ - Step 117699: {'lr': 5.650185967619045e-05, 'samples': 22598208, 'steps': 117698, 'loss/train': 0.8505609035491943} 11/07/2021 13:46:12 - INFO - __main__ - Step 117700: {'lr': 5.6498499517444044e-05, 'samples': 22598400, 'steps': 117699, 'loss/train': 1.5743461847305298} 11/07/2021 13:46:13 - INFO - __main__ - Step 117701: {'lr': 5.6495139445886465e-05, 'samples': 22598592, 'steps': 117700, 'loss/train': 1.0377888679504395} 11/07/2021 13:46:14 - INFO - __main__ - Step 117702: {'lr': 5.649177946151915e-05, 'samples': 22598784, 'steps': 117701, 'loss/train': 1.054595947265625} 11/07/2021 13:46:14 - INFO - __main__ - Step 117703: {'lr': 5.6488419564343695e-05, 'samples': 22598976, 'steps': 117702, 'loss/train': 0.9208033084869385} 11/07/2021 13:46:14 - INFO - __main__ - Step 117704: {'lr': 5.648505975436155e-05, 'samples': 22599168, 'steps': 117703, 'loss/train': 1.2344797849655151} 11/07/2021 13:46:15 - INFO - __main__ - Step 117705: {'lr': 5.6481700031574324e-05, 'samples': 22599360, 'steps': 117704, 'loss/train': 1.256798267364502} 11/07/2021 13:46:16 - INFO - __main__ - Step 117706: {'lr': 5.647834039598338e-05, 'samples': 22599552, 'steps': 117705, 'loss/train': 1.326643466949463} 11/07/2021 13:46:16 - INFO - __main__ - Step 117707: {'lr': 5.647498084759031e-05, 'samples': 22599744, 'steps': 117706, 'loss/train': 0.9927430748939514} 11/07/2021 13:46:16 - INFO - __main__ - Step 117708: {'lr': 5.647162138639664e-05, 'samples': 22599936, 'steps': 117707, 'loss/train': 1.0834859609603882} 11/07/2021 13:46:17 - INFO - __main__ - Step 117709: {'lr': 5.646826201240385e-05, 'samples': 22600128, 'steps': 117708, 'loss/train': 0.7161469459533691} 11/07/2021 13:46:17 - INFO - __main__ - Step 117710: {'lr': 5.64649027256135e-05, 'samples': 22600320, 'steps': 117709, 'loss/train': 1.471521019935608} 11/07/2021 13:46:19 - INFO - __main__ - Step 117711: {'lr': 5.6461543526027056e-05, 'samples': 22600512, 'steps': 117710, 'loss/train': 1.2806965112686157} 11/07/2021 13:46:19 - INFO - __main__ - Step 117712: {'lr': 5.645818441364606e-05, 'samples': 22600704, 'steps': 117711, 'loss/train': 1.7818728685379028} 11/07/2021 13:46:19 - INFO - __main__ - Step 117713: {'lr': 5.645482538847202e-05, 'samples': 22600896, 'steps': 117712, 'loss/train': 1.3317162990570068} 11/07/2021 13:46:20 - INFO - __main__ - Step 117714: {'lr': 5.6451466450506474e-05, 'samples': 22601088, 'steps': 117713, 'loss/train': 1.1474134922027588} 11/07/2021 13:46:20 - INFO - __main__ - Step 117715: {'lr': 5.644810759975094e-05, 'samples': 22601280, 'steps': 117714, 'loss/train': 0.891796886920929} 11/07/2021 13:46:21 - INFO - __main__ - Step 117716: {'lr': 5.644474883620687e-05, 'samples': 22601472, 'steps': 117715, 'loss/train': 0.13772180676460266} 11/07/2021 13:46:21 - INFO - __main__ - Step 117717: {'lr': 5.6441390159875786e-05, 'samples': 22601664, 'steps': 117716, 'loss/train': 1.1757757663726807} 11/07/2021 13:46:22 - INFO - __main__ - Step 117718: {'lr': 5.643803157075922e-05, 'samples': 22601856, 'steps': 117717, 'loss/train': 1.5144227743148804} 11/07/2021 13:46:22 - INFO - __main__ - Step 117719: {'lr': 5.643467306885871e-05, 'samples': 22602048, 'steps': 117718, 'loss/train': 1.5395516157150269} 11/07/2021 13:46:22 - INFO - __main__ - Step 117720: {'lr': 5.643131465417575e-05, 'samples': 22602240, 'steps': 117719, 'loss/train': 1.393128514289856} 11/07/2021 13:46:24 - INFO - __main__ - Step 117721: {'lr': 5.6427956326711825e-05, 'samples': 22602432, 'steps': 117720, 'loss/train': 1.158742904663086} 11/07/2021 13:46:24 - INFO - __main__ - Step 117722: {'lr': 5.6424598086468494e-05, 'samples': 22602624, 'steps': 117721, 'loss/train': 1.0522797107696533} 11/07/2021 13:46:24 - INFO - __main__ - Step 117723: {'lr': 5.642123993344725e-05, 'samples': 22602816, 'steps': 117722, 'loss/train': 1.322550892829895} 11/07/2021 13:46:25 - INFO - __main__ - Step 117724: {'lr': 5.641788186764962e-05, 'samples': 22603008, 'steps': 117723, 'loss/train': 1.695482850074768} 11/07/2021 13:46:25 - INFO - __main__ - Step 117725: {'lr': 5.6414523889077084e-05, 'samples': 22603200, 'steps': 117724, 'loss/train': 1.2116727828979492} 11/07/2021 13:46:26 - INFO - __main__ - Step 117726: {'lr': 5.641116599773119e-05, 'samples': 22603392, 'steps': 117725, 'loss/train': 0.9416276216506958} 11/07/2021 13:46:26 - INFO - __main__ - Step 117727: {'lr': 5.640780819361352e-05, 'samples': 22603584, 'steps': 117726, 'loss/train': 0.7187889218330383} 11/07/2021 13:46:27 - INFO - __main__ - Step 117728: {'lr': 5.6404450476725405e-05, 'samples': 22603776, 'steps': 117727, 'loss/train': 1.3192437887191772} 11/07/2021 13:46:27 - INFO - __main__ - Step 117729: {'lr': 5.6401092847068485e-05, 'samples': 22603968, 'steps': 117728, 'loss/train': 1.7881197929382324} 11/07/2021 13:46:27 - INFO - __main__ - Step 117730: {'lr': 5.6397735304644235e-05, 'samples': 22604160, 'steps': 117729, 'loss/train': 1.3346610069274902} 11/07/2021 13:46:29 - INFO - __main__ - Step 117731: {'lr': 5.639437784945417e-05, 'samples': 22604352, 'steps': 117730, 'loss/train': 1.1054776906967163} 11/07/2021 13:46:29 - INFO - __main__ - Step 117732: {'lr': 5.63910204814998e-05, 'samples': 22604544, 'steps': 117731, 'loss/train': 1.3897074460983276} 11/07/2021 13:46:30 - INFO - __main__ - Step 117733: {'lr': 5.6387663200782676e-05, 'samples': 22604736, 'steps': 117732, 'loss/train': 1.0313713550567627} 11/07/2021 13:46:30 - INFO - __main__ - Step 117734: {'lr': 5.638430600730427e-05, 'samples': 22604928, 'steps': 117733, 'loss/train': 1.5199300050735474} 11/07/2021 13:46:30 - INFO - __main__ - Step 117735: {'lr': 5.63809489010661e-05, 'samples': 22605120, 'steps': 117734, 'loss/train': 1.244797706604004} 11/07/2021 13:46:31 - INFO - __main__ - Step 117736: {'lr': 5.63775918820697e-05, 'samples': 22605312, 'steps': 117735, 'loss/train': 1.0741242170333862} 11/07/2021 13:46:32 - INFO - __main__ - Step 117737: {'lr': 5.637423495031657e-05, 'samples': 22605504, 'steps': 117736, 'loss/train': 0.29262852668762207} 11/07/2021 13:46:32 - INFO - __main__ - Step 117738: {'lr': 5.637087810580821e-05, 'samples': 22605696, 'steps': 117737, 'loss/train': 0.43368321657180786} 11/07/2021 13:46:33 - INFO - __main__ - Step 117739: {'lr': 5.636752134854614e-05, 'samples': 22605888, 'steps': 117738, 'loss/train': 1.6660449504852295} 11/07/2021 13:46:33 - INFO - __main__ - Step 117740: {'lr': 5.636416467853189e-05, 'samples': 22606080, 'steps': 117739, 'loss/train': 1.3687355518341064} 11/07/2021 13:46:33 - INFO - __main__ - Step 117741: {'lr': 5.636080809576705e-05, 'samples': 22606272, 'steps': 117740, 'loss/train': 2.1220145225524902} 11/07/2021 13:46:34 - INFO - __main__ - Step 117742: {'lr': 5.635745160025294e-05, 'samples': 22606464, 'steps': 117741, 'loss/train': 1.2525275945663452} 11/07/2021 13:46:35 - INFO - __main__ - Step 117743: {'lr': 5.6354095191991194e-05, 'samples': 22606656, 'steps': 117742, 'loss/train': 1.1822775602340698} 11/07/2021 13:46:35 - INFO - __main__ - Step 117744: {'lr': 5.63507388709833e-05, 'samples': 22606848, 'steps': 117743, 'loss/train': 5.035028457641602} 11/07/2021 13:46:35 - INFO - __main__ - Step 117745: {'lr': 5.6347382637230746e-05, 'samples': 22607040, 'steps': 117744, 'loss/train': 1.3666683435440063} 11/07/2021 13:46:36 - INFO - __main__ - Step 117746: {'lr': 5.634402649073511e-05, 'samples': 22607232, 'steps': 117745, 'loss/train': 1.5368136167526245} 11/07/2021 13:46:36 - INFO - __main__ - Step 117747: {'lr': 5.634067043149785e-05, 'samples': 22607424, 'steps': 117746, 'loss/train': 1.2803345918655396} 11/07/2021 13:46:37 - INFO - __main__ - Step 117748: {'lr': 5.633731445952051e-05, 'samples': 22607616, 'steps': 117747, 'loss/train': 1.3961730003356934} 11/07/2021 13:46:38 - INFO - __main__ - Step 117749: {'lr': 5.633395857480456e-05, 'samples': 22607808, 'steps': 117748, 'loss/train': 1.1696116924285889} 11/07/2021 13:46:38 - INFO - __main__ - Step 117750: {'lr': 5.6330602777351556e-05, 'samples': 22608000, 'steps': 117749, 'loss/train': 1.2635793685913086} 11/07/2021 13:46:38 - INFO - __main__ - Step 117751: {'lr': 5.6327247067163e-05, 'samples': 22608192, 'steps': 117750, 'loss/train': 1.1938610076904297} 11/07/2021 13:46:39 - INFO - __main__ - Step 117752: {'lr': 5.632389144424038e-05, 'samples': 22608384, 'steps': 117751, 'loss/train': 0.799423336982727} 11/07/2021 13:46:40 - INFO - __main__ - Step 117753: {'lr': 5.632053590858524e-05, 'samples': 22608576, 'steps': 117752, 'loss/train': 1.1857450008392334} 11/07/2021 13:46:40 - INFO - __main__ - Step 117754: {'lr': 5.6317180460199155e-05, 'samples': 22608768, 'steps': 117753, 'loss/train': 1.378414511680603} 11/07/2021 13:46:40 - INFO - __main__ - Step 117755: {'lr': 5.631382509908348e-05, 'samples': 22608960, 'steps': 117754, 'loss/train': 1.5222628116607666} 11/07/2021 13:46:41 - INFO - __main__ - Step 117756: {'lr': 5.6310469825239824e-05, 'samples': 22609152, 'steps': 117755, 'loss/train': 0.9746707677841187} 11/07/2021 13:46:41 - INFO - __main__ - Step 117757: {'lr': 5.6307114638669666e-05, 'samples': 22609344, 'steps': 117756, 'loss/train': 1.3282809257507324} 11/07/2021 13:46:43 - INFO - __main__ - Step 117758: {'lr': 5.630375953937453e-05, 'samples': 22609536, 'steps': 117757, 'loss/train': 1.2948240041732788} 11/07/2021 13:46:44 - INFO - __main__ - Step 117759: {'lr': 5.6300404527355935e-05, 'samples': 22609728, 'steps': 117758, 'loss/train': 1.458156943321228} 11/07/2021 13:46:44 - INFO - __main__ - Step 117760: {'lr': 5.629704960261539e-05, 'samples': 22609920, 'steps': 117759, 'loss/train': 1.1709487438201904} 11/07/2021 13:46:44 - INFO - __main__ - Step 117761: {'lr': 5.629369476515439e-05, 'samples': 22610112, 'steps': 117760, 'loss/train': 2.129842519760132} 11/07/2021 13:46:45 - INFO - __main__ - Step 117762: {'lr': 5.629034001497449e-05, 'samples': 22610304, 'steps': 117761, 'loss/train': 1.0327467918395996} 11/07/2021 13:46:45 - INFO - __main__ - Step 117763: {'lr': 5.6286985352077156e-05, 'samples': 22610496, 'steps': 117762, 'loss/train': 1.7283920049667358} 11/07/2021 13:46:45 - INFO - __main__ - Step 117764: {'lr': 5.6283630776463895e-05, 'samples': 22610688, 'steps': 117763, 'loss/train': 0.8071074485778809} 11/07/2021 13:46:46 - INFO - __main__ - Step 117765: {'lr': 5.628027628813629e-05, 'samples': 22610880, 'steps': 117764, 'loss/train': 0.8440547585487366} 11/07/2021 13:46:47 - INFO - __main__ - Step 117766: {'lr': 5.627692188709577e-05, 'samples': 22611072, 'steps': 117765, 'loss/train': 1.468694806098938} 11/07/2021 13:46:47 - INFO - __main__ - Step 117767: {'lr': 5.6273567573343896e-05, 'samples': 22611264, 'steps': 117766, 'loss/train': 1.3394334316253662} 11/07/2021 13:46:47 - INFO - __main__ - Step 117768: {'lr': 5.62702133468822e-05, 'samples': 22611456, 'steps': 117767, 'loss/train': 1.3562144041061401} 11/07/2021 13:46:48 - INFO - __main__ - Step 117769: {'lr': 5.6266859207712126e-05, 'samples': 22611648, 'steps': 117768, 'loss/train': 1.2554463148117065} 11/07/2021 13:46:48 - INFO - __main__ - Step 117770: {'lr': 5.62635051558352e-05, 'samples': 22611840, 'steps': 117769, 'loss/train': 1.127733826637268} 11/07/2021 13:46:49 - INFO - __main__ - Step 117771: {'lr': 5.6260151191252965e-05, 'samples': 22612032, 'steps': 117770, 'loss/train': 0.8982546925544739} 11/07/2021 13:46:50 - INFO - __main__ - Step 117772: {'lr': 5.625679731396691e-05, 'samples': 22612224, 'steps': 117771, 'loss/train': 1.0701355934143066} 11/07/2021 13:46:50 - INFO - __main__ - Step 117773: {'lr': 5.6253443523978515e-05, 'samples': 22612416, 'steps': 117772, 'loss/train': 1.569420576095581} 11/07/2021 13:46:50 - INFO - __main__ - Step 117774: {'lr': 5.6250089821289375e-05, 'samples': 22612608, 'steps': 117773, 'loss/train': 1.285535216331482} 11/07/2021 13:46:51 - INFO - __main__ - Step 117775: {'lr': 5.6246736205900926e-05, 'samples': 22612800, 'steps': 117774, 'loss/train': 1.637096643447876} 11/07/2021 13:46:51 - INFO - __main__ - Step 117776: {'lr': 5.624338267781473e-05, 'samples': 22612992, 'steps': 117775, 'loss/train': 1.5246013402938843} 11/07/2021 13:46:52 - INFO - __main__ - Step 117777: {'lr': 5.624002923703225e-05, 'samples': 22613184, 'steps': 117776, 'loss/train': 1.141854166984558} 11/07/2021 13:46:53 - INFO - __main__ - Step 117778: {'lr': 5.623667588355505e-05, 'samples': 22613376, 'steps': 117777, 'loss/train': 1.488526701927185} 11/07/2021 13:46:53 - INFO - __main__ - Step 117779: {'lr': 5.623332261738462e-05, 'samples': 22613568, 'steps': 117778, 'loss/train': 0.9964814782142639} 11/07/2021 13:46:54 - INFO - __main__ - Step 117780: {'lr': 5.622996943852243e-05, 'samples': 22613760, 'steps': 117779, 'loss/train': 1.3203989267349243} 11/07/2021 13:46:55 - INFO - __main__ - Step 117781: {'lr': 5.6226616346970125e-05, 'samples': 22613952, 'steps': 117780, 'loss/train': 0.8417327404022217} 11/07/2021 13:46:55 - INFO - __main__ - Step 117782: {'lr': 5.622326334272904e-05, 'samples': 22614144, 'steps': 117781, 'loss/train': 1.4613053798675537} 11/07/2021 13:46:55 - INFO - __main__ - Step 117783: {'lr': 5.621991042580074e-05, 'samples': 22614336, 'steps': 117782, 'loss/train': 1.2710254192352295} 11/07/2021 13:46:56 - INFO - __main__ - Step 117784: {'lr': 5.621655759618677e-05, 'samples': 22614528, 'steps': 117783, 'loss/train': 1.1745126247406006} 11/07/2021 13:46:56 - INFO - __main__ - Step 117785: {'lr': 5.6213204853888646e-05, 'samples': 22614720, 'steps': 117784, 'loss/train': 1.4636884927749634} 11/07/2021 13:46:56 - INFO - __main__ - Step 117786: {'lr': 5.620985219890784e-05, 'samples': 22614912, 'steps': 117785, 'loss/train': 1.5942169427871704} 11/07/2021 13:46:58 - INFO - __main__ - Step 117787: {'lr': 5.620649963124591e-05, 'samples': 22615104, 'steps': 117786, 'loss/train': 1.3719277381896973} 11/07/2021 13:46:58 - INFO - __main__ - Step 117788: {'lr': 5.6203147150904326e-05, 'samples': 22615296, 'steps': 117787, 'loss/train': 1.4408212900161743} 11/07/2021 13:46:59 - INFO - __main__ - Step 117789: {'lr': 5.6199794757884614e-05, 'samples': 22615488, 'steps': 117788, 'loss/train': 1.0618352890014648} 11/07/2021 13:46:59 - INFO - __main__ - Step 117790: {'lr': 5.619644245218827e-05, 'samples': 22615680, 'steps': 117789, 'loss/train': 1.4979121685028076} 11/07/2021 13:46:59 - INFO - __main__ - Step 117791: {'lr': 5.6193090233816826e-05, 'samples': 22615872, 'steps': 117790, 'loss/train': 1.2918733358383179} 11/07/2021 13:47:00 - INFO - __main__ - Step 117792: {'lr': 5.618973810277178e-05, 'samples': 22616064, 'steps': 117791, 'loss/train': 0.6656963229179382} 11/07/2021 13:47:01 - INFO - __main__ - Step 117793: {'lr': 5.6186386059054686e-05, 'samples': 22616256, 'steps': 117792, 'loss/train': 0.5494286417961121} 11/07/2021 13:47:01 - INFO - __main__ - Step 117794: {'lr': 5.618303410266698e-05, 'samples': 22616448, 'steps': 117793, 'loss/train': 1.2732704877853394} 11/07/2021 13:47:02 - INFO - __main__ - Step 117795: {'lr': 5.617968223361028e-05, 'samples': 22616640, 'steps': 117794, 'loss/train': 1.1048489809036255} 11/07/2021 13:47:02 - INFO - __main__ - Step 117796: {'lr': 5.6176330451885946e-05, 'samples': 22616832, 'steps': 117795, 'loss/train': 1.261149287223816} 11/07/2021 13:47:02 - INFO - __main__ - Step 117797: {'lr': 5.617297875749558e-05, 'samples': 22617024, 'steps': 117796, 'loss/train': 1.4566965103149414} 11/07/2021 13:47:03 - INFO - __main__ - Step 117798: {'lr': 5.616962715044069e-05, 'samples': 22617216, 'steps': 117797, 'loss/train': 1.4926881790161133} 11/07/2021 13:47:04 - INFO - __main__ - Step 117799: {'lr': 5.616627563072277e-05, 'samples': 22617408, 'steps': 117798, 'loss/train': 1.4460965394973755} 11/07/2021 13:47:04 - INFO - __main__ - Step 117800: {'lr': 5.616292419834332e-05, 'samples': 22617600, 'steps': 117799, 'loss/train': 1.5184977054595947} 11/07/2021 13:47:05 - INFO - __main__ - Step 117801: {'lr': 5.6159572853303864e-05, 'samples': 22617792, 'steps': 117800, 'loss/train': 1.4875009059906006} 11/07/2021 13:47:05 - INFO - __main__ - Step 117802: {'lr': 5.6156221595605935e-05, 'samples': 22617984, 'steps': 117801, 'loss/train': 1.031266212463379} 11/07/2021 13:47:06 - INFO - __main__ - Step 117803: {'lr': 5.6152870425250994e-05, 'samples': 22618176, 'steps': 117802, 'loss/train': 1.5887953042984009} 11/07/2021 13:47:06 - INFO - __main__ - Step 117804: {'lr': 5.6149519342240607e-05, 'samples': 22618368, 'steps': 117803, 'loss/train': 1.1885254383087158} 11/07/2021 13:47:07 - INFO - __main__ - Step 117805: {'lr': 5.6146168346576236e-05, 'samples': 22618560, 'steps': 117804, 'loss/train': 1.7753901481628418} 11/07/2021 13:47:07 - INFO - __main__ - Step 117806: {'lr': 5.6142817438259415e-05, 'samples': 22618752, 'steps': 117805, 'loss/train': 1.1883480548858643} 11/07/2021 13:47:07 - INFO - __main__ - Step 117807: {'lr': 5.613946661729166e-05, 'samples': 22618944, 'steps': 117806, 'loss/train': 1.2834038734436035} 11/07/2021 13:47:08 - INFO - __main__ - Step 117808: {'lr': 5.6136115883674536e-05, 'samples': 22619136, 'steps': 117807, 'loss/train': 1.3715565204620361} 11/07/2021 13:47:09 - INFO - __main__ - Step 117809: {'lr': 5.61327652374094e-05, 'samples': 22619328, 'steps': 117808, 'loss/train': 1.4202295541763306} 11/07/2021 13:47:09 - INFO - __main__ - Step 117810: {'lr': 5.6129414678497856e-05, 'samples': 22619520, 'steps': 117809, 'loss/train': 1.3811559677124023} 11/07/2021 13:47:09 - INFO - __main__ - Step 117811: {'lr': 5.612606420694141e-05, 'samples': 22619712, 'steps': 117810, 'loss/train': 0.7026497721672058} 11/07/2021 13:47:10 - INFO - __main__ - Step 117812: {'lr': 5.612271382274159e-05, 'samples': 22619904, 'steps': 117811, 'loss/train': 1.6168581247329712} 11/07/2021 13:47:11 - INFO - __main__ - Step 117813: {'lr': 5.6119363525899855e-05, 'samples': 22620096, 'steps': 117812, 'loss/train': 0.9761037826538086} 11/07/2021 13:47:11 - INFO - __main__ - Step 117814: {'lr': 5.611601331641775e-05, 'samples': 22620288, 'steps': 117813, 'loss/train': 1.124147653579712} 11/07/2021 13:47:12 - INFO - __main__ - Step 117815: {'lr': 5.611266319429678e-05, 'samples': 22620480, 'steps': 117814, 'loss/train': 1.175001621246338} 11/07/2021 13:47:12 - INFO - __main__ - Step 117816: {'lr': 5.6109313159538436e-05, 'samples': 22620672, 'steps': 117815, 'loss/train': 1.047357201576233} 11/07/2021 13:47:12 - INFO - __main__ - Step 117817: {'lr': 5.6105963212144253e-05, 'samples': 22620864, 'steps': 117816, 'loss/train': 1.0579348802566528} 11/07/2021 13:47:14 - INFO - __main__ - Step 117818: {'lr': 5.610261335211575e-05, 'samples': 22621056, 'steps': 117817, 'loss/train': 1.2043360471725464} 11/07/2021 13:47:14 - INFO - __main__ - Step 117819: {'lr': 5.6099263579454384e-05, 'samples': 22621248, 'steps': 117818, 'loss/train': 1.5062860250473022} 11/07/2021 13:47:14 - INFO - __main__ - Step 117820: {'lr': 5.609591389416171e-05, 'samples': 22621440, 'steps': 117819, 'loss/train': 1.2028205394744873} 11/07/2021 13:47:15 - INFO - __main__ - Step 117821: {'lr': 5.6092564296239325e-05, 'samples': 22621632, 'steps': 117820, 'loss/train': 1.0375802516937256} 11/07/2021 13:47:15 - INFO - __main__ - Step 117822: {'lr': 5.608921478568854e-05, 'samples': 22621824, 'steps': 117821, 'loss/train': 1.0811691284179688} 11/07/2021 13:47:15 - INFO - __main__ - Step 117823: {'lr': 5.6085865362510954e-05, 'samples': 22622016, 'steps': 117822, 'loss/train': 1.4612318277359009} 11/07/2021 13:47:16 - INFO - __main__ - Step 117824: {'lr': 5.608251602670811e-05, 'samples': 22622208, 'steps': 117823, 'loss/train': 0.990825355052948} 11/07/2021 13:47:17 - INFO - __main__ - Step 117825: {'lr': 5.607916677828148e-05, 'samples': 22622400, 'steps': 117824, 'loss/train': 1.2134194374084473} 11/07/2021 13:47:17 - INFO - __main__ - Step 117826: {'lr': 5.607581761723257e-05, 'samples': 22622592, 'steps': 117825, 'loss/train': 1.151599407196045} 11/07/2021 13:47:17 - INFO - __main__ - Step 117827: {'lr': 5.607246854356293e-05, 'samples': 22622784, 'steps': 117826, 'loss/train': 1.3630053997039795} 11/07/2021 13:47:18 - INFO - __main__ - Step 117828: {'lr': 5.6069119557274035e-05, 'samples': 22622976, 'steps': 117827, 'loss/train': 0.9962207674980164} 11/07/2021 13:47:19 - INFO - __main__ - Step 117829: {'lr': 5.60657706583674e-05, 'samples': 22623168, 'steps': 117828, 'loss/train': 0.9408150315284729} 11/07/2021 13:47:19 - INFO - __main__ - Step 117830: {'lr': 5.606242184684451e-05, 'samples': 22623360, 'steps': 117829, 'loss/train': 1.4416648149490356} 11/07/2021 13:47:20 - INFO - __main__ - Step 117831: {'lr': 5.6059073122706945e-05, 'samples': 22623552, 'steps': 117830, 'loss/train': 1.0016741752624512} 11/07/2021 13:47:20 - INFO - __main__ - Step 117832: {'lr': 5.6055724485956136e-05, 'samples': 22623744, 'steps': 117831, 'loss/train': 1.4035115242004395} 11/07/2021 13:47:20 - INFO - __main__ - Step 117833: {'lr': 5.6052375936593655e-05, 'samples': 22623936, 'steps': 117832, 'loss/train': 1.4136159420013428} 11/07/2021 13:47:21 - INFO - __main__ - Step 117834: {'lr': 5.604902747462096e-05, 'samples': 22624128, 'steps': 117833, 'loss/train': 1.828152060508728} 11/07/2021 13:47:22 - INFO - __main__ - Step 117835: {'lr': 5.604567910003966e-05, 'samples': 22624320, 'steps': 117834, 'loss/train': 0.7889447808265686} 11/07/2021 13:47:22 - INFO - __main__ - Step 117836: {'lr': 5.6042330812851094e-05, 'samples': 22624512, 'steps': 117835, 'loss/train': 1.342165470123291} 11/07/2021 13:47:22 - INFO - __main__ - Step 117837: {'lr': 5.6038982613056874e-05, 'samples': 22624704, 'steps': 117836, 'loss/train': 1.4890737533569336} 11/07/2021 13:47:23 - INFO - __main__ - Step 117838: {'lr': 5.603563450065849e-05, 'samples': 22624896, 'steps': 117837, 'loss/train': 1.148332118988037} 11/07/2021 13:47:23 - INFO - __main__ - Step 117839: {'lr': 5.6032286475657474e-05, 'samples': 22625088, 'steps': 117838, 'loss/train': 1.3876811265945435} 11/07/2021 13:47:24 - INFO - __main__ - Step 117840: {'lr': 5.6028938538055297e-05, 'samples': 22625280, 'steps': 117839, 'loss/train': 1.3323339223861694} 11/07/2021 13:47:24 - INFO - __main__ - Step 117841: {'lr': 5.602559068785351e-05, 'samples': 22625472, 'steps': 117840, 'loss/train': 1.339822769165039} 11/07/2021 13:47:25 - INFO - __main__ - Step 117842: {'lr': 5.602224292505356e-05, 'samples': 22625664, 'steps': 117841, 'loss/train': 0.7078068852424622} 11/07/2021 13:47:25 - INFO - __main__ - Step 117843: {'lr': 5.601889524965703e-05, 'samples': 22625856, 'steps': 117842, 'loss/train': 2.117016077041626} 11/07/2021 13:47:25 - INFO - __main__ - Step 117844: {'lr': 5.601554766166539e-05, 'samples': 22626048, 'steps': 117843, 'loss/train': 1.470697283744812} 11/07/2021 13:47:27 - INFO - __main__ - Step 117845: {'lr': 5.601220016108013e-05, 'samples': 22626240, 'steps': 117844, 'loss/train': 1.5005226135253906} 11/07/2021 13:47:27 - INFO - __main__ - Step 117846: {'lr': 5.600885274790279e-05, 'samples': 22626432, 'steps': 117845, 'loss/train': 0.7001411318778992} 11/07/2021 13:47:27 - INFO - __main__ - Step 117847: {'lr': 5.6005505422134866e-05, 'samples': 22626624, 'steps': 117846, 'loss/train': 1.7402738332748413} 11/07/2021 13:47:28 - INFO - __main__ - Step 117848: {'lr': 5.6002158183777936e-05, 'samples': 22626816, 'steps': 117847, 'loss/train': 1.3434401750564575} 11/07/2021 13:47:28 - INFO - __main__ - Step 117849: {'lr': 5.599881103283338e-05, 'samples': 22627008, 'steps': 117848, 'loss/train': 1.2536065578460693} 11/07/2021 13:47:28 - INFO - __main__ - Step 117850: {'lr': 5.599546396930277e-05, 'samples': 22627200, 'steps': 117849, 'loss/train': 1.4949406385421753} 11/07/2021 13:47:29 - INFO - __main__ - Step 117851: {'lr': 5.59921169931876e-05, 'samples': 22627392, 'steps': 117850, 'loss/train': 1.2882087230682373} 11/07/2021 13:47:30 - INFO - __main__ - Step 117852: {'lr': 5.598877010448938e-05, 'samples': 22627584, 'steps': 117851, 'loss/train': 1.8580453395843506} 11/07/2021 13:47:30 - INFO - __main__ - Step 117853: {'lr': 5.598542330320963e-05, 'samples': 22627776, 'steps': 117852, 'loss/train': 1.185723900794983} 11/07/2021 13:47:30 - INFO - __main__ - Step 117854: {'lr': 5.5982076589349866e-05, 'samples': 22627968, 'steps': 117853, 'loss/train': 1.1985468864440918} 11/07/2021 13:47:31 - INFO - __main__ - Step 117855: {'lr': 5.597872996291156e-05, 'samples': 22628160, 'steps': 117854, 'loss/train': 1.4611618518829346} 11/07/2021 13:47:32 - INFO - __main__ - Step 117856: {'lr': 5.597538342389627e-05, 'samples': 22628352, 'steps': 117855, 'loss/train': 1.2340998649597168} 11/07/2021 13:47:32 - INFO - __main__ - Step 117857: {'lr': 5.597203697230549e-05, 'samples': 22628544, 'steps': 117856, 'loss/train': 1.2916557788848877} 11/07/2021 13:47:33 - INFO - __main__ - Step 117858: {'lr': 5.596869060814069e-05, 'samples': 22628736, 'steps': 117857, 'loss/train': 1.5011531114578247} 11/07/2021 13:47:33 - INFO - __main__ - Step 117859: {'lr': 5.596534433140341e-05, 'samples': 22628928, 'steps': 117858, 'loss/train': 0.9405037760734558} 11/07/2021 13:47:33 - INFO - __main__ - Step 117860: {'lr': 5.5961998142095156e-05, 'samples': 22629120, 'steps': 117859, 'loss/train': 1.591567873954773} 11/07/2021 13:47:34 - INFO - __main__ - Step 117861: {'lr': 5.5958652040217414e-05, 'samples': 22629312, 'steps': 117860, 'loss/train': 1.2300161123275757} 11/07/2021 13:47:35 - INFO - __main__ - Step 117862: {'lr': 5.595530602577178e-05, 'samples': 22629504, 'steps': 117861, 'loss/train': 0.7869330048561096} 11/07/2021 13:47:35 - INFO - __main__ - Step 117863: {'lr': 5.5951960098759637e-05, 'samples': 22629696, 'steps': 117862, 'loss/train': 1.5645990371704102} 11/07/2021 13:47:35 - INFO - __main__ - Step 117864: {'lr': 5.594861425918255e-05, 'samples': 22629888, 'steps': 117863, 'loss/train': 1.311586618423462} 11/07/2021 13:47:36 - INFO - __main__ - Step 117865: {'lr': 5.5945268507042013e-05, 'samples': 22630080, 'steps': 117864, 'loss/train': 1.1125507354736328} 11/07/2021 13:47:37 - INFO - __main__ - Step 117866: {'lr': 5.5941922842339565e-05, 'samples': 22630272, 'steps': 117865, 'loss/train': 1.2271472215652466} 11/07/2021 13:47:37 - INFO - __main__ - Step 117867: {'lr': 5.5938577265076674e-05, 'samples': 22630464, 'steps': 117866, 'loss/train': 1.3863941431045532} 11/07/2021 13:47:38 - INFO - __main__ - Step 117868: {'lr': 5.5935231775254865e-05, 'samples': 22630656, 'steps': 117867, 'loss/train': 1.0746427774429321} 11/07/2021 13:47:38 - INFO - __main__ - Step 117869: {'lr': 5.5931886372875634e-05, 'samples': 22630848, 'steps': 117868, 'loss/train': 1.189023494720459} 11/07/2021 13:47:38 - INFO - __main__ - Step 117870: {'lr': 5.592854105794051e-05, 'samples': 22631040, 'steps': 117869, 'loss/train': 1.6809921264648438} 11/07/2021 13:47:39 - INFO - __main__ - Step 117871: {'lr': 5.5925195830451e-05, 'samples': 22631232, 'steps': 117870, 'loss/train': 1.5148073434829712} 11/07/2021 13:47:40 - INFO - __main__ - Step 117872: {'lr': 5.592185069040859e-05, 'samples': 22631424, 'steps': 117871, 'loss/train': 0.21988500654697418} 11/07/2021 13:47:40 - INFO - __main__ - Step 117873: {'lr': 5.591850563781481e-05, 'samples': 22631616, 'steps': 117872, 'loss/train': 1.369497299194336} 11/07/2021 13:47:41 - INFO - __main__ - Step 117874: {'lr': 5.5915160672671134e-05, 'samples': 22631808, 'steps': 117873, 'loss/train': 1.0856380462646484} 11/07/2021 13:47:41 - INFO - __main__ - Step 117875: {'lr': 5.59118157949792e-05, 'samples': 22632000, 'steps': 117874, 'loss/train': 1.4321973323822021} 11/07/2021 13:47:41 - INFO - __main__ - Step 117876: {'lr': 5.590847100474031e-05, 'samples': 22632192, 'steps': 117875, 'loss/train': 1.1646767854690552} 11/07/2021 13:47:42 - INFO - __main__ - Step 117877: {'lr': 5.590512630195607e-05, 'samples': 22632384, 'steps': 117876, 'loss/train': 0.8327118158340454} 11/07/2021 13:47:43 - INFO - __main__ - Step 117878: {'lr': 5.590178168662799e-05, 'samples': 22632576, 'steps': 117877, 'loss/train': 1.5064787864685059} 11/07/2021 13:47:43 - INFO - __main__ - Step 117879: {'lr': 5.589843715875756e-05, 'samples': 22632768, 'steps': 117878, 'loss/train': 1.2619283199310303} 11/07/2021 13:47:44 - INFO - __main__ - Step 117880: {'lr': 5.5895092718346306e-05, 'samples': 22632960, 'steps': 117879, 'loss/train': 0.40677040815353394} 11/07/2021 13:47:44 - INFO - __main__ - Step 117881: {'lr': 5.589174836539573e-05, 'samples': 22633152, 'steps': 117880, 'loss/train': 1.2983276844024658} 11/07/2021 13:47:45 - INFO - __main__ - Step 117882: {'lr': 5.5888404099907336e-05, 'samples': 22633344, 'steps': 117881, 'loss/train': 0.5051808953285217} 11/07/2021 13:47:45 - INFO - __main__ - Step 117883: {'lr': 5.588505992188264e-05, 'samples': 22633536, 'steps': 117882, 'loss/train': 1.2217732667922974} 11/07/2021 13:47:46 - INFO - __main__ - Step 117884: {'lr': 5.588171583132315e-05, 'samples': 22633728, 'steps': 117883, 'loss/train': 1.5343308448791504} 11/07/2021 13:47:46 - INFO - __main__ - Step 117885: {'lr': 5.587837182823033e-05, 'samples': 22633920, 'steps': 117884, 'loss/train': 1.1133078336715698} 11/07/2021 13:47:46 - INFO - __main__ - Step 117886: {'lr': 5.5875027912605735e-05, 'samples': 22634112, 'steps': 117885, 'loss/train': 1.2938156127929688} 11/07/2021 13:47:47 - INFO - __main__ - Step 117887: {'lr': 5.587168408445087e-05, 'samples': 22634304, 'steps': 117886, 'loss/train': 1.736081838607788} 11/07/2021 13:47:48 - INFO - __main__ - Step 117888: {'lr': 5.586834034376723e-05, 'samples': 22634496, 'steps': 117887, 'loss/train': 0.7520347237586975} 11/07/2021 13:47:48 - INFO - __main__ - Step 117889: {'lr': 5.586499669055636e-05, 'samples': 22634688, 'steps': 117888, 'loss/train': 1.267807960510254} 11/07/2021 13:47:48 - INFO - __main__ - Step 117890: {'lr': 5.5861653124819696e-05, 'samples': 22634880, 'steps': 117889, 'loss/train': 1.4351975917816162} 11/07/2021 13:47:49 - INFO - __main__ - Step 117891: {'lr': 5.5858309646558746e-05, 'samples': 22635072, 'steps': 117890, 'loss/train': 1.2282121181488037} 11/07/2021 13:47:49 - INFO - __main__ - Step 117892: {'lr': 5.585496625577505e-05, 'samples': 22635264, 'steps': 117891, 'loss/train': 1.1948301792144775} 11/07/2021 13:47:50 - INFO - __main__ - Step 117893: {'lr': 5.5851622952470124e-05, 'samples': 22635456, 'steps': 117892, 'loss/train': 1.3884179592132568} 11/07/2021 13:47:50 - INFO - __main__ - Step 117894: {'lr': 5.584827973664544e-05, 'samples': 22635648, 'steps': 117893, 'loss/train': 1.175776720046997} 11/07/2021 13:47:51 - INFO - __main__ - Step 117895: {'lr': 5.5844936608302535e-05, 'samples': 22635840, 'steps': 117894, 'loss/train': 1.4354321956634521} 11/07/2021 13:47:51 - INFO - __main__ - Step 117896: {'lr': 5.584159356744292e-05, 'samples': 22636032, 'steps': 117895, 'loss/train': 1.3395880460739136} 11/07/2021 13:47:52 - INFO - __main__ - Step 117897: {'lr': 5.5838250614068055e-05, 'samples': 22636224, 'steps': 117896, 'loss/train': 1.4092967510223389} 11/07/2021 13:47:53 - INFO - __main__ - Step 117898: {'lr': 5.583490774817951e-05, 'samples': 22636416, 'steps': 117897, 'loss/train': 0.5271498560905457} 11/07/2021 13:47:53 - INFO - __main__ - Step 117899: {'lr': 5.583156496977876e-05, 'samples': 22636608, 'steps': 117898, 'loss/train': 1.304589867591858} 11/07/2021 13:47:53 - INFO - __main__ - Step 117900: {'lr': 5.582822227886728e-05, 'samples': 22636800, 'steps': 117899, 'loss/train': 1.5643428564071655} 11/07/2021 13:47:54 - INFO - __main__ - Step 117901: {'lr': 5.582487967544664e-05, 'samples': 22636992, 'steps': 117900, 'loss/train': 1.4997642040252686} 11/07/2021 13:47:54 - INFO - __main__ - Step 117902: {'lr': 5.582153715951835e-05, 'samples': 22637184, 'steps': 117901, 'loss/train': 1.2047425508499146} 11/07/2021 13:47:55 - INFO - __main__ - Step 117903: {'lr': 5.5818194731083824e-05, 'samples': 22637376, 'steps': 117902, 'loss/train': 0.8211953639984131} 11/07/2021 13:47:55 - INFO - __main__ - Step 117904: {'lr': 5.581485239014464e-05, 'samples': 22637568, 'steps': 117903, 'loss/train': 0.8677889704704285} 11/07/2021 13:47:56 - INFO - __main__ - Step 117905: {'lr': 5.581151013670227e-05, 'samples': 22637760, 'steps': 117904, 'loss/train': 1.3819304704666138} 11/07/2021 13:47:56 - INFO - __main__ - Step 117906: {'lr': 5.5808167970758245e-05, 'samples': 22637952, 'steps': 117905, 'loss/train': 1.1120941638946533} 11/07/2021 13:47:56 - INFO - __main__ - Step 117907: {'lr': 5.580482589231406e-05, 'samples': 22638144, 'steps': 117906, 'loss/train': 0.9480305314064026} 11/07/2021 13:47:57 - INFO - __main__ - Step 117908: {'lr': 5.580148390137121e-05, 'samples': 22638336, 'steps': 117907, 'loss/train': 0.8216351866722107} 11/07/2021 13:47:58 - INFO - __main__ - Step 117909: {'lr': 5.579814199793123e-05, 'samples': 22638528, 'steps': 117908, 'loss/train': 1.2555235624313354} 11/07/2021 13:47:58 - INFO - __main__ - Step 117910: {'lr': 5.579480018199559e-05, 'samples': 22638720, 'steps': 117909, 'loss/train': 1.1430602073669434} 11/07/2021 13:47:58 - INFO - __main__ - Step 117911: {'lr': 5.579145845356584e-05, 'samples': 22638912, 'steps': 117910, 'loss/train': 1.2390788793563843} 11/07/2021 13:47:59 - INFO - __main__ - Step 117912: {'lr': 5.578811681264345e-05, 'samples': 22639104, 'steps': 117911, 'loss/train': 1.1514703035354614} 11/07/2021 13:48:00 - INFO - __main__ - Step 117913: {'lr': 5.5784775259229954e-05, 'samples': 22639296, 'steps': 117912, 'loss/train': 1.0764387845993042} 11/07/2021 13:48:00 - INFO - __main__ - Step 117914: {'lr': 5.578143379332684e-05, 'samples': 22639488, 'steps': 117913, 'loss/train': 1.0363023281097412} 11/07/2021 13:48:01 - INFO - __main__ - Step 117915: {'lr': 5.577809241493559e-05, 'samples': 22639680, 'steps': 117914, 'loss/train': 1.256902813911438} 11/07/2021 13:48:01 - INFO - __main__ - Step 117916: {'lr': 5.5774751124057834e-05, 'samples': 22639872, 'steps': 117915, 'loss/train': 1.3637508153915405} 11/07/2021 13:48:01 - INFO - __main__ - Step 117917: {'lr': 5.5771409920694873e-05, 'samples': 22640064, 'steps': 117916, 'loss/train': 1.4341834783554077} 11/07/2021 13:48:03 - INFO - __main__ - Step 117918: {'lr': 5.576806880484836e-05, 'samples': 22640256, 'steps': 117917, 'loss/train': 0.9755026698112488} 11/07/2021 13:48:03 - INFO - __main__ - Step 117919: {'lr': 5.576472777651972e-05, 'samples': 22640448, 'steps': 117918, 'loss/train': 1.3334448337554932} 11/07/2021 13:48:03 - INFO - __main__ - Step 117920: {'lr': 5.576138683571053e-05, 'samples': 22640640, 'steps': 117919, 'loss/train': 1.0967762470245361} 11/07/2021 13:48:04 - INFO - __main__ - Step 117921: {'lr': 5.575804598242223e-05, 'samples': 22640832, 'steps': 117920, 'loss/train': 1.4616458415985107} 11/07/2021 13:48:04 - INFO - __main__ - Step 117922: {'lr': 5.5754705216656375e-05, 'samples': 22641024, 'steps': 117921, 'loss/train': 1.380746841430664} 11/07/2021 13:48:05 - INFO - __main__ - Step 117923: {'lr': 5.575136453841445e-05, 'samples': 22641216, 'steps': 117922, 'loss/train': 0.9218029975891113} 11/07/2021 13:48:06 - INFO - __main__ - Step 117924: {'lr': 5.574802394769796e-05, 'samples': 22641408, 'steps': 117923, 'loss/train': 0.7310882210731506} 11/07/2021 13:48:06 - INFO - __main__ - Step 117925: {'lr': 5.574468344450842e-05, 'samples': 22641600, 'steps': 117924, 'loss/train': 1.2638286352157593} 11/07/2021 13:48:06 - INFO - __main__ - Step 117926: {'lr': 5.574134302884731e-05, 'samples': 22641792, 'steps': 117925, 'loss/train': 0.9800962209701538} 11/07/2021 13:48:07 - INFO - __main__ - Step 117927: {'lr': 5.5738002700716164e-05, 'samples': 22641984, 'steps': 117926, 'loss/train': 1.535714864730835} 11/07/2021 13:48:07 - INFO - __main__ - Step 117928: {'lr': 5.573466246011649e-05, 'samples': 22642176, 'steps': 117927, 'loss/train': 1.686217188835144} 11/07/2021 13:48:08 - INFO - __main__ - Step 117929: {'lr': 5.573132230704983e-05, 'samples': 22642368, 'steps': 117928, 'loss/train': 1.6561962366104126} 11/07/2021 13:48:09 - INFO - __main__ - Step 117930: {'lr': 5.57279822415176e-05, 'samples': 22642560, 'steps': 117929, 'loss/train': 1.3582768440246582} 11/07/2021 13:48:09 - INFO - __main__ - Step 117931: {'lr': 5.572464226352131e-05, 'samples': 22642752, 'steps': 117930, 'loss/train': 1.5254143476486206} 11/07/2021 13:48:09 - INFO - __main__ - Step 117932: {'lr': 5.57213023730625e-05, 'samples': 22642944, 'steps': 117931, 'loss/train': 1.2788763046264648} 11/07/2021 13:48:10 - INFO - __main__ - Step 117933: {'lr': 5.571796257014267e-05, 'samples': 22643136, 'steps': 117932, 'loss/train': 1.0962661504745483} 11/07/2021 13:48:11 - INFO - __main__ - Step 117934: {'lr': 5.571462285476333e-05, 'samples': 22643328, 'steps': 117933, 'loss/train': 1.3537371158599854} 11/07/2021 13:48:11 - INFO - __main__ - Step 117935: {'lr': 5.5711283226926005e-05, 'samples': 22643520, 'steps': 117934, 'loss/train': 1.659556269645691} 11/07/2021 13:48:11 - INFO - __main__ - Step 117936: {'lr': 5.570794368663218e-05, 'samples': 22643712, 'steps': 117935, 'loss/train': 1.1708859205245972} 11/07/2021 13:48:12 - INFO - __main__ - Step 117937: {'lr': 5.570460423388332e-05, 'samples': 22643904, 'steps': 117936, 'loss/train': 1.5310016870498657} 11/07/2021 13:48:12 - INFO - __main__ - Step 117938: {'lr': 5.570126486868099e-05, 'samples': 22644096, 'steps': 117937, 'loss/train': 1.2695896625518799} 11/07/2021 13:48:13 - INFO - __main__ - Step 117939: {'lr': 5.569792559102668e-05, 'samples': 22644288, 'steps': 117938, 'loss/train': 1.2870070934295654} 11/07/2021 13:48:13 - INFO - __main__ - Step 117940: {'lr': 5.569458640092187e-05, 'samples': 22644480, 'steps': 117939, 'loss/train': 1.7524077892303467} 11/07/2021 13:48:14 - INFO - __main__ - Step 117941: {'lr': 5.5691247298368164e-05, 'samples': 22644672, 'steps': 117940, 'loss/train': 1.3213872909545898} 11/07/2021 13:48:14 - INFO - __main__ - Step 117942: {'lr': 5.5687908283366925e-05, 'samples': 22644864, 'steps': 117941, 'loss/train': 1.2241630554199219} 11/07/2021 13:48:15 - INFO - __main__ - Step 117943: {'lr': 5.56845693559197e-05, 'samples': 22645056, 'steps': 117942, 'loss/train': 1.1958026885986328} 11/07/2021 13:48:16 - INFO - __main__ - Step 117944: {'lr': 5.5681230516027996e-05, 'samples': 22645248, 'steps': 117943, 'loss/train': 1.2221404314041138} 11/07/2021 13:48:16 - INFO - __main__ - Step 117945: {'lr': 5.567789176369334e-05, 'samples': 22645440, 'steps': 117944, 'loss/train': 1.2426892518997192} 11/07/2021 13:48:16 - INFO - __main__ - Step 117946: {'lr': 5.567455309891722e-05, 'samples': 22645632, 'steps': 117945, 'loss/train': 1.1381378173828125} 11/07/2021 13:48:17 - INFO - __main__ - Step 117947: {'lr': 5.567121452170118e-05, 'samples': 22645824, 'steps': 117946, 'loss/train': 1.100978970527649} 11/07/2021 13:48:17 - INFO - __main__ - Step 117948: {'lr': 5.5667876032046675e-05, 'samples': 22646016, 'steps': 117947, 'loss/train': 1.7793959379196167} 11/07/2021 13:48:17 - INFO - __main__ - Step 117949: {'lr': 5.566453762995521e-05, 'samples': 22646208, 'steps': 117948, 'loss/train': 0.9922183752059937} 11/07/2021 13:48:18 - INFO - __main__ - Step 117950: {'lr': 5.566119931542832e-05, 'samples': 22646400, 'steps': 117949, 'loss/train': 1.192304015159607} 11/07/2021 13:48:19 - INFO - __main__ - Step 117951: {'lr': 5.565786108846749e-05, 'samples': 22646592, 'steps': 117950, 'loss/train': 1.4172899723052979} 11/07/2021 13:48:19 - INFO - __main__ - Step 117952: {'lr': 5.56545229490743e-05, 'samples': 22646784, 'steps': 117951, 'loss/train': 0.6398098468780518} 11/07/2021 13:48:19 - INFO - __main__ - Step 117953: {'lr': 5.565118489725013e-05, 'samples': 22646976, 'steps': 117952, 'loss/train': 1.1229640245437622} 11/07/2021 13:48:20 - INFO - __main__ - Step 117954: {'lr': 5.564784693299652e-05, 'samples': 22647168, 'steps': 117953, 'loss/train': 1.1317485570907593} 11/07/2021 13:48:21 - INFO - __main__ - Step 117955: {'lr': 5.564450905631499e-05, 'samples': 22647360, 'steps': 117954, 'loss/train': 1.3452950716018677} 11/07/2021 13:48:21 - INFO - __main__ - Step 117956: {'lr': 5.564117126720705e-05, 'samples': 22647552, 'steps': 117955, 'loss/train': 1.950115442276001} 11/07/2021 13:48:21 - INFO - __main__ - Step 117957: {'lr': 5.56378335656742e-05, 'samples': 22647744, 'steps': 117956, 'loss/train': 1.3403390645980835} 11/07/2021 13:48:22 - INFO - __main__ - Step 117958: {'lr': 5.5634495951717936e-05, 'samples': 22647936, 'steps': 117957, 'loss/train': 1.5629382133483887} 11/07/2021 13:48:22 - INFO - __main__ - Step 117959: {'lr': 5.563115842533978e-05, 'samples': 22648128, 'steps': 117958, 'loss/train': 1.5943710803985596} 11/07/2021 13:48:23 - INFO - __main__ - Step 117960: {'lr': 5.56278209865412e-05, 'samples': 22648320, 'steps': 117959, 'loss/train': 0.7219426035881042} 11/07/2021 13:48:24 - INFO - __main__ - Step 117961: {'lr': 5.562448363532374e-05, 'samples': 22648512, 'steps': 117960, 'loss/train': 0.9972994923591614} 11/07/2021 13:48:24 - INFO - __main__ - Step 117962: {'lr': 5.562114637168897e-05, 'samples': 22648704, 'steps': 117961, 'loss/train': 0.6965422630310059} 11/07/2021 13:48:24 - INFO - __main__ - Step 117963: {'lr': 5.5617809195638244e-05, 'samples': 22648896, 'steps': 117962, 'loss/train': 0.3442496061325073} 11/07/2021 13:48:25 - INFO - __main__ - Step 117964: {'lr': 5.5614472107173105e-05, 'samples': 22649088, 'steps': 117963, 'loss/train': 1.6450825929641724} 11/07/2021 13:48:26 - INFO - __main__ - Step 117965: {'lr': 5.5611135106295116e-05, 'samples': 22649280, 'steps': 117964, 'loss/train': 1.1493713855743408} 11/07/2021 13:48:26 - INFO - __main__ - Step 117966: {'lr': 5.5607798193005735e-05, 'samples': 22649472, 'steps': 117965, 'loss/train': 1.4139292240142822} 11/07/2021 13:48:26 - INFO - __main__ - Step 117967: {'lr': 5.5604461367306486e-05, 'samples': 22649664, 'steps': 117966, 'loss/train': 2.5680487155914307} 11/07/2021 13:48:27 - INFO - __main__ - Step 117968: {'lr': 5.560112462919886e-05, 'samples': 22649856, 'steps': 117967, 'loss/train': 1.0831652879714966} 11/07/2021 13:48:27 - INFO - __main__ - Step 117969: {'lr': 5.559778797868437e-05, 'samples': 22650048, 'steps': 117968, 'loss/train': 1.3428258895874023} 11/07/2021 13:48:28 - INFO - __main__ - Step 117970: {'lr': 5.559445141576453e-05, 'samples': 22650240, 'steps': 117969, 'loss/train': 1.8572160005569458} 11/07/2021 13:48:29 - INFO - __main__ - Step 117971: {'lr': 5.559111494044081e-05, 'samples': 22650432, 'steps': 117970, 'loss/train': 1.0277810096740723} 11/07/2021 13:48:29 - INFO - __main__ - Step 117972: {'lr': 5.558777855271474e-05, 'samples': 22650624, 'steps': 117971, 'loss/train': 1.4362001419067383} 11/07/2021 13:48:29 - INFO - __main__ - Step 117973: {'lr': 5.558444225258788e-05, 'samples': 22650816, 'steps': 117972, 'loss/train': 1.4520834684371948} 11/07/2021 13:48:30 - INFO - __main__ - Step 117974: {'lr': 5.5581106040061614e-05, 'samples': 22651008, 'steps': 117973, 'loss/train': 1.3949428796768188} 11/07/2021 13:48:30 - INFO - __main__ - Step 117975: {'lr': 5.5577769915137497e-05, 'samples': 22651200, 'steps': 117974, 'loss/train': 1.710532784461975} 11/07/2021 13:48:31 - INFO - __main__ - Step 117976: {'lr': 5.557443387781702e-05, 'samples': 22651392, 'steps': 117975, 'loss/train': 1.4672309160232544} 11/07/2021 13:48:31 - INFO - __main__ - Step 117977: {'lr': 5.5571097928101724e-05, 'samples': 22651584, 'steps': 117976, 'loss/train': 1.5540199279785156} 11/07/2021 13:48:32 - INFO - __main__ - Step 117978: {'lr': 5.556776206599309e-05, 'samples': 22651776, 'steps': 117977, 'loss/train': 1.427085518836975} 11/07/2021 13:48:32 - INFO - __main__ - Step 117979: {'lr': 5.556442629149264e-05, 'samples': 22651968, 'steps': 117978, 'loss/train': 1.0111844539642334} 11/07/2021 13:48:32 - INFO - __main__ - Step 117980: {'lr': 5.556109060460182e-05, 'samples': 22652160, 'steps': 117979, 'loss/train': 1.8058490753173828} 11/07/2021 13:48:34 - INFO - __main__ - Step 117981: {'lr': 5.55577550053222e-05, 'samples': 22652352, 'steps': 117980, 'loss/train': 1.3992149829864502} 11/07/2021 13:48:34 - INFO - __main__ - Step 117982: {'lr': 5.555441949365522e-05, 'samples': 22652544, 'steps': 117981, 'loss/train': 1.08245050907135} 11/07/2021 13:48:34 - INFO - __main__ - Step 117983: {'lr': 5.5551084069602466e-05, 'samples': 22652736, 'steps': 117982, 'loss/train': 1.2866183519363403} 11/07/2021 13:48:35 - INFO - __main__ - Step 117984: {'lr': 5.554774873316543e-05, 'samples': 22652928, 'steps': 117983, 'loss/train': 0.8267714381217957} 11/07/2021 13:48:35 - INFO - __main__ - Step 117985: {'lr': 5.554441348434553e-05, 'samples': 22653120, 'steps': 117984, 'loss/train': 0.5879713892936707} 11/07/2021 13:48:36 - INFO - __main__ - Step 117986: {'lr': 5.5541078323144285e-05, 'samples': 22653312, 'steps': 117985, 'loss/train': 0.6248368620872498} 11/07/2021 13:48:36 - INFO - __main__ - Step 117987: {'lr': 5.553774324956326e-05, 'samples': 22653504, 'steps': 117986, 'loss/train': 1.481668472290039} 11/07/2021 13:48:37 - INFO - __main__ - Step 117988: {'lr': 5.553440826360393e-05, 'samples': 22653696, 'steps': 117987, 'loss/train': 1.6345881223678589} 11/07/2021 13:48:37 - INFO - __main__ - Step 117989: {'lr': 5.553107336526777e-05, 'samples': 22653888, 'steps': 117988, 'loss/train': 1.8049174547195435} 11/07/2021 13:48:37 - INFO - __main__ - Step 117990: {'lr': 5.552773855455631e-05, 'samples': 22654080, 'steps': 117989, 'loss/train': 1.362610936164856} 11/07/2021 13:48:39 - INFO - __main__ - Step 117991: {'lr': 5.552440383147106e-05, 'samples': 22654272, 'steps': 117990, 'loss/train': 1.3877142667770386} 11/07/2021 13:48:39 - INFO - __main__ - Step 117992: {'lr': 5.5521069196013515e-05, 'samples': 22654464, 'steps': 117991, 'loss/train': 1.4915337562561035} 11/07/2021 13:48:39 - INFO - __main__ - Step 117993: {'lr': 5.551773464818516e-05, 'samples': 22654656, 'steps': 117992, 'loss/train': 0.903304398059845} 11/07/2021 13:48:40 - INFO - __main__ - Step 117994: {'lr': 5.5514400187987536e-05, 'samples': 22654848, 'steps': 117993, 'loss/train': 0.7335205674171448} 11/07/2021 13:48:40 - INFO - __main__ - Step 117995: {'lr': 5.551106581542212e-05, 'samples': 22655040, 'steps': 117994, 'loss/train': 0.8477895855903625} 11/07/2021 13:48:40 - INFO - __main__ - Step 117996: {'lr': 5.550773153049046e-05, 'samples': 22655232, 'steps': 117995, 'loss/train': 1.4869904518127441} 11/07/2021 13:48:41 - INFO - __main__ - Step 117997: {'lr': 5.550439733319396e-05, 'samples': 22655424, 'steps': 117996, 'loss/train': 1.1031945943832397} 11/07/2021 13:48:42 - INFO - __main__ - Step 117998: {'lr': 5.550106322353418e-05, 'samples': 22655616, 'steps': 117997, 'loss/train': 0.5579570531845093} 11/07/2021 13:48:42 - INFO - __main__ - Step 117999: {'lr': 5.5497729201512636e-05, 'samples': 22655808, 'steps': 117998, 'loss/train': 1.535355567932129} 11/07/2021 13:48:43 - INFO - __main__ - Step 118000: {'lr': 5.5494395267130795e-05, 'samples': 22656000, 'steps': 117999, 'loss/train': 1.34171462059021} 11/07/2021 13:48:43 - INFO - __main__ - Step 118001: {'lr': 5.549106142039018e-05, 'samples': 22656192, 'steps': 118000, 'loss/train': 1.2888673543930054} 11/07/2021 13:48:44 - INFO - __main__ - Step 118002: {'lr': 5.5487727661292284e-05, 'samples': 22656384, 'steps': 118001, 'loss/train': 1.0685659646987915} 11/07/2021 13:48:44 - INFO - __main__ - Step 118003: {'lr': 5.5484393989838624e-05, 'samples': 22656576, 'steps': 118002, 'loss/train': 1.7271801233291626} 11/07/2021 13:48:45 - INFO - __main__ - Step 118004: {'lr': 5.548106040603068e-05, 'samples': 22656768, 'steps': 118003, 'loss/train': 1.3835797309875488} 11/07/2021 13:48:45 - INFO - __main__ - Step 118005: {'lr': 5.547772690986999e-05, 'samples': 22656960, 'steps': 118004, 'loss/train': 1.033996343612671} 11/07/2021 13:48:45 - INFO - __main__ - Step 118006: {'lr': 5.547439350135802e-05, 'samples': 22657152, 'steps': 118005, 'loss/train': 1.5705796480178833} 11/07/2021 13:48:46 - INFO - __main__ - Step 118007: {'lr': 5.54710601804963e-05, 'samples': 22657344, 'steps': 118006, 'loss/train': 1.3077733516693115} 11/07/2021 13:48:47 - INFO - __main__ - Step 118008: {'lr': 5.5467726947286326e-05, 'samples': 22657536, 'steps': 118007, 'loss/train': 1.5745995044708252} 11/07/2021 13:48:47 - INFO - __main__ - Step 118009: {'lr': 5.546439380172957e-05, 'samples': 22657728, 'steps': 118008, 'loss/train': 1.3886877298355103} 11/07/2021 13:48:47 - INFO - __main__ - Step 118010: {'lr': 5.546106074382765e-05, 'samples': 22657920, 'steps': 118009, 'loss/train': 1.4768805503845215} 11/07/2021 13:48:48 - INFO - __main__ - Step 118011: {'lr': 5.545772777358188e-05, 'samples': 22658112, 'steps': 118010, 'loss/train': 2.013489007949829} 11/07/2021 13:48:50 - INFO - __main__ - Step 118012: {'lr': 5.5454394890993855e-05, 'samples': 22658304, 'steps': 118011, 'loss/train': 0.985601007938385} 11/07/2021 13:48:50 - INFO - __main__ - Step 118013: {'lr': 5.5451062096065094e-05, 'samples': 22658496, 'steps': 118012, 'loss/train': 0.940851092338562} 11/07/2021 13:48:51 - INFO - __main__ - Step 118014: {'lr': 5.544772938879708e-05, 'samples': 22658688, 'steps': 118013, 'loss/train': 1.2188361883163452} 11/07/2021 13:48:51 - INFO - __main__ - Step 118015: {'lr': 5.54443967691913e-05, 'samples': 22658880, 'steps': 118014, 'loss/train': 1.745141863822937} 11/07/2021 13:48:51 - INFO - __main__ - Step 118016: {'lr': 5.544106423724929e-05, 'samples': 22659072, 'steps': 118015, 'loss/train': 1.7486767768859863} 11/07/2021 13:48:52 - INFO - __main__ - Step 118017: {'lr': 5.543773179297254e-05, 'samples': 22659264, 'steps': 118016, 'loss/train': 1.746155858039856} 11/07/2021 13:48:52 - INFO - __main__ - Step 118018: {'lr': 5.5434399436362524e-05, 'samples': 22659456, 'steps': 118017, 'loss/train': 1.741794466972351} 11/07/2021 13:48:53 - INFO - __main__ - Step 118019: {'lr': 5.543106716742077e-05, 'samples': 22659648, 'steps': 118018, 'loss/train': 1.2543059587478638} 11/07/2021 13:48:54 - INFO - __main__ - Step 118020: {'lr': 5.542773498614878e-05, 'samples': 22659840, 'steps': 118019, 'loss/train': 1.596511960029602} 11/07/2021 13:48:54 - INFO - __main__ - Step 118021: {'lr': 5.5424402892548076e-05, 'samples': 22660032, 'steps': 118020, 'loss/train': 1.1799440383911133} 11/07/2021 13:48:54 - INFO - __main__ - Step 118022: {'lr': 5.54210708866201e-05, 'samples': 22660224, 'steps': 118021, 'loss/train': 1.256933569908142} 11/07/2021 13:48:55 - INFO - __main__ - Step 118023: {'lr': 5.541773896836647e-05, 'samples': 22660416, 'steps': 118022, 'loss/train': 1.6459928750991821} 11/07/2021 13:48:55 - INFO - __main__ - Step 118024: {'lr': 5.541440713778853e-05, 'samples': 22660608, 'steps': 118023, 'loss/train': 1.3590329885482788} 11/07/2021 13:48:56 - INFO - __main__ - Step 118025: {'lr': 5.541107539488785e-05, 'samples': 22660800, 'steps': 118024, 'loss/train': 1.7401683330535889} 11/07/2021 13:48:57 - INFO - __main__ - Step 118026: {'lr': 5.540774373966595e-05, 'samples': 22660992, 'steps': 118025, 'loss/train': 1.4491207599639893} 11/07/2021 13:48:57 - INFO - __main__ - Step 118027: {'lr': 5.5404412172124305e-05, 'samples': 22661184, 'steps': 118026, 'loss/train': 0.054649509489536285} 11/07/2021 13:48:57 - INFO - __main__ - Step 118028: {'lr': 5.5401080692264435e-05, 'samples': 22661376, 'steps': 118027, 'loss/train': 1.426284909248352} 11/07/2021 13:48:58 - INFO - __main__ - Step 118029: {'lr': 5.539774930008784e-05, 'samples': 22661568, 'steps': 118028, 'loss/train': 1.0693241357803345} 11/07/2021 13:48:59 - INFO - __main__ - Step 118030: {'lr': 5.5394417995596e-05, 'samples': 22661760, 'steps': 118029, 'loss/train': 1.4345542192459106} 11/07/2021 13:48:59 - INFO - __main__ - Step 118031: {'lr': 5.539108677879046e-05, 'samples': 22661952, 'steps': 118030, 'loss/train': 1.0472759008407593} 11/07/2021 13:48:59 - INFO - __main__ - Step 118032: {'lr': 5.538775564967266e-05, 'samples': 22662144, 'steps': 118031, 'loss/train': 1.4820115566253662} 11/07/2021 13:49:00 - INFO - __main__ - Step 118033: {'lr': 5.5384424608244165e-05, 'samples': 22662336, 'steps': 118032, 'loss/train': 1.3496532440185547} 11/07/2021 13:49:00 - INFO - __main__ - Step 118034: {'lr': 5.5381093654506416e-05, 'samples': 22662528, 'steps': 118033, 'loss/train': 1.389877438545227} 11/07/2021 13:49:01 - INFO - __main__ - Step 118035: {'lr': 5.5377762788460964e-05, 'samples': 22662720, 'steps': 118034, 'loss/train': 1.620640516281128} 11/07/2021 13:49:02 - INFO - __main__ - Step 118036: {'lr': 5.5374432010109274e-05, 'samples': 22662912, 'steps': 118035, 'loss/train': 1.1094329357147217} 11/07/2021 13:49:02 - INFO - __main__ - Step 118037: {'lr': 5.5371101319452945e-05, 'samples': 22663104, 'steps': 118036, 'loss/train': 1.4788150787353516} 11/07/2021 13:49:02 - INFO - __main__ - Step 118038: {'lr': 5.536777071649332e-05, 'samples': 22663296, 'steps': 118037, 'loss/train': 1.4050755500793457} 11/07/2021 13:49:03 - INFO - __main__ - Step 118039: {'lr': 5.536444020123199e-05, 'samples': 22663488, 'steps': 118038, 'loss/train': 1.2803077697753906} 11/07/2021 13:49:03 - INFO - __main__ - Step 118040: {'lr': 5.5361109773670426e-05, 'samples': 22663680, 'steps': 118039, 'loss/train': 1.1473387479782104} 11/07/2021 13:49:04 - INFO - __main__ - Step 118041: {'lr': 5.535777943381012e-05, 'samples': 22663872, 'steps': 118040, 'loss/train': 1.3816180229187012} 11/07/2021 13:49:04 - INFO - __main__ - Step 118042: {'lr': 5.535444918165264e-05, 'samples': 22664064, 'steps': 118041, 'loss/train': 1.1388964653015137} 11/07/2021 13:49:05 - INFO - __main__ - Step 118043: {'lr': 5.5351119017199415e-05, 'samples': 22664256, 'steps': 118042, 'loss/train': 1.1036911010742188} 11/07/2021 13:49:05 - INFO - __main__ - Step 118044: {'lr': 5.534778894045197e-05, 'samples': 22664448, 'steps': 118043, 'loss/train': 2.0632827281951904} 11/07/2021 13:49:05 - INFO - __main__ - Step 118045: {'lr': 5.53444589514118e-05, 'samples': 22664640, 'steps': 118044, 'loss/train': 0.9656166434288025} 11/07/2021 13:49:07 - INFO - __main__ - Step 118046: {'lr': 5.534112905008043e-05, 'samples': 22664832, 'steps': 118045, 'loss/train': 1.0581635236740112} 11/07/2021 13:49:07 - INFO - __main__ - Step 118047: {'lr': 5.5337799236459345e-05, 'samples': 22665024, 'steps': 118046, 'loss/train': 1.620220422744751} 11/07/2021 13:49:07 - INFO - __main__ - Step 118048: {'lr': 5.533446951055004e-05, 'samples': 22665216, 'steps': 118047, 'loss/train': 1.5396414995193481} 11/07/2021 13:49:08 - INFO - __main__ - Step 118049: {'lr': 5.5331139872354e-05, 'samples': 22665408, 'steps': 118048, 'loss/train': 1.2598803043365479} 11/07/2021 13:49:08 - INFO - __main__ - Step 118050: {'lr': 5.532781032187284e-05, 'samples': 22665600, 'steps': 118049, 'loss/train': 1.682875394821167} 11/07/2021 13:49:09 - INFO - __main__ - Step 118051: {'lr': 5.532448085910788e-05, 'samples': 22665792, 'steps': 118050, 'loss/train': 0.4626710116863251} 11/07/2021 13:49:09 - INFO - __main__ - Step 118052: {'lr': 5.532115148406072e-05, 'samples': 22665984, 'steps': 118051, 'loss/train': 1.2635877132415771} 11/07/2021 13:49:10 - INFO - __main__ - Step 118053: {'lr': 5.531782219673284e-05, 'samples': 22666176, 'steps': 118052, 'loss/train': 1.3156625032424927} 11/07/2021 13:49:10 - INFO - __main__ - Step 118054: {'lr': 5.5314492997125734e-05, 'samples': 22666368, 'steps': 118053, 'loss/train': 1.0299761295318604} 11/07/2021 13:49:10 - INFO - __main__ - Step 118055: {'lr': 5.5311163885240935e-05, 'samples': 22666560, 'steps': 118054, 'loss/train': 1.148549199104309} 11/07/2021 13:49:12 - INFO - __main__ - Step 118056: {'lr': 5.5307834861079903e-05, 'samples': 22666752, 'steps': 118055, 'loss/train': 1.4113497734069824} 11/07/2021 13:49:12 - INFO - __main__ - Step 118057: {'lr': 5.530450592464414e-05, 'samples': 22666944, 'steps': 118056, 'loss/train': 1.4577195644378662} 11/07/2021 13:49:12 - INFO - __main__ - Step 118058: {'lr': 5.5301177075935184e-05, 'samples': 22667136, 'steps': 118057, 'loss/train': 1.3485972881317139} 11/07/2021 13:49:13 - INFO - __main__ - Step 118059: {'lr': 5.529784831495452e-05, 'samples': 22667328, 'steps': 118058, 'loss/train': 1.005892276763916} 11/07/2021 13:49:13 - INFO - __main__ - Step 118060: {'lr': 5.5294519641703625e-05, 'samples': 22667520, 'steps': 118059, 'loss/train': 1.289768099784851} 11/07/2021 13:49:13 - INFO - __main__ - Step 118061: {'lr': 5.529119105618402e-05, 'samples': 22667712, 'steps': 118060, 'loss/train': 1.3118330240249634} 11/07/2021 13:49:14 - INFO - __main__ - Step 118062: {'lr': 5.528786255839721e-05, 'samples': 22667904, 'steps': 118061, 'loss/train': 1.4926055669784546} 11/07/2021 13:49:15 - INFO - __main__ - Step 118063: {'lr': 5.528453414834475e-05, 'samples': 22668096, 'steps': 118062, 'loss/train': 1.2593518495559692} 11/07/2021 13:49:15 - INFO - __main__ - Step 118064: {'lr': 5.5281205826027996e-05, 'samples': 22668288, 'steps': 118063, 'loss/train': 1.020715355873108} 11/07/2021 13:49:15 - INFO - __main__ - Step 118065: {'lr': 5.527787759144853e-05, 'samples': 22668480, 'steps': 118064, 'loss/train': 1.6575630903244019} 11/07/2021 13:49:16 - INFO - __main__ - Step 118066: {'lr': 5.527454944460786e-05, 'samples': 22668672, 'steps': 118065, 'loss/train': 1.6844542026519775} 11/07/2021 13:49:17 - INFO - __main__ - Step 118067: {'lr': 5.527122138550747e-05, 'samples': 22668864, 'steps': 118066, 'loss/train': 1.3645343780517578} 11/07/2021 13:49:17 - INFO - __main__ - Step 118068: {'lr': 5.526789341414884e-05, 'samples': 22669056, 'steps': 118067, 'loss/train': 1.1108968257904053} 11/07/2021 13:49:18 - INFO - __main__ - Step 118069: {'lr': 5.526456553053352e-05, 'samples': 22669248, 'steps': 118068, 'loss/train': 1.266072154045105} 11/07/2021 13:49:18 - INFO - __main__ - Step 118070: {'lr': 5.526123773466296e-05, 'samples': 22669440, 'steps': 118069, 'loss/train': 1.5471292734146118} 11/07/2021 13:49:18 - INFO - __main__ - Step 118071: {'lr': 5.525791002653868e-05, 'samples': 22669632, 'steps': 118070, 'loss/train': 1.0184251070022583} 11/07/2021 13:49:19 - INFO - __main__ - Step 118072: {'lr': 5.5254582406162214e-05, 'samples': 22669824, 'steps': 118071, 'loss/train': 1.0812699794769287} 11/07/2021 13:49:20 - INFO - __main__ - Step 118073: {'lr': 5.5251254873534994e-05, 'samples': 22670016, 'steps': 118072, 'loss/train': 1.4744890928268433} 11/07/2021 13:49:20 - INFO - __main__ - Step 118074: {'lr': 5.524792742865856e-05, 'samples': 22670208, 'steps': 118073, 'loss/train': 1.3817671537399292} 11/07/2021 13:49:20 - INFO - __main__ - Step 118075: {'lr': 5.524460007153442e-05, 'samples': 22670400, 'steps': 118074, 'loss/train': 0.9726481437683105} 11/07/2021 13:49:21 - INFO - __main__ - Step 118076: {'lr': 5.524127280216404e-05, 'samples': 22670592, 'steps': 118075, 'loss/train': 1.4063864946365356} 11/07/2021 13:49:22 - INFO - __main__ - Step 118077: {'lr': 5.5237945620549014e-05, 'samples': 22670784, 'steps': 118076, 'loss/train': 1.293070912361145} 11/07/2021 13:49:22 - INFO - __main__ - Step 118078: {'lr': 5.523461852669071e-05, 'samples': 22670976, 'steps': 118077, 'loss/train': 1.1751079559326172} 11/07/2021 13:49:22 - INFO - __main__ - Step 118079: {'lr': 5.523129152059064e-05, 'samples': 22671168, 'steps': 118078, 'loss/train': 1.172418236732483} 11/07/2021 13:49:23 - INFO - __main__ - Step 118080: {'lr': 5.522796460225038e-05, 'samples': 22671360, 'steps': 118079, 'loss/train': 1.2014497518539429} 11/07/2021 13:49:23 - INFO - __main__ - Step 118081: {'lr': 5.522463777167139e-05, 'samples': 22671552, 'steps': 118080, 'loss/train': 1.538568377494812} 11/07/2021 13:49:23 - INFO - __main__ - Step 118082: {'lr': 5.522131102885516e-05, 'samples': 22671744, 'steps': 118081, 'loss/train': 1.5685652494430542} 11/07/2021 13:49:24 - INFO - __main__ - Step 118083: {'lr': 5.521798437380321e-05, 'samples': 22671936, 'steps': 118082, 'loss/train': 1.5043656826019287} 11/07/2021 13:49:25 - INFO - __main__ - Step 118084: {'lr': 5.5214657806517024e-05, 'samples': 22672128, 'steps': 118083, 'loss/train': 1.406683087348938} 11/07/2021 13:49:25 - INFO - __main__ - Step 118085: {'lr': 5.521133132699813e-05, 'samples': 22672320, 'steps': 118084, 'loss/train': 1.5915815830230713} 11/07/2021 13:49:25 - INFO - __main__ - Step 118086: {'lr': 5.520800493524797e-05, 'samples': 22672512, 'steps': 118085, 'loss/train': 1.3278141021728516} 11/07/2021 13:49:26 - INFO - __main__ - Step 118087: {'lr': 5.52046786312681e-05, 'samples': 22672704, 'steps': 118086, 'loss/train': 1.342378854751587} 11/07/2021 13:49:27 - INFO - __main__ - Step 118088: {'lr': 5.520135241505999e-05, 'samples': 22672896, 'steps': 118087, 'loss/train': 1.3768419027328491} 11/07/2021 13:49:27 - INFO - __main__ - Step 118089: {'lr': 5.5198026286625155e-05, 'samples': 22673088, 'steps': 118088, 'loss/train': 1.8663667440414429} 11/07/2021 13:49:28 - INFO - __main__ - Step 118090: {'lr': 5.519470024596512e-05, 'samples': 22673280, 'steps': 118089, 'loss/train': 1.0203142166137695} 11/07/2021 13:49:28 - INFO - __main__ - Step 118091: {'lr': 5.5191374293081325e-05, 'samples': 22673472, 'steps': 118090, 'loss/train': 1.0295982360839844} 11/07/2021 13:49:28 - INFO - __main__ - Step 118092: {'lr': 5.5188048427975254e-05, 'samples': 22673664, 'steps': 118091, 'loss/train': 1.3922585248947144} 11/07/2021 13:49:29 - INFO - __main__ - Step 118093: {'lr': 5.518472265064845e-05, 'samples': 22673856, 'steps': 118092, 'loss/train': 0.8252376914024353} 11/07/2021 13:49:30 - INFO - __main__ - Step 118094: {'lr': 5.5181396961102416e-05, 'samples': 22674048, 'steps': 118093, 'loss/train': 0.8124462962150574} 11/07/2021 13:49:30 - INFO - __main__ - Step 118095: {'lr': 5.517807135933864e-05, 'samples': 22674240, 'steps': 118094, 'loss/train': 1.4017866849899292} 11/07/2021 13:49:30 - INFO - __main__ - Step 118096: {'lr': 5.517474584535859e-05, 'samples': 22674432, 'steps': 118095, 'loss/train': 1.1053837537765503} 11/07/2021 13:49:31 - INFO - __main__ - Step 118097: {'lr': 5.5171420419163815e-05, 'samples': 22674624, 'steps': 118096, 'loss/train': 1.6987305879592896} 11/07/2021 13:49:32 - INFO - __main__ - Step 118098: {'lr': 5.5168095080755793e-05, 'samples': 22674816, 'steps': 118097, 'loss/train': 1.647225022315979} 11/07/2021 13:49:32 - INFO - __main__ - Step 118099: {'lr': 5.516476983013602e-05, 'samples': 22675008, 'steps': 118098, 'loss/train': 0.8069128394126892} 11/07/2021 13:49:33 - INFO - __main__ - Step 118100: {'lr': 5.516144466730599e-05, 'samples': 22675200, 'steps': 118099, 'loss/train': 1.0170636177062988} 11/07/2021 13:49:33 - INFO - __main__ - Step 118101: {'lr': 5.515811959226722e-05, 'samples': 22675392, 'steps': 118100, 'loss/train': 1.5463796854019165} 11/07/2021 13:49:33 - INFO - __main__ - Step 118102: {'lr': 5.515479460502118e-05, 'samples': 22675584, 'steps': 118101, 'loss/train': 1.1162134408950806} 11/07/2021 13:49:34 - INFO - __main__ - Step 118103: {'lr': 5.515146970556936e-05, 'samples': 22675776, 'steps': 118102, 'loss/train': 1.299835205078125} 11/07/2021 13:49:35 - INFO - __main__ - Step 118104: {'lr': 5.51481448939134e-05, 'samples': 22675968, 'steps': 118103, 'loss/train': 1.0453145503997803} 11/07/2021 13:49:35 - INFO - __main__ - Step 118105: {'lr': 5.5144820170054565e-05, 'samples': 22676160, 'steps': 118104, 'loss/train': 1.3007383346557617} 11/07/2021 13:49:35 - INFO - __main__ - Step 118106: {'lr': 5.51414955339945e-05, 'samples': 22676352, 'steps': 118105, 'loss/train': 0.9873498678207397} 11/07/2021 13:49:36 - INFO - __main__ - Step 118107: {'lr': 5.513817098573465e-05, 'samples': 22676544, 'steps': 118106, 'loss/train': 0.9503182768821716} 11/07/2021 13:49:37 - INFO - __main__ - Step 118108: {'lr': 5.513484652527656e-05, 'samples': 22676736, 'steps': 118107, 'loss/train': 1.1075206995010376} 11/07/2021 13:49:37 - INFO - __main__ - Step 118109: {'lr': 5.513152215262168e-05, 'samples': 22676928, 'steps': 118108, 'loss/train': 1.3134541511535645} 11/07/2021 13:49:37 - INFO - __main__ - Step 118110: {'lr': 5.512819786777151e-05, 'samples': 22677120, 'steps': 118109, 'loss/train': 1.126041293144226} 11/07/2021 13:49:38 - INFO - __main__ - Step 118111: {'lr': 5.5124873670727604e-05, 'samples': 22677312, 'steps': 118110, 'loss/train': 1.7179630994796753} 11/07/2021 13:49:38 - INFO - __main__ - Step 118112: {'lr': 5.51215495614914e-05, 'samples': 22677504, 'steps': 118111, 'loss/train': 1.2105211019515991} 11/07/2021 13:49:39 - INFO - __main__ - Step 118113: {'lr': 5.511822554006443e-05, 'samples': 22677696, 'steps': 118112, 'loss/train': 1.3374618291854858} 11/07/2021 13:49:40 - INFO - __main__ - Step 118114: {'lr': 5.511490160644819e-05, 'samples': 22677888, 'steps': 118113, 'loss/train': 1.4451093673706055} 11/07/2021 13:49:40 - INFO - __main__ - Step 118115: {'lr': 5.511157776064415e-05, 'samples': 22678080, 'steps': 118114, 'loss/train': 1.4928832054138184} 11/07/2021 13:49:40 - INFO - __main__ - Step 118116: {'lr': 5.510825400265382e-05, 'samples': 22678272, 'steps': 118115, 'loss/train': 1.3489227294921875} 11/07/2021 13:49:41 - INFO - __main__ - Step 118117: {'lr': 5.510493033247879e-05, 'samples': 22678464, 'steps': 118116, 'loss/train': 1.5366530418395996} 11/07/2021 13:49:42 - INFO - __main__ - Step 118118: {'lr': 5.510160675012041e-05, 'samples': 22678656, 'steps': 118117, 'loss/train': 1.2602254152297974} 11/07/2021 13:49:42 - INFO - __main__ - Step 118119: {'lr': 5.5098283255580226e-05, 'samples': 22678848, 'steps': 118118, 'loss/train': 0.9024894833564758} 11/07/2021 13:49:42 - INFO - __main__ - Step 118120: {'lr': 5.5094959848859734e-05, 'samples': 22679040, 'steps': 118119, 'loss/train': 1.260973334312439} 11/07/2021 13:49:43 - INFO - __main__ - Step 118121: {'lr': 5.5091636529960466e-05, 'samples': 22679232, 'steps': 118120, 'loss/train': 0.9695318937301636} 11/07/2021 13:49:43 - INFO - __main__ - Step 118122: {'lr': 5.508831329888389e-05, 'samples': 22679424, 'steps': 118121, 'loss/train': 0.9643194079399109} 11/07/2021 13:49:44 - INFO - __main__ - Step 118123: {'lr': 5.508499015563151e-05, 'samples': 22679616, 'steps': 118122, 'loss/train': 1.5328476428985596} 11/07/2021 13:49:44 - INFO - __main__ - Step 118124: {'lr': 5.508166710020482e-05, 'samples': 22679808, 'steps': 118123, 'loss/train': 0.7447001934051514} 11/07/2021 13:49:45 - INFO - __main__ - Step 118125: {'lr': 5.5078344132605316e-05, 'samples': 22680000, 'steps': 118124, 'loss/train': 1.4600595235824585} 11/07/2021 13:49:45 - INFO - __main__ - Step 118126: {'lr': 5.507502125283453e-05, 'samples': 22680192, 'steps': 118125, 'loss/train': 1.0992348194122314} 11/07/2021 13:49:46 - INFO - __main__ - Step 118127: {'lr': 5.507169846089391e-05, 'samples': 22680384, 'steps': 118126, 'loss/train': 1.2105947732925415} 11/07/2021 13:49:46 - INFO - __main__ - Step 118128: {'lr': 5.5068375756784996e-05, 'samples': 22680576, 'steps': 118127, 'loss/train': 1.3314061164855957} 11/07/2021 13:49:47 - INFO - __main__ - Step 118129: {'lr': 5.506505314050925e-05, 'samples': 22680768, 'steps': 118128, 'loss/train': 1.5139302015304565} 11/07/2021 13:49:47 - INFO - __main__ - Step 118130: {'lr': 5.506173061206818e-05, 'samples': 22680960, 'steps': 118129, 'loss/train': 1.3125298023223877} 11/07/2021 13:49:48 - INFO - __main__ - Step 118131: {'lr': 5.505840817146338e-05, 'samples': 22681152, 'steps': 118130, 'loss/train': 1.3819570541381836} 11/07/2021 13:49:48 - INFO - __main__ - Step 118132: {'lr': 5.5055085818696144e-05, 'samples': 22681344, 'steps': 118131, 'loss/train': 1.0984243154525757} 11/07/2021 13:49:48 - INFO - __main__ - Step 118133: {'lr': 5.505176355376812e-05, 'samples': 22681536, 'steps': 118132, 'loss/train': 1.60236656665802} 11/07/2021 13:49:49 - INFO - __main__ - Step 118134: {'lr': 5.504844137668072e-05, 'samples': 22681728, 'steps': 118133, 'loss/train': 1.3314977884292603} 11/07/2021 13:49:50 - INFO - __main__ - Step 118135: {'lr': 5.5045119287435526e-05, 'samples': 22681920, 'steps': 118134, 'loss/train': 1.0739431381225586} 11/07/2021 13:49:50 - INFO - __main__ - Step 118136: {'lr': 5.504179728603395e-05, 'samples': 22682112, 'steps': 118135, 'loss/train': 1.3668465614318848} 11/07/2021 13:49:50 - INFO - __main__ - Step 118137: {'lr': 5.503847537247755e-05, 'samples': 22682304, 'steps': 118136, 'loss/train': 0.9529404640197754} 11/07/2021 13:49:51 - INFO - __main__ - Step 118138: {'lr': 5.503515354676783e-05, 'samples': 22682496, 'steps': 118137, 'loss/train': 1.1836215257644653} 11/07/2021 13:49:52 - INFO - __main__ - Step 118139: {'lr': 5.503183180890622e-05, 'samples': 22682688, 'steps': 118138, 'loss/train': 1.2361726760864258} 11/07/2021 13:49:52 - INFO - __main__ - Step 118140: {'lr': 5.502851015889429e-05, 'samples': 22682880, 'steps': 118139, 'loss/train': 1.5605885982513428} 11/07/2021 13:49:53 - INFO - __main__ - Step 118141: {'lr': 5.50251885967335e-05, 'samples': 22683072, 'steps': 118140, 'loss/train': 1.2568551301956177} 11/07/2021 13:49:53 - INFO - __main__ - Step 118142: {'lr': 5.502186712242535e-05, 'samples': 22683264, 'steps': 118141, 'loss/train': 1.7798945903778076} 11/07/2021 13:49:53 - INFO - __main__ - Step 118143: {'lr': 5.5018545735971314e-05, 'samples': 22683456, 'steps': 118142, 'loss/train': 0.5422136783599854} 11/07/2021 13:49:54 - INFO - __main__ - Step 118144: {'lr': 5.5015224437373005e-05, 'samples': 22683648, 'steps': 118143, 'loss/train': 1.7673084735870361} 11/07/2021 13:49:55 - INFO - __main__ - Step 118145: {'lr': 5.5011903226631745e-05, 'samples': 22683840, 'steps': 118144, 'loss/train': 1.241908311843872} 11/07/2021 13:49:55 - INFO - __main__ - Step 118146: {'lr': 5.500858210374912e-05, 'samples': 22684032, 'steps': 118145, 'loss/train': 1.5688010454177856} 11/07/2021 13:49:55 - INFO - __main__ - Step 118147: {'lr': 5.5005261068726634e-05, 'samples': 22684224, 'steps': 118146, 'loss/train': 1.3151801824569702} 11/07/2021 13:49:56 - INFO - __main__ - Step 118148: {'lr': 5.500194012156576e-05, 'samples': 22684416, 'steps': 118147, 'loss/train': 1.3024052381515503} 11/07/2021 13:49:57 - INFO - __main__ - Step 118149: {'lr': 5.499861926226799e-05, 'samples': 22684608, 'steps': 118148, 'loss/train': 0.9892287850379944} 11/07/2021 13:49:57 - INFO - __main__ - Step 118150: {'lr': 5.4995298490834844e-05, 'samples': 22684800, 'steps': 118149, 'loss/train': 1.025028944015503} 11/07/2021 13:49:57 - INFO - __main__ - Step 118151: {'lr': 5.499197780726781e-05, 'samples': 22684992, 'steps': 118150, 'loss/train': 1.6866674423217773} 11/07/2021 13:49:58 - INFO - __main__ - Step 118152: {'lr': 5.498865721156837e-05, 'samples': 22685184, 'steps': 118151, 'loss/train': 1.242343544960022} 11/07/2021 13:49:58 - INFO - __main__ - Step 118153: {'lr': 5.4985336703738034e-05, 'samples': 22685376, 'steps': 118152, 'loss/train': 1.1567862033843994} 11/07/2021 13:49:59 - INFO - __main__ - Step 118154: {'lr': 5.4982016283778304e-05, 'samples': 22685568, 'steps': 118153, 'loss/train': 1.31291663646698} 11/07/2021 13:50:00 - INFO - __main__ - Step 118155: {'lr': 5.497869595169064e-05, 'samples': 22685760, 'steps': 118154, 'loss/train': 0.972201943397522} 11/07/2021 13:50:00 - INFO - __main__ - Step 118156: {'lr': 5.49753757074766e-05, 'samples': 22685952, 'steps': 118155, 'loss/train': 1.5798678398132324} 11/07/2021 13:50:00 - INFO - __main__ - Step 118157: {'lr': 5.4972055551137625e-05, 'samples': 22686144, 'steps': 118156, 'loss/train': 1.2470847368240356} 11/07/2021 13:50:01 - INFO - __main__ - Step 118158: {'lr': 5.496873548267531e-05, 'samples': 22686336, 'steps': 118157, 'loss/train': 1.1873506307601929} 11/07/2021 13:50:02 - INFO - __main__ - Step 118159: {'lr': 5.4965415502091026e-05, 'samples': 22686528, 'steps': 118158, 'loss/train': 1.26020085811615} 11/07/2021 13:50:02 - INFO - __main__ - Step 118160: {'lr': 5.496209560938628e-05, 'samples': 22686720, 'steps': 118159, 'loss/train': 1.342381238937378} 11/07/2021 13:50:02 - INFO - __main__ - Step 118161: {'lr': 5.4958775804562625e-05, 'samples': 22686912, 'steps': 118160, 'loss/train': 1.5954591035842896} 11/07/2021 13:50:03 - INFO - __main__ - Step 118162: {'lr': 5.495545608762154e-05, 'samples': 22687104, 'steps': 118161, 'loss/train': 0.576206624507904} 11/07/2021 13:50:03 - INFO - __main__ - Step 118163: {'lr': 5.4952136458564505e-05, 'samples': 22687296, 'steps': 118162, 'loss/train': 1.302451729774475} 11/07/2021 13:50:03 - INFO - __main__ - Step 118164: {'lr': 5.4948816917393035e-05, 'samples': 22687488, 'steps': 118163, 'loss/train': 1.137723684310913} 11/07/2021 13:50:04 - INFO - __main__ - Step 118165: {'lr': 5.494549746410859e-05, 'samples': 22687680, 'steps': 118164, 'loss/train': 0.9505645632743835} 11/07/2021 13:50:05 - INFO - __main__ - Step 118166: {'lr': 5.494217809871274e-05, 'samples': 22687872, 'steps': 118165, 'loss/train': 1.2202656269073486} 11/07/2021 13:50:05 - INFO - __main__ - Step 118167: {'lr': 5.493885882120689e-05, 'samples': 22688064, 'steps': 118166, 'loss/train': 0.7513657212257385} 11/07/2021 13:50:05 - INFO - __main__ - Step 118168: {'lr': 5.493553963159262e-05, 'samples': 22688256, 'steps': 118167, 'loss/train': 0.7735535502433777} 11/07/2021 13:50:06 - INFO - __main__ - Step 118169: {'lr': 5.493222052987137e-05, 'samples': 22688448, 'steps': 118168, 'loss/train': 1.4140340089797974} 11/07/2021 13:50:07 - INFO - __main__ - Step 118170: {'lr': 5.492890151604466e-05, 'samples': 22688640, 'steps': 118169, 'loss/train': 2.455735921859741} 11/07/2021 13:50:07 - INFO - __main__ - Step 118171: {'lr': 5.4925582590114016e-05, 'samples': 22688832, 'steps': 118170, 'loss/train': 1.289747953414917} 11/07/2021 13:50:08 - INFO - __main__ - Step 118172: {'lr': 5.4922263752080845e-05, 'samples': 22689024, 'steps': 118171, 'loss/train': 1.072389841079712} 11/07/2021 13:50:08 - INFO - __main__ - Step 118173: {'lr': 5.491894500194669e-05, 'samples': 22689216, 'steps': 118172, 'loss/train': 0.8243014812469482} 11/07/2021 13:50:08 - INFO - __main__ - Step 118174: {'lr': 5.491562633971306e-05, 'samples': 22689408, 'steps': 118173, 'loss/train': 1.5006215572357178} 11/07/2021 13:50:09 - INFO - __main__ - Step 118175: {'lr': 5.491230776538142e-05, 'samples': 22689600, 'steps': 118174, 'loss/train': 1.3896760940551758} 11/07/2021 13:50:10 - INFO - __main__ - Step 118176: {'lr': 5.4908989278953295e-05, 'samples': 22689792, 'steps': 118175, 'loss/train': 1.2942304611206055} 11/07/2021 13:50:10 - INFO - __main__ - Step 118177: {'lr': 5.4905670880430165e-05, 'samples': 22689984, 'steps': 118176, 'loss/train': 1.4918391704559326} 11/07/2021 13:50:10 - INFO - __main__ - Step 118178: {'lr': 5.490235256981352e-05, 'samples': 22690176, 'steps': 118177, 'loss/train': 0.2532706558704376} 11/07/2021 13:50:11 - INFO - __main__ - Step 118179: {'lr': 5.489903434710489e-05, 'samples': 22690368, 'steps': 118178, 'loss/train': 1.3022818565368652} 11/07/2021 13:50:12 - INFO - __main__ - Step 118180: {'lr': 5.489571621230571e-05, 'samples': 22690560, 'steps': 118179, 'loss/train': 0.877171516418457} 11/07/2021 13:50:12 - INFO - __main__ - Step 118181: {'lr': 5.489239816541755e-05, 'samples': 22690752, 'steps': 118180, 'loss/train': 1.4525136947631836} 11/07/2021 13:50:12 - INFO - __main__ - Step 118182: {'lr': 5.4889080206441846e-05, 'samples': 22690944, 'steps': 118181, 'loss/train': 1.3006949424743652} 11/07/2021 13:50:13 - INFO - __main__ - Step 118183: {'lr': 5.488576233538009e-05, 'samples': 22691136, 'steps': 118182, 'loss/train': 1.3514236211776733} 11/07/2021 13:50:13 - INFO - __main__ - Step 118184: {'lr': 5.48824445522339e-05, 'samples': 22691328, 'steps': 118183, 'loss/train': 1.254185438156128} 11/07/2021 13:50:14 - INFO - __main__ - Step 118185: {'lr': 5.487912685700458e-05, 'samples': 22691520, 'steps': 118184, 'loss/train': 1.7197027206420898} 11/07/2021 13:50:14 - INFO - __main__ - Step 118186: {'lr': 5.487580924969374e-05, 'samples': 22691712, 'steps': 118185, 'loss/train': 1.1574095487594604} 11/07/2021 13:50:15 - INFO - __main__ - Step 118187: {'lr': 5.487249173030282e-05, 'samples': 22691904, 'steps': 118186, 'loss/train': 1.293919563293457} 11/07/2021 13:50:15 - INFO - __main__ - Step 118188: {'lr': 5.486917429883334e-05, 'samples': 22692096, 'steps': 118187, 'loss/train': 1.1183593273162842} 11/07/2021 13:50:16 - INFO - __main__ - Step 118189: {'lr': 5.486585695528681e-05, 'samples': 22692288, 'steps': 118188, 'loss/train': 1.2887428998947144} 11/07/2021 13:50:17 - INFO - __main__ - Step 118190: {'lr': 5.486253969966473e-05, 'samples': 22692480, 'steps': 118189, 'loss/train': 1.3132158517837524} 11/07/2021 13:50:17 - INFO - __main__ - Step 118191: {'lr': 5.4859222531968565e-05, 'samples': 22692672, 'steps': 118190, 'loss/train': 1.5470190048217773} 11/07/2021 13:50:17 - INFO - __main__ - Step 118192: {'lr': 5.485590545219982e-05, 'samples': 22692864, 'steps': 118191, 'loss/train': 1.2974287271499634} 11/07/2021 13:50:18 - INFO - __main__ - Step 118193: {'lr': 5.4852588460360006e-05, 'samples': 22693056, 'steps': 118192, 'loss/train': 1.1368144750595093} 11/07/2021 13:50:18 - INFO - __main__ - Step 118194: {'lr': 5.484927155645059e-05, 'samples': 22693248, 'steps': 118193, 'loss/train': 1.5379174947738647} 11/07/2021 13:50:19 - INFO - __main__ - Step 118195: {'lr': 5.48459547404731e-05, 'samples': 22693440, 'steps': 118194, 'loss/train': 1.2992396354675293} 11/07/2021 13:50:19 - INFO - __main__ - Step 118196: {'lr': 5.4842638012429e-05, 'samples': 22693632, 'steps': 118195, 'loss/train': 0.9035112857818604} 11/07/2021 13:50:20 - INFO - __main__ - Step 118197: {'lr': 5.483932137231978e-05, 'samples': 22693824, 'steps': 118196, 'loss/train': 1.2832462787628174} 11/07/2021 13:50:20 - INFO - __main__ - Step 118198: {'lr': 5.483600482014703e-05, 'samples': 22694016, 'steps': 118197, 'loss/train': 1.329098105430603} 11/07/2021 13:50:21 - INFO - __main__ - Step 118199: {'lr': 5.48326883559121e-05, 'samples': 22694208, 'steps': 118198, 'loss/train': 1.1713842153549194} 11/07/2021 13:50:21 - INFO - __main__ - Step 118200: {'lr': 5.482937197961654e-05, 'samples': 22694400, 'steps': 118199, 'loss/train': 0.3843962252140045} 11/07/2021 13:50:22 - INFO - __main__ - Step 118201: {'lr': 5.4826055691261864e-05, 'samples': 22694592, 'steps': 118200, 'loss/train': 1.2149088382720947} 11/07/2021 13:50:22 - INFO - __main__ - Step 118202: {'lr': 5.482273949084957e-05, 'samples': 22694784, 'steps': 118201, 'loss/train': 1.364173173904419} 11/07/2021 13:50:23 - INFO - __main__ - Step 118203: {'lr': 5.481942337838111e-05, 'samples': 22694976, 'steps': 118202, 'loss/train': 0.9008709788322449} 11/07/2021 13:50:23 - INFO - __main__ - Step 118204: {'lr': 5.4816107353858033e-05, 'samples': 22695168, 'steps': 118203, 'loss/train': 1.381041169166565} 11/07/2021 13:50:23 - INFO - __main__ - Step 118205: {'lr': 5.4812791417281796e-05, 'samples': 22695360, 'steps': 118204, 'loss/train': 1.3723740577697754} 11/07/2021 13:50:24 - INFO - __main__ - Step 118206: {'lr': 5.480947556865387e-05, 'samples': 22695552, 'steps': 118205, 'loss/train': 0.9247915744781494} 11/07/2021 13:50:25 - INFO - __main__ - Step 118207: {'lr': 5.480615980797582e-05, 'samples': 22695744, 'steps': 118206, 'loss/train': 1.5592879056930542} 11/07/2021 13:50:25 - INFO - __main__ - Step 118208: {'lr': 5.480284413524908e-05, 'samples': 22695936, 'steps': 118207, 'loss/train': 1.4135963916778564} 11/07/2021 13:50:25 - INFO - __main__ - Step 118209: {'lr': 5.479952855047527e-05, 'samples': 22696128, 'steps': 118208, 'loss/train': 0.9167602062225342} 11/07/2021 13:50:26 - INFO - __main__ - Step 118210: {'lr': 5.479621305365567e-05, 'samples': 22696320, 'steps': 118209, 'loss/train': 1.0440089702606201} 11/07/2021 13:50:27 - INFO - __main__ - Step 118211: {'lr': 5.4792897644791925e-05, 'samples': 22696512, 'steps': 118210, 'loss/train': 1.8005743026733398} 11/07/2021 13:50:27 - INFO - __main__ - Step 118212: {'lr': 5.478958232388545e-05, 'samples': 22696704, 'steps': 118211, 'loss/train': 1.27495276927948} 11/07/2021 13:50:27 - INFO - __main__ - Step 118213: {'lr': 5.4786267090937787e-05, 'samples': 22696896, 'steps': 118212, 'loss/train': 1.1895060539245605} 11/07/2021 13:50:28 - INFO - __main__ - Step 118214: {'lr': 5.4782951945950424e-05, 'samples': 22697088, 'steps': 118213, 'loss/train': 1.514763355255127} 11/07/2021 13:50:28 - INFO - __main__ - Step 118215: {'lr': 5.477963688892487e-05, 'samples': 22697280, 'steps': 118214, 'loss/train': 0.6036431193351746} 11/07/2021 13:50:29 - INFO - __main__ - Step 118216: {'lr': 5.4776321919862565e-05, 'samples': 22697472, 'steps': 118215, 'loss/train': 1.2572158575057983} 11/07/2021 13:50:30 - INFO - __main__ - Step 118217: {'lr': 5.477300703876506e-05, 'samples': 22697664, 'steps': 118216, 'loss/train': 1.183315634727478} 11/07/2021 13:50:30 - INFO - __main__ - Step 118218: {'lr': 5.476969224563383e-05, 'samples': 22697856, 'steps': 118217, 'loss/train': 1.452109456062317} 11/07/2021 13:50:30 - INFO - __main__ - Step 118219: {'lr': 5.476637754047034e-05, 'samples': 22698048, 'steps': 118218, 'loss/train': 1.1350566148757935} 11/07/2021 13:50:31 - INFO - __main__ - Step 118220: {'lr': 5.476306292327618e-05, 'samples': 22698240, 'steps': 118219, 'loss/train': 1.0153310298919678} 11/07/2021 13:50:32 - INFO - __main__ - Step 118221: {'lr': 5.475974839405273e-05, 'samples': 22698432, 'steps': 118220, 'loss/train': 1.1324565410614014} 11/07/2021 13:50:32 - INFO - __main__ - Step 118222: {'lr': 5.47564339528015e-05, 'samples': 22698624, 'steps': 118221, 'loss/train': 1.4394844770431519} 11/07/2021 13:50:32 - INFO - __main__ - Step 118223: {'lr': 5.4753119599524006e-05, 'samples': 22698816, 'steps': 118222, 'loss/train': 1.3109767436981201} 11/07/2021 13:50:33 - INFO - __main__ - Step 118224: {'lr': 5.474980533422175e-05, 'samples': 22699008, 'steps': 118223, 'loss/train': 1.1299498081207275} 11/07/2021 13:50:33 - INFO - __main__ - Step 118225: {'lr': 5.474649115689623e-05, 'samples': 22699200, 'steps': 118224, 'loss/train': 1.165205478668213} 11/07/2021 13:50:34 - INFO - __main__ - Step 118226: {'lr': 5.474317706754892e-05, 'samples': 22699392, 'steps': 118225, 'loss/train': 0.7619900107383728} 11/07/2021 13:50:35 - INFO - __main__ - Step 118227: {'lr': 5.473986306618131e-05, 'samples': 22699584, 'steps': 118226, 'loss/train': 0.8696517944335938} 11/07/2021 13:50:35 - INFO - __main__ - Step 118228: {'lr': 5.473654915279491e-05, 'samples': 22699776, 'steps': 118227, 'loss/train': 0.5977702736854553} 11/07/2021 13:50:35 - INFO - __main__ - Step 118229: {'lr': 5.473323532739122e-05, 'samples': 22699968, 'steps': 118228, 'loss/train': 1.1948108673095703} 11/07/2021 13:50:36 - INFO - __main__ - Step 118230: {'lr': 5.4729921589971726e-05, 'samples': 22700160, 'steps': 118229, 'loss/train': 1.5744940042495728} 11/07/2021 13:50:36 - INFO - __main__ - Step 118231: {'lr': 5.472660794053796e-05, 'samples': 22700352, 'steps': 118230, 'loss/train': 1.2448211908340454} 11/07/2021 13:50:37 - INFO - __main__ - Step 118232: {'lr': 5.472329437909132e-05, 'samples': 22700544, 'steps': 118231, 'loss/train': 1.4191677570343018} 11/07/2021 13:50:37 - INFO - __main__ - Step 118233: {'lr': 5.4719980905633346e-05, 'samples': 22700736, 'steps': 118232, 'loss/train': 1.5137237310409546} 11/07/2021 13:50:38 - INFO - __main__ - Step 118234: {'lr': 5.471666752016552e-05, 'samples': 22700928, 'steps': 118233, 'loss/train': 1.0739768743515015} 11/07/2021 13:50:38 - INFO - __main__ - Step 118235: {'lr': 5.4713354222689385e-05, 'samples': 22701120, 'steps': 118234, 'loss/train': 1.271859049797058} 11/07/2021 13:50:38 - INFO - __main__ - Step 118236: {'lr': 5.471004101320637e-05, 'samples': 22701312, 'steps': 118235, 'loss/train': 1.0820908546447754} 11/07/2021 13:50:39 - INFO - __main__ - Step 118237: {'lr': 5.470672789171802e-05, 'samples': 22701504, 'steps': 118236, 'loss/train': 1.4535733461380005} 11/07/2021 13:50:40 - INFO - __main__ - Step 118238: {'lr': 5.4703414858225777e-05, 'samples': 22701696, 'steps': 118237, 'loss/train': 1.5726417303085327} 11/07/2021 13:50:40 - INFO - __main__ - Step 118239: {'lr': 5.470010191273117e-05, 'samples': 22701888, 'steps': 118238, 'loss/train': 0.9312129616737366} 11/07/2021 13:50:40 - INFO - __main__ - Step 118240: {'lr': 5.46967890552357e-05, 'samples': 22702080, 'steps': 118239, 'loss/train': 0.9384392499923706} 11/07/2021 13:50:41 - INFO - __main__ - Step 118241: {'lr': 5.4693476285740813e-05, 'samples': 22702272, 'steps': 118240, 'loss/train': 1.8645721673965454} 11/07/2021 13:50:42 - INFO - __main__ - Step 118242: {'lr': 5.4690163604248137e-05, 'samples': 22702464, 'steps': 118241, 'loss/train': 1.328977108001709} 11/07/2021 13:50:42 - INFO - __main__ - Step 118243: {'lr': 5.468685101075896e-05, 'samples': 22702656, 'steps': 118242, 'loss/train': 1.4028328657150269} 11/07/2021 13:50:43 - INFO - __main__ - Step 118244: {'lr': 5.468353850527488e-05, 'samples': 22702848, 'steps': 118243, 'loss/train': 1.1783217191696167} 11/07/2021 13:50:43 - INFO - __main__ - Step 118245: {'lr': 5.468022608779741e-05, 'samples': 22703040, 'steps': 118244, 'loss/train': 1.4780373573303223} 11/07/2021 13:50:43 - INFO - __main__ - Step 118246: {'lr': 5.467691375832798e-05, 'samples': 22703232, 'steps': 118245, 'loss/train': 0.4692501425743103} 11/07/2021 13:50:44 - INFO - __main__ - Step 118247: {'lr': 5.467360151686815e-05, 'samples': 22703424, 'steps': 118246, 'loss/train': 1.2687721252441406} 11/07/2021 13:50:45 - INFO - __main__ - Step 118248: {'lr': 5.4670289363419365e-05, 'samples': 22703616, 'steps': 118247, 'loss/train': 1.1956604719161987} 11/07/2021 13:50:45 - INFO - __main__ - Step 118249: {'lr': 5.466697729798314e-05, 'samples': 22703808, 'steps': 118248, 'loss/train': 1.7355228662490845} 11/07/2021 13:50:45 - INFO - __main__ - Step 118250: {'lr': 5.466366532056094e-05, 'samples': 22704000, 'steps': 118249, 'loss/train': 1.2877181768417358} 11/07/2021 13:50:46 - INFO - __main__ - Step 118251: {'lr': 5.4660353431154306e-05, 'samples': 22704192, 'steps': 118250, 'loss/train': 1.37416410446167} 11/07/2021 13:50:47 - INFO - __main__ - Step 118252: {'lr': 5.4657041629764674e-05, 'samples': 22704384, 'steps': 118251, 'loss/train': 1.4632220268249512} 11/07/2021 13:50:47 - INFO - __main__ - Step 118253: {'lr': 5.465372991639367e-05, 'samples': 22704576, 'steps': 118252, 'loss/train': 1.1408703327178955} 11/07/2021 13:50:48 - INFO - __main__ - Step 118254: {'lr': 5.4650418291042584e-05, 'samples': 22704768, 'steps': 118253, 'loss/train': 0.743631899356842} 11/07/2021 13:50:48 - INFO - __main__ - Step 118255: {'lr': 5.4647106753713014e-05, 'samples': 22704960, 'steps': 118254, 'loss/train': 1.1272379159927368} 11/07/2021 13:50:48 - INFO - __main__ - Step 118256: {'lr': 5.464379530440644e-05, 'samples': 22705152, 'steps': 118255, 'loss/train': 1.4979636669158936} 11/07/2021 13:50:49 - INFO - __main__ - Step 118257: {'lr': 5.4640483943124376e-05, 'samples': 22705344, 'steps': 118256, 'loss/train': 0.6927489042282104} 11/07/2021 13:50:50 - INFO - __main__ - Step 118258: {'lr': 5.463717266986826e-05, 'samples': 22705536, 'steps': 118257, 'loss/train': 1.3344007730484009} 11/07/2021 13:50:50 - INFO - __main__ - Step 118259: {'lr': 5.4633861484639644e-05, 'samples': 22705728, 'steps': 118258, 'loss/train': 1.1561849117279053} 11/07/2021 13:50:50 - INFO - __main__ - Step 118260: {'lr': 5.463055038744e-05, 'samples': 22705920, 'steps': 118259, 'loss/train': 0.0479864701628685} 11/07/2021 13:50:51 - INFO - __main__ - Step 118261: {'lr': 5.46272393782708e-05, 'samples': 22706112, 'steps': 118260, 'loss/train': 1.288417935371399} 11/07/2021 13:50:52 - INFO - __main__ - Step 118262: {'lr': 5.4623928457133545e-05, 'samples': 22706304, 'steps': 118261, 'loss/train': 1.5709353685379028} 11/07/2021 13:50:52 - INFO - __main__ - Step 118263: {'lr': 5.4620617624029755e-05, 'samples': 22706496, 'steps': 118262, 'loss/train': 1.4163079261779785} 11/07/2021 13:50:53 - INFO - __main__ - Step 118264: {'lr': 5.4617306878960885e-05, 'samples': 22706688, 'steps': 118263, 'loss/train': 1.2136821746826172} 11/07/2021 13:50:53 - INFO - __main__ - Step 118265: {'lr': 5.4613996221928505e-05, 'samples': 22706880, 'steps': 118264, 'loss/train': 0.8755255937576294} 11/07/2021 13:50:53 - INFO - __main__ - Step 118266: {'lr': 5.461068565293401e-05, 'samples': 22707072, 'steps': 118265, 'loss/train': 1.3393146991729736} 11/07/2021 13:50:54 - INFO - __main__ - Step 118267: {'lr': 5.460737517197889e-05, 'samples': 22707264, 'steps': 118266, 'loss/train': 0.914094090461731} 11/07/2021 13:50:55 - INFO - __main__ - Step 118268: {'lr': 5.460406477906468e-05, 'samples': 22707456, 'steps': 118267, 'loss/train': 1.2298773527145386} 11/07/2021 13:50:55 - INFO - __main__ - Step 118269: {'lr': 5.460075447419286e-05, 'samples': 22707648, 'steps': 118268, 'loss/train': 2.0714261531829834} 11/07/2021 13:50:55 - INFO - __main__ - Step 118270: {'lr': 5.459744425736493e-05, 'samples': 22707840, 'steps': 118269, 'loss/train': 1.1238880157470703} 11/07/2021 13:50:56 - INFO - __main__ - Step 118271: {'lr': 5.459413412858236e-05, 'samples': 22708032, 'steps': 118270, 'loss/train': 0.8689592480659485} 11/07/2021 13:50:57 - INFO - __main__ - Step 118272: {'lr': 5.4590824087846686e-05, 'samples': 22708224, 'steps': 118271, 'loss/train': 0.8310273885726929} 11/07/2021 13:50:57 - INFO - __main__ - Step 118273: {'lr': 5.4587514135159364e-05, 'samples': 22708416, 'steps': 118272, 'loss/train': 1.1097190380096436} 11/07/2021 13:50:57 - INFO - __main__ - Step 118274: {'lr': 5.458420427052188e-05, 'samples': 22708608, 'steps': 118273, 'loss/train': 1.3954607248306274} 11/07/2021 13:50:58 - INFO - __main__ - Step 118275: {'lr': 5.4580894493935745e-05, 'samples': 22708800, 'steps': 118274, 'loss/train': 1.0488954782485962} 11/07/2021 13:50:58 - INFO - __main__ - Step 118276: {'lr': 5.457758480540245e-05, 'samples': 22708992, 'steps': 118275, 'loss/train': 1.204209327697754} 11/07/2021 13:50:58 - INFO - __main__ - Step 118277: {'lr': 5.4574275204923476e-05, 'samples': 22709184, 'steps': 118276, 'loss/train': 0.8667522668838501} 11/07/2021 13:51:00 - INFO - __main__ - Step 118278: {'lr': 5.4570965692500305e-05, 'samples': 22709376, 'steps': 118277, 'loss/train': 1.108146071434021} 11/07/2021 13:51:00 - INFO - __main__ - Step 118279: {'lr': 5.456765626813451e-05, 'samples': 22709568, 'steps': 118278, 'loss/train': 1.6048730611801147} 11/07/2021 13:51:00 - INFO - __main__ - Step 118280: {'lr': 5.4564346931827465e-05, 'samples': 22709760, 'steps': 118279, 'loss/train': 1.1949639320373535} 11/07/2021 13:51:01 - INFO - __main__ - Step 118281: {'lr': 5.45610376835807e-05, 'samples': 22709952, 'steps': 118280, 'loss/train': 0.8461679220199585} 11/07/2021 13:51:01 - INFO - __main__ - Step 118282: {'lr': 5.4557728523395717e-05, 'samples': 22710144, 'steps': 118281, 'loss/train': 1.354655385017395} 11/07/2021 13:51:02 - INFO - __main__ - Step 118283: {'lr': 5.4554419451274014e-05, 'samples': 22710336, 'steps': 118282, 'loss/train': 1.7977696657180786} 11/07/2021 13:51:02 - INFO - __main__ - Step 118284: {'lr': 5.455111046721706e-05, 'samples': 22710528, 'steps': 118283, 'loss/train': 1.535706877708435} 11/07/2021 13:51:03 - INFO - __main__ - Step 118285: {'lr': 5.454780157122635e-05, 'samples': 22710720, 'steps': 118284, 'loss/train': 1.3643736839294434} 11/07/2021 13:51:03 - INFO - __main__ - Step 118286: {'lr': 5.454449276330339e-05, 'samples': 22710912, 'steps': 118285, 'loss/train': 1.1886459589004517} 11/07/2021 13:51:03 - INFO - __main__ - Step 118287: {'lr': 5.454118404344968e-05, 'samples': 22711104, 'steps': 118286, 'loss/train': 1.351986050605774} 11/07/2021 13:51:04 - INFO - __main__ - Step 118288: {'lr': 5.453787541166669e-05, 'samples': 22711296, 'steps': 118287, 'loss/train': 1.1333731412887573} 11/07/2021 13:51:05 - INFO - __main__ - Step 118289: {'lr': 5.4534566867955906e-05, 'samples': 22711488, 'steps': 118288, 'loss/train': 1.4761418104171753} 11/07/2021 13:51:05 - INFO - __main__ - Step 118290: {'lr': 5.453125841231884e-05, 'samples': 22711680, 'steps': 118289, 'loss/train': 1.0997027158737183} 11/07/2021 13:51:05 - INFO - __main__ - Step 118291: {'lr': 5.4527950044757e-05, 'samples': 22711872, 'steps': 118290, 'loss/train': 1.5374560356140137} 11/07/2021 13:51:06 - INFO - __main__ - Step 118292: {'lr': 5.452464176527189e-05, 'samples': 22712064, 'steps': 118291, 'loss/train': 1.3515087366104126} 11/07/2021 13:51:07 - INFO - __main__ - Step 118293: {'lr': 5.4521333573864875e-05, 'samples': 22712256, 'steps': 118292, 'loss/train': 0.7869061827659607} 11/07/2021 13:51:07 - INFO - __main__ - Step 118294: {'lr': 5.451802547053755e-05, 'samples': 22712448, 'steps': 118293, 'loss/train': 1.4251457452774048} 11/07/2021 13:51:08 - INFO - __main__ - Step 118295: {'lr': 5.451471745529138e-05, 'samples': 22712640, 'steps': 118294, 'loss/train': 1.3288050889968872} 11/07/2021 13:51:08 - INFO - __main__ - Step 118296: {'lr': 5.451140952812789e-05, 'samples': 22712832, 'steps': 118295, 'loss/train': 1.572685956954956} 11/07/2021 13:51:08 - INFO - __main__ - Step 118297: {'lr': 5.450810168904852e-05, 'samples': 22713024, 'steps': 118296, 'loss/train': 1.2723618745803833} 11/07/2021 13:51:09 - INFO - __main__ - Step 118298: {'lr': 5.450479393805477e-05, 'samples': 22713216, 'steps': 118297, 'loss/train': 1.210899829864502} 11/07/2021 13:51:10 - INFO - __main__ - Step 118299: {'lr': 5.4501486275148145e-05, 'samples': 22713408, 'steps': 118298, 'loss/train': 1.6015766859054565} 11/07/2021 13:51:10 - INFO - __main__ - Step 118300: {'lr': 5.449817870033013e-05, 'samples': 22713600, 'steps': 118299, 'loss/train': 0.8754652142524719} 11/07/2021 13:51:11 - INFO - __main__ - Step 118301: {'lr': 5.449487121360225e-05, 'samples': 22713792, 'steps': 118300, 'loss/train': 1.1890318393707275} 11/07/2021 13:51:11 - INFO - __main__ - Step 118302: {'lr': 5.449156381496595e-05, 'samples': 22713984, 'steps': 118301, 'loss/train': 1.4449735879898071} 11/07/2021 13:51:11 - INFO - __main__ - Step 118303: {'lr': 5.448825650442274e-05, 'samples': 22714176, 'steps': 118302, 'loss/train': 1.5070199966430664} 11/07/2021 13:51:12 - INFO - __main__ - Step 118304: {'lr': 5.448494928197409e-05, 'samples': 22714368, 'steps': 118303, 'loss/train': 1.3297128677368164} 11/07/2021 13:51:13 - INFO - __main__ - Step 118305: {'lr': 5.448164214762158e-05, 'samples': 22714560, 'steps': 118304, 'loss/train': 1.55649995803833} 11/07/2021 13:51:13 - INFO - __main__ - Step 118306: {'lr': 5.4478335101366546e-05, 'samples': 22714752, 'steps': 118305, 'loss/train': 1.000915765762329} 11/07/2021 13:51:13 - INFO - __main__ - Step 118307: {'lr': 5.4475028143210563e-05, 'samples': 22714944, 'steps': 118306, 'loss/train': 1.2024599313735962} 11/07/2021 13:51:14 - INFO - __main__ - Step 118308: {'lr': 5.447172127315514e-05, 'samples': 22715136, 'steps': 118307, 'loss/train': 0.8229005932807922} 11/07/2021 13:51:15 - INFO - __main__ - Step 118309: {'lr': 5.446841449120171e-05, 'samples': 22715328, 'steps': 118308, 'loss/train': 0.5700790882110596} 11/07/2021 13:51:15 - INFO - __main__ - Step 118310: {'lr': 5.446510779735181e-05, 'samples': 22715520, 'steps': 118309, 'loss/train': 1.0014495849609375} 11/07/2021 13:51:15 - INFO - __main__ - Step 118311: {'lr': 5.44618011916069e-05, 'samples': 22715712, 'steps': 118310, 'loss/train': 1.278857946395874} 11/07/2021 13:51:16 - INFO - __main__ - Step 118312: {'lr': 5.445849467396849e-05, 'samples': 22715904, 'steps': 118311, 'loss/train': 1.7093966007232666} 11/07/2021 13:51:16 - INFO - __main__ - Step 118313: {'lr': 5.445518824443807e-05, 'samples': 22716096, 'steps': 118312, 'loss/train': 0.539777398109436} 11/07/2021 13:51:17 - INFO - __main__ - Step 118314: {'lr': 5.4451881903017144e-05, 'samples': 22716288, 'steps': 118313, 'loss/train': 1.5389994382858276} 11/07/2021 13:51:18 - INFO - __main__ - Step 118315: {'lr': 5.4448575649707146e-05, 'samples': 22716480, 'steps': 118314, 'loss/train': 1.0267689228057861} 11/07/2021 13:51:18 - INFO - __main__ - Step 118316: {'lr': 5.444526948450965e-05, 'samples': 22716672, 'steps': 118315, 'loss/train': 1.4681837558746338} 11/07/2021 13:51:18 - INFO - __main__ - Step 118317: {'lr': 5.444196340742605e-05, 'samples': 22716864, 'steps': 118316, 'loss/train': 1.454344630241394} 11/07/2021 13:51:19 - INFO - __main__ - Step 118318: {'lr': 5.443865741845791e-05, 'samples': 22717056, 'steps': 118317, 'loss/train': 1.4721766710281372} 11/07/2021 13:51:20 - INFO - __main__ - Step 118319: {'lr': 5.443535151760676e-05, 'samples': 22717248, 'steps': 118318, 'loss/train': 1.3119258880615234} 11/07/2021 13:51:20 - INFO - __main__ - Step 118320: {'lr': 5.443204570487395e-05, 'samples': 22717440, 'steps': 118319, 'loss/train': 1.0689915418624878} 11/07/2021 13:51:20 - INFO - __main__ - Step 118321: {'lr': 5.4428739980261014e-05, 'samples': 22717632, 'steps': 118320, 'loss/train': 1.7348072528839111} 11/07/2021 13:51:21 - INFO - __main__ - Step 118322: {'lr': 5.442543434376951e-05, 'samples': 22717824, 'steps': 118321, 'loss/train': 1.5033522844314575} 11/07/2021 13:51:21 - INFO - __main__ - Step 118323: {'lr': 5.442212879540087e-05, 'samples': 22718016, 'steps': 118322, 'loss/train': 1.3716051578521729} 11/07/2021 13:51:22 - INFO - __main__ - Step 118324: {'lr': 5.44188233351566e-05, 'samples': 22718208, 'steps': 118323, 'loss/train': 1.4443790912628174} 11/07/2021 13:51:22 - INFO - __main__ - Step 118325: {'lr': 5.44155179630382e-05, 'samples': 22718400, 'steps': 118324, 'loss/train': 1.1069883108139038} 11/07/2021 13:51:23 - INFO - __main__ - Step 118326: {'lr': 5.4412212679047144e-05, 'samples': 22718592, 'steps': 118325, 'loss/train': 0.8924340605735779} 11/07/2021 13:51:23 - INFO - __main__ - Step 118327: {'lr': 5.440890748318494e-05, 'samples': 22718784, 'steps': 118326, 'loss/train': 1.1151080131530762} 11/07/2021 13:51:23 - INFO - __main__ - Step 118328: {'lr': 5.440560237545306e-05, 'samples': 22718976, 'steps': 118327, 'loss/train': 1.101677417755127} 11/07/2021 13:51:25 - INFO - __main__ - Step 118329: {'lr': 5.440229735585297e-05, 'samples': 22719168, 'steps': 118328, 'loss/train': 1.3844859600067139} 11/07/2021 13:51:25 - INFO - __main__ - Step 118330: {'lr': 5.4398992424386204e-05, 'samples': 22719360, 'steps': 118329, 'loss/train': 1.3378357887268066} 11/07/2021 13:51:25 - INFO - __main__ - Step 118331: {'lr': 5.439568758105423e-05, 'samples': 22719552, 'steps': 118330, 'loss/train': 0.9660530090332031} 11/07/2021 13:51:26 - INFO - __main__ - Step 118332: {'lr': 5.439238282585862e-05, 'samples': 22719744, 'steps': 118331, 'loss/train': 1.144402027130127} 11/07/2021 13:51:26 - INFO - __main__ - Step 118333: {'lr': 5.438907815880073e-05, 'samples': 22719936, 'steps': 118332, 'loss/train': 1.3077906370162964} 11/07/2021 13:51:27 - INFO - __main__ - Step 118334: {'lr': 5.438577357988206e-05, 'samples': 22720128, 'steps': 118333, 'loss/train': 1.3177616596221924} 11/07/2021 13:51:27 - INFO - __main__ - Step 118335: {'lr': 5.4382469089104185e-05, 'samples': 22720320, 'steps': 118334, 'loss/train': 0.8160459399223328} 11/07/2021 13:51:28 - INFO - __main__ - Step 118336: {'lr': 5.437916468646853e-05, 'samples': 22720512, 'steps': 118335, 'loss/train': 1.514315128326416} 11/07/2021 13:51:28 - INFO - __main__ - Step 118337: {'lr': 5.4375860371976585e-05, 'samples': 22720704, 'steps': 118336, 'loss/train': 1.5111095905303955} 11/07/2021 13:51:28 - INFO - __main__ - Step 118338: {'lr': 5.437255614562989e-05, 'samples': 22720896, 'steps': 118337, 'loss/train': 1.5356212854385376} 11/07/2021 13:51:29 - INFO - __main__ - Step 118339: {'lr': 5.43692520074299e-05, 'samples': 22721088, 'steps': 118338, 'loss/train': 1.4499835968017578} 11/07/2021 13:51:30 - INFO - __main__ - Step 118340: {'lr': 5.4365947957378094e-05, 'samples': 22721280, 'steps': 118339, 'loss/train': 1.1680387258529663} 11/07/2021 13:51:30 - INFO - __main__ - Step 118341: {'lr': 5.436264399547597e-05, 'samples': 22721472, 'steps': 118340, 'loss/train': 1.589959740638733} 11/07/2021 13:51:30 - INFO - __main__ - Step 118342: {'lr': 5.435934012172503e-05, 'samples': 22721664, 'steps': 118341, 'loss/train': 1.3771470785140991} 11/07/2021 13:51:31 - INFO - __main__ - Step 118343: {'lr': 5.435603633612676e-05, 'samples': 22721856, 'steps': 118342, 'loss/train': 1.7051541805267334} 11/07/2021 13:51:31 - INFO - __main__ - Step 118344: {'lr': 5.435273263868262e-05, 'samples': 22722048, 'steps': 118343, 'loss/train': 1.5344479084014893} 11/07/2021 13:51:32 - INFO - __main__ - Step 118345: {'lr': 5.4349429029394135e-05, 'samples': 22722240, 'steps': 118344, 'loss/train': 1.5337005853652954} 11/07/2021 13:51:33 - INFO - __main__ - Step 118346: {'lr': 5.4346125508262845e-05, 'samples': 22722432, 'steps': 118345, 'loss/train': 1.4485841989517212} 11/07/2021 13:51:33 - INFO - __main__ - Step 118347: {'lr': 5.434282207529009e-05, 'samples': 22722624, 'steps': 118346, 'loss/train': 1.4105098247528076} 11/07/2021 13:51:33 - INFO - __main__ - Step 118348: {'lr': 5.433951873047746e-05, 'samples': 22722816, 'steps': 118347, 'loss/train': 1.1879156827926636} 11/07/2021 13:51:34 - INFO - __main__ - Step 118349: {'lr': 5.433621547382642e-05, 'samples': 22723008, 'steps': 118348, 'loss/train': 1.2308988571166992} 11/07/2021 13:51:35 - INFO - __main__ - Step 118350: {'lr': 5.433291230533843e-05, 'samples': 22723200, 'steps': 118349, 'loss/train': 1.4557974338531494} 11/07/2021 13:51:35 - INFO - __main__ - Step 118351: {'lr': 5.4329609225015035e-05, 'samples': 22723392, 'steps': 118350, 'loss/train': 1.7667202949523926} 11/07/2021 13:51:35 - INFO - __main__ - Step 118352: {'lr': 5.4326306232857724e-05, 'samples': 22723584, 'steps': 118351, 'loss/train': 1.2008506059646606} 11/07/2021 13:51:36 - INFO - __main__ - Step 118353: {'lr': 5.432300332886792e-05, 'samples': 22723776, 'steps': 118352, 'loss/train': 1.962278127670288} 11/07/2021 13:51:36 - INFO - __main__ - Step 118354: {'lr': 5.431970051304716e-05, 'samples': 22723968, 'steps': 118353, 'loss/train': 1.476412296295166} 11/07/2021 13:51:37 - INFO - __main__ - Step 118355: {'lr': 5.4316397785396934e-05, 'samples': 22724160, 'steps': 118354, 'loss/train': 1.3332678079605103} 11/07/2021 13:51:37 - INFO - __main__ - Step 118356: {'lr': 5.431309514591873e-05, 'samples': 22724352, 'steps': 118355, 'loss/train': 1.1467362642288208} 11/07/2021 13:51:38 - INFO - __main__ - Step 118357: {'lr': 5.4309792594613996e-05, 'samples': 22724544, 'steps': 118356, 'loss/train': 0.944844663143158} 11/07/2021 13:51:38 - INFO - __main__ - Step 118358: {'lr': 5.430649013148428e-05, 'samples': 22724736, 'steps': 118357, 'loss/train': 1.3182796239852905} 11/07/2021 13:51:39 - INFO - __main__ - Step 118359: {'lr': 5.430318775653109e-05, 'samples': 22724928, 'steps': 118358, 'loss/train': 1.5349624156951904} 11/07/2021 13:51:39 - INFO - __main__ - Step 118360: {'lr': 5.429988546975581e-05, 'samples': 22725120, 'steps': 118359, 'loss/train': 1.452754259109497} 11/07/2021 13:51:40 - INFO - __main__ - Step 118361: {'lr': 5.4296583271159964e-05, 'samples': 22725312, 'steps': 118360, 'loss/train': 1.1732826232910156} 11/07/2021 13:51:40 - INFO - __main__ - Step 118362: {'lr': 5.429328116074505e-05, 'samples': 22725504, 'steps': 118361, 'loss/train': 0.9922409653663635} 11/07/2021 13:51:41 - INFO - __main__ - Step 118363: {'lr': 5.428997913851258e-05, 'samples': 22725696, 'steps': 118362, 'loss/train': 1.481993556022644} 11/07/2021 13:51:41 - INFO - __main__ - Step 118364: {'lr': 5.4286677204464037e-05, 'samples': 22725888, 'steps': 118363, 'loss/train': 1.7751857042312622} 11/07/2021 13:51:41 - INFO - __main__ - Step 118365: {'lr': 5.4283375358600866e-05, 'samples': 22726080, 'steps': 118364, 'loss/train': 1.591041088104248} 11/07/2021 13:51:42 - INFO - __main__ - Step 118366: {'lr': 5.4280073600924626e-05, 'samples': 22726272, 'steps': 118365, 'loss/train': 1.5968014001846313} 11/07/2021 13:51:43 - INFO - __main__ - Step 118367: {'lr': 5.4276771931436734e-05, 'samples': 22726464, 'steps': 118366, 'loss/train': 0.6783089637756348} 11/07/2021 13:51:43 - INFO - __main__ - Step 118368: {'lr': 5.4273470350138713e-05, 'samples': 22726656, 'steps': 118367, 'loss/train': 1.5192872285842896} 11/07/2021 13:51:43 - INFO - __main__ - Step 118369: {'lr': 5.427016885703206e-05, 'samples': 22726848, 'steps': 118368, 'loss/train': 1.5615862607955933} 11/07/2021 13:51:44 - INFO - __main__ - Step 118370: {'lr': 5.426686745211823e-05, 'samples': 22727040, 'steps': 118369, 'loss/train': 1.2060390710830688} 11/07/2021 13:51:45 - INFO - __main__ - Step 118371: {'lr': 5.426356613539873e-05, 'samples': 22727232, 'steps': 118370, 'loss/train': 1.3848905563354492} 11/07/2021 13:51:45 - INFO - __main__ - Step 118372: {'lr': 5.4260264906875054e-05, 'samples': 22727424, 'steps': 118371, 'loss/train': 1.3716342449188232} 11/07/2021 13:51:46 - INFO - __main__ - Step 118373: {'lr': 5.425696376654876e-05, 'samples': 22727616, 'steps': 118372, 'loss/train': 1.173021912574768} 11/07/2021 13:51:46 - INFO - __main__ - Step 118374: {'lr': 5.425366271442117e-05, 'samples': 22727808, 'steps': 118373, 'loss/train': 1.2162580490112305} 11/07/2021 13:51:46 - INFO - __main__ - Step 118375: {'lr': 5.425036175049389e-05, 'samples': 22728000, 'steps': 118374, 'loss/train': 1.2228457927703857} 11/07/2021 13:51:47 - INFO - __main__ - Step 118376: {'lr': 5.424706087476836e-05, 'samples': 22728192, 'steps': 118375, 'loss/train': 0.9370942711830139} 11/07/2021 13:51:48 - INFO - __main__ - Step 118377: {'lr': 5.4243760087246075e-05, 'samples': 22728384, 'steps': 118376, 'loss/train': 1.3792284727096558} 11/07/2021 13:51:48 - INFO - __main__ - Step 118378: {'lr': 5.424045938792852e-05, 'samples': 22728576, 'steps': 118377, 'loss/train': 1.0588535070419312} 11/07/2021 13:51:48 - INFO - __main__ - Step 118379: {'lr': 5.4237158776817204e-05, 'samples': 22728768, 'steps': 118378, 'loss/train': 1.2998497486114502} 11/07/2021 13:51:49 - INFO - __main__ - Step 118380: {'lr': 5.423385825391361e-05, 'samples': 22728960, 'steps': 118379, 'loss/train': 0.5560778379440308} 11/07/2021 13:51:50 - INFO - __main__ - Step 118381: {'lr': 5.4230557819219236e-05, 'samples': 22729152, 'steps': 118380, 'loss/train': 0.895350456237793} 11/07/2021 13:51:50 - INFO - __main__ - Step 118382: {'lr': 5.422725747273552e-05, 'samples': 22729344, 'steps': 118381, 'loss/train': 1.1791270971298218} 11/07/2021 13:51:50 - INFO - __main__ - Step 118383: {'lr': 5.4223957214463996e-05, 'samples': 22729536, 'steps': 118382, 'loss/train': 1.4932159185409546} 11/07/2021 13:51:51 - INFO - __main__ - Step 118384: {'lr': 5.4220657044406126e-05, 'samples': 22729728, 'steps': 118383, 'loss/train': 1.1775248050689697} 11/07/2021 13:51:51 - INFO - __main__ - Step 118385: {'lr': 5.4217356962563417e-05, 'samples': 22729920, 'steps': 118384, 'loss/train': 0.9797684550285339} 11/07/2021 13:51:52 - INFO - __main__ - Step 118386: {'lr': 5.4214056968937414e-05, 'samples': 22730112, 'steps': 118385, 'loss/train': 1.4721949100494385} 11/07/2021 13:51:53 - INFO - __main__ - Step 118387: {'lr': 5.421075706352946e-05, 'samples': 22730304, 'steps': 118386, 'loss/train': 0.8963158130645752} 11/07/2021 13:51:53 - INFO - __main__ - Step 118388: {'lr': 5.420745724634113e-05, 'samples': 22730496, 'steps': 118387, 'loss/train': 1.6220204830169678} 11/07/2021 13:51:53 - INFO - __main__ - Step 118389: {'lr': 5.42041575173739e-05, 'samples': 22730688, 'steps': 118388, 'loss/train': 1.433727502822876} 11/07/2021 13:51:54 - INFO - __main__ - Step 118390: {'lr': 5.4200857876629234e-05, 'samples': 22730880, 'steps': 118389, 'loss/train': 1.0794447660446167} 11/07/2021 13:51:55 - INFO - __main__ - Step 118391: {'lr': 5.4197558324108635e-05, 'samples': 22731072, 'steps': 118390, 'loss/train': 1.3758115768432617} 11/07/2021 13:51:55 - INFO - __main__ - Step 118392: {'lr': 5.419425885981363e-05, 'samples': 22731264, 'steps': 118391, 'loss/train': 1.444959282875061} 11/07/2021 13:51:55 - INFO - __main__ - Step 118393: {'lr': 5.419095948374564e-05, 'samples': 22731456, 'steps': 118392, 'loss/train': 0.9798300266265869} 11/07/2021 13:51:56 - INFO - __main__ - Step 118394: {'lr': 5.4187660195906205e-05, 'samples': 22731648, 'steps': 118393, 'loss/train': 0.9416541457176208} 11/07/2021 13:51:56 - INFO - __main__ - Step 118395: {'lr': 5.418436099629678e-05, 'samples': 22731840, 'steps': 118394, 'loss/train': 1.6661901473999023} 11/07/2021 13:51:57 - INFO - __main__ - Step 118396: {'lr': 5.418106188491886e-05, 'samples': 22732032, 'steps': 118395, 'loss/train': 1.277534008026123} 11/07/2021 13:51:58 - INFO - __main__ - Step 118397: {'lr': 5.417776286177392e-05, 'samples': 22732224, 'steps': 118396, 'loss/train': 1.029642939567566} 11/07/2021 13:51:58 - INFO - __main__ - Step 118398: {'lr': 5.4174463926863485e-05, 'samples': 22732416, 'steps': 118397, 'loss/train': 1.806950569152832} 11/07/2021 13:51:58 - INFO - __main__ - Step 118399: {'lr': 5.417116508018899e-05, 'samples': 22732608, 'steps': 118398, 'loss/train': 1.5864373445510864} 11/07/2021 13:51:59 - INFO - __main__ - Step 118400: {'lr': 5.4167866321752025e-05, 'samples': 22732800, 'steps': 118399, 'loss/train': 1.747491478919983} 11/07/2021 13:51:59 - INFO - __main__ - Step 118401: {'lr': 5.416456765155392e-05, 'samples': 22732992, 'steps': 118400, 'loss/train': 1.467047095298767} 11/07/2021 13:52:00 - INFO - __main__ - Step 118402: {'lr': 5.416126906959626e-05, 'samples': 22733184, 'steps': 118401, 'loss/train': 1.5038731098175049} 11/07/2021 13:52:01 - INFO - __main__ - Step 118403: {'lr': 5.4157970575880486e-05, 'samples': 22733376, 'steps': 118402, 'loss/train': 1.0742123126983643} 11/07/2021 13:52:01 - INFO - __main__ - Step 118404: {'lr': 5.415467217040812e-05, 'samples': 22733568, 'steps': 118403, 'loss/train': 1.4743112325668335} 11/07/2021 13:52:01 - INFO - __main__ - Step 118405: {'lr': 5.415137385318064e-05, 'samples': 22733760, 'steps': 118404, 'loss/train': 1.286588430404663} 11/07/2021 13:52:02 - INFO - __main__ - Step 118406: {'lr': 5.414807562419951e-05, 'samples': 22733952, 'steps': 118405, 'loss/train': 0.7688052654266357} 11/07/2021 13:52:02 - INFO - __main__ - Step 118407: {'lr': 5.414477748346625e-05, 'samples': 22734144, 'steps': 118406, 'loss/train': 1.1230096817016602} 11/07/2021 13:52:03 - INFO - __main__ - Step 118408: {'lr': 5.414147943098233e-05, 'samples': 22734336, 'steps': 118407, 'loss/train': 1.3989684581756592} 11/07/2021 13:52:03 - INFO - __main__ - Step 118409: {'lr': 5.4138181466749256e-05, 'samples': 22734528, 'steps': 118408, 'loss/train': 1.7244811058044434} 11/07/2021 13:52:04 - INFO - __main__ - Step 118410: {'lr': 5.413488359076846e-05, 'samples': 22734720, 'steps': 118409, 'loss/train': 0.973365306854248} 11/07/2021 13:52:04 - INFO - __main__ - Step 118411: {'lr': 5.413158580304148e-05, 'samples': 22734912, 'steps': 118410, 'loss/train': 1.6212079524993896} 11/07/2021 13:52:05 - INFO - __main__ - Step 118412: {'lr': 5.412828810356979e-05, 'samples': 22735104, 'steps': 118411, 'loss/train': 0.04428703337907791} 11/07/2021 13:52:06 - INFO - __main__ - Step 118413: {'lr': 5.412499049235495e-05, 'samples': 22735296, 'steps': 118412, 'loss/train': 0.7411783337593079} 11/07/2021 13:52:06 - INFO - __main__ - Step 118414: {'lr': 5.412169296939826e-05, 'samples': 22735488, 'steps': 118413, 'loss/train': 1.0569504499435425} 11/07/2021 13:52:06 - INFO - __main__ - Step 118415: {'lr': 5.411839553470135e-05, 'samples': 22735680, 'steps': 118414, 'loss/train': 1.5147650241851807} 11/07/2021 13:52:07 - INFO - __main__ - Step 118416: {'lr': 5.411509818826566e-05, 'samples': 22735872, 'steps': 118415, 'loss/train': 1.6027629375457764} 11/07/2021 13:52:07 - INFO - __main__ - Step 118417: {'lr': 5.411180093009266e-05, 'samples': 22736064, 'steps': 118416, 'loss/train': 1.1819854974746704} 11/07/2021 13:52:08 - INFO - __main__ - Step 118418: {'lr': 5.4108503760183895e-05, 'samples': 22736256, 'steps': 118417, 'loss/train': 0.9241877198219299} 11/07/2021 13:52:08 - INFO - __main__ - Step 118419: {'lr': 5.410520667854077e-05, 'samples': 22736448, 'steps': 118418, 'loss/train': 1.3611290454864502} 11/07/2021 13:52:09 - INFO - __main__ - Step 118420: {'lr': 5.4101909685164845e-05, 'samples': 22736640, 'steps': 118419, 'loss/train': 0.06372004002332687} 11/07/2021 13:52:09 - INFO - __main__ - Step 118421: {'lr': 5.4098612780057595e-05, 'samples': 22736832, 'steps': 118420, 'loss/train': 0.8494365811347961} 11/07/2021 13:52:09 - INFO - __main__ - Step 118422: {'lr': 5.4095315963220455e-05, 'samples': 22737024, 'steps': 118421, 'loss/train': 1.152573585510254} 11/07/2021 13:52:11 - INFO - __main__ - Step 118423: {'lr': 5.4092019234654955e-05, 'samples': 22737216, 'steps': 118422, 'loss/train': 1.7419835329055786} 11/07/2021 13:52:11 - INFO - __main__ - Step 118424: {'lr': 5.408872259436257e-05, 'samples': 22737408, 'steps': 118423, 'loss/train': 1.1788086891174316} 11/07/2021 13:52:11 - INFO - __main__ - Step 118425: {'lr': 5.408542604234479e-05, 'samples': 22737600, 'steps': 118424, 'loss/train': 1.1203930377960205} 11/07/2021 13:52:12 - INFO - __main__ - Step 118426: {'lr': 5.4082129578603146e-05, 'samples': 22737792, 'steps': 118425, 'loss/train': 1.355332374572754} 11/07/2021 13:52:12 - INFO - __main__ - Step 118427: {'lr': 5.4078833203139e-05, 'samples': 22737984, 'steps': 118426, 'loss/train': 1.2335342168807983} 11/07/2021 13:52:13 - INFO - __main__ - Step 118428: {'lr': 5.407553691595393e-05, 'samples': 22738176, 'steps': 118427, 'loss/train': 1.2525651454925537} 11/07/2021 13:52:13 - INFO - __main__ - Step 118429: {'lr': 5.407224071704939e-05, 'samples': 22738368, 'steps': 118428, 'loss/train': 2.394505739212036} 11/07/2021 13:52:14 - INFO - __main__ - Step 118430: {'lr': 5.406894460642686e-05, 'samples': 22738560, 'steps': 118429, 'loss/train': 1.0829614400863647} 11/07/2021 13:52:14 - INFO - __main__ - Step 118431: {'lr': 5.4065648584087856e-05, 'samples': 22738752, 'steps': 118430, 'loss/train': 1.3014827966690063} 11/07/2021 13:52:14 - INFO - __main__ - Step 118432: {'lr': 5.406235265003384e-05, 'samples': 22738944, 'steps': 118431, 'loss/train': 1.3220672607421875} 11/07/2021 13:52:15 - INFO - __main__ - Step 118433: {'lr': 5.405905680426631e-05, 'samples': 22739136, 'steps': 118432, 'loss/train': 1.5275861024856567} 11/07/2021 13:52:16 - INFO - __main__ - Step 118434: {'lr': 5.405576104678675e-05, 'samples': 22739328, 'steps': 118433, 'loss/train': 1.3401612043380737} 11/07/2021 13:52:16 - INFO - __main__ - Step 118435: {'lr': 5.4052465377596645e-05, 'samples': 22739520, 'steps': 118434, 'loss/train': 0.9200600385665894} 11/07/2021 13:52:17 - INFO - __main__ - Step 118436: {'lr': 5.4049169796697465e-05, 'samples': 22739712, 'steps': 118435, 'loss/train': 1.963563323020935} 11/07/2021 13:52:17 - INFO - __main__ - Step 118437: {'lr': 5.404587430409069e-05, 'samples': 22739904, 'steps': 118436, 'loss/train': 1.3981252908706665} 11/07/2021 13:52:17 - INFO - __main__ - Step 118438: {'lr': 5.404257889977785e-05, 'samples': 22740096, 'steps': 118437, 'loss/train': 1.377065896987915} 11/07/2021 13:52:18 - INFO - __main__ - Step 118439: {'lr': 5.40392835837604e-05, 'samples': 22740288, 'steps': 118438, 'loss/train': 1.2363300323486328} 11/07/2021 13:52:19 - INFO - __main__ - Step 118440: {'lr': 5.4035988356039874e-05, 'samples': 22740480, 'steps': 118439, 'loss/train': 1.237924575805664} 11/07/2021 13:52:19 - INFO - __main__ - Step 118441: {'lr': 5.4032693216617634e-05, 'samples': 22740672, 'steps': 118440, 'loss/train': 1.4401017427444458} 11/07/2021 13:52:19 - INFO - __main__ - Step 118442: {'lr': 5.4029398165495265e-05, 'samples': 22740864, 'steps': 118441, 'loss/train': 1.2479287385940552} 11/07/2021 13:52:20 - INFO - __main__ - Step 118443: {'lr': 5.4026103202674204e-05, 'samples': 22741056, 'steps': 118442, 'loss/train': 1.4544119834899902} 11/07/2021 13:52:21 - INFO - __main__ - Step 118444: {'lr': 5.4022808328155955e-05, 'samples': 22741248, 'steps': 118443, 'loss/train': 1.5507019758224487} 11/07/2021 13:52:21 - INFO - __main__ - Step 118445: {'lr': 5.401951354194201e-05, 'samples': 22741440, 'steps': 118444, 'loss/train': 1.2810395956039429} 11/07/2021 13:52:21 - INFO - __main__ - Step 118446: {'lr': 5.401621884403385e-05, 'samples': 22741632, 'steps': 118445, 'loss/train': 0.6177605986595154} 11/07/2021 13:52:22 - INFO - __main__ - Step 118447: {'lr': 5.401292423443296e-05, 'samples': 22741824, 'steps': 118446, 'loss/train': 1.0801582336425781} 11/07/2021 13:52:22 - INFO - __main__ - Step 118448: {'lr': 5.400962971314083e-05, 'samples': 22742016, 'steps': 118447, 'loss/train': 1.1659631729125977} 11/07/2021 13:52:23 - INFO - __main__ - Step 118449: {'lr': 5.400633528015891e-05, 'samples': 22742208, 'steps': 118448, 'loss/train': 0.9989265203475952} 11/07/2021 13:52:23 - INFO - __main__ - Step 118450: {'lr': 5.400304093548875e-05, 'samples': 22742400, 'steps': 118449, 'loss/train': 1.0886212587356567} 11/07/2021 13:52:24 - INFO - __main__ - Step 118451: {'lr': 5.399974667913177e-05, 'samples': 22742592, 'steps': 118450, 'loss/train': 1.0030275583267212} 11/07/2021 13:52:24 - INFO - __main__ - Step 118452: {'lr': 5.3996452511089474e-05, 'samples': 22742784, 'steps': 118451, 'loss/train': 1.725546956062317} 11/07/2021 13:52:24 - INFO - __main__ - Step 118453: {'lr': 5.399315843136343e-05, 'samples': 22742976, 'steps': 118452, 'loss/train': 1.6896960735321045} 11/07/2021 13:52:25 - INFO - __main__ - Step 118454: {'lr': 5.398986443995496e-05, 'samples': 22743168, 'steps': 118453, 'loss/train': 1.028084635734558} 11/07/2021 13:52:26 - INFO - __main__ - Step 118455: {'lr': 5.398657053686565e-05, 'samples': 22743360, 'steps': 118454, 'loss/train': 1.2777481079101562} 11/07/2021 13:52:26 - INFO - __main__ - Step 118456: {'lr': 5.3983276722096966e-05, 'samples': 22743552, 'steps': 118455, 'loss/train': 1.5739233493804932} 11/07/2021 13:52:27 - INFO - __main__ - Step 118457: {'lr': 5.3979982995650406e-05, 'samples': 22743744, 'steps': 118456, 'loss/train': 1.6154807806015015} 11/07/2021 13:52:27 - INFO - __main__ - Step 118458: {'lr': 5.397668935752742e-05, 'samples': 22743936, 'steps': 118457, 'loss/train': 1.0265583992004395} 11/07/2021 13:52:27 - INFO - __main__ - Step 118459: {'lr': 5.39733958077295e-05, 'samples': 22744128, 'steps': 118458, 'loss/train': 1.1461148262023926} 11/07/2021 13:52:28 - INFO - __main__ - Step 118460: {'lr': 5.3970102346258184e-05, 'samples': 22744320, 'steps': 118459, 'loss/train': 1.6413129568099976} 11/07/2021 13:52:29 - INFO - __main__ - Step 118461: {'lr': 5.3966808973114874e-05, 'samples': 22744512, 'steps': 118460, 'loss/train': 1.2759690284729004} 11/07/2021 13:52:29 - INFO - __main__ - Step 118462: {'lr': 5.396351568830113e-05, 'samples': 22744704, 'steps': 118461, 'loss/train': 1.7246898412704468} 11/07/2021 13:52:29 - INFO - __main__ - Step 118463: {'lr': 5.396022249181837e-05, 'samples': 22744896, 'steps': 118462, 'loss/train': 0.7640182971954346} 11/07/2021 13:52:30 - INFO - __main__ - Step 118464: {'lr': 5.395692938366814e-05, 'samples': 22745088, 'steps': 118463, 'loss/train': 1.6652897596359253} 11/07/2021 13:52:31 - INFO - __main__ - Step 118465: {'lr': 5.395363636385187e-05, 'samples': 22745280, 'steps': 118464, 'loss/train': 0.9540293216705322} 11/07/2021 13:52:31 - INFO - __main__ - Step 118466: {'lr': 5.3950343432371066e-05, 'samples': 22745472, 'steps': 118465, 'loss/train': 1.130691647529602} 11/07/2021 13:52:31 - INFO - __main__ - Step 118467: {'lr': 5.39470505892273e-05, 'samples': 22745664, 'steps': 118466, 'loss/train': 1.245428442955017} 11/07/2021 13:52:32 - INFO - __main__ - Step 118468: {'lr': 5.394375783442187e-05, 'samples': 22745856, 'steps': 118467, 'loss/train': 0.9720343351364136} 11/07/2021 13:52:32 - INFO - __main__ - Step 118469: {'lr': 5.394046516795637e-05, 'samples': 22746048, 'steps': 118468, 'loss/train': 1.1390990018844604} 11/07/2021 13:52:33 - INFO - __main__ - Step 118470: {'lr': 5.393717258983227e-05, 'samples': 22746240, 'steps': 118469, 'loss/train': 1.4675174951553345} 11/07/2021 13:52:34 - INFO - __main__ - Step 118471: {'lr': 5.393388010005107e-05, 'samples': 22746432, 'steps': 118470, 'loss/train': 1.4329845905303955} 11/07/2021 13:52:34 - INFO - __main__ - Step 118472: {'lr': 5.393058769861423e-05, 'samples': 22746624, 'steps': 118471, 'loss/train': 0.38495275378227234} 11/07/2021 13:52:34 - INFO - __main__ - Step 118473: {'lr': 5.392729538552324e-05, 'samples': 22746816, 'steps': 118472, 'loss/train': 1.6219604015350342} 11/07/2021 13:52:35 - INFO - __main__ - Step 118474: {'lr': 5.392400316077958e-05, 'samples': 22747008, 'steps': 118473, 'loss/train': 1.674100399017334} 11/07/2021 13:52:36 - INFO - __main__ - Step 118475: {'lr': 5.392071102438473e-05, 'samples': 22747200, 'steps': 118474, 'loss/train': 1.0753562450408936} 11/07/2021 13:52:36 - INFO - __main__ - Step 118476: {'lr': 5.39174189763402e-05, 'samples': 22747392, 'steps': 118475, 'loss/train': 0.11589603871107101} 11/07/2021 13:52:37 - INFO - __main__ - Step 118477: {'lr': 5.391412701664744e-05, 'samples': 22747584, 'steps': 118476, 'loss/train': 1.1947563886642456} 11/07/2021 13:52:37 - INFO - __main__ - Step 118478: {'lr': 5.3910835145308036e-05, 'samples': 22747776, 'steps': 118477, 'loss/train': 1.6778738498687744} 11/07/2021 13:52:37 - INFO - __main__ - Step 118479: {'lr': 5.390754336232331e-05, 'samples': 22747968, 'steps': 118478, 'loss/train': 1.141769289970398} 11/07/2021 13:52:38 - INFO - __main__ - Step 118480: {'lr': 5.3904251667694806e-05, 'samples': 22748160, 'steps': 118479, 'loss/train': 0.7412251830101013} 11/07/2021 13:52:39 - INFO - __main__ - Step 118481: {'lr': 5.390096006142403e-05, 'samples': 22748352, 'steps': 118480, 'loss/train': 0.6409322023391724} 11/07/2021 13:52:39 - INFO - __main__ - Step 118482: {'lr': 5.389766854351247e-05, 'samples': 22748544, 'steps': 118481, 'loss/train': 0.39856812357902527} 11/07/2021 13:52:39 - INFO - __main__ - Step 118483: {'lr': 5.3894377113961555e-05, 'samples': 22748736, 'steps': 118482, 'loss/train': 1.437424898147583} 11/07/2021 13:52:40 - INFO - __main__ - Step 118484: {'lr': 5.389108577277285e-05, 'samples': 22748928, 'steps': 118483, 'loss/train': 1.7969591617584229} 11/07/2021 13:52:41 - INFO - __main__ - Step 118485: {'lr': 5.388779451994777e-05, 'samples': 22749120, 'steps': 118484, 'loss/train': 1.3996859788894653} 11/07/2021 13:52:41 - INFO - __main__ - Step 118486: {'lr': 5.388450335548783e-05, 'samples': 22749312, 'steps': 118485, 'loss/train': 1.809501051902771} 11/07/2021 13:52:41 - INFO - __main__ - Step 118487: {'lr': 5.3881212279394524e-05, 'samples': 22749504, 'steps': 118486, 'loss/train': 1.1860910654067993} 11/07/2021 13:52:42 - INFO - __main__ - Step 118488: {'lr': 5.3877921291669296e-05, 'samples': 22749696, 'steps': 118487, 'loss/train': 1.427868127822876} 11/07/2021 13:52:42 - INFO - __main__ - Step 118489: {'lr': 5.3874630392313714e-05, 'samples': 22749888, 'steps': 118488, 'loss/train': 1.7930800914764404} 11/07/2021 13:52:42 - INFO - __main__ - Step 118490: {'lr': 5.387133958132914e-05, 'samples': 22750080, 'steps': 118489, 'loss/train': 1.0299136638641357} 11/07/2021 13:52:44 - INFO - __main__ - Step 118491: {'lr': 5.38680488587171e-05, 'samples': 22750272, 'steps': 118490, 'loss/train': 1.1938939094543457} 11/07/2021 13:52:45 - INFO - __main__ - Step 118492: {'lr': 5.386475822447912e-05, 'samples': 22750464, 'steps': 118491, 'loss/train': 1.0411564111709595} 11/07/2021 13:52:45 - INFO - __main__ - Step 118493: {'lr': 5.386146767861663e-05, 'samples': 22750656, 'steps': 118492, 'loss/train': 1.5596450567245483} 11/07/2021 13:52:45 - INFO - __main__ - Step 118494: {'lr': 5.385817722113115e-05, 'samples': 22750848, 'steps': 118493, 'loss/train': 1.491284728050232} 11/07/2021 13:52:46 - INFO - __main__ - Step 118495: {'lr': 5.385488685202414e-05, 'samples': 22751040, 'steps': 118494, 'loss/train': 1.9495739936828613} 11/07/2021 13:52:47 - INFO - __main__ - Step 118496: {'lr': 5.3851596571297065e-05, 'samples': 22751232, 'steps': 118495, 'loss/train': 0.9194532632827759} 11/07/2021 13:52:47 - INFO - __main__ - Step 118497: {'lr': 5.384830637895147e-05, 'samples': 22751424, 'steps': 118496, 'loss/train': 1.3480478525161743} 11/07/2021 13:52:47 - INFO - __main__ - Step 118498: {'lr': 5.384501627498881e-05, 'samples': 22751616, 'steps': 118497, 'loss/train': 1.222328782081604} 11/07/2021 13:52:48 - INFO - __main__ - Step 118499: {'lr': 5.384172625941053e-05, 'samples': 22751808, 'steps': 118498, 'loss/train': 1.4134197235107422} 11/07/2021 13:52:48 - INFO - __main__ - Step 118500: {'lr': 5.3838436332218215e-05, 'samples': 22752000, 'steps': 118499, 'loss/train': 1.1254819631576538} 11/07/2021 13:52:49 - INFO - __main__ - Step 118501: {'lr': 5.38351464934132e-05, 'samples': 22752192, 'steps': 118500, 'loss/train': 1.872963309288025} 11/07/2021 13:52:49 - INFO - __main__ - Step 118502: {'lr': 5.383185674299706e-05, 'samples': 22752384, 'steps': 118501, 'loss/train': 1.1847981214523315} 11/07/2021 13:52:50 - INFO - __main__ - Step 118503: {'lr': 5.382856708097125e-05, 'samples': 22752576, 'steps': 118502, 'loss/train': 0.6842215061187744} 11/07/2021 13:52:50 - INFO - __main__ - Step 118504: {'lr': 5.382527750733726e-05, 'samples': 22752768, 'steps': 118503, 'loss/train': 1.0084820985794067} 11/07/2021 13:52:51 - INFO - __main__ - Step 118505: {'lr': 5.382198802209659e-05, 'samples': 22752960, 'steps': 118504, 'loss/train': 0.7897499203681946} 11/07/2021 13:52:52 - INFO - __main__ - Step 118506: {'lr': 5.381869862525069e-05, 'samples': 22753152, 'steps': 118505, 'loss/train': 1.261861801147461} 11/07/2021 13:52:52 - INFO - __main__ - Step 118507: {'lr': 5.381540931680104e-05, 'samples': 22753344, 'steps': 118506, 'loss/train': 1.4398114681243896} 11/07/2021 13:52:52 - INFO - __main__ - Step 118508: {'lr': 5.3812120096749154e-05, 'samples': 22753536, 'steps': 118507, 'loss/train': 1.222664475440979} 11/07/2021 13:52:53 - INFO - __main__ - Step 118509: {'lr': 5.380883096509651e-05, 'samples': 22753728, 'steps': 118508, 'loss/train': 1.5318338871002197} 11/07/2021 13:52:53 - INFO - __main__ - Step 118510: {'lr': 5.380554192184456e-05, 'samples': 22753920, 'steps': 118509, 'loss/train': 1.3586945533752441} 11/07/2021 13:52:53 - INFO - __main__ - Step 118511: {'lr': 5.38022529669949e-05, 'samples': 22754112, 'steps': 118510, 'loss/train': 1.4659795761108398} 11/07/2021 13:52:54 - INFO - __main__ - Step 118512: {'lr': 5.379896410054883e-05, 'samples': 22754304, 'steps': 118511, 'loss/train': 1.1044607162475586} 11/07/2021 13:52:55 - INFO - __main__ - Step 118513: {'lr': 5.37956753225079e-05, 'samples': 22754496, 'steps': 118512, 'loss/train': 1.320846438407898} 11/07/2021 13:52:55 - INFO - __main__ - Step 118514: {'lr': 5.379238663287364e-05, 'samples': 22754688, 'steps': 118513, 'loss/train': 1.4599957466125488} 11/07/2021 13:52:56 - INFO - __main__ - Step 118515: {'lr': 5.37890980316475e-05, 'samples': 22754880, 'steps': 118514, 'loss/train': 1.321228265762329} 11/07/2021 13:52:56 - INFO - __main__ - Step 118516: {'lr': 5.378580951883097e-05, 'samples': 22755072, 'steps': 118515, 'loss/train': 1.3363597393035889} 11/07/2021 13:52:58 - INFO - __main__ - Step 118517: {'lr': 5.37825210944255e-05, 'samples': 22755264, 'steps': 118516, 'loss/train': 1.1223573684692383} 11/07/2021 13:52:58 - INFO - __main__ - Step 118518: {'lr': 5.3779232758432606e-05, 'samples': 22755456, 'steps': 118517, 'loss/train': 1.1923537254333496} 11/07/2021 13:52:59 - INFO - __main__ - Step 118519: {'lr': 5.377594451085377e-05, 'samples': 22755648, 'steps': 118518, 'loss/train': 0.9249511361122131} 11/07/2021 13:52:59 - INFO - __main__ - Step 118520: {'lr': 5.3772656351690486e-05, 'samples': 22755840, 'steps': 118519, 'loss/train': 1.3885462284088135} 11/07/2021 13:52:59 - INFO - __main__ - Step 118521: {'lr': 5.37693682809442e-05, 'samples': 22756032, 'steps': 118520, 'loss/train': 1.532001256942749} 11/07/2021 13:53:00 - INFO - __main__ - Step 118522: {'lr': 5.3766080298616457e-05, 'samples': 22756224, 'steps': 118521, 'loss/train': 1.4734444618225098} 11/07/2021 13:53:00 - INFO - __main__ - Step 118523: {'lr': 5.376279240470863e-05, 'samples': 22756416, 'steps': 118522, 'loss/train': 1.3543524742126465} 11/07/2021 13:53:00 - INFO - __main__ - Step 118524: {'lr': 5.375950459922227e-05, 'samples': 22756608, 'steps': 118523, 'loss/train': 1.5525118112564087} 11/07/2021 13:53:01 - INFO - __main__ - Step 118525: {'lr': 5.3756216882158844e-05, 'samples': 22756800, 'steps': 118524, 'loss/train': 1.6253228187561035} 11/07/2021 13:53:02 - INFO - __main__ - Step 118526: {'lr': 5.375292925351985e-05, 'samples': 22756992, 'steps': 118525, 'loss/train': 1.7484185695648193} 11/07/2021 13:53:02 - INFO - __main__ - Step 118527: {'lr': 5.3749641713306764e-05, 'samples': 22757184, 'steps': 118526, 'loss/train': 1.58125901222229} 11/07/2021 13:53:02 - INFO - __main__ - Step 118528: {'lr': 5.374635426152102e-05, 'samples': 22757376, 'steps': 118527, 'loss/train': 1.438641905784607} 11/07/2021 13:53:03 - INFO - __main__ - Step 118529: {'lr': 5.374306689816419e-05, 'samples': 22757568, 'steps': 118528, 'loss/train': 1.607872724533081} 11/07/2021 13:53:04 - INFO - __main__ - Step 118530: {'lr': 5.3739779623237674e-05, 'samples': 22757760, 'steps': 118529, 'loss/train': 1.1280145645141602} 11/07/2021 13:53:04 - INFO - __main__ - Step 118531: {'lr': 5.3736492436743e-05, 'samples': 22757952, 'steps': 118530, 'loss/train': 1.3325995206832886} 11/07/2021 13:53:04 - INFO - __main__ - Step 118532: {'lr': 5.373320533868162e-05, 'samples': 22758144, 'steps': 118531, 'loss/train': 1.0964521169662476} 11/07/2021 13:53:05 - INFO - __main__ - Step 118533: {'lr': 5.3729918329055025e-05, 'samples': 22758336, 'steps': 118532, 'loss/train': 1.3665523529052734} 11/07/2021 13:53:05 - INFO - __main__ - Step 118534: {'lr': 5.3726631407864795e-05, 'samples': 22758528, 'steps': 118533, 'loss/train': 1.008639931678772} 11/07/2021 13:53:06 - INFO - __main__ - Step 118535: {'lr': 5.372334457511222e-05, 'samples': 22758720, 'steps': 118534, 'loss/train': 1.3759164810180664} 11/07/2021 13:53:07 - INFO - __main__ - Step 118536: {'lr': 5.372005783079889e-05, 'samples': 22758912, 'steps': 118535, 'loss/train': 1.8697972297668457} 11/07/2021 13:53:07 - INFO - __main__ - Step 118537: {'lr': 5.3716771174926264e-05, 'samples': 22759104, 'steps': 118536, 'loss/train': 1.2369627952575684} 11/07/2021 13:53:07 - INFO - __main__ - Step 118538: {'lr': 5.371348460749584e-05, 'samples': 22759296, 'steps': 118537, 'loss/train': 0.3155345320701599} 11/07/2021 13:53:08 - INFO - __main__ - Step 118539: {'lr': 5.371019812850911e-05, 'samples': 22759488, 'steps': 118538, 'loss/train': 1.318402647972107} 11/07/2021 13:53:09 - INFO - __main__ - Step 118540: {'lr': 5.370691173796752e-05, 'samples': 22759680, 'steps': 118539, 'loss/train': 1.1288585662841797} 11/07/2021 13:53:09 - INFO - __main__ - Step 118541: {'lr': 5.3703625435872564e-05, 'samples': 22759872, 'steps': 118540, 'loss/train': 1.4030883312225342} 11/07/2021 13:53:09 - INFO - __main__ - Step 118542: {'lr': 5.3700339222225726e-05, 'samples': 22760064, 'steps': 118541, 'loss/train': 1.3722715377807617} 11/07/2021 13:53:10 - INFO - __main__ - Step 118543: {'lr': 5.369705309702847e-05, 'samples': 22760256, 'steps': 118542, 'loss/train': 1.210091233253479} 11/07/2021 13:53:10 - INFO - __main__ - Step 118544: {'lr': 5.369376706028231e-05, 'samples': 22760448, 'steps': 118543, 'loss/train': 0.5030874609947205} 11/07/2021 13:53:11 - INFO - __main__ - Step 118545: {'lr': 5.369048111198871e-05, 'samples': 22760640, 'steps': 118544, 'loss/train': 0.7749139666557312} 11/07/2021 13:53:12 - INFO - __main__ - Step 118546: {'lr': 5.3687195252149155e-05, 'samples': 22760832, 'steps': 118545, 'loss/train': 1.2964657545089722} 11/07/2021 13:53:12 - INFO - __main__ - Step 118547: {'lr': 5.368390948076518e-05, 'samples': 22761024, 'steps': 118546, 'loss/train': 1.1712610721588135} 11/07/2021 13:53:12 - INFO - __main__ - Step 118548: {'lr': 5.368062379783814e-05, 'samples': 22761216, 'steps': 118547, 'loss/train': 1.5875275135040283} 11/07/2021 13:53:13 - INFO - __main__ - Step 118549: {'lr': 5.367733820336956e-05, 'samples': 22761408, 'steps': 118548, 'loss/train': 1.5339281558990479} 11/07/2021 13:53:13 - INFO - __main__ - Step 118550: {'lr': 5.3674052697360976e-05, 'samples': 22761600, 'steps': 118549, 'loss/train': 1.567367434501648} 11/07/2021 13:53:14 - INFO - __main__ - Step 118551: {'lr': 5.367076727981382e-05, 'samples': 22761792, 'steps': 118550, 'loss/train': 1.2842235565185547} 11/07/2021 13:53:14 - INFO - __main__ - Step 118552: {'lr': 5.3667481950729596e-05, 'samples': 22761984, 'steps': 118551, 'loss/train': 1.2042499780654907} 11/07/2021 13:53:15 - INFO - __main__ - Step 118553: {'lr': 5.366419671010975e-05, 'samples': 22762176, 'steps': 118552, 'loss/train': 1.2395806312561035} 11/07/2021 13:53:15 - INFO - __main__ - Step 118554: {'lr': 5.36609115579558e-05, 'samples': 22762368, 'steps': 118553, 'loss/train': 1.4174529314041138} 11/07/2021 13:53:15 - INFO - __main__ - Step 118555: {'lr': 5.365762649426922e-05, 'samples': 22762560, 'steps': 118554, 'loss/train': 1.242850422859192} 11/07/2021 13:53:16 - INFO - __main__ - Step 118556: {'lr': 5.3654341519051495e-05, 'samples': 22762752, 'steps': 118555, 'loss/train': 1.293221116065979} 11/07/2021 13:53:17 - INFO - __main__ - Step 118557: {'lr': 5.3651056632304076e-05, 'samples': 22762944, 'steps': 118556, 'loss/train': 0.9464021325111389} 11/07/2021 13:53:17 - INFO - __main__ - Step 118558: {'lr': 5.364777183402847e-05, 'samples': 22763136, 'steps': 118557, 'loss/train': 1.6602510213851929} 11/07/2021 13:53:17 - INFO - __main__ - Step 118559: {'lr': 5.3644487124226146e-05, 'samples': 22763328, 'steps': 118558, 'loss/train': 1.425696611404419} 11/07/2021 13:53:18 - INFO - __main__ - Step 118560: {'lr': 5.364120250289858e-05, 'samples': 22763520, 'steps': 118559, 'loss/train': 1.4494099617004395} 11/07/2021 13:53:19 - INFO - __main__ - Step 118561: {'lr': 5.363791797004733e-05, 'samples': 22763712, 'steps': 118560, 'loss/train': 1.4041309356689453} 11/07/2021 13:53:19 - INFO - __main__ - Step 118562: {'lr': 5.363463352567374e-05, 'samples': 22763904, 'steps': 118561, 'loss/train': 0.9897298812866211} 11/07/2021 13:53:20 - INFO - __main__ - Step 118563: {'lr': 5.363134916977933e-05, 'samples': 22764096, 'steps': 118562, 'loss/train': 1.505433440208435} 11/07/2021 13:53:20 - INFO - __main__ - Step 118564: {'lr': 5.362806490236563e-05, 'samples': 22764288, 'steps': 118563, 'loss/train': 0.8191065788269043} 11/07/2021 13:53:20 - INFO - __main__ - Step 118565: {'lr': 5.3624780723434103e-05, 'samples': 22764480, 'steps': 118564, 'loss/train': 1.257466435432434} 11/07/2021 13:53:21 - INFO - __main__ - Step 118566: {'lr': 5.362149663298618e-05, 'samples': 22764672, 'steps': 118565, 'loss/train': 1.64839506149292} 11/07/2021 13:53:22 - INFO - __main__ - Step 118567: {'lr': 5.3618212631023426e-05, 'samples': 22764864, 'steps': 118566, 'loss/train': 0.8492699861526489} 11/07/2021 13:53:22 - INFO - __main__ - Step 118568: {'lr': 5.361492871754725e-05, 'samples': 22765056, 'steps': 118567, 'loss/train': 1.1270123720169067} 11/07/2021 13:53:22 - INFO - __main__ - Step 118569: {'lr': 5.361164489255915e-05, 'samples': 22765248, 'steps': 118568, 'loss/train': 1.1890162229537964} 11/07/2021 13:53:23 - INFO - __main__ - Step 118570: {'lr': 5.360836115606063e-05, 'samples': 22765440, 'steps': 118569, 'loss/train': 0.2216787338256836} 11/07/2021 13:53:24 - INFO - __main__ - Step 118571: {'lr': 5.360507750805313e-05, 'samples': 22765632, 'steps': 118570, 'loss/train': 1.0873830318450928} 11/07/2021 13:53:24 - INFO - __main__ - Step 118572: {'lr': 5.360179394853818e-05, 'samples': 22765824, 'steps': 118571, 'loss/train': 1.3744940757751465} 11/07/2021 13:53:25 - INFO - __main__ - Step 118573: {'lr': 5.359851047751721e-05, 'samples': 22766016, 'steps': 118572, 'loss/train': 1.436720848083496} 11/07/2021 13:53:25 - INFO - __main__ - Step 118574: {'lr': 5.359522709499179e-05, 'samples': 22766208, 'steps': 118573, 'loss/train': 1.5587849617004395} 11/07/2021 13:53:25 - INFO - __main__ - Step 118575: {'lr': 5.3591943800963274e-05, 'samples': 22766400, 'steps': 118574, 'loss/train': 1.1659296751022339} 11/07/2021 13:53:26 - INFO - __main__ - Step 118576: {'lr': 5.358866059543319e-05, 'samples': 22766592, 'steps': 118575, 'loss/train': 1.2028700113296509} 11/07/2021 13:53:27 - INFO - __main__ - Step 118577: {'lr': 5.358537747840303e-05, 'samples': 22766784, 'steps': 118576, 'loss/train': 1.5263704061508179} 11/07/2021 13:53:27 - INFO - __main__ - Step 118578: {'lr': 5.358209444987425e-05, 'samples': 22766976, 'steps': 118577, 'loss/train': 0.7268046736717224} 11/07/2021 13:53:27 - INFO - __main__ - Step 118579: {'lr': 5.357881150984836e-05, 'samples': 22767168, 'steps': 118578, 'loss/train': 1.316781997680664} 11/07/2021 13:53:28 - INFO - __main__ - Step 118580: {'lr': 5.3575528658326846e-05, 'samples': 22767360, 'steps': 118579, 'loss/train': 5.625912189483643} 11/07/2021 13:53:29 - INFO - __main__ - Step 118581: {'lr': 5.3572245895311154e-05, 'samples': 22767552, 'steps': 118580, 'loss/train': 1.094407320022583} 11/07/2021 13:53:29 - INFO - __main__ - Step 118582: {'lr': 5.356896322080276e-05, 'samples': 22767744, 'steps': 118581, 'loss/train': 1.5758867263793945} 11/07/2021 13:53:29 - INFO - __main__ - Step 118583: {'lr': 5.356568063480316e-05, 'samples': 22767936, 'steps': 118582, 'loss/train': 1.7815054655075073} 11/07/2021 13:53:30 - INFO - __main__ - Step 118584: {'lr': 5.356239813731387e-05, 'samples': 22768128, 'steps': 118583, 'loss/train': 1.4225329160690308} 11/07/2021 13:53:30 - INFO - __main__ - Step 118585: {'lr': 5.35591157283363e-05, 'samples': 22768320, 'steps': 118584, 'loss/train': 0.9943707585334778} 11/07/2021 13:53:31 - INFO - __main__ - Step 118586: {'lr': 5.3555833407871955e-05, 'samples': 22768512, 'steps': 118585, 'loss/train': 0.9942431449890137} 11/07/2021 13:53:32 - INFO - __main__ - Step 118587: {'lr': 5.355255117592234e-05, 'samples': 22768704, 'steps': 118586, 'loss/train': 1.1145143508911133} 11/07/2021 13:53:32 - INFO - __main__ - Step 118588: {'lr': 5.3549269032488966e-05, 'samples': 22768896, 'steps': 118587, 'loss/train': 1.2903114557266235} 11/07/2021 13:53:32 - INFO - __main__ - Step 118589: {'lr': 5.354598697757321e-05, 'samples': 22769088, 'steps': 118588, 'loss/train': 1.0839191675186157} 11/07/2021 13:53:33 - INFO - __main__ - Step 118590: {'lr': 5.3542705011176585e-05, 'samples': 22769280, 'steps': 118589, 'loss/train': 2.2100906372070312} 11/07/2021 13:53:33 - INFO - __main__ - Step 118591: {'lr': 5.35394231333006e-05, 'samples': 22769472, 'steps': 118590, 'loss/train': 1.0921612977981567} 11/07/2021 13:53:34 - INFO - __main__ - Step 118592: {'lr': 5.3536141343946716e-05, 'samples': 22769664, 'steps': 118591, 'loss/train': 1.275788426399231} 11/07/2021 13:53:35 - INFO - __main__ - Step 118593: {'lr': 5.353285964311641e-05, 'samples': 22769856, 'steps': 118592, 'loss/train': 1.5956507921218872} 11/07/2021 13:53:35 - INFO - __main__ - Step 118594: {'lr': 5.3529578030811186e-05, 'samples': 22770048, 'steps': 118593, 'loss/train': 1.2250956296920776} 11/07/2021 13:53:35 - INFO - __main__ - Step 118595: {'lr': 5.352629650703247e-05, 'samples': 22770240, 'steps': 118594, 'loss/train': 0.9097330570220947} 11/07/2021 13:53:36 - INFO - __main__ - Step 118596: {'lr': 5.3523015071781783e-05, 'samples': 22770432, 'steps': 118595, 'loss/train': 1.44463312625885} 11/07/2021 13:53:37 - INFO - __main__ - Step 118597: {'lr': 5.351973372506061e-05, 'samples': 22770624, 'steps': 118596, 'loss/train': 1.0964349508285522} 11/07/2021 13:53:37 - INFO - __main__ - Step 118598: {'lr': 5.3516452466870395e-05, 'samples': 22770816, 'steps': 118597, 'loss/train': 1.386092185974121} 11/07/2021 13:53:37 - INFO - __main__ - Step 118599: {'lr': 5.351317129721264e-05, 'samples': 22771008, 'steps': 118598, 'loss/train': 1.3241627216339111} 11/07/2021 13:53:38 - INFO - __main__ - Step 118600: {'lr': 5.350989021608882e-05, 'samples': 22771200, 'steps': 118599, 'loss/train': 1.309912919998169} 11/07/2021 13:53:38 - INFO - __main__ - Step 118601: {'lr': 5.3506609223500477e-05, 'samples': 22771392, 'steps': 118600, 'loss/train': 0.9646210074424744} 11/07/2021 13:53:39 - INFO - __main__ - Step 118602: {'lr': 5.350332831944896e-05, 'samples': 22771584, 'steps': 118601, 'loss/train': 0.8752322196960449} 11/07/2021 13:53:39 - INFO - __main__ - Step 118603: {'lr': 5.350004750393581e-05, 'samples': 22771776, 'steps': 118602, 'loss/train': 0.9572304487228394} 11/07/2021 13:53:40 - INFO - __main__ - Step 118604: {'lr': 5.34967667769625e-05, 'samples': 22771968, 'steps': 118603, 'loss/train': 2.504141330718994} 11/07/2021 13:53:40 - INFO - __main__ - Step 118605: {'lr': 5.34934861385305e-05, 'samples': 22772160, 'steps': 118604, 'loss/train': 1.2902511358261108} 11/07/2021 13:53:41 - INFO - __main__ - Step 118606: {'lr': 5.3490205588641344e-05, 'samples': 22772352, 'steps': 118605, 'loss/train': 1.5511131286621094} 11/07/2021 13:53:42 - INFO - __main__ - Step 118607: {'lr': 5.348692512729644e-05, 'samples': 22772544, 'steps': 118606, 'loss/train': 1.418961524963379} 11/07/2021 13:53:42 - INFO - __main__ - Step 118608: {'lr': 5.34836447544973e-05, 'samples': 22772736, 'steps': 118607, 'loss/train': 1.2389740943908691} 11/07/2021 13:53:42 - INFO - __main__ - Step 118609: {'lr': 5.34803644702454e-05, 'samples': 22772928, 'steps': 118608, 'loss/train': 0.9791439175605774} 11/07/2021 13:53:43 - INFO - __main__ - Step 118610: {'lr': 5.34770842745422e-05, 'samples': 22773120, 'steps': 118609, 'loss/train': 1.347902536392212} 11/07/2021 13:53:43 - INFO - __main__ - Step 118611: {'lr': 5.347380416738923e-05, 'samples': 22773312, 'steps': 118610, 'loss/train': 1.7777594327926636} 11/07/2021 13:53:43 - INFO - __main__ - Step 118612: {'lr': 5.347052414878789e-05, 'samples': 22773504, 'steps': 118611, 'loss/train': 0.8749678730964661} 11/07/2021 13:53:44 - INFO - __main__ - Step 118613: {'lr': 5.346724421873972e-05, 'samples': 22773696, 'steps': 118612, 'loss/train': 1.5306506156921387} 11/07/2021 13:53:45 - INFO - __main__ - Step 118614: {'lr': 5.3463964377246184e-05, 'samples': 22773888, 'steps': 118613, 'loss/train': 1.550437092781067} 11/07/2021 13:53:45 - INFO - __main__ - Step 118615: {'lr': 5.346068462430881e-05, 'samples': 22774080, 'steps': 118614, 'loss/train': 1.1826519966125488} 11/07/2021 13:53:46 - INFO - __main__ - Step 118616: {'lr': 5.345740495992896e-05, 'samples': 22774272, 'steps': 118615, 'loss/train': 0.7870422601699829} 11/07/2021 13:53:46 - INFO - __main__ - Step 118617: {'lr': 5.345412538410815e-05, 'samples': 22774464, 'steps': 118616, 'loss/train': 0.7388402223587036} 11/07/2021 13:53:47 - INFO - __main__ - Step 118618: {'lr': 5.34508458968479e-05, 'samples': 22774656, 'steps': 118617, 'loss/train': 1.2944014072418213} 11/07/2021 13:53:47 - INFO - __main__ - Step 118619: {'lr': 5.3447566498149665e-05, 'samples': 22774848, 'steps': 118618, 'loss/train': 1.329811692237854} 11/07/2021 13:53:48 - INFO - __main__ - Step 118620: {'lr': 5.344428718801489e-05, 'samples': 22775040, 'steps': 118619, 'loss/train': 1.024898648262024} 11/07/2021 13:53:48 - INFO - __main__ - Step 118621: {'lr': 5.344100796644513e-05, 'samples': 22775232, 'steps': 118620, 'loss/train': 1.0814591646194458} 11/07/2021 13:53:48 - INFO - __main__ - Step 118622: {'lr': 5.3437728833441804e-05, 'samples': 22775424, 'steps': 118621, 'loss/train': 1.1913487911224365} 11/07/2021 13:53:49 - INFO - __main__ - Step 118623: {'lr': 5.3434449789006385e-05, 'samples': 22775616, 'steps': 118622, 'loss/train': 1.1824065446853638} 11/07/2021 13:53:50 - INFO - __main__ - Step 118624: {'lr': 5.343117083314039e-05, 'samples': 22775808, 'steps': 118623, 'loss/train': 1.5258766412734985} 11/07/2021 13:53:50 - INFO - __main__ - Step 118625: {'lr': 5.342789196584527e-05, 'samples': 22776000, 'steps': 118624, 'loss/train': 1.1811490058898926} 11/07/2021 13:53:50 - INFO - __main__ - Step 118626: {'lr': 5.342461318712252e-05, 'samples': 22776192, 'steps': 118625, 'loss/train': 1.3103313446044922} 11/07/2021 13:53:51 - INFO - __main__ - Step 118627: {'lr': 5.342133449697359e-05, 'samples': 22776384, 'steps': 118626, 'loss/train': 2.1626665592193604} 11/07/2021 13:53:52 - INFO - __main__ - Step 118628: {'lr': 5.341805589540005e-05, 'samples': 22776576, 'steps': 118627, 'loss/train': 1.2413969039916992} 11/07/2021 13:53:52 - INFO - __main__ - Step 118629: {'lr': 5.3414777382403215e-05, 'samples': 22776768, 'steps': 118628, 'loss/train': 1.6043881177902222} 11/07/2021 13:53:52 - INFO - __main__ - Step 118630: {'lr': 5.3411498957984664e-05, 'samples': 22776960, 'steps': 118629, 'loss/train': 1.4373539686203003} 11/07/2021 13:53:53 - INFO - __main__ - Step 118631: {'lr': 5.340822062214587e-05, 'samples': 22777152, 'steps': 118630, 'loss/train': 1.0780096054077148} 11/07/2021 13:53:53 - INFO - __main__ - Step 118632: {'lr': 5.3404942374888274e-05, 'samples': 22777344, 'steps': 118631, 'loss/train': 1.1920068264007568} 11/07/2021 13:53:54 - INFO - __main__ - Step 118633: {'lr': 5.3401664216213395e-05, 'samples': 22777536, 'steps': 118632, 'loss/train': 0.9561355710029602} 11/07/2021 13:53:54 - INFO - __main__ - Step 118634: {'lr': 5.339838614612269e-05, 'samples': 22777728, 'steps': 118633, 'loss/train': 1.2046961784362793} 11/07/2021 13:53:55 - INFO - __main__ - Step 118635: {'lr': 5.339510816461762e-05, 'samples': 22777920, 'steps': 118634, 'loss/train': 1.1042312383651733} 11/07/2021 13:53:55 - INFO - __main__ - Step 118636: {'lr': 5.339183027169972e-05, 'samples': 22778112, 'steps': 118635, 'loss/train': 1.4340142011642456} 11/07/2021 13:53:56 - INFO - __main__ - Step 118637: {'lr': 5.33885524673704e-05, 'samples': 22778304, 'steps': 118636, 'loss/train': 1.1824651956558228} 11/07/2021 13:53:57 - INFO - __main__ - Step 118638: {'lr': 5.338527475163116e-05, 'samples': 22778496, 'steps': 118637, 'loss/train': 3.2352945804595947} 11/07/2021 13:53:57 - INFO - __main__ - Step 118639: {'lr': 5.3381997124483496e-05, 'samples': 22778688, 'steps': 118638, 'loss/train': 1.3574790954589844} 11/07/2021 13:53:57 - INFO - __main__ - Step 118640: {'lr': 5.337871958592885e-05, 'samples': 22778880, 'steps': 118639, 'loss/train': 1.311186671257019} 11/07/2021 13:53:58 - INFO - __main__ - Step 118641: {'lr': 5.337544213596873e-05, 'samples': 22779072, 'steps': 118640, 'loss/train': 1.2160797119140625} 11/07/2021 13:53:58 - INFO - __main__ - Step 118642: {'lr': 5.337216477460469e-05, 'samples': 22779264, 'steps': 118641, 'loss/train': 1.4092116355895996} 11/07/2021 13:53:59 - INFO - __main__ - Step 118643: {'lr': 5.336888750183802e-05, 'samples': 22779456, 'steps': 118642, 'loss/train': 1.063119649887085} 11/07/2021 13:53:59 - INFO - __main__ - Step 118644: {'lr': 5.3365610317670285e-05, 'samples': 22779648, 'steps': 118643, 'loss/train': 1.4197216033935547} 11/07/2021 13:54:00 - INFO - __main__ - Step 118645: {'lr': 5.3362333222103016e-05, 'samples': 22779840, 'steps': 118644, 'loss/train': 1.4391223192214966} 11/07/2021 13:54:00 - INFO - __main__ - Step 118646: {'lr': 5.335905621513762e-05, 'samples': 22780032, 'steps': 118645, 'loss/train': 1.6069891452789307} 11/07/2021 13:54:00 - INFO - __main__ - Step 118647: {'lr': 5.3355779296775595e-05, 'samples': 22780224, 'steps': 118646, 'loss/train': 1.2141485214233398} 11/07/2021 13:54:01 - INFO - __main__ - Step 118648: {'lr': 5.335250246701842e-05, 'samples': 22780416, 'steps': 118647, 'loss/train': 1.2837297916412354} 11/07/2021 13:54:02 - INFO - __main__ - Step 118649: {'lr': 5.334922572586759e-05, 'samples': 22780608, 'steps': 118648, 'loss/train': 1.4360618591308594} 11/07/2021 13:54:02 - INFO - __main__ - Step 118650: {'lr': 5.3345949073324547e-05, 'samples': 22780800, 'steps': 118649, 'loss/train': 0.9113653898239136} 11/07/2021 13:54:02 - INFO - __main__ - Step 118651: {'lr': 5.334267250939079e-05, 'samples': 22780992, 'steps': 118650, 'loss/train': 0.9639165997505188} 11/07/2021 13:54:03 - INFO - __main__ - Step 118652: {'lr': 5.333939603406779e-05, 'samples': 22781184, 'steps': 118651, 'loss/train': 1.7503584623336792} 11/07/2021 13:54:03 - INFO - __main__ - Step 118653: {'lr': 5.3336119647357044e-05, 'samples': 22781376, 'steps': 118652, 'loss/train': 1.4268397092819214} 11/07/2021 13:54:05 - INFO - __main__ - Step 118654: {'lr': 5.333284334925997e-05, 'samples': 22781568, 'steps': 118653, 'loss/train': 1.6238186359405518} 11/07/2021 13:54:05 - INFO - __main__ - Step 118655: {'lr': 5.3329567139778185e-05, 'samples': 22781760, 'steps': 118654, 'loss/train': 1.2858000993728638} 11/07/2021 13:54:05 - INFO - __main__ - Step 118656: {'lr': 5.332629101891298e-05, 'samples': 22781952, 'steps': 118655, 'loss/train': 1.3582240343093872} 11/07/2021 13:54:06 - INFO - __main__ - Step 118657: {'lr': 5.332301498666592e-05, 'samples': 22782144, 'steps': 118656, 'loss/train': 1.6441508531570435} 11/07/2021 13:54:06 - INFO - __main__ - Step 118658: {'lr': 5.3319739043038466e-05, 'samples': 22782336, 'steps': 118657, 'loss/train': 1.687744379043579} 11/07/2021 13:54:07 - INFO - __main__ - Step 118659: {'lr': 5.33164631880321e-05, 'samples': 22782528, 'steps': 118658, 'loss/train': 1.4130630493164062} 11/07/2021 13:54:08 - INFO - __main__ - Step 118660: {'lr': 5.331318742164831e-05, 'samples': 22782720, 'steps': 118659, 'loss/train': 2.7818777561187744} 11/07/2021 13:54:08 - INFO - __main__ - Step 118661: {'lr': 5.330991174388855e-05, 'samples': 22782912, 'steps': 118660, 'loss/train': 1.3544511795043945} 11/07/2021 13:54:08 - INFO - __main__ - Step 118662: {'lr': 5.330663615475431e-05, 'samples': 22783104, 'steps': 118661, 'loss/train': 1.2615793943405151} 11/07/2021 13:54:09 - INFO - __main__ - Step 118663: {'lr': 5.330336065424707e-05, 'samples': 22783296, 'steps': 118662, 'loss/train': 1.0600427389144897} 11/07/2021 13:54:09 - INFO - __main__ - Step 118664: {'lr': 5.330008524236832e-05, 'samples': 22783488, 'steps': 118663, 'loss/train': 1.4182077646255493} 11/07/2021 13:54:10 - INFO - __main__ - Step 118665: {'lr': 5.3296809919119475e-05, 'samples': 22783680, 'steps': 118664, 'loss/train': 1.072338581085205} 11/07/2021 13:54:10 - INFO - __main__ - Step 118666: {'lr': 5.329353468450207e-05, 'samples': 22783872, 'steps': 118665, 'loss/train': 1.3921520709991455} 11/07/2021 13:54:11 - INFO - __main__ - Step 118667: {'lr': 5.329025953851757e-05, 'samples': 22784064, 'steps': 118666, 'loss/train': 1.5929021835327148} 11/07/2021 13:54:11 - INFO - __main__ - Step 118668: {'lr': 5.328698448116753e-05, 'samples': 22784256, 'steps': 118667, 'loss/train': 0.0855720117688179} 11/07/2021 13:54:11 - INFO - __main__ - Step 118669: {'lr': 5.328370951245323e-05, 'samples': 22784448, 'steps': 118668, 'loss/train': 1.1419347524642944} 11/07/2021 13:54:13 - INFO - __main__ - Step 118670: {'lr': 5.328043463237628e-05, 'samples': 22784640, 'steps': 118669, 'loss/train': 2.0000996589660645} 11/07/2021 13:54:13 - INFO - __main__ - Step 118671: {'lr': 5.3277159840938145e-05, 'samples': 22784832, 'steps': 118670, 'loss/train': 1.3844765424728394} 11/07/2021 13:54:13 - INFO - __main__ - Step 118672: {'lr': 5.327388513814024e-05, 'samples': 22785024, 'steps': 118671, 'loss/train': 1.3691388368606567} 11/07/2021 13:54:14 - INFO - __main__ - Step 118673: {'lr': 5.3270610523984134e-05, 'samples': 22785216, 'steps': 118672, 'loss/train': 1.2554839849472046} 11/07/2021 13:54:14 - INFO - __main__ - Step 118674: {'lr': 5.326733599847122e-05, 'samples': 22785408, 'steps': 118673, 'loss/train': 0.8863407969474792} 11/07/2021 13:54:15 - INFO - __main__ - Step 118675: {'lr': 5.326406156160304e-05, 'samples': 22785600, 'steps': 118674, 'loss/train': 1.4284521341323853} 11/07/2021 13:54:16 - INFO - __main__ - Step 118676: {'lr': 5.326078721338101e-05, 'samples': 22785792, 'steps': 118675, 'loss/train': 1.583622932434082} 11/07/2021 13:54:16 - INFO - __main__ - Step 118677: {'lr': 5.325751295380665e-05, 'samples': 22785984, 'steps': 118676, 'loss/train': 1.0025142431259155} 11/07/2021 13:54:16 - INFO - __main__ - Step 118678: {'lr': 5.325423878288141e-05, 'samples': 22786176, 'steps': 118677, 'loss/train': 1.322593331336975} 11/07/2021 13:54:17 - INFO - __main__ - Step 118679: {'lr': 5.325096470060678e-05, 'samples': 22786368, 'steps': 118678, 'loss/train': 1.3677489757537842} 11/07/2021 13:54:17 - INFO - __main__ - Step 118680: {'lr': 5.3247690706984236e-05, 'samples': 22786560, 'steps': 118679, 'loss/train': 0.9632869362831116} 11/07/2021 13:54:18 - INFO - __main__ - Step 118681: {'lr': 5.3244416802015225e-05, 'samples': 22786752, 'steps': 118680, 'loss/train': 0.8574251532554626} 11/07/2021 13:54:19 - INFO - __main__ - Step 118682: {'lr': 5.3241142985701317e-05, 'samples': 22786944, 'steps': 118681, 'loss/train': 0.9894857406616211} 11/07/2021 13:54:19 - INFO - __main__ - Step 118683: {'lr': 5.323786925804386e-05, 'samples': 22787136, 'steps': 118682, 'loss/train': 1.2740529775619507} 11/07/2021 13:54:19 - INFO - __main__ - Step 118684: {'lr': 5.3234595619044366e-05, 'samples': 22787328, 'steps': 118683, 'loss/train': 1.6044996976852417} 11/07/2021 13:54:20 - INFO - __main__ - Step 118685: {'lr': 5.3231322068704347e-05, 'samples': 22787520, 'steps': 118684, 'loss/train': 0.7914737462997437} 11/07/2021 13:54:21 - INFO - __main__ - Step 118686: {'lr': 5.3228048607025264e-05, 'samples': 22787712, 'steps': 118685, 'loss/train': 0.9555718898773193} 11/07/2021 13:54:21 - INFO - __main__ - Step 118687: {'lr': 5.322477523400856e-05, 'samples': 22787904, 'steps': 118686, 'loss/train': 1.0615732669830322} 11/07/2021 13:54:21 - INFO - __main__ - Step 118688: {'lr': 5.322150194965575e-05, 'samples': 22788096, 'steps': 118687, 'loss/train': 1.0663928985595703} 11/07/2021 13:54:22 - INFO - __main__ - Step 118689: {'lr': 5.321822875396829e-05, 'samples': 22788288, 'steps': 118688, 'loss/train': 0.6871232986450195} 11/07/2021 13:54:22 - INFO - __main__ - Step 118690: {'lr': 5.3214955646947646e-05, 'samples': 22788480, 'steps': 118689, 'loss/train': 1.2597074508666992} 11/07/2021 13:54:23 - INFO - __main__ - Step 118691: {'lr': 5.321168262859533e-05, 'samples': 22788672, 'steps': 118690, 'loss/train': 0.3693498969078064} 11/07/2021 13:54:24 - INFO - __main__ - Step 118692: {'lr': 5.320840969891277e-05, 'samples': 22788864, 'steps': 118691, 'loss/train': 1.1164389848709106} 11/07/2021 13:54:24 - INFO - __main__ - Step 118693: {'lr': 5.3205136857901486e-05, 'samples': 22789056, 'steps': 118692, 'loss/train': 0.9959789514541626} 11/07/2021 13:54:24 - INFO - __main__ - Step 118694: {'lr': 5.3201864105562936e-05, 'samples': 22789248, 'steps': 118693, 'loss/train': 1.2786660194396973} 11/07/2021 13:54:25 - INFO - __main__ - Step 118695: {'lr': 5.319859144189862e-05, 'samples': 22789440, 'steps': 118694, 'loss/train': 1.5304754972457886} 11/07/2021 13:54:26 - INFO - __main__ - Step 118696: {'lr': 5.3195318866909956e-05, 'samples': 22789632, 'steps': 118695, 'loss/train': 1.325257658958435} 11/07/2021 13:54:26 - INFO - __main__ - Step 118697: {'lr': 5.3192046380598406e-05, 'samples': 22789824, 'steps': 118696, 'loss/train': 2.635140895843506} 11/07/2021 13:54:27 - INFO - __main__ - Step 118698: {'lr': 5.318877398296551e-05, 'samples': 22790016, 'steps': 118697, 'loss/train': 0.8509399890899658} 11/07/2021 13:54:27 - INFO - __main__ - Step 118699: {'lr': 5.31855016740127e-05, 'samples': 22790208, 'steps': 118698, 'loss/train': 1.2918442487716675} 11/07/2021 13:54:27 - INFO - __main__ - Step 118700: {'lr': 5.318222945374149e-05, 'samples': 22790400, 'steps': 118699, 'loss/train': 1.1387313604354858} 11/07/2021 13:54:28 - INFO - __main__ - Step 118701: {'lr': 5.3178957322153304e-05, 'samples': 22790592, 'steps': 118700, 'loss/train': 0.5045097470283508} 11/07/2021 13:54:28 - INFO - __main__ - Step 118702: {'lr': 5.317568527924965e-05, 'samples': 22790784, 'steps': 118701, 'loss/train': 1.2478392124176025} 11/07/2021 13:54:29 - INFO - __main__ - Step 118703: {'lr': 5.3172413325032e-05, 'samples': 22790976, 'steps': 118702, 'loss/train': 1.1457887887954712} 11/07/2021 13:54:29 - INFO - __main__ - Step 118704: {'lr': 5.316914145950183e-05, 'samples': 22791168, 'steps': 118703, 'loss/train': 0.9568139314651489} 11/07/2021 13:54:30 - INFO - __main__ - Step 118705: {'lr': 5.31658696826606e-05, 'samples': 22791360, 'steps': 118704, 'loss/train': 1.3805851936340332} 11/07/2021 13:54:30 - INFO - __main__ - Step 118706: {'lr': 5.316259799450979e-05, 'samples': 22791552, 'steps': 118705, 'loss/train': 0.6376063823699951} 11/07/2021 13:54:30 - INFO - __main__ - Step 118707: {'lr': 5.315932639505086e-05, 'samples': 22791744, 'steps': 118706, 'loss/train': 0.8803451061248779} 11/07/2021 13:54:32 - INFO - __main__ - Step 118708: {'lr': 5.315605488428532e-05, 'samples': 22791936, 'steps': 118707, 'loss/train': 1.383115291595459} 11/07/2021 13:54:32 - INFO - __main__ - Step 118709: {'lr': 5.3152783462214696e-05, 'samples': 22792128, 'steps': 118708, 'loss/train': 1.2516613006591797} 11/07/2021 13:54:32 - INFO - __main__ - Step 118710: {'lr': 5.314951212884031e-05, 'samples': 22792320, 'steps': 118709, 'loss/train': 1.1433411836624146} 11/07/2021 13:54:33 - INFO - __main__ - Step 118711: {'lr': 5.3146240884163726e-05, 'samples': 22792512, 'steps': 118710, 'loss/train': 0.8261168003082275} 11/07/2021 13:54:33 - INFO - __main__ - Step 118712: {'lr': 5.3142969728186416e-05, 'samples': 22792704, 'steps': 118711, 'loss/train': 1.4604130983352661} 11/07/2021 13:54:34 - INFO - __main__ - Step 118713: {'lr': 5.3139698660909814e-05, 'samples': 22792896, 'steps': 118712, 'loss/train': 1.2684849500656128} 11/07/2021 13:54:34 - INFO - __main__ - Step 118714: {'lr': 5.313642768233545e-05, 'samples': 22793088, 'steps': 118713, 'loss/train': 1.1278539896011353} 11/07/2021 13:54:35 - INFO - __main__ - Step 118715: {'lr': 5.3133156792464775e-05, 'samples': 22793280, 'steps': 118714, 'loss/train': 1.2654321193695068} 11/07/2021 13:54:35 - INFO - __main__ - Step 118716: {'lr': 5.312988599129925e-05, 'samples': 22793472, 'steps': 118715, 'loss/train': 1.2976428270339966} 11/07/2021 13:54:35 - INFO - __main__ - Step 118717: {'lr': 5.312661527884038e-05, 'samples': 22793664, 'steps': 118716, 'loss/train': 1.2888720035552979} 11/07/2021 13:54:36 - INFO - __main__ - Step 118718: {'lr': 5.312334465508961e-05, 'samples': 22793856, 'steps': 118717, 'loss/train': 1.413796305656433} 11/07/2021 13:54:37 - INFO - __main__ - Step 118719: {'lr': 5.31200741200484e-05, 'samples': 22794048, 'steps': 118718, 'loss/train': 1.6180908679962158} 11/07/2021 13:54:37 - INFO - __main__ - Step 118720: {'lr': 5.3116803673718264e-05, 'samples': 22794240, 'steps': 118719, 'loss/train': 1.0510200262069702} 11/07/2021 13:54:38 - INFO - __main__ - Step 118721: {'lr': 5.3113533316100664e-05, 'samples': 22794432, 'steps': 118720, 'loss/train': 1.1788212060928345} 11/07/2021 13:54:38 - INFO - __main__ - Step 118722: {'lr': 5.311026304719713e-05, 'samples': 22794624, 'steps': 118721, 'loss/train': 1.4157088994979858} 11/07/2021 13:54:38 - INFO - __main__ - Step 118723: {'lr': 5.3106992867009016e-05, 'samples': 22794816, 'steps': 118722, 'loss/train': 1.383631706237793} 11/07/2021 13:54:40 - INFO - __main__ - Step 118724: {'lr': 5.310372277553785e-05, 'samples': 22795008, 'steps': 118723, 'loss/train': 1.3224612474441528} 11/07/2021 13:54:40 - INFO - __main__ - Step 118725: {'lr': 5.310045277278511e-05, 'samples': 22795200, 'steps': 118724, 'loss/train': 1.0197712182998657} 11/07/2021 13:54:41 - INFO - __main__ - Step 118726: {'lr': 5.309718285875226e-05, 'samples': 22795392, 'steps': 118725, 'loss/train': 1.000282883644104} 11/07/2021 13:54:41 - INFO - __main__ - Step 118727: {'lr': 5.309391303344077e-05, 'samples': 22795584, 'steps': 118726, 'loss/train': 2.5951755046844482} 11/07/2021 13:54:41 - INFO - __main__ - Step 118728: {'lr': 5.309064329685212e-05, 'samples': 22795776, 'steps': 118727, 'loss/train': 2.5644371509552} 11/07/2021 13:54:42 - INFO - __main__ - Step 118729: {'lr': 5.30873736489878e-05, 'samples': 22795968, 'steps': 118728, 'loss/train': 1.5421971082687378} 11/07/2021 13:54:43 - INFO - __main__ - Step 118730: {'lr': 5.308410408984929e-05, 'samples': 22796160, 'steps': 118729, 'loss/train': 1.5800760984420776} 11/07/2021 13:54:43 - INFO - __main__ - Step 118731: {'lr': 5.308083461943802e-05, 'samples': 22796352, 'steps': 118730, 'loss/train': 1.498619794845581} 11/07/2021 13:54:43 - INFO - __main__ - Step 118732: {'lr': 5.307756523775551e-05, 'samples': 22796544, 'steps': 118731, 'loss/train': 0.6062990427017212} 11/07/2021 13:54:44 - INFO - __main__ - Step 118733: {'lr': 5.3074295944803176e-05, 'samples': 22796736, 'steps': 118732, 'loss/train': 1.2370636463165283} 11/07/2021 13:54:44 - INFO - __main__ - Step 118734: {'lr': 5.307102674058256e-05, 'samples': 22796928, 'steps': 118733, 'loss/train': 1.3383581638336182} 11/07/2021 13:54:45 - INFO - __main__ - Step 118735: {'lr': 5.306775762509508e-05, 'samples': 22797120, 'steps': 118734, 'loss/train': 1.8401293754577637} 11/07/2021 13:54:46 - INFO - __main__ - Step 118736: {'lr': 5.3064488598342285e-05, 'samples': 22797312, 'steps': 118735, 'loss/train': 1.1765056848526} 11/07/2021 13:54:46 - INFO - __main__ - Step 118737: {'lr': 5.3061219660325566e-05, 'samples': 22797504, 'steps': 118736, 'loss/train': 1.3325384855270386} 11/07/2021 13:54:46 - INFO - __main__ - Step 118738: {'lr': 5.305795081104639e-05, 'samples': 22797696, 'steps': 118737, 'loss/train': 1.084464192390442} 11/07/2021 13:54:47 - INFO - __main__ - Step 118739: {'lr': 5.305468205050626e-05, 'samples': 22797888, 'steps': 118738, 'loss/train': 1.4517168998718262} 11/07/2021 13:54:48 - INFO - __main__ - Step 118740: {'lr': 5.305141337870667e-05, 'samples': 22798080, 'steps': 118739, 'loss/train': 0.9543129205703735} 11/07/2021 13:54:48 - INFO - __main__ - Step 118741: {'lr': 5.304814479564907e-05, 'samples': 22798272, 'steps': 118740, 'loss/train': 1.060693383216858} 11/07/2021 13:54:49 - INFO - __main__ - Step 118742: {'lr': 5.3044876301334924e-05, 'samples': 22798464, 'steps': 118741, 'loss/train': 1.3681174516677856} 11/07/2021 13:54:49 - INFO - __main__ - Step 118743: {'lr': 5.304160789576573e-05, 'samples': 22798656, 'steps': 118742, 'loss/train': 1.742991328239441} 11/07/2021 13:54:50 - INFO - __main__ - Step 118744: {'lr': 5.303833957894294e-05, 'samples': 22798848, 'steps': 118743, 'loss/train': 1.5412837266921997} 11/07/2021 13:54:50 - INFO - __main__ - Step 118745: {'lr': 5.303507135086805e-05, 'samples': 22799040, 'steps': 118744, 'loss/train': 1.2974205017089844} 11/07/2021 13:54:50 - INFO - __main__ - Step 118746: {'lr': 5.30318032115425e-05, 'samples': 22799232, 'steps': 118745, 'loss/train': 5.526613712310791} 11/07/2021 13:54:51 - INFO - __main__ - Step 118747: {'lr': 5.302853516096787e-05, 'samples': 22799424, 'steps': 118746, 'loss/train': 5.457108974456787} 11/07/2021 13:54:52 - INFO - __main__ - Step 118748: {'lr': 5.302526719914544e-05, 'samples': 22799616, 'steps': 118747, 'loss/train': 1.2589956521987915} 11/07/2021 13:54:52 - INFO - __main__ - Step 118749: {'lr': 5.302199932607682e-05, 'samples': 22799808, 'steps': 118748, 'loss/train': 0.8424980044364929} 11/07/2021 13:54:52 - INFO - __main__ - Step 118750: {'lr': 5.3018731541763424e-05, 'samples': 22800000, 'steps': 118749, 'loss/train': 1.2028619050979614} 11/07/2021 13:54:53 - INFO - __main__ - Step 118751: {'lr': 5.301546384620676e-05, 'samples': 22800192, 'steps': 118750, 'loss/train': 1.5910770893096924} 11/07/2021 13:54:54 - INFO - __main__ - Step 118752: {'lr': 5.301219623940828e-05, 'samples': 22800384, 'steps': 118751, 'loss/train': 0.13232021033763885} 11/07/2021 13:54:54 - INFO - __main__ - Step 118753: {'lr': 5.3008928721369474e-05, 'samples': 22800576, 'steps': 118752, 'loss/train': 1.4260708093643188} 11/07/2021 13:54:55 - INFO - __main__ - Step 118754: {'lr': 5.3005661292091803e-05, 'samples': 22800768, 'steps': 118753, 'loss/train': 1.3405250310897827} 11/07/2021 13:54:55 - INFO - __main__ - Step 118755: {'lr': 5.300239395157674e-05, 'samples': 22800960, 'steps': 118754, 'loss/train': 1.4276211261749268} 11/07/2021 13:54:55 - INFO - __main__ - Step 118756: {'lr': 5.299912669982576e-05, 'samples': 22801152, 'steps': 118755, 'loss/train': 1.6466946601867676} 11/07/2021 13:54:56 - INFO - __main__ - Step 118757: {'lr': 5.299585953684033e-05, 'samples': 22801344, 'steps': 118756, 'loss/train': 1.3382418155670166} 11/07/2021 13:54:57 - INFO - __main__ - Step 118758: {'lr': 5.2992592462622e-05, 'samples': 22801536, 'steps': 118757, 'loss/train': 0.999945878982544} 11/07/2021 13:54:57 - INFO - __main__ - Step 118759: {'lr': 5.298932547717209e-05, 'samples': 22801728, 'steps': 118758, 'loss/train': 1.2082903385162354} 11/07/2021 13:54:57 - INFO - __main__ - Step 118760: {'lr': 5.298605858049216e-05, 'samples': 22801920, 'steps': 118759, 'loss/train': 1.29362952709198} 11/07/2021 13:54:58 - INFO - __main__ - Step 118761: {'lr': 5.298279177258366e-05, 'samples': 22802112, 'steps': 118760, 'loss/train': 1.8012889623641968} 11/07/2021 13:54:59 - INFO - __main__ - Step 118762: {'lr': 5.297952505344808e-05, 'samples': 22802304, 'steps': 118761, 'loss/train': 1.3442764282226562} 11/07/2021 13:55:00 - INFO - __main__ - Step 118763: {'lr': 5.2976258423086874e-05, 'samples': 22802496, 'steps': 118762, 'loss/train': 1.0595566034317017} 11/07/2021 13:55:00 - INFO - __main__ - Step 118764: {'lr': 5.2972991881501535e-05, 'samples': 22802688, 'steps': 118763, 'loss/train': 0.5122440457344055} 11/07/2021 13:55:00 - INFO - __main__ - Step 118765: {'lr': 5.296972542869355e-05, 'samples': 22802880, 'steps': 118764, 'loss/train': 1.2500108480453491} 11/07/2021 13:55:01 - INFO - __main__ - Step 118766: {'lr': 5.296645906466432e-05, 'samples': 22803072, 'steps': 118765, 'loss/train': 1.4387471675872803} 11/07/2021 13:55:01 - INFO - __main__ - Step 118767: {'lr': 5.296319278941539e-05, 'samples': 22803264, 'steps': 118766, 'loss/train': 1.7152786254882812} 11/07/2021 13:55:01 - INFO - __main__ - Step 118768: {'lr': 5.295992660294821e-05, 'samples': 22803456, 'steps': 118767, 'loss/train': 1.6846259832382202} 11/07/2021 13:55:02 - INFO - __main__ - Step 118769: {'lr': 5.295666050526432e-05, 'samples': 22803648, 'steps': 118768, 'loss/train': 1.2184303998947144} 11/07/2021 13:55:03 - INFO - __main__ - Step 118770: {'lr': 5.295339449636502e-05, 'samples': 22803840, 'steps': 118769, 'loss/train': 1.2172553539276123} 11/07/2021 13:55:03 - INFO - __main__ - Step 118771: {'lr': 5.295012857625189e-05, 'samples': 22804032, 'steps': 118770, 'loss/train': 0.8555411696434021} 11/07/2021 13:55:03 - INFO - __main__ - Step 118772: {'lr': 5.294686274492641e-05, 'samples': 22804224, 'steps': 118771, 'loss/train': 1.440453290939331} 11/07/2021 13:55:04 - INFO - __main__ - Step 118773: {'lr': 5.294359700239001e-05, 'samples': 22804416, 'steps': 118772, 'loss/train': 1.448622465133667} 11/07/2021 13:55:05 - INFO - __main__ - Step 118774: {'lr': 5.2940331348644206e-05, 'samples': 22804608, 'steps': 118773, 'loss/train': 1.310640811920166} 11/07/2021 13:55:05 - INFO - __main__ - Step 118775: {'lr': 5.2937065783690423e-05, 'samples': 22804800, 'steps': 118774, 'loss/train': 1.5353788137435913} 11/07/2021 13:55:06 - INFO - __main__ - Step 118776: {'lr': 5.293380030753017e-05, 'samples': 22804992, 'steps': 118775, 'loss/train': 0.8424102067947388} 11/07/2021 13:55:06 - INFO - __main__ - Step 118777: {'lr': 5.293053492016492e-05, 'samples': 22805184, 'steps': 118776, 'loss/train': 1.4161444902420044} 11/07/2021 13:55:06 - INFO - __main__ - Step 118778: {'lr': 5.292726962159611e-05, 'samples': 22805376, 'steps': 118777, 'loss/train': 0.912663996219635} 11/07/2021 13:55:07 - INFO - __main__ - Step 118779: {'lr': 5.292400441182524e-05, 'samples': 22805568, 'steps': 118778, 'loss/train': 1.3905658721923828} 11/07/2021 13:55:08 - INFO - __main__ - Step 118780: {'lr': 5.292073929085384e-05, 'samples': 22805760, 'steps': 118779, 'loss/train': 1.4283738136291504} 11/07/2021 13:55:08 - INFO - __main__ - Step 118781: {'lr': 5.2917474258683264e-05, 'samples': 22805952, 'steps': 118780, 'loss/train': 0.9695920348167419} 11/07/2021 13:55:08 - INFO - __main__ - Step 118782: {'lr': 5.2914209315315015e-05, 'samples': 22806144, 'steps': 118781, 'loss/train': 1.5097602605819702} 11/07/2021 13:55:09 - INFO - __main__ - Step 118783: {'lr': 5.291094446075057e-05, 'samples': 22806336, 'steps': 118782, 'loss/train': 0.6817830204963684} 11/07/2021 13:55:09 - INFO - __main__ - Step 118784: {'lr': 5.290767969499141e-05, 'samples': 22806528, 'steps': 118783, 'loss/train': 1.3237533569335938} 11/07/2021 13:55:10 - INFO - __main__ - Step 118785: {'lr': 5.2904415018039026e-05, 'samples': 22806720, 'steps': 118784, 'loss/train': 1.2284038066864014} 11/07/2021 13:55:10 - INFO - __main__ - Step 118786: {'lr': 5.290115042989488e-05, 'samples': 22806912, 'steps': 118785, 'loss/train': 1.4135870933532715} 11/07/2021 13:55:11 - INFO - __main__ - Step 118787: {'lr': 5.289788593056041e-05, 'samples': 22807104, 'steps': 118786, 'loss/train': 1.4988124370574951} 11/07/2021 13:55:11 - INFO - __main__ - Step 118788: {'lr': 5.289462152003713e-05, 'samples': 22807296, 'steps': 118787, 'loss/train': 1.5459924936294556} 11/07/2021 13:55:11 - INFO - __main__ - Step 118789: {'lr': 5.289135719832649e-05, 'samples': 22807488, 'steps': 118788, 'loss/train': 0.8169487118721008} 11/07/2021 13:55:13 - INFO - __main__ - Step 118790: {'lr': 5.288809296543001e-05, 'samples': 22807680, 'steps': 118789, 'loss/train': 0.8980430364608765} 11/07/2021 13:55:13 - INFO - __main__ - Step 118791: {'lr': 5.288482882134907e-05, 'samples': 22807872, 'steps': 118790, 'loss/train': 0.9950320720672607} 11/07/2021 13:55:13 - INFO - __main__ - Step 118792: {'lr': 5.288156476608516e-05, 'samples': 22808064, 'steps': 118791, 'loss/train': 0.7608320116996765} 11/07/2021 13:55:14 - INFO - __main__ - Step 118793: {'lr': 5.287830079963979e-05, 'samples': 22808256, 'steps': 118792, 'loss/train': 0.6939772367477417} 11/07/2021 13:55:14 - INFO - __main__ - Step 118794: {'lr': 5.287503692201443e-05, 'samples': 22808448, 'steps': 118793, 'loss/train': 1.9063655138015747} 11/07/2021 13:55:15 - INFO - __main__ - Step 118795: {'lr': 5.287177313321051e-05, 'samples': 22808640, 'steps': 118794, 'loss/train': 1.153857707977295} 11/07/2021 13:55:15 - INFO - __main__ - Step 118796: {'lr': 5.2868509433229544e-05, 'samples': 22808832, 'steps': 118795, 'loss/train': 1.151140809059143} 11/07/2021 13:55:16 - INFO - __main__ - Step 118797: {'lr': 5.2865245822073e-05, 'samples': 22809024, 'steps': 118796, 'loss/train': 1.1403247117996216} 11/07/2021 13:55:16 - INFO - __main__ - Step 118798: {'lr': 5.2861982299742314e-05, 'samples': 22809216, 'steps': 118797, 'loss/train': 1.3368589878082275} 11/07/2021 13:55:16 - INFO - __main__ - Step 118799: {'lr': 5.285871886623897e-05, 'samples': 22809408, 'steps': 118798, 'loss/train': 1.4713102579116821} 11/07/2021 13:55:17 - INFO - __main__ - Step 118800: {'lr': 5.285545552156445e-05, 'samples': 22809600, 'steps': 118799, 'loss/train': 1.6359028816223145} 11/07/2021 13:55:18 - INFO - __main__ - Step 118801: {'lr': 5.285219226572022e-05, 'samples': 22809792, 'steps': 118800, 'loss/train': 1.298877239227295} 11/07/2021 13:55:18 - INFO - __main__ - Step 118802: {'lr': 5.284892909870775e-05, 'samples': 22809984, 'steps': 118801, 'loss/train': 1.4066379070281982} 11/07/2021 13:55:18 - INFO - __main__ - Step 118803: {'lr': 5.284566602052859e-05, 'samples': 22810176, 'steps': 118802, 'loss/train': 1.5264908075332642} 11/07/2021 13:55:19 - INFO - __main__ - Step 118804: {'lr': 5.284240303118407e-05, 'samples': 22810368, 'steps': 118803, 'loss/train': 1.020400881767273} 11/07/2021 13:55:19 - INFO - __main__ - Step 118805: {'lr': 5.28391401306757e-05, 'samples': 22810560, 'steps': 118804, 'loss/train': 0.9147961735725403} 11/07/2021 13:55:20 - INFO - __main__ - Step 118806: {'lr': 5.2835877319004965e-05, 'samples': 22810752, 'steps': 118805, 'loss/train': 5.650150299072266} 11/07/2021 13:55:21 - INFO - __main__ - Step 118807: {'lr': 5.2832614596173364e-05, 'samples': 22810944, 'steps': 118806, 'loss/train': 1.2036457061767578} 11/07/2021 13:55:21 - INFO - __main__ - Step 118808: {'lr': 5.282935196218233e-05, 'samples': 22811136, 'steps': 118807, 'loss/train': 1.2309561967849731} 11/07/2021 13:55:21 - INFO - __main__ - Step 118809: {'lr': 5.282608941703335e-05, 'samples': 22811328, 'steps': 118808, 'loss/train': 1.7286714315414429} 11/07/2021 13:55:22 - INFO - __main__ - Step 118810: {'lr': 5.282282696072788e-05, 'samples': 22811520, 'steps': 118809, 'loss/train': 1.3032394647598267} 11/07/2021 13:55:23 - INFO - __main__ - Step 118811: {'lr': 5.281956459326742e-05, 'samples': 22811712, 'steps': 118810, 'loss/train': 0.8593766689300537} 11/07/2021 13:55:23 - INFO - __main__ - Step 118812: {'lr': 5.2816302314653424e-05, 'samples': 22811904, 'steps': 118811, 'loss/train': 1.1641845703125} 11/07/2021 13:55:23 - INFO - __main__ - Step 118813: {'lr': 5.281304012488733e-05, 'samples': 22812096, 'steps': 118812, 'loss/train': 1.4160622358322144} 11/07/2021 13:55:24 - INFO - __main__ - Step 118814: {'lr': 5.280977802397066e-05, 'samples': 22812288, 'steps': 118813, 'loss/train': 1.2703675031661987} 11/07/2021 13:55:24 - INFO - __main__ - Step 118815: {'lr': 5.2806516011904866e-05, 'samples': 22812480, 'steps': 118814, 'loss/train': 1.498687744140625} 11/07/2021 13:55:25 - INFO - __main__ - Step 118816: {'lr': 5.280325408869147e-05, 'samples': 22812672, 'steps': 118815, 'loss/train': 0.7005724906921387} 11/07/2021 13:55:26 - INFO - __main__ - Step 118817: {'lr': 5.279999225433182e-05, 'samples': 22812864, 'steps': 118816, 'loss/train': 1.2571734189987183} 11/07/2021 13:55:26 - INFO - __main__ - Step 118818: {'lr': 5.279673050882747e-05, 'samples': 22813056, 'steps': 118817, 'loss/train': 0.364201158285141} 11/07/2021 13:55:26 - INFO - __main__ - Step 118819: {'lr': 5.279346885217984e-05, 'samples': 22813248, 'steps': 118818, 'loss/train': 1.1018025875091553} 11/07/2021 13:55:27 - INFO - __main__ - Step 118820: {'lr': 5.279020728439043e-05, 'samples': 22813440, 'steps': 118819, 'loss/train': 1.6452962160110474} 11/07/2021 13:55:27 - INFO - __main__ - Step 118821: {'lr': 5.278694580546073e-05, 'samples': 22813632, 'steps': 118820, 'loss/train': 1.098785161972046} 11/07/2021 13:55:28 - INFO - __main__ - Step 118822: {'lr': 5.278368441539219e-05, 'samples': 22813824, 'steps': 118821, 'loss/train': 1.4633913040161133} 11/07/2021 13:55:28 - INFO - __main__ - Step 118823: {'lr': 5.2780423114186265e-05, 'samples': 22814016, 'steps': 118822, 'loss/train': 1.5700258016586304} 11/07/2021 13:55:29 - INFO - __main__ - Step 118824: {'lr': 5.277716190184442e-05, 'samples': 22814208, 'steps': 118823, 'loss/train': 1.3516628742218018} 11/07/2021 13:55:29 - INFO - __main__ - Step 118825: {'lr': 5.277390077836819e-05, 'samples': 22814400, 'steps': 118824, 'loss/train': 1.2815748453140259} 11/07/2021 13:55:29 - INFO - __main__ - Step 118826: {'lr': 5.2770639743758956e-05, 'samples': 22814592, 'steps': 118825, 'loss/train': 1.2019888162612915} 11/07/2021 13:55:31 - INFO - __main__ - Step 118827: {'lr': 5.276737879801824e-05, 'samples': 22814784, 'steps': 118826, 'loss/train': 1.371923804283142} 11/07/2021 13:55:31 - INFO - __main__ - Step 118828: {'lr': 5.276411794114752e-05, 'samples': 22814976, 'steps': 118827, 'loss/train': 1.339579701423645} 11/07/2021 13:55:31 - INFO - __main__ - Step 118829: {'lr': 5.276085717314821e-05, 'samples': 22815168, 'steps': 118828, 'loss/train': 0.3026787042617798} 11/07/2021 13:55:32 - INFO - __main__ - Step 118830: {'lr': 5.27575964940219e-05, 'samples': 22815360, 'steps': 118829, 'loss/train': 1.422371745109558} 11/07/2021 13:55:32 - INFO - __main__ - Step 118831: {'lr': 5.2754335903769904e-05, 'samples': 22815552, 'steps': 118830, 'loss/train': 1.3133126497268677} 11/07/2021 13:55:33 - INFO - __main__ - Step 118832: {'lr': 5.2751075402393764e-05, 'samples': 22815744, 'steps': 118831, 'loss/train': 1.5364203453063965} 11/07/2021 13:55:33 - INFO - __main__ - Step 118833: {'lr': 5.274781498989495e-05, 'samples': 22815936, 'steps': 118832, 'loss/train': 1.7667028903961182} 11/07/2021 13:55:34 - INFO - __main__ - Step 118834: {'lr': 5.274455466627492e-05, 'samples': 22816128, 'steps': 118833, 'loss/train': 1.5109479427337646} 11/07/2021 13:55:34 - INFO - __main__ - Step 118835: {'lr': 5.274129443153514e-05, 'samples': 22816320, 'steps': 118834, 'loss/train': 0.1550804078578949} 11/07/2021 13:55:34 - INFO - __main__ - Step 118836: {'lr': 5.273803428567711e-05, 'samples': 22816512, 'steps': 118835, 'loss/train': 1.091874122619629} 11/07/2021 13:55:35 - INFO - __main__ - Step 118837: {'lr': 5.273477422870227e-05, 'samples': 22816704, 'steps': 118836, 'loss/train': 1.1337915658950806} 11/07/2021 13:55:36 - INFO - __main__ - Step 118838: {'lr': 5.2731514260612095e-05, 'samples': 22816896, 'steps': 118837, 'loss/train': 0.6715529561042786} 11/07/2021 13:55:36 - INFO - __main__ - Step 118839: {'lr': 5.272825438140805e-05, 'samples': 22817088, 'steps': 118838, 'loss/train': 1.422790765762329} 11/07/2021 13:55:36 - INFO - __main__ - Step 118840: {'lr': 5.272499459109162e-05, 'samples': 22817280, 'steps': 118839, 'loss/train': 1.2347103357315063} 11/07/2021 13:55:37 - INFO - __main__ - Step 118841: {'lr': 5.272173488966425e-05, 'samples': 22817472, 'steps': 118840, 'loss/train': 1.7418798208236694} 11/07/2021 13:55:37 - INFO - __main__ - Step 118842: {'lr': 5.271847527712742e-05, 'samples': 22817664, 'steps': 118841, 'loss/train': 1.5383365154266357} 11/07/2021 13:55:38 - INFO - __main__ - Step 118843: {'lr': 5.271521575348268e-05, 'samples': 22817856, 'steps': 118842, 'loss/train': 1.5875961780548096} 11/07/2021 13:55:38 - INFO - __main__ - Step 118844: {'lr': 5.2711956318731355e-05, 'samples': 22818048, 'steps': 118843, 'loss/train': 1.2637704610824585} 11/07/2021 13:55:39 - INFO - __main__ - Step 118845: {'lr': 5.2708696972874973e-05, 'samples': 22818240, 'steps': 118844, 'loss/train': 1.4718447923660278} 11/07/2021 13:55:39 - INFO - __main__ - Step 118846: {'lr': 5.270543771591502e-05, 'samples': 22818432, 'steps': 118845, 'loss/train': 2.93221378326416} 11/07/2021 13:55:40 - INFO - __main__ - Step 118847: {'lr': 5.270217854785292e-05, 'samples': 22818624, 'steps': 118846, 'loss/train': 1.5436890125274658} 11/07/2021 13:55:41 - INFO - __main__ - Step 118848: {'lr': 5.269891946869021e-05, 'samples': 22818816, 'steps': 118847, 'loss/train': 1.202144980430603} 11/07/2021 13:55:41 - INFO - __main__ - Step 118849: {'lr': 5.2695660478428307e-05, 'samples': 22819008, 'steps': 118848, 'loss/train': 1.2369509935379028} 11/07/2021 13:55:41 - INFO - __main__ - Step 118850: {'lr': 5.269240157706867e-05, 'samples': 22819200, 'steps': 118849, 'loss/train': 1.2441462278366089} 11/07/2021 13:55:42 - INFO - __main__ - Step 118851: {'lr': 5.268914276461281e-05, 'samples': 22819392, 'steps': 118850, 'loss/train': 1.236479640007019} 11/07/2021 13:55:42 - INFO - __main__ - Step 118852: {'lr': 5.26858840410622e-05, 'samples': 22819584, 'steps': 118851, 'loss/train': 1.3587321043014526} 11/07/2021 13:55:43 - INFO - __main__ - Step 118853: {'lr': 5.268262540641827e-05, 'samples': 22819776, 'steps': 118852, 'loss/train': 1.0751376152038574} 11/07/2021 13:55:44 - INFO - __main__ - Step 118854: {'lr': 5.2679366860682507e-05, 'samples': 22819968, 'steps': 118853, 'loss/train': 1.2015043497085571} 11/07/2021 13:55:44 - INFO - __main__ - Step 118855: {'lr': 5.267610840385637e-05, 'samples': 22820160, 'steps': 118854, 'loss/train': 1.513192057609558} 11/07/2021 13:55:44 - INFO - __main__ - Step 118856: {'lr': 5.267285003594133e-05, 'samples': 22820352, 'steps': 118855, 'loss/train': 0.8007073998451233} 11/07/2021 13:55:45 - INFO - __main__ - Step 118857: {'lr': 5.266959175693894e-05, 'samples': 22820544, 'steps': 118856, 'loss/train': 0.4327601492404938} 11/07/2021 13:55:46 - INFO - __main__ - Step 118858: {'lr': 5.266633356685052e-05, 'samples': 22820736, 'steps': 118857, 'loss/train': 1.6376063823699951} 11/07/2021 13:55:46 - INFO - __main__ - Step 118859: {'lr': 5.2663075465677585e-05, 'samples': 22820928, 'steps': 118858, 'loss/train': 1.1409838199615479} 11/07/2021 13:55:46 - INFO - __main__ - Step 118860: {'lr': 5.265981745342163e-05, 'samples': 22821120, 'steps': 118859, 'loss/train': 1.0533758401870728} 11/07/2021 13:55:47 - INFO - __main__ - Step 118861: {'lr': 5.2656559530084136e-05, 'samples': 22821312, 'steps': 118860, 'loss/train': 1.1005520820617676} 11/07/2021 13:55:47 - INFO - __main__ - Step 118862: {'lr': 5.2653301695666536e-05, 'samples': 22821504, 'steps': 118861, 'loss/train': 0.7784494757652283} 11/07/2021 13:55:48 - INFO - __main__ - Step 118863: {'lr': 5.265004395017031e-05, 'samples': 22821696, 'steps': 118862, 'loss/train': 1.5010699033737183} 11/07/2021 13:55:49 - INFO - __main__ - Step 118864: {'lr': 5.264678629359693e-05, 'samples': 22821888, 'steps': 118863, 'loss/train': 0.6170837879180908} 11/07/2021 13:55:49 - INFO - __main__ - Step 118865: {'lr': 5.2643528725947854e-05, 'samples': 22822080, 'steps': 118864, 'loss/train': 1.6439459323883057} 11/07/2021 13:55:49 - INFO - __main__ - Step 118866: {'lr': 5.264027124722459e-05, 'samples': 22822272, 'steps': 118865, 'loss/train': 1.5500506162643433} 11/07/2021 13:55:50 - INFO - __main__ - Step 118867: {'lr': 5.2637013857428555e-05, 'samples': 22822464, 'steps': 118866, 'loss/train': 0.6981337070465088} 11/07/2021 13:55:51 - INFO - __main__ - Step 118868: {'lr': 5.2633756556561216e-05, 'samples': 22822656, 'steps': 118867, 'loss/train': 1.1045881509780884} 11/07/2021 13:55:51 - INFO - __main__ - Step 118869: {'lr': 5.2630499344624077e-05, 'samples': 22822848, 'steps': 118868, 'loss/train': 1.200899362564087} 11/07/2021 13:55:51 - INFO - __main__ - Step 118870: {'lr': 5.262724222161866e-05, 'samples': 22823040, 'steps': 118869, 'loss/train': 1.3282809257507324} 11/07/2021 13:55:52 - INFO - __main__ - Step 118871: {'lr': 5.26239851875463e-05, 'samples': 22823232, 'steps': 118870, 'loss/train': 1.5588595867156982} 11/07/2021 13:55:52 - INFO - __main__ - Step 118872: {'lr': 5.2620728242408516e-05, 'samples': 22823424, 'steps': 118871, 'loss/train': 1.5781288146972656} 11/07/2021 13:55:52 - INFO - __main__ - Step 118873: {'lr': 5.2617471386206817e-05, 'samples': 22823616, 'steps': 118872, 'loss/train': 1.7385737895965576} 11/07/2021 13:55:53 - INFO - __main__ - Step 118874: {'lr': 5.2614214618942614e-05, 'samples': 22823808, 'steps': 118873, 'loss/train': 2.5100009441375732} 11/07/2021 13:55:54 - INFO - __main__ - Step 118875: {'lr': 5.261095794061738e-05, 'samples': 22824000, 'steps': 118874, 'loss/train': 0.6206822991371155} 11/07/2021 13:55:54 - INFO - __main__ - Step 118876: {'lr': 5.260770135123264e-05, 'samples': 22824192, 'steps': 118875, 'loss/train': 1.4250504970550537} 11/07/2021 13:55:55 - INFO - __main__ - Step 118877: {'lr': 5.2604444850789804e-05, 'samples': 22824384, 'steps': 118876, 'loss/train': 1.513622522354126} 11/07/2021 13:55:55 - INFO - __main__ - Step 118878: {'lr': 5.260118843929035e-05, 'samples': 22824576, 'steps': 118877, 'loss/train': 1.371610403060913} 11/07/2021 13:55:55 - INFO - __main__ - Step 118879: {'lr': 5.259793211673578e-05, 'samples': 22824768, 'steps': 118878, 'loss/train': 1.3686341047286987} 11/07/2021 13:55:56 - INFO - __main__ - Step 118880: {'lr': 5.25946758831275e-05, 'samples': 22824960, 'steps': 118879, 'loss/train': 1.3072643280029297} 11/07/2021 13:55:57 - INFO - __main__ - Step 118881: {'lr': 5.259141973846704e-05, 'samples': 22825152, 'steps': 118880, 'loss/train': 1.3447585105895996} 11/07/2021 13:55:57 - INFO - __main__ - Step 118882: {'lr': 5.2588163682755845e-05, 'samples': 22825344, 'steps': 118881, 'loss/train': 1.0237103700637817} 11/07/2021 13:55:57 - INFO - __main__ - Step 118883: {'lr': 5.258490771599536e-05, 'samples': 22825536, 'steps': 118882, 'loss/train': 1.3041586875915527} 11/07/2021 13:55:58 - INFO - __main__ - Step 118884: {'lr': 5.2581651838187136e-05, 'samples': 22825728, 'steps': 118883, 'loss/train': 1.3938343524932861} 11/07/2021 13:55:59 - INFO - __main__ - Step 118885: {'lr': 5.2578396049332514e-05, 'samples': 22825920, 'steps': 118884, 'loss/train': 0.5208465456962585} 11/07/2021 13:55:59 - INFO - __main__ - Step 118886: {'lr': 5.2575140349433034e-05, 'samples': 22826112, 'steps': 118885, 'loss/train': 0.7012737989425659} 11/07/2021 13:56:00 - INFO - __main__ - Step 118887: {'lr': 5.2571884738490114e-05, 'samples': 22826304, 'steps': 118886, 'loss/train': 1.7301926612854004} 11/07/2021 13:56:00 - INFO - __main__ - Step 118888: {'lr': 5.256862921650529e-05, 'samples': 22826496, 'steps': 118887, 'loss/train': 1.2013325691223145} 11/07/2021 13:56:00 - INFO - __main__ - Step 118889: {'lr': 5.256537378347997e-05, 'samples': 22826688, 'steps': 118888, 'loss/train': 0.056424595415592194} 11/07/2021 13:56:01 - INFO - __main__ - Step 118890: {'lr': 5.256211843941566e-05, 'samples': 22826880, 'steps': 118889, 'loss/train': 1.771228551864624} 11/07/2021 13:56:02 - INFO - __main__ - Step 118891: {'lr': 5.255886318431383e-05, 'samples': 22827072, 'steps': 118890, 'loss/train': 1.4130321741104126} 11/07/2021 13:56:02 - INFO - __main__ - Step 118892: {'lr': 5.255560801817591e-05, 'samples': 22827264, 'steps': 118891, 'loss/train': 0.7265766859054565} 11/07/2021 13:56:03 - INFO - __main__ - Step 118893: {'lr': 5.255235294100338e-05, 'samples': 22827456, 'steps': 118892, 'loss/train': 1.3542814254760742} 11/07/2021 13:56:03 - INFO - __main__ - Step 118894: {'lr': 5.254909795279772e-05, 'samples': 22827648, 'steps': 118893, 'loss/train': 1.4722260236740112} 11/07/2021 13:56:03 - INFO - __main__ - Step 118895: {'lr': 5.2545843053560385e-05, 'samples': 22827840, 'steps': 118894, 'loss/train': 0.7860344052314758} 11/07/2021 13:56:04 - INFO - __main__ - Step 118896: {'lr': 5.254258824329286e-05, 'samples': 22828032, 'steps': 118895, 'loss/train': 0.8056517839431763} 11/07/2021 13:56:05 - INFO - __main__ - Step 118897: {'lr': 5.2539333521996634e-05, 'samples': 22828224, 'steps': 118896, 'loss/train': 1.1534268856048584} 11/07/2021 13:56:05 - INFO - __main__ - Step 118898: {'lr': 5.25360788896731e-05, 'samples': 22828416, 'steps': 118897, 'loss/train': 0.5302726626396179} 11/07/2021 13:56:05 - INFO - __main__ - Step 118899: {'lr': 5.253282434632376e-05, 'samples': 22828608, 'steps': 118898, 'loss/train': 1.3545925617218018} 11/07/2021 13:56:06 - INFO - __main__ - Step 118900: {'lr': 5.252956989195007e-05, 'samples': 22828800, 'steps': 118899, 'loss/train': 1.0902429819107056} 11/07/2021 13:56:07 - INFO - __main__ - Step 118901: {'lr': 5.252631552655351e-05, 'samples': 22828992, 'steps': 118900, 'loss/train': 1.7476297616958618} 11/07/2021 13:56:07 - INFO - __main__ - Step 118902: {'lr': 5.2523061250135564e-05, 'samples': 22829184, 'steps': 118901, 'loss/train': 1.251421332359314} 11/07/2021 13:56:08 - INFO - __main__ - Step 118903: {'lr': 5.2519807062697654e-05, 'samples': 22829376, 'steps': 118902, 'loss/train': 1.9631202220916748} 11/07/2021 13:56:08 - INFO - __main__ - Step 118904: {'lr': 5.2516552964241294e-05, 'samples': 22829568, 'steps': 118903, 'loss/train': 1.9404464960098267} 11/07/2021 13:56:08 - INFO - __main__ - Step 118905: {'lr': 5.251329895476792e-05, 'samples': 22829760, 'steps': 118904, 'loss/train': 1.907281756401062} 11/07/2021 13:56:09 - INFO - __main__ - Step 118906: {'lr': 5.2510045034279003e-05, 'samples': 22829952, 'steps': 118905, 'loss/train': 1.149886965751648} 11/07/2021 13:56:10 - INFO - __main__ - Step 118907: {'lr': 5.2506791202775987e-05, 'samples': 22830144, 'steps': 118906, 'loss/train': 1.0092958211898804} 11/07/2021 13:56:10 - INFO - __main__ - Step 118908: {'lr': 5.25035374602604e-05, 'samples': 22830336, 'steps': 118907, 'loss/train': 1.2041856050491333} 11/07/2021 13:56:11 - INFO - __main__ - Step 118909: {'lr': 5.2500283806733664e-05, 'samples': 22830528, 'steps': 118908, 'loss/train': 0.4795683026313782} 11/07/2021 13:56:11 - INFO - __main__ - Step 118910: {'lr': 5.249703024219732e-05, 'samples': 22830720, 'steps': 118909, 'loss/train': 1.07111394405365} 11/07/2021 13:56:12 - INFO - __main__ - Step 118911: {'lr': 5.249377676665268e-05, 'samples': 22830912, 'steps': 118910, 'loss/train': 1.4536291360855103} 11/07/2021 13:56:12 - INFO - __main__ - Step 118912: {'lr': 5.2490523380101323e-05, 'samples': 22831104, 'steps': 118911, 'loss/train': 2.2959134578704834} 11/07/2021 13:56:13 - INFO - __main__ - Step 118913: {'lr': 5.248727008254467e-05, 'samples': 22831296, 'steps': 118912, 'loss/train': 1.6288176774978638} 11/07/2021 13:56:13 - INFO - __main__ - Step 118914: {'lr': 5.248401687398421e-05, 'samples': 22831488, 'steps': 118913, 'loss/train': 1.2440472841262817} 11/07/2021 13:56:13 - INFO - __main__ - Step 118915: {'lr': 5.24807637544214e-05, 'samples': 22831680, 'steps': 118914, 'loss/train': 1.2911993265151978} 11/07/2021 13:56:14 - INFO - __main__ - Step 118916: {'lr': 5.247751072385773e-05, 'samples': 22831872, 'steps': 118915, 'loss/train': 1.4083110094070435} 11/07/2021 13:56:15 - INFO - __main__ - Step 118917: {'lr': 5.2474257782294615e-05, 'samples': 22832064, 'steps': 118916, 'loss/train': 0.9172548055648804} 11/07/2021 13:56:15 - INFO - __main__ - Step 118918: {'lr': 5.247100492973358e-05, 'samples': 22832256, 'steps': 118917, 'loss/train': 1.3415029048919678} 11/07/2021 13:56:16 - INFO - __main__ - Step 118919: {'lr': 5.2467752166176055e-05, 'samples': 22832448, 'steps': 118918, 'loss/train': 1.2347501516342163} 11/07/2021 13:56:16 - INFO - __main__ - Step 118920: {'lr': 5.246449949162349e-05, 'samples': 22832640, 'steps': 118919, 'loss/train': 1.3585525751113892} 11/07/2021 13:56:16 - INFO - __main__ - Step 118921: {'lr': 5.2461246906077396e-05, 'samples': 22832832, 'steps': 118920, 'loss/train': 1.2940499782562256} 11/07/2021 13:56:18 - INFO - __main__ - Step 118922: {'lr': 5.245799440953922e-05, 'samples': 22833024, 'steps': 118921, 'loss/train': 1.3349905014038086} 11/07/2021 13:56:18 - INFO - __main__ - Step 118923: {'lr': 5.245474200201042e-05, 'samples': 22833216, 'steps': 118922, 'loss/train': 1.1538724899291992} 11/07/2021 13:56:18 - INFO - __main__ - Step 118924: {'lr': 5.24514896834925e-05, 'samples': 22833408, 'steps': 118923, 'loss/train': 1.6695427894592285} 11/07/2021 13:56:19 - INFO - __main__ - Step 118925: {'lr': 5.2448237453986856e-05, 'samples': 22833600, 'steps': 118924, 'loss/train': 1.5849627256393433} 11/07/2021 13:56:19 - INFO - __main__ - Step 118926: {'lr': 5.244498531349498e-05, 'samples': 22833792, 'steps': 118925, 'loss/train': 0.1417188197374344} 11/07/2021 13:56:20 - INFO - __main__ - Step 118927: {'lr': 5.2441733262018346e-05, 'samples': 22833984, 'steps': 118926, 'loss/train': 1.8388291597366333} 11/07/2021 13:56:21 - INFO - __main__ - Step 118928: {'lr': 5.243848129955842e-05, 'samples': 22834176, 'steps': 118927, 'loss/train': 1.3170890808105469} 11/07/2021 13:56:21 - INFO - __main__ - Step 118929: {'lr': 5.243522942611667e-05, 'samples': 22834368, 'steps': 118928, 'loss/train': 1.3352159261703491} 11/07/2021 13:56:22 - INFO - __main__ - Step 118930: {'lr': 5.243197764169455e-05, 'samples': 22834560, 'steps': 118929, 'loss/train': 1.1042767763137817} 11/07/2021 13:56:22 - INFO - __main__ - Step 118931: {'lr': 5.242872594629352e-05, 'samples': 22834752, 'steps': 118930, 'loss/train': 0.9970715641975403} 11/07/2021 13:56:23 - INFO - __main__ - Step 118932: {'lr': 5.24254743399151e-05, 'samples': 22834944, 'steps': 118931, 'loss/train': 1.3740795850753784} 11/07/2021 13:56:24 - INFO - __main__ - Step 118933: {'lr': 5.2422222822560676e-05, 'samples': 22835136, 'steps': 118932, 'loss/train': 1.5690053701400757} 11/07/2021 13:56:24 - INFO - __main__ - Step 118934: {'lr': 5.241897139423177e-05, 'samples': 22835328, 'steps': 118933, 'loss/train': 1.3143982887268066} 11/07/2021 13:56:24 - INFO - __main__ - Step 118935: {'lr': 5.241572005492981e-05, 'samples': 22835520, 'steps': 118934, 'loss/train': 1.2167584896087646} 11/07/2021 13:56:25 - INFO - __main__ - Step 118936: {'lr': 5.2412468804656274e-05, 'samples': 22835712, 'steps': 118935, 'loss/train': 1.8887666463851929} 11/07/2021 13:56:25 - INFO - __main__ - Step 118937: {'lr': 5.240921764341269e-05, 'samples': 22835904, 'steps': 118936, 'loss/train': 1.0316619873046875} 11/07/2021 13:56:26 - INFO - __main__ - Step 118938: {'lr': 5.240596657120042e-05, 'samples': 22836096, 'steps': 118937, 'loss/train': 0.4537408947944641} 11/07/2021 13:56:26 - INFO - __main__ - Step 118939: {'lr': 5.2402715588020985e-05, 'samples': 22836288, 'steps': 118938, 'loss/train': 1.386381983757019} 11/07/2021 13:56:27 - INFO - __main__ - Step 118940: {'lr': 5.23994646938758e-05, 'samples': 22836480, 'steps': 118939, 'loss/train': 1.575568437576294} 11/07/2021 13:56:27 - INFO - __main__ - Step 118941: {'lr': 5.239621388876639e-05, 'samples': 22836672, 'steps': 118940, 'loss/train': 0.777921199798584} 11/07/2021 13:56:27 - INFO - __main__ - Step 118942: {'lr': 5.239296317269418e-05, 'samples': 22836864, 'steps': 118941, 'loss/train': 1.156182885169983} 11/07/2021 13:56:28 - INFO - __main__ - Step 118943: {'lr': 5.238971254566066e-05, 'samples': 22837056, 'steps': 118942, 'loss/train': 1.6540292501449585} 11/07/2021 13:56:29 - INFO - __main__ - Step 118944: {'lr': 5.238646200766731e-05, 'samples': 22837248, 'steps': 118943, 'loss/train': 0.9985964894294739} 11/07/2021 13:56:29 - INFO - __main__ - Step 118945: {'lr': 5.238321155871553e-05, 'samples': 22837440, 'steps': 118944, 'loss/train': 1.1958301067352295} 11/07/2021 13:56:29 - INFO - __main__ - Step 118946: {'lr': 5.237996119880686e-05, 'samples': 22837632, 'steps': 118945, 'loss/train': 1.6258814334869385} 11/07/2021 13:56:30 - INFO - __main__ - Step 118947: {'lr': 5.237671092794272e-05, 'samples': 22837824, 'steps': 118946, 'loss/train': 1.6663185358047485} 11/07/2021 13:56:31 - INFO - __main__ - Step 118948: {'lr': 5.2373460746124564e-05, 'samples': 22838016, 'steps': 118947, 'loss/train': 1.2333691120147705} 11/07/2021 13:56:31 - INFO - __main__ - Step 118949: {'lr': 5.23702106533539e-05, 'samples': 22838208, 'steps': 118948, 'loss/train': 1.2607890367507935} 11/07/2021 13:56:32 - INFO - __main__ - Step 118950: {'lr': 5.236696064963214e-05, 'samples': 22838400, 'steps': 118949, 'loss/train': 1.3217769861221313} 11/07/2021 13:56:32 - INFO - __main__ - Step 118951: {'lr': 5.236371073496088e-05, 'samples': 22838592, 'steps': 118950, 'loss/train': 1.0841225385665894} 11/07/2021 13:56:32 - INFO - __main__ - Step 118952: {'lr': 5.236046090934141e-05, 'samples': 22838784, 'steps': 118951, 'loss/train': 1.1971830129623413} 11/07/2021 13:56:33 - INFO - __main__ - Step 118953: {'lr': 5.235721117277526e-05, 'samples': 22838976, 'steps': 118952, 'loss/train': 1.5106885433197021} 11/07/2021 13:56:34 - INFO - __main__ - Step 118954: {'lr': 5.2353961525263895e-05, 'samples': 22839168, 'steps': 118953, 'loss/train': 1.2065629959106445} 11/07/2021 13:56:34 - INFO - __main__ - Step 118955: {'lr': 5.23507119668088e-05, 'samples': 22839360, 'steps': 118954, 'loss/train': 1.4213805198669434} 11/07/2021 13:56:35 - INFO - __main__ - Step 118956: {'lr': 5.23474624974114e-05, 'samples': 22839552, 'steps': 118955, 'loss/train': 0.9902083873748779} 11/07/2021 13:56:35 - INFO - __main__ - Step 118957: {'lr': 5.234421311707319e-05, 'samples': 22839744, 'steps': 118956, 'loss/train': 1.5169306993484497} 11/07/2021 13:56:36 - INFO - __main__ - Step 118958: {'lr': 5.234096382579565e-05, 'samples': 22839936, 'steps': 118957, 'loss/train': 1.273674726486206} 11/07/2021 13:56:36 - INFO - __main__ - Step 118959: {'lr': 5.23377146235802e-05, 'samples': 22840128, 'steps': 118958, 'loss/train': 1.1082403659820557} 11/07/2021 13:56:37 - INFO - __main__ - Step 118960: {'lr': 5.233446551042834e-05, 'samples': 22840320, 'steps': 118959, 'loss/train': 1.4001505374908447} 11/07/2021 13:56:37 - INFO - __main__ - Step 118961: {'lr': 5.233121648634151e-05, 'samples': 22840512, 'steps': 118960, 'loss/train': 1.364719033241272} 11/07/2021 13:56:37 - INFO - __main__ - Step 118962: {'lr': 5.232796755132119e-05, 'samples': 22840704, 'steps': 118961, 'loss/train': 1.2902483940124512} 11/07/2021 13:56:38 - INFO - __main__ - Step 118963: {'lr': 5.2324718705368814e-05, 'samples': 22840896, 'steps': 118962, 'loss/train': 1.1206730604171753} 11/07/2021 13:56:39 - INFO - __main__ - Step 118964: {'lr': 5.2321469948485965e-05, 'samples': 22841088, 'steps': 118963, 'loss/train': 1.9546095132827759} 11/07/2021 13:56:39 - INFO - __main__ - Step 118965: {'lr': 5.231822128067393e-05, 'samples': 22841280, 'steps': 118964, 'loss/train': 0.7782416939735413} 11/07/2021 13:56:39 - INFO - __main__ - Step 118966: {'lr': 5.231497270193425e-05, 'samples': 22841472, 'steps': 118965, 'loss/train': 1.6511812210083008} 11/07/2021 13:56:40 - INFO - __main__ - Step 118967: {'lr': 5.23117242122684e-05, 'samples': 22841664, 'steps': 118966, 'loss/train': 1.1503405570983887} 11/07/2021 13:56:40 - INFO - __main__ - Step 118968: {'lr': 5.230847581167786e-05, 'samples': 22841856, 'steps': 118967, 'loss/train': 1.3477879762649536} 11/07/2021 13:56:41 - INFO - __main__ - Step 118969: {'lr': 5.2305227500164033e-05, 'samples': 22842048, 'steps': 118968, 'loss/train': 1.298258900642395} 11/07/2021 13:56:42 - INFO - __main__ - Step 118970: {'lr': 5.2301979277728426e-05, 'samples': 22842240, 'steps': 118969, 'loss/train': 1.5716309547424316} 11/07/2021 13:56:42 - INFO - __main__ - Step 118971: {'lr': 5.22987311443725e-05, 'samples': 22842432, 'steps': 118970, 'loss/train': 1.5050655603408813} 11/07/2021 13:56:43 - INFO - __main__ - Step 118972: {'lr': 5.229548310009774e-05, 'samples': 22842624, 'steps': 118971, 'loss/train': 1.2931958436965942} 11/07/2021 13:56:43 - INFO - __main__ - Step 118973: {'lr': 5.2292235144905555e-05, 'samples': 22842816, 'steps': 118972, 'loss/train': 1.8793262243270874} 11/07/2021 13:56:44 - INFO - __main__ - Step 118974: {'lr': 5.228898727879744e-05, 'samples': 22843008, 'steps': 118973, 'loss/train': 0.13995276391506195} 11/07/2021 13:56:44 - INFO - __main__ - Step 118975: {'lr': 5.228573950177487e-05, 'samples': 22843200, 'steps': 118974, 'loss/train': 1.6992987394332886} 11/07/2021 13:56:45 - INFO - __main__ - Step 118976: {'lr': 5.2282491813839286e-05, 'samples': 22843392, 'steps': 118975, 'loss/train': 1.2372413873672485} 11/07/2021 13:56:45 - INFO - __main__ - Step 118977: {'lr': 5.227924421499217e-05, 'samples': 22843584, 'steps': 118976, 'loss/train': 1.0584524869918823} 11/07/2021 13:56:45 - INFO - __main__ - Step 118978: {'lr': 5.227599670523503e-05, 'samples': 22843776, 'steps': 118977, 'loss/train': 0.890330970287323} 11/07/2021 13:56:46 - INFO - __main__ - Step 118979: {'lr': 5.2272749284569236e-05, 'samples': 22843968, 'steps': 118978, 'loss/train': 0.9936787486076355} 11/07/2021 13:56:47 - INFO - __main__ - Step 118980: {'lr': 5.226950195299626e-05, 'samples': 22844160, 'steps': 118979, 'loss/train': 1.1324586868286133} 11/07/2021 13:56:47 - INFO - __main__ - Step 118981: {'lr': 5.22662547105176e-05, 'samples': 22844352, 'steps': 118980, 'loss/train': 1.3778032064437866} 11/07/2021 13:56:47 - INFO - __main__ - Step 118982: {'lr': 5.226300755713473e-05, 'samples': 22844544, 'steps': 118981, 'loss/train': 1.0701466798782349} 11/07/2021 13:56:48 - INFO - __main__ - Step 118983: {'lr': 5.225976049284908e-05, 'samples': 22844736, 'steps': 118982, 'loss/train': 1.2692606449127197} 11/07/2021 13:56:49 - INFO - __main__ - Step 118984: {'lr': 5.225651351766214e-05, 'samples': 22844928, 'steps': 118983, 'loss/train': 1.5746843814849854} 11/07/2021 13:56:49 - INFO - __main__ - Step 118985: {'lr': 5.225326663157534e-05, 'samples': 22845120, 'steps': 118984, 'loss/train': 1.0445255041122437} 11/07/2021 13:56:49 - INFO - __main__ - Step 118986: {'lr': 5.2250019834590184e-05, 'samples': 22845312, 'steps': 118985, 'loss/train': 1.683966875076294} 11/07/2021 13:56:50 - INFO - __main__ - Step 118987: {'lr': 5.224677312670814e-05, 'samples': 22845504, 'steps': 118986, 'loss/train': 1.3528480529785156} 11/07/2021 13:56:50 - INFO - __main__ - Step 118988: {'lr': 5.2243526507930625e-05, 'samples': 22845696, 'steps': 118987, 'loss/train': 1.4664326906204224} 11/07/2021 13:56:51 - INFO - __main__ - Step 118989: {'lr': 5.224027997825911e-05, 'samples': 22845888, 'steps': 118988, 'loss/train': 1.328171730041504} 11/07/2021 13:56:52 - INFO - __main__ - Step 118990: {'lr': 5.22370335376951e-05, 'samples': 22846080, 'steps': 118989, 'loss/train': 1.5719889402389526} 11/07/2021 13:56:52 - INFO - __main__ - Step 118991: {'lr': 5.223378718624008e-05, 'samples': 22846272, 'steps': 118990, 'loss/train': 1.493606686592102} 11/07/2021 13:56:52 - INFO - __main__ - Step 118992: {'lr': 5.2230540923895424e-05, 'samples': 22846464, 'steps': 118991, 'loss/train': 0.683285117149353} 11/07/2021 13:56:53 - INFO - __main__ - Step 118993: {'lr': 5.2227294750662625e-05, 'samples': 22846656, 'steps': 118992, 'loss/train': 1.5691072940826416} 11/07/2021 13:56:53 - INFO - __main__ - Step 118994: {'lr': 5.222404866654312e-05, 'samples': 22846848, 'steps': 118993, 'loss/train': 1.3921923637390137} 11/07/2021 13:56:54 - INFO - __main__ - Step 118995: {'lr': 5.2220802671538446e-05, 'samples': 22847040, 'steps': 118994, 'loss/train': 1.4082534313201904} 11/07/2021 13:56:55 - INFO - __main__ - Step 118996: {'lr': 5.2217556765650014e-05, 'samples': 22847232, 'steps': 118995, 'loss/train': 3.3705344200134277} 11/07/2021 13:56:55 - INFO - __main__ - Step 118997: {'lr': 5.2214310948879294e-05, 'samples': 22847424, 'steps': 118996, 'loss/train': 1.5902060270309448} 11/07/2021 13:56:55 - INFO - __main__ - Step 118998: {'lr': 5.2211065221227763e-05, 'samples': 22847616, 'steps': 118997, 'loss/train': 1.688537836074829} 11/07/2021 13:56:56 - INFO - __main__ - Step 118999: {'lr': 5.220781958269688e-05, 'samples': 22847808, 'steps': 118998, 'loss/train': 1.2281591892242432} 11/07/2021 13:56:56 - INFO - __main__ - Step 119000: {'lr': 5.22045740332881e-05, 'samples': 22848000, 'steps': 118999, 'loss/train': 1.7434080839157104} 11/07/2021 13:56:57 - INFO - __main__ - Step 119001: {'lr': 5.220132857300286e-05, 'samples': 22848192, 'steps': 119000, 'loss/train': 1.770835280418396} 11/07/2021 13:56:57 - INFO - __main__ - Step 119002: {'lr': 5.2198083201842664e-05, 'samples': 22848384, 'steps': 119001, 'loss/train': 1.4096039533615112} 11/07/2021 13:56:58 - INFO - __main__ - Step 119003: {'lr': 5.219483791980897e-05, 'samples': 22848576, 'steps': 119002, 'loss/train': 1.3957436084747314} 11/07/2021 13:56:58 - INFO - __main__ - Step 119004: {'lr': 5.219159272690321e-05, 'samples': 22848768, 'steps': 119003, 'loss/train': 0.9911213517189026} 11/07/2021 13:56:58 - INFO - __main__ - Step 119005: {'lr': 5.218834762312696e-05, 'samples': 22848960, 'steps': 119004, 'loss/train': 1.4844012260437012} 11/07/2021 13:57:00 - INFO - __main__ - Step 119006: {'lr': 5.21851026084815e-05, 'samples': 22849152, 'steps': 119005, 'loss/train': 1.219947338104248} 11/07/2021 13:57:00 - INFO - __main__ - Step 119007: {'lr': 5.21818576829684e-05, 'samples': 22849344, 'steps': 119006, 'loss/train': 1.5508843660354614} 11/07/2021 13:57:00 - INFO - __main__ - Step 119008: {'lr': 5.217861284658909e-05, 'samples': 22849536, 'steps': 119007, 'loss/train': 1.4907610416412354} 11/07/2021 13:57:01 - INFO - __main__ - Step 119009: {'lr': 5.217536809934503e-05, 'samples': 22849728, 'steps': 119008, 'loss/train': 0.6845035552978516} 11/07/2021 13:57:01 - INFO - __main__ - Step 119010: {'lr': 5.21721234412377e-05, 'samples': 22849920, 'steps': 119009, 'loss/train': 1.61370050907135} 11/07/2021 13:57:02 - INFO - __main__ - Step 119011: {'lr': 5.2168878872268594e-05, 'samples': 22850112, 'steps': 119010, 'loss/train': 1.4407105445861816} 11/07/2021 13:57:02 - INFO - __main__ - Step 119012: {'lr': 5.21656343924391e-05, 'samples': 22850304, 'steps': 119011, 'loss/train': 1.5031421184539795} 11/07/2021 13:57:03 - INFO - __main__ - Step 119013: {'lr': 5.216239000175074e-05, 'samples': 22850496, 'steps': 119012, 'loss/train': 1.8224343061447144} 11/07/2021 13:57:03 - INFO - __main__ - Step 119014: {'lr': 5.215914570020494e-05, 'samples': 22850688, 'steps': 119013, 'loss/train': 1.0157337188720703} 11/07/2021 13:57:03 - INFO - __main__ - Step 119015: {'lr': 5.215590148780317e-05, 'samples': 22850880, 'steps': 119014, 'loss/train': 0.8343490958213806} 11/07/2021 13:57:04 - INFO - __main__ - Step 119016: {'lr': 5.2152657364547e-05, 'samples': 22851072, 'steps': 119015, 'loss/train': 1.1309092044830322} 11/07/2021 13:57:05 - INFO - __main__ - Step 119017: {'lr': 5.2149413330437685e-05, 'samples': 22851264, 'steps': 119016, 'loss/train': 0.48513737320899963} 11/07/2021 13:57:05 - INFO - __main__ - Step 119018: {'lr': 5.214616938547681e-05, 'samples': 22851456, 'steps': 119017, 'loss/train': 1.0939011573791504} 11/07/2021 13:57:06 - INFO - __main__ - Step 119019: {'lr': 5.2142925529665816e-05, 'samples': 22851648, 'steps': 119018, 'loss/train': 1.4052449464797974} 11/07/2021 13:57:06 - INFO - __main__ - Step 119020: {'lr': 5.2139681763006154e-05, 'samples': 22851840, 'steps': 119019, 'loss/train': 1.6299422979354858} 11/07/2021 13:57:06 - INFO - __main__ - Step 119021: {'lr': 5.213643808549931e-05, 'samples': 22852032, 'steps': 119020, 'loss/train': 1.4466524124145508} 11/07/2021 13:57:08 - INFO - __main__ - Step 119022: {'lr': 5.213319449714674e-05, 'samples': 22852224, 'steps': 119021, 'loss/train': 1.614367961883545} 11/07/2021 13:57:08 - INFO - __main__ - Step 119023: {'lr': 5.212995099794987e-05, 'samples': 22852416, 'steps': 119022, 'loss/train': 1.765398621559143} 11/07/2021 13:57:08 - INFO - __main__ - Step 119024: {'lr': 5.2126707587910216e-05, 'samples': 22852608, 'steps': 119023, 'loss/train': 1.2483028173446655} 11/07/2021 13:57:09 - INFO - __main__ - Step 119025: {'lr': 5.2123464267029215e-05, 'samples': 22852800, 'steps': 119024, 'loss/train': 1.2682809829711914} 11/07/2021 13:57:09 - INFO - __main__ - Step 119026: {'lr': 5.212022103530834e-05, 'samples': 22852992, 'steps': 119025, 'loss/train': 1.0727108716964722} 11/07/2021 13:57:09 - INFO - __main__ - Step 119027: {'lr': 5.211697789274908e-05, 'samples': 22853184, 'steps': 119026, 'loss/train': 1.0061050653457642} 11/07/2021 13:57:10 - INFO - __main__ - Step 119028: {'lr': 5.2113734839352806e-05, 'samples': 22853376, 'steps': 119027, 'loss/train': 1.3954102993011475} 11/07/2021 13:57:11 - INFO - __main__ - Step 119029: {'lr': 5.211049187512101e-05, 'samples': 22853568, 'steps': 119028, 'loss/train': 1.286084771156311} 11/07/2021 13:57:11 - INFO - __main__ - Step 119030: {'lr': 5.21072490000552e-05, 'samples': 22853760, 'steps': 119029, 'loss/train': 0.9722974896430969} 11/07/2021 13:57:11 - INFO - __main__ - Step 119031: {'lr': 5.210400621415679e-05, 'samples': 22853952, 'steps': 119030, 'loss/train': 1.385973572731018} 11/07/2021 13:57:12 - INFO - __main__ - Step 119032: {'lr': 5.210076351742726e-05, 'samples': 22854144, 'steps': 119031, 'loss/train': 1.5423814058303833} 11/07/2021 13:57:13 - INFO - __main__ - Step 119033: {'lr': 5.2097520909868076e-05, 'samples': 22854336, 'steps': 119032, 'loss/train': 1.3153040409088135} 11/07/2021 13:57:13 - INFO - __main__ - Step 119034: {'lr': 5.2094278391480705e-05, 'samples': 22854528, 'steps': 119033, 'loss/train': 0.3604738414287567} 11/07/2021 13:57:14 - INFO - __main__ - Step 119035: {'lr': 5.209103596226658e-05, 'samples': 22854720, 'steps': 119034, 'loss/train': 1.3133454322814941} 11/07/2021 13:57:14 - INFO - __main__ - Step 119036: {'lr': 5.208779362222721e-05, 'samples': 22854912, 'steps': 119035, 'loss/train': 1.3323235511779785} 11/07/2021 13:57:14 - INFO - __main__ - Step 119037: {'lr': 5.208455137136406e-05, 'samples': 22855104, 'steps': 119036, 'loss/train': 1.4295263290405273} 11/07/2021 13:57:15 - INFO - __main__ - Step 119038: {'lr': 5.208130920967852e-05, 'samples': 22855296, 'steps': 119037, 'loss/train': 1.5544921159744263} 11/07/2021 13:57:16 - INFO - __main__ - Step 119039: {'lr': 5.207806713717206e-05, 'samples': 22855488, 'steps': 119038, 'loss/train': 1.6686829328536987} 11/07/2021 13:57:16 - INFO - __main__ - Step 119040: {'lr': 5.207482515384618e-05, 'samples': 22855680, 'steps': 119039, 'loss/train': 1.0341516733169556} 11/07/2021 13:57:16 - INFO - __main__ - Step 119041: {'lr': 5.2071583259702346e-05, 'samples': 22855872, 'steps': 119040, 'loss/train': 1.4364728927612305} 11/07/2021 13:57:17 - INFO - __main__ - Step 119042: {'lr': 5.206834145474199e-05, 'samples': 22856064, 'steps': 119041, 'loss/train': 0.8889238238334656} 11/07/2021 13:57:17 - INFO - __main__ - Step 119043: {'lr': 5.206509973896656e-05, 'samples': 22856256, 'steps': 119042, 'loss/train': 1.3295469284057617} 11/07/2021 13:57:18 - INFO - __main__ - Step 119044: {'lr': 5.206185811237757e-05, 'samples': 22856448, 'steps': 119043, 'loss/train': 1.1114131212234497} 11/07/2021 13:57:19 - INFO - __main__ - Step 119045: {'lr': 5.205861657497646e-05, 'samples': 22856640, 'steps': 119044, 'loss/train': 1.1943328380584717} 11/07/2021 13:57:19 - INFO - __main__ - Step 119046: {'lr': 5.205537512676467e-05, 'samples': 22856832, 'steps': 119045, 'loss/train': 0.7483692765235901} 11/07/2021 13:57:19 - INFO - __main__ - Step 119047: {'lr': 5.2052133767743677e-05, 'samples': 22857024, 'steps': 119046, 'loss/train': 1.345873475074768} 11/07/2021 13:57:20 - INFO - __main__ - Step 119048: {'lr': 5.2048892497915e-05, 'samples': 22857216, 'steps': 119047, 'loss/train': 1.0682415962219238} 11/07/2021 13:57:21 - INFO - __main__ - Step 119049: {'lr': 5.204565131727995e-05, 'samples': 22857408, 'steps': 119048, 'loss/train': 1.5843123197555542} 11/07/2021 13:57:21 - INFO - __main__ - Step 119050: {'lr': 5.204241022584011e-05, 'samples': 22857600, 'steps': 119049, 'loss/train': 2.1439592838287354} 11/07/2021 13:57:21 - INFO - __main__ - Step 119051: {'lr': 5.203916922359689e-05, 'samples': 22857792, 'steps': 119050, 'loss/train': 0.6513935327529907} 11/07/2021 13:57:22 - INFO - __main__ - Step 119052: {'lr': 5.203592831055176e-05, 'samples': 22857984, 'steps': 119051, 'loss/train': 0.820497989654541} 11/07/2021 13:57:22 - INFO - __main__ - Step 119053: {'lr': 5.20326874867062e-05, 'samples': 22858176, 'steps': 119052, 'loss/train': 1.2002460956573486} 11/07/2021 13:57:23 - INFO - __main__ - Step 119054: {'lr': 5.202944675206164e-05, 'samples': 22858368, 'steps': 119053, 'loss/train': 1.4808361530303955} 11/07/2021 13:57:24 - INFO - __main__ - Step 119055: {'lr': 5.2026206106619564e-05, 'samples': 22858560, 'steps': 119054, 'loss/train': 1.2392233610153198} 11/07/2021 13:57:24 - INFO - __main__ - Step 119056: {'lr': 5.202296555038144e-05, 'samples': 22858752, 'steps': 119055, 'loss/train': 1.2570949792861938} 11/07/2021 13:57:24 - INFO - __main__ - Step 119057: {'lr': 5.201972508334871e-05, 'samples': 22858944, 'steps': 119056, 'loss/train': 3.773599863052368} 11/07/2021 13:57:25 - INFO - __main__ - Step 119058: {'lr': 5.201648470552281e-05, 'samples': 22859136, 'steps': 119057, 'loss/train': 2.058136463165283} 11/07/2021 13:57:26 - INFO - __main__ - Step 119059: {'lr': 5.2013244416905306e-05, 'samples': 22859328, 'steps': 119058, 'loss/train': 0.7879811525344849} 11/07/2021 13:57:26 - INFO - __main__ - Step 119060: {'lr': 5.201000421749752e-05, 'samples': 22859520, 'steps': 119059, 'loss/train': 0.9084044694900513} 11/07/2021 13:57:26 - INFO - __main__ - Step 119061: {'lr': 5.200676410730096e-05, 'samples': 22859712, 'steps': 119060, 'loss/train': 1.236556887626648} 11/07/2021 13:57:27 - INFO - __main__ - Step 119062: {'lr': 5.200352408631712e-05, 'samples': 22859904, 'steps': 119061, 'loss/train': 0.9772292375564575} 11/07/2021 13:57:27 - INFO - __main__ - Step 119063: {'lr': 5.200028415454741e-05, 'samples': 22860096, 'steps': 119062, 'loss/train': 1.238384485244751} 11/07/2021 13:57:27 - INFO - __main__ - Step 119064: {'lr': 5.199704431199334e-05, 'samples': 22860288, 'steps': 119063, 'loss/train': 1.6029984951019287} 11/07/2021 13:57:29 - INFO - __main__ - Step 119065: {'lr': 5.199380455865632e-05, 'samples': 22860480, 'steps': 119064, 'loss/train': 0.9378836750984192} 11/07/2021 13:57:29 - INFO - __main__ - Step 119066: {'lr': 5.199056489453785e-05, 'samples': 22860672, 'steps': 119065, 'loss/train': 0.8115398287773132} 11/07/2021 13:57:29 - INFO - __main__ - Step 119067: {'lr': 5.198732531963937e-05, 'samples': 22860864, 'steps': 119066, 'loss/train': 1.1508049964904785} 11/07/2021 13:57:30 - INFO - __main__ - Step 119068: {'lr': 5.1984085833962356e-05, 'samples': 22861056, 'steps': 119067, 'loss/train': 1.5436419248580933} 11/07/2021 13:57:30 - INFO - __main__ - Step 119069: {'lr': 5.198084643750825e-05, 'samples': 22861248, 'steps': 119068, 'loss/train': 1.4220757484436035} 11/07/2021 13:57:31 - INFO - __main__ - Step 119070: {'lr': 5.197760713027852e-05, 'samples': 22861440, 'steps': 119069, 'loss/train': 1.272832989692688} 11/07/2021 13:57:31 - INFO - __main__ - Step 119071: {'lr': 5.197436791227464e-05, 'samples': 22861632, 'steps': 119070, 'loss/train': 1.2997705936431885} 11/07/2021 13:57:32 - INFO - __main__ - Step 119072: {'lr': 5.197112878349811e-05, 'samples': 22861824, 'steps': 119071, 'loss/train': 1.7270827293395996} 11/07/2021 13:57:32 - INFO - __main__ - Step 119073: {'lr': 5.1967889743950255e-05, 'samples': 22862016, 'steps': 119072, 'loss/train': 1.4221760034561157} 11/07/2021 13:57:32 - INFO - __main__ - Step 119074: {'lr': 5.196465079363263e-05, 'samples': 22862208, 'steps': 119073, 'loss/train': 1.4750633239746094} 11/07/2021 13:57:34 - INFO - __main__ - Step 119075: {'lr': 5.1961411932546667e-05, 'samples': 22862400, 'steps': 119074, 'loss/train': 0.8143059611320496} 11/07/2021 13:57:34 - INFO - __main__ - Step 119076: {'lr': 5.1958173160693845e-05, 'samples': 22862592, 'steps': 119075, 'loss/train': 1.3807971477508545} 11/07/2021 13:57:34 - INFO - __main__ - Step 119077: {'lr': 5.1954934478075586e-05, 'samples': 22862784, 'steps': 119076, 'loss/train': 1.1329188346862793} 11/07/2021 13:57:35 - INFO - __main__ - Step 119078: {'lr': 5.195169588469342e-05, 'samples': 22862976, 'steps': 119077, 'loss/train': 1.2049099206924438} 11/07/2021 13:57:35 - INFO - __main__ - Step 119079: {'lr': 5.194845738054874e-05, 'samples': 22863168, 'steps': 119078, 'loss/train': 0.5011285543441772} 11/07/2021 13:57:36 - INFO - __main__ - Step 119080: {'lr': 5.1945218965643026e-05, 'samples': 22863360, 'steps': 119079, 'loss/train': 1.3873405456542969} 11/07/2021 13:57:36 - INFO - __main__ - Step 119081: {'lr': 5.194198063997774e-05, 'samples': 22863552, 'steps': 119080, 'loss/train': 1.245157241821289} 11/07/2021 13:57:37 - INFO - __main__ - Step 119082: {'lr': 5.1938742403554366e-05, 'samples': 22863744, 'steps': 119081, 'loss/train': 0.6807941198348999} 11/07/2021 13:57:37 - INFO - __main__ - Step 119083: {'lr': 5.1935504256374303e-05, 'samples': 22863936, 'steps': 119082, 'loss/train': 1.1856443881988525} 11/07/2021 13:57:37 - INFO - __main__ - Step 119084: {'lr': 5.1932266198439075e-05, 'samples': 22864128, 'steps': 119083, 'loss/train': 1.7853606939315796} 11/07/2021 13:57:38 - INFO - __main__ - Step 119085: {'lr': 5.192902822975015e-05, 'samples': 22864320, 'steps': 119084, 'loss/train': 1.3337754011154175} 11/07/2021 13:57:39 - INFO - __main__ - Step 119086: {'lr': 5.192579035030892e-05, 'samples': 22864512, 'steps': 119085, 'loss/train': 1.6663446426391602} 11/07/2021 13:57:39 - INFO - __main__ - Step 119087: {'lr': 5.1922552560116825e-05, 'samples': 22864704, 'steps': 119086, 'loss/train': 2.4505715370178223} 11/07/2021 13:57:39 - INFO - __main__ - Step 119088: {'lr': 5.191931485917542e-05, 'samples': 22864896, 'steps': 119087, 'loss/train': 1.169126033782959} 11/07/2021 13:57:40 - INFO - __main__ - Step 119089: {'lr': 5.191607724748609e-05, 'samples': 22865088, 'steps': 119088, 'loss/train': 1.5700428485870361} 11/07/2021 13:57:40 - INFO - __main__ - Step 119090: {'lr': 5.191283972505031e-05, 'samples': 22865280, 'steps': 119089, 'loss/train': 1.509354829788208} 11/07/2021 13:57:41 - INFO - __main__ - Step 119091: {'lr': 5.1909602291869557e-05, 'samples': 22865472, 'steps': 119090, 'loss/train': 1.370011568069458} 11/07/2021 13:57:42 - INFO - __main__ - Step 119092: {'lr': 5.190636494794529e-05, 'samples': 22865664, 'steps': 119091, 'loss/train': 1.8360017538070679} 11/07/2021 13:57:42 - INFO - __main__ - Step 119093: {'lr': 5.190312769327896e-05, 'samples': 22865856, 'steps': 119092, 'loss/train': 1.3835841417312622} 11/07/2021 13:57:42 - INFO - __main__ - Step 119094: {'lr': 5.1899890527872004e-05, 'samples': 22866048, 'steps': 119093, 'loss/train': 1.6967259645462036} 11/07/2021 13:57:43 - INFO - __main__ - Step 119095: {'lr': 5.1896653451725895e-05, 'samples': 22866240, 'steps': 119094, 'loss/train': 1.253298044204712} 11/07/2021 13:57:44 - INFO - __main__ - Step 119096: {'lr': 5.189341646484211e-05, 'samples': 22866432, 'steps': 119095, 'loss/train': 1.526054859161377} 11/07/2021 13:57:44 - INFO - __main__ - Step 119097: {'lr': 5.1890179567222114e-05, 'samples': 22866624, 'steps': 119096, 'loss/train': 1.4388315677642822} 11/07/2021 13:57:44 - INFO - __main__ - Step 119098: {'lr': 5.188694275886732e-05, 'samples': 22866816, 'steps': 119097, 'loss/train': 1.1373462677001953} 11/07/2021 13:57:45 - INFO - __main__ - Step 119099: {'lr': 5.188370603977929e-05, 'samples': 22867008, 'steps': 119098, 'loss/train': 0.6965236067771912} 11/07/2021 13:57:45 - INFO - __main__ - Step 119100: {'lr': 5.188046940995933e-05, 'samples': 22867200, 'steps': 119099, 'loss/train': 1.2121151685714722} 11/07/2021 13:57:46 - INFO - __main__ - Step 119101: {'lr': 5.187723286940899e-05, 'samples': 22867392, 'steps': 119100, 'loss/train': 1.4215805530548096} 11/07/2021 13:57:46 - INFO - __main__ - Step 119102: {'lr': 5.1873996418129704e-05, 'samples': 22867584, 'steps': 119101, 'loss/train': 0.9646036028862} 11/07/2021 13:57:47 - INFO - __main__ - Step 119103: {'lr': 5.187076005612293e-05, 'samples': 22867776, 'steps': 119102, 'loss/train': 1.1463905572891235} 11/07/2021 13:57:47 - INFO - __main__ - Step 119104: {'lr': 5.186752378339013e-05, 'samples': 22867968, 'steps': 119103, 'loss/train': 1.1614874601364136} 11/07/2021 13:57:47 - INFO - __main__ - Step 119105: {'lr': 5.186428759993278e-05, 'samples': 22868160, 'steps': 119104, 'loss/train': 1.350565791130066} 11/07/2021 13:57:48 - INFO - __main__ - Step 119106: {'lr': 5.1861051505752324e-05, 'samples': 22868352, 'steps': 119105, 'loss/train': 1.28490149974823} 11/07/2021 13:57:49 - INFO - __main__ - Step 119107: {'lr': 5.185781550085023e-05, 'samples': 22868544, 'steps': 119106, 'loss/train': 1.2605185508728027} 11/07/2021 13:57:49 - INFO - __main__ - Step 119108: {'lr': 5.185457958522791e-05, 'samples': 22868736, 'steps': 119107, 'loss/train': 1.584457516670227} 11/07/2021 13:57:49 - INFO - __main__ - Step 119109: {'lr': 5.185134375888689e-05, 'samples': 22868928, 'steps': 119108, 'loss/train': 1.2053979635238647} 11/07/2021 13:57:50 - INFO - __main__ - Step 119110: {'lr': 5.18481080218286e-05, 'samples': 22869120, 'steps': 119109, 'loss/train': 1.623332142829895} 11/07/2021 13:57:51 - INFO - __main__ - Step 119111: {'lr': 5.184487237405447e-05, 'samples': 22869312, 'steps': 119110, 'loss/train': 1.6973587274551392} 11/07/2021 13:57:51 - INFO - __main__ - Step 119112: {'lr': 5.184163681556606e-05, 'samples': 22869504, 'steps': 119111, 'loss/train': 1.3470638990402222} 11/07/2021 13:57:51 - INFO - __main__ - Step 119113: {'lr': 5.183840134636469e-05, 'samples': 22869696, 'steps': 119112, 'loss/train': 1.1447144746780396} 11/07/2021 13:57:52 - INFO - __main__ - Step 119114: {'lr': 5.183516596645188e-05, 'samples': 22869888, 'steps': 119113, 'loss/train': 1.0964888334274292} 11/07/2021 13:57:52 - INFO - __main__ - Step 119115: {'lr': 5.1831930675829086e-05, 'samples': 22870080, 'steps': 119114, 'loss/train': 1.1989244222640991} 11/07/2021 13:57:53 - INFO - __main__ - Step 119116: {'lr': 5.1828695474497754e-05, 'samples': 22870272, 'steps': 119115, 'loss/train': 1.3887113332748413} 11/07/2021 13:57:54 - INFO - __main__ - Step 119117: {'lr': 5.182546036245936e-05, 'samples': 22870464, 'steps': 119116, 'loss/train': 1.1683152914047241} 11/07/2021 13:57:54 - INFO - __main__ - Step 119118: {'lr': 5.1822225339715366e-05, 'samples': 22870656, 'steps': 119117, 'loss/train': 1.2848970890045166} 11/07/2021 13:57:54 - INFO - __main__ - Step 119119: {'lr': 5.181899040626722e-05, 'samples': 22870848, 'steps': 119118, 'loss/train': 1.5225017070770264} 11/07/2021 13:57:55 - INFO - __main__ - Step 119120: {'lr': 5.1815755562116376e-05, 'samples': 22871040, 'steps': 119119, 'loss/train': 1.4805073738098145} 11/07/2021 13:57:55 - INFO - __main__ - Step 119121: {'lr': 5.181252080726428e-05, 'samples': 22871232, 'steps': 119120, 'loss/train': 0.2891910672187805} 11/07/2021 13:57:56 - INFO - __main__ - Step 119122: {'lr': 5.18092861417124e-05, 'samples': 22871424, 'steps': 119121, 'loss/train': 1.2205126285552979} 11/07/2021 13:57:56 - INFO - __main__ - Step 119123: {'lr': 5.1806051565462226e-05, 'samples': 22871616, 'steps': 119122, 'loss/train': 0.9563359618186951} 11/07/2021 13:57:57 - INFO - __main__ - Step 119124: {'lr': 5.180281707851517e-05, 'samples': 22871808, 'steps': 119123, 'loss/train': 1.0821537971496582} 11/07/2021 13:57:57 - INFO - __main__ - Step 119125: {'lr': 5.1799582680872705e-05, 'samples': 22872000, 'steps': 119124, 'loss/train': 1.3128539323806763} 11/07/2021 13:57:57 - INFO - __main__ - Step 119126: {'lr': 5.1796348372536354e-05, 'samples': 22872192, 'steps': 119125, 'loss/train': 1.2610918283462524} 11/07/2021 13:57:59 - INFO - __main__ - Step 119127: {'lr': 5.179311415350746e-05, 'samples': 22872384, 'steps': 119126, 'loss/train': 1.3935565948486328} 11/07/2021 13:57:59 - INFO - __main__ - Step 119128: {'lr': 5.178988002378751e-05, 'samples': 22872576, 'steps': 119127, 'loss/train': 1.0765151977539062} 11/07/2021 13:57:59 - INFO - __main__ - Step 119129: {'lr': 5.1786645983377984e-05, 'samples': 22872768, 'steps': 119128, 'loss/train': 0.8628991842269897} 11/07/2021 13:58:00 - INFO - __main__ - Step 119130: {'lr': 5.178341203228035e-05, 'samples': 22872960, 'steps': 119129, 'loss/train': 1.2330325841903687} 11/07/2021 13:58:00 - INFO - __main__ - Step 119131: {'lr': 5.1780178170496046e-05, 'samples': 22873152, 'steps': 119130, 'loss/train': 1.3807073831558228} 11/07/2021 13:58:01 - INFO - __main__ - Step 119132: {'lr': 5.1776944398026524e-05, 'samples': 22873344, 'steps': 119131, 'loss/train': 0.5038021802902222} 11/07/2021 13:58:01 - INFO - __main__ - Step 119133: {'lr': 5.177371071487327e-05, 'samples': 22873536, 'steps': 119132, 'loss/train': 1.194109559059143} 11/07/2021 13:58:02 - INFO - __main__ - Step 119134: {'lr': 5.1770477121037693e-05, 'samples': 22873728, 'steps': 119133, 'loss/train': 1.6009187698364258} 11/07/2021 13:58:02 - INFO - __main__ - Step 119135: {'lr': 5.1767243616521325e-05, 'samples': 22873920, 'steps': 119134, 'loss/train': 1.0589059591293335} 11/07/2021 13:58:03 - INFO - __main__ - Step 119136: {'lr': 5.176401020132554e-05, 'samples': 22874112, 'steps': 119135, 'loss/train': 1.104013442993164} 11/07/2021 13:58:04 - INFO - __main__ - Step 119137: {'lr': 5.176077687545186e-05, 'samples': 22874304, 'steps': 119136, 'loss/train': 1.604777455329895} 11/07/2021 13:58:05 - INFO - __main__ - Step 119138: {'lr': 5.17575436389017e-05, 'samples': 22874496, 'steps': 119137, 'loss/train': 1.2029865980148315} 11/07/2021 13:58:05 - INFO - __main__ - Step 119139: {'lr': 5.1754310491676586e-05, 'samples': 22874688, 'steps': 119138, 'loss/train': 0.7853790521621704} 11/07/2021 13:58:05 - INFO - __main__ - Step 119140: {'lr': 5.175107743377788e-05, 'samples': 22874880, 'steps': 119139, 'loss/train': 1.228301763534546} 11/07/2021 13:58:06 - INFO - __main__ - Step 119141: {'lr': 5.1747844465207056e-05, 'samples': 22875072, 'steps': 119140, 'loss/train': 1.1548489332199097} 11/07/2021 13:58:06 - INFO - __main__ - Step 119142: {'lr': 5.1744611585965605e-05, 'samples': 22875264, 'steps': 119141, 'loss/train': 1.317503809928894} 11/07/2021 13:58:06 - INFO - __main__ - Step 119143: {'lr': 5.174137879605498e-05, 'samples': 22875456, 'steps': 119142, 'loss/train': 0.05993594601750374} 11/07/2021 13:58:08 - INFO - __main__ - Step 119144: {'lr': 5.173814609547661e-05, 'samples': 22875648, 'steps': 119143, 'loss/train': 1.0653749704360962} 11/07/2021 13:58:08 - INFO - __main__ - Step 119145: {'lr': 5.173491348423201e-05, 'samples': 22875840, 'steps': 119144, 'loss/train': 1.4838248491287231} 11/07/2021 13:58:08 - INFO - __main__ - Step 119146: {'lr': 5.173168096232256e-05, 'samples': 22876032, 'steps': 119145, 'loss/train': 1.504299521446228} 11/07/2021 13:58:09 - INFO - __main__ - Step 119147: {'lr': 5.172844852974978e-05, 'samples': 22876224, 'steps': 119146, 'loss/train': 1.7049434185028076} 11/07/2021 13:58:09 - INFO - __main__ - Step 119148: {'lr': 5.17252161865151e-05, 'samples': 22876416, 'steps': 119147, 'loss/train': 1.4855730533599854} 11/07/2021 13:58:09 - INFO - __main__ - Step 119149: {'lr': 5.172198393261995e-05, 'samples': 22876608, 'steps': 119148, 'loss/train': 1.7362302541732788} 11/07/2021 13:58:10 - INFO - __main__ - Step 119150: {'lr': 5.1718751768065844e-05, 'samples': 22876800, 'steps': 119149, 'loss/train': 0.8938423991203308} 11/07/2021 13:58:11 - INFO - __main__ - Step 119151: {'lr': 5.171551969285421e-05, 'samples': 22876992, 'steps': 119150, 'loss/train': 1.4875503778457642} 11/07/2021 13:58:11 - INFO - __main__ - Step 119152: {'lr': 5.1712287706986547e-05, 'samples': 22877184, 'steps': 119151, 'loss/train': 1.017643928527832} 11/07/2021 13:58:11 - INFO - __main__ - Step 119153: {'lr': 5.1709055810464203e-05, 'samples': 22877376, 'steps': 119152, 'loss/train': 1.248706579208374} 11/07/2021 13:58:12 - INFO - __main__ - Step 119154: {'lr': 5.170582400328872e-05, 'samples': 22877568, 'steps': 119153, 'loss/train': 1.271234154701233} 11/07/2021 13:58:13 - INFO - __main__ - Step 119155: {'lr': 5.170259228546151e-05, 'samples': 22877760, 'steps': 119154, 'loss/train': 1.2534197568893433} 11/07/2021 13:58:13 - INFO - __main__ - Step 119156: {'lr': 5.1699360656984076e-05, 'samples': 22877952, 'steps': 119155, 'loss/train': 1.7253799438476562} 11/07/2021 13:58:14 - INFO - __main__ - Step 119157: {'lr': 5.169612911785782e-05, 'samples': 22878144, 'steps': 119156, 'loss/train': 1.1757683753967285} 11/07/2021 13:58:14 - INFO - __main__ - Step 119158: {'lr': 5.1692897668084247e-05, 'samples': 22878336, 'steps': 119157, 'loss/train': 1.1068044900894165} 11/07/2021 13:58:14 - INFO - __main__ - Step 119159: {'lr': 5.1689666307664804e-05, 'samples': 22878528, 'steps': 119158, 'loss/train': 0.8100593090057373} 11/07/2021 13:58:15 - INFO - __main__ - Step 119160: {'lr': 5.168643503660092e-05, 'samples': 22878720, 'steps': 119159, 'loss/train': 1.36505126953125} 11/07/2021 13:58:16 - INFO - __main__ - Step 119161: {'lr': 5.1683203854894086e-05, 'samples': 22878912, 'steps': 119160, 'loss/train': 1.1094635725021362} 11/07/2021 13:58:16 - INFO - __main__ - Step 119162: {'lr': 5.167997276254571e-05, 'samples': 22879104, 'steps': 119161, 'loss/train': 1.289198398590088} 11/07/2021 13:58:16 - INFO - __main__ - Step 119163: {'lr': 5.1676741759557305e-05, 'samples': 22879296, 'steps': 119162, 'loss/train': 1.2927777767181396} 11/07/2021 13:58:17 - INFO - __main__ - Step 119164: {'lr': 5.167351084593028e-05, 'samples': 22879488, 'steps': 119163, 'loss/train': 1.5350695848464966} 11/07/2021 13:58:18 - INFO - __main__ - Step 119165: {'lr': 5.1670280021666125e-05, 'samples': 22879680, 'steps': 119164, 'loss/train': 1.8595761060714722} 11/07/2021 13:58:18 - INFO - __main__ - Step 119166: {'lr': 5.166704928676636e-05, 'samples': 22879872, 'steps': 119165, 'loss/train': 1.483228087425232} 11/07/2021 13:58:18 - INFO - __main__ - Step 119167: {'lr': 5.166381864123226e-05, 'samples': 22880064, 'steps': 119166, 'loss/train': 1.2292312383651733} 11/07/2021 13:58:19 - INFO - __main__ - Step 119168: {'lr': 5.166058808506541e-05, 'samples': 22880256, 'steps': 119167, 'loss/train': 0.9421584010124207} 11/07/2021 13:58:19 - INFO - __main__ - Step 119169: {'lr': 5.165735761826723e-05, 'samples': 22880448, 'steps': 119168, 'loss/train': 1.3088756799697876} 11/07/2021 13:58:20 - INFO - __main__ - Step 119170: {'lr': 5.165412724083918e-05, 'samples': 22880640, 'steps': 119169, 'loss/train': 0.8511092066764832} 11/07/2021 13:58:21 - INFO - __main__ - Step 119171: {'lr': 5.165089695278272e-05, 'samples': 22880832, 'steps': 119170, 'loss/train': 1.083274245262146} 11/07/2021 13:58:21 - INFO - __main__ - Step 119172: {'lr': 5.164766675409932e-05, 'samples': 22881024, 'steps': 119171, 'loss/train': 1.1529531478881836} 11/07/2021 13:58:21 - INFO - __main__ - Step 119173: {'lr': 5.16444366447904e-05, 'samples': 22881216, 'steps': 119172, 'loss/train': 1.635331392288208} 11/07/2021 13:58:22 - INFO - __main__ - Step 119174: {'lr': 5.1641206624857465e-05, 'samples': 22881408, 'steps': 119173, 'loss/train': 1.6855441331863403} 11/07/2021 13:58:22 - INFO - __main__ - Step 119175: {'lr': 5.1637976694301926e-05, 'samples': 22881600, 'steps': 119174, 'loss/train': 1.1363465785980225} 11/07/2021 13:58:23 - INFO - __main__ - Step 119176: {'lr': 5.163474685312525e-05, 'samples': 22881792, 'steps': 119175, 'loss/train': 1.2580641508102417} 11/07/2021 13:58:23 - INFO - __main__ - Step 119177: {'lr': 5.163151710132888e-05, 'samples': 22881984, 'steps': 119176, 'loss/train': 1.441177248954773} 11/07/2021 13:58:24 - INFO - __main__ - Step 119178: {'lr': 5.162828743891432e-05, 'samples': 22882176, 'steps': 119177, 'loss/train': 1.3291798830032349} 11/07/2021 13:58:24 - INFO - __main__ - Step 119179: {'lr': 5.162505786588303e-05, 'samples': 22882368, 'steps': 119178, 'loss/train': 1.2934081554412842} 11/07/2021 13:58:24 - INFO - __main__ - Step 119180: {'lr': 5.162182838223639e-05, 'samples': 22882560, 'steps': 119179, 'loss/train': 1.0220894813537598} 11/07/2021 13:58:26 - INFO - __main__ - Step 119181: {'lr': 5.161859898797586e-05, 'samples': 22882752, 'steps': 119180, 'loss/train': 1.1140973567962646} 11/07/2021 13:58:26 - INFO - __main__ - Step 119182: {'lr': 5.161536968310296e-05, 'samples': 22882944, 'steps': 119181, 'loss/train': 1.2621052265167236} 11/07/2021 13:58:26 - INFO - __main__ - Step 119183: {'lr': 5.161214046761908e-05, 'samples': 22883136, 'steps': 119182, 'loss/train': 1.3054314851760864} 11/07/2021 13:58:27 - INFO - __main__ - Step 119184: {'lr': 5.1608911341525734e-05, 'samples': 22883328, 'steps': 119183, 'loss/train': 1.5169206857681274} 11/07/2021 13:58:27 - INFO - __main__ - Step 119185: {'lr': 5.1605682304824346e-05, 'samples': 22883520, 'steps': 119184, 'loss/train': 1.1503610610961914} 11/07/2021 13:58:28 - INFO - __main__ - Step 119186: {'lr': 5.160245335751637e-05, 'samples': 22883712, 'steps': 119185, 'loss/train': 0.20669709146022797} 11/07/2021 13:58:29 - INFO - __main__ - Step 119187: {'lr': 5.159922449960327e-05, 'samples': 22883904, 'steps': 119186, 'loss/train': 1.4541616439819336} 11/07/2021 13:58:29 - INFO - __main__ - Step 119188: {'lr': 5.15959957310865e-05, 'samples': 22884096, 'steps': 119187, 'loss/train': 1.7429324388504028} 11/07/2021 13:58:29 - INFO - __main__ - Step 119189: {'lr': 5.1592767051967526e-05, 'samples': 22884288, 'steps': 119188, 'loss/train': 1.5426383018493652} 11/07/2021 13:58:30 - INFO - __main__ - Step 119190: {'lr': 5.158953846224776e-05, 'samples': 22884480, 'steps': 119189, 'loss/train': 0.6167244911193848} 11/07/2021 13:58:31 - INFO - __main__ - Step 119191: {'lr': 5.15863099619287e-05, 'samples': 22884672, 'steps': 119190, 'loss/train': 0.7263466119766235} 11/07/2021 13:58:31 - INFO - __main__ - Step 119192: {'lr': 5.15830815510118e-05, 'samples': 22884864, 'steps': 119191, 'loss/train': 1.3512276411056519} 11/07/2021 13:58:31 - INFO - __main__ - Step 119193: {'lr': 5.157985322949857e-05, 'samples': 22885056, 'steps': 119192, 'loss/train': 1.355177402496338} 11/07/2021 13:58:32 - INFO - __main__ - Step 119194: {'lr': 5.15766249973903e-05, 'samples': 22885248, 'steps': 119193, 'loss/train': 1.2857922315597534} 11/07/2021 13:58:32 - INFO - __main__ - Step 119195: {'lr': 5.1573396854688566e-05, 'samples': 22885440, 'steps': 119194, 'loss/train': 1.5321282148361206} 11/07/2021 13:58:33 - INFO - __main__ - Step 119196: {'lr': 5.157016880139479e-05, 'samples': 22885632, 'steps': 119195, 'loss/train': 0.8633344173431396} 11/07/2021 13:58:34 - INFO - __main__ - Step 119197: {'lr': 5.156694083751043e-05, 'samples': 22885824, 'steps': 119196, 'loss/train': 1.091121792793274} 11/07/2021 13:58:34 - INFO - __main__ - Step 119198: {'lr': 5.1563712963036944e-05, 'samples': 22886016, 'steps': 119197, 'loss/train': 1.4326505661010742} 11/07/2021 13:58:34 - INFO - __main__ - Step 119199: {'lr': 5.156048517797579e-05, 'samples': 22886208, 'steps': 119198, 'loss/train': 1.46879243850708} 11/07/2021 13:58:35 - INFO - __main__ - Step 119200: {'lr': 5.155725748232842e-05, 'samples': 22886400, 'steps': 119199, 'loss/train': 0.9818353652954102} 11/07/2021 13:58:36 - INFO - __main__ - Step 119201: {'lr': 5.155402987609628e-05, 'samples': 22886592, 'steps': 119200, 'loss/train': 1.3552542924880981} 11/07/2021 13:58:36 - INFO - __main__ - Step 119202: {'lr': 5.155080235928086e-05, 'samples': 22886784, 'steps': 119201, 'loss/train': 1.2081055641174316} 11/07/2021 13:58:36 - INFO - __main__ - Step 119203: {'lr': 5.1547574931883554e-05, 'samples': 22886976, 'steps': 119202, 'loss/train': 1.3304381370544434} 11/07/2021 13:58:37 - INFO - __main__ - Step 119204: {'lr': 5.154434759390586e-05, 'samples': 22887168, 'steps': 119203, 'loss/train': 0.9270598888397217} 11/07/2021 13:58:37 - INFO - __main__ - Step 119205: {'lr': 5.154112034534922e-05, 'samples': 22887360, 'steps': 119204, 'loss/train': 1.174607753753662} 11/07/2021 13:58:37 - INFO - __main__ - Step 119206: {'lr': 5.153789318621516e-05, 'samples': 22887552, 'steps': 119205, 'loss/train': 1.1747015714645386} 11/07/2021 13:58:38 - INFO - __main__ - Step 119207: {'lr': 5.153466611650498e-05, 'samples': 22887744, 'steps': 119206, 'loss/train': 1.4114304780960083} 11/07/2021 13:58:39 - INFO - __main__ - Step 119208: {'lr': 5.1531439136220244e-05, 'samples': 22887936, 'steps': 119207, 'loss/train': 1.1687158346176147} 11/07/2021 13:58:39 - INFO - __main__ - Step 119209: {'lr': 5.1528212245362363e-05, 'samples': 22888128, 'steps': 119208, 'loss/train': 1.282996416091919} 11/07/2021 13:58:39 - INFO - __main__ - Step 119210: {'lr': 5.1524985443932805e-05, 'samples': 22888320, 'steps': 119209, 'loss/train': 0.8289745450019836} 11/07/2021 13:58:40 - INFO - __main__ - Step 119211: {'lr': 5.1521758731933045e-05, 'samples': 22888512, 'steps': 119210, 'loss/train': 1.359241008758545} 11/07/2021 13:58:41 - INFO - __main__ - Step 119212: {'lr': 5.1518532109364495e-05, 'samples': 22888704, 'steps': 119211, 'loss/train': 1.5649338960647583} 11/07/2021 13:58:41 - INFO - __main__ - Step 119213: {'lr': 5.151530557622863e-05, 'samples': 22888896, 'steps': 119212, 'loss/train': 0.6241868138313293} 11/07/2021 13:58:42 - INFO - __main__ - Step 119214: {'lr': 5.1512079132526924e-05, 'samples': 22889088, 'steps': 119213, 'loss/train': 1.057706594467163} 11/07/2021 13:58:42 - INFO - __main__ - Step 119215: {'lr': 5.150885277826078e-05, 'samples': 22889280, 'steps': 119214, 'loss/train': 1.1088569164276123} 11/07/2021 13:58:42 - INFO - __main__ - Step 119216: {'lr': 5.150562651343171e-05, 'samples': 22889472, 'steps': 119215, 'loss/train': 1.5546077489852905} 11/07/2021 13:58:43 - INFO - __main__ - Step 119217: {'lr': 5.1502400338041156e-05, 'samples': 22889664, 'steps': 119216, 'loss/train': 1.1434296369552612} 11/07/2021 13:58:44 - INFO - __main__ - Step 119218: {'lr': 5.149917425209052e-05, 'samples': 22889856, 'steps': 119217, 'loss/train': 0.7048628330230713} 11/07/2021 13:58:44 - INFO - __main__ - Step 119219: {'lr': 5.1495948255581323e-05, 'samples': 22890048, 'steps': 119218, 'loss/train': 1.1730252504348755} 11/07/2021 13:58:44 - INFO - __main__ - Step 119220: {'lr': 5.149272234851504e-05, 'samples': 22890240, 'steps': 119219, 'loss/train': 1.1770007610321045} 11/07/2021 13:58:45 - INFO - __main__ - Step 119221: {'lr': 5.148949653089302e-05, 'samples': 22890432, 'steps': 119220, 'loss/train': 1.661521315574646} 11/07/2021 13:58:46 - INFO - __main__ - Step 119222: {'lr': 5.148627080271675e-05, 'samples': 22890624, 'steps': 119221, 'loss/train': 1.2922298908233643} 11/07/2021 13:58:46 - INFO - __main__ - Step 119223: {'lr': 5.148304516398772e-05, 'samples': 22890816, 'steps': 119222, 'loss/train': 1.5000327825546265} 11/07/2021 13:58:46 - INFO - __main__ - Step 119224: {'lr': 5.1479819614707355e-05, 'samples': 22891008, 'steps': 119223, 'loss/train': 1.0663388967514038} 11/07/2021 13:58:47 - INFO - __main__ - Step 119225: {'lr': 5.1476594154877126e-05, 'samples': 22891200, 'steps': 119224, 'loss/train': 1.3380166292190552} 11/07/2021 13:58:47 - INFO - __main__ - Step 119226: {'lr': 5.1473368784498483e-05, 'samples': 22891392, 'steps': 119225, 'loss/train': 1.1790945529937744} 11/07/2021 13:58:48 - INFO - __main__ - Step 119227: {'lr': 5.1470143503572876e-05, 'samples': 22891584, 'steps': 119226, 'loss/train': 1.4109759330749512} 11/07/2021 13:58:49 - INFO - __main__ - Step 119228: {'lr': 5.146691831210176e-05, 'samples': 22891776, 'steps': 119227, 'loss/train': 1.2546368837356567} 11/07/2021 13:58:49 - INFO - __main__ - Step 119229: {'lr': 5.1463693210086594e-05, 'samples': 22891968, 'steps': 119228, 'loss/train': 1.5217044353485107} 11/07/2021 13:58:49 - INFO - __main__ - Step 119230: {'lr': 5.146046819752881e-05, 'samples': 22892160, 'steps': 119229, 'loss/train': 1.0846054553985596} 11/07/2021 13:58:50 - INFO - __main__ - Step 119231: {'lr': 5.145724327442988e-05, 'samples': 22892352, 'steps': 119230, 'loss/train': 1.6928101778030396} 11/07/2021 13:58:51 - INFO - __main__ - Step 119232: {'lr': 5.145401844079126e-05, 'samples': 22892544, 'steps': 119231, 'loss/train': 1.2218143939971924} 11/07/2021 13:58:51 - INFO - __main__ - Step 119233: {'lr': 5.145079369661443e-05, 'samples': 22892736, 'steps': 119232, 'loss/train': 1.431857943534851} 11/07/2021 13:58:51 - INFO - __main__ - Step 119234: {'lr': 5.144756904190076e-05, 'samples': 22892928, 'steps': 119233, 'loss/train': 1.1925159692764282} 11/07/2021 13:58:52 - INFO - __main__ - Step 119235: {'lr': 5.144434447665178e-05, 'samples': 22893120, 'steps': 119234, 'loss/train': 1.310896635055542} 11/07/2021 13:58:52 - INFO - __main__ - Step 119236: {'lr': 5.1441120000868865e-05, 'samples': 22893312, 'steps': 119235, 'loss/train': 1.2579351663589478} 11/07/2021 13:58:52 - INFO - __main__ - Step 119237: {'lr': 5.143789561455356e-05, 'samples': 22893504, 'steps': 119236, 'loss/train': 1.2930032014846802} 11/07/2021 13:58:53 - INFO - __main__ - Step 119238: {'lr': 5.1434671317707264e-05, 'samples': 22893696, 'steps': 119237, 'loss/train': 0.6673069000244141} 11/07/2021 13:58:54 - INFO - __main__ - Step 119239: {'lr': 5.1431447110331434e-05, 'samples': 22893888, 'steps': 119238, 'loss/train': 1.211626648902893} 11/07/2021 13:58:54 - INFO - __main__ - Step 119240: {'lr': 5.1428222992427525e-05, 'samples': 22894080, 'steps': 119239, 'loss/train': 1.3119665384292603} 11/07/2021 13:58:55 - INFO - __main__ - Step 119241: {'lr': 5.1424998963996994e-05, 'samples': 22894272, 'steps': 119240, 'loss/train': 1.2575743198394775} 11/07/2021 13:58:55 - INFO - __main__ - Step 119242: {'lr': 5.1421775025041304e-05, 'samples': 22894464, 'steps': 119241, 'loss/train': 1.9355964660644531} 11/07/2021 13:58:56 - INFO - __main__ - Step 119243: {'lr': 5.1418551175561905e-05, 'samples': 22894656, 'steps': 119242, 'loss/train': 1.4863629341125488} 11/07/2021 13:58:56 - INFO - __main__ - Step 119244: {'lr': 5.1415327415560235e-05, 'samples': 22894848, 'steps': 119243, 'loss/train': 1.6395723819732666} 11/07/2021 13:58:57 - INFO - __main__ - Step 119245: {'lr': 5.141210374503774e-05, 'samples': 22895040, 'steps': 119244, 'loss/train': 1.2486684322357178} 11/07/2021 13:58:57 - INFO - __main__ - Step 119246: {'lr': 5.140888016399592e-05, 'samples': 22895232, 'steps': 119245, 'loss/train': 0.38682177662849426} 11/07/2021 13:58:58 - INFO - __main__ - Step 119247: {'lr': 5.140565667243624e-05, 'samples': 22895424, 'steps': 119246, 'loss/train': 1.5333151817321777} 11/07/2021 13:58:59 - INFO - __main__ - Step 119248: {'lr': 5.140243327036004e-05, 'samples': 22895616, 'steps': 119247, 'loss/train': 1.470802664756775} 11/07/2021 13:58:59 - INFO - __main__ - Step 119249: {'lr': 5.1399209957768836e-05, 'samples': 22895808, 'steps': 119248, 'loss/train': 1.1151028871536255} 11/07/2021 13:58:59 - INFO - __main__ - Step 119250: {'lr': 5.139598673466409e-05, 'samples': 22896000, 'steps': 119249, 'loss/train': 1.3059922456741333} 11/07/2021 13:59:00 - INFO - __main__ - Step 119251: {'lr': 5.1392763601047247e-05, 'samples': 22896192, 'steps': 119250, 'loss/train': 1.3660321235656738} 11/07/2021 13:59:00 - INFO - __main__ - Step 119252: {'lr': 5.138954055691975e-05, 'samples': 22896384, 'steps': 119251, 'loss/train': 1.6728084087371826} 11/07/2021 13:59:02 - INFO - __main__ - Step 119253: {'lr': 5.1386317602283075e-05, 'samples': 22896576, 'steps': 119252, 'loss/train': 1.7018128633499146} 11/07/2021 13:59:02 - INFO - __main__ - Step 119254: {'lr': 5.1383094737138645e-05, 'samples': 22896768, 'steps': 119253, 'loss/train': 1.3665646314620972} 11/07/2021 13:59:02 - INFO - __main__ - Step 119255: {'lr': 5.137987196148794e-05, 'samples': 22896960, 'steps': 119254, 'loss/train': 1.7101017236709595} 11/07/2021 13:59:03 - INFO - __main__ - Step 119256: {'lr': 5.1376649275332396e-05, 'samples': 22897152, 'steps': 119255, 'loss/train': 1.2493149042129517} 11/07/2021 13:59:03 - INFO - __main__ - Step 119257: {'lr': 5.137342667867345e-05, 'samples': 22897344, 'steps': 119256, 'loss/train': 1.0657994747161865} 11/07/2021 13:59:03 - INFO - __main__ - Step 119258: {'lr': 5.1370204171512614e-05, 'samples': 22897536, 'steps': 119257, 'loss/train': 3.820547580718994} 11/07/2021 13:59:04 - INFO - __main__ - Step 119259: {'lr': 5.1366981753851265e-05, 'samples': 22897728, 'steps': 119258, 'loss/train': 0.5804303884506226} 11/07/2021 13:59:05 - INFO - __main__ - Step 119260: {'lr': 5.136375942569096e-05, 'samples': 22897920, 'steps': 119259, 'loss/train': 1.6091768741607666} 11/07/2021 13:59:06 - INFO - __main__ - Step 119261: {'lr': 5.136053718703304e-05, 'samples': 22898112, 'steps': 119260, 'loss/train': 1.145674705505371} 11/07/2021 13:59:06 - INFO - __main__ - Step 119262: {'lr': 5.1357315037878966e-05, 'samples': 22898304, 'steps': 119261, 'loss/train': 1.2601981163024902} 11/07/2021 13:59:06 - INFO - __main__ - Step 119263: {'lr': 5.135409297823024e-05, 'samples': 22898496, 'steps': 119262, 'loss/train': 1.551633358001709} 11/07/2021 13:59:07 - INFO - __main__ - Step 119264: {'lr': 5.1350871008088274e-05, 'samples': 22898688, 'steps': 119263, 'loss/train': 0.2306177020072937} 11/07/2021 13:59:08 - INFO - __main__ - Step 119265: {'lr': 5.134764912745457e-05, 'samples': 22898880, 'steps': 119264, 'loss/train': 1.1528037786483765} 11/07/2021 13:59:08 - INFO - __main__ - Step 119266: {'lr': 5.1344427336330515e-05, 'samples': 22899072, 'steps': 119265, 'loss/train': 1.2796388864517212} 11/07/2021 13:59:08 - INFO - __main__ - Step 119267: {'lr': 5.1341205634717616e-05, 'samples': 22899264, 'steps': 119266, 'loss/train': 1.138117790222168} 11/07/2021 13:59:09 - INFO - __main__ - Step 119268: {'lr': 5.13379840226173e-05, 'samples': 22899456, 'steps': 119267, 'loss/train': 0.9785695672035217} 11/07/2021 13:59:09 - INFO - __main__ - Step 119269: {'lr': 5.1334762500031024e-05, 'samples': 22899648, 'steps': 119268, 'loss/train': 1.5451672077178955} 11/07/2021 13:59:10 - INFO - __main__ - Step 119270: {'lr': 5.133154106696025e-05, 'samples': 22899840, 'steps': 119269, 'loss/train': 1.0603476762771606} 11/07/2021 13:59:10 - INFO - __main__ - Step 119271: {'lr': 5.13283197234064e-05, 'samples': 22900032, 'steps': 119270, 'loss/train': 1.7759335041046143} 11/07/2021 13:59:11 - INFO - __main__ - Step 119272: {'lr': 5.132509846937094e-05, 'samples': 22900224, 'steps': 119271, 'loss/train': 0.9419232606887817} 11/07/2021 13:59:11 - INFO - __main__ - Step 119273: {'lr': 5.132187730485541e-05, 'samples': 22900416, 'steps': 119272, 'loss/train': 1.2424757480621338} 11/07/2021 13:59:11 - INFO - __main__ - Step 119274: {'lr': 5.131865622986112e-05, 'samples': 22900608, 'steps': 119273, 'loss/train': 1.156270146369934} 11/07/2021 13:59:12 - INFO - __main__ - Step 119275: {'lr': 5.131543524438956e-05, 'samples': 22900800, 'steps': 119274, 'loss/train': 1.2816399335861206} 11/07/2021 13:59:13 - INFO - __main__ - Step 119276: {'lr': 5.1312214348442186e-05, 'samples': 22900992, 'steps': 119275, 'loss/train': 0.6475158333778381} 11/07/2021 13:59:13 - INFO - __main__ - Step 119277: {'lr': 5.130899354202048e-05, 'samples': 22901184, 'steps': 119276, 'loss/train': 1.281724452972412} 11/07/2021 13:59:14 - INFO - __main__ - Step 119278: {'lr': 5.130577282512586e-05, 'samples': 22901376, 'steps': 119277, 'loss/train': 1.7106523513793945} 11/07/2021 13:59:14 - INFO - __main__ - Step 119279: {'lr': 5.130255219775981e-05, 'samples': 22901568, 'steps': 119278, 'loss/train': 1.3298671245574951} 11/07/2021 13:59:15 - INFO - __main__ - Step 119280: {'lr': 5.129933165992376e-05, 'samples': 22901760, 'steps': 119279, 'loss/train': 1.3955718278884888} 11/07/2021 13:59:15 - INFO - __main__ - Step 119281: {'lr': 5.129611121161914e-05, 'samples': 22901952, 'steps': 119280, 'loss/train': 1.7022030353546143} 11/07/2021 13:59:16 - INFO - __main__ - Step 119282: {'lr': 5.129289085284747e-05, 'samples': 22902144, 'steps': 119281, 'loss/train': 1.3608412742614746} 11/07/2021 13:59:16 - INFO - __main__ - Step 119283: {'lr': 5.128967058361014e-05, 'samples': 22902336, 'steps': 119282, 'loss/train': 1.4743268489837646} 11/07/2021 13:59:16 - INFO - __main__ - Step 119284: {'lr': 5.128645040390867e-05, 'samples': 22902528, 'steps': 119283, 'loss/train': 1.0018502473831177} 11/07/2021 13:59:17 - INFO - __main__ - Step 119285: {'lr': 5.1283230313744405e-05, 'samples': 22902720, 'steps': 119284, 'loss/train': 0.9810152649879456} 11/07/2021 13:59:18 - INFO - __main__ - Step 119286: {'lr': 5.128001031311885e-05, 'samples': 22902912, 'steps': 119285, 'loss/train': 1.4661877155303955} 11/07/2021 13:59:18 - INFO - __main__ - Step 119287: {'lr': 5.127679040203345e-05, 'samples': 22903104, 'steps': 119286, 'loss/train': 1.3873769044876099} 11/07/2021 13:59:18 - INFO - __main__ - Step 119288: {'lr': 5.127357058048968e-05, 'samples': 22903296, 'steps': 119287, 'loss/train': 1.5514063835144043} 11/07/2021 13:59:19 - INFO - __main__ - Step 119289: {'lr': 5.127035084848894e-05, 'samples': 22903488, 'steps': 119288, 'loss/train': 0.5633100867271423} 11/07/2021 13:59:20 - INFO - __main__ - Step 119290: {'lr': 5.126713120603274e-05, 'samples': 22903680, 'steps': 119289, 'loss/train': 1.3968867063522339} 11/07/2021 13:59:20 - INFO - __main__ - Step 119291: {'lr': 5.12639116531225e-05, 'samples': 22903872, 'steps': 119290, 'loss/train': 1.3097189664840698} 11/07/2021 13:59:20 - INFO - __main__ - Step 119292: {'lr': 5.126069218975965e-05, 'samples': 22904064, 'steps': 119291, 'loss/train': 1.0415719747543335} 11/07/2021 13:59:21 - INFO - __main__ - Step 119293: {'lr': 5.12574728159457e-05, 'samples': 22904256, 'steps': 119292, 'loss/train': 1.5807565450668335} 11/07/2021 13:59:21 - INFO - __main__ - Step 119294: {'lr': 5.125425353168203e-05, 'samples': 22904448, 'steps': 119293, 'loss/train': 1.3957408666610718} 11/07/2021 13:59:23 - INFO - __main__ - Step 119295: {'lr': 5.125103433697023e-05, 'samples': 22904640, 'steps': 119294, 'loss/train': 1.3078229427337646} 11/07/2021 13:59:23 - INFO - __main__ - Step 119296: {'lr': 5.124781523181155e-05, 'samples': 22904832, 'steps': 119295, 'loss/train': 1.464239478111267} 11/07/2021 13:59:23 - INFO - __main__ - Step 119297: {'lr': 5.1244596216207555e-05, 'samples': 22905024, 'steps': 119296, 'loss/train': 0.5628297924995422} 11/07/2021 13:59:24 - INFO - __main__ - Step 119298: {'lr': 5.124137729015968e-05, 'samples': 22905216, 'steps': 119297, 'loss/train': 1.6345301866531372} 11/07/2021 13:59:24 - INFO - __main__ - Step 119299: {'lr': 5.123815845366936e-05, 'samples': 22905408, 'steps': 119298, 'loss/train': 0.5055443048477173} 11/07/2021 13:59:24 - INFO - __main__ - Step 119300: {'lr': 5.123493970673807e-05, 'samples': 22905600, 'steps': 119299, 'loss/train': 1.5219449996948242} 11/07/2021 13:59:25 - INFO - __main__ - Step 119301: {'lr': 5.123172104936724e-05, 'samples': 22905792, 'steps': 119300, 'loss/train': 1.275759220123291} 11/07/2021 13:59:26 - INFO - __main__ - Step 119302: {'lr': 5.122850248155836e-05, 'samples': 22905984, 'steps': 119301, 'loss/train': 1.3339005708694458} 11/07/2021 13:59:26 - INFO - __main__ - Step 119303: {'lr': 5.122528400331281e-05, 'samples': 22906176, 'steps': 119302, 'loss/train': 1.3087576627731323} 11/07/2021 13:59:26 - INFO - __main__ - Step 119304: {'lr': 5.122206561463211e-05, 'samples': 22906368, 'steps': 119303, 'loss/train': 0.6708119511604309} 11/07/2021 13:59:27 - INFO - __main__ - Step 119305: {'lr': 5.121884731551765e-05, 'samples': 22906560, 'steps': 119304, 'loss/train': 2.2261297702789307} 11/07/2021 13:59:28 - INFO - __main__ - Step 119306: {'lr': 5.1215629105970995e-05, 'samples': 22906752, 'steps': 119305, 'loss/train': 1.6246637105941772} 11/07/2021 13:59:28 - INFO - __main__ - Step 119307: {'lr': 5.121241098599344e-05, 'samples': 22906944, 'steps': 119306, 'loss/train': 1.516841173171997} 11/07/2021 13:59:29 - INFO - __main__ - Step 119308: {'lr': 5.120919295558652e-05, 'samples': 22907136, 'steps': 119307, 'loss/train': 0.8629969954490662} 11/07/2021 13:59:29 - INFO - __main__ - Step 119309: {'lr': 5.120597501475163e-05, 'samples': 22907328, 'steps': 119308, 'loss/train': 1.7030565738677979} 11/07/2021 13:59:29 - INFO - __main__ - Step 119310: {'lr': 5.1202757163490294e-05, 'samples': 22907520, 'steps': 119309, 'loss/train': 1.079053282737732} 11/07/2021 13:59:30 - INFO - __main__ - Step 119311: {'lr': 5.119953940180391e-05, 'samples': 22907712, 'steps': 119310, 'loss/train': 1.1222764253616333} 11/07/2021 13:59:31 - INFO - __main__ - Step 119312: {'lr': 5.119632172969396e-05, 'samples': 22907904, 'steps': 119311, 'loss/train': 1.009093999862671} 11/07/2021 13:59:31 - INFO - __main__ - Step 119313: {'lr': 5.1193104147161885e-05, 'samples': 22908096, 'steps': 119312, 'loss/train': 1.5653855800628662} 11/07/2021 13:59:31 - INFO - __main__ - Step 119314: {'lr': 5.118988665420912e-05, 'samples': 22908288, 'steps': 119313, 'loss/train': 1.098549723625183} 11/07/2021 13:59:32 - INFO - __main__ - Step 119315: {'lr': 5.118666925083712e-05, 'samples': 22908480, 'steps': 119314, 'loss/train': 1.2536330223083496} 11/07/2021 13:59:33 - INFO - __main__ - Step 119316: {'lr': 5.118345193704735e-05, 'samples': 22908672, 'steps': 119315, 'loss/train': 1.3143959045410156} 11/07/2021 13:59:33 - INFO - __main__ - Step 119317: {'lr': 5.118023471284131e-05, 'samples': 22908864, 'steps': 119316, 'loss/train': 0.9923622608184814} 11/07/2021 13:59:33 - INFO - __main__ - Step 119318: {'lr': 5.11770175782203e-05, 'samples': 22909056, 'steps': 119317, 'loss/train': 1.3257681131362915} 11/07/2021 13:59:34 - INFO - __main__ - Step 119319: {'lr': 5.1173800533185876e-05, 'samples': 22909248, 'steps': 119318, 'loss/train': 1.2005913257598877} 11/07/2021 13:59:34 - INFO - __main__ - Step 119320: {'lr': 5.117058357773946e-05, 'samples': 22909440, 'steps': 119319, 'loss/train': 0.8201268315315247} 11/07/2021 13:59:35 - INFO - __main__ - Step 119321: {'lr': 5.1167366711882546e-05, 'samples': 22909632, 'steps': 119320, 'loss/train': 0.6114447712898254} 11/07/2021 13:59:35 - INFO - __main__ - Step 119322: {'lr': 5.116414993561652e-05, 'samples': 22909824, 'steps': 119321, 'loss/train': 1.2876888513565063} 11/07/2021 13:59:36 - INFO - __main__ - Step 119323: {'lr': 5.116093324894286e-05, 'samples': 22910016, 'steps': 119322, 'loss/train': 0.7163128852844238} 11/07/2021 13:59:36 - INFO - __main__ - Step 119324: {'lr': 5.1157716651863e-05, 'samples': 22910208, 'steps': 119323, 'loss/train': 1.4340243339538574} 11/07/2021 13:59:36 - INFO - __main__ - Step 119325: {'lr': 5.1154500144378446e-05, 'samples': 22910400, 'steps': 119324, 'loss/train': 1.6098196506500244} 11/07/2021 13:59:38 - INFO - __main__ - Step 119326: {'lr': 5.1151283726490583e-05, 'samples': 22910592, 'steps': 119325, 'loss/train': 1.2800391912460327} 11/07/2021 13:59:38 - INFO - __main__ - Step 119327: {'lr': 5.1148067398200884e-05, 'samples': 22910784, 'steps': 119326, 'loss/train': 1.1109604835510254} 11/07/2021 13:59:39 - INFO - __main__ - Step 119328: {'lr': 5.1144851159510844e-05, 'samples': 22910976, 'steps': 119327, 'loss/train': 1.6318336725234985} 11/07/2021 13:59:39 - INFO - __main__ - Step 119329: {'lr': 5.114163501042182e-05, 'samples': 22911168, 'steps': 119328, 'loss/train': 1.295923113822937} 11/07/2021 13:59:39 - INFO - __main__ - Step 119330: {'lr': 5.113841895093532e-05, 'samples': 22911360, 'steps': 119329, 'loss/train': 1.1939395666122437} 11/07/2021 13:59:40 - INFO - __main__ - Step 119331: {'lr': 5.113520298105276e-05, 'samples': 22911552, 'steps': 119330, 'loss/train': 0.6011296510696411} 11/07/2021 13:59:40 - INFO - __main__ - Step 119332: {'lr': 5.11319871007756e-05, 'samples': 22911744, 'steps': 119331, 'loss/train': 1.6853300333023071} 11/07/2021 13:59:41 - INFO - __main__ - Step 119333: {'lr': 5.112877131010532e-05, 'samples': 22911936, 'steps': 119332, 'loss/train': 1.7479547262191772} 11/07/2021 13:59:41 - INFO - __main__ - Step 119334: {'lr': 5.1125555609043334e-05, 'samples': 22912128, 'steps': 119333, 'loss/train': 1.6398489475250244} 11/07/2021 13:59:42 - INFO - __main__ - Step 119335: {'lr': 5.112233999759111e-05, 'samples': 22912320, 'steps': 119334, 'loss/train': 0.8537807464599609} 11/07/2021 13:59:42 - INFO - __main__ - Step 119336: {'lr': 5.11191244757501e-05, 'samples': 22912512, 'steps': 119335, 'loss/train': 1.9114038944244385} 11/07/2021 13:59:42 - INFO - __main__ - Step 119337: {'lr': 5.111590904352173e-05, 'samples': 22912704, 'steps': 119336, 'loss/train': 1.5439670085906982} 11/07/2021 13:59:43 - INFO - __main__ - Step 119338: {'lr': 5.111269370090746e-05, 'samples': 22912896, 'steps': 119337, 'loss/train': 1.2697995901107788} 11/07/2021 13:59:44 - INFO - __main__ - Step 119339: {'lr': 5.1109478447908755e-05, 'samples': 22913088, 'steps': 119338, 'loss/train': 1.3794629573822021} 11/07/2021 13:59:44 - INFO - __main__ - Step 119340: {'lr': 5.110626328452703e-05, 'samples': 22913280, 'steps': 119339, 'loss/train': 1.6772993803024292} 11/07/2021 13:59:45 - INFO - __main__ - Step 119341: {'lr': 5.1103048210763833e-05, 'samples': 22913472, 'steps': 119340, 'loss/train': 1.4717376232147217} 11/07/2021 13:59:45 - INFO - __main__ - Step 119342: {'lr': 5.109983322662046e-05, 'samples': 22913664, 'steps': 119341, 'loss/train': 1.320074439048767} 11/07/2021 13:59:46 - INFO - __main__ - Step 119343: {'lr': 5.109661833209844e-05, 'samples': 22913856, 'steps': 119342, 'loss/train': 1.0994951725006104} 11/07/2021 13:59:46 - INFO - __main__ - Step 119344: {'lr': 5.109340352719921e-05, 'samples': 22914048, 'steps': 119343, 'loss/train': 1.374592900276184} 11/07/2021 13:59:47 - INFO - __main__ - Step 119345: {'lr': 5.109018881192423e-05, 'samples': 22914240, 'steps': 119344, 'loss/train': 1.3814802169799805} 11/07/2021 13:59:47 - INFO - __main__ - Step 119346: {'lr': 5.108697418627495e-05, 'samples': 22914432, 'steps': 119345, 'loss/train': 1.6241925954818726} 11/07/2021 13:59:47 - INFO - __main__ - Step 119347: {'lr': 5.1083759650252775e-05, 'samples': 22914624, 'steps': 119346, 'loss/train': 1.3642711639404297} 11/07/2021 13:59:48 - INFO - __main__ - Step 119348: {'lr': 5.108054520385921e-05, 'samples': 22914816, 'steps': 119347, 'loss/train': 0.21503891050815582} 11/07/2021 13:59:49 - INFO - __main__ - Step 119349: {'lr': 5.1077330847095704e-05, 'samples': 22915008, 'steps': 119348, 'loss/train': 1.027758240699768} 11/07/2021 13:59:49 - INFO - __main__ - Step 119350: {'lr': 5.1074116579963666e-05, 'samples': 22915200, 'steps': 119349, 'loss/train': 0.6461832523345947} 11/07/2021 13:59:49 - INFO - __main__ - Step 119351: {'lr': 5.107090240246454e-05, 'samples': 22915392, 'steps': 119350, 'loss/train': 1.5196709632873535} 11/07/2021 13:59:50 - INFO - __main__ - Step 119352: {'lr': 5.106768831459982e-05, 'samples': 22915584, 'steps': 119351, 'loss/train': 1.1310458183288574} 11/07/2021 13:59:50 - INFO - __main__ - Step 119353: {'lr': 5.106447431637093e-05, 'samples': 22915776, 'steps': 119352, 'loss/train': 1.5873042345046997} 11/07/2021 13:59:51 - INFO - __main__ - Step 119354: {'lr': 5.106126040777936e-05, 'samples': 22915968, 'steps': 119353, 'loss/train': 1.2897169589996338} 11/07/2021 13:59:51 - INFO - __main__ - Step 119355: {'lr': 5.1058046588826484e-05, 'samples': 22916160, 'steps': 119354, 'loss/train': 0.935621440410614} 11/07/2021 13:59:52 - INFO - __main__ - Step 119356: {'lr': 5.105483285951376e-05, 'samples': 22916352, 'steps': 119355, 'loss/train': 0.8295337557792664} 11/07/2021 13:59:52 - INFO - __main__ - Step 119357: {'lr': 5.105161921984267e-05, 'samples': 22916544, 'steps': 119356, 'loss/train': 1.1847901344299316} 11/07/2021 13:59:53 - INFO - __main__ - Step 119358: {'lr': 5.104840566981464e-05, 'samples': 22916736, 'steps': 119357, 'loss/train': 0.9017123579978943} 11/07/2021 13:59:54 - INFO - __main__ - Step 119359: {'lr': 5.104519220943113e-05, 'samples': 22916928, 'steps': 119358, 'loss/train': 1.309053659439087} 11/07/2021 13:59:54 - INFO - __main__ - Step 119360: {'lr': 5.104197883869357e-05, 'samples': 22917120, 'steps': 119359, 'loss/train': 1.2096983194351196} 11/07/2021 13:59:54 - INFO - __main__ - Step 119361: {'lr': 5.103876555760345e-05, 'samples': 22917312, 'steps': 119360, 'loss/train': 1.0718754529953003} 11/07/2021 13:59:55 - INFO - __main__ - Step 119362: {'lr': 5.10355523661622e-05, 'samples': 22917504, 'steps': 119361, 'loss/train': 1.3718892335891724} 11/07/2021 13:59:55 - INFO - __main__ - Step 119363: {'lr': 5.103233926437123e-05, 'samples': 22917696, 'steps': 119362, 'loss/train': 0.9864029288291931} 11/07/2021 13:59:56 - INFO - __main__ - Step 119364: {'lr': 5.102912625223205e-05, 'samples': 22917888, 'steps': 119363, 'loss/train': 1.5409424304962158} 11/07/2021 13:59:57 - INFO - __main__ - Step 119365: {'lr': 5.102591332974604e-05, 'samples': 22918080, 'steps': 119364, 'loss/train': 1.1333346366882324} 11/07/2021 13:59:57 - INFO - __main__ - Step 119366: {'lr': 5.1022700496914706e-05, 'samples': 22918272, 'steps': 119365, 'loss/train': 0.466231107711792} 11/07/2021 13:59:57 - INFO - __main__ - Step 119367: {'lr': 5.1019487753739464e-05, 'samples': 22918464, 'steps': 119366, 'loss/train': 1.8131381273269653} 11/07/2021 13:59:58 - INFO - __main__ - Step 119368: {'lr': 5.101627510022186e-05, 'samples': 22918656, 'steps': 119367, 'loss/train': 1.50985848903656} 11/07/2021 13:59:58 - INFO - __main__ - Step 119369: {'lr': 5.101306253636315e-05, 'samples': 22918848, 'steps': 119368, 'loss/train': 0.4842083752155304} 11/07/2021 13:59:59 - INFO - __main__ - Step 119370: {'lr': 5.100985006216491e-05, 'samples': 22919040, 'steps': 119369, 'loss/train': 0.4729100465774536} 11/07/2021 14:00:00 - INFO - __main__ - Step 119371: {'lr': 5.100663767762853e-05, 'samples': 22919232, 'steps': 119370, 'loss/train': 0.956460177898407} 11/07/2021 14:00:00 - INFO - __main__ - Step 119372: {'lr': 5.1003425382755516e-05, 'samples': 22919424, 'steps': 119371, 'loss/train': 2.02183198928833} 11/07/2021 14:00:00 - INFO - __main__ - Step 119373: {'lr': 5.10002131775473e-05, 'samples': 22919616, 'steps': 119372, 'loss/train': 1.9025624990463257} 11/07/2021 14:00:01 - INFO - __main__ - Step 119374: {'lr': 5.0997001062005274e-05, 'samples': 22919808, 'steps': 119373, 'loss/train': 1.2688056230545044} 11/07/2021 14:00:02 - INFO - __main__ - Step 119375: {'lr': 5.099378903613097e-05, 'samples': 22920000, 'steps': 119374, 'loss/train': 0.9817401766777039} 11/07/2021 14:00:02 - INFO - __main__ - Step 119376: {'lr': 5.0990577099925774e-05, 'samples': 22920192, 'steps': 119375, 'loss/train': 1.4807668924331665} 11/07/2021 14:00:02 - INFO - __main__ - Step 119377: {'lr': 5.098736525339115e-05, 'samples': 22920384, 'steps': 119376, 'loss/train': 1.1978785991668701} 11/07/2021 14:00:03 - INFO - __main__ - Step 119378: {'lr': 5.098415349652855e-05, 'samples': 22920576, 'steps': 119377, 'loss/train': 1.4487950801849365} 11/07/2021 14:00:03 - INFO - __main__ - Step 119379: {'lr': 5.0980941829339436e-05, 'samples': 22920768, 'steps': 119378, 'loss/train': 0.7860084772109985} 11/07/2021 14:00:04 - INFO - __main__ - Step 119380: {'lr': 5.0977730251825226e-05, 'samples': 22920960, 'steps': 119379, 'loss/train': 1.35975980758667} 11/07/2021 14:00:04 - INFO - __main__ - Step 119381: {'lr': 5.0974518763987425e-05, 'samples': 22921152, 'steps': 119380, 'loss/train': 1.0466817617416382} 11/07/2021 14:00:05 - INFO - __main__ - Step 119382: {'lr': 5.097130736582739e-05, 'samples': 22921344, 'steps': 119381, 'loss/train': 0.8298596739768982} 11/07/2021 14:00:05 - INFO - __main__ - Step 119383: {'lr': 5.096809605734662e-05, 'samples': 22921536, 'steps': 119382, 'loss/train': 1.5126172304153442} 11/07/2021 14:00:05 - INFO - __main__ - Step 119384: {'lr': 5.096488483854655e-05, 'samples': 22921728, 'steps': 119383, 'loss/train': 2.0778260231018066} 11/07/2021 14:00:07 - INFO - __main__ - Step 119385: {'lr': 5.096167370942864e-05, 'samples': 22921920, 'steps': 119384, 'loss/train': 1.4025100469589233} 11/07/2021 14:00:07 - INFO - __main__ - Step 119386: {'lr': 5.095846266999429e-05, 'samples': 22922112, 'steps': 119385, 'loss/train': 0.5519183874130249} 11/07/2021 14:00:07 - INFO - __main__ - Step 119387: {'lr': 5.095525172024504e-05, 'samples': 22922304, 'steps': 119386, 'loss/train': 1.061232089996338} 11/07/2021 14:00:08 - INFO - __main__ - Step 119388: {'lr': 5.095204086018224e-05, 'samples': 22922496, 'steps': 119387, 'loss/train': 1.1966089010238647} 11/07/2021 14:00:08 - INFO - __main__ - Step 119389: {'lr': 5.0948830089807384e-05, 'samples': 22922688, 'steps': 119388, 'loss/train': 1.516758918762207} 11/07/2021 14:00:09 - INFO - __main__ - Step 119390: {'lr': 5.094561940912193e-05, 'samples': 22922880, 'steps': 119389, 'loss/train': 1.0017918348312378} 11/07/2021 14:00:09 - INFO - __main__ - Step 119391: {'lr': 5.0942408818127315e-05, 'samples': 22923072, 'steps': 119390, 'loss/train': 1.5643742084503174} 11/07/2021 14:00:10 - INFO - __main__ - Step 119392: {'lr': 5.0939198316824945e-05, 'samples': 22923264, 'steps': 119391, 'loss/train': 1.8256844282150269} 11/07/2021 14:00:10 - INFO - __main__ - Step 119393: {'lr': 5.093598790521634e-05, 'samples': 22923456, 'steps': 119392, 'loss/train': 1.4736881256103516} 11/07/2021 14:00:10 - INFO - __main__ - Step 119394: {'lr': 5.093277758330295e-05, 'samples': 22923648, 'steps': 119393, 'loss/train': 1.817359209060669} 11/07/2021 14:00:11 - INFO - __main__ - Step 119395: {'lr': 5.0929567351086115e-05, 'samples': 22923840, 'steps': 119394, 'loss/train': 1.317192792892456} 11/07/2021 14:00:12 - INFO - __main__ - Step 119396: {'lr': 5.092635720856734e-05, 'samples': 22924032, 'steps': 119395, 'loss/train': 1.189365029335022} 11/07/2021 14:00:12 - INFO - __main__ - Step 119397: {'lr': 5.0923147155748084e-05, 'samples': 22924224, 'steps': 119396, 'loss/train': 0.8996958136558533} 11/07/2021 14:00:13 - INFO - __main__ - Step 119398: {'lr': 5.0919937192629774e-05, 'samples': 22924416, 'steps': 119397, 'loss/train': 1.3565386533737183} 11/07/2021 14:00:13 - INFO - __main__ - Step 119399: {'lr': 5.0916727319213874e-05, 'samples': 22924608, 'steps': 119398, 'loss/train': 1.591681718826294} 11/07/2021 14:00:13 - INFO - __main__ - Step 119400: {'lr': 5.091351753550183e-05, 'samples': 22924800, 'steps': 119399, 'loss/train': 1.2632977962493896} 11/07/2021 14:00:14 - INFO - __main__ - Step 119401: {'lr': 5.0910307841495085e-05, 'samples': 22924992, 'steps': 119400, 'loss/train': 1.13453209400177} 11/07/2021 14:00:15 - INFO - __main__ - Step 119402: {'lr': 5.0907098237195084e-05, 'samples': 22925184, 'steps': 119401, 'loss/train': 1.1595888137817383} 11/07/2021 14:00:15 - INFO - __main__ - Step 119403: {'lr': 5.0903888722603265e-05, 'samples': 22925376, 'steps': 119402, 'loss/train': 1.3836146593093872} 11/07/2021 14:00:15 - INFO - __main__ - Step 119404: {'lr': 5.0900679297721105e-05, 'samples': 22925568, 'steps': 119403, 'loss/train': 1.0164207220077515} 11/07/2021 14:00:16 - INFO - __main__ - Step 119405: {'lr': 5.0897469962549986e-05, 'samples': 22925760, 'steps': 119404, 'loss/train': 1.3146542310714722} 11/07/2021 14:00:17 - INFO - __main__ - Step 119406: {'lr': 5.089426071709144e-05, 'samples': 22925952, 'steps': 119405, 'loss/train': 0.9662491083145142} 11/07/2021 14:00:17 - INFO - __main__ - Step 119407: {'lr': 5.0891051561346824e-05, 'samples': 22926144, 'steps': 119406, 'loss/train': 1.5365108251571655} 11/07/2021 14:00:17 - INFO - __main__ - Step 119408: {'lr': 5.088784249531772e-05, 'samples': 22926336, 'steps': 119407, 'loss/train': 1.2972701787948608} 11/07/2021 14:00:18 - INFO - __main__ - Step 119409: {'lr': 5.0884633519005406e-05, 'samples': 22926528, 'steps': 119408, 'loss/train': 1.3148846626281738} 11/07/2021 14:00:18 - INFO - __main__ - Step 119410: {'lr': 5.088142463241141e-05, 'samples': 22926720, 'steps': 119409, 'loss/train': 1.924607515335083} 11/07/2021 14:00:19 - INFO - __main__ - Step 119411: {'lr': 5.087821583553717e-05, 'samples': 22926912, 'steps': 119410, 'loss/train': 1.6803869009017944} 11/07/2021 14:00:19 - INFO - __main__ - Step 119412: {'lr': 5.0875007128384136e-05, 'samples': 22927104, 'steps': 119411, 'loss/train': 1.3826056718826294} 11/07/2021 14:00:20 - INFO - __main__ - Step 119413: {'lr': 5.087179851095375e-05, 'samples': 22927296, 'steps': 119412, 'loss/train': 1.3362022638320923} 11/07/2021 14:00:20 - INFO - __main__ - Step 119414: {'lr': 5.0868589983247446e-05, 'samples': 22927488, 'steps': 119413, 'loss/train': 1.3229495286941528} 11/07/2021 14:00:21 - INFO - __main__ - Step 119415: {'lr': 5.086538154526668e-05, 'samples': 22927680, 'steps': 119414, 'loss/train': 1.0396279096603394} 11/07/2021 14:00:22 - INFO - __main__ - Step 119416: {'lr': 5.086217319701292e-05, 'samples': 22927872, 'steps': 119415, 'loss/train': 1.5038306713104248} 11/07/2021 14:00:22 - INFO - __main__ - Step 119417: {'lr': 5.085896493848757e-05, 'samples': 22928064, 'steps': 119416, 'loss/train': 1.7932530641555786} 11/07/2021 14:00:22 - INFO - __main__ - Step 119418: {'lr': 5.085575676969212e-05, 'samples': 22928256, 'steps': 119417, 'loss/train': 1.8112499713897705} 11/07/2021 14:00:23 - INFO - __main__ - Step 119419: {'lr': 5.0852548690627994e-05, 'samples': 22928448, 'steps': 119418, 'loss/train': 1.1031252145767212} 11/07/2021 14:00:23 - INFO - __main__ - Step 119420: {'lr': 5.084934070129662e-05, 'samples': 22928640, 'steps': 119419, 'loss/train': 1.5378280878067017} 11/07/2021 14:00:24 - INFO - __main__ - Step 119421: {'lr': 5.084613280169953e-05, 'samples': 22928832, 'steps': 119420, 'loss/train': 1.375640869140625} 11/07/2021 14:00:24 - INFO - __main__ - Step 119422: {'lr': 5.084292499183804e-05, 'samples': 22929024, 'steps': 119421, 'loss/train': 1.3288359642028809} 11/07/2021 14:00:25 - INFO - __main__ - Step 119423: {'lr': 5.083971727171366e-05, 'samples': 22929216, 'steps': 119422, 'loss/train': 1.329796314239502} 11/07/2021 14:00:25 - INFO - __main__ - Step 119424: {'lr': 5.083650964132783e-05, 'samples': 22929408, 'steps': 119423, 'loss/train': 1.311681866645813} 11/07/2021 14:00:25 - INFO - __main__ - Step 119425: {'lr': 5.083330210068196e-05, 'samples': 22929600, 'steps': 119424, 'loss/train': 1.3011788129806519} 11/07/2021 14:00:26 - INFO - __main__ - Step 119426: {'lr': 5.083009464977756e-05, 'samples': 22929792, 'steps': 119425, 'loss/train': 1.0438488721847534} 11/07/2021 14:00:27 - INFO - __main__ - Step 119427: {'lr': 5.0826887288616066e-05, 'samples': 22929984, 'steps': 119426, 'loss/train': 1.4062211513519287} 11/07/2021 14:00:27 - INFO - __main__ - Step 119428: {'lr': 5.0823680017198866e-05, 'samples': 22930176, 'steps': 119427, 'loss/train': 1.3056273460388184} 11/07/2021 14:00:28 - INFO - __main__ - Step 119429: {'lr': 5.082047283552746e-05, 'samples': 22930368, 'steps': 119428, 'loss/train': 1.1442489624023438} 11/07/2021 14:00:28 - INFO - __main__ - Step 119430: {'lr': 5.081726574360326e-05, 'samples': 22930560, 'steps': 119429, 'loss/train': 1.5477041006088257} 11/07/2021 14:00:28 - INFO - __main__ - Step 119431: {'lr': 5.081405874142775e-05, 'samples': 22930752, 'steps': 119430, 'loss/train': 1.7616046667099} 11/07/2021 14:00:29 - INFO - __main__ - Step 119432: {'lr': 5.081085182900233e-05, 'samples': 22930944, 'steps': 119431, 'loss/train': 1.1652077436447144} 11/07/2021 14:00:30 - INFO - __main__ - Step 119433: {'lr': 5.080764500632848e-05, 'samples': 22931136, 'steps': 119432, 'loss/train': 1.3652942180633545} 11/07/2021 14:00:30 - INFO - __main__ - Step 119434: {'lr': 5.080443827340764e-05, 'samples': 22931328, 'steps': 119433, 'loss/train': 1.5450305938720703} 11/07/2021 14:00:30 - INFO - __main__ - Step 119435: {'lr': 5.0801231630241304e-05, 'samples': 22931520, 'steps': 119434, 'loss/train': 1.4426239728927612} 11/07/2021 14:00:31 - INFO - __main__ - Step 119436: {'lr': 5.0798025076830786e-05, 'samples': 22931712, 'steps': 119435, 'loss/train': 1.9600801467895508} 11/07/2021 14:00:32 - INFO - __main__ - Step 119437: {'lr': 5.079481861317761e-05, 'samples': 22931904, 'steps': 119436, 'loss/train': 1.1721446514129639} 11/07/2021 14:00:32 - INFO - __main__ - Step 119438: {'lr': 5.079161223928322e-05, 'samples': 22932096, 'steps': 119437, 'loss/train': 1.217655897140503} 11/07/2021 14:00:33 - INFO - __main__ - Step 119439: {'lr': 5.078840595514902e-05, 'samples': 22932288, 'steps': 119438, 'loss/train': 0.765208899974823} 11/07/2021 14:00:33 - INFO - __main__ - Step 119440: {'lr': 5.0785199760776526e-05, 'samples': 22932480, 'steps': 119439, 'loss/train': 0.7994443774223328} 11/07/2021 14:00:33 - INFO - __main__ - Step 119441: {'lr': 5.078199365616715e-05, 'samples': 22932672, 'steps': 119440, 'loss/train': 0.9205998182296753} 11/07/2021 14:00:34 - INFO - __main__ - Step 119442: {'lr': 5.077878764132232e-05, 'samples': 22932864, 'steps': 119441, 'loss/train': 1.3396884202957153} 11/07/2021 14:00:35 - INFO - __main__ - Step 119443: {'lr': 5.0775581716243495e-05, 'samples': 22933056, 'steps': 119442, 'loss/train': 1.4100099802017212} 11/07/2021 14:00:35 - INFO - __main__ - Step 119444: {'lr': 5.0772375880932114e-05, 'samples': 22933248, 'steps': 119443, 'loss/train': 1.2430473566055298} 11/07/2021 14:00:35 - INFO - __main__ - Step 119445: {'lr': 5.0769170135389644e-05, 'samples': 22933440, 'steps': 119444, 'loss/train': 1.614083170890808} 11/07/2021 14:00:36 - INFO - __main__ - Step 119446: {'lr': 5.076596447961751e-05, 'samples': 22933632, 'steps': 119445, 'loss/train': 1.2724609375} 11/07/2021 14:00:37 - INFO - __main__ - Step 119447: {'lr': 5.076275891361714e-05, 'samples': 22933824, 'steps': 119446, 'loss/train': 1.335955262184143} 11/07/2021 14:00:37 - INFO - __main__ - Step 119448: {'lr': 5.075955343739005e-05, 'samples': 22934016, 'steps': 119447, 'loss/train': 0.46053025126457214} 11/07/2021 14:00:37 - INFO - __main__ - Step 119449: {'lr': 5.075634805093759e-05, 'samples': 22934208, 'steps': 119448, 'loss/train': 0.9697422385215759} 11/07/2021 14:00:38 - INFO - __main__ - Step 119450: {'lr': 5.0753142754261266e-05, 'samples': 22934400, 'steps': 119449, 'loss/train': 1.144894003868103} 11/07/2021 14:00:38 - INFO - __main__ - Step 119451: {'lr': 5.0749937547362456e-05, 'samples': 22934592, 'steps': 119450, 'loss/train': 1.2487787008285522} 11/07/2021 14:00:39 - INFO - __main__ - Step 119452: {'lr': 5.074673243024266e-05, 'samples': 22934784, 'steps': 119451, 'loss/train': 1.0932878255844116} 11/07/2021 14:00:40 - INFO - __main__ - Step 119453: {'lr': 5.074352740290333e-05, 'samples': 22934976, 'steps': 119452, 'loss/train': 0.7481849789619446} 11/07/2021 14:00:40 - INFO - __main__ - Step 119454: {'lr': 5.074032246534591e-05, 'samples': 22935168, 'steps': 119453, 'loss/train': 0.35924747586250305} 11/07/2021 14:00:40 - INFO - __main__ - Step 119455: {'lr': 5.073711761757177e-05, 'samples': 22935360, 'steps': 119454, 'loss/train': 1.7443655729293823} 11/07/2021 14:00:41 - INFO - __main__ - Step 119456: {'lr': 5.073391285958246e-05, 'samples': 22935552, 'steps': 119455, 'loss/train': 0.8660818338394165} 11/07/2021 14:00:42 - INFO - __main__ - Step 119457: {'lr': 5.073070819137934e-05, 'samples': 22935744, 'steps': 119456, 'loss/train': 1.014788031578064} 11/07/2021 14:00:42 - INFO - __main__ - Step 119458: {'lr': 5.072750361296391e-05, 'samples': 22935936, 'steps': 119457, 'loss/train': 1.024677038192749} 11/07/2021 14:00:42 - INFO - __main__ - Step 119459: {'lr': 5.072429912433757e-05, 'samples': 22936128, 'steps': 119458, 'loss/train': 1.5839216709136963} 11/07/2021 14:00:43 - INFO - __main__ - Step 119460: {'lr': 5.072109472550179e-05, 'samples': 22936320, 'steps': 119459, 'loss/train': 1.4515489339828491} 11/07/2021 14:00:43 - INFO - __main__ - Step 119461: {'lr': 5.071789041645802e-05, 'samples': 22936512, 'steps': 119460, 'loss/train': 1.3769440650939941} 11/07/2021 14:00:43 - INFO - __main__ - Step 119462: {'lr': 5.071468619720776e-05, 'samples': 22936704, 'steps': 119461, 'loss/train': 0.8410283327102661} 11/07/2021 14:00:44 - INFO - __main__ - Step 119463: {'lr': 5.071148206775234e-05, 'samples': 22936896, 'steps': 119462, 'loss/train': 1.4150437116622925} 11/07/2021 14:00:45 - INFO - __main__ - Step 119464: {'lr': 5.070827802809322e-05, 'samples': 22937088, 'steps': 119463, 'loss/train': 1.1643478870391846} 11/07/2021 14:00:45 - INFO - __main__ - Step 119465: {'lr': 5.070507407823188e-05, 'samples': 22937280, 'steps': 119464, 'loss/train': 1.3197102546691895} 11/07/2021 14:00:46 - INFO - __main__ - Step 119466: {'lr': 5.070187021816977e-05, 'samples': 22937472, 'steps': 119465, 'loss/train': 1.2530463933944702} 11/07/2021 14:00:46 - INFO - __main__ - Step 119467: {'lr': 5.069866644790833e-05, 'samples': 22937664, 'steps': 119466, 'loss/train': 1.3317618370056152} 11/07/2021 14:00:47 - INFO - __main__ - Step 119468: {'lr': 5.069546276744896e-05, 'samples': 22937856, 'steps': 119467, 'loss/train': 1.3918161392211914} 11/07/2021 14:00:47 - INFO - __main__ - Step 119469: {'lr': 5.0692259176793154e-05, 'samples': 22938048, 'steps': 119468, 'loss/train': 1.2186267375946045} 11/07/2021 14:00:48 - INFO - __main__ - Step 119470: {'lr': 5.068905567594237e-05, 'samples': 22938240, 'steps': 119469, 'loss/train': 1.257509469985962} 11/07/2021 14:00:48 - INFO - __main__ - Step 119471: {'lr': 5.0685852264898e-05, 'samples': 22938432, 'steps': 119470, 'loss/train': 1.1205635070800781} 11/07/2021 14:00:48 - INFO - __main__ - Step 119472: {'lr': 5.068264894366148e-05, 'samples': 22938624, 'steps': 119471, 'loss/train': 1.137746810913086} 11/07/2021 14:00:49 - INFO - __main__ - Step 119473: {'lr': 5.067944571223432e-05, 'samples': 22938816, 'steps': 119472, 'loss/train': 1.3961896896362305} 11/07/2021 14:00:50 - INFO - __main__ - Step 119474: {'lr': 5.0676242570617924e-05, 'samples': 22939008, 'steps': 119473, 'loss/train': 1.1827577352523804} 11/07/2021 14:00:50 - INFO - __main__ - Step 119475: {'lr': 5.06730395188138e-05, 'samples': 22939200, 'steps': 119474, 'loss/train': 1.1461752653121948} 11/07/2021 14:00:50 - INFO - __main__ - Step 119476: {'lr': 5.066983655682325e-05, 'samples': 22939392, 'steps': 119475, 'loss/train': 1.5722274780273438} 11/07/2021 14:00:51 - INFO - __main__ - Step 119477: {'lr': 5.06666336846478e-05, 'samples': 22939584, 'steps': 119476, 'loss/train': 1.2058886289596558} 11/07/2021 14:00:52 - INFO - __main__ - Step 119478: {'lr': 5.066343090228889e-05, 'samples': 22939776, 'steps': 119477, 'loss/train': 1.4793791770935059} 11/07/2021 14:00:52 - INFO - __main__ - Step 119479: {'lr': 5.066022820974797e-05, 'samples': 22939968, 'steps': 119478, 'loss/train': 1.207394003868103} 11/07/2021 14:00:53 - INFO - __main__ - Step 119480: {'lr': 5.065702560702648e-05, 'samples': 22940160, 'steps': 119479, 'loss/train': 0.9209238290786743} 11/07/2021 14:00:53 - INFO - __main__ - Step 119481: {'lr': 5.0653823094125834e-05, 'samples': 22940352, 'steps': 119480, 'loss/train': 1.8153373003005981} 11/07/2021 14:00:53 - INFO - __main__ - Step 119482: {'lr': 5.06506206710475e-05, 'samples': 22940544, 'steps': 119481, 'loss/train': 1.3621034622192383} 11/07/2021 14:00:54 - INFO - __main__ - Step 119483: {'lr': 5.064741833779296e-05, 'samples': 22940736, 'steps': 119482, 'loss/train': 1.0661251544952393} 11/07/2021 14:00:55 - INFO - __main__ - Step 119484: {'lr': 5.064421609436359e-05, 'samples': 22940928, 'steps': 119483, 'loss/train': 1.1146016120910645} 11/07/2021 14:00:55 - INFO - __main__ - Step 119485: {'lr': 5.0641013940760843e-05, 'samples': 22941120, 'steps': 119484, 'loss/train': 0.871116042137146} 11/07/2021 14:00:55 - INFO - __main__ - Step 119486: {'lr': 5.063781187698621e-05, 'samples': 22941312, 'steps': 119485, 'loss/train': 1.2594611644744873} 11/07/2021 14:00:56 - INFO - __main__ - Step 119487: {'lr': 5.0634609903041086e-05, 'samples': 22941504, 'steps': 119486, 'loss/train': 1.27139413356781} 11/07/2021 14:00:56 - INFO - __main__ - Step 119488: {'lr': 5.063140801892693e-05, 'samples': 22941696, 'steps': 119487, 'loss/train': 1.2719323635101318} 11/07/2021 14:00:57 - INFO - __main__ - Step 119489: {'lr': 5.0628206224645254e-05, 'samples': 22941888, 'steps': 119488, 'loss/train': 0.9683706760406494} 11/07/2021 14:00:57 - INFO - __main__ - Step 119490: {'lr': 5.062500452019736e-05, 'samples': 22942080, 'steps': 119489, 'loss/train': 1.5102105140686035} 11/07/2021 14:00:58 - INFO - __main__ - Step 119491: {'lr': 5.0621802905584766e-05, 'samples': 22942272, 'steps': 119490, 'loss/train': 1.4606679677963257} 11/07/2021 14:00:58 - INFO - __main__ - Step 119492: {'lr': 5.061860138080892e-05, 'samples': 22942464, 'steps': 119491, 'loss/train': 0.979155421257019} 11/07/2021 14:00:59 - INFO - __main__ - Step 119493: {'lr': 5.061539994587125e-05, 'samples': 22942656, 'steps': 119492, 'loss/train': 1.7063902616500854} 11/07/2021 14:01:00 - INFO - __main__ - Step 119494: {'lr': 5.0612198600773206e-05, 'samples': 22942848, 'steps': 119493, 'loss/train': 0.883765697479248} 11/07/2021 14:01:00 - INFO - __main__ - Step 119495: {'lr': 5.060899734551622e-05, 'samples': 22943040, 'steps': 119494, 'loss/train': 1.2009130716323853} 11/07/2021 14:01:00 - INFO - __main__ - Step 119496: {'lr': 5.060579618010175e-05, 'samples': 22943232, 'steps': 119495, 'loss/train': 1.3975114822387695} 11/07/2021 14:01:01 - INFO - __main__ - Step 119497: {'lr': 5.060259510453125e-05, 'samples': 22943424, 'steps': 119496, 'loss/train': 1.2049211263656616} 11/07/2021 14:01:01 - INFO - __main__ - Step 119498: {'lr': 5.059939411880613e-05, 'samples': 22943616, 'steps': 119497, 'loss/train': 2.613762378692627} 11/07/2021 14:01:02 - INFO - __main__ - Step 119499: {'lr': 5.0596193222927826e-05, 'samples': 22943808, 'steps': 119498, 'loss/train': 1.597848653793335} 11/07/2021 14:01:02 - INFO - __main__ - Step 119500: {'lr': 5.0592992416897826e-05, 'samples': 22944000, 'steps': 119499, 'loss/train': 1.3601642847061157} 11/07/2021 14:01:03 - INFO - __main__ - Step 119501: {'lr': 5.0589791700717537e-05, 'samples': 22944192, 'steps': 119500, 'loss/train': 0.9301084280014038} 11/07/2021 14:01:03 - INFO - __main__ - Step 119502: {'lr': 5.0586591074388456e-05, 'samples': 22944384, 'steps': 119501, 'loss/train': 1.0516141653060913} 11/07/2021 14:01:03 - INFO - __main__ - Step 119503: {'lr': 5.0583390537911946e-05, 'samples': 22944576, 'steps': 119502, 'loss/train': 1.2778090238571167} 11/07/2021 14:01:04 - INFO - __main__ - Step 119504: {'lr': 5.058019009128948e-05, 'samples': 22944768, 'steps': 119503, 'loss/train': 1.381414532661438} 11/07/2021 14:01:05 - INFO - __main__ - Step 119505: {'lr': 5.0576989734522486e-05, 'samples': 22944960, 'steps': 119504, 'loss/train': 1.3053168058395386} 11/07/2021 14:01:05 - INFO - __main__ - Step 119506: {'lr': 5.057378946761243e-05, 'samples': 22945152, 'steps': 119505, 'loss/train': 1.153106451034546} 11/07/2021 14:01:05 - INFO - __main__ - Step 119507: {'lr': 5.0570589290560744e-05, 'samples': 22945344, 'steps': 119506, 'loss/train': 1.540859580039978} 11/07/2021 14:01:06 - INFO - __main__ - Step 119508: {'lr': 5.0567389203368866e-05, 'samples': 22945536, 'steps': 119507, 'loss/train': 1.2350008487701416} 11/07/2021 14:01:07 - INFO - __main__ - Step 119509: {'lr': 5.056418920603825e-05, 'samples': 22945728, 'steps': 119508, 'loss/train': 1.2027509212493896} 11/07/2021 14:01:07 - INFO - __main__ - Step 119510: {'lr': 5.0560989298570334e-05, 'samples': 22945920, 'steps': 119509, 'loss/train': 1.1315892934799194} 11/07/2021 14:01:08 - INFO - __main__ - Step 119511: {'lr': 5.055778948096657e-05, 'samples': 22946112, 'steps': 119510, 'loss/train': 1.4680083990097046} 11/07/2021 14:01:08 - INFO - __main__ - Step 119512: {'lr': 5.055458975322838e-05, 'samples': 22946304, 'steps': 119511, 'loss/train': 1.1997250318527222} 11/07/2021 14:01:08 - INFO - __main__ - Step 119513: {'lr': 5.0551390115357225e-05, 'samples': 22946496, 'steps': 119512, 'loss/train': 1.696947455406189} 11/07/2021 14:01:09 - INFO - __main__ - Step 119514: {'lr': 5.054819056735452e-05, 'samples': 22946688, 'steps': 119513, 'loss/train': 2.273658037185669} 11/07/2021 14:01:10 - INFO - __main__ - Step 119515: {'lr': 5.054499110922181e-05, 'samples': 22946880, 'steps': 119514, 'loss/train': 1.295295238494873} 11/07/2021 14:01:10 - INFO - __main__ - Step 119516: {'lr': 5.054179174096035e-05, 'samples': 22947072, 'steps': 119515, 'loss/train': 1.1632707118988037} 11/07/2021 14:01:10 - INFO - __main__ - Step 119517: {'lr': 5.0538592462571695e-05, 'samples': 22947264, 'steps': 119516, 'loss/train': 1.4163298606872559} 11/07/2021 14:01:11 - INFO - __main__ - Step 119518: {'lr': 5.053539327405729e-05, 'samples': 22947456, 'steps': 119517, 'loss/train': 0.8396657109260559} 11/07/2021 14:01:11 - INFO - __main__ - Step 119519: {'lr': 5.0532194175418545e-05, 'samples': 22947648, 'steps': 119518, 'loss/train': 1.3722634315490723} 11/07/2021 14:01:12 - INFO - __main__ - Step 119520: {'lr': 5.052899516665691e-05, 'samples': 22947840, 'steps': 119519, 'loss/train': 0.7567718029022217} 11/07/2021 14:01:13 - INFO - __main__ - Step 119521: {'lr': 5.052579624777384e-05, 'samples': 22948032, 'steps': 119520, 'loss/train': 1.319940209388733} 11/07/2021 14:01:13 - INFO - __main__ - Step 119522: {'lr': 5.052259741877077e-05, 'samples': 22948224, 'steps': 119521, 'loss/train': 0.6802740097045898} 11/07/2021 14:01:13 - INFO - __main__ - Step 119523: {'lr': 5.051939867964914e-05, 'samples': 22948416, 'steps': 119522, 'loss/train': 1.909637689590454} 11/07/2021 14:01:14 - INFO - __main__ - Step 119524: {'lr': 5.051620003041038e-05, 'samples': 22948608, 'steps': 119523, 'loss/train': 1.0148414373397827} 11/07/2021 14:01:15 - INFO - __main__ - Step 119525: {'lr': 5.0513001471055945e-05, 'samples': 22948800, 'steps': 119524, 'loss/train': 1.971573829650879} 11/07/2021 14:01:15 - INFO - __main__ - Step 119526: {'lr': 5.0509803001587276e-05, 'samples': 22948992, 'steps': 119525, 'loss/train': 0.4669632613658905} 11/07/2021 14:01:15 - INFO - __main__ - Step 119527: {'lr': 5.050660462200582e-05, 'samples': 22949184, 'steps': 119526, 'loss/train': 1.2690832614898682} 11/07/2021 14:01:16 - INFO - __main__ - Step 119528: {'lr': 5.0503406332312985e-05, 'samples': 22949376, 'steps': 119527, 'loss/train': 1.45167076587677} 11/07/2021 14:01:16 - INFO - __main__ - Step 119529: {'lr': 5.0500208132510326e-05, 'samples': 22949568, 'steps': 119528, 'loss/train': 1.5171797275543213} 11/07/2021 14:01:16 - INFO - __main__ - Step 119530: {'lr': 5.0497010022599126e-05, 'samples': 22949760, 'steps': 119529, 'loss/train': 1.3465546369552612} 11/07/2021 14:01:18 - INFO - __main__ - Step 119531: {'lr': 5.049381200258088e-05, 'samples': 22949952, 'steps': 119530, 'loss/train': 0.8177201151847839} 11/07/2021 14:01:18 - INFO - __main__ - Step 119532: {'lr': 5.049061407245706e-05, 'samples': 22950144, 'steps': 119531, 'loss/train': 1.857217788696289} 11/07/2021 14:01:18 - INFO - __main__ - Step 119533: {'lr': 5.04874162322291e-05, 'samples': 22950336, 'steps': 119532, 'loss/train': 1.5057623386383057} 11/07/2021 14:01:19 - INFO - __main__ - Step 119534: {'lr': 5.0484218481898405e-05, 'samples': 22950528, 'steps': 119533, 'loss/train': 1.152743935585022} 11/07/2021 14:01:19 - INFO - __main__ - Step 119535: {'lr': 5.0481020821466465e-05, 'samples': 22950720, 'steps': 119534, 'loss/train': 1.303466558456421} 11/07/2021 14:01:20 - INFO - __main__ - Step 119536: {'lr': 5.0477823250934665e-05, 'samples': 22950912, 'steps': 119535, 'loss/train': 1.434794545173645} 11/07/2021 14:01:20 - INFO - __main__ - Step 119537: {'lr': 5.047462577030451e-05, 'samples': 22951104, 'steps': 119536, 'loss/train': 0.8914667963981628} 11/07/2021 14:01:21 - INFO - __main__ - Step 119538: {'lr': 5.047142837957741e-05, 'samples': 22951296, 'steps': 119537, 'loss/train': 1.4077779054641724} 11/07/2021 14:01:21 - INFO - __main__ - Step 119539: {'lr': 5.046823107875478e-05, 'samples': 22951488, 'steps': 119538, 'loss/train': 0.9702016115188599} 11/07/2021 14:01:22 - INFO - __main__ - Step 119540: {'lr': 5.0465033867838125e-05, 'samples': 22951680, 'steps': 119539, 'loss/train': 0.9676209688186646} 11/07/2021 14:01:23 - INFO - __main__ - Step 119541: {'lr': 5.0461836746828804e-05, 'samples': 22951872, 'steps': 119540, 'loss/train': 2.045780658721924} 11/07/2021 14:01:23 - INFO - __main__ - Step 119542: {'lr': 5.045863971572839e-05, 'samples': 22952064, 'steps': 119541, 'loss/train': 1.324744701385498} 11/07/2021 14:01:23 - INFO - __main__ - Step 119543: {'lr': 5.0455442774538175e-05, 'samples': 22952256, 'steps': 119542, 'loss/train': 1.5573184490203857} 11/07/2021 14:01:24 - INFO - __main__ - Step 119544: {'lr': 5.0452245923259645e-05, 'samples': 22952448, 'steps': 119543, 'loss/train': 1.6907894611358643} 11/07/2021 14:01:24 - INFO - __main__ - Step 119545: {'lr': 5.0449049161894246e-05, 'samples': 22952640, 'steps': 119544, 'loss/train': 0.5409862995147705} 11/07/2021 14:01:25 - INFO - __main__ - Step 119546: {'lr': 5.0445852490443453e-05, 'samples': 22952832, 'steps': 119545, 'loss/train': 1.4358150959014893} 11/07/2021 14:01:26 - INFO - __main__ - Step 119547: {'lr': 5.0442655908908644e-05, 'samples': 22953024, 'steps': 119546, 'loss/train': 1.354887843132019} 11/07/2021 14:01:26 - INFO - __main__ - Step 119548: {'lr': 5.0439459417291335e-05, 'samples': 22953216, 'steps': 119547, 'loss/train': 1.1834553480148315} 11/07/2021 14:01:26 - INFO - __main__ - Step 119549: {'lr': 5.0436263015592896e-05, 'samples': 22953408, 'steps': 119548, 'loss/train': 1.2097970247268677} 11/07/2021 14:01:27 - INFO - __main__ - Step 119550: {'lr': 5.043306670381481e-05, 'samples': 22953600, 'steps': 119549, 'loss/train': 1.5497909784317017} 11/07/2021 14:01:27 - INFO - __main__ - Step 119551: {'lr': 5.042987048195849e-05, 'samples': 22953792, 'steps': 119550, 'loss/train': 1.2765915393829346} 11/07/2021 14:01:28 - INFO - __main__ - Step 119552: {'lr': 5.0426674350025407e-05, 'samples': 22953984, 'steps': 119551, 'loss/train': 1.5426478385925293} 11/07/2021 14:01:28 - INFO - __main__ - Step 119553: {'lr': 5.042347830801705e-05, 'samples': 22954176, 'steps': 119552, 'loss/train': 1.3481720685958862} 11/07/2021 14:01:29 - INFO - __main__ - Step 119554: {'lr': 5.042028235593474e-05, 'samples': 22954368, 'steps': 119553, 'loss/train': 1.3681092262268066} 11/07/2021 14:01:29 - INFO - __main__ - Step 119555: {'lr': 5.0417086493779934e-05, 'samples': 22954560, 'steps': 119554, 'loss/train': 1.2208493947982788} 11/07/2021 14:01:29 - INFO - __main__ - Step 119556: {'lr': 5.041389072155414e-05, 'samples': 22954752, 'steps': 119555, 'loss/train': 1.0683788061141968} 11/07/2021 14:01:30 - INFO - __main__ - Step 119557: {'lr': 5.041069503925877e-05, 'samples': 22954944, 'steps': 119556, 'loss/train': 1.2534290552139282} 11/07/2021 14:01:31 - INFO - __main__ - Step 119558: {'lr': 5.040749944689524e-05, 'samples': 22955136, 'steps': 119557, 'loss/train': 1.757511854171753} 11/07/2021 14:01:31 - INFO - __main__ - Step 119559: {'lr': 5.0404303944465015e-05, 'samples': 22955328, 'steps': 119558, 'loss/train': 0.9975600838661194} 11/07/2021 14:01:31 - INFO - __main__ - Step 119560: {'lr': 5.0401108531969525e-05, 'samples': 22955520, 'steps': 119559, 'loss/train': 1.019278645515442} 11/07/2021 14:01:32 - INFO - __main__ - Step 119561: {'lr': 5.039791320941023e-05, 'samples': 22955712, 'steps': 119560, 'loss/train': 1.4666913747787476} 11/07/2021 14:01:33 - INFO - __main__ - Step 119562: {'lr': 5.039471797678854e-05, 'samples': 22955904, 'steps': 119561, 'loss/train': 1.4660651683807373} 11/07/2021 14:01:33 - INFO - __main__ - Step 119563: {'lr': 5.0391522834105944e-05, 'samples': 22956096, 'steps': 119562, 'loss/train': 1.4038574695587158} 11/07/2021 14:01:34 - INFO - __main__ - Step 119564: {'lr': 5.038832778136387e-05, 'samples': 22956288, 'steps': 119563, 'loss/train': 1.2216649055480957} 11/07/2021 14:01:34 - INFO - __main__ - Step 119565: {'lr': 5.038513281856369e-05, 'samples': 22956480, 'steps': 119564, 'loss/train': 0.9794641137123108} 11/07/2021 14:01:34 - INFO - __main__ - Step 119566: {'lr': 5.038193794570689e-05, 'samples': 22956672, 'steps': 119565, 'loss/train': 0.6476624011993408} 11/07/2021 14:01:35 - INFO - __main__ - Step 119567: {'lr': 5.037874316279492e-05, 'samples': 22956864, 'steps': 119566, 'loss/train': 1.5718244314193726} 11/07/2021 14:01:36 - INFO - __main__ - Step 119568: {'lr': 5.0375548469829196e-05, 'samples': 22957056, 'steps': 119567, 'loss/train': 1.001465916633606} 11/07/2021 14:01:36 - INFO - __main__ - Step 119569: {'lr': 5.037235386681116e-05, 'samples': 22957248, 'steps': 119568, 'loss/train': 1.7564067840576172} 11/07/2021 14:01:36 - INFO - __main__ - Step 119570: {'lr': 5.0369159353742284e-05, 'samples': 22957440, 'steps': 119569, 'loss/train': 1.4822050333023071} 11/07/2021 14:01:37 - INFO - __main__ - Step 119571: {'lr': 5.036596493062395e-05, 'samples': 22957632, 'steps': 119570, 'loss/train': 0.4786619246006012} 11/07/2021 14:01:37 - INFO - __main__ - Step 119572: {'lr': 5.036277059745767e-05, 'samples': 22957824, 'steps': 119571, 'loss/train': 1.314380168914795} 11/07/2021 14:01:38 - INFO - __main__ - Step 119573: {'lr': 5.035957635424482e-05, 'samples': 22958016, 'steps': 119572, 'loss/train': 1.7303156852722168} 11/07/2021 14:01:39 - INFO - __main__ - Step 119574: {'lr': 5.035638220098687e-05, 'samples': 22958208, 'steps': 119573, 'loss/train': 1.2076722383499146} 11/07/2021 14:01:39 - INFO - __main__ - Step 119575: {'lr': 5.035318813768533e-05, 'samples': 22958400, 'steps': 119574, 'loss/train': 1.4160585403442383} 11/07/2021 14:01:39 - INFO - __main__ - Step 119576: {'lr': 5.034999416434147e-05, 'samples': 22958592, 'steps': 119575, 'loss/train': 1.5304615497589111} 11/07/2021 14:01:40 - INFO - __main__ - Step 119577: {'lr': 5.034680028095684e-05, 'samples': 22958784, 'steps': 119576, 'loss/train': 1.5441464185714722} 11/07/2021 14:01:41 - INFO - __main__ - Step 119578: {'lr': 5.034360648753286e-05, 'samples': 22958976, 'steps': 119577, 'loss/train': 0.7300692200660706} 11/07/2021 14:01:41 - INFO - __main__ - Step 119579: {'lr': 5.034041278407098e-05, 'samples': 22959168, 'steps': 119578, 'loss/train': 1.7333110570907593} 11/07/2021 14:01:41 - INFO - __main__ - Step 119580: {'lr': 5.03372191705726e-05, 'samples': 22959360, 'steps': 119579, 'loss/train': 0.9265751838684082} 11/07/2021 14:01:42 - INFO - __main__ - Step 119581: {'lr': 5.033402564703923e-05, 'samples': 22959552, 'steps': 119580, 'loss/train': 1.4071813821792603} 11/07/2021 14:01:42 - INFO - __main__ - Step 119582: {'lr': 5.0330832213472235e-05, 'samples': 22959744, 'steps': 119581, 'loss/train': 1.0742491483688354} 11/07/2021 14:01:43 - INFO - __main__ - Step 119583: {'lr': 5.03276388698731e-05, 'samples': 22959936, 'steps': 119582, 'loss/train': 0.9654896259307861} 11/07/2021 14:01:44 - INFO - __main__ - Step 119584: {'lr': 5.032444561624325e-05, 'samples': 22960128, 'steps': 119583, 'loss/train': 1.2838733196258545} 11/07/2021 14:01:44 - INFO - __main__ - Step 119585: {'lr': 5.0321252452584094e-05, 'samples': 22960320, 'steps': 119584, 'loss/train': 1.2429131269454956} 11/07/2021 14:01:44 - INFO - __main__ - Step 119586: {'lr': 5.0318059378897193e-05, 'samples': 22960512, 'steps': 119585, 'loss/train': 1.2512941360473633} 11/07/2021 14:01:45 - INFO - __main__ - Step 119587: {'lr': 5.031486639518385e-05, 'samples': 22960704, 'steps': 119586, 'loss/train': 1.2218189239501953} 11/07/2021 14:01:46 - INFO - __main__ - Step 119588: {'lr': 5.03116735014455e-05, 'samples': 22960896, 'steps': 119587, 'loss/train': 0.9119781255722046} 11/07/2021 14:01:46 - INFO - __main__ - Step 119589: {'lr': 5.030848069768365e-05, 'samples': 22961088, 'steps': 119588, 'loss/train': 1.2515199184417725} 11/07/2021 14:01:46 - INFO - __main__ - Step 119590: {'lr': 5.0305287983899714e-05, 'samples': 22961280, 'steps': 119589, 'loss/train': 0.9437512159347534} 11/07/2021 14:01:47 - INFO - __main__ - Step 119591: {'lr': 5.030209536009514e-05, 'samples': 22961472, 'steps': 119590, 'loss/train': 1.2714918851852417} 11/07/2021 14:01:47 - INFO - __main__ - Step 119592: {'lr': 5.029890282627136e-05, 'samples': 22961664, 'steps': 119591, 'loss/train': 1.2577401399612427} 11/07/2021 14:01:47 - INFO - __main__ - Step 119593: {'lr': 5.0295710382429807e-05, 'samples': 22961856, 'steps': 119592, 'loss/train': 1.3386375904083252} 11/07/2021 14:01:48 - INFO - __main__ - Step 119594: {'lr': 5.0292518028571935e-05, 'samples': 22962048, 'steps': 119593, 'loss/train': 1.175888180732727} 11/07/2021 14:01:49 - INFO - __main__ - Step 119595: {'lr': 5.0289325764699164e-05, 'samples': 22962240, 'steps': 119594, 'loss/train': 1.0354747772216797} 11/07/2021 14:01:49 - INFO - __main__ - Step 119596: {'lr': 5.028613359081294e-05, 'samples': 22962432, 'steps': 119595, 'loss/train': 1.1813753843307495} 11/07/2021 14:01:50 - INFO - __main__ - Step 119597: {'lr': 5.028294150691479e-05, 'samples': 22962624, 'steps': 119596, 'loss/train': 0.7735356688499451} 11/07/2021 14:01:50 - INFO - __main__ - Step 119598: {'lr': 5.027974951300596e-05, 'samples': 22962816, 'steps': 119597, 'loss/train': 1.0137642621994019} 11/07/2021 14:01:51 - INFO - __main__ - Step 119599: {'lr': 5.027655760908803e-05, 'samples': 22963008, 'steps': 119598, 'loss/train': 1.5319708585739136} 11/07/2021 14:01:51 - INFO - __main__ - Step 119600: {'lr': 5.027336579516237e-05, 'samples': 22963200, 'steps': 119599, 'loss/train': 1.789392113685608} 11/07/2021 14:01:52 - INFO - __main__ - Step 119601: {'lr': 5.027017407123047e-05, 'samples': 22963392, 'steps': 119600, 'loss/train': 1.3324307203292847} 11/07/2021 14:01:52 - INFO - __main__ - Step 119602: {'lr': 5.0266982437293745e-05, 'samples': 22963584, 'steps': 119601, 'loss/train': 0.4039584994316101} 11/07/2021 14:01:52 - INFO - __main__ - Step 119603: {'lr': 5.026379089335362e-05, 'samples': 22963776, 'steps': 119602, 'loss/train': 1.2009694576263428} 11/07/2021 14:01:54 - INFO - __main__ - Step 119604: {'lr': 5.026059943941158e-05, 'samples': 22963968, 'steps': 119603, 'loss/train': 1.0268170833587646} 11/07/2021 14:01:54 - INFO - __main__ - Step 119605: {'lr': 5.0257408075468995e-05, 'samples': 22964160, 'steps': 119604, 'loss/train': 1.5558726787567139} 11/07/2021 14:01:54 - INFO - __main__ - Step 119606: {'lr': 5.0254216801527364e-05, 'samples': 22964352, 'steps': 119605, 'loss/train': 1.111538052558899} 11/07/2021 14:01:55 - INFO - __main__ - Step 119607: {'lr': 5.02510256175881e-05, 'samples': 22964544, 'steps': 119606, 'loss/train': 1.6477605104446411} 11/07/2021 14:01:55 - INFO - __main__ - Step 119608: {'lr': 5.0247834523652644e-05, 'samples': 22964736, 'steps': 119607, 'loss/train': 1.2279033660888672} 11/07/2021 14:01:56 - INFO - __main__ - Step 119609: {'lr': 5.024464351972241e-05, 'samples': 22964928, 'steps': 119608, 'loss/train': 0.518764078617096} 11/07/2021 14:01:56 - INFO - __main__ - Step 119610: {'lr': 5.024145260579893e-05, 'samples': 22965120, 'steps': 119609, 'loss/train': 1.113508701324463} 11/07/2021 14:01:57 - INFO - __main__ - Step 119611: {'lr': 5.023826178188351e-05, 'samples': 22965312, 'steps': 119610, 'loss/train': 1.143405795097351} 11/07/2021 14:01:57 - INFO - __main__ - Step 119612: {'lr': 5.0235071047977644e-05, 'samples': 22965504, 'steps': 119611, 'loss/train': 1.2675937414169312} 11/07/2021 14:01:57 - INFO - __main__ - Step 119613: {'lr': 5.0231880404082774e-05, 'samples': 22965696, 'steps': 119612, 'loss/train': 1.36361825466156} 11/07/2021 14:01:58 - INFO - __main__ - Step 119614: {'lr': 5.022868985020035e-05, 'samples': 22965888, 'steps': 119613, 'loss/train': 0.8413946628570557} 11/07/2021 14:01:59 - INFO - __main__ - Step 119615: {'lr': 5.022549938633178e-05, 'samples': 22966080, 'steps': 119614, 'loss/train': 1.933458685874939} 11/07/2021 14:01:59 - INFO - __main__ - Step 119616: {'lr': 5.022230901247851e-05, 'samples': 22966272, 'steps': 119615, 'loss/train': 1.6491317749023438} 11/07/2021 14:01:59 - INFO - __main__ - Step 119617: {'lr': 5.021911872864199e-05, 'samples': 22966464, 'steps': 119616, 'loss/train': 1.880689263343811} 11/07/2021 14:02:00 - INFO - __main__ - Step 119618: {'lr': 5.0215928534823655e-05, 'samples': 22966656, 'steps': 119617, 'loss/train': 1.367798089981079} 11/07/2021 14:02:01 - INFO - __main__ - Step 119619: {'lr': 5.0212738431024945e-05, 'samples': 22966848, 'steps': 119618, 'loss/train': 0.6851826906204224} 11/07/2021 14:02:01 - INFO - __main__ - Step 119620: {'lr': 5.0209548417247284e-05, 'samples': 22967040, 'steps': 119619, 'loss/train': 1.5537816286087036} 11/07/2021 14:02:02 - INFO - __main__ - Step 119621: {'lr': 5.020635849349214e-05, 'samples': 22967232, 'steps': 119620, 'loss/train': 2.061779260635376} 11/07/2021 14:02:02 - INFO - __main__ - Step 119622: {'lr': 5.020316865976091e-05, 'samples': 22967424, 'steps': 119621, 'loss/train': 1.3403300046920776} 11/07/2021 14:02:02 - INFO - __main__ - Step 119623: {'lr': 5.01999789160551e-05, 'samples': 22967616, 'steps': 119622, 'loss/train': 0.9991187453269958} 11/07/2021 14:02:03 - INFO - __main__ - Step 119624: {'lr': 5.0196789262376055e-05, 'samples': 22967808, 'steps': 119623, 'loss/train': 1.376434087753296} 11/07/2021 14:02:04 - INFO - __main__ - Step 119625: {'lr': 5.019359969872525e-05, 'samples': 22968000, 'steps': 119624, 'loss/train': 1.5208364725112915} 11/07/2021 14:02:04 - INFO - __main__ - Step 119626: {'lr': 5.019041022510412e-05, 'samples': 22968192, 'steps': 119625, 'loss/train': 1.373939871788025} 11/07/2021 14:02:04 - INFO - __main__ - Step 119627: {'lr': 5.018722084151409e-05, 'samples': 22968384, 'steps': 119626, 'loss/train': 0.9018421173095703} 11/07/2021 14:02:05 - INFO - __main__ - Step 119628: {'lr': 5.018403154795664e-05, 'samples': 22968576, 'steps': 119627, 'loss/train': 1.8023639917373657} 11/07/2021 14:02:05 - INFO - __main__ - Step 119629: {'lr': 5.018084234443318e-05, 'samples': 22968768, 'steps': 119628, 'loss/train': 1.4426040649414062} 11/07/2021 14:02:06 - INFO - __main__ - Step 119630: {'lr': 5.017765323094514e-05, 'samples': 22968960, 'steps': 119629, 'loss/train': 0.8697035312652588} 11/07/2021 14:02:06 - INFO - __main__ - Step 119631: {'lr': 5.017446420749397e-05, 'samples': 22969152, 'steps': 119630, 'loss/train': 0.9793904423713684} 11/07/2021 14:02:07 - INFO - __main__ - Step 119632: {'lr': 5.017127527408111e-05, 'samples': 22969344, 'steps': 119631, 'loss/train': 1.358563780784607} 11/07/2021 14:02:07 - INFO - __main__ - Step 119633: {'lr': 5.016808643070797e-05, 'samples': 22969536, 'steps': 119632, 'loss/train': 1.2885664701461792} 11/07/2021 14:02:08 - INFO - __main__ - Step 119634: {'lr': 5.016489767737603e-05, 'samples': 22969728, 'steps': 119633, 'loss/train': 0.42753610014915466} 11/07/2021 14:02:08 - INFO - __main__ - Step 119635: {'lr': 5.0161709014086675e-05, 'samples': 22969920, 'steps': 119634, 'loss/train': 1.6560084819793701} 11/07/2021 14:02:09 - INFO - __main__ - Step 119636: {'lr': 5.015852044084146e-05, 'samples': 22970112, 'steps': 119635, 'loss/train': 1.3170900344848633} 11/07/2021 14:02:09 - INFO - __main__ - Step 119637: {'lr': 5.0155331957641656e-05, 'samples': 22970304, 'steps': 119636, 'loss/train': 1.1581233739852905} 11/07/2021 14:02:10 - INFO - __main__ - Step 119638: {'lr': 5.015214356448877e-05, 'samples': 22970496, 'steps': 119637, 'loss/train': 1.268211007118225} 11/07/2021 14:02:10 - INFO - __main__ - Step 119639: {'lr': 5.014895526138427e-05, 'samples': 22970688, 'steps': 119638, 'loss/train': 1.0337096452713013} 11/07/2021 14:02:11 - INFO - __main__ - Step 119640: {'lr': 5.0145767048329545e-05, 'samples': 22970880, 'steps': 119639, 'loss/train': 1.2493293285369873} 11/07/2021 14:02:11 - INFO - __main__ - Step 119641: {'lr': 5.014257892532606e-05, 'samples': 22971072, 'steps': 119640, 'loss/train': 1.424103021621704} 11/07/2021 14:02:12 - INFO - __main__ - Step 119642: {'lr': 5.013939089237523e-05, 'samples': 22971264, 'steps': 119641, 'loss/train': 1.15111243724823} 11/07/2021 14:02:12 - INFO - __main__ - Step 119643: {'lr': 5.013620294947854e-05, 'samples': 22971456, 'steps': 119642, 'loss/train': 1.256295084953308} 11/07/2021 14:02:12 - INFO - __main__ - Step 119644: {'lr': 5.013301509663737e-05, 'samples': 22971648, 'steps': 119643, 'loss/train': 1.1492007970809937} 11/07/2021 14:02:13 - INFO - __main__ - Step 119645: {'lr': 5.012982733385319e-05, 'samples': 22971840, 'steps': 119644, 'loss/train': 1.1977399587631226} 11/07/2021 14:02:14 - INFO - __main__ - Step 119646: {'lr': 5.0126639661127405e-05, 'samples': 22972032, 'steps': 119645, 'loss/train': 1.1921637058258057} 11/07/2021 14:02:14 - INFO - __main__ - Step 119647: {'lr': 5.01234520784615e-05, 'samples': 22972224, 'steps': 119646, 'loss/train': 1.2278763055801392} 11/07/2021 14:02:14 - INFO - __main__ - Step 119648: {'lr': 5.012026458585686e-05, 'samples': 22972416, 'steps': 119647, 'loss/train': 1.1702775955200195} 11/07/2021 14:02:15 - INFO - __main__ - Step 119649: {'lr': 5.011707718331496e-05, 'samples': 22972608, 'steps': 119648, 'loss/train': 0.571133017539978} 11/07/2021 14:02:16 - INFO - __main__ - Step 119650: {'lr': 5.011388987083726e-05, 'samples': 22972800, 'steps': 119649, 'loss/train': 1.3807119131088257} 11/07/2021 14:02:16 - INFO - __main__ - Step 119651: {'lr': 5.011070264842513e-05, 'samples': 22972992, 'steps': 119650, 'loss/train': 1.819212794303894} 11/07/2021 14:02:16 - INFO - __main__ - Step 119652: {'lr': 5.0107515516080006e-05, 'samples': 22973184, 'steps': 119651, 'loss/train': 1.3668012619018555} 11/07/2021 14:02:17 - INFO - __main__ - Step 119653: {'lr': 5.010432847380336e-05, 'samples': 22973376, 'steps': 119652, 'loss/train': 1.0938321352005005} 11/07/2021 14:02:17 - INFO - __main__ - Step 119654: {'lr': 5.010114152159661e-05, 'samples': 22973568, 'steps': 119653, 'loss/train': 1.1451877355575562} 11/07/2021 14:02:18 - INFO - __main__ - Step 119655: {'lr': 5.009795465946121e-05, 'samples': 22973760, 'steps': 119654, 'loss/train': 1.523108959197998} 11/07/2021 14:02:19 - INFO - __main__ - Step 119656: {'lr': 5.0094767887398583e-05, 'samples': 22973952, 'steps': 119655, 'loss/train': 1.242987036705017} 11/07/2021 14:02:19 - INFO - __main__ - Step 119657: {'lr': 5.009158120541016e-05, 'samples': 22974144, 'steps': 119656, 'loss/train': 1.2747244834899902} 11/07/2021 14:02:19 - INFO - __main__ - Step 119658: {'lr': 5.00883946134974e-05, 'samples': 22974336, 'steps': 119657, 'loss/train': 0.1785053014755249} 11/07/2021 14:02:20 - INFO - __main__ - Step 119659: {'lr': 5.0085208111661726e-05, 'samples': 22974528, 'steps': 119658, 'loss/train': 1.2522615194320679} 11/07/2021 14:02:20 - INFO - __main__ - Step 119660: {'lr': 5.0082021699904554e-05, 'samples': 22974720, 'steps': 119659, 'loss/train': 1.188537836074829} 11/07/2021 14:02:21 - INFO - __main__ - Step 119661: {'lr': 5.007883537822736e-05, 'samples': 22974912, 'steps': 119660, 'loss/train': 1.1884814500808716} 11/07/2021 14:02:22 - INFO - __main__ - Step 119662: {'lr': 5.007564914663157e-05, 'samples': 22975104, 'steps': 119661, 'loss/train': 1.211238145828247} 11/07/2021 14:02:22 - INFO - __main__ - Step 119663: {'lr': 5.007246300511864e-05, 'samples': 22975296, 'steps': 119662, 'loss/train': 1.2367022037506104} 11/07/2021 14:02:22 - INFO - __main__ - Step 119664: {'lr': 5.006927695368993e-05, 'samples': 22975488, 'steps': 119663, 'loss/train': 1.2399159669876099} 11/07/2021 14:02:23 - INFO - __main__ - Step 119665: {'lr': 5.006609099234691e-05, 'samples': 22975680, 'steps': 119664, 'loss/train': 1.6358206272125244} 11/07/2021 14:02:24 - INFO - __main__ - Step 119666: {'lr': 5.0062905121091015e-05, 'samples': 22975872, 'steps': 119665, 'loss/train': 0.8472388982772827} 11/07/2021 14:02:24 - INFO - __main__ - Step 119667: {'lr': 5.00597193399237e-05, 'samples': 22976064, 'steps': 119666, 'loss/train': 1.4907972812652588} 11/07/2021 14:02:24 - INFO - __main__ - Step 119668: {'lr': 5.005653364884638e-05, 'samples': 22976256, 'steps': 119667, 'loss/train': 1.3101903200149536} 11/07/2021 14:02:25 - INFO - __main__ - Step 119669: {'lr': 5.005334804786052e-05, 'samples': 22976448, 'steps': 119668, 'loss/train': 1.5045233964920044} 11/07/2021 14:02:25 - INFO - __main__ - Step 119670: {'lr': 5.005016253696751e-05, 'samples': 22976640, 'steps': 119669, 'loss/train': 1.1341392993927002} 11/07/2021 14:02:26 - INFO - __main__ - Step 119671: {'lr': 5.004697711616882e-05, 'samples': 22976832, 'steps': 119670, 'loss/train': 0.7721496820449829} 11/07/2021 14:02:27 - INFO - __main__ - Step 119672: {'lr': 5.004379178546589e-05, 'samples': 22977024, 'steps': 119671, 'loss/train': 1.0470845699310303} 11/07/2021 14:02:27 - INFO - __main__ - Step 119673: {'lr': 5.004060654486014e-05, 'samples': 22977216, 'steps': 119672, 'loss/train': 1.245296835899353} 11/07/2021 14:02:27 - INFO - __main__ - Step 119674: {'lr': 5.0037421394352994e-05, 'samples': 22977408, 'steps': 119673, 'loss/train': 1.0770094394683838} 11/07/2021 14:02:28 - INFO - __main__ - Step 119675: {'lr': 5.003423633394591e-05, 'samples': 22977600, 'steps': 119674, 'loss/train': 1.3095299005508423} 11/07/2021 14:02:29 - INFO - __main__ - Step 119676: {'lr': 5.0031051363640306e-05, 'samples': 22977792, 'steps': 119675, 'loss/train': 1.3567866086959839} 11/07/2021 14:02:29 - INFO - __main__ - Step 119677: {'lr': 5.0027866483437715e-05, 'samples': 22977984, 'steps': 119676, 'loss/train': 1.349589228630066} 11/07/2021 14:02:29 - INFO - __main__ - Step 119678: {'lr': 5.002468169333937e-05, 'samples': 22978176, 'steps': 119677, 'loss/train': 1.3726855516433716} 11/07/2021 14:02:30 - INFO - __main__ - Step 119679: {'lr': 5.002149699334685e-05, 'samples': 22978368, 'steps': 119678, 'loss/train': 1.3536001443862915} 11/07/2021 14:02:30 - INFO - __main__ - Step 119680: {'lr': 5.001831238346155e-05, 'samples': 22978560, 'steps': 119679, 'loss/train': 0.3745940625667572} 11/07/2021 14:02:31 - INFO - __main__ - Step 119681: {'lr': 5.0015127863684926e-05, 'samples': 22978752, 'steps': 119680, 'loss/train': 1.0263752937316895} 11/07/2021 14:02:32 - INFO - __main__ - Step 119682: {'lr': 5.001194343401838e-05, 'samples': 22978944, 'steps': 119681, 'loss/train': 1.4943509101867676} 11/07/2021 14:02:32 - INFO - __main__ - Step 119683: {'lr': 5.0008759094463367e-05, 'samples': 22979136, 'steps': 119682, 'loss/train': 1.163798213005066} 11/07/2021 14:02:32 - INFO - __main__ - Step 119684: {'lr': 5.000557484502133e-05, 'samples': 22979328, 'steps': 119683, 'loss/train': 1.4461411237716675} 11/07/2021 14:02:33 - INFO - __main__ - Step 119685: {'lr': 5.00023906856937e-05, 'samples': 22979520, 'steps': 119684, 'loss/train': 1.2826765775680542} 11/07/2021 14:02:33 - INFO - __main__ - Step 119686: {'lr': 4.999920661648191e-05, 'samples': 22979712, 'steps': 119685, 'loss/train': 3.3612561225891113} 11/07/2021 14:02:34 - INFO - __main__ - Step 119687: {'lr': 4.999602263738737e-05, 'samples': 22979904, 'steps': 119686, 'loss/train': 1.6961705684661865} 11/07/2021 14:02:35 - INFO - __main__ - Step 119688: {'lr': 4.9992838748411537e-05, 'samples': 22980096, 'steps': 119687, 'loss/train': 1.4074000120162964} 11/07/2021 14:02:35 - INFO - __main__ - Step 119689: {'lr': 4.998965494955587e-05, 'samples': 22980288, 'steps': 119688, 'loss/train': 0.8173038959503174} 11/07/2021 14:02:35 - INFO - __main__ - Step 119690: {'lr': 4.9986471240821815e-05, 'samples': 22980480, 'steps': 119689, 'loss/train': 1.3528372049331665} 11/07/2021 14:02:36 - INFO - __main__ - Step 119691: {'lr': 4.9983287622210715e-05, 'samples': 22980672, 'steps': 119690, 'loss/train': 0.8015022873878479} 11/07/2021 14:02:37 - INFO - __main__ - Step 119692: {'lr': 4.9980104093724084e-05, 'samples': 22980864, 'steps': 119691, 'loss/train': 1.565985918045044} 11/07/2021 14:02:37 - INFO - __main__ - Step 119693: {'lr': 4.9976920655363304e-05, 'samples': 22981056, 'steps': 119692, 'loss/train': 1.3382891416549683} 11/07/2021 14:02:37 - INFO - __main__ - Step 119694: {'lr': 4.9973737307129844e-05, 'samples': 22981248, 'steps': 119693, 'loss/train': 1.1622576713562012} 11/07/2021 14:02:38 - INFO - __main__ - Step 119695: {'lr': 4.997055404902512e-05, 'samples': 22981440, 'steps': 119694, 'loss/train': 0.7161290645599365} 11/07/2021 14:02:38 - INFO - __main__ - Step 119696: {'lr': 4.996737088105058e-05, 'samples': 22981632, 'steps': 119695, 'loss/train': 1.0470515489578247} 11/07/2021 14:02:39 - INFO - __main__ - Step 119697: {'lr': 4.996418780320766e-05, 'samples': 22981824, 'steps': 119696, 'loss/train': 1.1821861267089844} 11/07/2021 14:02:39 - INFO - __main__ - Step 119698: {'lr': 4.996100481549781e-05, 'samples': 22982016, 'steps': 119697, 'loss/train': 1.4204801321029663} 11/07/2021 14:02:40 - INFO - __main__ - Step 119699: {'lr': 4.995782191792242e-05, 'samples': 22982208, 'steps': 119698, 'loss/train': 1.664227843284607} 11/07/2021 14:02:40 - INFO - __main__ - Step 119700: {'lr': 4.995463911048295e-05, 'samples': 22982400, 'steps': 119699, 'loss/train': 1.4204704761505127} 11/07/2021 14:02:41 - INFO - __main__ - Step 119701: {'lr': 4.995145639318085e-05, 'samples': 22982592, 'steps': 119700, 'loss/train': 1.117382526397705} 11/07/2021 14:02:41 - INFO - __main__ - Step 119702: {'lr': 4.9948273766017516e-05, 'samples': 22982784, 'steps': 119701, 'loss/train': 1.4805986881256104} 11/07/2021 14:02:42 - INFO - __main__ - Step 119703: {'lr': 4.994509122899441e-05, 'samples': 22982976, 'steps': 119702, 'loss/train': 1.3698054552078247} 11/07/2021 14:02:42 - INFO - __main__ - Step 119704: {'lr': 4.994190878211302e-05, 'samples': 22983168, 'steps': 119703, 'loss/train': 1.070998191833496} 11/07/2021 14:02:43 - INFO - __main__ - Step 119705: {'lr': 4.993872642537467e-05, 'samples': 22983360, 'steps': 119704, 'loss/train': 1.486283540725708} 11/07/2021 14:02:43 - INFO - __main__ - Step 119706: {'lr': 4.993554415878085e-05, 'samples': 22983552, 'steps': 119705, 'loss/train': 1.3253099918365479} 11/07/2021 14:02:43 - INFO - __main__ - Step 119707: {'lr': 4.9932361982332945e-05, 'samples': 22983744, 'steps': 119706, 'loss/train': 1.6515384912490845} 11/07/2021 14:02:44 - INFO - __main__ - Step 119708: {'lr': 4.992917989603246e-05, 'samples': 22983936, 'steps': 119707, 'loss/train': 0.7594415545463562} 11/07/2021 14:02:45 - INFO - __main__ - Step 119709: {'lr': 4.992599789988081e-05, 'samples': 22984128, 'steps': 119708, 'loss/train': 1.86416757106781} 11/07/2021 14:02:45 - INFO - __main__ - Step 119710: {'lr': 4.992281599387938e-05, 'samples': 22984320, 'steps': 119709, 'loss/train': 1.3814152479171753} 11/07/2021 14:02:46 - INFO - __main__ - Step 119711: {'lr': 4.9919634178029665e-05, 'samples': 22984512, 'steps': 119710, 'loss/train': 1.3830336332321167} 11/07/2021 14:02:46 - INFO - __main__ - Step 119712: {'lr': 4.991645245233309e-05, 'samples': 22984704, 'steps': 119711, 'loss/train': 1.3547782897949219} 11/07/2021 14:02:47 - INFO - __main__ - Step 119713: {'lr': 4.991327081679106e-05, 'samples': 22984896, 'steps': 119712, 'loss/train': 2.0505354404449463} 11/07/2021 14:02:47 - INFO - __main__ - Step 119714: {'lr': 4.9910089271405e-05, 'samples': 22985088, 'steps': 119713, 'loss/train': 1.108872413635254} 11/07/2021 14:02:48 - INFO - __main__ - Step 119715: {'lr': 4.9906907816176405e-05, 'samples': 22985280, 'steps': 119714, 'loss/train': 1.22661292552948} 11/07/2021 14:02:48 - INFO - __main__ - Step 119716: {'lr': 4.990372645110663e-05, 'samples': 22985472, 'steps': 119715, 'loss/train': 1.1231110095977783} 11/07/2021 14:02:48 - INFO - __main__ - Step 119717: {'lr': 4.990054517619724e-05, 'samples': 22985664, 'steps': 119716, 'loss/train': 1.3262786865234375} 11/07/2021 14:02:49 - INFO - __main__ - Step 119718: {'lr': 4.989736399144954e-05, 'samples': 22985856, 'steps': 119717, 'loss/train': 1.3885091543197632} 11/07/2021 14:02:50 - INFO - __main__ - Step 119719: {'lr': 4.989418289686495e-05, 'samples': 22986048, 'steps': 119718, 'loss/train': 1.2739009857177734} 11/07/2021 14:02:50 - INFO - __main__ - Step 119720: {'lr': 4.989100189244497e-05, 'samples': 22986240, 'steps': 119719, 'loss/train': 1.1242645978927612} 11/07/2021 14:02:50 - INFO - __main__ - Step 119721: {'lr': 4.988782097819103e-05, 'samples': 22986432, 'steps': 119720, 'loss/train': 0.7252048850059509} 11/07/2021 14:02:51 - INFO - __main__ - Step 119722: {'lr': 4.9884640154104515e-05, 'samples': 22986624, 'steps': 119721, 'loss/train': 2.093085289001465} 11/07/2021 14:02:51 - INFO - __main__ - Step 119723: {'lr': 4.988145942018693e-05, 'samples': 22986816, 'steps': 119722, 'loss/train': 1.5568578243255615} 11/07/2021 14:02:52 - INFO - __main__ - Step 119724: {'lr': 4.987827877643966e-05, 'samples': 22987008, 'steps': 119723, 'loss/train': 1.4374942779541016} 11/07/2021 14:02:52 - INFO - __main__ - Step 119725: {'lr': 4.987509822286413e-05, 'samples': 22987200, 'steps': 119724, 'loss/train': 1.3118865489959717} 11/07/2021 14:02:53 - INFO - __main__ - Step 119726: {'lr': 4.9871917759461815e-05, 'samples': 22987392, 'steps': 119725, 'loss/train': 1.3580305576324463} 11/07/2021 14:02:53 - INFO - __main__ - Step 119727: {'lr': 4.986873738623412e-05, 'samples': 22987584, 'steps': 119726, 'loss/train': 0.7182621359825134} 11/07/2021 14:02:53 - INFO - __main__ - Step 119728: {'lr': 4.9865557103182465e-05, 'samples': 22987776, 'steps': 119727, 'loss/train': 0.7991275787353516} 11/07/2021 14:02:54 - INFO - __main__ - Step 119729: {'lr': 4.986237691030834e-05, 'samples': 22987968, 'steps': 119728, 'loss/train': 1.0360465049743652} 11/07/2021 14:02:55 - INFO - __main__ - Step 119730: {'lr': 4.985919680761311e-05, 'samples': 22988160, 'steps': 119729, 'loss/train': 1.2126777172088623} 11/07/2021 14:02:55 - INFO - __main__ - Step 119731: {'lr': 4.9856016795098324e-05, 'samples': 22988352, 'steps': 119730, 'loss/train': 1.399524211883545} 11/07/2021 14:02:56 - INFO - __main__ - Step 119732: {'lr': 4.9852836872765236e-05, 'samples': 22988544, 'steps': 119731, 'loss/train': 1.3561477661132812} 11/07/2021 14:02:56 - INFO - __main__ - Step 119733: {'lr': 4.984965704061539e-05, 'samples': 22988736, 'steps': 119732, 'loss/train': 1.6670241355895996} 11/07/2021 14:02:57 - INFO - __main__ - Step 119734: {'lr': 4.984647729865019e-05, 'samples': 22988928, 'steps': 119733, 'loss/train': 1.8107177019119263} 11/07/2021 14:02:57 - INFO - __main__ - Step 119735: {'lr': 4.9843297646871096e-05, 'samples': 22989120, 'steps': 119734, 'loss/train': 1.2791430950164795} 11/07/2021 14:02:58 - INFO - __main__ - Step 119736: {'lr': 4.984011808527952e-05, 'samples': 22989312, 'steps': 119735, 'loss/train': 1.213609218597412} 11/07/2021 14:02:58 - INFO - __main__ - Step 119737: {'lr': 4.983693861387689e-05, 'samples': 22989504, 'steps': 119736, 'loss/train': 1.3927114009857178} 11/07/2021 14:02:58 - INFO - __main__ - Step 119738: {'lr': 4.983375923266464e-05, 'samples': 22989696, 'steps': 119737, 'loss/train': 1.228415846824646} 11/07/2021 14:02:59 - INFO - __main__ - Step 119739: {'lr': 4.983057994164422e-05, 'samples': 22989888, 'steps': 119738, 'loss/train': 0.6913363933563232} 11/07/2021 14:03:00 - INFO - __main__ - Step 119740: {'lr': 4.982740074081704e-05, 'samples': 22990080, 'steps': 119739, 'loss/train': 1.7113373279571533} 11/07/2021 14:03:00 - INFO - __main__ - Step 119741: {'lr': 4.9824221630184544e-05, 'samples': 22990272, 'steps': 119740, 'loss/train': 1.4122239351272583} 11/07/2021 14:03:00 - INFO - __main__ - Step 119742: {'lr': 4.982104260974818e-05, 'samples': 22990464, 'steps': 119741, 'loss/train': 1.7146230936050415} 11/07/2021 14:03:01 - INFO - __main__ - Step 119743: {'lr': 4.9817863679509354e-05, 'samples': 22990656, 'steps': 119742, 'loss/train': 1.3661390542984009} 11/07/2021 14:03:02 - INFO - __main__ - Step 119744: {'lr': 4.9814684839469606e-05, 'samples': 22990848, 'steps': 119743, 'loss/train': 1.5556284189224243} 11/07/2021 14:03:02 - INFO - __main__ - Step 119745: {'lr': 4.981150608963017e-05, 'samples': 22991040, 'steps': 119744, 'loss/train': 0.8527435064315796} 11/07/2021 14:03:02 - INFO - __main__ - Step 119746: {'lr': 4.9808327429992585e-05, 'samples': 22991232, 'steps': 119745, 'loss/train': 1.1650702953338623} 11/07/2021 14:03:03 - INFO - __main__ - Step 119747: {'lr': 4.980514886055829e-05, 'samples': 22991424, 'steps': 119746, 'loss/train': 1.250974178314209} 11/07/2021 14:03:03 - INFO - __main__ - Step 119748: {'lr': 4.980197038132869e-05, 'samples': 22991616, 'steps': 119747, 'loss/train': 1.2583180665969849} 11/07/2021 14:03:04 - INFO - __main__ - Step 119749: {'lr': 4.979879199230525e-05, 'samples': 22991808, 'steps': 119748, 'loss/train': 1.4304029941558838} 11/07/2021 14:03:05 - INFO - __main__ - Step 119750: {'lr': 4.979561369348939e-05, 'samples': 22992000, 'steps': 119749, 'loss/train': 1.9277511835098267} 11/07/2021 14:03:05 - INFO - __main__ - Step 119751: {'lr': 4.979243548488252e-05, 'samples': 22992192, 'steps': 119750, 'loss/train': 1.1273627281188965} 11/07/2021 14:03:05 - INFO - __main__ - Step 119752: {'lr': 4.9789257366486094e-05, 'samples': 22992384, 'steps': 119751, 'loss/train': 1.2055350542068481} 11/07/2021 14:03:06 - INFO - __main__ - Step 119753: {'lr': 4.978607933830154e-05, 'samples': 22992576, 'steps': 119752, 'loss/train': 1.6350023746490479} 11/07/2021 14:03:06 - INFO - __main__ - Step 119754: {'lr': 4.9782901400330285e-05, 'samples': 22992768, 'steps': 119753, 'loss/train': 1.3482905626296997} 11/07/2021 14:03:07 - INFO - __main__ - Step 119755: {'lr': 4.9779723552573764e-05, 'samples': 22992960, 'steps': 119754, 'loss/train': 1.33794105052948} 11/07/2021 14:03:07 - INFO - __main__ - Step 119756: {'lr': 4.9776545795033435e-05, 'samples': 22993152, 'steps': 119755, 'loss/train': 1.1600250005722046} 11/07/2021 14:03:08 - INFO - __main__ - Step 119757: {'lr': 4.977336812771074e-05, 'samples': 22993344, 'steps': 119756, 'loss/train': 1.548789620399475} 11/07/2021 14:03:08 - INFO - __main__ - Step 119758: {'lr': 4.9770190550607024e-05, 'samples': 22993536, 'steps': 119757, 'loss/train': 1.1311650276184082} 11/07/2021 14:03:08 - INFO - __main__ - Step 119759: {'lr': 4.9767013063723775e-05, 'samples': 22993728, 'steps': 119758, 'loss/train': 1.5349782705307007} 11/07/2021 14:03:10 - INFO - __main__ - Step 119760: {'lr': 4.9763835667062416e-05, 'samples': 22993920, 'steps': 119759, 'loss/train': 1.2352733612060547} 11/07/2021 14:03:10 - INFO - __main__ - Step 119761: {'lr': 4.976065836062435e-05, 'samples': 22994112, 'steps': 119760, 'loss/train': 1.3058619499206543} 11/07/2021 14:03:10 - INFO - __main__ - Step 119762: {'lr': 4.975748114441109e-05, 'samples': 22994304, 'steps': 119761, 'loss/train': 1.3143389225006104} 11/07/2021 14:03:11 - INFO - __main__ - Step 119763: {'lr': 4.9754304018423986e-05, 'samples': 22994496, 'steps': 119762, 'loss/train': 1.2050288915634155} 11/07/2021 14:03:11 - INFO - __main__ - Step 119764: {'lr': 4.9751126982664515e-05, 'samples': 22994688, 'steps': 119763, 'loss/train': 0.3529549837112427} 11/07/2021 14:03:12 - INFO - __main__ - Step 119765: {'lr': 4.9747950037134116e-05, 'samples': 22994880, 'steps': 119764, 'loss/train': 1.4505879878997803} 11/07/2021 14:03:12 - INFO - __main__ - Step 119766: {'lr': 4.974477318183418e-05, 'samples': 22995072, 'steps': 119765, 'loss/train': 1.362607479095459} 11/07/2021 14:03:13 - INFO - __main__ - Step 119767: {'lr': 4.974159641676615e-05, 'samples': 22995264, 'steps': 119766, 'loss/train': 1.3021200895309448} 11/07/2021 14:03:13 - INFO - __main__ - Step 119768: {'lr': 4.973841974193147e-05, 'samples': 22995456, 'steps': 119767, 'loss/train': 1.4949851036071777} 11/07/2021 14:03:13 - INFO - __main__ - Step 119769: {'lr': 4.9735243157331574e-05, 'samples': 22995648, 'steps': 119768, 'loss/train': 1.302217960357666} 11/07/2021 14:03:14 - INFO - __main__ - Step 119770: {'lr': 4.9732066662967895e-05, 'samples': 22995840, 'steps': 119769, 'loss/train': 1.60959792137146} 11/07/2021 14:03:15 - INFO - __main__ - Step 119771: {'lr': 4.972889025884192e-05, 'samples': 22996032, 'steps': 119770, 'loss/train': 1.6958171129226685} 11/07/2021 14:03:15 - INFO - __main__ - Step 119772: {'lr': 4.9725713944954956e-05, 'samples': 22996224, 'steps': 119771, 'loss/train': 1.3031189441680908} 11/07/2021 14:03:15 - INFO - __main__ - Step 119773: {'lr': 4.972253772130847e-05, 'samples': 22996416, 'steps': 119772, 'loss/train': 1.5729349851608276} 11/07/2021 14:03:16 - INFO - __main__ - Step 119774: {'lr': 4.9719361587903936e-05, 'samples': 22996608, 'steps': 119773, 'loss/train': 0.7342546582221985} 11/07/2021 14:03:16 - INFO - __main__ - Step 119775: {'lr': 4.9716185544742774e-05, 'samples': 22996800, 'steps': 119774, 'loss/train': 1.3893589973449707} 11/07/2021 14:03:17 - INFO - __main__ - Step 119776: {'lr': 4.9713009591826394e-05, 'samples': 22996992, 'steps': 119775, 'loss/train': 1.1453932523727417} 11/07/2021 14:03:18 - INFO - __main__ - Step 119777: {'lr': 4.9709833729156244e-05, 'samples': 22997184, 'steps': 119776, 'loss/train': 1.283674955368042} 11/07/2021 14:03:18 - INFO - __main__ - Step 119778: {'lr': 4.970665795673376e-05, 'samples': 22997376, 'steps': 119777, 'loss/train': 0.7218777537345886} 11/07/2021 14:03:18 - INFO - __main__ - Step 119779: {'lr': 4.970348227456034e-05, 'samples': 22997568, 'steps': 119778, 'loss/train': 1.583383560180664} 11/07/2021 14:03:19 - INFO - __main__ - Step 119780: {'lr': 4.970030668263748e-05, 'samples': 22997760, 'steps': 119779, 'loss/train': 1.3015855550765991} 11/07/2021 14:03:20 - INFO - __main__ - Step 119781: {'lr': 4.969713118096656e-05, 'samples': 22997952, 'steps': 119780, 'loss/train': 1.5833139419555664} 11/07/2021 14:03:20 - INFO - __main__ - Step 119782: {'lr': 4.9693955769548995e-05, 'samples': 22998144, 'steps': 119781, 'loss/train': 1.1668199300765991} 11/07/2021 14:03:20 - INFO - __main__ - Step 119783: {'lr': 4.969078044838626e-05, 'samples': 22998336, 'steps': 119782, 'loss/train': 1.0079495906829834} 11/07/2021 14:03:21 - INFO - __main__ - Step 119784: {'lr': 4.968760521747984e-05, 'samples': 22998528, 'steps': 119783, 'loss/train': 1.5419692993164062} 11/07/2021 14:03:21 - INFO - __main__ - Step 119785: {'lr': 4.9684430076831015e-05, 'samples': 22998720, 'steps': 119784, 'loss/train': 1.1521986722946167} 11/07/2021 14:03:22 - INFO - __main__ - Step 119786: {'lr': 4.9681255026441304e-05, 'samples': 22998912, 'steps': 119785, 'loss/train': 1.4737874269485474} 11/07/2021 14:03:22 - INFO - __main__ - Step 119787: {'lr': 4.9678080066312136e-05, 'samples': 22999104, 'steps': 119786, 'loss/train': 1.1663808822631836} 11/07/2021 14:03:23 - INFO - __main__ - Step 119788: {'lr': 4.9674905196444906e-05, 'samples': 22999296, 'steps': 119787, 'loss/train': 1.39835524559021} 11/07/2021 14:03:23 - INFO - __main__ - Step 119789: {'lr': 4.967173041684112e-05, 'samples': 22999488, 'steps': 119788, 'loss/train': 1.1072267293930054} 11/07/2021 14:03:23 - INFO - __main__ - Step 119790: {'lr': 4.9668555727502115e-05, 'samples': 22999680, 'steps': 119789, 'loss/train': 1.7011772394180298} 11/07/2021 14:03:25 - INFO - __main__ - Step 119791: {'lr': 4.966538112842939e-05, 'samples': 22999872, 'steps': 119790, 'loss/train': 1.3063300848007202} 11/07/2021 14:03:25 - INFO - __main__ - Step 119792: {'lr': 4.966220661962434e-05, 'samples': 23000064, 'steps': 119791, 'loss/train': 1.2483787536621094} 11/07/2021 14:03:25 - INFO - __main__ - Step 119793: {'lr': 4.9659032201088414e-05, 'samples': 23000256, 'steps': 119792, 'loss/train': 1.641167402267456} 11/07/2021 14:03:26 - INFO - __main__ - Step 119794: {'lr': 4.9655857872823036e-05, 'samples': 23000448, 'steps': 119793, 'loss/train': 1.114798903465271} 11/07/2021 14:03:26 - INFO - __main__ - Step 119795: {'lr': 4.965268363482964e-05, 'samples': 23000640, 'steps': 119794, 'loss/train': 0.6964682936668396} 11/07/2021 14:03:26 - INFO - __main__ - Step 119796: {'lr': 4.9649509487109665e-05, 'samples': 23000832, 'steps': 119795, 'loss/train': 1.3397116661071777} 11/07/2021 14:03:27 - INFO - __main__ - Step 119797: {'lr': 4.964633542966451e-05, 'samples': 23001024, 'steps': 119796, 'loss/train': 0.7082358598709106} 11/07/2021 14:03:28 - INFO - __main__ - Step 119798: {'lr': 4.96431614624957e-05, 'samples': 23001216, 'steps': 119797, 'loss/train': 1.2365386486053467} 11/07/2021 14:03:28 - INFO - __main__ - Step 119799: {'lr': 4.9639987585604505e-05, 'samples': 23001408, 'steps': 119798, 'loss/train': 1.5957286357879639} 11/07/2021 14:03:28 - INFO - __main__ - Step 119800: {'lr': 4.963681379899246e-05, 'samples': 23001600, 'steps': 119799, 'loss/train': 1.4295967817306519} 11/07/2021 14:03:29 - INFO - __main__ - Step 119801: {'lr': 4.963364010266097e-05, 'samples': 23001792, 'steps': 119800, 'loss/train': 1.9904948472976685} 11/07/2021 14:03:30 - INFO - __main__ - Step 119802: {'lr': 4.9630466496611486e-05, 'samples': 23001984, 'steps': 119801, 'loss/train': 1.360348105430603} 11/07/2021 14:03:30 - INFO - __main__ - Step 119803: {'lr': 4.962729298084539e-05, 'samples': 23002176, 'steps': 119802, 'loss/train': 1.8029669523239136} 11/07/2021 14:03:31 - INFO - __main__ - Step 119804: {'lr': 4.962411955536417e-05, 'samples': 23002368, 'steps': 119803, 'loss/train': 0.9965413808822632} 11/07/2021 14:03:31 - INFO - __main__ - Step 119805: {'lr': 4.962094622016922e-05, 'samples': 23002560, 'steps': 119804, 'loss/train': 1.4576205015182495} 11/07/2021 14:03:31 - INFO - __main__ - Step 119806: {'lr': 4.961777297526199e-05, 'samples': 23002752, 'steps': 119805, 'loss/train': 1.36794912815094} 11/07/2021 14:03:32 - INFO - __main__ - Step 119807: {'lr': 4.9614599820643895e-05, 'samples': 23002944, 'steps': 119806, 'loss/train': 1.3646166324615479} 11/07/2021 14:03:33 - INFO - __main__ - Step 119808: {'lr': 4.961142675631636e-05, 'samples': 23003136, 'steps': 119807, 'loss/train': 1.42556631565094} 11/07/2021 14:03:33 - INFO - __main__ - Step 119809: {'lr': 4.960825378228082e-05, 'samples': 23003328, 'steps': 119808, 'loss/train': 1.134358286857605} 11/07/2021 14:03:33 - INFO - __main__ - Step 119810: {'lr': 4.960508089853871e-05, 'samples': 23003520, 'steps': 119809, 'loss/train': 1.6175259351730347} 11/07/2021 14:03:34 - INFO - __main__ - Step 119811: {'lr': 4.9601908105091546e-05, 'samples': 23003712, 'steps': 119810, 'loss/train': 0.5522589683532715} 11/07/2021 14:03:34 - INFO - __main__ - Step 119812: {'lr': 4.959873540194057e-05, 'samples': 23003904, 'steps': 119811, 'loss/train': 0.8823108673095703} 11/07/2021 14:03:36 - INFO - __main__ - Step 119813: {'lr': 4.9595562789087335e-05, 'samples': 23004096, 'steps': 119812, 'loss/train': 1.7206398248672485} 11/07/2021 14:03:36 - INFO - __main__ - Step 119814: {'lr': 4.959239026653326e-05, 'samples': 23004288, 'steps': 119813, 'loss/train': 0.9414799213409424} 11/07/2021 14:03:36 - INFO - __main__ - Step 119815: {'lr': 4.9589217834279724e-05, 'samples': 23004480, 'steps': 119814, 'loss/train': 1.3698415756225586} 11/07/2021 14:03:37 - INFO - __main__ - Step 119816: {'lr': 4.958604549232823e-05, 'samples': 23004672, 'steps': 119815, 'loss/train': 1.3335282802581787} 11/07/2021 14:03:37 - INFO - __main__ - Step 119817: {'lr': 4.9582873240680145e-05, 'samples': 23004864, 'steps': 119816, 'loss/train': 1.4562780857086182} 11/07/2021 14:03:38 - INFO - __main__ - Step 119818: {'lr': 4.957970107933693e-05, 'samples': 23005056, 'steps': 119817, 'loss/train': 1.5116584300994873} 11/07/2021 14:03:38 - INFO - __main__ - Step 119819: {'lr': 4.957652900830001e-05, 'samples': 23005248, 'steps': 119818, 'loss/train': 1.5694596767425537} 11/07/2021 14:03:39 - INFO - __main__ - Step 119820: {'lr': 4.957335702757082e-05, 'samples': 23005440, 'steps': 119819, 'loss/train': 1.4753339290618896} 11/07/2021 14:03:39 - INFO - __main__ - Step 119821: {'lr': 4.957018513715078e-05, 'samples': 23005632, 'steps': 119820, 'loss/train': 1.2653628587722778} 11/07/2021 14:03:40 - INFO - __main__ - Step 119822: {'lr': 4.9567013337041386e-05, 'samples': 23005824, 'steps': 119821, 'loss/train': 1.25702965259552} 11/07/2021 14:03:42 - INFO - __main__ - Step 119823: {'lr': 4.956384162724395e-05, 'samples': 23006016, 'steps': 119822, 'loss/train': 1.116326928138733} 11/07/2021 14:03:42 - INFO - __main__ - Step 119824: {'lr': 4.956067000775993e-05, 'samples': 23006208, 'steps': 119823, 'loss/train': 1.485988736152649} 11/07/2021 14:03:43 - INFO - __main__ - Step 119825: {'lr': 4.955749847859078e-05, 'samples': 23006400, 'steps': 119824, 'loss/train': 1.1438206434249878} 11/07/2021 14:03:43 - INFO - __main__ - Step 119826: {'lr': 4.955432703973794e-05, 'samples': 23006592, 'steps': 119825, 'loss/train': 2.6958165168762207} 11/07/2021 14:03:43 - INFO - __main__ - Step 119827: {'lr': 4.955115569120283e-05, 'samples': 23006784, 'steps': 119826, 'loss/train': 2.6980247497558594} 11/07/2021 14:03:44 - INFO - __main__ - Step 119828: {'lr': 4.954798443298689e-05, 'samples': 23006976, 'steps': 119827, 'loss/train': 2.592339038848877} 11/07/2021 14:03:44 - INFO - __main__ - Step 119829: {'lr': 4.954481326509153e-05, 'samples': 23007168, 'steps': 119828, 'loss/train': 2.371384620666504} 11/07/2021 14:03:44 - INFO - __main__ - Step 119830: {'lr': 4.954164218751817e-05, 'samples': 23007360, 'steps': 119829, 'loss/train': 1.273677945137024} 11/07/2021 14:03:46 - INFO - __main__ - Step 119831: {'lr': 4.953847120026825e-05, 'samples': 23007552, 'steps': 119830, 'loss/train': 1.4132294654846191} 11/07/2021 14:03:46 - INFO - __main__ - Step 119832: {'lr': 4.9535300303343186e-05, 'samples': 23007744, 'steps': 119831, 'loss/train': 1.483150601387024} 11/07/2021 14:03:46 - INFO - __main__ - Step 119833: {'lr': 4.953212949674452e-05, 'samples': 23007936, 'steps': 119832, 'loss/train': 0.9880454540252686} 11/07/2021 14:03:47 - INFO - __main__ - Step 119834: {'lr': 4.95289587804735e-05, 'samples': 23008128, 'steps': 119833, 'loss/train': 0.4081268012523651} 11/07/2021 14:03:47 - INFO - __main__ - Step 119835: {'lr': 4.9525788154531654e-05, 'samples': 23008320, 'steps': 119834, 'loss/train': 1.2893879413604736} 11/07/2021 14:03:48 - INFO - __main__ - Step 119836: {'lr': 4.952261761892038e-05, 'samples': 23008512, 'steps': 119835, 'loss/train': 1.417372226715088} 11/07/2021 14:03:48 - INFO - __main__ - Step 119837: {'lr': 4.9519447173641125e-05, 'samples': 23008704, 'steps': 119836, 'loss/train': 1.4265681505203247} 11/07/2021 14:03:49 - INFO - __main__ - Step 119838: {'lr': 4.9516276818695304e-05, 'samples': 23008896, 'steps': 119837, 'loss/train': 1.9164036512374878} 11/07/2021 14:03:49 - INFO - __main__ - Step 119839: {'lr': 4.951310655408436e-05, 'samples': 23009088, 'steps': 119838, 'loss/train': 1.4985954761505127} 11/07/2021 14:03:49 - INFO - __main__ - Step 119840: {'lr': 4.950993637980972e-05, 'samples': 23009280, 'steps': 119839, 'loss/train': 1.9063161611557007} 11/07/2021 14:03:50 - INFO - __main__ - Step 119841: {'lr': 4.950676629587281e-05, 'samples': 23009472, 'steps': 119840, 'loss/train': 1.4661310911178589} 11/07/2021 14:03:51 - INFO - __main__ - Step 119842: {'lr': 4.950359630227505e-05, 'samples': 23009664, 'steps': 119841, 'loss/train': 1.5975191593170166} 11/07/2021 14:03:51 - INFO - __main__ - Step 119843: {'lr': 4.950042639901789e-05, 'samples': 23009856, 'steps': 119842, 'loss/train': 1.142392635345459} 11/07/2021 14:03:51 - INFO - __main__ - Step 119844: {'lr': 4.94972565861028e-05, 'samples': 23010048, 'steps': 119843, 'loss/train': 1.1586216688156128} 11/07/2021 14:03:52 - INFO - __main__ - Step 119845: {'lr': 4.94940868635311e-05, 'samples': 23010240, 'steps': 119844, 'loss/train': 0.951343297958374} 11/07/2021 14:03:53 - INFO - __main__ - Step 119846: {'lr': 4.949091723130425e-05, 'samples': 23010432, 'steps': 119845, 'loss/train': 0.8041165471076965} 11/07/2021 14:03:54 - INFO - __main__ - Step 119847: {'lr': 4.948774768942371e-05, 'samples': 23010624, 'steps': 119846, 'loss/train': 1.7547677755355835} 11/07/2021 14:03:54 - INFO - __main__ - Step 119848: {'lr': 4.94845782378909e-05, 'samples': 23010816, 'steps': 119847, 'loss/train': 1.3907002210617065} 11/07/2021 14:03:54 - INFO - __main__ - Step 119849: {'lr': 4.948140887670724e-05, 'samples': 23011008, 'steps': 119848, 'loss/train': 0.8047236800193787} 11/07/2021 14:03:55 - INFO - __main__ - Step 119850: {'lr': 4.9478239605874166e-05, 'samples': 23011200, 'steps': 119849, 'loss/train': 0.5578262209892273} 11/07/2021 14:03:56 - INFO - __main__ - Step 119851: {'lr': 4.947507042539309e-05, 'samples': 23011392, 'steps': 119850, 'loss/train': 1.376939296722412} 11/07/2021 14:03:56 - INFO - __main__ - Step 119852: {'lr': 4.9471901335265465e-05, 'samples': 23011584, 'steps': 119851, 'loss/train': 1.3519970178604126} 11/07/2021 14:03:56 - INFO - __main__ - Step 119853: {'lr': 4.9468732335492705e-05, 'samples': 23011776, 'steps': 119852, 'loss/train': 1.3990408182144165} 11/07/2021 14:03:57 - INFO - __main__ - Step 119854: {'lr': 4.946556342607622e-05, 'samples': 23011968, 'steps': 119853, 'loss/train': 1.4801278114318848} 11/07/2021 14:03:57 - INFO - __main__ - Step 119855: {'lr': 4.946239460701757e-05, 'samples': 23012160, 'steps': 119854, 'loss/train': 1.3224072456359863} 11/07/2021 14:03:58 - INFO - __main__ - Step 119856: {'lr': 4.945922587831797e-05, 'samples': 23012352, 'steps': 119855, 'loss/train': 1.511900782585144} 11/07/2021 14:03:58 - INFO - __main__ - Step 119857: {'lr': 4.9456057239978954e-05, 'samples': 23012544, 'steps': 119856, 'loss/train': 1.2995033264160156} 11/07/2021 14:03:59 - INFO - __main__ - Step 119858: {'lr': 4.945288869200193e-05, 'samples': 23012736, 'steps': 119857, 'loss/train': 1.5760149955749512} 11/07/2021 14:03:59 - INFO - __main__ - Step 119859: {'lr': 4.944972023438837e-05, 'samples': 23012928, 'steps': 119858, 'loss/train': 0.5743430256843567} 11/07/2021 14:03:59 - INFO - __main__ - Step 119860: {'lr': 4.944655186713965e-05, 'samples': 23013120, 'steps': 119859, 'loss/train': 1.2944531440734863} 11/07/2021 14:04:00 - INFO - __main__ - Step 119861: {'lr': 4.944338359025724e-05, 'samples': 23013312, 'steps': 119860, 'loss/train': 1.3093711137771606} 11/07/2021 14:04:01 - INFO - __main__ - Step 119862: {'lr': 4.944021540374252e-05, 'samples': 23013504, 'steps': 119861, 'loss/train': 1.5284205675125122} 11/07/2021 14:04:01 - INFO - __main__ - Step 119863: {'lr': 4.943704730759696e-05, 'samples': 23013696, 'steps': 119862, 'loss/train': 1.4064710140228271} 11/07/2021 14:04:02 - INFO - __main__ - Step 119864: {'lr': 4.943387930182197e-05, 'samples': 23013888, 'steps': 119863, 'loss/train': 1.364735722541809} 11/07/2021 14:04:02 - INFO - __main__ - Step 119865: {'lr': 4.9430711386419023e-05, 'samples': 23014080, 'steps': 119864, 'loss/train': 1.4672452211380005} 11/07/2021 14:04:03 - INFO - __main__ - Step 119866: {'lr': 4.942754356138948e-05, 'samples': 23014272, 'steps': 119865, 'loss/train': 1.3975669145584106} 11/07/2021 14:04:03 - INFO - __main__ - Step 119867: {'lr': 4.942437582673476e-05, 'samples': 23014464, 'steps': 119866, 'loss/train': 1.2964383363723755} 11/07/2021 14:04:04 - INFO - __main__ - Step 119868: {'lr': 4.942120818245632e-05, 'samples': 23014656, 'steps': 119867, 'loss/train': 1.4352976083755493} 11/07/2021 14:04:04 - INFO - __main__ - Step 119869: {'lr': 4.9418040628555594e-05, 'samples': 23014848, 'steps': 119868, 'loss/train': 1.0221186876296997} 11/07/2021 14:04:04 - INFO - __main__ - Step 119870: {'lr': 4.9414873165034015e-05, 'samples': 23015040, 'steps': 119869, 'loss/train': 1.0221447944641113} 11/07/2021 14:04:05 - INFO - __main__ - Step 119871: {'lr': 4.9411705791893e-05, 'samples': 23015232, 'steps': 119870, 'loss/train': 1.5580384731292725} 11/07/2021 14:04:06 - INFO - __main__ - Step 119872: {'lr': 4.940853850913396e-05, 'samples': 23015424, 'steps': 119871, 'loss/train': 1.580344319343567} 11/07/2021 14:04:06 - INFO - __main__ - Step 119873: {'lr': 4.9405371316758345e-05, 'samples': 23015616, 'steps': 119872, 'loss/train': 1.3346052169799805} 11/07/2021 14:04:07 - INFO - __main__ - Step 119874: {'lr': 4.940220421476757e-05, 'samples': 23015808, 'steps': 119873, 'loss/train': 1.4424526691436768} 11/07/2021 14:04:07 - INFO - __main__ - Step 119875: {'lr': 4.9399037203163075e-05, 'samples': 23016000, 'steps': 119874, 'loss/train': 1.0571653842926025} 11/07/2021 14:04:07 - INFO - __main__ - Step 119876: {'lr': 4.939587028194625e-05, 'samples': 23016192, 'steps': 119875, 'loss/train': 1.399951457977295} 11/07/2021 14:04:08 - INFO - __main__ - Step 119877: {'lr': 4.939270345111857e-05, 'samples': 23016384, 'steps': 119876, 'loss/train': 1.4421533346176147} 11/07/2021 14:04:09 - INFO - __main__ - Step 119878: {'lr': 4.9389536710681524e-05, 'samples': 23016576, 'steps': 119877, 'loss/train': 1.8451573848724365} 11/07/2021 14:04:09 - INFO - __main__ - Step 119879: {'lr': 4.938637006063637e-05, 'samples': 23016768, 'steps': 119878, 'loss/train': 1.663635015487671} 11/07/2021 14:04:09 - INFO - __main__ - Step 119880: {'lr': 4.938320350098463e-05, 'samples': 23016960, 'steps': 119879, 'loss/train': 1.0949009656906128} 11/07/2021 14:04:10 - INFO - __main__ - Step 119881: {'lr': 4.938003703172772e-05, 'samples': 23017152, 'steps': 119880, 'loss/train': 0.9271994829177856} 11/07/2021 14:04:11 - INFO - __main__ - Step 119882: {'lr': 4.9376870652867086e-05, 'samples': 23017344, 'steps': 119881, 'loss/train': 1.0064747333526611} 11/07/2021 14:04:11 - INFO - __main__ - Step 119883: {'lr': 4.9373704364404106e-05, 'samples': 23017536, 'steps': 119882, 'loss/train': 1.5280479192733765} 11/07/2021 14:04:11 - INFO - __main__ - Step 119884: {'lr': 4.937053816634027e-05, 'samples': 23017728, 'steps': 119883, 'loss/train': 1.586579442024231} 11/07/2021 14:04:12 - INFO - __main__ - Step 119885: {'lr': 4.9367372058676975e-05, 'samples': 23017920, 'steps': 119884, 'loss/train': 1.5736652612686157} 11/07/2021 14:04:12 - INFO - __main__ - Step 119886: {'lr': 4.9364206041415616e-05, 'samples': 23018112, 'steps': 119885, 'loss/train': 1.3944153785705566} 11/07/2021 14:04:13 - INFO - __main__ - Step 119887: {'lr': 4.936104011455766e-05, 'samples': 23018304, 'steps': 119886, 'loss/train': 1.7173203229904175} 11/07/2021 14:04:14 - INFO - __main__ - Step 119888: {'lr': 4.935787427810454e-05, 'samples': 23018496, 'steps': 119887, 'loss/train': 1.3415884971618652} 11/07/2021 14:04:14 - INFO - __main__ - Step 119889: {'lr': 4.9354708532057646e-05, 'samples': 23018688, 'steps': 119888, 'loss/train': 1.3537428379058838} 11/07/2021 14:04:14 - INFO - __main__ - Step 119890: {'lr': 4.9351542876418436e-05, 'samples': 23018880, 'steps': 119889, 'loss/train': 0.9983468651771545} 11/07/2021 14:04:15 - INFO - __main__ - Step 119891: {'lr': 4.9348377311188325e-05, 'samples': 23019072, 'steps': 119890, 'loss/train': 1.2381058931350708} 11/07/2021 14:04:15 - INFO - __main__ - Step 119892: {'lr': 4.934521183636881e-05, 'samples': 23019264, 'steps': 119891, 'loss/train': 1.2463423013687134} 11/07/2021 14:04:16 - INFO - __main__ - Step 119893: {'lr': 4.9342046451961163e-05, 'samples': 23019456, 'steps': 119892, 'loss/train': 1.5147652626037598} 11/07/2021 14:04:16 - INFO - __main__ - Step 119894: {'lr': 4.933888115796689e-05, 'samples': 23019648, 'steps': 119893, 'loss/train': 1.257423996925354} 11/07/2021 14:04:17 - INFO - __main__ - Step 119895: {'lr': 4.933571595438743e-05, 'samples': 23019840, 'steps': 119894, 'loss/train': 1.3079123497009277} 11/07/2021 14:04:17 - INFO - __main__ - Step 119896: {'lr': 4.9332550841224205e-05, 'samples': 23020032, 'steps': 119895, 'loss/train': 1.4107102155685425} 11/07/2021 14:04:17 - INFO - __main__ - Step 119897: {'lr': 4.932938581847865e-05, 'samples': 23020224, 'steps': 119896, 'loss/train': 1.0135314464569092} 11/07/2021 14:04:19 - INFO - __main__ - Step 119898: {'lr': 4.932622088615216e-05, 'samples': 23020416, 'steps': 119897, 'loss/train': 1.352624773979187} 11/07/2021 14:04:19 - INFO - __main__ - Step 119899: {'lr': 4.932305604424617e-05, 'samples': 23020608, 'steps': 119898, 'loss/train': 1.637068271636963} 11/07/2021 14:04:20 - INFO - __main__ - Step 119900: {'lr': 4.931989129276212e-05, 'samples': 23020800, 'steps': 119899, 'loss/train': 1.7067008018493652} 11/07/2021 14:04:20 - INFO - __main__ - Step 119901: {'lr': 4.931672663170145e-05, 'samples': 23020992, 'steps': 119900, 'loss/train': 1.459293246269226} 11/07/2021 14:04:20 - INFO - __main__ - Step 119902: {'lr': 4.931356206106555e-05, 'samples': 23021184, 'steps': 119901, 'loss/train': 0.9936407804489136} 11/07/2021 14:04:21 - INFO - __main__ - Step 119903: {'lr': 4.931039758085587e-05, 'samples': 23021376, 'steps': 119902, 'loss/train': 1.2058346271514893} 11/07/2021 14:04:22 - INFO - __main__ - Step 119904: {'lr': 4.9307233191073805e-05, 'samples': 23021568, 'steps': 119903, 'loss/train': 1.7669081687927246} 11/07/2021 14:04:22 - INFO - __main__ - Step 119905: {'lr': 4.93040688917209e-05, 'samples': 23021760, 'steps': 119904, 'loss/train': 1.5661453008651733} 11/07/2021 14:04:22 - INFO - __main__ - Step 119906: {'lr': 4.930090468279841e-05, 'samples': 23021952, 'steps': 119905, 'loss/train': 1.506068468093872} 11/07/2021 14:04:23 - INFO - __main__ - Step 119907: {'lr': 4.92977405643078e-05, 'samples': 23022144, 'steps': 119906, 'loss/train': 1.0614750385284424} 11/07/2021 14:04:24 - INFO - __main__ - Step 119908: {'lr': 4.929457653625058e-05, 'samples': 23022336, 'steps': 119907, 'loss/train': 1.388953685760498} 11/07/2021 14:04:24 - INFO - __main__ - Step 119909: {'lr': 4.92914125986281e-05, 'samples': 23022528, 'steps': 119908, 'loss/train': 1.074906587600708} 11/07/2021 14:04:24 - INFO - __main__ - Step 119910: {'lr': 4.9288248751441835e-05, 'samples': 23022720, 'steps': 119909, 'loss/train': 1.3404799699783325} 11/07/2021 14:04:25 - INFO - __main__ - Step 119911: {'lr': 4.928508499469317e-05, 'samples': 23022912, 'steps': 119910, 'loss/train': 1.2771432399749756} 11/07/2021 14:04:25 - INFO - __main__ - Step 119912: {'lr': 4.928192132838355e-05, 'samples': 23023104, 'steps': 119911, 'loss/train': 1.4612863063812256} 11/07/2021 14:04:26 - INFO - __main__ - Step 119913: {'lr': 4.927875775251439e-05, 'samples': 23023296, 'steps': 119912, 'loss/train': 1.3442983627319336} 11/07/2021 14:04:27 - INFO - __main__ - Step 119914: {'lr': 4.927559426708714e-05, 'samples': 23023488, 'steps': 119913, 'loss/train': 0.8140401840209961} 11/07/2021 14:04:27 - INFO - __main__ - Step 119915: {'lr': 4.92724308721032e-05, 'samples': 23023680, 'steps': 119914, 'loss/train': 0.5308074355125427} 11/07/2021 14:04:27 - INFO - __main__ - Step 119916: {'lr': 4.9269267567564004e-05, 'samples': 23023872, 'steps': 119915, 'loss/train': 1.1881754398345947} 11/07/2021 14:04:28 - INFO - __main__ - Step 119917: {'lr': 4.926610435347098e-05, 'samples': 23024064, 'steps': 119916, 'loss/train': 1.414641261100769} 11/07/2021 14:04:28 - INFO - __main__ - Step 119918: {'lr': 4.9262941229825556e-05, 'samples': 23024256, 'steps': 119917, 'loss/train': 1.3554744720458984} 11/07/2021 14:04:29 - INFO - __main__ - Step 119919: {'lr': 4.925977819662922e-05, 'samples': 23024448, 'steps': 119918, 'loss/train': 0.8829789757728577} 11/07/2021 14:04:29 - INFO - __main__ - Step 119920: {'lr': 4.925661525388328e-05, 'samples': 23024640, 'steps': 119919, 'loss/train': 1.2732207775115967} 11/07/2021 14:04:30 - INFO - __main__ - Step 119921: {'lr': 4.925345240158918e-05, 'samples': 23024832, 'steps': 119920, 'loss/train': 1.0830212831497192} 11/07/2021 14:04:30 - INFO - __main__ - Step 119922: {'lr': 4.9250289639748395e-05, 'samples': 23025024, 'steps': 119921, 'loss/train': 1.1160242557525635} 11/07/2021 14:04:30 - INFO - __main__ - Step 119923: {'lr': 4.9247126968362336e-05, 'samples': 23025216, 'steps': 119922, 'loss/train': 0.48013150691986084} 11/07/2021 14:04:32 - INFO - __main__ - Step 119924: {'lr': 4.924396438743242e-05, 'samples': 23025408, 'steps': 119923, 'loss/train': 0.48832976818084717} 11/07/2021 14:04:32 - INFO - __main__ - Step 119925: {'lr': 4.9240801896960065e-05, 'samples': 23025600, 'steps': 119924, 'loss/train': 1.4954125881195068} 11/07/2021 14:04:32 - INFO - __main__ - Step 119926: {'lr': 4.9237639496946706e-05, 'samples': 23025792, 'steps': 119925, 'loss/train': 1.4177426099777222} 11/07/2021 14:04:33 - INFO - __main__ - Step 119927: {'lr': 4.9234477187393796e-05, 'samples': 23025984, 'steps': 119926, 'loss/train': 1.1942856311798096} 11/07/2021 14:04:33 - INFO - __main__ - Step 119928: {'lr': 4.923131496830269e-05, 'samples': 23026176, 'steps': 119927, 'loss/train': 1.2622361183166504} 11/07/2021 14:04:34 - INFO - __main__ - Step 119929: {'lr': 4.922815283967489e-05, 'samples': 23026368, 'steps': 119928, 'loss/train': 1.3785158395767212} 11/07/2021 14:04:34 - INFO - __main__ - Step 119930: {'lr': 4.9224990801511774e-05, 'samples': 23026560, 'steps': 119929, 'loss/train': 1.575429081916809} 11/07/2021 14:04:35 - INFO - __main__ - Step 119931: {'lr': 4.9221828853814795e-05, 'samples': 23026752, 'steps': 119930, 'loss/train': 2.2240657806396484} 11/07/2021 14:04:35 - INFO - __main__ - Step 119932: {'lr': 4.921866699658539e-05, 'samples': 23026944, 'steps': 119931, 'loss/train': 1.1842639446258545} 11/07/2021 14:04:35 - INFO - __main__ - Step 119933: {'lr': 4.921550522982493e-05, 'samples': 23027136, 'steps': 119932, 'loss/train': 0.8317989110946655} 11/07/2021 14:04:36 - INFO - __main__ - Step 119934: {'lr': 4.921234355353482e-05, 'samples': 23027328, 'steps': 119933, 'loss/train': 1.4114670753479004} 11/07/2021 14:04:37 - INFO - __main__ - Step 119935: {'lr': 4.9209181967716566e-05, 'samples': 23027520, 'steps': 119934, 'loss/train': 0.9254234433174133} 11/07/2021 14:04:37 - INFO - __main__ - Step 119936: {'lr': 4.920602047237155e-05, 'samples': 23027712, 'steps': 119935, 'loss/train': 1.292459487915039} 11/07/2021 14:04:38 - INFO - __main__ - Step 119937: {'lr': 4.920285906750122e-05, 'samples': 23027904, 'steps': 119936, 'loss/train': 1.5442180633544922} 11/07/2021 14:04:38 - INFO - __main__ - Step 119938: {'lr': 4.9199697753106956e-05, 'samples': 23028096, 'steps': 119937, 'loss/train': 1.5812188386917114} 11/07/2021 14:04:39 - INFO - __main__ - Step 119939: {'lr': 4.9196536529190204e-05, 'samples': 23028288, 'steps': 119938, 'loss/train': 1.5602816343307495} 11/07/2021 14:04:40 - INFO - __main__ - Step 119940: {'lr': 4.9193375395752415e-05, 'samples': 23028480, 'steps': 119939, 'loss/train': 1.5262950658798218} 11/07/2021 14:04:40 - INFO - __main__ - Step 119941: {'lr': 4.919021435279497e-05, 'samples': 23028672, 'steps': 119940, 'loss/train': 1.1702313423156738} 11/07/2021 14:04:40 - INFO - __main__ - Step 119942: {'lr': 4.9187053400319345e-05, 'samples': 23028864, 'steps': 119941, 'loss/train': 0.3290627598762512} 11/07/2021 14:04:41 - INFO - __main__ - Step 119943: {'lr': 4.918389253832692e-05, 'samples': 23029056, 'steps': 119942, 'loss/train': 1.5829188823699951} 11/07/2021 14:04:41 - INFO - __main__ - Step 119944: {'lr': 4.9180731766819117e-05, 'samples': 23029248, 'steps': 119943, 'loss/train': 2.7144482135772705} 11/07/2021 14:04:42 - INFO - __main__ - Step 119945: {'lr': 4.91775710857974e-05, 'samples': 23029440, 'steps': 119944, 'loss/train': 1.4596788883209229} 11/07/2021 14:04:42 - INFO - __main__ - Step 119946: {'lr': 4.917441049526322e-05, 'samples': 23029632, 'steps': 119945, 'loss/train': 1.1498539447784424} 11/07/2021 14:04:43 - INFO - __main__ - Step 119947: {'lr': 4.917124999521791e-05, 'samples': 23029824, 'steps': 119946, 'loss/train': 1.799602746963501} 11/07/2021 14:04:43 - INFO - __main__ - Step 119948: {'lr': 4.916808958566293e-05, 'samples': 23030016, 'steps': 119947, 'loss/train': 1.6325814723968506} 11/07/2021 14:04:44 - INFO - __main__ - Step 119949: {'lr': 4.916492926659968e-05, 'samples': 23030208, 'steps': 119948, 'loss/train': 1.5958445072174072} 11/07/2021 14:04:44 - INFO - __main__ - Step 119950: {'lr': 4.916176903802966e-05, 'samples': 23030400, 'steps': 119949, 'loss/train': 0.8829160928726196} 11/07/2021 14:04:45 - INFO - __main__ - Step 119951: {'lr': 4.915860889995421e-05, 'samples': 23030592, 'steps': 119950, 'loss/train': 1.1549561023712158} 11/07/2021 14:04:45 - INFO - __main__ - Step 119952: {'lr': 4.91554488523748e-05, 'samples': 23030784, 'steps': 119951, 'loss/train': 1.2783335447311401} 11/07/2021 14:04:46 - INFO - __main__ - Step 119953: {'lr': 4.915228889529286e-05, 'samples': 23030976, 'steps': 119952, 'loss/train': 1.27994966506958} 11/07/2021 14:04:46 - INFO - __main__ - Step 119954: {'lr': 4.91491290287098e-05, 'samples': 23031168, 'steps': 119953, 'loss/train': 1.2136046886444092} 11/07/2021 14:04:46 - INFO - __main__ - Step 119955: {'lr': 4.9145969252627016e-05, 'samples': 23031360, 'steps': 119954, 'loss/train': 1.3886679410934448} 11/07/2021 14:04:47 - INFO - __main__ - Step 119956: {'lr': 4.9142809567045974e-05, 'samples': 23031552, 'steps': 119955, 'loss/train': 0.7069751620292664} 11/07/2021 14:04:48 - INFO - __main__ - Step 119957: {'lr': 4.91396499719681e-05, 'samples': 23031744, 'steps': 119956, 'loss/train': 1.2022769451141357} 11/07/2021 14:04:48 - INFO - __main__ - Step 119958: {'lr': 4.9136490467394765e-05, 'samples': 23031936, 'steps': 119957, 'loss/train': 1.5155476331710815} 11/07/2021 14:04:48 - INFO - __main__ - Step 119959: {'lr': 4.913333105332754e-05, 'samples': 23032128, 'steps': 119958, 'loss/train': 1.4525641202926636} 11/07/2021 14:04:49 - INFO - __main__ - Step 119960: {'lr': 4.913017172976764e-05, 'samples': 23032320, 'steps': 119959, 'loss/train': 1.4187198877334595} 11/07/2021 14:04:50 - INFO - __main__ - Step 119961: {'lr': 4.9127012496716585e-05, 'samples': 23032512, 'steps': 119960, 'loss/train': 1.5251394510269165} 11/07/2021 14:04:50 - INFO - __main__ - Step 119962: {'lr': 4.91238533541758e-05, 'samples': 23032704, 'steps': 119961, 'loss/train': 1.2831227779388428} 11/07/2021 14:04:51 - INFO - __main__ - Step 119963: {'lr': 4.912069430214672e-05, 'samples': 23032896, 'steps': 119962, 'loss/train': 0.9218352437019348} 11/07/2021 14:04:51 - INFO - __main__ - Step 119964: {'lr': 4.9117535340630736e-05, 'samples': 23033088, 'steps': 119963, 'loss/train': 1.3044729232788086} 11/07/2021 14:04:51 - INFO - __main__ - Step 119965: {'lr': 4.91143764696293e-05, 'samples': 23033280, 'steps': 119964, 'loss/train': 1.4355762004852295} 11/07/2021 14:04:52 - INFO - __main__ - Step 119966: {'lr': 4.911121768914381e-05, 'samples': 23033472, 'steps': 119965, 'loss/train': 1.7348923683166504} 11/07/2021 14:04:53 - INFO - __main__ - Step 119967: {'lr': 4.910805899917573e-05, 'samples': 23033664, 'steps': 119966, 'loss/train': 1.172661542892456} 11/07/2021 14:04:53 - INFO - __main__ - Step 119968: {'lr': 4.9104900399726454e-05, 'samples': 23033856, 'steps': 119967, 'loss/train': 1.1955256462097168} 11/07/2021 14:04:53 - INFO - __main__ - Step 119969: {'lr': 4.9101741890797415e-05, 'samples': 23034048, 'steps': 119968, 'loss/train': 1.5444697141647339} 11/07/2021 14:04:54 - INFO - __main__ - Step 119970: {'lr': 4.9098583472390015e-05, 'samples': 23034240, 'steps': 119969, 'loss/train': 1.228186011314392} 11/07/2021 14:04:55 - INFO - __main__ - Step 119971: {'lr': 4.909542514450571e-05, 'samples': 23034432, 'steps': 119970, 'loss/train': 1.027446985244751} 11/07/2021 14:04:55 - INFO - __main__ - Step 119972: {'lr': 4.9092266907145884e-05, 'samples': 23034624, 'steps': 119971, 'loss/train': 1.4008082151412964} 11/07/2021 14:04:56 - INFO - __main__ - Step 119973: {'lr': 4.908910876031206e-05, 'samples': 23034816, 'steps': 119972, 'loss/train': 1.7142552137374878} 11/07/2021 14:04:56 - INFO - __main__ - Step 119974: {'lr': 4.908595070400551e-05, 'samples': 23035008, 'steps': 119973, 'loss/train': 1.760067105293274} 11/07/2021 14:04:56 - INFO - __main__ - Step 119975: {'lr': 4.9082792738227745e-05, 'samples': 23035200, 'steps': 119974, 'loss/train': 1.2109333276748657} 11/07/2021 14:04:57 - INFO - __main__ - Step 119976: {'lr': 4.907963486298017e-05, 'samples': 23035392, 'steps': 119975, 'loss/train': 0.5238632559776306} 11/07/2021 14:04:58 - INFO - __main__ - Step 119977: {'lr': 4.907647707826421e-05, 'samples': 23035584, 'steps': 119976, 'loss/train': 1.0258597135543823} 11/07/2021 14:04:58 - INFO - __main__ - Step 119978: {'lr': 4.90733193840813e-05, 'samples': 23035776, 'steps': 119977, 'loss/train': 0.9779492020606995} 11/07/2021 14:04:58 - INFO - __main__ - Step 119979: {'lr': 4.907016178043283e-05, 'samples': 23035968, 'steps': 119978, 'loss/train': 1.3077306747436523} 11/07/2021 14:04:59 - INFO - __main__ - Step 119980: {'lr': 4.9067004267320245e-05, 'samples': 23036160, 'steps': 119979, 'loss/train': 1.1330616474151611} 11/07/2021 14:04:59 - INFO - __main__ - Step 119981: {'lr': 4.906384684474499e-05, 'samples': 23036352, 'steps': 119980, 'loss/train': 1.2067033052444458} 11/07/2021 14:05:00 - INFO - __main__ - Step 119982: {'lr': 4.906068951270845e-05, 'samples': 23036544, 'steps': 119981, 'loss/train': 1.2456773519515991} 11/07/2021 14:05:00 - INFO - __main__ - Step 119983: {'lr': 4.905753227121204e-05, 'samples': 23036736, 'steps': 119982, 'loss/train': 1.2133811712265015} 11/07/2021 14:05:01 - INFO - __main__ - Step 119984: {'lr': 4.905437512025723e-05, 'samples': 23036928, 'steps': 119983, 'loss/train': 1.3989567756652832} 11/07/2021 14:05:01 - INFO - __main__ - Step 119985: {'lr': 4.9051218059845444e-05, 'samples': 23037120, 'steps': 119984, 'loss/train': 1.3537942171096802} 11/07/2021 14:05:02 - INFO - __main__ - Step 119986: {'lr': 4.904806108997811e-05, 'samples': 23037312, 'steps': 119985, 'loss/train': 1.1671839952468872} 11/07/2021 14:05:03 - INFO - __main__ - Step 119987: {'lr': 4.904490421065655e-05, 'samples': 23037504, 'steps': 119986, 'loss/train': 1.2539210319519043} 11/07/2021 14:05:03 - INFO - __main__ - Step 119988: {'lr': 4.904174742188228e-05, 'samples': 23037696, 'steps': 119987, 'loss/train': 1.7269123792648315} 11/07/2021 14:05:03 - INFO - __main__ - Step 119989: {'lr': 4.903859072365666e-05, 'samples': 23037888, 'steps': 119988, 'loss/train': 1.3117371797561646} 11/07/2021 14:05:04 - INFO - __main__ - Step 119990: {'lr': 4.903543411598119e-05, 'samples': 23038080, 'steps': 119989, 'loss/train': 1.6516653299331665} 11/07/2021 14:05:04 - INFO - __main__ - Step 119991: {'lr': 4.9032277598857225e-05, 'samples': 23038272, 'steps': 119990, 'loss/train': 1.0754274129867554} 11/07/2021 14:05:04 - INFO - __main__ - Step 119992: {'lr': 4.902912117228622e-05, 'samples': 23038464, 'steps': 119991, 'loss/train': 1.6682729721069336} 11/07/2021 14:05:05 - INFO - __main__ - Step 119993: {'lr': 4.902596483626959e-05, 'samples': 23038656, 'steps': 119992, 'loss/train': 1.1758676767349243} 11/07/2021 14:05:06 - INFO - __main__ - Step 119994: {'lr': 4.902280859080876e-05, 'samples': 23038848, 'steps': 119993, 'loss/train': 1.7827175855636597} 11/07/2021 14:05:06 - INFO - __main__ - Step 119995: {'lr': 4.9019652435905174e-05, 'samples': 23039040, 'steps': 119994, 'loss/train': 1.3533591032028198} 11/07/2021 14:05:06 - INFO - __main__ - Step 119996: {'lr': 4.901649637156022e-05, 'samples': 23039232, 'steps': 119995, 'loss/train': 1.3292304277420044} 11/07/2021 14:05:07 - INFO - __main__ - Step 119997: {'lr': 4.90133403977753e-05, 'samples': 23039424, 'steps': 119996, 'loss/train': 0.9482015371322632} 11/07/2021 14:05:08 - INFO - __main__ - Step 119998: {'lr': 4.901018451455191e-05, 'samples': 23039616, 'steps': 119997, 'loss/train': 1.1665154695510864} 11/07/2021 14:05:08 - INFO - __main__ - Step 119999: {'lr': 4.9007028721891415e-05, 'samples': 23039808, 'steps': 119998, 'loss/train': 1.0045204162597656} 11/07/2021 14:05:09 - INFO - __main__ - Step 120000: {'lr': 4.900387301979531e-05, 'samples': 23040000, 'steps': 119999, 'loss/train': 1.1156840324401855} 11/07/2021 14:05:09 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 14:08:25 - INFO - __main__ - Step 120000: {'loss/eval': 1.2484334707260132, 'perplexity': 3.484879493713379} 11/07/2021 14:08:42 - WARNING - huggingface_hub.repository - Several commits (8) will be pushed upstream. 11/07/2021 14:08:42 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 14:09:04 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small c93cc06..b58b427 proud-haze-135 -> proud-haze-135 11/07/2021 14:09:06 - INFO - __main__ - Step 120001: {'lr': 4.900071740826489e-05, 'samples': 23040192, 'steps': 120000, 'loss/train': 1.5684682130813599} 11/07/2021 14:09:06 - INFO - __main__ - Step 120002: {'lr': 4.8997561887301646e-05, 'samples': 23040384, 'steps': 120001, 'loss/train': 1.7844196557998657} 11/07/2021 14:09:07 - INFO - __main__ - Step 120003: {'lr': 4.8994406456907e-05, 'samples': 23040576, 'steps': 120002, 'loss/train': 1.011291742324829} 11/07/2021 14:09:08 - INFO - __main__ - Step 120004: {'lr': 4.899125111708239e-05, 'samples': 23040768, 'steps': 120003, 'loss/train': 1.3767328262329102} 11/07/2021 14:09:08 - INFO - __main__ - Step 120005: {'lr': 4.8988095867829204e-05, 'samples': 23040960, 'steps': 120004, 'loss/train': 1.4766099452972412} 11/07/2021 14:09:08 - INFO - __main__ - Step 120006: {'lr': 4.898494070914889e-05, 'samples': 23041152, 'steps': 120005, 'loss/train': 1.1431798934936523} 11/07/2021 14:09:09 - INFO - __main__ - Step 120007: {'lr': 4.898178564104286e-05, 'samples': 23041344, 'steps': 120006, 'loss/train': 1.230944037437439} 11/07/2021 14:09:10 - INFO - __main__ - Step 120008: {'lr': 4.897863066351252e-05, 'samples': 23041536, 'steps': 120007, 'loss/train': 1.7552257776260376} 11/07/2021 14:09:10 - INFO - __main__ - Step 120009: {'lr': 4.897547577655931e-05, 'samples': 23041728, 'steps': 120008, 'loss/train': 1.320626139640808} 11/07/2021 14:09:10 - INFO - __main__ - Step 120010: {'lr': 4.897232098018467e-05, 'samples': 23041920, 'steps': 120009, 'loss/train': 1.3448165655136108} 11/07/2021 14:09:11 - INFO - __main__ - Step 120011: {'lr': 4.896916627438999e-05, 'samples': 23042112, 'steps': 120010, 'loss/train': 1.1587456464767456} 11/07/2021 14:09:11 - INFO - __main__ - Step 120012: {'lr': 4.896601165917669e-05, 'samples': 23042304, 'steps': 120011, 'loss/train': 1.632805347442627} 11/07/2021 14:09:12 - INFO - __main__ - Step 120013: {'lr': 4.8962857134546265e-05, 'samples': 23042496, 'steps': 120012, 'loss/train': 1.3532031774520874} 11/07/2021 14:09:12 - INFO - __main__ - Step 120014: {'lr': 4.895970270050001e-05, 'samples': 23042688, 'steps': 120013, 'loss/train': 1.467564344406128} 11/07/2021 14:09:13 - INFO - __main__ - Step 120015: {'lr': 4.895654835703944e-05, 'samples': 23042880, 'steps': 120014, 'loss/train': 1.3109745979309082} 11/07/2021 14:09:13 - INFO - __main__ - Step 120016: {'lr': 4.895339410416591e-05, 'samples': 23043072, 'steps': 120015, 'loss/train': 1.2999281883239746} 11/07/2021 14:09:14 - INFO - __main__ - Step 120017: {'lr': 4.8950239941880916e-05, 'samples': 23043264, 'steps': 120016, 'loss/train': 1.3403267860412598} 11/07/2021 14:09:15 - INFO - __main__ - Step 120018: {'lr': 4.894708587018582e-05, 'samples': 23043456, 'steps': 120017, 'loss/train': 1.5707504749298096} 11/07/2021 14:09:15 - INFO - __main__ - Step 120019: {'lr': 4.894393188908206e-05, 'samples': 23043648, 'steps': 120018, 'loss/train': 1.1108729839324951} 11/07/2021 14:09:15 - INFO - __main__ - Step 120020: {'lr': 4.894077799857105e-05, 'samples': 23043840, 'steps': 120019, 'loss/train': 1.4605718851089478} 11/07/2021 14:09:16 - INFO - __main__ - Step 120021: {'lr': 4.893762419865425e-05, 'samples': 23044032, 'steps': 120020, 'loss/train': 1.293405294418335} 11/07/2021 14:09:16 - INFO - __main__ - Step 120022: {'lr': 4.8934470489333056e-05, 'samples': 23044224, 'steps': 120021, 'loss/train': 1.1273342370986938} 11/07/2021 14:09:16 - INFO - __main__ - Step 120023: {'lr': 4.893131687060886e-05, 'samples': 23044416, 'steps': 120022, 'loss/train': 1.264103651046753} 11/07/2021 14:09:18 - INFO - __main__ - Step 120024: {'lr': 4.8928163342483124e-05, 'samples': 23044608, 'steps': 120023, 'loss/train': 1.4428634643554688} 11/07/2021 14:09:18 - INFO - __main__ - Step 120025: {'lr': 4.892500990495727e-05, 'samples': 23044800, 'steps': 120024, 'loss/train': 0.518305242061615} 11/07/2021 14:09:18 - INFO - __main__ - Step 120026: {'lr': 4.892185655803275e-05, 'samples': 23044992, 'steps': 120025, 'loss/train': 0.4182068109512329} 11/07/2021 14:09:19 - INFO - __main__ - Step 120027: {'lr': 4.8918703301710884e-05, 'samples': 23045184, 'steps': 120026, 'loss/train': 1.2141886949539185} 11/07/2021 14:09:20 - INFO - __main__ - Step 120028: {'lr': 4.891555013599314e-05, 'samples': 23045376, 'steps': 120027, 'loss/train': 1.5009403228759766} 11/07/2021 14:09:20 - INFO - __main__ - Step 120029: {'lr': 4.8912397060880935e-05, 'samples': 23045568, 'steps': 120028, 'loss/train': 1.1414533853530884} 11/07/2021 14:09:21 - INFO - __main__ - Step 120030: {'lr': 4.890924407637573e-05, 'samples': 23045760, 'steps': 120029, 'loss/train': 0.9397246837615967} 11/07/2021 14:09:21 - INFO - __main__ - Step 120031: {'lr': 4.890609118247888e-05, 'samples': 23045952, 'steps': 120030, 'loss/train': 1.783538818359375} 11/07/2021 14:09:21 - INFO - __main__ - Step 120032: {'lr': 4.890293837919188e-05, 'samples': 23046144, 'steps': 120031, 'loss/train': 1.2741663455963135} 11/07/2021 14:09:23 - INFO - __main__ - Step 120033: {'lr': 4.889978566651609e-05, 'samples': 23046336, 'steps': 120032, 'loss/train': 1.3321585655212402} 11/07/2021 14:09:23 - INFO - __main__ - Step 120034: {'lr': 4.8896633044452966e-05, 'samples': 23046528, 'steps': 120033, 'loss/train': 1.4527745246887207} 11/07/2021 14:09:23 - INFO - __main__ - Step 120035: {'lr': 4.8893480513003906e-05, 'samples': 23046720, 'steps': 120034, 'loss/train': 1.083811640739441} 11/07/2021 14:09:24 - INFO - __main__ - Step 120036: {'lr': 4.8890328072170336e-05, 'samples': 23046912, 'steps': 120035, 'loss/train': 1.143491268157959} 11/07/2021 14:09:24 - INFO - __main__ - Step 120037: {'lr': 4.88871757219537e-05, 'samples': 23047104, 'steps': 120036, 'loss/train': 1.5296365022659302} 11/07/2021 14:09:24 - INFO - __main__ - Step 120038: {'lr': 4.888402346235538e-05, 'samples': 23047296, 'steps': 120037, 'loss/train': 1.2477445602416992} 11/07/2021 14:09:25 - INFO - __main__ - Step 120039: {'lr': 4.8880871293376816e-05, 'samples': 23047488, 'steps': 120038, 'loss/train': 1.337018609046936} 11/07/2021 14:09:26 - INFO - __main__ - Step 120040: {'lr': 4.887771921501949e-05, 'samples': 23047680, 'steps': 120039, 'loss/train': 1.965865969657898} 11/07/2021 14:09:26 - INFO - __main__ - Step 120041: {'lr': 4.887456722728473e-05, 'samples': 23047872, 'steps': 120040, 'loss/train': 0.8893584609031677} 11/07/2021 14:09:26 - INFO - __main__ - Step 120042: {'lr': 4.8871415330173946e-05, 'samples': 23048064, 'steps': 120041, 'loss/train': 1.3100159168243408} 11/07/2021 14:09:27 - INFO - __main__ - Step 120043: {'lr': 4.8868263523688616e-05, 'samples': 23048256, 'steps': 120042, 'loss/train': 1.2612327337265015} 11/07/2021 14:09:28 - INFO - __main__ - Step 120044: {'lr': 4.8865111807830125e-05, 'samples': 23048448, 'steps': 120043, 'loss/train': 1.5382835865020752} 11/07/2021 14:09:28 - INFO - __main__ - Step 120045: {'lr': 4.8861960182599945e-05, 'samples': 23048640, 'steps': 120044, 'loss/train': 1.809225082397461} 11/07/2021 14:09:28 - INFO - __main__ - Step 120046: {'lr': 4.885880864799944e-05, 'samples': 23048832, 'steps': 120045, 'loss/train': 1.398596167564392} 11/07/2021 14:09:29 - INFO - __main__ - Step 120047: {'lr': 4.885565720403007e-05, 'samples': 23049024, 'steps': 120046, 'loss/train': 1.6262061595916748} 11/07/2021 14:09:29 - INFO - __main__ - Step 120048: {'lr': 4.88525058506932e-05, 'samples': 23049216, 'steps': 120047, 'loss/train': 0.6329628229141235} 11/07/2021 14:09:30 - INFO - __main__ - Step 120049: {'lr': 4.884935458799031e-05, 'samples': 23049408, 'steps': 120048, 'loss/train': 1.0708551406860352} 11/07/2021 14:09:31 - INFO - __main__ - Step 120050: {'lr': 4.8846203415922804e-05, 'samples': 23049600, 'steps': 120049, 'loss/train': 1.32381272315979} 11/07/2021 14:09:31 - INFO - __main__ - Step 120051: {'lr': 4.88430523344921e-05, 'samples': 23049792, 'steps': 120050, 'loss/train': 1.7816917896270752} 11/07/2021 14:09:31 - INFO - __main__ - Step 120052: {'lr': 4.8839901343699586e-05, 'samples': 23049984, 'steps': 120051, 'loss/train': 0.9924923181533813} 11/07/2021 14:09:32 - INFO - __main__ - Step 120053: {'lr': 4.883675044354679e-05, 'samples': 23050176, 'steps': 120052, 'loss/train': 1.8889893293380737} 11/07/2021 14:09:33 - INFO - __main__ - Step 120054: {'lr': 4.883359963403497e-05, 'samples': 23050368, 'steps': 120053, 'loss/train': 1.23691725730896} 11/07/2021 14:09:33 - INFO - __main__ - Step 120055: {'lr': 4.883044891516564e-05, 'samples': 23050560, 'steps': 120054, 'loss/train': 1.405544400215149} 11/07/2021 14:09:33 - INFO - __main__ - Step 120056: {'lr': 4.882729828694021e-05, 'samples': 23050752, 'steps': 120055, 'loss/train': 1.7093794345855713} 11/07/2021 14:09:34 - INFO - __main__ - Step 120057: {'lr': 4.882414774936009e-05, 'samples': 23050944, 'steps': 120056, 'loss/train': 0.9893200397491455} 11/07/2021 14:09:34 - INFO - __main__ - Step 120058: {'lr': 4.88209973024267e-05, 'samples': 23051136, 'steps': 120057, 'loss/train': 1.3935505151748657} 11/07/2021 14:09:35 - INFO - __main__ - Step 120059: {'lr': 4.881784694614147e-05, 'samples': 23051328, 'steps': 120058, 'loss/train': 1.3452649116516113} 11/07/2021 14:09:36 - INFO - __main__ - Step 120060: {'lr': 4.881469668050581e-05, 'samples': 23051520, 'steps': 120059, 'loss/train': 1.654772400856018} 11/07/2021 14:09:36 - INFO - __main__ - Step 120061: {'lr': 4.881154650552114e-05, 'samples': 23051712, 'steps': 120060, 'loss/train': 0.7769205570220947} 11/07/2021 14:09:36 - INFO - __main__ - Step 120062: {'lr': 4.8808396421188896e-05, 'samples': 23051904, 'steps': 120061, 'loss/train': 1.4615012407302856} 11/07/2021 14:09:37 - INFO - __main__ - Step 120063: {'lr': 4.880524642751047e-05, 'samples': 23052096, 'steps': 120062, 'loss/train': 1.3639267683029175} 11/07/2021 14:09:37 - INFO - __main__ - Step 120064: {'lr': 4.880209652448731e-05, 'samples': 23052288, 'steps': 120063, 'loss/train': 1.4276766777038574} 11/07/2021 14:09:38 - INFO - __main__ - Step 120065: {'lr': 4.8798946712120816e-05, 'samples': 23052480, 'steps': 120064, 'loss/train': 1.552965760231018} 11/07/2021 14:09:38 - INFO - __main__ - Step 120066: {'lr': 4.8795796990412397e-05, 'samples': 23052672, 'steps': 120065, 'loss/train': 1.3906797170639038} 11/07/2021 14:09:39 - INFO - __main__ - Step 120067: {'lr': 4.879264735936356e-05, 'samples': 23052864, 'steps': 120066, 'loss/train': 0.8515996336936951} 11/07/2021 14:09:39 - INFO - __main__ - Step 120068: {'lr': 4.87894978189756e-05, 'samples': 23053056, 'steps': 120067, 'loss/train': 1.4538038969039917} 11/07/2021 14:09:39 - INFO - __main__ - Step 120069: {'lr': 4.8786348369249975e-05, 'samples': 23053248, 'steps': 120068, 'loss/train': 1.056023359298706} 11/07/2021 14:09:40 - INFO - __main__ - Step 120070: {'lr': 4.87831990101881e-05, 'samples': 23053440, 'steps': 120069, 'loss/train': 1.5582994222640991} 11/07/2021 14:09:41 - INFO - __main__ - Step 120071: {'lr': 4.8780049741791426e-05, 'samples': 23053632, 'steps': 120070, 'loss/train': 1.0406829118728638} 11/07/2021 14:09:41 - INFO - __main__ - Step 120072: {'lr': 4.877690056406137e-05, 'samples': 23053824, 'steps': 120071, 'loss/train': 1.047364592552185} 11/07/2021 14:09:41 - INFO - __main__ - Step 120073: {'lr': 4.8773751476999336e-05, 'samples': 23054016, 'steps': 120072, 'loss/train': 1.529931664466858} 11/07/2021 14:09:42 - INFO - __main__ - Step 120074: {'lr': 4.877060248060672e-05, 'samples': 23054208, 'steps': 120073, 'loss/train': 1.0046523809432983} 11/07/2021 14:09:43 - INFO - __main__ - Step 120075: {'lr': 4.876745357488499e-05, 'samples': 23054400, 'steps': 120074, 'loss/train': 0.3601728081703186} 11/07/2021 14:09:43 - INFO - __main__ - Step 120076: {'lr': 4.8764304759835534e-05, 'samples': 23054592, 'steps': 120075, 'loss/train': 1.3307033777236938} 11/07/2021 14:09:44 - INFO - __main__ - Step 120077: {'lr': 4.876115603545978e-05, 'samples': 23054784, 'steps': 120076, 'loss/train': 0.642892062664032} 11/07/2021 14:09:44 - INFO - __main__ - Step 120078: {'lr': 4.875800740175912e-05, 'samples': 23054976, 'steps': 120077, 'loss/train': 1.0805084705352783} 11/07/2021 14:09:44 - INFO - __main__ - Step 120079: {'lr': 4.8754858858735014e-05, 'samples': 23055168, 'steps': 120078, 'loss/train': 1.405299425125122} 11/07/2021 14:09:45 - INFO - __main__ - Step 120080: {'lr': 4.875171040638893e-05, 'samples': 23055360, 'steps': 120079, 'loss/train': 1.244773268699646} 11/07/2021 14:09:46 - INFO - __main__ - Step 120081: {'lr': 4.8748562044722166e-05, 'samples': 23055552, 'steps': 120080, 'loss/train': 1.17471444606781} 11/07/2021 14:09:46 - INFO - __main__ - Step 120082: {'lr': 4.8745413773736176e-05, 'samples': 23055744, 'steps': 120081, 'loss/train': 1.1522585153579712} 11/07/2021 14:09:46 - INFO - __main__ - Step 120083: {'lr': 4.8742265593432394e-05, 'samples': 23055936, 'steps': 120082, 'loss/train': 1.1359418630599976} 11/07/2021 14:09:47 - INFO - __main__ - Step 120084: {'lr': 4.8739117503812245e-05, 'samples': 23056128, 'steps': 120083, 'loss/train': 0.7787581086158752} 11/07/2021 14:09:48 - INFO - __main__ - Step 120085: {'lr': 4.873596950487716e-05, 'samples': 23056320, 'steps': 120084, 'loss/train': 1.4789268970489502} 11/07/2021 14:09:48 - INFO - __main__ - Step 120086: {'lr': 4.8732821596628534e-05, 'samples': 23056512, 'steps': 120085, 'loss/train': 1.449021816253662} 11/07/2021 14:09:48 - INFO - __main__ - Step 120087: {'lr': 4.8729673779067784e-05, 'samples': 23056704, 'steps': 120086, 'loss/train': 1.0221765041351318} 11/07/2021 14:09:49 - INFO - __main__ - Step 120088: {'lr': 4.8726526052196325e-05, 'samples': 23056896, 'steps': 120087, 'loss/train': 1.4528453350067139} 11/07/2021 14:09:49 - INFO - __main__ - Step 120089: {'lr': 4.8723378416015626e-05, 'samples': 23057088, 'steps': 120088, 'loss/train': 1.0968000888824463} 11/07/2021 14:09:50 - INFO - __main__ - Step 120090: {'lr': 4.872023087052704e-05, 'samples': 23057280, 'steps': 120089, 'loss/train': 0.9867737889289856} 11/07/2021 14:09:51 - INFO - __main__ - Step 120091: {'lr': 4.871708341573208e-05, 'samples': 23057472, 'steps': 120090, 'loss/train': 1.3832026720046997} 11/07/2021 14:09:51 - INFO - __main__ - Step 120092: {'lr': 4.871393605163202e-05, 'samples': 23057664, 'steps': 120091, 'loss/train': 1.383430004119873} 11/07/2021 14:09:51 - INFO - __main__ - Step 120093: {'lr': 4.8710788778228375e-05, 'samples': 23057856, 'steps': 120092, 'loss/train': 1.5785682201385498} 11/07/2021 14:09:52 - INFO - __main__ - Step 120094: {'lr': 4.8707641595522546e-05, 'samples': 23058048, 'steps': 120093, 'loss/train': 1.4014928340911865} 11/07/2021 14:09:52 - INFO - __main__ - Step 120095: {'lr': 4.870449450351594e-05, 'samples': 23058240, 'steps': 120094, 'loss/train': 1.4011480808258057} 11/07/2021 14:09:53 - INFO - __main__ - Step 120096: {'lr': 4.870134750220998e-05, 'samples': 23058432, 'steps': 120095, 'loss/train': 1.2953729629516602} 11/07/2021 14:09:54 - INFO - __main__ - Step 120097: {'lr': 4.869820059160607e-05, 'samples': 23058624, 'steps': 120096, 'loss/train': 1.4466310739517212} 11/07/2021 14:09:54 - INFO - __main__ - Step 120098: {'lr': 4.869505377170566e-05, 'samples': 23058816, 'steps': 120097, 'loss/train': 1.5048341751098633} 11/07/2021 14:09:54 - INFO - __main__ - Step 120099: {'lr': 4.869190704251017e-05, 'samples': 23059008, 'steps': 120098, 'loss/train': 0.4600214660167694} 11/07/2021 14:09:55 - INFO - __main__ - Step 120100: {'lr': 4.8688760404020985e-05, 'samples': 23059200, 'steps': 120099, 'loss/train': 1.4575965404510498} 11/07/2021 14:09:56 - INFO - __main__ - Step 120101: {'lr': 4.8685613856239513e-05, 'samples': 23059392, 'steps': 120100, 'loss/train': 1.3905311822891235} 11/07/2021 14:09:56 - INFO - __main__ - Step 120102: {'lr': 4.868246739916729e-05, 'samples': 23059584, 'steps': 120101, 'loss/train': 1.0519388914108276} 11/07/2021 14:09:56 - INFO - __main__ - Step 120103: {'lr': 4.867932103280559e-05, 'samples': 23059776, 'steps': 120102, 'loss/train': 0.20397356152534485} 11/07/2021 14:09:57 - INFO - __main__ - Step 120104: {'lr': 4.8676174757155855e-05, 'samples': 23059968, 'steps': 120103, 'loss/train': 1.3598629236221313} 11/07/2021 14:09:57 - INFO - __main__ - Step 120105: {'lr': 4.867302857221953e-05, 'samples': 23060160, 'steps': 120104, 'loss/train': 1.4264113903045654} 11/07/2021 14:09:58 - INFO - __main__ - Step 120106: {'lr': 4.866988247799806e-05, 'samples': 23060352, 'steps': 120105, 'loss/train': 1.1250240802764893} 11/07/2021 14:09:58 - INFO - __main__ - Step 120107: {'lr': 4.86667364744928e-05, 'samples': 23060544, 'steps': 120106, 'loss/train': 1.4273658990859985} 11/07/2021 14:09:59 - INFO - __main__ - Step 120108: {'lr': 4.8663590561705215e-05, 'samples': 23060736, 'steps': 120107, 'loss/train': 1.1588927507400513} 11/07/2021 14:09:59 - INFO - __main__ - Step 120109: {'lr': 4.866044473963671e-05, 'samples': 23060928, 'steps': 120108, 'loss/train': 1.7624019384384155} 11/07/2021 14:10:00 - INFO - __main__ - Step 120110: {'lr': 4.865729900828869e-05, 'samples': 23061120, 'steps': 120109, 'loss/train': 0.6818217039108276} 11/07/2021 14:10:01 - INFO - __main__ - Step 120111: {'lr': 4.86541533676626e-05, 'samples': 23061312, 'steps': 120110, 'loss/train': 1.5346742868423462} 11/07/2021 14:10:01 - INFO - __main__ - Step 120112: {'lr': 4.8651007817759914e-05, 'samples': 23061504, 'steps': 120111, 'loss/train': 1.352070927619934} 11/07/2021 14:10:01 - INFO - __main__ - Step 120113: {'lr': 4.864786235858187e-05, 'samples': 23061696, 'steps': 120112, 'loss/train': 1.7349194288253784} 11/07/2021 14:10:02 - INFO - __main__ - Step 120114: {'lr': 4.864471699013004e-05, 'samples': 23061888, 'steps': 120113, 'loss/train': 1.7683570384979248} 11/07/2021 14:10:02 - INFO - __main__ - Step 120115: {'lr': 4.864157171240577e-05, 'samples': 23062080, 'steps': 120114, 'loss/train': 1.3874843120574951} 11/07/2021 14:10:03 - INFO - __main__ - Step 120116: {'lr': 4.863842652541051e-05, 'samples': 23062272, 'steps': 120115, 'loss/train': 1.4431577920913696} 11/07/2021 14:10:03 - INFO - __main__ - Step 120117: {'lr': 4.8635281429145665e-05, 'samples': 23062464, 'steps': 120116, 'loss/train': 1.4077190160751343} 11/07/2021 14:10:04 - INFO - __main__ - Step 120118: {'lr': 4.863213642361264e-05, 'samples': 23062656, 'steps': 120117, 'loss/train': 1.0496821403503418} 11/07/2021 14:10:04 - INFO - __main__ - Step 120119: {'lr': 4.8628991508812895e-05, 'samples': 23062848, 'steps': 120118, 'loss/train': 1.3093593120574951} 11/07/2021 14:10:05 - INFO - __main__ - Step 120120: {'lr': 4.862584668474779e-05, 'samples': 23063040, 'steps': 120119, 'loss/train': 1.4479435682296753} 11/07/2021 14:10:06 - INFO - __main__ - Step 120121: {'lr': 4.8622701951418795e-05, 'samples': 23063232, 'steps': 120120, 'loss/train': 1.1383105516433716} 11/07/2021 14:10:06 - INFO - __main__ - Step 120122: {'lr': 4.861955730882728e-05, 'samples': 23063424, 'steps': 120121, 'loss/train': 0.7797020077705383} 11/07/2021 14:10:07 - INFO - __main__ - Step 120123: {'lr': 4.861641275697479e-05, 'samples': 23063616, 'steps': 120122, 'loss/train': 1.074372410774231} 11/07/2021 14:10:07 - INFO - __main__ - Step 120124: {'lr': 4.861326829586252e-05, 'samples': 23063808, 'steps': 120123, 'loss/train': 1.266793966293335} 11/07/2021 14:10:07 - INFO - __main__ - Step 120125: {'lr': 4.861012392549205e-05, 'samples': 23064000, 'steps': 120124, 'loss/train': 1.6476311683654785} 11/07/2021 14:10:08 - INFO - __main__ - Step 120126: {'lr': 4.8606979645864716e-05, 'samples': 23064192, 'steps': 120125, 'loss/train': 1.452030897140503} 11/07/2021 14:10:09 - INFO - __main__ - Step 120127: {'lr': 4.860383545698199e-05, 'samples': 23064384, 'steps': 120126, 'loss/train': 1.2830878496170044} 11/07/2021 14:10:10 - INFO - __main__ - Step 120128: {'lr': 4.860069135884526e-05, 'samples': 23064576, 'steps': 120127, 'loss/train': 2.6502320766448975} 11/07/2021 14:10:10 - INFO - __main__ - Step 120129: {'lr': 4.859754735145594e-05, 'samples': 23064768, 'steps': 120128, 'loss/train': 4.92717170715332} 11/07/2021 14:10:10 - INFO - __main__ - Step 120130: {'lr': 4.859440343481547e-05, 'samples': 23064960, 'steps': 120129, 'loss/train': 4.877042770385742} 11/07/2021 14:10:11 - INFO - __main__ - Step 120131: {'lr': 4.859125960892527e-05, 'samples': 23065152, 'steps': 120130, 'loss/train': 1.6287416219711304} 11/07/2021 14:10:11 - INFO - __main__ - Step 120132: {'lr': 4.85881158737867e-05, 'samples': 23065344, 'steps': 120131, 'loss/train': 1.4324090480804443} 11/07/2021 14:10:11 - INFO - __main__ - Step 120133: {'lr': 4.8584972229401255e-05, 'samples': 23065536, 'steps': 120132, 'loss/train': 1.3755590915679932} 11/07/2021 14:10:12 - INFO - __main__ - Step 120134: {'lr': 4.858182867577035e-05, 'samples': 23065728, 'steps': 120133, 'loss/train': 1.4683531522750854} 11/07/2021 14:10:13 - INFO - __main__ - Step 120135: {'lr': 4.857868521289532e-05, 'samples': 23065920, 'steps': 120134, 'loss/train': 0.6152967214584351} 11/07/2021 14:10:13 - INFO - __main__ - Step 120136: {'lr': 4.857554184077762e-05, 'samples': 23066112, 'steps': 120135, 'loss/train': 1.863839030265808} 11/07/2021 14:10:13 - INFO - __main__ - Step 120137: {'lr': 4.8572398559418665e-05, 'samples': 23066304, 'steps': 120136, 'loss/train': 1.2415772676467896} 11/07/2021 14:10:14 - INFO - __main__ - Step 120138: {'lr': 4.8569255368819897e-05, 'samples': 23066496, 'steps': 120137, 'loss/train': 1.29514479637146} 11/07/2021 14:10:15 - INFO - __main__ - Step 120139: {'lr': 4.8566112268982694e-05, 'samples': 23066688, 'steps': 120138, 'loss/train': 1.3347645998001099} 11/07/2021 14:10:15 - INFO - __main__ - Step 120140: {'lr': 4.856296925990853e-05, 'samples': 23066880, 'steps': 120139, 'loss/train': 1.134694218635559} 11/07/2021 14:10:15 - INFO - __main__ - Step 120141: {'lr': 4.855982634159875e-05, 'samples': 23067072, 'steps': 120140, 'loss/train': 1.4546056985855103} 11/07/2021 14:10:16 - INFO - __main__ - Step 120142: {'lr': 4.855668351405479e-05, 'samples': 23067264, 'steps': 120141, 'loss/train': 1.3468990325927734} 11/07/2021 14:10:16 - INFO - __main__ - Step 120143: {'lr': 4.855354077727811e-05, 'samples': 23067456, 'steps': 120142, 'loss/train': 0.747558057308197} 11/07/2021 14:10:17 - INFO - __main__ - Step 120144: {'lr': 4.855039813127007e-05, 'samples': 23067648, 'steps': 120143, 'loss/train': 1.3060170412063599} 11/07/2021 14:10:18 - INFO - __main__ - Step 120145: {'lr': 4.8547255576032154e-05, 'samples': 23067840, 'steps': 120144, 'loss/train': 1.132084608078003} 11/07/2021 14:10:18 - INFO - __main__ - Step 120146: {'lr': 4.85441131115657e-05, 'samples': 23068032, 'steps': 120145, 'loss/train': 1.480417013168335} 11/07/2021 14:10:18 - INFO - __main__ - Step 120147: {'lr': 4.8540970737872226e-05, 'samples': 23068224, 'steps': 120146, 'loss/train': 0.8495973348617554} 11/07/2021 14:10:19 - INFO - __main__ - Step 120148: {'lr': 4.853782845495303e-05, 'samples': 23068416, 'steps': 120147, 'loss/train': 0.790054440498352} 11/07/2021 14:10:20 - INFO - __main__ - Step 120149: {'lr': 4.853468626280957e-05, 'samples': 23068608, 'steps': 120148, 'loss/train': 1.265928864479065} 11/07/2021 14:10:20 - INFO - __main__ - Step 120150: {'lr': 4.853154416144329e-05, 'samples': 23068800, 'steps': 120149, 'loss/train': 1.4138014316558838} 11/07/2021 14:10:20 - INFO - __main__ - Step 120151: {'lr': 4.852840215085558e-05, 'samples': 23068992, 'steps': 120150, 'loss/train': 1.2658663988113403} 11/07/2021 14:10:21 - INFO - __main__ - Step 120152: {'lr': 4.852526023104784e-05, 'samples': 23069184, 'steps': 120151, 'loss/train': 0.9216386675834656} 11/07/2021 14:10:21 - INFO - __main__ - Step 120153: {'lr': 4.852211840202153e-05, 'samples': 23069376, 'steps': 120152, 'loss/train': 1.444380521774292} 11/07/2021 14:10:22 - INFO - __main__ - Step 120154: {'lr': 4.8518976663778055e-05, 'samples': 23069568, 'steps': 120153, 'loss/train': 1.1154906749725342} 11/07/2021 14:10:22 - INFO - __main__ - Step 120155: {'lr': 4.851583501631879e-05, 'samples': 23069760, 'steps': 120154, 'loss/train': 1.4922033548355103} 11/07/2021 14:10:23 - INFO - __main__ - Step 120156: {'lr': 4.851269345964521e-05, 'samples': 23069952, 'steps': 120155, 'loss/train': 1.741206169128418} 11/07/2021 14:10:23 - INFO - __main__ - Step 120157: {'lr': 4.850955199375867e-05, 'samples': 23070144, 'steps': 120156, 'loss/train': 1.3339388370513916} 11/07/2021 14:10:24 - INFO - __main__ - Step 120158: {'lr': 4.850641061866065e-05, 'samples': 23070336, 'steps': 120157, 'loss/train': 1.1386715173721313} 11/07/2021 14:10:24 - INFO - __main__ - Step 120159: {'lr': 4.850326933435251e-05, 'samples': 23070528, 'steps': 120158, 'loss/train': 1.808728814125061} 11/07/2021 14:10:25 - INFO - __main__ - Step 120160: {'lr': 4.8500128140835684e-05, 'samples': 23070720, 'steps': 120159, 'loss/train': 1.6958401203155518} 11/07/2021 14:10:25 - INFO - __main__ - Step 120161: {'lr': 4.8496987038111676e-05, 'samples': 23070912, 'steps': 120160, 'loss/train': 1.6192134618759155} 11/07/2021 14:10:26 - INFO - __main__ - Step 120162: {'lr': 4.849384602618173e-05, 'samples': 23071104, 'steps': 120161, 'loss/train': 1.2969380617141724} 11/07/2021 14:10:26 - INFO - __main__ - Step 120163: {'lr': 4.849070510504735e-05, 'samples': 23071296, 'steps': 120162, 'loss/train': 0.9544126987457275} 11/07/2021 14:10:26 - INFO - __main__ - Step 120164: {'lr': 4.848756427470999e-05, 'samples': 23071488, 'steps': 120163, 'loss/train': 1.553922176361084} 11/07/2021 14:10:27 - INFO - __main__ - Step 120165: {'lr': 4.848442353517099e-05, 'samples': 23071680, 'steps': 120164, 'loss/train': 1.0246092081069946} 11/07/2021 14:10:28 - INFO - __main__ - Step 120166: {'lr': 4.8481282886431774e-05, 'samples': 23071872, 'steps': 120165, 'loss/train': 1.5104868412017822} 11/07/2021 14:10:28 - INFO - __main__ - Step 120167: {'lr': 4.847814232849382e-05, 'samples': 23072064, 'steps': 120166, 'loss/train': 1.5143312215805054} 11/07/2021 14:10:28 - INFO - __main__ - Step 120168: {'lr': 4.8475001861358506e-05, 'samples': 23072256, 'steps': 120167, 'loss/train': 1.4079164266586304} 11/07/2021 14:10:29 - INFO - __main__ - Step 120169: {'lr': 4.847186148502722e-05, 'samples': 23072448, 'steps': 120168, 'loss/train': 1.2978566884994507} 11/07/2021 14:10:30 - INFO - __main__ - Step 120170: {'lr': 4.8468721199501436e-05, 'samples': 23072640, 'steps': 120169, 'loss/train': 1.258700966835022} 11/07/2021 14:10:30 - INFO - __main__ - Step 120171: {'lr': 4.846558100478252e-05, 'samples': 23072832, 'steps': 120170, 'loss/train': 1.8420261144638062} 11/07/2021 14:10:30 - INFO - __main__ - Step 120172: {'lr': 4.84624409008719e-05, 'samples': 23073024, 'steps': 120171, 'loss/train': 1.5211058855056763} 11/07/2021 14:10:31 - INFO - __main__ - Step 120173: {'lr': 4.845930088777098e-05, 'samples': 23073216, 'steps': 120172, 'loss/train': 1.3930732011795044} 11/07/2021 14:10:31 - INFO - __main__ - Step 120174: {'lr': 4.845616096548128e-05, 'samples': 23073408, 'steps': 120173, 'loss/train': 1.4530442953109741} 11/07/2021 14:10:32 - INFO - __main__ - Step 120175: {'lr': 4.8453021134004044e-05, 'samples': 23073600, 'steps': 120174, 'loss/train': 1.160083293914795} 11/07/2021 14:10:33 - INFO - __main__ - Step 120176: {'lr': 4.8449881393340776e-05, 'samples': 23073792, 'steps': 120175, 'loss/train': 1.4233694076538086} 11/07/2021 14:10:33 - INFO - __main__ - Step 120177: {'lr': 4.844674174349287e-05, 'samples': 23073984, 'steps': 120176, 'loss/train': 1.3718721866607666} 11/07/2021 14:10:33 - INFO - __main__ - Step 120178: {'lr': 4.844360218446176e-05, 'samples': 23074176, 'steps': 120177, 'loss/train': 1.2951374053955078} 11/07/2021 14:10:34 - INFO - __main__ - Step 120179: {'lr': 4.844046271624886e-05, 'samples': 23074368, 'steps': 120178, 'loss/train': 1.6959272623062134} 11/07/2021 14:10:36 - INFO - __main__ - Step 120180: {'lr': 4.843732333885556e-05, 'samples': 23074560, 'steps': 120179, 'loss/train': 1.5096938610076904} 11/07/2021 14:10:36 - INFO - __main__ - Step 120181: {'lr': 4.8434184052283306e-05, 'samples': 23074752, 'steps': 120180, 'loss/train': 1.1847314834594727} 11/07/2021 14:10:36 - INFO - __main__ - Step 120182: {'lr': 4.843104485653349e-05, 'samples': 23074944, 'steps': 120181, 'loss/train': 1.1949785947799683} 11/07/2021 14:10:37 - INFO - __main__ - Step 120183: {'lr': 4.842790575160755e-05, 'samples': 23075136, 'steps': 120182, 'loss/train': 1.4403669834136963} 11/07/2021 14:10:37 - INFO - __main__ - Step 120184: {'lr': 4.842476673750687e-05, 'samples': 23075328, 'steps': 120183, 'loss/train': 0.984769344329834} 11/07/2021 14:10:37 - INFO - __main__ - Step 120185: {'lr': 4.842162781423287e-05, 'samples': 23075520, 'steps': 120184, 'loss/train': 0.7499074935913086} 11/07/2021 14:10:38 - INFO - __main__ - Step 120186: {'lr': 4.8418488981786997e-05, 'samples': 23075712, 'steps': 120185, 'loss/train': 1.4305474758148193} 11/07/2021 14:10:39 - INFO - __main__ - Step 120187: {'lr': 4.841535024017063e-05, 'samples': 23075904, 'steps': 120186, 'loss/train': 1.4209634065628052} 11/07/2021 14:10:39 - INFO - __main__ - Step 120188: {'lr': 4.841221158938527e-05, 'samples': 23076096, 'steps': 120187, 'loss/train': 1.3921235799789429} 11/07/2021 14:10:40 - INFO - __main__ - Step 120189: {'lr': 4.8409073029432176e-05, 'samples': 23076288, 'steps': 120188, 'loss/train': 1.5792559385299683} 11/07/2021 14:10:40 - INFO - __main__ - Step 120190: {'lr': 4.840593456031287e-05, 'samples': 23076480, 'steps': 120189, 'loss/train': 1.3377041816711426} 11/07/2021 14:10:40 - INFO - __main__ - Step 120191: {'lr': 4.8402796182028694e-05, 'samples': 23076672, 'steps': 120190, 'loss/train': 0.1568266749382019} 11/07/2021 14:10:41 - INFO - __main__ - Step 120192: {'lr': 4.839965789458115e-05, 'samples': 23076864, 'steps': 120191, 'loss/train': 1.152855634689331} 11/07/2021 14:10:42 - INFO - __main__ - Step 120193: {'lr': 4.8396519697971596e-05, 'samples': 23077056, 'steps': 120192, 'loss/train': 1.5455821752548218} 11/07/2021 14:10:42 - INFO - __main__ - Step 120194: {'lr': 4.839338159220144e-05, 'samples': 23077248, 'steps': 120193, 'loss/train': 1.1069440841674805} 11/07/2021 14:10:43 - INFO - __main__ - Step 120195: {'lr': 4.8390243577272145e-05, 'samples': 23077440, 'steps': 120194, 'loss/train': 1.3871734142303467} 11/07/2021 14:10:43 - INFO - __main__ - Step 120196: {'lr': 4.8387105653185076e-05, 'samples': 23077632, 'steps': 120195, 'loss/train': 1.2297347784042358} 11/07/2021 14:10:43 - INFO - __main__ - Step 120197: {'lr': 4.838396781994167e-05, 'samples': 23077824, 'steps': 120196, 'loss/train': 1.1161812543869019} 11/07/2021 14:10:44 - INFO - __main__ - Step 120198: {'lr': 4.838083007754335e-05, 'samples': 23078016, 'steps': 120197, 'loss/train': 1.571709394454956} 11/07/2021 14:10:45 - INFO - __main__ - Step 120199: {'lr': 4.837769242599149e-05, 'samples': 23078208, 'steps': 120198, 'loss/train': 1.2753846645355225} 11/07/2021 14:10:45 - INFO - __main__ - Step 120200: {'lr': 4.8374554865287554e-05, 'samples': 23078400, 'steps': 120199, 'loss/train': 1.2438814640045166} 11/07/2021 14:10:45 - INFO - __main__ - Step 120201: {'lr': 4.837141739543299e-05, 'samples': 23078592, 'steps': 120200, 'loss/train': 1.9853259325027466} 11/07/2021 14:10:46 - INFO - __main__ - Step 120202: {'lr': 4.836828001642909e-05, 'samples': 23078784, 'steps': 120201, 'loss/train': 1.6033837795257568} 11/07/2021 14:10:47 - INFO - __main__ - Step 120203: {'lr': 4.8365142728277326e-05, 'samples': 23078976, 'steps': 120202, 'loss/train': 0.9391089081764221} 11/07/2021 14:10:47 - INFO - __main__ - Step 120204: {'lr': 4.8362005530979106e-05, 'samples': 23079168, 'steps': 120203, 'loss/train': 1.2558473348617554} 11/07/2021 14:10:48 - INFO - __main__ - Step 120205: {'lr': 4.8358868424535876e-05, 'samples': 23079360, 'steps': 120204, 'loss/train': 0.9800662994384766} 11/07/2021 14:10:48 - INFO - __main__ - Step 120206: {'lr': 4.8355731408949025e-05, 'samples': 23079552, 'steps': 120205, 'loss/train': 0.9856838583946228} 11/07/2021 14:10:48 - INFO - __main__ - Step 120207: {'lr': 4.835259448421997e-05, 'samples': 23079744, 'steps': 120206, 'loss/train': 1.2830605506896973} 11/07/2021 14:10:49 - INFO - __main__ - Step 120208: {'lr': 4.834945765035012e-05, 'samples': 23079936, 'steps': 120207, 'loss/train': 1.4949041604995728} 11/07/2021 14:10:50 - INFO - __main__ - Step 120209: {'lr': 4.8346320907340897e-05, 'samples': 23080128, 'steps': 120208, 'loss/train': 1.2986007928848267} 11/07/2021 14:10:50 - INFO - __main__ - Step 120210: {'lr': 4.834318425519371e-05, 'samples': 23080320, 'steps': 120209, 'loss/train': 1.27944815158844} 11/07/2021 14:10:50 - INFO - __main__ - Step 120211: {'lr': 4.834004769390998e-05, 'samples': 23080512, 'steps': 120210, 'loss/train': 1.437787652015686} 11/07/2021 14:10:51 - INFO - __main__ - Step 120212: {'lr': 4.83369112234911e-05, 'samples': 23080704, 'steps': 120211, 'loss/train': 1.20599365234375} 11/07/2021 14:10:51 - INFO - __main__ - Step 120213: {'lr': 4.83337748439385e-05, 'samples': 23080896, 'steps': 120212, 'loss/train': 1.3000494241714478} 11/07/2021 14:10:52 - INFO - __main__ - Step 120214: {'lr': 4.8330638555253575e-05, 'samples': 23081088, 'steps': 120213, 'loss/train': 0.5245353579521179} 11/07/2021 14:10:53 - INFO - __main__ - Step 120215: {'lr': 4.832750235743783e-05, 'samples': 23081280, 'steps': 120214, 'loss/train': 1.2397571802139282} 11/07/2021 14:10:53 - INFO - __main__ - Step 120216: {'lr': 4.8324366250492553e-05, 'samples': 23081472, 'steps': 120215, 'loss/train': 2.60931658744812} 11/07/2021 14:10:53 - INFO - __main__ - Step 120217: {'lr': 4.832123023441917e-05, 'samples': 23081664, 'steps': 120216, 'loss/train': 1.0061111450195312} 11/07/2021 14:10:54 - INFO - __main__ - Step 120218: {'lr': 4.831809430921916e-05, 'samples': 23081856, 'steps': 120217, 'loss/train': 1.731117844581604} 11/07/2021 14:10:55 - INFO - __main__ - Step 120219: {'lr': 4.831495847489389e-05, 'samples': 23082048, 'steps': 120218, 'loss/train': 0.8068562150001526} 11/07/2021 14:10:55 - INFO - __main__ - Step 120220: {'lr': 4.831182273144477e-05, 'samples': 23082240, 'steps': 120219, 'loss/train': 1.8784304857254028} 11/07/2021 14:10:55 - INFO - __main__ - Step 120221: {'lr': 4.830868707887326e-05, 'samples': 23082432, 'steps': 120220, 'loss/train': 1.4379502534866333} 11/07/2021 14:10:56 - INFO - __main__ - Step 120222: {'lr': 4.830555151718072e-05, 'samples': 23082624, 'steps': 120221, 'loss/train': 1.2839560508728027} 11/07/2021 14:10:56 - INFO - __main__ - Step 120223: {'lr': 4.830241604636862e-05, 'samples': 23082816, 'steps': 120222, 'loss/train': 1.843581199645996} 11/07/2021 14:10:57 - INFO - __main__ - Step 120224: {'lr': 4.829928066643829e-05, 'samples': 23083008, 'steps': 120223, 'loss/train': 1.1331653594970703} 11/07/2021 14:10:57 - INFO - __main__ - Step 120225: {'lr': 4.829614537739124e-05, 'samples': 23083200, 'steps': 120224, 'loss/train': 1.1948782205581665} 11/07/2021 14:10:58 - INFO - __main__ - Step 120226: {'lr': 4.82930101792288e-05, 'samples': 23083392, 'steps': 120225, 'loss/train': 1.3793047666549683} 11/07/2021 14:10:58 - INFO - __main__ - Step 120227: {'lr': 4.8289875071952425e-05, 'samples': 23083584, 'steps': 120226, 'loss/train': 1.5684820413589478} 11/07/2021 14:10:59 - INFO - __main__ - Step 120228: {'lr': 4.82867400555636e-05, 'samples': 23083776, 'steps': 120227, 'loss/train': 1.333714485168457} 11/07/2021 14:11:00 - INFO - __main__ - Step 120229: {'lr': 4.8283605130063576e-05, 'samples': 23083968, 'steps': 120228, 'loss/train': 1.627631664276123} 11/07/2021 14:11:00 - INFO - __main__ - Step 120230: {'lr': 4.828047029545385e-05, 'samples': 23084160, 'steps': 120229, 'loss/train': 1.3095715045928955} 11/07/2021 14:11:01 - INFO - __main__ - Step 120231: {'lr': 4.827733555173583e-05, 'samples': 23084352, 'steps': 120230, 'loss/train': 0.6232367753982544} 11/07/2021 14:11:01 - INFO - __main__ - Step 120232: {'lr': 4.8274200898910936e-05, 'samples': 23084544, 'steps': 120231, 'loss/train': 1.3381916284561157} 11/07/2021 14:11:01 - INFO - __main__ - Step 120233: {'lr': 4.8271066336980586e-05, 'samples': 23084736, 'steps': 120232, 'loss/train': 1.3959784507751465} 11/07/2021 14:11:02 - INFO - __main__ - Step 120234: {'lr': 4.826793186594616e-05, 'samples': 23084928, 'steps': 120233, 'loss/train': 1.087538242340088} 11/07/2021 14:11:03 - INFO - __main__ - Step 120235: {'lr': 4.826479748580908e-05, 'samples': 23085120, 'steps': 120234, 'loss/train': 0.570103108882904} 11/07/2021 14:11:03 - INFO - __main__ - Step 120236: {'lr': 4.82616631965708e-05, 'samples': 23085312, 'steps': 120235, 'loss/train': 1.3267256021499634} 11/07/2021 14:11:03 - INFO - __main__ - Step 120237: {'lr': 4.825852899823269e-05, 'samples': 23085504, 'steps': 120236, 'loss/train': 1.2788870334625244} 11/07/2021 14:11:04 - INFO - __main__ - Step 120238: {'lr': 4.825539489079617e-05, 'samples': 23085696, 'steps': 120237, 'loss/train': 1.2684237957000732} 11/07/2021 14:11:05 - INFO - __main__ - Step 120239: {'lr': 4.8252260874262654e-05, 'samples': 23085888, 'steps': 120238, 'loss/train': 1.478910207748413} 11/07/2021 14:11:05 - INFO - __main__ - Step 120240: {'lr': 4.824912694863356e-05, 'samples': 23086080, 'steps': 120239, 'loss/train': 1.3166375160217285} 11/07/2021 14:11:05 - INFO - __main__ - Step 120241: {'lr': 4.824599311391031e-05, 'samples': 23086272, 'steps': 120240, 'loss/train': 1.6382330656051636} 11/07/2021 14:11:06 - INFO - __main__ - Step 120242: {'lr': 4.824285937009434e-05, 'samples': 23086464, 'steps': 120241, 'loss/train': 1.2281049489974976} 11/07/2021 14:11:06 - INFO - __main__ - Step 120243: {'lr': 4.823972571718699e-05, 'samples': 23086656, 'steps': 120242, 'loss/train': 1.4280974864959717} 11/07/2021 14:11:07 - INFO - __main__ - Step 120244: {'lr': 4.8236592155189693e-05, 'samples': 23086848, 'steps': 120243, 'loss/train': 1.4750343561172485} 11/07/2021 14:11:07 - INFO - __main__ - Step 120245: {'lr': 4.8233458684103866e-05, 'samples': 23087040, 'steps': 120244, 'loss/train': 1.8819115161895752} 11/07/2021 14:11:08 - INFO - __main__ - Step 120246: {'lr': 4.8230325303930925e-05, 'samples': 23087232, 'steps': 120245, 'loss/train': 1.3276134729385376} 11/07/2021 14:11:08 - INFO - __main__ - Step 120247: {'lr': 4.8227192014672294e-05, 'samples': 23087424, 'steps': 120246, 'loss/train': 1.0113731622695923} 11/07/2021 14:11:09 - INFO - __main__ - Step 120248: {'lr': 4.822405881632938e-05, 'samples': 23087616, 'steps': 120247, 'loss/train': 1.1001477241516113} 11/07/2021 14:11:09 - INFO - __main__ - Step 120249: {'lr': 4.8220925708903604e-05, 'samples': 23087808, 'steps': 120248, 'loss/train': 1.2945815324783325} 11/07/2021 14:11:10 - INFO - __main__ - Step 120250: {'lr': 4.8217792692396345e-05, 'samples': 23088000, 'steps': 120249, 'loss/train': 1.3259196281433105} 11/07/2021 14:11:10 - INFO - __main__ - Step 120251: {'lr': 4.8214659766809026e-05, 'samples': 23088192, 'steps': 120250, 'loss/train': 1.3470172882080078} 11/07/2021 14:11:11 - INFO - __main__ - Step 120252: {'lr': 4.821152693214309e-05, 'samples': 23088384, 'steps': 120251, 'loss/train': 1.3906077146530151} 11/07/2021 14:11:11 - INFO - __main__ - Step 120253: {'lr': 4.820839418839992e-05, 'samples': 23088576, 'steps': 120252, 'loss/train': 1.1676290035247803} 11/07/2021 14:11:11 - INFO - __main__ - Step 120254: {'lr': 4.820526153558094e-05, 'samples': 23088768, 'steps': 120253, 'loss/train': 1.8071155548095703} 11/07/2021 14:11:13 - INFO - __main__ - Step 120255: {'lr': 4.8202128973687616e-05, 'samples': 23088960, 'steps': 120254, 'loss/train': 0.8431418538093567} 11/07/2021 14:11:13 - INFO - __main__ - Step 120256: {'lr': 4.819899650272122e-05, 'samples': 23089152, 'steps': 120255, 'loss/train': 1.6852113008499146} 11/07/2021 14:11:13 - INFO - __main__ - Step 120257: {'lr': 4.819586412268326e-05, 'samples': 23089344, 'steps': 120256, 'loss/train': 1.3122954368591309} 11/07/2021 14:11:14 - INFO - __main__ - Step 120258: {'lr': 4.819273183357511e-05, 'samples': 23089536, 'steps': 120257, 'loss/train': 1.0351951122283936} 11/07/2021 14:11:14 - INFO - __main__ - Step 120259: {'lr': 4.818959963539821e-05, 'samples': 23089728, 'steps': 120258, 'loss/train': 1.456261157989502} 11/07/2021 14:11:15 - INFO - __main__ - Step 120260: {'lr': 4.818646752815398e-05, 'samples': 23089920, 'steps': 120259, 'loss/train': 0.9198675155639648} 11/07/2021 14:11:15 - INFO - __main__ - Step 120261: {'lr': 4.818333551184379e-05, 'samples': 23090112, 'steps': 120260, 'loss/train': 0.7073165774345398} 11/07/2021 14:11:16 - INFO - __main__ - Step 120262: {'lr': 4.818020358646908e-05, 'samples': 23090304, 'steps': 120261, 'loss/train': 1.392409324645996} 11/07/2021 14:11:16 - INFO - __main__ - Step 120263: {'lr': 4.817707175203126e-05, 'samples': 23090496, 'steps': 120262, 'loss/train': 1.171875} 11/07/2021 14:11:16 - INFO - __main__ - Step 120264: {'lr': 4.817394000853173e-05, 'samples': 23090688, 'steps': 120263, 'loss/train': 1.4754289388656616} 11/07/2021 14:11:17 - INFO - __main__ - Step 120265: {'lr': 4.81708083559719e-05, 'samples': 23090880, 'steps': 120264, 'loss/train': 1.4154735803604126} 11/07/2021 14:11:18 - INFO - __main__ - Step 120266: {'lr': 4.8167676794353214e-05, 'samples': 23091072, 'steps': 120265, 'loss/train': 0.7744295001029968} 11/07/2021 14:11:18 - INFO - __main__ - Step 120267: {'lr': 4.816454532367706e-05, 'samples': 23091264, 'steps': 120266, 'loss/train': 1.4385242462158203} 11/07/2021 14:11:18 - INFO - __main__ - Step 120268: {'lr': 4.816141394394488e-05, 'samples': 23091456, 'steps': 120267, 'loss/train': 1.2410346269607544} 11/07/2021 14:11:19 - INFO - __main__ - Step 120269: {'lr': 4.815828265515801e-05, 'samples': 23091648, 'steps': 120268, 'loss/train': 1.3091601133346558} 11/07/2021 14:11:20 - INFO - __main__ - Step 120270: {'lr': 4.8155151457317915e-05, 'samples': 23091840, 'steps': 120269, 'loss/train': 1.1742660999298096} 11/07/2021 14:11:20 - INFO - __main__ - Step 120271: {'lr': 4.8152020350425984e-05, 'samples': 23092032, 'steps': 120270, 'loss/train': 0.4543708264827728} 11/07/2021 14:11:21 - INFO - __main__ - Step 120272: {'lr': 4.814888933448363e-05, 'samples': 23092224, 'steps': 120271, 'loss/train': 1.2521575689315796} 11/07/2021 14:11:21 - INFO - __main__ - Step 120273: {'lr': 4.8145758409492285e-05, 'samples': 23092416, 'steps': 120272, 'loss/train': 0.9859701991081238} 11/07/2021 14:11:21 - INFO - __main__ - Step 120274: {'lr': 4.8142627575453316e-05, 'samples': 23092608, 'steps': 120273, 'loss/train': 1.12774658203125} 11/07/2021 14:11:22 - INFO - __main__ - Step 120275: {'lr': 4.813949683236821e-05, 'samples': 23092800, 'steps': 120274, 'loss/train': 1.3740277290344238} 11/07/2021 14:11:23 - INFO - __main__ - Step 120276: {'lr': 4.813636618023829e-05, 'samples': 23092992, 'steps': 120275, 'loss/train': 1.4304721355438232} 11/07/2021 14:11:23 - INFO - __main__ - Step 120277: {'lr': 4.813323561906502e-05, 'samples': 23093184, 'steps': 120276, 'loss/train': 1.2361345291137695} 11/07/2021 14:11:23 - INFO - __main__ - Step 120278: {'lr': 4.81301051488498e-05, 'samples': 23093376, 'steps': 120277, 'loss/train': 1.2920117378234863} 11/07/2021 14:11:24 - INFO - __main__ - Step 120279: {'lr': 4.812697476959405e-05, 'samples': 23093568, 'steps': 120278, 'loss/train': 1.6282291412353516} 11/07/2021 14:11:25 - INFO - __main__ - Step 120280: {'lr': 4.8123844481299166e-05, 'samples': 23093760, 'steps': 120279, 'loss/train': 1.2256158590316772} 11/07/2021 14:11:25 - INFO - __main__ - Step 120281: {'lr': 4.8120714283966555e-05, 'samples': 23093952, 'steps': 120280, 'loss/train': 1.664832592010498} 11/07/2021 14:11:25 - INFO - __main__ - Step 120282: {'lr': 4.811758417759771e-05, 'samples': 23094144, 'steps': 120281, 'loss/train': 1.2684639692306519} 11/07/2021 14:11:26 - INFO - __main__ - Step 120283: {'lr': 4.8114454162193896e-05, 'samples': 23094336, 'steps': 120282, 'loss/train': 1.237318515777588} 11/07/2021 14:11:26 - INFO - __main__ - Step 120284: {'lr': 4.8111324237756606e-05, 'samples': 23094528, 'steps': 120283, 'loss/train': 1.9162009954452515} 11/07/2021 14:11:27 - INFO - __main__ - Step 120285: {'lr': 4.810819440428721e-05, 'samples': 23094720, 'steps': 120284, 'loss/train': 1.2940704822540283} 11/07/2021 14:11:28 - INFO - __main__ - Step 120286: {'lr': 4.810506466178718e-05, 'samples': 23094912, 'steps': 120285, 'loss/train': 1.342922329902649} 11/07/2021 14:11:28 - INFO - __main__ - Step 120287: {'lr': 4.8101935010257864e-05, 'samples': 23095104, 'steps': 120286, 'loss/train': 1.2832711935043335} 11/07/2021 14:11:28 - INFO - __main__ - Step 120288: {'lr': 4.8098805449700716e-05, 'samples': 23095296, 'steps': 120287, 'loss/train': 1.6956325769424438} 11/07/2021 14:11:29 - INFO - __main__ - Step 120289: {'lr': 4.809567598011713e-05, 'samples': 23095488, 'steps': 120288, 'loss/train': 1.2152292728424072} 11/07/2021 14:11:29 - INFO - __main__ - Step 120290: {'lr': 4.809254660150852e-05, 'samples': 23095680, 'steps': 120289, 'loss/train': 1.2614145278930664} 11/07/2021 14:11:30 - INFO - __main__ - Step 120291: {'lr': 4.8089417313876286e-05, 'samples': 23095872, 'steps': 120290, 'loss/train': 0.9592404365539551} 11/07/2021 14:11:30 - INFO - __main__ - Step 120292: {'lr': 4.8086288117221846e-05, 'samples': 23096064, 'steps': 120291, 'loss/train': 0.6430933475494385} 11/07/2021 14:11:31 - INFO - __main__ - Step 120293: {'lr': 4.808315901154661e-05, 'samples': 23096256, 'steps': 120292, 'loss/train': 0.33973878622055054} 11/07/2021 14:11:31 - INFO - __main__ - Step 120294: {'lr': 4.808002999685199e-05, 'samples': 23096448, 'steps': 120293, 'loss/train': 1.2882705926895142} 11/07/2021 14:11:31 - INFO - __main__ - Step 120295: {'lr': 4.807690107313945e-05, 'samples': 23096640, 'steps': 120294, 'loss/train': 1.454314947128296} 11/07/2021 14:11:33 - INFO - __main__ - Step 120296: {'lr': 4.807377224041026e-05, 'samples': 23096832, 'steps': 120295, 'loss/train': 1.2901966571807861} 11/07/2021 14:11:33 - INFO - __main__ - Step 120297: {'lr': 4.807064349866594e-05, 'samples': 23097024, 'steps': 120296, 'loss/train': 0.932083785533905} 11/07/2021 14:11:33 - INFO - __main__ - Step 120298: {'lr': 4.806751484790786e-05, 'samples': 23097216, 'steps': 120297, 'loss/train': 1.2828067541122437} 11/07/2021 14:11:34 - INFO - __main__ - Step 120299: {'lr': 4.806438628813745e-05, 'samples': 23097408, 'steps': 120298, 'loss/train': 1.1581521034240723} 11/07/2021 14:11:34 - INFO - __main__ - Step 120300: {'lr': 4.806125781935611e-05, 'samples': 23097600, 'steps': 120299, 'loss/train': 1.4659963846206665} 11/07/2021 14:11:35 - INFO - __main__ - Step 120301: {'lr': 4.805812944156526e-05, 'samples': 23097792, 'steps': 120300, 'loss/train': 1.4337530136108398} 11/07/2021 14:11:35 - INFO - __main__ - Step 120302: {'lr': 4.805500115476627e-05, 'samples': 23097984, 'steps': 120301, 'loss/train': 1.4733409881591797} 11/07/2021 14:11:36 - INFO - __main__ - Step 120303: {'lr': 4.80518729589606e-05, 'samples': 23098176, 'steps': 120302, 'loss/train': 1.8828039169311523} 11/07/2021 14:11:36 - INFO - __main__ - Step 120304: {'lr': 4.8048744854149643e-05, 'samples': 23098368, 'steps': 120303, 'loss/train': 1.397469401359558} 11/07/2021 14:11:36 - INFO - __main__ - Step 120305: {'lr': 4.804561684033482e-05, 'samples': 23098560, 'steps': 120304, 'loss/train': 1.1476843357086182} 11/07/2021 14:11:37 - INFO - __main__ - Step 120306: {'lr': 4.80424889175175e-05, 'samples': 23098752, 'steps': 120305, 'loss/train': 1.268188238143921} 11/07/2021 14:11:38 - INFO - __main__ - Step 120307: {'lr': 4.803936108569912e-05, 'samples': 23098944, 'steps': 120306, 'loss/train': 1.6402806043624878} 11/07/2021 14:11:38 - INFO - __main__ - Step 120308: {'lr': 4.803623334488111e-05, 'samples': 23099136, 'steps': 120307, 'loss/train': 2.0799522399902344} 11/07/2021 14:11:38 - INFO - __main__ - Step 120309: {'lr': 4.803310569506489e-05, 'samples': 23099328, 'steps': 120308, 'loss/train': 1.050684928894043} 11/07/2021 14:11:39 - INFO - __main__ - Step 120310: {'lr': 4.802997813625179e-05, 'samples': 23099520, 'steps': 120309, 'loss/train': 1.4872695207595825} 11/07/2021 14:11:40 - INFO - __main__ - Step 120311: {'lr': 4.8026850668443256e-05, 'samples': 23099712, 'steps': 120310, 'loss/train': 1.4029099941253662} 11/07/2021 14:11:40 - INFO - __main__ - Step 120312: {'lr': 4.80237232916407e-05, 'samples': 23099904, 'steps': 120311, 'loss/train': 2.024980068206787} 11/07/2021 14:11:41 - INFO - __main__ - Step 120313: {'lr': 4.802059600584557e-05, 'samples': 23100096, 'steps': 120312, 'loss/train': 0.9432738423347473} 11/07/2021 14:11:41 - INFO - __main__ - Step 120314: {'lr': 4.8017468811059226e-05, 'samples': 23100288, 'steps': 120313, 'loss/train': 0.6977756023406982} 11/07/2021 14:11:41 - INFO - __main__ - Step 120315: {'lr': 4.801434170728308e-05, 'samples': 23100480, 'steps': 120314, 'loss/train': 2.286691427230835} 11/07/2021 14:11:42 - INFO - __main__ - Step 120316: {'lr': 4.801121469451855e-05, 'samples': 23100672, 'steps': 120315, 'loss/train': 0.9418417811393738} 11/07/2021 14:11:43 - INFO - __main__ - Step 120317: {'lr': 4.800808777276708e-05, 'samples': 23100864, 'steps': 120316, 'loss/train': 1.5285557508468628} 11/07/2021 14:11:43 - INFO - __main__ - Step 120318: {'lr': 4.8004960942030026e-05, 'samples': 23101056, 'steps': 120317, 'loss/train': 1.0863614082336426} 11/07/2021 14:11:43 - INFO - __main__ - Step 120319: {'lr': 4.800183420230883e-05, 'samples': 23101248, 'steps': 120318, 'loss/train': 0.8687788248062134} 11/07/2021 14:11:44 - INFO - __main__ - Step 120320: {'lr': 4.799870755360489e-05, 'samples': 23101440, 'steps': 120319, 'loss/train': 1.4721003770828247} 11/07/2021 14:11:44 - INFO - __main__ - Step 120321: {'lr': 4.7995580995919605e-05, 'samples': 23101632, 'steps': 120320, 'loss/train': 1.1684966087341309} 11/07/2021 14:11:45 - INFO - __main__ - Step 120322: {'lr': 4.799245452925446e-05, 'samples': 23101824, 'steps': 120321, 'loss/train': 0.3721439838409424} 11/07/2021 14:11:46 - INFO - __main__ - Step 120323: {'lr': 4.798932815361076e-05, 'samples': 23102016, 'steps': 120322, 'loss/train': 1.692816972732544} 11/07/2021 14:11:46 - INFO - __main__ - Step 120324: {'lr': 4.798620186898991e-05, 'samples': 23102208, 'steps': 120323, 'loss/train': 1.191178560256958} 11/07/2021 14:11:46 - INFO - __main__ - Step 120325: {'lr': 4.798307567539339e-05, 'samples': 23102400, 'steps': 120324, 'loss/train': 0.7862756848335266} 11/07/2021 14:11:47 - INFO - __main__ - Step 120326: {'lr': 4.797994957282256e-05, 'samples': 23102592, 'steps': 120325, 'loss/train': 1.0134766101837158} 11/07/2021 14:11:48 - INFO - __main__ - Step 120327: {'lr': 4.797682356127886e-05, 'samples': 23102784, 'steps': 120326, 'loss/train': 1.1692076921463013} 11/07/2021 14:11:48 - INFO - __main__ - Step 120328: {'lr': 4.7973697640763704e-05, 'samples': 23102976, 'steps': 120327, 'loss/train': 1.1730672121047974} 11/07/2021 14:11:48 - INFO - __main__ - Step 120329: {'lr': 4.797057181127845e-05, 'samples': 23103168, 'steps': 120328, 'loss/train': 1.6768351793289185} 11/07/2021 14:11:49 - INFO - __main__ - Step 120330: {'lr': 4.796744607282455e-05, 'samples': 23103360, 'steps': 120329, 'loss/train': 1.5456349849700928} 11/07/2021 14:11:49 - INFO - __main__ - Step 120331: {'lr': 4.796432042540341e-05, 'samples': 23103552, 'steps': 120330, 'loss/train': 1.3238252401351929} 11/07/2021 14:11:50 - INFO - __main__ - Step 120332: {'lr': 4.796119486901643e-05, 'samples': 23103744, 'steps': 120331, 'loss/train': 1.3529889583587646} 11/07/2021 14:11:50 - INFO - __main__ - Step 120333: {'lr': 4.795806940366498e-05, 'samples': 23103936, 'steps': 120332, 'loss/train': 1.182174801826477} 11/07/2021 14:11:51 - INFO - __main__ - Step 120334: {'lr': 4.795494402935055e-05, 'samples': 23104128, 'steps': 120333, 'loss/train': 1.5143060684204102} 11/07/2021 14:11:51 - INFO - __main__ - Step 120335: {'lr': 4.795181874607449e-05, 'samples': 23104320, 'steps': 120334, 'loss/train': 1.442817211151123} 11/07/2021 14:11:51 - INFO - __main__ - Step 120336: {'lr': 4.79486935538383e-05, 'samples': 23104512, 'steps': 120335, 'loss/train': 0.9767188429832458} 11/07/2021 14:11:53 - INFO - __main__ - Step 120337: {'lr': 4.794556845264322e-05, 'samples': 23104704, 'steps': 120336, 'loss/train': 1.660335898399353} 11/07/2021 14:11:53 - INFO - __main__ - Step 120338: {'lr': 4.794244344249077e-05, 'samples': 23104896, 'steps': 120337, 'loss/train': 1.1376748085021973} 11/07/2021 14:11:53 - INFO - __main__ - Step 120339: {'lr': 4.793931852338232e-05, 'samples': 23105088, 'steps': 120338, 'loss/train': 2.4820117950439453} 11/07/2021 14:11:54 - INFO - __main__ - Step 120340: {'lr': 4.793619369531932e-05, 'samples': 23105280, 'steps': 120339, 'loss/train': 1.7427655458450317} 11/07/2021 14:11:54 - INFO - __main__ - Step 120341: {'lr': 4.793306895830313e-05, 'samples': 23105472, 'steps': 120340, 'loss/train': 1.5387805700302124} 11/07/2021 14:11:55 - INFO - __main__ - Step 120342: {'lr': 4.792994431233519e-05, 'samples': 23105664, 'steps': 120341, 'loss/train': 0.8595216274261475} 11/07/2021 14:11:55 - INFO - __main__ - Step 120343: {'lr': 4.792681975741689e-05, 'samples': 23105856, 'steps': 120342, 'loss/train': 0.8551942110061646} 11/07/2021 14:11:56 - INFO - __main__ - Step 120344: {'lr': 4.7923695293549645e-05, 'samples': 23106048, 'steps': 120343, 'loss/train': 1.1068717241287231} 11/07/2021 14:11:56 - INFO - __main__ - Step 120345: {'lr': 4.7920570920734905e-05, 'samples': 23106240, 'steps': 120344, 'loss/train': 1.457632303237915} 11/07/2021 14:11:56 - INFO - __main__ - Step 120346: {'lr': 4.791744663897399e-05, 'samples': 23106432, 'steps': 120345, 'loss/train': 1.1990242004394531} 11/07/2021 14:11:57 - INFO - __main__ - Step 120347: {'lr': 4.791432244826838e-05, 'samples': 23106624, 'steps': 120346, 'loss/train': 1.3533474206924438} 11/07/2021 14:11:58 - INFO - __main__ - Step 120348: {'lr': 4.7911198348619514e-05, 'samples': 23106816, 'steps': 120347, 'loss/train': 1.7677520513534546} 11/07/2021 14:11:58 - INFO - __main__ - Step 120349: {'lr': 4.790807434002867e-05, 'samples': 23107008, 'steps': 120348, 'loss/train': 1.5373070240020752} 11/07/2021 14:11:58 - INFO - __main__ - Step 120350: {'lr': 4.7904950422497343e-05, 'samples': 23107200, 'steps': 120349, 'loss/train': 1.4748201370239258} 11/07/2021 14:11:59 - INFO - __main__ - Step 120351: {'lr': 4.790182659602693e-05, 'samples': 23107392, 'steps': 120350, 'loss/train': 1.1900302171707153} 11/07/2021 14:11:59 - INFO - __main__ - Step 120352: {'lr': 4.7898702860618815e-05, 'samples': 23107584, 'steps': 120351, 'loss/train': 1.2825058698654175} 11/07/2021 14:12:00 - INFO - __main__ - Step 120353: {'lr': 4.789557921627444e-05, 'samples': 23107776, 'steps': 120352, 'loss/train': 1.7128106355667114} 11/07/2021 14:12:01 - INFO - __main__ - Step 120354: {'lr': 4.789245566299519e-05, 'samples': 23107968, 'steps': 120353, 'loss/train': 1.3823878765106201} 11/07/2021 14:12:01 - INFO - __main__ - Step 120355: {'lr': 4.788933220078251e-05, 'samples': 23108160, 'steps': 120354, 'loss/train': 1.6369564533233643} 11/07/2021 14:12:01 - INFO - __main__ - Step 120356: {'lr': 4.7886208829637734e-05, 'samples': 23108352, 'steps': 120355, 'loss/train': 0.10322218388319016} 11/07/2021 14:12:02 - INFO - __main__ - Step 120357: {'lr': 4.788308554956233e-05, 'samples': 23108544, 'steps': 120356, 'loss/train': 1.527501106262207} 11/07/2021 14:12:03 - INFO - __main__ - Step 120358: {'lr': 4.787996236055772e-05, 'samples': 23108736, 'steps': 120357, 'loss/train': 1.079404354095459} 11/07/2021 14:12:03 - INFO - __main__ - Step 120359: {'lr': 4.787683926262532e-05, 'samples': 23108928, 'steps': 120358, 'loss/train': 0.7230713367462158} 11/07/2021 14:12:03 - INFO - __main__ - Step 120360: {'lr': 4.787371625576642e-05, 'samples': 23109120, 'steps': 120359, 'loss/train': 1.3587778806686401} 11/07/2021 14:12:04 - INFO - __main__ - Step 120361: {'lr': 4.787059333998251e-05, 'samples': 23109312, 'steps': 120360, 'loss/train': 1.0805977582931519} 11/07/2021 14:12:04 - INFO - __main__ - Step 120362: {'lr': 4.786747051527501e-05, 'samples': 23109504, 'steps': 120361, 'loss/train': 1.5920013189315796} 11/07/2021 14:12:05 - INFO - __main__ - Step 120363: {'lr': 4.786434778164528e-05, 'samples': 23109696, 'steps': 120362, 'loss/train': 1.4050005674362183} 11/07/2021 14:12:06 - INFO - __main__ - Step 120364: {'lr': 4.7861225139094774e-05, 'samples': 23109888, 'steps': 120363, 'loss/train': 1.0521420240402222} 11/07/2021 14:12:06 - INFO - __main__ - Step 120365: {'lr': 4.785810258762488e-05, 'samples': 23110080, 'steps': 120364, 'loss/train': 1.314132571220398} 11/07/2021 14:12:06 - INFO - __main__ - Step 120366: {'lr': 4.785498012723702e-05, 'samples': 23110272, 'steps': 120365, 'loss/train': 1.1519006490707397} 11/07/2021 14:12:07 - INFO - __main__ - Step 120367: {'lr': 4.785185775793257e-05, 'samples': 23110464, 'steps': 120366, 'loss/train': 1.510465145111084} 11/07/2021 14:12:08 - INFO - __main__ - Step 120368: {'lr': 4.784873547971294e-05, 'samples': 23110656, 'steps': 120367, 'loss/train': 1.2129093408584595} 11/07/2021 14:12:08 - INFO - __main__ - Step 120369: {'lr': 4.784561329257958e-05, 'samples': 23110848, 'steps': 120368, 'loss/train': 1.4831136465072632} 11/07/2021 14:12:09 - INFO - __main__ - Step 120370: {'lr': 4.7842491196533914e-05, 'samples': 23111040, 'steps': 120369, 'loss/train': 1.632554292678833} 11/07/2021 14:12:09 - INFO - __main__ - Step 120371: {'lr': 4.7839369191577246e-05, 'samples': 23111232, 'steps': 120370, 'loss/train': 1.2769267559051514} 11/07/2021 14:12:09 - INFO - __main__ - Step 120372: {'lr': 4.783624727771102e-05, 'samples': 23111424, 'steps': 120371, 'loss/train': 1.1071115732192993} 11/07/2021 14:12:10 - INFO - __main__ - Step 120373: {'lr': 4.7833125454936676e-05, 'samples': 23111616, 'steps': 120372, 'loss/train': 1.2749931812286377} 11/07/2021 14:12:11 - INFO - __main__ - Step 120374: {'lr': 4.7830003723255605e-05, 'samples': 23111808, 'steps': 120373, 'loss/train': 0.2216758280992508} 11/07/2021 14:12:11 - INFO - __main__ - Step 120375: {'lr': 4.782688208266922e-05, 'samples': 23112000, 'steps': 120374, 'loss/train': 1.3222131729125977} 11/07/2021 14:12:12 - INFO - __main__ - Step 120376: {'lr': 4.7823760533178914e-05, 'samples': 23112192, 'steps': 120375, 'loss/train': 1.323294997215271} 11/07/2021 14:12:12 - INFO - __main__ - Step 120377: {'lr': 4.782063907478609e-05, 'samples': 23112384, 'steps': 120376, 'loss/train': 1.1837804317474365} 11/07/2021 14:12:12 - INFO - __main__ - Step 120378: {'lr': 4.781751770749221e-05, 'samples': 23112576, 'steps': 120377, 'loss/train': 1.5097836256027222} 11/07/2021 14:12:13 - INFO - __main__ - Step 120379: {'lr': 4.781439643129859e-05, 'samples': 23112768, 'steps': 120378, 'loss/train': 1.2944002151489258} 11/07/2021 14:12:14 - INFO - __main__ - Step 120380: {'lr': 4.781127524620671e-05, 'samples': 23112960, 'steps': 120379, 'loss/train': 1.460050344467163} 11/07/2021 14:12:14 - INFO - __main__ - Step 120381: {'lr': 4.780815415221801e-05, 'samples': 23113152, 'steps': 120380, 'loss/train': 1.273977279663086} 11/07/2021 14:12:14 - INFO - __main__ - Step 120382: {'lr': 4.780503314933376e-05, 'samples': 23113344, 'steps': 120381, 'loss/train': 1.5028209686279297} 11/07/2021 14:12:15 - INFO - __main__ - Step 120383: {'lr': 4.780191223755545e-05, 'samples': 23113536, 'steps': 120382, 'loss/train': 1.2870595455169678} 11/07/2021 14:12:16 - INFO - __main__ - Step 120384: {'lr': 4.779879141688448e-05, 'samples': 23113728, 'steps': 120383, 'loss/train': 1.1814292669296265} 11/07/2021 14:12:16 - INFO - __main__ - Step 120385: {'lr': 4.779567068732224e-05, 'samples': 23113920, 'steps': 120384, 'loss/train': 0.8663395643234253} 11/07/2021 14:12:16 - INFO - __main__ - Step 120386: {'lr': 4.7792550048870146e-05, 'samples': 23114112, 'steps': 120385, 'loss/train': 1.2035638093948364} 11/07/2021 14:12:17 - INFO - __main__ - Step 120387: {'lr': 4.7789429501529644e-05, 'samples': 23114304, 'steps': 120386, 'loss/train': 1.3001556396484375} 11/07/2021 14:12:17 - INFO - __main__ - Step 120388: {'lr': 4.7786309045302065e-05, 'samples': 23114496, 'steps': 120387, 'loss/train': 1.211769938468933} 11/07/2021 14:12:18 - INFO - __main__ - Step 120389: {'lr': 4.7783188680188884e-05, 'samples': 23114688, 'steps': 120388, 'loss/train': 0.6733616590499878} 11/07/2021 14:12:19 - INFO - __main__ - Step 120390: {'lr': 4.778006840619148e-05, 'samples': 23114880, 'steps': 120389, 'loss/train': 1.5268113613128662} 11/07/2021 14:12:19 - INFO - __main__ - Step 120391: {'lr': 4.7776948223311215e-05, 'samples': 23115072, 'steps': 120390, 'loss/train': 1.4162461757659912} 11/07/2021 14:12:19 - INFO - __main__ - Step 120392: {'lr': 4.7773828131549654e-05, 'samples': 23115264, 'steps': 120391, 'loss/train': 2.667860746383667} 11/07/2021 14:12:20 - INFO - __main__ - Step 120393: {'lr': 4.7770708130908e-05, 'samples': 23115456, 'steps': 120392, 'loss/train': 1.353355050086975} 11/07/2021 14:12:20 - INFO - __main__ - Step 120394: {'lr': 4.776758822138774e-05, 'samples': 23115648, 'steps': 120393, 'loss/train': 1.2838785648345947} 11/07/2021 14:12:21 - INFO - __main__ - Step 120395: {'lr': 4.776446840299028e-05, 'samples': 23115840, 'steps': 120394, 'loss/train': 1.4385602474212646} 11/07/2021 14:12:22 - INFO - __main__ - Step 120396: {'lr': 4.776134867571705e-05, 'samples': 23116032, 'steps': 120395, 'loss/train': 1.540807843208313} 11/07/2021 14:12:22 - INFO - __main__ - Step 120397: {'lr': 4.7758229039569414e-05, 'samples': 23116224, 'steps': 120396, 'loss/train': 1.097678542137146} 11/07/2021 14:12:22 - INFO - __main__ - Step 120398: {'lr': 4.775510949454881e-05, 'samples': 23116416, 'steps': 120397, 'loss/train': 1.5890893936157227} 11/07/2021 14:12:23 - INFO - __main__ - Step 120399: {'lr': 4.775199004065661e-05, 'samples': 23116608, 'steps': 120398, 'loss/train': 0.873512327671051} 11/07/2021 14:12:24 - INFO - __main__ - Step 120400: {'lr': 4.774887067789427e-05, 'samples': 23116800, 'steps': 120399, 'loss/train': 1.4339008331298828} 11/07/2021 14:12:24 - INFO - __main__ - Step 120401: {'lr': 4.7745751406263163e-05, 'samples': 23116992, 'steps': 120400, 'loss/train': 1.293571949005127} 11/07/2021 14:12:24 - INFO - __main__ - Step 120402: {'lr': 4.774263222576469e-05, 'samples': 23117184, 'steps': 120401, 'loss/train': 1.375727891921997} 11/07/2021 14:12:25 - INFO - __main__ - Step 120403: {'lr': 4.7739513136400346e-05, 'samples': 23117376, 'steps': 120402, 'loss/train': 1.1187796592712402} 11/07/2021 14:12:25 - INFO - __main__ - Step 120404: {'lr': 4.773639413817138e-05, 'samples': 23117568, 'steps': 120403, 'loss/train': 1.4095529317855835} 11/07/2021 14:12:26 - INFO - __main__ - Step 120405: {'lr': 4.773327523107926e-05, 'samples': 23117760, 'steps': 120404, 'loss/train': 1.4595754146575928} 11/07/2021 14:12:26 - INFO - __main__ - Step 120406: {'lr': 4.773015641512543e-05, 'samples': 23117952, 'steps': 120405, 'loss/train': 1.0071812868118286} 11/07/2021 14:12:27 - INFO - __main__ - Step 120407: {'lr': 4.772703769031125e-05, 'samples': 23118144, 'steps': 120406, 'loss/train': 1.5213820934295654} 11/07/2021 14:12:27 - INFO - __main__ - Step 120408: {'lr': 4.772391905663817e-05, 'samples': 23118336, 'steps': 120407, 'loss/train': 1.1671143770217896} 11/07/2021 14:12:28 - INFO - __main__ - Step 120409: {'lr': 4.772080051410757e-05, 'samples': 23118528, 'steps': 120408, 'loss/train': 0.6533594131469727} 11/07/2021 14:12:29 - INFO - __main__ - Step 120410: {'lr': 4.771768206272084e-05, 'samples': 23118720, 'steps': 120409, 'loss/train': 1.1946457624435425} 11/07/2021 14:12:29 - INFO - __main__ - Step 120411: {'lr': 4.77145637024794e-05, 'samples': 23118912, 'steps': 120410, 'loss/train': 1.3253358602523804} 11/07/2021 14:12:29 - INFO - __main__ - Step 120412: {'lr': 4.771144543338465e-05, 'samples': 23119104, 'steps': 120411, 'loss/train': 0.988019585609436} 11/07/2021 14:12:30 - INFO - __main__ - Step 120413: {'lr': 4.770832725543803e-05, 'samples': 23119296, 'steps': 120412, 'loss/train': 0.704111635684967} 11/07/2021 14:12:30 - INFO - __main__ - Step 120414: {'lr': 4.770520916864088e-05, 'samples': 23119488, 'steps': 120413, 'loss/train': 1.3998943567276} 11/07/2021 14:12:31 - INFO - __main__ - Step 120415: {'lr': 4.7702091172994676e-05, 'samples': 23119680, 'steps': 120414, 'loss/train': 1.168453335762024} 11/07/2021 14:12:31 - INFO - __main__ - Step 120416: {'lr': 4.7698973268500836e-05, 'samples': 23119872, 'steps': 120415, 'loss/train': 1.6341556310653687} 11/07/2021 14:12:32 - INFO - __main__ - Step 120417: {'lr': 4.7695855455160644e-05, 'samples': 23120064, 'steps': 120416, 'loss/train': 1.1686806678771973} 11/07/2021 14:12:32 - INFO - __main__ - Step 120418: {'lr': 4.769273773297558e-05, 'samples': 23120256, 'steps': 120417, 'loss/train': 1.0223095417022705} 11/07/2021 14:12:32 - INFO - __main__ - Step 120419: {'lr': 4.768962010194705e-05, 'samples': 23120448, 'steps': 120418, 'loss/train': 1.163737416267395} 11/07/2021 14:12:33 - INFO - __main__ - Step 120420: {'lr': 4.7686502562076465e-05, 'samples': 23120640, 'steps': 120419, 'loss/train': 1.3257008790969849} 11/07/2021 14:12:34 - INFO - __main__ - Step 120421: {'lr': 4.7683385113365227e-05, 'samples': 23120832, 'steps': 120420, 'loss/train': 1.8067171573638916} 11/07/2021 14:12:34 - INFO - __main__ - Step 120422: {'lr': 4.768026775581472e-05, 'samples': 23121024, 'steps': 120421, 'loss/train': 1.436998963356018} 11/07/2021 14:12:34 - INFO - __main__ - Step 120423: {'lr': 4.7677150489426365e-05, 'samples': 23121216, 'steps': 120422, 'loss/train': 1.2868560552597046} 11/07/2021 14:12:35 - INFO - __main__ - Step 120424: {'lr': 4.767403331420156e-05, 'samples': 23121408, 'steps': 120423, 'loss/train': 1.6955833435058594} 11/07/2021 14:12:35 - INFO - __main__ - Step 120425: {'lr': 4.767091623014169e-05, 'samples': 23121600, 'steps': 120424, 'loss/train': 1.2779031991958618} 11/07/2021 14:12:36 - INFO - __main__ - Step 120426: {'lr': 4.766779923724823e-05, 'samples': 23121792, 'steps': 120425, 'loss/train': 1.3792024850845337} 11/07/2021 14:12:36 - INFO - __main__ - Step 120427: {'lr': 4.766468233552251e-05, 'samples': 23121984, 'steps': 120426, 'loss/train': 1.3543767929077148} 11/07/2021 14:12:37 - INFO - __main__ - Step 120428: {'lr': 4.766156552496595e-05, 'samples': 23122176, 'steps': 120427, 'loss/train': 1.4243202209472656} 11/07/2021 14:12:37 - INFO - __main__ - Step 120429: {'lr': 4.7658448805579984e-05, 'samples': 23122368, 'steps': 120428, 'loss/train': 1.4247467517852783} 11/07/2021 14:12:38 - INFO - __main__ - Step 120430: {'lr': 4.765533217736609e-05, 'samples': 23122560, 'steps': 120429, 'loss/train': 1.2251930236816406} 11/07/2021 14:12:39 - INFO - __main__ - Step 120431: {'lr': 4.7652215640325485e-05, 'samples': 23122752, 'steps': 120430, 'loss/train': 1.4758554697036743} 11/07/2021 14:12:39 - INFO - __main__ - Step 120432: {'lr': 4.764909919445967e-05, 'samples': 23122944, 'steps': 120431, 'loss/train': 1.174038052558899} 11/07/2021 14:12:39 - INFO - __main__ - Step 120433: {'lr': 4.7645982839770034e-05, 'samples': 23123136, 'steps': 120432, 'loss/train': 0.6481460928916931} 11/07/2021 14:12:40 - INFO - __main__ - Step 120434: {'lr': 4.764286657625802e-05, 'samples': 23123328, 'steps': 120433, 'loss/train': 0.8095322251319885} 11/07/2021 14:12:40 - INFO - __main__ - Step 120435: {'lr': 4.7639750403925016e-05, 'samples': 23123520, 'steps': 120434, 'loss/train': 1.4036377668380737} 11/07/2021 14:12:41 - INFO - __main__ - Step 120436: {'lr': 4.7636634322772405e-05, 'samples': 23123712, 'steps': 120435, 'loss/train': 1.3286656141281128} 11/07/2021 14:12:41 - INFO - __main__ - Step 120437: {'lr': 4.7633518332801576e-05, 'samples': 23123904, 'steps': 120436, 'loss/train': 1.1105817556381226} 11/07/2021 14:12:42 - INFO - __main__ - Step 120438: {'lr': 4.7630402434014006e-05, 'samples': 23124096, 'steps': 120437, 'loss/train': 1.0492643117904663} 11/07/2021 14:12:42 - INFO - __main__ - Step 120439: {'lr': 4.7627286626411025e-05, 'samples': 23124288, 'steps': 120438, 'loss/train': 1.3097710609436035} 11/07/2021 14:12:43 - INFO - __main__ - Step 120440: {'lr': 4.7624170909994074e-05, 'samples': 23124480, 'steps': 120439, 'loss/train': 1.4505386352539062} 11/07/2021 14:12:44 - INFO - __main__ - Step 120441: {'lr': 4.762105528476457e-05, 'samples': 23124672, 'steps': 120440, 'loss/train': 0.7380849123001099} 11/07/2021 14:12:44 - INFO - __main__ - Step 120442: {'lr': 4.761793975072387e-05, 'samples': 23124864, 'steps': 120441, 'loss/train': 1.40951669216156} 11/07/2021 14:12:44 - INFO - __main__ - Step 120443: {'lr': 4.761482430787348e-05, 'samples': 23125056, 'steps': 120442, 'loss/train': 1.232000470161438} 11/07/2021 14:12:45 - INFO - __main__ - Step 120444: {'lr': 4.761170895621464e-05, 'samples': 23125248, 'steps': 120443, 'loss/train': 1.4045112133026123} 11/07/2021 14:12:45 - INFO - __main__ - Step 120445: {'lr': 4.7608593695748856e-05, 'samples': 23125440, 'steps': 120444, 'loss/train': 1.4533940553665161} 11/07/2021 14:12:46 - INFO - __main__ - Step 120446: {'lr': 4.760547852647753e-05, 'samples': 23125632, 'steps': 120445, 'loss/train': 1.3073805570602417} 11/07/2021 14:12:46 - INFO - __main__ - Step 120447: {'lr': 4.760236344840202e-05, 'samples': 23125824, 'steps': 120446, 'loss/train': 1.1837069988250732} 11/07/2021 14:12:47 - INFO - __main__ - Step 120448: {'lr': 4.7599248461523805e-05, 'samples': 23126016, 'steps': 120447, 'loss/train': 1.239042043685913} 11/07/2021 14:12:47 - INFO - __main__ - Step 120449: {'lr': 4.759613356584422e-05, 'samples': 23126208, 'steps': 120448, 'loss/train': 1.155797004699707} 11/07/2021 14:12:47 - INFO - __main__ - Step 120450: {'lr': 4.7593018761364685e-05, 'samples': 23126400, 'steps': 120449, 'loss/train': 1.2867704629898071} 11/07/2021 14:12:49 - INFO - __main__ - Step 120451: {'lr': 4.758990404808664e-05, 'samples': 23126592, 'steps': 120450, 'loss/train': 1.446480631828308} 11/07/2021 14:12:49 - INFO - __main__ - Step 120452: {'lr': 4.758678942601144e-05, 'samples': 23126784, 'steps': 120451, 'loss/train': 1.6417773962020874} 11/07/2021 14:12:49 - INFO - __main__ - Step 120453: {'lr': 4.758367489514051e-05, 'samples': 23126976, 'steps': 120452, 'loss/train': 1.0928924083709717} 11/07/2021 14:12:50 - INFO - __main__ - Step 120454: {'lr': 4.7580560455475264e-05, 'samples': 23127168, 'steps': 120453, 'loss/train': 0.47505369782447815} 11/07/2021 14:12:50 - INFO - __main__ - Step 120455: {'lr': 4.7577446107017086e-05, 'samples': 23127360, 'steps': 120454, 'loss/train': 1.3834367990493774} 11/07/2021 14:12:51 - INFO - __main__ - Step 120456: {'lr': 4.757433184976737e-05, 'samples': 23127552, 'steps': 120455, 'loss/train': 0.6384744644165039} 11/07/2021 14:12:52 - INFO - __main__ - Step 120457: {'lr': 4.757121768372763e-05, 'samples': 23127744, 'steps': 120456, 'loss/train': 0.32480108737945557} 11/07/2021 14:12:52 - INFO - __main__ - Step 120458: {'lr': 4.75681036088991e-05, 'samples': 23127936, 'steps': 120457, 'loss/train': 1.7155996561050415} 11/07/2021 14:12:52 - INFO - __main__ - Step 120459: {'lr': 4.7564989625283245e-05, 'samples': 23128128, 'steps': 120458, 'loss/train': 0.9421411156654358} 11/07/2021 14:12:53 - INFO - __main__ - Step 120460: {'lr': 4.756187573288151e-05, 'samples': 23128320, 'steps': 120459, 'loss/train': 1.0685536861419678} 11/07/2021 14:12:53 - INFO - __main__ - Step 120461: {'lr': 4.7558761931695255e-05, 'samples': 23128512, 'steps': 120460, 'loss/train': 0.23116599023342133} 11/07/2021 14:12:54 - INFO - __main__ - Step 120462: {'lr': 4.75556482217259e-05, 'samples': 23128704, 'steps': 120461, 'loss/train': 1.4154512882232666} 11/07/2021 14:12:54 - INFO - __main__ - Step 120463: {'lr': 4.7552534602974846e-05, 'samples': 23128896, 'steps': 120462, 'loss/train': 0.9064000248908997} 11/07/2021 14:12:55 - INFO - __main__ - Step 120464: {'lr': 4.7549421075443497e-05, 'samples': 23129088, 'steps': 120463, 'loss/train': 1.0633245706558228} 11/07/2021 14:12:55 - INFO - __main__ - Step 120465: {'lr': 4.754630763913323e-05, 'samples': 23129280, 'steps': 120464, 'loss/train': 1.3476102352142334} 11/07/2021 14:12:56 - INFO - __main__ - Step 120466: {'lr': 4.7543194294045495e-05, 'samples': 23129472, 'steps': 120465, 'loss/train': 1.8033486604690552} 11/07/2021 14:12:56 - INFO - __main__ - Step 120467: {'lr': 4.7540081040181675e-05, 'samples': 23129664, 'steps': 120466, 'loss/train': 1.2515058517456055} 11/07/2021 14:12:57 - INFO - __main__ - Step 120468: {'lr': 4.753696787754319e-05, 'samples': 23129856, 'steps': 120467, 'loss/train': 1.2245607376098633} 11/07/2021 14:12:57 - INFO - __main__ - Step 120469: {'lr': 4.7533854806131396e-05, 'samples': 23130048, 'steps': 120468, 'loss/train': 0.1935606598854065} 11/07/2021 14:12:58 - INFO - __main__ - Step 120470: {'lr': 4.75307418259478e-05, 'samples': 23130240, 'steps': 120469, 'loss/train': 1.3291820287704468} 11/07/2021 14:12:58 - INFO - __main__ - Step 120471: {'lr': 4.752762893699364e-05, 'samples': 23130432, 'steps': 120470, 'loss/train': 1.1788811683654785} 11/07/2021 14:12:58 - INFO - __main__ - Step 120472: {'lr': 4.752451613927042e-05, 'samples': 23130624, 'steps': 120471, 'loss/train': 1.1817394495010376} 11/07/2021 14:12:59 - INFO - __main__ - Step 120473: {'lr': 4.752140343277953e-05, 'samples': 23130816, 'steps': 120472, 'loss/train': 1.2232729196548462} 11/07/2021 14:13:00 - INFO - __main__ - Step 120474: {'lr': 4.7518290817522375e-05, 'samples': 23131008, 'steps': 120473, 'loss/train': 1.217445731163025} 11/07/2021 14:13:00 - INFO - __main__ - Step 120475: {'lr': 4.7515178293500354e-05, 'samples': 23131200, 'steps': 120474, 'loss/train': 1.095641016960144} 11/07/2021 14:13:00 - INFO - __main__ - Step 120476: {'lr': 4.751206586071485e-05, 'samples': 23131392, 'steps': 120475, 'loss/train': 1.620357871055603} 11/07/2021 14:13:01 - INFO - __main__ - Step 120477: {'lr': 4.750895351916732e-05, 'samples': 23131584, 'steps': 120476, 'loss/train': 1.776129126548767} 11/07/2021 14:13:02 - INFO - __main__ - Step 120478: {'lr': 4.750584126885909e-05, 'samples': 23131776, 'steps': 120477, 'loss/train': 0.9443041682243347} 11/07/2021 14:13:02 - INFO - __main__ - Step 120479: {'lr': 4.750272910979164e-05, 'samples': 23131968, 'steps': 120478, 'loss/train': 1.3843379020690918} 11/07/2021 14:13:02 - INFO - __main__ - Step 120480: {'lr': 4.749961704196632e-05, 'samples': 23132160, 'steps': 120479, 'loss/train': 1.222428321838379} 11/07/2021 14:13:03 - INFO - __main__ - Step 120481: {'lr': 4.749650506538453e-05, 'samples': 23132352, 'steps': 120480, 'loss/train': 1.1263563632965088} 11/07/2021 14:13:03 - INFO - __main__ - Step 120482: {'lr': 4.7493393180047725e-05, 'samples': 23132544, 'steps': 120481, 'loss/train': 1.1758739948272705} 11/07/2021 14:13:04 - INFO - __main__ - Step 120483: {'lr': 4.749028138595723e-05, 'samples': 23132736, 'steps': 120482, 'loss/train': 1.4212102890014648} 11/07/2021 14:13:05 - INFO - __main__ - Step 120484: {'lr': 4.748716968311459e-05, 'samples': 23132928, 'steps': 120483, 'loss/train': 1.2916244268417358} 11/07/2021 14:13:05 - INFO - __main__ - Step 120485: {'lr': 4.7484058071521036e-05, 'samples': 23133120, 'steps': 120484, 'loss/train': 1.3810099363327026} 11/07/2021 14:13:05 - INFO - __main__ - Step 120486: {'lr': 4.748094655117805e-05, 'samples': 23133312, 'steps': 120485, 'loss/train': 1.413669466972351} 11/07/2021 14:13:06 - INFO - __main__ - Step 120487: {'lr': 4.7477835122087e-05, 'samples': 23133504, 'steps': 120486, 'loss/train': 1.357480764389038} 11/07/2021 14:13:06 - INFO - __main__ - Step 120488: {'lr': 4.7474723784249304e-05, 'samples': 23133696, 'steps': 120487, 'loss/train': 1.289093017578125} 11/07/2021 14:13:07 - INFO - __main__ - Step 120489: {'lr': 4.747161253766641e-05, 'samples': 23133888, 'steps': 120488, 'loss/train': 1.3127951622009277} 11/07/2021 14:13:07 - INFO - __main__ - Step 120490: {'lr': 4.746850138233966e-05, 'samples': 23134080, 'steps': 120489, 'loss/train': 1.083760380744934} 11/07/2021 14:13:08 - INFO - __main__ - Step 120491: {'lr': 4.746539031827046e-05, 'samples': 23134272, 'steps': 120490, 'loss/train': 1.5352708101272583} 11/07/2021 14:13:08 - INFO - __main__ - Step 120492: {'lr': 4.746227934546027e-05, 'samples': 23134464, 'steps': 120491, 'loss/train': 1.2810816764831543} 11/07/2021 14:13:08 - INFO - __main__ - Step 120493: {'lr': 4.7459168463910405e-05, 'samples': 23134656, 'steps': 120492, 'loss/train': 1.2736588716506958} 11/07/2021 14:13:10 - INFO - __main__ - Step 120494: {'lr': 4.745605767362235e-05, 'samples': 23134848, 'steps': 120493, 'loss/train': 1.604522466659546} 11/07/2021 14:13:10 - INFO - __main__ - Step 120495: {'lr': 4.745294697459746e-05, 'samples': 23135040, 'steps': 120494, 'loss/train': 1.2122774124145508} 11/07/2021 14:13:11 - INFO - __main__ - Step 120496: {'lr': 4.744983636683714e-05, 'samples': 23135232, 'steps': 120495, 'loss/train': 0.7604621648788452} 11/07/2021 14:13:11 - INFO - __main__ - Step 120497: {'lr': 4.744672585034288e-05, 'samples': 23135424, 'steps': 120496, 'loss/train': 1.0575021505355835} 11/07/2021 14:13:11 - INFO - __main__ - Step 120498: {'lr': 4.744361542511591e-05, 'samples': 23135616, 'steps': 120497, 'loss/train': 2.264735698699951} 11/07/2021 14:13:12 - INFO - __main__ - Step 120499: {'lr': 4.744050509115774e-05, 'samples': 23135808, 'steps': 120498, 'loss/train': 1.3680695295333862} 11/07/2021 14:13:13 - INFO - __main__ - Step 120500: {'lr': 4.7437394848469764e-05, 'samples': 23136000, 'steps': 120499, 'loss/train': 1.0943636894226074} 11/07/2021 14:13:13 - INFO - __main__ - Step 120501: {'lr': 4.7434284697053354e-05, 'samples': 23136192, 'steps': 120500, 'loss/train': 0.7522642612457275} 11/07/2021 14:13:13 - INFO - __main__ - Step 120502: {'lr': 4.7431174636909934e-05, 'samples': 23136384, 'steps': 120501, 'loss/train': 1.2587753534317017} 11/07/2021 14:13:14 - INFO - __main__ - Step 120503: {'lr': 4.7428064668040895e-05, 'samples': 23136576, 'steps': 120502, 'loss/train': 1.575400710105896} 11/07/2021 14:13:16 - INFO - __main__ - Step 120504: {'lr': 4.7424954790447645e-05, 'samples': 23136768, 'steps': 120503, 'loss/train': 1.6467441320419312} 11/07/2021 14:13:16 - INFO - __main__ - Step 120505: {'lr': 4.742184500413157e-05, 'samples': 23136960, 'steps': 120504, 'loss/train': 1.6172895431518555} 11/07/2021 14:13:16 - INFO - __main__ - Step 120506: {'lr': 4.74187353090941e-05, 'samples': 23137152, 'steps': 120505, 'loss/train': 1.1415716409683228} 11/07/2021 14:13:17 - INFO - __main__ - Step 120507: {'lr': 4.741562570533664e-05, 'samples': 23137344, 'steps': 120506, 'loss/train': 1.7566325664520264} 11/07/2021 14:13:17 - INFO - __main__ - Step 120508: {'lr': 4.741251619286055e-05, 'samples': 23137536, 'steps': 120507, 'loss/train': 1.739776372909546} 11/07/2021 14:13:17 - INFO - __main__ - Step 120509: {'lr': 4.740940677166727e-05, 'samples': 23137728, 'steps': 120508, 'loss/train': 1.7386715412139893} 11/07/2021 14:13:18 - INFO - __main__ - Step 120510: {'lr': 4.740629744175823e-05, 'samples': 23137920, 'steps': 120509, 'loss/train': 1.6053528785705566} 11/07/2021 14:13:18 - INFO - __main__ - Step 120511: {'lr': 4.7403188203134744e-05, 'samples': 23138112, 'steps': 120510, 'loss/train': 1.5762602090835571} 11/07/2021 14:13:19 - INFO - __main__ - Step 120512: {'lr': 4.740007905579824e-05, 'samples': 23138304, 'steps': 120511, 'loss/train': 1.3207138776779175} 11/07/2021 14:13:20 - INFO - __main__ - Step 120513: {'lr': 4.739696999975013e-05, 'samples': 23138496, 'steps': 120512, 'loss/train': 0.8338986039161682} 11/07/2021 14:13:20 - INFO - __main__ - Step 120514: {'lr': 4.7393861034991826e-05, 'samples': 23138688, 'steps': 120513, 'loss/train': 0.6869608759880066} 11/07/2021 14:13:20 - INFO - __main__ - Step 120515: {'lr': 4.739075216152475e-05, 'samples': 23138880, 'steps': 120514, 'loss/train': 1.5373432636260986} 11/07/2021 14:13:21 - INFO - __main__ - Step 120516: {'lr': 4.738764337935023e-05, 'samples': 23139072, 'steps': 120515, 'loss/train': 0.9723455309867859} 11/07/2021 14:13:22 - INFO - __main__ - Step 120517: {'lr': 4.7384534688469735e-05, 'samples': 23139264, 'steps': 120516, 'loss/train': 0.7958728075027466} 11/07/2021 14:13:22 - INFO - __main__ - Step 120518: {'lr': 4.738142608888463e-05, 'samples': 23139456, 'steps': 120517, 'loss/train': 1.5668936967849731} 11/07/2021 14:13:22 - INFO - __main__ - Step 120519: {'lr': 4.737831758059633e-05, 'samples': 23139648, 'steps': 120518, 'loss/train': 1.480164885520935} 11/07/2021 14:13:23 - INFO - __main__ - Step 120520: {'lr': 4.737520916360624e-05, 'samples': 23139840, 'steps': 120519, 'loss/train': 1.0513490438461304} 11/07/2021 14:13:23 - INFO - __main__ - Step 120521: {'lr': 4.737210083791577e-05, 'samples': 23140032, 'steps': 120520, 'loss/train': 1.3151495456695557} 11/07/2021 14:13:24 - INFO - __main__ - Step 120522: {'lr': 4.736899260352629e-05, 'samples': 23140224, 'steps': 120521, 'loss/train': 1.5043394565582275} 11/07/2021 14:13:24 - INFO - __main__ - Step 120523: {'lr': 4.7365884460439185e-05, 'samples': 23140416, 'steps': 120522, 'loss/train': 1.5876723527908325} 11/07/2021 14:13:25 - INFO - __main__ - Step 120524: {'lr': 4.736277640865599e-05, 'samples': 23140608, 'steps': 120523, 'loss/train': 1.1829674243927002} 11/07/2021 14:13:25 - INFO - __main__ - Step 120525: {'lr': 4.7359668448177905e-05, 'samples': 23140800, 'steps': 120524, 'loss/train': 1.1187511682510376} 11/07/2021 14:13:26 - INFO - __main__ - Step 120526: {'lr': 4.7356560579006444e-05, 'samples': 23140992, 'steps': 120525, 'loss/train': 1.0869017839431763} 11/07/2021 14:13:26 - INFO - __main__ - Step 120527: {'lr': 4.7353452801143e-05, 'samples': 23141184, 'steps': 120526, 'loss/train': 1.4287865161895752} 11/07/2021 14:13:27 - INFO - __main__ - Step 120528: {'lr': 4.7350345114588964e-05, 'samples': 23141376, 'steps': 120527, 'loss/train': 1.3050047159194946} 11/07/2021 14:13:27 - INFO - __main__ - Step 120529: {'lr': 4.734723751934572e-05, 'samples': 23141568, 'steps': 120528, 'loss/train': 1.1876569986343384} 11/07/2021 14:13:28 - INFO - __main__ - Step 120530: {'lr': 4.734413001541468e-05, 'samples': 23141760, 'steps': 120529, 'loss/train': 1.1968815326690674} 11/07/2021 14:13:28 - INFO - __main__ - Step 120531: {'lr': 4.7341022602797265e-05, 'samples': 23141952, 'steps': 120530, 'loss/train': 1.329290747642517} 11/07/2021 14:13:29 - INFO - __main__ - Step 120532: {'lr': 4.733791528149484e-05, 'samples': 23142144, 'steps': 120531, 'loss/train': 1.3686832189559937} 11/07/2021 14:13:29 - INFO - __main__ - Step 120533: {'lr': 4.7334808051508834e-05, 'samples': 23142336, 'steps': 120532, 'loss/train': 0.7921923398971558} 11/07/2021 14:13:30 - INFO - __main__ - Step 120534: {'lr': 4.7331700912840644e-05, 'samples': 23142528, 'steps': 120533, 'loss/train': 0.8335058689117432} 11/07/2021 14:13:30 - INFO - __main__ - Step 120535: {'lr': 4.732859386549165e-05, 'samples': 23142720, 'steps': 120534, 'loss/train': 1.1117668151855469} 11/07/2021 14:13:30 - INFO - __main__ - Step 120536: {'lr': 4.7325486909463254e-05, 'samples': 23142912, 'steps': 120535, 'loss/train': 1.0302987098693848} 11/07/2021 14:13:31 - INFO - __main__ - Step 120537: {'lr': 4.732238004475695e-05, 'samples': 23143104, 'steps': 120536, 'loss/train': 1.6714386940002441} 11/07/2021 14:13:32 - INFO - __main__ - Step 120538: {'lr': 4.731927327137397e-05, 'samples': 23143296, 'steps': 120537, 'loss/train': 1.4771668910980225} 11/07/2021 14:13:32 - INFO - __main__ - Step 120539: {'lr': 4.731616658931584e-05, 'samples': 23143488, 'steps': 120538, 'loss/train': 1.3261289596557617} 11/07/2021 14:13:33 - INFO - __main__ - Step 120540: {'lr': 4.731305999858387e-05, 'samples': 23143680, 'steps': 120539, 'loss/train': 1.003543734550476} 11/07/2021 14:13:33 - INFO - __main__ - Step 120541: {'lr': 4.730995349917952e-05, 'samples': 23143872, 'steps': 120540, 'loss/train': 1.1709665060043335} 11/07/2021 14:13:33 - INFO - __main__ - Step 120542: {'lr': 4.7306847091104195e-05, 'samples': 23144064, 'steps': 120541, 'loss/train': 1.4230903387069702} 11/07/2021 14:13:35 - INFO - __main__ - Step 120543: {'lr': 4.730374077435926e-05, 'samples': 23144256, 'steps': 120542, 'loss/train': 1.1894339323043823} 11/07/2021 14:13:35 - INFO - __main__ - Step 120544: {'lr': 4.730063454894615e-05, 'samples': 23144448, 'steps': 120543, 'loss/train': 1.2305285930633545} 11/07/2021 14:13:36 - INFO - __main__ - Step 120545: {'lr': 4.729752841486623e-05, 'samples': 23144640, 'steps': 120544, 'loss/train': 0.5463544726371765} 11/07/2021 14:13:36 - INFO - __main__ - Step 120546: {'lr': 4.729442237212092e-05, 'samples': 23144832, 'steps': 120545, 'loss/train': 0.5390135645866394} 11/07/2021 14:13:36 - INFO - __main__ - Step 120547: {'lr': 4.7291316420711605e-05, 'samples': 23145024, 'steps': 120546, 'loss/train': 0.4434431493282318} 11/07/2021 14:13:37 - INFO - __main__ - Step 120548: {'lr': 4.7288210560639696e-05, 'samples': 23145216, 'steps': 120547, 'loss/train': 1.6993731260299683} 11/07/2021 14:13:37 - INFO - __main__ - Step 120549: {'lr': 4.7285104791906617e-05, 'samples': 23145408, 'steps': 120548, 'loss/train': 1.301778793334961} 11/07/2021 14:13:39 - INFO - __main__ - Step 120550: {'lr': 4.728199911451372e-05, 'samples': 23145600, 'steps': 120549, 'loss/train': 1.3307448625564575} 11/07/2021 14:13:39 - INFO - __main__ - Step 120551: {'lr': 4.727889352846249e-05, 'samples': 23145792, 'steps': 120550, 'loss/train': 0.9687880277633667} 11/07/2021 14:13:39 - INFO - __main__ - Step 120552: {'lr': 4.727578803375421e-05, 'samples': 23145984, 'steps': 120551, 'loss/train': 1.570669174194336} 11/07/2021 14:13:40 - INFO - __main__ - Step 120553: {'lr': 4.7272682630390335e-05, 'samples': 23146176, 'steps': 120552, 'loss/train': 0.6158198118209839} 11/07/2021 14:13:40 - INFO - __main__ - Step 120554: {'lr': 4.726957731837222e-05, 'samples': 23146368, 'steps': 120553, 'loss/train': 1.2040354013442993} 11/07/2021 14:13:41 - INFO - __main__ - Step 120555: {'lr': 4.726647209770135e-05, 'samples': 23146560, 'steps': 120554, 'loss/train': 1.2689203023910522} 11/07/2021 14:13:41 - INFO - __main__ - Step 120556: {'lr': 4.7263366968379076e-05, 'samples': 23146752, 'steps': 120555, 'loss/train': 1.4216411113739014} 11/07/2021 14:13:42 - INFO - __main__ - Step 120557: {'lr': 4.726026193040678e-05, 'samples': 23146944, 'steps': 120556, 'loss/train': 1.1883881092071533} 11/07/2021 14:13:42 - INFO - __main__ - Step 120558: {'lr': 4.725715698378588e-05, 'samples': 23147136, 'steps': 120557, 'loss/train': 0.7753278613090515} 11/07/2021 14:13:42 - INFO - __main__ - Step 120559: {'lr': 4.72540521285178e-05, 'samples': 23147328, 'steps': 120558, 'loss/train': 1.705966830253601} 11/07/2021 14:13:44 - INFO - __main__ - Step 120560: {'lr': 4.725094736460389e-05, 'samples': 23147520, 'steps': 120559, 'loss/train': 1.0583220720291138} 11/07/2021 14:13:44 - INFO - __main__ - Step 120561: {'lr': 4.72478426920456e-05, 'samples': 23147712, 'steps': 120560, 'loss/train': 1.6606303453445435} 11/07/2021 14:13:44 - INFO - __main__ - Step 120562: {'lr': 4.7244738110844286e-05, 'samples': 23147904, 'steps': 120561, 'loss/train': 1.3701108694076538} 11/07/2021 14:13:45 - INFO - __main__ - Step 120563: {'lr': 4.724163362100137e-05, 'samples': 23148096, 'steps': 120562, 'loss/train': 1.3181148767471313} 11/07/2021 14:13:45 - INFO - __main__ - Step 120564: {'lr': 4.723852922251831e-05, 'samples': 23148288, 'steps': 120563, 'loss/train': 1.2544353008270264} 11/07/2021 14:13:45 - INFO - __main__ - Step 120565: {'lr': 4.723542491539637e-05, 'samples': 23148480, 'steps': 120564, 'loss/train': 1.760043740272522} 11/07/2021 14:13:46 - INFO - __main__ - Step 120566: {'lr': 4.723232069963704e-05, 'samples': 23148672, 'steps': 120565, 'loss/train': 1.607071876525879} 11/07/2021 14:13:47 - INFO - __main__ - Step 120567: {'lr': 4.722921657524168e-05, 'samples': 23148864, 'steps': 120566, 'loss/train': 1.4800764322280884} 11/07/2021 14:13:47 - INFO - __main__ - Step 120568: {'lr': 4.7226112542211707e-05, 'samples': 23149056, 'steps': 120567, 'loss/train': 0.5049821138381958} 11/07/2021 14:13:47 - INFO - __main__ - Step 120569: {'lr': 4.7223008600548515e-05, 'samples': 23149248, 'steps': 120568, 'loss/train': 0.9804527759552002} 11/07/2021 14:13:48 - INFO - __main__ - Step 120570: {'lr': 4.7219904750253506e-05, 'samples': 23149440, 'steps': 120569, 'loss/train': 1.410687804222107} 11/07/2021 14:13:49 - INFO - __main__ - Step 120571: {'lr': 4.721680099132808e-05, 'samples': 23149632, 'steps': 120570, 'loss/train': 1.6643285751342773} 11/07/2021 14:13:49 - INFO - __main__ - Step 120572: {'lr': 4.721369732377362e-05, 'samples': 23149824, 'steps': 120571, 'loss/train': 1.4027031660079956} 11/07/2021 14:13:50 - INFO - __main__ - Step 120573: {'lr': 4.721059374759157e-05, 'samples': 23150016, 'steps': 120572, 'loss/train': 1.257595181465149} 11/07/2021 14:13:50 - INFO - __main__ - Step 120574: {'lr': 4.720749026278329e-05, 'samples': 23150208, 'steps': 120573, 'loss/train': 0.7741877436637878} 11/07/2021 14:13:50 - INFO - __main__ - Step 120575: {'lr': 4.7204386869350165e-05, 'samples': 23150400, 'steps': 120574, 'loss/train': 1.2189699411392212} 11/07/2021 14:13:51 - INFO - __main__ - Step 120576: {'lr': 4.720128356729364e-05, 'samples': 23150592, 'steps': 120575, 'loss/train': 1.564940094947815} 11/07/2021 14:13:52 - INFO - __main__ - Step 120577: {'lr': 4.719818035661508e-05, 'samples': 23150784, 'steps': 120576, 'loss/train': 1.5689265727996826} 11/07/2021 14:13:52 - INFO - __main__ - Step 120578: {'lr': 4.719507723731595e-05, 'samples': 23150976, 'steps': 120577, 'loss/train': 1.7295726537704468} 11/07/2021 14:13:52 - INFO - __main__ - Step 120579: {'lr': 4.7191974209397495e-05, 'samples': 23151168, 'steps': 120578, 'loss/train': 1.7765716314315796} 11/07/2021 14:13:53 - INFO - __main__ - Step 120580: {'lr': 4.7188871272861254e-05, 'samples': 23151360, 'steps': 120579, 'loss/train': 1.3733649253845215} 11/07/2021 14:13:53 - INFO - __main__ - Step 120581: {'lr': 4.718576842770855e-05, 'samples': 23151552, 'steps': 120580, 'loss/train': 1.4344815015792847} 11/07/2021 14:13:54 - INFO - __main__ - Step 120582: {'lr': 4.718266567394083e-05, 'samples': 23151744, 'steps': 120581, 'loss/train': 1.339082956314087} 11/07/2021 14:13:54 - INFO - __main__ - Step 120583: {'lr': 4.7179563011559455e-05, 'samples': 23151936, 'steps': 120582, 'loss/train': 1.2889851331710815} 11/07/2021 14:13:55 - INFO - __main__ - Step 120584: {'lr': 4.717646044056584e-05, 'samples': 23152128, 'steps': 120583, 'loss/train': 1.422472596168518} 11/07/2021 14:13:55 - INFO - __main__ - Step 120585: {'lr': 4.717335796096139e-05, 'samples': 23152320, 'steps': 120584, 'loss/train': 1.606481909751892} 11/07/2021 14:13:55 - INFO - __main__ - Step 120586: {'lr': 4.7170255572747485e-05, 'samples': 23152512, 'steps': 120585, 'loss/train': 1.286672592163086} 11/07/2021 14:13:57 - INFO - __main__ - Step 120587: {'lr': 4.716715327592555e-05, 'samples': 23152704, 'steps': 120586, 'loss/train': 1.320998191833496} 11/07/2021 14:13:57 - INFO - __main__ - Step 120588: {'lr': 4.716405107049696e-05, 'samples': 23152896, 'steps': 120587, 'loss/train': 1.1343138217926025} 11/07/2021 14:13:57 - INFO - __main__ - Step 120589: {'lr': 4.7160948956463115e-05, 'samples': 23153088, 'steps': 120588, 'loss/train': 1.3817580938339233} 11/07/2021 14:13:58 - INFO - __main__ - Step 120590: {'lr': 4.715784693382541e-05, 'samples': 23153280, 'steps': 120589, 'loss/train': 1.7434197664260864} 11/07/2021 14:13:58 - INFO - __main__ - Step 120591: {'lr': 4.715474500258532e-05, 'samples': 23153472, 'steps': 120590, 'loss/train': 1.132184386253357} 11/07/2021 14:13:59 - INFO - __main__ - Step 120592: {'lr': 4.715164316274409e-05, 'samples': 23153664, 'steps': 120591, 'loss/train': 1.1143717765808105} 11/07/2021 14:13:59 - INFO - __main__ - Step 120593: {'lr': 4.714854141430322e-05, 'samples': 23153856, 'steps': 120592, 'loss/train': 1.3631304502487183} 11/07/2021 14:14:00 - INFO - __main__ - Step 120594: {'lr': 4.7145439757264095e-05, 'samples': 23154048, 'steps': 120593, 'loss/train': 1.2550519704818726} 11/07/2021 14:14:00 - INFO - __main__ - Step 120595: {'lr': 4.714233819162808e-05, 'samples': 23154240, 'steps': 120594, 'loss/train': 1.0490913391113281} 11/07/2021 14:14:00 - INFO - __main__ - Step 120596: {'lr': 4.713923671739662e-05, 'samples': 23154432, 'steps': 120595, 'loss/train': 1.7727233171463013} 11/07/2021 14:14:02 - INFO - __main__ - Step 120597: {'lr': 4.713613533457106e-05, 'samples': 23154624, 'steps': 120596, 'loss/train': 0.8169413208961487} 11/07/2021 14:14:02 - INFO - __main__ - Step 120598: {'lr': 4.713303404315286e-05, 'samples': 23154816, 'steps': 120597, 'loss/train': 1.12064790725708} 11/07/2021 14:14:02 - INFO - __main__ - Step 120599: {'lr': 4.712993284314338e-05, 'samples': 23155008, 'steps': 120598, 'loss/train': 1.3788095712661743} 11/07/2021 14:14:03 - INFO - __main__ - Step 120600: {'lr': 4.712683173454399e-05, 'samples': 23155200, 'steps': 120599, 'loss/train': 1.1058027744293213} 11/07/2021 14:14:03 - INFO - __main__ - Step 120601: {'lr': 4.712373071735615e-05, 'samples': 23155392, 'steps': 120600, 'loss/train': 1.0624428987503052} 11/07/2021 14:14:03 - INFO - __main__ - Step 120602: {'lr': 4.7120629791581214e-05, 'samples': 23155584, 'steps': 120601, 'loss/train': 1.0381649732589722} 11/07/2021 14:14:05 - INFO - __main__ - Step 120603: {'lr': 4.7117528957220604e-05, 'samples': 23155776, 'steps': 120602, 'loss/train': 0.09752225875854492} 11/07/2021 14:14:05 - INFO - __main__ - Step 120604: {'lr': 4.7114428214275694e-05, 'samples': 23155968, 'steps': 120603, 'loss/train': 1.7859869003295898} 11/07/2021 14:14:05 - INFO - __main__ - Step 120605: {'lr': 4.711132756274794e-05, 'samples': 23156160, 'steps': 120604, 'loss/train': 1.4311038255691528} 11/07/2021 14:14:06 - INFO - __main__ - Step 120606: {'lr': 4.710822700263867e-05, 'samples': 23156352, 'steps': 120605, 'loss/train': 1.4295785427093506} 11/07/2021 14:14:06 - INFO - __main__ - Step 120607: {'lr': 4.710512653394927e-05, 'samples': 23156544, 'steps': 120606, 'loss/train': 0.8739529848098755} 11/07/2021 14:14:07 - INFO - __main__ - Step 120608: {'lr': 4.710202615668116e-05, 'samples': 23156736, 'steps': 120607, 'loss/train': 2.0793843269348145} 11/07/2021 14:14:07 - INFO - __main__ - Step 120609: {'lr': 4.709892587083578e-05, 'samples': 23156928, 'steps': 120608, 'loss/train': 1.3378857374191284} 11/07/2021 14:14:08 - INFO - __main__ - Step 120610: {'lr': 4.709582567641446e-05, 'samples': 23157120, 'steps': 120609, 'loss/train': 1.2407253980636597} 11/07/2021 14:14:08 - INFO - __main__ - Step 120611: {'lr': 4.709272557341865e-05, 'samples': 23157312, 'steps': 120610, 'loss/train': 1.114006519317627} 11/07/2021 14:14:08 - INFO - __main__ - Step 120612: {'lr': 4.708962556184973e-05, 'samples': 23157504, 'steps': 120611, 'loss/train': 1.3261526823043823} 11/07/2021 14:14:09 - INFO - __main__ - Step 120613: {'lr': 4.70865256417091e-05, 'samples': 23157696, 'steps': 120612, 'loss/train': 1.2486317157745361} 11/07/2021 14:14:10 - INFO - __main__ - Step 120614: {'lr': 4.7083425812998126e-05, 'samples': 23157888, 'steps': 120613, 'loss/train': 1.391269326210022} 11/07/2021 14:14:10 - INFO - __main__ - Step 120615: {'lr': 4.708032607571824e-05, 'samples': 23158080, 'steps': 120614, 'loss/train': 1.4821038246154785} 11/07/2021 14:14:11 - INFO - __main__ - Step 120616: {'lr': 4.707722642987081e-05, 'samples': 23158272, 'steps': 120615, 'loss/train': 1.8555381298065186} 11/07/2021 14:14:11 - INFO - __main__ - Step 120617: {'lr': 4.707412687545734e-05, 'samples': 23158464, 'steps': 120616, 'loss/train': 1.7938557863235474} 11/07/2021 14:14:12 - INFO - __main__ - Step 120618: {'lr': 4.707102741247907e-05, 'samples': 23158656, 'steps': 120617, 'loss/train': 1.14085853099823} 11/07/2021 14:14:12 - INFO - __main__ - Step 120619: {'lr': 4.7067928040937455e-05, 'samples': 23158848, 'steps': 120618, 'loss/train': 1.2789502143859863} 11/07/2021 14:14:13 - INFO - __main__ - Step 120620: {'lr': 4.70648287608339e-05, 'samples': 23159040, 'steps': 120619, 'loss/train': 1.3316519260406494} 11/07/2021 14:14:13 - INFO - __main__ - Step 120621: {'lr': 4.706172957216981e-05, 'samples': 23159232, 'steps': 120620, 'loss/train': 0.9088757038116455} 11/07/2021 14:14:13 - INFO - __main__ - Step 120622: {'lr': 4.705863047494657e-05, 'samples': 23159424, 'steps': 120621, 'loss/train': 1.6730382442474365} 11/07/2021 14:14:15 - INFO - __main__ - Step 120623: {'lr': 4.705553146916558e-05, 'samples': 23159616, 'steps': 120622, 'loss/train': 0.8899472951889038} 11/07/2021 14:14:15 - INFO - __main__ - Step 120624: {'lr': 4.7052432554828215e-05, 'samples': 23159808, 'steps': 120623, 'loss/train': 1.5569140911102295} 11/07/2021 14:14:15 - INFO - __main__ - Step 120625: {'lr': 4.704933373193593e-05, 'samples': 23160000, 'steps': 120624, 'loss/train': 1.206419587135315} 11/07/2021 14:14:16 - INFO - __main__ - Step 120626: {'lr': 4.7046235000490045e-05, 'samples': 23160192, 'steps': 120625, 'loss/train': 1.161026954650879} 11/07/2021 14:14:16 - INFO - __main__ - Step 120627: {'lr': 4.704313636049204e-05, 'samples': 23160384, 'steps': 120626, 'loss/train': 1.0549061298370361} 11/07/2021 14:14:16 - INFO - __main__ - Step 120628: {'lr': 4.70400378119433e-05, 'samples': 23160576, 'steps': 120627, 'loss/train': 1.4593522548675537} 11/07/2021 14:14:17 - INFO - __main__ - Step 120629: {'lr': 4.7036939354845124e-05, 'samples': 23160768, 'steps': 120628, 'loss/train': 1.0543330907821655} 11/07/2021 14:14:18 - INFO - __main__ - Step 120630: {'lr': 4.703384098919897e-05, 'samples': 23160960, 'steps': 120629, 'loss/train': 0.5440089702606201} 11/07/2021 14:14:18 - INFO - __main__ - Step 120631: {'lr': 4.703074271500624e-05, 'samples': 23161152, 'steps': 120630, 'loss/train': 1.2130770683288574} 11/07/2021 14:14:18 - INFO - __main__ - Step 120632: {'lr': 4.702764453226832e-05, 'samples': 23161344, 'steps': 120631, 'loss/train': 1.490671157836914} 11/07/2021 14:14:19 - INFO - __main__ - Step 120633: {'lr': 4.702454644098661e-05, 'samples': 23161536, 'steps': 120632, 'loss/train': 1.3703787326812744} 11/07/2021 14:14:20 - INFO - __main__ - Step 120634: {'lr': 4.7021448441162516e-05, 'samples': 23161728, 'steps': 120633, 'loss/train': 1.445204257965088} 11/07/2021 14:14:20 - INFO - __main__ - Step 120635: {'lr': 4.701835053279743e-05, 'samples': 23161920, 'steps': 120634, 'loss/train': 1.3854694366455078} 11/07/2021 14:14:21 - INFO - __main__ - Step 120636: {'lr': 4.7015252715892744e-05, 'samples': 23162112, 'steps': 120635, 'loss/train': 1.0064802169799805} 11/07/2021 14:14:21 - INFO - __main__ - Step 120637: {'lr': 4.701215499044983e-05, 'samples': 23162304, 'steps': 120636, 'loss/train': 1.7105613946914673} 11/07/2021 14:14:22 - INFO - __main__ - Step 120638: {'lr': 4.700905735647012e-05, 'samples': 23162496, 'steps': 120637, 'loss/train': 0.8317742943763733} 11/07/2021 14:14:23 - INFO - __main__ - Step 120639: {'lr': 4.700595981395508e-05, 'samples': 23162688, 'steps': 120638, 'loss/train': 2.1341311931610107} 11/07/2021 14:14:23 - INFO - __main__ - Step 120640: {'lr': 4.700286236290593e-05, 'samples': 23162880, 'steps': 120639, 'loss/train': 1.4299185276031494} 11/07/2021 14:14:23 - INFO - __main__ - Step 120641: {'lr': 4.699976500332417e-05, 'samples': 23163072, 'steps': 120640, 'loss/train': 1.4207857847213745} 11/07/2021 14:14:24 - INFO - __main__ - Step 120642: {'lr': 4.699666773521119e-05, 'samples': 23163264, 'steps': 120641, 'loss/train': 1.162501335144043} 11/07/2021 14:14:24 - INFO - __main__ - Step 120643: {'lr': 4.699357055856837e-05, 'samples': 23163456, 'steps': 120642, 'loss/train': 0.30247098207473755} 11/07/2021 14:14:25 - INFO - __main__ - Step 120644: {'lr': 4.699047347339711e-05, 'samples': 23163648, 'steps': 120643, 'loss/train': 1.2934181690216064} 11/07/2021 14:14:25 - INFO - __main__ - Step 120645: {'lr': 4.6987376479698805e-05, 'samples': 23163840, 'steps': 120644, 'loss/train': 1.4711353778839111} 11/07/2021 14:14:26 - INFO - __main__ - Step 120646: {'lr': 4.698427957747486e-05, 'samples': 23164032, 'steps': 120645, 'loss/train': 0.9442518353462219} 11/07/2021 14:14:26 - INFO - __main__ - Step 120647: {'lr': 4.6981182766726696e-05, 'samples': 23164224, 'steps': 120646, 'loss/train': 1.6361019611358643} 11/07/2021 14:14:26 - INFO - __main__ - Step 120648: {'lr': 4.697808604745563e-05, 'samples': 23164416, 'steps': 120647, 'loss/train': 0.20719033479690552} 11/07/2021 14:14:27 - INFO - __main__ - Step 120649: {'lr': 4.697498941966313e-05, 'samples': 23164608, 'steps': 120648, 'loss/train': 1.4098109006881714} 11/07/2021 14:14:28 - INFO - __main__ - Step 120650: {'lr': 4.697189288335063e-05, 'samples': 23164800, 'steps': 120649, 'loss/train': 1.0972329378128052} 11/07/2021 14:14:28 - INFO - __main__ - Step 120651: {'lr': 4.69687964385194e-05, 'samples': 23164992, 'steps': 120650, 'loss/train': 2.125213623046875} 11/07/2021 14:14:29 - INFO - __main__ - Step 120652: {'lr': 4.69657000851709e-05, 'samples': 23165184, 'steps': 120651, 'loss/train': 1.1508830785751343} 11/07/2021 14:14:29 - INFO - __main__ - Step 120653: {'lr': 4.696260382330653e-05, 'samples': 23165376, 'steps': 120652, 'loss/train': 1.3907057046890259} 11/07/2021 14:14:29 - INFO - __main__ - Step 120654: {'lr': 4.6959507652927666e-05, 'samples': 23165568, 'steps': 120653, 'loss/train': 1.3750056028366089} 11/07/2021 14:14:30 - INFO - __main__ - Step 120655: {'lr': 4.695641157403571e-05, 'samples': 23165760, 'steps': 120654, 'loss/train': 1.269963026046753} 11/07/2021 14:14:31 - INFO - __main__ - Step 120656: {'lr': 4.695331558663207e-05, 'samples': 23165952, 'steps': 120655, 'loss/train': 1.368247151374817} 11/07/2021 14:14:31 - INFO - __main__ - Step 120657: {'lr': 4.695021969071811e-05, 'samples': 23166144, 'steps': 120656, 'loss/train': 1.0667041540145874} 11/07/2021 14:14:31 - INFO - __main__ - Step 120658: {'lr': 4.694712388629527e-05, 'samples': 23166336, 'steps': 120657, 'loss/train': 1.3163522481918335} 11/07/2021 14:14:32 - INFO - __main__ - Step 120659: {'lr': 4.694402817336493e-05, 'samples': 23166528, 'steps': 120658, 'loss/train': 1.3802293539047241} 11/07/2021 14:14:33 - INFO - __main__ - Step 120660: {'lr': 4.694093255192847e-05, 'samples': 23166720, 'steps': 120659, 'loss/train': 1.4252593517303467} 11/07/2021 14:14:33 - INFO - __main__ - Step 120661: {'lr': 4.693783702198734e-05, 'samples': 23166912, 'steps': 120660, 'loss/train': 1.0959004163742065} 11/07/2021 14:14:34 - INFO - __main__ - Step 120662: {'lr': 4.6934741583542826e-05, 'samples': 23167104, 'steps': 120661, 'loss/train': 1.1992796659469604} 11/07/2021 14:14:34 - INFO - __main__ - Step 120663: {'lr': 4.69316462365964e-05, 'samples': 23167296, 'steps': 120662, 'loss/train': 1.057533860206604} 11/07/2021 14:14:34 - INFO - __main__ - Step 120664: {'lr': 4.6928550981149454e-05, 'samples': 23167488, 'steps': 120663, 'loss/train': 1.5345823764801025} 11/07/2021 14:14:35 - INFO - __main__ - Step 120665: {'lr': 4.692545581720334e-05, 'samples': 23167680, 'steps': 120664, 'loss/train': 1.525348424911499} 11/07/2021 14:14:36 - INFO - __main__ - Step 120666: {'lr': 4.69223607447595e-05, 'samples': 23167872, 'steps': 120665, 'loss/train': 1.226497769355774} 11/07/2021 14:14:36 - INFO - __main__ - Step 120667: {'lr': 4.69192657638193e-05, 'samples': 23168064, 'steps': 120666, 'loss/train': 1.391931414604187} 11/07/2021 14:14:36 - INFO - __main__ - Step 120668: {'lr': 4.691617087438416e-05, 'samples': 23168256, 'steps': 120667, 'loss/train': 1.3189518451690674} 11/07/2021 14:14:37 - INFO - __main__ - Step 120669: {'lr': 4.691307607645543e-05, 'samples': 23168448, 'steps': 120668, 'loss/train': 1.5378971099853516} 11/07/2021 14:14:37 - INFO - __main__ - Step 120670: {'lr': 4.690998137003455e-05, 'samples': 23168640, 'steps': 120669, 'loss/train': 2.002545118331909} 11/07/2021 14:14:38 - INFO - __main__ - Step 120671: {'lr': 4.690688675512292e-05, 'samples': 23168832, 'steps': 120670, 'loss/train': 1.2624073028564453} 11/07/2021 14:14:38 - INFO - __main__ - Step 120672: {'lr': 4.690379223172195e-05, 'samples': 23169024, 'steps': 120671, 'loss/train': 1.7810099124908447} 11/07/2021 14:14:39 - INFO - __main__ - Step 120673: {'lr': 4.690069779983294e-05, 'samples': 23169216, 'steps': 120672, 'loss/train': 1.3597520589828491} 11/07/2021 14:14:39 - INFO - __main__ - Step 120674: {'lr': 4.689760345945735e-05, 'samples': 23169408, 'steps': 120673, 'loss/train': 0.7513729333877563} 11/07/2021 14:14:39 - INFO - __main__ - Step 120675: {'lr': 4.689450921059654e-05, 'samples': 23169600, 'steps': 120674, 'loss/train': 0.7191171050071716} 11/07/2021 14:14:41 - INFO - __main__ - Step 120676: {'lr': 4.689141505325195e-05, 'samples': 23169792, 'steps': 120675, 'loss/train': 1.845999836921692} 11/07/2021 14:14:41 - INFO - __main__ - Step 120677: {'lr': 4.6888320987424956e-05, 'samples': 23169984, 'steps': 120676, 'loss/train': 1.4109625816345215} 11/07/2021 14:14:42 - INFO - __main__ - Step 120678: {'lr': 4.688522701311695e-05, 'samples': 23170176, 'steps': 120677, 'loss/train': 1.6094173192977905} 11/07/2021 14:14:42 - INFO - __main__ - Step 120679: {'lr': 4.688213313032933e-05, 'samples': 23170368, 'steps': 120678, 'loss/train': 1.2508858442306519} 11/07/2021 14:14:42 - INFO - __main__ - Step 120680: {'lr': 4.6879039339063456e-05, 'samples': 23170560, 'steps': 120679, 'loss/train': 1.2350637912750244} 11/07/2021 14:14:43 - INFO - __main__ - Step 120681: {'lr': 4.6875945639320796e-05, 'samples': 23170752, 'steps': 120680, 'loss/train': 1.197112798690796} 11/07/2021 14:14:44 - INFO - __main__ - Step 120682: {'lr': 4.687285203110267e-05, 'samples': 23170944, 'steps': 120681, 'loss/train': 1.1895580291748047} 11/07/2021 14:14:44 - INFO - __main__ - Step 120683: {'lr': 4.6869758514410524e-05, 'samples': 23171136, 'steps': 120682, 'loss/train': 1.2436182498931885} 11/07/2021 14:14:44 - INFO - __main__ - Step 120684: {'lr': 4.6866665089245696e-05, 'samples': 23171328, 'steps': 120683, 'loss/train': 0.9945613145828247} 11/07/2021 14:14:45 - INFO - __main__ - Step 120685: {'lr': 4.686357175560971e-05, 'samples': 23171520, 'steps': 120684, 'loss/train': 1.6134361028671265} 11/07/2021 14:14:46 - INFO - __main__ - Step 120686: {'lr': 4.686047851350381e-05, 'samples': 23171712, 'steps': 120685, 'loss/train': 1.5140254497528076} 11/07/2021 14:14:46 - INFO - __main__ - Step 120687: {'lr': 4.685738536292941e-05, 'samples': 23171904, 'steps': 120686, 'loss/train': 1.4146764278411865} 11/07/2021 14:14:47 - INFO - __main__ - Step 120688: {'lr': 4.6854292303887965e-05, 'samples': 23172096, 'steps': 120687, 'loss/train': 1.3171359300613403} 11/07/2021 14:14:47 - INFO - __main__ - Step 120689: {'lr': 4.6851199336380826e-05, 'samples': 23172288, 'steps': 120688, 'loss/train': 1.1103909015655518} 11/07/2021 14:14:47 - INFO - __main__ - Step 120690: {'lr': 4.6848106460409406e-05, 'samples': 23172480, 'steps': 120689, 'loss/train': 1.521064043045044} 11/07/2021 14:14:48 - INFO - __main__ - Step 120691: {'lr': 4.6845013675975076e-05, 'samples': 23172672, 'steps': 120690, 'loss/train': 0.9738125205039978} 11/07/2021 14:14:49 - INFO - __main__ - Step 120692: {'lr': 4.6841920983079264e-05, 'samples': 23172864, 'steps': 120691, 'loss/train': 0.7723230719566345} 11/07/2021 14:14:49 - INFO - __main__ - Step 120693: {'lr': 4.6838828381723346e-05, 'samples': 23173056, 'steps': 120692, 'loss/train': 1.1886898279190063} 11/07/2021 14:14:49 - INFO - __main__ - Step 120694: {'lr': 4.683573587190873e-05, 'samples': 23173248, 'steps': 120693, 'loss/train': 1.4531925916671753} 11/07/2021 14:14:50 - INFO - __main__ - Step 120695: {'lr': 4.6832643453636776e-05, 'samples': 23173440, 'steps': 120694, 'loss/train': 1.2679210901260376} 11/07/2021 14:14:50 - INFO - __main__ - Step 120696: {'lr': 4.682955112690892e-05, 'samples': 23173632, 'steps': 120695, 'loss/train': 1.3057531118392944} 11/07/2021 14:14:51 - INFO - __main__ - Step 120697: {'lr': 4.6826458891726513e-05, 'samples': 23173824, 'steps': 120696, 'loss/train': 0.9452956914901733} 11/07/2021 14:14:51 - INFO - __main__ - Step 120698: {'lr': 4.6823366748090985e-05, 'samples': 23174016, 'steps': 120697, 'loss/train': 1.8852578401565552} 11/07/2021 14:14:52 - INFO - __main__ - Step 120699: {'lr': 4.682027469600378e-05, 'samples': 23174208, 'steps': 120698, 'loss/train': 1.3444808721542358} 11/07/2021 14:14:52 - INFO - __main__ - Step 120700: {'lr': 4.6817182735466145e-05, 'samples': 23174400, 'steps': 120699, 'loss/train': 1.1848045587539673} 11/07/2021 14:14:53 - INFO - __main__ - Step 120701: {'lr': 4.681409086647956e-05, 'samples': 23174592, 'steps': 120700, 'loss/train': 1.3830245733261108} 11/07/2021 14:14:54 - INFO - __main__ - Step 120702: {'lr': 4.68109990890454e-05, 'samples': 23174784, 'steps': 120701, 'loss/train': 1.2747077941894531} 11/07/2021 14:14:54 - INFO - __main__ - Step 120703: {'lr': 4.6807907403165066e-05, 'samples': 23174976, 'steps': 120702, 'loss/train': 1.0719830989837646} 11/07/2021 14:14:54 - INFO - __main__ - Step 120704: {'lr': 4.6804815808839965e-05, 'samples': 23175168, 'steps': 120703, 'loss/train': 0.8480728268623352} 11/07/2021 14:14:55 - INFO - __main__ - Step 120705: {'lr': 4.680172430607146e-05, 'samples': 23175360, 'steps': 120704, 'loss/train': 1.2861013412475586} 11/07/2021 14:14:55 - INFO - __main__ - Step 120706: {'lr': 4.6798632894861e-05, 'samples': 23175552, 'steps': 120705, 'loss/train': 0.8313263654708862} 11/07/2021 14:14:56 - INFO - __main__ - Step 120707: {'lr': 4.6795541575209905e-05, 'samples': 23175744, 'steps': 120706, 'loss/train': 1.34210205078125} 11/07/2021 14:14:56 - INFO - __main__ - Step 120708: {'lr': 4.6792450347119624e-05, 'samples': 23175936, 'steps': 120707, 'loss/train': 1.3643748760223389} 11/07/2021 14:14:57 - INFO - __main__ - Step 120709: {'lr': 4.678935921059152e-05, 'samples': 23176128, 'steps': 120708, 'loss/train': 1.2482832670211792} 11/07/2021 14:14:57 - INFO - __main__ - Step 120710: {'lr': 4.6786268165626975e-05, 'samples': 23176320, 'steps': 120709, 'loss/train': 1.1983782052993774} 11/07/2021 14:14:57 - INFO - __main__ - Step 120711: {'lr': 4.678317721222744e-05, 'samples': 23176512, 'steps': 120710, 'loss/train': 1.102173089981079} 11/07/2021 14:14:58 - INFO - __main__ - Step 120712: {'lr': 4.6780086350394326e-05, 'samples': 23176704, 'steps': 120711, 'loss/train': 0.5687884092330933} 11/07/2021 14:14:59 - INFO - __main__ - Step 120713: {'lr': 4.677699558012888e-05, 'samples': 23176896, 'steps': 120712, 'loss/train': 1.2231018543243408} 11/07/2021 14:14:59 - INFO - __main__ - Step 120714: {'lr': 4.67739049014326e-05, 'samples': 23177088, 'steps': 120713, 'loss/train': 1.3872803449630737} 11/07/2021 14:15:00 - INFO - __main__ - Step 120715: {'lr': 4.677081431430685e-05, 'samples': 23177280, 'steps': 120714, 'loss/train': 1.7734640836715698} 11/07/2021 14:15:00 - INFO - __main__ - Step 120716: {'lr': 4.676772381875308e-05, 'samples': 23177472, 'steps': 120715, 'loss/train': 1.1992279291152954} 11/07/2021 14:15:01 - INFO - __main__ - Step 120717: {'lr': 4.676463341477258e-05, 'samples': 23177664, 'steps': 120716, 'loss/train': 1.633104920387268} 11/07/2021 14:15:01 - INFO - __main__ - Step 120718: {'lr': 4.6761543102366826e-05, 'samples': 23177856, 'steps': 120717, 'loss/train': 1.365250587463379} 11/07/2021 14:15:02 - INFO - __main__ - Step 120719: {'lr': 4.6758452881537185e-05, 'samples': 23178048, 'steps': 120718, 'loss/train': 5.656520366668701} 11/07/2021 14:15:02 - INFO - __main__ - Step 120720: {'lr': 4.675536275228506e-05, 'samples': 23178240, 'steps': 120719, 'loss/train': 1.3970366716384888} 11/07/2021 14:15:02 - INFO - __main__ - Step 120721: {'lr': 4.675227271461183e-05, 'samples': 23178432, 'steps': 120720, 'loss/train': 1.0424318313598633} 11/07/2021 14:15:03 - INFO - __main__ - Step 120722: {'lr': 4.674918276851886e-05, 'samples': 23178624, 'steps': 120721, 'loss/train': 1.5961509943008423} 11/07/2021 14:15:04 - INFO - __main__ - Step 120723: {'lr': 4.674609291400758e-05, 'samples': 23178816, 'steps': 120722, 'loss/train': 1.0099095106124878} 11/07/2021 14:15:04 - INFO - __main__ - Step 120724: {'lr': 4.67430031510794e-05, 'samples': 23179008, 'steps': 120723, 'loss/train': 1.24820077419281} 11/07/2021 14:15:05 - INFO - __main__ - Step 120725: {'lr': 4.673991347973566e-05, 'samples': 23179200, 'steps': 120724, 'loss/train': 1.0257534980773926} 11/07/2021 14:15:05 - INFO - __main__ - Step 120726: {'lr': 4.673682389997785e-05, 'samples': 23179392, 'steps': 120725, 'loss/train': 1.383181095123291} 11/07/2021 14:15:05 - INFO - __main__ - Step 120727: {'lr': 4.6733734411807255e-05, 'samples': 23179584, 'steps': 120726, 'loss/train': 1.2403969764709473} 11/07/2021 14:15:06 - INFO - __main__ - Step 120728: {'lr': 4.673064501522528e-05, 'samples': 23179776, 'steps': 120727, 'loss/train': 1.791261076927185} 11/07/2021 14:15:07 - INFO - __main__ - Step 120729: {'lr': 4.6727555710233325e-05, 'samples': 23179968, 'steps': 120728, 'loss/train': 1.3607181310653687} 11/07/2021 14:15:07 - INFO - __main__ - Step 120730: {'lr': 4.6724466496832816e-05, 'samples': 23180160, 'steps': 120729, 'loss/train': 1.1887117624282837} 11/07/2021 14:15:07 - INFO - __main__ - Step 120731: {'lr': 4.6721377375025105e-05, 'samples': 23180352, 'steps': 120730, 'loss/train': 0.6288072466850281} 11/07/2021 14:15:08 - INFO - __main__ - Step 120732: {'lr': 4.671828834481162e-05, 'samples': 23180544, 'steps': 120731, 'loss/train': 1.460876226425171} 11/07/2021 14:15:08 - INFO - __main__ - Step 120733: {'lr': 4.671519940619376e-05, 'samples': 23180736, 'steps': 120732, 'loss/train': 1.6075146198272705} 11/07/2021 14:15:09 - INFO - __main__ - Step 120734: {'lr': 4.671211055917285e-05, 'samples': 23180928, 'steps': 120733, 'loss/train': 1.2189654111862183} 11/07/2021 14:15:10 - INFO - __main__ - Step 120735: {'lr': 4.6709021803750364e-05, 'samples': 23181120, 'steps': 120734, 'loss/train': 1.4281913042068481} 11/07/2021 14:15:10 - INFO - __main__ - Step 120736: {'lr': 4.6705933139927634e-05, 'samples': 23181312, 'steps': 120735, 'loss/train': 1.558449387550354} 11/07/2021 14:15:10 - INFO - __main__ - Step 120737: {'lr': 4.670284456770607e-05, 'samples': 23181504, 'steps': 120736, 'loss/train': 1.3063374757766724} 11/07/2021 14:15:11 - INFO - __main__ - Step 120738: {'lr': 4.669975608708707e-05, 'samples': 23181696, 'steps': 120737, 'loss/train': 0.9650479555130005} 11/07/2021 14:15:12 - INFO - __main__ - Step 120739: {'lr': 4.66966676980721e-05, 'samples': 23181888, 'steps': 120738, 'loss/train': 1.4289600849151611} 11/07/2021 14:15:12 - INFO - __main__ - Step 120740: {'lr': 4.6693579400662405e-05, 'samples': 23182080, 'steps': 120739, 'loss/train': 0.8529819250106812} 11/07/2021 14:15:12 - INFO - __main__ - Step 120741: {'lr': 4.669049119485943e-05, 'samples': 23182272, 'steps': 120740, 'loss/train': 1.5796483755111694} 11/07/2021 14:15:13 - INFO - __main__ - Step 120742: {'lr': 4.6687403080664606e-05, 'samples': 23182464, 'steps': 120741, 'loss/train': 1.4720476865768433} 11/07/2021 14:15:13 - INFO - __main__ - Step 120743: {'lr': 4.66843150580793e-05, 'samples': 23182656, 'steps': 120742, 'loss/train': 1.2737540006637573} 11/07/2021 14:15:14 - INFO - __main__ - Step 120744: {'lr': 4.668122712710487e-05, 'samples': 23182848, 'steps': 120743, 'loss/train': 1.418700933456421} 11/07/2021 14:15:15 - INFO - __main__ - Step 120745: {'lr': 4.667813928774278e-05, 'samples': 23183040, 'steps': 120744, 'loss/train': 0.8346951603889465} 11/07/2021 14:15:15 - INFO - __main__ - Step 120746: {'lr': 4.6675051539994375e-05, 'samples': 23183232, 'steps': 120745, 'loss/train': 1.1554418802261353} 11/07/2021 14:15:15 - INFO - __main__ - Step 120747: {'lr': 4.6671963883861054e-05, 'samples': 23183424, 'steps': 120746, 'loss/train': 1.7294037342071533} 11/07/2021 14:15:16 - INFO - __main__ - Step 120748: {'lr': 4.666887631934419e-05, 'samples': 23183616, 'steps': 120747, 'loss/train': 1.1115823984146118} 11/07/2021 14:15:16 - INFO - __main__ - Step 120749: {'lr': 4.6665788846445205e-05, 'samples': 23183808, 'steps': 120748, 'loss/train': 1.6343858242034912} 11/07/2021 14:15:17 - INFO - __main__ - Step 120750: {'lr': 4.666270146516549e-05, 'samples': 23184000, 'steps': 120749, 'loss/train': 0.6604203581809998} 11/07/2021 14:15:17 - INFO - __main__ - Step 120751: {'lr': 4.665961417550641e-05, 'samples': 23184192, 'steps': 120750, 'loss/train': 1.099429726600647} 11/07/2021 14:15:18 - INFO - __main__ - Step 120752: {'lr': 4.665652697746944e-05, 'samples': 23184384, 'steps': 120751, 'loss/train': 1.3396673202514648} 11/07/2021 14:15:18 - INFO - __main__ - Step 120753: {'lr': 4.665343987105583e-05, 'samples': 23184576, 'steps': 120752, 'loss/train': 1.7764936685562134} 11/07/2021 14:15:18 - INFO - __main__ - Step 120754: {'lr': 4.6650352856267035e-05, 'samples': 23184768, 'steps': 120753, 'loss/train': 1.1365715265274048} 11/07/2021 14:15:19 - INFO - __main__ - Step 120755: {'lr': 4.664726593310448e-05, 'samples': 23184960, 'steps': 120754, 'loss/train': 0.9229661822319031} 11/07/2021 14:15:20 - INFO - __main__ - Step 120756: {'lr': 4.664417910156951e-05, 'samples': 23185152, 'steps': 120755, 'loss/train': 1.5797125101089478} 11/07/2021 14:15:20 - INFO - __main__ - Step 120757: {'lr': 4.664109236166353e-05, 'samples': 23185344, 'steps': 120756, 'loss/train': 1.2820336818695068} 11/07/2021 14:15:20 - INFO - __main__ - Step 120758: {'lr': 4.663800571338794e-05, 'samples': 23185536, 'steps': 120757, 'loss/train': 1.300958275794983} 11/07/2021 14:15:21 - INFO - __main__ - Step 120759: {'lr': 4.6634919156744147e-05, 'samples': 23185728, 'steps': 120758, 'loss/train': 1.4580695629119873} 11/07/2021 14:15:22 - INFO - __main__ - Step 120760: {'lr': 4.663183269173352e-05, 'samples': 23185920, 'steps': 120759, 'loss/train': 1.005216121673584} 11/07/2021 14:15:22 - INFO - __main__ - Step 120761: {'lr': 4.6628746318357423e-05, 'samples': 23186112, 'steps': 120760, 'loss/train': 1.4479228258132935} 11/07/2021 14:15:23 - INFO - __main__ - Step 120762: {'lr': 4.662566003661728e-05, 'samples': 23186304, 'steps': 120761, 'loss/train': 1.39852774143219} 11/07/2021 14:15:23 - INFO - __main__ - Step 120763: {'lr': 4.66225738465145e-05, 'samples': 23186496, 'steps': 120762, 'loss/train': 1.1500508785247803} 11/07/2021 14:15:23 - INFO - __main__ - Step 120764: {'lr': 4.661948774805041e-05, 'samples': 23186688, 'steps': 120763, 'loss/train': 1.3150869607925415} 11/07/2021 14:15:24 - INFO - __main__ - Step 120765: {'lr': 4.661640174122647e-05, 'samples': 23186880, 'steps': 120764, 'loss/train': 1.4828130006790161} 11/07/2021 14:15:25 - INFO - __main__ - Step 120766: {'lr': 4.661331582604411e-05, 'samples': 23187072, 'steps': 120765, 'loss/train': 1.3329323530197144} 11/07/2021 14:15:25 - INFO - __main__ - Step 120767: {'lr': 4.661023000250458e-05, 'samples': 23187264, 'steps': 120766, 'loss/train': 1.2661863565444946} 11/07/2021 14:15:25 - INFO - __main__ - Step 120768: {'lr': 4.660714427060933e-05, 'samples': 23187456, 'steps': 120767, 'loss/train': 1.348453164100647} 11/07/2021 14:15:26 - INFO - __main__ - Step 120769: {'lr': 4.6604058630359766e-05, 'samples': 23187648, 'steps': 120768, 'loss/train': 0.7287114858627319} 11/07/2021 14:15:27 - INFO - __main__ - Step 120770: {'lr': 4.660097308175728e-05, 'samples': 23187840, 'steps': 120769, 'loss/train': 1.1368368864059448} 11/07/2021 14:15:27 - INFO - __main__ - Step 120771: {'lr': 4.659788762480327e-05, 'samples': 23188032, 'steps': 120770, 'loss/train': 1.1650867462158203} 11/07/2021 14:15:28 - INFO - __main__ - Step 120772: {'lr': 4.659480225949911e-05, 'samples': 23188224, 'steps': 120771, 'loss/train': 0.9073972105979919} 11/07/2021 14:15:28 - INFO - __main__ - Step 120773: {'lr': 4.6591716985846164e-05, 'samples': 23188416, 'steps': 120772, 'loss/train': 1.2125500440597534} 11/07/2021 14:15:28 - INFO - __main__ - Step 120774: {'lr': 4.658863180384587e-05, 'samples': 23188608, 'steps': 120773, 'loss/train': 1.335434913635254} 11/07/2021 14:15:29 - INFO - __main__ - Step 120775: {'lr': 4.65855467134996e-05, 'samples': 23188800, 'steps': 120774, 'loss/train': 0.4099116325378418} 11/07/2021 14:15:30 - INFO - __main__ - Step 120776: {'lr': 4.658246171480876e-05, 'samples': 23188992, 'steps': 120775, 'loss/train': 1.5762107372283936} 11/07/2021 14:15:30 - INFO - __main__ - Step 120777: {'lr': 4.657937680777469e-05, 'samples': 23189184, 'steps': 120776, 'loss/train': 1.0958218574523926} 11/07/2021 14:15:30 - INFO - __main__ - Step 120778: {'lr': 4.657629199239885e-05, 'samples': 23189376, 'steps': 120777, 'loss/train': 1.3011620044708252} 11/07/2021 14:15:31 - INFO - __main__ - Step 120779: {'lr': 4.657320726868264e-05, 'samples': 23189568, 'steps': 120778, 'loss/train': 0.9635570049285889} 11/07/2021 14:15:32 - INFO - __main__ - Step 120780: {'lr': 4.6570122636627324e-05, 'samples': 23189760, 'steps': 120779, 'loss/train': 0.9168505668640137} 11/07/2021 14:15:32 - INFO - __main__ - Step 120781: {'lr': 4.656703809623439e-05, 'samples': 23189952, 'steps': 120780, 'loss/train': 1.4636136293411255} 11/07/2021 14:15:33 - INFO - __main__ - Step 120782: {'lr': 4.656395364750521e-05, 'samples': 23190144, 'steps': 120781, 'loss/train': 1.4033374786376953} 11/07/2021 14:15:33 - INFO - __main__ - Step 120783: {'lr': 4.656086929044118e-05, 'samples': 23190336, 'steps': 120782, 'loss/train': 1.52048659324646} 11/07/2021 14:15:33 - INFO - __main__ - Step 120784: {'lr': 4.6557785025043656e-05, 'samples': 23190528, 'steps': 120783, 'loss/train': 1.2484830617904663} 11/07/2021 14:15:34 - INFO - __main__ - Step 120785: {'lr': 4.655470085131408e-05, 'samples': 23190720, 'steps': 120784, 'loss/train': 1.3342543840408325} 11/07/2021 14:15:35 - INFO - __main__ - Step 120786: {'lr': 4.6551616769253815e-05, 'samples': 23190912, 'steps': 120785, 'loss/train': 1.1599438190460205} 11/07/2021 14:15:35 - INFO - __main__ - Step 120787: {'lr': 4.654853277886423e-05, 'samples': 23191104, 'steps': 120786, 'loss/train': 1.5903739929199219} 11/07/2021 14:15:35 - INFO - __main__ - Step 120788: {'lr': 4.654544888014675e-05, 'samples': 23191296, 'steps': 120787, 'loss/train': 1.2096627950668335} 11/07/2021 14:15:36 - INFO - __main__ - Step 120789: {'lr': 4.6542365073102746e-05, 'samples': 23191488, 'steps': 120788, 'loss/train': 1.2057037353515625} 11/07/2021 14:15:37 - INFO - __main__ - Step 120790: {'lr': 4.6539281357733637e-05, 'samples': 23191680, 'steps': 120789, 'loss/train': 1.2196327447891235} 11/07/2021 14:15:37 - INFO - __main__ - Step 120791: {'lr': 4.653619773404077e-05, 'samples': 23191872, 'steps': 120790, 'loss/train': 1.1613421440124512} 11/07/2021 14:15:37 - INFO - __main__ - Step 120792: {'lr': 4.653311420202555e-05, 'samples': 23192064, 'steps': 120791, 'loss/train': 1.5045913457870483} 11/07/2021 14:15:38 - INFO - __main__ - Step 120793: {'lr': 4.653003076168944e-05, 'samples': 23192256, 'steps': 120792, 'loss/train': 1.2460649013519287} 11/07/2021 14:15:38 - INFO - __main__ - Step 120794: {'lr': 4.652694741303371e-05, 'samples': 23192448, 'steps': 120793, 'loss/train': 1.271101713180542} 11/07/2021 14:15:39 - INFO - __main__ - Step 120795: {'lr': 4.652386415605975e-05, 'samples': 23192640, 'steps': 120794, 'loss/train': 1.2045433521270752} 11/07/2021 14:15:40 - INFO - __main__ - Step 120796: {'lr': 4.652078099076903e-05, 'samples': 23192832, 'steps': 120795, 'loss/train': 1.2139638662338257} 11/07/2021 14:15:40 - INFO - __main__ - Step 120797: {'lr': 4.65176979171629e-05, 'samples': 23193024, 'steps': 120796, 'loss/train': 1.6169501543045044} 11/07/2021 14:15:40 - INFO - __main__ - Step 120798: {'lr': 4.651461493524276e-05, 'samples': 23193216, 'steps': 120797, 'loss/train': 1.1923532485961914} 11/07/2021 14:15:41 - INFO - __main__ - Step 120799: {'lr': 4.6511532045009994e-05, 'samples': 23193408, 'steps': 120798, 'loss/train': 1.198868751525879} 11/07/2021 14:15:42 - INFO - __main__ - Step 120800: {'lr': 4.650844924646599e-05, 'samples': 23193600, 'steps': 120799, 'loss/train': 0.8892217874526978} 11/07/2021 14:15:42 - INFO - __main__ - Step 120801: {'lr': 4.6505366539612155e-05, 'samples': 23193792, 'steps': 120800, 'loss/train': 1.5375847816467285} 11/07/2021 14:15:42 - INFO - __main__ - Step 120802: {'lr': 4.650228392444983e-05, 'samples': 23193984, 'steps': 120801, 'loss/train': 1.2636758089065552} 11/07/2021 14:15:43 - INFO - __main__ - Step 120803: {'lr': 4.6499201400980464e-05, 'samples': 23194176, 'steps': 120802, 'loss/train': 1.0443272590637207} 11/07/2021 14:15:43 - INFO - __main__ - Step 120804: {'lr': 4.649611896920539e-05, 'samples': 23194368, 'steps': 120803, 'loss/train': 1.3323472738265991} 11/07/2021 14:15:43 - INFO - __main__ - Step 120805: {'lr': 4.6493036629126046e-05, 'samples': 23194560, 'steps': 120804, 'loss/train': 1.24997878074646} 11/07/2021 14:15:44 - INFO - __main__ - Step 120806: {'lr': 4.6489954380743856e-05, 'samples': 23194752, 'steps': 120805, 'loss/train': 1.289844274520874} 11/07/2021 14:15:45 - INFO - __main__ - Step 120807: {'lr': 4.648687222406009e-05, 'samples': 23194944, 'steps': 120806, 'loss/train': 1.6684293746948242} 11/07/2021 14:15:45 - INFO - __main__ - Step 120808: {'lr': 4.648379015907619e-05, 'samples': 23195136, 'steps': 120807, 'loss/train': 1.0805413722991943} 11/07/2021 14:15:45 - INFO - __main__ - Step 120809: {'lr': 4.648070818579356e-05, 'samples': 23195328, 'steps': 120808, 'loss/train': 1.21198570728302} 11/07/2021 14:15:46 - INFO - __main__ - Step 120810: {'lr': 4.647762630421359e-05, 'samples': 23195520, 'steps': 120809, 'loss/train': 1.134756088256836} 11/07/2021 14:15:47 - INFO - __main__ - Step 120811: {'lr': 4.647454451433766e-05, 'samples': 23195712, 'steps': 120810, 'loss/train': 1.480513572692871} 11/07/2021 14:15:47 - INFO - __main__ - Step 120812: {'lr': 4.6471462816167155e-05, 'samples': 23195904, 'steps': 120811, 'loss/train': 1.5450377464294434} 11/07/2021 14:15:48 - INFO - __main__ - Step 120813: {'lr': 4.6468381209703455e-05, 'samples': 23196096, 'steps': 120812, 'loss/train': 1.0960853099822998} 11/07/2021 14:15:48 - INFO - __main__ - Step 120814: {'lr': 4.646529969494798e-05, 'samples': 23196288, 'steps': 120813, 'loss/train': 1.1720092296600342} 11/07/2021 14:15:48 - INFO - __main__ - Step 120815: {'lr': 4.646221827190208e-05, 'samples': 23196480, 'steps': 120814, 'loss/train': 1.465715765953064} 11/07/2021 14:15:49 - INFO - __main__ - Step 120816: {'lr': 4.645913694056719e-05, 'samples': 23196672, 'steps': 120815, 'loss/train': 1.3581876754760742} 11/07/2021 14:15:50 - INFO - __main__ - Step 120817: {'lr': 4.645605570094466e-05, 'samples': 23196864, 'steps': 120816, 'loss/train': 1.5980565547943115} 11/07/2021 14:15:50 - INFO - __main__ - Step 120818: {'lr': 4.64529745530359e-05, 'samples': 23197056, 'steps': 120817, 'loss/train': 1.247403621673584} 11/07/2021 14:15:50 - INFO - __main__ - Step 120819: {'lr': 4.6449893496842284e-05, 'samples': 23197248, 'steps': 120818, 'loss/train': 1.4291387796401978} 11/07/2021 14:15:51 - INFO - __main__ - Step 120820: {'lr': 4.6446812532365266e-05, 'samples': 23197440, 'steps': 120819, 'loss/train': 1.1608978509902954} 11/07/2021 14:15:52 - INFO - __main__ - Step 120821: {'lr': 4.6443731659606107e-05, 'samples': 23197632, 'steps': 120820, 'loss/train': 1.2662822008132935} 11/07/2021 14:15:52 - INFO - __main__ - Step 120822: {'lr': 4.644065087856625e-05, 'samples': 23197824, 'steps': 120821, 'loss/train': 1.0573009252548218} 11/07/2021 14:15:53 - INFO - __main__ - Step 120823: {'lr': 4.6437570189247105e-05, 'samples': 23198016, 'steps': 120822, 'loss/train': 1.2570374011993408} 11/07/2021 14:15:53 - INFO - __main__ - Step 120824: {'lr': 4.643448959165006e-05, 'samples': 23198208, 'steps': 120823, 'loss/train': 0.7814019918441772} 11/07/2021 14:15:53 - INFO - __main__ - Step 120825: {'lr': 4.6431409085776474e-05, 'samples': 23198400, 'steps': 120824, 'loss/train': 1.1217213869094849} 11/07/2021 14:15:54 - INFO - __main__ - Step 120826: {'lr': 4.642832867162777e-05, 'samples': 23198592, 'steps': 120825, 'loss/train': 1.3886334896087646} 11/07/2021 14:15:55 - INFO - __main__ - Step 120827: {'lr': 4.6425248349205306e-05, 'samples': 23198784, 'steps': 120826, 'loss/train': 1.1923843622207642} 11/07/2021 14:15:55 - INFO - __main__ - Step 120828: {'lr': 4.64221681185105e-05, 'samples': 23198976, 'steps': 120827, 'loss/train': 0.6839224696159363} 11/07/2021 14:15:55 - INFO - __main__ - Step 120829: {'lr': 4.641908797954472e-05, 'samples': 23199168, 'steps': 120828, 'loss/train': 1.2724111080169678} 11/07/2021 14:15:56 - INFO - __main__ - Step 120830: {'lr': 4.641600793230935e-05, 'samples': 23199360, 'steps': 120829, 'loss/train': 2.127755641937256} 11/07/2021 14:15:56 - INFO - __main__ - Step 120831: {'lr': 4.64129279768058e-05, 'samples': 23199552, 'steps': 120830, 'loss/train': 1.3815398216247559} 11/07/2021 14:15:57 - INFO - __main__ - Step 120832: {'lr': 4.640984811303542e-05, 'samples': 23199744, 'steps': 120831, 'loss/train': 0.8536512851715088} 11/07/2021 14:15:57 - INFO - __main__ - Step 120833: {'lr': 4.6406768340999686e-05, 'samples': 23199936, 'steps': 120832, 'loss/train': 1.346085786819458} 11/07/2021 14:15:58 - INFO - __main__ - Step 120834: {'lr': 4.6403688660699886e-05, 'samples': 23200128, 'steps': 120833, 'loss/train': 1.1133157014846802} 11/07/2021 14:15:58 - INFO - __main__ - Step 120835: {'lr': 4.6400609072137414e-05, 'samples': 23200320, 'steps': 120834, 'loss/train': 1.270789384841919} 11/07/2021 14:15:58 - INFO - __main__ - Step 120836: {'lr': 4.639752957531368e-05, 'samples': 23200512, 'steps': 120835, 'loss/train': 1.304349422454834} 11/07/2021 14:15:59 - INFO - __main__ - Step 120837: {'lr': 4.639445017023011e-05, 'samples': 23200704, 'steps': 120836, 'loss/train': 0.9279773831367493} 11/07/2021 14:16:00 - INFO - __main__ - Step 120838: {'lr': 4.639137085688802e-05, 'samples': 23200896, 'steps': 120837, 'loss/train': 1.4488725662231445} 11/07/2021 14:16:00 - INFO - __main__ - Step 120839: {'lr': 4.638829163528888e-05, 'samples': 23201088, 'steps': 120838, 'loss/train': 1.6427397727966309} 11/07/2021 14:16:01 - INFO - __main__ - Step 120840: {'lr': 4.638521250543401e-05, 'samples': 23201280, 'steps': 120839, 'loss/train': 1.5564568042755127} 11/07/2021 14:16:01 - INFO - __main__ - Step 120841: {'lr': 4.638213346732481e-05, 'samples': 23201472, 'steps': 120840, 'loss/train': 1.4552243947982788} 11/07/2021 14:16:02 - INFO - __main__ - Step 120842: {'lr': 4.637905452096269e-05, 'samples': 23201664, 'steps': 120841, 'loss/train': 1.1960842609405518} 11/07/2021 14:16:02 - INFO - __main__ - Step 120843: {'lr': 4.6375975666349045e-05, 'samples': 23201856, 'steps': 120842, 'loss/train': 1.2524605989456177} 11/07/2021 14:16:03 - INFO - __main__ - Step 120844: {'lr': 4.63728969034852e-05, 'samples': 23202048, 'steps': 120843, 'loss/train': 1.5040161609649658} 11/07/2021 14:16:03 - INFO - __main__ - Step 120845: {'lr': 4.636981823237263e-05, 'samples': 23202240, 'steps': 120844, 'loss/train': 1.109509825706482} 11/07/2021 14:16:03 - INFO - __main__ - Step 120846: {'lr': 4.636673965301266e-05, 'samples': 23202432, 'steps': 120845, 'loss/train': 1.1762597560882568} 11/07/2021 14:16:04 - INFO - __main__ - Step 120847: {'lr': 4.636366116540674e-05, 'samples': 23202624, 'steps': 120846, 'loss/train': 1.1117894649505615} 11/07/2021 14:16:05 - INFO - __main__ - Step 120848: {'lr': 4.636058276955618e-05, 'samples': 23202816, 'steps': 120847, 'loss/train': 1.2489670515060425} 11/07/2021 14:16:05 - INFO - __main__ - Step 120849: {'lr': 4.635750446546239e-05, 'samples': 23203008, 'steps': 120848, 'loss/train': 1.2891596555709839} 11/07/2021 14:16:05 - INFO - __main__ - Step 120850: {'lr': 4.6354426253126746e-05, 'samples': 23203200, 'steps': 120849, 'loss/train': 1.412272334098816} 11/07/2021 14:16:06 - INFO - __main__ - Step 120851: {'lr': 4.635134813255068e-05, 'samples': 23203392, 'steps': 120850, 'loss/train': 1.275016188621521} 11/07/2021 14:16:06 - INFO - __main__ - Step 120852: {'lr': 4.6348270103735545e-05, 'samples': 23203584, 'steps': 120851, 'loss/train': 1.0790098905563354} 11/07/2021 14:16:07 - INFO - __main__ - Step 120853: {'lr': 4.634519216668273e-05, 'samples': 23203776, 'steps': 120852, 'loss/train': 1.5533047914505005} 11/07/2021 14:16:08 - INFO - __main__ - Step 120854: {'lr': 4.6342114321393624e-05, 'samples': 23203968, 'steps': 120853, 'loss/train': 1.359907865524292} 11/07/2021 14:16:08 - INFO - __main__ - Step 120855: {'lr': 4.633903656786964e-05, 'samples': 23204160, 'steps': 120854, 'loss/train': 1.5147303342819214} 11/07/2021 14:16:08 - INFO - __main__ - Step 120856: {'lr': 4.6335958906112143e-05, 'samples': 23204352, 'steps': 120855, 'loss/train': 1.3290281295776367} 11/07/2021 14:16:09 - INFO - __main__ - Step 120857: {'lr': 4.6332881336122514e-05, 'samples': 23204544, 'steps': 120856, 'loss/train': 0.7088494300842285} 11/07/2021 14:16:10 - INFO - __main__ - Step 120858: {'lr': 4.632980385790214e-05, 'samples': 23204736, 'steps': 120857, 'loss/train': 0.8125275373458862} 11/07/2021 14:16:10 - INFO - __main__ - Step 120859: {'lr': 4.632672647145239e-05, 'samples': 23204928, 'steps': 120858, 'loss/train': 0.9807575941085815} 11/07/2021 14:16:10 - INFO - __main__ - Step 120860: {'lr': 4.6323649176774786e-05, 'samples': 23205120, 'steps': 120859, 'loss/train': 1.4409178495407104} 11/07/2021 14:16:11 - INFO - __main__ - Step 120861: {'lr': 4.632057197387052e-05, 'samples': 23205312, 'steps': 120860, 'loss/train': 1.0825555324554443} 11/07/2021 14:16:11 - INFO - __main__ - Step 120862: {'lr': 4.631749486274103e-05, 'samples': 23205504, 'steps': 120861, 'loss/train': 0.5909513235092163} 11/07/2021 14:16:12 - INFO - __main__ - Step 120863: {'lr': 4.6314417843387776e-05, 'samples': 23205696, 'steps': 120862, 'loss/train': 1.0528875589370728} 11/07/2021 14:16:13 - INFO - __main__ - Step 120864: {'lr': 4.63113409158121e-05, 'samples': 23205888, 'steps': 120863, 'loss/train': 0.8258877992630005} 11/07/2021 14:16:13 - INFO - __main__ - Step 120865: {'lr': 4.6308264080015374e-05, 'samples': 23206080, 'steps': 120864, 'loss/train': 1.4741489887237549} 11/07/2021 14:16:13 - INFO - __main__ - Step 120866: {'lr': 4.6305187335999006e-05, 'samples': 23206272, 'steps': 120865, 'loss/train': 1.8002430200576782} 11/07/2021 14:16:14 - INFO - __main__ - Step 120867: {'lr': 4.630211068376436e-05, 'samples': 23206464, 'steps': 120866, 'loss/train': 0.3377382457256317} 11/07/2021 14:16:15 - INFO - __main__ - Step 120868: {'lr': 4.629903412331288e-05, 'samples': 23206656, 'steps': 120867, 'loss/train': 0.41799232363700867} 11/07/2021 14:16:15 - INFO - __main__ - Step 120869: {'lr': 4.629595765464587e-05, 'samples': 23206848, 'steps': 120868, 'loss/train': 1.0902514457702637} 11/07/2021 14:16:15 - INFO - __main__ - Step 120870: {'lr': 4.629288127776479e-05, 'samples': 23207040, 'steps': 120869, 'loss/train': 1.2467561960220337} 11/07/2021 14:16:16 - INFO - __main__ - Step 120871: {'lr': 4.628980499267096e-05, 'samples': 23207232, 'steps': 120870, 'loss/train': 1.2627686262130737} 11/07/2021 14:16:16 - INFO - __main__ - Step 120872: {'lr': 4.6286728799365824e-05, 'samples': 23207424, 'steps': 120871, 'loss/train': 0.8692756295204163} 11/07/2021 14:16:17 - INFO - __main__ - Step 120873: {'lr': 4.6283652697850815e-05, 'samples': 23207616, 'steps': 120872, 'loss/train': 1.5113141536712646} 11/07/2021 14:16:18 - INFO - __main__ - Step 120874: {'lr': 4.628057668812719e-05, 'samples': 23207808, 'steps': 120873, 'loss/train': 1.346913456916809} 11/07/2021 14:16:18 - INFO - __main__ - Step 120875: {'lr': 4.627750077019635e-05, 'samples': 23208000, 'steps': 120874, 'loss/train': 1.0729148387908936} 11/07/2021 14:16:18 - INFO - __main__ - Step 120876: {'lr': 4.6274424944059756e-05, 'samples': 23208192, 'steps': 120875, 'loss/train': 0.19651471078395844} 11/07/2021 14:16:19 - INFO - __main__ - Step 120877: {'lr': 4.6271349209718764e-05, 'samples': 23208384, 'steps': 120876, 'loss/train': 2.242318868637085} 11/07/2021 14:16:20 - INFO - __main__ - Step 120878: {'lr': 4.626827356717475e-05, 'samples': 23208576, 'steps': 120877, 'loss/train': 1.7094390392303467} 11/07/2021 14:16:20 - INFO - __main__ - Step 120879: {'lr': 4.626519801642912e-05, 'samples': 23208768, 'steps': 120878, 'loss/train': 1.5977424383163452} 11/07/2021 14:16:20 - INFO - __main__ - Step 120880: {'lr': 4.6262122557483244e-05, 'samples': 23208960, 'steps': 120879, 'loss/train': 1.4551489353179932} 11/07/2021 14:16:21 - INFO - __main__ - Step 120881: {'lr': 4.6259047190338495e-05, 'samples': 23209152, 'steps': 120880, 'loss/train': 1.183066725730896} 11/07/2021 14:16:21 - INFO - __main__ - Step 120882: {'lr': 4.625597191499631e-05, 'samples': 23209344, 'steps': 120881, 'loss/train': 1.3432750701904297} 11/07/2021 14:16:22 - INFO - __main__ - Step 120883: {'lr': 4.625289673145802e-05, 'samples': 23209536, 'steps': 120882, 'loss/train': 1.7099733352661133} 11/07/2021 14:16:23 - INFO - __main__ - Step 120884: {'lr': 4.624982163972502e-05, 'samples': 23209728, 'steps': 120883, 'loss/train': 1.1516814231872559} 11/07/2021 14:16:23 - INFO - __main__ - Step 120885: {'lr': 4.624674663979872e-05, 'samples': 23209920, 'steps': 120884, 'loss/train': 1.1594082117080688} 11/07/2021 14:16:23 - INFO - __main__ - Step 120886: {'lr': 4.624367173168054e-05, 'samples': 23210112, 'steps': 120885, 'loss/train': 1.7798874378204346} 11/07/2021 14:16:24 - INFO - __main__ - Step 120887: {'lr': 4.624059691537178e-05, 'samples': 23210304, 'steps': 120886, 'loss/train': 1.572027325630188} 11/07/2021 14:16:24 - INFO - __main__ - Step 120888: {'lr': 4.6237522190873846e-05, 'samples': 23210496, 'steps': 120887, 'loss/train': 1.739599347114563} 11/07/2021 14:16:25 - INFO - __main__ - Step 120889: {'lr': 4.623444755818812e-05, 'samples': 23210688, 'steps': 120888, 'loss/train': 1.1679195165634155} 11/07/2021 14:16:26 - INFO - __main__ - Step 120890: {'lr': 4.623137301731603e-05, 'samples': 23210880, 'steps': 120889, 'loss/train': 1.6217093467712402} 11/07/2021 14:16:26 - INFO - __main__ - Step 120891: {'lr': 4.622829856825894e-05, 'samples': 23211072, 'steps': 120890, 'loss/train': 0.9486067891120911} 11/07/2021 14:16:26 - INFO - __main__ - Step 120892: {'lr': 4.622522421101824e-05, 'samples': 23211264, 'steps': 120891, 'loss/train': 1.8044049739837646} 11/07/2021 14:16:27 - INFO - __main__ - Step 120893: {'lr': 4.6222149945595314e-05, 'samples': 23211456, 'steps': 120892, 'loss/train': 1.3554359674453735} 11/07/2021 14:16:28 - INFO - __main__ - Step 120894: {'lr': 4.6219075771991525e-05, 'samples': 23211648, 'steps': 120893, 'loss/train': 1.5156519412994385} 11/07/2021 14:16:28 - INFO - __main__ - Step 120895: {'lr': 4.621600169020829e-05, 'samples': 23211840, 'steps': 120894, 'loss/train': 1.3030076026916504} 11/07/2021 14:16:29 - INFO - __main__ - Step 120896: {'lr': 4.621292770024696e-05, 'samples': 23212032, 'steps': 120895, 'loss/train': 0.977223813533783} 11/07/2021 14:16:29 - INFO - __main__ - Step 120897: {'lr': 4.6209853802109014e-05, 'samples': 23212224, 'steps': 120896, 'loss/train': 0.17789708077907562} 11/07/2021 14:16:29 - INFO - __main__ - Step 120898: {'lr': 4.62067799957957e-05, 'samples': 23212416, 'steps': 120897, 'loss/train': 0.19035175442695618} 11/07/2021 14:16:31 - INFO - __main__ - Step 120899: {'lr': 4.620370628130846e-05, 'samples': 23212608, 'steps': 120898, 'loss/train': 0.9551235437393188} 11/07/2021 14:16:31 - INFO - __main__ - Step 120900: {'lr': 4.6200632658648714e-05, 'samples': 23212800, 'steps': 120899, 'loss/train': 1.3225127458572388} 11/07/2021 14:16:31 - INFO - __main__ - Step 120901: {'lr': 4.619755912781779e-05, 'samples': 23212992, 'steps': 120900, 'loss/train': 1.4883657693862915} 11/07/2021 14:16:32 - INFO - __main__ - Step 120902: {'lr': 4.61944856888171e-05, 'samples': 23213184, 'steps': 120901, 'loss/train': 1.0749235153198242} 11/07/2021 14:16:32 - INFO - __main__ - Step 120903: {'lr': 4.6191412341648035e-05, 'samples': 23213376, 'steps': 120902, 'loss/train': 2.230304479598999} 11/07/2021 14:16:32 - INFO - __main__ - Step 120904: {'lr': 4.618833908631198e-05, 'samples': 23213568, 'steps': 120903, 'loss/train': 1.3053555488586426} 11/07/2021 14:16:34 - INFO - __main__ - Step 120905: {'lr': 4.6185265922810335e-05, 'samples': 23213760, 'steps': 120904, 'loss/train': 1.2297507524490356} 11/07/2021 14:16:34 - INFO - __main__ - Step 120906: {'lr': 4.6182192851144416e-05, 'samples': 23213952, 'steps': 120905, 'loss/train': 1.4070210456848145} 11/07/2021 14:16:34 - INFO - __main__ - Step 120907: {'lr': 4.617911987131568e-05, 'samples': 23214144, 'steps': 120906, 'loss/train': 1.5615168809890747} 11/07/2021 14:16:35 - INFO - __main__ - Step 120908: {'lr': 4.617604698332556e-05, 'samples': 23214336, 'steps': 120907, 'loss/train': 1.2180405855178833} 11/07/2021 14:16:35 - INFO - __main__ - Step 120909: {'lr': 4.617297418717531e-05, 'samples': 23214528, 'steps': 120908, 'loss/train': 1.2616435289382935} 11/07/2021 14:16:36 - INFO - __main__ - Step 120910: {'lr': 4.616990148286635e-05, 'samples': 23214720, 'steps': 120909, 'loss/train': 0.8846762180328369} 11/07/2021 14:16:36 - INFO - __main__ - Step 120911: {'lr': 4.616682887040008e-05, 'samples': 23214912, 'steps': 120910, 'loss/train': 1.5694528818130493} 11/07/2021 14:16:37 - INFO - __main__ - Step 120912: {'lr': 4.616375634977793e-05, 'samples': 23215104, 'steps': 120911, 'loss/train': 1.3754040002822876} 11/07/2021 14:16:37 - INFO - __main__ - Step 120913: {'lr': 4.6160683921001204e-05, 'samples': 23215296, 'steps': 120912, 'loss/train': 1.2113831043243408} 11/07/2021 14:16:37 - INFO - __main__ - Step 120914: {'lr': 4.615761158407136e-05, 'samples': 23215488, 'steps': 120913, 'loss/train': 1.3380253314971924} 11/07/2021 14:16:39 - INFO - __main__ - Step 120915: {'lr': 4.615453933898975e-05, 'samples': 23215680, 'steps': 120914, 'loss/train': 1.579473853111267} 11/07/2021 14:16:39 - INFO - __main__ - Step 120916: {'lr': 4.6151467185757744e-05, 'samples': 23215872, 'steps': 120915, 'loss/train': 1.150447964668274} 11/07/2021 14:16:39 - INFO - __main__ - Step 120917: {'lr': 4.614839512437674e-05, 'samples': 23216064, 'steps': 120916, 'loss/train': 1.8615288734436035} 11/07/2021 14:16:40 - INFO - __main__ - Step 120918: {'lr': 4.614532315484812e-05, 'samples': 23216256, 'steps': 120917, 'loss/train': 1.2095612287521362} 11/07/2021 14:16:40 - INFO - __main__ - Step 120919: {'lr': 4.614225127717334e-05, 'samples': 23216448, 'steps': 120918, 'loss/train': 1.956375002861023} 11/07/2021 14:16:41 - INFO - __main__ - Step 120920: {'lr': 4.613917949135366e-05, 'samples': 23216640, 'steps': 120919, 'loss/train': 1.2530118227005005} 11/07/2021 14:16:42 - INFO - __main__ - Step 120921: {'lr': 4.613610779739053e-05, 'samples': 23216832, 'steps': 120920, 'loss/train': 1.0187934637069702} 11/07/2021 14:16:42 - INFO - __main__ - Step 120922: {'lr': 4.613303619528531e-05, 'samples': 23217024, 'steps': 120921, 'loss/train': 1.0574556589126587} 11/07/2021 14:16:42 - INFO - __main__ - Step 120923: {'lr': 4.612996468503938e-05, 'samples': 23217216, 'steps': 120922, 'loss/train': 0.5899240970611572} 11/07/2021 14:16:43 - INFO - __main__ - Step 120924: {'lr': 4.612689326665417e-05, 'samples': 23217408, 'steps': 120923, 'loss/train': 0.8576339483261108} 11/07/2021 14:16:44 - INFO - __main__ - Step 120925: {'lr': 4.6123821940131014e-05, 'samples': 23217600, 'steps': 120924, 'loss/train': 1.3923627138137817} 11/07/2021 14:16:44 - INFO - __main__ - Step 120926: {'lr': 4.6120750705471337e-05, 'samples': 23217792, 'steps': 120925, 'loss/train': 0.9816062450408936} 11/07/2021 14:16:44 - INFO - __main__ - Step 120927: {'lr': 4.611767956267648e-05, 'samples': 23217984, 'steps': 120926, 'loss/train': 0.539700448513031} 11/07/2021 14:16:45 - INFO - __main__ - Step 120928: {'lr': 4.6114608511747893e-05, 'samples': 23218176, 'steps': 120927, 'loss/train': 1.5659112930297852} 11/07/2021 14:16:45 - INFO - __main__ - Step 120929: {'lr': 4.611153755268688e-05, 'samples': 23218368, 'steps': 120928, 'loss/train': 1.1562389135360718} 11/07/2021 14:16:46 - INFO - __main__ - Step 120930: {'lr': 4.610846668549493e-05, 'samples': 23218560, 'steps': 120929, 'loss/train': 1.4727294445037842} 11/07/2021 14:16:47 - INFO - __main__ - Step 120931: {'lr': 4.610539591017332e-05, 'samples': 23218752, 'steps': 120930, 'loss/train': 1.7822693586349487} 11/07/2021 14:16:47 - INFO - __main__ - Step 120932: {'lr': 4.610232522672345e-05, 'samples': 23218944, 'steps': 120931, 'loss/train': 1.369978666305542} 11/07/2021 14:16:47 - INFO - __main__ - Step 120933: {'lr': 4.609925463514672e-05, 'samples': 23219136, 'steps': 120932, 'loss/train': 1.7457791566848755} 11/07/2021 14:16:48 - INFO - __main__ - Step 120934: {'lr': 4.609618413544453e-05, 'samples': 23219328, 'steps': 120933, 'loss/train': 1.6968153715133667} 11/07/2021 14:16:48 - INFO - __main__ - Step 120935: {'lr': 4.6093113727618266e-05, 'samples': 23219520, 'steps': 120934, 'loss/train': 1.4467226266860962} 11/07/2021 14:16:49 - INFO - __main__ - Step 120936: {'lr': 4.609004341166928e-05, 'samples': 23219712, 'steps': 120935, 'loss/train': 1.553177833557129} 11/07/2021 14:16:50 - INFO - __main__ - Step 120937: {'lr': 4.608697318759897e-05, 'samples': 23219904, 'steps': 120936, 'loss/train': 1.6949416399002075} 11/07/2021 14:16:50 - INFO - __main__ - Step 120938: {'lr': 4.6083903055408744e-05, 'samples': 23220096, 'steps': 120937, 'loss/train': 1.2285857200622559} 11/07/2021 14:16:50 - INFO - __main__ - Step 120939: {'lr': 4.608083301509994e-05, 'samples': 23220288, 'steps': 120938, 'loss/train': 1.5205689668655396} 11/07/2021 14:16:51 - INFO - __main__ - Step 120940: {'lr': 4.6077763066673995e-05, 'samples': 23220480, 'steps': 120939, 'loss/train': 1.6196496486663818} 11/07/2021 14:16:52 - INFO - __main__ - Step 120941: {'lr': 4.60746932101323e-05, 'samples': 23220672, 'steps': 120940, 'loss/train': 1.2338590621948242} 11/07/2021 14:16:52 - INFO - __main__ - Step 120942: {'lr': 4.6071623445476166e-05, 'samples': 23220864, 'steps': 120941, 'loss/train': 0.8811200857162476} 11/07/2021 14:16:53 - INFO - __main__ - Step 120943: {'lr': 4.6068553772706964e-05, 'samples': 23221056, 'steps': 120942, 'loss/train': 1.2453961372375488} 11/07/2021 14:16:53 - INFO - __main__ - Step 120944: {'lr': 4.606548419182619e-05, 'samples': 23221248, 'steps': 120943, 'loss/train': 1.4559496641159058} 11/07/2021 14:16:53 - INFO - __main__ - Step 120945: {'lr': 4.606241470283512e-05, 'samples': 23221440, 'steps': 120944, 'loss/train': 1.4913536310195923} 11/07/2021 14:16:54 - INFO - __main__ - Step 120946: {'lr': 4.6059345305735164e-05, 'samples': 23221632, 'steps': 120945, 'loss/train': 1.235758900642395} 11/07/2021 14:16:55 - INFO - __main__ - Step 120947: {'lr': 4.605627600052775e-05, 'samples': 23221824, 'steps': 120946, 'loss/train': 1.7404193878173828} 11/07/2021 14:16:55 - INFO - __main__ - Step 120948: {'lr': 4.6053206787214225e-05, 'samples': 23222016, 'steps': 120947, 'loss/train': 1.4800795316696167} 11/07/2021 14:16:55 - INFO - __main__ - Step 120949: {'lr': 4.6050137665795967e-05, 'samples': 23222208, 'steps': 120948, 'loss/train': 1.0381709337234497} 11/07/2021 14:16:56 - INFO - __main__ - Step 120950: {'lr': 4.604706863627439e-05, 'samples': 23222400, 'steps': 120949, 'loss/train': 1.3158327341079712} 11/07/2021 14:16:57 - INFO - __main__ - Step 120951: {'lr': 4.604399969865083e-05, 'samples': 23222592, 'steps': 120950, 'loss/train': 0.8735660314559937} 11/07/2021 14:16:57 - INFO - __main__ - Step 120952: {'lr': 4.6040930852926735e-05, 'samples': 23222784, 'steps': 120951, 'loss/train': 1.6709396839141846} 11/07/2021 14:16:58 - INFO - __main__ - Step 120953: {'lr': 4.603786209910341e-05, 'samples': 23222976, 'steps': 120952, 'loss/train': 1.1763687133789062} 11/07/2021 14:16:58 - INFO - __main__ - Step 120954: {'lr': 4.6034793437182364e-05, 'samples': 23223168, 'steps': 120953, 'loss/train': 1.195374608039856} 11/07/2021 14:16:58 - INFO - __main__ - Step 120955: {'lr': 4.6031724867164834e-05, 'samples': 23223360, 'steps': 120954, 'loss/train': 1.4215835332870483} 11/07/2021 14:17:00 - INFO - __main__ - Step 120956: {'lr': 4.6028656389052236e-05, 'samples': 23223552, 'steps': 120955, 'loss/train': 1.4365012645721436} 11/07/2021 14:17:00 - INFO - __main__ - Step 120957: {'lr': 4.602558800284598e-05, 'samples': 23223744, 'steps': 120956, 'loss/train': 0.6007328629493713} 11/07/2021 14:17:00 - INFO - __main__ - Step 120958: {'lr': 4.6022519708547486e-05, 'samples': 23223936, 'steps': 120957, 'loss/train': 1.4285812377929688} 11/07/2021 14:17:01 - INFO - __main__ - Step 120959: {'lr': 4.601945150615805e-05, 'samples': 23224128, 'steps': 120958, 'loss/train': 1.4972524642944336} 11/07/2021 14:17:01 - INFO - __main__ - Step 120960: {'lr': 4.6016383395679124e-05, 'samples': 23224320, 'steps': 120959, 'loss/train': 0.09303396195173264} 11/07/2021 14:17:02 - INFO - __main__ - Step 120961: {'lr': 4.601331537711206e-05, 'samples': 23224512, 'steps': 120960, 'loss/train': 1.3047420978546143} 11/07/2021 14:17:03 - INFO - __main__ - Step 120962: {'lr': 4.601024745045826e-05, 'samples': 23224704, 'steps': 120961, 'loss/train': 1.2424874305725098} 11/07/2021 14:17:03 - INFO - __main__ - Step 120963: {'lr': 4.6007179615719096e-05, 'samples': 23224896, 'steps': 120962, 'loss/train': 0.9329887628555298} 11/07/2021 14:17:04 - INFO - __main__ - Step 120964: {'lr': 4.600411187289594e-05, 'samples': 23225088, 'steps': 120963, 'loss/train': 0.12631484866142273} 11/07/2021 14:17:04 - INFO - __main__ - Step 120965: {'lr': 4.600104422199017e-05, 'samples': 23225280, 'steps': 120964, 'loss/train': 1.0145782232284546} 11/07/2021 14:17:04 - INFO - __main__ - Step 120966: {'lr': 4.5997976663003205e-05, 'samples': 23225472, 'steps': 120965, 'loss/train': 1.615734577178955} 11/07/2021 14:17:05 - INFO - __main__ - Step 120967: {'lr': 4.599490919593638e-05, 'samples': 23225664, 'steps': 120966, 'loss/train': 1.2200618982315063} 11/07/2021 14:17:06 - INFO - __main__ - Step 120968: {'lr': 4.599184182079119e-05, 'samples': 23225856, 'steps': 120967, 'loss/train': 1.411840796470642} 11/07/2021 14:17:06 - INFO - __main__ - Step 120969: {'lr': 4.598877453756886e-05, 'samples': 23226048, 'steps': 120968, 'loss/train': 0.7425568699836731} 11/07/2021 14:17:06 - INFO - __main__ - Step 120970: {'lr': 4.598570734627083e-05, 'samples': 23226240, 'steps': 120969, 'loss/train': 4.1002397537231445} 11/07/2021 14:17:07 - INFO - __main__ - Step 120971: {'lr': 4.5982640246898495e-05, 'samples': 23226432, 'steps': 120970, 'loss/train': 1.6142253875732422} 11/07/2021 14:17:08 - INFO - __main__ - Step 120972: {'lr': 4.5979573239453235e-05, 'samples': 23226624, 'steps': 120971, 'loss/train': 1.1452289819717407} 11/07/2021 14:17:08 - INFO - __main__ - Step 120973: {'lr': 4.5976506323936433e-05, 'samples': 23226816, 'steps': 120972, 'loss/train': 0.5797629356384277} 11/07/2021 14:17:08 - INFO - __main__ - Step 120974: {'lr': 4.597343950034946e-05, 'samples': 23227008, 'steps': 120973, 'loss/train': 1.087819218635559} 11/07/2021 14:17:09 - INFO - __main__ - Step 120975: {'lr': 4.59703727686937e-05, 'samples': 23227200, 'steps': 120974, 'loss/train': 1.481359839439392} 11/07/2021 14:17:09 - INFO - __main__ - Step 120976: {'lr': 4.596730612897057e-05, 'samples': 23227392, 'steps': 120975, 'loss/train': 1.2615091800689697} 11/07/2021 14:17:10 - INFO - __main__ - Step 120977: {'lr': 4.59642395811814e-05, 'samples': 23227584, 'steps': 120976, 'loss/train': 1.4466469287872314} 11/07/2021 14:17:10 - INFO - __main__ - Step 120978: {'lr': 4.596117312532761e-05, 'samples': 23227776, 'steps': 120977, 'loss/train': 1.5947859287261963} 11/07/2021 14:17:11 - INFO - __main__ - Step 120979: {'lr': 4.5958106761410574e-05, 'samples': 23227968, 'steps': 120978, 'loss/train': 0.8675800561904907} 11/07/2021 14:17:11 - INFO - __main__ - Step 120980: {'lr': 4.5955040489431635e-05, 'samples': 23228160, 'steps': 120979, 'loss/train': 1.6169987916946411} 11/07/2021 14:17:12 - INFO - __main__ - Step 120981: {'lr': 4.5951974309392295e-05, 'samples': 23228352, 'steps': 120980, 'loss/train': 1.1652867794036865} 11/07/2021 14:17:13 - INFO - __main__ - Step 120982: {'lr': 4.594890822129377e-05, 'samples': 23228544, 'steps': 120981, 'loss/train': 0.9758316874504089} 11/07/2021 14:17:13 - INFO - __main__ - Step 120983: {'lr': 4.5945842225137534e-05, 'samples': 23228736, 'steps': 120982, 'loss/train': 1.5738669633865356} 11/07/2021 14:17:13 - INFO - __main__ - Step 120984: {'lr': 4.5942776320924945e-05, 'samples': 23228928, 'steps': 120983, 'loss/train': 1.6187779903411865} 11/07/2021 14:17:14 - INFO - __main__ - Step 120985: {'lr': 4.593971050865739e-05, 'samples': 23229120, 'steps': 120984, 'loss/train': 1.4131814241409302} 11/07/2021 14:17:14 - INFO - __main__ - Step 120986: {'lr': 4.593664478833626e-05, 'samples': 23229312, 'steps': 120985, 'loss/train': 1.2441967725753784} 11/07/2021 14:17:15 - INFO - __main__ - Step 120987: {'lr': 4.593357915996291e-05, 'samples': 23229504, 'steps': 120986, 'loss/train': 1.5778188705444336} 11/07/2021 14:17:15 - INFO - __main__ - Step 120988: {'lr': 4.5930513623538756e-05, 'samples': 23229696, 'steps': 120987, 'loss/train': 1.5706435441970825} 11/07/2021 14:17:16 - INFO - __main__ - Step 120989: {'lr': 4.5927448179065165e-05, 'samples': 23229888, 'steps': 120988, 'loss/train': 1.2221271991729736} 11/07/2021 14:17:16 - INFO - __main__ - Step 120990: {'lr': 4.592438282654351e-05, 'samples': 23230080, 'steps': 120989, 'loss/train': 1.0222243070602417} 11/07/2021 14:17:16 - INFO - __main__ - Step 120991: {'lr': 4.5921317565975174e-05, 'samples': 23230272, 'steps': 120990, 'loss/train': 1.4411296844482422} 11/07/2021 14:17:17 - INFO - __main__ - Step 120992: {'lr': 4.591825239736155e-05, 'samples': 23230464, 'steps': 120991, 'loss/train': 1.2781416177749634} 11/07/2021 14:17:18 - INFO - __main__ - Step 120993: {'lr': 4.5915187320704016e-05, 'samples': 23230656, 'steps': 120992, 'loss/train': 0.8911145329475403} 11/07/2021 14:17:18 - INFO - __main__ - Step 120994: {'lr': 4.5912122336004e-05, 'samples': 23230848, 'steps': 120993, 'loss/train': 1.3466635942459106} 11/07/2021 14:17:19 - INFO - __main__ - Step 120995: {'lr': 4.590905744326279e-05, 'samples': 23231040, 'steps': 120994, 'loss/train': 0.9604590535163879} 11/07/2021 14:17:19 - INFO - __main__ - Step 120996: {'lr': 4.59059926424818e-05, 'samples': 23231232, 'steps': 120995, 'loss/train': 1.158227801322937} 11/07/2021 14:17:19 - INFO - __main__ - Step 120997: {'lr': 4.5902927933662406e-05, 'samples': 23231424, 'steps': 120996, 'loss/train': 1.48817777633667} 11/07/2021 14:17:21 - INFO - __main__ - Step 120998: {'lr': 4.589986331680601e-05, 'samples': 23231616, 'steps': 120997, 'loss/train': 1.1706898212432861} 11/07/2021 14:17:21 - INFO - __main__ - Step 120999: {'lr': 4.5896798791914e-05, 'samples': 23231808, 'steps': 120998, 'loss/train': 1.4724253416061401} 11/07/2021 14:17:21 - INFO - __main__ - Step 121000: {'lr': 4.589373435898772e-05, 'samples': 23232000, 'steps': 120999, 'loss/train': 0.348531574010849} 11/07/2021 14:17:22 - INFO - __main__ - Step 121001: {'lr': 4.5890670018028604e-05, 'samples': 23232192, 'steps': 121000, 'loss/train': 1.4035584926605225} 11/07/2021 14:17:22 - INFO - __main__ - Step 121002: {'lr': 4.588760576903797e-05, 'samples': 23232384, 'steps': 121001, 'loss/train': 1.521490216255188} 11/07/2021 14:17:22 - INFO - __main__ - Step 121003: {'lr': 4.588454161201727e-05, 'samples': 23232576, 'steps': 121002, 'loss/train': 1.7608875036239624} 11/07/2021 14:17:24 - INFO - __main__ - Step 121004: {'lr': 4.588147754696781e-05, 'samples': 23232768, 'steps': 121003, 'loss/train': 1.5488306283950806} 11/07/2021 14:17:24 - INFO - __main__ - Step 121005: {'lr': 4.5878413573891026e-05, 'samples': 23232960, 'steps': 121004, 'loss/train': 1.1432485580444336} 11/07/2021 14:17:24 - INFO - __main__ - Step 121006: {'lr': 4.587534969278828e-05, 'samples': 23233152, 'steps': 121005, 'loss/train': 1.3442754745483398} 11/07/2021 14:17:25 - INFO - __main__ - Step 121007: {'lr': 4.587228590366094e-05, 'samples': 23233344, 'steps': 121006, 'loss/train': 1.482246994972229} 11/07/2021 14:17:25 - INFO - __main__ - Step 121008: {'lr': 4.5869222206510466e-05, 'samples': 23233536, 'steps': 121007, 'loss/train': 1.4601222276687622} 11/07/2021 14:17:26 - INFO - __main__ - Step 121009: {'lr': 4.586615860133811e-05, 'samples': 23233728, 'steps': 121008, 'loss/train': 1.414419174194336} 11/07/2021 14:17:26 - INFO - __main__ - Step 121010: {'lr': 4.586309508814529e-05, 'samples': 23233920, 'steps': 121009, 'loss/train': 1.4407366514205933} 11/07/2021 14:17:27 - INFO - __main__ - Step 121011: {'lr': 4.586003166693345e-05, 'samples': 23234112, 'steps': 121010, 'loss/train': 1.2262680530548096} 11/07/2021 14:17:27 - INFO - __main__ - Step 121012: {'lr': 4.585696833770389e-05, 'samples': 23234304, 'steps': 121011, 'loss/train': 1.5235382318496704} 11/07/2021 14:17:27 - INFO - __main__ - Step 121013: {'lr': 4.585390510045806e-05, 'samples': 23234496, 'steps': 121012, 'loss/train': 0.7682686448097229} 11/07/2021 14:17:28 - INFO - __main__ - Step 121014: {'lr': 4.585084195519729e-05, 'samples': 23234688, 'steps': 121013, 'loss/train': 1.1852967739105225} 11/07/2021 14:17:29 - INFO - __main__ - Step 121015: {'lr': 4.5847778901922984e-05, 'samples': 23234880, 'steps': 121014, 'loss/train': 1.266527533531189} 11/07/2021 14:17:29 - INFO - __main__ - Step 121016: {'lr': 4.584471594063655e-05, 'samples': 23235072, 'steps': 121015, 'loss/train': 1.6753863096237183} 11/07/2021 14:17:30 - INFO - __main__ - Step 121017: {'lr': 4.584165307133931e-05, 'samples': 23235264, 'steps': 121016, 'loss/train': 1.9009724855422974} 11/07/2021 14:17:30 - INFO - __main__ - Step 121018: {'lr': 4.583859029403267e-05, 'samples': 23235456, 'steps': 121017, 'loss/train': 1.054194688796997} 11/07/2021 14:17:30 - INFO - __main__ - Step 121019: {'lr': 4.583552760871801e-05, 'samples': 23235648, 'steps': 121018, 'loss/train': 1.5268672704696655} 11/07/2021 14:17:31 - INFO - __main__ - Step 121020: {'lr': 4.5832465015396704e-05, 'samples': 23235840, 'steps': 121019, 'loss/train': 1.5179495811462402} 11/07/2021 14:17:32 - INFO - __main__ - Step 121021: {'lr': 4.582940251407022e-05, 'samples': 23236032, 'steps': 121020, 'loss/train': 1.9313944578170776} 11/07/2021 14:17:32 - INFO - __main__ - Step 121022: {'lr': 4.5826340104739766e-05, 'samples': 23236224, 'steps': 121021, 'loss/train': 0.4703315794467926} 11/07/2021 14:17:32 - INFO - __main__ - Step 121023: {'lr': 4.582327778740683e-05, 'samples': 23236416, 'steps': 121022, 'loss/train': 1.3657772541046143} 11/07/2021 14:17:33 - INFO - __main__ - Step 121024: {'lr': 4.5820215562072775e-05, 'samples': 23236608, 'steps': 121023, 'loss/train': 0.3617773950099945} 11/07/2021 14:17:34 - INFO - __main__ - Step 121025: {'lr': 4.581715342873899e-05, 'samples': 23236800, 'steps': 121024, 'loss/train': 1.270128607749939} 11/07/2021 14:17:34 - INFO - __main__ - Step 121026: {'lr': 4.581409138740683e-05, 'samples': 23236992, 'steps': 121025, 'loss/train': 1.0686830282211304} 11/07/2021 14:17:34 - INFO - __main__ - Step 121027: {'lr': 4.581102943807772e-05, 'samples': 23237184, 'steps': 121026, 'loss/train': 0.1296989917755127} 11/07/2021 14:17:35 - INFO - __main__ - Step 121028: {'lr': 4.580796758075298e-05, 'samples': 23237376, 'steps': 121027, 'loss/train': 1.4099335670471191} 11/07/2021 14:17:35 - INFO - __main__ - Step 121029: {'lr': 4.5804905815434e-05, 'samples': 23237568, 'steps': 121028, 'loss/train': 1.2689993381500244} 11/07/2021 14:17:36 - INFO - __main__ - Step 121030: {'lr': 4.5801844142122214e-05, 'samples': 23237760, 'steps': 121029, 'loss/train': 1.3087605237960815} 11/07/2021 14:17:36 - INFO - __main__ - Step 121031: {'lr': 4.579878256081896e-05, 'samples': 23237952, 'steps': 121030, 'loss/train': 1.5610010623931885} 11/07/2021 14:17:37 - INFO - __main__ - Step 121032: {'lr': 4.57957210715256e-05, 'samples': 23238144, 'steps': 121031, 'loss/train': 1.0527480840682983} 11/07/2021 14:17:37 - INFO - __main__ - Step 121033: {'lr': 4.5792659674243566e-05, 'samples': 23238336, 'steps': 121032, 'loss/train': 1.467382550239563} 11/07/2021 14:17:38 - INFO - __main__ - Step 121034: {'lr': 4.57895983689742e-05, 'samples': 23238528, 'steps': 121033, 'loss/train': 1.2775789499282837} 11/07/2021 14:17:39 - INFO - __main__ - Step 121035: {'lr': 4.578653715571896e-05, 'samples': 23238720, 'steps': 121034, 'loss/train': 1.592339277267456} 11/07/2021 14:17:39 - INFO - __main__ - Step 121036: {'lr': 4.578347603447908e-05, 'samples': 23238912, 'steps': 121035, 'loss/train': 0.9818099737167358} 11/07/2021 14:17:39 - INFO - __main__ - Step 121037: {'lr': 4.578041500525601e-05, 'samples': 23239104, 'steps': 121036, 'loss/train': 1.4409137964248657} 11/07/2021 14:17:40 - INFO - __main__ - Step 121038: {'lr': 4.577735406805114e-05, 'samples': 23239296, 'steps': 121037, 'loss/train': 1.388702630996704} 11/07/2021 14:17:40 - INFO - __main__ - Step 121039: {'lr': 4.5774293222865835e-05, 'samples': 23239488, 'steps': 121038, 'loss/train': 1.2627434730529785} 11/07/2021 14:17:41 - INFO - __main__ - Step 121040: {'lr': 4.5771232469701495e-05, 'samples': 23239680, 'steps': 121039, 'loss/train': 1.6351054906845093} 11/07/2021 14:17:41 - INFO - __main__ - Step 121041: {'lr': 4.576817180855949e-05, 'samples': 23239872, 'steps': 121040, 'loss/train': 1.2272371053695679} 11/07/2021 14:17:42 - INFO - __main__ - Step 121042: {'lr': 4.576511123944119e-05, 'samples': 23240064, 'steps': 121041, 'loss/train': 0.6492634415626526} 11/07/2021 14:17:42 - INFO - __main__ - Step 121043: {'lr': 4.5762050762347965e-05, 'samples': 23240256, 'steps': 121042, 'loss/train': 1.5126302242279053} 11/07/2021 14:17:43 - INFO - __main__ - Step 121044: {'lr': 4.5758990377281234e-05, 'samples': 23240448, 'steps': 121043, 'loss/train': 1.5106110572814941} 11/07/2021 14:17:43 - INFO - __main__ - Step 121045: {'lr': 4.5755930084242335e-05, 'samples': 23240640, 'steps': 121044, 'loss/train': 1.20461106300354} 11/07/2021 14:17:44 - INFO - __main__ - Step 121046: {'lr': 4.575286988323266e-05, 'samples': 23240832, 'steps': 121045, 'loss/train': 1.2443463802337646} 11/07/2021 14:17:44 - INFO - __main__ - Step 121047: {'lr': 4.5749809774253584e-05, 'samples': 23241024, 'steps': 121046, 'loss/train': 1.3285748958587646} 11/07/2021 14:17:45 - INFO - __main__ - Step 121048: {'lr': 4.574674975730655e-05, 'samples': 23241216, 'steps': 121047, 'loss/train': 1.4429413080215454} 11/07/2021 14:17:45 - INFO - __main__ - Step 121049: {'lr': 4.5743689832392854e-05, 'samples': 23241408, 'steps': 121048, 'loss/train': 1.2141876220703125} 11/07/2021 14:17:45 - INFO - __main__ - Step 121050: {'lr': 4.574062999951387e-05, 'samples': 23241600, 'steps': 121049, 'loss/train': 1.3910857439041138} 11/07/2021 14:17:46 - INFO - __main__ - Step 121051: {'lr': 4.5737570258671006e-05, 'samples': 23241792, 'steps': 121050, 'loss/train': 1.2221965789794922} 11/07/2021 14:17:47 - INFO - __main__ - Step 121052: {'lr': 4.5734510609865634e-05, 'samples': 23241984, 'steps': 121051, 'loss/train': 1.5602502822875977} 11/07/2021 14:17:47 - INFO - __main__ - Step 121053: {'lr': 4.5731451053099174e-05, 'samples': 23242176, 'steps': 121052, 'loss/train': 1.0964661836624146} 11/07/2021 14:17:47 - INFO - __main__ - Step 121054: {'lr': 4.572839158837294e-05, 'samples': 23242368, 'steps': 121053, 'loss/train': 1.6110981702804565} 11/07/2021 14:17:48 - INFO - __main__ - Step 121055: {'lr': 4.5725332215688336e-05, 'samples': 23242560, 'steps': 121054, 'loss/train': 1.5349233150482178} 11/07/2021 14:17:49 - INFO - __main__ - Step 121056: {'lr': 4.572227293504677e-05, 'samples': 23242752, 'steps': 121055, 'loss/train': 1.2524343729019165} 11/07/2021 14:17:49 - INFO - __main__ - Step 121057: {'lr': 4.571921374644958e-05, 'samples': 23242944, 'steps': 121056, 'loss/train': 1.3539959192276} 11/07/2021 14:17:49 - INFO - __main__ - Step 121058: {'lr': 4.5716154649898174e-05, 'samples': 23243136, 'steps': 121057, 'loss/train': 1.3156273365020752} 11/07/2021 14:17:50 - INFO - __main__ - Step 121059: {'lr': 4.571309564539389e-05, 'samples': 23243328, 'steps': 121058, 'loss/train': 1.1717126369476318} 11/07/2021 14:17:50 - INFO - __main__ - Step 121060: {'lr': 4.5710036732938164e-05, 'samples': 23243520, 'steps': 121059, 'loss/train': 1.2642173767089844} 11/07/2021 14:17:51 - INFO - __main__ - Step 121061: {'lr': 4.570697791253231e-05, 'samples': 23243712, 'steps': 121060, 'loss/train': 1.2304779291152954} 11/07/2021 14:17:52 - INFO - __main__ - Step 121062: {'lr': 4.570391918417782e-05, 'samples': 23243904, 'steps': 121061, 'loss/train': 1.2162635326385498} 11/07/2021 14:17:52 - INFO - __main__ - Step 121063: {'lr': 4.570086054787595e-05, 'samples': 23244096, 'steps': 121062, 'loss/train': 1.2142255306243896} 11/07/2021 14:17:52 - INFO - __main__ - Step 121064: {'lr': 4.569780200362808e-05, 'samples': 23244288, 'steps': 121063, 'loss/train': 1.4833494424819946} 11/07/2021 14:17:53 - INFO - __main__ - Step 121065: {'lr': 4.569474355143566e-05, 'samples': 23244480, 'steps': 121064, 'loss/train': 1.425329566001892} 11/07/2021 14:17:53 - INFO - __main__ - Step 121066: {'lr': 4.569168519130001e-05, 'samples': 23244672, 'steps': 121065, 'loss/train': 1.3320817947387695} 11/07/2021 14:17:54 - INFO - __main__ - Step 121067: {'lr': 4.5688626923222564e-05, 'samples': 23244864, 'steps': 121066, 'loss/train': 1.4587186574935913} 11/07/2021 14:17:54 - INFO - __main__ - Step 121068: {'lr': 4.568556874720464e-05, 'samples': 23245056, 'steps': 121067, 'loss/train': 0.8376361131668091} 11/07/2021 14:17:55 - INFO - __main__ - Step 121069: {'lr': 4.5682510663247664e-05, 'samples': 23245248, 'steps': 121068, 'loss/train': 1.2315361499786377} 11/07/2021 14:17:55 - INFO - __main__ - Step 121070: {'lr': 4.5679452671352985e-05, 'samples': 23245440, 'steps': 121069, 'loss/train': 1.2939999103546143} 11/07/2021 14:17:55 - INFO - __main__ - Step 121071: {'lr': 4.5676394771522e-05, 'samples': 23245632, 'steps': 121070, 'loss/train': 1.3733720779418945} 11/07/2021 14:17:56 - INFO - __main__ - Step 121072: {'lr': 4.567333696375609e-05, 'samples': 23245824, 'steps': 121071, 'loss/train': 1.2236191034317017} 11/07/2021 14:17:57 - INFO - __main__ - Step 121073: {'lr': 4.5670279248056585e-05, 'samples': 23246016, 'steps': 121072, 'loss/train': 1.9554427862167358} 11/07/2021 14:17:57 - INFO - __main__ - Step 121074: {'lr': 4.5667221624424936e-05, 'samples': 23246208, 'steps': 121073, 'loss/train': 1.2699074745178223} 11/07/2021 14:17:57 - INFO - __main__ - Step 121075: {'lr': 4.566416409286253e-05, 'samples': 23246400, 'steps': 121074, 'loss/train': 0.9984866380691528} 11/07/2021 14:17:58 - INFO - __main__ - Step 121076: {'lr': 4.5661106653370646e-05, 'samples': 23246592, 'steps': 121075, 'loss/train': 0.9351995587348938} 11/07/2021 14:17:59 - INFO - __main__ - Step 121077: {'lr': 4.565804930595069e-05, 'samples': 23246784, 'steps': 121076, 'loss/train': 1.5775641202926636} 11/07/2021 14:17:59 - INFO - __main__ - Step 121078: {'lr': 4.565499205060408e-05, 'samples': 23246976, 'steps': 121077, 'loss/train': 1.552022933959961} 11/07/2021 14:18:00 - INFO - __main__ - Step 121079: {'lr': 4.565193488733219e-05, 'samples': 23247168, 'steps': 121078, 'loss/train': 1.1354843378067017} 11/07/2021 14:18:00 - INFO - __main__ - Step 121080: {'lr': 4.5648877816136356e-05, 'samples': 23247360, 'steps': 121079, 'loss/train': 1.1624906063079834} 11/07/2021 14:18:00 - INFO - __main__ - Step 121081: {'lr': 4.5645820837017986e-05, 'samples': 23247552, 'steps': 121080, 'loss/train': 1.5236587524414062} 11/07/2021 14:18:01 - INFO - __main__ - Step 121082: {'lr': 4.564276394997849e-05, 'samples': 23247744, 'steps': 121081, 'loss/train': 1.223721981048584} 11/07/2021 14:18:02 - INFO - __main__ - Step 121083: {'lr': 4.5639707155019195e-05, 'samples': 23247936, 'steps': 121082, 'loss/train': 1.3241349458694458} 11/07/2021 14:18:02 - INFO - __main__ - Step 121084: {'lr': 4.563665045214149e-05, 'samples': 23248128, 'steps': 121083, 'loss/train': 1.1721843481063843} 11/07/2021 14:18:02 - INFO - __main__ - Step 121085: {'lr': 4.5633593841346744e-05, 'samples': 23248320, 'steps': 121084, 'loss/train': 1.1843184232711792} 11/07/2021 14:18:03 - INFO - __main__ - Step 121086: {'lr': 4.5630537322636365e-05, 'samples': 23248512, 'steps': 121085, 'loss/train': 1.4158961772918701} 11/07/2021 14:18:04 - INFO - __main__ - Step 121087: {'lr': 4.562748089601171e-05, 'samples': 23248704, 'steps': 121086, 'loss/train': 1.6135039329528809} 11/07/2021 14:18:04 - INFO - __main__ - Step 121088: {'lr': 4.562442456147414e-05, 'samples': 23248896, 'steps': 121087, 'loss/train': 1.6952975988388062} 11/07/2021 14:18:05 - INFO - __main__ - Step 121089: {'lr': 4.5621368319025136e-05, 'samples': 23249088, 'steps': 121088, 'loss/train': 1.5862597227096558} 11/07/2021 14:18:05 - INFO - __main__ - Step 121090: {'lr': 4.561831216866594e-05, 'samples': 23249280, 'steps': 121089, 'loss/train': 1.4154798984527588} 11/07/2021 14:18:05 - INFO - __main__ - Step 121091: {'lr': 4.5615256110397934e-05, 'samples': 23249472, 'steps': 121090, 'loss/train': 1.164923906326294} 11/07/2021 14:18:06 - INFO - __main__ - Step 121092: {'lr': 4.561220014422257e-05, 'samples': 23249664, 'steps': 121091, 'loss/train': 1.1825132369995117} 11/07/2021 14:18:07 - INFO - __main__ - Step 121093: {'lr': 4.5609144270141206e-05, 'samples': 23249856, 'steps': 121092, 'loss/train': 1.1196892261505127} 11/07/2021 14:18:07 - INFO - __main__ - Step 121094: {'lr': 4.5606088488155175e-05, 'samples': 23250048, 'steps': 121093, 'loss/train': 1.1918830871582031} 11/07/2021 14:18:07 - INFO - __main__ - Step 121095: {'lr': 4.560303279826592e-05, 'samples': 23250240, 'steps': 121094, 'loss/train': 1.4511723518371582} 11/07/2021 14:18:08 - INFO - __main__ - Step 121096: {'lr': 4.559997720047476e-05, 'samples': 23250432, 'steps': 121095, 'loss/train': 1.3882873058319092} 11/07/2021 14:18:09 - INFO - __main__ - Step 121097: {'lr': 4.5596921694783106e-05, 'samples': 23250624, 'steps': 121096, 'loss/train': 2.0567243099212646} 11/07/2021 14:18:09 - INFO - __main__ - Step 121098: {'lr': 4.55938662811923e-05, 'samples': 23250816, 'steps': 121097, 'loss/train': 1.2255820035934448} 11/07/2021 14:18:09 - INFO - __main__ - Step 121099: {'lr': 4.559081095970377e-05, 'samples': 23251008, 'steps': 121098, 'loss/train': 1.3080612421035767} 11/07/2021 14:18:10 - INFO - __main__ - Step 121100: {'lr': 4.558775573031887e-05, 'samples': 23251200, 'steps': 121099, 'loss/train': 1.5141783952713013} 11/07/2021 14:18:10 - INFO - __main__ - Step 121101: {'lr': 4.5584700593038955e-05, 'samples': 23251392, 'steps': 121100, 'loss/train': 1.362884283065796} 11/07/2021 14:18:10 - INFO - __main__ - Step 121102: {'lr': 4.5581645547865505e-05, 'samples': 23251584, 'steps': 121101, 'loss/train': 1.2542515993118286} 11/07/2021 14:18:12 - INFO - __main__ - Step 121103: {'lr': 4.557859059479974e-05, 'samples': 23251776, 'steps': 121102, 'loss/train': 1.0026143789291382} 11/07/2021 14:18:12 - INFO - __main__ - Step 121104: {'lr': 4.55755357338431e-05, 'samples': 23251968, 'steps': 121103, 'loss/train': 1.4723641872406006} 11/07/2021 14:18:12 - INFO - __main__ - Step 121105: {'lr': 4.557248096499697e-05, 'samples': 23252160, 'steps': 121104, 'loss/train': 0.9649338126182556} 11/07/2021 14:18:13 - INFO - __main__ - Step 121106: {'lr': 4.5569426288262716e-05, 'samples': 23252352, 'steps': 121105, 'loss/train': 1.3905658721923828} 11/07/2021 14:18:13 - INFO - __main__ - Step 121107: {'lr': 4.5566371703641754e-05, 'samples': 23252544, 'steps': 121106, 'loss/train': 1.4620985984802246} 11/07/2021 14:18:14 - INFO - __main__ - Step 121108: {'lr': 4.556331721113541e-05, 'samples': 23252736, 'steps': 121107, 'loss/train': 1.2182023525238037} 11/07/2021 14:18:14 - INFO - __main__ - Step 121109: {'lr': 4.5560262810745074e-05, 'samples': 23252928, 'steps': 121108, 'loss/train': 1.2166343927383423} 11/07/2021 14:18:15 - INFO - __main__ - Step 121110: {'lr': 4.5557208502472135e-05, 'samples': 23253120, 'steps': 121109, 'loss/train': 1.2713342905044556} 11/07/2021 14:18:15 - INFO - __main__ - Step 121111: {'lr': 4.5554154286317986e-05, 'samples': 23253312, 'steps': 121110, 'loss/train': 0.2495165318250656} 11/07/2021 14:18:15 - INFO - __main__ - Step 121112: {'lr': 4.555110016228395e-05, 'samples': 23253504, 'steps': 121111, 'loss/train': 1.5035873651504517} 11/07/2021 14:18:16 - INFO - __main__ - Step 121113: {'lr': 4.5548046130371446e-05, 'samples': 23253696, 'steps': 121112, 'loss/train': 1.544446587562561} 11/07/2021 14:18:17 - INFO - __main__ - Step 121114: {'lr': 4.5544992190581834e-05, 'samples': 23253888, 'steps': 121113, 'loss/train': 1.3101218938827515} 11/07/2021 14:18:17 - INFO - __main__ - Step 121115: {'lr': 4.554193834291656e-05, 'samples': 23254080, 'steps': 121114, 'loss/train': 1.2493454217910767} 11/07/2021 14:18:18 - INFO - __main__ - Step 121116: {'lr': 4.553888458737687e-05, 'samples': 23254272, 'steps': 121115, 'loss/train': 1.631117582321167} 11/07/2021 14:18:18 - INFO - __main__ - Step 121117: {'lr': 4.553583092396418e-05, 'samples': 23254464, 'steps': 121116, 'loss/train': 1.244954228401184} 11/07/2021 14:18:19 - INFO - __main__ - Step 121118: {'lr': 4.553277735267994e-05, 'samples': 23254656, 'steps': 121117, 'loss/train': 0.8141382932662964} 11/07/2021 14:18:19 - INFO - __main__ - Step 121119: {'lr': 4.552972387352544e-05, 'samples': 23254848, 'steps': 121118, 'loss/train': 1.1728278398513794} 11/07/2021 14:18:20 - INFO - __main__ - Step 121120: {'lr': 4.552667048650208e-05, 'samples': 23255040, 'steps': 121119, 'loss/train': 1.4438623189926147} 11/07/2021 14:18:20 - INFO - __main__ - Step 121121: {'lr': 4.552361719161127e-05, 'samples': 23255232, 'steps': 121120, 'loss/train': 1.7316160202026367} 11/07/2021 14:18:20 - INFO - __main__ - Step 121122: {'lr': 4.5520563988854375e-05, 'samples': 23255424, 'steps': 121121, 'loss/train': 1.4506531953811646} 11/07/2021 14:18:21 - INFO - __main__ - Step 121123: {'lr': 4.551751087823272e-05, 'samples': 23255616, 'steps': 121122, 'loss/train': 0.9897506833076477} 11/07/2021 14:18:22 - INFO - __main__ - Step 121124: {'lr': 4.5514457859747754e-05, 'samples': 23255808, 'steps': 121123, 'loss/train': 1.4881831407546997} 11/07/2021 14:18:22 - INFO - __main__ - Step 121125: {'lr': 4.551140493340081e-05, 'samples': 23256000, 'steps': 121124, 'loss/train': 1.2950671911239624} 11/07/2021 14:18:22 - INFO - __main__ - Step 121126: {'lr': 4.5508352099193265e-05, 'samples': 23256192, 'steps': 121125, 'loss/train': 1.3723019361495972} 11/07/2021 14:18:23 - INFO - __main__ - Step 121127: {'lr': 4.5505299357126496e-05, 'samples': 23256384, 'steps': 121126, 'loss/train': 1.5916472673416138} 11/07/2021 14:18:23 - INFO - __main__ - Step 121128: {'lr': 4.550224670720188e-05, 'samples': 23256576, 'steps': 121127, 'loss/train': 1.7112164497375488} 11/07/2021 14:18:24 - INFO - __main__ - Step 121129: {'lr': 4.5499194149420884e-05, 'samples': 23256768, 'steps': 121128, 'loss/train': 1.180955171585083} 11/07/2021 14:18:24 - INFO - __main__ - Step 121130: {'lr': 4.549614168378471e-05, 'samples': 23256960, 'steps': 121129, 'loss/train': 1.181370496749878} 11/07/2021 14:18:25 - INFO - __main__ - Step 121131: {'lr': 4.549308931029483e-05, 'samples': 23257152, 'steps': 121130, 'loss/train': 1.097164273262024} 11/07/2021 14:18:25 - INFO - __main__ - Step 121132: {'lr': 4.5490037028952604e-05, 'samples': 23257344, 'steps': 121131, 'loss/train': 1.454750657081604} 11/07/2021 14:18:25 - INFO - __main__ - Step 121133: {'lr': 4.548698483975941e-05, 'samples': 23257536, 'steps': 121132, 'loss/train': 1.6272404193878174} 11/07/2021 14:18:27 - INFO - __main__ - Step 121134: {'lr': 4.548393274271662e-05, 'samples': 23257728, 'steps': 121133, 'loss/train': 1.3658207654953003} 11/07/2021 14:18:27 - INFO - __main__ - Step 121135: {'lr': 4.548088073782564e-05, 'samples': 23257920, 'steps': 121134, 'loss/train': 1.3482666015625} 11/07/2021 14:18:27 - INFO - __main__ - Step 121136: {'lr': 4.5477828825087804e-05, 'samples': 23258112, 'steps': 121135, 'loss/train': 1.2953232526779175} 11/07/2021 14:18:28 - INFO - __main__ - Step 121137: {'lr': 4.547477700450448e-05, 'samples': 23258304, 'steps': 121136, 'loss/train': 1.3436133861541748} 11/07/2021 14:18:28 - INFO - __main__ - Step 121138: {'lr': 4.5471725276077096e-05, 'samples': 23258496, 'steps': 121137, 'loss/train': 1.2078269720077515} 11/07/2021 14:18:29 - INFO - __main__ - Step 121139: {'lr': 4.5468673639806974e-05, 'samples': 23258688, 'steps': 121138, 'loss/train': 0.8626706004142761} 11/07/2021 14:18:29 - INFO - __main__ - Step 121140: {'lr': 4.546562209569552e-05, 'samples': 23258880, 'steps': 121139, 'loss/train': 1.3882578611373901} 11/07/2021 14:18:30 - INFO - __main__ - Step 121141: {'lr': 4.54625706437441e-05, 'samples': 23259072, 'steps': 121140, 'loss/train': 1.384433388710022} 11/07/2021 14:18:30 - INFO - __main__ - Step 121142: {'lr': 4.545951928395417e-05, 'samples': 23259264, 'steps': 121141, 'loss/train': 1.3024927377700806} 11/07/2021 14:18:30 - INFO - __main__ - Step 121143: {'lr': 4.545646801632694e-05, 'samples': 23259456, 'steps': 121142, 'loss/train': 1.1649914979934692} 11/07/2021 14:18:31 - INFO - __main__ - Step 121144: {'lr': 4.5453416840863876e-05, 'samples': 23259648, 'steps': 121143, 'loss/train': 1.0538111925125122} 11/07/2021 14:18:32 - INFO - __main__ - Step 121145: {'lr': 4.545036575756634e-05, 'samples': 23259840, 'steps': 121144, 'loss/train': 1.1038062572479248} 11/07/2021 14:18:32 - INFO - __main__ - Step 121146: {'lr': 4.544731476643571e-05, 'samples': 23260032, 'steps': 121145, 'loss/train': 0.7342548966407776} 11/07/2021 14:18:33 - INFO - __main__ - Step 121147: {'lr': 4.5444263867473384e-05, 'samples': 23260224, 'steps': 121146, 'loss/train': 1.2355018854141235} 11/07/2021 14:18:33 - INFO - __main__ - Step 121148: {'lr': 4.544121306068069e-05, 'samples': 23260416, 'steps': 121147, 'loss/train': 1.6017823219299316} 11/07/2021 14:18:33 - INFO - __main__ - Step 121149: {'lr': 4.543816234605905e-05, 'samples': 23260608, 'steps': 121148, 'loss/train': 0.9361854195594788} 11/07/2021 14:18:34 - INFO - __main__ - Step 121150: {'lr': 4.543511172360981e-05, 'samples': 23260800, 'steps': 121149, 'loss/train': 1.199326753616333} 11/07/2021 14:18:35 - INFO - __main__ - Step 121151: {'lr': 4.5432061193334343e-05, 'samples': 23260992, 'steps': 121150, 'loss/train': 1.2593293190002441} 11/07/2021 14:18:35 - INFO - __main__ - Step 121152: {'lr': 4.542901075523406e-05, 'samples': 23261184, 'steps': 121151, 'loss/train': 1.764281988143921} 11/07/2021 14:18:35 - INFO - __main__ - Step 121153: {'lr': 4.5425960409310294e-05, 'samples': 23261376, 'steps': 121152, 'loss/train': 1.2456763982772827} 11/07/2021 14:18:36 - INFO - __main__ - Step 121154: {'lr': 4.5422910155564434e-05, 'samples': 23261568, 'steps': 121153, 'loss/train': 0.8937584161758423} 11/07/2021 14:18:37 - INFO - __main__ - Step 121155: {'lr': 4.541985999399789e-05, 'samples': 23261760, 'steps': 121154, 'loss/train': 1.1625542640686035} 11/07/2021 14:18:37 - INFO - __main__ - Step 121156: {'lr': 4.5416809924611976e-05, 'samples': 23261952, 'steps': 121155, 'loss/train': 1.2210248708724976} 11/07/2021 14:18:37 - INFO - __main__ - Step 121157: {'lr': 4.541375994740807e-05, 'samples': 23262144, 'steps': 121156, 'loss/train': 1.4778571128845215} 11/07/2021 14:18:38 - INFO - __main__ - Step 121158: {'lr': 4.5410710062387564e-05, 'samples': 23262336, 'steps': 121157, 'loss/train': 1.1347590684890747} 11/07/2021 14:18:38 - INFO - __main__ - Step 121159: {'lr': 4.540766026955184e-05, 'samples': 23262528, 'steps': 121158, 'loss/train': 1.3152663707733154} 11/07/2021 14:18:39 - INFO - __main__ - Step 121160: {'lr': 4.540461056890227e-05, 'samples': 23262720, 'steps': 121159, 'loss/train': 1.2241504192352295} 11/07/2021 14:18:40 - INFO - __main__ - Step 121161: {'lr': 4.540156096044024e-05, 'samples': 23262912, 'steps': 121160, 'loss/train': 1.0782514810562134} 11/07/2021 14:18:40 - INFO - __main__ - Step 121162: {'lr': 4.5398511444167095e-05, 'samples': 23263104, 'steps': 121161, 'loss/train': 0.6858470439910889} 11/07/2021 14:18:40 - INFO - __main__ - Step 121163: {'lr': 4.5395462020084214e-05, 'samples': 23263296, 'steps': 121162, 'loss/train': 1.85700261592865} 11/07/2021 14:18:41 - INFO - __main__ - Step 121164: {'lr': 4.5392412688192994e-05, 'samples': 23263488, 'steps': 121163, 'loss/train': 1.315364956855774} 11/07/2021 14:18:41 - INFO - __main__ - Step 121165: {'lr': 4.5389363448494786e-05, 'samples': 23263680, 'steps': 121164, 'loss/train': 1.4252862930297852} 11/07/2021 14:18:42 - INFO - __main__ - Step 121166: {'lr': 4.538631430099105e-05, 'samples': 23263872, 'steps': 121165, 'loss/train': 1.3295828104019165} 11/07/2021 14:18:43 - INFO - __main__ - Step 121167: {'lr': 4.538326524568301e-05, 'samples': 23264064, 'steps': 121166, 'loss/train': 1.086824893951416} 11/07/2021 14:18:43 - INFO - __main__ - Step 121168: {'lr': 4.538021628257211e-05, 'samples': 23264256, 'steps': 121167, 'loss/train': 1.6144676208496094} 11/07/2021 14:18:43 - INFO - __main__ - Step 121169: {'lr': 4.537716741165973e-05, 'samples': 23264448, 'steps': 121168, 'loss/train': 1.7435939311981201} 11/07/2021 14:18:44 - INFO - __main__ - Step 121170: {'lr': 4.537411863294724e-05, 'samples': 23264640, 'steps': 121169, 'loss/train': 4.004571437835693} 11/07/2021 14:18:44 - INFO - __main__ - Step 121171: {'lr': 4.537106994643603e-05, 'samples': 23264832, 'steps': 121170, 'loss/train': 1.6206235885620117} 11/07/2021 14:18:45 - INFO - __main__ - Step 121172: {'lr': 4.536802135212745e-05, 'samples': 23265024, 'steps': 121171, 'loss/train': 1.3205009698867798} 11/07/2021 14:18:45 - INFO - __main__ - Step 121173: {'lr': 4.536497285002286e-05, 'samples': 23265216, 'steps': 121172, 'loss/train': 1.2135387659072876} 11/07/2021 14:18:46 - INFO - __main__ - Step 121174: {'lr': 4.536192444012369e-05, 'samples': 23265408, 'steps': 121173, 'loss/train': 1.070778727531433} 11/07/2021 14:18:46 - INFO - __main__ - Step 121175: {'lr': 4.535887612243125e-05, 'samples': 23265600, 'steps': 121174, 'loss/train': 1.331309199333191} 11/07/2021 14:18:46 - INFO - __main__ - Step 121176: {'lr': 4.535582789694695e-05, 'samples': 23265792, 'steps': 121175, 'loss/train': 1.289969563484192} 11/07/2021 14:18:48 - INFO - __main__ - Step 121177: {'lr': 4.535277976367222e-05, 'samples': 23265984, 'steps': 121176, 'loss/train': 1.3116861581802368} 11/07/2021 14:18:48 - INFO - __main__ - Step 121178: {'lr': 4.534973172260831e-05, 'samples': 23266176, 'steps': 121177, 'loss/train': 1.0928391218185425} 11/07/2021 14:18:49 - INFO - __main__ - Step 121179: {'lr': 4.5346683773756667e-05, 'samples': 23266368, 'steps': 121178, 'loss/train': 0.849380373954773} 11/07/2021 14:18:49 - INFO - __main__ - Step 121180: {'lr': 4.534363591711863e-05, 'samples': 23266560, 'steps': 121179, 'loss/train': 1.1182692050933838} 11/07/2021 14:18:50 - INFO - __main__ - Step 121181: {'lr': 4.5340588152695596e-05, 'samples': 23266752, 'steps': 121180, 'loss/train': 1.378164529800415} 11/07/2021 14:18:50 - INFO - __main__ - Step 121182: {'lr': 4.533754048048894e-05, 'samples': 23266944, 'steps': 121181, 'loss/train': 1.683665156364441} 11/07/2021 14:18:50 - INFO - __main__ - Step 121183: {'lr': 4.533449290050004e-05, 'samples': 23267136, 'steps': 121182, 'loss/train': 1.3500076532363892} 11/07/2021 14:18:51 - INFO - __main__ - Step 121184: {'lr': 4.5331445412730235e-05, 'samples': 23267328, 'steps': 121183, 'loss/train': 1.2740668058395386} 11/07/2021 14:18:52 - INFO - __main__ - Step 121185: {'lr': 4.5328398017180944e-05, 'samples': 23267520, 'steps': 121184, 'loss/train': 0.7733657956123352} 11/07/2021 14:18:52 - INFO - __main__ - Step 121186: {'lr': 4.532535071385349e-05, 'samples': 23267712, 'steps': 121185, 'loss/train': 0.576204776763916} 11/07/2021 14:18:52 - INFO - __main__ - Step 121187: {'lr': 4.532230350274938e-05, 'samples': 23267904, 'steps': 121186, 'loss/train': 1.661158800125122} 11/07/2021 14:18:53 - INFO - __main__ - Step 121188: {'lr': 4.53192563838698e-05, 'samples': 23268096, 'steps': 121187, 'loss/train': 0.5703549981117249} 11/07/2021 14:18:54 - INFO - __main__ - Step 121189: {'lr': 4.53162093572162e-05, 'samples': 23268288, 'steps': 121188, 'loss/train': 1.1440083980560303} 11/07/2021 14:18:54 - INFO - __main__ - Step 121190: {'lr': 4.531316242278993e-05, 'samples': 23268480, 'steps': 121189, 'loss/train': 1.1284579038619995} 11/07/2021 14:18:55 - INFO - __main__ - Step 121191: {'lr': 4.5310115580592445e-05, 'samples': 23268672, 'steps': 121190, 'loss/train': 1.277269959449768} 11/07/2021 14:18:55 - INFO - __main__ - Step 121192: {'lr': 4.530706883062502e-05, 'samples': 23268864, 'steps': 121191, 'loss/train': 0.7552962899208069} 11/07/2021 14:18:55 - INFO - __main__ - Step 121193: {'lr': 4.530402217288909e-05, 'samples': 23269056, 'steps': 121192, 'loss/train': 1.3362996578216553} 11/07/2021 14:18:56 - INFO - __main__ - Step 121194: {'lr': 4.5300975607386026e-05, 'samples': 23269248, 'steps': 121193, 'loss/train': 0.6590954661369324} 11/07/2021 14:18:57 - INFO - __main__ - Step 121195: {'lr': 4.529792913411715e-05, 'samples': 23269440, 'steps': 121194, 'loss/train': 1.1494355201721191} 11/07/2021 14:18:57 - INFO - __main__ - Step 121196: {'lr': 4.5294882753083884e-05, 'samples': 23269632, 'steps': 121195, 'loss/train': 1.4549387693405151} 11/07/2021 14:18:58 - INFO - __main__ - Step 121197: {'lr': 4.529183646428758e-05, 'samples': 23269824, 'steps': 121196, 'loss/train': 1.1755355596542358} 11/07/2021 14:18:58 - INFO - __main__ - Step 121198: {'lr': 4.52887902677297e-05, 'samples': 23270016, 'steps': 121197, 'loss/train': 0.22819706797599792} 11/07/2021 14:18:59 - INFO - __main__ - Step 121199: {'lr': 4.528574416341144e-05, 'samples': 23270208, 'steps': 121198, 'loss/train': 1.4142056703567505} 11/07/2021 14:18:59 - INFO - __main__ - Step 121200: {'lr': 4.528269815133429e-05, 'samples': 23270400, 'steps': 121199, 'loss/train': 1.5587519407272339} 11/07/2021 14:19:00 - INFO - __main__ - Step 121201: {'lr': 4.5279652231499574e-05, 'samples': 23270592, 'steps': 121200, 'loss/train': 1.7574905157089233} 11/07/2021 14:19:00 - INFO - __main__ - Step 121202: {'lr': 4.527660640390868e-05, 'samples': 23270784, 'steps': 121201, 'loss/train': 1.1516963243484497} 11/07/2021 14:19:00 - INFO - __main__ - Step 121203: {'lr': 4.527356066856303e-05, 'samples': 23270976, 'steps': 121202, 'loss/train': 0.6912009119987488} 11/07/2021 14:19:01 - INFO - __main__ - Step 121204: {'lr': 4.527051502546392e-05, 'samples': 23271168, 'steps': 121203, 'loss/train': 1.1777018308639526} 11/07/2021 14:19:02 - INFO - __main__ - Step 121205: {'lr': 4.526746947461277e-05, 'samples': 23271360, 'steps': 121204, 'loss/train': 1.4894404411315918} 11/07/2021 14:19:02 - INFO - __main__ - Step 121206: {'lr': 4.526442401601094e-05, 'samples': 23271552, 'steps': 121205, 'loss/train': 1.968587875366211} 11/07/2021 14:19:02 - INFO - __main__ - Step 121207: {'lr': 4.526137864965979e-05, 'samples': 23271744, 'steps': 121206, 'loss/train': 1.2874019145965576} 11/07/2021 14:19:03 - INFO - __main__ - Step 121208: {'lr': 4.5258333375560704e-05, 'samples': 23271936, 'steps': 121207, 'loss/train': 0.9143485426902771} 11/07/2021 14:19:03 - INFO - __main__ - Step 121209: {'lr': 4.525528819371508e-05, 'samples': 23272128, 'steps': 121208, 'loss/train': 1.9250913858413696} 11/07/2021 14:19:04 - INFO - __main__ - Step 121210: {'lr': 4.5252243104124294e-05, 'samples': 23272320, 'steps': 121209, 'loss/train': 1.683364748954773} 11/07/2021 14:19:05 - INFO - __main__ - Step 121211: {'lr': 4.524919810678965e-05, 'samples': 23272512, 'steps': 121210, 'loss/train': 1.0125064849853516} 11/07/2021 14:19:05 - INFO - __main__ - Step 121212: {'lr': 4.524615320171255e-05, 'samples': 23272704, 'steps': 121211, 'loss/train': 1.3555972576141357} 11/07/2021 14:19:05 - INFO - __main__ - Step 121213: {'lr': 4.5243108388894364e-05, 'samples': 23272896, 'steps': 121212, 'loss/train': 1.2238072156906128} 11/07/2021 14:19:06 - INFO - __main__ - Step 121214: {'lr': 4.5240063668336466e-05, 'samples': 23273088, 'steps': 121213, 'loss/train': 1.0285331010818481} 11/07/2021 14:19:07 - INFO - __main__ - Step 121215: {'lr': 4.523701904004027e-05, 'samples': 23273280, 'steps': 121214, 'loss/train': 0.8797420859336853} 11/07/2021 14:19:07 - INFO - __main__ - Step 121216: {'lr': 4.523397450400707e-05, 'samples': 23273472, 'steps': 121215, 'loss/train': 1.163411021232605} 11/07/2021 14:19:08 - INFO - __main__ - Step 121217: {'lr': 4.5230930060238316e-05, 'samples': 23273664, 'steps': 121216, 'loss/train': 0.996482253074646} 11/07/2021 14:19:08 - INFO - __main__ - Step 121218: {'lr': 4.5227885708735317e-05, 'samples': 23273856, 'steps': 121217, 'loss/train': 0.809558629989624} 11/07/2021 14:19:08 - INFO - __main__ - Step 121219: {'lr': 4.522484144949951e-05, 'samples': 23274048, 'steps': 121218, 'loss/train': 1.8035683631896973} 11/07/2021 14:19:09 - INFO - __main__ - Step 121220: {'lr': 4.52217972825322e-05, 'samples': 23274240, 'steps': 121219, 'loss/train': 1.437811255455017} 11/07/2021 14:19:10 - INFO - __main__ - Step 121221: {'lr': 4.521875320783481e-05, 'samples': 23274432, 'steps': 121220, 'loss/train': 1.7696055173873901} 11/07/2021 14:19:10 - INFO - __main__ - Step 121222: {'lr': 4.521570922540866e-05, 'samples': 23274624, 'steps': 121221, 'loss/train': 1.0825624465942383} 11/07/2021 14:19:10 - INFO - __main__ - Step 121223: {'lr': 4.5212665335255226e-05, 'samples': 23274816, 'steps': 121222, 'loss/train': 1.3103758096694946} 11/07/2021 14:19:11 - INFO - __main__ - Step 121224: {'lr': 4.520962153737576e-05, 'samples': 23275008, 'steps': 121223, 'loss/train': 1.4394035339355469} 11/07/2021 14:19:12 - INFO - __main__ - Step 121225: {'lr': 4.520657783177165e-05, 'samples': 23275200, 'steps': 121224, 'loss/train': 1.1577273607254028} 11/07/2021 14:19:12 - INFO - __main__ - Step 121226: {'lr': 4.520353421844434e-05, 'samples': 23275392, 'steps': 121225, 'loss/train': 0.7695355415344238} 11/07/2021 14:19:12 - INFO - __main__ - Step 121227: {'lr': 4.5200490697395126e-05, 'samples': 23275584, 'steps': 121226, 'loss/train': 1.238966703414917} 11/07/2021 14:19:13 - INFO - __main__ - Step 121228: {'lr': 4.5197447268625404e-05, 'samples': 23275776, 'steps': 121227, 'loss/train': 1.3761515617370605} 11/07/2021 14:19:13 - INFO - __main__ - Step 121229: {'lr': 4.519440393213656e-05, 'samples': 23275968, 'steps': 121228, 'loss/train': 1.1497626304626465} 11/07/2021 14:19:13 - INFO - __main__ - Step 121230: {'lr': 4.519136068792995e-05, 'samples': 23276160, 'steps': 121229, 'loss/train': 1.4663677215576172} 11/07/2021 14:19:14 - INFO - __main__ - Step 121231: {'lr': 4.5188317536006964e-05, 'samples': 23276352, 'steps': 121230, 'loss/train': 1.1330381631851196} 11/07/2021 14:19:15 - INFO - __main__ - Step 121232: {'lr': 4.518527447636897e-05, 'samples': 23276544, 'steps': 121231, 'loss/train': 1.2299219369888306} 11/07/2021 14:19:15 - INFO - __main__ - Step 121233: {'lr': 4.518223150901732e-05, 'samples': 23276736, 'steps': 121232, 'loss/train': 0.7646993398666382} 11/07/2021 14:19:16 - INFO - __main__ - Step 121234: {'lr': 4.5179188633953397e-05, 'samples': 23276928, 'steps': 121233, 'loss/train': 1.51641845703125} 11/07/2021 14:19:16 - INFO - __main__ - Step 121235: {'lr': 4.517614585117855e-05, 'samples': 23277120, 'steps': 121234, 'loss/train': 1.0312371253967285} 11/07/2021 14:19:17 - INFO - __main__ - Step 121236: {'lr': 4.517310316069426e-05, 'samples': 23277312, 'steps': 121235, 'loss/train': 1.3749520778656006} 11/07/2021 14:19:17 - INFO - __main__ - Step 121237: {'lr': 4.517006056250175e-05, 'samples': 23277504, 'steps': 121236, 'loss/train': 1.4512202739715576} 11/07/2021 14:19:18 - INFO - __main__ - Step 121238: {'lr': 4.516701805660242e-05, 'samples': 23277696, 'steps': 121237, 'loss/train': 0.8822027444839478} 11/07/2021 14:19:18 - INFO - __main__ - Step 121239: {'lr': 4.516397564299771e-05, 'samples': 23277888, 'steps': 121238, 'loss/train': 1.102943778038025} 11/07/2021 14:19:18 - INFO - __main__ - Step 121240: {'lr': 4.516093332168891e-05, 'samples': 23278080, 'steps': 121239, 'loss/train': 1.249808430671692} 11/07/2021 14:19:19 - INFO - __main__ - Step 121241: {'lr': 4.515789109267746e-05, 'samples': 23278272, 'steps': 121240, 'loss/train': 1.2675747871398926} 11/07/2021 14:19:20 - INFO - __main__ - Step 121242: {'lr': 4.515484895596469e-05, 'samples': 23278464, 'steps': 121241, 'loss/train': 1.7948458194732666} 11/07/2021 14:19:20 - INFO - __main__ - Step 121243: {'lr': 4.5151806911552016e-05, 'samples': 23278656, 'steps': 121242, 'loss/train': 1.7827067375183105} 11/07/2021 14:19:20 - INFO - __main__ - Step 121244: {'lr': 4.514876495944076e-05, 'samples': 23278848, 'steps': 121243, 'loss/train': 1.3974775075912476} 11/07/2021 14:19:21 - INFO - __main__ - Step 121245: {'lr': 4.5145723099632305e-05, 'samples': 23279040, 'steps': 121244, 'loss/train': 0.7350812554359436} 11/07/2021 14:19:22 - INFO - __main__ - Step 121246: {'lr': 4.514268133212801e-05, 'samples': 23279232, 'steps': 121245, 'loss/train': 0.9378039836883545} 11/07/2021 14:19:22 - INFO - __main__ - Step 121247: {'lr': 4.5139639656929274e-05, 'samples': 23279424, 'steps': 121246, 'loss/train': 1.1125694513320923} 11/07/2021 14:19:23 - INFO - __main__ - Step 121248: {'lr': 4.513659807403744e-05, 'samples': 23279616, 'steps': 121247, 'loss/train': 0.9683958292007446} 11/07/2021 14:19:23 - INFO - __main__ - Step 121249: {'lr': 4.5133556583453916e-05, 'samples': 23279808, 'steps': 121248, 'loss/train': 1.3859385251998901} 11/07/2021 14:19:23 - INFO - __main__ - Step 121250: {'lr': 4.5130515185180103e-05, 'samples': 23280000, 'steps': 121249, 'loss/train': 1.1940255165100098} 11/07/2021 14:19:24 - INFO - __main__ - Step 121251: {'lr': 4.512747387921728e-05, 'samples': 23280192, 'steps': 121250, 'loss/train': 1.7719154357910156} 11/07/2021 14:19:25 - INFO - __main__ - Step 121252: {'lr': 4.5124432665566816e-05, 'samples': 23280384, 'steps': 121251, 'loss/train': 1.172684907913208} 11/07/2021 14:19:25 - INFO - __main__ - Step 121253: {'lr': 4.512139154423015e-05, 'samples': 23280576, 'steps': 121252, 'loss/train': 1.3825665712356567} 11/07/2021 14:19:25 - INFO - __main__ - Step 121254: {'lr': 4.5118350515208625e-05, 'samples': 23280768, 'steps': 121253, 'loss/train': 1.3609687089920044} 11/07/2021 14:19:26 - INFO - __main__ - Step 121255: {'lr': 4.51153095785036e-05, 'samples': 23280960, 'steps': 121254, 'loss/train': 1.3604384660720825} 11/07/2021 14:19:27 - INFO - __main__ - Step 121256: {'lr': 4.5112268734116446e-05, 'samples': 23281152, 'steps': 121255, 'loss/train': 1.097865104675293} 11/07/2021 14:19:27 - INFO - __main__ - Step 121257: {'lr': 4.5109227982048555e-05, 'samples': 23281344, 'steps': 121256, 'loss/train': 1.3646835088729858} 11/07/2021 14:19:28 - INFO - __main__ - Step 121258: {'lr': 4.510618732230129e-05, 'samples': 23281536, 'steps': 121257, 'loss/train': 1.2146596908569336} 11/07/2021 14:19:28 - INFO - __main__ - Step 121259: {'lr': 4.510314675487598e-05, 'samples': 23281728, 'steps': 121258, 'loss/train': 1.0309667587280273} 11/07/2021 14:19:28 - INFO - __main__ - Step 121260: {'lr': 4.510010627977407e-05, 'samples': 23281920, 'steps': 121259, 'loss/train': 0.91588294506073} 11/07/2021 14:19:29 - INFO - __main__ - Step 121261: {'lr': 4.5097065896996856e-05, 'samples': 23282112, 'steps': 121260, 'loss/train': 1.425702691078186} 11/07/2021 14:19:30 - INFO - __main__ - Step 121262: {'lr': 4.5094025606545766e-05, 'samples': 23282304, 'steps': 121261, 'loss/train': 1.422767996788025} 11/07/2021 14:19:30 - INFO - __main__ - Step 121263: {'lr': 4.5090985408422216e-05, 'samples': 23282496, 'steps': 121262, 'loss/train': 0.9066232442855835} 11/07/2021 14:19:30 - INFO - __main__ - Step 121264: {'lr': 4.508794530262744e-05, 'samples': 23282688, 'steps': 121263, 'loss/train': 1.4101557731628418} 11/07/2021 14:19:31 - INFO - __main__ - Step 121265: {'lr': 4.508490528916287e-05, 'samples': 23282880, 'steps': 121264, 'loss/train': 1.4057408571243286} 11/07/2021 14:19:31 - INFO - __main__ - Step 121266: {'lr': 4.5081865368029856e-05, 'samples': 23283072, 'steps': 121265, 'loss/train': 1.2347943782806396} 11/07/2021 14:19:32 - INFO - __main__ - Step 121267: {'lr': 4.5078825539229815e-05, 'samples': 23283264, 'steps': 121266, 'loss/train': 1.5028280019760132} 11/07/2021 14:19:33 - INFO - __main__ - Step 121268: {'lr': 4.5075785802764083e-05, 'samples': 23283456, 'steps': 121267, 'loss/train': 1.4493999481201172} 11/07/2021 14:19:33 - INFO - __main__ - Step 121269: {'lr': 4.507274615863405e-05, 'samples': 23283648, 'steps': 121268, 'loss/train': 0.9342155456542969} 11/07/2021 14:19:33 - INFO - __main__ - Step 121270: {'lr': 4.5069706606841064e-05, 'samples': 23283840, 'steps': 121269, 'loss/train': 1.4409056901931763} 11/07/2021 14:19:34 - INFO - __main__ - Step 121271: {'lr': 4.506666714738653e-05, 'samples': 23284032, 'steps': 121270, 'loss/train': 1.184112310409546} 11/07/2021 14:19:35 - INFO - __main__ - Step 121272: {'lr': 4.506362778027176e-05, 'samples': 23284224, 'steps': 121271, 'loss/train': 1.4172887802124023} 11/07/2021 14:19:35 - INFO - __main__ - Step 121273: {'lr': 4.5060588505498184e-05, 'samples': 23284416, 'steps': 121272, 'loss/train': 1.527787685394287} 11/07/2021 14:19:35 - INFO - __main__ - Step 121274: {'lr': 4.505754932306713e-05, 'samples': 23284608, 'steps': 121273, 'loss/train': 1.3884589672088623} 11/07/2021 14:19:36 - INFO - __main__ - Step 121275: {'lr': 4.505451023297999e-05, 'samples': 23284800, 'steps': 121274, 'loss/train': 1.1298128366470337} 11/07/2021 14:19:36 - INFO - __main__ - Step 121276: {'lr': 4.505147123523812e-05, 'samples': 23284992, 'steps': 121275, 'loss/train': 1.0321592092514038} 11/07/2021 14:19:37 - INFO - __main__ - Step 121277: {'lr': 4.504843232984296e-05, 'samples': 23285184, 'steps': 121276, 'loss/train': 1.519288420677185} 11/07/2021 14:19:37 - INFO - __main__ - Step 121278: {'lr': 4.5045393516795735e-05, 'samples': 23285376, 'steps': 121277, 'loss/train': 1.3275926113128662} 11/07/2021 14:19:38 - INFO - __main__ - Step 121279: {'lr': 4.504235479609792e-05, 'samples': 23285568, 'steps': 121278, 'loss/train': 1.5152426958084106} 11/07/2021 14:19:38 - INFO - __main__ - Step 121280: {'lr': 4.5039316167750835e-05, 'samples': 23285760, 'steps': 121279, 'loss/train': 1.3285325765609741} 11/07/2021 14:19:39 - INFO - __main__ - Step 121281: {'lr': 4.503627763175588e-05, 'samples': 23285952, 'steps': 121280, 'loss/train': 1.4736350774765015} 11/07/2021 14:19:40 - INFO - __main__ - Step 121282: {'lr': 4.503323918811442e-05, 'samples': 23286144, 'steps': 121281, 'loss/train': 1.635294795036316} 11/07/2021 14:19:40 - INFO - __main__ - Step 121283: {'lr': 4.503020083682782e-05, 'samples': 23286336, 'steps': 121282, 'loss/train': 1.9837408065795898} 11/07/2021 14:19:40 - INFO - __main__ - Step 121284: {'lr': 4.5027162577897435e-05, 'samples': 23286528, 'steps': 121283, 'loss/train': 1.3467381000518799} 11/07/2021 14:19:41 - INFO - __main__ - Step 121285: {'lr': 4.502412441132467e-05, 'samples': 23286720, 'steps': 121284, 'loss/train': 1.0798190832138062} 11/07/2021 14:19:41 - INFO - __main__ - Step 121286: {'lr': 4.5021086337110856e-05, 'samples': 23286912, 'steps': 121285, 'loss/train': 1.2680469751358032} 11/07/2021 14:19:42 - INFO - __main__ - Step 121287: {'lr': 4.501804835525739e-05, 'samples': 23287104, 'steps': 121286, 'loss/train': 1.3494583368301392} 11/07/2021 14:19:42 - INFO - __main__ - Step 121288: {'lr': 4.501501046576562e-05, 'samples': 23287296, 'steps': 121287, 'loss/train': 1.1022144556045532} 11/07/2021 14:19:43 - INFO - __main__ - Step 121289: {'lr': 4.501197266863691e-05, 'samples': 23287488, 'steps': 121288, 'loss/train': 1.1961946487426758} 11/07/2021 14:19:43 - INFO - __main__ - Step 121290: {'lr': 4.500893496387271e-05, 'samples': 23287680, 'steps': 121289, 'loss/train': 1.3916058540344238} 11/07/2021 14:19:43 - INFO - __main__ - Step 121291: {'lr': 4.500589735147428e-05, 'samples': 23287872, 'steps': 121290, 'loss/train': 1.2953436374664307} 11/07/2021 14:19:45 - INFO - __main__ - Step 121292: {'lr': 4.5002859831443005e-05, 'samples': 23288064, 'steps': 121291, 'loss/train': 0.9919707775115967} 11/07/2021 14:19:45 - INFO - __main__ - Step 121293: {'lr': 4.499982240378028e-05, 'samples': 23288256, 'steps': 121292, 'loss/train': 1.2771791219711304} 11/07/2021 14:19:45 - INFO - __main__ - Step 121294: {'lr': 4.499678506848748e-05, 'samples': 23288448, 'steps': 121293, 'loss/train': 0.6145309209823608} 11/07/2021 14:19:46 - INFO - __main__ - Step 121295: {'lr': 4.499374782556598e-05, 'samples': 23288640, 'steps': 121294, 'loss/train': 1.36825692653656} 11/07/2021 14:19:46 - INFO - __main__ - Step 121296: {'lr': 4.4990710675017114e-05, 'samples': 23288832, 'steps': 121295, 'loss/train': 1.383992075920105} 11/07/2021 14:19:46 - INFO - __main__ - Step 121297: {'lr': 4.498767361684228e-05, 'samples': 23289024, 'steps': 121296, 'loss/train': 1.61213219165802} 11/07/2021 14:19:47 - INFO - __main__ - Step 121298: {'lr': 4.4984636651042826e-05, 'samples': 23289216, 'steps': 121297, 'loss/train': 1.5337032079696655} 11/07/2021 14:19:48 - INFO - __main__ - Step 121299: {'lr': 4.498159977762012e-05, 'samples': 23289408, 'steps': 121298, 'loss/train': 1.2891477346420288} 11/07/2021 14:19:48 - INFO - __main__ - Step 121300: {'lr': 4.497856299657557e-05, 'samples': 23289600, 'steps': 121299, 'loss/train': 0.8990678787231445} 11/07/2021 14:19:48 - INFO - __main__ - Step 121301: {'lr': 4.497552630791049e-05, 'samples': 23289792, 'steps': 121300, 'loss/train': 0.9744924902915955} 11/07/2021 14:19:49 - INFO - __main__ - Step 121302: {'lr': 4.4972489711626294e-05, 'samples': 23289984, 'steps': 121301, 'loss/train': 1.2429901361465454} 11/07/2021 14:19:50 - INFO - __main__ - Step 121303: {'lr': 4.496945320772433e-05, 'samples': 23290176, 'steps': 121302, 'loss/train': 1.2454544305801392} 11/07/2021 14:19:50 - INFO - __main__ - Step 121304: {'lr': 4.4966416796206024e-05, 'samples': 23290368, 'steps': 121303, 'loss/train': 1.2024980783462524} 11/07/2021 14:19:51 - INFO - __main__ - Step 121305: {'lr': 4.49633804770726e-05, 'samples': 23290560, 'steps': 121304, 'loss/train': 0.8918709754943848} 11/07/2021 14:19:51 - INFO - __main__ - Step 121306: {'lr': 4.4960344250325555e-05, 'samples': 23290752, 'steps': 121305, 'loss/train': 1.4599865674972534} 11/07/2021 14:19:51 - INFO - __main__ - Step 121307: {'lr': 4.495730811596618e-05, 'samples': 23290944, 'steps': 121306, 'loss/train': 1.2619379758834839} 11/07/2021 14:19:53 - INFO - __main__ - Step 121308: {'lr': 4.495427207399591e-05, 'samples': 23291136, 'steps': 121307, 'loss/train': 1.0678951740264893} 11/07/2021 14:19:53 - INFO - __main__ - Step 121309: {'lr': 4.4951236124416064e-05, 'samples': 23291328, 'steps': 121308, 'loss/train': 1.581332802772522} 11/07/2021 14:19:53 - INFO - __main__ - Step 121310: {'lr': 4.494820026722801e-05, 'samples': 23291520, 'steps': 121309, 'loss/train': 0.9854121208190918} 11/07/2021 14:19:54 - INFO - __main__ - Step 121311: {'lr': 4.4945164502433164e-05, 'samples': 23291712, 'steps': 121310, 'loss/train': 0.2425001859664917} 11/07/2021 14:19:54 - INFO - __main__ - Step 121312: {'lr': 4.494212883003285e-05, 'samples': 23291904, 'steps': 121311, 'loss/train': 1.3760755062103271} 11/07/2021 14:19:54 - INFO - __main__ - Step 121313: {'lr': 4.493909325002846e-05, 'samples': 23292096, 'steps': 121312, 'loss/train': 1.3493064641952515} 11/07/2021 14:19:55 - INFO - __main__ - Step 121314: {'lr': 4.493605776242132e-05, 'samples': 23292288, 'steps': 121313, 'loss/train': 2.5893471240997314} 11/07/2021 14:19:56 - INFO - __main__ - Step 121315: {'lr': 4.4933022367212864e-05, 'samples': 23292480, 'steps': 121314, 'loss/train': 1.104270100593567} 11/07/2021 14:19:56 - INFO - __main__ - Step 121316: {'lr': 4.492998706440438e-05, 'samples': 23292672, 'steps': 121315, 'loss/train': 1.297422170639038} 11/07/2021 14:19:57 - INFO - __main__ - Step 121317: {'lr': 4.4926951853997365e-05, 'samples': 23292864, 'steps': 121316, 'loss/train': 1.3133312463760376} 11/07/2021 14:19:57 - INFO - __main__ - Step 121318: {'lr': 4.4923916735993056e-05, 'samples': 23293056, 'steps': 121317, 'loss/train': 1.3484909534454346} 11/07/2021 14:19:58 - INFO - __main__ - Step 121319: {'lr': 4.492088171039285e-05, 'samples': 23293248, 'steps': 121318, 'loss/train': 1.7964906692504883} 11/07/2021 14:19:58 - INFO - __main__ - Step 121320: {'lr': 4.491784677719812e-05, 'samples': 23293440, 'steps': 121319, 'loss/train': 0.5836780071258545} 11/07/2021 14:19:59 - INFO - __main__ - Step 121321: {'lr': 4.4914811936410254e-05, 'samples': 23293632, 'steps': 121320, 'loss/train': 1.3108023405075073} 11/07/2021 14:19:59 - INFO - __main__ - Step 121322: {'lr': 4.4911777188030604e-05, 'samples': 23293824, 'steps': 121321, 'loss/train': 1.3443214893341064} 11/07/2021 14:19:59 - INFO - __main__ - Step 121323: {'lr': 4.490874253206056e-05, 'samples': 23294016, 'steps': 121322, 'loss/train': 1.0718169212341309} 11/07/2021 14:20:00 - INFO - __main__ - Step 121324: {'lr': 4.490570796850146e-05, 'samples': 23294208, 'steps': 121323, 'loss/train': 1.0758864879608154} 11/07/2021 14:20:01 - INFO - __main__ - Step 121325: {'lr': 4.490267349735469e-05, 'samples': 23294400, 'steps': 121324, 'loss/train': 1.7481938600540161} 11/07/2021 14:20:01 - INFO - __main__ - Step 121326: {'lr': 4.4899639118621604e-05, 'samples': 23294592, 'steps': 121325, 'loss/train': 1.3657582998275757} 11/07/2021 14:20:02 - INFO - __main__ - Step 121327: {'lr': 4.489660483230357e-05, 'samples': 23294784, 'steps': 121326, 'loss/train': 1.4529753923416138} 11/07/2021 14:20:02 - INFO - __main__ - Step 121328: {'lr': 4.489357063840197e-05, 'samples': 23294976, 'steps': 121327, 'loss/train': 1.022154688835144} 11/07/2021 14:20:03 - INFO - __main__ - Step 121329: {'lr': 4.489053653691816e-05, 'samples': 23295168, 'steps': 121328, 'loss/train': 1.3048101663589478} 11/07/2021 14:20:03 - INFO - __main__ - Step 121330: {'lr': 4.488750252785351e-05, 'samples': 23295360, 'steps': 121329, 'loss/train': 1.3799527883529663} 11/07/2021 14:20:04 - INFO - __main__ - Step 121331: {'lr': 4.4884468611209426e-05, 'samples': 23295552, 'steps': 121330, 'loss/train': 1.2526036500930786} 11/07/2021 14:20:04 - INFO - __main__ - Step 121332: {'lr': 4.4881434786987195e-05, 'samples': 23295744, 'steps': 121331, 'loss/train': 1.4455132484436035} 11/07/2021 14:20:04 - INFO - __main__ - Step 121333: {'lr': 4.4878401055188225e-05, 'samples': 23295936, 'steps': 121332, 'loss/train': 1.3581992387771606} 11/07/2021 14:20:05 - INFO - __main__ - Step 121334: {'lr': 4.4875367415813885e-05, 'samples': 23296128, 'steps': 121333, 'loss/train': 1.160773754119873} 11/07/2021 14:20:06 - INFO - __main__ - Step 121335: {'lr': 4.4872333868865524e-05, 'samples': 23296320, 'steps': 121334, 'loss/train': 1.376145839691162} 11/07/2021 14:20:06 - INFO - __main__ - Step 121336: {'lr': 4.486930041434453e-05, 'samples': 23296512, 'steps': 121335, 'loss/train': 1.2859318256378174} 11/07/2021 14:20:06 - INFO - __main__ - Step 121337: {'lr': 4.4866267052252274e-05, 'samples': 23296704, 'steps': 121336, 'loss/train': 1.2703818082809448} 11/07/2021 14:20:07 - INFO - __main__ - Step 121338: {'lr': 4.486323378259011e-05, 'samples': 23296896, 'steps': 121337, 'loss/train': 1.3346965312957764} 11/07/2021 14:20:07 - INFO - __main__ - Step 121339: {'lr': 4.486020060535942e-05, 'samples': 23297088, 'steps': 121338, 'loss/train': 1.2121641635894775} 11/07/2021 14:20:09 - INFO - __main__ - Step 121340: {'lr': 4.4857167520561544e-05, 'samples': 23297280, 'steps': 121339, 'loss/train': 1.482961893081665} 11/07/2021 14:20:09 - INFO - __main__ - Step 121341: {'lr': 4.4854134528197834e-05, 'samples': 23297472, 'steps': 121340, 'loss/train': 1.305362343788147} 11/07/2021 14:20:09 - INFO - __main__ - Step 121342: {'lr': 4.485110162826972e-05, 'samples': 23297664, 'steps': 121341, 'loss/train': 1.4470115900039673} 11/07/2021 14:20:10 - INFO - __main__ - Step 121343: {'lr': 4.484806882077852e-05, 'samples': 23297856, 'steps': 121342, 'loss/train': 0.105463407933712} 11/07/2021 14:20:10 - INFO - __main__ - Step 121344: {'lr': 4.4845036105725684e-05, 'samples': 23298048, 'steps': 121343, 'loss/train': 0.937712550163269} 11/07/2021 14:20:11 - INFO - __main__ - Step 121345: {'lr': 4.484200348311246e-05, 'samples': 23298240, 'steps': 121344, 'loss/train': 1.754617691040039} 11/07/2021 14:20:11 - INFO - __main__ - Step 121346: {'lr': 4.4838970952940235e-05, 'samples': 23298432, 'steps': 121345, 'loss/train': 0.994785726070404} 11/07/2021 14:20:12 - INFO - __main__ - Step 121347: {'lr': 4.4835938515210426e-05, 'samples': 23298624, 'steps': 121346, 'loss/train': 1.6152199506759644} 11/07/2021 14:20:12 - INFO - __main__ - Step 121348: {'lr': 4.4832906169924356e-05, 'samples': 23298816, 'steps': 121347, 'loss/train': 1.3765047788619995} 11/07/2021 14:20:12 - INFO - __main__ - Step 121349: {'lr': 4.482987391708343e-05, 'samples': 23299008, 'steps': 121348, 'loss/train': 1.4906835556030273} 11/07/2021 14:20:14 - INFO - __main__ - Step 121350: {'lr': 4.4826841756689e-05, 'samples': 23299200, 'steps': 121349, 'loss/train': 1.026109218597412} 11/07/2021 14:20:14 - INFO - __main__ - Step 121351: {'lr': 4.482380968874242e-05, 'samples': 23299392, 'steps': 121350, 'loss/train': 1.256375789642334} 11/07/2021 14:20:14 - INFO - __main__ - Step 121352: {'lr': 4.482077771324508e-05, 'samples': 23299584, 'steps': 121351, 'loss/train': 1.6517333984375} 11/07/2021 14:20:15 - INFO - __main__ - Step 121353: {'lr': 4.481774583019832e-05, 'samples': 23299776, 'steps': 121352, 'loss/train': 1.2684955596923828} 11/07/2021 14:20:15 - INFO - __main__ - Step 121354: {'lr': 4.4814714039603494e-05, 'samples': 23299968, 'steps': 121353, 'loss/train': 1.1188693046569824} 11/07/2021 14:20:16 - INFO - __main__ - Step 121355: {'lr': 4.481168234146202e-05, 'samples': 23300160, 'steps': 121354, 'loss/train': 1.547057032585144} 11/07/2021 14:20:16 - INFO - __main__ - Step 121356: {'lr': 4.4808650735775224e-05, 'samples': 23300352, 'steps': 121355, 'loss/train': 1.1965700387954712} 11/07/2021 14:20:17 - INFO - __main__ - Step 121357: {'lr': 4.480561922254456e-05, 'samples': 23300544, 'steps': 121356, 'loss/train': 0.9310030341148376} 11/07/2021 14:20:17 - INFO - __main__ - Step 121358: {'lr': 4.480258780177124e-05, 'samples': 23300736, 'steps': 121357, 'loss/train': 1.229312539100647} 11/07/2021 14:20:17 - INFO - __main__ - Step 121359: {'lr': 4.4799556473456706e-05, 'samples': 23300928, 'steps': 121358, 'loss/train': 1.1735141277313232} 11/07/2021 14:20:18 - INFO - __main__ - Step 121360: {'lr': 4.4796525237602356e-05, 'samples': 23301120, 'steps': 121359, 'loss/train': 0.41301125288009644} 11/07/2021 14:20:19 - INFO - __main__ - Step 121361: {'lr': 4.479349409420949e-05, 'samples': 23301312, 'steps': 121360, 'loss/train': 1.026417851448059} 11/07/2021 14:20:19 - INFO - __main__ - Step 121362: {'lr': 4.4790463043279524e-05, 'samples': 23301504, 'steps': 121361, 'loss/train': 1.7790377140045166} 11/07/2021 14:20:19 - INFO - __main__ - Step 121363: {'lr': 4.4787432084813814e-05, 'samples': 23301696, 'steps': 121362, 'loss/train': 0.5257319211959839} 11/07/2021 14:20:20 - INFO - __main__ - Step 121364: {'lr': 4.478440121881372e-05, 'samples': 23301888, 'steps': 121363, 'loss/train': 1.5941678285598755} 11/07/2021 14:20:21 - INFO - __main__ - Step 121365: {'lr': 4.478137044528058e-05, 'samples': 23302080, 'steps': 121364, 'loss/train': 1.2999567985534668} 11/07/2021 14:20:21 - INFO - __main__ - Step 121366: {'lr': 4.477833976421583e-05, 'samples': 23302272, 'steps': 121365, 'loss/train': 1.3374627828598022} 11/07/2021 14:20:22 - INFO - __main__ - Step 121367: {'lr': 4.477530917562075e-05, 'samples': 23302464, 'steps': 121366, 'loss/train': 1.2767974138259888} 11/07/2021 14:20:22 - INFO - __main__ - Step 121368: {'lr': 4.4772278679496794e-05, 'samples': 23302656, 'steps': 121367, 'loss/train': 1.3537979125976562} 11/07/2021 14:20:22 - INFO - __main__ - Step 121369: {'lr': 4.476924827584525e-05, 'samples': 23302848, 'steps': 121368, 'loss/train': 1.293164610862732} 11/07/2021 14:20:23 - INFO - __main__ - Step 121370: {'lr': 4.476621796466751e-05, 'samples': 23303040, 'steps': 121369, 'loss/train': 1.4124174118041992} 11/07/2021 14:20:24 - INFO - __main__ - Step 121371: {'lr': 4.476318774596502e-05, 'samples': 23303232, 'steps': 121370, 'loss/train': 1.5950326919555664} 11/07/2021 14:20:24 - INFO - __main__ - Step 121372: {'lr': 4.4760157619739036e-05, 'samples': 23303424, 'steps': 121371, 'loss/train': 1.2507938146591187} 11/07/2021 14:20:24 - INFO - __main__ - Step 121373: {'lr': 4.475712758599093e-05, 'samples': 23303616, 'steps': 121372, 'loss/train': 1.417860507965088} 11/07/2021 14:20:25 - INFO - __main__ - Step 121374: {'lr': 4.47540976447221e-05, 'samples': 23303808, 'steps': 121373, 'loss/train': 1.227600336074829} 11/07/2021 14:20:25 - INFO - __main__ - Step 121375: {'lr': 4.475106779593391e-05, 'samples': 23304000, 'steps': 121374, 'loss/train': 1.51547110080719} 11/07/2021 14:20:26 - INFO - __main__ - Step 121376: {'lr': 4.474803803962771e-05, 'samples': 23304192, 'steps': 121375, 'loss/train': 1.620615839958191} 11/07/2021 14:20:27 - INFO - __main__ - Step 121377: {'lr': 4.4745008375804864e-05, 'samples': 23304384, 'steps': 121376, 'loss/train': 0.8149914145469666} 11/07/2021 14:20:27 - INFO - __main__ - Step 121378: {'lr': 4.474197880446679e-05, 'samples': 23304576, 'steps': 121377, 'loss/train': 1.1299961805343628} 11/07/2021 14:20:27 - INFO - __main__ - Step 121379: {'lr': 4.4738949325614786e-05, 'samples': 23304768, 'steps': 121378, 'loss/train': 1.172013521194458} 11/07/2021 14:20:28 - INFO - __main__ - Step 121380: {'lr': 4.473591993925025e-05, 'samples': 23304960, 'steps': 121379, 'loss/train': 1.496186375617981} 11/07/2021 14:20:29 - INFO - __main__ - Step 121381: {'lr': 4.473289064537453e-05, 'samples': 23305152, 'steps': 121380, 'loss/train': 1.2840757369995117} 11/07/2021 14:20:29 - INFO - __main__ - Step 121382: {'lr': 4.472986144398902e-05, 'samples': 23305344, 'steps': 121381, 'loss/train': 1.272365927696228} 11/07/2021 14:20:29 - INFO - __main__ - Step 121383: {'lr': 4.472683233509506e-05, 'samples': 23305536, 'steps': 121382, 'loss/train': 1.6027330160140991} 11/07/2021 14:20:30 - INFO - __main__ - Step 121384: {'lr': 4.472380331869408e-05, 'samples': 23305728, 'steps': 121383, 'loss/train': 0.9467859864234924} 11/07/2021 14:20:30 - INFO - __main__ - Step 121385: {'lr': 4.4720774394787335e-05, 'samples': 23305920, 'steps': 121384, 'loss/train': 0.9280626773834229} 11/07/2021 14:20:31 - INFO - __main__ - Step 121386: {'lr': 4.471774556337624e-05, 'samples': 23306112, 'steps': 121385, 'loss/train': 1.290555715560913} 11/07/2021 14:20:31 - INFO - __main__ - Step 121387: {'lr': 4.471471682446215e-05, 'samples': 23306304, 'steps': 121386, 'loss/train': 1.0811301469802856} 11/07/2021 14:20:32 - INFO - __main__ - Step 121388: {'lr': 4.471168817804644e-05, 'samples': 23306496, 'steps': 121387, 'loss/train': 1.5494903326034546} 11/07/2021 14:20:32 - INFO - __main__ - Step 121389: {'lr': 4.470865962413048e-05, 'samples': 23306688, 'steps': 121388, 'loss/train': 1.1366273164749146} 11/07/2021 14:20:33 - INFO - __main__ - Step 121390: {'lr': 4.4705631162715645e-05, 'samples': 23306880, 'steps': 121389, 'loss/train': 1.346877932548523} 11/07/2021 14:20:33 - INFO - __main__ - Step 121391: {'lr': 4.470260279380328e-05, 'samples': 23307072, 'steps': 121390, 'loss/train': 1.013796091079712} 11/07/2021 14:20:34 - INFO - __main__ - Step 121392: {'lr': 4.469957451739473e-05, 'samples': 23307264, 'steps': 121391, 'loss/train': 1.490409016609192} 11/07/2021 14:20:34 - INFO - __main__ - Step 121393: {'lr': 4.469654633349141e-05, 'samples': 23307456, 'steps': 121392, 'loss/train': 1.4531230926513672} 11/07/2021 14:20:35 - INFO - __main__ - Step 121394: {'lr': 4.469351824209464e-05, 'samples': 23307648, 'steps': 121393, 'loss/train': 1.5638606548309326} 11/07/2021 14:20:35 - INFO - __main__ - Step 121395: {'lr': 4.469049024320582e-05, 'samples': 23307840, 'steps': 121394, 'loss/train': 1.041141152381897} 11/07/2021 14:20:35 - INFO - __main__ - Step 121396: {'lr': 4.4687462336826275e-05, 'samples': 23308032, 'steps': 121395, 'loss/train': 1.3756217956542969} 11/07/2021 14:20:36 - INFO - __main__ - Step 121397: {'lr': 4.4684434522957425e-05, 'samples': 23308224, 'steps': 121396, 'loss/train': 1.2256247997283936} 11/07/2021 14:20:37 - INFO - __main__ - Step 121398: {'lr': 4.4681406801600626e-05, 'samples': 23308416, 'steps': 121397, 'loss/train': 1.2812498807907104} 11/07/2021 14:20:37 - INFO - __main__ - Step 121399: {'lr': 4.4678379172757186e-05, 'samples': 23308608, 'steps': 121398, 'loss/train': 1.3054367303848267} 11/07/2021 14:20:37 - INFO - __main__ - Step 121400: {'lr': 4.4675351636428466e-05, 'samples': 23308800, 'steps': 121399, 'loss/train': 0.29340660572052} 11/07/2021 14:20:38 - INFO - __main__ - Step 121401: {'lr': 4.467232419261591e-05, 'samples': 23308992, 'steps': 121400, 'loss/train': 0.8541764616966248} 11/07/2021 14:20:39 - INFO - __main__ - Step 121402: {'lr': 4.4669296841320786e-05, 'samples': 23309184, 'steps': 121401, 'loss/train': 1.600915789604187} 11/07/2021 14:20:39 - INFO - __main__ - Step 121403: {'lr': 4.466626958254455e-05, 'samples': 23309376, 'steps': 121402, 'loss/train': 0.9962278604507446} 11/07/2021 14:20:40 - INFO - __main__ - Step 121404: {'lr': 4.4663242416288495e-05, 'samples': 23309568, 'steps': 121403, 'loss/train': 1.4663922786712646} 11/07/2021 14:20:40 - INFO - __main__ - Step 121405: {'lr': 4.466021534255402e-05, 'samples': 23309760, 'steps': 121404, 'loss/train': 1.0030323266983032} 11/07/2021 14:20:40 - INFO - __main__ - Step 121406: {'lr': 4.46571883613425e-05, 'samples': 23309952, 'steps': 121405, 'loss/train': 1.59781014919281} 11/07/2021 14:20:42 - INFO - __main__ - Step 121407: {'lr': 4.465416147265528e-05, 'samples': 23310144, 'steps': 121406, 'loss/train': 1.0924344062805176} 11/07/2021 14:20:42 - INFO - __main__ - Step 121408: {'lr': 4.46511346764937e-05, 'samples': 23310336, 'steps': 121407, 'loss/train': 1.5064022541046143} 11/07/2021 14:20:42 - INFO - __main__ - Step 121409: {'lr': 4.464810797285917e-05, 'samples': 23310528, 'steps': 121408, 'loss/train': 1.008601427078247} 11/07/2021 14:20:43 - INFO - __main__ - Step 121410: {'lr': 4.4645081361753045e-05, 'samples': 23310720, 'steps': 121409, 'loss/train': 1.5546071529388428} 11/07/2021 14:20:43 - INFO - __main__ - Step 121411: {'lr': 4.464205484317671e-05, 'samples': 23310912, 'steps': 121410, 'loss/train': 1.3379555940628052} 11/07/2021 14:20:43 - INFO - __main__ - Step 121412: {'lr': 4.4639028417131465e-05, 'samples': 23311104, 'steps': 121411, 'loss/train': 1.3372522592544556} 11/07/2021 14:20:44 - INFO - __main__ - Step 121413: {'lr': 4.4636002083618675e-05, 'samples': 23311296, 'steps': 121412, 'loss/train': 1.7992346286773682} 11/07/2021 14:20:45 - INFO - __main__ - Step 121414: {'lr': 4.4632975842639756e-05, 'samples': 23311488, 'steps': 121413, 'loss/train': 2.4508230686187744} 11/07/2021 14:20:45 - INFO - __main__ - Step 121415: {'lr': 4.4629949694196035e-05, 'samples': 23311680, 'steps': 121414, 'loss/train': 1.171701431274414} 11/07/2021 14:20:45 - INFO - __main__ - Step 121416: {'lr': 4.462692363828891e-05, 'samples': 23311872, 'steps': 121415, 'loss/train': 1.373429536819458} 11/07/2021 14:20:46 - INFO - __main__ - Step 121417: {'lr': 4.46238976749197e-05, 'samples': 23312064, 'steps': 121416, 'loss/train': 0.4992087185382843} 11/07/2021 14:20:47 - INFO - __main__ - Step 121418: {'lr': 4.46208718040898e-05, 'samples': 23312256, 'steps': 121417, 'loss/train': 0.8213461637496948} 11/07/2021 14:20:47 - INFO - __main__ - Step 121419: {'lr': 4.4617846025800574e-05, 'samples': 23312448, 'steps': 121418, 'loss/train': 0.659742534160614} 11/07/2021 14:20:48 - INFO - __main__ - Step 121420: {'lr': 4.461482034005338e-05, 'samples': 23312640, 'steps': 121419, 'loss/train': 2.756298065185547} 11/07/2021 14:20:48 - INFO - __main__ - Step 121421: {'lr': 4.4611794746849565e-05, 'samples': 23312832, 'steps': 121420, 'loss/train': 1.1163487434387207} 11/07/2021 14:20:48 - INFO - __main__ - Step 121422: {'lr': 4.4608769246190506e-05, 'samples': 23313024, 'steps': 121421, 'loss/train': 0.29488644003868103} 11/07/2021 14:20:49 - INFO - __main__ - Step 121423: {'lr': 4.460574383807764e-05, 'samples': 23313216, 'steps': 121422, 'loss/train': 1.4111746549606323} 11/07/2021 14:20:50 - INFO - __main__ - Step 121424: {'lr': 4.4602718522512184e-05, 'samples': 23313408, 'steps': 121423, 'loss/train': 1.420727252960205} 11/07/2021 14:20:50 - INFO - __main__ - Step 121425: {'lr': 4.459969329949559e-05, 'samples': 23313600, 'steps': 121424, 'loss/train': 1.136582612991333} 11/07/2021 14:20:50 - INFO - __main__ - Step 121426: {'lr': 4.4596668169029184e-05, 'samples': 23313792, 'steps': 121425, 'loss/train': 1.0899230241775513} 11/07/2021 14:20:51 - INFO - __main__ - Step 121427: {'lr': 4.459364313111436e-05, 'samples': 23313984, 'steps': 121426, 'loss/train': 0.8259133100509644} 11/07/2021 14:20:51 - INFO - __main__ - Step 121428: {'lr': 4.459061818575247e-05, 'samples': 23314176, 'steps': 121427, 'loss/train': 1.226034164428711} 11/07/2021 14:20:52 - INFO - __main__ - Step 121429: {'lr': 4.4587593332944874e-05, 'samples': 23314368, 'steps': 121428, 'loss/train': 0.9543644189834595} 11/07/2021 14:20:53 - INFO - __main__ - Step 121430: {'lr': 4.458456857269294e-05, 'samples': 23314560, 'steps': 121429, 'loss/train': 1.3472540378570557} 11/07/2021 14:20:53 - INFO - __main__ - Step 121431: {'lr': 4.4581543904998025e-05, 'samples': 23314752, 'steps': 121430, 'loss/train': 1.0626686811447144} 11/07/2021 14:20:53 - INFO - __main__ - Step 121432: {'lr': 4.457851932986151e-05, 'samples': 23314944, 'steps': 121431, 'loss/train': 1.558348536491394} 11/07/2021 14:20:54 - INFO - __main__ - Step 121433: {'lr': 4.457549484728474e-05, 'samples': 23315136, 'steps': 121432, 'loss/train': 1.4339078664779663} 11/07/2021 14:20:55 - INFO - __main__ - Step 121434: {'lr': 4.457247045726914e-05, 'samples': 23315328, 'steps': 121433, 'loss/train': 0.9415296316146851} 11/07/2021 14:20:55 - INFO - __main__ - Step 121435: {'lr': 4.4569446159815954e-05, 'samples': 23315520, 'steps': 121434, 'loss/train': 0.6572050452232361} 11/07/2021 14:20:55 - INFO - __main__ - Step 121436: {'lr': 4.4566421954926606e-05, 'samples': 23315712, 'steps': 121435, 'loss/train': 1.1472766399383545} 11/07/2021 14:20:56 - INFO - __main__ - Step 121437: {'lr': 4.456339784260246e-05, 'samples': 23315904, 'steps': 121436, 'loss/train': 1.564831256866455} 11/07/2021 14:20:56 - INFO - __main__ - Step 121438: {'lr': 4.4560373822844865e-05, 'samples': 23316096, 'steps': 121437, 'loss/train': 1.2941226959228516} 11/07/2021 14:20:57 - INFO - __main__ - Step 121439: {'lr': 4.4557349895655215e-05, 'samples': 23316288, 'steps': 121438, 'loss/train': 1.332025408744812} 11/07/2021 14:20:57 - INFO - __main__ - Step 121440: {'lr': 4.455432606103485e-05, 'samples': 23316480, 'steps': 121439, 'loss/train': 1.4009876251220703} 11/07/2021 14:20:58 - INFO - __main__ - Step 121441: {'lr': 4.4551302318985134e-05, 'samples': 23316672, 'steps': 121440, 'loss/train': 1.525466799736023} 11/07/2021 14:20:58 - INFO - __main__ - Step 121442: {'lr': 4.454827866950742e-05, 'samples': 23316864, 'steps': 121441, 'loss/train': 1.2364633083343506} 11/07/2021 14:20:58 - INFO - __main__ - Step 121443: {'lr': 4.4545255112603075e-05, 'samples': 23317056, 'steps': 121442, 'loss/train': 1.0656994581222534} 11/07/2021 14:20:59 - INFO - __main__ - Step 121444: {'lr': 4.454223164827348e-05, 'samples': 23317248, 'steps': 121443, 'loss/train': 1.4181491136550903} 11/07/2021 14:21:00 - INFO - __main__ - Step 121445: {'lr': 4.4539208276520056e-05, 'samples': 23317440, 'steps': 121444, 'loss/train': 0.8790913820266724} 11/07/2021 14:21:00 - INFO - __main__ - Step 121446: {'lr': 4.453618499734402e-05, 'samples': 23317632, 'steps': 121445, 'loss/train': 1.4013344049453735} 11/07/2021 14:21:01 - INFO - __main__ - Step 121447: {'lr': 4.453316181074682e-05, 'samples': 23317824, 'steps': 121446, 'loss/train': 1.7379640340805054} 11/07/2021 14:21:01 - INFO - __main__ - Step 121448: {'lr': 4.453013871672978e-05, 'samples': 23318016, 'steps': 121447, 'loss/train': 0.5893751382827759} 11/07/2021 14:21:01 - INFO - __main__ - Step 121449: {'lr': 4.45271157152943e-05, 'samples': 23318208, 'steps': 121448, 'loss/train': 1.877015233039856} 11/07/2021 14:21:02 - INFO - __main__ - Step 121450: {'lr': 4.452409280644176e-05, 'samples': 23318400, 'steps': 121449, 'loss/train': 1.3553636074066162} 11/07/2021 14:21:03 - INFO - __main__ - Step 121451: {'lr': 4.452106999017347e-05, 'samples': 23318592, 'steps': 121450, 'loss/train': 1.2983826398849487} 11/07/2021 14:21:03 - INFO - __main__ - Step 121452: {'lr': 4.451804726649081e-05, 'samples': 23318784, 'steps': 121451, 'loss/train': 1.3252639770507812} 11/07/2021 14:21:03 - INFO - __main__ - Step 121453: {'lr': 4.451502463539517e-05, 'samples': 23318976, 'steps': 121452, 'loss/train': 1.924219012260437} 11/07/2021 14:21:04 - INFO - __main__ - Step 121454: {'lr': 4.451200209688785e-05, 'samples': 23319168, 'steps': 121453, 'loss/train': 1.437687635421753} 11/07/2021 14:21:05 - INFO - __main__ - Step 121455: {'lr': 4.4508979650970285e-05, 'samples': 23319360, 'steps': 121454, 'loss/train': 1.4809064865112305} 11/07/2021 14:21:05 - INFO - __main__ - Step 121456: {'lr': 4.450595729764384e-05, 'samples': 23319552, 'steps': 121455, 'loss/train': 1.171148419380188} 11/07/2021 14:21:06 - INFO - __main__ - Step 121457: {'lr': 4.4502935036909806e-05, 'samples': 23319744, 'steps': 121456, 'loss/train': 1.3732402324676514} 11/07/2021 14:21:06 - INFO - __main__ - Step 121458: {'lr': 4.449991286876956e-05, 'samples': 23319936, 'steps': 121457, 'loss/train': 0.9243550896644592} 11/07/2021 14:21:06 - INFO - __main__ - Step 121459: {'lr': 4.44968907932245e-05, 'samples': 23320128, 'steps': 121458, 'loss/train': 0.6186041235923767} 11/07/2021 14:21:08 - INFO - __main__ - Step 121460: {'lr': 4.449386881027595e-05, 'samples': 23320320, 'steps': 121459, 'loss/train': 1.1945610046386719} 11/07/2021 14:21:08 - INFO - __main__ - Step 121461: {'lr': 4.449084691992528e-05, 'samples': 23320512, 'steps': 121460, 'loss/train': 1.3089759349822998} 11/07/2021 14:21:08 - INFO - __main__ - Step 121462: {'lr': 4.448782512217389e-05, 'samples': 23320704, 'steps': 121461, 'loss/train': 1.38713538646698} 11/07/2021 14:21:09 - INFO - __main__ - Step 121463: {'lr': 4.44848034170231e-05, 'samples': 23320896, 'steps': 121462, 'loss/train': 1.5127629041671753} 11/07/2021 14:21:09 - INFO - __main__ - Step 121464: {'lr': 4.448178180447429e-05, 'samples': 23321088, 'steps': 121463, 'loss/train': 1.2474771738052368} 11/07/2021 14:21:09 - INFO - __main__ - Step 121465: {'lr': 4.4478760284528825e-05, 'samples': 23321280, 'steps': 121464, 'loss/train': 1.514622688293457} 11/07/2021 14:21:11 - INFO - __main__ - Step 121466: {'lr': 4.447573885718806e-05, 'samples': 23321472, 'steps': 121465, 'loss/train': 1.3490777015686035} 11/07/2021 14:21:11 - INFO - __main__ - Step 121467: {'lr': 4.447271752245341e-05, 'samples': 23321664, 'steps': 121466, 'loss/train': 0.6976979970932007} 11/07/2021 14:21:11 - INFO - __main__ - Step 121468: {'lr': 4.4469696280326124e-05, 'samples': 23321856, 'steps': 121467, 'loss/train': 1.0995080471038818} 11/07/2021 14:21:12 - INFO - __main__ - Step 121469: {'lr': 4.44666751308076e-05, 'samples': 23322048, 'steps': 121468, 'loss/train': 2.384829044342041} 11/07/2021 14:21:12 - INFO - __main__ - Step 121470: {'lr': 4.446365407389924e-05, 'samples': 23322240, 'steps': 121469, 'loss/train': 1.3585820198059082} 11/07/2021 14:21:13 - INFO - __main__ - Step 121471: {'lr': 4.446063310960238e-05, 'samples': 23322432, 'steps': 121470, 'loss/train': 1.7104464769363403} 11/07/2021 14:21:13 - INFO - __main__ - Step 121472: {'lr': 4.4457612237918415e-05, 'samples': 23322624, 'steps': 121471, 'loss/train': 1.4573655128479004} 11/07/2021 14:21:14 - INFO - __main__ - Step 121473: {'lr': 4.4454591458848645e-05, 'samples': 23322816, 'steps': 121472, 'loss/train': 1.1751879453659058} 11/07/2021 14:21:14 - INFO - __main__ - Step 121474: {'lr': 4.4451570772394475e-05, 'samples': 23323008, 'steps': 121473, 'loss/train': 1.3986537456512451} 11/07/2021 14:21:15 - INFO - __main__ - Step 121475: {'lr': 4.444855017855726e-05, 'samples': 23323200, 'steps': 121474, 'loss/train': 1.1772428750991821} 11/07/2021 14:21:16 - INFO - __main__ - Step 121476: {'lr': 4.444552967733834e-05, 'samples': 23323392, 'steps': 121475, 'loss/train': 1.3818222284317017} 11/07/2021 14:21:16 - INFO - __main__ - Step 121477: {'lr': 4.4442509268739103e-05, 'samples': 23323584, 'steps': 121476, 'loss/train': 0.868944525718689} 11/07/2021 14:21:17 - INFO - __main__ - Step 121478: {'lr': 4.443948895276098e-05, 'samples': 23323776, 'steps': 121477, 'loss/train': 1.450991153717041} 11/07/2021 14:21:17 - INFO - __main__ - Step 121479: {'lr': 4.4436468729405156e-05, 'samples': 23323968, 'steps': 121478, 'loss/train': 1.242989420890808} 11/07/2021 14:21:17 - INFO - __main__ - Step 121480: {'lr': 4.44334485986731e-05, 'samples': 23324160, 'steps': 121479, 'loss/train': 1.7116591930389404} 11/07/2021 14:21:18 - INFO - __main__ - Step 121481: {'lr': 4.443042856056617e-05, 'samples': 23324352, 'steps': 121480, 'loss/train': 0.6345921754837036} 11/07/2021 14:21:19 - INFO - __main__ - Step 121482: {'lr': 4.442740861508571e-05, 'samples': 23324544, 'steps': 121481, 'loss/train': 0.7317664623260498} 11/07/2021 14:21:20 - INFO - __main__ - Step 121483: {'lr': 4.442438876223309e-05, 'samples': 23324736, 'steps': 121482, 'loss/train': 1.0524276494979858} 11/07/2021 14:21:20 - INFO - __main__ - Step 121484: {'lr': 4.442136900200966e-05, 'samples': 23324928, 'steps': 121483, 'loss/train': 1.5103561878204346} 11/07/2021 14:21:20 - INFO - __main__ - Step 121485: {'lr': 4.4418349334416795e-05, 'samples': 23325120, 'steps': 121484, 'loss/train': 0.8650486469268799} 11/07/2021 14:21:21 - INFO - __main__ - Step 121486: {'lr': 4.441532975945583e-05, 'samples': 23325312, 'steps': 121485, 'loss/train': 0.9024832248687744} 11/07/2021 14:21:21 - INFO - __main__ - Step 121487: {'lr': 4.441231027712819e-05, 'samples': 23325504, 'steps': 121486, 'loss/train': 0.8061333894729614} 11/07/2021 14:21:22 - INFO - __main__ - Step 121488: {'lr': 4.4409290887435146e-05, 'samples': 23325696, 'steps': 121487, 'loss/train': 0.5882074236869812} 11/07/2021 14:21:23 - INFO - __main__ - Step 121489: {'lr': 4.440627159037813e-05, 'samples': 23325888, 'steps': 121488, 'loss/train': 1.1394704580307007} 11/07/2021 14:21:23 - INFO - __main__ - Step 121490: {'lr': 4.440325238595847e-05, 'samples': 23326080, 'steps': 121489, 'loss/train': 0.7999932765960693} 11/07/2021 14:21:23 - INFO - __main__ - Step 121491: {'lr': 4.4400233274177524e-05, 'samples': 23326272, 'steps': 121490, 'loss/train': 0.6676000952720642} 11/07/2021 14:21:24 - INFO - __main__ - Step 121492: {'lr': 4.4397214255036735e-05, 'samples': 23326464, 'steps': 121491, 'loss/train': 1.3872973918914795} 11/07/2021 14:21:25 - INFO - __main__ - Step 121493: {'lr': 4.439419532853731e-05, 'samples': 23326656, 'steps': 121492, 'loss/train': 1.3526864051818848} 11/07/2021 14:21:25 - INFO - __main__ - Step 121494: {'lr': 4.439117649468069e-05, 'samples': 23326848, 'steps': 121493, 'loss/train': 0.7442038655281067} 11/07/2021 14:21:25 - INFO - __main__ - Step 121495: {'lr': 4.438815775346824e-05, 'samples': 23327040, 'steps': 121494, 'loss/train': 1.5041402578353882} 11/07/2021 14:21:26 - INFO - __main__ - Step 121496: {'lr': 4.438513910490133e-05, 'samples': 23327232, 'steps': 121495, 'loss/train': 1.7654401063919067} 11/07/2021 14:21:26 - INFO - __main__ - Step 121497: {'lr': 4.4382120548981275e-05, 'samples': 23327424, 'steps': 121496, 'loss/train': 1.5574532747268677} 11/07/2021 14:21:27 - INFO - __main__ - Step 121498: {'lr': 4.437910208570947e-05, 'samples': 23327616, 'steps': 121497, 'loss/train': 1.587753176689148} 11/07/2021 14:21:28 - INFO - __main__ - Step 121499: {'lr': 4.437608371508728e-05, 'samples': 23327808, 'steps': 121498, 'loss/train': 1.5261963605880737} 11/07/2021 14:21:28 - INFO - __main__ - Step 121500: {'lr': 4.4373065437116057e-05, 'samples': 23328000, 'steps': 121499, 'loss/train': 1.8044655323028564} 11/07/2021 14:21:28 - INFO - __main__ - Step 121501: {'lr': 4.437004725179714e-05, 'samples': 23328192, 'steps': 121500, 'loss/train': 1.2403045892715454} 11/07/2021 14:21:29 - INFO - __main__ - Step 121502: {'lr': 4.436702915913191e-05, 'samples': 23328384, 'steps': 121501, 'loss/train': 1.457614541053772} 11/07/2021 14:21:30 - INFO - __main__ - Step 121503: {'lr': 4.4364011159121727e-05, 'samples': 23328576, 'steps': 121502, 'loss/train': 1.6365119218826294} 11/07/2021 14:21:30 - INFO - __main__ - Step 121504: {'lr': 4.436099325176796e-05, 'samples': 23328768, 'steps': 121503, 'loss/train': 1.1892799139022827} 11/07/2021 14:21:30 - INFO - __main__ - Step 121505: {'lr': 4.435797543707201e-05, 'samples': 23328960, 'steps': 121504, 'loss/train': 1.1104015111923218} 11/07/2021 14:21:31 - INFO - __main__ - Step 121506: {'lr': 4.4354957715035114e-05, 'samples': 23329152, 'steps': 121505, 'loss/train': 1.2384504079818726} 11/07/2021 14:21:31 - INFO - __main__ - Step 121507: {'lr': 4.435194008565871e-05, 'samples': 23329344, 'steps': 121506, 'loss/train': 1.5474363565444946} 11/07/2021 14:21:31 - INFO - __main__ - Step 121508: {'lr': 4.434892254894413e-05, 'samples': 23329536, 'steps': 121507, 'loss/train': 1.2137384414672852} 11/07/2021 14:21:32 - INFO - __main__ - Step 121509: {'lr': 4.434590510489278e-05, 'samples': 23329728, 'steps': 121508, 'loss/train': 1.335909128189087} 11/07/2021 14:21:33 - INFO - __main__ - Step 121510: {'lr': 4.434288775350598e-05, 'samples': 23329920, 'steps': 121509, 'loss/train': 1.2257269620895386} 11/07/2021 14:21:33 - INFO - __main__ - Step 121511: {'lr': 4.433987049478508e-05, 'samples': 23330112, 'steps': 121510, 'loss/train': 1.4577749967575073} 11/07/2021 14:21:34 - INFO - __main__ - Step 121512: {'lr': 4.433685332873147e-05, 'samples': 23330304, 'steps': 121511, 'loss/train': 1.268209457397461} 11/07/2021 14:21:34 - INFO - __main__ - Step 121513: {'lr': 4.433383625534651e-05, 'samples': 23330496, 'steps': 121512, 'loss/train': 1.3946453332901} 11/07/2021 14:21:35 - INFO - __main__ - Step 121514: {'lr': 4.433081927463156e-05, 'samples': 23330688, 'steps': 121513, 'loss/train': 0.13904540240764618} 11/07/2021 14:21:35 - INFO - __main__ - Step 121515: {'lr': 4.4327802386587956e-05, 'samples': 23330880, 'steps': 121514, 'loss/train': 1.1461020708084106} 11/07/2021 14:21:36 - INFO - __main__ - Step 121516: {'lr': 4.432478559121708e-05, 'samples': 23331072, 'steps': 121515, 'loss/train': 1.4146171808242798} 11/07/2021 14:21:36 - INFO - __main__ - Step 121517: {'lr': 4.432176888852027e-05, 'samples': 23331264, 'steps': 121516, 'loss/train': 1.921716570854187} 11/07/2021 14:21:36 - INFO - __main__ - Step 121518: {'lr': 4.4318752278498906e-05, 'samples': 23331456, 'steps': 121517, 'loss/train': 1.1319022178649902} 11/07/2021 14:21:38 - INFO - __main__ - Step 121519: {'lr': 4.4315735761154387e-05, 'samples': 23331648, 'steps': 121518, 'loss/train': 1.7516342401504517} 11/07/2021 14:21:38 - INFO - __main__ - Step 121520: {'lr': 4.431271933648798e-05, 'samples': 23331840, 'steps': 121519, 'loss/train': 1.242613673210144} 11/07/2021 14:21:38 - INFO - __main__ - Step 121521: {'lr': 4.4309703004501074e-05, 'samples': 23332032, 'steps': 121520, 'loss/train': 0.7633126378059387} 11/07/2021 14:21:39 - INFO - __main__ - Step 121522: {'lr': 4.430668676519506e-05, 'samples': 23332224, 'steps': 121521, 'loss/train': 0.6480634212493896} 11/07/2021 14:21:39 - INFO - __main__ - Step 121523: {'lr': 4.430367061857124e-05, 'samples': 23332416, 'steps': 121522, 'loss/train': 0.32609161734580994} 11/07/2021 14:21:40 - INFO - __main__ - Step 121524: {'lr': 4.430065456463106e-05, 'samples': 23332608, 'steps': 121523, 'loss/train': 1.2003610134124756} 11/07/2021 14:21:40 - INFO - __main__ - Step 121525: {'lr': 4.429763860337579e-05, 'samples': 23332800, 'steps': 121524, 'loss/train': 0.6232852339744568} 11/07/2021 14:21:41 - INFO - __main__ - Step 121526: {'lr': 4.429462273480686e-05, 'samples': 23332992, 'steps': 121525, 'loss/train': 1.8774008750915527} 11/07/2021 14:21:41 - INFO - __main__ - Step 121527: {'lr': 4.429160695892559e-05, 'samples': 23333184, 'steps': 121526, 'loss/train': 1.2160167694091797} 11/07/2021 14:21:41 - INFO - __main__ - Step 121528: {'lr': 4.4288591275733345e-05, 'samples': 23333376, 'steps': 121527, 'loss/train': 0.6629692912101746} 11/07/2021 14:21:42 - INFO - __main__ - Step 121529: {'lr': 4.428557568523148e-05, 'samples': 23333568, 'steps': 121528, 'loss/train': 1.0029364824295044} 11/07/2021 14:21:43 - INFO - __main__ - Step 121530: {'lr': 4.428256018742135e-05, 'samples': 23333760, 'steps': 121529, 'loss/train': 0.820447564125061} 11/07/2021 14:21:43 - INFO - __main__ - Step 121531: {'lr': 4.4279544782304365e-05, 'samples': 23333952, 'steps': 121530, 'loss/train': 1.2180835008621216} 11/07/2021 14:21:43 - INFO - __main__ - Step 121532: {'lr': 4.4276529469881866e-05, 'samples': 23334144, 'steps': 121531, 'loss/train': 0.8622405529022217} 11/07/2021 14:21:44 - INFO - __main__ - Step 121533: {'lr': 4.4273514250155135e-05, 'samples': 23334336, 'steps': 121532, 'loss/train': 1.1859071254730225} 11/07/2021 14:21:45 - INFO - __main__ - Step 121534: {'lr': 4.427049912312558e-05, 'samples': 23334528, 'steps': 121533, 'loss/train': 1.261507511138916} 11/07/2021 14:21:45 - INFO - __main__ - Step 121535: {'lr': 4.426748408879458e-05, 'samples': 23334720, 'steps': 121534, 'loss/train': 1.2195476293563843} 11/07/2021 14:21:46 - INFO - __main__ - Step 121536: {'lr': 4.4264469147163475e-05, 'samples': 23334912, 'steps': 121535, 'loss/train': 1.4270588159561157} 11/07/2021 14:21:46 - INFO - __main__ - Step 121537: {'lr': 4.426145429823361e-05, 'samples': 23335104, 'steps': 121536, 'loss/train': 0.15929223597049713} 11/07/2021 14:21:47 - INFO - __main__ - Step 121538: {'lr': 4.425843954200637e-05, 'samples': 23335296, 'steps': 121537, 'loss/train': 0.13451102375984192} 11/07/2021 14:21:47 - INFO - __main__ - Step 121539: {'lr': 4.4255424878483107e-05, 'samples': 23335488, 'steps': 121538, 'loss/train': 0.9524839520454407} 11/07/2021 14:21:48 - INFO - __main__ - Step 121540: {'lr': 4.4252410307665166e-05, 'samples': 23335680, 'steps': 121539, 'loss/train': 1.3863905668258667} 11/07/2021 14:21:48 - INFO - __main__ - Step 121541: {'lr': 4.4249395829553924e-05, 'samples': 23335872, 'steps': 121540, 'loss/train': 1.6936750411987305} 11/07/2021 14:21:49 - INFO - __main__ - Step 121542: {'lr': 4.4246381444150716e-05, 'samples': 23336064, 'steps': 121541, 'loss/train': 1.131351113319397} 11/07/2021 14:21:49 - INFO - __main__ - Step 121543: {'lr': 4.424336715145694e-05, 'samples': 23336256, 'steps': 121542, 'loss/train': 1.3273297548294067} 11/07/2021 14:21:49 - INFO - __main__ - Step 121544: {'lr': 4.4240352951473885e-05, 'samples': 23336448, 'steps': 121543, 'loss/train': 1.3454124927520752} 11/07/2021 14:21:50 - INFO - __main__ - Step 121545: {'lr': 4.423733884420297e-05, 'samples': 23336640, 'steps': 121544, 'loss/train': 1.2501846551895142} 11/07/2021 14:21:51 - INFO - __main__ - Step 121546: {'lr': 4.423432482964562e-05, 'samples': 23336832, 'steps': 121545, 'loss/train': 1.3539011478424072} 11/07/2021 14:21:51 - INFO - __main__ - Step 121547: {'lr': 4.4231310907803026e-05, 'samples': 23337024, 'steps': 121546, 'loss/train': 1.2536420822143555} 11/07/2021 14:21:51 - INFO - __main__ - Step 121548: {'lr': 4.4228297078676625e-05, 'samples': 23337216, 'steps': 121547, 'loss/train': 1.3506031036376953} 11/07/2021 14:21:52 - INFO - __main__ - Step 121549: {'lr': 4.422528334226778e-05, 'samples': 23337408, 'steps': 121548, 'loss/train': 1.424723505973816} 11/07/2021 14:21:53 - INFO - __main__ - Step 121550: {'lr': 4.422226969857785e-05, 'samples': 23337600, 'steps': 121549, 'loss/train': 1.404104471206665} 11/07/2021 14:21:53 - INFO - __main__ - Step 121551: {'lr': 4.421925614760819e-05, 'samples': 23337792, 'steps': 121550, 'loss/train': 1.4050219058990479} 11/07/2021 14:21:53 - INFO - __main__ - Step 121552: {'lr': 4.421624268936017e-05, 'samples': 23337984, 'steps': 121551, 'loss/train': 1.4982702732086182} 11/07/2021 14:21:54 - INFO - __main__ - Step 121553: {'lr': 4.421322932383512e-05, 'samples': 23338176, 'steps': 121552, 'loss/train': 1.2844793796539307} 11/07/2021 14:21:54 - INFO - __main__ - Step 121554: {'lr': 4.4210216051034395e-05, 'samples': 23338368, 'steps': 121553, 'loss/train': 1.5186676979064941} 11/07/2021 14:21:55 - INFO - __main__ - Step 121555: {'lr': 4.420720287095942e-05, 'samples': 23338560, 'steps': 121554, 'loss/train': 0.98911452293396} 11/07/2021 14:21:56 - INFO - __main__ - Step 121556: {'lr': 4.420418978361146e-05, 'samples': 23338752, 'steps': 121555, 'loss/train': 1.3185703754425049} 11/07/2021 14:21:56 - INFO - __main__ - Step 121557: {'lr': 4.420117678899194e-05, 'samples': 23338944, 'steps': 121556, 'loss/train': 1.6589387655258179} 11/07/2021 14:21:56 - INFO - __main__ - Step 121558: {'lr': 4.4198163887102185e-05, 'samples': 23339136, 'steps': 121557, 'loss/train': 1.1714848279953003} 11/07/2021 14:21:57 - INFO - __main__ - Step 121559: {'lr': 4.419515107794361e-05, 'samples': 23339328, 'steps': 121558, 'loss/train': 1.5406171083450317} 11/07/2021 14:21:57 - INFO - __main__ - Step 121560: {'lr': 4.41921383615175e-05, 'samples': 23339520, 'steps': 121559, 'loss/train': 1.334684133529663} 11/07/2021 14:21:58 - INFO - __main__ - Step 121561: {'lr': 4.4189125737825215e-05, 'samples': 23339712, 'steps': 121560, 'loss/train': 1.5785452127456665} 11/07/2021 14:21:58 - INFO - __main__ - Step 121562: {'lr': 4.4186113206868135e-05, 'samples': 23339904, 'steps': 121561, 'loss/train': 2.071460247039795} 11/07/2021 14:21:59 - INFO - __main__ - Step 121563: {'lr': 4.418310076864759e-05, 'samples': 23340096, 'steps': 121562, 'loss/train': 1.5610084533691406} 11/07/2021 14:21:59 - INFO - __main__ - Step 121564: {'lr': 4.418008842316501e-05, 'samples': 23340288, 'steps': 121563, 'loss/train': 1.2971441745758057} 11/07/2021 14:21:59 - INFO - __main__ - Step 121565: {'lr': 4.417707617042169e-05, 'samples': 23340480, 'steps': 121564, 'loss/train': 2.2952818870544434} 11/07/2021 14:22:01 - INFO - __main__ - Step 121566: {'lr': 4.417406401041898e-05, 'samples': 23340672, 'steps': 121565, 'loss/train': 1.2875850200653076} 11/07/2021 14:22:01 - INFO - __main__ - Step 121567: {'lr': 4.417105194315829e-05, 'samples': 23340864, 'steps': 121566, 'loss/train': 1.0309031009674072} 11/07/2021 14:22:01 - INFO - __main__ - Step 121568: {'lr': 4.416803996864094e-05, 'samples': 23341056, 'steps': 121567, 'loss/train': 1.3977304697036743} 11/07/2021 14:22:02 - INFO - __main__ - Step 121569: {'lr': 4.416502808686829e-05, 'samples': 23341248, 'steps': 121568, 'loss/train': 1.0791528224945068} 11/07/2021 14:22:02 - INFO - __main__ - Step 121570: {'lr': 4.416201629784169e-05, 'samples': 23341440, 'steps': 121569, 'loss/train': 2.282163619995117} 11/07/2021 14:22:02 - INFO - __main__ - Step 121571: {'lr': 4.415900460156253e-05, 'samples': 23341632, 'steps': 121570, 'loss/train': 1.583105206489563} 11/07/2021 14:22:03 - INFO - __main__ - Step 121572: {'lr': 4.4155992998032135e-05, 'samples': 23341824, 'steps': 121571, 'loss/train': 1.2536258697509766} 11/07/2021 14:22:04 - INFO - __main__ - Step 121573: {'lr': 4.415298148725194e-05, 'samples': 23342016, 'steps': 121572, 'loss/train': 1.2657511234283447} 11/07/2021 14:22:04 - INFO - __main__ - Step 121574: {'lr': 4.4149970069223165e-05, 'samples': 23342208, 'steps': 121573, 'loss/train': 1.3257688283920288} 11/07/2021 14:22:04 - INFO - __main__ - Step 121575: {'lr': 4.414695874394725e-05, 'samples': 23342400, 'steps': 121574, 'loss/train': 1.2065925598144531} 11/07/2021 14:22:05 - INFO - __main__ - Step 121576: {'lr': 4.414394751142553e-05, 'samples': 23342592, 'steps': 121575, 'loss/train': 1.078678011894226} 11/07/2021 14:22:06 - INFO - __main__ - Step 121577: {'lr': 4.414093637165939e-05, 'samples': 23342784, 'steps': 121576, 'loss/train': 1.3307507038116455} 11/07/2021 14:22:06 - INFO - __main__ - Step 121578: {'lr': 4.4137925324650136e-05, 'samples': 23342976, 'steps': 121577, 'loss/train': 1.692881464958191} 11/07/2021 14:22:07 - INFO - __main__ - Step 121579: {'lr': 4.413491437039918e-05, 'samples': 23343168, 'steps': 121578, 'loss/train': 0.5234664678573608} 11/07/2021 14:22:07 - INFO - __main__ - Step 121580: {'lr': 4.413190350890786e-05, 'samples': 23343360, 'steps': 121579, 'loss/train': 1.5776400566101074} 11/07/2021 14:22:07 - INFO - __main__ - Step 121581: {'lr': 4.4128892740177507e-05, 'samples': 23343552, 'steps': 121580, 'loss/train': 1.6126185655593872} 11/07/2021 14:22:08 - INFO - __main__ - Step 121582: {'lr': 4.41258820642095e-05, 'samples': 23343744, 'steps': 121581, 'loss/train': 1.3642843961715698} 11/07/2021 14:22:09 - INFO - __main__ - Step 121583: {'lr': 4.412287148100519e-05, 'samples': 23343936, 'steps': 121582, 'loss/train': 0.7432518601417542} 11/07/2021 14:22:09 - INFO - __main__ - Step 121584: {'lr': 4.411986099056595e-05, 'samples': 23344128, 'steps': 121583, 'loss/train': 1.6718345880508423} 11/07/2021 14:22:09 - INFO - __main__ - Step 121585: {'lr': 4.411685059289314e-05, 'samples': 23344320, 'steps': 121584, 'loss/train': 1.1402764320373535} 11/07/2021 14:22:10 - INFO - __main__ - Step 121586: {'lr': 4.411384028798812e-05, 'samples': 23344512, 'steps': 121585, 'loss/train': 1.4987961053848267} 11/07/2021 14:22:11 - INFO - __main__ - Step 121587: {'lr': 4.411083007585221e-05, 'samples': 23344704, 'steps': 121586, 'loss/train': 1.2514946460723877} 11/07/2021 14:22:11 - INFO - __main__ - Step 121588: {'lr': 4.4107819956486745e-05, 'samples': 23344896, 'steps': 121587, 'loss/train': 1.2319718599319458} 11/07/2021 14:22:11 - INFO - __main__ - Step 121589: {'lr': 4.410480992989313e-05, 'samples': 23345088, 'steps': 121588, 'loss/train': 1.1482179164886475} 11/07/2021 14:22:12 - INFO - __main__ - Step 121590: {'lr': 4.410179999607272e-05, 'samples': 23345280, 'steps': 121589, 'loss/train': 1.0513792037963867} 11/07/2021 14:22:12 - INFO - __main__ - Step 121591: {'lr': 4.4098790155026855e-05, 'samples': 23345472, 'steps': 121590, 'loss/train': 1.5027233362197876} 11/07/2021 14:22:13 - INFO - __main__ - Step 121592: {'lr': 4.409578040675691e-05, 'samples': 23345664, 'steps': 121591, 'loss/train': 1.4345390796661377} 11/07/2021 14:22:13 - INFO - __main__ - Step 121593: {'lr': 4.409277075126422e-05, 'samples': 23345856, 'steps': 121592, 'loss/train': 1.0906533002853394} 11/07/2021 14:22:14 - INFO - __main__ - Step 121594: {'lr': 4.408976118855012e-05, 'samples': 23346048, 'steps': 121593, 'loss/train': 1.2897871732711792} 11/07/2021 14:22:14 - INFO - __main__ - Step 121595: {'lr': 4.4086751718616036e-05, 'samples': 23346240, 'steps': 121594, 'loss/train': 0.9116547703742981} 11/07/2021 14:22:15 - INFO - __main__ - Step 121596: {'lr': 4.408374234146328e-05, 'samples': 23346432, 'steps': 121595, 'loss/train': 1.3942821025848389} 11/07/2021 14:22:16 - INFO - __main__ - Step 121597: {'lr': 4.40807330570932e-05, 'samples': 23346624, 'steps': 121596, 'loss/train': 1.4323889017105103} 11/07/2021 14:22:16 - INFO - __main__ - Step 121598: {'lr': 4.407772386550718e-05, 'samples': 23346816, 'steps': 121597, 'loss/train': 1.1365678310394287} 11/07/2021 14:22:16 - INFO - __main__ - Step 121599: {'lr': 4.40747147667066e-05, 'samples': 23347008, 'steps': 121598, 'loss/train': 1.3950115442276} 11/07/2021 14:22:17 - INFO - __main__ - Step 121600: {'lr': 4.407170576069272e-05, 'samples': 23347200, 'steps': 121599, 'loss/train': 1.2520296573638916} 11/07/2021 14:22:17 - INFO - __main__ - Step 121601: {'lr': 4.4068696847466977e-05, 'samples': 23347392, 'steps': 121600, 'loss/train': 1.2206109762191772} 11/07/2021 14:22:18 - INFO - __main__ - Step 121602: {'lr': 4.406568802703068e-05, 'samples': 23347584, 'steps': 121601, 'loss/train': 1.5285828113555908} 11/07/2021 14:22:18 - INFO - __main__ - Step 121603: {'lr': 4.406267929938521e-05, 'samples': 23347776, 'steps': 121602, 'loss/train': 1.2902450561523438} 11/07/2021 14:22:19 - INFO - __main__ - Step 121604: {'lr': 4.40596706645319e-05, 'samples': 23347968, 'steps': 121603, 'loss/train': 1.0865535736083984} 11/07/2021 14:22:19 - INFO - __main__ - Step 121605: {'lr': 4.405666212247214e-05, 'samples': 23348160, 'steps': 121604, 'loss/train': 0.9267745018005371} 11/07/2021 14:22:19 - INFO - __main__ - Step 121606: {'lr': 4.4053653673207264e-05, 'samples': 23348352, 'steps': 121605, 'loss/train': 1.2966734170913696} 11/07/2021 14:22:20 - INFO - __main__ - Step 121607: {'lr': 4.4050645316738665e-05, 'samples': 23348544, 'steps': 121606, 'loss/train': 1.2741879224777222} 11/07/2021 14:22:21 - INFO - __main__ - Step 121608: {'lr': 4.4047637053067633e-05, 'samples': 23348736, 'steps': 121607, 'loss/train': 1.4109301567077637} 11/07/2021 14:22:21 - INFO - __main__ - Step 121609: {'lr': 4.404462888219557e-05, 'samples': 23348928, 'steps': 121608, 'loss/train': 1.352618932723999} 11/07/2021 14:22:22 - INFO - __main__ - Step 121610: {'lr': 4.404162080412383e-05, 'samples': 23349120, 'steps': 121609, 'loss/train': 1.3647373914718628} 11/07/2021 14:22:22 - INFO - __main__ - Step 121611: {'lr': 4.403861281885374e-05, 'samples': 23349312, 'steps': 121610, 'loss/train': 1.1527321338653564} 11/07/2021 14:22:22 - INFO - __main__ - Step 121612: {'lr': 4.4035604926386666e-05, 'samples': 23349504, 'steps': 121611, 'loss/train': 1.3425921201705933} 11/07/2021 14:22:23 - INFO - __main__ - Step 121613: {'lr': 4.403259712672406e-05, 'samples': 23349696, 'steps': 121612, 'loss/train': 1.4071776866912842} 11/07/2021 14:22:24 - INFO - __main__ - Step 121614: {'lr': 4.4029589419867096e-05, 'samples': 23349888, 'steps': 121613, 'loss/train': 1.069487452507019} 11/07/2021 14:22:24 - INFO - __main__ - Step 121615: {'lr': 4.402658180581723e-05, 'samples': 23350080, 'steps': 121614, 'loss/train': 1.351895809173584} 11/07/2021 14:22:25 - INFO - __main__ - Step 121616: {'lr': 4.4023574284575815e-05, 'samples': 23350272, 'steps': 121615, 'loss/train': 1.2370884418487549} 11/07/2021 14:22:25 - INFO - __main__ - Step 121617: {'lr': 4.402056685614419e-05, 'samples': 23350464, 'steps': 121616, 'loss/train': 0.1092284694314003} 11/07/2021 14:22:26 - INFO - __main__ - Step 121618: {'lr': 4.401755952052372e-05, 'samples': 23350656, 'steps': 121617, 'loss/train': 1.018211841583252} 11/07/2021 14:22:26 - INFO - __main__ - Step 121619: {'lr': 4.4014552277715783e-05, 'samples': 23350848, 'steps': 121618, 'loss/train': 0.7302078008651733} 11/07/2021 14:22:27 - INFO - __main__ - Step 121620: {'lr': 4.401154512772168e-05, 'samples': 23351040, 'steps': 121619, 'loss/train': 0.8914958238601685} 11/07/2021 14:22:27 - INFO - __main__ - Step 121621: {'lr': 4.400853807054281e-05, 'samples': 23351232, 'steps': 121620, 'loss/train': 0.19197127223014832} 11/07/2021 14:22:27 - INFO - __main__ - Step 121622: {'lr': 4.4005531106180495e-05, 'samples': 23351424, 'steps': 121621, 'loss/train': 1.7470123767852783} 11/07/2021 14:22:28 - INFO - __main__ - Step 121623: {'lr': 4.400252423463613e-05, 'samples': 23351616, 'steps': 121622, 'loss/train': 1.3012107610702515} 11/07/2021 14:22:29 - INFO - __main__ - Step 121624: {'lr': 4.399951745591105e-05, 'samples': 23351808, 'steps': 121623, 'loss/train': 1.3843514919281006} 11/07/2021 14:22:29 - INFO - __main__ - Step 121625: {'lr': 4.39965107700066e-05, 'samples': 23352000, 'steps': 121624, 'loss/train': 1.491215705871582} 11/07/2021 14:22:29 - INFO - __main__ - Step 121626: {'lr': 4.399350417692421e-05, 'samples': 23352192, 'steps': 121625, 'loss/train': 1.1970487833023071} 11/07/2021 14:22:30 - INFO - __main__ - Step 121627: {'lr': 4.39904976766651e-05, 'samples': 23352384, 'steps': 121626, 'loss/train': 1.3191219568252563} 11/07/2021 14:22:31 - INFO - __main__ - Step 121628: {'lr': 4.398749126923071e-05, 'samples': 23352576, 'steps': 121627, 'loss/train': 1.7128218412399292} 11/07/2021 14:22:31 - INFO - __main__ - Step 121629: {'lr': 4.398448495462237e-05, 'samples': 23352768, 'steps': 121628, 'loss/train': 1.2049790620803833} 11/07/2021 14:22:32 - INFO - __main__ - Step 121630: {'lr': 4.398147873284142e-05, 'samples': 23352960, 'steps': 121629, 'loss/train': 1.3054336309432983} 11/07/2021 14:22:32 - INFO - __main__ - Step 121631: {'lr': 4.397847260388926e-05, 'samples': 23353152, 'steps': 121630, 'loss/train': 1.7396059036254883} 11/07/2021 14:22:32 - INFO - __main__ - Step 121632: {'lr': 4.3975466567767214e-05, 'samples': 23353344, 'steps': 121631, 'loss/train': 1.2353477478027344} 11/07/2021 14:22:33 - INFO - __main__ - Step 121633: {'lr': 4.397246062447666e-05, 'samples': 23353536, 'steps': 121632, 'loss/train': 1.278241515159607} 11/07/2021 14:22:34 - INFO - __main__ - Step 121634: {'lr': 4.39694547740189e-05, 'samples': 23353728, 'steps': 121633, 'loss/train': 0.5228225588798523} 11/07/2021 14:22:34 - INFO - __main__ - Step 121635: {'lr': 4.3966449016395346e-05, 'samples': 23353920, 'steps': 121634, 'loss/train': 1.380915641784668} 11/07/2021 14:22:34 - INFO - __main__ - Step 121636: {'lr': 4.3963443351607345e-05, 'samples': 23354112, 'steps': 121635, 'loss/train': 1.356627345085144} 11/07/2021 14:22:35 - INFO - __main__ - Step 121637: {'lr': 4.396043777965622e-05, 'samples': 23354304, 'steps': 121636, 'loss/train': 1.4036412239074707} 11/07/2021 14:22:35 - INFO - __main__ - Step 121638: {'lr': 4.395743230054333e-05, 'samples': 23354496, 'steps': 121637, 'loss/train': 0.5116982460021973} 11/07/2021 14:22:36 - INFO - __main__ - Step 121639: {'lr': 4.395442691427007e-05, 'samples': 23354688, 'steps': 121638, 'loss/train': 1.3580121994018555} 11/07/2021 14:22:36 - INFO - __main__ - Step 121640: {'lr': 4.395142162083782e-05, 'samples': 23354880, 'steps': 121639, 'loss/train': 1.5314862728118896} 11/07/2021 14:22:37 - INFO - __main__ - Step 121641: {'lr': 4.39484164202478e-05, 'samples': 23355072, 'steps': 121640, 'loss/train': 1.1235500574111938} 11/07/2021 14:22:37 - INFO - __main__ - Step 121642: {'lr': 4.394541131250146e-05, 'samples': 23355264, 'steps': 121641, 'loss/train': 1.3698997497558594} 11/07/2021 14:22:38 - INFO - __main__ - Step 121643: {'lr': 4.394240629760013e-05, 'samples': 23355456, 'steps': 121642, 'loss/train': 0.83978271484375} 11/07/2021 14:22:39 - INFO - __main__ - Step 121644: {'lr': 4.393940137554517e-05, 'samples': 23355648, 'steps': 121643, 'loss/train': 1.026244878768921} 11/07/2021 14:22:39 - INFO - __main__ - Step 121645: {'lr': 4.3936396546337936e-05, 'samples': 23355840, 'steps': 121644, 'loss/train': 0.5456420183181763} 11/07/2021 14:22:39 - INFO - __main__ - Step 121646: {'lr': 4.393339180997979e-05, 'samples': 23356032, 'steps': 121645, 'loss/train': 1.1943539381027222} 11/07/2021 14:22:40 - INFO - __main__ - Step 121647: {'lr': 4.3930387166472074e-05, 'samples': 23356224, 'steps': 121646, 'loss/train': 1.4564379453659058} 11/07/2021 14:22:40 - INFO - __main__ - Step 121648: {'lr': 4.3927382615816134e-05, 'samples': 23356416, 'steps': 121647, 'loss/train': 1.0642155408859253} 11/07/2021 14:22:41 - INFO - __main__ - Step 121649: {'lr': 4.392437815801337e-05, 'samples': 23356608, 'steps': 121648, 'loss/train': 1.0150543451309204} 11/07/2021 14:22:42 - INFO - __main__ - Step 121650: {'lr': 4.392137379306507e-05, 'samples': 23356800, 'steps': 121649, 'loss/train': 1.1253985166549683} 11/07/2021 14:22:42 - INFO - __main__ - Step 121651: {'lr': 4.391836952097264e-05, 'samples': 23356992, 'steps': 121650, 'loss/train': 2.35155987739563} 11/07/2021 14:22:43 - INFO - __main__ - Step 121652: {'lr': 4.39153653417374e-05, 'samples': 23357184, 'steps': 121651, 'loss/train': 1.3872417211532593} 11/07/2021 14:22:43 - INFO - __main__ - Step 121653: {'lr': 4.391236125536077e-05, 'samples': 23357376, 'steps': 121652, 'loss/train': 1.3257079124450684} 11/07/2021 14:22:44 - INFO - __main__ - Step 121654: {'lr': 4.3909357261844e-05, 'samples': 23357568, 'steps': 121653, 'loss/train': 1.4586901664733887} 11/07/2021 14:22:44 - INFO - __main__ - Step 121655: {'lr': 4.390635336118848e-05, 'samples': 23357760, 'steps': 121654, 'loss/train': 1.4363337755203247} 11/07/2021 14:22:45 - INFO - __main__ - Step 121656: {'lr': 4.390334955339559e-05, 'samples': 23357952, 'steps': 121655, 'loss/train': 1.1042581796646118} 11/07/2021 14:22:45 - INFO - __main__ - Step 121657: {'lr': 4.3900345838466666e-05, 'samples': 23358144, 'steps': 121656, 'loss/train': 1.4108127355575562} 11/07/2021 14:22:45 - INFO - __main__ - Step 121658: {'lr': 4.3897342216403066e-05, 'samples': 23358336, 'steps': 121657, 'loss/train': 1.702609658241272} 11/07/2021 14:22:46 - INFO - __main__ - Step 121659: {'lr': 4.389433868720616e-05, 'samples': 23358528, 'steps': 121658, 'loss/train': 0.9553170800209045} 11/07/2021 14:22:47 - INFO - __main__ - Step 121660: {'lr': 4.389133525087727e-05, 'samples': 23358720, 'steps': 121659, 'loss/train': 2.0049962997436523} 11/07/2021 14:22:47 - INFO - __main__ - Step 121661: {'lr': 4.388833190741775e-05, 'samples': 23358912, 'steps': 121660, 'loss/train': 1.3401228189468384} 11/07/2021 14:22:48 - INFO - __main__ - Step 121662: {'lr': 4.388532865682898e-05, 'samples': 23359104, 'steps': 121661, 'loss/train': 1.7813650369644165} 11/07/2021 14:22:48 - INFO - __main__ - Step 121663: {'lr': 4.388232549911231e-05, 'samples': 23359296, 'steps': 121662, 'loss/train': 1.2467948198318481} 11/07/2021 14:22:48 - INFO - __main__ - Step 121664: {'lr': 4.387932243426907e-05, 'samples': 23359488, 'steps': 121663, 'loss/train': 1.1404542922973633} 11/07/2021 14:22:49 - INFO - __main__ - Step 121665: {'lr': 4.387631946230064e-05, 'samples': 23359680, 'steps': 121664, 'loss/train': 1.3734002113342285} 11/07/2021 14:22:50 - INFO - __main__ - Step 121666: {'lr': 4.3873316583208336e-05, 'samples': 23359872, 'steps': 121665, 'loss/train': 1.6753846406936646} 11/07/2021 14:22:50 - INFO - __main__ - Step 121667: {'lr': 4.387031379699363e-05, 'samples': 23360064, 'steps': 121666, 'loss/train': 1.2047730684280396} 11/07/2021 14:22:50 - INFO - __main__ - Step 121668: {'lr': 4.3867311103657686e-05, 'samples': 23360256, 'steps': 121667, 'loss/train': 1.3497414588928223} 11/07/2021 14:22:51 - INFO - __main__ - Step 121669: {'lr': 4.3864308503201974e-05, 'samples': 23360448, 'steps': 121668, 'loss/train': 1.1240519285202026} 11/07/2021 14:22:52 - INFO - __main__ - Step 121670: {'lr': 4.386130599562782e-05, 'samples': 23360640, 'steps': 121669, 'loss/train': 1.2134474515914917} 11/07/2021 14:22:52 - INFO - __main__ - Step 121671: {'lr': 4.3858303580936566e-05, 'samples': 23360832, 'steps': 121670, 'loss/train': 1.1206899881362915} 11/07/2021 14:22:53 - INFO - __main__ - Step 121672: {'lr': 4.3855301259129594e-05, 'samples': 23361024, 'steps': 121671, 'loss/train': 0.9508215188980103} 11/07/2021 14:22:53 - INFO - __main__ - Step 121673: {'lr': 4.3852299030208235e-05, 'samples': 23361216, 'steps': 121672, 'loss/train': 1.849990725517273} 11/07/2021 14:22:53 - INFO - __main__ - Step 121674: {'lr': 4.384929689417386e-05, 'samples': 23361408, 'steps': 121673, 'loss/train': 1.653304100036621} 11/07/2021 14:22:54 - INFO - __main__ - Step 121675: {'lr': 4.384629485102778e-05, 'samples': 23361600, 'steps': 121674, 'loss/train': 1.2871910333633423} 11/07/2021 14:22:55 - INFO - __main__ - Step 121676: {'lr': 4.3843292900771407e-05, 'samples': 23361792, 'steps': 121675, 'loss/train': 1.431494116783142} 11/07/2021 14:22:55 - INFO - __main__ - Step 121677: {'lr': 4.3840291043406064e-05, 'samples': 23361984, 'steps': 121676, 'loss/train': 1.0889915227890015} 11/07/2021 14:22:55 - INFO - __main__ - Step 121678: {'lr': 4.3837289278933104e-05, 'samples': 23362176, 'steps': 121677, 'loss/train': 0.9983717799186707} 11/07/2021 14:22:56 - INFO - __main__ - Step 121679: {'lr': 4.383428760735386e-05, 'samples': 23362368, 'steps': 121678, 'loss/train': 1.4387832880020142} 11/07/2021 14:22:56 - INFO - __main__ - Step 121680: {'lr': 4.3831286028669785e-05, 'samples': 23362560, 'steps': 121679, 'loss/train': 1.0337690114974976} 11/07/2021 14:22:57 - INFO - __main__ - Step 121681: {'lr': 4.3828284542882096e-05, 'samples': 23362752, 'steps': 121680, 'loss/train': 1.0605239868164062} 11/07/2021 14:22:58 - INFO - __main__ - Step 121682: {'lr': 4.3825283149992176e-05, 'samples': 23362944, 'steps': 121681, 'loss/train': 0.9827118515968323} 11/07/2021 14:22:58 - INFO - __main__ - Step 121683: {'lr': 4.382228185000142e-05, 'samples': 23363136, 'steps': 121682, 'loss/train': 1.706985592842102} 11/07/2021 14:22:58 - INFO - __main__ - Step 121684: {'lr': 4.381928064291116e-05, 'samples': 23363328, 'steps': 121683, 'loss/train': 1.4101663827896118} 11/07/2021 14:22:59 - INFO - __main__ - Step 121685: {'lr': 4.381627952872275e-05, 'samples': 23363520, 'steps': 121684, 'loss/train': 1.4527018070220947} 11/07/2021 14:22:59 - INFO - __main__ - Step 121686: {'lr': 4.3813278507437546e-05, 'samples': 23363712, 'steps': 121685, 'loss/train': 1.1500499248504639} 11/07/2021 14:23:00 - INFO - __main__ - Step 121687: {'lr': 4.38102775790569e-05, 'samples': 23363904, 'steps': 121686, 'loss/train': 0.7142278552055359} 11/07/2021 14:23:00 - INFO - __main__ - Step 121688: {'lr': 4.380727674358215e-05, 'samples': 23364096, 'steps': 121687, 'loss/train': 1.3771623373031616} 11/07/2021 14:23:01 - INFO - __main__ - Step 121689: {'lr': 4.380427600101466e-05, 'samples': 23364288, 'steps': 121688, 'loss/train': 0.9865142703056335} 11/07/2021 14:23:01 - INFO - __main__ - Step 121690: {'lr': 4.380127535135578e-05, 'samples': 23364480, 'steps': 121689, 'loss/train': 1.471132755279541} 11/07/2021 14:23:01 - INFO - __main__ - Step 121691: {'lr': 4.379827479460688e-05, 'samples': 23364672, 'steps': 121690, 'loss/train': 1.115463376045227} 11/07/2021 14:23:03 - INFO - __main__ - Step 121692: {'lr': 4.379527433076933e-05, 'samples': 23364864, 'steps': 121691, 'loss/train': 1.4353793859481812} 11/07/2021 14:23:03 - INFO - __main__ - Step 121693: {'lr': 4.379227395984442e-05, 'samples': 23365056, 'steps': 121692, 'loss/train': 1.264952540397644} 11/07/2021 14:23:03 - INFO - __main__ - Step 121694: {'lr': 4.37892736818335e-05, 'samples': 23365248, 'steps': 121693, 'loss/train': 0.2969173491001129} 11/07/2021 14:23:04 - INFO - __main__ - Step 121695: {'lr': 4.378627349673797e-05, 'samples': 23365440, 'steps': 121694, 'loss/train': 1.2694823741912842} 11/07/2021 14:23:04 - INFO - __main__ - Step 121696: {'lr': 4.378327340455915e-05, 'samples': 23365632, 'steps': 121695, 'loss/train': 1.283090591430664} 11/07/2021 14:23:05 - INFO - __main__ - Step 121697: {'lr': 4.378027340529841e-05, 'samples': 23365824, 'steps': 121696, 'loss/train': 1.1774898767471313} 11/07/2021 14:23:05 - INFO - __main__ - Step 121698: {'lr': 4.37772734989571e-05, 'samples': 23366016, 'steps': 121697, 'loss/train': 0.7912939786911011} 11/07/2021 14:23:06 - INFO - __main__ - Step 121699: {'lr': 4.3774273685536546e-05, 'samples': 23366208, 'steps': 121698, 'loss/train': 0.5135875344276428} 11/07/2021 14:23:06 - INFO - __main__ - Step 121700: {'lr': 4.377127396503816e-05, 'samples': 23366400, 'steps': 121699, 'loss/train': 1.544105887413025} 11/07/2021 14:23:06 - INFO - __main__ - Step 121701: {'lr': 4.376827433746322e-05, 'samples': 23366592, 'steps': 121700, 'loss/train': 1.4816206693649292} 11/07/2021 14:23:08 - INFO - __main__ - Step 121702: {'lr': 4.3765274802813146e-05, 'samples': 23366784, 'steps': 121701, 'loss/train': 1.16494882106781} 11/07/2021 14:23:08 - INFO - __main__ - Step 121703: {'lr': 4.376227536108929e-05, 'samples': 23366976, 'steps': 121702, 'loss/train': 0.8072142004966736} 11/07/2021 14:23:08 - INFO - __main__ - Step 121704: {'lr': 4.375927601229293e-05, 'samples': 23367168, 'steps': 121703, 'loss/train': 1.2198388576507568} 11/07/2021 14:23:09 - INFO - __main__ - Step 121705: {'lr': 4.3756276756425434e-05, 'samples': 23367360, 'steps': 121704, 'loss/train': 1.2071528434753418} 11/07/2021 14:23:09 - INFO - __main__ - Step 121706: {'lr': 4.3753277593488214e-05, 'samples': 23367552, 'steps': 121705, 'loss/train': 1.2538738250732422} 11/07/2021 14:23:10 - INFO - __main__ - Step 121707: {'lr': 4.3750278523482536e-05, 'samples': 23367744, 'steps': 121706, 'loss/train': 1.4261584281921387} 11/07/2021 14:23:10 - INFO - __main__ - Step 121708: {'lr': 4.374727954640984e-05, 'samples': 23367936, 'steps': 121707, 'loss/train': 1.953545331954956} 11/07/2021 14:23:11 - INFO - __main__ - Step 121709: {'lr': 4.374428066227143e-05, 'samples': 23368128, 'steps': 121708, 'loss/train': 1.0455322265625} 11/07/2021 14:23:11 - INFO - __main__ - Step 121710: {'lr': 4.3741281871068655e-05, 'samples': 23368320, 'steps': 121709, 'loss/train': 1.324471116065979} 11/07/2021 14:23:12 - INFO - __main__ - Step 121711: {'lr': 4.3738283172802874e-05, 'samples': 23368512, 'steps': 121710, 'loss/train': 1.5262632369995117} 11/07/2021 14:23:12 - INFO - __main__ - Step 121712: {'lr': 4.373528456747544e-05, 'samples': 23368704, 'steps': 121711, 'loss/train': 0.5841691493988037} 11/07/2021 14:23:13 - INFO - __main__ - Step 121713: {'lr': 4.373228605508772e-05, 'samples': 23368896, 'steps': 121712, 'loss/train': 1.5554128885269165} 11/07/2021 14:23:13 - INFO - __main__ - Step 121714: {'lr': 4.37292876356411e-05, 'samples': 23369088, 'steps': 121713, 'loss/train': 1.3787603378295898} 11/07/2021 14:23:14 - INFO - __main__ - Step 121715: {'lr': 4.37262893091368e-05, 'samples': 23369280, 'steps': 121714, 'loss/train': 1.4814597368240356} 11/07/2021 14:23:14 - INFO - __main__ - Step 121716: {'lr': 4.372329107557627e-05, 'samples': 23369472, 'steps': 121715, 'loss/train': 1.2657703161239624} 11/07/2021 14:23:14 - INFO - __main__ - Step 121717: {'lr': 4.3720292934960856e-05, 'samples': 23369664, 'steps': 121716, 'loss/train': 1.4789764881134033} 11/07/2021 14:23:15 - INFO - __main__ - Step 121718: {'lr': 4.371729488729187e-05, 'samples': 23369856, 'steps': 121717, 'loss/train': 1.5885370969772339} 11/07/2021 14:23:16 - INFO - __main__ - Step 121719: {'lr': 4.37142969325707e-05, 'samples': 23370048, 'steps': 121718, 'loss/train': 1.1881603002548218} 11/07/2021 14:23:16 - INFO - __main__ - Step 121720: {'lr': 4.371129907079868e-05, 'samples': 23370240, 'steps': 121719, 'loss/train': 1.3417699337005615} 11/07/2021 14:23:16 - INFO - __main__ - Step 121721: {'lr': 4.370830130197717e-05, 'samples': 23370432, 'steps': 121720, 'loss/train': 1.0715965032577515} 11/07/2021 14:23:17 - INFO - __main__ - Step 121722: {'lr': 4.37053036261075e-05, 'samples': 23370624, 'steps': 121721, 'loss/train': 1.6691631078720093} 11/07/2021 14:23:18 - INFO - __main__ - Step 121723: {'lr': 4.370230604319106e-05, 'samples': 23370816, 'steps': 121722, 'loss/train': 1.4425525665283203} 11/07/2021 14:23:18 - INFO - __main__ - Step 121724: {'lr': 4.369930855322915e-05, 'samples': 23371008, 'steps': 121723, 'loss/train': 1.1516175270080566} 11/07/2021 14:23:19 - INFO - __main__ - Step 121725: {'lr': 4.369631115622325e-05, 'samples': 23371200, 'steps': 121724, 'loss/train': 1.228338599205017} 11/07/2021 14:23:19 - INFO - __main__ - Step 121726: {'lr': 4.369331385217451e-05, 'samples': 23371392, 'steps': 121725, 'loss/train': 1.2472560405731201} 11/07/2021 14:23:19 - INFO - __main__ - Step 121727: {'lr': 4.369031664108439e-05, 'samples': 23371584, 'steps': 121726, 'loss/train': 1.3990850448608398} 11/07/2021 14:23:21 - INFO - __main__ - Step 121728: {'lr': 4.368731952295424e-05, 'samples': 23371776, 'steps': 121727, 'loss/train': 1.6182986497879028} 11/07/2021 14:23:21 - INFO - __main__ - Step 121729: {'lr': 4.36843224977854e-05, 'samples': 23371968, 'steps': 121728, 'loss/train': 1.3579905033111572} 11/07/2021 14:23:21 - INFO - __main__ - Step 121730: {'lr': 4.368132556557921e-05, 'samples': 23372160, 'steps': 121729, 'loss/train': 1.4898446798324585} 11/07/2021 14:23:22 - INFO - __main__ - Step 121731: {'lr': 4.3678328726337035e-05, 'samples': 23372352, 'steps': 121730, 'loss/train': 1.198449730873108} 11/07/2021 14:23:22 - INFO - __main__ - Step 121732: {'lr': 4.3675331980060214e-05, 'samples': 23372544, 'steps': 121731, 'loss/train': 1.288648009300232} 11/07/2021 14:23:23 - INFO - __main__ - Step 121733: {'lr': 4.367233532675011e-05, 'samples': 23372736, 'steps': 121732, 'loss/train': 1.158282995223999} 11/07/2021 14:23:23 - INFO - __main__ - Step 121734: {'lr': 4.366933876640808e-05, 'samples': 23372928, 'steps': 121733, 'loss/train': 1.205754280090332} 11/07/2021 14:23:24 - INFO - __main__ - Step 121735: {'lr': 4.366634229903546e-05, 'samples': 23373120, 'steps': 121734, 'loss/train': 1.5436153411865234} 11/07/2021 14:23:24 - INFO - __main__ - Step 121736: {'lr': 4.366334592463364e-05, 'samples': 23373312, 'steps': 121735, 'loss/train': 1.3748927116394043} 11/07/2021 14:23:24 - INFO - __main__ - Step 121737: {'lr': 4.3660349643203894e-05, 'samples': 23373504, 'steps': 121736, 'loss/train': 1.6962937116622925} 11/07/2021 14:23:25 - INFO - __main__ - Step 121738: {'lr': 4.365735345474761e-05, 'samples': 23373696, 'steps': 121737, 'loss/train': 2.1563057899475098} 11/07/2021 14:23:26 - INFO - __main__ - Step 121739: {'lr': 4.365435735926612e-05, 'samples': 23373888, 'steps': 121738, 'loss/train': 0.46080172061920166} 11/07/2021 14:23:26 - INFO - __main__ - Step 121740: {'lr': 4.365136135676082e-05, 'samples': 23374080, 'steps': 121739, 'loss/train': 1.4274766445159912} 11/07/2021 14:23:27 - INFO - __main__ - Step 121741: {'lr': 4.3648365447232994e-05, 'samples': 23374272, 'steps': 121740, 'loss/train': 1.3265594244003296} 11/07/2021 14:23:27 - INFO - __main__ - Step 121742: {'lr': 4.364536963068405e-05, 'samples': 23374464, 'steps': 121741, 'loss/train': 1.4352521896362305} 11/07/2021 14:23:27 - INFO - __main__ - Step 121743: {'lr': 4.364237390711534e-05, 'samples': 23374656, 'steps': 121742, 'loss/train': 1.4189884662628174} 11/07/2021 14:23:28 - INFO - __main__ - Step 121744: {'lr': 4.363937827652817e-05, 'samples': 23374848, 'steps': 121743, 'loss/train': 1.16230046749115} 11/07/2021 14:23:29 - INFO - __main__ - Step 121745: {'lr': 4.363638273892392e-05, 'samples': 23375040, 'steps': 121744, 'loss/train': 1.5745311975479126} 11/07/2021 14:23:29 - INFO - __main__ - Step 121746: {'lr': 4.3633387294303914e-05, 'samples': 23375232, 'steps': 121745, 'loss/train': 1.4053152799606323} 11/07/2021 14:23:29 - INFO - __main__ - Step 121747: {'lr': 4.36303919426696e-05, 'samples': 23375424, 'steps': 121746, 'loss/train': 0.417719304561615} 11/07/2021 14:23:30 - INFO - __main__ - Step 121748: {'lr': 4.3627396684022186e-05, 'samples': 23375616, 'steps': 121747, 'loss/train': 1.5033732652664185} 11/07/2021 14:23:31 - INFO - __main__ - Step 121749: {'lr': 4.3624401518363054e-05, 'samples': 23375808, 'steps': 121748, 'loss/train': 1.4690979719161987} 11/07/2021 14:23:31 - INFO - __main__ - Step 121750: {'lr': 4.3621406445693624e-05, 'samples': 23376000, 'steps': 121749, 'loss/train': 1.3441493511199951} 11/07/2021 14:23:31 - INFO - __main__ - Step 121751: {'lr': 4.3618411466015165e-05, 'samples': 23376192, 'steps': 121750, 'loss/train': 1.8060214519500732} 11/07/2021 14:23:32 - INFO - __main__ - Step 121752: {'lr': 4.3615416579329105e-05, 'samples': 23376384, 'steps': 121751, 'loss/train': 1.188445806503296} 11/07/2021 14:23:32 - INFO - __main__ - Step 121753: {'lr': 4.3612421785636706e-05, 'samples': 23376576, 'steps': 121752, 'loss/train': 0.9483349323272705} 11/07/2021 14:23:33 - INFO - __main__ - Step 121754: {'lr': 4.36094270849394e-05, 'samples': 23376768, 'steps': 121753, 'loss/train': 1.1979397535324097} 11/07/2021 14:23:34 - INFO - __main__ - Step 121755: {'lr': 4.36064324772385e-05, 'samples': 23376960, 'steps': 121754, 'loss/train': 1.256221890449524} 11/07/2021 14:23:34 - INFO - __main__ - Step 121756: {'lr': 4.3603437962535354e-05, 'samples': 23377152, 'steps': 121755, 'loss/train': 1.4215339422225952} 11/07/2021 14:23:34 - INFO - __main__ - Step 121757: {'lr': 4.360044354083128e-05, 'samples': 23377344, 'steps': 121756, 'loss/train': 0.8522151112556458} 11/07/2021 14:23:35 - INFO - __main__ - Step 121758: {'lr': 4.359744921212772e-05, 'samples': 23377536, 'steps': 121757, 'loss/train': 1.4483425617218018} 11/07/2021 14:23:36 - INFO - __main__ - Step 121759: {'lr': 4.3594454976425915e-05, 'samples': 23377728, 'steps': 121758, 'loss/train': 1.2062255144119263} 11/07/2021 14:23:36 - INFO - __main__ - Step 121760: {'lr': 4.359146083372728e-05, 'samples': 23377920, 'steps': 121759, 'loss/train': 1.4043362140655518} 11/07/2021 14:23:36 - INFO - __main__ - Step 121761: {'lr': 4.358846678403322e-05, 'samples': 23378112, 'steps': 121760, 'loss/train': 0.940837562084198} 11/07/2021 14:23:37 - INFO - __main__ - Step 121762: {'lr': 4.358547282734493e-05, 'samples': 23378304, 'steps': 121761, 'loss/train': 1.400369644165039} 11/07/2021 14:23:37 - INFO - __main__ - Step 121763: {'lr': 4.358247896366385e-05, 'samples': 23378496, 'steps': 121762, 'loss/train': 1.7463710308074951} 11/07/2021 14:23:38 - INFO - __main__ - Step 121764: {'lr': 4.3579485192991345e-05, 'samples': 23378688, 'steps': 121763, 'loss/train': 0.9994555711746216} 11/07/2021 14:23:39 - INFO - __main__ - Step 121765: {'lr': 4.357649151532872e-05, 'samples': 23378880, 'steps': 121764, 'loss/train': 1.1289575099945068} 11/07/2021 14:23:39 - INFO - __main__ - Step 121766: {'lr': 4.357349793067733e-05, 'samples': 23379072, 'steps': 121765, 'loss/train': 1.4772039651870728} 11/07/2021 14:23:39 - INFO - __main__ - Step 121767: {'lr': 4.357050443903854e-05, 'samples': 23379264, 'steps': 121766, 'loss/train': 1.337325930595398} 11/07/2021 14:23:40 - INFO - __main__ - Step 121768: {'lr': 4.3567511040413706e-05, 'samples': 23379456, 'steps': 121767, 'loss/train': 1.7020951509475708} 11/07/2021 14:23:41 - INFO - __main__ - Step 121769: {'lr': 4.356451773480416e-05, 'samples': 23379648, 'steps': 121768, 'loss/train': 1.2463191747665405} 11/07/2021 14:23:41 - INFO - __main__ - Step 121770: {'lr': 4.356152452221127e-05, 'samples': 23379840, 'steps': 121769, 'loss/train': 1.4634425640106201} 11/07/2021 14:23:41 - INFO - __main__ - Step 121771: {'lr': 4.355853140263635e-05, 'samples': 23380032, 'steps': 121770, 'loss/train': 1.955907940864563} 11/07/2021 14:23:42 - INFO - __main__ - Step 121772: {'lr': 4.355553837608078e-05, 'samples': 23380224, 'steps': 121771, 'loss/train': 1.386272668838501} 11/07/2021 14:23:42 - INFO - __main__ - Step 121773: {'lr': 4.3552545442545885e-05, 'samples': 23380416, 'steps': 121772, 'loss/train': 1.3119155168533325} 11/07/2021 14:23:43 - INFO - __main__ - Step 121774: {'lr': 4.354955260203311e-05, 'samples': 23380608, 'steps': 121773, 'loss/train': 1.540178894996643} 11/07/2021 14:23:43 - INFO - __main__ - Step 121775: {'lr': 4.3546559854543645e-05, 'samples': 23380800, 'steps': 121774, 'loss/train': 1.456860899925232} 11/07/2021 14:23:44 - INFO - __main__ - Step 121776: {'lr': 4.354356720007893e-05, 'samples': 23380992, 'steps': 121775, 'loss/train': 1.5706076622009277} 11/07/2021 14:23:44 - INFO - __main__ - Step 121777: {'lr': 4.354057463864028e-05, 'samples': 23381184, 'steps': 121776, 'loss/train': 1.4817430973052979} 11/07/2021 14:23:45 - INFO - __main__ - Step 121778: {'lr': 4.353758217022907e-05, 'samples': 23381376, 'steps': 121777, 'loss/train': 1.3900600671768188} 11/07/2021 14:23:46 - INFO - __main__ - Step 121779: {'lr': 4.353458979484665e-05, 'samples': 23381568, 'steps': 121778, 'loss/train': 1.3552985191345215} 11/07/2021 14:23:46 - INFO - __main__ - Step 121780: {'lr': 4.353159751249433e-05, 'samples': 23381760, 'steps': 121779, 'loss/train': 1.3950541019439697} 11/07/2021 14:23:46 - INFO - __main__ - Step 121781: {'lr': 4.352860532317352e-05, 'samples': 23381952, 'steps': 121780, 'loss/train': 1.2551908493041992} 11/07/2021 14:23:47 - INFO - __main__ - Step 121782: {'lr': 4.3525613226885504e-05, 'samples': 23382144, 'steps': 121781, 'loss/train': 1.3452694416046143} 11/07/2021 14:23:47 - INFO - __main__ - Step 121783: {'lr': 4.352262122363168e-05, 'samples': 23382336, 'steps': 121782, 'loss/train': 1.1452566385269165} 11/07/2021 14:23:48 - INFO - __main__ - Step 121784: {'lr': 4.351962931341339e-05, 'samples': 23382528, 'steps': 121783, 'loss/train': 0.3061634302139282} 11/07/2021 14:23:48 - INFO - __main__ - Step 121785: {'lr': 4.3516637496231945e-05, 'samples': 23382720, 'steps': 121784, 'loss/train': 1.4045511484146118} 11/07/2021 14:23:49 - INFO - __main__ - Step 121786: {'lr': 4.351364577208872e-05, 'samples': 23382912, 'steps': 121785, 'loss/train': 1.140820026397705} 11/07/2021 14:23:49 - INFO - __main__ - Step 121787: {'lr': 4.351065414098504e-05, 'samples': 23383104, 'steps': 121786, 'loss/train': 1.5265990495681763} 11/07/2021 14:23:49 - INFO - __main__ - Step 121788: {'lr': 4.3507662602922354e-05, 'samples': 23383296, 'steps': 121787, 'loss/train': 1.716138243675232} 11/07/2021 14:23:50 - INFO - __main__ - Step 121789: {'lr': 4.350467115790188e-05, 'samples': 23383488, 'steps': 121788, 'loss/train': 1.476576805114746} 11/07/2021 14:23:51 - INFO - __main__ - Step 121790: {'lr': 4.350167980592501e-05, 'samples': 23383680, 'steps': 121789, 'loss/train': 0.7330666184425354} 11/07/2021 14:23:51 - INFO - __main__ - Step 121791: {'lr': 4.34986885469931e-05, 'samples': 23383872, 'steps': 121790, 'loss/train': 1.5014643669128418} 11/07/2021 14:23:52 - INFO - __main__ - Step 121792: {'lr': 4.349569738110748e-05, 'samples': 23384064, 'steps': 121791, 'loss/train': 1.1261329650878906} 11/07/2021 14:23:52 - INFO - __main__ - Step 121793: {'lr': 4.349270630826952e-05, 'samples': 23384256, 'steps': 121792, 'loss/train': 1.2657594680786133} 11/07/2021 14:23:52 - INFO - __main__ - Step 121794: {'lr': 4.3489715328480535e-05, 'samples': 23384448, 'steps': 121793, 'loss/train': 1.374606728553772} 11/07/2021 14:23:53 - INFO - __main__ - Step 121795: {'lr': 4.348672444174192e-05, 'samples': 23384640, 'steps': 121794, 'loss/train': 1.3904181718826294} 11/07/2021 14:23:54 - INFO - __main__ - Step 121796: {'lr': 4.3483733648055023e-05, 'samples': 23384832, 'steps': 121795, 'loss/train': 1.3334580659866333} 11/07/2021 14:23:54 - INFO - __main__ - Step 121797: {'lr': 4.348074294742113e-05, 'samples': 23385024, 'steps': 121796, 'loss/train': 1.5012863874435425} 11/07/2021 14:23:54 - INFO - __main__ - Step 121798: {'lr': 4.3477752339841634e-05, 'samples': 23385216, 'steps': 121797, 'loss/train': 0.7903628349304199} 11/07/2021 14:23:55 - INFO - __main__ - Step 121799: {'lr': 4.34747618253179e-05, 'samples': 23385408, 'steps': 121798, 'loss/train': 0.979907751083374} 11/07/2021 14:23:56 - INFO - __main__ - Step 121800: {'lr': 4.347177140385122e-05, 'samples': 23385600, 'steps': 121799, 'loss/train': 1.8648492097854614} 11/07/2021 14:23:56 - INFO - __main__ - Step 121801: {'lr': 4.3468781075443084e-05, 'samples': 23385792, 'steps': 121800, 'loss/train': 1.1616119146347046} 11/07/2021 14:23:57 - INFO - __main__ - Step 121802: {'lr': 4.346579084009461e-05, 'samples': 23385984, 'steps': 121801, 'loss/train': 1.4825681447982788} 11/07/2021 14:23:57 - INFO - __main__ - Step 121803: {'lr': 4.346280069780731e-05, 'samples': 23386176, 'steps': 121802, 'loss/train': 1.1031330823898315} 11/07/2021 14:23:57 - INFO - __main__ - Step 121804: {'lr': 4.345981064858245e-05, 'samples': 23386368, 'steps': 121803, 'loss/train': 1.5041390657424927} 11/07/2021 14:23:58 - INFO - __main__ - Step 121805: {'lr': 4.345682069242144e-05, 'samples': 23386560, 'steps': 121804, 'loss/train': 1.1306653022766113} 11/07/2021 14:23:59 - INFO - __main__ - Step 121806: {'lr': 4.345383082932558e-05, 'samples': 23386752, 'steps': 121805, 'loss/train': 1.1392951011657715} 11/07/2021 14:23:59 - INFO - __main__ - Step 121807: {'lr': 4.345084105929622e-05, 'samples': 23386944, 'steps': 121806, 'loss/train': 1.8952833414077759} 11/07/2021 14:23:59 - INFO - __main__ - Step 121808: {'lr': 4.3447851382334755e-05, 'samples': 23387136, 'steps': 121807, 'loss/train': 1.283473014831543} 11/07/2021 14:24:00 - INFO - __main__ - Step 121809: {'lr': 4.3444861798442483e-05, 'samples': 23387328, 'steps': 121808, 'loss/train': 0.9610525369644165} 11/07/2021 14:24:00 - INFO - __main__ - Step 121810: {'lr': 4.344187230762078e-05, 'samples': 23387520, 'steps': 121809, 'loss/train': 1.012974500656128} 11/07/2021 14:24:01 - INFO - __main__ - Step 121811: {'lr': 4.3438882909870967e-05, 'samples': 23387712, 'steps': 121810, 'loss/train': 0.9024028778076172} 11/07/2021 14:24:01 - INFO - __main__ - Step 121812: {'lr': 4.3435893605194425e-05, 'samples': 23387904, 'steps': 121811, 'loss/train': 1.5941979885101318} 11/07/2021 14:24:02 - INFO - __main__ - Step 121813: {'lr': 4.343290439359249e-05, 'samples': 23388096, 'steps': 121812, 'loss/train': 1.5989751815795898} 11/07/2021 14:24:02 - INFO - __main__ - Step 121814: {'lr': 4.3429915275066486e-05, 'samples': 23388288, 'steps': 121813, 'loss/train': 1.5281795263290405} 11/07/2021 14:24:03 - INFO - __main__ - Step 121815: {'lr': 4.342692624961783e-05, 'samples': 23388480, 'steps': 121814, 'loss/train': 0.7826485633850098} 11/07/2021 14:24:04 - INFO - __main__ - Step 121816: {'lr': 4.342393731724775e-05, 'samples': 23388672, 'steps': 121815, 'loss/train': 1.6211825609207153} 11/07/2021 14:24:04 - INFO - __main__ - Step 121817: {'lr': 4.342094847795766e-05, 'samples': 23388864, 'steps': 121816, 'loss/train': 1.1133852005004883} 11/07/2021 14:24:04 - INFO - __main__ - Step 121818: {'lr': 4.341795973174892e-05, 'samples': 23389056, 'steps': 121817, 'loss/train': 0.6271378993988037} 11/07/2021 14:24:05 - INFO - __main__ - Step 121819: {'lr': 4.341497107862283e-05, 'samples': 23389248, 'steps': 121818, 'loss/train': 1.0209450721740723} 11/07/2021 14:24:05 - INFO - __main__ - Step 121820: {'lr': 4.341198251858078e-05, 'samples': 23389440, 'steps': 121819, 'loss/train': 1.3075364828109741} 11/07/2021 14:24:07 - INFO - __main__ - Step 121821: {'lr': 4.340899405162413e-05, 'samples': 23389632, 'steps': 121820, 'loss/train': 1.3767529726028442} 11/07/2021 14:24:07 - INFO - __main__ - Step 121822: {'lr': 4.3406005677754156e-05, 'samples': 23389824, 'steps': 121821, 'loss/train': 1.3591856956481934} 11/07/2021 14:24:08 - INFO - __main__ - Step 121823: {'lr': 4.3403017396972275e-05, 'samples': 23390016, 'steps': 121822, 'loss/train': 0.679821789264679} 11/07/2021 14:24:08 - INFO - __main__ - Step 121824: {'lr': 4.3400029209279795e-05, 'samples': 23390208, 'steps': 121823, 'loss/train': 0.8435695171356201} 11/07/2021 14:24:08 - INFO - __main__ - Step 121825: {'lr': 4.339704111467807e-05, 'samples': 23390400, 'steps': 121824, 'loss/train': 1.0256322622299194} 11/07/2021 14:24:09 - INFO - __main__ - Step 121826: {'lr': 4.3394053113168464e-05, 'samples': 23390592, 'steps': 121825, 'loss/train': 2.5520732402801514} 11/07/2021 14:24:09 - INFO - __main__ - Step 121827: {'lr': 4.339106520475231e-05, 'samples': 23390784, 'steps': 121826, 'loss/train': 2.694129467010498} 11/07/2021 14:24:09 - INFO - __main__ - Step 121828: {'lr': 4.338807738943098e-05, 'samples': 23390976, 'steps': 121827, 'loss/train': 2.62225604057312} 11/07/2021 14:24:10 - INFO - __main__ - Step 121829: {'lr': 4.338508966720578e-05, 'samples': 23391168, 'steps': 121828, 'loss/train': 1.2588427066802979} 11/07/2021 14:24:11 - INFO - __main__ - Step 121830: {'lr': 4.3382102038078046e-05, 'samples': 23391360, 'steps': 121829, 'loss/train': 1.385772705078125} 11/07/2021 14:24:11 - INFO - __main__ - Step 121831: {'lr': 4.337911450204915e-05, 'samples': 23391552, 'steps': 121830, 'loss/train': 1.5239794254302979} 11/07/2021 14:24:11 - INFO - __main__ - Step 121832: {'lr': 4.337612705912045e-05, 'samples': 23391744, 'steps': 121831, 'loss/train': 1.8347094058990479} 11/07/2021 14:24:12 - INFO - __main__ - Step 121833: {'lr': 4.337313970929327e-05, 'samples': 23391936, 'steps': 121832, 'loss/train': 1.1829134225845337} 11/07/2021 14:24:13 - INFO - __main__ - Step 121834: {'lr': 4.3370152452568954e-05, 'samples': 23392128, 'steps': 121833, 'loss/train': 1.1389020681381226} 11/07/2021 14:24:13 - INFO - __main__ - Step 121835: {'lr': 4.3367165288948876e-05, 'samples': 23392320, 'steps': 121834, 'loss/train': 1.4099801778793335} 11/07/2021 14:24:14 - INFO - __main__ - Step 121836: {'lr': 4.336417821843436e-05, 'samples': 23392512, 'steps': 121835, 'loss/train': 1.0653660297393799} 11/07/2021 14:24:14 - INFO - __main__ - Step 121837: {'lr': 4.336119124102675e-05, 'samples': 23392704, 'steps': 121836, 'loss/train': 0.9880668520927429} 11/07/2021 14:24:14 - INFO - __main__ - Step 121838: {'lr': 4.335820435672738e-05, 'samples': 23392896, 'steps': 121837, 'loss/train': 1.4833060503005981} 11/07/2021 14:24:15 - INFO - __main__ - Step 121839: {'lr': 4.335521756553765e-05, 'samples': 23393088, 'steps': 121838, 'loss/train': 1.139548897743225} 11/07/2021 14:24:16 - INFO - __main__ - Step 121840: {'lr': 4.3352230867458845e-05, 'samples': 23393280, 'steps': 121839, 'loss/train': 0.6898266077041626} 11/07/2021 14:24:16 - INFO - __main__ - Step 121841: {'lr': 4.334924426249243e-05, 'samples': 23393472, 'steps': 121840, 'loss/train': 0.9433724880218506} 11/07/2021 14:24:16 - INFO - __main__ - Step 121842: {'lr': 4.334625775063958e-05, 'samples': 23393664, 'steps': 121841, 'loss/train': 1.161109447479248} 11/07/2021 14:24:17 - INFO - __main__ - Step 121843: {'lr': 4.334327133190169e-05, 'samples': 23393856, 'steps': 121842, 'loss/train': 0.94483482837677} 11/07/2021 14:24:18 - INFO - __main__ - Step 121844: {'lr': 4.334028500628015e-05, 'samples': 23394048, 'steps': 121843, 'loss/train': 1.4098546504974365} 11/07/2021 14:24:18 - INFO - __main__ - Step 121845: {'lr': 4.333729877377632e-05, 'samples': 23394240, 'steps': 121844, 'loss/train': 2.2233548164367676} 11/07/2021 14:24:18 - INFO - __main__ - Step 121846: {'lr': 4.333431263439147e-05, 'samples': 23394432, 'steps': 121845, 'loss/train': 1.315725564956665} 11/07/2021 14:24:19 - INFO - __main__ - Step 121847: {'lr': 4.333132658812702e-05, 'samples': 23394624, 'steps': 121846, 'loss/train': 0.746740996837616} 11/07/2021 14:24:19 - INFO - __main__ - Step 121848: {'lr': 4.332834063498425e-05, 'samples': 23394816, 'steps': 121847, 'loss/train': 1.1905310153961182} 11/07/2021 14:24:20 - INFO - __main__ - Step 121849: {'lr': 4.3325354774964576e-05, 'samples': 23395008, 'steps': 121848, 'loss/train': 1.2911611795425415} 11/07/2021 14:24:20 - INFO - __main__ - Step 121850: {'lr': 4.3322369008069296e-05, 'samples': 23395200, 'steps': 121849, 'loss/train': 1.1167961359024048} 11/07/2021 14:24:21 - INFO - __main__ - Step 121851: {'lr': 4.331938333429977e-05, 'samples': 23395392, 'steps': 121850, 'loss/train': 1.6891474723815918} 11/07/2021 14:24:21 - INFO - __main__ - Step 121852: {'lr': 4.331639775365734e-05, 'samples': 23395584, 'steps': 121851, 'loss/train': 1.359540581703186} 11/07/2021 14:24:21 - INFO - __main__ - Step 121853: {'lr': 4.331341226614335e-05, 'samples': 23395776, 'steps': 121852, 'loss/train': 1.2521320581436157} 11/07/2021 14:24:22 - INFO - __main__ - Step 121854: {'lr': 4.331042687175915e-05, 'samples': 23395968, 'steps': 121853, 'loss/train': 1.0905983448028564} 11/07/2021 14:24:23 - INFO - __main__ - Step 121855: {'lr': 4.3307441570506116e-05, 'samples': 23396160, 'steps': 121854, 'loss/train': 1.6358990669250488} 11/07/2021 14:24:23 - INFO - __main__ - Step 121856: {'lr': 4.330445636238553e-05, 'samples': 23396352, 'steps': 121855, 'loss/train': 0.6153882145881653} 11/07/2021 14:24:23 - INFO - __main__ - Step 121857: {'lr': 4.330147124739875e-05, 'samples': 23396544, 'steps': 121856, 'loss/train': 1.0879437923431396} 11/07/2021 14:24:24 - INFO - __main__ - Step 121858: {'lr': 4.329848622554716e-05, 'samples': 23396736, 'steps': 121857, 'loss/train': 1.4706145524978638} 11/07/2021 14:24:25 - INFO - __main__ - Step 121859: {'lr': 4.329550129683207e-05, 'samples': 23396928, 'steps': 121858, 'loss/train': 0.7175898551940918} 11/07/2021 14:24:25 - INFO - __main__ - Step 121860: {'lr': 4.329251646125482e-05, 'samples': 23397120, 'steps': 121859, 'loss/train': 1.0140619277954102} 11/07/2021 14:24:26 - INFO - __main__ - Step 121861: {'lr': 4.32895317188168e-05, 'samples': 23397312, 'steps': 121860, 'loss/train': 1.433599829673767} 11/07/2021 14:24:26 - INFO - __main__ - Step 121862: {'lr': 4.3286547069519316e-05, 'samples': 23397504, 'steps': 121861, 'loss/train': 1.4780806303024292} 11/07/2021 14:24:26 - INFO - __main__ - Step 121863: {'lr': 4.3283562513363714e-05, 'samples': 23397696, 'steps': 121862, 'loss/train': 1.5324935913085938} 11/07/2021 14:24:28 - INFO - __main__ - Step 121864: {'lr': 4.328057805035135e-05, 'samples': 23397888, 'steps': 121863, 'loss/train': 1.5176328420639038} 11/07/2021 14:24:28 - INFO - __main__ - Step 121865: {'lr': 4.327759368048359e-05, 'samples': 23398080, 'steps': 121864, 'loss/train': 0.786799430847168} 11/07/2021 14:24:28 - INFO - __main__ - Step 121866: {'lr': 4.327460940376174e-05, 'samples': 23398272, 'steps': 121865, 'loss/train': 1.3970890045166016} 11/07/2021 14:24:29 - INFO - __main__ - Step 121867: {'lr': 4.327162522018715e-05, 'samples': 23398464, 'steps': 121866, 'loss/train': 1.0967068672180176} 11/07/2021 14:24:29 - INFO - __main__ - Step 121868: {'lr': 4.326864112976125e-05, 'samples': 23398656, 'steps': 121867, 'loss/train': 1.881991982460022} 11/07/2021 14:24:29 - INFO - __main__ - Step 121869: {'lr': 4.326565713248526e-05, 'samples': 23398848, 'steps': 121868, 'loss/train': 1.6323078870773315} 11/07/2021 14:24:30 - INFO - __main__ - Step 121870: {'lr': 4.326267322836056e-05, 'samples': 23399040, 'steps': 121869, 'loss/train': 1.7788646221160889} 11/07/2021 14:24:31 - INFO - __main__ - Step 121871: {'lr': 4.3259689417388534e-05, 'samples': 23399232, 'steps': 121870, 'loss/train': 1.3894232511520386} 11/07/2021 14:24:31 - INFO - __main__ - Step 121872: {'lr': 4.325670569957046e-05, 'samples': 23399424, 'steps': 121871, 'loss/train': 1.1124317646026611} 11/07/2021 14:24:31 - INFO - __main__ - Step 121873: {'lr': 4.325372207490774e-05, 'samples': 23399616, 'steps': 121872, 'loss/train': 1.471025824546814} 11/07/2021 14:24:32 - INFO - __main__ - Step 121874: {'lr': 4.325073854340172e-05, 'samples': 23399808, 'steps': 121873, 'loss/train': 1.2938330173492432} 11/07/2021 14:24:32 - INFO - __main__ - Step 121875: {'lr': 4.3247755105053715e-05, 'samples': 23400000, 'steps': 121874, 'loss/train': 1.042295217514038} 11/07/2021 14:24:34 - INFO - __main__ - Step 121876: {'lr': 4.3244771759865075e-05, 'samples': 23400192, 'steps': 121875, 'loss/train': 1.6085659265518188} 11/07/2021 14:24:34 - INFO - __main__ - Step 121877: {'lr': 4.3241788507837164e-05, 'samples': 23400384, 'steps': 121876, 'loss/train': 1.3975071907043457} 11/07/2021 14:24:34 - INFO - __main__ - Step 121878: {'lr': 4.323880534897129e-05, 'samples': 23400576, 'steps': 121877, 'loss/train': 1.5059425830841064} 11/07/2021 14:24:35 - INFO - __main__ - Step 121879: {'lr': 4.3235822283268835e-05, 'samples': 23400768, 'steps': 121878, 'loss/train': 1.1622416973114014} 11/07/2021 14:24:35 - INFO - __main__ - Step 121880: {'lr': 4.3232839310731134e-05, 'samples': 23400960, 'steps': 121879, 'loss/train': 1.329095721244812} 11/07/2021 14:24:36 - INFO - __main__ - Step 121881: {'lr': 4.3229856431359515e-05, 'samples': 23401152, 'steps': 121880, 'loss/train': 0.7852922677993774} 11/07/2021 14:24:36 - INFO - __main__ - Step 121882: {'lr': 4.32268736451554e-05, 'samples': 23401344, 'steps': 121881, 'loss/train': 0.8122729659080505} 11/07/2021 14:24:37 - INFO - __main__ - Step 121883: {'lr': 4.322389095212001e-05, 'samples': 23401536, 'steps': 121882, 'loss/train': 1.32456636428833} 11/07/2021 14:24:37 - INFO - __main__ - Step 121884: {'lr': 4.322090835225473e-05, 'samples': 23401728, 'steps': 121883, 'loss/train': 1.2825875282287598} 11/07/2021 14:24:37 - INFO - __main__ - Step 121885: {'lr': 4.321792584556092e-05, 'samples': 23401920, 'steps': 121884, 'loss/train': 1.156847596168518} 11/07/2021 14:24:39 - INFO - __main__ - Step 121886: {'lr': 4.321494343203994e-05, 'samples': 23402112, 'steps': 121885, 'loss/train': 1.3900212049484253} 11/07/2021 14:24:39 - INFO - __main__ - Step 121887: {'lr': 4.321196111169309e-05, 'samples': 23402304, 'steps': 121886, 'loss/train': 1.4310417175292969} 11/07/2021 14:24:40 - INFO - __main__ - Step 121888: {'lr': 4.3208978884521744e-05, 'samples': 23402496, 'steps': 121887, 'loss/train': 1.1838229894638062} 11/07/2021 14:24:40 - INFO - __main__ - Step 121889: {'lr': 4.3205996750527246e-05, 'samples': 23402688, 'steps': 121888, 'loss/train': 1.256457805633545} 11/07/2021 14:24:40 - INFO - __main__ - Step 121890: {'lr': 4.320301470971094e-05, 'samples': 23402880, 'steps': 121889, 'loss/train': 0.6853662133216858} 11/07/2021 14:24:41 - INFO - __main__ - Step 121891: {'lr': 4.3200032762074154e-05, 'samples': 23403072, 'steps': 121890, 'loss/train': 0.6989704966545105} 11/07/2021 14:24:42 - INFO - __main__ - Step 121892: {'lr': 4.319705090761825e-05, 'samples': 23403264, 'steps': 121891, 'loss/train': 0.5712666511535645} 11/07/2021 14:24:42 - INFO - __main__ - Step 121893: {'lr': 4.319406914634455e-05, 'samples': 23403456, 'steps': 121892, 'loss/train': 1.4416064023971558} 11/07/2021 14:24:43 - INFO - __main__ - Step 121894: {'lr': 4.3191087478254424e-05, 'samples': 23403648, 'steps': 121893, 'loss/train': 0.9819695353507996} 11/07/2021 14:24:43 - INFO - __main__ - Step 121895: {'lr': 4.318810590334926e-05, 'samples': 23403840, 'steps': 121894, 'loss/train': 0.950090765953064} 11/07/2021 14:24:43 - INFO - __main__ - Step 121896: {'lr': 4.318512442163031e-05, 'samples': 23404032, 'steps': 121895, 'loss/train': 0.7361970543861389} 11/07/2021 14:24:44 - INFO - __main__ - Step 121897: {'lr': 4.318214303309892e-05, 'samples': 23404224, 'steps': 121896, 'loss/train': 1.1112396717071533} 11/07/2021 14:24:45 - INFO - __main__ - Step 121898: {'lr': 4.317916173775646e-05, 'samples': 23404416, 'steps': 121897, 'loss/train': 1.2207810878753662} 11/07/2021 14:24:45 - INFO - __main__ - Step 121899: {'lr': 4.317618053560429e-05, 'samples': 23404608, 'steps': 121898, 'loss/train': 1.3848156929016113} 11/07/2021 14:24:45 - INFO - __main__ - Step 121900: {'lr': 4.3173199426643746e-05, 'samples': 23404800, 'steps': 121899, 'loss/train': 1.390964388847351} 11/07/2021 14:24:46 - INFO - __main__ - Step 121901: {'lr': 4.317021841087615e-05, 'samples': 23404992, 'steps': 121900, 'loss/train': 1.725439429283142} 11/07/2021 14:24:47 - INFO - __main__ - Step 121902: {'lr': 4.31672374883029e-05, 'samples': 23405184, 'steps': 121901, 'loss/train': 1.6454004049301147} 11/07/2021 14:24:47 - INFO - __main__ - Step 121903: {'lr': 4.316425665892526e-05, 'samples': 23405376, 'steps': 121902, 'loss/train': 1.4883506298065186} 11/07/2021 14:24:47 - INFO - __main__ - Step 121904: {'lr': 4.316127592274463e-05, 'samples': 23405568, 'steps': 121903, 'loss/train': 0.7693279981613159} 11/07/2021 14:24:48 - INFO - __main__ - Step 121905: {'lr': 4.315829527976234e-05, 'samples': 23405760, 'steps': 121904, 'loss/train': 1.2442553043365479} 11/07/2021 14:24:48 - INFO - __main__ - Step 121906: {'lr': 4.315531472997972e-05, 'samples': 23405952, 'steps': 121905, 'loss/train': 1.0139400959014893} 11/07/2021 14:24:48 - INFO - __main__ - Step 121907: {'lr': 4.3152334273398156e-05, 'samples': 23406144, 'steps': 121906, 'loss/train': 1.854422688484192} 11/07/2021 14:24:50 - INFO - __main__ - Step 121908: {'lr': 4.314935391001892e-05, 'samples': 23406336, 'steps': 121907, 'loss/train': 1.0950894355773926} 11/07/2021 14:24:50 - INFO - __main__ - Step 121909: {'lr': 4.3146373639843474e-05, 'samples': 23406528, 'steps': 121908, 'loss/train': 1.4731754064559937} 11/07/2021 14:24:50 - INFO - __main__ - Step 121910: {'lr': 4.314339346287299e-05, 'samples': 23406720, 'steps': 121909, 'loss/train': 1.059898018836975} 11/07/2021 14:24:51 - INFO - __main__ - Step 121911: {'lr': 4.314041337910893e-05, 'samples': 23406912, 'steps': 121910, 'loss/train': 1.0726677179336548} 11/07/2021 14:24:51 - INFO - __main__ - Step 121912: {'lr': 4.313743338855261e-05, 'samples': 23407104, 'steps': 121911, 'loss/train': 1.477790117263794} 11/07/2021 14:24:52 - INFO - __main__ - Step 121913: {'lr': 4.313445349120537e-05, 'samples': 23407296, 'steps': 121912, 'loss/train': 0.5953063368797302} 11/07/2021 14:24:52 - INFO - __main__ - Step 121914: {'lr': 4.3131473687068544e-05, 'samples': 23407488, 'steps': 121913, 'loss/train': 1.513563632965088} 11/07/2021 14:24:53 - INFO - __main__ - Step 121915: {'lr': 4.312849397614349e-05, 'samples': 23407680, 'steps': 121914, 'loss/train': 0.9547673463821411} 11/07/2021 14:24:53 - INFO - __main__ - Step 121916: {'lr': 4.312551435843154e-05, 'samples': 23407872, 'steps': 121915, 'loss/train': 1.0545670986175537} 11/07/2021 14:24:53 - INFO - __main__ - Step 121917: {'lr': 4.312253483393402e-05, 'samples': 23408064, 'steps': 121916, 'loss/train': 0.5369089841842651} 11/07/2021 14:24:54 - INFO - __main__ - Step 121918: {'lr': 4.3119555402652334e-05, 'samples': 23408256, 'steps': 121917, 'loss/train': 1.0308446884155273} 11/07/2021 14:24:55 - INFO - __main__ - Step 121919: {'lr': 4.311657606458774e-05, 'samples': 23408448, 'steps': 121918, 'loss/train': 1.2435951232910156} 11/07/2021 14:24:55 - INFO - __main__ - Step 121920: {'lr': 4.311359681974167e-05, 'samples': 23408640, 'steps': 121919, 'loss/train': 0.7634084820747375} 11/07/2021 14:24:55 - INFO - __main__ - Step 121921: {'lr': 4.3110617668115384e-05, 'samples': 23408832, 'steps': 121920, 'loss/train': 1.4574825763702393} 11/07/2021 14:24:56 - INFO - __main__ - Step 121922: {'lr': 4.3107638609710346e-05, 'samples': 23409024, 'steps': 121921, 'loss/train': 1.314925193786621} 11/07/2021 14:24:57 - INFO - __main__ - Step 121923: {'lr': 4.310465964452773e-05, 'samples': 23409216, 'steps': 121922, 'loss/train': 1.4263066053390503} 11/07/2021 14:24:57 - INFO - __main__ - Step 121924: {'lr': 4.3101680772568986e-05, 'samples': 23409408, 'steps': 121923, 'loss/train': 1.2478629350662231} 11/07/2021 14:24:58 - INFO - __main__ - Step 121925: {'lr': 4.309870199383542e-05, 'samples': 23409600, 'steps': 121924, 'loss/train': 1.062361478805542} 11/07/2021 14:24:58 - INFO - __main__ - Step 121926: {'lr': 4.309572330832839e-05, 'samples': 23409792, 'steps': 121925, 'loss/train': 0.6961898803710938} 11/07/2021 14:24:58 - INFO - __main__ - Step 121927: {'lr': 4.309274471604924e-05, 'samples': 23409984, 'steps': 121926, 'loss/train': 1.251188039779663} 11/07/2021 14:24:59 - INFO - __main__ - Step 121928: {'lr': 4.308976621699928e-05, 'samples': 23410176, 'steps': 121927, 'loss/train': 1.0553840398788452} 11/07/2021 14:25:00 - INFO - __main__ - Step 121929: {'lr': 4.308678781117992e-05, 'samples': 23410368, 'steps': 121928, 'loss/train': 1.3436037302017212} 11/07/2021 14:25:00 - INFO - __main__ - Step 121930: {'lr': 4.308380949859242e-05, 'samples': 23410560, 'steps': 121929, 'loss/train': 1.4966109991073608} 11/07/2021 14:25:00 - INFO - __main__ - Step 121931: {'lr': 4.3080831279238174e-05, 'samples': 23410752, 'steps': 121930, 'loss/train': 1.5399678945541382} 11/07/2021 14:25:01 - INFO - __main__ - Step 121932: {'lr': 4.307785315311852e-05, 'samples': 23410944, 'steps': 121931, 'loss/train': 1.2326534986495972} 11/07/2021 14:25:02 - INFO - __main__ - Step 121933: {'lr': 4.307487512023481e-05, 'samples': 23411136, 'steps': 121932, 'loss/train': 1.1609933376312256} 11/07/2021 14:25:02 - INFO - __main__ - Step 121934: {'lr': 4.307189718058835e-05, 'samples': 23411328, 'steps': 121933, 'loss/train': 1.2048816680908203} 11/07/2021 14:25:02 - INFO - __main__ - Step 121935: {'lr': 4.306891933418047e-05, 'samples': 23411520, 'steps': 121934, 'loss/train': 1.1591862440109253} 11/07/2021 14:25:03 - INFO - __main__ - Step 121936: {'lr': 4.306594158101265e-05, 'samples': 23411712, 'steps': 121935, 'loss/train': 1.3058987855911255} 11/07/2021 14:25:03 - INFO - __main__ - Step 121937: {'lr': 4.306296392108605e-05, 'samples': 23411904, 'steps': 121936, 'loss/train': 1.0900665521621704} 11/07/2021 14:25:04 - INFO - __main__ - Step 121938: {'lr': 4.305998635440206e-05, 'samples': 23412096, 'steps': 121937, 'loss/train': 0.823680579662323} 11/07/2021 14:25:05 - INFO - __main__ - Step 121939: {'lr': 4.305700888096209e-05, 'samples': 23412288, 'steps': 121938, 'loss/train': 1.1294103860855103} 11/07/2021 14:25:05 - INFO - __main__ - Step 121940: {'lr': 4.305403150076739e-05, 'samples': 23412480, 'steps': 121939, 'loss/train': 1.2061035633087158} 11/07/2021 14:25:05 - INFO - __main__ - Step 121941: {'lr': 4.3051054213819386e-05, 'samples': 23412672, 'steps': 121940, 'loss/train': 0.9234007596969604} 11/07/2021 14:25:06 - INFO - __main__ - Step 121942: {'lr': 4.304807702011937e-05, 'samples': 23412864, 'steps': 121941, 'loss/train': 1.3801418542861938} 11/07/2021 14:25:06 - INFO - __main__ - Step 121943: {'lr': 4.304509991966871e-05, 'samples': 23413056, 'steps': 121942, 'loss/train': 0.8562299609184265} 11/07/2021 14:25:07 - INFO - __main__ - Step 121944: {'lr': 4.304212291246873e-05, 'samples': 23413248, 'steps': 121943, 'loss/train': 1.488005518913269} 11/07/2021 14:25:07 - INFO - __main__ - Step 121945: {'lr': 4.303914599852077e-05, 'samples': 23413440, 'steps': 121944, 'loss/train': 1.0810290575027466} 11/07/2021 14:25:08 - INFO - __main__ - Step 121946: {'lr': 4.303616917782616e-05, 'samples': 23413632, 'steps': 121945, 'loss/train': 1.2276670932769775} 11/07/2021 14:25:08 - INFO - __main__ - Step 121947: {'lr': 4.303319245038628e-05, 'samples': 23413824, 'steps': 121946, 'loss/train': 0.8898214101791382} 11/07/2021 14:25:08 - INFO - __main__ - Step 121948: {'lr': 4.3030215816202453e-05, 'samples': 23414016, 'steps': 121947, 'loss/train': 0.9171852469444275} 11/07/2021 14:25:09 - INFO - __main__ - Step 121949: {'lr': 4.302723927527607e-05, 'samples': 23414208, 'steps': 121948, 'loss/train': 1.121004581451416} 11/07/2021 14:25:10 - INFO - __main__ - Step 121950: {'lr': 4.302426282760835e-05, 'samples': 23414400, 'steps': 121949, 'loss/train': 1.522051215171814} 11/07/2021 14:25:10 - INFO - __main__ - Step 121951: {'lr': 4.302128647320072e-05, 'samples': 23414592, 'steps': 121950, 'loss/train': 1.492652177810669} 11/07/2021 14:25:11 - INFO - __main__ - Step 121952: {'lr': 4.301831021205449e-05, 'samples': 23414784, 'steps': 121951, 'loss/train': 1.1656463146209717} 11/07/2021 14:25:11 - INFO - __main__ - Step 121953: {'lr': 4.3015334044171014e-05, 'samples': 23414976, 'steps': 121952, 'loss/train': 1.5590488910675049} 11/07/2021 14:25:12 - INFO - __main__ - Step 121954: {'lr': 4.3012357969551666e-05, 'samples': 23415168, 'steps': 121953, 'loss/train': 1.2378637790679932} 11/07/2021 14:25:12 - INFO - __main__ - Step 121955: {'lr': 4.3009381988197707e-05, 'samples': 23415360, 'steps': 121954, 'loss/train': 1.4767868518829346} 11/07/2021 14:25:13 - INFO - __main__ - Step 121956: {'lr': 4.300640610011056e-05, 'samples': 23415552, 'steps': 121955, 'loss/train': 1.215514063835144} 11/07/2021 14:25:13 - INFO - __main__ - Step 121957: {'lr': 4.300343030529152e-05, 'samples': 23415744, 'steps': 121956, 'loss/train': 1.35750412940979} 11/07/2021 14:25:13 - INFO - __main__ - Step 121958: {'lr': 4.300045460374194e-05, 'samples': 23415936, 'steps': 121957, 'loss/train': 1.8368785381317139} 11/07/2021 14:25:14 - INFO - __main__ - Step 121959: {'lr': 4.299747899546316e-05, 'samples': 23416128, 'steps': 121958, 'loss/train': 1.8943604230880737} 11/07/2021 14:25:15 - INFO - __main__ - Step 121960: {'lr': 4.299450348045653e-05, 'samples': 23416320, 'steps': 121959, 'loss/train': 1.4133964776992798} 11/07/2021 14:25:15 - INFO - __main__ - Step 121961: {'lr': 4.2991528058723446e-05, 'samples': 23416512, 'steps': 121960, 'loss/train': 1.085167407989502} 11/07/2021 14:25:15 - INFO - __main__ - Step 121962: {'lr': 4.298855273026511e-05, 'samples': 23416704, 'steps': 121961, 'loss/train': 1.3976831436157227} 11/07/2021 14:25:16 - INFO - __main__ - Step 121963: {'lr': 4.2985577495082946e-05, 'samples': 23416896, 'steps': 121962, 'loss/train': 1.5121248960494995} 11/07/2021 14:25:17 - INFO - __main__ - Step 121964: {'lr': 4.2982602353178305e-05, 'samples': 23417088, 'steps': 121963, 'loss/train': 1.3687664270401} 11/07/2021 14:25:17 - INFO - __main__ - Step 121965: {'lr': 4.297962730455249e-05, 'samples': 23417280, 'steps': 121964, 'loss/train': 1.142478108406067} 11/07/2021 14:25:18 - INFO - __main__ - Step 121966: {'lr': 4.297665234920686e-05, 'samples': 23417472, 'steps': 121965, 'loss/train': 1.7178092002868652} 11/07/2021 14:25:18 - INFO - __main__ - Step 121967: {'lr': 4.297367748714276e-05, 'samples': 23417664, 'steps': 121966, 'loss/train': 1.6845628023147583} 11/07/2021 14:25:18 - INFO - __main__ - Step 121968: {'lr': 4.297070271836151e-05, 'samples': 23417856, 'steps': 121967, 'loss/train': 1.133064866065979} 11/07/2021 14:25:19 - INFO - __main__ - Step 121969: {'lr': 4.296772804286447e-05, 'samples': 23418048, 'steps': 121968, 'loss/train': 1.5233618021011353} 11/07/2021 14:25:19 - INFO - __main__ - Step 121970: {'lr': 4.296475346065301e-05, 'samples': 23418240, 'steps': 121969, 'loss/train': 0.6439050436019897} 11/07/2021 14:25:20 - INFO - __main__ - Step 121971: {'lr': 4.29617789717284e-05, 'samples': 23418432, 'steps': 121970, 'loss/train': 0.11523493379354477} 11/07/2021 14:25:21 - INFO - __main__ - Step 121972: {'lr': 4.295880457609211e-05, 'samples': 23418624, 'steps': 121971, 'loss/train': 1.4393399953842163} 11/07/2021 14:25:21 - INFO - __main__ - Step 121973: {'lr': 4.29558302737453e-05, 'samples': 23418816, 'steps': 121972, 'loss/train': 1.2885046005249023} 11/07/2021 14:25:21 - INFO - __main__ - Step 121974: {'lr': 4.2952856064689403e-05, 'samples': 23419008, 'steps': 121973, 'loss/train': 0.7959787249565125} 11/07/2021 14:25:22 - INFO - __main__ - Step 121975: {'lr': 4.294988194892577e-05, 'samples': 23419200, 'steps': 121974, 'loss/train': 0.7481681108474731} 11/07/2021 14:25:23 - INFO - __main__ - Step 121976: {'lr': 4.2946907926455696e-05, 'samples': 23419392, 'steps': 121975, 'loss/train': 0.6479575634002686} 11/07/2021 14:25:23 - INFO - __main__ - Step 121977: {'lr': 4.2943933997280584e-05, 'samples': 23419584, 'steps': 121976, 'loss/train': 1.2703588008880615} 11/07/2021 14:25:23 - INFO - __main__ - Step 121978: {'lr': 4.29409601614017e-05, 'samples': 23419776, 'steps': 121977, 'loss/train': 1.2919468879699707} 11/07/2021 14:25:24 - INFO - __main__ - Step 121979: {'lr': 4.2937986418820465e-05, 'samples': 23419968, 'steps': 121978, 'loss/train': 1.1926010847091675} 11/07/2021 14:25:24 - INFO - __main__ - Step 121980: {'lr': 4.293501276953815e-05, 'samples': 23420160, 'steps': 121979, 'loss/train': 1.52068293094635} 11/07/2021 14:25:25 - INFO - __main__ - Step 121981: {'lr': 4.293203921355615e-05, 'samples': 23420352, 'steps': 121980, 'loss/train': 1.2121312618255615} 11/07/2021 14:25:25 - INFO - __main__ - Step 121982: {'lr': 4.292906575087574e-05, 'samples': 23420544, 'steps': 121981, 'loss/train': 1.5822056531906128} 11/07/2021 14:25:26 - INFO - __main__ - Step 121983: {'lr': 4.292609238149839e-05, 'samples': 23420736, 'steps': 121982, 'loss/train': 1.282122015953064} 11/07/2021 14:25:26 - INFO - __main__ - Step 121984: {'lr': 4.292311910542529e-05, 'samples': 23420928, 'steps': 121983, 'loss/train': 1.4187136888504028} 11/07/2021 14:25:26 - INFO - __main__ - Step 121985: {'lr': 4.29201459226578e-05, 'samples': 23421120, 'steps': 121984, 'loss/train': 1.0504471063613892} 11/07/2021 14:25:28 - INFO - __main__ - Step 121986: {'lr': 4.291717283319735e-05, 'samples': 23421312, 'steps': 121985, 'loss/train': 1.3848135471343994} 11/07/2021 14:25:28 - INFO - __main__ - Step 121987: {'lr': 4.2914199837045196e-05, 'samples': 23421504, 'steps': 121986, 'loss/train': 1.4262328147888184} 11/07/2021 14:25:28 - INFO - __main__ - Step 121988: {'lr': 4.2911226934202715e-05, 'samples': 23421696, 'steps': 121987, 'loss/train': 1.1150885820388794} 11/07/2021 14:25:29 - INFO - __main__ - Step 121989: {'lr': 4.290825412467123e-05, 'samples': 23421888, 'steps': 121988, 'loss/train': 1.7542750835418701} 11/07/2021 14:25:29 - INFO - __main__ - Step 121990: {'lr': 4.290528140845209e-05, 'samples': 23422080, 'steps': 121989, 'loss/train': 1.3303489685058594} 11/07/2021 14:25:30 - INFO - __main__ - Step 121991: {'lr': 4.290230878554663e-05, 'samples': 23422272, 'steps': 121990, 'loss/train': 1.4904910326004028} 11/07/2021 14:25:30 - INFO - __main__ - Step 121992: {'lr': 4.289933625595621e-05, 'samples': 23422464, 'steps': 121991, 'loss/train': 1.4460417032241821} 11/07/2021 14:25:31 - INFO - __main__ - Step 121993: {'lr': 4.289636381968215e-05, 'samples': 23422656, 'steps': 121992, 'loss/train': 1.3777556419372559} 11/07/2021 14:25:31 - INFO - __main__ - Step 121994: {'lr': 4.289339147672586e-05, 'samples': 23422848, 'steps': 121993, 'loss/train': 0.9241055846214294} 11/07/2021 14:25:31 - INFO - __main__ - Step 121995: {'lr': 4.289041922708853e-05, 'samples': 23423040, 'steps': 121994, 'loss/train': 1.4809417724609375} 11/07/2021 14:25:32 - INFO - __main__ - Step 121996: {'lr': 4.288744707077158e-05, 'samples': 23423232, 'steps': 121995, 'loss/train': 1.435575008392334} 11/07/2021 14:25:33 - INFO - __main__ - Step 121997: {'lr': 4.288447500777637e-05, 'samples': 23423424, 'steps': 121996, 'loss/train': 1.2024832963943481} 11/07/2021 14:25:33 - INFO - __main__ - Step 121998: {'lr': 4.2881503038104205e-05, 'samples': 23423616, 'steps': 121997, 'loss/train': 1.4208388328552246} 11/07/2021 14:25:33 - INFO - __main__ - Step 121999: {'lr': 4.287853116175644e-05, 'samples': 23423808, 'steps': 121998, 'loss/train': 1.3420798778533936} 11/07/2021 14:25:34 - INFO - __main__ - Step 122000: {'lr': 4.287555937873444e-05, 'samples': 23424000, 'steps': 121999, 'loss/train': 1.5926361083984375} 11/07/2021 14:25:34 - INFO - __main__ - Step 122001: {'lr': 4.287258768903948e-05, 'samples': 23424192, 'steps': 122000, 'loss/train': 1.1990522146224976} 11/07/2021 14:25:35 - INFO - __main__ - Step 122002: {'lr': 4.286961609267295e-05, 'samples': 23424384, 'steps': 122001, 'loss/train': 1.0701866149902344} 11/07/2021 14:25:36 - INFO - __main__ - Step 122003: {'lr': 4.286664458963618e-05, 'samples': 23424576, 'steps': 122002, 'loss/train': 1.4566617012023926} 11/07/2021 14:25:36 - INFO - __main__ - Step 122004: {'lr': 4.286367317993051e-05, 'samples': 23424768, 'steps': 122003, 'loss/train': 1.6918363571166992} 11/07/2021 14:25:36 - INFO - __main__ - Step 122005: {'lr': 4.286070186355731e-05, 'samples': 23424960, 'steps': 122004, 'loss/train': 1.121177077293396} 11/07/2021 14:25:37 - INFO - __main__ - Step 122006: {'lr': 4.285773064051785e-05, 'samples': 23425152, 'steps': 122005, 'loss/train': 1.228018879890442} 11/07/2021 14:25:38 - INFO - __main__ - Step 122007: {'lr': 4.285475951081347e-05, 'samples': 23425344, 'steps': 122006, 'loss/train': 1.1583216190338135} 11/07/2021 14:25:38 - INFO - __main__ - Step 122008: {'lr': 4.285178847444557e-05, 'samples': 23425536, 'steps': 122007, 'loss/train': 1.0629979372024536} 11/07/2021 14:25:38 - INFO - __main__ - Step 122009: {'lr': 4.284881753141542e-05, 'samples': 23425728, 'steps': 122008, 'loss/train': 1.044390082359314} 11/07/2021 14:25:39 - INFO - __main__ - Step 122010: {'lr': 4.284584668172442e-05, 'samples': 23425920, 'steps': 122009, 'loss/train': 1.1413484811782837} 11/07/2021 14:25:39 - INFO - __main__ - Step 122011: {'lr': 4.2842875925373894e-05, 'samples': 23426112, 'steps': 122010, 'loss/train': 1.4755622148513794} 11/07/2021 14:25:40 - INFO - __main__ - Step 122012: {'lr': 4.2839905262365145e-05, 'samples': 23426304, 'steps': 122011, 'loss/train': 1.2945865392684937} 11/07/2021 14:25:41 - INFO - __main__ - Step 122013: {'lr': 4.2836934692699556e-05, 'samples': 23426496, 'steps': 122012, 'loss/train': 1.5092244148254395} 11/07/2021 14:25:41 - INFO - __main__ - Step 122014: {'lr': 4.283396421637845e-05, 'samples': 23426688, 'steps': 122013, 'loss/train': 1.3071359395980835} 11/07/2021 14:25:41 - INFO - __main__ - Step 122015: {'lr': 4.283099383340316e-05, 'samples': 23426880, 'steps': 122014, 'loss/train': 1.3614102602005005} 11/07/2021 14:25:42 - INFO - __main__ - Step 122016: {'lr': 4.2828023543775074e-05, 'samples': 23427072, 'steps': 122015, 'loss/train': 1.7199634313583374} 11/07/2021 14:25:44 - INFO - __main__ - Step 122017: {'lr': 4.2825053347495425e-05, 'samples': 23427264, 'steps': 122016, 'loss/train': 1.4147039651870728} 11/07/2021 14:25:44 - INFO - __main__ - Step 122018: {'lr': 4.282208324456563e-05, 'samples': 23427456, 'steps': 122017, 'loss/train': 1.516242265701294} 11/07/2021 14:25:44 - INFO - __main__ - Step 122019: {'lr': 4.2819113234987e-05, 'samples': 23427648, 'steps': 122018, 'loss/train': 0.8490079045295715} 11/07/2021 14:25:45 - INFO - __main__ - Step 122020: {'lr': 4.281614331876088e-05, 'samples': 23427840, 'steps': 122019, 'loss/train': 0.9339141249656677} 11/07/2021 14:25:45 - INFO - __main__ - Step 122021: {'lr': 4.281317349588859e-05, 'samples': 23428032, 'steps': 122020, 'loss/train': 0.44176867604255676} 11/07/2021 14:25:45 - INFO - __main__ - Step 122022: {'lr': 4.281020376637151e-05, 'samples': 23428224, 'steps': 122021, 'loss/train': 1.5169014930725098} 11/07/2021 14:25:46 - INFO - __main__ - Step 122023: {'lr': 4.280723413021095e-05, 'samples': 23428416, 'steps': 122022, 'loss/train': 1.7452865839004517} 11/07/2021 14:25:47 - INFO - __main__ - Step 122024: {'lr': 4.280426458740824e-05, 'samples': 23428608, 'steps': 122023, 'loss/train': 1.7524149417877197} 11/07/2021 14:25:47 - INFO - __main__ - Step 122025: {'lr': 4.280129513796474e-05, 'samples': 23428800, 'steps': 122024, 'loss/train': 0.9842167496681213} 11/07/2021 14:25:48 - INFO - __main__ - Step 122026: {'lr': 4.2798325781881783e-05, 'samples': 23428992, 'steps': 122025, 'loss/train': 1.2174241542816162} 11/07/2021 14:25:48 - INFO - __main__ - Step 122027: {'lr': 4.2795356519160696e-05, 'samples': 23429184, 'steps': 122026, 'loss/train': 1.1462798118591309} 11/07/2021 14:25:48 - INFO - __main__ - Step 122028: {'lr': 4.279238734980284e-05, 'samples': 23429376, 'steps': 122027, 'loss/train': 1.3927453756332397} 11/07/2021 14:25:49 - INFO - __main__ - Step 122029: {'lr': 4.278941827380953e-05, 'samples': 23429568, 'steps': 122028, 'loss/train': 1.4270744323730469} 11/07/2021 14:25:50 - INFO - __main__ - Step 122030: {'lr': 4.2786449291182166e-05, 'samples': 23429760, 'steps': 122029, 'loss/train': 1.6023569107055664} 11/07/2021 14:25:50 - INFO - __main__ - Step 122031: {'lr': 4.278348040192198e-05, 'samples': 23429952, 'steps': 122030, 'loss/train': 0.9968171119689941} 11/07/2021 14:25:50 - INFO - __main__ - Step 122032: {'lr': 4.2780511606030334e-05, 'samples': 23430144, 'steps': 122031, 'loss/train': 1.742719054222107} 11/07/2021 14:25:51 - INFO - __main__ - Step 122033: {'lr': 4.27775429035086e-05, 'samples': 23430336, 'steps': 122032, 'loss/train': 1.2757731676101685} 11/07/2021 14:25:51 - INFO - __main__ - Step 122034: {'lr': 4.277457429435813e-05, 'samples': 23430528, 'steps': 122033, 'loss/train': 1.2421352863311768} 11/07/2021 14:25:52 - INFO - __main__ - Step 122035: {'lr': 4.277160577858022e-05, 'samples': 23430720, 'steps': 122034, 'loss/train': 1.2490299940109253} 11/07/2021 14:25:53 - INFO - __main__ - Step 122036: {'lr': 4.2768637356176226e-05, 'samples': 23430912, 'steps': 122035, 'loss/train': 1.283376932144165} 11/07/2021 14:25:53 - INFO - __main__ - Step 122037: {'lr': 4.276566902714751e-05, 'samples': 23431104, 'steps': 122036, 'loss/train': 0.8135694265365601} 11/07/2021 14:25:53 - INFO - __main__ - Step 122038: {'lr': 4.276270079149536e-05, 'samples': 23431296, 'steps': 122037, 'loss/train': 1.2029013633728027} 11/07/2021 14:25:54 - INFO - __main__ - Step 122039: {'lr': 4.2759732649221146e-05, 'samples': 23431488, 'steps': 122038, 'loss/train': 1.4906635284423828} 11/07/2021 14:25:55 - INFO - __main__ - Step 122040: {'lr': 4.275676460032621e-05, 'samples': 23431680, 'steps': 122039, 'loss/train': 0.8594770431518555} 11/07/2021 14:25:55 - INFO - __main__ - Step 122041: {'lr': 4.2753796644811854e-05, 'samples': 23431872, 'steps': 122040, 'loss/train': 1.1096302270889282} 11/07/2021 14:25:55 - INFO - __main__ - Step 122042: {'lr': 4.275082878267947e-05, 'samples': 23432064, 'steps': 122041, 'loss/train': 1.2453452348709106} 11/07/2021 14:25:56 - INFO - __main__ - Step 122043: {'lr': 4.274786101393041e-05, 'samples': 23432256, 'steps': 122042, 'loss/train': 1.5449081659317017} 11/07/2021 14:25:56 - INFO - __main__ - Step 122044: {'lr': 4.2744893338565905e-05, 'samples': 23432448, 'steps': 122043, 'loss/train': 1.3054718971252441} 11/07/2021 14:25:57 - INFO - __main__ - Step 122045: {'lr': 4.274192575658734e-05, 'samples': 23432640, 'steps': 122044, 'loss/train': 1.1206512451171875} 11/07/2021 14:25:57 - INFO - __main__ - Step 122046: {'lr': 4.2738958267996065e-05, 'samples': 23432832, 'steps': 122045, 'loss/train': 0.554561197757721} 11/07/2021 14:25:58 - INFO - __main__ - Step 122047: {'lr': 4.2735990872793453e-05, 'samples': 23433024, 'steps': 122046, 'loss/train': 1.3409461975097656} 11/07/2021 14:25:58 - INFO - __main__ - Step 122048: {'lr': 4.273302357098077e-05, 'samples': 23433216, 'steps': 122047, 'loss/train': 1.3500663042068481} 11/07/2021 14:25:58 - INFO - __main__ - Step 122049: {'lr': 4.273005636255939e-05, 'samples': 23433408, 'steps': 122048, 'loss/train': 1.297472596168518} 11/07/2021 14:26:00 - INFO - __main__ - Step 122050: {'lr': 4.272708924753066e-05, 'samples': 23433600, 'steps': 122049, 'loss/train': 1.135649561882019} 11/07/2021 14:26:00 - INFO - __main__ - Step 122051: {'lr': 4.2724122225895915e-05, 'samples': 23433792, 'steps': 122050, 'loss/train': 0.9601747989654541} 11/07/2021 14:26:00 - INFO - __main__ - Step 122052: {'lr': 4.272115529765647e-05, 'samples': 23433984, 'steps': 122051, 'loss/train': 0.982645571231842} 11/07/2021 14:26:01 - INFO - __main__ - Step 122053: {'lr': 4.271818846281367e-05, 'samples': 23434176, 'steps': 122052, 'loss/train': 1.3274630308151245} 11/07/2021 14:26:01 - INFO - __main__ - Step 122054: {'lr': 4.271522172136885e-05, 'samples': 23434368, 'steps': 122053, 'loss/train': 0.8659157156944275} 11/07/2021 14:26:02 - INFO - __main__ - Step 122055: {'lr': 4.271225507332335e-05, 'samples': 23434560, 'steps': 122054, 'loss/train': 0.05503827705979347} 11/07/2021 14:26:02 - INFO - __main__ - Step 122056: {'lr': 4.2709288518678526e-05, 'samples': 23434752, 'steps': 122055, 'loss/train': 1.2191425561904907} 11/07/2021 14:26:03 - INFO - __main__ - Step 122057: {'lr': 4.270632205743577e-05, 'samples': 23434944, 'steps': 122056, 'loss/train': 1.0796915292739868} 11/07/2021 14:26:03 - INFO - __main__ - Step 122058: {'lr': 4.270335568959627e-05, 'samples': 23435136, 'steps': 122057, 'loss/train': 1.1441749334335327} 11/07/2021 14:26:03 - INFO - __main__ - Step 122059: {'lr': 4.270038941516144e-05, 'samples': 23435328, 'steps': 122058, 'loss/train': 1.4303394556045532} 11/07/2021 14:26:04 - INFO - __main__ - Step 122060: {'lr': 4.269742323413262e-05, 'samples': 23435520, 'steps': 122059, 'loss/train': 0.9645299315452576} 11/07/2021 14:26:05 - INFO - __main__ - Step 122061: {'lr': 4.269445714651113e-05, 'samples': 23435712, 'steps': 122060, 'loss/train': 1.5900087356567383} 11/07/2021 14:26:05 - INFO - __main__ - Step 122062: {'lr': 4.269149115229831e-05, 'samples': 23435904, 'steps': 122061, 'loss/train': 1.1544866561889648} 11/07/2021 14:26:06 - INFO - __main__ - Step 122063: {'lr': 4.2688525251495525e-05, 'samples': 23436096, 'steps': 122062, 'loss/train': 1.2514674663543701} 11/07/2021 14:26:06 - INFO - __main__ - Step 122064: {'lr': 4.26855594441041e-05, 'samples': 23436288, 'steps': 122063, 'loss/train': 1.2748740911483765} 11/07/2021 14:26:07 - INFO - __main__ - Step 122065: {'lr': 4.268259373012534e-05, 'samples': 23436480, 'steps': 122064, 'loss/train': 1.3626779317855835} 11/07/2021 14:26:07 - INFO - __main__ - Step 122066: {'lr': 4.267962810956061e-05, 'samples': 23436672, 'steps': 122065, 'loss/train': 1.2260931730270386} 11/07/2021 14:26:08 - INFO - __main__ - Step 122067: {'lr': 4.2676662582411235e-05, 'samples': 23436864, 'steps': 122066, 'loss/train': 1.635251522064209} 11/07/2021 14:26:08 - INFO - __main__ - Step 122068: {'lr': 4.267369714867858e-05, 'samples': 23437056, 'steps': 122067, 'loss/train': 0.9542341828346252} 11/07/2021 14:26:08 - INFO - __main__ - Step 122069: {'lr': 4.267073180836395e-05, 'samples': 23437248, 'steps': 122068, 'loss/train': 2.290158748626709} 11/07/2021 14:26:09 - INFO - __main__ - Step 122070: {'lr': 4.266776656146873e-05, 'samples': 23437440, 'steps': 122069, 'loss/train': 1.3042112588882446} 11/07/2021 14:26:10 - INFO - __main__ - Step 122071: {'lr': 4.266480140799417e-05, 'samples': 23437632, 'steps': 122070, 'loss/train': 1.4458503723144531} 11/07/2021 14:26:10 - INFO - __main__ - Step 122072: {'lr': 4.266183634794166e-05, 'samples': 23437824, 'steps': 122071, 'loss/train': 1.6870251893997192} 11/07/2021 14:26:10 - INFO - __main__ - Step 122073: {'lr': 4.2658871381312494e-05, 'samples': 23438016, 'steps': 122072, 'loss/train': 1.539299726486206} 11/07/2021 14:26:11 - INFO - __main__ - Step 122074: {'lr': 4.265590650810808e-05, 'samples': 23438208, 'steps': 122073, 'loss/train': 0.9033560156822205} 11/07/2021 14:26:11 - INFO - __main__ - Step 122075: {'lr': 4.26529417283297e-05, 'samples': 23438400, 'steps': 122074, 'loss/train': 1.1907447576522827} 11/07/2021 14:26:12 - INFO - __main__ - Step 122076: {'lr': 4.2649977041978706e-05, 'samples': 23438592, 'steps': 122075, 'loss/train': 1.3626539707183838} 11/07/2021 14:26:13 - INFO - __main__ - Step 122077: {'lr': 4.2647012449056414e-05, 'samples': 23438784, 'steps': 122076, 'loss/train': 1.1187019348144531} 11/07/2021 14:26:13 - INFO - __main__ - Step 122078: {'lr': 4.2644047949564194e-05, 'samples': 23438976, 'steps': 122077, 'loss/train': 1.1187098026275635} 11/07/2021 14:26:13 - INFO - __main__ - Step 122079: {'lr': 4.264108354350338e-05, 'samples': 23439168, 'steps': 122078, 'loss/train': 1.4148204326629639} 11/07/2021 14:26:14 - INFO - __main__ - Step 122080: {'lr': 4.263811923087527e-05, 'samples': 23439360, 'steps': 122079, 'loss/train': 1.2943286895751953} 11/07/2021 14:26:15 - INFO - __main__ - Step 122081: {'lr': 4.263515501168122e-05, 'samples': 23439552, 'steps': 122080, 'loss/train': 1.2535933256149292} 11/07/2021 14:26:15 - INFO - __main__ - Step 122082: {'lr': 4.26321908859226e-05, 'samples': 23439744, 'steps': 122081, 'loss/train': 0.8344196677207947} 11/07/2021 14:26:15 - INFO - __main__ - Step 122083: {'lr': 4.2629226853600766e-05, 'samples': 23439936, 'steps': 122082, 'loss/train': 0.878876268863678} 11/07/2021 14:26:16 - INFO - __main__ - Step 122084: {'lr': 4.2626262914716916e-05, 'samples': 23440128, 'steps': 122083, 'loss/train': 1.6838651895523071} 11/07/2021 14:26:16 - INFO - __main__ - Step 122085: {'lr': 4.262329906927251e-05, 'samples': 23440320, 'steps': 122084, 'loss/train': 1.5149555206298828} 11/07/2021 14:26:17 - INFO - __main__ - Step 122086: {'lr': 4.2620335317268806e-05, 'samples': 23440512, 'steps': 122085, 'loss/train': 1.1529160737991333} 11/07/2021 14:26:17 - INFO - __main__ - Step 122087: {'lr': 4.2617371658707217e-05, 'samples': 23440704, 'steps': 122086, 'loss/train': 1.3056888580322266} 11/07/2021 14:26:18 - INFO - __main__ - Step 122088: {'lr': 4.261440809358902e-05, 'samples': 23440896, 'steps': 122087, 'loss/train': 1.3817522525787354} 11/07/2021 14:26:18 - INFO - __main__ - Step 122089: {'lr': 4.2611444621915575e-05, 'samples': 23441088, 'steps': 122088, 'loss/train': 1.3632646799087524} 11/07/2021 14:26:19 - INFO - __main__ - Step 122090: {'lr': 4.260848124368821e-05, 'samples': 23441280, 'steps': 122089, 'loss/train': 1.2844383716583252} 11/07/2021 14:26:20 - INFO - __main__ - Step 122091: {'lr': 4.260551795890827e-05, 'samples': 23441472, 'steps': 122090, 'loss/train': 1.3737919330596924} 11/07/2021 14:26:20 - INFO - __main__ - Step 122092: {'lr': 4.2602554767577074e-05, 'samples': 23441664, 'steps': 122091, 'loss/train': 0.9888972640037537} 11/07/2021 14:26:21 - INFO - __main__ - Step 122093: {'lr': 4.259959166969596e-05, 'samples': 23441856, 'steps': 122092, 'loss/train': 1.6819329261779785} 11/07/2021 14:26:21 - INFO - __main__ - Step 122094: {'lr': 4.2596628665266285e-05, 'samples': 23442048, 'steps': 122093, 'loss/train': 0.9178557991981506} 11/07/2021 14:26:21 - INFO - __main__ - Step 122095: {'lr': 4.2593665754289354e-05, 'samples': 23442240, 'steps': 122094, 'loss/train': 1.6346819400787354} 11/07/2021 14:26:22 - INFO - __main__ - Step 122096: {'lr': 4.259070293676654e-05, 'samples': 23442432, 'steps': 122095, 'loss/train': 1.7311171293258667} 11/07/2021 14:26:23 - INFO - __main__ - Step 122097: {'lr': 4.258774021269918e-05, 'samples': 23442624, 'steps': 122096, 'loss/train': 1.779307246208191} 11/07/2021 14:26:23 - INFO - __main__ - Step 122098: {'lr': 4.2584777582088566e-05, 'samples': 23442816, 'steps': 122097, 'loss/train': 1.0357837677001953} 11/07/2021 14:26:23 - INFO - __main__ - Step 122099: {'lr': 4.258181504493602e-05, 'samples': 23443008, 'steps': 122098, 'loss/train': 1.3067184686660767} 11/07/2021 14:26:24 - INFO - __main__ - Step 122100: {'lr': 4.257885260124292e-05, 'samples': 23443200, 'steps': 122099, 'loss/train': 1.7478045225143433} 11/07/2021 14:26:24 - INFO - __main__ - Step 122101: {'lr': 4.2575890251010576e-05, 'samples': 23443392, 'steps': 122100, 'loss/train': 1.4334115982055664} 11/07/2021 14:26:25 - INFO - __main__ - Step 122102: {'lr': 4.257292799424034e-05, 'samples': 23443584, 'steps': 122101, 'loss/train': 1.307841181755066} 11/07/2021 14:26:25 - INFO - __main__ - Step 122103: {'lr': 4.256996583093356e-05, 'samples': 23443776, 'steps': 122102, 'loss/train': 1.0760996341705322} 11/07/2021 14:26:26 - INFO - __main__ - Step 122104: {'lr': 4.2567003761091516e-05, 'samples': 23443968, 'steps': 122103, 'loss/train': 1.4897576570510864} 11/07/2021 14:26:26 - INFO - __main__ - Step 122105: {'lr': 4.256404178471562e-05, 'samples': 23444160, 'steps': 122104, 'loss/train': 1.2534570693969727} 11/07/2021 14:26:26 - INFO - __main__ - Step 122106: {'lr': 4.256107990180713e-05, 'samples': 23444352, 'steps': 122105, 'loss/train': 1.2162526845932007} 11/07/2021 14:26:28 - INFO - __main__ - Step 122107: {'lr': 4.255811811236743e-05, 'samples': 23444544, 'steps': 122106, 'loss/train': 1.1101409196853638} 11/07/2021 14:26:28 - INFO - __main__ - Step 122108: {'lr': 4.255515641639784e-05, 'samples': 23444736, 'steps': 122107, 'loss/train': 1.2604337930679321} 11/07/2021 14:26:28 - INFO - __main__ - Step 122109: {'lr': 4.2552194813899714e-05, 'samples': 23444928, 'steps': 122108, 'loss/train': 1.4359869956970215} 11/07/2021 14:26:29 - INFO - __main__ - Step 122110: {'lr': 4.2549233304874425e-05, 'samples': 23445120, 'steps': 122109, 'loss/train': 0.599048376083374} 11/07/2021 14:26:29 - INFO - __main__ - Step 122111: {'lr': 4.254627188932317e-05, 'samples': 23445312, 'steps': 122110, 'loss/train': 0.9862338304519653} 11/07/2021 14:26:29 - INFO - __main__ - Step 122112: {'lr': 4.2543310567247364e-05, 'samples': 23445504, 'steps': 122111, 'loss/train': 1.215052604675293} 11/07/2021 14:26:31 - INFO - __main__ - Step 122113: {'lr': 4.2540349338648364e-05, 'samples': 23445696, 'steps': 122112, 'loss/train': 1.076054573059082} 11/07/2021 14:26:32 - INFO - __main__ - Step 122114: {'lr': 4.253738820352745e-05, 'samples': 23445888, 'steps': 122113, 'loss/train': 1.5542773008346558} 11/07/2021 14:26:32 - INFO - __main__ - Step 122115: {'lr': 4.253442716188602e-05, 'samples': 23446080, 'steps': 122114, 'loss/train': 1.1561683416366577} 11/07/2021 14:26:32 - INFO - __main__ - Step 122116: {'lr': 4.2531466213725364e-05, 'samples': 23446272, 'steps': 122115, 'loss/train': 1.1995571851730347} 11/07/2021 14:26:33 - INFO - __main__ - Step 122117: {'lr': 4.2528505359046815e-05, 'samples': 23446464, 'steps': 122116, 'loss/train': 1.2429866790771484} 11/07/2021 14:26:33 - INFO - __main__ - Step 122118: {'lr': 4.252554459785174e-05, 'samples': 23446656, 'steps': 122117, 'loss/train': 1.330118179321289} 11/07/2021 14:26:33 - INFO - __main__ - Step 122119: {'lr': 4.252258393014144e-05, 'samples': 23446848, 'steps': 122118, 'loss/train': 0.588861882686615} 11/07/2021 14:26:35 - INFO - __main__ - Step 122120: {'lr': 4.251962335591725e-05, 'samples': 23447040, 'steps': 122119, 'loss/train': 0.5397169589996338} 11/07/2021 14:26:35 - INFO - __main__ - Step 122121: {'lr': 4.251666287518055e-05, 'samples': 23447232, 'steps': 122120, 'loss/train': 1.022973656654358} 11/07/2021 14:26:35 - INFO - __main__ - Step 122122: {'lr': 4.2513702487932595e-05, 'samples': 23447424, 'steps': 122121, 'loss/train': 1.7168306112289429} 11/07/2021 14:26:36 - INFO - __main__ - Step 122123: {'lr': 4.251074219417481e-05, 'samples': 23447616, 'steps': 122122, 'loss/train': 1.4564706087112427} 11/07/2021 14:26:36 - INFO - __main__ - Step 122124: {'lr': 4.250778199390851e-05, 'samples': 23447808, 'steps': 122123, 'loss/train': 1.168310284614563} 11/07/2021 14:26:37 - INFO - __main__ - Step 122125: {'lr': 4.250482188713495e-05, 'samples': 23448000, 'steps': 122124, 'loss/train': 0.48455920815467834} 11/07/2021 14:26:37 - INFO - __main__ - Step 122126: {'lr': 4.250186187385552e-05, 'samples': 23448192, 'steps': 122125, 'loss/train': 0.7893132567405701} 11/07/2021 14:26:38 - INFO - __main__ - Step 122127: {'lr': 4.249890195407155e-05, 'samples': 23448384, 'steps': 122126, 'loss/train': 0.8536028265953064} 11/07/2021 14:26:38 - INFO - __main__ - Step 122128: {'lr': 4.2495942127784375e-05, 'samples': 23448576, 'steps': 122127, 'loss/train': 1.1238433122634888} 11/07/2021 14:26:38 - INFO - __main__ - Step 122129: {'lr': 4.249298239499533e-05, 'samples': 23448768, 'steps': 122128, 'loss/train': 1.3194549083709717} 11/07/2021 14:26:39 - INFO - __main__ - Step 122130: {'lr': 4.2490022755705735e-05, 'samples': 23448960, 'steps': 122129, 'loss/train': 1.2679216861724854} 11/07/2021 14:26:40 - INFO - __main__ - Step 122131: {'lr': 4.248706320991694e-05, 'samples': 23449152, 'steps': 122130, 'loss/train': 1.41371488571167} 11/07/2021 14:26:40 - INFO - __main__ - Step 122132: {'lr': 4.248410375763026e-05, 'samples': 23449344, 'steps': 122131, 'loss/train': 1.370923399925232} 11/07/2021 14:26:40 - INFO - __main__ - Step 122133: {'lr': 4.248114439884707e-05, 'samples': 23449536, 'steps': 122132, 'loss/train': 1.4908756017684937} 11/07/2021 14:26:41 - INFO - __main__ - Step 122134: {'lr': 4.2478185133568634e-05, 'samples': 23449728, 'steps': 122133, 'loss/train': 1.395738959312439} 11/07/2021 14:26:42 - INFO - __main__ - Step 122135: {'lr': 4.247522596179634e-05, 'samples': 23449920, 'steps': 122134, 'loss/train': 1.5346083641052246} 11/07/2021 14:26:42 - INFO - __main__ - Step 122136: {'lr': 4.2472266883531506e-05, 'samples': 23450112, 'steps': 122135, 'loss/train': 1.1850229501724243} 11/07/2021 14:26:43 - INFO - __main__ - Step 122137: {'lr': 4.2469307898775536e-05, 'samples': 23450304, 'steps': 122136, 'loss/train': 1.1259087324142456} 11/07/2021 14:26:43 - INFO - __main__ - Step 122138: {'lr': 4.246634900752963e-05, 'samples': 23450496, 'steps': 122137, 'loss/train': 0.07629115134477615} 11/07/2021 14:26:44 - INFO - __main__ - Step 122139: {'lr': 4.24633902097952e-05, 'samples': 23450688, 'steps': 122138, 'loss/train': 1.096441388130188} 11/07/2021 14:26:44 - INFO - __main__ - Step 122140: {'lr': 4.246043150557355e-05, 'samples': 23450880, 'steps': 122139, 'loss/train': 1.4094070196151733} 11/07/2021 14:26:45 - INFO - __main__ - Step 122141: {'lr': 4.245747289486601e-05, 'samples': 23451072, 'steps': 122140, 'loss/train': 1.1518361568450928} 11/07/2021 14:26:45 - INFO - __main__ - Step 122142: {'lr': 4.245451437767395e-05, 'samples': 23451264, 'steps': 122141, 'loss/train': 1.4429242610931396} 11/07/2021 14:26:46 - INFO - __main__ - Step 122143: {'lr': 4.245155595399869e-05, 'samples': 23451456, 'steps': 122142, 'loss/train': 1.102340817451477} 11/07/2021 14:26:46 - INFO - __main__ - Step 122144: {'lr': 4.244859762384154e-05, 'samples': 23451648, 'steps': 122143, 'loss/train': 1.945934772491455} 11/07/2021 14:26:46 - INFO - __main__ - Step 122145: {'lr': 4.244563938720386e-05, 'samples': 23451840, 'steps': 122144, 'loss/train': 1.4588266611099243} 11/07/2021 14:26:48 - INFO - __main__ - Step 122146: {'lr': 4.244268124408696e-05, 'samples': 23452032, 'steps': 122145, 'loss/train': 1.3646517992019653} 11/07/2021 14:26:48 - INFO - __main__ - Step 122147: {'lr': 4.2439723194492184e-05, 'samples': 23452224, 'steps': 122146, 'loss/train': 0.6489017009735107} 11/07/2021 14:26:48 - INFO - __main__ - Step 122148: {'lr': 4.243676523842088e-05, 'samples': 23452416, 'steps': 122147, 'loss/train': 0.30723702907562256} 11/07/2021 14:26:49 - INFO - __main__ - Step 122149: {'lr': 4.243380737587435e-05, 'samples': 23452608, 'steps': 122148, 'loss/train': 0.8823301792144775} 11/07/2021 14:26:49 - INFO - __main__ - Step 122150: {'lr': 4.243084960685395e-05, 'samples': 23452800, 'steps': 122149, 'loss/train': 1.4215692281723022} 11/07/2021 14:26:50 - INFO - __main__ - Step 122151: {'lr': 4.242789193136107e-05, 'samples': 23452992, 'steps': 122150, 'loss/train': 1.2280025482177734} 11/07/2021 14:26:50 - INFO - __main__ - Step 122152: {'lr': 4.2424934349396924e-05, 'samples': 23453184, 'steps': 122151, 'loss/train': 1.105258584022522} 11/07/2021 14:26:51 - INFO - __main__ - Step 122153: {'lr': 4.2421976860962915e-05, 'samples': 23453376, 'steps': 122152, 'loss/train': 1.4552973508834839} 11/07/2021 14:26:51 - INFO - __main__ - Step 122154: {'lr': 4.241901946606033e-05, 'samples': 23453568, 'steps': 122153, 'loss/train': 1.3828083276748657} 11/07/2021 14:26:51 - INFO - __main__ - Step 122155: {'lr': 4.2416062164690545e-05, 'samples': 23453760, 'steps': 122154, 'loss/train': 1.3783446550369263} 11/07/2021 14:26:53 - INFO - __main__ - Step 122156: {'lr': 4.2413104956854855e-05, 'samples': 23453952, 'steps': 122155, 'loss/train': 1.1877224445343018} 11/07/2021 14:26:53 - INFO - __main__ - Step 122157: {'lr': 4.241014784255465e-05, 'samples': 23454144, 'steps': 122156, 'loss/train': 1.138426661491394} 11/07/2021 14:26:53 - INFO - __main__ - Step 122158: {'lr': 4.2407190821791205e-05, 'samples': 23454336, 'steps': 122157, 'loss/train': 2.199768543243408} 11/07/2021 14:26:54 - INFO - __main__ - Step 122159: {'lr': 4.240423389456588e-05, 'samples': 23454528, 'steps': 122158, 'loss/train': 0.8913331627845764} 11/07/2021 14:26:54 - INFO - __main__ - Step 122160: {'lr': 4.240127706088001e-05, 'samples': 23454720, 'steps': 122159, 'loss/train': 1.3193249702453613} 11/07/2021 14:26:54 - INFO - __main__ - Step 122161: {'lr': 4.2398320320734926e-05, 'samples': 23454912, 'steps': 122160, 'loss/train': 1.1295080184936523} 11/07/2021 14:26:55 - INFO - __main__ - Step 122162: {'lr': 4.2395363674131964e-05, 'samples': 23455104, 'steps': 122161, 'loss/train': 1.7375974655151367} 11/07/2021 14:26:56 - INFO - __main__ - Step 122163: {'lr': 4.2392407121072426e-05, 'samples': 23455296, 'steps': 122162, 'loss/train': 1.3663487434387207} 11/07/2021 14:26:56 - INFO - __main__ - Step 122164: {'lr': 4.238945066155775e-05, 'samples': 23455488, 'steps': 122163, 'loss/train': 1.3077106475830078} 11/07/2021 14:26:56 - INFO - __main__ - Step 122165: {'lr': 4.238649429558911e-05, 'samples': 23455680, 'steps': 122164, 'loss/train': 1.672689437866211} 11/07/2021 14:26:57 - INFO - __main__ - Step 122166: {'lr': 4.238353802316791e-05, 'samples': 23455872, 'steps': 122165, 'loss/train': 1.4756718873977661} 11/07/2021 14:26:58 - INFO - __main__ - Step 122167: {'lr': 4.2380581844295466e-05, 'samples': 23456064, 'steps': 122166, 'loss/train': 1.114676594734192} 11/07/2021 14:26:58 - INFO - __main__ - Step 122168: {'lr': 4.2377625758973167e-05, 'samples': 23456256, 'steps': 122167, 'loss/train': 1.3961305618286133} 11/07/2021 14:26:59 - INFO - __main__ - Step 122169: {'lr': 4.2374669767202275e-05, 'samples': 23456448, 'steps': 122168, 'loss/train': 1.1818253993988037} 11/07/2021 14:26:59 - INFO - __main__ - Step 122170: {'lr': 4.237171386898417e-05, 'samples': 23456640, 'steps': 122169, 'loss/train': 1.2818447351455688} 11/07/2021 14:26:59 - INFO - __main__ - Step 122171: {'lr': 4.2368758064320136e-05, 'samples': 23456832, 'steps': 122170, 'loss/train': 1.454598307609558} 11/07/2021 14:27:00 - INFO - __main__ - Step 122172: {'lr': 4.236580235321158e-05, 'samples': 23457024, 'steps': 122171, 'loss/train': 1.3277711868286133} 11/07/2021 14:27:01 - INFO - __main__ - Step 122173: {'lr': 4.236284673565976e-05, 'samples': 23457216, 'steps': 122172, 'loss/train': 1.4964936971664429} 11/07/2021 14:27:01 - INFO - __main__ - Step 122174: {'lr': 4.2359891211666055e-05, 'samples': 23457408, 'steps': 122173, 'loss/train': 1.0262739658355713} 11/07/2021 14:27:01 - INFO - __main__ - Step 122175: {'lr': 4.235693578123176e-05, 'samples': 23457600, 'steps': 122174, 'loss/train': 1.208577275276184} 11/07/2021 14:27:02 - INFO - __main__ - Step 122176: {'lr': 4.2353980444358234e-05, 'samples': 23457792, 'steps': 122175, 'loss/train': 0.061084166169166565} 11/07/2021 14:27:03 - INFO - __main__ - Step 122177: {'lr': 4.2351025201046804e-05, 'samples': 23457984, 'steps': 122176, 'loss/train': 0.44294601678848267} 11/07/2021 14:27:03 - INFO - __main__ - Step 122178: {'lr': 4.234807005129887e-05, 'samples': 23458176, 'steps': 122177, 'loss/train': 1.1795804500579834} 11/07/2021 14:27:04 - INFO - __main__ - Step 122179: {'lr': 4.234511499511562e-05, 'samples': 23458368, 'steps': 122178, 'loss/train': 1.434585452079773} 11/07/2021 14:27:04 - INFO - __main__ - Step 122180: {'lr': 4.234216003249844e-05, 'samples': 23458560, 'steps': 122179, 'loss/train': 1.7255663871765137} 11/07/2021 14:27:04 - INFO - __main__ - Step 122181: {'lr': 4.233920516344869e-05, 'samples': 23458752, 'steps': 122180, 'loss/train': 1.6678341627120972} 11/07/2021 14:27:05 - INFO - __main__ - Step 122182: {'lr': 4.233625038796771e-05, 'samples': 23458944, 'steps': 122181, 'loss/train': 1.3784998655319214} 11/07/2021 14:27:06 - INFO - __main__ - Step 122183: {'lr': 4.233329570605679e-05, 'samples': 23459136, 'steps': 122182, 'loss/train': 1.3662456274032593} 11/07/2021 14:27:06 - INFO - __main__ - Step 122184: {'lr': 4.2330341117717274e-05, 'samples': 23459328, 'steps': 122183, 'loss/train': 1.2518644332885742} 11/07/2021 14:27:06 - INFO - __main__ - Step 122185: {'lr': 4.232738662295052e-05, 'samples': 23459520, 'steps': 122184, 'loss/train': 1.2941118478775024} 11/07/2021 14:27:07 - INFO - __main__ - Step 122186: {'lr': 4.232443222175783e-05, 'samples': 23459712, 'steps': 122185, 'loss/train': 1.4359328746795654} 11/07/2021 14:27:07 - INFO - __main__ - Step 122187: {'lr': 4.2321477914140564e-05, 'samples': 23459904, 'steps': 122186, 'loss/train': 1.3476121425628662} 11/07/2021 14:27:08 - INFO - __main__ - Step 122188: {'lr': 4.2318523700100006e-05, 'samples': 23460096, 'steps': 122187, 'loss/train': 1.2057852745056152} 11/07/2021 14:27:08 - INFO - __main__ - Step 122189: {'lr': 4.231556957963753e-05, 'samples': 23460288, 'steps': 122188, 'loss/train': 1.2230339050292969} 11/07/2021 14:27:09 - INFO - __main__ - Step 122190: {'lr': 4.231261555275448e-05, 'samples': 23460480, 'steps': 122189, 'loss/train': 1.4522172212600708} 11/07/2021 14:27:09 - INFO - __main__ - Step 122191: {'lr': 4.230966161945218e-05, 'samples': 23460672, 'steps': 122190, 'loss/train': 1.134679913520813} 11/07/2021 14:27:09 - INFO - __main__ - Step 122192: {'lr': 4.2306707779731916e-05, 'samples': 23460864, 'steps': 122191, 'loss/train': 1.18757963180542} 11/07/2021 14:27:10 - INFO - __main__ - Step 122193: {'lr': 4.2303754033595015e-05, 'samples': 23461056, 'steps': 122192, 'loss/train': 1.3878906965255737} 11/07/2021 14:27:11 - INFO - __main__ - Step 122194: {'lr': 4.230080038104287e-05, 'samples': 23461248, 'steps': 122193, 'loss/train': 1.5420833826065063} 11/07/2021 14:27:11 - INFO - __main__ - Step 122195: {'lr': 4.229784682207674e-05, 'samples': 23461440, 'steps': 122194, 'loss/train': 0.9818091988563538} 11/07/2021 14:27:12 - INFO - __main__ - Step 122196: {'lr': 4.2294893356698e-05, 'samples': 23461632, 'steps': 122195, 'loss/train': 1.4248294830322266} 11/07/2021 14:27:12 - INFO - __main__ - Step 122197: {'lr': 4.229193998490802e-05, 'samples': 23461824, 'steps': 122196, 'loss/train': 1.4238815307617188} 11/07/2021 14:27:12 - INFO - __main__ - Step 122198: {'lr': 4.228898670670806e-05, 'samples': 23462016, 'steps': 122197, 'loss/train': 1.6367796659469604} 11/07/2021 14:27:13 - INFO - __main__ - Step 122199: {'lr': 4.228603352209945e-05, 'samples': 23462208, 'steps': 122198, 'loss/train': 0.756469190120697} 11/07/2021 14:27:14 - INFO - __main__ - Step 122200: {'lr': 4.228308043108359e-05, 'samples': 23462400, 'steps': 122199, 'loss/train': 1.722169041633606} 11/07/2021 14:27:14 - INFO - __main__ - Step 122201: {'lr': 4.228012743366175e-05, 'samples': 23462592, 'steps': 122200, 'loss/train': 1.5886328220367432} 11/07/2021 14:27:14 - INFO - __main__ - Step 122202: {'lr': 4.227717452983526e-05, 'samples': 23462784, 'steps': 122201, 'loss/train': 1.1134998798370361} 11/07/2021 14:27:15 - INFO - __main__ - Step 122203: {'lr': 4.2274221719605514e-05, 'samples': 23462976, 'steps': 122202, 'loss/train': 1.0109596252441406} 11/07/2021 14:27:16 - INFO - __main__ - Step 122204: {'lr': 4.2271269002973816e-05, 'samples': 23463168, 'steps': 122203, 'loss/train': 1.5468720197677612} 11/07/2021 14:27:16 - INFO - __main__ - Step 122205: {'lr': 4.226831637994144e-05, 'samples': 23463360, 'steps': 122204, 'loss/train': 1.464060664176941} 11/07/2021 14:27:17 - INFO - __main__ - Step 122206: {'lr': 4.226536385050975e-05, 'samples': 23463552, 'steps': 122205, 'loss/train': 0.11482524871826172} 11/07/2021 14:27:17 - INFO - __main__ - Step 122207: {'lr': 4.22624114146801e-05, 'samples': 23463744, 'steps': 122206, 'loss/train': 1.4956753253936768} 11/07/2021 14:27:18 - INFO - __main__ - Step 122208: {'lr': 4.2259459072453794e-05, 'samples': 23463936, 'steps': 122207, 'loss/train': 1.0297493934631348} 11/07/2021 14:27:18 - INFO - __main__ - Step 122209: {'lr': 4.225650682383214e-05, 'samples': 23464128, 'steps': 122208, 'loss/train': 0.40336480736732483} 11/07/2021 14:27:19 - INFO - __main__ - Step 122210: {'lr': 4.225355466881653e-05, 'samples': 23464320, 'steps': 122209, 'loss/train': 1.8066387176513672} 11/07/2021 14:27:19 - INFO - __main__ - Step 122211: {'lr': 4.225060260740826e-05, 'samples': 23464512, 'steps': 122210, 'loss/train': 0.9286011457443237} 11/07/2021 14:27:20 - INFO - __main__ - Step 122212: {'lr': 4.224765063960864e-05, 'samples': 23464704, 'steps': 122211, 'loss/train': 1.1445302963256836} 11/07/2021 14:27:20 - INFO - __main__ - Step 122213: {'lr': 4.224469876541903e-05, 'samples': 23464896, 'steps': 122212, 'loss/train': 1.3685288429260254} 11/07/2021 14:27:21 - INFO - __main__ - Step 122214: {'lr': 4.224174698484079e-05, 'samples': 23465088, 'steps': 122213, 'loss/train': 1.0846810340881348} 11/07/2021 14:27:21 - INFO - __main__ - Step 122215: {'lr': 4.223879529787517e-05, 'samples': 23465280, 'steps': 122214, 'loss/train': 1.5257195234298706} 11/07/2021 14:27:22 - INFO - __main__ - Step 122216: {'lr': 4.223584370452355e-05, 'samples': 23465472, 'steps': 122215, 'loss/train': 1.4172532558441162} 11/07/2021 14:27:22 - INFO - __main__ - Step 122217: {'lr': 4.223289220478726e-05, 'samples': 23465664, 'steps': 122216, 'loss/train': 1.0649709701538086} 11/07/2021 14:27:22 - INFO - __main__ - Step 122218: {'lr': 4.222994079866768e-05, 'samples': 23465856, 'steps': 122217, 'loss/train': 1.463423252105713} 11/07/2021 14:27:24 - INFO - __main__ - Step 122219: {'lr': 4.222698948616604e-05, 'samples': 23466048, 'steps': 122218, 'loss/train': 1.146089792251587} 11/07/2021 14:27:24 - INFO - __main__ - Step 122220: {'lr': 4.222403826728369e-05, 'samples': 23466240, 'steps': 122219, 'loss/train': 1.1676470041275024} 11/07/2021 14:27:25 - INFO - __main__ - Step 122221: {'lr': 4.2221087142022e-05, 'samples': 23466432, 'steps': 122220, 'loss/train': 1.2890007495880127} 11/07/2021 14:27:25 - INFO - __main__ - Step 122222: {'lr': 4.221813611038228e-05, 'samples': 23466624, 'steps': 122221, 'loss/train': 0.49861642718315125} 11/07/2021 14:27:25 - INFO - __main__ - Step 122223: {'lr': 4.221518517236583e-05, 'samples': 23466816, 'steps': 122222, 'loss/train': 0.8962599039077759} 11/07/2021 14:27:26 - INFO - __main__ - Step 122224: {'lr': 4.221223432797405e-05, 'samples': 23467008, 'steps': 122223, 'loss/train': 1.0860416889190674} 11/07/2021 14:27:27 - INFO - __main__ - Step 122225: {'lr': 4.220928357720821e-05, 'samples': 23467200, 'steps': 122224, 'loss/train': 0.7494337558746338} 11/07/2021 14:27:27 - INFO - __main__ - Step 122226: {'lr': 4.2206332920069675e-05, 'samples': 23467392, 'steps': 122225, 'loss/train': 1.1978150606155396} 11/07/2021 14:27:27 - INFO - __main__ - Step 122227: {'lr': 4.220338235655974e-05, 'samples': 23467584, 'steps': 122226, 'loss/train': 1.333958625793457} 11/07/2021 14:27:28 - INFO - __main__ - Step 122228: {'lr': 4.220043188667977e-05, 'samples': 23467776, 'steps': 122227, 'loss/train': 1.1101049184799194} 11/07/2021 14:27:28 - INFO - __main__ - Step 122229: {'lr': 4.219748151043107e-05, 'samples': 23467968, 'steps': 122228, 'loss/train': 1.0993796586990356} 11/07/2021 14:27:29 - INFO - __main__ - Step 122230: {'lr': 4.219453122781505e-05, 'samples': 23468160, 'steps': 122229, 'loss/train': 1.2795629501342773} 11/07/2021 14:27:29 - INFO - __main__ - Step 122231: {'lr': 4.219158103883289e-05, 'samples': 23468352, 'steps': 122230, 'loss/train': 1.1146032810211182} 11/07/2021 14:27:30 - INFO - __main__ - Step 122232: {'lr': 4.218863094348602e-05, 'samples': 23468544, 'steps': 122231, 'loss/train': 1.2056777477264404} 11/07/2021 14:27:30 - INFO - __main__ - Step 122233: {'lr': 4.2185680941775714e-05, 'samples': 23468736, 'steps': 122232, 'loss/train': 1.7846381664276123} 11/07/2021 14:27:31 - INFO - __main__ - Step 122234: {'lr': 4.218273103370335e-05, 'samples': 23468928, 'steps': 122233, 'loss/train': 1.231961965560913} 11/07/2021 14:27:31 - INFO - __main__ - Step 122235: {'lr': 4.217978121927024e-05, 'samples': 23469120, 'steps': 122234, 'loss/train': 1.5635545253753662} 11/07/2021 14:27:32 - INFO - __main__ - Step 122236: {'lr': 4.217683149847773e-05, 'samples': 23469312, 'steps': 122235, 'loss/train': 1.2678083181381226} 11/07/2021 14:27:32 - INFO - __main__ - Step 122237: {'lr': 4.217388187132712e-05, 'samples': 23469504, 'steps': 122236, 'loss/train': 1.1412453651428223} 11/07/2021 14:27:33 - INFO - __main__ - Step 122238: {'lr': 4.217093233781974e-05, 'samples': 23469696, 'steps': 122237, 'loss/train': 1.1130636930465698} 11/07/2021 14:27:33 - INFO - __main__ - Step 122239: {'lr': 4.216798289795695e-05, 'samples': 23469888, 'steps': 122238, 'loss/train': 0.9453576803207397} 11/07/2021 14:27:34 - INFO - __main__ - Step 122240: {'lr': 4.216503355174006e-05, 'samples': 23470080, 'steps': 122239, 'loss/train': 1.062127709388733} 11/07/2021 14:27:34 - INFO - __main__ - Step 122241: {'lr': 4.2162084299170455e-05, 'samples': 23470272, 'steps': 122240, 'loss/train': 0.909765899181366} 11/07/2021 14:27:35 - INFO - __main__ - Step 122242: {'lr': 4.215913514024933e-05, 'samples': 23470464, 'steps': 122241, 'loss/train': 1.6478337049484253} 11/07/2021 14:27:35 - INFO - __main__ - Step 122243: {'lr': 4.215618607497812e-05, 'samples': 23470656, 'steps': 122242, 'loss/train': 1.5626169443130493} 11/07/2021 14:27:35 - INFO - __main__ - Step 122244: {'lr': 4.2153237103358114e-05, 'samples': 23470848, 'steps': 122243, 'loss/train': 1.7481571435928345} 11/07/2021 14:27:36 - INFO - __main__ - Step 122245: {'lr': 4.215028822539063e-05, 'samples': 23471040, 'steps': 122244, 'loss/train': 1.1677594184875488} 11/07/2021 14:27:37 - INFO - __main__ - Step 122246: {'lr': 4.214733944107704e-05, 'samples': 23471232, 'steps': 122245, 'loss/train': 0.05274851992726326} 11/07/2021 14:27:37 - INFO - __main__ - Step 122247: {'lr': 4.214439075041865e-05, 'samples': 23471424, 'steps': 122246, 'loss/train': 1.4368438720703125} 11/07/2021 14:27:38 - INFO - __main__ - Step 122248: {'lr': 4.214144215341678e-05, 'samples': 23471616, 'steps': 122247, 'loss/train': 1.0456138849258423} 11/07/2021 14:27:38 - INFO - __main__ - Step 122249: {'lr': 4.21384936500728e-05, 'samples': 23471808, 'steps': 122248, 'loss/train': 1.4070192575454712} 11/07/2021 14:27:38 - INFO - __main__ - Step 122250: {'lr': 4.2135545240387984e-05, 'samples': 23472000, 'steps': 122249, 'loss/train': 1.1721111536026} 11/07/2021 14:27:39 - INFO - __main__ - Step 122251: {'lr': 4.213259692436367e-05, 'samples': 23472192, 'steps': 122250, 'loss/train': 1.3874866962432861} 11/07/2021 14:27:40 - INFO - __main__ - Step 122252: {'lr': 4.2129648702001285e-05, 'samples': 23472384, 'steps': 122251, 'loss/train': 1.2776910066604614} 11/07/2021 14:27:40 - INFO - __main__ - Step 122253: {'lr': 4.212670057330201e-05, 'samples': 23472576, 'steps': 122252, 'loss/train': 1.722408652305603} 11/07/2021 14:27:40 - INFO - __main__ - Step 122254: {'lr': 4.2123752538267226e-05, 'samples': 23472768, 'steps': 122253, 'loss/train': 1.238885760307312} 11/07/2021 14:27:41 - INFO - __main__ - Step 122255: {'lr': 4.2120804596898266e-05, 'samples': 23472960, 'steps': 122254, 'loss/train': 1.0507615804672241} 11/07/2021 14:27:42 - INFO - __main__ - Step 122256: {'lr': 4.2117856749196466e-05, 'samples': 23473152, 'steps': 122255, 'loss/train': 1.3307609558105469} 11/07/2021 14:27:42 - INFO - __main__ - Step 122257: {'lr': 4.2114908995163154e-05, 'samples': 23473344, 'steps': 122256, 'loss/train': 1.3210777044296265} 11/07/2021 14:27:42 - INFO - __main__ - Step 122258: {'lr': 4.2111961334799665e-05, 'samples': 23473536, 'steps': 122257, 'loss/train': 1.0860923528671265} 11/07/2021 14:27:43 - INFO - __main__ - Step 122259: {'lr': 4.210901376810733e-05, 'samples': 23473728, 'steps': 122258, 'loss/train': 1.3687041997909546} 11/07/2021 14:27:43 - INFO - __main__ - Step 122260: {'lr': 4.210606629508743e-05, 'samples': 23473920, 'steps': 122259, 'loss/train': 1.2133368253707886} 11/07/2021 14:27:44 - INFO - __main__ - Step 122261: {'lr': 4.210311891574134e-05, 'samples': 23474112, 'steps': 122260, 'loss/train': 0.6907863020896912} 11/07/2021 14:27:44 - INFO - __main__ - Step 122262: {'lr': 4.210017163007046e-05, 'samples': 23474304, 'steps': 122261, 'loss/train': 1.4529989957809448} 11/07/2021 14:27:45 - INFO - __main__ - Step 122263: {'lr': 4.209722443807595e-05, 'samples': 23474496, 'steps': 122262, 'loss/train': 1.6392405033111572} 11/07/2021 14:27:45 - INFO - __main__ - Step 122264: {'lr': 4.209427733975923e-05, 'samples': 23474688, 'steps': 122263, 'loss/train': 1.2918550968170166} 11/07/2021 14:27:45 - INFO - __main__ - Step 122265: {'lr': 4.2091330335121637e-05, 'samples': 23474880, 'steps': 122264, 'loss/train': 1.3824646472930908} 11/07/2021 14:27:47 - INFO - __main__ - Step 122266: {'lr': 4.208838342416446e-05, 'samples': 23475072, 'steps': 122265, 'loss/train': 1.1491446495056152} 11/07/2021 14:27:47 - INFO - __main__ - Step 122267: {'lr': 4.2085436606889044e-05, 'samples': 23475264, 'steps': 122266, 'loss/train': 1.5657031536102295} 11/07/2021 14:27:48 - INFO - __main__ - Step 122268: {'lr': 4.2082489883296744e-05, 'samples': 23475456, 'steps': 122267, 'loss/train': 1.3412142992019653} 11/07/2021 14:27:48 - INFO - __main__ - Step 122269: {'lr': 4.207954325338886e-05, 'samples': 23475648, 'steps': 122268, 'loss/train': 1.5433646440505981} 11/07/2021 14:27:48 - INFO - __main__ - Step 122270: {'lr': 4.207659671716671e-05, 'samples': 23475840, 'steps': 122269, 'loss/train': 1.3159202337265015} 11/07/2021 14:27:49 - INFO - __main__ - Step 122271: {'lr': 4.207365027463164e-05, 'samples': 23476032, 'steps': 122270, 'loss/train': 0.8550270199775696} 11/07/2021 14:27:50 - INFO - __main__ - Step 122272: {'lr': 4.2070703925784994e-05, 'samples': 23476224, 'steps': 122271, 'loss/train': 0.8133010864257812} 11/07/2021 14:27:50 - INFO - __main__ - Step 122273: {'lr': 4.2067757670628124e-05, 'samples': 23476416, 'steps': 122272, 'loss/train': 0.9313228726387024} 11/07/2021 14:27:50 - INFO - __main__ - Step 122274: {'lr': 4.206481150916227e-05, 'samples': 23476608, 'steps': 122273, 'loss/train': 1.3612715005874634} 11/07/2021 14:27:51 - INFO - __main__ - Step 122275: {'lr': 4.206186544138879e-05, 'samples': 23476800, 'steps': 122274, 'loss/train': 0.7537388801574707} 11/07/2021 14:27:51 - INFO - __main__ - Step 122276: {'lr': 4.205891946730903e-05, 'samples': 23476992, 'steps': 122275, 'loss/train': 1.3870670795440674} 11/07/2021 14:27:52 - INFO - __main__ - Step 122277: {'lr': 4.205597358692431e-05, 'samples': 23477184, 'steps': 122276, 'loss/train': 1.1464788913726807} 11/07/2021 14:27:53 - INFO - __main__ - Step 122278: {'lr': 4.2053027800235955e-05, 'samples': 23477376, 'steps': 122277, 'loss/train': 0.06284916400909424} 11/07/2021 14:27:53 - INFO - __main__ - Step 122279: {'lr': 4.2050082107245284e-05, 'samples': 23477568, 'steps': 122278, 'loss/train': 1.3682715892791748} 11/07/2021 14:27:53 - INFO - __main__ - Step 122280: {'lr': 4.204713650795366e-05, 'samples': 23477760, 'steps': 122279, 'loss/train': 1.9219956398010254} 11/07/2021 14:27:54 - INFO - __main__ - Step 122281: {'lr': 4.20441910023624e-05, 'samples': 23477952, 'steps': 122280, 'loss/train': 1.2219789028167725} 11/07/2021 14:27:55 - INFO - __main__ - Step 122282: {'lr': 4.204124559047279e-05, 'samples': 23478144, 'steps': 122281, 'loss/train': 1.3853561878204346} 11/07/2021 14:27:55 - INFO - __main__ - Step 122283: {'lr': 4.2038300272286195e-05, 'samples': 23478336, 'steps': 122282, 'loss/train': 1.1754924058914185} 11/07/2021 14:27:56 - INFO - __main__ - Step 122284: {'lr': 4.203535504780392e-05, 'samples': 23478528, 'steps': 122283, 'loss/train': 1.5489411354064941} 11/07/2021 14:27:56 - INFO - __main__ - Step 122285: {'lr': 4.203240991702739e-05, 'samples': 23478720, 'steps': 122284, 'loss/train': 1.16068434715271} 11/07/2021 14:27:56 - INFO - __main__ - Step 122286: {'lr': 4.2029464879957765e-05, 'samples': 23478912, 'steps': 122285, 'loss/train': 1.4027206897735596} 11/07/2021 14:27:57 - INFO - __main__ - Step 122287: {'lr': 4.202651993659648e-05, 'samples': 23479104, 'steps': 122286, 'loss/train': 1.071995496749878} 11/07/2021 14:27:58 - INFO - __main__ - Step 122288: {'lr': 4.202357508694482e-05, 'samples': 23479296, 'steps': 122287, 'loss/train': 1.4034262895584106} 11/07/2021 14:27:58 - INFO - __main__ - Step 122289: {'lr': 4.2020630331004115e-05, 'samples': 23479488, 'steps': 122288, 'loss/train': 1.4814611673355103} 11/07/2021 14:27:58 - INFO - __main__ - Step 122290: {'lr': 4.201768566877573e-05, 'samples': 23479680, 'steps': 122289, 'loss/train': 1.0388461351394653} 11/07/2021 14:27:59 - INFO - __main__ - Step 122291: {'lr': 4.2014741100260933e-05, 'samples': 23479872, 'steps': 122290, 'loss/train': 1.0399322509765625} 11/07/2021 14:27:59 - INFO - __main__ - Step 122292: {'lr': 4.201179662546112e-05, 'samples': 23480064, 'steps': 122291, 'loss/train': 0.956809937953949} 11/07/2021 14:28:00 - INFO - __main__ - Step 122293: {'lr': 4.200885224437756e-05, 'samples': 23480256, 'steps': 122292, 'loss/train': 1.2457183599472046} 11/07/2021 14:28:00 - INFO - __main__ - Step 122294: {'lr': 4.200590795701162e-05, 'samples': 23480448, 'steps': 122293, 'loss/train': 1.2393627166748047} 11/07/2021 14:28:01 - INFO - __main__ - Step 122295: {'lr': 4.2002963763364574e-05, 'samples': 23480640, 'steps': 122294, 'loss/train': 1.020492434501648} 11/07/2021 14:28:01 - INFO - __main__ - Step 122296: {'lr': 4.200001966343781e-05, 'samples': 23480832, 'steps': 122295, 'loss/train': 1.032646656036377} 11/07/2021 14:28:01 - INFO - __main__ - Step 122297: {'lr': 4.1997075657232626e-05, 'samples': 23481024, 'steps': 122296, 'loss/train': 1.070901870727539} 11/07/2021 14:28:02 - INFO - __main__ - Step 122298: {'lr': 4.199413174475036e-05, 'samples': 23481216, 'steps': 122297, 'loss/train': 0.8481626510620117} 11/07/2021 14:28:03 - INFO - __main__ - Step 122299: {'lr': 4.199118792599238e-05, 'samples': 23481408, 'steps': 122298, 'loss/train': 0.930567741394043} 11/07/2021 14:28:03 - INFO - __main__ - Step 122300: {'lr': 4.1988244200959894e-05, 'samples': 23481600, 'steps': 122299, 'loss/train': 1.217759609222412} 11/07/2021 14:28:03 - INFO - __main__ - Step 122301: {'lr': 4.19853005696543e-05, 'samples': 23481792, 'steps': 122300, 'loss/train': 0.7090765833854675} 11/07/2021 14:28:04 - INFO - __main__ - Step 122302: {'lr': 4.1982357032076896e-05, 'samples': 23481984, 'steps': 122301, 'loss/train': 1.10731041431427} 11/07/2021 14:28:05 - INFO - __main__ - Step 122303: {'lr': 4.197941358822907e-05, 'samples': 23482176, 'steps': 122302, 'loss/train': 0.6541736721992493} 11/07/2021 14:28:05 - INFO - __main__ - Step 122304: {'lr': 4.197647023811207e-05, 'samples': 23482368, 'steps': 122303, 'loss/train': 1.4060417413711548} 11/07/2021 14:28:05 - INFO - __main__ - Step 122305: {'lr': 4.197352698172729e-05, 'samples': 23482560, 'steps': 122304, 'loss/train': 1.469959020614624} 11/07/2021 14:28:06 - INFO - __main__ - Step 122306: {'lr': 4.1970583819076036e-05, 'samples': 23482752, 'steps': 122305, 'loss/train': 1.0323125123977661} 11/07/2021 14:28:06 - INFO - __main__ - Step 122307: {'lr': 4.196764075015963e-05, 'samples': 23482944, 'steps': 122306, 'loss/train': 1.5087169408798218} 11/07/2021 14:28:07 - INFO - __main__ - Step 122308: {'lr': 4.1964697774979385e-05, 'samples': 23483136, 'steps': 122307, 'loss/train': 1.488206148147583} 11/07/2021 14:28:08 - INFO - __main__ - Step 122309: {'lr': 4.1961754893536624e-05, 'samples': 23483328, 'steps': 122308, 'loss/train': 1.3162822723388672} 11/07/2021 14:28:08 - INFO - __main__ - Step 122310: {'lr': 4.195881210583269e-05, 'samples': 23483520, 'steps': 122309, 'loss/train': 1.4561980962753296} 11/07/2021 14:28:08 - INFO - __main__ - Step 122311: {'lr': 4.195586941186891e-05, 'samples': 23483712, 'steps': 122310, 'loss/train': 1.3496493101119995} 11/07/2021 14:28:09 - INFO - __main__ - Step 122312: {'lr': 4.195292681164667e-05, 'samples': 23483904, 'steps': 122311, 'loss/train': 1.1789554357528687} 11/07/2021 14:28:10 - INFO - __main__ - Step 122313: {'lr': 4.1949984305167166e-05, 'samples': 23484096, 'steps': 122312, 'loss/train': 1.255628228187561} 11/07/2021 14:28:10 - INFO - __main__ - Step 122314: {'lr': 4.194704189243179e-05, 'samples': 23484288, 'steps': 122313, 'loss/train': 1.588565707206726} 11/07/2021 14:28:10 - INFO - __main__ - Step 122315: {'lr': 4.194409957344186e-05, 'samples': 23484480, 'steps': 122314, 'loss/train': 0.8562589883804321} 11/07/2021 14:28:11 - INFO - __main__ - Step 122316: {'lr': 4.19411573481987e-05, 'samples': 23484672, 'steps': 122315, 'loss/train': 1.1975769996643066} 11/07/2021 14:28:11 - INFO - __main__ - Step 122317: {'lr': 4.1938215216703654e-05, 'samples': 23484864, 'steps': 122316, 'loss/train': 1.4701920747756958} 11/07/2021 14:28:12 - INFO - __main__ - Step 122318: {'lr': 4.193527317895807e-05, 'samples': 23485056, 'steps': 122317, 'loss/train': 2.0638837814331055} 11/07/2021 14:28:13 - INFO - __main__ - Step 122319: {'lr': 4.193233123496321e-05, 'samples': 23485248, 'steps': 122318, 'loss/train': 1.3036847114562988} 11/07/2021 14:28:13 - INFO - __main__ - Step 122320: {'lr': 4.192938938472041e-05, 'samples': 23485440, 'steps': 122319, 'loss/train': 3.53737473487854} 11/07/2021 14:28:13 - INFO - __main__ - Step 122321: {'lr': 4.1926447628231056e-05, 'samples': 23485632, 'steps': 122320, 'loss/train': 4.22914457321167} 11/07/2021 14:28:14 - INFO - __main__ - Step 122322: {'lr': 4.192350596549641e-05, 'samples': 23485824, 'steps': 122321, 'loss/train': 1.3858681917190552} 11/07/2021 14:28:14 - INFO - __main__ - Step 122323: {'lr': 4.192056439651784e-05, 'samples': 23486016, 'steps': 122322, 'loss/train': 0.872980535030365} 11/07/2021 14:28:15 - INFO - __main__ - Step 122324: {'lr': 4.1917622921296636e-05, 'samples': 23486208, 'steps': 122323, 'loss/train': 1.4368906021118164} 11/07/2021 14:28:16 - INFO - __main__ - Step 122325: {'lr': 4.191468153983419e-05, 'samples': 23486400, 'steps': 122324, 'loss/train': 1.0037332773208618} 11/07/2021 14:28:16 - INFO - __main__ - Step 122326: {'lr': 4.1911740252131734e-05, 'samples': 23486592, 'steps': 122325, 'loss/train': 1.2788769006729126} 11/07/2021 14:28:16 - INFO - __main__ - Step 122327: {'lr': 4.190879905819065e-05, 'samples': 23486784, 'steps': 122326, 'loss/train': 1.7203655242919922} 11/07/2021 14:28:17 - INFO - __main__ - Step 122328: {'lr': 4.1905857958012245e-05, 'samples': 23486976, 'steps': 122327, 'loss/train': 1.0127986669540405} 11/07/2021 14:28:18 - INFO - __main__ - Step 122329: {'lr': 4.1902916951597815e-05, 'samples': 23487168, 'steps': 122328, 'loss/train': 1.5620297193527222} 11/07/2021 14:28:18 - INFO - __main__ - Step 122330: {'lr': 4.189997603894877e-05, 'samples': 23487360, 'steps': 122329, 'loss/train': 1.270112156867981} 11/07/2021 14:28:19 - INFO - __main__ - Step 122331: {'lr': 4.1897035220066354e-05, 'samples': 23487552, 'steps': 122330, 'loss/train': 0.9966835975646973} 11/07/2021 14:28:19 - INFO - __main__ - Step 122332: {'lr': 4.189409449495193e-05, 'samples': 23487744, 'steps': 122331, 'loss/train': 1.2243036031723022} 11/07/2021 14:28:19 - INFO - __main__ - Step 122333: {'lr': 4.1891153863606815e-05, 'samples': 23487936, 'steps': 122332, 'loss/train': 0.8679396510124207} 11/07/2021 14:28:20 - INFO - __main__ - Step 122334: {'lr': 4.188821332603232e-05, 'samples': 23488128, 'steps': 122333, 'loss/train': 1.3974841833114624} 11/07/2021 14:28:21 - INFO - __main__ - Step 122335: {'lr': 4.188527288222979e-05, 'samples': 23488320, 'steps': 122334, 'loss/train': 1.4047248363494873} 11/07/2021 14:28:21 - INFO - __main__ - Step 122336: {'lr': 4.1882332532200557e-05, 'samples': 23488512, 'steps': 122335, 'loss/train': 1.6163939237594604} 11/07/2021 14:28:21 - INFO - __main__ - Step 122337: {'lr': 4.187939227594595e-05, 'samples': 23488704, 'steps': 122336, 'loss/train': 1.2523722648620605} 11/07/2021 14:28:22 - INFO - __main__ - Step 122338: {'lr': 4.1876452113467246e-05, 'samples': 23488896, 'steps': 122337, 'loss/train': 1.722306489944458} 11/07/2021 14:28:22 - INFO - __main__ - Step 122339: {'lr': 4.187351204476586e-05, 'samples': 23489088, 'steps': 122338, 'loss/train': 1.1766988039016724} 11/07/2021 14:28:23 - INFO - __main__ - Step 122340: {'lr': 4.187057206984302e-05, 'samples': 23489280, 'steps': 122339, 'loss/train': 1.3796018362045288} 11/07/2021 14:28:23 - INFO - __main__ - Step 122341: {'lr': 4.1867632188700075e-05, 'samples': 23489472, 'steps': 122340, 'loss/train': 5.710327625274658} 11/07/2021 14:28:24 - INFO - __main__ - Step 122342: {'lr': 4.186469240133836e-05, 'samples': 23489664, 'steps': 122341, 'loss/train': 0.9076054096221924} 11/07/2021 14:28:24 - INFO - __main__ - Step 122343: {'lr': 4.186175270775922e-05, 'samples': 23489856, 'steps': 122342, 'loss/train': 1.4245532751083374} 11/07/2021 14:28:25 - INFO - __main__ - Step 122344: {'lr': 4.185881310796397e-05, 'samples': 23490048, 'steps': 122343, 'loss/train': 1.00174081325531} 11/07/2021 14:28:25 - INFO - __main__ - Step 122345: {'lr': 4.185587360195389e-05, 'samples': 23490240, 'steps': 122344, 'loss/train': 0.9332665801048279} 11/07/2021 14:28:26 - INFO - __main__ - Step 122346: {'lr': 4.185293418973035e-05, 'samples': 23490432, 'steps': 122345, 'loss/train': 1.1441830396652222} 11/07/2021 14:28:27 - INFO - __main__ - Step 122347: {'lr': 4.184999487129468e-05, 'samples': 23490624, 'steps': 122346, 'loss/train': 1.414169430732727} 11/07/2021 14:28:27 - INFO - __main__ - Step 122348: {'lr': 4.18470556466482e-05, 'samples': 23490816, 'steps': 122347, 'loss/train': 1.448915719985962} 11/07/2021 14:28:27 - INFO - __main__ - Step 122349: {'lr': 4.18441165157922e-05, 'samples': 23491008, 'steps': 122348, 'loss/train': 1.3612369298934937} 11/07/2021 14:28:28 - INFO - __main__ - Step 122350: {'lr': 4.184117747872804e-05, 'samples': 23491200, 'steps': 122349, 'loss/train': 1.01313054561615} 11/07/2021 14:28:28 - INFO - __main__ - Step 122351: {'lr': 4.183823853545704e-05, 'samples': 23491392, 'steps': 122350, 'loss/train': 0.9175783395767212} 11/07/2021 14:28:28 - INFO - __main__ - Step 122352: {'lr': 4.183529968598057e-05, 'samples': 23491584, 'steps': 122351, 'loss/train': 1.4239689111709595} 11/07/2021 14:28:30 - INFO - __main__ - Step 122353: {'lr': 4.183236093029985e-05, 'samples': 23491776, 'steps': 122352, 'loss/train': 0.8158003091812134} 11/07/2021 14:28:30 - INFO - __main__ - Step 122354: {'lr': 4.182942226841624e-05, 'samples': 23491968, 'steps': 122353, 'loss/train': 1.0481171607971191} 11/07/2021 14:28:30 - INFO - __main__ - Step 122355: {'lr': 4.18264837003311e-05, 'samples': 23492160, 'steps': 122354, 'loss/train': 1.2156778573989868} 11/07/2021 14:28:31 - INFO - __main__ - Step 122356: {'lr': 4.182354522604573e-05, 'samples': 23492352, 'steps': 122355, 'loss/train': 0.7185439467430115} 11/07/2021 14:28:31 - INFO - __main__ - Step 122357: {'lr': 4.1820606845561435e-05, 'samples': 23492544, 'steps': 122356, 'loss/train': 1.0316017866134644} 11/07/2021 14:28:32 - INFO - __main__ - Step 122358: {'lr': 4.1817668558879586e-05, 'samples': 23492736, 'steps': 122357, 'loss/train': 1.0362858772277832} 11/07/2021 14:28:32 - INFO - __main__ - Step 122359: {'lr': 4.181473036600147e-05, 'samples': 23492928, 'steps': 122358, 'loss/train': 1.7502285242080688} 11/07/2021 14:28:33 - INFO - __main__ - Step 122360: {'lr': 4.181179226692844e-05, 'samples': 23493120, 'steps': 122359, 'loss/train': 1.2709629535675049} 11/07/2021 14:28:33 - INFO - __main__ - Step 122361: {'lr': 4.1808854261661786e-05, 'samples': 23493312, 'steps': 122360, 'loss/train': 1.735646367073059} 11/07/2021 14:28:33 - INFO - __main__ - Step 122362: {'lr': 4.180591635020287e-05, 'samples': 23493504, 'steps': 122361, 'loss/train': 1.4914238452911377} 11/07/2021 14:28:35 - INFO - __main__ - Step 122363: {'lr': 4.1802978532552964e-05, 'samples': 23493696, 'steps': 122362, 'loss/train': 1.670962929725647} 11/07/2021 14:28:35 - INFO - __main__ - Step 122364: {'lr': 4.180004080871347e-05, 'samples': 23493888, 'steps': 122363, 'loss/train': 1.0754623413085938} 11/07/2021 14:28:35 - INFO - __main__ - Step 122365: {'lr': 4.179710317868563e-05, 'samples': 23494080, 'steps': 122364, 'loss/train': 1.5135055780410767} 11/07/2021 14:28:36 - INFO - __main__ - Step 122366: {'lr': 4.179416564247085e-05, 'samples': 23494272, 'steps': 122365, 'loss/train': 1.4676364660263062} 11/07/2021 14:28:36 - INFO - __main__ - Step 122367: {'lr': 4.179122820007039e-05, 'samples': 23494464, 'steps': 122366, 'loss/train': 1.144034743309021} 11/07/2021 14:28:37 - INFO - __main__ - Step 122368: {'lr': 4.1788290851485556e-05, 'samples': 23494656, 'steps': 122367, 'loss/train': 1.8793189525604248} 11/07/2021 14:28:37 - INFO - __main__ - Step 122369: {'lr': 4.17853535967177e-05, 'samples': 23494848, 'steps': 122368, 'loss/train': 1.1290454864501953} 11/07/2021 14:28:38 - INFO - __main__ - Step 122370: {'lr': 4.178241643576819e-05, 'samples': 23495040, 'steps': 122369, 'loss/train': 1.4586552381515503} 11/07/2021 14:28:38 - INFO - __main__ - Step 122371: {'lr': 4.177947936863827e-05, 'samples': 23495232, 'steps': 122370, 'loss/train': 1.8670395612716675} 11/07/2021 14:28:38 - INFO - __main__ - Step 122372: {'lr': 4.177654239532933e-05, 'samples': 23495424, 'steps': 122371, 'loss/train': 1.5908925533294678} 11/07/2021 14:28:39 - INFO - __main__ - Step 122373: {'lr': 4.177360551584264e-05, 'samples': 23495616, 'steps': 122372, 'loss/train': 0.5856866836547852} 11/07/2021 14:28:40 - INFO - __main__ - Step 122374: {'lr': 4.177066873017957e-05, 'samples': 23495808, 'steps': 122373, 'loss/train': 1.535530924797058} 11/07/2021 14:28:40 - INFO - __main__ - Step 122375: {'lr': 4.176773203834142e-05, 'samples': 23496000, 'steps': 122374, 'loss/train': 1.539415717124939} 11/07/2021 14:28:40 - INFO - __main__ - Step 122376: {'lr': 4.176479544032952e-05, 'samples': 23496192, 'steps': 122375, 'loss/train': 1.3038458824157715} 11/07/2021 14:28:41 - INFO - __main__ - Step 122377: {'lr': 4.176185893614517e-05, 'samples': 23496384, 'steps': 122376, 'loss/train': 0.9264252781867981} 11/07/2021 14:28:42 - INFO - __main__ - Step 122378: {'lr': 4.175892252578975e-05, 'samples': 23496576, 'steps': 122377, 'loss/train': 0.831622838973999} 11/07/2021 14:28:42 - INFO - __main__ - Step 122379: {'lr': 4.175598620926457e-05, 'samples': 23496768, 'steps': 122378, 'loss/train': 1.3792304992675781} 11/07/2021 14:28:43 - INFO - __main__ - Step 122380: {'lr': 4.17530499865709e-05, 'samples': 23496960, 'steps': 122379, 'loss/train': 1.1866600513458252} 11/07/2021 14:28:43 - INFO - __main__ - Step 122381: {'lr': 4.1750113857710076e-05, 'samples': 23497152, 'steps': 122380, 'loss/train': 1.2048907279968262} 11/07/2021 14:28:43 - INFO - __main__ - Step 122382: {'lr': 4.174717782268345e-05, 'samples': 23497344, 'steps': 122381, 'loss/train': 1.9899653196334839} 11/07/2021 14:28:44 - INFO - __main__ - Step 122383: {'lr': 4.174424188149231e-05, 'samples': 23497536, 'steps': 122382, 'loss/train': 1.6167937517166138} 11/07/2021 14:28:45 - INFO - __main__ - Step 122384: {'lr': 4.1741306034138006e-05, 'samples': 23497728, 'steps': 122383, 'loss/train': 1.011712670326233} 11/07/2021 14:28:45 - INFO - __main__ - Step 122385: {'lr': 4.173837028062186e-05, 'samples': 23497920, 'steps': 122384, 'loss/train': 1.4866034984588623} 11/07/2021 14:28:45 - INFO - __main__ - Step 122386: {'lr': 4.17354346209452e-05, 'samples': 23498112, 'steps': 122385, 'loss/train': 1.2444103956222534} 11/07/2021 14:28:46 - INFO - __main__ - Step 122387: {'lr': 4.1732499055109344e-05, 'samples': 23498304, 'steps': 122386, 'loss/train': 1.122247576713562} 11/07/2021 14:28:47 - INFO - __main__ - Step 122388: {'lr': 4.172956358311558e-05, 'samples': 23498496, 'steps': 122387, 'loss/train': 1.6299678087234497} 11/07/2021 14:28:47 - INFO - __main__ - Step 122389: {'lr': 4.172662820496528e-05, 'samples': 23498688, 'steps': 122388, 'loss/train': 1.187890887260437} 11/07/2021 14:28:48 - INFO - __main__ - Step 122390: {'lr': 4.172369292065975e-05, 'samples': 23498880, 'steps': 122389, 'loss/train': 1.2293885946273804} 11/07/2021 14:28:48 - INFO - __main__ - Step 122391: {'lr': 4.1720757730200315e-05, 'samples': 23499072, 'steps': 122390, 'loss/train': 2.30204439163208} 11/07/2021 14:28:48 - INFO - __main__ - Step 122392: {'lr': 4.1717822633588284e-05, 'samples': 23499264, 'steps': 122391, 'loss/train': 1.2441340684890747} 11/07/2021 14:28:49 - INFO - __main__ - Step 122393: {'lr': 4.171488763082504e-05, 'samples': 23499456, 'steps': 122392, 'loss/train': 1.1222572326660156} 11/07/2021 14:28:50 - INFO - __main__ - Step 122394: {'lr': 4.171195272191181e-05, 'samples': 23499648, 'steps': 122393, 'loss/train': 0.7687196135520935} 11/07/2021 14:28:50 - INFO - __main__ - Step 122395: {'lr': 4.170901790684994e-05, 'samples': 23499840, 'steps': 122394, 'loss/train': 1.3334747552871704} 11/07/2021 14:28:50 - INFO - __main__ - Step 122396: {'lr': 4.1706083185640786e-05, 'samples': 23500032, 'steps': 122395, 'loss/train': 1.3587801456451416} 11/07/2021 14:28:51 - INFO - __main__ - Step 122397: {'lr': 4.1703148558285665e-05, 'samples': 23500224, 'steps': 122396, 'loss/train': 1.5669243335723877} 11/07/2021 14:28:52 - INFO - __main__ - Step 122398: {'lr': 4.170021402478588e-05, 'samples': 23500416, 'steps': 122397, 'loss/train': 1.3640955686569214} 11/07/2021 14:28:52 - INFO - __main__ - Step 122399: {'lr': 4.169727958514275e-05, 'samples': 23500608, 'steps': 122398, 'loss/train': 1.182969570159912} 11/07/2021 14:28:52 - INFO - __main__ - Step 122400: {'lr': 4.169434523935764e-05, 'samples': 23500800, 'steps': 122399, 'loss/train': 1.1423702239990234} 11/07/2021 14:28:53 - INFO - __main__ - Step 122401: {'lr': 4.1691410987431815e-05, 'samples': 23500992, 'steps': 122400, 'loss/train': 1.5116298198699951} 11/07/2021 14:28:53 - INFO - __main__ - Step 122402: {'lr': 4.168847682936663e-05, 'samples': 23501184, 'steps': 122401, 'loss/train': 1.247750997543335} 11/07/2021 14:28:53 - INFO - __main__ - Step 122403: {'lr': 4.1685542765163426e-05, 'samples': 23501376, 'steps': 122402, 'loss/train': 1.3061493635177612} 11/07/2021 14:28:54 - INFO - __main__ - Step 122404: {'lr': 4.16826087948235e-05, 'samples': 23501568, 'steps': 122403, 'loss/train': 1.0535310506820679} 11/07/2021 14:28:55 - INFO - __main__ - Step 122405: {'lr': 4.167967491834815e-05, 'samples': 23501760, 'steps': 122404, 'loss/train': 1.3202953338623047} 11/07/2021 14:28:55 - INFO - __main__ - Step 122406: {'lr': 4.16767411357388e-05, 'samples': 23501952, 'steps': 122405, 'loss/train': 1.2998197078704834} 11/07/2021 14:28:56 - INFO - __main__ - Step 122407: {'lr': 4.167380744699664e-05, 'samples': 23502144, 'steps': 122406, 'loss/train': 1.1554911136627197} 11/07/2021 14:28:56 - INFO - __main__ - Step 122408: {'lr': 4.167087385212304e-05, 'samples': 23502336, 'steps': 122407, 'loss/train': 1.170273780822754} 11/07/2021 14:28:57 - INFO - __main__ - Step 122409: {'lr': 4.1667940351119345e-05, 'samples': 23502528, 'steps': 122408, 'loss/train': 1.479997992515564} 11/07/2021 14:28:57 - INFO - __main__ - Step 122410: {'lr': 4.166500694398684e-05, 'samples': 23502720, 'steps': 122409, 'loss/train': 1.3928261995315552} 11/07/2021 14:28:58 - INFO - __main__ - Step 122411: {'lr': 4.1662073630726884e-05, 'samples': 23502912, 'steps': 122410, 'loss/train': 1.2480429410934448} 11/07/2021 14:28:58 - INFO - __main__ - Step 122412: {'lr': 4.165914041134078e-05, 'samples': 23503104, 'steps': 122411, 'loss/train': 1.4788445234298706} 11/07/2021 14:28:58 - INFO - __main__ - Step 122413: {'lr': 4.165620728582983e-05, 'samples': 23503296, 'steps': 122412, 'loss/train': 1.1434025764465332} 11/07/2021 14:28:59 - INFO - __main__ - Step 122414: {'lr': 4.1653274254195406e-05, 'samples': 23503488, 'steps': 122413, 'loss/train': 1.520963191986084} 11/07/2021 14:29:00 - INFO - __main__ - Step 122415: {'lr': 4.16503413164388e-05, 'samples': 23503680, 'steps': 122414, 'loss/train': 1.0888302326202393} 11/07/2021 14:29:00 - INFO - __main__ - Step 122416: {'lr': 4.164740847256132e-05, 'samples': 23503872, 'steps': 122415, 'loss/train': 1.3931578397750854} 11/07/2021 14:29:00 - INFO - __main__ - Step 122417: {'lr': 4.1644475722564304e-05, 'samples': 23504064, 'steps': 122416, 'loss/train': 1.475477933883667} 11/07/2021 14:29:01 - INFO - __main__ - Step 122418: {'lr': 4.164154306644907e-05, 'samples': 23504256, 'steps': 122417, 'loss/train': 1.2554343938827515} 11/07/2021 14:29:02 - INFO - __main__ - Step 122419: {'lr': 4.163861050421697e-05, 'samples': 23504448, 'steps': 122418, 'loss/train': 1.5281720161437988} 11/07/2021 14:29:02 - INFO - __main__ - Step 122420: {'lr': 4.163567803586934e-05, 'samples': 23504640, 'steps': 122419, 'loss/train': 1.4551409482955933} 11/07/2021 14:29:02 - INFO - __main__ - Step 122421: {'lr': 4.163274566140737e-05, 'samples': 23504832, 'steps': 122420, 'loss/train': 0.872772753238678} 11/07/2021 14:29:03 - INFO - __main__ - Step 122422: {'lr': 4.162981338083252e-05, 'samples': 23505024, 'steps': 122421, 'loss/train': 1.5092427730560303} 11/07/2021 14:29:03 - INFO - __main__ - Step 122423: {'lr': 4.162688119414604e-05, 'samples': 23505216, 'steps': 122422, 'loss/train': 1.0108695030212402} 11/07/2021 14:29:04 - INFO - __main__ - Step 122424: {'lr': 4.1623949101349254e-05, 'samples': 23505408, 'steps': 122423, 'loss/train': 0.582080066204071} 11/07/2021 14:29:04 - INFO - __main__ - Step 122425: {'lr': 4.162101710244351e-05, 'samples': 23505600, 'steps': 122424, 'loss/train': 1.8312740325927734} 11/07/2021 14:29:05 - INFO - __main__ - Step 122426: {'lr': 4.1618085197430125e-05, 'samples': 23505792, 'steps': 122425, 'loss/train': 1.3993080854415894} 11/07/2021 14:29:05 - INFO - __main__ - Step 122427: {'lr': 4.1615153386310415e-05, 'samples': 23505984, 'steps': 122426, 'loss/train': 1.6843174695968628} 11/07/2021 14:29:05 - INFO - __main__ - Step 122428: {'lr': 4.161222166908571e-05, 'samples': 23506176, 'steps': 122427, 'loss/train': 0.9426590800285339} 11/07/2021 14:29:07 - INFO - __main__ - Step 122429: {'lr': 4.160929004575731e-05, 'samples': 23506368, 'steps': 122428, 'loss/train': 1.1529858112335205} 11/07/2021 14:29:07 - INFO - __main__ - Step 122430: {'lr': 4.1606358516326545e-05, 'samples': 23506560, 'steps': 122429, 'loss/train': 1.097747564315796} 11/07/2021 14:29:07 - INFO - __main__ - Step 122431: {'lr': 4.160342708079473e-05, 'samples': 23506752, 'steps': 122430, 'loss/train': 1.018364667892456} 11/07/2021 14:29:08 - INFO - __main__ - Step 122432: {'lr': 4.1600495739163216e-05, 'samples': 23506944, 'steps': 122431, 'loss/train': 1.1726762056350708} 11/07/2021 14:29:08 - INFO - __main__ - Step 122433: {'lr': 4.159756449143337e-05, 'samples': 23507136, 'steps': 122432, 'loss/train': 1.419111728668213} 11/07/2021 14:29:09 - INFO - __main__ - Step 122434: {'lr': 4.159463333760638e-05, 'samples': 23507328, 'steps': 122433, 'loss/train': 1.3440561294555664} 11/07/2021 14:29:09 - INFO - __main__ - Step 122435: {'lr': 4.1591702277683606e-05, 'samples': 23507520, 'steps': 122434, 'loss/train': 1.4916667938232422} 11/07/2021 14:29:10 - INFO - __main__ - Step 122436: {'lr': 4.1588771311666415e-05, 'samples': 23507712, 'steps': 122435, 'loss/train': 1.197536587715149} 11/07/2021 14:29:10 - INFO - __main__ - Step 122437: {'lr': 4.158584043955613e-05, 'samples': 23507904, 'steps': 122436, 'loss/train': 0.7284564971923828} 11/07/2021 14:29:10 - INFO - __main__ - Step 122438: {'lr': 4.1582909661354004e-05, 'samples': 23508096, 'steps': 122437, 'loss/train': 5.576035022735596} 11/07/2021 14:29:11 - INFO - __main__ - Step 122439: {'lr': 4.157997897706142e-05, 'samples': 23508288, 'steps': 122438, 'loss/train': 1.8076235055923462} 11/07/2021 14:29:12 - INFO - __main__ - Step 122440: {'lr': 4.157704838667969e-05, 'samples': 23508480, 'steps': 122439, 'loss/train': 1.6367404460906982} 11/07/2021 14:29:12 - INFO - __main__ - Step 122441: {'lr': 4.1574117890210125e-05, 'samples': 23508672, 'steps': 122440, 'loss/train': 1.242681860923767} 11/07/2021 14:29:13 - INFO - __main__ - Step 122442: {'lr': 4.1571187487654036e-05, 'samples': 23508864, 'steps': 122441, 'loss/train': 0.70124751329422} 11/07/2021 14:29:13 - INFO - __main__ - Step 122443: {'lr': 4.156825717901277e-05, 'samples': 23509056, 'steps': 122442, 'loss/train': 1.419248104095459} 11/07/2021 14:29:13 - INFO - __main__ - Step 122444: {'lr': 4.1565326964287635e-05, 'samples': 23509248, 'steps': 122443, 'loss/train': 1.3610219955444336} 11/07/2021 14:29:14 - INFO - __main__ - Step 122445: {'lr': 4.1562396843479925e-05, 'samples': 23509440, 'steps': 122444, 'loss/train': 0.744465172290802} 11/07/2021 14:29:15 - INFO - __main__ - Step 122446: {'lr': 4.1559466816591065e-05, 'samples': 23509632, 'steps': 122445, 'loss/train': 0.8119130730628967} 11/07/2021 14:29:15 - INFO - __main__ - Step 122447: {'lr': 4.155653688362221e-05, 'samples': 23509824, 'steps': 122446, 'loss/train': 1.6596214771270752} 11/07/2021 14:29:15 - INFO - __main__ - Step 122448: {'lr': 4.1553607044574784e-05, 'samples': 23510016, 'steps': 122447, 'loss/train': 0.9332661628723145} 11/07/2021 14:29:16 - INFO - __main__ - Step 122449: {'lr': 4.155067729945006e-05, 'samples': 23510208, 'steps': 122448, 'loss/train': 1.89745032787323} 11/07/2021 14:29:17 - INFO - __main__ - Step 122450: {'lr': 4.1547747648249396e-05, 'samples': 23510400, 'steps': 122449, 'loss/train': 1.4108837842941284} 11/07/2021 14:29:17 - INFO - __main__ - Step 122451: {'lr': 4.15448180909741e-05, 'samples': 23510592, 'steps': 122450, 'loss/train': 2.0578038692474365} 11/07/2021 14:29:18 - INFO - __main__ - Step 122452: {'lr': 4.1541888627625505e-05, 'samples': 23510784, 'steps': 122451, 'loss/train': 1.402672290802002} 11/07/2021 14:29:18 - INFO - __main__ - Step 122453: {'lr': 4.153895925820492e-05, 'samples': 23510976, 'steps': 122452, 'loss/train': 1.129328966140747} 11/07/2021 14:29:18 - INFO - __main__ - Step 122454: {'lr': 4.1536029982713635e-05, 'samples': 23511168, 'steps': 122453, 'loss/train': 1.2055974006652832} 11/07/2021 14:29:19 - INFO - __main__ - Step 122455: {'lr': 4.1533100801153025e-05, 'samples': 23511360, 'steps': 122454, 'loss/train': 1.2816970348358154} 11/07/2021 14:29:20 - INFO - __main__ - Step 122456: {'lr': 4.153017171352436e-05, 'samples': 23511552, 'steps': 122455, 'loss/train': 1.1873301267623901} 11/07/2021 14:29:20 - INFO - __main__ - Step 122457: {'lr': 4.1527242719828994e-05, 'samples': 23511744, 'steps': 122456, 'loss/train': 1.3113000392913818} 11/07/2021 14:29:20 - INFO - __main__ - Step 122458: {'lr': 4.152431382006824e-05, 'samples': 23511936, 'steps': 122457, 'loss/train': 1.4117004871368408} 11/07/2021 14:29:21 - INFO - __main__ - Step 122459: {'lr': 4.1521385014243405e-05, 'samples': 23512128, 'steps': 122458, 'loss/train': 0.08405007421970367} 11/07/2021 14:29:21 - INFO - __main__ - Step 122460: {'lr': 4.1518456302355904e-05, 'samples': 23512320, 'steps': 122459, 'loss/train': 1.409835696220398} 11/07/2021 14:29:22 - INFO - __main__ - Step 122461: {'lr': 4.151552768440689e-05, 'samples': 23512512, 'steps': 122460, 'loss/train': 1.3118257522583008} 11/07/2021 14:29:23 - INFO - __main__ - Step 122462: {'lr': 4.151259916039776e-05, 'samples': 23512704, 'steps': 122461, 'loss/train': 0.7378225326538086} 11/07/2021 14:29:23 - INFO - __main__ - Step 122463: {'lr': 4.150967073032982e-05, 'samples': 23512896, 'steps': 122462, 'loss/train': 1.1983904838562012} 11/07/2021 14:29:23 - INFO - __main__ - Step 122464: {'lr': 4.150674239420443e-05, 'samples': 23513088, 'steps': 122463, 'loss/train': 1.2225584983825684} 11/07/2021 14:29:24 - INFO - __main__ - Step 122465: {'lr': 4.150381415202287e-05, 'samples': 23513280, 'steps': 122464, 'loss/train': 1.3025500774383545} 11/07/2021 14:29:25 - INFO - __main__ - Step 122466: {'lr': 4.1500886003786484e-05, 'samples': 23513472, 'steps': 122465, 'loss/train': 0.9653082489967346} 11/07/2021 14:29:25 - INFO - __main__ - Step 122467: {'lr': 4.149795794949657e-05, 'samples': 23513664, 'steps': 122466, 'loss/train': 1.4936909675598145} 11/07/2021 14:29:25 - INFO - __main__ - Step 122468: {'lr': 4.149502998915447e-05, 'samples': 23513856, 'steps': 122467, 'loss/train': 1.2590528726577759} 11/07/2021 14:29:26 - INFO - __main__ - Step 122469: {'lr': 4.14921021227615e-05, 'samples': 23514048, 'steps': 122468, 'loss/train': 1.4753516912460327} 11/07/2021 14:29:26 - INFO - __main__ - Step 122470: {'lr': 4.148917435031896e-05, 'samples': 23514240, 'steps': 122469, 'loss/train': 0.9275017976760864} 11/07/2021 14:29:27 - INFO - __main__ - Step 122471: {'lr': 4.148624667182818e-05, 'samples': 23514432, 'steps': 122470, 'loss/train': 1.232185959815979} 11/07/2021 14:29:28 - INFO - __main__ - Step 122472: {'lr': 4.1483319087290474e-05, 'samples': 23514624, 'steps': 122471, 'loss/train': 1.3266609907150269} 11/07/2021 14:29:28 - INFO - __main__ - Step 122473: {'lr': 4.148039159670722e-05, 'samples': 23514816, 'steps': 122472, 'loss/train': 1.2452282905578613} 11/07/2021 14:29:28 - INFO - __main__ - Step 122474: {'lr': 4.147746420007964e-05, 'samples': 23515008, 'steps': 122473, 'loss/train': 1.0680533647537231} 11/07/2021 14:29:29 - INFO - __main__ - Step 122475: {'lr': 4.1474536897409096e-05, 'samples': 23515200, 'steps': 122474, 'loss/train': 1.2428135871887207} 11/07/2021 14:29:30 - INFO - __main__ - Step 122476: {'lr': 4.14716096886969e-05, 'samples': 23515392, 'steps': 122475, 'loss/train': 1.1559759378433228} 11/07/2021 14:29:30 - INFO - __main__ - Step 122477: {'lr': 4.14686825739444e-05, 'samples': 23515584, 'steps': 122476, 'loss/train': 1.7756133079528809} 11/07/2021 14:29:30 - INFO - __main__ - Step 122478: {'lr': 4.1465755553152876e-05, 'samples': 23515776, 'steps': 122477, 'loss/train': 0.6213221549987793} 11/07/2021 14:29:31 - INFO - __main__ - Step 122479: {'lr': 4.146282862632367e-05, 'samples': 23515968, 'steps': 122478, 'loss/train': 1.1170670986175537} 11/07/2021 14:29:31 - INFO - __main__ - Step 122480: {'lr': 4.1459901793458075e-05, 'samples': 23516160, 'steps': 122479, 'loss/train': 1.2309281826019287} 11/07/2021 14:29:32 - INFO - __main__ - Step 122481: {'lr': 4.145697505455745e-05, 'samples': 23516352, 'steps': 122480, 'loss/train': 1.2127180099487305} 11/07/2021 14:29:32 - INFO - __main__ - Step 122482: {'lr': 4.1454048409623105e-05, 'samples': 23516544, 'steps': 122481, 'loss/train': 1.4955352544784546} 11/07/2021 14:29:33 - INFO - __main__ - Step 122483: {'lr': 4.1451121858656324e-05, 'samples': 23516736, 'steps': 122482, 'loss/train': 1.3807789087295532} 11/07/2021 14:29:33 - INFO - __main__ - Step 122484: {'lr': 4.144819540165845e-05, 'samples': 23516928, 'steps': 122483, 'loss/train': 0.8600056171417236} 11/07/2021 14:29:34 - INFO - __main__ - Step 122485: {'lr': 4.144526903863083e-05, 'samples': 23517120, 'steps': 122484, 'loss/train': 1.2159132957458496} 11/07/2021 14:29:35 - INFO - __main__ - Step 122486: {'lr': 4.144234276957473e-05, 'samples': 23517312, 'steps': 122485, 'loss/train': 1.266977071762085} 11/07/2021 14:29:35 - INFO - __main__ - Step 122487: {'lr': 4.143941659449155e-05, 'samples': 23517504, 'steps': 122486, 'loss/train': 1.4135898351669312} 11/07/2021 14:29:35 - INFO - __main__ - Step 122488: {'lr': 4.1436490513382497e-05, 'samples': 23517696, 'steps': 122487, 'loss/train': 1.5819756984710693} 11/07/2021 14:29:36 - INFO - __main__ - Step 122489: {'lr': 4.143356452624894e-05, 'samples': 23517888, 'steps': 122488, 'loss/train': 1.1790930032730103} 11/07/2021 14:29:36 - INFO - __main__ - Step 122490: {'lr': 4.143063863309221e-05, 'samples': 23518080, 'steps': 122489, 'loss/train': 1.3236713409423828} 11/07/2021 14:29:36 - INFO - __main__ - Step 122491: {'lr': 4.142771283391361e-05, 'samples': 23518272, 'steps': 122490, 'loss/train': 2.0512986183166504} 11/07/2021 14:29:37 - INFO - __main__ - Step 122492: {'lr': 4.142478712871445e-05, 'samples': 23518464, 'steps': 122491, 'loss/train': 1.6958063840866089} 11/07/2021 14:29:38 - INFO - __main__ - Step 122493: {'lr': 4.142186151749608e-05, 'samples': 23518656, 'steps': 122492, 'loss/train': 1.025072693824768} 11/07/2021 14:29:38 - INFO - __main__ - Step 122494: {'lr': 4.141893600025981e-05, 'samples': 23518848, 'steps': 122493, 'loss/train': 1.1713424921035767} 11/07/2021 14:29:39 - INFO - __main__ - Step 122495: {'lr': 4.141601057700692e-05, 'samples': 23519040, 'steps': 122494, 'loss/train': 1.0840849876403809} 11/07/2021 14:29:39 - INFO - __main__ - Step 122496: {'lr': 4.1413085247738764e-05, 'samples': 23519232, 'steps': 122495, 'loss/train': 1.2641175985336304} 11/07/2021 14:29:40 - INFO - __main__ - Step 122497: {'lr': 4.141016001245668e-05, 'samples': 23519424, 'steps': 122496, 'loss/train': 1.2598532438278198} 11/07/2021 14:29:40 - INFO - __main__ - Step 122498: {'lr': 4.1407234871161994e-05, 'samples': 23519616, 'steps': 122497, 'loss/train': 1.3634952306747437} 11/07/2021 14:29:41 - INFO - __main__ - Step 122499: {'lr': 4.140430982385593e-05, 'samples': 23519808, 'steps': 122498, 'loss/train': 1.193695068359375} 11/07/2021 14:29:41 - INFO - __main__ - Step 122500: {'lr': 4.1401384870539876e-05, 'samples': 23520000, 'steps': 122499, 'loss/train': 1.05503249168396} 11/07/2021 14:29:41 - INFO - __main__ - Step 122501: {'lr': 4.139846001121514e-05, 'samples': 23520192, 'steps': 122500, 'loss/train': 1.524527668952942} 11/07/2021 14:29:43 - INFO - __main__ - Step 122502: {'lr': 4.139553524588302e-05, 'samples': 23520384, 'steps': 122501, 'loss/train': 1.5823235511779785} 11/07/2021 14:29:43 - INFO - __main__ - Step 122503: {'lr': 4.139261057454488e-05, 'samples': 23520576, 'steps': 122502, 'loss/train': 1.9818239212036133} 11/07/2021 14:29:44 - INFO - __main__ - Step 122504: {'lr': 4.1389685997201996e-05, 'samples': 23520768, 'steps': 122503, 'loss/train': 1.404206395149231} 11/07/2021 14:29:44 - INFO - __main__ - Step 122505: {'lr': 4.1386761513855704e-05, 'samples': 23520960, 'steps': 122504, 'loss/train': 1.7315977811813354} 11/07/2021 14:29:44 - INFO - __main__ - Step 122506: {'lr': 4.13838371245073e-05, 'samples': 23521152, 'steps': 122505, 'loss/train': 1.001367211341858} 11/07/2021 14:29:45 - INFO - __main__ - Step 122507: {'lr': 4.1380912829158155e-05, 'samples': 23521344, 'steps': 122506, 'loss/train': 0.9059791564941406} 11/07/2021 14:29:46 - INFO - __main__ - Step 122508: {'lr': 4.137798862780951e-05, 'samples': 23521536, 'steps': 122507, 'loss/train': 1.4628534317016602} 11/07/2021 14:29:46 - INFO - __main__ - Step 122509: {'lr': 4.13750645204628e-05, 'samples': 23521728, 'steps': 122508, 'loss/train': 1.564202904701233} 11/07/2021 14:29:46 - INFO - __main__ - Step 122510: {'lr': 4.137214050711921e-05, 'samples': 23521920, 'steps': 122509, 'loss/train': 1.3733587265014648} 11/07/2021 14:29:47 - INFO - __main__ - Step 122511: {'lr': 4.136921658778012e-05, 'samples': 23522112, 'steps': 122510, 'loss/train': 1.5807671546936035} 11/07/2021 14:29:48 - INFO - __main__ - Step 122512: {'lr': 4.136629276244686e-05, 'samples': 23522304, 'steps': 122511, 'loss/train': 1.56660795211792} 11/07/2021 14:29:48 - INFO - __main__ - Step 122513: {'lr': 4.13633690311207e-05, 'samples': 23522496, 'steps': 122512, 'loss/train': 1.3733469247817993} 11/07/2021 14:29:48 - INFO - __main__ - Step 122514: {'lr': 4.1360445393802986e-05, 'samples': 23522688, 'steps': 122513, 'loss/train': 2.4170584678649902} 11/07/2021 14:29:49 - INFO - __main__ - Step 122515: {'lr': 4.1357521850495044e-05, 'samples': 23522880, 'steps': 122514, 'loss/train': 0.9097854495048523} 11/07/2021 14:29:49 - INFO - __main__ - Step 122516: {'lr': 4.135459840119818e-05, 'samples': 23523072, 'steps': 122515, 'loss/train': 1.5632407665252686} 11/07/2021 14:29:50 - INFO - __main__ - Step 122517: {'lr': 4.135167504591372e-05, 'samples': 23523264, 'steps': 122516, 'loss/train': 1.0203790664672852} 11/07/2021 14:29:50 - INFO - __main__ - Step 122518: {'lr': 4.134875178464298e-05, 'samples': 23523456, 'steps': 122517, 'loss/train': 1.526273488998413} 11/07/2021 14:29:51 - INFO - __main__ - Step 122519: {'lr': 4.1345828617387255e-05, 'samples': 23523648, 'steps': 122518, 'loss/train': 1.4191888570785522} 11/07/2021 14:29:51 - INFO - __main__ - Step 122520: {'lr': 4.1342905544147966e-05, 'samples': 23523840, 'steps': 122519, 'loss/train': 1.031857967376709} 11/07/2021 14:29:52 - INFO - __main__ - Step 122521: {'lr': 4.1339982564926244e-05, 'samples': 23524032, 'steps': 122520, 'loss/train': 1.5277140140533447} 11/07/2021 14:29:52 - INFO - __main__ - Step 122522: {'lr': 4.133705967972354e-05, 'samples': 23524224, 'steps': 122521, 'loss/train': 1.2051963806152344} 11/07/2021 14:29:53 - INFO - __main__ - Step 122523: {'lr': 4.1334136888541126e-05, 'samples': 23524416, 'steps': 122522, 'loss/train': 1.341617465019226} 11/07/2021 14:29:53 - INFO - __main__ - Step 122524: {'lr': 4.133121419138033e-05, 'samples': 23524608, 'steps': 122523, 'loss/train': 1.1773484945297241} 11/07/2021 14:29:54 - INFO - __main__ - Step 122525: {'lr': 4.132829158824247e-05, 'samples': 23524800, 'steps': 122524, 'loss/train': 1.638425588607788} 11/07/2021 14:29:54 - INFO - __main__ - Step 122526: {'lr': 4.1325369079128874e-05, 'samples': 23524992, 'steps': 122525, 'loss/train': 1.255165934562683} 11/07/2021 14:29:54 - INFO - __main__ - Step 122527: {'lr': 4.1322446664040805e-05, 'samples': 23525184, 'steps': 122526, 'loss/train': 0.6792795062065125} 11/07/2021 14:29:56 - INFO - __main__ - Step 122528: {'lr': 4.131952434297967e-05, 'samples': 23525376, 'steps': 122527, 'loss/train': 1.3283573389053345} 11/07/2021 14:29:56 - INFO - __main__ - Step 122529: {'lr': 4.1316602115946704e-05, 'samples': 23525568, 'steps': 122528, 'loss/train': 1.6681233644485474} 11/07/2021 14:29:56 - INFO - __main__ - Step 122530: {'lr': 4.131367998294327e-05, 'samples': 23525760, 'steps': 122529, 'loss/train': 1.4055153131484985} 11/07/2021 14:29:57 - INFO - __main__ - Step 122531: {'lr': 4.131075794397074e-05, 'samples': 23525952, 'steps': 122530, 'loss/train': 1.2229282855987549} 11/07/2021 14:29:57 - INFO - __main__ - Step 122532: {'lr': 4.130783599903029e-05, 'samples': 23526144, 'steps': 122531, 'loss/train': 0.6294504404067993} 11/07/2021 14:29:58 - INFO - __main__ - Step 122533: {'lr': 4.130491414812332e-05, 'samples': 23526336, 'steps': 122532, 'loss/train': 1.2354748249053955} 11/07/2021 14:29:59 - INFO - __main__ - Step 122534: {'lr': 4.130199239125113e-05, 'samples': 23526528, 'steps': 122533, 'loss/train': 1.2598156929016113} 11/07/2021 14:29:59 - INFO - __main__ - Step 122535: {'lr': 4.1299070728415047e-05, 'samples': 23526720, 'steps': 122534, 'loss/train': 1.2403371334075928} 11/07/2021 14:29:59 - INFO - __main__ - Step 122536: {'lr': 4.129614915961638e-05, 'samples': 23526912, 'steps': 122535, 'loss/train': 1.3235255479812622} 11/07/2021 14:30:00 - INFO - __main__ - Step 122537: {'lr': 4.129322768485644e-05, 'samples': 23527104, 'steps': 122536, 'loss/train': 1.6247808933258057} 11/07/2021 14:30:01 - INFO - __main__ - Step 122538: {'lr': 4.129030630413655e-05, 'samples': 23527296, 'steps': 122537, 'loss/train': 1.2504892349243164} 11/07/2021 14:30:01 - INFO - __main__ - Step 122539: {'lr': 4.128738501745802e-05, 'samples': 23527488, 'steps': 122538, 'loss/train': 1.434687614440918} 11/07/2021 14:30:01 - INFO - __main__ - Step 122540: {'lr': 4.1284463824822205e-05, 'samples': 23527680, 'steps': 122539, 'loss/train': 1.0900763273239136} 11/07/2021 14:30:02 - INFO - __main__ - Step 122541: {'lr': 4.128154272623036e-05, 'samples': 23527872, 'steps': 122540, 'loss/train': 1.1723564863204956} 11/07/2021 14:30:02 - INFO - __main__ - Step 122542: {'lr': 4.1278621721683895e-05, 'samples': 23528064, 'steps': 122541, 'loss/train': 1.38835608959198} 11/07/2021 14:30:02 - INFO - __main__ - Step 122543: {'lr': 4.127570081118401e-05, 'samples': 23528256, 'steps': 122542, 'loss/train': 1.4857666492462158} 11/07/2021 14:30:03 - INFO - __main__ - Step 122544: {'lr': 4.1272779994732086e-05, 'samples': 23528448, 'steps': 122543, 'loss/train': 1.3322229385375977} 11/07/2021 14:30:04 - INFO - __main__ - Step 122545: {'lr': 4.1269859272329404e-05, 'samples': 23528640, 'steps': 122544, 'loss/train': 1.4963182210922241} 11/07/2021 14:30:04 - INFO - __main__ - Step 122546: {'lr': 4.126693864397732e-05, 'samples': 23528832, 'steps': 122545, 'loss/train': 1.268662929534912} 11/07/2021 14:30:04 - INFO - __main__ - Step 122547: {'lr': 4.126401810967711e-05, 'samples': 23529024, 'steps': 122546, 'loss/train': 0.8667211532592773} 11/07/2021 14:30:05 - INFO - __main__ - Step 122548: {'lr': 4.126109766943012e-05, 'samples': 23529216, 'steps': 122547, 'loss/train': 0.23723074793815613} 11/07/2021 14:30:06 - INFO - __main__ - Step 122549: {'lr': 4.125817732323767e-05, 'samples': 23529408, 'steps': 122548, 'loss/train': 1.5033094882965088} 11/07/2021 14:30:06 - INFO - __main__ - Step 122550: {'lr': 4.125525707110106e-05, 'samples': 23529600, 'steps': 122549, 'loss/train': 1.7920058965682983} 11/07/2021 14:30:07 - INFO - __main__ - Step 122551: {'lr': 4.12523369130216e-05, 'samples': 23529792, 'steps': 122550, 'loss/train': 1.2918686866760254} 11/07/2021 14:30:07 - INFO - __main__ - Step 122552: {'lr': 4.1249416849000634e-05, 'samples': 23529984, 'steps': 122551, 'loss/train': 1.1818827390670776} 11/07/2021 14:30:07 - INFO - __main__ - Step 122553: {'lr': 4.1246496879039444e-05, 'samples': 23530176, 'steps': 122552, 'loss/train': 1.537212610244751} 11/07/2021 14:30:08 - INFO - __main__ - Step 122554: {'lr': 4.124357700313944e-05, 'samples': 23530368, 'steps': 122553, 'loss/train': 1.6427879333496094} 11/07/2021 14:30:09 - INFO - __main__ - Step 122555: {'lr': 4.12406572213018e-05, 'samples': 23530560, 'steps': 122554, 'loss/train': 1.8832412958145142} 11/07/2021 14:30:09 - INFO - __main__ - Step 122556: {'lr': 4.123773753352786e-05, 'samples': 23530752, 'steps': 122555, 'loss/train': 1.602410078048706} 11/07/2021 14:30:09 - INFO - __main__ - Step 122557: {'lr': 4.123481793981901e-05, 'samples': 23530944, 'steps': 122556, 'loss/train': 1.2881629467010498} 11/07/2021 14:30:10 - INFO - __main__ - Step 122558: {'lr': 4.123189844017652e-05, 'samples': 23531136, 'steps': 122557, 'loss/train': 1.1207563877105713} 11/07/2021 14:30:11 - INFO - __main__ - Step 122559: {'lr': 4.122897903460171e-05, 'samples': 23531328, 'steps': 122558, 'loss/train': 1.0268242359161377} 11/07/2021 14:30:11 - INFO - __main__ - Step 122560: {'lr': 4.1226059723095896e-05, 'samples': 23531520, 'steps': 122559, 'loss/train': 1.1020513772964478} 11/07/2021 14:30:11 - INFO - __main__ - Step 122561: {'lr': 4.1223140505660426e-05, 'samples': 23531712, 'steps': 122560, 'loss/train': 1.1385985612869263} 11/07/2021 14:30:12 - INFO - __main__ - Step 122562: {'lr': 4.122022138229656e-05, 'samples': 23531904, 'steps': 122561, 'loss/train': 1.5202807188034058} 11/07/2021 14:30:12 - INFO - __main__ - Step 122563: {'lr': 4.121730235300564e-05, 'samples': 23532096, 'steps': 122562, 'loss/train': 1.2799943685531616} 11/07/2021 14:30:13 - INFO - __main__ - Step 122564: {'lr': 4.121438341778899e-05, 'samples': 23532288, 'steps': 122563, 'loss/train': 1.2223442792892456} 11/07/2021 14:30:14 - INFO - __main__ - Step 122565: {'lr': 4.1211464576647926e-05, 'samples': 23532480, 'steps': 122564, 'loss/train': 1.220811367034912} 11/07/2021 14:30:14 - INFO - __main__ - Step 122566: {'lr': 4.120854582958375e-05, 'samples': 23532672, 'steps': 122565, 'loss/train': 1.2439627647399902} 11/07/2021 14:30:14 - INFO - __main__ - Step 122567: {'lr': 4.1205627176597816e-05, 'samples': 23532864, 'steps': 122566, 'loss/train': 1.2373836040496826} 11/07/2021 14:30:15 - INFO - __main__ - Step 122568: {'lr': 4.120270861769138e-05, 'samples': 23533056, 'steps': 122567, 'loss/train': 1.5092267990112305} 11/07/2021 14:30:16 - INFO - __main__ - Step 122569: {'lr': 4.119979015286576e-05, 'samples': 23533248, 'steps': 122568, 'loss/train': 1.5013397932052612} 11/07/2021 14:30:16 - INFO - __main__ - Step 122570: {'lr': 4.119687178212231e-05, 'samples': 23533440, 'steps': 122569, 'loss/train': 1.2656351327896118} 11/07/2021 14:30:16 - INFO - __main__ - Step 122571: {'lr': 4.1193953505462314e-05, 'samples': 23533632, 'steps': 122570, 'loss/train': 1.6337083578109741} 11/07/2021 14:30:17 - INFO - __main__ - Step 122572: {'lr': 4.119103532288709e-05, 'samples': 23533824, 'steps': 122571, 'loss/train': 1.4538367986679077} 11/07/2021 14:30:17 - INFO - __main__ - Step 122573: {'lr': 4.118811723439797e-05, 'samples': 23534016, 'steps': 122572, 'loss/train': 1.2886172533035278} 11/07/2021 14:30:17 - INFO - __main__ - Step 122574: {'lr': 4.118519923999628e-05, 'samples': 23534208, 'steps': 122573, 'loss/train': 1.6825987100601196} 11/07/2021 14:30:18 - INFO - __main__ - Step 122575: {'lr': 4.11822813396833e-05, 'samples': 23534400, 'steps': 122574, 'loss/train': 1.1388015747070312} 11/07/2021 14:30:19 - INFO - __main__ - Step 122576: {'lr': 4.117936353346039e-05, 'samples': 23534592, 'steps': 122575, 'loss/train': 0.7890264391899109} 11/07/2021 14:30:19 - INFO - __main__ - Step 122577: {'lr': 4.117644582132879e-05, 'samples': 23534784, 'steps': 122576, 'loss/train': 1.1866711378097534} 11/07/2021 14:30:19 - INFO - __main__ - Step 122578: {'lr': 4.11735282032899e-05, 'samples': 23534976, 'steps': 122577, 'loss/train': 0.43116599321365356} 11/07/2021 14:30:20 - INFO - __main__ - Step 122579: {'lr': 4.117061067934496e-05, 'samples': 23535168, 'steps': 122578, 'loss/train': 1.3403512239456177} 11/07/2021 14:30:21 - INFO - __main__ - Step 122580: {'lr': 4.116769324949535e-05, 'samples': 23535360, 'steps': 122579, 'loss/train': 1.2837892770767212} 11/07/2021 14:30:21 - INFO - __main__ - Step 122581: {'lr': 4.1164775913742404e-05, 'samples': 23535552, 'steps': 122580, 'loss/train': 1.2058472633361816} 11/07/2021 14:30:21 - INFO - __main__ - Step 122582: {'lr': 4.1161858672087326e-05, 'samples': 23535744, 'steps': 122581, 'loss/train': 0.09683708846569061} 11/07/2021 14:30:22 - INFO - __main__ - Step 122583: {'lr': 4.1158941524531504e-05, 'samples': 23535936, 'steps': 122582, 'loss/train': 1.2605725526809692} 11/07/2021 14:30:22 - INFO - __main__ - Step 122584: {'lr': 4.1156024471076245e-05, 'samples': 23536128, 'steps': 122583, 'loss/train': 1.36945378780365} 11/07/2021 14:30:23 - INFO - __main__ - Step 122585: {'lr': 4.115310751172283e-05, 'samples': 23536320, 'steps': 122584, 'loss/train': 1.0831220149993896} 11/07/2021 14:30:24 - INFO - __main__ - Step 122586: {'lr': 4.115019064647263e-05, 'samples': 23536512, 'steps': 122585, 'loss/train': 1.3879191875457764} 11/07/2021 14:30:24 - INFO - __main__ - Step 122587: {'lr': 4.114727387532691e-05, 'samples': 23536704, 'steps': 122586, 'loss/train': 1.4675332307815552} 11/07/2021 14:30:24 - INFO - __main__ - Step 122588: {'lr': 4.114435719828702e-05, 'samples': 23536896, 'steps': 122587, 'loss/train': 1.4306678771972656} 11/07/2021 14:30:25 - INFO - __main__ - Step 122589: {'lr': 4.114144061535424e-05, 'samples': 23537088, 'steps': 122588, 'loss/train': 1.4631441831588745} 11/07/2021 14:30:26 - INFO - __main__ - Step 122590: {'lr': 4.1138524126529936e-05, 'samples': 23537280, 'steps': 122589, 'loss/train': 1.1533808708190918} 11/07/2021 14:30:26 - INFO - __main__ - Step 122591: {'lr': 4.113560773181538e-05, 'samples': 23537472, 'steps': 122590, 'loss/train': 0.8946738243103027} 11/07/2021 14:30:26 - INFO - __main__ - Step 122592: {'lr': 4.113269143121187e-05, 'samples': 23537664, 'steps': 122591, 'loss/train': 0.8377827405929565} 11/07/2021 14:30:27 - INFO - __main__ - Step 122593: {'lr': 4.112977522472078e-05, 'samples': 23537856, 'steps': 122592, 'loss/train': 1.0348490476608276} 11/07/2021 14:30:27 - INFO - __main__ - Step 122594: {'lr': 4.112685911234343e-05, 'samples': 23538048, 'steps': 122593, 'loss/train': 1.4799480438232422} 11/07/2021 14:30:28 - INFO - __main__ - Step 122595: {'lr': 4.1123943094081046e-05, 'samples': 23538240, 'steps': 122594, 'loss/train': 1.6754286289215088} 11/07/2021 14:30:29 - INFO - __main__ - Step 122596: {'lr': 4.112102716993499e-05, 'samples': 23538432, 'steps': 122595, 'loss/train': 1.0514079332351685} 11/07/2021 14:30:29 - INFO - __main__ - Step 122597: {'lr': 4.111811133990656e-05, 'samples': 23538624, 'steps': 122596, 'loss/train': 1.2940007448196411} 11/07/2021 14:30:29 - INFO - __main__ - Step 122598: {'lr': 4.11151956039971e-05, 'samples': 23538816, 'steps': 122597, 'loss/train': 1.2256429195404053} 11/07/2021 14:30:30 - INFO - __main__ - Step 122599: {'lr': 4.11122799622079e-05, 'samples': 23539008, 'steps': 122598, 'loss/train': 1.653497338294983} 11/07/2021 14:30:31 - INFO - __main__ - Step 122600: {'lr': 4.1109364414540274e-05, 'samples': 23539200, 'steps': 122599, 'loss/train': 0.766314685344696} 11/07/2021 14:30:31 - INFO - __main__ - Step 122601: {'lr': 4.110644896099558e-05, 'samples': 23539392, 'steps': 122600, 'loss/train': 1.1637725830078125} 11/07/2021 14:30:32 - INFO - __main__ - Step 122602: {'lr': 4.1103533601575065e-05, 'samples': 23539584, 'steps': 122601, 'loss/train': 1.345105528831482} 11/07/2021 14:30:32 - INFO - __main__ - Step 122603: {'lr': 4.1100618336280096e-05, 'samples': 23539776, 'steps': 122602, 'loss/train': 0.8075726628303528} 11/07/2021 14:30:32 - INFO - __main__ - Step 122604: {'lr': 4.1097703165111935e-05, 'samples': 23539968, 'steps': 122603, 'loss/train': 1.205857753753662} 11/07/2021 14:30:33 - INFO - __main__ - Step 122605: {'lr': 4.109478808807196e-05, 'samples': 23540160, 'steps': 122604, 'loss/train': 1.1755684614181519} 11/07/2021 14:30:34 - INFO - __main__ - Step 122606: {'lr': 4.1091873105161436e-05, 'samples': 23540352, 'steps': 122605, 'loss/train': 1.1474199295043945} 11/07/2021 14:30:34 - INFO - __main__ - Step 122607: {'lr': 4.108895821638167e-05, 'samples': 23540544, 'steps': 122606, 'loss/train': 0.6789926886558533} 11/07/2021 14:30:34 - INFO - __main__ - Step 122608: {'lr': 4.108604342173408e-05, 'samples': 23540736, 'steps': 122607, 'loss/train': 2.088667392730713} 11/07/2021 14:30:35 - INFO - __main__ - Step 122609: {'lr': 4.108312872121983e-05, 'samples': 23540928, 'steps': 122608, 'loss/train': 1.0719436407089233} 11/07/2021 14:30:36 - INFO - __main__ - Step 122610: {'lr': 4.1080214114840305e-05, 'samples': 23541120, 'steps': 122609, 'loss/train': 1.425538420677185} 11/07/2021 14:30:36 - INFO - __main__ - Step 122611: {'lr': 4.1077299602596794e-05, 'samples': 23541312, 'steps': 122610, 'loss/train': 1.8379729986190796} 11/07/2021 14:30:36 - INFO - __main__ - Step 122612: {'lr': 4.107438518449064e-05, 'samples': 23541504, 'steps': 122611, 'loss/train': 1.32472562789917} 11/07/2021 14:30:37 - INFO - __main__ - Step 122613: {'lr': 4.107147086052315e-05, 'samples': 23541696, 'steps': 122612, 'loss/train': 1.2270530462265015} 11/07/2021 14:30:37 - INFO - __main__ - Step 122614: {'lr': 4.106855663069561e-05, 'samples': 23541888, 'steps': 122613, 'loss/train': 1.0573387145996094} 11/07/2021 14:30:38 - INFO - __main__ - Step 122615: {'lr': 4.106564249500938e-05, 'samples': 23542080, 'steps': 122614, 'loss/train': 1.3937419652938843} 11/07/2021 14:30:39 - INFO - __main__ - Step 122616: {'lr': 4.106272845346573e-05, 'samples': 23542272, 'steps': 122615, 'loss/train': 1.4775019884109497} 11/07/2021 14:30:39 - INFO - __main__ - Step 122617: {'lr': 4.1059814506065995e-05, 'samples': 23542464, 'steps': 122616, 'loss/train': 1.4465277194976807} 11/07/2021 14:30:39 - INFO - __main__ - Step 122618: {'lr': 4.105690065281148e-05, 'samples': 23542656, 'steps': 122617, 'loss/train': 1.5066980123519897} 11/07/2021 14:30:40 - INFO - __main__ - Step 122619: {'lr': 4.105398689370351e-05, 'samples': 23542848, 'steps': 122618, 'loss/train': 1.632744550704956} 11/07/2021 14:30:40 - INFO - __main__ - Step 122620: {'lr': 4.105107322874338e-05, 'samples': 23543040, 'steps': 122619, 'loss/train': 1.122246265411377} 11/07/2021 14:30:41 - INFO - __main__ - Step 122621: {'lr': 4.104815965793249e-05, 'samples': 23543232, 'steps': 122620, 'loss/train': 1.1071653366088867} 11/07/2021 14:30:42 - INFO - __main__ - Step 122622: {'lr': 4.104524618127201e-05, 'samples': 23543424, 'steps': 122621, 'loss/train': 1.4425671100616455} 11/07/2021 14:30:42 - INFO - __main__ - Step 122623: {'lr': 4.104233279876329e-05, 'samples': 23543616, 'steps': 122622, 'loss/train': 0.8182289004325867} 11/07/2021 14:30:42 - INFO - __main__ - Step 122624: {'lr': 4.103941951040771e-05, 'samples': 23543808, 'steps': 122623, 'loss/train': 1.3781739473342896} 11/07/2021 14:30:43 - INFO - __main__ - Step 122625: {'lr': 4.103650631620651e-05, 'samples': 23544000, 'steps': 122624, 'loss/train': 1.4624141454696655} 11/07/2021 14:30:44 - INFO - __main__ - Step 122626: {'lr': 4.103359321616104e-05, 'samples': 23544192, 'steps': 122625, 'loss/train': 4.410301208496094} 11/07/2021 14:30:44 - INFO - __main__ - Step 122627: {'lr': 4.103068021027262e-05, 'samples': 23544384, 'steps': 122626, 'loss/train': 1.217873215675354} 11/07/2021 14:30:44 - INFO - __main__ - Step 122628: {'lr': 4.1027767298542546e-05, 'samples': 23544576, 'steps': 122627, 'loss/train': 1.5735746622085571} 11/07/2021 14:30:45 - INFO - __main__ - Step 122629: {'lr': 4.102485448097215e-05, 'samples': 23544768, 'steps': 122628, 'loss/train': 1.2699217796325684} 11/07/2021 14:30:45 - INFO - __main__ - Step 122630: {'lr': 4.1021941757562714e-05, 'samples': 23544960, 'steps': 122629, 'loss/train': 0.7942612767219543} 11/07/2021 14:30:46 - INFO - __main__ - Step 122631: {'lr': 4.1019029128315565e-05, 'samples': 23545152, 'steps': 122630, 'loss/train': 1.6524966955184937} 11/07/2021 14:30:47 - INFO - __main__ - Step 122632: {'lr': 4.101611659323204e-05, 'samples': 23545344, 'steps': 122631, 'loss/train': 1.7257764339447021} 11/07/2021 14:30:47 - INFO - __main__ - Step 122633: {'lr': 4.101320415231341e-05, 'samples': 23545536, 'steps': 122632, 'loss/train': 0.9262992739677429} 11/07/2021 14:30:47 - INFO - __main__ - Step 122634: {'lr': 4.1010291805560986e-05, 'samples': 23545728, 'steps': 122633, 'loss/train': 1.3757917881011963} 11/07/2021 14:30:48 - INFO - __main__ - Step 122635: {'lr': 4.1007379552976175e-05, 'samples': 23545920, 'steps': 122634, 'loss/train': 1.3879070281982422} 11/07/2021 14:30:48 - INFO - __main__ - Step 122636: {'lr': 4.100446739456018e-05, 'samples': 23546112, 'steps': 122635, 'loss/train': 1.6673318147659302} 11/07/2021 14:30:49 - INFO - __main__ - Step 122637: {'lr': 4.100155533031433e-05, 'samples': 23546304, 'steps': 122636, 'loss/train': 1.0741277933120728} 11/07/2021 14:30:50 - INFO - __main__ - Step 122638: {'lr': 4.099864336023992e-05, 'samples': 23546496, 'steps': 122637, 'loss/train': 1.3934990167617798} 11/07/2021 14:30:50 - INFO - __main__ - Step 122639: {'lr': 4.099573148433833e-05, 'samples': 23546688, 'steps': 122638, 'loss/train': 1.1647554636001587} 11/07/2021 14:30:50 - INFO - __main__ - Step 122640: {'lr': 4.0992819702610845e-05, 'samples': 23546880, 'steps': 122639, 'loss/train': 0.692507803440094} 11/07/2021 14:30:51 - INFO - __main__ - Step 122641: {'lr': 4.0989908015058756e-05, 'samples': 23547072, 'steps': 122640, 'loss/train': 1.2865906953811646} 11/07/2021 14:30:52 - INFO - __main__ - Step 122642: {'lr': 4.098699642168338e-05, 'samples': 23547264, 'steps': 122641, 'loss/train': 1.4289671182632446} 11/07/2021 14:30:52 - INFO - __main__ - Step 122643: {'lr': 4.098408492248606e-05, 'samples': 23547456, 'steps': 122642, 'loss/train': 1.595455288887024} 11/07/2021 14:30:52 - INFO - __main__ - Step 122644: {'lr': 4.098117351746808e-05, 'samples': 23547648, 'steps': 122643, 'loss/train': 1.4917588233947754} 11/07/2021 14:30:53 - INFO - __main__ - Step 122645: {'lr': 4.0978262206630756e-05, 'samples': 23547840, 'steps': 122644, 'loss/train': 0.9836898446083069} 11/07/2021 14:30:53 - INFO - __main__ - Step 122646: {'lr': 4.09753509899754e-05, 'samples': 23548032, 'steps': 122645, 'loss/train': 1.226759672164917} 11/07/2021 14:30:54 - INFO - __main__ - Step 122647: {'lr': 4.097243986750332e-05, 'samples': 23548224, 'steps': 122646, 'loss/train': 1.6396571397781372} 11/07/2021 14:30:55 - INFO - __main__ - Step 122648: {'lr': 4.0969528839215895e-05, 'samples': 23548416, 'steps': 122647, 'loss/train': 1.4996156692504883} 11/07/2021 14:30:55 - INFO - __main__ - Step 122649: {'lr': 4.096661790511433e-05, 'samples': 23548608, 'steps': 122648, 'loss/train': 1.5924897193908691} 11/07/2021 14:30:55 - INFO - __main__ - Step 122650: {'lr': 4.096370706519994e-05, 'samples': 23548800, 'steps': 122649, 'loss/train': 1.7350828647613525} 11/07/2021 14:30:56 - INFO - __main__ - Step 122651: {'lr': 4.0960796319474134e-05, 'samples': 23548992, 'steps': 122650, 'loss/train': 1.262075662612915} 11/07/2021 14:30:56 - INFO - __main__ - Step 122652: {'lr': 4.095788566793812e-05, 'samples': 23549184, 'steps': 122651, 'loss/train': 1.3260735273361206} 11/07/2021 14:30:57 - INFO - __main__ - Step 122653: {'lr': 4.095497511059329e-05, 'samples': 23549376, 'steps': 122652, 'loss/train': 1.430083990097046} 11/07/2021 14:30:57 - INFO - __main__ - Step 122654: {'lr': 4.0952064647440914e-05, 'samples': 23549568, 'steps': 122653, 'loss/train': 1.0837312936782837} 11/07/2021 14:30:58 - INFO - __main__ - Step 122655: {'lr': 4.094915427848231e-05, 'samples': 23549760, 'steps': 122654, 'loss/train': 1.6996179819107056} 11/07/2021 14:30:58 - INFO - __main__ - Step 122656: {'lr': 4.094624400371877e-05, 'samples': 23549952, 'steps': 122655, 'loss/train': 1.276906967163086} 11/07/2021 14:30:58 - INFO - __main__ - Step 122657: {'lr': 4.0943333823151655e-05, 'samples': 23550144, 'steps': 122656, 'loss/train': 0.5707422494888306} 11/07/2021 14:30:59 - INFO - __main__ - Step 122658: {'lr': 4.094042373678225e-05, 'samples': 23550336, 'steps': 122657, 'loss/train': 0.9634395241737366} 11/07/2021 14:31:00 - INFO - __main__ - Step 122659: {'lr': 4.093751374461185e-05, 'samples': 23550528, 'steps': 122658, 'loss/train': 1.7329387664794922} 11/07/2021 14:31:00 - INFO - __main__ - Step 122660: {'lr': 4.093460384664177e-05, 'samples': 23550720, 'steps': 122659, 'loss/train': 1.3465310335159302} 11/07/2021 14:31:01 - INFO - __main__ - Step 122661: {'lr': 4.093169404287336e-05, 'samples': 23550912, 'steps': 122660, 'loss/train': 1.4503711462020874} 11/07/2021 14:31:01 - INFO - __main__ - Step 122662: {'lr': 4.0928784333307935e-05, 'samples': 23551104, 'steps': 122661, 'loss/train': 1.2600971460342407} 11/07/2021 14:31:01 - INFO - __main__ - Step 122663: {'lr': 4.0925874717946735e-05, 'samples': 23551296, 'steps': 122662, 'loss/train': 0.9075090289115906} 11/07/2021 14:31:02 - INFO - __main__ - Step 122664: {'lr': 4.092296519679109e-05, 'samples': 23551488, 'steps': 122663, 'loss/train': 1.2567410469055176} 11/07/2021 14:31:03 - INFO - __main__ - Step 122665: {'lr': 4.092005576984234e-05, 'samples': 23551680, 'steps': 122664, 'loss/train': 1.224100947380066} 11/07/2021 14:31:03 - INFO - __main__ - Step 122666: {'lr': 4.0917146437101785e-05, 'samples': 23551872, 'steps': 122665, 'loss/train': 1.4692500829696655} 11/07/2021 14:31:03 - INFO - __main__ - Step 122667: {'lr': 4.091423719857074e-05, 'samples': 23552064, 'steps': 122666, 'loss/train': 1.5229036808013916} 11/07/2021 14:31:04 - INFO - __main__ - Step 122668: {'lr': 4.091132805425052e-05, 'samples': 23552256, 'steps': 122667, 'loss/train': 1.4905613660812378} 11/07/2021 14:31:05 - INFO - __main__ - Step 122669: {'lr': 4.090841900414241e-05, 'samples': 23552448, 'steps': 122668, 'loss/train': 1.3917979001998901} 11/07/2021 14:31:05 - INFO - __main__ - Step 122670: {'lr': 4.090551004824777e-05, 'samples': 23552640, 'steps': 122669, 'loss/train': 1.5256725549697876} 11/07/2021 14:31:05 - INFO - __main__ - Step 122671: {'lr': 4.0902601186567856e-05, 'samples': 23552832, 'steps': 122670, 'loss/train': 1.252243995666504} 11/07/2021 14:31:06 - INFO - __main__ - Step 122672: {'lr': 4.089969241910402e-05, 'samples': 23553024, 'steps': 122671, 'loss/train': 1.2213386297225952} 11/07/2021 14:31:06 - INFO - __main__ - Step 122673: {'lr': 4.0896783745857536e-05, 'samples': 23553216, 'steps': 122672, 'loss/train': 1.4262380599975586} 11/07/2021 14:31:07 - INFO - __main__ - Step 122674: {'lr': 4.089387516682974e-05, 'samples': 23553408, 'steps': 122673, 'loss/train': 1.1267311573028564} 11/07/2021 14:31:08 - INFO - __main__ - Step 122675: {'lr': 4.0890966682022e-05, 'samples': 23553600, 'steps': 122674, 'loss/train': 1.399621844291687} 11/07/2021 14:31:08 - INFO - __main__ - Step 122676: {'lr': 4.088805829143552e-05, 'samples': 23553792, 'steps': 122675, 'loss/train': 0.576954185962677} 11/07/2021 14:31:08 - INFO - __main__ - Step 122677: {'lr': 4.088514999507162e-05, 'samples': 23553984, 'steps': 122676, 'loss/train': 1.3609230518341064} 11/07/2021 14:31:09 - INFO - __main__ - Step 122678: {'lr': 4.088224179293168e-05, 'samples': 23554176, 'steps': 122677, 'loss/train': 1.2953758239746094} 11/07/2021 14:31:10 - INFO - __main__ - Step 122679: {'lr': 4.087933368501695e-05, 'samples': 23554368, 'steps': 122678, 'loss/train': 0.9204114079475403} 11/07/2021 14:31:10 - INFO - __main__ - Step 122680: {'lr': 4.0876425671328765e-05, 'samples': 23554560, 'steps': 122679, 'loss/train': 1.2934726476669312} 11/07/2021 14:31:10 - INFO - __main__ - Step 122681: {'lr': 4.087351775186846e-05, 'samples': 23554752, 'steps': 122680, 'loss/train': 1.5087989568710327} 11/07/2021 14:31:11 - INFO - __main__ - Step 122682: {'lr': 4.08706099266373e-05, 'samples': 23554944, 'steps': 122681, 'loss/train': 1.5043307542800903} 11/07/2021 14:31:11 - INFO - __main__ - Step 122683: {'lr': 4.0867702195636625e-05, 'samples': 23555136, 'steps': 122682, 'loss/train': 1.3160552978515625} 11/07/2021 14:31:12 - INFO - __main__ - Step 122684: {'lr': 4.086479455886774e-05, 'samples': 23555328, 'steps': 122683, 'loss/train': 1.3725467920303345} 11/07/2021 14:31:13 - INFO - __main__ - Step 122685: {'lr': 4.086188701633195e-05, 'samples': 23555520, 'steps': 122684, 'loss/train': 1.2336424589157104} 11/07/2021 14:31:13 - INFO - __main__ - Step 122686: {'lr': 4.085897956803056e-05, 'samples': 23555712, 'steps': 122685, 'loss/train': 1.1967905759811401} 11/07/2021 14:31:13 - INFO - __main__ - Step 122687: {'lr': 4.0856072213964866e-05, 'samples': 23555904, 'steps': 122686, 'loss/train': 0.757764995098114} 11/07/2021 14:31:14 - INFO - __main__ - Step 122688: {'lr': 4.08531649541363e-05, 'samples': 23556096, 'steps': 122687, 'loss/train': 1.1333798170089722} 11/07/2021 14:31:14 - INFO - __main__ - Step 122689: {'lr': 4.085025778854598e-05, 'samples': 23556288, 'steps': 122688, 'loss/train': 1.4739010334014893} 11/07/2021 14:31:15 - INFO - __main__ - Step 122690: {'lr': 4.0847350717195307e-05, 'samples': 23556480, 'steps': 122689, 'loss/train': 1.3493725061416626} 11/07/2021 14:31:16 - INFO - __main__ - Step 122691: {'lr': 4.0844443740085614e-05, 'samples': 23556672, 'steps': 122690, 'loss/train': 1.3512424230575562} 11/07/2021 14:31:16 - INFO - __main__ - Step 122692: {'lr': 4.084153685721817e-05, 'samples': 23556864, 'steps': 122691, 'loss/train': 1.111491322517395} 11/07/2021 14:31:16 - INFO - __main__ - Step 122693: {'lr': 4.0838630068594315e-05, 'samples': 23557056, 'steps': 122692, 'loss/train': 1.3250846862792969} 11/07/2021 14:31:17 - INFO - __main__ - Step 122694: {'lr': 4.083572337421534e-05, 'samples': 23557248, 'steps': 122693, 'loss/train': 1.09848153591156} 11/07/2021 14:31:18 - INFO - __main__ - Step 122695: {'lr': 4.083281677408254e-05, 'samples': 23557440, 'steps': 122694, 'loss/train': 1.4257527589797974} 11/07/2021 14:31:18 - INFO - __main__ - Step 122696: {'lr': 4.082991026819727e-05, 'samples': 23557632, 'steps': 122695, 'loss/train': 1.3171544075012207} 11/07/2021 14:31:18 - INFO - __main__ - Step 122697: {'lr': 4.0827003856560796e-05, 'samples': 23557824, 'steps': 122696, 'loss/train': 1.438834547996521} 11/07/2021 14:31:19 - INFO - __main__ - Step 122698: {'lr': 4.082409753917446e-05, 'samples': 23558016, 'steps': 122697, 'loss/train': 1.26791512966156} 11/07/2021 14:31:19 - INFO - __main__ - Step 122699: {'lr': 4.082119131603956e-05, 'samples': 23558208, 'steps': 122698, 'loss/train': 0.7252678871154785} 11/07/2021 14:31:20 - INFO - __main__ - Step 122700: {'lr': 4.081828518715741e-05, 'samples': 23558400, 'steps': 122699, 'loss/train': 1.6121180057525635} 11/07/2021 14:31:21 - INFO - __main__ - Step 122701: {'lr': 4.081537915252931e-05, 'samples': 23558592, 'steps': 122700, 'loss/train': 1.3826566934585571} 11/07/2021 14:31:21 - INFO - __main__ - Step 122702: {'lr': 4.0812473212156616e-05, 'samples': 23558784, 'steps': 122701, 'loss/train': 0.8679426908493042} 11/07/2021 14:31:21 - INFO - __main__ - Step 122703: {'lr': 4.0809567366040524e-05, 'samples': 23558976, 'steps': 122702, 'loss/train': 1.5414760112762451} 11/07/2021 14:31:22 - INFO - __main__ - Step 122704: {'lr': 4.080666161418245e-05, 'samples': 23559168, 'steps': 122703, 'loss/train': 1.5338621139526367} 11/07/2021 14:31:23 - INFO - __main__ - Step 122705: {'lr': 4.080375595658364e-05, 'samples': 23559360, 'steps': 122704, 'loss/train': 1.4032517671585083} 11/07/2021 14:31:23 - INFO - __main__ - Step 122706: {'lr': 4.0800850393245435e-05, 'samples': 23559552, 'steps': 122705, 'loss/train': 0.5634937286376953} 11/07/2021 14:31:24 - INFO - __main__ - Step 122707: {'lr': 4.079794492416913e-05, 'samples': 23559744, 'steps': 122706, 'loss/train': 0.041491348296403885} 11/07/2021 14:31:24 - INFO - __main__ - Step 122708: {'lr': 4.0795039549356064e-05, 'samples': 23559936, 'steps': 122707, 'loss/train': 1.2240052223205566} 11/07/2021 14:31:24 - INFO - __main__ - Step 122709: {'lr': 4.079213426880751e-05, 'samples': 23560128, 'steps': 122708, 'loss/train': 1.4123846292495728} 11/07/2021 14:31:25 - INFO - __main__ - Step 122710: {'lr': 4.0789229082524806e-05, 'samples': 23560320, 'steps': 122709, 'loss/train': 1.4746934175491333} 11/07/2021 14:31:26 - INFO - __main__ - Step 122711: {'lr': 4.078632399050922e-05, 'samples': 23560512, 'steps': 122710, 'loss/train': 1.1233129501342773} 11/07/2021 14:31:26 - INFO - __main__ - Step 122712: {'lr': 4.078341899276211e-05, 'samples': 23560704, 'steps': 122711, 'loss/train': 0.8817643523216248} 11/07/2021 14:31:26 - INFO - __main__ - Step 122713: {'lr': 4.0780514089284766e-05, 'samples': 23560896, 'steps': 122712, 'loss/train': 1.4394886493682861} 11/07/2021 14:31:27 - INFO - __main__ - Step 122714: {'lr': 4.077760928007848e-05, 'samples': 23561088, 'steps': 122713, 'loss/train': 1.1754403114318848} 11/07/2021 14:31:27 - INFO - __main__ - Step 122715: {'lr': 4.077470456514465e-05, 'samples': 23561280, 'steps': 122714, 'loss/train': 1.3730173110961914} 11/07/2021 14:31:28 - INFO - __main__ - Step 122716: {'lr': 4.077179994448443e-05, 'samples': 23561472, 'steps': 122715, 'loss/train': 1.3211344480514526} 11/07/2021 14:31:28 - INFO - __main__ - Step 122717: {'lr': 4.0768895418099225e-05, 'samples': 23561664, 'steps': 122716, 'loss/train': 1.5654443502426147} 11/07/2021 14:31:29 - INFO - __main__ - Step 122718: {'lr': 4.076599098599032e-05, 'samples': 23561856, 'steps': 122717, 'loss/train': 1.064673662185669} 11/07/2021 14:31:29 - INFO - __main__ - Step 122719: {'lr': 4.076308664815903e-05, 'samples': 23562048, 'steps': 122718, 'loss/train': 1.420242190361023} 11/07/2021 14:31:30 - INFO - __main__ - Step 122720: {'lr': 4.076018240460666e-05, 'samples': 23562240, 'steps': 122719, 'loss/train': 1.4071457386016846} 11/07/2021 14:31:30 - INFO - __main__ - Step 122721: {'lr': 4.0757278255334514e-05, 'samples': 23562432, 'steps': 122720, 'loss/train': 1.3863768577575684} 11/07/2021 14:31:31 - INFO - __main__ - Step 122722: {'lr': 4.0754374200343946e-05, 'samples': 23562624, 'steps': 122721, 'loss/train': 0.6484341025352478} 11/07/2021 14:31:31 - INFO - __main__ - Step 122723: {'lr': 4.075147023963619e-05, 'samples': 23562816, 'steps': 122722, 'loss/train': 1.0697200298309326} 11/07/2021 14:31:32 - INFO - __main__ - Step 122724: {'lr': 4.0748566373212615e-05, 'samples': 23563008, 'steps': 122723, 'loss/train': 1.4765372276306152} 11/07/2021 14:31:33 - INFO - __main__ - Step 122725: {'lr': 4.074566260107448e-05, 'samples': 23563200, 'steps': 122724, 'loss/train': 0.5957391262054443} 11/07/2021 14:31:33 - INFO - __main__ - Step 122726: {'lr': 4.074275892322316e-05, 'samples': 23563392, 'steps': 122725, 'loss/train': 1.4444876909255981} 11/07/2021 14:31:33 - INFO - __main__ - Step 122727: {'lr': 4.07398553396599e-05, 'samples': 23563584, 'steps': 122726, 'loss/train': 1.3131966590881348} 11/07/2021 14:31:34 - INFO - __main__ - Step 122728: {'lr': 4.073695185038603e-05, 'samples': 23563776, 'steps': 122727, 'loss/train': 0.9847102761268616} 11/07/2021 14:31:34 - INFO - __main__ - Step 122729: {'lr': 4.073404845540293e-05, 'samples': 23563968, 'steps': 122728, 'loss/train': 1.3233556747436523} 11/07/2021 14:31:35 - INFO - __main__ - Step 122730: {'lr': 4.07311451547118e-05, 'samples': 23564160, 'steps': 122729, 'loss/train': 1.507636547088623} 11/07/2021 14:31:35 - INFO - __main__ - Step 122731: {'lr': 4.072824194831396e-05, 'samples': 23564352, 'steps': 122730, 'loss/train': 1.2996450662612915} 11/07/2021 14:31:36 - INFO - __main__ - Step 122732: {'lr': 4.072533883621074e-05, 'samples': 23564544, 'steps': 122731, 'loss/train': 1.1807461977005005} 11/07/2021 14:31:36 - INFO - __main__ - Step 122733: {'lr': 4.072243581840346e-05, 'samples': 23564736, 'steps': 122732, 'loss/train': 1.0244598388671875} 11/07/2021 14:31:36 - INFO - __main__ - Step 122734: {'lr': 4.0719532894893415e-05, 'samples': 23564928, 'steps': 122733, 'loss/train': 1.2652578353881836} 11/07/2021 14:31:37 - INFO - __main__ - Step 122735: {'lr': 4.0716630065681934e-05, 'samples': 23565120, 'steps': 122734, 'loss/train': 1.641963005065918} 11/07/2021 14:31:38 - INFO - __main__ - Step 122736: {'lr': 4.0713727330770306e-05, 'samples': 23565312, 'steps': 122735, 'loss/train': 1.01317298412323} 11/07/2021 14:31:38 - INFO - __main__ - Step 122737: {'lr': 4.0710824690159825e-05, 'samples': 23565504, 'steps': 122736, 'loss/train': 1.2134778499603271} 11/07/2021 14:31:39 - INFO - __main__ - Step 122738: {'lr': 4.0707922143851826e-05, 'samples': 23565696, 'steps': 122737, 'loss/train': 1.6912497282028198} 11/07/2021 14:31:39 - INFO - __main__ - Step 122739: {'lr': 4.070501969184762e-05, 'samples': 23565888, 'steps': 122738, 'loss/train': 2.47052001953125} 11/07/2021 14:31:39 - INFO - __main__ - Step 122740: {'lr': 4.07021173341485e-05, 'samples': 23566080, 'steps': 122739, 'loss/train': 0.828179121017456} 11/07/2021 14:31:40 - INFO - __main__ - Step 122741: {'lr': 4.069921507075577e-05, 'samples': 23566272, 'steps': 122740, 'loss/train': 1.8005295991897583} 11/07/2021 14:31:41 - INFO - __main__ - Step 122742: {'lr': 4.0696312901670806e-05, 'samples': 23566464, 'steps': 122741, 'loss/train': 1.413032054901123} 11/07/2021 14:31:41 - INFO - __main__ - Step 122743: {'lr': 4.0693410826894786e-05, 'samples': 23566656, 'steps': 122742, 'loss/train': 1.4237620830535889} 11/07/2021 14:31:41 - INFO - __main__ - Step 122744: {'lr': 4.069050884642911e-05, 'samples': 23566848, 'steps': 122743, 'loss/train': 1.3188918828964233} 11/07/2021 14:31:42 - INFO - __main__ - Step 122745: {'lr': 4.068760696027504e-05, 'samples': 23567040, 'steps': 122744, 'loss/train': 1.3872089385986328} 11/07/2021 14:31:42 - INFO - __main__ - Step 122746: {'lr': 4.068470516843389e-05, 'samples': 23567232, 'steps': 122745, 'loss/train': 1.4024021625518799} 11/07/2021 14:31:43 - INFO - __main__ - Step 122747: {'lr': 4.0681803470907e-05, 'samples': 23567424, 'steps': 122746, 'loss/train': 1.5329777002334595} 11/07/2021 14:31:44 - INFO - __main__ - Step 122748: {'lr': 4.0678901867695656e-05, 'samples': 23567616, 'steps': 122747, 'loss/train': 1.995766043663025} 11/07/2021 14:31:44 - INFO - __main__ - Step 122749: {'lr': 4.067600035880117e-05, 'samples': 23567808, 'steps': 122748, 'loss/train': 1.8338065147399902} 11/07/2021 14:31:44 - INFO - __main__ - Step 122750: {'lr': 4.067309894422486e-05, 'samples': 23568000, 'steps': 122749, 'loss/train': 1.0698034763336182} 11/07/2021 14:31:45 - INFO - __main__ - Step 122751: {'lr': 4.067019762396801e-05, 'samples': 23568192, 'steps': 122750, 'loss/train': 1.3198769092559814} 11/07/2021 14:31:46 - INFO - __main__ - Step 122752: {'lr': 4.0667296398031934e-05, 'samples': 23568384, 'steps': 122751, 'loss/train': 1.6353436708450317} 11/07/2021 14:31:46 - INFO - __main__ - Step 122753: {'lr': 4.066439526641797e-05, 'samples': 23568576, 'steps': 122752, 'loss/train': 1.1623674631118774} 11/07/2021 14:31:46 - INFO - __main__ - Step 122754: {'lr': 4.066149422912738e-05, 'samples': 23568768, 'steps': 122753, 'loss/train': 1.232370138168335} 11/07/2021 14:31:47 - INFO - __main__ - Step 122755: {'lr': 4.065859328616148e-05, 'samples': 23568960, 'steps': 122754, 'loss/train': 1.5698646306991577} 11/07/2021 14:31:47 - INFO - __main__ - Step 122756: {'lr': 4.065569243752165e-05, 'samples': 23569152, 'steps': 122755, 'loss/train': 1.0153170824050903} 11/07/2021 14:31:48 - INFO - __main__ - Step 122757: {'lr': 4.06527916832091e-05, 'samples': 23569344, 'steps': 122756, 'loss/train': 0.8673912882804871} 11/07/2021 14:31:48 - INFO - __main__ - Step 122758: {'lr': 4.0649891023225136e-05, 'samples': 23569536, 'steps': 122757, 'loss/train': 1.1990084648132324} 11/07/2021 14:31:49 - INFO - __main__ - Step 122759: {'lr': 4.064699045757111e-05, 'samples': 23569728, 'steps': 122758, 'loss/train': 1.1462204456329346} 11/07/2021 14:31:49 - INFO - __main__ - Step 122760: {'lr': 4.064408998624833e-05, 'samples': 23569920, 'steps': 122759, 'loss/train': 1.719469666481018} 11/07/2021 14:31:50 - INFO - __main__ - Step 122761: {'lr': 4.064118960925811e-05, 'samples': 23570112, 'steps': 122760, 'loss/train': 1.2882317304611206} 11/07/2021 14:31:51 - INFO - __main__ - Step 122762: {'lr': 4.063828932660171e-05, 'samples': 23570304, 'steps': 122761, 'loss/train': 1.3150910139083862} 11/07/2021 14:31:51 - INFO - __main__ - Step 122763: {'lr': 4.0635389138280464e-05, 'samples': 23570496, 'steps': 122762, 'loss/train': 1.122635841369629} 11/07/2021 14:31:51 - INFO - __main__ - Step 122764: {'lr': 4.06324890442957e-05, 'samples': 23570688, 'steps': 122763, 'loss/train': 1.3195300102233887} 11/07/2021 14:31:52 - INFO - __main__ - Step 122765: {'lr': 4.06295890446487e-05, 'samples': 23570880, 'steps': 122764, 'loss/train': 1.2573167085647583} 11/07/2021 14:31:52 - INFO - __main__ - Step 122766: {'lr': 4.062668913934078e-05, 'samples': 23571072, 'steps': 122765, 'loss/train': 1.2225067615509033} 11/07/2021 14:31:53 - INFO - __main__ - Step 122767: {'lr': 4.062378932837329e-05, 'samples': 23571264, 'steps': 122766, 'loss/train': 1.454783320426941} 11/07/2021 14:31:54 - INFO - __main__ - Step 122768: {'lr': 4.0620889611747455e-05, 'samples': 23571456, 'steps': 122767, 'loss/train': 0.1492692083120346} 11/07/2021 14:31:54 - INFO - __main__ - Step 122769: {'lr': 4.0617989989464586e-05, 'samples': 23571648, 'steps': 122768, 'loss/train': 1.6482349634170532} 11/07/2021 14:31:54 - INFO - __main__ - Step 122770: {'lr': 4.0615090461526036e-05, 'samples': 23571840, 'steps': 122769, 'loss/train': 1.52577543258667} 11/07/2021 14:31:55 - INFO - __main__ - Step 122771: {'lr': 4.061219102793309e-05, 'samples': 23572032, 'steps': 122770, 'loss/train': 1.3225438594818115} 11/07/2021 14:31:56 - INFO - __main__ - Step 122772: {'lr': 4.060929168868707e-05, 'samples': 23572224, 'steps': 122771, 'loss/train': 1.3978902101516724} 11/07/2021 14:31:56 - INFO - __main__ - Step 122773: {'lr': 4.060639244378925e-05, 'samples': 23572416, 'steps': 122772, 'loss/train': 1.4339479207992554} 11/07/2021 14:31:56 - INFO - __main__ - Step 122774: {'lr': 4.0603493293240976e-05, 'samples': 23572608, 'steps': 122773, 'loss/train': 1.6578158140182495} 11/07/2021 14:31:57 - INFO - __main__ - Step 122775: {'lr': 4.060059423704354e-05, 'samples': 23572800, 'steps': 122774, 'loss/train': 1.323574185371399} 11/07/2021 14:31:57 - INFO - __main__ - Step 122776: {'lr': 4.0597695275198246e-05, 'samples': 23572992, 'steps': 122775, 'loss/train': 1.486228346824646} 11/07/2021 14:31:58 - INFO - __main__ - Step 122777: {'lr': 4.059479640770639e-05, 'samples': 23573184, 'steps': 122776, 'loss/train': 1.4948736429214478} 11/07/2021 14:31:59 - INFO - __main__ - Step 122778: {'lr': 4.059189763456933e-05, 'samples': 23573376, 'steps': 122777, 'loss/train': 1.1344590187072754} 11/07/2021 14:31:59 - INFO - __main__ - Step 122779: {'lr': 4.05889989557883e-05, 'samples': 23573568, 'steps': 122778, 'loss/train': 0.8431844711303711} 11/07/2021 14:31:59 - INFO - __main__ - Step 122780: {'lr': 4.058610037136462e-05, 'samples': 23573760, 'steps': 122779, 'loss/train': 1.4587384462356567} 11/07/2021 14:32:00 - INFO - __main__ - Step 122781: {'lr': 4.05832018812996e-05, 'samples': 23573952, 'steps': 122780, 'loss/train': 1.3128060102462769} 11/07/2021 14:32:01 - INFO - __main__ - Step 122782: {'lr': 4.058030348559458e-05, 'samples': 23574144, 'steps': 122781, 'loss/train': 1.2949265241622925} 11/07/2021 14:32:01 - INFO - __main__ - Step 122783: {'lr': 4.057740518425085e-05, 'samples': 23574336, 'steps': 122782, 'loss/train': 0.8759891986846924} 11/07/2021 14:32:01 - INFO - __main__ - Step 122784: {'lr': 4.057450697726969e-05, 'samples': 23574528, 'steps': 122783, 'loss/train': 0.9433574676513672} 11/07/2021 14:32:02 - INFO - __main__ - Step 122785: {'lr': 4.0571608864652444e-05, 'samples': 23574720, 'steps': 122784, 'loss/train': 1.4258853197097778} 11/07/2021 14:32:02 - INFO - __main__ - Step 122786: {'lr': 4.0568710846400374e-05, 'samples': 23574912, 'steps': 122785, 'loss/train': 1.2830003499984741} 11/07/2021 14:32:02 - INFO - __main__ - Step 122787: {'lr': 4.056581292251482e-05, 'samples': 23575104, 'steps': 122786, 'loss/train': 1.0574589967727661} 11/07/2021 14:32:03 - INFO - __main__ - Step 122788: {'lr': 4.056291509299709e-05, 'samples': 23575296, 'steps': 122787, 'loss/train': 0.7253304123878479} 11/07/2021 14:32:04 - INFO - __main__ - Step 122789: {'lr': 4.0560017357848535e-05, 'samples': 23575488, 'steps': 122788, 'loss/train': 0.8107234835624695} 11/07/2021 14:32:04 - INFO - __main__ - Step 122790: {'lr': 4.055711971707035e-05, 'samples': 23575680, 'steps': 122789, 'loss/train': 1.2596296072006226} 11/07/2021 14:32:04 - INFO - __main__ - Step 122791: {'lr': 4.055422217066388e-05, 'samples': 23575872, 'steps': 122790, 'loss/train': 1.3526668548583984} 11/07/2021 14:32:05 - INFO - __main__ - Step 122792: {'lr': 4.0551324718630435e-05, 'samples': 23576064, 'steps': 122791, 'loss/train': 0.916555643081665} 11/07/2021 14:32:06 - INFO - __main__ - Step 122793: {'lr': 4.0548427360971364e-05, 'samples': 23576256, 'steps': 122792, 'loss/train': 1.240802526473999} 11/07/2021 14:32:06 - INFO - __main__ - Step 122794: {'lr': 4.054553009768791e-05, 'samples': 23576448, 'steps': 122793, 'loss/train': 1.028031826019287} 11/07/2021 14:32:07 - INFO - __main__ - Step 122795: {'lr': 4.054263292878144e-05, 'samples': 23576640, 'steps': 122794, 'loss/train': 1.105987787246704} 11/07/2021 14:32:07 - INFO - __main__ - Step 122796: {'lr': 4.053973585425319e-05, 'samples': 23576832, 'steps': 122795, 'loss/train': 0.8700567483901978} 11/07/2021 14:32:07 - INFO - __main__ - Step 122797: {'lr': 4.053683887410453e-05, 'samples': 23577024, 'steps': 122796, 'loss/train': 1.2327110767364502} 11/07/2021 14:32:08 - INFO - __main__ - Step 122798: {'lr': 4.053394198833674e-05, 'samples': 23577216, 'steps': 122797, 'loss/train': 1.3593430519104004} 11/07/2021 14:32:09 - INFO - __main__ - Step 122799: {'lr': 4.053104519695111e-05, 'samples': 23577408, 'steps': 122798, 'loss/train': 1.1462671756744385} 11/07/2021 14:32:09 - INFO - __main__ - Step 122800: {'lr': 4.0528148499949014e-05, 'samples': 23577600, 'steps': 122799, 'loss/train': 1.4542186260223389} 11/07/2021 14:32:09 - INFO - __main__ - Step 122801: {'lr': 4.0525251897331663e-05, 'samples': 23577792, 'steps': 122800, 'loss/train': 1.4157127141952515} 11/07/2021 14:32:10 - INFO - __main__ - Step 122802: {'lr': 4.0522355389100373e-05, 'samples': 23577984, 'steps': 122801, 'loss/train': 1.3502506017684937} 11/07/2021 14:32:11 - INFO - __main__ - Step 122803: {'lr': 4.05194589752565e-05, 'samples': 23578176, 'steps': 122802, 'loss/train': 1.0506922006607056} 11/07/2021 14:32:11 - INFO - __main__ - Step 122804: {'lr': 4.051656265580131e-05, 'samples': 23578368, 'steps': 122803, 'loss/train': 1.5415836572647095} 11/07/2021 14:32:11 - INFO - __main__ - Step 122805: {'lr': 4.051366643073615e-05, 'samples': 23578560, 'steps': 122804, 'loss/train': 1.011582851409912} 11/07/2021 14:32:12 - INFO - __main__ - Step 122806: {'lr': 4.0510770300062285e-05, 'samples': 23578752, 'steps': 122805, 'loss/train': 1.615538239479065} 11/07/2021 14:32:12 - INFO - __main__ - Step 122807: {'lr': 4.050787426378102e-05, 'samples': 23578944, 'steps': 122806, 'loss/train': 1.434889793395996} 11/07/2021 14:32:13 - INFO - __main__ - Step 122808: {'lr': 4.0504978321893675e-05, 'samples': 23579136, 'steps': 122807, 'loss/train': 1.7448651790618896} 11/07/2021 14:32:14 - INFO - __main__ - Step 122809: {'lr': 4.050208247440157e-05, 'samples': 23579328, 'steps': 122808, 'loss/train': 1.3046488761901855} 11/07/2021 14:32:14 - INFO - __main__ - Step 122810: {'lr': 4.049918672130598e-05, 'samples': 23579520, 'steps': 122809, 'loss/train': 1.2472960948944092} 11/07/2021 14:32:14 - INFO - __main__ - Step 122811: {'lr': 4.04962910626083e-05, 'samples': 23579712, 'steps': 122810, 'loss/train': 1.4147417545318604} 11/07/2021 14:32:15 - INFO - __main__ - Step 122812: {'lr': 4.049339549830969e-05, 'samples': 23579904, 'steps': 122811, 'loss/train': 1.2500998973846436} 11/07/2021 14:32:15 - INFO - __main__ - Step 122813: {'lr': 4.049050002841154e-05, 'samples': 23580096, 'steps': 122812, 'loss/train': 1.4021540880203247} 11/07/2021 14:32:16 - INFO - __main__ - Step 122814: {'lr': 4.048760465291512e-05, 'samples': 23580288, 'steps': 122813, 'loss/train': 1.495575189590454} 11/07/2021 14:32:16 - INFO - __main__ - Step 122815: {'lr': 4.048470937182175e-05, 'samples': 23580480, 'steps': 122814, 'loss/train': 1.2783470153808594} 11/07/2021 14:32:17 - INFO - __main__ - Step 122816: {'lr': 4.0481814185132746e-05, 'samples': 23580672, 'steps': 122815, 'loss/train': 1.269828200340271} 11/07/2021 14:32:17 - INFO - __main__ - Step 122817: {'lr': 4.0478919092849395e-05, 'samples': 23580864, 'steps': 122816, 'loss/train': 1.299452543258667} 11/07/2021 14:32:17 - INFO - __main__ - Step 122818: {'lr': 4.0476024094973e-05, 'samples': 23581056, 'steps': 122817, 'loss/train': 1.2650859355926514} 11/07/2021 14:32:19 - INFO - __main__ - Step 122819: {'lr': 4.047312919150489e-05, 'samples': 23581248, 'steps': 122818, 'loss/train': 0.8822494149208069} 11/07/2021 14:32:19 - INFO - __main__ - Step 122820: {'lr': 4.047023438244638e-05, 'samples': 23581440, 'steps': 122819, 'loss/train': 1.17789888381958} 11/07/2021 14:32:19 - INFO - __main__ - Step 122821: {'lr': 4.04673396677987e-05, 'samples': 23581632, 'steps': 122820, 'loss/train': 1.1971246004104614} 11/07/2021 14:32:20 - INFO - __main__ - Step 122822: {'lr': 4.0464445047563246e-05, 'samples': 23581824, 'steps': 122821, 'loss/train': 2.2340729236602783} 11/07/2021 14:32:20 - INFO - __main__ - Step 122823: {'lr': 4.04615505217413e-05, 'samples': 23582016, 'steps': 122822, 'loss/train': 0.8265403509140015} 11/07/2021 14:32:21 - INFO - __main__ - Step 122824: {'lr': 4.045865609033411e-05, 'samples': 23582208, 'steps': 122823, 'loss/train': 1.5967570543289185} 11/07/2021 14:32:22 - INFO - __main__ - Step 122825: {'lr': 4.0455761753343005e-05, 'samples': 23582400, 'steps': 122824, 'loss/train': 1.338694453239441} 11/07/2021 14:32:22 - INFO - __main__ - Step 122826: {'lr': 4.045286751076932e-05, 'samples': 23582592, 'steps': 122825, 'loss/train': 1.0315101146697998} 11/07/2021 14:32:22 - INFO - __main__ - Step 122827: {'lr': 4.044997336261433e-05, 'samples': 23582784, 'steps': 122826, 'loss/train': 1.2293519973754883} 11/07/2021 14:32:23 - INFO - __main__ - Step 122828: {'lr': 4.0447079308879336e-05, 'samples': 23582976, 'steps': 122827, 'loss/train': 0.7919381856918335} 11/07/2021 14:32:23 - INFO - __main__ - Step 122829: {'lr': 4.044418534956567e-05, 'samples': 23583168, 'steps': 122828, 'loss/train': 1.029121994972229} 11/07/2021 14:32:24 - INFO - __main__ - Step 122830: {'lr': 4.044129148467463e-05, 'samples': 23583360, 'steps': 122829, 'loss/train': 1.7036877870559692} 11/07/2021 14:32:24 - INFO - __main__ - Step 122831: {'lr': 4.043839771420749e-05, 'samples': 23583552, 'steps': 122830, 'loss/train': 1.226304531097412} 11/07/2021 14:32:25 - INFO - __main__ - Step 122832: {'lr': 4.0435504038165563e-05, 'samples': 23583744, 'steps': 122831, 'loss/train': 1.5348819494247437} 11/07/2021 14:32:25 - INFO - __main__ - Step 122833: {'lr': 4.043261045655019e-05, 'samples': 23583936, 'steps': 122832, 'loss/train': 1.2942273616790771} 11/07/2021 14:32:25 - INFO - __main__ - Step 122834: {'lr': 4.0429716969362625e-05, 'samples': 23584128, 'steps': 122833, 'loss/train': 1.3457940816879272} 11/07/2021 14:32:27 - INFO - __main__ - Step 122835: {'lr': 4.042682357660421e-05, 'samples': 23584320, 'steps': 122834, 'loss/train': 1.3019195795059204} 11/07/2021 14:32:27 - INFO - __main__ - Step 122836: {'lr': 4.042393027827632e-05, 'samples': 23584512, 'steps': 122835, 'loss/train': 1.289849042892456} 11/07/2021 14:32:27 - INFO - __main__ - Step 122837: {'lr': 4.0421037074380077e-05, 'samples': 23584704, 'steps': 122836, 'loss/train': 0.8904672265052795} 11/07/2021 14:32:28 - INFO - __main__ - Step 122838: {'lr': 4.041814396491689e-05, 'samples': 23584896, 'steps': 122837, 'loss/train': 1.1189556121826172} 11/07/2021 14:32:28 - INFO - __main__ - Step 122839: {'lr': 4.0415250949888045e-05, 'samples': 23585088, 'steps': 122838, 'loss/train': 1.339619517326355} 11/07/2021 14:32:29 - INFO - __main__ - Step 122840: {'lr': 4.0412358029294856e-05, 'samples': 23585280, 'steps': 122839, 'loss/train': 0.7971931099891663} 11/07/2021 14:32:29 - INFO - __main__ - Step 122841: {'lr': 4.040946520313865e-05, 'samples': 23585472, 'steps': 122840, 'loss/train': 1.4254090785980225} 11/07/2021 14:32:30 - INFO - __main__ - Step 122842: {'lr': 4.040657247142068e-05, 'samples': 23585664, 'steps': 122841, 'loss/train': 1.3872545957565308} 11/07/2021 14:32:30 - INFO - __main__ - Step 122843: {'lr': 4.040367983414228e-05, 'samples': 23585856, 'steps': 122842, 'loss/train': 1.6392955780029297} 11/07/2021 14:32:30 - INFO - __main__ - Step 122844: {'lr': 4.040078729130475e-05, 'samples': 23586048, 'steps': 122843, 'loss/train': 1.321315050125122} 11/07/2021 14:32:31 - INFO - __main__ - Step 122845: {'lr': 4.039789484290937e-05, 'samples': 23586240, 'steps': 122844, 'loss/train': 1.428444743156433} 11/07/2021 14:32:32 - INFO - __main__ - Step 122846: {'lr': 4.03950024889575e-05, 'samples': 23586432, 'steps': 122845, 'loss/train': 0.7766793370246887} 11/07/2021 14:32:32 - INFO - __main__ - Step 122847: {'lr': 4.0392110229450387e-05, 'samples': 23586624, 'steps': 122846, 'loss/train': 1.5083305835723877} 11/07/2021 14:32:33 - INFO - __main__ - Step 122848: {'lr': 4.038921806438936e-05, 'samples': 23586816, 'steps': 122847, 'loss/train': 0.5117987394332886} 11/07/2021 14:32:33 - INFO - __main__ - Step 122849: {'lr': 4.0386325993775705e-05, 'samples': 23587008, 'steps': 122848, 'loss/train': 1.2286193370819092} 11/07/2021 14:32:34 - INFO - __main__ - Step 122850: {'lr': 4.038343401761083e-05, 'samples': 23587200, 'steps': 122849, 'loss/train': 1.6725739240646362} 11/07/2021 14:32:34 - INFO - __main__ - Step 122851: {'lr': 4.0380542135895844e-05, 'samples': 23587392, 'steps': 122850, 'loss/train': 1.317402720451355} 11/07/2021 14:32:35 - INFO - __main__ - Step 122852: {'lr': 4.037765034863217e-05, 'samples': 23587584, 'steps': 122851, 'loss/train': 1.1304845809936523} 11/07/2021 14:32:35 - INFO - __main__ - Step 122853: {'lr': 4.0374758655821104e-05, 'samples': 23587776, 'steps': 122852, 'loss/train': 1.4233531951904297} 11/07/2021 14:32:35 - INFO - __main__ - Step 122854: {'lr': 4.03718670574639e-05, 'samples': 23587968, 'steps': 122853, 'loss/train': 1.5434863567352295} 11/07/2021 14:32:37 - INFO - __main__ - Step 122855: {'lr': 4.036897555356195e-05, 'samples': 23588160, 'steps': 122854, 'loss/train': 1.598573923110962} 11/07/2021 14:32:37 - INFO - __main__ - Step 122856: {'lr': 4.0366084144116465e-05, 'samples': 23588352, 'steps': 122855, 'loss/train': 0.9092813730239868} 11/07/2021 14:32:37 - INFO - __main__ - Step 122857: {'lr': 4.036319282912878e-05, 'samples': 23588544, 'steps': 122856, 'loss/train': 1.171512484550476} 11/07/2021 14:32:38 - INFO - __main__ - Step 122858: {'lr': 4.0360301608600244e-05, 'samples': 23588736, 'steps': 122857, 'loss/train': 1.1921430826187134} 11/07/2021 14:32:38 - INFO - __main__ - Step 122859: {'lr': 4.035741048253211e-05, 'samples': 23588928, 'steps': 122858, 'loss/train': 1.0946036577224731} 11/07/2021 14:32:38 - INFO - __main__ - Step 122860: {'lr': 4.035451945092567e-05, 'samples': 23589120, 'steps': 122859, 'loss/train': 1.398002028465271} 11/07/2021 14:32:40 - INFO - __main__ - Step 122861: {'lr': 4.0351628513782266e-05, 'samples': 23589312, 'steps': 122860, 'loss/train': 1.8346339464187622} 11/07/2021 14:32:40 - INFO - __main__ - Step 122862: {'lr': 4.034873767110317e-05, 'samples': 23589504, 'steps': 122861, 'loss/train': 1.5976390838623047} 11/07/2021 14:32:40 - INFO - __main__ - Step 122863: {'lr': 4.0345846922889784e-05, 'samples': 23589696, 'steps': 122862, 'loss/train': 1.4568367004394531} 11/07/2021 14:32:41 - INFO - __main__ - Step 122864: {'lr': 4.034295626914325e-05, 'samples': 23589888, 'steps': 122863, 'loss/train': 1.331172227859497} 11/07/2021 14:32:41 - INFO - __main__ - Step 122865: {'lr': 4.034006570986493e-05, 'samples': 23590080, 'steps': 122864, 'loss/train': 0.24834176898002625} 11/07/2021 14:32:42 - INFO - __main__ - Step 122866: {'lr': 4.033717524505615e-05, 'samples': 23590272, 'steps': 122865, 'loss/train': 1.347160816192627} 11/07/2021 14:32:42 - INFO - __main__ - Step 122867: {'lr': 4.033428487471821e-05, 'samples': 23590464, 'steps': 122866, 'loss/train': 1.1616034507751465} 11/07/2021 14:32:43 - INFO - __main__ - Step 122868: {'lr': 4.0331394598852404e-05, 'samples': 23590656, 'steps': 122867, 'loss/train': 1.0711170434951782} 11/07/2021 14:32:43 - INFO - __main__ - Step 122869: {'lr': 4.032850441746003e-05, 'samples': 23590848, 'steps': 122868, 'loss/train': 1.455581545829773} 11/07/2021 14:32:44 - INFO - __main__ - Step 122870: {'lr': 4.032561433054241e-05, 'samples': 23591040, 'steps': 122869, 'loss/train': 1.4863700866699219} 11/07/2021 14:32:45 - INFO - __main__ - Step 122871: {'lr': 4.032272433810083e-05, 'samples': 23591232, 'steps': 122870, 'loss/train': 1.4943856000900269} 11/07/2021 14:32:45 - INFO - __main__ - Step 122872: {'lr': 4.0319834440136594e-05, 'samples': 23591424, 'steps': 122871, 'loss/train': 1.2936283349990845} 11/07/2021 14:32:45 - INFO - __main__ - Step 122873: {'lr': 4.0316944636651004e-05, 'samples': 23591616, 'steps': 122872, 'loss/train': 1.2368028163909912} 11/07/2021 14:32:46 - INFO - __main__ - Step 122874: {'lr': 4.031405492764534e-05, 'samples': 23591808, 'steps': 122873, 'loss/train': 1.1499754190444946} 11/07/2021 14:32:46 - INFO - __main__ - Step 122875: {'lr': 4.0311165313120954e-05, 'samples': 23592000, 'steps': 122874, 'loss/train': 1.0705862045288086} 11/07/2021 14:32:46 - INFO - __main__ - Step 122876: {'lr': 4.030827579307911e-05, 'samples': 23592192, 'steps': 122875, 'loss/train': 1.0298823118209839} 11/07/2021 14:32:48 - INFO - __main__ - Step 122877: {'lr': 4.030538636752118e-05, 'samples': 23592384, 'steps': 122876, 'loss/train': 0.7011131644248962} 11/07/2021 14:32:48 - INFO - __main__ - Step 122878: {'lr': 4.0302497036448364e-05, 'samples': 23592576, 'steps': 122877, 'loss/train': 0.08740489184856415} 11/07/2021 14:32:48 - INFO - __main__ - Step 122879: {'lr': 4.0299607799861996e-05, 'samples': 23592768, 'steps': 122878, 'loss/train': 1.1891818046569824} 11/07/2021 14:32:49 - INFO - __main__ - Step 122880: {'lr': 4.029671865776338e-05, 'samples': 23592960, 'steps': 122879, 'loss/train': 1.4828118085861206} 11/07/2021 14:32:49 - INFO - __main__ - Step 122881: {'lr': 4.029382961015385e-05, 'samples': 23593152, 'steps': 122880, 'loss/train': 1.6255732774734497} 11/07/2021 14:32:50 - INFO - __main__ - Step 122882: {'lr': 4.029094065703465e-05, 'samples': 23593344, 'steps': 122881, 'loss/train': 1.3784425258636475} 11/07/2021 14:32:51 - INFO - __main__ - Step 122883: {'lr': 4.028805179840714e-05, 'samples': 23593536, 'steps': 122882, 'loss/train': 0.6683790683746338} 11/07/2021 14:32:51 - INFO - __main__ - Step 122884: {'lr': 4.02851630342726e-05, 'samples': 23593728, 'steps': 122883, 'loss/train': 0.595981240272522} 11/07/2021 14:32:51 - INFO - __main__ - Step 122885: {'lr': 4.02822743646323e-05, 'samples': 23593920, 'steps': 122884, 'loss/train': 1.4519572257995605} 11/07/2021 14:32:52 - INFO - __main__ - Step 122886: {'lr': 4.0279385789487614e-05, 'samples': 23594112, 'steps': 122885, 'loss/train': 1.4910883903503418} 11/07/2021 14:32:53 - INFO - __main__ - Step 122887: {'lr': 4.0276497308839785e-05, 'samples': 23594304, 'steps': 122886, 'loss/train': 1.4070607423782349} 11/07/2021 14:32:53 - INFO - __main__ - Step 122888: {'lr': 4.027360892269011e-05, 'samples': 23594496, 'steps': 122887, 'loss/train': 1.3252785205841064} 11/07/2021 14:32:53 - INFO - __main__ - Step 122889: {'lr': 4.027072063103993e-05, 'samples': 23594688, 'steps': 122888, 'loss/train': 1.1050090789794922} 11/07/2021 14:32:54 - INFO - __main__ - Step 122890: {'lr': 4.0267832433890596e-05, 'samples': 23594880, 'steps': 122889, 'loss/train': 1.286324143409729} 11/07/2021 14:32:54 - INFO - __main__ - Step 122891: {'lr': 4.0264944331243254e-05, 'samples': 23595072, 'steps': 122890, 'loss/train': 1.4627870321273804} 11/07/2021 14:32:55 - INFO - __main__ - Step 122892: {'lr': 4.026205632309932e-05, 'samples': 23595264, 'steps': 122891, 'loss/train': 1.1170475482940674} 11/07/2021 14:32:55 - INFO - __main__ - Step 122893: {'lr': 4.025916840946003e-05, 'samples': 23595456, 'steps': 122892, 'loss/train': 1.604310154914856} 11/07/2021 14:32:56 - INFO - __main__ - Step 122894: {'lr': 4.0256280590326766e-05, 'samples': 23595648, 'steps': 122893, 'loss/train': 1.0528873205184937} 11/07/2021 14:32:56 - INFO - __main__ - Step 122895: {'lr': 4.025339286570076e-05, 'samples': 23595840, 'steps': 122894, 'loss/train': 1.3873742818832397} 11/07/2021 14:32:56 - INFO - __main__ - Step 122896: {'lr': 4.0250505235583326e-05, 'samples': 23596032, 'steps': 122895, 'loss/train': 1.211134672164917} 11/07/2021 14:32:57 - INFO - __main__ - Step 122897: {'lr': 4.024761769997579e-05, 'samples': 23596224, 'steps': 122896, 'loss/train': 0.5401784777641296} 11/07/2021 14:32:58 - INFO - __main__ - Step 122898: {'lr': 4.024473025887945e-05, 'samples': 23596416, 'steps': 122897, 'loss/train': 1.245910406112671} 11/07/2021 14:32:58 - INFO - __main__ - Step 122899: {'lr': 4.02418429122956e-05, 'samples': 23596608, 'steps': 122898, 'loss/train': 1.3769499063491821} 11/07/2021 14:32:59 - INFO - __main__ - Step 122900: {'lr': 4.023895566022553e-05, 'samples': 23596800, 'steps': 122899, 'loss/train': 1.2366172075271606} 11/07/2021 14:32:59 - INFO - __main__ - Step 122901: {'lr': 4.0236068502670556e-05, 'samples': 23596992, 'steps': 122900, 'loss/train': 1.1601299047470093} 11/07/2021 14:33:00 - INFO - __main__ - Step 122902: {'lr': 4.023318143963195e-05, 'samples': 23597184, 'steps': 122901, 'loss/train': 0.10149159282445908} 11/07/2021 14:33:00 - INFO - __main__ - Step 122903: {'lr': 4.023029447111107e-05, 'samples': 23597376, 'steps': 122902, 'loss/train': 1.2974371910095215} 11/07/2021 14:33:01 - INFO - __main__ - Step 122904: {'lr': 4.0227407597109215e-05, 'samples': 23597568, 'steps': 122903, 'loss/train': 1.8814902305603027} 11/07/2021 14:33:01 - INFO - __main__ - Step 122905: {'lr': 4.022452081762762e-05, 'samples': 23597760, 'steps': 122904, 'loss/train': 1.5865792036056519} 11/07/2021 14:33:01 - INFO - __main__ - Step 122906: {'lr': 4.022163413266758e-05, 'samples': 23597952, 'steps': 122905, 'loss/train': 0.822204053401947} 11/07/2021 14:33:02 - INFO - __main__ - Step 122907: {'lr': 4.0218747542230456e-05, 'samples': 23598144, 'steps': 122906, 'loss/train': 1.5909597873687744} 11/07/2021 14:33:03 - INFO - __main__ - Step 122908: {'lr': 4.0215861046317526e-05, 'samples': 23598336, 'steps': 122907, 'loss/train': 1.268939733505249} 11/07/2021 14:33:03 - INFO - __main__ - Step 122909: {'lr': 4.021297464493009e-05, 'samples': 23598528, 'steps': 122908, 'loss/train': 1.4922759532928467} 11/07/2021 14:33:03 - INFO - __main__ - Step 122910: {'lr': 4.021008833806947e-05, 'samples': 23598720, 'steps': 122909, 'loss/train': 1.4594993591308594} 11/07/2021 14:33:04 - INFO - __main__ - Step 122911: {'lr': 4.020720212573692e-05, 'samples': 23598912, 'steps': 122910, 'loss/train': 1.377908706665039} 11/07/2021 14:33:04 - INFO - __main__ - Step 122912: {'lr': 4.0204316007933786e-05, 'samples': 23599104, 'steps': 122911, 'loss/train': 1.2962687015533447} 11/07/2021 14:33:05 - INFO - __main__ - Step 122913: {'lr': 4.020142998466134e-05, 'samples': 23599296, 'steps': 122912, 'loss/train': 1.2020548582077026} 11/07/2021 14:33:06 - INFO - __main__ - Step 122914: {'lr': 4.019854405592091e-05, 'samples': 23599488, 'steps': 122913, 'loss/train': 0.764436662197113} 11/07/2021 14:33:06 - INFO - __main__ - Step 122915: {'lr': 4.019565822171376e-05, 'samples': 23599680, 'steps': 122914, 'loss/train': 1.3140369653701782} 11/07/2021 14:33:06 - INFO - __main__ - Step 122916: {'lr': 4.01927724820412e-05, 'samples': 23599872, 'steps': 122915, 'loss/train': 1.4427850246429443} 11/07/2021 14:33:07 - INFO - __main__ - Step 122917: {'lr': 4.018988683690461e-05, 'samples': 23600064, 'steps': 122916, 'loss/train': 1.877947449684143} 11/07/2021 14:33:08 - INFO - __main__ - Step 122918: {'lr': 4.0187001286305176e-05, 'samples': 23600256, 'steps': 122917, 'loss/train': 0.9832974076271057} 11/07/2021 14:33:08 - INFO - __main__ - Step 122919: {'lr': 4.018411583024423e-05, 'samples': 23600448, 'steps': 122918, 'loss/train': 1.7516169548034668} 11/07/2021 14:33:08 - INFO - __main__ - Step 122920: {'lr': 4.018123046872307e-05, 'samples': 23600640, 'steps': 122919, 'loss/train': 1.6826176643371582} 11/07/2021 14:33:09 - INFO - __main__ - Step 122921: {'lr': 4.017834520174302e-05, 'samples': 23600832, 'steps': 122920, 'loss/train': 0.9253250956535339} 11/07/2021 14:33:09 - INFO - __main__ - Step 122922: {'lr': 4.017546002930536e-05, 'samples': 23601024, 'steps': 122921, 'loss/train': 1.2651808261871338} 11/07/2021 14:33:10 - INFO - __main__ - Step 122923: {'lr': 4.017257495141141e-05, 'samples': 23601216, 'steps': 122922, 'loss/train': 1.3666127920150757} 11/07/2021 14:33:10 - INFO - __main__ - Step 122924: {'lr': 4.0169689968062476e-05, 'samples': 23601408, 'steps': 122923, 'loss/train': 1.3477375507354736} 11/07/2021 14:33:11 - INFO - __main__ - Step 122925: {'lr': 4.0166805079259824e-05, 'samples': 23601600, 'steps': 122924, 'loss/train': 1.5298004150390625} 11/07/2021 14:33:11 - INFO - __main__ - Step 122926: {'lr': 4.0163920285004765e-05, 'samples': 23601792, 'steps': 122925, 'loss/train': 1.3473882675170898} 11/07/2021 14:33:11 - INFO - __main__ - Step 122927: {'lr': 4.016103558529863e-05, 'samples': 23601984, 'steps': 122926, 'loss/train': 1.0467168092727661} 11/07/2021 14:33:13 - INFO - __main__ - Step 122928: {'lr': 4.0158150980142665e-05, 'samples': 23602176, 'steps': 122927, 'loss/train': 1.2263760566711426} 11/07/2021 14:33:13 - INFO - __main__ - Step 122929: {'lr': 4.015526646953821e-05, 'samples': 23602368, 'steps': 122928, 'loss/train': 1.0987958908081055} 11/07/2021 14:33:13 - INFO - __main__ - Step 122930: {'lr': 4.0152382053486617e-05, 'samples': 23602560, 'steps': 122929, 'loss/train': 1.2122747898101807} 11/07/2021 14:33:14 - INFO - __main__ - Step 122931: {'lr': 4.014949773198906e-05, 'samples': 23602752, 'steps': 122930, 'loss/train': 1.264204978942871} 11/07/2021 14:33:14 - INFO - __main__ - Step 122932: {'lr': 4.014661350504692e-05, 'samples': 23602944, 'steps': 122931, 'loss/train': 1.4030659198760986} 11/07/2021 14:33:15 - INFO - __main__ - Step 122933: {'lr': 4.014372937266145e-05, 'samples': 23603136, 'steps': 122932, 'loss/train': 0.8563552498817444} 11/07/2021 14:33:15 - INFO - __main__ - Step 122934: {'lr': 4.0140845334834005e-05, 'samples': 23603328, 'steps': 122933, 'loss/train': 1.2494388818740845} 11/07/2021 14:33:16 - INFO - __main__ - Step 122935: {'lr': 4.0137961391565836e-05, 'samples': 23603520, 'steps': 122934, 'loss/train': 1.9175410270690918} 11/07/2021 14:33:16 - INFO - __main__ - Step 122936: {'lr': 4.013507754285825e-05, 'samples': 23603712, 'steps': 122935, 'loss/train': 0.8010329604148865} 11/07/2021 14:33:16 - INFO - __main__ - Step 122937: {'lr': 4.013219378871258e-05, 'samples': 23603904, 'steps': 122936, 'loss/train': 1.1737312078475952} 11/07/2021 14:33:17 - INFO - __main__ - Step 122938: {'lr': 4.0129310129130094e-05, 'samples': 23604096, 'steps': 122937, 'loss/train': 1.2075673341751099} 11/07/2021 14:33:18 - INFO - __main__ - Step 122939: {'lr': 4.012642656411211e-05, 'samples': 23604288, 'steps': 122938, 'loss/train': 1.348296046257019} 11/07/2021 14:33:18 - INFO - __main__ - Step 122940: {'lr': 4.012354309365992e-05, 'samples': 23604480, 'steps': 122939, 'loss/train': 1.2512279748916626} 11/07/2021 14:33:19 - INFO - __main__ - Step 122941: {'lr': 4.0120659717774843e-05, 'samples': 23604672, 'steps': 122940, 'loss/train': 1.400955080986023} 11/07/2021 14:33:19 - INFO - __main__ - Step 122942: {'lr': 4.011777643645811e-05, 'samples': 23604864, 'steps': 122941, 'loss/train': 1.306995153427124} 11/07/2021 14:33:19 - INFO - __main__ - Step 122943: {'lr': 4.0114893249711125e-05, 'samples': 23605056, 'steps': 122942, 'loss/train': 0.08456646651029587} 11/07/2021 14:33:20 - INFO - __main__ - Step 122944: {'lr': 4.011201015753516e-05, 'samples': 23605248, 'steps': 122943, 'loss/train': 1.5197328329086304} 11/07/2021 14:33:21 - INFO - __main__ - Step 122945: {'lr': 4.010912715993143e-05, 'samples': 23605440, 'steps': 122944, 'loss/train': 1.1671273708343506} 11/07/2021 14:33:21 - INFO - __main__ - Step 122946: {'lr': 4.0106244256901264e-05, 'samples': 23605632, 'steps': 122945, 'loss/train': 0.8325340747833252} 11/07/2021 14:33:22 - INFO - __main__ - Step 122947: {'lr': 4.010336144844601e-05, 'samples': 23605824, 'steps': 122946, 'loss/train': 1.0286033153533936} 11/07/2021 14:33:22 - INFO - __main__ - Step 122948: {'lr': 4.010047873456693e-05, 'samples': 23606016, 'steps': 122947, 'loss/train': 0.5093704462051392} 11/07/2021 14:33:23 - INFO - __main__ - Step 122949: {'lr': 4.009759611526534e-05, 'samples': 23606208, 'steps': 122948, 'loss/train': 1.2172342538833618} 11/07/2021 14:33:23 - INFO - __main__ - Step 122950: {'lr': 4.009471359054254e-05, 'samples': 23606400, 'steps': 122949, 'loss/train': 1.0821983814239502} 11/07/2021 14:33:24 - INFO - __main__ - Step 122951: {'lr': 4.009183116039983e-05, 'samples': 23606592, 'steps': 122950, 'loss/train': 1.2041850090026855} 11/07/2021 14:33:24 - INFO - __main__ - Step 122952: {'lr': 4.008894882483849e-05, 'samples': 23606784, 'steps': 122951, 'loss/train': 1.3980246782302856} 11/07/2021 14:33:24 - INFO - __main__ - Step 122953: {'lr': 4.008606658385983e-05, 'samples': 23606976, 'steps': 122952, 'loss/train': 1.4022951126098633} 11/07/2021 14:33:25 - INFO - __main__ - Step 122954: {'lr': 4.008318443746517e-05, 'samples': 23607168, 'steps': 122953, 'loss/train': 1.2806695699691772} 11/07/2021 14:33:26 - INFO - __main__ - Step 122955: {'lr': 4.008030238565577e-05, 'samples': 23607360, 'steps': 122954, 'loss/train': 1.297724962234497} 11/07/2021 14:33:26 - INFO - __main__ - Step 122956: {'lr': 4.007742042843293e-05, 'samples': 23607552, 'steps': 122955, 'loss/train': 1.3087878227233887} 11/07/2021 14:33:26 - INFO - __main__ - Step 122957: {'lr': 4.007453856579804e-05, 'samples': 23607744, 'steps': 122956, 'loss/train': 1.5673737525939941} 11/07/2021 14:33:27 - INFO - __main__ - Step 122958: {'lr': 4.007165679775226e-05, 'samples': 23607936, 'steps': 122957, 'loss/train': 0.37939754128456116} 11/07/2021 14:33:27 - INFO - __main__ - Step 122959: {'lr': 4.0068775124296965e-05, 'samples': 23608128, 'steps': 122958, 'loss/train': 1.2697629928588867} 11/07/2021 14:33:28 - INFO - __main__ - Step 122960: {'lr': 4.006589354543344e-05, 'samples': 23608320, 'steps': 122959, 'loss/train': 1.6400622129440308} 11/07/2021 14:33:29 - INFO - __main__ - Step 122961: {'lr': 4.0063012061162974e-05, 'samples': 23608512, 'steps': 122960, 'loss/train': 1.1143039464950562} 11/07/2021 14:33:29 - INFO - __main__ - Step 122962: {'lr': 4.006013067148687e-05, 'samples': 23608704, 'steps': 122961, 'loss/train': 1.4065525531768799} 11/07/2021 14:33:29 - INFO - __main__ - Step 122963: {'lr': 4.005724937640645e-05, 'samples': 23608896, 'steps': 122962, 'loss/train': 1.25282883644104} 11/07/2021 14:33:30 - INFO - __main__ - Step 122964: {'lr': 4.0054368175922976e-05, 'samples': 23609088, 'steps': 122963, 'loss/train': 1.843550443649292} 11/07/2021 14:33:31 - INFO - __main__ - Step 122965: {'lr': 4.005148707003778e-05, 'samples': 23609280, 'steps': 122964, 'loss/train': 1.3697794675827026} 11/07/2021 14:33:31 - INFO - __main__ - Step 122966: {'lr': 4.004860605875213e-05, 'samples': 23609472, 'steps': 122965, 'loss/train': 1.375239610671997} 11/07/2021 14:33:32 - INFO - __main__ - Step 122967: {'lr': 4.004572514206734e-05, 'samples': 23609664, 'steps': 122966, 'loss/train': 0.14163078367710114} 11/07/2021 14:33:32 - INFO - __main__ - Step 122968: {'lr': 4.004284431998473e-05, 'samples': 23609856, 'steps': 122967, 'loss/train': 0.14436830580234528} 11/07/2021 14:33:32 - INFO - __main__ - Step 122969: {'lr': 4.003996359250553e-05, 'samples': 23610048, 'steps': 122968, 'loss/train': 1.7466508150100708} 11/07/2021 14:33:33 - INFO - __main__ - Step 122970: {'lr': 4.0037082959631125e-05, 'samples': 23610240, 'steps': 122969, 'loss/train': 0.8422331809997559} 11/07/2021 14:33:34 - INFO - __main__ - Step 122971: {'lr': 4.0034202421362797e-05, 'samples': 23610432, 'steps': 122970, 'loss/train': 1.1771742105484009} 11/07/2021 14:33:34 - INFO - __main__ - Step 122972: {'lr': 4.003132197770179e-05, 'samples': 23610624, 'steps': 122971, 'loss/train': 1.3703733682632446} 11/07/2021 14:33:34 - INFO - __main__ - Step 122973: {'lr': 4.002844162864941e-05, 'samples': 23610816, 'steps': 122972, 'loss/train': 1.2949419021606445} 11/07/2021 14:33:35 - INFO - __main__ - Step 122974: {'lr': 4.002556137420696e-05, 'samples': 23611008, 'steps': 122973, 'loss/train': 1.4502640962600708} 11/07/2021 14:33:36 - INFO - __main__ - Step 122975: {'lr': 4.002268121437577e-05, 'samples': 23611200, 'steps': 122974, 'loss/train': 1.05476975440979} 11/07/2021 14:33:36 - INFO - __main__ - Step 122976: {'lr': 4.0019801149157124e-05, 'samples': 23611392, 'steps': 122975, 'loss/train': 1.4756842851638794} 11/07/2021 14:33:37 - INFO - __main__ - Step 122977: {'lr': 4.0016921178552327e-05, 'samples': 23611584, 'steps': 122976, 'loss/train': 1.269564151763916} 11/07/2021 14:33:37 - INFO - __main__ - Step 122978: {'lr': 4.001404130256264e-05, 'samples': 23611776, 'steps': 122977, 'loss/train': 1.0197104215621948} 11/07/2021 14:33:37 - INFO - __main__ - Step 122979: {'lr': 4.001116152118939e-05, 'samples': 23611968, 'steps': 122978, 'loss/train': 1.3854601383209229} 11/07/2021 14:33:38 - INFO - __main__ - Step 122980: {'lr': 4.000828183443386e-05, 'samples': 23612160, 'steps': 122979, 'loss/train': 1.1663217544555664} 11/07/2021 14:33:39 - INFO - __main__ - Step 122981: {'lr': 4.000540224229737e-05, 'samples': 23612352, 'steps': 122980, 'loss/train': 1.4659737348556519} 11/07/2021 14:33:39 - INFO - __main__ - Step 122982: {'lr': 4.000252274478122e-05, 'samples': 23612544, 'steps': 122981, 'loss/train': 1.2311506271362305} 11/07/2021 14:33:39 - INFO - __main__ - Step 122983: {'lr': 3.999964334188669e-05, 'samples': 23612736, 'steps': 122982, 'loss/train': 0.5921839475631714} 11/07/2021 14:33:40 - INFO - __main__ - Step 122984: {'lr': 3.999676403361513e-05, 'samples': 23612928, 'steps': 122983, 'loss/train': 1.522803783416748} 11/07/2021 14:33:41 - INFO - __main__ - Step 122985: {'lr': 3.999388481996771e-05, 'samples': 23613120, 'steps': 122984, 'loss/train': 1.2880207300186157} 11/07/2021 14:33:41 - INFO - __main__ - Step 122986: {'lr': 3.999100570094582e-05, 'samples': 23613312, 'steps': 122985, 'loss/train': 0.9021084904670715} 11/07/2021 14:33:41 - INFO - __main__ - Step 122987: {'lr': 3.998812667655074e-05, 'samples': 23613504, 'steps': 122986, 'loss/train': 1.6487133502960205} 11/07/2021 14:33:42 - INFO - __main__ - Step 122988: {'lr': 3.9985247746783806e-05, 'samples': 23613696, 'steps': 122987, 'loss/train': 1.1878478527069092} 11/07/2021 14:33:42 - INFO - __main__ - Step 122989: {'lr': 3.9982368911646224e-05, 'samples': 23613888, 'steps': 122988, 'loss/train': 0.9049253463745117} 11/07/2021 14:33:43 - INFO - __main__ - Step 122990: {'lr': 3.997949017113939e-05, 'samples': 23614080, 'steps': 122989, 'loss/train': 0.9746875166893005} 11/07/2021 14:33:44 - INFO - __main__ - Step 122991: {'lr': 3.997661152526452e-05, 'samples': 23614272, 'steps': 122990, 'loss/train': 1.0071172714233398} 11/07/2021 14:33:44 - INFO - __main__ - Step 122992: {'lr': 3.997373297402296e-05, 'samples': 23614464, 'steps': 122991, 'loss/train': 1.081445336341858} 11/07/2021 14:33:44 - INFO - __main__ - Step 122993: {'lr': 3.9970854517416e-05, 'samples': 23614656, 'steps': 122992, 'loss/train': 1.4014503955841064} 11/07/2021 14:33:45 - INFO - __main__ - Step 122994: {'lr': 3.996797615544493e-05, 'samples': 23614848, 'steps': 122993, 'loss/train': 1.2938346862792969} 11/07/2021 14:33:46 - INFO - __main__ - Step 122995: {'lr': 3.996509788811106e-05, 'samples': 23615040, 'steps': 122994, 'loss/train': 0.6136462688446045} 11/07/2021 14:33:46 - INFO - __main__ - Step 122996: {'lr': 3.996221971541566e-05, 'samples': 23615232, 'steps': 122995, 'loss/train': 1.2765605449676514} 11/07/2021 14:33:46 - INFO - __main__ - Step 122997: {'lr': 3.995934163736004e-05, 'samples': 23615424, 'steps': 122996, 'loss/train': 0.7434252500534058} 11/07/2021 14:33:47 - INFO - __main__ - Step 122998: {'lr': 3.995646365394559e-05, 'samples': 23615616, 'steps': 122997, 'loss/train': 1.007278561592102} 11/07/2021 14:33:47 - INFO - __main__ - Step 122999: {'lr': 3.995358576517341e-05, 'samples': 23615808, 'steps': 122998, 'loss/train': 1.0903691053390503} 11/07/2021 14:33:48 - INFO - __main__ - Step 123000: {'lr': 3.995070797104494e-05, 'samples': 23616000, 'steps': 122999, 'loss/train': 1.138880729675293} 11/07/2021 14:33:49 - INFO - __main__ - Step 123001: {'lr': 3.994783027156143e-05, 'samples': 23616192, 'steps': 123000, 'loss/train': 1.3537299633026123} 11/07/2021 14:33:49 - INFO - __main__ - Step 123002: {'lr': 3.994495266672418e-05, 'samples': 23616384, 'steps': 123001, 'loss/train': 1.4399479627609253} 11/07/2021 14:33:49 - INFO - __main__ - Step 123003: {'lr': 3.9942075156534504e-05, 'samples': 23616576, 'steps': 123002, 'loss/train': 1.3189772367477417} 11/07/2021 14:33:50 - INFO - __main__ - Step 123004: {'lr': 3.993919774099367e-05, 'samples': 23616768, 'steps': 123003, 'loss/train': 0.15362942218780518} 11/07/2021 14:33:51 - INFO - __main__ - Step 123005: {'lr': 3.9936320420103006e-05, 'samples': 23616960, 'steps': 123004, 'loss/train': 0.05097329616546631} 11/07/2021 14:33:51 - INFO - __main__ - Step 123006: {'lr': 3.9933443193863775e-05, 'samples': 23617152, 'steps': 123005, 'loss/train': 1.6662551164627075} 11/07/2021 14:33:51 - INFO - __main__ - Step 123007: {'lr': 3.9930566062277325e-05, 'samples': 23617344, 'steps': 123006, 'loss/train': 1.1961950063705444} 11/07/2021 14:33:52 - INFO - __main__ - Step 123008: {'lr': 3.992768902534491e-05, 'samples': 23617536, 'steps': 123007, 'loss/train': 0.9485980868339539} 11/07/2021 14:33:52 - INFO - __main__ - Step 123009: {'lr': 3.992481208306781e-05, 'samples': 23617728, 'steps': 123008, 'loss/train': 0.30268537998199463} 11/07/2021 14:33:53 - INFO - __main__ - Step 123010: {'lr': 3.9921935235447374e-05, 'samples': 23617920, 'steps': 123009, 'loss/train': 0.9133272767066956} 11/07/2021 14:33:53 - INFO - __main__ - Step 123011: {'lr': 3.991905848248492e-05, 'samples': 23618112, 'steps': 123010, 'loss/train': 1.467418909072876} 11/07/2021 14:33:54 - INFO - __main__ - Step 123012: {'lr': 3.991618182418166e-05, 'samples': 23618304, 'steps': 123011, 'loss/train': 1.817793607711792} 11/07/2021 14:33:54 - INFO - __main__ - Step 123013: {'lr': 3.991330526053891e-05, 'samples': 23618496, 'steps': 123012, 'loss/train': 1.338951826095581} 11/07/2021 14:33:54 - INFO - __main__ - Step 123014: {'lr': 3.991042879155798e-05, 'samples': 23618688, 'steps': 123013, 'loss/train': 1.34908127784729} 11/07/2021 14:33:56 - INFO - __main__ - Step 123015: {'lr': 3.9907552417240176e-05, 'samples': 23618880, 'steps': 123014, 'loss/train': 1.1953933238983154} 11/07/2021 14:33:56 - INFO - __main__ - Step 123016: {'lr': 3.990467613758678e-05, 'samples': 23619072, 'steps': 123015, 'loss/train': 1.3457379341125488} 11/07/2021 14:33:56 - INFO - __main__ - Step 123017: {'lr': 3.99017999525991e-05, 'samples': 23619264, 'steps': 123016, 'loss/train': 1.2366234064102173} 11/07/2021 14:33:57 - INFO - __main__ - Step 123018: {'lr': 3.989892386227842e-05, 'samples': 23619456, 'steps': 123017, 'loss/train': 1.4565880298614502} 11/07/2021 14:33:57 - INFO - __main__ - Step 123019: {'lr': 3.989604786662604e-05, 'samples': 23619648, 'steps': 123018, 'loss/train': 1.3854470252990723} 11/07/2021 14:33:58 - INFO - __main__ - Step 123020: {'lr': 3.989317196564326e-05, 'samples': 23619840, 'steps': 123019, 'loss/train': 1.1797430515289307} 11/07/2021 14:33:58 - INFO - __main__ - Step 123021: {'lr': 3.989029615933137e-05, 'samples': 23620032, 'steps': 123020, 'loss/train': 1.1467738151550293} 11/07/2021 14:33:59 - INFO - __main__ - Step 123022: {'lr': 3.98874204476917e-05, 'samples': 23620224, 'steps': 123021, 'loss/train': 0.7931938171386719} 11/07/2021 14:33:59 - INFO - __main__ - Step 123023: {'lr': 3.9884544830725484e-05, 'samples': 23620416, 'steps': 123022, 'loss/train': 1.1930344104766846} 11/07/2021 14:33:59 - INFO - __main__ - Step 123024: {'lr': 3.988166930843407e-05, 'samples': 23620608, 'steps': 123023, 'loss/train': 1.198403239250183} 11/07/2021 14:34:00 - INFO - __main__ - Step 123025: {'lr': 3.9878793880818804e-05, 'samples': 23620800, 'steps': 123024, 'loss/train': 1.7297508716583252} 11/07/2021 14:34:01 - INFO - __main__ - Step 123026: {'lr': 3.987591854788081e-05, 'samples': 23620992, 'steps': 123025, 'loss/train': 1.5213098526000977} 11/07/2021 14:34:01 - INFO - __main__ - Step 123027: {'lr': 3.9873043309621524e-05, 'samples': 23621184, 'steps': 123026, 'loss/train': 0.9531729817390442} 11/07/2021 14:34:02 - INFO - __main__ - Step 123028: {'lr': 3.987016816604219e-05, 'samples': 23621376, 'steps': 123027, 'loss/train': 1.2642927169799805} 11/07/2021 14:34:02 - INFO - __main__ - Step 123029: {'lr': 3.98672931171441e-05, 'samples': 23621568, 'steps': 123028, 'loss/train': 1.2074172496795654} 11/07/2021 14:34:02 - INFO - __main__ - Step 123030: {'lr': 3.9864418162928576e-05, 'samples': 23621760, 'steps': 123029, 'loss/train': 1.326902151107788} 11/07/2021 14:34:03 - INFO - __main__ - Step 123031: {'lr': 3.986154330339692e-05, 'samples': 23621952, 'steps': 123030, 'loss/train': 1.1538357734680176} 11/07/2021 14:34:04 - INFO - __main__ - Step 123032: {'lr': 3.985866853855038e-05, 'samples': 23622144, 'steps': 123031, 'loss/train': 1.4617685079574585} 11/07/2021 14:34:04 - INFO - __main__ - Step 123033: {'lr': 3.985579386839031e-05, 'samples': 23622336, 'steps': 123032, 'loss/train': 1.1873277425765991} 11/07/2021 14:34:04 - INFO - __main__ - Step 123034: {'lr': 3.985291929291796e-05, 'samples': 23622528, 'steps': 123033, 'loss/train': 1.3227438926696777} 11/07/2021 14:34:05 - INFO - __main__ - Step 123035: {'lr': 3.985004481213464e-05, 'samples': 23622720, 'steps': 123034, 'loss/train': 0.8177157044410706} 11/07/2021 14:34:06 - INFO - __main__ - Step 123036: {'lr': 3.984717042604169e-05, 'samples': 23622912, 'steps': 123035, 'loss/train': 1.5837385654449463} 11/07/2021 14:34:06 - INFO - __main__ - Step 123037: {'lr': 3.984429613464033e-05, 'samples': 23623104, 'steps': 123036, 'loss/train': 1.1994433403015137} 11/07/2021 14:34:07 - INFO - __main__ - Step 123038: {'lr': 3.984142193793189e-05, 'samples': 23623296, 'steps': 123037, 'loss/train': 1.2687851190567017} 11/07/2021 14:34:07 - INFO - __main__ - Step 123039: {'lr': 3.983854783591764e-05, 'samples': 23623488, 'steps': 123038, 'loss/train': 1.4343249797821045} 11/07/2021 14:34:07 - INFO - __main__ - Step 123040: {'lr': 3.98356738285989e-05, 'samples': 23623680, 'steps': 123039, 'loss/train': 1.40854012966156} 11/07/2021 14:34:08 - INFO - __main__ - Step 123041: {'lr': 3.983279991597699e-05, 'samples': 23623872, 'steps': 123040, 'loss/train': 0.04010568931698799} 11/07/2021 14:34:09 - INFO - __main__ - Step 123042: {'lr': 3.9829926098053165e-05, 'samples': 23624064, 'steps': 123041, 'loss/train': 1.3425228595733643} 11/07/2021 14:34:09 - INFO - __main__ - Step 123043: {'lr': 3.982705237482873e-05, 'samples': 23624256, 'steps': 123042, 'loss/train': 1.391300916671753} 11/07/2021 14:34:09 - INFO - __main__ - Step 123044: {'lr': 3.982417874630498e-05, 'samples': 23624448, 'steps': 123043, 'loss/train': 1.272483229637146} 11/07/2021 14:34:10 - INFO - __main__ - Step 123045: {'lr': 3.98213052124832e-05, 'samples': 23624640, 'steps': 123044, 'loss/train': 0.8652746081352234} 11/07/2021 14:34:11 - INFO - __main__ - Step 123046: {'lr': 3.981843177336469e-05, 'samples': 23624832, 'steps': 123045, 'loss/train': 1.3772727251052856} 11/07/2021 14:34:11 - INFO - __main__ - Step 123047: {'lr': 3.981555842895085e-05, 'samples': 23625024, 'steps': 123046, 'loss/train': 0.44457224011421204} 11/07/2021 14:34:12 - INFO - __main__ - Step 123048: {'lr': 3.9812685179242775e-05, 'samples': 23625216, 'steps': 123047, 'loss/train': 1.359013319015503} 11/07/2021 14:34:12 - INFO - __main__ - Step 123049: {'lr': 3.9809812024241885e-05, 'samples': 23625408, 'steps': 123048, 'loss/train': 2.087406873703003} 11/07/2021 14:34:12 - INFO - __main__ - Step 123050: {'lr': 3.9806938963949465e-05, 'samples': 23625600, 'steps': 123049, 'loss/train': 0.9776168465614319} 11/07/2021 14:34:13 - INFO - __main__ - Step 123051: {'lr': 3.980406599836675e-05, 'samples': 23625792, 'steps': 123050, 'loss/train': 1.2521015405654907} 11/07/2021 14:34:14 - INFO - __main__ - Step 123052: {'lr': 3.980119312749511e-05, 'samples': 23625984, 'steps': 123051, 'loss/train': 1.3740267753601074} 11/07/2021 14:34:14 - INFO - __main__ - Step 123053: {'lr': 3.979832035133579e-05, 'samples': 23626176, 'steps': 123052, 'loss/train': 1.335605502128601} 11/07/2021 14:34:14 - INFO - __main__ - Step 123054: {'lr': 3.9795447669890125e-05, 'samples': 23626368, 'steps': 123053, 'loss/train': 1.2433531284332275} 11/07/2021 14:34:15 - INFO - __main__ - Step 123055: {'lr': 3.979257508315939e-05, 'samples': 23626560, 'steps': 123054, 'loss/train': 1.40195894241333} 11/07/2021 14:34:16 - INFO - __main__ - Step 123056: {'lr': 3.9789702591144865e-05, 'samples': 23626752, 'steps': 123055, 'loss/train': 1.3566601276397705} 11/07/2021 14:34:16 - INFO - __main__ - Step 123057: {'lr': 3.978683019384785e-05, 'samples': 23626944, 'steps': 123056, 'loss/train': 1.5348560810089111} 11/07/2021 14:34:17 - INFO - __main__ - Step 123058: {'lr': 3.97839578912697e-05, 'samples': 23627136, 'steps': 123057, 'loss/train': 1.2881898880004883} 11/07/2021 14:34:17 - INFO - __main__ - Step 123059: {'lr': 3.978108568341163e-05, 'samples': 23627328, 'steps': 123058, 'loss/train': 0.9837015271186829} 11/07/2021 14:34:17 - INFO - __main__ - Step 123060: {'lr': 3.977821357027492e-05, 'samples': 23627520, 'steps': 123059, 'loss/train': 1.2427479028701782} 11/07/2021 14:34:18 - INFO - __main__ - Step 123061: {'lr': 3.9775341551860936e-05, 'samples': 23627712, 'steps': 123060, 'loss/train': 1.493859052658081} 11/07/2021 14:34:19 - INFO - __main__ - Step 123062: {'lr': 3.977246962817091e-05, 'samples': 23627904, 'steps': 123061, 'loss/train': 1.3888177871704102} 11/07/2021 14:34:19 - INFO - __main__ - Step 123063: {'lr': 3.976959779920619e-05, 'samples': 23628096, 'steps': 123062, 'loss/train': 1.2427170276641846} 11/07/2021 14:34:19 - INFO - __main__ - Step 123064: {'lr': 3.9766726064968035e-05, 'samples': 23628288, 'steps': 123063, 'loss/train': 1.030409812927246} 11/07/2021 14:34:20 - INFO - __main__ - Step 123065: {'lr': 3.976385442545774e-05, 'samples': 23628480, 'steps': 123064, 'loss/train': 1.0683057308197021} 11/07/2021 14:34:20 - INFO - __main__ - Step 123066: {'lr': 3.976098288067661e-05, 'samples': 23628672, 'steps': 123065, 'loss/train': 1.48566734790802} 11/07/2021 14:34:21 - INFO - __main__ - Step 123067: {'lr': 3.975811143062594e-05, 'samples': 23628864, 'steps': 123066, 'loss/train': 1.4041377305984497} 11/07/2021 14:34:22 - INFO - __main__ - Step 123068: {'lr': 3.9755240075307034e-05, 'samples': 23629056, 'steps': 123067, 'loss/train': 1.2236202955245972} 11/07/2021 14:34:22 - INFO - __main__ - Step 123069: {'lr': 3.975236881472122e-05, 'samples': 23629248, 'steps': 123068, 'loss/train': 0.04792112112045288} 11/07/2021 14:34:22 - INFO - __main__ - Step 123070: {'lr': 3.974949764886968e-05, 'samples': 23629440, 'steps': 123069, 'loss/train': 1.5030945539474487} 11/07/2021 14:34:23 - INFO - __main__ - Step 123071: {'lr': 3.974662657775377e-05, 'samples': 23629632, 'steps': 123070, 'loss/train': 0.9058906435966492} 11/07/2021 14:34:24 - INFO - __main__ - Step 123072: {'lr': 3.9743755601374806e-05, 'samples': 23629824, 'steps': 123071, 'loss/train': 1.5878373384475708} 11/07/2021 14:34:24 - INFO - __main__ - Step 123073: {'lr': 3.9740884719734053e-05, 'samples': 23630016, 'steps': 123072, 'loss/train': 1.4250634908676147} 11/07/2021 14:34:25 - INFO - __main__ - Step 123074: {'lr': 3.97380139328328e-05, 'samples': 23630208, 'steps': 123073, 'loss/train': 0.07278912514448166} 11/07/2021 14:34:25 - INFO - __main__ - Step 123075: {'lr': 3.973514324067237e-05, 'samples': 23630400, 'steps': 123074, 'loss/train': 1.307030439376831} 11/07/2021 14:34:25 - INFO - __main__ - Step 123076: {'lr': 3.973227264325402e-05, 'samples': 23630592, 'steps': 123075, 'loss/train': 1.5396450757980347} 11/07/2021 14:34:26 - INFO - __main__ - Step 123077: {'lr': 3.9729402140579075e-05, 'samples': 23630784, 'steps': 123076, 'loss/train': 1.2736154794692993} 11/07/2021 14:34:27 - INFO - __main__ - Step 123078: {'lr': 3.972653173264881e-05, 'samples': 23630976, 'steps': 123077, 'loss/train': 0.8366063833236694} 11/07/2021 14:34:27 - INFO - __main__ - Step 123079: {'lr': 3.972366141946454e-05, 'samples': 23631168, 'steps': 123078, 'loss/train': 1.4793367385864258} 11/07/2021 14:34:27 - INFO - __main__ - Step 123080: {'lr': 3.972079120102759e-05, 'samples': 23631360, 'steps': 123079, 'loss/train': 0.779290497303009} 11/07/2021 14:34:28 - INFO - __main__ - Step 123081: {'lr': 3.9717921077339155e-05, 'samples': 23631552, 'steps': 123080, 'loss/train': 1.277098298072815} 11/07/2021 14:34:29 - INFO - __main__ - Step 123082: {'lr': 3.971505104840059e-05, 'samples': 23631744, 'steps': 123081, 'loss/train': 2.160698652267456} 11/07/2021 14:34:29 - INFO - __main__ - Step 123083: {'lr': 3.9712181114213154e-05, 'samples': 23631936, 'steps': 123082, 'loss/train': 1.0539882183074951} 11/07/2021 14:34:29 - INFO - __main__ - Step 123084: {'lr': 3.970931127477817e-05, 'samples': 23632128, 'steps': 123083, 'loss/train': 0.8644225597381592} 11/07/2021 14:34:30 - INFO - __main__ - Step 123085: {'lr': 3.970644153009695e-05, 'samples': 23632320, 'steps': 123084, 'loss/train': 1.1733285188674927} 11/07/2021 14:34:30 - INFO - __main__ - Step 123086: {'lr': 3.970357188017074e-05, 'samples': 23632512, 'steps': 123085, 'loss/train': 1.3338356018066406} 11/07/2021 14:34:31 - INFO - __main__ - Step 123087: {'lr': 3.970070232500084e-05, 'samples': 23632704, 'steps': 123086, 'loss/train': 1.1687889099121094} 11/07/2021 14:34:32 - INFO - __main__ - Step 123088: {'lr': 3.9697832864588585e-05, 'samples': 23632896, 'steps': 123087, 'loss/train': 1.309122920036316} 11/07/2021 14:34:32 - INFO - __main__ - Step 123089: {'lr': 3.969496349893523e-05, 'samples': 23633088, 'steps': 123088, 'loss/train': 0.5224377512931824} 11/07/2021 14:34:32 - INFO - __main__ - Step 123090: {'lr': 3.9692094228042095e-05, 'samples': 23633280, 'steps': 123089, 'loss/train': 1.4675021171569824} 11/07/2021 14:34:33 - INFO - __main__ - Step 123091: {'lr': 3.968922505191044e-05, 'samples': 23633472, 'steps': 123090, 'loss/train': 1.2300480604171753} 11/07/2021 14:34:34 - INFO - __main__ - Step 123092: {'lr': 3.9686355970541624e-05, 'samples': 23633664, 'steps': 123091, 'loss/train': 1.460445523262024} 11/07/2021 14:34:34 - INFO - __main__ - Step 123093: {'lr': 3.9683486983936865e-05, 'samples': 23633856, 'steps': 123092, 'loss/train': 1.0536839962005615} 11/07/2021 14:34:34 - INFO - __main__ - Step 123094: {'lr': 3.968061809209744e-05, 'samples': 23634048, 'steps': 123093, 'loss/train': 0.590620219707489} 11/07/2021 14:34:35 - INFO - __main__ - Step 123095: {'lr': 3.9677749295024715e-05, 'samples': 23634240, 'steps': 123094, 'loss/train': 1.4546926021575928} 11/07/2021 14:34:35 - INFO - __main__ - Step 123096: {'lr': 3.967488059271995e-05, 'samples': 23634432, 'steps': 123095, 'loss/train': 1.1712368726730347} 11/07/2021 14:34:36 - INFO - __main__ - Step 123097: {'lr': 3.967201198518441e-05, 'samples': 23634624, 'steps': 123096, 'loss/train': 1.206701636314392} 11/07/2021 14:34:37 - INFO - __main__ - Step 123098: {'lr': 3.9669143472419454e-05, 'samples': 23634816, 'steps': 123097, 'loss/train': 1.3721002340316772} 11/07/2021 14:34:37 - INFO - __main__ - Step 123099: {'lr': 3.966627505442633e-05, 'samples': 23635008, 'steps': 123098, 'loss/train': 1.5838531255722046} 11/07/2021 14:34:37 - INFO - __main__ - Step 123100: {'lr': 3.96634067312063e-05, 'samples': 23635200, 'steps': 123099, 'loss/train': 1.1834583282470703} 11/07/2021 14:34:38 - INFO - __main__ - Step 123101: {'lr': 3.966053850276072e-05, 'samples': 23635392, 'steps': 123100, 'loss/train': 1.5493749380111694} 11/07/2021 14:34:39 - INFO - __main__ - Step 123102: {'lr': 3.965767036909085e-05, 'samples': 23635584, 'steps': 123101, 'loss/train': 0.9800310730934143} 11/07/2021 14:34:39 - INFO - __main__ - Step 123103: {'lr': 3.9654802330198e-05, 'samples': 23635776, 'steps': 123102, 'loss/train': 1.1704058647155762} 11/07/2021 14:34:39 - INFO - __main__ - Step 123104: {'lr': 3.9651934386083444e-05, 'samples': 23635968, 'steps': 123103, 'loss/train': 0.7725042104721069} 11/07/2021 14:34:40 - INFO - __main__ - Step 123105: {'lr': 3.964906653674854e-05, 'samples': 23636160, 'steps': 123104, 'loss/train': 1.010382056236267} 11/07/2021 14:34:40 - INFO - __main__ - Step 123106: {'lr': 3.964619878219444e-05, 'samples': 23636352, 'steps': 123105, 'loss/train': 1.367695689201355} 11/07/2021 14:34:41 - INFO - __main__ - Step 123107: {'lr': 3.964333112242255e-05, 'samples': 23636544, 'steps': 123106, 'loss/train': 1.5030514001846313} 11/07/2021 14:34:41 - INFO - __main__ - Step 123108: {'lr': 3.964046355743412e-05, 'samples': 23636736, 'steps': 123107, 'loss/train': 1.5181899070739746} 11/07/2021 14:34:42 - INFO - __main__ - Step 123109: {'lr': 3.9637596087230445e-05, 'samples': 23636928, 'steps': 123108, 'loss/train': 0.7567373514175415} 11/07/2021 14:34:42 - INFO - __main__ - Step 123110: {'lr': 3.963472871181281e-05, 'samples': 23637120, 'steps': 123109, 'loss/train': 0.38999027013778687} 11/07/2021 14:34:42 - INFO - __main__ - Step 123111: {'lr': 3.963186143118255e-05, 'samples': 23637312, 'steps': 123110, 'loss/train': 0.44016751646995544} 11/07/2021 14:34:43 - INFO - __main__ - Step 123112: {'lr': 3.962899424534092e-05, 'samples': 23637504, 'steps': 123111, 'loss/train': 1.535292625427246} 11/07/2021 14:34:44 - INFO - __main__ - Step 123113: {'lr': 3.96261271542892e-05, 'samples': 23637696, 'steps': 123112, 'loss/train': 1.6558771133422852} 11/07/2021 14:34:44 - INFO - __main__ - Step 123114: {'lr': 3.9623260158028695e-05, 'samples': 23637888, 'steps': 123113, 'loss/train': 1.4292044639587402} 11/07/2021 14:34:45 - INFO - __main__ - Step 123115: {'lr': 3.962039325656072e-05, 'samples': 23638080, 'steps': 123114, 'loss/train': 1.3850046396255493} 11/07/2021 14:34:45 - INFO - __main__ - Step 123116: {'lr': 3.961752644988656e-05, 'samples': 23638272, 'steps': 123115, 'loss/train': 1.6096240282058716} 11/07/2021 14:34:46 - INFO - __main__ - Step 123117: {'lr': 3.961465973800749e-05, 'samples': 23638464, 'steps': 123116, 'loss/train': 1.1139219999313354} 11/07/2021 14:34:46 - INFO - __main__ - Step 123118: {'lr': 3.96117931209248e-05, 'samples': 23638656, 'steps': 123117, 'loss/train': 0.9657848477363586} 11/07/2021 14:34:47 - INFO - __main__ - Step 123119: {'lr': 3.960892659863985e-05, 'samples': 23638848, 'steps': 123118, 'loss/train': 1.5313297510147095} 11/07/2021 14:34:47 - INFO - __main__ - Step 123120: {'lr': 3.960606017115381e-05, 'samples': 23639040, 'steps': 123119, 'loss/train': 1.5604369640350342} 11/07/2021 14:34:47 - INFO - __main__ - Step 123121: {'lr': 3.960319383846803e-05, 'samples': 23639232, 'steps': 123120, 'loss/train': 0.6557202935218811} 11/07/2021 14:34:48 - INFO - __main__ - Step 123122: {'lr': 3.960032760058383e-05, 'samples': 23639424, 'steps': 123121, 'loss/train': 0.8602159023284912} 11/07/2021 14:34:49 - INFO - __main__ - Step 123123: {'lr': 3.9597461457502425e-05, 'samples': 23639616, 'steps': 123122, 'loss/train': 1.7208294868469238} 11/07/2021 14:34:49 - INFO - __main__ - Step 123124: {'lr': 3.959459540922519e-05, 'samples': 23639808, 'steps': 123123, 'loss/train': 1.0892302989959717} 11/07/2021 14:34:50 - INFO - __main__ - Step 123125: {'lr': 3.95917294557534e-05, 'samples': 23640000, 'steps': 123124, 'loss/train': 1.2032537460327148} 11/07/2021 14:34:50 - INFO - __main__ - Step 123126: {'lr': 3.95888635970883e-05, 'samples': 23640192, 'steps': 123125, 'loss/train': 0.08073261380195618} 11/07/2021 14:34:50 - INFO - __main__ - Step 123127: {'lr': 3.958599783323122e-05, 'samples': 23640384, 'steps': 123126, 'loss/train': 1.0613020658493042} 11/07/2021 14:34:51 - INFO - __main__ - Step 123128: {'lr': 3.958313216418344e-05, 'samples': 23640576, 'steps': 123127, 'loss/train': 0.8236377239227295} 11/07/2021 14:34:52 - INFO - __main__ - Step 123129: {'lr': 3.958026658994626e-05, 'samples': 23640768, 'steps': 123128, 'loss/train': 0.7147482633590698} 11/07/2021 14:34:52 - INFO - __main__ - Step 123130: {'lr': 3.957740111052097e-05, 'samples': 23640960, 'steps': 123129, 'loss/train': 1.7307968139648438} 11/07/2021 14:34:52 - INFO - __main__ - Step 123131: {'lr': 3.9574535725908856e-05, 'samples': 23641152, 'steps': 123130, 'loss/train': 1.2425507307052612} 11/07/2021 14:34:53 - INFO - __main__ - Step 123132: {'lr': 3.957167043611126e-05, 'samples': 23641344, 'steps': 123131, 'loss/train': 0.8987420201301575} 11/07/2021 14:34:54 - INFO - __main__ - Step 123133: {'lr': 3.9568805241129355e-05, 'samples': 23641536, 'steps': 123132, 'loss/train': 1.3607192039489746} 11/07/2021 14:34:54 - INFO - __main__ - Step 123134: {'lr': 3.9565940140964513e-05, 'samples': 23641728, 'steps': 123133, 'loss/train': 1.5468791723251343} 11/07/2021 14:34:54 - INFO - __main__ - Step 123135: {'lr': 3.9563075135618024e-05, 'samples': 23641920, 'steps': 123134, 'loss/train': 1.3612333536148071} 11/07/2021 14:34:55 - INFO - __main__ - Step 123136: {'lr': 3.956021022509113e-05, 'samples': 23642112, 'steps': 123135, 'loss/train': 1.4398586750030518} 11/07/2021 14:34:55 - INFO - __main__ - Step 123137: {'lr': 3.9557345409385184e-05, 'samples': 23642304, 'steps': 123136, 'loss/train': 1.2866499423980713} 11/07/2021 14:34:56 - INFO - __main__ - Step 123138: {'lr': 3.955448068850143e-05, 'samples': 23642496, 'steps': 123137, 'loss/train': 0.8893740773200989} 11/07/2021 14:34:57 - INFO - __main__ - Step 123139: {'lr': 3.95516160624412e-05, 'samples': 23642688, 'steps': 123138, 'loss/train': 1.8765321969985962} 11/07/2021 14:34:57 - INFO - __main__ - Step 123140: {'lr': 3.9548751531205765e-05, 'samples': 23642880, 'steps': 123139, 'loss/train': 1.0916329622268677} 11/07/2021 14:34:57 - INFO - __main__ - Step 123141: {'lr': 3.95458870947964e-05, 'samples': 23643072, 'steps': 123140, 'loss/train': 1.459279179573059} 11/07/2021 14:34:58 - INFO - __main__ - Step 123142: {'lr': 3.954302275321442e-05, 'samples': 23643264, 'steps': 123141, 'loss/train': 1.3565961122512817} 11/07/2021 14:34:59 - INFO - __main__ - Step 123143: {'lr': 3.9540158506461114e-05, 'samples': 23643456, 'steps': 123142, 'loss/train': 1.2756564617156982} 11/07/2021 14:34:59 - INFO - __main__ - Step 123144: {'lr': 3.9537294354537765e-05, 'samples': 23643648, 'steps': 123143, 'loss/train': 1.145723581314087} 11/07/2021 14:34:59 - INFO - __main__ - Step 123145: {'lr': 3.9534430297445636e-05, 'samples': 23643840, 'steps': 123144, 'loss/train': 1.297315001487732} 11/07/2021 14:35:00 - INFO - __main__ - Step 123146: {'lr': 3.953156633518612e-05, 'samples': 23644032, 'steps': 123145, 'loss/train': 1.7878377437591553} 11/07/2021 14:35:00 - INFO - __main__ - Step 123147: {'lr': 3.9528702467760384e-05, 'samples': 23644224, 'steps': 123146, 'loss/train': 1.064159870147705} 11/07/2021 14:35:01 - INFO - __main__ - Step 123148: {'lr': 3.952583869516976e-05, 'samples': 23644416, 'steps': 123147, 'loss/train': 1.3512705564498901} 11/07/2021 14:35:01 - INFO - __main__ - Step 123149: {'lr': 3.952297501741553e-05, 'samples': 23644608, 'steps': 123148, 'loss/train': 1.60977303981781} 11/07/2021 14:35:02 - INFO - __main__ - Step 123150: {'lr': 3.952011143449902e-05, 'samples': 23644800, 'steps': 123149, 'loss/train': 1.0976107120513916} 11/07/2021 14:35:02 - INFO - __main__ - Step 123151: {'lr': 3.95172479464215e-05, 'samples': 23644992, 'steps': 123150, 'loss/train': 1.2731047868728638} 11/07/2021 14:35:02 - INFO - __main__ - Step 123152: {'lr': 3.951438455318426e-05, 'samples': 23645184, 'steps': 123151, 'loss/train': 1.6230015754699707} 11/07/2021 14:35:03 - INFO - __main__ - Step 123153: {'lr': 3.9511521254788575e-05, 'samples': 23645376, 'steps': 123152, 'loss/train': 1.4550725221633911} 11/07/2021 14:35:04 - INFO - __main__ - Step 123154: {'lr': 3.950865805123577e-05, 'samples': 23645568, 'steps': 123153, 'loss/train': 1.1750035285949707} 11/07/2021 14:35:04 - INFO - __main__ - Step 123155: {'lr': 3.95057949425271e-05, 'samples': 23645760, 'steps': 123154, 'loss/train': 1.571876883506775} 11/07/2021 14:35:05 - INFO - __main__ - Step 123156: {'lr': 3.9502931928663886e-05, 'samples': 23645952, 'steps': 123155, 'loss/train': 1.231302261352539} 11/07/2021 14:35:05 - INFO - __main__ - Step 123157: {'lr': 3.9500069009647394e-05, 'samples': 23646144, 'steps': 123156, 'loss/train': 1.5457277297973633} 11/07/2021 14:35:05 - INFO - __main__ - Step 123158: {'lr': 3.949720618547892e-05, 'samples': 23646336, 'steps': 123157, 'loss/train': 1.347414255142212} 11/07/2021 14:35:06 - INFO - __main__ - Step 123159: {'lr': 3.949434345615982e-05, 'samples': 23646528, 'steps': 123158, 'loss/train': 1.2433648109436035} 11/07/2021 14:35:07 - INFO - __main__ - Step 123160: {'lr': 3.949148082169124e-05, 'samples': 23646720, 'steps': 123159, 'loss/train': 1.542725920677185} 11/07/2021 14:35:07 - INFO - __main__ - Step 123161: {'lr': 3.9488618282074564e-05, 'samples': 23646912, 'steps': 123160, 'loss/train': 1.1953761577606201} 11/07/2021 14:35:07 - INFO - __main__ - Step 123162: {'lr': 3.9485755837311094e-05, 'samples': 23647104, 'steps': 123161, 'loss/train': 1.5051023960113525} 11/07/2021 14:35:08 - INFO - __main__ - Step 123163: {'lr': 3.948289348740205e-05, 'samples': 23647296, 'steps': 123162, 'loss/train': 1.2575857639312744} 11/07/2021 14:35:09 - INFO - __main__ - Step 123164: {'lr': 3.948003123234881e-05, 'samples': 23647488, 'steps': 123163, 'loss/train': 1.1102471351623535} 11/07/2021 14:35:09 - INFO - __main__ - Step 123165: {'lr': 3.9477169072152595e-05, 'samples': 23647680, 'steps': 123164, 'loss/train': 0.6327109932899475} 11/07/2021 14:35:10 - INFO - __main__ - Step 123166: {'lr': 3.947430700681473e-05, 'samples': 23647872, 'steps': 123165, 'loss/train': 1.2645171880722046} 11/07/2021 14:35:10 - INFO - __main__ - Step 123167: {'lr': 3.9471445036336486e-05, 'samples': 23648064, 'steps': 123166, 'loss/train': 0.9996726512908936} 11/07/2021 14:35:10 - INFO - __main__ - Step 123168: {'lr': 3.946858316071913e-05, 'samples': 23648256, 'steps': 123167, 'loss/train': 1.1212241649627686} 11/07/2021 14:35:11 - INFO - __main__ - Step 123169: {'lr': 3.946572137996404e-05, 'samples': 23648448, 'steps': 123168, 'loss/train': 1.427019715309143} 11/07/2021 14:35:12 - INFO - __main__ - Step 123170: {'lr': 3.94628596940724e-05, 'samples': 23648640, 'steps': 123169, 'loss/train': 1.0350184440612793} 11/07/2021 14:35:12 - INFO - __main__ - Step 123171: {'lr': 3.945999810304557e-05, 'samples': 23648832, 'steps': 123170, 'loss/train': 1.0470176935195923} 11/07/2021 14:35:12 - INFO - __main__ - Step 123172: {'lr': 3.945713660688488e-05, 'samples': 23649024, 'steps': 123171, 'loss/train': 1.3498210906982422} 11/07/2021 14:35:13 - INFO - __main__ - Step 123173: {'lr': 3.9454275205591475e-05, 'samples': 23649216, 'steps': 123172, 'loss/train': 1.5570660829544067} 11/07/2021 14:35:14 - INFO - __main__ - Step 123174: {'lr': 3.945141389916676e-05, 'samples': 23649408, 'steps': 123173, 'loss/train': 1.005175232887268} 11/07/2021 14:35:14 - INFO - __main__ - Step 123175: {'lr': 3.9448552687611964e-05, 'samples': 23649600, 'steps': 123174, 'loss/train': 1.01223623752594} 11/07/2021 14:35:15 - INFO - __main__ - Step 123176: {'lr': 3.944569157092839e-05, 'samples': 23649792, 'steps': 123175, 'loss/train': 1.2669353485107422} 11/07/2021 14:35:15 - INFO - __main__ - Step 123177: {'lr': 3.944283054911735e-05, 'samples': 23649984, 'steps': 123176, 'loss/train': 1.3980014324188232} 11/07/2021 14:35:15 - INFO - __main__ - Step 123178: {'lr': 3.9439969622180134e-05, 'samples': 23650176, 'steps': 123177, 'loss/train': 1.5944269895553589} 11/07/2021 14:35:16 - INFO - __main__ - Step 123179: {'lr': 3.9437108790117996e-05, 'samples': 23650368, 'steps': 123178, 'loss/train': 1.1335842609405518} 11/07/2021 14:35:17 - INFO - __main__ - Step 123180: {'lr': 3.9434248052932275e-05, 'samples': 23650560, 'steps': 123179, 'loss/train': 1.105050802230835} 11/07/2021 14:35:17 - INFO - __main__ - Step 123181: {'lr': 3.943138741062421e-05, 'samples': 23650752, 'steps': 123180, 'loss/train': 1.3572887182235718} 11/07/2021 14:35:17 - INFO - __main__ - Step 123182: {'lr': 3.942852686319512e-05, 'samples': 23650944, 'steps': 123181, 'loss/train': 0.7152692675590515} 11/07/2021 14:35:18 - INFO - __main__ - Step 123183: {'lr': 3.942566641064627e-05, 'samples': 23651136, 'steps': 123182, 'loss/train': 1.3151684999465942} 11/07/2021 14:35:19 - INFO - __main__ - Step 123184: {'lr': 3.9422806052978985e-05, 'samples': 23651328, 'steps': 123183, 'loss/train': 0.5747361779212952} 11/07/2021 14:35:19 - INFO - __main__ - Step 123185: {'lr': 3.941994579019453e-05, 'samples': 23651520, 'steps': 123184, 'loss/train': 0.45456817746162415} 11/07/2021 14:35:19 - INFO - __main__ - Step 123186: {'lr': 3.941708562229426e-05, 'samples': 23651712, 'steps': 123185, 'loss/train': 1.1754674911499023} 11/07/2021 14:35:20 - INFO - __main__ - Step 123187: {'lr': 3.941422554927934e-05, 'samples': 23651904, 'steps': 123186, 'loss/train': 0.9452953338623047} 11/07/2021 14:35:20 - INFO - __main__ - Step 123188: {'lr': 3.941136557115113e-05, 'samples': 23652096, 'steps': 123187, 'loss/train': 2.118818998336792} 11/07/2021 14:35:22 - INFO - __main__ - Step 123189: {'lr': 3.940850568791088e-05, 'samples': 23652288, 'steps': 123188, 'loss/train': 1.1002262830734253} 11/07/2021 14:35:22 - INFO - __main__ - Step 123190: {'lr': 3.9405645899559945e-05, 'samples': 23652480, 'steps': 123189, 'loss/train': 1.1410901546478271} 11/07/2021 14:35:22 - INFO - __main__ - Step 123191: {'lr': 3.940278620609955e-05, 'samples': 23652672, 'steps': 123190, 'loss/train': 1.3907852172851562} 11/07/2021 14:35:23 - INFO - __main__ - Step 123192: {'lr': 3.939992660753103e-05, 'samples': 23652864, 'steps': 123191, 'loss/train': 0.37488776445388794} 11/07/2021 14:35:23 - INFO - __main__ - Step 123193: {'lr': 3.939706710385563e-05, 'samples': 23653056, 'steps': 123192, 'loss/train': 1.6764130592346191} 11/07/2021 14:35:24 - INFO - __main__ - Step 123194: {'lr': 3.939420769507468e-05, 'samples': 23653248, 'steps': 123193, 'loss/train': 1.0190699100494385} 11/07/2021 14:35:24 - INFO - __main__ - Step 123195: {'lr': 3.939134838118943e-05, 'samples': 23653440, 'steps': 123194, 'loss/train': 0.9693483114242554} 11/07/2021 14:35:25 - INFO - __main__ - Step 123196: {'lr': 3.9388489162201226e-05, 'samples': 23653632, 'steps': 123195, 'loss/train': 1.3792685270309448} 11/07/2021 14:35:25 - INFO - __main__ - Step 123197: {'lr': 3.938563003811127e-05, 'samples': 23653824, 'steps': 123196, 'loss/train': 1.3164088726043701} 11/07/2021 14:35:25 - INFO - __main__ - Step 123198: {'lr': 3.938277100892093e-05, 'samples': 23654016, 'steps': 123197, 'loss/train': 1.0633600950241089} 11/07/2021 14:35:27 - INFO - __main__ - Step 123199: {'lr': 3.9379912074631516e-05, 'samples': 23654208, 'steps': 123198, 'loss/train': 1.5771921873092651} 11/07/2021 14:35:27 - INFO - __main__ - Step 123200: {'lr': 3.937705323524421e-05, 'samples': 23654400, 'steps': 123199, 'loss/train': 0.9507240056991577} 11/07/2021 14:35:27 - INFO - __main__ - Step 123201: {'lr': 3.937419449076035e-05, 'samples': 23654592, 'steps': 123200, 'loss/train': 1.0742981433868408} 11/07/2021 14:35:28 - INFO - __main__ - Step 123202: {'lr': 3.937133584118122e-05, 'samples': 23654784, 'steps': 123201, 'loss/train': 0.9261986017227173} 11/07/2021 14:35:28 - INFO - __main__ - Step 123203: {'lr': 3.9368477286508133e-05, 'samples': 23654976, 'steps': 123202, 'loss/train': 1.7745919227600098} 11/07/2021 14:35:29 - INFO - __main__ - Step 123204: {'lr': 3.936561882674233e-05, 'samples': 23655168, 'steps': 123203, 'loss/train': 1.2530790567398071} 11/07/2021 14:35:29 - INFO - __main__ - Step 123205: {'lr': 3.936276046188517e-05, 'samples': 23655360, 'steps': 123204, 'loss/train': 1.2747594118118286} 11/07/2021 14:35:30 - INFO - __main__ - Step 123206: {'lr': 3.935990219193786e-05, 'samples': 23655552, 'steps': 123205, 'loss/train': 1.241982340812683} 11/07/2021 14:35:30 - INFO - __main__ - Step 123207: {'lr': 3.9357044016901766e-05, 'samples': 23655744, 'steps': 123206, 'loss/train': 0.8591379523277283} 11/07/2021 14:35:30 - INFO - __main__ - Step 123208: {'lr': 3.9354185936778117e-05, 'samples': 23655936, 'steps': 123207, 'loss/train': 1.2217991352081299} 11/07/2021 14:35:31 - INFO - __main__ - Step 123209: {'lr': 3.935132795156821e-05, 'samples': 23656128, 'steps': 123208, 'loss/train': 1.1890560388565063} 11/07/2021 14:35:32 - INFO - __main__ - Step 123210: {'lr': 3.934847006127334e-05, 'samples': 23656320, 'steps': 123209, 'loss/train': 0.9547662138938904} 11/07/2021 14:35:32 - INFO - __main__ - Step 123211: {'lr': 3.9345612265894834e-05, 'samples': 23656512, 'steps': 123210, 'loss/train': 1.3358451128005981} 11/07/2021 14:35:33 - INFO - __main__ - Step 123212: {'lr': 3.934275456543393e-05, 'samples': 23656704, 'steps': 123211, 'loss/train': 1.5399587154388428} 11/07/2021 14:35:33 - INFO - __main__ - Step 123213: {'lr': 3.9339896959891985e-05, 'samples': 23656896, 'steps': 123212, 'loss/train': 1.0236048698425293} 11/07/2021 14:35:33 - INFO - __main__ - Step 123214: {'lr': 3.9337039449270164e-05, 'samples': 23657088, 'steps': 123213, 'loss/train': 1.2024104595184326} 11/07/2021 14:35:34 - INFO - __main__ - Step 123215: {'lr': 3.933418203356984e-05, 'samples': 23657280, 'steps': 123214, 'loss/train': 1.1756162643432617} 11/07/2021 14:35:35 - INFO - __main__ - Step 123216: {'lr': 3.933132471279227e-05, 'samples': 23657472, 'steps': 123215, 'loss/train': 1.2707408666610718} 11/07/2021 14:35:35 - INFO - __main__ - Step 123217: {'lr': 3.932846748693875e-05, 'samples': 23657664, 'steps': 123216, 'loss/train': 1.136185884475708} 11/07/2021 14:35:35 - INFO - __main__ - Step 123218: {'lr': 3.9325610356010566e-05, 'samples': 23657856, 'steps': 123217, 'loss/train': 1.2925047874450684} 11/07/2021 14:35:36 - INFO - __main__ - Step 123219: {'lr': 3.9322753320009034e-05, 'samples': 23658048, 'steps': 123218, 'loss/train': 1.4023585319519043} 11/07/2021 14:35:37 - INFO - __main__ - Step 123220: {'lr': 3.931989637893541e-05, 'samples': 23658240, 'steps': 123219, 'loss/train': 1.2804001569747925} 11/07/2021 14:35:37 - INFO - __main__ - Step 123221: {'lr': 3.931703953279098e-05, 'samples': 23658432, 'steps': 123220, 'loss/train': 0.8054465651512146} 11/07/2021 14:35:38 - INFO - __main__ - Step 123222: {'lr': 3.9314182781577054e-05, 'samples': 23658624, 'steps': 123221, 'loss/train': 1.0800060033798218} 11/07/2021 14:35:38 - INFO - __main__ - Step 123223: {'lr': 3.931132612529489e-05, 'samples': 23658816, 'steps': 123222, 'loss/train': 1.140679121017456} 11/07/2021 14:35:38 - INFO - __main__ - Step 123224: {'lr': 3.930846956394582e-05, 'samples': 23659008, 'steps': 123223, 'loss/train': 1.1531068086624146} 11/07/2021 14:35:39 - INFO - __main__ - Step 123225: {'lr': 3.930561309753109e-05, 'samples': 23659200, 'steps': 123224, 'loss/train': 1.2927199602127075} 11/07/2021 14:35:40 - INFO - __main__ - Step 123226: {'lr': 3.930275672605205e-05, 'samples': 23659392, 'steps': 123225, 'loss/train': 1.2814000844955444} 11/07/2021 14:35:40 - INFO - __main__ - Step 123227: {'lr': 3.9299900449509875e-05, 'samples': 23659584, 'steps': 123226, 'loss/train': 1.2898861169815063} 11/07/2021 14:35:40 - INFO - __main__ - Step 123228: {'lr': 3.9297044267905924e-05, 'samples': 23659776, 'steps': 123227, 'loss/train': 0.7237874865531921} 11/07/2021 14:35:41 - INFO - __main__ - Step 123229: {'lr': 3.929418818124148e-05, 'samples': 23659968, 'steps': 123228, 'loss/train': 0.839734673500061} 11/07/2021 14:35:42 - INFO - __main__ - Step 123230: {'lr': 3.92913321895178e-05, 'samples': 23660160, 'steps': 123229, 'loss/train': 1.2606786489486694} 11/07/2021 14:35:42 - INFO - __main__ - Step 123231: {'lr': 3.9288476292736216e-05, 'samples': 23660352, 'steps': 123230, 'loss/train': 1.2651081085205078} 11/07/2021 14:35:43 - INFO - __main__ - Step 123232: {'lr': 3.928562049089798e-05, 'samples': 23660544, 'steps': 123231, 'loss/train': 2.191171169281006} 11/07/2021 14:35:43 - INFO - __main__ - Step 123233: {'lr': 3.928276478400439e-05, 'samples': 23660736, 'steps': 123232, 'loss/train': 0.7556306719779968} 11/07/2021 14:35:43 - INFO - __main__ - Step 123234: {'lr': 3.927990917205673e-05, 'samples': 23660928, 'steps': 123233, 'loss/train': 1.2707664966583252} 11/07/2021 14:35:44 - INFO - __main__ - Step 123235: {'lr': 3.927705365505632e-05, 'samples': 23661120, 'steps': 123234, 'loss/train': 0.8864647746086121} 11/07/2021 14:35:45 - INFO - __main__ - Step 123236: {'lr': 3.9274198233004376e-05, 'samples': 23661312, 'steps': 123235, 'loss/train': 1.4714430570602417} 11/07/2021 14:35:45 - INFO - __main__ - Step 123237: {'lr': 3.927134290590226e-05, 'samples': 23661504, 'steps': 123236, 'loss/train': 0.8500187397003174} 11/07/2021 14:35:45 - INFO - __main__ - Step 123238: {'lr': 3.926848767375121e-05, 'samples': 23661696, 'steps': 123237, 'loss/train': 0.05207175016403198} 11/07/2021 14:35:46 - INFO - __main__ - Step 123239: {'lr': 3.9265632536552546e-05, 'samples': 23661888, 'steps': 123238, 'loss/train': 0.978468120098114} 11/07/2021 14:35:47 - INFO - __main__ - Step 123240: {'lr': 3.926277749430757e-05, 'samples': 23662080, 'steps': 123239, 'loss/train': 1.064685583114624} 11/07/2021 14:35:47 - INFO - __main__ - Step 123241: {'lr': 3.925992254701749e-05, 'samples': 23662272, 'steps': 123240, 'loss/train': 1.2090106010437012} 11/07/2021 14:35:48 - INFO - __main__ - Step 123242: {'lr': 3.9257067694683654e-05, 'samples': 23662464, 'steps': 123241, 'loss/train': 1.1529284715652466} 11/07/2021 14:35:48 - INFO - __main__ - Step 123243: {'lr': 3.92542129373073e-05, 'samples': 23662656, 'steps': 123242, 'loss/train': 1.2590380907058716} 11/07/2021 14:35:48 - INFO - __main__ - Step 123244: {'lr': 3.9251358274889764e-05, 'samples': 23662848, 'steps': 123243, 'loss/train': 1.2733672857284546} 11/07/2021 14:35:49 - INFO - __main__ - Step 123245: {'lr': 3.92485037074323e-05, 'samples': 23663040, 'steps': 123244, 'loss/train': 0.9905918836593628} 11/07/2021 14:35:50 - INFO - __main__ - Step 123246: {'lr': 3.924564923493623e-05, 'samples': 23663232, 'steps': 123245, 'loss/train': 1.2382843494415283} 11/07/2021 14:35:50 - INFO - __main__ - Step 123247: {'lr': 3.9242794857402785e-05, 'samples': 23663424, 'steps': 123246, 'loss/train': 0.09547867625951767} 11/07/2021 14:35:50 - INFO - __main__ - Step 123248: {'lr': 3.923994057483332e-05, 'samples': 23663616, 'steps': 123247, 'loss/train': 1.470150351524353} 11/07/2021 14:35:51 - INFO - __main__ - Step 123249: {'lr': 3.923708638722906e-05, 'samples': 23663808, 'steps': 123248, 'loss/train': 0.984870195388794} 11/07/2021 14:35:52 - INFO - __main__ - Step 123250: {'lr': 3.923423229459133e-05, 'samples': 23664000, 'steps': 123249, 'loss/train': 1.095247745513916} 11/07/2021 14:35:52 - INFO - __main__ - Step 123251: {'lr': 3.923137829692139e-05, 'samples': 23664192, 'steps': 123250, 'loss/train': 1.3563340902328491} 11/07/2021 14:35:53 - INFO - __main__ - Step 123252: {'lr': 3.922852439422056e-05, 'samples': 23664384, 'steps': 123251, 'loss/train': 1.5193893909454346} 11/07/2021 14:35:53 - INFO - __main__ - Step 123253: {'lr': 3.922567058649015e-05, 'samples': 23664576, 'steps': 123252, 'loss/train': 1.4605228900909424} 11/07/2021 14:35:53 - INFO - __main__ - Step 123254: {'lr': 3.9222816873731335e-05, 'samples': 23664768, 'steps': 123253, 'loss/train': 0.5036138892173767} 11/07/2021 14:35:54 - INFO - __main__ - Step 123255: {'lr': 3.9219963255945485e-05, 'samples': 23664960, 'steps': 123254, 'loss/train': 1.5631450414657593} 11/07/2021 14:35:55 - INFO - __main__ - Step 123256: {'lr': 3.9217109733133835e-05, 'samples': 23665152, 'steps': 123255, 'loss/train': 1.1508784294128418} 11/07/2021 14:35:55 - INFO - __main__ - Step 123257: {'lr': 3.921425630529773e-05, 'samples': 23665344, 'steps': 123256, 'loss/train': 1.2754698991775513} 11/07/2021 14:35:55 - INFO - __main__ - Step 123258: {'lr': 3.921140297243841e-05, 'samples': 23665536, 'steps': 123257, 'loss/train': 1.2935853004455566} 11/07/2021 14:35:56 - INFO - __main__ - Step 123259: {'lr': 3.92085497345572e-05, 'samples': 23665728, 'steps': 123258, 'loss/train': 1.4414618015289307} 11/07/2021 14:35:57 - INFO - __main__ - Step 123260: {'lr': 3.920569659165535e-05, 'samples': 23665920, 'steps': 123259, 'loss/train': 0.4727078080177307} 11/07/2021 14:35:57 - INFO - __main__ - Step 123261: {'lr': 3.920284354373419e-05, 'samples': 23666112, 'steps': 123260, 'loss/train': 0.7235383987426758} 11/07/2021 14:35:57 - INFO - __main__ - Step 123262: {'lr': 3.9199990590794934e-05, 'samples': 23666304, 'steps': 123261, 'loss/train': 1.1428611278533936} 11/07/2021 14:35:58 - INFO - __main__ - Step 123263: {'lr': 3.9197137732838926e-05, 'samples': 23666496, 'steps': 123262, 'loss/train': 0.641939103603363} 11/07/2021 14:35:58 - INFO - __main__ - Step 123264: {'lr': 3.919428496986743e-05, 'samples': 23666688, 'steps': 123263, 'loss/train': 1.2369593381881714} 11/07/2021 14:35:58 - INFO - __main__ - Step 123265: {'lr': 3.919143230188174e-05, 'samples': 23666880, 'steps': 123264, 'loss/train': 0.3601485788822174} 11/07/2021 14:35:59 - INFO - __main__ - Step 123266: {'lr': 3.918857972888315e-05, 'samples': 23667072, 'steps': 123265, 'loss/train': 0.540390133857727} 11/07/2021 14:36:00 - INFO - __main__ - Step 123267: {'lr': 3.918572725087299e-05, 'samples': 23667264, 'steps': 123266, 'loss/train': 0.8938366770744324} 11/07/2021 14:36:00 - INFO - __main__ - Step 123268: {'lr': 3.918287486785241e-05, 'samples': 23667456, 'steps': 123267, 'loss/train': 1.7454049587249756} 11/07/2021 14:36:00 - INFO - __main__ - Step 123269: {'lr': 3.9180022579822785e-05, 'samples': 23667648, 'steps': 123268, 'loss/train': 1.294981837272644} 11/07/2021 14:36:01 - INFO - __main__ - Step 123270: {'lr': 3.917717038678539e-05, 'samples': 23667840, 'steps': 123269, 'loss/train': 1.0614712238311768} 11/07/2021 14:36:02 - INFO - __main__ - Step 123271: {'lr': 3.9174318288741515e-05, 'samples': 23668032, 'steps': 123270, 'loss/train': 0.4556422531604767} 11/07/2021 14:36:02 - INFO - __main__ - Step 123272: {'lr': 3.917146628569243e-05, 'samples': 23668224, 'steps': 123271, 'loss/train': 1.4746334552764893} 11/07/2021 14:36:03 - INFO - __main__ - Step 123273: {'lr': 3.916861437763941e-05, 'samples': 23668416, 'steps': 123272, 'loss/train': 1.288822889328003} 11/07/2021 14:36:03 - INFO - __main__ - Step 123274: {'lr': 3.916576256458379e-05, 'samples': 23668608, 'steps': 123273, 'loss/train': 1.3955880403518677} 11/07/2021 14:36:03 - INFO - __main__ - Step 123275: {'lr': 3.916291084652682e-05, 'samples': 23668800, 'steps': 123274, 'loss/train': 1.2951862812042236} 11/07/2021 14:36:05 - INFO - __main__ - Step 123276: {'lr': 3.916005922346977e-05, 'samples': 23668992, 'steps': 123275, 'loss/train': 0.9239875674247742} 11/07/2021 14:36:05 - INFO - __main__ - Step 123277: {'lr': 3.9157207695413946e-05, 'samples': 23669184, 'steps': 123276, 'loss/train': 1.3973978757858276} 11/07/2021 14:36:05 - INFO - __main__ - Step 123278: {'lr': 3.915435626236066e-05, 'samples': 23669376, 'steps': 123277, 'loss/train': 1.384757161140442} 11/07/2021 14:36:06 - INFO - __main__ - Step 123279: {'lr': 3.9151504924311135e-05, 'samples': 23669568, 'steps': 123278, 'loss/train': 1.4862321615219116} 11/07/2021 14:36:06 - INFO - __main__ - Step 123280: {'lr': 3.9148653681266775e-05, 'samples': 23669760, 'steps': 123279, 'loss/train': 1.1893573999404907} 11/07/2021 14:36:07 - INFO - __main__ - Step 123281: {'lr': 3.914580253322869e-05, 'samples': 23669952, 'steps': 123280, 'loss/train': 1.2029402256011963} 11/07/2021 14:36:08 - INFO - __main__ - Step 123282: {'lr': 3.914295148019828e-05, 'samples': 23670144, 'steps': 123281, 'loss/train': 1.190731406211853} 11/07/2021 14:36:08 - INFO - __main__ - Step 123283: {'lr': 3.9140100522176786e-05, 'samples': 23670336, 'steps': 123282, 'loss/train': 1.2763044834136963} 11/07/2021 14:36:08 - INFO - __main__ - Step 123284: {'lr': 3.9137249659165515e-05, 'samples': 23670528, 'steps': 123283, 'loss/train': 1.2838398218154907} 11/07/2021 14:36:09 - INFO - __main__ - Step 123285: {'lr': 3.913439889116574e-05, 'samples': 23670720, 'steps': 123284, 'loss/train': 0.9580749273300171} 11/07/2021 14:36:10 - INFO - __main__ - Step 123286: {'lr': 3.913154821817874e-05, 'samples': 23670912, 'steps': 123285, 'loss/train': 1.1704057455062866} 11/07/2021 14:36:10 - INFO - __main__ - Step 123287: {'lr': 3.912869764020583e-05, 'samples': 23671104, 'steps': 123286, 'loss/train': 1.3453469276428223} 11/07/2021 14:36:10 - INFO - __main__ - Step 123288: {'lr': 3.9125847157248264e-05, 'samples': 23671296, 'steps': 123287, 'loss/train': 1.1325299739837646} 11/07/2021 14:36:11 - INFO - __main__ - Step 123289: {'lr': 3.912299676930736e-05, 'samples': 23671488, 'steps': 123288, 'loss/train': 1.3203492164611816} 11/07/2021 14:36:11 - INFO - __main__ - Step 123290: {'lr': 3.9120146476384344e-05, 'samples': 23671680, 'steps': 123289, 'loss/train': 1.304607629776001} 11/07/2021 14:36:12 - INFO - __main__ - Step 123291: {'lr': 3.911729627848057e-05, 'samples': 23671872, 'steps': 123290, 'loss/train': 1.221840500831604} 11/07/2021 14:36:12 - INFO - __main__ - Step 123292: {'lr': 3.9114446175597254e-05, 'samples': 23672064, 'steps': 123291, 'loss/train': 0.8640409708023071} 11/07/2021 14:36:13 - INFO - __main__ - Step 123293: {'lr': 3.9111596167735796e-05, 'samples': 23672256, 'steps': 123292, 'loss/train': 1.1170241832733154} 11/07/2021 14:36:13 - INFO - __main__ - Step 123294: {'lr': 3.9108746254897353e-05, 'samples': 23672448, 'steps': 123293, 'loss/train': 1.3524452447891235} 11/07/2021 14:36:13 - INFO - __main__ - Step 123295: {'lr': 3.9105896437083234e-05, 'samples': 23672640, 'steps': 123294, 'loss/train': 1.0942637920379639} 11/07/2021 14:36:15 - INFO - __main__ - Step 123296: {'lr': 3.910304671429474e-05, 'samples': 23672832, 'steps': 123295, 'loss/train': 1.1893404722213745} 11/07/2021 14:36:15 - INFO - __main__ - Step 123297: {'lr': 3.910019708653317e-05, 'samples': 23673024, 'steps': 123296, 'loss/train': 1.4553306102752686} 11/07/2021 14:36:15 - INFO - __main__ - Step 123298: {'lr': 3.909734755379979e-05, 'samples': 23673216, 'steps': 123297, 'loss/train': 1.2715598344802856} 11/07/2021 14:36:16 - INFO - __main__ - Step 123299: {'lr': 3.909449811609589e-05, 'samples': 23673408, 'steps': 123298, 'loss/train': 1.5119370222091675} 11/07/2021 14:36:16 - INFO - __main__ - Step 123300: {'lr': 3.909164877342275e-05, 'samples': 23673600, 'steps': 123299, 'loss/train': 1.3030929565429688} 11/07/2021 14:36:17 - INFO - __main__ - Step 123301: {'lr': 3.908879952578168e-05, 'samples': 23673792, 'steps': 123300, 'loss/train': 1.2616418600082397} 11/07/2021 14:36:17 - INFO - __main__ - Step 123302: {'lr': 3.908595037317392e-05, 'samples': 23673984, 'steps': 123301, 'loss/train': 1.2529475688934326} 11/07/2021 14:36:18 - INFO - __main__ - Step 123303: {'lr': 3.908310131560078e-05, 'samples': 23674176, 'steps': 123302, 'loss/train': 1.258249044418335} 11/07/2021 14:36:18 - INFO - __main__ - Step 123304: {'lr': 3.908025235306356e-05, 'samples': 23674368, 'steps': 123303, 'loss/train': 1.4891451597213745} 11/07/2021 14:36:18 - INFO - __main__ - Step 123305: {'lr': 3.9077403485563576e-05, 'samples': 23674560, 'steps': 123304, 'loss/train': 1.62765634059906} 11/07/2021 14:36:20 - INFO - __main__ - Step 123306: {'lr': 3.9074554713101976e-05, 'samples': 23674752, 'steps': 123305, 'loss/train': 1.085892915725708} 11/07/2021 14:36:20 - INFO - __main__ - Step 123307: {'lr': 3.9071706035680135e-05, 'samples': 23674944, 'steps': 123306, 'loss/train': 1.0812863111495972} 11/07/2021 14:36:21 - INFO - __main__ - Step 123308: {'lr': 3.9068857453299355e-05, 'samples': 23675136, 'steps': 123307, 'loss/train': 4.011615753173828} 11/07/2021 14:36:21 - INFO - __main__ - Step 123309: {'lr': 3.9066008965960856e-05, 'samples': 23675328, 'steps': 123308, 'loss/train': 1.439925193786621} 11/07/2021 14:36:21 - INFO - __main__ - Step 123310: {'lr': 3.906316057366599e-05, 'samples': 23675520, 'steps': 123309, 'loss/train': 1.7103898525238037} 11/07/2021 14:36:22 - INFO - __main__ - Step 123311: {'lr': 3.906031227641599e-05, 'samples': 23675712, 'steps': 123310, 'loss/train': 1.3382619619369507} 11/07/2021 14:36:22 - INFO - __main__ - Step 123312: {'lr': 3.905746407421215e-05, 'samples': 23675904, 'steps': 123311, 'loss/train': 1.471448540687561} 11/07/2021 14:36:23 - INFO - __main__ - Step 123313: {'lr': 3.9054615967055786e-05, 'samples': 23676096, 'steps': 123312, 'loss/train': 1.458120346069336} 11/07/2021 14:36:24 - INFO - __main__ - Step 123314: {'lr': 3.905176795494814e-05, 'samples': 23676288, 'steps': 123313, 'loss/train': 0.8318496346473694} 11/07/2021 14:36:24 - INFO - __main__ - Step 123315: {'lr': 3.9048920037890514e-05, 'samples': 23676480, 'steps': 123314, 'loss/train': 1.492676854133606} 11/07/2021 14:36:24 - INFO - __main__ - Step 123316: {'lr': 3.904607221588427e-05, 'samples': 23676672, 'steps': 123315, 'loss/train': 0.6033188700675964} 11/07/2021 14:36:25 - INFO - __main__ - Step 123317: {'lr': 3.904322448893052e-05, 'samples': 23676864, 'steps': 123316, 'loss/train': 1.116654396057129} 11/07/2021 14:36:26 - INFO - __main__ - Step 123318: {'lr': 3.9040376857030654e-05, 'samples': 23677056, 'steps': 123317, 'loss/train': 1.577004075050354} 11/07/2021 14:36:26 - INFO - __main__ - Step 123319: {'lr': 3.903752932018595e-05, 'samples': 23677248, 'steps': 123318, 'loss/train': 1.2574613094329834} 11/07/2021 14:36:26 - INFO - __main__ - Step 123320: {'lr': 3.903468187839765e-05, 'samples': 23677440, 'steps': 123319, 'loss/train': 1.4836539030075073} 11/07/2021 14:36:27 - INFO - __main__ - Step 123321: {'lr': 3.9031834531667085e-05, 'samples': 23677632, 'steps': 123320, 'loss/train': 0.8430678248405457} 11/07/2021 14:36:27 - INFO - __main__ - Step 123322: {'lr': 3.9028987279995516e-05, 'samples': 23677824, 'steps': 123321, 'loss/train': 1.8367705345153809} 11/07/2021 14:36:28 - INFO - __main__ - Step 123323: {'lr': 3.902614012338423e-05, 'samples': 23678016, 'steps': 123322, 'loss/train': 0.7512456774711609} 11/07/2021 14:36:28 - INFO - __main__ - Step 123324: {'lr': 3.902329306183453e-05, 'samples': 23678208, 'steps': 123323, 'loss/train': 0.782010555267334} 11/07/2021 14:36:29 - INFO - __main__ - Step 123325: {'lr': 3.902044609534766e-05, 'samples': 23678400, 'steps': 123324, 'loss/train': 1.6471058130264282} 11/07/2021 14:36:29 - INFO - __main__ - Step 123326: {'lr': 3.901759922392498e-05, 'samples': 23678592, 'steps': 123325, 'loss/train': 1.5437328815460205} 11/07/2021 14:36:29 - INFO - __main__ - Step 123327: {'lr': 3.901475244756766e-05, 'samples': 23678784, 'steps': 123326, 'loss/train': 1.2873469591140747} 11/07/2021 14:36:30 - INFO - __main__ - Step 123328: {'lr': 3.901190576627703e-05, 'samples': 23678976, 'steps': 123327, 'loss/train': 1.1922305822372437} 11/07/2021 14:36:31 - INFO - __main__ - Step 123329: {'lr': 3.9009059180054376e-05, 'samples': 23679168, 'steps': 123328, 'loss/train': 0.7327008843421936} 11/07/2021 14:36:31 - INFO - __main__ - Step 123330: {'lr': 3.9006212688901e-05, 'samples': 23679360, 'steps': 123329, 'loss/train': 1.9174429178237915} 11/07/2021 14:36:31 - INFO - __main__ - Step 123331: {'lr': 3.9003366292818145e-05, 'samples': 23679552, 'steps': 123330, 'loss/train': 1.3955459594726562} 11/07/2021 14:36:32 - INFO - __main__ - Step 123332: {'lr': 3.900051999180715e-05, 'samples': 23679744, 'steps': 123331, 'loss/train': 1.1297358274459839} 11/07/2021 14:36:33 - INFO - __main__ - Step 123333: {'lr': 3.899767378586924e-05, 'samples': 23679936, 'steps': 123332, 'loss/train': 1.7764416933059692} 11/07/2021 14:36:33 - INFO - __main__ - Step 123334: {'lr': 3.899482767500573e-05, 'samples': 23680128, 'steps': 123333, 'loss/train': 0.8564994931221008} 11/07/2021 14:36:34 - INFO - __main__ - Step 123335: {'lr': 3.899198165921788e-05, 'samples': 23680320, 'steps': 123334, 'loss/train': 0.3297494947910309} 11/07/2021 14:36:34 - INFO - __main__ - Step 123336: {'lr': 3.898913573850701e-05, 'samples': 23680512, 'steps': 123335, 'loss/train': 1.3383100032806396} 11/07/2021 14:36:35 - INFO - __main__ - Step 123337: {'lr': 3.8986289912874424e-05, 'samples': 23680704, 'steps': 123336, 'loss/train': 0.9180617332458496} 11/07/2021 14:36:35 - INFO - __main__ - Step 123338: {'lr': 3.8983444182321305e-05, 'samples': 23680896, 'steps': 123337, 'loss/train': 1.3924988508224487} 11/07/2021 14:36:36 - INFO - __main__ - Step 123339: {'lr': 3.898059854684899e-05, 'samples': 23681088, 'steps': 123338, 'loss/train': 1.763038992881775} 11/07/2021 14:36:36 - INFO - __main__ - Step 123340: {'lr': 3.8977753006458785e-05, 'samples': 23681280, 'steps': 123339, 'loss/train': 1.0305802822113037} 11/07/2021 14:36:37 - INFO - __main__ - Step 123341: {'lr': 3.8974907561151905e-05, 'samples': 23681472, 'steps': 123340, 'loss/train': 0.07142999768257141} 11/07/2021 14:36:37 - INFO - __main__ - Step 123342: {'lr': 3.897206221092969e-05, 'samples': 23681664, 'steps': 123341, 'loss/train': 0.95762699842453} 11/07/2021 14:36:38 - INFO - __main__ - Step 123343: {'lr': 3.896921695579342e-05, 'samples': 23681856, 'steps': 123342, 'loss/train': 1.711588978767395} 11/07/2021 14:36:39 - INFO - __main__ - Step 123344: {'lr': 3.896637179574436e-05, 'samples': 23682048, 'steps': 123343, 'loss/train': 1.1961911916732788} 11/07/2021 14:36:39 - INFO - __main__ - Step 123345: {'lr': 3.896352673078379e-05, 'samples': 23682240, 'steps': 123344, 'loss/train': 1.6825110912322998} 11/07/2021 14:36:39 - INFO - __main__ - Step 123346: {'lr': 3.8960681760912996e-05, 'samples': 23682432, 'steps': 123345, 'loss/train': 1.2215040922164917} 11/07/2021 14:36:40 - INFO - __main__ - Step 123347: {'lr': 3.8957836886133276e-05, 'samples': 23682624, 'steps': 123346, 'loss/train': 1.2658309936523438} 11/07/2021 14:36:40 - INFO - __main__ - Step 123348: {'lr': 3.895499210644596e-05, 'samples': 23682816, 'steps': 123347, 'loss/train': 0.6029603481292725} 11/07/2021 14:36:41 - INFO - __main__ - Step 123349: {'lr': 3.895214742185219e-05, 'samples': 23683008, 'steps': 123348, 'loss/train': 1.2853487730026245} 11/07/2021 14:36:41 - INFO - __main__ - Step 123350: {'lr': 3.8949302832353347e-05, 'samples': 23683200, 'steps': 123349, 'loss/train': 1.6234627962112427} 11/07/2021 14:36:42 - INFO - __main__ - Step 123351: {'lr': 3.894645833795066e-05, 'samples': 23683392, 'steps': 123350, 'loss/train': 1.3272231817245483} 11/07/2021 14:36:42 - INFO - __main__ - Step 123352: {'lr': 3.8943613938645456e-05, 'samples': 23683584, 'steps': 123351, 'loss/train': 1.2903817892074585} 11/07/2021 14:36:42 - INFO - __main__ - Step 123353: {'lr': 3.8940769634439014e-05, 'samples': 23683776, 'steps': 123352, 'loss/train': 1.6075836420059204} 11/07/2021 14:36:44 - INFO - __main__ - Step 123354: {'lr': 3.893792542533259e-05, 'samples': 23683968, 'steps': 123353, 'loss/train': 0.8164322376251221} 11/07/2021 14:36:44 - INFO - __main__ - Step 123355: {'lr': 3.89350813113275e-05, 'samples': 23684160, 'steps': 123354, 'loss/train': 1.4844820499420166} 11/07/2021 14:36:44 - INFO - __main__ - Step 123356: {'lr': 3.893223729242498e-05, 'samples': 23684352, 'steps': 123355, 'loss/train': 1.0392104387283325} 11/07/2021 14:36:45 - INFO - __main__ - Step 123357: {'lr': 3.892939336862636e-05, 'samples': 23684544, 'steps': 123356, 'loss/train': 1.338776707649231} 11/07/2021 14:36:45 - INFO - __main__ - Step 123358: {'lr': 3.892654953993288e-05, 'samples': 23684736, 'steps': 123357, 'loss/train': 1.7287672758102417} 11/07/2021 14:36:46 - INFO - __main__ - Step 123359: {'lr': 3.892370580634586e-05, 'samples': 23684928, 'steps': 123358, 'loss/train': 1.4188892841339111} 11/07/2021 14:36:46 - INFO - __main__ - Step 123360: {'lr': 3.892086216786655e-05, 'samples': 23685120, 'steps': 123359, 'loss/train': 0.8283233046531677} 11/07/2021 14:36:47 - INFO - __main__ - Step 123361: {'lr': 3.891801862449629e-05, 'samples': 23685312, 'steps': 123360, 'loss/train': 0.04361845552921295} 11/07/2021 14:36:47 - INFO - __main__ - Step 123362: {'lr': 3.891517517623627e-05, 'samples': 23685504, 'steps': 123361, 'loss/train': 0.11386759579181671} 11/07/2021 14:36:47 - INFO - __main__ - Step 123363: {'lr': 3.8912331823087815e-05, 'samples': 23685696, 'steps': 123362, 'loss/train': 0.9890279769897461} 11/07/2021 14:36:49 - INFO - __main__ - Step 123364: {'lr': 3.8909488565052194e-05, 'samples': 23685888, 'steps': 123363, 'loss/train': 1.5053435564041138} 11/07/2021 14:36:49 - INFO - __main__ - Step 123365: {'lr': 3.890664540213071e-05, 'samples': 23686080, 'steps': 123364, 'loss/train': 1.1644493341445923} 11/07/2021 14:36:49 - INFO - __main__ - Step 123366: {'lr': 3.890380233432464e-05, 'samples': 23686272, 'steps': 123365, 'loss/train': 1.3422688245773315} 11/07/2021 14:36:50 - INFO - __main__ - Step 123367: {'lr': 3.8900959361635235e-05, 'samples': 23686464, 'steps': 123366, 'loss/train': 1.2554781436920166} 11/07/2021 14:36:50 - INFO - __main__ - Step 123368: {'lr': 3.889811648406383e-05, 'samples': 23686656, 'steps': 123367, 'loss/train': 1.275971531867981} 11/07/2021 14:36:51 - INFO - __main__ - Step 123369: {'lr': 3.889527370161164e-05, 'samples': 23686848, 'steps': 123368, 'loss/train': 1.654284119606018} 11/07/2021 14:36:51 - INFO - __main__ - Step 123370: {'lr': 3.889243101428e-05, 'samples': 23687040, 'steps': 123369, 'loss/train': 0.8631777763366699} 11/07/2021 14:36:52 - INFO - __main__ - Step 123371: {'lr': 3.888958842207019e-05, 'samples': 23687232, 'steps': 123370, 'loss/train': 1.0413821935653687} 11/07/2021 14:36:52 - INFO - __main__ - Step 123372: {'lr': 3.888674592498345e-05, 'samples': 23687424, 'steps': 123371, 'loss/train': 0.6541873216629028} 11/07/2021 14:36:52 - INFO - __main__ - Step 123373: {'lr': 3.88839035230211e-05, 'samples': 23687616, 'steps': 123372, 'loss/train': 1.356555461883545} 11/07/2021 14:36:53 - INFO - __main__ - Step 123374: {'lr': 3.8881061216184454e-05, 'samples': 23687808, 'steps': 123373, 'loss/train': 1.3376437425613403} 11/07/2021 14:36:54 - INFO - __main__ - Step 123375: {'lr': 3.8878219004474694e-05, 'samples': 23688000, 'steps': 123374, 'loss/train': 0.45765626430511475} 11/07/2021 14:36:54 - INFO - __main__ - Step 123376: {'lr': 3.8875376887893164e-05, 'samples': 23688192, 'steps': 123375, 'loss/train': 1.5846861600875854} 11/07/2021 14:36:54 - INFO - __main__ - Step 123377: {'lr': 3.88725348664411e-05, 'samples': 23688384, 'steps': 123376, 'loss/train': 1.5826504230499268} 11/07/2021 14:36:55 - INFO - __main__ - Step 123378: {'lr': 3.8869692940119825e-05, 'samples': 23688576, 'steps': 123377, 'loss/train': 0.6231110095977783} 11/07/2021 14:36:56 - INFO - __main__ - Step 123379: {'lr': 3.8866851108930624e-05, 'samples': 23688768, 'steps': 123378, 'loss/train': 1.016886830329895} 11/07/2021 14:36:56 - INFO - __main__ - Step 123380: {'lr': 3.8864009372874736e-05, 'samples': 23688960, 'steps': 123379, 'loss/train': 1.5590680837631226} 11/07/2021 14:36:57 - INFO - __main__ - Step 123381: {'lr': 3.88611677319535e-05, 'samples': 23689152, 'steps': 123380, 'loss/train': 1.182045817375183} 11/07/2021 14:36:57 - INFO - __main__ - Step 123382: {'lr': 3.8858326186168136e-05, 'samples': 23689344, 'steps': 123381, 'loss/train': 1.1867942810058594} 11/07/2021 14:36:57 - INFO - __main__ - Step 123383: {'lr': 3.8855484735519946e-05, 'samples': 23689536, 'steps': 123382, 'loss/train': 0.480349063873291} 11/07/2021 14:36:58 - INFO - __main__ - Step 123384: {'lr': 3.885264338001024e-05, 'samples': 23689728, 'steps': 123383, 'loss/train': 0.9517650008201599} 11/07/2021 14:36:59 - INFO - __main__ - Step 123385: {'lr': 3.884980211964026e-05, 'samples': 23689920, 'steps': 123384, 'loss/train': 0.9323834180831909} 11/07/2021 14:36:59 - INFO - __main__ - Step 123386: {'lr': 3.8846960954411314e-05, 'samples': 23690112, 'steps': 123385, 'loss/train': 1.3544888496398926} 11/07/2021 14:36:59 - INFO - __main__ - Step 123387: {'lr': 3.884411988432468e-05, 'samples': 23690304, 'steps': 123386, 'loss/train': 1.2001045942306519} 11/07/2021 14:37:00 - INFO - __main__ - Step 123388: {'lr': 3.8841278909381664e-05, 'samples': 23690496, 'steps': 123387, 'loss/train': 1.299440622329712} 11/07/2021 14:37:01 - INFO - __main__ - Step 123389: {'lr': 3.8838438029583454e-05, 'samples': 23690688, 'steps': 123388, 'loss/train': 1.5777146816253662} 11/07/2021 14:37:01 - INFO - __main__ - Step 123390: {'lr': 3.8835597244931384e-05, 'samples': 23690880, 'steps': 123389, 'loss/train': 1.5308030843734741} 11/07/2021 14:37:02 - INFO - __main__ - Step 123391: {'lr': 3.883275655542673e-05, 'samples': 23691072, 'steps': 123390, 'loss/train': 1.2533420324325562} 11/07/2021 14:37:02 - INFO - __main__ - Step 123392: {'lr': 3.88299159610708e-05, 'samples': 23691264, 'steps': 123391, 'loss/train': 1.2459598779678345} 11/07/2021 14:37:02 - INFO - __main__ - Step 123393: {'lr': 3.882707546186481e-05, 'samples': 23691456, 'steps': 123392, 'loss/train': 1.1727428436279297} 11/07/2021 14:37:03 - INFO - __main__ - Step 123394: {'lr': 3.882423505781013e-05, 'samples': 23691648, 'steps': 123393, 'loss/train': 0.4297689199447632} 11/07/2021 14:37:04 - INFO - __main__ - Step 123395: {'lr': 3.882139474890797e-05, 'samples': 23691840, 'steps': 123394, 'loss/train': 1.1579471826553345} 11/07/2021 14:37:04 - INFO - __main__ - Step 123396: {'lr': 3.881855453515962e-05, 'samples': 23692032, 'steps': 123395, 'loss/train': 1.1555497646331787} 11/07/2021 14:37:04 - INFO - __main__ - Step 123397: {'lr': 3.8815714416566365e-05, 'samples': 23692224, 'steps': 123396, 'loss/train': 1.1799697875976562} 11/07/2021 14:37:05 - INFO - __main__ - Step 123398: {'lr': 3.8812874393129524e-05, 'samples': 23692416, 'steps': 123397, 'loss/train': 0.7985051274299622} 11/07/2021 14:37:06 - INFO - __main__ - Step 123399: {'lr': 3.881003446485032e-05, 'samples': 23692608, 'steps': 123398, 'loss/train': 0.9119536876678467} 11/07/2021 14:37:06 - INFO - __main__ - Step 123400: {'lr': 3.880719463173005e-05, 'samples': 23692800, 'steps': 123399, 'loss/train': 0.7691817283630371} 11/07/2021 14:37:06 - INFO - __main__ - Step 123401: {'lr': 3.880435489377007e-05, 'samples': 23692992, 'steps': 123400, 'loss/train': 1.119096040725708} 11/07/2021 14:37:07 - INFO - __main__ - Step 123402: {'lr': 3.880151525097153e-05, 'samples': 23693184, 'steps': 123401, 'loss/train': 1.2648838758468628} 11/07/2021 14:37:07 - INFO - __main__ - Step 123403: {'lr': 3.879867570333578e-05, 'samples': 23693376, 'steps': 123402, 'loss/train': 1.2158443927764893} 11/07/2021 14:37:08 - INFO - __main__ - Step 123404: {'lr': 3.879583625086405e-05, 'samples': 23693568, 'steps': 123403, 'loss/train': 1.2896511554718018} 11/07/2021 14:37:09 - INFO - __main__ - Step 123405: {'lr': 3.87929968935577e-05, 'samples': 23693760, 'steps': 123404, 'loss/train': 1.433341145515442} 11/07/2021 14:37:09 - INFO - __main__ - Step 123406: {'lr': 3.879015763141794e-05, 'samples': 23693952, 'steps': 123405, 'loss/train': 1.1637303829193115} 11/07/2021 14:37:09 - INFO - __main__ - Step 123407: {'lr': 3.878731846444608e-05, 'samples': 23694144, 'steps': 123406, 'loss/train': 1.6017802953720093} 11/07/2021 14:37:10 - INFO - __main__ - Step 123408: {'lr': 3.87844793926434e-05, 'samples': 23694336, 'steps': 123407, 'loss/train': 0.7605571746826172} 11/07/2021 14:37:10 - INFO - __main__ - Step 123409: {'lr': 3.8781640416011176e-05, 'samples': 23694528, 'steps': 123408, 'loss/train': 1.2119379043579102} 11/07/2021 14:37:11 - INFO - __main__ - Step 123410: {'lr': 3.877880153455069e-05, 'samples': 23694720, 'steps': 123409, 'loss/train': 1.187935709953308} 11/07/2021 14:37:12 - INFO - __main__ - Step 123411: {'lr': 3.87759627482632e-05, 'samples': 23694912, 'steps': 123410, 'loss/train': 1.3368861675262451} 11/07/2021 14:37:12 - INFO - __main__ - Step 123412: {'lr': 3.877312405715003e-05, 'samples': 23695104, 'steps': 123411, 'loss/train': 1.140005350112915} 11/07/2021 14:37:13 - INFO - __main__ - Step 123413: {'lr': 3.877028546121239e-05, 'samples': 23695296, 'steps': 123412, 'loss/train': 0.9469035863876343} 11/07/2021 14:37:13 - INFO - __main__ - Step 123414: {'lr': 3.876744696045167e-05, 'samples': 23695488, 'steps': 123413, 'loss/train': 1.3981506824493408} 11/07/2021 14:37:14 - INFO - __main__ - Step 123415: {'lr': 3.8764608554869044e-05, 'samples': 23695680, 'steps': 123414, 'loss/train': 0.21357853710651398} 11/07/2021 14:37:14 - INFO - __main__ - Step 123416: {'lr': 3.876177024446581e-05, 'samples': 23695872, 'steps': 123415, 'loss/train': 1.2599939107894897} 11/07/2021 14:37:15 - INFO - __main__ - Step 123417: {'lr': 3.875893202924327e-05, 'samples': 23696064, 'steps': 123416, 'loss/train': 1.405588150024414} 11/07/2021 14:37:15 - INFO - __main__ - Step 123418: {'lr': 3.875609390920268e-05, 'samples': 23696256, 'steps': 123417, 'loss/train': 1.3605597019195557} 11/07/2021 14:37:15 - INFO - __main__ - Step 123419: {'lr': 3.875325588434536e-05, 'samples': 23696448, 'steps': 123418, 'loss/train': 1.1534483432769775} 11/07/2021 14:37:16 - INFO - __main__ - Step 123420: {'lr': 3.8750417954672546e-05, 'samples': 23696640, 'steps': 123419, 'loss/train': 1.341609239578247} 11/07/2021 14:37:17 - INFO - __main__ - Step 123421: {'lr': 3.874758012018553e-05, 'samples': 23696832, 'steps': 123420, 'loss/train': 1.3968453407287598} 11/07/2021 14:37:17 - INFO - __main__ - Step 123422: {'lr': 3.87447423808856e-05, 'samples': 23697024, 'steps': 123421, 'loss/train': 1.216444969177246} 11/07/2021 14:37:17 - INFO - __main__ - Step 123423: {'lr': 3.8741904736774025e-05, 'samples': 23697216, 'steps': 123422, 'loss/train': 0.8483815789222717} 11/07/2021 14:37:18 - INFO - __main__ - Step 123424: {'lr': 3.8739067187852114e-05, 'samples': 23697408, 'steps': 123423, 'loss/train': 1.475765347480774} 11/07/2021 14:37:19 - INFO - __main__ - Step 123425: {'lr': 3.873622973412108e-05, 'samples': 23697600, 'steps': 123424, 'loss/train': 1.0350607633590698} 11/07/2021 14:37:19 - INFO - __main__ - Step 123426: {'lr': 3.8733392375582265e-05, 'samples': 23697792, 'steps': 123425, 'loss/train': 1.237402081489563} 11/07/2021 14:37:19 - INFO - __main__ - Step 123427: {'lr': 3.8730555112236916e-05, 'samples': 23697984, 'steps': 123426, 'loss/train': 1.3061518669128418} 11/07/2021 14:37:20 - INFO - __main__ - Step 123428: {'lr': 3.872771794408639e-05, 'samples': 23698176, 'steps': 123427, 'loss/train': 1.2347995042800903} 11/07/2021 14:37:20 - INFO - __main__ - Step 123429: {'lr': 3.8724880871131825e-05, 'samples': 23698368, 'steps': 123428, 'loss/train': 1.3064160346984863} 11/07/2021 14:37:21 - INFO - __main__ - Step 123430: {'lr': 3.872204389337455e-05, 'samples': 23698560, 'steps': 123429, 'loss/train': 0.9675045013427734} 11/07/2021 14:37:22 - INFO - __main__ - Step 123431: {'lr': 3.8719207010815885e-05, 'samples': 23698752, 'steps': 123430, 'loss/train': 1.3211442232131958} 11/07/2021 14:37:22 - INFO - __main__ - Step 123432: {'lr': 3.871637022345709e-05, 'samples': 23698944, 'steps': 123431, 'loss/train': 1.4650901556015015} 11/07/2021 14:37:22 - INFO - __main__ - Step 123433: {'lr': 3.8713533531299415e-05, 'samples': 23699136, 'steps': 123432, 'loss/train': 1.262822151184082} 11/07/2021 14:37:23 - INFO - __main__ - Step 123434: {'lr': 3.871069693434417e-05, 'samples': 23699328, 'steps': 123433, 'loss/train': 0.4116703271865845} 11/07/2021 14:37:24 - INFO - __main__ - Step 123435: {'lr': 3.870786043259264e-05, 'samples': 23699520, 'steps': 123434, 'loss/train': 1.4373688697814941} 11/07/2021 14:37:24 - INFO - __main__ - Step 123436: {'lr': 3.870502402604606e-05, 'samples': 23699712, 'steps': 123435, 'loss/train': 0.9806073904037476} 11/07/2021 14:37:24 - INFO - __main__ - Step 123437: {'lr': 3.870218771470577e-05, 'samples': 23699904, 'steps': 123436, 'loss/train': 1.2165616750717163} 11/07/2021 14:37:25 - INFO - __main__ - Step 123438: {'lr': 3.869935149857298e-05, 'samples': 23700096, 'steps': 123437, 'loss/train': 1.186428189277649} 11/07/2021 14:37:25 - INFO - __main__ - Step 123439: {'lr': 3.869651537764901e-05, 'samples': 23700288, 'steps': 123438, 'loss/train': 1.226565957069397} 11/07/2021 14:37:26 - INFO - __main__ - Step 123440: {'lr': 3.869367935193516e-05, 'samples': 23700480, 'steps': 123439, 'loss/train': 0.7248185276985168} 11/07/2021 14:37:26 - INFO - __main__ - Step 123441: {'lr': 3.8690843421432695e-05, 'samples': 23700672, 'steps': 123440, 'loss/train': 2.3325555324554443} 11/07/2021 14:37:27 - INFO - __main__ - Step 123442: {'lr': 3.8688007586142854e-05, 'samples': 23700864, 'steps': 123441, 'loss/train': 1.2674952745437622} 11/07/2021 14:37:27 - INFO - __main__ - Step 123443: {'lr': 3.8685171846066904e-05, 'samples': 23701056, 'steps': 123442, 'loss/train': 1.3931143283843994} 11/07/2021 14:37:27 - INFO - __main__ - Step 123444: {'lr': 3.868233620120618e-05, 'samples': 23701248, 'steps': 123443, 'loss/train': 1.3270095586776733} 11/07/2021 14:37:29 - INFO - __main__ - Step 123445: {'lr': 3.867950065156192e-05, 'samples': 23701440, 'steps': 123444, 'loss/train': 1.059454083442688} 11/07/2021 14:37:29 - INFO - __main__ - Step 123446: {'lr': 3.867666519713542e-05, 'samples': 23701632, 'steps': 123445, 'loss/train': 1.6626003980636597} 11/07/2021 14:37:29 - INFO - __main__ - Step 123447: {'lr': 3.867382983792794e-05, 'samples': 23701824, 'steps': 123446, 'loss/train': 1.5283347368240356} 11/07/2021 14:37:30 - INFO - __main__ - Step 123448: {'lr': 3.867099457394077e-05, 'samples': 23702016, 'steps': 123447, 'loss/train': 1.1191126108169556} 11/07/2021 14:37:30 - INFO - __main__ - Step 123449: {'lr': 3.8668159405175206e-05, 'samples': 23702208, 'steps': 123448, 'loss/train': 1.6332029104232788} 11/07/2021 14:37:30 - INFO - __main__ - Step 123450: {'lr': 3.8665324331632504e-05, 'samples': 23702400, 'steps': 123449, 'loss/train': 1.206302523612976} 11/07/2021 14:37:31 - INFO - __main__ - Step 123451: {'lr': 3.866248935331396e-05, 'samples': 23702592, 'steps': 123450, 'loss/train': 0.6704519987106323} 11/07/2021 14:37:32 - INFO - __main__ - Step 123452: {'lr': 3.8659654470220826e-05, 'samples': 23702784, 'steps': 123451, 'loss/train': 1.2026082277297974} 11/07/2021 14:37:32 - INFO - __main__ - Step 123453: {'lr': 3.865681968235438e-05, 'samples': 23702976, 'steps': 123452, 'loss/train': 0.6282213926315308} 11/07/2021 14:37:32 - INFO - __main__ - Step 123454: {'lr': 3.865398498971592e-05, 'samples': 23703168, 'steps': 123453, 'loss/train': 1.1408250331878662} 11/07/2021 14:37:33 - INFO - __main__ - Step 123455: {'lr': 3.865115039230677e-05, 'samples': 23703360, 'steps': 123454, 'loss/train': 1.869328260421753} 11/07/2021 14:37:34 - INFO - __main__ - Step 123456: {'lr': 3.86483158901281e-05, 'samples': 23703552, 'steps': 123455, 'loss/train': 1.134080410003662} 11/07/2021 14:37:34 - INFO - __main__ - Step 123457: {'lr': 3.8645481483181226e-05, 'samples': 23703744, 'steps': 123456, 'loss/train': 1.3179223537445068} 11/07/2021 14:37:35 - INFO - __main__ - Step 123458: {'lr': 3.8642647171467424e-05, 'samples': 23703936, 'steps': 123457, 'loss/train': 1.1696048974990845} 11/07/2021 14:37:35 - INFO - __main__ - Step 123459: {'lr': 3.8639812954988e-05, 'samples': 23704128, 'steps': 123458, 'loss/train': 1.4618161916732788} 11/07/2021 14:37:35 - INFO - __main__ - Step 123460: {'lr': 3.863697883374423e-05, 'samples': 23704320, 'steps': 123459, 'loss/train': 1.5143927335739136} 11/07/2021 14:37:36 - INFO - __main__ - Step 123461: {'lr': 3.863414480773736e-05, 'samples': 23704512, 'steps': 123460, 'loss/train': 1.351546287536621} 11/07/2021 14:37:37 - INFO - __main__ - Step 123462: {'lr': 3.863131087696867e-05, 'samples': 23704704, 'steps': 123461, 'loss/train': 1.3581578731536865} 11/07/2021 14:37:37 - INFO - __main__ - Step 123463: {'lr': 3.8628477041439456e-05, 'samples': 23704896, 'steps': 123462, 'loss/train': 1.4222335815429688} 11/07/2021 14:37:37 - INFO - __main__ - Step 123464: {'lr': 3.862564330115101e-05, 'samples': 23705088, 'steps': 123463, 'loss/train': 0.6666765213012695} 11/07/2021 14:37:38 - INFO - __main__ - Step 123465: {'lr': 3.862280965610457e-05, 'samples': 23705280, 'steps': 123464, 'loss/train': 1.3255088329315186} 11/07/2021 14:37:39 - INFO - __main__ - Step 123466: {'lr': 3.8619976106301416e-05, 'samples': 23705472, 'steps': 123465, 'loss/train': 1.151963472366333} 11/07/2021 14:37:39 - INFO - __main__ - Step 123467: {'lr': 3.8617142651742876e-05, 'samples': 23705664, 'steps': 123466, 'loss/train': 1.1024022102355957} 11/07/2021 14:37:39 - INFO - __main__ - Step 123468: {'lr': 3.86143092924302e-05, 'samples': 23705856, 'steps': 123467, 'loss/train': 0.8699742555618286} 11/07/2021 14:37:40 - INFO - __main__ - Step 123469: {'lr': 3.861147602836465e-05, 'samples': 23706048, 'steps': 123468, 'loss/train': 1.1043909788131714} 11/07/2021 14:37:40 - INFO - __main__ - Step 123470: {'lr': 3.860864285954746e-05, 'samples': 23706240, 'steps': 123469, 'loss/train': 1.384331226348877} 11/07/2021 14:37:41 - INFO - __main__ - Step 123471: {'lr': 3.860580978597999e-05, 'samples': 23706432, 'steps': 123470, 'loss/train': 1.0265213251113892} 11/07/2021 14:37:42 - INFO - __main__ - Step 123472: {'lr': 3.860297680766345e-05, 'samples': 23706624, 'steps': 123471, 'loss/train': 1.3165448904037476} 11/07/2021 14:37:42 - INFO - __main__ - Step 123473: {'lr': 3.860014392459918e-05, 'samples': 23706816, 'steps': 123472, 'loss/train': 0.7567027807235718} 11/07/2021 14:37:42 - INFO - __main__ - Step 123474: {'lr': 3.859731113678838e-05, 'samples': 23707008, 'steps': 123473, 'loss/train': 1.174354076385498} 11/07/2021 14:37:43 - INFO - __main__ - Step 123475: {'lr': 3.859447844423242e-05, 'samples': 23707200, 'steps': 123474, 'loss/train': 1.145262598991394} 11/07/2021 14:37:44 - INFO - __main__ - Step 123476: {'lr': 3.859164584693248e-05, 'samples': 23707392, 'steps': 123475, 'loss/train': 1.1633310317993164} 11/07/2021 14:37:44 - INFO - __main__ - Step 123477: {'lr': 3.8588813344889896e-05, 'samples': 23707584, 'steps': 123476, 'loss/train': 1.2974313497543335} 11/07/2021 14:37:45 - INFO - __main__ - Step 123478: {'lr': 3.858598093810595e-05, 'samples': 23707776, 'steps': 123477, 'loss/train': 0.05166761577129364} 11/07/2021 14:37:45 - INFO - __main__ - Step 123479: {'lr': 3.858314862658188e-05, 'samples': 23707968, 'steps': 123478, 'loss/train': 1.1363332271575928} 11/07/2021 14:37:46 - INFO - __main__ - Step 123480: {'lr': 3.858031641031898e-05, 'samples': 23708160, 'steps': 123479, 'loss/train': 1.3324062824249268} 11/07/2021 14:37:47 - INFO - __main__ - Step 123481: {'lr': 3.857748428931854e-05, 'samples': 23708352, 'steps': 123480, 'loss/train': 1.435484528541565} 11/07/2021 14:37:47 - INFO - __main__ - Step 123482: {'lr': 3.8574652263581865e-05, 'samples': 23708544, 'steps': 123481, 'loss/train': 1.1772600412368774} 11/07/2021 14:37:48 - INFO - __main__ - Step 123483: {'lr': 3.857182033311013e-05, 'samples': 23708736, 'steps': 123482, 'loss/train': 1.3778588771820068} 11/07/2021 14:37:48 - INFO - __main__ - Step 123484: {'lr': 3.8568988497904686e-05, 'samples': 23708928, 'steps': 123483, 'loss/train': 1.8174958229064941} 11/07/2021 14:37:48 - INFO - __main__ - Step 123485: {'lr': 3.856615675796679e-05, 'samples': 23709120, 'steps': 123484, 'loss/train': 1.2129249572753906} 11/07/2021 14:37:49 - INFO - __main__ - Step 123486: {'lr': 3.8563325113297717e-05, 'samples': 23709312, 'steps': 123485, 'loss/train': 0.11507482826709747} 11/07/2021 14:37:50 - INFO - __main__ - Step 123487: {'lr': 3.8560493563898766e-05, 'samples': 23709504, 'steps': 123486, 'loss/train': 1.1255825757980347} 11/07/2021 14:37:50 - INFO - __main__ - Step 123488: {'lr': 3.8557662109771155e-05, 'samples': 23709696, 'steps': 123487, 'loss/train': 0.8823533058166504} 11/07/2021 14:37:50 - INFO - __main__ - Step 123489: {'lr': 3.855483075091623e-05, 'samples': 23709888, 'steps': 123488, 'loss/train': 0.9847286939620972} 11/07/2021 14:37:51 - INFO - __main__ - Step 123490: {'lr': 3.855199948733523e-05, 'samples': 23710080, 'steps': 123489, 'loss/train': 1.0634905099868774} 11/07/2021 14:37:51 - INFO - __main__ - Step 123491: {'lr': 3.854916831902941e-05, 'samples': 23710272, 'steps': 123490, 'loss/train': 1.6414556503295898} 11/07/2021 14:37:52 - INFO - __main__ - Step 123492: {'lr': 3.8546337246000094e-05, 'samples': 23710464, 'steps': 123491, 'loss/train': 1.1040692329406738} 11/07/2021 14:37:52 - INFO - __main__ - Step 123493: {'lr': 3.854350626824854e-05, 'samples': 23710656, 'steps': 123492, 'loss/train': 1.68900465965271} 11/07/2021 14:37:53 - INFO - __main__ - Step 123494: {'lr': 3.854067538577602e-05, 'samples': 23710848, 'steps': 123493, 'loss/train': 0.5995023846626282} 11/07/2021 14:37:53 - INFO - __main__ - Step 123495: {'lr': 3.853784459858387e-05, 'samples': 23711040, 'steps': 123494, 'loss/train': 1.1458771228790283} 11/07/2021 14:37:54 - INFO - __main__ - Step 123496: {'lr': 3.853501390667321e-05, 'samples': 23711232, 'steps': 123495, 'loss/train': 1.3325469493865967} 11/07/2021 14:37:55 - INFO - __main__ - Step 123497: {'lr': 3.853218331004546e-05, 'samples': 23711424, 'steps': 123496, 'loss/train': 1.112390160560608} 11/07/2021 14:37:55 - INFO - __main__ - Step 123498: {'lr': 3.852935280870182e-05, 'samples': 23711616, 'steps': 123497, 'loss/train': 0.9792702198028564} 11/07/2021 14:37:55 - INFO - __main__ - Step 123499: {'lr': 3.8526522402643594e-05, 'samples': 23711808, 'steps': 123498, 'loss/train': 1.162959337234497} 11/07/2021 14:37:56 - INFO - __main__ - Step 123500: {'lr': 3.8523692091872036e-05, 'samples': 23712000, 'steps': 123499, 'loss/train': 1.2142024040222168} 11/07/2021 14:37:56 - INFO - __main__ - Step 123501: {'lr': 3.852086187638845e-05, 'samples': 23712192, 'steps': 123500, 'loss/train': 1.393640160560608} 11/07/2021 14:37:57 - INFO - __main__ - Step 123502: {'lr': 3.8518031756194116e-05, 'samples': 23712384, 'steps': 123501, 'loss/train': 1.0028011798858643} 11/07/2021 14:37:57 - INFO - __main__ - Step 123503: {'lr': 3.8515201731290275e-05, 'samples': 23712576, 'steps': 123502, 'loss/train': 1.0438337326049805} 11/07/2021 14:37:58 - INFO - __main__ - Step 123504: {'lr': 3.8512371801678244e-05, 'samples': 23712768, 'steps': 123503, 'loss/train': 1.7922898530960083} 11/07/2021 14:37:58 - INFO - __main__ - Step 123505: {'lr': 3.8509541967359256e-05, 'samples': 23712960, 'steps': 123504, 'loss/train': 1.3640879392623901} 11/07/2021 14:37:58 - INFO - __main__ - Step 123506: {'lr': 3.85067122283346e-05, 'samples': 23713152, 'steps': 123505, 'loss/train': 1.1727333068847656} 11/07/2021 14:38:00 - INFO - __main__ - Step 123507: {'lr': 3.850388258460555e-05, 'samples': 23713344, 'steps': 123506, 'loss/train': 0.9149900078773499} 11/07/2021 14:38:00 - INFO - __main__ - Step 123508: {'lr': 3.8501053036173404e-05, 'samples': 23713536, 'steps': 123507, 'loss/train': 0.059999242424964905} 11/07/2021 14:38:00 - INFO - __main__ - Step 123509: {'lr': 3.8498223583039476e-05, 'samples': 23713728, 'steps': 123508, 'loss/train': 1.1739356517791748} 11/07/2021 14:38:01 - INFO - __main__ - Step 123510: {'lr': 3.8495394225204927e-05, 'samples': 23713920, 'steps': 123509, 'loss/train': 1.1614432334899902} 11/07/2021 14:38:01 - INFO - __main__ - Step 123511: {'lr': 3.849256496267109e-05, 'samples': 23714112, 'steps': 123510, 'loss/train': 1.2586791515350342} 11/07/2021 14:38:01 - INFO - __main__ - Step 123512: {'lr': 3.848973579543924e-05, 'samples': 23714304, 'steps': 123511, 'loss/train': 1.7411468029022217} 11/07/2021 14:38:03 - INFO - __main__ - Step 123513: {'lr': 3.8486906723510657e-05, 'samples': 23714496, 'steps': 123512, 'loss/train': 1.7281980514526367} 11/07/2021 14:38:03 - INFO - __main__ - Step 123514: {'lr': 3.8484077746886585e-05, 'samples': 23714688, 'steps': 123513, 'loss/train': 1.6203705072402954} 11/07/2021 14:38:03 - INFO - __main__ - Step 123515: {'lr': 3.848124886556834e-05, 'samples': 23714880, 'steps': 123514, 'loss/train': 1.214110255241394} 11/07/2021 14:38:04 - INFO - __main__ - Step 123516: {'lr': 3.847842007955718e-05, 'samples': 23715072, 'steps': 123515, 'loss/train': 0.8389989137649536} 11/07/2021 14:38:04 - INFO - __main__ - Step 123517: {'lr': 3.84755913888544e-05, 'samples': 23715264, 'steps': 123516, 'loss/train': 1.2750580310821533} 11/07/2021 14:38:05 - INFO - __main__ - Step 123518: {'lr': 3.8472762793461235e-05, 'samples': 23715456, 'steps': 123517, 'loss/train': 0.8586960434913635} 11/07/2021 14:38:05 - INFO - __main__ - Step 123519: {'lr': 3.846993429337897e-05, 'samples': 23715648, 'steps': 123518, 'loss/train': 1.314831256866455} 11/07/2021 14:38:06 - INFO - __main__ - Step 123520: {'lr': 3.8467105888608885e-05, 'samples': 23715840, 'steps': 123519, 'loss/train': 1.5482300519943237} 11/07/2021 14:38:06 - INFO - __main__ - Step 123521: {'lr': 3.846427757915227e-05, 'samples': 23716032, 'steps': 123520, 'loss/train': 0.9979482293128967} 11/07/2021 14:38:06 - INFO - __main__ - Step 123522: {'lr': 3.846144936501045e-05, 'samples': 23716224, 'steps': 123521, 'loss/train': 1.3769266605377197} 11/07/2021 14:38:08 - INFO - __main__ - Step 123523: {'lr': 3.845862124618457e-05, 'samples': 23716416, 'steps': 123522, 'loss/train': 1.3080437183380127} 11/07/2021 14:38:08 - INFO - __main__ - Step 123524: {'lr': 3.8455793222675976e-05, 'samples': 23716608, 'steps': 123523, 'loss/train': 1.1495531797409058} 11/07/2021 14:38:08 - INFO - __main__ - Step 123525: {'lr': 3.845296529448594e-05, 'samples': 23716800, 'steps': 123524, 'loss/train': 1.4109759330749512} 11/07/2021 14:38:09 - INFO - __main__ - Step 123526: {'lr': 3.845013746161574e-05, 'samples': 23716992, 'steps': 123525, 'loss/train': 0.764912486076355} 11/07/2021 14:38:09 - INFO - __main__ - Step 123527: {'lr': 3.8447309724066625e-05, 'samples': 23717184, 'steps': 123526, 'loss/train': 1.1311153173446655} 11/07/2021 14:38:10 - INFO - __main__ - Step 123528: {'lr': 3.84444820818399e-05, 'samples': 23717376, 'steps': 123527, 'loss/train': 1.2035858631134033} 11/07/2021 14:38:10 - INFO - __main__ - Step 123529: {'lr': 3.8441654534936837e-05, 'samples': 23717568, 'steps': 123528, 'loss/train': 1.594801664352417} 11/07/2021 14:38:11 - INFO - __main__ - Step 123530: {'lr': 3.843882708335866e-05, 'samples': 23717760, 'steps': 123529, 'loss/train': 1.2360845804214478} 11/07/2021 14:38:11 - INFO - __main__ - Step 123531: {'lr': 3.8435999727106736e-05, 'samples': 23717952, 'steps': 123530, 'loss/train': 1.8292666673660278} 11/07/2021 14:38:11 - INFO - __main__ - Step 123532: {'lr': 3.843317246618225e-05, 'samples': 23718144, 'steps': 123531, 'loss/train': 0.8490180969238281} 11/07/2021 14:38:12 - INFO - __main__ - Step 123533: {'lr': 3.843034530058651e-05, 'samples': 23718336, 'steps': 123532, 'loss/train': 1.9201847314834595} 11/07/2021 14:38:13 - INFO - __main__ - Step 123534: {'lr': 3.8427518230320815e-05, 'samples': 23718528, 'steps': 123533, 'loss/train': 2.1744697093963623} 11/07/2021 14:38:13 - INFO - __main__ - Step 123535: {'lr': 3.8424691255386444e-05, 'samples': 23718720, 'steps': 123534, 'loss/train': 0.8572391271591187} 11/07/2021 14:38:13 - INFO - __main__ - Step 123536: {'lr': 3.842186437578463e-05, 'samples': 23718912, 'steps': 123535, 'loss/train': 0.9835323095321655} 11/07/2021 14:38:14 - INFO - __main__ - Step 123537: {'lr': 3.84190375915166e-05, 'samples': 23719104, 'steps': 123536, 'loss/train': 1.3347266912460327} 11/07/2021 14:38:15 - INFO - __main__ - Step 123538: {'lr': 3.841621090258374e-05, 'samples': 23719296, 'steps': 123537, 'loss/train': 0.5897365212440491} 11/07/2021 14:38:15 - INFO - __main__ - Step 123539: {'lr': 3.841338430898725e-05, 'samples': 23719488, 'steps': 123538, 'loss/train': 1.1474080085754395} 11/07/2021 14:38:15 - INFO - __main__ - Step 123540: {'lr': 3.8410557810728414e-05, 'samples': 23719680, 'steps': 123539, 'loss/train': 1.2520661354064941} 11/07/2021 14:38:16 - INFO - __main__ - Step 123541: {'lr': 3.840773140780851e-05, 'samples': 23719872, 'steps': 123540, 'loss/train': 1.0444296598434448} 11/07/2021 14:38:16 - INFO - __main__ - Step 123542: {'lr': 3.840490510022884e-05, 'samples': 23720064, 'steps': 123541, 'loss/train': 1.5963395833969116} 11/07/2021 14:38:17 - INFO - __main__ - Step 123543: {'lr': 3.840207888799063e-05, 'samples': 23720256, 'steps': 123542, 'loss/train': 1.196637749671936} 11/07/2021 14:38:18 - INFO - __main__ - Step 123544: {'lr': 3.839925277109521e-05, 'samples': 23720448, 'steps': 123543, 'loss/train': 1.1927886009216309} 11/07/2021 14:38:18 - INFO - __main__ - Step 123545: {'lr': 3.83964267495438e-05, 'samples': 23720640, 'steps': 123544, 'loss/train': 1.3260791301727295} 11/07/2021 14:38:18 - INFO - __main__ - Step 123546: {'lr': 3.8393600823337707e-05, 'samples': 23720832, 'steps': 123545, 'loss/train': 1.3460863828659058} 11/07/2021 14:38:19 - INFO - __main__ - Step 123547: {'lr': 3.8390774992478175e-05, 'samples': 23721024, 'steps': 123546, 'loss/train': 1.4123528003692627} 11/07/2021 14:38:20 - INFO - __main__ - Step 123548: {'lr': 3.8387949256966506e-05, 'samples': 23721216, 'steps': 123547, 'loss/train': 1.2751580476760864} 11/07/2021 14:38:20 - INFO - __main__ - Step 123549: {'lr': 3.838512361680402e-05, 'samples': 23721408, 'steps': 123548, 'loss/train': 1.3464946746826172} 11/07/2021 14:38:20 - INFO - __main__ - Step 123550: {'lr': 3.8382298071991866e-05, 'samples': 23721600, 'steps': 123549, 'loss/train': 1.2497665882110596} 11/07/2021 14:38:21 - INFO - __main__ - Step 123551: {'lr': 3.837947262253139e-05, 'samples': 23721792, 'steps': 123550, 'loss/train': 2.3037145137786865} 11/07/2021 14:38:21 - INFO - __main__ - Step 123552: {'lr': 3.8376647268423856e-05, 'samples': 23721984, 'steps': 123551, 'loss/train': 0.6711753010749817} 11/07/2021 14:38:21 - INFO - __main__ - Step 123553: {'lr': 3.837382200967054e-05, 'samples': 23722176, 'steps': 123552, 'loss/train': 1.3749721050262451} 11/07/2021 14:38:23 - INFO - __main__ - Step 123554: {'lr': 3.837099684627271e-05, 'samples': 23722368, 'steps': 123553, 'loss/train': 1.38961660861969} 11/07/2021 14:38:23 - INFO - __main__ - Step 123555: {'lr': 3.8368171778231656e-05, 'samples': 23722560, 'steps': 123554, 'loss/train': 1.3909947872161865} 11/07/2021 14:38:23 - INFO - __main__ - Step 123556: {'lr': 3.8365346805548624e-05, 'samples': 23722752, 'steps': 123555, 'loss/train': 0.8610202074050903} 11/07/2021 14:38:24 - INFO - __main__ - Step 123557: {'lr': 3.8362521928224926e-05, 'samples': 23722944, 'steps': 123556, 'loss/train': 1.1958552598953247} 11/07/2021 14:38:24 - INFO - __main__ - Step 123558: {'lr': 3.8359697146261777e-05, 'samples': 23723136, 'steps': 123557, 'loss/train': 1.343741536140442} 11/07/2021 14:38:25 - INFO - __main__ - Step 123559: {'lr': 3.835687245966049e-05, 'samples': 23723328, 'steps': 123558, 'loss/train': 5.690911769866943} 11/07/2021 14:38:25 - INFO - __main__ - Step 123560: {'lr': 3.835404786842234e-05, 'samples': 23723520, 'steps': 123559, 'loss/train': 1.1389418840408325} 11/07/2021 14:38:26 - INFO - __main__ - Step 123561: {'lr': 3.835122337254859e-05, 'samples': 23723712, 'steps': 123560, 'loss/train': 1.0602692365646362} 11/07/2021 14:38:26 - INFO - __main__ - Step 123562: {'lr': 3.8348398972040565e-05, 'samples': 23723904, 'steps': 123561, 'loss/train': 1.8560638427734375} 11/07/2021 14:38:26 - INFO - __main__ - Step 123563: {'lr': 3.834557466689945e-05, 'samples': 23724096, 'steps': 123562, 'loss/train': 1.6607357263565063} 11/07/2021 14:38:27 - INFO - __main__ - Step 123564: {'lr': 3.8342750457126515e-05, 'samples': 23724288, 'steps': 123563, 'loss/train': 1.0295627117156982} 11/07/2021 14:38:28 - INFO - __main__ - Step 123565: {'lr': 3.8339926342723096e-05, 'samples': 23724480, 'steps': 123564, 'loss/train': 1.2056503295898438} 11/07/2021 14:38:28 - INFO - __main__ - Step 123566: {'lr': 3.8337102323690423e-05, 'samples': 23724672, 'steps': 123565, 'loss/train': 1.5742472410202026} 11/07/2021 14:38:29 - INFO - __main__ - Step 123567: {'lr': 3.833427840002982e-05, 'samples': 23724864, 'steps': 123566, 'loss/train': 1.3197537660598755} 11/07/2021 14:38:29 - INFO - __main__ - Step 123568: {'lr': 3.8331454571742474e-05, 'samples': 23725056, 'steps': 123567, 'loss/train': 1.5357941389083862} 11/07/2021 14:38:30 - INFO - __main__ - Step 123569: {'lr': 3.832863083882973e-05, 'samples': 23725248, 'steps': 123568, 'loss/train': 1.2162944078445435} 11/07/2021 14:38:30 - INFO - __main__ - Step 123570: {'lr': 3.8325807201292864e-05, 'samples': 23725440, 'steps': 123569, 'loss/train': 1.09015691280365} 11/07/2021 14:38:31 - INFO - __main__ - Step 123571: {'lr': 3.8322983659133086e-05, 'samples': 23725632, 'steps': 123570, 'loss/train': 1.2296867370605469} 11/07/2021 14:38:31 - INFO - __main__ - Step 123572: {'lr': 3.832016021235174e-05, 'samples': 23725824, 'steps': 123571, 'loss/train': 1.46399986743927} 11/07/2021 14:38:31 - INFO - __main__ - Step 123573: {'lr': 3.83173368609501e-05, 'samples': 23726016, 'steps': 123572, 'loss/train': 1.2047613859176636} 11/07/2021 14:38:33 - INFO - __main__ - Step 123574: {'lr': 3.831451360492935e-05, 'samples': 23726208, 'steps': 123573, 'loss/train': 0.05055724084377289} 11/07/2021 14:38:33 - INFO - __main__ - Step 123575: {'lr': 3.831169044429081e-05, 'samples': 23726400, 'steps': 123574, 'loss/train': 1.2948803901672363} 11/07/2021 14:38:33 - INFO - __main__ - Step 123576: {'lr': 3.830886737903574e-05, 'samples': 23726592, 'steps': 123575, 'loss/train': 1.2395731210708618} 11/07/2021 14:38:34 - INFO - __main__ - Step 123577: {'lr': 3.830604440916546e-05, 'samples': 23726784, 'steps': 123576, 'loss/train': 1.2803195714950562} 11/07/2021 14:38:34 - INFO - __main__ - Step 123578: {'lr': 3.8303221534681186e-05, 'samples': 23726976, 'steps': 123577, 'loss/train': 0.9954777956008911} 11/07/2021 14:38:35 - INFO - __main__ - Step 123579: {'lr': 3.830039875558422e-05, 'samples': 23727168, 'steps': 123578, 'loss/train': 1.3572158813476562} 11/07/2021 14:38:36 - INFO - __main__ - Step 123580: {'lr': 3.829757607187584e-05, 'samples': 23727360, 'steps': 123579, 'loss/train': 1.1624350547790527} 11/07/2021 14:38:36 - INFO - __main__ - Step 123581: {'lr': 3.8294753483557296e-05, 'samples': 23727552, 'steps': 123580, 'loss/train': 1.2668524980545044} 11/07/2021 14:38:36 - INFO - __main__ - Step 123582: {'lr': 3.829193099062986e-05, 'samples': 23727744, 'steps': 123581, 'loss/train': 0.7465823292732239} 11/07/2021 14:38:37 - INFO - __main__ - Step 123583: {'lr': 3.8289108593094815e-05, 'samples': 23727936, 'steps': 123582, 'loss/train': 1.2474790811538696} 11/07/2021 14:38:37 - INFO - __main__ - Step 123584: {'lr': 3.828628629095349e-05, 'samples': 23728128, 'steps': 123583, 'loss/train': 1.3117074966430664} 11/07/2021 14:38:38 - INFO - __main__ - Step 123585: {'lr': 3.828346408420705e-05, 'samples': 23728320, 'steps': 123584, 'loss/train': 0.9974750876426697} 11/07/2021 14:38:39 - INFO - __main__ - Step 123586: {'lr': 3.82806419728568e-05, 'samples': 23728512, 'steps': 123585, 'loss/train': 0.04202243313193321} 11/07/2021 14:38:39 - INFO - __main__ - Step 123587: {'lr': 3.827781995690405e-05, 'samples': 23728704, 'steps': 123586, 'loss/train': 1.2503687143325806} 11/07/2021 14:38:39 - INFO - __main__ - Step 123588: {'lr': 3.827499803635001e-05, 'samples': 23728896, 'steps': 123587, 'loss/train': 0.8718851804733276} 11/07/2021 14:38:40 - INFO - __main__ - Step 123589: {'lr': 3.827217621119603e-05, 'samples': 23729088, 'steps': 123588, 'loss/train': 1.4284353256225586} 11/07/2021 14:38:40 - INFO - __main__ - Step 123590: {'lr': 3.8269354481443306e-05, 'samples': 23729280, 'steps': 123589, 'loss/train': 1.31253182888031} 11/07/2021 14:38:41 - INFO - __main__ - Step 123591: {'lr': 3.8266532847093166e-05, 'samples': 23729472, 'steps': 123590, 'loss/train': 0.8938684463500977} 11/07/2021 14:38:41 - INFO - __main__ - Step 123592: {'lr': 3.826371130814685e-05, 'samples': 23729664, 'steps': 123591, 'loss/train': 1.369842767715454} 11/07/2021 14:38:42 - INFO - __main__ - Step 123593: {'lr': 3.826088986460563e-05, 'samples': 23729856, 'steps': 123592, 'loss/train': 1.0564662218093872} 11/07/2021 14:38:42 - INFO - __main__ - Step 123594: {'lr': 3.825806851647079e-05, 'samples': 23730048, 'steps': 123593, 'loss/train': 1.2006207704544067} 11/07/2021 14:38:42 - INFO - __main__ - Step 123595: {'lr': 3.825524726374366e-05, 'samples': 23730240, 'steps': 123594, 'loss/train': 1.4868650436401367} 11/07/2021 14:38:43 - INFO - __main__ - Step 123596: {'lr': 3.8252426106425405e-05, 'samples': 23730432, 'steps': 123595, 'loss/train': 1.078798770904541} 11/07/2021 14:38:44 - INFO - __main__ - Step 123597: {'lr': 3.82496050445173e-05, 'samples': 23730624, 'steps': 123596, 'loss/train': 1.255081057548523} 11/07/2021 14:38:44 - INFO - __main__ - Step 123598: {'lr': 3.824678407802068e-05, 'samples': 23730816, 'steps': 123597, 'loss/train': 0.38146865367889404} 11/07/2021 14:38:45 - INFO - __main__ - Step 123599: {'lr': 3.82439632069368e-05, 'samples': 23731008, 'steps': 123598, 'loss/train': 1.2696723937988281} 11/07/2021 14:38:45 - INFO - __main__ - Step 123600: {'lr': 3.824114243126692e-05, 'samples': 23731200, 'steps': 123599, 'loss/train': 1.3326215744018555} 11/07/2021 14:38:46 - INFO - __main__ - Step 123601: {'lr': 3.82383217510123e-05, 'samples': 23731392, 'steps': 123600, 'loss/train': 0.9871969223022461} 11/07/2021 14:38:46 - INFO - __main__ - Step 123602: {'lr': 3.823550116617425e-05, 'samples': 23731584, 'steps': 123601, 'loss/train': 1.081164836883545} 11/07/2021 14:38:47 - INFO - __main__ - Step 123603: {'lr': 3.823268067675398e-05, 'samples': 23731776, 'steps': 123602, 'loss/train': 1.1078886985778809} 11/07/2021 14:38:47 - INFO - __main__ - Step 123604: {'lr': 3.822986028275283e-05, 'samples': 23731968, 'steps': 123603, 'loss/train': 1.0026626586914062} 11/07/2021 14:38:47 - INFO - __main__ - Step 123605: {'lr': 3.822703998417201e-05, 'samples': 23732160, 'steps': 123604, 'loss/train': 1.6050324440002441} 11/07/2021 14:38:48 - INFO - __main__ - Step 123606: {'lr': 3.822421978101287e-05, 'samples': 23732352, 'steps': 123605, 'loss/train': 1.5490700006484985} 11/07/2021 14:38:49 - INFO - __main__ - Step 123607: {'lr': 3.822139967327659e-05, 'samples': 23732544, 'steps': 123606, 'loss/train': 1.3390049934387207} 11/07/2021 14:38:49 - INFO - __main__ - Step 123608: {'lr': 3.8218579660964484e-05, 'samples': 23732736, 'steps': 123607, 'loss/train': 1.5125722885131836} 11/07/2021 14:38:49 - INFO - __main__ - Step 123609: {'lr': 3.8215759744077816e-05, 'samples': 23732928, 'steps': 123608, 'loss/train': 0.8441988825798035} 11/07/2021 14:38:50 - INFO - __main__ - Step 123610: {'lr': 3.8212939922617846e-05, 'samples': 23733120, 'steps': 123609, 'loss/train': 1.3330392837524414} 11/07/2021 14:38:51 - INFO - __main__ - Step 123611: {'lr': 3.8210120196585874e-05, 'samples': 23733312, 'steps': 123610, 'loss/train': 1.5627354383468628} 11/07/2021 14:38:51 - INFO - __main__ - Step 123612: {'lr': 3.820730056598315e-05, 'samples': 23733504, 'steps': 123611, 'loss/train': 0.8598341941833496} 11/07/2021 14:38:51 - INFO - __main__ - Step 123613: {'lr': 3.820448103081092e-05, 'samples': 23733696, 'steps': 123612, 'loss/train': 1.2337754964828491} 11/07/2021 14:38:52 - INFO - __main__ - Step 123614: {'lr': 3.8201661591070525e-05, 'samples': 23733888, 'steps': 123613, 'loss/train': 1.5156710147857666} 11/07/2021 14:38:52 - INFO - __main__ - Step 123615: {'lr': 3.8198842246763146e-05, 'samples': 23734080, 'steps': 123614, 'loss/train': 1.3640729188919067} 11/07/2021 14:38:53 - INFO - __main__ - Step 123616: {'lr': 3.819602299789013e-05, 'samples': 23734272, 'steps': 123615, 'loss/train': 0.9656534790992737} 11/07/2021 14:38:54 - INFO - __main__ - Step 123617: {'lr': 3.81932038444528e-05, 'samples': 23734464, 'steps': 123616, 'loss/train': 0.1109231561422348} 11/07/2021 14:38:54 - INFO - __main__ - Step 123618: {'lr': 3.819038478645223e-05, 'samples': 23734656, 'steps': 123617, 'loss/train': 1.5084267854690552} 11/07/2021 14:38:54 - INFO - __main__ - Step 123619: {'lr': 3.818756582388985e-05, 'samples': 23734848, 'steps': 123618, 'loss/train': 1.2719343900680542} 11/07/2021 14:38:55 - INFO - __main__ - Step 123620: {'lr': 3.818474695676685e-05, 'samples': 23735040, 'steps': 123619, 'loss/train': 1.658925175666809} 11/07/2021 14:38:56 - INFO - __main__ - Step 123621: {'lr': 3.8181928185084564e-05, 'samples': 23735232, 'steps': 123620, 'loss/train': 0.04874742776155472} 11/07/2021 14:38:56 - INFO - __main__ - Step 123622: {'lr': 3.8179109508844235e-05, 'samples': 23735424, 'steps': 123621, 'loss/train': 1.2387688159942627} 11/07/2021 14:38:57 - INFO - __main__ - Step 123623: {'lr': 3.817629092804712e-05, 'samples': 23735616, 'steps': 123622, 'loss/train': 1.483825922012329} 11/07/2021 14:38:57 - INFO - __main__ - Step 123624: {'lr': 3.817347244269448e-05, 'samples': 23735808, 'steps': 123623, 'loss/train': 1.2879865169525146} 11/07/2021 14:38:57 - INFO - __main__ - Step 123625: {'lr': 3.817065405278763e-05, 'samples': 23736000, 'steps': 123624, 'loss/train': 1.107292890548706} 11/07/2021 14:38:58 - INFO - __main__ - Step 123626: {'lr': 3.81678357583278e-05, 'samples': 23736192, 'steps': 123625, 'loss/train': 0.8188843131065369} 11/07/2021 14:38:59 - INFO - __main__ - Step 123627: {'lr': 3.816501755931628e-05, 'samples': 23736384, 'steps': 123626, 'loss/train': 1.3588521480560303} 11/07/2021 14:38:59 - INFO - __main__ - Step 123628: {'lr': 3.816219945575433e-05, 'samples': 23736576, 'steps': 123627, 'loss/train': 1.6020469665527344} 11/07/2021 14:38:59 - INFO - __main__ - Step 123629: {'lr': 3.815938144764322e-05, 'samples': 23736768, 'steps': 123628, 'loss/train': 0.6683192253112793} 11/07/2021 14:39:00 - INFO - __main__ - Step 123630: {'lr': 3.815656353498429e-05, 'samples': 23736960, 'steps': 123629, 'loss/train': 0.8613177537918091} 11/07/2021 14:39:01 - INFO - __main__ - Step 123631: {'lr': 3.8153745717778684e-05, 'samples': 23737152, 'steps': 123630, 'loss/train': 1.454590916633606} 11/07/2021 14:39:01 - INFO - __main__ - Step 123632: {'lr': 3.815092799602773e-05, 'samples': 23737344, 'steps': 123631, 'loss/train': 1.4102400541305542} 11/07/2021 14:39:01 - INFO - __main__ - Step 123633: {'lr': 3.81481103697327e-05, 'samples': 23737536, 'steps': 123632, 'loss/train': 1.5472110509872437} 11/07/2021 14:39:02 - INFO - __main__ - Step 123634: {'lr': 3.8145292838894865e-05, 'samples': 23737728, 'steps': 123633, 'loss/train': 1.493431568145752} 11/07/2021 14:39:02 - INFO - __main__ - Step 123635: {'lr': 3.8142475403515506e-05, 'samples': 23737920, 'steps': 123634, 'loss/train': 1.0679391622543335} 11/07/2021 14:39:03 - INFO - __main__ - Step 123636: {'lr': 3.813965806359587e-05, 'samples': 23738112, 'steps': 123635, 'loss/train': 1.2902377843856812} 11/07/2021 14:39:03 - INFO - __main__ - Step 123637: {'lr': 3.813684081913721e-05, 'samples': 23738304, 'steps': 123636, 'loss/train': 1.168334722518921} 11/07/2021 14:39:04 - INFO - __main__ - Step 123638: {'lr': 3.813402367014085e-05, 'samples': 23738496, 'steps': 123637, 'loss/train': 1.4083662033081055} 11/07/2021 14:39:04 - INFO - __main__ - Step 123639: {'lr': 3.813120661660802e-05, 'samples': 23738688, 'steps': 123638, 'loss/train': 1.5709701776504517} 11/07/2021 14:39:05 - INFO - __main__ - Step 123640: {'lr': 3.8128389658539984e-05, 'samples': 23738880, 'steps': 123639, 'loss/train': 1.195874810218811} 11/07/2021 14:39:05 - INFO - __main__ - Step 123641: {'lr': 3.812557279593803e-05, 'samples': 23739072, 'steps': 123640, 'loss/train': 1.0159862041473389} 11/07/2021 14:39:06 - INFO - __main__ - Step 123642: {'lr': 3.8122756028803443e-05, 'samples': 23739264, 'steps': 123641, 'loss/train': 1.266025185585022} 11/07/2021 14:39:06 - INFO - __main__ - Step 123643: {'lr': 3.811993935713753e-05, 'samples': 23739456, 'steps': 123642, 'loss/train': 1.0087603330612183} 11/07/2021 14:39:07 - INFO - __main__ - Step 123644: {'lr': 3.811712278094143e-05, 'samples': 23739648, 'steps': 123643, 'loss/train': 1.1959948539733887} 11/07/2021 14:39:07 - INFO - __main__ - Step 123645: {'lr': 3.811430630021648e-05, 'samples': 23739840, 'steps': 123644, 'loss/train': 1.2845485210418701} 11/07/2021 14:39:07 - INFO - __main__ - Step 123646: {'lr': 3.8111489914963966e-05, 'samples': 23740032, 'steps': 123645, 'loss/train': 1.1343269348144531} 11/07/2021 14:39:08 - INFO - __main__ - Step 123647: {'lr': 3.8108673625185135e-05, 'samples': 23740224, 'steps': 123646, 'loss/train': 1.167353868484497} 11/07/2021 14:39:09 - INFO - __main__ - Step 123648: {'lr': 3.8105857430881296e-05, 'samples': 23740416, 'steps': 123647, 'loss/train': 1.4599579572677612} 11/07/2021 14:39:09 - INFO - __main__ - Step 123649: {'lr': 3.810304133205367e-05, 'samples': 23740608, 'steps': 123648, 'loss/train': 1.4943915605545044} 11/07/2021 14:39:09 - INFO - __main__ - Step 123650: {'lr': 3.810022532870352e-05, 'samples': 23740800, 'steps': 123649, 'loss/train': 1.4623425006866455} 11/07/2021 14:39:10 - INFO - __main__ - Step 123651: {'lr': 3.8097409420832176e-05, 'samples': 23740992, 'steps': 123650, 'loss/train': 1.0941967964172363} 11/07/2021 14:39:11 - INFO - __main__ - Step 123652: {'lr': 3.8094593608440836e-05, 'samples': 23741184, 'steps': 123651, 'loss/train': 0.6693841218948364} 11/07/2021 14:39:11 - INFO - __main__ - Step 123653: {'lr': 3.809177789153082e-05, 'samples': 23741376, 'steps': 123652, 'loss/train': 1.6143229007720947} 11/07/2021 14:39:11 - INFO - __main__ - Step 123654: {'lr': 3.808896227010339e-05, 'samples': 23741568, 'steps': 123653, 'loss/train': 0.8997108936309814} 11/07/2021 14:39:12 - INFO - __main__ - Step 123655: {'lr': 3.808614674415978e-05, 'samples': 23741760, 'steps': 123654, 'loss/train': 1.511238932609558} 11/07/2021 14:39:12 - INFO - __main__ - Step 123656: {'lr': 3.808333131370134e-05, 'samples': 23741952, 'steps': 123655, 'loss/train': 1.0421757698059082} 11/07/2021 14:39:14 - INFO - __main__ - Step 123657: {'lr': 3.808051597872925e-05, 'samples': 23742144, 'steps': 123656, 'loss/train': 0.9226675033569336} 11/07/2021 14:39:14 - INFO - __main__ - Step 123658: {'lr': 3.8077700739244796e-05, 'samples': 23742336, 'steps': 123657, 'loss/train': 1.4491806030273438} 11/07/2021 14:39:14 - INFO - __main__ - Step 123659: {'lr': 3.807488559524924e-05, 'samples': 23742528, 'steps': 123658, 'loss/train': 1.5329082012176514} 11/07/2021 14:39:15 - INFO - __main__ - Step 123660: {'lr': 3.8072070546743886e-05, 'samples': 23742720, 'steps': 123659, 'loss/train': 0.9434036016464233} 11/07/2021 14:39:15 - INFO - __main__ - Step 123661: {'lr': 3.806925559373001e-05, 'samples': 23742912, 'steps': 123660, 'loss/train': 5.696948528289795} 11/07/2021 14:39:16 - INFO - __main__ - Step 123662: {'lr': 3.8066440736208825e-05, 'samples': 23743104, 'steps': 123661, 'loss/train': 1.0879889726638794} 11/07/2021 14:39:17 - INFO - __main__ - Step 123663: {'lr': 3.806362597418164e-05, 'samples': 23743296, 'steps': 123662, 'loss/train': 0.6312156915664673} 11/07/2021 14:39:17 - INFO - __main__ - Step 123664: {'lr': 3.8060811307649715e-05, 'samples': 23743488, 'steps': 123663, 'loss/train': 1.3689481019973755} 11/07/2021 14:39:17 - INFO - __main__ - Step 123665: {'lr': 3.805799673661431e-05, 'samples': 23743680, 'steps': 123664, 'loss/train': 0.41247618198394775} 11/07/2021 14:39:18 - INFO - __main__ - Step 123666: {'lr': 3.805518226107671e-05, 'samples': 23743872, 'steps': 123665, 'loss/train': 1.156624436378479} 11/07/2021 14:39:18 - INFO - __main__ - Step 123667: {'lr': 3.8052367881038194e-05, 'samples': 23744064, 'steps': 123666, 'loss/train': 1.12260103225708} 11/07/2021 14:39:18 - INFO - __main__ - Step 123668: {'lr': 3.8049553596499975e-05, 'samples': 23744256, 'steps': 123667, 'loss/train': 1.4275197982788086} 11/07/2021 14:39:20 - INFO - __main__ - Step 123669: {'lr': 3.804673940746339e-05, 'samples': 23744448, 'steps': 123668, 'loss/train': 0.22954192757606506} 11/07/2021 14:39:20 - INFO - __main__ - Step 123670: {'lr': 3.8043925313929694e-05, 'samples': 23744640, 'steps': 123669, 'loss/train': 1.1116480827331543} 11/07/2021 14:39:20 - INFO - __main__ - Step 123671: {'lr': 3.80411113159001e-05, 'samples': 23744832, 'steps': 123670, 'loss/train': 1.3122652769088745} 11/07/2021 14:39:21 - INFO - __main__ - Step 123672: {'lr': 3.8038297413375914e-05, 'samples': 23745024, 'steps': 123671, 'loss/train': 1.2212785482406616} 11/07/2021 14:39:21 - INFO - __main__ - Step 123673: {'lr': 3.803548360635839e-05, 'samples': 23745216, 'steps': 123672, 'loss/train': 1.052146077156067} 11/07/2021 14:39:22 - INFO - __main__ - Step 123674: {'lr': 3.8032669894848826e-05, 'samples': 23745408, 'steps': 123673, 'loss/train': 1.3836643695831299} 11/07/2021 14:39:23 - INFO - __main__ - Step 123675: {'lr': 3.802985627884844e-05, 'samples': 23745600, 'steps': 123674, 'loss/train': 1.465770959854126} 11/07/2021 14:39:23 - INFO - __main__ - Step 123676: {'lr': 3.8027042758358554e-05, 'samples': 23745792, 'steps': 123675, 'loss/train': 0.6095297336578369} 11/07/2021 14:39:23 - INFO - __main__ - Step 123677: {'lr': 3.80242293333804e-05, 'samples': 23745984, 'steps': 123676, 'loss/train': 1.469650149345398} 11/07/2021 14:39:24 - INFO - __main__ - Step 123678: {'lr': 3.802141600391526e-05, 'samples': 23746176, 'steps': 123677, 'loss/train': 0.37643134593963623} 11/07/2021 14:39:25 - INFO - __main__ - Step 123679: {'lr': 3.801860276996438e-05, 'samples': 23746368, 'steps': 123678, 'loss/train': 1.1226555109024048} 11/07/2021 14:39:25 - INFO - __main__ - Step 123680: {'lr': 3.8015789631529076e-05, 'samples': 23746560, 'steps': 123679, 'loss/train': 1.382102608680725} 11/07/2021 14:39:25 - INFO - __main__ - Step 123681: {'lr': 3.801297658861058e-05, 'samples': 23746752, 'steps': 123680, 'loss/train': 1.2776364088058472} 11/07/2021 14:39:26 - INFO - __main__ - Step 123682: {'lr': 3.801016364121016e-05, 'samples': 23746944, 'steps': 123681, 'loss/train': 1.1181484460830688} 11/07/2021 14:39:26 - INFO - __main__ - Step 123683: {'lr': 3.8007350789329154e-05, 'samples': 23747136, 'steps': 123682, 'loss/train': 1.3482768535614014} 11/07/2021 14:39:27 - INFO - __main__ - Step 123684: {'lr': 3.8004538032968686e-05, 'samples': 23747328, 'steps': 123683, 'loss/train': 1.329625129699707} 11/07/2021 14:39:27 - INFO - __main__ - Step 123685: {'lr': 3.8001725372130116e-05, 'samples': 23747520, 'steps': 123684, 'loss/train': 1.104002594947815} 11/07/2021 14:39:28 - INFO - __main__ - Step 123686: {'lr': 3.79989128068147e-05, 'samples': 23747712, 'steps': 123685, 'loss/train': 1.5085570812225342} 11/07/2021 14:39:28 - INFO - __main__ - Step 123687: {'lr': 3.799610033702369e-05, 'samples': 23747904, 'steps': 123686, 'loss/train': 1.6680861711502075} 11/07/2021 14:39:29 - INFO - __main__ - Step 123688: {'lr': 3.799328796275839e-05, 'samples': 23748096, 'steps': 123687, 'loss/train': 1.4230059385299683} 11/07/2021 14:39:30 - INFO - __main__ - Step 123689: {'lr': 3.799047568402003e-05, 'samples': 23748288, 'steps': 123688, 'loss/train': 0.9517614245414734} 11/07/2021 14:39:30 - INFO - __main__ - Step 123690: {'lr': 3.798766350080987e-05, 'samples': 23748480, 'steps': 123689, 'loss/train': 0.40652695298194885} 11/07/2021 14:39:30 - INFO - __main__ - Step 123691: {'lr': 3.798485141312924e-05, 'samples': 23748672, 'steps': 123690, 'loss/train': 1.1540038585662842} 11/07/2021 14:39:31 - INFO - __main__ - Step 123692: {'lr': 3.798203942097933e-05, 'samples': 23748864, 'steps': 123691, 'loss/train': 1.1478813886642456} 11/07/2021 14:39:31 - INFO - __main__ - Step 123693: {'lr': 3.797922752436145e-05, 'samples': 23749056, 'steps': 123692, 'loss/train': 1.5288389921188354} 11/07/2021 14:39:32 - INFO - __main__ - Step 123694: {'lr': 3.797641572327687e-05, 'samples': 23749248, 'steps': 123693, 'loss/train': 1.3779757022857666} 11/07/2021 14:39:32 - INFO - __main__ - Step 123695: {'lr': 3.797360401772682e-05, 'samples': 23749440, 'steps': 123694, 'loss/train': 1.0849860906600952} 11/07/2021 14:39:33 - INFO - __main__ - Step 123696: {'lr': 3.797079240771262e-05, 'samples': 23749632, 'steps': 123695, 'loss/train': 1.0114340782165527} 11/07/2021 14:39:33 - INFO - __main__ - Step 123697: {'lr': 3.796798089323556e-05, 'samples': 23749824, 'steps': 123696, 'loss/train': 1.4808872938156128} 11/07/2021 14:39:33 - INFO - __main__ - Step 123698: {'lr': 3.796516947429679e-05, 'samples': 23750016, 'steps': 123697, 'loss/train': 1.0311126708984375} 11/07/2021 14:39:35 - INFO - __main__ - Step 123699: {'lr': 3.796235815089763e-05, 'samples': 23750208, 'steps': 123698, 'loss/train': 1.419861912727356} 11/07/2021 14:39:35 - INFO - __main__ - Step 123700: {'lr': 3.7959546923039376e-05, 'samples': 23750400, 'steps': 123699, 'loss/train': 1.1373920440673828} 11/07/2021 14:39:35 - INFO - __main__ - Step 123701: {'lr': 3.7956735790723286e-05, 'samples': 23750592, 'steps': 123700, 'loss/train': 1.34999418258667} 11/07/2021 14:39:36 - INFO - __main__ - Step 123702: {'lr': 3.7953924753950594e-05, 'samples': 23750784, 'steps': 123701, 'loss/train': 0.06340421736240387} 11/07/2021 14:39:36 - INFO - __main__ - Step 123703: {'lr': 3.795111381272262e-05, 'samples': 23750976, 'steps': 123702, 'loss/train': 0.7796374559402466} 11/07/2021 14:39:37 - INFO - __main__ - Step 123704: {'lr': 3.794830296704058e-05, 'samples': 23751168, 'steps': 123703, 'loss/train': 1.2795045375823975} 11/07/2021 14:39:37 - INFO - __main__ - Step 123705: {'lr': 3.794549221690577e-05, 'samples': 23751360, 'steps': 123704, 'loss/train': 1.203372836112976} 11/07/2021 14:39:38 - INFO - __main__ - Step 123706: {'lr': 3.794268156231945e-05, 'samples': 23751552, 'steps': 123705, 'loss/train': 1.2219563722610474} 11/07/2021 14:39:38 - INFO - __main__ - Step 123707: {'lr': 3.793987100328289e-05, 'samples': 23751744, 'steps': 123706, 'loss/train': 1.6081792116165161} 11/07/2021 14:39:39 - INFO - __main__ - Step 123708: {'lr': 3.793706053979734e-05, 'samples': 23751936, 'steps': 123707, 'loss/train': 0.7222334146499634} 11/07/2021 14:39:40 - INFO - __main__ - Step 123709: {'lr': 3.793425017186411e-05, 'samples': 23752128, 'steps': 123708, 'loss/train': 1.290756344795227} 11/07/2021 14:39:40 - INFO - __main__ - Step 123710: {'lr': 3.7931439899484474e-05, 'samples': 23752320, 'steps': 123709, 'loss/train': 0.9110608696937561} 11/07/2021 14:39:40 - INFO - __main__ - Step 123711: {'lr': 3.792862972265959e-05, 'samples': 23752512, 'steps': 123710, 'loss/train': 1.3923823833465576} 11/07/2021 14:39:41 - INFO - __main__ - Step 123712: {'lr': 3.792581964139078e-05, 'samples': 23752704, 'steps': 123711, 'loss/train': 1.941296935081482} 11/07/2021 14:39:41 - INFO - __main__ - Step 123713: {'lr': 3.7923009655679355e-05, 'samples': 23752896, 'steps': 123712, 'loss/train': 1.8464035987854004} 11/07/2021 14:39:41 - INFO - __main__ - Step 123714: {'lr': 3.792019976552652e-05, 'samples': 23753088, 'steps': 123713, 'loss/train': 1.1476998329162598} 11/07/2021 14:39:42 - INFO - __main__ - Step 123715: {'lr': 3.791738997093361e-05, 'samples': 23753280, 'steps': 123714, 'loss/train': 1.0879281759262085} 11/07/2021 14:39:43 - INFO - __main__ - Step 123716: {'lr': 3.791458027190181e-05, 'samples': 23753472, 'steps': 123715, 'loss/train': 0.9682210087776184} 11/07/2021 14:39:43 - INFO - __main__ - Step 123717: {'lr': 3.791177066843246e-05, 'samples': 23753664, 'steps': 123716, 'loss/train': 0.3583618402481079} 11/07/2021 14:39:43 - INFO - __main__ - Step 123718: {'lr': 3.7908961160526776e-05, 'samples': 23753856, 'steps': 123717, 'loss/train': 1.2693299055099487} 11/07/2021 14:39:44 - INFO - __main__ - Step 123719: {'lr': 3.790615174818604e-05, 'samples': 23754048, 'steps': 123718, 'loss/train': 1.2915587425231934} 11/07/2021 14:39:45 - INFO - __main__ - Step 123720: {'lr': 3.790334243141153e-05, 'samples': 23754240, 'steps': 123719, 'loss/train': 1.6515159606933594} 11/07/2021 14:39:45 - INFO - __main__ - Step 123721: {'lr': 3.790053321020448e-05, 'samples': 23754432, 'steps': 123720, 'loss/train': 1.252691388130188} 11/07/2021 14:39:46 - INFO - __main__ - Step 123722: {'lr': 3.7897724084566184e-05, 'samples': 23754624, 'steps': 123721, 'loss/train': 1.092734694480896} 11/07/2021 14:39:46 - INFO - __main__ - Step 123723: {'lr': 3.7894915054497906e-05, 'samples': 23754816, 'steps': 123722, 'loss/train': 0.393417626619339} 11/07/2021 14:39:46 - INFO - __main__ - Step 123724: {'lr': 3.7892106120000966e-05, 'samples': 23755008, 'steps': 123723, 'loss/train': 1.05901300907135} 11/07/2021 14:39:47 - INFO - __main__ - Step 123725: {'lr': 3.7889297281076515e-05, 'samples': 23755200, 'steps': 123724, 'loss/train': 1.4557499885559082} 11/07/2021 14:39:48 - INFO - __main__ - Step 123726: {'lr': 3.788648853772589e-05, 'samples': 23755392, 'steps': 123725, 'loss/train': 0.7439761757850647} 11/07/2021 14:39:48 - INFO - __main__ - Step 123727: {'lr': 3.788367988995031e-05, 'samples': 23755584, 'steps': 123726, 'loss/train': 1.1778020858764648} 11/07/2021 14:39:48 - INFO - __main__ - Step 123728: {'lr': 3.788087133775109e-05, 'samples': 23755776, 'steps': 123727, 'loss/train': 0.6353368163108826} 11/07/2021 14:39:49 - INFO - __main__ - Step 123729: {'lr': 3.787806288112947e-05, 'samples': 23755968, 'steps': 123728, 'loss/train': 0.5402214527130127} 11/07/2021 14:39:50 - INFO - __main__ - Step 123730: {'lr': 3.7875254520086694e-05, 'samples': 23756160, 'steps': 123729, 'loss/train': 1.2664440870285034} 11/07/2021 14:39:50 - INFO - __main__ - Step 123731: {'lr': 3.787244625462411e-05, 'samples': 23756352, 'steps': 123730, 'loss/train': 1.4460911750793457} 11/07/2021 14:39:51 - INFO - __main__ - Step 123732: {'lr': 3.786963808474289e-05, 'samples': 23756544, 'steps': 123731, 'loss/train': 1.3389378786087036} 11/07/2021 14:39:51 - INFO - __main__ - Step 123733: {'lr': 3.786683001044433e-05, 'samples': 23756736, 'steps': 123732, 'loss/train': 0.8533901572227478} 11/07/2021 14:39:51 - INFO - __main__ - Step 123734: {'lr': 3.786402203172973e-05, 'samples': 23756928, 'steps': 123733, 'loss/train': 1.7357763051986694} 11/07/2021 14:39:52 - INFO - __main__ - Step 123735: {'lr': 3.7861214148600303e-05, 'samples': 23757120, 'steps': 123734, 'loss/train': 1.2546461820602417} 11/07/2021 14:39:53 - INFO - __main__ - Step 123736: {'lr': 3.785840636105736e-05, 'samples': 23757312, 'steps': 123735, 'loss/train': 1.6998769044876099} 11/07/2021 14:39:53 - INFO - __main__ - Step 123737: {'lr': 3.785559866910221e-05, 'samples': 23757504, 'steps': 123736, 'loss/train': 1.3901814222335815} 11/07/2021 14:39:53 - INFO - __main__ - Step 123738: {'lr': 3.785279107273598e-05, 'samples': 23757696, 'steps': 123737, 'loss/train': 1.3627398014068604} 11/07/2021 14:39:54 - INFO - __main__ - Step 123739: {'lr': 3.784998357196001e-05, 'samples': 23757888, 'steps': 123738, 'loss/train': 1.6091084480285645} 11/07/2021 14:39:55 - INFO - __main__ - Step 123740: {'lr': 3.784717616677555e-05, 'samples': 23758080, 'steps': 123739, 'loss/train': 1.5364869832992554} 11/07/2021 14:39:55 - INFO - __main__ - Step 123741: {'lr': 3.78443688571839e-05, 'samples': 23758272, 'steps': 123740, 'loss/train': 0.690824031829834} 11/07/2021 14:39:56 - INFO - __main__ - Step 123742: {'lr': 3.7841561643186303e-05, 'samples': 23758464, 'steps': 123741, 'loss/train': 1.4222776889801025} 11/07/2021 14:39:56 - INFO - __main__ - Step 123743: {'lr': 3.783875452478403e-05, 'samples': 23758656, 'steps': 123742, 'loss/train': 0.9577237367630005} 11/07/2021 14:39:56 - INFO - __main__ - Step 123744: {'lr': 3.783594750197833e-05, 'samples': 23758848, 'steps': 123743, 'loss/train': 0.7748436331748962} 11/07/2021 14:39:57 - INFO - __main__ - Step 123745: {'lr': 3.783314057477047e-05, 'samples': 23759040, 'steps': 123744, 'loss/train': 1.4320605993270874} 11/07/2021 14:39:58 - INFO - __main__ - Step 123746: {'lr': 3.7830333743161723e-05, 'samples': 23759232, 'steps': 123745, 'loss/train': 1.4957414865493774} 11/07/2021 14:39:58 - INFO - __main__ - Step 123747: {'lr': 3.782752700715336e-05, 'samples': 23759424, 'steps': 123746, 'loss/train': 0.7999776601791382} 11/07/2021 14:39:58 - INFO - __main__ - Step 123748: {'lr': 3.782472036674664e-05, 'samples': 23759616, 'steps': 123747, 'loss/train': 1.198608636856079} 11/07/2021 14:39:59 - INFO - __main__ - Step 123749: {'lr': 3.7821913821942835e-05, 'samples': 23759808, 'steps': 123748, 'loss/train': 1.262265920639038} 11/07/2021 14:40:00 - INFO - __main__ - Step 123750: {'lr': 3.781910737274319e-05, 'samples': 23760000, 'steps': 123749, 'loss/train': 1.5550798177719116} 11/07/2021 14:40:00 - INFO - __main__ - Step 123751: {'lr': 3.781630101914904e-05, 'samples': 23760192, 'steps': 123750, 'loss/train': 1.2871577739715576} 11/07/2021 14:40:00 - INFO - __main__ - Step 123752: {'lr': 3.781349476116156e-05, 'samples': 23760384, 'steps': 123751, 'loss/train': 1.2344105243682861} 11/07/2021 14:40:01 - INFO - __main__ - Step 123753: {'lr': 3.7810688598782004e-05, 'samples': 23760576, 'steps': 123752, 'loss/train': 1.4944090843200684} 11/07/2021 14:40:01 - INFO - __main__ - Step 123754: {'lr': 3.78078825320117e-05, 'samples': 23760768, 'steps': 123753, 'loss/train': 2.118304967880249} 11/07/2021 14:40:02 - INFO - __main__ - Step 123755: {'lr': 3.7805076560851884e-05, 'samples': 23760960, 'steps': 123754, 'loss/train': 1.3762543201446533} 11/07/2021 14:40:02 - INFO - __main__ - Step 123756: {'lr': 3.780227068530381e-05, 'samples': 23761152, 'steps': 123755, 'loss/train': 0.6102837920188904} 11/07/2021 14:40:03 - INFO - __main__ - Step 123757: {'lr': 3.779946490536879e-05, 'samples': 23761344, 'steps': 123756, 'loss/train': 0.6588653922080994} 11/07/2021 14:40:03 - INFO - __main__ - Step 123758: {'lr': 3.7796659221048025e-05, 'samples': 23761536, 'steps': 123757, 'loss/train': 1.3655476570129395} 11/07/2021 14:40:03 - INFO - __main__ - Step 123759: {'lr': 3.7793853632342834e-05, 'samples': 23761728, 'steps': 123758, 'loss/train': 1.0252963304519653} 11/07/2021 14:40:04 - INFO - __main__ - Step 123760: {'lr': 3.779104813925446e-05, 'samples': 23761920, 'steps': 123759, 'loss/train': 1.3828283548355103} 11/07/2021 14:40:05 - INFO - __main__ - Step 123761: {'lr': 3.7788242741784164e-05, 'samples': 23762112, 'steps': 123760, 'loss/train': 0.6294505596160889} 11/07/2021 14:40:05 - INFO - __main__ - Step 123762: {'lr': 3.778543743993318e-05, 'samples': 23762304, 'steps': 123761, 'loss/train': 1.4743226766586304} 11/07/2021 14:40:06 - INFO - __main__ - Step 123763: {'lr': 3.778263223370285e-05, 'samples': 23762496, 'steps': 123762, 'loss/train': 1.313908576965332} 11/07/2021 14:40:06 - INFO - __main__ - Step 123764: {'lr': 3.7779827123094413e-05, 'samples': 23762688, 'steps': 123763, 'loss/train': 1.195605754852295} 11/07/2021 14:40:06 - INFO - __main__ - Step 123765: {'lr': 3.777702210810907e-05, 'samples': 23762880, 'steps': 123764, 'loss/train': 1.3171424865722656} 11/07/2021 14:40:07 - INFO - __main__ - Step 123766: {'lr': 3.777421718874813e-05, 'samples': 23763072, 'steps': 123765, 'loss/train': 1.31455659866333} 11/07/2021 14:40:08 - INFO - __main__ - Step 123767: {'lr': 3.777141236501283e-05, 'samples': 23763264, 'steps': 123766, 'loss/train': 1.0116620063781738} 11/07/2021 14:40:08 - INFO - __main__ - Step 123768: {'lr': 3.7768607636904485e-05, 'samples': 23763456, 'steps': 123767, 'loss/train': 1.4009454250335693} 11/07/2021 14:40:08 - INFO - __main__ - Step 123769: {'lr': 3.776580300442431e-05, 'samples': 23763648, 'steps': 123768, 'loss/train': 0.6373777389526367} 11/07/2021 14:40:09 - INFO - __main__ - Step 123770: {'lr': 3.776299846757358e-05, 'samples': 23763840, 'steps': 123769, 'loss/train': 1.7515697479248047} 11/07/2021 14:40:10 - INFO - __main__ - Step 123771: {'lr': 3.776019402635358e-05, 'samples': 23764032, 'steps': 123770, 'loss/train': 1.309158444404602} 11/07/2021 14:40:10 - INFO - __main__ - Step 123772: {'lr': 3.7757389680765586e-05, 'samples': 23764224, 'steps': 123771, 'loss/train': 1.0063788890838623} 11/07/2021 14:40:11 - INFO - __main__ - Step 123773: {'lr': 3.7754585430810815e-05, 'samples': 23764416, 'steps': 123772, 'loss/train': 1.1746171712875366} 11/07/2021 14:40:11 - INFO - __main__ - Step 123774: {'lr': 3.775178127649056e-05, 'samples': 23764608, 'steps': 123773, 'loss/train': 0.6375904679298401} 11/07/2021 14:40:12 - INFO - __main__ - Step 123775: {'lr': 3.774897721780607e-05, 'samples': 23764800, 'steps': 123774, 'loss/train': 0.08430052548646927} 11/07/2021 14:40:13 - INFO - __main__ - Step 123776: {'lr': 3.77461732547586e-05, 'samples': 23764992, 'steps': 123775, 'loss/train': 1.4977586269378662} 11/07/2021 14:40:13 - INFO - __main__ - Step 123777: {'lr': 3.774336938734951e-05, 'samples': 23765184, 'steps': 123776, 'loss/train': 1.2262541055679321} 11/07/2021 14:40:13 - INFO - __main__ - Step 123778: {'lr': 3.774056561557993e-05, 'samples': 23765376, 'steps': 123777, 'loss/train': 0.703905463218689} 11/07/2021 14:40:14 - INFO - __main__ - Step 123779: {'lr': 3.7737761939451163e-05, 'samples': 23765568, 'steps': 123778, 'loss/train': 0.8188656568527222} 11/07/2021 14:40:14 - INFO - __main__ - Step 123780: {'lr': 3.773495835896448e-05, 'samples': 23765760, 'steps': 123779, 'loss/train': 1.3945411443710327} 11/07/2021 14:40:15 - INFO - __main__ - Step 123781: {'lr': 3.7732154874121154e-05, 'samples': 23765952, 'steps': 123780, 'loss/train': 0.9637452960014343} 11/07/2021 14:40:16 - INFO - __main__ - Step 123782: {'lr': 3.772935148492246e-05, 'samples': 23766144, 'steps': 123781, 'loss/train': 1.546830177307129} 11/07/2021 14:40:16 - INFO - __main__ - Step 123783: {'lr': 3.7726548191369615e-05, 'samples': 23766336, 'steps': 123782, 'loss/train': 1.4267934560775757} 11/07/2021 14:40:16 - INFO - __main__ - Step 123784: {'lr': 3.7723744993463924e-05, 'samples': 23766528, 'steps': 123783, 'loss/train': 1.162623405456543} 11/07/2021 14:40:17 - INFO - __main__ - Step 123785: {'lr': 3.772094189120664e-05, 'samples': 23766720, 'steps': 123784, 'loss/train': 0.9351767897605896} 11/07/2021 14:40:18 - INFO - __main__ - Step 123786: {'lr': 3.7718138884599046e-05, 'samples': 23766912, 'steps': 123785, 'loss/train': 0.9213583469390869} 11/07/2021 14:40:18 - INFO - __main__ - Step 123787: {'lr': 3.7715335973642354e-05, 'samples': 23767104, 'steps': 123786, 'loss/train': 1.1891342401504517} 11/07/2021 14:40:18 - INFO - __main__ - Step 123788: {'lr': 3.7712533158337867e-05, 'samples': 23767296, 'steps': 123787, 'loss/train': 1.2625339031219482} 11/07/2021 14:40:19 - INFO - __main__ - Step 123789: {'lr': 3.770973043868683e-05, 'samples': 23767488, 'steps': 123788, 'loss/train': 1.4624409675598145} 11/07/2021 14:40:19 - INFO - __main__ - Step 123790: {'lr': 3.770692781469051e-05, 'samples': 23767680, 'steps': 123789, 'loss/train': 1.2800487279891968} 11/07/2021 14:40:21 - INFO - __main__ - Step 123791: {'lr': 3.770412528635023e-05, 'samples': 23767872, 'steps': 123790, 'loss/train': 1.3170620203018188} 11/07/2021 14:40:21 - INFO - __main__ - Step 123792: {'lr': 3.7701322853667144e-05, 'samples': 23768064, 'steps': 123791, 'loss/train': 1.2868746519088745} 11/07/2021 14:40:21 - INFO - __main__ - Step 123793: {'lr': 3.7698520516642576e-05, 'samples': 23768256, 'steps': 123792, 'loss/train': 1.2587931156158447} 11/07/2021 14:40:22 - INFO - __main__ - Step 123794: {'lr': 3.769571827527776e-05, 'samples': 23768448, 'steps': 123793, 'loss/train': 1.4732331037521362} 11/07/2021 14:40:22 - INFO - __main__ - Step 123795: {'lr': 3.769291612957398e-05, 'samples': 23768640, 'steps': 123794, 'loss/train': 1.3384877443313599} 11/07/2021 14:40:23 - INFO - __main__ - Step 123796: {'lr': 3.7690114079532516e-05, 'samples': 23768832, 'steps': 123795, 'loss/train': 0.999938428401947} 11/07/2021 14:40:24 - INFO - __main__ - Step 123797: {'lr': 3.768731212515458e-05, 'samples': 23769024, 'steps': 123796, 'loss/train': 1.2410162687301636} 11/07/2021 14:40:24 - INFO - __main__ - Step 123798: {'lr': 3.768451026644149e-05, 'samples': 23769216, 'steps': 123797, 'loss/train': 0.574456512928009} 11/07/2021 14:40:24 - INFO - __main__ - Step 123799: {'lr': 3.7681708503394476e-05, 'samples': 23769408, 'steps': 123798, 'loss/train': 1.2467210292816162} 11/07/2021 14:40:25 - INFO - __main__ - Step 123800: {'lr': 3.7678906836014796e-05, 'samples': 23769600, 'steps': 123799, 'loss/train': 1.461730718612671} 11/07/2021 14:40:25 - INFO - __main__ - Step 123801: {'lr': 3.767610526430373e-05, 'samples': 23769792, 'steps': 123800, 'loss/train': 1.6309415102005005} 11/07/2021 14:40:26 - INFO - __main__ - Step 123802: {'lr': 3.767330378826256e-05, 'samples': 23769984, 'steps': 123801, 'loss/train': 0.9593573808670044} 11/07/2021 14:40:27 - INFO - __main__ - Step 123803: {'lr': 3.7670502407892494e-05, 'samples': 23770176, 'steps': 123802, 'loss/train': 1.3063265085220337} 11/07/2021 14:40:27 - INFO - __main__ - Step 123804: {'lr': 3.76677011231949e-05, 'samples': 23770368, 'steps': 123803, 'loss/train': 1.6266793012619019} 11/07/2021 14:40:27 - INFO - __main__ - Step 123805: {'lr': 3.766489993417088e-05, 'samples': 23770560, 'steps': 123804, 'loss/train': 1.5942316055297852} 11/07/2021 14:40:28 - INFO - __main__ - Step 123806: {'lr': 3.7662098840821805e-05, 'samples': 23770752, 'steps': 123805, 'loss/train': 0.6427563428878784} 11/07/2021 14:40:28 - INFO - __main__ - Step 123807: {'lr': 3.7659297843148893e-05, 'samples': 23770944, 'steps': 123806, 'loss/train': 1.5071848630905151} 11/07/2021 14:40:29 - INFO - __main__ - Step 123808: {'lr': 3.765649694115344e-05, 'samples': 23771136, 'steps': 123807, 'loss/train': 0.9745342135429382} 11/07/2021 14:40:29 - INFO - __main__ - Step 123809: {'lr': 3.7653696134836685e-05, 'samples': 23771328, 'steps': 123808, 'loss/train': 1.1881744861602783} 11/07/2021 14:40:30 - INFO - __main__ - Step 123810: {'lr': 3.765089542419989e-05, 'samples': 23771520, 'steps': 123809, 'loss/train': 1.7710065841674805} 11/07/2021 14:40:30 - INFO - __main__ - Step 123811: {'lr': 3.7648094809244334e-05, 'samples': 23771712, 'steps': 123810, 'loss/train': 1.1073514223098755} 11/07/2021 14:40:30 - INFO - __main__ - Step 123812: {'lr': 3.764529428997127e-05, 'samples': 23771904, 'steps': 123811, 'loss/train': 1.0807217359542847} 11/07/2021 14:40:31 - INFO - __main__ - Step 123813: {'lr': 3.764249386638196e-05, 'samples': 23772096, 'steps': 123812, 'loss/train': 1.3934988975524902} 11/07/2021 14:40:32 - INFO - __main__ - Step 123814: {'lr': 3.7639693538477654e-05, 'samples': 23772288, 'steps': 123813, 'loss/train': 1.392153024673462} 11/07/2021 14:40:32 - INFO - __main__ - Step 123815: {'lr': 3.7636893306259636e-05, 'samples': 23772480, 'steps': 123814, 'loss/train': 1.111687421798706} 11/07/2021 14:40:32 - INFO - __main__ - Step 123816: {'lr': 3.7634093169729153e-05, 'samples': 23772672, 'steps': 123815, 'loss/train': 1.1958284378051758} 11/07/2021 14:40:33 - INFO - __main__ - Step 123817: {'lr': 3.763129312888747e-05, 'samples': 23772864, 'steps': 123816, 'loss/train': 1.0418272018432617} 11/07/2021 14:40:34 - INFO - __main__ - Step 123818: {'lr': 3.762849318373593e-05, 'samples': 23773056, 'steps': 123817, 'loss/train': 1.643347978591919} 11/07/2021 14:40:34 - INFO - __main__ - Step 123819: {'lr': 3.7625693334275626e-05, 'samples': 23773248, 'steps': 123818, 'loss/train': 0.8152331113815308} 11/07/2021 14:40:35 - INFO - __main__ - Step 123820: {'lr': 3.7622893580507914e-05, 'samples': 23773440, 'steps': 123819, 'loss/train': 1.31644868850708} 11/07/2021 14:40:35 - INFO - __main__ - Step 123821: {'lr': 3.762009392243407e-05, 'samples': 23773632, 'steps': 123820, 'loss/train': 1.0198283195495605} 11/07/2021 14:40:35 - INFO - __main__ - Step 123822: {'lr': 3.7617294360055315e-05, 'samples': 23773824, 'steps': 123821, 'loss/train': 1.2740265130996704} 11/07/2021 14:40:36 - INFO - __main__ - Step 123823: {'lr': 3.761449489337293e-05, 'samples': 23774016, 'steps': 123822, 'loss/train': 1.2630970478057861} 11/07/2021 14:40:37 - INFO - __main__ - Step 123824: {'lr': 3.761169552238816e-05, 'samples': 23774208, 'steps': 123823, 'loss/train': 1.4546664953231812} 11/07/2021 14:40:37 - INFO - __main__ - Step 123825: {'lr': 3.7608896247102314e-05, 'samples': 23774400, 'steps': 123824, 'loss/train': 1.4530880451202393} 11/07/2021 14:40:37 - INFO - __main__ - Step 123826: {'lr': 3.760609706751661e-05, 'samples': 23774592, 'steps': 123825, 'loss/train': 1.101058840751648} 11/07/2021 14:40:38 - INFO - __main__ - Step 123827: {'lr': 3.760329798363232e-05, 'samples': 23774784, 'steps': 123826, 'loss/train': 5.683681488037109} 11/07/2021 14:40:38 - INFO - __main__ - Step 123828: {'lr': 3.7600498995450705e-05, 'samples': 23774976, 'steps': 123827, 'loss/train': 1.1397788524627686} 11/07/2021 14:40:40 - INFO - __main__ - Step 123829: {'lr': 3.759770010297303e-05, 'samples': 23775168, 'steps': 123828, 'loss/train': 1.3659785985946655} 11/07/2021 14:40:40 - INFO - __main__ - Step 123830: {'lr': 3.759490130620055e-05, 'samples': 23775360, 'steps': 123829, 'loss/train': 0.24387671053409576} 11/07/2021 14:40:40 - INFO - __main__ - Step 123831: {'lr': 3.7592102605134596e-05, 'samples': 23775552, 'steps': 123830, 'loss/train': 1.0522658824920654} 11/07/2021 14:40:41 - INFO - __main__ - Step 123832: {'lr': 3.758930399977631e-05, 'samples': 23775744, 'steps': 123831, 'loss/train': 1.294580101966858} 11/07/2021 14:40:41 - INFO - __main__ - Step 123833: {'lr': 3.7586505490126986e-05, 'samples': 23775936, 'steps': 123832, 'loss/train': 1.2140237092971802} 11/07/2021 14:40:42 - INFO - __main__ - Step 123834: {'lr': 3.758370707618791e-05, 'samples': 23776128, 'steps': 123833, 'loss/train': 0.9804573655128479} 11/07/2021 14:40:42 - INFO - __main__ - Step 123835: {'lr': 3.758090875796033e-05, 'samples': 23776320, 'steps': 123834, 'loss/train': 1.2985001802444458} 11/07/2021 14:40:43 - INFO - __main__ - Step 123836: {'lr': 3.757811053544555e-05, 'samples': 23776512, 'steps': 123835, 'loss/train': 1.0975161790847778} 11/07/2021 14:40:43 - INFO - __main__ - Step 123837: {'lr': 3.7575312408644756e-05, 'samples': 23776704, 'steps': 123836, 'loss/train': 1.7160927057266235} 11/07/2021 14:40:43 - INFO - __main__ - Step 123838: {'lr': 3.757251437755926e-05, 'samples': 23776896, 'steps': 123837, 'loss/train': 1.719017744064331} 11/07/2021 14:40:44 - INFO - __main__ - Step 123839: {'lr': 3.7569716442190315e-05, 'samples': 23777088, 'steps': 123838, 'loss/train': 1.105567455291748} 11/07/2021 14:40:45 - INFO - __main__ - Step 123840: {'lr': 3.756691860253919e-05, 'samples': 23777280, 'steps': 123839, 'loss/train': 1.5436705350875854} 11/07/2021 14:40:45 - INFO - __main__ - Step 123841: {'lr': 3.7564120858607134e-05, 'samples': 23777472, 'steps': 123840, 'loss/train': 1.146998405456543} 11/07/2021 14:40:45 - INFO - __main__ - Step 123842: {'lr': 3.7561323210395434e-05, 'samples': 23777664, 'steps': 123841, 'loss/train': 1.0734870433807373} 11/07/2021 14:40:46 - INFO - __main__ - Step 123843: {'lr': 3.7558525657905294e-05, 'samples': 23777856, 'steps': 123842, 'loss/train': 1.4094791412353516} 11/07/2021 14:40:47 - INFO - __main__ - Step 123844: {'lr': 3.755572820113801e-05, 'samples': 23778048, 'steps': 123843, 'loss/train': 1.1870412826538086} 11/07/2021 14:40:47 - INFO - __main__ - Step 123845: {'lr': 3.755293084009481e-05, 'samples': 23778240, 'steps': 123844, 'loss/train': 1.4306625127792358} 11/07/2021 14:40:48 - INFO - __main__ - Step 123846: {'lr': 3.755013357477699e-05, 'samples': 23778432, 'steps': 123845, 'loss/train': 1.1811528205871582} 11/07/2021 14:40:48 - INFO - __main__ - Step 123847: {'lr': 3.754733640518582e-05, 'samples': 23778624, 'steps': 123846, 'loss/train': 1.5667657852172852} 11/07/2021 14:40:48 - INFO - __main__ - Step 123848: {'lr': 3.7544539331322514e-05, 'samples': 23778816, 'steps': 123847, 'loss/train': 1.4192003011703491} 11/07/2021 14:40:49 - INFO - __main__ - Step 123849: {'lr': 3.754174235318836e-05, 'samples': 23779008, 'steps': 123848, 'loss/train': 1.9382859468460083} 11/07/2021 14:40:50 - INFO - __main__ - Step 123850: {'lr': 3.753894547078465e-05, 'samples': 23779200, 'steps': 123849, 'loss/train': 1.2639422416687012} 11/07/2021 14:40:50 - INFO - __main__ - Step 123851: {'lr': 3.753614868411259e-05, 'samples': 23779392, 'steps': 123850, 'loss/train': 0.7588691115379333} 11/07/2021 14:40:50 - INFO - __main__ - Step 123852: {'lr': 3.7533351993173454e-05, 'samples': 23779584, 'steps': 123851, 'loss/train': 1.3848183155059814} 11/07/2021 14:40:51 - INFO - __main__ - Step 123853: {'lr': 3.75305553979686e-05, 'samples': 23779776, 'steps': 123852, 'loss/train': 1.489365577697754} 11/07/2021 14:40:51 - INFO - __main__ - Step 123854: {'lr': 3.7527758898499105e-05, 'samples': 23779968, 'steps': 123853, 'loss/train': 1.722753643989563} 11/07/2021 14:40:52 - INFO - __main__ - Step 123855: {'lr': 3.752496249476634e-05, 'samples': 23780160, 'steps': 123854, 'loss/train': 1.3937128782272339} 11/07/2021 14:40:53 - INFO - __main__ - Step 123856: {'lr': 3.752216618677157e-05, 'samples': 23780352, 'steps': 123855, 'loss/train': 1.4469490051269531} 11/07/2021 14:40:53 - INFO - __main__ - Step 123857: {'lr': 3.7519369974515994e-05, 'samples': 23780544, 'steps': 123856, 'loss/train': 1.1107654571533203} 11/07/2021 14:40:53 - INFO - __main__ - Step 123858: {'lr': 3.7516573858000945e-05, 'samples': 23780736, 'steps': 123857, 'loss/train': 1.163301944732666} 11/07/2021 14:40:54 - INFO - __main__ - Step 123859: {'lr': 3.7513777837227616e-05, 'samples': 23780928, 'steps': 123858, 'loss/train': 1.15697181224823} 11/07/2021 14:40:55 - INFO - __main__ - Step 123860: {'lr': 3.7510981912197315e-05, 'samples': 23781120, 'steps': 123859, 'loss/train': 1.4116979837417603} 11/07/2021 14:40:55 - INFO - __main__ - Step 123861: {'lr': 3.750818608291129e-05, 'samples': 23781312, 'steps': 123860, 'loss/train': 1.3251700401306152} 11/07/2021 14:40:55 - INFO - __main__ - Step 123862: {'lr': 3.750539034937081e-05, 'samples': 23781504, 'steps': 123861, 'loss/train': 0.9178657531738281} 11/07/2021 14:40:56 - INFO - __main__ - Step 123863: {'lr': 3.7502594711577105e-05, 'samples': 23781696, 'steps': 123862, 'loss/train': 0.7634811997413635} 11/07/2021 14:40:56 - INFO - __main__ - Step 123864: {'lr': 3.74997991695315e-05, 'samples': 23781888, 'steps': 123863, 'loss/train': 0.9571239948272705} 11/07/2021 14:40:57 - INFO - __main__ - Step 123865: {'lr': 3.749700372323517e-05, 'samples': 23782080, 'steps': 123864, 'loss/train': 1.0058679580688477} 11/07/2021 14:40:57 - INFO - __main__ - Step 123866: {'lr': 3.749420837268941e-05, 'samples': 23782272, 'steps': 123865, 'loss/train': 1.3333991765975952} 11/07/2021 14:40:58 - INFO - __main__ - Step 123867: {'lr': 3.7491413117895474e-05, 'samples': 23782464, 'steps': 123866, 'loss/train': 0.6493328213691711} 11/07/2021 14:40:58 - INFO - __main__ - Step 123868: {'lr': 3.748861795885461e-05, 'samples': 23782656, 'steps': 123867, 'loss/train': 0.9254019260406494} 11/07/2021 14:40:59 - INFO - __main__ - Step 123869: {'lr': 3.7485822895568125e-05, 'samples': 23782848, 'steps': 123868, 'loss/train': 1.0811071395874023} 11/07/2021 14:41:00 - INFO - __main__ - Step 123870: {'lr': 3.7483027928037236e-05, 'samples': 23783040, 'steps': 123869, 'loss/train': 1.5298556089401245} 11/07/2021 14:41:00 - INFO - __main__ - Step 123871: {'lr': 3.748023305626322e-05, 'samples': 23783232, 'steps': 123870, 'loss/train': 1.1711585521697998} 11/07/2021 14:41:00 - INFO - __main__ - Step 123872: {'lr': 3.747743828024733e-05, 'samples': 23783424, 'steps': 123871, 'loss/train': 1.5683246850967407} 11/07/2021 14:41:01 - INFO - __main__ - Step 123873: {'lr': 3.747464359999081e-05, 'samples': 23783616, 'steps': 123872, 'loss/train': 1.5747116804122925} 11/07/2021 14:41:01 - INFO - __main__ - Step 123874: {'lr': 3.747184901549497e-05, 'samples': 23783808, 'steps': 123873, 'loss/train': 1.3326917886734009} 11/07/2021 14:41:02 - INFO - __main__ - Step 123875: {'lr': 3.746905452676105e-05, 'samples': 23784000, 'steps': 123874, 'loss/train': 0.9329296350479126} 11/07/2021 14:41:02 - INFO - __main__ - Step 123876: {'lr': 3.746626013379026e-05, 'samples': 23784192, 'steps': 123875, 'loss/train': 1.40763258934021} 11/07/2021 14:41:03 - INFO - __main__ - Step 123877: {'lr': 3.746346583658392e-05, 'samples': 23784384, 'steps': 123876, 'loss/train': 0.4468100070953369} 11/07/2021 14:41:03 - INFO - __main__ - Step 123878: {'lr': 3.7460671635143216e-05, 'samples': 23784576, 'steps': 123877, 'loss/train': 1.1771440505981445} 11/07/2021 14:41:03 - INFO - __main__ - Step 123879: {'lr': 3.745787752946947e-05, 'samples': 23784768, 'steps': 123878, 'loss/train': 1.2081853151321411} 11/07/2021 14:41:04 - INFO - __main__ - Step 123880: {'lr': 3.7455083519563945e-05, 'samples': 23784960, 'steps': 123879, 'loss/train': 0.8911455273628235} 11/07/2021 14:41:05 - INFO - __main__ - Step 123881: {'lr': 3.745228960542785e-05, 'samples': 23785152, 'steps': 123880, 'loss/train': 1.178532600402832} 11/07/2021 14:41:05 - INFO - __main__ - Step 123882: {'lr': 3.74494957870625e-05, 'samples': 23785344, 'steps': 123881, 'loss/train': 1.0347051620483398} 11/07/2021 14:41:06 - INFO - __main__ - Step 123883: {'lr': 3.7446702064469094e-05, 'samples': 23785536, 'steps': 123882, 'loss/train': 0.9059107303619385} 11/07/2021 14:41:06 - INFO - __main__ - Step 123884: {'lr': 3.744390843764897e-05, 'samples': 23785728, 'steps': 123883, 'loss/train': 1.0409801006317139} 11/07/2021 14:41:06 - INFO - __main__ - Step 123885: {'lr': 3.744111490660329e-05, 'samples': 23785920, 'steps': 123884, 'loss/train': 1.3423171043395996} 11/07/2021 14:41:07 - INFO - __main__ - Step 123886: {'lr': 3.743832147133347e-05, 'samples': 23786112, 'steps': 123885, 'loss/train': 1.2621665000915527} 11/07/2021 14:41:08 - INFO - __main__ - Step 123887: {'lr': 3.7435528131840564e-05, 'samples': 23786304, 'steps': 123886, 'loss/train': 1.0556470155715942} 11/07/2021 14:41:08 - INFO - __main__ - Step 123888: {'lr': 3.7432734888125956e-05, 'samples': 23786496, 'steps': 123887, 'loss/train': 0.8584079742431641} 11/07/2021 14:41:08 - INFO - __main__ - Step 123889: {'lr': 3.742994174019088e-05, 'samples': 23786688, 'steps': 123888, 'loss/train': 1.0668998956680298} 11/07/2021 14:41:09 - INFO - __main__ - Step 123890: {'lr': 3.7427148688036566e-05, 'samples': 23786880, 'steps': 123889, 'loss/train': 0.8537676930427551} 11/07/2021 14:41:10 - INFO - __main__ - Step 123891: {'lr': 3.7424355731664306e-05, 'samples': 23787072, 'steps': 123890, 'loss/train': 1.3140352964401245} 11/07/2021 14:41:10 - INFO - __main__ - Step 123892: {'lr': 3.742156287107537e-05, 'samples': 23787264, 'steps': 123891, 'loss/train': 0.6784180998802185} 11/07/2021 14:41:10 - INFO - __main__ - Step 123893: {'lr': 3.741877010627098e-05, 'samples': 23787456, 'steps': 123892, 'loss/train': 1.7571966648101807} 11/07/2021 14:41:11 - INFO - __main__ - Step 123894: {'lr': 3.741597743725242e-05, 'samples': 23787648, 'steps': 123893, 'loss/train': 0.8061465620994568} 11/07/2021 14:41:11 - INFO - __main__ - Step 123895: {'lr': 3.7413184864020924e-05, 'samples': 23787840, 'steps': 123894, 'loss/train': 1.5185918807983398} 11/07/2021 14:41:12 - INFO - __main__ - Step 123896: {'lr': 3.741039238657778e-05, 'samples': 23788032, 'steps': 123895, 'loss/train': 1.4477838277816772} 11/07/2021 14:41:13 - INFO - __main__ - Step 123897: {'lr': 3.740760000492424e-05, 'samples': 23788224, 'steps': 123896, 'loss/train': 1.4619801044464111} 11/07/2021 14:41:13 - INFO - __main__ - Step 123898: {'lr': 3.74048077190616e-05, 'samples': 23788416, 'steps': 123897, 'loss/train': 1.1708911657333374} 11/07/2021 14:41:13 - INFO - __main__ - Step 123899: {'lr': 3.740201552899103e-05, 'samples': 23788608, 'steps': 123898, 'loss/train': 0.9935601949691772} 11/07/2021 14:41:14 - INFO - __main__ - Step 123900: {'lr': 3.739922343471383e-05, 'samples': 23788800, 'steps': 123899, 'loss/train': 1.3757802248001099} 11/07/2021 14:41:15 - INFO - __main__ - Step 123901: {'lr': 3.7396431436231256e-05, 'samples': 23788992, 'steps': 123900, 'loss/train': 0.40056341886520386} 11/07/2021 14:41:15 - INFO - __main__ - Step 123902: {'lr': 3.739363953354455e-05, 'samples': 23789184, 'steps': 123901, 'loss/train': 1.236751675605774} 11/07/2021 14:41:15 - INFO - __main__ - Step 123903: {'lr': 3.739084772665499e-05, 'samples': 23789376, 'steps': 123902, 'loss/train': 1.3716591596603394} 11/07/2021 14:41:16 - INFO - __main__ - Step 123904: {'lr': 3.738805601556386e-05, 'samples': 23789568, 'steps': 123903, 'loss/train': 1.0879104137420654} 11/07/2021 14:41:16 - INFO - __main__ - Step 123905: {'lr': 3.7385264400272376e-05, 'samples': 23789760, 'steps': 123904, 'loss/train': 1.1345504522323608} 11/07/2021 14:41:17 - INFO - __main__ - Step 123906: {'lr': 3.738247288078181e-05, 'samples': 23789952, 'steps': 123905, 'loss/train': 1.4725078344345093} 11/07/2021 14:41:17 - INFO - __main__ - Step 123907: {'lr': 3.737968145709342e-05, 'samples': 23790144, 'steps': 123906, 'loss/train': 1.3043251037597656} 11/07/2021 14:41:18 - INFO - __main__ - Step 123908: {'lr': 3.7376890129208476e-05, 'samples': 23790336, 'steps': 123907, 'loss/train': 1.4719821214675903} 11/07/2021 14:41:18 - INFO - __main__ - Step 123909: {'lr': 3.737409889712823e-05, 'samples': 23790528, 'steps': 123908, 'loss/train': 1.110624074935913} 11/07/2021 14:41:18 - INFO - __main__ - Step 123910: {'lr': 3.73713077608539e-05, 'samples': 23790720, 'steps': 123909, 'loss/train': 1.2367717027664185} 11/07/2021 14:41:20 - INFO - __main__ - Step 123911: {'lr': 3.73685167203868e-05, 'samples': 23790912, 'steps': 123910, 'loss/train': 0.8453313112258911} 11/07/2021 14:41:20 - INFO - __main__ - Step 123912: {'lr': 3.736572577572822e-05, 'samples': 23791104, 'steps': 123911, 'loss/train': 0.9838840365409851} 11/07/2021 14:41:20 - INFO - __main__ - Step 123913: {'lr': 3.736293492687931e-05, 'samples': 23791296, 'steps': 123912, 'loss/train': 1.4226690530776978} 11/07/2021 14:41:21 - INFO - __main__ - Step 123914: {'lr': 3.736014417384137e-05, 'samples': 23791488, 'steps': 123913, 'loss/train': 0.7343175411224365} 11/07/2021 14:41:21 - INFO - __main__ - Step 123915: {'lr': 3.7357353516615675e-05, 'samples': 23791680, 'steps': 123914, 'loss/train': 1.1903125047683716} 11/07/2021 14:41:22 - INFO - __main__ - Step 123916: {'lr': 3.735456295520348e-05, 'samples': 23791872, 'steps': 123915, 'loss/train': 1.6709078550338745} 11/07/2021 14:41:22 - INFO - __main__ - Step 123917: {'lr': 3.735177248960603e-05, 'samples': 23792064, 'steps': 123916, 'loss/train': 0.8959658741950989} 11/07/2021 14:41:23 - INFO - __main__ - Step 123918: {'lr': 3.7348982119824596e-05, 'samples': 23792256, 'steps': 123917, 'loss/train': 1.375898003578186} 11/07/2021 14:41:23 - INFO - __main__ - Step 123919: {'lr': 3.734619184586044e-05, 'samples': 23792448, 'steps': 123918, 'loss/train': 0.6688946485519409} 11/07/2021 14:41:23 - INFO - __main__ - Step 123920: {'lr': 3.734340166771477e-05, 'samples': 23792640, 'steps': 123919, 'loss/train': 1.268762230873108} 11/07/2021 14:41:25 - INFO - __main__ - Step 123921: {'lr': 3.734061158538893e-05, 'samples': 23792832, 'steps': 123920, 'loss/train': 1.1669645309448242} 11/07/2021 14:41:25 - INFO - __main__ - Step 123922: {'lr': 3.7337821598884106e-05, 'samples': 23793024, 'steps': 123921, 'loss/train': 1.2263023853302002} 11/07/2021 14:41:25 - INFO - __main__ - Step 123923: {'lr': 3.733503170820157e-05, 'samples': 23793216, 'steps': 123922, 'loss/train': 1.5231019258499146} 11/07/2021 14:41:26 - INFO - __main__ - Step 123924: {'lr': 3.733224191334258e-05, 'samples': 23793408, 'steps': 123923, 'loss/train': 1.2767298221588135} 11/07/2021 14:41:26 - INFO - __main__ - Step 123925: {'lr': 3.7329452214308474e-05, 'samples': 23793600, 'steps': 123924, 'loss/train': 1.4386804103851318} 11/07/2021 14:41:27 - INFO - __main__ - Step 123926: {'lr': 3.7326662611100376e-05, 'samples': 23793792, 'steps': 123925, 'loss/train': 1.1162264347076416} 11/07/2021 14:41:27 - INFO - __main__ - Step 123927: {'lr': 3.7323873103719594e-05, 'samples': 23793984, 'steps': 123926, 'loss/train': 1.4233413934707642} 11/07/2021 14:41:28 - INFO - __main__ - Step 123928: {'lr': 3.732108369216741e-05, 'samples': 23794176, 'steps': 123927, 'loss/train': 1.3807741403579712} 11/07/2021 14:41:28 - INFO - __main__ - Step 123929: {'lr': 3.731829437644507e-05, 'samples': 23794368, 'steps': 123928, 'loss/train': 1.032501220703125} 11/07/2021 14:41:28 - INFO - __main__ - Step 123930: {'lr': 3.73155051565538e-05, 'samples': 23794560, 'steps': 123929, 'loss/train': 0.9589534997940063} 11/07/2021 14:41:29 - INFO - __main__ - Step 123931: {'lr': 3.731271603249489e-05, 'samples': 23794752, 'steps': 123930, 'loss/train': 0.9606517553329468} 11/07/2021 14:41:30 - INFO - __main__ - Step 123932: {'lr': 3.7309927004269575e-05, 'samples': 23794944, 'steps': 123931, 'loss/train': 1.431997537612915} 11/07/2021 14:41:30 - INFO - __main__ - Step 123933: {'lr': 3.730713807187916e-05, 'samples': 23795136, 'steps': 123932, 'loss/train': 1.273912787437439} 11/07/2021 14:41:30 - INFO - __main__ - Step 123934: {'lr': 3.730434923532483e-05, 'samples': 23795328, 'steps': 123933, 'loss/train': 1.2259368896484375} 11/07/2021 14:41:31 - INFO - __main__ - Step 123935: {'lr': 3.7301560494607894e-05, 'samples': 23795520, 'steps': 123934, 'loss/train': 1.6382187604904175} 11/07/2021 14:41:32 - INFO - __main__ - Step 123936: {'lr': 3.72987718497296e-05, 'samples': 23795712, 'steps': 123935, 'loss/train': 1.0607715845108032} 11/07/2021 14:41:32 - INFO - __main__ - Step 123937: {'lr': 3.729598330069117e-05, 'samples': 23795904, 'steps': 123936, 'loss/train': 1.1829396486282349} 11/07/2021 14:41:33 - INFO - __main__ - Step 123938: {'lr': 3.729319484749391e-05, 'samples': 23796096, 'steps': 123937, 'loss/train': 0.8733965754508972} 11/07/2021 14:41:33 - INFO - __main__ - Step 123939: {'lr': 3.729040649013912e-05, 'samples': 23796288, 'steps': 123938, 'loss/train': 1.3855326175689697} 11/07/2021 14:41:33 - INFO - __main__ - Step 123940: {'lr': 3.7287618228627916e-05, 'samples': 23796480, 'steps': 123939, 'loss/train': 1.460957646369934} 11/07/2021 14:41:34 - INFO - __main__ - Step 123941: {'lr': 3.728483006296163e-05, 'samples': 23796672, 'steps': 123940, 'loss/train': 1.2413026094436646} 11/07/2021 14:41:35 - INFO - __main__ - Step 123942: {'lr': 3.728204199314153e-05, 'samples': 23796864, 'steps': 123941, 'loss/train': 0.8533570170402527} 11/07/2021 14:41:35 - INFO - __main__ - Step 123943: {'lr': 3.7279254019168845e-05, 'samples': 23797056, 'steps': 123942, 'loss/train': 1.5095876455307007} 11/07/2021 14:41:35 - INFO - __main__ - Step 123944: {'lr': 3.727646614104485e-05, 'samples': 23797248, 'steps': 123943, 'loss/train': 1.3685804605484009} 11/07/2021 14:41:36 - INFO - __main__ - Step 123945: {'lr': 3.7273678358770795e-05, 'samples': 23797440, 'steps': 123944, 'loss/train': 1.525754451751709} 11/07/2021 14:41:36 - INFO - __main__ - Step 123946: {'lr': 3.7270890672347955e-05, 'samples': 23797632, 'steps': 123945, 'loss/train': 1.3559091091156006} 11/07/2021 14:41:37 - INFO - __main__ - Step 123947: {'lr': 3.726810308177755e-05, 'samples': 23797824, 'steps': 123946, 'loss/train': 1.0597409009933472} 11/07/2021 14:41:37 - INFO - __main__ - Step 123948: {'lr': 3.726531558706087e-05, 'samples': 23798016, 'steps': 123947, 'loss/train': 1.0950039625167847} 11/07/2021 14:41:38 - INFO - __main__ - Step 123949: {'lr': 3.726252818819914e-05, 'samples': 23798208, 'steps': 123948, 'loss/train': 0.32436519861221313} 11/07/2021 14:41:38 - INFO - __main__ - Step 123950: {'lr': 3.7259740885193626e-05, 'samples': 23798400, 'steps': 123949, 'loss/train': 1.2483181953430176} 11/07/2021 14:41:38 - INFO - __main__ - Step 123951: {'lr': 3.72569536780456e-05, 'samples': 23798592, 'steps': 123950, 'loss/train': 1.1470139026641846} 11/07/2021 14:41:39 - INFO - __main__ - Step 123952: {'lr': 3.725416656675637e-05, 'samples': 23798784, 'steps': 123951, 'loss/train': 1.5763859748840332} 11/07/2021 14:41:40 - INFO - __main__ - Step 123953: {'lr': 3.725137955132707e-05, 'samples': 23798976, 'steps': 123952, 'loss/train': 0.9416154026985168} 11/07/2021 14:41:40 - INFO - __main__ - Step 123954: {'lr': 3.7248592631759035e-05, 'samples': 23799168, 'steps': 123953, 'loss/train': 1.0349901914596558} 11/07/2021 14:41:41 - INFO - __main__ - Step 123955: {'lr': 3.7245805808053476e-05, 'samples': 23799360, 'steps': 123954, 'loss/train': 1.306177020072937} 11/07/2021 14:41:41 - INFO - __main__ - Step 123956: {'lr': 3.724301908021169e-05, 'samples': 23799552, 'steps': 123955, 'loss/train': 1.168328046798706} 11/07/2021 14:41:42 - INFO - __main__ - Step 123957: {'lr': 3.7240232448234905e-05, 'samples': 23799744, 'steps': 123956, 'loss/train': 1.2526583671569824} 11/07/2021 14:41:42 - INFO - __main__ - Step 123958: {'lr': 3.723744591212439e-05, 'samples': 23799936, 'steps': 123957, 'loss/train': 1.2971575260162354} 11/07/2021 14:41:43 - INFO - __main__ - Step 123959: {'lr': 3.7234659471881397e-05, 'samples': 23800128, 'steps': 123958, 'loss/train': 1.4506711959838867} 11/07/2021 14:41:43 - INFO - __main__ - Step 123960: {'lr': 3.7231873127507174e-05, 'samples': 23800320, 'steps': 123959, 'loss/train': 1.0645605325698853} 11/07/2021 14:41:43 - INFO - __main__ - Step 123961: {'lr': 3.722908687900301e-05, 'samples': 23800512, 'steps': 123960, 'loss/train': 0.9080715775489807} 11/07/2021 14:41:44 - INFO - __main__ - Step 123962: {'lr': 3.722630072637012e-05, 'samples': 23800704, 'steps': 123961, 'loss/train': 1.4768502712249756} 11/07/2021 14:41:45 - INFO - __main__ - Step 123963: {'lr': 3.722351466960977e-05, 'samples': 23800896, 'steps': 123962, 'loss/train': 1.929879903793335} 11/07/2021 14:41:45 - INFO - __main__ - Step 123964: {'lr': 3.7220728708723225e-05, 'samples': 23801088, 'steps': 123963, 'loss/train': 1.340812087059021} 11/07/2021 14:41:45 - INFO - __main__ - Step 123965: {'lr': 3.721794284371174e-05, 'samples': 23801280, 'steps': 123964, 'loss/train': 1.4359384775161743} 11/07/2021 14:41:46 - INFO - __main__ - Step 123966: {'lr': 3.721515707457662e-05, 'samples': 23801472, 'steps': 123965, 'loss/train': 1.1162455081939697} 11/07/2021 14:41:47 - INFO - __main__ - Step 123967: {'lr': 3.721237140131903e-05, 'samples': 23801664, 'steps': 123966, 'loss/train': 1.3937599658966064} 11/07/2021 14:41:47 - INFO - __main__ - Step 123968: {'lr': 3.7209585823940236e-05, 'samples': 23801856, 'steps': 123967, 'loss/train': 1.500360369682312} 11/07/2021 14:41:48 - INFO - __main__ - Step 123969: {'lr': 3.7206800342441534e-05, 'samples': 23802048, 'steps': 123968, 'loss/train': 0.46686795353889465} 11/07/2021 14:41:48 - INFO - __main__ - Step 123970: {'lr': 3.7204014956824155e-05, 'samples': 23802240, 'steps': 123969, 'loss/train': 0.9905659556388855} 11/07/2021 14:41:48 - INFO - __main__ - Step 123971: {'lr': 3.720122966708936e-05, 'samples': 23802432, 'steps': 123970, 'loss/train': 0.9065704941749573} 11/07/2021 14:41:49 - INFO - __main__ - Step 123972: {'lr': 3.719844447323842e-05, 'samples': 23802624, 'steps': 123971, 'loss/train': 1.8352563381195068} 11/07/2021 14:41:50 - INFO - __main__ - Step 123973: {'lr': 3.7195659375272555e-05, 'samples': 23802816, 'steps': 123972, 'loss/train': 1.0020209550857544} 11/07/2021 14:41:50 - INFO - __main__ - Step 123974: {'lr': 3.719287437319308e-05, 'samples': 23803008, 'steps': 123973, 'loss/train': 1.4199562072753906} 11/07/2021 14:41:50 - INFO - __main__ - Step 123975: {'lr': 3.719008946700117e-05, 'samples': 23803200, 'steps': 123974, 'loss/train': 1.7791138887405396} 11/07/2021 14:41:51 - INFO - __main__ - Step 123976: {'lr': 3.7187304656698145e-05, 'samples': 23803392, 'steps': 123975, 'loss/train': 1.2828816175460815} 11/07/2021 14:41:52 - INFO - __main__ - Step 123977: {'lr': 3.718451994228525e-05, 'samples': 23803584, 'steps': 123976, 'loss/train': 1.4414476156234741} 11/07/2021 14:41:52 - INFO - __main__ - Step 123978: {'lr': 3.7181735323763707e-05, 'samples': 23803776, 'steps': 123977, 'loss/train': 1.4269037246704102} 11/07/2021 14:41:53 - INFO - __main__ - Step 123979: {'lr': 3.717895080113484e-05, 'samples': 23803968, 'steps': 123978, 'loss/train': 1.3399457931518555} 11/07/2021 14:41:53 - INFO - __main__ - Step 123980: {'lr': 3.71761663743998e-05, 'samples': 23804160, 'steps': 123979, 'loss/train': 1.0693477392196655} 11/07/2021 14:41:53 - INFO - __main__ - Step 123981: {'lr': 3.717338204355991e-05, 'samples': 23804352, 'steps': 123980, 'loss/train': 1.6220428943634033} 11/07/2021 14:41:54 - INFO - __main__ - Step 123982: {'lr': 3.7170597808616396e-05, 'samples': 23804544, 'steps': 123981, 'loss/train': 1.123082160949707} 11/07/2021 14:41:55 - INFO - __main__ - Step 123983: {'lr': 3.7167813669570535e-05, 'samples': 23804736, 'steps': 123982, 'loss/train': 1.1612049341201782} 11/07/2021 14:41:55 - INFO - __main__ - Step 123984: {'lr': 3.716502962642357e-05, 'samples': 23804928, 'steps': 123983, 'loss/train': 0.9856463670730591} 11/07/2021 14:41:55 - INFO - __main__ - Step 123985: {'lr': 3.716224567917678e-05, 'samples': 23805120, 'steps': 123984, 'loss/train': 1.3059085607528687} 11/07/2021 14:41:56 - INFO - __main__ - Step 123986: {'lr': 3.715946182783136e-05, 'samples': 23805312, 'steps': 123985, 'loss/train': 1.1835970878601074} 11/07/2021 14:41:57 - INFO - __main__ - Step 123987: {'lr': 3.7156678072388624e-05, 'samples': 23805504, 'steps': 123986, 'loss/train': 1.3830593824386597} 11/07/2021 14:41:57 - INFO - __main__ - Step 123988: {'lr': 3.71538944128498e-05, 'samples': 23805696, 'steps': 123987, 'loss/train': 1.61318838596344} 11/07/2021 14:41:57 - INFO - __main__ - Step 123989: {'lr': 3.7151110849216156e-05, 'samples': 23805888, 'steps': 123988, 'loss/train': 1.3385239839553833} 11/07/2021 14:41:58 - INFO - __main__ - Step 123990: {'lr': 3.7148327381488906e-05, 'samples': 23806080, 'steps': 123989, 'loss/train': 1.5557290315628052} 11/07/2021 14:41:58 - INFO - __main__ - Step 123991: {'lr': 3.714554400966938e-05, 'samples': 23806272, 'steps': 123990, 'loss/train': 0.80199134349823} 11/07/2021 14:41:59 - INFO - __main__ - Step 123992: {'lr': 3.714276073375875e-05, 'samples': 23806464, 'steps': 123991, 'loss/train': 0.6679530143737793} 11/07/2021 14:42:00 - INFO - __main__ - Step 123993: {'lr': 3.713997755375839e-05, 'samples': 23806656, 'steps': 123992, 'loss/train': 0.8052207231521606} 11/07/2021 14:42:00 - INFO - __main__ - Step 123994: {'lr': 3.71371944696694e-05, 'samples': 23806848, 'steps': 123993, 'loss/train': 1.4682756662368774} 11/07/2021 14:42:00 - INFO - __main__ - Step 123995: {'lr': 3.7134411481493106e-05, 'samples': 23807040, 'steps': 123994, 'loss/train': 0.7905340790748596} 11/07/2021 14:42:01 - INFO - __main__ - Step 123996: {'lr': 3.713162858923075e-05, 'samples': 23807232, 'steps': 123995, 'loss/train': 1.4931997060775757} 11/07/2021 14:42:01 - INFO - __main__ - Step 123997: {'lr': 3.7128845792883614e-05, 'samples': 23807424, 'steps': 123996, 'loss/train': 1.5507142543792725} 11/07/2021 14:42:02 - INFO - __main__ - Step 123998: {'lr': 3.712606309245295e-05, 'samples': 23807616, 'steps': 123997, 'loss/train': 1.3542087078094482} 11/07/2021 14:42:02 - INFO - __main__ - Step 123999: {'lr': 3.712328048793997e-05, 'samples': 23807808, 'steps': 123998, 'loss/train': 0.974786639213562} 11/07/2021 14:42:03 - INFO - __main__ - Step 124000: {'lr': 3.712049797934597e-05, 'samples': 23808000, 'steps': 123999, 'loss/train': 1.2136006355285645} 11/07/2021 14:42:03 - INFO - __main__ - Step 124001: {'lr': 3.711771556667218e-05, 'samples': 23808192, 'steps': 124000, 'loss/train': 0.9048500657081604} 11/07/2021 14:42:03 - INFO - __main__ - Step 124002: {'lr': 3.711493324991985e-05, 'samples': 23808384, 'steps': 124001, 'loss/train': 1.4801080226898193} 11/07/2021 14:42:04 - INFO - __main__ - Step 124003: {'lr': 3.711215102909027e-05, 'samples': 23808576, 'steps': 124002, 'loss/train': 1.070214033126831} 11/07/2021 14:42:05 - INFO - __main__ - Step 124004: {'lr': 3.710936890418468e-05, 'samples': 23808768, 'steps': 124003, 'loss/train': 1.3066911697387695} 11/07/2021 14:42:05 - INFO - __main__ - Step 124005: {'lr': 3.71065868752043e-05, 'samples': 23808960, 'steps': 124004, 'loss/train': 1.217868447303772} 11/07/2021 14:42:05 - INFO - __main__ - Step 124006: {'lr': 3.710380494215046e-05, 'samples': 23809152, 'steps': 124005, 'loss/train': 1.324234127998352} 11/07/2021 14:42:06 - INFO - __main__ - Step 124007: {'lr': 3.7101023105024305e-05, 'samples': 23809344, 'steps': 124006, 'loss/train': 1.179666519165039} 11/07/2021 14:42:07 - INFO - __main__ - Step 124008: {'lr': 3.709824136382717e-05, 'samples': 23809536, 'steps': 124007, 'loss/train': 1.4294577836990356} 11/07/2021 14:42:07 - INFO - __main__ - Step 124009: {'lr': 3.709545971856024e-05, 'samples': 23809728, 'steps': 124008, 'loss/train': 0.866707980632782} 11/07/2021 14:42:08 - INFO - __main__ - Step 124010: {'lr': 3.709267816922485e-05, 'samples': 23809920, 'steps': 124009, 'loss/train': 1.426483154296875} 11/07/2021 14:42:08 - INFO - __main__ - Step 124011: {'lr': 3.708989671582219e-05, 'samples': 23810112, 'steps': 124010, 'loss/train': 0.8596733808517456} 11/07/2021 14:42:08 - INFO - __main__ - Step 124012: {'lr': 3.7087115358353545e-05, 'samples': 23810304, 'steps': 124011, 'loss/train': 1.2475346326828003} 11/07/2021 14:42:09 - INFO - __main__ - Step 124013: {'lr': 3.708433409682016e-05, 'samples': 23810496, 'steps': 124012, 'loss/train': 0.884358823299408} 11/07/2021 14:42:10 - INFO - __main__ - Step 124014: {'lr': 3.708155293122328e-05, 'samples': 23810688, 'steps': 124013, 'loss/train': 1.4556825160980225} 11/07/2021 14:42:10 - INFO - __main__ - Step 124015: {'lr': 3.707877186156419e-05, 'samples': 23810880, 'steps': 124014, 'loss/train': 1.1131826639175415} 11/07/2021 14:42:10 - INFO - __main__ - Step 124016: {'lr': 3.707599088784411e-05, 'samples': 23811072, 'steps': 124015, 'loss/train': 1.5945796966552734} 11/07/2021 14:42:11 - INFO - __main__ - Step 124017: {'lr': 3.707321001006428e-05, 'samples': 23811264, 'steps': 124016, 'loss/train': 1.0652241706848145} 11/07/2021 14:42:12 - INFO - __main__ - Step 124018: {'lr': 3.707042922822601e-05, 'samples': 23811456, 'steps': 124017, 'loss/train': 0.969813346862793} 11/07/2021 14:42:12 - INFO - __main__ - Step 124019: {'lr': 3.706764854233055e-05, 'samples': 23811648, 'steps': 124018, 'loss/train': 0.7868967652320862} 11/07/2021 14:42:13 - INFO - __main__ - Step 124020: {'lr': 3.7064867952379066e-05, 'samples': 23811840, 'steps': 124019, 'loss/train': 1.0358184576034546} 11/07/2021 14:42:13 - INFO - __main__ - Step 124021: {'lr': 3.706208745837289e-05, 'samples': 23812032, 'steps': 124020, 'loss/train': 0.3373759686946869} 11/07/2021 14:42:13 - INFO - __main__ - Step 124022: {'lr': 3.705930706031321e-05, 'samples': 23812224, 'steps': 124021, 'loss/train': 1.3251004219055176} 11/07/2021 14:42:14 - INFO - __main__ - Step 124023: {'lr': 3.705652675820137e-05, 'samples': 23812416, 'steps': 124022, 'loss/train': 0.8786476254463196} 11/07/2021 14:42:15 - INFO - __main__ - Step 124024: {'lr': 3.705374655203855e-05, 'samples': 23812608, 'steps': 124023, 'loss/train': 1.4647434949874878} 11/07/2021 14:42:15 - INFO - __main__ - Step 124025: {'lr': 3.7050966441826014e-05, 'samples': 23812800, 'steps': 124024, 'loss/train': 1.40997314453125} 11/07/2021 14:42:15 - INFO - __main__ - Step 124026: {'lr': 3.704818642756505e-05, 'samples': 23812992, 'steps': 124025, 'loss/train': 1.196252703666687} 11/07/2021 14:42:16 - INFO - __main__ - Step 124027: {'lr': 3.704540650925686e-05, 'samples': 23813184, 'steps': 124026, 'loss/train': 1.2542816400527954} 11/07/2021 14:42:17 - INFO - __main__ - Step 124028: {'lr': 3.704262668690275e-05, 'samples': 23813376, 'steps': 124027, 'loss/train': 0.839633584022522} 11/07/2021 14:42:17 - INFO - __main__ - Step 124029: {'lr': 3.7039846960503944e-05, 'samples': 23813568, 'steps': 124028, 'loss/train': 1.4437050819396973} 11/07/2021 14:42:17 - INFO - __main__ - Step 124030: {'lr': 3.703706733006168e-05, 'samples': 23813760, 'steps': 124029, 'loss/train': 0.8077112436294556} 11/07/2021 14:42:18 - INFO - __main__ - Step 124031: {'lr': 3.7034287795577244e-05, 'samples': 23813952, 'steps': 124030, 'loss/train': 1.5310404300689697} 11/07/2021 14:42:18 - INFO - __main__ - Step 124032: {'lr': 3.703150835705185e-05, 'samples': 23814144, 'steps': 124031, 'loss/train': 1.152390718460083} 11/07/2021 14:42:19 - INFO - __main__ - Step 124033: {'lr': 3.702872901448684e-05, 'samples': 23814336, 'steps': 124032, 'loss/train': 0.9451045393943787} 11/07/2021 14:42:20 - INFO - __main__ - Step 124034: {'lr': 3.7025949767883344e-05, 'samples': 23814528, 'steps': 124033, 'loss/train': 1.1462477445602417} 11/07/2021 14:42:20 - INFO - __main__ - Step 124035: {'lr': 3.7023170617242666e-05, 'samples': 23814720, 'steps': 124034, 'loss/train': 1.1928281784057617} 11/07/2021 14:42:20 - INFO - __main__ - Step 124036: {'lr': 3.7020391562566064e-05, 'samples': 23814912, 'steps': 124035, 'loss/train': 1.3791614770889282} 11/07/2021 14:42:21 - INFO - __main__ - Step 124037: {'lr': 3.701761260385478e-05, 'samples': 23815104, 'steps': 124036, 'loss/train': 0.8403186798095703} 11/07/2021 14:42:22 - INFO - __main__ - Step 124038: {'lr': 3.701483374111009e-05, 'samples': 23815296, 'steps': 124037, 'loss/train': 0.03990115225315094} 11/07/2021 14:42:22 - INFO - __main__ - Step 124039: {'lr': 3.7012054974333216e-05, 'samples': 23815488, 'steps': 124038, 'loss/train': 1.2593061923980713} 11/07/2021 14:42:22 - INFO - __main__ - Step 124040: {'lr': 3.700927630352543e-05, 'samples': 23815680, 'steps': 124039, 'loss/train': 1.2501146793365479} 11/07/2021 14:42:23 - INFO - __main__ - Step 124041: {'lr': 3.7006497728687974e-05, 'samples': 23815872, 'steps': 124040, 'loss/train': 1.8056997060775757} 11/07/2021 14:42:23 - INFO - __main__ - Step 124042: {'lr': 3.70037192498221e-05, 'samples': 23816064, 'steps': 124041, 'loss/train': 1.7458256483078003} 11/07/2021 14:42:23 - INFO - __main__ - Step 124043: {'lr': 3.700094086692907e-05, 'samples': 23816256, 'steps': 124042, 'loss/train': 1.4007457494735718} 11/07/2021 14:42:25 - INFO - __main__ - Step 124044: {'lr': 3.6998162580010124e-05, 'samples': 23816448, 'steps': 124043, 'loss/train': 1.5214240550994873} 11/07/2021 14:42:25 - INFO - __main__ - Step 124045: {'lr': 3.699538438906652e-05, 'samples': 23816640, 'steps': 124044, 'loss/train': 1.0649484395980835} 11/07/2021 14:42:25 - INFO - __main__ - Step 124046: {'lr': 3.699260629409956e-05, 'samples': 23816832, 'steps': 124045, 'loss/train': 1.4069901704788208} 11/07/2021 14:42:26 - INFO - __main__ - Step 124047: {'lr': 3.698982829511041e-05, 'samples': 23817024, 'steps': 124046, 'loss/train': 0.767356276512146} 11/07/2021 14:42:26 - INFO - __main__ - Step 124048: {'lr': 3.698705039210035e-05, 'samples': 23817216, 'steps': 124047, 'loss/train': 1.3349779844284058} 11/07/2021 14:42:27 - INFO - __main__ - Step 124049: {'lr': 3.6984272585070615e-05, 'samples': 23817408, 'steps': 124048, 'loss/train': 1.6756268739700317} 11/07/2021 14:42:27 - INFO - __main__ - Step 124050: {'lr': 3.6981494874022495e-05, 'samples': 23817600, 'steps': 124049, 'loss/train': 1.5557115077972412} 11/07/2021 14:42:28 - INFO - __main__ - Step 124051: {'lr': 3.697871725895721e-05, 'samples': 23817792, 'steps': 124050, 'loss/train': 0.8502287268638611} 11/07/2021 14:42:28 - INFO - __main__ - Step 124052: {'lr': 3.697593973987606e-05, 'samples': 23817984, 'steps': 124051, 'loss/train': 0.20753861963748932} 11/07/2021 14:42:28 - INFO - __main__ - Step 124053: {'lr': 3.697316231678024e-05, 'samples': 23818176, 'steps': 124052, 'loss/train': 1.2992748022079468} 11/07/2021 14:42:30 - INFO - __main__ - Step 124054: {'lr': 3.6970384989671036e-05, 'samples': 23818368, 'steps': 124053, 'loss/train': 1.3219188451766968} 11/07/2021 14:42:30 - INFO - __main__ - Step 124055: {'lr': 3.6967607758549684e-05, 'samples': 23818560, 'steps': 124054, 'loss/train': 1.3645867109298706} 11/07/2021 14:42:30 - INFO - __main__ - Step 124056: {'lr': 3.696483062341743e-05, 'samples': 23818752, 'steps': 124055, 'loss/train': 1.2420775890350342} 11/07/2021 14:42:31 - INFO - __main__ - Step 124057: {'lr': 3.696205358427557e-05, 'samples': 23818944, 'steps': 124056, 'loss/train': 1.3310258388519287} 11/07/2021 14:42:31 - INFO - __main__ - Step 124058: {'lr': 3.695927664112531e-05, 'samples': 23819136, 'steps': 124057, 'loss/train': 1.5184754133224487} 11/07/2021 14:42:32 - INFO - __main__ - Step 124059: {'lr': 3.695649979396789e-05, 'samples': 23819328, 'steps': 124058, 'loss/train': 1.3353683948516846} 11/07/2021 14:42:32 - INFO - __main__ - Step 124060: {'lr': 3.695372304280464e-05, 'samples': 23819520, 'steps': 124059, 'loss/train': 1.4399261474609375} 11/07/2021 14:42:33 - INFO - __main__ - Step 124061: {'lr': 3.695094638763671e-05, 'samples': 23819712, 'steps': 124060, 'loss/train': 1.9882941246032715} 11/07/2021 14:42:33 - INFO - __main__ - Step 124062: {'lr': 3.694816982846541e-05, 'samples': 23819904, 'steps': 124061, 'loss/train': 1.7016663551330566} 11/07/2021 14:42:33 - INFO - __main__ - Step 124063: {'lr': 3.694539336529196e-05, 'samples': 23820096, 'steps': 124062, 'loss/train': 0.9996981620788574} 11/07/2021 14:42:34 - INFO - __main__ - Step 124064: {'lr': 3.694261699811763e-05, 'samples': 23820288, 'steps': 124063, 'loss/train': 1.1947373151779175} 11/07/2021 14:42:35 - INFO - __main__ - Step 124065: {'lr': 3.6939840726943677e-05, 'samples': 23820480, 'steps': 124064, 'loss/train': 1.3133169412612915} 11/07/2021 14:42:35 - INFO - __main__ - Step 124066: {'lr': 3.693706455177134e-05, 'samples': 23820672, 'steps': 124065, 'loss/train': 1.173547625541687} 11/07/2021 14:42:36 - INFO - __main__ - Step 124067: {'lr': 3.693428847260189e-05, 'samples': 23820864, 'steps': 124066, 'loss/train': 1.2869105339050293} 11/07/2021 14:42:36 - INFO - __main__ - Step 124068: {'lr': 3.693151248943652e-05, 'samples': 23821056, 'steps': 124067, 'loss/train': 1.2281675338745117} 11/07/2021 14:42:36 - INFO - __main__ - Step 124069: {'lr': 3.692873660227655e-05, 'samples': 23821248, 'steps': 124068, 'loss/train': 1.1306196451187134} 11/07/2021 14:42:37 - INFO - __main__ - Step 124070: {'lr': 3.6925960811123205e-05, 'samples': 23821440, 'steps': 124069, 'loss/train': 1.4817904233932495} 11/07/2021 14:42:38 - INFO - __main__ - Step 124071: {'lr': 3.692318511597773e-05, 'samples': 23821632, 'steps': 124070, 'loss/train': 1.3460475206375122} 11/07/2021 14:42:38 - INFO - __main__ - Step 124072: {'lr': 3.692040951684139e-05, 'samples': 23821824, 'steps': 124071, 'loss/train': 0.9944090247154236} 11/07/2021 14:42:38 - INFO - __main__ - Step 124073: {'lr': 3.691763401371545e-05, 'samples': 23822016, 'steps': 124072, 'loss/train': 1.3303775787353516} 11/07/2021 14:42:39 - INFO - __main__ - Step 124074: {'lr': 3.691485860660113e-05, 'samples': 23822208, 'steps': 124073, 'loss/train': 1.0358787775039673} 11/07/2021 14:42:40 - INFO - __main__ - Step 124075: {'lr': 3.6912083295499636e-05, 'samples': 23822400, 'steps': 124074, 'loss/train': 1.0341771841049194} 11/07/2021 14:42:40 - INFO - __main__ - Step 124076: {'lr': 3.690930808041229e-05, 'samples': 23822592, 'steps': 124075, 'loss/train': 0.5586075782775879} 11/07/2021 14:42:40 - INFO - __main__ - Step 124077: {'lr': 3.690653296134033e-05, 'samples': 23822784, 'steps': 124076, 'loss/train': 1.3897221088409424} 11/07/2021 14:42:41 - INFO - __main__ - Step 124078: {'lr': 3.6903757938284985e-05, 'samples': 23822976, 'steps': 124077, 'loss/train': 1.2730863094329834} 11/07/2021 14:42:41 - INFO - __main__ - Step 124079: {'lr': 3.690098301124753e-05, 'samples': 23823168, 'steps': 124078, 'loss/train': 1.3178966045379639} 11/07/2021 14:42:42 - INFO - __main__ - Step 124080: {'lr': 3.6898208180229184e-05, 'samples': 23823360, 'steps': 124079, 'loss/train': 1.1034897565841675} 11/07/2021 14:42:43 - INFO - __main__ - Step 124081: {'lr': 3.6895433445231243e-05, 'samples': 23823552, 'steps': 124080, 'loss/train': 1.5937719345092773} 11/07/2021 14:42:43 - INFO - __main__ - Step 124082: {'lr': 3.689265880625492e-05, 'samples': 23823744, 'steps': 124081, 'loss/train': 1.2140448093414307} 11/07/2021 14:42:43 - INFO - __main__ - Step 124083: {'lr': 3.6889884263301476e-05, 'samples': 23823936, 'steps': 124082, 'loss/train': 1.1115281581878662} 11/07/2021 14:42:44 - INFO - __main__ - Step 124084: {'lr': 3.688710981637216e-05, 'samples': 23824128, 'steps': 124083, 'loss/train': 1.3179222345352173} 11/07/2021 14:42:45 - INFO - __main__ - Step 124085: {'lr': 3.688433546546821e-05, 'samples': 23824320, 'steps': 124084, 'loss/train': 1.1861759424209595} 11/07/2021 14:42:45 - INFO - __main__ - Step 124086: {'lr': 3.688156121059091e-05, 'samples': 23824512, 'steps': 124085, 'loss/train': 0.8503082990646362} 11/07/2021 14:42:45 - INFO - __main__ - Step 124087: {'lr': 3.687878705174155e-05, 'samples': 23824704, 'steps': 124086, 'loss/train': 1.595559000968933} 11/07/2021 14:42:46 - INFO - __main__ - Step 124088: {'lr': 3.687601298892126e-05, 'samples': 23824896, 'steps': 124087, 'loss/train': 1.2712351083755493} 11/07/2021 14:42:46 - INFO - __main__ - Step 124089: {'lr': 3.687323902213133e-05, 'samples': 23825088, 'steps': 124088, 'loss/train': 1.186019778251648} 11/07/2021 14:42:47 - INFO - __main__ - Step 124090: {'lr': 3.6870465151373044e-05, 'samples': 23825280, 'steps': 124089, 'loss/train': 1.3668081760406494} 11/07/2021 14:42:47 - INFO - __main__ - Step 124091: {'lr': 3.686769137664764e-05, 'samples': 23825472, 'steps': 124090, 'loss/train': 1.081347942352295} 11/07/2021 14:42:48 - INFO - __main__ - Step 124092: {'lr': 3.6864917697956355e-05, 'samples': 23825664, 'steps': 124091, 'loss/train': 1.6121084690093994} 11/07/2021 14:42:48 - INFO - __main__ - Step 124093: {'lr': 3.686214411530048e-05, 'samples': 23825856, 'steps': 124092, 'loss/train': 1.4698840379714966} 11/07/2021 14:42:48 - INFO - __main__ - Step 124094: {'lr': 3.6859370628681195e-05, 'samples': 23826048, 'steps': 124093, 'loss/train': 1.1888896226882935} 11/07/2021 14:42:50 - INFO - __main__ - Step 124095: {'lr': 3.685659723809981e-05, 'samples': 23826240, 'steps': 124094, 'loss/train': 1.1010611057281494} 11/07/2021 14:42:50 - INFO - __main__ - Step 124096: {'lr': 3.685382394355755e-05, 'samples': 23826432, 'steps': 124095, 'loss/train': 1.2413265705108643} 11/07/2021 14:42:50 - INFO - __main__ - Step 124097: {'lr': 3.6851050745055656e-05, 'samples': 23826624, 'steps': 124096, 'loss/train': 1.2971571683883667} 11/07/2021 14:42:51 - INFO - __main__ - Step 124098: {'lr': 3.684827764259541e-05, 'samples': 23826816, 'steps': 124097, 'loss/train': 1.457143783569336} 11/07/2021 14:42:51 - INFO - __main__ - Step 124099: {'lr': 3.684550463617803e-05, 'samples': 23827008, 'steps': 124098, 'loss/train': 1.446921467781067} 11/07/2021 14:42:52 - INFO - __main__ - Step 124100: {'lr': 3.684273172580482e-05, 'samples': 23827200, 'steps': 124099, 'loss/train': 1.0876528024673462} 11/07/2021 14:42:52 - INFO - __main__ - Step 124101: {'lr': 3.6839958911476953e-05, 'samples': 23827392, 'steps': 124100, 'loss/train': 0.05548003315925598} 11/07/2021 14:42:53 - INFO - __main__ - Step 124102: {'lr': 3.68371861931957e-05, 'samples': 23827584, 'steps': 124101, 'loss/train': 0.6125876307487488} 11/07/2021 14:42:53 - INFO - __main__ - Step 124103: {'lr': 3.6834413570962314e-05, 'samples': 23827776, 'steps': 124102, 'loss/train': 1.0284736156463623} 11/07/2021 14:42:53 - INFO - __main__ - Step 124104: {'lr': 3.683164104477807e-05, 'samples': 23827968, 'steps': 124103, 'loss/train': 1.5663233995437622} 11/07/2021 14:42:54 - INFO - __main__ - Step 124105: {'lr': 3.682886861464418e-05, 'samples': 23828160, 'steps': 124104, 'loss/train': 1.111641526222229} 11/07/2021 14:42:55 - INFO - __main__ - Step 124106: {'lr': 3.6826096280561936e-05, 'samples': 23828352, 'steps': 124105, 'loss/train': 1.5659527778625488} 11/07/2021 14:42:55 - INFO - __main__ - Step 124107: {'lr': 3.682332404253255e-05, 'samples': 23828544, 'steps': 124106, 'loss/train': 1.2240104675292969} 11/07/2021 14:42:56 - INFO - __main__ - Step 124108: {'lr': 3.6820551900557275e-05, 'samples': 23828736, 'steps': 124107, 'loss/train': 1.3211815357208252} 11/07/2021 14:42:56 - INFO - __main__ - Step 124109: {'lr': 3.6817779854637386e-05, 'samples': 23828928, 'steps': 124108, 'loss/train': 1.6489579677581787} 11/07/2021 14:42:57 - INFO - __main__ - Step 124110: {'lr': 3.68150079047741e-05, 'samples': 23829120, 'steps': 124109, 'loss/train': 1.3925447463989258} 11/07/2021 14:42:57 - INFO - __main__ - Step 124111: {'lr': 3.6812236050968756e-05, 'samples': 23829312, 'steps': 124110, 'loss/train': 1.3178147077560425} 11/07/2021 14:42:58 - INFO - __main__ - Step 124112: {'lr': 3.680946429322246e-05, 'samples': 23829504, 'steps': 124111, 'loss/train': 1.251876711845398} 11/07/2021 14:42:58 - INFO - __main__ - Step 124113: {'lr': 3.680669263153655e-05, 'samples': 23829696, 'steps': 124112, 'loss/train': 1.5464131832122803} 11/07/2021 14:42:58 - INFO - __main__ - Step 124114: {'lr': 3.680392106591224e-05, 'samples': 23829888, 'steps': 124113, 'loss/train': 1.1982287168502808} 11/07/2021 14:42:59 - INFO - __main__ - Step 124115: {'lr': 3.680114959635078e-05, 'samples': 23830080, 'steps': 124114, 'loss/train': 1.3674226999282837} 11/07/2021 14:43:00 - INFO - __main__ - Step 124116: {'lr': 3.679837822285345e-05, 'samples': 23830272, 'steps': 124115, 'loss/train': 0.9206868410110474} 11/07/2021 14:43:00 - INFO - __main__ - Step 124117: {'lr': 3.6795606945421476e-05, 'samples': 23830464, 'steps': 124116, 'loss/train': 1.0456665754318237} 11/07/2021 14:43:01 - INFO - __main__ - Step 124118: {'lr': 3.6792835764056096e-05, 'samples': 23830656, 'steps': 124117, 'loss/train': 1.4150325059890747} 11/07/2021 14:43:01 - INFO - __main__ - Step 124119: {'lr': 3.679006467875859e-05, 'samples': 23830848, 'steps': 124118, 'loss/train': 1.5654221773147583} 11/07/2021 14:43:02 - INFO - __main__ - Step 124120: {'lr': 3.678729368953018e-05, 'samples': 23831040, 'steps': 124119, 'loss/train': 0.9281941652297974} 11/07/2021 14:43:02 - INFO - __main__ - Step 124121: {'lr': 3.678452279637215e-05, 'samples': 23831232, 'steps': 124120, 'loss/train': 1.497769832611084} 11/07/2021 14:43:03 - INFO - __main__ - Step 124122: {'lr': 3.6781751999285764e-05, 'samples': 23831424, 'steps': 124121, 'loss/train': 1.9248327016830444} 11/07/2021 14:43:03 - INFO - __main__ - Step 124123: {'lr': 3.677898129827217e-05, 'samples': 23831616, 'steps': 124122, 'loss/train': 1.4159870147705078} 11/07/2021 14:43:03 - INFO - __main__ - Step 124124: {'lr': 3.6776210693332676e-05, 'samples': 23831808, 'steps': 124123, 'loss/train': 1.013342022895813} 11/07/2021 14:43:04 - INFO - __main__ - Step 124125: {'lr': 3.677344018446854e-05, 'samples': 23832000, 'steps': 124124, 'loss/train': 1.1889406442642212} 11/07/2021 14:43:05 - INFO - __main__ - Step 124126: {'lr': 3.6770669771681e-05, 'samples': 23832192, 'steps': 124125, 'loss/train': 1.2299813032150269} 11/07/2021 14:43:05 - INFO - __main__ - Step 124127: {'lr': 3.67678994549713e-05, 'samples': 23832384, 'steps': 124126, 'loss/train': 1.3450974225997925} 11/07/2021 14:43:05 - INFO - __main__ - Step 124128: {'lr': 3.6765129234340695e-05, 'samples': 23832576, 'steps': 124127, 'loss/train': 1.222499966621399} 11/07/2021 14:43:06 - INFO - __main__ - Step 124129: {'lr': 3.676235910979045e-05, 'samples': 23832768, 'steps': 124128, 'loss/train': 1.6719011068344116} 11/07/2021 14:43:06 - INFO - __main__ - Step 124130: {'lr': 3.6759589081321766e-05, 'samples': 23832960, 'steps': 124129, 'loss/train': 1.7718350887298584} 11/07/2021 14:43:08 - INFO - __main__ - Step 124131: {'lr': 3.675681914893594e-05, 'samples': 23833152, 'steps': 124130, 'loss/train': 1.5461623668670654} 11/07/2021 14:43:08 - INFO - __main__ - Step 124132: {'lr': 3.675404931263421e-05, 'samples': 23833344, 'steps': 124131, 'loss/train': 5.7297210693359375} 11/07/2021 14:43:08 - INFO - __main__ - Step 124133: {'lr': 3.6751279572417836e-05, 'samples': 23833536, 'steps': 124132, 'loss/train': 5.703030586242676} 11/07/2021 14:43:09 - INFO - __main__ - Step 124134: {'lr': 3.674850992828802e-05, 'samples': 23833728, 'steps': 124133, 'loss/train': 0.9015304446220398} 11/07/2021 14:43:09 - INFO - __main__ - Step 124135: {'lr': 3.674574038024603e-05, 'samples': 23833920, 'steps': 124134, 'loss/train': 1.1300650835037231} 11/07/2021 14:43:09 - INFO - __main__ - Step 124136: {'lr': 3.6742970928293095e-05, 'samples': 23834112, 'steps': 124135, 'loss/train': 0.9842207431793213} 11/07/2021 14:43:10 - INFO - __main__ - Step 124137: {'lr': 3.674020157243052e-05, 'samples': 23834304, 'steps': 124136, 'loss/train': 1.293269395828247} 11/07/2021 14:43:11 - INFO - __main__ - Step 124138: {'lr': 3.67374323126595e-05, 'samples': 23834496, 'steps': 124137, 'loss/train': 1.3369110822677612} 11/07/2021 14:43:11 - INFO - __main__ - Step 124139: {'lr': 3.67346631489813e-05, 'samples': 23834688, 'steps': 124138, 'loss/train': 1.272962212562561} 11/07/2021 14:43:11 - INFO - __main__ - Step 124140: {'lr': 3.673189408139718e-05, 'samples': 23834880, 'steps': 124139, 'loss/train': 1.4410055875778198} 11/07/2021 14:43:12 - INFO - __main__ - Step 124141: {'lr': 3.672912510990839e-05, 'samples': 23835072, 'steps': 124140, 'loss/train': 1.1403775215148926} 11/07/2021 14:43:13 - INFO - __main__ - Step 124142: {'lr': 3.672635623451614e-05, 'samples': 23835264, 'steps': 124141, 'loss/train': 1.0228307247161865} 11/07/2021 14:43:13 - INFO - __main__ - Step 124143: {'lr': 3.6723587455221696e-05, 'samples': 23835456, 'steps': 124142, 'loss/train': 1.327019453048706} 11/07/2021 14:43:14 - INFO - __main__ - Step 124144: {'lr': 3.67208187720264e-05, 'samples': 23835648, 'steps': 124143, 'loss/train': 1.2897132635116577} 11/07/2021 14:43:14 - INFO - __main__ - Step 124145: {'lr': 3.671805018493135e-05, 'samples': 23835840, 'steps': 124144, 'loss/train': 1.5556503534317017} 11/07/2021 14:43:14 - INFO - __main__ - Step 124146: {'lr': 3.671528169393784e-05, 'samples': 23836032, 'steps': 124145, 'loss/train': 1.2438197135925293} 11/07/2021 14:43:15 - INFO - __main__ - Step 124147: {'lr': 3.6712513299047123e-05, 'samples': 23836224, 'steps': 124146, 'loss/train': 1.2610623836517334} 11/07/2021 14:43:16 - INFO - __main__ - Step 124148: {'lr': 3.6709745000260474e-05, 'samples': 23836416, 'steps': 124147, 'loss/train': 1.1575623750686646} 11/07/2021 14:43:16 - INFO - __main__ - Step 124149: {'lr': 3.670697679757912e-05, 'samples': 23836608, 'steps': 124148, 'loss/train': 1.1767276525497437} 11/07/2021 14:43:16 - INFO - __main__ - Step 124150: {'lr': 3.6704208691004324e-05, 'samples': 23836800, 'steps': 124149, 'loss/train': 1.0771631002426147} 11/07/2021 14:43:17 - INFO - __main__ - Step 124151: {'lr': 3.6701440680537295e-05, 'samples': 23836992, 'steps': 124150, 'loss/train': 0.949032187461853} 11/07/2021 14:43:18 - INFO - __main__ - Step 124152: {'lr': 3.669867276617933e-05, 'samples': 23837184, 'steps': 124151, 'loss/train': 0.7128089070320129} 11/07/2021 14:43:18 - INFO - __main__ - Step 124153: {'lr': 3.669590494793163e-05, 'samples': 23837376, 'steps': 124152, 'loss/train': 0.44508057832717896} 11/07/2021 14:43:19 - INFO - __main__ - Step 124154: {'lr': 3.669313722579548e-05, 'samples': 23837568, 'steps': 124153, 'loss/train': 1.2054649591445923} 11/07/2021 14:43:19 - INFO - __main__ - Step 124155: {'lr': 3.669036959977215e-05, 'samples': 23837760, 'steps': 124154, 'loss/train': 0.8934260010719299} 11/07/2021 14:43:19 - INFO - __main__ - Step 124156: {'lr': 3.6687602069862827e-05, 'samples': 23837952, 'steps': 124155, 'loss/train': 1.5576859712600708} 11/07/2021 14:43:20 - INFO - __main__ - Step 124157: {'lr': 3.668483463606875e-05, 'samples': 23838144, 'steps': 124156, 'loss/train': 1.659271478652954} 11/07/2021 14:43:21 - INFO - __main__ - Step 124158: {'lr': 3.668206729839118e-05, 'samples': 23838336, 'steps': 124157, 'loss/train': 0.5611768960952759} 11/07/2021 14:43:21 - INFO - __main__ - Step 124159: {'lr': 3.66793000568314e-05, 'samples': 23838528, 'steps': 124158, 'loss/train': 1.2754989862442017} 11/07/2021 14:43:22 - INFO - __main__ - Step 124160: {'lr': 3.667653291139064e-05, 'samples': 23838720, 'steps': 124159, 'loss/train': 0.43914780020713806} 11/07/2021 14:43:22 - INFO - __main__ - Step 124161: {'lr': 3.667376586207013e-05, 'samples': 23838912, 'steps': 124160, 'loss/train': 1.2087857723236084} 11/07/2021 14:43:22 - INFO - __main__ - Step 124162: {'lr': 3.6670998908871126e-05, 'samples': 23839104, 'steps': 124161, 'loss/train': 1.1839402914047241} 11/07/2021 14:43:23 - INFO - __main__ - Step 124163: {'lr': 3.6668232051794896e-05, 'samples': 23839296, 'steps': 124162, 'loss/train': 1.4753402471542358} 11/07/2021 14:43:24 - INFO - __main__ - Step 124164: {'lr': 3.666546529084266e-05, 'samples': 23839488, 'steps': 124163, 'loss/train': 1.3614250421524048} 11/07/2021 14:43:24 - INFO - __main__ - Step 124165: {'lr': 3.6662698626015676e-05, 'samples': 23839680, 'steps': 124164, 'loss/train': 1.6659948825836182} 11/07/2021 14:43:24 - INFO - __main__ - Step 124166: {'lr': 3.665993205731519e-05, 'samples': 23839872, 'steps': 124165, 'loss/train': 1.3188061714172363} 11/07/2021 14:43:25 - INFO - __main__ - Step 124167: {'lr': 3.6657165584742496e-05, 'samples': 23840064, 'steps': 124166, 'loss/train': 0.6648744940757751} 11/07/2021 14:43:26 - INFO - __main__ - Step 124168: {'lr': 3.665439920829875e-05, 'samples': 23840256, 'steps': 124167, 'loss/train': 1.512919306755066} 11/07/2021 14:43:26 - INFO - __main__ - Step 124169: {'lr': 3.665163292798521e-05, 'samples': 23840448, 'steps': 124168, 'loss/train': 1.611725926399231} 11/07/2021 14:43:27 - INFO - __main__ - Step 124170: {'lr': 3.664886674380316e-05, 'samples': 23840640, 'steps': 124169, 'loss/train': 0.6756879091262817} 11/07/2021 14:43:27 - INFO - __main__ - Step 124171: {'lr': 3.6646100655753854e-05, 'samples': 23840832, 'steps': 124170, 'loss/train': 1.19868803024292} 11/07/2021 14:43:27 - INFO - __main__ - Step 124172: {'lr': 3.664333466383851e-05, 'samples': 23841024, 'steps': 124171, 'loss/train': 1.4052252769470215} 11/07/2021 14:43:28 - INFO - __main__ - Step 124173: {'lr': 3.664056876805841e-05, 'samples': 23841216, 'steps': 124172, 'loss/train': 1.2993255853652954} 11/07/2021 14:43:29 - INFO - __main__ - Step 124174: {'lr': 3.663780296841476e-05, 'samples': 23841408, 'steps': 124173, 'loss/train': 1.6667557954788208} 11/07/2021 14:43:29 - INFO - __main__ - Step 124175: {'lr': 3.663503726490883e-05, 'samples': 23841600, 'steps': 124174, 'loss/train': 0.7035251259803772} 11/07/2021 14:43:29 - INFO - __main__ - Step 124176: {'lr': 3.663227165754185e-05, 'samples': 23841792, 'steps': 124175, 'loss/train': 1.4127482175827026} 11/07/2021 14:43:30 - INFO - __main__ - Step 124177: {'lr': 3.662950614631508e-05, 'samples': 23841984, 'steps': 124176, 'loss/train': 1.0430148839950562} 11/07/2021 14:43:30 - INFO - __main__ - Step 124178: {'lr': 3.662674073122976e-05, 'samples': 23842176, 'steps': 124177, 'loss/train': 1.5318474769592285} 11/07/2021 14:43:31 - INFO - __main__ - Step 124179: {'lr': 3.6623975412287126e-05, 'samples': 23842368, 'steps': 124178, 'loss/train': 1.320980429649353} 11/07/2021 14:43:31 - INFO - __main__ - Step 124180: {'lr': 3.662121018948847e-05, 'samples': 23842560, 'steps': 124179, 'loss/train': 1.2655799388885498} 11/07/2021 14:43:32 - INFO - __main__ - Step 124181: {'lr': 3.661844506283504e-05, 'samples': 23842752, 'steps': 124180, 'loss/train': 0.9875871539115906} 11/07/2021 14:43:32 - INFO - __main__ - Step 124182: {'lr': 3.661568003232799e-05, 'samples': 23842944, 'steps': 124181, 'loss/train': 1.1209923028945923} 11/07/2021 14:43:33 - INFO - __main__ - Step 124183: {'lr': 3.66129150979686e-05, 'samples': 23843136, 'steps': 124182, 'loss/train': 1.1813150644302368} 11/07/2021 14:43:34 - INFO - __main__ - Step 124184: {'lr': 3.661015025975817e-05, 'samples': 23843328, 'steps': 124183, 'loss/train': 1.4206188917160034} 11/07/2021 14:43:34 - INFO - __main__ - Step 124185: {'lr': 3.660738551769791e-05, 'samples': 23843520, 'steps': 124184, 'loss/train': 1.4556694030761719} 11/07/2021 14:43:34 - INFO - __main__ - Step 124186: {'lr': 3.6604620871789064e-05, 'samples': 23843712, 'steps': 124185, 'loss/train': 0.9993470311164856} 11/07/2021 14:43:35 - INFO - __main__ - Step 124187: {'lr': 3.660185632203286e-05, 'samples': 23843904, 'steps': 124186, 'loss/train': 1.266108512878418} 11/07/2021 14:43:35 - INFO - __main__ - Step 124188: {'lr': 3.659909186843061e-05, 'samples': 23844096, 'steps': 124187, 'loss/train': 1.41244375705719} 11/07/2021 14:43:36 - INFO - __main__ - Step 124189: {'lr': 3.659632751098349e-05, 'samples': 23844288, 'steps': 124188, 'loss/train': 1.3196964263916016} 11/07/2021 14:43:37 - INFO - __main__ - Step 124190: {'lr': 3.6593563249692763e-05, 'samples': 23844480, 'steps': 124189, 'loss/train': 1.2557554244995117} 11/07/2021 14:43:37 - INFO - __main__ - Step 124191: {'lr': 3.65907990845597e-05, 'samples': 23844672, 'steps': 124190, 'loss/train': 1.4018183946609497} 11/07/2021 14:43:37 - INFO - __main__ - Step 124192: {'lr': 3.6588035015585527e-05, 'samples': 23844864, 'steps': 124191, 'loss/train': 1.3159806728363037} 11/07/2021 14:43:38 - INFO - __main__ - Step 124193: {'lr': 3.658527104277148e-05, 'samples': 23845056, 'steps': 124192, 'loss/train': 0.6640650629997253} 11/07/2021 14:43:39 - INFO - __main__ - Step 124194: {'lr': 3.658250716611891e-05, 'samples': 23845248, 'steps': 124193, 'loss/train': 0.0657397136092186} 11/07/2021 14:43:39 - INFO - __main__ - Step 124195: {'lr': 3.657974338562889e-05, 'samples': 23845440, 'steps': 124194, 'loss/train': 1.5797951221466064} 11/07/2021 14:43:40 - INFO - __main__ - Step 124196: {'lr': 3.657697970130272e-05, 'samples': 23845632, 'steps': 124195, 'loss/train': 1.3496265411376953} 11/07/2021 14:43:40 - INFO - __main__ - Step 124197: {'lr': 3.657421611314171e-05, 'samples': 23845824, 'steps': 124196, 'loss/train': 1.496024489402771} 11/07/2021 14:43:40 - INFO - __main__ - Step 124198: {'lr': 3.657145262114703e-05, 'samples': 23846016, 'steps': 124197, 'loss/train': 1.7465487718582153} 11/07/2021 14:43:41 - INFO - __main__ - Step 124199: {'lr': 3.656868922531997e-05, 'samples': 23846208, 'steps': 124198, 'loss/train': 1.7229646444320679} 11/07/2021 14:43:42 - INFO - __main__ - Step 124200: {'lr': 3.656592592566177e-05, 'samples': 23846400, 'steps': 124199, 'loss/train': 0.7806426882743835} 11/07/2021 14:43:43 - INFO - __main__ - Step 124201: {'lr': 3.6563162722173696e-05, 'samples': 23846592, 'steps': 124200, 'loss/train': 0.9980756640434265} 11/07/2021 14:43:43 - INFO - __main__ - Step 124202: {'lr': 3.6560399614856934e-05, 'samples': 23846784, 'steps': 124201, 'loss/train': 0.9566059708595276} 11/07/2021 14:43:43 - INFO - __main__ - Step 124203: {'lr': 3.655763660371278e-05, 'samples': 23846976, 'steps': 124202, 'loss/train': 1.007659673690796} 11/07/2021 14:43:44 - INFO - __main__ - Step 124204: {'lr': 3.655487368874244e-05, 'samples': 23847168, 'steps': 124203, 'loss/train': 1.1740152835845947} 11/07/2021 14:43:44 - INFO - __main__ - Step 124205: {'lr': 3.65521108699472e-05, 'samples': 23847360, 'steps': 124204, 'loss/train': 0.9332311749458313} 11/07/2021 14:43:45 - INFO - __main__ - Step 124206: {'lr': 3.654934814732827e-05, 'samples': 23847552, 'steps': 124205, 'loss/train': 1.2585035562515259} 11/07/2021 14:43:45 - INFO - __main__ - Step 124207: {'lr': 3.654658552088691e-05, 'samples': 23847744, 'steps': 124206, 'loss/train': 0.8685135841369629} 11/07/2021 14:43:46 - INFO - __main__ - Step 124208: {'lr': 3.654382299062445e-05, 'samples': 23847936, 'steps': 124207, 'loss/train': 1.1114822626113892} 11/07/2021 14:43:46 - INFO - __main__ - Step 124209: {'lr': 3.654106055654197e-05, 'samples': 23848128, 'steps': 124208, 'loss/train': 1.3296774625778198} 11/07/2021 14:43:47 - INFO - __main__ - Step 124210: {'lr': 3.6538298218640796e-05, 'samples': 23848320, 'steps': 124209, 'loss/train': 1.4367940425872803} 11/07/2021 14:43:48 - INFO - __main__ - Step 124211: {'lr': 3.653553597692216e-05, 'samples': 23848512, 'steps': 124210, 'loss/train': 1.2808001041412354} 11/07/2021 14:43:48 - INFO - __main__ - Step 124212: {'lr': 3.653277383138734e-05, 'samples': 23848704, 'steps': 124211, 'loss/train': 0.7940772175788879} 11/07/2021 14:43:48 - INFO - __main__ - Step 124213: {'lr': 3.653001178203755e-05, 'samples': 23848896, 'steps': 124212, 'loss/train': 1.1441490650177002} 11/07/2021 14:43:49 - INFO - __main__ - Step 124214: {'lr': 3.652724982887404e-05, 'samples': 23849088, 'steps': 124213, 'loss/train': 1.4099942445755005} 11/07/2021 14:43:49 - INFO - __main__ - Step 124215: {'lr': 3.652448797189803e-05, 'samples': 23849280, 'steps': 124214, 'loss/train': 1.215070366859436} 11/07/2021 14:43:50 - INFO - __main__ - Step 124216: {'lr': 3.652172621111083e-05, 'samples': 23849472, 'steps': 124215, 'loss/train': 1.147278070449829} 11/07/2021 14:43:50 - INFO - __main__ - Step 124217: {'lr': 3.651896454651363e-05, 'samples': 23849664, 'steps': 124216, 'loss/train': 1.0754828453063965} 11/07/2021 14:43:51 - INFO - __main__ - Step 124218: {'lr': 3.6516202978107704e-05, 'samples': 23849856, 'steps': 124217, 'loss/train': 0.9098512530326843} 11/07/2021 14:43:51 - INFO - __main__ - Step 124219: {'lr': 3.651344150589428e-05, 'samples': 23850048, 'steps': 124218, 'loss/train': 1.1124423742294312} 11/07/2021 14:43:51 - INFO - __main__ - Step 124220: {'lr': 3.6510680129874574e-05, 'samples': 23850240, 'steps': 124219, 'loss/train': 0.9504040479660034} 11/07/2021 14:43:52 - INFO - __main__ - Step 124221: {'lr': 3.650791885004995e-05, 'samples': 23850432, 'steps': 124220, 'loss/train': 1.2170007228851318} 11/07/2021 14:43:53 - INFO - __main__ - Step 124222: {'lr': 3.6505157666421514e-05, 'samples': 23850624, 'steps': 124221, 'loss/train': 1.2555195093154907} 11/07/2021 14:43:53 - INFO - __main__ - Step 124223: {'lr': 3.6502396578990544e-05, 'samples': 23850816, 'steps': 124222, 'loss/train': 1.4023151397705078} 11/07/2021 14:43:53 - INFO - __main__ - Step 124224: {'lr': 3.649963558775829e-05, 'samples': 23851008, 'steps': 124223, 'loss/train': 1.193747639656067} 11/07/2021 14:43:54 - INFO - __main__ - Step 124225: {'lr': 3.6496874692726e-05, 'samples': 23851200, 'steps': 124224, 'loss/train': 1.1574435234069824} 11/07/2021 14:43:54 - INFO - __main__ - Step 124226: {'lr': 3.649411389389495e-05, 'samples': 23851392, 'steps': 124225, 'loss/train': 0.8393873572349548} 11/07/2021 14:43:55 - INFO - __main__ - Step 124227: {'lr': 3.649135319126634e-05, 'samples': 23851584, 'steps': 124226, 'loss/train': 1.4177947044372559} 11/07/2021 14:43:56 - INFO - __main__ - Step 124228: {'lr': 3.648859258484144e-05, 'samples': 23851776, 'steps': 124227, 'loss/train': 1.5386461019515991} 11/07/2021 14:43:56 - INFO - __main__ - Step 124229: {'lr': 3.64858320746215e-05, 'samples': 23851968, 'steps': 124228, 'loss/train': 1.1584858894348145} 11/07/2021 14:43:56 - INFO - __main__ - Step 124230: {'lr': 3.6483071660607715e-05, 'samples': 23852160, 'steps': 124229, 'loss/train': 0.9136354923248291} 11/07/2021 14:43:57 - INFO - __main__ - Step 124231: {'lr': 3.648031134280139e-05, 'samples': 23852352, 'steps': 124230, 'loss/train': 0.9778932332992554} 11/07/2021 14:43:58 - INFO - __main__ - Step 124232: {'lr': 3.647755112120374e-05, 'samples': 23852544, 'steps': 124231, 'loss/train': 1.0872185230255127} 11/07/2021 14:43:58 - INFO - __main__ - Step 124233: {'lr': 3.647479099581599e-05, 'samples': 23852736, 'steps': 124232, 'loss/train': 1.1731430292129517} 11/07/2021 14:43:58 - INFO - __main__ - Step 124234: {'lr': 3.647203096663942e-05, 'samples': 23852928, 'steps': 124233, 'loss/train': 0.9146100282669067} 11/07/2021 14:43:59 - INFO - __main__ - Step 124235: {'lr': 3.646927103367531e-05, 'samples': 23853120, 'steps': 124234, 'loss/train': 1.407256007194519} 11/07/2021 14:43:59 - INFO - __main__ - Step 124236: {'lr': 3.646651119692482e-05, 'samples': 23853312, 'steps': 124235, 'loss/train': 1.2690268754959106} 11/07/2021 14:44:00 - INFO - __main__ - Step 124237: {'lr': 3.646375145638919e-05, 'samples': 23853504, 'steps': 124236, 'loss/train': 1.664892554283142} 11/07/2021 14:44:01 - INFO - __main__ - Step 124238: {'lr': 3.6460991812069715e-05, 'samples': 23853696, 'steps': 124237, 'loss/train': 1.3778026103973389} 11/07/2021 14:44:01 - INFO - __main__ - Step 124239: {'lr': 3.64582322639676e-05, 'samples': 23853888, 'steps': 124238, 'loss/train': 0.8770082592964172} 11/07/2021 14:44:01 - INFO - __main__ - Step 124240: {'lr': 3.645547281208414e-05, 'samples': 23854080, 'steps': 124239, 'loss/train': 1.233812928199768} 11/07/2021 14:44:02 - INFO - __main__ - Step 124241: {'lr': 3.645271345642054e-05, 'samples': 23854272, 'steps': 124240, 'loss/train': 1.563847303390503} 11/07/2021 14:44:03 - INFO - __main__ - Step 124242: {'lr': 3.644995419697805e-05, 'samples': 23854464, 'steps': 124241, 'loss/train': 0.04586835578083992} 11/07/2021 14:44:03 - INFO - __main__ - Step 124243: {'lr': 3.644719503375793e-05, 'samples': 23854656, 'steps': 124242, 'loss/train': 1.3521355390548706} 11/07/2021 14:44:03 - INFO - __main__ - Step 124244: {'lr': 3.644443596676139e-05, 'samples': 23854848, 'steps': 124243, 'loss/train': 1.4848108291625977} 11/07/2021 14:44:04 - INFO - __main__ - Step 124245: {'lr': 3.644167699598969e-05, 'samples': 23855040, 'steps': 124244, 'loss/train': 1.269834280014038} 11/07/2021 14:44:04 - INFO - __main__ - Step 124246: {'lr': 3.6438918121444064e-05, 'samples': 23855232, 'steps': 124245, 'loss/train': 1.2530003786087036} 11/07/2021 14:44:05 - INFO - __main__ - Step 124247: {'lr': 3.64361593431258e-05, 'samples': 23855424, 'steps': 124246, 'loss/train': 0.9757335782051086} 11/07/2021 14:44:05 - INFO - __main__ - Step 124248: {'lr': 3.643340066103615e-05, 'samples': 23855616, 'steps': 124247, 'loss/train': 0.9605802893638611} 11/07/2021 14:44:06 - INFO - __main__ - Step 124249: {'lr': 3.643064207517624e-05, 'samples': 23855808, 'steps': 124248, 'loss/train': 1.247441291809082} 11/07/2021 14:44:06 - INFO - __main__ - Step 124250: {'lr': 3.642788358554741e-05, 'samples': 23856000, 'steps': 124249, 'loss/train': 1.0021647214889526} 11/07/2021 14:44:07 - INFO - __main__ - Step 124251: {'lr': 3.6425125192150854e-05, 'samples': 23856192, 'steps': 124250, 'loss/train': 1.4236849546432495} 11/07/2021 14:44:08 - INFO - __main__ - Step 124252: {'lr': 3.642236689498787e-05, 'samples': 23856384, 'steps': 124251, 'loss/train': 0.7506468892097473} 11/07/2021 14:44:08 - INFO - __main__ - Step 124253: {'lr': 3.6419608694059666e-05, 'samples': 23856576, 'steps': 124252, 'loss/train': 0.7790511250495911} 11/07/2021 14:44:08 - INFO - __main__ - Step 124254: {'lr': 3.641685058936747e-05, 'samples': 23856768, 'steps': 124253, 'loss/train': 1.020677924156189} 11/07/2021 14:44:09 - INFO - __main__ - Step 124255: {'lr': 3.6414092580912575e-05, 'samples': 23856960, 'steps': 124254, 'loss/train': 1.2972124814987183} 11/07/2021 14:44:09 - INFO - __main__ - Step 124256: {'lr': 3.641133466869617e-05, 'samples': 23857152, 'steps': 124255, 'loss/train': 1.0691850185394287} 11/07/2021 14:44:10 - INFO - __main__ - Step 124257: {'lr': 3.640857685271953e-05, 'samples': 23857344, 'steps': 124256, 'loss/train': 1.8184664249420166} 11/07/2021 14:44:10 - INFO - __main__ - Step 124258: {'lr': 3.640581913298388e-05, 'samples': 23857536, 'steps': 124257, 'loss/train': 1.1870348453521729} 11/07/2021 14:44:11 - INFO - __main__ - Step 124259: {'lr': 3.640306150949049e-05, 'samples': 23857728, 'steps': 124258, 'loss/train': 0.6526609063148499} 11/07/2021 14:44:11 - INFO - __main__ - Step 124260: {'lr': 3.640030398224059e-05, 'samples': 23857920, 'steps': 124259, 'loss/train': 1.1855230331420898} 11/07/2021 14:44:11 - INFO - __main__ - Step 124261: {'lr': 3.6397546551235446e-05, 'samples': 23858112, 'steps': 124260, 'loss/train': 0.9716941714286804} 11/07/2021 14:44:12 - INFO - __main__ - Step 124262: {'lr': 3.639478921647624e-05, 'samples': 23858304, 'steps': 124261, 'loss/train': 1.1365689039230347} 11/07/2021 14:44:13 - INFO - __main__ - Step 124263: {'lr': 3.639203197796423e-05, 'samples': 23858496, 'steps': 124262, 'loss/train': 0.9719514846801758} 11/07/2021 14:44:13 - INFO - __main__ - Step 124264: {'lr': 3.638927483570067e-05, 'samples': 23858688, 'steps': 124263, 'loss/train': 1.279374361038208} 11/07/2021 14:44:14 - INFO - __main__ - Step 124265: {'lr': 3.6386517789686826e-05, 'samples': 23858880, 'steps': 124264, 'loss/train': 1.3551952838897705} 11/07/2021 14:44:14 - INFO - __main__ - Step 124266: {'lr': 3.6383760839923894e-05, 'samples': 23859072, 'steps': 124265, 'loss/train': 1.188401460647583} 11/07/2021 14:44:15 - INFO - __main__ - Step 124267: {'lr': 3.638100398641317e-05, 'samples': 23859264, 'steps': 124266, 'loss/train': 1.3365247249603271} 11/07/2021 14:44:15 - INFO - __main__ - Step 124268: {'lr': 3.6378247229155865e-05, 'samples': 23859456, 'steps': 124267, 'loss/train': 1.356886863708496} 11/07/2021 14:44:16 - INFO - __main__ - Step 124269: {'lr': 3.63754905681532e-05, 'samples': 23859648, 'steps': 124268, 'loss/train': 0.9807181358337402} 11/07/2021 14:44:16 - INFO - __main__ - Step 124270: {'lr': 3.637273400340646e-05, 'samples': 23859840, 'steps': 124269, 'loss/train': 1.4826539754867554} 11/07/2021 14:44:16 - INFO - __main__ - Step 124271: {'lr': 3.636997753491689e-05, 'samples': 23860032, 'steps': 124270, 'loss/train': 0.982094943523407} 11/07/2021 14:44:17 - INFO - __main__ - Step 124272: {'lr': 3.636722116268568e-05, 'samples': 23860224, 'steps': 124271, 'loss/train': 1.866416335105896} 11/07/2021 14:44:18 - INFO - __main__ - Step 124273: {'lr': 3.6364464886714105e-05, 'samples': 23860416, 'steps': 124272, 'loss/train': 1.3255454301834106} 11/07/2021 14:44:18 - INFO - __main__ - Step 124274: {'lr': 3.636170870700342e-05, 'samples': 23860608, 'steps': 124273, 'loss/train': 1.4014374017715454} 11/07/2021 14:44:18 - INFO - __main__ - Step 124275: {'lr': 3.63589526235549e-05, 'samples': 23860800, 'steps': 124274, 'loss/train': 1.1564854383468628} 11/07/2021 14:44:19 - INFO - __main__ - Step 124276: {'lr': 3.635619663636971e-05, 'samples': 23860992, 'steps': 124275, 'loss/train': 1.2154121398925781} 11/07/2021 14:44:20 - INFO - __main__ - Step 124277: {'lr': 3.635344074544908e-05, 'samples': 23861184, 'steps': 124276, 'loss/train': 1.8327453136444092} 11/07/2021 14:44:20 - INFO - __main__ - Step 124278: {'lr': 3.635068495079433e-05, 'samples': 23861376, 'steps': 124277, 'loss/train': 1.1677591800689697} 11/07/2021 14:44:20 - INFO - __main__ - Step 124279: {'lr': 3.6347929252406654e-05, 'samples': 23861568, 'steps': 124278, 'loss/train': 1.3586267232894897} 11/07/2021 14:44:21 - INFO - __main__ - Step 124280: {'lr': 3.634517365028728e-05, 'samples': 23861760, 'steps': 124279, 'loss/train': 0.5867312550544739} 11/07/2021 14:44:21 - INFO - __main__ - Step 124281: {'lr': 3.6342418144437504e-05, 'samples': 23861952, 'steps': 124280, 'loss/train': 1.1038488149642944} 11/07/2021 14:44:22 - INFO - __main__ - Step 124282: {'lr': 3.63396627348585e-05, 'samples': 23862144, 'steps': 124281, 'loss/train': 1.482396125793457} 11/07/2021 14:44:23 - INFO - __main__ - Step 124283: {'lr': 3.6336907421551574e-05, 'samples': 23862336, 'steps': 124282, 'loss/train': 1.4007575511932373} 11/07/2021 14:44:23 - INFO - __main__ - Step 124284: {'lr': 3.633415220451794e-05, 'samples': 23862528, 'steps': 124283, 'loss/train': 1.3023210763931274} 11/07/2021 14:44:23 - INFO - __main__ - Step 124285: {'lr': 3.633139708375885e-05, 'samples': 23862720, 'steps': 124284, 'loss/train': 1.1242057085037231} 11/07/2021 14:44:24 - INFO - __main__ - Step 124286: {'lr': 3.6328642059275526e-05, 'samples': 23862912, 'steps': 124285, 'loss/train': 0.9876798987388611} 11/07/2021 14:44:24 - INFO - __main__ - Step 124287: {'lr': 3.6325887131069216e-05, 'samples': 23863104, 'steps': 124286, 'loss/train': 1.6120535135269165} 11/07/2021 14:44:25 - INFO - __main__ - Step 124288: {'lr': 3.632313229914122e-05, 'samples': 23863296, 'steps': 124287, 'loss/train': 1.2907832860946655} 11/07/2021 14:44:26 - INFO - __main__ - Step 124289: {'lr': 3.6320377563492655e-05, 'samples': 23863488, 'steps': 124288, 'loss/train': 1.3013485670089722} 11/07/2021 14:44:26 - INFO - __main__ - Step 124290: {'lr': 3.631762292412486e-05, 'samples': 23863680, 'steps': 124289, 'loss/train': 1.6460258960723877} 11/07/2021 14:44:26 - INFO - __main__ - Step 124291: {'lr': 3.6314868381039004e-05, 'samples': 23863872, 'steps': 124290, 'loss/train': 1.2399678230285645} 11/07/2021 14:44:27 - INFO - __main__ - Step 124292: {'lr': 3.6312113934236415e-05, 'samples': 23864064, 'steps': 124291, 'loss/train': 1.1772041320800781} 11/07/2021 14:44:28 - INFO - __main__ - Step 124293: {'lr': 3.630935958371826e-05, 'samples': 23864256, 'steps': 124292, 'loss/train': 1.2632389068603516} 11/07/2021 14:44:28 - INFO - __main__ - Step 124294: {'lr': 3.630660532948582e-05, 'samples': 23864448, 'steps': 124293, 'loss/train': 1.1728193759918213} 11/07/2021 14:44:28 - INFO - __main__ - Step 124295: {'lr': 3.6303851171540336e-05, 'samples': 23864640, 'steps': 124294, 'loss/train': 1.2274572849273682} 11/07/2021 14:44:29 - INFO - __main__ - Step 124296: {'lr': 3.6301097109883025e-05, 'samples': 23864832, 'steps': 124295, 'loss/train': 1.1386518478393555} 11/07/2021 14:44:29 - INFO - __main__ - Step 124297: {'lr': 3.629834314451516e-05, 'samples': 23865024, 'steps': 124296, 'loss/train': 1.736440658569336} 11/07/2021 14:44:30 - INFO - __main__ - Step 124298: {'lr': 3.629558927543794e-05, 'samples': 23865216, 'steps': 124297, 'loss/train': 0.9995026588439941} 11/07/2021 14:44:30 - INFO - __main__ - Step 124299: {'lr': 3.6292835502652636e-05, 'samples': 23865408, 'steps': 124298, 'loss/train': 1.0332013368606567} 11/07/2021 14:44:31 - INFO - __main__ - Step 124300: {'lr': 3.6290081826160505e-05, 'samples': 23865600, 'steps': 124299, 'loss/train': 1.4486364126205444} 11/07/2021 14:44:31 - INFO - __main__ - Step 124301: {'lr': 3.628732824596273e-05, 'samples': 23865792, 'steps': 124300, 'loss/train': 1.3435487747192383} 11/07/2021 14:44:32 - INFO - __main__ - Step 124302: {'lr': 3.628457476206068e-05, 'samples': 23865984, 'steps': 124301, 'loss/train': 1.3255653381347656} 11/07/2021 14:44:32 - INFO - __main__ - Step 124303: {'lr': 3.628182137445543e-05, 'samples': 23866176, 'steps': 124302, 'loss/train': 1.7001971006393433} 11/07/2021 14:44:33 - INFO - __main__ - Step 124304: {'lr': 3.6279068083148293e-05, 'samples': 23866368, 'steps': 124303, 'loss/train': 1.30991792678833} 11/07/2021 14:44:33 - INFO - __main__ - Step 124305: {'lr': 3.627631488814051e-05, 'samples': 23866560, 'steps': 124304, 'loss/train': 1.0455366373062134} 11/07/2021 14:44:34 - INFO - __main__ - Step 124306: {'lr': 3.627356178943331e-05, 'samples': 23866752, 'steps': 124305, 'loss/train': 2.129269599914551} 11/07/2021 14:44:34 - INFO - __main__ - Step 124307: {'lr': 3.627080878702796e-05, 'samples': 23866944, 'steps': 124306, 'loss/train': 1.4265508651733398} 11/07/2021 14:44:34 - INFO - __main__ - Step 124308: {'lr': 3.6268055880925686e-05, 'samples': 23867136, 'steps': 124307, 'loss/train': 1.0762156248092651} 11/07/2021 14:44:36 - INFO - __main__ - Step 124309: {'lr': 3.6265303071127716e-05, 'samples': 23867328, 'steps': 124308, 'loss/train': 1.0148686170578003} 11/07/2021 14:44:36 - INFO - __main__ - Step 124310: {'lr': 3.626255035763532e-05, 'samples': 23867520, 'steps': 124309, 'loss/train': 1.452581524848938} 11/07/2021 14:44:36 - INFO - __main__ - Step 124311: {'lr': 3.625979774044969e-05, 'samples': 23867712, 'steps': 124310, 'loss/train': 1.3602207899093628} 11/07/2021 14:44:37 - INFO - __main__ - Step 124312: {'lr': 3.625704521957213e-05, 'samples': 23867904, 'steps': 124311, 'loss/train': 1.34112548828125} 11/07/2021 14:44:37 - INFO - __main__ - Step 124313: {'lr': 3.6254292795003834e-05, 'samples': 23868096, 'steps': 124312, 'loss/train': 1.0498650074005127} 11/07/2021 14:44:38 - INFO - __main__ - Step 124314: {'lr': 3.625154046674606e-05, 'samples': 23868288, 'steps': 124313, 'loss/train': 1.1993852853775024} 11/07/2021 14:44:38 - INFO - __main__ - Step 124315: {'lr': 3.624878823480007e-05, 'samples': 23868480, 'steps': 124314, 'loss/train': 1.3664500713348389} 11/07/2021 14:44:39 - INFO - __main__ - Step 124316: {'lr': 3.624603609916707e-05, 'samples': 23868672, 'steps': 124315, 'loss/train': 1.0502251386642456} 11/07/2021 14:44:39 - INFO - __main__ - Step 124317: {'lr': 3.624328405984828e-05, 'samples': 23868864, 'steps': 124316, 'loss/train': 1.3475406169891357} 11/07/2021 14:44:39 - INFO - __main__ - Step 124318: {'lr': 3.624053211684497e-05, 'samples': 23869056, 'steps': 124317, 'loss/train': 0.8339822888374329} 11/07/2021 14:44:40 - INFO - __main__ - Step 124319: {'lr': 3.6237780270158366e-05, 'samples': 23869248, 'steps': 124318, 'loss/train': 1.0914477109909058} 11/07/2021 14:44:41 - INFO - __main__ - Step 124320: {'lr': 3.623502851978974e-05, 'samples': 23869440, 'steps': 124319, 'loss/train': 0.5897454023361206} 11/07/2021 14:44:41 - INFO - __main__ - Step 124321: {'lr': 3.6232276865740324e-05, 'samples': 23869632, 'steps': 124320, 'loss/train': 1.4066071510314941} 11/07/2021 14:44:41 - INFO - __main__ - Step 124322: {'lr': 3.6229525308011325e-05, 'samples': 23869824, 'steps': 124321, 'loss/train': 1.2026472091674805} 11/07/2021 14:44:42 - INFO - __main__ - Step 124323: {'lr': 3.6226773846604e-05, 'samples': 23870016, 'steps': 124322, 'loss/train': 1.2017099857330322} 11/07/2021 14:44:43 - INFO - __main__ - Step 124324: {'lr': 3.62240224815196e-05, 'samples': 23870208, 'steps': 124323, 'loss/train': 1.2337700128555298} 11/07/2021 14:44:43 - INFO - __main__ - Step 124325: {'lr': 3.622127121275934e-05, 'samples': 23870400, 'steps': 124324, 'loss/train': 1.3722238540649414} 11/07/2021 14:44:44 - INFO - __main__ - Step 124326: {'lr': 3.62185200403245e-05, 'samples': 23870592, 'steps': 124325, 'loss/train': 1.3211411237716675} 11/07/2021 14:44:44 - INFO - __main__ - Step 124327: {'lr': 3.6215768964216275e-05, 'samples': 23870784, 'steps': 124326, 'loss/train': 0.9687637686729431} 11/07/2021 14:44:44 - INFO - __main__ - Step 124328: {'lr': 3.6213017984435935e-05, 'samples': 23870976, 'steps': 124327, 'loss/train': 1.1239372491836548} 11/07/2021 14:44:45 - INFO - __main__ - Step 124329: {'lr': 3.621026710098477e-05, 'samples': 23871168, 'steps': 124328, 'loss/train': 1.2521330118179321} 11/07/2021 14:44:46 - INFO - __main__ - Step 124330: {'lr': 3.6207516313863904e-05, 'samples': 23871360, 'steps': 124329, 'loss/train': 0.8707616925239563} 11/07/2021 14:44:46 - INFO - __main__ - Step 124331: {'lr': 3.620476562307462e-05, 'samples': 23871552, 'steps': 124330, 'loss/train': 1.0774049758911133} 11/07/2021 14:44:46 - INFO - __main__ - Step 124332: {'lr': 3.62020150286182e-05, 'samples': 23871744, 'steps': 124331, 'loss/train': 1.2471672296524048} 11/07/2021 14:44:47 - INFO - __main__ - Step 124333: {'lr': 3.6199264530495826e-05, 'samples': 23871936, 'steps': 124332, 'loss/train': 1.1556819677352905} 11/07/2021 14:44:47 - INFO - __main__ - Step 124334: {'lr': 3.619651412870875e-05, 'samples': 23872128, 'steps': 124333, 'loss/train': 1.0315245389938354} 11/07/2021 14:44:48 - INFO - __main__ - Step 124335: {'lr': 3.6193763823258255e-05, 'samples': 23872320, 'steps': 124334, 'loss/train': 1.1583585739135742} 11/07/2021 14:44:49 - INFO - __main__ - Step 124336: {'lr': 3.6191013614145536e-05, 'samples': 23872512, 'steps': 124335, 'loss/train': 1.4911714792251587} 11/07/2021 14:44:49 - INFO - __main__ - Step 124337: {'lr': 3.618826350137186e-05, 'samples': 23872704, 'steps': 124336, 'loss/train': 0.08753739297389984} 11/07/2021 14:44:49 - INFO - __main__ - Step 124338: {'lr': 3.6185513484938455e-05, 'samples': 23872896, 'steps': 124337, 'loss/train': 1.7015464305877686} 11/07/2021 14:44:50 - INFO - __main__ - Step 124339: {'lr': 3.618276356484654e-05, 'samples': 23873088, 'steps': 124338, 'loss/train': 1.4447780847549438} 11/07/2021 14:44:51 - INFO - __main__ - Step 124340: {'lr': 3.618001374109739e-05, 'samples': 23873280, 'steps': 124339, 'loss/train': 1.3463115692138672} 11/07/2021 14:44:51 - INFO - __main__ - Step 124341: {'lr': 3.6177264013692204e-05, 'samples': 23873472, 'steps': 124340, 'loss/train': 1.5541473627090454} 11/07/2021 14:44:52 - INFO - __main__ - Step 124342: {'lr': 3.617451438263231e-05, 'samples': 23873664, 'steps': 124341, 'loss/train': 1.054261565208435} 11/07/2021 14:44:52 - INFO - __main__ - Step 124343: {'lr': 3.617176484791884e-05, 'samples': 23873856, 'steps': 124342, 'loss/train': 0.8179841637611389} 11/07/2021 14:44:52 - INFO - __main__ - Step 124344: {'lr': 3.616901540955306e-05, 'samples': 23874048, 'steps': 124343, 'loss/train': 0.3329102694988251} 11/07/2021 14:44:53 - INFO - __main__ - Step 124345: {'lr': 3.61662660675362e-05, 'samples': 23874240, 'steps': 124344, 'loss/train': 0.9810031056404114} 11/07/2021 14:44:54 - INFO - __main__ - Step 124346: {'lr': 3.6163516821869554e-05, 'samples': 23874432, 'steps': 124345, 'loss/train': 1.4586918354034424} 11/07/2021 14:44:54 - INFO - __main__ - Step 124347: {'lr': 3.616076767255433e-05, 'samples': 23874624, 'steps': 124346, 'loss/train': 1.1733973026275635} 11/07/2021 14:44:54 - INFO - __main__ - Step 124348: {'lr': 3.615801861959175e-05, 'samples': 23874816, 'steps': 124347, 'loss/train': 1.3236428499221802} 11/07/2021 14:44:55 - INFO - __main__ - Step 124349: {'lr': 3.6155269662983046e-05, 'samples': 23875008, 'steps': 124348, 'loss/train': 1.9039913415908813} 11/07/2021 14:44:56 - INFO - __main__ - Step 124350: {'lr': 3.615252080272952e-05, 'samples': 23875200, 'steps': 124349, 'loss/train': 0.9355567097663879} 11/07/2021 14:44:56 - INFO - __main__ - Step 124351: {'lr': 3.614977203883235e-05, 'samples': 23875392, 'steps': 124350, 'loss/train': 1.2357865571975708} 11/07/2021 14:44:57 - INFO - __main__ - Step 124352: {'lr': 3.6147023371292773e-05, 'samples': 23875584, 'steps': 124351, 'loss/train': 1.1730290651321411} 11/07/2021 14:44:57 - INFO - __main__ - Step 124353: {'lr': 3.6144274800112066e-05, 'samples': 23875776, 'steps': 124352, 'loss/train': 0.7596510052680969} 11/07/2021 14:44:57 - INFO - __main__ - Step 124354: {'lr': 3.614152632529147e-05, 'samples': 23875968, 'steps': 124353, 'loss/train': 1.1850467920303345} 11/07/2021 14:44:58 - INFO - __main__ - Step 124355: {'lr': 3.613877794683218e-05, 'samples': 23876160, 'steps': 124354, 'loss/train': 0.8269699811935425} 11/07/2021 14:44:59 - INFO - __main__ - Step 124356: {'lr': 3.6136029664735506e-05, 'samples': 23876352, 'steps': 124355, 'loss/train': 1.0831328630447388} 11/07/2021 14:44:59 - INFO - __main__ - Step 124357: {'lr': 3.6133281479002576e-05, 'samples': 23876544, 'steps': 124356, 'loss/train': 1.5342648029327393} 11/07/2021 14:44:59 - INFO - __main__ - Step 124358: {'lr': 3.613053338963471e-05, 'samples': 23876736, 'steps': 124357, 'loss/train': 1.6546456813812256} 11/07/2021 14:45:00 - INFO - __main__ - Step 124359: {'lr': 3.6127785396633114e-05, 'samples': 23876928, 'steps': 124358, 'loss/train': 0.9160215854644775} 11/07/2021 14:45:01 - INFO - __main__ - Step 124360: {'lr': 3.612503749999904e-05, 'samples': 23877120, 'steps': 124359, 'loss/train': 1.3991776704788208} 11/07/2021 14:45:01 - INFO - __main__ - Step 124361: {'lr': 3.6122289699733716e-05, 'samples': 23877312, 'steps': 124360, 'loss/train': 1.2167558670043945} 11/07/2021 14:45:01 - INFO - __main__ - Step 124362: {'lr': 3.611954199583839e-05, 'samples': 23877504, 'steps': 124361, 'loss/train': 0.5932101011276245} 11/07/2021 14:45:02 - INFO - __main__ - Step 124363: {'lr': 3.61167943883143e-05, 'samples': 23877696, 'steps': 124362, 'loss/train': 1.3515557050704956} 11/07/2021 14:45:02 - INFO - __main__ - Step 124364: {'lr': 3.6114046877162715e-05, 'samples': 23877888, 'steps': 124363, 'loss/train': 1.120893955230713} 11/07/2021 14:45:03 - INFO - __main__ - Step 124365: {'lr': 3.611129946238481e-05, 'samples': 23878080, 'steps': 124364, 'loss/train': 1.710034728050232} 11/07/2021 14:45:04 - INFO - __main__ - Step 124366: {'lr': 3.610855214398184e-05, 'samples': 23878272, 'steps': 124365, 'loss/train': 1.0053799152374268} 11/07/2021 14:45:04 - INFO - __main__ - Step 124367: {'lr': 3.610580492195506e-05, 'samples': 23878464, 'steps': 124366, 'loss/train': 1.1538054943084717} 11/07/2021 14:45:04 - INFO - __main__ - Step 124368: {'lr': 3.6103057796305735e-05, 'samples': 23878656, 'steps': 124367, 'loss/train': 1.097287893295288} 11/07/2021 14:45:05 - INFO - __main__ - Step 124369: {'lr': 3.61003107670351e-05, 'samples': 23878848, 'steps': 124368, 'loss/train': 1.4316160678863525} 11/07/2021 14:45:06 - INFO - __main__ - Step 124370: {'lr': 3.609756383414431e-05, 'samples': 23879040, 'steps': 124369, 'loss/train': 1.4152379035949707} 11/07/2021 14:45:06 - INFO - __main__ - Step 124371: {'lr': 3.609481699763467e-05, 'samples': 23879232, 'steps': 124370, 'loss/train': 0.8976991176605225} 11/07/2021 14:45:06 - INFO - __main__ - Step 124372: {'lr': 3.609207025750738e-05, 'samples': 23879424, 'steps': 124371, 'loss/train': 1.1897375583648682} 11/07/2021 14:45:07 - INFO - __main__ - Step 124373: {'lr': 3.6089323613763716e-05, 'samples': 23879616, 'steps': 124372, 'loss/train': 1.2202335596084595} 11/07/2021 14:45:07 - INFO - __main__ - Step 124374: {'lr': 3.608657706640492e-05, 'samples': 23879808, 'steps': 124373, 'loss/train': 1.1467176675796509} 11/07/2021 14:45:08 - INFO - __main__ - Step 124375: {'lr': 3.608383061543219e-05, 'samples': 23880000, 'steps': 124374, 'loss/train': 1.4785791635513306} 11/07/2021 14:45:08 - INFO - __main__ - Step 124376: {'lr': 3.6081084260846803e-05, 'samples': 23880192, 'steps': 124375, 'loss/train': 1.3106839656829834} 11/07/2021 14:45:09 - INFO - __main__ - Step 124377: {'lr': 3.607833800264995e-05, 'samples': 23880384, 'steps': 124376, 'loss/train': 1.263501524925232} 11/07/2021 14:45:09 - INFO - __main__ - Step 124378: {'lr': 3.607559184084291e-05, 'samples': 23880576, 'steps': 124377, 'loss/train': 1.1802027225494385} 11/07/2021 14:45:10 - INFO - __main__ - Step 124379: {'lr': 3.607284577542691e-05, 'samples': 23880768, 'steps': 124378, 'loss/train': 1.2637696266174316} 11/07/2021 14:45:11 - INFO - __main__ - Step 124380: {'lr': 3.6070099806403246e-05, 'samples': 23880960, 'steps': 124379, 'loss/train': 1.1543328762054443} 11/07/2021 14:45:11 - INFO - __main__ - Step 124381: {'lr': 3.606735393377303e-05, 'samples': 23881152, 'steps': 124380, 'loss/train': 1.0889381170272827} 11/07/2021 14:45:11 - INFO - __main__ - Step 124382: {'lr': 3.6064608157537566e-05, 'samples': 23881344, 'steps': 124381, 'loss/train': 1.2434347867965698} 11/07/2021 14:45:12 - INFO - __main__ - Step 124383: {'lr': 3.6061862477698105e-05, 'samples': 23881536, 'steps': 124382, 'loss/train': 1.6925597190856934} 11/07/2021 14:45:12 - INFO - __main__ - Step 124384: {'lr': 3.605911689425584e-05, 'samples': 23881728, 'steps': 124383, 'loss/train': 1.4249448776245117} 11/07/2021 14:45:13 - INFO - __main__ - Step 124385: {'lr': 3.605637140721205e-05, 'samples': 23881920, 'steps': 124384, 'loss/train': 1.4444966316223145} 11/07/2021 14:45:13 - INFO - __main__ - Step 124386: {'lr': 3.6053626016567945e-05, 'samples': 23882112, 'steps': 124385, 'loss/train': 0.5201152563095093} 11/07/2021 14:45:14 - INFO - __main__ - Step 124387: {'lr': 3.605088072232479e-05, 'samples': 23882304, 'steps': 124386, 'loss/train': 1.8021003007888794} 11/07/2021 14:45:14 - INFO - __main__ - Step 124388: {'lr': 3.60481355244838e-05, 'samples': 23882496, 'steps': 124387, 'loss/train': 1.4698179960250854} 11/07/2021 14:45:14 - INFO - __main__ - Step 124389: {'lr': 3.604539042304622e-05, 'samples': 23882688, 'steps': 124388, 'loss/train': 1.3978465795516968} 11/07/2021 14:45:15 - INFO - __main__ - Step 124390: {'lr': 3.6042645418013276e-05, 'samples': 23882880, 'steps': 124389, 'loss/train': 1.5401028394699097} 11/07/2021 14:45:16 - INFO - __main__ - Step 124391: {'lr': 3.6039900509386296e-05, 'samples': 23883072, 'steps': 124390, 'loss/train': 1.6632808446884155} 11/07/2021 14:45:16 - INFO - __main__ - Step 124392: {'lr': 3.603715569716634e-05, 'samples': 23883264, 'steps': 124391, 'loss/train': 1.1801730394363403} 11/07/2021 14:45:16 - INFO - __main__ - Step 124393: {'lr': 3.6034410981354764e-05, 'samples': 23883456, 'steps': 124392, 'loss/train': 1.5369789600372314} 11/07/2021 14:45:17 - INFO - __main__ - Step 124394: {'lr': 3.6031666361952794e-05, 'samples': 23883648, 'steps': 124393, 'loss/train': 1.448761224746704} 11/07/2021 14:45:17 - INFO - __main__ - Step 124395: {'lr': 3.6028921838961646e-05, 'samples': 23883840, 'steps': 124394, 'loss/train': 1.2914103269577026} 11/07/2021 14:45:18 - INFO - __main__ - Step 124396: {'lr': 3.602617741238254e-05, 'samples': 23884032, 'steps': 124395, 'loss/train': 1.4242271184921265} 11/07/2021 14:45:19 - INFO - __main__ - Step 124397: {'lr': 3.602343308221675e-05, 'samples': 23884224, 'steps': 124396, 'loss/train': 1.2812457084655762} 11/07/2021 14:45:19 - INFO - __main__ - Step 124398: {'lr': 3.6020688848465517e-05, 'samples': 23884416, 'steps': 124397, 'loss/train': 1.2551192045211792} 11/07/2021 14:45:19 - INFO - __main__ - Step 124399: {'lr': 3.601794471113004e-05, 'samples': 23884608, 'steps': 124398, 'loss/train': 1.415263056755066} 11/07/2021 14:45:20 - INFO - __main__ - Step 124400: {'lr': 3.601520067021158e-05, 'samples': 23884800, 'steps': 124399, 'loss/train': 1.5780973434448242} 11/07/2021 14:45:21 - INFO - __main__ - Step 124401: {'lr': 3.6012456725711437e-05, 'samples': 23884992, 'steps': 124400, 'loss/train': 1.4972641468048096} 11/07/2021 14:45:21 - INFO - __main__ - Step 124402: {'lr': 3.600971287763069e-05, 'samples': 23885184, 'steps': 124401, 'loss/train': 1.009102702140808} 11/07/2021 14:45:21 - INFO - __main__ - Step 124403: {'lr': 3.600696912597068e-05, 'samples': 23885376, 'steps': 124402, 'loss/train': 1.2481952905654907} 11/07/2021 14:45:22 - INFO - __main__ - Step 124404: {'lr': 3.600422547073265e-05, 'samples': 23885568, 'steps': 124403, 'loss/train': 1.469608187675476} 11/07/2021 14:45:22 - INFO - __main__ - Step 124405: {'lr': 3.600148191191779e-05, 'samples': 23885760, 'steps': 124404, 'loss/train': 1.6915360689163208} 11/07/2021 14:45:23 - INFO - __main__ - Step 124406: {'lr': 3.599873844952736e-05, 'samples': 23885952, 'steps': 124405, 'loss/train': 1.2618106603622437} 11/07/2021 14:45:23 - INFO - __main__ - Step 124407: {'lr': 3.5995995083562604e-05, 'samples': 23886144, 'steps': 124406, 'loss/train': 1.298259973526001} 11/07/2021 14:45:24 - INFO - __main__ - Step 124408: {'lr': 3.599325181402474e-05, 'samples': 23886336, 'steps': 124407, 'loss/train': 1.577540636062622} 11/07/2021 14:45:24 - INFO - __main__ - Step 124409: {'lr': 3.599050864091505e-05, 'samples': 23886528, 'steps': 124408, 'loss/train': 1.1950535774230957} 11/07/2021 14:45:25 - INFO - __main__ - Step 124410: {'lr': 3.598776556423469e-05, 'samples': 23886720, 'steps': 124409, 'loss/train': 1.6308948993682861} 11/07/2021 14:45:26 - INFO - __main__ - Step 124411: {'lr': 3.598502258398495e-05, 'samples': 23886912, 'steps': 124410, 'loss/train': 1.6730382442474365} 11/07/2021 14:45:26 - INFO - __main__ - Step 124412: {'lr': 3.598227970016712e-05, 'samples': 23887104, 'steps': 124411, 'loss/train': 1.2997030019760132} 11/07/2021 14:45:27 - INFO - __main__ - Step 124413: {'lr': 3.597953691278233e-05, 'samples': 23887296, 'steps': 124412, 'loss/train': 1.3759361505508423} 11/07/2021 14:45:27 - INFO - __main__ - Step 124414: {'lr': 3.597679422183184e-05, 'samples': 23887488, 'steps': 124413, 'loss/train': 0.9964688420295715} 11/07/2021 14:45:27 - INFO - __main__ - Step 124415: {'lr': 3.597405162731693e-05, 'samples': 23887680, 'steps': 124414, 'loss/train': 1.978069543838501} 11/07/2021 14:45:28 - INFO - __main__ - Step 124416: {'lr': 3.5971309129238766e-05, 'samples': 23887872, 'steps': 124415, 'loss/train': 1.3291023969650269} 11/07/2021 14:45:29 - INFO - __main__ - Step 124417: {'lr': 3.596856672759866e-05, 'samples': 23888064, 'steps': 124416, 'loss/train': 1.3588060140609741} 11/07/2021 14:45:29 - INFO - __main__ - Step 124418: {'lr': 3.596582442239779e-05, 'samples': 23888256, 'steps': 124417, 'loss/train': 1.6561588048934937} 11/07/2021 14:45:29 - INFO - __main__ - Step 124419: {'lr': 3.596308221363745e-05, 'samples': 23888448, 'steps': 124418, 'loss/train': 1.058531403541565} 11/07/2021 14:45:30 - INFO - __main__ - Step 124420: {'lr': 3.596034010131882e-05, 'samples': 23888640, 'steps': 124419, 'loss/train': 0.801151692867279} 11/07/2021 14:45:30 - INFO - __main__ - Step 124421: {'lr': 3.595759808544316e-05, 'samples': 23888832, 'steps': 124420, 'loss/train': 1.8615578413009644} 11/07/2021 14:45:31 - INFO - __main__ - Step 124422: {'lr': 3.595485616601171e-05, 'samples': 23889024, 'steps': 124421, 'loss/train': 1.2299389839172363} 11/07/2021 14:45:32 - INFO - __main__ - Step 124423: {'lr': 3.5952114343025754e-05, 'samples': 23889216, 'steps': 124422, 'loss/train': 1.067927598953247} 11/07/2021 14:45:32 - INFO - __main__ - Step 124424: {'lr': 3.59493726164864e-05, 'samples': 23889408, 'steps': 124423, 'loss/train': 1.1212857961654663} 11/07/2021 14:45:32 - INFO - __main__ - Step 124425: {'lr': 3.5946630986394974e-05, 'samples': 23889600, 'steps': 124424, 'loss/train': 1.432543396949768} 11/07/2021 14:45:33 - INFO - __main__ - Step 124426: {'lr': 3.594388945275271e-05, 'samples': 23889792, 'steps': 124425, 'loss/train': 1.5041905641555786} 11/07/2021 14:45:34 - INFO - __main__ - Step 124427: {'lr': 3.594114801556078e-05, 'samples': 23889984, 'steps': 124426, 'loss/train': 1.130056619644165} 11/07/2021 14:45:34 - INFO - __main__ - Step 124428: {'lr': 3.593840667482048e-05, 'samples': 23890176, 'steps': 124427, 'loss/train': 1.3652291297912598} 11/07/2021 14:45:34 - INFO - __main__ - Step 124429: {'lr': 3.593566543053306e-05, 'samples': 23890368, 'steps': 124428, 'loss/train': 1.3120391368865967} 11/07/2021 14:45:35 - INFO - __main__ - Step 124430: {'lr': 3.59329242826997e-05, 'samples': 23890560, 'steps': 124429, 'loss/train': 0.9689830541610718} 11/07/2021 14:45:35 - INFO - __main__ - Step 124431: {'lr': 3.593018323132166e-05, 'samples': 23890752, 'steps': 124430, 'loss/train': 0.9894468784332275} 11/07/2021 14:45:36 - INFO - __main__ - Step 124432: {'lr': 3.5927442276400186e-05, 'samples': 23890944, 'steps': 124431, 'loss/train': 1.0958465337753296} 11/07/2021 14:45:36 - INFO - __main__ - Step 124433: {'lr': 3.592470141793649e-05, 'samples': 23891136, 'steps': 124432, 'loss/train': 0.08986552059650421} 11/07/2021 14:45:37 - INFO - __main__ - Step 124434: {'lr': 3.592196065593184e-05, 'samples': 23891328, 'steps': 124433, 'loss/train': 1.1939316987991333} 11/07/2021 14:45:37 - INFO - __main__ - Step 124435: {'lr': 3.5919219990387445e-05, 'samples': 23891520, 'steps': 124434, 'loss/train': 1.1047252416610718} 11/07/2021 14:45:38 - INFO - __main__ - Step 124436: {'lr': 3.59164794213046e-05, 'samples': 23891712, 'steps': 124435, 'loss/train': 1.6180572509765625} 11/07/2021 14:45:39 - INFO - __main__ - Step 124437: {'lr': 3.5913738948684435e-05, 'samples': 23891904, 'steps': 124436, 'loss/train': 1.2174369096755981} 11/07/2021 14:45:39 - INFO - __main__ - Step 124438: {'lr': 3.591099857252822e-05, 'samples': 23892096, 'steps': 124437, 'loss/train': 5.748526096343994} 11/07/2021 14:45:39 - INFO - __main__ - Step 124439: {'lr': 3.590825829283723e-05, 'samples': 23892288, 'steps': 124438, 'loss/train': 1.227051854133606} 11/07/2021 14:45:40 - INFO - __main__ - Step 124440: {'lr': 3.590551810961265e-05, 'samples': 23892480, 'steps': 124439, 'loss/train': 1.851698637008667} 11/07/2021 14:45:40 - INFO - __main__ - Step 124441: {'lr': 3.5902778022855745e-05, 'samples': 23892672, 'steps': 124440, 'loss/train': 1.5159714221954346} 11/07/2021 14:45:40 - INFO - __main__ - Step 124442: {'lr': 3.590003803256775e-05, 'samples': 23892864, 'steps': 124441, 'loss/train': 0.9155612587928772} 11/07/2021 14:45:42 - INFO - __main__ - Step 124443: {'lr': 3.589729813874989e-05, 'samples': 23893056, 'steps': 124442, 'loss/train': 1.4850668907165527} 11/07/2021 14:45:42 - INFO - __main__ - Step 124444: {'lr': 3.5894558341403425e-05, 'samples': 23893248, 'steps': 124443, 'loss/train': 1.6218068599700928} 11/07/2021 14:45:43 - INFO - __main__ - Step 124445: {'lr': 3.589181864052954e-05, 'samples': 23893440, 'steps': 124444, 'loss/train': 5.503448486328125} 11/07/2021 14:45:43 - INFO - __main__ - Step 124446: {'lr': 3.588907903612951e-05, 'samples': 23893632, 'steps': 124445, 'loss/train': 5.454255104064941} 11/07/2021 14:45:43 - INFO - __main__ - Step 124447: {'lr': 3.588633952820455e-05, 'samples': 23893824, 'steps': 124446, 'loss/train': 5.485781669616699} 11/07/2021 14:45:44 - INFO - __main__ - Step 124448: {'lr': 3.5883600116755926e-05, 'samples': 23894016, 'steps': 124447, 'loss/train': 1.79083251953125} 11/07/2021 14:45:45 - INFO - __main__ - Step 124449: {'lr': 3.588086080178482e-05, 'samples': 23894208, 'steps': 124448, 'loss/train': 1.4317357540130615} 11/07/2021 14:45:45 - INFO - __main__ - Step 124450: {'lr': 3.5878121583292565e-05, 'samples': 23894400, 'steps': 124449, 'loss/train': 1.0655646324157715} 11/07/2021 14:45:46 - INFO - __main__ - Step 124451: {'lr': 3.587538246128025e-05, 'samples': 23894592, 'steps': 124450, 'loss/train': 1.384931206703186} 11/07/2021 14:45:46 - INFO - __main__ - Step 124452: {'lr': 3.58726434357492e-05, 'samples': 23894784, 'steps': 124451, 'loss/train': 1.1057534217834473} 11/07/2021 14:45:46 - INFO - __main__ - Step 124453: {'lr': 3.5869904506700636e-05, 'samples': 23894976, 'steps': 124452, 'loss/train': 1.5753138065338135} 11/07/2021 14:45:47 - INFO - __main__ - Step 124454: {'lr': 3.586716567413578e-05, 'samples': 23895168, 'steps': 124453, 'loss/train': 1.3454744815826416} 11/07/2021 14:45:48 - INFO - __main__ - Step 124455: {'lr': 3.586442693805586e-05, 'samples': 23895360, 'steps': 124454, 'loss/train': 1.7996515035629272} 11/07/2021 14:45:48 - INFO - __main__ - Step 124456: {'lr': 3.5861688298462145e-05, 'samples': 23895552, 'steps': 124455, 'loss/train': 1.1205118894577026} 11/07/2021 14:45:48 - INFO - __main__ - Step 124457: {'lr': 3.585894975535584e-05, 'samples': 23895744, 'steps': 124456, 'loss/train': 1.4266982078552246} 11/07/2021 14:45:49 - INFO - __main__ - Step 124458: {'lr': 3.5856211308738204e-05, 'samples': 23895936, 'steps': 124457, 'loss/train': 1.6244137287139893} 11/07/2021 14:45:49 - INFO - __main__ - Step 124459: {'lr': 3.585347295861044e-05, 'samples': 23896128, 'steps': 124458, 'loss/train': 1.3084912300109863} 11/07/2021 14:45:50 - INFO - __main__ - Step 124460: {'lr': 3.58507347049738e-05, 'samples': 23896320, 'steps': 124459, 'loss/train': 1.376591682434082} 11/07/2021 14:45:50 - INFO - __main__ - Step 124461: {'lr': 3.58479965478295e-05, 'samples': 23896512, 'steps': 124460, 'loss/train': 1.2333581447601318} 11/07/2021 14:45:51 - INFO - __main__ - Step 124462: {'lr': 3.584525848717882e-05, 'samples': 23896704, 'steps': 124461, 'loss/train': 0.8753380179405212} 11/07/2021 14:45:51 - INFO - __main__ - Step 124463: {'lr': 3.5842520523023007e-05, 'samples': 23896896, 'steps': 124462, 'loss/train': 1.2064006328582764} 11/07/2021 14:45:51 - INFO - __main__ - Step 124464: {'lr': 3.583978265536317e-05, 'samples': 23897088, 'steps': 124463, 'loss/train': 1.1793888807296753} 11/07/2021 14:45:53 - INFO - __main__ - Step 124465: {'lr': 3.583704488420064e-05, 'samples': 23897280, 'steps': 124464, 'loss/train': 1.102830410003662} 11/07/2021 14:45:53 - INFO - __main__ - Step 124466: {'lr': 3.583430720953665e-05, 'samples': 23897472, 'steps': 124465, 'loss/train': 1.0367307662963867} 11/07/2021 14:45:53 - INFO - __main__ - Step 124467: {'lr': 3.583156963137238e-05, 'samples': 23897664, 'steps': 124466, 'loss/train': 1.2402793169021606} 11/07/2021 14:45:54 - INFO - __main__ - Step 124468: {'lr': 3.5828832149709115e-05, 'samples': 23897856, 'steps': 124467, 'loss/train': 0.9512218832969666} 11/07/2021 14:45:54 - INFO - __main__ - Step 124469: {'lr': 3.5826094764548096e-05, 'samples': 23898048, 'steps': 124468, 'loss/train': 1.3560206890106201} 11/07/2021 14:45:55 - INFO - __main__ - Step 124470: {'lr': 3.5823357475890495e-05, 'samples': 23898240, 'steps': 124469, 'loss/train': 1.3040003776550293} 11/07/2021 14:45:55 - INFO - __main__ - Step 124471: {'lr': 3.5820620283737615e-05, 'samples': 23898432, 'steps': 124470, 'loss/train': 1.6561349630355835} 11/07/2021 14:45:56 - INFO - __main__ - Step 124472: {'lr': 3.581788318809065e-05, 'samples': 23898624, 'steps': 124471, 'loss/train': 1.637032151222229} 11/07/2021 14:45:56 - INFO - __main__ - Step 124473: {'lr': 3.581514618895082e-05, 'samples': 23898816, 'steps': 124472, 'loss/train': 1.1142797470092773} 11/07/2021 14:45:56 - INFO - __main__ - Step 124474: {'lr': 3.5812409286319404e-05, 'samples': 23899008, 'steps': 124473, 'loss/train': 1.5420581102371216} 11/07/2021 14:45:58 - INFO - __main__ - Step 124475: {'lr': 3.580967248019762e-05, 'samples': 23899200, 'steps': 124474, 'loss/train': 1.1219919919967651} 11/07/2021 14:45:58 - INFO - __main__ - Step 124476: {'lr': 3.5806935770586665e-05, 'samples': 23899392, 'steps': 124475, 'loss/train': 0.5523250699043274} 11/07/2021 14:45:58 - INFO - __main__ - Step 124477: {'lr': 3.580419915748786e-05, 'samples': 23899584, 'steps': 124476, 'loss/train': 0.9966041445732117} 11/07/2021 14:45:59 - INFO - __main__ - Step 124478: {'lr': 3.580146264090234e-05, 'samples': 23899776, 'steps': 124477, 'loss/train': 1.3393046855926514} 11/07/2021 14:45:59 - INFO - __main__ - Step 124479: {'lr': 3.579872622083139e-05, 'samples': 23899968, 'steps': 124478, 'loss/train': 1.2352771759033203} 11/07/2021 14:45:59 - INFO - __main__ - Step 124480: {'lr': 3.57959898972762e-05, 'samples': 23900160, 'steps': 124479, 'loss/train': 1.2774689197540283} 11/07/2021 14:46:01 - INFO - __main__ - Step 124481: {'lr': 3.579325367023803e-05, 'samples': 23900352, 'steps': 124480, 'loss/train': 1.2972793579101562} 11/07/2021 14:46:01 - INFO - __main__ - Step 124482: {'lr': 3.579051753971813e-05, 'samples': 23900544, 'steps': 124481, 'loss/train': 1.1501466035842896} 11/07/2021 14:46:01 - INFO - __main__ - Step 124483: {'lr': 3.578778150571768e-05, 'samples': 23900736, 'steps': 124482, 'loss/train': 1.0808464288711548} 11/07/2021 14:46:02 - INFO - __main__ - Step 124484: {'lr': 3.5785045568238e-05, 'samples': 23900928, 'steps': 124483, 'loss/train': 1.3789182901382446} 11/07/2021 14:46:02 - INFO - __main__ - Step 124485: {'lr': 3.578230972728025e-05, 'samples': 23901120, 'steps': 124484, 'loss/train': 1.1193430423736572} 11/07/2021 14:46:03 - INFO - __main__ - Step 124486: {'lr': 3.577957398284567e-05, 'samples': 23901312, 'steps': 124485, 'loss/train': 0.9501432180404663} 11/07/2021 14:46:03 - INFO - __main__ - Step 124487: {'lr': 3.5776838334935526e-05, 'samples': 23901504, 'steps': 124486, 'loss/train': 1.7216075658798218} 11/07/2021 14:46:04 - INFO - __main__ - Step 124488: {'lr': 3.577410278355103e-05, 'samples': 23901696, 'steps': 124487, 'loss/train': 1.2587000131607056} 11/07/2021 14:46:04 - INFO - __main__ - Step 124489: {'lr': 3.577136732869343e-05, 'samples': 23901888, 'steps': 124488, 'loss/train': 1.2373716831207275} 11/07/2021 14:46:04 - INFO - __main__ - Step 124490: {'lr': 3.576863197036401e-05, 'samples': 23902080, 'steps': 124489, 'loss/train': 0.9121084809303284} 11/07/2021 14:46:05 - INFO - __main__ - Step 124491: {'lr': 3.576589670856384e-05, 'samples': 23902272, 'steps': 124490, 'loss/train': 0.8852536678314209} 11/07/2021 14:46:06 - INFO - __main__ - Step 124492: {'lr': 3.576316154329429e-05, 'samples': 23902464, 'steps': 124491, 'loss/train': 1.2031450271606445} 11/07/2021 14:46:06 - INFO - __main__ - Step 124493: {'lr': 3.5760426474556546e-05, 'samples': 23902656, 'steps': 124492, 'loss/train': 0.8189623951911926} 11/07/2021 14:46:06 - INFO - __main__ - Step 124494: {'lr': 3.5757691502351836e-05, 'samples': 23902848, 'steps': 124493, 'loss/train': 1.4731781482696533} 11/07/2021 14:46:07 - INFO - __main__ - Step 124495: {'lr': 3.5754956626681404e-05, 'samples': 23903040, 'steps': 124494, 'loss/train': 1.0943667888641357} 11/07/2021 14:46:08 - INFO - __main__ - Step 124496: {'lr': 3.575222184754648e-05, 'samples': 23903232, 'steps': 124495, 'loss/train': 1.1381454467773438} 11/07/2021 14:46:08 - INFO - __main__ - Step 124497: {'lr': 3.57494871649483e-05, 'samples': 23903424, 'steps': 124496, 'loss/train': 1.411547064781189} 11/07/2021 14:46:09 - INFO - __main__ - Step 124498: {'lr': 3.5746752578888124e-05, 'samples': 23903616, 'steps': 124497, 'loss/train': 0.9535952806472778} 11/07/2021 14:46:09 - INFO - __main__ - Step 124499: {'lr': 3.574401808936712e-05, 'samples': 23903808, 'steps': 124498, 'loss/train': 1.6107938289642334} 11/07/2021 14:46:09 - INFO - __main__ - Step 124500: {'lr': 3.5741283696386575e-05, 'samples': 23904000, 'steps': 124499, 'loss/train': 1.1042653322219849} 11/07/2021 14:46:10 - INFO - __main__ - Step 124501: {'lr': 3.57385493999477e-05, 'samples': 23904192, 'steps': 124500, 'loss/train': 1.680699110031128} 11/07/2021 14:46:11 - INFO - __main__ - Step 124502: {'lr': 3.5735815200051705e-05, 'samples': 23904384, 'steps': 124501, 'loss/train': 1.2728346586227417} 11/07/2021 14:46:11 - INFO - __main__ - Step 124503: {'lr': 3.5733081096699924e-05, 'samples': 23904576, 'steps': 124502, 'loss/train': 1.4486669301986694} 11/07/2021 14:46:11 - INFO - __main__ - Step 124504: {'lr': 3.573034708989345e-05, 'samples': 23904768, 'steps': 124503, 'loss/train': 1.4543957710266113} 11/07/2021 14:46:12 - INFO - __main__ - Step 124505: {'lr': 3.572761317963358e-05, 'samples': 23904960, 'steps': 124504, 'loss/train': 1.4227246046066284} 11/07/2021 14:46:13 - INFO - __main__ - Step 124506: {'lr': 3.572487936592153e-05, 'samples': 23905152, 'steps': 124505, 'loss/train': 1.2540292739868164} 11/07/2021 14:46:14 - INFO - __main__ - Step 124507: {'lr': 3.572214564875856e-05, 'samples': 23905344, 'steps': 124506, 'loss/train': 0.6190057396888733} 11/07/2021 14:46:14 - INFO - __main__ - Step 124508: {'lr': 3.571941202814588e-05, 'samples': 23905536, 'steps': 124507, 'loss/train': 0.969629168510437} 11/07/2021 14:46:14 - INFO - __main__ - Step 124509: {'lr': 3.571667850408472e-05, 'samples': 23905728, 'steps': 124508, 'loss/train': 1.4200688600540161} 11/07/2021 14:46:15 - INFO - __main__ - Step 124510: {'lr': 3.57139450765763e-05, 'samples': 23905920, 'steps': 124509, 'loss/train': 0.22023826837539673} 11/07/2021 14:46:15 - INFO - __main__ - Step 124511: {'lr': 3.571121174562189e-05, 'samples': 23906112, 'steps': 124510, 'loss/train': 0.2636975049972534} 11/07/2021 14:46:16 - INFO - __main__ - Step 124512: {'lr': 3.570847851122269e-05, 'samples': 23906304, 'steps': 124511, 'loss/train': 1.86190927028656} 11/07/2021 14:46:17 - INFO - __main__ - Step 124513: {'lr': 3.570574537337998e-05, 'samples': 23906496, 'steps': 124512, 'loss/train': 1.0418163537979126} 11/07/2021 14:46:17 - INFO - __main__ - Step 124514: {'lr': 3.570301233209491e-05, 'samples': 23906688, 'steps': 124513, 'loss/train': 1.499216914176941} 11/07/2021 14:46:17 - INFO - __main__ - Step 124515: {'lr': 3.570027938736878e-05, 'samples': 23906880, 'steps': 124514, 'loss/train': 0.8478302955627441} 11/07/2021 14:46:18 - INFO - __main__ - Step 124516: {'lr': 3.569754653920279e-05, 'samples': 23907072, 'steps': 124515, 'loss/train': 1.6289030313491821} 11/07/2021 14:46:19 - INFO - __main__ - Step 124517: {'lr': 3.569481378759823e-05, 'samples': 23907264, 'steps': 124516, 'loss/train': 0.5835424065589905} 11/07/2021 14:46:19 - INFO - __main__ - Step 124518: {'lr': 3.569208113255623e-05, 'samples': 23907456, 'steps': 124517, 'loss/train': 1.2853366136550903} 11/07/2021 14:46:19 - INFO - __main__ - Step 124519: {'lr': 3.568934857407807e-05, 'samples': 23907648, 'steps': 124518, 'loss/train': 1.206971526145935} 11/07/2021 14:46:20 - INFO - __main__ - Step 124520: {'lr': 3.568661611216498e-05, 'samples': 23907840, 'steps': 124519, 'loss/train': 1.0374716520309448} 11/07/2021 14:46:20 - INFO - __main__ - Step 124521: {'lr': 3.568388374681819e-05, 'samples': 23908032, 'steps': 124520, 'loss/train': 1.288256049156189} 11/07/2021 14:46:21 - INFO - __main__ - Step 124522: {'lr': 3.568115147803894e-05, 'samples': 23908224, 'steps': 124521, 'loss/train': 1.2782917022705078} 11/07/2021 14:46:21 - INFO - __main__ - Step 124523: {'lr': 3.567841930582846e-05, 'samples': 23908416, 'steps': 124522, 'loss/train': 1.3989094495773315} 11/07/2021 14:46:22 - INFO - __main__ - Step 124524: {'lr': 3.5675687230187965e-05, 'samples': 23908608, 'steps': 124523, 'loss/train': 1.2344894409179688} 11/07/2021 14:46:22 - INFO - __main__ - Step 124525: {'lr': 3.567295525111872e-05, 'samples': 23908800, 'steps': 124524, 'loss/train': 0.7005478143692017} 11/07/2021 14:46:23 - INFO - __main__ - Step 124526: {'lr': 3.5670223368621914e-05, 'samples': 23908992, 'steps': 124525, 'loss/train': 1.2873866558074951} 11/07/2021 14:46:24 - INFO - __main__ - Step 124527: {'lr': 3.5667491582698805e-05, 'samples': 23909184, 'steps': 124526, 'loss/train': 1.1992090940475464} 11/07/2021 14:46:24 - INFO - __main__ - Step 124528: {'lr': 3.5664759893350605e-05, 'samples': 23909376, 'steps': 124527, 'loss/train': 1.3114023208618164} 11/07/2021 14:46:24 - INFO - __main__ - Step 124529: {'lr': 3.5662028300578574e-05, 'samples': 23909568, 'steps': 124528, 'loss/train': 1.7043408155441284} 11/07/2021 14:46:25 - INFO - __main__ - Step 124530: {'lr': 3.5659296804383986e-05, 'samples': 23909760, 'steps': 124529, 'loss/train': 1.0256907939910889} 11/07/2021 14:46:25 - INFO - __main__ - Step 124531: {'lr': 3.5656565404767946e-05, 'samples': 23909952, 'steps': 124530, 'loss/train': 1.7379080057144165} 11/07/2021 14:46:26 - INFO - __main__ - Step 124532: {'lr': 3.565383410173176e-05, 'samples': 23910144, 'steps': 124531, 'loss/train': 1.3064420223236084} 11/07/2021 14:46:26 - INFO - __main__ - Step 124533: {'lr': 3.5651102895276624e-05, 'samples': 23910336, 'steps': 124532, 'loss/train': 1.2948472499847412} 11/07/2021 14:46:27 - INFO - __main__ - Step 124534: {'lr': 3.564837178540381e-05, 'samples': 23910528, 'steps': 124533, 'loss/train': 1.4846330881118774} 11/07/2021 14:46:27 - INFO - __main__ - Step 124535: {'lr': 3.5645640772114515e-05, 'samples': 23910720, 'steps': 124534, 'loss/train': 0.7116425633430481} 11/07/2021 14:46:27 - INFO - __main__ - Step 124536: {'lr': 3.5642909855410024e-05, 'samples': 23910912, 'steps': 124535, 'loss/train': 1.3058605194091797} 11/07/2021 14:46:28 - INFO - __main__ - Step 124537: {'lr': 3.564017903529151e-05, 'samples': 23911104, 'steps': 124536, 'loss/train': 1.2090189456939697} 11/07/2021 14:46:29 - INFO - __main__ - Step 124538: {'lr': 3.563744831176022e-05, 'samples': 23911296, 'steps': 124537, 'loss/train': 1.1366571187973022} 11/07/2021 14:46:29 - INFO - __main__ - Step 124539: {'lr': 3.563471768481738e-05, 'samples': 23911488, 'steps': 124538, 'loss/train': 1.2412242889404297} 11/07/2021 14:46:29 - INFO - __main__ - Step 124540: {'lr': 3.563198715446425e-05, 'samples': 23911680, 'steps': 124539, 'loss/train': 1.4798611402511597} 11/07/2021 14:46:30 - INFO - __main__ - Step 124541: {'lr': 3.562925672070202e-05, 'samples': 23911872, 'steps': 124540, 'loss/train': 0.9929686188697815} 11/07/2021 14:46:30 - INFO - __main__ - Step 124542: {'lr': 3.562652638353195e-05, 'samples': 23912064, 'steps': 124541, 'loss/train': 0.9654789566993713} 11/07/2021 14:46:31 - INFO - __main__ - Step 124543: {'lr': 3.5623796142955246e-05, 'samples': 23912256, 'steps': 124542, 'loss/train': 1.363284945487976} 11/07/2021 14:46:32 - INFO - __main__ - Step 124544: {'lr': 3.562106599897322e-05, 'samples': 23912448, 'steps': 124543, 'loss/train': 1.2241871356964111} 11/07/2021 14:46:32 - INFO - __main__ - Step 124545: {'lr': 3.561833595158698e-05, 'samples': 23912640, 'steps': 124544, 'loss/train': 1.3053152561187744} 11/07/2021 14:46:32 - INFO - __main__ - Step 124546: {'lr': 3.56156060007978e-05, 'samples': 23912832, 'steps': 124545, 'loss/train': 0.2251705378293991} 11/07/2021 14:46:33 - INFO - __main__ - Step 124547: {'lr': 3.561287614660691e-05, 'samples': 23913024, 'steps': 124546, 'loss/train': 1.5516531467437744} 11/07/2021 14:46:34 - INFO - __main__ - Step 124548: {'lr': 3.561014638901558e-05, 'samples': 23913216, 'steps': 124547, 'loss/train': 1.18132746219635} 11/07/2021 14:46:34 - INFO - __main__ - Step 124549: {'lr': 3.560741672802498e-05, 'samples': 23913408, 'steps': 124548, 'loss/train': 1.4976646900177002} 11/07/2021 14:46:34 - INFO - __main__ - Step 124550: {'lr': 3.560468716363638e-05, 'samples': 23913600, 'steps': 124549, 'loss/train': 1.1383929252624512} 11/07/2021 14:46:35 - INFO - __main__ - Step 124551: {'lr': 3.560195769585101e-05, 'samples': 23913792, 'steps': 124550, 'loss/train': 1.6249347925186157} 11/07/2021 14:46:35 - INFO - __main__ - Step 124552: {'lr': 3.559922832467008e-05, 'samples': 23913984, 'steps': 124551, 'loss/train': 0.9842202663421631} 11/07/2021 14:46:36 - INFO - __main__ - Step 124553: {'lr': 3.5596499050094824e-05, 'samples': 23914176, 'steps': 124552, 'loss/train': 1.214299201965332} 11/07/2021 14:46:36 - INFO - __main__ - Step 124554: {'lr': 3.559376987212648e-05, 'samples': 23914368, 'steps': 124553, 'loss/train': 1.3056474924087524} 11/07/2021 14:46:37 - INFO - __main__ - Step 124555: {'lr': 3.559104079076628e-05, 'samples': 23914560, 'steps': 124554, 'loss/train': 1.280997395515442} 11/07/2021 14:46:37 - INFO - __main__ - Step 124556: {'lr': 3.558831180601546e-05, 'samples': 23914752, 'steps': 124555, 'loss/train': 1.0549546480178833} 11/07/2021 14:46:37 - INFO - __main__ - Step 124557: {'lr': 3.5585582917875284e-05, 'samples': 23914944, 'steps': 124556, 'loss/train': 1.537700891494751} 11/07/2021 14:46:39 - INFO - __main__ - Step 124558: {'lr': 3.558285412634687e-05, 'samples': 23915136, 'steps': 124557, 'loss/train': 1.332596778869629} 11/07/2021 14:46:39 - INFO - __main__ - Step 124559: {'lr': 3.558012543143152e-05, 'samples': 23915328, 'steps': 124558, 'loss/train': 1.6375670433044434} 11/07/2021 14:46:39 - INFO - __main__ - Step 124560: {'lr': 3.5577396833130465e-05, 'samples': 23915520, 'steps': 124559, 'loss/train': 1.2774425745010376} 11/07/2021 14:46:40 - INFO - __main__ - Step 124561: {'lr': 3.55746683314449e-05, 'samples': 23915712, 'steps': 124560, 'loss/train': 1.495437502861023} 11/07/2021 14:46:40 - INFO - __main__ - Step 124562: {'lr': 3.557193992637611e-05, 'samples': 23915904, 'steps': 124561, 'loss/train': 0.5562726259231567} 11/07/2021 14:46:40 - INFO - __main__ - Step 124563: {'lr': 3.556921161792528e-05, 'samples': 23916096, 'steps': 124562, 'loss/train': 1.0175915956497192} 11/07/2021 14:46:41 - INFO - __main__ - Step 124564: {'lr': 3.556648340609367e-05, 'samples': 23916288, 'steps': 124563, 'loss/train': 0.8351730704307556} 11/07/2021 14:46:42 - INFO - __main__ - Step 124565: {'lr': 3.556375529088246e-05, 'samples': 23916480, 'steps': 124564, 'loss/train': 1.0053610801696777} 11/07/2021 14:46:42 - INFO - __main__ - Step 124566: {'lr': 3.556102727229293e-05, 'samples': 23916672, 'steps': 124565, 'loss/train': 1.6827330589294434} 11/07/2021 14:46:43 - INFO - __main__ - Step 124567: {'lr': 3.5558299350326314e-05, 'samples': 23916864, 'steps': 124566, 'loss/train': 1.472299337387085} 11/07/2021 14:46:43 - INFO - __main__ - Step 124568: {'lr': 3.5555571524983787e-05, 'samples': 23917056, 'steps': 124567, 'loss/train': 1.3832459449768066} 11/07/2021 14:46:44 - INFO - __main__ - Step 124569: {'lr': 3.555284379626664e-05, 'samples': 23917248, 'steps': 124568, 'loss/train': 0.960456132888794} 11/07/2021 14:46:44 - INFO - __main__ - Step 124570: {'lr': 3.555011616417605e-05, 'samples': 23917440, 'steps': 124569, 'loss/train': 1.513155460357666} 11/07/2021 14:46:45 - INFO - __main__ - Step 124571: {'lr': 3.554738862871335e-05, 'samples': 23917632, 'steps': 124570, 'loss/train': 1.0976550579071045} 11/07/2021 14:46:45 - INFO - __main__ - Step 124572: {'lr': 3.554466118987959e-05, 'samples': 23917824, 'steps': 124571, 'loss/train': 1.626816749572754} 11/07/2021 14:46:45 - INFO - __main__ - Step 124573: {'lr': 3.554193384767612e-05, 'samples': 23918016, 'steps': 124572, 'loss/train': 1.5181077718734741} 11/07/2021 14:46:46 - INFO - __main__ - Step 124574: {'lr': 3.5539206602104165e-05, 'samples': 23918208, 'steps': 124573, 'loss/train': 1.325595498085022} 11/07/2021 14:46:47 - INFO - __main__ - Step 124575: {'lr': 3.553647945316491e-05, 'samples': 23918400, 'steps': 124574, 'loss/train': 1.034805417060852} 11/07/2021 14:46:47 - INFO - __main__ - Step 124576: {'lr': 3.55337524008596e-05, 'samples': 23918592, 'steps': 124575, 'loss/train': 1.1630074977874756} 11/07/2021 14:46:47 - INFO - __main__ - Step 124577: {'lr': 3.553102544518949e-05, 'samples': 23918784, 'steps': 124576, 'loss/train': 1.640674352645874} 11/07/2021 14:46:48 - INFO - __main__ - Step 124578: {'lr': 3.5528298586155776e-05, 'samples': 23918976, 'steps': 124577, 'loss/train': 1.3643852472305298} 11/07/2021 14:46:49 - INFO - __main__ - Step 124579: {'lr': 3.552557182375973e-05, 'samples': 23919168, 'steps': 124578, 'loss/train': 1.4325319528579712} 11/07/2021 14:46:49 - INFO - __main__ - Step 124580: {'lr': 3.5522845158002525e-05, 'samples': 23919360, 'steps': 124579, 'loss/train': 1.4451792240142822} 11/07/2021 14:46:50 - INFO - __main__ - Step 124581: {'lr': 3.552011858888543e-05, 'samples': 23919552, 'steps': 124580, 'loss/train': 1.4060498476028442} 11/07/2021 14:46:50 - INFO - __main__ - Step 124582: {'lr': 3.551739211640964e-05, 'samples': 23919744, 'steps': 124581, 'loss/train': 1.3661402463912964} 11/07/2021 14:46:50 - INFO - __main__ - Step 124583: {'lr': 3.5514665740576435e-05, 'samples': 23919936, 'steps': 124582, 'loss/train': 1.2682327032089233} 11/07/2021 14:46:51 - INFO - __main__ - Step 124584: {'lr': 3.5511939461387033e-05, 'samples': 23920128, 'steps': 124583, 'loss/train': 0.9953180551528931} 11/07/2021 14:46:52 - INFO - __main__ - Step 124585: {'lr': 3.55092132788426e-05, 'samples': 23920320, 'steps': 124584, 'loss/train': 1.2601112127304077} 11/07/2021 14:46:52 - INFO - __main__ - Step 124586: {'lr': 3.5506487192944416e-05, 'samples': 23920512, 'steps': 124585, 'loss/train': 1.5656144618988037} 11/07/2021 14:46:52 - INFO - __main__ - Step 124587: {'lr': 3.5503761203693694e-05, 'samples': 23920704, 'steps': 124586, 'loss/train': 0.8689941167831421} 11/07/2021 14:46:53 - INFO - __main__ - Step 124588: {'lr': 3.550103531109167e-05, 'samples': 23920896, 'steps': 124587, 'loss/train': 1.2896993160247803} 11/07/2021 14:46:53 - INFO - __main__ - Step 124589: {'lr': 3.549830951513955e-05, 'samples': 23921088, 'steps': 124588, 'loss/train': 1.0297259092330933} 11/07/2021 14:46:54 - INFO - __main__ - Step 124590: {'lr': 3.549558381583859e-05, 'samples': 23921280, 'steps': 124589, 'loss/train': 3.6160476207733154} 11/07/2021 14:46:55 - INFO - __main__ - Step 124591: {'lr': 3.549285821319004e-05, 'samples': 23921472, 'steps': 124590, 'loss/train': 1.257359504699707} 11/07/2021 14:46:55 - INFO - __main__ - Step 124592: {'lr': 3.549013270719506e-05, 'samples': 23921664, 'steps': 124591, 'loss/train': 1.5188660621643066} 11/07/2021 14:46:55 - INFO - __main__ - Step 124593: {'lr': 3.5487407297854936e-05, 'samples': 23921856, 'steps': 124592, 'loss/train': 0.5002409219741821} 11/07/2021 14:46:56 - INFO - __main__ - Step 124594: {'lr': 3.548468198517085e-05, 'samples': 23922048, 'steps': 124593, 'loss/train': 1.4734958410263062} 11/07/2021 14:46:57 - INFO - __main__ - Step 124595: {'lr': 3.548195676914409e-05, 'samples': 23922240, 'steps': 124594, 'loss/train': 1.2976338863372803} 11/07/2021 14:46:57 - INFO - __main__ - Step 124596: {'lr': 3.547923164977584e-05, 'samples': 23922432, 'steps': 124595, 'loss/train': 1.6086208820343018} 11/07/2021 14:46:57 - INFO - __main__ - Step 124597: {'lr': 3.5476506627067335e-05, 'samples': 23922624, 'steps': 124596, 'loss/train': 0.6334239840507507} 11/07/2021 14:46:58 - INFO - __main__ - Step 124598: {'lr': 3.547378170101986e-05, 'samples': 23922816, 'steps': 124597, 'loss/train': 1.0182645320892334} 11/07/2021 14:46:58 - INFO - __main__ - Step 124599: {'lr': 3.5471056871634544e-05, 'samples': 23923008, 'steps': 124598, 'loss/train': 1.4364184141159058} 11/07/2021 14:46:59 - INFO - __main__ - Step 124600: {'lr': 3.5468332138912626e-05, 'samples': 23923200, 'steps': 124599, 'loss/train': 1.6977053880691528} 11/07/2021 14:47:00 - INFO - __main__ - Step 124601: {'lr': 3.546560750285541e-05, 'samples': 23923392, 'steps': 124600, 'loss/train': 1.1993645429611206} 11/07/2021 14:47:00 - INFO - __main__ - Step 124602: {'lr': 3.546288296346406e-05, 'samples': 23923584, 'steps': 124601, 'loss/train': 0.9354153275489807} 11/07/2021 14:47:00 - INFO - __main__ - Step 124603: {'lr': 3.5460158520739805e-05, 'samples': 23923776, 'steps': 124602, 'loss/train': 0.8135285973548889} 11/07/2021 14:47:01 - INFO - __main__ - Step 124604: {'lr': 3.545743417468392e-05, 'samples': 23923968, 'steps': 124603, 'loss/train': 1.0716667175292969} 11/07/2021 14:47:03 - INFO - __main__ - Step 124605: {'lr': 3.545470992529759e-05, 'samples': 23924160, 'steps': 124604, 'loss/train': 1.1632177829742432} 11/07/2021 14:47:03 - INFO - __main__ - Step 124606: {'lr': 3.5451985772582076e-05, 'samples': 23924352, 'steps': 124605, 'loss/train': 1.0105624198913574} 11/07/2021 14:47:04 - INFO - __main__ - Step 124607: {'lr': 3.544926171653856e-05, 'samples': 23924544, 'steps': 124606, 'loss/train': 1.4032567739486694} 11/07/2021 14:47:04 - INFO - __main__ - Step 124608: {'lr': 3.544653775716833e-05, 'samples': 23924736, 'steps': 124607, 'loss/train': 1.368841528892517} 11/07/2021 14:47:04 - INFO - __main__ - Step 124609: {'lr': 3.544381389447254e-05, 'samples': 23924928, 'steps': 124608, 'loss/train': 1.4061905145645142} 11/07/2021 14:47:05 - INFO - __main__ - Step 124610: {'lr': 3.544109012845248e-05, 'samples': 23925120, 'steps': 124609, 'loss/train': 1.4260718822479248} 11/07/2021 14:47:05 - INFO - __main__ - Step 124611: {'lr': 3.543836645910942e-05, 'samples': 23925312, 'steps': 124610, 'loss/train': 1.700681209564209} 11/07/2021 14:47:05 - INFO - __main__ - Step 124612: {'lr': 3.543564288644443e-05, 'samples': 23925504, 'steps': 124611, 'loss/train': 1.7284247875213623} 11/07/2021 14:47:06 - INFO - __main__ - Step 124613: {'lr': 3.543291941045887e-05, 'samples': 23925696, 'steps': 124612, 'loss/train': 1.7259323596954346} 11/07/2021 14:47:07 - INFO - __main__ - Step 124614: {'lr': 3.543019603115391e-05, 'samples': 23925888, 'steps': 124613, 'loss/train': 1.7253516912460327} 11/07/2021 14:47:07 - INFO - __main__ - Step 124615: {'lr': 3.5427472748530784e-05, 'samples': 23926080, 'steps': 124614, 'loss/train': 1.7502021789550781} 11/07/2021 14:47:08 - INFO - __main__ - Step 124616: {'lr': 3.542474956259073e-05, 'samples': 23926272, 'steps': 124615, 'loss/train': 0.05634592846035957} 11/07/2021 14:47:08 - INFO - __main__ - Step 124617: {'lr': 3.542202647333498e-05, 'samples': 23926464, 'steps': 124616, 'loss/train': 1.5655555725097656} 11/07/2021 14:47:08 - INFO - __main__ - Step 124618: {'lr': 3.541930348076475e-05, 'samples': 23926656, 'steps': 124617, 'loss/train': 1.2787933349609375} 11/07/2021 14:47:09 - INFO - __main__ - Step 124619: {'lr': 3.541658058488126e-05, 'samples': 23926848, 'steps': 124618, 'loss/train': 1.1552788019180298} 11/07/2021 14:47:10 - INFO - __main__ - Step 124620: {'lr': 3.541385778568576e-05, 'samples': 23927040, 'steps': 124619, 'loss/train': 1.1185388565063477} 11/07/2021 14:47:10 - INFO - __main__ - Step 124621: {'lr': 3.541113508317945e-05, 'samples': 23927232, 'steps': 124620, 'loss/train': 1.2557549476623535} 11/07/2021 14:47:10 - INFO - __main__ - Step 124622: {'lr': 3.5408412477363596e-05, 'samples': 23927424, 'steps': 124621, 'loss/train': 1.403551697731018} 11/07/2021 14:47:11 - INFO - __main__ - Step 124623: {'lr': 3.5405689968239366e-05, 'samples': 23927616, 'steps': 124622, 'loss/train': 1.2849103212356567} 11/07/2021 14:47:12 - INFO - __main__ - Step 124624: {'lr': 3.540296755580805e-05, 'samples': 23927808, 'steps': 124623, 'loss/train': 1.1211265325546265} 11/07/2021 14:47:12 - INFO - __main__ - Step 124625: {'lr': 3.5400245240070906e-05, 'samples': 23928000, 'steps': 124624, 'loss/train': 1.287584662437439} 11/07/2021 14:47:12 - INFO - __main__ - Step 124626: {'lr': 3.539752302102903e-05, 'samples': 23928192, 'steps': 124625, 'loss/train': 1.4726066589355469} 11/07/2021 14:47:13 - INFO - __main__ - Step 124627: {'lr': 3.539480089868372e-05, 'samples': 23928384, 'steps': 124626, 'loss/train': 1.6591224670410156} 11/07/2021 14:47:13 - INFO - __main__ - Step 124628: {'lr': 3.539207887303619e-05, 'samples': 23928576, 'steps': 124627, 'loss/train': 1.5126192569732666} 11/07/2021 14:47:14 - INFO - __main__ - Step 124629: {'lr': 3.538935694408768e-05, 'samples': 23928768, 'steps': 124628, 'loss/train': 1.475695252418518} 11/07/2021 14:47:15 - INFO - __main__ - Step 124630: {'lr': 3.5386635111839425e-05, 'samples': 23928960, 'steps': 124629, 'loss/train': 1.6013071537017822} 11/07/2021 14:47:15 - INFO - __main__ - Step 124631: {'lr': 3.5383913376292626e-05, 'samples': 23929152, 'steps': 124630, 'loss/train': 1.0414401292800903} 11/07/2021 14:47:15 - INFO - __main__ - Step 124632: {'lr': 3.5381191737448556e-05, 'samples': 23929344, 'steps': 124631, 'loss/train': 1.1313793659210205} 11/07/2021 14:47:16 - INFO - __main__ - Step 124633: {'lr': 3.5378470195308374e-05, 'samples': 23929536, 'steps': 124632, 'loss/train': 1.4188803434371948} 11/07/2021 14:47:16 - INFO - __main__ - Step 124634: {'lr': 3.537574874987337e-05, 'samples': 23929728, 'steps': 124633, 'loss/train': 1.6265089511871338} 11/07/2021 14:47:17 - INFO - __main__ - Step 124635: {'lr': 3.537302740114473e-05, 'samples': 23929920, 'steps': 124634, 'loss/train': 1.3058667182922363} 11/07/2021 14:47:18 - INFO - __main__ - Step 124636: {'lr': 3.537030614912368e-05, 'samples': 23930112, 'steps': 124635, 'loss/train': 1.0614649057388306} 11/07/2021 14:47:18 - INFO - __main__ - Step 124637: {'lr': 3.536758499381146e-05, 'samples': 23930304, 'steps': 124636, 'loss/train': 1.4353333711624146} 11/07/2021 14:47:18 - INFO - __main__ - Step 124638: {'lr': 3.536486393520935e-05, 'samples': 23930496, 'steps': 124637, 'loss/train': 1.5226277112960815} 11/07/2021 14:47:19 - INFO - __main__ - Step 124639: {'lr': 3.53621429733185e-05, 'samples': 23930688, 'steps': 124638, 'loss/train': 1.4374619722366333} 11/07/2021 14:47:20 - INFO - __main__ - Step 124640: {'lr': 3.535942210814011e-05, 'samples': 23930880, 'steps': 124639, 'loss/train': 1.8858195543289185} 11/07/2021 14:47:20 - INFO - __main__ - Step 124641: {'lr': 3.5356701339675475e-05, 'samples': 23931072, 'steps': 124640, 'loss/train': 1.0183273553848267} 11/07/2021 14:47:20 - INFO - __main__ - Step 124642: {'lr': 3.5353980667925804e-05, 'samples': 23931264, 'steps': 124641, 'loss/train': 1.4173924922943115} 11/07/2021 14:47:21 - INFO - __main__ - Step 124643: {'lr': 3.53512600928923e-05, 'samples': 23931456, 'steps': 124642, 'loss/train': 1.432166337966919} 11/07/2021 14:47:21 - INFO - __main__ - Step 124644: {'lr': 3.534853961457621e-05, 'samples': 23931648, 'steps': 124643, 'loss/train': 0.6907973885536194} 11/07/2021 14:47:22 - INFO - __main__ - Step 124645: {'lr': 3.534581923297875e-05, 'samples': 23931840, 'steps': 124644, 'loss/train': 1.5388133525848389} 11/07/2021 14:47:23 - INFO - __main__ - Step 124646: {'lr': 3.5343098948101174e-05, 'samples': 23932032, 'steps': 124645, 'loss/train': 0.8098070025444031} 11/07/2021 14:47:23 - INFO - __main__ - Step 124647: {'lr': 3.534037875994467e-05, 'samples': 23932224, 'steps': 124646, 'loss/train': 1.2454403638839722} 11/07/2021 14:47:23 - INFO - __main__ - Step 124648: {'lr': 3.5337658668510546e-05, 'samples': 23932416, 'steps': 124647, 'loss/train': 0.901232898235321} 11/07/2021 14:47:24 - INFO - __main__ - Step 124649: {'lr': 3.5334938673799887e-05, 'samples': 23932608, 'steps': 124648, 'loss/train': 1.0203317403793335} 11/07/2021 14:47:25 - INFO - __main__ - Step 124650: {'lr': 3.533221877581399e-05, 'samples': 23932800, 'steps': 124649, 'loss/train': 1.7661908864974976} 11/07/2021 14:47:25 - INFO - __main__ - Step 124651: {'lr': 3.532949897455409e-05, 'samples': 23932992, 'steps': 124650, 'loss/train': 1.0287420749664307} 11/07/2021 14:47:25 - INFO - __main__ - Step 124652: {'lr': 3.532677927002142e-05, 'samples': 23933184, 'steps': 124651, 'loss/train': 1.242226004600525} 11/07/2021 14:47:26 - INFO - __main__ - Step 124653: {'lr': 3.532405966221719e-05, 'samples': 23933376, 'steps': 124652, 'loss/train': 1.2745530605316162} 11/07/2021 14:47:26 - INFO - __main__ - Step 124654: {'lr': 3.532134015114261e-05, 'samples': 23933568, 'steps': 124653, 'loss/train': 1.3626883029937744} 11/07/2021 14:47:27 - INFO - __main__ - Step 124655: {'lr': 3.5318620736798927e-05, 'samples': 23933760, 'steps': 124654, 'loss/train': 1.250970721244812} 11/07/2021 14:47:27 - INFO - __main__ - Step 124656: {'lr': 3.5315901419187364e-05, 'samples': 23933952, 'steps': 124655, 'loss/train': 1.1476107835769653} 11/07/2021 14:47:28 - INFO - __main__ - Step 124657: {'lr': 3.531318219830912e-05, 'samples': 23934144, 'steps': 124656, 'loss/train': 1.0845112800598145} 11/07/2021 14:47:28 - INFO - __main__ - Step 124658: {'lr': 3.5310463074165465e-05, 'samples': 23934336, 'steps': 124657, 'loss/train': 1.8520081043243408} 11/07/2021 14:47:28 - INFO - __main__ - Step 124659: {'lr': 3.5307744046757656e-05, 'samples': 23934528, 'steps': 124658, 'loss/train': 1.6139674186706543} 11/07/2021 14:47:29 - INFO - __main__ - Step 124660: {'lr': 3.5305025116086824e-05, 'samples': 23934720, 'steps': 124659, 'loss/train': 1.4035474061965942} 11/07/2021 14:47:30 - INFO - __main__ - Step 124661: {'lr': 3.5302306282154195e-05, 'samples': 23934912, 'steps': 124660, 'loss/train': 1.2659317255020142} 11/07/2021 14:47:30 - INFO - __main__ - Step 124662: {'lr': 3.529958754496107e-05, 'samples': 23935104, 'steps': 124661, 'loss/train': 1.5034587383270264} 11/07/2021 14:47:31 - INFO - __main__ - Step 124663: {'lr': 3.5296868904508615e-05, 'samples': 23935296, 'steps': 124662, 'loss/train': 1.2434215545654297} 11/07/2021 14:47:31 - INFO - __main__ - Step 124664: {'lr': 3.529415036079811e-05, 'samples': 23935488, 'steps': 124663, 'loss/train': 1.5083322525024414} 11/07/2021 14:47:31 - INFO - __main__ - Step 124665: {'lr': 3.529143191383072e-05, 'samples': 23935680, 'steps': 124664, 'loss/train': 1.1266798973083496} 11/07/2021 14:47:32 - INFO - __main__ - Step 124666: {'lr': 3.528871356360769e-05, 'samples': 23935872, 'steps': 124665, 'loss/train': 1.091535210609436} 11/07/2021 14:47:33 - INFO - __main__ - Step 124667: {'lr': 3.528599531013027e-05, 'samples': 23936064, 'steps': 124666, 'loss/train': 1.7866570949554443} 11/07/2021 14:47:33 - INFO - __main__ - Step 124668: {'lr': 3.5283277153399685e-05, 'samples': 23936256, 'steps': 124667, 'loss/train': 0.9117711186408997} 11/07/2021 14:47:33 - INFO - __main__ - Step 124669: {'lr': 3.52805590934171e-05, 'samples': 23936448, 'steps': 124668, 'loss/train': 1.3579057455062866} 11/07/2021 14:47:34 - INFO - __main__ - Step 124670: {'lr': 3.527784113018387e-05, 'samples': 23936640, 'steps': 124669, 'loss/train': 0.8246399760246277} 11/07/2021 14:47:35 - INFO - __main__ - Step 124671: {'lr': 3.5275123263701055e-05, 'samples': 23936832, 'steps': 124670, 'loss/train': 1.4250774383544922} 11/07/2021 14:47:35 - INFO - __main__ - Step 124672: {'lr': 3.527240549396998e-05, 'samples': 23937024, 'steps': 124671, 'loss/train': 1.3608198165893555} 11/07/2021 14:47:35 - INFO - __main__ - Step 124673: {'lr': 3.5269687820991824e-05, 'samples': 23937216, 'steps': 124672, 'loss/train': 1.1573710441589355} 11/07/2021 14:47:36 - INFO - __main__ - Step 124674: {'lr': 3.5266970244767827e-05, 'samples': 23937408, 'steps': 124673, 'loss/train': 0.27189165353775024} 11/07/2021 14:47:36 - INFO - __main__ - Step 124675: {'lr': 3.526425276529924e-05, 'samples': 23937600, 'steps': 124674, 'loss/train': 0.9674161076545715} 11/07/2021 14:47:37 - INFO - __main__ - Step 124676: {'lr': 3.526153538258725e-05, 'samples': 23937792, 'steps': 124675, 'loss/train': 1.1587934494018555} 11/07/2021 14:47:38 - INFO - __main__ - Step 124677: {'lr': 3.5258818096633115e-05, 'samples': 23937984, 'steps': 124676, 'loss/train': 1.1676474809646606} 11/07/2021 14:47:38 - INFO - __main__ - Step 124678: {'lr': 3.5256100907438054e-05, 'samples': 23938176, 'steps': 124677, 'loss/train': 1.6107985973358154} 11/07/2021 14:47:38 - INFO - __main__ - Step 124679: {'lr': 3.525338381500326e-05, 'samples': 23938368, 'steps': 124678, 'loss/train': 1.4315109252929688} 11/07/2021 14:47:39 - INFO - __main__ - Step 124680: {'lr': 3.525066681932998e-05, 'samples': 23938560, 'steps': 124679, 'loss/train': 1.1578786373138428} 11/07/2021 14:47:40 - INFO - __main__ - Step 124681: {'lr': 3.5247949920419495e-05, 'samples': 23938752, 'steps': 124680, 'loss/train': 1.2676044702529907} 11/07/2021 14:47:40 - INFO - __main__ - Step 124682: {'lr': 3.524523311827291e-05, 'samples': 23938944, 'steps': 124681, 'loss/train': 1.2890892028808594} 11/07/2021 14:47:40 - INFO - __main__ - Step 124683: {'lr': 3.52425164128915e-05, 'samples': 23939136, 'steps': 124682, 'loss/train': 1.3487845659255981} 11/07/2021 14:47:41 - INFO - __main__ - Step 124684: {'lr': 3.52397998042765e-05, 'samples': 23939328, 'steps': 124683, 'loss/train': 1.4160518646240234} 11/07/2021 14:47:41 - INFO - __main__ - Step 124685: {'lr': 3.523708329242914e-05, 'samples': 23939520, 'steps': 124684, 'loss/train': 1.2345184087753296} 11/07/2021 14:47:42 - INFO - __main__ - Step 124686: {'lr': 3.523436687735065e-05, 'samples': 23939712, 'steps': 124685, 'loss/train': 1.5471214056015015} 11/07/2021 14:47:42 - INFO - __main__ - Step 124687: {'lr': 3.5231650559042206e-05, 'samples': 23939904, 'steps': 124686, 'loss/train': 1.5110564231872559} 11/07/2021 14:47:43 - INFO - __main__ - Step 124688: {'lr': 3.52289343375051e-05, 'samples': 23940096, 'steps': 124687, 'loss/train': 1.145615577697754} 11/07/2021 14:47:43 - INFO - __main__ - Step 124689: {'lr': 3.52262182127405e-05, 'samples': 23940288, 'steps': 124688, 'loss/train': 0.9105579853057861} 11/07/2021 14:47:43 - INFO - __main__ - Step 124690: {'lr': 3.522350218474968e-05, 'samples': 23940480, 'steps': 124689, 'loss/train': 1.3688559532165527} 11/07/2021 14:47:44 - INFO - __main__ - Step 124691: {'lr': 3.522078625353381e-05, 'samples': 23940672, 'steps': 124690, 'loss/train': 1.2698911428451538} 11/07/2021 14:47:45 - INFO - __main__ - Step 124692: {'lr': 3.5218070419094195e-05, 'samples': 23940864, 'steps': 124691, 'loss/train': 1.4102036952972412} 11/07/2021 14:47:45 - INFO - __main__ - Step 124693: {'lr': 3.521535468143197e-05, 'samples': 23941056, 'steps': 124692, 'loss/train': 1.6868373155593872} 11/07/2021 14:47:46 - INFO - __main__ - Step 124694: {'lr': 3.5212639040548364e-05, 'samples': 23941248, 'steps': 124693, 'loss/train': 1.3688119649887085} 11/07/2021 14:47:46 - INFO - __main__ - Step 124695: {'lr': 3.520992349644464e-05, 'samples': 23941440, 'steps': 124694, 'loss/train': 3.022324323654175} 11/07/2021 14:47:46 - INFO - __main__ - Step 124696: {'lr': 3.5207208049122e-05, 'samples': 23941632, 'steps': 124695, 'loss/train': 0.9186707139015198} 11/07/2021 14:47:47 - INFO - __main__ - Step 124697: {'lr': 3.52044926985817e-05, 'samples': 23941824, 'steps': 124696, 'loss/train': 0.8288110494613647} 11/07/2021 14:47:48 - INFO - __main__ - Step 124698: {'lr': 3.520177744482492e-05, 'samples': 23942016, 'steps': 124697, 'loss/train': 1.2498884201049805} 11/07/2021 14:47:48 - INFO - __main__ - Step 124699: {'lr': 3.5199062287852915e-05, 'samples': 23942208, 'steps': 124698, 'loss/train': 1.1135280132293701} 11/07/2021 14:47:48 - INFO - __main__ - Step 124700: {'lr': 3.51963472276669e-05, 'samples': 23942400, 'steps': 124699, 'loss/train': 1.0480375289916992} 11/07/2021 14:47:49 - INFO - __main__ - Step 124701: {'lr': 3.519363226426808e-05, 'samples': 23942592, 'steps': 124700, 'loss/train': 1.6808662414550781} 11/07/2021 14:47:50 - INFO - __main__ - Step 124702: {'lr': 3.519091739765773e-05, 'samples': 23942784, 'steps': 124701, 'loss/train': 1.5523136854171753} 11/07/2021 14:47:50 - INFO - __main__ - Step 124703: {'lr': 3.5188202627837004e-05, 'samples': 23942976, 'steps': 124702, 'loss/train': 1.0069148540496826} 11/07/2021 14:47:51 - INFO - __main__ - Step 124704: {'lr': 3.5185487954807165e-05, 'samples': 23943168, 'steps': 124703, 'loss/train': 0.8324753046035767} 11/07/2021 14:47:51 - INFO - __main__ - Step 124705: {'lr': 3.518277337856951e-05, 'samples': 23943360, 'steps': 124704, 'loss/train': 1.355819582939148} 11/07/2021 14:47:51 - INFO - __main__ - Step 124706: {'lr': 3.51800588991251e-05, 'samples': 23943552, 'steps': 124705, 'loss/train': 1.1511138677597046} 11/07/2021 14:47:52 - INFO - __main__ - Step 124707: {'lr': 3.517734451647525e-05, 'samples': 23943744, 'steps': 124706, 'loss/train': 1.468429684638977} 11/07/2021 14:47:53 - INFO - __main__ - Step 124708: {'lr': 3.5174630230621175e-05, 'samples': 23943936, 'steps': 124707, 'loss/train': 1.5863417387008667} 11/07/2021 14:47:53 - INFO - __main__ - Step 124709: {'lr': 3.517191604156408e-05, 'samples': 23944128, 'steps': 124708, 'loss/train': 6.041017055511475} 11/07/2021 14:47:53 - INFO - __main__ - Step 124710: {'lr': 3.516920194930523e-05, 'samples': 23944320, 'steps': 124709, 'loss/train': 0.8635374307632446} 11/07/2021 14:47:54 - INFO - __main__ - Step 124711: {'lr': 3.516648795384581e-05, 'samples': 23944512, 'steps': 124710, 'loss/train': 1.2123923301696777} 11/07/2021 14:47:54 - INFO - __main__ - Step 124712: {'lr': 3.516377405518706e-05, 'samples': 23944704, 'steps': 124711, 'loss/train': 1.3522015810012817} 11/07/2021 14:47:55 - INFO - __main__ - Step 124713: {'lr': 3.516106025333018e-05, 'samples': 23944896, 'steps': 124712, 'loss/train': 1.3793784379959106} 11/07/2021 14:47:55 - INFO - __main__ - Step 124714: {'lr': 3.5158346548276435e-05, 'samples': 23945088, 'steps': 124713, 'loss/train': 1.1078025102615356} 11/07/2021 14:47:56 - INFO - __main__ - Step 124715: {'lr': 3.515563294002702e-05, 'samples': 23945280, 'steps': 124714, 'loss/train': 1.316932201385498} 11/07/2021 14:47:56 - INFO - __main__ - Step 124716: {'lr': 3.515291942858317e-05, 'samples': 23945472, 'steps': 124715, 'loss/train': 0.1529579758644104} 11/07/2021 14:47:56 - INFO - __main__ - Step 124717: {'lr': 3.515020601394608e-05, 'samples': 23945664, 'steps': 124716, 'loss/train': 1.3346627950668335} 11/07/2021 14:47:58 - INFO - __main__ - Step 124718: {'lr': 3.5147492696117e-05, 'samples': 23945856, 'steps': 124717, 'loss/train': 1.3339282274246216} 11/07/2021 14:47:58 - INFO - __main__ - Step 124719: {'lr': 3.51447794750972e-05, 'samples': 23946048, 'steps': 124718, 'loss/train': 1.3283060789108276} 11/07/2021 14:47:58 - INFO - __main__ - Step 124720: {'lr': 3.514206635088779e-05, 'samples': 23946240, 'steps': 124719, 'loss/train': 1.0299049615859985} 11/07/2021 14:47:59 - INFO - __main__ - Step 124721: {'lr': 3.513935332349005e-05, 'samples': 23946432, 'steps': 124720, 'loss/train': 1.2514196634292603} 11/07/2021 14:47:59 - INFO - __main__ - Step 124722: {'lr': 3.5136640392905205e-05, 'samples': 23946624, 'steps': 124721, 'loss/train': 1.3758643865585327} 11/07/2021 14:48:00 - INFO - __main__ - Step 124723: {'lr': 3.513392755913447e-05, 'samples': 23946816, 'steps': 124722, 'loss/train': 1.247139811515808} 11/07/2021 14:48:00 - INFO - __main__ - Step 124724: {'lr': 3.513121482217907e-05, 'samples': 23947008, 'steps': 124723, 'loss/train': 1.4268386363983154} 11/07/2021 14:48:01 - INFO - __main__ - Step 124725: {'lr': 3.512850218204022e-05, 'samples': 23947200, 'steps': 124724, 'loss/train': 1.659156084060669} 11/07/2021 14:48:01 - INFO - __main__ - Step 124726: {'lr': 3.512578963871915e-05, 'samples': 23947392, 'steps': 124725, 'loss/train': 0.9444100856781006} 11/07/2021 14:48:02 - INFO - __main__ - Step 124727: {'lr': 3.5123077192217105e-05, 'samples': 23947584, 'steps': 124726, 'loss/train': 1.8230178356170654} 11/07/2021 14:48:02 - INFO - __main__ - Step 124728: {'lr': 3.512036484253528e-05, 'samples': 23947776, 'steps': 124727, 'loss/train': 1.2542158365249634} 11/07/2021 14:48:03 - INFO - __main__ - Step 124729: {'lr': 3.5117652589674895e-05, 'samples': 23947968, 'steps': 124728, 'loss/train': 1.2833279371261597} 11/07/2021 14:48:04 - INFO - __main__ - Step 124730: {'lr': 3.51149404336372e-05, 'samples': 23948160, 'steps': 124729, 'loss/train': 1.1134501695632935} 11/07/2021 14:48:04 - INFO - __main__ - Step 124731: {'lr': 3.5112228374423375e-05, 'samples': 23948352, 'steps': 124730, 'loss/train': 1.2479872703552246} 11/07/2021 14:48:04 - INFO - __main__ - Step 124732: {'lr': 3.510951641203472e-05, 'samples': 23948544, 'steps': 124731, 'loss/train': 1.3899261951446533} 11/07/2021 14:48:05 - INFO - __main__ - Step 124733: {'lr': 3.510680454647236e-05, 'samples': 23948736, 'steps': 124732, 'loss/train': 1.7197641134262085} 11/07/2021 14:48:05 - INFO - __main__ - Step 124734: {'lr': 3.510409277773755e-05, 'samples': 23948928, 'steps': 124733, 'loss/train': 1.4770262241363525} 11/07/2021 14:48:06 - INFO - __main__ - Step 124735: {'lr': 3.51013811058315e-05, 'samples': 23949120, 'steps': 124734, 'loss/train': 0.2823501527309418} 11/07/2021 14:48:06 - INFO - __main__ - Step 124736: {'lr': 3.5098669530755465e-05, 'samples': 23949312, 'steps': 124735, 'loss/train': 1.7841978073120117} 11/07/2021 14:48:07 - INFO - __main__ - Step 124737: {'lr': 3.509595805251067e-05, 'samples': 23949504, 'steps': 124736, 'loss/train': 1.2524110078811646} 11/07/2021 14:48:07 - INFO - __main__ - Step 124738: {'lr': 3.509324667109831e-05, 'samples': 23949696, 'steps': 124737, 'loss/train': 1.5139647722244263} 11/07/2021 14:48:07 - INFO - __main__ - Step 124739: {'lr': 3.50905353865196e-05, 'samples': 23949888, 'steps': 124738, 'loss/train': 0.6901364922523499} 11/07/2021 14:48:09 - INFO - __main__ - Step 124740: {'lr': 3.508782419877579e-05, 'samples': 23950080, 'steps': 124739, 'loss/train': 1.7477586269378662} 11/07/2021 14:48:09 - INFO - __main__ - Step 124741: {'lr': 3.5085113107868104e-05, 'samples': 23950272, 'steps': 124740, 'loss/train': 1.4818497896194458} 11/07/2021 14:48:10 - INFO - __main__ - Step 124742: {'lr': 3.508240211379773e-05, 'samples': 23950464, 'steps': 124741, 'loss/train': 1.1239498853683472} 11/07/2021 14:48:10 - INFO - __main__ - Step 124743: {'lr': 3.5079691216565926e-05, 'samples': 23950656, 'steps': 124742, 'loss/train': 0.0827287808060646} 11/07/2021 14:48:10 - INFO - __main__ - Step 124744: {'lr': 3.507698041617388e-05, 'samples': 23950848, 'steps': 124743, 'loss/train': 1.7505180835723877} 11/07/2021 14:48:11 - INFO - __main__ - Step 124745: {'lr': 3.5074269712622845e-05, 'samples': 23951040, 'steps': 124744, 'loss/train': 1.6116628646850586} 11/07/2021 14:48:11 - INFO - __main__ - Step 124746: {'lr': 3.507155910591408e-05, 'samples': 23951232, 'steps': 124745, 'loss/train': 1.720371127128601} 11/07/2021 14:48:12 - INFO - __main__ - Step 124747: {'lr': 3.50688485960487e-05, 'samples': 23951424, 'steps': 124746, 'loss/train': 1.1884535551071167} 11/07/2021 14:48:12 - INFO - __main__ - Step 124748: {'lr': 3.506613818302798e-05, 'samples': 23951616, 'steps': 124747, 'loss/train': 1.4426761865615845} 11/07/2021 14:48:13 - INFO - __main__ - Step 124749: {'lr': 3.5063427866853126e-05, 'samples': 23951808, 'steps': 124748, 'loss/train': 1.62679922580719} 11/07/2021 14:48:13 - INFO - __main__ - Step 124750: {'lr': 3.506071764752539e-05, 'samples': 23952000, 'steps': 124749, 'loss/train': 1.1837444305419922} 11/07/2021 14:48:13 - INFO - __main__ - Step 124751: {'lr': 3.5058007525045984e-05, 'samples': 23952192, 'steps': 124750, 'loss/train': 0.9424183964729309} 11/07/2021 14:48:15 - INFO - __main__ - Step 124752: {'lr': 3.50552974994161e-05, 'samples': 23952384, 'steps': 124751, 'loss/train': 1.5037407875061035} 11/07/2021 14:48:15 - INFO - __main__ - Step 124753: {'lr': 3.5052587570637004e-05, 'samples': 23952576, 'steps': 124752, 'loss/train': 1.4397246837615967} 11/07/2021 14:48:15 - INFO - __main__ - Step 124754: {'lr': 3.5049877738709876e-05, 'samples': 23952768, 'steps': 124753, 'loss/train': 1.266208529472351} 11/07/2021 14:48:16 - INFO - __main__ - Step 124755: {'lr': 3.504716800363597e-05, 'samples': 23952960, 'steps': 124754, 'loss/train': 1.0485984086990356} 11/07/2021 14:48:16 - INFO - __main__ - Step 124756: {'lr': 3.50444583654165e-05, 'samples': 23953152, 'steps': 124755, 'loss/train': 0.8124158382415771} 11/07/2021 14:48:17 - INFO - __main__ - Step 124757: {'lr': 3.504174882405267e-05, 'samples': 23953344, 'steps': 124756, 'loss/train': 1.2694807052612305} 11/07/2021 14:48:17 - INFO - __main__ - Step 124758: {'lr': 3.503903937954572e-05, 'samples': 23953536, 'steps': 124757, 'loss/train': 1.2580056190490723} 11/07/2021 14:48:18 - INFO - __main__ - Step 124759: {'lr': 3.503633003189691e-05, 'samples': 23953728, 'steps': 124758, 'loss/train': 1.2117382287979126} 11/07/2021 14:48:18 - INFO - __main__ - Step 124760: {'lr': 3.503362078110736e-05, 'samples': 23953920, 'steps': 124759, 'loss/train': 1.1752220392227173} 11/07/2021 14:48:18 - INFO - __main__ - Step 124761: {'lr': 3.5030911627178336e-05, 'samples': 23954112, 'steps': 124760, 'loss/train': 1.4076811075210571} 11/07/2021 14:48:20 - INFO - __main__ - Step 124762: {'lr': 3.502820257011105e-05, 'samples': 23954304, 'steps': 124761, 'loss/train': 1.1953095197677612} 11/07/2021 14:48:20 - INFO - __main__ - Step 124763: {'lr': 3.502549360990676e-05, 'samples': 23954496, 'steps': 124762, 'loss/train': 1.7227962017059326} 11/07/2021 14:48:20 - INFO - __main__ - Step 124764: {'lr': 3.502278474656667e-05, 'samples': 23954688, 'steps': 124763, 'loss/train': 1.364123821258545} 11/07/2021 14:48:21 - INFO - __main__ - Step 124765: {'lr': 3.502007598009199e-05, 'samples': 23954880, 'steps': 124764, 'loss/train': 1.078220009803772} 11/07/2021 14:48:21 - INFO - __main__ - Step 124766: {'lr': 3.5017367310483936e-05, 'samples': 23955072, 'steps': 124765, 'loss/train': 1.4862474203109741} 11/07/2021 14:48:22 - INFO - __main__ - Step 124767: {'lr': 3.501465873774376e-05, 'samples': 23955264, 'steps': 124766, 'loss/train': 0.8815580010414124} 11/07/2021 14:48:22 - INFO - __main__ - Step 124768: {'lr': 3.501195026187265e-05, 'samples': 23955456, 'steps': 124767, 'loss/train': 1.3021199703216553} 11/07/2021 14:48:23 - INFO - __main__ - Step 124769: {'lr': 3.500924188287183e-05, 'samples': 23955648, 'steps': 124768, 'loss/train': 0.9490295648574829} 11/07/2021 14:48:23 - INFO - __main__ - Step 124770: {'lr': 3.500653360074255e-05, 'samples': 23955840, 'steps': 124769, 'loss/train': 1.1032992601394653} 11/07/2021 14:48:23 - INFO - __main__ - Step 124771: {'lr': 3.5003825415486e-05, 'samples': 23956032, 'steps': 124770, 'loss/train': 0.9255357384681702} 11/07/2021 14:48:24 - INFO - __main__ - Step 124772: {'lr': 3.5001117327103456e-05, 'samples': 23956224, 'steps': 124771, 'loss/train': 1.507263422012329} 11/07/2021 14:48:25 - INFO - __main__ - Step 124773: {'lr': 3.499840933559603e-05, 'samples': 23956416, 'steps': 124772, 'loss/train': 1.1848738193511963} 11/07/2021 14:48:25 - INFO - __main__ - Step 124774: {'lr': 3.4995701440965004e-05, 'samples': 23956608, 'steps': 124773, 'loss/train': 1.15981924533844} 11/07/2021 14:48:25 - INFO - __main__ - Step 124775: {'lr': 3.4992993643211595e-05, 'samples': 23956800, 'steps': 124774, 'loss/train': 1.4019016027450562} 11/07/2021 14:48:26 - INFO - __main__ - Step 124776: {'lr': 3.499028594233705e-05, 'samples': 23956992, 'steps': 124775, 'loss/train': 1.3728114366531372} 11/07/2021 14:48:27 - INFO - __main__ - Step 124777: {'lr': 3.4987578338342544e-05, 'samples': 23957184, 'steps': 124776, 'loss/train': 1.083038091659546} 11/07/2021 14:48:27 - INFO - __main__ - Step 124778: {'lr': 3.498487083122931e-05, 'samples': 23957376, 'steps': 124777, 'loss/train': 0.9055895805358887} 11/07/2021 14:48:28 - INFO - __main__ - Step 124779: {'lr': 3.498216342099861e-05, 'samples': 23957568, 'steps': 124778, 'loss/train': 1.3423423767089844} 11/07/2021 14:48:28 - INFO - __main__ - Step 124780: {'lr': 3.4979456107651605e-05, 'samples': 23957760, 'steps': 124779, 'loss/train': 1.3430397510528564} 11/07/2021 14:48:28 - INFO - __main__ - Step 124781: {'lr': 3.497674889118954e-05, 'samples': 23957952, 'steps': 124780, 'loss/train': 1.2880635261535645} 11/07/2021 14:48:29 - INFO - __main__ - Step 124782: {'lr': 3.497404177161362e-05, 'samples': 23958144, 'steps': 124781, 'loss/train': 1.7587082386016846} 11/07/2021 14:48:30 - INFO - __main__ - Step 124783: {'lr': 3.497133474892508e-05, 'samples': 23958336, 'steps': 124782, 'loss/train': 1.2900872230529785} 11/07/2021 14:48:30 - INFO - __main__ - Step 124784: {'lr': 3.4968627823125154e-05, 'samples': 23958528, 'steps': 124783, 'loss/train': 1.3712596893310547} 11/07/2021 14:48:30 - INFO - __main__ - Step 124785: {'lr': 3.496592099421506e-05, 'samples': 23958720, 'steps': 124784, 'loss/train': 1.6554203033447266} 11/07/2021 14:48:31 - INFO - __main__ - Step 124786: {'lr': 3.4963214262196036e-05, 'samples': 23958912, 'steps': 124785, 'loss/train': 0.9012446999549866} 11/07/2021 14:48:31 - INFO - __main__ - Step 124787: {'lr': 3.4960507627069236e-05, 'samples': 23959104, 'steps': 124786, 'loss/train': 1.3945538997650146} 11/07/2021 14:48:32 - INFO - __main__ - Step 124788: {'lr': 3.4957801088835896e-05, 'samples': 23959296, 'steps': 124787, 'loss/train': 1.3068798780441284} 11/07/2021 14:48:32 - INFO - __main__ - Step 124789: {'lr': 3.4955094647497244e-05, 'samples': 23959488, 'steps': 124788, 'loss/train': 1.4288358688354492} 11/07/2021 14:48:33 - INFO - __main__ - Step 124790: {'lr': 3.4952388303054526e-05, 'samples': 23959680, 'steps': 124789, 'loss/train': 1.6819403171539307} 11/07/2021 14:48:33 - INFO - __main__ - Step 124791: {'lr': 3.494968205550894e-05, 'samples': 23959872, 'steps': 124790, 'loss/train': 1.2343178987503052} 11/07/2021 14:48:33 - INFO - __main__ - Step 124792: {'lr': 3.4946975904861704e-05, 'samples': 23960064, 'steps': 124791, 'loss/train': 1.3241692781448364} 11/07/2021 14:48:35 - INFO - __main__ - Step 124793: {'lr': 3.494426985111404e-05, 'samples': 23960256, 'steps': 124792, 'loss/train': 1.4182974100112915} 11/07/2021 14:48:35 - INFO - __main__ - Step 124794: {'lr': 3.494156389426717e-05, 'samples': 23960448, 'steps': 124793, 'loss/train': 1.3689557313919067} 11/07/2021 14:48:35 - INFO - __main__ - Step 124795: {'lr': 3.4938858034322314e-05, 'samples': 23960640, 'steps': 124794, 'loss/train': 1.4162044525146484} 11/07/2021 14:48:36 - INFO - __main__ - Step 124796: {'lr': 3.4936152271280694e-05, 'samples': 23960832, 'steps': 124795, 'loss/train': 0.3917936384677887} 11/07/2021 14:48:36 - INFO - __main__ - Step 124797: {'lr': 3.4933446605143525e-05, 'samples': 23961024, 'steps': 124796, 'loss/train': 1.4717841148376465} 11/07/2021 14:48:37 - INFO - __main__ - Step 124798: {'lr': 3.4930741035912015e-05, 'samples': 23961216, 'steps': 124797, 'loss/train': 0.8926539421081543} 11/07/2021 14:48:37 - INFO - __main__ - Step 124799: {'lr': 3.492803556358745e-05, 'samples': 23961408, 'steps': 124798, 'loss/train': 1.2921051979064941} 11/07/2021 14:48:38 - INFO - __main__ - Step 124800: {'lr': 3.4925330188170956e-05, 'samples': 23961600, 'steps': 124799, 'loss/train': 1.3577288389205933} 11/07/2021 14:48:38 - INFO - __main__ - Step 124801: {'lr': 3.492262490966377e-05, 'samples': 23961792, 'steps': 124800, 'loss/train': 1.024158000946045} 11/07/2021 14:48:38 - INFO - __main__ - Step 124802: {'lr': 3.491991972806716e-05, 'samples': 23961984, 'steps': 124801, 'loss/train': 1.5123296976089478} 11/07/2021 14:48:40 - INFO - __main__ - Step 124803: {'lr': 3.4917214643382296e-05, 'samples': 23962176, 'steps': 124802, 'loss/train': 1.2795771360397339} 11/07/2021 14:48:40 - INFO - __main__ - Step 124804: {'lr': 3.491450965561041e-05, 'samples': 23962368, 'steps': 124803, 'loss/train': 1.2957634925842285} 11/07/2021 14:48:40 - INFO - __main__ - Step 124805: {'lr': 3.491180476475273e-05, 'samples': 23962560, 'steps': 124804, 'loss/train': 1.611060619354248} 11/07/2021 14:48:41 - INFO - __main__ - Step 124806: {'lr': 3.490909997081046e-05, 'samples': 23962752, 'steps': 124805, 'loss/train': 1.16643226146698} 11/07/2021 14:48:41 - INFO - __main__ - Step 124807: {'lr': 3.490639527378486e-05, 'samples': 23962944, 'steps': 124806, 'loss/train': 1.5253427028656006} 11/07/2021 14:48:42 - INFO - __main__ - Step 124808: {'lr': 3.49036906736771e-05, 'samples': 23963136, 'steps': 124807, 'loss/train': 1.2868399620056152} 11/07/2021 14:48:42 - INFO - __main__ - Step 124809: {'lr': 3.4900986170488425e-05, 'samples': 23963328, 'steps': 124808, 'loss/train': 1.3629939556121826} 11/07/2021 14:48:43 - INFO - __main__ - Step 124810: {'lr': 3.489828176422005e-05, 'samples': 23963520, 'steps': 124809, 'loss/train': 1.069006085395813} 11/07/2021 14:48:43 - INFO - __main__ - Step 124811: {'lr': 3.489557745487318e-05, 'samples': 23963712, 'steps': 124810, 'loss/train': 1.4287912845611572} 11/07/2021 14:48:43 - INFO - __main__ - Step 124812: {'lr': 3.489287324244905e-05, 'samples': 23963904, 'steps': 124811, 'loss/train': 0.8980966806411743} 11/07/2021 14:48:44 - INFO - __main__ - Step 124813: {'lr': 3.489016912694892e-05, 'samples': 23964096, 'steps': 124812, 'loss/train': 0.9053656458854675} 11/07/2021 14:48:45 - INFO - __main__ - Step 124814: {'lr': 3.48874651083739e-05, 'samples': 23964288, 'steps': 124813, 'loss/train': 1.0312514305114746} 11/07/2021 14:48:45 - INFO - __main__ - Step 124815: {'lr': 3.488476118672529e-05, 'samples': 23964480, 'steps': 124814, 'loss/train': 1.4734560251235962} 11/07/2021 14:48:45 - INFO - __main__ - Step 124816: {'lr': 3.488205736200428e-05, 'samples': 23964672, 'steps': 124815, 'loss/train': 1.347324252128601} 11/07/2021 14:48:46 - INFO - __main__ - Step 124817: {'lr': 3.487935363421207e-05, 'samples': 23964864, 'steps': 124816, 'loss/train': 1.4255653619766235} 11/07/2021 14:48:47 - INFO - __main__ - Step 124818: {'lr': 3.487665000334994e-05, 'samples': 23965056, 'steps': 124817, 'loss/train': 1.0426257848739624} 11/07/2021 14:48:47 - INFO - __main__ - Step 124819: {'lr': 3.487394646941905e-05, 'samples': 23965248, 'steps': 124818, 'loss/train': 1.0971283912658691} 11/07/2021 14:48:47 - INFO - __main__ - Step 124820: {'lr': 3.4871243032420644e-05, 'samples': 23965440, 'steps': 124819, 'loss/train': 1.4182900190353394} 11/07/2021 14:48:48 - INFO - __main__ - Step 124821: {'lr': 3.4868539692355956e-05, 'samples': 23965632, 'steps': 124820, 'loss/train': 1.4477509260177612} 11/07/2021 14:48:48 - INFO - __main__ - Step 124822: {'lr': 3.4865836449226166e-05, 'samples': 23965824, 'steps': 124821, 'loss/train': 1.6433959007263184} 11/07/2021 14:48:49 - INFO - __main__ - Step 124823: {'lr': 3.486313330303251e-05, 'samples': 23966016, 'steps': 124822, 'loss/train': 1.3060953617095947} 11/07/2021 14:48:50 - INFO - __main__ - Step 124824: {'lr': 3.486043025377619e-05, 'samples': 23966208, 'steps': 124823, 'loss/train': 1.2708673477172852} 11/07/2021 14:48:50 - INFO - __main__ - Step 124825: {'lr': 3.4857727301458474e-05, 'samples': 23966400, 'steps': 124824, 'loss/train': 1.5273507833480835} 11/07/2021 14:48:50 - INFO - __main__ - Step 124826: {'lr': 3.4855024446080574e-05, 'samples': 23966592, 'steps': 124825, 'loss/train': 1.8300741910934448} 11/07/2021 14:48:51 - INFO - __main__ - Step 124827: {'lr': 3.485232168764363e-05, 'samples': 23966784, 'steps': 124826, 'loss/train': 1.206321358680725} 11/07/2021 14:48:51 - INFO - __main__ - Step 124828: {'lr': 3.4849619026148915e-05, 'samples': 23966976, 'steps': 124827, 'loss/train': 0.9004486799240112} 11/07/2021 14:48:52 - INFO - __main__ - Step 124829: {'lr': 3.4846916461597656e-05, 'samples': 23967168, 'steps': 124828, 'loss/train': 1.3549072742462158} 11/07/2021 14:48:52 - INFO - __main__ - Step 124830: {'lr': 3.484421399399104e-05, 'samples': 23967360, 'steps': 124829, 'loss/train': 1.0724871158599854} 11/07/2021 14:48:53 - INFO - __main__ - Step 124831: {'lr': 3.48415116233303e-05, 'samples': 23967552, 'steps': 124830, 'loss/train': 1.3124431371688843} 11/07/2021 14:48:53 - INFO - __main__ - Step 124832: {'lr': 3.483880934961667e-05, 'samples': 23967744, 'steps': 124831, 'loss/train': 1.2973294258117676} 11/07/2021 14:48:53 - INFO - __main__ - Step 124833: {'lr': 3.483610717285135e-05, 'samples': 23967936, 'steps': 124832, 'loss/train': 1.233999490737915} 11/07/2021 14:48:55 - INFO - __main__ - Step 124834: {'lr': 3.4833405093035535e-05, 'samples': 23968128, 'steps': 124833, 'loss/train': 1.4336355924606323} 11/07/2021 14:48:55 - INFO - __main__ - Step 124835: {'lr': 3.4830703110170506e-05, 'samples': 23968320, 'steps': 124834, 'loss/train': 1.2406740188598633} 11/07/2021 14:48:55 - INFO - __main__ - Step 124836: {'lr': 3.4828001224257416e-05, 'samples': 23968512, 'steps': 124835, 'loss/train': 0.7255012392997742} 11/07/2021 14:48:56 - INFO - __main__ - Step 124837: {'lr': 3.48252994352975e-05, 'samples': 23968704, 'steps': 124836, 'loss/train': 1.9574425220489502} 11/07/2021 14:48:56 - INFO - __main__ - Step 124838: {'lr': 3.4822597743292e-05, 'samples': 23968896, 'steps': 124837, 'loss/train': 1.5065010786056519} 11/07/2021 14:48:57 - INFO - __main__ - Step 124839: {'lr': 3.48198961482421e-05, 'samples': 23969088, 'steps': 124838, 'loss/train': 1.4894503355026245} 11/07/2021 14:48:58 - INFO - __main__ - Step 124840: {'lr': 3.4817194650149124e-05, 'samples': 23969280, 'steps': 124839, 'loss/train': 1.4722403287887573} 11/07/2021 14:48:58 - INFO - __main__ - Step 124841: {'lr': 3.481449324901412e-05, 'samples': 23969472, 'steps': 124840, 'loss/train': 1.3725337982177734} 11/07/2021 14:48:58 - INFO - __main__ - Step 124842: {'lr': 3.4811791944838384e-05, 'samples': 23969664, 'steps': 124841, 'loss/train': 1.5126121044158936} 11/07/2021 14:48:59 - INFO - __main__ - Step 124843: {'lr': 3.480909073762314e-05, 'samples': 23969856, 'steps': 124842, 'loss/train': 1.3493375778198242} 11/07/2021 14:48:59 - INFO - __main__ - Step 124844: {'lr': 3.480638962736959e-05, 'samples': 23970048, 'steps': 124843, 'loss/train': 0.9043334722518921} 11/07/2021 14:49:00 - INFO - __main__ - Step 124845: {'lr': 3.480368861407898e-05, 'samples': 23970240, 'steps': 124844, 'loss/train': 1.1427816152572632} 11/07/2021 14:49:01 - INFO - __main__ - Step 124846: {'lr': 3.48009876977525e-05, 'samples': 23970432, 'steps': 124845, 'loss/train': 0.8362852334976196} 11/07/2021 14:49:01 - INFO - __main__ - Step 124847: {'lr': 3.4798286878391344e-05, 'samples': 23970624, 'steps': 124846, 'loss/train': 0.4031038284301758} 11/07/2021 14:49:01 - INFO - __main__ - Step 124848: {'lr': 3.479558615599679e-05, 'samples': 23970816, 'steps': 124847, 'loss/train': 1.4188226461410522} 11/07/2021 14:49:02 - INFO - __main__ - Step 124849: {'lr': 3.479288553057003e-05, 'samples': 23971008, 'steps': 124848, 'loss/train': 1.4469709396362305} 11/07/2021 14:49:03 - INFO - __main__ - Step 124850: {'lr': 3.479018500211226e-05, 'samples': 23971200, 'steps': 124849, 'loss/train': 1.5010212659835815} 11/07/2021 14:49:03 - INFO - __main__ - Step 124851: {'lr': 3.478748457062472e-05, 'samples': 23971392, 'steps': 124850, 'loss/train': 1.2152621746063232} 11/07/2021 14:49:03 - INFO - __main__ - Step 124852: {'lr': 3.478478423610862e-05, 'samples': 23971584, 'steps': 124851, 'loss/train': 0.936867356300354} 11/07/2021 14:49:04 - INFO - __main__ - Step 124853: {'lr': 3.4782083998565224e-05, 'samples': 23971776, 'steps': 124852, 'loss/train': 1.169081211090088} 11/07/2021 14:49:04 - INFO - __main__ - Step 124854: {'lr': 3.477938385799564e-05, 'samples': 23971968, 'steps': 124853, 'loss/train': 1.4463458061218262} 11/07/2021 14:49:04 - INFO - __main__ - Step 124855: {'lr': 3.4776683814401134e-05, 'samples': 23972160, 'steps': 124854, 'loss/train': 1.0132133960723877} 11/07/2021 14:49:05 - INFO - __main__ - Step 124856: {'lr': 3.477398386778297e-05, 'samples': 23972352, 'steps': 124855, 'loss/train': 5.617630958557129} 11/07/2021 14:49:06 - INFO - __main__ - Step 124857: {'lr': 3.477128401814228e-05, 'samples': 23972544, 'steps': 124856, 'loss/train': 1.4392549991607666} 11/07/2021 14:49:06 - INFO - __main__ - Step 124858: {'lr': 3.4768584265480354e-05, 'samples': 23972736, 'steps': 124857, 'loss/train': 1.3085821866989136} 11/07/2021 14:49:06 - INFO - __main__ - Step 124859: {'lr': 3.476588460979841e-05, 'samples': 23972928, 'steps': 124858, 'loss/train': 0.838890016078949} 11/07/2021 14:49:07 - INFO - __main__ - Step 124860: {'lr': 3.47631850510976e-05, 'samples': 23973120, 'steps': 124859, 'loss/train': 1.0655028820037842} 11/07/2021 14:49:08 - INFO - __main__ - Step 124861: {'lr': 3.476048558937919e-05, 'samples': 23973312, 'steps': 124860, 'loss/train': 1.2798709869384766} 11/07/2021 14:49:08 - INFO - __main__ - Step 124862: {'lr': 3.475778622464437e-05, 'samples': 23973504, 'steps': 124861, 'loss/train': 2.110927104949951} 11/07/2021 14:49:09 - INFO - __main__ - Step 124863: {'lr': 3.475508695689439e-05, 'samples': 23973696, 'steps': 124862, 'loss/train': 1.831733226776123} 11/07/2021 14:49:09 - INFO - __main__ - Step 124864: {'lr': 3.475238778613043e-05, 'samples': 23973888, 'steps': 124863, 'loss/train': 1.11203134059906} 11/07/2021 14:49:09 - INFO - __main__ - Step 124865: {'lr': 3.474968871235373e-05, 'samples': 23974080, 'steps': 124864, 'loss/train': 1.486536979675293} 11/07/2021 14:49:10 - INFO - __main__ - Step 124866: {'lr': 3.4746989735565506e-05, 'samples': 23974272, 'steps': 124865, 'loss/train': 5.692070484161377} 11/07/2021 14:49:11 - INFO - __main__ - Step 124867: {'lr': 3.474429085576703e-05, 'samples': 23974464, 'steps': 124866, 'loss/train': 1.2809628248214722} 11/07/2021 14:49:11 - INFO - __main__ - Step 124868: {'lr': 3.474159207295938e-05, 'samples': 23974656, 'steps': 124867, 'loss/train': 1.369125485420227} 11/07/2021 14:49:11 - INFO - __main__ - Step 124869: {'lr': 3.4738893387143837e-05, 'samples': 23974848, 'steps': 124868, 'loss/train': 1.5031416416168213} 11/07/2021 14:49:12 - INFO - __main__ - Step 124870: {'lr': 3.473619479832166e-05, 'samples': 23975040, 'steps': 124869, 'loss/train': 1.4369001388549805} 11/07/2021 14:49:12 - INFO - __main__ - Step 124871: {'lr': 3.4733496306494e-05, 'samples': 23975232, 'steps': 124870, 'loss/train': 1.2565159797668457} 11/07/2021 14:49:13 - INFO - __main__ - Step 124872: {'lr': 3.473079791166212e-05, 'samples': 23975424, 'steps': 124871, 'loss/train': 1.4275437593460083} 11/07/2021 14:49:14 - INFO - __main__ - Step 124873: {'lr': 3.472809961382723e-05, 'samples': 23975616, 'steps': 124872, 'loss/train': 1.1819037199020386} 11/07/2021 14:49:14 - INFO - __main__ - Step 124874: {'lr': 3.472540141299052e-05, 'samples': 23975808, 'steps': 124873, 'loss/train': 1.4774365425109863} 11/07/2021 14:49:14 - INFO - __main__ - Step 124875: {'lr': 3.472270330915322e-05, 'samples': 23976000, 'steps': 124874, 'loss/train': 1.1174826622009277} 11/07/2021 14:49:15 - INFO - __main__ - Step 124876: {'lr': 3.472000530231656e-05, 'samples': 23976192, 'steps': 124875, 'loss/train': 1.8448925018310547} 11/07/2021 14:49:16 - INFO - __main__ - Step 124877: {'lr': 3.471730739248174e-05, 'samples': 23976384, 'steps': 124876, 'loss/train': 1.4679869413375854} 11/07/2021 14:49:16 - INFO - __main__ - Step 124878: {'lr': 3.4714609579649975e-05, 'samples': 23976576, 'steps': 124877, 'loss/train': 1.6132798194885254} 11/07/2021 14:49:16 - INFO - __main__ - Step 124879: {'lr': 3.47119118638225e-05, 'samples': 23976768, 'steps': 124878, 'loss/train': 1.390256643295288} 11/07/2021 14:49:17 - INFO - __main__ - Step 124880: {'lr': 3.470921424500056e-05, 'samples': 23976960, 'steps': 124879, 'loss/train': 1.2784594297409058} 11/07/2021 14:49:17 - INFO - __main__ - Step 124881: {'lr': 3.470651672318525e-05, 'samples': 23977152, 'steps': 124880, 'loss/train': 1.0873712301254272} 11/07/2021 14:49:17 - INFO - __main__ - Step 124882: {'lr': 3.47038192983779e-05, 'samples': 23977344, 'steps': 124881, 'loss/train': 1.4430233240127563} 11/07/2021 14:49:19 - INFO - __main__ - Step 124883: {'lr': 3.470112197057967e-05, 'samples': 23977536, 'steps': 124882, 'loss/train': 1.6219367980957031} 11/07/2021 14:49:19 - INFO - __main__ - Step 124884: {'lr': 3.469842473979179e-05, 'samples': 23977728, 'steps': 124883, 'loss/train': 1.6675792932510376} 11/07/2021 14:49:19 - INFO - __main__ - Step 124885: {'lr': 3.469572760601547e-05, 'samples': 23977920, 'steps': 124884, 'loss/train': 1.2597262859344482} 11/07/2021 14:49:20 - INFO - __main__ - Step 124886: {'lr': 3.469303056925194e-05, 'samples': 23978112, 'steps': 124885, 'loss/train': 1.3389290571212769} 11/07/2021 14:49:20 - INFO - __main__ - Step 124887: {'lr': 3.469033362950241e-05, 'samples': 23978304, 'steps': 124886, 'loss/train': 1.3768870830535889} 11/07/2021 14:49:21 - INFO - __main__ - Step 124888: {'lr': 3.468763678676809e-05, 'samples': 23978496, 'steps': 124887, 'loss/train': 1.1098660230636597} 11/07/2021 14:49:21 - INFO - __main__ - Step 124889: {'lr': 3.468494004105019e-05, 'samples': 23978688, 'steps': 124888, 'loss/train': 1.0455403327941895} 11/07/2021 14:49:22 - INFO - __main__ - Step 124890: {'lr': 3.468224339234996e-05, 'samples': 23978880, 'steps': 124889, 'loss/train': 1.393918752670288} 11/07/2021 14:49:22 - INFO - __main__ - Step 124891: {'lr': 3.467954684066857e-05, 'samples': 23979072, 'steps': 124890, 'loss/train': 1.268263578414917} 11/07/2021 14:49:22 - INFO - __main__ - Step 124892: {'lr': 3.467685038600726e-05, 'samples': 23979264, 'steps': 124891, 'loss/train': 1.3820708990097046} 11/07/2021 14:49:24 - INFO - __main__ - Step 124893: {'lr': 3.467415402836729e-05, 'samples': 23979456, 'steps': 124892, 'loss/train': 0.9482100605964661} 11/07/2021 14:49:24 - INFO - __main__ - Step 124894: {'lr': 3.467145776774977e-05, 'samples': 23979648, 'steps': 124893, 'loss/train': 1.3885338306427002} 11/07/2021 14:49:24 - INFO - __main__ - Step 124895: {'lr': 3.466876160415597e-05, 'samples': 23979840, 'steps': 124894, 'loss/train': 1.3221088647842407} 11/07/2021 14:49:25 - INFO - __main__ - Step 124896: {'lr': 3.466606553758708e-05, 'samples': 23980032, 'steps': 124895, 'loss/train': 1.5415247678756714} 11/07/2021 14:49:25 - INFO - __main__ - Step 124897: {'lr': 3.466336956804436e-05, 'samples': 23980224, 'steps': 124896, 'loss/train': 0.5502971410751343} 11/07/2021 14:49:26 - INFO - __main__ - Step 124898: {'lr': 3.4660673695529e-05, 'samples': 23980416, 'steps': 124897, 'loss/train': 1.2914743423461914} 11/07/2021 14:49:27 - INFO - __main__ - Step 124899: {'lr': 3.4657977920042217e-05, 'samples': 23980608, 'steps': 124898, 'loss/train': 1.518429160118103} 11/07/2021 14:49:27 - INFO - __main__ - Step 124900: {'lr': 3.465528224158523e-05, 'samples': 23980800, 'steps': 124899, 'loss/train': 0.8014762997627258} 11/07/2021 14:49:27 - INFO - __main__ - Step 124901: {'lr': 3.465258666015925e-05, 'samples': 23980992, 'steps': 124900, 'loss/train': 0.9163896441459656} 11/07/2021 14:49:28 - INFO - __main__ - Step 124902: {'lr': 3.46498911757655e-05, 'samples': 23981184, 'steps': 124901, 'loss/train': 1.3607152700424194} 11/07/2021 14:49:29 - INFO - __main__ - Step 124903: {'lr': 3.4647195788405164e-05, 'samples': 23981376, 'steps': 124902, 'loss/train': 0.09345489740371704} 11/07/2021 14:49:29 - INFO - __main__ - Step 124904: {'lr': 3.4644500498079486e-05, 'samples': 23981568, 'steps': 124903, 'loss/train': 1.228279948234558} 11/07/2021 14:49:29 - INFO - __main__ - Step 124905: {'lr': 3.464180530478969e-05, 'samples': 23981760, 'steps': 124904, 'loss/train': 1.609506607055664} 11/07/2021 14:49:30 - INFO - __main__ - Step 124906: {'lr': 3.4639110208537024e-05, 'samples': 23981952, 'steps': 124905, 'loss/train': 1.2289016246795654} 11/07/2021 14:49:30 - INFO - __main__ - Step 124907: {'lr': 3.4636415209322563e-05, 'samples': 23982144, 'steps': 124906, 'loss/train': 1.2284870147705078} 11/07/2021 14:49:30 - INFO - __main__ - Step 124908: {'lr': 3.4633720307147647e-05, 'samples': 23982336, 'steps': 124907, 'loss/train': 1.2528103590011597} 11/07/2021 14:49:31 - INFO - __main__ - Step 124909: {'lr': 3.463102550201344e-05, 'samples': 23982528, 'steps': 124908, 'loss/train': 1.281020998954773} 11/07/2021 14:49:32 - INFO - __main__ - Step 124910: {'lr': 3.4628330793921166e-05, 'samples': 23982720, 'steps': 124909, 'loss/train': 1.1679925918579102} 11/07/2021 14:49:32 - INFO - __main__ - Step 124911: {'lr': 3.4625636182872036e-05, 'samples': 23982912, 'steps': 124910, 'loss/train': 1.2220340967178345} 11/07/2021 14:49:32 - INFO - __main__ - Step 124912: {'lr': 3.462294166886729e-05, 'samples': 23983104, 'steps': 124911, 'loss/train': 1.3567653894424438} 11/07/2021 14:49:33 - INFO - __main__ - Step 124913: {'lr': 3.462024725190813e-05, 'samples': 23983296, 'steps': 124912, 'loss/train': 1.3148419857025146} 11/07/2021 14:49:34 - INFO - __main__ - Step 124914: {'lr': 3.4617552931995726e-05, 'samples': 23983488, 'steps': 124913, 'loss/train': 1.558266043663025} 11/07/2021 14:49:34 - INFO - __main__ - Step 124915: {'lr': 3.461485870913137e-05, 'samples': 23983680, 'steps': 124914, 'loss/train': 1.0150318145751953} 11/07/2021 14:49:35 - INFO - __main__ - Step 124916: {'lr': 3.46121645833162e-05, 'samples': 23983872, 'steps': 124915, 'loss/train': 1.411071538925171} 11/07/2021 14:49:35 - INFO - __main__ - Step 124917: {'lr': 3.460947055455155e-05, 'samples': 23984064, 'steps': 124916, 'loss/train': 1.027470588684082} 11/07/2021 14:49:35 - INFO - __main__ - Step 124918: {'lr': 3.460677662283848e-05, 'samples': 23984256, 'steps': 124917, 'loss/train': 1.4689383506774902} 11/07/2021 14:49:36 - INFO - __main__ - Step 124919: {'lr': 3.460408278817828e-05, 'samples': 23984448, 'steps': 124918, 'loss/train': 1.1466234922409058} 11/07/2021 14:49:37 - INFO - __main__ - Step 124920: {'lr': 3.460138905057214e-05, 'samples': 23984640, 'steps': 124919, 'loss/train': 0.7380986213684082} 11/07/2021 14:49:37 - INFO - __main__ - Step 124921: {'lr': 3.4598695410021305e-05, 'samples': 23984832, 'steps': 124920, 'loss/train': 1.6478015184402466} 11/07/2021 14:49:37 - INFO - __main__ - Step 124922: {'lr': 3.459600186652698e-05, 'samples': 23985024, 'steps': 124921, 'loss/train': 1.181930661201477} 11/07/2021 14:49:38 - INFO - __main__ - Step 124923: {'lr': 3.459330842009034e-05, 'samples': 23985216, 'steps': 124922, 'loss/train': 1.369447112083435} 11/07/2021 14:49:39 - INFO - __main__ - Step 124924: {'lr': 3.459061507071265e-05, 'samples': 23985408, 'steps': 124923, 'loss/train': 1.1370697021484375} 11/07/2021 14:49:39 - INFO - __main__ - Step 124925: {'lr': 3.458792181839512e-05, 'samples': 23985600, 'steps': 124924, 'loss/train': 1.202663779258728} 11/07/2021 14:49:39 - INFO - __main__ - Step 124926: {'lr': 3.458522866313893e-05, 'samples': 23985792, 'steps': 124925, 'loss/train': 1.0704485177993774} 11/07/2021 14:49:40 - INFO - __main__ - Step 124927: {'lr': 3.458253560494531e-05, 'samples': 23985984, 'steps': 124926, 'loss/train': 1.0146243572235107} 11/07/2021 14:49:40 - INFO - __main__ - Step 124928: {'lr': 3.457984264381556e-05, 'samples': 23986176, 'steps': 124927, 'loss/train': 1.6265491247177124} 11/07/2021 14:49:41 - INFO - __main__ - Step 124929: {'lr': 3.457714977975071e-05, 'samples': 23986368, 'steps': 124928, 'loss/train': 1.4751358032226562} 11/07/2021 14:49:42 - INFO - __main__ - Step 124930: {'lr': 3.457445701275211e-05, 'samples': 23986560, 'steps': 124929, 'loss/train': 1.4171226024627686} 11/07/2021 14:49:42 - INFO - __main__ - Step 124931: {'lr': 3.457176434282091e-05, 'samples': 23986752, 'steps': 124930, 'loss/train': 1.2042500972747803} 11/07/2021 14:49:42 - INFO - __main__ - Step 124932: {'lr': 3.456907176995835e-05, 'samples': 23986944, 'steps': 124931, 'loss/train': 1.3272229433059692} 11/07/2021 14:49:43 - INFO - __main__ - Step 124933: {'lr': 3.456637929416567e-05, 'samples': 23987136, 'steps': 124932, 'loss/train': 1.2755974531173706} 11/07/2021 14:49:43 - INFO - __main__ - Step 124934: {'lr': 3.4563686915444035e-05, 'samples': 23987328, 'steps': 124933, 'loss/train': 1.6726081371307373} 11/07/2021 14:49:44 - INFO - __main__ - Step 124935: {'lr': 3.4560994633794666e-05, 'samples': 23987520, 'steps': 124934, 'loss/train': 0.9802484512329102} 11/07/2021 14:49:44 - INFO - __main__ - Step 124936: {'lr': 3.455830244921882e-05, 'samples': 23987712, 'steps': 124935, 'loss/train': 1.26675283908844} 11/07/2021 14:49:45 - INFO - __main__ - Step 124937: {'lr': 3.455561036171764e-05, 'samples': 23987904, 'steps': 124936, 'loss/train': 1.7270320653915405} 11/07/2021 14:49:45 - INFO - __main__ - Step 124938: {'lr': 3.45529183712924e-05, 'samples': 23988096, 'steps': 124937, 'loss/train': 1.4227179288864136} 11/07/2021 14:49:45 - INFO - __main__ - Step 124939: {'lr': 3.455022647794434e-05, 'samples': 23988288, 'steps': 124938, 'loss/train': 1.2689591646194458} 11/07/2021 14:49:46 - INFO - __main__ - Step 124940: {'lr': 3.454753468167457e-05, 'samples': 23988480, 'steps': 124939, 'loss/train': 1.1374521255493164} 11/07/2021 14:49:47 - INFO - __main__ - Step 124941: {'lr': 3.454484298248437e-05, 'samples': 23988672, 'steps': 124940, 'loss/train': 1.355869174003601} 11/07/2021 14:49:47 - INFO - __main__ - Step 124942: {'lr': 3.454215138037492e-05, 'samples': 23988864, 'steps': 124941, 'loss/train': 1.5153567790985107} 11/07/2021 14:49:47 - INFO - __main__ - Step 124943: {'lr': 3.4539459875347454e-05, 'samples': 23989056, 'steps': 124942, 'loss/train': 1.262718915939331} 11/07/2021 14:49:48 - INFO - __main__ - Step 124944: {'lr': 3.45367684674032e-05, 'samples': 23989248, 'steps': 124943, 'loss/train': 0.7687518000602722} 11/07/2021 14:49:49 - INFO - __main__ - Step 124945: {'lr': 3.453407715654333e-05, 'samples': 23989440, 'steps': 124944, 'loss/train': 1.2473074197769165} 11/07/2021 14:49:49 - INFO - __main__ - Step 124946: {'lr': 3.453138594276908e-05, 'samples': 23989632, 'steps': 124945, 'loss/train': 0.6369754076004028} 11/07/2021 14:49:50 - INFO - __main__ - Step 124947: {'lr': 3.452869482608167e-05, 'samples': 23989824, 'steps': 124946, 'loss/train': 1.5152114629745483} 11/07/2021 14:49:50 - INFO - __main__ - Step 124948: {'lr': 3.4526003806482325e-05, 'samples': 23990016, 'steps': 124947, 'loss/train': 1.4138118028640747} 11/07/2021 14:49:50 - INFO - __main__ - Step 124949: {'lr': 3.452331288397223e-05, 'samples': 23990208, 'steps': 124948, 'loss/train': 1.490059494972229} 11/07/2021 14:49:51 - INFO - __main__ - Step 124950: {'lr': 3.452062205855264e-05, 'samples': 23990400, 'steps': 124949, 'loss/train': 1.5142823457717896} 11/07/2021 14:49:52 - INFO - __main__ - Step 124951: {'lr': 3.451793133022468e-05, 'samples': 23990592, 'steps': 124950, 'loss/train': 1.5207513570785522} 11/07/2021 14:49:52 - INFO - __main__ - Step 124952: {'lr': 3.451524069898962e-05, 'samples': 23990784, 'steps': 124951, 'loss/train': 1.0518462657928467} 11/07/2021 14:49:52 - INFO - __main__ - Step 124953: {'lr': 3.4512550164848694e-05, 'samples': 23990976, 'steps': 124952, 'loss/train': 1.0162676572799683} 11/07/2021 14:49:53 - INFO - __main__ - Step 124954: {'lr': 3.4509859727803046e-05, 'samples': 23991168, 'steps': 124953, 'loss/train': 1.2225710153579712} 11/07/2021 14:49:54 - INFO - __main__ - Step 124955: {'lr': 3.450716938785395e-05, 'samples': 23991360, 'steps': 124954, 'loss/train': 0.515871524810791} 11/07/2021 14:49:54 - INFO - __main__ - Step 124956: {'lr': 3.45044791450026e-05, 'samples': 23991552, 'steps': 124955, 'loss/train': 1.0313962697982788} 11/07/2021 14:49:54 - INFO - __main__ - Step 124957: {'lr': 3.450178899925022e-05, 'samples': 23991744, 'steps': 124956, 'loss/train': 1.5169130563735962} 11/07/2021 14:49:55 - INFO - __main__ - Step 124958: {'lr': 3.449909895059797e-05, 'samples': 23991936, 'steps': 124957, 'loss/train': 1.5896161794662476} 11/07/2021 14:49:55 - INFO - __main__ - Step 124959: {'lr': 3.449640899904713e-05, 'samples': 23992128, 'steps': 124958, 'loss/train': 1.5039081573486328} 11/07/2021 14:49:56 - INFO - __main__ - Step 124960: {'lr': 3.449371914459887e-05, 'samples': 23992320, 'steps': 124959, 'loss/train': 1.2955944538116455} 11/07/2021 14:49:57 - INFO - __main__ - Step 124961: {'lr': 3.449102938725448e-05, 'samples': 23992512, 'steps': 124960, 'loss/train': 1.2757980823516846} 11/07/2021 14:49:57 - INFO - __main__ - Step 124962: {'lr': 3.448833972701504e-05, 'samples': 23992704, 'steps': 124961, 'loss/train': 1.5865147113800049} 11/07/2021 14:49:57 - INFO - __main__ - Step 124963: {'lr': 3.448565016388183e-05, 'samples': 23992896, 'steps': 124962, 'loss/train': 1.3176213502883911} 11/07/2021 14:49:58 - INFO - __main__ - Step 124964: {'lr': 3.4482960697856085e-05, 'samples': 23993088, 'steps': 124963, 'loss/train': 1.3237407207489014} 11/07/2021 14:49:59 - INFO - __main__ - Step 124965: {'lr': 3.448027132893897e-05, 'samples': 23993280, 'steps': 124964, 'loss/train': 1.4434064626693726} 11/07/2021 14:49:59 - INFO - __main__ - Step 124966: {'lr': 3.447758205713172e-05, 'samples': 23993472, 'steps': 124965, 'loss/train': 0.8522605299949646} 11/07/2021 14:49:59 - INFO - __main__ - Step 124967: {'lr': 3.447489288243555e-05, 'samples': 23993664, 'steps': 124966, 'loss/train': 1.9664663076400757} 11/07/2021 14:50:00 - INFO - __main__ - Step 124968: {'lr': 3.447220380485166e-05, 'samples': 23993856, 'steps': 124967, 'loss/train': 1.7352656126022339} 11/07/2021 14:50:00 - INFO - __main__ - Step 124969: {'lr': 3.4469514824381264e-05, 'samples': 23994048, 'steps': 124968, 'loss/train': 1.1601827144622803} 11/07/2021 14:50:01 - INFO - __main__ - Step 124970: {'lr': 3.44668259410256e-05, 'samples': 23994240, 'steps': 124969, 'loss/train': 0.11458183825016022} 11/07/2021 14:50:02 - INFO - __main__ - Step 124971: {'lr': 3.4464137154785854e-05, 'samples': 23994432, 'steps': 124970, 'loss/train': 1.213631510734558} 11/07/2021 14:50:02 - INFO - __main__ - Step 124972: {'lr': 3.4461448465663234e-05, 'samples': 23994624, 'steps': 124971, 'loss/train': 0.8763653039932251} 11/07/2021 14:50:02 - INFO - __main__ - Step 124973: {'lr': 3.445875987365896e-05, 'samples': 23994816, 'steps': 124972, 'loss/train': 1.2784160375595093} 11/07/2021 14:50:03 - INFO - __main__ - Step 124974: {'lr': 3.4456071378774294e-05, 'samples': 23995008, 'steps': 124973, 'loss/train': 1.2466641664505005} 11/07/2021 14:50:03 - INFO - __main__ - Step 124975: {'lr': 3.445338298101033e-05, 'samples': 23995200, 'steps': 124974, 'loss/train': 0.6966968178749084} 11/07/2021 14:50:04 - INFO - __main__ - Step 124976: {'lr': 3.4450694680368375e-05, 'samples': 23995392, 'steps': 124975, 'loss/train': 0.457965612411499} 11/07/2021 14:50:04 - INFO - __main__ - Step 124977: {'lr': 3.4448006476849594e-05, 'samples': 23995584, 'steps': 124976, 'loss/train': 2.1435344219207764} 11/07/2021 14:50:05 - INFO - __main__ - Step 124978: {'lr': 3.444531837045522e-05, 'samples': 23995776, 'steps': 124977, 'loss/train': 0.5966642498970032} 11/07/2021 14:50:05 - INFO - __main__ - Step 124979: {'lr': 3.444263036118647e-05, 'samples': 23995968, 'steps': 124978, 'loss/train': 1.1466343402862549} 11/07/2021 14:50:05 - INFO - __main__ - Step 124980: {'lr': 3.4439942449044526e-05, 'samples': 23996160, 'steps': 124979, 'loss/train': 1.028188943862915} 11/07/2021 14:50:07 - INFO - __main__ - Step 124981: {'lr': 3.4437254634030606e-05, 'samples': 23996352, 'steps': 124980, 'loss/train': 1.4760291576385498} 11/07/2021 14:50:07 - INFO - __main__ - Step 124982: {'lr': 3.443456691614597e-05, 'samples': 23996544, 'steps': 124981, 'loss/train': 1.135483741760254} 11/07/2021 14:50:07 - INFO - __main__ - Step 124983: {'lr': 3.4431879295391763e-05, 'samples': 23996736, 'steps': 124982, 'loss/train': 1.3141887187957764} 11/07/2021 14:50:08 - INFO - __main__ - Step 124984: {'lr': 3.442919177176923e-05, 'samples': 23996928, 'steps': 124983, 'loss/train': 1.9262213706970215} 11/07/2021 14:50:08 - INFO - __main__ - Step 124985: {'lr': 3.442650434527958e-05, 'samples': 23997120, 'steps': 124984, 'loss/train': 1.0443956851959229} 11/07/2021 14:50:09 - INFO - __main__ - Step 124986: {'lr': 3.442381701592404e-05, 'samples': 23997312, 'steps': 124985, 'loss/train': 1.6788995265960693} 11/07/2021 14:50:09 - INFO - __main__ - Step 124987: {'lr': 3.4421129783703764e-05, 'samples': 23997504, 'steps': 124986, 'loss/train': 1.4072916507720947} 11/07/2021 14:50:10 - INFO - __main__ - Step 124988: {'lr': 3.441844264862007e-05, 'samples': 23997696, 'steps': 124987, 'loss/train': 1.5450539588928223} 11/07/2021 14:50:10 - INFO - __main__ - Step 124989: {'lr': 3.441575561067406e-05, 'samples': 23997888, 'steps': 124988, 'loss/train': 1.3425028324127197} 11/07/2021 14:50:10 - INFO - __main__ - Step 124990: {'lr': 3.441306866986696e-05, 'samples': 23998080, 'steps': 124989, 'loss/train': 1.4621542692184448} 11/07/2021 14:50:11 - INFO - __main__ - Step 124991: {'lr': 3.4410381826200016e-05, 'samples': 23998272, 'steps': 124990, 'loss/train': 1.7079808712005615} 11/07/2021 14:50:12 - INFO - __main__ - Step 124992: {'lr': 3.440769507967445e-05, 'samples': 23998464, 'steps': 124991, 'loss/train': 1.4105298519134521} 11/07/2021 14:50:12 - INFO - __main__ - Step 124993: {'lr': 3.440500843029143e-05, 'samples': 23998656, 'steps': 124992, 'loss/train': 1.2384759187698364} 11/07/2021 14:50:12 - INFO - __main__ - Step 124994: {'lr': 3.440232187805217e-05, 'samples': 23998848, 'steps': 124993, 'loss/train': 0.5732433199882507} 11/07/2021 14:50:13 - INFO - __main__ - Step 124995: {'lr': 3.4399635422957904e-05, 'samples': 23999040, 'steps': 124994, 'loss/train': 1.3175746202468872} 11/07/2021 14:50:13 - INFO - __main__ - Step 124996: {'lr': 3.439694906500984e-05, 'samples': 23999232, 'steps': 124995, 'loss/train': 1.5365006923675537} 11/07/2021 14:50:14 - INFO - __main__ - Step 124997: {'lr': 3.439426280420921e-05, 'samples': 23999424, 'steps': 124996, 'loss/train': 1.0541815757751465} 11/07/2021 14:50:15 - INFO - __main__ - Step 124998: {'lr': 3.439157664055717e-05, 'samples': 23999616, 'steps': 124997, 'loss/train': 1.1345734596252441} 11/07/2021 14:50:15 - INFO - __main__ - Step 124999: {'lr': 3.438889057405495e-05, 'samples': 23999808, 'steps': 124998, 'loss/train': 0.7185196876525879} 11/07/2021 14:50:15 - INFO - __main__ - Step 125000: {'lr': 3.438620460470379e-05, 'samples': 24000000, 'steps': 124999, 'loss/train': 0.9713723659515381} 11/07/2021 14:50:16 - INFO - __main__ - Step 125001: {'lr': 3.438351873250492e-05, 'samples': 24000192, 'steps': 125000, 'loss/train': 1.4009156227111816} 11/07/2021 14:50:17 - INFO - __main__ - Step 125002: {'lr': 3.4380832957459476e-05, 'samples': 24000384, 'steps': 125001, 'loss/train': 1.156702995300293} 11/07/2021 14:50:17 - INFO - __main__ - Step 125003: {'lr': 3.437814727956867e-05, 'samples': 24000576, 'steps': 125002, 'loss/train': 1.5049868822097778} 11/07/2021 14:50:18 - INFO - __main__ - Step 125004: {'lr': 3.437546169883376e-05, 'samples': 24000768, 'steps': 125003, 'loss/train': 1.2197209596633911} 11/07/2021 14:50:18 - INFO - __main__ - Step 125005: {'lr': 3.4372776215255946e-05, 'samples': 24000960, 'steps': 125004, 'loss/train': 1.2113808393478394} 11/07/2021 14:50:18 - INFO - __main__ - Step 125006: {'lr': 3.437009082883641e-05, 'samples': 24001152, 'steps': 125005, 'loss/train': 1.273844599723816} 11/07/2021 14:50:20 - INFO - __main__ - Step 125007: {'lr': 3.43674055395764e-05, 'samples': 24001344, 'steps': 125006, 'loss/train': 1.3673362731933594} 11/07/2021 14:50:20 - INFO - __main__ - Step 125008: {'lr': 3.436472034747712e-05, 'samples': 24001536, 'steps': 125007, 'loss/train': 1.4859777688980103} 11/07/2021 14:50:20 - INFO - __main__ - Step 125009: {'lr': 3.436203525253975e-05, 'samples': 24001728, 'steps': 125008, 'loss/train': 1.3308359384536743} 11/07/2021 14:50:21 - INFO - __main__ - Step 125010: {'lr': 3.4359350254765527e-05, 'samples': 24001920, 'steps': 125009, 'loss/train': 1.5189746618270874} 11/07/2021 14:50:21 - INFO - __main__ - Step 125011: {'lr': 3.4356665354155656e-05, 'samples': 24002112, 'steps': 125010, 'loss/train': 1.5125706195831299} 11/07/2021 14:50:22 - INFO - __main__ - Step 125012: {'lr': 3.435398055071135e-05, 'samples': 24002304, 'steps': 125011, 'loss/train': 1.2321414947509766} 11/07/2021 14:50:23 - INFO - __main__ - Step 125013: {'lr': 3.435129584443378e-05, 'samples': 24002496, 'steps': 125012, 'loss/train': 0.6610686779022217} 11/07/2021 14:50:23 - INFO - __main__ - Step 125014: {'lr': 3.434861123532429e-05, 'samples': 24002688, 'steps': 125013, 'loss/train': 1.3390600681304932} 11/07/2021 14:50:23 - INFO - __main__ - Step 125015: {'lr': 3.434592672338391e-05, 'samples': 24002880, 'steps': 125014, 'loss/train': 1.2767785787582397} 11/07/2021 14:50:24 - INFO - __main__ - Step 125016: {'lr': 3.434324230861391e-05, 'samples': 24003072, 'steps': 125015, 'loss/train': 1.1829582452774048} 11/07/2021 14:50:24 - INFO - __main__ - Step 125017: {'lr': 3.434055799101554e-05, 'samples': 24003264, 'steps': 125016, 'loss/train': 1.2934105396270752} 11/07/2021 14:50:25 - INFO - __main__ - Step 125018: {'lr': 3.4337873770589974e-05, 'samples': 24003456, 'steps': 125017, 'loss/train': 1.4666917324066162} 11/07/2021 14:50:26 - INFO - __main__ - Step 125019: {'lr': 3.433518964733845e-05, 'samples': 24003648, 'steps': 125018, 'loss/train': 0.7169766426086426} 11/07/2021 14:50:26 - INFO - __main__ - Step 125020: {'lr': 3.433250562126214e-05, 'samples': 24003840, 'steps': 125019, 'loss/train': 1.3321324586868286} 11/07/2021 14:50:26 - INFO - __main__ - Step 125021: {'lr': 3.432982169236229e-05, 'samples': 24004032, 'steps': 125020, 'loss/train': 0.733807384967804} 11/07/2021 14:50:27 - INFO - __main__ - Step 125022: {'lr': 3.43271378606401e-05, 'samples': 24004224, 'steps': 125021, 'loss/train': 1.5734434127807617} 11/07/2021 14:50:27 - INFO - __main__ - Step 125023: {'lr': 3.432445412609678e-05, 'samples': 24004416, 'steps': 125022, 'loss/train': 1.456147313117981} 11/07/2021 14:50:28 - INFO - __main__ - Step 125024: {'lr': 3.43217704887335e-05, 'samples': 24004608, 'steps': 125023, 'loss/train': 0.9035787582397461} 11/07/2021 14:50:29 - INFO - __main__ - Step 125025: {'lr': 3.431908694855154e-05, 'samples': 24004800, 'steps': 125024, 'loss/train': 1.5194761753082275} 11/07/2021 14:50:29 - INFO - __main__ - Step 125026: {'lr': 3.431640350555204e-05, 'samples': 24004992, 'steps': 125025, 'loss/train': 1.2988039255142212} 11/07/2021 14:50:29 - INFO - __main__ - Step 125027: {'lr': 3.431372015973624e-05, 'samples': 24005184, 'steps': 125026, 'loss/train': 0.7653622627258301} 11/07/2021 14:50:30 - INFO - __main__ - Step 125028: {'lr': 3.431103691110543e-05, 'samples': 24005376, 'steps': 125027, 'loss/train': 1.2262942790985107} 11/07/2021 14:50:31 - INFO - __main__ - Step 125029: {'lr': 3.430835375966068e-05, 'samples': 24005568, 'steps': 125028, 'loss/train': 0.7962832450866699} 11/07/2021 14:50:32 - INFO - __main__ - Step 125030: {'lr': 3.430567070540325e-05, 'samples': 24005760, 'steps': 125029, 'loss/train': 2.306252956390381} 11/07/2021 14:50:32 - INFO - __main__ - Step 125031: {'lr': 3.430298774833435e-05, 'samples': 24005952, 'steps': 125030, 'loss/train': 1.256244421005249} 11/07/2021 14:50:32 - INFO - __main__ - Step 125032: {'lr': 3.43003048884552e-05, 'samples': 24006144, 'steps': 125031, 'loss/train': 0.8251751065254211} 11/07/2021 14:50:33 - INFO - __main__ - Step 125033: {'lr': 3.429762212576701e-05, 'samples': 24006336, 'steps': 125032, 'loss/train': 0.9612962603569031} 11/07/2021 14:50:33 - INFO - __main__ - Step 125034: {'lr': 3.4294939460270984e-05, 'samples': 24006528, 'steps': 125033, 'loss/train': 1.712831974029541} 11/07/2021 14:50:34 - INFO - __main__ - Step 125035: {'lr': 3.4292256891968326e-05, 'samples': 24006720, 'steps': 125034, 'loss/train': 1.6536121368408203} 11/07/2021 14:50:35 - INFO - __main__ - Step 125036: {'lr': 3.4289574420860226e-05, 'samples': 24006912, 'steps': 125035, 'loss/train': 1.6302739381790161} 11/07/2021 14:50:35 - INFO - __main__ - Step 125037: {'lr': 3.428689204694793e-05, 'samples': 24007104, 'steps': 125036, 'loss/train': 1.356165885925293} 11/07/2021 14:50:35 - INFO - __main__ - Step 125038: {'lr': 3.428420977023264e-05, 'samples': 24007296, 'steps': 125037, 'loss/train': 1.2639191150665283} 11/07/2021 14:50:36 - INFO - __main__ - Step 125039: {'lr': 3.428152759071557e-05, 'samples': 24007488, 'steps': 125038, 'loss/train': 0.4406932294368744} 11/07/2021 14:50:37 - INFO - __main__ - Step 125040: {'lr': 3.427884550839788e-05, 'samples': 24007680, 'steps': 125039, 'loss/train': 0.9383053779602051} 11/07/2021 14:50:37 - INFO - __main__ - Step 125041: {'lr': 3.427616352328089e-05, 'samples': 24007872, 'steps': 125040, 'loss/train': 1.0855404138565063} 11/07/2021 14:50:37 - INFO - __main__ - Step 125042: {'lr': 3.427348163536567e-05, 'samples': 24008064, 'steps': 125041, 'loss/train': 2.368540048599243} 11/07/2021 14:50:38 - INFO - __main__ - Step 125043: {'lr': 3.4270799844653504e-05, 'samples': 24008256, 'steps': 125042, 'loss/train': 1.1855709552764893} 11/07/2021 14:50:38 - INFO - __main__ - Step 125044: {'lr': 3.426811815114558e-05, 'samples': 24008448, 'steps': 125043, 'loss/train': 1.4820663928985596} 11/07/2021 14:50:39 - INFO - __main__ - Step 125045: {'lr': 3.42654365548431e-05, 'samples': 24008640, 'steps': 125044, 'loss/train': 0.9843706488609314} 11/07/2021 14:50:40 - INFO - __main__ - Step 125046: {'lr': 3.42627550557473e-05, 'samples': 24008832, 'steps': 125045, 'loss/train': 1.502817988395691} 11/07/2021 14:50:40 - INFO - __main__ - Step 125047: {'lr': 3.426007365385936e-05, 'samples': 24009024, 'steps': 125046, 'loss/train': 1.386558175086975} 11/07/2021 14:50:40 - INFO - __main__ - Step 125048: {'lr': 3.4257392349180516e-05, 'samples': 24009216, 'steps': 125047, 'loss/train': 1.0956735610961914} 11/07/2021 14:50:41 - INFO - __main__ - Step 125049: {'lr': 3.425471114171197e-05, 'samples': 24009408, 'steps': 125048, 'loss/train': 1.3737423419952393} 11/07/2021 14:50:41 - INFO - __main__ - Step 125050: {'lr': 3.4252030031454886e-05, 'samples': 24009600, 'steps': 125049, 'loss/train': 1.4715640544891357} 11/07/2021 14:50:42 - INFO - __main__ - Step 125051: {'lr': 3.424934901841054e-05, 'samples': 24009792, 'steps': 125050, 'loss/train': 1.2540541887283325} 11/07/2021 14:50:42 - INFO - __main__ - Step 125052: {'lr': 3.424666810258009e-05, 'samples': 24009984, 'steps': 125051, 'loss/train': 1.2171952724456787} 11/07/2021 14:50:43 - INFO - __main__ - Step 125053: {'lr': 3.424398728396477e-05, 'samples': 24010176, 'steps': 125052, 'loss/train': 1.2770179510116577} 11/07/2021 14:50:43 - INFO - __main__ - Step 125054: {'lr': 3.42413065625658e-05, 'samples': 24010368, 'steps': 125053, 'loss/train': 1.1720912456512451} 11/07/2021 14:50:43 - INFO - __main__ - Step 125055: {'lr': 3.4238625938384396e-05, 'samples': 24010560, 'steps': 125054, 'loss/train': 1.6074299812316895} 11/07/2021 14:50:44 - INFO - __main__ - Step 125056: {'lr': 3.4235945411421695e-05, 'samples': 24010752, 'steps': 125055, 'loss/train': 1.1149784326553345} 11/07/2021 14:50:45 - INFO - __main__ - Step 125057: {'lr': 3.423326498167895e-05, 'samples': 24010944, 'steps': 125056, 'loss/train': 1.3311707973480225} 11/07/2021 14:50:45 - INFO - __main__ - Step 125058: {'lr': 3.423058464915735e-05, 'samples': 24011136, 'steps': 125057, 'loss/train': 1.435324788093567} 11/07/2021 14:50:46 - INFO - __main__ - Step 125059: {'lr': 3.422790441385812e-05, 'samples': 24011328, 'steps': 125058, 'loss/train': 1.022083044052124} 11/07/2021 14:50:46 - INFO - __main__ - Step 125060: {'lr': 3.422522427578248e-05, 'samples': 24011520, 'steps': 125059, 'loss/train': 1.3324964046478271} 11/07/2021 14:50:47 - INFO - __main__ - Step 125061: {'lr': 3.422254423493162e-05, 'samples': 24011712, 'steps': 125060, 'loss/train': 1.3156944513320923} 11/07/2021 14:50:47 - INFO - __main__ - Step 125062: {'lr': 3.421986429130675e-05, 'samples': 24011904, 'steps': 125061, 'loss/train': 1.8938628435134888} 11/07/2021 14:50:48 - INFO - __main__ - Step 125063: {'lr': 3.421718444490907e-05, 'samples': 24012096, 'steps': 125062, 'loss/train': 1.8104703426361084} 11/07/2021 14:50:48 - INFO - __main__ - Step 125064: {'lr': 3.4214504695739805e-05, 'samples': 24012288, 'steps': 125063, 'loss/train': 1.9827512502670288} 11/07/2021 14:50:48 - INFO - __main__ - Step 125065: {'lr': 3.421182504380016e-05, 'samples': 24012480, 'steps': 125064, 'loss/train': 1.6616305112838745} 11/07/2021 14:50:49 - INFO - __main__ - Step 125066: {'lr': 3.4209145489091346e-05, 'samples': 24012672, 'steps': 125065, 'loss/train': 1.520662784576416} 11/07/2021 14:50:50 - INFO - __main__ - Step 125067: {'lr': 3.4206466031614535e-05, 'samples': 24012864, 'steps': 125066, 'loss/train': 1.039074420928955} 11/07/2021 14:50:50 - INFO - __main__ - Step 125068: {'lr': 3.420378667137103e-05, 'samples': 24013056, 'steps': 125067, 'loss/train': 1.1135411262512207} 11/07/2021 14:50:50 - INFO - __main__ - Step 125069: {'lr': 3.420110740836191e-05, 'samples': 24013248, 'steps': 125068, 'loss/train': 1.0106322765350342} 11/07/2021 14:50:51 - INFO - __main__ - Step 125070: {'lr': 3.419842824258845e-05, 'samples': 24013440, 'steps': 125069, 'loss/train': 0.9082858562469482} 11/07/2021 14:50:51 - INFO - __main__ - Step 125071: {'lr': 3.419574917405183e-05, 'samples': 24013632, 'steps': 125070, 'loss/train': 0.7589564919471741} 11/07/2021 14:50:52 - INFO - __main__ - Step 125072: {'lr': 3.419307020275331e-05, 'samples': 24013824, 'steps': 125071, 'loss/train': 1.0253618955612183} 11/07/2021 14:50:53 - INFO - __main__ - Step 125073: {'lr': 3.4190391328694035e-05, 'samples': 24014016, 'steps': 125072, 'loss/train': 0.9626893401145935} 11/07/2021 14:50:53 - INFO - __main__ - Step 125074: {'lr': 3.418771255187525e-05, 'samples': 24014208, 'steps': 125073, 'loss/train': 1.7489606142044067} 11/07/2021 14:50:53 - INFO - __main__ - Step 125075: {'lr': 3.4185033872298127e-05, 'samples': 24014400, 'steps': 125074, 'loss/train': 1.5229382514953613} 11/07/2021 14:50:54 - INFO - __main__ - Step 125076: {'lr': 3.418235528996391e-05, 'samples': 24014592, 'steps': 125075, 'loss/train': 1.3801876306533813} 11/07/2021 14:50:55 - INFO - __main__ - Step 125077: {'lr': 3.41796768048738e-05, 'samples': 24014784, 'steps': 125076, 'loss/train': 1.6655149459838867} 11/07/2021 14:50:55 - INFO - __main__ - Step 125078: {'lr': 3.417699841702901e-05, 'samples': 24014976, 'steps': 125077, 'loss/train': 1.3767186403274536} 11/07/2021 14:50:55 - INFO - __main__ - Step 125079: {'lr': 3.417432012643071e-05, 'samples': 24015168, 'steps': 125078, 'loss/train': 1.4290030002593994} 11/07/2021 14:50:56 - INFO - __main__ - Step 125080: {'lr': 3.4171641933080147e-05, 'samples': 24015360, 'steps': 125079, 'loss/train': 1.5217316150665283} 11/07/2021 14:50:56 - INFO - __main__ - Step 125081: {'lr': 3.4168963836978513e-05, 'samples': 24015552, 'steps': 125080, 'loss/train': 1.3434913158416748} 11/07/2021 14:50:57 - INFO - __main__ - Step 125082: {'lr': 3.416628583812706e-05, 'samples': 24015744, 'steps': 125081, 'loss/train': 1.3666940927505493} 11/07/2021 14:50:57 - INFO - __main__ - Step 125083: {'lr': 3.4163607936526896e-05, 'samples': 24015936, 'steps': 125082, 'loss/train': 0.9769893884658813} 11/07/2021 14:50:58 - INFO - __main__ - Step 125084: {'lr': 3.416093013217928e-05, 'samples': 24016128, 'steps': 125083, 'loss/train': 0.8779830932617188} 11/07/2021 14:50:58 - INFO - __main__ - Step 125085: {'lr': 3.415825242508541e-05, 'samples': 24016320, 'steps': 125084, 'loss/train': 1.4898706674575806} 11/07/2021 14:50:59 - INFO - __main__ - Step 125086: {'lr': 3.41555748152465e-05, 'samples': 24016512, 'steps': 125085, 'loss/train': 1.5939804315567017} 11/07/2021 14:51:00 - INFO - __main__ - Step 125087: {'lr': 3.415289730266377e-05, 'samples': 24016704, 'steps': 125086, 'loss/train': 1.174052357673645} 11/07/2021 14:51:00 - INFO - __main__ - Step 125088: {'lr': 3.4150219887338437e-05, 'samples': 24016896, 'steps': 125087, 'loss/train': 1.4083555936813354} 11/07/2021 14:51:01 - INFO - __main__ - Step 125089: {'lr': 3.414754256927163e-05, 'samples': 24017088, 'steps': 125088, 'loss/train': 1.2757542133331299} 11/07/2021 14:51:01 - INFO - __main__ - Step 125090: {'lr': 3.414486534846464e-05, 'samples': 24017280, 'steps': 125089, 'loss/train': 0.986873984336853} 11/07/2021 14:51:01 - INFO - __main__ - Step 125091: {'lr': 3.4142188224918656e-05, 'samples': 24017472, 'steps': 125090, 'loss/train': 1.3566256761550903} 11/07/2021 14:51:02 - INFO - __main__ - Step 125092: {'lr': 3.413951119863484e-05, 'samples': 24017664, 'steps': 125091, 'loss/train': 1.097051739692688} 11/07/2021 14:51:02 - INFO - __main__ - Step 125093: {'lr': 3.4136834269614444e-05, 'samples': 24017856, 'steps': 125092, 'loss/train': 1.657492756843567} 11/07/2021 14:51:03 - INFO - __main__ - Step 125094: {'lr': 3.4134157437858664e-05, 'samples': 24018048, 'steps': 125093, 'loss/train': 1.432663917541504} 11/07/2021 14:51:03 - INFO - __main__ - Step 125095: {'lr': 3.413148070336874e-05, 'samples': 24018240, 'steps': 125094, 'loss/train': 1.4708768129348755} 11/07/2021 14:51:04 - INFO - __main__ - Step 125096: {'lr': 3.4128804066145794e-05, 'samples': 24018432, 'steps': 125095, 'loss/train': 1.3793935775756836} 11/07/2021 14:51:04 - INFO - __main__ - Step 125097: {'lr': 3.4126127526191096e-05, 'samples': 24018624, 'steps': 125096, 'loss/train': 1.3314048051834106} 11/07/2021 14:51:04 - INFO - __main__ - Step 125098: {'lr': 3.412345108350581e-05, 'samples': 24018816, 'steps': 125097, 'loss/train': 1.3138670921325684} 11/07/2021 14:51:06 - INFO - __main__ - Step 125099: {'lr': 3.4120774738091164e-05, 'samples': 24019008, 'steps': 125098, 'loss/train': 1.3090691566467285} 11/07/2021 14:51:06 - INFO - __main__ - Step 125100: {'lr': 3.41180984899484e-05, 'samples': 24019200, 'steps': 125099, 'loss/train': 1.6072502136230469} 11/07/2021 14:51:07 - INFO - __main__ - Step 125101: {'lr': 3.411542233907866e-05, 'samples': 24019392, 'steps': 125100, 'loss/train': 5.39447546005249} 11/07/2021 14:51:07 - INFO - __main__ - Step 125102: {'lr': 3.411274628548317e-05, 'samples': 24019584, 'steps': 125101, 'loss/train': 5.337888717651367} 11/07/2021 14:51:07 - INFO - __main__ - Step 125103: {'lr': 3.4110070329163165e-05, 'samples': 24019776, 'steps': 125102, 'loss/train': 0.6767166256904602} 11/07/2021 14:51:08 - INFO - __main__ - Step 125104: {'lr': 3.4107394470119815e-05, 'samples': 24019968, 'steps': 125103, 'loss/train': 1.006584882736206} 11/07/2021 14:51:09 - INFO - __main__ - Step 125105: {'lr': 3.4104718708354357e-05, 'samples': 24020160, 'steps': 125104, 'loss/train': 1.1838359832763672} 11/07/2021 14:51:09 - INFO - __main__ - Step 125106: {'lr': 3.410204304386799e-05, 'samples': 24020352, 'steps': 125105, 'loss/train': 1.1773691177368164} 11/07/2021 14:51:09 - INFO - __main__ - Step 125107: {'lr': 3.40993674766619e-05, 'samples': 24020544, 'steps': 125106, 'loss/train': 0.9104191064834595} 11/07/2021 14:51:10 - INFO - __main__ - Step 125108: {'lr': 3.409669200673729e-05, 'samples': 24020736, 'steps': 125107, 'loss/train': 0.34177738428115845} 11/07/2021 14:51:10 - INFO - __main__ - Step 125109: {'lr': 3.409401663409545e-05, 'samples': 24020928, 'steps': 125108, 'loss/train': 1.3365328311920166} 11/07/2021 14:51:10 - INFO - __main__ - Step 125110: {'lr': 3.4091341358737456e-05, 'samples': 24021120, 'steps': 125109, 'loss/train': 1.033239722251892} 11/07/2021 14:51:11 - INFO - __main__ - Step 125111: {'lr': 3.4088666180664557e-05, 'samples': 24021312, 'steps': 125110, 'loss/train': 1.1698600053787231} 11/07/2021 14:51:12 - INFO - __main__ - Step 125112: {'lr': 3.4085991099878006e-05, 'samples': 24021504, 'steps': 125111, 'loss/train': 1.1119768619537354} 11/07/2021 14:51:12 - INFO - __main__ - Step 125113: {'lr': 3.4083316116378935e-05, 'samples': 24021696, 'steps': 125112, 'loss/train': 1.0208061933517456} 11/07/2021 14:51:12 - INFO - __main__ - Step 125114: {'lr': 3.408064123016863e-05, 'samples': 24021888, 'steps': 125113, 'loss/train': 1.5204346179962158} 11/07/2021 14:51:13 - INFO - __main__ - Step 125115: {'lr': 3.407796644124822e-05, 'samples': 24022080, 'steps': 125114, 'loss/train': 1.1962124109268188} 11/07/2021 14:51:14 - INFO - __main__ - Step 125116: {'lr': 3.407529174961896e-05, 'samples': 24022272, 'steps': 125115, 'loss/train': 1.7039414644241333} 11/07/2021 14:51:14 - INFO - __main__ - Step 125117: {'lr': 3.407261715528207e-05, 'samples': 24022464, 'steps': 125116, 'loss/train': 1.128793716430664} 11/07/2021 14:51:15 - INFO - __main__ - Step 125118: {'lr': 3.406994265823868e-05, 'samples': 24022656, 'steps': 125117, 'loss/train': 1.2069854736328125} 11/07/2021 14:51:15 - INFO - __main__ - Step 125119: {'lr': 3.406726825849007e-05, 'samples': 24022848, 'steps': 125118, 'loss/train': 1.3494938611984253} 11/07/2021 14:51:15 - INFO - __main__ - Step 125120: {'lr': 3.406459395603742e-05, 'samples': 24023040, 'steps': 125119, 'loss/train': 1.6951547861099243} 11/07/2021 14:51:17 - INFO - __main__ - Step 125121: {'lr': 3.4061919750881906e-05, 'samples': 24023232, 'steps': 125120, 'loss/train': 1.7623525857925415} 11/07/2021 14:51:17 - INFO - __main__ - Step 125122: {'lr': 3.405924564302484e-05, 'samples': 24023424, 'steps': 125121, 'loss/train': 1.6160154342651367} 11/07/2021 14:51:18 - INFO - __main__ - Step 125123: {'lr': 3.405657163246728e-05, 'samples': 24023616, 'steps': 125122, 'loss/train': 1.1632119417190552} 11/07/2021 14:51:18 - INFO - __main__ - Step 125124: {'lr': 3.40538977192105e-05, 'samples': 24023808, 'steps': 125123, 'loss/train': 2.6118996143341064} 11/07/2021 14:51:18 - INFO - __main__ - Step 125125: {'lr': 3.405122390325569e-05, 'samples': 24024000, 'steps': 125124, 'loss/train': 2.591944456100464} 11/07/2021 14:51:19 - INFO - __main__ - Step 125126: {'lr': 3.4048550184604096e-05, 'samples': 24024192, 'steps': 125125, 'loss/train': 1.3653935194015503} 11/07/2021 14:51:20 - INFO - __main__ - Step 125127: {'lr': 3.404587656325686e-05, 'samples': 24024384, 'steps': 125126, 'loss/train': 1.2448986768722534} 11/07/2021 14:51:20 - INFO - __main__ - Step 125128: {'lr': 3.4043203039215235e-05, 'samples': 24024576, 'steps': 125127, 'loss/train': 1.1647933721542358} 11/07/2021 14:51:20 - INFO - __main__ - Step 125129: {'lr': 3.404052961248042e-05, 'samples': 24024768, 'steps': 125128, 'loss/train': 1.1861828565597534} 11/07/2021 14:51:21 - INFO - __main__ - Step 125130: {'lr': 3.4037856283053584e-05, 'samples': 24024960, 'steps': 125129, 'loss/train': 1.2078883647918701} 11/07/2021 14:51:21 - INFO - __main__ - Step 125131: {'lr': 3.4035183050935976e-05, 'samples': 24025152, 'steps': 125130, 'loss/train': 1.6038517951965332} 11/07/2021 14:51:22 - INFO - __main__ - Step 125132: {'lr': 3.403250991612877e-05, 'samples': 24025344, 'steps': 125131, 'loss/train': 1.277744174003601} 11/07/2021 14:51:23 - INFO - __main__ - Step 125133: {'lr': 3.4029836878633174e-05, 'samples': 24025536, 'steps': 125132, 'loss/train': 0.8820052146911621} 11/07/2021 14:51:23 - INFO - __main__ - Step 125134: {'lr': 3.4027163938450425e-05, 'samples': 24025728, 'steps': 125133, 'loss/train': 1.4121358394622803} 11/07/2021 14:51:23 - INFO - __main__ - Step 125135: {'lr': 3.4024491095581754e-05, 'samples': 24025920, 'steps': 125134, 'loss/train': 1.5528597831726074} 11/07/2021 14:51:24 - INFO - __main__ - Step 125136: {'lr': 3.402181835002824e-05, 'samples': 24026112, 'steps': 125135, 'loss/train': 1.144220232963562} 11/07/2021 14:51:25 - INFO - __main__ - Step 125137: {'lr': 3.401914570179118e-05, 'samples': 24026304, 'steps': 125136, 'loss/train': 1.4870209693908691} 11/07/2021 14:51:25 - INFO - __main__ - Step 125138: {'lr': 3.4016473150871755e-05, 'samples': 24026496, 'steps': 125137, 'loss/train': 1.723498821258545} 11/07/2021 14:51:25 - INFO - __main__ - Step 125139: {'lr': 3.4013800697271196e-05, 'samples': 24026688, 'steps': 125138, 'loss/train': 1.259946584701538} 11/07/2021 14:51:26 - INFO - __main__ - Step 125140: {'lr': 3.401112834099066e-05, 'samples': 24026880, 'steps': 125139, 'loss/train': 1.3716148138046265} 11/07/2021 14:51:26 - INFO - __main__ - Step 125141: {'lr': 3.400845608203138e-05, 'samples': 24027072, 'steps': 125140, 'loss/train': 1.6736788749694824} 11/07/2021 14:51:27 - INFO - __main__ - Step 125142: {'lr': 3.400578392039455e-05, 'samples': 24027264, 'steps': 125141, 'loss/train': 1.3667196035385132} 11/07/2021 14:51:27 - INFO - __main__ - Step 125143: {'lr': 3.4003111856081404e-05, 'samples': 24027456, 'steps': 125142, 'loss/train': 1.7081694602966309} 11/07/2021 14:51:28 - INFO - __main__ - Step 125144: {'lr': 3.40004398890931e-05, 'samples': 24027648, 'steps': 125143, 'loss/train': 1.7579584121704102} 11/07/2021 14:51:28 - INFO - __main__ - Step 125145: {'lr': 3.399776801943089e-05, 'samples': 24027840, 'steps': 125144, 'loss/train': 1.2063919305801392} 11/07/2021 14:51:29 - INFO - __main__ - Step 125146: {'lr': 3.399509624709593e-05, 'samples': 24028032, 'steps': 125145, 'loss/train': 0.43145468831062317} 11/07/2021 14:51:30 - INFO - __main__ - Step 125147: {'lr': 3.399242457208945e-05, 'samples': 24028224, 'steps': 125146, 'loss/train': 1.4248411655426025} 11/07/2021 14:51:30 - INFO - __main__ - Step 125148: {'lr': 3.398975299441265e-05, 'samples': 24028416, 'steps': 125147, 'loss/train': 1.4774872064590454} 11/07/2021 14:51:30 - INFO - __main__ - Step 125149: {'lr': 3.398708151406679e-05, 'samples': 24028608, 'steps': 125148, 'loss/train': 1.3650267124176025} 11/07/2021 14:51:31 - INFO - __main__ - Step 125150: {'lr': 3.3984410131052965e-05, 'samples': 24028800, 'steps': 125149, 'loss/train': 1.2704256772994995} 11/07/2021 14:51:31 - INFO - __main__ - Step 125151: {'lr': 3.398173884537242e-05, 'samples': 24028992, 'steps': 125150, 'loss/train': 0.973454475402832} 11/07/2021 14:51:31 - INFO - __main__ - Step 125152: {'lr': 3.397906765702638e-05, 'samples': 24029184, 'steps': 125151, 'loss/train': 1.2515920400619507} 11/07/2021 14:51:32 - INFO - __main__ - Step 125153: {'lr': 3.397639656601606e-05, 'samples': 24029376, 'steps': 125152, 'loss/train': 1.241809368133545} 11/07/2021 14:51:33 - INFO - __main__ - Step 125154: {'lr': 3.39737255723426e-05, 'samples': 24029568, 'steps': 125153, 'loss/train': 1.6371299028396606} 11/07/2021 14:51:33 - INFO - __main__ - Step 125155: {'lr': 3.3971054676007276e-05, 'samples': 24029760, 'steps': 125154, 'loss/train': 1.83396315574646} 11/07/2021 14:51:33 - INFO - __main__ - Step 125156: {'lr': 3.396838387701126e-05, 'samples': 24029952, 'steps': 125155, 'loss/train': 1.3003039360046387} 11/07/2021 14:51:34 - INFO - __main__ - Step 125157: {'lr': 3.396571317535574e-05, 'samples': 24030144, 'steps': 125156, 'loss/train': 1.394210934638977} 11/07/2021 14:51:35 - INFO - __main__ - Step 125158: {'lr': 3.396304257104196e-05, 'samples': 24030336, 'steps': 125157, 'loss/train': 0.6973487734794617} 11/07/2021 14:51:35 - INFO - __main__ - Step 125159: {'lr': 3.3960372064071074e-05, 'samples': 24030528, 'steps': 125158, 'loss/train': 1.2497576475143433} 11/07/2021 14:51:36 - INFO - __main__ - Step 125160: {'lr': 3.395770165444431e-05, 'samples': 24030720, 'steps': 125159, 'loss/train': 1.3961886167526245} 11/07/2021 14:51:36 - INFO - __main__ - Step 125161: {'lr': 3.3955031342162904e-05, 'samples': 24030912, 'steps': 125160, 'loss/train': 1.4566850662231445} 11/07/2021 14:51:36 - INFO - __main__ - Step 125162: {'lr': 3.3952361127228046e-05, 'samples': 24031104, 'steps': 125161, 'loss/train': 1.4073759317398071} 11/07/2021 14:51:38 - INFO - __main__ - Step 125163: {'lr': 3.39496910096409e-05, 'samples': 24031296, 'steps': 125162, 'loss/train': 1.6406389474868774} 11/07/2021 14:51:38 - INFO - __main__ - Step 125164: {'lr': 3.3947020989402665e-05, 'samples': 24031488, 'steps': 125163, 'loss/train': 0.9032002091407776} 11/07/2021 14:51:38 - INFO - __main__ - Step 125165: {'lr': 3.394435106651458e-05, 'samples': 24031680, 'steps': 125164, 'loss/train': 1.5744551420211792} 11/07/2021 14:51:39 - INFO - __main__ - Step 125166: {'lr': 3.3941681240977826e-05, 'samples': 24031872, 'steps': 125165, 'loss/train': 5.745985507965088} 11/07/2021 14:51:39 - INFO - __main__ - Step 125167: {'lr': 3.39390115127936e-05, 'samples': 24032064, 'steps': 125166, 'loss/train': 1.049519419670105} 11/07/2021 14:51:39 - INFO - __main__ - Step 125168: {'lr': 3.393634188196315e-05, 'samples': 24032256, 'steps': 125167, 'loss/train': 1.3426203727722168} 11/07/2021 14:51:40 - INFO - __main__ - Step 125169: {'lr': 3.393367234848766e-05, 'samples': 24032448, 'steps': 125168, 'loss/train': 1.4826393127441406} 11/07/2021 14:51:41 - INFO - __main__ - Step 125170: {'lr': 3.393100291236831e-05, 'samples': 24032640, 'steps': 125169, 'loss/train': 1.4073307514190674} 11/07/2021 14:51:41 - INFO - __main__ - Step 125171: {'lr': 3.392833357360631e-05, 'samples': 24032832, 'steps': 125170, 'loss/train': 1.4078023433685303} 11/07/2021 14:51:41 - INFO - __main__ - Step 125172: {'lr': 3.3925664332202874e-05, 'samples': 24033024, 'steps': 125171, 'loss/train': 1.6318296194076538} 11/07/2021 14:51:42 - INFO - __main__ - Step 125173: {'lr': 3.3922995188159194e-05, 'samples': 24033216, 'steps': 125172, 'loss/train': 1.6719163656234741} 11/07/2021 14:51:43 - INFO - __main__ - Step 125174: {'lr': 3.392032614147647e-05, 'samples': 24033408, 'steps': 125173, 'loss/train': 1.0586758852005005} 11/07/2021 14:51:43 - INFO - __main__ - Step 125175: {'lr': 3.3917657192155975e-05, 'samples': 24033600, 'steps': 125174, 'loss/train': 1.5314606428146362} 11/07/2021 14:51:44 - INFO - __main__ - Step 125176: {'lr': 3.391498834019879e-05, 'samples': 24033792, 'steps': 125175, 'loss/train': 1.0870217084884644} 11/07/2021 14:51:44 - INFO - __main__ - Step 125177: {'lr': 3.39123195856062e-05, 'samples': 24033984, 'steps': 125176, 'loss/train': 1.1714550256729126} 11/07/2021 14:51:44 - INFO - __main__ - Step 125178: {'lr': 3.390965092837936e-05, 'samples': 24034176, 'steps': 125177, 'loss/train': 1.3751496076583862} 11/07/2021 14:51:45 - INFO - __main__ - Step 125179: {'lr': 3.3906982368519495e-05, 'samples': 24034368, 'steps': 125178, 'loss/train': 1.3839744329452515} 11/07/2021 14:51:46 - INFO - __main__ - Step 125180: {'lr': 3.3904313906027825e-05, 'samples': 24034560, 'steps': 125179, 'loss/train': 1.320430040359497} 11/07/2021 14:51:46 - INFO - __main__ - Step 125181: {'lr': 3.390164554090553e-05, 'samples': 24034752, 'steps': 125180, 'loss/train': 1.0286011695861816} 11/07/2021 14:51:46 - INFO - __main__ - Step 125182: {'lr': 3.389897727315383e-05, 'samples': 24034944, 'steps': 125181, 'loss/train': 1.2523295879364014} 11/07/2021 14:51:47 - INFO - __main__ - Step 125183: {'lr': 3.389630910277389e-05, 'samples': 24035136, 'steps': 125182, 'loss/train': 1.5476586818695068} 11/07/2021 14:51:48 - INFO - __main__ - Step 125184: {'lr': 3.389364102976697e-05, 'samples': 24035328, 'steps': 125183, 'loss/train': 1.2981740236282349} 11/07/2021 14:51:48 - INFO - __main__ - Step 125185: {'lr': 3.389097305413422e-05, 'samples': 24035520, 'steps': 125184, 'loss/train': 1.3165137767791748} 11/07/2021 14:51:49 - INFO - __main__ - Step 125186: {'lr': 3.388830517587693e-05, 'samples': 24035712, 'steps': 125185, 'loss/train': 2.1377227306365967} 11/07/2021 14:51:49 - INFO - __main__ - Step 125187: {'lr': 3.388563739499617e-05, 'samples': 24035904, 'steps': 125186, 'loss/train': 1.0545274019241333} 11/07/2021 14:51:49 - INFO - __main__ - Step 125188: {'lr': 3.388296971149321e-05, 'samples': 24036096, 'steps': 125187, 'loss/train': 1.7890079021453857} 11/07/2021 14:51:50 - INFO - __main__ - Step 125189: {'lr': 3.3880302125369245e-05, 'samples': 24036288, 'steps': 125188, 'loss/train': 1.7296909093856812} 11/07/2021 14:51:51 - INFO - __main__ - Step 125190: {'lr': 3.387763463662549e-05, 'samples': 24036480, 'steps': 125189, 'loss/train': 1.4475384950637817} 11/07/2021 14:51:51 - INFO - __main__ - Step 125191: {'lr': 3.387496724526312e-05, 'samples': 24036672, 'steps': 125190, 'loss/train': 1.3928889036178589} 11/07/2021 14:51:52 - INFO - __main__ - Step 125192: {'lr': 3.387229995128338e-05, 'samples': 24036864, 'steps': 125191, 'loss/train': 1.5344878435134888} 11/07/2021 14:51:52 - INFO - __main__ - Step 125193: {'lr': 3.3869632754687436e-05, 'samples': 24037056, 'steps': 125192, 'loss/train': 1.0139422416687012} 11/07/2021 14:51:52 - INFO - __main__ - Step 125194: {'lr': 3.386696565547648e-05, 'samples': 24037248, 'steps': 125193, 'loss/train': 1.4892810583114624} 11/07/2021 14:51:53 - INFO - __main__ - Step 125195: {'lr': 3.386429865365176e-05, 'samples': 24037440, 'steps': 125194, 'loss/train': 1.4935269355773926} 11/07/2021 14:51:54 - INFO - __main__ - Step 125196: {'lr': 3.3861631749214444e-05, 'samples': 24037632, 'steps': 125195, 'loss/train': 0.6307225227355957} 11/07/2021 14:51:54 - INFO - __main__ - Step 125197: {'lr': 3.385896494216578e-05, 'samples': 24037824, 'steps': 125196, 'loss/train': 1.8732603788375854} 11/07/2021 14:51:54 - INFO - __main__ - Step 125198: {'lr': 3.385629823250691e-05, 'samples': 24038016, 'steps': 125197, 'loss/train': 1.2203139066696167} 11/07/2021 14:51:55 - INFO - __main__ - Step 125199: {'lr': 3.3853631620239024e-05, 'samples': 24038208, 'steps': 125198, 'loss/train': 1.5667437314987183} 11/07/2021 14:51:55 - INFO - __main__ - Step 125200: {'lr': 3.385096510536339e-05, 'samples': 24038400, 'steps': 125199, 'loss/train': 1.0324456691741943} 11/07/2021 14:51:56 - INFO - __main__ - Step 125201: {'lr': 3.384829868788114e-05, 'samples': 24038592, 'steps': 125200, 'loss/train': 1.0440614223480225} 11/07/2021 14:51:56 - INFO - __main__ - Step 125202: {'lr': 3.384563236779353e-05, 'samples': 24038784, 'steps': 125201, 'loss/train': 1.3689095973968506} 11/07/2021 14:51:57 - INFO - __main__ - Step 125203: {'lr': 3.384296614510174e-05, 'samples': 24038976, 'steps': 125202, 'loss/train': 0.9472480416297913} 11/07/2021 14:51:57 - INFO - __main__ - Step 125204: {'lr': 3.384030001980698e-05, 'samples': 24039168, 'steps': 125203, 'loss/train': 0.6146320700645447} 11/07/2021 14:51:58 - INFO - __main__ - Step 125205: {'lr': 3.383763399191045e-05, 'samples': 24039360, 'steps': 125204, 'loss/train': 1.4662154912948608} 11/07/2021 14:51:58 - INFO - __main__ - Step 125206: {'lr': 3.383496806141334e-05, 'samples': 24039552, 'steps': 125205, 'loss/train': 0.8012204766273499} 11/07/2021 14:51:59 - INFO - __main__ - Step 125207: {'lr': 3.383230222831685e-05, 'samples': 24039744, 'steps': 125206, 'loss/train': 1.6614761352539062} 11/07/2021 14:51:59 - INFO - __main__ - Step 125208: {'lr': 3.3829636492622243e-05, 'samples': 24039936, 'steps': 125207, 'loss/train': 0.7010765075683594} 11/07/2021 14:52:00 - INFO - __main__ - Step 125209: {'lr': 3.382697085433062e-05, 'samples': 24040128, 'steps': 125208, 'loss/train': 1.8480099439620972} 11/07/2021 14:52:00 - INFO - __main__ - Step 125210: {'lr': 3.3824305313443215e-05, 'samples': 24040320, 'steps': 125209, 'loss/train': 0.9440897107124329} 11/07/2021 14:52:01 - INFO - __main__ - Step 125211: {'lr': 3.3821639869961257e-05, 'samples': 24040512, 'steps': 125210, 'loss/train': 1.5993331670761108} 11/07/2021 14:52:01 - INFO - __main__ - Step 125212: {'lr': 3.381897452388594e-05, 'samples': 24040704, 'steps': 125211, 'loss/train': 1.5353010892868042} 11/07/2021 14:52:02 - INFO - __main__ - Step 125213: {'lr': 3.381630927521845e-05, 'samples': 24040896, 'steps': 125212, 'loss/train': 1.3193029165267944} 11/07/2021 14:52:02 - INFO - __main__ - Step 125214: {'lr': 3.381364412395998e-05, 'samples': 24041088, 'steps': 125213, 'loss/train': 1.0144621133804321} 11/07/2021 14:52:02 - INFO - __main__ - Step 125215: {'lr': 3.3810979070111744e-05, 'samples': 24041280, 'steps': 125214, 'loss/train': 1.4628974199295044} 11/07/2021 14:52:03 - INFO - __main__ - Step 125216: {'lr': 3.3808314113674963e-05, 'samples': 24041472, 'steps': 125215, 'loss/train': 1.6533814668655396} 11/07/2021 14:52:04 - INFO - __main__ - Step 125217: {'lr': 3.380564925465082e-05, 'samples': 24041664, 'steps': 125216, 'loss/train': 1.2723779678344727} 11/07/2021 14:52:04 - INFO - __main__ - Step 125218: {'lr': 3.380298449304051e-05, 'samples': 24041856, 'steps': 125217, 'loss/train': 1.323150873184204} 11/07/2021 14:52:04 - INFO - __main__ - Step 125219: {'lr': 3.3800319828845294e-05, 'samples': 24042048, 'steps': 125218, 'loss/train': 0.864437997341156} 11/07/2021 14:52:05 - INFO - __main__ - Step 125220: {'lr': 3.379765526206624e-05, 'samples': 24042240, 'steps': 125219, 'loss/train': 1.093607783317566} 11/07/2021 14:52:06 - INFO - __main__ - Step 125221: {'lr': 3.379499079270465e-05, 'samples': 24042432, 'steps': 125220, 'loss/train': 1.3673053979873657} 11/07/2021 14:52:06 - INFO - __main__ - Step 125222: {'lr': 3.3792326420761715e-05, 'samples': 24042624, 'steps': 125221, 'loss/train': 1.3883039951324463} 11/07/2021 14:52:07 - INFO - __main__ - Step 125223: {'lr': 3.37896621462386e-05, 'samples': 24042816, 'steps': 125222, 'loss/train': 0.8625388145446777} 11/07/2021 14:52:07 - INFO - __main__ - Step 125224: {'lr': 3.378699796913653e-05, 'samples': 24043008, 'steps': 125223, 'loss/train': 1.2492430210113525} 11/07/2021 14:52:07 - INFO - __main__ - Step 125225: {'lr': 3.3784333889456706e-05, 'samples': 24043200, 'steps': 125224, 'loss/train': 0.9258843660354614} 11/07/2021 14:52:08 - INFO - __main__ - Step 125226: {'lr': 3.378166990720033e-05, 'samples': 24043392, 'steps': 125225, 'loss/train': 0.7582959532737732} 11/07/2021 14:52:09 - INFO - __main__ - Step 125227: {'lr': 3.377900602236858e-05, 'samples': 24043584, 'steps': 125226, 'loss/train': 1.4820754528045654} 11/07/2021 14:52:09 - INFO - __main__ - Step 125228: {'lr': 3.3776342234962676e-05, 'samples': 24043776, 'steps': 125227, 'loss/train': 0.9047059416770935} 11/07/2021 14:52:09 - INFO - __main__ - Step 125229: {'lr': 3.3773678544983836e-05, 'samples': 24043968, 'steps': 125228, 'loss/train': 0.8088140487670898} 11/07/2021 14:52:10 - INFO - __main__ - Step 125230: {'lr': 3.3771014952433286e-05, 'samples': 24044160, 'steps': 125229, 'loss/train': 2.5406510829925537} 11/07/2021 14:52:10 - INFO - __main__ - Step 125231: {'lr': 3.3768351457312105e-05, 'samples': 24044352, 'steps': 125230, 'loss/train': 1.2212971448898315} 11/07/2021 14:52:11 - INFO - __main__ - Step 125232: {'lr': 3.37656880596216e-05, 'samples': 24044544, 'steps': 125231, 'loss/train': 0.9852997064590454} 11/07/2021 14:52:11 - INFO - __main__ - Step 125233: {'lr': 3.376302475936291e-05, 'samples': 24044736, 'steps': 125232, 'loss/train': 1.349700689315796} 11/07/2021 14:52:12 - INFO - __main__ - Step 125234: {'lr': 3.3760361556537275e-05, 'samples': 24044928, 'steps': 125233, 'loss/train': 0.9634236693382263} 11/07/2021 14:52:12 - INFO - __main__ - Step 125235: {'lr': 3.37576984511459e-05, 'samples': 24045120, 'steps': 125234, 'loss/train': 1.3780571222305298} 11/07/2021 14:52:12 - INFO - __main__ - Step 125236: {'lr': 3.3755035443189946e-05, 'samples': 24045312, 'steps': 125235, 'loss/train': 0.794048011302948} 11/07/2021 14:52:13 - INFO - __main__ - Step 125237: {'lr': 3.3752372532670664e-05, 'samples': 24045504, 'steps': 125236, 'loss/train': 0.8520177006721497} 11/07/2021 14:52:14 - INFO - __main__ - Step 125238: {'lr': 3.374970971958918e-05, 'samples': 24045696, 'steps': 125237, 'loss/train': 1.3137210607528687} 11/07/2021 14:52:14 - INFO - __main__ - Step 125239: {'lr': 3.374704700394679e-05, 'samples': 24045888, 'steps': 125238, 'loss/train': 1.5866045951843262} 11/07/2021 14:52:15 - INFO - __main__ - Step 125240: {'lr': 3.374438438574462e-05, 'samples': 24046080, 'steps': 125239, 'loss/train': 1.456810712814331} 11/07/2021 14:52:15 - INFO - __main__ - Step 125241: {'lr': 3.374172186498389e-05, 'samples': 24046272, 'steps': 125240, 'loss/train': 1.295082449913025} 11/07/2021 14:52:16 - INFO - __main__ - Step 125242: {'lr': 3.3739059441665806e-05, 'samples': 24046464, 'steps': 125241, 'loss/train': 1.6216847896575928} 11/07/2021 14:52:16 - INFO - __main__ - Step 125243: {'lr': 3.373639711579163e-05, 'samples': 24046656, 'steps': 125242, 'loss/train': 1.6105877161026} 11/07/2021 14:52:17 - INFO - __main__ - Step 125244: {'lr': 3.373373488736242e-05, 'samples': 24046848, 'steps': 125243, 'loss/train': 1.3739532232284546} 11/07/2021 14:52:17 - INFO - __main__ - Step 125245: {'lr': 3.373107275637946e-05, 'samples': 24047040, 'steps': 125244, 'loss/train': 1.2346135377883911} 11/07/2021 14:52:17 - INFO - __main__ - Step 125246: {'lr': 3.3728410722843966e-05, 'samples': 24047232, 'steps': 125245, 'loss/train': 1.1882387399673462} 11/07/2021 14:52:18 - INFO - __main__ - Step 125247: {'lr': 3.372574878675708e-05, 'samples': 24047424, 'steps': 125246, 'loss/train': 1.246747374534607} 11/07/2021 14:52:19 - INFO - __main__ - Step 125248: {'lr': 3.3723086948120066e-05, 'samples': 24047616, 'steps': 125247, 'loss/train': 1.2873916625976562} 11/07/2021 14:52:19 - INFO - __main__ - Step 125249: {'lr': 3.372042520693405e-05, 'samples': 24047808, 'steps': 125248, 'loss/train': 1.4837725162506104} 11/07/2021 14:52:19 - INFO - __main__ - Step 125250: {'lr': 3.371776356320031e-05, 'samples': 24048000, 'steps': 125249, 'loss/train': 1.574913740158081} 11/07/2021 14:52:20 - INFO - __main__ - Step 125251: {'lr': 3.371510201691999e-05, 'samples': 24048192, 'steps': 125250, 'loss/train': 1.1544193029403687} 11/07/2021 14:52:20 - INFO - __main__ - Step 125252: {'lr': 3.371244056809431e-05, 'samples': 24048384, 'steps': 125251, 'loss/train': 0.9669017791748047} 11/07/2021 14:52:22 - INFO - __main__ - Step 125253: {'lr': 3.370977921672447e-05, 'samples': 24048576, 'steps': 125252, 'loss/train': 1.9012092351913452} 11/07/2021 14:52:22 - INFO - __main__ - Step 125254: {'lr': 3.370711796281167e-05, 'samples': 24048768, 'steps': 125253, 'loss/train': 1.7488635778427124} 11/07/2021 14:52:23 - INFO - __main__ - Step 125255: {'lr': 3.3704456806357085e-05, 'samples': 24048960, 'steps': 125254, 'loss/train': 1.5503137111663818} 11/07/2021 14:52:23 - INFO - __main__ - Step 125256: {'lr': 3.3701795747362014e-05, 'samples': 24049152, 'steps': 125255, 'loss/train': 1.3836787939071655} 11/07/2021 14:52:23 - INFO - __main__ - Step 125257: {'lr': 3.369913478582751e-05, 'samples': 24049344, 'steps': 125256, 'loss/train': 1.2881697416305542} 11/07/2021 14:52:24 - INFO - __main__ - Step 125258: {'lr': 3.3696473921754844e-05, 'samples': 24049536, 'steps': 125257, 'loss/train': 1.4809430837631226} 11/07/2021 14:52:25 - INFO - __main__ - Step 125259: {'lr': 3.369381315514522e-05, 'samples': 24049728, 'steps': 125258, 'loss/train': 3.593841552734375} 11/07/2021 14:52:25 - INFO - __main__ - Step 125260: {'lr': 3.36911524859998e-05, 'samples': 24049920, 'steps': 125259, 'loss/train': 1.2632145881652832} 11/07/2021 14:52:25 - INFO - __main__ - Step 125261: {'lr': 3.368849191431983e-05, 'samples': 24050112, 'steps': 125260, 'loss/train': 1.5349007844924927} 11/07/2021 14:52:26 - INFO - __main__ - Step 125262: {'lr': 3.3685831440106454e-05, 'samples': 24050304, 'steps': 125261, 'loss/train': 1.3455947637557983} 11/07/2021 14:52:26 - INFO - __main__ - Step 125263: {'lr': 3.368317106336094e-05, 'samples': 24050496, 'steps': 125262, 'loss/train': 2.0410847663879395} 11/07/2021 14:52:26 - INFO - __main__ - Step 125264: {'lr': 3.368051078408443e-05, 'samples': 24050688, 'steps': 125263, 'loss/train': 1.0106470584869385} 11/07/2021 14:52:28 - INFO - __main__ - Step 125265: {'lr': 3.367785060227816e-05, 'samples': 24050880, 'steps': 125264, 'loss/train': 1.2524185180664062} 11/07/2021 14:52:29 - INFO - __main__ - Step 125266: {'lr': 3.367519051794329e-05, 'samples': 24051072, 'steps': 125265, 'loss/train': 2.658628225326538} 11/07/2021 14:52:29 - INFO - __main__ - Step 125267: {'lr': 3.3672530531081074e-05, 'samples': 24051264, 'steps': 125266, 'loss/train': 2.652536153793335} 11/07/2021 14:52:29 - INFO - __main__ - Step 125268: {'lr': 3.3669870641692664e-05, 'samples': 24051456, 'steps': 125267, 'loss/train': 1.8227894306182861} 11/07/2021 14:52:30 - INFO - __main__ - Step 125269: {'lr': 3.366721084977925e-05, 'samples': 24051648, 'steps': 125268, 'loss/train': 1.3007385730743408} 11/07/2021 14:52:30 - INFO - __main__ - Step 125270: {'lr': 3.3664551155342145e-05, 'samples': 24051840, 'steps': 125269, 'loss/train': 1.0280652046203613} 11/07/2021 14:52:30 - INFO - __main__ - Step 125271: {'lr': 3.3661891558382366e-05, 'samples': 24052032, 'steps': 125270, 'loss/train': 1.6872566938400269} 11/07/2021 14:52:32 - INFO - __main__ - Step 125272: {'lr': 3.3659232058901227e-05, 'samples': 24052224, 'steps': 125271, 'loss/train': 0.5608728528022766} 11/07/2021 14:52:32 - INFO - __main__ - Step 125273: {'lr': 3.3656572656899864e-05, 'samples': 24052416, 'steps': 125272, 'loss/train': 0.7868756651878357} 11/07/2021 14:52:33 - INFO - __main__ - Step 125274: {'lr': 3.365391335237955e-05, 'samples': 24052608, 'steps': 125273, 'loss/train': 1.493935227394104} 11/07/2021 14:52:33 - INFO - __main__ - Step 125275: {'lr': 3.365125414534142e-05, 'samples': 24052800, 'steps': 125274, 'loss/train': 1.1487010717391968} 11/07/2021 14:52:33 - INFO - __main__ - Step 125276: {'lr': 3.364859503578671e-05, 'samples': 24052992, 'steps': 125275, 'loss/train': 1.2469722032546997} 11/07/2021 14:52:34 - INFO - __main__ - Step 125277: {'lr': 3.36459360237166e-05, 'samples': 24053184, 'steps': 125276, 'loss/train': 2.163831949234009} 11/07/2021 14:52:35 - INFO - __main__ - Step 125278: {'lr': 3.364327710913229e-05, 'samples': 24053376, 'steps': 125277, 'loss/train': 0.8471287488937378} 11/07/2021 14:52:35 - INFO - __main__ - Step 125279: {'lr': 3.364061829203499e-05, 'samples': 24053568, 'steps': 125278, 'loss/train': 1.0270024538040161} 11/07/2021 14:52:35 - INFO - __main__ - Step 125280: {'lr': 3.363795957242588e-05, 'samples': 24053760, 'steps': 125279, 'loss/train': 1.3189013004302979} 11/07/2021 14:52:36 - INFO - __main__ - Step 125281: {'lr': 3.363530095030617e-05, 'samples': 24053952, 'steps': 125280, 'loss/train': 0.8119052648544312} 11/07/2021 14:52:37 - INFO - __main__ - Step 125282: {'lr': 3.363264242567704e-05, 'samples': 24054144, 'steps': 125281, 'loss/train': 1.6079587936401367} 11/07/2021 14:52:37 - INFO - __main__ - Step 125283: {'lr': 3.362998399853978e-05, 'samples': 24054336, 'steps': 125282, 'loss/train': 1.2930301427841187} 11/07/2021 14:52:38 - INFO - __main__ - Step 125284: {'lr': 3.362732566889545e-05, 'samples': 24054528, 'steps': 125283, 'loss/train': 1.2245512008666992} 11/07/2021 14:52:38 - INFO - __main__ - Step 125285: {'lr': 3.3624667436745305e-05, 'samples': 24054720, 'steps': 125284, 'loss/train': 1.6060484647750854} 11/07/2021 14:52:38 - INFO - __main__ - Step 125286: {'lr': 3.362200930209053e-05, 'samples': 24054912, 'steps': 125285, 'loss/train': 1.3688040971755981} 11/07/2021 14:52:39 - INFO - __main__ - Step 125287: {'lr': 3.3619351264932375e-05, 'samples': 24055104, 'steps': 125286, 'loss/train': 1.4656416177749634} 11/07/2021 14:52:41 - INFO - __main__ - Step 125288: {'lr': 3.361669332527198e-05, 'samples': 24055296, 'steps': 125287, 'loss/train': 1.4081742763519287} 11/07/2021 14:52:41 - INFO - __main__ - Step 125289: {'lr': 3.3614035483110537e-05, 'samples': 24055488, 'steps': 125288, 'loss/train': 1.9659992456436157} 11/07/2021 14:52:42 - INFO - __main__ - Step 125290: {'lr': 3.36113777384493e-05, 'samples': 24055680, 'steps': 125289, 'loss/train': 1.3492703437805176} 11/07/2021 14:52:42 - INFO - __main__ - Step 125291: {'lr': 3.360872009128946e-05, 'samples': 24055872, 'steps': 125290, 'loss/train': 0.8833732604980469} 11/07/2021 14:52:42 - INFO - __main__ - Step 125292: {'lr': 3.360606254163215e-05, 'samples': 24056064, 'steps': 125291, 'loss/train': 0.797395646572113} 11/07/2021 14:52:43 - INFO - __main__ - Step 125293: {'lr': 3.360340508947862e-05, 'samples': 24056256, 'steps': 125292, 'loss/train': 1.70654296875} 11/07/2021 14:52:43 - INFO - __main__ - Step 125294: {'lr': 3.360074773483007e-05, 'samples': 24056448, 'steps': 125293, 'loss/train': 0.8181554079055786} 11/07/2021 14:52:44 - INFO - __main__ - Step 125295: {'lr': 3.3598090477687665e-05, 'samples': 24056640, 'steps': 125294, 'loss/train': 0.7384997606277466} 11/07/2021 14:52:44 - INFO - __main__ - Step 125296: {'lr': 3.359543331805265e-05, 'samples': 24056832, 'steps': 125295, 'loss/train': 1.0402648448944092} 11/07/2021 14:52:45 - INFO - __main__ - Step 125297: {'lr': 3.3592776255926217e-05, 'samples': 24057024, 'steps': 125296, 'loss/train': 2.304594039916992} 11/07/2021 14:52:45 - INFO - __main__ - Step 125298: {'lr': 3.3590119291309505e-05, 'samples': 24057216, 'steps': 125297, 'loss/train': 2.5103261470794678} 11/07/2021 14:52:45 - INFO - __main__ - Step 125299: {'lr': 3.358746242420374e-05, 'samples': 24057408, 'steps': 125298, 'loss/train': 1.7648406028747559} 11/07/2021 14:52:46 - INFO - __main__ - Step 125300: {'lr': 3.358480565461011e-05, 'samples': 24057600, 'steps': 125299, 'loss/train': 1.0323916673660278} 11/07/2021 14:52:47 - INFO - __main__ - Step 125301: {'lr': 3.358214898252987e-05, 'samples': 24057792, 'steps': 125300, 'loss/train': 1.2256402969360352} 11/07/2021 14:52:47 - INFO - __main__ - Step 125302: {'lr': 3.3579492407964126e-05, 'samples': 24057984, 'steps': 125301, 'loss/train': 1.2390908002853394} 11/07/2021 14:52:47 - INFO - __main__ - Step 125303: {'lr': 3.357683593091415e-05, 'samples': 24058176, 'steps': 125302, 'loss/train': 0.7312503457069397} 11/07/2021 14:52:48 - INFO - __main__ - Step 125304: {'lr': 3.357417955138109e-05, 'samples': 24058368, 'steps': 125303, 'loss/train': 1.5860421657562256} 11/07/2021 14:52:48 - INFO - __main__ - Step 125305: {'lr': 3.3571523269366186e-05, 'samples': 24058560, 'steps': 125304, 'loss/train': 1.323815107345581} 11/07/2021 14:52:49 - INFO - __main__ - Step 125306: {'lr': 3.3568867084870614e-05, 'samples': 24058752, 'steps': 125305, 'loss/train': 1.3584855794906616} 11/07/2021 14:52:50 - INFO - __main__ - Step 125307: {'lr': 3.356621099789556e-05, 'samples': 24058944, 'steps': 125306, 'loss/train': 1.0792179107666016} 11/07/2021 14:52:50 - INFO - __main__ - Step 125308: {'lr': 3.3563555008442244e-05, 'samples': 24059136, 'steps': 125307, 'loss/train': 1.282834768295288} 11/07/2021 14:52:50 - INFO - __main__ - Step 125309: {'lr': 3.356089911651183e-05, 'samples': 24059328, 'steps': 125308, 'loss/train': 1.357906699180603} 11/07/2021 14:52:51 - INFO - __main__ - Step 125310: {'lr': 3.355824332210561e-05, 'samples': 24059520, 'steps': 125309, 'loss/train': 1.4597766399383545} 11/07/2021 14:52:52 - INFO - __main__ - Step 125311: {'lr': 3.355558762522465e-05, 'samples': 24059712, 'steps': 125310, 'loss/train': 1.5994867086410522} 11/07/2021 14:52:52 - INFO - __main__ - Step 125312: {'lr': 3.355293202587018e-05, 'samples': 24059904, 'steps': 125311, 'loss/train': 1.2296429872512817} 11/07/2021 14:52:52 - INFO - __main__ - Step 125313: {'lr': 3.355027652404344e-05, 'samples': 24060096, 'steps': 125312, 'loss/train': 1.3802464008331299} 11/07/2021 14:52:53 - INFO - __main__ - Step 125314: {'lr': 3.3547621119745605e-05, 'samples': 24060288, 'steps': 125313, 'loss/train': 1.3623404502868652} 11/07/2021 14:52:53 - INFO - __main__ - Step 125315: {'lr': 3.354496581297786e-05, 'samples': 24060480, 'steps': 125314, 'loss/train': 1.490670919418335} 11/07/2021 14:52:54 - INFO - __main__ - Step 125316: {'lr': 3.354231060374141e-05, 'samples': 24060672, 'steps': 125315, 'loss/train': 1.3469796180725098} 11/07/2021 14:52:55 - INFO - __main__ - Step 125317: {'lr': 3.353965549203747e-05, 'samples': 24060864, 'steps': 125316, 'loss/train': 1.1161714792251587} 11/07/2021 14:52:55 - INFO - __main__ - Step 125318: {'lr': 3.353700047786723e-05, 'samples': 24061056, 'steps': 125317, 'loss/train': 1.196521520614624} 11/07/2021 14:52:55 - INFO - __main__ - Step 125319: {'lr': 3.353434556123186e-05, 'samples': 24061248, 'steps': 125318, 'loss/train': 0.32133913040161133} 11/07/2021 14:52:56 - INFO - __main__ - Step 125320: {'lr': 3.3531690742132554e-05, 'samples': 24061440, 'steps': 125319, 'loss/train': 1.535189151763916} 11/07/2021 14:52:57 - INFO - __main__ - Step 125321: {'lr': 3.3529036020570556e-05, 'samples': 24061632, 'steps': 125320, 'loss/train': 1.353886365890503} 11/07/2021 14:52:57 - INFO - __main__ - Step 125322: {'lr': 3.352638139654704e-05, 'samples': 24061824, 'steps': 125321, 'loss/train': 1.27946937084198} 11/07/2021 14:52:57 - INFO - __main__ - Step 125323: {'lr': 3.3523726870063194e-05, 'samples': 24062016, 'steps': 125322, 'loss/train': 0.6310428380966187} 11/07/2021 14:52:58 - INFO - __main__ - Step 125324: {'lr': 3.3521072441120234e-05, 'samples': 24062208, 'steps': 125323, 'loss/train': 1.3710938692092896} 11/07/2021 14:52:58 - INFO - __main__ - Step 125325: {'lr': 3.351841810971931e-05, 'samples': 24062400, 'steps': 125324, 'loss/train': 1.0995854139328003} 11/07/2021 14:52:59 - INFO - __main__ - Step 125326: {'lr': 3.351576387586167e-05, 'samples': 24062592, 'steps': 125325, 'loss/train': 1.1245120763778687} 11/07/2021 14:52:59 - INFO - __main__ - Step 125327: {'lr': 3.3513109739548465e-05, 'samples': 24062784, 'steps': 125326, 'loss/train': 1.5200998783111572} 11/07/2021 14:53:00 - INFO - __main__ - Step 125328: {'lr': 3.35104557007809e-05, 'samples': 24062976, 'steps': 125327, 'loss/train': 1.2637032270431519} 11/07/2021 14:53:00 - INFO - __main__ - Step 125329: {'lr': 3.3507801759560194e-05, 'samples': 24063168, 'steps': 125328, 'loss/train': 1.1895264387130737} 11/07/2021 14:53:00 - INFO - __main__ - Step 125330: {'lr': 3.350514791588752e-05, 'samples': 24063360, 'steps': 125329, 'loss/train': 1.5450806617736816} 11/07/2021 14:53:01 - INFO - __main__ - Step 125331: {'lr': 3.3502494169764115e-05, 'samples': 24063552, 'steps': 125330, 'loss/train': 1.1046218872070312} 11/07/2021 14:53:02 - INFO - __main__ - Step 125332: {'lr': 3.349984052119112e-05, 'samples': 24063744, 'steps': 125331, 'loss/train': 1.405009388923645} 11/07/2021 14:53:02 - INFO - __main__ - Step 125333: {'lr': 3.349718697016976e-05, 'samples': 24063936, 'steps': 125332, 'loss/train': 1.5537656545639038} 11/07/2021 14:53:03 - INFO - __main__ - Step 125334: {'lr': 3.349453351670123e-05, 'samples': 24064128, 'steps': 125333, 'loss/train': 1.2435511350631714} 11/07/2021 14:53:03 - INFO - __main__ - Step 125335: {'lr': 3.349188016078672e-05, 'samples': 24064320, 'steps': 125334, 'loss/train': 1.3759238719940186} 11/07/2021 14:53:03 - INFO - __main__ - Step 125336: {'lr': 3.348922690242742e-05, 'samples': 24064512, 'steps': 125335, 'loss/train': 1.2813904285430908} 11/07/2021 14:53:04 - INFO - __main__ - Step 125337: {'lr': 3.3486573741624615e-05, 'samples': 24064704, 'steps': 125336, 'loss/train': 0.9456472396850586} 11/07/2021 14:53:05 - INFO - __main__ - Step 125338: {'lr': 3.348392067837935e-05, 'samples': 24064896, 'steps': 125337, 'loss/train': 1.594191074371338} 11/07/2021 14:53:05 - INFO - __main__ - Step 125339: {'lr': 3.348126771269289e-05, 'samples': 24065088, 'steps': 125338, 'loss/train': 1.225036859512329} 11/07/2021 14:53:05 - INFO - __main__ - Step 125340: {'lr': 3.347861484456641e-05, 'samples': 24065280, 'steps': 125339, 'loss/train': 2.4340569972991943} 11/07/2021 14:53:06 - INFO - __main__ - Step 125341: {'lr': 3.347596207400114e-05, 'samples': 24065472, 'steps': 125340, 'loss/train': 1.2753223180770874} 11/07/2021 14:53:07 - INFO - __main__ - Step 125342: {'lr': 3.347330940099827e-05, 'samples': 24065664, 'steps': 125341, 'loss/train': 1.335310459136963} 11/07/2021 14:53:07 - INFO - __main__ - Step 125343: {'lr': 3.347065682555897e-05, 'samples': 24065856, 'steps': 125342, 'loss/train': 1.2799248695373535} 11/07/2021 14:53:07 - INFO - __main__ - Step 125344: {'lr': 3.346800434768446e-05, 'samples': 24066048, 'steps': 125343, 'loss/train': 1.3462488651275635} 11/07/2021 14:53:08 - INFO - __main__ - Step 125345: {'lr': 3.346535196737593e-05, 'samples': 24066240, 'steps': 125344, 'loss/train': 1.0865150690078735} 11/07/2021 14:53:08 - INFO - __main__ - Step 125346: {'lr': 3.346269968463456e-05, 'samples': 24066432, 'steps': 125345, 'loss/train': 1.3387982845306396} 11/07/2021 14:53:09 - INFO - __main__ - Step 125347: {'lr': 3.346004749946158e-05, 'samples': 24066624, 'steps': 125346, 'loss/train': 1.4851861000061035} 11/07/2021 14:53:10 - INFO - __main__ - Step 125348: {'lr': 3.345739541185813e-05, 'samples': 24066816, 'steps': 125347, 'loss/train': 1.3275941610336304} 11/07/2021 14:53:10 - INFO - __main__ - Step 125349: {'lr': 3.3454743421825445e-05, 'samples': 24067008, 'steps': 125348, 'loss/train': 1.5833791494369507} 11/07/2021 14:53:10 - INFO - __main__ - Step 125350: {'lr': 3.3452091529364706e-05, 'samples': 24067200, 'steps': 125349, 'loss/train': 0.8585340976715088} 11/07/2021 14:53:11 - INFO - __main__ - Step 125351: {'lr': 3.34494397344772e-05, 'samples': 24067392, 'steps': 125350, 'loss/train': 1.8668882846832275} 11/07/2021 14:53:12 - INFO - __main__ - Step 125352: {'lr': 3.3446788037163943e-05, 'samples': 24067584, 'steps': 125351, 'loss/train': 1.661379098892212} 11/07/2021 14:53:12 - INFO - __main__ - Step 125353: {'lr': 3.344413643742625e-05, 'samples': 24067776, 'steps': 125352, 'loss/train': 1.4762113094329834} 11/07/2021 14:53:12 - INFO - __main__ - Step 125354: {'lr': 3.344148493526525e-05, 'samples': 24067968, 'steps': 125353, 'loss/train': 1.4737446308135986} 11/07/2021 14:53:13 - INFO - __main__ - Step 125355: {'lr': 3.3438833530682194e-05, 'samples': 24068160, 'steps': 125354, 'loss/train': 1.3044577836990356} 11/07/2021 14:53:13 - INFO - __main__ - Step 125356: {'lr': 3.343618222367828e-05, 'samples': 24068352, 'steps': 125355, 'loss/train': 1.5475459098815918} 11/07/2021 14:53:14 - INFO - __main__ - Step 125357: {'lr': 3.343353101425467e-05, 'samples': 24068544, 'steps': 125356, 'loss/train': 1.4264590740203857} 11/07/2021 14:53:14 - INFO - __main__ - Step 125358: {'lr': 3.343087990241256e-05, 'samples': 24068736, 'steps': 125357, 'loss/train': 1.2928924560546875} 11/07/2021 14:53:15 - INFO - __main__ - Step 125359: {'lr': 3.342822888815314e-05, 'samples': 24068928, 'steps': 125358, 'loss/train': 1.6414707899093628} 11/07/2021 14:53:15 - INFO - __main__ - Step 125360: {'lr': 3.3425577971477636e-05, 'samples': 24069120, 'steps': 125359, 'loss/train': 1.088962197303772} 11/07/2021 14:53:16 - INFO - __main__ - Step 125361: {'lr': 3.342292715238723e-05, 'samples': 24069312, 'steps': 125360, 'loss/train': 1.0360119342803955} 11/07/2021 14:53:17 - INFO - __main__ - Step 125362: {'lr': 3.342027643088311e-05, 'samples': 24069504, 'steps': 125361, 'loss/train': 0.29156360030174255} 11/07/2021 14:53:17 - INFO - __main__ - Step 125363: {'lr': 3.3417625806966444e-05, 'samples': 24069696, 'steps': 125362, 'loss/train': 1.182605266571045} 11/07/2021 14:53:17 - INFO - __main__ - Step 125364: {'lr': 3.3414975280638525e-05, 'samples': 24069888, 'steps': 125363, 'loss/train': 1.2881481647491455} 11/07/2021 14:53:18 - INFO - __main__ - Step 125365: {'lr': 3.341232485190043e-05, 'samples': 24070080, 'steps': 125364, 'loss/train': 1.3209431171417236} 11/07/2021 14:53:18 - INFO - __main__ - Step 125366: {'lr': 3.3409674520753385e-05, 'samples': 24070272, 'steps': 125365, 'loss/train': 1.6911720037460327} 11/07/2021 14:53:19 - INFO - __main__ - Step 125367: {'lr': 3.3407024287198636e-05, 'samples': 24070464, 'steps': 125366, 'loss/train': 1.2018747329711914} 11/07/2021 14:53:19 - INFO - __main__ - Step 125368: {'lr': 3.340437415123729e-05, 'samples': 24070656, 'steps': 125367, 'loss/train': 1.2619285583496094} 11/07/2021 14:53:20 - INFO - __main__ - Step 125369: {'lr': 3.3401724112870625e-05, 'samples': 24070848, 'steps': 125368, 'loss/train': 1.451938271522522} 11/07/2021 14:53:20 - INFO - __main__ - Step 125370: {'lr': 3.339907417209978e-05, 'samples': 24071040, 'steps': 125369, 'loss/train': 1.7909295558929443} 11/07/2021 14:53:20 - INFO - __main__ - Step 125371: {'lr': 3.339642432892598e-05, 'samples': 24071232, 'steps': 125370, 'loss/train': 0.41487571597099304} 11/07/2021 14:53:21 - INFO - __main__ - Step 125372: {'lr': 3.339377458335041e-05, 'samples': 24071424, 'steps': 125371, 'loss/train': 1.1405749320983887} 11/07/2021 14:53:22 - INFO - __main__ - Step 125373: {'lr': 3.339112493537425e-05, 'samples': 24071616, 'steps': 125372, 'loss/train': 1.3971903324127197} 11/07/2021 14:53:22 - INFO - __main__ - Step 125374: {'lr': 3.338847538499873e-05, 'samples': 24071808, 'steps': 125373, 'loss/train': 0.9466192722320557} 11/07/2021 14:53:23 - INFO - __main__ - Step 125375: {'lr': 3.3385825932225004e-05, 'samples': 24072000, 'steps': 125374, 'loss/train': 1.1835391521453857} 11/07/2021 14:53:23 - INFO - __main__ - Step 125376: {'lr': 3.3383176577054284e-05, 'samples': 24072192, 'steps': 125375, 'loss/train': 1.1947989463806152} 11/07/2021 14:53:23 - INFO - __main__ - Step 125377: {'lr': 3.338052731948782e-05, 'samples': 24072384, 'steps': 125376, 'loss/train': 1.3381719589233398} 11/07/2021 14:53:24 - INFO - __main__ - Step 125378: {'lr': 3.337787815952667e-05, 'samples': 24072576, 'steps': 125377, 'loss/train': 1.199999213218689} 11/07/2021 14:53:25 - INFO - __main__ - Step 125379: {'lr': 3.337522909717214e-05, 'samples': 24072768, 'steps': 125378, 'loss/train': 1.0442613363265991} 11/07/2021 14:53:25 - INFO - __main__ - Step 125380: {'lr': 3.337258013242536e-05, 'samples': 24072960, 'steps': 125379, 'loss/train': 0.6776432991027832} 11/07/2021 14:53:25 - INFO - __main__ - Step 125381: {'lr': 3.336993126528759e-05, 'samples': 24073152, 'steps': 125380, 'loss/train': 1.2869808673858643} 11/07/2021 14:53:26 - INFO - __main__ - Step 125382: {'lr': 3.336728249575996e-05, 'samples': 24073344, 'steps': 125381, 'loss/train': 1.3689404726028442} 11/07/2021 14:53:27 - INFO - __main__ - Step 125383: {'lr': 3.336463382384369e-05, 'samples': 24073536, 'steps': 125382, 'loss/train': 1.4501781463623047} 11/07/2021 14:53:27 - INFO - __main__ - Step 125384: {'lr': 3.336198524953998e-05, 'samples': 24073728, 'steps': 125383, 'loss/train': 0.8597565293312073} 11/07/2021 14:53:27 - INFO - __main__ - Step 125385: {'lr': 3.3359336772849994e-05, 'samples': 24073920, 'steps': 125384, 'loss/train': 1.150602102279663} 11/07/2021 14:53:28 - INFO - __main__ - Step 125386: {'lr': 3.3356688393774984e-05, 'samples': 24074112, 'steps': 125385, 'loss/train': 1.3249292373657227} 11/07/2021 14:53:28 - INFO - __main__ - Step 125387: {'lr': 3.3354040112316076e-05, 'samples': 24074304, 'steps': 125386, 'loss/train': 1.2779043912887573} 11/07/2021 14:53:29 - INFO - __main__ - Step 125388: {'lr': 3.335139192847453e-05, 'samples': 24074496, 'steps': 125387, 'loss/train': 1.7201879024505615} 11/07/2021 14:53:30 - INFO - __main__ - Step 125389: {'lr': 3.334874384225148e-05, 'samples': 24074688, 'steps': 125388, 'loss/train': 1.4659459590911865} 11/07/2021 14:53:30 - INFO - __main__ - Step 125390: {'lr': 3.334609585364815e-05, 'samples': 24074880, 'steps': 125389, 'loss/train': 1.169593334197998} 11/07/2021 14:53:31 - INFO - __main__ - Step 125391: {'lr': 3.334344796266575e-05, 'samples': 24075072, 'steps': 125390, 'loss/train': 0.7633979320526123} 11/07/2021 14:53:31 - INFO - __main__ - Step 125392: {'lr': 3.334080016930544e-05, 'samples': 24075264, 'steps': 125391, 'loss/train': 1.2812530994415283} 11/07/2021 14:53:32 - INFO - __main__ - Step 125393: {'lr': 3.333815247356839e-05, 'samples': 24075456, 'steps': 125392, 'loss/train': 0.8687723278999329} 11/07/2021 14:53:32 - INFO - __main__ - Step 125394: {'lr': 3.333550487545583e-05, 'samples': 24075648, 'steps': 125393, 'loss/train': 0.9263911843299866} 11/07/2021 14:53:33 - INFO - __main__ - Step 125395: {'lr': 3.3332857374968966e-05, 'samples': 24075840, 'steps': 125394, 'loss/train': 1.2941069602966309} 11/07/2021 14:53:33 - INFO - __main__ - Step 125396: {'lr': 3.3330209972108976e-05, 'samples': 24076032, 'steps': 125395, 'loss/train': 1.209660291671753} 11/07/2021 14:53:33 - INFO - __main__ - Step 125397: {'lr': 3.3327562666877034e-05, 'samples': 24076224, 'steps': 125396, 'loss/train': 1.2402524948120117} 11/07/2021 14:53:34 - INFO - __main__ - Step 125398: {'lr': 3.3324915459274353e-05, 'samples': 24076416, 'steps': 125397, 'loss/train': 1.1360880136489868} 11/07/2021 14:53:35 - INFO - __main__ - Step 125399: {'lr': 3.332226834930211e-05, 'samples': 24076608, 'steps': 125398, 'loss/train': 1.4812380075454712} 11/07/2021 14:53:35 - INFO - __main__ - Step 125400: {'lr': 3.331962133696151e-05, 'samples': 24076800, 'steps': 125399, 'loss/train': 1.2573814392089844} 11/07/2021 14:53:36 - INFO - __main__ - Step 125401: {'lr': 3.331697442225376e-05, 'samples': 24076992, 'steps': 125400, 'loss/train': 1.3179394006729126} 11/07/2021 14:53:36 - INFO - __main__ - Step 125402: {'lr': 3.331432760518005e-05, 'samples': 24077184, 'steps': 125401, 'loss/train': 1.2186825275421143} 11/07/2021 14:53:36 - INFO - __main__ - Step 125403: {'lr': 3.331168088574152e-05, 'samples': 24077376, 'steps': 125402, 'loss/train': 5.666399955749512} 11/07/2021 14:53:37 - INFO - __main__ - Step 125404: {'lr': 3.33090342639395e-05, 'samples': 24077568, 'steps': 125403, 'loss/train': 1.182259440422058} 11/07/2021 14:53:38 - INFO - __main__ - Step 125405: {'lr': 3.330638773977501e-05, 'samples': 24077760, 'steps': 125404, 'loss/train': 1.445381999015808} 11/07/2021 14:53:38 - INFO - __main__ - Step 125406: {'lr': 3.3303741313249314e-05, 'samples': 24077952, 'steps': 125405, 'loss/train': 2.736642599105835} 11/07/2021 14:53:38 - INFO - __main__ - Step 125407: {'lr': 3.330109498436362e-05, 'samples': 24078144, 'steps': 125406, 'loss/train': 1.5787063837051392} 11/07/2021 14:53:39 - INFO - __main__ - Step 125408: {'lr': 3.32984487531191e-05, 'samples': 24078336, 'steps': 125407, 'loss/train': 1.2458690404891968} 11/07/2021 14:53:39 - INFO - __main__ - Step 125409: {'lr': 3.329580261951695e-05, 'samples': 24078528, 'steps': 125408, 'loss/train': 1.7179385423660278} 11/07/2021 14:53:40 - INFO - __main__ - Step 125410: {'lr': 3.329315658355839e-05, 'samples': 24078720, 'steps': 125409, 'loss/train': 1.0875455141067505} 11/07/2021 14:53:41 - INFO - __main__ - Step 125411: {'lr': 3.329051064524455e-05, 'samples': 24078912, 'steps': 125410, 'loss/train': 1.4643831253051758} 11/07/2021 14:53:41 - INFO - __main__ - Step 125412: {'lr': 3.3287864804576686e-05, 'samples': 24079104, 'steps': 125411, 'loss/train': 1.3080048561096191} 11/07/2021 14:53:41 - INFO - __main__ - Step 125413: {'lr': 3.328521906155599e-05, 'samples': 24079296, 'steps': 125412, 'loss/train': 1.3169363737106323} 11/07/2021 14:53:42 - INFO - __main__ - Step 125414: {'lr': 3.32825734161836e-05, 'samples': 24079488, 'steps': 125413, 'loss/train': 1.7661457061767578} 11/07/2021 14:53:43 - INFO - __main__ - Step 125415: {'lr': 3.327992786846073e-05, 'samples': 24079680, 'steps': 125414, 'loss/train': 1.5914908647537231} 11/07/2021 14:53:43 - INFO - __main__ - Step 125416: {'lr': 3.3277282418388595e-05, 'samples': 24079872, 'steps': 125415, 'loss/train': 1.3936316967010498} 11/07/2021 14:53:43 - INFO - __main__ - Step 125417: {'lr': 3.3274637065968365e-05, 'samples': 24080064, 'steps': 125416, 'loss/train': 1.3068021535873413} 11/07/2021 14:53:44 - INFO - __main__ - Step 125418: {'lr': 3.3271991811201303e-05, 'samples': 24080256, 'steps': 125417, 'loss/train': 1.3473235368728638} 11/07/2021 14:53:44 - INFO - __main__ - Step 125419: {'lr': 3.326934665408848e-05, 'samples': 24080448, 'steps': 125418, 'loss/train': 1.6607609987258911} 11/07/2021 14:53:45 - INFO - __main__ - Step 125420: {'lr': 3.326670159463116e-05, 'samples': 24080640, 'steps': 125419, 'loss/train': 1.3781471252441406} 11/07/2021 14:53:45 - INFO - __main__ - Step 125421: {'lr': 3.32640566328305e-05, 'samples': 24080832, 'steps': 125420, 'loss/train': 1.5539019107818604} 11/07/2021 14:53:46 - INFO - __main__ - Step 125422: {'lr': 3.3261411768687715e-05, 'samples': 24081024, 'steps': 125421, 'loss/train': 1.4505313634872437} 11/07/2021 14:53:46 - INFO - __main__ - Step 125423: {'lr': 3.325876700220401e-05, 'samples': 24081216, 'steps': 125422, 'loss/train': 1.3160748481750488} 11/07/2021 14:53:46 - INFO - __main__ - Step 125424: {'lr': 3.325612233338054e-05, 'samples': 24081408, 'steps': 125423, 'loss/train': 0.94954913854599} 11/07/2021 14:53:47 - INFO - __main__ - Step 125425: {'lr': 3.3253477762218515e-05, 'samples': 24081600, 'steps': 125424, 'loss/train': 1.0470259189605713} 11/07/2021 14:53:48 - INFO - __main__ - Step 125426: {'lr': 3.325083328871914e-05, 'samples': 24081792, 'steps': 125425, 'loss/train': 1.1439871788024902} 11/07/2021 14:53:48 - INFO - __main__ - Step 125427: {'lr': 3.3248188912883584e-05, 'samples': 24081984, 'steps': 125426, 'loss/train': 1.0581655502319336} 11/07/2021 14:53:48 - INFO - __main__ - Step 125428: {'lr': 3.3245544634713045e-05, 'samples': 24082176, 'steps': 125427, 'loss/train': 1.770446538925171} 11/07/2021 14:53:49 - INFO - __main__ - Step 125429: {'lr': 3.3242900454208746e-05, 'samples': 24082368, 'steps': 125428, 'loss/train': 1.3589298725128174} 11/07/2021 14:53:50 - INFO - __main__ - Step 125430: {'lr': 3.3240256371371845e-05, 'samples': 24082560, 'steps': 125429, 'loss/train': 0.8752182126045227} 11/07/2021 14:53:50 - INFO - __main__ - Step 125431: {'lr': 3.323761238620357e-05, 'samples': 24082752, 'steps': 125430, 'loss/train': 1.2688219547271729} 11/07/2021 14:53:51 - INFO - __main__ - Step 125432: {'lr': 3.3234968498705056e-05, 'samples': 24082944, 'steps': 125431, 'loss/train': 0.9502446055412292} 11/07/2021 14:53:51 - INFO - __main__ - Step 125433: {'lr': 3.323232470887749e-05, 'samples': 24083136, 'steps': 125432, 'loss/train': 1.2285666465759277} 11/07/2021 14:53:51 - INFO - __main__ - Step 125434: {'lr': 3.322968101672211e-05, 'samples': 24083328, 'steps': 125433, 'loss/train': 0.40368223190307617} 11/07/2021 14:53:52 - INFO - __main__ - Step 125435: {'lr': 3.322703742224009e-05, 'samples': 24083520, 'steps': 125434, 'loss/train': 1.1971999406814575} 11/07/2021 14:53:53 - INFO - __main__ - Step 125436: {'lr': 3.322439392543264e-05, 'samples': 24083712, 'steps': 125435, 'loss/train': 1.7125377655029297} 11/07/2021 14:53:53 - INFO - __main__ - Step 125437: {'lr': 3.322175052630092e-05, 'samples': 24083904, 'steps': 125436, 'loss/train': 1.0783816576004028} 11/07/2021 14:53:53 - INFO - __main__ - Step 125438: {'lr': 3.321910722484611e-05, 'samples': 24084096, 'steps': 125437, 'loss/train': 0.930374264717102} 11/07/2021 14:53:54 - INFO - __main__ - Step 125439: {'lr': 3.3216464021069454e-05, 'samples': 24084288, 'steps': 125438, 'loss/train': 1.518755316734314} 11/07/2021 14:53:54 - INFO - __main__ - Step 125440: {'lr': 3.32138209149721e-05, 'samples': 24084480, 'steps': 125439, 'loss/train': 1.3826379776000977} 11/07/2021 14:53:55 - INFO - __main__ - Step 125441: {'lr': 3.3211177906555255e-05, 'samples': 24084672, 'steps': 125440, 'loss/train': 0.9067808985710144} 11/07/2021 14:53:56 - INFO - __main__ - Step 125442: {'lr': 3.3208534995820104e-05, 'samples': 24084864, 'steps': 125441, 'loss/train': 1.5162023305892944} 11/07/2021 14:53:56 - INFO - __main__ - Step 125443: {'lr': 3.320589218276784e-05, 'samples': 24085056, 'steps': 125442, 'loss/train': 0.9989749789237976} 11/07/2021 14:53:56 - INFO - __main__ - Step 125444: {'lr': 3.320324946739975e-05, 'samples': 24085248, 'steps': 125443, 'loss/train': 0.9565309286117554} 11/07/2021 14:53:57 - INFO - __main__ - Step 125445: {'lr': 3.320060684971682e-05, 'samples': 24085440, 'steps': 125444, 'loss/train': 0.9200481176376343} 11/07/2021 14:53:58 - INFO - __main__ - Step 125446: {'lr': 3.3197964329720386e-05, 'samples': 24085632, 'steps': 125445, 'loss/train': 1.3703495264053345} 11/07/2021 14:53:58 - INFO - __main__ - Step 125447: {'lr': 3.319532190741159e-05, 'samples': 24085824, 'steps': 125446, 'loss/train': 1.931037425994873} 11/07/2021 14:53:58 - INFO - __main__ - Step 125448: {'lr': 3.319267958279165e-05, 'samples': 24086016, 'steps': 125447, 'loss/train': 1.2280257940292358} 11/07/2021 14:53:59 - INFO - __main__ - Step 125449: {'lr': 3.3190037355861737e-05, 'samples': 24086208, 'steps': 125448, 'loss/train': 1.2235645055770874} 11/07/2021 14:53:59 - INFO - __main__ - Step 125450: {'lr': 3.3187395226623035e-05, 'samples': 24086400, 'steps': 125449, 'loss/train': 1.180591106414795} 11/07/2021 14:54:00 - INFO - __main__ - Step 125451: {'lr': 3.318475319507674e-05, 'samples': 24086592, 'steps': 125450, 'loss/train': 1.6006426811218262} 11/07/2021 14:54:00 - INFO - __main__ - Step 125452: {'lr': 3.3182111261224086e-05, 'samples': 24086784, 'steps': 125451, 'loss/train': 0.6825656890869141} 11/07/2021 14:54:01 - INFO - __main__ - Step 125453: {'lr': 3.3179469425066197e-05, 'samples': 24086976, 'steps': 125452, 'loss/train': 1.350663423538208} 11/07/2021 14:54:01 - INFO - __main__ - Step 125454: {'lr': 3.3176827686604295e-05, 'samples': 24087168, 'steps': 125453, 'loss/train': 1.7484095096588135} 11/07/2021 14:54:02 - INFO - __main__ - Step 125455: {'lr': 3.3174186045839634e-05, 'samples': 24087360, 'steps': 125454, 'loss/train': 1.327993631362915} 11/07/2021 14:54:03 - INFO - __main__ - Step 125456: {'lr': 3.317154450277329e-05, 'samples': 24087552, 'steps': 125455, 'loss/train': 0.3119719326496124} 11/07/2021 14:54:03 - INFO - __main__ - Step 125457: {'lr': 3.3168903057406494e-05, 'samples': 24087744, 'steps': 125456, 'loss/train': 0.8290284276008606} 11/07/2021 14:54:03 - INFO - __main__ - Step 125458: {'lr': 3.3166261709740434e-05, 'samples': 24087936, 'steps': 125457, 'loss/train': 1.1409797668457031} 11/07/2021 14:54:04 - INFO - __main__ - Step 125459: {'lr': 3.316362045977633e-05, 'samples': 24088128, 'steps': 125458, 'loss/train': 1.2959040403366089} 11/07/2021 14:54:04 - INFO - __main__ - Step 125460: {'lr': 3.316097930751535e-05, 'samples': 24088320, 'steps': 125459, 'loss/train': 0.15488453209400177} 11/07/2021 14:54:04 - INFO - __main__ - Step 125461: {'lr': 3.3158338252958666e-05, 'samples': 24088512, 'steps': 125460, 'loss/train': 1.3786801099777222} 11/07/2021 14:54:06 - INFO - __main__ - Step 125462: {'lr': 3.315569729610751e-05, 'samples': 24088704, 'steps': 125461, 'loss/train': 1.0962374210357666} 11/07/2021 14:54:06 - INFO - __main__ - Step 125463: {'lr': 3.315305643696304e-05, 'samples': 24088896, 'steps': 125462, 'loss/train': 1.0393130779266357} 11/07/2021 14:54:06 - INFO - __main__ - Step 125464: {'lr': 3.3150415675526456e-05, 'samples': 24089088, 'steps': 125463, 'loss/train': 1.7573107481002808} 11/07/2021 14:54:07 - INFO - __main__ - Step 125465: {'lr': 3.314777501179897e-05, 'samples': 24089280, 'steps': 125464, 'loss/train': 0.6934310793876648} 11/07/2021 14:54:07 - INFO - __main__ - Step 125466: {'lr': 3.3145134445781767e-05, 'samples': 24089472, 'steps': 125465, 'loss/train': 1.5106898546218872} 11/07/2021 14:54:08 - INFO - __main__ - Step 125467: {'lr': 3.3142493977475985e-05, 'samples': 24089664, 'steps': 125466, 'loss/train': 0.8608350157737732} 11/07/2021 14:54:08 - INFO - __main__ - Step 125468: {'lr': 3.3139853606882846e-05, 'samples': 24089856, 'steps': 125467, 'loss/train': 1.0981829166412354} 11/07/2021 14:54:09 - INFO - __main__ - Step 125469: {'lr': 3.3137213334003544e-05, 'samples': 24090048, 'steps': 125468, 'loss/train': 0.9133472442626953} 11/07/2021 14:54:09 - INFO - __main__ - Step 125470: {'lr': 3.3134573158839276e-05, 'samples': 24090240, 'steps': 125469, 'loss/train': 0.7789233326911926} 11/07/2021 14:54:09 - INFO - __main__ - Step 125471: {'lr': 3.313193308139123e-05, 'samples': 24090432, 'steps': 125470, 'loss/train': 1.3643349409103394} 11/07/2021 14:54:10 - INFO - __main__ - Step 125472: {'lr': 3.312929310166057e-05, 'samples': 24090624, 'steps': 125471, 'loss/train': 2.0404555797576904} 11/07/2021 14:54:11 - INFO - __main__ - Step 125473: {'lr': 3.31266532196485e-05, 'samples': 24090816, 'steps': 125472, 'loss/train': 1.177242636680603} 11/07/2021 14:54:11 - INFO - __main__ - Step 125474: {'lr': 3.312401343535623e-05, 'samples': 24091008, 'steps': 125473, 'loss/train': 1.0764304399490356} 11/07/2021 14:54:12 - INFO - __main__ - Step 125475: {'lr': 3.3121373748784936e-05, 'samples': 24091200, 'steps': 125474, 'loss/train': 0.13985495269298553} 11/07/2021 14:54:12 - INFO - __main__ - Step 125476: {'lr': 3.3118734159935824e-05, 'samples': 24091392, 'steps': 125475, 'loss/train': 1.393154501914978} 11/07/2021 14:54:13 - INFO - __main__ - Step 125477: {'lr': 3.311609466881005e-05, 'samples': 24091584, 'steps': 125476, 'loss/train': 0.6994310617446899} 11/07/2021 14:54:13 - INFO - __main__ - Step 125478: {'lr': 3.3113455275408796e-05, 'samples': 24091776, 'steps': 125477, 'loss/train': 1.7056626081466675} 11/07/2021 14:54:14 - INFO - __main__ - Step 125479: {'lr': 3.3110815979733285e-05, 'samples': 24091968, 'steps': 125478, 'loss/train': 1.331545114517212} 11/07/2021 14:54:14 - INFO - __main__ - Step 125480: {'lr': 3.310817678178468e-05, 'samples': 24092160, 'steps': 125479, 'loss/train': 1.670133352279663} 11/07/2021 14:54:14 - INFO - __main__ - Step 125481: {'lr': 3.3105537681564185e-05, 'samples': 24092352, 'steps': 125480, 'loss/train': 0.7415451407432556} 11/07/2021 14:54:15 - INFO - __main__ - Step 125482: {'lr': 3.310289867907299e-05, 'samples': 24092544, 'steps': 125481, 'loss/train': 1.2553155422210693} 11/07/2021 14:54:16 - INFO - __main__ - Step 125483: {'lr': 3.3100259774312275e-05, 'samples': 24092736, 'steps': 125482, 'loss/train': 1.5111992359161377} 11/07/2021 14:54:16 - INFO - __main__ - Step 125484: {'lr': 3.309762096728325e-05, 'samples': 24092928, 'steps': 125483, 'loss/train': 1.1536040306091309} 11/07/2021 14:54:16 - INFO - __main__ - Step 125485: {'lr': 3.30949822579871e-05, 'samples': 24093120, 'steps': 125484, 'loss/train': 1.803595781326294} 11/07/2021 14:54:17 - INFO - __main__ - Step 125486: {'lr': 3.309234364642496e-05, 'samples': 24093312, 'steps': 125485, 'loss/train': 0.7647872567176819} 11/07/2021 14:54:18 - INFO - __main__ - Step 125487: {'lr': 3.308970513259815e-05, 'samples': 24093504, 'steps': 125486, 'loss/train': 0.9463027715682983} 11/07/2021 14:54:18 - INFO - __main__ - Step 125488: {'lr': 3.308706671650774e-05, 'samples': 24093696, 'steps': 125487, 'loss/train': 1.240871787071228} 11/07/2021 14:54:19 - INFO - __main__ - Step 125489: {'lr': 3.3084428398154926e-05, 'samples': 24093888, 'steps': 125488, 'loss/train': 1.4561702013015747} 11/07/2021 14:54:19 - INFO - __main__ - Step 125490: {'lr': 3.3081790177540896e-05, 'samples': 24094080, 'steps': 125489, 'loss/train': 1.3320164680480957} 11/07/2021 14:54:19 - INFO - __main__ - Step 125491: {'lr': 3.3079152054666885e-05, 'samples': 24094272, 'steps': 125490, 'loss/train': 0.937847375869751} 11/07/2021 14:54:20 - INFO - __main__ - Step 125492: {'lr': 3.307651402953407e-05, 'samples': 24094464, 'steps': 125491, 'loss/train': 1.2196595668792725} 11/07/2021 14:54:21 - INFO - __main__ - Step 125493: {'lr': 3.307387610214363e-05, 'samples': 24094656, 'steps': 125492, 'loss/train': 1.2068235874176025} 11/07/2021 14:54:21 - INFO - __main__ - Step 125494: {'lr': 3.3071238272496754e-05, 'samples': 24094848, 'steps': 125493, 'loss/train': 1.1163139343261719} 11/07/2021 14:54:21 - INFO - __main__ - Step 125495: {'lr': 3.306860054059463e-05, 'samples': 24095040, 'steps': 125494, 'loss/train': 1.4883652925491333} 11/07/2021 14:54:22 - INFO - __main__ - Step 125496: {'lr': 3.306596290643843e-05, 'samples': 24095232, 'steps': 125495, 'loss/train': 1.428388237953186} 11/07/2021 14:54:22 - INFO - __main__ - Step 125497: {'lr': 3.306332537002937e-05, 'samples': 24095424, 'steps': 125496, 'loss/train': 0.9024084806442261} 11/07/2021 14:54:23 - INFO - __main__ - Step 125498: {'lr': 3.306068793136868e-05, 'samples': 24095616, 'steps': 125497, 'loss/train': 1.2856311798095703} 11/07/2021 14:54:23 - INFO - __main__ - Step 125499: {'lr': 3.3058050590457436e-05, 'samples': 24095808, 'steps': 125498, 'loss/train': 0.1639234721660614} 11/07/2021 14:54:24 - INFO - __main__ - Step 125500: {'lr': 3.305541334729692e-05, 'samples': 24096000, 'steps': 125499, 'loss/train': 1.558167815208435} 11/07/2021 14:54:24 - INFO - __main__ - Step 125501: {'lr': 3.305277620188826e-05, 'samples': 24096192, 'steps': 125500, 'loss/train': 1.4992412328720093} 11/07/2021 14:54:24 - INFO - __main__ - Step 125502: {'lr': 3.305013915423266e-05, 'samples': 24096384, 'steps': 125501, 'loss/train': 1.5562047958374023} 11/07/2021 14:54:26 - INFO - __main__ - Step 125503: {'lr': 3.3047502204331334e-05, 'samples': 24096576, 'steps': 125502, 'loss/train': 1.2499581575393677} 11/07/2021 14:54:26 - INFO - __main__ - Step 125504: {'lr': 3.3044865352185454e-05, 'samples': 24096768, 'steps': 125503, 'loss/train': 0.06144466623663902} 11/07/2021 14:54:26 - INFO - __main__ - Step 125505: {'lr': 3.304222859779621e-05, 'samples': 24096960, 'steps': 125504, 'loss/train': 1.273284912109375} 11/07/2021 14:54:27 - INFO - __main__ - Step 125506: {'lr': 3.303959194116479e-05, 'samples': 24097152, 'steps': 125505, 'loss/train': 1.0378928184509277} 11/07/2021 14:54:27 - INFO - __main__ - Step 125507: {'lr': 3.30369553822924e-05, 'samples': 24097344, 'steps': 125506, 'loss/train': 1.5432740449905396} 11/07/2021 14:54:28 - INFO - __main__ - Step 125508: {'lr': 3.30343189211802e-05, 'samples': 24097536, 'steps': 125507, 'loss/train': 1.2714170217514038} 11/07/2021 14:54:28 - INFO - __main__ - Step 125509: {'lr': 3.303168255782937e-05, 'samples': 24097728, 'steps': 125508, 'loss/train': 1.4062721729278564} 11/07/2021 14:54:29 - INFO - __main__ - Step 125510: {'lr': 3.302904629224113e-05, 'samples': 24097920, 'steps': 125509, 'loss/train': 1.310033917427063} 11/07/2021 14:54:29 - INFO - __main__ - Step 125511: {'lr': 3.302641012441665e-05, 'samples': 24098112, 'steps': 125510, 'loss/train': 1.304111361503601} 11/07/2021 14:54:29 - INFO - __main__ - Step 125512: {'lr': 3.302377405435716e-05, 'samples': 24098304, 'steps': 125511, 'loss/train': 1.4498952627182007} 11/07/2021 14:54:31 - INFO - __main__ - Step 125513: {'lr': 3.3021138082063776e-05, 'samples': 24098496, 'steps': 125512, 'loss/train': 1.7624591588974} 11/07/2021 14:54:31 - INFO - __main__ - Step 125514: {'lr': 3.301850220753772e-05, 'samples': 24098688, 'steps': 125513, 'loss/train': 1.3549989461898804} 11/07/2021 14:54:31 - INFO - __main__ - Step 125515: {'lr': 3.3015866430780164e-05, 'samples': 24098880, 'steps': 125514, 'loss/train': 1.3275388479232788} 11/07/2021 14:54:32 - INFO - __main__ - Step 125516: {'lr': 3.3013230751792325e-05, 'samples': 24099072, 'steps': 125515, 'loss/train': 1.434426188468933} 11/07/2021 14:54:32 - INFO - __main__ - Step 125517: {'lr': 3.301059517057539e-05, 'samples': 24099264, 'steps': 125516, 'loss/train': 1.2328718900680542} 11/07/2021 14:54:33 - INFO - __main__ - Step 125518: {'lr': 3.3007959687130495e-05, 'samples': 24099456, 'steps': 125517, 'loss/train': 0.0714627280831337} 11/07/2021 14:54:34 - INFO - __main__ - Step 125519: {'lr': 3.300532430145889e-05, 'samples': 24099648, 'steps': 125518, 'loss/train': 1.6176800727844238} 11/07/2021 14:54:34 - INFO - __main__ - Step 125520: {'lr': 3.3002689013561734e-05, 'samples': 24099840, 'steps': 125519, 'loss/train': 1.4267909526824951} 11/07/2021 14:54:34 - INFO - __main__ - Step 125521: {'lr': 3.300005382344021e-05, 'samples': 24100032, 'steps': 125520, 'loss/train': 1.0062583684921265} 11/07/2021 14:54:35 - INFO - __main__ - Step 125522: {'lr': 3.299741873109552e-05, 'samples': 24100224, 'steps': 125521, 'loss/train': 1.1272122859954834} 11/07/2021 14:54:36 - INFO - __main__ - Step 125523: {'lr': 3.299478373652884e-05, 'samples': 24100416, 'steps': 125522, 'loss/train': 1.4333808422088623} 11/07/2021 14:54:36 - INFO - __main__ - Step 125524: {'lr': 3.299214883974136e-05, 'samples': 24100608, 'steps': 125523, 'loss/train': 1.1769086122512817} 11/07/2021 14:54:36 - INFO - __main__ - Step 125525: {'lr': 3.298951404073433e-05, 'samples': 24100800, 'steps': 125524, 'loss/train': 1.4330018758773804} 11/07/2021 14:54:37 - INFO - __main__ - Step 125526: {'lr': 3.2986879339508807e-05, 'samples': 24100992, 'steps': 125525, 'loss/train': 1.177546501159668} 11/07/2021 14:54:37 - INFO - __main__ - Step 125527: {'lr': 3.298424473606606e-05, 'samples': 24101184, 'steps': 125526, 'loss/train': 0.29344412684440613} 11/07/2021 14:54:37 - INFO - __main__ - Step 125528: {'lr': 3.298161023040727e-05, 'samples': 24101376, 'steps': 125527, 'loss/train': 1.6594380140304565} 11/07/2021 14:54:39 - INFO - __main__ - Step 125529: {'lr': 3.297897582253362e-05, 'samples': 24101568, 'steps': 125528, 'loss/train': 0.6146705150604248} 11/07/2021 14:54:39 - INFO - __main__ - Step 125530: {'lr': 3.297634151244627e-05, 'samples': 24101760, 'steps': 125529, 'loss/train': 0.4078051447868347} 11/07/2021 14:54:39 - INFO - __main__ - Step 125531: {'lr': 3.2973707300146455e-05, 'samples': 24101952, 'steps': 125530, 'loss/train': 1.1685717105865479} 11/07/2021 14:54:40 - INFO - __main__ - Step 125532: {'lr': 3.2971073185635334e-05, 'samples': 24102144, 'steps': 125531, 'loss/train': 1.9828543663024902} 11/07/2021 14:54:40 - INFO - __main__ - Step 125533: {'lr': 3.296843916891409e-05, 'samples': 24102336, 'steps': 125532, 'loss/train': 1.3438079357147217} 11/07/2021 14:54:41 - INFO - __main__ - Step 125534: {'lr': 3.2965805249983935e-05, 'samples': 24102528, 'steps': 125533, 'loss/train': 1.1895390748977661} 11/07/2021 14:54:41 - INFO - __main__ - Step 125535: {'lr': 3.296317142884603e-05, 'samples': 24102720, 'steps': 125534, 'loss/train': 1.0864177942276} 11/07/2021 14:54:42 - INFO - __main__ - Step 125536: {'lr': 3.296053770550156e-05, 'samples': 24102912, 'steps': 125535, 'loss/train': 1.6240465641021729} 11/07/2021 14:54:42 - INFO - __main__ - Step 125537: {'lr': 3.295790407995172e-05, 'samples': 24103104, 'steps': 125536, 'loss/train': 1.6315479278564453} 11/07/2021 14:54:42 - INFO - __main__ - Step 125538: {'lr': 3.2955270552197716e-05, 'samples': 24103296, 'steps': 125537, 'loss/train': 1.3761792182922363} 11/07/2021 14:54:43 - INFO - __main__ - Step 125539: {'lr': 3.295263712224078e-05, 'samples': 24103488, 'steps': 125538, 'loss/train': 0.824447512626648} 11/07/2021 14:54:44 - INFO - __main__ - Step 125540: {'lr': 3.295000379008198e-05, 'samples': 24103680, 'steps': 125539, 'loss/train': 1.4615718126296997} 11/07/2021 14:54:44 - INFO - __main__ - Step 125541: {'lr': 3.294737055572253e-05, 'samples': 24103872, 'steps': 125540, 'loss/train': 0.8816465139389038} 11/07/2021 14:54:44 - INFO - __main__ - Step 125542: {'lr': 3.294473741916368e-05, 'samples': 24104064, 'steps': 125541, 'loss/train': 1.0836735963821411} 11/07/2021 14:54:45 - INFO - __main__ - Step 125543: {'lr': 3.294210438040654e-05, 'samples': 24104256, 'steps': 125542, 'loss/train': 1.2105026245117188} 11/07/2021 14:54:46 - INFO - __main__ - Step 125544: {'lr': 3.293947143945236e-05, 'samples': 24104448, 'steps': 125543, 'loss/train': 0.3673022985458374} 11/07/2021 14:54:46 - INFO - __main__ - Step 125545: {'lr': 3.293683859630231e-05, 'samples': 24104640, 'steps': 125544, 'loss/train': 1.3213176727294922} 11/07/2021 14:54:47 - INFO - __main__ - Step 125546: {'lr': 3.293420585095758e-05, 'samples': 24104832, 'steps': 125545, 'loss/train': 1.098681926727295} 11/07/2021 14:54:47 - INFO - __main__ - Step 125547: {'lr': 3.293157320341933e-05, 'samples': 24105024, 'steps': 125546, 'loss/train': 1.185814619064331} 11/07/2021 14:54:47 - INFO - __main__ - Step 125548: {'lr': 3.2928940653688785e-05, 'samples': 24105216, 'steps': 125547, 'loss/train': 1.6489026546478271} 11/07/2021 14:54:48 - INFO - __main__ - Step 125549: {'lr': 3.292630820176709e-05, 'samples': 24105408, 'steps': 125548, 'loss/train': 1.2925983667373657} 11/07/2021 14:54:49 - INFO - __main__ - Step 125550: {'lr': 3.292367584765546e-05, 'samples': 24105600, 'steps': 125549, 'loss/train': 1.3444970846176147} 11/07/2021 14:54:49 - INFO - __main__ - Step 125551: {'lr': 3.292104359135506e-05, 'samples': 24105792, 'steps': 125550, 'loss/train': 1.225922703742981} 11/07/2021 14:54:49 - INFO - __main__ - Step 125552: {'lr': 3.2918411432867165e-05, 'samples': 24105984, 'steps': 125551, 'loss/train': 1.4222315549850464} 11/07/2021 14:54:50 - INFO - __main__ - Step 125553: {'lr': 3.291577937219281e-05, 'samples': 24106176, 'steps': 125552, 'loss/train': 1.2971324920654297} 11/07/2021 14:54:50 - INFO - __main__ - Step 125554: {'lr': 3.291314740933326e-05, 'samples': 24106368, 'steps': 125553, 'loss/train': 1.0071443319320679} 11/07/2021 14:54:51 - INFO - __main__ - Step 125555: {'lr': 3.29105155442897e-05, 'samples': 24106560, 'steps': 125554, 'loss/train': 0.7962578535079956} 11/07/2021 14:54:51 - INFO - __main__ - Step 125556: {'lr': 3.29078837770633e-05, 'samples': 24106752, 'steps': 125555, 'loss/train': 1.3502259254455566} 11/07/2021 14:54:52 - INFO - __main__ - Step 125557: {'lr': 3.290525210765527e-05, 'samples': 24106944, 'steps': 125556, 'loss/train': 0.8230602741241455} 11/07/2021 14:54:52 - INFO - __main__ - Step 125558: {'lr': 3.290262053606677e-05, 'samples': 24107136, 'steps': 125557, 'loss/train': 1.2225676774978638} 11/07/2021 14:54:52 - INFO - __main__ - Step 125559: {'lr': 3.289998906229902e-05, 'samples': 24107328, 'steps': 125558, 'loss/train': 1.341295838356018} 11/07/2021 14:54:53 - INFO - __main__ - Step 125560: {'lr': 3.289735768635316e-05, 'samples': 24107520, 'steps': 125559, 'loss/train': 1.1969239711761475} 11/07/2021 14:54:54 - INFO - __main__ - Step 125561: {'lr': 3.289472640823041e-05, 'samples': 24107712, 'steps': 125560, 'loss/train': 1.1672242879867554} 11/07/2021 14:54:54 - INFO - __main__ - Step 125562: {'lr': 3.289209522793196e-05, 'samples': 24107904, 'steps': 125561, 'loss/train': 1.0069646835327148} 11/07/2021 14:54:55 - INFO - __main__ - Step 125563: {'lr': 3.288946414545896e-05, 'samples': 24108096, 'steps': 125562, 'loss/train': 1.275742530822754} 11/07/2021 14:54:55 - INFO - __main__ - Step 125564: {'lr': 3.288683316081264e-05, 'samples': 24108288, 'steps': 125563, 'loss/train': 1.4223495721817017} 11/07/2021 14:54:56 - INFO - __main__ - Step 125565: {'lr': 3.2884202273994137e-05, 'samples': 24108480, 'steps': 125564, 'loss/train': 1.2022234201431274} 11/07/2021 14:54:56 - INFO - __main__ - Step 125566: {'lr': 3.288157148500473e-05, 'samples': 24108672, 'steps': 125565, 'loss/train': 0.8685049414634705} 11/07/2021 14:54:57 - INFO - __main__ - Step 125567: {'lr': 3.287894079384548e-05, 'samples': 24108864, 'steps': 125566, 'loss/train': 1.5550132989883423} 11/07/2021 14:54:57 - INFO - __main__ - Step 125568: {'lr': 3.287631020051765e-05, 'samples': 24109056, 'steps': 125567, 'loss/train': 1.397229790687561} 11/07/2021 14:54:57 - INFO - __main__ - Step 125569: {'lr': 3.287367970502239e-05, 'samples': 24109248, 'steps': 125568, 'loss/train': 1.4105079174041748} 11/07/2021 14:54:58 - INFO - __main__ - Step 125570: {'lr': 3.287104930736087e-05, 'samples': 24109440, 'steps': 125569, 'loss/train': 1.4588252305984497} 11/07/2021 14:54:59 - INFO - __main__ - Step 125571: {'lr': 3.286841900753434e-05, 'samples': 24109632, 'steps': 125570, 'loss/train': 1.227649450302124} 11/07/2021 14:54:59 - INFO - __main__ - Step 125572: {'lr': 3.2865788805543946e-05, 'samples': 24109824, 'steps': 125571, 'loss/train': 0.646054208278656} 11/07/2021 14:54:59 - INFO - __main__ - Step 125573: {'lr': 3.286315870139087e-05, 'samples': 24110016, 'steps': 125572, 'loss/train': 1.748223900794983} 11/07/2021 14:55:00 - INFO - __main__ - Step 125574: {'lr': 3.2860528695076305e-05, 'samples': 24110208, 'steps': 125573, 'loss/train': 1.7402889728546143} 11/07/2021 14:55:00 - INFO - __main__ - Step 125575: {'lr': 3.2857898786601446e-05, 'samples': 24110400, 'steps': 125574, 'loss/train': 0.5751084089279175} 11/07/2021 14:55:01 - INFO - __main__ - Step 125576: {'lr': 3.285526897596744e-05, 'samples': 24110592, 'steps': 125575, 'loss/train': 1.216086506843567} 11/07/2021 14:55:01 - INFO - __main__ - Step 125577: {'lr': 3.2852639263175527e-05, 'samples': 24110784, 'steps': 125576, 'loss/train': 1.3220664262771606} 11/07/2021 14:55:02 - INFO - __main__ - Step 125578: {'lr': 3.285000964822685e-05, 'samples': 24110976, 'steps': 125577, 'loss/train': 1.0837410688400269} 11/07/2021 14:55:02 - INFO - __main__ - Step 125579: {'lr': 3.284738013112265e-05, 'samples': 24111168, 'steps': 125578, 'loss/train': 1.2071183919906616} 11/07/2021 14:55:03 - INFO - __main__ - Step 125580: {'lr': 3.2844750711864044e-05, 'samples': 24111360, 'steps': 125579, 'loss/train': 1.1486313343048096} 11/07/2021 14:55:04 - INFO - __main__ - Step 125581: {'lr': 3.284212139045223e-05, 'samples': 24111552, 'steps': 125580, 'loss/train': 1.1508049964904785} 11/07/2021 14:55:04 - INFO - __main__ - Step 125582: {'lr': 3.283949216688839e-05, 'samples': 24111744, 'steps': 125581, 'loss/train': 1.4040899276733398} 11/07/2021 14:55:04 - INFO - __main__ - Step 125583: {'lr': 3.283686304117375e-05, 'samples': 24111936, 'steps': 125582, 'loss/train': 1.2121777534484863} 11/07/2021 14:55:05 - INFO - __main__ - Step 125584: {'lr': 3.2834234013309454e-05, 'samples': 24112128, 'steps': 125583, 'loss/train': 1.2574892044067383} 11/07/2021 14:55:05 - INFO - __main__ - Step 125585: {'lr': 3.283160508329669e-05, 'samples': 24112320, 'steps': 125584, 'loss/train': 1.2010587453842163} 11/07/2021 14:55:06 - INFO - __main__ - Step 125586: {'lr': 3.282897625113668e-05, 'samples': 24112512, 'steps': 125585, 'loss/train': 1.1401492357254028} 11/07/2021 14:55:06 - INFO - __main__ - Step 125587: {'lr': 3.282634751683056e-05, 'samples': 24112704, 'steps': 125586, 'loss/train': 1.5892993211746216} 11/07/2021 14:55:07 - INFO - __main__ - Step 125588: {'lr': 3.282371888037955e-05, 'samples': 24112896, 'steps': 125587, 'loss/train': 1.3180804252624512} 11/07/2021 14:55:07 - INFO - __main__ - Step 125589: {'lr': 3.2821090341784824e-05, 'samples': 24113088, 'steps': 125588, 'loss/train': 0.42646437883377075} 11/07/2021 14:55:07 - INFO - __main__ - Step 125590: {'lr': 3.281846190104754e-05, 'samples': 24113280, 'steps': 125589, 'loss/train': 1.0030324459075928} 11/07/2021 14:55:09 - INFO - __main__ - Step 125591: {'lr': 3.281583355816892e-05, 'samples': 24113472, 'steps': 125590, 'loss/train': 1.406352162361145} 11/07/2021 14:55:09 - INFO - __main__ - Step 125592: {'lr': 3.281320531315013e-05, 'samples': 24113664, 'steps': 125591, 'loss/train': 1.3298214673995972} 11/07/2021 14:55:09 - INFO - __main__ - Step 125593: {'lr': 3.281057716599242e-05, 'samples': 24113856, 'steps': 125592, 'loss/train': 1.4998698234558105} 11/07/2021 14:55:10 - INFO - __main__ - Step 125594: {'lr': 3.2807949116696876e-05, 'samples': 24114048, 'steps': 125593, 'loss/train': 1.4607011079788208} 11/07/2021 14:55:10 - INFO - __main__ - Step 125595: {'lr': 3.280532116526469e-05, 'samples': 24114240, 'steps': 125594, 'loss/train': 1.3852994441986084} 11/07/2021 14:55:11 - INFO - __main__ - Step 125596: {'lr': 3.280269331169708e-05, 'samples': 24114432, 'steps': 125595, 'loss/train': 1.2804452180862427} 11/07/2021 14:55:11 - INFO - __main__ - Step 125597: {'lr': 3.280006555599524e-05, 'samples': 24114624, 'steps': 125596, 'loss/train': 1.4210249185562134} 11/07/2021 14:55:12 - INFO - __main__ - Step 125598: {'lr': 3.279743789816031e-05, 'samples': 24114816, 'steps': 125597, 'loss/train': 1.2320072650909424} 11/07/2021 14:55:12 - INFO - __main__ - Step 125599: {'lr': 3.279481033819354e-05, 'samples': 24115008, 'steps': 125598, 'loss/train': 1.1597386598587036} 11/07/2021 14:55:12 - INFO - __main__ - Step 125600: {'lr': 3.2792182876096035e-05, 'samples': 24115200, 'steps': 125599, 'loss/train': 2.732959032058716} 11/07/2021 14:55:13 - INFO - __main__ - Step 125601: {'lr': 3.278955551186904e-05, 'samples': 24115392, 'steps': 125600, 'loss/train': 1.22111976146698} 11/07/2021 14:55:14 - INFO - __main__ - Step 125602: {'lr': 3.278692824551374e-05, 'samples': 24115584, 'steps': 125601, 'loss/train': 1.2327367067337036} 11/07/2021 14:55:14 - INFO - __main__ - Step 125603: {'lr': 3.2784301077031284e-05, 'samples': 24115776, 'steps': 125602, 'loss/train': 0.7055980563163757} 11/07/2021 14:55:14 - INFO - __main__ - Step 125604: {'lr': 3.278167400642285e-05, 'samples': 24115968, 'steps': 125603, 'loss/train': 1.2485146522521973} 11/07/2021 14:55:15 - INFO - __main__ - Step 125605: {'lr': 3.2779047033689666e-05, 'samples': 24116160, 'steps': 125604, 'loss/train': 1.4734643697738647} 11/07/2021 14:55:15 - INFO - __main__ - Step 125606: {'lr': 3.277642015883295e-05, 'samples': 24116352, 'steps': 125605, 'loss/train': 1.1082185506820679} 11/07/2021 14:55:16 - INFO - __main__ - Step 125607: {'lr': 3.277379338185374e-05, 'samples': 24116544, 'steps': 125606, 'loss/train': 0.8956972360610962} 11/07/2021 14:55:17 - INFO - __main__ - Step 125608: {'lr': 3.277116670275335e-05, 'samples': 24116736, 'steps': 125607, 'loss/train': 1.1355326175689697} 11/07/2021 14:55:17 - INFO - __main__ - Step 125609: {'lr': 3.276854012153288e-05, 'samples': 24116928, 'steps': 125608, 'loss/train': 1.3702808618545532} 11/07/2021 14:55:17 - INFO - __main__ - Step 125610: {'lr': 3.276591363819356e-05, 'samples': 24117120, 'steps': 125609, 'loss/train': 1.6064813137054443} 11/07/2021 14:55:18 - INFO - __main__ - Step 125611: {'lr': 3.276328725273658e-05, 'samples': 24117312, 'steps': 125610, 'loss/train': 1.0194733142852783} 11/07/2021 14:55:19 - INFO - __main__ - Step 125612: {'lr': 3.276066096516311e-05, 'samples': 24117504, 'steps': 125611, 'loss/train': 2.965338945388794} 11/07/2021 14:55:19 - INFO - __main__ - Step 125613: {'lr': 3.2758034775474344e-05, 'samples': 24117696, 'steps': 125612, 'loss/train': 1.0997072458267212} 11/07/2021 14:55:20 - INFO - __main__ - Step 125614: {'lr': 3.275540868367144e-05, 'samples': 24117888, 'steps': 125613, 'loss/train': 1.3999043703079224} 11/07/2021 14:55:20 - INFO - __main__ - Step 125615: {'lr': 3.275278268975559e-05, 'samples': 24118080, 'steps': 125614, 'loss/train': 1.546338677406311} 11/07/2021 14:55:20 - INFO - __main__ - Step 125616: {'lr': 3.2750156793728e-05, 'samples': 24118272, 'steps': 125615, 'loss/train': 0.9647374749183655} 11/07/2021 14:55:21 - INFO - __main__ - Step 125617: {'lr': 3.274753099558983e-05, 'samples': 24118464, 'steps': 125616, 'loss/train': 1.0799448490142822} 11/07/2021 14:55:22 - INFO - __main__ - Step 125618: {'lr': 3.2744905295342295e-05, 'samples': 24118656, 'steps': 125617, 'loss/train': 0.8421158194541931} 11/07/2021 14:55:22 - INFO - __main__ - Step 125619: {'lr': 3.274227969298657e-05, 'samples': 24118848, 'steps': 125618, 'loss/train': 0.5550857782363892} 11/07/2021 14:55:23 - INFO - __main__ - Step 125620: {'lr': 3.2739654188523786e-05, 'samples': 24119040, 'steps': 125619, 'loss/train': 1.0615652799606323} 11/07/2021 14:55:23 - INFO - __main__ - Step 125621: {'lr': 3.273702878195517e-05, 'samples': 24119232, 'steps': 125620, 'loss/train': 1.521372675895691} 11/07/2021 14:55:23 - INFO - __main__ - Step 125622: {'lr': 3.273440347328188e-05, 'samples': 24119424, 'steps': 125621, 'loss/train': 1.223109245300293} 11/07/2021 14:55:25 - INFO - __main__ - Step 125623: {'lr': 3.2731778262505116e-05, 'samples': 24119616, 'steps': 125622, 'loss/train': 0.558910608291626} 11/07/2021 14:55:25 - INFO - __main__ - Step 125624: {'lr': 3.272915314962604e-05, 'samples': 24119808, 'steps': 125623, 'loss/train': 1.6625182628631592} 11/07/2021 14:55:25 - INFO - __main__ - Step 125625: {'lr': 3.2726528134645884e-05, 'samples': 24120000, 'steps': 125624, 'loss/train': 1.2257920503616333} 11/07/2021 14:55:26 - INFO - __main__ - Step 125626: {'lr': 3.27239032175658e-05, 'samples': 24120192, 'steps': 125625, 'loss/train': 1.754591464996338} 11/07/2021 14:55:26 - INFO - __main__ - Step 125627: {'lr': 3.272127839838696e-05, 'samples': 24120384, 'steps': 125626, 'loss/train': 1.5136363506317139} 11/07/2021 14:55:27 - INFO - __main__ - Step 125628: {'lr': 3.2718653677110576e-05, 'samples': 24120576, 'steps': 125627, 'loss/train': 1.240005612373352} 11/07/2021 14:55:27 - INFO - __main__ - Step 125629: {'lr': 3.27160290537378e-05, 'samples': 24120768, 'steps': 125628, 'loss/train': 1.3086799383163452} 11/07/2021 14:55:28 - INFO - __main__ - Step 125630: {'lr': 3.2713404528269846e-05, 'samples': 24120960, 'steps': 125629, 'loss/train': 1.4971303939819336} 11/07/2021 14:55:28 - INFO - __main__ - Step 125631: {'lr': 3.271078010070786e-05, 'samples': 24121152, 'steps': 125630, 'loss/train': 1.7199410200119019} 11/07/2021 14:55:29 - INFO - __main__ - Step 125632: {'lr': 3.2708155771053045e-05, 'samples': 24121344, 'steps': 125631, 'loss/train': 1.2679370641708374} 11/07/2021 14:55:29 - INFO - __main__ - Step 125633: {'lr': 3.2705531539306635e-05, 'samples': 24121536, 'steps': 125632, 'loss/train': 0.3977450430393219} 11/07/2021 14:55:30 - INFO - __main__ - Step 125634: {'lr': 3.270290740546972e-05, 'samples': 24121728, 'steps': 125633, 'loss/train': 1.4007123708724976} 11/07/2021 14:55:30 - INFO - __main__ - Step 125635: {'lr': 3.27002833695435e-05, 'samples': 24121920, 'steps': 125634, 'loss/train': 5.636014938354492} 11/07/2021 14:55:31 - INFO - __main__ - Step 125636: {'lr': 3.2697659431529193e-05, 'samples': 24122112, 'steps': 125635, 'loss/train': 1.2949284315109253} 11/07/2021 14:55:31 - INFO - __main__ - Step 125637: {'lr': 3.2695035591427976e-05, 'samples': 24122304, 'steps': 125636, 'loss/train': 1.5335389375686646} 11/07/2021 14:55:31 - INFO - __main__ - Step 125638: {'lr': 3.2692411849241015e-05, 'samples': 24122496, 'steps': 125637, 'loss/train': 1.5992769002914429} 11/07/2021 14:55:32 - INFO - __main__ - Step 125639: {'lr': 3.2689788204969485e-05, 'samples': 24122688, 'steps': 125638, 'loss/train': 1.2579336166381836} 11/07/2021 14:55:33 - INFO - __main__ - Step 125640: {'lr': 3.26871646586146e-05, 'samples': 24122880, 'steps': 125639, 'loss/train': 1.348460078239441} 11/07/2021 14:55:33 - INFO - __main__ - Step 125641: {'lr': 3.2684541210177525e-05, 'samples': 24123072, 'steps': 125640, 'loss/train': 1.1508915424346924} 11/07/2021 14:55:34 - INFO - __main__ - Step 125642: {'lr': 3.268191785965943e-05, 'samples': 24123264, 'steps': 125641, 'loss/train': 1.4830219745635986} 11/07/2021 14:55:34 - INFO - __main__ - Step 125643: {'lr': 3.267929460706154e-05, 'samples': 24123456, 'steps': 125642, 'loss/train': 1.4529205560684204} 11/07/2021 14:55:35 - INFO - __main__ - Step 125644: {'lr': 3.267667145238498e-05, 'samples': 24123648, 'steps': 125643, 'loss/train': 1.4024447202682495} 11/07/2021 14:55:35 - INFO - __main__ - Step 125645: {'lr': 3.2674048395630954e-05, 'samples': 24123840, 'steps': 125644, 'loss/train': 1.7572695016860962} 11/07/2021 14:55:36 - INFO - __main__ - Step 125646: {'lr': 3.267142543680071e-05, 'samples': 24124032, 'steps': 125645, 'loss/train': 1.339340090751648} 11/07/2021 14:55:36 - INFO - __main__ - Step 125647: {'lr': 3.266880257589533e-05, 'samples': 24124224, 'steps': 125646, 'loss/train': 1.4172335863113403} 11/07/2021 14:55:36 - INFO - __main__ - Step 125648: {'lr': 3.2666179812916e-05, 'samples': 24124416, 'steps': 125647, 'loss/train': 0.9220601320266724} 11/07/2021 14:55:37 - INFO - __main__ - Step 125649: {'lr': 3.266355714786395e-05, 'samples': 24124608, 'steps': 125648, 'loss/train': 0.8483999967575073} 11/07/2021 14:55:38 - INFO - __main__ - Step 125650: {'lr': 3.266093458074035e-05, 'samples': 24124800, 'steps': 125649, 'loss/train': 1.348673701286316} 11/07/2021 14:55:38 - INFO - __main__ - Step 125651: {'lr': 3.265831211154638e-05, 'samples': 24124992, 'steps': 125650, 'loss/train': 1.3909400701522827} 11/07/2021 14:55:38 - INFO - __main__ - Step 125652: {'lr': 3.265568974028324e-05, 'samples': 24125184, 'steps': 125651, 'loss/train': 1.187692642211914} 11/07/2021 14:55:39 - INFO - __main__ - Step 125653: {'lr': 3.265306746695207e-05, 'samples': 24125376, 'steps': 125652, 'loss/train': 0.6641001105308533} 11/07/2021 14:55:39 - INFO - __main__ - Step 125654: {'lr': 3.2650445291554085e-05, 'samples': 24125568, 'steps': 125653, 'loss/train': 1.2108964920043945} 11/07/2021 14:55:40 - INFO - __main__ - Step 125655: {'lr': 3.2647823214090436e-05, 'samples': 24125760, 'steps': 125654, 'loss/train': 1.030134916305542} 11/07/2021 14:55:41 - INFO - __main__ - Step 125656: {'lr': 3.264520123456233e-05, 'samples': 24125952, 'steps': 125655, 'loss/train': 1.238325595855713} 11/07/2021 14:55:41 - INFO - __main__ - Step 125657: {'lr': 3.264257935297096e-05, 'samples': 24126144, 'steps': 125656, 'loss/train': 1.0909333229064941} 11/07/2021 14:55:41 - INFO - __main__ - Step 125658: {'lr': 3.263995756931748e-05, 'samples': 24126336, 'steps': 125657, 'loss/train': 1.3008614778518677} 11/07/2021 14:55:42 - INFO - __main__ - Step 125659: {'lr': 3.263733588360307e-05, 'samples': 24126528, 'steps': 125658, 'loss/train': 1.4722065925598145} 11/07/2021 14:55:43 - INFO - __main__ - Step 125660: {'lr': 3.263471429582898e-05, 'samples': 24126720, 'steps': 125659, 'loss/train': 0.18750432133674622} 11/07/2021 14:55:43 - INFO - __main__ - Step 125661: {'lr': 3.26320928059963e-05, 'samples': 24126912, 'steps': 125660, 'loss/train': 1.3070073127746582} 11/07/2021 14:55:44 - INFO - __main__ - Step 125662: {'lr': 3.2629471414106246e-05, 'samples': 24127104, 'steps': 125661, 'loss/train': 0.9632875919342041} 11/07/2021 14:55:44 - INFO - __main__ - Step 125663: {'lr': 3.262685012015998e-05, 'samples': 24127296, 'steps': 125662, 'loss/train': 1.4748395681381226} 11/07/2021 14:55:44 - INFO - __main__ - Step 125664: {'lr': 3.26242289241587e-05, 'samples': 24127488, 'steps': 125663, 'loss/train': 0.7949128746986389} 11/07/2021 14:55:45 - INFO - __main__ - Step 125665: {'lr': 3.2621607826103575e-05, 'samples': 24127680, 'steps': 125664, 'loss/train': 1.60536527633667} 11/07/2021 14:55:46 - INFO - __main__ - Step 125666: {'lr': 3.2618986825995816e-05, 'samples': 24127872, 'steps': 125665, 'loss/train': 1.2396999597549438} 11/07/2021 14:55:46 - INFO - __main__ - Step 125667: {'lr': 3.26163659238366e-05, 'samples': 24128064, 'steps': 125666, 'loss/train': 0.8519799113273621} 11/07/2021 14:55:46 - INFO - __main__ - Step 125668: {'lr': 3.2613745119627085e-05, 'samples': 24128256, 'steps': 125667, 'loss/train': 1.227903127670288} 11/07/2021 14:55:47 - INFO - __main__ - Step 125669: {'lr': 3.2611124413368445e-05, 'samples': 24128448, 'steps': 125668, 'loss/train': 1.501509428024292} 11/07/2021 14:55:48 - INFO - __main__ - Step 125670: {'lr': 3.260850380506189e-05, 'samples': 24128640, 'steps': 125669, 'loss/train': 0.190740704536438} 11/07/2021 14:55:48 - INFO - __main__ - Step 125671: {'lr': 3.260588329470859e-05, 'samples': 24128832, 'steps': 125670, 'loss/train': 1.6610231399536133} 11/07/2021 14:55:48 - INFO - __main__ - Step 125672: {'lr': 3.260326288230972e-05, 'samples': 24129024, 'steps': 125671, 'loss/train': 1.5805641412734985} 11/07/2021 14:55:49 - INFO - __main__ - Step 125673: {'lr': 3.2600642567866543e-05, 'samples': 24129216, 'steps': 125672, 'loss/train': 2.4753479957580566} 11/07/2021 14:55:49 - INFO - __main__ - Step 125674: {'lr': 3.259802235138007e-05, 'samples': 24129408, 'steps': 125673, 'loss/train': 0.6889846324920654} 11/07/2021 14:55:50 - INFO - __main__ - Step 125675: {'lr': 3.2595402232851595e-05, 'samples': 24129600, 'steps': 125674, 'loss/train': 1.2505732774734497} 11/07/2021 14:55:50 - INFO - __main__ - Step 125676: {'lr': 3.259278221228229e-05, 'samples': 24129792, 'steps': 125675, 'loss/train': 1.3008214235305786} 11/07/2021 14:55:51 - INFO - __main__ - Step 125677: {'lr': 3.259016228967329e-05, 'samples': 24129984, 'steps': 125676, 'loss/train': 1.4692661762237549} 11/07/2021 14:55:51 - INFO - __main__ - Step 125678: {'lr': 3.258754246502582e-05, 'samples': 24130176, 'steps': 125677, 'loss/train': 0.9361345767974854} 11/07/2021 14:55:51 - INFO - __main__ - Step 125679: {'lr': 3.258492273834107e-05, 'samples': 24130368, 'steps': 125678, 'loss/train': 1.0927484035491943} 11/07/2021 14:55:52 - INFO - __main__ - Step 125680: {'lr': 3.258230310962018e-05, 'samples': 24130560, 'steps': 125679, 'loss/train': 1.0656979084014893} 11/07/2021 14:55:53 - INFO - __main__ - Step 125681: {'lr': 3.2579683578864345e-05, 'samples': 24130752, 'steps': 125680, 'loss/train': 1.2769535779953003} 11/07/2021 14:55:53 - INFO - __main__ - Step 125682: {'lr': 3.2577064146074754e-05, 'samples': 24130944, 'steps': 125681, 'loss/train': 1.1541236639022827} 11/07/2021 14:55:54 - INFO - __main__ - Step 125683: {'lr': 3.25744448112526e-05, 'samples': 24131136, 'steps': 125682, 'loss/train': 1.572736144065857} 11/07/2021 14:55:54 - INFO - __main__ - Step 125684: {'lr': 3.257182557439903e-05, 'samples': 24131328, 'steps': 125683, 'loss/train': 1.439976692199707} 11/07/2021 14:55:55 - INFO - __main__ - Step 125685: {'lr': 3.256920643551523e-05, 'samples': 24131520, 'steps': 125684, 'loss/train': 1.2094870805740356} 11/07/2021 14:55:55 - INFO - __main__ - Step 125686: {'lr': 3.256658739460241e-05, 'samples': 24131712, 'steps': 125685, 'loss/train': 1.3508821725845337} 11/07/2021 14:55:56 - INFO - __main__ - Step 125687: {'lr': 3.256396845166176e-05, 'samples': 24131904, 'steps': 125686, 'loss/train': 1.3361985683441162} 11/07/2021 14:55:56 - INFO - __main__ - Step 125688: {'lr': 3.2561349606694406e-05, 'samples': 24132096, 'steps': 125687, 'loss/train': 0.8785132765769958} 11/07/2021 14:55:56 - INFO - __main__ - Step 125689: {'lr': 3.2558730859701544e-05, 'samples': 24132288, 'steps': 125688, 'loss/train': 1.0681307315826416} 11/07/2021 14:55:57 - INFO - __main__ - Step 125690: {'lr': 3.255611221068436e-05, 'samples': 24132480, 'steps': 125689, 'loss/train': 1.5088731050491333} 11/07/2021 14:55:58 - INFO - __main__ - Step 125691: {'lr': 3.2553493659644025e-05, 'samples': 24132672, 'steps': 125690, 'loss/train': 1.2142599821090698} 11/07/2021 14:55:58 - INFO - __main__ - Step 125692: {'lr': 3.255087520658173e-05, 'samples': 24132864, 'steps': 125691, 'loss/train': 0.7390680313110352} 11/07/2021 14:55:58 - INFO - __main__ - Step 125693: {'lr': 3.2548256851498675e-05, 'samples': 24133056, 'steps': 125692, 'loss/train': 1.3412532806396484} 11/07/2021 14:55:59 - INFO - __main__ - Step 125694: {'lr': 3.254563859439602e-05, 'samples': 24133248, 'steps': 125693, 'loss/train': 1.7290356159210205} 11/07/2021 14:55:59 - INFO - __main__ - Step 125695: {'lr': 3.2543020435274936e-05, 'samples': 24133440, 'steps': 125694, 'loss/train': 1.3133673667907715} 11/07/2021 14:56:00 - INFO - __main__ - Step 125696: {'lr': 3.2540402374136604e-05, 'samples': 24133632, 'steps': 125695, 'loss/train': 1.5269603729248047} 11/07/2021 14:56:00 - INFO - __main__ - Step 125697: {'lr': 3.253778441098221e-05, 'samples': 24133824, 'steps': 125696, 'loss/train': 1.3530939817428589} 11/07/2021 14:56:01 - INFO - __main__ - Step 125698: {'lr': 3.2535166545812954e-05, 'samples': 24134016, 'steps': 125697, 'loss/train': 1.130832314491272} 11/07/2021 14:56:01 - INFO - __main__ - Step 125699: {'lr': 3.253254877862996e-05, 'samples': 24134208, 'steps': 125698, 'loss/train': 1.632692813873291} 11/07/2021 14:56:01 - INFO - __main__ - Step 125700: {'lr': 3.252993110943453e-05, 'samples': 24134400, 'steps': 125699, 'loss/train': 1.1362863779067993} 11/07/2021 14:56:03 - INFO - __main__ - Step 125701: {'lr': 3.2527313538227684e-05, 'samples': 24134592, 'steps': 125700, 'loss/train': 0.3434278666973114} 11/07/2021 14:56:03 - INFO - __main__ - Step 125702: {'lr': 3.252469606501071e-05, 'samples': 24134784, 'steps': 125701, 'loss/train': 1.8258737325668335} 11/07/2021 14:56:03 - INFO - __main__ - Step 125703: {'lr': 3.2522078689784714e-05, 'samples': 24134976, 'steps': 125702, 'loss/train': 1.1196212768554688} 11/07/2021 14:56:04 - INFO - __main__ - Step 125704: {'lr': 3.251946141255094e-05, 'samples': 24135168, 'steps': 125703, 'loss/train': 1.2671382427215576} 11/07/2021 14:56:04 - INFO - __main__ - Step 125705: {'lr': 3.251684423331053e-05, 'samples': 24135360, 'steps': 125704, 'loss/train': 1.3544317483901978} 11/07/2021 14:56:05 - INFO - __main__ - Step 125706: {'lr': 3.2514227152064676e-05, 'samples': 24135552, 'steps': 125705, 'loss/train': 0.9962705969810486} 11/07/2021 14:56:05 - INFO - __main__ - Step 125707: {'lr': 3.2511610168814543e-05, 'samples': 24135744, 'steps': 125706, 'loss/train': 1.2113643884658813} 11/07/2021 14:56:06 - INFO - __main__ - Step 125708: {'lr': 3.2508993283561326e-05, 'samples': 24135936, 'steps': 125707, 'loss/train': 1.5153154134750366} 11/07/2021 14:56:06 - INFO - __main__ - Step 125709: {'lr': 3.2506376496306194e-05, 'samples': 24136128, 'steps': 125708, 'loss/train': 1.4744900465011597} 11/07/2021 14:56:07 - INFO - __main__ - Step 125710: {'lr': 3.250375980705036e-05, 'samples': 24136320, 'steps': 125709, 'loss/train': 1.3889975547790527} 11/07/2021 14:56:07 - INFO - __main__ - Step 125711: {'lr': 3.250114321579495e-05, 'samples': 24136512, 'steps': 125710, 'loss/train': 1.187231183052063} 11/07/2021 14:56:08 - INFO - __main__ - Step 125712: {'lr': 3.249852672254119e-05, 'samples': 24136704, 'steps': 125711, 'loss/train': 1.3023992776870728} 11/07/2021 14:56:08 - INFO - __main__ - Step 125713: {'lr': 3.249591032729027e-05, 'samples': 24136896, 'steps': 125712, 'loss/train': 1.1268534660339355} 11/07/2021 14:56:09 - INFO - __main__ - Step 125714: {'lr': 3.249329403004331e-05, 'samples': 24137088, 'steps': 125713, 'loss/train': 0.2809812128543854} 11/07/2021 14:56:09 - INFO - __main__ - Step 125715: {'lr': 3.249067783080148e-05, 'samples': 24137280, 'steps': 125714, 'loss/train': 0.9688840508460999} 11/07/2021 14:56:10 - INFO - __main__ - Step 125716: {'lr': 3.2488061729566e-05, 'samples': 24137472, 'steps': 125715, 'loss/train': 1.2587623596191406} 11/07/2021 14:56:10 - INFO - __main__ - Step 125717: {'lr': 3.248544572633807e-05, 'samples': 24137664, 'steps': 125716, 'loss/train': 1.1703845262527466} 11/07/2021 14:56:11 - INFO - __main__ - Step 125718: {'lr': 3.2482829821118834e-05, 'samples': 24137856, 'steps': 125717, 'loss/train': 1.4448109865188599} 11/07/2021 14:56:11 - INFO - __main__ - Step 125719: {'lr': 3.2480214013909466e-05, 'samples': 24138048, 'steps': 125718, 'loss/train': 1.0490576028823853} 11/07/2021 14:56:11 - INFO - __main__ - Step 125720: {'lr': 3.247759830471117e-05, 'samples': 24138240, 'steps': 125719, 'loss/train': 1.3641935586929321} 11/07/2021 14:56:13 - INFO - __main__ - Step 125721: {'lr': 3.2474982693525086e-05, 'samples': 24138432, 'steps': 125720, 'loss/train': 1.368215799331665} 11/07/2021 14:56:13 - INFO - __main__ - Step 125722: {'lr': 3.247236718035243e-05, 'samples': 24138624, 'steps': 125721, 'loss/train': 1.1989343166351318} 11/07/2021 14:56:13 - INFO - __main__ - Step 125723: {'lr': 3.246975176519446e-05, 'samples': 24138816, 'steps': 125722, 'loss/train': 1.0772134065628052} 11/07/2021 14:56:14 - INFO - __main__ - Step 125724: {'lr': 3.2467136448052157e-05, 'samples': 24139008, 'steps': 125723, 'loss/train': 1.0932390689849854} 11/07/2021 14:56:14 - INFO - __main__ - Step 125725: {'lr': 3.246452122892682e-05, 'samples': 24139200, 'steps': 125724, 'loss/train': 1.2410857677459717} 11/07/2021 14:56:14 - INFO - __main__ - Step 125726: {'lr': 3.246190610781963e-05, 'samples': 24139392, 'steps': 125725, 'loss/train': 1.1645698547363281} 11/07/2021 14:56:15 - INFO - __main__ - Step 125727: {'lr': 3.2459291084731726e-05, 'samples': 24139584, 'steps': 125726, 'loss/train': 1.017416000366211} 11/07/2021 14:56:16 - INFO - __main__ - Step 125728: {'lr': 3.2456676159664326e-05, 'samples': 24139776, 'steps': 125727, 'loss/train': 1.0639057159423828} 11/07/2021 14:56:16 - INFO - __main__ - Step 125729: {'lr': 3.245406133261858e-05, 'samples': 24139968, 'steps': 125728, 'loss/train': 1.010526180267334} 11/07/2021 14:56:16 - INFO - __main__ - Step 125730: {'lr': 3.24514466035957e-05, 'samples': 24140160, 'steps': 125729, 'loss/train': 0.7922136783599854} 11/07/2021 14:56:17 - INFO - __main__ - Step 125731: {'lr': 3.244883197259682e-05, 'samples': 24140352, 'steps': 125730, 'loss/train': 1.362777829170227} 11/07/2021 14:56:18 - INFO - __main__ - Step 125732: {'lr': 3.2446217439623145e-05, 'samples': 24140544, 'steps': 125731, 'loss/train': 1.4912217855453491} 11/07/2021 14:56:18 - INFO - __main__ - Step 125733: {'lr': 3.244360300467583e-05, 'samples': 24140736, 'steps': 125732, 'loss/train': 1.1721948385238647} 11/07/2021 14:56:19 - INFO - __main__ - Step 125734: {'lr': 3.244098866775613e-05, 'samples': 24140928, 'steps': 125733, 'loss/train': 0.9861537218093872} 11/07/2021 14:56:19 - INFO - __main__ - Step 125735: {'lr': 3.243837442886513e-05, 'samples': 24141120, 'steps': 125734, 'loss/train': 0.6805912852287292} 11/07/2021 14:56:19 - INFO - __main__ - Step 125736: {'lr': 3.2435760288004046e-05, 'samples': 24141312, 'steps': 125735, 'loss/train': 0.8881900310516357} 11/07/2021 14:56:20 - INFO - __main__ - Step 125737: {'lr': 3.243314624517402e-05, 'samples': 24141504, 'steps': 125736, 'loss/train': 1.2749605178833008} 11/07/2021 14:56:21 - INFO - __main__ - Step 125738: {'lr': 3.243053230037629e-05, 'samples': 24141696, 'steps': 125737, 'loss/train': 1.287515640258789} 11/07/2021 14:56:21 - INFO - __main__ - Step 125739: {'lr': 3.242791845361198e-05, 'samples': 24141888, 'steps': 125738, 'loss/train': 1.6278382539749146} 11/07/2021 14:56:21 - INFO - __main__ - Step 125740: {'lr': 3.2425304704882306e-05, 'samples': 24142080, 'steps': 125739, 'loss/train': 1.3745659589767456} 11/07/2021 14:56:22 - INFO - __main__ - Step 125741: {'lr': 3.242269105418844e-05, 'samples': 24142272, 'steps': 125740, 'loss/train': 1.2694159746170044} 11/07/2021 14:56:23 - INFO - __main__ - Step 125742: {'lr': 3.242007750153153e-05, 'samples': 24142464, 'steps': 125741, 'loss/train': 1.4279046058654785} 11/07/2021 14:56:23 - INFO - __main__ - Step 125743: {'lr': 3.2417464046912785e-05, 'samples': 24142656, 'steps': 125742, 'loss/train': 1.1776353120803833} 11/07/2021 14:56:23 - INFO - __main__ - Step 125744: {'lr': 3.241485069033337e-05, 'samples': 24142848, 'steps': 125743, 'loss/train': 0.21663545072078705} 11/07/2021 14:56:24 - INFO - __main__ - Step 125745: {'lr': 3.241223743179453e-05, 'samples': 24143040, 'steps': 125744, 'loss/train': 1.6773070096969604} 11/07/2021 14:56:24 - INFO - __main__ - Step 125746: {'lr': 3.2409624271297316e-05, 'samples': 24143232, 'steps': 125745, 'loss/train': 1.2944743633270264} 11/07/2021 14:56:25 - INFO - __main__ - Step 125747: {'lr': 3.2407011208842987e-05, 'samples': 24143424, 'steps': 125746, 'loss/train': 1.3120685815811157} 11/07/2021 14:56:26 - INFO - __main__ - Step 125748: {'lr': 3.240439824443267e-05, 'samples': 24143616, 'steps': 125747, 'loss/train': 1.6983400583267212} 11/07/2021 14:56:26 - INFO - __main__ - Step 125749: {'lr': 3.24017853780676e-05, 'samples': 24143808, 'steps': 125748, 'loss/train': 0.9306241869926453} 11/07/2021 14:56:26 - INFO - __main__ - Step 125750: {'lr': 3.239917260974892e-05, 'samples': 24144000, 'steps': 125749, 'loss/train': 1.4110107421875} 11/07/2021 14:56:27 - INFO - __main__ - Step 125751: {'lr': 3.239655993947779e-05, 'samples': 24144192, 'steps': 125750, 'loss/train': 0.7323570251464844} 11/07/2021 14:56:28 - INFO - __main__ - Step 125752: {'lr': 3.239394736725546e-05, 'samples': 24144384, 'steps': 125751, 'loss/train': 1.35401451587677} 11/07/2021 14:56:28 - INFO - __main__ - Step 125753: {'lr': 3.239133489308302e-05, 'samples': 24144576, 'steps': 125752, 'loss/train': 1.2767786979675293} 11/07/2021 14:56:28 - INFO - __main__ - Step 125754: {'lr': 3.238872251696171e-05, 'samples': 24144768, 'steps': 125753, 'loss/train': 1.379646897315979} 11/07/2021 14:56:29 - INFO - __main__ - Step 125755: {'lr': 3.238611023889265e-05, 'samples': 24144960, 'steps': 125754, 'loss/train': 1.1207796335220337} 11/07/2021 14:56:29 - INFO - __main__ - Step 125756: {'lr': 3.238349805887714e-05, 'samples': 24145152, 'steps': 125755, 'loss/train': 0.8630984425544739} 11/07/2021 14:56:29 - INFO - __main__ - Step 125757: {'lr': 3.238088597691621e-05, 'samples': 24145344, 'steps': 125756, 'loss/train': 0.8816668391227722} 11/07/2021 14:56:30 - INFO - __main__ - Step 125758: {'lr': 3.2378273993011074e-05, 'samples': 24145536, 'steps': 125757, 'loss/train': 0.5967018604278564} 11/07/2021 14:56:31 - INFO - __main__ - Step 125759: {'lr': 3.237566210716295e-05, 'samples': 24145728, 'steps': 125758, 'loss/train': 1.5802178382873535} 11/07/2021 14:56:31 - INFO - __main__ - Step 125760: {'lr': 3.237305031937299e-05, 'samples': 24145920, 'steps': 125759, 'loss/train': 1.38740873336792} 11/07/2021 14:56:32 - INFO - __main__ - Step 125761: {'lr': 3.237043862964237e-05, 'samples': 24146112, 'steps': 125760, 'loss/train': 1.3881816864013672} 11/07/2021 14:56:32 - INFO - __main__ - Step 125762: {'lr': 3.236782703797228e-05, 'samples': 24146304, 'steps': 125761, 'loss/train': 1.5942026376724243} 11/07/2021 14:56:33 - INFO - __main__ - Step 125763: {'lr': 3.2365215544363864e-05, 'samples': 24146496, 'steps': 125762, 'loss/train': 1.2618883848190308} 11/07/2021 14:56:33 - INFO - __main__ - Step 125764: {'lr': 3.236260414881836e-05, 'samples': 24146688, 'steps': 125763, 'loss/train': 1.2791107892990112} 11/07/2021 14:56:34 - INFO - __main__ - Step 125765: {'lr': 3.23599928513369e-05, 'samples': 24146880, 'steps': 125764, 'loss/train': 0.8622357845306396} 11/07/2021 14:56:34 - INFO - __main__ - Step 125766: {'lr': 3.235738165192065e-05, 'samples': 24147072, 'steps': 125765, 'loss/train': 1.2425273656845093} 11/07/2021 14:56:34 - INFO - __main__ - Step 125767: {'lr': 3.2354770550570876e-05, 'samples': 24147264, 'steps': 125766, 'loss/train': 1.3742021322250366} 11/07/2021 14:56:35 - INFO - __main__ - Step 125768: {'lr': 3.235215954728862e-05, 'samples': 24147456, 'steps': 125767, 'loss/train': 1.1690353155136108} 11/07/2021 14:56:36 - INFO - __main__ - Step 125769: {'lr': 3.234954864207512e-05, 'samples': 24147648, 'steps': 125768, 'loss/train': 1.7561489343643188} 11/07/2021 14:56:36 - INFO - __main__ - Step 125770: {'lr': 3.234693783493156e-05, 'samples': 24147840, 'steps': 125769, 'loss/train': 1.3438644409179688} 11/07/2021 14:56:36 - INFO - __main__ - Step 125771: {'lr': 3.234432712585911e-05, 'samples': 24148032, 'steps': 125770, 'loss/train': 1.2067610025405884} 11/07/2021 14:56:37 - INFO - __main__ - Step 125772: {'lr': 3.2341716514858954e-05, 'samples': 24148224, 'steps': 125771, 'loss/train': 0.9249855279922485} 11/07/2021 14:56:38 - INFO - __main__ - Step 125773: {'lr': 3.233910600193224e-05, 'samples': 24148416, 'steps': 125772, 'loss/train': 1.418448805809021} 11/07/2021 14:56:38 - INFO - __main__ - Step 125774: {'lr': 3.2336495587080187e-05, 'samples': 24148608, 'steps': 125773, 'loss/train': 1.0414420366287231} 11/07/2021 14:56:39 - INFO - __main__ - Step 125775: {'lr': 3.233388527030395e-05, 'samples': 24148800, 'steps': 125774, 'loss/train': 1.5199106931686401} 11/07/2021 14:56:39 - INFO - __main__ - Step 125776: {'lr': 3.2331275051604715e-05, 'samples': 24148992, 'steps': 125775, 'loss/train': 1.2451649904251099} 11/07/2021 14:56:39 - INFO - __main__ - Step 125777: {'lr': 3.232866493098363e-05, 'samples': 24149184, 'steps': 125776, 'loss/train': 1.3642299175262451} 11/07/2021 14:56:40 - INFO - __main__ - Step 125778: {'lr': 3.23260549084419e-05, 'samples': 24149376, 'steps': 125777, 'loss/train': 1.4536569118499756} 11/07/2021 14:56:41 - INFO - __main__ - Step 125779: {'lr': 3.232344498398068e-05, 'samples': 24149568, 'steps': 125778, 'loss/train': 1.649601936340332} 11/07/2021 14:56:41 - INFO - __main__ - Step 125780: {'lr': 3.232083515760117e-05, 'samples': 24149760, 'steps': 125779, 'loss/train': 1.4620639085769653} 11/07/2021 14:56:41 - INFO - __main__ - Step 125781: {'lr': 3.231822542930457e-05, 'samples': 24149952, 'steps': 125780, 'loss/train': 1.3645952939987183} 11/07/2021 14:56:42 - INFO - __main__ - Step 125782: {'lr': 3.231561579909198e-05, 'samples': 24150144, 'steps': 125781, 'loss/train': 1.1573209762573242} 11/07/2021 14:56:43 - INFO - __main__ - Step 125783: {'lr': 3.2313006266964595e-05, 'samples': 24150336, 'steps': 125782, 'loss/train': 1.5211726427078247} 11/07/2021 14:56:43 - INFO - __main__ - Step 125784: {'lr': 3.231039683292364e-05, 'samples': 24150528, 'steps': 125783, 'loss/train': 1.6374895572662354} 11/07/2021 14:56:43 - INFO - __main__ - Step 125785: {'lr': 3.230778749697025e-05, 'samples': 24150720, 'steps': 125784, 'loss/train': 1.1418771743774414} 11/07/2021 14:56:44 - INFO - __main__ - Step 125786: {'lr': 3.2305178259105586e-05, 'samples': 24150912, 'steps': 125785, 'loss/train': 1.3702031373977661} 11/07/2021 14:56:44 - INFO - __main__ - Step 125787: {'lr': 3.230256911933088e-05, 'samples': 24151104, 'steps': 125786, 'loss/train': 1.3102290630340576} 11/07/2021 14:56:45 - INFO - __main__ - Step 125788: {'lr': 3.229996007764727e-05, 'samples': 24151296, 'steps': 125787, 'loss/train': 1.033579707145691} 11/07/2021 14:56:46 - INFO - __main__ - Step 125789: {'lr': 3.229735113405593e-05, 'samples': 24151488, 'steps': 125788, 'loss/train': 1.244084119796753} 11/07/2021 14:56:46 - INFO - __main__ - Step 125790: {'lr': 3.229474228855805e-05, 'samples': 24151680, 'steps': 125789, 'loss/train': 1.343693494796753} 11/07/2021 14:56:46 - INFO - __main__ - Step 125791: {'lr': 3.229213354115479e-05, 'samples': 24151872, 'steps': 125790, 'loss/train': 0.7881277203559875} 11/07/2021 14:56:47 - INFO - __main__ - Step 125792: {'lr': 3.228952489184736e-05, 'samples': 24152064, 'steps': 125791, 'loss/train': 1.0713088512420654} 11/07/2021 14:56:48 - INFO - __main__ - Step 125793: {'lr': 3.2286916340636876e-05, 'samples': 24152256, 'steps': 125792, 'loss/train': 1.6920560598373413} 11/07/2021 14:56:48 - INFO - __main__ - Step 125794: {'lr': 3.228430788752465e-05, 'samples': 24152448, 'steps': 125793, 'loss/train': 0.06890112161636353} 11/07/2021 14:56:48 - INFO - __main__ - Step 125795: {'lr': 3.2281699532511674e-05, 'samples': 24152640, 'steps': 125794, 'loss/train': 1.242842197418213} 11/07/2021 14:56:49 - INFO - __main__ - Step 125796: {'lr': 3.2279091275599194e-05, 'samples': 24152832, 'steps': 125795, 'loss/train': 1.1769212484359741} 11/07/2021 14:56:49 - INFO - __main__ - Step 125797: {'lr': 3.227648311678841e-05, 'samples': 24153024, 'steps': 125796, 'loss/train': 0.899408757686615} 11/07/2021 14:56:50 - INFO - __main__ - Step 125798: {'lr': 3.227387505608048e-05, 'samples': 24153216, 'steps': 125797, 'loss/train': 1.0908491611480713} 11/07/2021 14:56:51 - INFO - __main__ - Step 125799: {'lr': 3.227126709347658e-05, 'samples': 24153408, 'steps': 125798, 'loss/train': 1.3813693523406982} 11/07/2021 14:56:51 - INFO - __main__ - Step 125800: {'lr': 3.226865922897787e-05, 'samples': 24153600, 'steps': 125799, 'loss/train': 1.0826311111450195} 11/07/2021 14:56:51 - INFO - __main__ - Step 125801: {'lr': 3.226605146258557e-05, 'samples': 24153792, 'steps': 125800, 'loss/train': 1.4046744108200073} 11/07/2021 14:56:52 - INFO - __main__ - Step 125802: {'lr': 3.226344379430082e-05, 'samples': 24153984, 'steps': 125801, 'loss/train': 0.917705774307251} 11/07/2021 14:56:53 - INFO - __main__ - Step 125803: {'lr': 3.226083622412479e-05, 'samples': 24154176, 'steps': 125802, 'loss/train': 1.0103750228881836} 11/07/2021 14:56:53 - INFO - __main__ - Step 125804: {'lr': 3.225822875205869e-05, 'samples': 24154368, 'steps': 125803, 'loss/train': 1.4128302335739136} 11/07/2021 14:56:53 - INFO - __main__ - Step 125805: {'lr': 3.225562137810364e-05, 'samples': 24154560, 'steps': 125804, 'loss/train': 1.4599567651748657} 11/07/2021 14:56:54 - INFO - __main__ - Step 125806: {'lr': 3.2253014102260895e-05, 'samples': 24154752, 'steps': 125805, 'loss/train': 0.06797361373901367} 11/07/2021 14:56:54 - INFO - __main__ - Step 125807: {'lr': 3.2250406924531544e-05, 'samples': 24154944, 'steps': 125806, 'loss/train': 1.1503299474716187} 11/07/2021 14:56:55 - INFO - __main__ - Step 125808: {'lr': 3.224779984491685e-05, 'samples': 24155136, 'steps': 125807, 'loss/train': 1.5402178764343262} 11/07/2021 14:56:56 - INFO - __main__ - Step 125809: {'lr': 3.22451928634179e-05, 'samples': 24155328, 'steps': 125808, 'loss/train': 0.9188585877418518} 11/07/2021 14:56:56 - INFO - __main__ - Step 125810: {'lr': 3.2242585980035905e-05, 'samples': 24155520, 'steps': 125809, 'loss/train': 0.9100019931793213} 11/07/2021 14:56:56 - INFO - __main__ - Step 125811: {'lr': 3.2239979194772033e-05, 'samples': 24155712, 'steps': 125810, 'loss/train': 1.3479294776916504} 11/07/2021 14:56:57 - INFO - __main__ - Step 125812: {'lr': 3.223737250762748e-05, 'samples': 24155904, 'steps': 125811, 'loss/train': 1.2871322631835938} 11/07/2021 14:56:57 - INFO - __main__ - Step 125813: {'lr': 3.2234765918603385e-05, 'samples': 24156096, 'steps': 125812, 'loss/train': 1.3060723543167114} 11/07/2021 14:56:58 - INFO - __main__ - Step 125814: {'lr': 3.2232159427700966e-05, 'samples': 24156288, 'steps': 125813, 'loss/train': 1.9915461540222168} 11/07/2021 14:56:58 - INFO - __main__ - Step 125815: {'lr': 3.2229553034921365e-05, 'samples': 24156480, 'steps': 125814, 'loss/train': 0.7453563213348389} 11/07/2021 14:56:59 - INFO - __main__ - Step 125816: {'lr': 3.222694674026577e-05, 'samples': 24156672, 'steps': 125815, 'loss/train': 0.8088968396186829} 11/07/2021 14:56:59 - INFO - __main__ - Step 125817: {'lr': 3.222434054373535e-05, 'samples': 24156864, 'steps': 125816, 'loss/train': 1.104385256767273} 11/07/2021 14:57:00 - INFO - __main__ - Step 125818: {'lr': 3.2221734445331275e-05, 'samples': 24157056, 'steps': 125817, 'loss/train': 1.3632395267486572} 11/07/2021 14:57:00 - INFO - __main__ - Step 125819: {'lr': 3.221912844505473e-05, 'samples': 24157248, 'steps': 125818, 'loss/train': 1.2575935125350952} 11/07/2021 14:57:01 - INFO - __main__ - Step 125820: {'lr': 3.2216522542906885e-05, 'samples': 24157440, 'steps': 125819, 'loss/train': 1.303808331489563} 11/07/2021 14:57:01 - INFO - __main__ - Step 125821: {'lr': 3.221391673888899e-05, 'samples': 24157632, 'steps': 125820, 'loss/train': 1.7796634435653687} 11/07/2021 14:57:02 - INFO - __main__ - Step 125822: {'lr': 3.221131103300207e-05, 'samples': 24157824, 'steps': 125821, 'loss/train': 0.835414707660675} 11/07/2021 14:57:02 - INFO - __main__ - Step 125823: {'lr': 3.220870542524737e-05, 'samples': 24158016, 'steps': 125822, 'loss/train': 1.268065333366394} 11/07/2021 14:57:03 - INFO - __main__ - Step 125824: {'lr': 3.220609991562606e-05, 'samples': 24158208, 'steps': 125823, 'loss/train': 1.279423475265503} 11/07/2021 14:57:03 - INFO - __main__ - Step 125825: {'lr': 3.220349450413934e-05, 'samples': 24158400, 'steps': 125824, 'loss/train': 1.607831358909607} 11/07/2021 14:57:04 - INFO - __main__ - Step 125826: {'lr': 3.220088919078834e-05, 'samples': 24158592, 'steps': 125825, 'loss/train': 1.1834229230880737} 11/07/2021 14:57:04 - INFO - __main__ - Step 125827: {'lr': 3.219828397557428e-05, 'samples': 24158784, 'steps': 125826, 'loss/train': 1.4088584184646606} 11/07/2021 14:57:04 - INFO - __main__ - Step 125828: {'lr': 3.2195678858498304e-05, 'samples': 24158976, 'steps': 125827, 'loss/train': 1.510187029838562} 11/07/2021 14:57:05 - INFO - __main__ - Step 125829: {'lr': 3.21930738395616e-05, 'samples': 24159168, 'steps': 125828, 'loss/train': 1.5549393892288208} 11/07/2021 14:57:06 - INFO - __main__ - Step 125830: {'lr': 3.2190468918765344e-05, 'samples': 24159360, 'steps': 125829, 'loss/train': 1.1422654390335083} 11/07/2021 14:57:06 - INFO - __main__ - Step 125831: {'lr': 3.2187864096110686e-05, 'samples': 24159552, 'steps': 125830, 'loss/train': 1.1681532859802246} 11/07/2021 14:57:07 - INFO - __main__ - Step 125832: {'lr': 3.21852593715988e-05, 'samples': 24159744, 'steps': 125831, 'loss/train': 1.0817822217941284} 11/07/2021 14:57:07 - INFO - __main__ - Step 125833: {'lr': 3.218265474523091e-05, 'samples': 24159936, 'steps': 125832, 'loss/train': 1.2055199146270752} 11/07/2021 14:57:08 - INFO - __main__ - Step 125834: {'lr': 3.218005021700815e-05, 'samples': 24160128, 'steps': 125833, 'loss/train': 1.3968089818954468} 11/07/2021 14:57:08 - INFO - __main__ - Step 125835: {'lr': 3.217744578693174e-05, 'samples': 24160320, 'steps': 125834, 'loss/train': 1.5319812297821045} 11/07/2021 14:57:09 - INFO - __main__ - Step 125836: {'lr': 3.217484145500274e-05, 'samples': 24160512, 'steps': 125835, 'loss/train': 1.069079041481018} 11/07/2021 14:57:09 - INFO - __main__ - Step 125837: {'lr': 3.2172237221222425e-05, 'samples': 24160704, 'steps': 125836, 'loss/train': 1.41275155544281} 11/07/2021 14:57:09 - INFO - __main__ - Step 125838: {'lr': 3.216963308559193e-05, 'samples': 24160896, 'steps': 125837, 'loss/train': 0.9209392666816711} 11/07/2021 14:57:10 - INFO - __main__ - Step 125839: {'lr': 3.216702904811242e-05, 'samples': 24161088, 'steps': 125838, 'loss/train': 1.1835482120513916} 11/07/2021 14:57:11 - INFO - __main__ - Step 125840: {'lr': 3.2164425108785114e-05, 'samples': 24161280, 'steps': 125839, 'loss/train': 1.124697208404541} 11/07/2021 14:57:11 - INFO - __main__ - Step 125841: {'lr': 3.2161821267611134e-05, 'samples': 24161472, 'steps': 125840, 'loss/train': 1.0683183670043945} 11/07/2021 14:57:11 - INFO - __main__ - Step 125842: {'lr': 3.21592175245917e-05, 'samples': 24161664, 'steps': 125841, 'loss/train': 1.3523565530776978} 11/07/2021 14:57:12 - INFO - __main__ - Step 125843: {'lr': 3.215661387972793e-05, 'samples': 24161856, 'steps': 125842, 'loss/train': 0.8890810608863831} 11/07/2021 14:57:13 - INFO - __main__ - Step 125844: {'lr': 3.215401033302104e-05, 'samples': 24162048, 'steps': 125843, 'loss/train': 1.5833877325057983} 11/07/2021 14:57:14 - INFO - __main__ - Step 125845: {'lr': 3.215140688447221e-05, 'samples': 24162240, 'steps': 125844, 'loss/train': 1.1081465482711792} 11/07/2021 14:57:14 - INFO - __main__ - Step 125846: {'lr': 3.214880353408256e-05, 'samples': 24162432, 'steps': 125845, 'loss/train': 2.0906970500946045} 11/07/2021 14:57:14 - INFO - __main__ - Step 125847: {'lr': 3.214620028185333e-05, 'samples': 24162624, 'steps': 125846, 'loss/train': 1.5354163646697998} 11/07/2021 14:57:15 - INFO - __main__ - Step 125848: {'lr': 3.2143597127785695e-05, 'samples': 24162816, 'steps': 125847, 'loss/train': 0.9510960578918457} 11/07/2021 14:57:15 - INFO - __main__ - Step 125849: {'lr': 3.214099407188076e-05, 'samples': 24163008, 'steps': 125848, 'loss/train': 1.5586023330688477} 11/07/2021 14:57:16 - INFO - __main__ - Step 125850: {'lr': 3.2138391114139715e-05, 'samples': 24163200, 'steps': 125849, 'loss/train': 1.3485784530639648} 11/07/2021 14:57:16 - INFO - __main__ - Step 125851: {'lr': 3.213578825456376e-05, 'samples': 24163392, 'steps': 125850, 'loss/train': 1.386326551437378} 11/07/2021 14:57:17 - INFO - __main__ - Step 125852: {'lr': 3.2133185493154025e-05, 'samples': 24163584, 'steps': 125851, 'loss/train': 1.7173149585723877} 11/07/2021 14:57:17 - INFO - __main__ - Step 125853: {'lr': 3.213058282991174e-05, 'samples': 24163776, 'steps': 125852, 'loss/train': 0.8114742040634155} 11/07/2021 14:57:17 - INFO - __main__ - Step 125854: {'lr': 3.212798026483807e-05, 'samples': 24163968, 'steps': 125853, 'loss/train': 1.4373350143432617} 11/07/2021 14:57:18 - INFO - __main__ - Step 125855: {'lr': 3.212537779793415e-05, 'samples': 24164160, 'steps': 125854, 'loss/train': 1.2822626829147339} 11/07/2021 14:57:19 - INFO - __main__ - Step 125856: {'lr': 3.2122775429201164e-05, 'samples': 24164352, 'steps': 125855, 'loss/train': 1.3812768459320068} 11/07/2021 14:57:19 - INFO - __main__ - Step 125857: {'lr': 3.2120173158640297e-05, 'samples': 24164544, 'steps': 125856, 'loss/train': 1.0997570753097534} 11/07/2021 14:57:19 - INFO - __main__ - Step 125858: {'lr': 3.211757098625273e-05, 'samples': 24164736, 'steps': 125857, 'loss/train': 1.6267762184143066} 11/07/2021 14:57:20 - INFO - __main__ - Step 125859: {'lr': 3.211496891203961e-05, 'samples': 24164928, 'steps': 125858, 'loss/train': 1.4080400466918945} 11/07/2021 14:57:21 - INFO - __main__ - Step 125860: {'lr': 3.211236693600214e-05, 'samples': 24165120, 'steps': 125859, 'loss/train': 1.1182751655578613} 11/07/2021 14:57:21 - INFO - __main__ - Step 125861: {'lr': 3.2109765058141534e-05, 'samples': 24165312, 'steps': 125860, 'loss/train': 1.2514989376068115} 11/07/2021 14:57:22 - INFO - __main__ - Step 125862: {'lr': 3.210716327845883e-05, 'samples': 24165504, 'steps': 125861, 'loss/train': 1.5702177286148071} 11/07/2021 14:57:22 - INFO - __main__ - Step 125863: {'lr': 3.210456159695527e-05, 'samples': 24165696, 'steps': 125862, 'loss/train': 0.7733402848243713} 11/07/2021 14:57:22 - INFO - __main__ - Step 125864: {'lr': 3.2101960013632056e-05, 'samples': 24165888, 'steps': 125863, 'loss/train': 1.0777370929718018} 11/07/2021 14:57:24 - INFO - __main__ - Step 125865: {'lr': 3.209935852849033e-05, 'samples': 24166080, 'steps': 125864, 'loss/train': 0.9402232766151428} 11/07/2021 14:57:24 - INFO - __main__ - Step 125866: {'lr': 3.209675714153126e-05, 'samples': 24166272, 'steps': 125865, 'loss/train': 1.536487102508545} 11/07/2021 14:57:25 - INFO - __main__ - Step 125867: {'lr': 3.2094155852756016e-05, 'samples': 24166464, 'steps': 125866, 'loss/train': 1.265724539756775} 11/07/2021 14:57:25 - INFO - __main__ - Step 125868: {'lr': 3.2091554662165815e-05, 'samples': 24166656, 'steps': 125867, 'loss/train': 1.0755988359451294} 11/07/2021 14:57:25 - INFO - __main__ - Step 125869: {'lr': 3.2088953569761796e-05, 'samples': 24166848, 'steps': 125868, 'loss/train': 1.219467282295227} 11/07/2021 14:57:26 - INFO - __main__ - Step 125870: {'lr': 3.20863525755451e-05, 'samples': 24167040, 'steps': 125869, 'loss/train': 0.7431042194366455} 11/07/2021 14:57:27 - INFO - __main__ - Step 125871: {'lr': 3.208375167951697e-05, 'samples': 24167232, 'steps': 125870, 'loss/train': 0.11075641214847565} 11/07/2021 14:57:27 - INFO - __main__ - Step 125872: {'lr': 3.208115088167851e-05, 'samples': 24167424, 'steps': 125871, 'loss/train': 0.9404196739196777} 11/07/2021 14:57:28 - INFO - __main__ - Step 125873: {'lr': 3.207855018203093e-05, 'samples': 24167616, 'steps': 125872, 'loss/train': 0.09771916270256042} 11/07/2021 14:57:28 - INFO - __main__ - Step 125874: {'lr': 3.2075949580575386e-05, 'samples': 24167808, 'steps': 125873, 'loss/train': 1.688042163848877} 11/07/2021 14:57:28 - INFO - __main__ - Step 125875: {'lr': 3.207334907731313e-05, 'samples': 24168000, 'steps': 125874, 'loss/train': 1.1509121656417847} 11/07/2021 14:57:29 - INFO - __main__ - Step 125876: {'lr': 3.207074867224519e-05, 'samples': 24168192, 'steps': 125875, 'loss/train': 1.0311591625213623} 11/07/2021 14:57:30 - INFO - __main__ - Step 125877: {'lr': 3.206814836537281e-05, 'samples': 24168384, 'steps': 125876, 'loss/train': 1.666965126991272} 11/07/2021 14:57:30 - INFO - __main__ - Step 125878: {'lr': 3.2065548156697156e-05, 'samples': 24168576, 'steps': 125877, 'loss/train': 1.2315890789031982} 11/07/2021 14:57:30 - INFO - __main__ - Step 125879: {'lr': 3.2062948046219395e-05, 'samples': 24168768, 'steps': 125878, 'loss/train': 1.5472732782363892} 11/07/2021 14:57:31 - INFO - __main__ - Step 125880: {'lr': 3.2060348033940725e-05, 'samples': 24168960, 'steps': 125879, 'loss/train': 1.415156364440918} 11/07/2021 14:57:32 - INFO - __main__ - Step 125881: {'lr': 3.205774811986231e-05, 'samples': 24169152, 'steps': 125880, 'loss/train': 1.2618374824523926} 11/07/2021 14:57:32 - INFO - __main__ - Step 125882: {'lr': 3.205514830398529e-05, 'samples': 24169344, 'steps': 125881, 'loss/train': 1.1910127401351929} 11/07/2021 14:57:32 - INFO - __main__ - Step 125883: {'lr': 3.205254858631085e-05, 'samples': 24169536, 'steps': 125882, 'loss/train': 0.7480348348617554} 11/07/2021 14:57:33 - INFO - __main__ - Step 125884: {'lr': 3.2049948966840185e-05, 'samples': 24169728, 'steps': 125883, 'loss/train': 1.4113608598709106} 11/07/2021 14:57:33 - INFO - __main__ - Step 125885: {'lr': 3.204734944557444e-05, 'samples': 24169920, 'steps': 125884, 'loss/train': 1.2644777297973633} 11/07/2021 14:57:34 - INFO - __main__ - Step 125886: {'lr': 3.2044750022514805e-05, 'samples': 24170112, 'steps': 125885, 'loss/train': 0.9999291896820068} 11/07/2021 14:57:35 - INFO - __main__ - Step 125887: {'lr': 3.204215069766245e-05, 'samples': 24170304, 'steps': 125886, 'loss/train': 1.3298897743225098} 11/07/2021 14:57:35 - INFO - __main__ - Step 125888: {'lr': 3.203955147101856e-05, 'samples': 24170496, 'steps': 125887, 'loss/train': 1.4191738367080688} 11/07/2021 14:57:35 - INFO - __main__ - Step 125889: {'lr': 3.203695234258427e-05, 'samples': 24170688, 'steps': 125888, 'loss/train': 0.7945061326026917} 11/07/2021 14:57:36 - INFO - __main__ - Step 125890: {'lr': 3.203435331236074e-05, 'samples': 24170880, 'steps': 125889, 'loss/train': 1.263504147529602} 11/07/2021 14:57:36 - INFO - __main__ - Step 125891: {'lr': 3.203175438034916e-05, 'samples': 24171072, 'steps': 125890, 'loss/train': 1.3400596380233765} 11/07/2021 14:57:37 - INFO - __main__ - Step 125892: {'lr': 3.2029155546550725e-05, 'samples': 24171264, 'steps': 125891, 'loss/train': 1.3623052835464478} 11/07/2021 14:57:37 - INFO - __main__ - Step 125893: {'lr': 3.2026556810966585e-05, 'samples': 24171456, 'steps': 125892, 'loss/train': 1.4082744121551514} 11/07/2021 14:57:38 - INFO - __main__ - Step 125894: {'lr': 3.2023958173597934e-05, 'samples': 24171648, 'steps': 125893, 'loss/train': 0.8541359305381775} 11/07/2021 14:57:38 - INFO - __main__ - Step 125895: {'lr': 3.202135963444591e-05, 'samples': 24171840, 'steps': 125894, 'loss/train': 0.5384044051170349} 11/07/2021 14:57:38 - INFO - __main__ - Step 125896: {'lr': 3.201876119351169e-05, 'samples': 24172032, 'steps': 125895, 'loss/train': 1.0337408781051636} 11/07/2021 14:57:39 - INFO - __main__ - Step 125897: {'lr': 3.2016162850796446e-05, 'samples': 24172224, 'steps': 125896, 'loss/train': 1.4058729410171509} 11/07/2021 14:57:40 - INFO - __main__ - Step 125898: {'lr': 3.201356460630137e-05, 'samples': 24172416, 'steps': 125897, 'loss/train': 1.1634140014648438} 11/07/2021 14:57:40 - INFO - __main__ - Step 125899: {'lr': 3.20109664600276e-05, 'samples': 24172608, 'steps': 125898, 'loss/train': 1.4151699542999268} 11/07/2021 14:57:40 - INFO - __main__ - Step 125900: {'lr': 3.200836841197635e-05, 'samples': 24172800, 'steps': 125899, 'loss/train': 1.076608419418335} 11/07/2021 14:57:41 - INFO - __main__ - Step 125901: {'lr': 3.2005770462148754e-05, 'samples': 24172992, 'steps': 125900, 'loss/train': 0.9005810618400574} 11/07/2021 14:57:42 - INFO - __main__ - Step 125902: {'lr': 3.200317261054605e-05, 'samples': 24173184, 'steps': 125901, 'loss/train': 1.2183470726013184} 11/07/2021 14:57:42 - INFO - __main__ - Step 125903: {'lr': 3.2000574857169284e-05, 'samples': 24173376, 'steps': 125902, 'loss/train': 0.821739137172699} 11/07/2021 14:57:43 - INFO - __main__ - Step 125904: {'lr': 3.199797720201972e-05, 'samples': 24173568, 'steps': 125903, 'loss/train': 1.3322659730911255} 11/07/2021 14:57:43 - INFO - __main__ - Step 125905: {'lr': 3.1995379645098495e-05, 'samples': 24173760, 'steps': 125904, 'loss/train': 1.0812039375305176} 11/07/2021 14:57:43 - INFO - __main__ - Step 125906: {'lr': 3.199278218640678e-05, 'samples': 24173952, 'steps': 125905, 'loss/train': 1.4478471279144287} 11/07/2021 14:57:44 - INFO - __main__ - Step 125907: {'lr': 3.1990184825945764e-05, 'samples': 24174144, 'steps': 125906, 'loss/train': 1.4920064210891724} 11/07/2021 14:57:45 - INFO - __main__ - Step 125908: {'lr': 3.1987587563716584e-05, 'samples': 24174336, 'steps': 125907, 'loss/train': 1.2045583724975586} 11/07/2021 14:57:45 - INFO - __main__ - Step 125909: {'lr': 3.1984990399720444e-05, 'samples': 24174528, 'steps': 125908, 'loss/train': 1.0746018886566162} 11/07/2021 14:57:45 - INFO - __main__ - Step 125910: {'lr': 3.198239333395852e-05, 'samples': 24174720, 'steps': 125909, 'loss/train': 1.4091451168060303} 11/07/2021 14:57:46 - INFO - __main__ - Step 125911: {'lr': 3.1979796366431946e-05, 'samples': 24174912, 'steps': 125910, 'loss/train': 1.2547918558120728} 11/07/2021 14:57:46 - INFO - __main__ - Step 125912: {'lr': 3.1977199497141926e-05, 'samples': 24175104, 'steps': 125911, 'loss/train': 1.1056960821151733} 11/07/2021 14:57:47 - INFO - __main__ - Step 125913: {'lr': 3.1974602726089594e-05, 'samples': 24175296, 'steps': 125912, 'loss/train': 1.6620707511901855} 11/07/2021 14:57:47 - INFO - __main__ - Step 125914: {'lr': 3.197200605327616e-05, 'samples': 24175488, 'steps': 125913, 'loss/train': 1.0675464868545532} 11/07/2021 14:57:48 - INFO - __main__ - Step 125915: {'lr': 3.196940947870283e-05, 'samples': 24175680, 'steps': 125914, 'loss/train': 0.700693666934967} 11/07/2021 14:57:48 - INFO - __main__ - Step 125916: {'lr': 3.196681300237067e-05, 'samples': 24175872, 'steps': 125915, 'loss/train': 1.2513068914413452} 11/07/2021 14:57:48 - INFO - __main__ - Step 125917: {'lr': 3.196421662428089e-05, 'samples': 24176064, 'steps': 125916, 'loss/train': 1.5377421379089355} 11/07/2021 14:57:50 - INFO - __main__ - Step 125918: {'lr': 3.196162034443467e-05, 'samples': 24176256, 'steps': 125917, 'loss/train': 1.407827377319336} 11/07/2021 14:57:50 - INFO - __main__ - Step 125919: {'lr': 3.195902416283317e-05, 'samples': 24176448, 'steps': 125918, 'loss/train': 1.0126336812973022} 11/07/2021 14:57:50 - INFO - __main__ - Step 125920: {'lr': 3.195642807947757e-05, 'samples': 24176640, 'steps': 125919, 'loss/train': 1.227871060371399} 11/07/2021 14:57:51 - INFO - __main__ - Step 125921: {'lr': 3.195383209436906e-05, 'samples': 24176832, 'steps': 125920, 'loss/train': 1.1775054931640625} 11/07/2021 14:57:51 - INFO - __main__ - Step 125922: {'lr': 3.195123620750878e-05, 'samples': 24177024, 'steps': 125921, 'loss/train': 0.8491548299789429} 11/07/2021 14:57:52 - INFO - __main__ - Step 125923: {'lr': 3.194864041889789e-05, 'samples': 24177216, 'steps': 125922, 'loss/train': 0.9076461791992188} 11/07/2021 14:57:52 - INFO - __main__ - Step 125924: {'lr': 3.19460447285376e-05, 'samples': 24177408, 'steps': 125923, 'loss/train': 1.2781999111175537} 11/07/2021 14:57:53 - INFO - __main__ - Step 125925: {'lr': 3.1943449136429046e-05, 'samples': 24177600, 'steps': 125924, 'loss/train': 1.4512461423873901} 11/07/2021 14:57:53 - INFO - __main__ - Step 125926: {'lr': 3.194085364257343e-05, 'samples': 24177792, 'steps': 125925, 'loss/train': 0.33227524161338806} 11/07/2021 14:57:53 - INFO - __main__ - Step 125927: {'lr': 3.193825824697189e-05, 'samples': 24177984, 'steps': 125926, 'loss/train': 1.1261117458343506} 11/07/2021 14:57:54 - INFO - __main__ - Step 125928: {'lr': 3.1935662949625574e-05, 'samples': 24178176, 'steps': 125927, 'loss/train': 1.3381288051605225} 11/07/2021 14:57:55 - INFO - __main__ - Step 125929: {'lr': 3.193306775053578e-05, 'samples': 24178368, 'steps': 125928, 'loss/train': 0.7069416046142578} 11/07/2021 14:57:55 - INFO - __main__ - Step 125930: {'lr': 3.19304726497035e-05, 'samples': 24178560, 'steps': 125929, 'loss/train': 0.8833186030387878} 11/07/2021 14:57:55 - INFO - __main__ - Step 125931: {'lr': 3.192787764712998e-05, 'samples': 24178752, 'steps': 125930, 'loss/train': 1.3437076807022095} 11/07/2021 14:57:56 - INFO - __main__ - Step 125932: {'lr': 3.192528274281642e-05, 'samples': 24178944, 'steps': 125931, 'loss/train': 1.3863173723220825} 11/07/2021 14:57:57 - INFO - __main__ - Step 125933: {'lr': 3.192268793676395e-05, 'samples': 24179136, 'steps': 125932, 'loss/train': 1.0669236183166504} 11/07/2021 14:57:57 - INFO - __main__ - Step 125934: {'lr': 3.192009322897374e-05, 'samples': 24179328, 'steps': 125933, 'loss/train': 1.4648503065109253} 11/07/2021 14:57:58 - INFO - __main__ - Step 125935: {'lr': 3.191749861944698e-05, 'samples': 24179520, 'steps': 125934, 'loss/train': 1.3659404516220093} 11/07/2021 14:57:58 - INFO - __main__ - Step 125936: {'lr': 3.191490410818484e-05, 'samples': 24179712, 'steps': 125935, 'loss/train': 1.1762261390686035} 11/07/2021 14:57:58 - INFO - __main__ - Step 125937: {'lr': 3.191230969518846e-05, 'samples': 24179904, 'steps': 125936, 'loss/train': 1.582440733909607} 11/07/2021 14:57:59 - INFO - __main__ - Step 125938: {'lr': 3.1909715380459056e-05, 'samples': 24180096, 'steps': 125937, 'loss/train': 1.3298231363296509} 11/07/2021 14:58:00 - INFO - __main__ - Step 125939: {'lr': 3.1907121163997744e-05, 'samples': 24180288, 'steps': 125938, 'loss/train': 1.429466962814331} 11/07/2021 14:58:00 - INFO - __main__ - Step 125940: {'lr': 3.190452704580574e-05, 'samples': 24180480, 'steps': 125939, 'loss/train': 1.6868672370910645} 11/07/2021 14:58:00 - INFO - __main__ - Step 125941: {'lr': 3.190193302588418e-05, 'samples': 24180672, 'steps': 125940, 'loss/train': 1.0668383836746216} 11/07/2021 14:58:01 - INFO - __main__ - Step 125942: {'lr': 3.189933910423429e-05, 'samples': 24180864, 'steps': 125941, 'loss/train': 1.5768855810165405} 11/07/2021 14:58:01 - INFO - __main__ - Step 125943: {'lr': 3.1896745280857123e-05, 'samples': 24181056, 'steps': 125942, 'loss/train': 0.9928706884384155} 11/07/2021 14:58:02 - INFO - __main__ - Step 125944: {'lr': 3.189415155575395e-05, 'samples': 24181248, 'steps': 125943, 'loss/train': 1.1734410524368286} 11/07/2021 14:58:02 - INFO - __main__ - Step 125945: {'lr': 3.1891557928925896e-05, 'samples': 24181440, 'steps': 125944, 'loss/train': 1.1814415454864502} 11/07/2021 14:58:03 - INFO - __main__ - Step 125946: {'lr': 3.188896440037412e-05, 'samples': 24181632, 'steps': 125945, 'loss/train': 1.1735742092132568} 11/07/2021 14:58:03 - INFO - __main__ - Step 125947: {'lr': 3.188637097009983e-05, 'samples': 24181824, 'steps': 125946, 'loss/train': 1.0511856079101562} 11/07/2021 14:58:03 - INFO - __main__ - Step 125948: {'lr': 3.1883777638104185e-05, 'samples': 24182016, 'steps': 125947, 'loss/train': 1.2681041955947876} 11/07/2021 14:58:04 - INFO - __main__ - Step 125949: {'lr': 3.1881184404388334e-05, 'samples': 24182208, 'steps': 125948, 'loss/train': 0.8315151929855347} 11/07/2021 14:58:05 - INFO - __main__ - Step 125950: {'lr': 3.187859126895346e-05, 'samples': 24182400, 'steps': 125949, 'loss/train': 1.3544946908950806} 11/07/2021 14:58:05 - INFO - __main__ - Step 125951: {'lr': 3.187599823180071e-05, 'samples': 24182592, 'steps': 125950, 'loss/train': 1.1097040176391602} 11/07/2021 14:58:06 - INFO - __main__ - Step 125952: {'lr': 3.187340529293129e-05, 'samples': 24182784, 'steps': 125951, 'loss/train': 1.0199562311172485} 11/07/2021 14:58:06 - INFO - __main__ - Step 125953: {'lr': 3.1870812452346324e-05, 'samples': 24182976, 'steps': 125952, 'loss/train': 1.1974080801010132} 11/07/2021 14:58:07 - INFO - __main__ - Step 125954: {'lr': 3.1868219710047025e-05, 'samples': 24183168, 'steps': 125953, 'loss/train': 1.1735879182815552} 11/07/2021 14:58:07 - INFO - __main__ - Step 125955: {'lr': 3.186562706603452e-05, 'samples': 24183360, 'steps': 125954, 'loss/train': 1.1058409214019775} 11/07/2021 14:58:08 - INFO - __main__ - Step 125956: {'lr': 3.186303452031009e-05, 'samples': 24183552, 'steps': 125955, 'loss/train': 1.737718939781189} 11/07/2021 14:58:08 - INFO - __main__ - Step 125957: {'lr': 3.186044207287472e-05, 'samples': 24183744, 'steps': 125956, 'loss/train': 1.369547724723816} 11/07/2021 14:58:08 - INFO - __main__ - Step 125958: {'lr': 3.185784972372968e-05, 'samples': 24183936, 'steps': 125957, 'loss/train': 0.9763699173927307} 11/07/2021 14:58:09 - INFO - __main__ - Step 125959: {'lr': 3.185525747287613e-05, 'samples': 24184128, 'steps': 125958, 'loss/train': 0.9938520789146423} 11/07/2021 14:58:10 - INFO - __main__ - Step 125960: {'lr': 3.1852665320315225e-05, 'samples': 24184320, 'steps': 125959, 'loss/train': 1.4839884042739868} 11/07/2021 14:58:10 - INFO - __main__ - Step 125961: {'lr': 3.185007326604814e-05, 'samples': 24184512, 'steps': 125960, 'loss/train': 1.1378743648529053} 11/07/2021 14:58:11 - INFO - __main__ - Step 125962: {'lr': 3.184748131007606e-05, 'samples': 24184704, 'steps': 125961, 'loss/train': 1.2950893640518188} 11/07/2021 14:58:11 - INFO - __main__ - Step 125963: {'lr': 3.1844889452400135e-05, 'samples': 24184896, 'steps': 125962, 'loss/train': 1.4169368743896484} 11/07/2021 14:58:11 - INFO - __main__ - Step 125964: {'lr': 3.1842297693021524e-05, 'samples': 24185088, 'steps': 125963, 'loss/train': 5.598136901855469} 11/07/2021 14:58:12 - INFO - __main__ - Step 125965: {'lr': 3.183970603194142e-05, 'samples': 24185280, 'steps': 125964, 'loss/train': 0.7383898496627808} 11/07/2021 14:58:13 - INFO - __main__ - Step 125966: {'lr': 3.183711446916099e-05, 'samples': 24185472, 'steps': 125965, 'loss/train': 1.5290344953536987} 11/07/2021 14:58:13 - INFO - __main__ - Step 125967: {'lr': 3.183452300468137e-05, 'samples': 24185664, 'steps': 125966, 'loss/train': 1.1481231451034546} 11/07/2021 14:58:13 - INFO - __main__ - Step 125968: {'lr': 3.183193163850376e-05, 'samples': 24185856, 'steps': 125967, 'loss/train': 0.8731396198272705} 11/07/2021 14:58:14 - INFO - __main__ - Step 125969: {'lr': 3.182934037062934e-05, 'samples': 24186048, 'steps': 125968, 'loss/train': 0.8963198065757751} 11/07/2021 14:58:15 - INFO - __main__ - Step 125970: {'lr': 3.182674920105924e-05, 'samples': 24186240, 'steps': 125969, 'loss/train': 1.1823816299438477} 11/07/2021 14:58:15 - INFO - __main__ - Step 125971: {'lr': 3.182415812979461e-05, 'samples': 24186432, 'steps': 125970, 'loss/train': 1.507891297340393} 11/07/2021 14:58:15 - INFO - __main__ - Step 125972: {'lr': 3.182156715683668e-05, 'samples': 24186624, 'steps': 125971, 'loss/train': 1.2971888780593872} 11/07/2021 14:58:16 - INFO - __main__ - Step 125973: {'lr': 3.181897628218655e-05, 'samples': 24186816, 'steps': 125972, 'loss/train': 0.9535319805145264} 11/07/2021 14:58:16 - INFO - __main__ - Step 125974: {'lr': 3.1816385505845455e-05, 'samples': 24187008, 'steps': 125973, 'loss/train': 0.9336350560188293} 11/07/2021 14:58:17 - INFO - __main__ - Step 125975: {'lr': 3.181379482781449e-05, 'samples': 24187200, 'steps': 125974, 'loss/train': 0.4119570851325989} 11/07/2021 14:58:17 - INFO - __main__ - Step 125976: {'lr': 3.181120424809489e-05, 'samples': 24187392, 'steps': 125975, 'loss/train': 0.9447660446166992} 11/07/2021 14:58:18 - INFO - __main__ - Step 125977: {'lr': 3.180861376668778e-05, 'samples': 24187584, 'steps': 125976, 'loss/train': 1.5710057020187378} 11/07/2021 14:58:18 - INFO - __main__ - Step 125978: {'lr': 3.180602338359437e-05, 'samples': 24187776, 'steps': 125977, 'loss/train': 1.4339051246643066} 11/07/2021 14:58:19 - INFO - __main__ - Step 125979: {'lr': 3.180343309881578e-05, 'samples': 24187968, 'steps': 125978, 'loss/train': 1.604566216468811} 11/07/2021 14:58:20 - INFO - __main__ - Step 125980: {'lr': 3.180084291235319e-05, 'samples': 24188160, 'steps': 125979, 'loss/train': 1.3421473503112793} 11/07/2021 14:58:20 - INFO - __main__ - Step 125981: {'lr': 3.1798252824207814e-05, 'samples': 24188352, 'steps': 125980, 'loss/train': 0.7630378603935242} 11/07/2021 14:58:20 - INFO - __main__ - Step 125982: {'lr': 3.179566283438076e-05, 'samples': 24188544, 'steps': 125981, 'loss/train': 0.7785152792930603} 11/07/2021 14:58:21 - INFO - __main__ - Step 125983: {'lr': 3.17930729428732e-05, 'samples': 24188736, 'steps': 125982, 'loss/train': 1.1573426723480225} 11/07/2021 14:58:21 - INFO - __main__ - Step 125984: {'lr': 3.17904831496863e-05, 'samples': 24188928, 'steps': 125983, 'loss/train': 1.7995444536209106} 11/07/2021 14:58:22 - INFO - __main__ - Step 125985: {'lr': 3.178789345482125e-05, 'samples': 24189120, 'steps': 125984, 'loss/train': 1.606287956237793} 11/07/2021 14:58:22 - INFO - __main__ - Step 125986: {'lr': 3.178530385827921e-05, 'samples': 24189312, 'steps': 125985, 'loss/train': 0.9921368360519409} 11/07/2021 14:58:23 - INFO - __main__ - Step 125987: {'lr': 3.1782714360061334e-05, 'samples': 24189504, 'steps': 125986, 'loss/train': 0.9411188960075378} 11/07/2021 14:58:23 - INFO - __main__ - Step 125988: {'lr': 3.1780124960168824e-05, 'samples': 24189696, 'steps': 125987, 'loss/train': 1.4737417697906494} 11/07/2021 14:58:23 - INFO - __main__ - Step 125989: {'lr': 3.1777535658602805e-05, 'samples': 24189888, 'steps': 125988, 'loss/train': 1.47340989112854} 11/07/2021 14:58:24 - INFO - __main__ - Step 125990: {'lr': 3.1774946455364464e-05, 'samples': 24190080, 'steps': 125989, 'loss/train': 1.0466405153274536} 11/07/2021 14:58:25 - INFO - __main__ - Step 125991: {'lr': 3.177235735045497e-05, 'samples': 24190272, 'steps': 125990, 'loss/train': 1.1619210243225098} 11/07/2021 14:58:25 - INFO - __main__ - Step 125992: {'lr': 3.1769768343875516e-05, 'samples': 24190464, 'steps': 125991, 'loss/train': 1.3132916688919067} 11/07/2021 14:58:25 - INFO - __main__ - Step 125993: {'lr': 3.176717943562721e-05, 'samples': 24190656, 'steps': 125992, 'loss/train': 0.9195170998573303} 11/07/2021 14:58:26 - INFO - __main__ - Step 125994: {'lr': 3.1764590625711244e-05, 'samples': 24190848, 'steps': 125993, 'loss/train': 1.3321110010147095} 11/07/2021 14:58:27 - INFO - __main__ - Step 125995: {'lr': 3.176200191412876e-05, 'samples': 24191040, 'steps': 125994, 'loss/train': 0.9578988552093506} 11/07/2021 14:58:27 - INFO - __main__ - Step 125996: {'lr': 3.175941330088097e-05, 'samples': 24191232, 'steps': 125995, 'loss/train': 1.3095916509628296} 11/07/2021 14:58:28 - INFO - __main__ - Step 125997: {'lr': 3.175682478596903e-05, 'samples': 24191424, 'steps': 125996, 'loss/train': 0.9377864003181458} 11/07/2021 14:58:28 - INFO - __main__ - Step 125998: {'lr': 3.175423636939409e-05, 'samples': 24191616, 'steps': 125997, 'loss/train': 1.1236417293548584} 11/07/2021 14:58:28 - INFO - __main__ - Step 125999: {'lr': 3.175164805115732e-05, 'samples': 24191808, 'steps': 125998, 'loss/train': 1.529774785041809} 11/07/2021 14:58:29 - INFO - __main__ - Step 126000: {'lr': 3.174905983125989e-05, 'samples': 24192000, 'steps': 125999, 'loss/train': 1.4057385921478271} 11/07/2021 14:58:30 - INFO - __main__ - Step 126001: {'lr': 3.174647170970296e-05, 'samples': 24192192, 'steps': 126000, 'loss/train': 0.8773496150970459} 11/07/2021 14:58:30 - INFO - __main__ - Step 126002: {'lr': 3.1743883686487704e-05, 'samples': 24192384, 'steps': 126001, 'loss/train': 0.9192690849304199} 11/07/2021 14:58:30 - INFO - __main__ - Step 126003: {'lr': 3.174129576161533e-05, 'samples': 24192576, 'steps': 126002, 'loss/train': 1.2696853876113892} 11/07/2021 14:58:31 - INFO - __main__ - Step 126004: {'lr': 3.173870793508693e-05, 'samples': 24192768, 'steps': 126003, 'loss/train': 0.9649693369865417} 11/07/2021 14:58:31 - INFO - __main__ - Step 126005: {'lr': 3.17361202069037e-05, 'samples': 24192960, 'steps': 126004, 'loss/train': 0.9790403246879578} 11/07/2021 14:58:32 - INFO - __main__ - Step 126006: {'lr': 3.173353257706677e-05, 'samples': 24193152, 'steps': 126005, 'loss/train': 1.267565131187439} 11/07/2021 14:58:32 - INFO - __main__ - Step 126007: {'lr': 3.173094504557739e-05, 'samples': 24193344, 'steps': 126006, 'loss/train': 1.0817798376083374} 11/07/2021 14:58:33 - INFO - __main__ - Step 126008: {'lr': 3.1728357612436644e-05, 'samples': 24193536, 'steps': 126007, 'loss/train': 1.3055096864700317} 11/07/2021 14:58:33 - INFO - __main__ - Step 126009: {'lr': 3.172577027764573e-05, 'samples': 24193728, 'steps': 126008, 'loss/train': 1.26987624168396} 11/07/2021 14:58:34 - INFO - __main__ - Step 126010: {'lr': 3.172318304120583e-05, 'samples': 24193920, 'steps': 126009, 'loss/train': 1.6016510725021362} 11/07/2021 14:58:35 - INFO - __main__ - Step 126011: {'lr': 3.17205959031181e-05, 'samples': 24194112, 'steps': 126010, 'loss/train': 1.5371140241622925} 11/07/2021 14:58:35 - INFO - __main__ - Step 126012: {'lr': 3.171800886338369e-05, 'samples': 24194304, 'steps': 126011, 'loss/train': 1.061624526977539} 11/07/2021 14:58:35 - INFO - __main__ - Step 126013: {'lr': 3.17154219220038e-05, 'samples': 24194496, 'steps': 126012, 'loss/train': 1.0112688541412354} 11/07/2021 14:58:36 - INFO - __main__ - Step 126014: {'lr': 3.1712835078979596e-05, 'samples': 24194688, 'steps': 126013, 'loss/train': 1.1092876195907593} 11/07/2021 14:58:36 - INFO - __main__ - Step 126015: {'lr': 3.1710248334312186e-05, 'samples': 24194880, 'steps': 126014, 'loss/train': 1.3489726781845093} 11/07/2021 14:58:37 - INFO - __main__ - Step 126016: {'lr': 3.1707661688002763e-05, 'samples': 24195072, 'steps': 126015, 'loss/train': 1.4176223278045654} 11/07/2021 14:58:38 - INFO - __main__ - Step 126017: {'lr': 3.1705075140052494e-05, 'samples': 24195264, 'steps': 126016, 'loss/train': 1.3220479488372803} 11/07/2021 14:58:38 - INFO - __main__ - Step 126018: {'lr': 3.170248869046255e-05, 'samples': 24195456, 'steps': 126017, 'loss/train': 1.286642074584961} 11/07/2021 14:58:38 - INFO - __main__ - Step 126019: {'lr': 3.169990233923412e-05, 'samples': 24195648, 'steps': 126018, 'loss/train': 0.9833983778953552} 11/07/2021 14:58:39 - INFO - __main__ - Step 126020: {'lr': 3.169731608636831e-05, 'samples': 24195840, 'steps': 126019, 'loss/train': 0.8365879654884338} 11/07/2021 14:58:40 - INFO - __main__ - Step 126021: {'lr': 3.169472993186634e-05, 'samples': 24196032, 'steps': 126020, 'loss/train': 0.5453596115112305} 11/07/2021 14:58:40 - INFO - __main__ - Step 126022: {'lr': 3.169214387572936e-05, 'samples': 24196224, 'steps': 126021, 'loss/train': 1.5364946126937866} 11/07/2021 14:58:40 - INFO - __main__ - Step 126023: {'lr': 3.1689557917958524e-05, 'samples': 24196416, 'steps': 126022, 'loss/train': 0.9241828918457031} 11/07/2021 14:58:41 - INFO - __main__ - Step 126024: {'lr': 3.1686972058555e-05, 'samples': 24196608, 'steps': 126023, 'loss/train': 1.3760226964950562} 11/07/2021 14:58:41 - INFO - __main__ - Step 126025: {'lr': 3.168438629752002e-05, 'samples': 24196800, 'steps': 126024, 'loss/train': 1.3710341453552246} 11/07/2021 14:58:42 - INFO - __main__ - Step 126026: {'lr': 3.168180063485462e-05, 'samples': 24196992, 'steps': 126025, 'loss/train': 1.217366337776184} 11/07/2021 14:58:43 - INFO - __main__ - Step 126027: {'lr': 3.167921507056004e-05, 'samples': 24197184, 'steps': 126026, 'loss/train': 1.1569559574127197} 11/07/2021 14:58:43 - INFO - __main__ - Step 126028: {'lr': 3.1676629604637434e-05, 'samples': 24197376, 'steps': 126027, 'loss/train': 0.9253084063529968} 11/07/2021 14:58:43 - INFO - __main__ - Step 126029: {'lr': 3.1674044237087973e-05, 'samples': 24197568, 'steps': 126028, 'loss/train': 1.5552090406417847} 11/07/2021 14:58:44 - INFO - __main__ - Step 126030: {'lr': 3.167145896791282e-05, 'samples': 24197760, 'steps': 126029, 'loss/train': 1.39939546585083} 11/07/2021 14:58:45 - INFO - __main__ - Step 126031: {'lr': 3.166887379711314e-05, 'samples': 24197952, 'steps': 126030, 'loss/train': 1.8047301769256592} 11/07/2021 14:58:45 - INFO - __main__ - Step 126032: {'lr': 3.16662887246901e-05, 'samples': 24198144, 'steps': 126031, 'loss/train': 1.172060251235962} 11/07/2021 14:58:45 - INFO - __main__ - Step 126033: {'lr': 3.166370375064484e-05, 'samples': 24198336, 'steps': 126032, 'loss/train': 1.4436041116714478} 11/07/2021 14:58:46 - INFO - __main__ - Step 126034: {'lr': 3.166111887497858e-05, 'samples': 24198528, 'steps': 126033, 'loss/train': 0.6720024347305298} 11/07/2021 14:58:46 - INFO - __main__ - Step 126035: {'lr': 3.1658534097692425e-05, 'samples': 24198720, 'steps': 126034, 'loss/train': 1.4460731744766235} 11/07/2021 14:58:46 - INFO - __main__ - Step 126036: {'lr': 3.1655949418787635e-05, 'samples': 24198912, 'steps': 126035, 'loss/train': 1.2894419431686401} 11/07/2021 14:58:48 - INFO - __main__ - Step 126037: {'lr': 3.165336483826523e-05, 'samples': 24199104, 'steps': 126036, 'loss/train': 1.0689305067062378} 11/07/2021 14:58:48 - INFO - __main__ - Step 126038: {'lr': 3.1650780356126455e-05, 'samples': 24199296, 'steps': 126037, 'loss/train': 0.805029571056366} 11/07/2021 14:58:48 - INFO - __main__ - Step 126039: {'lr': 3.164819597237248e-05, 'samples': 24199488, 'steps': 126038, 'loss/train': 1.2341474294662476} 11/07/2021 14:58:49 - INFO - __main__ - Step 126040: {'lr': 3.1645611687004447e-05, 'samples': 24199680, 'steps': 126039, 'loss/train': 1.2755433320999146} 11/07/2021 14:58:49 - INFO - __main__ - Step 126041: {'lr': 3.164302750002354e-05, 'samples': 24199872, 'steps': 126040, 'loss/train': 0.8558013439178467} 11/07/2021 14:58:50 - INFO - __main__ - Step 126042: {'lr': 3.1640443411430933e-05, 'samples': 24200064, 'steps': 126041, 'loss/train': 1.585488200187683} 11/07/2021 14:58:50 - INFO - __main__ - Step 126043: {'lr': 3.1637859421227735e-05, 'samples': 24200256, 'steps': 126042, 'loss/train': 1.427300214767456} 11/07/2021 14:58:51 - INFO - __main__ - Step 126044: {'lr': 3.163527552941517e-05, 'samples': 24200448, 'steps': 126043, 'loss/train': 1.3959672451019287} 11/07/2021 14:58:51 - INFO - __main__ - Step 126045: {'lr': 3.16326917359944e-05, 'samples': 24200640, 'steps': 126044, 'loss/train': 1.0218570232391357} 11/07/2021 14:58:51 - INFO - __main__ - Step 126046: {'lr': 3.163010804096653e-05, 'samples': 24200832, 'steps': 126045, 'loss/train': 1.069917917251587} 11/07/2021 14:58:52 - INFO - __main__ - Step 126047: {'lr': 3.162752444433278e-05, 'samples': 24201024, 'steps': 126046, 'loss/train': 1.3632842302322388} 11/07/2021 14:58:53 - INFO - __main__ - Step 126048: {'lr': 3.16249409460943e-05, 'samples': 24201216, 'steps': 126047, 'loss/train': 1.404054880142212} 11/07/2021 14:58:53 - INFO - __main__ - Step 126049: {'lr': 3.162235754625226e-05, 'samples': 24201408, 'steps': 126048, 'loss/train': 1.1675307750701904} 11/07/2021 14:58:53 - INFO - __main__ - Step 126050: {'lr': 3.161977424480786e-05, 'samples': 24201600, 'steps': 126049, 'loss/train': 0.9641391634941101} 11/07/2021 14:58:54 - INFO - __main__ - Step 126051: {'lr': 3.161719104176217e-05, 'samples': 24201792, 'steps': 126050, 'loss/train': 1.2228584289550781} 11/07/2021 14:58:55 - INFO - __main__ - Step 126052: {'lr': 3.161460793711637e-05, 'samples': 24201984, 'steps': 126051, 'loss/train': 0.47511279582977295} 11/07/2021 14:58:55 - INFO - __main__ - Step 126053: {'lr': 3.16120249308717e-05, 'samples': 24202176, 'steps': 126052, 'loss/train': 1.3947207927703857} 11/07/2021 14:58:56 - INFO - __main__ - Step 126054: {'lr': 3.160944202302926e-05, 'samples': 24202368, 'steps': 126053, 'loss/train': 1.1840503215789795} 11/07/2021 14:58:56 - INFO - __main__ - Step 126055: {'lr': 3.160685921359027e-05, 'samples': 24202560, 'steps': 126054, 'loss/train': 0.7879982590675354} 11/07/2021 14:58:56 - INFO - __main__ - Step 126056: {'lr': 3.160427650255582e-05, 'samples': 24202752, 'steps': 126055, 'loss/train': 1.2145127058029175} 11/07/2021 14:58:58 - INFO - __main__ - Step 126057: {'lr': 3.1601693889927116e-05, 'samples': 24202944, 'steps': 126056, 'loss/train': 0.512731671333313} 11/07/2021 14:58:59 - INFO - __main__ - Step 126058: {'lr': 3.1599111375705344e-05, 'samples': 24203136, 'steps': 126057, 'loss/train': 0.8956285119056702} 11/07/2021 14:58:59 - INFO - __main__ - Step 126059: {'lr': 3.159652895989162e-05, 'samples': 24203328, 'steps': 126058, 'loss/train': 1.1692866086959839} 11/07/2021 14:58:59 - INFO - __main__ - Step 126060: {'lr': 3.159394664248713e-05, 'samples': 24203520, 'steps': 126059, 'loss/train': 1.7197498083114624} 11/07/2021 14:59:00 - INFO - __main__ - Step 126061: {'lr': 3.159136442349306e-05, 'samples': 24203712, 'steps': 126060, 'loss/train': 1.0136913061141968} 11/07/2021 14:59:00 - INFO - __main__ - Step 126062: {'lr': 3.158878230291054e-05, 'samples': 24203904, 'steps': 126061, 'loss/train': 1.5342984199523926} 11/07/2021 14:59:00 - INFO - __main__ - Step 126063: {'lr': 3.15862002807408e-05, 'samples': 24204096, 'steps': 126062, 'loss/train': 1.7410950660705566} 11/07/2021 14:59:01 - INFO - __main__ - Step 126064: {'lr': 3.1583618356984864e-05, 'samples': 24204288, 'steps': 126063, 'loss/train': 1.152647852897644} 11/07/2021 14:59:02 - INFO - __main__ - Step 126065: {'lr': 3.158103653164402e-05, 'samples': 24204480, 'steps': 126064, 'loss/train': 0.44357913732528687} 11/07/2021 14:59:02 - INFO - __main__ - Step 126066: {'lr': 3.157845480471938e-05, 'samples': 24204672, 'steps': 126065, 'loss/train': 1.5565214157104492} 11/07/2021 14:59:02 - INFO - __main__ - Step 126067: {'lr': 3.15758731762121e-05, 'samples': 24204864, 'steps': 126066, 'loss/train': 1.25431227684021} 11/07/2021 14:59:03 - INFO - __main__ - Step 126068: {'lr': 3.157329164612338e-05, 'samples': 24205056, 'steps': 126067, 'loss/train': 2.7190730571746826} 11/07/2021 14:59:04 - INFO - __main__ - Step 126069: {'lr': 3.157071021445434e-05, 'samples': 24205248, 'steps': 126068, 'loss/train': 1.0273759365081787} 11/07/2021 14:59:04 - INFO - __main__ - Step 126070: {'lr': 3.156812888120619e-05, 'samples': 24205440, 'steps': 126069, 'loss/train': 0.6864585876464844} 11/07/2021 14:59:04 - INFO - __main__ - Step 126071: {'lr': 3.156554764638009e-05, 'samples': 24205632, 'steps': 126070, 'loss/train': 0.9368714094161987} 11/07/2021 14:59:05 - INFO - __main__ - Step 126072: {'lr': 3.156296650997714e-05, 'samples': 24205824, 'steps': 126071, 'loss/train': 1.1453723907470703} 11/07/2021 14:59:05 - INFO - __main__ - Step 126073: {'lr': 3.1560385471998575e-05, 'samples': 24206016, 'steps': 126072, 'loss/train': 1.191416621208191} 11/07/2021 14:59:06 - INFO - __main__ - Step 126074: {'lr': 3.155780453244553e-05, 'samples': 24206208, 'steps': 126073, 'loss/train': 1.0141949653625488} 11/07/2021 14:59:07 - INFO - __main__ - Step 126075: {'lr': 3.1555223691319135e-05, 'samples': 24206400, 'steps': 126074, 'loss/train': 1.232019066810608} 11/07/2021 14:59:07 - INFO - __main__ - Step 126076: {'lr': 3.155264294862062e-05, 'samples': 24206592, 'steps': 126075, 'loss/train': 1.3066189289093018} 11/07/2021 14:59:07 - INFO - __main__ - Step 126077: {'lr': 3.1550062304351146e-05, 'samples': 24206784, 'steps': 126076, 'loss/train': 2.020922899246216} 11/07/2021 14:59:08 - INFO - __main__ - Step 126078: {'lr': 3.15474817585118e-05, 'samples': 24206976, 'steps': 126077, 'loss/train': 2.3786988258361816} 11/07/2021 14:59:08 - INFO - __main__ - Step 126079: {'lr': 3.15449013111038e-05, 'samples': 24207168, 'steps': 126078, 'loss/train': 1.3837330341339111} 11/07/2021 14:59:09 - INFO - __main__ - Step 126080: {'lr': 3.154232096212828e-05, 'samples': 24207360, 'steps': 126079, 'loss/train': 5.682657718658447} 11/07/2021 14:59:10 - INFO - __main__ - Step 126081: {'lr': 3.153974071158641e-05, 'samples': 24207552, 'steps': 126080, 'loss/train': 0.04678555950522423} 11/07/2021 14:59:10 - INFO - __main__ - Step 126082: {'lr': 3.153716055947936e-05, 'samples': 24207744, 'steps': 126081, 'loss/train': 1.641542911529541} 11/07/2021 14:59:10 - INFO - __main__ - Step 126083: {'lr': 3.153458050580832e-05, 'samples': 24207936, 'steps': 126082, 'loss/train': 1.4934054613113403} 11/07/2021 14:59:11 - INFO - __main__ - Step 126084: {'lr': 3.153200055057443e-05, 'samples': 24208128, 'steps': 126083, 'loss/train': 1.380338191986084} 11/07/2021 14:59:12 - INFO - __main__ - Step 126085: {'lr': 3.152942069377881e-05, 'samples': 24208320, 'steps': 126084, 'loss/train': 1.1530234813690186} 11/07/2021 14:59:12 - INFO - __main__ - Step 126086: {'lr': 3.1526840935422686e-05, 'samples': 24208512, 'steps': 126085, 'loss/train': 1.3954335451126099} 11/07/2021 14:59:12 - INFO - __main__ - Step 126087: {'lr': 3.1524261275507196e-05, 'samples': 24208704, 'steps': 126086, 'loss/train': 1.2727128267288208} 11/07/2021 14:59:13 - INFO - __main__ - Step 126088: {'lr': 3.152168171403352e-05, 'samples': 24208896, 'steps': 126087, 'loss/train': 0.06309381127357483} 11/07/2021 14:59:13 - INFO - __main__ - Step 126089: {'lr': 3.151910225100277e-05, 'samples': 24209088, 'steps': 126088, 'loss/train': 1.452245831489563} 11/07/2021 14:59:14 - INFO - __main__ - Step 126090: {'lr': 3.151652288641621e-05, 'samples': 24209280, 'steps': 126089, 'loss/train': 1.0166867971420288} 11/07/2021 14:59:14 - INFO - __main__ - Step 126091: {'lr': 3.1513943620274876e-05, 'samples': 24209472, 'steps': 126090, 'loss/train': 1.4135693311691284} 11/07/2021 14:59:15 - INFO - __main__ - Step 126092: {'lr': 3.1511364452580017e-05, 'samples': 24209664, 'steps': 126091, 'loss/train': 1.3305954933166504} 11/07/2021 14:59:15 - INFO - __main__ - Step 126093: {'lr': 3.150878538333274e-05, 'samples': 24209856, 'steps': 126092, 'loss/train': 1.5299299955368042} 11/07/2021 14:59:15 - INFO - __main__ - Step 126094: {'lr': 3.1506206412534213e-05, 'samples': 24210048, 'steps': 126093, 'loss/train': 1.1026477813720703} 11/07/2021 14:59:16 - INFO - __main__ - Step 126095: {'lr': 3.1503627540185655e-05, 'samples': 24210240, 'steps': 126094, 'loss/train': 1.3327929973602295} 11/07/2021 14:59:17 - INFO - __main__ - Step 126096: {'lr': 3.1501048766288176e-05, 'samples': 24210432, 'steps': 126095, 'loss/train': 1.49724543094635} 11/07/2021 14:59:17 - INFO - __main__ - Step 126097: {'lr': 3.149847009084294e-05, 'samples': 24210624, 'steps': 126096, 'loss/train': 1.0338753461837769} 11/07/2021 14:59:18 - INFO - __main__ - Step 126098: {'lr': 3.149589151385113e-05, 'samples': 24210816, 'steps': 126097, 'loss/train': 1.336267113685608} 11/07/2021 14:59:18 - INFO - __main__ - Step 126099: {'lr': 3.1493313035313916e-05, 'samples': 24211008, 'steps': 126098, 'loss/train': 1.2702559232711792} 11/07/2021 14:59:18 - INFO - __main__ - Step 126100: {'lr': 3.149073465523242e-05, 'samples': 24211200, 'steps': 126099, 'loss/train': 1.7057818174362183} 11/07/2021 14:59:19 - INFO - __main__ - Step 126101: {'lr': 3.148815637360783e-05, 'samples': 24211392, 'steps': 126100, 'loss/train': 1.1867088079452515} 11/07/2021 14:59:20 - INFO - __main__ - Step 126102: {'lr': 3.148557819044131e-05, 'samples': 24211584, 'steps': 126101, 'loss/train': 1.1348272562026978} 11/07/2021 14:59:20 - INFO - __main__ - Step 126103: {'lr': 3.148300010573407e-05, 'samples': 24211776, 'steps': 126102, 'loss/train': 1.3485392332077026} 11/07/2021 14:59:20 - INFO - __main__ - Step 126104: {'lr': 3.148042211948718e-05, 'samples': 24211968, 'steps': 126103, 'loss/train': 1.2441926002502441} 11/07/2021 14:59:21 - INFO - __main__ - Step 126105: {'lr': 3.147784423170183e-05, 'samples': 24212160, 'steps': 126104, 'loss/train': 0.58872389793396} 11/07/2021 14:59:22 - INFO - __main__ - Step 126106: {'lr': 3.1475266442379166e-05, 'samples': 24212352, 'steps': 126105, 'loss/train': 0.5095863342285156} 11/07/2021 14:59:22 - INFO - __main__ - Step 126107: {'lr': 3.14726887515204e-05, 'samples': 24212544, 'steps': 126106, 'loss/train': 1.219882845878601} 11/07/2021 14:59:22 - INFO - __main__ - Step 126108: {'lr': 3.147011115912668e-05, 'samples': 24212736, 'steps': 126107, 'loss/train': 0.9340551495552063} 11/07/2021 14:59:23 - INFO - __main__ - Step 126109: {'lr': 3.146753366519914e-05, 'samples': 24212928, 'steps': 126108, 'loss/train': 0.9631221890449524} 11/07/2021 14:59:23 - INFO - __main__ - Step 126110: {'lr': 3.146495626973894e-05, 'samples': 24213120, 'steps': 126109, 'loss/train': 1.8660950660705566} 11/07/2021 14:59:24 - INFO - __main__ - Step 126111: {'lr': 3.1462378972747286e-05, 'samples': 24213312, 'steps': 126110, 'loss/train': 1.56953763961792} 11/07/2021 14:59:24 - INFO - __main__ - Step 126112: {'lr': 3.1459801774225305e-05, 'samples': 24213504, 'steps': 126111, 'loss/train': 1.1057151556015015} 11/07/2021 14:59:25 - INFO - __main__ - Step 126113: {'lr': 3.145722467417417e-05, 'samples': 24213696, 'steps': 126112, 'loss/train': 1.1004718542099} 11/07/2021 14:59:25 - INFO - __main__ - Step 126114: {'lr': 3.145464767259501e-05, 'samples': 24213888, 'steps': 126113, 'loss/train': 1.1337778568267822} 11/07/2021 14:59:25 - INFO - __main__ - Step 126115: {'lr': 3.145207076948903e-05, 'samples': 24214080, 'steps': 126114, 'loss/train': 0.9742420315742493} 11/07/2021 14:59:27 - INFO - __main__ - Step 126116: {'lr': 3.1449493964857384e-05, 'samples': 24214272, 'steps': 126115, 'loss/train': 1.433230996131897} 11/07/2021 14:59:27 - INFO - __main__ - Step 126117: {'lr': 3.1446917258701276e-05, 'samples': 24214464, 'steps': 126116, 'loss/train': 1.5002905130386353} 11/07/2021 14:59:27 - INFO - __main__ - Step 126118: {'lr': 3.1444340651021753e-05, 'samples': 24214656, 'steps': 126117, 'loss/train': 1.5196881294250488} 11/07/2021 14:59:28 - INFO - __main__ - Step 126119: {'lr': 3.144176414182004e-05, 'samples': 24214848, 'steps': 126118, 'loss/train': 1.4905747175216675} 11/07/2021 14:59:28 - INFO - __main__ - Step 126120: {'lr': 3.1439187731097305e-05, 'samples': 24215040, 'steps': 126119, 'loss/train': 1.372028112411499} 11/07/2021 14:59:29 - INFO - __main__ - Step 126121: {'lr': 3.1436611418854675e-05, 'samples': 24215232, 'steps': 126120, 'loss/train': 1.2468520402908325} 11/07/2021 14:59:30 - INFO - __main__ - Step 126122: {'lr': 3.143403520509336e-05, 'samples': 24215424, 'steps': 126121, 'loss/train': 1.1025928258895874} 11/07/2021 14:59:30 - INFO - __main__ - Step 126123: {'lr': 3.143145908981449e-05, 'samples': 24215616, 'steps': 126122, 'loss/train': 1.039284110069275} 11/07/2021 14:59:30 - INFO - __main__ - Step 126124: {'lr': 3.142888307301922e-05, 'samples': 24215808, 'steps': 126123, 'loss/train': 1.3756442070007324} 11/07/2021 14:59:31 - INFO - __main__ - Step 126125: {'lr': 3.142630715470873e-05, 'samples': 24216000, 'steps': 126124, 'loss/train': 1.1376667022705078} 11/07/2021 14:59:32 - INFO - __main__ - Step 126126: {'lr': 3.142373133488416e-05, 'samples': 24216192, 'steps': 126125, 'loss/train': 0.145346999168396} 11/07/2021 14:59:32 - INFO - __main__ - Step 126127: {'lr': 3.142115561354672e-05, 'samples': 24216384, 'steps': 126126, 'loss/train': 1.2218153476715088} 11/07/2021 14:59:32 - INFO - __main__ - Step 126128: {'lr': 3.14185799906975e-05, 'samples': 24216576, 'steps': 126127, 'loss/train': 1.137263298034668} 11/07/2021 14:59:33 - INFO - __main__ - Step 126129: {'lr': 3.141600446633772e-05, 'samples': 24216768, 'steps': 126128, 'loss/train': 1.02194082736969} 11/07/2021 14:59:33 - INFO - __main__ - Step 126130: {'lr': 3.1413429040468536e-05, 'samples': 24216960, 'steps': 126129, 'loss/train': 1.1887410879135132} 11/07/2021 14:59:34 - INFO - __main__ - Step 126131: {'lr': 3.141085371309105e-05, 'samples': 24217152, 'steps': 126130, 'loss/train': 1.583444356918335} 11/07/2021 14:59:35 - INFO - __main__ - Step 126132: {'lr': 3.140827848420647e-05, 'samples': 24217344, 'steps': 126131, 'loss/train': 1.432094693183899} 11/07/2021 14:59:35 - INFO - __main__ - Step 126133: {'lr': 3.140570335381595e-05, 'samples': 24217536, 'steps': 126132, 'loss/train': 1.2118359804153442} 11/07/2021 14:59:35 - INFO - __main__ - Step 126134: {'lr': 3.140312832192063e-05, 'samples': 24217728, 'steps': 126133, 'loss/train': 1.3783693313598633} 11/07/2021 14:59:36 - INFO - __main__ - Step 126135: {'lr': 3.1400553388521684e-05, 'samples': 24217920, 'steps': 126134, 'loss/train': 1.4021354913711548} 11/07/2021 14:59:37 - INFO - __main__ - Step 126136: {'lr': 3.139797855362031e-05, 'samples': 24218112, 'steps': 126135, 'loss/train': 1.3508697748184204} 11/07/2021 14:59:37 - INFO - __main__ - Step 126137: {'lr': 3.1395403817217587e-05, 'samples': 24218304, 'steps': 126136, 'loss/train': 1.3202682733535767} 11/07/2021 14:59:38 - INFO - __main__ - Step 126138: {'lr': 3.139282917931477e-05, 'samples': 24218496, 'steps': 126137, 'loss/train': 1.044824481010437} 11/07/2021 14:59:38 - INFO - __main__ - Step 126139: {'lr': 3.139025463991294e-05, 'samples': 24218688, 'steps': 126138, 'loss/train': 1.3339577913284302} 11/07/2021 14:59:38 - INFO - __main__ - Step 126140: {'lr': 3.138768019901328e-05, 'samples': 24218880, 'steps': 126139, 'loss/train': 1.8552641868591309} 11/07/2021 14:59:39 - INFO - __main__ - Step 126141: {'lr': 3.138510585661697e-05, 'samples': 24219072, 'steps': 126140, 'loss/train': 1.317099928855896} 11/07/2021 14:59:40 - INFO - __main__ - Step 126142: {'lr': 3.138253161272517e-05, 'samples': 24219264, 'steps': 126141, 'loss/train': 1.1360822916030884} 11/07/2021 14:59:40 - INFO - __main__ - Step 126143: {'lr': 3.1379957467339014e-05, 'samples': 24219456, 'steps': 126142, 'loss/train': 1.8520612716674805} 11/07/2021 14:59:41 - INFO - __main__ - Step 126144: {'lr': 3.137738342045973e-05, 'samples': 24219648, 'steps': 126143, 'loss/train': 0.4049614667892456} 11/07/2021 14:59:41 - INFO - __main__ - Step 126145: {'lr': 3.137480947208837e-05, 'samples': 24219840, 'steps': 126144, 'loss/train': 0.7613654732704163} 11/07/2021 14:59:41 - INFO - __main__ - Step 126146: {'lr': 3.137223562222616e-05, 'samples': 24220032, 'steps': 126145, 'loss/train': 1.308700442314148} 11/07/2021 14:59:42 - INFO - __main__ - Step 126147: {'lr': 3.1369661870874227e-05, 'samples': 24220224, 'steps': 126146, 'loss/train': 1.3893647193908691} 11/07/2021 14:59:43 - INFO - __main__ - Step 126148: {'lr': 3.1367088218033776e-05, 'samples': 24220416, 'steps': 126147, 'loss/train': 1.0902721881866455} 11/07/2021 14:59:43 - INFO - __main__ - Step 126149: {'lr': 3.1364514663705906e-05, 'samples': 24220608, 'steps': 126148, 'loss/train': 0.9678022265434265} 11/07/2021 14:59:44 - INFO - __main__ - Step 126150: {'lr': 3.136194120789185e-05, 'samples': 24220800, 'steps': 126149, 'loss/train': 1.735148549079895} 11/07/2021 14:59:44 - INFO - __main__ - Step 126151: {'lr': 3.135936785059271e-05, 'samples': 24220992, 'steps': 126150, 'loss/train': 1.1054000854492188} 11/07/2021 14:59:44 - INFO - __main__ - Step 126152: {'lr': 3.135679459180965e-05, 'samples': 24221184, 'steps': 126151, 'loss/train': 1.1356743574142456} 11/07/2021 14:59:45 - INFO - __main__ - Step 126153: {'lr': 3.1354221431543876e-05, 'samples': 24221376, 'steps': 126152, 'loss/train': 0.19323112070560455} 11/07/2021 14:59:46 - INFO - __main__ - Step 126154: {'lr': 3.135164836979651e-05, 'samples': 24221568, 'steps': 126153, 'loss/train': 1.2640681266784668} 11/07/2021 14:59:46 - INFO - __main__ - Step 126155: {'lr': 3.13490754065687e-05, 'samples': 24221760, 'steps': 126154, 'loss/train': 1.5073405504226685} 11/07/2021 14:59:46 - INFO - __main__ - Step 126156: {'lr': 3.134650254186164e-05, 'samples': 24221952, 'steps': 126155, 'loss/train': 1.9172307252883911} 11/07/2021 14:59:47 - INFO - __main__ - Step 126157: {'lr': 3.134392977567649e-05, 'samples': 24222144, 'steps': 126156, 'loss/train': 1.3731814622879028} 11/07/2021 14:59:47 - INFO - __main__ - Step 126158: {'lr': 3.134135710801436e-05, 'samples': 24222336, 'steps': 126157, 'loss/train': 1.2770414352416992} 11/07/2021 14:59:48 - INFO - __main__ - Step 126159: {'lr': 3.1338784538876454e-05, 'samples': 24222528, 'steps': 126158, 'loss/train': 1.4949108362197876} 11/07/2021 14:59:49 - INFO - __main__ - Step 126160: {'lr': 3.133621206826392e-05, 'samples': 24222720, 'steps': 126159, 'loss/train': 0.8484584093093872} 11/07/2021 14:59:49 - INFO - __main__ - Step 126161: {'lr': 3.133363969617789e-05, 'samples': 24222912, 'steps': 126160, 'loss/train': 1.0797386169433594} 11/07/2021 14:59:49 - INFO - __main__ - Step 126162: {'lr': 3.1331067422619566e-05, 'samples': 24223104, 'steps': 126161, 'loss/train': 1.4469550848007202} 11/07/2021 14:59:50 - INFO - __main__ - Step 126163: {'lr': 3.132849524759007e-05, 'samples': 24223296, 'steps': 126162, 'loss/train': 1.5462653636932373} 11/07/2021 14:59:51 - INFO - __main__ - Step 126164: {'lr': 3.132592317109059e-05, 'samples': 24223488, 'steps': 126163, 'loss/train': 1.2632118463516235} 11/07/2021 14:59:51 - INFO - __main__ - Step 126165: {'lr': 3.13233511931223e-05, 'samples': 24223680, 'steps': 126164, 'loss/train': 1.5315091609954834} 11/07/2021 14:59:52 - INFO - __main__ - Step 126166: {'lr': 3.1320779313686295e-05, 'samples': 24223872, 'steps': 126165, 'loss/train': 1.8683552742004395} 11/07/2021 14:59:52 - INFO - __main__ - Step 126167: {'lr': 3.1318207532783805e-05, 'samples': 24224064, 'steps': 126166, 'loss/train': 1.4499902725219727} 11/07/2021 14:59:52 - INFO - __main__ - Step 126168: {'lr': 3.131563585041594e-05, 'samples': 24224256, 'steps': 126167, 'loss/train': 0.9096540212631226} 11/07/2021 14:59:53 - INFO - __main__ - Step 126169: {'lr': 3.1313064266583866e-05, 'samples': 24224448, 'steps': 126168, 'loss/train': 1.2040177583694458} 11/07/2021 14:59:54 - INFO - __main__ - Step 126170: {'lr': 3.131049278128875e-05, 'samples': 24224640, 'steps': 126169, 'loss/train': 1.2936362028121948} 11/07/2021 14:59:54 - INFO - __main__ - Step 126171: {'lr': 3.1307921394531816e-05, 'samples': 24224832, 'steps': 126170, 'loss/train': 1.2798197269439697} 11/07/2021 14:59:54 - INFO - __main__ - Step 126172: {'lr': 3.1305350106314104e-05, 'samples': 24225024, 'steps': 126171, 'loss/train': 1.036184310913086} 11/07/2021 14:59:55 - INFO - __main__ - Step 126173: {'lr': 3.1302778916636824e-05, 'samples': 24225216, 'steps': 126172, 'loss/train': 1.4147847890853882} 11/07/2021 14:59:56 - INFO - __main__ - Step 126174: {'lr': 3.1300207825501134e-05, 'samples': 24225408, 'steps': 126173, 'loss/train': 1.248140811920166} 11/07/2021 14:59:56 - INFO - __main__ - Step 126175: {'lr': 3.129763683290821e-05, 'samples': 24225600, 'steps': 126174, 'loss/train': 0.563103973865509} 11/07/2021 14:59:56 - INFO - __main__ - Step 126176: {'lr': 3.129506593885917e-05, 'samples': 24225792, 'steps': 126175, 'loss/train': 0.951204240322113} 11/07/2021 14:59:57 - INFO - __main__ - Step 126177: {'lr': 3.129249514335522e-05, 'samples': 24225984, 'steps': 126176, 'loss/train': 1.7835925817489624} 11/07/2021 14:59:57 - INFO - __main__ - Step 126178: {'lr': 3.12899244463975e-05, 'samples': 24226176, 'steps': 126177, 'loss/train': 1.4356633424758911} 11/07/2021 14:59:57 - INFO - __main__ - Step 126179: {'lr': 3.1287353847987146e-05, 'samples': 24226368, 'steps': 126178, 'loss/train': 1.346877932548523} 11/07/2021 14:59:58 - INFO - __main__ - Step 126180: {'lr': 3.1284783348125347e-05, 'samples': 24226560, 'steps': 126179, 'loss/train': 1.4236241579055786} 11/07/2021 14:59:59 - INFO - __main__ - Step 126181: {'lr': 3.128221294681324e-05, 'samples': 24226752, 'steps': 126180, 'loss/train': 1.1052517890930176} 11/07/2021 14:59:59 - INFO - __main__ - Step 126182: {'lr': 3.1279642644052004e-05, 'samples': 24226944, 'steps': 126181, 'loss/train': 1.0624819993972778} 11/07/2021 15:00:00 - INFO - __main__ - Step 126183: {'lr': 3.127707243984279e-05, 'samples': 24227136, 'steps': 126182, 'loss/train': 1.1695787906646729} 11/07/2021 15:00:00 - INFO - __main__ - Step 126184: {'lr': 3.12745023341868e-05, 'samples': 24227328, 'steps': 126183, 'loss/train': 1.115105152130127} 11/07/2021 15:00:01 - INFO - __main__ - Step 126185: {'lr': 3.127193232708508e-05, 'samples': 24227520, 'steps': 126184, 'loss/train': 1.1674222946166992} 11/07/2021 15:00:01 - INFO - __main__ - Step 126186: {'lr': 3.1269362418538866e-05, 'samples': 24227712, 'steps': 126185, 'loss/train': 0.8833454251289368} 11/07/2021 15:00:02 - INFO - __main__ - Step 126187: {'lr': 3.126679260854931e-05, 'samples': 24227904, 'steps': 126186, 'loss/train': 1.6461416482925415} 11/07/2021 15:00:02 - INFO - __main__ - Step 126188: {'lr': 3.1264222897117556e-05, 'samples': 24228096, 'steps': 126187, 'loss/train': 1.3232154846191406} 11/07/2021 15:00:02 - INFO - __main__ - Step 126189: {'lr': 3.126165328424474e-05, 'samples': 24228288, 'steps': 126188, 'loss/train': 1.3314353227615356} 11/07/2021 15:00:03 - INFO - __main__ - Step 126190: {'lr': 3.1259083769932085e-05, 'samples': 24228480, 'steps': 126189, 'loss/train': 1.1880983114242554} 11/07/2021 15:00:04 - INFO - __main__ - Step 126191: {'lr': 3.12565143541807e-05, 'samples': 24228672, 'steps': 126190, 'loss/train': 1.368848204612732} 11/07/2021 15:00:04 - INFO - __main__ - Step 126192: {'lr': 3.125394503699175e-05, 'samples': 24228864, 'steps': 126191, 'loss/train': 1.5282855033874512} 11/07/2021 15:00:04 - INFO - __main__ - Step 126193: {'lr': 3.125137581836637e-05, 'samples': 24229056, 'steps': 126192, 'loss/train': 1.2515408992767334} 11/07/2021 15:00:05 - INFO - __main__ - Step 126194: {'lr': 3.1248806698305794e-05, 'samples': 24229248, 'steps': 126193, 'loss/train': 1.2730334997177124} 11/07/2021 15:00:06 - INFO - __main__ - Step 126195: {'lr': 3.124623767681109e-05, 'samples': 24229440, 'steps': 126194, 'loss/train': 1.1523844003677368} 11/07/2021 15:00:06 - INFO - __main__ - Step 126196: {'lr': 3.124366875388349e-05, 'samples': 24229632, 'steps': 126195, 'loss/train': 1.4786032438278198} 11/07/2021 15:00:06 - INFO - __main__ - Step 126197: {'lr': 3.124109992952409e-05, 'samples': 24229824, 'steps': 126196, 'loss/train': 1.6280319690704346} 11/07/2021 15:00:07 - INFO - __main__ - Step 126198: {'lr': 3.1238531203734125e-05, 'samples': 24230016, 'steps': 126197, 'loss/train': 1.151090145111084} 11/07/2021 15:00:07 - INFO - __main__ - Step 126199: {'lr': 3.123596257651467e-05, 'samples': 24230208, 'steps': 126198, 'loss/train': 0.46435073018074036} 11/07/2021 15:00:08 - INFO - __main__ - Step 126200: {'lr': 3.1233394047866904e-05, 'samples': 24230400, 'steps': 126199, 'loss/train': 0.7058010697364807} 11/07/2021 15:00:09 - INFO - __main__ - Step 126201: {'lr': 3.1230825617792006e-05, 'samples': 24230592, 'steps': 126200, 'loss/train': 1.1964362859725952} 11/07/2021 15:00:09 - INFO - __main__ - Step 126202: {'lr': 3.1228257286291115e-05, 'samples': 24230784, 'steps': 126201, 'loss/train': 1.7342556715011597} 11/07/2021 15:00:09 - INFO - __main__ - Step 126203: {'lr': 3.12256890533654e-05, 'samples': 24230976, 'steps': 126202, 'loss/train': 1.3742146492004395} 11/07/2021 15:00:10 - INFO - __main__ - Step 126204: {'lr': 3.122312091901599e-05, 'samples': 24231168, 'steps': 126203, 'loss/train': 0.44682633876800537} 11/07/2021 15:00:11 - INFO - __main__ - Step 126205: {'lr': 3.12205528832441e-05, 'samples': 24231360, 'steps': 126204, 'loss/train': 0.7522687911987305} 11/07/2021 15:00:11 - INFO - __main__ - Step 126206: {'lr': 3.121798494605083e-05, 'samples': 24231552, 'steps': 126205, 'loss/train': 1.2496248483657837} 11/07/2021 15:00:11 - INFO - __main__ - Step 126207: {'lr': 3.121541710743736e-05, 'samples': 24231744, 'steps': 126206, 'loss/train': 1.7706576585769653} 11/07/2021 15:00:12 - INFO - __main__ - Step 126208: {'lr': 3.121284936740487e-05, 'samples': 24231936, 'steps': 126207, 'loss/train': 1.2152624130249023} 11/07/2021 15:00:12 - INFO - __main__ - Step 126209: {'lr': 3.121028172595447e-05, 'samples': 24232128, 'steps': 126208, 'loss/train': 0.17122849822044373} 11/07/2021 15:00:13 - INFO - __main__ - Step 126210: {'lr': 3.120771418308735e-05, 'samples': 24232320, 'steps': 126209, 'loss/train': 1.5783509016036987} 11/07/2021 15:00:14 - INFO - __main__ - Step 126211: {'lr': 3.1205146738804705e-05, 'samples': 24232512, 'steps': 126210, 'loss/train': 1.0054670572280884} 11/07/2021 15:00:14 - INFO - __main__ - Step 126212: {'lr': 3.1202579393107585e-05, 'samples': 24232704, 'steps': 126211, 'loss/train': 1.2491505146026611} 11/07/2021 15:00:14 - INFO - __main__ - Step 126213: {'lr': 3.120001214599724e-05, 'samples': 24232896, 'steps': 126212, 'loss/train': 1.2835643291473389} 11/07/2021 15:00:15 - INFO - __main__ - Step 126214: {'lr': 3.119744499747476e-05, 'samples': 24233088, 'steps': 126213, 'loss/train': 1.211684226989746} 11/07/2021 15:00:16 - INFO - __main__ - Step 126215: {'lr': 3.1194877947541334e-05, 'samples': 24233280, 'steps': 126214, 'loss/train': 1.3336421251296997} 11/07/2021 15:00:16 - INFO - __main__ - Step 126216: {'lr': 3.1192310996198157e-05, 'samples': 24233472, 'steps': 126215, 'loss/train': 1.2944669723510742} 11/07/2021 15:00:16 - INFO - __main__ - Step 126217: {'lr': 3.11897441434463e-05, 'samples': 24233664, 'steps': 126216, 'loss/train': 5.6575927734375} 11/07/2021 15:00:17 - INFO - __main__ - Step 126218: {'lr': 3.1187177389287e-05, 'samples': 24233856, 'steps': 126217, 'loss/train': 1.2331876754760742} 11/07/2021 15:00:17 - INFO - __main__ - Step 126219: {'lr': 3.1184610733721366e-05, 'samples': 24234048, 'steps': 126218, 'loss/train': 1.111513614654541} 11/07/2021 15:00:18 - INFO - __main__ - Step 126220: {'lr': 3.1182044176750576e-05, 'samples': 24234240, 'steps': 126219, 'loss/train': 1.2166723012924194} 11/07/2021 15:00:19 - INFO - __main__ - Step 126221: {'lr': 3.117947771837579e-05, 'samples': 24234432, 'steps': 126220, 'loss/train': 1.3979320526123047} 11/07/2021 15:00:19 - INFO - __main__ - Step 126222: {'lr': 3.117691135859813e-05, 'samples': 24234624, 'steps': 126221, 'loss/train': 1.5825822353363037} 11/07/2021 15:00:20 - INFO - __main__ - Step 126223: {'lr': 3.117434509741879e-05, 'samples': 24234816, 'steps': 126222, 'loss/train': 0.8709682822227478} 11/07/2021 15:00:20 - INFO - __main__ - Step 126224: {'lr': 3.117177893483897e-05, 'samples': 24235008, 'steps': 126223, 'loss/train': 1.7305699586868286} 11/07/2021 15:00:20 - INFO - __main__ - Step 126225: {'lr': 3.116921287085972e-05, 'samples': 24235200, 'steps': 126224, 'loss/train': 1.676514983177185} 11/07/2021 15:00:21 - INFO - __main__ - Step 126226: {'lr': 3.116664690548224e-05, 'samples': 24235392, 'steps': 126225, 'loss/train': 1.4228962659835815} 11/07/2021 15:00:22 - INFO - __main__ - Step 126227: {'lr': 3.116408103870769e-05, 'samples': 24235584, 'steps': 126226, 'loss/train': 1.4123162031173706} 11/07/2021 15:00:22 - INFO - __main__ - Step 126228: {'lr': 3.116151527053723e-05, 'samples': 24235776, 'steps': 126227, 'loss/train': 1.4177038669586182} 11/07/2021 15:00:23 - INFO - __main__ - Step 126229: {'lr': 3.1158949600972015e-05, 'samples': 24235968, 'steps': 126228, 'loss/train': 1.6027421951293945} 11/07/2021 15:00:23 - INFO - __main__ - Step 126230: {'lr': 3.11563840300132e-05, 'samples': 24236160, 'steps': 126229, 'loss/train': 1.0199297666549683} 11/07/2021 15:00:23 - INFO - __main__ - Step 126231: {'lr': 3.1153818557661944e-05, 'samples': 24236352, 'steps': 126230, 'loss/train': 1.2502597570419312} 11/07/2021 15:00:24 - INFO - __main__ - Step 126232: {'lr': 3.11512531839194e-05, 'samples': 24236544, 'steps': 126231, 'loss/train': 1.042514681816101} 11/07/2021 15:00:25 - INFO - __main__ - Step 126233: {'lr': 3.1148687908786724e-05, 'samples': 24236736, 'steps': 126232, 'loss/train': 1.3397468328475952} 11/07/2021 15:00:25 - INFO - __main__ - Step 126234: {'lr': 3.114612273226508e-05, 'samples': 24236928, 'steps': 126233, 'loss/train': 1.1669812202453613} 11/07/2021 15:00:25 - INFO - __main__ - Step 126235: {'lr': 3.1143557654355585e-05, 'samples': 24237120, 'steps': 126234, 'loss/train': 1.5270212888717651} 11/07/2021 15:00:26 - INFO - __main__ - Step 126236: {'lr': 3.114099267505946e-05, 'samples': 24237312, 'steps': 126235, 'loss/train': 1.7050856351852417} 11/07/2021 15:00:27 - INFO - __main__ - Step 126237: {'lr': 3.113842779437781e-05, 'samples': 24237504, 'steps': 126236, 'loss/train': 1.1354871988296509} 11/07/2021 15:00:27 - INFO - __main__ - Step 126238: {'lr': 3.113586301231186e-05, 'samples': 24237696, 'steps': 126237, 'loss/train': 1.466529369354248} 11/07/2021 15:00:28 - INFO - __main__ - Step 126239: {'lr': 3.1133298328862666e-05, 'samples': 24237888, 'steps': 126238, 'loss/train': 1.227446436882019} 11/07/2021 15:00:28 - INFO - __main__ - Step 126240: {'lr': 3.1130733744031444e-05, 'samples': 24238080, 'steps': 126239, 'loss/train': 0.6683287024497986} 11/07/2021 15:00:28 - INFO - __main__ - Step 126241: {'lr': 3.1128169257819305e-05, 'samples': 24238272, 'steps': 126240, 'loss/train': 1.0951735973358154} 11/07/2021 15:00:29 - INFO - __main__ - Step 126242: {'lr': 3.112560487022745e-05, 'samples': 24238464, 'steps': 126241, 'loss/train': 3.241873025894165} 11/07/2021 15:00:30 - INFO - __main__ - Step 126243: {'lr': 3.112304058125704e-05, 'samples': 24238656, 'steps': 126242, 'loss/train': 1.3748607635498047} 11/07/2021 15:00:30 - INFO - __main__ - Step 126244: {'lr': 3.112047639090918e-05, 'samples': 24238848, 'steps': 126243, 'loss/train': 1.3566128015518188} 11/07/2021 15:00:31 - INFO - __main__ - Step 126245: {'lr': 3.111791229918506e-05, 'samples': 24239040, 'steps': 126244, 'loss/train': 1.623793363571167} 11/07/2021 15:00:31 - INFO - __main__ - Step 126246: {'lr': 3.111534830608584e-05, 'samples': 24239232, 'steps': 126245, 'loss/train': 1.1757285594940186} 11/07/2021 15:00:31 - INFO - __main__ - Step 126247: {'lr': 3.1112784411612667e-05, 'samples': 24239424, 'steps': 126246, 'loss/train': 1.1445071697235107} 11/07/2021 15:00:32 - INFO - __main__ - Step 126248: {'lr': 3.111022061576671e-05, 'samples': 24239616, 'steps': 126247, 'loss/train': 0.8338703513145447} 11/07/2021 15:00:33 - INFO - __main__ - Step 126249: {'lr': 3.1107656918549084e-05, 'samples': 24239808, 'steps': 126248, 'loss/train': 1.1820560693740845} 11/07/2021 15:00:33 - INFO - __main__ - Step 126250: {'lr': 3.110509331996103e-05, 'samples': 24240000, 'steps': 126249, 'loss/train': 1.460058569908142} 11/07/2021 15:00:33 - INFO - __main__ - Step 126251: {'lr': 3.1102529820003586e-05, 'samples': 24240192, 'steps': 126250, 'loss/train': 1.6699955463409424} 11/07/2021 15:00:34 - INFO - __main__ - Step 126252: {'lr': 3.109996641867799e-05, 'samples': 24240384, 'steps': 126251, 'loss/train': 0.9207051992416382} 11/07/2021 15:00:35 - INFO - __main__ - Step 126253: {'lr': 3.1097403115985326e-05, 'samples': 24240576, 'steps': 126252, 'loss/train': 0.8306469917297363} 11/07/2021 15:00:35 - INFO - __main__ - Step 126254: {'lr': 3.1094839911926824e-05, 'samples': 24240768, 'steps': 126253, 'loss/train': 0.8279229402542114} 11/07/2021 15:00:35 - INFO - __main__ - Step 126255: {'lr': 3.1092276806503615e-05, 'samples': 24240960, 'steps': 126254, 'loss/train': 1.2406593561172485} 11/07/2021 15:00:36 - INFO - __main__ - Step 126256: {'lr': 3.108971379971684e-05, 'samples': 24241152, 'steps': 126255, 'loss/train': 1.258965253829956} 11/07/2021 15:00:36 - INFO - __main__ - Step 126257: {'lr': 3.108715089156766e-05, 'samples': 24241344, 'steps': 126256, 'loss/train': 1.3229097127914429} 11/07/2021 15:00:37 - INFO - __main__ - Step 126258: {'lr': 3.108458808205725e-05, 'samples': 24241536, 'steps': 126257, 'loss/train': 1.2159663438796997} 11/07/2021 15:00:37 - INFO - __main__ - Step 126259: {'lr': 3.1082025371186704e-05, 'samples': 24241728, 'steps': 126258, 'loss/train': 1.4497431516647339} 11/07/2021 15:00:38 - INFO - __main__ - Step 126260: {'lr': 3.1079462758957264e-05, 'samples': 24241920, 'steps': 126259, 'loss/train': 1.4198639392852783} 11/07/2021 15:00:38 - INFO - __main__ - Step 126261: {'lr': 3.107690024537008e-05, 'samples': 24242112, 'steps': 126260, 'loss/train': 1.488517165184021} 11/07/2021 15:00:38 - INFO - __main__ - Step 126262: {'lr': 3.107433783042618e-05, 'samples': 24242304, 'steps': 126261, 'loss/train': 1.4996209144592285} 11/07/2021 15:00:39 - INFO - __main__ - Step 126263: {'lr': 3.107177551412685e-05, 'samples': 24242496, 'steps': 126262, 'loss/train': 1.6529954671859741} 11/07/2021 15:00:40 - INFO - __main__ - Step 126264: {'lr': 3.1069213296473166e-05, 'samples': 24242688, 'steps': 126263, 'loss/train': 1.1243711709976196} 11/07/2021 15:00:40 - INFO - __main__ - Step 126265: {'lr': 3.106665117746635e-05, 'samples': 24242880, 'steps': 126264, 'loss/train': 1.6784641742706299} 11/07/2021 15:00:41 - INFO - __main__ - Step 126266: {'lr': 3.1064089157107484e-05, 'samples': 24243072, 'steps': 126265, 'loss/train': 1.2100622653961182} 11/07/2021 15:00:41 - INFO - __main__ - Step 126267: {'lr': 3.106152723539779e-05, 'samples': 24243264, 'steps': 126266, 'loss/train': 1.2287099361419678} 11/07/2021 15:00:42 - INFO - __main__ - Step 126268: {'lr': 3.105896541233838e-05, 'samples': 24243456, 'steps': 126267, 'loss/train': 1.5097599029541016} 11/07/2021 15:00:42 - INFO - __main__ - Step 126269: {'lr': 3.1056403687930444e-05, 'samples': 24243648, 'steps': 126268, 'loss/train': 1.3618416786193848} 11/07/2021 15:00:43 - INFO - __main__ - Step 126270: {'lr': 3.10538420621751e-05, 'samples': 24243840, 'steps': 126269, 'loss/train': 1.4752663373947144} 11/07/2021 15:00:43 - INFO - __main__ - Step 126271: {'lr': 3.10512805350735e-05, 'samples': 24244032, 'steps': 126270, 'loss/train': 1.204845666885376} 11/07/2021 15:00:43 - INFO - __main__ - Step 126272: {'lr': 3.104871910662688e-05, 'samples': 24244224, 'steps': 126271, 'loss/train': 1.2281231880187988} 11/07/2021 15:00:45 - INFO - __main__ - Step 126273: {'lr': 3.104615777683628e-05, 'samples': 24244416, 'steps': 126272, 'loss/train': 1.7080645561218262} 11/07/2021 15:00:45 - INFO - __main__ - Step 126274: {'lr': 3.1043596545702905e-05, 'samples': 24244608, 'steps': 126273, 'loss/train': 1.2636798620224} 11/07/2021 15:00:45 - INFO - __main__ - Step 126275: {'lr': 3.104103541322789e-05, 'samples': 24244800, 'steps': 126274, 'loss/train': 1.5229814052581787} 11/07/2021 15:00:46 - INFO - __main__ - Step 126276: {'lr': 3.103847437941243e-05, 'samples': 24244992, 'steps': 126275, 'loss/train': 1.063579797744751} 11/07/2021 15:00:46 - INFO - __main__ - Step 126277: {'lr': 3.103591344425763e-05, 'samples': 24245184, 'steps': 126276, 'loss/train': 0.13812285661697388} 11/07/2021 15:00:46 - INFO - __main__ - Step 126278: {'lr': 3.103335260776469e-05, 'samples': 24245376, 'steps': 126277, 'loss/train': 0.12201057374477386} 11/07/2021 15:00:48 - INFO - __main__ - Step 126279: {'lr': 3.103079186993471e-05, 'samples': 24245568, 'steps': 126278, 'loss/train': 1.468117356300354} 11/07/2021 15:00:48 - INFO - __main__ - Step 126280: {'lr': 3.1028231230768896e-05, 'samples': 24245760, 'steps': 126279, 'loss/train': 1.2680073976516724} 11/07/2021 15:00:48 - INFO - __main__ - Step 126281: {'lr': 3.10256706902684e-05, 'samples': 24245952, 'steps': 126280, 'loss/train': 1.3384389877319336} 11/07/2021 15:00:49 - INFO - __main__ - Step 126282: {'lr': 3.102311024843435e-05, 'samples': 24246144, 'steps': 126281, 'loss/train': 1.6123437881469727} 11/07/2021 15:00:49 - INFO - __main__ - Step 126283: {'lr': 3.102054990526795e-05, 'samples': 24246336, 'steps': 126282, 'loss/train': 0.7099880576133728} 11/07/2021 15:00:50 - INFO - __main__ - Step 126284: {'lr': 3.101798966077024e-05, 'samples': 24246528, 'steps': 126283, 'loss/train': 1.069987416267395} 11/07/2021 15:00:50 - INFO - __main__ - Step 126285: {'lr': 3.1015429514942486e-05, 'samples': 24246720, 'steps': 126284, 'loss/train': 1.0020079612731934} 11/07/2021 15:00:51 - INFO - __main__ - Step 126286: {'lr': 3.101286946778578e-05, 'samples': 24246912, 'steps': 126285, 'loss/train': 1.2674862146377563} 11/07/2021 15:00:51 - INFO - __main__ - Step 126287: {'lr': 3.1010309519301285e-05, 'samples': 24247104, 'steps': 126286, 'loss/train': 1.495667576789856} 11/07/2021 15:00:51 - INFO - __main__ - Step 126288: {'lr': 3.1007749669490185e-05, 'samples': 24247296, 'steps': 126287, 'loss/train': 1.9164572954177856} 11/07/2021 15:00:53 - INFO - __main__ - Step 126289: {'lr': 3.1005189918353605e-05, 'samples': 24247488, 'steps': 126288, 'loss/train': 1.4312384128570557} 11/07/2021 15:00:53 - INFO - __main__ - Step 126290: {'lr': 3.10026302658927e-05, 'samples': 24247680, 'steps': 126289, 'loss/train': 1.2895528078079224} 11/07/2021 15:00:53 - INFO - __main__ - Step 126291: {'lr': 3.100007071210864e-05, 'samples': 24247872, 'steps': 126290, 'loss/train': 0.8355743288993835} 11/07/2021 15:00:54 - INFO - __main__ - Step 126292: {'lr': 3.099751125700256e-05, 'samples': 24248064, 'steps': 126291, 'loss/train': 0.9438901543617249} 11/07/2021 15:00:54 - INFO - __main__ - Step 126293: {'lr': 3.099495190057564e-05, 'samples': 24248256, 'steps': 126292, 'loss/train': 0.4725619852542877} 11/07/2021 15:00:55 - INFO - __main__ - Step 126294: {'lr': 3.099239264282905e-05, 'samples': 24248448, 'steps': 126293, 'loss/train': 1.2534345388412476} 11/07/2021 15:00:55 - INFO - __main__ - Step 126295: {'lr': 3.0989833483763857e-05, 'samples': 24248640, 'steps': 126294, 'loss/train': 0.6302972435951233} 11/07/2021 15:00:56 - INFO - __main__ - Step 126296: {'lr': 3.0987274423381256e-05, 'samples': 24248832, 'steps': 126295, 'loss/train': 0.92026686668396} 11/07/2021 15:00:56 - INFO - __main__ - Step 126297: {'lr': 3.098471546168244e-05, 'samples': 24249024, 'steps': 126296, 'loss/train': 1.3132128715515137} 11/07/2021 15:00:56 - INFO - __main__ - Step 126298: {'lr': 3.098215659866852e-05, 'samples': 24249216, 'steps': 126297, 'loss/train': 1.7559555768966675} 11/07/2021 15:00:58 - INFO - __main__ - Step 126299: {'lr': 3.097959783434065e-05, 'samples': 24249408, 'steps': 126298, 'loss/train': 1.0967223644256592} 11/07/2021 15:00:58 - INFO - __main__ - Step 126300: {'lr': 3.097703916870001e-05, 'samples': 24249600, 'steps': 126299, 'loss/train': 1.5645073652267456} 11/07/2021 15:00:59 - INFO - __main__ - Step 126301: {'lr': 3.097448060174771e-05, 'samples': 24249792, 'steps': 126300, 'loss/train': 1.3708165884017944} 11/07/2021 15:00:59 - INFO - __main__ - Step 126302: {'lr': 3.097192213348496e-05, 'samples': 24249984, 'steps': 126301, 'loss/train': 0.6780251860618591} 11/07/2021 15:00:59 - INFO - __main__ - Step 126303: {'lr': 3.096936376391285e-05, 'samples': 24250176, 'steps': 126302, 'loss/train': 1.2655187845230103} 11/07/2021 15:01:00 - INFO - __main__ - Step 126304: {'lr': 3.096680549303257e-05, 'samples': 24250368, 'steps': 126303, 'loss/train': 1.064221978187561} 11/07/2021 15:01:01 - INFO - __main__ - Step 126305: {'lr': 3.096424732084535e-05, 'samples': 24250560, 'steps': 126304, 'loss/train': 0.05409626662731171} 11/07/2021 15:01:01 - INFO - __main__ - Step 126306: {'lr': 3.096168924735218e-05, 'samples': 24250752, 'steps': 126305, 'loss/train': 0.9849985241889954} 11/07/2021 15:01:02 - INFO - __main__ - Step 126307: {'lr': 3.0959131272554316e-05, 'samples': 24250944, 'steps': 126306, 'loss/train': 1.2506427764892578} 11/07/2021 15:01:02 - INFO - __main__ - Step 126308: {'lr': 3.095657339645286e-05, 'samples': 24251136, 'steps': 126307, 'loss/train': 1.2219265699386597} 11/07/2021 15:01:02 - INFO - __main__ - Step 126309: {'lr': 3.095401561904901e-05, 'samples': 24251328, 'steps': 126308, 'loss/train': 1.226516842842102} 11/07/2021 15:01:03 - INFO - __main__ - Step 126310: {'lr': 3.0951457940343905e-05, 'samples': 24251520, 'steps': 126309, 'loss/train': 1.0273945331573486} 11/07/2021 15:01:04 - INFO - __main__ - Step 126311: {'lr': 3.0948900360338676e-05, 'samples': 24251712, 'steps': 126310, 'loss/train': 0.8663069009780884} 11/07/2021 15:01:04 - INFO - __main__ - Step 126312: {'lr': 3.09463428790345e-05, 'samples': 24251904, 'steps': 126311, 'loss/train': 1.3658709526062012} 11/07/2021 15:01:04 - INFO - __main__ - Step 126313: {'lr': 3.094378549643254e-05, 'samples': 24252096, 'steps': 126312, 'loss/train': 1.4997975826263428} 11/07/2021 15:01:05 - INFO - __main__ - Step 126314: {'lr': 3.094122821253389e-05, 'samples': 24252288, 'steps': 126313, 'loss/train': 0.9913204908370972} 11/07/2021 15:01:06 - INFO - __main__ - Step 126315: {'lr': 3.0938671027339774e-05, 'samples': 24252480, 'steps': 126314, 'loss/train': 1.2934839725494385} 11/07/2021 15:01:06 - INFO - __main__ - Step 126316: {'lr': 3.0936113940851305e-05, 'samples': 24252672, 'steps': 126315, 'loss/train': 1.4461040496826172} 11/07/2021 15:01:06 - INFO - __main__ - Step 126317: {'lr': 3.093355695306965e-05, 'samples': 24252864, 'steps': 126316, 'loss/train': 1.3827935457229614} 11/07/2021 15:01:07 - INFO - __main__ - Step 126318: {'lr': 3.0931000063995934e-05, 'samples': 24253056, 'steps': 126317, 'loss/train': 0.9234599471092224} 11/07/2021 15:01:07 - INFO - __main__ - Step 126319: {'lr': 3.0928443273631396e-05, 'samples': 24253248, 'steps': 126318, 'loss/train': 1.5881900787353516} 11/07/2021 15:01:08 - INFO - __main__ - Step 126320: {'lr': 3.092588658197706e-05, 'samples': 24253440, 'steps': 126319, 'loss/train': 0.9509207606315613} 11/07/2021 15:01:09 - INFO - __main__ - Step 126321: {'lr': 3.092332998903416e-05, 'samples': 24253632, 'steps': 126320, 'loss/train': 1.4131864309310913} 11/07/2021 15:01:09 - INFO - __main__ - Step 126322: {'lr': 3.092077349480379e-05, 'samples': 24253824, 'steps': 126321, 'loss/train': 1.172371745109558} 11/07/2021 15:01:10 - INFO - __main__ - Step 126323: {'lr': 3.091821709928716e-05, 'samples': 24254016, 'steps': 126322, 'loss/train': 1.4716612100601196} 11/07/2021 15:01:10 - INFO - __main__ - Step 126324: {'lr': 3.0915660802485394e-05, 'samples': 24254208, 'steps': 126323, 'loss/train': 0.3779688775539398} 11/07/2021 15:01:11 - INFO - __main__ - Step 126325: {'lr': 3.0913104604399666e-05, 'samples': 24254400, 'steps': 126324, 'loss/train': 0.1155799999833107} 11/07/2021 15:01:11 - INFO - __main__ - Step 126326: {'lr': 3.091054850503111e-05, 'samples': 24254592, 'steps': 126325, 'loss/train': 0.914098858833313} 11/07/2021 15:01:12 - INFO - __main__ - Step 126327: {'lr': 3.090799250438087e-05, 'samples': 24254784, 'steps': 126326, 'loss/train': 0.7180081009864807} 11/07/2021 15:01:12 - INFO - __main__ - Step 126328: {'lr': 3.0905436602450126e-05, 'samples': 24254976, 'steps': 126327, 'loss/train': 1.0922884941101074} 11/07/2021 15:01:12 - INFO - __main__ - Step 126329: {'lr': 3.090288079923997e-05, 'samples': 24255168, 'steps': 126328, 'loss/train': 0.9677011370658875} 11/07/2021 15:01:13 - INFO - __main__ - Step 126330: {'lr': 3.090032509475163e-05, 'samples': 24255360, 'steps': 126329, 'loss/train': 1.1457765102386475} 11/07/2021 15:01:14 - INFO - __main__ - Step 126331: {'lr': 3.089776948898621e-05, 'samples': 24255552, 'steps': 126330, 'loss/train': 1.2994953393936157} 11/07/2021 15:01:14 - INFO - __main__ - Step 126332: {'lr': 3.0895213981944944e-05, 'samples': 24255744, 'steps': 126331, 'loss/train': 0.7470300793647766} 11/07/2021 15:01:14 - INFO - __main__ - Step 126333: {'lr': 3.0892658573628854e-05, 'samples': 24255936, 'steps': 126332, 'loss/train': 1.1353752613067627} 11/07/2021 15:01:15 - INFO - __main__ - Step 126334: {'lr': 3.089010326403913e-05, 'samples': 24256128, 'steps': 126333, 'loss/train': 1.0219447612762451} 11/07/2021 15:01:16 - INFO - __main__ - Step 126335: {'lr': 3.088754805317695e-05, 'samples': 24256320, 'steps': 126334, 'loss/train': 1.3250864744186401} 11/07/2021 15:01:16 - INFO - __main__ - Step 126336: {'lr': 3.088499294104349e-05, 'samples': 24256512, 'steps': 126335, 'loss/train': 0.44856587052345276} 11/07/2021 15:01:17 - INFO - __main__ - Step 126337: {'lr': 3.088243792763984e-05, 'samples': 24256704, 'steps': 126336, 'loss/train': 0.7528207302093506} 11/07/2021 15:01:17 - INFO - __main__ - Step 126338: {'lr': 3.087988301296721e-05, 'samples': 24256896, 'steps': 126337, 'loss/train': 0.9655125141143799} 11/07/2021 15:01:17 - INFO - __main__ - Step 126339: {'lr': 3.087732819702668e-05, 'samples': 24257088, 'steps': 126338, 'loss/train': 0.4168552756309509} 11/07/2021 15:01:18 - INFO - __main__ - Step 126340: {'lr': 3.087477347981948e-05, 'samples': 24257280, 'steps': 126339, 'loss/train': 1.5853462219238281} 11/07/2021 15:01:19 - INFO - __main__ - Step 126341: {'lr': 3.087221886134672e-05, 'samples': 24257472, 'steps': 126340, 'loss/train': 1.383489966392517} 11/07/2021 15:01:19 - INFO - __main__ - Step 126342: {'lr': 3.0869664341609535e-05, 'samples': 24257664, 'steps': 126341, 'loss/train': 1.4472743272781372} 11/07/2021 15:01:19 - INFO - __main__ - Step 126343: {'lr': 3.086710992060912e-05, 'samples': 24257856, 'steps': 126342, 'loss/train': 1.3417322635650635} 11/07/2021 15:01:20 - INFO - __main__ - Step 126344: {'lr': 3.086455559834661e-05, 'samples': 24258048, 'steps': 126343, 'loss/train': 1.4774501323699951} 11/07/2021 15:01:21 - INFO - __main__ - Step 126345: {'lr': 3.08620013748232e-05, 'samples': 24258240, 'steps': 126344, 'loss/train': 1.7820602655410767} 11/07/2021 15:01:21 - INFO - __main__ - Step 126346: {'lr': 3.085944725003992e-05, 'samples': 24258432, 'steps': 126345, 'loss/train': 1.173646092414856} 11/07/2021 15:01:22 - INFO - __main__ - Step 126347: {'lr': 3.085689322399801e-05, 'samples': 24258624, 'steps': 126346, 'loss/train': 0.8094708919525146} 11/07/2021 15:01:22 - INFO - __main__ - Step 126348: {'lr': 3.085433929669859e-05, 'samples': 24258816, 'steps': 126347, 'loss/train': 1.450547695159912} 11/07/2021 15:01:22 - INFO - __main__ - Step 126349: {'lr': 3.0851785468142825e-05, 'samples': 24259008, 'steps': 126348, 'loss/train': 1.2890913486480713} 11/07/2021 15:01:23 - INFO - __main__ - Step 126350: {'lr': 3.0849231738331875e-05, 'samples': 24259200, 'steps': 126349, 'loss/train': 1.054819107055664} 11/07/2021 15:01:24 - INFO - __main__ - Step 126351: {'lr': 3.0846678107266854e-05, 'samples': 24259392, 'steps': 126350, 'loss/train': 0.8985877633094788} 11/07/2021 15:01:24 - INFO - __main__ - Step 126352: {'lr': 3.0844124574948953e-05, 'samples': 24259584, 'steps': 126351, 'loss/train': 1.564237356185913} 11/07/2021 15:01:24 - INFO - __main__ - Step 126353: {'lr': 3.084157114137931e-05, 'samples': 24259776, 'steps': 126352, 'loss/train': 0.03760873153805733} 11/07/2021 15:01:25 - INFO - __main__ - Step 126354: {'lr': 3.083901780655909e-05, 'samples': 24259968, 'steps': 126353, 'loss/train': 1.2053018808364868} 11/07/2021 15:01:25 - INFO - __main__ - Step 126355: {'lr': 3.083646457048941e-05, 'samples': 24260160, 'steps': 126354, 'loss/train': 1.331242322921753} 11/07/2021 15:01:26 - INFO - __main__ - Step 126356: {'lr': 3.0833911433171436e-05, 'samples': 24260352, 'steps': 126355, 'loss/train': 1.4070961475372314} 11/07/2021 15:01:27 - INFO - __main__ - Step 126357: {'lr': 3.083135839460632e-05, 'samples': 24260544, 'steps': 126356, 'loss/train': 0.9195661544799805} 11/07/2021 15:01:27 - INFO - __main__ - Step 126358: {'lr': 3.082880545479519e-05, 'samples': 24260736, 'steps': 126357, 'loss/train': 0.816559374332428} 11/07/2021 15:01:27 - INFO - __main__ - Step 126359: {'lr': 3.0826252613739306e-05, 'samples': 24260928, 'steps': 126358, 'loss/train': 1.4230583906173706} 11/07/2021 15:01:28 - INFO - __main__ - Step 126360: {'lr': 3.082369987143965e-05, 'samples': 24261120, 'steps': 126359, 'loss/train': 2.107151985168457} 11/07/2021 15:01:29 - INFO - __main__ - Step 126361: {'lr': 3.082114722789747e-05, 'samples': 24261312, 'steps': 126360, 'loss/train': 0.5051599144935608} 11/07/2021 15:01:29 - INFO - __main__ - Step 126362: {'lr': 3.0818594683113905e-05, 'samples': 24261504, 'steps': 126361, 'loss/train': 1.297560453414917} 11/07/2021 15:01:29 - INFO - __main__ - Step 126363: {'lr': 3.0816042237090085e-05, 'samples': 24261696, 'steps': 126362, 'loss/train': 0.9974119067192078} 11/07/2021 15:01:30 - INFO - __main__ - Step 126364: {'lr': 3.081348988982718e-05, 'samples': 24261888, 'steps': 126363, 'loss/train': 1.582949161529541} 11/07/2021 15:01:30 - INFO - __main__ - Step 126365: {'lr': 3.0810937641326335e-05, 'samples': 24262080, 'steps': 126364, 'loss/train': 1.484161138534546} 11/07/2021 15:01:31 - INFO - __main__ - Step 126366: {'lr': 3.080838549158871e-05, 'samples': 24262272, 'steps': 126365, 'loss/train': 0.042635973542928696} 11/07/2021 15:01:32 - INFO - __main__ - Step 126367: {'lr': 3.080583344061544e-05, 'samples': 24262464, 'steps': 126366, 'loss/train': 0.9437739849090576} 11/07/2021 15:01:32 - INFO - __main__ - Step 126368: {'lr': 3.0803281488407666e-05, 'samples': 24262656, 'steps': 126367, 'loss/train': 1.332571029663086} 11/07/2021 15:01:32 - INFO - __main__ - Step 126369: {'lr': 3.080072963496655e-05, 'samples': 24262848, 'steps': 126368, 'loss/train': 1.0554090738296509} 11/07/2021 15:01:33 - INFO - __main__ - Step 126370: {'lr': 3.079817788029324e-05, 'samples': 24263040, 'steps': 126369, 'loss/train': 0.6941094398498535} 11/07/2021 15:01:34 - INFO - __main__ - Step 126371: {'lr': 3.079562622438889e-05, 'samples': 24263232, 'steps': 126370, 'loss/train': 0.27209165692329407} 11/07/2021 15:01:34 - INFO - __main__ - Step 126372: {'lr': 3.079307466725473e-05, 'samples': 24263424, 'steps': 126371, 'loss/train': 1.3969640731811523} 11/07/2021 15:01:34 - INFO - __main__ - Step 126373: {'lr': 3.0790523208891754e-05, 'samples': 24263616, 'steps': 126372, 'loss/train': 1.618151307106018} 11/07/2021 15:01:35 - INFO - __main__ - Step 126374: {'lr': 3.0787971849301184e-05, 'samples': 24263808, 'steps': 126373, 'loss/train': 1.2333461046218872} 11/07/2021 15:01:35 - INFO - __main__ - Step 126375: {'lr': 3.0785420588484156e-05, 'samples': 24264000, 'steps': 126374, 'loss/train': 1.1726197004318237} 11/07/2021 15:01:36 - INFO - __main__ - Step 126376: {'lr': 3.0782869426441876e-05, 'samples': 24264192, 'steps': 126375, 'loss/train': 1.1898829936981201} 11/07/2021 15:01:37 - INFO - __main__ - Step 126377: {'lr': 3.078031836317541e-05, 'samples': 24264384, 'steps': 126376, 'loss/train': 1.4658613204956055} 11/07/2021 15:01:37 - INFO - __main__ - Step 126378: {'lr': 3.077776739868596e-05, 'samples': 24264576, 'steps': 126377, 'loss/train': 1.2034324407577515} 11/07/2021 15:01:37 - INFO - __main__ - Step 126379: {'lr': 3.0775216532974686e-05, 'samples': 24264768, 'steps': 126378, 'loss/train': 1.2908695936203003} 11/07/2021 15:01:38 - INFO - __main__ - Step 126380: {'lr': 3.077266576604271e-05, 'samples': 24264960, 'steps': 126379, 'loss/train': 1.5456386804580688} 11/07/2021 15:01:39 - INFO - __main__ - Step 126381: {'lr': 3.0770115097891184e-05, 'samples': 24265152, 'steps': 126380, 'loss/train': 1.4807177782058716} 11/07/2021 15:01:39 - INFO - __main__ - Step 126382: {'lr': 3.076756452852125e-05, 'samples': 24265344, 'steps': 126381, 'loss/train': 1.336440086364746} 11/07/2021 15:01:39 - INFO - __main__ - Step 126383: {'lr': 3.0765014057934085e-05, 'samples': 24265536, 'steps': 126382, 'loss/train': 1.2546311616897583} 11/07/2021 15:01:40 - INFO - __main__ - Step 126384: {'lr': 3.076246368613081e-05, 'samples': 24265728, 'steps': 126383, 'loss/train': 0.8147125244140625} 11/07/2021 15:01:40 - INFO - __main__ - Step 126385: {'lr': 3.075991341311257e-05, 'samples': 24265920, 'steps': 126384, 'loss/train': 1.3668583631515503} 11/07/2021 15:01:41 - INFO - __main__ - Step 126386: {'lr': 3.075736323888062e-05, 'samples': 24266112, 'steps': 126385, 'loss/train': 1.3651924133300781} 11/07/2021 15:01:42 - INFO - __main__ - Step 126387: {'lr': 3.075481316343595e-05, 'samples': 24266304, 'steps': 126386, 'loss/train': 0.8277021050453186} 11/07/2021 15:01:42 - INFO - __main__ - Step 126388: {'lr': 3.075226318677976e-05, 'samples': 24266496, 'steps': 126387, 'loss/train': 1.3219789266586304} 11/07/2021 15:01:42 - INFO - __main__ - Step 126389: {'lr': 3.074971330891324e-05, 'samples': 24266688, 'steps': 126388, 'loss/train': 1.3498128652572632} 11/07/2021 15:01:43 - INFO - __main__ - Step 126390: {'lr': 3.07471635298375e-05, 'samples': 24266880, 'steps': 126389, 'loss/train': 1.08281672000885} 11/07/2021 15:01:44 - INFO - __main__ - Step 126391: {'lr': 3.074461384955371e-05, 'samples': 24267072, 'steps': 126390, 'loss/train': 0.38387057185173035} 11/07/2021 15:01:44 - INFO - __main__ - Step 126392: {'lr': 3.074206426806303e-05, 'samples': 24267264, 'steps': 126391, 'loss/train': 0.9501628279685974} 11/07/2021 15:01:44 - INFO - __main__ - Step 126393: {'lr': 3.0739514785366575e-05, 'samples': 24267456, 'steps': 126392, 'loss/train': 1.2454525232315063} 11/07/2021 15:01:45 - INFO - __main__ - Step 126394: {'lr': 3.0736965401465534e-05, 'samples': 24267648, 'steps': 126393, 'loss/train': 1.079863429069519} 11/07/2021 15:01:45 - INFO - __main__ - Step 126395: {'lr': 3.0734416116360994e-05, 'samples': 24267840, 'steps': 126394, 'loss/train': 0.8105006814002991} 11/07/2021 15:01:46 - INFO - __main__ - Step 126396: {'lr': 3.073186693005417e-05, 'samples': 24268032, 'steps': 126395, 'loss/train': 1.131080150604248} 11/07/2021 15:01:46 - INFO - __main__ - Step 126397: {'lr': 3.072931784254618e-05, 'samples': 24268224, 'steps': 126396, 'loss/train': 1.345014214515686} 11/07/2021 15:01:47 - INFO - __main__ - Step 126398: {'lr': 3.0726768853838184e-05, 'samples': 24268416, 'steps': 126397, 'loss/train': 0.7105435132980347} 11/07/2021 15:01:47 - INFO - __main__ - Step 126399: {'lr': 3.0724219963931346e-05, 'samples': 24268608, 'steps': 126398, 'loss/train': 1.7588489055633545} 11/07/2021 15:01:47 - INFO - __main__ - Step 126400: {'lr': 3.072167117282676e-05, 'samples': 24268800, 'steps': 126399, 'loss/train': 1.2702686786651611} 11/07/2021 15:01:48 - INFO - __main__ - Step 126401: {'lr': 3.071912248052561e-05, 'samples': 24268992, 'steps': 126400, 'loss/train': 0.959148645401001} 11/07/2021 15:01:49 - INFO - __main__ - Step 126402: {'lr': 3.071657388702903e-05, 'samples': 24269184, 'steps': 126401, 'loss/train': 1.1926655769348145} 11/07/2021 15:01:49 - INFO - __main__ - Step 126403: {'lr': 3.0714025392338166e-05, 'samples': 24269376, 'steps': 126402, 'loss/train': 1.1085453033447266} 11/07/2021 15:01:50 - INFO - __main__ - Step 126404: {'lr': 3.0711476996454214e-05, 'samples': 24269568, 'steps': 126403, 'loss/train': 1.1117808818817139} 11/07/2021 15:01:50 - INFO - __main__ - Step 126405: {'lr': 3.070892869937825e-05, 'samples': 24269760, 'steps': 126404, 'loss/train': 0.8268205523490906} 11/07/2021 15:01:50 - INFO - __main__ - Step 126406: {'lr': 3.070638050111146e-05, 'samples': 24269952, 'steps': 126405, 'loss/train': 1.0168139934539795} 11/07/2021 15:01:51 - INFO - __main__ - Step 126407: {'lr': 3.070383240165503e-05, 'samples': 24270144, 'steps': 126406, 'loss/train': 0.5328511595726013} 11/07/2021 15:01:52 - INFO - __main__ - Step 126408: {'lr': 3.070128440101003e-05, 'samples': 24270336, 'steps': 126407, 'loss/train': 1.1762192249298096} 11/07/2021 15:01:52 - INFO - __main__ - Step 126409: {'lr': 3.0698736499177675e-05, 'samples': 24270528, 'steps': 126408, 'loss/train': 1.5080244541168213} 11/07/2021 15:01:52 - INFO - __main__ - Step 126410: {'lr': 3.069618869615906e-05, 'samples': 24270720, 'steps': 126409, 'loss/train': 0.7072645425796509} 11/07/2021 15:01:53 - INFO - __main__ - Step 126411: {'lr': 3.0693640991955375e-05, 'samples': 24270912, 'steps': 126410, 'loss/train': 1.415860652923584} 11/07/2021 15:01:54 - INFO - __main__ - Step 126412: {'lr': 3.0691093386567754e-05, 'samples': 24271104, 'steps': 126411, 'loss/train': 1.259505271911621} 11/07/2021 15:01:54 - INFO - __main__ - Step 126413: {'lr': 3.068854587999737e-05, 'samples': 24271296, 'steps': 126412, 'loss/train': 1.2793188095092773} 11/07/2021 15:01:55 - INFO - __main__ - Step 126414: {'lr': 3.068599847224532e-05, 'samples': 24271488, 'steps': 126413, 'loss/train': 1.4392012357711792} 11/07/2021 15:01:55 - INFO - __main__ - Step 126415: {'lr': 3.0683451163312754e-05, 'samples': 24271680, 'steps': 126414, 'loss/train': 0.8485211133956909} 11/07/2021 15:01:55 - INFO - __main__ - Step 126416: {'lr': 3.068090395320083e-05, 'samples': 24271872, 'steps': 126415, 'loss/train': 1.340899109840393} 11/07/2021 15:01:56 - INFO - __main__ - Step 126417: {'lr': 3.0678356841910757e-05, 'samples': 24272064, 'steps': 126416, 'loss/train': 1.1858500242233276} 11/07/2021 15:01:57 - INFO - __main__ - Step 126418: {'lr': 3.0675809829443596e-05, 'samples': 24272256, 'steps': 126417, 'loss/train': 1.194425344467163} 11/07/2021 15:01:57 - INFO - __main__ - Step 126419: {'lr': 3.067326291580053e-05, 'samples': 24272448, 'steps': 126418, 'loss/train': 1.1758077144622803} 11/07/2021 15:01:57 - INFO - __main__ - Step 126420: {'lr': 3.067071610098271e-05, 'samples': 24272640, 'steps': 126419, 'loss/train': 1.2168124914169312} 11/07/2021 15:01:58 - INFO - __main__ - Step 126421: {'lr': 3.066816938499128e-05, 'samples': 24272832, 'steps': 126420, 'loss/train': 1.2589895725250244} 11/07/2021 15:01:59 - INFO - __main__ - Step 126422: {'lr': 3.066562276782739e-05, 'samples': 24273024, 'steps': 126421, 'loss/train': 0.14172643423080444} 11/07/2021 15:01:59 - INFO - __main__ - Step 126423: {'lr': 3.066307624949219e-05, 'samples': 24273216, 'steps': 126422, 'loss/train': 1.1568233966827393} 11/07/2021 15:01:59 - INFO - __main__ - Step 126424: {'lr': 3.0660529829986824e-05, 'samples': 24273408, 'steps': 126423, 'loss/train': 1.1199586391448975} 11/07/2021 15:02:00 - INFO - __main__ - Step 126425: {'lr': 3.0657983509312424e-05, 'samples': 24273600, 'steps': 126424, 'loss/train': 1.4555360078811646} 11/07/2021 15:02:00 - INFO - __main__ - Step 126426: {'lr': 3.065543728747022e-05, 'samples': 24273792, 'steps': 126425, 'loss/train': 1.2543413639068604} 11/07/2021 15:02:00 - INFO - __main__ - Step 126427: {'lr': 3.065289116446124e-05, 'samples': 24273984, 'steps': 126426, 'loss/train': 1.483027696609497} 11/07/2021 15:02:02 - INFO - __main__ - Step 126428: {'lr': 3.0650345140286664e-05, 'samples': 24274176, 'steps': 126427, 'loss/train': 1.5036895275115967} 11/07/2021 15:02:02 - INFO - __main__ - Step 126429: {'lr': 3.0647799214947674e-05, 'samples': 24274368, 'steps': 126428, 'loss/train': 1.0967676639556885} 11/07/2021 15:02:02 - INFO - __main__ - Step 126430: {'lr': 3.06452533884454e-05, 'samples': 24274560, 'steps': 126429, 'loss/train': 0.6591377854347229} 11/07/2021 15:02:03 - INFO - __main__ - Step 126431: {'lr': 3.0642707660780976e-05, 'samples': 24274752, 'steps': 126430, 'loss/train': 1.3302158117294312} 11/07/2021 15:02:03 - INFO - __main__ - Step 126432: {'lr': 3.064016203195558e-05, 'samples': 24274944, 'steps': 126431, 'loss/train': 0.7885977029800415} 11/07/2021 15:02:04 - INFO - __main__ - Step 126433: {'lr': 3.0637616501970336e-05, 'samples': 24275136, 'steps': 126432, 'loss/train': 1.586549997329712} 11/07/2021 15:02:04 - INFO - __main__ - Step 126434: {'lr': 3.063507107082639e-05, 'samples': 24275328, 'steps': 126433, 'loss/train': 1.283143401145935} 11/07/2021 15:02:05 - INFO - __main__ - Step 126435: {'lr': 3.06325257385249e-05, 'samples': 24275520, 'steps': 126434, 'loss/train': 0.3010762929916382} 11/07/2021 15:02:05 - INFO - __main__ - Step 126436: {'lr': 3.062998050506702e-05, 'samples': 24275712, 'steps': 126435, 'loss/train': 1.2054580450057983} 11/07/2021 15:02:05 - INFO - __main__ - Step 126437: {'lr': 3.062743537045387e-05, 'samples': 24275904, 'steps': 126436, 'loss/train': 0.9202432632446289} 11/07/2021 15:02:06 - INFO - __main__ - Step 126438: {'lr': 3.062489033468663e-05, 'samples': 24276096, 'steps': 126437, 'loss/train': 1.1016017198562622} 11/07/2021 15:02:07 - INFO - __main__ - Step 126439: {'lr': 3.0622345397766424e-05, 'samples': 24276288, 'steps': 126438, 'loss/train': 0.93192058801651} 11/07/2021 15:02:07 - INFO - __main__ - Step 126440: {'lr': 3.061980055969446e-05, 'samples': 24276480, 'steps': 126439, 'loss/train': 0.9798221588134766} 11/07/2021 15:02:07 - INFO - __main__ - Step 126441: {'lr': 3.0617255820471755e-05, 'samples': 24276672, 'steps': 126440, 'loss/train': 1.3932037353515625} 11/07/2021 15:02:08 - INFO - __main__ - Step 126442: {'lr': 3.0614711180099534e-05, 'samples': 24276864, 'steps': 126441, 'loss/train': 1.5376195907592773} 11/07/2021 15:02:08 - INFO - __main__ - Step 126443: {'lr': 3.0612166638578964e-05, 'samples': 24277056, 'steps': 126442, 'loss/train': 1.2675679922103882} 11/07/2021 15:02:09 - INFO - __main__ - Step 126444: {'lr': 3.060962219591115e-05, 'samples': 24277248, 'steps': 126443, 'loss/train': 1.26675283908844} 11/07/2021 15:02:10 - INFO - __main__ - Step 126445: {'lr': 3.060707785209724e-05, 'samples': 24277440, 'steps': 126444, 'loss/train': 1.2892111539840698} 11/07/2021 15:02:10 - INFO - __main__ - Step 126446: {'lr': 3.0604533607138416e-05, 'samples': 24277632, 'steps': 126445, 'loss/train': 1.2120898962020874} 11/07/2021 15:02:10 - INFO - __main__ - Step 126447: {'lr': 3.060198946103579e-05, 'samples': 24277824, 'steps': 126446, 'loss/train': 0.9292708039283752} 11/07/2021 15:02:11 - INFO - __main__ - Step 126448: {'lr': 3.059944541379053e-05, 'samples': 24278016, 'steps': 126447, 'loss/train': 0.9769397974014282} 11/07/2021 15:02:12 - INFO - __main__ - Step 126449: {'lr': 3.059690146540378e-05, 'samples': 24278208, 'steps': 126448, 'loss/train': 0.17064204812049866} 11/07/2021 15:02:12 - INFO - __main__ - Step 126450: {'lr': 3.0594357615876675e-05, 'samples': 24278400, 'steps': 126449, 'loss/train': 1.186723232269287} 11/07/2021 15:02:12 - INFO - __main__ - Step 126451: {'lr': 3.059181386521037e-05, 'samples': 24278592, 'steps': 126450, 'loss/train': 1.2703906297683716} 11/07/2021 15:02:13 - INFO - __main__ - Step 126452: {'lr': 3.058927021340599e-05, 'samples': 24278784, 'steps': 126451, 'loss/train': 5.623509407043457} 11/07/2021 15:02:13 - INFO - __main__ - Step 126453: {'lr': 3.058672666046477e-05, 'samples': 24278976, 'steps': 126452, 'loss/train': 1.8174651861190796} 11/07/2021 15:02:14 - INFO - __main__ - Step 126454: {'lr': 3.058418320638773e-05, 'samples': 24279168, 'steps': 126453, 'loss/train': 1.162872314453125} 11/07/2021 15:02:15 - INFO - __main__ - Step 126455: {'lr': 3.058163985117607e-05, 'samples': 24279360, 'steps': 126454, 'loss/train': 0.040177203714847565} 11/07/2021 15:02:15 - INFO - __main__ - Step 126456: {'lr': 3.0579096594830936e-05, 'samples': 24279552, 'steps': 126455, 'loss/train': 1.243344783782959} 11/07/2021 15:02:15 - INFO - __main__ - Step 126457: {'lr': 3.0576553437353495e-05, 'samples': 24279744, 'steps': 126456, 'loss/train': 1.0118536949157715} 11/07/2021 15:02:16 - INFO - __main__ - Step 126458: {'lr': 3.0574010378744856e-05, 'samples': 24279936, 'steps': 126457, 'loss/train': 0.6518626809120178} 11/07/2021 15:02:16 - INFO - __main__ - Step 126459: {'lr': 3.057146741900615e-05, 'samples': 24280128, 'steps': 126458, 'loss/train': 1.4355820417404175} 11/07/2021 15:02:17 - INFO - __main__ - Step 126460: {'lr': 3.0568924558138615e-05, 'samples': 24280320, 'steps': 126459, 'loss/train': 1.1975486278533936} 11/07/2021 15:02:18 - INFO - __main__ - Step 126461: {'lr': 3.0566381796143297e-05, 'samples': 24280512, 'steps': 126460, 'loss/train': 1.3682430982589722} 11/07/2021 15:02:18 - INFO - __main__ - Step 126462: {'lr': 3.056383913302138e-05, 'samples': 24280704, 'steps': 126461, 'loss/train': 1.409468412399292} 11/07/2021 15:02:18 - INFO - __main__ - Step 126463: {'lr': 3.056129656877404e-05, 'samples': 24280896, 'steps': 126462, 'loss/train': 1.2481720447540283} 11/07/2021 15:02:19 - INFO - __main__ - Step 126464: {'lr': 3.0558754103402364e-05, 'samples': 24281088, 'steps': 126463, 'loss/train': 1.3145878314971924} 11/07/2021 15:02:20 - INFO - __main__ - Step 126465: {'lr': 3.055621173690754e-05, 'samples': 24281280, 'steps': 126464, 'loss/train': 1.1516976356506348} 11/07/2021 15:02:20 - INFO - __main__ - Step 126466: {'lr': 3.055366946929075e-05, 'samples': 24281472, 'steps': 126465, 'loss/train': 0.6572951078414917} 11/07/2021 15:02:20 - INFO - __main__ - Step 126467: {'lr': 3.055112730055304e-05, 'samples': 24281664, 'steps': 126466, 'loss/train': 1.1442644596099854} 11/07/2021 15:02:21 - INFO - __main__ - Step 126468: {'lr': 3.0548585230695617e-05, 'samples': 24281856, 'steps': 126467, 'loss/train': 1.0493924617767334} 11/07/2021 15:02:21 - INFO - __main__ - Step 126469: {'lr': 3.05460432597196e-05, 'samples': 24282048, 'steps': 126468, 'loss/train': 1.3386722803115845} 11/07/2021 15:02:22 - INFO - __main__ - Step 126470: {'lr': 3.0543501387626155e-05, 'samples': 24282240, 'steps': 126469, 'loss/train': 1.161922812461853} 11/07/2021 15:02:22 - INFO - __main__ - Step 126471: {'lr': 3.0540959614416415e-05, 'samples': 24282432, 'steps': 126470, 'loss/train': 0.9951832890510559} 11/07/2021 15:02:23 - INFO - __main__ - Step 126472: {'lr': 3.053841794009154e-05, 'samples': 24282624, 'steps': 126471, 'loss/train': 1.4558653831481934} 11/07/2021 15:02:23 - INFO - __main__ - Step 126473: {'lr': 3.053587636465266e-05, 'samples': 24282816, 'steps': 126472, 'loss/train': 0.7705415487289429} 11/07/2021 15:02:24 - INFO - __main__ - Step 126474: {'lr': 3.053333488810092e-05, 'samples': 24283008, 'steps': 126473, 'loss/train': 1.3437726497650146} 11/07/2021 15:02:24 - INFO - __main__ - Step 126475: {'lr': 3.05307935104375e-05, 'samples': 24283200, 'steps': 126474, 'loss/train': 0.881048321723938} 11/07/2021 15:02:25 - INFO - __main__ - Step 126476: {'lr': 3.0528252231663503e-05, 'samples': 24283392, 'steps': 126475, 'loss/train': 1.459252119064331} 11/07/2021 15:02:25 - INFO - __main__ - Step 126477: {'lr': 3.0525711051780095e-05, 'samples': 24283584, 'steps': 126476, 'loss/train': 0.5946940183639526} 11/07/2021 15:02:26 - INFO - __main__ - Step 126478: {'lr': 3.052316997078841e-05, 'samples': 24283776, 'steps': 126477, 'loss/train': 1.2892520427703857} 11/07/2021 15:02:26 - INFO - __main__ - Step 126479: {'lr': 3.0520628988689596e-05, 'samples': 24283968, 'steps': 126478, 'loss/train': 1.2198176383972168} 11/07/2021 15:02:27 - INFO - __main__ - Step 126480: {'lr': 3.0518088105484844e-05, 'samples': 24284160, 'steps': 126479, 'loss/train': 1.0416386127471924} 11/07/2021 15:02:27 - INFO - __main__ - Step 126481: {'lr': 3.0515547321175203e-05, 'samples': 24284352, 'steps': 126480, 'loss/train': 0.9637975692749023} 11/07/2021 15:02:28 - INFO - __main__ - Step 126482: {'lr': 3.051300663576187e-05, 'samples': 24284544, 'steps': 126481, 'loss/train': 0.9278624653816223} 11/07/2021 15:02:28 - INFO - __main__ - Step 126483: {'lr': 3.051046604924601e-05, 'samples': 24284736, 'steps': 126482, 'loss/train': 1.1695315837860107} 11/07/2021 15:02:28 - INFO - __main__ - Step 126484: {'lr': 3.0507925561628734e-05, 'samples': 24284928, 'steps': 126483, 'loss/train': 0.5963159203529358} 11/07/2021 15:02:30 - INFO - __main__ - Step 126485: {'lr': 3.050538517291121e-05, 'samples': 24285120, 'steps': 126484, 'loss/train': 1.6894032955169678} 11/07/2021 15:02:30 - INFO - __main__ - Step 126486: {'lr': 3.0502844883094544e-05, 'samples': 24285312, 'steps': 126485, 'loss/train': 1.53275465965271} 11/07/2021 15:02:30 - INFO - __main__ - Step 126487: {'lr': 3.050030469217993e-05, 'samples': 24285504, 'steps': 126486, 'loss/train': 0.15338365733623505} 11/07/2021 15:02:31 - INFO - __main__ - Step 126488: {'lr': 3.049776460016848e-05, 'samples': 24285696, 'steps': 126487, 'loss/train': 1.34798002243042} 11/07/2021 15:02:31 - INFO - __main__ - Step 126489: {'lr': 3.0495224607061362e-05, 'samples': 24285888, 'steps': 126488, 'loss/train': 1.471721887588501} 11/07/2021 15:02:32 - INFO - __main__ - Step 126490: {'lr': 3.049268471285971e-05, 'samples': 24286080, 'steps': 126489, 'loss/train': 1.1558661460876465} 11/07/2021 15:02:32 - INFO - __main__ - Step 126491: {'lr': 3.049014491756466e-05, 'samples': 24286272, 'steps': 126490, 'loss/train': 0.9474048614501953} 11/07/2021 15:02:33 - INFO - __main__ - Step 126492: {'lr': 3.0487605221177385e-05, 'samples': 24286464, 'steps': 126491, 'loss/train': 1.504366159439087} 11/07/2021 15:02:33 - INFO - __main__ - Step 126493: {'lr': 3.0485065623699044e-05, 'samples': 24286656, 'steps': 126492, 'loss/train': 0.714717447757721} 11/07/2021 15:02:33 - INFO - __main__ - Step 126494: {'lr': 3.0482526125130666e-05, 'samples': 24286848, 'steps': 126493, 'loss/train': 0.1460999697446823} 11/07/2021 15:02:34 - INFO - __main__ - Step 126495: {'lr': 3.0479986725473502e-05, 'samples': 24287040, 'steps': 126494, 'loss/train': 0.27910226583480835} 11/07/2021 15:02:35 - INFO - __main__ - Step 126496: {'lr': 3.047744742472866e-05, 'samples': 24287232, 'steps': 126495, 'loss/train': 1.2416677474975586} 11/07/2021 15:02:35 - INFO - __main__ - Step 126497: {'lr': 3.0474908222897307e-05, 'samples': 24287424, 'steps': 126496, 'loss/train': 1.2715415954589844} 11/07/2021 15:02:36 - INFO - __main__ - Step 126498: {'lr': 3.0472369119980552e-05, 'samples': 24287616, 'steps': 126497, 'loss/train': 1.4655859470367432} 11/07/2021 15:02:36 - INFO - __main__ - Step 126499: {'lr': 3.046983011597959e-05, 'samples': 24287808, 'steps': 126498, 'loss/train': 1.333992600440979} 11/07/2021 15:02:36 - INFO - __main__ - Step 126500: {'lr': 3.0467291210895504e-05, 'samples': 24288000, 'steps': 126499, 'loss/train': 1.0659558773040771} 11/07/2021 15:02:37 - INFO - __main__ - Step 126501: {'lr': 3.0464752404729485e-05, 'samples': 24288192, 'steps': 126500, 'loss/train': 1.3727097511291504} 11/07/2021 15:02:38 - INFO - __main__ - Step 126502: {'lr': 3.046221369748267e-05, 'samples': 24288384, 'steps': 126501, 'loss/train': 1.2545092105865479} 11/07/2021 15:02:38 - INFO - __main__ - Step 126503: {'lr': 3.0459675089156175e-05, 'samples': 24288576, 'steps': 126502, 'loss/train': 1.3978012800216675} 11/07/2021 15:02:38 - INFO - __main__ - Step 126504: {'lr': 3.045713657975116e-05, 'samples': 24288768, 'steps': 126503, 'loss/train': 1.3546689748764038} 11/07/2021 15:02:39 - INFO - __main__ - Step 126505: {'lr': 3.0454598169268793e-05, 'samples': 24288960, 'steps': 126504, 'loss/train': 1.2610588073730469} 11/07/2021 15:02:40 - INFO - __main__ - Step 126506: {'lr': 3.0452059857710184e-05, 'samples': 24289152, 'steps': 126505, 'loss/train': 1.0640133619308472} 11/07/2021 15:02:40 - INFO - __main__ - Step 126507: {'lr': 3.0449521645076526e-05, 'samples': 24289344, 'steps': 126506, 'loss/train': 0.9953116774559021} 11/07/2021 15:02:40 - INFO - __main__ - Step 126508: {'lr': 3.0446983531368906e-05, 'samples': 24289536, 'steps': 126507, 'loss/train': 1.1374201774597168} 11/07/2021 15:02:41 - INFO - __main__ - Step 126509: {'lr': 3.0444445516588453e-05, 'samples': 24289728, 'steps': 126508, 'loss/train': 1.4135818481445312} 11/07/2021 15:02:41 - INFO - __main__ - Step 126510: {'lr': 3.044190760073637e-05, 'samples': 24289920, 'steps': 126509, 'loss/train': 1.4309669733047485} 11/07/2021 15:02:43 - INFO - __main__ - Step 126511: {'lr': 3.043936978381376e-05, 'samples': 24290112, 'steps': 126510, 'loss/train': 1.3077963590621948} 11/07/2021 15:02:43 - INFO - __main__ - Step 126512: {'lr': 3.043683206582179e-05, 'samples': 24290304, 'steps': 126511, 'loss/train': 0.7080228328704834} 11/07/2021 15:02:43 - INFO - __main__ - Step 126513: {'lr': 3.04342944467616e-05, 'samples': 24290496, 'steps': 126512, 'loss/train': 1.3585671186447144} 11/07/2021 15:02:44 - INFO - __main__ - Step 126514: {'lr': 3.043175692663433e-05, 'samples': 24290688, 'steps': 126513, 'loss/train': 1.0900588035583496} 11/07/2021 15:02:44 - INFO - __main__ - Step 126515: {'lr': 3.0429219505441113e-05, 'samples': 24290880, 'steps': 126514, 'loss/train': 1.3723280429840088} 11/07/2021 15:02:45 - INFO - __main__ - Step 126516: {'lr': 3.042668218318309e-05, 'samples': 24291072, 'steps': 126515, 'loss/train': 0.0827801525592804} 11/07/2021 15:02:45 - INFO - __main__ - Step 126517: {'lr': 3.0424144959861428e-05, 'samples': 24291264, 'steps': 126516, 'loss/train': 1.3142012357711792} 11/07/2021 15:02:46 - INFO - __main__ - Step 126518: {'lr': 3.0421607835477262e-05, 'samples': 24291456, 'steps': 126517, 'loss/train': 1.1409677267074585} 11/07/2021 15:02:46 - INFO - __main__ - Step 126519: {'lr': 3.0419070810031786e-05, 'samples': 24291648, 'steps': 126518, 'loss/train': 1.3178690671920776} 11/07/2021 15:02:46 - INFO - __main__ - Step 126520: {'lr': 3.0416533883526026e-05, 'samples': 24291840, 'steps': 126519, 'loss/train': 1.3005826473236084} 11/07/2021 15:02:47 - INFO - __main__ - Step 126521: {'lr': 3.0413997055961206e-05, 'samples': 24292032, 'steps': 126520, 'loss/train': 0.8920913338661194} 11/07/2021 15:02:48 - INFO - __main__ - Step 126522: {'lr': 3.0411460327338436e-05, 'samples': 24292224, 'steps': 126521, 'loss/train': 1.5660649538040161} 11/07/2021 15:02:48 - INFO - __main__ - Step 126523: {'lr': 3.040892369765888e-05, 'samples': 24292416, 'steps': 126522, 'loss/train': 1.0423918962478638} 11/07/2021 15:02:49 - INFO - __main__ - Step 126524: {'lr': 3.040638716692365e-05, 'samples': 24292608, 'steps': 126523, 'loss/train': 0.03611854836344719} 11/07/2021 15:02:49 - INFO - __main__ - Step 126525: {'lr': 3.0403850735133937e-05, 'samples': 24292800, 'steps': 126524, 'loss/train': 1.300540566444397} 11/07/2021 15:02:50 - INFO - __main__ - Step 126526: {'lr': 3.0401314402290853e-05, 'samples': 24292992, 'steps': 126525, 'loss/train': 1.1344761848449707} 11/07/2021 15:02:50 - INFO - __main__ - Step 126527: {'lr': 3.039877816839556e-05, 'samples': 24293184, 'steps': 126526, 'loss/train': 0.8923770785331726} 11/07/2021 15:02:51 - INFO - __main__ - Step 126528: {'lr': 3.039624203344918e-05, 'samples': 24293376, 'steps': 126527, 'loss/train': 1.0725985765457153} 11/07/2021 15:02:51 - INFO - __main__ - Step 126529: {'lr': 3.0393705997452863e-05, 'samples': 24293568, 'steps': 126528, 'loss/train': 1.1169700622558594} 11/07/2021 15:02:51 - INFO - __main__ - Step 126530: {'lr': 3.0391170060407814e-05, 'samples': 24293760, 'steps': 126529, 'loss/train': 1.043250322341919} 11/07/2021 15:02:52 - INFO - __main__ - Step 126531: {'lr': 3.0388634222315054e-05, 'samples': 24293952, 'steps': 126530, 'loss/train': 1.3017752170562744} 11/07/2021 15:02:53 - INFO - __main__ - Step 126532: {'lr': 3.0386098483175777e-05, 'samples': 24294144, 'steps': 126531, 'loss/train': 1.340928077697754} 11/07/2021 15:02:54 - INFO - __main__ - Step 126533: {'lr': 3.038356284299115e-05, 'samples': 24294336, 'steps': 126532, 'loss/train': 1.1536465883255005} 11/07/2021 15:02:54 - INFO - __main__ - Step 126534: {'lr': 3.0381027301762287e-05, 'samples': 24294528, 'steps': 126533, 'loss/train': 1.1922959089279175} 11/07/2021 15:02:54 - INFO - __main__ - Step 126535: {'lr': 3.0378491859490375e-05, 'samples': 24294720, 'steps': 126534, 'loss/train': 1.3694392442703247} 11/07/2021 15:02:55 - INFO - __main__ - Step 126536: {'lr': 3.03759565161765e-05, 'samples': 24294912, 'steps': 126535, 'loss/train': 1.3890968561172485} 11/07/2021 15:02:56 - INFO - __main__ - Step 126537: {'lr': 3.0373421271821828e-05, 'samples': 24295104, 'steps': 126536, 'loss/train': 0.33677175641059875} 11/07/2021 15:02:56 - INFO - __main__ - Step 126538: {'lr': 3.037088612642752e-05, 'samples': 24295296, 'steps': 126537, 'loss/train': 1.0876349210739136} 11/07/2021 15:02:57 - INFO - __main__ - Step 126539: {'lr': 3.0368351079994694e-05, 'samples': 24295488, 'steps': 126538, 'loss/train': 0.9623526334762573} 11/07/2021 15:02:57 - INFO - __main__ - Step 126540: {'lr': 3.036581613252448e-05, 'samples': 24295680, 'steps': 126539, 'loss/train': 0.6391281485557556} 11/07/2021 15:02:57 - INFO - __main__ - Step 126541: {'lr': 3.0363281284018136e-05, 'samples': 24295872, 'steps': 126540, 'loss/train': 1.207403540611267} 11/07/2021 15:02:58 - INFO - __main__ - Step 126542: {'lr': 3.0360746534476626e-05, 'samples': 24296064, 'steps': 126541, 'loss/train': 1.9121631383895874} 11/07/2021 15:02:59 - INFO - __main__ - Step 126543: {'lr': 3.035821188390117e-05, 'samples': 24296256, 'steps': 126542, 'loss/train': 1.3825440406799316} 11/07/2021 15:02:59 - INFO - __main__ - Step 126544: {'lr': 3.0355677332292914e-05, 'samples': 24296448, 'steps': 126543, 'loss/train': 1.4514282941818237} 11/07/2021 15:02:59 - INFO - __main__ - Step 126545: {'lr': 3.0353142879653017e-05, 'samples': 24296640, 'steps': 126544, 'loss/train': 1.2289683818817139} 11/07/2021 15:03:00 - INFO - __main__ - Step 126546: {'lr': 3.0350608525982594e-05, 'samples': 24296832, 'steps': 126545, 'loss/train': 1.0035390853881836} 11/07/2021 15:03:01 - INFO - __main__ - Step 126547: {'lr': 3.0348074271282804e-05, 'samples': 24297024, 'steps': 126546, 'loss/train': 1.1709696054458618} 11/07/2021 15:03:01 - INFO - __main__ - Step 126548: {'lr': 3.034554011555479e-05, 'samples': 24297216, 'steps': 126547, 'loss/train': 1.0223444700241089} 11/07/2021 15:03:02 - INFO - __main__ - Step 126549: {'lr': 3.0343006058799666e-05, 'samples': 24297408, 'steps': 126548, 'loss/train': 1.5799254179000854} 11/07/2021 15:03:02 - INFO - __main__ - Step 126550: {'lr': 3.034047210101859e-05, 'samples': 24297600, 'steps': 126549, 'loss/train': 1.6719948053359985} 11/07/2021 15:03:02 - INFO - __main__ - Step 126551: {'lr': 3.033793824221279e-05, 'samples': 24297792, 'steps': 126550, 'loss/train': 1.5221035480499268} 11/07/2021 15:03:03 - INFO - __main__ - Step 126552: {'lr': 3.0335404482383256e-05, 'samples': 24297984, 'steps': 126551, 'loss/train': 1.7508885860443115} 11/07/2021 15:03:04 - INFO - __main__ - Step 126553: {'lr': 3.0332870821531188e-05, 'samples': 24298176, 'steps': 126552, 'loss/train': 0.9178496599197388} 11/07/2021 15:03:04 - INFO - __main__ - Step 126554: {'lr': 3.0330337259657755e-05, 'samples': 24298368, 'steps': 126553, 'loss/train': 0.2245146483182907} 11/07/2021 15:03:04 - INFO - __main__ - Step 126555: {'lr': 3.0327803796764086e-05, 'samples': 24298560, 'steps': 126554, 'loss/train': 1.3154176473617554} 11/07/2021 15:03:05 - INFO - __main__ - Step 126556: {'lr': 3.03252704328513e-05, 'samples': 24298752, 'steps': 126555, 'loss/train': 1.2265942096710205} 11/07/2021 15:03:06 - INFO - __main__ - Step 126557: {'lr': 3.0322737167920556e-05, 'samples': 24298944, 'steps': 126556, 'loss/train': 1.2887656688690186} 11/07/2021 15:03:06 - INFO - __main__ - Step 126558: {'lr': 3.0320204001973023e-05, 'samples': 24299136, 'steps': 126557, 'loss/train': 1.0366190671920776} 11/07/2021 15:03:06 - INFO - __main__ - Step 126559: {'lr': 3.031767093500981e-05, 'samples': 24299328, 'steps': 126558, 'loss/train': 1.6108688116073608} 11/07/2021 15:03:07 - INFO - __main__ - Step 126560: {'lr': 3.031513796703206e-05, 'samples': 24299520, 'steps': 126559, 'loss/train': 1.2883141040802002} 11/07/2021 15:03:07 - INFO - __main__ - Step 126561: {'lr': 3.03126050980409e-05, 'samples': 24299712, 'steps': 126560, 'loss/train': 1.4319454431533813} 11/07/2021 15:03:08 - INFO - __main__ - Step 126562: {'lr': 3.0310072328037564e-05, 'samples': 24299904, 'steps': 126561, 'loss/train': 1.417560338973999} 11/07/2021 15:03:08 - INFO - __main__ - Step 126563: {'lr': 3.030753965702304e-05, 'samples': 24300096, 'steps': 126562, 'loss/train': 0.9240963459014893} 11/07/2021 15:03:09 - INFO - __main__ - Step 126564: {'lr': 3.030500708499859e-05, 'samples': 24300288, 'steps': 126563, 'loss/train': 1.4126436710357666} 11/07/2021 15:03:09 - INFO - __main__ - Step 126565: {'lr': 3.030247461196528e-05, 'samples': 24300480, 'steps': 126564, 'loss/train': 0.9867686629295349} 11/07/2021 15:03:10 - INFO - __main__ - Step 126566: {'lr': 3.0299942237924317e-05, 'samples': 24300672, 'steps': 126565, 'loss/train': 1.2255760431289673} 11/07/2021 15:03:11 - INFO - __main__ - Step 126567: {'lr': 3.0297409962876775e-05, 'samples': 24300864, 'steps': 126566, 'loss/train': 1.5556952953338623} 11/07/2021 15:03:11 - INFO - __main__ - Step 126568: {'lr': 3.0294877786823854e-05, 'samples': 24301056, 'steps': 126567, 'loss/train': 1.0871987342834473} 11/07/2021 15:03:11 - INFO - __main__ - Step 126569: {'lr': 3.029234570976666e-05, 'samples': 24301248, 'steps': 126568, 'loss/train': 1.07105553150177} 11/07/2021 15:03:12 - INFO - __main__ - Step 126570: {'lr': 3.0289813731706333e-05, 'samples': 24301440, 'steps': 126569, 'loss/train': 1.4795860052108765} 11/07/2021 15:03:12 - INFO - __main__ - Step 126571: {'lr': 3.0287281852644038e-05, 'samples': 24301632, 'steps': 126570, 'loss/train': 1.191298484802246} 11/07/2021 15:03:13 - INFO - __main__ - Step 126572: {'lr': 3.0284750072580913e-05, 'samples': 24301824, 'steps': 126571, 'loss/train': 1.4941459894180298} 11/07/2021 15:03:13 - INFO - __main__ - Step 126573: {'lr': 3.0282218391518095e-05, 'samples': 24302016, 'steps': 126572, 'loss/train': 0.6148841381072998} 11/07/2021 15:03:14 - INFO - __main__ - Step 126574: {'lr': 3.0279686809456752e-05, 'samples': 24302208, 'steps': 126573, 'loss/train': 1.0219264030456543} 11/07/2021 15:03:14 - INFO - __main__ - Step 126575: {'lr': 3.0277155326397938e-05, 'samples': 24302400, 'steps': 126574, 'loss/train': 0.03577055409550667} 11/07/2021 15:03:14 - INFO - __main__ - Step 126576: {'lr': 3.0274623942342842e-05, 'samples': 24302592, 'steps': 126575, 'loss/train': 1.1591726541519165} 11/07/2021 15:03:15 - INFO - __main__ - Step 126577: {'lr': 3.0272092657292637e-05, 'samples': 24302784, 'steps': 126576, 'loss/train': 1.4606504440307617} 11/07/2021 15:03:16 - INFO - __main__ - Step 126578: {'lr': 3.026956147124843e-05, 'samples': 24302976, 'steps': 126577, 'loss/train': 1.125555396080017} 11/07/2021 15:03:16 - INFO - __main__ - Step 126579: {'lr': 3.026703038421136e-05, 'samples': 24303168, 'steps': 126578, 'loss/train': 1.2780765295028687} 11/07/2021 15:03:17 - INFO - __main__ - Step 126580: {'lr': 3.026449939618256e-05, 'samples': 24303360, 'steps': 126579, 'loss/train': 1.4161620140075684} 11/07/2021 15:03:17 - INFO - __main__ - Step 126581: {'lr': 3.02619685071632e-05, 'samples': 24303552, 'steps': 126580, 'loss/train': 1.6031641960144043} 11/07/2021 15:03:17 - INFO - __main__ - Step 126582: {'lr': 3.025943771715442e-05, 'samples': 24303744, 'steps': 126581, 'loss/train': 1.4330103397369385} 11/07/2021 15:03:18 - INFO - __main__ - Step 126583: {'lr': 3.0256907026157327e-05, 'samples': 24303936, 'steps': 126582, 'loss/train': 1.4589647054672241} 11/07/2021 15:03:19 - INFO - __main__ - Step 126584: {'lr': 3.0254376434173087e-05, 'samples': 24304128, 'steps': 126583, 'loss/train': 1.2411144971847534} 11/07/2021 15:03:19 - INFO - __main__ - Step 126585: {'lr': 3.025184594120284e-05, 'samples': 24304320, 'steps': 126584, 'loss/train': 1.3260688781738281} 11/07/2021 15:03:19 - INFO - __main__ - Step 126586: {'lr': 3.0249315547247692e-05, 'samples': 24304512, 'steps': 126585, 'loss/train': 1.492798924446106} 11/07/2021 15:03:20 - INFO - __main__ - Step 126587: {'lr': 3.0246785252308894e-05, 'samples': 24304704, 'steps': 126586, 'loss/train': 1.2474231719970703} 11/07/2021 15:03:21 - INFO - __main__ - Step 126588: {'lr': 3.024425505638745e-05, 'samples': 24304896, 'steps': 126587, 'loss/train': 1.7110215425491333} 11/07/2021 15:03:21 - INFO - __main__ - Step 126589: {'lr': 3.0241724959484544e-05, 'samples': 24305088, 'steps': 126588, 'loss/train': 1.3953865766525269} 11/07/2021 15:03:21 - INFO - __main__ - Step 126590: {'lr': 3.0239194961601323e-05, 'samples': 24305280, 'steps': 126589, 'loss/train': 1.1815513372421265} 11/07/2021 15:03:22 - INFO - __main__ - Step 126591: {'lr': 3.023666506273895e-05, 'samples': 24305472, 'steps': 126590, 'loss/train': 0.8306041359901428} 11/07/2021 15:03:22 - INFO - __main__ - Step 126592: {'lr': 3.0234135262898534e-05, 'samples': 24305664, 'steps': 126591, 'loss/train': 1.3123263120651245} 11/07/2021 15:03:23 - INFO - __main__ - Step 126593: {'lr': 3.0231605562081212e-05, 'samples': 24305856, 'steps': 126592, 'loss/train': 1.1971707344055176} 11/07/2021 15:03:24 - INFO - __main__ - Step 126594: {'lr': 3.022907596028815e-05, 'samples': 24306048, 'steps': 126593, 'loss/train': 1.6974611282348633} 11/07/2021 15:03:24 - INFO - __main__ - Step 126595: {'lr': 3.022654645752046e-05, 'samples': 24306240, 'steps': 126594, 'loss/train': 1.1667903661727905} 11/07/2021 15:03:24 - INFO - __main__ - Step 126596: {'lr': 3.0224017053779308e-05, 'samples': 24306432, 'steps': 126595, 'loss/train': 0.5525984168052673} 11/07/2021 15:03:25 - INFO - __main__ - Step 126597: {'lr': 3.0221487749065828e-05, 'samples': 24306624, 'steps': 126596, 'loss/train': 0.9690840244293213} 11/07/2021 15:03:26 - INFO - __main__ - Step 126598: {'lr': 3.0218958543381138e-05, 'samples': 24306816, 'steps': 126597, 'loss/train': 1.4102510213851929} 11/07/2021 15:03:26 - INFO - __main__ - Step 126599: {'lr': 3.0216429436726424e-05, 'samples': 24307008, 'steps': 126598, 'loss/train': 1.1942880153656006} 11/07/2021 15:03:26 - INFO - __main__ - Step 126600: {'lr': 3.021390042910277e-05, 'samples': 24307200, 'steps': 126599, 'loss/train': 1.518867015838623} 11/07/2021 15:03:27 - INFO - __main__ - Step 126601: {'lr': 3.02113715205114e-05, 'samples': 24307392, 'steps': 126600, 'loss/train': 1.3397119045257568} 11/07/2021 15:03:27 - INFO - __main__ - Step 126602: {'lr': 3.020884271095334e-05, 'samples': 24307584, 'steps': 126601, 'loss/train': 0.846734881401062} 11/07/2021 15:03:28 - INFO - __main__ - Step 126603: {'lr': 3.020631400042978e-05, 'samples': 24307776, 'steps': 126602, 'loss/train': 1.2912280559539795} 11/07/2021 15:03:28 - INFO - __main__ - Step 126604: {'lr': 3.020378538894189e-05, 'samples': 24307968, 'steps': 126603, 'loss/train': 1.3519870042800903} 11/07/2021 15:03:29 - INFO - __main__ - Step 126605: {'lr': 3.0201256876490752e-05, 'samples': 24308160, 'steps': 126604, 'loss/train': 1.2231744527816772} 11/07/2021 15:03:29 - INFO - __main__ - Step 126606: {'lr': 3.019872846307753e-05, 'samples': 24308352, 'steps': 126605, 'loss/train': 1.299729347229004} 11/07/2021 15:03:29 - INFO - __main__ - Step 126607: {'lr': 3.019620014870339e-05, 'samples': 24308544, 'steps': 126606, 'loss/train': 1.2950993776321411} 11/07/2021 15:03:30 - INFO - __main__ - Step 126608: {'lr': 3.0193671933369443e-05, 'samples': 24308736, 'steps': 126607, 'loss/train': 1.0704147815704346} 11/07/2021 15:03:31 - INFO - __main__ - Step 126609: {'lr': 3.0191143817076855e-05, 'samples': 24308928, 'steps': 126608, 'loss/train': 1.4428995847702026} 11/07/2021 15:03:31 - INFO - __main__ - Step 126610: {'lr': 3.0188615799826736e-05, 'samples': 24309120, 'steps': 126609, 'loss/train': 1.1251204013824463} 11/07/2021 15:03:32 - INFO - __main__ - Step 126611: {'lr': 3.0186087881620223e-05, 'samples': 24309312, 'steps': 126610, 'loss/train': 0.812737226486206} 11/07/2021 15:03:32 - INFO - __main__ - Step 126612: {'lr': 3.0183560062458455e-05, 'samples': 24309504, 'steps': 126611, 'loss/train': 1.2975984811782837} 11/07/2021 15:03:33 - INFO - __main__ - Step 126613: {'lr': 3.0181032342342595e-05, 'samples': 24309696, 'steps': 126612, 'loss/train': 1.1979082822799683} 11/07/2021 15:03:34 - INFO - __main__ - Step 126614: {'lr': 3.017850472127384e-05, 'samples': 24309888, 'steps': 126613, 'loss/train': 1.3745365142822266} 11/07/2021 15:03:34 - INFO - __main__ - Step 126615: {'lr': 3.0175977199253192e-05, 'samples': 24310080, 'steps': 126614, 'loss/train': 1.4860140085220337} 11/07/2021 15:03:34 - INFO - __main__ - Step 126616: {'lr': 3.0173449776281863e-05, 'samples': 24310272, 'steps': 126615, 'loss/train': 1.0408110618591309} 11/07/2021 15:03:35 - INFO - __main__ - Step 126617: {'lr': 3.017092245236097e-05, 'samples': 24310464, 'steps': 126616, 'loss/train': 0.41220730543136597} 11/07/2021 15:03:36 - INFO - __main__ - Step 126618: {'lr': 3.016839522749168e-05, 'samples': 24310656, 'steps': 126617, 'loss/train': 1.244485855102539} 11/07/2021 15:03:36 - INFO - __main__ - Step 126619: {'lr': 3.0165868101675126e-05, 'samples': 24310848, 'steps': 126618, 'loss/train': 0.8130645155906677} 11/07/2021 15:03:36 - INFO - __main__ - Step 126620: {'lr': 3.016334107491242e-05, 'samples': 24311040, 'steps': 126619, 'loss/train': 1.1126651763916016} 11/07/2021 15:03:37 - INFO - __main__ - Step 126621: {'lr': 3.016081414720473e-05, 'samples': 24311232, 'steps': 126620, 'loss/train': 1.1665599346160889} 11/07/2021 15:03:37 - INFO - __main__ - Step 126622: {'lr': 3.015828731855319e-05, 'samples': 24311424, 'steps': 126621, 'loss/train': 0.9817543029785156} 11/07/2021 15:03:38 - INFO - __main__ - Step 126623: {'lr': 3.0155760588958912e-05, 'samples': 24311616, 'steps': 126622, 'loss/train': 1.3301745653152466} 11/07/2021 15:03:38 - INFO - __main__ - Step 126624: {'lr': 3.0153233958423094e-05, 'samples': 24311808, 'steps': 126623, 'loss/train': 1.2671749591827393} 11/07/2021 15:03:39 - INFO - __main__ - Step 126625: {'lr': 3.015070742694681e-05, 'samples': 24312000, 'steps': 126624, 'loss/train': 1.3816537857055664} 11/07/2021 15:03:39 - INFO - __main__ - Step 126626: {'lr': 3.0148180994531232e-05, 'samples': 24312192, 'steps': 126625, 'loss/train': 1.1740573644638062} 11/07/2021 15:03:39 - INFO - __main__ - Step 126627: {'lr': 3.014565466117747e-05, 'samples': 24312384, 'steps': 126626, 'loss/train': 1.4007538557052612} 11/07/2021 15:03:40 - INFO - __main__ - Step 126628: {'lr': 3.0143128426886766e-05, 'samples': 24312576, 'steps': 126627, 'loss/train': 1.4427083730697632} 11/07/2021 15:03:41 - INFO - __main__ - Step 126629: {'lr': 3.01406022916601e-05, 'samples': 24312768, 'steps': 126628, 'loss/train': 1.1821156740188599} 11/07/2021 15:03:41 - INFO - __main__ - Step 126630: {'lr': 3.013807625549872e-05, 'samples': 24312960, 'steps': 126629, 'loss/train': 0.9681251049041748} 11/07/2021 15:03:42 - INFO - __main__ - Step 126631: {'lr': 3.0135550318403703e-05, 'samples': 24313152, 'steps': 126630, 'loss/train': 1.3919620513916016} 11/07/2021 15:03:42 - INFO - __main__ - Step 126632: {'lr': 3.013302448037622e-05, 'samples': 24313344, 'steps': 126631, 'loss/train': 1.1575736999511719} 11/07/2021 15:03:42 - INFO - __main__ - Step 126633: {'lr': 3.0130498741417378e-05, 'samples': 24313536, 'steps': 126632, 'loss/train': 1.2163660526275635} 11/07/2021 15:03:43 - INFO - __main__ - Step 126634: {'lr': 3.012797310152837e-05, 'samples': 24313728, 'steps': 126633, 'loss/train': 0.989356279373169} 11/07/2021 15:03:44 - INFO - __main__ - Step 126635: {'lr': 3.0125447560710312e-05, 'samples': 24313920, 'steps': 126634, 'loss/train': 1.4454668760299683} 11/07/2021 15:03:44 - INFO - __main__ - Step 126636: {'lr': 3.0122922118964306e-05, 'samples': 24314112, 'steps': 126635, 'loss/train': 1.0299174785614014} 11/07/2021 15:03:44 - INFO - __main__ - Step 126637: {'lr': 3.0120396776291554e-05, 'samples': 24314304, 'steps': 126636, 'loss/train': 1.092712163925171} 11/07/2021 15:03:45 - INFO - __main__ - Step 126638: {'lr': 3.011787153269313e-05, 'samples': 24314496, 'steps': 126637, 'loss/train': 1.5173418521881104} 11/07/2021 15:03:46 - INFO - __main__ - Step 126639: {'lr': 3.0115346388170207e-05, 'samples': 24314688, 'steps': 126638, 'loss/train': 0.9705626368522644} 11/07/2021 15:03:46 - INFO - __main__ - Step 126640: {'lr': 3.0112821342723918e-05, 'samples': 24314880, 'steps': 126639, 'loss/train': 1.0519481897354126} 11/07/2021 15:03:46 - INFO - __main__ - Step 126641: {'lr': 3.011029639635546e-05, 'samples': 24315072, 'steps': 126640, 'loss/train': 1.0749667882919312} 11/07/2021 15:03:47 - INFO - __main__ - Step 126642: {'lr': 3.0107771549065825e-05, 'samples': 24315264, 'steps': 126641, 'loss/train': 1.3087207078933716} 11/07/2021 15:03:47 - INFO - __main__ - Step 126643: {'lr': 3.0105246800856272e-05, 'samples': 24315456, 'steps': 126642, 'loss/train': 1.296673059463501} 11/07/2021 15:03:48 - INFO - __main__ - Step 126644: {'lr': 3.010272215172788e-05, 'samples': 24315648, 'steps': 126643, 'loss/train': 1.3050522804260254} 11/07/2021 15:03:49 - INFO - __main__ - Step 126645: {'lr': 3.0100197601681813e-05, 'samples': 24315840, 'steps': 126644, 'loss/train': 1.166544795036316} 11/07/2021 15:03:49 - INFO - __main__ - Step 126646: {'lr': 3.0097673150719207e-05, 'samples': 24316032, 'steps': 126645, 'loss/train': 0.9462153315544128} 11/07/2021 15:03:49 - INFO - __main__ - Step 126647: {'lr': 3.0095148798841205e-05, 'samples': 24316224, 'steps': 126646, 'loss/train': 1.4294989109039307} 11/07/2021 15:03:50 - INFO - __main__ - Step 126648: {'lr': 3.0092624546048913e-05, 'samples': 24316416, 'steps': 126647, 'loss/train': 0.6353975534439087} 11/07/2021 15:03:51 - INFO - __main__ - Step 126649: {'lr': 3.00901003923435e-05, 'samples': 24316608, 'steps': 126648, 'loss/train': 1.6058803796768188} 11/07/2021 15:03:51 - INFO - __main__ - Step 126650: {'lr': 3.00875763377261e-05, 'samples': 24316800, 'steps': 126649, 'loss/train': 1.4036719799041748} 11/07/2021 15:03:51 - INFO - __main__ - Step 126651: {'lr': 3.0085052382197857e-05, 'samples': 24316992, 'steps': 126650, 'loss/train': 1.4289031028747559} 11/07/2021 15:03:52 - INFO - __main__ - Step 126652: {'lr': 3.0082528525759878e-05, 'samples': 24317184, 'steps': 126651, 'loss/train': 1.4311244487762451} 11/07/2021 15:03:52 - INFO - __main__ - Step 126653: {'lr': 3.008000476841333e-05, 'samples': 24317376, 'steps': 126652, 'loss/train': 1.4676666259765625} 11/07/2021 15:03:53 - INFO - __main__ - Step 126654: {'lr': 3.0077481110159317e-05, 'samples': 24317568, 'steps': 126653, 'loss/train': 1.136991024017334} 11/07/2021 15:03:53 - INFO - __main__ - Step 126655: {'lr': 3.0074957550999067e-05, 'samples': 24317760, 'steps': 126654, 'loss/train': 1.371172547340393} 11/07/2021 15:03:54 - INFO - __main__ - Step 126656: {'lr': 3.0072434090933574e-05, 'samples': 24317952, 'steps': 126655, 'loss/train': 1.1126078367233276} 11/07/2021 15:03:54 - INFO - __main__ - Step 126657: {'lr': 3.006991072996407e-05, 'samples': 24318144, 'steps': 126656, 'loss/train': 1.2584309577941895} 11/07/2021 15:03:55 - INFO - __main__ - Step 126658: {'lr': 3.0067387468091678e-05, 'samples': 24318336, 'steps': 126657, 'loss/train': 1.2157467603683472} 11/07/2021 15:03:55 - INFO - __main__ - Step 126659: {'lr': 3.0064864305317517e-05, 'samples': 24318528, 'steps': 126658, 'loss/train': 1.1448469161987305} 11/07/2021 15:03:56 - INFO - __main__ - Step 126660: {'lr': 3.0062341241642725e-05, 'samples': 24318720, 'steps': 126659, 'loss/train': 1.5448168516159058} 11/07/2021 15:03:56 - INFO - __main__ - Step 126661: {'lr': 3.0059818277068467e-05, 'samples': 24318912, 'steps': 126660, 'loss/train': 1.3492615222930908} 11/07/2021 15:03:57 - INFO - __main__ - Step 126662: {'lr': 3.0057295411595854e-05, 'samples': 24319104, 'steps': 126661, 'loss/train': 1.3344943523406982} 11/07/2021 15:03:57 - INFO - __main__ - Step 126663: {'lr': 3.005477264522602e-05, 'samples': 24319296, 'steps': 126662, 'loss/train': 1.4972848892211914} 11/07/2021 15:03:58 - INFO - __main__ - Step 126664: {'lr': 3.0052249977960105e-05, 'samples': 24319488, 'steps': 126663, 'loss/train': 1.6493620872497559} 11/07/2021 15:03:58 - INFO - __main__ - Step 126665: {'lr': 3.004972740979928e-05, 'samples': 24319680, 'steps': 126664, 'loss/train': 2.1071174144744873} 11/07/2021 15:03:59 - INFO - __main__ - Step 126666: {'lr': 3.0047204940744617e-05, 'samples': 24319872, 'steps': 126665, 'loss/train': 1.0338376760482788} 11/07/2021 15:03:59 - INFO - __main__ - Step 126667: {'lr': 3.0044682570797317e-05, 'samples': 24320064, 'steps': 126666, 'loss/train': 1.3242932558059692} 11/07/2021 15:03:59 - INFO - __main__ - Step 126668: {'lr': 3.0042160299958543e-05, 'samples': 24320256, 'steps': 126667, 'loss/train': 1.4410345554351807} 11/07/2021 15:04:00 - INFO - __main__ - Step 126669: {'lr': 3.0039638128229323e-05, 'samples': 24320448, 'steps': 126668, 'loss/train': 1.6033785343170166} 11/07/2021 15:04:01 - INFO - __main__ - Step 126670: {'lr': 3.0037116055610825e-05, 'samples': 24320640, 'steps': 126669, 'loss/train': 1.0690792798995972} 11/07/2021 15:04:01 - INFO - __main__ - Step 126671: {'lr': 3.0034594082104237e-05, 'samples': 24320832, 'steps': 126670, 'loss/train': 1.1949183940887451} 11/07/2021 15:04:01 - INFO - __main__ - Step 126672: {'lr': 3.003207220771065e-05, 'samples': 24321024, 'steps': 126671, 'loss/train': 1.4557883739471436} 11/07/2021 15:04:02 - INFO - __main__ - Step 126673: {'lr': 3.002955043243122e-05, 'samples': 24321216, 'steps': 126672, 'loss/train': 1.429855227470398} 11/07/2021 15:04:03 - INFO - __main__ - Step 126674: {'lr': 3.0027028756267088e-05, 'samples': 24321408, 'steps': 126673, 'loss/train': 1.1201001405715942} 11/07/2021 15:04:03 - INFO - __main__ - Step 126675: {'lr': 3.0024507179219367e-05, 'samples': 24321600, 'steps': 126674, 'loss/train': 0.893470287322998} 11/07/2021 15:04:04 - INFO - __main__ - Step 126676: {'lr': 3.0021985701289223e-05, 'samples': 24321792, 'steps': 126675, 'loss/train': 0.8404068946838379} 11/07/2021 15:04:04 - INFO - __main__ - Step 126677: {'lr': 3.0019464322477763e-05, 'samples': 24321984, 'steps': 126676, 'loss/train': 1.5081359148025513} 11/07/2021 15:04:04 - INFO - __main__ - Step 126678: {'lr': 3.0016943042786153e-05, 'samples': 24322176, 'steps': 126677, 'loss/train': 1.3224166631698608} 11/07/2021 15:04:05 - INFO - __main__ - Step 126679: {'lr': 3.0014421862215507e-05, 'samples': 24322368, 'steps': 126678, 'loss/train': 1.0672017335891724} 11/07/2021 15:04:06 - INFO - __main__ - Step 126680: {'lr': 3.001190078076696e-05, 'samples': 24322560, 'steps': 126679, 'loss/train': 1.247156023979187} 11/07/2021 15:04:06 - INFO - __main__ - Step 126681: {'lr': 3.0009379798441676e-05, 'samples': 24322752, 'steps': 126680, 'loss/train': 1.9390184879302979} 11/07/2021 15:04:07 - INFO - __main__ - Step 126682: {'lr': 3.0006858915240798e-05, 'samples': 24322944, 'steps': 126681, 'loss/train': 1.5080416202545166} 11/07/2021 15:04:07 - INFO - __main__ - Step 126683: {'lr': 3.0004338131165404e-05, 'samples': 24323136, 'steps': 126682, 'loss/train': 1.338681936264038} 11/07/2021 15:04:07 - INFO - __main__ - Step 126684: {'lr': 3.0001817446216663e-05, 'samples': 24323328, 'steps': 126683, 'loss/train': 1.1041181087493896} 11/07/2021 15:04:08 - INFO - __main__ - Step 126685: {'lr': 2.999929686039568e-05, 'samples': 24323520, 'steps': 126684, 'loss/train': 1.2107174396514893} 11/07/2021 15:04:09 - INFO - __main__ - Step 126686: {'lr': 2.9996776373703653e-05, 'samples': 24323712, 'steps': 126685, 'loss/train': 1.0622156858444214} 11/07/2021 15:04:09 - INFO - __main__ - Step 126687: {'lr': 2.9994255986141665e-05, 'samples': 24323904, 'steps': 126686, 'loss/train': 1.413983941078186} 11/07/2021 15:04:09 - INFO - __main__ - Step 126688: {'lr': 2.999173569771088e-05, 'samples': 24324096, 'steps': 126687, 'loss/train': 1.090338945388794} 11/07/2021 15:04:10 - INFO - __main__ - Step 126689: {'lr': 2.998921550841241e-05, 'samples': 24324288, 'steps': 126688, 'loss/train': 0.04947207495570183} 11/07/2021 15:04:11 - INFO - __main__ - Step 126690: {'lr': 2.998669541824742e-05, 'samples': 24324480, 'steps': 126689, 'loss/train': 0.14002040028572083} 11/07/2021 15:04:11 - INFO - __main__ - Step 126691: {'lr': 2.9984175427217013e-05, 'samples': 24324672, 'steps': 126690, 'loss/train': 0.48592451214790344} 11/07/2021 15:04:11 - INFO - __main__ - Step 126692: {'lr': 2.9981655535322337e-05, 'samples': 24324864, 'steps': 126691, 'loss/train': 1.165930986404419} 11/07/2021 15:04:12 - INFO - __main__ - Step 126693: {'lr': 2.9979135742564557e-05, 'samples': 24325056, 'steps': 126692, 'loss/train': 0.7126807570457458} 11/07/2021 15:04:12 - INFO - __main__ - Step 126694: {'lr': 2.9976616048944776e-05, 'samples': 24325248, 'steps': 126693, 'loss/train': 1.0505731105804443} 11/07/2021 15:04:13 - INFO - __main__ - Step 126695: {'lr': 2.9974096454464195e-05, 'samples': 24325440, 'steps': 126694, 'loss/train': 1.0811420679092407} 11/07/2021 15:04:14 - INFO - __main__ - Step 126696: {'lr': 2.997157695912381e-05, 'samples': 24325632, 'steps': 126695, 'loss/train': 1.1530336141586304} 11/07/2021 15:04:14 - INFO - __main__ - Step 126697: {'lr': 2.9969057562924866e-05, 'samples': 24325824, 'steps': 126696, 'loss/train': 1.1306835412979126} 11/07/2021 15:04:14 - INFO - __main__ - Step 126698: {'lr': 2.9966538265868452e-05, 'samples': 24326016, 'steps': 126697, 'loss/train': 1.1355488300323486} 11/07/2021 15:04:15 - INFO - __main__ - Step 126699: {'lr': 2.9964019067955734e-05, 'samples': 24326208, 'steps': 126698, 'loss/train': 1.2946784496307373} 11/07/2021 15:04:16 - INFO - __main__ - Step 126700: {'lr': 2.9961499969187816e-05, 'samples': 24326400, 'steps': 126699, 'loss/train': 1.635289192199707} 11/07/2021 15:04:16 - INFO - __main__ - Step 126701: {'lr': 2.995898096956587e-05, 'samples': 24326592, 'steps': 126700, 'loss/train': 1.4430322647094727} 11/07/2021 15:04:16 - INFO - __main__ - Step 126702: {'lr': 2.9956462069091e-05, 'samples': 24326784, 'steps': 126701, 'loss/train': 0.8973352313041687} 11/07/2021 15:04:17 - INFO - __main__ - Step 126703: {'lr': 2.995394326776435e-05, 'samples': 24326976, 'steps': 126702, 'loss/train': 1.5811980962753296} 11/07/2021 15:04:17 - INFO - __main__ - Step 126704: {'lr': 2.9951424565587082e-05, 'samples': 24327168, 'steps': 126703, 'loss/train': 1.3995238542556763} 11/07/2021 15:04:18 - INFO - __main__ - Step 126705: {'lr': 2.994890596256028e-05, 'samples': 24327360, 'steps': 126704, 'loss/train': 0.6857572793960571} 11/07/2021 15:04:18 - INFO - __main__ - Step 126706: {'lr': 2.994638745868511e-05, 'samples': 24327552, 'steps': 126705, 'loss/train': 1.610209345817566} 11/07/2021 15:04:19 - INFO - __main__ - Step 126707: {'lr': 2.994386905396271e-05, 'samples': 24327744, 'steps': 126706, 'loss/train': 0.5302055478096008} 11/07/2021 15:04:19 - INFO - __main__ - Step 126708: {'lr': 2.994135074839424e-05, 'samples': 24327936, 'steps': 126707, 'loss/train': 1.2364360094070435} 11/07/2021 15:04:19 - INFO - __main__ - Step 126709: {'lr': 2.9938832541980766e-05, 'samples': 24328128, 'steps': 126708, 'loss/train': 0.721731960773468} 11/07/2021 15:04:20 - INFO - __main__ - Step 126710: {'lr': 2.9936314434723473e-05, 'samples': 24328320, 'steps': 126709, 'loss/train': 0.624435305595398} 11/07/2021 15:04:21 - INFO - __main__ - Step 126711: {'lr': 2.9933796426623445e-05, 'samples': 24328512, 'steps': 126710, 'loss/train': 1.191494107246399} 11/07/2021 15:04:21 - INFO - __main__ - Step 126712: {'lr': 2.9931278517681874e-05, 'samples': 24328704, 'steps': 126711, 'loss/train': 1.0952575206756592} 11/07/2021 15:04:21 - INFO - __main__ - Step 126713: {'lr': 2.9928760707899875e-05, 'samples': 24328896, 'steps': 126712, 'loss/train': 1.4678115844726562} 11/07/2021 15:04:22 - INFO - __main__ - Step 126714: {'lr': 2.9926242997278584e-05, 'samples': 24329088, 'steps': 126713, 'loss/train': 0.9488266706466675} 11/07/2021 15:04:22 - INFO - __main__ - Step 126715: {'lr': 2.992372538581911e-05, 'samples': 24329280, 'steps': 126714, 'loss/train': 1.641599178314209} 11/07/2021 15:04:23 - INFO - __main__ - Step 126716: {'lr': 2.992120787352265e-05, 'samples': 24329472, 'steps': 126715, 'loss/train': 0.743595540523529} 11/07/2021 15:04:24 - INFO - __main__ - Step 126717: {'lr': 2.9918690460390252e-05, 'samples': 24329664, 'steps': 126716, 'loss/train': 1.085230827331543} 11/07/2021 15:04:24 - INFO - __main__ - Step 126718: {'lr': 2.9916173146423116e-05, 'samples': 24329856, 'steps': 126717, 'loss/train': 1.3496716022491455} 11/07/2021 15:04:24 - INFO - __main__ - Step 126719: {'lr': 2.9913655931622375e-05, 'samples': 24330048, 'steps': 126718, 'loss/train': 1.1533352136611938} 11/07/2021 15:04:25 - INFO - __main__ - Step 126720: {'lr': 2.9911138815989115e-05, 'samples': 24330240, 'steps': 126719, 'loss/train': 0.9926164150238037} 11/07/2021 15:04:26 - INFO - __main__ - Step 126721: {'lr': 2.99086217995245e-05, 'samples': 24330432, 'steps': 126720, 'loss/train': 1.2520718574523926} 11/07/2021 15:04:26 - INFO - __main__ - Step 126722: {'lr': 2.9906104882229723e-05, 'samples': 24330624, 'steps': 126721, 'loss/train': 0.45746344327926636} 11/07/2021 15:04:26 - INFO - __main__ - Step 126723: {'lr': 2.9903588064105815e-05, 'samples': 24330816, 'steps': 126722, 'loss/train': 1.1958608627319336} 11/07/2021 15:04:27 - INFO - __main__ - Step 126724: {'lr': 2.9901071345153964e-05, 'samples': 24331008, 'steps': 126723, 'loss/train': 0.9233465790748596} 11/07/2021 15:04:27 - INFO - __main__ - Step 126725: {'lr': 2.989855472537528e-05, 'samples': 24331200, 'steps': 126724, 'loss/train': 1.4556033611297607} 11/07/2021 15:04:28 - INFO - __main__ - Step 126726: {'lr': 2.989603820477091e-05, 'samples': 24331392, 'steps': 126725, 'loss/train': 1.2981725931167603} 11/07/2021 15:04:28 - INFO - __main__ - Step 126727: {'lr': 2.989352178334198e-05, 'samples': 24331584, 'steps': 126726, 'loss/train': 1.2247732877731323} 11/07/2021 15:04:29 - INFO - __main__ - Step 126728: {'lr': 2.9891005461089638e-05, 'samples': 24331776, 'steps': 126727, 'loss/train': 1.0158917903900146} 11/07/2021 15:04:29 - INFO - __main__ - Step 126729: {'lr': 2.9888489238015015e-05, 'samples': 24331968, 'steps': 126728, 'loss/train': 1.6855247020721436} 11/07/2021 15:04:29 - INFO - __main__ - Step 126730: {'lr': 2.9885973114119252e-05, 'samples': 24332160, 'steps': 126729, 'loss/train': 0.9624506235122681} 11/07/2021 15:04:31 - INFO - __main__ - Step 126731: {'lr': 2.988345708940346e-05, 'samples': 24332352, 'steps': 126730, 'loss/train': 1.4148095846176147} 11/07/2021 15:04:31 - INFO - __main__ - Step 126732: {'lr': 2.9880941163868775e-05, 'samples': 24332544, 'steps': 126731, 'loss/train': 1.205998420715332} 11/07/2021 15:04:31 - INFO - __main__ - Step 126733: {'lr': 2.987842533751636e-05, 'samples': 24332736, 'steps': 126732, 'loss/train': 1.1276837587356567} 11/07/2021 15:04:32 - INFO - __main__ - Step 126734: {'lr': 2.9875909610347335e-05, 'samples': 24332928, 'steps': 126733, 'loss/train': 1.2315384149551392} 11/07/2021 15:04:32 - INFO - __main__ - Step 126735: {'lr': 2.9873393982362858e-05, 'samples': 24333120, 'steps': 126734, 'loss/train': 1.1365785598754883} 11/07/2021 15:04:33 - INFO - __main__ - Step 126736: {'lr': 2.987087845356401e-05, 'samples': 24333312, 'steps': 126735, 'loss/train': 1.6448627710342407} 11/07/2021 15:04:33 - INFO - __main__ - Step 126737: {'lr': 2.9868363023951932e-05, 'samples': 24333504, 'steps': 126736, 'loss/train': 1.4127939939498901} 11/07/2021 15:04:34 - INFO - __main__ - Step 126738: {'lr': 2.9865847693527765e-05, 'samples': 24333696, 'steps': 126737, 'loss/train': 1.2401447296142578} 11/07/2021 15:04:34 - INFO - __main__ - Step 126739: {'lr': 2.9863332462292643e-05, 'samples': 24333888, 'steps': 126738, 'loss/train': 1.9472523927688599} 11/07/2021 15:04:34 - INFO - __main__ - Step 126740: {'lr': 2.986081733024773e-05, 'samples': 24334080, 'steps': 126739, 'loss/train': 1.5430861711502075} 11/07/2021 15:04:35 - INFO - __main__ - Step 126741: {'lr': 2.9858302297394112e-05, 'samples': 24334272, 'steps': 126740, 'loss/train': 1.2346196174621582} 11/07/2021 15:04:36 - INFO - __main__ - Step 126742: {'lr': 2.9855787363732984e-05, 'samples': 24334464, 'steps': 126741, 'loss/train': 1.0128356218338013} 11/07/2021 15:04:36 - INFO - __main__ - Step 126743: {'lr': 2.9853272529265397e-05, 'samples': 24334656, 'steps': 126742, 'loss/train': 0.9197754263877869} 11/07/2021 15:04:36 - INFO - __main__ - Step 126744: {'lr': 2.9850757793992572e-05, 'samples': 24334848, 'steps': 126743, 'loss/train': 1.1845860481262207} 11/07/2021 15:04:37 - INFO - __main__ - Step 126745: {'lr': 2.9848243157915565e-05, 'samples': 24335040, 'steps': 126744, 'loss/train': 1.2239574193954468} 11/07/2021 15:04:38 - INFO - __main__ - Step 126746: {'lr': 2.9845728621035546e-05, 'samples': 24335232, 'steps': 126745, 'loss/train': 1.2510669231414795} 11/07/2021 15:04:38 - INFO - __main__ - Step 126747: {'lr': 2.9843214183353645e-05, 'samples': 24335424, 'steps': 126746, 'loss/train': 1.092146873474121} 11/07/2021 15:04:39 - INFO - __main__ - Step 126748: {'lr': 2.9840699844871005e-05, 'samples': 24335616, 'steps': 126747, 'loss/train': 1.121585488319397} 11/07/2021 15:04:39 - INFO - __main__ - Step 126749: {'lr': 2.983818560558879e-05, 'samples': 24335808, 'steps': 126748, 'loss/train': 1.141412615776062} 11/07/2021 15:04:39 - INFO - __main__ - Step 126750: {'lr': 2.9835671465508058e-05, 'samples': 24336000, 'steps': 126749, 'loss/train': 1.2972266674041748} 11/07/2021 15:04:40 - INFO - __main__ - Step 126751: {'lr': 2.983315742462997e-05, 'samples': 24336192, 'steps': 126750, 'loss/train': 1.870964527130127} 11/07/2021 15:04:41 - INFO - __main__ - Step 126752: {'lr': 2.9830643482955638e-05, 'samples': 24336384, 'steps': 126751, 'loss/train': 1.006643295288086} 11/07/2021 15:04:41 - INFO - __main__ - Step 126753: {'lr': 2.9828129640486256e-05, 'samples': 24336576, 'steps': 126752, 'loss/train': 2.311173677444458} 11/07/2021 15:04:41 - INFO - __main__ - Step 126754: {'lr': 2.9825615897222908e-05, 'samples': 24336768, 'steps': 126753, 'loss/train': 0.12810683250427246} 11/07/2021 15:04:42 - INFO - __main__ - Step 126755: {'lr': 2.9823102253166728e-05, 'samples': 24336960, 'steps': 126754, 'loss/train': 0.7439903616905212} 11/07/2021 15:04:42 - INFO - __main__ - Step 126756: {'lr': 2.982058870831886e-05, 'samples': 24337152, 'steps': 126755, 'loss/train': 1.038591742515564} 11/07/2021 15:04:43 - INFO - __main__ - Step 126757: {'lr': 2.981807526268046e-05, 'samples': 24337344, 'steps': 126756, 'loss/train': 1.5650978088378906} 11/07/2021 15:04:44 - INFO - __main__ - Step 126758: {'lr': 2.981556191625262e-05, 'samples': 24337536, 'steps': 126757, 'loss/train': 1.0605360269546509} 11/07/2021 15:04:44 - INFO - __main__ - Step 126759: {'lr': 2.9813048669036473e-05, 'samples': 24337728, 'steps': 126758, 'loss/train': 1.1268588304519653} 11/07/2021 15:04:44 - INFO - __main__ - Step 126760: {'lr': 2.9810535521033216e-05, 'samples': 24337920, 'steps': 126759, 'loss/train': 1.3044880628585815} 11/07/2021 15:04:45 - INFO - __main__ - Step 126761: {'lr': 2.98080224722439e-05, 'samples': 24338112, 'steps': 126760, 'loss/train': 0.7245055437088013} 11/07/2021 15:04:46 - INFO - __main__ - Step 126762: {'lr': 2.980550952266975e-05, 'samples': 24338304, 'steps': 126761, 'loss/train': 1.5127077102661133} 11/07/2021 15:04:46 - INFO - __main__ - Step 126763: {'lr': 2.980299667231179e-05, 'samples': 24338496, 'steps': 126762, 'loss/train': 0.03928899019956589} 11/07/2021 15:04:46 - INFO - __main__ - Step 126764: {'lr': 2.9800483921171184e-05, 'samples': 24338688, 'steps': 126763, 'loss/train': 1.7142747640609741} 11/07/2021 15:04:47 - INFO - __main__ - Step 126765: {'lr': 2.9797971269249103e-05, 'samples': 24338880, 'steps': 126764, 'loss/train': 1.1150524616241455} 11/07/2021 15:04:47 - INFO - __main__ - Step 126766: {'lr': 2.9795458716546652e-05, 'samples': 24339072, 'steps': 126765, 'loss/train': 1.1364037990570068} 11/07/2021 15:04:48 - INFO - __main__ - Step 126767: {'lr': 2.9792946263064974e-05, 'samples': 24339264, 'steps': 126766, 'loss/train': 0.919776201248169} 11/07/2021 15:04:49 - INFO - __main__ - Step 126768: {'lr': 2.9790433908805202e-05, 'samples': 24339456, 'steps': 126767, 'loss/train': 1.3249379396438599} 11/07/2021 15:04:49 - INFO - __main__ - Step 126769: {'lr': 2.9787921653768452e-05, 'samples': 24339648, 'steps': 126768, 'loss/train': 1.5604407787322998} 11/07/2021 15:04:49 - INFO - __main__ - Step 126770: {'lr': 2.9785409497955856e-05, 'samples': 24339840, 'steps': 126769, 'loss/train': 0.177085742354393} 11/07/2021 15:04:50 - INFO - __main__ - Step 126771: {'lr': 2.9782897441368585e-05, 'samples': 24340032, 'steps': 126770, 'loss/train': 1.3163635730743408} 11/07/2021 15:04:51 - INFO - __main__ - Step 126772: {'lr': 2.9780385484007717e-05, 'samples': 24340224, 'steps': 126771, 'loss/train': 1.354034662246704} 11/07/2021 15:04:51 - INFO - __main__ - Step 126773: {'lr': 2.9777873625874418e-05, 'samples': 24340416, 'steps': 126772, 'loss/train': 1.1129566431045532} 11/07/2021 15:04:52 - INFO - __main__ - Step 126774: {'lr': 2.9775361866969802e-05, 'samples': 24340608, 'steps': 126773, 'loss/train': 1.3005422353744507} 11/07/2021 15:04:52 - INFO - __main__ - Step 126775: {'lr': 2.977285020729503e-05, 'samples': 24340800, 'steps': 126774, 'loss/train': 1.3836525678634644} 11/07/2021 15:04:52 - INFO - __main__ - Step 126776: {'lr': 2.9770338646851248e-05, 'samples': 24340992, 'steps': 126775, 'loss/train': 0.9386743307113647} 11/07/2021 15:04:53 - INFO - __main__ - Step 126777: {'lr': 2.9767827185639502e-05, 'samples': 24341184, 'steps': 126776, 'loss/train': 1.3225164413452148} 11/07/2021 15:04:54 - INFO - __main__ - Step 126778: {'lr': 2.9765315823660986e-05, 'samples': 24341376, 'steps': 126777, 'loss/train': 0.6824650168418884} 11/07/2021 15:04:54 - INFO - __main__ - Step 126779: {'lr': 2.976280456091682e-05, 'samples': 24341568, 'steps': 126778, 'loss/train': 1.2822716236114502} 11/07/2021 15:04:54 - INFO - __main__ - Step 126780: {'lr': 2.9760293397408128e-05, 'samples': 24341760, 'steps': 126779, 'loss/train': 1.0479134321212769} 11/07/2021 15:04:55 - INFO - __main__ - Step 126781: {'lr': 2.9757782333136058e-05, 'samples': 24341952, 'steps': 126780, 'loss/train': 1.3641518354415894} 11/07/2021 15:04:56 - INFO - __main__ - Step 126782: {'lr': 2.9755271368101715e-05, 'samples': 24342144, 'steps': 126781, 'loss/train': 0.8512364029884338} 11/07/2021 15:04:56 - INFO - __main__ - Step 126783: {'lr': 2.9752760502306243e-05, 'samples': 24342336, 'steps': 126782, 'loss/train': 1.2116762399673462} 11/07/2021 15:04:56 - INFO - __main__ - Step 126784: {'lr': 2.97502497357508e-05, 'samples': 24342528, 'steps': 126783, 'loss/train': 1.1942644119262695} 11/07/2021 15:04:57 - INFO - __main__ - Step 126785: {'lr': 2.97477390684365e-05, 'samples': 24342720, 'steps': 126784, 'loss/train': 4.486190319061279} 11/07/2021 15:04:57 - INFO - __main__ - Step 126786: {'lr': 2.9745228500364457e-05, 'samples': 24342912, 'steps': 126785, 'loss/train': 1.6541794538497925} 11/07/2021 15:04:58 - INFO - __main__ - Step 126787: {'lr': 2.9742718031535804e-05, 'samples': 24343104, 'steps': 126786, 'loss/train': 1.1640735864639282} 11/07/2021 15:04:59 - INFO - __main__ - Step 126788: {'lr': 2.9740207661951762e-05, 'samples': 24343296, 'steps': 126787, 'loss/train': 1.3032900094985962} 11/07/2021 15:04:59 - INFO - __main__ - Step 126789: {'lr': 2.9737697391613333e-05, 'samples': 24343488, 'steps': 126788, 'loss/train': 1.4293419122695923} 11/07/2021 15:04:59 - INFO - __main__ - Step 126790: {'lr': 2.973518722052168e-05, 'samples': 24343680, 'steps': 126789, 'loss/train': 1.0184242725372314} 11/07/2021 15:05:00 - INFO - __main__ - Step 126791: {'lr': 2.9732677148677946e-05, 'samples': 24343872, 'steps': 126790, 'loss/train': 1.3297853469848633} 11/07/2021 15:05:00 - INFO - __main__ - Step 126792: {'lr': 2.9730167176083288e-05, 'samples': 24344064, 'steps': 126791, 'loss/train': 1.2524573802947998} 11/07/2021 15:05:01 - INFO - __main__ - Step 126793: {'lr': 2.9727657302738797e-05, 'samples': 24344256, 'steps': 126792, 'loss/train': 0.9470028877258301} 11/07/2021 15:05:01 - INFO - __main__ - Step 126794: {'lr': 2.9725147528645635e-05, 'samples': 24344448, 'steps': 126793, 'loss/train': 1.071668267250061} 11/07/2021 15:05:02 - INFO - __main__ - Step 126795: {'lr': 2.972263785380494e-05, 'samples': 24344640, 'steps': 126794, 'loss/train': 1.320306658744812} 11/07/2021 15:05:02 - INFO - __main__ - Step 126796: {'lr': 2.9720128278217794e-05, 'samples': 24344832, 'steps': 126795, 'loss/train': 0.6445837616920471} 11/07/2021 15:05:02 - INFO - __main__ - Step 126797: {'lr': 2.9717618801885394e-05, 'samples': 24345024, 'steps': 126796, 'loss/train': 1.0819209814071655} 11/07/2021 15:05:03 - INFO - __main__ - Step 126798: {'lr': 2.9715109424808874e-05, 'samples': 24345216, 'steps': 126797, 'loss/train': 1.1441669464111328} 11/07/2021 15:05:04 - INFO - __main__ - Step 126799: {'lr': 2.971260014698926e-05, 'samples': 24345408, 'steps': 126798, 'loss/train': 0.5490462183952332} 11/07/2021 15:05:04 - INFO - __main__ - Step 126800: {'lr': 2.971009096842775e-05, 'samples': 24345600, 'steps': 126799, 'loss/train': 0.8575505614280701} 11/07/2021 15:05:05 - INFO - __main__ - Step 126801: {'lr': 2.9707581889125506e-05, 'samples': 24345792, 'steps': 126800, 'loss/train': 1.4977308511734009} 11/07/2021 15:05:05 - INFO - __main__ - Step 126802: {'lr': 2.9705072909083587e-05, 'samples': 24345984, 'steps': 126801, 'loss/train': 1.6259123086929321} 11/07/2021 15:05:06 - INFO - __main__ - Step 126803: {'lr': 2.970256402830318e-05, 'samples': 24346176, 'steps': 126802, 'loss/train': 1.3104445934295654} 11/07/2021 15:05:07 - INFO - __main__ - Step 126804: {'lr': 2.97000552467854e-05, 'samples': 24346368, 'steps': 126803, 'loss/train': 1.2464816570281982} 11/07/2021 15:05:07 - INFO - __main__ - Step 126805: {'lr': 2.9697546564531387e-05, 'samples': 24346560, 'steps': 126804, 'loss/train': 0.09129340201616287} 11/07/2021 15:05:07 - INFO - __main__ - Step 126806: {'lr': 2.9695037981542244e-05, 'samples': 24346752, 'steps': 126805, 'loss/train': 1.143824577331543} 11/07/2021 15:05:08 - INFO - __main__ - Step 126807: {'lr': 2.9692529497819115e-05, 'samples': 24346944, 'steps': 126806, 'loss/train': 1.5696667432785034} 11/07/2021 15:05:09 - INFO - __main__ - Step 126808: {'lr': 2.9690021113363135e-05, 'samples': 24347136, 'steps': 126807, 'loss/train': 1.0292487144470215} 11/07/2021 15:05:09 - INFO - __main__ - Step 126809: {'lr': 2.9687512828175473e-05, 'samples': 24347328, 'steps': 126808, 'loss/train': 0.893883228302002} 11/07/2021 15:05:09 - INFO - __main__ - Step 126810: {'lr': 2.9685004642257178e-05, 'samples': 24347520, 'steps': 126809, 'loss/train': 1.1638859510421753} 11/07/2021 15:05:10 - INFO - __main__ - Step 126811: {'lr': 2.9682496555609424e-05, 'samples': 24347712, 'steps': 126810, 'loss/train': 1.2881656885147095} 11/07/2021 15:05:10 - INFO - __main__ - Step 126812: {'lr': 2.967998856823334e-05, 'samples': 24347904, 'steps': 126811, 'loss/train': 1.3134801387786865} 11/07/2021 15:05:11 - INFO - __main__ - Step 126813: {'lr': 2.9677480680130042e-05, 'samples': 24348096, 'steps': 126812, 'loss/train': 1.3501120805740356} 11/07/2021 15:05:12 - INFO - __main__ - Step 126814: {'lr': 2.9674972891300695e-05, 'samples': 24348288, 'steps': 126813, 'loss/train': 1.5396500825881958} 11/07/2021 15:05:12 - INFO - __main__ - Step 126815: {'lr': 2.967246520174638e-05, 'samples': 24348480, 'steps': 126814, 'loss/train': 0.9779621958732605} 11/07/2021 15:05:13 - INFO - __main__ - Step 126816: {'lr': 2.966995761146826e-05, 'samples': 24348672, 'steps': 126815, 'loss/train': 4.9911298751831055} 11/07/2021 15:05:13 - INFO - __main__ - Step 126817: {'lr': 2.9667450120467455e-05, 'samples': 24348864, 'steps': 126816, 'loss/train': 1.6016876697540283} 11/07/2021 15:05:13 - INFO - __main__ - Step 126818: {'lr': 2.9664942728745094e-05, 'samples': 24349056, 'steps': 126817, 'loss/train': 0.5825211405754089} 11/07/2021 15:05:14 - INFO - __main__ - Step 126819: {'lr': 2.9662435436302316e-05, 'samples': 24349248, 'steps': 126818, 'loss/train': 1.1969801187515259} 11/07/2021 15:05:15 - INFO - __main__ - Step 126820: {'lr': 2.965992824314029e-05, 'samples': 24349440, 'steps': 126819, 'loss/train': 1.3033316135406494} 11/07/2021 15:05:15 - INFO - __main__ - Step 126821: {'lr': 2.9657421149260066e-05, 'samples': 24349632, 'steps': 126820, 'loss/train': 1.4325575828552246} 11/07/2021 15:05:16 - INFO - __main__ - Step 126822: {'lr': 2.9654914154662786e-05, 'samples': 24349824, 'steps': 126821, 'loss/train': 0.9621691107749939} 11/07/2021 15:05:16 - INFO - __main__ - Step 126823: {'lr': 2.9652407259349616e-05, 'samples': 24350016, 'steps': 126822, 'loss/train': 0.9327643513679504} 11/07/2021 15:05:17 - INFO - __main__ - Step 126824: {'lr': 2.9649900463321668e-05, 'samples': 24350208, 'steps': 126823, 'loss/train': 0.07201182097196579} 11/07/2021 15:05:17 - INFO - __main__ - Step 126825: {'lr': 2.964739376658007e-05, 'samples': 24350400, 'steps': 126824, 'loss/train': 0.9712162017822266} 11/07/2021 15:05:18 - INFO - __main__ - Step 126826: {'lr': 2.9644887169125973e-05, 'samples': 24350592, 'steps': 126825, 'loss/train': 1.133947730064392} 11/07/2021 15:05:18 - INFO - __main__ - Step 126827: {'lr': 2.964238067096045e-05, 'samples': 24350784, 'steps': 126826, 'loss/train': 1.3803225755691528} 11/07/2021 15:05:18 - INFO - __main__ - Step 126828: {'lr': 2.96398742720847e-05, 'samples': 24350976, 'steps': 126827, 'loss/train': 1.1810426712036133} 11/07/2021 15:05:19 - INFO - __main__ - Step 126829: {'lr': 2.963736797249983e-05, 'samples': 24351168, 'steps': 126828, 'loss/train': 1.3731952905654907} 11/07/2021 15:05:20 - INFO - __main__ - Step 126830: {'lr': 2.963486177220695e-05, 'samples': 24351360, 'steps': 126829, 'loss/train': 1.233957290649414} 11/07/2021 15:05:20 - INFO - __main__ - Step 126831: {'lr': 2.9632355671207255e-05, 'samples': 24351552, 'steps': 126830, 'loss/train': 1.0346792936325073} 11/07/2021 15:05:20 - INFO - __main__ - Step 126832: {'lr': 2.9629849669501773e-05, 'samples': 24351744, 'steps': 126831, 'loss/train': 0.6313906908035278} 11/07/2021 15:05:21 - INFO - __main__ - Step 126833: {'lr': 2.962734376709167e-05, 'samples': 24351936, 'steps': 126832, 'loss/train': 1.2829265594482422} 11/07/2021 15:05:22 - INFO - __main__ - Step 126834: {'lr': 2.962483796397808e-05, 'samples': 24352128, 'steps': 126833, 'loss/train': 1.3977746963500977} 11/07/2021 15:05:22 - INFO - __main__ - Step 126835: {'lr': 2.9622332260162145e-05, 'samples': 24352320, 'steps': 126834, 'loss/train': 1.5702099800109863} 11/07/2021 15:05:22 - INFO - __main__ - Step 126836: {'lr': 2.9619826655645e-05, 'samples': 24352512, 'steps': 126835, 'loss/train': 1.0678788423538208} 11/07/2021 15:05:23 - INFO - __main__ - Step 126837: {'lr': 2.9617321150427728e-05, 'samples': 24352704, 'steps': 126836, 'loss/train': 1.4285613298416138} 11/07/2021 15:05:23 - INFO - __main__ - Step 126838: {'lr': 2.9614815744511526e-05, 'samples': 24352896, 'steps': 126837, 'loss/train': 1.046252965927124} 11/07/2021 15:05:23 - INFO - __main__ - Step 126839: {'lr': 2.9612310437897472e-05, 'samples': 24353088, 'steps': 126838, 'loss/train': 0.7920141220092773} 11/07/2021 15:05:25 - INFO - __main__ - Step 126840: {'lr': 2.9609805230586705e-05, 'samples': 24353280, 'steps': 126839, 'loss/train': 1.0873163938522339} 11/07/2021 15:05:25 - INFO - __main__ - Step 126841: {'lr': 2.9607300122580365e-05, 'samples': 24353472, 'steps': 126840, 'loss/train': 4.488575458526611} 11/07/2021 15:05:26 - INFO - __main__ - Step 126842: {'lr': 2.9604795113879563e-05, 'samples': 24353664, 'steps': 126841, 'loss/train': 1.1477972269058228} 11/07/2021 15:05:26 - INFO - __main__ - Step 126843: {'lr': 2.960229020448549e-05, 'samples': 24353856, 'steps': 126842, 'loss/train': 1.262538194656372} 11/07/2021 15:05:26 - INFO - __main__ - Step 126844: {'lr': 2.9599785394399197e-05, 'samples': 24354048, 'steps': 126843, 'loss/train': 1.0815154314041138} 11/07/2021 15:05:27 - INFO - __main__ - Step 126845: {'lr': 2.959728068362183e-05, 'samples': 24354240, 'steps': 126844, 'loss/train': 1.290226936340332} 11/07/2021 15:05:28 - INFO - __main__ - Step 126846: {'lr': 2.9594776072154523e-05, 'samples': 24354432, 'steps': 126845, 'loss/train': 1.5986977815628052} 11/07/2021 15:05:28 - INFO - __main__ - Step 126847: {'lr': 2.9592271559998413e-05, 'samples': 24354624, 'steps': 126846, 'loss/train': 1.1275253295898438} 11/07/2021 15:05:28 - INFO - __main__ - Step 126848: {'lr': 2.9589767147154613e-05, 'samples': 24354816, 'steps': 126847, 'loss/train': 1.418555498123169} 11/07/2021 15:05:29 - INFO - __main__ - Step 126849: {'lr': 2.9587262833624256e-05, 'samples': 24355008, 'steps': 126848, 'loss/train': 1.3022427558898926} 11/07/2021 15:05:30 - INFO - __main__ - Step 126850: {'lr': 2.9584758619408513e-05, 'samples': 24355200, 'steps': 126849, 'loss/train': 1.3352018594741821} 11/07/2021 15:05:30 - INFO - __main__ - Step 126851: {'lr': 2.9582254504508436e-05, 'samples': 24355392, 'steps': 126850, 'loss/train': 0.9988805055618286} 11/07/2021 15:05:30 - INFO - __main__ - Step 126852: {'lr': 2.9579750488925223e-05, 'samples': 24355584, 'steps': 126851, 'loss/train': 1.7374221086502075} 11/07/2021 15:05:31 - INFO - __main__ - Step 126853: {'lr': 2.957724657265995e-05, 'samples': 24355776, 'steps': 126852, 'loss/train': 1.2268550395965576} 11/07/2021 15:05:31 - INFO - __main__ - Step 126854: {'lr': 2.9574742755713785e-05, 'samples': 24355968, 'steps': 126853, 'loss/train': 1.3485937118530273} 11/07/2021 15:05:32 - INFO - __main__ - Step 126855: {'lr': 2.957223903808784e-05, 'samples': 24356160, 'steps': 126854, 'loss/train': 1.073049783706665} 11/07/2021 15:05:32 - INFO - __main__ - Step 126856: {'lr': 2.9569735419783306e-05, 'samples': 24356352, 'steps': 126855, 'loss/train': 1.077558994293213} 11/07/2021 15:05:33 - INFO - __main__ - Step 126857: {'lr': 2.9567231900801184e-05, 'samples': 24356544, 'steps': 126856, 'loss/train': 1.1374599933624268} 11/07/2021 15:05:33 - INFO - __main__ - Step 126858: {'lr': 2.9564728481142637e-05, 'samples': 24356736, 'steps': 126857, 'loss/train': 0.8750438094139099} 11/07/2021 15:05:34 - INFO - __main__ - Step 126859: {'lr': 2.9562225160808864e-05, 'samples': 24356928, 'steps': 126858, 'loss/train': 1.4413284063339233} 11/07/2021 15:05:35 - INFO - __main__ - Step 126860: {'lr': 2.9559721939800916e-05, 'samples': 24357120, 'steps': 126859, 'loss/train': 1.1352571249008179} 11/07/2021 15:05:35 - INFO - __main__ - Step 126861: {'lr': 2.9557218818119984e-05, 'samples': 24357312, 'steps': 126860, 'loss/train': 1.475502610206604} 11/07/2021 15:05:36 - INFO - __main__ - Step 126862: {'lr': 2.9554715795767157e-05, 'samples': 24357504, 'steps': 126861, 'loss/train': 0.40820685029029846} 11/07/2021 15:05:36 - INFO - __main__ - Step 126863: {'lr': 2.9552212872743566e-05, 'samples': 24357696, 'steps': 126862, 'loss/train': 1.4174162149429321} 11/07/2021 15:05:36 - INFO - __main__ - Step 126864: {'lr': 2.9549710049050353e-05, 'samples': 24357888, 'steps': 126863, 'loss/train': 1.7485060691833496} 11/07/2021 15:05:37 - INFO - __main__ - Step 126865: {'lr': 2.9547207324688657e-05, 'samples': 24358080, 'steps': 126864, 'loss/train': 2.328033447265625} 11/07/2021 15:05:38 - INFO - __main__ - Step 126866: {'lr': 2.9544704699659558e-05, 'samples': 24358272, 'steps': 126865, 'loss/train': 1.415738582611084} 11/07/2021 15:05:38 - INFO - __main__ - Step 126867: {'lr': 2.954220217396422e-05, 'samples': 24358464, 'steps': 126866, 'loss/train': 0.9951217770576477} 11/07/2021 15:05:39 - INFO - __main__ - Step 126868: {'lr': 2.9539699747603787e-05, 'samples': 24358656, 'steps': 126867, 'loss/train': 1.1344172954559326} 11/07/2021 15:05:39 - INFO - __main__ - Step 126869: {'lr': 2.9537197420579337e-05, 'samples': 24358848, 'steps': 126868, 'loss/train': 0.7826376557350159} 11/07/2021 15:05:39 - INFO - __main__ - Step 126870: {'lr': 2.9534695192892092e-05, 'samples': 24359040, 'steps': 126869, 'loss/train': 1.4746953248977661} 11/07/2021 15:05:40 - INFO - __main__ - Step 126871: {'lr': 2.953219306454305e-05, 'samples': 24359232, 'steps': 126870, 'loss/train': 1.1192225217819214} 11/07/2021 15:05:41 - INFO - __main__ - Step 126872: {'lr': 2.9529691035533406e-05, 'samples': 24359424, 'steps': 126871, 'loss/train': 0.9880653619766235} 11/07/2021 15:05:41 - INFO - __main__ - Step 126873: {'lr': 2.9527189105864272e-05, 'samples': 24359616, 'steps': 126872, 'loss/train': 1.3654662370681763} 11/07/2021 15:05:41 - INFO - __main__ - Step 126874: {'lr': 2.9524687275536782e-05, 'samples': 24359808, 'steps': 126873, 'loss/train': 1.0638024806976318} 11/07/2021 15:05:42 - INFO - __main__ - Step 126875: {'lr': 2.9522185544552077e-05, 'samples': 24360000, 'steps': 126874, 'loss/train': 1.5833173990249634} 11/07/2021 15:05:42 - INFO - __main__ - Step 126876: {'lr': 2.9519683912911265e-05, 'samples': 24360192, 'steps': 126875, 'loss/train': 1.0966248512268066} 11/07/2021 15:05:43 - INFO - __main__ - Step 126877: {'lr': 2.9517182380615488e-05, 'samples': 24360384, 'steps': 126876, 'loss/train': 1.6661778688430786} 11/07/2021 15:05:44 - INFO - __main__ - Step 126878: {'lr': 2.951468094766585e-05, 'samples': 24360576, 'steps': 126877, 'loss/train': 1.4325810670852661} 11/07/2021 15:05:44 - INFO - __main__ - Step 126879: {'lr': 2.9512179614063522e-05, 'samples': 24360768, 'steps': 126878, 'loss/train': 1.790384292602539} 11/07/2021 15:05:44 - INFO - __main__ - Step 126880: {'lr': 2.9509678379809583e-05, 'samples': 24360960, 'steps': 126879, 'loss/train': 1.229132890701294} 11/07/2021 15:05:45 - INFO - __main__ - Step 126881: {'lr': 2.9507177244905202e-05, 'samples': 24361152, 'steps': 126880, 'loss/train': 1.3947837352752686} 11/07/2021 15:05:46 - INFO - __main__ - Step 126882: {'lr': 2.950467620935146e-05, 'samples': 24361344, 'steps': 126881, 'loss/train': 1.3530499935150146} 11/07/2021 15:05:46 - INFO - __main__ - Step 126883: {'lr': 2.9502175273149577e-05, 'samples': 24361536, 'steps': 126882, 'loss/train': 1.3538018465042114} 11/07/2021 15:05:46 - INFO - __main__ - Step 126884: {'lr': 2.9499674436300556e-05, 'samples': 24361728, 'steps': 126883, 'loss/train': 1.894086241722107} 11/07/2021 15:05:47 - INFO - __main__ - Step 126885: {'lr': 2.949717369880556e-05, 'samples': 24361920, 'steps': 126884, 'loss/train': 1.314357042312622} 11/07/2021 15:05:47 - INFO - __main__ - Step 126886: {'lr': 2.949467306066575e-05, 'samples': 24362112, 'steps': 126885, 'loss/train': 1.3042949438095093} 11/07/2021 15:05:48 - INFO - __main__ - Step 126887: {'lr': 2.9492172521882242e-05, 'samples': 24362304, 'steps': 126886, 'loss/train': 0.9141592979431152} 11/07/2021 15:05:49 - INFO - __main__ - Step 126888: {'lr': 2.9489672082456147e-05, 'samples': 24362496, 'steps': 126887, 'loss/train': 1.115544080734253} 11/07/2021 15:05:49 - INFO - __main__ - Step 126889: {'lr': 2.948717174238863e-05, 'samples': 24362688, 'steps': 126888, 'loss/train': 0.09948873519897461} 11/07/2021 15:05:49 - INFO - __main__ - Step 126890: {'lr': 2.9484671501680772e-05, 'samples': 24362880, 'steps': 126889, 'loss/train': 1.2967112064361572} 11/07/2021 15:05:50 - INFO - __main__ - Step 126891: {'lr': 2.948217136033371e-05, 'samples': 24363072, 'steps': 126890, 'loss/train': 1.0190577507019043} 11/07/2021 15:05:51 - INFO - __main__ - Step 126892: {'lr': 2.9479671318348584e-05, 'samples': 24363264, 'steps': 126891, 'loss/train': 1.147585153579712} 11/07/2021 15:05:51 - INFO - __main__ - Step 126893: {'lr': 2.947717137572653e-05, 'samples': 24363456, 'steps': 126892, 'loss/train': 0.6402814388275146} 11/07/2021 15:05:51 - INFO - __main__ - Step 126894: {'lr': 2.9474671532468633e-05, 'samples': 24363648, 'steps': 126893, 'loss/train': 0.9762140512466431} 11/07/2021 15:05:52 - INFO - __main__ - Step 126895: {'lr': 2.947217178857606e-05, 'samples': 24363840, 'steps': 126894, 'loss/train': 1.7756990194320679} 11/07/2021 15:05:52 - INFO - __main__ - Step 126896: {'lr': 2.946967214404994e-05, 'samples': 24364032, 'steps': 126895, 'loss/train': 1.2521967887878418} 11/07/2021 15:05:53 - INFO - __main__ - Step 126897: {'lr': 2.9467172598891394e-05, 'samples': 24364224, 'steps': 126896, 'loss/train': 1.4778974056243896} 11/07/2021 15:05:54 - INFO - __main__ - Step 126898: {'lr': 2.946467315310153e-05, 'samples': 24364416, 'steps': 126897, 'loss/train': 5.660013675689697} 11/07/2021 15:05:54 - INFO - __main__ - Step 126899: {'lr': 2.9462173806681453e-05, 'samples': 24364608, 'steps': 126898, 'loss/train': 1.067379117012024} 11/07/2021 15:05:54 - INFO - __main__ - Step 126900: {'lr': 2.945967455963233e-05, 'samples': 24364800, 'steps': 126899, 'loss/train': 1.055120587348938} 11/07/2021 15:05:55 - INFO - __main__ - Step 126901: {'lr': 2.945717541195528e-05, 'samples': 24364992, 'steps': 126900, 'loss/train': 1.0861024856567383} 11/07/2021 15:05:55 - INFO - __main__ - Step 126902: {'lr': 2.94546763636514e-05, 'samples': 24365184, 'steps': 126901, 'loss/train': 1.7552142143249512} 11/07/2021 15:05:56 - INFO - __main__ - Step 126903: {'lr': 2.9452177414721865e-05, 'samples': 24365376, 'steps': 126902, 'loss/train': 1.1067558526992798} 11/07/2021 15:05:56 - INFO - __main__ - Step 126904: {'lr': 2.9449678565167752e-05, 'samples': 24365568, 'steps': 126903, 'loss/train': 0.49571576714515686} 11/07/2021 15:05:57 - INFO - __main__ - Step 126905: {'lr': 2.9447179814990234e-05, 'samples': 24365760, 'steps': 126904, 'loss/train': 1.3276365995407104} 11/07/2021 15:05:57 - INFO - __main__ - Step 126906: {'lr': 2.9444681164190385e-05, 'samples': 24365952, 'steps': 126905, 'loss/train': 1.2144293785095215} 11/07/2021 15:05:58 - INFO - __main__ - Step 126907: {'lr': 2.9442182612769375e-05, 'samples': 24366144, 'steps': 126906, 'loss/train': 1.5391980409622192} 11/07/2021 15:05:59 - INFO - __main__ - Step 126908: {'lr': 2.9439684160728315e-05, 'samples': 24366336, 'steps': 126907, 'loss/train': 1.4077885150909424} 11/07/2021 15:05:59 - INFO - __main__ - Step 126909: {'lr': 2.943718580806834e-05, 'samples': 24366528, 'steps': 126908, 'loss/train': 0.9801421761512756} 11/07/2021 15:05:59 - INFO - __main__ - Step 126910: {'lr': 2.9434687554790618e-05, 'samples': 24366720, 'steps': 126909, 'loss/train': 1.0667306184768677} 11/07/2021 15:06:00 - INFO - __main__ - Step 126911: {'lr': 2.9432189400896146e-05, 'samples': 24366912, 'steps': 126910, 'loss/train': 1.4656609296798706} 11/07/2021 15:06:00 - INFO - __main__ - Step 126912: {'lr': 2.942969134638615e-05, 'samples': 24367104, 'steps': 126911, 'loss/train': 1.4064735174179077} 11/07/2021 15:06:01 - INFO - __main__ - Step 126913: {'lr': 2.942719339126171e-05, 'samples': 24367296, 'steps': 126912, 'loss/train': 1.586273193359375} 11/07/2021 15:06:01 - INFO - __main__ - Step 126914: {'lr': 2.9424695535523987e-05, 'samples': 24367488, 'steps': 126913, 'loss/train': 1.3772951364517212} 11/07/2021 15:06:02 - INFO - __main__ - Step 126915: {'lr': 2.9422197779174098e-05, 'samples': 24367680, 'steps': 126914, 'loss/train': 1.3863133192062378} 11/07/2021 15:06:02 - INFO - __main__ - Step 126916: {'lr': 2.941970012221315e-05, 'samples': 24367872, 'steps': 126915, 'loss/train': 1.2839906215667725} 11/07/2021 15:06:02 - INFO - __main__ - Step 126917: {'lr': 2.9417202564642282e-05, 'samples': 24368064, 'steps': 126916, 'loss/train': 1.4068427085876465} 11/07/2021 15:06:03 - INFO - __main__ - Step 126918: {'lr': 2.941470510646263e-05, 'samples': 24368256, 'steps': 126917, 'loss/train': 1.1362968683242798} 11/07/2021 15:06:04 - INFO - __main__ - Step 126919: {'lr': 2.941220774767528e-05, 'samples': 24368448, 'steps': 126918, 'loss/train': 1.0061029195785522} 11/07/2021 15:06:04 - INFO - __main__ - Step 126920: {'lr': 2.940971048828142e-05, 'samples': 24368640, 'steps': 126919, 'loss/train': 1.3476675748825073} 11/07/2021 15:06:05 - INFO - __main__ - Step 126921: {'lr': 2.940721332828211e-05, 'samples': 24368832, 'steps': 126920, 'loss/train': 1.0181231498718262} 11/07/2021 15:06:05 - INFO - __main__ - Step 126922: {'lr': 2.9404716267678543e-05, 'samples': 24369024, 'steps': 126921, 'loss/train': 1.227590560913086} 11/07/2021 15:06:05 - INFO - __main__ - Step 126923: {'lr': 2.9402219306471773e-05, 'samples': 24369216, 'steps': 126922, 'loss/train': 1.066246509552002} 11/07/2021 15:06:06 - INFO - __main__ - Step 126924: {'lr': 2.939972244466302e-05, 'samples': 24369408, 'steps': 126923, 'loss/train': 1.016726016998291} 11/07/2021 15:06:07 - INFO - __main__ - Step 126925: {'lr': 2.9397225682253308e-05, 'samples': 24369600, 'steps': 126924, 'loss/train': 0.03605438396334648} 11/07/2021 15:06:07 - INFO - __main__ - Step 126926: {'lr': 2.939472901924381e-05, 'samples': 24369792, 'steps': 126925, 'loss/train': 1.3980962038040161} 11/07/2021 15:06:07 - INFO - __main__ - Step 126927: {'lr': 2.9392232455635604e-05, 'samples': 24369984, 'steps': 126926, 'loss/train': 0.8378463983535767} 11/07/2021 15:06:08 - INFO - __main__ - Step 126928: {'lr': 2.9389735991429883e-05, 'samples': 24370176, 'steps': 126927, 'loss/train': 1.2756874561309814} 11/07/2021 15:06:09 - INFO - __main__ - Step 126929: {'lr': 2.9387239626627733e-05, 'samples': 24370368, 'steps': 126928, 'loss/train': 1.3805992603302002} 11/07/2021 15:06:09 - INFO - __main__ - Step 126930: {'lr': 2.9384743361230286e-05, 'samples': 24370560, 'steps': 126929, 'loss/train': 1.581145167350769} 11/07/2021 15:06:09 - INFO - __main__ - Step 126931: {'lr': 2.938224719523869e-05, 'samples': 24370752, 'steps': 126930, 'loss/train': 0.8278349041938782} 11/07/2021 15:06:10 - INFO - __main__ - Step 126932: {'lr': 2.937975112865404e-05, 'samples': 24370944, 'steps': 126931, 'loss/train': 1.5008881092071533} 11/07/2021 15:06:10 - INFO - __main__ - Step 126933: {'lr': 2.937725516147746e-05, 'samples': 24371136, 'steps': 126932, 'loss/train': 1.2130894660949707} 11/07/2021 15:06:11 - INFO - __main__ - Step 126934: {'lr': 2.9374759293710084e-05, 'samples': 24371328, 'steps': 126933, 'loss/train': 1.2146633863449097} 11/07/2021 15:06:12 - INFO - __main__ - Step 126935: {'lr': 2.9372263525353048e-05, 'samples': 24371520, 'steps': 126934, 'loss/train': 1.0973353385925293} 11/07/2021 15:06:12 - INFO - __main__ - Step 126936: {'lr': 2.9369767856407465e-05, 'samples': 24371712, 'steps': 126935, 'loss/train': 0.5630989074707031} 11/07/2021 15:06:12 - INFO - __main__ - Step 126937: {'lr': 2.9367272286874497e-05, 'samples': 24371904, 'steps': 126936, 'loss/train': 1.3559484481811523} 11/07/2021 15:06:13 - INFO - __main__ - Step 126938: {'lr': 2.93647768167552e-05, 'samples': 24372096, 'steps': 126937, 'loss/train': 1.1386107206344604} 11/07/2021 15:06:14 - INFO - __main__ - Step 126939: {'lr': 2.9362281446050714e-05, 'samples': 24372288, 'steps': 126938, 'loss/train': 1.2945767641067505} 11/07/2021 15:06:14 - INFO - __main__ - Step 126940: {'lr': 2.9359786174762177e-05, 'samples': 24372480, 'steps': 126939, 'loss/train': 1.2098630666732788} 11/07/2021 15:06:15 - INFO - __main__ - Step 126941: {'lr': 2.9357291002890723e-05, 'samples': 24372672, 'steps': 126940, 'loss/train': 1.3008488416671753} 11/07/2021 15:06:15 - INFO - __main__ - Step 126942: {'lr': 2.9354795930437468e-05, 'samples': 24372864, 'steps': 126941, 'loss/train': 0.39308083057403564} 11/07/2021 15:06:15 - INFO - __main__ - Step 126943: {'lr': 2.9352300957403543e-05, 'samples': 24373056, 'steps': 126942, 'loss/train': 1.3524158000946045} 11/07/2021 15:06:17 - INFO - __main__ - Step 126944: {'lr': 2.9349806083790066e-05, 'samples': 24373248, 'steps': 126943, 'loss/train': 0.9262962341308594} 11/07/2021 15:06:17 - INFO - __main__ - Step 126945: {'lr': 2.934731130959814e-05, 'samples': 24373440, 'steps': 126944, 'loss/train': 1.202694058418274} 11/07/2021 15:06:17 - INFO - __main__ - Step 126946: {'lr': 2.9344816634828935e-05, 'samples': 24373632, 'steps': 126945, 'loss/train': 0.7104070782661438} 11/07/2021 15:06:18 - INFO - __main__ - Step 126947: {'lr': 2.9342322059483562e-05, 'samples': 24373824, 'steps': 126946, 'loss/train': 0.7712755799293518} 11/07/2021 15:06:18 - INFO - __main__ - Step 126948: {'lr': 2.93398275835631e-05, 'samples': 24374016, 'steps': 126947, 'loss/train': 1.0372447967529297} 11/07/2021 15:06:19 - INFO - __main__ - Step 126949: {'lr': 2.9337333207068718e-05, 'samples': 24374208, 'steps': 126948, 'loss/train': 1.4612326622009277} 11/07/2021 15:06:19 - INFO - __main__ - Step 126950: {'lr': 2.933483893000158e-05, 'samples': 24374400, 'steps': 126949, 'loss/train': 1.3652054071426392} 11/07/2021 15:06:20 - INFO - __main__ - Step 126951: {'lr': 2.933234475236271e-05, 'samples': 24374592, 'steps': 126950, 'loss/train': 1.3746718168258667} 11/07/2021 15:06:20 - INFO - __main__ - Step 126952: {'lr': 2.932985067415328e-05, 'samples': 24374784, 'steps': 126951, 'loss/train': 1.1961543560028076} 11/07/2021 15:06:20 - INFO - __main__ - Step 126953: {'lr': 2.9327356695374425e-05, 'samples': 24374976, 'steps': 126952, 'loss/train': 1.3090838193893433} 11/07/2021 15:06:21 - INFO - __main__ - Step 126954: {'lr': 2.9324862816027252e-05, 'samples': 24375168, 'steps': 126953, 'loss/train': 0.4926491379737854} 11/07/2021 15:06:22 - INFO - __main__ - Step 126955: {'lr': 2.9322369036112878e-05, 'samples': 24375360, 'steps': 126954, 'loss/train': 0.8827235698699951} 11/07/2021 15:06:22 - INFO - __main__ - Step 126956: {'lr': 2.9319875355632463e-05, 'samples': 24375552, 'steps': 126955, 'loss/train': 1.3237814903259277} 11/07/2021 15:06:22 - INFO - __main__ - Step 126957: {'lr': 2.9317381774587093e-05, 'samples': 24375744, 'steps': 126956, 'loss/train': 1.0677863359451294} 11/07/2021 15:06:23 - INFO - __main__ - Step 126958: {'lr': 2.9314888292977905e-05, 'samples': 24375936, 'steps': 126957, 'loss/train': 1.3569903373718262} 11/07/2021 15:06:24 - INFO - __main__ - Step 126959: {'lr': 2.9312394910806005e-05, 'samples': 24376128, 'steps': 126958, 'loss/train': 1.2615290880203247} 11/07/2021 15:06:24 - INFO - __main__ - Step 126960: {'lr': 2.9309901628072567e-05, 'samples': 24376320, 'steps': 126959, 'loss/train': 1.4873799085617065} 11/07/2021 15:06:25 - INFO - __main__ - Step 126961: {'lr': 2.9307408444778666e-05, 'samples': 24376512, 'steps': 126960, 'loss/train': 1.1932555437088013} 11/07/2021 15:06:25 - INFO - __main__ - Step 126962: {'lr': 2.9304915360925444e-05, 'samples': 24376704, 'steps': 126961, 'loss/train': 1.1874301433563232} 11/07/2021 15:06:25 - INFO - __main__ - Step 126963: {'lr': 2.9302422376514008e-05, 'samples': 24376896, 'steps': 126962, 'loss/train': 0.9084793329238892} 11/07/2021 15:06:26 - INFO - __main__ - Step 126964: {'lr': 2.929992949154556e-05, 'samples': 24377088, 'steps': 126963, 'loss/train': 1.5723627805709839} 11/07/2021 15:06:27 - INFO - __main__ - Step 126965: {'lr': 2.9297436706021115e-05, 'samples': 24377280, 'steps': 126964, 'loss/train': 1.1902658939361572} 11/07/2021 15:06:27 - INFO - __main__ - Step 126966: {'lr': 2.9294944019941815e-05, 'samples': 24377472, 'steps': 126965, 'loss/train': 1.0016790628433228} 11/07/2021 15:06:28 - INFO - __main__ - Step 126967: {'lr': 2.9292451433308832e-05, 'samples': 24377664, 'steps': 126966, 'loss/train': 0.9324924349784851} 11/07/2021 15:06:28 - INFO - __main__ - Step 126968: {'lr': 2.928995894612324e-05, 'samples': 24377856, 'steps': 126967, 'loss/train': 0.5078957080841064} 11/07/2021 15:06:28 - INFO - __main__ - Step 126969: {'lr': 2.928746655838621e-05, 'samples': 24378048, 'steps': 126968, 'loss/train': 1.556564450263977} 11/07/2021 15:06:29 - INFO - __main__ - Step 126970: {'lr': 2.9284974270098824e-05, 'samples': 24378240, 'steps': 126969, 'loss/train': 1.3210471868515015} 11/07/2021 15:06:30 - INFO - __main__ - Step 126971: {'lr': 2.9282482081262247e-05, 'samples': 24378432, 'steps': 126970, 'loss/train': 1.2472962141036987} 11/07/2021 15:06:30 - INFO - __main__ - Step 126972: {'lr': 2.927998999187756e-05, 'samples': 24378624, 'steps': 126971, 'loss/train': 1.3313095569610596} 11/07/2021 15:06:30 - INFO - __main__ - Step 126973: {'lr': 2.9277498001945902e-05, 'samples': 24378816, 'steps': 126972, 'loss/train': 1.053412914276123} 11/07/2021 15:06:31 - INFO - __main__ - Step 126974: {'lr': 2.927500611146841e-05, 'samples': 24379008, 'steps': 126973, 'loss/train': 1.3706159591674805} 11/07/2021 15:06:32 - INFO - __main__ - Step 126975: {'lr': 2.927251432044617e-05, 'samples': 24379200, 'steps': 126974, 'loss/train': 0.5449742078781128} 11/07/2021 15:06:32 - INFO - __main__ - Step 126976: {'lr': 2.9270022628880343e-05, 'samples': 24379392, 'steps': 126975, 'loss/train': 1.349137783050537} 11/07/2021 15:06:33 - INFO - __main__ - Step 126977: {'lr': 2.9267531036772098e-05, 'samples': 24379584, 'steps': 126976, 'loss/train': 1.2760107517242432} 11/07/2021 15:06:33 - INFO - __main__ - Step 126978: {'lr': 2.9265039544122434e-05, 'samples': 24379776, 'steps': 126977, 'loss/train': 1.3223850727081299} 11/07/2021 15:06:33 - INFO - __main__ - Step 126979: {'lr': 2.9262548150932544e-05, 'samples': 24379968, 'steps': 126978, 'loss/train': 1.2340563535690308} 11/07/2021 15:06:34 - INFO - __main__ - Step 126980: {'lr': 2.9260056857203537e-05, 'samples': 24380160, 'steps': 126979, 'loss/train': 1.2298698425292969} 11/07/2021 15:06:35 - INFO - __main__ - Step 126981: {'lr': 2.9257565662936554e-05, 'samples': 24380352, 'steps': 126980, 'loss/train': 0.8793227076530457} 11/07/2021 15:06:35 - INFO - __main__ - Step 126982: {'lr': 2.9255074568132702e-05, 'samples': 24380544, 'steps': 126981, 'loss/train': 1.538927435874939} 11/07/2021 15:06:35 - INFO - __main__ - Step 126983: {'lr': 2.925258357279309e-05, 'samples': 24380736, 'steps': 126982, 'loss/train': 1.4626708030700684} 11/07/2021 15:06:36 - INFO - __main__ - Step 126984: {'lr': 2.9250092676918887e-05, 'samples': 24380928, 'steps': 126983, 'loss/train': 1.1874629259109497} 11/07/2021 15:06:37 - INFO - __main__ - Step 126985: {'lr': 2.9247601880511176e-05, 'samples': 24381120, 'steps': 126984, 'loss/train': 1.218838095664978} 11/07/2021 15:06:37 - INFO - __main__ - Step 126986: {'lr': 2.9245111183571066e-05, 'samples': 24381312, 'steps': 126985, 'loss/train': 1.406091570854187} 11/07/2021 15:06:37 - INFO - __main__ - Step 126987: {'lr': 2.9242620586099723e-05, 'samples': 24381504, 'steps': 126986, 'loss/train': 2.217592477798462} 11/07/2021 15:06:38 - INFO - __main__ - Step 126988: {'lr': 2.9240130088098254e-05, 'samples': 24381696, 'steps': 126987, 'loss/train': 0.17078427970409393} 11/07/2021 15:06:38 - INFO - __main__ - Step 126989: {'lr': 2.9237639689567746e-05, 'samples': 24381888, 'steps': 126988, 'loss/train': 1.2414098978042603} 11/07/2021 15:06:39 - INFO - __main__ - Step 126990: {'lr': 2.923514939050939e-05, 'samples': 24382080, 'steps': 126989, 'loss/train': 0.8868499994277954} 11/07/2021 15:06:39 - INFO - __main__ - Step 126991: {'lr': 2.92326591909243e-05, 'samples': 24382272, 'steps': 126990, 'loss/train': 1.367357611656189} 11/07/2021 15:06:40 - INFO - __main__ - Step 126992: {'lr': 2.9230169090813525e-05, 'samples': 24382464, 'steps': 126991, 'loss/train': 1.3134852647781372} 11/07/2021 15:06:40 - INFO - __main__ - Step 126993: {'lr': 2.9227679090178205e-05, 'samples': 24382656, 'steps': 126992, 'loss/train': 1.4106251001358032} 11/07/2021 15:06:41 - INFO - __main__ - Step 126994: {'lr': 2.9225189189019508e-05, 'samples': 24382848, 'steps': 126993, 'loss/train': 1.5031074285507202} 11/07/2021 15:06:41 - INFO - __main__ - Step 126995: {'lr': 2.9222699387338542e-05, 'samples': 24383040, 'steps': 126994, 'loss/train': 0.8304315805435181} 11/07/2021 15:06:42 - INFO - __main__ - Step 126996: {'lr': 2.922020968513639e-05, 'samples': 24383232, 'steps': 126995, 'loss/train': 1.8839434385299683} 11/07/2021 15:06:42 - INFO - __main__ - Step 126997: {'lr': 2.9217720082414216e-05, 'samples': 24383424, 'steps': 126996, 'loss/train': 1.1875888109207153} 11/07/2021 15:06:43 - INFO - __main__ - Step 126998: {'lr': 2.9215230579173136e-05, 'samples': 24383616, 'steps': 126997, 'loss/train': 1.1037436723709106} 11/07/2021 15:06:43 - INFO - __main__ - Step 126999: {'lr': 2.921274117541428e-05, 'samples': 24383808, 'steps': 126998, 'loss/train': 1.057318925857544} 11/07/2021 15:06:44 - INFO - __main__ - Step 127000: {'lr': 2.921025187113874e-05, 'samples': 24384000, 'steps': 126999, 'loss/train': 1.3938716650009155} 11/07/2021 15:06:44 - INFO - __main__ - Step 127001: {'lr': 2.920776266634767e-05, 'samples': 24384192, 'steps': 127000, 'loss/train': 1.0745797157287598} 11/07/2021 15:06:45 - INFO - __main__ - Step 127002: {'lr': 2.9205273561042162e-05, 'samples': 24384384, 'steps': 127001, 'loss/train': 1.6203172206878662} 11/07/2021 15:06:45 - INFO - __main__ - Step 127003: {'lr': 2.920278455522335e-05, 'samples': 24384576, 'steps': 127002, 'loss/train': 1.3121589422225952} 11/07/2021 15:06:45 - INFO - __main__ - Step 127004: {'lr': 2.92002956488924e-05, 'samples': 24384768, 'steps': 127003, 'loss/train': 1.1515262126922607} 11/07/2021 15:06:46 - INFO - __main__ - Step 127005: {'lr': 2.919780684205034e-05, 'samples': 24384960, 'steps': 127004, 'loss/train': 1.7601840496063232} 11/07/2021 15:06:47 - INFO - __main__ - Step 127006: {'lr': 2.919531813469836e-05, 'samples': 24385152, 'steps': 127005, 'loss/train': 0.779609739780426} 11/07/2021 15:06:47 - INFO - __main__ - Step 127007: {'lr': 2.919282952683755e-05, 'samples': 24385344, 'steps': 127006, 'loss/train': 1.401652455329895} 11/07/2021 15:06:48 - INFO - __main__ - Step 127008: {'lr': 2.919034101846904e-05, 'samples': 24385536, 'steps': 127007, 'loss/train': 0.03173646330833435} 11/07/2021 15:06:48 - INFO - __main__ - Step 127009: {'lr': 2.9187852609593946e-05, 'samples': 24385728, 'steps': 127008, 'loss/train': 1.1446620225906372} 11/07/2021 15:06:48 - INFO - __main__ - Step 127010: {'lr': 2.918536430021343e-05, 'samples': 24385920, 'steps': 127009, 'loss/train': 1.3449437618255615} 11/07/2021 15:06:49 - INFO - __main__ - Step 127011: {'lr': 2.9182876090328548e-05, 'samples': 24386112, 'steps': 127010, 'loss/train': 1.1678407192230225} 11/07/2021 15:06:50 - INFO - __main__ - Step 127012: {'lr': 2.9180387979940464e-05, 'samples': 24386304, 'steps': 127011, 'loss/train': 0.9544244408607483} 11/07/2021 15:06:50 - INFO - __main__ - Step 127013: {'lr': 2.917789996905029e-05, 'samples': 24386496, 'steps': 127012, 'loss/train': 1.4543036222457886} 11/07/2021 15:06:50 - INFO - __main__ - Step 127014: {'lr': 2.9175412057659167e-05, 'samples': 24386688, 'steps': 127013, 'loss/train': 0.38475096225738525} 11/07/2021 15:06:51 - INFO - __main__ - Step 127015: {'lr': 2.9172924245768174e-05, 'samples': 24386880, 'steps': 127014, 'loss/train': 1.3679401874542236} 11/07/2021 15:06:52 - INFO - __main__ - Step 127016: {'lr': 2.9170436533378476e-05, 'samples': 24387072, 'steps': 127015, 'loss/train': 1.5278143882751465} 11/07/2021 15:06:52 - INFO - __main__ - Step 127017: {'lr': 2.9167948920491155e-05, 'samples': 24387264, 'steps': 127016, 'loss/train': 1.3454902172088623} 11/07/2021 15:06:53 - INFO - __main__ - Step 127018: {'lr': 2.916546140710738e-05, 'samples': 24387456, 'steps': 127017, 'loss/train': 1.2886849641799927} 11/07/2021 15:06:53 - INFO - __main__ - Step 127019: {'lr': 2.916297399322823e-05, 'samples': 24387648, 'steps': 127018, 'loss/train': 0.9842585325241089} 11/07/2021 15:06:53 - INFO - __main__ - Step 127020: {'lr': 2.916048667885479e-05, 'samples': 24387840, 'steps': 127019, 'loss/train': 1.4906851053237915} 11/07/2021 15:06:54 - INFO - __main__ - Step 127021: {'lr': 2.9157999463988255e-05, 'samples': 24388032, 'steps': 127020, 'loss/train': 1.7402328252792358} 11/07/2021 15:06:55 - INFO - __main__ - Step 127022: {'lr': 2.915551234862973e-05, 'samples': 24388224, 'steps': 127021, 'loss/train': 1.3479585647583008} 11/07/2021 15:06:55 - INFO - __main__ - Step 127023: {'lr': 2.9153025332780304e-05, 'samples': 24388416, 'steps': 127022, 'loss/train': 1.2515335083007812} 11/07/2021 15:06:55 - INFO - __main__ - Step 127024: {'lr': 2.9150538416441135e-05, 'samples': 24388608, 'steps': 127023, 'loss/train': 1.4395506381988525} 11/07/2021 15:06:56 - INFO - __main__ - Step 127025: {'lr': 2.914805159961331e-05, 'samples': 24388800, 'steps': 127024, 'loss/train': 1.4230380058288574} 11/07/2021 15:06:57 - INFO - __main__ - Step 127026: {'lr': 2.914556488229797e-05, 'samples': 24388992, 'steps': 127025, 'loss/train': 1.5677322149276733} 11/07/2021 15:06:57 - INFO - __main__ - Step 127027: {'lr': 2.914307826449622e-05, 'samples': 24389184, 'steps': 127026, 'loss/train': 0.8613438010215759} 11/07/2021 15:06:57 - INFO - __main__ - Step 127028: {'lr': 2.9140591746209198e-05, 'samples': 24389376, 'steps': 127027, 'loss/train': 1.5414406061172485} 11/07/2021 15:06:58 - INFO - __main__ - Step 127029: {'lr': 2.913810532743802e-05, 'samples': 24389568, 'steps': 127028, 'loss/train': 1.1539127826690674} 11/07/2021 15:06:58 - INFO - __main__ - Step 127030: {'lr': 2.913561900818379e-05, 'samples': 24389760, 'steps': 127029, 'loss/train': 1.2068321704864502} 11/07/2021 15:06:59 - INFO - __main__ - Step 127031: {'lr': 2.91331327884477e-05, 'samples': 24389952, 'steps': 127030, 'loss/train': 1.4989012479782104} 11/07/2021 15:07:00 - INFO - __main__ - Step 127032: {'lr': 2.9130646668230788e-05, 'samples': 24390144, 'steps': 127031, 'loss/train': 1.1705747842788696} 11/07/2021 15:07:00 - INFO - __main__ - Step 127033: {'lr': 2.912816064753415e-05, 'samples': 24390336, 'steps': 127032, 'loss/train': 0.9626666903495789} 11/07/2021 15:07:00 - INFO - __main__ - Step 127034: {'lr': 2.9125674726358993e-05, 'samples': 24390528, 'steps': 127033, 'loss/train': 1.3246148824691772} 11/07/2021 15:07:01 - INFO - __main__ - Step 127035: {'lr': 2.9123188904706387e-05, 'samples': 24390720, 'steps': 127034, 'loss/train': 1.3402787446975708} 11/07/2021 15:07:02 - INFO - __main__ - Step 127036: {'lr': 2.9120703182577452e-05, 'samples': 24390912, 'steps': 127035, 'loss/train': 1.213515043258667} 11/07/2021 15:07:02 - INFO - __main__ - Step 127037: {'lr': 2.9118217559973348e-05, 'samples': 24391104, 'steps': 127036, 'loss/train': 1.061386227607727} 11/07/2021 15:07:02 - INFO - __main__ - Step 127038: {'lr': 2.9115732036895133e-05, 'samples': 24391296, 'steps': 127037, 'loss/train': 1.1126246452331543} 11/07/2021 15:07:03 - INFO - __main__ - Step 127039: {'lr': 2.9113246613343998e-05, 'samples': 24391488, 'steps': 127038, 'loss/train': 1.2909400463104248} 11/07/2021 15:07:03 - INFO - __main__ - Step 127040: {'lr': 2.9110761289320996e-05, 'samples': 24391680, 'steps': 127039, 'loss/train': 1.3924987316131592} 11/07/2021 15:07:04 - INFO - __main__ - Step 127041: {'lr': 2.9108276064827272e-05, 'samples': 24391872, 'steps': 127040, 'loss/train': 0.9623072743415833} 11/07/2021 15:07:04 - INFO - __main__ - Step 127042: {'lr': 2.9105790939863985e-05, 'samples': 24392064, 'steps': 127041, 'loss/train': 1.3188896179199219} 11/07/2021 15:07:05 - INFO - __main__ - Step 127043: {'lr': 2.9103305914432192e-05, 'samples': 24392256, 'steps': 127042, 'loss/train': 1.2224416732788086} 11/07/2021 15:07:05 - INFO - __main__ - Step 127044: {'lr': 2.9100820988533032e-05, 'samples': 24392448, 'steps': 127043, 'loss/train': 1.2878665924072266} 11/07/2021 15:07:06 - INFO - __main__ - Step 127045: {'lr': 2.9098336162167698e-05, 'samples': 24392640, 'steps': 127044, 'loss/train': 0.05405452474951744} 11/07/2021 15:07:07 - INFO - __main__ - Step 127046: {'lr': 2.9095851435337218e-05, 'samples': 24392832, 'steps': 127045, 'loss/train': 0.5619896650314331} 11/07/2021 15:07:07 - INFO - __main__ - Step 127047: {'lr': 2.9093366808042698e-05, 'samples': 24393024, 'steps': 127046, 'loss/train': 1.40402352809906} 11/07/2021 15:07:07 - INFO - __main__ - Step 127048: {'lr': 2.9090882280285337e-05, 'samples': 24393216, 'steps': 127047, 'loss/train': 1.1240465641021729} 11/07/2021 15:07:08 - INFO - __main__ - Step 127049: {'lr': 2.9088397852066182e-05, 'samples': 24393408, 'steps': 127048, 'loss/train': 1.4197278022766113} 11/07/2021 15:07:08 - INFO - __main__ - Step 127050: {'lr': 2.908591352338641e-05, 'samples': 24393600, 'steps': 127049, 'loss/train': 1.0905507802963257} 11/07/2021 15:07:09 - INFO - __main__ - Step 127051: {'lr': 2.9083429294247092e-05, 'samples': 24393792, 'steps': 127050, 'loss/train': 1.4928030967712402} 11/07/2021 15:07:10 - INFO - __main__ - Step 127052: {'lr': 2.9080945164649376e-05, 'samples': 24393984, 'steps': 127051, 'loss/train': 2.0122265815734863} 11/07/2021 15:07:10 - INFO - __main__ - Step 127053: {'lr': 2.9078461134594392e-05, 'samples': 24394176, 'steps': 127052, 'loss/train': 1.5279806852340698} 11/07/2021 15:07:10 - INFO - __main__ - Step 127054: {'lr': 2.9075977204083252e-05, 'samples': 24394368, 'steps': 127053, 'loss/train': 0.760441243648529} 11/07/2021 15:07:11 - INFO - __main__ - Step 127055: {'lr': 2.9073493373117044e-05, 'samples': 24394560, 'steps': 127054, 'loss/train': 1.1829190254211426} 11/07/2021 15:07:11 - INFO - __main__ - Step 127056: {'lr': 2.9071009641696983e-05, 'samples': 24394752, 'steps': 127055, 'loss/train': 1.50046968460083} 11/07/2021 15:07:12 - INFO - __main__ - Step 127057: {'lr': 2.9068526009824043e-05, 'samples': 24394944, 'steps': 127056, 'loss/train': 0.8159464597702026} 11/07/2021 15:07:12 - INFO - __main__ - Step 127058: {'lr': 2.9066042477499416e-05, 'samples': 24395136, 'steps': 127057, 'loss/train': 1.255865216255188} 11/07/2021 15:07:13 - INFO - __main__ - Step 127059: {'lr': 2.9063559044724243e-05, 'samples': 24395328, 'steps': 127058, 'loss/train': 1.1310242414474487} 11/07/2021 15:07:13 - INFO - __main__ - Step 127060: {'lr': 2.9061075711499602e-05, 'samples': 24395520, 'steps': 127059, 'loss/train': 0.9755403399467468} 11/07/2021 15:07:13 - INFO - __main__ - Step 127061: {'lr': 2.9058592477826635e-05, 'samples': 24395712, 'steps': 127060, 'loss/train': 0.8873434662818909} 11/07/2021 15:07:14 - INFO - __main__ - Step 127062: {'lr': 2.9056109343706477e-05, 'samples': 24395904, 'steps': 127061, 'loss/train': 1.1580270528793335} 11/07/2021 15:07:15 - INFO - __main__ - Step 127063: {'lr': 2.9053626309140212e-05, 'samples': 24396096, 'steps': 127062, 'loss/train': 0.6997900605201721} 11/07/2021 15:07:15 - INFO - __main__ - Step 127064: {'lr': 2.9051143374128952e-05, 'samples': 24396288, 'steps': 127063, 'loss/train': 1.321681261062622} 11/07/2021 15:07:16 - INFO - __main__ - Step 127065: {'lr': 2.904866053867386e-05, 'samples': 24396480, 'steps': 127064, 'loss/train': 1.4897935390472412} 11/07/2021 15:07:16 - INFO - __main__ - Step 127066: {'lr': 2.904617780277605e-05, 'samples': 24396672, 'steps': 127065, 'loss/train': 1.50031578540802} 11/07/2021 15:07:17 - INFO - __main__ - Step 127067: {'lr': 2.9043695166436652e-05, 'samples': 24396864, 'steps': 127066, 'loss/train': 1.268962025642395} 11/07/2021 15:07:17 - INFO - __main__ - Step 127068: {'lr': 2.90412126296567e-05, 'samples': 24397056, 'steps': 127067, 'loss/train': 0.8930447697639465} 11/07/2021 15:07:18 - INFO - __main__ - Step 127069: {'lr': 2.9038730192437384e-05, 'samples': 24397248, 'steps': 127068, 'loss/train': 1.5876384973526} 11/07/2021 15:07:18 - INFO - __main__ - Step 127070: {'lr': 2.903624785477979e-05, 'samples': 24397440, 'steps': 127069, 'loss/train': 1.3861196041107178} 11/07/2021 15:07:18 - INFO - __main__ - Step 127071: {'lr': 2.9033765616685055e-05, 'samples': 24397632, 'steps': 127070, 'loss/train': 1.3951315879821777} 11/07/2021 15:07:19 - INFO - __main__ - Step 127072: {'lr': 2.9031283478154284e-05, 'samples': 24397824, 'steps': 127071, 'loss/train': 1.3898106813430786} 11/07/2021 15:07:20 - INFO - __main__ - Step 127073: {'lr': 2.9028801439188625e-05, 'samples': 24398016, 'steps': 127072, 'loss/train': 0.8186616897583008} 11/07/2021 15:07:20 - INFO - __main__ - Step 127074: {'lr': 2.9026319499789177e-05, 'samples': 24398208, 'steps': 127073, 'loss/train': 1.2574341297149658} 11/07/2021 15:07:21 - INFO - __main__ - Step 127075: {'lr': 2.902383765995706e-05, 'samples': 24398400, 'steps': 127074, 'loss/train': 1.098360300064087} 11/07/2021 15:07:21 - INFO - __main__ - Step 127076: {'lr': 2.9021355919693405e-05, 'samples': 24398592, 'steps': 127075, 'loss/train': 1.356166124343872} 11/07/2021 15:07:21 - INFO - __main__ - Step 127077: {'lr': 2.9018874278999297e-05, 'samples': 24398784, 'steps': 127076, 'loss/train': 1.7115256786346436} 11/07/2021 15:07:22 - INFO - __main__ - Step 127078: {'lr': 2.901639273787593e-05, 'samples': 24398976, 'steps': 127077, 'loss/train': 1.0047667026519775} 11/07/2021 15:07:23 - INFO - __main__ - Step 127079: {'lr': 2.901391129632433e-05, 'samples': 24399168, 'steps': 127078, 'loss/train': 1.5306442975997925} 11/07/2021 15:07:23 - INFO - __main__ - Step 127080: {'lr': 2.9011429954345636e-05, 'samples': 24399360, 'steps': 127079, 'loss/train': 1.2987583875656128} 11/07/2021 15:07:23 - INFO - __main__ - Step 127081: {'lr': 2.9008948711940985e-05, 'samples': 24399552, 'steps': 127080, 'loss/train': 1.4425350427627563} 11/07/2021 15:07:24 - INFO - __main__ - Step 127082: {'lr': 2.900646756911149e-05, 'samples': 24399744, 'steps': 127081, 'loss/train': 1.0815398693084717} 11/07/2021 15:07:25 - INFO - __main__ - Step 127083: {'lr': 2.9003986525858254e-05, 'samples': 24399936, 'steps': 127082, 'loss/train': 1.5146446228027344} 11/07/2021 15:07:25 - INFO - __main__ - Step 127084: {'lr': 2.9001505582182425e-05, 'samples': 24400128, 'steps': 127083, 'loss/train': 1.3043104410171509} 11/07/2021 15:07:25 - INFO - __main__ - Step 127085: {'lr': 2.8999024738085107e-05, 'samples': 24400320, 'steps': 127084, 'loss/train': 1.875816822052002} 11/07/2021 15:07:26 - INFO - __main__ - Step 127086: {'lr': 2.899654399356744e-05, 'samples': 24400512, 'steps': 127085, 'loss/train': 1.0414220094680786} 11/07/2021 15:07:26 - INFO - __main__ - Step 127087: {'lr': 2.89940633486305e-05, 'samples': 24400704, 'steps': 127086, 'loss/train': 1.2435873746871948} 11/07/2021 15:07:27 - INFO - __main__ - Step 127088: {'lr': 2.8991582803275435e-05, 'samples': 24400896, 'steps': 127087, 'loss/train': 1.3425467014312744} 11/07/2021 15:07:27 - INFO - __main__ - Step 127089: {'lr': 2.8989102357503376e-05, 'samples': 24401088, 'steps': 127088, 'loss/train': 1.2453378438949585} 11/07/2021 15:07:28 - INFO - __main__ - Step 127090: {'lr': 2.8986622011315382e-05, 'samples': 24401280, 'steps': 127089, 'loss/train': 1.437221884727478} 11/07/2021 15:07:28 - INFO - __main__ - Step 127091: {'lr': 2.898414176471262e-05, 'samples': 24401472, 'steps': 127090, 'loss/train': 0.9997630715370178} 11/07/2021 15:07:29 - INFO - __main__ - Step 127092: {'lr': 2.8981661617696192e-05, 'samples': 24401664, 'steps': 127091, 'loss/train': 1.1482945680618286} 11/07/2021 15:07:29 - INFO - __main__ - Step 127093: {'lr': 2.8979181570267188e-05, 'samples': 24401856, 'steps': 127092, 'loss/train': 1.283211350440979} 11/07/2021 15:07:30 - INFO - __main__ - Step 127094: {'lr': 2.897670162242677e-05, 'samples': 24402048, 'steps': 127093, 'loss/train': 0.9875978827476501} 11/07/2021 15:07:30 - INFO - __main__ - Step 127095: {'lr': 2.897422177417605e-05, 'samples': 24402240, 'steps': 127094, 'loss/train': 1.0679254531860352} 11/07/2021 15:07:31 - INFO - __main__ - Step 127096: {'lr': 2.897174202551614e-05, 'samples': 24402432, 'steps': 127095, 'loss/train': 1.316267967224121} 11/07/2021 15:07:31 - INFO - __main__ - Step 127097: {'lr': 2.896926237644812e-05, 'samples': 24402624, 'steps': 127096, 'loss/train': 1.0600627660751343} 11/07/2021 15:07:32 - INFO - __main__ - Step 127098: {'lr': 2.896678282697318e-05, 'samples': 24402816, 'steps': 127097, 'loss/train': 1.3258235454559326} 11/07/2021 15:07:32 - INFO - __main__ - Step 127099: {'lr': 2.8964303377092354e-05, 'samples': 24403008, 'steps': 127098, 'loss/train': 1.130286455154419} 11/07/2021 15:07:33 - INFO - __main__ - Step 127100: {'lr': 2.896182402680689e-05, 'samples': 24403200, 'steps': 127099, 'loss/train': 1.6314479112625122} 11/07/2021 15:07:33 - INFO - __main__ - Step 127101: {'lr': 2.8959344776117752e-05, 'samples': 24403392, 'steps': 127100, 'loss/train': 0.6267738938331604} 11/07/2021 15:07:33 - INFO - __main__ - Step 127102: {'lr': 2.8956865625026115e-05, 'samples': 24403584, 'steps': 127101, 'loss/train': 0.9709679484367371} 11/07/2021 15:07:34 - INFO - __main__ - Step 127103: {'lr': 2.895438657353311e-05, 'samples': 24403776, 'steps': 127102, 'loss/train': 1.1552103757858276} 11/07/2021 15:07:35 - INFO - __main__ - Step 127104: {'lr': 2.895190762163985e-05, 'samples': 24403968, 'steps': 127103, 'loss/train': 0.9903197884559631} 11/07/2021 15:07:35 - INFO - __main__ - Step 127105: {'lr': 2.8949428769347447e-05, 'samples': 24404160, 'steps': 127104, 'loss/train': 0.8138483762741089} 11/07/2021 15:07:35 - INFO - __main__ - Step 127106: {'lr': 2.8946950016657037e-05, 'samples': 24404352, 'steps': 127105, 'loss/train': 0.053223710507154465} 11/07/2021 15:07:36 - INFO - __main__ - Step 127107: {'lr': 2.8944471363569703e-05, 'samples': 24404544, 'steps': 127106, 'loss/train': 1.0255670547485352} 11/07/2021 15:07:36 - INFO - __main__ - Step 127108: {'lr': 2.8941992810086583e-05, 'samples': 24404736, 'steps': 127107, 'loss/train': 0.8984047770500183} 11/07/2021 15:07:37 - INFO - __main__ - Step 127109: {'lr': 2.8939514356208784e-05, 'samples': 24404928, 'steps': 127108, 'loss/train': 0.8330838084220886} 11/07/2021 15:07:38 - INFO - __main__ - Step 127110: {'lr': 2.893703600193742e-05, 'samples': 24405120, 'steps': 127109, 'loss/train': 1.9291101694107056} 11/07/2021 15:07:38 - INFO - __main__ - Step 127111: {'lr': 2.8934557747273633e-05, 'samples': 24405312, 'steps': 127110, 'loss/train': 0.9664016962051392} 11/07/2021 15:07:38 - INFO - __main__ - Step 127112: {'lr': 2.893207959221855e-05, 'samples': 24405504, 'steps': 127111, 'loss/train': 0.8864834904670715} 11/07/2021 15:07:39 - INFO - __main__ - Step 127113: {'lr': 2.8929601536773237e-05, 'samples': 24405696, 'steps': 127112, 'loss/train': 1.520140290260315} 11/07/2021 15:07:40 - INFO - __main__ - Step 127114: {'lr': 2.8927123580938823e-05, 'samples': 24405888, 'steps': 127113, 'loss/train': 0.803408145904541} 11/07/2021 15:07:40 - INFO - __main__ - Step 127115: {'lr': 2.8924645724716454e-05, 'samples': 24406080, 'steps': 127114, 'loss/train': 1.5164536237716675} 11/07/2021 15:07:40 - INFO - __main__ - Step 127116: {'lr': 2.8922167968107205e-05, 'samples': 24406272, 'steps': 127115, 'loss/train': 0.8167110085487366} 11/07/2021 15:07:41 - INFO - __main__ - Step 127117: {'lr': 2.8919690311112216e-05, 'samples': 24406464, 'steps': 127116, 'loss/train': 1.2427210807800293} 11/07/2021 15:07:41 - INFO - __main__ - Step 127118: {'lr': 2.8917212753732632e-05, 'samples': 24406656, 'steps': 127117, 'loss/train': 1.3744773864746094} 11/07/2021 15:07:42 - INFO - __main__ - Step 127119: {'lr': 2.8914735295969495e-05, 'samples': 24406848, 'steps': 127118, 'loss/train': 0.8297964930534363} 11/07/2021 15:07:43 - INFO - __main__ - Step 127120: {'lr': 2.891225793782401e-05, 'samples': 24407040, 'steps': 127119, 'loss/train': 1.1867194175720215} 11/07/2021 15:07:43 - INFO - __main__ - Step 127121: {'lr': 2.8909780679297226e-05, 'samples': 24407232, 'steps': 127120, 'loss/train': 0.9546193480491638} 11/07/2021 15:07:43 - INFO - __main__ - Step 127122: {'lr': 2.8907303520390282e-05, 'samples': 24407424, 'steps': 127121, 'loss/train': 1.0404112339019775} 11/07/2021 15:07:44 - INFO - __main__ - Step 127123: {'lr': 2.8904826461104315e-05, 'samples': 24407616, 'steps': 127122, 'loss/train': 1.4860841035842896} 11/07/2021 15:07:45 - INFO - __main__ - Step 127124: {'lr': 2.890234950144041e-05, 'samples': 24407808, 'steps': 127123, 'loss/train': 0.3840829133987427} 11/07/2021 15:07:45 - INFO - __main__ - Step 127125: {'lr': 2.8899872641399732e-05, 'samples': 24408000, 'steps': 127124, 'loss/train': 1.353623390197754} 11/07/2021 15:07:45 - INFO - __main__ - Step 127126: {'lr': 2.8897395880983334e-05, 'samples': 24408192, 'steps': 127125, 'loss/train': 1.2830729484558105} 11/07/2021 15:07:46 - INFO - __main__ - Step 127127: {'lr': 2.889491922019233e-05, 'samples': 24408384, 'steps': 127126, 'loss/train': 0.7683191299438477} 11/07/2021 15:07:46 - INFO - __main__ - Step 127128: {'lr': 2.889244265902788e-05, 'samples': 24408576, 'steps': 127127, 'loss/train': 0.9981954097747803} 11/07/2021 15:07:47 - INFO - __main__ - Step 127129: {'lr': 2.8889966197491068e-05, 'samples': 24408768, 'steps': 127128, 'loss/train': 1.4033209085464478} 11/07/2021 15:07:47 - INFO - __main__ - Step 127130: {'lr': 2.8887489835583065e-05, 'samples': 24408960, 'steps': 127129, 'loss/train': 0.79163658618927} 11/07/2021 15:07:48 - INFO - __main__ - Step 127131: {'lr': 2.888501357330492e-05, 'samples': 24409152, 'steps': 127130, 'loss/train': 1.4255127906799316} 11/07/2021 15:07:48 - INFO - __main__ - Step 127132: {'lr': 2.8882537410657773e-05, 'samples': 24409344, 'steps': 127131, 'loss/train': 0.985653281211853} 11/07/2021 15:07:48 - INFO - __main__ - Step 127133: {'lr': 2.8880061347642733e-05, 'samples': 24409536, 'steps': 127132, 'loss/train': 0.830756664276123} 11/07/2021 15:07:50 - INFO - __main__ - Step 127134: {'lr': 2.887758538426094e-05, 'samples': 24409728, 'steps': 127133, 'loss/train': 1.22691011428833} 11/07/2021 15:07:50 - INFO - __main__ - Step 127135: {'lr': 2.8875109520513505e-05, 'samples': 24409920, 'steps': 127134, 'loss/train': 1.1230189800262451} 11/07/2021 15:07:50 - INFO - __main__ - Step 127136: {'lr': 2.8872633756401505e-05, 'samples': 24410112, 'steps': 127135, 'loss/train': 2.1244399547576904} 11/07/2021 15:07:51 - INFO - __main__ - Step 127137: {'lr': 2.8870158091926113e-05, 'samples': 24410304, 'steps': 127136, 'loss/train': 0.9055362939834595} 11/07/2021 15:07:51 - INFO - __main__ - Step 127138: {'lr': 2.8867682527088405e-05, 'samples': 24410496, 'steps': 127137, 'loss/train': 1.4478884935379028} 11/07/2021 15:07:51 - INFO - __main__ - Step 127139: {'lr': 2.8865207061889555e-05, 'samples': 24410688, 'steps': 127138, 'loss/train': 0.08506624400615692} 11/07/2021 15:07:52 - INFO - __main__ - Step 127140: {'lr': 2.886273169633058e-05, 'samples': 24410880, 'steps': 127139, 'loss/train': 1.2646418809890747} 11/07/2021 15:07:53 - INFO - __main__ - Step 127141: {'lr': 2.8860256430412652e-05, 'samples': 24411072, 'steps': 127140, 'loss/train': 1.1778652667999268} 11/07/2021 15:07:53 - INFO - __main__ - Step 127142: {'lr': 2.885778126413685e-05, 'samples': 24411264, 'steps': 127141, 'loss/train': 2.020487070083618} 11/07/2021 15:07:54 - INFO - __main__ - Step 127143: {'lr': 2.8855306197504344e-05, 'samples': 24411456, 'steps': 127142, 'loss/train': 1.411188006401062} 11/07/2021 15:07:54 - INFO - __main__ - Step 127144: {'lr': 2.885283123051624e-05, 'samples': 24411648, 'steps': 127143, 'loss/train': 1.3567932844161987} 11/07/2021 15:07:55 - INFO - __main__ - Step 127145: {'lr': 2.8850356363173623e-05, 'samples': 24411840, 'steps': 127144, 'loss/train': 1.2996660470962524} 11/07/2021 15:07:55 - INFO - __main__ - Step 127146: {'lr': 2.8847881595477603e-05, 'samples': 24412032, 'steps': 127145, 'loss/train': 1.423232078552246} 11/07/2021 15:07:56 - INFO - __main__ - Step 127147: {'lr': 2.8845406927429347e-05, 'samples': 24412224, 'steps': 127146, 'loss/train': 0.04544231295585632} 11/07/2021 15:07:56 - INFO - __main__ - Step 127148: {'lr': 2.8842932359029933e-05, 'samples': 24412416, 'steps': 127147, 'loss/train': 1.4166079759597778} 11/07/2021 15:07:56 - INFO - __main__ - Step 127149: {'lr': 2.8840457890280446e-05, 'samples': 24412608, 'steps': 127148, 'loss/train': 1.3689645528793335} 11/07/2021 15:07:57 - INFO - __main__ - Step 127150: {'lr': 2.883798352118208e-05, 'samples': 24412800, 'steps': 127149, 'loss/train': 1.5149356126785278} 11/07/2021 15:07:58 - INFO - __main__ - Step 127151: {'lr': 2.883550925173589e-05, 'samples': 24412992, 'steps': 127150, 'loss/train': 1.137846827507019} 11/07/2021 15:07:58 - INFO - __main__ - Step 127152: {'lr': 2.8833035081943044e-05, 'samples': 24413184, 'steps': 127151, 'loss/train': 1.0358471870422363} 11/07/2021 15:07:59 - INFO - __main__ - Step 127153: {'lr': 2.883056101180459e-05, 'samples': 24413376, 'steps': 127152, 'loss/train': 1.3465241193771362} 11/07/2021 15:07:59 - INFO - __main__ - Step 127154: {'lr': 2.8828087041321673e-05, 'samples': 24413568, 'steps': 127153, 'loss/train': 1.0904641151428223} 11/07/2021 15:08:00 - INFO - __main__ - Step 127155: {'lr': 2.8825613170495396e-05, 'samples': 24413760, 'steps': 127154, 'loss/train': 0.9045116901397705} 11/07/2021 15:08:00 - INFO - __main__ - Step 127156: {'lr': 2.8823139399326875e-05, 'samples': 24413952, 'steps': 127155, 'loss/train': 0.6381984949111938} 11/07/2021 15:08:01 - INFO - __main__ - Step 127157: {'lr': 2.8820665727817245e-05, 'samples': 24414144, 'steps': 127156, 'loss/train': 0.7949960827827454} 11/07/2021 15:08:01 - INFO - __main__ - Step 127158: {'lr': 2.8818192155967622e-05, 'samples': 24414336, 'steps': 127157, 'loss/train': 1.1339560747146606} 11/07/2021 15:08:02 - INFO - __main__ - Step 127159: {'lr': 2.8815718683779078e-05, 'samples': 24414528, 'steps': 127158, 'loss/train': 0.9848365783691406} 11/07/2021 15:08:02 - INFO - __main__ - Step 127160: {'lr': 2.881324531125279e-05, 'samples': 24414720, 'steps': 127159, 'loss/train': 1.1336654424667358} 11/07/2021 15:08:03 - INFO - __main__ - Step 127161: {'lr': 2.8810772038389832e-05, 'samples': 24414912, 'steps': 127160, 'loss/train': 1.6027257442474365} 11/07/2021 15:08:03 - INFO - __main__ - Step 127162: {'lr': 2.880829886519132e-05, 'samples': 24415104, 'steps': 127161, 'loss/train': 1.5410892963409424} 11/07/2021 15:08:04 - INFO - __main__ - Step 127163: {'lr': 2.8805825791658385e-05, 'samples': 24415296, 'steps': 127162, 'loss/train': 0.9964122772216797} 11/07/2021 15:08:04 - INFO - __main__ - Step 127164: {'lr': 2.8803352817792118e-05, 'samples': 24415488, 'steps': 127163, 'loss/train': 1.4550057649612427} 11/07/2021 15:08:04 - INFO - __main__ - Step 127165: {'lr': 2.880087994359365e-05, 'samples': 24415680, 'steps': 127164, 'loss/train': 1.0737228393554688} 11/07/2021 15:08:06 - INFO - __main__ - Step 127166: {'lr': 2.879840716906415e-05, 'samples': 24415872, 'steps': 127165, 'loss/train': 2.310523509979248} 11/07/2021 15:08:06 - INFO - __main__ - Step 127167: {'lr': 2.879593449420462e-05, 'samples': 24416064, 'steps': 127166, 'loss/train': 1.4496201276779175} 11/07/2021 15:08:06 - INFO - __main__ - Step 127168: {'lr': 2.8793461919016217e-05, 'samples': 24416256, 'steps': 127167, 'loss/train': 0.801075279712677} 11/07/2021 15:08:07 - INFO - __main__ - Step 127169: {'lr': 2.8790989443500087e-05, 'samples': 24416448, 'steps': 127168, 'loss/train': 0.8251419067382812} 11/07/2021 15:08:07 - INFO - __main__ - Step 127170: {'lr': 2.878851706765731e-05, 'samples': 24416640, 'steps': 127169, 'loss/train': 1.3027944564819336} 11/07/2021 15:08:08 - INFO - __main__ - Step 127171: {'lr': 2.8786044791489025e-05, 'samples': 24416832, 'steps': 127170, 'loss/train': 1.2227742671966553} 11/07/2021 15:08:08 - INFO - __main__ - Step 127172: {'lr': 2.878357261499631e-05, 'samples': 24417024, 'steps': 127171, 'loss/train': 1.1960328817367554} 11/07/2021 15:08:09 - INFO - __main__ - Step 127173: {'lr': 2.8781100538180338e-05, 'samples': 24417216, 'steps': 127172, 'loss/train': 1.2062174081802368} 11/07/2021 15:08:09 - INFO - __main__ - Step 127174: {'lr': 2.877862856104216e-05, 'samples': 24417408, 'steps': 127173, 'loss/train': 1.0209957361221313} 11/07/2021 15:08:09 - INFO - __main__ - Step 127175: {'lr': 2.8776156683582938e-05, 'samples': 24417600, 'steps': 127174, 'loss/train': 1.506624460220337} 11/07/2021 15:08:11 - INFO - __main__ - Step 127176: {'lr': 2.877368490580376e-05, 'samples': 24417792, 'steps': 127175, 'loss/train': 1.26276695728302} 11/07/2021 15:08:11 - INFO - __main__ - Step 127177: {'lr': 2.8771213227705735e-05, 'samples': 24417984, 'steps': 127176, 'loss/train': 1.161794900894165} 11/07/2021 15:08:11 - INFO - __main__ - Step 127178: {'lr': 2.876874164929e-05, 'samples': 24418176, 'steps': 127177, 'loss/train': 0.9556633234024048} 11/07/2021 15:08:12 - INFO - __main__ - Step 127179: {'lr': 2.8766270170557718e-05, 'samples': 24418368, 'steps': 127178, 'loss/train': 1.1868394613265991} 11/07/2021 15:08:12 - INFO - __main__ - Step 127180: {'lr': 2.8763798791509865e-05, 'samples': 24418560, 'steps': 127179, 'loss/train': 0.7156410813331604} 11/07/2021 15:08:13 - INFO - __main__ - Step 127181: {'lr': 2.8761327512147662e-05, 'samples': 24418752, 'steps': 127180, 'loss/train': 1.5504313707351685} 11/07/2021 15:08:13 - INFO - __main__ - Step 127182: {'lr': 2.875885633247216e-05, 'samples': 24418944, 'steps': 127181, 'loss/train': 1.1048203706741333} 11/07/2021 15:08:14 - INFO - __main__ - Step 127183: {'lr': 2.8756385252484503e-05, 'samples': 24419136, 'steps': 127182, 'loss/train': 1.4596494436264038} 11/07/2021 15:08:14 - INFO - __main__ - Step 127184: {'lr': 2.8753914272185822e-05, 'samples': 24419328, 'steps': 127183, 'loss/train': 1.3240681886672974} 11/07/2021 15:08:14 - INFO - __main__ - Step 127185: {'lr': 2.8751443391577204e-05, 'samples': 24419520, 'steps': 127184, 'loss/train': 0.4519909620285034} 11/07/2021 15:08:15 - INFO - __main__ - Step 127186: {'lr': 2.8748972610659786e-05, 'samples': 24419712, 'steps': 127185, 'loss/train': 1.3829883337020874} 11/07/2021 15:08:16 - INFO - __main__ - Step 127187: {'lr': 2.874650192943465e-05, 'samples': 24419904, 'steps': 127186, 'loss/train': 1.086751103401184} 11/07/2021 15:08:16 - INFO - __main__ - Step 127188: {'lr': 2.8744031347902933e-05, 'samples': 24420096, 'steps': 127187, 'loss/train': 1.1585181951522827} 11/07/2021 15:08:17 - INFO - __main__ - Step 127189: {'lr': 2.8741560866065747e-05, 'samples': 24420288, 'steps': 127188, 'loss/train': 0.5013075470924377} 11/07/2021 15:08:17 - INFO - __main__ - Step 127190: {'lr': 2.8739090483924173e-05, 'samples': 24420480, 'steps': 127189, 'loss/train': 1.3489031791687012} 11/07/2021 15:08:18 - INFO - __main__ - Step 127191: {'lr': 2.873662020147938e-05, 'samples': 24420672, 'steps': 127190, 'loss/train': 1.2518008947372437} 11/07/2021 15:08:18 - INFO - __main__ - Step 127192: {'lr': 2.8734150018732503e-05, 'samples': 24420864, 'steps': 127191, 'loss/train': 1.2069600820541382} 11/07/2021 15:08:19 - INFO - __main__ - Step 127193: {'lr': 2.873167993568454e-05, 'samples': 24421056, 'steps': 127192, 'loss/train': 1.1200799942016602} 11/07/2021 15:08:19 - INFO - __main__ - Step 127194: {'lr': 2.872920995233666e-05, 'samples': 24421248, 'steps': 127193, 'loss/train': 1.7201669216156006} 11/07/2021 15:08:19 - INFO - __main__ - Step 127195: {'lr': 2.8726740068689998e-05, 'samples': 24421440, 'steps': 127194, 'loss/train': 1.1454371213912964} 11/07/2021 15:08:20 - INFO - __main__ - Step 127196: {'lr': 2.872427028474564e-05, 'samples': 24421632, 'steps': 127195, 'loss/train': 1.8569674491882324} 11/07/2021 15:08:21 - INFO - __main__ - Step 127197: {'lr': 2.8721800600504723e-05, 'samples': 24421824, 'steps': 127196, 'loss/train': 1.4115830659866333} 11/07/2021 15:08:21 - INFO - __main__ - Step 127198: {'lr': 2.8719331015968353e-05, 'samples': 24422016, 'steps': 127197, 'loss/train': 1.3104642629623413} 11/07/2021 15:08:21 - INFO - __main__ - Step 127199: {'lr': 2.8716861531137616e-05, 'samples': 24422208, 'steps': 127198, 'loss/train': 0.43984004855155945} 11/07/2021 15:08:22 - INFO - __main__ - Step 127200: {'lr': 2.8714392146013652e-05, 'samples': 24422400, 'steps': 127199, 'loss/train': 1.3305035829544067} 11/07/2021 15:08:22 - INFO - __main__ - Step 127201: {'lr': 2.8711922860597596e-05, 'samples': 24422592, 'steps': 127200, 'loss/train': 1.4296318292617798} 11/07/2021 15:08:23 - INFO - __main__ - Step 127202: {'lr': 2.87094536748905e-05, 'samples': 24422784, 'steps': 127201, 'loss/train': 1.0670218467712402} 11/07/2021 15:08:24 - INFO - __main__ - Step 127203: {'lr': 2.8706984588893535e-05, 'samples': 24422976, 'steps': 127202, 'loss/train': 1.0196701288223267} 11/07/2021 15:08:24 - INFO - __main__ - Step 127204: {'lr': 2.8704515602607757e-05, 'samples': 24423168, 'steps': 127203, 'loss/train': 1.1473268270492554} 11/07/2021 15:08:24 - INFO - __main__ - Step 127205: {'lr': 2.8702046716034325e-05, 'samples': 24423360, 'steps': 127204, 'loss/train': 1.2341161966323853} 11/07/2021 15:08:25 - INFO - __main__ - Step 127206: {'lr': 2.8699577929174407e-05, 'samples': 24423552, 'steps': 127205, 'loss/train': 1.1266380548477173} 11/07/2021 15:08:26 - INFO - __main__ - Step 127207: {'lr': 2.869710924202898e-05, 'samples': 24423744, 'steps': 127206, 'loss/train': 1.0697072744369507} 11/07/2021 15:08:26 - INFO - __main__ - Step 127208: {'lr': 2.8694640654599202e-05, 'samples': 24423936, 'steps': 127207, 'loss/train': 1.0445587635040283} 11/07/2021 15:08:26 - INFO - __main__ - Step 127209: {'lr': 2.8692172166886215e-05, 'samples': 24424128, 'steps': 127208, 'loss/train': 1.5087006092071533} 11/07/2021 15:08:27 - INFO - __main__ - Step 127210: {'lr': 2.868970377889113e-05, 'samples': 24424320, 'steps': 127209, 'loss/train': 1.2748597860336304} 11/07/2021 15:08:27 - INFO - __main__ - Step 127211: {'lr': 2.8687235490615054e-05, 'samples': 24424512, 'steps': 127210, 'loss/train': 0.5410304665565491} 11/07/2021 15:08:28 - INFO - __main__ - Step 127212: {'lr': 2.8684767302059074e-05, 'samples': 24424704, 'steps': 127211, 'loss/train': 1.186886191368103} 11/07/2021 15:08:28 - INFO - __main__ - Step 127213: {'lr': 2.868229921322432e-05, 'samples': 24424896, 'steps': 127212, 'loss/train': 1.8164886236190796} 11/07/2021 15:08:29 - INFO - __main__ - Step 127214: {'lr': 2.8679831224111942e-05, 'samples': 24425088, 'steps': 127213, 'loss/train': 1.149928331375122} 11/07/2021 15:08:29 - INFO - __main__ - Step 127215: {'lr': 2.8677363334722982e-05, 'samples': 24425280, 'steps': 127214, 'loss/train': 1.3688054084777832} 11/07/2021 15:08:29 - INFO - __main__ - Step 127216: {'lr': 2.867489554505859e-05, 'samples': 24425472, 'steps': 127215, 'loss/train': 1.2782557010650635} 11/07/2021 15:08:30 - INFO - __main__ - Step 127217: {'lr': 2.8672427855119893e-05, 'samples': 24425664, 'steps': 127216, 'loss/train': 1.3617979288101196} 11/07/2021 15:08:31 - INFO - __main__ - Step 127218: {'lr': 2.866996026490798e-05, 'samples': 24425856, 'steps': 127217, 'loss/train': 1.1443642377853394} 11/07/2021 15:08:31 - INFO - __main__ - Step 127219: {'lr': 2.8667492774424013e-05, 'samples': 24426048, 'steps': 127218, 'loss/train': 1.2912473678588867} 11/07/2021 15:08:32 - INFO - __main__ - Step 127220: {'lr': 2.8665025383668997e-05, 'samples': 24426240, 'steps': 127219, 'loss/train': 1.2501227855682373} 11/07/2021 15:08:32 - INFO - __main__ - Step 127221: {'lr': 2.866255809264412e-05, 'samples': 24426432, 'steps': 127220, 'loss/train': 1.233081579208374} 11/07/2021 15:08:33 - INFO - __main__ - Step 127222: {'lr': 2.8660090901350493e-05, 'samples': 24426624, 'steps': 127221, 'loss/train': 1.2497820854187012} 11/07/2021 15:08:33 - INFO - __main__ - Step 127223: {'lr': 2.8657623809789174e-05, 'samples': 24426816, 'steps': 127222, 'loss/train': 1.2541821002960205} 11/07/2021 15:08:34 - INFO - __main__ - Step 127224: {'lr': 2.8655156817961353e-05, 'samples': 24427008, 'steps': 127223, 'loss/train': 1.119343876838684} 11/07/2021 15:08:34 - INFO - __main__ - Step 127225: {'lr': 2.8652689925868087e-05, 'samples': 24427200, 'steps': 127224, 'loss/train': 1.343688726425171} 11/07/2021 15:08:35 - INFO - __main__ - Step 127226: {'lr': 2.8650223133510484e-05, 'samples': 24427392, 'steps': 127225, 'loss/train': 0.9483633637428284} 11/07/2021 15:08:36 - INFO - __main__ - Step 127227: {'lr': 2.8647756440889712e-05, 'samples': 24427584, 'steps': 127226, 'loss/train': 1.3873897790908813} 11/07/2021 15:08:36 - INFO - __main__ - Step 127228: {'lr': 2.8645289848006823e-05, 'samples': 24427776, 'steps': 127227, 'loss/train': 1.1838881969451904} 11/07/2021 15:08:36 - INFO - __main__ - Step 127229: {'lr': 2.8642823354862958e-05, 'samples': 24427968, 'steps': 127228, 'loss/train': 0.6153071522712708} 11/07/2021 15:08:37 - INFO - __main__ - Step 127230: {'lr': 2.8640356961459225e-05, 'samples': 24428160, 'steps': 127229, 'loss/train': 1.4059396982192993} 11/07/2021 15:08:37 - INFO - __main__ - Step 127231: {'lr': 2.8637890667796708e-05, 'samples': 24428352, 'steps': 127230, 'loss/train': 0.8275644779205322} 11/07/2021 15:08:37 - INFO - __main__ - Step 127232: {'lr': 2.863542447387657e-05, 'samples': 24428544, 'steps': 127231, 'loss/train': 1.364763855934143} 11/07/2021 15:08:38 - INFO - __main__ - Step 127233: {'lr': 2.8632958379699924e-05, 'samples': 24428736, 'steps': 127232, 'loss/train': 1.3133269548416138} 11/07/2021 15:08:39 - INFO - __main__ - Step 127234: {'lr': 2.8630492385267797e-05, 'samples': 24428928, 'steps': 127233, 'loss/train': 1.3174339532852173} 11/07/2021 15:08:39 - INFO - __main__ - Step 127235: {'lr': 2.8628026490581383e-05, 'samples': 24429120, 'steps': 127234, 'loss/train': 1.455689549446106} 11/07/2021 15:08:39 - INFO - __main__ - Step 127236: {'lr': 2.8625560695641735e-05, 'samples': 24429312, 'steps': 127235, 'loss/train': 1.5492024421691895} 11/07/2021 15:08:40 - INFO - __main__ - Step 127237: {'lr': 2.8623095000450015e-05, 'samples': 24429504, 'steps': 127236, 'loss/train': 0.8505935668945312} 11/07/2021 15:08:41 - INFO - __main__ - Step 127238: {'lr': 2.8620629405007287e-05, 'samples': 24429696, 'steps': 127237, 'loss/train': 0.7829328775405884} 11/07/2021 15:08:41 - INFO - __main__ - Step 127239: {'lr': 2.861816390931471e-05, 'samples': 24429888, 'steps': 127238, 'loss/train': 1.5994184017181396} 11/07/2021 15:08:42 - INFO - __main__ - Step 127240: {'lr': 2.8615698513373367e-05, 'samples': 24430080, 'steps': 127239, 'loss/train': 1.0416364669799805} 11/07/2021 15:08:42 - INFO - __main__ - Step 127241: {'lr': 2.861323321718437e-05, 'samples': 24430272, 'steps': 127240, 'loss/train': 1.2148290872573853} 11/07/2021 15:08:42 - INFO - __main__ - Step 127242: {'lr': 2.8610768020748827e-05, 'samples': 24430464, 'steps': 127241, 'loss/train': 1.5930668115615845} 11/07/2021 15:08:43 - INFO - __main__ - Step 127243: {'lr': 2.860830292406788e-05, 'samples': 24430656, 'steps': 127242, 'loss/train': 1.681972622871399} 11/07/2021 15:08:44 - INFO - __main__ - Step 127244: {'lr': 2.8605837927142607e-05, 'samples': 24430848, 'steps': 127243, 'loss/train': 1.586150050163269} 11/07/2021 15:08:44 - INFO - __main__ - Step 127245: {'lr': 2.8603373029974095e-05, 'samples': 24431040, 'steps': 127244, 'loss/train': 1.607174038887024} 11/07/2021 15:08:44 - INFO - __main__ - Step 127246: {'lr': 2.860090823256359e-05, 'samples': 24431232, 'steps': 127245, 'loss/train': 0.9712284803390503} 11/07/2021 15:08:45 - INFO - __main__ - Step 127247: {'lr': 2.8598443534912004e-05, 'samples': 24431424, 'steps': 127246, 'loss/train': 1.266545057296753} 11/07/2021 15:08:45 - INFO - __main__ - Step 127248: {'lr': 2.8595978937020567e-05, 'samples': 24431616, 'steps': 127247, 'loss/train': 1.2273993492126465} 11/07/2021 15:08:46 - INFO - __main__ - Step 127249: {'lr': 2.8593514438890357e-05, 'samples': 24431808, 'steps': 127248, 'loss/train': 0.9928236603736877} 11/07/2021 15:08:46 - INFO - __main__ - Step 127250: {'lr': 2.859105004052248e-05, 'samples': 24432000, 'steps': 127249, 'loss/train': 1.2175440788269043} 11/07/2021 15:08:47 - INFO - __main__ - Step 127251: {'lr': 2.858858574191808e-05, 'samples': 24432192, 'steps': 127250, 'loss/train': 1.6177289485931396} 11/07/2021 15:08:47 - INFO - __main__ - Step 127252: {'lr': 2.8586121543078242e-05, 'samples': 24432384, 'steps': 127251, 'loss/train': 1.3939582109451294} 11/07/2021 15:08:47 - INFO - __main__ - Step 127253: {'lr': 2.8583657444004098e-05, 'samples': 24432576, 'steps': 127252, 'loss/train': 1.4319144487380981} 11/07/2021 15:08:48 - INFO - __main__ - Step 127254: {'lr': 2.8581193444696703e-05, 'samples': 24432768, 'steps': 127253, 'loss/train': 1.377451777458191} 11/07/2021 15:08:49 - INFO - __main__ - Step 127255: {'lr': 2.8578729545157222e-05, 'samples': 24432960, 'steps': 127254, 'loss/train': 0.7894366383552551} 11/07/2021 15:08:49 - INFO - __main__ - Step 127256: {'lr': 2.857626574538677e-05, 'samples': 24433152, 'steps': 127255, 'loss/train': 1.4863322973251343} 11/07/2021 15:08:49 - INFO - __main__ - Step 127257: {'lr': 2.85738020453864e-05, 'samples': 24433344, 'steps': 127256, 'loss/train': 1.213900089263916} 11/07/2021 15:08:50 - INFO - __main__ - Step 127258: {'lr': 2.8571338445157274e-05, 'samples': 24433536, 'steps': 127257, 'loss/train': 1.1037781238555908} 11/07/2021 15:08:51 - INFO - __main__ - Step 127259: {'lr': 2.8568874944700507e-05, 'samples': 24433728, 'steps': 127258, 'loss/train': 1.3213474750518799} 11/07/2021 15:08:51 - INFO - __main__ - Step 127260: {'lr': 2.8566411544017205e-05, 'samples': 24433920, 'steps': 127259, 'loss/train': 1.2524791955947876} 11/07/2021 15:08:52 - INFO - __main__ - Step 127261: {'lr': 2.8563948243108428e-05, 'samples': 24434112, 'steps': 127260, 'loss/train': 1.544988989830017} 11/07/2021 15:08:52 - INFO - __main__ - Step 127262: {'lr': 2.856148504197531e-05, 'samples': 24434304, 'steps': 127261, 'loss/train': 1.677878975868225} 11/07/2021 15:08:52 - INFO - __main__ - Step 127263: {'lr': 2.855902194061899e-05, 'samples': 24434496, 'steps': 127262, 'loss/train': 1.3153976202011108} 11/07/2021 15:08:53 - INFO - __main__ - Step 127264: {'lr': 2.855655893904055e-05, 'samples': 24434688, 'steps': 127263, 'loss/train': 0.754905641078949} 11/07/2021 15:08:54 - INFO - __main__ - Step 127265: {'lr': 2.8554096037241102e-05, 'samples': 24434880, 'steps': 127264, 'loss/train': 1.2859172821044922} 11/07/2021 15:08:54 - INFO - __main__ - Step 127266: {'lr': 2.855163323522175e-05, 'samples': 24435072, 'steps': 127265, 'loss/train': 1.3560068607330322} 11/07/2021 15:08:54 - INFO - __main__ - Step 127267: {'lr': 2.8549170532983616e-05, 'samples': 24435264, 'steps': 127266, 'loss/train': 0.6695832014083862} 11/07/2021 15:08:55 - INFO - __main__ - Step 127268: {'lr': 2.8546707930527826e-05, 'samples': 24435456, 'steps': 127267, 'loss/train': 1.7289795875549316} 11/07/2021 15:08:56 - INFO - __main__ - Step 127269: {'lr': 2.854424542785547e-05, 'samples': 24435648, 'steps': 127268, 'loss/train': 1.0849448442459106} 11/07/2021 15:08:56 - INFO - __main__ - Step 127270: {'lr': 2.8541783024967653e-05, 'samples': 24435840, 'steps': 127269, 'loss/train': 2.4297611713409424} 11/07/2021 15:08:56 - INFO - __main__ - Step 127271: {'lr': 2.853932072186549e-05, 'samples': 24436032, 'steps': 127270, 'loss/train': 1.0936875343322754} 11/07/2021 15:08:57 - INFO - __main__ - Step 127272: {'lr': 2.853685851855009e-05, 'samples': 24436224, 'steps': 127271, 'loss/train': 1.2293951511383057} 11/07/2021 15:08:57 - INFO - __main__ - Step 127273: {'lr': 2.853439641502262e-05, 'samples': 24436416, 'steps': 127272, 'loss/train': 1.212810754776001} 11/07/2021 15:08:58 - INFO - __main__ - Step 127274: {'lr': 2.853193441128407e-05, 'samples': 24436608, 'steps': 127273, 'loss/train': 0.9449689388275146} 11/07/2021 15:08:58 - INFO - __main__ - Step 127275: {'lr': 2.852947250733562e-05, 'samples': 24436800, 'steps': 127274, 'loss/train': 0.9525312185287476} 11/07/2021 15:08:59 - INFO - __main__ - Step 127276: {'lr': 2.8527010703178398e-05, 'samples': 24436992, 'steps': 127275, 'loss/train': 1.4817450046539307} 11/07/2021 15:08:59 - INFO - __main__ - Step 127277: {'lr': 2.852454899881346e-05, 'samples': 24437184, 'steps': 127276, 'loss/train': 1.1987966299057007} 11/07/2021 15:09:00 - INFO - __main__ - Step 127278: {'lr': 2.8522087394241948e-05, 'samples': 24437376, 'steps': 127277, 'loss/train': 1.5104787349700928} 11/07/2021 15:09:01 - INFO - __main__ - Step 127279: {'lr': 2.8519625889464967e-05, 'samples': 24437568, 'steps': 127278, 'loss/train': 1.365042805671692} 11/07/2021 15:09:01 - INFO - __main__ - Step 127280: {'lr': 2.8517164484483632e-05, 'samples': 24437760, 'steps': 127279, 'loss/train': 0.7050572633743286} 11/07/2021 15:09:01 - INFO - __main__ - Step 127281: {'lr': 2.8514703179299024e-05, 'samples': 24437952, 'steps': 127280, 'loss/train': 0.9966058135032654} 11/07/2021 15:09:02 - INFO - __main__ - Step 127282: {'lr': 2.851224197391228e-05, 'samples': 24438144, 'steps': 127281, 'loss/train': 1.539258360862732} 11/07/2021 15:09:02 - INFO - __main__ - Step 127283: {'lr': 2.8509780868324507e-05, 'samples': 24438336, 'steps': 127282, 'loss/train': 1.3545297384262085} 11/07/2021 15:09:03 - INFO - __main__ - Step 127284: {'lr': 2.8507319862536824e-05, 'samples': 24438528, 'steps': 127283, 'loss/train': 1.671895980834961} 11/07/2021 15:09:03 - INFO - __main__ - Step 127285: {'lr': 2.8504858956550307e-05, 'samples': 24438720, 'steps': 127284, 'loss/train': 1.0803756713867188} 11/07/2021 15:09:04 - INFO - __main__ - Step 127286: {'lr': 2.8502398150366093e-05, 'samples': 24438912, 'steps': 127285, 'loss/train': 1.067794919013977} 11/07/2021 15:09:04 - INFO - __main__ - Step 127287: {'lr': 2.8499937443985326e-05, 'samples': 24439104, 'steps': 127286, 'loss/train': 1.1168196201324463} 11/07/2021 15:09:04 - INFO - __main__ - Step 127288: {'lr': 2.8497476837408997e-05, 'samples': 24439296, 'steps': 127287, 'loss/train': 1.1456722021102905} 11/07/2021 15:09:05 - INFO - __main__ - Step 127289: {'lr': 2.8495016330638306e-05, 'samples': 24439488, 'steps': 127288, 'loss/train': 0.7558709979057312} 11/07/2021 15:09:06 - INFO - __main__ - Step 127290: {'lr': 2.849255592367436e-05, 'samples': 24439680, 'steps': 127289, 'loss/train': 1.405903935432434} 11/07/2021 15:09:06 - INFO - __main__ - Step 127291: {'lr': 2.849009561651822e-05, 'samples': 24439872, 'steps': 127290, 'loss/train': 0.9697750210762024} 11/07/2021 15:09:07 - INFO - __main__ - Step 127292: {'lr': 2.8487635409171043e-05, 'samples': 24440064, 'steps': 127291, 'loss/train': 0.887175440788269} 11/07/2021 15:09:07 - INFO - __main__ - Step 127293: {'lr': 2.8485175301633916e-05, 'samples': 24440256, 'steps': 127292, 'loss/train': 1.3239320516586304} 11/07/2021 15:09:07 - INFO - __main__ - Step 127294: {'lr': 2.8482715293907946e-05, 'samples': 24440448, 'steps': 127293, 'loss/train': 1.3036527633666992} 11/07/2021 15:09:08 - INFO - __main__ - Step 127295: {'lr': 2.8480255385994248e-05, 'samples': 24440640, 'steps': 127294, 'loss/train': 1.1932944059371948} 11/07/2021 15:09:09 - INFO - __main__ - Step 127296: {'lr': 2.8477795577893954e-05, 'samples': 24440832, 'steps': 127295, 'loss/train': 1.7289021015167236} 11/07/2021 15:09:09 - INFO - __main__ - Step 127297: {'lr': 2.8475335869608128e-05, 'samples': 24441024, 'steps': 127296, 'loss/train': 0.9555014967918396} 11/07/2021 15:09:09 - INFO - __main__ - Step 127298: {'lr': 2.847287626113787e-05, 'samples': 24441216, 'steps': 127297, 'loss/train': 1.2219512462615967} 11/07/2021 15:09:10 - INFO - __main__ - Step 127299: {'lr': 2.8470416752484353e-05, 'samples': 24441408, 'steps': 127298, 'loss/train': 1.065458059310913} 11/07/2021 15:09:11 - INFO - __main__ - Step 127300: {'lr': 2.846795734364868e-05, 'samples': 24441600, 'steps': 127299, 'loss/train': 1.4372807741165161} 11/07/2021 15:09:11 - INFO - __main__ - Step 127301: {'lr': 2.8465498034631886e-05, 'samples': 24441792, 'steps': 127300, 'loss/train': 1.0560472011566162} 11/07/2021 15:09:11 - INFO - __main__ - Step 127302: {'lr': 2.8463038825435107e-05, 'samples': 24441984, 'steps': 127301, 'loss/train': 1.0388927459716797} 11/07/2021 15:09:12 - INFO - __main__ - Step 127303: {'lr': 2.8460579716059477e-05, 'samples': 24442176, 'steps': 127302, 'loss/train': 1.1206672191619873} 11/07/2021 15:09:12 - INFO - __main__ - Step 127304: {'lr': 2.845812070650608e-05, 'samples': 24442368, 'steps': 127303, 'loss/train': 1.2996395826339722} 11/07/2021 15:09:13 - INFO - __main__ - Step 127305: {'lr': 2.8455661796776056e-05, 'samples': 24442560, 'steps': 127304, 'loss/train': 2.2119481563568115} 11/07/2021 15:09:14 - INFO - __main__ - Step 127306: {'lr': 2.8453202986870456e-05, 'samples': 24442752, 'steps': 127305, 'loss/train': 1.2213952541351318} 11/07/2021 15:09:14 - INFO - __main__ - Step 127307: {'lr': 2.8450744276790454e-05, 'samples': 24442944, 'steps': 127306, 'loss/train': 1.150046706199646} 11/07/2021 15:09:14 - INFO - __main__ - Step 127308: {'lr': 2.844828566653712e-05, 'samples': 24443136, 'steps': 127307, 'loss/train': 0.7859076261520386} 11/07/2021 15:09:15 - INFO - __main__ - Step 127309: {'lr': 2.8445827156111576e-05, 'samples': 24443328, 'steps': 127308, 'loss/train': 1.2883261442184448} 11/07/2021 15:09:15 - INFO - __main__ - Step 127310: {'lr': 2.8443368745514926e-05, 'samples': 24443520, 'steps': 127309, 'loss/train': 1.1605267524719238} 11/07/2021 15:09:16 - INFO - __main__ - Step 127311: {'lr': 2.844091043474825e-05, 'samples': 24443712, 'steps': 127310, 'loss/train': 1.021212100982666} 11/07/2021 15:09:17 - INFO - __main__ - Step 127312: {'lr': 2.8438452223812696e-05, 'samples': 24443904, 'steps': 127311, 'loss/train': 1.3807607889175415} 11/07/2021 15:09:17 - INFO - __main__ - Step 127313: {'lr': 2.8435994112709416e-05, 'samples': 24444096, 'steps': 127312, 'loss/train': 1.281142234802246} 11/07/2021 15:09:17 - INFO - __main__ - Step 127314: {'lr': 2.8433536101439395e-05, 'samples': 24444288, 'steps': 127313, 'loss/train': 1.3041969537734985} 11/07/2021 15:09:18 - INFO - __main__ - Step 127315: {'lr': 2.8431078190003818e-05, 'samples': 24444480, 'steps': 127314, 'loss/train': 1.768510341644287} 11/07/2021 15:09:19 - INFO - __main__ - Step 127316: {'lr': 2.842862037840377e-05, 'samples': 24444672, 'steps': 127315, 'loss/train': 0.5350625514984131} 11/07/2021 15:09:19 - INFO - __main__ - Step 127317: {'lr': 2.842616266664036e-05, 'samples': 24444864, 'steps': 127316, 'loss/train': 1.211381435394287} 11/07/2021 15:09:19 - INFO - __main__ - Step 127318: {'lr': 2.84237050547147e-05, 'samples': 24445056, 'steps': 127317, 'loss/train': 2.05239200592041} 11/07/2021 15:09:20 - INFO - __main__ - Step 127319: {'lr': 2.8421247542627897e-05, 'samples': 24445248, 'steps': 127318, 'loss/train': 0.8421318531036377} 11/07/2021 15:09:20 - INFO - __main__ - Step 127320: {'lr': 2.8418790130381067e-05, 'samples': 24445440, 'steps': 127319, 'loss/train': 0.9231988787651062} 11/07/2021 15:09:20 - INFO - __main__ - Step 127321: {'lr': 2.8416332817975314e-05, 'samples': 24445632, 'steps': 127320, 'loss/train': 1.3577566146850586} 11/07/2021 15:09:21 - INFO - __main__ - Step 127322: {'lr': 2.8413875605411755e-05, 'samples': 24445824, 'steps': 127321, 'loss/train': 1.2381575107574463} 11/07/2021 15:09:22 - INFO - __main__ - Step 127323: {'lr': 2.8411418492691465e-05, 'samples': 24446016, 'steps': 127322, 'loss/train': 1.5614097118377686} 11/07/2021 15:09:22 - INFO - __main__ - Step 127324: {'lr': 2.8408961479815588e-05, 'samples': 24446208, 'steps': 127323, 'loss/train': 1.1763522624969482} 11/07/2021 15:09:22 - INFO - __main__ - Step 127325: {'lr': 2.8406504566785257e-05, 'samples': 24446400, 'steps': 127324, 'loss/train': 1.0655627250671387} 11/07/2021 15:09:23 - INFO - __main__ - Step 127326: {'lr': 2.8404047753601476e-05, 'samples': 24446592, 'steps': 127325, 'loss/train': 1.3819565773010254} 11/07/2021 15:09:24 - INFO - __main__ - Step 127327: {'lr': 2.8401591040265407e-05, 'samples': 24446784, 'steps': 127326, 'loss/train': 1.4956905841827393} 11/07/2021 15:09:24 - INFO - __main__ - Step 127328: {'lr': 2.8399134426778188e-05, 'samples': 24446976, 'steps': 127327, 'loss/train': 1.2364369630813599} 11/07/2021 15:09:24 - INFO - __main__ - Step 127329: {'lr': 2.839667791314088e-05, 'samples': 24447168, 'steps': 127328, 'loss/train': 0.9135932922363281} 11/07/2021 15:09:25 - INFO - __main__ - Step 127330: {'lr': 2.839422149935461e-05, 'samples': 24447360, 'steps': 127329, 'loss/train': 1.8211488723754883} 11/07/2021 15:09:25 - INFO - __main__ - Step 127331: {'lr': 2.839176518542047e-05, 'samples': 24447552, 'steps': 127330, 'loss/train': 1.5355005264282227} 11/07/2021 15:09:26 - INFO - __main__ - Step 127332: {'lr': 2.838930897133962e-05, 'samples': 24447744, 'steps': 127331, 'loss/train': 1.32616126537323} 11/07/2021 15:09:27 - INFO - __main__ - Step 127333: {'lr': 2.8386852857113093e-05, 'samples': 24447936, 'steps': 127332, 'loss/train': 1.105683445930481} 11/07/2021 15:09:27 - INFO - __main__ - Step 127334: {'lr': 2.838439684274205e-05, 'samples': 24448128, 'steps': 127333, 'loss/train': 1.9204657077789307} 11/07/2021 15:09:27 - INFO - __main__ - Step 127335: {'lr': 2.8381940928227574e-05, 'samples': 24448320, 'steps': 127334, 'loss/train': 1.0007163286209106} 11/07/2021 15:09:28 - INFO - __main__ - Step 127336: {'lr': 2.83794851135708e-05, 'samples': 24448512, 'steps': 127335, 'loss/train': 0.746687650680542} 11/07/2021 15:09:29 - INFO - __main__ - Step 127337: {'lr': 2.837702939877279e-05, 'samples': 24448704, 'steps': 127336, 'loss/train': 1.3749511241912842} 11/07/2021 15:09:29 - INFO - __main__ - Step 127338: {'lr': 2.8374573783834678e-05, 'samples': 24448896, 'steps': 127337, 'loss/train': 1.4586033821105957} 11/07/2021 15:09:29 - INFO - __main__ - Step 127339: {'lr': 2.8372118268757545e-05, 'samples': 24449088, 'steps': 127338, 'loss/train': 1.4879738092422485} 11/07/2021 15:09:30 - INFO - __main__ - Step 127340: {'lr': 2.8369662853542504e-05, 'samples': 24449280, 'steps': 127339, 'loss/train': 1.1421332359313965} 11/07/2021 15:09:30 - INFO - __main__ - Step 127341: {'lr': 2.8367207538190692e-05, 'samples': 24449472, 'steps': 127340, 'loss/train': 1.0797697305679321} 11/07/2021 15:09:31 - INFO - __main__ - Step 127342: {'lr': 2.836475232270319e-05, 'samples': 24449664, 'steps': 127341, 'loss/train': 1.049251914024353} 11/07/2021 15:09:31 - INFO - __main__ - Step 127343: {'lr': 2.836229720708111e-05, 'samples': 24449856, 'steps': 127342, 'loss/train': 1.5822631120681763} 11/07/2021 15:09:32 - INFO - __main__ - Step 127344: {'lr': 2.8359842191325563e-05, 'samples': 24450048, 'steps': 127343, 'loss/train': 1.7498985528945923} 11/07/2021 15:09:32 - INFO - __main__ - Step 127345: {'lr': 2.8357387275437657e-05, 'samples': 24450240, 'steps': 127344, 'loss/train': 1.1952403783798218} 11/07/2021 15:09:33 - INFO - __main__ - Step 127346: {'lr': 2.8354932459418476e-05, 'samples': 24450432, 'steps': 127345, 'loss/train': 1.4398423433303833} 11/07/2021 15:09:34 - INFO - __main__ - Step 127347: {'lr': 2.8352477743269213e-05, 'samples': 24450624, 'steps': 127346, 'loss/train': 1.227957844734192} 11/07/2021 15:09:34 - INFO - __main__ - Step 127348: {'lr': 2.8350023126990836e-05, 'samples': 24450816, 'steps': 127347, 'loss/train': 1.2450354099273682} 11/07/2021 15:09:34 - INFO - __main__ - Step 127349: {'lr': 2.8347568610584546e-05, 'samples': 24451008, 'steps': 127348, 'loss/train': 1.4981589317321777} 11/07/2021 15:09:35 - INFO - __main__ - Step 127350: {'lr': 2.834511419405139e-05, 'samples': 24451200, 'steps': 127349, 'loss/train': 1.420602798461914} 11/07/2021 15:09:35 - INFO - __main__ - Step 127351: {'lr': 2.8342659877392512e-05, 'samples': 24451392, 'steps': 127350, 'loss/train': 0.7371758222579956} 11/07/2021 15:09:36 - INFO - __main__ - Step 127352: {'lr': 2.834020566060902e-05, 'samples': 24451584, 'steps': 127351, 'loss/train': 1.6007776260375977} 11/07/2021 15:09:36 - INFO - __main__ - Step 127353: {'lr': 2.8337751543701996e-05, 'samples': 24451776, 'steps': 127352, 'loss/train': 1.4156153202056885} 11/07/2021 15:09:37 - INFO - __main__ - Step 127354: {'lr': 2.8335297526672576e-05, 'samples': 24451968, 'steps': 127353, 'loss/train': 0.15996623039245605} 11/07/2021 15:09:37 - INFO - __main__ - Step 127355: {'lr': 2.8332843609521848e-05, 'samples': 24452160, 'steps': 127354, 'loss/train': 1.075707197189331} 11/07/2021 15:09:38 - INFO - __main__ - Step 127356: {'lr': 2.8330389792250887e-05, 'samples': 24452352, 'steps': 127355, 'loss/train': 1.2514454126358032} 11/07/2021 15:09:38 - INFO - __main__ - Step 127357: {'lr': 2.8327936074860865e-05, 'samples': 24452544, 'steps': 127356, 'loss/train': 1.620068073272705} 11/07/2021 15:09:39 - INFO - __main__ - Step 127358: {'lr': 2.8325482457352918e-05, 'samples': 24452736, 'steps': 127357, 'loss/train': 1.4846150875091553} 11/07/2021 15:09:39 - INFO - __main__ - Step 127359: {'lr': 2.8323028939728018e-05, 'samples': 24452928, 'steps': 127358, 'loss/train': 1.042799949645996} 11/07/2021 15:09:40 - INFO - __main__ - Step 127360: {'lr': 2.832057552198733e-05, 'samples': 24453120, 'steps': 127359, 'loss/train': 0.8057222366333008} 11/07/2021 15:09:40 - INFO - __main__ - Step 127361: {'lr': 2.8318122204131992e-05, 'samples': 24453312, 'steps': 127360, 'loss/train': 1.2397186756134033} 11/07/2021 15:09:40 - INFO - __main__ - Step 127362: {'lr': 2.8315668986163086e-05, 'samples': 24453504, 'steps': 127361, 'loss/train': 1.521505355834961} 11/07/2021 15:09:41 - INFO - __main__ - Step 127363: {'lr': 2.8313215868081692e-05, 'samples': 24453696, 'steps': 127362, 'loss/train': 1.3455101251602173} 11/07/2021 15:09:42 - INFO - __main__ - Step 127364: {'lr': 2.8310762849888955e-05, 'samples': 24453888, 'steps': 127363, 'loss/train': 1.613576054573059} 11/07/2021 15:09:42 - INFO - __main__ - Step 127365: {'lr': 2.830830993158598e-05, 'samples': 24454080, 'steps': 127364, 'loss/train': 1.4037970304489136} 11/07/2021 15:09:42 - INFO - __main__ - Step 127366: {'lr': 2.830585711317385e-05, 'samples': 24454272, 'steps': 127365, 'loss/train': 1.0785930156707764} 11/07/2021 15:09:43 - INFO - __main__ - Step 127367: {'lr': 2.8303404394653675e-05, 'samples': 24454464, 'steps': 127366, 'loss/train': 1.0523191690444946} 11/07/2021 15:09:43 - INFO - __main__ - Step 127368: {'lr': 2.8300951776026597e-05, 'samples': 24454656, 'steps': 127367, 'loss/train': 0.7409497499465942} 11/07/2021 15:09:44 - INFO - __main__ - Step 127369: {'lr': 2.8298499257293692e-05, 'samples': 24454848, 'steps': 127368, 'loss/train': 1.0276899337768555} 11/07/2021 15:09:45 - INFO - __main__ - Step 127370: {'lr': 2.8296046838456048e-05, 'samples': 24455040, 'steps': 127369, 'loss/train': 1.3104897737503052} 11/07/2021 15:09:45 - INFO - __main__ - Step 127371: {'lr': 2.829359451951477e-05, 'samples': 24455232, 'steps': 127370, 'loss/train': 1.2782245874404907} 11/07/2021 15:09:45 - INFO - __main__ - Step 127372: {'lr': 2.8291142300470975e-05, 'samples': 24455424, 'steps': 127371, 'loss/train': 0.9695186018943787} 11/07/2021 15:09:46 - INFO - __main__ - Step 127373: {'lr': 2.8288690181325764e-05, 'samples': 24455616, 'steps': 127372, 'loss/train': 0.06487217545509338} 11/07/2021 15:09:47 - INFO - __main__ - Step 127374: {'lr': 2.8286238162080257e-05, 'samples': 24455808, 'steps': 127373, 'loss/train': 1.0924158096313477} 11/07/2021 15:09:47 - INFO - __main__ - Step 127375: {'lr': 2.828378624273556e-05, 'samples': 24456000, 'steps': 127374, 'loss/train': 1.3074824810028076} 11/07/2021 15:09:47 - INFO - __main__ - Step 127376: {'lr': 2.8281334423292755e-05, 'samples': 24456192, 'steps': 127375, 'loss/train': 1.4370425939559937} 11/07/2021 15:09:48 - INFO - __main__ - Step 127377: {'lr': 2.8278882703752952e-05, 'samples': 24456384, 'steps': 127376, 'loss/train': 1.8319308757781982} 11/07/2021 15:09:48 - INFO - __main__ - Step 127378: {'lr': 2.8276431084117288e-05, 'samples': 24456576, 'steps': 127377, 'loss/train': 1.2232800722122192} 11/07/2021 15:09:49 - INFO - __main__ - Step 127379: {'lr': 2.827397956438682e-05, 'samples': 24456768, 'steps': 127378, 'loss/train': 1.2381144762039185} 11/07/2021 15:09:50 - INFO - __main__ - Step 127380: {'lr': 2.8271528144562685e-05, 'samples': 24456960, 'steps': 127379, 'loss/train': 1.3686143159866333} 11/07/2021 15:09:50 - INFO - __main__ - Step 127381: {'lr': 2.8269076824646023e-05, 'samples': 24457152, 'steps': 127380, 'loss/train': 1.2457557916641235} 11/07/2021 15:09:50 - INFO - __main__ - Step 127382: {'lr': 2.8266625604637857e-05, 'samples': 24457344, 'steps': 127381, 'loss/train': 0.8131839632987976} 11/07/2021 15:09:51 - INFO - __main__ - Step 127383: {'lr': 2.82641744845393e-05, 'samples': 24457536, 'steps': 127382, 'loss/train': 1.4410005807876587} 11/07/2021 15:09:52 - INFO - __main__ - Step 127384: {'lr': 2.826172346435152e-05, 'samples': 24457728, 'steps': 127383, 'loss/train': 1.2461665868759155} 11/07/2021 15:09:52 - INFO - __main__ - Step 127385: {'lr': 2.8259272544075566e-05, 'samples': 24457920, 'steps': 127384, 'loss/train': 1.101196527481079} 11/07/2021 15:09:52 - INFO - __main__ - Step 127386: {'lr': 2.825682172371255e-05, 'samples': 24458112, 'steps': 127385, 'loss/train': 1.3900146484375} 11/07/2021 15:09:53 - INFO - __main__ - Step 127387: {'lr': 2.8254371003263614e-05, 'samples': 24458304, 'steps': 127386, 'loss/train': 1.4015867710113525} 11/07/2021 15:09:53 - INFO - __main__ - Step 127388: {'lr': 2.8251920382729805e-05, 'samples': 24458496, 'steps': 127387, 'loss/train': 1.125569462776184} 11/07/2021 15:09:54 - INFO - __main__ - Step 127389: {'lr': 2.82494698621123e-05, 'samples': 24458688, 'steps': 127388, 'loss/train': 0.8955537676811218} 11/07/2021 15:09:54 - INFO - __main__ - Step 127390: {'lr': 2.8247019441412143e-05, 'samples': 24458880, 'steps': 127389, 'loss/train': 1.0279241800308228} 11/07/2021 15:09:55 - INFO - __main__ - Step 127391: {'lr': 2.8244569120630447e-05, 'samples': 24459072, 'steps': 127390, 'loss/train': 1.1909877061843872} 11/07/2021 15:09:55 - INFO - __main__ - Step 127392: {'lr': 2.8242118899768325e-05, 'samples': 24459264, 'steps': 127391, 'loss/train': 0.9851958155632019} 11/07/2021 15:09:55 - INFO - __main__ - Step 127393: {'lr': 2.823966877882689e-05, 'samples': 24459456, 'steps': 127392, 'loss/train': 1.4380848407745361} 11/07/2021 15:09:56 - INFO - __main__ - Step 127394: {'lr': 2.8237218757807297e-05, 'samples': 24459648, 'steps': 127393, 'loss/train': 1.2367806434631348} 11/07/2021 15:09:57 - INFO - __main__ - Step 127395: {'lr': 2.8234768836710528e-05, 'samples': 24459840, 'steps': 127394, 'loss/train': 1.1377027034759521} 11/07/2021 15:09:57 - INFO - __main__ - Step 127396: {'lr': 2.8232319015537772e-05, 'samples': 24460032, 'steps': 127395, 'loss/train': 1.2425042390823364} 11/07/2021 15:09:58 - INFO - __main__ - Step 127397: {'lr': 2.8229869294290082e-05, 'samples': 24460224, 'steps': 127396, 'loss/train': 1.205188512802124} 11/07/2021 15:09:58 - INFO - __main__ - Step 127398: {'lr': 2.82274196729686e-05, 'samples': 24460416, 'steps': 127397, 'loss/train': 1.2285685539245605} 11/07/2021 15:09:58 - INFO - __main__ - Step 127399: {'lr': 2.8224970151574435e-05, 'samples': 24460608, 'steps': 127398, 'loss/train': 1.3948814868927002} 11/07/2021 15:09:59 - INFO - __main__ - Step 127400: {'lr': 2.822252073010867e-05, 'samples': 24460800, 'steps': 127399, 'loss/train': 1.2602953910827637} 11/07/2021 15:10:00 - INFO - __main__ - Step 127401: {'lr': 2.8220071408572412e-05, 'samples': 24460992, 'steps': 127400, 'loss/train': 1.1292091608047485} 11/07/2021 15:10:00 - INFO - __main__ - Step 127402: {'lr': 2.8217622186966747e-05, 'samples': 24461184, 'steps': 127401, 'loss/train': 1.4093436002731323} 11/07/2021 15:10:00 - INFO - __main__ - Step 127403: {'lr': 2.8215173065292837e-05, 'samples': 24461376, 'steps': 127402, 'loss/train': 1.3943629264831543} 11/07/2021 15:10:01 - INFO - __main__ - Step 127404: {'lr': 2.8212724043551714e-05, 'samples': 24461568, 'steps': 127403, 'loss/train': 1.1190706491470337} 11/07/2021 15:10:02 - INFO - __main__ - Step 127405: {'lr': 2.821027512174454e-05, 'samples': 24461760, 'steps': 127404, 'loss/train': 1.4615131616592407} 11/07/2021 15:10:02 - INFO - __main__ - Step 127406: {'lr': 2.820782629987237e-05, 'samples': 24461952, 'steps': 127405, 'loss/train': 1.487715482711792} 11/07/2021 15:10:03 - INFO - __main__ - Step 127407: {'lr': 2.8205377577936343e-05, 'samples': 24462144, 'steps': 127406, 'loss/train': 1.3584387302398682} 11/07/2021 15:10:03 - INFO - __main__ - Step 127408: {'lr': 2.8202928955937624e-05, 'samples': 24462336, 'steps': 127407, 'loss/train': 1.2882779836654663} 11/07/2021 15:10:03 - INFO - __main__ - Step 127409: {'lr': 2.8200480433877158e-05, 'samples': 24462528, 'steps': 127408, 'loss/train': 1.2979116439819336} 11/07/2021 15:10:04 - INFO - __main__ - Step 127410: {'lr': 2.8198032011756137e-05, 'samples': 24462720, 'steps': 127409, 'loss/train': 1.5824718475341797} 11/07/2021 15:10:05 - INFO - __main__ - Step 127411: {'lr': 2.819558368957567e-05, 'samples': 24462912, 'steps': 127410, 'loss/train': 1.0917692184448242} 11/07/2021 15:10:05 - INFO - __main__ - Step 127412: {'lr': 2.819313546733687e-05, 'samples': 24463104, 'steps': 127411, 'loss/train': 1.4211770296096802} 11/07/2021 15:10:05 - INFO - __main__ - Step 127413: {'lr': 2.8190687345040794e-05, 'samples': 24463296, 'steps': 127412, 'loss/train': 1.3511962890625} 11/07/2021 15:10:06 - INFO - __main__ - Step 127414: {'lr': 2.8188239322688574e-05, 'samples': 24463488, 'steps': 127413, 'loss/train': 0.9346222877502441} 11/07/2021 15:10:06 - INFO - __main__ - Step 127415: {'lr': 2.8185791400281326e-05, 'samples': 24463680, 'steps': 127414, 'loss/train': 0.945634126663208} 11/07/2021 15:10:07 - INFO - __main__ - Step 127416: {'lr': 2.81833435778201e-05, 'samples': 24463872, 'steps': 127415, 'loss/train': 1.5918792486190796} 11/07/2021 15:10:07 - INFO - __main__ - Step 127417: {'lr': 2.818089585530606e-05, 'samples': 24464064, 'steps': 127416, 'loss/train': 0.536130964756012} 11/07/2021 15:10:08 - INFO - __main__ - Step 127418: {'lr': 2.8178448232740296e-05, 'samples': 24464256, 'steps': 127417, 'loss/train': 1.4495735168457031} 11/07/2021 15:10:08 - INFO - __main__ - Step 127419: {'lr': 2.8176000710123884e-05, 'samples': 24464448, 'steps': 127418, 'loss/train': 1.0468899011611938} 11/07/2021 15:10:08 - INFO - __main__ - Step 127420: {'lr': 2.8173553287457963e-05, 'samples': 24464640, 'steps': 127419, 'loss/train': 1.1312528848648071} 11/07/2021 15:10:09 - INFO - __main__ - Step 127421: {'lr': 2.8171105964743648e-05, 'samples': 24464832, 'steps': 127420, 'loss/train': 1.778384804725647} 11/07/2021 15:10:10 - INFO - __main__ - Step 127422: {'lr': 2.816865874198196e-05, 'samples': 24465024, 'steps': 127421, 'loss/train': 1.3315538167953491} 11/07/2021 15:10:10 - INFO - __main__ - Step 127423: {'lr': 2.816621161917407e-05, 'samples': 24465216, 'steps': 127422, 'loss/train': 1.4423757791519165} 11/07/2021 15:10:11 - INFO - __main__ - Step 127424: {'lr': 2.8163764596321055e-05, 'samples': 24465408, 'steps': 127423, 'loss/train': 0.9890381097793579} 11/07/2021 15:10:11 - INFO - __main__ - Step 127425: {'lr': 2.8161317673424004e-05, 'samples': 24465600, 'steps': 127424, 'loss/train': 1.2109596729278564} 11/07/2021 15:10:12 - INFO - __main__ - Step 127426: {'lr': 2.8158870850484048e-05, 'samples': 24465792, 'steps': 127425, 'loss/train': 1.581565260887146} 11/07/2021 15:10:12 - INFO - __main__ - Step 127427: {'lr': 2.81564241275023e-05, 'samples': 24465984, 'steps': 127426, 'loss/train': 1.4466766119003296} 11/07/2021 15:10:13 - INFO - __main__ - Step 127428: {'lr': 2.8153977504479815e-05, 'samples': 24466176, 'steps': 127427, 'loss/train': 1.2381749153137207} 11/07/2021 15:10:13 - INFO - __main__ - Step 127429: {'lr': 2.8151530981417762e-05, 'samples': 24466368, 'steps': 127428, 'loss/train': 1.415959358215332} 11/07/2021 15:10:13 - INFO - __main__ - Step 127430: {'lr': 2.814908455831719e-05, 'samples': 24466560, 'steps': 127429, 'loss/train': 0.9965806007385254} 11/07/2021 15:10:14 - INFO - __main__ - Step 127431: {'lr': 2.8146638235179213e-05, 'samples': 24466752, 'steps': 127430, 'loss/train': 1.0692662000656128} 11/07/2021 15:10:15 - INFO - __main__ - Step 127432: {'lr': 2.814419201200491e-05, 'samples': 24466944, 'steps': 127431, 'loss/train': 1.152289867401123} 11/07/2021 15:10:15 - INFO - __main__ - Step 127433: {'lr': 2.814174588879545e-05, 'samples': 24467136, 'steps': 127432, 'loss/train': 1.613016963005066} 11/07/2021 15:10:15 - INFO - __main__ - Step 127434: {'lr': 2.8139299865551944e-05, 'samples': 24467328, 'steps': 127433, 'loss/train': 1.1395798921585083} 11/07/2021 15:10:16 - INFO - __main__ - Step 127435: {'lr': 2.8136853942275388e-05, 'samples': 24467520, 'steps': 127434, 'loss/train': 2.638866662979126} 11/07/2021 15:10:17 - INFO - __main__ - Step 127436: {'lr': 2.813440811896692e-05, 'samples': 24467712, 'steps': 127435, 'loss/train': 1.4294121265411377} 11/07/2021 15:10:17 - INFO - __main__ - Step 127437: {'lr': 2.813196239562768e-05, 'samples': 24467904, 'steps': 127436, 'loss/train': 1.390950322151184} 11/07/2021 15:10:18 - INFO - __main__ - Step 127438: {'lr': 2.812951677225878e-05, 'samples': 24468096, 'steps': 127437, 'loss/train': 5.406692981719971} 11/07/2021 15:10:18 - INFO - __main__ - Step 127439: {'lr': 2.812707124886127e-05, 'samples': 24468288, 'steps': 127438, 'loss/train': 1.3687231540679932} 11/07/2021 15:10:18 - INFO - __main__ - Step 127440: {'lr': 2.8124625825436263e-05, 'samples': 24468480, 'steps': 127439, 'loss/train': 1.705001950263977} 11/07/2021 15:10:19 - INFO - __main__ - Step 127441: {'lr': 2.8122180501984896e-05, 'samples': 24468672, 'steps': 127440, 'loss/train': 1.225957989692688} 11/07/2021 15:10:20 - INFO - __main__ - Step 127442: {'lr': 2.8119735278508252e-05, 'samples': 24468864, 'steps': 127441, 'loss/train': 1.6861222982406616} 11/07/2021 15:10:20 - INFO - __main__ - Step 127443: {'lr': 2.811729015500744e-05, 'samples': 24469056, 'steps': 127442, 'loss/train': 0.9600949287414551} 11/07/2021 15:10:21 - INFO - __main__ - Step 127444: {'lr': 2.811484513148352e-05, 'samples': 24469248, 'steps': 127443, 'loss/train': 1.206408977508545} 11/07/2021 15:10:21 - INFO - __main__ - Step 127445: {'lr': 2.811240020793765e-05, 'samples': 24469440, 'steps': 127444, 'loss/train': 1.4328906536102295} 11/07/2021 15:10:21 - INFO - __main__ - Step 127446: {'lr': 2.8109955384370918e-05, 'samples': 24469632, 'steps': 127445, 'loss/train': 1.358027458190918} 11/07/2021 15:10:22 - INFO - __main__ - Step 127447: {'lr': 2.81075106607844e-05, 'samples': 24469824, 'steps': 127446, 'loss/train': 1.3103018999099731} 11/07/2021 15:10:23 - INFO - __main__ - Step 127448: {'lr': 2.810506603717927e-05, 'samples': 24470016, 'steps': 127447, 'loss/train': 0.8726685047149658} 11/07/2021 15:10:23 - INFO - __main__ - Step 127449: {'lr': 2.8102621513556525e-05, 'samples': 24470208, 'steps': 127448, 'loss/train': 1.2342798709869385} 11/07/2021 15:10:23 - INFO - __main__ - Step 127450: {'lr': 2.81001770899173e-05, 'samples': 24470400, 'steps': 127449, 'loss/train': 0.42606306076049805} 11/07/2021 15:10:24 - INFO - __main__ - Step 127451: {'lr': 2.8097732766262736e-05, 'samples': 24470592, 'steps': 127450, 'loss/train': 0.5467414259910583} 11/07/2021 15:10:25 - INFO - __main__ - Step 127452: {'lr': 2.8095288542593882e-05, 'samples': 24470784, 'steps': 127451, 'loss/train': 1.2466223239898682} 11/07/2021 15:10:25 - INFO - __main__ - Step 127453: {'lr': 2.8092844418911884e-05, 'samples': 24470976, 'steps': 127452, 'loss/train': 1.3191438913345337} 11/07/2021 15:10:26 - INFO - __main__ - Step 127454: {'lr': 2.809040039521782e-05, 'samples': 24471168, 'steps': 127453, 'loss/train': 0.4469349980354309} 11/07/2021 15:10:26 - INFO - __main__ - Step 127455: {'lr': 2.80879564715128e-05, 'samples': 24471360, 'steps': 127454, 'loss/train': 1.3808361291885376} 11/07/2021 15:10:26 - INFO - __main__ - Step 127456: {'lr': 2.8085512647797934e-05, 'samples': 24471552, 'steps': 127455, 'loss/train': 0.9467670917510986} 11/07/2021 15:10:27 - INFO - __main__ - Step 127457: {'lr': 2.8083068924074305e-05, 'samples': 24471744, 'steps': 127456, 'loss/train': 1.3953697681427002} 11/07/2021 15:10:28 - INFO - __main__ - Step 127458: {'lr': 2.8080625300342998e-05, 'samples': 24471936, 'steps': 127457, 'loss/train': 1.0965896844863892} 11/07/2021 15:10:28 - INFO - __main__ - Step 127459: {'lr': 2.8078181776605176e-05, 'samples': 24472128, 'steps': 127458, 'loss/train': 1.1305546760559082} 11/07/2021 15:10:29 - INFO - __main__ - Step 127460: {'lr': 2.8075738352861868e-05, 'samples': 24472320, 'steps': 127459, 'loss/train': 1.2242878675460815} 11/07/2021 15:10:29 - INFO - __main__ - Step 127461: {'lr': 2.8073295029114265e-05, 'samples': 24472512, 'steps': 127460, 'loss/train': 0.9504767060279846} 11/07/2021 15:10:30 - INFO - __main__ - Step 127462: {'lr': 2.8070851805363367e-05, 'samples': 24472704, 'steps': 127461, 'loss/train': 0.8253231644630432} 11/07/2021 15:10:30 - INFO - __main__ - Step 127463: {'lr': 2.8068408681610312e-05, 'samples': 24472896, 'steps': 127462, 'loss/train': 1.1920521259307861} 11/07/2021 15:10:31 - INFO - __main__ - Step 127464: {'lr': 2.8065965657856212e-05, 'samples': 24473088, 'steps': 127463, 'loss/train': 0.7486048936843872} 11/07/2021 15:10:31 - INFO - __main__ - Step 127465: {'lr': 2.8063522734102175e-05, 'samples': 24473280, 'steps': 127464, 'loss/train': 1.3378194570541382} 11/07/2021 15:10:31 - INFO - __main__ - Step 127466: {'lr': 2.8061079910349284e-05, 'samples': 24473472, 'steps': 127465, 'loss/train': 1.661170482635498} 11/07/2021 15:10:32 - INFO - __main__ - Step 127467: {'lr': 2.8058637186598625e-05, 'samples': 24473664, 'steps': 127466, 'loss/train': 1.4637367725372314} 11/07/2021 15:10:33 - INFO - __main__ - Step 127468: {'lr': 2.8056194562851355e-05, 'samples': 24473856, 'steps': 127467, 'loss/train': 1.2956299781799316} 11/07/2021 15:10:33 - INFO - __main__ - Step 127469: {'lr': 2.805375203910851e-05, 'samples': 24474048, 'steps': 127468, 'loss/train': 1.4584892988204956} 11/07/2021 15:10:34 - INFO - __main__ - Step 127470: {'lr': 2.8051309615371223e-05, 'samples': 24474240, 'steps': 127469, 'loss/train': 1.3993446826934814} 11/07/2021 15:10:34 - INFO - __main__ - Step 127471: {'lr': 2.8048867291640608e-05, 'samples': 24474432, 'steps': 127470, 'loss/train': 1.4950909614562988} 11/07/2021 15:10:34 - INFO - __main__ - Step 127472: {'lr': 2.8046425067917742e-05, 'samples': 24474624, 'steps': 127471, 'loss/train': 1.2496424913406372} 11/07/2021 15:10:35 - INFO - __main__ - Step 127473: {'lr': 2.8043982944203712e-05, 'samples': 24474816, 'steps': 127472, 'loss/train': 1.0867295265197754} 11/07/2021 15:10:36 - INFO - __main__ - Step 127474: {'lr': 2.8041540920499654e-05, 'samples': 24475008, 'steps': 127473, 'loss/train': 0.9344812035560608} 11/07/2021 15:10:36 - INFO - __main__ - Step 127475: {'lr': 2.8039098996806704e-05, 'samples': 24475200, 'steps': 127474, 'loss/train': 1.1853927373886108} 11/07/2021 15:10:36 - INFO - __main__ - Step 127476: {'lr': 2.8036657173125868e-05, 'samples': 24475392, 'steps': 127475, 'loss/train': 1.4356558322906494} 11/07/2021 15:10:37 - INFO - __main__ - Step 127477: {'lr': 2.803421544945828e-05, 'samples': 24475584, 'steps': 127476, 'loss/train': 1.4372875690460205} 11/07/2021 15:10:38 - INFO - __main__ - Step 127478: {'lr': 2.8031773825805046e-05, 'samples': 24475776, 'steps': 127477, 'loss/train': 1.4648255109786987} 11/07/2021 15:10:38 - INFO - __main__ - Step 127479: {'lr': 2.8029332302167254e-05, 'samples': 24475968, 'steps': 127478, 'loss/train': 1.1612051725387573} 11/07/2021 15:10:38 - INFO - __main__ - Step 127480: {'lr': 2.802689087854604e-05, 'samples': 24476160, 'steps': 127479, 'loss/train': 2.036334276199341} 11/07/2021 15:10:39 - INFO - __main__ - Step 127481: {'lr': 2.8024449554942488e-05, 'samples': 24476352, 'steps': 127480, 'loss/train': 1.1616487503051758} 11/07/2021 15:10:39 - INFO - __main__ - Step 127482: {'lr': 2.802200833135768e-05, 'samples': 24476544, 'steps': 127481, 'loss/train': 0.8500939011573792} 11/07/2021 15:10:40 - INFO - __main__ - Step 127483: {'lr': 2.8019567207792752e-05, 'samples': 24476736, 'steps': 127482, 'loss/train': 1.1836812496185303} 11/07/2021 15:10:40 - INFO - __main__ - Step 127484: {'lr': 2.801712618424876e-05, 'samples': 24476928, 'steps': 127483, 'loss/train': 1.0783238410949707} 11/07/2021 15:10:41 - INFO - __main__ - Step 127485: {'lr': 2.801468526072684e-05, 'samples': 24477120, 'steps': 127484, 'loss/train': 1.2788667678833008} 11/07/2021 15:10:41 - INFO - __main__ - Step 127486: {'lr': 2.8012244437228053e-05, 'samples': 24477312, 'steps': 127485, 'loss/train': 1.3765960931777954} 11/07/2021 15:10:42 - INFO - __main__ - Step 127487: {'lr': 2.8009803713753555e-05, 'samples': 24477504, 'steps': 127486, 'loss/train': 1.118585467338562} 11/07/2021 15:10:42 - INFO - __main__ - Step 127488: {'lr': 2.800736309030444e-05, 'samples': 24477696, 'steps': 127487, 'loss/train': 1.687037467956543} 11/07/2021 15:10:43 - INFO - __main__ - Step 127489: {'lr': 2.800492256688175e-05, 'samples': 24477888, 'steps': 127488, 'loss/train': 1.0825490951538086} 11/07/2021 15:10:43 - INFO - __main__ - Step 127490: {'lr': 2.8002482143486608e-05, 'samples': 24478080, 'steps': 127489, 'loss/train': 0.9943603277206421} 11/07/2021 15:10:44 - INFO - __main__ - Step 127491: {'lr': 2.8000041820120114e-05, 'samples': 24478272, 'steps': 127490, 'loss/train': 1.2129042148590088} 11/07/2021 15:10:44 - INFO - __main__ - Step 127492: {'lr': 2.7997601596783386e-05, 'samples': 24478464, 'steps': 127491, 'loss/train': 0.9085214138031006} 11/07/2021 15:10:45 - INFO - __main__ - Step 127493: {'lr': 2.7995161473477498e-05, 'samples': 24478656, 'steps': 127492, 'loss/train': 1.6115130186080933} 11/07/2021 15:10:45 - INFO - __main__ - Step 127494: {'lr': 2.799272145020357e-05, 'samples': 24478848, 'steps': 127493, 'loss/train': 1.2391256093978882} 11/07/2021 15:10:46 - INFO - __main__ - Step 127495: {'lr': 2.7990281526962703e-05, 'samples': 24479040, 'steps': 127494, 'loss/train': 1.2509396076202393} 11/07/2021 15:10:46 - INFO - __main__ - Step 127496: {'lr': 2.7987841703755985e-05, 'samples': 24479232, 'steps': 127495, 'loss/train': 1.2066222429275513} 11/07/2021 15:10:46 - INFO - __main__ - Step 127497: {'lr': 2.7985401980584524e-05, 'samples': 24479424, 'steps': 127496, 'loss/train': 1.631461262702942} 11/07/2021 15:10:47 - INFO - __main__ - Step 127498: {'lr': 2.798296235744943e-05, 'samples': 24479616, 'steps': 127497, 'loss/train': 0.029564259573817253} 11/07/2021 15:10:48 - INFO - __main__ - Step 127499: {'lr': 2.7980522834351764e-05, 'samples': 24479808, 'steps': 127498, 'loss/train': 0.6553418636322021} 11/07/2021 15:10:48 - INFO - __main__ - Step 127500: {'lr': 2.7978083411292656e-05, 'samples': 24480000, 'steps': 127499, 'loss/train': 1.3191282749176025} 11/07/2021 15:10:49 - INFO - __main__ - Step 127501: {'lr': 2.797564408827319e-05, 'samples': 24480192, 'steps': 127500, 'loss/train': 1.4383914470672607} 11/07/2021 15:10:49 - INFO - __main__ - Step 127502: {'lr': 2.7973204865294533e-05, 'samples': 24480384, 'steps': 127501, 'loss/train': 0.4767230749130249} 11/07/2021 15:10:49 - INFO - __main__ - Step 127503: {'lr': 2.7970765742357684e-05, 'samples': 24480576, 'steps': 127502, 'loss/train': 1.3569128513336182} 11/07/2021 15:10:50 - INFO - __main__ - Step 127504: {'lr': 2.7968326719463753e-05, 'samples': 24480768, 'steps': 127503, 'loss/train': 1.1442145109176636} 11/07/2021 15:10:51 - INFO - __main__ - Step 127505: {'lr': 2.796588779661388e-05, 'samples': 24480960, 'steps': 127504, 'loss/train': 1.5535237789154053} 11/07/2021 15:10:51 - INFO - __main__ - Step 127506: {'lr': 2.7963448973809173e-05, 'samples': 24481152, 'steps': 127505, 'loss/train': 1.3417644500732422} 11/07/2021 15:10:51 - INFO - __main__ - Step 127507: {'lr': 2.796101025105069e-05, 'samples': 24481344, 'steps': 127506, 'loss/train': 1.3354460000991821} 11/07/2021 15:10:52 - INFO - __main__ - Step 127508: {'lr': 2.7958571628339534e-05, 'samples': 24481536, 'steps': 127507, 'loss/train': 1.4160127639770508} 11/07/2021 15:10:53 - INFO - __main__ - Step 127509: {'lr': 2.7956133105676852e-05, 'samples': 24481728, 'steps': 127508, 'loss/train': 1.2502776384353638} 11/07/2021 15:10:53 - INFO - __main__ - Step 127510: {'lr': 2.795369468306369e-05, 'samples': 24481920, 'steps': 127509, 'loss/train': 0.9528316259384155} 11/07/2021 15:10:54 - INFO - __main__ - Step 127511: {'lr': 2.7951256360501164e-05, 'samples': 24482112, 'steps': 127510, 'loss/train': 2.6465353965759277} 11/07/2021 15:10:54 - INFO - __main__ - Step 127512: {'lr': 2.7948818137990383e-05, 'samples': 24482304, 'steps': 127511, 'loss/train': 1.3445038795471191} 11/07/2021 15:10:54 - INFO - __main__ - Step 127513: {'lr': 2.794638001553243e-05, 'samples': 24482496, 'steps': 127512, 'loss/train': 1.5304371118545532} 11/07/2021 15:10:55 - INFO - __main__ - Step 127514: {'lr': 2.7943941993128442e-05, 'samples': 24482688, 'steps': 127513, 'loss/train': 1.4737378358840942} 11/07/2021 15:10:56 - INFO - __main__ - Step 127515: {'lr': 2.79415040707795e-05, 'samples': 24482880, 'steps': 127514, 'loss/train': 1.2689990997314453} 11/07/2021 15:10:56 - INFO - __main__ - Step 127516: {'lr': 2.793906624848666e-05, 'samples': 24483072, 'steps': 127515, 'loss/train': 1.096379280090332} 11/07/2021 15:10:56 - INFO - __main__ - Step 127517: {'lr': 2.7936628526251036e-05, 'samples': 24483264, 'steps': 127516, 'loss/train': 1.97903573513031} 11/07/2021 15:10:57 - INFO - __main__ - Step 127518: {'lr': 2.793419090407376e-05, 'samples': 24483456, 'steps': 127517, 'loss/train': 1.5614213943481445} 11/07/2021 15:10:58 - INFO - __main__ - Step 127519: {'lr': 2.7931753381955888e-05, 'samples': 24483648, 'steps': 127518, 'loss/train': 1.1777136325836182} 11/07/2021 15:10:58 - INFO - __main__ - Step 127520: {'lr': 2.792931595989856e-05, 'samples': 24483840, 'steps': 127519, 'loss/train': 1.1464440822601318} 11/07/2021 15:10:58 - INFO - __main__ - Step 127521: {'lr': 2.792687863790286e-05, 'samples': 24484032, 'steps': 127520, 'loss/train': 1.6192500591278076} 11/07/2021 15:10:59 - INFO - __main__ - Step 127522: {'lr': 2.7924441415969866e-05, 'samples': 24484224, 'steps': 127521, 'loss/train': 0.10025928169488907} 11/07/2021 15:10:59 - INFO - __main__ - Step 127523: {'lr': 2.792200429410069e-05, 'samples': 24484416, 'steps': 127522, 'loss/train': 1.175652265548706} 11/07/2021 15:10:59 - INFO - __main__ - Step 127524: {'lr': 2.7919567272296443e-05, 'samples': 24484608, 'steps': 127523, 'loss/train': 1.3711098432540894} 11/07/2021 15:11:01 - INFO - __main__ - Step 127525: {'lr': 2.7917130350558205e-05, 'samples': 24484800, 'steps': 127524, 'loss/train': 1.2487870454788208} 11/07/2021 15:11:01 - INFO - __main__ - Step 127526: {'lr': 2.7914693528887064e-05, 'samples': 24484992, 'steps': 127525, 'loss/train': 1.759333848953247} 11/07/2021 15:11:01 - INFO - __main__ - Step 127527: {'lr': 2.791225680728418e-05, 'samples': 24485184, 'steps': 127526, 'loss/train': 1.6674946546554565} 11/07/2021 15:11:02 - INFO - __main__ - Step 127528: {'lr': 2.7909820185750557e-05, 'samples': 24485376, 'steps': 127527, 'loss/train': 1.5837161540985107} 11/07/2021 15:11:02 - INFO - __main__ - Step 127529: {'lr': 2.790738366428744e-05, 'samples': 24485568, 'steps': 127528, 'loss/train': 1.5332847833633423} 11/07/2021 15:11:03 - INFO - __main__ - Step 127530: {'lr': 2.7904947242895744e-05, 'samples': 24485760, 'steps': 127529, 'loss/train': 0.6795066595077515} 11/07/2021 15:11:03 - INFO - __main__ - Step 127531: {'lr': 2.7902510921576668e-05, 'samples': 24485952, 'steps': 127530, 'loss/train': 1.3093249797821045} 11/07/2021 15:11:04 - INFO - __main__ - Step 127532: {'lr': 2.790007470033129e-05, 'samples': 24486144, 'steps': 127531, 'loss/train': 1.6721512079238892} 11/07/2021 15:11:04 - INFO - __main__ - Step 127533: {'lr': 2.7897638579160695e-05, 'samples': 24486336, 'steps': 127532, 'loss/train': 1.1844675540924072} 11/07/2021 15:11:04 - INFO - __main__ - Step 127534: {'lr': 2.789520255806602e-05, 'samples': 24486528, 'steps': 127533, 'loss/train': 0.9364411234855652} 11/07/2021 15:11:06 - INFO - __main__ - Step 127535: {'lr': 2.7892766637048318e-05, 'samples': 24486720, 'steps': 127534, 'loss/train': 1.3941208124160767} 11/07/2021 15:11:06 - INFO - __main__ - Step 127536: {'lr': 2.7890330816108728e-05, 'samples': 24486912, 'steps': 127535, 'loss/train': 1.1947989463806152} 11/07/2021 15:11:06 - INFO - __main__ - Step 127537: {'lr': 2.7887895095248307e-05, 'samples': 24487104, 'steps': 127536, 'loss/train': 1.3368407487869263} 11/07/2021 15:11:07 - INFO - __main__ - Step 127538: {'lr': 2.788545947446819e-05, 'samples': 24487296, 'steps': 127537, 'loss/train': 1.4523372650146484} 11/07/2021 15:11:07 - INFO - __main__ - Step 127539: {'lr': 2.788302395376946e-05, 'samples': 24487488, 'steps': 127538, 'loss/train': 1.5409590005874634} 11/07/2021 15:11:07 - INFO - __main__ - Step 127540: {'lr': 2.7880588533153202e-05, 'samples': 24487680, 'steps': 127539, 'loss/train': 5.684698104858398} 11/07/2021 15:11:08 - INFO - __main__ - Step 127541: {'lr': 2.787815321262052e-05, 'samples': 24487872, 'steps': 127540, 'loss/train': 1.1351816654205322} 11/07/2021 15:11:09 - INFO - __main__ - Step 127542: {'lr': 2.787571799217259e-05, 'samples': 24488064, 'steps': 127541, 'loss/train': 1.3223215341567993} 11/07/2021 15:11:09 - INFO - __main__ - Step 127543: {'lr': 2.7873282871810345e-05, 'samples': 24488256, 'steps': 127542, 'loss/train': 1.5079046487808228} 11/07/2021 15:11:09 - INFO - __main__ - Step 127544: {'lr': 2.7870847851534988e-05, 'samples': 24488448, 'steps': 127543, 'loss/train': 1.5413414239883423} 11/07/2021 15:11:10 - INFO - __main__ - Step 127545: {'lr': 2.786841293134762e-05, 'samples': 24488640, 'steps': 127544, 'loss/train': 1.3959287405014038} 11/07/2021 15:11:11 - INFO - __main__ - Step 127546: {'lr': 2.78659781112493e-05, 'samples': 24488832, 'steps': 127545, 'loss/train': 1.3169022798538208} 11/07/2021 15:11:11 - INFO - __main__ - Step 127547: {'lr': 2.7863543391241143e-05, 'samples': 24489024, 'steps': 127546, 'loss/train': 1.3568105697631836} 11/07/2021 15:11:11 - INFO - __main__ - Step 127548: {'lr': 2.7861108771324223e-05, 'samples': 24489216, 'steps': 127547, 'loss/train': 1.6687119007110596} 11/07/2021 15:11:12 - INFO - __main__ - Step 127549: {'lr': 2.785867425149968e-05, 'samples': 24489408, 'steps': 127548, 'loss/train': 0.7994849681854248} 11/07/2021 15:11:12 - INFO - __main__ - Step 127550: {'lr': 2.7856239831768603e-05, 'samples': 24489600, 'steps': 127549, 'loss/train': 1.4182761907577515} 11/07/2021 15:11:13 - INFO - __main__ - Step 127551: {'lr': 2.785380551213207e-05, 'samples': 24489792, 'steps': 127550, 'loss/train': 1.2431058883666992} 11/07/2021 15:11:14 - INFO - __main__ - Step 127552: {'lr': 2.785137129259116e-05, 'samples': 24489984, 'steps': 127551, 'loss/train': 0.9162675738334656} 11/07/2021 15:11:14 - INFO - __main__ - Step 127553: {'lr': 2.7848937173147014e-05, 'samples': 24490176, 'steps': 127552, 'loss/train': 1.4415347576141357} 11/07/2021 15:11:14 - INFO - __main__ - Step 127554: {'lr': 2.784650315380072e-05, 'samples': 24490368, 'steps': 127553, 'loss/train': 1.2482504844665527} 11/07/2021 15:11:15 - INFO - __main__ - Step 127555: {'lr': 2.7844069234553403e-05, 'samples': 24490560, 'steps': 127554, 'loss/train': 1.1430253982543945} 11/07/2021 15:11:16 - INFO - __main__ - Step 127556: {'lr': 2.7841635415406076e-05, 'samples': 24490752, 'steps': 127555, 'loss/train': 1.5983836650848389} 11/07/2021 15:11:16 - INFO - __main__ - Step 127557: {'lr': 2.7839201696359866e-05, 'samples': 24490944, 'steps': 127556, 'loss/train': 0.8641786575317383} 11/07/2021 15:11:16 - INFO - __main__ - Step 127558: {'lr': 2.783676807741589e-05, 'samples': 24491136, 'steps': 127557, 'loss/train': 1.4652292728424072} 11/07/2021 15:11:17 - INFO - __main__ - Step 127559: {'lr': 2.7834334558575232e-05, 'samples': 24491328, 'steps': 127558, 'loss/train': 1.2693192958831787} 11/07/2021 15:11:17 - INFO - __main__ - Step 127560: {'lr': 2.7831901139839024e-05, 'samples': 24491520, 'steps': 127559, 'loss/train': 1.4140310287475586} 11/07/2021 15:11:18 - INFO - __main__ - Step 127561: {'lr': 2.7829467821208293e-05, 'samples': 24491712, 'steps': 127560, 'loss/train': 1.571945309638977} 11/07/2021 15:11:19 - INFO - __main__ - Step 127562: {'lr': 2.782703460268421e-05, 'samples': 24491904, 'steps': 127561, 'loss/train': 1.4401148557662964} 11/07/2021 15:11:19 - INFO - __main__ - Step 127563: {'lr': 2.7824601484267798e-05, 'samples': 24492096, 'steps': 127562, 'loss/train': 1.4350448846817017} 11/07/2021 15:11:19 - INFO - __main__ - Step 127564: {'lr': 2.7822168465960222e-05, 'samples': 24492288, 'steps': 127563, 'loss/train': 1.0655694007873535} 11/07/2021 15:11:20 - INFO - __main__ - Step 127565: {'lr': 2.7819735547762542e-05, 'samples': 24492480, 'steps': 127564, 'loss/train': 1.5299932956695557} 11/07/2021 15:11:20 - INFO - __main__ - Step 127566: {'lr': 2.7817302729675863e-05, 'samples': 24492672, 'steps': 127565, 'loss/train': 1.6024243831634521} 11/07/2021 15:11:21 - INFO - __main__ - Step 127567: {'lr': 2.781487001170127e-05, 'samples': 24492864, 'steps': 127566, 'loss/train': 1.4163457155227661} 11/07/2021 15:11:21 - INFO - __main__ - Step 127568: {'lr': 2.7812437393839874e-05, 'samples': 24493056, 'steps': 127567, 'loss/train': 1.475249171257019} 11/07/2021 15:11:22 - INFO - __main__ - Step 127569: {'lr': 2.781000487609281e-05, 'samples': 24493248, 'steps': 127568, 'loss/train': 0.9926286339759827} 11/07/2021 15:11:22 - INFO - __main__ - Step 127570: {'lr': 2.7807572458461077e-05, 'samples': 24493440, 'steps': 127569, 'loss/train': 1.2002114057540894} 11/07/2021 15:11:22 - INFO - __main__ - Step 127571: {'lr': 2.7805140140945844e-05, 'samples': 24493632, 'steps': 127570, 'loss/train': 0.6815919280052185} 11/07/2021 15:11:23 - INFO - __main__ - Step 127572: {'lr': 2.7802707923548164e-05, 'samples': 24493824, 'steps': 127571, 'loss/train': 1.3210537433624268} 11/07/2021 15:11:24 - INFO - __main__ - Step 127573: {'lr': 2.7800275806269175e-05, 'samples': 24494016, 'steps': 127572, 'loss/train': 1.2742812633514404} 11/07/2021 15:11:24 - INFO - __main__ - Step 127574: {'lr': 2.7797843789109932e-05, 'samples': 24494208, 'steps': 127573, 'loss/train': 1.4294607639312744} 11/07/2021 15:11:24 - INFO - __main__ - Step 127575: {'lr': 2.7795411872071575e-05, 'samples': 24494400, 'steps': 127574, 'loss/train': 1.151281476020813} 11/07/2021 15:11:25 - INFO - __main__ - Step 127576: {'lr': 2.7792980055155155e-05, 'samples': 24494592, 'steps': 127575, 'loss/train': 0.8588740825653076} 11/07/2021 15:11:25 - INFO - __main__ - Step 127577: {'lr': 2.779054833836181e-05, 'samples': 24494784, 'steps': 127576, 'loss/train': 1.3418885469436646} 11/07/2021 15:11:26 - INFO - __main__ - Step 127578: {'lr': 2.7788116721692596e-05, 'samples': 24494976, 'steps': 127577, 'loss/train': 1.1042076349258423} 11/07/2021 15:11:27 - INFO - __main__ - Step 127579: {'lr': 2.7785685205148625e-05, 'samples': 24495168, 'steps': 127578, 'loss/train': 0.4140263795852661} 11/07/2021 15:11:27 - INFO - __main__ - Step 127580: {'lr': 2.7783253788731006e-05, 'samples': 24495360, 'steps': 127579, 'loss/train': 1.2774561643600464} 11/07/2021 15:11:27 - INFO - __main__ - Step 127581: {'lr': 2.778082247244082e-05, 'samples': 24495552, 'steps': 127580, 'loss/train': 1.2542091608047485} 11/07/2021 15:11:28 - INFO - __main__ - Step 127582: {'lr': 2.7778391256279207e-05, 'samples': 24495744, 'steps': 127581, 'loss/train': 1.2981072664260864} 11/07/2021 15:11:29 - INFO - __main__ - Step 127583: {'lr': 2.7775960140247193e-05, 'samples': 24495936, 'steps': 127582, 'loss/train': 1.3742226362228394} 11/07/2021 15:11:29 - INFO - __main__ - Step 127584: {'lr': 2.7773529124345887e-05, 'samples': 24496128, 'steps': 127583, 'loss/train': 1.2385027408599854} 11/07/2021 15:11:29 - INFO - __main__ - Step 127585: {'lr': 2.7771098208576402e-05, 'samples': 24496320, 'steps': 127584, 'loss/train': 1.1530208587646484} 11/07/2021 15:11:30 - INFO - __main__ - Step 127586: {'lr': 2.7768667392939845e-05, 'samples': 24496512, 'steps': 127585, 'loss/train': 1.671730399131775} 11/07/2021 15:11:30 - INFO - __main__ - Step 127587: {'lr': 2.7766236677437273e-05, 'samples': 24496704, 'steps': 127586, 'loss/train': 1.0529272556304932} 11/07/2021 15:11:31 - INFO - __main__ - Step 127588: {'lr': 2.7763806062069825e-05, 'samples': 24496896, 'steps': 127587, 'loss/train': 0.6753624677658081} 11/07/2021 15:11:31 - INFO - __main__ - Step 127589: {'lr': 2.776137554683858e-05, 'samples': 24497088, 'steps': 127588, 'loss/train': 1.2817531824111938} 11/07/2021 15:11:32 - INFO - __main__ - Step 127590: {'lr': 2.7758945131744624e-05, 'samples': 24497280, 'steps': 127589, 'loss/train': 1.3592911958694458} 11/07/2021 15:11:32 - INFO - __main__ - Step 127591: {'lr': 2.7756514816789035e-05, 'samples': 24497472, 'steps': 127590, 'loss/train': 1.2478893995285034} 11/07/2021 15:11:32 - INFO - __main__ - Step 127592: {'lr': 2.7754084601972955e-05, 'samples': 24497664, 'steps': 127591, 'loss/train': 1.393498182296753} 11/07/2021 15:11:34 - INFO - __main__ - Step 127593: {'lr': 2.7751654487297466e-05, 'samples': 24497856, 'steps': 127592, 'loss/train': 1.103808045387268} 11/07/2021 15:11:34 - INFO - __main__ - Step 127594: {'lr': 2.7749224472763678e-05, 'samples': 24498048, 'steps': 127593, 'loss/train': 1.4540926218032837} 11/07/2021 15:11:34 - INFO - __main__ - Step 127595: {'lr': 2.7746794558372617e-05, 'samples': 24498240, 'steps': 127594, 'loss/train': 1.4534729719161987} 11/07/2021 15:11:35 - INFO - __main__ - Step 127596: {'lr': 2.7744364744125423e-05, 'samples': 24498432, 'steps': 127595, 'loss/train': 1.5037925243377686} 11/07/2021 15:11:35 - INFO - __main__ - Step 127597: {'lr': 2.7741935030023173e-05, 'samples': 24498624, 'steps': 127596, 'loss/train': 1.4845612049102783} 11/07/2021 15:11:35 - INFO - __main__ - Step 127598: {'lr': 2.7739505416067013e-05, 'samples': 24498816, 'steps': 127597, 'loss/train': 1.2537094354629517} 11/07/2021 15:11:36 - INFO - __main__ - Step 127599: {'lr': 2.7737075902257965e-05, 'samples': 24499008, 'steps': 127598, 'loss/train': 1.5340875387191772} 11/07/2021 15:11:37 - INFO - __main__ - Step 127600: {'lr': 2.7734646488597193e-05, 'samples': 24499200, 'steps': 127599, 'loss/train': 1.2069358825683594} 11/07/2021 15:11:37 - INFO - __main__ - Step 127601: {'lr': 2.7732217175085727e-05, 'samples': 24499392, 'steps': 127600, 'loss/train': 1.2763938903808594} 11/07/2021 15:11:37 - INFO - __main__ - Step 127602: {'lr': 2.7729787961724706e-05, 'samples': 24499584, 'steps': 127601, 'loss/train': 2.1035709381103516} 11/07/2021 15:11:38 - INFO - __main__ - Step 127603: {'lr': 2.7727358848515238e-05, 'samples': 24499776, 'steps': 127602, 'loss/train': 0.9815565943717957} 11/07/2021 15:11:39 - INFO - __main__ - Step 127604: {'lr': 2.7724929835458353e-05, 'samples': 24499968, 'steps': 127603, 'loss/train': 0.9220651984214783} 11/07/2021 15:11:39 - INFO - __main__ - Step 127605: {'lr': 2.7722500922555266e-05, 'samples': 24500160, 'steps': 127604, 'loss/train': 1.3508926630020142} 11/07/2021 15:11:40 - INFO - __main__ - Step 127606: {'lr': 2.7720072109806928e-05, 'samples': 24500352, 'steps': 127605, 'loss/train': 1.278590202331543} 11/07/2021 15:11:40 - INFO - __main__ - Step 127607: {'lr': 2.77176433972145e-05, 'samples': 24500544, 'steps': 127606, 'loss/train': 1.2217016220092773} 11/07/2021 15:11:40 - INFO - __main__ - Step 127608: {'lr': 2.7715214784779065e-05, 'samples': 24500736, 'steps': 127607, 'loss/train': 1.4100183248519897} 11/07/2021 15:11:41 - INFO - __main__ - Step 127609: {'lr': 2.7712786272501705e-05, 'samples': 24500928, 'steps': 127608, 'loss/train': 1.2507867813110352} 11/07/2021 15:11:42 - INFO - __main__ - Step 127610: {'lr': 2.7710357860383563e-05, 'samples': 24501120, 'steps': 127609, 'loss/train': 1.4685240983963013} 11/07/2021 15:11:42 - INFO - __main__ - Step 127611: {'lr': 2.7707929548425687e-05, 'samples': 24501312, 'steps': 127610, 'loss/train': 0.8632720112800598} 11/07/2021 15:11:42 - INFO - __main__ - Step 127612: {'lr': 2.770550133662919e-05, 'samples': 24501504, 'steps': 127611, 'loss/train': 1.3020684719085693} 11/07/2021 15:11:43 - INFO - __main__ - Step 127613: {'lr': 2.7703073224995185e-05, 'samples': 24501696, 'steps': 127612, 'loss/train': 1.0439296960830688} 11/07/2021 15:11:44 - INFO - __main__ - Step 127614: {'lr': 2.770064521352472e-05, 'samples': 24501888, 'steps': 127613, 'loss/train': 1.1658008098602295} 11/07/2021 15:11:44 - INFO - __main__ - Step 127615: {'lr': 2.769821730221894e-05, 'samples': 24502080, 'steps': 127614, 'loss/train': 1.5085628032684326} 11/07/2021 15:11:44 - INFO - __main__ - Step 127616: {'lr': 2.7695789491078925e-05, 'samples': 24502272, 'steps': 127615, 'loss/train': 1.5565720796585083} 11/07/2021 15:11:45 - INFO - __main__ - Step 127617: {'lr': 2.769336178010573e-05, 'samples': 24502464, 'steps': 127616, 'loss/train': 1.166731834411621} 11/07/2021 15:11:45 - INFO - __main__ - Step 127618: {'lr': 2.7690934169300493e-05, 'samples': 24502656, 'steps': 127617, 'loss/train': 1.4063289165496826} 11/07/2021 15:11:46 - INFO - __main__ - Step 127619: {'lr': 2.7688506658664266e-05, 'samples': 24502848, 'steps': 127618, 'loss/train': 1.0387715101242065} 11/07/2021 15:11:47 - INFO - __main__ - Step 127620: {'lr': 2.768607924819816e-05, 'samples': 24503040, 'steps': 127619, 'loss/train': 0.38692963123321533} 11/07/2021 15:11:47 - INFO - __main__ - Step 127621: {'lr': 2.7683651937903285e-05, 'samples': 24503232, 'steps': 127620, 'loss/train': 1.139756441116333} 11/07/2021 15:11:47 - INFO - __main__ - Step 127622: {'lr': 2.7681224727780728e-05, 'samples': 24503424, 'steps': 127621, 'loss/train': 1.7144947052001953} 11/07/2021 15:11:48 - INFO - __main__ - Step 127623: {'lr': 2.7678797617831597e-05, 'samples': 24503616, 'steps': 127622, 'loss/train': 1.1436837911605835} 11/07/2021 15:11:49 - INFO - __main__ - Step 127624: {'lr': 2.7676370608056946e-05, 'samples': 24503808, 'steps': 127623, 'loss/train': 1.2606714963912964} 11/07/2021 15:11:49 - INFO - __main__ - Step 127625: {'lr': 2.767394369845791e-05, 'samples': 24504000, 'steps': 127624, 'loss/train': 1.3396283388137817} 11/07/2021 15:11:49 - INFO - __main__ - Step 127626: {'lr': 2.76715168890356e-05, 'samples': 24504192, 'steps': 127625, 'loss/train': 1.2120492458343506} 11/07/2021 15:11:50 - INFO - __main__ - Step 127627: {'lr': 2.7669090179791022e-05, 'samples': 24504384, 'steps': 127626, 'loss/train': 0.9471668004989624} 11/07/2021 15:11:50 - INFO - __main__ - Step 127628: {'lr': 2.7666663570725338e-05, 'samples': 24504576, 'steps': 127627, 'loss/train': 1.1596746444702148} 11/07/2021 15:11:51 - INFO - __main__ - Step 127629: {'lr': 2.7664237061839625e-05, 'samples': 24504768, 'steps': 127628, 'loss/train': 1.241958737373352} 11/07/2021 15:11:51 - INFO - __main__ - Step 127630: {'lr': 2.7661810653134943e-05, 'samples': 24504960, 'steps': 127629, 'loss/train': 1.2535244226455688} 11/07/2021 15:11:52 - INFO - __main__ - Step 127631: {'lr': 2.765938434461246e-05, 'samples': 24505152, 'steps': 127630, 'loss/train': 1.177711009979248} 11/07/2021 15:11:52 - INFO - __main__ - Step 127632: {'lr': 2.7656958136273196e-05, 'samples': 24505344, 'steps': 127631, 'loss/train': 1.1921976804733276} 11/07/2021 15:11:53 - INFO - __main__ - Step 127633: {'lr': 2.7654532028118294e-05, 'samples': 24505536, 'steps': 127632, 'loss/train': 1.306778907775879} 11/07/2021 15:11:53 - INFO - __main__ - Step 127634: {'lr': 2.765210602014881e-05, 'samples': 24505728, 'steps': 127633, 'loss/train': 1.4215668439865112} 11/07/2021 15:11:54 - INFO - __main__ - Step 127635: {'lr': 2.7649680112365875e-05, 'samples': 24505920, 'steps': 127634, 'loss/train': 0.8377905488014221} 11/07/2021 15:11:54 - INFO - __main__ - Step 127636: {'lr': 2.764725430477055e-05, 'samples': 24506112, 'steps': 127635, 'loss/train': 1.113342523574829} 11/07/2021 15:11:55 - INFO - __main__ - Step 127637: {'lr': 2.7644828597364003e-05, 'samples': 24506304, 'steps': 127636, 'loss/train': 1.9924238920211792} 11/07/2021 15:11:55 - INFO - __main__ - Step 127638: {'lr': 2.7642402990147224e-05, 'samples': 24506496, 'steps': 127637, 'loss/train': 1.463303804397583} 11/07/2021 15:11:56 - INFO - __main__ - Step 127639: {'lr': 2.7639977483121332e-05, 'samples': 24506688, 'steps': 127638, 'loss/train': 1.2147283554077148} 11/07/2021 15:11:56 - INFO - __main__ - Step 127640: {'lr': 2.7637552076287433e-05, 'samples': 24506880, 'steps': 127639, 'loss/train': 1.2359539270401} 11/07/2021 15:11:57 - INFO - __main__ - Step 127641: {'lr': 2.7635126769646608e-05, 'samples': 24507072, 'steps': 127640, 'loss/train': 1.5184600353240967} 11/07/2021 15:11:57 - INFO - __main__ - Step 127642: {'lr': 2.7632701563199996e-05, 'samples': 24507264, 'steps': 127641, 'loss/train': 1.2188377380371094} 11/07/2021 15:11:57 - INFO - __main__ - Step 127643: {'lr': 2.7630276456948627e-05, 'samples': 24507456, 'steps': 127642, 'loss/train': 1.8765759468078613} 11/07/2021 15:11:58 - INFO - __main__ - Step 127644: {'lr': 2.762785145089364e-05, 'samples': 24507648, 'steps': 127643, 'loss/train': 0.8693525195121765} 11/07/2021 15:11:59 - INFO - __main__ - Step 127645: {'lr': 2.762542654503611e-05, 'samples': 24507840, 'steps': 127644, 'loss/train': 1.4078489542007446} 11/07/2021 15:11:59 - INFO - __main__ - Step 127646: {'lr': 2.7623001739377153e-05, 'samples': 24508032, 'steps': 127645, 'loss/train': 1.5322017669677734} 11/07/2021 15:11:59 - INFO - __main__ - Step 127647: {'lr': 2.7620577033917793e-05, 'samples': 24508224, 'steps': 127646, 'loss/train': 1.2569177150726318} 11/07/2021 15:12:00 - INFO - __main__ - Step 127648: {'lr': 2.7618152428659198e-05, 'samples': 24508416, 'steps': 127647, 'loss/train': 0.738260805606842} 11/07/2021 15:12:00 - INFO - __main__ - Step 127649: {'lr': 2.7615727923602423e-05, 'samples': 24508608, 'steps': 127648, 'loss/train': 1.271349310874939} 11/07/2021 15:12:01 - INFO - __main__ - Step 127650: {'lr': 2.7613303518748632e-05, 'samples': 24508800, 'steps': 127649, 'loss/train': 0.9364190101623535} 11/07/2021 15:12:02 - INFO - __main__ - Step 127651: {'lr': 2.76108792140988e-05, 'samples': 24508992, 'steps': 127650, 'loss/train': 1.3256328105926514} 11/07/2021 15:12:02 - INFO - __main__ - Step 127652: {'lr': 2.7608455009654087e-05, 'samples': 24509184, 'steps': 127651, 'loss/train': 1.115069031715393} 11/07/2021 15:12:02 - INFO - __main__ - Step 127653: {'lr': 2.7606030905415552e-05, 'samples': 24509376, 'steps': 127652, 'loss/train': 1.242209792137146} 11/07/2021 15:12:03 - INFO - __main__ - Step 127654: {'lr': 2.7603606901384305e-05, 'samples': 24509568, 'steps': 127653, 'loss/train': 1.391237735748291} 11/07/2021 15:12:04 - INFO - __main__ - Step 127655: {'lr': 2.7601182997561453e-05, 'samples': 24509760, 'steps': 127654, 'loss/train': 1.1736366748809814} 11/07/2021 15:12:04 - INFO - __main__ - Step 127656: {'lr': 2.759875919394808e-05, 'samples': 24509952, 'steps': 127655, 'loss/train': 1.3174986839294434} 11/07/2021 15:12:04 - INFO - __main__ - Step 127657: {'lr': 2.759633549054527e-05, 'samples': 24510144, 'steps': 127656, 'loss/train': 0.8256440758705139} 11/07/2021 15:12:05 - INFO - __main__ - Step 127658: {'lr': 2.7593911887354108e-05, 'samples': 24510336, 'steps': 127657, 'loss/train': 1.0023162364959717} 11/07/2021 15:12:05 - INFO - __main__ - Step 127659: {'lr': 2.7591488384375697e-05, 'samples': 24510528, 'steps': 127658, 'loss/train': 1.2319239377975464} 11/07/2021 15:12:06 - INFO - __main__ - Step 127660: {'lr': 2.758906498161115e-05, 'samples': 24510720, 'steps': 127659, 'loss/train': 1.3013054132461548} 11/07/2021 15:12:06 - INFO - __main__ - Step 127661: {'lr': 2.7586641679061526e-05, 'samples': 24510912, 'steps': 127660, 'loss/train': 1.1508582830429077} 11/07/2021 15:12:07 - INFO - __main__ - Step 127662: {'lr': 2.758421847672793e-05, 'samples': 24511104, 'steps': 127661, 'loss/train': 1.675915002822876} 11/07/2021 15:12:07 - INFO - __main__ - Step 127663: {'lr': 2.7581795374611502e-05, 'samples': 24511296, 'steps': 127662, 'loss/train': 1.1463253498077393} 11/07/2021 15:12:08 - INFO - __main__ - Step 127664: {'lr': 2.7579372372713242e-05, 'samples': 24511488, 'steps': 127663, 'loss/train': 1.283659815788269} 11/07/2021 15:12:09 - INFO - __main__ - Step 127665: {'lr': 2.7576949471034257e-05, 'samples': 24511680, 'steps': 127664, 'loss/train': 1.4788169860839844} 11/07/2021 15:12:09 - INFO - __main__ - Step 127666: {'lr': 2.757452666957569e-05, 'samples': 24511872, 'steps': 127665, 'loss/train': 1.5264595746994019} 11/07/2021 15:12:09 - INFO - __main__ - Step 127667: {'lr': 2.7572103968338617e-05, 'samples': 24512064, 'steps': 127666, 'loss/train': 1.6079368591308594} 11/07/2021 15:12:10 - INFO - __main__ - Step 127668: {'lr': 2.75696813673241e-05, 'samples': 24512256, 'steps': 127667, 'loss/train': 1.0046286582946777} 11/07/2021 15:12:10 - INFO - __main__ - Step 127669: {'lr': 2.7567258866533273e-05, 'samples': 24512448, 'steps': 127668, 'loss/train': 1.3659104108810425} 11/07/2021 15:12:11 - INFO - __main__ - Step 127670: {'lr': 2.7564836465967193e-05, 'samples': 24512640, 'steps': 127669, 'loss/train': 1.1189475059509277} 11/07/2021 15:12:11 - INFO - __main__ - Step 127671: {'lr': 2.7562414165626963e-05, 'samples': 24512832, 'steps': 127670, 'loss/train': 0.6412570476531982} 11/07/2021 15:12:12 - INFO - __main__ - Step 127672: {'lr': 2.7559991965513703e-05, 'samples': 24513024, 'steps': 127671, 'loss/train': 0.2894483208656311} 11/07/2021 15:12:12 - INFO - __main__ - Step 127673: {'lr': 2.7557569865628435e-05, 'samples': 24513216, 'steps': 127672, 'loss/train': 1.3568918704986572} 11/07/2021 15:12:12 - INFO - __main__ - Step 127674: {'lr': 2.7555147865972324e-05, 'samples': 24513408, 'steps': 127673, 'loss/train': 1.1895171403884888} 11/07/2021 15:12:14 - INFO - __main__ - Step 127675: {'lr': 2.7552725966546426e-05, 'samples': 24513600, 'steps': 127674, 'loss/train': 1.2237001657485962} 11/07/2021 15:12:14 - INFO - __main__ - Step 127676: {'lr': 2.755030416735188e-05, 'samples': 24513792, 'steps': 127675, 'loss/train': 0.8974683880805969} 11/07/2021 15:12:14 - INFO - __main__ - Step 127677: {'lr': 2.7547882468389686e-05, 'samples': 24513984, 'steps': 127676, 'loss/train': 0.869926929473877} 11/07/2021 15:12:15 - INFO - __main__ - Step 127678: {'lr': 2.7545460869661005e-05, 'samples': 24514176, 'steps': 127677, 'loss/train': 1.2044134140014648} 11/07/2021 15:12:15 - INFO - __main__ - Step 127679: {'lr': 2.7543039371166867e-05, 'samples': 24514368, 'steps': 127678, 'loss/train': 1.0832297801971436} 11/07/2021 15:12:15 - INFO - __main__ - Step 127680: {'lr': 2.754061797290844e-05, 'samples': 24514560, 'steps': 127679, 'loss/train': 1.3451311588287354} 11/07/2021 15:12:16 - INFO - __main__ - Step 127681: {'lr': 2.7538196674886744e-05, 'samples': 24514752, 'steps': 127680, 'loss/train': 1.7107430696487427} 11/07/2021 15:12:17 - INFO - __main__ - Step 127682: {'lr': 2.7535775477102925e-05, 'samples': 24514944, 'steps': 127681, 'loss/train': 1.303866982460022} 11/07/2021 15:12:17 - INFO - __main__ - Step 127683: {'lr': 2.7533354379558063e-05, 'samples': 24515136, 'steps': 127682, 'loss/train': 1.2726372480392456} 11/07/2021 15:12:17 - INFO - __main__ - Step 127684: {'lr': 2.753093338225321e-05, 'samples': 24515328, 'steps': 127683, 'loss/train': 1.059308409690857} 11/07/2021 15:12:18 - INFO - __main__ - Step 127685: {'lr': 2.7528512485189507e-05, 'samples': 24515520, 'steps': 127684, 'loss/train': 0.9849502444267273} 11/07/2021 15:12:19 - INFO - __main__ - Step 127686: {'lr': 2.7526091688368033e-05, 'samples': 24515712, 'steps': 127685, 'loss/train': 1.3076441287994385} 11/07/2021 15:12:19 - INFO - __main__ - Step 127687: {'lr': 2.7523670991789845e-05, 'samples': 24515904, 'steps': 127686, 'loss/train': 1.2760366201400757} 11/07/2021 15:12:19 - INFO - __main__ - Step 127688: {'lr': 2.7521250395456054e-05, 'samples': 24516096, 'steps': 127687, 'loss/train': 1.1187312602996826} 11/07/2021 15:12:20 - INFO - __main__ - Step 127689: {'lr': 2.751882989936777e-05, 'samples': 24516288, 'steps': 127688, 'loss/train': 1.3886299133300781} 11/07/2021 15:12:20 - INFO - __main__ - Step 127690: {'lr': 2.751640950352613e-05, 'samples': 24516480, 'steps': 127689, 'loss/train': 1.0849448442459106} 11/07/2021 15:12:21 - INFO - __main__ - Step 127691: {'lr': 2.7513989207932078e-05, 'samples': 24516672, 'steps': 127690, 'loss/train': 1.2612040042877197} 11/07/2021 15:12:22 - INFO - __main__ - Step 127692: {'lr': 2.7511569012586806e-05, 'samples': 24516864, 'steps': 127691, 'loss/train': 1.361432671546936} 11/07/2021 15:12:22 - INFO - __main__ - Step 127693: {'lr': 2.75091489174914e-05, 'samples': 24517056, 'steps': 127692, 'loss/train': 1.2589677572250366} 11/07/2021 15:12:22 - INFO - __main__ - Step 127694: {'lr': 2.750672892264694e-05, 'samples': 24517248, 'steps': 127693, 'loss/train': 1.230563759803772} 11/07/2021 15:12:23 - INFO - __main__ - Step 127695: {'lr': 2.750430902805448e-05, 'samples': 24517440, 'steps': 127694, 'loss/train': 1.2824143171310425} 11/07/2021 15:12:24 - INFO - __main__ - Step 127696: {'lr': 2.750188923371519e-05, 'samples': 24517632, 'steps': 127695, 'loss/train': 1.4549864530563354} 11/07/2021 15:12:25 - INFO - __main__ - Step 127697: {'lr': 2.749946953963009e-05, 'samples': 24517824, 'steps': 127696, 'loss/train': 1.5325369834899902} 11/07/2021 15:12:25 - INFO - __main__ - Step 127698: {'lr': 2.7497049945800294e-05, 'samples': 24518016, 'steps': 127697, 'loss/train': 1.2741007804870605} 11/07/2021 15:12:25 - INFO - __main__ - Step 127699: {'lr': 2.7494630452226887e-05, 'samples': 24518208, 'steps': 127698, 'loss/train': 1.086700439453125} 11/07/2021 15:12:26 - INFO - __main__ - Step 127700: {'lr': 2.7492211058910976e-05, 'samples': 24518400, 'steps': 127699, 'loss/train': 1.7579199075698853} 11/07/2021 15:12:26 - INFO - __main__ - Step 127701: {'lr': 2.7489791765853646e-05, 'samples': 24518592, 'steps': 127700, 'loss/train': 0.07377536594867706} 11/07/2021 15:12:27 - INFO - __main__ - Step 127702: {'lr': 2.7487372573056e-05, 'samples': 24518784, 'steps': 127701, 'loss/train': 1.2415825128555298} 11/07/2021 15:12:27 - INFO - __main__ - Step 127703: {'lr': 2.748495348051913e-05, 'samples': 24518976, 'steps': 127702, 'loss/train': 1.5559585094451904} 11/07/2021 15:12:28 - INFO - __main__ - Step 127704: {'lr': 2.748253448824406e-05, 'samples': 24519168, 'steps': 127703, 'loss/train': 1.1112583875656128} 11/07/2021 15:12:28 - INFO - __main__ - Step 127705: {'lr': 2.7480115596231952e-05, 'samples': 24519360, 'steps': 127704, 'loss/train': 0.828424334526062} 11/07/2021 15:12:28 - INFO - __main__ - Step 127706: {'lr': 2.7477696804483838e-05, 'samples': 24519552, 'steps': 127705, 'loss/train': 1.269045114517212} 11/07/2021 15:12:29 - INFO - __main__ - Step 127707: {'lr': 2.747527811300085e-05, 'samples': 24519744, 'steps': 127706, 'loss/train': 1.8104618787765503} 11/07/2021 15:12:30 - INFO - __main__ - Step 127708: {'lr': 2.7472859521784076e-05, 'samples': 24519936, 'steps': 127707, 'loss/train': 0.8833493590354919} 11/07/2021 15:12:30 - INFO - __main__ - Step 127709: {'lr': 2.7470441030834597e-05, 'samples': 24520128, 'steps': 127708, 'loss/train': 0.8102421760559082} 11/07/2021 15:12:30 - INFO - __main__ - Step 127710: {'lr': 2.7468022640153494e-05, 'samples': 24520320, 'steps': 127709, 'loss/train': 1.4358564615249634} 11/07/2021 15:12:31 - INFO - __main__ - Step 127711: {'lr': 2.746560434974188e-05, 'samples': 24520512, 'steps': 127710, 'loss/train': 1.2890567779541016} 11/07/2021 15:12:32 - INFO - __main__ - Step 127712: {'lr': 2.7463186159600807e-05, 'samples': 24520704, 'steps': 127711, 'loss/train': 0.6705204248428345} 11/07/2021 15:12:32 - INFO - __main__ - Step 127713: {'lr': 2.7460768069731414e-05, 'samples': 24520896, 'steps': 127712, 'loss/train': 1.625483512878418} 11/07/2021 15:12:33 - INFO - __main__ - Step 127714: {'lr': 2.7458350080134753e-05, 'samples': 24521088, 'steps': 127713, 'loss/train': 1.2771679162979126} 11/07/2021 15:12:33 - INFO - __main__ - Step 127715: {'lr': 2.745593219081191e-05, 'samples': 24521280, 'steps': 127714, 'loss/train': 1.1075924634933472} 11/07/2021 15:12:33 - INFO - __main__ - Step 127716: {'lr': 2.7453514401764023e-05, 'samples': 24521472, 'steps': 127715, 'loss/train': 1.4031769037246704} 11/07/2021 15:12:34 - INFO - __main__ - Step 127717: {'lr': 2.7451096712992173e-05, 'samples': 24521664, 'steps': 127716, 'loss/train': 1.3057433366775513} 11/07/2021 15:12:35 - INFO - __main__ - Step 127718: {'lr': 2.744867912449739e-05, 'samples': 24521856, 'steps': 127717, 'loss/train': 1.1947706937789917} 11/07/2021 15:12:35 - INFO - __main__ - Step 127719: {'lr': 2.7446261636280777e-05, 'samples': 24522048, 'steps': 127718, 'loss/train': 1.400376796722412} 11/07/2021 15:12:35 - INFO - __main__ - Step 127720: {'lr': 2.744384424834348e-05, 'samples': 24522240, 'steps': 127719, 'loss/train': 1.6798920631408691} 11/07/2021 15:12:36 - INFO - __main__ - Step 127721: {'lr': 2.7441426960686523e-05, 'samples': 24522432, 'steps': 127720, 'loss/train': 1.5804307460784912} 11/07/2021 15:12:37 - INFO - __main__ - Step 127722: {'lr': 2.7439009773311042e-05, 'samples': 24522624, 'steps': 127721, 'loss/train': 1.7710202932357788} 11/07/2021 15:12:37 - INFO - __main__ - Step 127723: {'lr': 2.7436592686218093e-05, 'samples': 24522816, 'steps': 127722, 'loss/train': 1.2623783349990845} 11/07/2021 15:12:37 - INFO - __main__ - Step 127724: {'lr': 2.7434175699408786e-05, 'samples': 24523008, 'steps': 127723, 'loss/train': 0.9365987181663513} 11/07/2021 15:12:38 - INFO - __main__ - Step 127725: {'lr': 2.7431758812884206e-05, 'samples': 24523200, 'steps': 127724, 'loss/train': 1.0633633136749268} 11/07/2021 15:12:38 - INFO - __main__ - Step 127726: {'lr': 2.742934202664543e-05, 'samples': 24523392, 'steps': 127725, 'loss/train': 1.1461989879608154} 11/07/2021 15:12:38 - INFO - __main__ - Step 127727: {'lr': 2.7426925340693577e-05, 'samples': 24523584, 'steps': 127726, 'loss/train': 1.3681161403656006} 11/07/2021 15:12:39 - INFO - __main__ - Step 127728: {'lr': 2.742450875502972e-05, 'samples': 24523776, 'steps': 127727, 'loss/train': 1.4688551425933838} 11/07/2021 15:12:40 - INFO - __main__ - Step 127729: {'lr': 2.742209226965492e-05, 'samples': 24523968, 'steps': 127728, 'loss/train': 1.0339852571487427} 11/07/2021 15:12:40 - INFO - __main__ - Step 127730: {'lr': 2.7419675884570367e-05, 'samples': 24524160, 'steps': 127729, 'loss/train': 1.1004178524017334} 11/07/2021 15:12:41 - INFO - __main__ - Step 127731: {'lr': 2.7417259599777007e-05, 'samples': 24524352, 'steps': 127730, 'loss/train': 1.4403327703475952} 11/07/2021 15:12:41 - INFO - __main__ - Step 127732: {'lr': 2.7414843415276003e-05, 'samples': 24524544, 'steps': 127731, 'loss/train': 1.3097058534622192} 11/07/2021 15:12:42 - INFO - __main__ - Step 127733: {'lr': 2.741242733106844e-05, 'samples': 24524736, 'steps': 127732, 'loss/train': 1.3519260883331299} 11/07/2021 15:12:42 - INFO - __main__ - Step 127734: {'lr': 2.7410011347155373e-05, 'samples': 24524928, 'steps': 127733, 'loss/train': 0.9006853699684143} 11/07/2021 15:12:43 - INFO - __main__ - Step 127735: {'lr': 2.7407595463537965e-05, 'samples': 24525120, 'steps': 127734, 'loss/train': 1.108486294746399} 11/07/2021 15:12:43 - INFO - __main__ - Step 127736: {'lr': 2.7405179680217217e-05, 'samples': 24525312, 'steps': 127735, 'loss/train': 1.2907674312591553} 11/07/2021 15:12:43 - INFO - __main__ - Step 127737: {'lr': 2.7402763997194298e-05, 'samples': 24525504, 'steps': 127736, 'loss/train': 1.27446711063385} 11/07/2021 15:12:44 - INFO - __main__ - Step 127738: {'lr': 2.7400348414470227e-05, 'samples': 24525696, 'steps': 127737, 'loss/train': 0.987923264503479} 11/07/2021 15:12:45 - INFO - __main__ - Step 127739: {'lr': 2.739793293204615e-05, 'samples': 24525888, 'steps': 127738, 'loss/train': 0.9311794638633728} 11/07/2021 15:12:45 - INFO - __main__ - Step 127740: {'lr': 2.7395517549923116e-05, 'samples': 24526080, 'steps': 127739, 'loss/train': 0.9463788866996765} 11/07/2021 15:12:45 - INFO - __main__ - Step 127741: {'lr': 2.739310226810221e-05, 'samples': 24526272, 'steps': 127740, 'loss/train': 1.3258681297302246} 11/07/2021 15:12:46 - INFO - __main__ - Step 127742: {'lr': 2.739068708658457e-05, 'samples': 24526464, 'steps': 127741, 'loss/train': 1.0931330919265747} 11/07/2021 15:12:47 - INFO - __main__ - Step 127743: {'lr': 2.7388272005371222e-05, 'samples': 24526656, 'steps': 127742, 'loss/train': 1.547873854637146} 11/07/2021 15:12:47 - INFO - __main__ - Step 127744: {'lr': 2.7385857024463362e-05, 'samples': 24526848, 'steps': 127743, 'loss/train': 1.0202267169952393} 11/07/2021 15:12:47 - INFO - __main__ - Step 127745: {'lr': 2.738344214386193e-05, 'samples': 24527040, 'steps': 127744, 'loss/train': 0.9540729522705078} 11/07/2021 15:12:48 - INFO - __main__ - Step 127746: {'lr': 2.7381027363568094e-05, 'samples': 24527232, 'steps': 127745, 'loss/train': 0.8957424163818359} 11/07/2021 15:12:48 - INFO - __main__ - Step 127747: {'lr': 2.7378612683582936e-05, 'samples': 24527424, 'steps': 127746, 'loss/train': 1.3693431615829468} 11/07/2021 15:12:49 - INFO - __main__ - Step 127748: {'lr': 2.7376198103907512e-05, 'samples': 24527616, 'steps': 127747, 'loss/train': 1.4956210851669312} 11/07/2021 15:12:49 - INFO - __main__ - Step 127749: {'lr': 2.7373783624542958e-05, 'samples': 24527808, 'steps': 127748, 'loss/train': 1.5547103881835938} 11/07/2021 15:12:50 - INFO - __main__ - Step 127750: {'lr': 2.7371369245490357e-05, 'samples': 24528000, 'steps': 127749, 'loss/train': 1.4492086172103882} 11/07/2021 15:12:50 - INFO - __main__ - Step 127751: {'lr': 2.7368954966750764e-05, 'samples': 24528192, 'steps': 127750, 'loss/train': 1.2690768241882324} 11/07/2021 15:12:51 - INFO - __main__ - Step 127752: {'lr': 2.736654078832529e-05, 'samples': 24528384, 'steps': 127751, 'loss/train': 1.1253899335861206} 11/07/2021 15:12:52 - INFO - __main__ - Step 127753: {'lr': 2.7364126710215014e-05, 'samples': 24528576, 'steps': 127752, 'loss/train': 1.303969383239746} 11/07/2021 15:12:52 - INFO - __main__ - Step 127754: {'lr': 2.7361712732421023e-05, 'samples': 24528768, 'steps': 127753, 'loss/train': 1.1016967296600342} 11/07/2021 15:12:52 - INFO - __main__ - Step 127755: {'lr': 2.7359298854944396e-05, 'samples': 24528960, 'steps': 127754, 'loss/train': 1.5052309036254883} 11/07/2021 15:12:53 - INFO - __main__ - Step 127756: {'lr': 2.735688507778625e-05, 'samples': 24529152, 'steps': 127755, 'loss/train': 0.7915965914726257} 11/07/2021 15:12:53 - INFO - __main__ - Step 127757: {'lr': 2.7354471400947712e-05, 'samples': 24529344, 'steps': 127756, 'loss/train': 1.5776833295822144} 11/07/2021 15:12:53 - INFO - __main__ - Step 127758: {'lr': 2.735205782442976e-05, 'samples': 24529536, 'steps': 127757, 'loss/train': 1.4454010725021362} 11/07/2021 15:12:54 - INFO - __main__ - Step 127759: {'lr': 2.734964434823353e-05, 'samples': 24529728, 'steps': 127758, 'loss/train': 1.4768083095550537} 11/07/2021 15:12:55 - INFO - __main__ - Step 127760: {'lr': 2.7347230972360108e-05, 'samples': 24529920, 'steps': 127759, 'loss/train': 1.3327003717422485} 11/07/2021 15:12:55 - INFO - __main__ - Step 127761: {'lr': 2.7344817696810603e-05, 'samples': 24530112, 'steps': 127760, 'loss/train': 0.47301754355430603} 11/07/2021 15:12:55 - INFO - __main__ - Step 127762: {'lr': 2.7342404521586096e-05, 'samples': 24530304, 'steps': 127761, 'loss/train': 1.3599213361740112} 11/07/2021 15:12:56 - INFO - __main__ - Step 127763: {'lr': 2.733999144668764e-05, 'samples': 24530496, 'steps': 127762, 'loss/train': 1.2699495553970337} 11/07/2021 15:12:57 - INFO - __main__ - Step 127764: {'lr': 2.7337578472116348e-05, 'samples': 24530688, 'steps': 127763, 'loss/train': 1.1952643394470215} 11/07/2021 15:12:57 - INFO - __main__ - Step 127765: {'lr': 2.733516559787333e-05, 'samples': 24530880, 'steps': 127764, 'loss/train': 0.991359531879425} 11/07/2021 15:12:58 - INFO - __main__ - Step 127766: {'lr': 2.733275282395964e-05, 'samples': 24531072, 'steps': 127765, 'loss/train': 1.2997196912765503} 11/07/2021 15:12:58 - INFO - __main__ - Step 127767: {'lr': 2.733034015037636e-05, 'samples': 24531264, 'steps': 127766, 'loss/train': 1.007665753364563} 11/07/2021 15:12:58 - INFO - __main__ - Step 127768: {'lr': 2.7327927577124628e-05, 'samples': 24531456, 'steps': 127767, 'loss/train': 0.7682595252990723} 11/07/2021 15:12:59 - INFO - __main__ - Step 127769: {'lr': 2.732551510420547e-05, 'samples': 24531648, 'steps': 127768, 'loss/train': 1.301004409790039} 11/07/2021 15:13:00 - INFO - __main__ - Step 127770: {'lr': 2.7323102731620004e-05, 'samples': 24531840, 'steps': 127769, 'loss/train': 1.126265048980713} 11/07/2021 15:13:00 - INFO - __main__ - Step 127771: {'lr': 2.7320690459369356e-05, 'samples': 24532032, 'steps': 127770, 'loss/train': 1.4566909074783325} 11/07/2021 15:13:00 - INFO - __main__ - Step 127772: {'lr': 2.7318278287454534e-05, 'samples': 24532224, 'steps': 127771, 'loss/train': 1.3581982851028442} 11/07/2021 15:13:01 - INFO - __main__ - Step 127773: {'lr': 2.7315866215876644e-05, 'samples': 24532416, 'steps': 127772, 'loss/train': 1.298124074935913} 11/07/2021 15:13:02 - INFO - __main__ - Step 127774: {'lr': 2.73134542446368e-05, 'samples': 24532608, 'steps': 127773, 'loss/train': 1.0234057903289795} 11/07/2021 15:13:02 - INFO - __main__ - Step 127775: {'lr': 2.731104237373605e-05, 'samples': 24532800, 'steps': 127774, 'loss/train': 1.3301657438278198} 11/07/2021 15:13:03 - INFO - __main__ - Step 127776: {'lr': 2.730863060317554e-05, 'samples': 24532992, 'steps': 127775, 'loss/train': 1.4485955238342285} 11/07/2021 15:13:03 - INFO - __main__ - Step 127777: {'lr': 2.7306218932956317e-05, 'samples': 24533184, 'steps': 127776, 'loss/train': 2.094475030899048} 11/07/2021 15:13:03 - INFO - __main__ - Step 127778: {'lr': 2.730380736307947e-05, 'samples': 24533376, 'steps': 127777, 'loss/train': 0.7946223020553589} 11/07/2021 15:13:04 - INFO - __main__ - Step 127779: {'lr': 2.7301395893546104e-05, 'samples': 24533568, 'steps': 127778, 'loss/train': 1.0954499244689941} 11/07/2021 15:13:05 - INFO - __main__ - Step 127780: {'lr': 2.7298984524357278e-05, 'samples': 24533760, 'steps': 127779, 'loss/train': 1.5564489364624023} 11/07/2021 15:13:05 - INFO - __main__ - Step 127781: {'lr': 2.72965732555141e-05, 'samples': 24533952, 'steps': 127780, 'loss/train': 1.2663602828979492} 11/07/2021 15:13:05 - INFO - __main__ - Step 127782: {'lr': 2.7294162087017626e-05, 'samples': 24534144, 'steps': 127781, 'loss/train': 1.0716040134429932} 11/07/2021 15:13:06 - INFO - __main__ - Step 127783: {'lr': 2.729175101886899e-05, 'samples': 24534336, 'steps': 127782, 'loss/train': 1.0685806274414062} 11/07/2021 15:13:07 - INFO - __main__ - Step 127784: {'lr': 2.728934005106931e-05, 'samples': 24534528, 'steps': 127783, 'loss/train': 1.0531710386276245} 11/07/2021 15:13:07 - INFO - __main__ - Step 127785: {'lr': 2.7286929183619552e-05, 'samples': 24534720, 'steps': 127784, 'loss/train': 1.1357927322387695} 11/07/2021 15:13:07 - INFO - __main__ - Step 127786: {'lr': 2.728451841652088e-05, 'samples': 24534912, 'steps': 127785, 'loss/train': 1.136753797531128} 11/07/2021 15:13:08 - INFO - __main__ - Step 127787: {'lr': 2.7282107749774354e-05, 'samples': 24535104, 'steps': 127786, 'loss/train': 1.0260093212127686} 11/07/2021 15:13:08 - INFO - __main__ - Step 127788: {'lr': 2.727969718338108e-05, 'samples': 24535296, 'steps': 127787, 'loss/train': 1.7446209192276} 11/07/2021 15:13:08 - INFO - __main__ - Step 127789: {'lr': 2.7277286717342143e-05, 'samples': 24535488, 'steps': 127788, 'loss/train': 1.4972267150878906} 11/07/2021 15:13:10 - INFO - __main__ - Step 127790: {'lr': 2.7274876351658623e-05, 'samples': 24535680, 'steps': 127789, 'loss/train': 1.2989617586135864} 11/07/2021 15:13:10 - INFO - __main__ - Step 127791: {'lr': 2.7272466086331604e-05, 'samples': 24535872, 'steps': 127790, 'loss/train': 1.3254281282424927} 11/07/2021 15:13:10 - INFO - __main__ - Step 127792: {'lr': 2.727005592136217e-05, 'samples': 24536064, 'steps': 127791, 'loss/train': 1.1746476888656616} 11/07/2021 15:13:11 - INFO - __main__ - Step 127793: {'lr': 2.72676458567514e-05, 'samples': 24536256, 'steps': 127792, 'loss/train': 0.9726319909095764} 11/07/2021 15:13:11 - INFO - __main__ - Step 127794: {'lr': 2.7265235892500406e-05, 'samples': 24536448, 'steps': 127793, 'loss/train': 1.3436639308929443} 11/07/2021 15:13:12 - INFO - __main__ - Step 127795: {'lr': 2.7262826028610273e-05, 'samples': 24536640, 'steps': 127794, 'loss/train': 1.1161553859710693} 11/07/2021 15:13:12 - INFO - __main__ - Step 127796: {'lr': 2.726041626508205e-05, 'samples': 24536832, 'steps': 127795, 'loss/train': 1.375345230102539} 11/07/2021 15:13:13 - INFO - __main__ - Step 127797: {'lr': 2.725800660191691e-05, 'samples': 24537024, 'steps': 127796, 'loss/train': 0.3493488132953644} 11/07/2021 15:13:13 - INFO - __main__ - Step 127798: {'lr': 2.7255597039115814e-05, 'samples': 24537216, 'steps': 127797, 'loss/train': 1.2521787881851196} 11/07/2021 15:13:14 - INFO - __main__ - Step 127799: {'lr': 2.725318757667994e-05, 'samples': 24537408, 'steps': 127798, 'loss/train': 1.2393602132797241} 11/07/2021 15:13:15 - INFO - __main__ - Step 127800: {'lr': 2.7250778214610305e-05, 'samples': 24537600, 'steps': 127799, 'loss/train': 1.6361192464828491} 11/07/2021 15:13:15 - INFO - __main__ - Step 127801: {'lr': 2.7248368952908055e-05, 'samples': 24537792, 'steps': 127800, 'loss/train': 1.1743263006210327} 11/07/2021 15:13:15 - INFO - __main__ - Step 127802: {'lr': 2.724595979157424e-05, 'samples': 24537984, 'steps': 127801, 'loss/train': 1.0737690925598145} 11/07/2021 15:13:16 - INFO - __main__ - Step 127803: {'lr': 2.7243550730609967e-05, 'samples': 24538176, 'steps': 127802, 'loss/train': 1.440696358680725} 11/07/2021 15:13:16 - INFO - __main__ - Step 127804: {'lr': 2.7241141770016298e-05, 'samples': 24538368, 'steps': 127803, 'loss/train': 1.463624358177185} 11/07/2021 15:13:17 - INFO - __main__ - Step 127805: {'lr': 2.723873290979434e-05, 'samples': 24538560, 'steps': 127804, 'loss/train': 1.0356711149215698} 11/07/2021 15:13:17 - INFO - __main__ - Step 127806: {'lr': 2.7236324149945175e-05, 'samples': 24538752, 'steps': 127805, 'loss/train': 0.8853057622909546} 11/07/2021 15:13:18 - INFO - __main__ - Step 127807: {'lr': 2.7233915490469886e-05, 'samples': 24538944, 'steps': 127806, 'loss/train': 1.3186861276626587} 11/07/2021 15:13:18 - INFO - __main__ - Step 127808: {'lr': 2.7231506931369553e-05, 'samples': 24539136, 'steps': 127807, 'loss/train': 1.1919825077056885} 11/07/2021 15:13:18 - INFO - __main__ - Step 127809: {'lr': 2.7229098472645263e-05, 'samples': 24539328, 'steps': 127808, 'loss/train': 1.187421202659607} 11/07/2021 15:13:19 - INFO - __main__ - Step 127810: {'lr': 2.7226690114298125e-05, 'samples': 24539520, 'steps': 127809, 'loss/train': 1.7451289892196655} 11/07/2021 15:13:20 - INFO - __main__ - Step 127811: {'lr': 2.722428185632922e-05, 'samples': 24539712, 'steps': 127810, 'loss/train': 1.6047697067260742} 11/07/2021 15:13:20 - INFO - __main__ - Step 127812: {'lr': 2.7221873698739603e-05, 'samples': 24539904, 'steps': 127811, 'loss/train': 1.0442296266555786} 11/07/2021 15:13:20 - INFO - __main__ - Step 127813: {'lr': 2.7219465641530354e-05, 'samples': 24540096, 'steps': 127812, 'loss/train': 1.411885142326355} 11/07/2021 15:13:21 - INFO - __main__ - Step 127814: {'lr': 2.7217057684702562e-05, 'samples': 24540288, 'steps': 127813, 'loss/train': 0.7759804129600525} 11/07/2021 15:13:23 - INFO - __main__ - Step 127815: {'lr': 2.7214649828257333e-05, 'samples': 24540480, 'steps': 127814, 'loss/train': 1.4084552526474} 11/07/2021 15:13:23 - INFO - __main__ - Step 127816: {'lr': 2.7212242072195747e-05, 'samples': 24540672, 'steps': 127815, 'loss/train': 0.461764931678772} 11/07/2021 15:13:23 - INFO - __main__ - Step 127817: {'lr': 2.7209834416518892e-05, 'samples': 24540864, 'steps': 127816, 'loss/train': 0.4897724688053131} 11/07/2021 15:13:24 - INFO - __main__ - Step 127818: {'lr': 2.7207426861227845e-05, 'samples': 24541056, 'steps': 127817, 'loss/train': 1.3072932958602905} 11/07/2021 15:13:24 - INFO - __main__ - Step 127819: {'lr': 2.7205019406323694e-05, 'samples': 24541248, 'steps': 127818, 'loss/train': 1.507246732711792} 11/07/2021 15:13:24 - INFO - __main__ - Step 127820: {'lr': 2.720261205180752e-05, 'samples': 24541440, 'steps': 127819, 'loss/train': 1.6593230962753296} 11/07/2021 15:13:25 - INFO - __main__ - Step 127821: {'lr': 2.720020479768043e-05, 'samples': 24541632, 'steps': 127820, 'loss/train': 1.2970917224884033} 11/07/2021 15:13:26 - INFO - __main__ - Step 127822: {'lr': 2.7197797643943477e-05, 'samples': 24541824, 'steps': 127821, 'loss/train': 1.0906434059143066} 11/07/2021 15:13:26 - INFO - __main__ - Step 127823: {'lr': 2.719539059059775e-05, 'samples': 24542016, 'steps': 127822, 'loss/train': 1.3018841743469238} 11/07/2021 15:13:26 - INFO - __main__ - Step 127824: {'lr': 2.7192983637644386e-05, 'samples': 24542208, 'steps': 127823, 'loss/train': 1.1003859043121338} 11/07/2021 15:13:27 - INFO - __main__ - Step 127825: {'lr': 2.7190576785084408e-05, 'samples': 24542400, 'steps': 127824, 'loss/train': 1.2896645069122314} 11/07/2021 15:13:28 - INFO - __main__ - Step 127826: {'lr': 2.7188170032918875e-05, 'samples': 24542592, 'steps': 127825, 'loss/train': 0.7631842494010925} 11/07/2021 15:13:28 - INFO - __main__ - Step 127827: {'lr': 2.7185763381148948e-05, 'samples': 24542784, 'steps': 127826, 'loss/train': 0.9768900275230408} 11/07/2021 15:13:29 - INFO - __main__ - Step 127828: {'lr': 2.7183356829775658e-05, 'samples': 24542976, 'steps': 127827, 'loss/train': 1.5830949544906616} 11/07/2021 15:13:29 - INFO - __main__ - Step 127829: {'lr': 2.7180950378800113e-05, 'samples': 24543168, 'steps': 127828, 'loss/train': 0.586651086807251} 11/07/2021 15:13:29 - INFO - __main__ - Step 127830: {'lr': 2.7178544028223396e-05, 'samples': 24543360, 'steps': 127829, 'loss/train': 1.1700903177261353} 11/07/2021 15:13:30 - INFO - __main__ - Step 127831: {'lr': 2.717613777804659e-05, 'samples': 24543552, 'steps': 127830, 'loss/train': 1.2244799137115479} 11/07/2021 15:13:31 - INFO - __main__ - Step 127832: {'lr': 2.7173731628270804e-05, 'samples': 24543744, 'steps': 127831, 'loss/train': 0.9851155281066895} 11/07/2021 15:13:31 - INFO - __main__ - Step 127833: {'lr': 2.7171325578897065e-05, 'samples': 24543936, 'steps': 127832, 'loss/train': 1.0573545694351196} 11/07/2021 15:13:31 - INFO - __main__ - Step 127834: {'lr': 2.7168919629926485e-05, 'samples': 24544128, 'steps': 127833, 'loss/train': 1.5800224542617798} 11/07/2021 15:13:32 - INFO - __main__ - Step 127835: {'lr': 2.7166513781360145e-05, 'samples': 24544320, 'steps': 127834, 'loss/train': 2.432009696960449} 11/07/2021 15:13:32 - INFO - __main__ - Step 127836: {'lr': 2.716410803319916e-05, 'samples': 24544512, 'steps': 127835, 'loss/train': 1.1518923044204712} 11/07/2021 15:13:33 - INFO - __main__ - Step 127837: {'lr': 2.7161702385444575e-05, 'samples': 24544704, 'steps': 127836, 'loss/train': 1.073104739189148} 11/07/2021 15:13:34 - INFO - __main__ - Step 127838: {'lr': 2.7159296838097565e-05, 'samples': 24544896, 'steps': 127837, 'loss/train': 1.205741047859192} 11/07/2021 15:13:34 - INFO - __main__ - Step 127839: {'lr': 2.7156891391159066e-05, 'samples': 24545088, 'steps': 127838, 'loss/train': 1.339217185974121} 11/07/2021 15:13:34 - INFO - __main__ - Step 127840: {'lr': 2.715448604463022e-05, 'samples': 24545280, 'steps': 127839, 'loss/train': 1.3303614854812622} 11/07/2021 15:13:35 - INFO - __main__ - Step 127841: {'lr': 2.715208079851214e-05, 'samples': 24545472, 'steps': 127840, 'loss/train': 1.007380723953247} 11/07/2021 15:13:36 - INFO - __main__ - Step 127842: {'lr': 2.7149675652805877e-05, 'samples': 24545664, 'steps': 127841, 'loss/train': 5.665793418884277} 11/07/2021 15:13:36 - INFO - __main__ - Step 127843: {'lr': 2.7147270607512543e-05, 'samples': 24545856, 'steps': 127842, 'loss/train': 1.4090489149093628} 11/07/2021 15:13:36 - INFO - __main__ - Step 127844: {'lr': 2.7144865662633217e-05, 'samples': 24546048, 'steps': 127843, 'loss/train': 1.4571548700332642} 11/07/2021 15:13:37 - INFO - __main__ - Step 127845: {'lr': 2.7142460818168985e-05, 'samples': 24546240, 'steps': 127844, 'loss/train': 1.3079508543014526} 11/07/2021 15:13:37 - INFO - __main__ - Step 127846: {'lr': 2.71400560741209e-05, 'samples': 24546432, 'steps': 127845, 'loss/train': 1.340547800064087} 11/07/2021 15:13:37 - INFO - __main__ - Step 127847: {'lr': 2.7137651430490074e-05, 'samples': 24546624, 'steps': 127846, 'loss/train': 1.2711796760559082} 11/07/2021 15:13:39 - INFO - __main__ - Step 127848: {'lr': 2.7135246887277586e-05, 'samples': 24546816, 'steps': 127847, 'loss/train': 1.1518326997756958} 11/07/2021 15:13:39 - INFO - __main__ - Step 127849: {'lr': 2.7132842444484497e-05, 'samples': 24547008, 'steps': 127848, 'loss/train': 1.2016428709030151} 11/07/2021 15:13:39 - INFO - __main__ - Step 127850: {'lr': 2.7130438102111937e-05, 'samples': 24547200, 'steps': 127849, 'loss/train': 1.5993902683258057} 11/07/2021 15:13:40 - INFO - __main__ - Step 127851: {'lr': 2.7128033860160994e-05, 'samples': 24547392, 'steps': 127850, 'loss/train': 1.3383398056030273} 11/07/2021 15:13:40 - INFO - __main__ - Step 127852: {'lr': 2.7125629718632665e-05, 'samples': 24547584, 'steps': 127851, 'loss/train': 1.3107292652130127} 11/07/2021 15:13:41 - INFO - __main__ - Step 127853: {'lr': 2.712322567752812e-05, 'samples': 24547776, 'steps': 127852, 'loss/train': 0.08843497931957245} 11/07/2021 15:13:41 - INFO - __main__ - Step 127854: {'lr': 2.7120821736848378e-05, 'samples': 24547968, 'steps': 127853, 'loss/train': 1.4605467319488525} 11/07/2021 15:13:42 - INFO - __main__ - Step 127855: {'lr': 2.7118417896594584e-05, 'samples': 24548160, 'steps': 127854, 'loss/train': 1.8944271802902222} 11/07/2021 15:13:42 - INFO - __main__ - Step 127856: {'lr': 2.711601415676776e-05, 'samples': 24548352, 'steps': 127855, 'loss/train': 1.6121344566345215} 11/07/2021 15:13:43 - INFO - __main__ - Step 127857: {'lr': 2.7113610517369047e-05, 'samples': 24548544, 'steps': 127856, 'loss/train': 1.37639319896698} 11/07/2021 15:13:44 - INFO - __main__ - Step 127858: {'lr': 2.7111206978399472e-05, 'samples': 24548736, 'steps': 127857, 'loss/train': 1.4546153545379639} 11/07/2021 15:13:44 - INFO - __main__ - Step 127859: {'lr': 2.710880353986017e-05, 'samples': 24548928, 'steps': 127858, 'loss/train': 1.6197829246520996} 11/07/2021 15:13:44 - INFO - __main__ - Step 127860: {'lr': 2.71064002017522e-05, 'samples': 24549120, 'steps': 127859, 'loss/train': 0.6986574530601501} 11/07/2021 15:13:45 - INFO - __main__ - Step 127861: {'lr': 2.7103996964076643e-05, 'samples': 24549312, 'steps': 127860, 'loss/train': 1.7879303693771362} 11/07/2021 15:13:45 - INFO - __main__ - Step 127862: {'lr': 2.710159382683458e-05, 'samples': 24549504, 'steps': 127861, 'loss/train': 1.1041624546051025} 11/07/2021 15:13:46 - INFO - __main__ - Step 127863: {'lr': 2.7099190790027178e-05, 'samples': 24549696, 'steps': 127862, 'loss/train': 1.3977693319320679} 11/07/2021 15:13:46 - INFO - __main__ - Step 127864: {'lr': 2.709678785365535e-05, 'samples': 24549888, 'steps': 127863, 'loss/train': 1.8605678081512451} 11/07/2021 15:13:47 - INFO - __main__ - Step 127865: {'lr': 2.7094385017720297e-05, 'samples': 24550080, 'steps': 127864, 'loss/train': 1.6417365074157715} 11/07/2021 15:13:47 - INFO - __main__ - Step 127866: {'lr': 2.7091982282223065e-05, 'samples': 24550272, 'steps': 127865, 'loss/train': 1.3045766353607178} 11/07/2021 15:13:47 - INFO - __main__ - Step 127867: {'lr': 2.708957964716474e-05, 'samples': 24550464, 'steps': 127866, 'loss/train': 1.54091215133667} 11/07/2021 15:13:48 - INFO - __main__ - Step 127868: {'lr': 2.7087177112546434e-05, 'samples': 24550656, 'steps': 127867, 'loss/train': 1.254414677619934} 11/07/2021 15:13:49 - INFO - __main__ - Step 127869: {'lr': 2.70847746783692e-05, 'samples': 24550848, 'steps': 127868, 'loss/train': 0.8781160712242126} 11/07/2021 15:13:49 - INFO - __main__ - Step 127870: {'lr': 2.7082372344634095e-05, 'samples': 24551040, 'steps': 127869, 'loss/train': 1.184690237045288} 11/07/2021 15:13:49 - INFO - __main__ - Step 127871: {'lr': 2.7079970111342277e-05, 'samples': 24551232, 'steps': 127870, 'loss/train': 1.5938409566879272} 11/07/2021 15:13:50 - INFO - __main__ - Step 127872: {'lr': 2.7077567978494754e-05, 'samples': 24551424, 'steps': 127871, 'loss/train': 1.2057569026947021} 11/07/2021 15:13:51 - INFO - __main__ - Step 127873: {'lr': 2.707516594609269e-05, 'samples': 24551616, 'steps': 127872, 'loss/train': 1.4927518367767334} 11/07/2021 15:13:52 - INFO - __main__ - Step 127874: {'lr': 2.707276401413708e-05, 'samples': 24551808, 'steps': 127873, 'loss/train': 1.2897720336914062} 11/07/2021 15:13:52 - INFO - __main__ - Step 127875: {'lr': 2.7070362182629038e-05, 'samples': 24552000, 'steps': 127874, 'loss/train': 1.292522668838501} 11/07/2021 15:13:53 - INFO - __main__ - Step 127876: {'lr': 2.7067960451569645e-05, 'samples': 24552192, 'steps': 127875, 'loss/train': 1.2344404458999634} 11/07/2021 15:13:53 - INFO - __main__ - Step 127877: {'lr': 2.7065558820959985e-05, 'samples': 24552384, 'steps': 127876, 'loss/train': 1.121396780014038} 11/07/2021 15:13:53 - INFO - __main__ - Step 127878: {'lr': 2.7063157290801167e-05, 'samples': 24552576, 'steps': 127877, 'loss/train': 0.07862497866153717} 11/07/2021 15:13:54 - INFO - __main__ - Step 127879: {'lr': 2.7060755861094243e-05, 'samples': 24552768, 'steps': 127878, 'loss/train': 0.9992244839668274} 11/07/2021 15:13:55 - INFO - __main__ - Step 127880: {'lr': 2.7058354531840274e-05, 'samples': 24552960, 'steps': 127879, 'loss/train': 1.5026482343673706} 11/07/2021 15:13:55 - INFO - __main__ - Step 127881: {'lr': 2.7055953303040394e-05, 'samples': 24553152, 'steps': 127880, 'loss/train': 1.2178494930267334} 11/07/2021 15:13:55 - INFO - __main__ - Step 127882: {'lr': 2.7053552174695656e-05, 'samples': 24553344, 'steps': 127881, 'loss/train': 1.7511868476867676} 11/07/2021 15:13:56 - INFO - __main__ - Step 127883: {'lr': 2.7051151146807173e-05, 'samples': 24553536, 'steps': 127882, 'loss/train': 1.3523675203323364} 11/07/2021 15:13:57 - INFO - __main__ - Step 127884: {'lr': 2.704875021937603e-05, 'samples': 24553728, 'steps': 127883, 'loss/train': 1.3983078002929688} 11/07/2021 15:13:57 - INFO - __main__ - Step 127885: {'lr': 2.704634939240322e-05, 'samples': 24553920, 'steps': 127884, 'loss/train': 1.3624054193496704} 11/07/2021 15:13:58 - INFO - __main__ - Step 127886: {'lr': 2.7043948665889885e-05, 'samples': 24554112, 'steps': 127885, 'loss/train': 0.7540894746780396} 11/07/2021 15:13:58 - INFO - __main__ - Step 127887: {'lr': 2.7041548039837105e-05, 'samples': 24554304, 'steps': 127886, 'loss/train': 1.1880112886428833} 11/07/2021 15:13:58 - INFO - __main__ - Step 127888: {'lr': 2.7039147514245993e-05, 'samples': 24554496, 'steps': 127887, 'loss/train': 1.1766842603683472} 11/07/2021 15:13:59 - INFO - __main__ - Step 127889: {'lr': 2.7036747089117577e-05, 'samples': 24554688, 'steps': 127888, 'loss/train': 1.0484697818756104} 11/07/2021 15:14:00 - INFO - __main__ - Step 127890: {'lr': 2.703434676445296e-05, 'samples': 24554880, 'steps': 127889, 'loss/train': 1.1729692220687866} 11/07/2021 15:14:00 - INFO - __main__ - Step 127891: {'lr': 2.7031946540253233e-05, 'samples': 24555072, 'steps': 127890, 'loss/train': 1.0632843971252441} 11/07/2021 15:14:00 - INFO - __main__ - Step 127892: {'lr': 2.7029546416519445e-05, 'samples': 24555264, 'steps': 127891, 'loss/train': 1.4580320119857788} 11/07/2021 15:14:01 - INFO - __main__ - Step 127893: {'lr': 2.702714639325274e-05, 'samples': 24555456, 'steps': 127892, 'loss/train': 0.8439717292785645} 11/07/2021 15:14:01 - INFO - __main__ - Step 127894: {'lr': 2.702474647045414e-05, 'samples': 24555648, 'steps': 127893, 'loss/train': 1.7488090991973877} 11/07/2021 15:14:03 - INFO - __main__ - Step 127895: {'lr': 2.7022346648124806e-05, 'samples': 24555840, 'steps': 127894, 'loss/train': 1.2649192810058594} 11/07/2021 15:14:04 - INFO - __main__ - Step 127896: {'lr': 2.701994692626572e-05, 'samples': 24556032, 'steps': 127895, 'loss/train': 0.874180018901825} 11/07/2021 15:14:04 - INFO - __main__ - Step 127897: {'lr': 2.701754730487799e-05, 'samples': 24556224, 'steps': 127896, 'loss/train': 0.9978048205375671} 11/07/2021 15:14:04 - INFO - __main__ - Step 127898: {'lr': 2.7015147783962718e-05, 'samples': 24556416, 'steps': 127897, 'loss/train': 1.0143181085586548} 11/07/2021 15:14:05 - INFO - __main__ - Step 127899: {'lr': 2.7012748363520996e-05, 'samples': 24556608, 'steps': 127898, 'loss/train': 1.7466437816619873} 11/07/2021 15:14:05 - INFO - __main__ - Step 127900: {'lr': 2.7010349043553874e-05, 'samples': 24556800, 'steps': 127899, 'loss/train': 1.733253002166748} 11/07/2021 15:14:05 - INFO - __main__ - Step 127901: {'lr': 2.7007949824062434e-05, 'samples': 24556992, 'steps': 127900, 'loss/train': 1.7553157806396484} 11/07/2021 15:14:06 - INFO - __main__ - Step 127902: {'lr': 2.7005550705047787e-05, 'samples': 24557184, 'steps': 127901, 'loss/train': 1.528497576713562} 11/07/2021 15:14:07 - INFO - __main__ - Step 127903: {'lr': 2.700315168651099e-05, 'samples': 24557376, 'steps': 127902, 'loss/train': 1.2633585929870605} 11/07/2021 15:14:07 - INFO - __main__ - Step 127904: {'lr': 2.700075276845315e-05, 'samples': 24557568, 'steps': 127903, 'loss/train': 1.500300407409668} 11/07/2021 15:14:08 - INFO - __main__ - Step 127905: {'lr': 2.6998353950875297e-05, 'samples': 24557760, 'steps': 127904, 'loss/train': 1.0347086191177368} 11/07/2021 15:14:08 - INFO - __main__ - Step 127906: {'lr': 2.6995955233778624e-05, 'samples': 24557952, 'steps': 127905, 'loss/train': 1.1981964111328125} 11/07/2021 15:14:08 - INFO - __main__ - Step 127907: {'lr': 2.6993556617164074e-05, 'samples': 24558144, 'steps': 127906, 'loss/train': 0.5413497090339661} 11/07/2021 15:14:10 - INFO - __main__ - Step 127908: {'lr': 2.699115810103278e-05, 'samples': 24558336, 'steps': 127907, 'loss/train': 0.7704252600669861} 11/07/2021 15:14:10 - INFO - __main__ - Step 127909: {'lr': 2.6988759685385833e-05, 'samples': 24558528, 'steps': 127908, 'loss/train': 1.1786028146743774} 11/07/2021 15:14:10 - INFO - __main__ - Step 127910: {'lr': 2.698636137022431e-05, 'samples': 24558720, 'steps': 127909, 'loss/train': 1.5139529705047607} 11/07/2021 15:14:11 - INFO - __main__ - Step 127911: {'lr': 2.6983963155549296e-05, 'samples': 24558912, 'steps': 127910, 'loss/train': 1.334621787071228} 11/07/2021 15:14:11 - INFO - __main__ - Step 127912: {'lr': 2.6981565041361873e-05, 'samples': 24559104, 'steps': 127911, 'loss/train': 0.8355957269668579} 11/07/2021 15:14:11 - INFO - __main__ - Step 127913: {'lr': 2.6979167027663094e-05, 'samples': 24559296, 'steps': 127912, 'loss/train': 1.152289628982544} 11/07/2021 15:14:13 - INFO - __main__ - Step 127914: {'lr': 2.697676911445407e-05, 'samples': 24559488, 'steps': 127913, 'loss/train': 1.4320971965789795} 11/07/2021 15:14:13 - INFO - __main__ - Step 127915: {'lr': 2.6974371301735885e-05, 'samples': 24559680, 'steps': 127914, 'loss/train': 1.64071786403656} 11/07/2021 15:14:13 - INFO - __main__ - Step 127916: {'lr': 2.697197358950959e-05, 'samples': 24559872, 'steps': 127915, 'loss/train': 1.2582112550735474} 11/07/2021 15:14:14 - INFO - __main__ - Step 127917: {'lr': 2.6969575977776272e-05, 'samples': 24560064, 'steps': 127916, 'loss/train': 1.0776801109313965} 11/07/2021 15:14:14 - INFO - __main__ - Step 127918: {'lr': 2.6967178466537095e-05, 'samples': 24560256, 'steps': 127917, 'loss/train': 1.4407212734222412} 11/07/2021 15:14:15 - INFO - __main__ - Step 127919: {'lr': 2.6964781055793004e-05, 'samples': 24560448, 'steps': 127918, 'loss/train': 1.3123583793640137} 11/07/2021 15:14:15 - INFO - __main__ - Step 127920: {'lr': 2.696238374554516e-05, 'samples': 24560640, 'steps': 127919, 'loss/train': 0.1374104619026184} 11/07/2021 15:14:16 - INFO - __main__ - Step 127921: {'lr': 2.6959986535794595e-05, 'samples': 24560832, 'steps': 127920, 'loss/train': 1.1427921056747437} 11/07/2021 15:14:16 - INFO - __main__ - Step 127922: {'lr': 2.6957589426542445e-05, 'samples': 24561024, 'steps': 127921, 'loss/train': 1.1330523490905762} 11/07/2021 15:14:16 - INFO - __main__ - Step 127923: {'lr': 2.6955192417789735e-05, 'samples': 24561216, 'steps': 127922, 'loss/train': 1.2146110534667969} 11/07/2021 15:14:17 - INFO - __main__ - Step 127924: {'lr': 2.695279550953761e-05, 'samples': 24561408, 'steps': 127923, 'loss/train': 1.4108892679214478} 11/07/2021 15:14:18 - INFO - __main__ - Step 127925: {'lr': 2.6950398701787088e-05, 'samples': 24561600, 'steps': 127924, 'loss/train': 1.0497268438339233} 11/07/2021 15:14:18 - INFO - __main__ - Step 127926: {'lr': 2.6948001994539283e-05, 'samples': 24561792, 'steps': 127925, 'loss/train': 1.1923145055770874} 11/07/2021 15:14:18 - INFO - __main__ - Step 127927: {'lr': 2.6945605387795253e-05, 'samples': 24561984, 'steps': 127926, 'loss/train': 1.3937233686447144} 11/07/2021 15:14:19 - INFO - __main__ - Step 127928: {'lr': 2.6943208881556104e-05, 'samples': 24562176, 'steps': 127927, 'loss/train': 0.6016250252723694} 11/07/2021 15:14:20 - INFO - __main__ - Step 127929: {'lr': 2.694081247582289e-05, 'samples': 24562368, 'steps': 127928, 'loss/train': 1.3180224895477295} 11/07/2021 15:14:20 - INFO - __main__ - Step 127930: {'lr': 2.6938416170596725e-05, 'samples': 24562560, 'steps': 127929, 'loss/train': 1.348571538925171} 11/07/2021 15:14:21 - INFO - __main__ - Step 127931: {'lr': 2.6936019965878662e-05, 'samples': 24562752, 'steps': 127930, 'loss/train': 1.375658631324768} 11/07/2021 15:14:21 - INFO - __main__ - Step 127932: {'lr': 2.693362386166981e-05, 'samples': 24562944, 'steps': 127931, 'loss/train': 1.0439791679382324} 11/07/2021 15:14:21 - INFO - __main__ - Step 127933: {'lr': 2.6931227857971196e-05, 'samples': 24563136, 'steps': 127932, 'loss/train': 1.304913878440857} 11/07/2021 15:14:23 - INFO - __main__ - Step 127934: {'lr': 2.6928831954783934e-05, 'samples': 24563328, 'steps': 127933, 'loss/train': 0.6660281419754028} 11/07/2021 15:14:23 - INFO - __main__ - Step 127935: {'lr': 2.69264361521091e-05, 'samples': 24563520, 'steps': 127934, 'loss/train': 1.034876823425293} 11/07/2021 15:14:23 - INFO - __main__ - Step 127936: {'lr': 2.6924040449947755e-05, 'samples': 24563712, 'steps': 127935, 'loss/train': 0.9603924751281738} 11/07/2021 15:14:24 - INFO - __main__ - Step 127937: {'lr': 2.6921644848301007e-05, 'samples': 24563904, 'steps': 127936, 'loss/train': 0.7660900354385376} 11/07/2021 15:14:24 - INFO - __main__ - Step 127938: {'lr': 2.6919249347169937e-05, 'samples': 24564096, 'steps': 127937, 'loss/train': 1.172394037246704} 11/07/2021 15:14:24 - INFO - __main__ - Step 127939: {'lr': 2.6916853946555576e-05, 'samples': 24564288, 'steps': 127938, 'loss/train': 1.1078708171844482} 11/07/2021 15:14:25 - INFO - __main__ - Step 127940: {'lr': 2.6914458646459055e-05, 'samples': 24564480, 'steps': 127939, 'loss/train': 1.5225403308868408} 11/07/2021 15:14:26 - INFO - __main__ - Step 127941: {'lr': 2.6912063446881435e-05, 'samples': 24564672, 'steps': 127940, 'loss/train': 0.5612502694129944} 11/07/2021 15:14:26 - INFO - __main__ - Step 127942: {'lr': 2.6909668347823824e-05, 'samples': 24564864, 'steps': 127941, 'loss/train': 1.3298417329788208} 11/07/2021 15:14:27 - INFO - __main__ - Step 127943: {'lr': 2.6907273349287248e-05, 'samples': 24565056, 'steps': 127942, 'loss/train': 2.28031587600708} 11/07/2021 15:14:27 - INFO - __main__ - Step 127944: {'lr': 2.690487845127282e-05, 'samples': 24565248, 'steps': 127943, 'loss/train': 0.8605681657791138} 11/07/2021 15:14:28 - INFO - __main__ - Step 127945: {'lr': 2.6902483653781644e-05, 'samples': 24565440, 'steps': 127944, 'loss/train': 1.2264297008514404} 11/07/2021 15:14:28 - INFO - __main__ - Step 127946: {'lr': 2.6900088956814727e-05, 'samples': 24565632, 'steps': 127945, 'loss/train': 1.4147539138793945} 11/07/2021 15:14:29 - INFO - __main__ - Step 127947: {'lr': 2.6897694360373175e-05, 'samples': 24565824, 'steps': 127946, 'loss/train': 0.6912171840667725} 11/07/2021 15:14:29 - INFO - __main__ - Step 127948: {'lr': 2.68952998644581e-05, 'samples': 24566016, 'steps': 127947, 'loss/train': 1.310416579246521} 11/07/2021 15:14:29 - INFO - __main__ - Step 127949: {'lr': 2.6892905469070554e-05, 'samples': 24566208, 'steps': 127948, 'loss/train': 1.0995935201644897} 11/07/2021 15:14:30 - INFO - __main__ - Step 127950: {'lr': 2.6890511174211624e-05, 'samples': 24566400, 'steps': 127949, 'loss/train': 1.2557185888290405} 11/07/2021 15:14:31 - INFO - __main__ - Step 127951: {'lr': 2.688811697988239e-05, 'samples': 24566592, 'steps': 127950, 'loss/train': 1.4135857820510864} 11/07/2021 15:14:31 - INFO - __main__ - Step 127952: {'lr': 2.6885722886083903e-05, 'samples': 24566784, 'steps': 127951, 'loss/train': 1.4757001399993896} 11/07/2021 15:14:31 - INFO - __main__ - Step 127953: {'lr': 2.6883328892817305e-05, 'samples': 24566976, 'steps': 127952, 'loss/train': 1.617256999015808} 11/07/2021 15:14:32 - INFO - __main__ - Step 127954: {'lr': 2.6880935000083597e-05, 'samples': 24567168, 'steps': 127953, 'loss/train': 1.5089582204818726} 11/07/2021 15:14:33 - INFO - __main__ - Step 127955: {'lr': 2.6878541207883938e-05, 'samples': 24567360, 'steps': 127954, 'loss/train': 1.5200793743133545} 11/07/2021 15:14:33 - INFO - __main__ - Step 127956: {'lr': 2.6876147516219334e-05, 'samples': 24567552, 'steps': 127955, 'loss/train': 1.0155495405197144} 11/07/2021 15:14:34 - INFO - __main__ - Step 127957: {'lr': 2.6873753925090894e-05, 'samples': 24567744, 'steps': 127956, 'loss/train': 0.9188483357429504} 11/07/2021 15:14:34 - INFO - __main__ - Step 127958: {'lr': 2.6871360434499725e-05, 'samples': 24567936, 'steps': 127957, 'loss/train': 1.0375275611877441} 11/07/2021 15:14:34 - INFO - __main__ - Step 127959: {'lr': 2.686896704444691e-05, 'samples': 24568128, 'steps': 127958, 'loss/train': 1.0383102893829346} 11/07/2021 15:14:35 - INFO - __main__ - Step 127960: {'lr': 2.6866573754933426e-05, 'samples': 24568320, 'steps': 127959, 'loss/train': 1.511928915977478} 11/07/2021 15:14:36 - INFO - __main__ - Step 127961: {'lr': 2.686418056596046e-05, 'samples': 24568512, 'steps': 127960, 'loss/train': 1.0399607419967651} 11/07/2021 15:14:36 - INFO - __main__ - Step 127962: {'lr': 2.6861787477529016e-05, 'samples': 24568704, 'steps': 127961, 'loss/train': 0.055380694568157196} 11/07/2021 15:14:37 - INFO - __main__ - Step 127963: {'lr': 2.6859394489640225e-05, 'samples': 24568896, 'steps': 127962, 'loss/train': 1.0009111166000366} 11/07/2021 15:14:37 - INFO - __main__ - Step 127964: {'lr': 2.685700160229515e-05, 'samples': 24569088, 'steps': 127963, 'loss/train': 1.5295100212097168} 11/07/2021 15:14:37 - INFO - __main__ - Step 127965: {'lr': 2.685460881549487e-05, 'samples': 24569280, 'steps': 127964, 'loss/train': 0.7611652612686157} 11/07/2021 15:14:38 - INFO - __main__ - Step 127966: {'lr': 2.6852216129240437e-05, 'samples': 24569472, 'steps': 127965, 'loss/train': 1.5457886457443237} 11/07/2021 15:14:39 - INFO - __main__ - Step 127967: {'lr': 2.6849823543532963e-05, 'samples': 24569664, 'steps': 127966, 'loss/train': 1.1946723461151123} 11/07/2021 15:14:39 - INFO - __main__ - Step 127968: {'lr': 2.6847431058373534e-05, 'samples': 24569856, 'steps': 127967, 'loss/train': 0.6818677186965942} 11/07/2021 15:14:39 - INFO - __main__ - Step 127969: {'lr': 2.684503867376317e-05, 'samples': 24570048, 'steps': 127968, 'loss/train': 1.4982541799545288} 11/07/2021 15:14:40 - INFO - __main__ - Step 127970: {'lr': 2.684264638970302e-05, 'samples': 24570240, 'steps': 127969, 'loss/train': 1.341609001159668} 11/07/2021 15:14:41 - INFO - __main__ - Step 127971: {'lr': 2.6840254206194127e-05, 'samples': 24570432, 'steps': 127970, 'loss/train': 1.1333363056182861} 11/07/2021 15:14:41 - INFO - __main__ - Step 127972: {'lr': 2.6837862123237634e-05, 'samples': 24570624, 'steps': 127971, 'loss/train': 0.4361759126186371} 11/07/2021 15:14:42 - INFO - __main__ - Step 127973: {'lr': 2.6835470140834483e-05, 'samples': 24570816, 'steps': 127972, 'loss/train': 1.2837157249450684} 11/07/2021 15:14:42 - INFO - __main__ - Step 127974: {'lr': 2.6833078258985815e-05, 'samples': 24571008, 'steps': 127973, 'loss/train': 1.0696994066238403} 11/07/2021 15:14:42 - INFO - __main__ - Step 127975: {'lr': 2.683068647769274e-05, 'samples': 24571200, 'steps': 127974, 'loss/train': 1.298898696899414} 11/07/2021 15:14:43 - INFO - __main__ - Step 127976: {'lr': 2.682829479695631e-05, 'samples': 24571392, 'steps': 127975, 'loss/train': 0.3472585380077362} 11/07/2021 15:14:44 - INFO - __main__ - Step 127977: {'lr': 2.682590321677761e-05, 'samples': 24571584, 'steps': 127976, 'loss/train': 1.489234209060669} 11/07/2021 15:14:44 - INFO - __main__ - Step 127978: {'lr': 2.682351173715772e-05, 'samples': 24571776, 'steps': 127977, 'loss/train': 1.085405945777893} 11/07/2021 15:14:44 - INFO - __main__ - Step 127979: {'lr': 2.6821120358097694e-05, 'samples': 24571968, 'steps': 127978, 'loss/train': 1.3542039394378662} 11/07/2021 15:14:45 - INFO - __main__ - Step 127980: {'lr': 2.6818729079598648e-05, 'samples': 24572160, 'steps': 127979, 'loss/train': 1.5565369129180908} 11/07/2021 15:14:46 - INFO - __main__ - Step 127981: {'lr': 2.6816337901661603e-05, 'samples': 24572352, 'steps': 127980, 'loss/train': 1.3745132684707642} 11/07/2021 15:14:46 - INFO - __main__ - Step 127982: {'lr': 2.68139468242877e-05, 'samples': 24572544, 'steps': 127981, 'loss/train': 0.9639144539833069} 11/07/2021 15:14:46 - INFO - __main__ - Step 127983: {'lr': 2.681155584747799e-05, 'samples': 24572736, 'steps': 127982, 'loss/train': 1.7278997898101807} 11/07/2021 15:14:47 - INFO - __main__ - Step 127984: {'lr': 2.6809164971233536e-05, 'samples': 24572928, 'steps': 127983, 'loss/train': 0.9620276689529419} 11/07/2021 15:14:47 - INFO - __main__ - Step 127985: {'lr': 2.680677419555544e-05, 'samples': 24573120, 'steps': 127984, 'loss/train': 1.0450271368026733} 11/07/2021 15:14:47 - INFO - __main__ - Step 127986: {'lr': 2.6804383520444812e-05, 'samples': 24573312, 'steps': 127985, 'loss/train': 1.2784138917922974} 11/07/2021 15:14:48 - INFO - __main__ - Step 127987: {'lr': 2.680199294590263e-05, 'samples': 24573504, 'steps': 127986, 'loss/train': 1.44276762008667} 11/07/2021 15:14:49 - INFO - __main__ - Step 127988: {'lr': 2.679960247193003e-05, 'samples': 24573696, 'steps': 127987, 'loss/train': 1.1593409776687622} 11/07/2021 15:14:49 - INFO - __main__ - Step 127989: {'lr': 2.679721209852809e-05, 'samples': 24573888, 'steps': 127988, 'loss/train': 1.57772696018219} 11/07/2021 15:14:49 - INFO - __main__ - Step 127990: {'lr': 2.6794821825697895e-05, 'samples': 24574080, 'steps': 127989, 'loss/train': 0.6194378137588501} 11/07/2021 15:14:50 - INFO - __main__ - Step 127991: {'lr': 2.6792431653440473e-05, 'samples': 24574272, 'steps': 127990, 'loss/train': 1.345476746559143} 11/07/2021 15:14:51 - INFO - __main__ - Step 127992: {'lr': 2.6790041581756965e-05, 'samples': 24574464, 'steps': 127991, 'loss/train': 1.0239861011505127} 11/07/2021 15:14:51 - INFO - __main__ - Step 127993: {'lr': 2.6787651610648417e-05, 'samples': 24574656, 'steps': 127992, 'loss/train': 1.1750010251998901} 11/07/2021 15:14:52 - INFO - __main__ - Step 127994: {'lr': 2.678526174011589e-05, 'samples': 24574848, 'steps': 127993, 'loss/train': 0.9126515984535217} 11/07/2021 15:14:52 - INFO - __main__ - Step 127995: {'lr': 2.6782871970160494e-05, 'samples': 24575040, 'steps': 127994, 'loss/train': 0.44540560245513916} 11/07/2021 15:14:52 - INFO - __main__ - Step 127996: {'lr': 2.6780482300783283e-05, 'samples': 24575232, 'steps': 127995, 'loss/train': 1.2711994647979736} 11/07/2021 15:14:53 - INFO - __main__ - Step 127997: {'lr': 2.6778092731985366e-05, 'samples': 24575424, 'steps': 127996, 'loss/train': 0.9398373365402222} 11/07/2021 15:14:54 - INFO - __main__ - Step 127998: {'lr': 2.677570326376777e-05, 'samples': 24575616, 'steps': 127997, 'loss/train': 1.3386132717132568} 11/07/2021 15:14:54 - INFO - __main__ - Step 127999: {'lr': 2.677331389613166e-05, 'samples': 24575808, 'steps': 127998, 'loss/train': 1.2946027517318726} 11/07/2021 15:14:54 - INFO - __main__ - Step 128000: {'lr': 2.6770924629077987e-05, 'samples': 24576000, 'steps': 127999, 'loss/train': 0.70365971326828} 11/07/2021 15:14:55 - INFO - __main__ - Step 128001: {'lr': 2.6768535462607907e-05, 'samples': 24576192, 'steps': 128000, 'loss/train': 1.2198430299758911} 11/07/2021 15:14:56 - INFO - __main__ - Step 128002: {'lr': 2.676614639672248e-05, 'samples': 24576384, 'steps': 128001, 'loss/train': 1.2929776906967163} 11/07/2021 15:14:56 - INFO - __main__ - Step 128003: {'lr': 2.676375743142276e-05, 'samples': 24576576, 'steps': 128002, 'loss/train': 1.364205002784729} 11/07/2021 15:14:56 - INFO - __main__ - Step 128004: {'lr': 2.676136856670988e-05, 'samples': 24576768, 'steps': 128003, 'loss/train': 0.9254674911499023} 11/07/2021 15:14:57 - INFO - __main__ - Step 128005: {'lr': 2.6758979802584848e-05, 'samples': 24576960, 'steps': 128004, 'loss/train': 1.2961781024932861} 11/07/2021 15:14:57 - INFO - __main__ - Step 128006: {'lr': 2.6756591139048796e-05, 'samples': 24577152, 'steps': 128005, 'loss/train': 1.4939062595367432} 11/07/2021 15:14:58 - INFO - __main__ - Step 128007: {'lr': 2.6754202576102782e-05, 'samples': 24577344, 'steps': 128006, 'loss/train': 1.6374156475067139} 11/07/2021 15:14:59 - INFO - __main__ - Step 128008: {'lr': 2.6751814113747887e-05, 'samples': 24577536, 'steps': 128007, 'loss/train': 1.346124529838562} 11/07/2021 15:14:59 - INFO - __main__ - Step 128009: {'lr': 2.674942575198516e-05, 'samples': 24577728, 'steps': 128008, 'loss/train': 1.0776050090789795} 11/07/2021 15:14:59 - INFO - __main__ - Step 128010: {'lr': 2.6747037490815695e-05, 'samples': 24577920, 'steps': 128009, 'loss/train': 1.5404495000839233} 11/07/2021 15:15:00 - INFO - __main__ - Step 128011: {'lr': 2.6744649330240567e-05, 'samples': 24578112, 'steps': 128010, 'loss/train': 0.8129628300666809} 11/07/2021 15:15:01 - INFO - __main__ - Step 128012: {'lr': 2.674226127026086e-05, 'samples': 24578304, 'steps': 128011, 'loss/train': 1.277174949645996} 11/07/2021 15:15:01 - INFO - __main__ - Step 128013: {'lr': 2.673987331087771e-05, 'samples': 24578496, 'steps': 128012, 'loss/train': 0.7958237528800964} 11/07/2021 15:15:01 - INFO - __main__ - Step 128014: {'lr': 2.6737485452092064e-05, 'samples': 24578688, 'steps': 128013, 'loss/train': 1.2297440767288208} 11/07/2021 15:15:02 - INFO - __main__ - Step 128015: {'lr': 2.673509769390506e-05, 'samples': 24578880, 'steps': 128014, 'loss/train': 1.1088740825653076} 11/07/2021 15:15:02 - INFO - __main__ - Step 128016: {'lr': 2.6732710036317804e-05, 'samples': 24579072, 'steps': 128015, 'loss/train': 1.7027066946029663} 11/07/2021 15:15:02 - INFO - __main__ - Step 128017: {'lr': 2.6730322479331297e-05, 'samples': 24579264, 'steps': 128016, 'loss/train': 1.1959820985794067} 11/07/2021 15:15:03 - INFO - __main__ - Step 128018: {'lr': 2.672793502294671e-05, 'samples': 24579456, 'steps': 128017, 'loss/train': 1.098368763923645} 11/07/2021 15:15:04 - INFO - __main__ - Step 128019: {'lr': 2.6725547667165035e-05, 'samples': 24579648, 'steps': 128018, 'loss/train': 1.2204906940460205} 11/07/2021 15:15:04 - INFO - __main__ - Step 128020: {'lr': 2.6723160411987385e-05, 'samples': 24579840, 'steps': 128019, 'loss/train': 1.3289388418197632} 11/07/2021 15:15:04 - INFO - __main__ - Step 128021: {'lr': 2.6720773257414844e-05, 'samples': 24580032, 'steps': 128020, 'loss/train': 1.3694573640823364} 11/07/2021 15:15:05 - INFO - __main__ - Step 128022: {'lr': 2.671838620344849e-05, 'samples': 24580224, 'steps': 128021, 'loss/train': 1.3400943279266357} 11/07/2021 15:15:06 - INFO - __main__ - Step 128023: {'lr': 2.6715999250089358e-05, 'samples': 24580416, 'steps': 128022, 'loss/train': 1.4510407447814941} 11/07/2021 15:15:06 - INFO - __main__ - Step 128024: {'lr': 2.6713612397338575e-05, 'samples': 24580608, 'steps': 128023, 'loss/train': 1.1067581176757812} 11/07/2021 15:15:07 - INFO - __main__ - Step 128025: {'lr': 2.671122564519718e-05, 'samples': 24580800, 'steps': 128024, 'loss/train': 1.175380825996399} 11/07/2021 15:15:07 - INFO - __main__ - Step 128026: {'lr': 2.6708838993666302e-05, 'samples': 24580992, 'steps': 128025, 'loss/train': 0.9649370908737183} 11/07/2021 15:15:07 - INFO - __main__ - Step 128027: {'lr': 2.670645244274694e-05, 'samples': 24581184, 'steps': 128026, 'loss/train': 1.895155668258667} 11/07/2021 15:15:09 - INFO - __main__ - Step 128028: {'lr': 2.670406599244021e-05, 'samples': 24581376, 'steps': 128027, 'loss/train': 1.2726547718048096} 11/07/2021 15:15:10 - INFO - __main__ - Step 128029: {'lr': 2.670167964274717e-05, 'samples': 24581568, 'steps': 128028, 'loss/train': 1.7287628650665283} 11/07/2021 15:15:10 - INFO - __main__ - Step 128030: {'lr': 2.6699293393668918e-05, 'samples': 24581760, 'steps': 128029, 'loss/train': 1.5476548671722412} 11/07/2021 15:15:10 - INFO - __main__ - Step 128031: {'lr': 2.6696907245206515e-05, 'samples': 24581952, 'steps': 128030, 'loss/train': 1.440126657485962} 11/07/2021 15:15:11 - INFO - __main__ - Step 128032: {'lr': 2.6694521197361015e-05, 'samples': 24582144, 'steps': 128031, 'loss/train': 1.4334193468093872} 11/07/2021 15:15:11 - INFO - __main__ - Step 128033: {'lr': 2.669213525013356e-05, 'samples': 24582336, 'steps': 128032, 'loss/train': 1.084893822669983} 11/07/2021 15:15:12 - INFO - __main__ - Step 128034: {'lr': 2.6689749403525145e-05, 'samples': 24582528, 'steps': 128033, 'loss/train': 1.5132473707199097} 11/07/2021 15:15:13 - INFO - __main__ - Step 128035: {'lr': 2.6687363657536905e-05, 'samples': 24582720, 'steps': 128034, 'loss/train': 1.6126759052276611} 11/07/2021 15:15:13 - INFO - __main__ - Step 128036: {'lr': 2.66849780121699e-05, 'samples': 24582912, 'steps': 128035, 'loss/train': 1.237551212310791} 11/07/2021 15:15:13 - INFO - __main__ - Step 128037: {'lr': 2.6682592467425187e-05, 'samples': 24583104, 'steps': 128036, 'loss/train': 0.9065446257591248} 11/07/2021 15:15:14 - INFO - __main__ - Step 128038: {'lr': 2.6680207023303843e-05, 'samples': 24583296, 'steps': 128037, 'loss/train': 0.6328088641166687} 11/07/2021 15:15:14 - INFO - __main__ - Step 128039: {'lr': 2.6677821679807008e-05, 'samples': 24583488, 'steps': 128038, 'loss/train': 1.592343807220459} 11/07/2021 15:15:14 - INFO - __main__ - Step 128040: {'lr': 2.667543643693565e-05, 'samples': 24583680, 'steps': 128039, 'loss/train': 1.48020339012146} 11/07/2021 15:15:15 - INFO - __main__ - Step 128041: {'lr': 2.6673051294690914e-05, 'samples': 24583872, 'steps': 128040, 'loss/train': 0.7004969120025635} 11/07/2021 15:15:16 - INFO - __main__ - Step 128042: {'lr': 2.6670666253073823e-05, 'samples': 24584064, 'steps': 128041, 'loss/train': 1.3047012090682983} 11/07/2021 15:15:16 - INFO - __main__ - Step 128043: {'lr': 2.6668281312085513e-05, 'samples': 24584256, 'steps': 128042, 'loss/train': 1.4825634956359863} 11/07/2021 15:15:17 - INFO - __main__ - Step 128044: {'lr': 2.6665896471727015e-05, 'samples': 24584448, 'steps': 128043, 'loss/train': 1.193413257598877} 11/07/2021 15:15:17 - INFO - __main__ - Step 128045: {'lr': 2.666351173199941e-05, 'samples': 24584640, 'steps': 128044, 'loss/train': 1.2213693857192993} 11/07/2021 15:15:18 - INFO - __main__ - Step 128046: {'lr': 2.6661127092903775e-05, 'samples': 24584832, 'steps': 128045, 'loss/train': 0.3087203800678253} 11/07/2021 15:15:18 - INFO - __main__ - Step 128047: {'lr': 2.6658742554441202e-05, 'samples': 24585024, 'steps': 128046, 'loss/train': 0.8857443332672119} 11/07/2021 15:15:19 - INFO - __main__ - Step 128048: {'lr': 2.6656358116612767e-05, 'samples': 24585216, 'steps': 128047, 'loss/train': 0.9311529994010925} 11/07/2021 15:15:19 - INFO - __main__ - Step 128049: {'lr': 2.6653973779419527e-05, 'samples': 24585408, 'steps': 128048, 'loss/train': 1.0168015956878662} 11/07/2021 15:15:19 - INFO - __main__ - Step 128050: {'lr': 2.6651589542862536e-05, 'samples': 24585600, 'steps': 128049, 'loss/train': 0.757550060749054} 11/07/2021 15:15:20 - INFO - __main__ - Step 128051: {'lr': 2.6649205406942904e-05, 'samples': 24585792, 'steps': 128050, 'loss/train': 1.320420265197754} 11/07/2021 15:15:21 - INFO - __main__ - Step 128052: {'lr': 2.6646821371661717e-05, 'samples': 24585984, 'steps': 128051, 'loss/train': 1.2936878204345703} 11/07/2021 15:15:21 - INFO - __main__ - Step 128053: {'lr': 2.6644437437020052e-05, 'samples': 24586176, 'steps': 128052, 'loss/train': 0.9708200693130493} 11/07/2021 15:15:21 - INFO - __main__ - Step 128054: {'lr': 2.664205360301891e-05, 'samples': 24586368, 'steps': 128053, 'loss/train': 1.1296954154968262} 11/07/2021 15:15:22 - INFO - __main__ - Step 128055: {'lr': 2.6639669869659407e-05, 'samples': 24586560, 'steps': 128054, 'loss/train': 1.336923599243164} 11/07/2021 15:15:23 - INFO - __main__ - Step 128056: {'lr': 2.6637286236942615e-05, 'samples': 24586752, 'steps': 128055, 'loss/train': 1.3636831045150757} 11/07/2021 15:15:23 - INFO - __main__ - Step 128057: {'lr': 2.6634902704869624e-05, 'samples': 24586944, 'steps': 128056, 'loss/train': 1.5150070190429688} 11/07/2021 15:15:23 - INFO - __main__ - Step 128058: {'lr': 2.6632519273441512e-05, 'samples': 24587136, 'steps': 128057, 'loss/train': 1.2733023166656494} 11/07/2021 15:15:24 - INFO - __main__ - Step 128059: {'lr': 2.663013594265934e-05, 'samples': 24587328, 'steps': 128058, 'loss/train': 1.157301902770996} 11/07/2021 15:15:24 - INFO - __main__ - Step 128060: {'lr': 2.6627752712524157e-05, 'samples': 24587520, 'steps': 128059, 'loss/train': 1.509647250175476} 11/07/2021 15:15:25 - INFO - __main__ - Step 128061: {'lr': 2.6625369583037073e-05, 'samples': 24587712, 'steps': 128060, 'loss/train': 1.4456660747528076} 11/07/2021 15:15:26 - INFO - __main__ - Step 128062: {'lr': 2.6622986554199174e-05, 'samples': 24587904, 'steps': 128061, 'loss/train': 1.1293706893920898} 11/07/2021 15:15:26 - INFO - __main__ - Step 128063: {'lr': 2.6620603626011486e-05, 'samples': 24588096, 'steps': 128062, 'loss/train': 1.0175316333770752} 11/07/2021 15:15:26 - INFO - __main__ - Step 128064: {'lr': 2.6618220798475117e-05, 'samples': 24588288, 'steps': 128063, 'loss/train': 1.1096633672714233} 11/07/2021 15:15:27 - INFO - __main__ - Step 128065: {'lr': 2.6615838071591124e-05, 'samples': 24588480, 'steps': 128064, 'loss/train': 1.1597900390625} 11/07/2021 15:15:27 - INFO - __main__ - Step 128066: {'lr': 2.661345544536062e-05, 'samples': 24588672, 'steps': 128065, 'loss/train': 1.062257170677185} 11/07/2021 15:15:28 - INFO - __main__ - Step 128067: {'lr': 2.6611072919784624e-05, 'samples': 24588864, 'steps': 128066, 'loss/train': 0.8276917934417725} 11/07/2021 15:15:28 - INFO - __main__ - Step 128068: {'lr': 2.6608690494864225e-05, 'samples': 24589056, 'steps': 128067, 'loss/train': 1.0453861951828003} 11/07/2021 15:15:29 - INFO - __main__ - Step 128069: {'lr': 2.66063081706005e-05, 'samples': 24589248, 'steps': 128068, 'loss/train': 1.3286019563674927} 11/07/2021 15:15:29 - INFO - __main__ - Step 128070: {'lr': 2.660392594699454e-05, 'samples': 24589440, 'steps': 128069, 'loss/train': 1.5003069639205933} 11/07/2021 15:15:30 - INFO - __main__ - Step 128071: {'lr': 2.6601543824047363e-05, 'samples': 24589632, 'steps': 128070, 'loss/train': 0.652435839176178} 11/07/2021 15:15:31 - INFO - __main__ - Step 128072: {'lr': 2.6599161801760115e-05, 'samples': 24589824, 'steps': 128071, 'loss/train': 1.4585925340652466} 11/07/2021 15:15:31 - INFO - __main__ - Step 128073: {'lr': 2.659677988013384e-05, 'samples': 24590016, 'steps': 128072, 'loss/train': 0.8543286919593811} 11/07/2021 15:15:32 - INFO - __main__ - Step 128074: {'lr': 2.6594398059169607e-05, 'samples': 24590208, 'steps': 128073, 'loss/train': 1.335404396057129} 11/07/2021 15:15:32 - INFO - __main__ - Step 128075: {'lr': 2.6592016338868486e-05, 'samples': 24590400, 'steps': 128074, 'loss/train': 0.45085209608078003} 11/07/2021 15:15:32 - INFO - __main__ - Step 128076: {'lr': 2.6589634719231535e-05, 'samples': 24590592, 'steps': 128075, 'loss/train': 1.2341762781143188} 11/07/2021 15:15:34 - INFO - __main__ - Step 128077: {'lr': 2.658725320025987e-05, 'samples': 24590784, 'steps': 128076, 'loss/train': 0.09461282193660736} 11/07/2021 15:15:34 - INFO - __main__ - Step 128078: {'lr': 2.658487178195454e-05, 'samples': 24590976, 'steps': 128077, 'loss/train': 1.236715316772461} 11/07/2021 15:15:34 - INFO - __main__ - Step 128079: {'lr': 2.658249046431663e-05, 'samples': 24591168, 'steps': 128078, 'loss/train': 1.3876972198486328} 11/07/2021 15:15:35 - INFO - __main__ - Step 128080: {'lr': 2.658010924734722e-05, 'samples': 24591360, 'steps': 128079, 'loss/train': 1.230312705039978} 11/07/2021 15:15:35 - INFO - __main__ - Step 128081: {'lr': 2.6577728131047335e-05, 'samples': 24591552, 'steps': 128080, 'loss/train': 1.2779728174209595} 11/07/2021 15:15:35 - INFO - __main__ - Step 128082: {'lr': 2.657534711541809e-05, 'samples': 24591744, 'steps': 128081, 'loss/train': 1.6887234449386597} 11/07/2021 15:15:37 - INFO - __main__ - Step 128083: {'lr': 2.6572966200460513e-05, 'samples': 24591936, 'steps': 128082, 'loss/train': 1.7304726839065552} 11/07/2021 15:15:37 - INFO - __main__ - Step 128084: {'lr': 2.657058538617574e-05, 'samples': 24592128, 'steps': 128083, 'loss/train': 2.0910110473632812} 11/07/2021 15:15:37 - INFO - __main__ - Step 128085: {'lr': 2.6568204672564796e-05, 'samples': 24592320, 'steps': 128084, 'loss/train': 0.9340899586677551} 11/07/2021 15:15:38 - INFO - __main__ - Step 128086: {'lr': 2.656582405962879e-05, 'samples': 24592512, 'steps': 128085, 'loss/train': 1.4084793329238892} 11/07/2021 15:15:38 - INFO - __main__ - Step 128087: {'lr': 2.6563443547368755e-05, 'samples': 24592704, 'steps': 128086, 'loss/train': 1.3224409818649292} 11/07/2021 15:15:39 - INFO - __main__ - Step 128088: {'lr': 2.6561063135785796e-05, 'samples': 24592896, 'steps': 128087, 'loss/train': 0.49014726281166077} 11/07/2021 15:15:40 - INFO - __main__ - Step 128089: {'lr': 2.655868282488097e-05, 'samples': 24593088, 'steps': 128088, 'loss/train': 1.6068974733352661} 11/07/2021 15:15:40 - INFO - __main__ - Step 128090: {'lr': 2.6556302614655358e-05, 'samples': 24593280, 'steps': 128089, 'loss/train': 1.2619816064834595} 11/07/2021 15:15:40 - INFO - __main__ - Step 128091: {'lr': 2.6553922505110016e-05, 'samples': 24593472, 'steps': 128090, 'loss/train': 1.0896180868148804} 11/07/2021 15:15:41 - INFO - __main__ - Step 128092: {'lr': 2.6551542496246056e-05, 'samples': 24593664, 'steps': 128091, 'loss/train': 1.1580965518951416} 11/07/2021 15:15:41 - INFO - __main__ - Step 128093: {'lr': 2.6549162588064556e-05, 'samples': 24593856, 'steps': 128092, 'loss/train': 1.4914518594741821} 11/07/2021 15:15:42 - INFO - __main__ - Step 128094: {'lr': 2.654678278056649e-05, 'samples': 24594048, 'steps': 128093, 'loss/train': 1.2264516353607178} 11/07/2021 15:15:43 - INFO - __main__ - Step 128095: {'lr': 2.6544403073753027e-05, 'samples': 24594240, 'steps': 128094, 'loss/train': 1.1909271478652954} 11/07/2021 15:15:43 - INFO - __main__ - Step 128096: {'lr': 2.6542023467625186e-05, 'samples': 24594432, 'steps': 128095, 'loss/train': 1.4720908403396606} 11/07/2021 15:15:43 - INFO - __main__ - Step 128097: {'lr': 2.6539643962184058e-05, 'samples': 24594624, 'steps': 128096, 'loss/train': 1.750276803970337} 11/07/2021 15:15:44 - INFO - __main__ - Step 128098: {'lr': 2.6537264557430718e-05, 'samples': 24594816, 'steps': 128097, 'loss/train': 1.2787625789642334} 11/07/2021 15:15:45 - INFO - __main__ - Step 128099: {'lr': 2.653488525336625e-05, 'samples': 24595008, 'steps': 128098, 'loss/train': 1.3827873468399048} 11/07/2021 15:15:45 - INFO - __main__ - Step 128100: {'lr': 2.6532506049991715e-05, 'samples': 24595200, 'steps': 128099, 'loss/train': 1.6808239221572876} 11/07/2021 15:15:45 - INFO - __main__ - Step 128101: {'lr': 2.653012694730819e-05, 'samples': 24595392, 'steps': 128100, 'loss/train': 1.2227683067321777} 11/07/2021 15:15:46 - INFO - __main__ - Step 128102: {'lr': 2.6527747945316733e-05, 'samples': 24595584, 'steps': 128101, 'loss/train': 1.591880202293396} 11/07/2021 15:15:46 - INFO - __main__ - Step 128103: {'lr': 2.6525369044018422e-05, 'samples': 24595776, 'steps': 128102, 'loss/train': 0.5382452607154846} 11/07/2021 15:15:47 - INFO - __main__ - Step 128104: {'lr': 2.6522990243414314e-05, 'samples': 24595968, 'steps': 128103, 'loss/train': 1.6806939840316772} 11/07/2021 15:15:47 - INFO - __main__ - Step 128105: {'lr': 2.652061154350552e-05, 'samples': 24596160, 'steps': 128104, 'loss/train': 1.1829501390457153} 11/07/2021 15:15:48 - INFO - __main__ - Step 128106: {'lr': 2.6518232944293093e-05, 'samples': 24596352, 'steps': 128105, 'loss/train': 1.7881264686584473} 11/07/2021 15:15:48 - INFO - __main__ - Step 128107: {'lr': 2.651585444577814e-05, 'samples': 24596544, 'steps': 128106, 'loss/train': 1.5380289554595947} 11/07/2021 15:15:48 - INFO - __main__ - Step 128108: {'lr': 2.6513476047961642e-05, 'samples': 24596736, 'steps': 128107, 'loss/train': 1.0058956146240234} 11/07/2021 15:15:50 - INFO - __main__ - Step 128109: {'lr': 2.6511097750844732e-05, 'samples': 24596928, 'steps': 128108, 'loss/train': 1.0776638984680176} 11/07/2021 15:15:50 - INFO - __main__ - Step 128110: {'lr': 2.650871955442849e-05, 'samples': 24597120, 'steps': 128109, 'loss/train': 1.3259481191635132} 11/07/2021 15:15:50 - INFO - __main__ - Step 128111: {'lr': 2.6506341458713945e-05, 'samples': 24597312, 'steps': 128110, 'loss/train': 0.9672123789787292} 11/07/2021 15:15:51 - INFO - __main__ - Step 128112: {'lr': 2.6503963463702208e-05, 'samples': 24597504, 'steps': 128111, 'loss/train': 1.380702018737793} 11/07/2021 15:15:51 - INFO - __main__ - Step 128113: {'lr': 2.650158556939433e-05, 'samples': 24597696, 'steps': 128112, 'loss/train': 0.7652121186256409} 11/07/2021 15:15:52 - INFO - __main__ - Step 128114: {'lr': 2.6499207775791372e-05, 'samples': 24597888, 'steps': 128113, 'loss/train': 1.2521772384643555} 11/07/2021 15:15:52 - INFO - __main__ - Step 128115: {'lr': 2.649683008289444e-05, 'samples': 24598080, 'steps': 128114, 'loss/train': 1.4225761890411377} 11/07/2021 15:15:53 - INFO - __main__ - Step 128116: {'lr': 2.649445249070459e-05, 'samples': 24598272, 'steps': 128115, 'loss/train': 1.1174288988113403} 11/07/2021 15:15:53 - INFO - __main__ - Step 128117: {'lr': 2.6492074999222876e-05, 'samples': 24598464, 'steps': 128116, 'loss/train': 1.290568470954895} 11/07/2021 15:15:54 - INFO - __main__ - Step 128118: {'lr': 2.6489697608450407e-05, 'samples': 24598656, 'steps': 128117, 'loss/train': 1.2747689485549927} 11/07/2021 15:15:54 - INFO - __main__ - Step 128119: {'lr': 2.6487320318388214e-05, 'samples': 24598848, 'steps': 128118, 'loss/train': 1.245387315750122} 11/07/2021 15:15:55 - INFO - __main__ - Step 128120: {'lr': 2.648494312903743e-05, 'samples': 24599040, 'steps': 128119, 'loss/train': 1.275948166847229} 11/07/2021 15:15:55 - INFO - __main__ - Step 128121: {'lr': 2.648256604039906e-05, 'samples': 24599232, 'steps': 128120, 'loss/train': 1.337843894958496} 11/07/2021 15:15:56 - INFO - __main__ - Step 128122: {'lr': 2.648018905247418e-05, 'samples': 24599424, 'steps': 128121, 'loss/train': 1.1577625274658203} 11/07/2021 15:15:56 - INFO - __main__ - Step 128123: {'lr': 2.6477812165263875e-05, 'samples': 24599616, 'steps': 128122, 'loss/train': 0.9812092781066895} 11/07/2021 15:15:56 - INFO - __main__ - Step 128124: {'lr': 2.6475435378769203e-05, 'samples': 24599808, 'steps': 128123, 'loss/train': 1.5621685981750488} 11/07/2021 15:15:57 - INFO - __main__ - Step 128125: {'lr': 2.647305869299127e-05, 'samples': 24600000, 'steps': 128124, 'loss/train': 1.1204274892807007} 11/07/2021 15:15:58 - INFO - __main__ - Step 128126: {'lr': 2.647068210793113e-05, 'samples': 24600192, 'steps': 128125, 'loss/train': 1.509216547012329} 11/07/2021 15:15:58 - INFO - __main__ - Step 128127: {'lr': 2.6468305623589846e-05, 'samples': 24600384, 'steps': 128126, 'loss/train': 1.0711225271224976} 11/07/2021 15:15:59 - INFO - __main__ - Step 128128: {'lr': 2.646592923996849e-05, 'samples': 24600576, 'steps': 128127, 'loss/train': 0.9391081929206848} 11/07/2021 15:15:59 - INFO - __main__ - Step 128129: {'lr': 2.646355295706815e-05, 'samples': 24600768, 'steps': 128128, 'loss/train': 1.0804637670516968} 11/07/2021 15:16:00 - INFO - __main__ - Step 128130: {'lr': 2.6461176774889878e-05, 'samples': 24600960, 'steps': 128129, 'loss/train': 1.5715711116790771} 11/07/2021 15:16:00 - INFO - __main__ - Step 128131: {'lr': 2.6458800693434786e-05, 'samples': 24601152, 'steps': 128130, 'loss/train': 0.48494935035705566} 11/07/2021 15:16:01 - INFO - __main__ - Step 128132: {'lr': 2.6456424712703875e-05, 'samples': 24601344, 'steps': 128131, 'loss/train': 1.1276569366455078} 11/07/2021 15:16:01 - INFO - __main__ - Step 128133: {'lr': 2.6454048832698225e-05, 'samples': 24601536, 'steps': 128132, 'loss/train': 1.2881814241409302} 11/07/2021 15:16:01 - INFO - __main__ - Step 128134: {'lr': 2.6451673053418972e-05, 'samples': 24601728, 'steps': 128133, 'loss/train': 1.4144190549850464} 11/07/2021 15:16:02 - INFO - __main__ - Step 128135: {'lr': 2.6449297374867122e-05, 'samples': 24601920, 'steps': 128134, 'loss/train': 1.4652680158615112} 11/07/2021 15:16:03 - INFO - __main__ - Step 128136: {'lr': 2.6446921797043777e-05, 'samples': 24602112, 'steps': 128135, 'loss/train': 1.3709053993225098} 11/07/2021 15:16:03 - INFO - __main__ - Step 128137: {'lr': 2.644454631994997e-05, 'samples': 24602304, 'steps': 128136, 'loss/train': 1.5267188549041748} 11/07/2021 15:16:03 - INFO - __main__ - Step 128138: {'lr': 2.6442170943586836e-05, 'samples': 24602496, 'steps': 128137, 'loss/train': 1.450080394744873} 11/07/2021 15:16:04 - INFO - __main__ - Step 128139: {'lr': 2.6439795667955403e-05, 'samples': 24602688, 'steps': 128138, 'loss/train': 0.9643221497535706} 11/07/2021 15:16:05 - INFO - __main__ - Step 128140: {'lr': 2.643742049305675e-05, 'samples': 24602880, 'steps': 128139, 'loss/train': 1.1771787405014038} 11/07/2021 15:16:05 - INFO - __main__ - Step 128141: {'lr': 2.643504541889194e-05, 'samples': 24603072, 'steps': 128140, 'loss/train': 1.2663066387176514} 11/07/2021 15:16:05 - INFO - __main__ - Step 128142: {'lr': 2.6432670445462077e-05, 'samples': 24603264, 'steps': 128141, 'loss/train': 1.3697714805603027} 11/07/2021 15:16:06 - INFO - __main__ - Step 128143: {'lr': 2.6430295572768188e-05, 'samples': 24603456, 'steps': 128142, 'loss/train': 1.1951510906219482} 11/07/2021 15:16:06 - INFO - __main__ - Step 128144: {'lr': 2.6427920800811328e-05, 'samples': 24603648, 'steps': 128143, 'loss/train': 1.4424437284469604} 11/07/2021 15:16:07 - INFO - __main__ - Step 128145: {'lr': 2.6425546129592608e-05, 'samples': 24603840, 'steps': 128144, 'loss/train': 1.1589598655700684} 11/07/2021 15:16:08 - INFO - __main__ - Step 128146: {'lr': 2.642317155911311e-05, 'samples': 24604032, 'steps': 128145, 'loss/train': 1.0724257230758667} 11/07/2021 15:16:08 - INFO - __main__ - Step 128147: {'lr': 2.6420797089373866e-05, 'samples': 24604224, 'steps': 128146, 'loss/train': 1.242382526397705} 11/07/2021 15:16:08 - INFO - __main__ - Step 128148: {'lr': 2.641842272037595e-05, 'samples': 24604416, 'steps': 128147, 'loss/train': 1.3960027694702148} 11/07/2021 15:16:09 - INFO - __main__ - Step 128149: {'lr': 2.641604845212045e-05, 'samples': 24604608, 'steps': 128148, 'loss/train': 0.9680596590042114} 11/07/2021 15:16:09 - INFO - __main__ - Step 128150: {'lr': 2.641367428460842e-05, 'samples': 24604800, 'steps': 128149, 'loss/train': 1.4272878170013428} 11/07/2021 15:16:11 - INFO - __main__ - Step 128151: {'lr': 2.6411300217840966e-05, 'samples': 24604992, 'steps': 128150, 'loss/train': 1.108747959136963} 11/07/2021 15:16:11 - INFO - __main__ - Step 128152: {'lr': 2.6408926251819092e-05, 'samples': 24605184, 'steps': 128151, 'loss/train': 1.2693231105804443} 11/07/2021 15:16:11 - INFO - __main__ - Step 128153: {'lr': 2.640655238654399e-05, 'samples': 24605376, 'steps': 128152, 'loss/train': 1.2952547073364258} 11/07/2021 15:16:12 - INFO - __main__ - Step 128154: {'lr': 2.6404178622016578e-05, 'samples': 24605568, 'steps': 128153, 'loss/train': 1.5164259672164917} 11/07/2021 15:16:12 - INFO - __main__ - Step 128155: {'lr': 2.640180495823799e-05, 'samples': 24605760, 'steps': 128154, 'loss/train': 1.6657508611679077} 11/07/2021 15:16:13 - INFO - __main__ - Step 128156: {'lr': 2.639943139520931e-05, 'samples': 24605952, 'steps': 128155, 'loss/train': 1.5184590816497803} 11/07/2021 15:16:14 - INFO - __main__ - Step 128157: {'lr': 2.639705793293157e-05, 'samples': 24606144, 'steps': 128156, 'loss/train': 1.305188775062561} 11/07/2021 15:16:14 - INFO - __main__ - Step 128158: {'lr': 2.6394684571405898e-05, 'samples': 24606336, 'steps': 128157, 'loss/train': 1.1803076267242432} 11/07/2021 15:16:14 - INFO - __main__ - Step 128159: {'lr': 2.63923113106333e-05, 'samples': 24606528, 'steps': 128158, 'loss/train': 1.4379254579544067} 11/07/2021 15:16:15 - INFO - __main__ - Step 128160: {'lr': 2.6389938150614913e-05, 'samples': 24606720, 'steps': 128159, 'loss/train': 1.1677820682525635} 11/07/2021 15:16:15 - INFO - __main__ - Step 128161: {'lr': 2.6387565091351733e-05, 'samples': 24606912, 'steps': 128160, 'loss/train': 1.4832615852355957} 11/07/2021 15:16:16 - INFO - __main__ - Step 128162: {'lr': 2.6385192132844877e-05, 'samples': 24607104, 'steps': 128161, 'loss/train': 1.233046531677246} 11/07/2021 15:16:16 - INFO - __main__ - Step 128163: {'lr': 2.638281927509542e-05, 'samples': 24607296, 'steps': 128162, 'loss/train': 1.3239046335220337} 11/07/2021 15:16:17 - INFO - __main__ - Step 128164: {'lr': 2.638044651810445e-05, 'samples': 24607488, 'steps': 128163, 'loss/train': 1.591776728630066} 11/07/2021 15:16:17 - INFO - __main__ - Step 128165: {'lr': 2.6378073861872938e-05, 'samples': 24607680, 'steps': 128164, 'loss/train': 1.2498657703399658} 11/07/2021 15:16:17 - INFO - __main__ - Step 128166: {'lr': 2.6375701306402044e-05, 'samples': 24607872, 'steps': 128165, 'loss/train': 1.2469005584716797} 11/07/2021 15:16:19 - INFO - __main__ - Step 128167: {'lr': 2.6373328851692774e-05, 'samples': 24608064, 'steps': 128166, 'loss/train': 1.4954509735107422} 11/07/2021 15:16:19 - INFO - __main__ - Step 128168: {'lr': 2.6370956497746262e-05, 'samples': 24608256, 'steps': 128167, 'loss/train': 1.2794179916381836} 11/07/2021 15:16:20 - INFO - __main__ - Step 128169: {'lr': 2.6368584244563538e-05, 'samples': 24608448, 'steps': 128168, 'loss/train': 0.9586179852485657} 11/07/2021 15:16:20 - INFO - __main__ - Step 128170: {'lr': 2.636621209214568e-05, 'samples': 24608640, 'steps': 128169, 'loss/train': 1.417333960533142} 11/07/2021 15:16:20 - INFO - __main__ - Step 128171: {'lr': 2.6363840040493748e-05, 'samples': 24608832, 'steps': 128170, 'loss/train': 1.0222034454345703} 11/07/2021 15:16:21 - INFO - __main__ - Step 128172: {'lr': 2.636146808960882e-05, 'samples': 24609024, 'steps': 128171, 'loss/train': 1.2052900791168213} 11/07/2021 15:16:22 - INFO - __main__ - Step 128173: {'lr': 2.6359096239491954e-05, 'samples': 24609216, 'steps': 128172, 'loss/train': 1.3890973329544067} 11/07/2021 15:16:22 - INFO - __main__ - Step 128174: {'lr': 2.6356724490144258e-05, 'samples': 24609408, 'steps': 128173, 'loss/train': 1.338384747505188} 11/07/2021 15:16:22 - INFO - __main__ - Step 128175: {'lr': 2.6354352841566788e-05, 'samples': 24609600, 'steps': 128174, 'loss/train': 1.8230044841766357} 11/07/2021 15:16:23 - INFO - __main__ - Step 128176: {'lr': 2.635198129376057e-05, 'samples': 24609792, 'steps': 128175, 'loss/train': 1.3803236484527588} 11/07/2021 15:16:24 - INFO - __main__ - Step 128177: {'lr': 2.6349609846726684e-05, 'samples': 24609984, 'steps': 128176, 'loss/train': 1.2481285333633423} 11/07/2021 15:16:24 - INFO - __main__ - Step 128178: {'lr': 2.634723850046622e-05, 'samples': 24610176, 'steps': 128177, 'loss/train': 1.027034044265747} 11/07/2021 15:16:24 - INFO - __main__ - Step 128179: {'lr': 2.6344867254980226e-05, 'samples': 24610368, 'steps': 128178, 'loss/train': 1.4072037935256958} 11/07/2021 15:16:25 - INFO - __main__ - Step 128180: {'lr': 2.6342496110269815e-05, 'samples': 24610560, 'steps': 128179, 'loss/train': 1.543603777885437} 11/07/2021 15:16:25 - INFO - __main__ - Step 128181: {'lr': 2.6340125066335985e-05, 'samples': 24610752, 'steps': 128180, 'loss/train': 1.2852632999420166} 11/07/2021 15:16:26 - INFO - __main__ - Step 128182: {'lr': 2.6337754123179876e-05, 'samples': 24610944, 'steps': 128181, 'loss/train': 1.141222357749939} 11/07/2021 15:16:27 - INFO - __main__ - Step 128183: {'lr': 2.633538328080251e-05, 'samples': 24611136, 'steps': 128182, 'loss/train': 1.2770580053329468} 11/07/2021 15:16:27 - INFO - __main__ - Step 128184: {'lr': 2.6333012539204948e-05, 'samples': 24611328, 'steps': 128183, 'loss/train': 1.4521229267120361} 11/07/2021 15:16:27 - INFO - __main__ - Step 128185: {'lr': 2.6330641898388298e-05, 'samples': 24611520, 'steps': 128184, 'loss/train': 1.1937377452850342} 11/07/2021 15:16:28 - INFO - __main__ - Step 128186: {'lr': 2.6328271358353613e-05, 'samples': 24611712, 'steps': 128185, 'loss/train': 0.5625494718551636} 11/07/2021 15:16:28 - INFO - __main__ - Step 128187: {'lr': 2.6325900919102e-05, 'samples': 24611904, 'steps': 128186, 'loss/train': 1.2184960842132568} 11/07/2021 15:16:29 - INFO - __main__ - Step 128188: {'lr': 2.6323530580634443e-05, 'samples': 24612096, 'steps': 128187, 'loss/train': 0.16984304785728455} 11/07/2021 15:16:30 - INFO - __main__ - Step 128189: {'lr': 2.6321160342952065e-05, 'samples': 24612288, 'steps': 128188, 'loss/train': 0.8362568020820618} 11/07/2021 15:16:30 - INFO - __main__ - Step 128190: {'lr': 2.6318790206055905e-05, 'samples': 24612480, 'steps': 128189, 'loss/train': 1.373253583908081} 11/07/2021 15:16:30 - INFO - __main__ - Step 128191: {'lr': 2.6316420169947036e-05, 'samples': 24612672, 'steps': 128190, 'loss/train': 1.6577867269515991} 11/07/2021 15:16:31 - INFO - __main__ - Step 128192: {'lr': 2.6314050234626547e-05, 'samples': 24612864, 'steps': 128191, 'loss/train': 1.5334842205047607} 11/07/2021 15:16:32 - INFO - __main__ - Step 128193: {'lr': 2.631168040009549e-05, 'samples': 24613056, 'steps': 128192, 'loss/train': 1.8160030841827393} 11/07/2021 15:16:32 - INFO - __main__ - Step 128194: {'lr': 2.6309310666354948e-05, 'samples': 24613248, 'steps': 128193, 'loss/train': 1.123180866241455} 11/07/2021 15:16:32 - INFO - __main__ - Step 128195: {'lr': 2.6306941033405972e-05, 'samples': 24613440, 'steps': 128194, 'loss/train': 1.416786789894104} 11/07/2021 15:16:33 - INFO - __main__ - Step 128196: {'lr': 2.6304571501249625e-05, 'samples': 24613632, 'steps': 128195, 'loss/train': 0.9033763408660889} 11/07/2021 15:16:33 - INFO - __main__ - Step 128197: {'lr': 2.630220206988701e-05, 'samples': 24613824, 'steps': 128196, 'loss/train': 1.2276484966278076} 11/07/2021 15:16:34 - INFO - __main__ - Step 128198: {'lr': 2.6299832739319158e-05, 'samples': 24614016, 'steps': 128197, 'loss/train': 0.8460161685943604} 11/07/2021 15:16:34 - INFO - __main__ - Step 128199: {'lr': 2.629746350954715e-05, 'samples': 24614208, 'steps': 128198, 'loss/train': 1.4859650135040283} 11/07/2021 15:16:35 - INFO - __main__ - Step 128200: {'lr': 2.6295094380572064e-05, 'samples': 24614400, 'steps': 128199, 'loss/train': 1.4736170768737793} 11/07/2021 15:16:35 - INFO - __main__ - Step 128201: {'lr': 2.629272535239499e-05, 'samples': 24614592, 'steps': 128200, 'loss/train': 1.6093132495880127} 11/07/2021 15:16:35 - INFO - __main__ - Step 128202: {'lr': 2.6290356425016926e-05, 'samples': 24614784, 'steps': 128201, 'loss/train': 0.6591059565544128} 11/07/2021 15:16:37 - INFO - __main__ - Step 128203: {'lr': 2.628798759843895e-05, 'samples': 24614976, 'steps': 128202, 'loss/train': 1.3489978313446045} 11/07/2021 15:16:37 - INFO - __main__ - Step 128204: {'lr': 2.6285618872662176e-05, 'samples': 24615168, 'steps': 128203, 'loss/train': 1.3677828311920166} 11/07/2021 15:16:37 - INFO - __main__ - Step 128205: {'lr': 2.6283250247687656e-05, 'samples': 24615360, 'steps': 128204, 'loss/train': 1.413901686668396} 11/07/2021 15:16:38 - INFO - __main__ - Step 128206: {'lr': 2.6280881723516447e-05, 'samples': 24615552, 'steps': 128205, 'loss/train': 1.2412914037704468} 11/07/2021 15:16:38 - INFO - __main__ - Step 128207: {'lr': 2.6278513300149603e-05, 'samples': 24615744, 'steps': 128206, 'loss/train': 1.2734947204589844} 11/07/2021 15:16:39 - INFO - __main__ - Step 128208: {'lr': 2.6276144977588234e-05, 'samples': 24615936, 'steps': 128207, 'loss/train': 1.333852767944336} 11/07/2021 15:16:39 - INFO - __main__ - Step 128209: {'lr': 2.6273776755833367e-05, 'samples': 24616128, 'steps': 128208, 'loss/train': 1.357498288154602} 11/07/2021 15:16:40 - INFO - __main__ - Step 128210: {'lr': 2.6271408634886084e-05, 'samples': 24616320, 'steps': 128209, 'loss/train': 1.0606729984283447} 11/07/2021 15:16:40 - INFO - __main__ - Step 128211: {'lr': 2.626904061474744e-05, 'samples': 24616512, 'steps': 128210, 'loss/train': 1.2849502563476562} 11/07/2021 15:16:40 - INFO - __main__ - Step 128212: {'lr': 2.626667269541852e-05, 'samples': 24616704, 'steps': 128211, 'loss/train': 1.2358722686767578} 11/07/2021 15:16:41 - INFO - __main__ - Step 128213: {'lr': 2.6264304876900403e-05, 'samples': 24616896, 'steps': 128212, 'loss/train': 1.1078591346740723} 11/07/2021 15:16:42 - INFO - __main__ - Step 128214: {'lr': 2.6261937159194172e-05, 'samples': 24617088, 'steps': 128213, 'loss/train': 1.3964183330535889} 11/07/2021 15:16:42 - INFO - __main__ - Step 128215: {'lr': 2.6259569542300827e-05, 'samples': 24617280, 'steps': 128214, 'loss/train': 1.5473262071609497} 11/07/2021 15:16:43 - INFO - __main__ - Step 128216: {'lr': 2.625720202622145e-05, 'samples': 24617472, 'steps': 128215, 'loss/train': 1.6526507139205933} 11/07/2021 15:16:43 - INFO - __main__ - Step 128217: {'lr': 2.6254834610957124e-05, 'samples': 24617664, 'steps': 128216, 'loss/train': 0.9335987567901611} 11/07/2021 15:16:43 - INFO - __main__ - Step 128218: {'lr': 2.625246729650893e-05, 'samples': 24617856, 'steps': 128217, 'loss/train': 1.5252490043640137} 11/07/2021 15:16:44 - INFO - __main__ - Step 128219: {'lr': 2.6250100082877926e-05, 'samples': 24618048, 'steps': 128218, 'loss/train': 1.1820510625839233} 11/07/2021 15:16:45 - INFO - __main__ - Step 128220: {'lr': 2.6247732970065137e-05, 'samples': 24618240, 'steps': 128219, 'loss/train': 0.960924506187439} 11/07/2021 15:16:45 - INFO - __main__ - Step 128221: {'lr': 2.6245365958071698e-05, 'samples': 24618432, 'steps': 128220, 'loss/train': 1.3147270679473877} 11/07/2021 15:16:45 - INFO - __main__ - Step 128222: {'lr': 2.6242999046898642e-05, 'samples': 24618624, 'steps': 128221, 'loss/train': 1.27881920337677} 11/07/2021 15:16:46 - INFO - __main__ - Step 128223: {'lr': 2.6240632236547047e-05, 'samples': 24618816, 'steps': 128222, 'loss/train': 1.742671012878418} 11/07/2021 15:16:47 - INFO - __main__ - Step 128224: {'lr': 2.623826552701794e-05, 'samples': 24619008, 'steps': 128223, 'loss/train': 1.4648200273513794} 11/07/2021 15:16:47 - INFO - __main__ - Step 128225: {'lr': 2.6235898918312435e-05, 'samples': 24619200, 'steps': 128224, 'loss/train': 1.1419235467910767} 11/07/2021 15:16:48 - INFO - __main__ - Step 128226: {'lr': 2.6233532410431583e-05, 'samples': 24619392, 'steps': 128225, 'loss/train': 1.656419277191162} 11/07/2021 15:16:48 - INFO - __main__ - Step 128227: {'lr': 2.6231166003376467e-05, 'samples': 24619584, 'steps': 128226, 'loss/train': 1.4676523208618164} 11/07/2021 15:16:48 - INFO - __main__ - Step 128228: {'lr': 2.622879969714817e-05, 'samples': 24619776, 'steps': 128227, 'loss/train': 1.902531623840332} 11/07/2021 15:16:49 - INFO - __main__ - Step 128229: {'lr': 2.6226433491747665e-05, 'samples': 24619968, 'steps': 128228, 'loss/train': 1.0761951208114624} 11/07/2021 15:16:50 - INFO - __main__ - Step 128230: {'lr': 2.6224067387176058e-05, 'samples': 24620160, 'steps': 128229, 'loss/train': 1.4333665370941162} 11/07/2021 15:16:50 - INFO - __main__ - Step 128231: {'lr': 2.6221701383434464e-05, 'samples': 24620352, 'steps': 128230, 'loss/train': 1.241032600402832} 11/07/2021 15:16:50 - INFO - __main__ - Step 128232: {'lr': 2.621933548052391e-05, 'samples': 24620544, 'steps': 128231, 'loss/train': 1.2963093519210815} 11/07/2021 15:16:51 - INFO - __main__ - Step 128233: {'lr': 2.6216969678445474e-05, 'samples': 24620736, 'steps': 128232, 'loss/train': 1.3766496181488037} 11/07/2021 15:16:51 - INFO - __main__ - Step 128234: {'lr': 2.6214603977200213e-05, 'samples': 24620928, 'steps': 128233, 'loss/train': 1.2856028079986572} 11/07/2021 15:16:52 - INFO - __main__ - Step 128235: {'lr': 2.6212238376789183e-05, 'samples': 24621120, 'steps': 128234, 'loss/train': 1.0362883806228638} 11/07/2021 15:16:52 - INFO - __main__ - Step 128236: {'lr': 2.620987287721349e-05, 'samples': 24621312, 'steps': 128235, 'loss/train': 1.2058515548706055} 11/07/2021 15:16:53 - INFO - __main__ - Step 128237: {'lr': 2.620750747847417e-05, 'samples': 24621504, 'steps': 128236, 'loss/train': 1.5431452989578247} 11/07/2021 15:16:53 - INFO - __main__ - Step 128238: {'lr': 2.6205142180572295e-05, 'samples': 24621696, 'steps': 128237, 'loss/train': 1.4231771230697632} 11/07/2021 15:16:53 - INFO - __main__ - Step 128239: {'lr': 2.6202776983508925e-05, 'samples': 24621888, 'steps': 128238, 'loss/train': 1.946244716644287} 11/07/2021 15:16:55 - INFO - __main__ - Step 128240: {'lr': 2.6200411887285112e-05, 'samples': 24622080, 'steps': 128239, 'loss/train': 1.85580575466156} 11/07/2021 15:16:55 - INFO - __main__ - Step 128241: {'lr': 2.6198046891901998e-05, 'samples': 24622272, 'steps': 128240, 'loss/train': 1.0973893404006958} 11/07/2021 15:16:55 - INFO - __main__ - Step 128242: {'lr': 2.6195681997360555e-05, 'samples': 24622464, 'steps': 128241, 'loss/train': 1.5377252101898193} 11/07/2021 15:16:56 - INFO - __main__ - Step 128243: {'lr': 2.6193317203661888e-05, 'samples': 24622656, 'steps': 128242, 'loss/train': 1.2511603832244873} 11/07/2021 15:16:56 - INFO - __main__ - Step 128244: {'lr': 2.6190952510807053e-05, 'samples': 24622848, 'steps': 128243, 'loss/train': 0.9774960279464722} 11/07/2021 15:16:57 - INFO - __main__ - Step 128245: {'lr': 2.618858791879711e-05, 'samples': 24623040, 'steps': 128244, 'loss/train': 0.9303826689720154} 11/07/2021 15:16:57 - INFO - __main__ - Step 128246: {'lr': 2.618622342763316e-05, 'samples': 24623232, 'steps': 128245, 'loss/train': 1.1687861680984497} 11/07/2021 15:16:58 - INFO - __main__ - Step 128247: {'lr': 2.618385903731621e-05, 'samples': 24623424, 'steps': 128246, 'loss/train': 1.440192461013794} 11/07/2021 15:16:58 - INFO - __main__ - Step 128248: {'lr': 2.6181494747847368e-05, 'samples': 24623616, 'steps': 128247, 'loss/train': 1.5326961278915405} 11/07/2021 15:16:58 - INFO - __main__ - Step 128249: {'lr': 2.6179130559227717e-05, 'samples': 24623808, 'steps': 128248, 'loss/train': 1.5192214250564575} 11/07/2021 15:17:00 - INFO - __main__ - Step 128250: {'lr': 2.617676647145828e-05, 'samples': 24624000, 'steps': 128249, 'loss/train': 1.2786082029342651} 11/07/2021 15:17:00 - INFO - __main__ - Step 128251: {'lr': 2.6174402484540143e-05, 'samples': 24624192, 'steps': 128250, 'loss/train': 1.6575164794921875} 11/07/2021 15:17:00 - INFO - __main__ - Step 128252: {'lr': 2.6172038598474334e-05, 'samples': 24624384, 'steps': 128251, 'loss/train': 0.0841832235455513} 11/07/2021 15:17:01 - INFO - __main__ - Step 128253: {'lr': 2.616967481326199e-05, 'samples': 24624576, 'steps': 128252, 'loss/train': 1.2043907642364502} 11/07/2021 15:17:01 - INFO - __main__ - Step 128254: {'lr': 2.6167311128904136e-05, 'samples': 24624768, 'steps': 128253, 'loss/train': 1.7641801834106445} 11/07/2021 15:17:02 - INFO - __main__ - Step 128255: {'lr': 2.6164947545401858e-05, 'samples': 24624960, 'steps': 128254, 'loss/train': 1.5442804098129272} 11/07/2021 15:17:02 - INFO - __main__ - Step 128256: {'lr': 2.616258406275618e-05, 'samples': 24625152, 'steps': 128255, 'loss/train': 1.2401783466339111} 11/07/2021 15:17:03 - INFO - __main__ - Step 128257: {'lr': 2.6160220680968156e-05, 'samples': 24625344, 'steps': 128256, 'loss/train': 1.2407070398330688} 11/07/2021 15:17:03 - INFO - __main__ - Step 128258: {'lr': 2.6157857400038927e-05, 'samples': 24625536, 'steps': 128257, 'loss/train': 1.3796778917312622} 11/07/2021 15:17:04 - INFO - __main__ - Step 128259: {'lr': 2.615549421996946e-05, 'samples': 24625728, 'steps': 128258, 'loss/train': 1.3250727653503418} 11/07/2021 15:17:04 - INFO - __main__ - Step 128260: {'lr': 2.6153131140760928e-05, 'samples': 24625920, 'steps': 128259, 'loss/train': 1.312160849571228} 11/07/2021 15:17:05 - INFO - __main__ - Step 128261: {'lr': 2.6150768162414295e-05, 'samples': 24626112, 'steps': 128260, 'loss/train': 1.3212499618530273} 11/07/2021 15:17:05 - INFO - __main__ - Step 128262: {'lr': 2.6148405284930705e-05, 'samples': 24626304, 'steps': 128261, 'loss/train': 1.764062523841858} 11/07/2021 15:17:06 - INFO - __main__ - Step 128263: {'lr': 2.614604250831118e-05, 'samples': 24626496, 'steps': 128262, 'loss/train': 1.2957689762115479} 11/07/2021 15:17:06 - INFO - __main__ - Step 128264: {'lr': 2.6143679832556776e-05, 'samples': 24626688, 'steps': 128263, 'loss/train': 1.0466880798339844} 11/07/2021 15:17:07 - INFO - __main__ - Step 128265: {'lr': 2.6141317257668578e-05, 'samples': 24626880, 'steps': 128264, 'loss/train': 0.8034894466400146} 11/07/2021 15:17:07 - INFO - __main__ - Step 128266: {'lr': 2.613895478364767e-05, 'samples': 24627072, 'steps': 128265, 'loss/train': 1.364260196685791} 11/07/2021 15:17:08 - INFO - __main__ - Step 128267: {'lr': 2.61365924104951e-05, 'samples': 24627264, 'steps': 128266, 'loss/train': 1.393225073814392} 11/07/2021 15:17:08 - INFO - __main__ - Step 128268: {'lr': 2.613423013821195e-05, 'samples': 24627456, 'steps': 128267, 'loss/train': 1.141231894493103} 11/07/2021 15:17:08 - INFO - __main__ - Step 128269: {'lr': 2.6131867966799228e-05, 'samples': 24627648, 'steps': 128268, 'loss/train': 1.564410924911499} 11/07/2021 15:17:09 - INFO - __main__ - Step 128270: {'lr': 2.612950589625801e-05, 'samples': 24627840, 'steps': 128269, 'loss/train': 1.174720048904419} 11/07/2021 15:17:10 - INFO - __main__ - Step 128271: {'lr': 2.612714392658941e-05, 'samples': 24628032, 'steps': 128270, 'loss/train': 1.3795400857925415} 11/07/2021 15:17:10 - INFO - __main__ - Step 128272: {'lr': 2.612478205779445e-05, 'samples': 24628224, 'steps': 128271, 'loss/train': 1.58494234085083} 11/07/2021 15:17:10 - INFO - __main__ - Step 128273: {'lr': 2.6122420289874214e-05, 'samples': 24628416, 'steps': 128272, 'loss/train': 1.1228022575378418} 11/07/2021 15:17:11 - INFO - __main__ - Step 128274: {'lr': 2.6120058622829763e-05, 'samples': 24628608, 'steps': 128273, 'loss/train': 1.3419426679611206} 11/07/2021 15:17:12 - INFO - __main__ - Step 128275: {'lr': 2.6117697056662144e-05, 'samples': 24628800, 'steps': 128274, 'loss/train': 0.4260622262954712} 11/07/2021 15:17:12 - INFO - __main__ - Step 128276: {'lr': 2.611533559137244e-05, 'samples': 24628992, 'steps': 128275, 'loss/train': 1.2901924848556519} 11/07/2021 15:17:13 - INFO - __main__ - Step 128277: {'lr': 2.611297422696171e-05, 'samples': 24629184, 'steps': 128276, 'loss/train': 1.579382061958313} 11/07/2021 15:17:13 - INFO - __main__ - Step 128278: {'lr': 2.6110612963431036e-05, 'samples': 24629376, 'steps': 128277, 'loss/train': 1.0905710458755493} 11/07/2021 15:17:13 - INFO - __main__ - Step 128279: {'lr': 2.610825180078144e-05, 'samples': 24629568, 'steps': 128278, 'loss/train': 1.2860418558120728} 11/07/2021 15:17:15 - INFO - __main__ - Step 128280: {'lr': 2.6105890739014037e-05, 'samples': 24629760, 'steps': 128279, 'loss/train': 1.369443416595459} 11/07/2021 15:17:15 - INFO - __main__ - Step 128281: {'lr': 2.6103529778129908e-05, 'samples': 24629952, 'steps': 128280, 'loss/train': 1.3268767595291138} 11/07/2021 15:17:15 - INFO - __main__ - Step 128282: {'lr': 2.6101168918130026e-05, 'samples': 24630144, 'steps': 128281, 'loss/train': 1.314370036125183} 11/07/2021 15:17:16 - INFO - __main__ - Step 128283: {'lr': 2.6098808159015498e-05, 'samples': 24630336, 'steps': 128282, 'loss/train': 0.927548885345459} 11/07/2021 15:17:16 - INFO - __main__ - Step 128284: {'lr': 2.6096447500787378e-05, 'samples': 24630528, 'steps': 128283, 'loss/train': 0.6665692329406738} 11/07/2021 15:17:16 - INFO - __main__ - Step 128285: {'lr': 2.6094086943446753e-05, 'samples': 24630720, 'steps': 128284, 'loss/train': 1.2831374406814575} 11/07/2021 15:17:18 - INFO - __main__ - Step 128286: {'lr': 2.609172648699468e-05, 'samples': 24630912, 'steps': 128285, 'loss/train': 1.529231309890747} 11/07/2021 15:17:18 - INFO - __main__ - Step 128287: {'lr': 2.608936613143223e-05, 'samples': 24631104, 'steps': 128286, 'loss/train': 1.1609857082366943} 11/07/2021 15:17:18 - INFO - __main__ - Step 128288: {'lr': 2.608700587676044e-05, 'samples': 24631296, 'steps': 128287, 'loss/train': 1.1807886362075806} 11/07/2021 15:17:19 - INFO - __main__ - Step 128289: {'lr': 2.608464572298039e-05, 'samples': 24631488, 'steps': 128288, 'loss/train': 1.335904836654663} 11/07/2021 15:17:19 - INFO - __main__ - Step 128290: {'lr': 2.608228567009316e-05, 'samples': 24631680, 'steps': 128289, 'loss/train': 1.4915999174118042} 11/07/2021 15:17:20 - INFO - __main__ - Step 128291: {'lr': 2.607992571809978e-05, 'samples': 24631872, 'steps': 128290, 'loss/train': 0.29828447103500366} 11/07/2021 15:17:21 - INFO - __main__ - Step 128292: {'lr': 2.607756586700133e-05, 'samples': 24632064, 'steps': 128291, 'loss/train': 1.0844244956970215} 11/07/2021 15:17:21 - INFO - __main__ - Step 128293: {'lr': 2.6075206116798868e-05, 'samples': 24632256, 'steps': 128292, 'loss/train': 1.1771420240402222} 11/07/2021 15:17:21 - INFO - __main__ - Step 128294: {'lr': 2.607284646749347e-05, 'samples': 24632448, 'steps': 128293, 'loss/train': 0.9906705617904663} 11/07/2021 15:17:22 - INFO - __main__ - Step 128295: {'lr': 2.6070486919086254e-05, 'samples': 24632640, 'steps': 128294, 'loss/train': 1.351825475692749} 11/07/2021 15:17:23 - INFO - __main__ - Step 128296: {'lr': 2.6068127471578162e-05, 'samples': 24632832, 'steps': 128295, 'loss/train': 1.2091493606567383} 11/07/2021 15:17:23 - INFO - __main__ - Step 128297: {'lr': 2.606576812497033e-05, 'samples': 24633024, 'steps': 128296, 'loss/train': 0.9608190655708313} 11/07/2021 15:17:23 - INFO - __main__ - Step 128298: {'lr': 2.6063408879263785e-05, 'samples': 24633216, 'steps': 128297, 'loss/train': 0.8007802963256836} 11/07/2021 15:17:24 - INFO - __main__ - Step 128299: {'lr': 2.6061049734459637e-05, 'samples': 24633408, 'steps': 128298, 'loss/train': 1.2130013704299927} 11/07/2021 15:17:24 - INFO - __main__ - Step 128300: {'lr': 2.6058690690558912e-05, 'samples': 24633600, 'steps': 128299, 'loss/train': 1.848563313484192} 11/07/2021 15:17:24 - INFO - __main__ - Step 128301: {'lr': 2.6056331747562668e-05, 'samples': 24633792, 'steps': 128300, 'loss/train': 1.284052848815918} 11/07/2021 15:17:25 - INFO - __main__ - Step 128302: {'lr': 2.6053972905472012e-05, 'samples': 24633984, 'steps': 128301, 'loss/train': 0.664337158203125} 11/07/2021 15:17:26 - INFO - __main__ - Step 128303: {'lr': 2.6051614164287947e-05, 'samples': 24634176, 'steps': 128302, 'loss/train': 0.9367918372154236} 11/07/2021 15:17:26 - INFO - __main__ - Step 128304: {'lr': 2.6049255524011605e-05, 'samples': 24634368, 'steps': 128303, 'loss/train': 0.9861352443695068} 11/07/2021 15:17:26 - INFO - __main__ - Step 128305: {'lr': 2.6046896984643992e-05, 'samples': 24634560, 'steps': 128304, 'loss/train': 0.740806519985199} 11/07/2021 15:17:27 - INFO - __main__ - Step 128306: {'lr': 2.6044538546186213e-05, 'samples': 24634752, 'steps': 128305, 'loss/train': 2.458740234375} 11/07/2021 15:17:28 - INFO - __main__ - Step 128307: {'lr': 2.604218020863927e-05, 'samples': 24634944, 'steps': 128306, 'loss/train': 1.0973539352416992} 11/07/2021 15:17:29 - INFO - __main__ - Step 128308: {'lr': 2.6039821972004356e-05, 'samples': 24635136, 'steps': 128307, 'loss/train': 1.2634618282318115} 11/07/2021 15:17:29 - INFO - __main__ - Step 128309: {'lr': 2.603746383628236e-05, 'samples': 24635328, 'steps': 128308, 'loss/train': 0.07420972734689713} 11/07/2021 15:17:29 - INFO - __main__ - Step 128310: {'lr': 2.6035105801474444e-05, 'samples': 24635520, 'steps': 128309, 'loss/train': 1.139809250831604} 11/07/2021 15:17:30 - INFO - __main__ - Step 128311: {'lr': 2.603274786758167e-05, 'samples': 24635712, 'steps': 128310, 'loss/train': 1.2102251052856445} 11/07/2021 15:17:31 - INFO - __main__ - Step 128312: {'lr': 2.6030390034605057e-05, 'samples': 24635904, 'steps': 128311, 'loss/train': 0.6463508605957031} 11/07/2021 15:17:31 - INFO - __main__ - Step 128313: {'lr': 2.602803230254569e-05, 'samples': 24636096, 'steps': 128312, 'loss/train': 1.3802920579910278} 11/07/2021 15:17:31 - INFO - __main__ - Step 128314: {'lr': 2.6025674671404653e-05, 'samples': 24636288, 'steps': 128313, 'loss/train': 1.4695308208465576} 11/07/2021 15:17:32 - INFO - __main__ - Step 128315: {'lr': 2.6023317141182972e-05, 'samples': 24636480, 'steps': 128314, 'loss/train': 1.1879781484603882} 11/07/2021 15:17:32 - INFO - __main__ - Step 128316: {'lr': 2.6020959711881758e-05, 'samples': 24636672, 'steps': 128315, 'loss/train': 1.2746798992156982} 11/07/2021 15:17:33 - INFO - __main__ - Step 128317: {'lr': 2.601860238350201e-05, 'samples': 24636864, 'steps': 128316, 'loss/train': 1.2287380695343018} 11/07/2021 15:17:33 - INFO - __main__ - Step 128318: {'lr': 2.6016245156044865e-05, 'samples': 24637056, 'steps': 128317, 'loss/train': 0.8800534605979919} 11/07/2021 15:17:34 - INFO - __main__ - Step 128319: {'lr': 2.6013888029511294e-05, 'samples': 24637248, 'steps': 128318, 'loss/train': 1.2590692043304443} 11/07/2021 15:17:34 - INFO - __main__ - Step 128320: {'lr': 2.6011531003902438e-05, 'samples': 24637440, 'steps': 128319, 'loss/train': 1.0043365955352783} 11/07/2021 15:17:34 - INFO - __main__ - Step 128321: {'lr': 2.6009174079219323e-05, 'samples': 24637632, 'steps': 128320, 'loss/train': 1.1690750122070312} 11/07/2021 15:17:36 - INFO - __main__ - Step 128322: {'lr': 2.6006817255463083e-05, 'samples': 24637824, 'steps': 128321, 'loss/train': 1.0805442333221436} 11/07/2021 15:17:36 - INFO - __main__ - Step 128323: {'lr': 2.6004460532634638e-05, 'samples': 24638016, 'steps': 128322, 'loss/train': 1.1951087713241577} 11/07/2021 15:17:36 - INFO - __main__ - Step 128324: {'lr': 2.6002103910735152e-05, 'samples': 24638208, 'steps': 128323, 'loss/train': 1.1451313495635986} 11/07/2021 15:17:37 - INFO - __main__ - Step 128325: {'lr': 2.5999747389765656e-05, 'samples': 24638400, 'steps': 128324, 'loss/train': 1.5887629985809326} 11/07/2021 15:17:37 - INFO - __main__ - Step 128326: {'lr': 2.5997390969727196e-05, 'samples': 24638592, 'steps': 128325, 'loss/train': 1.4066247940063477} 11/07/2021 15:17:37 - INFO - __main__ - Step 128327: {'lr': 2.599503465062089e-05, 'samples': 24638784, 'steps': 128326, 'loss/train': 1.4648507833480835} 11/07/2021 15:17:38 - INFO - __main__ - Step 128328: {'lr': 2.5992678432447737e-05, 'samples': 24638976, 'steps': 128327, 'loss/train': 1.0520501136779785} 11/07/2021 15:17:39 - INFO - __main__ - Step 128329: {'lr': 2.599032231520884e-05, 'samples': 24639168, 'steps': 128328, 'loss/train': 1.5103718042373657} 11/07/2021 15:17:39 - INFO - __main__ - Step 128330: {'lr': 2.5987966298905235e-05, 'samples': 24639360, 'steps': 128329, 'loss/train': 0.6184425950050354} 11/07/2021 15:17:39 - INFO - __main__ - Step 128331: {'lr': 2.5985610383538e-05, 'samples': 24639552, 'steps': 128330, 'loss/train': 1.5878729820251465} 11/07/2021 15:17:40 - INFO - __main__ - Step 128332: {'lr': 2.598325456910819e-05, 'samples': 24639744, 'steps': 128331, 'loss/train': 1.6299922466278076} 11/07/2021 15:17:41 - INFO - __main__ - Step 128333: {'lr': 2.5980898855616886e-05, 'samples': 24639936, 'steps': 128332, 'loss/train': 1.0882635116577148} 11/07/2021 15:17:41 - INFO - __main__ - Step 128334: {'lr': 2.5978543243065116e-05, 'samples': 24640128, 'steps': 128333, 'loss/train': 1.279426097869873} 11/07/2021 15:17:41 - INFO - __main__ - Step 128335: {'lr': 2.5976187731453992e-05, 'samples': 24640320, 'steps': 128334, 'loss/train': 1.7382076978683472} 11/07/2021 15:17:42 - INFO - __main__ - Step 128336: {'lr': 2.5973832320784512e-05, 'samples': 24640512, 'steps': 128335, 'loss/train': 1.0095021724700928} 11/07/2021 15:17:42 - INFO - __main__ - Step 128337: {'lr': 2.5971477011057785e-05, 'samples': 24640704, 'steps': 128336, 'loss/train': 1.4062057733535767} 11/07/2021 15:17:43 - INFO - __main__ - Step 128338: {'lr': 2.5969121802274814e-05, 'samples': 24640896, 'steps': 128337, 'loss/train': 1.631034255027771} 11/07/2021 15:17:44 - INFO - __main__ - Step 128339: {'lr': 2.5966766694436733e-05, 'samples': 24641088, 'steps': 128338, 'loss/train': 1.6780250072479248} 11/07/2021 15:17:44 - INFO - __main__ - Step 128340: {'lr': 2.5964411687544543e-05, 'samples': 24641280, 'steps': 128339, 'loss/train': 1.0774145126342773} 11/07/2021 15:17:44 - INFO - __main__ - Step 128341: {'lr': 2.5962056781599354e-05, 'samples': 24641472, 'steps': 128340, 'loss/train': 1.137571096420288} 11/07/2021 15:17:45 - INFO - __main__ - Step 128342: {'lr': 2.595970197660219e-05, 'samples': 24641664, 'steps': 128341, 'loss/train': 1.277723789215088} 11/07/2021 15:17:46 - INFO - __main__ - Step 128343: {'lr': 2.5957347272554137e-05, 'samples': 24641856, 'steps': 128342, 'loss/train': 1.5829801559448242} 11/07/2021 15:17:46 - INFO - __main__ - Step 128344: {'lr': 2.595499266945625e-05, 'samples': 24642048, 'steps': 128343, 'loss/train': 1.330430030822754} 11/07/2021 15:17:46 - INFO - __main__ - Step 128345: {'lr': 2.5952638167309556e-05, 'samples': 24642240, 'steps': 128344, 'loss/train': 1.2798720598220825} 11/07/2021 15:17:47 - INFO - __main__ - Step 128346: {'lr': 2.595028376611519e-05, 'samples': 24642432, 'steps': 128345, 'loss/train': 1.3121720552444458} 11/07/2021 15:17:47 - INFO - __main__ - Step 128347: {'lr': 2.5947929465874127e-05, 'samples': 24642624, 'steps': 128346, 'loss/train': 1.3914854526519775} 11/07/2021 15:17:48 - INFO - __main__ - Step 128348: {'lr': 2.5945575266587502e-05, 'samples': 24642816, 'steps': 128347, 'loss/train': 1.8106499910354614} 11/07/2021 15:17:49 - INFO - __main__ - Step 128349: {'lr': 2.594322116825637e-05, 'samples': 24643008, 'steps': 128348, 'loss/train': 1.41473388671875} 11/07/2021 15:17:49 - INFO - __main__ - Step 128350: {'lr': 2.5940867170881732e-05, 'samples': 24643200, 'steps': 128349, 'loss/train': 1.3364230394363403} 11/07/2021 15:17:49 - INFO - __main__ - Step 128351: {'lr': 2.593851327446467e-05, 'samples': 24643392, 'steps': 128350, 'loss/train': 0.9374962449073792} 11/07/2021 15:17:50 - INFO - __main__ - Step 128352: {'lr': 2.5936159479006265e-05, 'samples': 24643584, 'steps': 128351, 'loss/train': 0.6271359324455261} 11/07/2021 15:17:51 - INFO - __main__ - Step 128353: {'lr': 2.5933805784507576e-05, 'samples': 24643776, 'steps': 128352, 'loss/train': 0.9434432983398438} 11/07/2021 15:17:51 - INFO - __main__ - Step 128354: {'lr': 2.5931452190969622e-05, 'samples': 24643968, 'steps': 128353, 'loss/train': 1.0822733640670776} 11/07/2021 15:17:51 - INFO - __main__ - Step 128355: {'lr': 2.5929098698393522e-05, 'samples': 24644160, 'steps': 128354, 'loss/train': 1.199425220489502} 11/07/2021 15:17:52 - INFO - __main__ - Step 128356: {'lr': 2.5926745306780324e-05, 'samples': 24644352, 'steps': 128355, 'loss/train': 1.3032070398330688} 11/07/2021 15:17:52 - INFO - __main__ - Step 128357: {'lr': 2.5924392016131058e-05, 'samples': 24644544, 'steps': 128356, 'loss/train': 1.1230244636535645} 11/07/2021 15:17:53 - INFO - __main__ - Step 128358: {'lr': 2.592203882644681e-05, 'samples': 24644736, 'steps': 128357, 'loss/train': 1.4041070938110352} 11/07/2021 15:17:53 - INFO - __main__ - Step 128359: {'lr': 2.5919685737728655e-05, 'samples': 24644928, 'steps': 128358, 'loss/train': 1.1684353351593018} 11/07/2021 15:17:54 - INFO - __main__ - Step 128360: {'lr': 2.5917332749977596e-05, 'samples': 24645120, 'steps': 128359, 'loss/train': 1.6128230094909668} 11/07/2021 15:17:54 - INFO - __main__ - Step 128361: {'lr': 2.5914979863194743e-05, 'samples': 24645312, 'steps': 128360, 'loss/train': 0.6573364734649658} 11/07/2021 15:17:54 - INFO - __main__ - Step 128362: {'lr': 2.5912627077381207e-05, 'samples': 24645504, 'steps': 128361, 'loss/train': 1.1683616638183594} 11/07/2021 15:17:55 - INFO - __main__ - Step 128363: {'lr': 2.59102743925379e-05, 'samples': 24645696, 'steps': 128362, 'loss/train': 1.4151089191436768} 11/07/2021 15:17:56 - INFO - __main__ - Step 128364: {'lr': 2.5907921808665998e-05, 'samples': 24645888, 'steps': 128363, 'loss/train': 1.4925669431686401} 11/07/2021 15:17:56 - INFO - __main__ - Step 128365: {'lr': 2.5905569325766513e-05, 'samples': 24646080, 'steps': 128364, 'loss/train': 1.1789470911026} 11/07/2021 15:17:57 - INFO - __main__ - Step 128366: {'lr': 2.590321694384054e-05, 'samples': 24646272, 'steps': 128365, 'loss/train': 0.47134262323379517} 11/07/2021 15:17:57 - INFO - __main__ - Step 128367: {'lr': 2.5900864662889102e-05, 'samples': 24646464, 'steps': 128366, 'loss/train': 1.154547095298767} 11/07/2021 15:17:57 - INFO - __main__ - Step 128368: {'lr': 2.589851248291328e-05, 'samples': 24646656, 'steps': 128367, 'loss/train': 1.1800917387008667} 11/07/2021 15:17:58 - INFO - __main__ - Step 128369: {'lr': 2.5896160403914127e-05, 'samples': 24646848, 'steps': 128368, 'loss/train': 1.597321629524231} 11/07/2021 15:17:59 - INFO - __main__ - Step 128370: {'lr': 2.58938084258927e-05, 'samples': 24647040, 'steps': 128369, 'loss/train': 1.1206485033035278} 11/07/2021 15:17:59 - INFO - __main__ - Step 128371: {'lr': 2.5891456548850056e-05, 'samples': 24647232, 'steps': 128370, 'loss/train': 1.2605197429656982} 11/07/2021 15:17:59 - INFO - __main__ - Step 128372: {'lr': 2.5889104772787274e-05, 'samples': 24647424, 'steps': 128371, 'loss/train': 1.1919564008712769} 11/07/2021 15:18:00 - INFO - __main__ - Step 128373: {'lr': 2.5886753097705412e-05, 'samples': 24647616, 'steps': 128372, 'loss/train': 1.7778505086898804} 11/07/2021 15:18:01 - INFO - __main__ - Step 128374: {'lr': 2.588440152360552e-05, 'samples': 24647808, 'steps': 128373, 'loss/train': 1.9693489074707031} 11/07/2021 15:18:01 - INFO - __main__ - Step 128375: {'lr': 2.588205005048866e-05, 'samples': 24648000, 'steps': 128374, 'loss/train': 1.4204301834106445} 11/07/2021 15:18:02 - INFO - __main__ - Step 128376: {'lr': 2.5879698678355934e-05, 'samples': 24648192, 'steps': 128375, 'loss/train': 1.3746334314346313} 11/07/2021 15:18:02 - INFO - __main__ - Step 128377: {'lr': 2.5877347407208317e-05, 'samples': 24648384, 'steps': 128376, 'loss/train': 1.4254887104034424} 11/07/2021 15:18:02 - INFO - __main__ - Step 128378: {'lr': 2.5874996237046895e-05, 'samples': 24648576, 'steps': 128377, 'loss/train': 0.8912340402603149} 11/07/2021 15:18:03 - INFO - __main__ - Step 128379: {'lr': 2.5872645167872745e-05, 'samples': 24648768, 'steps': 128378, 'loss/train': 1.5744540691375732} 11/07/2021 15:18:04 - INFO - __main__ - Step 128380: {'lr': 2.5870294199686922e-05, 'samples': 24648960, 'steps': 128379, 'loss/train': 1.0516010522842407} 11/07/2021 15:18:04 - INFO - __main__ - Step 128381: {'lr': 2.5867943332490486e-05, 'samples': 24649152, 'steps': 128380, 'loss/train': 1.4555273056030273} 11/07/2021 15:18:05 - INFO - __main__ - Step 128382: {'lr': 2.5865592566284514e-05, 'samples': 24649344, 'steps': 128381, 'loss/train': 1.5046287775039673} 11/07/2021 15:18:05 - INFO - __main__ - Step 128383: {'lr': 2.5863241901070006e-05, 'samples': 24649536, 'steps': 128382, 'loss/train': 0.7802101373672485} 11/07/2021 15:18:06 - INFO - __main__ - Step 128384: {'lr': 2.5860891336848104e-05, 'samples': 24649728, 'steps': 128383, 'loss/train': 2.0414650440216064} 11/07/2021 15:18:06 - INFO - __main__ - Step 128385: {'lr': 2.5858540873619803e-05, 'samples': 24649920, 'steps': 128384, 'loss/train': 1.0615462064743042} 11/07/2021 15:18:07 - INFO - __main__ - Step 128386: {'lr': 2.5856190511386185e-05, 'samples': 24650112, 'steps': 128385, 'loss/train': 1.8462390899658203} 11/07/2021 15:18:07 - INFO - __main__ - Step 128387: {'lr': 2.5853840250148337e-05, 'samples': 24650304, 'steps': 128386, 'loss/train': 0.9607682824134827} 11/07/2021 15:18:07 - INFO - __main__ - Step 128388: {'lr': 2.5851490089907252e-05, 'samples': 24650496, 'steps': 128387, 'loss/train': 1.3220289945602417} 11/07/2021 15:18:08 - INFO - __main__ - Step 128389: {'lr': 2.5849140030664103e-05, 'samples': 24650688, 'steps': 128388, 'loss/train': 1.5598161220550537} 11/07/2021 15:18:09 - INFO - __main__ - Step 128390: {'lr': 2.58467900724198e-05, 'samples': 24650880, 'steps': 128389, 'loss/train': 1.787968635559082} 11/07/2021 15:18:09 - INFO - __main__ - Step 128391: {'lr': 2.5844440215175485e-05, 'samples': 24651072, 'steps': 128390, 'loss/train': 0.8775414824485779} 11/07/2021 15:18:09 - INFO - __main__ - Step 128392: {'lr': 2.584209045893221e-05, 'samples': 24651264, 'steps': 128391, 'loss/train': 1.0929925441741943} 11/07/2021 15:18:10 - INFO - __main__ - Step 128393: {'lr': 2.5839740803691032e-05, 'samples': 24651456, 'steps': 128392, 'loss/train': 1.6647719144821167} 11/07/2021 15:18:10 - INFO - __main__ - Step 128394: {'lr': 2.5837391249453002e-05, 'samples': 24651648, 'steps': 128393, 'loss/train': 1.1130715608596802} 11/07/2021 15:18:11 - INFO - __main__ - Step 128395: {'lr': 2.5835041796219178e-05, 'samples': 24651840, 'steps': 128394, 'loss/train': 1.1563400030136108} 11/07/2021 15:18:12 - INFO - __main__ - Step 128396: {'lr': 2.583269244399064e-05, 'samples': 24652032, 'steps': 128395, 'loss/train': 1.3375617265701294} 11/07/2021 15:18:12 - INFO - __main__ - Step 128397: {'lr': 2.583034319276842e-05, 'samples': 24652224, 'steps': 128396, 'loss/train': 1.1803580522537231} 11/07/2021 15:18:12 - INFO - __main__ - Step 128398: {'lr': 2.5827994042553595e-05, 'samples': 24652416, 'steps': 128397, 'loss/train': 1.2356940507888794} 11/07/2021 15:18:13 - INFO - __main__ - Step 128399: {'lr': 2.582564499334722e-05, 'samples': 24652608, 'steps': 128398, 'loss/train': 1.4431519508361816} 11/07/2021 15:18:14 - INFO - __main__ - Step 128400: {'lr': 2.5823296045150406e-05, 'samples': 24652800, 'steps': 128399, 'loss/train': 1.6294807195663452} 11/07/2021 15:18:14 - INFO - __main__ - Step 128401: {'lr': 2.58209471979641e-05, 'samples': 24652992, 'steps': 128400, 'loss/train': 3.535102367401123} 11/07/2021 15:18:14 - INFO - __main__ - Step 128402: {'lr': 2.581859845178941e-05, 'samples': 24653184, 'steps': 128401, 'loss/train': 1.5013353824615479} 11/07/2021 15:18:15 - INFO - __main__ - Step 128403: {'lr': 2.581624980662739e-05, 'samples': 24653376, 'steps': 128402, 'loss/train': 1.238033652305603} 11/07/2021 15:18:15 - INFO - __main__ - Step 128404: {'lr': 2.581390126247912e-05, 'samples': 24653568, 'steps': 128403, 'loss/train': 1.1589382886886597} 11/07/2021 15:18:15 - INFO - __main__ - Step 128405: {'lr': 2.5811552819345636e-05, 'samples': 24653760, 'steps': 128404, 'loss/train': 1.377196192741394} 11/07/2021 15:18:16 - INFO - __main__ - Step 128406: {'lr': 2.5809204477228037e-05, 'samples': 24653952, 'steps': 128405, 'loss/train': 1.4187451601028442} 11/07/2021 15:18:17 - INFO - __main__ - Step 128407: {'lr': 2.580685623612733e-05, 'samples': 24654144, 'steps': 128406, 'loss/train': 1.2806850671768188} 11/07/2021 15:18:17 - INFO - __main__ - Step 128408: {'lr': 2.5804508096044593e-05, 'samples': 24654336, 'steps': 128407, 'loss/train': 1.2125474214553833} 11/07/2021 15:18:17 - INFO - __main__ - Step 128409: {'lr': 2.5802160056980884e-05, 'samples': 24654528, 'steps': 128408, 'loss/train': 1.2173981666564941} 11/07/2021 15:18:18 - INFO - __main__ - Step 128410: {'lr': 2.5799812118937256e-05, 'samples': 24654720, 'steps': 128409, 'loss/train': 1.0995242595672607} 11/07/2021 15:18:20 - INFO - __main__ - Step 128411: {'lr': 2.5797464281914845e-05, 'samples': 24654912, 'steps': 128410, 'loss/train': 1.1965370178222656} 11/07/2021 15:18:20 - INFO - __main__ - Step 128412: {'lr': 2.579511654591457e-05, 'samples': 24655104, 'steps': 128411, 'loss/train': 1.7303069829940796} 11/07/2021 15:18:21 - INFO - __main__ - Step 128413: {'lr': 2.579276891093757e-05, 'samples': 24655296, 'steps': 128412, 'loss/train': 1.6767016649246216} 11/07/2021 15:18:21 - INFO - __main__ - Step 128414: {'lr': 2.5790421376984868e-05, 'samples': 24655488, 'steps': 128413, 'loss/train': 1.717566967010498} 11/07/2021 15:18:21 - INFO - __main__ - Step 128415: {'lr': 2.578807394405755e-05, 'samples': 24655680, 'steps': 128414, 'loss/train': 1.4008933305740356} 11/07/2021 15:18:22 - INFO - __main__ - Step 128416: {'lr': 2.5785726612156668e-05, 'samples': 24655872, 'steps': 128415, 'loss/train': 1.1121023893356323} 11/07/2021 15:18:22 - INFO - __main__ - Step 128417: {'lr': 2.578337938128328e-05, 'samples': 24656064, 'steps': 128416, 'loss/train': 0.4177101254463196} 11/07/2021 15:18:23 - INFO - __main__ - Step 128418: {'lr': 2.5781032251438434e-05, 'samples': 24656256, 'steps': 128417, 'loss/train': 1.4494422674179077} 11/07/2021 15:18:23 - INFO - __main__ - Step 128419: {'lr': 2.577868522262322e-05, 'samples': 24656448, 'steps': 128418, 'loss/train': 1.4764145612716675} 11/07/2021 15:18:24 - INFO - __main__ - Step 128420: {'lr': 2.5776338294838637e-05, 'samples': 24656640, 'steps': 128419, 'loss/train': 1.6431093215942383} 11/07/2021 15:18:24 - INFO - __main__ - Step 128421: {'lr': 2.577399146808579e-05, 'samples': 24656832, 'steps': 128420, 'loss/train': 1.1833178997039795} 11/07/2021 15:18:25 - INFO - __main__ - Step 128422: {'lr': 2.5771644742365763e-05, 'samples': 24657024, 'steps': 128421, 'loss/train': 1.0445796251296997} 11/07/2021 15:18:26 - INFO - __main__ - Step 128423: {'lr': 2.5769298117679556e-05, 'samples': 24657216, 'steps': 128422, 'loss/train': 1.1406320333480835} 11/07/2021 15:18:26 - INFO - __main__ - Step 128424: {'lr': 2.57669515940282e-05, 'samples': 24657408, 'steps': 128423, 'loss/train': 1.0131840705871582} 11/07/2021 15:18:26 - INFO - __main__ - Step 128425: {'lr': 2.5764605171412825e-05, 'samples': 24657600, 'steps': 128424, 'loss/train': 1.3880890607833862} 11/07/2021 15:18:27 - INFO - __main__ - Step 128426: {'lr': 2.5762258849834462e-05, 'samples': 24657792, 'steps': 128425, 'loss/train': 1.855252742767334} 11/07/2021 15:18:27 - INFO - __main__ - Step 128427: {'lr': 2.575991262929414e-05, 'samples': 24657984, 'steps': 128426, 'loss/train': 1.4711893796920776} 11/07/2021 15:18:28 - INFO - __main__ - Step 128428: {'lr': 2.575756650979294e-05, 'samples': 24658176, 'steps': 128427, 'loss/train': 1.5048422813415527} 11/07/2021 15:18:28 - INFO - __main__ - Step 128429: {'lr': 2.5755220491331942e-05, 'samples': 24658368, 'steps': 128428, 'loss/train': 1.3822370767593384} 11/07/2021 15:18:29 - INFO - __main__ - Step 128430: {'lr': 2.575287457391218e-05, 'samples': 24658560, 'steps': 128429, 'loss/train': 1.5142303705215454} 11/07/2021 15:18:29 - INFO - __main__ - Step 128431: {'lr': 2.5750528757534697e-05, 'samples': 24658752, 'steps': 128430, 'loss/train': 1.465702772140503} 11/07/2021 15:18:29 - INFO - __main__ - Step 128432: {'lr': 2.5748183042200586e-05, 'samples': 24658944, 'steps': 128431, 'loss/train': 1.1781716346740723} 11/07/2021 15:18:31 - INFO - __main__ - Step 128433: {'lr': 2.5745837427910923e-05, 'samples': 24659136, 'steps': 128432, 'loss/train': 1.3797905445098877} 11/07/2021 15:18:31 - INFO - __main__ - Step 128434: {'lr': 2.5743491914666655e-05, 'samples': 24659328, 'steps': 128433, 'loss/train': 1.4514580965042114} 11/07/2021 15:18:31 - INFO - __main__ - Step 128435: {'lr': 2.574114650246895e-05, 'samples': 24659520, 'steps': 128434, 'loss/train': 1.4142316579818726} 11/07/2021 15:18:32 - INFO - __main__ - Step 128436: {'lr': 2.57388011913188e-05, 'samples': 24659712, 'steps': 128435, 'loss/train': 1.1263270378112793} 11/07/2021 15:18:32 - INFO - __main__ - Step 128437: {'lr': 2.5736455981217268e-05, 'samples': 24659904, 'steps': 128436, 'loss/train': 1.5690596103668213} 11/07/2021 15:18:33 - INFO - __main__ - Step 128438: {'lr': 2.5734110872165457e-05, 'samples': 24660096, 'steps': 128437, 'loss/train': 1.4873143434524536} 11/07/2021 15:18:33 - INFO - __main__ - Step 128439: {'lr': 2.573176586416437e-05, 'samples': 24660288, 'steps': 128438, 'loss/train': 0.9357027411460876} 11/07/2021 15:18:34 - INFO - __main__ - Step 128440: {'lr': 2.5729420957215118e-05, 'samples': 24660480, 'steps': 128439, 'loss/train': 1.288875937461853} 11/07/2021 15:18:34 - INFO - __main__ - Step 128441: {'lr': 2.5727076151318723e-05, 'samples': 24660672, 'steps': 128440, 'loss/train': 1.1593711376190186} 11/07/2021 15:18:34 - INFO - __main__ - Step 128442: {'lr': 2.572473144647622e-05, 'samples': 24660864, 'steps': 128441, 'loss/train': 1.2768579721450806} 11/07/2021 15:18:36 - INFO - __main__ - Step 128443: {'lr': 2.572238684268871e-05, 'samples': 24661056, 'steps': 128442, 'loss/train': 1.7169106006622314} 11/07/2021 15:18:36 - INFO - __main__ - Step 128444: {'lr': 2.5720042339957283e-05, 'samples': 24661248, 'steps': 128443, 'loss/train': 1.514435887336731} 11/07/2021 15:18:36 - INFO - __main__ - Step 128445: {'lr': 2.5717697938282907e-05, 'samples': 24661440, 'steps': 128444, 'loss/train': 1.4146524667739868} 11/07/2021 15:18:37 - INFO - __main__ - Step 128446: {'lr': 2.571535363766664e-05, 'samples': 24661632, 'steps': 128445, 'loss/train': 1.166965365409851} 11/07/2021 15:18:37 - INFO - __main__ - Step 128447: {'lr': 2.5713009438109615e-05, 'samples': 24661824, 'steps': 128446, 'loss/train': 1.1404099464416504} 11/07/2021 15:18:38 - INFO - __main__ - Step 128448: {'lr': 2.571066533961283e-05, 'samples': 24662016, 'steps': 128447, 'loss/train': 0.9132813215255737} 11/07/2021 15:18:38 - INFO - __main__ - Step 128449: {'lr': 2.570832134217735e-05, 'samples': 24662208, 'steps': 128448, 'loss/train': 1.731771469116211} 11/07/2021 15:18:39 - INFO - __main__ - Step 128450: {'lr': 2.5705977445804246e-05, 'samples': 24662400, 'steps': 128449, 'loss/train': 1.3386456966400146} 11/07/2021 15:18:39 - INFO - __main__ - Step 128451: {'lr': 2.570363365049455e-05, 'samples': 24662592, 'steps': 128450, 'loss/train': 0.7540428638458252} 11/07/2021 15:18:39 - INFO - __main__ - Step 128452: {'lr': 2.5701289956249346e-05, 'samples': 24662784, 'steps': 128451, 'loss/train': 1.1416982412338257} 11/07/2021 15:18:40 - INFO - __main__ - Step 128453: {'lr': 2.5698946363069687e-05, 'samples': 24662976, 'steps': 128452, 'loss/train': 1.3164446353912354} 11/07/2021 15:18:41 - INFO - __main__ - Step 128454: {'lr': 2.5696602870956627e-05, 'samples': 24663168, 'steps': 128453, 'loss/train': 1.2784876823425293} 11/07/2021 15:18:41 - INFO - __main__ - Step 128455: {'lr': 2.569425947991122e-05, 'samples': 24663360, 'steps': 128454, 'loss/train': 1.0911693572998047} 11/07/2021 15:18:41 - INFO - __main__ - Step 128456: {'lr': 2.569191618993455e-05, 'samples': 24663552, 'steps': 128455, 'loss/train': 1.6712204217910767} 11/07/2021 15:18:42 - INFO - __main__ - Step 128457: {'lr': 2.5689573001027588e-05, 'samples': 24663744, 'steps': 128456, 'loss/train': 1.89030122756958} 11/07/2021 15:18:42 - INFO - __main__ - Step 128458: {'lr': 2.568722991319147e-05, 'samples': 24663936, 'steps': 128457, 'loss/train': 0.8194608092308044} 11/07/2021 15:18:43 - INFO - __main__ - Step 128459: {'lr': 2.5684886926427202e-05, 'samples': 24664128, 'steps': 128458, 'loss/train': 1.3654078245162964} 11/07/2021 15:18:44 - INFO - __main__ - Step 128460: {'lr': 2.568254404073586e-05, 'samples': 24664320, 'steps': 128459, 'loss/train': 1.5289602279663086} 11/07/2021 15:18:44 - INFO - __main__ - Step 128461: {'lr': 2.568020125611853e-05, 'samples': 24664512, 'steps': 128460, 'loss/train': 1.1859326362609863} 11/07/2021 15:18:44 - INFO - __main__ - Step 128462: {'lr': 2.5677858572576205e-05, 'samples': 24664704, 'steps': 128461, 'loss/train': 1.6587198972702026} 11/07/2021 15:18:45 - INFO - __main__ - Step 128463: {'lr': 2.5675515990110005e-05, 'samples': 24664896, 'steps': 128462, 'loss/train': 1.2781754732131958} 11/07/2021 15:18:46 - INFO - __main__ - Step 128464: {'lr': 2.5673173508720947e-05, 'samples': 24665088, 'steps': 128463, 'loss/train': 1.1260374784469604} 11/07/2021 15:18:46 - INFO - __main__ - Step 128465: {'lr': 2.5670831128410093e-05, 'samples': 24665280, 'steps': 128464, 'loss/train': 1.2700332403182983} 11/07/2021 15:18:47 - INFO - __main__ - Step 128466: {'lr': 2.5668488849178496e-05, 'samples': 24665472, 'steps': 128465, 'loss/train': 0.9634284377098083} 11/07/2021 15:18:47 - INFO - __main__ - Step 128467: {'lr': 2.5666146671027206e-05, 'samples': 24665664, 'steps': 128466, 'loss/train': 0.5957033634185791} 11/07/2021 15:18:47 - INFO - __main__ - Step 128468: {'lr': 2.566380459395731e-05, 'samples': 24665856, 'steps': 128467, 'loss/train': 1.414779543876648} 11/07/2021 15:18:48 - INFO - __main__ - Step 128469: {'lr': 2.566146261796984e-05, 'samples': 24666048, 'steps': 128468, 'loss/train': 1.5641489028930664} 11/07/2021 15:18:49 - INFO - __main__ - Step 128470: {'lr': 2.5659120743065895e-05, 'samples': 24666240, 'steps': 128469, 'loss/train': 1.0815858840942383} 11/07/2021 15:18:49 - INFO - __main__ - Step 128471: {'lr': 2.5656778969246426e-05, 'samples': 24666432, 'steps': 128470, 'loss/train': 1.0710185766220093} 11/07/2021 15:18:49 - INFO - __main__ - Step 128472: {'lr': 2.5654437296512568e-05, 'samples': 24666624, 'steps': 128471, 'loss/train': 1.3936201333999634} 11/07/2021 15:18:50 - INFO - __main__ - Step 128473: {'lr': 2.5652095724865376e-05, 'samples': 24666816, 'steps': 128472, 'loss/train': 0.9564065337181091} 11/07/2021 15:18:51 - INFO - __main__ - Step 128474: {'lr': 2.564975425430585e-05, 'samples': 24667008, 'steps': 128473, 'loss/train': 0.8051006197929382} 11/07/2021 15:18:51 - INFO - __main__ - Step 128475: {'lr': 2.5647412884835103e-05, 'samples': 24667200, 'steps': 128474, 'loss/train': 1.3921912908554077} 11/07/2021 15:18:52 - INFO - __main__ - Step 128476: {'lr': 2.5645071616454157e-05, 'samples': 24667392, 'steps': 128475, 'loss/train': 1.6909555196762085} 11/07/2021 15:18:52 - INFO - __main__ - Step 128477: {'lr': 2.56427304491641e-05, 'samples': 24667584, 'steps': 128476, 'loss/train': 1.2837587594985962} 11/07/2021 15:18:52 - INFO - __main__ - Step 128478: {'lr': 2.564038938296595e-05, 'samples': 24667776, 'steps': 128477, 'loss/train': 1.400307059288025} 11/07/2021 15:18:54 - INFO - __main__ - Step 128479: {'lr': 2.56380484178608e-05, 'samples': 24667968, 'steps': 128478, 'loss/train': 1.2437084913253784} 11/07/2021 15:18:54 - INFO - __main__ - Step 128480: {'lr': 2.563570755384967e-05, 'samples': 24668160, 'steps': 128479, 'loss/train': 1.2363877296447754} 11/07/2021 15:18:55 - INFO - __main__ - Step 128481: {'lr': 2.5633366790933614e-05, 'samples': 24668352, 'steps': 128480, 'loss/train': 1.235900640487671} 11/07/2021 15:18:55 - INFO - __main__ - Step 128482: {'lr': 2.563102612911372e-05, 'samples': 24668544, 'steps': 128481, 'loss/train': 0.9465747475624084} 11/07/2021 15:18:56 - INFO - __main__ - Step 128483: {'lr': 2.5628685568391068e-05, 'samples': 24668736, 'steps': 128482, 'loss/train': 1.152992606163025} 11/07/2021 15:18:56 - INFO - __main__ - Step 128484: {'lr': 2.56263451087666e-05, 'samples': 24668928, 'steps': 128483, 'loss/train': 0.16918137669563293} 11/07/2021 15:18:56 - INFO - __main__ - Step 128485: {'lr': 2.5624004750241457e-05, 'samples': 24669120, 'steps': 128484, 'loss/train': 0.2078322172164917} 11/07/2021 15:18:57 - INFO - __main__ - Step 128486: {'lr': 2.562166449281669e-05, 'samples': 24669312, 'steps': 128485, 'loss/train': 0.9643492102622986} 11/07/2021 15:18:58 - INFO - __main__ - Step 128487: {'lr': 2.5619324336493304e-05, 'samples': 24669504, 'steps': 128486, 'loss/train': 0.5387812852859497} 11/07/2021 15:18:58 - INFO - __main__ - Step 128488: {'lr': 2.5616984281272433e-05, 'samples': 24669696, 'steps': 128487, 'loss/train': 1.538875699043274} 11/07/2021 15:18:58 - INFO - __main__ - Step 128489: {'lr': 2.5614644327155045e-05, 'samples': 24669888, 'steps': 128488, 'loss/train': 1.3706581592559814} 11/07/2021 15:18:59 - INFO - __main__ - Step 128490: {'lr': 2.5612304474142257e-05, 'samples': 24670080, 'steps': 128489, 'loss/train': 0.7613466382026672} 11/07/2021 15:19:00 - INFO - __main__ - Step 128491: {'lr': 2.560996472223509e-05, 'samples': 24670272, 'steps': 128490, 'loss/train': 1.277868390083313} 11/07/2021 15:19:00 - INFO - __main__ - Step 128492: {'lr': 2.5607625071434605e-05, 'samples': 24670464, 'steps': 128491, 'loss/train': 1.1526902914047241} 11/07/2021 15:19:00 - INFO - __main__ - Step 128493: {'lr': 2.560528552174188e-05, 'samples': 24670656, 'steps': 128492, 'loss/train': 1.206063151359558} 11/07/2021 15:19:01 - INFO - __main__ - Step 128494: {'lr': 2.560294607315794e-05, 'samples': 24670848, 'steps': 128493, 'loss/train': 1.2932065725326538} 11/07/2021 15:19:01 - INFO - __main__ - Step 128495: {'lr': 2.5600606725683846e-05, 'samples': 24671040, 'steps': 128494, 'loss/train': 1.2720999717712402} 11/07/2021 15:19:02 - INFO - __main__ - Step 128496: {'lr': 2.5598267479320676e-05, 'samples': 24671232, 'steps': 128495, 'loss/train': 1.427635908126831} 11/07/2021 15:19:02 - INFO - __main__ - Step 128497: {'lr': 2.5595928334069486e-05, 'samples': 24671424, 'steps': 128496, 'loss/train': 1.4553638696670532} 11/07/2021 15:19:03 - INFO - __main__ - Step 128498: {'lr': 2.5593589289931275e-05, 'samples': 24671616, 'steps': 128497, 'loss/train': 1.4238512516021729} 11/07/2021 15:19:03 - INFO - __main__ - Step 128499: {'lr': 2.5591250346907124e-05, 'samples': 24671808, 'steps': 128498, 'loss/train': 1.5050517320632935} 11/07/2021 15:19:04 - INFO - __main__ - Step 128500: {'lr': 2.5588911504998118e-05, 'samples': 24672000, 'steps': 128499, 'loss/train': 0.7656267285346985} 11/07/2021 15:19:05 - INFO - __main__ - Step 128501: {'lr': 2.5586572764205258e-05, 'samples': 24672192, 'steps': 128500, 'loss/train': 1.6910483837127686} 11/07/2021 15:19:05 - INFO - __main__ - Step 128502: {'lr': 2.558423412452962e-05, 'samples': 24672384, 'steps': 128501, 'loss/train': 0.08327841758728027} 11/07/2021 15:19:06 - INFO - __main__ - Step 128503: {'lr': 2.5581895585972293e-05, 'samples': 24672576, 'steps': 128502, 'loss/train': 1.557999610900879} 11/07/2021 15:19:06 - INFO - __main__ - Step 128504: {'lr': 2.5579557148534272e-05, 'samples': 24672768, 'steps': 128503, 'loss/train': 0.9526381492614746} 11/07/2021 15:19:06 - INFO - __main__ - Step 128505: {'lr': 2.557721881221664e-05, 'samples': 24672960, 'steps': 128504, 'loss/train': 1.169966459274292} 11/07/2021 15:19:07 - INFO - __main__ - Step 128506: {'lr': 2.557488057702048e-05, 'samples': 24673152, 'steps': 128505, 'loss/train': 1.4771909713745117} 11/07/2021 15:19:08 - INFO - __main__ - Step 128507: {'lr': 2.5572542442946794e-05, 'samples': 24673344, 'steps': 128506, 'loss/train': 1.3833597898483276} 11/07/2021 15:19:08 - INFO - __main__ - Step 128508: {'lr': 2.557020440999666e-05, 'samples': 24673536, 'steps': 128507, 'loss/train': 1.3178220987319946} 11/07/2021 15:19:09 - INFO - __main__ - Step 128509: {'lr': 2.556786647817111e-05, 'samples': 24673728, 'steps': 128508, 'loss/train': 2.7455105781555176} 11/07/2021 15:19:09 - INFO - __main__ - Step 128510: {'lr': 2.5565528647471274e-05, 'samples': 24673920, 'steps': 128509, 'loss/train': 1.1115306615829468} 11/07/2021 15:19:10 - INFO - __main__ - Step 128511: {'lr': 2.556319091789813e-05, 'samples': 24674112, 'steps': 128510, 'loss/train': 1.2222976684570312} 11/07/2021 15:19:10 - INFO - __main__ - Step 128512: {'lr': 2.5560853289452706e-05, 'samples': 24674304, 'steps': 128511, 'loss/train': 1.2197972536087036} 11/07/2021 15:19:11 - INFO - __main__ - Step 128513: {'lr': 2.5558515762136136e-05, 'samples': 24674496, 'steps': 128512, 'loss/train': 1.3905361890792847} 11/07/2021 15:19:11 - INFO - __main__ - Step 128514: {'lr': 2.555617833594942e-05, 'samples': 24674688, 'steps': 128513, 'loss/train': 1.2754846811294556} 11/07/2021 15:19:11 - INFO - __main__ - Step 128515: {'lr': 2.5553841010893614e-05, 'samples': 24674880, 'steps': 128514, 'loss/train': 3.2307541370391846} 11/07/2021 15:19:12 - INFO - __main__ - Step 128516: {'lr': 2.55515037869698e-05, 'samples': 24675072, 'steps': 128515, 'loss/train': 0.9557328820228577} 11/07/2021 15:19:13 - INFO - __main__ - Step 128517: {'lr': 2.5549166664179e-05, 'samples': 24675264, 'steps': 128516, 'loss/train': 1.2287302017211914} 11/07/2021 15:19:13 - INFO - __main__ - Step 128518: {'lr': 2.5546829642522307e-05, 'samples': 24675456, 'steps': 128517, 'loss/train': 1.2102572917938232} 11/07/2021 15:19:13 - INFO - __main__ - Step 128519: {'lr': 2.554449272200071e-05, 'samples': 24675648, 'steps': 128518, 'loss/train': 1.2166892290115356} 11/07/2021 15:19:14 - INFO - __main__ - Step 128520: {'lr': 2.5542155902615328e-05, 'samples': 24675840, 'steps': 128519, 'loss/train': 1.4361486434936523} 11/07/2021 15:19:14 - INFO - __main__ - Step 128521: {'lr': 2.553981918436718e-05, 'samples': 24676032, 'steps': 128520, 'loss/train': 1.6080317497253418} 11/07/2021 15:19:15 - INFO - __main__ - Step 128522: {'lr': 2.5537482567257353e-05, 'samples': 24676224, 'steps': 128521, 'loss/train': 1.1193057298660278} 11/07/2021 15:19:15 - INFO - __main__ - Step 128523: {'lr': 2.55351460512869e-05, 'samples': 24676416, 'steps': 128522, 'loss/train': 1.1803518533706665} 11/07/2021 15:19:16 - INFO - __main__ - Step 128524: {'lr': 2.5532809636456794e-05, 'samples': 24676608, 'steps': 128523, 'loss/train': 1.3499023914337158} 11/07/2021 15:19:16 - INFO - __main__ - Step 128525: {'lr': 2.5530473322768144e-05, 'samples': 24676800, 'steps': 128524, 'loss/train': 1.498783826828003} 11/07/2021 15:19:16 - INFO - __main__ - Step 128526: {'lr': 2.5528137110221977e-05, 'samples': 24676992, 'steps': 128525, 'loss/train': 0.8943772315979004} 11/07/2021 15:19:18 - INFO - __main__ - Step 128527: {'lr': 2.5525800998819404e-05, 'samples': 24677184, 'steps': 128526, 'loss/train': 1.3912140130996704} 11/07/2021 15:19:18 - INFO - __main__ - Step 128528: {'lr': 2.5523464988561425e-05, 'samples': 24677376, 'steps': 128527, 'loss/train': 0.9083548784255981} 11/07/2021 15:19:18 - INFO - __main__ - Step 128529: {'lr': 2.552112907944909e-05, 'samples': 24677568, 'steps': 128528, 'loss/train': 1.0651928186416626} 11/07/2021 15:19:19 - INFO - __main__ - Step 128530: {'lr': 2.5518793271483487e-05, 'samples': 24677760, 'steps': 128529, 'loss/train': 1.2301549911499023} 11/07/2021 15:19:19 - INFO - __main__ - Step 128531: {'lr': 2.551645756466567e-05, 'samples': 24677952, 'steps': 128530, 'loss/train': 1.4414969682693481} 11/07/2021 15:19:20 - INFO - __main__ - Step 128532: {'lr': 2.551412195899666e-05, 'samples': 24678144, 'steps': 128531, 'loss/train': 1.6333123445510864} 11/07/2021 15:19:21 - INFO - __main__ - Step 128533: {'lr': 2.551178645447752e-05, 'samples': 24678336, 'steps': 128532, 'loss/train': 1.441733479499817} 11/07/2021 15:19:21 - INFO - __main__ - Step 128534: {'lr': 2.55094510511093e-05, 'samples': 24678528, 'steps': 128533, 'loss/train': 2.5015506744384766} 11/07/2021 15:19:21 - INFO - __main__ - Step 128535: {'lr': 2.5507115748893057e-05, 'samples': 24678720, 'steps': 128534, 'loss/train': 1.474268913269043} 11/07/2021 15:19:22 - INFO - __main__ - Step 128536: {'lr': 2.5504780547829843e-05, 'samples': 24678912, 'steps': 128535, 'loss/train': 1.5494753122329712} 11/07/2021 15:19:22 - INFO - __main__ - Step 128537: {'lr': 2.5502445447920768e-05, 'samples': 24679104, 'steps': 128536, 'loss/train': 1.4316102266311646} 11/07/2021 15:19:23 - INFO - __main__ - Step 128538: {'lr': 2.550011044916678e-05, 'samples': 24679296, 'steps': 128537, 'loss/train': 1.4123479127883911} 11/07/2021 15:19:23 - INFO - __main__ - Step 128539: {'lr': 2.549777555156896e-05, 'samples': 24679488, 'steps': 128538, 'loss/train': 1.2469085454940796} 11/07/2021 15:19:24 - INFO - __main__ - Step 128540: {'lr': 2.5495440755128384e-05, 'samples': 24679680, 'steps': 128539, 'loss/train': 1.5748106241226196} 11/07/2021 15:19:24 - INFO - __main__ - Step 128541: {'lr': 2.5493106059846115e-05, 'samples': 24679872, 'steps': 128540, 'loss/train': 1.5665080547332764} 11/07/2021 15:19:24 - INFO - __main__ - Step 128542: {'lr': 2.5490771465723177e-05, 'samples': 24680064, 'steps': 128541, 'loss/train': 0.10927478969097137} 11/07/2021 15:19:26 - INFO - __main__ - Step 128543: {'lr': 2.5488436972760626e-05, 'samples': 24680256, 'steps': 128542, 'loss/train': 1.1611140966415405} 11/07/2021 15:19:26 - INFO - __main__ - Step 128544: {'lr': 2.548610258095954e-05, 'samples': 24680448, 'steps': 128543, 'loss/train': 1.1177163124084473} 11/07/2021 15:19:26 - INFO - __main__ - Step 128545: {'lr': 2.5483768290320923e-05, 'samples': 24680640, 'steps': 128544, 'loss/train': 1.1398720741271973} 11/07/2021 15:19:27 - INFO - __main__ - Step 128546: {'lr': 2.5481434100845885e-05, 'samples': 24680832, 'steps': 128545, 'loss/train': 0.8344020247459412} 11/07/2021 15:19:27 - INFO - __main__ - Step 128547: {'lr': 2.547910001253545e-05, 'samples': 24681024, 'steps': 128546, 'loss/train': 1.0060211420059204} 11/07/2021 15:19:28 - INFO - __main__ - Step 128548: {'lr': 2.5476766025390646e-05, 'samples': 24681216, 'steps': 128547, 'loss/train': 0.763846755027771} 11/07/2021 15:19:28 - INFO - __main__ - Step 128549: {'lr': 2.5474432139412555e-05, 'samples': 24681408, 'steps': 128548, 'loss/train': 1.107670783996582} 11/07/2021 15:19:29 - INFO - __main__ - Step 128550: {'lr': 2.5472098354602263e-05, 'samples': 24681600, 'steps': 128549, 'loss/train': 1.6126365661621094} 11/07/2021 15:19:29 - INFO - __main__ - Step 128551: {'lr': 2.5469764670960737e-05, 'samples': 24681792, 'steps': 128550, 'loss/train': 0.9994946122169495} 11/07/2021 15:19:29 - INFO - __main__ - Step 128552: {'lr': 2.5467431088489062e-05, 'samples': 24681984, 'steps': 128551, 'loss/train': 1.1547369956970215} 11/07/2021 15:19:31 - INFO - __main__ - Step 128553: {'lr': 2.546509760718832e-05, 'samples': 24682176, 'steps': 128552, 'loss/train': 1.0808895826339722} 11/07/2021 15:19:31 - INFO - __main__ - Step 128554: {'lr': 2.546276422705951e-05, 'samples': 24682368, 'steps': 128553, 'loss/train': 1.6081377267837524} 11/07/2021 15:19:31 - INFO - __main__ - Step 128555: {'lr': 2.546043094810374e-05, 'samples': 24682560, 'steps': 128554, 'loss/train': 1.511724591255188} 11/07/2021 15:19:32 - INFO - __main__ - Step 128556: {'lr': 2.5458097770322013e-05, 'samples': 24682752, 'steps': 128555, 'loss/train': 1.0710439682006836} 11/07/2021 15:19:32 - INFO - __main__ - Step 128557: {'lr': 2.5455764693715413e-05, 'samples': 24682944, 'steps': 128556, 'loss/train': 1.049066424369812} 11/07/2021 15:19:33 - INFO - __main__ - Step 128558: {'lr': 2.5453431718284987e-05, 'samples': 24683136, 'steps': 128557, 'loss/train': 1.1971375942230225} 11/07/2021 15:19:33 - INFO - __main__ - Step 128559: {'lr': 2.5451098844031766e-05, 'samples': 24683328, 'steps': 128558, 'loss/train': 1.1144819259643555} 11/07/2021 15:19:34 - INFO - __main__ - Step 128560: {'lr': 2.5448766070956837e-05, 'samples': 24683520, 'steps': 128559, 'loss/train': 1.5507307052612305} 11/07/2021 15:19:34 - INFO - __main__ - Step 128561: {'lr': 2.544643339906119e-05, 'samples': 24683712, 'steps': 128560, 'loss/train': 1.1060824394226074} 11/07/2021 15:19:34 - INFO - __main__ - Step 128562: {'lr': 2.5444100828345946e-05, 'samples': 24683904, 'steps': 128561, 'loss/train': 0.8911561369895935} 11/07/2021 15:19:35 - INFO - __main__ - Step 128563: {'lr': 2.544176835881212e-05, 'samples': 24684096, 'steps': 128562, 'loss/train': 1.3905680179595947} 11/07/2021 15:19:36 - INFO - __main__ - Step 128564: {'lr': 2.5439435990460807e-05, 'samples': 24684288, 'steps': 128563, 'loss/train': 0.6222538352012634} 11/07/2021 15:19:36 - INFO - __main__ - Step 128565: {'lr': 2.5437103723293e-05, 'samples': 24684480, 'steps': 128564, 'loss/train': 1.3391934633255005} 11/07/2021 15:19:36 - INFO - __main__ - Step 128566: {'lr': 2.5434771557309722e-05, 'samples': 24684672, 'steps': 128565, 'loss/train': 1.6767398118972778} 11/07/2021 15:19:37 - INFO - __main__ - Step 128567: {'lr': 2.5432439492512116e-05, 'samples': 24684864, 'steps': 128566, 'loss/train': 0.9870095252990723} 11/07/2021 15:19:37 - INFO - __main__ - Step 128568: {'lr': 2.5430107528901153e-05, 'samples': 24685056, 'steps': 128567, 'loss/train': 1.623401165008545} 11/07/2021 15:19:38 - INFO - __main__ - Step 128569: {'lr': 2.5427775666477943e-05, 'samples': 24685248, 'steps': 128568, 'loss/train': 1.1846750974655151} 11/07/2021 15:19:39 - INFO - __main__ - Step 128570: {'lr': 2.5425443905243484e-05, 'samples': 24685440, 'steps': 128569, 'loss/train': 1.1841305494308472} 11/07/2021 15:19:39 - INFO - __main__ - Step 128571: {'lr': 2.5423112245198888e-05, 'samples': 24685632, 'steps': 128570, 'loss/train': 1.1915183067321777} 11/07/2021 15:19:39 - INFO - __main__ - Step 128572: {'lr': 2.5420780686345153e-05, 'samples': 24685824, 'steps': 128571, 'loss/train': 1.2693004608154297} 11/07/2021 15:19:40 - INFO - __main__ - Step 128573: {'lr': 2.541844922868336e-05, 'samples': 24686016, 'steps': 128572, 'loss/train': 1.1044342517852783} 11/07/2021 15:19:41 - INFO - __main__ - Step 128574: {'lr': 2.541611787221454e-05, 'samples': 24686208, 'steps': 128573, 'loss/train': 1.9701268672943115} 11/07/2021 15:19:41 - INFO - __main__ - Step 128575: {'lr': 2.5413786616939743e-05, 'samples': 24686400, 'steps': 128574, 'loss/train': 1.38288414478302} 11/07/2021 15:19:41 - INFO - __main__ - Step 128576: {'lr': 2.5411455462860028e-05, 'samples': 24686592, 'steps': 128575, 'loss/train': 0.835101842880249} 11/07/2021 15:19:42 - INFO - __main__ - Step 128577: {'lr': 2.5409124409976502e-05, 'samples': 24686784, 'steps': 128576, 'loss/train': 0.8021990656852722} 11/07/2021 15:19:42 - INFO - __main__ - Step 128578: {'lr': 2.5406793458290113e-05, 'samples': 24686976, 'steps': 128577, 'loss/train': 0.7079052925109863} 11/07/2021 15:19:43 - INFO - __main__ - Step 128579: {'lr': 2.5404462607801966e-05, 'samples': 24687168, 'steps': 128578, 'loss/train': 0.8533437848091125} 11/07/2021 15:19:43 - INFO - __main__ - Step 128580: {'lr': 2.540213185851309e-05, 'samples': 24687360, 'steps': 128579, 'loss/train': 1.6256675720214844} 11/07/2021 15:19:44 - INFO - __main__ - Step 128581: {'lr': 2.539980121042454e-05, 'samples': 24687552, 'steps': 128580, 'loss/train': 0.740663468837738} 11/07/2021 15:19:44 - INFO - __main__ - Step 128582: {'lr': 2.5397470663537398e-05, 'samples': 24687744, 'steps': 128581, 'loss/train': 1.4889503717422485} 11/07/2021 15:19:44 - INFO - __main__ - Step 128583: {'lr': 2.5395140217852662e-05, 'samples': 24687936, 'steps': 128582, 'loss/train': 0.8674752116203308} 11/07/2021 15:19:45 - INFO - __main__ - Step 128584: {'lr': 2.539280987337142e-05, 'samples': 24688128, 'steps': 128583, 'loss/train': 1.6993595361709595} 11/07/2021 15:19:46 - INFO - __main__ - Step 128585: {'lr': 2.539047963009472e-05, 'samples': 24688320, 'steps': 128584, 'loss/train': 1.2797812223434448} 11/07/2021 15:19:46 - INFO - __main__ - Step 128586: {'lr': 2.538814948802359e-05, 'samples': 24688512, 'steps': 128585, 'loss/train': 0.9855076670646667} 11/07/2021 15:19:47 - INFO - __main__ - Step 128587: {'lr': 2.538581944715912e-05, 'samples': 24688704, 'steps': 128586, 'loss/train': 1.7401256561279297} 11/07/2021 15:19:47 - INFO - __main__ - Step 128588: {'lr': 2.53834895075023e-05, 'samples': 24688896, 'steps': 128587, 'loss/train': 1.9943649768829346} 11/07/2021 15:19:47 - INFO - __main__ - Step 128589: {'lr': 2.5381159669054243e-05, 'samples': 24689088, 'steps': 128588, 'loss/train': 1.4497089385986328} 11/07/2021 15:19:48 - INFO - __main__ - Step 128590: {'lr': 2.537882993181595e-05, 'samples': 24689280, 'steps': 128589, 'loss/train': 1.1778961420059204} 11/07/2021 15:19:49 - INFO - __main__ - Step 128591: {'lr': 2.5376500295788557e-05, 'samples': 24689472, 'steps': 128590, 'loss/train': 1.1711671352386475} 11/07/2021 15:19:49 - INFO - __main__ - Step 128592: {'lr': 2.537417076097298e-05, 'samples': 24689664, 'steps': 128591, 'loss/train': 1.4242476224899292} 11/07/2021 15:19:49 - INFO - __main__ - Step 128593: {'lr': 2.5371841327370332e-05, 'samples': 24689856, 'steps': 128592, 'loss/train': 1.5044150352478027} 11/07/2021 15:19:50 - INFO - __main__ - Step 128594: {'lr': 2.5369511994981693e-05, 'samples': 24690048, 'steps': 128593, 'loss/train': 1.1778180599212646} 11/07/2021 15:19:51 - INFO - __main__ - Step 128595: {'lr': 2.536718276380806e-05, 'samples': 24690240, 'steps': 128594, 'loss/train': 1.8657853603363037} 11/07/2021 15:19:51 - INFO - __main__ - Step 128596: {'lr': 2.5364853633850523e-05, 'samples': 24690432, 'steps': 128595, 'loss/train': 2.4561452865600586} 11/07/2021 15:19:52 - INFO - __main__ - Step 128597: {'lr': 2.5362524605110097e-05, 'samples': 24690624, 'steps': 128596, 'loss/train': 0.8080645799636841} 11/07/2021 15:19:52 - INFO - __main__ - Step 128598: {'lr': 2.5360195677587877e-05, 'samples': 24690816, 'steps': 128597, 'loss/train': 1.0445425510406494} 11/07/2021 15:19:52 - INFO - __main__ - Step 128599: {'lr': 2.5357866851284883e-05, 'samples': 24691008, 'steps': 128598, 'loss/train': 1.1756563186645508} 11/07/2021 15:19:54 - INFO - __main__ - Step 128600: {'lr': 2.5355538126202145e-05, 'samples': 24691200, 'steps': 128599, 'loss/train': 1.4388803243637085} 11/07/2021 15:19:54 - INFO - __main__ - Step 128601: {'lr': 2.535320950234074e-05, 'samples': 24691392, 'steps': 128600, 'loss/train': 1.3503737449645996} 11/07/2021 15:19:54 - INFO - __main__ - Step 128602: {'lr': 2.535088097970173e-05, 'samples': 24691584, 'steps': 128601, 'loss/train': 0.10159173607826233} 11/07/2021 15:19:55 - INFO - __main__ - Step 128603: {'lr': 2.5348552558286136e-05, 'samples': 24691776, 'steps': 128602, 'loss/train': 1.6087104082107544} 11/07/2021 15:19:55 - INFO - __main__ - Step 128604: {'lr': 2.5346224238095073e-05, 'samples': 24691968, 'steps': 128603, 'loss/train': 1.3228195905685425} 11/07/2021 15:19:56 - INFO - __main__ - Step 128605: {'lr': 2.5343896019129482e-05, 'samples': 24692160, 'steps': 128604, 'loss/train': 1.3515496253967285} 11/07/2021 15:19:57 - INFO - __main__ - Step 128606: {'lr': 2.534156790139047e-05, 'samples': 24692352, 'steps': 128605, 'loss/train': 1.270513892173767} 11/07/2021 15:19:57 - INFO - __main__ - Step 128607: {'lr': 2.5339239884879073e-05, 'samples': 24692544, 'steps': 128606, 'loss/train': 1.5630038976669312} 11/07/2021 15:19:57 - INFO - __main__ - Step 128608: {'lr': 2.533691196959634e-05, 'samples': 24692736, 'steps': 128607, 'loss/train': 1.180633544921875} 11/07/2021 15:19:58 - INFO - __main__ - Step 128609: {'lr': 2.533458415554335e-05, 'samples': 24692928, 'steps': 128608, 'loss/train': 1.155342698097229} 11/07/2021 15:19:59 - INFO - __main__ - Step 128610: {'lr': 2.5332256442721107e-05, 'samples': 24693120, 'steps': 128609, 'loss/train': 0.4470501244068146} 11/07/2021 15:19:59 - INFO - __main__ - Step 128611: {'lr': 2.5329928831130693e-05, 'samples': 24693312, 'steps': 128610, 'loss/train': 1.4567131996154785} 11/07/2021 15:19:59 - INFO - __main__ - Step 128612: {'lr': 2.5327601320773136e-05, 'samples': 24693504, 'steps': 128611, 'loss/train': 1.1993396282196045} 11/07/2021 15:20:00 - INFO - __main__ - Step 128613: {'lr': 2.5325273911649515e-05, 'samples': 24693696, 'steps': 128612, 'loss/train': 5.66504430770874} 11/07/2021 15:20:00 - INFO - __main__ - Step 128614: {'lr': 2.5322946603760833e-05, 'samples': 24693888, 'steps': 128613, 'loss/train': 1.094240427017212} 11/07/2021 15:20:01 - INFO - __main__ - Step 128615: {'lr': 2.5320619397108197e-05, 'samples': 24694080, 'steps': 128614, 'loss/train': 0.4564017355442047} 11/07/2021 15:20:02 - INFO - __main__ - Step 128616: {'lr': 2.5318292291692607e-05, 'samples': 24694272, 'steps': 128615, 'loss/train': 1.7774959802627563} 11/07/2021 15:20:02 - INFO - __main__ - Step 128617: {'lr': 2.531596528751512e-05, 'samples': 24694464, 'steps': 128616, 'loss/train': 1.0854991674423218} 11/07/2021 15:20:02 - INFO - __main__ - Step 128618: {'lr': 2.5313638384576843e-05, 'samples': 24694656, 'steps': 128617, 'loss/train': 1.6648164987564087} 11/07/2021 15:20:03 - INFO - __main__ - Step 128619: {'lr': 2.5311311582878722e-05, 'samples': 24694848, 'steps': 128618, 'loss/train': 1.3650190830230713} 11/07/2021 15:20:04 - INFO - __main__ - Step 128620: {'lr': 2.5308984882421866e-05, 'samples': 24695040, 'steps': 128619, 'loss/train': 0.12288787961006165} 11/07/2021 15:20:04 - INFO - __main__ - Step 128621: {'lr': 2.53066582832073e-05, 'samples': 24695232, 'steps': 128620, 'loss/train': 1.0333715677261353} 11/07/2021 15:20:05 - INFO - __main__ - Step 128622: {'lr': 2.5304331785236113e-05, 'samples': 24695424, 'steps': 128621, 'loss/train': 1.340997576713562} 11/07/2021 15:20:05 - INFO - __main__ - Step 128623: {'lr': 2.5302005388509296e-05, 'samples': 24695616, 'steps': 128622, 'loss/train': 1.2505820989608765} 11/07/2021 15:20:05 - INFO - __main__ - Step 128624: {'lr': 2.5299679093027965e-05, 'samples': 24695808, 'steps': 128623, 'loss/train': 1.1775435209274292} 11/07/2021 15:20:06 - INFO - __main__ - Step 128625: {'lr': 2.5297352898793092e-05, 'samples': 24696000, 'steps': 128624, 'loss/train': 1.1579322814941406} 11/07/2021 15:20:07 - INFO - __main__ - Step 128626: {'lr': 2.5295026805805782e-05, 'samples': 24696192, 'steps': 128625, 'loss/train': 0.7510146498680115} 11/07/2021 15:20:07 - INFO - __main__ - Step 128627: {'lr': 2.5292700814067065e-05, 'samples': 24696384, 'steps': 128626, 'loss/train': 1.717406988143921} 11/07/2021 15:20:07 - INFO - __main__ - Step 128628: {'lr': 2.5290374923577997e-05, 'samples': 24696576, 'steps': 128627, 'loss/train': 1.4092381000518799} 11/07/2021 15:20:08 - INFO - __main__ - Step 128629: {'lr': 2.52880491343396e-05, 'samples': 24696768, 'steps': 128628, 'loss/train': 1.4423856735229492} 11/07/2021 15:20:08 - INFO - __main__ - Step 128630: {'lr': 2.5285723446352963e-05, 'samples': 24696960, 'steps': 128629, 'loss/train': 0.055777840316295624} 11/07/2021 15:20:09 - INFO - __main__ - Step 128631: {'lr': 2.528339785961914e-05, 'samples': 24697152, 'steps': 128630, 'loss/train': 1.4009631872177124} 11/07/2021 15:20:10 - INFO - __main__ - Step 128632: {'lr': 2.5281072374139126e-05, 'samples': 24697344, 'steps': 128631, 'loss/train': 1.3434176445007324} 11/07/2021 15:20:10 - INFO - __main__ - Step 128633: {'lr': 2.5278746989913976e-05, 'samples': 24697536, 'steps': 128632, 'loss/train': 0.9975287914276123} 11/07/2021 15:20:10 - INFO - __main__ - Step 128634: {'lr': 2.5276421706944748e-05, 'samples': 24697728, 'steps': 128633, 'loss/train': 1.370509386062622} 11/07/2021 15:20:11 - INFO - __main__ - Step 128635: {'lr': 2.527409652523252e-05, 'samples': 24697920, 'steps': 128634, 'loss/train': 1.4876954555511475} 11/07/2021 15:20:12 - INFO - __main__ - Step 128636: {'lr': 2.5271771444778296e-05, 'samples': 24698112, 'steps': 128635, 'loss/train': 0.9639585614204407} 11/07/2021 15:20:13 - INFO - __main__ - Step 128637: {'lr': 2.5269446465583157e-05, 'samples': 24698304, 'steps': 128636, 'loss/train': 1.1247349977493286} 11/07/2021 15:20:13 - INFO - __main__ - Step 128638: {'lr': 2.526712158764813e-05, 'samples': 24698496, 'steps': 128637, 'loss/train': 1.5019575357437134} 11/07/2021 15:20:13 - INFO - __main__ - Step 128639: {'lr': 2.5264796810974265e-05, 'samples': 24698688, 'steps': 128638, 'loss/train': 1.4297322034835815} 11/07/2021 15:20:14 - INFO - __main__ - Step 128640: {'lr': 2.5262472135562627e-05, 'samples': 24698880, 'steps': 128639, 'loss/train': 1.3018523454666138} 11/07/2021 15:20:14 - INFO - __main__ - Step 128641: {'lr': 2.526014756141423e-05, 'samples': 24699072, 'steps': 128640, 'loss/train': 0.10160021483898163} 11/07/2021 15:20:15 - INFO - __main__ - Step 128642: {'lr': 2.525782308853017e-05, 'samples': 24699264, 'steps': 128641, 'loss/train': 0.11673558503389359} 11/07/2021 15:20:15 - INFO - __main__ - Step 128643: {'lr': 2.525549871691146e-05, 'samples': 24699456, 'steps': 128642, 'loss/train': 1.1054093837738037} 11/07/2021 15:20:16 - INFO - __main__ - Step 128644: {'lr': 2.5253174446559195e-05, 'samples': 24699648, 'steps': 128643, 'loss/train': 1.203536868095398} 11/07/2021 15:20:16 - INFO - __main__ - Step 128645: {'lr': 2.5250850277474315e-05, 'samples': 24699840, 'steps': 128644, 'loss/train': 1.2581597566604614} 11/07/2021 15:20:16 - INFO - __main__ - Step 128646: {'lr': 2.5248526209657953e-05, 'samples': 24700032, 'steps': 128645, 'loss/train': 1.275887131690979} 11/07/2021 15:20:18 - INFO - __main__ - Step 128647: {'lr': 2.524620224311114e-05, 'samples': 24700224, 'steps': 128646, 'loss/train': 1.2744895219802856} 11/07/2021 15:20:18 - INFO - __main__ - Step 128648: {'lr': 2.5243878377834927e-05, 'samples': 24700416, 'steps': 128647, 'loss/train': 1.1385514736175537} 11/07/2021 15:20:19 - INFO - __main__ - Step 128649: {'lr': 2.5241554613830347e-05, 'samples': 24700608, 'steps': 128648, 'loss/train': 1.2110928297042847} 11/07/2021 15:20:19 - INFO - __main__ - Step 128650: {'lr': 2.5239230951098452e-05, 'samples': 24700800, 'steps': 128649, 'loss/train': 1.767419695854187} 11/07/2021 15:20:19 - INFO - __main__ - Step 128651: {'lr': 2.5236907389640297e-05, 'samples': 24700992, 'steps': 128650, 'loss/train': 1.0504875183105469} 11/07/2021 15:20:20 - INFO - __main__ - Step 128652: {'lr': 2.5234583929456904e-05, 'samples': 24701184, 'steps': 128651, 'loss/train': 0.0464579202234745} 11/07/2021 15:20:21 - INFO - __main__ - Step 128653: {'lr': 2.5232260570549365e-05, 'samples': 24701376, 'steps': 128652, 'loss/train': 0.8842893838882446} 11/07/2021 15:20:21 - INFO - __main__ - Step 128654: {'lr': 2.5229937312918672e-05, 'samples': 24701568, 'steps': 128653, 'loss/train': 0.8963024616241455} 11/07/2021 15:20:21 - INFO - __main__ - Step 128655: {'lr': 2.5227614156565936e-05, 'samples': 24701760, 'steps': 128654, 'loss/train': 1.2401906251907349} 11/07/2021 15:20:22 - INFO - __main__ - Step 128656: {'lr': 2.522529110149213e-05, 'samples': 24701952, 'steps': 128655, 'loss/train': 1.1898918151855469} 11/07/2021 15:20:23 - INFO - __main__ - Step 128657: {'lr': 2.5222968147698366e-05, 'samples': 24702144, 'steps': 128656, 'loss/train': 1.6948541402816772} 11/07/2021 15:20:23 - INFO - __main__ - Step 128658: {'lr': 2.5220645295185724e-05, 'samples': 24702336, 'steps': 128657, 'loss/train': 1.0775561332702637} 11/07/2021 15:20:24 - INFO - __main__ - Step 128659: {'lr': 2.521832254395512e-05, 'samples': 24702528, 'steps': 128658, 'loss/train': 1.3403713703155518} 11/07/2021 15:20:24 - INFO - __main__ - Step 128660: {'lr': 2.5215999894007664e-05, 'samples': 24702720, 'steps': 128659, 'loss/train': 1.3097710609436035} 11/07/2021 15:20:24 - INFO - __main__ - Step 128661: {'lr': 2.521367734534444e-05, 'samples': 24702912, 'steps': 128660, 'loss/train': 0.6825568079948425} 11/07/2021 15:20:25 - INFO - __main__ - Step 128662: {'lr': 2.5211354897966442e-05, 'samples': 24703104, 'steps': 128661, 'loss/train': 0.6781173944473267} 11/07/2021 15:20:26 - INFO - __main__ - Step 128663: {'lr': 2.5209032551874735e-05, 'samples': 24703296, 'steps': 128662, 'loss/train': 1.3293126821517944} 11/07/2021 15:20:26 - INFO - __main__ - Step 128664: {'lr': 2.520671030707039e-05, 'samples': 24703488, 'steps': 128663, 'loss/train': 1.2321096658706665} 11/07/2021 15:20:26 - INFO - __main__ - Step 128665: {'lr': 2.5204388163554414e-05, 'samples': 24703680, 'steps': 128664, 'loss/train': 1.5246169567108154} 11/07/2021 15:20:27 - INFO - __main__ - Step 128666: {'lr': 2.5202066121327862e-05, 'samples': 24703872, 'steps': 128665, 'loss/train': 1.4885048866271973} 11/07/2021 15:20:28 - INFO - __main__ - Step 128667: {'lr': 2.519974418039181e-05, 'samples': 24704064, 'steps': 128666, 'loss/train': 1.411450982093811} 11/07/2021 15:20:28 - INFO - __main__ - Step 128668: {'lr': 2.5197422340747288e-05, 'samples': 24704256, 'steps': 128667, 'loss/train': 1.5674917697906494} 11/07/2021 15:20:28 - INFO - __main__ - Step 128669: {'lr': 2.5195100602395384e-05, 'samples': 24704448, 'steps': 128668, 'loss/train': 1.158224105834961} 11/07/2021 15:20:29 - INFO - __main__ - Step 128670: {'lr': 2.5192778965337033e-05, 'samples': 24704640, 'steps': 128669, 'loss/train': 1.0711896419525146} 11/07/2021 15:20:29 - INFO - __main__ - Step 128671: {'lr': 2.519045742957335e-05, 'samples': 24704832, 'steps': 128670, 'loss/train': 0.15936337411403656} 11/07/2021 15:20:30 - INFO - __main__ - Step 128672: {'lr': 2.518813599510539e-05, 'samples': 24705024, 'steps': 128671, 'loss/train': 0.841254472732544} 11/07/2021 15:20:30 - INFO - __main__ - Step 128673: {'lr': 2.5185814661934202e-05, 'samples': 24705216, 'steps': 128672, 'loss/train': 1.4190592765808105} 11/07/2021 15:20:31 - INFO - __main__ - Step 128674: {'lr': 2.5183493430060794e-05, 'samples': 24705408, 'steps': 128673, 'loss/train': 1.2374765872955322} 11/07/2021 15:20:31 - INFO - __main__ - Step 128675: {'lr': 2.5181172299486244e-05, 'samples': 24705600, 'steps': 128674, 'loss/train': 1.337708592414856} 11/07/2021 15:20:32 - INFO - __main__ - Step 128676: {'lr': 2.5178851270211577e-05, 'samples': 24705792, 'steps': 128675, 'loss/train': 1.2684751749038696} 11/07/2021 15:20:32 - INFO - __main__ - Step 128677: {'lr': 2.5176530342237852e-05, 'samples': 24705984, 'steps': 128676, 'loss/train': 0.6522752046585083} 11/07/2021 15:20:33 - INFO - __main__ - Step 128678: {'lr': 2.517420951556612e-05, 'samples': 24706176, 'steps': 128677, 'loss/train': 0.6808786392211914} 11/07/2021 15:20:33 - INFO - __main__ - Step 128679: {'lr': 2.5171888790197412e-05, 'samples': 24706368, 'steps': 128678, 'loss/train': 1.28631591796875} 11/07/2021 15:20:34 - INFO - __main__ - Step 128680: {'lr': 2.5169568166132834e-05, 'samples': 24706560, 'steps': 128679, 'loss/train': 1.3056455850601196} 11/07/2021 15:20:34 - INFO - __main__ - Step 128681: {'lr': 2.5167247643373332e-05, 'samples': 24706752, 'steps': 128680, 'loss/train': 1.0588184595108032} 11/07/2021 15:20:34 - INFO - __main__ - Step 128682: {'lr': 2.5164927221920014e-05, 'samples': 24706944, 'steps': 128681, 'loss/train': 1.2011686563491821} 11/07/2021 15:20:35 - INFO - __main__ - Step 128683: {'lr': 2.5162606901773883e-05, 'samples': 24707136, 'steps': 128682, 'loss/train': 0.9509779214859009} 11/07/2021 15:20:36 - INFO - __main__ - Step 128684: {'lr': 2.5160286682936047e-05, 'samples': 24707328, 'steps': 128683, 'loss/train': 1.1653151512145996} 11/07/2021 15:20:36 - INFO - __main__ - Step 128685: {'lr': 2.5157966565407475e-05, 'samples': 24707520, 'steps': 128684, 'loss/train': 1.3728629350662231} 11/07/2021 15:20:36 - INFO - __main__ - Step 128686: {'lr': 2.515564654918928e-05, 'samples': 24707712, 'steps': 128685, 'loss/train': 0.8639098405838013} 11/07/2021 15:20:37 - INFO - __main__ - Step 128687: {'lr': 2.5153326634282465e-05, 'samples': 24707904, 'steps': 128686, 'loss/train': 1.143284559249878} 11/07/2021 15:20:38 - INFO - __main__ - Step 128688: {'lr': 2.5151006820688105e-05, 'samples': 24708096, 'steps': 128687, 'loss/train': 1.2722195386886597} 11/07/2021 15:20:38 - INFO - __main__ - Step 128689: {'lr': 2.514868710840723e-05, 'samples': 24708288, 'steps': 128688, 'loss/train': 1.568285346031189} 11/07/2021 15:20:38 - INFO - __main__ - Step 128690: {'lr': 2.5146367497440898e-05, 'samples': 24708480, 'steps': 128689, 'loss/train': 1.400712013244629} 11/07/2021 15:20:39 - INFO - __main__ - Step 128691: {'lr': 2.514404798779016e-05, 'samples': 24708672, 'steps': 128690, 'loss/train': 1.3238033056259155} 11/07/2021 15:20:39 - INFO - __main__ - Step 128692: {'lr': 2.5141728579456015e-05, 'samples': 24708864, 'steps': 128691, 'loss/train': 1.0275822877883911} 11/07/2021 15:20:40 - INFO - __main__ - Step 128693: {'lr': 2.5139409272439545e-05, 'samples': 24709056, 'steps': 128692, 'loss/train': 1.479452133178711} 11/07/2021 15:20:41 - INFO - __main__ - Step 128694: {'lr': 2.513709006674178e-05, 'samples': 24709248, 'steps': 128693, 'loss/train': 0.6096610426902771} 11/07/2021 15:20:41 - INFO - __main__ - Step 128695: {'lr': 2.5134770962363774e-05, 'samples': 24709440, 'steps': 128694, 'loss/train': 0.9360098838806152} 11/07/2021 15:20:41 - INFO - __main__ - Step 128696: {'lr': 2.513245195930655e-05, 'samples': 24709632, 'steps': 128695, 'loss/train': 1.1925753355026245} 11/07/2021 15:20:42 - INFO - __main__ - Step 128697: {'lr': 2.51301330575712e-05, 'samples': 24709824, 'steps': 128696, 'loss/train': 1.1387722492218018} 11/07/2021 15:20:43 - INFO - __main__ - Step 128698: {'lr': 2.5127814257158737e-05, 'samples': 24710016, 'steps': 128697, 'loss/train': 1.253521203994751} 11/07/2021 15:20:43 - INFO - __main__ - Step 128699: {'lr': 2.51254955580702e-05, 'samples': 24710208, 'steps': 128698, 'loss/train': 1.3632879257202148} 11/07/2021 15:20:43 - INFO - __main__ - Step 128700: {'lr': 2.5123176960306665e-05, 'samples': 24710400, 'steps': 128699, 'loss/train': 1.0979822874069214} 11/07/2021 15:20:44 - INFO - __main__ - Step 128701: {'lr': 2.5120858463869188e-05, 'samples': 24710592, 'steps': 128700, 'loss/train': 1.6189674139022827} 11/07/2021 15:20:44 - INFO - __main__ - Step 128702: {'lr': 2.5118540068758743e-05, 'samples': 24710784, 'steps': 128701, 'loss/train': 1.2090626955032349} 11/07/2021 15:20:45 - INFO - __main__ - Step 128703: {'lr': 2.5116221774976382e-05, 'samples': 24710976, 'steps': 128702, 'loss/train': 1.0565370321273804} 11/07/2021 15:20:45 - INFO - __main__ - Step 128704: {'lr': 2.5113903582523218e-05, 'samples': 24711168, 'steps': 128703, 'loss/train': 1.4913402795791626} 11/07/2021 15:20:46 - INFO - __main__ - Step 128705: {'lr': 2.5111585491400246e-05, 'samples': 24711360, 'steps': 128704, 'loss/train': 1.1036427021026611} 11/07/2021 15:20:46 - INFO - __main__ - Step 128706: {'lr': 2.5109267501608524e-05, 'samples': 24711552, 'steps': 128705, 'loss/train': 1.5891774892807007} 11/07/2021 15:20:47 - INFO - __main__ - Step 128707: {'lr': 2.5106949613149104e-05, 'samples': 24711744, 'steps': 128706, 'loss/train': 0.11766353249549866} 11/07/2021 15:20:48 - INFO - __main__ - Step 128708: {'lr': 2.510463182602302e-05, 'samples': 24711936, 'steps': 128707, 'loss/train': 1.7320091724395752} 11/07/2021 15:20:48 - INFO - __main__ - Step 128709: {'lr': 2.5102314140231312e-05, 'samples': 24712128, 'steps': 128708, 'loss/train': 1.0926525592803955} 11/07/2021 15:20:48 - INFO - __main__ - Step 128710: {'lr': 2.5099996555775023e-05, 'samples': 24712320, 'steps': 128709, 'loss/train': 1.486562967300415} 11/07/2021 15:20:49 - INFO - __main__ - Step 128711: {'lr': 2.5097679072655228e-05, 'samples': 24712512, 'steps': 128710, 'loss/train': 1.0462641716003418} 11/07/2021 15:20:49 - INFO - __main__ - Step 128712: {'lr': 2.509536169087298e-05, 'samples': 24712704, 'steps': 128711, 'loss/train': 1.3076133728027344} 11/07/2021 15:20:50 - INFO - __main__ - Step 128713: {'lr': 2.5093044410429227e-05, 'samples': 24712896, 'steps': 128712, 'loss/train': 1.5080476999282837} 11/07/2021 15:20:50 - INFO - __main__ - Step 128714: {'lr': 2.5090727231325105e-05, 'samples': 24713088, 'steps': 128713, 'loss/train': 1.4240647554397583} 11/07/2021 15:20:51 - INFO - __main__ - Step 128715: {'lr': 2.5088410153561614e-05, 'samples': 24713280, 'steps': 128714, 'loss/train': 1.339318871498108} 11/07/2021 15:20:51 - INFO - __main__ - Step 128716: {'lr': 2.5086093177139835e-05, 'samples': 24713472, 'steps': 128715, 'loss/train': 1.5925880670547485} 11/07/2021 15:20:51 - INFO - __main__ - Step 128717: {'lr': 2.5083776302060768e-05, 'samples': 24713664, 'steps': 128716, 'loss/train': 1.7129273414611816} 11/07/2021 15:20:53 - INFO - __main__ - Step 128718: {'lr': 2.5081459528325496e-05, 'samples': 24713856, 'steps': 128717, 'loss/train': 1.2970905303955078} 11/07/2021 15:20:53 - INFO - __main__ - Step 128719: {'lr': 2.5079142855935043e-05, 'samples': 24714048, 'steps': 128718, 'loss/train': 1.3932280540466309} 11/07/2021 15:20:53 - INFO - __main__ - Step 128720: {'lr': 2.507682628489047e-05, 'samples': 24714240, 'steps': 128719, 'loss/train': 1.0478794574737549} 11/07/2021 15:20:54 - INFO - __main__ - Step 128721: {'lr': 2.50745098151928e-05, 'samples': 24714432, 'steps': 128720, 'loss/train': 1.5375841856002808} 11/07/2021 15:20:54 - INFO - __main__ - Step 128722: {'lr': 2.5072193446843085e-05, 'samples': 24714624, 'steps': 128721, 'loss/train': 0.10295145213603973} 11/07/2021 15:20:55 - INFO - __main__ - Step 128723: {'lr': 2.5069877179842353e-05, 'samples': 24714816, 'steps': 128722, 'loss/train': 0.8104313611984253} 11/07/2021 15:20:56 - INFO - __main__ - Step 128724: {'lr': 2.5067561014191692e-05, 'samples': 24715008, 'steps': 128723, 'loss/train': 0.7049184441566467} 11/07/2021 15:20:56 - INFO - __main__ - Step 128725: {'lr': 2.506524494989215e-05, 'samples': 24715200, 'steps': 128724, 'loss/train': 1.3169571161270142} 11/07/2021 15:20:56 - INFO - __main__ - Step 128726: {'lr': 2.5062928986944677e-05, 'samples': 24715392, 'steps': 128725, 'loss/train': 0.9069405198097229} 11/07/2021 15:20:57 - INFO - __main__ - Step 128727: {'lr': 2.5060613125350408e-05, 'samples': 24715584, 'steps': 128726, 'loss/train': 1.3281394243240356} 11/07/2021 15:20:58 - INFO - __main__ - Step 128728: {'lr': 2.505829736511034e-05, 'samples': 24715776, 'steps': 128727, 'loss/train': 1.045569896697998} 11/07/2021 15:20:58 - INFO - __main__ - Step 128729: {'lr': 2.5055981706225527e-05, 'samples': 24715968, 'steps': 128728, 'loss/train': 2.772444009780884} 11/07/2021 15:20:58 - INFO - __main__ - Step 128730: {'lr': 2.5053666148697e-05, 'samples': 24716160, 'steps': 128729, 'loss/train': 1.0713046789169312} 11/07/2021 15:20:59 - INFO - __main__ - Step 128731: {'lr': 2.5051350692525842e-05, 'samples': 24716352, 'steps': 128730, 'loss/train': 1.5088398456573486} 11/07/2021 15:20:59 - INFO - __main__ - Step 128732: {'lr': 2.504903533771308e-05, 'samples': 24716544, 'steps': 128731, 'loss/train': 1.849228024482727} 11/07/2021 15:21:00 - INFO - __main__ - Step 128733: {'lr': 2.5046720084259734e-05, 'samples': 24716736, 'steps': 128732, 'loss/train': 1.1117887496948242} 11/07/2021 15:21:01 - INFO - __main__ - Step 128734: {'lr': 2.5044404932166892e-05, 'samples': 24716928, 'steps': 128733, 'loss/train': 1.2059412002563477} 11/07/2021 15:21:01 - INFO - __main__ - Step 128735: {'lr': 2.5042089881435555e-05, 'samples': 24717120, 'steps': 128734, 'loss/train': 1.2423619031906128} 11/07/2021 15:21:01 - INFO - __main__ - Step 128736: {'lr': 2.5039774932066774e-05, 'samples': 24717312, 'steps': 128735, 'loss/train': 0.05844426900148392} 11/07/2021 15:21:02 - INFO - __main__ - Step 128737: {'lr': 2.5037460084061602e-05, 'samples': 24717504, 'steps': 128736, 'loss/train': 1.5444170236587524} 11/07/2021 15:21:03 - INFO - __main__ - Step 128738: {'lr': 2.5035145337421073e-05, 'samples': 24717696, 'steps': 128737, 'loss/train': 1.1814604997634888} 11/07/2021 15:21:03 - INFO - __main__ - Step 128739: {'lr': 2.5032830692146292e-05, 'samples': 24717888, 'steps': 128738, 'loss/train': 1.5447883605957031} 11/07/2021 15:21:04 - INFO - __main__ - Step 128740: {'lr': 2.5030516148238203e-05, 'samples': 24718080, 'steps': 128739, 'loss/train': 1.1990351676940918} 11/07/2021 15:21:04 - INFO - __main__ - Step 128741: {'lr': 2.502820170569789e-05, 'samples': 24718272, 'steps': 128740, 'loss/train': 1.4703922271728516} 11/07/2021 15:21:04 - INFO - __main__ - Step 128742: {'lr': 2.502588736452638e-05, 'samples': 24718464, 'steps': 128741, 'loss/train': 1.7061012983322144} 11/07/2021 15:21:05 - INFO - __main__ - Step 128743: {'lr': 2.5023573124724753e-05, 'samples': 24718656, 'steps': 128742, 'loss/train': 1.7535768747329712} 11/07/2021 15:21:06 - INFO - __main__ - Step 128744: {'lr': 2.5021258986294036e-05, 'samples': 24718848, 'steps': 128743, 'loss/train': 1.5550330877304077} 11/07/2021 15:21:06 - INFO - __main__ - Step 128745: {'lr': 2.501894494923526e-05, 'samples': 24719040, 'steps': 128744, 'loss/train': 1.4781370162963867} 11/07/2021 15:21:06 - INFO - __main__ - Step 128746: {'lr': 2.501663101354948e-05, 'samples': 24719232, 'steps': 128745, 'loss/train': 1.248449683189392} 11/07/2021 15:21:07 - INFO - __main__ - Step 128747: {'lr': 2.5014317179237717e-05, 'samples': 24719424, 'steps': 128746, 'loss/train': 1.4910004138946533} 11/07/2021 15:21:08 - INFO - __main__ - Step 128748: {'lr': 2.5012003446301028e-05, 'samples': 24719616, 'steps': 128747, 'loss/train': 1.2466504573822021} 11/07/2021 15:21:08 - INFO - __main__ - Step 128749: {'lr': 2.50096898147405e-05, 'samples': 24719808, 'steps': 128748, 'loss/train': 0.7255334854125977} 11/07/2021 15:21:09 - INFO - __main__ - Step 128750: {'lr': 2.5007376284557098e-05, 'samples': 24720000, 'steps': 128749, 'loss/train': 1.3310582637786865} 11/07/2021 15:21:09 - INFO - __main__ - Step 128751: {'lr': 2.500506285575191e-05, 'samples': 24720192, 'steps': 128750, 'loss/train': 1.0146658420562744} 11/07/2021 15:21:09 - INFO - __main__ - Step 128752: {'lr': 2.5002749528326014e-05, 'samples': 24720384, 'steps': 128751, 'loss/train': 1.5976107120513916} 11/07/2021 15:21:10 - INFO - __main__ - Step 128753: {'lr': 2.5000436302280354e-05, 'samples': 24720576, 'steps': 128752, 'loss/train': 1.171164631843567} 11/07/2021 15:21:11 - INFO - __main__ - Step 128754: {'lr': 2.4998123177616043e-05, 'samples': 24720768, 'steps': 128753, 'loss/train': 2.0932767391204834} 11/07/2021 15:21:11 - INFO - __main__ - Step 128755: {'lr': 2.499581015433411e-05, 'samples': 24720960, 'steps': 128754, 'loss/train': 1.1320544481277466} 11/07/2021 15:21:11 - INFO - __main__ - Step 128756: {'lr': 2.4993497232435574e-05, 'samples': 24721152, 'steps': 128755, 'loss/train': 1.2154598236083984} 11/07/2021 15:21:12 - INFO - __main__ - Step 128757: {'lr': 2.4991184411921496e-05, 'samples': 24721344, 'steps': 128756, 'loss/train': 0.4951738715171814} 11/07/2021 15:21:12 - INFO - __main__ - Step 128758: {'lr': 2.498887169279293e-05, 'samples': 24721536, 'steps': 128757, 'loss/train': 0.0946190282702446} 11/07/2021 15:21:13 - INFO - __main__ - Step 128759: {'lr': 2.49865590750509e-05, 'samples': 24721728, 'steps': 128758, 'loss/train': 0.7828315496444702} 11/07/2021 15:21:13 - INFO - __main__ - Step 128760: {'lr': 2.498424655869644e-05, 'samples': 24721920, 'steps': 128759, 'loss/train': 0.879910409450531} 11/07/2021 15:21:14 - INFO - __main__ - Step 128761: {'lr': 2.4981934143730624e-05, 'samples': 24722112, 'steps': 128760, 'loss/train': 0.5517290234565735} 11/07/2021 15:21:14 - INFO - __main__ - Step 128762: {'lr': 2.4979621830154482e-05, 'samples': 24722304, 'steps': 128761, 'loss/train': 0.5516632795333862} 11/07/2021 15:21:15 - INFO - __main__ - Step 128763: {'lr': 2.497730961796904e-05, 'samples': 24722496, 'steps': 128762, 'loss/train': 0.17891019582748413} 11/07/2021 15:21:16 - INFO - __main__ - Step 128764: {'lr': 2.497499750717533e-05, 'samples': 24722688, 'steps': 128763, 'loss/train': 0.3433561325073242} 11/07/2021 15:21:16 - INFO - __main__ - Step 128765: {'lr': 2.4972685497774485e-05, 'samples': 24722880, 'steps': 128764, 'loss/train': 1.0886448621749878} 11/07/2021 15:21:16 - INFO - __main__ - Step 128766: {'lr': 2.497037358976742e-05, 'samples': 24723072, 'steps': 128765, 'loss/train': 1.1376186609268188} 11/07/2021 15:21:17 - INFO - __main__ - Step 128767: {'lr': 2.496806178315525e-05, 'samples': 24723264, 'steps': 128766, 'loss/train': 1.5369131565093994} 11/07/2021 15:21:17 - INFO - __main__ - Step 128768: {'lr': 2.4965750077939e-05, 'samples': 24723456, 'steps': 128767, 'loss/train': 0.9512562155723572} 11/07/2021 15:21:18 - INFO - __main__ - Step 128769: {'lr': 2.4963438474119692e-05, 'samples': 24723648, 'steps': 128768, 'loss/train': 0.3221037685871124} 11/07/2021 15:21:18 - INFO - __main__ - Step 128770: {'lr': 2.4961126971698387e-05, 'samples': 24723840, 'steps': 128769, 'loss/train': 1.3173967599868774} 11/07/2021 15:21:19 - INFO - __main__ - Step 128771: {'lr': 2.4958815570676112e-05, 'samples': 24724032, 'steps': 128770, 'loss/train': 1.35137939453125} 11/07/2021 15:21:19 - INFO - __main__ - Step 128772: {'lr': 2.4956504271053946e-05, 'samples': 24724224, 'steps': 128771, 'loss/train': 1.056453824043274} 11/07/2021 15:21:19 - INFO - __main__ - Step 128773: {'lr': 2.4954193072832894e-05, 'samples': 24724416, 'steps': 128772, 'loss/train': 1.3716377019882202} 11/07/2021 15:21:20 - INFO - __main__ - Step 128774: {'lr': 2.495188197601403e-05, 'samples': 24724608, 'steps': 128773, 'loss/train': 1.4553025960922241} 11/07/2021 15:21:21 - INFO - __main__ - Step 128775: {'lr': 2.4949570980598358e-05, 'samples': 24724800, 'steps': 128774, 'loss/train': 1.6932352781295776} 11/07/2021 15:21:21 - INFO - __main__ - Step 128776: {'lr': 2.494726008658693e-05, 'samples': 24724992, 'steps': 128775, 'loss/train': 0.9313123226165771} 11/07/2021 15:21:21 - INFO - __main__ - Step 128777: {'lr': 2.4944949293980805e-05, 'samples': 24725184, 'steps': 128776, 'loss/train': 1.4466930627822876} 11/07/2021 15:21:22 - INFO - __main__ - Step 128778: {'lr': 2.494263860278101e-05, 'samples': 24725376, 'steps': 128777, 'loss/train': 1.2586696147918701} 11/07/2021 15:21:23 - INFO - __main__ - Step 128779: {'lr': 2.494032801298865e-05, 'samples': 24725568, 'steps': 128778, 'loss/train': 1.0570560693740845} 11/07/2021 15:21:23 - INFO - __main__ - Step 128780: {'lr': 2.4938017524604646e-05, 'samples': 24725760, 'steps': 128779, 'loss/train': 1.0948971509933472} 11/07/2021 15:21:24 - INFO - __main__ - Step 128781: {'lr': 2.4935707137630103e-05, 'samples': 24725952, 'steps': 128780, 'loss/train': 1.2203611135482788} 11/07/2021 15:21:24 - INFO - __main__ - Step 128782: {'lr': 2.4933396852066054e-05, 'samples': 24726144, 'steps': 128781, 'loss/train': 0.6383169293403625} 11/07/2021 15:21:25 - INFO - __main__ - Step 128783: {'lr': 2.493108666791355e-05, 'samples': 24726336, 'steps': 128782, 'loss/train': 0.3846348822116852} 11/07/2021 15:21:25 - INFO - __main__ - Step 128784: {'lr': 2.4928776585173618e-05, 'samples': 24726528, 'steps': 128783, 'loss/train': 1.1847790479660034} 11/07/2021 15:21:26 - INFO - __main__ - Step 128785: {'lr': 2.4926466603847287e-05, 'samples': 24726720, 'steps': 128784, 'loss/train': 1.4003912210464478} 11/07/2021 15:21:26 - INFO - __main__ - Step 128786: {'lr': 2.492415672393564e-05, 'samples': 24726912, 'steps': 128785, 'loss/train': 1.1567779779434204} 11/07/2021 15:21:27 - INFO - __main__ - Step 128787: {'lr': 2.4921846945439695e-05, 'samples': 24727104, 'steps': 128786, 'loss/train': 1.2043331861495972} 11/07/2021 15:21:27 - INFO - __main__ - Step 128788: {'lr': 2.4919537268360494e-05, 'samples': 24727296, 'steps': 128787, 'loss/train': 1.276174545288086} 11/07/2021 15:21:27 - INFO - __main__ - Step 128789: {'lr': 2.491722769269905e-05, 'samples': 24727488, 'steps': 128788, 'loss/train': 1.1859163045883179} 11/07/2021 15:21:28 - INFO - __main__ - Step 128790: {'lr': 2.4914918218456455e-05, 'samples': 24727680, 'steps': 128789, 'loss/train': 1.5399991273880005} 11/07/2021 15:21:29 - INFO - __main__ - Step 128791: {'lr': 2.491260884563373e-05, 'samples': 24727872, 'steps': 128790, 'loss/train': 2.05515456199646} 11/07/2021 15:21:29 - INFO - __main__ - Step 128792: {'lr': 2.4910299574231938e-05, 'samples': 24728064, 'steps': 128791, 'loss/train': 0.6768157482147217} 11/07/2021 15:21:29 - INFO - __main__ - Step 128793: {'lr': 2.4907990404252067e-05, 'samples': 24728256, 'steps': 128792, 'loss/train': 1.1393470764160156} 11/07/2021 15:21:30 - INFO - __main__ - Step 128794: {'lr': 2.490568133569515e-05, 'samples': 24728448, 'steps': 128793, 'loss/train': 1.1416805982589722} 11/07/2021 15:21:31 - INFO - __main__ - Step 128795: {'lr': 2.49033723685623e-05, 'samples': 24728640, 'steps': 128794, 'loss/train': 1.0493884086608887} 11/07/2021 15:21:31 - INFO - __main__ - Step 128796: {'lr': 2.4901063502854482e-05, 'samples': 24728832, 'steps': 128795, 'loss/train': 1.1639392375946045} 11/07/2021 15:21:32 - INFO - __main__ - Step 128797: {'lr': 2.489875473857278e-05, 'samples': 24729024, 'steps': 128796, 'loss/train': 1.5174156427383423} 11/07/2021 15:21:32 - INFO - __main__ - Step 128798: {'lr': 2.4896446075718254e-05, 'samples': 24729216, 'steps': 128797, 'loss/train': 1.5152603387832642} 11/07/2021 15:21:32 - INFO - __main__ - Step 128799: {'lr': 2.489413751429187e-05, 'samples': 24729408, 'steps': 128798, 'loss/train': 1.5020041465759277} 11/07/2021 15:21:33 - INFO - __main__ - Step 128800: {'lr': 2.489182905429474e-05, 'samples': 24729600, 'steps': 128799, 'loss/train': 1.0870047807693481} 11/07/2021 15:21:34 - INFO - __main__ - Step 128801: {'lr': 2.488952069572789e-05, 'samples': 24729792, 'steps': 128800, 'loss/train': 1.4784092903137207} 11/07/2021 15:21:34 - INFO - __main__ - Step 128802: {'lr': 2.4887212438592322e-05, 'samples': 24729984, 'steps': 128801, 'loss/train': 1.2325223684310913} 11/07/2021 15:21:34 - INFO - __main__ - Step 128803: {'lr': 2.4884904282889113e-05, 'samples': 24730176, 'steps': 128802, 'loss/train': 1.396253228187561} 11/07/2021 15:21:35 - INFO - __main__ - Step 128804: {'lr': 2.48825962286193e-05, 'samples': 24730368, 'steps': 128803, 'loss/train': 1.3341522216796875} 11/07/2021 15:21:36 - INFO - __main__ - Step 128805: {'lr': 2.4880288275783896e-05, 'samples': 24730560, 'steps': 128804, 'loss/train': 1.336694359779358} 11/07/2021 15:21:36 - INFO - __main__ - Step 128806: {'lr': 2.487798042438402e-05, 'samples': 24730752, 'steps': 128805, 'loss/train': 1.3059086799621582} 11/07/2021 15:21:37 - INFO - __main__ - Step 128807: {'lr': 2.487567267442062e-05, 'samples': 24730944, 'steps': 128806, 'loss/train': 1.182289481163025} 11/07/2021 15:21:37 - INFO - __main__ - Step 128808: {'lr': 2.4873365025894738e-05, 'samples': 24731136, 'steps': 128807, 'loss/train': 1.0719319581985474} 11/07/2021 15:21:37 - INFO - __main__ - Step 128809: {'lr': 2.487105747880747e-05, 'samples': 24731328, 'steps': 128808, 'loss/train': 1.301666259765625} 11/07/2021 15:21:38 - INFO - __main__ - Step 128810: {'lr': 2.4868750033159803e-05, 'samples': 24731520, 'steps': 128809, 'loss/train': 1.3105690479278564} 11/07/2021 15:21:39 - INFO - __main__ - Step 128811: {'lr': 2.486644268895283e-05, 'samples': 24731712, 'steps': 128810, 'loss/train': 1.0662922859191895} 11/07/2021 15:21:39 - INFO - __main__ - Step 128812: {'lr': 2.486413544618754e-05, 'samples': 24731904, 'steps': 128811, 'loss/train': 1.5135279893875122} 11/07/2021 15:21:39 - INFO - __main__ - Step 128813: {'lr': 2.4861828304865026e-05, 'samples': 24732096, 'steps': 128812, 'loss/train': 1.312330722808838} 11/07/2021 15:21:40 - INFO - __main__ - Step 128814: {'lr': 2.4859521264986277e-05, 'samples': 24732288, 'steps': 128813, 'loss/train': 0.946086585521698} 11/07/2021 15:21:40 - INFO - __main__ - Step 128815: {'lr': 2.4857214326552358e-05, 'samples': 24732480, 'steps': 128814, 'loss/train': 1.1605229377746582} 11/07/2021 15:21:41 - INFO - __main__ - Step 128816: {'lr': 2.4854907489564316e-05, 'samples': 24732672, 'steps': 128815, 'loss/train': 0.08830859512090683} 11/07/2021 15:21:42 - INFO - __main__ - Step 128817: {'lr': 2.4852600754023153e-05, 'samples': 24732864, 'steps': 128816, 'loss/train': 0.8634956479072571} 11/07/2021 15:21:42 - INFO - __main__ - Step 128818: {'lr': 2.485029411992995e-05, 'samples': 24733056, 'steps': 128817, 'loss/train': 1.194962501525879} 11/07/2021 15:21:42 - INFO - __main__ - Step 128819: {'lr': 2.4847987587285765e-05, 'samples': 24733248, 'steps': 128818, 'loss/train': 0.7799480557441711} 11/07/2021 15:21:43 - INFO - __main__ - Step 128820: {'lr': 2.4845681156091567e-05, 'samples': 24733440, 'steps': 128819, 'loss/train': 1.3232314586639404} 11/07/2021 15:21:44 - INFO - __main__ - Step 128821: {'lr': 2.4843374826348435e-05, 'samples': 24733632, 'steps': 128820, 'loss/train': 1.2701141834259033} 11/07/2021 15:21:44 - INFO - __main__ - Step 128822: {'lr': 2.4841068598057405e-05, 'samples': 24733824, 'steps': 128821, 'loss/train': 0.7451719045639038} 11/07/2021 15:21:44 - INFO - __main__ - Step 128823: {'lr': 2.4838762471219495e-05, 'samples': 24734016, 'steps': 128822, 'loss/train': 0.829639732837677} 11/07/2021 15:21:45 - INFO - __main__ - Step 128824: {'lr': 2.4836456445835766e-05, 'samples': 24734208, 'steps': 128823, 'loss/train': 1.5502113103866577} 11/07/2021 15:21:45 - INFO - __main__ - Step 128825: {'lr': 2.483415052190727e-05, 'samples': 24734400, 'steps': 128824, 'loss/train': 1.091614842414856} 11/07/2021 15:21:46 - INFO - __main__ - Step 128826: {'lr': 2.483184469943503e-05, 'samples': 24734592, 'steps': 128825, 'loss/train': 1.0850414037704468} 11/07/2021 15:21:46 - INFO - __main__ - Step 128827: {'lr': 2.482953897842008e-05, 'samples': 24734784, 'steps': 128826, 'loss/train': 1.2050280570983887} 11/07/2021 15:21:47 - INFO - __main__ - Step 128828: {'lr': 2.482723335886347e-05, 'samples': 24734976, 'steps': 128827, 'loss/train': 0.7996206283569336} 11/07/2021 15:21:47 - INFO - __main__ - Step 128829: {'lr': 2.4824927840766232e-05, 'samples': 24735168, 'steps': 128828, 'loss/train': 1.094702124595642} 11/07/2021 15:21:47 - INFO - __main__ - Step 128830: {'lr': 2.4822622424129416e-05, 'samples': 24735360, 'steps': 128829, 'loss/train': 1.3074675798416138} 11/07/2021 15:21:49 - INFO - __main__ - Step 128831: {'lr': 2.4820317108954048e-05, 'samples': 24735552, 'steps': 128830, 'loss/train': 1.1899681091308594} 11/07/2021 15:21:49 - INFO - __main__ - Step 128832: {'lr': 2.4818011895241162e-05, 'samples': 24735744, 'steps': 128831, 'loss/train': 0.9487810730934143} 11/07/2021 15:21:49 - INFO - __main__ - Step 128833: {'lr': 2.481570678299186e-05, 'samples': 24735936, 'steps': 128832, 'loss/train': 1.0987727642059326} 11/07/2021 15:21:50 - INFO - __main__ - Step 128834: {'lr': 2.481340177220706e-05, 'samples': 24736128, 'steps': 128833, 'loss/train': 1.3143419027328491} 11/07/2021 15:21:50 - INFO - __main__ - Step 128835: {'lr': 2.481109686288788e-05, 'samples': 24736320, 'steps': 128834, 'loss/train': 1.2662136554718018} 11/07/2021 15:21:51 - INFO - __main__ - Step 128836: {'lr': 2.4808792055035363e-05, 'samples': 24736512, 'steps': 128835, 'loss/train': 1.208132266998291} 11/07/2021 15:21:51 - INFO - __main__ - Step 128837: {'lr': 2.4806487348650486e-05, 'samples': 24736704, 'steps': 128836, 'loss/train': 0.7959012985229492} 11/07/2021 15:21:52 - INFO - __main__ - Step 128838: {'lr': 2.4804182743734362e-05, 'samples': 24736896, 'steps': 128837, 'loss/train': 1.5868536233901978} 11/07/2021 15:21:52 - INFO - __main__ - Step 128839: {'lr': 2.480187824028801e-05, 'samples': 24737088, 'steps': 128838, 'loss/train': 0.05893876031041145} 11/07/2021 15:21:52 - INFO - __main__ - Step 128840: {'lr': 2.479957383831244e-05, 'samples': 24737280, 'steps': 128839, 'loss/train': 1.4219752550125122} 11/07/2021 15:21:53 - INFO - __main__ - Step 128841: {'lr': 2.47972695378087e-05, 'samples': 24737472, 'steps': 128840, 'loss/train': 1.0776537656784058} 11/07/2021 15:21:54 - INFO - __main__ - Step 128842: {'lr': 2.479496533877784e-05, 'samples': 24737664, 'steps': 128841, 'loss/train': 1.20052969455719} 11/07/2021 15:21:54 - INFO - __main__ - Step 128843: {'lr': 2.47926612412209e-05, 'samples': 24737856, 'steps': 128842, 'loss/train': 1.6184415817260742} 11/07/2021 15:21:55 - INFO - __main__ - Step 128844: {'lr': 2.4790357245138895e-05, 'samples': 24738048, 'steps': 128843, 'loss/train': 1.0285797119140625} 11/07/2021 15:21:55 - INFO - __main__ - Step 128845: {'lr': 2.4788053350532885e-05, 'samples': 24738240, 'steps': 128844, 'loss/train': 1.2967411279678345} 11/07/2021 15:21:56 - INFO - __main__ - Step 128846: {'lr': 2.4785749557403952e-05, 'samples': 24738432, 'steps': 128845, 'loss/train': 1.311326503753662} 11/07/2021 15:21:56 - INFO - __main__ - Step 128847: {'lr': 2.4783445865753067e-05, 'samples': 24738624, 'steps': 128846, 'loss/train': 0.9600632190704346} 11/07/2021 15:21:57 - INFO - __main__ - Step 128848: {'lr': 2.478114227558126e-05, 'samples': 24738816, 'steps': 128847, 'loss/train': 0.9340611696243286} 11/07/2021 15:21:57 - INFO - __main__ - Step 128849: {'lr': 2.477883878688958e-05, 'samples': 24739008, 'steps': 128848, 'loss/train': 1.2117382287979126} 11/07/2021 15:21:57 - INFO - __main__ - Step 128850: {'lr': 2.477653539967911e-05, 'samples': 24739200, 'steps': 128849, 'loss/train': 1.2519395351409912} 11/07/2021 15:21:58 - INFO - __main__ - Step 128851: {'lr': 2.477423211395083e-05, 'samples': 24739392, 'steps': 128850, 'loss/train': 1.1143183708190918} 11/07/2021 15:21:59 - INFO - __main__ - Step 128852: {'lr': 2.477192892970584e-05, 'samples': 24739584, 'steps': 128851, 'loss/train': 0.8944448828697205} 11/07/2021 15:21:59 - INFO - __main__ - Step 128853: {'lr': 2.4769625846945116e-05, 'samples': 24739776, 'steps': 128852, 'loss/train': 1.3032764196395874} 11/07/2021 15:21:59 - INFO - __main__ - Step 128854: {'lr': 2.476732286566971e-05, 'samples': 24739968, 'steps': 128853, 'loss/train': 1.1679787635803223} 11/07/2021 15:22:00 - INFO - __main__ - Step 128855: {'lr': 2.476501998588071e-05, 'samples': 24740160, 'steps': 128854, 'loss/train': 1.168971061706543} 11/07/2021 15:22:00 - INFO - __main__ - Step 128856: {'lr': 2.4762717207579084e-05, 'samples': 24740352, 'steps': 128855, 'loss/train': 1.3740953207015991} 11/07/2021 15:22:01 - INFO - __main__ - Step 128857: {'lr': 2.476041453076594e-05, 'samples': 24740544, 'steps': 128856, 'loss/train': 1.1212420463562012} 11/07/2021 15:22:02 - INFO - __main__ - Step 128858: {'lr': 2.4758111955442254e-05, 'samples': 24740736, 'steps': 128857, 'loss/train': 1.2109557390213013} 11/07/2021 15:22:02 - INFO - __main__ - Step 128859: {'lr': 2.4755809481609075e-05, 'samples': 24740928, 'steps': 128858, 'loss/train': 1.0950255393981934} 11/07/2021 15:22:02 - INFO - __main__ - Step 128860: {'lr': 2.475350710926752e-05, 'samples': 24741120, 'steps': 128859, 'loss/train': 0.6388876438140869} 11/07/2021 15:22:03 - INFO - __main__ - Step 128861: {'lr': 2.4751204838418502e-05, 'samples': 24741312, 'steps': 128860, 'loss/train': 1.3697364330291748} 11/07/2021 15:22:04 - INFO - __main__ - Step 128862: {'lr': 2.47489026690631e-05, 'samples': 24741504, 'steps': 128861, 'loss/train': 1.2478121519088745} 11/07/2021 15:22:04 - INFO - __main__ - Step 128863: {'lr': 2.47466006012024e-05, 'samples': 24741696, 'steps': 128862, 'loss/train': 0.7723315954208374} 11/07/2021 15:22:04 - INFO - __main__ - Step 128864: {'lr': 2.4744298634837375e-05, 'samples': 24741888, 'steps': 128863, 'loss/train': 1.5054283142089844} 11/07/2021 15:22:05 - INFO - __main__ - Step 128865: {'lr': 2.4741996769969134e-05, 'samples': 24742080, 'steps': 128864, 'loss/train': 0.9165393114089966} 11/07/2021 15:22:05 - INFO - __main__ - Step 128866: {'lr': 2.4739695006598643e-05, 'samples': 24742272, 'steps': 128865, 'loss/train': 1.082448124885559} 11/07/2021 15:22:06 - INFO - __main__ - Step 128867: {'lr': 2.4737393344726965e-05, 'samples': 24742464, 'steps': 128866, 'loss/train': 1.444786787033081} 11/07/2021 15:22:06 - INFO - __main__ - Step 128868: {'lr': 2.473509178435515e-05, 'samples': 24742656, 'steps': 128867, 'loss/train': 1.3389718532562256} 11/07/2021 15:22:07 - INFO - __main__ - Step 128869: {'lr': 2.473279032548423e-05, 'samples': 24742848, 'steps': 128868, 'loss/train': 1.1434123516082764} 11/07/2021 15:22:07 - INFO - __main__ - Step 128870: {'lr': 2.4730488968115223e-05, 'samples': 24743040, 'steps': 128869, 'loss/train': 1.2911882400512695} 11/07/2021 15:22:07 - INFO - __main__ - Step 128871: {'lr': 2.472818771224922e-05, 'samples': 24743232, 'steps': 128870, 'loss/train': 1.2667434215545654} 11/07/2021 15:22:09 - INFO - __main__ - Step 128872: {'lr': 2.4725886557887183e-05, 'samples': 24743424, 'steps': 128871, 'loss/train': 1.3111600875854492} 11/07/2021 15:22:09 - INFO - __main__ - Step 128873: {'lr': 2.4723585505030232e-05, 'samples': 24743616, 'steps': 128872, 'loss/train': 1.3103573322296143} 11/07/2021 15:22:09 - INFO - __main__ - Step 128874: {'lr': 2.4721284553679335e-05, 'samples': 24743808, 'steps': 128873, 'loss/train': 1.2480924129486084} 11/07/2021 15:22:10 - INFO - __main__ - Step 128875: {'lr': 2.4718983703835517e-05, 'samples': 24744000, 'steps': 128874, 'loss/train': 1.1985576152801514} 11/07/2021 15:22:10 - INFO - __main__ - Step 128876: {'lr': 2.471668295549989e-05, 'samples': 24744192, 'steps': 128875, 'loss/train': 1.1779839992523193} 11/07/2021 15:22:11 - INFO - __main__ - Step 128877: {'lr': 2.4714382308673428e-05, 'samples': 24744384, 'steps': 128876, 'loss/train': 0.7235305905342102} 11/07/2021 15:22:11 - INFO - __main__ - Step 128878: {'lr': 2.471208176335718e-05, 'samples': 24744576, 'steps': 128877, 'loss/train': 1.434583067893982} 11/07/2021 15:22:12 - INFO - __main__ - Step 128879: {'lr': 2.4709781319552205e-05, 'samples': 24744768, 'steps': 128878, 'loss/train': 1.3176130056381226} 11/07/2021 15:22:12 - INFO - __main__ - Step 128880: {'lr': 2.470748097725953e-05, 'samples': 24744960, 'steps': 128879, 'loss/train': 1.2597652673721313} 11/07/2021 15:22:12 - INFO - __main__ - Step 128881: {'lr': 2.4705180736480176e-05, 'samples': 24745152, 'steps': 128880, 'loss/train': 1.503339171409607} 11/07/2021 15:22:13 - INFO - __main__ - Step 128882: {'lr': 2.4702880597215178e-05, 'samples': 24745344, 'steps': 128881, 'loss/train': 1.2931169271469116} 11/07/2021 15:22:14 - INFO - __main__ - Step 128883: {'lr': 2.4700580559465615e-05, 'samples': 24745536, 'steps': 128882, 'loss/train': 1.0603055953979492} 11/07/2021 15:22:14 - INFO - __main__ - Step 128884: {'lr': 2.4698280623232483e-05, 'samples': 24745728, 'steps': 128883, 'loss/train': 1.1466875076293945} 11/07/2021 15:22:15 - INFO - __main__ - Step 128885: {'lr': 2.4695980788516815e-05, 'samples': 24745920, 'steps': 128884, 'loss/train': 1.1648327112197876} 11/07/2021 15:22:15 - INFO - __main__ - Step 128886: {'lr': 2.4693681055319717e-05, 'samples': 24746112, 'steps': 128885, 'loss/train': 1.1263738870620728} 11/07/2021 15:22:16 - INFO - __main__ - Step 128887: {'lr': 2.4691381423642133e-05, 'samples': 24746304, 'steps': 128886, 'loss/train': 1.0502270460128784} 11/07/2021 15:22:16 - INFO - __main__ - Step 128888: {'lr': 2.468908189348512e-05, 'samples': 24746496, 'steps': 128887, 'loss/train': 1.6261959075927734} 11/07/2021 15:22:17 - INFO - __main__ - Step 128889: {'lr': 2.4686782464849733e-05, 'samples': 24746688, 'steps': 128888, 'loss/train': 1.385756015777588} 11/07/2021 15:22:17 - INFO - __main__ - Step 128890: {'lr': 2.4684483137737024e-05, 'samples': 24746880, 'steps': 128889, 'loss/train': 1.1586520671844482} 11/07/2021 15:22:17 - INFO - __main__ - Step 128891: {'lr': 2.4682183912147994e-05, 'samples': 24747072, 'steps': 128890, 'loss/train': 1.3542602062225342} 11/07/2021 15:22:18 - INFO - __main__ - Step 128892: {'lr': 2.4679884788083696e-05, 'samples': 24747264, 'steps': 128891, 'loss/train': 1.2073862552642822} 11/07/2021 15:22:19 - INFO - __main__ - Step 128893: {'lr': 2.4677585765545157e-05, 'samples': 24747456, 'steps': 128892, 'loss/train': 1.3217668533325195} 11/07/2021 15:22:19 - INFO - __main__ - Step 128894: {'lr': 2.4675286844533433e-05, 'samples': 24747648, 'steps': 128893, 'loss/train': 1.3659547567367554} 11/07/2021 15:22:19 - INFO - __main__ - Step 128895: {'lr': 2.4672988025049552e-05, 'samples': 24747840, 'steps': 128894, 'loss/train': 1.9964585304260254} 11/07/2021 15:22:20 - INFO - __main__ - Step 128896: {'lr': 2.467068930709454e-05, 'samples': 24748032, 'steps': 128895, 'loss/train': 1.135772943496704} 11/07/2021 15:22:21 - INFO - __main__ - Step 128897: {'lr': 2.466839069066945e-05, 'samples': 24748224, 'steps': 128896, 'loss/train': 1.1305721998214722} 11/07/2021 15:22:21 - INFO - __main__ - Step 128898: {'lr': 2.4666092175775283e-05, 'samples': 24748416, 'steps': 128897, 'loss/train': 0.8624625205993652} 11/07/2021 15:22:22 - INFO - __main__ - Step 128899: {'lr': 2.4663793762413096e-05, 'samples': 24748608, 'steps': 128898, 'loss/train': 0.7392169237136841} 11/07/2021 15:22:22 - INFO - __main__ - Step 128900: {'lr': 2.466149545058399e-05, 'samples': 24748800, 'steps': 128899, 'loss/train': 0.7723391652107239} 11/07/2021 15:22:22 - INFO - __main__ - Step 128901: {'lr': 2.4659197240288893e-05, 'samples': 24748992, 'steps': 128900, 'loss/train': 1.334678053855896} 11/07/2021 15:22:23 - INFO - __main__ - Step 128902: {'lr': 2.465689913152888e-05, 'samples': 24749184, 'steps': 128901, 'loss/train': 1.09536612033844} 11/07/2021 15:22:24 - INFO - __main__ - Step 128903: {'lr': 2.465460112430498e-05, 'samples': 24749376, 'steps': 128902, 'loss/train': 0.12438294291496277} 11/07/2021 15:22:24 - INFO - __main__ - Step 128904: {'lr': 2.465230321861825e-05, 'samples': 24749568, 'steps': 128903, 'loss/train': 1.2090449333190918} 11/07/2021 15:22:24 - INFO - __main__ - Step 128905: {'lr': 2.465000541446971e-05, 'samples': 24749760, 'steps': 128904, 'loss/train': 0.5669840574264526} 11/07/2021 15:22:25 - INFO - __main__ - Step 128906: {'lr': 2.4647707711860394e-05, 'samples': 24749952, 'steps': 128905, 'loss/train': 1.5202984809875488} 11/07/2021 15:22:25 - INFO - __main__ - Step 128907: {'lr': 2.4645410110791354e-05, 'samples': 24750144, 'steps': 128906, 'loss/train': 1.1793535947799683} 11/07/2021 15:22:26 - INFO - __main__ - Step 128908: {'lr': 2.4643112611263618e-05, 'samples': 24750336, 'steps': 128907, 'loss/train': 1.3611316680908203} 11/07/2021 15:22:26 - INFO - __main__ - Step 128909: {'lr': 2.464081521327821e-05, 'samples': 24750528, 'steps': 128908, 'loss/train': 1.2866666316986084} 11/07/2021 15:22:27 - INFO - __main__ - Step 128910: {'lr': 2.4638517916836188e-05, 'samples': 24750720, 'steps': 128909, 'loss/train': 1.531002402305603} 11/07/2021 15:22:27 - INFO - __main__ - Step 128911: {'lr': 2.4636220721938552e-05, 'samples': 24750912, 'steps': 128910, 'loss/train': 1.2893701791763306} 11/07/2021 15:22:27 - INFO - __main__ - Step 128912: {'lr': 2.463392362858638e-05, 'samples': 24751104, 'steps': 128911, 'loss/train': 1.4797784090042114} 11/07/2021 15:22:29 - INFO - __main__ - Step 128913: {'lr': 2.4631626636780702e-05, 'samples': 24751296, 'steps': 128912, 'loss/train': 1.3575090169906616} 11/07/2021 15:22:29 - INFO - __main__ - Step 128914: {'lr': 2.4629329746522518e-05, 'samples': 24751488, 'steps': 128913, 'loss/train': 0.3031391203403473} 11/07/2021 15:22:29 - INFO - __main__ - Step 128915: {'lr': 2.462703295781285e-05, 'samples': 24751680, 'steps': 128914, 'loss/train': 0.04065944254398346} 11/07/2021 15:22:30 - INFO - __main__ - Step 128916: {'lr': 2.4624736270652787e-05, 'samples': 24751872, 'steps': 128915, 'loss/train': 1.1889394521713257} 11/07/2021 15:22:30 - INFO - __main__ - Step 128917: {'lr': 2.4622439685043324e-05, 'samples': 24752064, 'steps': 128916, 'loss/train': 0.9625918865203857} 11/07/2021 15:22:31 - INFO - __main__ - Step 128918: {'lr': 2.462014320098552e-05, 'samples': 24752256, 'steps': 128917, 'loss/train': 1.2596163749694824} 11/07/2021 15:22:31 - INFO - __main__ - Step 128919: {'lr': 2.4617846818480394e-05, 'samples': 24752448, 'steps': 128918, 'loss/train': 4.435166835784912} 11/07/2021 15:22:32 - INFO - __main__ - Step 128920: {'lr': 2.4615550537529008e-05, 'samples': 24752640, 'steps': 128919, 'loss/train': 1.3664638996124268} 11/07/2021 15:22:32 - INFO - __main__ - Step 128921: {'lr': 2.461325435813236e-05, 'samples': 24752832, 'steps': 128920, 'loss/train': 1.5363364219665527} 11/07/2021 15:22:32 - INFO - __main__ - Step 128922: {'lr': 2.46109582802915e-05, 'samples': 24753024, 'steps': 128921, 'loss/train': 1.5405620336532593} 11/07/2021 15:22:34 - INFO - __main__ - Step 128923: {'lr': 2.460866230400749e-05, 'samples': 24753216, 'steps': 128922, 'loss/train': 0.8820419907569885} 11/07/2021 15:22:34 - INFO - __main__ - Step 128924: {'lr': 2.4606366429281325e-05, 'samples': 24753408, 'steps': 128923, 'loss/train': 0.678982138633728} 11/07/2021 15:22:34 - INFO - __main__ - Step 128925: {'lr': 2.460407065611403e-05, 'samples': 24753600, 'steps': 128924, 'loss/train': 1.4834996461868286} 11/07/2021 15:22:35 - INFO - __main__ - Step 128926: {'lr': 2.460177498450669e-05, 'samples': 24753792, 'steps': 128925, 'loss/train': 1.6446553468704224} 11/07/2021 15:22:35 - INFO - __main__ - Step 128927: {'lr': 2.4599479414460336e-05, 'samples': 24753984, 'steps': 128926, 'loss/train': 1.3049424886703491} 11/07/2021 15:22:36 - INFO - __main__ - Step 128928: {'lr': 2.459718394597596e-05, 'samples': 24754176, 'steps': 128927, 'loss/train': 0.8415346741676331} 11/07/2021 15:22:36 - INFO - __main__ - Step 128929: {'lr': 2.459488857905459e-05, 'samples': 24754368, 'steps': 128928, 'loss/train': 1.4621899127960205} 11/07/2021 15:22:37 - INFO - __main__ - Step 128930: {'lr': 2.4592593313697286e-05, 'samples': 24754560, 'steps': 128929, 'loss/train': 0.14174069464206696} 11/07/2021 15:22:37 - INFO - __main__ - Step 128931: {'lr': 2.4590298149905098e-05, 'samples': 24754752, 'steps': 128930, 'loss/train': 1.4490422010421753} 11/07/2021 15:22:37 - INFO - __main__ - Step 128932: {'lr': 2.4588003087679027e-05, 'samples': 24754944, 'steps': 128931, 'loss/train': 0.950303316116333} 11/07/2021 15:22:39 - INFO - __main__ - Step 128933: {'lr': 2.4585708127020155e-05, 'samples': 24755136, 'steps': 128932, 'loss/train': 1.3425058126449585} 11/07/2021 15:22:39 - INFO - __main__ - Step 128934: {'lr': 2.4583413267929455e-05, 'samples': 24755328, 'steps': 128933, 'loss/train': 1.3639791011810303} 11/07/2021 15:22:39 - INFO - __main__ - Step 128935: {'lr': 2.4581118510408007e-05, 'samples': 24755520, 'steps': 128934, 'loss/train': 0.7752484083175659} 11/07/2021 15:22:40 - INFO - __main__ - Step 128936: {'lr': 2.457882385445681e-05, 'samples': 24755712, 'steps': 128935, 'loss/train': 0.9489630460739136} 11/07/2021 15:22:40 - INFO - __main__ - Step 128937: {'lr': 2.4576529300076977e-05, 'samples': 24755904, 'steps': 128936, 'loss/train': 1.2904224395751953} 11/07/2021 15:22:41 - INFO - __main__ - Step 128938: {'lr': 2.457423484726942e-05, 'samples': 24756096, 'steps': 128937, 'loss/train': 0.653512716293335} 11/07/2021 15:22:42 - INFO - __main__ - Step 128939: {'lr': 2.4571940496035254e-05, 'samples': 24756288, 'steps': 128938, 'loss/train': 1.4373341798782349} 11/07/2021 15:22:42 - INFO - __main__ - Step 128940: {'lr': 2.4569646246375476e-05, 'samples': 24756480, 'steps': 128939, 'loss/train': 1.1918610334396362} 11/07/2021 15:22:42 - INFO - __main__ - Step 128941: {'lr': 2.456735209829114e-05, 'samples': 24756672, 'steps': 128940, 'loss/train': 0.9795160293579102} 11/07/2021 15:22:43 - INFO - __main__ - Step 128942: {'lr': 2.4565058051783273e-05, 'samples': 24756864, 'steps': 128941, 'loss/train': 1.3625741004943848} 11/07/2021 15:22:44 - INFO - __main__ - Step 128943: {'lr': 2.456276410685293e-05, 'samples': 24757056, 'steps': 128942, 'loss/train': 1.4180777072906494} 11/07/2021 15:22:44 - INFO - __main__ - Step 128944: {'lr': 2.4560470263501112e-05, 'samples': 24757248, 'steps': 128943, 'loss/train': 1.3531347513198853} 11/07/2021 15:22:44 - INFO - __main__ - Step 128945: {'lr': 2.455817652172887e-05, 'samples': 24757440, 'steps': 128944, 'loss/train': 1.5271549224853516} 11/07/2021 15:22:45 - INFO - __main__ - Step 128946: {'lr': 2.4555882881537235e-05, 'samples': 24757632, 'steps': 128945, 'loss/train': 1.075097680091858} 11/07/2021 15:22:45 - INFO - __main__ - Step 128947: {'lr': 2.4553589342927257e-05, 'samples': 24757824, 'steps': 128946, 'loss/train': 1.0551608800888062} 11/07/2021 15:22:46 - INFO - __main__ - Step 128948: {'lr': 2.4551295905899968e-05, 'samples': 24758016, 'steps': 128947, 'loss/train': 0.9103860855102539} 11/07/2021 15:22:46 - INFO - __main__ - Step 128949: {'lr': 2.4549002570456365e-05, 'samples': 24758208, 'steps': 128948, 'loss/train': 1.1901462078094482} 11/07/2021 15:22:47 - INFO - __main__ - Step 128950: {'lr': 2.45467093365975e-05, 'samples': 24758400, 'steps': 128949, 'loss/train': 1.2505580186843872} 11/07/2021 15:22:47 - INFO - __main__ - Step 128951: {'lr': 2.4544416204324403e-05, 'samples': 24758592, 'steps': 128950, 'loss/train': 0.05517950654029846} 11/07/2021 15:22:48 - INFO - __main__ - Step 128952: {'lr': 2.4542123173638105e-05, 'samples': 24758784, 'steps': 128951, 'loss/train': 1.3172621726989746} 11/07/2021 15:22:49 - INFO - __main__ - Step 128953: {'lr': 2.4539830244539652e-05, 'samples': 24758976, 'steps': 128952, 'loss/train': 1.3827985525131226} 11/07/2021 15:22:49 - INFO - __main__ - Step 128954: {'lr': 2.4537537417030076e-05, 'samples': 24759168, 'steps': 128953, 'loss/train': 1.132537603378296} 11/07/2021 15:22:49 - INFO - __main__ - Step 128955: {'lr': 2.4535244691110403e-05, 'samples': 24759360, 'steps': 128954, 'loss/train': 1.2736235857009888} 11/07/2021 15:22:50 - INFO - __main__ - Step 128956: {'lr': 2.4532952066781662e-05, 'samples': 24759552, 'steps': 128955, 'loss/train': 1.522396206855774} 11/07/2021 15:22:50 - INFO - __main__ - Step 128957: {'lr': 2.4530659544044905e-05, 'samples': 24759744, 'steps': 128956, 'loss/train': 1.2674541473388672} 11/07/2021 15:22:50 - INFO - __main__ - Step 128958: {'lr': 2.4528367122901157e-05, 'samples': 24759936, 'steps': 128957, 'loss/train': 1.1857606172561646} 11/07/2021 15:22:52 - INFO - __main__ - Step 128959: {'lr': 2.4526074803351507e-05, 'samples': 24760128, 'steps': 128958, 'loss/train': 2.2060387134552} 11/07/2021 15:22:52 - INFO - __main__ - Step 128960: {'lr': 2.4523782585396838e-05, 'samples': 24760320, 'steps': 128959, 'loss/train': 1.2994484901428223} 11/07/2021 15:22:52 - INFO - __main__ - Step 128961: {'lr': 2.4521490469038316e-05, 'samples': 24760512, 'steps': 128960, 'loss/train': 1.1632369756698608} 11/07/2021 15:22:53 - INFO - __main__ - Step 128962: {'lr': 2.4519198454276914e-05, 'samples': 24760704, 'steps': 128961, 'loss/train': 1.8297317028045654} 11/07/2021 15:22:53 - INFO - __main__ - Step 128963: {'lr': 2.451690654111369e-05, 'samples': 24760896, 'steps': 128962, 'loss/train': 1.0910779237747192} 11/07/2021 15:22:54 - INFO - __main__ - Step 128964: {'lr': 2.4514614729549632e-05, 'samples': 24761088, 'steps': 128963, 'loss/train': 0.9701424241065979} 11/07/2021 15:22:55 - INFO - __main__ - Step 128965: {'lr': 2.4512323019585864e-05, 'samples': 24761280, 'steps': 128964, 'loss/train': 1.1147847175598145} 11/07/2021 15:22:55 - INFO - __main__ - Step 128966: {'lr': 2.451003141122332e-05, 'samples': 24761472, 'steps': 128965, 'loss/train': 1.1498692035675049} 11/07/2021 15:22:55 - INFO - __main__ - Step 128967: {'lr': 2.4507739904463088e-05, 'samples': 24761664, 'steps': 128966, 'loss/train': 1.4939756393432617} 11/07/2021 15:22:56 - INFO - __main__ - Step 128968: {'lr': 2.4505448499306192e-05, 'samples': 24761856, 'steps': 128967, 'loss/train': 1.4470144510269165} 11/07/2021 15:22:57 - INFO - __main__ - Step 128969: {'lr': 2.450315719575369e-05, 'samples': 24762048, 'steps': 128968, 'loss/train': 1.4176650047302246} 11/07/2021 15:22:57 - INFO - __main__ - Step 128970: {'lr': 2.4500865993806605e-05, 'samples': 24762240, 'steps': 128969, 'loss/train': 1.2607464790344238} 11/07/2021 15:22:57 - INFO - __main__ - Step 128971: {'lr': 2.449857489346591e-05, 'samples': 24762432, 'steps': 128970, 'loss/train': 1.2008867263793945} 11/07/2021 15:22:58 - INFO - __main__ - Step 128972: {'lr': 2.4496283894732657e-05, 'samples': 24762624, 'steps': 128971, 'loss/train': 1.2135093212127686} 11/07/2021 15:22:58 - INFO - __main__ - Step 128973: {'lr': 2.4493992997607905e-05, 'samples': 24762816, 'steps': 128972, 'loss/train': 1.810303807258606} 11/07/2021 15:22:59 - INFO - __main__ - Step 128974: {'lr': 2.4491702202092707e-05, 'samples': 24763008, 'steps': 128973, 'loss/train': 1.142859935760498} 11/07/2021 15:22:59 - INFO - __main__ - Step 128975: {'lr': 2.4489411508188035e-05, 'samples': 24763200, 'steps': 128974, 'loss/train': 0.9233965873718262} 11/07/2021 15:23:00 - INFO - __main__ - Step 128976: {'lr': 2.4487120915894974e-05, 'samples': 24763392, 'steps': 128975, 'loss/train': 1.0312261581420898} 11/07/2021 15:23:00 - INFO - __main__ - Step 128977: {'lr': 2.4484830425214543e-05, 'samples': 24763584, 'steps': 128976, 'loss/train': 1.3724291324615479} 11/07/2021 15:23:00 - INFO - __main__ - Step 128978: {'lr': 2.448254003614775e-05, 'samples': 24763776, 'steps': 128977, 'loss/train': 1.1122583150863647} 11/07/2021 15:23:02 - INFO - __main__ - Step 128979: {'lr': 2.4480249748695645e-05, 'samples': 24763968, 'steps': 128978, 'loss/train': 1.1846818923950195} 11/07/2021 15:23:02 - INFO - __main__ - Step 128980: {'lr': 2.447795956285928e-05, 'samples': 24764160, 'steps': 128979, 'loss/train': 1.23479425907135} 11/07/2021 15:23:02 - INFO - __main__ - Step 128981: {'lr': 2.447566947863969e-05, 'samples': 24764352, 'steps': 128980, 'loss/train': 1.1733118295669556} 11/07/2021 15:23:03 - INFO - __main__ - Step 128982: {'lr': 2.447337949603784e-05, 'samples': 24764544, 'steps': 128981, 'loss/train': 1.7533519268035889} 11/07/2021 15:23:03 - INFO - __main__ - Step 128983: {'lr': 2.4471089615054814e-05, 'samples': 24764736, 'steps': 128982, 'loss/train': 1.249430537223816} 11/07/2021 15:23:04 - INFO - __main__ - Step 128984: {'lr': 2.446879983569164e-05, 'samples': 24764928, 'steps': 128983, 'loss/train': 0.058090414851903915} 11/07/2021 15:23:04 - INFO - __main__ - Step 128985: {'lr': 2.4466510157949318e-05, 'samples': 24765120, 'steps': 128984, 'loss/train': 0.6919297575950623} 11/07/2021 15:23:05 - INFO - __main__ - Step 128986: {'lr': 2.4464220581828927e-05, 'samples': 24765312, 'steps': 128985, 'loss/train': 1.0261954069137573} 11/07/2021 15:23:05 - INFO - __main__ - Step 128987: {'lr': 2.446193110733147e-05, 'samples': 24765504, 'steps': 128986, 'loss/train': 1.07106351852417} 11/07/2021 15:23:06 - INFO - __main__ - Step 128988: {'lr': 2.4459641734458e-05, 'samples': 24765696, 'steps': 128987, 'loss/train': 1.6098167896270752} 11/07/2021 15:23:07 - INFO - __main__ - Step 128989: {'lr': 2.445735246320954e-05, 'samples': 24765888, 'steps': 128988, 'loss/train': 1.5424333810806274} 11/07/2021 15:23:07 - INFO - __main__ - Step 128990: {'lr': 2.44550632935871e-05, 'samples': 24766080, 'steps': 128989, 'loss/train': 1.1112757921218872} 11/07/2021 15:23:07 - INFO - __main__ - Step 128991: {'lr': 2.4452774225591724e-05, 'samples': 24766272, 'steps': 128990, 'loss/train': 1.0514620542526245} 11/07/2021 15:23:08 - INFO - __main__ - Step 128992: {'lr': 2.445048525922447e-05, 'samples': 24766464, 'steps': 128991, 'loss/train': 1.2904767990112305} 11/07/2021 15:23:08 - INFO - __main__ - Step 128993: {'lr': 2.444819639448631e-05, 'samples': 24766656, 'steps': 128992, 'loss/train': 1.341151237487793} 11/07/2021 15:23:08 - INFO - __main__ - Step 128994: {'lr': 2.4445907631378384e-05, 'samples': 24766848, 'steps': 128993, 'loss/train': 1.2793692350387573} 11/07/2021 15:23:09 - INFO - __main__ - Step 128995: {'lr': 2.4443618969901605e-05, 'samples': 24767040, 'steps': 128994, 'loss/train': 1.0788503885269165} 11/07/2021 15:23:10 - INFO - __main__ - Step 128996: {'lr': 2.4441330410057057e-05, 'samples': 24767232, 'steps': 128995, 'loss/train': 0.43585777282714844} 11/07/2021 15:23:10 - INFO - __main__ - Step 128997: {'lr': 2.4439041951845763e-05, 'samples': 24767424, 'steps': 128996, 'loss/train': 1.0486252307891846} 11/07/2021 15:23:10 - INFO - __main__ - Step 128998: {'lr': 2.443675359526873e-05, 'samples': 24767616, 'steps': 128997, 'loss/train': 1.4149729013442993} 11/07/2021 15:23:11 - INFO - __main__ - Step 128999: {'lr': 2.4434465340327032e-05, 'samples': 24767808, 'steps': 128998, 'loss/train': 1.220171332359314} 11/07/2021 15:23:12 - INFO - __main__ - Step 129000: {'lr': 2.4432177187021704e-05, 'samples': 24768000, 'steps': 128999, 'loss/train': 1.406322717666626} 11/07/2021 15:23:12 - INFO - __main__ - Step 129001: {'lr': 2.442988913535374e-05, 'samples': 24768192, 'steps': 129000, 'loss/train': 0.9529061317443848} 11/07/2021 15:23:13 - INFO - __main__ - Step 129002: {'lr': 2.4427601185324167e-05, 'samples': 24768384, 'steps': 129001, 'loss/train': 1.5490671396255493} 11/07/2021 15:23:13 - INFO - __main__ - Step 129003: {'lr': 2.4425313336934067e-05, 'samples': 24768576, 'steps': 129002, 'loss/train': 1.0756131410598755} 11/07/2021 15:23:13 - INFO - __main__ - Step 129004: {'lr': 2.4423025590184417e-05, 'samples': 24768768, 'steps': 129003, 'loss/train': 1.642729640007019} 11/07/2021 15:23:14 - INFO - __main__ - Step 129005: {'lr': 2.4420737945076265e-05, 'samples': 24768960, 'steps': 129004, 'loss/train': 1.3917217254638672} 11/07/2021 15:23:15 - INFO - __main__ - Step 129006: {'lr': 2.441845040161067e-05, 'samples': 24769152, 'steps': 129005, 'loss/train': 0.7989662885665894} 11/07/2021 15:23:15 - INFO - __main__ - Step 129007: {'lr': 2.4416162959788683e-05, 'samples': 24769344, 'steps': 129006, 'loss/train': 1.1117159128189087} 11/07/2021 15:23:15 - INFO - __main__ - Step 129008: {'lr': 2.4413875619611254e-05, 'samples': 24769536, 'steps': 129007, 'loss/train': 0.9167174696922302} 11/07/2021 15:23:16 - INFO - __main__ - Step 129009: {'lr': 2.441158838107943e-05, 'samples': 24769728, 'steps': 129008, 'loss/train': 1.0160412788391113} 11/07/2021 15:23:17 - INFO - __main__ - Step 129010: {'lr': 2.4409301244194272e-05, 'samples': 24769920, 'steps': 129009, 'loss/train': 1.2193046808242798} 11/07/2021 15:23:17 - INFO - __main__ - Step 129011: {'lr': 2.44070142089568e-05, 'samples': 24770112, 'steps': 129010, 'loss/train': 1.0959173440933228} 11/07/2021 15:23:17 - INFO - __main__ - Step 129012: {'lr': 2.440472727536805e-05, 'samples': 24770304, 'steps': 129011, 'loss/train': 3.56217098236084} 11/07/2021 15:23:18 - INFO - __main__ - Step 129013: {'lr': 2.440244044342904e-05, 'samples': 24770496, 'steps': 129012, 'loss/train': 1.0260121822357178} 11/07/2021 15:23:18 - INFO - __main__ - Step 129014: {'lr': 2.4400153713140832e-05, 'samples': 24770688, 'steps': 129013, 'loss/train': 0.8362674117088318} 11/07/2021 15:23:19 - INFO - __main__ - Step 129015: {'lr': 2.439786708450442e-05, 'samples': 24770880, 'steps': 129014, 'loss/train': 1.1482796669006348} 11/07/2021 15:23:20 - INFO - __main__ - Step 129016: {'lr': 2.4395580557520836e-05, 'samples': 24771072, 'steps': 129015, 'loss/train': 1.062798023223877} 11/07/2021 15:23:20 - INFO - __main__ - Step 129017: {'lr': 2.439329413219113e-05, 'samples': 24771264, 'steps': 129016, 'loss/train': 1.7349237203598022} 11/07/2021 15:23:20 - INFO - __main__ - Step 129018: {'lr': 2.439100780851633e-05, 'samples': 24771456, 'steps': 129017, 'loss/train': 1.4519360065460205} 11/07/2021 15:23:21 - INFO - __main__ - Step 129019: {'lr': 2.4388721586497464e-05, 'samples': 24771648, 'steps': 129018, 'loss/train': 0.8352022767066956} 11/07/2021 15:23:21 - INFO - __main__ - Step 129020: {'lr': 2.438643546613556e-05, 'samples': 24771840, 'steps': 129019, 'loss/train': 1.0824824571609497} 11/07/2021 15:23:22 - INFO - __main__ - Step 129021: {'lr': 2.438414944743167e-05, 'samples': 24772032, 'steps': 129020, 'loss/train': 1.3600050210952759} 11/07/2021 15:23:22 - INFO - __main__ - Step 129022: {'lr': 2.4381863530386766e-05, 'samples': 24772224, 'steps': 129021, 'loss/train': 1.388651967048645} 11/07/2021 15:23:23 - INFO - __main__ - Step 129023: {'lr': 2.4379577715001934e-05, 'samples': 24772416, 'steps': 129022, 'loss/train': 1.3624716997146606} 11/07/2021 15:23:23 - INFO - __main__ - Step 129024: {'lr': 2.437729200127817e-05, 'samples': 24772608, 'steps': 129023, 'loss/train': 1.8849396705627441} 11/07/2021 15:23:23 - INFO - __main__ - Step 129025: {'lr': 2.4375006389216497e-05, 'samples': 24772800, 'steps': 129024, 'loss/train': 1.590262770652771} 11/07/2021 15:23:24 - INFO - __main__ - Step 129026: {'lr': 2.4372720878817976e-05, 'samples': 24772992, 'steps': 129025, 'loss/train': 0.9622445702552795} 11/07/2021 15:23:25 - INFO - __main__ - Step 129027: {'lr': 2.4370435470083637e-05, 'samples': 24773184, 'steps': 129026, 'loss/train': 1.4252134561538696} 11/07/2021 15:23:25 - INFO - __main__ - Step 129028: {'lr': 2.4368150163014497e-05, 'samples': 24773376, 'steps': 129027, 'loss/train': 1.070030927658081} 11/07/2021 15:23:25 - INFO - __main__ - Step 129029: {'lr': 2.4365864957611562e-05, 'samples': 24773568, 'steps': 129028, 'loss/train': 1.220698356628418} 11/07/2021 15:23:26 - INFO - __main__ - Step 129030: {'lr': 2.436357985387591e-05, 'samples': 24773760, 'steps': 129029, 'loss/train': 1.2416304349899292} 11/07/2021 15:23:27 - INFO - __main__ - Step 129031: {'lr': 2.4361294851808546e-05, 'samples': 24773952, 'steps': 129030, 'loss/train': 1.0150338411331177} 11/07/2021 15:23:28 - INFO - __main__ - Step 129032: {'lr': 2.4359009951410493e-05, 'samples': 24774144, 'steps': 129031, 'loss/train': 1.2213199138641357} 11/07/2021 15:23:28 - INFO - __main__ - Step 129033: {'lr': 2.435672515268278e-05, 'samples': 24774336, 'steps': 129032, 'loss/train': 0.06318879872560501} 11/07/2021 15:23:28 - INFO - __main__ - Step 129034: {'lr': 2.4354440455626515e-05, 'samples': 24774528, 'steps': 129033, 'loss/train': 1.2089945077896118} 11/07/2021 15:23:29 - INFO - __main__ - Step 129035: {'lr': 2.4352155860242586e-05, 'samples': 24774720, 'steps': 129034, 'loss/train': 0.9287061095237732} 11/07/2021 15:23:30 - INFO - __main__ - Step 129036: {'lr': 2.4349871366532105e-05, 'samples': 24774912, 'steps': 129035, 'loss/train': 1.4217127561569214} 11/07/2021 15:23:30 - INFO - __main__ - Step 129037: {'lr': 2.43475869744961e-05, 'samples': 24775104, 'steps': 129036, 'loss/train': 1.3705213069915771} 11/07/2021 15:23:30 - INFO - __main__ - Step 129038: {'lr': 2.4345302684135594e-05, 'samples': 24775296, 'steps': 129037, 'loss/train': 1.0272513628005981} 11/07/2021 15:23:31 - INFO - __main__ - Step 129039: {'lr': 2.434301849545159e-05, 'samples': 24775488, 'steps': 129038, 'loss/train': 1.4384843111038208} 11/07/2021 15:23:31 - INFO - __main__ - Step 129040: {'lr': 2.434073440844514e-05, 'samples': 24775680, 'steps': 129039, 'loss/train': 0.6007697582244873} 11/07/2021 15:23:32 - INFO - __main__ - Step 129041: {'lr': 2.43384504231173e-05, 'samples': 24775872, 'steps': 129040, 'loss/train': 1.2430769205093384} 11/07/2021 15:23:32 - INFO - __main__ - Step 129042: {'lr': 2.4336166539469046e-05, 'samples': 24776064, 'steps': 129041, 'loss/train': 1.0120759010314941} 11/07/2021 15:23:33 - INFO - __main__ - Step 129043: {'lr': 2.433388275750145e-05, 'samples': 24776256, 'steps': 129042, 'loss/train': 1.1947112083435059} 11/07/2021 15:23:33 - INFO - __main__ - Step 129044: {'lr': 2.4331599077215493e-05, 'samples': 24776448, 'steps': 129043, 'loss/train': 1.3269381523132324} 11/07/2021 15:23:33 - INFO - __main__ - Step 129045: {'lr': 2.4329315498612282e-05, 'samples': 24776640, 'steps': 129044, 'loss/train': 0.9182097911834717} 11/07/2021 15:23:34 - INFO - __main__ - Step 129046: {'lr': 2.4327032021692758e-05, 'samples': 24776832, 'steps': 129045, 'loss/train': 1.2472747564315796} 11/07/2021 15:23:35 - INFO - __main__ - Step 129047: {'lr': 2.4324748646458008e-05, 'samples': 24777024, 'steps': 129046, 'loss/train': 1.392113447189331} 11/07/2021 15:23:35 - INFO - __main__ - Step 129048: {'lr': 2.432246537290911e-05, 'samples': 24777216, 'steps': 129047, 'loss/train': 1.0410062074661255} 11/07/2021 15:23:36 - INFO - __main__ - Step 129049: {'lr': 2.432018220104695e-05, 'samples': 24777408, 'steps': 129048, 'loss/train': 1.0998061895370483} 11/07/2021 15:23:36 - INFO - __main__ - Step 129050: {'lr': 2.4317899130872652e-05, 'samples': 24777600, 'steps': 129049, 'loss/train': 1.2207127809524536} 11/07/2021 15:23:37 - INFO - __main__ - Step 129051: {'lr': 2.43156161623872e-05, 'samples': 24777792, 'steps': 129050, 'loss/train': 1.5228745937347412} 11/07/2021 15:23:37 - INFO - __main__ - Step 129052: {'lr': 2.4313333295591683e-05, 'samples': 24777984, 'steps': 129051, 'loss/train': 1.4749873876571655} 11/07/2021 15:23:38 - INFO - __main__ - Step 129053: {'lr': 2.4311050530487074e-05, 'samples': 24778176, 'steps': 129052, 'loss/train': 1.320990800857544} 11/07/2021 15:23:38 - INFO - __main__ - Step 129054: {'lr': 2.430876786707442e-05, 'samples': 24778368, 'steps': 129053, 'loss/train': 1.4376963376998901} 11/07/2021 15:23:38 - INFO - __main__ - Step 129055: {'lr': 2.4306485305354758e-05, 'samples': 24778560, 'steps': 129054, 'loss/train': 1.5281376838684082} 11/07/2021 15:23:39 - INFO - __main__ - Step 129056: {'lr': 2.4304202845329136e-05, 'samples': 24778752, 'steps': 129055, 'loss/train': 1.2294005155563354} 11/07/2021 15:23:40 - INFO - __main__ - Step 129057: {'lr': 2.430192048699853e-05, 'samples': 24778944, 'steps': 129056, 'loss/train': 1.3368433713912964} 11/07/2021 15:23:40 - INFO - __main__ - Step 129058: {'lr': 2.429963823036399e-05, 'samples': 24779136, 'steps': 129057, 'loss/train': 1.4015973806381226} 11/07/2021 15:23:41 - INFO - __main__ - Step 129059: {'lr': 2.4297356075426575e-05, 'samples': 24779328, 'steps': 129058, 'loss/train': 1.014078974723816} 11/07/2021 15:23:41 - INFO - __main__ - Step 129060: {'lr': 2.429507402218728e-05, 'samples': 24779520, 'steps': 129059, 'loss/train': 1.7332011461257935} 11/07/2021 15:23:41 - INFO - __main__ - Step 129061: {'lr': 2.4292792070647163e-05, 'samples': 24779712, 'steps': 129060, 'loss/train': 1.4503474235534668} 11/07/2021 15:23:42 - INFO - __main__ - Step 129062: {'lr': 2.429051022080722e-05, 'samples': 24779904, 'steps': 129061, 'loss/train': 0.4330430328845978} 11/07/2021 15:23:43 - INFO - __main__ - Step 129063: {'lr': 2.4288228472668483e-05, 'samples': 24780096, 'steps': 129062, 'loss/train': 1.1544322967529297} 11/07/2021 15:23:43 - INFO - __main__ - Step 129064: {'lr': 2.4285946826231976e-05, 'samples': 24780288, 'steps': 129063, 'loss/train': 1.2352591753005981} 11/07/2021 15:23:43 - INFO - __main__ - Step 129065: {'lr': 2.4283665281498725e-05, 'samples': 24780480, 'steps': 129064, 'loss/train': 1.3967968225479126} 11/07/2021 15:23:44 - INFO - __main__ - Step 129066: {'lr': 2.4281383838469784e-05, 'samples': 24780672, 'steps': 129065, 'loss/train': 0.7893984317779541} 11/07/2021 15:23:45 - INFO - __main__ - Step 129067: {'lr': 2.4279102497146183e-05, 'samples': 24780864, 'steps': 129066, 'loss/train': 1.1102300882339478} 11/07/2021 15:23:45 - INFO - __main__ - Step 129068: {'lr': 2.427682125752892e-05, 'samples': 24781056, 'steps': 129067, 'loss/train': 1.4197756052017212} 11/07/2021 15:23:46 - INFO - __main__ - Step 129069: {'lr': 2.427454011961905e-05, 'samples': 24781248, 'steps': 129068, 'loss/train': 0.8704010248184204} 11/07/2021 15:23:46 - INFO - __main__ - Step 129070: {'lr': 2.427225908341757e-05, 'samples': 24781440, 'steps': 129069, 'loss/train': 1.2079371213912964} 11/07/2021 15:23:46 - INFO - __main__ - Step 129071: {'lr': 2.4269978148925566e-05, 'samples': 24781632, 'steps': 129070, 'loss/train': 1.6659085750579834} 11/07/2021 15:23:47 - INFO - __main__ - Step 129072: {'lr': 2.426769731614398e-05, 'samples': 24781824, 'steps': 129071, 'loss/train': 1.022937536239624} 11/07/2021 15:23:48 - INFO - __main__ - Step 129073: {'lr': 2.4265416585073917e-05, 'samples': 24782016, 'steps': 129072, 'loss/train': 1.2269803285598755} 11/07/2021 15:23:48 - INFO - __main__ - Step 129074: {'lr': 2.426313595571636e-05, 'samples': 24782208, 'steps': 129073, 'loss/train': 0.09212709218263626} 11/07/2021 15:23:49 - INFO - __main__ - Step 129075: {'lr': 2.426085542807241e-05, 'samples': 24782400, 'steps': 129074, 'loss/train': 0.46080923080444336} 11/07/2021 15:23:49 - INFO - __main__ - Step 129076: {'lr': 2.4258575002142956e-05, 'samples': 24782592, 'steps': 129075, 'loss/train': 1.7099244594573975} 11/07/2021 15:23:49 - INFO - __main__ - Step 129077: {'lr': 2.4256294677929142e-05, 'samples': 24782784, 'steps': 129076, 'loss/train': 1.3243807554244995} 11/07/2021 15:23:50 - INFO - __main__ - Step 129078: {'lr': 2.4254014455431933e-05, 'samples': 24782976, 'steps': 129077, 'loss/train': 1.3701008558273315} 11/07/2021 15:23:51 - INFO - __main__ - Step 129079: {'lr': 2.4251734334652414e-05, 'samples': 24783168, 'steps': 129078, 'loss/train': 1.0722007751464844} 11/07/2021 15:23:51 - INFO - __main__ - Step 129080: {'lr': 2.4249454315591557e-05, 'samples': 24783360, 'steps': 129079, 'loss/train': 0.9962363243103027} 11/07/2021 15:23:51 - INFO - __main__ - Step 129081: {'lr': 2.4247174398250415e-05, 'samples': 24783552, 'steps': 129080, 'loss/train': 1.1492692232131958} 11/07/2021 15:23:52 - INFO - __main__ - Step 129082: {'lr': 2.424489458262999e-05, 'samples': 24783744, 'steps': 129081, 'loss/train': 1.3334765434265137} 11/07/2021 15:23:53 - INFO - __main__ - Step 129083: {'lr': 2.4242614868731362e-05, 'samples': 24783936, 'steps': 129082, 'loss/train': 1.049641728401184} 11/07/2021 15:23:53 - INFO - __main__ - Step 129084: {'lr': 2.424033525655553e-05, 'samples': 24784128, 'steps': 129083, 'loss/train': 1.4933836460113525} 11/07/2021 15:23:53 - INFO - __main__ - Step 129085: {'lr': 2.4238055746103494e-05, 'samples': 24784320, 'steps': 129084, 'loss/train': 1.5547672510147095} 11/07/2021 15:23:54 - INFO - __main__ - Step 129086: {'lr': 2.4235776337376337e-05, 'samples': 24784512, 'steps': 129085, 'loss/train': 1.249322772026062} 11/07/2021 15:23:54 - INFO - __main__ - Step 129087: {'lr': 2.4233497030375028e-05, 'samples': 24784704, 'steps': 129086, 'loss/train': 1.2260624170303345} 11/07/2021 15:23:55 - INFO - __main__ - Step 129088: {'lr': 2.423121782510068e-05, 'samples': 24784896, 'steps': 129087, 'loss/train': 1.2101740837097168} 11/07/2021 15:23:55 - INFO - __main__ - Step 129089: {'lr': 2.422893872155421e-05, 'samples': 24785088, 'steps': 129088, 'loss/train': 1.218027949333191} 11/07/2021 15:23:56 - INFO - __main__ - Step 129090: {'lr': 2.42266597197367e-05, 'samples': 24785280, 'steps': 129089, 'loss/train': 1.197817087173462} 11/07/2021 15:23:56 - INFO - __main__ - Step 129091: {'lr': 2.422438081964917e-05, 'samples': 24785472, 'steps': 129090, 'loss/train': 0.8578238487243652} 11/07/2021 15:23:56 - INFO - __main__ - Step 129092: {'lr': 2.422210202129266e-05, 'samples': 24785664, 'steps': 129091, 'loss/train': 1.2286654710769653} 11/07/2021 15:23:58 - INFO - __main__ - Step 129093: {'lr': 2.4219823324668184e-05, 'samples': 24785856, 'steps': 129092, 'loss/train': 1.3248766660690308} 11/07/2021 15:23:58 - INFO - __main__ - Step 129094: {'lr': 2.4217544729776774e-05, 'samples': 24786048, 'steps': 129093, 'loss/train': 1.2448759078979492} 11/07/2021 15:23:58 - INFO - __main__ - Step 129095: {'lr': 2.4215266236619432e-05, 'samples': 24786240, 'steps': 129094, 'loss/train': 1.1811376810073853} 11/07/2021 15:23:59 - INFO - __main__ - Step 129096: {'lr': 2.421298784519724e-05, 'samples': 24786432, 'steps': 129095, 'loss/train': 1.4649258852005005} 11/07/2021 15:23:59 - INFO - __main__ - Step 129097: {'lr': 2.4210709555511163e-05, 'samples': 24786624, 'steps': 129096, 'loss/train': 1.1544522047042847} 11/07/2021 15:23:59 - INFO - __main__ - Step 129098: {'lr': 2.420843136756229e-05, 'samples': 24786816, 'steps': 129097, 'loss/train': 1.2658408880233765} 11/07/2021 15:24:00 - INFO - __main__ - Step 129099: {'lr': 2.420615328135159e-05, 'samples': 24787008, 'steps': 129098, 'loss/train': 4.137935161590576} 11/07/2021 15:24:01 - INFO - __main__ - Step 129100: {'lr': 2.4203875296880117e-05, 'samples': 24787200, 'steps': 129099, 'loss/train': 1.1169242858886719} 11/07/2021 15:24:01 - INFO - __main__ - Step 129101: {'lr': 2.4201597414148873e-05, 'samples': 24787392, 'steps': 129100, 'loss/train': 1.2120071649551392} 11/07/2021 15:24:02 - INFO - __main__ - Step 129102: {'lr': 2.4199319633158967e-05, 'samples': 24787584, 'steps': 129101, 'loss/train': 0.14308537542819977} 11/07/2021 15:24:02 - INFO - __main__ - Step 129103: {'lr': 2.4197041953911342e-05, 'samples': 24787776, 'steps': 129102, 'loss/train': 1.2063292264938354} 11/07/2021 15:24:03 - INFO - __main__ - Step 129104: {'lr': 2.4194764376407024e-05, 'samples': 24787968, 'steps': 129103, 'loss/train': 1.0510386228561401} 11/07/2021 15:24:03 - INFO - __main__ - Step 129105: {'lr': 2.4192486900647044e-05, 'samples': 24788160, 'steps': 129104, 'loss/train': 1.5251948833465576} 11/07/2021 15:24:04 - INFO - __main__ - Step 129106: {'lr': 2.4190209526632478e-05, 'samples': 24788352, 'steps': 129105, 'loss/train': 1.466784954071045} 11/07/2021 15:24:04 - INFO - __main__ - Step 129107: {'lr': 2.4187932254364303e-05, 'samples': 24788544, 'steps': 129106, 'loss/train': 1.2284842729568481} 11/07/2021 15:24:04 - INFO - __main__ - Step 129108: {'lr': 2.4185655083843544e-05, 'samples': 24788736, 'steps': 129107, 'loss/train': 4.992117404937744} 11/07/2021 15:24:05 - INFO - __main__ - Step 129109: {'lr': 2.4183378015071257e-05, 'samples': 24788928, 'steps': 129108, 'loss/train': 0.6172676682472229} 11/07/2021 15:24:06 - INFO - __main__ - Step 129110: {'lr': 2.4181101048048466e-05, 'samples': 24789120, 'steps': 129109, 'loss/train': 0.04756626859307289} 11/07/2021 15:24:06 - INFO - __main__ - Step 129111: {'lr': 2.417882418277617e-05, 'samples': 24789312, 'steps': 129110, 'loss/train': 1.2840352058410645} 11/07/2021 15:24:07 - INFO - __main__ - Step 129112: {'lr': 2.417654741925543e-05, 'samples': 24789504, 'steps': 129111, 'loss/train': 0.6366530060768127} 11/07/2021 15:24:07 - INFO - __main__ - Step 129113: {'lr': 2.4174270757487238e-05, 'samples': 24789696, 'steps': 129112, 'loss/train': 1.1471315622329712} 11/07/2021 15:24:07 - INFO - __main__ - Step 129114: {'lr': 2.4171994197472652e-05, 'samples': 24789888, 'steps': 129113, 'loss/train': 1.1654490232467651} 11/07/2021 15:24:08 - INFO - __main__ - Step 129115: {'lr': 2.41697177392127e-05, 'samples': 24790080, 'steps': 129114, 'loss/train': 0.12597018480300903} 11/07/2021 15:24:09 - INFO - __main__ - Step 129116: {'lr': 2.416744138270838e-05, 'samples': 24790272, 'steps': 129115, 'loss/train': 1.1494369506835938} 11/07/2021 15:24:09 - INFO - __main__ - Step 129117: {'lr': 2.4165165127960686e-05, 'samples': 24790464, 'steps': 129116, 'loss/train': 1.4462683200836182} 11/07/2021 15:24:09 - INFO - __main__ - Step 129118: {'lr': 2.416288897497071e-05, 'samples': 24790656, 'steps': 129117, 'loss/train': 1.377591609954834} 11/07/2021 15:24:10 - INFO - __main__ - Step 129119: {'lr': 2.4160612923739444e-05, 'samples': 24790848, 'steps': 129118, 'loss/train': 0.8742223381996155} 11/07/2021 15:24:11 - INFO - __main__ - Step 129120: {'lr': 2.4158336974267918e-05, 'samples': 24791040, 'steps': 129119, 'loss/train': 1.523949384689331} 11/07/2021 15:24:11 - INFO - __main__ - Step 129121: {'lr': 2.4156061126557162e-05, 'samples': 24791232, 'steps': 129120, 'loss/train': 0.8526588082313538} 11/07/2021 15:24:11 - INFO - __main__ - Step 129122: {'lr': 2.4153785380608195e-05, 'samples': 24791424, 'steps': 129121, 'loss/train': 1.5233708620071411} 11/07/2021 15:24:12 - INFO - __main__ - Step 129123: {'lr': 2.415150973642205e-05, 'samples': 24791616, 'steps': 129122, 'loss/train': 1.2179934978485107} 11/07/2021 15:24:12 - INFO - __main__ - Step 129124: {'lr': 2.4149234193999753e-05, 'samples': 24791808, 'steps': 129123, 'loss/train': 1.2626920938491821} 11/07/2021 15:24:13 - INFO - __main__ - Step 129125: {'lr': 2.414695875334233e-05, 'samples': 24792000, 'steps': 129124, 'loss/train': 1.2150496244430542} 11/07/2021 15:24:14 - INFO - __main__ - Step 129126: {'lr': 2.414468341445081e-05, 'samples': 24792192, 'steps': 129125, 'loss/train': 1.4739729166030884} 11/07/2021 15:24:14 - INFO - __main__ - Step 129127: {'lr': 2.4142408177326185e-05, 'samples': 24792384, 'steps': 129126, 'loss/train': 1.390537977218628} 11/07/2021 15:24:14 - INFO - __main__ - Step 129128: {'lr': 2.4140133041969574e-05, 'samples': 24792576, 'steps': 129127, 'loss/train': 1.3832093477249146} 11/07/2021 15:24:15 - INFO - __main__ - Step 129129: {'lr': 2.4137858008381862e-05, 'samples': 24792768, 'steps': 129128, 'loss/train': 0.29692983627319336} 11/07/2021 15:24:16 - INFO - __main__ - Step 129130: {'lr': 2.4135583076564162e-05, 'samples': 24792960, 'steps': 129129, 'loss/train': 1.2427787780761719} 11/07/2021 15:24:16 - INFO - __main__ - Step 129131: {'lr': 2.4133308246517494e-05, 'samples': 24793152, 'steps': 129130, 'loss/train': 0.8900996446609497} 11/07/2021 15:24:16 - INFO - __main__ - Step 129132: {'lr': 2.4131033518242862e-05, 'samples': 24793344, 'steps': 129131, 'loss/train': 1.4644170999526978} 11/07/2021 15:24:17 - INFO - __main__ - Step 129133: {'lr': 2.412875889174129e-05, 'samples': 24793536, 'steps': 129132, 'loss/train': 1.136155128479004} 11/07/2021 15:24:17 - INFO - __main__ - Step 129134: {'lr': 2.412648436701384e-05, 'samples': 24793728, 'steps': 129133, 'loss/train': 1.4424288272857666} 11/07/2021 15:24:18 - INFO - __main__ - Step 129135: {'lr': 2.4124209944061476e-05, 'samples': 24793920, 'steps': 129134, 'loss/train': 0.7636460065841675} 11/07/2021 15:24:18 - INFO - __main__ - Step 129136: {'lr': 2.4121935622885284e-05, 'samples': 24794112, 'steps': 129135, 'loss/train': 1.2789241075515747} 11/07/2021 15:24:19 - INFO - __main__ - Step 129137: {'lr': 2.411966140348626e-05, 'samples': 24794304, 'steps': 129136, 'loss/train': 1.0103869438171387} 11/07/2021 15:24:19 - INFO - __main__ - Step 129138: {'lr': 2.4117387285865432e-05, 'samples': 24794496, 'steps': 129137, 'loss/train': 0.9670966267585754} 11/07/2021 15:24:19 - INFO - __main__ - Step 129139: {'lr': 2.41151132700238e-05, 'samples': 24794688, 'steps': 129138, 'loss/train': 1.4434630870819092} 11/07/2021 15:24:21 - INFO - __main__ - Step 129140: {'lr': 2.4112839355962453e-05, 'samples': 24794880, 'steps': 129139, 'loss/train': 1.7422577142715454} 11/07/2021 15:24:21 - INFO - __main__ - Step 129141: {'lr': 2.411056554368235e-05, 'samples': 24795072, 'steps': 129140, 'loss/train': 1.6897555589675903} 11/07/2021 15:24:21 - INFO - __main__ - Step 129142: {'lr': 2.410829183318458e-05, 'samples': 24795264, 'steps': 129141, 'loss/train': 1.0045311450958252} 11/07/2021 15:24:22 - INFO - __main__ - Step 129143: {'lr': 2.410601822447009e-05, 'samples': 24795456, 'steps': 129142, 'loss/train': 1.238523244857788} 11/07/2021 15:24:22 - INFO - __main__ - Step 129144: {'lr': 2.4103744717539927e-05, 'samples': 24795648, 'steps': 129143, 'loss/train': 1.495995283126831} 11/07/2021 15:24:23 - INFO - __main__ - Step 129145: {'lr': 2.410147131239515e-05, 'samples': 24795840, 'steps': 129144, 'loss/train': 1.5508953332901} 11/07/2021 15:24:23 - INFO - __main__ - Step 129146: {'lr': 2.409919800903676e-05, 'samples': 24796032, 'steps': 129145, 'loss/train': 1.2392899990081787} 11/07/2021 15:24:24 - INFO - __main__ - Step 129147: {'lr': 2.4096924807465807e-05, 'samples': 24796224, 'steps': 129146, 'loss/train': 1.479594349861145} 11/07/2021 15:24:24 - INFO - __main__ - Step 129148: {'lr': 2.409465170768327e-05, 'samples': 24796416, 'steps': 129147, 'loss/train': 1.3612020015716553} 11/07/2021 15:24:24 - INFO - __main__ - Step 129149: {'lr': 2.4092378709690193e-05, 'samples': 24796608, 'steps': 129148, 'loss/train': 1.2739169597625732} 11/07/2021 15:24:26 - INFO - __main__ - Step 129150: {'lr': 2.4090105813487612e-05, 'samples': 24796800, 'steps': 129149, 'loss/train': 1.225538969039917} 11/07/2021 15:24:26 - INFO - __main__ - Step 129151: {'lr': 2.4087833019076548e-05, 'samples': 24796992, 'steps': 129150, 'loss/train': 1.0190651416778564} 11/07/2021 15:24:26 - INFO - __main__ - Step 129152: {'lr': 2.4085560326458007e-05, 'samples': 24797184, 'steps': 129151, 'loss/train': 1.2494946718215942} 11/07/2021 15:24:27 - INFO - __main__ - Step 129153: {'lr': 2.4083287735633036e-05, 'samples': 24797376, 'steps': 129152, 'loss/train': 0.8777867555618286} 11/07/2021 15:24:27 - INFO - __main__ - Step 129154: {'lr': 2.4081015246602638e-05, 'samples': 24797568, 'steps': 129153, 'loss/train': 0.9571378231048584} 11/07/2021 15:24:28 - INFO - __main__ - Step 129155: {'lr': 2.4078742859367924e-05, 'samples': 24797760, 'steps': 129154, 'loss/train': 1.7552160024642944} 11/07/2021 15:24:28 - INFO - __main__ - Step 129156: {'lr': 2.4076470573929756e-05, 'samples': 24797952, 'steps': 129155, 'loss/train': 1.6094310283660889} 11/07/2021 15:24:29 - INFO - __main__ - Step 129157: {'lr': 2.4074198390289264e-05, 'samples': 24798144, 'steps': 129156, 'loss/train': 1.2674310207366943} 11/07/2021 15:24:29 - INFO - __main__ - Step 129158: {'lr': 2.4071926308447454e-05, 'samples': 24798336, 'steps': 129157, 'loss/train': 1.307910442352295} 11/07/2021 15:24:29 - INFO - __main__ - Step 129159: {'lr': 2.4069654328405355e-05, 'samples': 24798528, 'steps': 129158, 'loss/train': 1.326750636100769} 11/07/2021 15:24:31 - INFO - __main__ - Step 129160: {'lr': 2.406738245016396e-05, 'samples': 24798720, 'steps': 129159, 'loss/train': 1.4090352058410645} 11/07/2021 15:24:31 - INFO - __main__ - Step 129161: {'lr': 2.4065110673724357e-05, 'samples': 24798912, 'steps': 129160, 'loss/train': 0.9208015203475952} 11/07/2021 15:24:31 - INFO - __main__ - Step 129162: {'lr': 2.4062838999087484e-05, 'samples': 24799104, 'steps': 129161, 'loss/train': 1.0524368286132812} 11/07/2021 15:24:32 - INFO - __main__ - Step 129163: {'lr': 2.4060567426254427e-05, 'samples': 24799296, 'steps': 129162, 'loss/train': 1.2240151166915894} 11/07/2021 15:24:32 - INFO - __main__ - Step 129164: {'lr': 2.4058295955226183e-05, 'samples': 24799488, 'steps': 129163, 'loss/train': 1.657089114189148} 11/07/2021 15:24:33 - INFO - __main__ - Step 129165: {'lr': 2.405602458600381e-05, 'samples': 24799680, 'steps': 129164, 'loss/train': 1.1587715148925781} 11/07/2021 15:24:34 - INFO - __main__ - Step 129166: {'lr': 2.405375331858828e-05, 'samples': 24799872, 'steps': 129165, 'loss/train': 1.1552051305770874} 11/07/2021 15:24:34 - INFO - __main__ - Step 129167: {'lr': 2.4051482152980668e-05, 'samples': 24800064, 'steps': 129166, 'loss/train': 1.2963614463806152} 11/07/2021 15:24:34 - INFO - __main__ - Step 129168: {'lr': 2.4049211089181954e-05, 'samples': 24800256, 'steps': 129167, 'loss/train': 1.0438508987426758} 11/07/2021 15:24:35 - INFO - __main__ - Step 129169: {'lr': 2.4046940127193216e-05, 'samples': 24800448, 'steps': 129168, 'loss/train': 1.105702519416809} 11/07/2021 15:24:36 - INFO - __main__ - Step 129170: {'lr': 2.4044669267015402e-05, 'samples': 24800640, 'steps': 129169, 'loss/train': 0.31198549270629883} 11/07/2021 15:24:36 - INFO - __main__ - Step 129171: {'lr': 2.4042398508649587e-05, 'samples': 24800832, 'steps': 129170, 'loss/train': 1.2565348148345947} 11/07/2021 15:24:36 - INFO - __main__ - Step 129172: {'lr': 2.4040127852096775e-05, 'samples': 24801024, 'steps': 129171, 'loss/train': 1.312651515007019} 11/07/2021 15:24:37 - INFO - __main__ - Step 129173: {'lr': 2.4037857297357968e-05, 'samples': 24801216, 'steps': 129172, 'loss/train': 1.1935272216796875} 11/07/2021 15:24:37 - INFO - __main__ - Step 129174: {'lr': 2.4035586844434242e-05, 'samples': 24801408, 'steps': 129173, 'loss/train': 1.2972019910812378} 11/07/2021 15:24:37 - INFO - __main__ - Step 129175: {'lr': 2.40333164933266e-05, 'samples': 24801600, 'steps': 129174, 'loss/train': 1.5075584650039673} 11/07/2021 15:24:39 - INFO - __main__ - Step 129176: {'lr': 2.4031046244036043e-05, 'samples': 24801792, 'steps': 129175, 'loss/train': 1.220647931098938} 11/07/2021 15:24:39 - INFO - __main__ - Step 129177: {'lr': 2.402877609656362e-05, 'samples': 24801984, 'steps': 129176, 'loss/train': 1.4158220291137695} 11/07/2021 15:24:39 - INFO - __main__ - Step 129178: {'lr': 2.4026506050910333e-05, 'samples': 24802176, 'steps': 129177, 'loss/train': 1.3778998851776123} 11/07/2021 15:24:40 - INFO - __main__ - Step 129179: {'lr': 2.4024236107077214e-05, 'samples': 24802368, 'steps': 129178, 'loss/train': 0.8231167197227478} 11/07/2021 15:24:40 - INFO - __main__ - Step 129180: {'lr': 2.402196626506528e-05, 'samples': 24802560, 'steps': 129179, 'loss/train': 0.9217475652694702} 11/07/2021 15:24:41 - INFO - __main__ - Step 129181: {'lr': 2.4019696524875596e-05, 'samples': 24802752, 'steps': 129180, 'loss/train': 1.2135989665985107} 11/07/2021 15:24:41 - INFO - __main__ - Step 129182: {'lr': 2.4017426886509154e-05, 'samples': 24802944, 'steps': 129181, 'loss/train': 1.2154526710510254} 11/07/2021 15:24:42 - INFO - __main__ - Step 129183: {'lr': 2.4015157349966955e-05, 'samples': 24803136, 'steps': 129182, 'loss/train': 0.8648806214332581} 11/07/2021 15:24:42 - INFO - __main__ - Step 129184: {'lr': 2.4012887915250026e-05, 'samples': 24803328, 'steps': 129183, 'loss/train': 1.5744787454605103} 11/07/2021 15:24:42 - INFO - __main__ - Step 129185: {'lr': 2.4010618582359423e-05, 'samples': 24803520, 'steps': 129184, 'loss/train': 1.064882755279541} 11/07/2021 15:24:44 - INFO - __main__ - Step 129186: {'lr': 2.4008349351296116e-05, 'samples': 24803712, 'steps': 129185, 'loss/train': 1.0954992771148682} 11/07/2021 15:24:44 - INFO - __main__ - Step 129187: {'lr': 2.400608022206119e-05, 'samples': 24803904, 'steps': 129186, 'loss/train': 1.430604100227356} 11/07/2021 15:24:44 - INFO - __main__ - Step 129188: {'lr': 2.400381119465561e-05, 'samples': 24804096, 'steps': 129187, 'loss/train': 1.4819482564926147} 11/07/2021 15:24:45 - INFO - __main__ - Step 129189: {'lr': 2.4001542269080438e-05, 'samples': 24804288, 'steps': 129188, 'loss/train': 0.9999305605888367} 11/07/2021 15:24:45 - INFO - __main__ - Step 129190: {'lr': 2.399927344533667e-05, 'samples': 24804480, 'steps': 129189, 'loss/train': 0.7797994613647461} 11/07/2021 15:24:46 - INFO - __main__ - Step 129191: {'lr': 2.3997004723425363e-05, 'samples': 24804672, 'steps': 129190, 'loss/train': 1.2897053956985474} 11/07/2021 15:24:46 - INFO - __main__ - Step 129192: {'lr': 2.3994736103347514e-05, 'samples': 24804864, 'steps': 129191, 'loss/train': 0.6289142966270447} 11/07/2021 15:24:47 - INFO - __main__ - Step 129193: {'lr': 2.399246758510415e-05, 'samples': 24805056, 'steps': 129192, 'loss/train': 0.9221065640449524} 11/07/2021 15:24:47 - INFO - __main__ - Step 129194: {'lr': 2.399019916869627e-05, 'samples': 24805248, 'steps': 129193, 'loss/train': 1.0291385650634766} 11/07/2021 15:24:47 - INFO - __main__ - Step 129195: {'lr': 2.3987930854124985e-05, 'samples': 24805440, 'steps': 129194, 'loss/train': 1.128686785697937} 11/07/2021 15:24:48 - INFO - __main__ - Step 129196: {'lr': 2.398566264139121e-05, 'samples': 24805632, 'steps': 129195, 'loss/train': 0.8397374153137207} 11/07/2021 15:24:49 - INFO - __main__ - Step 129197: {'lr': 2.3983394530496e-05, 'samples': 24805824, 'steps': 129196, 'loss/train': 0.9369224905967712} 11/07/2021 15:24:49 - INFO - __main__ - Step 129198: {'lr': 2.398112652144038e-05, 'samples': 24806016, 'steps': 129197, 'loss/train': 1.3217965364456177} 11/07/2021 15:24:50 - INFO - __main__ - Step 129199: {'lr': 2.3978858614225386e-05, 'samples': 24806208, 'steps': 129198, 'loss/train': 1.344841718673706} 11/07/2021 15:24:50 - INFO - __main__ - Step 129200: {'lr': 2.3976590808852032e-05, 'samples': 24806400, 'steps': 129199, 'loss/train': 1.3721476793289185} 11/07/2021 15:24:51 - INFO - __main__ - Step 129201: {'lr': 2.397432310532133e-05, 'samples': 24806592, 'steps': 129200, 'loss/train': 1.1821421384811401} 11/07/2021 15:24:51 - INFO - __main__ - Step 129202: {'lr': 2.3972055503634322e-05, 'samples': 24806784, 'steps': 129201, 'loss/train': 0.9232248663902283} 11/07/2021 15:24:52 - INFO - __main__ - Step 129203: {'lr': 2.3969788003792013e-05, 'samples': 24806976, 'steps': 129202, 'loss/train': 1.3201864957809448} 11/07/2021 15:24:52 - INFO - __main__ - Step 129204: {'lr': 2.3967520605795408e-05, 'samples': 24807168, 'steps': 129203, 'loss/train': 1.4272369146347046} 11/07/2021 15:24:52 - INFO - __main__ - Step 129205: {'lr': 2.396525330964558e-05, 'samples': 24807360, 'steps': 129204, 'loss/train': 0.8010483980178833} 11/07/2021 15:24:53 - INFO - __main__ - Step 129206: {'lr': 2.396298611534356e-05, 'samples': 24807552, 'steps': 129205, 'loss/train': 1.3144841194152832} 11/07/2021 15:24:54 - INFO - __main__ - Step 129207: {'lr': 2.3960719022890264e-05, 'samples': 24807744, 'steps': 129206, 'loss/train': 0.9558438062667847} 11/07/2021 15:24:54 - INFO - __main__ - Step 129208: {'lr': 2.3958452032286805e-05, 'samples': 24807936, 'steps': 129207, 'loss/train': 1.0605720281600952} 11/07/2021 15:24:54 - INFO - __main__ - Step 129209: {'lr': 2.3956185143534177e-05, 'samples': 24808128, 'steps': 129208, 'loss/train': 1.2917382717132568} 11/07/2021 15:24:55 - INFO - __main__ - Step 129210: {'lr': 2.395391835663338e-05, 'samples': 24808320, 'steps': 129209, 'loss/train': 1.1672663688659668} 11/07/2021 15:24:56 - INFO - __main__ - Step 129211: {'lr': 2.3951651671585474e-05, 'samples': 24808512, 'steps': 129210, 'loss/train': 1.1760613918304443} 11/07/2021 15:24:56 - INFO - __main__ - Step 129212: {'lr': 2.394938508839148e-05, 'samples': 24808704, 'steps': 129211, 'loss/train': 0.7989407181739807} 11/07/2021 15:24:56 - INFO - __main__ - Step 129213: {'lr': 2.39471186070524e-05, 'samples': 24808896, 'steps': 129212, 'loss/train': 1.2818115949630737} 11/07/2021 15:24:57 - INFO - __main__ - Step 129214: {'lr': 2.3944852227569232e-05, 'samples': 24809088, 'steps': 129213, 'loss/train': 1.2686744928359985} 11/07/2021 15:24:57 - INFO - __main__ - Step 129215: {'lr': 2.394258594994306e-05, 'samples': 24809280, 'steps': 129214, 'loss/train': 1.3820973634719849} 11/07/2021 15:24:58 - INFO - __main__ - Step 129216: {'lr': 2.394031977417485e-05, 'samples': 24809472, 'steps': 129215, 'loss/train': 1.4828888177871704} 11/07/2021 15:24:59 - INFO - __main__ - Step 129217: {'lr': 2.3938053700265694e-05, 'samples': 24809664, 'steps': 129216, 'loss/train': 1.4085999727249146} 11/07/2021 15:24:59 - INFO - __main__ - Step 129218: {'lr': 2.393578772821653e-05, 'samples': 24809856, 'steps': 129217, 'loss/train': 1.6055002212524414} 11/07/2021 15:24:59 - INFO - __main__ - Step 129219: {'lr': 2.3933521858028385e-05, 'samples': 24810048, 'steps': 129218, 'loss/train': 1.2361723184585571} 11/07/2021 15:25:00 - INFO - __main__ - Step 129220: {'lr': 2.393125608970234e-05, 'samples': 24810240, 'steps': 129219, 'loss/train': 1.2270207405090332} 11/07/2021 15:25:00 - INFO - __main__ - Step 129221: {'lr': 2.3928990423239345e-05, 'samples': 24810432, 'steps': 129220, 'loss/train': 1.598254680633545} 11/07/2021 15:25:01 - INFO - __main__ - Step 129222: {'lr': 2.3926724858640475e-05, 'samples': 24810624, 'steps': 129221, 'loss/train': 1.4089235067367554} 11/07/2021 15:25:02 - INFO - __main__ - Step 129223: {'lr': 2.3924459395906763e-05, 'samples': 24810816, 'steps': 129222, 'loss/train': 1.4853729009628296} 11/07/2021 15:25:02 - INFO - __main__ - Step 129224: {'lr': 2.3922194035039174e-05, 'samples': 24811008, 'steps': 129223, 'loss/train': 1.027393102645874} 11/07/2021 15:25:02 - INFO - __main__ - Step 129225: {'lr': 2.391992877603874e-05, 'samples': 24811200, 'steps': 129224, 'loss/train': 1.4793553352355957} 11/07/2021 15:25:03 - INFO - __main__ - Step 129226: {'lr': 2.3917663618906516e-05, 'samples': 24811392, 'steps': 129225, 'loss/train': 1.4577984809875488} 11/07/2021 15:25:04 - INFO - __main__ - Step 129227: {'lr': 2.3915398563643498e-05, 'samples': 24811584, 'steps': 129226, 'loss/train': 2.252218246459961} 11/07/2021 15:25:04 - INFO - __main__ - Step 129228: {'lr': 2.391313361025077e-05, 'samples': 24811776, 'steps': 129227, 'loss/train': 1.06498384475708} 11/07/2021 15:25:04 - INFO - __main__ - Step 129229: {'lr': 2.391086875872925e-05, 'samples': 24811968, 'steps': 129228, 'loss/train': 0.22498153150081635} 11/07/2021 15:25:05 - INFO - __main__ - Step 129230: {'lr': 2.3908604009079988e-05, 'samples': 24812160, 'steps': 129229, 'loss/train': 1.6344923973083496} 11/07/2021 15:25:05 - INFO - __main__ - Step 129231: {'lr': 2.390633936130404e-05, 'samples': 24812352, 'steps': 129230, 'loss/train': 0.1355360448360443} 11/07/2021 15:25:06 - INFO - __main__ - Step 129232: {'lr': 2.390407481540238e-05, 'samples': 24812544, 'steps': 129231, 'loss/train': 0.8417227268218994} 11/07/2021 15:25:07 - INFO - __main__ - Step 129233: {'lr': 2.3901810371376066e-05, 'samples': 24812736, 'steps': 129232, 'loss/train': 1.3158338069915771} 11/07/2021 15:25:07 - INFO - __main__ - Step 129234: {'lr': 2.3899546029226116e-05, 'samples': 24812928, 'steps': 129233, 'loss/train': 0.849611222743988} 11/07/2021 15:25:07 - INFO - __main__ - Step 129235: {'lr': 2.3897281788953535e-05, 'samples': 24813120, 'steps': 129234, 'loss/train': 1.069873332977295} 11/07/2021 15:25:08 - INFO - __main__ - Step 129236: {'lr': 2.3895017650559377e-05, 'samples': 24813312, 'steps': 129235, 'loss/train': 1.5534579753875732} 11/07/2021 15:25:09 - INFO - __main__ - Step 129237: {'lr': 2.3892753614044583e-05, 'samples': 24813504, 'steps': 129236, 'loss/train': 1.3452134132385254} 11/07/2021 15:25:09 - INFO - __main__ - Step 129238: {'lr': 2.3890489679410264e-05, 'samples': 24813696, 'steps': 129237, 'loss/train': 0.5307647585868835} 11/07/2021 15:25:09 - INFO - __main__ - Step 129239: {'lr': 2.3888225846657425e-05, 'samples': 24813888, 'steps': 129238, 'loss/train': 0.8661346435546875} 11/07/2021 15:25:10 - INFO - __main__ - Step 129240: {'lr': 2.388596211578703e-05, 'samples': 24814080, 'steps': 129239, 'loss/train': 1.4350448846817017} 11/07/2021 15:25:10 - INFO - __main__ - Step 129241: {'lr': 2.3883698486800136e-05, 'samples': 24814272, 'steps': 129240, 'loss/train': 1.256723403930664} 11/07/2021 15:25:11 - INFO - __main__ - Step 129242: {'lr': 2.3881434959697746e-05, 'samples': 24814464, 'steps': 129241, 'loss/train': 1.49372398853302} 11/07/2021 15:25:11 - INFO - __main__ - Step 129243: {'lr': 2.3879171534480907e-05, 'samples': 24814656, 'steps': 129242, 'loss/train': 1.6843701601028442} 11/07/2021 15:25:12 - INFO - __main__ - Step 129244: {'lr': 2.38769082111506e-05, 'samples': 24814848, 'steps': 129243, 'loss/train': 1.122175931930542} 11/07/2021 15:25:12 - INFO - __main__ - Step 129245: {'lr': 2.38746449897079e-05, 'samples': 24815040, 'steps': 129244, 'loss/train': 1.1719238758087158} 11/07/2021 15:25:12 - INFO - __main__ - Step 129246: {'lr': 2.3872381870153782e-05, 'samples': 24815232, 'steps': 129245, 'loss/train': 0.8628831505775452} 11/07/2021 15:25:14 - INFO - __main__ - Step 129247: {'lr': 2.387011885248927e-05, 'samples': 24815424, 'steps': 129246, 'loss/train': 1.2936680316925049} 11/07/2021 15:25:14 - INFO - __main__ - Step 129248: {'lr': 2.3867855936715392e-05, 'samples': 24815616, 'steps': 129247, 'loss/train': 1.2049709558486938} 11/07/2021 15:25:14 - INFO - __main__ - Step 129249: {'lr': 2.386559312283318e-05, 'samples': 24815808, 'steps': 129248, 'loss/train': 1.3459136486053467} 11/07/2021 15:25:15 - INFO - __main__ - Step 129250: {'lr': 2.386333041084368e-05, 'samples': 24816000, 'steps': 129249, 'loss/train': 0.030395179986953735} 11/07/2021 15:25:15 - INFO - __main__ - Step 129251: {'lr': 2.3861067800747842e-05, 'samples': 24816192, 'steps': 129250, 'loss/train': 1.2773329019546509} 11/07/2021 15:25:15 - INFO - __main__ - Step 129252: {'lr': 2.3858805292546694e-05, 'samples': 24816384, 'steps': 129251, 'loss/train': 1.4257385730743408} 11/07/2021 15:25:16 - INFO - __main__ - Step 129253: {'lr': 2.3856542886241285e-05, 'samples': 24816576, 'steps': 129252, 'loss/train': 1.277342677116394} 11/07/2021 15:25:17 - INFO - __main__ - Step 129254: {'lr': 2.3854280581832642e-05, 'samples': 24816768, 'steps': 129253, 'loss/train': 1.3596935272216797} 11/07/2021 15:25:17 - INFO - __main__ - Step 129255: {'lr': 2.385201837932177e-05, 'samples': 24816960, 'steps': 129254, 'loss/train': 1.417066216468811} 11/07/2021 15:25:17 - INFO - __main__ - Step 129256: {'lr': 2.3849756278709665e-05, 'samples': 24817152, 'steps': 129255, 'loss/train': 0.9633712768554688} 11/07/2021 15:25:18 - INFO - __main__ - Step 129257: {'lr': 2.3847494279997412e-05, 'samples': 24817344, 'steps': 129256, 'loss/train': 0.9413549304008484} 11/07/2021 15:25:19 - INFO - __main__ - Step 129258: {'lr': 2.384523238318595e-05, 'samples': 24817536, 'steps': 129257, 'loss/train': 1.4827828407287598} 11/07/2021 15:25:20 - INFO - __main__ - Step 129259: {'lr': 2.3842970588276362e-05, 'samples': 24817728, 'steps': 129258, 'loss/train': 1.742149829864502} 11/07/2021 15:25:20 - INFO - __main__ - Step 129260: {'lr': 2.3840708895269626e-05, 'samples': 24817920, 'steps': 129259, 'loss/train': 1.739904522895813} 11/07/2021 15:25:20 - INFO - __main__ - Step 129261: {'lr': 2.383844730416676e-05, 'samples': 24818112, 'steps': 129260, 'loss/train': 0.9040241241455078} 11/07/2021 15:25:21 - INFO - __main__ - Step 129262: {'lr': 2.3836185814968826e-05, 'samples': 24818304, 'steps': 129261, 'loss/train': 0.7693390846252441} 11/07/2021 15:25:21 - INFO - __main__ - Step 129263: {'lr': 2.3833924427676874e-05, 'samples': 24818496, 'steps': 129262, 'loss/train': 1.0751683712005615} 11/07/2021 15:25:22 - INFO - __main__ - Step 129264: {'lr': 2.3831663142291794e-05, 'samples': 24818688, 'steps': 129263, 'loss/train': 0.5961649417877197} 11/07/2021 15:25:22 - INFO - __main__ - Step 129265: {'lr': 2.3829401958814695e-05, 'samples': 24818880, 'steps': 129264, 'loss/train': 1.185717225074768} 11/07/2021 15:25:23 - INFO - __main__ - Step 129266: {'lr': 2.3827140877246552e-05, 'samples': 24819072, 'steps': 129265, 'loss/train': 1.1290836334228516} 11/07/2021 15:25:23 - INFO - __main__ - Step 129267: {'lr': 2.3824879897588443e-05, 'samples': 24819264, 'steps': 129266, 'loss/train': 1.5449390411376953} 11/07/2021 15:25:23 - INFO - __main__ - Step 129268: {'lr': 2.3822619019841313e-05, 'samples': 24819456, 'steps': 129267, 'loss/train': 1.130516529083252} 11/07/2021 15:25:24 - INFO - __main__ - Step 129269: {'lr': 2.3820358244006246e-05, 'samples': 24819648, 'steps': 129268, 'loss/train': 1.2110404968261719} 11/07/2021 15:25:25 - INFO - __main__ - Step 129270: {'lr': 2.381809757008424e-05, 'samples': 24819840, 'steps': 129269, 'loss/train': 1.705000638961792} 11/07/2021 15:25:25 - INFO - __main__ - Step 129271: {'lr': 2.3815836998076294e-05, 'samples': 24820032, 'steps': 129270, 'loss/train': 1.2791028022766113} 11/07/2021 15:25:25 - INFO - __main__ - Step 129272: {'lr': 2.3813576527983466e-05, 'samples': 24820224, 'steps': 129271, 'loss/train': 1.6689947843551636} 11/07/2021 15:25:26 - INFO - __main__ - Step 129273: {'lr': 2.3811316159806722e-05, 'samples': 24820416, 'steps': 129272, 'loss/train': 1.2322032451629639} 11/07/2021 15:25:27 - INFO - __main__ - Step 129274: {'lr': 2.380905589354712e-05, 'samples': 24820608, 'steps': 129273, 'loss/train': 1.367461919784546} 11/07/2021 15:25:27 - INFO - __main__ - Step 129275: {'lr': 2.380679572920566e-05, 'samples': 24820800, 'steps': 129274, 'loss/train': 1.6469075679779053} 11/07/2021 15:25:27 - INFO - __main__ - Step 129276: {'lr': 2.3804535666783423e-05, 'samples': 24820992, 'steps': 129275, 'loss/train': 1.3384345769882202} 11/07/2021 15:25:28 - INFO - __main__ - Step 129277: {'lr': 2.3802275706281322e-05, 'samples': 24821184, 'steps': 129276, 'loss/train': 1.2371673583984375} 11/07/2021 15:25:28 - INFO - __main__ - Step 129278: {'lr': 2.380001584770042e-05, 'samples': 24821376, 'steps': 129277, 'loss/train': 1.7292300462722778} 11/07/2021 15:25:29 - INFO - __main__ - Step 129279: {'lr': 2.379775609104176e-05, 'samples': 24821568, 'steps': 129278, 'loss/train': 1.021072268486023} 11/07/2021 15:25:30 - INFO - __main__ - Step 129280: {'lr': 2.3795496436306324e-05, 'samples': 24821760, 'steps': 129279, 'loss/train': 1.1220561265945435} 11/07/2021 15:25:30 - INFO - __main__ - Step 129281: {'lr': 2.379323688349516e-05, 'samples': 24821952, 'steps': 129280, 'loss/train': 0.05051514506340027} 11/07/2021 15:25:30 - INFO - __main__ - Step 129282: {'lr': 2.3790977432609244e-05, 'samples': 24822144, 'steps': 129281, 'loss/train': 1.4424793720245361} 11/07/2021 15:25:31 - INFO - __main__ - Step 129283: {'lr': 2.3788718083649658e-05, 'samples': 24822336, 'steps': 129282, 'loss/train': 0.5155863165855408} 11/07/2021 15:25:31 - INFO - __main__ - Step 129284: {'lr': 2.378645883661737e-05, 'samples': 24822528, 'steps': 129283, 'loss/train': 0.9115086197853088} 11/07/2021 15:25:32 - INFO - __main__ - Step 129285: {'lr': 2.3784199691513408e-05, 'samples': 24822720, 'steps': 129284, 'loss/train': 1.3338277339935303} 11/07/2021 15:25:33 - INFO - __main__ - Step 129286: {'lr': 2.37819406483388e-05, 'samples': 24822912, 'steps': 129285, 'loss/train': 1.0714671611785889} 11/07/2021 15:25:33 - INFO - __main__ - Step 129287: {'lr': 2.3779681707094546e-05, 'samples': 24823104, 'steps': 129286, 'loss/train': 1.3232498168945312} 11/07/2021 15:25:33 - INFO - __main__ - Step 129288: {'lr': 2.37774228677817e-05, 'samples': 24823296, 'steps': 129287, 'loss/train': 1.3554050922393799} 11/07/2021 15:25:34 - INFO - __main__ - Step 129289: {'lr': 2.3775164130401234e-05, 'samples': 24823488, 'steps': 129288, 'loss/train': 1.414124608039856} 11/07/2021 15:25:35 - INFO - __main__ - Step 129290: {'lr': 2.3772905494954254e-05, 'samples': 24823680, 'steps': 129289, 'loss/train': 1.0448769330978394} 11/07/2021 15:25:35 - INFO - __main__ - Step 129291: {'lr': 2.3770646961441655e-05, 'samples': 24823872, 'steps': 129290, 'loss/train': 1.219991683959961} 11/07/2021 15:25:36 - INFO - __main__ - Step 129292: {'lr': 2.3768388529864514e-05, 'samples': 24824064, 'steps': 129291, 'loss/train': 0.24916929006576538} 11/07/2021 15:25:36 - INFO - __main__ - Step 129293: {'lr': 2.3766130200223835e-05, 'samples': 24824256, 'steps': 129292, 'loss/train': 1.3561919927597046} 11/07/2021 15:25:36 - INFO - __main__ - Step 129294: {'lr': 2.3763871972520667e-05, 'samples': 24824448, 'steps': 129293, 'loss/train': 0.15610650181770325} 11/07/2021 15:25:37 - INFO - __main__ - Step 129295: {'lr': 2.3761613846755986e-05, 'samples': 24824640, 'steps': 129294, 'loss/train': 0.96929931640625} 11/07/2021 15:25:38 - INFO - __main__ - Step 129296: {'lr': 2.3759355822930843e-05, 'samples': 24824832, 'steps': 129295, 'loss/train': 2.006768226623535} 11/07/2021 15:25:38 - INFO - __main__ - Step 129297: {'lr': 2.3757097901046244e-05, 'samples': 24825024, 'steps': 129296, 'loss/train': 1.1885576248168945} 11/07/2021 15:25:38 - INFO - __main__ - Step 129298: {'lr': 2.3754840081103206e-05, 'samples': 24825216, 'steps': 129297, 'loss/train': 0.9301033616065979} 11/07/2021 15:25:39 - INFO - __main__ - Step 129299: {'lr': 2.3752582363102737e-05, 'samples': 24825408, 'steps': 129298, 'loss/train': 1.2259551286697388} 11/07/2021 15:25:40 - INFO - __main__ - Step 129300: {'lr': 2.375032474704586e-05, 'samples': 24825600, 'steps': 129299, 'loss/train': 1.6340733766555786} 11/07/2021 15:25:40 - INFO - __main__ - Step 129301: {'lr': 2.374806723293363e-05, 'samples': 24825792, 'steps': 129300, 'loss/train': 0.9930968284606934} 11/07/2021 15:25:41 - INFO - __main__ - Step 129302: {'lr': 2.3745809820766988e-05, 'samples': 24825984, 'steps': 129301, 'loss/train': 1.3221601247787476} 11/07/2021 15:25:41 - INFO - __main__ - Step 129303: {'lr': 2.3743552510547052e-05, 'samples': 24826176, 'steps': 129302, 'loss/train': 1.0962717533111572} 11/07/2021 15:25:41 - INFO - __main__ - Step 129304: {'lr': 2.3741295302274758e-05, 'samples': 24826368, 'steps': 129303, 'loss/train': 1.516879677772522} 11/07/2021 15:25:42 - INFO - __main__ - Step 129305: {'lr': 2.3739038195951106e-05, 'samples': 24826560, 'steps': 129304, 'loss/train': 1.1639045476913452} 11/07/2021 15:25:43 - INFO - __main__ - Step 129306: {'lr': 2.3736781191577183e-05, 'samples': 24826752, 'steps': 129305, 'loss/train': 1.505062222480774} 11/07/2021 15:25:43 - INFO - __main__ - Step 129307: {'lr': 2.3734524289153958e-05, 'samples': 24826944, 'steps': 129306, 'loss/train': 1.1516878604888916} 11/07/2021 15:25:43 - INFO - __main__ - Step 129308: {'lr': 2.3732267488682458e-05, 'samples': 24827136, 'steps': 129307, 'loss/train': 1.423857569694519} 11/07/2021 15:25:44 - INFO - __main__ - Step 129309: {'lr': 2.3730010790163737e-05, 'samples': 24827328, 'steps': 129308, 'loss/train': 1.3710927963256836} 11/07/2021 15:25:45 - INFO - __main__ - Step 129310: {'lr': 2.372775419359874e-05, 'samples': 24827520, 'steps': 129309, 'loss/train': 1.3995071649551392} 11/07/2021 15:25:45 - INFO - __main__ - Step 129311: {'lr': 2.3725497698988546e-05, 'samples': 24827712, 'steps': 129310, 'loss/train': 0.794392466545105} 11/07/2021 15:25:46 - INFO - __main__ - Step 129312: {'lr': 2.372324130633416e-05, 'samples': 24827904, 'steps': 129311, 'loss/train': 1.371120810508728} 11/07/2021 15:25:46 - INFO - __main__ - Step 129313: {'lr': 2.3720985015636575e-05, 'samples': 24828096, 'steps': 129312, 'loss/train': 0.7762079238891602} 11/07/2021 15:25:46 - INFO - __main__ - Step 129314: {'lr': 2.371872882689685e-05, 'samples': 24828288, 'steps': 129313, 'loss/train': 0.6344159841537476} 11/07/2021 15:25:47 - INFO - __main__ - Step 129315: {'lr': 2.371647274011596e-05, 'samples': 24828480, 'steps': 129314, 'loss/train': 1.0461536645889282} 11/07/2021 15:25:48 - INFO - __main__ - Step 129316: {'lr': 2.371421675529492e-05, 'samples': 24828672, 'steps': 129315, 'loss/train': 1.7819761037826538} 11/07/2021 15:25:48 - INFO - __main__ - Step 129317: {'lr': 2.3711960872434825e-05, 'samples': 24828864, 'steps': 129316, 'loss/train': 1.1514415740966797} 11/07/2021 15:25:48 - INFO - __main__ - Step 129318: {'lr': 2.3709705091536555e-05, 'samples': 24829056, 'steps': 129317, 'loss/train': 1.0949666500091553} 11/07/2021 15:25:49 - INFO - __main__ - Step 129319: {'lr': 2.3707449412601224e-05, 'samples': 24829248, 'steps': 129318, 'loss/train': 1.5068424940109253} 11/07/2021 15:25:49 - INFO - __main__ - Step 129320: {'lr': 2.370519383562983e-05, 'samples': 24829440, 'steps': 129319, 'loss/train': 1.1651132106781006} 11/07/2021 15:25:50 - INFO - __main__ - Step 129321: {'lr': 2.3702938360623373e-05, 'samples': 24829632, 'steps': 129320, 'loss/train': 1.1348108053207397} 11/07/2021 15:25:51 - INFO - __main__ - Step 129322: {'lr': 2.3700682987582878e-05, 'samples': 24829824, 'steps': 129321, 'loss/train': 1.436432957649231} 11/07/2021 15:25:51 - INFO - __main__ - Step 129323: {'lr': 2.3698427716509375e-05, 'samples': 24830016, 'steps': 129322, 'loss/train': 1.5594686269760132} 11/07/2021 15:25:51 - INFO - __main__ - Step 129324: {'lr': 2.369617254740386e-05, 'samples': 24830208, 'steps': 129323, 'loss/train': 0.03774389624595642} 11/07/2021 15:25:52 - INFO - __main__ - Step 129325: {'lr': 2.3693917480267363e-05, 'samples': 24830400, 'steps': 129324, 'loss/train': 1.1302118301391602} 11/07/2021 15:25:53 - INFO - __main__ - Step 129326: {'lr': 2.3691662515100883e-05, 'samples': 24830592, 'steps': 129325, 'loss/train': 1.4749603271484375} 11/07/2021 15:25:53 - INFO - __main__ - Step 129327: {'lr': 2.3689407651905443e-05, 'samples': 24830784, 'steps': 129326, 'loss/train': 1.4218246936798096} 11/07/2021 15:25:53 - INFO - __main__ - Step 129328: {'lr': 2.3687152890682074e-05, 'samples': 24830976, 'steps': 129327, 'loss/train': 1.0987353324890137} 11/07/2021 15:25:54 - INFO - __main__ - Step 129329: {'lr': 2.3684898231431802e-05, 'samples': 24831168, 'steps': 129328, 'loss/train': 1.2099229097366333} 11/07/2021 15:25:54 - INFO - __main__ - Step 129330: {'lr': 2.368264367415565e-05, 'samples': 24831360, 'steps': 129329, 'loss/train': 1.3103339672088623} 11/07/2021 15:25:55 - INFO - __main__ - Step 129331: {'lr': 2.368038921885454e-05, 'samples': 24831552, 'steps': 129330, 'loss/train': 1.484627604484558} 11/07/2021 15:25:55 - INFO - __main__ - Step 129332: {'lr': 2.367813486552958e-05, 'samples': 24831744, 'steps': 129331, 'loss/train': 1.1456456184387207} 11/07/2021 15:25:56 - INFO - __main__ - Step 129333: {'lr': 2.3675880614181744e-05, 'samples': 24831936, 'steps': 129332, 'loss/train': 1.1778385639190674} 11/07/2021 15:25:56 - INFO - __main__ - Step 129334: {'lr': 2.3673626464812082e-05, 'samples': 24832128, 'steps': 129333, 'loss/train': 1.3676509857177734} 11/07/2021 15:25:56 - INFO - __main__ - Step 129335: {'lr': 2.3671372417421592e-05, 'samples': 24832320, 'steps': 129334, 'loss/train': 1.340385913848877} 11/07/2021 15:25:57 - INFO - __main__ - Step 129336: {'lr': 2.366911847201128e-05, 'samples': 24832512, 'steps': 129335, 'loss/train': 1.3111181259155273} 11/07/2021 15:25:58 - INFO - __main__ - Step 129337: {'lr': 2.3666864628582168e-05, 'samples': 24832704, 'steps': 129336, 'loss/train': 0.8995756506919861} 11/07/2021 15:25:58 - INFO - __main__ - Step 129338: {'lr': 2.3664610887135286e-05, 'samples': 24832896, 'steps': 129337, 'loss/train': 1.4053817987442017} 11/07/2021 15:25:58 - INFO - __main__ - Step 129339: {'lr': 2.366235724767163e-05, 'samples': 24833088, 'steps': 129338, 'loss/train': 1.2351518869400024} 11/07/2021 15:25:59 - INFO - __main__ - Step 129340: {'lr': 2.36601037101922e-05, 'samples': 24833280, 'steps': 129339, 'loss/train': 0.9709149599075317} 11/07/2021 15:26:00 - INFO - __main__ - Step 129341: {'lr': 2.365785027469808e-05, 'samples': 24833472, 'steps': 129340, 'loss/train': 1.164092779159546} 11/07/2021 15:26:00 - INFO - __main__ - Step 129342: {'lr': 2.365559694119021e-05, 'samples': 24833664, 'steps': 129341, 'loss/train': 1.6063976287841797} 11/07/2021 15:26:01 - INFO - __main__ - Step 129343: {'lr': 2.3653343709669652e-05, 'samples': 24833856, 'steps': 129342, 'loss/train': 1.3250930309295654} 11/07/2021 15:26:01 - INFO - __main__ - Step 129344: {'lr': 2.3651090580137423e-05, 'samples': 24834048, 'steps': 129343, 'loss/train': 1.379935622215271} 11/07/2021 15:26:01 - INFO - __main__ - Step 129345: {'lr': 2.3648837552594504e-05, 'samples': 24834240, 'steps': 129344, 'loss/train': 0.0392066165804863} 11/07/2021 15:26:02 - INFO - __main__ - Step 129346: {'lr': 2.3646584627041917e-05, 'samples': 24834432, 'steps': 129345, 'loss/train': 1.121275782585144} 11/07/2021 15:26:03 - INFO - __main__ - Step 129347: {'lr': 2.3644331803480663e-05, 'samples': 24834624, 'steps': 129346, 'loss/train': 1.4619635343551636} 11/07/2021 15:26:03 - INFO - __main__ - Step 129348: {'lr': 2.364207908191182e-05, 'samples': 24834816, 'steps': 129347, 'loss/train': 1.045938491821289} 11/07/2021 15:26:03 - INFO - __main__ - Step 129349: {'lr': 2.363982646233634e-05, 'samples': 24835008, 'steps': 129348, 'loss/train': 0.9896507263183594} 11/07/2021 15:26:04 - INFO - __main__ - Step 129350: {'lr': 2.363757394475527e-05, 'samples': 24835200, 'steps': 129349, 'loss/train': 0.5948420166969299} 11/07/2021 15:26:04 - INFO - __main__ - Step 129351: {'lr': 2.3635321529169585e-05, 'samples': 24835392, 'steps': 129350, 'loss/train': 1.1289782524108887} 11/07/2021 15:26:05 - INFO - __main__ - Step 129352: {'lr': 2.3633069215580366e-05, 'samples': 24835584, 'steps': 129351, 'loss/train': 1.2710845470428467} 11/07/2021 15:26:06 - INFO - __main__ - Step 129353: {'lr': 2.3630817003988587e-05, 'samples': 24835776, 'steps': 129352, 'loss/train': 1.5904256105422974} 11/07/2021 15:26:06 - INFO - __main__ - Step 129354: {'lr': 2.362856489439527e-05, 'samples': 24835968, 'steps': 129353, 'loss/train': 1.2541263103485107} 11/07/2021 15:26:06 - INFO - __main__ - Step 129355: {'lr': 2.3626312886801423e-05, 'samples': 24836160, 'steps': 129354, 'loss/train': 1.459928274154663} 11/07/2021 15:26:07 - INFO - __main__ - Step 129356: {'lr': 2.3624060981208062e-05, 'samples': 24836352, 'steps': 129355, 'loss/train': 1.2746851444244385} 11/07/2021 15:26:08 - INFO - __main__ - Step 129357: {'lr': 2.362180917761625e-05, 'samples': 24836544, 'steps': 129356, 'loss/train': 1.182978630065918} 11/07/2021 15:26:08 - INFO - __main__ - Step 129358: {'lr': 2.3619557476026925e-05, 'samples': 24836736, 'steps': 129357, 'loss/train': 1.144586205482483} 11/07/2021 15:26:08 - INFO - __main__ - Step 129359: {'lr': 2.361730587644112e-05, 'samples': 24836928, 'steps': 129358, 'loss/train': 1.278488278388977} 11/07/2021 15:26:09 - INFO - __main__ - Step 129360: {'lr': 2.3615054378859885e-05, 'samples': 24837120, 'steps': 129359, 'loss/train': 1.170271873474121} 11/07/2021 15:26:09 - INFO - __main__ - Step 129361: {'lr': 2.361280298328419e-05, 'samples': 24837312, 'steps': 129360, 'loss/train': 1.374953031539917} 11/07/2021 15:26:10 - INFO - __main__ - Step 129362: {'lr': 2.36105516897151e-05, 'samples': 24837504, 'steps': 129361, 'loss/train': 1.1626843214035034} 11/07/2021 15:26:10 - INFO - __main__ - Step 129363: {'lr': 2.3608300498153574e-05, 'samples': 24837696, 'steps': 129362, 'loss/train': 1.5459725856781006} 11/07/2021 15:26:11 - INFO - __main__ - Step 129364: {'lr': 2.3606049408600672e-05, 'samples': 24837888, 'steps': 129363, 'loss/train': 0.8959010243415833} 11/07/2021 15:26:11 - INFO - __main__ - Step 129365: {'lr': 2.3603798421057365e-05, 'samples': 24838080, 'steps': 129364, 'loss/train': 1.6951879262924194} 11/07/2021 15:26:12 - INFO - __main__ - Step 129366: {'lr': 2.3601547535524735e-05, 'samples': 24838272, 'steps': 129365, 'loss/train': 1.7250542640686035} 11/07/2021 15:26:13 - INFO - __main__ - Step 129367: {'lr': 2.359929675200373e-05, 'samples': 24838464, 'steps': 129366, 'loss/train': 0.9061518311500549} 11/07/2021 15:26:13 - INFO - __main__ - Step 129368: {'lr': 2.359704607049537e-05, 'samples': 24838656, 'steps': 129367, 'loss/train': 1.3201442956924438} 11/07/2021 15:26:13 - INFO - __main__ - Step 129369: {'lr': 2.3594795491000713e-05, 'samples': 24838848, 'steps': 129368, 'loss/train': 0.9245727062225342} 11/07/2021 15:26:14 - INFO - __main__ - Step 129370: {'lr': 2.3592545013520734e-05, 'samples': 24839040, 'steps': 129369, 'loss/train': 1.3246915340423584} 11/07/2021 15:26:14 - INFO - __main__ - Step 129371: {'lr': 2.359029463805651e-05, 'samples': 24839232, 'steps': 129370, 'loss/train': 1.2114957571029663} 11/07/2021 15:26:15 - INFO - __main__ - Step 129372: {'lr': 2.3588044364608983e-05, 'samples': 24839424, 'steps': 129371, 'loss/train': 1.1394851207733154} 11/07/2021 15:26:15 - INFO - __main__ - Step 129373: {'lr': 2.358579419317916e-05, 'samples': 24839616, 'steps': 129372, 'loss/train': 1.3823878765106201} 11/07/2021 15:26:16 - INFO - __main__ - Step 129374: {'lr': 2.358354412376809e-05, 'samples': 24839808, 'steps': 129373, 'loss/train': 1.4239264726638794} 11/07/2021 15:26:16 - INFO - __main__ - Step 129375: {'lr': 2.3581294156376805e-05, 'samples': 24840000, 'steps': 129374, 'loss/train': 1.5394583940505981} 11/07/2021 15:26:16 - INFO - __main__ - Step 129376: {'lr': 2.357904429100627e-05, 'samples': 24840192, 'steps': 129375, 'loss/train': 0.8660029172897339} 11/07/2021 15:26:18 - INFO - __main__ - Step 129377: {'lr': 2.3576794527657512e-05, 'samples': 24840384, 'steps': 129376, 'loss/train': 1.352357268333435} 11/07/2021 15:26:18 - INFO - __main__ - Step 129378: {'lr': 2.3574544866331566e-05, 'samples': 24840576, 'steps': 129377, 'loss/train': 1.3525149822235107} 11/07/2021 15:26:18 - INFO - __main__ - Step 129379: {'lr': 2.357229530702945e-05, 'samples': 24840768, 'steps': 129378, 'loss/train': 2.60840106010437} 11/07/2021 15:26:19 - INFO - __main__ - Step 129380: {'lr': 2.357004584975217e-05, 'samples': 24840960, 'steps': 129379, 'loss/train': 1.4033395051956177} 11/07/2021 15:26:19 - INFO - __main__ - Step 129381: {'lr': 2.3567796494500722e-05, 'samples': 24841152, 'steps': 129380, 'loss/train': 1.1821385622024536} 11/07/2021 15:26:20 - INFO - __main__ - Step 129382: {'lr': 2.356554724127613e-05, 'samples': 24841344, 'steps': 129381, 'loss/train': 1.1283413171768188} 11/07/2021 15:26:20 - INFO - __main__ - Step 129383: {'lr': 2.35632980900794e-05, 'samples': 24841536, 'steps': 129382, 'loss/train': 1.3885732889175415} 11/07/2021 15:26:21 - INFO - __main__ - Step 129384: {'lr': 2.3561049040911608e-05, 'samples': 24841728, 'steps': 129383, 'loss/train': 1.240735411643982} 11/07/2021 15:26:21 - INFO - __main__ - Step 129385: {'lr': 2.3558800093773675e-05, 'samples': 24841920, 'steps': 129384, 'loss/train': 1.3139338493347168} 11/07/2021 15:26:21 - INFO - __main__ - Step 129386: {'lr': 2.3556551248666623e-05, 'samples': 24842112, 'steps': 129385, 'loss/train': 1.1497056484222412} 11/07/2021 15:26:22 - INFO - __main__ - Step 129387: {'lr': 2.355430250559151e-05, 'samples': 24842304, 'steps': 129386, 'loss/train': 0.7012307643890381} 11/07/2021 15:26:23 - INFO - __main__ - Step 129388: {'lr': 2.3552053864549367e-05, 'samples': 24842496, 'steps': 129387, 'loss/train': 1.2099864482879639} 11/07/2021 15:26:23 - INFO - __main__ - Step 129389: {'lr': 2.3549805325541128e-05, 'samples': 24842688, 'steps': 129388, 'loss/train': 1.4809473752975464} 11/07/2021 15:26:24 - INFO - __main__ - Step 129390: {'lr': 2.3547556888567883e-05, 'samples': 24842880, 'steps': 129389, 'loss/train': 1.3792674541473389} 11/07/2021 15:26:24 - INFO - __main__ - Step 129391: {'lr': 2.3545308553630602e-05, 'samples': 24843072, 'steps': 129390, 'loss/train': 1.4919418096542358} 11/07/2021 15:26:25 - INFO - __main__ - Step 129392: {'lr': 2.354306032073031e-05, 'samples': 24843264, 'steps': 129391, 'loss/train': 1.3837718963623047} 11/07/2021 15:26:25 - INFO - __main__ - Step 129393: {'lr': 2.3540812189868005e-05, 'samples': 24843456, 'steps': 129392, 'loss/train': 1.076170563697815} 11/07/2021 15:26:26 - INFO - __main__ - Step 129394: {'lr': 2.3538564161044745e-05, 'samples': 24843648, 'steps': 129393, 'loss/train': 1.1147438287734985} 11/07/2021 15:26:26 - INFO - __main__ - Step 129395: {'lr': 2.35363162342615e-05, 'samples': 24843840, 'steps': 129394, 'loss/train': 0.8100666999816895} 11/07/2021 15:26:26 - INFO - __main__ - Step 129396: {'lr': 2.35340684095193e-05, 'samples': 24844032, 'steps': 129395, 'loss/train': 1.8084397315979004} 11/07/2021 15:26:27 - INFO - __main__ - Step 129397: {'lr': 2.3531820686819195e-05, 'samples': 24844224, 'steps': 129396, 'loss/train': 1.05332612991333} 11/07/2021 15:26:28 - INFO - __main__ - Step 129398: {'lr': 2.35295730661621e-05, 'samples': 24844416, 'steps': 129397, 'loss/train': 1.2586076259613037} 11/07/2021 15:26:28 - INFO - __main__ - Step 129399: {'lr': 2.3527325547549107e-05, 'samples': 24844608, 'steps': 129398, 'loss/train': 1.2334623336791992} 11/07/2021 15:26:28 - INFO - __main__ - Step 129400: {'lr': 2.3525078130981203e-05, 'samples': 24844800, 'steps': 129399, 'loss/train': 1.1688066720962524} 11/07/2021 15:26:29 - INFO - __main__ - Step 129401: {'lr': 2.3522830816459395e-05, 'samples': 24844992, 'steps': 129400, 'loss/train': 1.6249338388442993} 11/07/2021 15:26:29 - INFO - __main__ - Step 129402: {'lr': 2.3520583603984707e-05, 'samples': 24845184, 'steps': 129401, 'loss/train': 0.9727410674095154} 11/07/2021 15:26:30 - INFO - __main__ - Step 129403: {'lr': 2.3518336493558167e-05, 'samples': 24845376, 'steps': 129402, 'loss/train': 1.6922367811203003} 11/07/2021 15:26:31 - INFO - __main__ - Step 129404: {'lr': 2.3516089485180747e-05, 'samples': 24845568, 'steps': 129403, 'loss/train': 1.2480757236480713} 11/07/2021 15:26:31 - INFO - __main__ - Step 129405: {'lr': 2.3513842578853473e-05, 'samples': 24845760, 'steps': 129404, 'loss/train': 1.363444447517395} 11/07/2021 15:26:31 - INFO - __main__ - Step 129406: {'lr': 2.35115957745774e-05, 'samples': 24845952, 'steps': 129405, 'loss/train': 1.2940396070480347} 11/07/2021 15:26:32 - INFO - __main__ - Step 129407: {'lr': 2.350934907235347e-05, 'samples': 24846144, 'steps': 129406, 'loss/train': 1.1472065448760986} 11/07/2021 15:26:33 - INFO - __main__ - Step 129408: {'lr': 2.3507102472182768e-05, 'samples': 24846336, 'steps': 129407, 'loss/train': 1.4805896282196045} 11/07/2021 15:26:33 - INFO - __main__ - Step 129409: {'lr': 2.3504855974066235e-05, 'samples': 24846528, 'steps': 129408, 'loss/train': 1.2241451740264893} 11/07/2021 15:26:33 - INFO - __main__ - Step 129410: {'lr': 2.350260957800496e-05, 'samples': 24846720, 'steps': 129409, 'loss/train': 0.8838547468185425} 11/07/2021 15:26:34 - INFO - __main__ - Step 129411: {'lr': 2.350036328399993e-05, 'samples': 24846912, 'steps': 129410, 'loss/train': 1.2002594470977783} 11/07/2021 15:26:34 - INFO - __main__ - Step 129412: {'lr': 2.3498117092052103e-05, 'samples': 24847104, 'steps': 129411, 'loss/train': 1.5431649684906006} 11/07/2021 15:26:35 - INFO - __main__ - Step 129413: {'lr': 2.3495871002162523e-05, 'samples': 24847296, 'steps': 129412, 'loss/train': 1.4436973333358765} 11/07/2021 15:26:36 - INFO - __main__ - Step 129414: {'lr': 2.349362501433222e-05, 'samples': 24847488, 'steps': 129413, 'loss/train': 1.6136564016342163} 11/07/2021 15:26:36 - INFO - __main__ - Step 129415: {'lr': 2.3491379128562196e-05, 'samples': 24847680, 'steps': 129414, 'loss/train': 1.2451386451721191} 11/07/2021 15:26:36 - INFO - __main__ - Step 129416: {'lr': 2.3489133344853447e-05, 'samples': 24847872, 'steps': 129415, 'loss/train': 1.3321335315704346} 11/07/2021 15:26:37 - INFO - __main__ - Step 129417: {'lr': 2.3486887663207002e-05, 'samples': 24848064, 'steps': 129416, 'loss/train': 0.5063287019729614} 11/07/2021 15:26:37 - INFO - __main__ - Step 129418: {'lr': 2.3484642083623887e-05, 'samples': 24848256, 'steps': 129417, 'loss/train': 1.2793351411819458} 11/07/2021 15:26:38 - INFO - __main__ - Step 129419: {'lr': 2.348239660610507e-05, 'samples': 24848448, 'steps': 129418, 'loss/train': 1.2229756116867065} 11/07/2021 15:26:39 - INFO - __main__ - Step 129420: {'lr': 2.3480151230651614e-05, 'samples': 24848640, 'steps': 129419, 'loss/train': 1.1014238595962524} 11/07/2021 15:26:39 - INFO - __main__ - Step 129421: {'lr': 2.3477905957264512e-05, 'samples': 24848832, 'steps': 129420, 'loss/train': 1.1950474977493286} 11/07/2021 15:26:39 - INFO - __main__ - Step 129422: {'lr': 2.3475660785944735e-05, 'samples': 24849024, 'steps': 129421, 'loss/train': 1.135008692741394} 11/07/2021 15:26:40 - INFO - __main__ - Step 129423: {'lr': 2.347341571669337e-05, 'samples': 24849216, 'steps': 129422, 'loss/train': 0.7434635162353516} 11/07/2021 15:26:41 - INFO - __main__ - Step 129424: {'lr': 2.3471170749511413e-05, 'samples': 24849408, 'steps': 129423, 'loss/train': 1.1536661386489868} 11/07/2021 15:26:41 - INFO - __main__ - Step 129425: {'lr': 2.3468925884399806e-05, 'samples': 24849600, 'steps': 129424, 'loss/train': 1.12167227268219} 11/07/2021 15:26:41 - INFO - __main__ - Step 129426: {'lr': 2.3466681121359606e-05, 'samples': 24849792, 'steps': 129425, 'loss/train': 1.3940285444259644} 11/07/2021 15:26:42 - INFO - __main__ - Step 129427: {'lr': 2.3464436460391813e-05, 'samples': 24849984, 'steps': 129426, 'loss/train': 1.2382153272628784} 11/07/2021 15:26:42 - INFO - __main__ - Step 129428: {'lr': 2.3462191901497453e-05, 'samples': 24850176, 'steps': 129427, 'loss/train': 0.6569872498512268} 11/07/2021 15:26:43 - INFO - __main__ - Step 129429: {'lr': 2.3459947444677553e-05, 'samples': 24850368, 'steps': 129428, 'loss/train': 1.1951578855514526} 11/07/2021 15:26:44 - INFO - __main__ - Step 129430: {'lr': 2.3457703089933085e-05, 'samples': 24850560, 'steps': 129429, 'loss/train': 1.3086782693862915} 11/07/2021 15:26:44 - INFO - __main__ - Step 129431: {'lr': 2.3455458837265076e-05, 'samples': 24850752, 'steps': 129430, 'loss/train': 1.4490472078323364} 11/07/2021 15:26:44 - INFO - __main__ - Step 129432: {'lr': 2.345321468667455e-05, 'samples': 24850944, 'steps': 129431, 'loss/train': 1.3074787855148315} 11/07/2021 15:26:45 - INFO - __main__ - Step 129433: {'lr': 2.3450970638162538e-05, 'samples': 24851136, 'steps': 129432, 'loss/train': 1.294535756111145} 11/07/2021 15:26:46 - INFO - __main__ - Step 129434: {'lr': 2.3448726691729984e-05, 'samples': 24851328, 'steps': 129433, 'loss/train': 1.390640377998352} 11/07/2021 15:26:46 - INFO - __main__ - Step 129435: {'lr': 2.3446482847377965e-05, 'samples': 24851520, 'steps': 129434, 'loss/train': 1.133129358291626} 11/07/2021 15:26:46 - INFO - __main__ - Step 129436: {'lr': 2.344423910510743e-05, 'samples': 24851712, 'steps': 129435, 'loss/train': 1.5333234071731567} 11/07/2021 15:26:47 - INFO - __main__ - Step 129437: {'lr': 2.3441995464919457e-05, 'samples': 24851904, 'steps': 129436, 'loss/train': 1.3471348285675049} 11/07/2021 15:26:47 - INFO - __main__ - Step 129438: {'lr': 2.343975192681505e-05, 'samples': 24852096, 'steps': 129437, 'loss/train': 0.9110859632492065} 11/07/2021 15:26:48 - INFO - __main__ - Step 129439: {'lr': 2.3437508490795178e-05, 'samples': 24852288, 'steps': 129438, 'loss/train': 1.5661548376083374} 11/07/2021 15:26:48 - INFO - __main__ - Step 129440: {'lr': 2.3435265156860842e-05, 'samples': 24852480, 'steps': 129439, 'loss/train': 0.7923760414123535} 11/07/2021 15:26:49 - INFO - __main__ - Step 129441: {'lr': 2.3433021925013092e-05, 'samples': 24852672, 'steps': 129440, 'loss/train': 1.2570853233337402} 11/07/2021 15:26:49 - INFO - __main__ - Step 129442: {'lr': 2.3430778795252904e-05, 'samples': 24852864, 'steps': 129441, 'loss/train': 1.0405815839767456} 11/07/2021 15:26:49 - INFO - __main__ - Step 129443: {'lr': 2.342853576758133e-05, 'samples': 24853056, 'steps': 129442, 'loss/train': 0.7998762130737305} 11/07/2021 15:26:50 - INFO - __main__ - Step 129444: {'lr': 2.3426292841999374e-05, 'samples': 24853248, 'steps': 129443, 'loss/train': 1.6121834516525269} 11/07/2021 15:26:51 - INFO - __main__ - Step 129445: {'lr': 2.3424050018508003e-05, 'samples': 24853440, 'steps': 129444, 'loss/train': 1.1318261623382568} 11/07/2021 15:26:51 - INFO - __main__ - Step 129446: {'lr': 2.342180729710827e-05, 'samples': 24853632, 'steps': 129445, 'loss/train': 1.3048150539398193} 11/07/2021 15:26:52 - INFO - __main__ - Step 129447: {'lr': 2.3419564677801182e-05, 'samples': 24853824, 'steps': 129446, 'loss/train': 0.7550424933433533} 11/07/2021 15:26:52 - INFO - __main__ - Step 129448: {'lr': 2.3417322160587757e-05, 'samples': 24854016, 'steps': 129447, 'loss/train': 1.114752173423767} 11/07/2021 15:26:53 - INFO - __main__ - Step 129449: {'lr': 2.341507974546897e-05, 'samples': 24854208, 'steps': 129448, 'loss/train': 0.5453378558158875} 11/07/2021 15:26:54 - INFO - __main__ - Step 129450: {'lr': 2.341283743244585e-05, 'samples': 24854400, 'steps': 129449, 'loss/train': 0.8911813497543335} 11/07/2021 15:26:54 - INFO - __main__ - Step 129451: {'lr': 2.341059522151945e-05, 'samples': 24854592, 'steps': 129450, 'loss/train': 1.55327308177948} 11/07/2021 15:26:54 - INFO - __main__ - Step 129452: {'lr': 2.3408353112690713e-05, 'samples': 24854784, 'steps': 129451, 'loss/train': 1.3168126344680786} 11/07/2021 15:26:55 - INFO - __main__ - Step 129453: {'lr': 2.3406111105960663e-05, 'samples': 24854976, 'steps': 129452, 'loss/train': 0.06576762348413467} 11/07/2021 15:26:56 - INFO - __main__ - Step 129454: {'lr': 2.3403869201330336e-05, 'samples': 24855168, 'steps': 129453, 'loss/train': 1.7919796705245972} 11/07/2021 15:26:56 - INFO - __main__ - Step 129455: {'lr': 2.3401627398800696e-05, 'samples': 24855360, 'steps': 129454, 'loss/train': 0.8874413371086121} 11/07/2021 15:26:56 - INFO - __main__ - Step 129456: {'lr': 2.3399385698372828e-05, 'samples': 24855552, 'steps': 129455, 'loss/train': 1.339065432548523} 11/07/2021 15:26:57 - INFO - __main__ - Step 129457: {'lr': 2.3397144100047673e-05, 'samples': 24855744, 'steps': 129456, 'loss/train': 1.3557339906692505} 11/07/2021 15:26:57 - INFO - __main__ - Step 129458: {'lr': 2.339490260382626e-05, 'samples': 24855936, 'steps': 129457, 'loss/train': 1.3504512310028076} 11/07/2021 15:26:57 - INFO - __main__ - Step 129459: {'lr': 2.3392661209709647e-05, 'samples': 24856128, 'steps': 129458, 'loss/train': 1.1195741891860962} 11/07/2021 15:26:59 - INFO - __main__ - Step 129460: {'lr': 2.3390419917698776e-05, 'samples': 24856320, 'steps': 129459, 'loss/train': 1.1455069780349731} 11/07/2021 15:26:59 - INFO - __main__ - Step 129461: {'lr': 2.3388178727794666e-05, 'samples': 24856512, 'steps': 129460, 'loss/train': 0.7211686372756958} 11/07/2021 15:27:00 - INFO - __main__ - Step 129462: {'lr': 2.3385937639998383e-05, 'samples': 24856704, 'steps': 129461, 'loss/train': 1.1345032453536987} 11/07/2021 15:27:00 - INFO - __main__ - Step 129463: {'lr': 2.3383696654310892e-05, 'samples': 24856896, 'steps': 129462, 'loss/train': 0.9488596320152283} 11/07/2021 15:27:00 - INFO - __main__ - Step 129464: {'lr': 2.338145577073325e-05, 'samples': 24857088, 'steps': 129463, 'loss/train': 1.3732928037643433} 11/07/2021 15:27:01 - INFO - __main__ - Step 129465: {'lr': 2.3379214989266374e-05, 'samples': 24857280, 'steps': 129464, 'loss/train': 1.3222566843032837} 11/07/2021 15:27:02 - INFO - __main__ - Step 129466: {'lr': 2.3376974309911343e-05, 'samples': 24857472, 'steps': 129465, 'loss/train': 0.5045725703239441} 11/07/2021 15:27:02 - INFO - __main__ - Step 129467: {'lr': 2.337473373266913e-05, 'samples': 24857664, 'steps': 129466, 'loss/train': 1.6926684379577637} 11/07/2021 15:27:03 - INFO - __main__ - Step 129468: {'lr': 2.337249325754079e-05, 'samples': 24857856, 'steps': 129467, 'loss/train': 1.0978549718856812} 11/07/2021 15:27:03 - INFO - __main__ - Step 129469: {'lr': 2.3370252884527265e-05, 'samples': 24858048, 'steps': 129468, 'loss/train': 0.8575010895729065} 11/07/2021 15:27:03 - INFO - __main__ - Step 129470: {'lr': 2.336801261362964e-05, 'samples': 24858240, 'steps': 129469, 'loss/train': 1.3113036155700684} 11/07/2021 15:27:04 - INFO - __main__ - Step 129471: {'lr': 2.3365772444848886e-05, 'samples': 24858432, 'steps': 129470, 'loss/train': 1.359731674194336} 11/07/2021 15:27:05 - INFO - __main__ - Step 129472: {'lr': 2.336353237818603e-05, 'samples': 24858624, 'steps': 129471, 'loss/train': 1.091238021850586} 11/07/2021 15:27:05 - INFO - __main__ - Step 129473: {'lr': 2.3361292413642042e-05, 'samples': 24858816, 'steps': 129472, 'loss/train': 0.9425843954086304} 11/07/2021 15:27:05 - INFO - __main__ - Step 129474: {'lr': 2.335905255121798e-05, 'samples': 24859008, 'steps': 129473, 'loss/train': 1.3357059955596924} 11/07/2021 15:27:06 - INFO - __main__ - Step 129475: {'lr': 2.3356812790914866e-05, 'samples': 24859200, 'steps': 129474, 'loss/train': 0.916527509689331} 11/07/2021 15:27:07 - INFO - __main__ - Step 129476: {'lr': 2.335457313273365e-05, 'samples': 24859392, 'steps': 129475, 'loss/train': 1.4868007898330688} 11/07/2021 15:27:07 - INFO - __main__ - Step 129477: {'lr': 2.335233357667535e-05, 'samples': 24859584, 'steps': 129476, 'loss/train': 1.1915092468261719} 11/07/2021 15:27:08 - INFO - __main__ - Step 129478: {'lr': 2.3350094122741e-05, 'samples': 24859776, 'steps': 129477, 'loss/train': 1.2666497230529785} 11/07/2021 15:27:08 - INFO - __main__ - Step 129479: {'lr': 2.334785477093157e-05, 'samples': 24859968, 'steps': 129478, 'loss/train': 1.009760856628418} 11/07/2021 15:27:08 - INFO - __main__ - Step 129480: {'lr': 2.3345615521248114e-05, 'samples': 24860160, 'steps': 129479, 'loss/train': 0.9372738599777222} 11/07/2021 15:27:09 - INFO - __main__ - Step 129481: {'lr': 2.334337637369166e-05, 'samples': 24860352, 'steps': 129480, 'loss/train': 1.3360906839370728} 11/07/2021 15:27:10 - INFO - __main__ - Step 129482: {'lr': 2.334113732826315e-05, 'samples': 24860544, 'steps': 129481, 'loss/train': 0.8538959622383118} 11/07/2021 15:27:10 - INFO - __main__ - Step 129483: {'lr': 2.3338898384963616e-05, 'samples': 24860736, 'steps': 129482, 'loss/train': 1.4023009538650513} 11/07/2021 15:27:10 - INFO - __main__ - Step 129484: {'lr': 2.3336659543794103e-05, 'samples': 24860928, 'steps': 129483, 'loss/train': 1.2163969278335571} 11/07/2021 15:27:11 - INFO - __main__ - Step 129485: {'lr': 2.333442080475559e-05, 'samples': 24861120, 'steps': 129484, 'loss/train': 0.9873047471046448} 11/07/2021 15:27:11 - INFO - __main__ - Step 129486: {'lr': 2.333218216784913e-05, 'samples': 24861312, 'steps': 129485, 'loss/train': 1.3792550563812256} 11/07/2021 15:27:12 - INFO - __main__ - Step 129487: {'lr': 2.332994363307564e-05, 'samples': 24861504, 'steps': 129486, 'loss/train': 1.3736530542373657} 11/07/2021 15:27:13 - INFO - __main__ - Step 129488: {'lr': 2.332770520043617e-05, 'samples': 24861696, 'steps': 129487, 'loss/train': 1.1254310607910156} 11/07/2021 15:27:13 - INFO - __main__ - Step 129489: {'lr': 2.3325466869931754e-05, 'samples': 24861888, 'steps': 129488, 'loss/train': 1.3872936964035034} 11/07/2021 15:27:13 - INFO - __main__ - Step 129490: {'lr': 2.332322864156339e-05, 'samples': 24862080, 'steps': 129489, 'loss/train': 1.5720070600509644} 11/07/2021 15:27:14 - INFO - __main__ - Step 129491: {'lr': 2.332099051533207e-05, 'samples': 24862272, 'steps': 129490, 'loss/train': 1.7864010334014893} 11/07/2021 15:27:14 - INFO - __main__ - Step 129492: {'lr': 2.3318752491238828e-05, 'samples': 24862464, 'steps': 129491, 'loss/train': 1.2419030666351318} 11/07/2021 15:27:15 - INFO - __main__ - Step 129493: {'lr': 2.331651456928463e-05, 'samples': 24862656, 'steps': 129492, 'loss/train': 0.45985257625579834} 11/07/2021 15:27:15 - INFO - __main__ - Step 129494: {'lr': 2.3314276749470536e-05, 'samples': 24862848, 'steps': 129493, 'loss/train': 1.1379286050796509} 11/07/2021 15:27:16 - INFO - __main__ - Step 129495: {'lr': 2.3312039031797515e-05, 'samples': 24863040, 'steps': 129494, 'loss/train': 1.2560572624206543} 11/07/2021 15:27:16 - INFO - __main__ - Step 129496: {'lr': 2.3309801416266625e-05, 'samples': 24863232, 'steps': 129495, 'loss/train': 1.0518745183944702} 11/07/2021 15:27:16 - INFO - __main__ - Step 129497: {'lr': 2.330756390287886e-05, 'samples': 24863424, 'steps': 129496, 'loss/train': 1.167081594467163} 11/07/2021 15:27:18 - INFO - __main__ - Step 129498: {'lr': 2.3305326491635165e-05, 'samples': 24863616, 'steps': 129497, 'loss/train': 1.5342576503753662} 11/07/2021 15:27:18 - INFO - __main__ - Step 129499: {'lr': 2.330308918253657e-05, 'samples': 24863808, 'steps': 129498, 'loss/train': 1.2424168586730957} 11/07/2021 15:27:18 - INFO - __main__ - Step 129500: {'lr': 2.3300851975584124e-05, 'samples': 24864000, 'steps': 129499, 'loss/train': 1.3736051321029663} 11/07/2021 15:27:19 - INFO - __main__ - Step 129501: {'lr': 2.3298614870778834e-05, 'samples': 24864192, 'steps': 129500, 'loss/train': 1.3967616558074951} 11/07/2021 15:27:19 - INFO - __main__ - Step 129502: {'lr': 2.3296377868121665e-05, 'samples': 24864384, 'steps': 129501, 'loss/train': 1.033277153968811} 11/07/2021 15:27:20 - INFO - __main__ - Step 129503: {'lr': 2.3294140967613675e-05, 'samples': 24864576, 'steps': 129502, 'loss/train': 1.1503044366836548} 11/07/2021 15:27:20 - INFO - __main__ - Step 129504: {'lr': 2.3291904169255835e-05, 'samples': 24864768, 'steps': 129503, 'loss/train': 1.3421355485916138} 11/07/2021 15:27:21 - INFO - __main__ - Step 129505: {'lr': 2.3289667473049142e-05, 'samples': 24864960, 'steps': 129504, 'loss/train': 1.0810809135437012} 11/07/2021 15:27:21 - INFO - __main__ - Step 129506: {'lr': 2.3287430878994653e-05, 'samples': 24865152, 'steps': 129505, 'loss/train': 1.1070446968078613} 11/07/2021 15:27:21 - INFO - __main__ - Step 129507: {'lr': 2.328519438709334e-05, 'samples': 24865344, 'steps': 129506, 'loss/train': 0.9458224177360535} 11/07/2021 15:27:22 - INFO - __main__ - Step 129508: {'lr': 2.3282957997346282e-05, 'samples': 24865536, 'steps': 129507, 'loss/train': 1.2748849391937256} 11/07/2021 15:27:23 - INFO - __main__ - Step 129509: {'lr': 2.3280721709754344e-05, 'samples': 24865728, 'steps': 129508, 'loss/train': 1.1371853351593018} 11/07/2021 15:27:23 - INFO - __main__ - Step 129510: {'lr': 2.3278485524318632e-05, 'samples': 24865920, 'steps': 129509, 'loss/train': 1.1692169904708862} 11/07/2021 15:27:23 - INFO - __main__ - Step 129511: {'lr': 2.327624944104015e-05, 'samples': 24866112, 'steps': 129510, 'loss/train': 1.4527592658996582} 11/07/2021 15:27:24 - INFO - __main__ - Step 129512: {'lr': 2.327401345991989e-05, 'samples': 24866304, 'steps': 129511, 'loss/train': 1.1758179664611816} 11/07/2021 15:27:24 - INFO - __main__ - Step 129513: {'lr': 2.327177758095883e-05, 'samples': 24866496, 'steps': 129512, 'loss/train': 1.1388276815414429} 11/07/2021 15:27:25 - INFO - __main__ - Step 129514: {'lr': 2.326954180415805e-05, 'samples': 24866688, 'steps': 129513, 'loss/train': 0.05360109359025955} 11/07/2021 15:27:26 - INFO - __main__ - Step 129515: {'lr': 2.3267306129518496e-05, 'samples': 24866880, 'steps': 129514, 'loss/train': 1.3199968338012695} 11/07/2021 15:27:26 - INFO - __main__ - Step 129516: {'lr': 2.326507055704119e-05, 'samples': 24867072, 'steps': 129515, 'loss/train': 1.5384752750396729} 11/07/2021 15:27:26 - INFO - __main__ - Step 129517: {'lr': 2.3262835086727137e-05, 'samples': 24867264, 'steps': 129516, 'loss/train': 1.2429908514022827} 11/07/2021 15:27:27 - INFO - __main__ - Step 129518: {'lr': 2.3260599718577385e-05, 'samples': 24867456, 'steps': 129517, 'loss/train': 1.4009073972702026} 11/07/2021 15:27:28 - INFO - __main__ - Step 129519: {'lr': 2.325836445259291e-05, 'samples': 24867648, 'steps': 129518, 'loss/train': 1.1356006860733032} 11/07/2021 15:27:28 - INFO - __main__ - Step 129520: {'lr': 2.3256129288774713e-05, 'samples': 24867840, 'steps': 129519, 'loss/train': 1.6386014223098755} 11/07/2021 15:27:28 - INFO - __main__ - Step 129521: {'lr': 2.325389422712379e-05, 'samples': 24868032, 'steps': 129520, 'loss/train': 1.4380611181259155} 11/07/2021 15:27:29 - INFO - __main__ - Step 129522: {'lr': 2.325165926764114e-05, 'samples': 24868224, 'steps': 129521, 'loss/train': 1.047199010848999} 11/07/2021 15:27:29 - INFO - __main__ - Step 129523: {'lr': 2.324942441032782e-05, 'samples': 24868416, 'steps': 129522, 'loss/train': 1.2963495254516602} 11/07/2021 15:27:30 - INFO - __main__ - Step 129524: {'lr': 2.3247189655184796e-05, 'samples': 24868608, 'steps': 129523, 'loss/train': 0.6897730827331543} 11/07/2021 15:27:31 - INFO - __main__ - Step 129525: {'lr': 2.3244955002213103e-05, 'samples': 24868800, 'steps': 129524, 'loss/train': 0.9383845329284668} 11/07/2021 15:27:31 - INFO - __main__ - Step 129526: {'lr': 2.3242720451413736e-05, 'samples': 24868992, 'steps': 129525, 'loss/train': 0.07847125828266144} 11/07/2021 15:27:31 - INFO - __main__ - Step 129527: {'lr': 2.324048600278769e-05, 'samples': 24869184, 'steps': 129526, 'loss/train': 0.7567629814147949} 11/07/2021 15:27:32 - INFO - __main__ - Step 129528: {'lr': 2.3238251656335975e-05, 'samples': 24869376, 'steps': 129527, 'loss/train': 1.5055797100067139} 11/07/2021 15:27:33 - INFO - __main__ - Step 129529: {'lr': 2.3236017412059607e-05, 'samples': 24869568, 'steps': 129528, 'loss/train': 0.8484637141227722} 11/07/2021 15:27:33 - INFO - __main__ - Step 129530: {'lr': 2.323378326995962e-05, 'samples': 24869760, 'steps': 129529, 'loss/train': 1.2825127840042114} 11/07/2021 15:27:33 - INFO - __main__ - Step 129531: {'lr': 2.3231549230036954e-05, 'samples': 24869952, 'steps': 129530, 'loss/train': 1.1104573011398315} 11/07/2021 15:27:34 - INFO - __main__ - Step 129532: {'lr': 2.322931529229272e-05, 'samples': 24870144, 'steps': 129531, 'loss/train': 1.0703150033950806} 11/07/2021 15:27:34 - INFO - __main__ - Step 129533: {'lr': 2.3227081456727807e-05, 'samples': 24870336, 'steps': 129532, 'loss/train': 1.6271402835845947} 11/07/2021 15:27:35 - INFO - __main__ - Step 129534: {'lr': 2.3224847723343267e-05, 'samples': 24870528, 'steps': 129533, 'loss/train': 1.3675364255905151} 11/07/2021 15:27:35 - INFO - __main__ - Step 129535: {'lr': 2.3222614092140104e-05, 'samples': 24870720, 'steps': 129534, 'loss/train': 1.4396034479141235} 11/07/2021 15:27:36 - INFO - __main__ - Step 129536: {'lr': 2.3220380563119342e-05, 'samples': 24870912, 'steps': 129535, 'loss/train': 1.1341795921325684} 11/07/2021 15:27:36 - INFO - __main__ - Step 129537: {'lr': 2.321814713628198e-05, 'samples': 24871104, 'steps': 129536, 'loss/train': 0.619266927242279} 11/07/2021 15:27:36 - INFO - __main__ - Step 129538: {'lr': 2.3215913811629018e-05, 'samples': 24871296, 'steps': 129537, 'loss/train': 1.876544713973999} 11/07/2021 15:27:38 - INFO - __main__ - Step 129539: {'lr': 2.3213680589161457e-05, 'samples': 24871488, 'steps': 129538, 'loss/train': 1.1454094648361206} 11/07/2021 15:27:38 - INFO - __main__ - Step 129540: {'lr': 2.321144746888032e-05, 'samples': 24871680, 'steps': 129539, 'loss/train': 1.4980612993240356} 11/07/2021 15:27:38 - INFO - __main__ - Step 129541: {'lr': 2.3209214450786607e-05, 'samples': 24871872, 'steps': 129540, 'loss/train': 1.194151759147644} 11/07/2021 15:27:39 - INFO - __main__ - Step 129542: {'lr': 2.320698153488132e-05, 'samples': 24872064, 'steps': 129541, 'loss/train': 1.6460846662521362} 11/07/2021 15:27:39 - INFO - __main__ - Step 129543: {'lr': 2.3204748721165457e-05, 'samples': 24872256, 'steps': 129542, 'loss/train': 0.7207732796669006} 11/07/2021 15:27:39 - INFO - __main__ - Step 129544: {'lr': 2.3202516009640045e-05, 'samples': 24872448, 'steps': 129543, 'loss/train': 1.4060026407241821} 11/07/2021 15:27:40 - INFO - __main__ - Step 129545: {'lr': 2.3200283400306137e-05, 'samples': 24872640, 'steps': 129544, 'loss/train': 1.0143628120422363} 11/07/2021 15:27:41 - INFO - __main__ - Step 129546: {'lr': 2.3198050893164625e-05, 'samples': 24872832, 'steps': 129545, 'loss/train': 1.5557520389556885} 11/07/2021 15:27:41 - INFO - __main__ - Step 129547: {'lr': 2.319581848821656e-05, 'samples': 24873024, 'steps': 129546, 'loss/train': 1.180822491645813} 11/07/2021 15:27:41 - INFO - __main__ - Step 129548: {'lr': 2.3193586185462966e-05, 'samples': 24873216, 'steps': 129547, 'loss/train': 0.8771147727966309} 11/07/2021 15:27:42 - INFO - __main__ - Step 129549: {'lr': 2.319135398490485e-05, 'samples': 24873408, 'steps': 129548, 'loss/train': 1.3657112121582031} 11/07/2021 15:27:43 - INFO - __main__ - Step 129550: {'lr': 2.3189121886543208e-05, 'samples': 24873600, 'steps': 129549, 'loss/train': 1.006905198097229} 11/07/2021 15:27:43 - INFO - __main__ - Step 129551: {'lr': 2.3186889890379065e-05, 'samples': 24873792, 'steps': 129550, 'loss/train': 1.082696557044983} 11/07/2021 15:27:43 - INFO - __main__ - Step 129552: {'lr': 2.3184657996413395e-05, 'samples': 24873984, 'steps': 129551, 'loss/train': 0.5920249819755554} 11/07/2021 15:27:44 - INFO - __main__ - Step 129553: {'lr': 2.3182426204647193e-05, 'samples': 24874176, 'steps': 129552, 'loss/train': 1.192578673362732} 11/07/2021 15:27:45 - INFO - __main__ - Step 129554: {'lr': 2.318019451508152e-05, 'samples': 24874368, 'steps': 129553, 'loss/train': 1.3092613220214844} 11/07/2021 15:27:45 - INFO - __main__ - Step 129555: {'lr': 2.3177962927717345e-05, 'samples': 24874560, 'steps': 129554, 'loss/train': 1.4050695896148682} 11/07/2021 15:27:46 - INFO - __main__ - Step 129556: {'lr': 2.3175731442555664e-05, 'samples': 24874752, 'steps': 129555, 'loss/train': 1.356620192527771} 11/07/2021 15:27:46 - INFO - __main__ - Step 129557: {'lr': 2.3173500059597507e-05, 'samples': 24874944, 'steps': 129556, 'loss/train': 1.5187159776687622} 11/07/2021 15:27:46 - INFO - __main__ - Step 129558: {'lr': 2.3171268778843873e-05, 'samples': 24875136, 'steps': 129557, 'loss/train': 0.614760160446167} 11/07/2021 15:27:47 - INFO - __main__ - Step 129559: {'lr': 2.3169037600295817e-05, 'samples': 24875328, 'steps': 129558, 'loss/train': 1.1912217140197754} 11/07/2021 15:27:48 - INFO - __main__ - Step 129560: {'lr': 2.3166806523954252e-05, 'samples': 24875520, 'steps': 129559, 'loss/train': 0.8076275587081909} 11/07/2021 15:27:48 - INFO - __main__ - Step 129561: {'lr': 2.316457554982021e-05, 'samples': 24875712, 'steps': 129560, 'loss/train': 1.20820951461792} 11/07/2021 15:27:48 - INFO - __main__ - Step 129562: {'lr': 2.3162344677894715e-05, 'samples': 24875904, 'steps': 129561, 'loss/train': 1.2727926969528198} 11/07/2021 15:27:49 - INFO - __main__ - Step 129563: {'lr': 2.3160113908178766e-05, 'samples': 24876096, 'steps': 129562, 'loss/train': 1.0317517518997192} 11/07/2021 15:27:49 - INFO - __main__ - Step 129564: {'lr': 2.3157883240673388e-05, 'samples': 24876288, 'steps': 129563, 'loss/train': 1.177128553390503} 11/07/2021 15:27:50 - INFO - __main__ - Step 129565: {'lr': 2.315565267537953e-05, 'samples': 24876480, 'steps': 129564, 'loss/train': 1.2480896711349487} 11/07/2021 15:27:51 - INFO - __main__ - Step 129566: {'lr': 2.315342221229827e-05, 'samples': 24876672, 'steps': 129565, 'loss/train': 1.349105954170227} 11/07/2021 15:27:51 - INFO - __main__ - Step 129567: {'lr': 2.3151191851430554e-05, 'samples': 24876864, 'steps': 129566, 'loss/train': 1.40923011302948} 11/07/2021 15:27:51 - INFO - __main__ - Step 129568: {'lr': 2.3148961592777405e-05, 'samples': 24877056, 'steps': 129567, 'loss/train': 1.282936692237854} 11/07/2021 15:27:52 - INFO - __main__ - Step 129569: {'lr': 2.3146731436339856e-05, 'samples': 24877248, 'steps': 129568, 'loss/train': 1.6210395097732544} 11/07/2021 15:27:52 - INFO - __main__ - Step 129570: {'lr': 2.3144501382118878e-05, 'samples': 24877440, 'steps': 129569, 'loss/train': 1.574355125427246} 11/07/2021 15:27:53 - INFO - __main__ - Step 129571: {'lr': 2.314227143011549e-05, 'samples': 24877632, 'steps': 129570, 'loss/train': 1.7525345087051392} 11/07/2021 15:27:53 - INFO - __main__ - Step 129572: {'lr': 2.314004158033073e-05, 'samples': 24877824, 'steps': 129571, 'loss/train': 1.0118483304977417} 11/07/2021 15:27:54 - INFO - __main__ - Step 129573: {'lr': 2.3137811832765532e-05, 'samples': 24878016, 'steps': 129572, 'loss/train': 1.6964995861053467} 11/07/2021 15:27:54 - INFO - __main__ - Step 129574: {'lr': 2.3135582187420927e-05, 'samples': 24878208, 'steps': 129573, 'loss/train': 0.6972171068191528} 11/07/2021 15:27:55 - INFO - __main__ - Step 129575: {'lr': 2.3133352644297946e-05, 'samples': 24878400, 'steps': 129574, 'loss/train': 1.0315934419631958} 11/07/2021 15:27:56 - INFO - __main__ - Step 129576: {'lr': 2.3131123203397553e-05, 'samples': 24878592, 'steps': 129575, 'loss/train': 1.0393223762512207} 11/07/2021 15:27:56 - INFO - __main__ - Step 129577: {'lr': 2.312889386472078e-05, 'samples': 24878784, 'steps': 129576, 'loss/train': 1.2588731050491333} 11/07/2021 15:27:56 - INFO - __main__ - Step 129578: {'lr': 2.3126664628268652e-05, 'samples': 24878976, 'steps': 129577, 'loss/train': 1.409393548965454} 11/07/2021 15:27:57 - INFO - __main__ - Step 129579: {'lr': 2.312443549404211e-05, 'samples': 24879168, 'steps': 129578, 'loss/train': 1.2188907861709595} 11/07/2021 15:27:57 - INFO - __main__ - Step 129580: {'lr': 2.3122206462042216e-05, 'samples': 24879360, 'steps': 129579, 'loss/train': 1.4725650548934937} 11/07/2021 15:27:57 - INFO - __main__ - Step 129581: {'lr': 2.3119977532269964e-05, 'samples': 24879552, 'steps': 129580, 'loss/train': 1.5753846168518066} 11/07/2021 15:27:59 - INFO - __main__ - Step 129582: {'lr': 2.311774870472633e-05, 'samples': 24879744, 'steps': 129581, 'loss/train': 1.503690242767334} 11/07/2021 15:27:59 - INFO - __main__ - Step 129583: {'lr': 2.311551997941236e-05, 'samples': 24879936, 'steps': 129582, 'loss/train': 1.3277637958526611} 11/07/2021 15:27:59 - INFO - __main__ - Step 129584: {'lr': 2.3113291356329004e-05, 'samples': 24880128, 'steps': 129583, 'loss/train': 1.110999345779419} 11/07/2021 15:28:00 - INFO - __main__ - Step 129585: {'lr': 2.3111062835477313e-05, 'samples': 24880320, 'steps': 129584, 'loss/train': 1.3538047075271606} 11/07/2021 15:28:00 - INFO - __main__ - Step 129586: {'lr': 2.310883441685832e-05, 'samples': 24880512, 'steps': 129585, 'loss/train': 1.676740288734436} 11/07/2021 15:28:01 - INFO - __main__ - Step 129587: {'lr': 2.3106606100472937e-05, 'samples': 24880704, 'steps': 129586, 'loss/train': 1.206655502319336} 11/07/2021 15:28:02 - INFO - __main__ - Step 129588: {'lr': 2.3104377886322248e-05, 'samples': 24880896, 'steps': 129587, 'loss/train': 1.266356110572815} 11/07/2021 15:28:02 - INFO - __main__ - Step 129589: {'lr': 2.3102149774407195e-05, 'samples': 24881088, 'steps': 129588, 'loss/train': 1.2357795238494873} 11/07/2021 15:28:02 - INFO - __main__ - Step 129590: {'lr': 2.3099921764728805e-05, 'samples': 24881280, 'steps': 129589, 'loss/train': 1.1705453395843506} 11/07/2021 15:28:03 - INFO - __main__ - Step 129591: {'lr': 2.3097693857288106e-05, 'samples': 24881472, 'steps': 129590, 'loss/train': 1.6195210218429565} 11/07/2021 15:28:03 - INFO - __main__ - Step 129592: {'lr': 2.3095466052086068e-05, 'samples': 24881664, 'steps': 129591, 'loss/train': 2.152649402618408} 11/07/2021 15:28:04 - INFO - __main__ - Step 129593: {'lr': 2.309323834912372e-05, 'samples': 24881856, 'steps': 129592, 'loss/train': 0.14737968146800995} 11/07/2021 15:28:05 - INFO - __main__ - Step 129594: {'lr': 2.309101074840206e-05, 'samples': 24882048, 'steps': 129593, 'loss/train': 0.7867170572280884} 11/07/2021 15:28:05 - INFO - __main__ - Step 129595: {'lr': 2.3088783249922084e-05, 'samples': 24882240, 'steps': 129594, 'loss/train': 0.6734504103660583} 11/07/2021 15:28:05 - INFO - __main__ - Step 129596: {'lr': 2.3086555853684826e-05, 'samples': 24882432, 'steps': 129595, 'loss/train': 1.0847047567367554} 11/07/2021 15:28:06 - INFO - __main__ - Step 129597: {'lr': 2.3084328559691225e-05, 'samples': 24882624, 'steps': 129596, 'loss/train': 1.6862680912017822} 11/07/2021 15:28:07 - INFO - __main__ - Step 129598: {'lr': 2.3082101367942364e-05, 'samples': 24882816, 'steps': 129597, 'loss/train': 1.8278926610946655} 11/07/2021 15:28:07 - INFO - __main__ - Step 129599: {'lr': 2.3079874278439216e-05, 'samples': 24883008, 'steps': 129598, 'loss/train': 1.118283987045288} 11/07/2021 15:28:07 - INFO - __main__ - Step 129600: {'lr': 2.307764729118275e-05, 'samples': 24883200, 'steps': 129599, 'loss/train': 0.9208910465240479} 11/07/2021 15:28:08 - INFO - __main__ - Step 129601: {'lr': 2.3075420406173997e-05, 'samples': 24883392, 'steps': 129600, 'loss/train': 1.2843010425567627} 11/07/2021 15:28:08 - INFO - __main__ - Step 129602: {'lr': 2.307319362341395e-05, 'samples': 24883584, 'steps': 129601, 'loss/train': 1.1556099653244019} 11/07/2021 15:28:09 - INFO - __main__ - Step 129603: {'lr': 2.3070966942903616e-05, 'samples': 24883776, 'steps': 129602, 'loss/train': 1.3259963989257812} 11/07/2021 15:28:09 - INFO - __main__ - Step 129604: {'lr': 2.3068740364644015e-05, 'samples': 24883968, 'steps': 129603, 'loss/train': 0.6808444261550903} 11/07/2021 15:28:10 - INFO - __main__ - Step 129605: {'lr': 2.306651388863612e-05, 'samples': 24884160, 'steps': 129604, 'loss/train': 1.8179314136505127} 11/07/2021 15:28:10 - INFO - __main__ - Step 129606: {'lr': 2.306428751488096e-05, 'samples': 24884352, 'steps': 129605, 'loss/train': 1.429118037223816} 11/07/2021 15:28:10 - INFO - __main__ - Step 129607: {'lr': 2.3062061243379533e-05, 'samples': 24884544, 'steps': 129606, 'loss/train': 1.303316354751587} 11/07/2021 15:28:11 - INFO - __main__ - Step 129608: {'lr': 2.305983507413284e-05, 'samples': 24884736, 'steps': 129607, 'loss/train': 1.7649128437042236} 11/07/2021 15:28:12 - INFO - __main__ - Step 129609: {'lr': 2.305760900714188e-05, 'samples': 24884928, 'steps': 129608, 'loss/train': 1.4714175462722778} 11/07/2021 15:28:12 - INFO - __main__ - Step 129610: {'lr': 2.3055383042407673e-05, 'samples': 24885120, 'steps': 129609, 'loss/train': 1.5145392417907715} 11/07/2021 15:28:13 - INFO - __main__ - Step 129611: {'lr': 2.30531571799312e-05, 'samples': 24885312, 'steps': 129610, 'loss/train': 0.44356828927993774} 11/07/2021 15:28:13 - INFO - __main__ - Step 129612: {'lr': 2.3050931419713456e-05, 'samples': 24885504, 'steps': 129611, 'loss/train': 0.8479723334312439} 11/07/2021 15:28:14 - INFO - __main__ - Step 129613: {'lr': 2.3048705761755522e-05, 'samples': 24885696, 'steps': 129612, 'loss/train': 0.9436629414558411} 11/07/2021 15:28:14 - INFO - __main__ - Step 129614: {'lr': 2.304648020605829e-05, 'samples': 24885888, 'steps': 129613, 'loss/train': 1.2336176633834839} 11/07/2021 15:28:15 - INFO - __main__ - Step 129615: {'lr': 2.304425475262281e-05, 'samples': 24886080, 'steps': 129614, 'loss/train': 1.1477795839309692} 11/07/2021 15:28:15 - INFO - __main__ - Step 129616: {'lr': 2.3042029401450086e-05, 'samples': 24886272, 'steps': 129615, 'loss/train': 1.274262547492981} 11/07/2021 15:28:15 - INFO - __main__ - Step 129617: {'lr': 2.3039804152541144e-05, 'samples': 24886464, 'steps': 129616, 'loss/train': 1.1384474039077759} 11/07/2021 15:28:16 - INFO - __main__ - Step 129618: {'lr': 2.3037579005896925e-05, 'samples': 24886656, 'steps': 129617, 'loss/train': 1.0393915176391602} 11/07/2021 15:28:17 - INFO - __main__ - Step 129619: {'lr': 2.3035353961518484e-05, 'samples': 24886848, 'steps': 129618, 'loss/train': 1.1749787330627441} 11/07/2021 15:28:17 - INFO - __main__ - Step 129620: {'lr': 2.3033129019406823e-05, 'samples': 24887040, 'steps': 129619, 'loss/train': 1.537907600402832} 11/07/2021 15:28:17 - INFO - __main__ - Step 129621: {'lr': 2.303090417956294e-05, 'samples': 24887232, 'steps': 129620, 'loss/train': 1.3990592956542969} 11/07/2021 15:28:18 - INFO - __main__ - Step 129622: {'lr': 2.3028679441987804e-05, 'samples': 24887424, 'steps': 129621, 'loss/train': 1.142238974571228} 11/07/2021 15:28:18 - INFO - __main__ - Step 129623: {'lr': 2.3026454806682444e-05, 'samples': 24887616, 'steps': 129622, 'loss/train': 1.1163723468780518} 11/07/2021 15:28:19 - INFO - __main__ - Step 129624: {'lr': 2.302423027364789e-05, 'samples': 24887808, 'steps': 129623, 'loss/train': 0.9907718896865845} 11/07/2021 15:28:20 - INFO - __main__ - Step 129625: {'lr': 2.302200584288508e-05, 'samples': 24888000, 'steps': 129624, 'loss/train': 1.3057717084884644} 11/07/2021 15:28:20 - INFO - __main__ - Step 129626: {'lr': 2.3019781514395127e-05, 'samples': 24888192, 'steps': 129625, 'loss/train': 1.2667135000228882} 11/07/2021 15:28:20 - INFO - __main__ - Step 129627: {'lr': 2.301755728817889e-05, 'samples': 24888384, 'steps': 129626, 'loss/train': 1.2278560400009155} 11/07/2021 15:28:21 - INFO - __main__ - Step 129628: {'lr': 2.3015333164237456e-05, 'samples': 24888576, 'steps': 129627, 'loss/train': 1.0083552598953247} 11/07/2021 15:28:22 - INFO - __main__ - Step 129629: {'lr': 2.3013109142571792e-05, 'samples': 24888768, 'steps': 129628, 'loss/train': 0.8488309979438782} 11/07/2021 15:28:22 - INFO - __main__ - Step 129630: {'lr': 2.3010885223182925e-05, 'samples': 24888960, 'steps': 129629, 'loss/train': 1.3631367683410645} 11/07/2021 15:28:22 - INFO - __main__ - Step 129631: {'lr': 2.3008661406071856e-05, 'samples': 24889152, 'steps': 129630, 'loss/train': 1.1300673484802246} 11/07/2021 15:28:23 - INFO - __main__ - Step 129632: {'lr': 2.3006437691239585e-05, 'samples': 24889344, 'steps': 129631, 'loss/train': 1.1745069026947021} 11/07/2021 15:28:23 - INFO - __main__ - Step 129633: {'lr': 2.300421407868711e-05, 'samples': 24889536, 'steps': 129632, 'loss/train': 0.32454439997673035} 11/07/2021 15:28:24 - INFO - __main__ - Step 129634: {'lr': 2.3001990568415425e-05, 'samples': 24889728, 'steps': 129633, 'loss/train': 1.2263751029968262} 11/07/2021 15:28:25 - INFO - __main__ - Step 129635: {'lr': 2.299976716042554e-05, 'samples': 24889920, 'steps': 129634, 'loss/train': 0.030660606920719147} 11/07/2021 15:28:25 - INFO - __main__ - Step 129636: {'lr': 2.2997543854718472e-05, 'samples': 24890112, 'steps': 129635, 'loss/train': 1.609272837638855} 11/07/2021 15:28:25 - INFO - __main__ - Step 129637: {'lr': 2.299532065129517e-05, 'samples': 24890304, 'steps': 129636, 'loss/train': 1.036591649055481} 11/07/2021 15:28:26 - INFO - __main__ - Step 129638: {'lr': 2.2993097550156715e-05, 'samples': 24890496, 'steps': 129637, 'loss/train': 1.565486192703247} 11/07/2021 15:28:27 - INFO - __main__ - Step 129639: {'lr': 2.299087455130411e-05, 'samples': 24890688, 'steps': 129638, 'loss/train': 1.2289912700653076} 11/07/2021 15:28:27 - INFO - __main__ - Step 129640: {'lr': 2.2988651654738235e-05, 'samples': 24890880, 'steps': 129639, 'loss/train': 1.2656329870224} 11/07/2021 15:28:27 - INFO - __main__ - Step 129641: {'lr': 2.2986428860460206e-05, 'samples': 24891072, 'steps': 129640, 'loss/train': 1.41961669921875} 11/07/2021 15:28:28 - INFO - __main__ - Step 129642: {'lr': 2.2984206168470966e-05, 'samples': 24891264, 'steps': 129641, 'loss/train': 0.944904088973999} 11/07/2021 15:28:28 - INFO - __main__ - Step 129643: {'lr': 2.2981983578771543e-05, 'samples': 24891456, 'steps': 129642, 'loss/train': 1.5724045038223267} 11/07/2021 15:28:29 - INFO - __main__ - Step 129644: {'lr': 2.2979761091362932e-05, 'samples': 24891648, 'steps': 129643, 'loss/train': 1.849913239479065} 11/07/2021 15:28:29 - INFO - __main__ - Step 129645: {'lr': 2.2977538706246163e-05, 'samples': 24891840, 'steps': 129644, 'loss/train': 1.4564247131347656} 11/07/2021 15:28:30 - INFO - __main__ - Step 129646: {'lr': 2.2975316423422182e-05, 'samples': 24892032, 'steps': 129645, 'loss/train': 1.2312793731689453} 11/07/2021 15:28:30 - INFO - __main__ - Step 129647: {'lr': 2.297309424289204e-05, 'samples': 24892224, 'steps': 129646, 'loss/train': 1.535204291343689} 11/07/2021 15:28:30 - INFO - __main__ - Step 129648: {'lr': 2.2970872164656707e-05, 'samples': 24892416, 'steps': 129647, 'loss/train': 0.9415724277496338} 11/07/2021 15:28:32 - INFO - __main__ - Step 129649: {'lr': 2.2968650188717215e-05, 'samples': 24892608, 'steps': 129648, 'loss/train': 1.3313965797424316} 11/07/2021 15:28:32 - INFO - __main__ - Step 129650: {'lr': 2.2966428315074532e-05, 'samples': 24892800, 'steps': 129649, 'loss/train': 1.7099019289016724} 11/07/2021 15:28:32 - INFO - __main__ - Step 129651: {'lr': 2.296420654372966e-05, 'samples': 24892992, 'steps': 129650, 'loss/train': 0.9127764701843262} 11/07/2021 15:28:33 - INFO - __main__ - Step 129652: {'lr': 2.2961984874683624e-05, 'samples': 24893184, 'steps': 129651, 'loss/train': 1.731455683708191} 11/07/2021 15:28:33 - INFO - __main__ - Step 129653: {'lr': 2.2959763307937475e-05, 'samples': 24893376, 'steps': 129652, 'loss/train': 1.4609050750732422} 11/07/2021 15:28:33 - INFO - __main__ - Step 129654: {'lr': 2.295754184349208e-05, 'samples': 24893568, 'steps': 129653, 'loss/train': 1.1505540609359741} 11/07/2021 15:28:34 - INFO - __main__ - Step 129655: {'lr': 2.2955320481348548e-05, 'samples': 24893760, 'steps': 129654, 'loss/train': 1.3141753673553467} 11/07/2021 15:28:35 - INFO - __main__ - Step 129656: {'lr': 2.2953099221507816e-05, 'samples': 24893952, 'steps': 129655, 'loss/train': 1.2468674182891846} 11/07/2021 15:28:35 - INFO - __main__ - Step 129657: {'lr': 2.295087806397092e-05, 'samples': 24894144, 'steps': 129656, 'loss/train': 1.1510672569274902} 11/07/2021 15:28:35 - INFO - __main__ - Step 129658: {'lr': 2.2948657008738854e-05, 'samples': 24894336, 'steps': 129657, 'loss/train': 0.9312916994094849} 11/07/2021 15:28:36 - INFO - __main__ - Step 129659: {'lr': 2.294643605581262e-05, 'samples': 24894528, 'steps': 129658, 'loss/train': 1.0900605916976929} 11/07/2021 15:28:37 - INFO - __main__ - Step 129660: {'lr': 2.2944215205193214e-05, 'samples': 24894720, 'steps': 129659, 'loss/train': 1.3737703561782837} 11/07/2021 15:28:37 - INFO - __main__ - Step 129661: {'lr': 2.2941994456881666e-05, 'samples': 24894912, 'steps': 129660, 'loss/train': 1.075958013534546} 11/07/2021 15:28:38 - INFO - __main__ - Step 129662: {'lr': 2.2939773810878918e-05, 'samples': 24895104, 'steps': 129661, 'loss/train': 1.2906246185302734} 11/07/2021 15:28:38 - INFO - __main__ - Step 129663: {'lr': 2.2937553267186023e-05, 'samples': 24895296, 'steps': 129662, 'loss/train': 1.1460819244384766} 11/07/2021 15:28:38 - INFO - __main__ - Step 129664: {'lr': 2.2935332825803955e-05, 'samples': 24895488, 'steps': 129663, 'loss/train': 1.4564265012741089} 11/07/2021 15:28:39 - INFO - __main__ - Step 129665: {'lr': 2.2933112486733716e-05, 'samples': 24895680, 'steps': 129664, 'loss/train': 0.9403479695320129} 11/07/2021 15:28:40 - INFO - __main__ - Step 129666: {'lr': 2.2930892249976383e-05, 'samples': 24895872, 'steps': 129665, 'loss/train': 0.03860127925872803} 11/07/2021 15:28:40 - INFO - __main__ - Step 129667: {'lr': 2.2928672115532817e-05, 'samples': 24896064, 'steps': 129666, 'loss/train': 0.6624731421470642} 11/07/2021 15:28:40 - INFO - __main__ - Step 129668: {'lr': 2.2926452083404102e-05, 'samples': 24896256, 'steps': 129667, 'loss/train': 0.8185074925422668} 11/07/2021 15:28:41 - INFO - __main__ - Step 129669: {'lr': 2.292423215359121e-05, 'samples': 24896448, 'steps': 129668, 'loss/train': 1.2313166856765747} 11/07/2021 15:28:42 - INFO - __main__ - Step 129670: {'lr': 2.292201232609517e-05, 'samples': 24896640, 'steps': 129669, 'loss/train': 1.30597722530365} 11/07/2021 15:28:42 - INFO - __main__ - Step 129671: {'lr': 2.291979260091695e-05, 'samples': 24896832, 'steps': 129670, 'loss/train': 1.2138928174972534} 11/07/2021 15:28:42 - INFO - __main__ - Step 129672: {'lr': 2.2917572978057576e-05, 'samples': 24897024, 'steps': 129671, 'loss/train': 0.643681526184082} 11/07/2021 15:28:43 - INFO - __main__ - Step 129673: {'lr': 2.2915353457518052e-05, 'samples': 24897216, 'steps': 129672, 'loss/train': 1.387310266494751} 11/07/2021 15:28:43 - INFO - __main__ - Step 129674: {'lr': 2.291313403929937e-05, 'samples': 24897408, 'steps': 129673, 'loss/train': 1.3224648237228394} 11/07/2021 15:28:44 - INFO - __main__ - Step 129675: {'lr': 2.2910914723402508e-05, 'samples': 24897600, 'steps': 129674, 'loss/train': 1.1200119256973267} 11/07/2021 15:28:45 - INFO - __main__ - Step 129676: {'lr': 2.2908695509828463e-05, 'samples': 24897792, 'steps': 129675, 'loss/train': 1.030980110168457} 11/07/2021 15:28:45 - INFO - __main__ - Step 129677: {'lr': 2.290647639857829e-05, 'samples': 24897984, 'steps': 129676, 'loss/train': 0.4430571496486664} 11/07/2021 15:28:45 - INFO - __main__ - Step 129678: {'lr': 2.2904257389652933e-05, 'samples': 24898176, 'steps': 129677, 'loss/train': 1.005976676940918} 11/07/2021 15:28:46 - INFO - __main__ - Step 129679: {'lr': 2.2902038483053443e-05, 'samples': 24898368, 'steps': 129678, 'loss/train': 0.6946455240249634} 11/07/2021 15:28:47 - INFO - __main__ - Step 129680: {'lr': 2.28998196787808e-05, 'samples': 24898560, 'steps': 129679, 'loss/train': 1.0678675174713135} 11/07/2021 15:28:47 - INFO - __main__ - Step 129681: {'lr': 2.2897600976835965e-05, 'samples': 24898752, 'steps': 129680, 'loss/train': 1.3781691789627075} 11/07/2021 15:28:48 - INFO - __main__ - Step 129682: {'lr': 2.289538237721997e-05, 'samples': 24898944, 'steps': 129681, 'loss/train': 1.039242148399353} 11/07/2021 15:28:48 - INFO - __main__ - Step 129683: {'lr': 2.2893163879933816e-05, 'samples': 24899136, 'steps': 129682, 'loss/train': 0.3139842450618744} 11/07/2021 15:28:48 - INFO - __main__ - Step 129684: {'lr': 2.289094548497847e-05, 'samples': 24899328, 'steps': 129683, 'loss/train': 1.2708362340927124} 11/07/2021 15:28:49 - INFO - __main__ - Step 129685: {'lr': 2.2888727192354993e-05, 'samples': 24899520, 'steps': 129684, 'loss/train': 0.8739088177680969} 11/07/2021 15:28:50 - INFO - __main__ - Step 129686: {'lr': 2.2886509002064348e-05, 'samples': 24899712, 'steps': 129685, 'loss/train': 1.0064308643341064} 11/07/2021 15:28:50 - INFO - __main__ - Step 129687: {'lr': 2.2884290914107514e-05, 'samples': 24899904, 'steps': 129686, 'loss/train': 1.1691627502441406} 11/07/2021 15:28:51 - INFO - __main__ - Step 129688: {'lr': 2.2882072928485515e-05, 'samples': 24900096, 'steps': 129687, 'loss/train': 5.680471420288086} 11/07/2021 15:28:51 - INFO - __main__ - Step 129689: {'lr': 2.2879855045199377e-05, 'samples': 24900288, 'steps': 129688, 'loss/train': 1.6437064409255981} 11/07/2021 15:28:51 - INFO - __main__ - Step 129690: {'lr': 2.2877637264250045e-05, 'samples': 24900480, 'steps': 129689, 'loss/train': 1.3058533668518066} 11/07/2021 15:28:52 - INFO - __main__ - Step 129691: {'lr': 2.2875419585638546e-05, 'samples': 24900672, 'steps': 129690, 'loss/train': 0.9051546454429626} 11/07/2021 15:28:52 - INFO - __main__ - Step 129692: {'lr': 2.2873202009365906e-05, 'samples': 24900864, 'steps': 129691, 'loss/train': 5.155226230621338} 11/07/2021 15:28:53 - INFO - __main__ - Step 129693: {'lr': 2.2870984535433126e-05, 'samples': 24901056, 'steps': 129692, 'loss/train': 5.038218975067139} 11/07/2021 15:28:54 - INFO - __main__ - Step 129694: {'lr': 2.286876716384112e-05, 'samples': 24901248, 'steps': 129693, 'loss/train': 1.2543001174926758} 11/07/2021 15:28:54 - INFO - __main__ - Step 129695: {'lr': 2.2866549894590943e-05, 'samples': 24901440, 'steps': 129694, 'loss/train': 1.425484538078308} 11/07/2021 15:28:54 - INFO - __main__ - Step 129696: {'lr': 2.2864332727683594e-05, 'samples': 24901632, 'steps': 129695, 'loss/train': 1.155626893043518} 11/07/2021 15:28:55 - INFO - __main__ - Step 129697: {'lr': 2.2862115663120075e-05, 'samples': 24901824, 'steps': 129696, 'loss/train': 1.3086804151535034} 11/07/2021 15:28:56 - INFO - __main__ - Step 129698: {'lr': 2.285989870090141e-05, 'samples': 24902016, 'steps': 129697, 'loss/train': 0.7748323678970337} 11/07/2021 15:28:56 - INFO - __main__ - Step 129699: {'lr': 2.2857681841028545e-05, 'samples': 24902208, 'steps': 129698, 'loss/train': 0.9871715307235718} 11/07/2021 15:28:56 - INFO - __main__ - Step 129700: {'lr': 2.2855465083502503e-05, 'samples': 24902400, 'steps': 129699, 'loss/train': 1.1427280902862549} 11/07/2021 15:28:57 - INFO - __main__ - Step 129701: {'lr': 2.2853248428324258e-05, 'samples': 24902592, 'steps': 129700, 'loss/train': 1.4574923515319824} 11/07/2021 15:28:57 - INFO - __main__ - Step 129702: {'lr': 2.2851031875494867e-05, 'samples': 24902784, 'steps': 129701, 'loss/train': 1.3750044107437134} 11/07/2021 15:28:58 - INFO - __main__ - Step 129703: {'lr': 2.2848815425015297e-05, 'samples': 24902976, 'steps': 129702, 'loss/train': 1.389392614364624} 11/07/2021 15:28:58 - INFO - __main__ - Step 129704: {'lr': 2.284659907688655e-05, 'samples': 24903168, 'steps': 129703, 'loss/train': 1.285388708114624} 11/07/2021 15:28:59 - INFO - __main__ - Step 129705: {'lr': 2.2844382831109594e-05, 'samples': 24903360, 'steps': 129704, 'loss/train': 1.437054991722107} 11/07/2021 15:28:59 - INFO - __main__ - Step 129706: {'lr': 2.284216668768546e-05, 'samples': 24903552, 'steps': 129705, 'loss/train': 1.1372610330581665} 11/07/2021 15:29:00 - INFO - __main__ - Step 129707: {'lr': 2.2839950646615206e-05, 'samples': 24903744, 'steps': 129706, 'loss/train': 1.2035245895385742} 11/07/2021 15:29:01 - INFO - __main__ - Step 129708: {'lr': 2.283773470789971e-05, 'samples': 24903936, 'steps': 129707, 'loss/train': 1.2386690378189087} 11/07/2021 15:29:01 - INFO - __main__ - Step 129709: {'lr': 2.2835518871540007e-05, 'samples': 24904128, 'steps': 129708, 'loss/train': 1.4391390085220337} 11/07/2021 15:29:01 - INFO - __main__ - Step 129710: {'lr': 2.283330313753712e-05, 'samples': 24904320, 'steps': 129709, 'loss/train': 1.2239751815795898} 11/07/2021 15:29:02 - INFO - __main__ - Step 129711: {'lr': 2.283108750589205e-05, 'samples': 24904512, 'steps': 129710, 'loss/train': 1.3421217203140259} 11/07/2021 15:29:02 - INFO - __main__ - Step 129712: {'lr': 2.2828871976605798e-05, 'samples': 24904704, 'steps': 129711, 'loss/train': 1.1843620538711548} 11/07/2021 15:29:03 - INFO - __main__ - Step 129713: {'lr': 2.282665654967933e-05, 'samples': 24904896, 'steps': 129712, 'loss/train': 1.2597280740737915} 11/07/2021 15:29:03 - INFO - __main__ - Step 129714: {'lr': 2.282444122511368e-05, 'samples': 24905088, 'steps': 129713, 'loss/train': 1.4514931440353394} 11/07/2021 15:29:04 - INFO - __main__ - Step 129715: {'lr': 2.282222600290984e-05, 'samples': 24905280, 'steps': 129714, 'loss/train': 1.4138442277908325} 11/07/2021 15:29:04 - INFO - __main__ - Step 129716: {'lr': 2.2820010883068787e-05, 'samples': 24905472, 'steps': 129715, 'loss/train': 1.4604206085205078} 11/07/2021 15:29:04 - INFO - __main__ - Step 129717: {'lr': 2.2817795865591517e-05, 'samples': 24905664, 'steps': 129716, 'loss/train': 1.302251935005188} 11/07/2021 15:29:05 - INFO - __main__ - Step 129718: {'lr': 2.281558095047906e-05, 'samples': 24905856, 'steps': 129717, 'loss/train': 1.0453073978424072} 11/07/2021 15:29:06 - INFO - __main__ - Step 129719: {'lr': 2.2813366137732383e-05, 'samples': 24906048, 'steps': 129718, 'loss/train': 1.2673205137252808} 11/07/2021 15:29:06 - INFO - __main__ - Step 129720: {'lr': 2.2811151427352573e-05, 'samples': 24906240, 'steps': 129719, 'loss/train': 1.5267186164855957} 11/07/2021 15:29:07 - INFO - __main__ - Step 129721: {'lr': 2.2808936819340458e-05, 'samples': 24906432, 'steps': 129720, 'loss/train': 1.1977994441986084} 11/07/2021 15:29:07 - INFO - __main__ - Step 129722: {'lr': 2.280672231369718e-05, 'samples': 24906624, 'steps': 129721, 'loss/train': 1.2193737030029297} 11/07/2021 15:29:07 - INFO - __main__ - Step 129723: {'lr': 2.2804507910423654e-05, 'samples': 24906816, 'steps': 129722, 'loss/train': 1.1282806396484375} 11/07/2021 15:29:08 - INFO - __main__ - Step 129724: {'lr': 2.2802293609520936e-05, 'samples': 24907008, 'steps': 129723, 'loss/train': 1.2717622518539429} 11/07/2021 15:29:09 - INFO - __main__ - Step 129725: {'lr': 2.2800079410989966e-05, 'samples': 24907200, 'steps': 129724, 'loss/train': 1.5356316566467285} 11/07/2021 15:29:09 - INFO - __main__ - Step 129726: {'lr': 2.27978653148318e-05, 'samples': 24907392, 'steps': 129725, 'loss/train': 0.9400920867919922} 11/07/2021 15:29:09 - INFO - __main__ - Step 129727: {'lr': 2.279565132104741e-05, 'samples': 24907584, 'steps': 129726, 'loss/train': 1.2678889036178589} 11/07/2021 15:29:10 - INFO - __main__ - Step 129728: {'lr': 2.279343742963777e-05, 'samples': 24907776, 'steps': 129727, 'loss/train': 1.5893574953079224} 11/07/2021 15:29:11 - INFO - __main__ - Step 129729: {'lr': 2.279122364060393e-05, 'samples': 24907968, 'steps': 129728, 'loss/train': 1.6448155641555786} 11/07/2021 15:29:11 - INFO - __main__ - Step 129730: {'lr': 2.2789009953946838e-05, 'samples': 24908160, 'steps': 129729, 'loss/train': 1.3333927392959595} 11/07/2021 15:29:11 - INFO - __main__ - Step 129731: {'lr': 2.278679636966752e-05, 'samples': 24908352, 'steps': 129730, 'loss/train': 1.3075989484786987} 11/07/2021 15:29:12 - INFO - __main__ - Step 129732: {'lr': 2.2784582887766968e-05, 'samples': 24908544, 'steps': 129731, 'loss/train': 0.8958889245986938} 11/07/2021 15:29:12 - INFO - __main__ - Step 129733: {'lr': 2.278236950824622e-05, 'samples': 24908736, 'steps': 129732, 'loss/train': 1.4401732683181763} 11/07/2021 15:29:13 - INFO - __main__ - Step 129734: {'lr': 2.2780156231106186e-05, 'samples': 24908928, 'steps': 129733, 'loss/train': 1.4030407667160034} 11/07/2021 15:29:13 - INFO - __main__ - Step 129735: {'lr': 2.2777943056347923e-05, 'samples': 24909120, 'steps': 129734, 'loss/train': 1.0187135934829712} 11/07/2021 15:29:14 - INFO - __main__ - Step 129736: {'lr': 2.277572998397237e-05, 'samples': 24909312, 'steps': 129735, 'loss/train': 1.6713602542877197} 11/07/2021 15:29:14 - INFO - __main__ - Step 129737: {'lr': 2.2773517013980615e-05, 'samples': 24909504, 'steps': 129736, 'loss/train': 1.0933914184570312} 11/07/2021 15:29:14 - INFO - __main__ - Step 129738: {'lr': 2.2771304146373572e-05, 'samples': 24909696, 'steps': 129737, 'loss/train': 1.4470816850662231} 11/07/2021 15:29:15 - INFO - __main__ - Step 129739: {'lr': 2.2769091381152298e-05, 'samples': 24909888, 'steps': 129738, 'loss/train': 1.2861417531967163} 11/07/2021 15:29:16 - INFO - __main__ - Step 129740: {'lr': 2.276687871831773e-05, 'samples': 24910080, 'steps': 129739, 'loss/train': 1.3527940511703491} 11/07/2021 15:29:16 - INFO - __main__ - Step 129741: {'lr': 2.276466615787093e-05, 'samples': 24910272, 'steps': 129740, 'loss/train': 1.4344793558120728} 11/07/2021 15:29:17 - INFO - __main__ - Step 129742: {'lr': 2.2762453699812864e-05, 'samples': 24910464, 'steps': 129741, 'loss/train': 0.8053730130195618} 11/07/2021 15:29:17 - INFO - __main__ - Step 129743: {'lr': 2.2760241344144504e-05, 'samples': 24910656, 'steps': 129742, 'loss/train': 1.163966417312622} 11/07/2021 15:29:18 - INFO - __main__ - Step 129744: {'lr': 2.2758029090866937e-05, 'samples': 24910848, 'steps': 129743, 'loss/train': 0.9193429946899414} 11/07/2021 15:29:18 - INFO - __main__ - Step 129745: {'lr': 2.2755816939981078e-05, 'samples': 24911040, 'steps': 129744, 'loss/train': 1.372003436088562} 11/07/2021 15:29:19 - INFO - __main__ - Step 129746: {'lr': 2.2753604891487895e-05, 'samples': 24911232, 'steps': 129745, 'loss/train': 0.9297892451286316} 11/07/2021 15:29:19 - INFO - __main__ - Step 129747: {'lr': 2.2751392945388443e-05, 'samples': 24911424, 'steps': 129746, 'loss/train': 1.078189492225647} 11/07/2021 15:29:19 - INFO - __main__ - Step 129748: {'lr': 2.2749181101683724e-05, 'samples': 24911616, 'steps': 129747, 'loss/train': 1.288784146308899} 11/07/2021 15:29:20 - INFO - __main__ - Step 129749: {'lr': 2.2746969360374707e-05, 'samples': 24911808, 'steps': 129748, 'loss/train': 1.153128743171692} 11/07/2021 15:29:21 - INFO - __main__ - Step 129750: {'lr': 2.2744757721462395e-05, 'samples': 24912000, 'steps': 129749, 'loss/train': 1.2779895067214966} 11/07/2021 15:29:21 - INFO - __main__ - Step 129751: {'lr': 2.274254618494781e-05, 'samples': 24912192, 'steps': 129750, 'loss/train': 0.9809364080429077} 11/07/2021 15:29:21 - INFO - __main__ - Step 129752: {'lr': 2.2740334750831898e-05, 'samples': 24912384, 'steps': 129751, 'loss/train': 1.3512316942214966} 11/07/2021 15:29:22 - INFO - __main__ - Step 129753: {'lr': 2.2738123419115685e-05, 'samples': 24912576, 'steps': 129752, 'loss/train': 1.4246573448181152} 11/07/2021 15:29:23 - INFO - __main__ - Step 129754: {'lr': 2.2735912189800175e-05, 'samples': 24912768, 'steps': 129753, 'loss/train': 0.03491226211190224} 11/07/2021 15:29:23 - INFO - __main__ - Step 129755: {'lr': 2.2733701062886414e-05, 'samples': 24912960, 'steps': 129754, 'loss/train': 1.3153318166732788} 11/07/2021 15:29:24 - INFO - __main__ - Step 129756: {'lr': 2.2731490038375295e-05, 'samples': 24913152, 'steps': 129755, 'loss/train': 1.498500943183899} 11/07/2021 15:29:24 - INFO - __main__ - Step 129757: {'lr': 2.2729279116267847e-05, 'samples': 24913344, 'steps': 129756, 'loss/train': 1.5463672876358032} 11/07/2021 15:29:24 - INFO - __main__ - Step 129758: {'lr': 2.2727068296565067e-05, 'samples': 24913536, 'steps': 129757, 'loss/train': 1.6414085626602173} 11/07/2021 15:29:25 - INFO - __main__ - Step 129759: {'lr': 2.2724857579267983e-05, 'samples': 24913728, 'steps': 129758, 'loss/train': 0.8999258875846863} 11/07/2021 15:29:26 - INFO - __main__ - Step 129760: {'lr': 2.2722646964377562e-05, 'samples': 24913920, 'steps': 129759, 'loss/train': 1.0947285890579224} 11/07/2021 15:29:26 - INFO - __main__ - Step 129761: {'lr': 2.272043645189481e-05, 'samples': 24914112, 'steps': 129760, 'loss/train': 1.4569588899612427} 11/07/2021 15:29:26 - INFO - __main__ - Step 129762: {'lr': 2.2718226041820724e-05, 'samples': 24914304, 'steps': 129761, 'loss/train': 0.04431464150547981} 11/07/2021 15:29:27 - INFO - __main__ - Step 129763: {'lr': 2.2716015734156298e-05, 'samples': 24914496, 'steps': 129762, 'loss/train': 0.9064356088638306} 11/07/2021 15:29:27 - INFO - __main__ - Step 129764: {'lr': 2.2713805528902537e-05, 'samples': 24914688, 'steps': 129763, 'loss/train': 0.9760269522666931} 11/07/2021 15:29:28 - INFO - __main__ - Step 129765: {'lr': 2.271159542606041e-05, 'samples': 24914880, 'steps': 129764, 'loss/train': 0.662019670009613} 11/07/2021 15:29:28 - INFO - __main__ - Step 129766: {'lr': 2.2709385425631002e-05, 'samples': 24915072, 'steps': 129765, 'loss/train': 1.4791141748428345} 11/07/2021 15:29:29 - INFO - __main__ - Step 129767: {'lr': 2.270717552761517e-05, 'samples': 24915264, 'steps': 129766, 'loss/train': 0.9422411918640137} 11/07/2021 15:29:29 - INFO - __main__ - Step 129768: {'lr': 2.2704965732013972e-05, 'samples': 24915456, 'steps': 129767, 'loss/train': 1.0696966648101807} 11/07/2021 15:29:29 - INFO - __main__ - Step 129769: {'lr': 2.2702756038828433e-05, 'samples': 24915648, 'steps': 129768, 'loss/train': 1.6006999015808105} 11/07/2021 15:29:31 - INFO - __main__ - Step 129770: {'lr': 2.2700546448059494e-05, 'samples': 24915840, 'steps': 129769, 'loss/train': 1.5563609600067139} 11/07/2021 15:29:31 - INFO - __main__ - Step 129771: {'lr': 2.2698336959708215e-05, 'samples': 24916032, 'steps': 129770, 'loss/train': 1.2821040153503418} 11/07/2021 15:29:31 - INFO - __main__ - Step 129772: {'lr': 2.269612757377554e-05, 'samples': 24916224, 'steps': 129771, 'loss/train': 1.195591688156128} 11/07/2021 15:29:32 - INFO - __main__ - Step 129773: {'lr': 2.269391829026249e-05, 'samples': 24916416, 'steps': 129772, 'loss/train': 1.049838662147522} 11/07/2021 15:29:32 - INFO - __main__ - Step 129774: {'lr': 2.2691709109170037e-05, 'samples': 24916608, 'steps': 129773, 'loss/train': 1.4295002222061157} 11/07/2021 15:29:33 - INFO - __main__ - Step 129775: {'lr': 2.2689500030499217e-05, 'samples': 24916800, 'steps': 129774, 'loss/train': 1.288271427154541} 11/07/2021 15:29:33 - INFO - __main__ - Step 129776: {'lr': 2.2687291054251018e-05, 'samples': 24916992, 'steps': 129775, 'loss/train': 1.6225385665893555} 11/07/2021 15:29:34 - INFO - __main__ - Step 129777: {'lr': 2.268508218042639e-05, 'samples': 24917184, 'steps': 129776, 'loss/train': 1.3986917734146118} 11/07/2021 15:29:34 - INFO - __main__ - Step 129778: {'lr': 2.268287340902636e-05, 'samples': 24917376, 'steps': 129777, 'loss/train': 0.9934407472610474} 11/07/2021 15:29:34 - INFO - __main__ - Step 129779: {'lr': 2.26806647400519e-05, 'samples': 24917568, 'steps': 129778, 'loss/train': 0.8080102205276489} 11/07/2021 15:29:36 - INFO - __main__ - Step 129780: {'lr': 2.267845617350406e-05, 'samples': 24917760, 'steps': 129779, 'loss/train': 1.4654004573822021} 11/07/2021 15:29:36 - INFO - __main__ - Step 129781: {'lr': 2.2676247709383758e-05, 'samples': 24917952, 'steps': 129780, 'loss/train': 1.1477632522583008} 11/07/2021 15:29:36 - INFO - __main__ - Step 129782: {'lr': 2.267403934769205e-05, 'samples': 24918144, 'steps': 129781, 'loss/train': 1.2412645816802979} 11/07/2021 15:29:37 - INFO - __main__ - Step 129783: {'lr': 2.2671831088429908e-05, 'samples': 24918336, 'steps': 129782, 'loss/train': 0.7282606363296509} 11/07/2021 15:29:37 - INFO - __main__ - Step 129784: {'lr': 2.2669622931598356e-05, 'samples': 24918528, 'steps': 129783, 'loss/train': 1.2925347089767456} 11/07/2021 15:29:37 - INFO - __main__ - Step 129785: {'lr': 2.266741487719834e-05, 'samples': 24918720, 'steps': 129784, 'loss/train': 1.4289833307266235} 11/07/2021 15:29:38 - INFO - __main__ - Step 129786: {'lr': 2.2665206925230857e-05, 'samples': 24918912, 'steps': 129785, 'loss/train': 1.3951114416122437} 11/07/2021 15:29:39 - INFO - __main__ - Step 129787: {'lr': 2.266299907569702e-05, 'samples': 24919104, 'steps': 129786, 'loss/train': 1.4115803241729736} 11/07/2021 15:29:39 - INFO - __main__ - Step 129788: {'lr': 2.2660791328597637e-05, 'samples': 24919296, 'steps': 129787, 'loss/train': 0.7956430912017822} 11/07/2021 15:29:39 - INFO - __main__ - Step 129789: {'lr': 2.265858368393381e-05, 'samples': 24919488, 'steps': 129788, 'loss/train': 1.4411556720733643} 11/07/2021 15:29:40 - INFO - __main__ - Step 129790: {'lr': 2.2656376141706542e-05, 'samples': 24919680, 'steps': 129789, 'loss/train': 1.180056095123291} 11/07/2021 15:29:41 - INFO - __main__ - Step 129791: {'lr': 2.265416870191678e-05, 'samples': 24919872, 'steps': 129790, 'loss/train': 1.6988753080368042} 11/07/2021 15:29:41 - INFO - __main__ - Step 129792: {'lr': 2.2651961364565545e-05, 'samples': 24920064, 'steps': 129791, 'loss/train': 0.9681733250617981} 11/07/2021 15:29:42 - INFO - __main__ - Step 129793: {'lr': 2.264975412965381e-05, 'samples': 24920256, 'steps': 129792, 'loss/train': 1.1089543104171753} 11/07/2021 15:29:42 - INFO - __main__ - Step 129794: {'lr': 2.2647546997182604e-05, 'samples': 24920448, 'steps': 129793, 'loss/train': 1.0292434692382812} 11/07/2021 15:29:42 - INFO - __main__ - Step 129795: {'lr': 2.26453399671529e-05, 'samples': 24920640, 'steps': 129794, 'loss/train': 0.8709102869033813} 11/07/2021 15:29:44 - INFO - __main__ - Step 129796: {'lr': 2.2643133039565695e-05, 'samples': 24920832, 'steps': 129795, 'loss/train': 0.9984269738197327} 11/07/2021 15:29:44 - INFO - __main__ - Step 129797: {'lr': 2.2640926214422013e-05, 'samples': 24921024, 'steps': 129796, 'loss/train': 1.2279399633407593} 11/07/2021 15:29:44 - INFO - __main__ - Step 129798: {'lr': 2.2638719491722774e-05, 'samples': 24921216, 'steps': 129797, 'loss/train': 1.811454176902771} 11/07/2021 15:29:45 - INFO - __main__ - Step 129799: {'lr': 2.263651287146906e-05, 'samples': 24921408, 'steps': 129798, 'loss/train': 1.2059162855148315} 11/07/2021 15:29:45 - INFO - __main__ - Step 129800: {'lr': 2.2634306353661816e-05, 'samples': 24921600, 'steps': 129799, 'loss/train': 1.503822684288025} 11/07/2021 15:29:46 - INFO - __main__ - Step 129801: {'lr': 2.2632099938302093e-05, 'samples': 24921792, 'steps': 129800, 'loss/train': 1.022418737411499} 11/07/2021 15:29:46 - INFO - __main__ - Step 129802: {'lr': 2.262989362539078e-05, 'samples': 24921984, 'steps': 129801, 'loss/train': 1.4467523097991943} 11/07/2021 15:29:47 - INFO - __main__ - Step 129803: {'lr': 2.2627687414928933e-05, 'samples': 24922176, 'steps': 129802, 'loss/train': 1.1831129789352417} 11/07/2021 15:29:47 - INFO - __main__ - Step 129804: {'lr': 2.2625481306917523e-05, 'samples': 24922368, 'steps': 129803, 'loss/train': 1.2330695390701294} 11/07/2021 15:29:48 - INFO - __main__ - Step 129805: {'lr': 2.2623275301357577e-05, 'samples': 24922560, 'steps': 129804, 'loss/train': 0.8200371861457825} 11/07/2021 15:29:49 - INFO - __main__ - Step 129806: {'lr': 2.2621069398250095e-05, 'samples': 24922752, 'steps': 129805, 'loss/train': 1.6482793092727661} 11/07/2021 15:29:49 - INFO - __main__ - Step 129807: {'lr': 2.261886359759602e-05, 'samples': 24922944, 'steps': 129806, 'loss/train': 1.292747974395752} 11/07/2021 15:29:50 - INFO - __main__ - Step 129808: {'lr': 2.2616657899396404e-05, 'samples': 24923136, 'steps': 129807, 'loss/train': 0.04013850912451744} 11/07/2021 15:29:50 - INFO - __main__ - Step 129809: {'lr': 2.2614452303652195e-05, 'samples': 24923328, 'steps': 129808, 'loss/train': 0.915105402469635} 11/07/2021 15:29:50 - INFO - __main__ - Step 129810: {'lr': 2.2612246810364416e-05, 'samples': 24923520, 'steps': 129809, 'loss/train': 1.8210567235946655} 11/07/2021 15:29:51 - INFO - __main__ - Step 129811: {'lr': 2.2610041419534044e-05, 'samples': 24923712, 'steps': 129810, 'loss/train': 1.4957809448242188} 11/07/2021 15:29:52 - INFO - __main__ - Step 129812: {'lr': 2.2607836131162075e-05, 'samples': 24923904, 'steps': 129811, 'loss/train': 0.11999521404504776} 11/07/2021 15:29:52 - INFO - __main__ - Step 129813: {'lr': 2.2605630945249505e-05, 'samples': 24924096, 'steps': 129812, 'loss/train': 1.1112377643585205} 11/07/2021 15:29:53 - INFO - __main__ - Step 129814: {'lr': 2.2603425861797368e-05, 'samples': 24924288, 'steps': 129813, 'loss/train': 1.2293018102645874} 11/07/2021 15:29:53 - INFO - __main__ - Step 129815: {'lr': 2.26012208808066e-05, 'samples': 24924480, 'steps': 129814, 'loss/train': 0.08674643933773041} 11/07/2021 15:29:53 - INFO - __main__ - Step 129816: {'lr': 2.259901600227818e-05, 'samples': 24924672, 'steps': 129815, 'loss/train': 1.5673611164093018} 11/07/2021 15:29:54 - INFO - __main__ - Step 129817: {'lr': 2.2596811226213153e-05, 'samples': 24924864, 'steps': 129816, 'loss/train': 1.276555061340332} 11/07/2021 15:29:55 - INFO - __main__ - Step 129818: {'lr': 2.2594606552612503e-05, 'samples': 24925056, 'steps': 129817, 'loss/train': 1.6127978563308716} 11/07/2021 15:29:55 - INFO - __main__ - Step 129819: {'lr': 2.2592401981477188e-05, 'samples': 24925248, 'steps': 129818, 'loss/train': 1.6195553541183472} 11/07/2021 15:29:55 - INFO - __main__ - Step 129820: {'lr': 2.259019751280825e-05, 'samples': 24925440, 'steps': 129819, 'loss/train': 1.4276189804077148} 11/07/2021 15:29:56 - INFO - __main__ - Step 129821: {'lr': 2.2587993146606643e-05, 'samples': 24925632, 'steps': 129820, 'loss/train': 1.1775659322738647} 11/07/2021 15:29:56 - INFO - __main__ - Step 129822: {'lr': 2.258578888287338e-05, 'samples': 24925824, 'steps': 129821, 'loss/train': 1.1765552759170532} 11/07/2021 15:29:57 - INFO - __main__ - Step 129823: {'lr': 2.2583584721609456e-05, 'samples': 24926016, 'steps': 129822, 'loss/train': 1.5345829725265503} 11/07/2021 15:29:58 - INFO - __main__ - Step 129824: {'lr': 2.258138066281587e-05, 'samples': 24926208, 'steps': 129823, 'loss/train': 1.4028400182724} 11/07/2021 15:29:58 - INFO - __main__ - Step 129825: {'lr': 2.2579176706493593e-05, 'samples': 24926400, 'steps': 129824, 'loss/train': 1.1157892942428589} 11/07/2021 15:29:58 - INFO - __main__ - Step 129826: {'lr': 2.2576972852643623e-05, 'samples': 24926592, 'steps': 129825, 'loss/train': 0.8915739059448242} 11/07/2021 15:29:59 - INFO - __main__ - Step 129827: {'lr': 2.257476910126699e-05, 'samples': 24926784, 'steps': 129826, 'loss/train': 1.207912802696228} 11/07/2021 15:30:00 - INFO - __main__ - Step 129828: {'lr': 2.2572565452364663e-05, 'samples': 24926976, 'steps': 129827, 'loss/train': 1.1696642637252808} 11/07/2021 15:30:00 - INFO - __main__ - Step 129829: {'lr': 2.2570361905937614e-05, 'samples': 24927168, 'steps': 129828, 'loss/train': 1.2816970348358154} 11/07/2021 15:30:00 - INFO - __main__ - Step 129830: {'lr': 2.2568158461986844e-05, 'samples': 24927360, 'steps': 129829, 'loss/train': 1.4025287628173828} 11/07/2021 15:30:01 - INFO - __main__ - Step 129831: {'lr': 2.256595512051332e-05, 'samples': 24927552, 'steps': 129830, 'loss/train': 1.5543527603149414} 11/07/2021 15:30:01 - INFO - __main__ - Step 129832: {'lr': 2.2563751881518103e-05, 'samples': 24927744, 'steps': 129831, 'loss/train': 1.5050578117370605} 11/07/2021 15:30:02 - INFO - __main__ - Step 129833: {'lr': 2.2561548745002132e-05, 'samples': 24927936, 'steps': 129832, 'loss/train': 1.72592294216156} 11/07/2021 15:30:02 - INFO - __main__ - Step 129834: {'lr': 2.2559345710966434e-05, 'samples': 24928128, 'steps': 129833, 'loss/train': 1.0304625034332275} 11/07/2021 15:30:03 - INFO - __main__ - Step 129835: {'lr': 2.2557142779411982e-05, 'samples': 24928320, 'steps': 129834, 'loss/train': 1.4917058944702148} 11/07/2021 15:30:03 - INFO - __main__ - Step 129836: {'lr': 2.255493995033975e-05, 'samples': 24928512, 'steps': 129835, 'loss/train': 1.107121467590332} 11/07/2021 15:30:03 - INFO - __main__ - Step 129837: {'lr': 2.2552737223750786e-05, 'samples': 24928704, 'steps': 129836, 'loss/train': 1.2353804111480713} 11/07/2021 15:30:05 - INFO - __main__ - Step 129838: {'lr': 2.255053459964601e-05, 'samples': 24928896, 'steps': 129837, 'loss/train': 1.5452313423156738} 11/07/2021 15:30:05 - INFO - __main__ - Step 129839: {'lr': 2.254833207802648e-05, 'samples': 24929088, 'steps': 129838, 'loss/train': 1.3555009365081787} 11/07/2021 15:30:05 - INFO - __main__ - Step 129840: {'lr': 2.254612965889316e-05, 'samples': 24929280, 'steps': 129839, 'loss/train': 1.3434265851974487} 11/07/2021 15:30:06 - INFO - __main__ - Step 129841: {'lr': 2.2543927342247084e-05, 'samples': 24929472, 'steps': 129840, 'loss/train': 1.3484444618225098} 11/07/2021 15:30:06 - INFO - __main__ - Step 129842: {'lr': 2.2541725128089162e-05, 'samples': 24929664, 'steps': 129841, 'loss/train': 1.7076019048690796} 11/07/2021 15:30:07 - INFO - __main__ - Step 129843: {'lr': 2.2539523016420428e-05, 'samples': 24929856, 'steps': 129842, 'loss/train': 1.6702033281326294} 11/07/2021 15:30:07 - INFO - __main__ - Step 129844: {'lr': 2.2537321007241873e-05, 'samples': 24930048, 'steps': 129843, 'loss/train': 0.5905718207359314} 11/07/2021 15:30:08 - INFO - __main__ - Step 129845: {'lr': 2.2535119100554502e-05, 'samples': 24930240, 'steps': 129844, 'loss/train': 1.0990405082702637} 11/07/2021 15:30:08 - INFO - __main__ - Step 129846: {'lr': 2.2532917296359283e-05, 'samples': 24930432, 'steps': 129845, 'loss/train': 1.3178904056549072} 11/07/2021 15:30:08 - INFO - __main__ - Step 129847: {'lr': 2.253071559465722e-05, 'samples': 24930624, 'steps': 129846, 'loss/train': 1.4214600324630737} 11/07/2021 15:30:09 - INFO - __main__ - Step 129848: {'lr': 2.2528513995449307e-05, 'samples': 24930816, 'steps': 129847, 'loss/train': 0.6585802435874939} 11/07/2021 15:30:10 - INFO - __main__ - Step 129849: {'lr': 2.2526312498736544e-05, 'samples': 24931008, 'steps': 129848, 'loss/train': 1.6390408277511597} 11/07/2021 15:30:10 - INFO - __main__ - Step 129850: {'lr': 2.2524111104519905e-05, 'samples': 24931200, 'steps': 129849, 'loss/train': 1.3420722484588623} 11/07/2021 15:30:11 - INFO - __main__ - Step 129851: {'lr': 2.252190981280042e-05, 'samples': 24931392, 'steps': 129850, 'loss/train': 1.383002758026123} 11/07/2021 15:30:11 - INFO - __main__ - Step 129852: {'lr': 2.251970862357902e-05, 'samples': 24931584, 'steps': 129851, 'loss/train': 0.9620668888092041} 11/07/2021 15:30:11 - INFO - __main__ - Step 129853: {'lr': 2.2517507536856748e-05, 'samples': 24931776, 'steps': 129852, 'loss/train': 0.7262072563171387} 11/07/2021 15:30:12 - INFO - __main__ - Step 129854: {'lr': 2.2515306552634562e-05, 'samples': 24931968, 'steps': 129853, 'loss/train': 1.04094398021698} 11/07/2021 15:30:13 - INFO - __main__ - Step 129855: {'lr': 2.2513105670913523e-05, 'samples': 24932160, 'steps': 129854, 'loss/train': 1.2951292991638184} 11/07/2021 15:30:13 - INFO - __main__ - Step 129856: {'lr': 2.2510904891694524e-05, 'samples': 24932352, 'steps': 129855, 'loss/train': 1.6374168395996094} 11/07/2021 15:30:13 - INFO - __main__ - Step 129857: {'lr': 2.2508704214978583e-05, 'samples': 24932544, 'steps': 129856, 'loss/train': 0.9740048050880432} 11/07/2021 15:30:14 - INFO - __main__ - Step 129858: {'lr': 2.250650364076673e-05, 'samples': 24932736, 'steps': 129857, 'loss/train': 0.9627076387405396} 11/07/2021 15:30:15 - INFO - __main__ - Step 129859: {'lr': 2.2504303169059908e-05, 'samples': 24932928, 'steps': 129858, 'loss/train': 0.9693552851676941} 11/07/2021 15:30:15 - INFO - __main__ - Step 129860: {'lr': 2.2502102799859177e-05, 'samples': 24933120, 'steps': 129859, 'loss/train': 0.7752572894096375} 11/07/2021 15:30:16 - INFO - __main__ - Step 129861: {'lr': 2.249990253316547e-05, 'samples': 24933312, 'steps': 129860, 'loss/train': 1.1759592294692993} 11/07/2021 15:30:16 - INFO - __main__ - Step 129862: {'lr': 2.249770236897977e-05, 'samples': 24933504, 'steps': 129861, 'loss/train': 0.5145472884178162} 11/07/2021 15:30:16 - INFO - __main__ - Step 129863: {'lr': 2.2495502307303127e-05, 'samples': 24933696, 'steps': 129862, 'loss/train': 1.5910695791244507} 11/07/2021 15:30:17 - INFO - __main__ - Step 129864: {'lr': 2.2493302348136487e-05, 'samples': 24933888, 'steps': 129863, 'loss/train': 1.2784584760665894} 11/07/2021 15:30:18 - INFO - __main__ - Step 129865: {'lr': 2.249110249148087e-05, 'samples': 24934080, 'steps': 129864, 'loss/train': 1.2966543436050415} 11/07/2021 15:30:18 - INFO - __main__ - Step 129866: {'lr': 2.2488902737337254e-05, 'samples': 24934272, 'steps': 129865, 'loss/train': 1.2453899383544922} 11/07/2021 15:30:19 - INFO - __main__ - Step 129867: {'lr': 2.2486703085706606e-05, 'samples': 24934464, 'steps': 129866, 'loss/train': 1.2220611572265625} 11/07/2021 15:30:19 - INFO - __main__ - Step 129868: {'lr': 2.2484503536589984e-05, 'samples': 24934656, 'steps': 129867, 'loss/train': 1.0139672756195068} 11/07/2021 15:30:20 - INFO - __main__ - Step 129869: {'lr': 2.2482304089988303e-05, 'samples': 24934848, 'steps': 129868, 'loss/train': 1.2714512348175049} 11/07/2021 15:30:20 - INFO - __main__ - Step 129870: {'lr': 2.248010474590259e-05, 'samples': 24935040, 'steps': 129869, 'loss/train': 1.416741132736206} 11/07/2021 15:30:21 - INFO - __main__ - Step 129871: {'lr': 2.247790550433382e-05, 'samples': 24935232, 'steps': 129870, 'loss/train': 1.353295922279358} 11/07/2021 15:30:21 - INFO - __main__ - Step 129872: {'lr': 2.2475706365282984e-05, 'samples': 24935424, 'steps': 129871, 'loss/train': 1.0856900215148926} 11/07/2021 15:30:21 - INFO - __main__ - Step 129873: {'lr': 2.2473507328751085e-05, 'samples': 24935616, 'steps': 129872, 'loss/train': 1.3463139533996582} 11/07/2021 15:30:22 - INFO - __main__ - Step 129874: {'lr': 2.2471308394739127e-05, 'samples': 24935808, 'steps': 129873, 'loss/train': 1.1048030853271484} 11/07/2021 15:30:23 - INFO - __main__ - Step 129875: {'lr': 2.2469109563248103e-05, 'samples': 24936000, 'steps': 129874, 'loss/train': 0.6832454800605774} 11/07/2021 15:30:23 - INFO - __main__ - Step 129876: {'lr': 2.246691083427896e-05, 'samples': 24936192, 'steps': 129875, 'loss/train': 2.009941816329956} 11/07/2021 15:30:23 - INFO - __main__ - Step 129877: {'lr': 2.246471220783272e-05, 'samples': 24936384, 'steps': 129876, 'loss/train': 1.5214639902114868} 11/07/2021 15:30:24 - INFO - __main__ - Step 129878: {'lr': 2.2462513683910362e-05, 'samples': 24936576, 'steps': 129877, 'loss/train': 1.4162787199020386} 11/07/2021 15:30:24 - INFO - __main__ - Step 129879: {'lr': 2.246031526251291e-05, 'samples': 24936768, 'steps': 129878, 'loss/train': 1.839347004890442} 11/07/2021 15:30:25 - INFO - __main__ - Step 129880: {'lr': 2.2458116943641305e-05, 'samples': 24936960, 'steps': 129879, 'loss/train': 0.8556408882141113} 11/07/2021 15:30:25 - INFO - __main__ - Step 129881: {'lr': 2.2455918727296602e-05, 'samples': 24937152, 'steps': 129880, 'loss/train': 1.324446678161621} 11/07/2021 15:30:26 - INFO - __main__ - Step 129882: {'lr': 2.2453720613479722e-05, 'samples': 24937344, 'steps': 129881, 'loss/train': 0.9201112985610962} 11/07/2021 15:30:26 - INFO - __main__ - Step 129883: {'lr': 2.2451522602191688e-05, 'samples': 24937536, 'steps': 129882, 'loss/train': 1.1788411140441895} 11/07/2021 15:30:27 - INFO - __main__ - Step 129884: {'lr': 2.244932469343347e-05, 'samples': 24937728, 'steps': 129883, 'loss/train': 1.4671809673309326} 11/07/2021 15:30:28 - INFO - __main__ - Step 129885: {'lr': 2.24471268872061e-05, 'samples': 24937920, 'steps': 129884, 'loss/train': 1.253709316253662} 11/07/2021 15:30:28 - INFO - __main__ - Step 129886: {'lr': 2.2444929183510517e-05, 'samples': 24938112, 'steps': 129885, 'loss/train': 1.3603756427764893} 11/07/2021 15:30:28 - INFO - __main__ - Step 129887: {'lr': 2.2442731582347748e-05, 'samples': 24938304, 'steps': 129886, 'loss/train': 1.0267713069915771} 11/07/2021 15:30:29 - INFO - __main__ - Step 129888: {'lr': 2.2440534083718767e-05, 'samples': 24938496, 'steps': 129887, 'loss/train': 1.1729836463928223} 11/07/2021 15:30:29 - INFO - __main__ - Step 129889: {'lr': 2.24383366876246e-05, 'samples': 24938688, 'steps': 129888, 'loss/train': 1.983036994934082} 11/07/2021 15:30:30 - INFO - __main__ - Step 129890: {'lr': 2.243613939406616e-05, 'samples': 24938880, 'steps': 129889, 'loss/train': 1.5039358139038086} 11/07/2021 15:30:30 - INFO - __main__ - Step 129891: {'lr': 2.243394220304451e-05, 'samples': 24939072, 'steps': 129890, 'loss/train': 1.626548171043396} 11/07/2021 15:30:31 - INFO - __main__ - Step 129892: {'lr': 2.2431745114560614e-05, 'samples': 24939264, 'steps': 129891, 'loss/train': 1.2345491647720337} 11/07/2021 15:30:31 - INFO - __main__ - Step 129893: {'lr': 2.2429548128615472e-05, 'samples': 24939456, 'steps': 129892, 'loss/train': 1.1043879985809326} 11/07/2021 15:30:31 - INFO - __main__ - Step 129894: {'lr': 2.2427351245210032e-05, 'samples': 24939648, 'steps': 129893, 'loss/train': 0.8217021822929382} 11/07/2021 15:30:32 - INFO - __main__ - Step 129895: {'lr': 2.2425154464345397e-05, 'samples': 24939840, 'steps': 129894, 'loss/train': 1.5487926006317139} 11/07/2021 15:30:33 - INFO - __main__ - Step 129896: {'lr': 2.242295778602241e-05, 'samples': 24940032, 'steps': 129895, 'loss/train': 0.9323079586029053} 11/07/2021 15:30:33 - INFO - __main__ - Step 129897: {'lr': 2.2420761210242113e-05, 'samples': 24940224, 'steps': 129896, 'loss/train': 1.3634401559829712} 11/07/2021 15:30:34 - INFO - __main__ - Step 129898: {'lr': 2.2418564737005543e-05, 'samples': 24940416, 'steps': 129897, 'loss/train': 1.3088244199752808} 11/07/2021 15:30:34 - INFO - __main__ - Step 129899: {'lr': 2.241636836631364e-05, 'samples': 24940608, 'steps': 129898, 'loss/train': 1.3474907875061035} 11/07/2021 15:30:35 - INFO - __main__ - Step 129900: {'lr': 2.24141720981674e-05, 'samples': 24940800, 'steps': 129899, 'loss/train': 1.2690283060073853} 11/07/2021 15:30:35 - INFO - __main__ - Step 129901: {'lr': 2.2411975932567828e-05, 'samples': 24940992, 'steps': 129900, 'loss/train': 1.1740422248840332} 11/07/2021 15:30:36 - INFO - __main__ - Step 129902: {'lr': 2.2409779869515922e-05, 'samples': 24941184, 'steps': 129901, 'loss/train': 1.361068844795227} 11/07/2021 15:30:36 - INFO - __main__ - Step 129903: {'lr': 2.2407583909012624e-05, 'samples': 24941376, 'steps': 129902, 'loss/train': 1.809840202331543} 11/07/2021 15:30:36 - INFO - __main__ - Step 129904: {'lr': 2.2405388051058988e-05, 'samples': 24941568, 'steps': 129903, 'loss/train': 1.1663836240768433} 11/07/2021 15:30:37 - INFO - __main__ - Step 129905: {'lr': 2.240319229565596e-05, 'samples': 24941760, 'steps': 129904, 'loss/train': 1.4562758207321167} 11/07/2021 15:30:38 - INFO - __main__ - Step 129906: {'lr': 2.240099664280454e-05, 'samples': 24941952, 'steps': 129905, 'loss/train': 1.512674331665039} 11/07/2021 15:30:38 - INFO - __main__ - Step 129907: {'lr': 2.239880109250572e-05, 'samples': 24942144, 'steps': 129906, 'loss/train': 1.4034154415130615} 11/07/2021 15:30:38 - INFO - __main__ - Step 129908: {'lr': 2.2396605644760536e-05, 'samples': 24942336, 'steps': 129907, 'loss/train': 1.3914768695831299} 11/07/2021 15:30:39 - INFO - __main__ - Step 129909: {'lr': 2.23944102995699e-05, 'samples': 24942528, 'steps': 129908, 'loss/train': 1.1827459335327148} 11/07/2021 15:30:40 - INFO - __main__ - Step 129910: {'lr': 2.239221505693481e-05, 'samples': 24942720, 'steps': 129909, 'loss/train': 1.090771198272705} 11/07/2021 15:30:40 - INFO - __main__ - Step 129911: {'lr': 2.23900199168563e-05, 'samples': 24942912, 'steps': 129910, 'loss/train': 1.384418249130249} 11/07/2021 15:30:40 - INFO - __main__ - Step 129912: {'lr': 2.23878248793353e-05, 'samples': 24943104, 'steps': 129911, 'loss/train': 0.7528942227363586} 11/07/2021 15:30:41 - INFO - __main__ - Step 129913: {'lr': 2.238562994437285e-05, 'samples': 24943296, 'steps': 129912, 'loss/train': 1.2251815795898438} 11/07/2021 15:30:41 - INFO - __main__ - Step 129914: {'lr': 2.2383435111969914e-05, 'samples': 24943488, 'steps': 129913, 'loss/train': 1.1760059595108032} 11/07/2021 15:30:42 - INFO - __main__ - Step 129915: {'lr': 2.2381240382127494e-05, 'samples': 24943680, 'steps': 129914, 'loss/train': 1.1377545595169067} 11/07/2021 15:30:43 - INFO - __main__ - Step 129916: {'lr': 2.2379045754846588e-05, 'samples': 24943872, 'steps': 129915, 'loss/train': 1.1001651287078857} 11/07/2021 15:30:43 - INFO - __main__ - Step 129917: {'lr': 2.237685123012817e-05, 'samples': 24944064, 'steps': 129916, 'loss/train': 1.336625576019287} 11/07/2021 15:30:43 - INFO - __main__ - Step 129918: {'lr': 2.237465680797321e-05, 'samples': 24944256, 'steps': 129917, 'loss/train': 0.7663359642028809} 11/07/2021 15:30:44 - INFO - __main__ - Step 129919: {'lr': 2.2372462488382734e-05, 'samples': 24944448, 'steps': 129918, 'loss/train': 1.2038075923919678} 11/07/2021 15:30:44 - INFO - __main__ - Step 129920: {'lr': 2.2370268271357712e-05, 'samples': 24944640, 'steps': 129919, 'loss/train': 0.927400529384613} 11/07/2021 15:30:45 - INFO - __main__ - Step 129921: {'lr': 2.236807415689912e-05, 'samples': 24944832, 'steps': 129920, 'loss/train': 1.0487340688705444} 11/07/2021 15:30:45 - INFO - __main__ - Step 129922: {'lr': 2.236588014500804e-05, 'samples': 24945024, 'steps': 129921, 'loss/train': 1.1726168394088745} 11/07/2021 15:30:46 - INFO - __main__ - Step 129923: {'lr': 2.23636862356853e-05, 'samples': 24945216, 'steps': 129922, 'loss/train': 1.1883811950683594} 11/07/2021 15:30:46 - INFO - __main__ - Step 129924: {'lr': 2.2361492428931983e-05, 'samples': 24945408, 'steps': 129923, 'loss/train': 1.4560999870300293} 11/07/2021 15:30:47 - INFO - __main__ - Step 129925: {'lr': 2.2359298724749066e-05, 'samples': 24945600, 'steps': 129924, 'loss/train': 1.450439214706421} 11/07/2021 15:30:48 - INFO - __main__ - Step 129926: {'lr': 2.2357105123137544e-05, 'samples': 24945792, 'steps': 129925, 'loss/train': 0.9047523736953735} 11/07/2021 15:30:48 - INFO - __main__ - Step 129927: {'lr': 2.235491162409839e-05, 'samples': 24945984, 'steps': 129926, 'loss/train': 1.6916875839233398} 11/07/2021 15:30:49 - INFO - __main__ - Step 129928: {'lr': 2.2352718227632603e-05, 'samples': 24946176, 'steps': 129927, 'loss/train': 0.9462178945541382} 11/07/2021 15:30:49 - INFO - __main__ - Step 129929: {'lr': 2.235052493374118e-05, 'samples': 24946368, 'steps': 129928, 'loss/train': 1.187447190284729} 11/07/2021 15:30:49 - INFO - __main__ - Step 129930: {'lr': 2.2348331742425065e-05, 'samples': 24946560, 'steps': 129929, 'loss/train': 1.3307712078094482} 11/07/2021 15:30:50 - INFO - __main__ - Step 129931: {'lr': 2.2346138653685317e-05, 'samples': 24946752, 'steps': 129930, 'loss/train': 1.4568883180618286} 11/07/2021 15:30:51 - INFO - __main__ - Step 129932: {'lr': 2.2343945667522848e-05, 'samples': 24946944, 'steps': 129931, 'loss/train': 1.3452095985412598} 11/07/2021 15:30:51 - INFO - __main__ - Step 129933: {'lr': 2.2341752783938712e-05, 'samples': 24947136, 'steps': 129932, 'loss/train': 1.0020661354064941} 11/07/2021 15:30:51 - INFO - __main__ - Step 129934: {'lr': 2.2339560002933857e-05, 'samples': 24947328, 'steps': 129933, 'loss/train': 0.05693649500608444} 11/07/2021 15:30:52 - INFO - __main__ - Step 129935: {'lr': 2.2337367324509334e-05, 'samples': 24947520, 'steps': 129934, 'loss/train': 1.2133866548538208} 11/07/2021 15:30:52 - INFO - __main__ - Step 129936: {'lr': 2.2335174748666033e-05, 'samples': 24947712, 'steps': 129935, 'loss/train': 1.139337420463562} 11/07/2021 15:30:53 - INFO - __main__ - Step 129937: {'lr': 2.2332982275405006e-05, 'samples': 24947904, 'steps': 129936, 'loss/train': 1.4173654317855835} 11/07/2021 15:30:54 - INFO - __main__ - Step 129938: {'lr': 2.23307899047272e-05, 'samples': 24948096, 'steps': 129937, 'loss/train': 1.1582527160644531} 11/07/2021 15:30:54 - INFO - __main__ - Step 129939: {'lr': 2.232859763663364e-05, 'samples': 24948288, 'steps': 129938, 'loss/train': 1.2116072177886963} 11/07/2021 15:30:54 - INFO - __main__ - Step 129940: {'lr': 2.2326405471125272e-05, 'samples': 24948480, 'steps': 129939, 'loss/train': 1.1667273044586182} 11/07/2021 15:30:55 - INFO - __main__ - Step 129941: {'lr': 2.232421340820315e-05, 'samples': 24948672, 'steps': 129940, 'loss/train': 1.4471832513809204} 11/07/2021 15:30:56 - INFO - __main__ - Step 129942: {'lr': 2.232202144786821e-05, 'samples': 24948864, 'steps': 129941, 'loss/train': 1.1163121461868286} 11/07/2021 15:30:56 - INFO - __main__ - Step 129943: {'lr': 2.2319829590121466e-05, 'samples': 24949056, 'steps': 129942, 'loss/train': 1.0651777982711792} 11/07/2021 15:30:56 - INFO - __main__ - Step 129944: {'lr': 2.231763783496388e-05, 'samples': 24949248, 'steps': 129943, 'loss/train': 1.2827332019805908} 11/07/2021 15:30:57 - INFO - __main__ - Step 129945: {'lr': 2.231544618239645e-05, 'samples': 24949440, 'steps': 129944, 'loss/train': 1.2125195264816284} 11/07/2021 15:30:57 - INFO - __main__ - Step 129946: {'lr': 2.2313254632420148e-05, 'samples': 24949632, 'steps': 129945, 'loss/train': 1.1082103252410889} 11/07/2021 15:30:58 - INFO - __main__ - Step 129947: {'lr': 2.2311063185036007e-05, 'samples': 24949824, 'steps': 129946, 'loss/train': 0.908994734287262} 11/07/2021 15:30:58 - INFO - __main__ - Step 129948: {'lr': 2.2308871840244994e-05, 'samples': 24950016, 'steps': 129947, 'loss/train': 1.5069453716278076} 11/07/2021 15:30:59 - INFO - __main__ - Step 129949: {'lr': 2.2306680598048134e-05, 'samples': 24950208, 'steps': 129948, 'loss/train': 0.1824704110622406} 11/07/2021 15:30:59 - INFO - __main__ - Step 129950: {'lr': 2.2304489458446292e-05, 'samples': 24950400, 'steps': 129949, 'loss/train': 1.7128007411956787} 11/07/2021 15:30:59 - INFO - __main__ - Step 129951: {'lr': 2.2302298421440575e-05, 'samples': 24950592, 'steps': 129950, 'loss/train': 1.2058051824569702} 11/07/2021 15:31:00 - INFO - __main__ - Step 129952: {'lr': 2.2300107487031903e-05, 'samples': 24950784, 'steps': 129951, 'loss/train': 1.0902281999588013} 11/07/2021 15:31:01 - INFO - __main__ - Step 129953: {'lr': 2.2297916655221295e-05, 'samples': 24950976, 'steps': 129952, 'loss/train': 1.1304479837417603} 11/07/2021 15:31:01 - INFO - __main__ - Step 129954: {'lr': 2.229572592600973e-05, 'samples': 24951168, 'steps': 129953, 'loss/train': 1.0192147493362427} 11/07/2021 15:31:02 - INFO - __main__ - Step 129955: {'lr': 2.2293535299398203e-05, 'samples': 24951360, 'steps': 129954, 'loss/train': 1.267568588256836} 11/07/2021 15:31:02 - INFO - __main__ - Step 129956: {'lr': 2.229134477538769e-05, 'samples': 24951552, 'steps': 129955, 'loss/train': 1.200925350189209} 11/07/2021 15:31:02 - INFO - __main__ - Step 129957: {'lr': 2.2289154353979186e-05, 'samples': 24951744, 'steps': 129956, 'loss/train': 1.515321135520935} 11/07/2021 15:31:03 - INFO - __main__ - Step 129958: {'lr': 2.228696403517369e-05, 'samples': 24951936, 'steps': 129957, 'loss/train': 1.2154459953308105} 11/07/2021 15:31:04 - INFO - __main__ - Step 129959: {'lr': 2.228477381897215e-05, 'samples': 24952128, 'steps': 129958, 'loss/train': 1.2598968744277954} 11/07/2021 15:31:04 - INFO - __main__ - Step 129960: {'lr': 2.2282583705375587e-05, 'samples': 24952320, 'steps': 129959, 'loss/train': 1.326845645904541} 11/07/2021 15:31:04 - INFO - __main__ - Step 129961: {'lr': 2.2280393694384978e-05, 'samples': 24952512, 'steps': 129960, 'loss/train': 1.3961254358291626} 11/07/2021 15:31:05 - INFO - __main__ - Step 129962: {'lr': 2.2278203786001345e-05, 'samples': 24952704, 'steps': 129961, 'loss/train': 1.6513830423355103} 11/07/2021 15:31:06 - INFO - __main__ - Step 129963: {'lr': 2.2276013980225606e-05, 'samples': 24952896, 'steps': 129962, 'loss/train': 1.3722857236862183} 11/07/2021 15:31:06 - INFO - __main__ - Step 129964: {'lr': 2.227382427705879e-05, 'samples': 24953088, 'steps': 129963, 'loss/train': 0.9640253186225891} 11/07/2021 15:31:07 - INFO - __main__ - Step 129965: {'lr': 2.2271634676501866e-05, 'samples': 24953280, 'steps': 129964, 'loss/train': 1.7819536924362183} 11/07/2021 15:31:07 - INFO - __main__ - Step 129966: {'lr': 2.226944517855581e-05, 'samples': 24953472, 'steps': 129965, 'loss/train': 1.3800804615020752} 11/07/2021 15:31:07 - INFO - __main__ - Step 129967: {'lr': 2.226725578322167e-05, 'samples': 24953664, 'steps': 129966, 'loss/train': 1.361901044845581} 11/07/2021 15:31:08 - INFO - __main__ - Step 129968: {'lr': 2.2265066490500363e-05, 'samples': 24953856, 'steps': 129967, 'loss/train': 1.3892619609832764} 11/07/2021 15:31:09 - INFO - __main__ - Step 129969: {'lr': 2.2262877300392893e-05, 'samples': 24954048, 'steps': 129968, 'loss/train': 0.7465394735336304} 11/07/2021 15:31:09 - INFO - __main__ - Step 129970: {'lr': 2.2260688212900284e-05, 'samples': 24954240, 'steps': 129969, 'loss/train': 1.5230751037597656} 11/07/2021 15:31:09 - INFO - __main__ - Step 129971: {'lr': 2.2258499228023476e-05, 'samples': 24954432, 'steps': 129970, 'loss/train': 1.3770818710327148} 11/07/2021 15:31:10 - INFO - __main__ - Step 129972: {'lr': 2.2256310345763474e-05, 'samples': 24954624, 'steps': 129971, 'loss/train': 1.2678085565567017} 11/07/2021 15:31:11 - INFO - __main__ - Step 129973: {'lr': 2.2254121566121248e-05, 'samples': 24954816, 'steps': 129972, 'loss/train': 1.357624888420105} 11/07/2021 15:31:11 - INFO - __main__ - Step 129974: {'lr': 2.2251932889097827e-05, 'samples': 24955008, 'steps': 129973, 'loss/train': 1.0163955688476562} 11/07/2021 15:31:11 - INFO - __main__ - Step 129975: {'lr': 2.2249744314694175e-05, 'samples': 24955200, 'steps': 129974, 'loss/train': 1.1913539171218872} 11/07/2021 15:31:12 - INFO - __main__ - Step 129976: {'lr': 2.22475558429113e-05, 'samples': 24955392, 'steps': 129975, 'loss/train': 1.4549531936645508} 11/07/2021 15:31:12 - INFO - __main__ - Step 129977: {'lr': 2.224536747375011e-05, 'samples': 24955584, 'steps': 129976, 'loss/train': 0.8054185509681702} 11/07/2021 15:31:13 - INFO - __main__ - Step 129978: {'lr': 2.2243179207211665e-05, 'samples': 24955776, 'steps': 129977, 'loss/train': 1.2067779302597046} 11/07/2021 15:31:13 - INFO - __main__ - Step 129979: {'lr': 2.2240991043296938e-05, 'samples': 24955968, 'steps': 129978, 'loss/train': 1.1501147747039795} 11/07/2021 15:31:14 - INFO - __main__ - Step 129980: {'lr': 2.2238802982006868e-05, 'samples': 24956160, 'steps': 129979, 'loss/train': 1.4139511585235596} 11/07/2021 15:31:14 - INFO - __main__ - Step 129981: {'lr': 2.223661502334251e-05, 'samples': 24956352, 'steps': 129980, 'loss/train': 1.2981979846954346} 11/07/2021 15:31:15 - INFO - __main__ - Step 129982: {'lr': 2.223442716730481e-05, 'samples': 24956544, 'steps': 129981, 'loss/train': 0.7984762787818909} 11/07/2021 15:31:16 - INFO - __main__ - Step 129983: {'lr': 2.2232239413894766e-05, 'samples': 24956736, 'steps': 129982, 'loss/train': 1.5082790851593018} 11/07/2021 15:31:16 - INFO - __main__ - Step 129984: {'lr': 2.2230051763113353e-05, 'samples': 24956928, 'steps': 129983, 'loss/train': 1.3983542919158936} 11/07/2021 15:31:16 - INFO - __main__ - Step 129985: {'lr': 2.2227864214961562e-05, 'samples': 24957120, 'steps': 129984, 'loss/train': 1.2519867420196533} 11/07/2021 15:31:17 - INFO - __main__ - Step 129986: {'lr': 2.2225676769440373e-05, 'samples': 24957312, 'steps': 129985, 'loss/train': 1.1593983173370361} 11/07/2021 15:31:17 - INFO - __main__ - Step 129987: {'lr': 2.2223489426550808e-05, 'samples': 24957504, 'steps': 129986, 'loss/train': 1.1420565843582153} 11/07/2021 15:31:18 - INFO - __main__ - Step 129988: {'lr': 2.2221302186293813e-05, 'samples': 24957696, 'steps': 129987, 'loss/train': 1.4264159202575684} 11/07/2021 15:31:18 - INFO - __main__ - Step 129989: {'lr': 2.2219115048670415e-05, 'samples': 24957888, 'steps': 129988, 'loss/train': 0.5818977355957031} 11/07/2021 15:31:19 - INFO - __main__ - Step 129990: {'lr': 2.2216928013681524e-05, 'samples': 24958080, 'steps': 129989, 'loss/train': 1.1761783361434937} 11/07/2021 15:31:19 - INFO - __main__ - Step 129991: {'lr': 2.2214741081328178e-05, 'samples': 24958272, 'steps': 129990, 'loss/train': 1.442885398864746} 11/07/2021 15:31:20 - INFO - __main__ - Step 129992: {'lr': 2.2212554251611366e-05, 'samples': 24958464, 'steps': 129991, 'loss/train': 1.1820826530456543} 11/07/2021 15:31:21 - INFO - __main__ - Step 129993: {'lr': 2.2210367524532037e-05, 'samples': 24958656, 'steps': 129992, 'loss/train': 1.4186865091323853} 11/07/2021 15:31:21 - INFO - __main__ - Step 129994: {'lr': 2.2208180900091217e-05, 'samples': 24958848, 'steps': 129993, 'loss/train': 1.0942516326904297} 11/07/2021 15:31:21 - INFO - __main__ - Step 129995: {'lr': 2.220599437828988e-05, 'samples': 24959040, 'steps': 129994, 'loss/train': 1.269180178642273} 11/07/2021 15:31:22 - INFO - __main__ - Step 129996: {'lr': 2.220380795912899e-05, 'samples': 24959232, 'steps': 129995, 'loss/train': 1.0350733995437622} 11/07/2021 15:31:22 - INFO - __main__ - Step 129997: {'lr': 2.220162164260958e-05, 'samples': 24959424, 'steps': 129996, 'loss/train': 1.4890549182891846} 11/07/2021 15:31:23 - INFO - __main__ - Step 129998: {'lr': 2.219943542873257e-05, 'samples': 24959616, 'steps': 129997, 'loss/train': 1.3163820505142212} 11/07/2021 15:31:23 - INFO - __main__ - Step 129999: {'lr': 2.2197249317499003e-05, 'samples': 24959808, 'steps': 129998, 'loss/train': 0.9964282512664795} 11/07/2021 15:31:24 - INFO - __main__ - Step 130000: {'lr': 2.219506330890983e-05, 'samples': 24960000, 'steps': 129999, 'loss/train': 1.6928282976150513} 11/07/2021 15:31:24 - INFO - __main__ - Step 130001: {'lr': 2.219287740296605e-05, 'samples': 24960192, 'steps': 130000, 'loss/train': 1.2128307819366455} 11/07/2021 15:31:24 - INFO - __main__ - Step 130002: {'lr': 2.2190691599668687e-05, 'samples': 24960384, 'steps': 130001, 'loss/train': 1.0425214767456055} 11/07/2021 15:31:25 - INFO - __main__ - Step 130003: {'lr': 2.2188505899018635e-05, 'samples': 24960576, 'steps': 130002, 'loss/train': 1.2828176021575928} 11/07/2021 15:31:26 - INFO - __main__ - Step 130004: {'lr': 2.2186320301016915e-05, 'samples': 24960768, 'steps': 130003, 'loss/train': 2.3120055198669434} 11/07/2021 15:31:26 - INFO - __main__ - Step 130005: {'lr': 2.218413480566456e-05, 'samples': 24960960, 'steps': 130004, 'loss/train': 0.38484394550323486} 11/07/2021 15:31:27 - INFO - __main__ - Step 130006: {'lr': 2.2181949412962476e-05, 'samples': 24961152, 'steps': 130005, 'loss/train': 1.6508255004882812} 11/07/2021 15:31:27 - INFO - __main__ - Step 130007: {'lr': 2.2179764122911727e-05, 'samples': 24961344, 'steps': 130006, 'loss/train': 1.2915905714035034} 11/07/2021 15:31:28 - INFO - __main__ - Step 130008: {'lr': 2.2177578935513225e-05, 'samples': 24961536, 'steps': 130007, 'loss/train': 1.223225712776184} 11/07/2021 15:31:28 - INFO - __main__ - Step 130009: {'lr': 2.217539385076803e-05, 'samples': 24961728, 'steps': 130008, 'loss/train': 1.4491184949874878} 11/07/2021 15:31:29 - INFO - __main__ - Step 130010: {'lr': 2.2173208868677073e-05, 'samples': 24961920, 'steps': 130009, 'loss/train': 0.9955521821975708} 11/07/2021 15:31:29 - INFO - __main__ - Step 130011: {'lr': 2.217102398924134e-05, 'samples': 24962112, 'steps': 130010, 'loss/train': 1.4727375507354736} 11/07/2021 15:31:29 - INFO - __main__ - Step 130012: {'lr': 2.2168839212461878e-05, 'samples': 24962304, 'steps': 130011, 'loss/train': 1.299240231513977} 11/07/2021 15:31:30 - INFO - __main__ - Step 130013: {'lr': 2.2166654538339575e-05, 'samples': 24962496, 'steps': 130012, 'loss/train': 1.0654690265655518} 11/07/2021 15:31:31 - INFO - __main__ - Step 130014: {'lr': 2.216446996687546e-05, 'samples': 24962688, 'steps': 130013, 'loss/train': 1.4071745872497559} 11/07/2021 15:31:31 - INFO - __main__ - Step 130015: {'lr': 2.2162285498070533e-05, 'samples': 24962880, 'steps': 130014, 'loss/train': 1.0278111696243286} 11/07/2021 15:31:31 - INFO - __main__ - Step 130016: {'lr': 2.2160101131925735e-05, 'samples': 24963072, 'steps': 130015, 'loss/train': 0.7367668747901917} 11/07/2021 15:31:32 - INFO - __main__ - Step 130017: {'lr': 2.2157916868442126e-05, 'samples': 24963264, 'steps': 130016, 'loss/train': 1.5518313646316528} 11/07/2021 15:31:32 - INFO - __main__ - Step 130018: {'lr': 2.2155732707620614e-05, 'samples': 24963456, 'steps': 130017, 'loss/train': 1.3131479024887085} 11/07/2021 15:31:33 - INFO - __main__ - Step 130019: {'lr': 2.2153548649462203e-05, 'samples': 24963648, 'steps': 130018, 'loss/train': 1.2131198644638062} 11/07/2021 15:31:34 - INFO - __main__ - Step 130020: {'lr': 2.2151364693967918e-05, 'samples': 24963840, 'steps': 130019, 'loss/train': 1.0681768655776978} 11/07/2021 15:31:34 - INFO - __main__ - Step 130021: {'lr': 2.214918084113868e-05, 'samples': 24964032, 'steps': 130020, 'loss/train': 0.9595240354537964} 11/07/2021 15:31:34 - INFO - __main__ - Step 130022: {'lr': 2.2146997090975508e-05, 'samples': 24964224, 'steps': 130021, 'loss/train': 0.044561322778463364} 11/07/2021 15:31:35 - INFO - __main__ - Step 130023: {'lr': 2.2144813443479462e-05, 'samples': 24964416, 'steps': 130022, 'loss/train': 1.20440673828125} 11/07/2021 15:31:36 - INFO - __main__ - Step 130024: {'lr': 2.2142629898651372e-05, 'samples': 24964608, 'steps': 130023, 'loss/train': 1.3409823179244995} 11/07/2021 15:31:36 - INFO - __main__ - Step 130025: {'lr': 2.2140446456492298e-05, 'samples': 24964800, 'steps': 130024, 'loss/train': 1.5920593738555908} 11/07/2021 15:31:37 - INFO - __main__ - Step 130026: {'lr': 2.2138263117003232e-05, 'samples': 24964992, 'steps': 130025, 'loss/train': 1.2945259809494019} 11/07/2021 15:31:37 - INFO - __main__ - Step 130027: {'lr': 2.213607988018515e-05, 'samples': 24965184, 'steps': 130026, 'loss/train': 1.444514274597168} 11/07/2021 15:31:37 - INFO - __main__ - Step 130028: {'lr': 2.2133896746039024e-05, 'samples': 24965376, 'steps': 130027, 'loss/train': 1.1075866222381592} 11/07/2021 15:31:38 - INFO - __main__ - Step 130029: {'lr': 2.213171371456585e-05, 'samples': 24965568, 'steps': 130028, 'loss/train': 1.1239678859710693} 11/07/2021 15:31:39 - INFO - __main__ - Step 130030: {'lr': 2.2129530785766628e-05, 'samples': 24965760, 'steps': 130029, 'loss/train': 1.4669841527938843} 11/07/2021 15:31:39 - INFO - __main__ - Step 130031: {'lr': 2.21273479596423e-05, 'samples': 24965952, 'steps': 130030, 'loss/train': 1.4833383560180664} 11/07/2021 15:31:39 - INFO - __main__ - Step 130032: {'lr': 2.21251652361939e-05, 'samples': 24966144, 'steps': 130031, 'loss/train': 0.9506120085716248} 11/07/2021 15:31:40 - INFO - __main__ - Step 130033: {'lr': 2.2122982615422364e-05, 'samples': 24966336, 'steps': 130032, 'loss/train': 0.9804279208183289} 11/07/2021 15:31:41 - INFO - __main__ - Step 130034: {'lr': 2.2120800097328724e-05, 'samples': 24966528, 'steps': 130033, 'loss/train': 1.3326869010925293} 11/07/2021 15:31:41 - INFO - __main__ - Step 130035: {'lr': 2.2118617681913922e-05, 'samples': 24966720, 'steps': 130034, 'loss/train': 1.0955144166946411} 11/07/2021 15:31:41 - INFO - __main__ - Step 130036: {'lr': 2.2116435369178927e-05, 'samples': 24966912, 'steps': 130035, 'loss/train': 1.5064176321029663} 11/07/2021 15:31:42 - INFO - __main__ - Step 130037: {'lr': 2.211425315912477e-05, 'samples': 24967104, 'steps': 130036, 'loss/train': 1.317770004272461} 11/07/2021 15:31:42 - INFO - __main__ - Step 130038: {'lr': 2.211207105175242e-05, 'samples': 24967296, 'steps': 130037, 'loss/train': 1.2570395469665527} 11/07/2021 15:31:43 - INFO - __main__ - Step 130039: {'lr': 2.210988904706282e-05, 'samples': 24967488, 'steps': 130038, 'loss/train': 1.1180973052978516} 11/07/2021 15:31:44 - INFO - __main__ - Step 130040: {'lr': 2.2107707145057026e-05, 'samples': 24967680, 'steps': 130039, 'loss/train': 0.927672803401947} 11/07/2021 15:31:44 - INFO - __main__ - Step 130041: {'lr': 2.2105525345735954e-05, 'samples': 24967872, 'steps': 130040, 'loss/train': 0.5729820132255554} 11/07/2021 15:31:44 - INFO - __main__ - Step 130042: {'lr': 2.2103343649100633e-05, 'samples': 24968064, 'steps': 130041, 'loss/train': 1.0316053628921509} 11/07/2021 15:31:45 - INFO - __main__ - Step 130043: {'lr': 2.210116205515203e-05, 'samples': 24968256, 'steps': 130042, 'loss/train': 0.979061484336853} 11/07/2021 15:31:45 - INFO - __main__ - Step 130044: {'lr': 2.209898056389112e-05, 'samples': 24968448, 'steps': 130043, 'loss/train': 1.162644624710083} 11/07/2021 15:31:46 - INFO - __main__ - Step 130045: {'lr': 2.2096799175318926e-05, 'samples': 24968640, 'steps': 130044, 'loss/train': 1.0467475652694702} 11/07/2021 15:31:46 - INFO - __main__ - Step 130046: {'lr': 2.209461788943637e-05, 'samples': 24968832, 'steps': 130045, 'loss/train': 1.2600836753845215} 11/07/2021 15:31:47 - INFO - __main__ - Step 130047: {'lr': 2.2092436706244474e-05, 'samples': 24969024, 'steps': 130046, 'loss/train': 1.294661521911621} 11/07/2021 15:31:47 - INFO - __main__ - Step 130048: {'lr': 2.209025562574418e-05, 'samples': 24969216, 'steps': 130047, 'loss/train': 1.1693084239959717} 11/07/2021 15:31:47 - INFO - __main__ - Step 130049: {'lr': 2.2088074647936523e-05, 'samples': 24969408, 'steps': 130048, 'loss/train': 0.5283559560775757} 11/07/2021 15:31:48 - INFO - __main__ - Step 130050: {'lr': 2.208589377282244e-05, 'samples': 24969600, 'steps': 130049, 'loss/train': 0.7325101494789124} 11/07/2021 15:31:49 - INFO - __main__ - Step 130051: {'lr': 2.208371300040296e-05, 'samples': 24969792, 'steps': 130050, 'loss/train': 1.2597391605377197} 11/07/2021 15:31:49 - INFO - __main__ - Step 130052: {'lr': 2.2081532330679026e-05, 'samples': 24969984, 'steps': 130051, 'loss/train': 1.3688747882843018} 11/07/2021 15:31:49 - INFO - __main__ - Step 130053: {'lr': 2.207935176365164e-05, 'samples': 24970176, 'steps': 130052, 'loss/train': 1.2390879392623901} 11/07/2021 15:31:50 - INFO - __main__ - Step 130054: {'lr': 2.207717129932177e-05, 'samples': 24970368, 'steps': 130053, 'loss/train': 1.3055391311645508} 11/07/2021 15:31:51 - INFO - __main__ - Step 130055: {'lr': 2.2074990937690413e-05, 'samples': 24970560, 'steps': 130054, 'loss/train': 0.9732938408851624} 11/07/2021 15:31:51 - INFO - __main__ - Step 130056: {'lr': 2.2072810678758604e-05, 'samples': 24970752, 'steps': 130055, 'loss/train': 1.3799198865890503} 11/07/2021 15:31:52 - INFO - __main__ - Step 130057: {'lr': 2.2070630522527223e-05, 'samples': 24970944, 'steps': 130056, 'loss/train': 1.055692434310913} 11/07/2021 15:31:52 - INFO - __main__ - Step 130058: {'lr': 2.2068450468997302e-05, 'samples': 24971136, 'steps': 130057, 'loss/train': 0.9412559866905212} 11/07/2021 15:31:52 - INFO - __main__ - Step 130059: {'lr': 2.206627051816981e-05, 'samples': 24971328, 'steps': 130058, 'loss/train': 1.2797952890396118} 11/07/2021 15:31:54 - INFO - __main__ - Step 130060: {'lr': 2.206409067004575e-05, 'samples': 24971520, 'steps': 130059, 'loss/train': 1.04344642162323} 11/07/2021 15:31:54 - INFO - __main__ - Step 130061: {'lr': 2.206191092462609e-05, 'samples': 24971712, 'steps': 130060, 'loss/train': 1.2529302835464478} 11/07/2021 15:31:54 - INFO - __main__ - Step 130062: {'lr': 2.2059731281911826e-05, 'samples': 24971904, 'steps': 130061, 'loss/train': 0.3067210912704468} 11/07/2021 15:31:55 - INFO - __main__ - Step 130063: {'lr': 2.205755174190391e-05, 'samples': 24972096, 'steps': 130062, 'loss/train': 0.9894163608551025} 11/07/2021 15:31:55 - INFO - __main__ - Step 130064: {'lr': 2.205537230460336e-05, 'samples': 24972288, 'steps': 130063, 'loss/train': 0.4886073172092438} 11/07/2021 15:31:56 - INFO - __main__ - Step 130065: {'lr': 2.2053192970011126e-05, 'samples': 24972480, 'steps': 130064, 'loss/train': 1.349441647529602} 11/07/2021 15:31:56 - INFO - __main__ - Step 130066: {'lr': 2.2051013738128205e-05, 'samples': 24972672, 'steps': 130065, 'loss/train': 0.9818046689033508} 11/07/2021 15:31:57 - INFO - __main__ - Step 130067: {'lr': 2.20488346089556e-05, 'samples': 24972864, 'steps': 130066, 'loss/train': 1.0229136943817139} 11/07/2021 15:31:57 - INFO - __main__ - Step 130068: {'lr': 2.2046655582494245e-05, 'samples': 24973056, 'steps': 130067, 'loss/train': 1.3359726667404175} 11/07/2021 15:31:57 - INFO - __main__ - Step 130069: {'lr': 2.2044476658745177e-05, 'samples': 24973248, 'steps': 130068, 'loss/train': 1.2115222215652466} 11/07/2021 15:31:58 - INFO - __main__ - Step 130070: {'lr': 2.204229783770939e-05, 'samples': 24973440, 'steps': 130069, 'loss/train': 1.2221388816833496} 11/07/2021 15:31:59 - INFO - __main__ - Step 130071: {'lr': 2.2040119119387774e-05, 'samples': 24973632, 'steps': 130070, 'loss/train': 1.2086269855499268} 11/07/2021 15:31:59 - INFO - __main__ - Step 130072: {'lr': 2.2037940503781357e-05, 'samples': 24973824, 'steps': 130071, 'loss/train': 1.3881635665893555} 11/07/2021 15:31:59 - INFO - __main__ - Step 130073: {'lr': 2.2035761990891136e-05, 'samples': 24974016, 'steps': 130072, 'loss/train': 1.4479217529296875} 11/07/2021 15:32:00 - INFO - __main__ - Step 130074: {'lr': 2.203358358071808e-05, 'samples': 24974208, 'steps': 130073, 'loss/train': 1.634560227394104} 11/07/2021 15:32:01 - INFO - __main__ - Step 130075: {'lr': 2.2031405273263167e-05, 'samples': 24974400, 'steps': 130074, 'loss/train': 1.3240865468978882} 11/07/2021 15:32:01 - INFO - __main__ - Step 130076: {'lr': 2.202922706852739e-05, 'samples': 24974592, 'steps': 130075, 'loss/train': 1.2840973138809204} 11/07/2021 15:32:02 - INFO - __main__ - Step 130077: {'lr': 2.2027048966511724e-05, 'samples': 24974784, 'steps': 130076, 'loss/train': 1.4379189014434814} 11/07/2021 15:32:02 - INFO - __main__ - Step 130078: {'lr': 2.2024870967217142e-05, 'samples': 24974976, 'steps': 130077, 'loss/train': 1.963302731513977} 11/07/2021 15:32:02 - INFO - __main__ - Step 130079: {'lr': 2.2022693070644668e-05, 'samples': 24975168, 'steps': 130078, 'loss/train': 1.5113152265548706} 11/07/2021 15:32:03 - INFO - __main__ - Step 130080: {'lr': 2.2020515276795217e-05, 'samples': 24975360, 'steps': 130079, 'loss/train': 0.7845205664634705} 11/07/2021 15:32:04 - INFO - __main__ - Step 130081: {'lr': 2.201833758566982e-05, 'samples': 24975552, 'steps': 130080, 'loss/train': 1.224756121635437} 11/07/2021 15:32:04 - INFO - __main__ - Step 130082: {'lr': 2.2016159997269442e-05, 'samples': 24975744, 'steps': 130081, 'loss/train': 1.2809301614761353} 11/07/2021 15:32:04 - INFO - __main__ - Step 130083: {'lr': 2.2013982511595087e-05, 'samples': 24975936, 'steps': 130082, 'loss/train': 1.5366133451461792} 11/07/2021 15:32:05 - INFO - __main__ - Step 130084: {'lr': 2.2011805128647698e-05, 'samples': 24976128, 'steps': 130083, 'loss/train': 1.3644293546676636} 11/07/2021 15:32:06 - INFO - __main__ - Step 130085: {'lr': 2.200962784842825e-05, 'samples': 24976320, 'steps': 130084, 'loss/train': 0.8434613943099976} 11/07/2021 15:32:06 - INFO - __main__ - Step 130086: {'lr': 2.200745067093776e-05, 'samples': 24976512, 'steps': 130085, 'loss/train': 1.1578245162963867} 11/07/2021 15:32:07 - INFO - __main__ - Step 130087: {'lr': 2.200527359617721e-05, 'samples': 24976704, 'steps': 130086, 'loss/train': 1.3653095960617065} 11/07/2021 15:32:07 - INFO - __main__ - Step 130088: {'lr': 2.200309662414754e-05, 'samples': 24976896, 'steps': 130087, 'loss/train': 1.25606369972229} 11/07/2021 15:32:07 - INFO - __main__ - Step 130089: {'lr': 2.2000919754849745e-05, 'samples': 24977088, 'steps': 130088, 'loss/train': 0.04477894306182861} 11/07/2021 15:32:08 - INFO - __main__ - Step 130090: {'lr': 2.1998742988284858e-05, 'samples': 24977280, 'steps': 130089, 'loss/train': 1.4307705163955688} 11/07/2021 15:32:09 - INFO - __main__ - Step 130091: {'lr': 2.1996566324453794e-05, 'samples': 24977472, 'steps': 130090, 'loss/train': 1.1128149032592773} 11/07/2021 15:32:09 - INFO - __main__ - Step 130092: {'lr': 2.1994389763357548e-05, 'samples': 24977664, 'steps': 130091, 'loss/train': 1.272504448890686} 11/07/2021 15:32:09 - INFO - __main__ - Step 130093: {'lr': 2.199221330499712e-05, 'samples': 24977856, 'steps': 130092, 'loss/train': 0.7649295926094055} 11/07/2021 15:32:10 - INFO - __main__ - Step 130094: {'lr': 2.1990036949373487e-05, 'samples': 24978048, 'steps': 130093, 'loss/train': 1.0841840505599976} 11/07/2021 15:32:11 - INFO - __main__ - Step 130095: {'lr': 2.1987860696487644e-05, 'samples': 24978240, 'steps': 130094, 'loss/train': 1.7403205633163452} 11/07/2021 15:32:11 - INFO - __main__ - Step 130096: {'lr': 2.1985684546340535e-05, 'samples': 24978432, 'steps': 130095, 'loss/train': 1.0805166959762573} 11/07/2021 15:32:11 - INFO - __main__ - Step 130097: {'lr': 2.1983508498933186e-05, 'samples': 24978624, 'steps': 130096, 'loss/train': 1.4550467729568481} 11/07/2021 15:32:12 - INFO - __main__ - Step 130098: {'lr': 2.1981332554266543e-05, 'samples': 24978816, 'steps': 130097, 'loss/train': 1.0506582260131836} 11/07/2021 15:32:12 - INFO - __main__ - Step 130099: {'lr': 2.1979156712341547e-05, 'samples': 24979008, 'steps': 130098, 'loss/train': 1.4599709510803223} 11/07/2021 15:32:13 - INFO - __main__ - Step 130100: {'lr': 2.1976980973159255e-05, 'samples': 24979200, 'steps': 130099, 'loss/train': 1.5173314809799194} 11/07/2021 15:32:14 - INFO - __main__ - Step 130101: {'lr': 2.197480533672061e-05, 'samples': 24979392, 'steps': 130100, 'loss/train': 1.0887975692749023} 11/07/2021 15:32:14 - INFO - __main__ - Step 130102: {'lr': 2.197262980302661e-05, 'samples': 24979584, 'steps': 130101, 'loss/train': 0.7690575122833252} 11/07/2021 15:32:14 - INFO - __main__ - Step 130103: {'lr': 2.1970454372078202e-05, 'samples': 24979776, 'steps': 130102, 'loss/train': 1.075035572052002} 11/07/2021 15:32:15 - INFO - __main__ - Step 130104: {'lr': 2.196827904387641e-05, 'samples': 24979968, 'steps': 130103, 'loss/train': 1.0407021045684814} 11/07/2021 15:32:16 - INFO - __main__ - Step 130105: {'lr': 2.1966103818422178e-05, 'samples': 24980160, 'steps': 130104, 'loss/train': 0.9808293581008911} 11/07/2021 15:32:16 - INFO - __main__ - Step 130106: {'lr': 2.1963928695716506e-05, 'samples': 24980352, 'steps': 130105, 'loss/train': 0.9907920956611633} 11/07/2021 15:32:16 - INFO - __main__ - Step 130107: {'lr': 2.1961753675760366e-05, 'samples': 24980544, 'steps': 130106, 'loss/train': 1.5614467859268188} 11/07/2021 15:32:17 - INFO - __main__ - Step 130108: {'lr': 2.1959578758554754e-05, 'samples': 24980736, 'steps': 130107, 'loss/train': 1.2595866918563843} 11/07/2021 15:32:17 - INFO - __main__ - Step 130109: {'lr': 2.1957403944100618e-05, 'samples': 24980928, 'steps': 130108, 'loss/train': 1.076682209968567} 11/07/2021 15:32:17 - INFO - __main__ - Step 130110: {'lr': 2.195522923239901e-05, 'samples': 24981120, 'steps': 130109, 'loss/train': 1.4424299001693726} 11/07/2021 15:32:18 - INFO - __main__ - Step 130111: {'lr': 2.1953054623450817e-05, 'samples': 24981312, 'steps': 130110, 'loss/train': 1.210779070854187} 11/07/2021 15:32:19 - INFO - __main__ - Step 130112: {'lr': 2.1950880117257043e-05, 'samples': 24981504, 'steps': 130111, 'loss/train': 1.1898406744003296} 11/07/2021 15:32:19 - INFO - __main__ - Step 130113: {'lr': 2.1948705713818686e-05, 'samples': 24981696, 'steps': 130112, 'loss/train': 1.1042883396148682} 11/07/2021 15:32:19 - INFO - __main__ - Step 130114: {'lr': 2.1946531413136738e-05, 'samples': 24981888, 'steps': 130113, 'loss/train': 1.2838709354400635} 11/07/2021 15:32:20 - INFO - __main__ - Step 130115: {'lr': 2.194435721521215e-05, 'samples': 24982080, 'steps': 130114, 'loss/train': 1.5432487726211548} 11/07/2021 15:32:21 - INFO - __main__ - Step 130116: {'lr': 2.1942183120045922e-05, 'samples': 24982272, 'steps': 130115, 'loss/train': 1.3903864622116089} 11/07/2021 15:32:21 - INFO - __main__ - Step 130117: {'lr': 2.194000912763905e-05, 'samples': 24982464, 'steps': 130116, 'loss/train': 1.2730417251586914} 11/07/2021 15:32:22 - INFO - __main__ - Step 130118: {'lr': 2.1937835237992447e-05, 'samples': 24982656, 'steps': 130117, 'loss/train': 1.0347052812576294} 11/07/2021 15:32:22 - INFO - __main__ - Step 130119: {'lr': 2.1935661451107177e-05, 'samples': 24982848, 'steps': 130118, 'loss/train': 0.7326734066009521} 11/07/2021 15:32:22 - INFO - __main__ - Step 130120: {'lr': 2.1933487766984146e-05, 'samples': 24983040, 'steps': 130119, 'loss/train': 0.6899870038032532} 11/07/2021 15:32:23 - INFO - __main__ - Step 130121: {'lr': 2.1931314185624383e-05, 'samples': 24983232, 'steps': 130120, 'loss/train': 1.3318573236465454} 11/07/2021 15:32:24 - INFO - __main__ - Step 130122: {'lr': 2.192914070702884e-05, 'samples': 24983424, 'steps': 130121, 'loss/train': 1.0636909008026123} 11/07/2021 15:32:24 - INFO - __main__ - Step 130123: {'lr': 2.192696733119856e-05, 'samples': 24983616, 'steps': 130122, 'loss/train': 1.158202886581421} 11/07/2021 15:32:24 - INFO - __main__ - Step 130124: {'lr': 2.1924794058134413e-05, 'samples': 24983808, 'steps': 130123, 'loss/train': 1.1054131984710693} 11/07/2021 15:32:25 - INFO - __main__ - Step 130125: {'lr': 2.1922620887837445e-05, 'samples': 24984000, 'steps': 130124, 'loss/train': 1.175594687461853} 11/07/2021 15:32:26 - INFO - __main__ - Step 130126: {'lr': 2.1920447820308637e-05, 'samples': 24984192, 'steps': 130125, 'loss/train': 0.7473324537277222} 11/07/2021 15:32:26 - INFO - __main__ - Step 130127: {'lr': 2.1918274855548954e-05, 'samples': 24984384, 'steps': 130126, 'loss/train': 1.41968834400177} 11/07/2021 15:32:26 - INFO - __main__ - Step 130128: {'lr': 2.1916101993559338e-05, 'samples': 24984576, 'steps': 130127, 'loss/train': 1.4816815853118896} 11/07/2021 15:32:27 - INFO - __main__ - Step 130129: {'lr': 2.191392923434085e-05, 'samples': 24984768, 'steps': 130128, 'loss/train': 0.9427018761634827} 11/07/2021 15:32:27 - INFO - __main__ - Step 130130: {'lr': 2.1911756577894404e-05, 'samples': 24984960, 'steps': 130129, 'loss/train': 0.932868242263794} 11/07/2021 15:32:28 - INFO - __main__ - Step 130131: {'lr': 2.1909584024220996e-05, 'samples': 24985152, 'steps': 130130, 'loss/train': 1.2860922813415527} 11/07/2021 15:32:28 - INFO - __main__ - Step 130132: {'lr': 2.190741157332163e-05, 'samples': 24985344, 'steps': 130131, 'loss/train': 1.5307281017303467} 11/07/2021 15:32:29 - INFO - __main__ - Step 130133: {'lr': 2.190523922519727e-05, 'samples': 24985536, 'steps': 130132, 'loss/train': 1.0773345232009888} 11/07/2021 15:32:29 - INFO - __main__ - Step 130134: {'lr': 2.190306697984887e-05, 'samples': 24985728, 'steps': 130133, 'loss/train': 1.0391508340835571} 11/07/2021 15:32:30 - INFO - __main__ - Step 130135: {'lr': 2.1900894837277417e-05, 'samples': 24985920, 'steps': 130134, 'loss/train': 1.3158053159713745} 11/07/2021 15:32:31 - INFO - __main__ - Step 130136: {'lr': 2.1898722797483923e-05, 'samples': 24986112, 'steps': 130135, 'loss/train': 1.362470030784607} 11/07/2021 15:32:31 - INFO - __main__ - Step 130137: {'lr': 2.1896550860469376e-05, 'samples': 24986304, 'steps': 130136, 'loss/train': 1.4502627849578857} 11/07/2021 15:32:31 - INFO - __main__ - Step 130138: {'lr': 2.1894379026234702e-05, 'samples': 24986496, 'steps': 130137, 'loss/train': 1.1135860681533813} 11/07/2021 15:32:32 - INFO - __main__ - Step 130139: {'lr': 2.1892207294780892e-05, 'samples': 24986688, 'steps': 130138, 'loss/train': 1.3655266761779785} 11/07/2021 15:32:32 - INFO - __main__ - Step 130140: {'lr': 2.189003566610892e-05, 'samples': 24986880, 'steps': 130139, 'loss/train': 1.1330642700195312} 11/07/2021 15:32:33 - INFO - __main__ - Step 130141: {'lr': 2.1887864140219788e-05, 'samples': 24987072, 'steps': 130140, 'loss/train': 1.1157681941986084} 11/07/2021 15:32:34 - INFO - __main__ - Step 130142: {'lr': 2.1885692717114462e-05, 'samples': 24987264, 'steps': 130141, 'loss/train': 1.1821892261505127} 11/07/2021 15:32:34 - INFO - __main__ - Step 130143: {'lr': 2.188352139679392e-05, 'samples': 24987456, 'steps': 130142, 'loss/train': 1.2150236368179321} 11/07/2021 15:32:34 - INFO - __main__ - Step 130144: {'lr': 2.188135017925913e-05, 'samples': 24987648, 'steps': 130143, 'loss/train': 1.0246630907058716} 11/07/2021 15:32:35 - INFO - __main__ - Step 130145: {'lr': 2.1879179064511118e-05, 'samples': 24987840, 'steps': 130144, 'loss/train': 1.2702890634536743} 11/07/2021 15:32:36 - INFO - __main__ - Step 130146: {'lr': 2.18770080525508e-05, 'samples': 24988032, 'steps': 130145, 'loss/train': 1.216117024421692} 11/07/2021 15:32:36 - INFO - __main__ - Step 130147: {'lr': 2.187483714337918e-05, 'samples': 24988224, 'steps': 130146, 'loss/train': 2.0559403896331787} 11/07/2021 15:32:36 - INFO - __main__ - Step 130148: {'lr': 2.187266633699725e-05, 'samples': 24988416, 'steps': 130147, 'loss/train': 0.6764572858810425} 11/07/2021 15:32:37 - INFO - __main__ - Step 130149: {'lr': 2.1870495633405986e-05, 'samples': 24988608, 'steps': 130148, 'loss/train': 1.35325026512146} 11/07/2021 15:32:37 - INFO - __main__ - Step 130150: {'lr': 2.1868325032606385e-05, 'samples': 24988800, 'steps': 130149, 'loss/train': 1.440859317779541} 11/07/2021 15:32:38 - INFO - __main__ - Step 130151: {'lr': 2.1866154534599364e-05, 'samples': 24988992, 'steps': 130150, 'loss/train': 1.362333059310913} 11/07/2021 15:32:38 - INFO - __main__ - Step 130152: {'lr': 2.186398413938592e-05, 'samples': 24989184, 'steps': 130151, 'loss/train': 1.153342843055725} 11/07/2021 15:32:39 - INFO - __main__ - Step 130153: {'lr': 2.1861813846967062e-05, 'samples': 24989376, 'steps': 130152, 'loss/train': 1.0898215770721436} 11/07/2021 15:32:39 - INFO - __main__ - Step 130154: {'lr': 2.185964365734372e-05, 'samples': 24989568, 'steps': 130153, 'loss/train': 1.1132503747940063} 11/07/2021 15:32:39 - INFO - __main__ - Step 130155: {'lr': 2.185747357051693e-05, 'samples': 24989760, 'steps': 130154, 'loss/train': 1.3397043943405151} 11/07/2021 15:32:41 - INFO - __main__ - Step 130156: {'lr': 2.185530358648763e-05, 'samples': 24989952, 'steps': 130155, 'loss/train': 1.1448345184326172} 11/07/2021 15:32:41 - INFO - __main__ - Step 130157: {'lr': 2.1853133705256823e-05, 'samples': 24990144, 'steps': 130156, 'loss/train': 1.4099469184875488} 11/07/2021 15:32:41 - INFO - __main__ - Step 130158: {'lr': 2.185096392682545e-05, 'samples': 24990336, 'steps': 130157, 'loss/train': 1.5813446044921875} 11/07/2021 15:32:42 - INFO - __main__ - Step 130159: {'lr': 2.1848794251194543e-05, 'samples': 24990528, 'steps': 130158, 'loss/train': 1.8172115087509155} 11/07/2021 15:32:42 - INFO - __main__ - Step 130160: {'lr': 2.1846624678365012e-05, 'samples': 24990720, 'steps': 130159, 'loss/train': 1.0603690147399902} 11/07/2021 15:32:43 - INFO - __main__ - Step 130161: {'lr': 2.1844455208337888e-05, 'samples': 24990912, 'steps': 130160, 'loss/train': 1.4582711458206177} 11/07/2021 15:32:43 - INFO - __main__ - Step 130162: {'lr': 2.184228584111414e-05, 'samples': 24991104, 'steps': 130161, 'loss/train': 1.1088848114013672} 11/07/2021 15:32:44 - INFO - __main__ - Step 130163: {'lr': 2.184011657669474e-05, 'samples': 24991296, 'steps': 130162, 'loss/train': 0.8782729506492615} 11/07/2021 15:32:44 - INFO - __main__ - Step 130164: {'lr': 2.1837947415080688e-05, 'samples': 24991488, 'steps': 130163, 'loss/train': 1.1933941841125488} 11/07/2021 15:32:44 - INFO - __main__ - Step 130165: {'lr': 2.18357783562729e-05, 'samples': 24991680, 'steps': 130164, 'loss/train': 0.6899797320365906} 11/07/2021 15:32:46 - INFO - __main__ - Step 130166: {'lr': 2.1833609400272404e-05, 'samples': 24991872, 'steps': 130165, 'loss/train': 1.0923292636871338} 11/07/2021 15:32:46 - INFO - __main__ - Step 130167: {'lr': 2.1831440547080137e-05, 'samples': 24992064, 'steps': 130166, 'loss/train': 1.422503113746643} 11/07/2021 15:32:46 - INFO - __main__ - Step 130168: {'lr': 2.1829271796697108e-05, 'samples': 24992256, 'steps': 130167, 'loss/train': 1.081761360168457} 11/07/2021 15:32:47 - INFO - __main__ - Step 130169: {'lr': 2.1827103149124312e-05, 'samples': 24992448, 'steps': 130168, 'loss/train': 1.1852688789367676} 11/07/2021 15:32:47 - INFO - __main__ - Step 130170: {'lr': 2.1824934604362688e-05, 'samples': 24992640, 'steps': 130169, 'loss/train': 1.6296093463897705} 11/07/2021 15:32:47 - INFO - __main__ - Step 130171: {'lr': 2.1822766162413215e-05, 'samples': 24992832, 'steps': 130170, 'loss/train': 1.0255398750305176} 11/07/2021 15:32:48 - INFO - __main__ - Step 130172: {'lr': 2.182059782327689e-05, 'samples': 24993024, 'steps': 130171, 'loss/train': 1.3932167291641235} 11/07/2021 15:32:49 - INFO - __main__ - Step 130173: {'lr': 2.1818429586954707e-05, 'samples': 24993216, 'steps': 130172, 'loss/train': 0.0570182204246521} 11/07/2021 15:32:49 - INFO - __main__ - Step 130174: {'lr': 2.181626145344759e-05, 'samples': 24993408, 'steps': 130173, 'loss/train': 1.6518465280532837} 11/07/2021 15:32:50 - INFO - __main__ - Step 130175: {'lr': 2.181409342275656e-05, 'samples': 24993600, 'steps': 130174, 'loss/train': 1.464717149734497} 11/07/2021 15:32:50 - INFO - __main__ - Step 130176: {'lr': 2.1811925494882562e-05, 'samples': 24993792, 'steps': 130175, 'loss/train': 0.9845554232597351} 11/07/2021 15:32:51 - INFO - __main__ - Step 130177: {'lr': 2.1809757669826653e-05, 'samples': 24993984, 'steps': 130176, 'loss/train': 1.506103277206421} 11/07/2021 15:32:51 - INFO - __main__ - Step 130178: {'lr': 2.180758994758969e-05, 'samples': 24994176, 'steps': 130177, 'loss/train': 0.9103133678436279} 11/07/2021 15:32:52 - INFO - __main__ - Step 130179: {'lr': 2.1805422328172703e-05, 'samples': 24994368, 'steps': 130178, 'loss/train': 1.0023545026779175} 11/07/2021 15:32:52 - INFO - __main__ - Step 130180: {'lr': 2.1803254811576662e-05, 'samples': 24994560, 'steps': 130179, 'loss/train': 0.8412537574768066} 11/07/2021 15:32:52 - INFO - __main__ - Step 130181: {'lr': 2.1801087397802567e-05, 'samples': 24994752, 'steps': 130180, 'loss/train': 1.6081346273422241} 11/07/2021 15:32:54 - INFO - __main__ - Step 130182: {'lr': 2.1798920086851388e-05, 'samples': 24994944, 'steps': 130181, 'loss/train': 1.532060146331787} 11/07/2021 15:32:54 - INFO - __main__ - Step 130183: {'lr': 2.1796752878724068e-05, 'samples': 24995136, 'steps': 130182, 'loss/train': 1.4157018661499023} 11/07/2021 15:32:54 - INFO - __main__ - Step 130184: {'lr': 2.179458577342164e-05, 'samples': 24995328, 'steps': 130183, 'loss/train': 1.420865535736084} 11/07/2021 15:32:55 - INFO - __main__ - Step 130185: {'lr': 2.179241877094504e-05, 'samples': 24995520, 'steps': 130184, 'loss/train': 1.2167786359786987} 11/07/2021 15:32:55 - INFO - __main__ - Step 130186: {'lr': 2.179025187129524e-05, 'samples': 24995712, 'steps': 130185, 'loss/train': 0.9415243864059448} 11/07/2021 15:32:56 - INFO - __main__ - Step 130187: {'lr': 2.1788085074473245e-05, 'samples': 24995904, 'steps': 130186, 'loss/train': 1.5839473009109497} 11/07/2021 15:32:56 - INFO - __main__ - Step 130188: {'lr': 2.1785918380480024e-05, 'samples': 24996096, 'steps': 130187, 'loss/train': 1.2348030805587769} 11/07/2021 15:32:57 - INFO - __main__ - Step 130189: {'lr': 2.1783751789316548e-05, 'samples': 24996288, 'steps': 130188, 'loss/train': 1.192076325416565} 11/07/2021 15:32:57 - INFO - __main__ - Step 130190: {'lr': 2.178158530098376e-05, 'samples': 24996480, 'steps': 130189, 'loss/train': 1.1179726123809814} 11/07/2021 15:32:57 - INFO - __main__ - Step 130191: {'lr': 2.177941891548274e-05, 'samples': 24996672, 'steps': 130190, 'loss/train': 1.3811969757080078} 11/07/2021 15:32:58 - INFO - __main__ - Step 130192: {'lr': 2.1777252632814355e-05, 'samples': 24996864, 'steps': 130191, 'loss/train': 1.0361140966415405} 11/07/2021 15:32:59 - INFO - __main__ - Step 130193: {'lr': 2.17750864529796e-05, 'samples': 24997056, 'steps': 130192, 'loss/train': 1.0761712789535522} 11/07/2021 15:32:59 - INFO - __main__ - Step 130194: {'lr': 2.177292037597947e-05, 'samples': 24997248, 'steps': 130193, 'loss/train': 1.041387915611267} 11/07/2021 15:33:00 - INFO - __main__ - Step 130195: {'lr': 2.1770754401814947e-05, 'samples': 24997440, 'steps': 130194, 'loss/train': 1.70180082321167} 11/07/2021 15:33:00 - INFO - __main__ - Step 130196: {'lr': 2.1768588530486995e-05, 'samples': 24997632, 'steps': 130195, 'loss/train': 1.2548584938049316} 11/07/2021 15:33:01 - INFO - __main__ - Step 130197: {'lr': 2.1766422761996612e-05, 'samples': 24997824, 'steps': 130196, 'loss/train': 1.1077117919921875} 11/07/2021 15:33:01 - INFO - __main__ - Step 130198: {'lr': 2.1764257096344746e-05, 'samples': 24998016, 'steps': 130197, 'loss/train': 1.510060429573059} 11/07/2021 15:33:02 - INFO - __main__ - Step 130199: {'lr': 2.176209153353237e-05, 'samples': 24998208, 'steps': 130198, 'loss/train': 1.3447551727294922} 11/07/2021 15:33:02 - INFO - __main__ - Step 130200: {'lr': 2.1759926073560477e-05, 'samples': 24998400, 'steps': 130199, 'loss/train': 1.2952828407287598} 11/07/2021 15:33:02 - INFO - __main__ - Step 130201: {'lr': 2.175776071643007e-05, 'samples': 24998592, 'steps': 130200, 'loss/train': 1.9776350259780884} 11/07/2021 15:33:03 - INFO - __main__ - Step 130202: {'lr': 2.1755595462142062e-05, 'samples': 24998784, 'steps': 130201, 'loss/train': 1.7540192604064941} 11/07/2021 15:33:04 - INFO - __main__ - Step 130203: {'lr': 2.1753430310697458e-05, 'samples': 24998976, 'steps': 130202, 'loss/train': 1.1742759943008423} 11/07/2021 15:33:04 - INFO - __main__ - Step 130204: {'lr': 2.1751265262097307e-05, 'samples': 24999168, 'steps': 130203, 'loss/train': 0.037021785974502563} 11/07/2021 15:33:04 - INFO - __main__ - Step 130205: {'lr': 2.1749100316342447e-05, 'samples': 24999360, 'steps': 130204, 'loss/train': 0.8300470113754272} 11/07/2021 15:33:05 - INFO - __main__ - Step 130206: {'lr': 2.1746935473433927e-05, 'samples': 24999552, 'steps': 130205, 'loss/train': 1.1135647296905518} 11/07/2021 15:33:06 - INFO - __main__ - Step 130207: {'lr': 2.174477073337272e-05, 'samples': 24999744, 'steps': 130206, 'loss/train': 1.2511606216430664} 11/07/2021 15:33:06 - INFO - __main__ - Step 130208: {'lr': 2.17426060961598e-05, 'samples': 24999936, 'steps': 130207, 'loss/train': 1.0053203105926514} 11/07/2021 15:33:06 - INFO - __main__ - Step 130209: {'lr': 2.174044156179614e-05, 'samples': 25000128, 'steps': 130208, 'loss/train': 1.3640073537826538} 11/07/2021 15:33:07 - INFO - __main__ - Step 130210: {'lr': 2.1738277130282702e-05, 'samples': 25000320, 'steps': 130209, 'loss/train': 1.6068732738494873} 11/07/2021 15:33:07 - INFO - __main__ - Step 130211: {'lr': 2.1736112801620495e-05, 'samples': 25000512, 'steps': 130210, 'loss/train': 1.3469548225402832} 11/07/2021 15:33:08 - INFO - __main__ - Step 130212: {'lr': 2.173394857581046e-05, 'samples': 25000704, 'steps': 130211, 'loss/train': 1.186231255531311} 11/07/2021 15:33:09 - INFO - __main__ - Step 130213: {'lr': 2.1731784452853565e-05, 'samples': 25000896, 'steps': 130212, 'loss/train': 1.2886401414871216} 11/07/2021 15:33:09 - INFO - __main__ - Step 130214: {'lr': 2.172962043275084e-05, 'samples': 25001088, 'steps': 130213, 'loss/train': 1.5608508586883545} 11/07/2021 15:33:09 - INFO - __main__ - Step 130215: {'lr': 2.1727456515503203e-05, 'samples': 25001280, 'steps': 130214, 'loss/train': 1.195414662361145} 11/07/2021 15:33:10 - INFO - __main__ - Step 130216: {'lr': 2.172529270111165e-05, 'samples': 25001472, 'steps': 130215, 'loss/train': 1.3830527067184448} 11/07/2021 15:33:10 - INFO - __main__ - Step 130217: {'lr': 2.172312898957718e-05, 'samples': 25001664, 'steps': 130216, 'loss/train': 1.6844176054000854} 11/07/2021 15:33:11 - INFO - __main__ - Step 130218: {'lr': 2.1720965380900764e-05, 'samples': 25001856, 'steps': 130217, 'loss/train': 1.2337534427642822} 11/07/2021 15:33:11 - INFO - __main__ - Step 130219: {'lr': 2.1718801875083323e-05, 'samples': 25002048, 'steps': 130218, 'loss/train': 1.0322145223617554} 11/07/2021 15:33:12 - INFO - __main__ - Step 130220: {'lr': 2.1716638472125877e-05, 'samples': 25002240, 'steps': 130219, 'loss/train': 1.3091799020767212} 11/07/2021 15:33:12 - INFO - __main__ - Step 130221: {'lr': 2.1714475172029402e-05, 'samples': 25002432, 'steps': 130220, 'loss/train': 1.1463795900344849} 11/07/2021 15:33:12 - INFO - __main__ - Step 130222: {'lr': 2.1712311974794842e-05, 'samples': 25002624, 'steps': 130221, 'loss/train': 1.691238284111023} 11/07/2021 15:33:13 - INFO - __main__ - Step 130223: {'lr': 2.171014888042319e-05, 'samples': 25002816, 'steps': 130222, 'loss/train': 0.5932119488716125} 11/07/2021 15:33:14 - INFO - __main__ - Step 130224: {'lr': 2.170798588891543e-05, 'samples': 25003008, 'steps': 130223, 'loss/train': 0.975130021572113} 11/07/2021 15:33:14 - INFO - __main__ - Step 130225: {'lr': 2.170582300027252e-05, 'samples': 25003200, 'steps': 130224, 'loss/train': 0.8369867205619812} 11/07/2021 15:33:14 - INFO - __main__ - Step 130226: {'lr': 2.1703660214495435e-05, 'samples': 25003392, 'steps': 130225, 'loss/train': 1.5331108570098877} 11/07/2021 15:33:15 - INFO - __main__ - Step 130227: {'lr': 2.170149753158518e-05, 'samples': 25003584, 'steps': 130226, 'loss/train': 1.2813163995742798} 11/07/2021 15:33:16 - INFO - __main__ - Step 130228: {'lr': 2.169933495154269e-05, 'samples': 25003776, 'steps': 130227, 'loss/train': 1.1345559358596802} 11/07/2021 15:33:16 - INFO - __main__ - Step 130229: {'lr': 2.1697172474368977e-05, 'samples': 25003968, 'steps': 130228, 'loss/train': 1.593270182609558} 11/07/2021 15:33:17 - INFO - __main__ - Step 130230: {'lr': 2.1695010100065e-05, 'samples': 25004160, 'steps': 130229, 'loss/train': 0.6370673179626465} 11/07/2021 15:33:17 - INFO - __main__ - Step 130231: {'lr': 2.1692847828631734e-05, 'samples': 25004352, 'steps': 130230, 'loss/train': 1.0192985534667969} 11/07/2021 15:33:17 - INFO - __main__ - Step 130232: {'lr': 2.1690685660070124e-05, 'samples': 25004544, 'steps': 130231, 'loss/train': 1.0777051448822021} 11/07/2021 15:33:18 - INFO - __main__ - Step 130233: {'lr': 2.168852359438117e-05, 'samples': 25004736, 'steps': 130232, 'loss/train': 0.8117192387580872} 11/07/2021 15:33:19 - INFO - __main__ - Step 130234: {'lr': 2.168636163156587e-05, 'samples': 25004928, 'steps': 130233, 'loss/train': 1.2559781074523926} 11/07/2021 15:33:19 - INFO - __main__ - Step 130235: {'lr': 2.168419977162514e-05, 'samples': 25005120, 'steps': 130234, 'loss/train': 1.5541876554489136} 11/07/2021 15:33:19 - INFO - __main__ - Step 130236: {'lr': 2.168203801455998e-05, 'samples': 25005312, 'steps': 130235, 'loss/train': 1.6607182025909424} 11/07/2021 15:33:20 - INFO - __main__ - Step 130237: {'lr': 2.1679876360371387e-05, 'samples': 25005504, 'steps': 130236, 'loss/train': 1.2829835414886475} 11/07/2021 15:33:21 - INFO - __main__ - Step 130238: {'lr': 2.1677714809060334e-05, 'samples': 25005696, 'steps': 130237, 'loss/train': 1.329944372177124} 11/07/2021 15:33:21 - INFO - __main__ - Step 130239: {'lr': 2.1675553360627738e-05, 'samples': 25005888, 'steps': 130238, 'loss/train': 1.021140456199646} 11/07/2021 15:33:21 - INFO - __main__ - Step 130240: {'lr': 2.167339201507465e-05, 'samples': 25006080, 'steps': 130239, 'loss/train': 0.9694226980209351} 11/07/2021 15:33:22 - INFO - __main__ - Step 130241: {'lr': 2.167123077240199e-05, 'samples': 25006272, 'steps': 130240, 'loss/train': 1.4573204517364502} 11/07/2021 15:33:22 - INFO - __main__ - Step 130242: {'lr': 2.1669069632610755e-05, 'samples': 25006464, 'steps': 130241, 'loss/train': 1.3470439910888672} 11/07/2021 15:33:23 - INFO - __main__ - Step 130243: {'lr': 2.1666908595701917e-05, 'samples': 25006656, 'steps': 130242, 'loss/train': 1.6863751411437988} 11/07/2021 15:33:23 - INFO - __main__ - Step 130244: {'lr': 2.1664747661676475e-05, 'samples': 25006848, 'steps': 130243, 'loss/train': 0.936955451965332} 11/07/2021 15:33:24 - INFO - __main__ - Step 130245: {'lr': 2.1662586830535347e-05, 'samples': 25007040, 'steps': 130244, 'loss/train': 1.2555698156356812} 11/07/2021 15:33:24 - INFO - __main__ - Step 130246: {'lr': 2.1660426102279528e-05, 'samples': 25007232, 'steps': 130245, 'loss/train': 1.5835168361663818} 11/07/2021 15:33:25 - INFO - __main__ - Step 130247: {'lr': 2.1658265476910022e-05, 'samples': 25007424, 'steps': 130246, 'loss/train': 1.0177212953567505} 11/07/2021 15:33:25 - INFO - __main__ - Step 130248: {'lr': 2.165610495442774e-05, 'samples': 25007616, 'steps': 130247, 'loss/train': 1.4023303985595703} 11/07/2021 15:33:26 - INFO - __main__ - Step 130249: {'lr': 2.1653944534833713e-05, 'samples': 25007808, 'steps': 130248, 'loss/train': 1.737526297569275} 11/07/2021 15:33:26 - INFO - __main__ - Step 130250: {'lr': 2.1651784218128885e-05, 'samples': 25008000, 'steps': 130249, 'loss/train': 1.1362062692642212} 11/07/2021 15:33:27 - INFO - __main__ - Step 130251: {'lr': 2.164962400431425e-05, 'samples': 25008192, 'steps': 130250, 'loss/train': 1.2379498481750488} 11/07/2021 15:33:27 - INFO - __main__ - Step 130252: {'lr': 2.1647463893390784e-05, 'samples': 25008384, 'steps': 130251, 'loss/train': 1.214922308921814} 11/07/2021 15:33:27 - INFO - __main__ - Step 130253: {'lr': 2.164530388535943e-05, 'samples': 25008576, 'steps': 130252, 'loss/train': 0.9272392392158508} 11/07/2021 15:33:28 - INFO - __main__ - Step 130254: {'lr': 2.1643143980221157e-05, 'samples': 25008768, 'steps': 130253, 'loss/train': 0.8888728022575378} 11/07/2021 15:33:29 - INFO - __main__ - Step 130255: {'lr': 2.1640984177976995e-05, 'samples': 25008960, 'steps': 130254, 'loss/train': 1.0631407499313354} 11/07/2021 15:33:29 - INFO - __main__ - Step 130256: {'lr': 2.1638824478627862e-05, 'samples': 25009152, 'steps': 130255, 'loss/train': 1.3156318664550781} 11/07/2021 15:33:29 - INFO - __main__ - Step 130257: {'lr': 2.163666488217475e-05, 'samples': 25009344, 'steps': 130256, 'loss/train': 1.0730271339416504} 11/07/2021 15:33:30 - INFO - __main__ - Step 130258: {'lr': 2.163450538861869e-05, 'samples': 25009536, 'steps': 130257, 'loss/train': 1.3428658246994019} 11/07/2021 15:33:31 - INFO - __main__ - Step 130259: {'lr': 2.1632345997960547e-05, 'samples': 25009728, 'steps': 130258, 'loss/train': 0.5353785157203674} 11/07/2021 15:33:31 - INFO - __main__ - Step 130260: {'lr': 2.1630186710201337e-05, 'samples': 25009920, 'steps': 130259, 'loss/train': 1.3505369424819946} 11/07/2021 15:33:32 - INFO - __main__ - Step 130261: {'lr': 2.162802752534207e-05, 'samples': 25010112, 'steps': 130260, 'loss/train': 1.5478068590164185} 11/07/2021 15:33:32 - INFO - __main__ - Step 130262: {'lr': 2.1625868443383657e-05, 'samples': 25010304, 'steps': 130261, 'loss/train': 1.8898309469223022} 11/07/2021 15:33:32 - INFO - __main__ - Step 130263: {'lr': 2.1623709464327123e-05, 'samples': 25010496, 'steps': 130262, 'loss/train': 1.164844036102295} 11/07/2021 15:33:33 - INFO - __main__ - Step 130264: {'lr': 2.1621550588173417e-05, 'samples': 25010688, 'steps': 130263, 'loss/train': 0.7985774874687195} 11/07/2021 15:33:34 - INFO - __main__ - Step 130265: {'lr': 2.1619391814923504e-05, 'samples': 25010880, 'steps': 130264, 'loss/train': 1.2279839515686035} 11/07/2021 15:33:34 - INFO - __main__ - Step 130266: {'lr': 2.1617233144578364e-05, 'samples': 25011072, 'steps': 130265, 'loss/train': 1.1133228540420532} 11/07/2021 15:33:34 - INFO - __main__ - Step 130267: {'lr': 2.161507457713899e-05, 'samples': 25011264, 'steps': 130266, 'loss/train': 1.007421851158142} 11/07/2021 15:33:35 - INFO - __main__ - Step 130268: {'lr': 2.161291611260635e-05, 'samples': 25011456, 'steps': 130267, 'loss/train': 1.0160601139068604} 11/07/2021 15:33:36 - INFO - __main__ - Step 130269: {'lr': 2.1610757750981395e-05, 'samples': 25011648, 'steps': 130268, 'loss/train': 1.4048798084259033} 11/07/2021 15:33:36 - INFO - __main__ - Step 130270: {'lr': 2.1608599492265153e-05, 'samples': 25011840, 'steps': 130269, 'loss/train': 1.2088215351104736} 11/07/2021 15:33:36 - INFO - __main__ - Step 130271: {'lr': 2.1606441336458504e-05, 'samples': 25012032, 'steps': 130270, 'loss/train': 1.1578105688095093} 11/07/2021 15:33:37 - INFO - __main__ - Step 130272: {'lr': 2.1604283283562452e-05, 'samples': 25012224, 'steps': 130271, 'loss/train': 1.5388895273208618} 11/07/2021 15:33:37 - INFO - __main__ - Step 130273: {'lr': 2.1602125333578027e-05, 'samples': 25012416, 'steps': 130272, 'loss/train': 1.1326823234558105} 11/07/2021 15:33:38 - INFO - __main__ - Step 130274: {'lr': 2.159996748650614e-05, 'samples': 25012608, 'steps': 130273, 'loss/train': 1.3278104066848755} 11/07/2021 15:33:39 - INFO - __main__ - Step 130275: {'lr': 2.1597809742347762e-05, 'samples': 25012800, 'steps': 130274, 'loss/train': 1.5597361326217651} 11/07/2021 15:33:39 - INFO - __main__ - Step 130276: {'lr': 2.1595652101103895e-05, 'samples': 25012992, 'steps': 130275, 'loss/train': 1.265280842781067} 11/07/2021 15:33:39 - INFO - __main__ - Step 130277: {'lr': 2.1593494562775513e-05, 'samples': 25013184, 'steps': 130276, 'loss/train': 0.8246714472770691} 11/07/2021 15:33:40 - INFO - __main__ - Step 130278: {'lr': 2.159133712736358e-05, 'samples': 25013376, 'steps': 130277, 'loss/train': 1.1184951066970825} 11/07/2021 15:33:41 - INFO - __main__ - Step 130279: {'lr': 2.1589179794869073e-05, 'samples': 25013568, 'steps': 130278, 'loss/train': 1.1455411911010742} 11/07/2021 15:33:41 - INFO - __main__ - Step 130280: {'lr': 2.1587022565292935e-05, 'samples': 25013760, 'steps': 130279, 'loss/train': 0.5340062379837036} 11/07/2021 15:33:41 - INFO - __main__ - Step 130281: {'lr': 2.158486543863622e-05, 'samples': 25013952, 'steps': 130280, 'loss/train': 1.1978179216384888} 11/07/2021 15:33:42 - INFO - __main__ - Step 130282: {'lr': 2.1582708414899788e-05, 'samples': 25014144, 'steps': 130281, 'loss/train': 1.3316174745559692} 11/07/2021 15:33:42 - INFO - __main__ - Step 130283: {'lr': 2.1580551494084666e-05, 'samples': 25014336, 'steps': 130282, 'loss/train': 1.1828856468200684} 11/07/2021 15:33:43 - INFO - __main__ - Step 130284: {'lr': 2.1578394676191826e-05, 'samples': 25014528, 'steps': 130283, 'loss/train': 1.035933017730713} 11/07/2021 15:33:44 - INFO - __main__ - Step 130285: {'lr': 2.1576237961222238e-05, 'samples': 25014720, 'steps': 130284, 'loss/train': 1.3227219581604004} 11/07/2021 15:33:44 - INFO - __main__ - Step 130286: {'lr': 2.1574081349176876e-05, 'samples': 25014912, 'steps': 130285, 'loss/train': 1.6925448179244995} 11/07/2021 15:33:44 - INFO - __main__ - Step 130287: {'lr': 2.1571924840056684e-05, 'samples': 25015104, 'steps': 130286, 'loss/train': 0.9140137434005737} 11/07/2021 15:33:45 - INFO - __main__ - Step 130288: {'lr': 2.1569768433862687e-05, 'samples': 25015296, 'steps': 130287, 'loss/train': 1.4202067852020264} 11/07/2021 15:33:46 - INFO - __main__ - Step 130289: {'lr': 2.1567612130595797e-05, 'samples': 25015488, 'steps': 130288, 'loss/train': 1.2432929277420044} 11/07/2021 15:33:46 - INFO - __main__ - Step 130290: {'lr': 2.156545593025705e-05, 'samples': 25015680, 'steps': 130289, 'loss/train': 0.9165794849395752} 11/07/2021 15:33:46 - INFO - __main__ - Step 130291: {'lr': 2.1563299832847356e-05, 'samples': 25015872, 'steps': 130290, 'loss/train': 1.1302382946014404} 11/07/2021 15:33:47 - INFO - __main__ - Step 130292: {'lr': 2.156114383836777e-05, 'samples': 25016064, 'steps': 130291, 'loss/train': 1.5624345541000366} 11/07/2021 15:33:47 - INFO - __main__ - Step 130293: {'lr': 2.155898794681918e-05, 'samples': 25016256, 'steps': 130292, 'loss/train': 2.065662384033203} 11/07/2021 15:33:48 - INFO - __main__ - Step 130294: {'lr': 2.155683215820256e-05, 'samples': 25016448, 'steps': 130293, 'loss/train': 1.2802197933197021} 11/07/2021 15:33:48 - INFO - __main__ - Step 130295: {'lr': 2.155467647251891e-05, 'samples': 25016640, 'steps': 130294, 'loss/train': 0.9903094172477722} 11/07/2021 15:33:49 - INFO - __main__ - Step 130296: {'lr': 2.1552520889769194e-05, 'samples': 25016832, 'steps': 130295, 'loss/train': 1.2462255954742432} 11/07/2021 15:33:49 - INFO - __main__ - Step 130297: {'lr': 2.155036540995442e-05, 'samples': 25017024, 'steps': 130296, 'loss/train': 1.3579344749450684} 11/07/2021 15:33:49 - INFO - __main__ - Step 130298: {'lr': 2.15482100330755e-05, 'samples': 25017216, 'steps': 130297, 'loss/train': 0.7177237272262573} 11/07/2021 15:33:51 - INFO - __main__ - Step 130299: {'lr': 2.1546054759133432e-05, 'samples': 25017408, 'steps': 130298, 'loss/train': 0.05709152668714523} 11/07/2021 15:33:51 - INFO - __main__ - Step 130300: {'lr': 2.154389958812916e-05, 'samples': 25017600, 'steps': 130299, 'loss/train': 0.9882644414901733} 11/07/2021 15:33:51 - INFO - __main__ - Step 130301: {'lr': 2.1541744520063716e-05, 'samples': 25017792, 'steps': 130300, 'loss/train': 1.087104082107544} 11/07/2021 15:33:52 - INFO - __main__ - Step 130302: {'lr': 2.153958955493804e-05, 'samples': 25017984, 'steps': 130301, 'loss/train': 1.0462827682495117} 11/07/2021 15:33:52 - INFO - __main__ - Step 130303: {'lr': 2.1537434692753127e-05, 'samples': 25018176, 'steps': 130302, 'loss/train': 1.2854678630828857} 11/07/2021 15:33:53 - INFO - __main__ - Step 130304: {'lr': 2.153527993350987e-05, 'samples': 25018368, 'steps': 130303, 'loss/train': 1.5892456769943237} 11/07/2021 15:33:54 - INFO - __main__ - Step 130305: {'lr': 2.1533125277209327e-05, 'samples': 25018560, 'steps': 130304, 'loss/train': 1.2470741271972656} 11/07/2021 15:33:54 - INFO - __main__ - Step 130306: {'lr': 2.1530970723852404e-05, 'samples': 25018752, 'steps': 130305, 'loss/train': 0.0589136965572834} 11/07/2021 15:33:54 - INFO - __main__ - Step 130307: {'lr': 2.1528816273440084e-05, 'samples': 25018944, 'steps': 130306, 'loss/train': 1.055582880973816} 11/07/2021 15:33:55 - INFO - __main__ - Step 130308: {'lr': 2.1526661925973384e-05, 'samples': 25019136, 'steps': 130307, 'loss/train': 1.0272831916809082} 11/07/2021 15:33:56 - INFO - __main__ - Step 130309: {'lr': 2.1524507681453225e-05, 'samples': 25019328, 'steps': 130308, 'loss/train': 0.9262581467628479} 11/07/2021 15:33:56 - INFO - __main__ - Step 130310: {'lr': 2.1522353539880607e-05, 'samples': 25019520, 'steps': 130309, 'loss/train': 1.215758204460144} 11/07/2021 15:33:56 - INFO - __main__ - Step 130311: {'lr': 2.15201995012565e-05, 'samples': 25019712, 'steps': 130310, 'loss/train': 1.5672866106033325} 11/07/2021 15:33:57 - INFO - __main__ - Step 130312: {'lr': 2.1518045565581845e-05, 'samples': 25019904, 'steps': 130311, 'loss/train': 1.251894474029541} 11/07/2021 15:33:57 - INFO - __main__ - Step 130313: {'lr': 2.1515891732857646e-05, 'samples': 25020096, 'steps': 130312, 'loss/train': 1.1477893590927124} 11/07/2021 15:33:58 - INFO - __main__ - Step 130314: {'lr': 2.15137380030849e-05, 'samples': 25020288, 'steps': 130313, 'loss/train': 0.056232623755931854} 11/07/2021 15:33:59 - INFO - __main__ - Step 130315: {'lr': 2.151158437626452e-05, 'samples': 25020480, 'steps': 130314, 'loss/train': 1.6484678983688354} 11/07/2021 15:33:59 - INFO - __main__ - Step 130316: {'lr': 2.1509430852397454e-05, 'samples': 25020672, 'steps': 130315, 'loss/train': 1.4733507633209229} 11/07/2021 15:33:59 - INFO - __main__ - Step 130317: {'lr': 2.150727743148473e-05, 'samples': 25020864, 'steps': 130316, 'loss/train': 1.0541096925735474} 11/07/2021 15:34:00 - INFO - __main__ - Step 130318: {'lr': 2.1505124113527315e-05, 'samples': 25021056, 'steps': 130317, 'loss/train': 1.4307687282562256} 11/07/2021 15:34:00 - INFO - __main__ - Step 130319: {'lr': 2.1502970898526125e-05, 'samples': 25021248, 'steps': 130318, 'loss/train': 1.50302255153656} 11/07/2021 15:34:01 - INFO - __main__ - Step 130320: {'lr': 2.150081778648222e-05, 'samples': 25021440, 'steps': 130319, 'loss/train': 0.09231890738010406} 11/07/2021 15:34:02 - INFO - __main__ - Step 130321: {'lr': 2.149866477739648e-05, 'samples': 25021632, 'steps': 130320, 'loss/train': 0.4239044189453125} 11/07/2021 15:34:02 - INFO - __main__ - Step 130322: {'lr': 2.149651187126994e-05, 'samples': 25021824, 'steps': 130321, 'loss/train': 1.038953185081482} 11/07/2021 15:34:02 - INFO - __main__ - Step 130323: {'lr': 2.1494359068103543e-05, 'samples': 25022016, 'steps': 130322, 'loss/train': 1.2421238422393799} 11/07/2021 15:34:03 - INFO - __main__ - Step 130324: {'lr': 2.1492206367898254e-05, 'samples': 25022208, 'steps': 130323, 'loss/train': 1.049257755279541} 11/07/2021 15:34:04 - INFO - __main__ - Step 130325: {'lr': 2.1490053770655076e-05, 'samples': 25022400, 'steps': 130324, 'loss/train': 0.990128755569458} 11/07/2021 15:34:04 - INFO - __main__ - Step 130326: {'lr': 2.1487901276374956e-05, 'samples': 25022592, 'steps': 130325, 'loss/train': 1.2658417224884033} 11/07/2021 15:34:04 - INFO - __main__ - Step 130327: {'lr': 2.1485748885058833e-05, 'samples': 25022784, 'steps': 130326, 'loss/train': 0.7720387578010559} 11/07/2021 15:34:05 - INFO - __main__ - Step 130328: {'lr': 2.1483596596707706e-05, 'samples': 25022976, 'steps': 130327, 'loss/train': 1.1393184661865234} 11/07/2021 15:34:05 - INFO - __main__ - Step 130329: {'lr': 2.1481444411322548e-05, 'samples': 25023168, 'steps': 130328, 'loss/train': 0.6490192413330078} 11/07/2021 15:34:06 - INFO - __main__ - Step 130330: {'lr': 2.1479292328904304e-05, 'samples': 25023360, 'steps': 130329, 'loss/train': 1.1745061874389648} 11/07/2021 15:34:07 - INFO - __main__ - Step 130331: {'lr': 2.1477140349454e-05, 'samples': 25023552, 'steps': 130330, 'loss/train': 1.5824496746063232} 11/07/2021 15:34:07 - INFO - __main__ - Step 130332: {'lr': 2.147498847297255e-05, 'samples': 25023744, 'steps': 130331, 'loss/train': 1.3998831510543823} 11/07/2021 15:34:07 - INFO - __main__ - Step 130333: {'lr': 2.1472836699460957e-05, 'samples': 25023936, 'steps': 130332, 'loss/train': 0.965201735496521} 11/07/2021 15:34:08 - INFO - __main__ - Step 130334: {'lr': 2.147068502892016e-05, 'samples': 25024128, 'steps': 130333, 'loss/train': 1.05355703830719} 11/07/2021 15:34:09 - INFO - __main__ - Step 130335: {'lr': 2.1468533461351165e-05, 'samples': 25024320, 'steps': 130334, 'loss/train': 1.5088038444519043} 11/07/2021 15:34:09 - INFO - __main__ - Step 130336: {'lr': 2.1466381996754907e-05, 'samples': 25024512, 'steps': 130335, 'loss/train': 1.0448193550109863} 11/07/2021 15:34:09 - INFO - __main__ - Step 130337: {'lr': 2.1464230635132364e-05, 'samples': 25024704, 'steps': 130336, 'loss/train': 0.6863916516304016} 11/07/2021 15:34:10 - INFO - __main__ - Step 130338: {'lr': 2.1462079376484534e-05, 'samples': 25024896, 'steps': 130337, 'loss/train': 1.1362546682357788} 11/07/2021 15:34:10 - INFO - __main__ - Step 130339: {'lr': 2.1459928220812386e-05, 'samples': 25025088, 'steps': 130338, 'loss/train': 1.3412209749221802} 11/07/2021 15:34:11 - INFO - __main__ - Step 130340: {'lr': 2.1457777168116836e-05, 'samples': 25025280, 'steps': 130339, 'loss/train': 1.4711006879806519} 11/07/2021 15:34:12 - INFO - __main__ - Step 130341: {'lr': 2.1455626218398887e-05, 'samples': 25025472, 'steps': 130340, 'loss/train': 1.2074426412582397} 11/07/2021 15:34:12 - INFO - __main__ - Step 130342: {'lr': 2.145347537165951e-05, 'samples': 25025664, 'steps': 130341, 'loss/train': 1.1698131561279297} 11/07/2021 15:34:12 - INFO - __main__ - Step 130343: {'lr': 2.145132462789967e-05, 'samples': 25025856, 'steps': 130342, 'loss/train': 1.1567065715789795} 11/07/2021 15:34:13 - INFO - __main__ - Step 130344: {'lr': 2.144917398712032e-05, 'samples': 25026048, 'steps': 130343, 'loss/train': 1.1102993488311768} 11/07/2021 15:34:14 - INFO - __main__ - Step 130345: {'lr': 2.144702344932245e-05, 'samples': 25026240, 'steps': 130344, 'loss/train': 0.8091270327568054} 11/07/2021 15:34:14 - INFO - __main__ - Step 130346: {'lr': 2.1444873014507036e-05, 'samples': 25026432, 'steps': 130345, 'loss/train': 1.4514063596725464} 11/07/2021 15:34:14 - INFO - __main__ - Step 130347: {'lr': 2.1442722682675024e-05, 'samples': 25026624, 'steps': 130346, 'loss/train': 1.395912528038025} 11/07/2021 15:34:15 - INFO - __main__ - Step 130348: {'lr': 2.144057245382741e-05, 'samples': 25026816, 'steps': 130347, 'loss/train': 1.374649167060852} 11/07/2021 15:34:15 - INFO - __main__ - Step 130349: {'lr': 2.143842232796514e-05, 'samples': 25027008, 'steps': 130348, 'loss/train': 0.9935281872749329} 11/07/2021 15:34:16 - INFO - __main__ - Step 130350: {'lr': 2.1436272305089182e-05, 'samples': 25027200, 'steps': 130349, 'loss/train': 1.2754285335540771} 11/07/2021 15:34:16 - INFO - __main__ - Step 130351: {'lr': 2.1434122385200537e-05, 'samples': 25027392, 'steps': 130350, 'loss/train': 1.003481388092041} 11/07/2021 15:34:17 - INFO - __main__ - Step 130352: {'lr': 2.143197256830015e-05, 'samples': 25027584, 'steps': 130351, 'loss/train': 1.2347242832183838} 11/07/2021 15:34:17 - INFO - __main__ - Step 130353: {'lr': 2.142982285438899e-05, 'samples': 25027776, 'steps': 130352, 'loss/train': 1.2813531160354614} 11/07/2021 15:34:17 - INFO - __main__ - Step 130354: {'lr': 2.1427673243468004e-05, 'samples': 25027968, 'steps': 130353, 'loss/train': 1.2412359714508057} 11/07/2021 15:34:18 - INFO - __main__ - Step 130355: {'lr': 2.142552373553819e-05, 'samples': 25028160, 'steps': 130354, 'loss/train': 1.156895637512207} 11/07/2021 15:34:19 - INFO - __main__ - Step 130356: {'lr': 2.1423374330600487e-05, 'samples': 25028352, 'steps': 130355, 'loss/train': 1.356170654296875} 11/07/2021 15:34:19 - INFO - __main__ - Step 130357: {'lr': 2.14212250286559e-05, 'samples': 25028544, 'steps': 130356, 'loss/train': 1.139025330543518} 11/07/2021 15:34:20 - INFO - __main__ - Step 130358: {'lr': 2.141907582970537e-05, 'samples': 25028736, 'steps': 130357, 'loss/train': 0.8767833709716797} 11/07/2021 15:34:20 - INFO - __main__ - Step 130359: {'lr': 2.1416926733749896e-05, 'samples': 25028928, 'steps': 130358, 'loss/train': 1.1954213380813599} 11/07/2021 15:34:20 - INFO - __main__ - Step 130360: {'lr': 2.1414777740790425e-05, 'samples': 25029120, 'steps': 130359, 'loss/train': 1.2206870317459106} 11/07/2021 15:34:21 - INFO - __main__ - Step 130361: {'lr': 2.1412628850827898e-05, 'samples': 25029312, 'steps': 130360, 'loss/train': 2.964803695678711} 11/07/2021 15:34:22 - INFO - __main__ - Step 130362: {'lr': 2.141048006386334e-05, 'samples': 25029504, 'steps': 130361, 'loss/train': 1.2633532285690308} 11/07/2021 15:34:22 - INFO - __main__ - Step 130363: {'lr': 2.14083313798977e-05, 'samples': 25029696, 'steps': 130362, 'loss/train': 1.1431214809417725} 11/07/2021 15:34:22 - INFO - __main__ - Step 130364: {'lr': 2.1406182798931917e-05, 'samples': 25029888, 'steps': 130363, 'loss/train': 1.639408826828003} 11/07/2021 15:34:23 - INFO - __main__ - Step 130365: {'lr': 2.140403432096705e-05, 'samples': 25030080, 'steps': 130364, 'loss/train': 0.6337420344352722} 11/07/2021 15:34:24 - INFO - __main__ - Step 130366: {'lr': 2.1401885946003924e-05, 'samples': 25030272, 'steps': 130365, 'loss/train': 0.5718123912811279} 11/07/2021 15:34:24 - INFO - __main__ - Step 130367: {'lr': 2.13997376740436e-05, 'samples': 25030464, 'steps': 130366, 'loss/train': 1.1377934217453003} 11/07/2021 15:34:24 - INFO - __main__ - Step 130368: {'lr': 2.1397589505087024e-05, 'samples': 25030656, 'steps': 130367, 'loss/train': 1.1009399890899658} 11/07/2021 15:34:25 - INFO - __main__ - Step 130369: {'lr': 2.139544143913516e-05, 'samples': 25030848, 'steps': 130368, 'loss/train': 1.020348072052002} 11/07/2021 15:34:25 - INFO - __main__ - Step 130370: {'lr': 2.1393293476188985e-05, 'samples': 25031040, 'steps': 130369, 'loss/train': 1.3158364295959473} 11/07/2021 15:34:26 - INFO - __main__ - Step 130371: {'lr': 2.139114561624947e-05, 'samples': 25031232, 'steps': 130370, 'loss/train': 1.2218058109283447} 11/07/2021 15:34:26 - INFO - __main__ - Step 130372: {'lr': 2.138899785931758e-05, 'samples': 25031424, 'steps': 130371, 'loss/train': 0.8788940906524658} 11/07/2021 15:34:27 - INFO - __main__ - Step 130373: {'lr': 2.138685020539427e-05, 'samples': 25031616, 'steps': 130372, 'loss/train': 1.2029929161071777} 11/07/2021 15:34:27 - INFO - __main__ - Step 130374: {'lr': 2.1384702654480502e-05, 'samples': 25031808, 'steps': 130373, 'loss/train': 0.9889582395553589} 11/07/2021 15:34:27 - INFO - __main__ - Step 130375: {'lr': 2.138255520657728e-05, 'samples': 25032000, 'steps': 130374, 'loss/train': 2.128678560256958} 11/07/2021 15:34:29 - INFO - __main__ - Step 130376: {'lr': 2.1380407861685545e-05, 'samples': 25032192, 'steps': 130375, 'loss/train': 1.5459191799163818} 11/07/2021 15:34:29 - INFO - __main__ - Step 130377: {'lr': 2.13782606198063e-05, 'samples': 25032384, 'steps': 130376, 'loss/train': 0.9953834414482117} 11/07/2021 15:34:29 - INFO - __main__ - Step 130378: {'lr': 2.1376113480940458e-05, 'samples': 25032576, 'steps': 130377, 'loss/train': 1.387715458869934} 11/07/2021 15:34:30 - INFO - __main__ - Step 130379: {'lr': 2.1373966445089043e-05, 'samples': 25032768, 'steps': 130378, 'loss/train': 1.420861840248108} 11/07/2021 15:34:30 - INFO - __main__ - Step 130380: {'lr': 2.137181951225295e-05, 'samples': 25032960, 'steps': 130379, 'loss/train': 1.18254816532135} 11/07/2021 15:34:31 - INFO - __main__ - Step 130381: {'lr': 2.13696726824332e-05, 'samples': 25033152, 'steps': 130380, 'loss/train': 1.393890380859375} 11/07/2021 15:34:31 - INFO - __main__ - Step 130382: {'lr': 2.136752595563074e-05, 'samples': 25033344, 'steps': 130381, 'loss/train': 1.2131993770599365} 11/07/2021 15:34:32 - INFO - __main__ - Step 130383: {'lr': 2.136537933184654e-05, 'samples': 25033536, 'steps': 130382, 'loss/train': 0.4432964026927948} 11/07/2021 15:34:32 - INFO - __main__ - Step 130384: {'lr': 2.13632328110816e-05, 'samples': 25033728, 'steps': 130383, 'loss/train': 1.2409346103668213} 11/07/2021 15:34:32 - INFO - __main__ - Step 130385: {'lr': 2.136108639333684e-05, 'samples': 25033920, 'steps': 130384, 'loss/train': 0.9746070504188538} 11/07/2021 15:34:33 - INFO - __main__ - Step 130386: {'lr': 2.135894007861325e-05, 'samples': 25034112, 'steps': 130385, 'loss/train': 1.846117377281189} 11/07/2021 15:34:34 - INFO - __main__ - Step 130387: {'lr': 2.1356793866911777e-05, 'samples': 25034304, 'steps': 130386, 'loss/train': 0.9574539065361023} 11/07/2021 15:34:34 - INFO - __main__ - Step 130388: {'lr': 2.1354647758233424e-05, 'samples': 25034496, 'steps': 130387, 'loss/train': 0.9741092920303345} 11/07/2021 15:34:34 - INFO - __main__ - Step 130389: {'lr': 2.1352501752579106e-05, 'samples': 25034688, 'steps': 130388, 'loss/train': 0.9667531847953796} 11/07/2021 15:34:35 - INFO - __main__ - Step 130390: {'lr': 2.135035584994985e-05, 'samples': 25034880, 'steps': 130389, 'loss/train': 1.4725251197814941} 11/07/2021 15:34:36 - INFO - __main__ - Step 130391: {'lr': 2.1348210050346596e-05, 'samples': 25035072, 'steps': 130390, 'loss/train': 1.275933027267456} 11/07/2021 15:34:36 - INFO - __main__ - Step 130392: {'lr': 2.134606435377037e-05, 'samples': 25035264, 'steps': 130391, 'loss/train': 1.127440333366394} 11/07/2021 15:34:37 - INFO - __main__ - Step 130393: {'lr': 2.1343918760222014e-05, 'samples': 25035456, 'steps': 130392, 'loss/train': 1.102241039276123} 11/07/2021 15:34:37 - INFO - __main__ - Step 130394: {'lr': 2.1341773269702547e-05, 'samples': 25035648, 'steps': 130393, 'loss/train': 1.193291425704956} 11/07/2021 15:34:37 - INFO - __main__ - Step 130395: {'lr': 2.133962788221297e-05, 'samples': 25035840, 'steps': 130394, 'loss/train': 1.3978357315063477} 11/07/2021 15:34:38 - INFO - __main__ - Step 130396: {'lr': 2.1337482597754225e-05, 'samples': 25036032, 'steps': 130395, 'loss/train': 1.2746378183364868} 11/07/2021 15:34:39 - INFO - __main__ - Step 130397: {'lr': 2.133533741632726e-05, 'samples': 25036224, 'steps': 130396, 'loss/train': 1.2809903621673584} 11/07/2021 15:34:39 - INFO - __main__ - Step 130398: {'lr': 2.13331923379331e-05, 'samples': 25036416, 'steps': 130397, 'loss/train': 1.7171562910079956} 11/07/2021 15:34:39 - INFO - __main__ - Step 130399: {'lr': 2.1331047362572658e-05, 'samples': 25036608, 'steps': 130398, 'loss/train': 0.7720159292221069} 11/07/2021 15:34:40 - INFO - __main__ - Step 130400: {'lr': 2.132890249024691e-05, 'samples': 25036800, 'steps': 130399, 'loss/train': 0.8436639308929443} 11/07/2021 15:34:41 - INFO - __main__ - Step 130401: {'lr': 2.1326757720956826e-05, 'samples': 25036992, 'steps': 130400, 'loss/train': 1.2047293186187744} 11/07/2021 15:34:41 - INFO - __main__ - Step 130402: {'lr': 2.13246130547034e-05, 'samples': 25037184, 'steps': 130401, 'loss/train': 1.5312615633010864} 11/07/2021 15:34:41 - INFO - __main__ - Step 130403: {'lr': 2.1322468491487558e-05, 'samples': 25037376, 'steps': 130402, 'loss/train': 1.5503580570220947} 11/07/2021 15:34:42 - INFO - __main__ - Step 130404: {'lr': 2.132032403131029e-05, 'samples': 25037568, 'steps': 130403, 'loss/train': 0.5854679346084595} 11/07/2021 15:34:42 - INFO - __main__ - Step 130405: {'lr': 2.1318179674172545e-05, 'samples': 25037760, 'steps': 130404, 'loss/train': 0.8802760243415833} 11/07/2021 15:34:43 - INFO - __main__ - Step 130406: {'lr': 2.1316035420075348e-05, 'samples': 25037952, 'steps': 130405, 'loss/train': 1.5954891443252563} 11/07/2021 15:34:43 - INFO - __main__ - Step 130407: {'lr': 2.1313891269019587e-05, 'samples': 25038144, 'steps': 130406, 'loss/train': 1.0338337421417236} 11/07/2021 15:34:44 - INFO - __main__ - Step 130408: {'lr': 2.1311747221006234e-05, 'samples': 25038336, 'steps': 130407, 'loss/train': 1.0882887840270996} 11/07/2021 15:34:44 - INFO - __main__ - Step 130409: {'lr': 2.130960327603629e-05, 'samples': 25038528, 'steps': 130408, 'loss/train': 1.15382981300354} 11/07/2021 15:34:45 - INFO - __main__ - Step 130410: {'lr': 2.130745943411072e-05, 'samples': 25038720, 'steps': 130409, 'loss/train': 1.8807905912399292} 11/07/2021 15:34:46 - INFO - __main__ - Step 130411: {'lr': 2.1305315695230476e-05, 'samples': 25038912, 'steps': 130410, 'loss/train': 1.1732549667358398} 11/07/2021 15:34:47 - INFO - __main__ - Step 130412: {'lr': 2.1303172059396498e-05, 'samples': 25039104, 'steps': 130411, 'loss/train': 1.658022403717041} 11/07/2021 15:34:47 - INFO - __main__ - Step 130413: {'lr': 2.130102852660981e-05, 'samples': 25039296, 'steps': 130412, 'loss/train': 1.297275424003601} 11/07/2021 15:34:47 - INFO - __main__ - Step 130414: {'lr': 2.1298885096871363e-05, 'samples': 25039488, 'steps': 130413, 'loss/train': 1.2404907941818237} 11/07/2021 15:34:48 - INFO - __main__ - Step 130415: {'lr': 2.1296741770182066e-05, 'samples': 25039680, 'steps': 130414, 'loss/train': 1.1858965158462524} 11/07/2021 15:34:48 - INFO - __main__ - Step 130416: {'lr': 2.1294598546542977e-05, 'samples': 25039872, 'steps': 130415, 'loss/train': 1.2222386598587036} 11/07/2021 15:34:48 - INFO - __main__ - Step 130417: {'lr': 2.1292455425954983e-05, 'samples': 25040064, 'steps': 130416, 'loss/train': 1.727228045463562} 11/07/2021 15:34:49 - INFO - __main__ - Step 130418: {'lr': 2.129031240841908e-05, 'samples': 25040256, 'steps': 130417, 'loss/train': 1.8345524072647095} 11/07/2021 15:34:50 - INFO - __main__ - Step 130419: {'lr': 2.1288169493936278e-05, 'samples': 25040448, 'steps': 130418, 'loss/train': 2.2345221042633057} 11/07/2021 15:34:50 - INFO - __main__ - Step 130420: {'lr': 2.1286026682507453e-05, 'samples': 25040640, 'steps': 130419, 'loss/train': 1.9006502628326416} 11/07/2021 15:34:50 - INFO - __main__ - Step 130421: {'lr': 2.1283883974133637e-05, 'samples': 25040832, 'steps': 130420, 'loss/train': 1.2979494333267212} 11/07/2021 15:34:51 - INFO - __main__ - Step 130422: {'lr': 2.128174136881575e-05, 'samples': 25041024, 'steps': 130421, 'loss/train': 1.410201072692871} 11/07/2021 15:34:52 - INFO - __main__ - Step 130423: {'lr': 2.1279598866554783e-05, 'samples': 25041216, 'steps': 130422, 'loss/train': 0.052822478115558624} 11/07/2021 15:34:52 - INFO - __main__ - Step 130424: {'lr': 2.1277456467351713e-05, 'samples': 25041408, 'steps': 130423, 'loss/train': 1.6327399015426636} 11/07/2021 15:34:53 - INFO - __main__ - Step 130425: {'lr': 2.1275314171207484e-05, 'samples': 25041600, 'steps': 130424, 'loss/train': 1.2663869857788086} 11/07/2021 15:34:53 - INFO - __main__ - Step 130426: {'lr': 2.127317197812306e-05, 'samples': 25041792, 'steps': 130425, 'loss/train': 1.1855462789535522} 11/07/2021 15:34:53 - INFO - __main__ - Step 130427: {'lr': 2.1271029888099425e-05, 'samples': 25041984, 'steps': 130426, 'loss/train': 1.3394794464111328} 11/07/2021 15:34:54 - INFO - __main__ - Step 130428: {'lr': 2.126888790113754e-05, 'samples': 25042176, 'steps': 130427, 'loss/train': 1.4984592199325562} 11/07/2021 15:34:55 - INFO - __main__ - Step 130429: {'lr': 2.1266746017238354e-05, 'samples': 25042368, 'steps': 130428, 'loss/train': 1.1601699590682983} 11/07/2021 15:34:55 - INFO - __main__ - Step 130430: {'lr': 2.1264604236402834e-05, 'samples': 25042560, 'steps': 130429, 'loss/train': 1.1715199947357178} 11/07/2021 15:34:55 - INFO - __main__ - Step 130431: {'lr': 2.1262462558631955e-05, 'samples': 25042752, 'steps': 130430, 'loss/train': 1.762955904006958} 11/07/2021 15:34:56 - INFO - __main__ - Step 130432: {'lr': 2.1260320983926718e-05, 'samples': 25042944, 'steps': 130431, 'loss/train': 0.7305252552032471} 11/07/2021 15:34:56 - INFO - __main__ - Step 130433: {'lr': 2.1258179512288063e-05, 'samples': 25043136, 'steps': 130432, 'loss/train': 0.999819815158844} 11/07/2021 15:34:57 - INFO - __main__ - Step 130434: {'lr': 2.1256038143716905e-05, 'samples': 25043328, 'steps': 130433, 'loss/train': 1.231407880783081} 11/07/2021 15:34:58 - INFO - __main__ - Step 130435: {'lr': 2.1253896878214245e-05, 'samples': 25043520, 'steps': 130434, 'loss/train': 1.2357187271118164} 11/07/2021 15:34:58 - INFO - __main__ - Step 130436: {'lr': 2.125175571578103e-05, 'samples': 25043712, 'steps': 130435, 'loss/train': 1.175541639328003} 11/07/2021 15:34:58 - INFO - __main__ - Step 130437: {'lr': 2.124961465641828e-05, 'samples': 25043904, 'steps': 130436, 'loss/train': 0.5098694562911987} 11/07/2021 15:34:59 - INFO - __main__ - Step 130438: {'lr': 2.124747370012689e-05, 'samples': 25044096, 'steps': 130437, 'loss/train': 1.531447172164917} 11/07/2021 15:35:00 - INFO - __main__ - Step 130439: {'lr': 2.1245332846907883e-05, 'samples': 25044288, 'steps': 130438, 'loss/train': 1.2740561962127686} 11/07/2021 15:35:00 - INFO - __main__ - Step 130440: {'lr': 2.1243192096762203e-05, 'samples': 25044480, 'steps': 130439, 'loss/train': 1.2834091186523438} 11/07/2021 15:35:00 - INFO - __main__ - Step 130441: {'lr': 2.1241051449690796e-05, 'samples': 25044672, 'steps': 130440, 'loss/train': 1.2350255250930786} 11/07/2021 15:35:01 - INFO - __main__ - Step 130442: {'lr': 2.123891090569463e-05, 'samples': 25044864, 'steps': 130441, 'loss/train': 1.4367634057998657} 11/07/2021 15:35:01 - INFO - __main__ - Step 130443: {'lr': 2.1236770464774707e-05, 'samples': 25045056, 'steps': 130442, 'loss/train': 1.2389912605285645} 11/07/2021 15:35:02 - INFO - __main__ - Step 130444: {'lr': 2.1234630126931943e-05, 'samples': 25045248, 'steps': 130443, 'loss/train': 1.4419621229171753} 11/07/2021 15:35:03 - INFO - __main__ - Step 130445: {'lr': 2.1232489892167335e-05, 'samples': 25045440, 'steps': 130444, 'loss/train': 1.0949281454086304} 11/07/2021 15:35:03 - INFO - __main__ - Step 130446: {'lr': 2.123034976048188e-05, 'samples': 25045632, 'steps': 130445, 'loss/train': 1.2154232263565063} 11/07/2021 15:35:04 - INFO - __main__ - Step 130447: {'lr': 2.1228209731876476e-05, 'samples': 25045824, 'steps': 130446, 'loss/train': 0.04103468358516693} 11/07/2021 15:35:05 - INFO - __main__ - Step 130448: {'lr': 2.1226069806352084e-05, 'samples': 25046016, 'steps': 130447, 'loss/train': 1.427359700202942} 11/07/2021 15:35:05 - INFO - __main__ - Step 130449: {'lr': 2.1223929983909705e-05, 'samples': 25046208, 'steps': 130448, 'loss/train': 1.139591932296753} 11/07/2021 15:35:05 - INFO - __main__ - Step 130450: {'lr': 2.122179026455029e-05, 'samples': 25046400, 'steps': 130449, 'loss/train': 1.4116480350494385} 11/07/2021 15:35:06 - INFO - __main__ - Step 130451: {'lr': 2.1219650648274802e-05, 'samples': 25046592, 'steps': 130450, 'loss/train': 1.3434717655181885} 11/07/2021 15:35:06 - INFO - __main__ - Step 130452: {'lr': 2.1217511135084216e-05, 'samples': 25046784, 'steps': 130451, 'loss/train': 1.415553331375122} 11/07/2021 15:35:06 - INFO - __main__ - Step 130453: {'lr': 2.1215371724979478e-05, 'samples': 25046976, 'steps': 130452, 'loss/train': 0.03399108722805977} 11/07/2021 15:35:08 - INFO - __main__ - Step 130454: {'lr': 2.121323241796158e-05, 'samples': 25047168, 'steps': 130453, 'loss/train': 0.9509701132774353} 11/07/2021 15:35:08 - INFO - __main__ - Step 130455: {'lr': 2.1211093214031445e-05, 'samples': 25047360, 'steps': 130454, 'loss/train': 1.060442328453064} 11/07/2021 15:35:08 - INFO - __main__ - Step 130456: {'lr': 2.120895411319007e-05, 'samples': 25047552, 'steps': 130455, 'loss/train': 1.2427752017974854} 11/07/2021 15:35:09 - INFO - __main__ - Step 130457: {'lr': 2.1206815115438427e-05, 'samples': 25047744, 'steps': 130456, 'loss/train': 1.740445852279663} 11/07/2021 15:35:09 - INFO - __main__ - Step 130458: {'lr': 2.120467622077746e-05, 'samples': 25047936, 'steps': 130457, 'loss/train': 1.3298530578613281} 11/07/2021 15:35:10 - INFO - __main__ - Step 130459: {'lr': 2.120253742920811e-05, 'samples': 25048128, 'steps': 130458, 'loss/train': 1.143074870109558} 11/07/2021 15:35:10 - INFO - __main__ - Step 130460: {'lr': 2.120039874073143e-05, 'samples': 25048320, 'steps': 130459, 'loss/train': 1.1570184230804443} 11/07/2021 15:35:11 - INFO - __main__ - Step 130461: {'lr': 2.1198260155348286e-05, 'samples': 25048512, 'steps': 130460, 'loss/train': 1.5878112316131592} 11/07/2021 15:35:11 - INFO - __main__ - Step 130462: {'lr': 2.1196121673059647e-05, 'samples': 25048704, 'steps': 130461, 'loss/train': 1.4542478322982788} 11/07/2021 15:35:11 - INFO - __main__ - Step 130463: {'lr': 2.1193983293866515e-05, 'samples': 25048896, 'steps': 130462, 'loss/train': 1.2560902833938599} 11/07/2021 15:35:12 - INFO - __main__ - Step 130464: {'lr': 2.1191845017769856e-05, 'samples': 25049088, 'steps': 130463, 'loss/train': 1.1533623933792114} 11/07/2021 15:35:13 - INFO - __main__ - Step 130465: {'lr': 2.1189706844770618e-05, 'samples': 25049280, 'steps': 130464, 'loss/train': 1.6050394773483276} 11/07/2021 15:35:13 - INFO - __main__ - Step 130466: {'lr': 2.118756877486977e-05, 'samples': 25049472, 'steps': 130465, 'loss/train': 0.7578378915786743} 11/07/2021 15:35:13 - INFO - __main__ - Step 130467: {'lr': 2.1185430808068257e-05, 'samples': 25049664, 'steps': 130466, 'loss/train': 1.2074002027511597} 11/07/2021 15:35:14 - INFO - __main__ - Step 130468: {'lr': 2.1183292944367078e-05, 'samples': 25049856, 'steps': 130467, 'loss/train': 1.2468960285186768} 11/07/2021 15:35:15 - INFO - __main__ - Step 130469: {'lr': 2.118115518376715e-05, 'samples': 25050048, 'steps': 130468, 'loss/train': 1.1966155767440796} 11/07/2021 15:35:15 - INFO - __main__ - Step 130470: {'lr': 2.1179017526269466e-05, 'samples': 25050240, 'steps': 130469, 'loss/train': 1.2117984294891357} 11/07/2021 15:35:15 - INFO - __main__ - Step 130471: {'lr': 2.117687997187501e-05, 'samples': 25050432, 'steps': 130470, 'loss/train': 1.6060292720794678} 11/07/2021 15:35:16 - INFO - __main__ - Step 130472: {'lr': 2.1174742520584712e-05, 'samples': 25050624, 'steps': 130471, 'loss/train': 1.3850857019424438} 11/07/2021 15:35:16 - INFO - __main__ - Step 130473: {'lr': 2.117260517239958e-05, 'samples': 25050816, 'steps': 130472, 'loss/train': 1.5936559438705444} 11/07/2021 15:35:17 - INFO - __main__ - Step 130474: {'lr': 2.1170467927320496e-05, 'samples': 25051008, 'steps': 130473, 'loss/train': 1.4257597923278809} 11/07/2021 15:35:18 - INFO - __main__ - Step 130475: {'lr': 2.1168330785348465e-05, 'samples': 25051200, 'steps': 130474, 'loss/train': 1.1786848306655884} 11/07/2021 15:35:18 - INFO - __main__ - Step 130476: {'lr': 2.1166193746484487e-05, 'samples': 25051392, 'steps': 130475, 'loss/train': 0.9973737001419067} 11/07/2021 15:35:18 - INFO - __main__ - Step 130477: {'lr': 2.1164056810729443e-05, 'samples': 25051584, 'steps': 130476, 'loss/train': 0.8983599543571472} 11/07/2021 15:35:19 - INFO - __main__ - Step 130478: {'lr': 2.1161919978084364e-05, 'samples': 25051776, 'steps': 130477, 'loss/train': 1.2952684164047241} 11/07/2021 15:35:20 - INFO - __main__ - Step 130479: {'lr': 2.1159783248550198e-05, 'samples': 25051968, 'steps': 130478, 'loss/train': 1.1688350439071655} 11/07/2021 15:35:20 - INFO - __main__ - Step 130480: {'lr': 2.1157646622127907e-05, 'samples': 25052160, 'steps': 130479, 'loss/train': 1.3295565843582153} 11/07/2021 15:35:20 - INFO - __main__ - Step 130481: {'lr': 2.1155510098818443e-05, 'samples': 25052352, 'steps': 130480, 'loss/train': 1.2006763219833374} 11/07/2021 15:35:21 - INFO - __main__ - Step 130482: {'lr': 2.1153373678622773e-05, 'samples': 25052544, 'steps': 130481, 'loss/train': 1.0275590419769287} 11/07/2021 15:35:21 - INFO - __main__ - Step 130483: {'lr': 2.115123736154187e-05, 'samples': 25052736, 'steps': 130482, 'loss/train': 1.2107484340667725} 11/07/2021 15:35:22 - INFO - __main__ - Step 130484: {'lr': 2.1149101147576677e-05, 'samples': 25052928, 'steps': 130483, 'loss/train': 1.2961862087249756} 11/07/2021 15:35:23 - INFO - __main__ - Step 130485: {'lr': 2.1146965036728165e-05, 'samples': 25053120, 'steps': 130484, 'loss/train': 1.0886365175247192} 11/07/2021 15:35:23 - INFO - __main__ - Step 130486: {'lr': 2.1144829028997337e-05, 'samples': 25053312, 'steps': 130485, 'loss/train': 1.4886339902877808} 11/07/2021 15:35:23 - INFO - __main__ - Step 130487: {'lr': 2.1142693124385105e-05, 'samples': 25053504, 'steps': 130486, 'loss/train': 0.7775676250457764} 11/07/2021 15:35:24 - INFO - __main__ - Step 130488: {'lr': 2.1140557322892413e-05, 'samples': 25053696, 'steps': 130487, 'loss/train': 1.3932828903198242} 11/07/2021 15:35:25 - INFO - __main__ - Step 130489: {'lr': 2.113842162452026e-05, 'samples': 25053888, 'steps': 130488, 'loss/train': 1.5718706846237183} 11/07/2021 15:35:25 - INFO - __main__ - Step 130490: {'lr': 2.1136286029269618e-05, 'samples': 25054080, 'steps': 130489, 'loss/train': 1.4589868783950806} 11/07/2021 15:35:25 - INFO - __main__ - Step 130491: {'lr': 2.1134150537141432e-05, 'samples': 25054272, 'steps': 130490, 'loss/train': 1.6388858556747437} 11/07/2021 15:35:26 - INFO - __main__ - Step 130492: {'lr': 2.1132015148136645e-05, 'samples': 25054464, 'steps': 130491, 'loss/train': 0.3651597201824188} 11/07/2021 15:35:26 - INFO - __main__ - Step 130493: {'lr': 2.112987986225626e-05, 'samples': 25054656, 'steps': 130492, 'loss/train': 1.6351587772369385} 11/07/2021 15:35:26 - INFO - __main__ - Step 130494: {'lr': 2.112774467950121e-05, 'samples': 25054848, 'steps': 130493, 'loss/train': 0.7724878787994385} 11/07/2021 15:35:27 - INFO - __main__ - Step 130495: {'lr': 2.1125609599872475e-05, 'samples': 25055040, 'steps': 130494, 'loss/train': 1.2440118789672852} 11/07/2021 15:35:28 - INFO - __main__ - Step 130496: {'lr': 2.1123474623370998e-05, 'samples': 25055232, 'steps': 130495, 'loss/train': 1.0091400146484375} 11/07/2021 15:35:28 - INFO - __main__ - Step 130497: {'lr': 2.1121339749997748e-05, 'samples': 25055424, 'steps': 130496, 'loss/train': 0.04605034366250038} 11/07/2021 15:35:29 - INFO - __main__ - Step 130498: {'lr': 2.1119204979753696e-05, 'samples': 25055616, 'steps': 130497, 'loss/train': 1.5535860061645508} 11/07/2021 15:35:29 - INFO - __main__ - Step 130499: {'lr': 2.111707031263982e-05, 'samples': 25055808, 'steps': 130498, 'loss/train': 1.5246989727020264} 11/07/2021 15:35:30 - INFO - __main__ - Step 130500: {'lr': 2.1114935748657083e-05, 'samples': 25056000, 'steps': 130499, 'loss/train': 1.3998115062713623} 11/07/2021 15:35:30 - INFO - __main__ - Step 130501: {'lr': 2.1112801287806378e-05, 'samples': 25056192, 'steps': 130500, 'loss/train': 1.2674553394317627} 11/07/2021 15:35:31 - INFO - __main__ - Step 130502: {'lr': 2.111066693008873e-05, 'samples': 25056384, 'steps': 130501, 'loss/train': 1.273673176765442} 11/07/2021 15:35:31 - INFO - __main__ - Step 130503: {'lr': 2.1108532675505056e-05, 'samples': 25056576, 'steps': 130502, 'loss/train': 1.2188140153884888} 11/07/2021 15:35:31 - INFO - __main__ - Step 130504: {'lr': 2.1106398524056353e-05, 'samples': 25056768, 'steps': 130503, 'loss/train': 0.731601893901825} 11/07/2021 15:35:32 - INFO - __main__ - Step 130505: {'lr': 2.1104264475743595e-05, 'samples': 25056960, 'steps': 130504, 'loss/train': 1.3138458728790283} 11/07/2021 15:35:33 - INFO - __main__ - Step 130506: {'lr': 2.1102130530567697e-05, 'samples': 25057152, 'steps': 130505, 'loss/train': 1.5065407752990723} 11/07/2021 15:35:33 - INFO - __main__ - Step 130507: {'lr': 2.109999668852966e-05, 'samples': 25057344, 'steps': 130506, 'loss/train': 1.0710046291351318} 11/07/2021 15:35:33 - INFO - __main__ - Step 130508: {'lr': 2.1097862949630453e-05, 'samples': 25057536, 'steps': 130507, 'loss/train': 1.1304330825805664} 11/07/2021 15:35:34 - INFO - __main__ - Step 130509: {'lr': 2.1095729313870994e-05, 'samples': 25057728, 'steps': 130508, 'loss/train': 1.5277169942855835} 11/07/2021 15:35:35 - INFO - __main__ - Step 130510: {'lr': 2.1093595781252278e-05, 'samples': 25057920, 'steps': 130509, 'loss/train': 0.7247419953346252} 11/07/2021 15:35:35 - INFO - __main__ - Step 130511: {'lr': 2.1091462351775225e-05, 'samples': 25058112, 'steps': 130510, 'loss/train': 0.9477328062057495} 11/07/2021 15:35:36 - INFO - __main__ - Step 130512: {'lr': 2.1089329025440862e-05, 'samples': 25058304, 'steps': 130511, 'loss/train': 1.3628439903259277} 11/07/2021 15:35:36 - INFO - __main__ - Step 130513: {'lr': 2.1087195802250132e-05, 'samples': 25058496, 'steps': 130512, 'loss/train': 1.1926015615463257} 11/07/2021 15:35:36 - INFO - __main__ - Step 130514: {'lr': 2.108506268220395e-05, 'samples': 25058688, 'steps': 130513, 'loss/train': 1.3259892463684082} 11/07/2021 15:35:37 - INFO - __main__ - Step 130515: {'lr': 2.1082929665303313e-05, 'samples': 25058880, 'steps': 130514, 'loss/train': 0.9586385488510132} 11/07/2021 15:35:38 - INFO - __main__ - Step 130516: {'lr': 2.108079675154917e-05, 'samples': 25059072, 'steps': 130515, 'loss/train': 1.1975014209747314} 11/07/2021 15:35:38 - INFO - __main__ - Step 130517: {'lr': 2.1078663940942488e-05, 'samples': 25059264, 'steps': 130516, 'loss/train': 1.4966517686843872} 11/07/2021 15:35:38 - INFO - __main__ - Step 130518: {'lr': 2.1076531233484214e-05, 'samples': 25059456, 'steps': 130517, 'loss/train': 1.258492112159729} 11/07/2021 15:35:39 - INFO - __main__ - Step 130519: {'lr': 2.1074398629175345e-05, 'samples': 25059648, 'steps': 130518, 'loss/train': 1.4948049783706665} 11/07/2021 15:35:40 - INFO - __main__ - Step 130520: {'lr': 2.1072266128016797e-05, 'samples': 25059840, 'steps': 130519, 'loss/train': 1.4866386651992798} 11/07/2021 15:35:40 - INFO - __main__ - Step 130521: {'lr': 2.1070133730009573e-05, 'samples': 25060032, 'steps': 130520, 'loss/train': 0.5479388236999512} 11/07/2021 15:35:41 - INFO - __main__ - Step 130522: {'lr': 2.106800143515461e-05, 'samples': 25060224, 'steps': 130521, 'loss/train': 0.8744893074035645} 11/07/2021 15:35:41 - INFO - __main__ - Step 130523: {'lr': 2.1065869243452857e-05, 'samples': 25060416, 'steps': 130522, 'loss/train': 1.0371372699737549} 11/07/2021 15:35:41 - INFO - __main__ - Step 130524: {'lr': 2.106373715490531e-05, 'samples': 25060608, 'steps': 130523, 'loss/train': 1.1347852945327759} 11/07/2021 15:35:42 - INFO - __main__ - Step 130525: {'lr': 2.1061605169512915e-05, 'samples': 25060800, 'steps': 130524, 'loss/train': 1.2490605115890503} 11/07/2021 15:35:43 - INFO - __main__ - Step 130526: {'lr': 2.1059473287276615e-05, 'samples': 25060992, 'steps': 130525, 'loss/train': 1.57795250415802} 11/07/2021 15:35:43 - INFO - __main__ - Step 130527: {'lr': 2.1057341508197408e-05, 'samples': 25061184, 'steps': 130526, 'loss/train': 1.0976277589797974} 11/07/2021 15:35:43 - INFO - __main__ - Step 130528: {'lr': 2.1055209832276213e-05, 'samples': 25061376, 'steps': 130527, 'loss/train': 1.2535333633422852} 11/07/2021 15:35:44 - INFO - __main__ - Step 130529: {'lr': 2.1053078259514e-05, 'samples': 25061568, 'steps': 130528, 'loss/train': 0.7410780191421509} 11/07/2021 15:35:44 - INFO - __main__ - Step 130530: {'lr': 2.1050946789911733e-05, 'samples': 25061760, 'steps': 130529, 'loss/train': 1.317876935005188} 11/07/2021 15:35:45 - INFO - __main__ - Step 130531: {'lr': 2.1048815423470397e-05, 'samples': 25061952, 'steps': 130530, 'loss/train': 0.9051169753074646} 11/07/2021 15:35:45 - INFO - __main__ - Step 130532: {'lr': 2.1046684160190897e-05, 'samples': 25062144, 'steps': 130531, 'loss/train': 1.2756046056747437} 11/07/2021 15:35:46 - INFO - __main__ - Step 130533: {'lr': 2.1044553000074268e-05, 'samples': 25062336, 'steps': 130532, 'loss/train': 1.188385248184204} 11/07/2021 15:35:46 - INFO - __main__ - Step 130534: {'lr': 2.1042421943121393e-05, 'samples': 25062528, 'steps': 130533, 'loss/train': 1.2767983675003052} 11/07/2021 15:35:46 - INFO - __main__ - Step 130535: {'lr': 2.10402909893333e-05, 'samples': 25062720, 'steps': 130534, 'loss/train': 1.0312210321426392} 11/07/2021 15:35:48 - INFO - __main__ - Step 130536: {'lr': 2.1038160138710903e-05, 'samples': 25062912, 'steps': 130535, 'loss/train': 1.1424872875213623} 11/07/2021 15:35:48 - INFO - __main__ - Step 130537: {'lr': 2.103602939125518e-05, 'samples': 25063104, 'steps': 130536, 'loss/train': 1.312482476234436} 11/07/2021 15:35:48 - INFO - __main__ - Step 130538: {'lr': 2.1033898746967094e-05, 'samples': 25063296, 'steps': 130537, 'loss/train': 1.3076714277267456} 11/07/2021 15:35:49 - INFO - __main__ - Step 130539: {'lr': 2.1031768205847624e-05, 'samples': 25063488, 'steps': 130538, 'loss/train': 1.3890430927276611} 11/07/2021 15:35:49 - INFO - __main__ - Step 130540: {'lr': 2.1029637767897682e-05, 'samples': 25063680, 'steps': 130539, 'loss/train': 1.328335165977478} 11/07/2021 15:35:50 - INFO - __main__ - Step 130541: {'lr': 2.1027507433118237e-05, 'samples': 25063872, 'steps': 130540, 'loss/train': 1.3425686359405518} 11/07/2021 15:35:50 - INFO - __main__ - Step 130542: {'lr': 2.1025377201510294e-05, 'samples': 25064064, 'steps': 130541, 'loss/train': 1.9788906574249268} 11/07/2021 15:35:51 - INFO - __main__ - Step 130543: {'lr': 2.1023247073074763e-05, 'samples': 25064256, 'steps': 130542, 'loss/train': 1.1578699350357056} 11/07/2021 15:35:51 - INFO - __main__ - Step 130544: {'lr': 2.1021117047812622e-05, 'samples': 25064448, 'steps': 130543, 'loss/train': 1.5384957790374756} 11/07/2021 15:35:51 - INFO - __main__ - Step 130545: {'lr': 2.1018987125724837e-05, 'samples': 25064640, 'steps': 130544, 'loss/train': 1.407044768333435} 11/07/2021 15:35:53 - INFO - __main__ - Step 130546: {'lr': 2.1016857306812353e-05, 'samples': 25064832, 'steps': 130545, 'loss/train': 1.3113455772399902} 11/07/2021 15:35:53 - INFO - __main__ - Step 130547: {'lr': 2.1014727591076143e-05, 'samples': 25065024, 'steps': 130546, 'loss/train': 1.15486478805542} 11/07/2021 15:35:53 - INFO - __main__ - Step 130548: {'lr': 2.1012597978517178e-05, 'samples': 25065216, 'steps': 130547, 'loss/train': 1.712644338607788} 11/07/2021 15:35:54 - INFO - __main__ - Step 130549: {'lr': 2.1010468469136375e-05, 'samples': 25065408, 'steps': 130548, 'loss/train': 1.0564440488815308} 11/07/2021 15:35:54 - INFO - __main__ - Step 130550: {'lr': 2.1008339062934785e-05, 'samples': 25065600, 'steps': 130549, 'loss/train': 1.4894517660140991} 11/07/2021 15:35:54 - INFO - __main__ - Step 130551: {'lr': 2.100620975991327e-05, 'samples': 25065792, 'steps': 130550, 'loss/train': 1.3108144998550415} 11/07/2021 15:35:55 - INFO - __main__ - Step 130552: {'lr': 2.10040805600728e-05, 'samples': 25065984, 'steps': 130551, 'loss/train': 1.0102955102920532} 11/07/2021 15:35:56 - INFO - __main__ - Step 130553: {'lr': 2.100195146341438e-05, 'samples': 25066176, 'steps': 130552, 'loss/train': 1.0244978666305542} 11/07/2021 15:35:56 - INFO - __main__ - Step 130554: {'lr': 2.0999822469938923e-05, 'samples': 25066368, 'steps': 130553, 'loss/train': 1.0384957790374756} 11/07/2021 15:35:56 - INFO - __main__ - Step 130555: {'lr': 2.0997693579647426e-05, 'samples': 25066560, 'steps': 130554, 'loss/train': 0.7986382246017456} 11/07/2021 15:35:57 - INFO - __main__ - Step 130556: {'lr': 2.0995564792540832e-05, 'samples': 25066752, 'steps': 130555, 'loss/train': 0.8722521066665649} 11/07/2021 15:35:58 - INFO - __main__ - Step 130557: {'lr': 2.099343610862009e-05, 'samples': 25066944, 'steps': 130556, 'loss/train': 1.3011178970336914} 11/07/2021 15:35:58 - INFO - __main__ - Step 130558: {'lr': 2.0991307527886195e-05, 'samples': 25067136, 'steps': 130557, 'loss/train': 1.2487999200820923} 11/07/2021 15:35:59 - INFO - __main__ - Step 130559: {'lr': 2.0989179050340064e-05, 'samples': 25067328, 'steps': 130558, 'loss/train': 1.1832983493804932} 11/07/2021 15:35:59 - INFO - __main__ - Step 130560: {'lr': 2.0987050675982695e-05, 'samples': 25067520, 'steps': 130559, 'loss/train': 0.22789731621742249} 11/07/2021 15:35:59 - INFO - __main__ - Step 130561: {'lr': 2.098492240481506e-05, 'samples': 25067712, 'steps': 130560, 'loss/train': 1.1682430505752563} 11/07/2021 15:36:01 - INFO - __main__ - Step 130562: {'lr': 2.0982794236838048e-05, 'samples': 25067904, 'steps': 130561, 'loss/train': 0.810385525226593} 11/07/2021 15:36:01 - INFO - __main__ - Step 130563: {'lr': 2.0980666172052633e-05, 'samples': 25068096, 'steps': 130562, 'loss/train': 0.810076892375946} 11/07/2021 15:36:01 - INFO - __main__ - Step 130564: {'lr': 2.0978538210459837e-05, 'samples': 25068288, 'steps': 130563, 'loss/train': 1.2579869031906128} 11/07/2021 15:36:02 - INFO - __main__ - Step 130565: {'lr': 2.097641035206055e-05, 'samples': 25068480, 'steps': 130564, 'loss/train': 1.5380207300186157} 11/07/2021 15:36:02 - INFO - __main__ - Step 130566: {'lr': 2.0974282596855743e-05, 'samples': 25068672, 'steps': 130565, 'loss/train': 1.2540630102157593} 11/07/2021 15:36:03 - INFO - __main__ - Step 130567: {'lr': 2.0972154944846418e-05, 'samples': 25068864, 'steps': 130566, 'loss/train': 1.3887062072753906} 11/07/2021 15:36:03 - INFO - __main__ - Step 130568: {'lr': 2.0970027396033485e-05, 'samples': 25069056, 'steps': 130567, 'loss/train': 0.9828259944915771} 11/07/2021 15:36:04 - INFO - __main__ - Step 130569: {'lr': 2.096789995041795e-05, 'samples': 25069248, 'steps': 130568, 'loss/train': 0.908704400062561} 11/07/2021 15:36:04 - INFO - __main__ - Step 130570: {'lr': 2.0965772608000726e-05, 'samples': 25069440, 'steps': 130569, 'loss/train': 1.0128847360610962} 11/07/2021 15:36:04 - INFO - __main__ - Step 130571: {'lr': 2.0963645368782787e-05, 'samples': 25069632, 'steps': 130570, 'loss/train': 0.7763883471488953} 11/07/2021 15:36:05 - INFO - __main__ - Step 130572: {'lr': 2.0961518232765154e-05, 'samples': 25069824, 'steps': 130571, 'loss/train': 1.3751682043075562} 11/07/2021 15:36:06 - INFO - __main__ - Step 130573: {'lr': 2.0959391199948663e-05, 'samples': 25070016, 'steps': 130572, 'loss/train': 1.2451969385147095} 11/07/2021 15:36:06 - INFO - __main__ - Step 130574: {'lr': 2.095726427033434e-05, 'samples': 25070208, 'steps': 130573, 'loss/train': 1.5276252031326294} 11/07/2021 15:36:07 - INFO - __main__ - Step 130575: {'lr': 2.095513744392316e-05, 'samples': 25070400, 'steps': 130574, 'loss/train': 1.1123045682907104} 11/07/2021 15:36:07 - INFO - __main__ - Step 130576: {'lr': 2.0953010720716037e-05, 'samples': 25070592, 'steps': 130575, 'loss/train': 1.3503443002700806} 11/07/2021 15:36:08 - INFO - __main__ - Step 130577: {'lr': 2.0950884100713968e-05, 'samples': 25070784, 'steps': 130576, 'loss/train': 0.7455087900161743} 11/07/2021 15:36:08 - INFO - __main__ - Step 130578: {'lr': 2.0948757583917897e-05, 'samples': 25070976, 'steps': 130577, 'loss/train': 1.5028589963912964} 11/07/2021 15:36:09 - INFO - __main__ - Step 130579: {'lr': 2.0946631170328773e-05, 'samples': 25071168, 'steps': 130578, 'loss/train': 0.9782928228378296} 11/07/2021 15:36:09 - INFO - __main__ - Step 130580: {'lr': 2.094450485994756e-05, 'samples': 25071360, 'steps': 130579, 'loss/train': 1.6844210624694824} 11/07/2021 15:36:09 - INFO - __main__ - Step 130581: {'lr': 2.0942378652775236e-05, 'samples': 25071552, 'steps': 130580, 'loss/train': 1.5262078046798706} 11/07/2021 15:36:10 - INFO - __main__ - Step 130582: {'lr': 2.094025254881271e-05, 'samples': 25071744, 'steps': 130581, 'loss/train': 0.6495213508605957} 11/07/2021 15:36:11 - INFO - __main__ - Step 130583: {'lr': 2.0938126548061044e-05, 'samples': 25071936, 'steps': 130582, 'loss/train': 1.5268882513046265} 11/07/2021 15:36:11 - INFO - __main__ - Step 130584: {'lr': 2.0936000650521064e-05, 'samples': 25072128, 'steps': 130583, 'loss/train': 1.1135808229446411} 11/07/2021 15:36:11 - INFO - __main__ - Step 130585: {'lr': 2.0933874856193804e-05, 'samples': 25072320, 'steps': 130584, 'loss/train': 1.6983407735824585} 11/07/2021 15:36:12 - INFO - __main__ - Step 130586: {'lr': 2.0931749165080198e-05, 'samples': 25072512, 'steps': 130585, 'loss/train': 1.729676604270935} 11/07/2021 15:36:13 - INFO - __main__ - Step 130587: {'lr': 2.09296235771812e-05, 'samples': 25072704, 'steps': 130586, 'loss/train': 1.1043787002563477} 11/07/2021 15:36:13 - INFO - __main__ - Step 130588: {'lr': 2.0927498092497804e-05, 'samples': 25072896, 'steps': 130587, 'loss/train': 0.9955835938453674} 11/07/2021 15:36:14 - INFO - __main__ - Step 130589: {'lr': 2.0925372711030926e-05, 'samples': 25073088, 'steps': 130588, 'loss/train': 1.1999506950378418} 11/07/2021 15:36:14 - INFO - __main__ - Step 130590: {'lr': 2.092324743278154e-05, 'samples': 25073280, 'steps': 130589, 'loss/train': 0.8864172697067261} 11/07/2021 15:36:14 - INFO - __main__ - Step 130591: {'lr': 2.0921122257750586e-05, 'samples': 25073472, 'steps': 130590, 'loss/train': 1.4456690549850464} 11/07/2021 15:36:15 - INFO - __main__ - Step 130592: {'lr': 2.0918997185939066e-05, 'samples': 25073664, 'steps': 130591, 'loss/train': 1.7718309164047241} 11/07/2021 15:36:16 - INFO - __main__ - Step 130593: {'lr': 2.091687221734789e-05, 'samples': 25073856, 'steps': 130592, 'loss/train': 1.1637156009674072} 11/07/2021 15:36:16 - INFO - __main__ - Step 130594: {'lr': 2.0914747351978097e-05, 'samples': 25074048, 'steps': 130593, 'loss/train': 0.8844444751739502} 11/07/2021 15:36:16 - INFO - __main__ - Step 130595: {'lr': 2.0912622589830536e-05, 'samples': 25074240, 'steps': 130594, 'loss/train': 0.790757417678833} 11/07/2021 15:36:17 - INFO - __main__ - Step 130596: {'lr': 2.0910497930906215e-05, 'samples': 25074432, 'steps': 130595, 'loss/train': 0.773059070110321} 11/07/2021 15:36:18 - INFO - __main__ - Step 130597: {'lr': 2.0908373375206096e-05, 'samples': 25074624, 'steps': 130596, 'loss/train': 1.2117518186569214} 11/07/2021 15:36:18 - INFO - __main__ - Step 130598: {'lr': 2.090624892273113e-05, 'samples': 25074816, 'steps': 130597, 'loss/train': 1.219698429107666} 11/07/2021 15:36:18 - INFO - __main__ - Step 130599: {'lr': 2.090412457348226e-05, 'samples': 25075008, 'steps': 130598, 'loss/train': 1.3639576435089111} 11/07/2021 15:36:19 - INFO - __main__ - Step 130600: {'lr': 2.090200032746045e-05, 'samples': 25075200, 'steps': 130599, 'loss/train': 1.129970908164978} 11/07/2021 15:36:19 - INFO - __main__ - Step 130601: {'lr': 2.0899876184666654e-05, 'samples': 25075392, 'steps': 130600, 'loss/train': 0.41869476437568665} 11/07/2021 15:36:20 - INFO - __main__ - Step 130602: {'lr': 2.0897752145101867e-05, 'samples': 25075584, 'steps': 130601, 'loss/train': 0.8318364024162292} 11/07/2021 15:36:21 - INFO - __main__ - Step 130603: {'lr': 2.0895628208767005e-05, 'samples': 25075776, 'steps': 130602, 'loss/train': 1.144010305404663} 11/07/2021 15:36:21 - INFO - __main__ - Step 130604: {'lr': 2.0893504375663036e-05, 'samples': 25075968, 'steps': 130603, 'loss/train': 1.471121907234192} 11/07/2021 15:36:21 - INFO - __main__ - Step 130605: {'lr': 2.0891380645790936e-05, 'samples': 25076160, 'steps': 130604, 'loss/train': 1.5305187702178955} 11/07/2021 15:36:22 - INFO - __main__ - Step 130606: {'lr': 2.088925701915162e-05, 'samples': 25076352, 'steps': 130605, 'loss/train': 1.287180781364441} 11/07/2021 15:36:23 - INFO - __main__ - Step 130607: {'lr': 2.0887133495746113e-05, 'samples': 25076544, 'steps': 130606, 'loss/train': 1.296364665031433} 11/07/2021 15:36:23 - INFO - __main__ - Step 130608: {'lr': 2.0885010075575307e-05, 'samples': 25076736, 'steps': 130607, 'loss/train': 1.5802316665649414} 11/07/2021 15:36:23 - INFO - __main__ - Step 130609: {'lr': 2.0882886758640168e-05, 'samples': 25076928, 'steps': 130608, 'loss/train': 1.4828801155090332} 11/07/2021 15:36:24 - INFO - __main__ - Step 130610: {'lr': 2.0880763544941673e-05, 'samples': 25077120, 'steps': 130609, 'loss/train': 1.3469505310058594} 11/07/2021 15:36:24 - INFO - __main__ - Step 130611: {'lr': 2.0878640434480763e-05, 'samples': 25077312, 'steps': 130610, 'loss/train': 1.136623740196228} 11/07/2021 15:36:24 - INFO - __main__ - Step 130612: {'lr': 2.087651742725838e-05, 'samples': 25077504, 'steps': 130611, 'loss/train': 1.3246970176696777} 11/07/2021 15:36:25 - INFO - __main__ - Step 130613: {'lr': 2.0874394523275526e-05, 'samples': 25077696, 'steps': 130612, 'loss/train': 1.1384873390197754} 11/07/2021 15:36:26 - INFO - __main__ - Step 130614: {'lr': 2.0872271722533142e-05, 'samples': 25077888, 'steps': 130613, 'loss/train': 1.0376256704330444} 11/07/2021 15:36:26 - INFO - __main__ - Step 130615: {'lr': 2.0870149025032174e-05, 'samples': 25078080, 'steps': 130614, 'loss/train': 1.4796966314315796} 11/07/2021 15:36:26 - INFO - __main__ - Step 130616: {'lr': 2.086802643077357e-05, 'samples': 25078272, 'steps': 130615, 'loss/train': 0.9412534832954407} 11/07/2021 15:36:27 - INFO - __main__ - Step 130617: {'lr': 2.0865903939758292e-05, 'samples': 25078464, 'steps': 130616, 'loss/train': 1.2779724597930908} 11/07/2021 15:36:28 - INFO - __main__ - Step 130618: {'lr': 2.0863781551987316e-05, 'samples': 25078656, 'steps': 130617, 'loss/train': 1.0735273361206055} 11/07/2021 15:36:29 - INFO - __main__ - Step 130619: {'lr': 2.0861659267461585e-05, 'samples': 25078848, 'steps': 130618, 'loss/train': 1.4117523431777954} 11/07/2021 15:36:29 - INFO - __main__ - Step 130620: {'lr': 2.0859537086182045e-05, 'samples': 25079040, 'steps': 130619, 'loss/train': 1.348544716835022} 11/07/2021 15:36:29 - INFO - __main__ - Step 130621: {'lr': 2.0857415008149723e-05, 'samples': 25079232, 'steps': 130620, 'loss/train': 0.11173141002655029} 11/07/2021 15:36:30 - INFO - __main__ - Step 130622: {'lr': 2.0855293033365445e-05, 'samples': 25079424, 'steps': 130621, 'loss/train': 0.31259942054748535} 11/07/2021 15:36:31 - INFO - __main__ - Step 130623: {'lr': 2.085317116183025e-05, 'samples': 25079616, 'steps': 130622, 'loss/train': 0.5646777153015137} 11/07/2021 15:36:31 - INFO - __main__ - Step 130624: {'lr': 2.0851049393545068e-05, 'samples': 25079808, 'steps': 130623, 'loss/train': 1.6228909492492676} 11/07/2021 15:36:31 - INFO - __main__ - Step 130625: {'lr': 2.084892772851088e-05, 'samples': 25080000, 'steps': 130624, 'loss/train': 1.3916115760803223} 11/07/2021 15:36:32 - INFO - __main__ - Step 130626: {'lr': 2.084680616672863e-05, 'samples': 25080192, 'steps': 130625, 'loss/train': 0.825897753238678} 11/07/2021 15:36:32 - INFO - __main__ - Step 130627: {'lr': 2.084468470819928e-05, 'samples': 25080384, 'steps': 130626, 'loss/train': 1.201064944267273} 11/07/2021 15:36:33 - INFO - __main__ - Step 130628: {'lr': 2.0842563352923753e-05, 'samples': 25080576, 'steps': 130627, 'loss/train': 1.1280732154846191} 11/07/2021 15:36:34 - INFO - __main__ - Step 130629: {'lr': 2.084044210090305e-05, 'samples': 25080768, 'steps': 130628, 'loss/train': 1.0183477401733398} 11/07/2021 15:36:34 - INFO - __main__ - Step 130630: {'lr': 2.0838320952138113e-05, 'samples': 25080960, 'steps': 130629, 'loss/train': 1.3376638889312744} 11/07/2021 15:36:34 - INFO - __main__ - Step 130631: {'lr': 2.0836199906629883e-05, 'samples': 25081152, 'steps': 130630, 'loss/train': 0.46027737855911255} 11/07/2021 15:36:35 - INFO - __main__ - Step 130632: {'lr': 2.0834078964379304e-05, 'samples': 25081344, 'steps': 130631, 'loss/train': 1.271215796470642} 11/07/2021 15:36:36 - INFO - __main__ - Step 130633: {'lr': 2.083195812538738e-05, 'samples': 25081536, 'steps': 130632, 'loss/train': 1.5100266933441162} 11/07/2021 15:36:36 - INFO - __main__ - Step 130634: {'lr': 2.0829837389655078e-05, 'samples': 25081728, 'steps': 130633, 'loss/train': 1.202318787574768} 11/07/2021 15:36:36 - INFO - __main__ - Step 130635: {'lr': 2.0827716757183285e-05, 'samples': 25081920, 'steps': 130634, 'loss/train': 0.8729540109634399} 11/07/2021 15:36:37 - INFO - __main__ - Step 130636: {'lr': 2.082559622797295e-05, 'samples': 25082112, 'steps': 130635, 'loss/train': 1.0955547094345093} 11/07/2021 15:36:37 - INFO - __main__ - Step 130637: {'lr': 2.0823475802025092e-05, 'samples': 25082304, 'steps': 130636, 'loss/train': 1.0549893379211426} 11/07/2021 15:36:38 - INFO - __main__ - Step 130638: {'lr': 2.0821355479340638e-05, 'samples': 25082496, 'steps': 130637, 'loss/train': 1.3389432430267334} 11/07/2021 15:36:38 - INFO - __main__ - Step 130639: {'lr': 2.081923525992055e-05, 'samples': 25082688, 'steps': 130638, 'loss/train': 1.1190619468688965} 11/07/2021 15:36:39 - INFO - __main__ - Step 130640: {'lr': 2.0817115143765776e-05, 'samples': 25082880, 'steps': 130639, 'loss/train': 0.590031087398529} 11/07/2021 15:36:39 - INFO - __main__ - Step 130641: {'lr': 2.0814995130877256e-05, 'samples': 25083072, 'steps': 130640, 'loss/train': 2.1933727264404297} 11/07/2021 15:36:39 - INFO - __main__ - Step 130642: {'lr': 2.0812875221255967e-05, 'samples': 25083264, 'steps': 130641, 'loss/train': 1.352471113204956} 11/07/2021 15:36:41 - INFO - __main__ - Step 130643: {'lr': 2.0810755414902878e-05, 'samples': 25083456, 'steps': 130642, 'loss/train': 0.9440672397613525} 11/07/2021 15:36:41 - INFO - __main__ - Step 130644: {'lr': 2.08086357118189e-05, 'samples': 25083648, 'steps': 130643, 'loss/train': 0.773542046546936} 11/07/2021 15:36:41 - INFO - __main__ - Step 130645: {'lr': 2.0806516112005042e-05, 'samples': 25083840, 'steps': 130644, 'loss/train': 1.3255640268325806} 11/07/2021 15:36:42 - INFO - __main__ - Step 130646: {'lr': 2.0804396615462213e-05, 'samples': 25084032, 'steps': 130645, 'loss/train': 1.4960052967071533} 11/07/2021 15:36:42 - INFO - __main__ - Step 130647: {'lr': 2.0802277222191385e-05, 'samples': 25084224, 'steps': 130646, 'loss/train': 1.0533169507980347} 11/07/2021 15:36:42 - INFO - __main__ - Step 130648: {'lr': 2.080015793219356e-05, 'samples': 25084416, 'steps': 130647, 'loss/train': 0.6459395885467529} 11/07/2021 15:36:43 - INFO - __main__ - Step 130649: {'lr': 2.079803874546962e-05, 'samples': 25084608, 'steps': 130648, 'loss/train': 1.4977012872695923} 11/07/2021 15:36:44 - INFO - __main__ - Step 130650: {'lr': 2.0795919662020518e-05, 'samples': 25084800, 'steps': 130649, 'loss/train': 1.214499831199646} 11/07/2021 15:36:44 - INFO - __main__ - Step 130651: {'lr': 2.0793800681847276e-05, 'samples': 25084992, 'steps': 130650, 'loss/train': 1.1423152685165405} 11/07/2021 15:36:44 - INFO - __main__ - Step 130652: {'lr': 2.079168180495078e-05, 'samples': 25085184, 'steps': 130651, 'loss/train': 1.0431402921676636} 11/07/2021 15:36:45 - INFO - __main__ - Step 130653: {'lr': 2.0789563031332003e-05, 'samples': 25085376, 'steps': 130652, 'loss/train': 4.477959632873535} 11/07/2021 15:36:46 - INFO - __main__ - Step 130654: {'lr': 2.0787444360991948e-05, 'samples': 25085568, 'steps': 130653, 'loss/train': 1.596901297569275} 11/07/2021 15:36:46 - INFO - __main__ - Step 130655: {'lr': 2.0785325793931524e-05, 'samples': 25085760, 'steps': 130654, 'loss/train': 1.2291017770767212} 11/07/2021 15:36:46 - INFO - __main__ - Step 130656: {'lr': 2.078320733015168e-05, 'samples': 25085952, 'steps': 130655, 'loss/train': 1.6324663162231445} 11/07/2021 15:36:47 - INFO - __main__ - Step 130657: {'lr': 2.0781088969653388e-05, 'samples': 25086144, 'steps': 130656, 'loss/train': 1.9082087278366089} 11/07/2021 15:36:47 - INFO - __main__ - Step 130658: {'lr': 2.0778970712437616e-05, 'samples': 25086336, 'steps': 130657, 'loss/train': 0.9867810606956482} 11/07/2021 15:36:48 - INFO - __main__ - Step 130659: {'lr': 2.077685255850528e-05, 'samples': 25086528, 'steps': 130658, 'loss/train': 1.2779691219329834} 11/07/2021 15:36:49 - INFO - __main__ - Step 130660: {'lr': 2.0774734507857383e-05, 'samples': 25086720, 'steps': 130659, 'loss/train': 0.9261911511421204} 11/07/2021 15:36:49 - INFO - __main__ - Step 130661: {'lr': 2.0772616560494893e-05, 'samples': 25086912, 'steps': 130660, 'loss/train': 1.1032962799072266} 11/07/2021 15:36:49 - INFO - __main__ - Step 130662: {'lr': 2.0770498716418673e-05, 'samples': 25087104, 'steps': 130661, 'loss/train': 0.9135032892227173} 11/07/2021 15:36:50 - INFO - __main__ - Step 130663: {'lr': 2.076838097562972e-05, 'samples': 25087296, 'steps': 130662, 'loss/train': 1.2395061254501343} 11/07/2021 15:36:51 - INFO - __main__ - Step 130664: {'lr': 2.0766263338129003e-05, 'samples': 25087488, 'steps': 130663, 'loss/train': 1.4433070421218872} 11/07/2021 15:36:51 - INFO - __main__ - Step 130665: {'lr': 2.0764145803917472e-05, 'samples': 25087680, 'steps': 130664, 'loss/train': 1.4845099449157715} 11/07/2021 15:36:51 - INFO - __main__ - Step 130666: {'lr': 2.0762028372996093e-05, 'samples': 25087872, 'steps': 130665, 'loss/train': 1.3123705387115479} 11/07/2021 15:36:52 - INFO - __main__ - Step 130667: {'lr': 2.0759911045365788e-05, 'samples': 25088064, 'steps': 130666, 'loss/train': 1.114951252937317} 11/07/2021 15:36:52 - INFO - __main__ - Step 130668: {'lr': 2.075779382102755e-05, 'samples': 25088256, 'steps': 130667, 'loss/train': 1.1226078271865845} 11/07/2021 15:36:53 - INFO - __main__ - Step 130669: {'lr': 2.0755676699982294e-05, 'samples': 25088448, 'steps': 130668, 'loss/train': 1.5078067779541016} 11/07/2021 15:36:54 - INFO - __main__ - Step 130670: {'lr': 2.0753559682231e-05, 'samples': 25088640, 'steps': 130669, 'loss/train': 0.9618829488754272} 11/07/2021 15:36:54 - INFO - __main__ - Step 130671: {'lr': 2.0751442767774605e-05, 'samples': 25088832, 'steps': 130670, 'loss/train': 1.3828505277633667} 11/07/2021 15:36:54 - INFO - __main__ - Step 130672: {'lr': 2.074932595661408e-05, 'samples': 25089024, 'steps': 130671, 'loss/train': 1.1898797750473022} 11/07/2021 15:36:55 - INFO - __main__ - Step 130673: {'lr': 2.074720924875037e-05, 'samples': 25089216, 'steps': 130672, 'loss/train': 0.6413442492485046} 11/07/2021 15:36:55 - INFO - __main__ - Step 130674: {'lr': 2.074509264418442e-05, 'samples': 25089408, 'steps': 130673, 'loss/train': 0.7774963974952698} 11/07/2021 15:36:56 - INFO - __main__ - Step 130675: {'lr': 2.074297614291726e-05, 'samples': 25089600, 'steps': 130674, 'loss/train': 1.2892147302627563} 11/07/2021 15:36:56 - INFO - __main__ - Step 130676: {'lr': 2.0740859744949713e-05, 'samples': 25089792, 'steps': 130675, 'loss/train': 1.3445719480514526} 11/07/2021 15:36:57 - INFO - __main__ - Step 130677: {'lr': 2.0738743450282816e-05, 'samples': 25089984, 'steps': 130676, 'loss/train': 1.0390973091125488} 11/07/2021 15:36:57 - INFO - __main__ - Step 130678: {'lr': 2.073662725891748e-05, 'samples': 25090176, 'steps': 130677, 'loss/train': 0.9015554785728455} 11/07/2021 15:36:57 - INFO - __main__ - Step 130679: {'lr': 2.0734511170854704e-05, 'samples': 25090368, 'steps': 130678, 'loss/train': 1.4864808320999146} 11/07/2021 15:36:58 - INFO - __main__ - Step 130680: {'lr': 2.0732395186095403e-05, 'samples': 25090560, 'steps': 130679, 'loss/train': 0.8375775814056396} 11/07/2021 15:36:59 - INFO - __main__ - Step 130681: {'lr': 2.0730279304640552e-05, 'samples': 25090752, 'steps': 130680, 'loss/train': 1.4117286205291748} 11/07/2021 15:36:59 - INFO - __main__ - Step 130682: {'lr': 2.0728163526491124e-05, 'samples': 25090944, 'steps': 130681, 'loss/train': 1.127994418144226} 11/07/2021 15:36:59 - INFO - __main__ - Step 130683: {'lr': 2.0726047851648026e-05, 'samples': 25091136, 'steps': 130682, 'loss/train': 1.537277102470398} 11/07/2021 15:37:00 - INFO - __main__ - Step 130684: {'lr': 2.072393228011221e-05, 'samples': 25091328, 'steps': 130683, 'loss/train': 1.1157413721084595} 11/07/2021 15:37:01 - INFO - __main__ - Step 130685: {'lr': 2.0721816811884704e-05, 'samples': 25091520, 'steps': 130684, 'loss/train': 0.7605574727058411} 11/07/2021 15:37:01 - INFO - __main__ - Step 130686: {'lr': 2.071970144696636e-05, 'samples': 25091712, 'steps': 130685, 'loss/train': 0.636904776096344} 11/07/2021 15:37:02 - INFO - __main__ - Step 130687: {'lr': 2.071758618535821e-05, 'samples': 25091904, 'steps': 130686, 'loss/train': 1.3998609781265259} 11/07/2021 15:37:02 - INFO - __main__ - Step 130688: {'lr': 2.07154710270612e-05, 'samples': 25092096, 'steps': 130687, 'loss/train': 1.2519869804382324} 11/07/2021 15:37:02 - INFO - __main__ - Step 130689: {'lr': 2.071335597207624e-05, 'samples': 25092288, 'steps': 130688, 'loss/train': 1.1503807306289673} 11/07/2021 15:37:04 - INFO - __main__ - Step 130690: {'lr': 2.071124102040428e-05, 'samples': 25092480, 'steps': 130689, 'loss/train': 1.5091286897659302} 11/07/2021 15:37:05 - INFO - __main__ - Step 130691: {'lr': 2.0709126172046316e-05, 'samples': 25092672, 'steps': 130690, 'loss/train': 0.9442390203475952} 11/07/2021 15:37:05 - INFO - __main__ - Step 130692: {'lr': 2.0707011427003292e-05, 'samples': 25092864, 'steps': 130691, 'loss/train': 0.9116193652153015} 11/07/2021 15:37:05 - INFO - __main__ - Step 130693: {'lr': 2.0704896785276122e-05, 'samples': 25093056, 'steps': 130692, 'loss/train': 1.4824413061141968} 11/07/2021 15:37:06 - INFO - __main__ - Step 130694: {'lr': 2.070278224686581e-05, 'samples': 25093248, 'steps': 130693, 'loss/train': 1.01716148853302} 11/07/2021 15:37:06 - INFO - __main__ - Step 130695: {'lr': 2.0700667811773266e-05, 'samples': 25093440, 'steps': 130694, 'loss/train': 0.6707388758659363} 11/07/2021 15:37:07 - INFO - __main__ - Step 130696: {'lr': 2.0698553479999467e-05, 'samples': 25093632, 'steps': 130695, 'loss/train': 0.36301499605178833} 11/07/2021 15:37:07 - INFO - __main__ - Step 130697: {'lr': 2.069643925154538e-05, 'samples': 25093824, 'steps': 130696, 'loss/train': 1.1066848039627075} 11/07/2021 15:37:08 - INFO - __main__ - Step 130698: {'lr': 2.0694325126411923e-05, 'samples': 25094016, 'steps': 130697, 'loss/train': 0.05492736026644707} 11/07/2021 15:37:08 - INFO - __main__ - Step 130699: {'lr': 2.0692211104600066e-05, 'samples': 25094208, 'steps': 130698, 'loss/train': 1.3747868537902832} 11/07/2021 15:37:09 - INFO - __main__ - Step 130700: {'lr': 2.0690097186110756e-05, 'samples': 25094400, 'steps': 130699, 'loss/train': 0.8928928375244141} 11/07/2021 15:37:10 - INFO - __main__ - Step 130701: {'lr': 2.0687983370944936e-05, 'samples': 25094592, 'steps': 130700, 'loss/train': 1.3136285543441772} 11/07/2021 15:37:10 - INFO - __main__ - Step 130702: {'lr': 2.0685869659103658e-05, 'samples': 25094784, 'steps': 130701, 'loss/train': 1.5023630857467651} 11/07/2021 15:37:10 - INFO - __main__ - Step 130703: {'lr': 2.0683756050587697e-05, 'samples': 25094976, 'steps': 130702, 'loss/train': 0.5567538142204285} 11/07/2021 15:37:11 - INFO - __main__ - Step 130704: {'lr': 2.0681642545398145e-05, 'samples': 25095168, 'steps': 130703, 'loss/train': 1.0702996253967285} 11/07/2021 15:37:11 - INFO - __main__ - Step 130705: {'lr': 2.067952914353588e-05, 'samples': 25095360, 'steps': 130704, 'loss/train': 1.9143767356872559} 11/07/2021 15:37:12 - INFO - __main__ - Step 130706: {'lr': 2.0677415845001878e-05, 'samples': 25095552, 'steps': 130705, 'loss/train': 0.685047447681427} 11/07/2021 15:37:12 - INFO - __main__ - Step 130707: {'lr': 2.0675302649797084e-05, 'samples': 25095744, 'steps': 130706, 'loss/train': 0.9841753244400024} 11/07/2021 15:37:13 - INFO - __main__ - Step 130708: {'lr': 2.0673189557922493e-05, 'samples': 25095936, 'steps': 130707, 'loss/train': 1.6286855936050415} 11/07/2021 15:37:13 - INFO - __main__ - Step 130709: {'lr': 2.0671076569379e-05, 'samples': 25096128, 'steps': 130708, 'loss/train': 1.4214156866073608} 11/07/2021 15:37:13 - INFO - __main__ - Step 130710: {'lr': 2.0668963684167597e-05, 'samples': 25096320, 'steps': 130709, 'loss/train': 0.9698418378829956} 11/07/2021 15:37:14 - INFO - __main__ - Step 130711: {'lr': 2.0666850902289203e-05, 'samples': 25096512, 'steps': 130710, 'loss/train': 0.9355164766311646} 11/07/2021 15:37:15 - INFO - __main__ - Step 130712: {'lr': 2.066473822374479e-05, 'samples': 25096704, 'steps': 130711, 'loss/train': 0.5466125011444092} 11/07/2021 15:37:15 - INFO - __main__ - Step 130713: {'lr': 2.0662625648535326e-05, 'samples': 25096896, 'steps': 130712, 'loss/train': 1.222728967666626} 11/07/2021 15:37:16 - INFO - __main__ - Step 130714: {'lr': 2.0660513176661707e-05, 'samples': 25097088, 'steps': 130713, 'loss/train': 1.383459210395813} 11/07/2021 15:37:16 - INFO - __main__ - Step 130715: {'lr': 2.0658400808125006e-05, 'samples': 25097280, 'steps': 130714, 'loss/train': 1.1575932502746582} 11/07/2021 15:37:16 - INFO - __main__ - Step 130716: {'lr': 2.0656288542926033e-05, 'samples': 25097472, 'steps': 130715, 'loss/train': 1.595782995223999} 11/07/2021 15:37:17 - INFO - __main__ - Step 130717: {'lr': 2.065417638106579e-05, 'samples': 25097664, 'steps': 130716, 'loss/train': 1.2199071645736694} 11/07/2021 15:37:18 - INFO - __main__ - Step 130718: {'lr': 2.065206432254524e-05, 'samples': 25097856, 'steps': 130717, 'loss/train': 1.489101767539978} 11/07/2021 15:37:18 - INFO - __main__ - Step 130719: {'lr': 2.064995236736533e-05, 'samples': 25098048, 'steps': 130718, 'loss/train': 1.4646297693252563} 11/07/2021 15:37:18 - INFO - __main__ - Step 130720: {'lr': 2.064784051552701e-05, 'samples': 25098240, 'steps': 130719, 'loss/train': 0.8465964794158936} 11/07/2021 15:37:19 - INFO - __main__ - Step 130721: {'lr': 2.0645728767031246e-05, 'samples': 25098432, 'steps': 130720, 'loss/train': 1.243566632270813} 11/07/2021 15:37:20 - INFO - __main__ - Step 130722: {'lr': 2.0643617121878954e-05, 'samples': 25098624, 'steps': 130721, 'loss/train': 1.2235898971557617} 11/07/2021 15:37:20 - INFO - __main__ - Step 130723: {'lr': 2.0641505580071136e-05, 'samples': 25098816, 'steps': 130722, 'loss/train': 1.7083905935287476} 11/07/2021 15:37:21 - INFO - __main__ - Step 130724: {'lr': 2.0639394141608704e-05, 'samples': 25099008, 'steps': 130723, 'loss/train': 0.7134568095207214} 11/07/2021 15:37:21 - INFO - __main__ - Step 130725: {'lr': 2.0637282806492602e-05, 'samples': 25099200, 'steps': 130724, 'loss/train': 0.9321412444114685} 11/07/2021 15:37:21 - INFO - __main__ - Step 130726: {'lr': 2.063517157472383e-05, 'samples': 25099392, 'steps': 130725, 'loss/train': 1.521357774734497} 11/07/2021 15:37:22 - INFO - __main__ - Step 130727: {'lr': 2.0633060446303308e-05, 'samples': 25099584, 'steps': 130726, 'loss/train': 1.1776043176651} 11/07/2021 15:37:23 - INFO - __main__ - Step 130728: {'lr': 2.0630949421232002e-05, 'samples': 25099776, 'steps': 130727, 'loss/train': 1.3095283508300781} 11/07/2021 15:37:23 - INFO - __main__ - Step 130729: {'lr': 2.0628838499510832e-05, 'samples': 25099968, 'steps': 130728, 'loss/train': 1.0522229671478271} 11/07/2021 15:37:23 - INFO - __main__ - Step 130730: {'lr': 2.0626727681140766e-05, 'samples': 25100160, 'steps': 130729, 'loss/train': 1.2148174047470093} 11/07/2021 15:37:24 - INFO - __main__ - Step 130731: {'lr': 2.0624616966122777e-05, 'samples': 25100352, 'steps': 130730, 'loss/train': 0.7770007252693176} 11/07/2021 15:37:25 - INFO - __main__ - Step 130732: {'lr': 2.062250635445778e-05, 'samples': 25100544, 'steps': 130731, 'loss/train': 1.1040834188461304} 11/07/2021 15:37:25 - INFO - __main__ - Step 130733: {'lr': 2.0620395846146723e-05, 'samples': 25100736, 'steps': 130732, 'loss/train': 1.2231385707855225} 11/07/2021 15:37:25 - INFO - __main__ - Step 130734: {'lr': 2.06182854411906e-05, 'samples': 25100928, 'steps': 130733, 'loss/train': 1.1777628660202026} 11/07/2021 15:37:26 - INFO - __main__ - Step 130735: {'lr': 2.0616175139590328e-05, 'samples': 25101120, 'steps': 130734, 'loss/train': 1.4364776611328125} 11/07/2021 15:37:26 - INFO - __main__ - Step 130736: {'lr': 2.061406494134688e-05, 'samples': 25101312, 'steps': 130735, 'loss/train': 1.6130127906799316} 11/07/2021 15:37:27 - INFO - __main__ - Step 130737: {'lr': 2.061195484646117e-05, 'samples': 25101504, 'steps': 130736, 'loss/train': 1.3374946117401123} 11/07/2021 15:37:28 - INFO - __main__ - Step 130738: {'lr': 2.0609844854934197e-05, 'samples': 25101696, 'steps': 130737, 'loss/train': 1.3760579824447632} 11/07/2021 15:37:28 - INFO - __main__ - Step 130739: {'lr': 2.0607734966766877e-05, 'samples': 25101888, 'steps': 130738, 'loss/train': 1.6579079627990723} 11/07/2021 15:37:28 - INFO - __main__ - Step 130740: {'lr': 2.0605625181960187e-05, 'samples': 25102080, 'steps': 130739, 'loss/train': 1.0269898176193237} 11/07/2021 15:37:29 - INFO - __main__ - Step 130741: {'lr': 2.0603515500515036e-05, 'samples': 25102272, 'steps': 130740, 'loss/train': 1.2588223218917847} 11/07/2021 15:37:30 - INFO - __main__ - Step 130742: {'lr': 2.0601405922432455e-05, 'samples': 25102464, 'steps': 130741, 'loss/train': 1.4007889032363892} 11/07/2021 15:37:30 - INFO - __main__ - Step 130743: {'lr': 2.05992964477133e-05, 'samples': 25102656, 'steps': 130742, 'loss/train': 1.1988544464111328} 11/07/2021 15:37:31 - INFO - __main__ - Step 130744: {'lr': 2.0597187076358575e-05, 'samples': 25102848, 'steps': 130743, 'loss/train': 0.9333130121231079} 11/07/2021 15:37:31 - INFO - __main__ - Step 130745: {'lr': 2.0595077808369196e-05, 'samples': 25103040, 'steps': 130744, 'loss/train': 1.0804816484451294} 11/07/2021 15:37:31 - INFO - __main__ - Step 130746: {'lr': 2.0592968643746158e-05, 'samples': 25103232, 'steps': 130745, 'loss/train': 1.4046093225479126} 11/07/2021 15:37:32 - INFO - __main__ - Step 130747: {'lr': 2.059085958249038e-05, 'samples': 25103424, 'steps': 130746, 'loss/train': 0.1276760995388031} 11/07/2021 15:37:33 - INFO - __main__ - Step 130748: {'lr': 2.0588750624602802e-05, 'samples': 25103616, 'steps': 130747, 'loss/train': 1.3795619010925293} 11/07/2021 15:37:33 - INFO - __main__ - Step 130749: {'lr': 2.05866417700844e-05, 'samples': 25103808, 'steps': 130748, 'loss/train': 1.588415503501892} 11/07/2021 15:37:34 - INFO - __main__ - Step 130750: {'lr': 2.058453301893612e-05, 'samples': 25104000, 'steps': 130749, 'loss/train': 1.38466477394104} 11/07/2021 15:37:34 - INFO - __main__ - Step 130751: {'lr': 2.0582424371158925e-05, 'samples': 25104192, 'steps': 130750, 'loss/train': 0.5933374166488647} 11/07/2021 15:37:34 - INFO - __main__ - Step 130752: {'lr': 2.0580315826753736e-05, 'samples': 25104384, 'steps': 130751, 'loss/train': 1.1815561056137085} 11/07/2021 15:37:35 - INFO - __main__ - Step 130753: {'lr': 2.0578207385721526e-05, 'samples': 25104576, 'steps': 130752, 'loss/train': 1.0767920017242432} 11/07/2021 15:37:36 - INFO - __main__ - Step 130754: {'lr': 2.0576099048063236e-05, 'samples': 25104768, 'steps': 130753, 'loss/train': 1.2405346632003784} 11/07/2021 15:37:36 - INFO - __main__ - Step 130755: {'lr': 2.0573990813779836e-05, 'samples': 25104960, 'steps': 130754, 'loss/train': 1.128066062927246} 11/07/2021 15:37:36 - INFO - __main__ - Step 130756: {'lr': 2.057188268287222e-05, 'samples': 25105152, 'steps': 130755, 'loss/train': 0.7623453736305237} 11/07/2021 15:37:37 - INFO - __main__ - Step 130757: {'lr': 2.0569774655341378e-05, 'samples': 25105344, 'steps': 130756, 'loss/train': 1.526658296585083} 11/07/2021 15:37:38 - INFO - __main__ - Step 130758: {'lr': 2.0567666731188263e-05, 'samples': 25105536, 'steps': 130757, 'loss/train': 0.9961429238319397} 11/07/2021 15:37:38 - INFO - __main__ - Step 130759: {'lr': 2.056555891041381e-05, 'samples': 25105728, 'steps': 130758, 'loss/train': 1.029394268989563} 11/07/2021 15:37:38 - INFO - __main__ - Step 130760: {'lr': 2.0563451193018974e-05, 'samples': 25105920, 'steps': 130759, 'loss/train': 1.1237064599990845} 11/07/2021 15:37:39 - INFO - __main__ - Step 130761: {'lr': 2.0561343579004716e-05, 'samples': 25106112, 'steps': 130760, 'loss/train': 1.441641926765442} 11/07/2021 15:37:39 - INFO - __main__ - Step 130762: {'lr': 2.0559236068371984e-05, 'samples': 25106304, 'steps': 130761, 'loss/train': 0.963752806186676} 11/07/2021 15:37:40 - INFO - __main__ - Step 130763: {'lr': 2.0557128661121694e-05, 'samples': 25106496, 'steps': 130762, 'loss/train': 1.406246304512024} 11/07/2021 15:37:41 - INFO - __main__ - Step 130764: {'lr': 2.0555021357254844e-05, 'samples': 25106688, 'steps': 130763, 'loss/train': 1.332947015762329} 11/07/2021 15:37:41 - INFO - __main__ - Step 130765: {'lr': 2.0552914156772323e-05, 'samples': 25106880, 'steps': 130764, 'loss/train': 0.8718979954719543} 11/07/2021 15:37:41 - INFO - __main__ - Step 130766: {'lr': 2.0550807059675157e-05, 'samples': 25107072, 'steps': 130765, 'loss/train': 0.8712929487228394} 11/07/2021 15:37:42 - INFO - __main__ - Step 130767: {'lr': 2.0548700065964238e-05, 'samples': 25107264, 'steps': 130766, 'loss/train': 1.3081040382385254} 11/07/2021 15:37:43 - INFO - __main__ - Step 130768: {'lr': 2.054659317564056e-05, 'samples': 25107456, 'steps': 130767, 'loss/train': 1.142020344734192} 11/07/2021 15:37:43 - INFO - __main__ - Step 130769: {'lr': 2.054448638870507e-05, 'samples': 25107648, 'steps': 130768, 'loss/train': 1.2741973400115967} 11/07/2021 15:37:43 - INFO - __main__ - Step 130770: {'lr': 2.0542379705158626e-05, 'samples': 25107840, 'steps': 130769, 'loss/train': 1.4922239780426025} 11/07/2021 15:37:44 - INFO - __main__ - Step 130771: {'lr': 2.0540273125002283e-05, 'samples': 25108032, 'steps': 130770, 'loss/train': 0.9284241795539856} 11/07/2021 15:37:44 - INFO - __main__ - Step 130772: {'lr': 2.0538166648236933e-05, 'samples': 25108224, 'steps': 130771, 'loss/train': 1.3357793092727661} 11/07/2021 15:37:45 - INFO - __main__ - Step 130773: {'lr': 2.0536060274863545e-05, 'samples': 25108416, 'steps': 130772, 'loss/train': 1.046773910522461} 11/07/2021 15:37:45 - INFO - __main__ - Step 130774: {'lr': 2.053395400488309e-05, 'samples': 25108608, 'steps': 130773, 'loss/train': 0.8633443117141724} 11/07/2021 15:37:46 - INFO - __main__ - Step 130775: {'lr': 2.053184783829648e-05, 'samples': 25108800, 'steps': 130774, 'loss/train': 1.1427394151687622} 11/07/2021 15:37:46 - INFO - __main__ - Step 130776: {'lr': 2.0529741775104664e-05, 'samples': 25108992, 'steps': 130775, 'loss/train': 0.8688380122184753} 11/07/2021 15:37:47 - INFO - __main__ - Step 130777: {'lr': 2.0527635815308615e-05, 'samples': 25109184, 'steps': 130776, 'loss/train': 0.8644014000892639} 11/07/2021 15:37:48 - INFO - __main__ - Step 130778: {'lr': 2.0525529958909274e-05, 'samples': 25109376, 'steps': 130777, 'loss/train': 1.3427287340164185} 11/07/2021 15:37:48 - INFO - __main__ - Step 130779: {'lr': 2.052342420590758e-05, 'samples': 25109568, 'steps': 130778, 'loss/train': 1.3036777973175049} 11/07/2021 15:37:48 - INFO - __main__ - Step 130780: {'lr': 2.0521318556304513e-05, 'samples': 25109760, 'steps': 130779, 'loss/train': 1.2306395769119263} 11/07/2021 15:37:49 - INFO - __main__ - Step 130781: {'lr': 2.0519213010100984e-05, 'samples': 25109952, 'steps': 130780, 'loss/train': 1.1886481046676636} 11/07/2021 15:37:49 - INFO - __main__ - Step 130782: {'lr': 2.0517107567297992e-05, 'samples': 25110144, 'steps': 130781, 'loss/train': 1.320886492729187} 11/07/2021 15:37:50 - INFO - __main__ - Step 130783: {'lr': 2.051500222789643e-05, 'samples': 25110336, 'steps': 130782, 'loss/train': 0.8018700480461121} 11/07/2021 15:37:51 - INFO - __main__ - Step 130784: {'lr': 2.0512896991897235e-05, 'samples': 25110528, 'steps': 130783, 'loss/train': 1.2858017683029175} 11/07/2021 15:37:51 - INFO - __main__ - Step 130785: {'lr': 2.051079185930141e-05, 'samples': 25110720, 'steps': 130784, 'loss/train': 1.1573406457901} 11/07/2021 15:37:51 - INFO - __main__ - Step 130786: {'lr': 2.050868683010987e-05, 'samples': 25110912, 'steps': 130785, 'loss/train': 0.871648907661438} 11/07/2021 15:37:52 - INFO - __main__ - Step 130787: {'lr': 2.0506581904323585e-05, 'samples': 25111104, 'steps': 130786, 'loss/train': 1.58584725856781} 11/07/2021 15:37:52 - INFO - __main__ - Step 130788: {'lr': 2.0504477081943503e-05, 'samples': 25111296, 'steps': 130787, 'loss/train': 0.8178991079330444} 11/07/2021 15:37:53 - INFO - __main__ - Step 130789: {'lr': 2.0502372362970534e-05, 'samples': 25111488, 'steps': 130788, 'loss/train': 1.3242838382720947} 11/07/2021 15:37:54 - INFO - __main__ - Step 130790: {'lr': 2.0500267747405654e-05, 'samples': 25111680, 'steps': 130789, 'loss/train': 0.8900655508041382} 11/07/2021 15:37:54 - INFO - __main__ - Step 130791: {'lr': 2.0498163235249833e-05, 'samples': 25111872, 'steps': 130790, 'loss/train': 1.6172436475753784} 11/07/2021 15:37:54 - INFO - __main__ - Step 130792: {'lr': 2.049605882650399e-05, 'samples': 25112064, 'steps': 130791, 'loss/train': 1.4765894412994385} 11/07/2021 15:37:55 - INFO - __main__ - Step 130793: {'lr': 2.0493954521169088e-05, 'samples': 25112256, 'steps': 130792, 'loss/train': 1.3939895629882812} 11/07/2021 15:37:56 - INFO - __main__ - Step 130794: {'lr': 2.0491850319246053e-05, 'samples': 25112448, 'steps': 130793, 'loss/train': 1.4492673873901367} 11/07/2021 15:37:56 - INFO - __main__ - Step 130795: {'lr': 2.048974622073588e-05, 'samples': 25112640, 'steps': 130794, 'loss/train': 1.233960509300232} 11/07/2021 15:37:57 - INFO - __main__ - Step 130796: {'lr': 2.048764222563951e-05, 'samples': 25112832, 'steps': 130795, 'loss/train': 0.2034066617488861} 11/07/2021 15:37:57 - INFO - __main__ - Step 130797: {'lr': 2.0485538333957803e-05, 'samples': 25113024, 'steps': 130796, 'loss/train': 1.2911591529846191} 11/07/2021 15:37:57 - INFO - __main__ - Step 130798: {'lr': 2.0483434545691792e-05, 'samples': 25113216, 'steps': 130797, 'loss/train': 1.2981855869293213} 11/07/2021 15:37:58 - INFO - __main__ - Step 130799: {'lr': 2.0481330860842388e-05, 'samples': 25113408, 'steps': 130798, 'loss/train': 1.1506855487823486} 11/07/2021 15:37:59 - INFO - __main__ - Step 130800: {'lr': 2.0479227279410568e-05, 'samples': 25113600, 'steps': 130799, 'loss/train': 1.261190414428711} 11/07/2021 15:37:59 - INFO - __main__ - Step 130801: {'lr': 2.047712380139727e-05, 'samples': 25113792, 'steps': 130800, 'loss/train': 1.1907426118850708} 11/07/2021 15:37:59 - INFO - __main__ - Step 130802: {'lr': 2.0475020426803437e-05, 'samples': 25113984, 'steps': 130801, 'loss/train': 1.2577710151672363} 11/07/2021 15:38:00 - INFO - __main__ - Step 130803: {'lr': 2.0472917155630017e-05, 'samples': 25114176, 'steps': 130802, 'loss/train': 1.3137640953063965} 11/07/2021 15:38:00 - INFO - __main__ - Step 130804: {'lr': 2.047081398787795e-05, 'samples': 25114368, 'steps': 130803, 'loss/train': 1.4439085721969604} 11/07/2021 15:38:02 - INFO - __main__ - Step 130805: {'lr': 2.0468710923548212e-05, 'samples': 25114560, 'steps': 130804, 'loss/train': 0.7894514203071594} 11/07/2021 15:38:02 - INFO - __main__ - Step 130806: {'lr': 2.0466607962641714e-05, 'samples': 25114752, 'steps': 130805, 'loss/train': 0.5628148317337036} 11/07/2021 15:38:03 - INFO - __main__ - Step 130807: {'lr': 2.0464505105159432e-05, 'samples': 25114944, 'steps': 130806, 'loss/train': 1.424539566040039} 11/07/2021 15:38:03 - INFO - __main__ - Step 130808: {'lr': 2.0462402351102334e-05, 'samples': 25115136, 'steps': 130807, 'loss/train': 0.5191425085067749} 11/07/2021 15:38:03 - INFO - __main__ - Step 130809: {'lr': 2.046029970047128e-05, 'samples': 25115328, 'steps': 130808, 'loss/train': 1.226043462753296} 11/07/2021 15:38:04 - INFO - __main__ - Step 130810: {'lr': 2.04581971532673e-05, 'samples': 25115520, 'steps': 130809, 'loss/train': 0.5352382063865662} 11/07/2021 15:38:05 - INFO - __main__ - Step 130811: {'lr': 2.0456094709491306e-05, 'samples': 25115712, 'steps': 130810, 'loss/train': 0.3978894054889679} 11/07/2021 15:38:05 - INFO - __main__ - Step 130812: {'lr': 2.045399236914425e-05, 'samples': 25115904, 'steps': 130811, 'loss/train': 1.3653007745742798} 11/07/2021 15:38:05 - INFO - __main__ - Step 130813: {'lr': 2.045189013222709e-05, 'samples': 25116096, 'steps': 130812, 'loss/train': 1.411268949508667} 11/07/2021 15:38:06 - INFO - __main__ - Step 130814: {'lr': 2.0449787998740756e-05, 'samples': 25116288, 'steps': 130813, 'loss/train': 1.273542881011963} 11/07/2021 15:38:06 - INFO - __main__ - Step 130815: {'lr': 2.0447685968686207e-05, 'samples': 25116480, 'steps': 130814, 'loss/train': 1.1744118928909302} 11/07/2021 15:38:07 - INFO - __main__ - Step 130816: {'lr': 2.0445584042064397e-05, 'samples': 25116672, 'steps': 130815, 'loss/train': 1.2672985792160034} 11/07/2021 15:38:08 - INFO - __main__ - Step 130817: {'lr': 2.0443482218876264e-05, 'samples': 25116864, 'steps': 130816, 'loss/train': 1.1227812767028809} 11/07/2021 15:38:08 - INFO - __main__ - Step 130818: {'lr': 2.0441380499122725e-05, 'samples': 25117056, 'steps': 130817, 'loss/train': 1.4799062013626099} 11/07/2021 15:38:08 - INFO - __main__ - Step 130819: {'lr': 2.0439278882804835e-05, 'samples': 25117248, 'steps': 130818, 'loss/train': 1.0751982927322388} 11/07/2021 15:38:09 - INFO - __main__ - Step 130820: {'lr': 2.0437177369923427e-05, 'samples': 25117440, 'steps': 130819, 'loss/train': 1.1529017686843872} 11/07/2021 15:38:10 - INFO - __main__ - Step 130821: {'lr': 2.0435075960479443e-05, 'samples': 25117632, 'steps': 130820, 'loss/train': 1.405774474143982} 11/07/2021 15:38:10 - INFO - __main__ - Step 130822: {'lr': 2.0432974654473913e-05, 'samples': 25117824, 'steps': 130821, 'loss/train': 1.0483473539352417} 11/07/2021 15:38:11 - INFO - __main__ - Step 130823: {'lr': 2.043087345190772e-05, 'samples': 25118016, 'steps': 130822, 'loss/train': 1.1126819849014282} 11/07/2021 15:38:11 - INFO - __main__ - Step 130824: {'lr': 2.0428772352781843e-05, 'samples': 25118208, 'steps': 130823, 'loss/train': 1.1858587265014648} 11/07/2021 15:38:11 - INFO - __main__ - Step 130825: {'lr': 2.042667135709722e-05, 'samples': 25118400, 'steps': 130824, 'loss/train': 1.0003374814987183} 11/07/2021 15:38:12 - INFO - __main__ - Step 130826: {'lr': 2.0424570464854797e-05, 'samples': 25118592, 'steps': 130825, 'loss/train': 0.06842698156833649} 11/07/2021 15:38:13 - INFO - __main__ - Step 130827: {'lr': 2.0422469676055516e-05, 'samples': 25118784, 'steps': 130826, 'loss/train': 1.588634729385376} 11/07/2021 15:38:13 - INFO - __main__ - Step 130828: {'lr': 2.042036899070032e-05, 'samples': 25118976, 'steps': 130827, 'loss/train': 1.4454994201660156} 11/07/2021 15:38:13 - INFO - __main__ - Step 130829: {'lr': 2.041826840879016e-05, 'samples': 25119168, 'steps': 130828, 'loss/train': 0.5742722153663635} 11/07/2021 15:38:14 - INFO - __main__ - Step 130830: {'lr': 2.0416167930326053e-05, 'samples': 25119360, 'steps': 130829, 'loss/train': 1.2423129081726074} 11/07/2021 15:38:15 - INFO - __main__ - Step 130831: {'lr': 2.0414067555308808e-05, 'samples': 25119552, 'steps': 130830, 'loss/train': 0.974649965763092} 11/07/2021 15:38:15 - INFO - __main__ - Step 130832: {'lr': 2.0411967283739453e-05, 'samples': 25119744, 'steps': 130831, 'loss/train': 1.0136653184890747} 11/07/2021 15:38:16 - INFO - __main__ - Step 130833: {'lr': 2.040986711561893e-05, 'samples': 25119936, 'steps': 130832, 'loss/train': 1.1213451623916626} 11/07/2021 15:38:16 - INFO - __main__ - Step 130834: {'lr': 2.0407767050948155e-05, 'samples': 25120128, 'steps': 130833, 'loss/train': 1.2839435338974} 11/07/2021 15:38:16 - INFO - __main__ - Step 130835: {'lr': 2.040566708972813e-05, 'samples': 25120320, 'steps': 130834, 'loss/train': 1.1556237936019897} 11/07/2021 15:38:17 - INFO - __main__ - Step 130836: {'lr': 2.040356723195974e-05, 'samples': 25120512, 'steps': 130835, 'loss/train': 1.433265209197998} 11/07/2021 15:38:18 - INFO - __main__ - Step 130837: {'lr': 2.0401467477643987e-05, 'samples': 25120704, 'steps': 130836, 'loss/train': 1.1882598400115967} 11/07/2021 15:38:18 - INFO - __main__ - Step 130838: {'lr': 2.0399367826781755e-05, 'samples': 25120896, 'steps': 130837, 'loss/train': 0.8466320633888245} 11/07/2021 15:38:18 - INFO - __main__ - Step 130839: {'lr': 2.039726827937405e-05, 'samples': 25121088, 'steps': 130838, 'loss/train': 1.2168978452682495} 11/07/2021 15:38:19 - INFO - __main__ - Step 130840: {'lr': 2.039516883542178e-05, 'samples': 25121280, 'steps': 130839, 'loss/train': 1.519142985343933} 11/07/2021 15:38:20 - INFO - __main__ - Step 130841: {'lr': 2.0393069494925974e-05, 'samples': 25121472, 'steps': 130840, 'loss/train': 1.30730402469635} 11/07/2021 15:38:20 - INFO - __main__ - Step 130842: {'lr': 2.039097025788744e-05, 'samples': 25121664, 'steps': 130841, 'loss/train': 1.3855177164077759} 11/07/2021 15:38:21 - INFO - __main__ - Step 130843: {'lr': 2.038887112430718e-05, 'samples': 25121856, 'steps': 130842, 'loss/train': 1.386150598526001} 11/07/2021 15:38:21 - INFO - __main__ - Step 130844: {'lr': 2.0386772094186184e-05, 'samples': 25122048, 'steps': 130843, 'loss/train': 1.3747961521148682} 11/07/2021 15:38:21 - INFO - __main__ - Step 130845: {'lr': 2.0384673167525347e-05, 'samples': 25122240, 'steps': 130844, 'loss/train': 1.2165164947509766} 11/07/2021 15:38:23 - INFO - __main__ - Step 130846: {'lr': 2.0382574344325638e-05, 'samples': 25122432, 'steps': 130845, 'loss/train': 1.388514518737793} 11/07/2021 15:38:23 - INFO - __main__ - Step 130847: {'lr': 2.0380475624588e-05, 'samples': 25122624, 'steps': 130846, 'loss/train': 1.4000935554504395} 11/07/2021 15:38:23 - INFO - __main__ - Step 130848: {'lr': 2.0378377008313352e-05, 'samples': 25122816, 'steps': 130847, 'loss/train': 1.1829262971878052} 11/07/2021 15:38:24 - INFO - __main__ - Step 130849: {'lr': 2.0376278495502693e-05, 'samples': 25123008, 'steps': 130848, 'loss/train': 0.7870945930480957} 11/07/2021 15:38:24 - INFO - __main__ - Step 130850: {'lr': 2.0374180086156936e-05, 'samples': 25123200, 'steps': 130849, 'loss/train': 1.1455163955688477} 11/07/2021 15:38:24 - INFO - __main__ - Step 130851: {'lr': 2.0372081780277053e-05, 'samples': 25123392, 'steps': 130850, 'loss/train': 1.5602082014083862} 11/07/2021 15:38:25 - INFO - __main__ - Step 130852: {'lr': 2.036998357786396e-05, 'samples': 25123584, 'steps': 130851, 'loss/train': 0.7088173031806946} 11/07/2021 15:38:26 - INFO - __main__ - Step 130853: {'lr': 2.0367885478918575e-05, 'samples': 25123776, 'steps': 130852, 'loss/train': 1.1485390663146973} 11/07/2021 15:38:26 - INFO - __main__ - Step 130854: {'lr': 2.0365787483441895e-05, 'samples': 25123968, 'steps': 130853, 'loss/train': 0.6069055199623108} 11/07/2021 15:38:26 - INFO - __main__ - Step 130855: {'lr': 2.0363689591434836e-05, 'samples': 25124160, 'steps': 130854, 'loss/train': 0.9730549454689026} 11/07/2021 15:38:27 - INFO - __main__ - Step 130856: {'lr': 2.036159180289837e-05, 'samples': 25124352, 'steps': 130855, 'loss/train': 1.3881075382232666} 11/07/2021 15:38:28 - INFO - __main__ - Step 130857: {'lr': 2.0359494117833417e-05, 'samples': 25124544, 'steps': 130856, 'loss/train': 1.4000396728515625} 11/07/2021 15:38:28 - INFO - __main__ - Step 130858: {'lr': 2.035739653624094e-05, 'samples': 25124736, 'steps': 130857, 'loss/train': 0.5601944923400879} 11/07/2021 15:38:28 - INFO - __main__ - Step 130859: {'lr': 2.035529905812186e-05, 'samples': 25124928, 'steps': 130858, 'loss/train': 0.989176332950592} 11/07/2021 15:38:29 - INFO - __main__ - Step 130860: {'lr': 2.0353201683477153e-05, 'samples': 25125120, 'steps': 130859, 'loss/train': 0.6151838898658752} 11/07/2021 15:38:29 - INFO - __main__ - Step 130861: {'lr': 2.0351104412307754e-05, 'samples': 25125312, 'steps': 130860, 'loss/train': 1.1100809574127197} 11/07/2021 15:38:30 - INFO - __main__ - Step 130862: {'lr': 2.034900724461458e-05, 'samples': 25125504, 'steps': 130861, 'loss/train': 1.4202722311019897} 11/07/2021 15:38:31 - INFO - __main__ - Step 130863: {'lr': 2.0346910180398665e-05, 'samples': 25125696, 'steps': 130862, 'loss/train': 0.8085333704948425} 11/07/2021 15:38:31 - INFO - __main__ - Step 130864: {'lr': 2.0344813219660835e-05, 'samples': 25125888, 'steps': 130863, 'loss/train': 1.2756421566009521} 11/07/2021 15:38:31 - INFO - __main__ - Step 130865: {'lr': 2.0342716362402092e-05, 'samples': 25126080, 'steps': 130864, 'loss/train': 1.180293321609497} 11/07/2021 15:38:32 - INFO - __main__ - Step 130866: {'lr': 2.034061960862338e-05, 'samples': 25126272, 'steps': 130865, 'loss/train': 1.1397145986557007} 11/07/2021 15:38:33 - INFO - __main__ - Step 130867: {'lr': 2.0338522958325638e-05, 'samples': 25126464, 'steps': 130866, 'loss/train': 1.270484447479248} 11/07/2021 15:38:33 - INFO - __main__ - Step 130868: {'lr': 2.0336426411509817e-05, 'samples': 25126656, 'steps': 130867, 'loss/train': 0.9848131537437439} 11/07/2021 15:38:33 - INFO - __main__ - Step 130869: {'lr': 2.0334329968176855e-05, 'samples': 25126848, 'steps': 130868, 'loss/train': 0.7940070629119873} 11/07/2021 15:38:34 - INFO - __main__ - Step 130870: {'lr': 2.03322336283277e-05, 'samples': 25127040, 'steps': 130869, 'loss/train': 1.2072231769561768} 11/07/2021 15:38:34 - INFO - __main__ - Step 130871: {'lr': 2.0330137391963295e-05, 'samples': 25127232, 'steps': 130870, 'loss/train': 0.7949463725090027} 11/07/2021 15:38:35 - INFO - __main__ - Step 130872: {'lr': 2.0328041259084578e-05, 'samples': 25127424, 'steps': 130871, 'loss/train': 1.2754653692245483} 11/07/2021 15:38:36 - INFO - __main__ - Step 130873: {'lr': 2.0325945229692527e-05, 'samples': 25127616, 'steps': 130872, 'loss/train': 1.0204256772994995} 11/07/2021 15:38:36 - INFO - __main__ - Step 130874: {'lr': 2.032384930378803e-05, 'samples': 25127808, 'steps': 130873, 'loss/train': 0.533597469329834} 11/07/2021 15:38:36 - INFO - __main__ - Step 130875: {'lr': 2.032175348137208e-05, 'samples': 25128000, 'steps': 130874, 'loss/train': 1.240552544593811} 11/07/2021 15:38:37 - INFO - __main__ - Step 130876: {'lr': 2.0319657762445652e-05, 'samples': 25128192, 'steps': 130875, 'loss/train': 1.6246024370193481} 11/07/2021 15:38:38 - INFO - __main__ - Step 130877: {'lr': 2.0317562147009584e-05, 'samples': 25128384, 'steps': 130876, 'loss/train': 1.2331901788711548} 11/07/2021 15:38:38 - INFO - __main__ - Step 130878: {'lr': 2.0315466635064893e-05, 'samples': 25128576, 'steps': 130877, 'loss/train': 1.1985846757888794} 11/07/2021 15:38:38 - INFO - __main__ - Step 130879: {'lr': 2.0313371226612503e-05, 'samples': 25128768, 'steps': 130878, 'loss/train': 1.3747676610946655} 11/07/2021 15:38:39 - INFO - __main__ - Step 130880: {'lr': 2.0311275921653356e-05, 'samples': 25128960, 'steps': 130879, 'loss/train': 0.8138599395751953} 11/07/2021 15:38:39 - INFO - __main__ - Step 130881: {'lr': 2.030918072018842e-05, 'samples': 25129152, 'steps': 130880, 'loss/train': 1.3351951837539673} 11/07/2021 15:38:40 - INFO - __main__ - Step 130882: {'lr': 2.0307085622218585e-05, 'samples': 25129344, 'steps': 130881, 'loss/train': 1.2942078113555908} 11/07/2021 15:38:40 - INFO - __main__ - Step 130883: {'lr': 2.0304990627744878e-05, 'samples': 25129536, 'steps': 130882, 'loss/train': 1.38664972782135} 11/07/2021 15:38:41 - INFO - __main__ - Step 130884: {'lr': 2.030289573676816e-05, 'samples': 25129728, 'steps': 130883, 'loss/train': 1.4067476987838745} 11/07/2021 15:38:41 - INFO - __main__ - Step 130885: {'lr': 2.0300800949289462e-05, 'samples': 25129920, 'steps': 130884, 'loss/train': 1.4209340810775757} 11/07/2021 15:38:42 - INFO - __main__ - Step 130886: {'lr': 2.0298706265309634e-05, 'samples': 25130112, 'steps': 130885, 'loss/train': 1.2939515113830566} 11/07/2021 15:38:42 - INFO - __main__ - Step 130887: {'lr': 2.0296611684829687e-05, 'samples': 25130304, 'steps': 130886, 'loss/train': 1.0923155546188354} 11/07/2021 15:38:43 - INFO - __main__ - Step 130888: {'lr': 2.029451720785053e-05, 'samples': 25130496, 'steps': 130887, 'loss/train': 0.9242063760757446} 11/07/2021 15:38:43 - INFO - __main__ - Step 130889: {'lr': 2.029242283437313e-05, 'samples': 25130688, 'steps': 130888, 'loss/train': 1.404264211654663} 11/07/2021 15:38:44 - INFO - __main__ - Step 130890: {'lr': 2.029032856439844e-05, 'samples': 25130880, 'steps': 130889, 'loss/train': 1.2285345792770386} 11/07/2021 15:38:44 - INFO - __main__ - Step 130891: {'lr': 2.0288234397927374e-05, 'samples': 25131072, 'steps': 130890, 'loss/train': 0.8559148907661438} 11/07/2021 15:38:45 - INFO - __main__ - Step 130892: {'lr': 2.0286140334960844e-05, 'samples': 25131264, 'steps': 130891, 'loss/train': 0.720786988735199} 11/07/2021 15:38:45 - INFO - __main__ - Step 130893: {'lr': 2.028404637549988e-05, 'samples': 25131456, 'steps': 130892, 'loss/train': 1.308122992515564} 11/07/2021 15:38:46 - INFO - __main__ - Step 130894: {'lr': 2.0281952519545343e-05, 'samples': 25131648, 'steps': 130893, 'loss/train': 1.6721041202545166} 11/07/2021 15:38:46 - INFO - __main__ - Step 130895: {'lr': 2.0279858767098232e-05, 'samples': 25131840, 'steps': 130894, 'loss/train': 1.0614843368530273} 11/07/2021 15:38:46 - INFO - __main__ - Step 130896: {'lr': 2.0277765118159485e-05, 'samples': 25132032, 'steps': 130895, 'loss/train': 1.4596788883209229} 11/07/2021 15:38:47 - INFO - __main__ - Step 130897: {'lr': 2.0275671572729998e-05, 'samples': 25132224, 'steps': 130896, 'loss/train': 1.1195670366287231} 11/07/2021 15:38:48 - INFO - __main__ - Step 130898: {'lr': 2.0273578130810766e-05, 'samples': 25132416, 'steps': 130897, 'loss/train': 1.1136398315429688} 11/07/2021 15:38:48 - INFO - __main__ - Step 130899: {'lr': 2.0271484792402734e-05, 'samples': 25132608, 'steps': 130898, 'loss/train': 1.1373553276062012} 11/07/2021 15:38:49 - INFO - __main__ - Step 130900: {'lr': 2.0269391557506787e-05, 'samples': 25132800, 'steps': 130899, 'loss/train': 1.4328322410583496} 11/07/2021 15:38:49 - INFO - __main__ - Step 130901: {'lr': 2.0267298426123932e-05, 'samples': 25132992, 'steps': 130900, 'loss/train': 0.5231042504310608} 11/07/2021 15:38:49 - INFO - __main__ - Step 130902: {'lr': 2.0265205398255075e-05, 'samples': 25133184, 'steps': 130901, 'loss/train': 1.1907463073730469} 11/07/2021 15:38:50 - INFO - __main__ - Step 130903: {'lr': 2.0263112473901224e-05, 'samples': 25133376, 'steps': 130902, 'loss/train': 1.112740159034729} 11/07/2021 15:38:51 - INFO - __main__ - Step 130904: {'lr': 2.0261019653063232e-05, 'samples': 25133568, 'steps': 130903, 'loss/train': 1.4721497297286987} 11/07/2021 15:38:51 - INFO - __main__ - Step 130905: {'lr': 2.025892693574208e-05, 'samples': 25133760, 'steps': 130904, 'loss/train': 1.388861894607544} 11/07/2021 15:38:51 - INFO - __main__ - Step 130906: {'lr': 2.02568343219387e-05, 'samples': 25133952, 'steps': 130905, 'loss/train': 0.8779199719429016} 11/07/2021 15:38:52 - INFO - __main__ - Step 130907: {'lr': 2.025474181165404e-05, 'samples': 25134144, 'steps': 130906, 'loss/train': 1.5494005680084229} 11/07/2021 15:38:53 - INFO - __main__ - Step 130908: {'lr': 2.025264940488905e-05, 'samples': 25134336, 'steps': 130907, 'loss/train': 1.2747743129730225} 11/07/2021 15:38:53 - INFO - __main__ - Step 130909: {'lr': 2.0250557101644697e-05, 'samples': 25134528, 'steps': 130908, 'loss/train': 1.2126902341842651} 11/07/2021 15:38:54 - INFO - __main__ - Step 130910: {'lr': 2.0248464901921864e-05, 'samples': 25134720, 'steps': 130909, 'loss/train': 1.1413477659225464} 11/07/2021 15:38:54 - INFO - __main__ - Step 130911: {'lr': 2.024637280572153e-05, 'samples': 25134912, 'steps': 130910, 'loss/train': 1.2398295402526855} 11/07/2021 15:38:54 - INFO - __main__ - Step 130912: {'lr': 2.0244280813044663e-05, 'samples': 25135104, 'steps': 130911, 'loss/train': 1.058511734008789} 11/07/2021 15:38:55 - INFO - __main__ - Step 130913: {'lr': 2.0242188923892152e-05, 'samples': 25135296, 'steps': 130912, 'loss/train': 0.9685015678405762} 11/07/2021 15:38:56 - INFO - __main__ - Step 130914: {'lr': 2.0240097138264967e-05, 'samples': 25135488, 'steps': 130913, 'loss/train': 0.9510534405708313} 11/07/2021 15:38:56 - INFO - __main__ - Step 130915: {'lr': 2.0238005456164056e-05, 'samples': 25135680, 'steps': 130914, 'loss/train': 2.2410902976989746} 11/07/2021 15:38:56 - INFO - __main__ - Step 130916: {'lr': 2.0235913877590355e-05, 'samples': 25135872, 'steps': 130915, 'loss/train': 1.115096926689148} 11/07/2021 15:38:57 - INFO - __main__ - Step 130917: {'lr': 2.0233822402544842e-05, 'samples': 25136064, 'steps': 130916, 'loss/train': 1.496054768562317} 11/07/2021 15:38:58 - INFO - __main__ - Step 130918: {'lr': 2.0231731031028405e-05, 'samples': 25136256, 'steps': 130917, 'loss/train': 1.2767843008041382} 11/07/2021 15:38:58 - INFO - __main__ - Step 130919: {'lr': 2.0229639763041984e-05, 'samples': 25136448, 'steps': 130918, 'loss/train': 1.1509203910827637} 11/07/2021 15:38:59 - INFO - __main__ - Step 130920: {'lr': 2.0227548598586553e-05, 'samples': 25136640, 'steps': 130919, 'loss/train': 1.2024093866348267} 11/07/2021 15:38:59 - INFO - __main__ - Step 130921: {'lr': 2.0225457537663027e-05, 'samples': 25136832, 'steps': 130920, 'loss/train': 1.2189579010009766} 11/07/2021 15:38:59 - INFO - __main__ - Step 130922: {'lr': 2.0223366580272352e-05, 'samples': 25137024, 'steps': 130921, 'loss/train': 0.6073693633079529} 11/07/2021 15:39:00 - INFO - __main__ - Step 130923: {'lr': 2.0221275726415524e-05, 'samples': 25137216, 'steps': 130922, 'loss/train': 1.2251683473587036} 11/07/2021 15:39:01 - INFO - __main__ - Step 130924: {'lr': 2.0219184976093403e-05, 'samples': 25137408, 'steps': 130923, 'loss/train': 0.6898264288902283} 11/07/2021 15:39:01 - INFO - __main__ - Step 130925: {'lr': 2.021709432930699e-05, 'samples': 25137600, 'steps': 130924, 'loss/train': 0.8414203524589539} 11/07/2021 15:39:01 - INFO - __main__ - Step 130926: {'lr': 2.02150037860572e-05, 'samples': 25137792, 'steps': 130925, 'loss/train': 1.4029593467712402} 11/07/2021 15:39:02 - INFO - __main__ - Step 130927: {'lr': 2.021291334634501e-05, 'samples': 25137984, 'steps': 130926, 'loss/train': 1.0391350984573364} 11/07/2021 15:39:03 - INFO - __main__ - Step 130928: {'lr': 2.0210823010171296e-05, 'samples': 25138176, 'steps': 130927, 'loss/train': 1.3195890188217163} 11/07/2021 15:39:03 - INFO - __main__ - Step 130929: {'lr': 2.020873277753707e-05, 'samples': 25138368, 'steps': 130928, 'loss/train': 1.2639474868774414} 11/07/2021 15:39:03 - INFO - __main__ - Step 130930: {'lr': 2.020664264844327e-05, 'samples': 25138560, 'steps': 130929, 'loss/train': 1.297163963317871} 11/07/2021 15:39:04 - INFO - __main__ - Step 130931: {'lr': 2.0204552622890782e-05, 'samples': 25138752, 'steps': 130930, 'loss/train': 1.3307549953460693} 11/07/2021 15:39:04 - INFO - __main__ - Step 130932: {'lr': 2.020246270088058e-05, 'samples': 25138944, 'steps': 130931, 'loss/train': 0.9776944518089294} 11/07/2021 15:39:05 - INFO - __main__ - Step 130933: {'lr': 2.0200372882413582e-05, 'samples': 25139136, 'steps': 130932, 'loss/train': 1.407707691192627} 11/07/2021 15:39:05 - INFO - __main__ - Step 130934: {'lr': 2.0198283167490756e-05, 'samples': 25139328, 'steps': 130933, 'loss/train': 1.4739571809768677} 11/07/2021 15:39:06 - INFO - __main__ - Step 130935: {'lr': 2.0196193556113046e-05, 'samples': 25139520, 'steps': 130934, 'loss/train': 1.254456877708435} 11/07/2021 15:39:06 - INFO - __main__ - Step 130936: {'lr': 2.019410404828137e-05, 'samples': 25139712, 'steps': 130935, 'loss/train': 1.3366882801055908} 11/07/2021 15:39:07 - INFO - __main__ - Step 130937: {'lr': 2.0192014643996698e-05, 'samples': 25139904, 'steps': 130936, 'loss/train': 1.0375202894210815} 11/07/2021 15:39:07 - INFO - __main__ - Step 130938: {'lr': 2.0189925343259946e-05, 'samples': 25140096, 'steps': 130937, 'loss/train': 1.212952733039856} 11/07/2021 15:39:08 - INFO - __main__ - Step 130939: {'lr': 2.0187836146072087e-05, 'samples': 25140288, 'steps': 130938, 'loss/train': 1.4541093111038208} 11/07/2021 15:39:09 - INFO - __main__ - Step 130940: {'lr': 2.0185747052434034e-05, 'samples': 25140480, 'steps': 130939, 'loss/train': 1.9325048923492432} 11/07/2021 15:39:09 - INFO - __main__ - Step 130941: {'lr': 2.0183658062346734e-05, 'samples': 25140672, 'steps': 130940, 'loss/train': 1.4819163084030151} 11/07/2021 15:39:09 - INFO - __main__ - Step 130942: {'lr': 2.0181569175811126e-05, 'samples': 25140864, 'steps': 130941, 'loss/train': 1.0871853828430176} 11/07/2021 15:39:10 - INFO - __main__ - Step 130943: {'lr': 2.017948039282816e-05, 'samples': 25141056, 'steps': 130942, 'loss/train': 1.318535327911377} 11/07/2021 15:39:11 - INFO - __main__ - Step 130944: {'lr': 2.0177391713398802e-05, 'samples': 25141248, 'steps': 130943, 'loss/train': 1.4397978782653809} 11/07/2021 15:39:11 - INFO - __main__ - Step 130945: {'lr': 2.0175303137523942e-05, 'samples': 25141440, 'steps': 130944, 'loss/train': 1.427672266960144} 11/07/2021 15:39:11 - INFO - __main__ - Step 130946: {'lr': 2.0173214665204552e-05, 'samples': 25141632, 'steps': 130945, 'loss/train': 1.0143802165985107} 11/07/2021 15:39:12 - INFO - __main__ - Step 130947: {'lr': 2.017112629644155e-05, 'samples': 25141824, 'steps': 130946, 'loss/train': 1.2649867534637451} 11/07/2021 15:39:12 - INFO - __main__ - Step 130948: {'lr': 2.0169038031235902e-05, 'samples': 25142016, 'steps': 130947, 'loss/train': 0.9978233575820923} 11/07/2021 15:39:13 - INFO - __main__ - Step 130949: {'lr': 2.016694986958853e-05, 'samples': 25142208, 'steps': 130948, 'loss/train': 1.1634259223937988} 11/07/2021 15:39:13 - INFO - __main__ - Step 130950: {'lr': 2.0164861811500373e-05, 'samples': 25142400, 'steps': 130949, 'loss/train': 1.2807798385620117} 11/07/2021 15:39:14 - INFO - __main__ - Step 130951: {'lr': 2.0162773856972403e-05, 'samples': 25142592, 'steps': 130950, 'loss/train': 1.4780683517456055} 11/07/2021 15:39:14 - INFO - __main__ - Step 130952: {'lr': 2.016068600600554e-05, 'samples': 25142784, 'steps': 130951, 'loss/train': 1.4422942399978638} 11/07/2021 15:39:14 - INFO - __main__ - Step 130953: {'lr': 2.0158598258600726e-05, 'samples': 25142976, 'steps': 130952, 'loss/train': 1.337608814239502} 11/07/2021 15:39:15 - INFO - __main__ - Step 130954: {'lr': 2.015651061475887e-05, 'samples': 25143168, 'steps': 130953, 'loss/train': 1.1975284814834595} 11/07/2021 15:39:16 - INFO - __main__ - Step 130955: {'lr': 2.0154423074480984e-05, 'samples': 25143360, 'steps': 130954, 'loss/train': 1.070703148841858} 11/07/2021 15:39:16 - INFO - __main__ - Step 130956: {'lr': 2.0152335637767944e-05, 'samples': 25143552, 'steps': 130955, 'loss/train': 0.6909329295158386} 11/07/2021 15:39:16 - INFO - __main__ - Step 130957: {'lr': 2.0150248304620783e-05, 'samples': 25143744, 'steps': 130956, 'loss/train': 1.4290759563446045} 11/07/2021 15:39:17 - INFO - __main__ - Step 130958: {'lr': 2.0148161075040307e-05, 'samples': 25143936, 'steps': 130957, 'loss/train': 0.8931061625480652} 11/07/2021 15:39:18 - INFO - __main__ - Step 130959: {'lr': 2.0146073949027538e-05, 'samples': 25144128, 'steps': 130958, 'loss/train': 1.5231136083602905} 11/07/2021 15:39:18 - INFO - __main__ - Step 130960: {'lr': 2.014398692658337e-05, 'samples': 25144320, 'steps': 130959, 'loss/train': 0.7670860886573792} 11/07/2021 15:39:18 - INFO - __main__ - Step 130961: {'lr': 2.0141900007708797e-05, 'samples': 25144512, 'steps': 130960, 'loss/train': 1.2839727401733398} 11/07/2021 15:39:19 - INFO - __main__ - Step 130962: {'lr': 2.013981319240474e-05, 'samples': 25144704, 'steps': 130961, 'loss/train': 1.1028565168380737} 11/07/2021 15:39:19 - INFO - __main__ - Step 130963: {'lr': 2.0137726480672136e-05, 'samples': 25144896, 'steps': 130962, 'loss/train': 1.4350467920303345} 11/07/2021 15:39:20 - INFO - __main__ - Step 130964: {'lr': 2.0135639872511936e-05, 'samples': 25145088, 'steps': 130963, 'loss/train': 1.2840852737426758} 11/07/2021 15:39:21 - INFO - __main__ - Step 130965: {'lr': 2.0133553367925052e-05, 'samples': 25145280, 'steps': 130964, 'loss/train': 1.3299437761306763} 11/07/2021 15:39:21 - INFO - __main__ - Step 130966: {'lr': 2.0131466966912425e-05, 'samples': 25145472, 'steps': 130965, 'loss/train': 1.1275198459625244} 11/07/2021 15:39:21 - INFO - __main__ - Step 130967: {'lr': 2.0129380669475034e-05, 'samples': 25145664, 'steps': 130966, 'loss/train': 1.764291763305664} 11/07/2021 15:39:22 - INFO - __main__ - Step 130968: {'lr': 2.0127294475613818e-05, 'samples': 25145856, 'steps': 130967, 'loss/train': 0.9977788925170898} 11/07/2021 15:39:23 - INFO - __main__ - Step 130969: {'lr': 2.0125208385329664e-05, 'samples': 25146048, 'steps': 130968, 'loss/train': 1.0649608373641968} 11/07/2021 15:39:23 - INFO - __main__ - Step 130970: {'lr': 2.01231223986236e-05, 'samples': 25146240, 'steps': 130969, 'loss/train': 1.0767326354980469} 11/07/2021 15:39:23 - INFO - __main__ - Step 130971: {'lr': 2.012103651549646e-05, 'samples': 25146432, 'steps': 130970, 'loss/train': 1.536558985710144} 11/07/2021 15:39:24 - INFO - __main__ - Step 130972: {'lr': 2.0118950735949243e-05, 'samples': 25146624, 'steps': 130971, 'loss/train': 1.135366439819336} 11/07/2021 15:39:24 - INFO - __main__ - Step 130973: {'lr': 2.0116865059982863e-05, 'samples': 25146816, 'steps': 130972, 'loss/train': 1.521307110786438} 11/07/2021 15:39:25 - INFO - __main__ - Step 130974: {'lr': 2.0114779487598296e-05, 'samples': 25147008, 'steps': 130973, 'loss/train': 1.5502928495407104} 11/07/2021 15:39:25 - INFO - __main__ - Step 130975: {'lr': 2.0112694018796452e-05, 'samples': 25147200, 'steps': 130974, 'loss/train': 1.3778750896453857} 11/07/2021 15:39:26 - INFO - __main__ - Step 130976: {'lr': 2.0110608653578278e-05, 'samples': 25147392, 'steps': 130975, 'loss/train': 1.20766282081604} 11/07/2021 15:39:26 - INFO - __main__ - Step 130977: {'lr': 2.0108523391944715e-05, 'samples': 25147584, 'steps': 130976, 'loss/train': 1.4747366905212402} 11/07/2021 15:39:27 - INFO - __main__ - Step 130978: {'lr': 2.0106438233896712e-05, 'samples': 25147776, 'steps': 130977, 'loss/train': 1.2649540901184082} 11/07/2021 15:39:28 - INFO - __main__ - Step 130979: {'lr': 2.010435317943518e-05, 'samples': 25147968, 'steps': 130978, 'loss/train': 1.0722131729125977} 11/07/2021 15:39:28 - INFO - __main__ - Step 130980: {'lr': 2.0102268228561122e-05, 'samples': 25148160, 'steps': 130979, 'loss/train': 1.3894519805908203} 11/07/2021 15:39:28 - INFO - __main__ - Step 130981: {'lr': 2.0100183381275396e-05, 'samples': 25148352, 'steps': 130980, 'loss/train': 1.5436006784439087} 11/07/2021 15:39:29 - INFO - __main__ - Step 130982: {'lr': 2.0098098637579e-05, 'samples': 25148544, 'steps': 130981, 'loss/train': 0.9564161896705627} 11/07/2021 15:39:29 - INFO - __main__ - Step 130983: {'lr': 2.0096013997472823e-05, 'samples': 25148736, 'steps': 130982, 'loss/train': 1.0762444734573364} 11/07/2021 15:39:29 - INFO - __main__ - Step 130984: {'lr': 2.0093929460957922e-05, 'samples': 25148928, 'steps': 130983, 'loss/train': 2.279697895050049} 11/07/2021 15:39:30 - INFO - __main__ - Step 130985: {'lr': 2.0091845028035072e-05, 'samples': 25149120, 'steps': 130984, 'loss/train': 0.9571191668510437} 11/07/2021 15:39:31 - INFO - __main__ - Step 130986: {'lr': 2.00897606987053e-05, 'samples': 25149312, 'steps': 130985, 'loss/train': 1.400718331336975} 11/07/2021 15:39:31 - INFO - __main__ - Step 130987: {'lr': 2.0087676472969552e-05, 'samples': 25149504, 'steps': 130986, 'loss/train': 1.1869325637817383} 11/07/2021 15:39:31 - INFO - __main__ - Step 130988: {'lr': 2.008559235082871e-05, 'samples': 25149696, 'steps': 130987, 'loss/train': 1.2132948637008667} 11/07/2021 15:39:32 - INFO - __main__ - Step 130989: {'lr': 2.0083508332283785e-05, 'samples': 25149888, 'steps': 130988, 'loss/train': 0.04587046429514885} 11/07/2021 15:39:33 - INFO - __main__ - Step 130990: {'lr': 2.008142441733568e-05, 'samples': 25150080, 'steps': 130989, 'loss/train': 0.533982515335083} 11/07/2021 15:39:33 - INFO - __main__ - Step 130991: {'lr': 2.007934060598532e-05, 'samples': 25150272, 'steps': 130990, 'loss/train': 1.5154134035110474} 11/07/2021 15:39:34 - INFO - __main__ - Step 130992: {'lr': 2.007725689823367e-05, 'samples': 25150464, 'steps': 130991, 'loss/train': 1.5181400775909424} 11/07/2021 15:39:34 - INFO - __main__ - Step 130993: {'lr': 2.007517329408165e-05, 'samples': 25150656, 'steps': 130992, 'loss/train': 1.087639570236206} 11/07/2021 15:39:34 - INFO - __main__ - Step 130994: {'lr': 2.007308979353023e-05, 'samples': 25150848, 'steps': 130993, 'loss/train': 1.269538402557373} 11/07/2021 15:39:35 - INFO - __main__ - Step 130995: {'lr': 2.0071006396580326e-05, 'samples': 25151040, 'steps': 130994, 'loss/train': 1.0564076900482178} 11/07/2021 15:39:36 - INFO - __main__ - Step 130996: {'lr': 2.0068923103232855e-05, 'samples': 25151232, 'steps': 130995, 'loss/train': 1.0849964618682861} 11/07/2021 15:39:36 - INFO - __main__ - Step 130997: {'lr': 2.0066839913488844e-05, 'samples': 25151424, 'steps': 130996, 'loss/train': 1.1687036752700806} 11/07/2021 15:39:36 - INFO - __main__ - Step 130998: {'lr': 2.0064756827349122e-05, 'samples': 25151616, 'steps': 130997, 'loss/train': 1.0704721212387085} 11/07/2021 15:39:37 - INFO - __main__ - Step 130999: {'lr': 2.0062673844814666e-05, 'samples': 25151808, 'steps': 130998, 'loss/train': 1.3343522548675537} 11/07/2021 15:39:38 - INFO - __main__ - Step 131000: {'lr': 2.0060590965886417e-05, 'samples': 25152000, 'steps': 130999, 'loss/train': 1.5118223428726196} 11/07/2021 15:39:38 - INFO - __main__ - Step 131001: {'lr': 2.0058508190565316e-05, 'samples': 25152192, 'steps': 131000, 'loss/train': 0.7014312148094177} 11/07/2021 15:39:38 - INFO - __main__ - Step 131002: {'lr': 2.005642551885231e-05, 'samples': 25152384, 'steps': 131001, 'loss/train': 0.92454993724823} 11/07/2021 15:39:39 - INFO - __main__ - Step 131003: {'lr': 2.0054342950748344e-05, 'samples': 25152576, 'steps': 131002, 'loss/train': 1.3830331563949585} 11/07/2021 15:39:39 - INFO - __main__ - Step 131004: {'lr': 2.0052260486254332e-05, 'samples': 25152768, 'steps': 131003, 'loss/train': 1.4055626392364502} 11/07/2021 15:39:40 - INFO - __main__ - Step 131005: {'lr': 2.0050178125371216e-05, 'samples': 25152960, 'steps': 131004, 'loss/train': 0.8407405018806458} 11/07/2021 15:39:41 - INFO - __main__ - Step 131006: {'lr': 2.0048095868099942e-05, 'samples': 25153152, 'steps': 131005, 'loss/train': 1.4116549491882324} 11/07/2021 15:39:41 - INFO - __main__ - Step 131007: {'lr': 2.0046013714441452e-05, 'samples': 25153344, 'steps': 131006, 'loss/train': 1.2435495853424072} 11/07/2021 15:39:41 - INFO - __main__ - Step 131008: {'lr': 2.004393166439669e-05, 'samples': 25153536, 'steps': 131007, 'loss/train': 1.2829502820968628} 11/07/2021 15:39:42 - INFO - __main__ - Step 131009: {'lr': 2.0041849717966575e-05, 'samples': 25153728, 'steps': 131008, 'loss/train': 1.5940543413162231} 11/07/2021 15:39:43 - INFO - __main__ - Step 131010: {'lr': 2.003976787515205e-05, 'samples': 25153920, 'steps': 131009, 'loss/train': 1.2194344997406006} 11/07/2021 15:39:43 - INFO - __main__ - Step 131011: {'lr': 2.003768613595411e-05, 'samples': 25154112, 'steps': 131010, 'loss/train': 1.1251872777938843} 11/07/2021 15:39:43 - INFO - __main__ - Step 131012: {'lr': 2.003560450037359e-05, 'samples': 25154304, 'steps': 131011, 'loss/train': 1.167799949645996} 11/07/2021 15:39:44 - INFO - __main__ - Step 131013: {'lr': 2.003352296841146e-05, 'samples': 25154496, 'steps': 131012, 'loss/train': 0.07536815851926804} 11/07/2021 15:39:44 - INFO - __main__ - Step 131014: {'lr': 2.0031441540068697e-05, 'samples': 25154688, 'steps': 131013, 'loss/train': 1.287489414215088} 11/07/2021 15:39:45 - INFO - __main__ - Step 131015: {'lr': 2.002936021534621e-05, 'samples': 25154880, 'steps': 131014, 'loss/train': 0.5921758413314819} 11/07/2021 15:39:46 - INFO - __main__ - Step 131016: {'lr': 2.0027278994244946e-05, 'samples': 25155072, 'steps': 131015, 'loss/train': 1.3640670776367188} 11/07/2021 15:39:46 - INFO - __main__ - Step 131017: {'lr': 2.0025197876765848e-05, 'samples': 25155264, 'steps': 131016, 'loss/train': 0.9590416550636292} 11/07/2021 15:39:46 - INFO - __main__ - Step 131018: {'lr': 2.0023116862909836e-05, 'samples': 25155456, 'steps': 131017, 'loss/train': 1.1666004657745361} 11/07/2021 15:39:47 - INFO - __main__ - Step 131019: {'lr': 2.0021035952677874e-05, 'samples': 25155648, 'steps': 131018, 'loss/train': 0.8832435011863708} 11/07/2021 15:39:48 - INFO - __main__ - Step 131020: {'lr': 2.0018955146070882e-05, 'samples': 25155840, 'steps': 131019, 'loss/train': 1.1721537113189697} 11/07/2021 15:39:48 - INFO - __main__ - Step 131021: {'lr': 2.0016874443089806e-05, 'samples': 25156032, 'steps': 131020, 'loss/train': 0.9487287998199463} 11/07/2021 15:39:48 - INFO - __main__ - Step 131022: {'lr': 2.0014793843735557e-05, 'samples': 25156224, 'steps': 131021, 'loss/train': 1.3003332614898682} 11/07/2021 15:39:49 - INFO - __main__ - Step 131023: {'lr': 2.001271334800911e-05, 'samples': 25156416, 'steps': 131022, 'loss/train': 1.552507758140564} 11/07/2021 15:39:49 - INFO - __main__ - Step 131024: {'lr': 2.0010632955911408e-05, 'samples': 25156608, 'steps': 131023, 'loss/train': 1.416717767715454} 11/07/2021 15:39:50 - INFO - __main__ - Step 131025: {'lr': 2.0008552667443337e-05, 'samples': 25156800, 'steps': 131024, 'loss/train': 1.3231306076049805} 11/07/2021 15:39:51 - INFO - __main__ - Step 131026: {'lr': 2.000647248260587e-05, 'samples': 25156992, 'steps': 131025, 'loss/train': 1.236931562423706} 11/07/2021 15:39:51 - INFO - __main__ - Step 131027: {'lr': 2.0004392401399924e-05, 'samples': 25157184, 'steps': 131026, 'loss/train': 0.6382936835289001} 11/07/2021 15:39:51 - INFO - __main__ - Step 131028: {'lr': 2.000231242382644e-05, 'samples': 25157376, 'steps': 131027, 'loss/train': 0.9901456832885742} 11/07/2021 15:39:52 - INFO - __main__ - Step 131029: {'lr': 2.0000232549886393e-05, 'samples': 25157568, 'steps': 131028, 'loss/train': 1.1633180379867554} 11/07/2021 15:39:53 - INFO - __main__ - Step 131030: {'lr': 1.999815277958067e-05, 'samples': 25157760, 'steps': 131029, 'loss/train': 1.4575165510177612} 11/07/2021 15:39:53 - INFO - __main__ - Step 131031: {'lr': 1.999607311291024e-05, 'samples': 25157952, 'steps': 131030, 'loss/train': 1.3790478706359863} 11/07/2021 15:39:53 - INFO - __main__ - Step 131032: {'lr': 1.999399354987605e-05, 'samples': 25158144, 'steps': 131031, 'loss/train': 0.9144032001495361} 11/07/2021 15:39:54 - INFO - __main__ - Step 131033: {'lr': 1.9991914090478984e-05, 'samples': 25158336, 'steps': 131032, 'loss/train': 1.2295325994491577} 11/07/2021 15:39:54 - INFO - __main__ - Step 131034: {'lr': 1.998983473472002e-05, 'samples': 25158528, 'steps': 131033, 'loss/train': 1.2463159561157227} 11/07/2021 15:39:55 - INFO - __main__ - Step 131035: {'lr': 1.9987755482600094e-05, 'samples': 25158720, 'steps': 131034, 'loss/train': 1.6036823987960815} 11/07/2021 15:39:55 - INFO - __main__ - Step 131036: {'lr': 1.998567633412013e-05, 'samples': 25158912, 'steps': 131035, 'loss/train': 1.575805425643921} 11/07/2021 15:39:56 - INFO - __main__ - Step 131037: {'lr': 1.9983597289281092e-05, 'samples': 25159104, 'steps': 131036, 'loss/train': 0.921076774597168} 11/07/2021 15:39:56 - INFO - __main__ - Step 131038: {'lr': 1.998151834808393e-05, 'samples': 25159296, 'steps': 131037, 'loss/train': 1.077651858329773} 11/07/2021 15:39:57 - INFO - __main__ - Step 131039: {'lr': 1.997943951052947e-05, 'samples': 25159488, 'steps': 131038, 'loss/train': 1.1641643047332764} 11/07/2021 15:39:58 - INFO - __main__ - Step 131040: {'lr': 1.997736077661877e-05, 'samples': 25159680, 'steps': 131039, 'loss/train': 0.7115543484687805} 11/07/2021 15:39:58 - INFO - __main__ - Step 131041: {'lr': 1.9975282146352693e-05, 'samples': 25159872, 'steps': 131040, 'loss/train': 1.1676143407821655} 11/07/2021 15:39:58 - INFO - __main__ - Step 131042: {'lr': 1.9973203619732207e-05, 'samples': 25160064, 'steps': 131041, 'loss/train': 1.4323053359985352} 11/07/2021 15:39:59 - INFO - __main__ - Step 131043: {'lr': 1.9971125196758257e-05, 'samples': 25160256, 'steps': 131042, 'loss/train': 1.1492018699645996} 11/07/2021 15:39:59 - INFO - __main__ - Step 131044: {'lr': 1.9969046877431758e-05, 'samples': 25160448, 'steps': 131043, 'loss/train': 1.007479190826416} 11/07/2021 15:39:59 - INFO - __main__ - Step 131045: {'lr': 1.996696866175368e-05, 'samples': 25160640, 'steps': 131044, 'loss/train': 0.7684197425842285} 11/07/2021 15:40:01 - INFO - __main__ - Step 131046: {'lr': 1.9964890549724917e-05, 'samples': 25160832, 'steps': 131045, 'loss/train': 1.6039687395095825} 11/07/2021 15:40:01 - INFO - __main__ - Step 131047: {'lr': 1.9962812541346408e-05, 'samples': 25161024, 'steps': 131046, 'loss/train': 1.303106665611267} 11/07/2021 15:40:02 - INFO - __main__ - Step 131048: {'lr': 1.9960734636619128e-05, 'samples': 25161216, 'steps': 131047, 'loss/train': 0.9768722057342529} 11/07/2021 15:40:02 - INFO - __main__ - Step 131049: {'lr': 1.9958656835543988e-05, 'samples': 25161408, 'steps': 131048, 'loss/train': 1.25710928440094} 11/07/2021 15:40:02 - INFO - __main__ - Step 131050: {'lr': 1.9956579138121933e-05, 'samples': 25161600, 'steps': 131049, 'loss/train': 1.238063931465149} 11/07/2021 15:40:03 - INFO - __main__ - Step 131051: {'lr': 1.9954501544353936e-05, 'samples': 25161792, 'steps': 131050, 'loss/train': 0.12979260087013245} 11/07/2021 15:40:04 - INFO - __main__ - Step 131052: {'lr': 1.995242405424083e-05, 'samples': 25161984, 'steps': 131051, 'loss/train': 1.2702199220657349} 11/07/2021 15:40:04 - INFO - __main__ - Step 131053: {'lr': 1.9950346667783642e-05, 'samples': 25162176, 'steps': 131052, 'loss/train': 2.176255941390991} 11/07/2021 15:40:04 - INFO - __main__ - Step 131054: {'lr': 1.994826938498326e-05, 'samples': 25162368, 'steps': 131053, 'loss/train': 0.9088658690452576} 11/07/2021 15:40:05 - INFO - __main__ - Step 131055: {'lr': 1.9946192205840624e-05, 'samples': 25162560, 'steps': 131054, 'loss/train': 1.1968072652816772} 11/07/2021 15:40:05 - INFO - __main__ - Step 131056: {'lr': 1.994411513035671e-05, 'samples': 25162752, 'steps': 131055, 'loss/train': 1.1577237844467163} 11/07/2021 15:40:06 - INFO - __main__ - Step 131057: {'lr': 1.9942038158532405e-05, 'samples': 25162944, 'steps': 131056, 'loss/train': 1.2438822984695435} 11/07/2021 15:40:07 - INFO - __main__ - Step 131058: {'lr': 1.993996129036868e-05, 'samples': 25163136, 'steps': 131057, 'loss/train': 1.0349394083023071} 11/07/2021 15:40:07 - INFO - __main__ - Step 131059: {'lr': 1.993788452586645e-05, 'samples': 25163328, 'steps': 131058, 'loss/train': 1.0941835641860962} 11/07/2021 15:40:07 - INFO - __main__ - Step 131060: {'lr': 1.993580786502669e-05, 'samples': 25163520, 'steps': 131059, 'loss/train': 1.2401816844940186} 11/07/2021 15:40:08 - INFO - __main__ - Step 131061: {'lr': 1.9933731307850283e-05, 'samples': 25163712, 'steps': 131060, 'loss/train': 1.1840767860412598} 11/07/2021 15:40:09 - INFO - __main__ - Step 131062: {'lr': 1.9931654854338178e-05, 'samples': 25163904, 'steps': 131061, 'loss/train': 1.1241260766983032} 11/07/2021 15:40:09 - INFO - __main__ - Step 131063: {'lr': 1.9929578504491315e-05, 'samples': 25164096, 'steps': 131062, 'loss/train': 1.705159068107605} 11/07/2021 15:40:09 - INFO - __main__ - Step 131064: {'lr': 1.9927502258310636e-05, 'samples': 25164288, 'steps': 131063, 'loss/train': 1.1979821920394897} 11/07/2021 15:40:10 - INFO - __main__ - Step 131065: {'lr': 1.9925426115797148e-05, 'samples': 25164480, 'steps': 131064, 'loss/train': 0.8965454697608948} 11/07/2021 15:40:10 - INFO - __main__ - Step 131066: {'lr': 1.9923350076951645e-05, 'samples': 25164672, 'steps': 131065, 'loss/train': 0.9923006892204285} 11/07/2021 15:40:11 - INFO - __main__ - Step 131067: {'lr': 1.9921274141775135e-05, 'samples': 25164864, 'steps': 131066, 'loss/train': 1.2623956203460693} 11/07/2021 15:40:11 - INFO - __main__ - Step 131068: {'lr': 1.9919198310268533e-05, 'samples': 25165056, 'steps': 131067, 'loss/train': 0.8778603076934814} 11/07/2021 15:40:12 - INFO - __main__ - Step 131069: {'lr': 1.9917122582432807e-05, 'samples': 25165248, 'steps': 131068, 'loss/train': 0.42144742608070374} 11/07/2021 15:40:12 - INFO - __main__ - Step 131070: {'lr': 1.9915046958268872e-05, 'samples': 25165440, 'steps': 131069, 'loss/train': 1.1190214157104492} 11/07/2021 15:40:12 - INFO - __main__ - Step 131071: {'lr': 1.9912971437777677e-05, 'samples': 25165632, 'steps': 131070, 'loss/train': 1.1195013523101807} 11/07/2021 15:40:14 - INFO - __main__ - Step 131072: {'lr': 1.9910896020960134e-05, 'samples': 25165824, 'steps': 131071, 'loss/train': 0.7151570916175842} 11/07/2021 15:40:14 - INFO - __main__ - Step 131073: {'lr': 1.9908820707817187e-05, 'samples': 25166016, 'steps': 131072, 'loss/train': 0.9290691018104553} 11/07/2021 15:40:14 - INFO - __main__ - Step 131074: {'lr': 1.990674549834978e-05, 'samples': 25166208, 'steps': 131073, 'loss/train': 1.6413960456848145} 11/07/2021 15:40:15 - INFO - __main__ - Step 131075: {'lr': 1.9904670392558833e-05, 'samples': 25166400, 'steps': 131074, 'loss/train': 0.9425440430641174} 11/07/2021 15:40:15 - INFO - __main__ - Step 131076: {'lr': 1.990259539044531e-05, 'samples': 25166592, 'steps': 131075, 'loss/train': 0.9360536336898804} 11/07/2021 15:40:16 - INFO - __main__ - Step 131077: {'lr': 1.990052049201016e-05, 'samples': 25166784, 'steps': 131076, 'loss/train': 1.2637383937835693} 11/07/2021 15:40:16 - INFO - __main__ - Step 131078: {'lr': 1.989844569725424e-05, 'samples': 25166976, 'steps': 131077, 'loss/train': 0.9146126508712769} 11/07/2021 15:40:17 - INFO - __main__ - Step 131079: {'lr': 1.9896371006178525e-05, 'samples': 25167168, 'steps': 131078, 'loss/train': 1.5646275281906128} 11/07/2021 15:40:17 - INFO - __main__ - Step 131080: {'lr': 1.9894296418783958e-05, 'samples': 25167360, 'steps': 131079, 'loss/train': 0.7517291307449341} 11/07/2021 15:40:17 - INFO - __main__ - Step 131081: {'lr': 1.989222193507148e-05, 'samples': 25167552, 'steps': 131080, 'loss/train': 1.5493797063827515} 11/07/2021 15:40:18 - INFO - __main__ - Step 131082: {'lr': 1.989014755504201e-05, 'samples': 25167744, 'steps': 131081, 'loss/train': 1.4671008586883545} 11/07/2021 15:40:19 - INFO - __main__ - Step 131083: {'lr': 1.988807327869649e-05, 'samples': 25167936, 'steps': 131082, 'loss/train': 0.9913336634635925} 11/07/2021 15:40:19 - INFO - __main__ - Step 131084: {'lr': 1.9885999106035863e-05, 'samples': 25168128, 'steps': 131083, 'loss/train': 0.964820384979248} 11/07/2021 15:40:20 - INFO - __main__ - Step 131085: {'lr': 1.9883925037061045e-05, 'samples': 25168320, 'steps': 131084, 'loss/train': 1.3188031911849976} 11/07/2021 15:40:20 - INFO - __main__ - Step 131086: {'lr': 1.9881851071772984e-05, 'samples': 25168512, 'steps': 131085, 'loss/train': 1.1893165111541748} 11/07/2021 15:40:21 - INFO - __main__ - Step 131087: {'lr': 1.9879777210172645e-05, 'samples': 25168704, 'steps': 131086, 'loss/train': 1.5343084335327148} 11/07/2021 15:40:21 - INFO - __main__ - Step 131088: {'lr': 1.9877703452260865e-05, 'samples': 25168896, 'steps': 131087, 'loss/train': 0.8774803280830383} 11/07/2021 15:40:22 - INFO - __main__ - Step 131089: {'lr': 1.987562979803867e-05, 'samples': 25169088, 'steps': 131088, 'loss/train': 1.402564525604248} 11/07/2021 15:40:22 - INFO - __main__ - Step 131090: {'lr': 1.9873556247506946e-05, 'samples': 25169280, 'steps': 131089, 'loss/train': 1.3327116966247559} 11/07/2021 15:40:22 - INFO - __main__ - Step 131091: {'lr': 1.9871482800666667e-05, 'samples': 25169472, 'steps': 131090, 'loss/train': 1.3032689094543457} 11/07/2021 15:40:23 - INFO - __main__ - Step 131092: {'lr': 1.986940945751875e-05, 'samples': 25169664, 'steps': 131091, 'loss/train': 1.0045527219772339} 11/07/2021 15:40:24 - INFO - __main__ - Step 131093: {'lr': 1.986733621806411e-05, 'samples': 25169856, 'steps': 131092, 'loss/train': 1.4830690622329712} 11/07/2021 15:40:24 - INFO - __main__ - Step 131094: {'lr': 1.9865263082303687e-05, 'samples': 25170048, 'steps': 131093, 'loss/train': 1.5983951091766357} 11/07/2021 15:40:24 - INFO - __main__ - Step 131095: {'lr': 1.9863190050238455e-05, 'samples': 25170240, 'steps': 131094, 'loss/train': 1.0174614191055298} 11/07/2021 15:40:25 - INFO - __main__ - Step 131096: {'lr': 1.9861117121869276e-05, 'samples': 25170432, 'steps': 131095, 'loss/train': 1.6659760475158691} 11/07/2021 15:40:25 - INFO - __main__ - Step 131097: {'lr': 1.9859044297197177e-05, 'samples': 25170624, 'steps': 131096, 'loss/train': 1.2127269506454468} 11/07/2021 15:40:26 - INFO - __main__ - Step 131098: {'lr': 1.9856971576223043e-05, 'samples': 25170816, 'steps': 131097, 'loss/train': 1.5615720748901367} 11/07/2021 15:40:26 - INFO - __main__ - Step 131099: {'lr': 1.9854898958947792e-05, 'samples': 25171008, 'steps': 131098, 'loss/train': 1.2629727125167847} 11/07/2021 15:40:27 - INFO - __main__ - Step 131100: {'lr': 1.985282644537234e-05, 'samples': 25171200, 'steps': 131099, 'loss/train': 1.2745906114578247} 11/07/2021 15:40:27 - INFO - __main__ - Step 131101: {'lr': 1.9850754035497688e-05, 'samples': 25171392, 'steps': 131100, 'loss/train': 1.6379801034927368} 11/07/2021 15:40:27 - INFO - __main__ - Step 131102: {'lr': 1.984868172932472e-05, 'samples': 25171584, 'steps': 131101, 'loss/train': 1.7965202331542969} 11/07/2021 15:40:28 - INFO - __main__ - Step 131103: {'lr': 1.984660952685438e-05, 'samples': 25171776, 'steps': 131102, 'loss/train': 0.9421381950378418} 11/07/2021 15:40:29 - INFO - __main__ - Step 131104: {'lr': 1.9844537428087616e-05, 'samples': 25171968, 'steps': 131103, 'loss/train': 1.1112902164459229} 11/07/2021 15:40:29 - INFO - __main__ - Step 131105: {'lr': 1.9842465433025343e-05, 'samples': 25172160, 'steps': 131104, 'loss/train': 1.1780799627304077} 11/07/2021 15:40:29 - INFO - __main__ - Step 131106: {'lr': 1.98403935416685e-05, 'samples': 25172352, 'steps': 131105, 'loss/train': 1.243576169013977} 11/07/2021 15:40:30 - INFO - __main__ - Step 131107: {'lr': 1.9838321754018034e-05, 'samples': 25172544, 'steps': 131106, 'loss/train': 1.350975751876831} 11/07/2021 15:40:31 - INFO - __main__ - Step 131108: {'lr': 1.983625007007486e-05, 'samples': 25172736, 'steps': 131107, 'loss/train': 0.9549513459205627} 11/07/2021 15:40:31 - INFO - __main__ - Step 131109: {'lr': 1.9834178489839984e-05, 'samples': 25172928, 'steps': 131108, 'loss/train': 1.387062668800354} 11/07/2021 15:40:32 - INFO - __main__ - Step 131110: {'lr': 1.9832107013314228e-05, 'samples': 25173120, 'steps': 131109, 'loss/train': 1.1241371631622314} 11/07/2021 15:40:32 - INFO - __main__ - Step 131111: {'lr': 1.983003564049854e-05, 'samples': 25173312, 'steps': 131110, 'loss/train': 1.3102598190307617} 11/07/2021 15:40:32 - INFO - __main__ - Step 131112: {'lr': 1.982796437139392e-05, 'samples': 25173504, 'steps': 131111, 'loss/train': 1.5484068393707275} 11/07/2021 15:40:34 - INFO - __main__ - Step 131113: {'lr': 1.9825893206001256e-05, 'samples': 25173696, 'steps': 131112, 'loss/train': 1.5127447843551636} 11/07/2021 15:40:34 - INFO - __main__ - Step 131114: {'lr': 1.982382214432149e-05, 'samples': 25173888, 'steps': 131113, 'loss/train': 1.12498140335083} 11/07/2021 15:40:34 - INFO - __main__ - Step 131115: {'lr': 1.982175118635557e-05, 'samples': 25174080, 'steps': 131114, 'loss/train': 0.11080775409936905} 11/07/2021 15:40:35 - INFO - __main__ - Step 131116: {'lr': 1.9819680332104405e-05, 'samples': 25174272, 'steps': 131115, 'loss/train': 1.0212563276290894} 11/07/2021 15:40:35 - INFO - __main__ - Step 131117: {'lr': 1.9817609581568945e-05, 'samples': 25174464, 'steps': 131116, 'loss/train': 1.2912088632583618} 11/07/2021 15:40:36 - INFO - __main__ - Step 131118: {'lr': 1.9815538934750105e-05, 'samples': 25174656, 'steps': 131117, 'loss/train': 1.1550742387771606} 11/07/2021 15:40:36 - INFO - __main__ - Step 131119: {'lr': 1.9813468391648853e-05, 'samples': 25174848, 'steps': 131118, 'loss/train': 0.059879738837480545} 11/07/2021 15:40:37 - INFO - __main__ - Step 131120: {'lr': 1.9811397952266135e-05, 'samples': 25175040, 'steps': 131119, 'loss/train': 1.4299077987670898} 11/07/2021 15:40:37 - INFO - __main__ - Step 131121: {'lr': 1.9809327616602785e-05, 'samples': 25175232, 'steps': 131120, 'loss/train': 0.8454110026359558} 11/07/2021 15:40:37 - INFO - __main__ - Step 131122: {'lr': 1.9807257384659828e-05, 'samples': 25175424, 'steps': 131121, 'loss/train': 1.2074404954910278} 11/07/2021 15:40:39 - INFO - __main__ - Step 131123: {'lr': 1.980518725643815e-05, 'samples': 25175616, 'steps': 131122, 'loss/train': 1.2730779647827148} 11/07/2021 15:40:39 - INFO - __main__ - Step 131124: {'lr': 1.980311723193873e-05, 'samples': 25175808, 'steps': 131123, 'loss/train': 1.2109416723251343} 11/07/2021 15:40:39 - INFO - __main__ - Step 131125: {'lr': 1.9801047311162447e-05, 'samples': 25176000, 'steps': 131124, 'loss/train': 1.2174583673477173} 11/07/2021 15:40:40 - INFO - __main__ - Step 131126: {'lr': 1.9798977494110275e-05, 'samples': 25176192, 'steps': 131125, 'loss/train': 1.1469203233718872} 11/07/2021 15:40:40 - INFO - __main__ - Step 131127: {'lr': 1.9796907780783108e-05, 'samples': 25176384, 'steps': 131126, 'loss/train': 0.5126351118087769} 11/07/2021 15:40:41 - INFO - __main__ - Step 131128: {'lr': 1.9794838171181912e-05, 'samples': 25176576, 'steps': 131127, 'loss/train': 0.6012584567070007} 11/07/2021 15:40:41 - INFO - __main__ - Step 131129: {'lr': 1.979276866530763e-05, 'samples': 25176768, 'steps': 131128, 'loss/train': 1.4932005405426025} 11/07/2021 15:40:42 - INFO - __main__ - Step 131130: {'lr': 1.9790699263161154e-05, 'samples': 25176960, 'steps': 131129, 'loss/train': 1.6224302053451538} 11/07/2021 15:40:42 - INFO - __main__ - Step 131131: {'lr': 1.9788629964743454e-05, 'samples': 25177152, 'steps': 131130, 'loss/train': 1.0298266410827637} 11/07/2021 15:40:42 - INFO - __main__ - Step 131132: {'lr': 1.9786560770055472e-05, 'samples': 25177344, 'steps': 131131, 'loss/train': 1.5890672206878662} 11/07/2021 15:40:44 - INFO - __main__ - Step 131133: {'lr': 1.978449167909807e-05, 'samples': 25177536, 'steps': 131132, 'loss/train': 1.0564987659454346} 11/07/2021 15:40:44 - INFO - __main__ - Step 131134: {'lr': 1.978242269187222e-05, 'samples': 25177728, 'steps': 131133, 'loss/train': 0.5529794096946716} 11/07/2021 15:40:44 - INFO - __main__ - Step 131135: {'lr': 1.9780353808378866e-05, 'samples': 25177920, 'steps': 131134, 'loss/train': 1.087312936782837} 11/07/2021 15:40:45 - INFO - __main__ - Step 131136: {'lr': 1.977828502861895e-05, 'samples': 25178112, 'steps': 131135, 'loss/train': 1.4100682735443115} 11/07/2021 15:40:45 - INFO - __main__ - Step 131137: {'lr': 1.9776216352593357e-05, 'samples': 25178304, 'steps': 131136, 'loss/train': 1.1858253479003906} 11/07/2021 15:40:46 - INFO - __main__ - Step 131138: {'lr': 1.9774147780303064e-05, 'samples': 25178496, 'steps': 131137, 'loss/train': 1.3499882221221924} 11/07/2021 15:40:46 - INFO - __main__ - Step 131139: {'lr': 1.9772079311748985e-05, 'samples': 25178688, 'steps': 131138, 'loss/train': 0.6619004607200623} 11/07/2021 15:40:47 - INFO - __main__ - Step 131140: {'lr': 1.9770010946932036e-05, 'samples': 25178880, 'steps': 131139, 'loss/train': 1.157644510269165} 11/07/2021 15:40:47 - INFO - __main__ - Step 131141: {'lr': 1.976794268585319e-05, 'samples': 25179072, 'steps': 131140, 'loss/train': 1.197974443435669} 11/07/2021 15:40:47 - INFO - __main__ - Step 131142: {'lr': 1.9765874528513362e-05, 'samples': 25179264, 'steps': 131141, 'loss/train': 1.2141969203948975} 11/07/2021 15:40:48 - INFO - __main__ - Step 131143: {'lr': 1.9763806474913466e-05, 'samples': 25179456, 'steps': 131142, 'loss/train': 1.6906887292861938} 11/07/2021 15:40:49 - INFO - __main__ - Step 131144: {'lr': 1.9761738525054445e-05, 'samples': 25179648, 'steps': 131143, 'loss/train': 1.34765625} 11/07/2021 15:40:49 - INFO - __main__ - Step 131145: {'lr': 1.9759670678937275e-05, 'samples': 25179840, 'steps': 131144, 'loss/train': 1.3629188537597656} 11/07/2021 15:40:49 - INFO - __main__ - Step 131146: {'lr': 1.9757602936562813e-05, 'samples': 25180032, 'steps': 131145, 'loss/train': 0.5341352820396423} 11/07/2021 15:40:50 - INFO - __main__ - Step 131147: {'lr': 1.9755535297932004e-05, 'samples': 25180224, 'steps': 131146, 'loss/train': 1.3063626289367676} 11/07/2021 15:40:50 - INFO - __main__ - Step 131148: {'lr': 1.975346776304582e-05, 'samples': 25180416, 'steps': 131147, 'loss/train': 1.4192153215408325} 11/07/2021 15:40:51 - INFO - __main__ - Step 131149: {'lr': 1.9751400331905146e-05, 'samples': 25180608, 'steps': 131148, 'loss/train': 1.3417706489562988} 11/07/2021 15:40:52 - INFO - __main__ - Step 131150: {'lr': 1.9749333004510956e-05, 'samples': 25180800, 'steps': 131149, 'loss/train': 1.1248260736465454} 11/07/2021 15:40:52 - INFO - __main__ - Step 131151: {'lr': 1.9747265780864137e-05, 'samples': 25180992, 'steps': 131150, 'loss/train': 1.0080503225326538} 11/07/2021 15:40:52 - INFO - __main__ - Step 131152: {'lr': 1.9745198660965663e-05, 'samples': 25181184, 'steps': 131151, 'loss/train': 1.2182515859603882} 11/07/2021 15:40:53 - INFO - __main__ - Step 131153: {'lr': 1.9743131644816474e-05, 'samples': 25181376, 'steps': 131152, 'loss/train': 1.3030073642730713} 11/07/2021 15:40:54 - INFO - __main__ - Step 131154: {'lr': 1.9741064732417434e-05, 'samples': 25181568, 'steps': 131153, 'loss/train': 0.8505106568336487} 11/07/2021 15:40:54 - INFO - __main__ - Step 131155: {'lr': 1.9738997923769542e-05, 'samples': 25181760, 'steps': 131154, 'loss/train': 0.3229740560054779} 11/07/2021 15:40:54 - INFO - __main__ - Step 131156: {'lr': 1.973693121887371e-05, 'samples': 25181952, 'steps': 131155, 'loss/train': 1.3044934272766113} 11/07/2021 15:40:55 - INFO - __main__ - Step 131157: {'lr': 1.9734864617730857e-05, 'samples': 25182144, 'steps': 131156, 'loss/train': 0.12109696120023727} 11/07/2021 15:40:55 - INFO - __main__ - Step 131158: {'lr': 1.9732798120341928e-05, 'samples': 25182336, 'steps': 131157, 'loss/train': 1.208219051361084} 11/07/2021 15:40:56 - INFO - __main__ - Step 131159: {'lr': 1.9730731726707864e-05, 'samples': 25182528, 'steps': 131158, 'loss/train': 1.1250885725021362} 11/07/2021 15:40:56 - INFO - __main__ - Step 131160: {'lr': 1.9728665436829552e-05, 'samples': 25182720, 'steps': 131159, 'loss/train': 1.5409318208694458} 11/07/2021 15:40:57 - INFO - __main__ - Step 131161: {'lr': 1.9726599250707965e-05, 'samples': 25182912, 'steps': 131160, 'loss/train': 0.7929918169975281} 11/07/2021 15:40:57 - INFO - __main__ - Step 131162: {'lr': 1.9724533168343994e-05, 'samples': 25183104, 'steps': 131161, 'loss/train': 1.3054602146148682} 11/07/2021 15:40:58 - INFO - __main__ - Step 131163: {'lr': 1.972246718973861e-05, 'samples': 25183296, 'steps': 131162, 'loss/train': 1.1783066987991333} 11/07/2021 15:40:59 - INFO - __main__ - Step 131164: {'lr': 1.9720401314892722e-05, 'samples': 25183488, 'steps': 131163, 'loss/train': 1.238623857498169} 11/07/2021 15:40:59 - INFO - __main__ - Step 131165: {'lr': 1.971833554380728e-05, 'samples': 25183680, 'steps': 131164, 'loss/train': 0.8909867405891418} 11/07/2021 15:40:59 - INFO - __main__ - Step 131166: {'lr': 1.9716269876483173e-05, 'samples': 25183872, 'steps': 131165, 'loss/train': 1.3000948429107666} 11/07/2021 15:41:00 - INFO - __main__ - Step 131167: {'lr': 1.9714204312921397e-05, 'samples': 25184064, 'steps': 131166, 'loss/train': 0.7355786561965942} 11/07/2021 15:41:00 - INFO - __main__ - Step 131168: {'lr': 1.971213885312284e-05, 'samples': 25184256, 'steps': 131167, 'loss/train': 0.6198503971099854} 11/07/2021 15:41:01 - INFO - __main__ - Step 131169: {'lr': 1.971007349708842e-05, 'samples': 25184448, 'steps': 131168, 'loss/train': 1.1966888904571533} 11/07/2021 15:41:01 - INFO - __main__ - Step 131170: {'lr': 1.970800824481911e-05, 'samples': 25184640, 'steps': 131169, 'loss/train': 0.769079864025116} 11/07/2021 15:41:02 - INFO - __main__ - Step 131171: {'lr': 1.9705943096315793e-05, 'samples': 25184832, 'steps': 131170, 'loss/train': 1.2631117105484009} 11/07/2021 15:41:02 - INFO - __main__ - Step 131172: {'lr': 1.970387805157947e-05, 'samples': 25185024, 'steps': 131171, 'loss/train': 1.0520070791244507} 11/07/2021 15:41:02 - INFO - __main__ - Step 131173: {'lr': 1.9701813110611004e-05, 'samples': 25185216, 'steps': 131172, 'loss/train': 1.2404783964157104} 11/07/2021 15:41:04 - INFO - __main__ - Step 131174: {'lr': 1.9699748273411338e-05, 'samples': 25185408, 'steps': 131173, 'loss/train': 1.0663617849349976} 11/07/2021 15:41:04 - INFO - __main__ - Step 131175: {'lr': 1.9697683539981413e-05, 'samples': 25185600, 'steps': 131174, 'loss/train': 1.1013346910476685} 11/07/2021 15:41:04 - INFO - __main__ - Step 131176: {'lr': 1.969561891032215e-05, 'samples': 25185792, 'steps': 131175, 'loss/train': 1.5213900804519653} 11/07/2021 15:41:05 - INFO - __main__ - Step 131177: {'lr': 1.9693554384434487e-05, 'samples': 25185984, 'steps': 131176, 'loss/train': 1.2439533472061157} 11/07/2021 15:41:05 - INFO - __main__ - Step 131178: {'lr': 1.969148996231937e-05, 'samples': 25186176, 'steps': 131177, 'loss/train': 1.1976226568222046} 11/07/2021 15:41:06 - INFO - __main__ - Step 131179: {'lr': 1.9689425643977686e-05, 'samples': 25186368, 'steps': 131178, 'loss/train': 0.8417842388153076} 11/07/2021 15:41:06 - INFO - __main__ - Step 131180: {'lr': 1.9687361429410438e-05, 'samples': 25186560, 'steps': 131179, 'loss/train': 1.0665737390518188} 11/07/2021 15:41:07 - INFO - __main__ - Step 131181: {'lr': 1.9685297318618485e-05, 'samples': 25186752, 'steps': 131180, 'loss/train': 1.2076665163040161} 11/07/2021 15:41:07 - INFO - __main__ - Step 131182: {'lr': 1.9683233311602766e-05, 'samples': 25186944, 'steps': 131181, 'loss/train': 1.3136835098266602} 11/07/2021 15:41:08 - INFO - __main__ - Step 131183: {'lr': 1.968116940836426e-05, 'samples': 25187136, 'steps': 131182, 'loss/train': 1.1991136074066162} 11/07/2021 15:41:09 - INFO - __main__ - Step 131184: {'lr': 1.9679105608903847e-05, 'samples': 25187328, 'steps': 131183, 'loss/train': 1.0734543800354004} 11/07/2021 15:41:09 - INFO - __main__ - Step 131185: {'lr': 1.9677041913222477e-05, 'samples': 25187520, 'steps': 131184, 'loss/train': 0.6498827338218689} 11/07/2021 15:41:10 - INFO - __main__ - Step 131186: {'lr': 1.967497832132112e-05, 'samples': 25187712, 'steps': 131185, 'loss/train': 1.1788731813430786} 11/07/2021 15:41:10 - INFO - __main__ - Step 131187: {'lr': 1.9672914833200605e-05, 'samples': 25187904, 'steps': 131186, 'loss/train': 1.282141089439392} 11/07/2021 15:41:10 - INFO - __main__ - Step 131188: {'lr': 1.9670851448861937e-05, 'samples': 25188096, 'steps': 131187, 'loss/train': 0.9890609383583069} 11/07/2021 15:41:11 - INFO - __main__ - Step 131189: {'lr': 1.966878816830603e-05, 'samples': 25188288, 'steps': 131188, 'loss/train': 1.3380402326583862} 11/07/2021 15:41:12 - INFO - __main__ - Step 131190: {'lr': 1.9666724991533825e-05, 'samples': 25188480, 'steps': 131189, 'loss/train': 0.16263459622859955} 11/07/2021 15:41:12 - INFO - __main__ - Step 131191: {'lr': 1.9664661918546213e-05, 'samples': 25188672, 'steps': 131190, 'loss/train': 0.03452523425221443} 11/07/2021 15:41:12 - INFO - __main__ - Step 131192: {'lr': 1.966259894934416e-05, 'samples': 25188864, 'steps': 131191, 'loss/train': 1.0270887613296509} 11/07/2021 15:41:13 - INFO - __main__ - Step 131193: {'lr': 1.9660536083928593e-05, 'samples': 25189056, 'steps': 131192, 'loss/train': 1.2421841621398926} 11/07/2021 15:41:13 - INFO - __main__ - Step 131194: {'lr': 1.965847332230042e-05, 'samples': 25189248, 'steps': 131193, 'loss/train': 1.2028231620788574} 11/07/2021 15:41:14 - INFO - __main__ - Step 131195: {'lr': 1.965641066446061e-05, 'samples': 25189440, 'steps': 131194, 'loss/train': 0.9842961430549622} 11/07/2021 15:41:14 - INFO - __main__ - Step 131196: {'lr': 1.9654348110410057e-05, 'samples': 25189632, 'steps': 131195, 'loss/train': 1.4750090837478638} 11/07/2021 15:41:15 - INFO - __main__ - Step 131197: {'lr': 1.9652285660149677e-05, 'samples': 25189824, 'steps': 131196, 'loss/train': 1.0585720539093018} 11/07/2021 15:41:15 - INFO - __main__ - Step 131198: {'lr': 1.9650223313680437e-05, 'samples': 25190016, 'steps': 131197, 'loss/train': 1.5310066938400269} 11/07/2021 15:41:15 - INFO - __main__ - Step 131199: {'lr': 1.9648161071003312e-05, 'samples': 25190208, 'steps': 131198, 'loss/train': 1.5395963191986084} 11/07/2021 15:41:17 - INFO - __main__ - Step 131200: {'lr': 1.9646098932119104e-05, 'samples': 25190400, 'steps': 131199, 'loss/train': 1.6437594890594482} 11/07/2021 15:41:17 - INFO - __main__ - Step 131201: {'lr': 1.9644036897028815e-05, 'samples': 25190592, 'steps': 131200, 'loss/train': 0.938396692276001} 11/07/2021 15:41:17 - INFO - __main__ - Step 131202: {'lr': 1.9641974965733388e-05, 'samples': 25190784, 'steps': 131201, 'loss/train': 0.7331992983818054} 11/07/2021 15:41:18 - INFO - __main__ - Step 131203: {'lr': 1.963991313823371e-05, 'samples': 25190976, 'steps': 131202, 'loss/train': 1.1759060621261597} 11/07/2021 15:41:18 - INFO - __main__ - Step 131204: {'lr': 1.9637851414530755e-05, 'samples': 25191168, 'steps': 131203, 'loss/train': 1.3826863765716553} 11/07/2021 15:41:19 - INFO - __main__ - Step 131205: {'lr': 1.963578979462541e-05, 'samples': 25191360, 'steps': 131204, 'loss/train': 1.378764033317566} 11/07/2021 15:41:20 - INFO - __main__ - Step 131206: {'lr': 1.9633728278518614e-05, 'samples': 25191552, 'steps': 131205, 'loss/train': 0.9381831288337708} 11/07/2021 15:41:20 - INFO - __main__ - Step 131207: {'lr': 1.963166686621132e-05, 'samples': 25191744, 'steps': 131206, 'loss/train': 0.9338041543960571} 11/07/2021 15:41:20 - INFO - __main__ - Step 131208: {'lr': 1.9629605557704432e-05, 'samples': 25191936, 'steps': 131207, 'loss/train': 1.2737587690353394} 11/07/2021 15:41:21 - INFO - __main__ - Step 131209: {'lr': 1.9627544352998906e-05, 'samples': 25192128, 'steps': 131208, 'loss/train': 1.5807052850723267} 11/07/2021 15:41:22 - INFO - __main__ - Step 131210: {'lr': 1.962548325209565e-05, 'samples': 25192320, 'steps': 131209, 'loss/train': 0.9583934545516968} 11/07/2021 15:41:22 - INFO - __main__ - Step 131211: {'lr': 1.9623422254995582e-05, 'samples': 25192512, 'steps': 131210, 'loss/train': 1.502549409866333} 11/07/2021 15:41:22 - INFO - __main__ - Step 131212: {'lr': 1.9621361361699702e-05, 'samples': 25192704, 'steps': 131211, 'loss/train': 1.4108003377914429} 11/07/2021 15:41:23 - INFO - __main__ - Step 131213: {'lr': 1.9619300572208842e-05, 'samples': 25192896, 'steps': 131212, 'loss/train': 1.4261912107467651} 11/07/2021 15:41:23 - INFO - __main__ - Step 131214: {'lr': 1.9617239886523974e-05, 'samples': 25193088, 'steps': 131213, 'loss/train': 1.1389670372009277} 11/07/2021 15:41:24 - INFO - __main__ - Step 131215: {'lr': 1.9615179304645985e-05, 'samples': 25193280, 'steps': 131214, 'loss/train': 0.9378883838653564} 11/07/2021 15:41:24 - INFO - __main__ - Step 131216: {'lr': 1.9613118826575878e-05, 'samples': 25193472, 'steps': 131215, 'loss/train': 0.6235902309417725} 11/07/2021 15:41:25 - INFO - __main__ - Step 131217: {'lr': 1.9611058452314534e-05, 'samples': 25193664, 'steps': 131216, 'loss/train': 1.563736081123352} 11/07/2021 15:41:25 - INFO - __main__ - Step 131218: {'lr': 1.9608998181862903e-05, 'samples': 25193856, 'steps': 131217, 'loss/train': 1.2093673944473267} 11/07/2021 15:41:25 - INFO - __main__ - Step 131219: {'lr': 1.960693801522187e-05, 'samples': 25194048, 'steps': 131218, 'loss/train': 0.9847797751426697} 11/07/2021 15:41:26 - INFO - __main__ - Step 131220: {'lr': 1.9604877952392434e-05, 'samples': 25194240, 'steps': 131219, 'loss/train': 1.1048868894577026} 11/07/2021 15:41:27 - INFO - __main__ - Step 131221: {'lr': 1.960281799337546e-05, 'samples': 25194432, 'steps': 131220, 'loss/train': 0.7567007541656494} 11/07/2021 15:41:27 - INFO - __main__ - Step 131222: {'lr': 1.9600758138171916e-05, 'samples': 25194624, 'steps': 131221, 'loss/train': 1.342843770980835} 11/07/2021 15:41:27 - INFO - __main__ - Step 131223: {'lr': 1.9598698386782715e-05, 'samples': 25194816, 'steps': 131222, 'loss/train': 1.2419686317443848} 11/07/2021 15:41:28 - INFO - __main__ - Step 131224: {'lr': 1.959663873920878e-05, 'samples': 25195008, 'steps': 131223, 'loss/train': 1.2802975177764893} 11/07/2021 15:41:29 - INFO - __main__ - Step 131225: {'lr': 1.959457919545102e-05, 'samples': 25195200, 'steps': 131224, 'loss/train': 1.4659794569015503} 11/07/2021 15:41:29 - INFO - __main__ - Step 131226: {'lr': 1.9592519755510463e-05, 'samples': 25195392, 'steps': 131225, 'loss/train': 1.2973206043243408} 11/07/2021 15:41:30 - INFO - __main__ - Step 131227: {'lr': 1.959046041938792e-05, 'samples': 25195584, 'steps': 131226, 'loss/train': 1.250433087348938} 11/07/2021 15:41:30 - INFO - __main__ - Step 131228: {'lr': 1.9588401187084326e-05, 'samples': 25195776, 'steps': 131227, 'loss/train': 1.0227171182632446} 11/07/2021 15:41:30 - INFO - __main__ - Step 131229: {'lr': 1.958634205860066e-05, 'samples': 25195968, 'steps': 131228, 'loss/train': 1.065185546875} 11/07/2021 15:41:31 - INFO - __main__ - Step 131230: {'lr': 1.958428303393786e-05, 'samples': 25196160, 'steps': 131229, 'loss/train': 1.970374584197998} 11/07/2021 15:41:32 - INFO - __main__ - Step 131231: {'lr': 1.958222411309679e-05, 'samples': 25196352, 'steps': 131230, 'loss/train': 1.0584595203399658} 11/07/2021 15:41:32 - INFO - __main__ - Step 131232: {'lr': 1.9580165296078422e-05, 'samples': 25196544, 'steps': 131231, 'loss/train': 1.5292919874191284} 11/07/2021 15:41:33 - INFO - __main__ - Step 131233: {'lr': 1.9578106582883698e-05, 'samples': 25196736, 'steps': 131232, 'loss/train': 0.8928581476211548} 11/07/2021 15:41:33 - INFO - __main__ - Step 131234: {'lr': 1.9576047973513505e-05, 'samples': 25196928, 'steps': 131233, 'loss/train': 1.2216527462005615} 11/07/2021 15:41:34 - INFO - __main__ - Step 131235: {'lr': 1.957398946796879e-05, 'samples': 25197120, 'steps': 131234, 'loss/train': 1.370319128036499} 11/07/2021 15:41:34 - INFO - __main__ - Step 131236: {'lr': 1.9571931066250492e-05, 'samples': 25197312, 'steps': 131235, 'loss/train': 2.3063907623291016} 11/07/2021 15:41:35 - INFO - __main__ - Step 131237: {'lr': 1.9569872768359504e-05, 'samples': 25197504, 'steps': 131236, 'loss/train': 1.6416234970092773} 11/07/2021 15:41:35 - INFO - __main__ - Step 131238: {'lr': 1.9567814574296793e-05, 'samples': 25197696, 'steps': 131237, 'loss/train': 1.0484322309494019} 11/07/2021 15:41:35 - INFO - __main__ - Step 131239: {'lr': 1.9565756484063308e-05, 'samples': 25197888, 'steps': 131238, 'loss/train': 0.2578880786895752} 11/07/2021 15:41:36 - INFO - __main__ - Step 131240: {'lr': 1.9563698497659878e-05, 'samples': 25198080, 'steps': 131239, 'loss/train': 5.694448471069336} 11/07/2021 15:41:37 - INFO - __main__ - Step 131241: {'lr': 1.95616406150875e-05, 'samples': 25198272, 'steps': 131240, 'loss/train': 0.9868974685668945} 11/07/2021 15:41:37 - INFO - __main__ - Step 131242: {'lr': 1.9559582836347094e-05, 'samples': 25198464, 'steps': 131241, 'loss/train': 1.30802321434021} 11/07/2021 15:41:38 - INFO - __main__ - Step 131243: {'lr': 1.9557525161439603e-05, 'samples': 25198656, 'steps': 131242, 'loss/train': 1.3563836812973022} 11/07/2021 15:41:38 - INFO - __main__ - Step 131244: {'lr': 1.9555467590365917e-05, 'samples': 25198848, 'steps': 131243, 'loss/train': 1.1264162063598633} 11/07/2021 15:41:38 - INFO - __main__ - Step 131245: {'lr': 1.9553410123126975e-05, 'samples': 25199040, 'steps': 131244, 'loss/train': 1.4625784158706665} 11/07/2021 15:41:39 - INFO - __main__ - Step 131246: {'lr': 1.9551352759723724e-05, 'samples': 25199232, 'steps': 131245, 'loss/train': 1.26669442653656} 11/07/2021 15:41:40 - INFO - __main__ - Step 131247: {'lr': 1.954929550015705e-05, 'samples': 25199424, 'steps': 131246, 'loss/train': 1.099883794784546} 11/07/2021 15:41:40 - INFO - __main__ - Step 131248: {'lr': 1.9547238344427925e-05, 'samples': 25199616, 'steps': 131247, 'loss/train': 1.3367291688919067} 11/07/2021 15:41:40 - INFO - __main__ - Step 131249: {'lr': 1.9545181292537267e-05, 'samples': 25199808, 'steps': 131248, 'loss/train': 0.8866224884986877} 11/07/2021 15:41:41 - INFO - __main__ - Step 131250: {'lr': 1.954312434448599e-05, 'samples': 25200000, 'steps': 131249, 'loss/train': 1.5366955995559692} 11/07/2021 15:41:42 - INFO - __main__ - Step 131251: {'lr': 1.9541067500275038e-05, 'samples': 25200192, 'steps': 131250, 'loss/train': 1.3407784700393677} 11/07/2021 15:41:42 - INFO - __main__ - Step 131252: {'lr': 1.95390107599053e-05, 'samples': 25200384, 'steps': 131251, 'loss/train': 1.4696266651153564} 11/07/2021 15:41:43 - INFO - __main__ - Step 131253: {'lr': 1.9536954123377776e-05, 'samples': 25200576, 'steps': 131252, 'loss/train': 1.284767508506775} 11/07/2021 15:41:43 - INFO - __main__ - Step 131254: {'lr': 1.9534897590693323e-05, 'samples': 25200768, 'steps': 131253, 'loss/train': 1.0553630590438843} 11/07/2021 15:41:43 - INFO - __main__ - Step 131255: {'lr': 1.953284116185286e-05, 'samples': 25200960, 'steps': 131254, 'loss/train': 1.0376770496368408} 11/07/2021 15:41:44 - INFO - __main__ - Step 131256: {'lr': 1.9530784836857356e-05, 'samples': 25201152, 'steps': 131255, 'loss/train': 1.420300006866455} 11/07/2021 15:41:45 - INFO - __main__ - Step 131257: {'lr': 1.952872861570773e-05, 'samples': 25201344, 'steps': 131256, 'loss/train': 1.1142505407333374} 11/07/2021 15:41:45 - INFO - __main__ - Step 131258: {'lr': 1.9526672498404897e-05, 'samples': 25201536, 'steps': 131257, 'loss/train': 1.747212529182434} 11/07/2021 15:41:46 - INFO - __main__ - Step 131259: {'lr': 1.952461648494977e-05, 'samples': 25201728, 'steps': 131258, 'loss/train': 1.316498875617981} 11/07/2021 15:41:46 - INFO - __main__ - Step 131260: {'lr': 1.9522560575343324e-05, 'samples': 25201920, 'steps': 131259, 'loss/train': 2.0585408210754395} 11/07/2021 15:41:46 - INFO - __main__ - Step 131261: {'lr': 1.9520504769586446e-05, 'samples': 25202112, 'steps': 131260, 'loss/train': 1.0473248958587646} 11/07/2021 15:41:47 - INFO - __main__ - Step 131262: {'lr': 1.951844906768005e-05, 'samples': 25202304, 'steps': 131261, 'loss/train': 1.2861958742141724} 11/07/2021 15:41:48 - INFO - __main__ - Step 131263: {'lr': 1.951639346962511e-05, 'samples': 25202496, 'steps': 131262, 'loss/train': 1.166386365890503} 11/07/2021 15:41:48 - INFO - __main__ - Step 131264: {'lr': 1.9514337975422513e-05, 'samples': 25202688, 'steps': 131263, 'loss/train': 1.1694743633270264} 11/07/2021 15:41:49 - INFO - __main__ - Step 131265: {'lr': 1.9512282585073206e-05, 'samples': 25202880, 'steps': 131264, 'loss/train': 1.386309266090393} 11/07/2021 15:41:49 - INFO - __main__ - Step 131266: {'lr': 1.9510227298578154e-05, 'samples': 25203072, 'steps': 131265, 'loss/train': 0.7117088437080383} 11/07/2021 15:41:50 - INFO - __main__ - Step 131267: {'lr': 1.9508172115938194e-05, 'samples': 25203264, 'steps': 131266, 'loss/train': 1.0012205839157104} 11/07/2021 15:41:50 - INFO - __main__ - Step 131268: {'lr': 1.9506117037154297e-05, 'samples': 25203456, 'steps': 131267, 'loss/train': 0.979674220085144} 11/07/2021 15:41:51 - INFO - __main__ - Step 131269: {'lr': 1.950406206222738e-05, 'samples': 25203648, 'steps': 131268, 'loss/train': 1.1454912424087524} 11/07/2021 15:41:51 - INFO - __main__ - Step 131270: {'lr': 1.9502007191158355e-05, 'samples': 25203840, 'steps': 131269, 'loss/train': 1.0965495109558105} 11/07/2021 15:41:51 - INFO - __main__ - Step 131271: {'lr': 1.9499952423948198e-05, 'samples': 25204032, 'steps': 131270, 'loss/train': 1.4465619325637817} 11/07/2021 15:41:53 - INFO - __main__ - Step 131272: {'lr': 1.9497897760597792e-05, 'samples': 25204224, 'steps': 131271, 'loss/train': 1.2657171487808228} 11/07/2021 15:41:53 - INFO - __main__ - Step 131273: {'lr': 1.9495843201108087e-05, 'samples': 25204416, 'steps': 131272, 'loss/train': 1.2706667184829712} 11/07/2021 15:41:53 - INFO - __main__ - Step 131274: {'lr': 1.9493788745479996e-05, 'samples': 25204608, 'steps': 131273, 'loss/train': 1.722068190574646} 11/07/2021 15:41:54 - INFO - __main__ - Step 131275: {'lr': 1.9491734393714434e-05, 'samples': 25204800, 'steps': 131274, 'loss/train': 1.138673186302185} 11/07/2021 15:41:54 - INFO - __main__ - Step 131276: {'lr': 1.948968014581237e-05, 'samples': 25204992, 'steps': 131275, 'loss/train': 1.1090530157089233} 11/07/2021 15:41:54 - INFO - __main__ - Step 131277: {'lr': 1.9487626001774674e-05, 'samples': 25205184, 'steps': 131276, 'loss/train': 1.3399027585983276} 11/07/2021 15:41:55 - INFO - __main__ - Step 131278: {'lr': 1.9485571961602304e-05, 'samples': 25205376, 'steps': 131277, 'loss/train': 1.2174326181411743} 11/07/2021 15:41:56 - INFO - __main__ - Step 131279: {'lr': 1.948351802529616e-05, 'samples': 25205568, 'steps': 131278, 'loss/train': 0.8971495628356934} 11/07/2021 15:41:56 - INFO - __main__ - Step 131280: {'lr': 1.9481464192857264e-05, 'samples': 25205760, 'steps': 131279, 'loss/train': 1.6726731061935425} 11/07/2021 15:41:56 - INFO - __main__ - Step 131281: {'lr': 1.947941046428639e-05, 'samples': 25205952, 'steps': 131280, 'loss/train': 1.2218992710113525} 11/07/2021 15:41:57 - INFO - __main__ - Step 131282: {'lr': 1.9477356839584543e-05, 'samples': 25206144, 'steps': 131281, 'loss/train': 1.578813910484314} 11/07/2021 15:41:58 - INFO - __main__ - Step 131283: {'lr': 1.9475303318752662e-05, 'samples': 25206336, 'steps': 131282, 'loss/train': 1.2391053438186646} 11/07/2021 15:41:58 - INFO - __main__ - Step 131284: {'lr': 1.947324990179161e-05, 'samples': 25206528, 'steps': 131283, 'loss/train': 1.3991451263427734} 11/07/2021 15:41:59 - INFO - __main__ - Step 131285: {'lr': 1.947119658870239e-05, 'samples': 25206720, 'steps': 131284, 'loss/train': 1.4831408262252808} 11/07/2021 15:41:59 - INFO - __main__ - Step 131286: {'lr': 1.9469143379485882e-05, 'samples': 25206912, 'steps': 131285, 'loss/train': 1.4200228452682495} 11/07/2021 15:41:59 - INFO - __main__ - Step 131287: {'lr': 1.9467090274143035e-05, 'samples': 25207104, 'steps': 131286, 'loss/train': 1.6649671792984009} 11/07/2021 15:42:00 - INFO - __main__ - Step 131288: {'lr': 1.9465037272674734e-05, 'samples': 25207296, 'steps': 131287, 'loss/train': 1.4654299020767212} 11/07/2021 15:42:01 - INFO - __main__ - Step 131289: {'lr': 1.9462984375081953e-05, 'samples': 25207488, 'steps': 131288, 'loss/train': 0.7304221987724304} 11/07/2021 15:42:01 - INFO - __main__ - Step 131290: {'lr': 1.9460931581365583e-05, 'samples': 25207680, 'steps': 131289, 'loss/train': 1.4176914691925049} 11/07/2021 15:42:01 - INFO - __main__ - Step 131291: {'lr': 1.945887889152656e-05, 'samples': 25207872, 'steps': 131290, 'loss/train': 1.2463629245758057} 11/07/2021 15:42:02 - INFO - __main__ - Step 131292: {'lr': 1.9456826305565805e-05, 'samples': 25208064, 'steps': 131291, 'loss/train': 1.8561135530471802} 11/07/2021 15:42:02 - INFO - __main__ - Step 131293: {'lr': 1.945477382348429e-05, 'samples': 25208256, 'steps': 131292, 'loss/train': 0.9720609784126282} 11/07/2021 15:42:03 - INFO - __main__ - Step 131294: {'lr': 1.9452721445282844e-05, 'samples': 25208448, 'steps': 131293, 'loss/train': 0.13481879234313965} 11/07/2021 15:42:03 - INFO - __main__ - Step 131295: {'lr': 1.9450669170962472e-05, 'samples': 25208640, 'steps': 131294, 'loss/train': 1.5904964208602905} 11/07/2021 15:42:04 - INFO - __main__ - Step 131296: {'lr': 1.9448617000524054e-05, 'samples': 25208832, 'steps': 131295, 'loss/train': 1.568786382675171} 11/07/2021 15:42:04 - INFO - __main__ - Step 131297: {'lr': 1.9446564933968512e-05, 'samples': 25209024, 'steps': 131296, 'loss/train': 1.4413745403289795} 11/07/2021 15:42:04 - INFO - __main__ - Step 131298: {'lr': 1.9444512971296817e-05, 'samples': 25209216, 'steps': 131297, 'loss/train': 1.3868498802185059} 11/07/2021 15:42:06 - INFO - __main__ - Step 131299: {'lr': 1.9442461112509857e-05, 'samples': 25209408, 'steps': 131298, 'loss/train': 1.049233317375183} 11/07/2021 15:42:06 - INFO - __main__ - Step 131300: {'lr': 1.9440409357608575e-05, 'samples': 25209600, 'steps': 131299, 'loss/train': 1.1640703678131104} 11/07/2021 15:42:06 - INFO - __main__ - Step 131301: {'lr': 1.9438357706593884e-05, 'samples': 25209792, 'steps': 131300, 'loss/train': 1.0535129308700562} 11/07/2021 15:42:07 - INFO - __main__ - Step 131302: {'lr': 1.9436306159466704e-05, 'samples': 25209984, 'steps': 131301, 'loss/train': 1.3454701900482178} 11/07/2021 15:42:07 - INFO - __main__ - Step 131303: {'lr': 1.943425471622798e-05, 'samples': 25210176, 'steps': 131302, 'loss/train': 1.0747252702713013} 11/07/2021 15:42:08 - INFO - __main__ - Step 131304: {'lr': 1.9432203376878594e-05, 'samples': 25210368, 'steps': 131303, 'loss/train': 1.635898232460022} 11/07/2021 15:42:08 - INFO - __main__ - Step 131305: {'lr': 1.943015214141952e-05, 'samples': 25210560, 'steps': 131304, 'loss/train': 1.3400335311889648} 11/07/2021 15:42:09 - INFO - __main__ - Step 131306: {'lr': 1.9428101009851678e-05, 'samples': 25210752, 'steps': 131305, 'loss/train': 0.5061046481132507} 11/07/2021 15:42:09 - INFO - __main__ - Step 131307: {'lr': 1.9426049982176008e-05, 'samples': 25210944, 'steps': 131306, 'loss/train': 1.4910354614257812} 11/07/2021 15:42:09 - INFO - __main__ - Step 131308: {'lr': 1.9423999058393343e-05, 'samples': 25211136, 'steps': 131307, 'loss/train': 0.9775039553642273} 11/07/2021 15:42:11 - INFO - __main__ - Step 131309: {'lr': 1.9421948238504684e-05, 'samples': 25211328, 'steps': 131308, 'loss/train': 1.1670713424682617} 11/07/2021 15:42:11 - INFO - __main__ - Step 131310: {'lr': 1.9419897522510917e-05, 'samples': 25211520, 'steps': 131309, 'loss/train': 1.3885592222213745} 11/07/2021 15:42:11 - INFO - __main__ - Step 131311: {'lr': 1.9417846910413012e-05, 'samples': 25211712, 'steps': 131310, 'loss/train': 1.1954336166381836} 11/07/2021 15:42:12 - INFO - __main__ - Step 131312: {'lr': 1.9415796402211834e-05, 'samples': 25211904, 'steps': 131311, 'loss/train': 0.7008744478225708} 11/07/2021 15:42:12 - INFO - __main__ - Step 131313: {'lr': 1.9413745997908377e-05, 'samples': 25212096, 'steps': 131312, 'loss/train': 1.478037714958191} 11/07/2021 15:42:13 - INFO - __main__ - Step 131314: {'lr': 1.9411695697503506e-05, 'samples': 25212288, 'steps': 131313, 'loss/train': 0.8724270462989807} 11/07/2021 15:42:13 - INFO - __main__ - Step 131315: {'lr': 1.9409645500998163e-05, 'samples': 25212480, 'steps': 131314, 'loss/train': 1.2171909809112549} 11/07/2021 15:42:14 - INFO - __main__ - Step 131316: {'lr': 1.940759540839329e-05, 'samples': 25212672, 'steps': 131315, 'loss/train': 1.636842966079712} 11/07/2021 15:42:14 - INFO - __main__ - Step 131317: {'lr': 1.940554541968978e-05, 'samples': 25212864, 'steps': 131316, 'loss/train': 0.8869168162345886} 11/07/2021 15:42:14 - INFO - __main__ - Step 131318: {'lr': 1.940349553488857e-05, 'samples': 25213056, 'steps': 131317, 'loss/train': 1.484756350517273} 11/07/2021 15:42:15 - INFO - __main__ - Step 131319: {'lr': 1.940144575399061e-05, 'samples': 25213248, 'steps': 131318, 'loss/train': 1.8234056234359741} 11/07/2021 15:42:16 - INFO - __main__ - Step 131320: {'lr': 1.9399396076996838e-05, 'samples': 25213440, 'steps': 131319, 'loss/train': 1.1999571323394775} 11/07/2021 15:42:16 - INFO - __main__ - Step 131321: {'lr': 1.9397346503908093e-05, 'samples': 25213632, 'steps': 131320, 'loss/train': 1.2901408672332764} 11/07/2021 15:42:16 - INFO - __main__ - Step 131322: {'lr': 1.939529703472534e-05, 'samples': 25213824, 'steps': 131321, 'loss/train': 1.171659231185913} 11/07/2021 15:42:17 - INFO - __main__ - Step 131323: {'lr': 1.93932476694495e-05, 'samples': 25214016, 'steps': 131322, 'loss/train': 1.1663175821304321} 11/07/2021 15:42:18 - INFO - __main__ - Step 131324: {'lr': 1.9391198408081513e-05, 'samples': 25214208, 'steps': 131323, 'loss/train': 1.1826292276382446} 11/07/2021 15:42:18 - INFO - __main__ - Step 131325: {'lr': 1.9389149250622295e-05, 'samples': 25214400, 'steps': 131324, 'loss/train': 1.075515866279602} 11/07/2021 15:42:19 - INFO - __main__ - Step 131326: {'lr': 1.9387100197072765e-05, 'samples': 25214592, 'steps': 131325, 'loss/train': 1.1217128038406372} 11/07/2021 15:42:19 - INFO - __main__ - Step 131327: {'lr': 1.9385051247433866e-05, 'samples': 25214784, 'steps': 131326, 'loss/train': 1.632102131843567} 11/07/2021 15:42:19 - INFO - __main__ - Step 131328: {'lr': 1.9383002401706486e-05, 'samples': 25214976, 'steps': 131327, 'loss/train': 1.3655585050582886} 11/07/2021 15:42:20 - INFO - __main__ - Step 131329: {'lr': 1.9380953659891566e-05, 'samples': 25215168, 'steps': 131328, 'loss/train': 0.8591810464859009} 11/07/2021 15:42:21 - INFO - __main__ - Step 131330: {'lr': 1.937890502199005e-05, 'samples': 25215360, 'steps': 131329, 'loss/train': 1.004238247871399} 11/07/2021 15:42:21 - INFO - __main__ - Step 131331: {'lr': 1.9376856488002803e-05, 'samples': 25215552, 'steps': 131330, 'loss/train': 1.435416579246521} 11/07/2021 15:42:21 - INFO - __main__ - Step 131332: {'lr': 1.937480805793082e-05, 'samples': 25215744, 'steps': 131331, 'loss/train': 1.1938061714172363} 11/07/2021 15:42:22 - INFO - __main__ - Step 131333: {'lr': 1.937275973177502e-05, 'samples': 25215936, 'steps': 131332, 'loss/train': 1.1459282636642456} 11/07/2021 15:42:22 - INFO - __main__ - Step 131334: {'lr': 1.9370711509536258e-05, 'samples': 25216128, 'steps': 131333, 'loss/train': 1.0700832605361938} 11/07/2021 15:42:23 - INFO - __main__ - Step 131335: {'lr': 1.9368663391215484e-05, 'samples': 25216320, 'steps': 131334, 'loss/train': 0.042086802423000336} 11/07/2021 15:42:24 - INFO - __main__ - Step 131336: {'lr': 1.936661537681364e-05, 'samples': 25216512, 'steps': 131335, 'loss/train': 1.1789156198501587} 11/07/2021 15:42:24 - INFO - __main__ - Step 131337: {'lr': 1.936456746633164e-05, 'samples': 25216704, 'steps': 131336, 'loss/train': 0.9855438470840454} 11/07/2021 15:42:24 - INFO - __main__ - Step 131338: {'lr': 1.93625196597704e-05, 'samples': 25216896, 'steps': 131337, 'loss/train': 1.5788325071334839} 11/07/2021 15:42:25 - INFO - __main__ - Step 131339: {'lr': 1.9360471957130865e-05, 'samples': 25217088, 'steps': 131338, 'loss/train': 1.3529208898544312} 11/07/2021 15:42:26 - INFO - __main__ - Step 131340: {'lr': 1.9358424358413924e-05, 'samples': 25217280, 'steps': 131339, 'loss/train': 1.2449922561645508} 11/07/2021 15:42:26 - INFO - __main__ - Step 131341: {'lr': 1.9356376863620516e-05, 'samples': 25217472, 'steps': 131340, 'loss/train': 1.2748427391052246} 11/07/2021 15:42:26 - INFO - __main__ - Step 131342: {'lr': 1.9354329472751593e-05, 'samples': 25217664, 'steps': 131341, 'loss/train': 1.2753098011016846} 11/07/2021 15:42:27 - INFO - __main__ - Step 131343: {'lr': 1.9352282185808036e-05, 'samples': 25217856, 'steps': 131342, 'loss/train': 1.4242594242095947} 11/07/2021 15:42:27 - INFO - __main__ - Step 131344: {'lr': 1.9350235002790762e-05, 'samples': 25218048, 'steps': 131343, 'loss/train': 0.9964110851287842} 11/07/2021 15:42:28 - INFO - __main__ - Step 131345: {'lr': 1.9348187923700772e-05, 'samples': 25218240, 'steps': 131344, 'loss/train': 1.2334458827972412} 11/07/2021 15:42:28 - INFO - __main__ - Step 131346: {'lr': 1.93461409485389e-05, 'samples': 25218432, 'steps': 131345, 'loss/train': 1.179322361946106} 11/07/2021 15:42:29 - INFO - __main__ - Step 131347: {'lr': 1.9344094077306085e-05, 'samples': 25218624, 'steps': 131346, 'loss/train': 1.4240937232971191} 11/07/2021 15:42:29 - INFO - __main__ - Step 131348: {'lr': 1.9342047310003248e-05, 'samples': 25218816, 'steps': 131347, 'loss/train': 0.960141122341156} 11/07/2021 15:42:29 - INFO - __main__ - Step 131349: {'lr': 1.934000064663133e-05, 'samples': 25219008, 'steps': 131348, 'loss/train': 1.7089710235595703} 11/07/2021 15:42:30 - INFO - __main__ - Step 131350: {'lr': 1.9337954087191272e-05, 'samples': 25219200, 'steps': 131349, 'loss/train': 1.5984445810317993} 11/07/2021 15:42:31 - INFO - __main__ - Step 131351: {'lr': 1.9335907631683943e-05, 'samples': 25219392, 'steps': 131350, 'loss/train': 0.9653474688529968} 11/07/2021 15:42:31 - INFO - __main__ - Step 131352: {'lr': 1.9333861280110304e-05, 'samples': 25219584, 'steps': 131351, 'loss/train': 0.8050532937049866} 11/07/2021 15:42:32 - INFO - __main__ - Step 131353: {'lr': 1.9331815032471277e-05, 'samples': 25219776, 'steps': 131352, 'loss/train': 1.3799699544906616} 11/07/2021 15:42:32 - INFO - __main__ - Step 131354: {'lr': 1.932976888876778e-05, 'samples': 25219968, 'steps': 131353, 'loss/train': 0.9992281794548035} 11/07/2021 15:42:33 - INFO - __main__ - Step 131355: {'lr': 1.932772284900072e-05, 'samples': 25220160, 'steps': 131354, 'loss/train': 0.9543507099151611} 11/07/2021 15:42:33 - INFO - __main__ - Step 131356: {'lr': 1.9325676913171053e-05, 'samples': 25220352, 'steps': 131355, 'loss/train': 0.9496420621871948} 11/07/2021 15:42:34 - INFO - __main__ - Step 131357: {'lr': 1.932363108127966e-05, 'samples': 25220544, 'steps': 131356, 'loss/train': 0.6349324584007263} 11/07/2021 15:42:34 - INFO - __main__ - Step 131358: {'lr': 1.9321585353327482e-05, 'samples': 25220736, 'steps': 131357, 'loss/train': 0.9767065048217773} 11/07/2021 15:42:34 - INFO - __main__ - Step 131359: {'lr': 1.9319539729315412e-05, 'samples': 25220928, 'steps': 131358, 'loss/train': 1.3412091732025146} 11/07/2021 15:42:35 - INFO - __main__ - Step 131360: {'lr': 1.931749420924442e-05, 'samples': 25221120, 'steps': 131359, 'loss/train': 1.2324408292770386} 11/07/2021 15:42:36 - INFO - __main__ - Step 131361: {'lr': 1.9315448793115393e-05, 'samples': 25221312, 'steps': 131360, 'loss/train': 1.1096305847167969} 11/07/2021 15:42:36 - INFO - __main__ - Step 131362: {'lr': 1.9313403480929277e-05, 'samples': 25221504, 'steps': 131361, 'loss/train': 1.1536953449249268} 11/07/2021 15:42:36 - INFO - __main__ - Step 131363: {'lr': 1.9311358272686985e-05, 'samples': 25221696, 'steps': 131362, 'loss/train': 1.1119177341461182} 11/07/2021 15:42:37 - INFO - __main__ - Step 131364: {'lr': 1.930931316838941e-05, 'samples': 25221888, 'steps': 131363, 'loss/train': 1.2476204633712769} 11/07/2021 15:42:38 - INFO - __main__ - Step 131365: {'lr': 1.9307268168037516e-05, 'samples': 25222080, 'steps': 131364, 'loss/train': 1.2819856405258179} 11/07/2021 15:42:38 - INFO - __main__ - Step 131366: {'lr': 1.93052232716322e-05, 'samples': 25222272, 'steps': 131365, 'loss/train': 0.6193860769271851} 11/07/2021 15:42:38 - INFO - __main__ - Step 131367: {'lr': 1.9303178479174455e-05, 'samples': 25222464, 'steps': 131366, 'loss/train': 1.3952877521514893} 11/07/2021 15:42:39 - INFO - __main__ - Step 131368: {'lr': 1.930113379066506e-05, 'samples': 25222656, 'steps': 131367, 'loss/train': 1.1604491472244263} 11/07/2021 15:42:39 - INFO - __main__ - Step 131369: {'lr': 1.929908920610504e-05, 'samples': 25222848, 'steps': 131368, 'loss/train': 1.1238688230514526} 11/07/2021 15:42:40 - INFO - __main__ - Step 131370: {'lr': 1.929704472549529e-05, 'samples': 25223040, 'steps': 131369, 'loss/train': 1.5546399354934692} 11/07/2021 15:42:41 - INFO - __main__ - Step 131371: {'lr': 1.9295000348836717e-05, 'samples': 25223232, 'steps': 131370, 'loss/train': 1.1463547945022583} 11/07/2021 15:42:41 - INFO - __main__ - Step 131372: {'lr': 1.929295607613027e-05, 'samples': 25223424, 'steps': 131371, 'loss/train': 1.1025831699371338} 11/07/2021 15:42:41 - INFO - __main__ - Step 131373: {'lr': 1.9290911907376864e-05, 'samples': 25223616, 'steps': 131372, 'loss/train': 1.413500428199768} 11/07/2021 15:42:42 - INFO - __main__ - Step 131374: {'lr': 1.9288867842577385e-05, 'samples': 25223808, 'steps': 131373, 'loss/train': 0.9303958415985107} 11/07/2021 15:42:43 - INFO - __main__ - Step 131375: {'lr': 1.9286823881732807e-05, 'samples': 25224000, 'steps': 131374, 'loss/train': 1.4101518392562866} 11/07/2021 15:42:43 - INFO - __main__ - Step 131376: {'lr': 1.928478002484402e-05, 'samples': 25224192, 'steps': 131375, 'loss/train': 1.0627869367599487} 11/07/2021 15:42:43 - INFO - __main__ - Step 131377: {'lr': 1.928273627191193e-05, 'samples': 25224384, 'steps': 131376, 'loss/train': 1.0897666215896606} 11/07/2021 15:42:44 - INFO - __main__ - Step 131378: {'lr': 1.928069262293755e-05, 'samples': 25224576, 'steps': 131377, 'loss/train': 0.715830385684967} 11/07/2021 15:42:44 - INFO - __main__ - Step 131379: {'lr': 1.9278649077921677e-05, 'samples': 25224768, 'steps': 131378, 'loss/train': 1.655496597290039} 11/07/2021 15:42:45 - INFO - __main__ - Step 131380: {'lr': 1.9276605636865284e-05, 'samples': 25224960, 'steps': 131379, 'loss/train': 1.1837178468704224} 11/07/2021 15:42:46 - INFO - __main__ - Step 131381: {'lr': 1.927456229976929e-05, 'samples': 25225152, 'steps': 131380, 'loss/train': 0.892525851726532} 11/07/2021 15:42:46 - INFO - __main__ - Step 131382: {'lr': 1.927251906663463e-05, 'samples': 25225344, 'steps': 131381, 'loss/train': 1.5803277492523193} 11/07/2021 15:42:46 - INFO - __main__ - Step 131383: {'lr': 1.92704759374622e-05, 'samples': 25225536, 'steps': 131382, 'loss/train': 1.3450541496276855} 11/07/2021 15:42:47 - INFO - __main__ - Step 131384: {'lr': 1.9268432912252913e-05, 'samples': 25225728, 'steps': 131383, 'loss/train': 2.4236645698547363} 11/07/2021 15:42:48 - INFO - __main__ - Step 131385: {'lr': 1.9266389991007744e-05, 'samples': 25225920, 'steps': 131384, 'loss/train': 1.4959720373153687} 11/07/2021 15:42:48 - INFO - __main__ - Step 131386: {'lr': 1.9264347173727575e-05, 'samples': 25226112, 'steps': 131385, 'loss/train': 0.8566153645515442} 11/07/2021 15:42:48 - INFO - __main__ - Step 131387: {'lr': 1.9262304460413328e-05, 'samples': 25226304, 'steps': 131386, 'loss/train': 1.1107993125915527} 11/07/2021 15:42:49 - INFO - __main__ - Step 131388: {'lr': 1.9260261851065914e-05, 'samples': 25226496, 'steps': 131387, 'loss/train': 0.7911747097969055} 11/07/2021 15:42:49 - INFO - __main__ - Step 131389: {'lr': 1.9258219345686306e-05, 'samples': 25226688, 'steps': 131388, 'loss/train': 1.1700674295425415} 11/07/2021 15:42:49 - INFO - __main__ - Step 131390: {'lr': 1.9256176944275367e-05, 'samples': 25226880, 'steps': 131389, 'loss/train': 1.4023128747940063} 11/07/2021 15:42:51 - INFO - __main__ - Step 131391: {'lr': 1.925413464683401e-05, 'samples': 25227072, 'steps': 131390, 'loss/train': 1.1489194631576538} 11/07/2021 15:42:51 - INFO - __main__ - Step 131392: {'lr': 1.925209245336318e-05, 'samples': 25227264, 'steps': 131391, 'loss/train': 1.3619590997695923} 11/07/2021 15:42:51 - INFO - __main__ - Step 131393: {'lr': 1.925005036386382e-05, 'samples': 25227456, 'steps': 131392, 'loss/train': 1.381056308746338} 11/07/2021 15:42:52 - INFO - __main__ - Step 131394: {'lr': 1.924800837833679e-05, 'samples': 25227648, 'steps': 131393, 'loss/train': 1.3366373777389526} 11/07/2021 15:42:52 - INFO - __main__ - Step 131395: {'lr': 1.924596649678309e-05, 'samples': 25227840, 'steps': 131394, 'loss/train': 0.9943685531616211} 11/07/2021 15:42:53 - INFO - __main__ - Step 131396: {'lr': 1.9243924719203552e-05, 'samples': 25228032, 'steps': 131395, 'loss/train': 0.04610098898410797} 11/07/2021 15:42:54 - INFO - __main__ - Step 131397: {'lr': 1.9241883045599178e-05, 'samples': 25228224, 'steps': 131396, 'loss/train': 1.1570712327957153} 11/07/2021 15:42:54 - INFO - __main__ - Step 131398: {'lr': 1.9239841475970826e-05, 'samples': 25228416, 'steps': 131397, 'loss/train': 0.41000133752822876} 11/07/2021 15:42:54 - INFO - __main__ - Step 131399: {'lr': 1.9237800010319467e-05, 'samples': 25228608, 'steps': 131398, 'loss/train': 1.3156938552856445} 11/07/2021 15:42:55 - INFO - __main__ - Step 131400: {'lr': 1.9235758648645964e-05, 'samples': 25228800, 'steps': 131399, 'loss/train': 0.6450033783912659} 11/07/2021 15:42:56 - INFO - __main__ - Step 131401: {'lr': 1.923371739095131e-05, 'samples': 25228992, 'steps': 131400, 'loss/train': 0.9912352561950684} 11/07/2021 15:42:56 - INFO - __main__ - Step 131402: {'lr': 1.9231676237236373e-05, 'samples': 25229184, 'steps': 131401, 'loss/train': 1.4100767374038696} 11/07/2021 15:42:56 - INFO - __main__ - Step 131403: {'lr': 1.9229635187502065e-05, 'samples': 25229376, 'steps': 131402, 'loss/train': 1.5105700492858887} 11/07/2021 15:42:57 - INFO - __main__ - Step 131404: {'lr': 1.92275942417493e-05, 'samples': 25229568, 'steps': 131403, 'loss/train': 0.6827055811882019} 11/07/2021 15:42:57 - INFO - __main__ - Step 131405: {'lr': 1.9225553399979057e-05, 'samples': 25229760, 'steps': 131404, 'loss/train': 1.7570455074310303} 11/07/2021 15:42:59 - INFO - __main__ - Step 131406: {'lr': 1.9223512662192187e-05, 'samples': 25229952, 'steps': 131405, 'loss/train': 1.3206608295440674} 11/07/2021 15:42:59 - INFO - __main__ - Step 131407: {'lr': 1.922147202838967e-05, 'samples': 25230144, 'steps': 131406, 'loss/train': 1.4478670358657837} 11/07/2021 15:42:59 - INFO - __main__ - Step 131408: {'lr': 1.921943149857236e-05, 'samples': 25230336, 'steps': 131407, 'loss/train': 0.9335084557533264} 11/07/2021 15:43:00 - INFO - __main__ - Step 131409: {'lr': 1.921739107274123e-05, 'samples': 25230528, 'steps': 131408, 'loss/train': 1.057376742362976} 11/07/2021 15:43:00 - INFO - __main__ - Step 131410: {'lr': 1.9215350750897197e-05, 'samples': 25230720, 'steps': 131409, 'loss/train': 1.6558901071548462} 11/07/2021 15:43:00 - INFO - __main__ - Step 131411: {'lr': 1.921331053304115e-05, 'samples': 25230912, 'steps': 131410, 'loss/train': 1.830091953277588} 11/07/2021 15:43:01 - INFO - __main__ - Step 131412: {'lr': 1.9211270419174032e-05, 'samples': 25231104, 'steps': 131411, 'loss/train': 0.9072646498680115} 11/07/2021 15:43:02 - INFO - __main__ - Step 131413: {'lr': 1.9209230409296757e-05, 'samples': 25231296, 'steps': 131412, 'loss/train': 1.2858967781066895} 11/07/2021 15:43:02 - INFO - __main__ - Step 131414: {'lr': 1.9207190503410272e-05, 'samples': 25231488, 'steps': 131413, 'loss/train': 1.1390897035598755} 11/07/2021 15:43:02 - INFO - __main__ - Step 131415: {'lr': 1.9205150701515435e-05, 'samples': 25231680, 'steps': 131414, 'loss/train': 1.2225172519683838} 11/07/2021 15:43:03 - INFO - __main__ - Step 131416: {'lr': 1.9203111003613188e-05, 'samples': 25231872, 'steps': 131415, 'loss/train': 0.9864172339439392} 11/07/2021 15:43:04 - INFO - __main__ - Step 131417: {'lr': 1.920107140970445e-05, 'samples': 25232064, 'steps': 131416, 'loss/train': 1.4414377212524414} 11/07/2021 15:43:04 - INFO - __main__ - Step 131418: {'lr': 1.9199031919790165e-05, 'samples': 25232256, 'steps': 131417, 'loss/train': 1.9999734163284302} 11/07/2021 15:43:05 - INFO - __main__ - Step 131419: {'lr': 1.919699253387122e-05, 'samples': 25232448, 'steps': 131418, 'loss/train': 1.4931206703186035} 11/07/2021 15:43:05 - INFO - __main__ - Step 131420: {'lr': 1.919495325194856e-05, 'samples': 25232640, 'steps': 131419, 'loss/train': 1.2342382669448853} 11/07/2021 15:43:05 - INFO - __main__ - Step 131421: {'lr': 1.91929140740231e-05, 'samples': 25232832, 'steps': 131420, 'loss/train': 1.2430592775344849} 11/07/2021 15:43:06 - INFO - __main__ - Step 131422: {'lr': 1.9190875000095726e-05, 'samples': 25233024, 'steps': 131421, 'loss/train': 1.4295393228530884} 11/07/2021 15:43:07 - INFO - __main__ - Step 131423: {'lr': 1.918883603016741e-05, 'samples': 25233216, 'steps': 131422, 'loss/train': 1.3293331861495972} 11/07/2021 15:43:07 - INFO - __main__ - Step 131424: {'lr': 1.9186797164239017e-05, 'samples': 25233408, 'steps': 131423, 'loss/train': 1.5001928806304932} 11/07/2021 15:43:07 - INFO - __main__ - Step 131425: {'lr': 1.9184758402311514e-05, 'samples': 25233600, 'steps': 131424, 'loss/train': 1.3919638395309448} 11/07/2021 15:43:08 - INFO - __main__ - Step 131426: {'lr': 1.9182719744385792e-05, 'samples': 25233792, 'steps': 131425, 'loss/train': 0.9304434657096863} 11/07/2021 15:43:08 - INFO - __main__ - Step 131427: {'lr': 1.9180681190462763e-05, 'samples': 25233984, 'steps': 131426, 'loss/train': 0.9860129356384277} 11/07/2021 15:43:09 - INFO - __main__ - Step 131428: {'lr': 1.91786427405434e-05, 'samples': 25234176, 'steps': 131427, 'loss/train': 1.5312163829803467} 11/07/2021 15:43:09 - INFO - __main__ - Step 131429: {'lr': 1.917660439462854e-05, 'samples': 25234368, 'steps': 131428, 'loss/train': 1.4481256008148193} 11/07/2021 15:43:10 - INFO - __main__ - Step 131430: {'lr': 1.9174566152719147e-05, 'samples': 25234560, 'steps': 131429, 'loss/train': 1.218659520149231} 11/07/2021 15:43:10 - INFO - __main__ - Step 131431: {'lr': 1.9172528014816114e-05, 'samples': 25234752, 'steps': 131430, 'loss/train': 1.2412928342819214} 11/07/2021 15:43:11 - INFO - __main__ - Step 131432: {'lr': 1.9170489980920415e-05, 'samples': 25234944, 'steps': 131431, 'loss/train': 0.7679175138473511} 11/07/2021 15:43:12 - INFO - __main__ - Step 131433: {'lr': 1.91684520510329e-05, 'samples': 25235136, 'steps': 131432, 'loss/train': 2.087444305419922} 11/07/2021 15:43:12 - INFO - __main__ - Step 131434: {'lr': 1.916641422515453e-05, 'samples': 25235328, 'steps': 131433, 'loss/train': 1.2357491254806519} 11/07/2021 15:43:12 - INFO - __main__ - Step 131435: {'lr': 1.91643765032862e-05, 'samples': 25235520, 'steps': 131434, 'loss/train': 1.7428202629089355} 11/07/2021 15:43:13 - INFO - __main__ - Step 131436: {'lr': 1.9162338885428844e-05, 'samples': 25235712, 'steps': 131435, 'loss/train': 1.0469361543655396} 11/07/2021 15:43:13 - INFO - __main__ - Step 131437: {'lr': 1.9160301371583392e-05, 'samples': 25235904, 'steps': 131436, 'loss/train': 1.4759202003479004} 11/07/2021 15:43:14 - INFO - __main__ - Step 131438: {'lr': 1.9158263961750744e-05, 'samples': 25236096, 'steps': 131437, 'loss/train': 1.1232593059539795} 11/07/2021 15:43:14 - INFO - __main__ - Step 131439: {'lr': 1.9156226655931807e-05, 'samples': 25236288, 'steps': 131438, 'loss/train': 1.0056103467941284} 11/07/2021 15:43:15 - INFO - __main__ - Step 131440: {'lr': 1.9154189454127503e-05, 'samples': 25236480, 'steps': 131439, 'loss/train': 1.346887469291687} 11/07/2021 15:43:15 - INFO - __main__ - Step 131441: {'lr': 1.9152152356338824e-05, 'samples': 25236672, 'steps': 131440, 'loss/train': 1.1458708047866821} 11/07/2021 15:43:15 - INFO - __main__ - Step 131442: {'lr': 1.9150115362566557e-05, 'samples': 25236864, 'steps': 131441, 'loss/train': 1.7100518941879272} 11/07/2021 15:43:16 - INFO - __main__ - Step 131443: {'lr': 1.914807847281172e-05, 'samples': 25237056, 'steps': 131442, 'loss/train': 1.4499043226242065} 11/07/2021 15:43:17 - INFO - __main__ - Step 131444: {'lr': 1.9146041687075178e-05, 'samples': 25237248, 'steps': 131443, 'loss/train': 1.0729366540908813} 11/07/2021 15:43:17 - INFO - __main__ - Step 131445: {'lr': 1.9144005005357845e-05, 'samples': 25237440, 'steps': 131444, 'loss/train': 0.48788201808929443} 11/07/2021 15:43:17 - INFO - __main__ - Step 131446: {'lr': 1.9141968427660694e-05, 'samples': 25237632, 'steps': 131445, 'loss/train': 1.118618369102478} 11/07/2021 15:43:18 - INFO - __main__ - Step 131447: {'lr': 1.9139931953984587e-05, 'samples': 25237824, 'steps': 131446, 'loss/train': 1.4170478582382202} 11/07/2021 15:43:19 - INFO - __main__ - Step 131448: {'lr': 1.9137895584330488e-05, 'samples': 25238016, 'steps': 131447, 'loss/train': 0.9148749709129333} 11/07/2021 15:43:19 - INFO - __main__ - Step 131449: {'lr': 1.9135859318699266e-05, 'samples': 25238208, 'steps': 131448, 'loss/train': 1.7261778116226196} 11/07/2021 15:43:19 - INFO - __main__ - Step 131450: {'lr': 1.913382315709189e-05, 'samples': 25238400, 'steps': 131449, 'loss/train': 1.5709718465805054} 11/07/2021 15:43:20 - INFO - __main__ - Step 131451: {'lr': 1.9131787099509217e-05, 'samples': 25238592, 'steps': 131450, 'loss/train': 1.2191002368927002} 11/07/2021 15:43:20 - INFO - __main__ - Step 131452: {'lr': 1.9129751145952224e-05, 'samples': 25238784, 'steps': 131451, 'loss/train': 1.1491342782974243} 11/07/2021 15:43:20 - INFO - __main__ - Step 131453: {'lr': 1.9127715296421793e-05, 'samples': 25238976, 'steps': 131452, 'loss/train': 1.4751681089401245} 11/07/2021 15:43:22 - INFO - __main__ - Step 131454: {'lr': 1.91256795509189e-05, 'samples': 25239168, 'steps': 131453, 'loss/train': 1.05500328540802} 11/07/2021 15:43:22 - INFO - __main__ - Step 131455: {'lr': 1.9123643909444376e-05, 'samples': 25239360, 'steps': 131454, 'loss/train': 1.4646339416503906} 11/07/2021 15:43:22 - INFO - __main__ - Step 131456: {'lr': 1.9121608371999166e-05, 'samples': 25239552, 'steps': 131455, 'loss/train': 1.7497096061706543} 11/07/2021 15:43:23 - INFO - __main__ - Step 131457: {'lr': 1.9119572938584184e-05, 'samples': 25239744, 'steps': 131456, 'loss/train': 1.1254191398620605} 11/07/2021 15:43:23 - INFO - __main__ - Step 131458: {'lr': 1.9117537609200376e-05, 'samples': 25239936, 'steps': 131457, 'loss/train': 1.3314396142959595} 11/07/2021 15:43:24 - INFO - __main__ - Step 131459: {'lr': 1.9115502383848653e-05, 'samples': 25240128, 'steps': 131458, 'loss/train': 1.419579029083252} 11/07/2021 15:43:24 - INFO - __main__ - Step 131460: {'lr': 1.911346726252991e-05, 'samples': 25240320, 'steps': 131459, 'loss/train': 1.2636586427688599} 11/07/2021 15:43:25 - INFO - __main__ - Step 131461: {'lr': 1.911143224524506e-05, 'samples': 25240512, 'steps': 131460, 'loss/train': 0.9146458506584167} 11/07/2021 15:43:25 - INFO - __main__ - Step 131462: {'lr': 1.9109397331995044e-05, 'samples': 25240704, 'steps': 131461, 'loss/train': 1.0812023878097534} 11/07/2021 15:43:25 - INFO - __main__ - Step 131463: {'lr': 1.910736252278078e-05, 'samples': 25240896, 'steps': 131462, 'loss/train': 1.1793638467788696} 11/07/2021 15:43:26 - INFO - __main__ - Step 131464: {'lr': 1.9105327817603186e-05, 'samples': 25241088, 'steps': 131463, 'loss/train': 1.311732530593872} 11/07/2021 15:43:27 - INFO - __main__ - Step 131465: {'lr': 1.9103293216463147e-05, 'samples': 25241280, 'steps': 131464, 'loss/train': 0.9103041291236877} 11/07/2021 15:43:27 - INFO - __main__ - Step 131466: {'lr': 1.9101258719361607e-05, 'samples': 25241472, 'steps': 131465, 'loss/train': 1.0835318565368652} 11/07/2021 15:43:28 - INFO - __main__ - Step 131467: {'lr': 1.9099224326299484e-05, 'samples': 25241664, 'steps': 131466, 'loss/train': 1.2969876527786255} 11/07/2021 15:43:28 - INFO - __main__ - Step 131468: {'lr': 1.9097190037277724e-05, 'samples': 25241856, 'steps': 131467, 'loss/train': 0.4565490186214447} 11/07/2021 15:43:29 - INFO - __main__ - Step 131469: {'lr': 1.9095155852297152e-05, 'samples': 25242048, 'steps': 131468, 'loss/train': 0.8175955414772034} 11/07/2021 15:43:29 - INFO - __main__ - Step 131470: {'lr': 1.9093121771358772e-05, 'samples': 25242240, 'steps': 131469, 'loss/train': 3.191619873046875} 11/07/2021 15:43:30 - INFO - __main__ - Step 131471: {'lr': 1.9091087794463445e-05, 'samples': 25242432, 'steps': 131470, 'loss/train': 1.3623853921890259} 11/07/2021 15:43:30 - INFO - __main__ - Step 131472: {'lr': 1.9089053921612116e-05, 'samples': 25242624, 'steps': 131471, 'loss/train': 1.3172471523284912} 11/07/2021 15:43:30 - INFO - __main__ - Step 131473: {'lr': 1.9087020152805696e-05, 'samples': 25242816, 'steps': 131472, 'loss/train': 1.2474617958068848} 11/07/2021 15:43:31 - INFO - __main__ - Step 131474: {'lr': 1.9084986488045103e-05, 'samples': 25243008, 'steps': 131473, 'loss/train': 1.1579875946044922} 11/07/2021 15:43:32 - INFO - __main__ - Step 131475: {'lr': 1.9082952927331226e-05, 'samples': 25243200, 'steps': 131474, 'loss/train': 1.7468712329864502} 11/07/2021 15:43:32 - INFO - __main__ - Step 131476: {'lr': 1.9080919470665035e-05, 'samples': 25243392, 'steps': 131475, 'loss/train': 1.3357973098754883} 11/07/2021 15:43:33 - INFO - __main__ - Step 131477: {'lr': 1.9078886118047424e-05, 'samples': 25243584, 'steps': 131476, 'loss/train': 1.4024995565414429} 11/07/2021 15:43:33 - INFO - __main__ - Step 131478: {'lr': 1.90768528694793e-05, 'samples': 25243776, 'steps': 131477, 'loss/train': 2.051501750946045} 11/07/2021 15:43:33 - INFO - __main__ - Step 131479: {'lr': 1.9074819724961555e-05, 'samples': 25243968, 'steps': 131478, 'loss/train': 1.3320688009262085} 11/07/2021 15:43:34 - INFO - __main__ - Step 131480: {'lr': 1.9072786684495165e-05, 'samples': 25244160, 'steps': 131479, 'loss/train': 0.832563042640686} 11/07/2021 15:43:35 - INFO - __main__ - Step 131481: {'lr': 1.907075374808104e-05, 'samples': 25244352, 'steps': 131480, 'loss/train': 1.061426043510437} 11/07/2021 15:43:35 - INFO - __main__ - Step 131482: {'lr': 1.906872091572004e-05, 'samples': 25244544, 'steps': 131481, 'loss/train': 1.1809455156326294} 11/07/2021 15:43:35 - INFO - __main__ - Step 131483: {'lr': 1.9066688187413113e-05, 'samples': 25244736, 'steps': 131482, 'loss/train': 1.324014663696289} 11/07/2021 15:43:36 - INFO - __main__ - Step 131484: {'lr': 1.9064655563161142e-05, 'samples': 25244928, 'steps': 131483, 'loss/train': 1.7359814643859863} 11/07/2021 15:43:37 - INFO - __main__ - Step 131485: {'lr': 1.9062623042965105e-05, 'samples': 25245120, 'steps': 131484, 'loss/train': 1.4157912731170654} 11/07/2021 15:43:37 - INFO - __main__ - Step 131486: {'lr': 1.9060590626825887e-05, 'samples': 25245312, 'steps': 131485, 'loss/train': 1.0958948135375977} 11/07/2021 15:43:37 - INFO - __main__ - Step 131487: {'lr': 1.9058558314744374e-05, 'samples': 25245504, 'steps': 131486, 'loss/train': 1.0114151239395142} 11/07/2021 15:43:38 - INFO - __main__ - Step 131488: {'lr': 1.9056526106721537e-05, 'samples': 25245696, 'steps': 131487, 'loss/train': 1.3939845561981201} 11/07/2021 15:43:38 - INFO - __main__ - Step 131489: {'lr': 1.9054494002758243e-05, 'samples': 25245888, 'steps': 131488, 'loss/train': 0.7927002906799316} 11/07/2021 15:43:40 - INFO - __main__ - Step 131490: {'lr': 1.9052462002855457e-05, 'samples': 25246080, 'steps': 131489, 'loss/train': 1.2220408916473389} 11/07/2021 15:43:40 - INFO - __main__ - Step 131491: {'lr': 1.9050430107014073e-05, 'samples': 25246272, 'steps': 131490, 'loss/train': 1.5867078304290771} 11/07/2021 15:43:40 - INFO - __main__ - Step 131492: {'lr': 1.9048398315234973e-05, 'samples': 25246464, 'steps': 131491, 'loss/train': 0.3572998344898224} 11/07/2021 15:43:41 - INFO - __main__ - Step 131493: {'lr': 1.9046366627519102e-05, 'samples': 25246656, 'steps': 131492, 'loss/train': 1.2310144901275635} 11/07/2021 15:43:41 - INFO - __main__ - Step 131494: {'lr': 1.9044335043867405e-05, 'samples': 25246848, 'steps': 131493, 'loss/train': 0.3631381392478943} 11/07/2021 15:43:41 - INFO - __main__ - Step 131495: {'lr': 1.9042303564280773e-05, 'samples': 25247040, 'steps': 131494, 'loss/train': 0.788165271282196} 11/07/2021 15:43:42 - INFO - __main__ - Step 131496: {'lr': 1.9040272188760088e-05, 'samples': 25247232, 'steps': 131495, 'loss/train': 1.7852379083633423} 11/07/2021 15:43:43 - INFO - __main__ - Step 131497: {'lr': 1.90382409173063e-05, 'samples': 25247424, 'steps': 131496, 'loss/train': 1.3354398012161255} 11/07/2021 15:43:43 - INFO - __main__ - Step 131498: {'lr': 1.903620974992032e-05, 'samples': 25247616, 'steps': 131497, 'loss/train': 1.2692339420318604} 11/07/2021 15:43:43 - INFO - __main__ - Step 131499: {'lr': 1.9034178686603038e-05, 'samples': 25247808, 'steps': 131498, 'loss/train': 1.1577204465866089} 11/07/2021 15:43:44 - INFO - __main__ - Step 131500: {'lr': 1.9032147727355397e-05, 'samples': 25248000, 'steps': 131499, 'loss/train': 1.08790922164917} 11/07/2021 15:43:45 - INFO - __main__ - Step 131501: {'lr': 1.9030116872178316e-05, 'samples': 25248192, 'steps': 131500, 'loss/train': 0.40532004833221436} 11/07/2021 15:43:45 - INFO - __main__ - Step 131502: {'lr': 1.9028086121072708e-05, 'samples': 25248384, 'steps': 131501, 'loss/train': 1.6776866912841797} 11/07/2021 15:43:46 - INFO - __main__ - Step 131503: {'lr': 1.9026055474039462e-05, 'samples': 25248576, 'steps': 131502, 'loss/train': 1.559747576713562} 11/07/2021 15:43:46 - INFO - __main__ - Step 131504: {'lr': 1.902402493107952e-05, 'samples': 25248768, 'steps': 131503, 'loss/train': 1.1935639381408691} 11/07/2021 15:43:46 - INFO - __main__ - Step 131505: {'lr': 1.902199449219377e-05, 'samples': 25248960, 'steps': 131504, 'loss/train': 1.6393297910690308} 11/07/2021 15:43:47 - INFO - __main__ - Step 131506: {'lr': 1.901996415738319e-05, 'samples': 25249152, 'steps': 131505, 'loss/train': 1.3244789838790894} 11/07/2021 15:43:48 - INFO - __main__ - Step 131507: {'lr': 1.9017933926648606e-05, 'samples': 25249344, 'steps': 131506, 'loss/train': 1.6119011640548706} 11/07/2021 15:43:48 - INFO - __main__ - Step 131508: {'lr': 1.9015903799991048e-05, 'samples': 25249536, 'steps': 131507, 'loss/train': 1.4729059934616089} 11/07/2021 15:43:48 - INFO - __main__ - Step 131509: {'lr': 1.9013873777411288e-05, 'samples': 25249728, 'steps': 131508, 'loss/train': 1.159115195274353} 11/07/2021 15:43:49 - INFO - __main__ - Step 131510: {'lr': 1.9011843858910332e-05, 'samples': 25249920, 'steps': 131509, 'loss/train': 1.6511880159378052} 11/07/2021 15:43:50 - INFO - __main__ - Step 131511: {'lr': 1.9009814044489064e-05, 'samples': 25250112, 'steps': 131510, 'loss/train': 1.4813921451568604} 11/07/2021 15:43:50 - INFO - __main__ - Step 131512: {'lr': 1.900778433414843e-05, 'samples': 25250304, 'steps': 131511, 'loss/train': 1.1814993619918823} 11/07/2021 15:43:50 - INFO - __main__ - Step 131513: {'lr': 1.900575472788929e-05, 'samples': 25250496, 'steps': 131512, 'loss/train': 1.3476656675338745} 11/07/2021 15:43:51 - INFO - __main__ - Step 131514: {'lr': 1.9003725225712614e-05, 'samples': 25250688, 'steps': 131513, 'loss/train': 1.3927117586135864} 11/07/2021 15:43:51 - INFO - __main__ - Step 131515: {'lr': 1.9001695827619292e-05, 'samples': 25250880, 'steps': 131514, 'loss/train': 1.083522915840149} 11/07/2021 15:43:52 - INFO - __main__ - Step 131516: {'lr': 1.8999666533610266e-05, 'samples': 25251072, 'steps': 131515, 'loss/train': 1.2999516725540161} 11/07/2021 15:43:53 - INFO - __main__ - Step 131517: {'lr': 1.8997637343686397e-05, 'samples': 25251264, 'steps': 131516, 'loss/train': 1.340255856513977} 11/07/2021 15:43:53 - INFO - __main__ - Step 131518: {'lr': 1.899560825784863e-05, 'samples': 25251456, 'steps': 131517, 'loss/train': 1.1757466793060303} 11/07/2021 15:43:53 - INFO - __main__ - Step 131519: {'lr': 1.8993579276097877e-05, 'samples': 25251648, 'steps': 131518, 'loss/train': 1.336388111114502} 11/07/2021 15:43:54 - INFO - __main__ - Step 131520: {'lr': 1.899155039843506e-05, 'samples': 25251840, 'steps': 131519, 'loss/train': 0.9771153926849365} 11/07/2021 15:43:55 - INFO - __main__ - Step 131521: {'lr': 1.898952162486109e-05, 'samples': 25252032, 'steps': 131520, 'loss/train': 1.068234920501709} 11/07/2021 15:43:55 - INFO - __main__ - Step 131522: {'lr': 1.898749295537691e-05, 'samples': 25252224, 'steps': 131521, 'loss/train': 1.030605435371399} 11/07/2021 15:43:55 - INFO - __main__ - Step 131523: {'lr': 1.898546438998336e-05, 'samples': 25252416, 'steps': 131522, 'loss/train': 0.5952191948890686} 11/07/2021 15:43:56 - INFO - __main__ - Step 131524: {'lr': 1.8983435928681375e-05, 'samples': 25252608, 'steps': 131523, 'loss/train': 1.5422556400299072} 11/07/2021 15:43:56 - INFO - __main__ - Step 131525: {'lr': 1.8981407571471903e-05, 'samples': 25252800, 'steps': 131524, 'loss/train': 1.05867600440979} 11/07/2021 15:43:57 - INFO - __main__ - Step 131526: {'lr': 1.8979379318355862e-05, 'samples': 25252992, 'steps': 131525, 'loss/train': 1.3030413389205933} 11/07/2021 15:43:58 - INFO - __main__ - Step 131527: {'lr': 1.8977351169334132e-05, 'samples': 25253184, 'steps': 131526, 'loss/train': 0.8978068232536316} 11/07/2021 15:43:58 - INFO - __main__ - Step 131528: {'lr': 1.897532312440764e-05, 'samples': 25253376, 'steps': 131527, 'loss/train': 0.8422261476516724} 11/07/2021 15:43:58 - INFO - __main__ - Step 131529: {'lr': 1.897329518357732e-05, 'samples': 25253568, 'steps': 131528, 'loss/train': 1.1526927947998047} 11/07/2021 15:43:59 - INFO - __main__ - Step 131530: {'lr': 1.8971267346844067e-05, 'samples': 25253760, 'steps': 131529, 'loss/train': 1.2312551736831665} 11/07/2021 15:43:59 - INFO - __main__ - Step 131531: {'lr': 1.8969239614208765e-05, 'samples': 25253952, 'steps': 131530, 'loss/train': 1.4876785278320312} 11/07/2021 15:44:00 - INFO - __main__ - Step 131532: {'lr': 1.896721198567239e-05, 'samples': 25254144, 'steps': 131531, 'loss/train': 0.5999032855033875} 11/07/2021 15:44:00 - INFO - __main__ - Step 131533: {'lr': 1.8965184461235825e-05, 'samples': 25254336, 'steps': 131532, 'loss/train': 1.3409557342529297} 11/07/2021 15:44:01 - INFO - __main__ - Step 131534: {'lr': 1.896315704089996e-05, 'samples': 25254528, 'steps': 131533, 'loss/train': 1.407349705696106} 11/07/2021 15:44:01 - INFO - __main__ - Step 131535: {'lr': 1.8961129724665794e-05, 'samples': 25254720, 'steps': 131534, 'loss/train': 1.2166821956634521} 11/07/2021 15:44:01 - INFO - __main__ - Step 131536: {'lr': 1.8959102512534105e-05, 'samples': 25254912, 'steps': 131535, 'loss/train': 1.4339362382888794} 11/07/2021 15:44:02 - INFO - __main__ - Step 131537: {'lr': 1.895707540450592e-05, 'samples': 25255104, 'steps': 131536, 'loss/train': 0.9247488975524902} 11/07/2021 15:44:03 - INFO - __main__ - Step 131538: {'lr': 1.895504840058207e-05, 'samples': 25255296, 'steps': 131537, 'loss/train': 0.5001038312911987} 11/07/2021 15:44:03 - INFO - __main__ - Step 131539: {'lr': 1.8953021500763558e-05, 'samples': 25255488, 'steps': 131538, 'loss/train': 1.6186712980270386} 11/07/2021 15:44:03 - INFO - __main__ - Step 131540: {'lr': 1.895099470505121e-05, 'samples': 25255680, 'steps': 131539, 'loss/train': 1.1644093990325928} 11/07/2021 15:44:04 - INFO - __main__ - Step 131541: {'lr': 1.8948968013446004e-05, 'samples': 25255872, 'steps': 131540, 'loss/train': 1.6155325174331665} 11/07/2021 15:44:05 - INFO - __main__ - Step 131542: {'lr': 1.89469414259488e-05, 'samples': 25256064, 'steps': 131541, 'loss/train': 1.4311670064926147} 11/07/2021 15:44:05 - INFO - __main__ - Step 131543: {'lr': 1.8944914942560565e-05, 'samples': 25256256, 'steps': 131542, 'loss/train': 1.1604056358337402} 11/07/2021 15:44:06 - INFO - __main__ - Step 131544: {'lr': 1.8942888563282163e-05, 'samples': 25256448, 'steps': 131543, 'loss/train': 1.4703625440597534} 11/07/2021 15:44:06 - INFO - __main__ - Step 131545: {'lr': 1.8940862288114536e-05, 'samples': 25256640, 'steps': 131544, 'loss/train': 1.1268996000289917} 11/07/2021 15:44:06 - INFO - __main__ - Step 131546: {'lr': 1.89388361170586e-05, 'samples': 25256832, 'steps': 131545, 'loss/train': 1.5610408782958984} 11/07/2021 15:44:07 - INFO - __main__ - Step 131547: {'lr': 1.8936810050115272e-05, 'samples': 25257024, 'steps': 131546, 'loss/train': 1.3867188692092896} 11/07/2021 15:44:08 - INFO - __main__ - Step 131548: {'lr': 1.893478408728544e-05, 'samples': 25257216, 'steps': 131547, 'loss/train': 1.027216911315918} 11/07/2021 15:44:08 - INFO - __main__ - Step 131549: {'lr': 1.8932758228570046e-05, 'samples': 25257408, 'steps': 131548, 'loss/train': 1.2698156833648682} 11/07/2021 15:44:09 - INFO - __main__ - Step 131550: {'lr': 1.893073247396998e-05, 'samples': 25257600, 'steps': 131549, 'loss/train': 0.7474846243858337} 11/07/2021 15:44:09 - INFO - __main__ - Step 131551: {'lr': 1.892870682348613e-05, 'samples': 25257792, 'steps': 131550, 'loss/train': 1.3829982280731201} 11/07/2021 15:44:09 - INFO - __main__ - Step 131552: {'lr': 1.8926681277119467e-05, 'samples': 25257984, 'steps': 131551, 'loss/train': 1.8408896923065186} 11/07/2021 15:44:10 - INFO - __main__ - Step 131553: {'lr': 1.892465583487085e-05, 'samples': 25258176, 'steps': 131552, 'loss/train': 1.6115421056747437} 11/07/2021 15:44:11 - INFO - __main__ - Step 131554: {'lr': 1.8922630496741228e-05, 'samples': 25258368, 'steps': 131553, 'loss/train': 0.23325401544570923} 11/07/2021 15:44:11 - INFO - __main__ - Step 131555: {'lr': 1.892060526273151e-05, 'samples': 25258560, 'steps': 131554, 'loss/train': 1.1151173114776611} 11/07/2021 15:44:11 - INFO - __main__ - Step 131556: {'lr': 1.891858013284259e-05, 'samples': 25258752, 'steps': 131555, 'loss/train': 0.9873237013816833} 11/07/2021 15:44:12 - INFO - __main__ - Step 131557: {'lr': 1.8916555107075405e-05, 'samples': 25258944, 'steps': 131556, 'loss/train': 1.5196353197097778} 11/07/2021 15:44:13 - INFO - __main__ - Step 131558: {'lr': 1.891453018543085e-05, 'samples': 25259136, 'steps': 131557, 'loss/train': 1.3354835510253906} 11/07/2021 15:44:13 - INFO - __main__ - Step 131559: {'lr': 1.8912505367909837e-05, 'samples': 25259328, 'steps': 131558, 'loss/train': 1.6291381120681763} 11/07/2021 15:44:13 - INFO - __main__ - Step 131560: {'lr': 1.8910480654513285e-05, 'samples': 25259520, 'steps': 131559, 'loss/train': 1.1248944997787476} 11/07/2021 15:44:14 - INFO - __main__ - Step 131561: {'lr': 1.8908456045242107e-05, 'samples': 25259712, 'steps': 131560, 'loss/train': 1.2879652976989746} 11/07/2021 15:44:14 - INFO - __main__ - Step 131562: {'lr': 1.8906431540097247e-05, 'samples': 25259904, 'steps': 131561, 'loss/train': 5.847967624664307} 11/07/2021 15:44:15 - INFO - __main__ - Step 131563: {'lr': 1.8904407139079565e-05, 'samples': 25260096, 'steps': 131562, 'loss/train': 1.8458682298660278} 11/07/2021 15:44:15 - INFO - __main__ - Step 131564: {'lr': 1.890238284218998e-05, 'samples': 25260288, 'steps': 131563, 'loss/train': 1.4922863245010376} 11/07/2021 15:44:16 - INFO - __main__ - Step 131565: {'lr': 1.8900358649429405e-05, 'samples': 25260480, 'steps': 131564, 'loss/train': 1.1008201837539673} 11/07/2021 15:44:16 - INFO - __main__ - Step 131566: {'lr': 1.8898334560798757e-05, 'samples': 25260672, 'steps': 131565, 'loss/train': 0.6764383912086487} 11/07/2021 15:44:17 - INFO - __main__ - Step 131567: {'lr': 1.8896310576298952e-05, 'samples': 25260864, 'steps': 131566, 'loss/train': 0.986129105091095} 11/07/2021 15:44:17 - INFO - __main__ - Step 131568: {'lr': 1.8894286695930934e-05, 'samples': 25261056, 'steps': 131567, 'loss/train': 1.4280216693878174} 11/07/2021 15:44:18 - INFO - __main__ - Step 131569: {'lr': 1.889226291969556e-05, 'samples': 25261248, 'steps': 131568, 'loss/train': 1.1873120069503784} 11/07/2021 15:44:18 - INFO - __main__ - Step 131570: {'lr': 1.8890239247593755e-05, 'samples': 25261440, 'steps': 131569, 'loss/train': 1.341163992881775} 11/07/2021 15:44:19 - INFO - __main__ - Step 131571: {'lr': 1.8888215679626454e-05, 'samples': 25261632, 'steps': 131570, 'loss/train': 1.3985933065414429} 11/07/2021 15:44:19 - INFO - __main__ - Step 131572: {'lr': 1.8886192215794573e-05, 'samples': 25261824, 'steps': 131571, 'loss/train': 1.2965041399002075} 11/07/2021 15:44:19 - INFO - __main__ - Step 131573: {'lr': 1.8884168856098975e-05, 'samples': 25262016, 'steps': 131572, 'loss/train': 0.7457861304283142} 11/07/2021 15:44:20 - INFO - __main__ - Step 131574: {'lr': 1.8882145600540633e-05, 'samples': 25262208, 'steps': 131573, 'loss/train': 1.4802491664886475} 11/07/2021 15:44:21 - INFO - __main__ - Step 131575: {'lr': 1.8880122449120463e-05, 'samples': 25262400, 'steps': 131574, 'loss/train': 1.3321906328201294} 11/07/2021 15:44:21 - INFO - __main__ - Step 131576: {'lr': 1.8878099401839293e-05, 'samples': 25262592, 'steps': 131575, 'loss/train': 1.1603516340255737} 11/07/2021 15:44:21 - INFO - __main__ - Step 131577: {'lr': 1.88760764586981e-05, 'samples': 25262784, 'steps': 131576, 'loss/train': 1.347609519958496} 11/07/2021 15:44:22 - INFO - __main__ - Step 131578: {'lr': 1.887405361969777e-05, 'samples': 25262976, 'steps': 131577, 'loss/train': 0.8147166967391968} 11/07/2021 15:44:23 - INFO - __main__ - Step 131579: {'lr': 1.8872030884839243e-05, 'samples': 25263168, 'steps': 131578, 'loss/train': 1.1866270303726196} 11/07/2021 15:44:23 - INFO - __main__ - Step 131580: {'lr': 1.887000825412338e-05, 'samples': 25263360, 'steps': 131579, 'loss/train': 1.3756386041641235} 11/07/2021 15:44:23 - INFO - __main__ - Step 131581: {'lr': 1.8867985727551163e-05, 'samples': 25263552, 'steps': 131580, 'loss/train': 2.22760009765625} 11/07/2021 15:44:24 - INFO - __main__ - Step 131582: {'lr': 1.886596330512344e-05, 'samples': 25263744, 'steps': 131581, 'loss/train': 1.7101434469223022} 11/07/2021 15:44:24 - INFO - __main__ - Step 131583: {'lr': 1.8863940986841133e-05, 'samples': 25263936, 'steps': 131582, 'loss/train': 1.6312408447265625} 11/07/2021 15:44:25 - INFO - __main__ - Step 131584: {'lr': 1.8861918772705182e-05, 'samples': 25264128, 'steps': 131583, 'loss/train': 0.9132221937179565} 11/07/2021 15:44:26 - INFO - __main__ - Step 131585: {'lr': 1.885989666271651e-05, 'samples': 25264320, 'steps': 131584, 'loss/train': 1.4059065580368042} 11/07/2021 15:44:26 - INFO - __main__ - Step 131586: {'lr': 1.885787465687597e-05, 'samples': 25264512, 'steps': 131585, 'loss/train': 1.2872591018676758} 11/07/2021 15:44:26 - INFO - __main__ - Step 131587: {'lr': 1.8855852755184505e-05, 'samples': 25264704, 'steps': 131586, 'loss/train': 1.257272720336914} 11/07/2021 15:44:27 - INFO - __main__ - Step 131588: {'lr': 1.8853830957643037e-05, 'samples': 25264896, 'steps': 131587, 'loss/train': 0.9115383625030518} 11/07/2021 15:44:27 - INFO - __main__ - Step 131589: {'lr': 1.8851809264252507e-05, 'samples': 25265088, 'steps': 131588, 'loss/train': 1.385115623474121} 11/07/2021 15:44:28 - INFO - __main__ - Step 131590: {'lr': 1.8849787675013747e-05, 'samples': 25265280, 'steps': 131589, 'loss/train': 1.4464895725250244} 11/07/2021 15:44:28 - INFO - __main__ - Step 131591: {'lr': 1.88477661899277e-05, 'samples': 25265472, 'steps': 131590, 'loss/train': 1.3931975364685059} 11/07/2021 15:44:29 - INFO - __main__ - Step 131592: {'lr': 1.8845744808995285e-05, 'samples': 25265664, 'steps': 131591, 'loss/train': 1.3956090211868286} 11/07/2021 15:44:29 - INFO - __main__ - Step 131593: {'lr': 1.8843723532217415e-05, 'samples': 25265856, 'steps': 131592, 'loss/train': 1.2335666418075562} 11/07/2021 15:44:29 - INFO - __main__ - Step 131594: {'lr': 1.884170235959498e-05, 'samples': 25266048, 'steps': 131593, 'loss/train': 0.6484060883522034} 11/07/2021 15:44:30 - INFO - __main__ - Step 131595: {'lr': 1.8839681291128924e-05, 'samples': 25266240, 'steps': 131594, 'loss/train': 0.983921468257904} 11/07/2021 15:44:31 - INFO - __main__ - Step 131596: {'lr': 1.8837660326820132e-05, 'samples': 25266432, 'steps': 131595, 'loss/train': 0.7379147410392761} 11/07/2021 15:44:31 - INFO - __main__ - Step 131597: {'lr': 1.8835639466669526e-05, 'samples': 25266624, 'steps': 131596, 'loss/train': 1.225166916847229} 11/07/2021 15:44:31 - INFO - __main__ - Step 131598: {'lr': 1.883361871067801e-05, 'samples': 25266816, 'steps': 131597, 'loss/train': 1.0319758653640747} 11/07/2021 15:44:32 - INFO - __main__ - Step 131599: {'lr': 1.8831598058846488e-05, 'samples': 25267008, 'steps': 131598, 'loss/train': 1.5812819004058838} 11/07/2021 15:44:33 - INFO - __main__ - Step 131600: {'lr': 1.8829577511175894e-05, 'samples': 25267200, 'steps': 131599, 'loss/train': 1.6550767421722412} 11/07/2021 15:44:33 - INFO - __main__ - Step 131601: {'lr': 1.8827557067667146e-05, 'samples': 25267392, 'steps': 131600, 'loss/train': 1.166285514831543} 11/07/2021 15:44:34 - INFO - __main__ - Step 131602: {'lr': 1.8825536728321157e-05, 'samples': 25267584, 'steps': 131601, 'loss/train': 1.249401569366455} 11/07/2021 15:44:34 - INFO - __main__ - Step 131603: {'lr': 1.8823516493138764e-05, 'samples': 25267776, 'steps': 131602, 'loss/train': 1.2997686862945557} 11/07/2021 15:44:34 - INFO - __main__ - Step 131604: {'lr': 1.882149636212094e-05, 'samples': 25267968, 'steps': 131603, 'loss/train': 1.3404661417007446} 11/07/2021 15:44:35 - INFO - __main__ - Step 131605: {'lr': 1.8819476335268566e-05, 'samples': 25268160, 'steps': 131604, 'loss/train': 1.6202346086502075} 11/07/2021 15:44:36 - INFO - __main__ - Step 131606: {'lr': 1.8817456412582563e-05, 'samples': 25268352, 'steps': 131605, 'loss/train': 1.8467684984207153} 11/07/2021 15:44:36 - INFO - __main__ - Step 131607: {'lr': 1.8815436594063872e-05, 'samples': 25268544, 'steps': 131606, 'loss/train': 1.531410574913025} 11/07/2021 15:44:36 - INFO - __main__ - Step 131608: {'lr': 1.8813416879713358e-05, 'samples': 25268736, 'steps': 131607, 'loss/train': 1.3557568788528442} 11/07/2021 15:44:37 - INFO - __main__ - Step 131609: {'lr': 1.881139726953196e-05, 'samples': 25268928, 'steps': 131608, 'loss/train': 1.7102313041687012} 11/07/2021 15:44:38 - INFO - __main__ - Step 131610: {'lr': 1.8809377763520598e-05, 'samples': 25269120, 'steps': 131609, 'loss/train': 0.908286988735199} 11/07/2021 15:44:38 - INFO - __main__ - Step 131611: {'lr': 1.8807358361680128e-05, 'samples': 25269312, 'steps': 131610, 'loss/train': 0.8180080652236938} 11/07/2021 15:44:39 - INFO - __main__ - Step 131612: {'lr': 1.8805339064011524e-05, 'samples': 25269504, 'steps': 131611, 'loss/train': 1.201716661453247} 11/07/2021 15:44:39 - INFO - __main__ - Step 131613: {'lr': 1.8803319870515643e-05, 'samples': 25269696, 'steps': 131612, 'loss/train': 1.2884960174560547} 11/07/2021 15:44:39 - INFO - __main__ - Step 131614: {'lr': 1.8801300781193465e-05, 'samples': 25269888, 'steps': 131613, 'loss/train': 1.100298523902893} 11/07/2021 15:44:40 - INFO - __main__ - Step 131615: {'lr': 1.8799281796045842e-05, 'samples': 25270080, 'steps': 131614, 'loss/train': 1.0882130861282349} 11/07/2021 15:44:41 - INFO - __main__ - Step 131616: {'lr': 1.8797262915073664e-05, 'samples': 25270272, 'steps': 131615, 'loss/train': 0.9160617589950562} 11/07/2021 15:44:41 - INFO - __main__ - Step 131617: {'lr': 1.8795244138277877e-05, 'samples': 25270464, 'steps': 131616, 'loss/train': 0.731616735458374} 11/07/2021 15:44:41 - INFO - __main__ - Step 131618: {'lr': 1.8793225465659368e-05, 'samples': 25270656, 'steps': 131617, 'loss/train': 1.275641679763794} 11/07/2021 15:44:42 - INFO - __main__ - Step 131619: {'lr': 1.879120689721908e-05, 'samples': 25270848, 'steps': 131618, 'loss/train': 0.5545017123222351} 11/07/2021 15:44:42 - INFO - __main__ - Step 131620: {'lr': 1.8789188432957933e-05, 'samples': 25271040, 'steps': 131619, 'loss/train': 1.4223902225494385} 11/07/2021 15:44:43 - INFO - __main__ - Step 131621: {'lr': 1.878717007287678e-05, 'samples': 25271232, 'steps': 131620, 'loss/train': 1.533160924911499} 11/07/2021 15:44:43 - INFO - __main__ - Step 131622: {'lr': 1.878515181697657e-05, 'samples': 25271424, 'steps': 131621, 'loss/train': 1.3661829233169556} 11/07/2021 15:44:44 - INFO - __main__ - Step 131623: {'lr': 1.878313366525819e-05, 'samples': 25271616, 'steps': 131622, 'loss/train': 1.7131344079971313} 11/07/2021 15:44:44 - INFO - __main__ - Step 131624: {'lr': 1.878111561772258e-05, 'samples': 25271808, 'steps': 131623, 'loss/train': 1.9104000329971313} 11/07/2021 15:44:44 - INFO - __main__ - Step 131625: {'lr': 1.8779097674370664e-05, 'samples': 25272000, 'steps': 131624, 'loss/train': 1.3743486404418945} 11/07/2021 15:44:45 - INFO - __main__ - Step 131626: {'lr': 1.877707983520327e-05, 'samples': 25272192, 'steps': 131625, 'loss/train': 1.5267452001571655} 11/07/2021 15:44:46 - INFO - __main__ - Step 131627: {'lr': 1.8775062100221367e-05, 'samples': 25272384, 'steps': 131626, 'loss/train': 1.338162899017334} 11/07/2021 15:44:46 - INFO - __main__ - Step 131628: {'lr': 1.8773044469425847e-05, 'samples': 25272576, 'steps': 131627, 'loss/train': 1.742396354675293} 11/07/2021 15:44:47 - INFO - __main__ - Step 131629: {'lr': 1.8771026942817627e-05, 'samples': 25272768, 'steps': 131628, 'loss/train': 1.2141820192337036} 11/07/2021 15:44:47 - INFO - __main__ - Step 131630: {'lr': 1.8769009520397616e-05, 'samples': 25272960, 'steps': 131629, 'loss/train': 1.5802667140960693} 11/07/2021 15:44:48 - INFO - __main__ - Step 131631: {'lr': 1.876699220216671e-05, 'samples': 25273152, 'steps': 131630, 'loss/train': 1.552351951599121} 11/07/2021 15:44:48 - INFO - __main__ - Step 131632: {'lr': 1.876497498812585e-05, 'samples': 25273344, 'steps': 131631, 'loss/train': 1.272195816040039} 11/07/2021 15:44:49 - INFO - __main__ - Step 131633: {'lr': 1.8762957878275895e-05, 'samples': 25273536, 'steps': 131632, 'loss/train': 0.8989475965499878} 11/07/2021 15:44:49 - INFO - __main__ - Step 131634: {'lr': 1.8760940872617816e-05, 'samples': 25273728, 'steps': 131633, 'loss/train': 1.3655929565429688} 11/07/2021 15:44:49 - INFO - __main__ - Step 131635: {'lr': 1.8758923971152473e-05, 'samples': 25273920, 'steps': 131634, 'loss/train': 1.415037989616394} 11/07/2021 15:44:50 - INFO - __main__ - Step 131636: {'lr': 1.8756907173880816e-05, 'samples': 25274112, 'steps': 131635, 'loss/train': 1.0770165920257568} 11/07/2021 15:44:51 - INFO - __main__ - Step 131637: {'lr': 1.87548904808037e-05, 'samples': 25274304, 'steps': 131636, 'loss/train': 1.3237643241882324} 11/07/2021 15:44:51 - INFO - __main__ - Step 131638: {'lr': 1.875287389192207e-05, 'samples': 25274496, 'steps': 131637, 'loss/train': 1.2069034576416016} 11/07/2021 15:44:51 - INFO - __main__ - Step 131639: {'lr': 1.875085740723684e-05, 'samples': 25274688, 'steps': 131638, 'loss/train': 1.5675654411315918} 11/07/2021 15:44:52 - INFO - __main__ - Step 131640: {'lr': 1.8748841026748868e-05, 'samples': 25274880, 'steps': 131639, 'loss/train': 1.2811588048934937} 11/07/2021 15:44:53 - INFO - __main__ - Step 131641: {'lr': 1.8746824750459133e-05, 'samples': 25275072, 'steps': 131640, 'loss/train': 1.0794920921325684} 11/07/2021 15:44:53 - INFO - __main__ - Step 131642: {'lr': 1.8744808578368495e-05, 'samples': 25275264, 'steps': 131641, 'loss/train': 1.187435269355774} 11/07/2021 15:44:54 - INFO - __main__ - Step 131643: {'lr': 1.8742792510477864e-05, 'samples': 25275456, 'steps': 131642, 'loss/train': 2.0622127056121826} 11/07/2021 15:44:54 - INFO - __main__ - Step 131644: {'lr': 1.8740776546788187e-05, 'samples': 25275648, 'steps': 131643, 'loss/train': 2.0396671295166016} 11/07/2021 15:44:54 - INFO - __main__ - Step 131645: {'lr': 1.8738760687300323e-05, 'samples': 25275840, 'steps': 131644, 'loss/train': 1.158564567565918} 11/07/2021 15:44:55 - INFO - __main__ - Step 131646: {'lr': 1.873674493201524e-05, 'samples': 25276032, 'steps': 131645, 'loss/train': 1.4781551361083984} 11/07/2021 15:44:56 - INFO - __main__ - Step 131647: {'lr': 1.873472928093381e-05, 'samples': 25276224, 'steps': 131646, 'loss/train': 1.0331119298934937} 11/07/2021 15:44:56 - INFO - __main__ - Step 131648: {'lr': 1.873271373405694e-05, 'samples': 25276416, 'steps': 131647, 'loss/train': 1.5127629041671753} 11/07/2021 15:44:57 - INFO - __main__ - Step 131649: {'lr': 1.873069829138552e-05, 'samples': 25276608, 'steps': 131648, 'loss/train': 1.4285963773727417} 11/07/2021 15:44:57 - INFO - __main__ - Step 131650: {'lr': 1.872868295292049e-05, 'samples': 25276800, 'steps': 131649, 'loss/train': 1.2163496017456055} 11/07/2021 15:44:57 - INFO - __main__ - Step 131651: {'lr': 1.8726667718662717e-05, 'samples': 25276992, 'steps': 131650, 'loss/train': 1.1816831827163696} 11/07/2021 15:44:58 - INFO - __main__ - Step 131652: {'lr': 1.8724652588613165e-05, 'samples': 25277184, 'steps': 131651, 'loss/train': 1.0976221561431885} 11/07/2021 15:44:59 - INFO - __main__ - Step 131653: {'lr': 1.8722637562772706e-05, 'samples': 25277376, 'steps': 131652, 'loss/train': 0.9459156394004822} 11/07/2021 15:44:59 - INFO - __main__ - Step 131654: {'lr': 1.8720622641142272e-05, 'samples': 25277568, 'steps': 131653, 'loss/train': 1.2354892492294312} 11/07/2021 15:44:59 - INFO - __main__ - Step 131655: {'lr': 1.8718607823722756e-05, 'samples': 25277760, 'steps': 131654, 'loss/train': 1.2139084339141846} 11/07/2021 15:45:00 - INFO - __main__ - Step 131656: {'lr': 1.8716593110515045e-05, 'samples': 25277952, 'steps': 131655, 'loss/train': 1.5435845851898193} 11/07/2021 15:45:01 - INFO - __main__ - Step 131657: {'lr': 1.8714578501520085e-05, 'samples': 25278144, 'steps': 131656, 'loss/train': 1.1105788946151733} 11/07/2021 15:45:01 - INFO - __main__ - Step 131658: {'lr': 1.8712563996738817e-05, 'samples': 25278336, 'steps': 131657, 'loss/train': 1.3661020994186401} 11/07/2021 15:45:02 - INFO - __main__ - Step 131659: {'lr': 1.8710549596172048e-05, 'samples': 25278528, 'steps': 131658, 'loss/train': 1.70622980594635} 11/07/2021 15:45:02 - INFO - __main__ - Step 131660: {'lr': 1.8708535299820723e-05, 'samples': 25278720, 'steps': 131659, 'loss/train': 1.2505031824111938} 11/07/2021 15:45:02 - INFO - __main__ - Step 131661: {'lr': 1.870652110768578e-05, 'samples': 25278912, 'steps': 131660, 'loss/train': 2.0926454067230225} 11/07/2021 15:45:03 - INFO - __main__ - Step 131662: {'lr': 1.8704507019768085e-05, 'samples': 25279104, 'steps': 131661, 'loss/train': 1.4397963285446167} 11/07/2021 15:45:04 - INFO - __main__ - Step 131663: {'lr': 1.8702493036068608e-05, 'samples': 25279296, 'steps': 131662, 'loss/train': 1.3961820602416992} 11/07/2021 15:45:04 - INFO - __main__ - Step 131664: {'lr': 1.870047915658818e-05, 'samples': 25279488, 'steps': 131663, 'loss/train': 0.4557932913303375} 11/07/2021 15:45:04 - INFO - __main__ - Step 131665: {'lr': 1.8698465381327773e-05, 'samples': 25279680, 'steps': 131664, 'loss/train': 1.2837941646575928} 11/07/2021 15:45:05 - INFO - __main__ - Step 131666: {'lr': 1.869645171028825e-05, 'samples': 25279872, 'steps': 131665, 'loss/train': 1.0307666063308716} 11/07/2021 15:45:06 - INFO - __main__ - Step 131667: {'lr': 1.8694438143470548e-05, 'samples': 25280064, 'steps': 131666, 'loss/train': 1.1878374814987183} 11/07/2021 15:45:06 - INFO - __main__ - Step 131668: {'lr': 1.8692424680875562e-05, 'samples': 25280256, 'steps': 131667, 'loss/train': 1.3968299627304077} 11/07/2021 15:45:06 - INFO - __main__ - Step 131669: {'lr': 1.869041132250421e-05, 'samples': 25280448, 'steps': 131668, 'loss/train': 1.5304059982299805} 11/07/2021 15:45:07 - INFO - __main__ - Step 131670: {'lr': 1.8688398068357426e-05, 'samples': 25280640, 'steps': 131669, 'loss/train': 1.12772536277771} 11/07/2021 15:45:07 - INFO - __main__ - Step 131671: {'lr': 1.8686384918436023e-05, 'samples': 25280832, 'steps': 131670, 'loss/train': 1.4357479810714722} 11/07/2021 15:45:08 - INFO - __main__ - Step 131672: {'lr': 1.8684371872740967e-05, 'samples': 25281024, 'steps': 131671, 'loss/train': 1.3946799039840698} 11/07/2021 15:45:09 - INFO - __main__ - Step 131673: {'lr': 1.868235893127318e-05, 'samples': 25281216, 'steps': 131672, 'loss/train': 0.8483985066413879} 11/07/2021 15:45:09 - INFO - __main__ - Step 131674: {'lr': 1.8680346094033546e-05, 'samples': 25281408, 'steps': 131673, 'loss/train': 1.059526801109314} 11/07/2021 15:45:09 - INFO - __main__ - Step 131675: {'lr': 1.867833336102298e-05, 'samples': 25281600, 'steps': 131674, 'loss/train': 1.486845850944519} 11/07/2021 15:45:10 - INFO - __main__ - Step 131676: {'lr': 1.8676320732242403e-05, 'samples': 25281792, 'steps': 131675, 'loss/train': 1.5259482860565186} 11/07/2021 15:45:10 - INFO - __main__ - Step 131677: {'lr': 1.867430820769267e-05, 'samples': 25281984, 'steps': 131676, 'loss/train': 1.6147394180297852} 11/07/2021 15:45:11 - INFO - __main__ - Step 131678: {'lr': 1.8672295787374754e-05, 'samples': 25282176, 'steps': 131677, 'loss/train': 1.0314993858337402} 11/07/2021 15:45:11 - INFO - __main__ - Step 131679: {'lr': 1.8670283471289518e-05, 'samples': 25282368, 'steps': 131678, 'loss/train': 1.2148511409759521} 11/07/2021 15:45:12 - INFO - __main__ - Step 131680: {'lr': 1.8668271259437873e-05, 'samples': 25282560, 'steps': 131679, 'loss/train': 1.523643970489502} 11/07/2021 15:45:12 - INFO - __main__ - Step 131681: {'lr': 1.8666259151820768e-05, 'samples': 25282752, 'steps': 131680, 'loss/train': 1.425205945968628} 11/07/2021 15:45:12 - INFO - __main__ - Step 131682: {'lr': 1.8664247148439033e-05, 'samples': 25282944, 'steps': 131681, 'loss/train': 1.063014268875122} 11/07/2021 15:45:13 - INFO - __main__ - Step 131683: {'lr': 1.8662235249293697e-05, 'samples': 25283136, 'steps': 131682, 'loss/train': 1.4342584609985352} 11/07/2021 15:45:14 - INFO - __main__ - Step 131684: {'lr': 1.8660223454385533e-05, 'samples': 25283328, 'steps': 131683, 'loss/train': 1.448857307434082} 11/07/2021 15:45:14 - INFO - __main__ - Step 131685: {'lr': 1.8658211763715514e-05, 'samples': 25283520, 'steps': 131684, 'loss/train': 1.1679415702819824} 11/07/2021 15:45:14 - INFO - __main__ - Step 131686: {'lr': 1.8656200177284503e-05, 'samples': 25283712, 'steps': 131685, 'loss/train': 1.4984078407287598} 11/07/2021 15:45:15 - INFO - __main__ - Step 131687: {'lr': 1.865418869509347e-05, 'samples': 25283904, 'steps': 131686, 'loss/train': 1.315793752670288} 11/07/2021 15:45:16 - INFO - __main__ - Step 131688: {'lr': 1.8652177317143275e-05, 'samples': 25284096, 'steps': 131687, 'loss/train': 1.6441401243209839} 11/07/2021 15:45:16 - INFO - __main__ - Step 131689: {'lr': 1.8650166043434837e-05, 'samples': 25284288, 'steps': 131688, 'loss/train': 1.0366002321243286} 11/07/2021 15:45:17 - INFO - __main__ - Step 131690: {'lr': 1.8648154873969093e-05, 'samples': 25284480, 'steps': 131689, 'loss/train': 0.8807560205459595} 11/07/2021 15:45:17 - INFO - __main__ - Step 131691: {'lr': 1.864614380874688e-05, 'samples': 25284672, 'steps': 131690, 'loss/train': 1.6778863668441772} 11/07/2021 15:45:17 - INFO - __main__ - Step 131692: {'lr': 1.864413284776917e-05, 'samples': 25284864, 'steps': 131691, 'loss/train': 0.7960876822471619} 11/07/2021 15:45:19 - INFO - __main__ - Step 131693: {'lr': 1.864212199103685e-05, 'samples': 25285056, 'steps': 131692, 'loss/train': 0.5110241770744324} 11/07/2021 15:45:19 - INFO - __main__ - Step 131694: {'lr': 1.8640111238550778e-05, 'samples': 25285248, 'steps': 131693, 'loss/train': 1.3290177583694458} 11/07/2021 15:45:19 - INFO - __main__ - Step 131695: {'lr': 1.863810059031193e-05, 'samples': 25285440, 'steps': 131694, 'loss/train': 1.227929711341858} 11/07/2021 15:45:20 - INFO - __main__ - Step 131696: {'lr': 1.8636090046321246e-05, 'samples': 25285632, 'steps': 131695, 'loss/train': 0.8904291987419128} 11/07/2021 15:45:20 - INFO - __main__ - Step 131697: {'lr': 1.863407960657951e-05, 'samples': 25285824, 'steps': 131696, 'loss/train': 1.1313408613204956} 11/07/2021 15:45:20 - INFO - __main__ - Step 131698: {'lr': 1.8632069271087683e-05, 'samples': 25286016, 'steps': 131697, 'loss/train': 1.1790122985839844} 11/07/2021 15:45:21 - INFO - __main__ - Step 131699: {'lr': 1.863005903984666e-05, 'samples': 25286208, 'steps': 131698, 'loss/train': 1.0878804922103882} 11/07/2021 15:45:22 - INFO - __main__ - Step 131700: {'lr': 1.8628048912857382e-05, 'samples': 25286400, 'steps': 131699, 'loss/train': 1.4610350131988525} 11/07/2021 15:45:22 - INFO - __main__ - Step 131701: {'lr': 1.862603889012074e-05, 'samples': 25286592, 'steps': 131700, 'loss/train': 0.8693850636482239} 11/07/2021 15:45:22 - INFO - __main__ - Step 131702: {'lr': 1.862402897163762e-05, 'samples': 25286784, 'steps': 131701, 'loss/train': 1.1360341310501099} 11/07/2021 15:45:23 - INFO - __main__ - Step 131703: {'lr': 1.8622019157408938e-05, 'samples': 25286976, 'steps': 131702, 'loss/train': 1.2245228290557861} 11/07/2021 15:45:24 - INFO - __main__ - Step 131704: {'lr': 1.8620009447435636e-05, 'samples': 25287168, 'steps': 131703, 'loss/train': 1.058889389038086} 11/07/2021 15:45:24 - INFO - __main__ - Step 131705: {'lr': 1.861799984171855e-05, 'samples': 25287360, 'steps': 131704, 'loss/train': 1.263048529624939} 11/07/2021 15:45:24 - INFO - __main__ - Step 131706: {'lr': 1.8615990340258653e-05, 'samples': 25287552, 'steps': 131705, 'loss/train': 1.1748013496398926} 11/07/2021 15:45:25 - INFO - __main__ - Step 131707: {'lr': 1.86139809430568e-05, 'samples': 25287744, 'steps': 131706, 'loss/train': 1.3143867254257202} 11/07/2021 15:45:25 - INFO - __main__ - Step 131708: {'lr': 1.8611971650113913e-05, 'samples': 25287936, 'steps': 131707, 'loss/train': 1.0860875844955444} 11/07/2021 15:45:26 - INFO - __main__ - Step 131709: {'lr': 1.8609962461430928e-05, 'samples': 25288128, 'steps': 131708, 'loss/train': 0.9178457856178284} 11/07/2021 15:45:27 - INFO - __main__ - Step 131710: {'lr': 1.860795337700874e-05, 'samples': 25288320, 'steps': 131709, 'loss/train': 1.2302616834640503} 11/07/2021 15:45:27 - INFO - __main__ - Step 131711: {'lr': 1.860594439684821e-05, 'samples': 25288512, 'steps': 131710, 'loss/train': 0.9009495973587036} 11/07/2021 15:45:27 - INFO - __main__ - Step 131712: {'lr': 1.8603935520950242e-05, 'samples': 25288704, 'steps': 131711, 'loss/train': 1.2606652975082397} 11/07/2021 15:45:28 - INFO - __main__ - Step 131713: {'lr': 1.8601926749315794e-05, 'samples': 25288896, 'steps': 131712, 'loss/train': 1.614911437034607} 11/07/2021 15:45:28 - INFO - __main__ - Step 131714: {'lr': 1.859991808194575e-05, 'samples': 25289088, 'steps': 131713, 'loss/train': 1.2866451740264893} 11/07/2021 15:45:29 - INFO - __main__ - Step 131715: {'lr': 1.859790951884102e-05, 'samples': 25289280, 'steps': 131714, 'loss/train': 1.4656563997268677} 11/07/2021 15:45:29 - INFO - __main__ - Step 131716: {'lr': 1.8595901060002474e-05, 'samples': 25289472, 'steps': 131715, 'loss/train': 1.4304317235946655} 11/07/2021 15:45:30 - INFO - __main__ - Step 131717: {'lr': 1.8593892705431075e-05, 'samples': 25289664, 'steps': 131716, 'loss/train': 0.9310936331748962} 11/07/2021 15:45:30 - INFO - __main__ - Step 131718: {'lr': 1.8591884455127662e-05, 'samples': 25289856, 'steps': 131717, 'loss/train': 1.8210041522979736} 11/07/2021 15:45:30 - INFO - __main__ - Step 131719: {'lr': 1.8589876309093202e-05, 'samples': 25290048, 'steps': 131718, 'loss/train': 1.1347987651824951} 11/07/2021 15:45:32 - INFO - __main__ - Step 131720: {'lr': 1.8587868267328556e-05, 'samples': 25290240, 'steps': 131719, 'loss/train': 0.9067209362983704} 11/07/2021 15:45:32 - INFO - __main__ - Step 131721: {'lr': 1.858586032983467e-05, 'samples': 25290432, 'steps': 131720, 'loss/train': 1.1566882133483887} 11/07/2021 15:45:32 - INFO - __main__ - Step 131722: {'lr': 1.8583852496612404e-05, 'samples': 25290624, 'steps': 131721, 'loss/train': 1.2294831275939941} 11/07/2021 15:45:33 - INFO - __main__ - Step 131723: {'lr': 1.8581844767662727e-05, 'samples': 25290816, 'steps': 131722, 'loss/train': 0.7430464625358582} 11/07/2021 15:45:33 - INFO - __main__ - Step 131724: {'lr': 1.8579837142986472e-05, 'samples': 25291008, 'steps': 131723, 'loss/train': 1.251355767250061} 11/07/2021 15:45:34 - INFO - __main__ - Step 131725: {'lr': 1.8577829622584557e-05, 'samples': 25291200, 'steps': 131724, 'loss/train': 1.485832691192627} 11/07/2021 15:45:34 - INFO - __main__ - Step 131726: {'lr': 1.8575822206457897e-05, 'samples': 25291392, 'steps': 131725, 'loss/train': 1.3245370388031006} 11/07/2021 15:45:35 - INFO - __main__ - Step 131727: {'lr': 1.857381489460741e-05, 'samples': 25291584, 'steps': 131726, 'loss/train': 1.0937796831130981} 11/07/2021 15:45:35 - INFO - __main__ - Step 131728: {'lr': 1.857180768703398e-05, 'samples': 25291776, 'steps': 131727, 'loss/train': 0.9849844574928284} 11/07/2021 15:45:35 - INFO - __main__ - Step 131729: {'lr': 1.8569800583738556e-05, 'samples': 25291968, 'steps': 131728, 'loss/train': 0.9425235390663147} 11/07/2021 15:45:37 - INFO - __main__ - Step 131730: {'lr': 1.8567793584721964e-05, 'samples': 25292160, 'steps': 131729, 'loss/train': 1.5039927959442139} 11/07/2021 15:45:37 - INFO - __main__ - Step 131731: {'lr': 1.856578668998518e-05, 'samples': 25292352, 'steps': 131730, 'loss/train': 0.9435480833053589} 11/07/2021 15:45:37 - INFO - __main__ - Step 131732: {'lr': 1.8563779899529064e-05, 'samples': 25292544, 'steps': 131731, 'loss/train': 1.5279086828231812} 11/07/2021 15:45:38 - INFO - __main__ - Step 131733: {'lr': 1.856177321335456e-05, 'samples': 25292736, 'steps': 131732, 'loss/train': 1.2577389478683472} 11/07/2021 15:45:38 - INFO - __main__ - Step 131734: {'lr': 1.8559766631462525e-05, 'samples': 25292928, 'steps': 131733, 'loss/train': 0.9272286891937256} 11/07/2021 15:45:38 - INFO - __main__ - Step 131735: {'lr': 1.855776015385391e-05, 'samples': 25293120, 'steps': 131734, 'loss/train': 0.049869105219841} 11/07/2021 15:45:39 - INFO - __main__ - Step 131736: {'lr': 1.8555753780529593e-05, 'samples': 25293312, 'steps': 131735, 'loss/train': 1.3558752536773682} 11/07/2021 15:45:40 - INFO - __main__ - Step 131737: {'lr': 1.8553747511490498e-05, 'samples': 25293504, 'steps': 131736, 'loss/train': 1.5060820579528809} 11/07/2021 15:45:40 - INFO - __main__ - Step 131738: {'lr': 1.855174134673751e-05, 'samples': 25293696, 'steps': 131737, 'loss/train': 0.5530993342399597} 11/07/2021 15:45:41 - INFO - __main__ - Step 131739: {'lr': 1.8549735286271517e-05, 'samples': 25293888, 'steps': 131738, 'loss/train': 1.540128469467163} 11/07/2021 15:45:41 - INFO - __main__ - Step 131740: {'lr': 1.8547729330093437e-05, 'samples': 25294080, 'steps': 131739, 'loss/train': 1.2349470853805542} 11/07/2021 15:45:42 - INFO - __main__ - Step 131741: {'lr': 1.8545723478204184e-05, 'samples': 25294272, 'steps': 131740, 'loss/train': 1.0699876546859741} 11/07/2021 15:45:42 - INFO - __main__ - Step 131742: {'lr': 1.8543717730604647e-05, 'samples': 25294464, 'steps': 131741, 'loss/train': 1.447965383529663} 11/07/2021 15:45:43 - INFO - __main__ - Step 131743: {'lr': 1.8541712087295743e-05, 'samples': 25294656, 'steps': 131742, 'loss/train': 1.2555694580078125} 11/07/2021 15:45:43 - INFO - __main__ - Step 131744: {'lr': 1.8539706548278383e-05, 'samples': 25294848, 'steps': 131743, 'loss/train': 1.1638716459274292} 11/07/2021 15:45:43 - INFO - __main__ - Step 131745: {'lr': 1.8537701113553464e-05, 'samples': 25295040, 'steps': 131744, 'loss/train': 0.8875115513801575} 11/07/2021 15:45:45 - INFO - __main__ - Step 131746: {'lr': 1.8535695783121864e-05, 'samples': 25295232, 'steps': 131745, 'loss/train': 1.7675563097000122} 11/07/2021 15:45:45 - INFO - __main__ - Step 131747: {'lr': 1.8533690556984507e-05, 'samples': 25295424, 'steps': 131746, 'loss/train': 1.393232822418213} 11/07/2021 15:45:45 - INFO - __main__ - Step 131748: {'lr': 1.8531685435142302e-05, 'samples': 25295616, 'steps': 131747, 'loss/train': 1.3404932022094727} 11/07/2021 15:45:46 - INFO - __main__ - Step 131749: {'lr': 1.8529680417596173e-05, 'samples': 25295808, 'steps': 131748, 'loss/train': 1.6626217365264893} 11/07/2021 15:45:46 - INFO - __main__ - Step 131750: {'lr': 1.852767550434703e-05, 'samples': 25296000, 'steps': 131749, 'loss/train': 1.0990016460418701} 11/07/2021 15:45:47 - INFO - __main__ - Step 131751: {'lr': 1.852567069539568e-05, 'samples': 25296192, 'steps': 131750, 'loss/train': 0.8495815396308899} 11/07/2021 15:45:48 - INFO - __main__ - Step 131752: {'lr': 1.8523665990743093e-05, 'samples': 25296384, 'steps': 131751, 'loss/train': 2.034224510192871} 11/07/2021 15:45:48 - INFO - __main__ - Step 131753: {'lr': 1.8521661390390187e-05, 'samples': 25296576, 'steps': 131752, 'loss/train': 1.3091024160385132} 11/07/2021 15:45:48 - INFO - __main__ - Step 131754: {'lr': 1.8519656894337848e-05, 'samples': 25296768, 'steps': 131753, 'loss/train': 0.6036501526832581} 11/07/2021 15:45:49 - INFO - __main__ - Step 131755: {'lr': 1.8517652502586968e-05, 'samples': 25296960, 'steps': 131754, 'loss/train': 1.2833210229873657} 11/07/2021 15:45:49 - INFO - __main__ - Step 131756: {'lr': 1.8515648215138485e-05, 'samples': 25297152, 'steps': 131755, 'loss/train': 1.2633824348449707} 11/07/2021 15:45:50 - INFO - __main__ - Step 131757: {'lr': 1.8513644031993267e-05, 'samples': 25297344, 'steps': 131756, 'loss/train': 1.4328935146331787} 11/07/2021 15:45:50 - INFO - __main__ - Step 131758: {'lr': 1.851163995315222e-05, 'samples': 25297536, 'steps': 131757, 'loss/train': 1.272261619567871} 11/07/2021 15:45:51 - INFO - __main__ - Step 131759: {'lr': 1.850963597861627e-05, 'samples': 25297728, 'steps': 131758, 'loss/train': 1.1430184841156006} 11/07/2021 15:45:51 - INFO - __main__ - Step 131760: {'lr': 1.8507632108386268e-05, 'samples': 25297920, 'steps': 131759, 'loss/train': 1.3780276775360107} 11/07/2021 15:45:51 - INFO - __main__ - Step 131761: {'lr': 1.8505628342463194e-05, 'samples': 25298112, 'steps': 131760, 'loss/train': 1.1137990951538086} 11/07/2021 15:45:52 - INFO - __main__ - Step 131762: {'lr': 1.85036246808479e-05, 'samples': 25298304, 'steps': 131761, 'loss/train': 1.430168628692627} 11/07/2021 15:45:53 - INFO - __main__ - Step 131763: {'lr': 1.8501621123541314e-05, 'samples': 25298496, 'steps': 131762, 'loss/train': 0.6474961638450623} 11/07/2021 15:45:53 - INFO - __main__ - Step 131764: {'lr': 1.8499617670544337e-05, 'samples': 25298688, 'steps': 131763, 'loss/train': 0.8444777131080627} 11/07/2021 15:45:54 - INFO - __main__ - Step 131765: {'lr': 1.849761432185784e-05, 'samples': 25298880, 'steps': 131764, 'loss/train': 1.5858603715896606} 11/07/2021 15:45:54 - INFO - __main__ - Step 131766: {'lr': 1.849561107748274e-05, 'samples': 25299072, 'steps': 131765, 'loss/train': 1.1282801628112793} 11/07/2021 15:45:55 - INFO - __main__ - Step 131767: {'lr': 1.8493607937419943e-05, 'samples': 25299264, 'steps': 131766, 'loss/train': 0.8848604559898376} 11/07/2021 15:45:55 - INFO - __main__ - Step 131768: {'lr': 1.8491604901670372e-05, 'samples': 25299456, 'steps': 131767, 'loss/train': 1.1753002405166626} 11/07/2021 15:45:56 - INFO - __main__ - Step 131769: {'lr': 1.8489601970234888e-05, 'samples': 25299648, 'steps': 131768, 'loss/train': 1.0553748607635498} 11/07/2021 15:45:56 - INFO - __main__ - Step 131770: {'lr': 1.8487599143114432e-05, 'samples': 25299840, 'steps': 131769, 'loss/train': 1.1759330034255981} 11/07/2021 15:45:56 - INFO - __main__ - Step 131771: {'lr': 1.8485596420309892e-05, 'samples': 25300032, 'steps': 131770, 'loss/train': 1.352189064025879} 11/07/2021 15:45:57 - INFO - __main__ - Step 131772: {'lr': 1.848359380182216e-05, 'samples': 25300224, 'steps': 131771, 'loss/train': 0.882630467414856} 11/07/2021 15:45:58 - INFO - __main__ - Step 131773: {'lr': 1.8481591287652143e-05, 'samples': 25300416, 'steps': 131772, 'loss/train': 0.9387694001197815} 11/07/2021 15:45:58 - INFO - __main__ - Step 131774: {'lr': 1.8479588877800767e-05, 'samples': 25300608, 'steps': 131773, 'loss/train': 1.1675946712493896} 11/07/2021 15:45:59 - INFO - __main__ - Step 131775: {'lr': 1.847758657226889e-05, 'samples': 25300800, 'steps': 131774, 'loss/train': 1.411636471748352} 11/07/2021 15:45:59 - INFO - __main__ - Step 131776: {'lr': 1.8475584371057452e-05, 'samples': 25300992, 'steps': 131775, 'loss/train': 0.9498498439788818} 11/07/2021 15:46:00 - INFO - __main__ - Step 131777: {'lr': 1.84735822741674e-05, 'samples': 25301184, 'steps': 131776, 'loss/train': 1.553251028060913} 11/07/2021 15:46:00 - INFO - __main__ - Step 131778: {'lr': 1.847158028159954e-05, 'samples': 25301376, 'steps': 131777, 'loss/train': 0.888656497001648} 11/07/2021 15:46:01 - INFO - __main__ - Step 131779: {'lr': 1.846957839335478e-05, 'samples': 25301568, 'steps': 131778, 'loss/train': 1.5795438289642334} 11/07/2021 15:46:01 - INFO - __main__ - Step 131780: {'lr': 1.8467576609434073e-05, 'samples': 25301760, 'steps': 131779, 'loss/train': 1.4719346761703491} 11/07/2021 15:46:01 - INFO - __main__ - Step 131781: {'lr': 1.8465574929838302e-05, 'samples': 25301952, 'steps': 131780, 'loss/train': 1.167454719543457} 11/07/2021 15:46:02 - INFO - __main__ - Step 131782: {'lr': 1.8463573354568385e-05, 'samples': 25302144, 'steps': 131781, 'loss/train': 1.4467779397964478} 11/07/2021 15:46:04 - INFO - __main__ - Step 131783: {'lr': 1.8461571883625184e-05, 'samples': 25302336, 'steps': 131782, 'loss/train': 1.3558696508407593} 11/07/2021 15:46:04 - INFO - __main__ - Step 131784: {'lr': 1.845957051700964e-05, 'samples': 25302528, 'steps': 131783, 'loss/train': 2.0193634033203125} 11/07/2021 15:46:05 - INFO - __main__ - Step 131785: {'lr': 1.845756925472264e-05, 'samples': 25302720, 'steps': 131784, 'loss/train': 1.2730047702789307} 11/07/2021 15:46:05 - INFO - __main__ - Step 131786: {'lr': 1.84555680967651e-05, 'samples': 25302912, 'steps': 131785, 'loss/train': 1.4871145486831665} 11/07/2021 15:46:05 - INFO - __main__ - Step 131787: {'lr': 1.8453567043137886e-05, 'samples': 25303104, 'steps': 131786, 'loss/train': 1.2362908124923706} 11/07/2021 15:46:06 - INFO - __main__ - Step 131788: {'lr': 1.8451566093841938e-05, 'samples': 25303296, 'steps': 131787, 'loss/train': 1.1524837017059326} 11/07/2021 15:46:06 - INFO - __main__ - Step 131789: {'lr': 1.8449565248878115e-05, 'samples': 25303488, 'steps': 131788, 'loss/train': 0.7900775671005249} 11/07/2021 15:46:06 - INFO - __main__ - Step 131790: {'lr': 1.8447564508247362e-05, 'samples': 25303680, 'steps': 131789, 'loss/train': 0.6894140839576721} 11/07/2021 15:46:08 - INFO - __main__ - Step 131791: {'lr': 1.844556387195062e-05, 'samples': 25303872, 'steps': 131790, 'loss/train': 0.8724527359008789} 11/07/2021 15:46:08 - INFO - __main__ - Step 131792: {'lr': 1.8443563339988674e-05, 'samples': 25304064, 'steps': 131791, 'loss/train': 1.4811177253723145} 11/07/2021 15:46:08 - INFO - __main__ - Step 131793: {'lr': 1.8441562912362487e-05, 'samples': 25304256, 'steps': 131792, 'loss/train': 1.7049294710159302} 11/07/2021 15:46:09 - INFO - __main__ - Step 131794: {'lr': 1.843956258907295e-05, 'samples': 25304448, 'steps': 131793, 'loss/train': 1.51331627368927} 11/07/2021 15:46:09 - INFO - __main__ - Step 131795: {'lr': 1.843756237012098e-05, 'samples': 25304640, 'steps': 131794, 'loss/train': 1.2398231029510498} 11/07/2021 15:46:10 - INFO - __main__ - Step 131796: {'lr': 1.843556225550749e-05, 'samples': 25304832, 'steps': 131795, 'loss/train': 1.9917806386947632} 11/07/2021 15:46:10 - INFO - __main__ - Step 131797: {'lr': 1.843356224523335e-05, 'samples': 25305024, 'steps': 131796, 'loss/train': 1.2315713167190552} 11/07/2021 15:46:11 - INFO - __main__ - Step 131798: {'lr': 1.843156233929946e-05, 'samples': 25305216, 'steps': 131797, 'loss/train': 0.9363423585891724} 11/07/2021 15:46:11 - INFO - __main__ - Step 131799: {'lr': 1.842956253770675e-05, 'samples': 25305408, 'steps': 131798, 'loss/train': 1.3812265396118164} 11/07/2021 15:46:11 - INFO - __main__ - Step 131800: {'lr': 1.84275628404561e-05, 'samples': 25305600, 'steps': 131799, 'loss/train': 1.6951788663864136} 11/07/2021 15:46:12 - INFO - __main__ - Step 131801: {'lr': 1.8425563247548405e-05, 'samples': 25305792, 'steps': 131800, 'loss/train': 1.295752763748169} 11/07/2021 15:46:13 - INFO - __main__ - Step 131802: {'lr': 1.84235637589846e-05, 'samples': 25305984, 'steps': 131801, 'loss/train': 0.5526770353317261} 11/07/2021 15:46:13 - INFO - __main__ - Step 131803: {'lr': 1.8421564374765555e-05, 'samples': 25306176, 'steps': 131802, 'loss/train': 1.7827376127243042} 11/07/2021 15:46:13 - INFO - __main__ - Step 131804: {'lr': 1.8419565094892234e-05, 'samples': 25306368, 'steps': 131803, 'loss/train': 1.228957176208496} 11/07/2021 15:46:14 - INFO - __main__ - Step 131805: {'lr': 1.8417565919365413e-05, 'samples': 25306560, 'steps': 131804, 'loss/train': 0.4928780496120453} 11/07/2021 15:46:15 - INFO - __main__ - Step 131806: {'lr': 1.841556684818607e-05, 'samples': 25306752, 'steps': 131805, 'loss/train': 1.1055917739868164} 11/07/2021 15:46:15 - INFO - __main__ - Step 131807: {'lr': 1.841356788135512e-05, 'samples': 25306944, 'steps': 131806, 'loss/train': 1.2688441276550293} 11/07/2021 15:46:15 - INFO - __main__ - Step 131808: {'lr': 1.841156901887342e-05, 'samples': 25307136, 'steps': 131807, 'loss/train': 1.5334680080413818} 11/07/2021 15:46:16 - INFO - __main__ - Step 131809: {'lr': 1.8409570260741914e-05, 'samples': 25307328, 'steps': 131808, 'loss/train': 1.0901997089385986} 11/07/2021 15:46:16 - INFO - __main__ - Step 131810: {'lr': 1.8407571606961466e-05, 'samples': 25307520, 'steps': 131809, 'loss/train': 1.3423930406570435} 11/07/2021 15:46:17 - INFO - __main__ - Step 131811: {'lr': 1.8405573057532987e-05, 'samples': 25307712, 'steps': 131810, 'loss/train': 1.1083914041519165} 11/07/2021 15:46:18 - INFO - __main__ - Step 131812: {'lr': 1.8403574612457398e-05, 'samples': 25307904, 'steps': 131811, 'loss/train': 0.8714402914047241} 11/07/2021 15:46:18 - INFO - __main__ - Step 131813: {'lr': 1.840157627173558e-05, 'samples': 25308096, 'steps': 131812, 'loss/train': 1.2437074184417725} 11/07/2021 15:46:18 - INFO - __main__ - Step 131814: {'lr': 1.839957803536846e-05, 'samples': 25308288, 'steps': 131813, 'loss/train': 1.3085490465164185} 11/07/2021 15:46:19 - INFO - __main__ - Step 131815: {'lr': 1.839757990335689e-05, 'samples': 25308480, 'steps': 131814, 'loss/train': 1.332926630973816} 11/07/2021 15:46:19 - INFO - __main__ - Step 131816: {'lr': 1.8395581875701782e-05, 'samples': 25308672, 'steps': 131815, 'loss/train': 1.3422642946243286} 11/07/2021 15:46:20 - INFO - __main__ - Step 131817: {'lr': 1.839358395240412e-05, 'samples': 25308864, 'steps': 131816, 'loss/train': 1.2576899528503418} 11/07/2021 15:46:20 - INFO - __main__ - Step 131818: {'lr': 1.8391586133464698e-05, 'samples': 25309056, 'steps': 131817, 'loss/train': 1.369944453239441} 11/07/2021 15:46:21 - INFO - __main__ - Step 131819: {'lr': 1.8389588418884438e-05, 'samples': 25309248, 'steps': 131818, 'loss/train': 1.3393431901931763} 11/07/2021 15:46:21 - INFO - __main__ - Step 131820: {'lr': 1.8387590808664255e-05, 'samples': 25309440, 'steps': 131819, 'loss/train': 1.3842216730117798} 11/07/2021 15:46:21 - INFO - __main__ - Step 131821: {'lr': 1.8385593302805066e-05, 'samples': 25309632, 'steps': 131820, 'loss/train': 0.8435797691345215} 11/07/2021 15:46:22 - INFO - __main__ - Step 131822: {'lr': 1.8383595901307726e-05, 'samples': 25309824, 'steps': 131821, 'loss/train': 1.072823166847229} 11/07/2021 15:46:23 - INFO - __main__ - Step 131823: {'lr': 1.8381598604173183e-05, 'samples': 25310016, 'steps': 131822, 'loss/train': 1.4107868671417236} 11/07/2021 15:46:23 - INFO - __main__ - Step 131824: {'lr': 1.8379601411402326e-05, 'samples': 25310208, 'steps': 131823, 'loss/train': 1.1971330642700195} 11/07/2021 15:46:24 - INFO - __main__ - Step 131825: {'lr': 1.837760432299601e-05, 'samples': 25310400, 'steps': 131824, 'loss/train': 1.2649866342544556} 11/07/2021 15:46:24 - INFO - __main__ - Step 131826: {'lr': 1.8375607338955213e-05, 'samples': 25310592, 'steps': 131825, 'loss/train': 0.9717959761619568} 11/07/2021 15:46:25 - INFO - __main__ - Step 131827: {'lr': 1.8373610459280767e-05, 'samples': 25310784, 'steps': 131826, 'loss/train': 1.7483882904052734} 11/07/2021 15:46:25 - INFO - __main__ - Step 131828: {'lr': 1.837161368397361e-05, 'samples': 25310976, 'steps': 131827, 'loss/train': 1.096676230430603} 11/07/2021 15:46:26 - INFO - __main__ - Step 131829: {'lr': 1.8369617013034635e-05, 'samples': 25311168, 'steps': 131828, 'loss/train': 1.5287917852401733} 11/07/2021 15:46:26 - INFO - __main__ - Step 131830: {'lr': 1.836762044646473e-05, 'samples': 25311360, 'steps': 131829, 'loss/train': 1.0104538202285767} 11/07/2021 15:46:26 - INFO - __main__ - Step 131831: {'lr': 1.8365623984264833e-05, 'samples': 25311552, 'steps': 131830, 'loss/train': 0.9540254473686218} 11/07/2021 15:46:27 - INFO - __main__ - Step 131832: {'lr': 1.8363627626435758e-05, 'samples': 25311744, 'steps': 131831, 'loss/train': 0.39652204513549805} 11/07/2021 15:46:28 - INFO - __main__ - Step 131833: {'lr': 1.83616313729785e-05, 'samples': 25311936, 'steps': 131832, 'loss/train': 1.6577885150909424} 11/07/2021 15:46:28 - INFO - __main__ - Step 131834: {'lr': 1.835963522389389e-05, 'samples': 25312128, 'steps': 131833, 'loss/train': 1.1211808919906616} 11/07/2021 15:46:28 - INFO - __main__ - Step 131835: {'lr': 1.8357639179182846e-05, 'samples': 25312320, 'steps': 131834, 'loss/train': 0.8930896520614624} 11/07/2021 15:46:29 - INFO - __main__ - Step 131836: {'lr': 1.835564323884628e-05, 'samples': 25312512, 'steps': 131835, 'loss/train': 1.193034291267395} 11/07/2021 15:46:30 - INFO - __main__ - Step 131837: {'lr': 1.8353647402885114e-05, 'samples': 25312704, 'steps': 131836, 'loss/train': 0.6603561043739319} 11/07/2021 15:46:30 - INFO - __main__ - Step 131838: {'lr': 1.835165167130018e-05, 'samples': 25312896, 'steps': 131837, 'loss/train': 1.1458083391189575} 11/07/2021 15:46:31 - INFO - __main__ - Step 131839: {'lr': 1.8349656044092444e-05, 'samples': 25313088, 'steps': 131838, 'loss/train': 1.5488442182540894} 11/07/2021 15:46:31 - INFO - __main__ - Step 131840: {'lr': 1.8347660521262772e-05, 'samples': 25313280, 'steps': 131839, 'loss/train': 1.1646034717559814} 11/07/2021 15:46:31 - INFO - __main__ - Step 131841: {'lr': 1.8345665102812077e-05, 'samples': 25313472, 'steps': 131840, 'loss/train': 1.4485328197479248} 11/07/2021 15:46:32 - INFO - __main__ - Step 131842: {'lr': 1.8343669788741245e-05, 'samples': 25313664, 'steps': 131841, 'loss/train': 1.4212218523025513} 11/07/2021 15:46:33 - INFO - __main__ - Step 131843: {'lr': 1.834167457905117e-05, 'samples': 25313856, 'steps': 131842, 'loss/train': 1.1791330575942993} 11/07/2021 15:46:33 - INFO - __main__ - Step 131844: {'lr': 1.833967947374282e-05, 'samples': 25314048, 'steps': 131843, 'loss/train': 1.7620872259140015} 11/07/2021 15:46:33 - INFO - __main__ - Step 131845: {'lr': 1.8337684472816974e-05, 'samples': 25314240, 'steps': 131844, 'loss/train': 1.3770864009857178} 11/07/2021 15:46:34 - INFO - __main__ - Step 131846: {'lr': 1.83356895762746e-05, 'samples': 25314432, 'steps': 131845, 'loss/train': 1.218245506286621} 11/07/2021 15:46:35 - INFO - __main__ - Step 131847: {'lr': 1.833369478411659e-05, 'samples': 25314624, 'steps': 131846, 'loss/train': 1.3034747838974} 11/07/2021 15:46:35 - INFO - __main__ - Step 131848: {'lr': 1.8331700096343858e-05, 'samples': 25314816, 'steps': 131847, 'loss/train': 1.099119782447815} 11/07/2021 15:46:36 - INFO - __main__ - Step 131849: {'lr': 1.8329705512957263e-05, 'samples': 25315008, 'steps': 131848, 'loss/train': 1.414033055305481} 11/07/2021 15:46:36 - INFO - __main__ - Step 131850: {'lr': 1.832771103395775e-05, 'samples': 25315200, 'steps': 131849, 'loss/train': 1.6808829307556152} 11/07/2021 15:46:36 - INFO - __main__ - Step 131851: {'lr': 1.832571665934618e-05, 'samples': 25315392, 'steps': 131850, 'loss/train': 1.828238606452942} 11/07/2021 15:46:37 - INFO - __main__ - Step 131852: {'lr': 1.8323722389123472e-05, 'samples': 25315584, 'steps': 131851, 'loss/train': 1.196628212928772} 11/07/2021 15:46:38 - INFO - __main__ - Step 131853: {'lr': 1.8321728223290534e-05, 'samples': 25315776, 'steps': 131852, 'loss/train': 1.125872254371643} 11/07/2021 15:46:38 - INFO - __main__ - Step 131854: {'lr': 1.8319734161848233e-05, 'samples': 25315968, 'steps': 131853, 'loss/train': 1.5872567892074585} 11/07/2021 15:46:38 - INFO - __main__ - Step 131855: {'lr': 1.8317740204797485e-05, 'samples': 25316160, 'steps': 131854, 'loss/train': 0.5967668294906616} 11/07/2021 15:46:39 - INFO - __main__ - Step 131856: {'lr': 1.83157463521392e-05, 'samples': 25316352, 'steps': 131855, 'loss/train': 1.3283157348632812} 11/07/2021 15:46:40 - INFO - __main__ - Step 131857: {'lr': 1.8313752603874246e-05, 'samples': 25316544, 'steps': 131856, 'loss/train': 1.471001148223877} 11/07/2021 15:46:40 - INFO - __main__ - Step 131858: {'lr': 1.831175896000359e-05, 'samples': 25316736, 'steps': 131857, 'loss/train': 1.4706894159317017} 11/07/2021 15:46:41 - INFO - __main__ - Step 131859: {'lr': 1.8309765420528036e-05, 'samples': 25316928, 'steps': 131858, 'loss/train': 1.0494831800460815} 11/07/2021 15:46:41 - INFO - __main__ - Step 131860: {'lr': 1.830777198544853e-05, 'samples': 25317120, 'steps': 131859, 'loss/train': 1.6095391511917114} 11/07/2021 15:46:41 - INFO - __main__ - Step 131861: {'lr': 1.8305778654765985e-05, 'samples': 25317312, 'steps': 131860, 'loss/train': 1.1612324714660645} 11/07/2021 15:46:42 - INFO - __main__ - Step 131862: {'lr': 1.830378542848124e-05, 'samples': 25317504, 'steps': 131861, 'loss/train': 1.0041258335113525} 11/07/2021 15:46:43 - INFO - __main__ - Step 131863: {'lr': 1.830179230659526e-05, 'samples': 25317696, 'steps': 131862, 'loss/train': 1.2738063335418701} 11/07/2021 15:46:43 - INFO - __main__ - Step 131864: {'lr': 1.8299799289108938e-05, 'samples': 25317888, 'steps': 131863, 'loss/train': 1.1225104331970215} 11/07/2021 15:46:43 - INFO - __main__ - Step 131865: {'lr': 1.8297806376023102e-05, 'samples': 25318080, 'steps': 131864, 'loss/train': 1.6204853057861328} 11/07/2021 15:46:44 - INFO - __main__ - Step 131866: {'lr': 1.8295813567338725e-05, 'samples': 25318272, 'steps': 131865, 'loss/train': 1.4329105615615845} 11/07/2021 15:46:44 - INFO - __main__ - Step 131867: {'lr': 1.82938208630567e-05, 'samples': 25318464, 'steps': 131866, 'loss/train': 1.2147573232650757} 11/07/2021 15:46:45 - INFO - __main__ - Step 131868: {'lr': 1.829182826317788e-05, 'samples': 25318656, 'steps': 131867, 'loss/train': 1.376247525215149} 11/07/2021 15:46:45 - INFO - __main__ - Step 131869: {'lr': 1.8289835767703183e-05, 'samples': 25318848, 'steps': 131868, 'loss/train': 1.4786083698272705} 11/07/2021 15:46:46 - INFO - __main__ - Step 131870: {'lr': 1.8287843376633528e-05, 'samples': 25319040, 'steps': 131869, 'loss/train': 1.5283896923065186} 11/07/2021 15:46:46 - INFO - __main__ - Step 131871: {'lr': 1.8285851089969803e-05, 'samples': 25319232, 'steps': 131870, 'loss/train': 1.5524414777755737} 11/07/2021 15:46:46 - INFO - __main__ - Step 131872: {'lr': 1.828385890771289e-05, 'samples': 25319424, 'steps': 131871, 'loss/train': 1.1732279062271118} 11/07/2021 15:46:48 - INFO - __main__ - Step 131873: {'lr': 1.8281866829863687e-05, 'samples': 25319616, 'steps': 131872, 'loss/train': 1.7434660196304321} 11/07/2021 15:46:48 - INFO - __main__ - Step 131874: {'lr': 1.82798748564231e-05, 'samples': 25319808, 'steps': 131873, 'loss/train': 1.270755410194397} 11/07/2021 15:46:48 - INFO - __main__ - Step 131875: {'lr': 1.8277882987391996e-05, 'samples': 25320000, 'steps': 131874, 'loss/train': 1.4136369228363037} 11/07/2021 15:46:49 - INFO - __main__ - Step 131876: {'lr': 1.8275891222771347e-05, 'samples': 25320192, 'steps': 131875, 'loss/train': 0.8366661071777344} 11/07/2021 15:46:49 - INFO - __main__ - Step 131877: {'lr': 1.827389956256198e-05, 'samples': 25320384, 'steps': 131876, 'loss/train': 1.5460519790649414} 11/07/2021 15:46:50 - INFO - __main__ - Step 131878: {'lr': 1.8271908006764814e-05, 'samples': 25320576, 'steps': 131877, 'loss/train': 0.3628748655319214} 11/07/2021 15:46:51 - INFO - __main__ - Step 131879: {'lr': 1.8269916555380767e-05, 'samples': 25320768, 'steps': 131878, 'loss/train': 0.9952349066734314} 11/07/2021 15:46:51 - INFO - __main__ - Step 131880: {'lr': 1.8267925208410725e-05, 'samples': 25320960, 'steps': 131879, 'loss/train': 1.2439827919006348} 11/07/2021 15:46:51 - INFO - __main__ - Step 131881: {'lr': 1.8265933965855574e-05, 'samples': 25321152, 'steps': 131880, 'loss/train': 1.1378482580184937} 11/07/2021 15:46:52 - INFO - __main__ - Step 131882: {'lr': 1.8263942827716206e-05, 'samples': 25321344, 'steps': 131881, 'loss/train': 1.354370355606079} 11/07/2021 15:46:52 - INFO - __main__ - Step 131883: {'lr': 1.8261951793993565e-05, 'samples': 25321536, 'steps': 131882, 'loss/train': 1.43056321144104} 11/07/2021 15:46:53 - INFO - __main__ - Step 131884: {'lr': 1.825996086468848e-05, 'samples': 25321728, 'steps': 131883, 'loss/train': 1.1716561317443848} 11/07/2021 15:46:54 - INFO - __main__ - Step 131885: {'lr': 1.8257970039801897e-05, 'samples': 25321920, 'steps': 131884, 'loss/train': 1.3495492935180664} 11/07/2021 15:46:54 - INFO - __main__ - Step 131886: {'lr': 1.8255979319334676e-05, 'samples': 25322112, 'steps': 131885, 'loss/train': 1.1150635480880737} 11/07/2021 15:46:54 - INFO - __main__ - Step 131887: {'lr': 1.825398870328773e-05, 'samples': 25322304, 'steps': 131886, 'loss/train': 1.5762532949447632} 11/07/2021 15:46:55 - INFO - __main__ - Step 131888: {'lr': 1.8251998191661982e-05, 'samples': 25322496, 'steps': 131887, 'loss/train': 0.5276896953582764} 11/07/2021 15:46:56 - INFO - __main__ - Step 131889: {'lr': 1.825000778445829e-05, 'samples': 25322688, 'steps': 131888, 'loss/train': 1.15291428565979} 11/07/2021 15:46:56 - INFO - __main__ - Step 131890: {'lr': 1.8248017481677593e-05, 'samples': 25322880, 'steps': 131889, 'loss/train': 1.3318374156951904} 11/07/2021 15:46:57 - INFO - __main__ - Step 131891: {'lr': 1.8246027283320756e-05, 'samples': 25323072, 'steps': 131890, 'loss/train': 1.5128083229064941} 11/07/2021 15:46:57 - INFO - __main__ - Step 131892: {'lr': 1.8244037189388664e-05, 'samples': 25323264, 'steps': 131891, 'loss/train': 1.0297125577926636} 11/07/2021 15:46:57 - INFO - __main__ - Step 131893: {'lr': 1.8242047199882233e-05, 'samples': 25323456, 'steps': 131892, 'loss/train': 1.4833662509918213} 11/07/2021 15:46:59 - INFO - __main__ - Step 131894: {'lr': 1.824005731480241e-05, 'samples': 25323648, 'steps': 131893, 'loss/train': 1.4387602806091309} 11/07/2021 15:46:59 - INFO - __main__ - Step 131895: {'lr': 1.823806753415e-05, 'samples': 25323840, 'steps': 131894, 'loss/train': 1.4315510988235474} 11/07/2021 15:46:59 - INFO - __main__ - Step 131896: {'lr': 1.8236077857925944e-05, 'samples': 25324032, 'steps': 131895, 'loss/train': 1.2780154943466187} 11/07/2021 15:47:00 - INFO - __main__ - Step 131897: {'lr': 1.8234088286131127e-05, 'samples': 25324224, 'steps': 131896, 'loss/train': 1.098307728767395} 11/07/2021 15:47:00 - INFO - __main__ - Step 131898: {'lr': 1.8232098818766474e-05, 'samples': 25324416, 'steps': 131897, 'loss/train': 0.8921128511428833} 11/07/2021 15:47:01 - INFO - __main__ - Step 131899: {'lr': 1.823010945583284e-05, 'samples': 25324608, 'steps': 131898, 'loss/train': 1.5213897228240967} 11/07/2021 15:47:01 - INFO - __main__ - Step 131900: {'lr': 1.822812019733114e-05, 'samples': 25324800, 'steps': 131899, 'loss/train': 1.0042505264282227} 11/07/2021 15:47:02 - INFO - __main__ - Step 131901: {'lr': 1.822613104326229e-05, 'samples': 25324992, 'steps': 131900, 'loss/train': 1.129673957824707} 11/07/2021 15:47:02 - INFO - __main__ - Step 131902: {'lr': 1.8224141993627153e-05, 'samples': 25325184, 'steps': 131901, 'loss/train': 1.4523845911026} 11/07/2021 15:47:02 - INFO - __main__ - Step 131903: {'lr': 1.8222153048426643e-05, 'samples': 25325376, 'steps': 131902, 'loss/train': 0.9828292727470398} 11/07/2021 15:47:03 - INFO - __main__ - Step 131904: {'lr': 1.822016420766165e-05, 'samples': 25325568, 'steps': 131903, 'loss/train': 1.4750996828079224} 11/07/2021 15:47:04 - INFO - __main__ - Step 131905: {'lr': 1.8218175471333116e-05, 'samples': 25325760, 'steps': 131904, 'loss/train': 3.321774959564209} 11/07/2021 15:47:04 - INFO - __main__ - Step 131906: {'lr': 1.8216186839441874e-05, 'samples': 25325952, 'steps': 131905, 'loss/train': 1.4910626411437988} 11/07/2021 15:47:04 - INFO - __main__ - Step 131907: {'lr': 1.821419831198881e-05, 'samples': 25326144, 'steps': 131906, 'loss/train': 0.9013728499412537} 11/07/2021 15:47:05 - INFO - __main__ - Step 131908: {'lr': 1.8212209888974874e-05, 'samples': 25326336, 'steps': 131907, 'loss/train': 1.3265323638916016} 11/07/2021 15:47:05 - INFO - __main__ - Step 131909: {'lr': 1.8210221570400946e-05, 'samples': 25326528, 'steps': 131908, 'loss/train': 0.9086816310882568} 11/07/2021 15:47:06 - INFO - __main__ - Step 131910: {'lr': 1.8208233356267895e-05, 'samples': 25326720, 'steps': 131909, 'loss/train': 1.4730015993118286} 11/07/2021 15:47:07 - INFO - __main__ - Step 131911: {'lr': 1.820624524657666e-05, 'samples': 25326912, 'steps': 131910, 'loss/train': 1.4648314714431763} 11/07/2021 15:47:07 - INFO - __main__ - Step 131912: {'lr': 1.82042572413281e-05, 'samples': 25327104, 'steps': 131911, 'loss/train': 1.1146107912063599} 11/07/2021 15:47:07 - INFO - __main__ - Step 131913: {'lr': 1.820226934052313e-05, 'samples': 25327296, 'steps': 131912, 'loss/train': 1.3534014225006104} 11/07/2021 15:47:08 - INFO - __main__ - Step 131914: {'lr': 1.8200281544162643e-05, 'samples': 25327488, 'steps': 131913, 'loss/train': 1.2575955390930176} 11/07/2021 15:47:09 - INFO - __main__ - Step 131915: {'lr': 1.819829385224753e-05, 'samples': 25327680, 'steps': 131914, 'loss/train': 1.8920085430145264} 11/07/2021 15:47:09 - INFO - __main__ - Step 131916: {'lr': 1.8196306264778722e-05, 'samples': 25327872, 'steps': 131915, 'loss/train': 1.40664541721344} 11/07/2021 15:47:09 - INFO - __main__ - Step 131917: {'lr': 1.8194318781757036e-05, 'samples': 25328064, 'steps': 131916, 'loss/train': 1.1192317008972168} 11/07/2021 15:47:10 - INFO - __main__ - Step 131918: {'lr': 1.819233140318341e-05, 'samples': 25328256, 'steps': 131917, 'loss/train': 1.538116455078125} 11/07/2021 15:47:10 - INFO - __main__ - Step 131919: {'lr': 1.8190344129058763e-05, 'samples': 25328448, 'steps': 131918, 'loss/train': 1.327014446258545} 11/07/2021 15:47:11 - INFO - __main__ - Step 131920: {'lr': 1.818835695938395e-05, 'samples': 25328640, 'steps': 131919, 'loss/train': 1.3071357011795044} 11/07/2021 15:47:11 - INFO - __main__ - Step 131921: {'lr': 1.818636989415992e-05, 'samples': 25328832, 'steps': 131920, 'loss/train': 1.5515785217285156} 11/07/2021 15:47:12 - INFO - __main__ - Step 131922: {'lr': 1.8184382933387505e-05, 'samples': 25329024, 'steps': 131921, 'loss/train': 1.3070961236953735} 11/07/2021 15:47:12 - INFO - __main__ - Step 131923: {'lr': 1.818239607706762e-05, 'samples': 25329216, 'steps': 131922, 'loss/train': 1.2711963653564453} 11/07/2021 15:47:12 - INFO - __main__ - Step 131924: {'lr': 1.8180409325201208e-05, 'samples': 25329408, 'steps': 131923, 'loss/train': 0.8968076109886169} 11/07/2021 15:47:13 - INFO - __main__ - Step 131925: {'lr': 1.8178422677789102e-05, 'samples': 25329600, 'steps': 131924, 'loss/train': 1.0007359981536865} 11/07/2021 15:47:14 - INFO - __main__ - Step 131926: {'lr': 1.8176436134832276e-05, 'samples': 25329792, 'steps': 131925, 'loss/train': 1.278369426727295} 11/07/2021 15:47:14 - INFO - __main__ - Step 131927: {'lr': 1.8174449696331502e-05, 'samples': 25329984, 'steps': 131926, 'loss/train': 1.4699286222457886} 11/07/2021 15:47:14 - INFO - __main__ - Step 131928: {'lr': 1.8172463362287757e-05, 'samples': 25330176, 'steps': 131927, 'loss/train': 1.4872037172317505} 11/07/2021 15:47:15 - INFO - __main__ - Step 131929: {'lr': 1.8170477132701923e-05, 'samples': 25330368, 'steps': 131928, 'loss/train': 1.589889407157898} 11/07/2021 15:47:16 - INFO - __main__ - Step 131930: {'lr': 1.8168491007574923e-05, 'samples': 25330560, 'steps': 131929, 'loss/train': 1.1932399272918701} 11/07/2021 15:47:16 - INFO - __main__ - Step 131931: {'lr': 1.8166504986907585e-05, 'samples': 25330752, 'steps': 131930, 'loss/train': 1.0530226230621338} 11/07/2021 15:47:17 - INFO - __main__ - Step 131932: {'lr': 1.816451907070085e-05, 'samples': 25330944, 'steps': 131931, 'loss/train': 1.2301865816116333} 11/07/2021 15:47:17 - INFO - __main__ - Step 131933: {'lr': 1.8162533258955615e-05, 'samples': 25331136, 'steps': 131932, 'loss/train': 1.448918342590332} 11/07/2021 15:47:17 - INFO - __main__ - Step 131934: {'lr': 1.8160547551672764e-05, 'samples': 25331328, 'steps': 131933, 'loss/train': 0.7985072135925293} 11/07/2021 15:47:18 - INFO - __main__ - Step 131935: {'lr': 1.815856194885318e-05, 'samples': 25331520, 'steps': 131934, 'loss/train': 0.7842907905578613} 11/07/2021 15:47:19 - INFO - __main__ - Step 131936: {'lr': 1.8156576450497786e-05, 'samples': 25331712, 'steps': 131935, 'loss/train': 1.3778369426727295} 11/07/2021 15:47:19 - INFO - __main__ - Step 131937: {'lr': 1.8154591056607465e-05, 'samples': 25331904, 'steps': 131936, 'loss/train': 1.531198501586914} 11/07/2021 15:47:20 - INFO - __main__ - Step 131938: {'lr': 1.8152605767183138e-05, 'samples': 25332096, 'steps': 131937, 'loss/train': 1.5911651849746704} 11/07/2021 15:47:20 - INFO - __main__ - Step 131939: {'lr': 1.8150620582225637e-05, 'samples': 25332288, 'steps': 131938, 'loss/train': 1.3372905254364014} 11/07/2021 15:47:20 - INFO - __main__ - Step 131940: {'lr': 1.814863550173587e-05, 'samples': 25332480, 'steps': 131939, 'loss/train': 1.4291982650756836} 11/07/2021 15:47:21 - INFO - __main__ - Step 131941: {'lr': 1.8146650525714763e-05, 'samples': 25332672, 'steps': 131940, 'loss/train': 1.1406878232955933} 11/07/2021 15:47:22 - INFO - __main__ - Step 131942: {'lr': 1.81446656541632e-05, 'samples': 25332864, 'steps': 131941, 'loss/train': 1.527762770652771} 11/07/2021 15:47:22 - INFO - __main__ - Step 131943: {'lr': 1.8142680887082068e-05, 'samples': 25333056, 'steps': 131942, 'loss/train': 1.2374407052993774} 11/07/2021 15:47:22 - INFO - __main__ - Step 131944: {'lr': 1.814069622447226e-05, 'samples': 25333248, 'steps': 131943, 'loss/train': 1.5962669849395752} 11/07/2021 15:47:23 - INFO - __main__ - Step 131945: {'lr': 1.8138711666334683e-05, 'samples': 25333440, 'steps': 131944, 'loss/train': 1.1223145723342896} 11/07/2021 15:47:23 - INFO - __main__ - Step 131946: {'lr': 1.8136727212670233e-05, 'samples': 25333632, 'steps': 131945, 'loss/train': 1.1781080961227417} 11/07/2021 15:47:24 - INFO - __main__ - Step 131947: {'lr': 1.81347428634798e-05, 'samples': 25333824, 'steps': 131946, 'loss/train': 0.7715721130371094} 11/07/2021 15:47:24 - INFO - __main__ - Step 131948: {'lr': 1.813275861876426e-05, 'samples': 25334016, 'steps': 131947, 'loss/train': 1.4674150943756104} 11/07/2021 15:47:25 - INFO - __main__ - Step 131949: {'lr': 1.8130774478524516e-05, 'samples': 25334208, 'steps': 131948, 'loss/train': 1.5196524858474731} 11/07/2021 15:47:25 - INFO - __main__ - Step 131950: {'lr': 1.8128790442761473e-05, 'samples': 25334400, 'steps': 131949, 'loss/train': 1.0300153493881226} 11/07/2021 15:47:25 - INFO - __main__ - Step 131951: {'lr': 1.8126806511476024e-05, 'samples': 25334592, 'steps': 131950, 'loss/train': 1.2344521284103394} 11/07/2021 15:47:27 - INFO - __main__ - Step 131952: {'lr': 1.8124822684669084e-05, 'samples': 25334784, 'steps': 131951, 'loss/train': 1.3570195436477661} 11/07/2021 15:47:27 - INFO - __main__ - Step 131953: {'lr': 1.8122838962341514e-05, 'samples': 25334976, 'steps': 131952, 'loss/train': 0.7841466665267944} 11/07/2021 15:47:27 - INFO - __main__ - Step 131954: {'lr': 1.8120855344494176e-05, 'samples': 25335168, 'steps': 131953, 'loss/train': 1.2988197803497314} 11/07/2021 15:47:28 - INFO - __main__ - Step 131955: {'lr': 1.8118871831128038e-05, 'samples': 25335360, 'steps': 131954, 'loss/train': 1.4282597303390503} 11/07/2021 15:47:28 - INFO - __main__ - Step 131956: {'lr': 1.8116888422243932e-05, 'samples': 25335552, 'steps': 131955, 'loss/train': 1.4081742763519287} 11/07/2021 15:47:29 - INFO - __main__ - Step 131957: {'lr': 1.811490511784278e-05, 'samples': 25335744, 'steps': 131956, 'loss/train': 1.00153386592865} 11/07/2021 15:47:29 - INFO - __main__ - Step 131958: {'lr': 1.811292191792549e-05, 'samples': 25335936, 'steps': 131957, 'loss/train': 1.2829368114471436} 11/07/2021 15:47:30 - INFO - __main__ - Step 131959: {'lr': 1.8110938822492902e-05, 'samples': 25336128, 'steps': 131958, 'loss/train': 2.1547889709472656} 11/07/2021 15:47:30 - INFO - __main__ - Step 131960: {'lr': 1.8108955831545982e-05, 'samples': 25336320, 'steps': 131959, 'loss/train': 1.6733152866363525} 11/07/2021 15:47:31 - INFO - __main__ - Step 131961: {'lr': 1.810697294508559e-05, 'samples': 25336512, 'steps': 131960, 'loss/train': 1.5735437870025635} 11/07/2021 15:47:31 - INFO - __main__ - Step 131962: {'lr': 1.8104990163112596e-05, 'samples': 25336704, 'steps': 131961, 'loss/train': 1.3265001773834229} 11/07/2021 15:47:32 - INFO - __main__ - Step 131963: {'lr': 1.810300748562793e-05, 'samples': 25336896, 'steps': 131962, 'loss/train': 1.390694499015808} 11/07/2021 15:47:32 - INFO - __main__ - Step 131964: {'lr': 1.8101024912632465e-05, 'samples': 25337088, 'steps': 131963, 'loss/train': 1.1273623704910278} 11/07/2021 15:47:33 - INFO - __main__ - Step 131965: {'lr': 1.8099042444127135e-05, 'samples': 25337280, 'steps': 131964, 'loss/train': 1.329344391822815} 11/07/2021 15:47:33 - INFO - __main__ - Step 131966: {'lr': 1.8097060080112776e-05, 'samples': 25337472, 'steps': 131965, 'loss/train': 1.5266495943069458} 11/07/2021 15:47:33 - INFO - __main__ - Step 131967: {'lr': 1.809507782059028e-05, 'samples': 25337664, 'steps': 131966, 'loss/train': 1.3387506008148193} 11/07/2021 15:47:34 - INFO - __main__ - Step 131968: {'lr': 1.809309566556058e-05, 'samples': 25337856, 'steps': 131967, 'loss/train': 1.526100516319275} 11/07/2021 15:47:35 - INFO - __main__ - Step 131969: {'lr': 1.809111361502455e-05, 'samples': 25338048, 'steps': 131968, 'loss/train': 0.35157105326652527} 11/07/2021 15:47:35 - INFO - __main__ - Step 131970: {'lr': 1.8089131668983072e-05, 'samples': 25338240, 'steps': 131969, 'loss/train': 1.375744104385376} 11/07/2021 15:47:35 - INFO - __main__ - Step 131971: {'lr': 1.808714982743706e-05, 'samples': 25338432, 'steps': 131970, 'loss/train': 1.2925031185150146} 11/07/2021 15:47:36 - INFO - __main__ - Step 131972: {'lr': 1.8085168090387404e-05, 'samples': 25338624, 'steps': 131971, 'loss/train': 1.50471031665802} 11/07/2021 15:47:37 - INFO - __main__ - Step 131973: {'lr': 1.8083186457834994e-05, 'samples': 25338816, 'steps': 131972, 'loss/train': 1.3163995742797852} 11/07/2021 15:47:37 - INFO - __main__ - Step 131974: {'lr': 1.8081204929780714e-05, 'samples': 25339008, 'steps': 131973, 'loss/train': 1.4245820045471191} 11/07/2021 15:47:37 - INFO - __main__ - Step 131975: {'lr': 1.8079223506225484e-05, 'samples': 25339200, 'steps': 131974, 'loss/train': 1.4353976249694824} 11/07/2021 15:47:38 - INFO - __main__ - Step 131976: {'lr': 1.8077242187170163e-05, 'samples': 25339392, 'steps': 131975, 'loss/train': 1.7564784288406372} 11/07/2021 15:47:38 - INFO - __main__ - Step 131977: {'lr': 1.8075260972615638e-05, 'samples': 25339584, 'steps': 131976, 'loss/train': 0.9944172501564026} 11/07/2021 15:47:39 - INFO - __main__ - Step 131978: {'lr': 1.8073279862562854e-05, 'samples': 25339776, 'steps': 131977, 'loss/train': 1.0664660930633545} 11/07/2021 15:47:40 - INFO - __main__ - Step 131979: {'lr': 1.80712988570127e-05, 'samples': 25339968, 'steps': 131978, 'loss/train': 1.0681390762329102} 11/07/2021 15:47:40 - INFO - __main__ - Step 131980: {'lr': 1.8069317955966002e-05, 'samples': 25340160, 'steps': 131979, 'loss/train': 0.10497671365737915} 11/07/2021 15:47:40 - INFO - __main__ - Step 131981: {'lr': 1.8067337159423686e-05, 'samples': 25340352, 'steps': 131980, 'loss/train': 1.110968828201294} 11/07/2021 15:47:41 - INFO - __main__ - Step 131982: {'lr': 1.8065356467386635e-05, 'samples': 25340544, 'steps': 131981, 'loss/train': 1.3798021078109741} 11/07/2021 15:47:42 - INFO - __main__ - Step 131983: {'lr': 1.8063375879855764e-05, 'samples': 25340736, 'steps': 131982, 'loss/train': 1.2361582517623901} 11/07/2021 15:47:42 - INFO - __main__ - Step 131984: {'lr': 1.8061395396831964e-05, 'samples': 25340928, 'steps': 131983, 'loss/train': 0.6876171231269836} 11/07/2021 15:47:43 - INFO - __main__ - Step 131985: {'lr': 1.805941501831612e-05, 'samples': 25341120, 'steps': 131984, 'loss/train': 1.2986115217208862} 11/07/2021 15:47:43 - INFO - __main__ - Step 131986: {'lr': 1.8057434744309125e-05, 'samples': 25341312, 'steps': 131985, 'loss/train': 1.713118314743042} 11/07/2021 15:47:43 - INFO - __main__ - Step 131987: {'lr': 1.8055454574811863e-05, 'samples': 25341504, 'steps': 131986, 'loss/train': 1.5096631050109863} 11/07/2021 15:47:44 - INFO - __main__ - Step 131988: {'lr': 1.8053474509825253e-05, 'samples': 25341696, 'steps': 131987, 'loss/train': 1.230866551399231} 11/07/2021 15:47:45 - INFO - __main__ - Step 131989: {'lr': 1.8051494549350152e-05, 'samples': 25341888, 'steps': 131988, 'loss/train': 1.1075239181518555} 11/07/2021 15:47:45 - INFO - __main__ - Step 131990: {'lr': 1.8049514693387475e-05, 'samples': 25342080, 'steps': 131989, 'loss/train': 1.1706866025924683} 11/07/2021 15:47:45 - INFO - __main__ - Step 131991: {'lr': 1.804753494193809e-05, 'samples': 25342272, 'steps': 131990, 'loss/train': 1.0759050846099854} 11/07/2021 15:47:46 - INFO - __main__ - Step 131992: {'lr': 1.8045555295002957e-05, 'samples': 25342464, 'steps': 131991, 'loss/train': 0.9122575521469116} 11/07/2021 15:47:46 - INFO - __main__ - Step 131993: {'lr': 1.804357575258289e-05, 'samples': 25342656, 'steps': 131992, 'loss/train': 1.2774028778076172} 11/07/2021 15:47:47 - INFO - __main__ - Step 131994: {'lr': 1.8041596314678804e-05, 'samples': 25342848, 'steps': 131993, 'loss/train': 1.5788482427597046} 11/07/2021 15:47:47 - INFO - __main__ - Step 131995: {'lr': 1.8039616981291586e-05, 'samples': 25343040, 'steps': 131994, 'loss/train': 0.9420141577720642} 11/07/2021 15:47:48 - INFO - __main__ - Step 131996: {'lr': 1.8037637752422148e-05, 'samples': 25343232, 'steps': 131995, 'loss/train': 1.7015904188156128} 11/07/2021 15:47:48 - INFO - __main__ - Step 131997: {'lr': 1.8035658628071355e-05, 'samples': 25343424, 'steps': 131996, 'loss/train': 1.4627807140350342} 11/07/2021 15:47:49 - INFO - __main__ - Step 131998: {'lr': 1.803367960824012e-05, 'samples': 25343616, 'steps': 131997, 'loss/train': 1.1671013832092285} 11/07/2021 15:47:50 - INFO - __main__ - Step 131999: {'lr': 1.803170069292934e-05, 'samples': 25343808, 'steps': 131998, 'loss/train': 1.2019708156585693} 11/07/2021 15:47:50 - INFO - __main__ - Step 132000: {'lr': 1.8029721882139887e-05, 'samples': 25344000, 'steps': 131999, 'loss/train': 1.332590937614441} 11/07/2021 15:47:50 - INFO - __main__ - Step 132001: {'lr': 1.8027743175872664e-05, 'samples': 25344192, 'steps': 132000, 'loss/train': 2.063779354095459} 11/07/2021 15:47:51 - INFO - __main__ - Step 132002: {'lr': 1.802576457412858e-05, 'samples': 25344384, 'steps': 132001, 'loss/train': 1.5227329730987549} 11/07/2021 15:47:51 - INFO - __main__ - Step 132003: {'lr': 1.8023786076908495e-05, 'samples': 25344576, 'steps': 132002, 'loss/train': 1.14496648311615} 11/07/2021 15:47:52 - INFO - __main__ - Step 132004: {'lr': 1.80218076842133e-05, 'samples': 25344768, 'steps': 132003, 'loss/train': 1.5909672975540161} 11/07/2021 15:47:52 - INFO - __main__ - Step 132005: {'lr': 1.8019829396043908e-05, 'samples': 25344960, 'steps': 132004, 'loss/train': 0.9797396659851074} 11/07/2021 15:47:53 - INFO - __main__ - Step 132006: {'lr': 1.8017851212401238e-05, 'samples': 25345152, 'steps': 132005, 'loss/train': 1.4343032836914062} 11/07/2021 15:47:53 - INFO - __main__ - Step 132007: {'lr': 1.8015873133286093e-05, 'samples': 25345344, 'steps': 132006, 'loss/train': 1.148972749710083} 11/07/2021 15:47:54 - INFO - __main__ - Step 132008: {'lr': 1.8013895158699415e-05, 'samples': 25345536, 'steps': 132007, 'loss/train': 0.6749100685119629} 11/07/2021 15:47:54 - INFO - __main__ - Step 132009: {'lr': 1.8011917288642126e-05, 'samples': 25345728, 'steps': 132008, 'loss/train': 1.640318751335144} 11/07/2021 15:47:55 - INFO - __main__ - Step 132010: {'lr': 1.8009939523115055e-05, 'samples': 25345920, 'steps': 132009, 'loss/train': 1.346348762512207} 11/07/2021 15:47:55 - INFO - __main__ - Step 132011: {'lr': 1.8007961862119144e-05, 'samples': 25346112, 'steps': 132010, 'loss/train': 1.4873560667037964} 11/07/2021 15:47:56 - INFO - __main__ - Step 132012: {'lr': 1.800598430565528e-05, 'samples': 25346304, 'steps': 132011, 'loss/train': 1.3313628435134888} 11/07/2021 15:47:56 - INFO - __main__ - Step 132013: {'lr': 1.8004006853724303e-05, 'samples': 25346496, 'steps': 132012, 'loss/train': 1.7517558336257935} 11/07/2021 15:47:57 - INFO - __main__ - Step 132014: {'lr': 1.800202950632718e-05, 'samples': 25346688, 'steps': 132013, 'loss/train': 1.3360254764556885} 11/07/2021 15:47:57 - INFO - __main__ - Step 132015: {'lr': 1.800005226346474e-05, 'samples': 25346880, 'steps': 132014, 'loss/train': 1.2417073249816895} 11/07/2021 15:47:58 - INFO - __main__ - Step 132016: {'lr': 1.7998075125137874e-05, 'samples': 25347072, 'steps': 132015, 'loss/train': 1.1668161153793335} 11/07/2021 15:47:58 - INFO - __main__ - Step 132017: {'lr': 1.799609809134753e-05, 'samples': 25347264, 'steps': 132016, 'loss/train': 1.3530621528625488} 11/07/2021 15:47:58 - INFO - __main__ - Step 132018: {'lr': 1.7994121162094562e-05, 'samples': 25347456, 'steps': 132017, 'loss/train': 1.5417239665985107} 11/07/2021 15:47:59 - INFO - __main__ - Step 132019: {'lr': 1.799214433737989e-05, 'samples': 25347648, 'steps': 132018, 'loss/train': 0.7205768823623657} 11/07/2021 15:48:00 - INFO - __main__ - Step 132020: {'lr': 1.7990167617204346e-05, 'samples': 25347840, 'steps': 132019, 'loss/train': 1.2552602291107178} 11/07/2021 15:48:00 - INFO - __main__ - Step 132021: {'lr': 1.7988191001568844e-05, 'samples': 25348032, 'steps': 132020, 'loss/train': 1.0711177587509155} 11/07/2021 15:48:01 - INFO - __main__ - Step 132022: {'lr': 1.7986214490474275e-05, 'samples': 25348224, 'steps': 132021, 'loss/train': 1.5597528219223022} 11/07/2021 15:48:01 - INFO - __main__ - Step 132023: {'lr': 1.798423808392155e-05, 'samples': 25348416, 'steps': 132022, 'loss/train': 1.0081114768981934} 11/07/2021 15:48:01 - INFO - __main__ - Step 132024: {'lr': 1.7982261781911562e-05, 'samples': 25348608, 'steps': 132023, 'loss/train': 0.6469451189041138} 11/07/2021 15:48:02 - INFO - __main__ - Step 132025: {'lr': 1.798028558444517e-05, 'samples': 25348800, 'steps': 132024, 'loss/train': 0.29367589950561523} 11/07/2021 15:48:03 - INFO - __main__ - Step 132026: {'lr': 1.7978309491523294e-05, 'samples': 25348992, 'steps': 132025, 'loss/train': 1.5459022521972656} 11/07/2021 15:48:03 - INFO - __main__ - Step 132027: {'lr': 1.7976333503146786e-05, 'samples': 25349184, 'steps': 132026, 'loss/train': 1.372126817703247} 11/07/2021 15:48:03 - INFO - __main__ - Step 132028: {'lr': 1.7974357619316567e-05, 'samples': 25349376, 'steps': 132027, 'loss/train': 1.1038438081741333} 11/07/2021 15:48:04 - INFO - __main__ - Step 132029: {'lr': 1.7972381840033554e-05, 'samples': 25349568, 'steps': 132028, 'loss/train': 0.8002528548240662} 11/07/2021 15:48:05 - INFO - __main__ - Step 132030: {'lr': 1.7970406165298574e-05, 'samples': 25349760, 'steps': 132029, 'loss/train': 1.1776012182235718} 11/07/2021 15:48:05 - INFO - __main__ - Step 132031: {'lr': 1.796843059511255e-05, 'samples': 25349952, 'steps': 132030, 'loss/train': 1.1863889694213867} 11/07/2021 15:48:05 - INFO - __main__ - Step 132032: {'lr': 1.7966455129476367e-05, 'samples': 25350144, 'steps': 132031, 'loss/train': 0.08693652600049973} 11/07/2021 15:48:06 - INFO - __main__ - Step 132033: {'lr': 1.796447976839097e-05, 'samples': 25350336, 'steps': 132032, 'loss/train': 1.3936253786087036} 11/07/2021 15:48:06 - INFO - __main__ - Step 132034: {'lr': 1.796250451185716e-05, 'samples': 25350528, 'steps': 132033, 'loss/train': 1.9551053047180176} 11/07/2021 15:48:07 - INFO - __main__ - Step 132035: {'lr': 1.796052935987588e-05, 'samples': 25350720, 'steps': 132034, 'loss/train': 1.5863006114959717} 11/07/2021 15:48:08 - INFO - __main__ - Step 132036: {'lr': 1.7958554312447973e-05, 'samples': 25350912, 'steps': 132035, 'loss/train': 1.5404129028320312} 11/07/2021 15:48:08 - INFO - __main__ - Step 132037: {'lr': 1.7956579369574372e-05, 'samples': 25351104, 'steps': 132036, 'loss/train': 0.937261164188385} 11/07/2021 15:48:08 - INFO - __main__ - Step 132038: {'lr': 1.7954604531255968e-05, 'samples': 25351296, 'steps': 132037, 'loss/train': 1.4413548707962036} 11/07/2021 15:48:09 - INFO - __main__ - Step 132039: {'lr': 1.7952629797493623e-05, 'samples': 25351488, 'steps': 132038, 'loss/train': 1.020665168762207} 11/07/2021 15:48:10 - INFO - __main__ - Step 132040: {'lr': 1.7950655168288256e-05, 'samples': 25351680, 'steps': 132039, 'loss/train': 1.0870763063430786} 11/07/2021 15:48:10 - INFO - __main__ - Step 132041: {'lr': 1.7948680643640718e-05, 'samples': 25351872, 'steps': 132040, 'loss/train': 1.3777605295181274} 11/07/2021 15:48:10 - INFO - __main__ - Step 132042: {'lr': 1.7946706223551963e-05, 'samples': 25352064, 'steps': 132041, 'loss/train': 0.2968837320804596} 11/07/2021 15:48:11 - INFO - __main__ - Step 132043: {'lr': 1.7944731908022815e-05, 'samples': 25352256, 'steps': 132042, 'loss/train': 1.1362241506576538} 11/07/2021 15:48:11 - INFO - __main__ - Step 132044: {'lr': 1.7942757697054196e-05, 'samples': 25352448, 'steps': 132043, 'loss/train': 0.9740577936172485} 11/07/2021 15:48:11 - INFO - __main__ - Step 132045: {'lr': 1.794078359064699e-05, 'samples': 25352640, 'steps': 132044, 'loss/train': 1.139897108078003} 11/07/2021 15:48:12 - INFO - __main__ - Step 132046: {'lr': 1.793880958880212e-05, 'samples': 25352832, 'steps': 132045, 'loss/train': 1.2034660577774048} 11/07/2021 15:48:13 - INFO - __main__ - Step 132047: {'lr': 1.793683569152041e-05, 'samples': 25353024, 'steps': 132046, 'loss/train': 1.3973238468170166} 11/07/2021 15:48:13 - INFO - __main__ - Step 132048: {'lr': 1.7934861898802778e-05, 'samples': 25353216, 'steps': 132047, 'loss/train': 1.0538579225540161} 11/07/2021 15:48:14 - INFO - __main__ - Step 132049: {'lr': 1.7932888210650117e-05, 'samples': 25353408, 'steps': 132048, 'loss/train': 1.1372030973434448} 11/07/2021 15:48:14 - INFO - __main__ - Step 132050: {'lr': 1.7930914627063312e-05, 'samples': 25353600, 'steps': 132049, 'loss/train': 1.3491976261138916} 11/07/2021 15:48:15 - INFO - __main__ - Step 132051: {'lr': 1.792894114804325e-05, 'samples': 25353792, 'steps': 132050, 'loss/train': 1.3084688186645508} 11/07/2021 15:48:15 - INFO - __main__ - Step 132052: {'lr': 1.792696777359082e-05, 'samples': 25353984, 'steps': 132051, 'loss/train': 1.370374321937561} 11/07/2021 15:48:16 - INFO - __main__ - Step 132053: {'lr': 1.792499450370694e-05, 'samples': 25354176, 'steps': 132052, 'loss/train': 1.387528896331787} 11/07/2021 15:48:16 - INFO - __main__ - Step 132054: {'lr': 1.7923021338392463e-05, 'samples': 25354368, 'steps': 132053, 'loss/train': 0.7279184460639954} 11/07/2021 15:48:16 - INFO - __main__ - Step 132055: {'lr': 1.7921048277648287e-05, 'samples': 25354560, 'steps': 132054, 'loss/train': 1.5557554960250854} 11/07/2021 15:48:17 - INFO - __main__ - Step 132056: {'lr': 1.7919075321475327e-05, 'samples': 25354752, 'steps': 132055, 'loss/train': 1.4468191862106323} 11/07/2021 15:48:18 - INFO - __main__ - Step 132057: {'lr': 1.7917102469874435e-05, 'samples': 25354944, 'steps': 132056, 'loss/train': 1.3307636976242065} 11/07/2021 15:48:18 - INFO - __main__ - Step 132058: {'lr': 1.7915129722846506e-05, 'samples': 25355136, 'steps': 132057, 'loss/train': 0.9996932744979858} 11/07/2021 15:48:18 - INFO - __main__ - Step 132059: {'lr': 1.7913157080392513e-05, 'samples': 25355328, 'steps': 132058, 'loss/train': 1.2121195793151855} 11/07/2021 15:48:19 - INFO - __main__ - Step 132060: {'lr': 1.7911184542513197e-05, 'samples': 25355520, 'steps': 132059, 'loss/train': 1.3662725687026978} 11/07/2021 15:48:20 - INFO - __main__ - Step 132061: {'lr': 1.7909212109209538e-05, 'samples': 25355712, 'steps': 132060, 'loss/train': 1.609397292137146} 11/07/2021 15:48:20 - INFO - __main__ - Step 132062: {'lr': 1.7907239780482392e-05, 'samples': 25355904, 'steps': 132061, 'loss/train': 0.8091583847999573} 11/07/2021 15:48:21 - INFO - __main__ - Step 132063: {'lr': 1.790526755633268e-05, 'samples': 25356096, 'steps': 132062, 'loss/train': 0.7314538359642029} 11/07/2021 15:48:21 - INFO - __main__ - Step 132064: {'lr': 1.790329543676125e-05, 'samples': 25356288, 'steps': 132063, 'loss/train': 2.6477115154266357} 11/07/2021 15:48:21 - INFO - __main__ - Step 132065: {'lr': 1.7901323421769035e-05, 'samples': 25356480, 'steps': 132064, 'loss/train': 1.3807170391082764} 11/07/2021 15:48:23 - INFO - __main__ - Step 132066: {'lr': 1.7899351511356883e-05, 'samples': 25356672, 'steps': 132065, 'loss/train': 1.3232150077819824} 11/07/2021 15:48:23 - INFO - __main__ - Step 132067: {'lr': 1.7897379705525714e-05, 'samples': 25356864, 'steps': 132066, 'loss/train': 1.2420274019241333} 11/07/2021 15:48:23 - INFO - __main__ - Step 132068: {'lr': 1.7895408004276388e-05, 'samples': 25357056, 'steps': 132067, 'loss/train': 1.7273858785629272} 11/07/2021 15:48:24 - INFO - __main__ - Step 132069: {'lr': 1.789343640760982e-05, 'samples': 25357248, 'steps': 132068, 'loss/train': 1.1763043403625488} 11/07/2021 15:48:24 - INFO - __main__ - Step 132070: {'lr': 1.78914649155269e-05, 'samples': 25357440, 'steps': 132069, 'loss/train': 1.2197391986846924} 11/07/2021 15:48:24 - INFO - __main__ - Step 132071: {'lr': 1.7889493528028487e-05, 'samples': 25357632, 'steps': 132070, 'loss/train': 1.5657292604446411} 11/07/2021 15:48:25 - INFO - __main__ - Step 132072: {'lr': 1.78875222451155e-05, 'samples': 25357824, 'steps': 132071, 'loss/train': 0.6864472031593323} 11/07/2021 15:48:26 - INFO - __main__ - Step 132073: {'lr': 1.7885551066788852e-05, 'samples': 25358016, 'steps': 132072, 'loss/train': 1.3035029172897339} 11/07/2021 15:48:26 - INFO - __main__ - Step 132074: {'lr': 1.7883579993049347e-05, 'samples': 25358208, 'steps': 132073, 'loss/train': 1.489046573638916} 11/07/2021 15:48:26 - INFO - __main__ - Step 132075: {'lr': 1.7881609023897933e-05, 'samples': 25358400, 'steps': 132074, 'loss/train': 1.4811277389526367} 11/07/2021 15:48:27 - INFO - __main__ - Step 132076: {'lr': 1.7879638159335464e-05, 'samples': 25358592, 'steps': 132075, 'loss/train': 1.3730249404907227} 11/07/2021 15:48:28 - INFO - __main__ - Step 132077: {'lr': 1.787766739936286e-05, 'samples': 25358784, 'steps': 132076, 'loss/train': 1.0936070680618286} 11/07/2021 15:48:28 - INFO - __main__ - Step 132078: {'lr': 1.7875696743980986e-05, 'samples': 25358976, 'steps': 132077, 'loss/train': 1.2299786806106567} 11/07/2021 15:48:28 - INFO - __main__ - Step 132079: {'lr': 1.787372619319075e-05, 'samples': 25359168, 'steps': 132078, 'loss/train': 0.8136963844299316} 11/07/2021 15:48:29 - INFO - __main__ - Step 132080: {'lr': 1.787175574699304e-05, 'samples': 25359360, 'steps': 132079, 'loss/train': 1.4664697647094727} 11/07/2021 15:48:29 - INFO - __main__ - Step 132081: {'lr': 1.7869785405388724e-05, 'samples': 25359552, 'steps': 132080, 'loss/train': 0.9591220617294312} 11/07/2021 15:48:30 - INFO - __main__ - Step 132082: {'lr': 1.786781516837868e-05, 'samples': 25359744, 'steps': 132081, 'loss/train': 1.5790435075759888} 11/07/2021 15:48:31 - INFO - __main__ - Step 132083: {'lr': 1.786584503596386e-05, 'samples': 25359936, 'steps': 132082, 'loss/train': 1.2088886499404907} 11/07/2021 15:48:31 - INFO - __main__ - Step 132084: {'lr': 1.786387500814507e-05, 'samples': 25360128, 'steps': 132083, 'loss/train': 1.2266610860824585} 11/07/2021 15:48:31 - INFO - __main__ - Step 132085: {'lr': 1.786190508492325e-05, 'samples': 25360320, 'steps': 132084, 'loss/train': 1.4085084199905396} 11/07/2021 15:48:32 - INFO - __main__ - Step 132086: {'lr': 1.7859935266299336e-05, 'samples': 25360512, 'steps': 132085, 'loss/train': 1.3414198160171509} 11/07/2021 15:48:33 - INFO - __main__ - Step 132087: {'lr': 1.7857965552274093e-05, 'samples': 25360704, 'steps': 132086, 'loss/train': 0.17793500423431396} 11/07/2021 15:48:33 - INFO - __main__ - Step 132088: {'lr': 1.7855995942848453e-05, 'samples': 25360896, 'steps': 132087, 'loss/train': 0.8917834758758545} 11/07/2021 15:48:33 - INFO - __main__ - Step 132089: {'lr': 1.7854026438023336e-05, 'samples': 25361088, 'steps': 132088, 'loss/train': 0.9905845522880554} 11/07/2021 15:48:34 - INFO - __main__ - Step 132090: {'lr': 1.78520570377996e-05, 'samples': 25361280, 'steps': 132089, 'loss/train': 1.4865503311157227} 11/07/2021 15:48:34 - INFO - __main__ - Step 132091: {'lr': 1.7850087742178168e-05, 'samples': 25361472, 'steps': 132090, 'loss/train': 1.2339859008789062} 11/07/2021 15:48:35 - INFO - __main__ - Step 132092: {'lr': 1.7848118551159892e-05, 'samples': 25361664, 'steps': 132091, 'loss/train': 1.2430895566940308} 11/07/2021 15:48:35 - INFO - __main__ - Step 132093: {'lr': 1.7846149464745666e-05, 'samples': 25361856, 'steps': 132092, 'loss/train': 1.0905119180679321} 11/07/2021 15:48:36 - INFO - __main__ - Step 132094: {'lr': 1.78441804829364e-05, 'samples': 25362048, 'steps': 132093, 'loss/train': 1.044770359992981} 11/07/2021 15:48:36 - INFO - __main__ - Step 132095: {'lr': 1.7842211605732932e-05, 'samples': 25362240, 'steps': 132094, 'loss/train': 2.0600249767303467} 11/07/2021 15:48:37 - INFO - __main__ - Step 132096: {'lr': 1.7840242833136207e-05, 'samples': 25362432, 'steps': 132095, 'loss/train': 1.0785763263702393} 11/07/2021 15:48:38 - INFO - __main__ - Step 132097: {'lr': 1.7838274165147078e-05, 'samples': 25362624, 'steps': 132096, 'loss/train': 1.1055912971496582} 11/07/2021 15:48:38 - INFO - __main__ - Step 132098: {'lr': 1.783630560176644e-05, 'samples': 25362816, 'steps': 132097, 'loss/train': 1.183279037475586} 11/07/2021 15:48:38 - INFO - __main__ - Step 132099: {'lr': 1.7834337142995178e-05, 'samples': 25363008, 'steps': 132098, 'loss/train': 1.4573616981506348} 11/07/2021 15:48:39 - INFO - __main__ - Step 132100: {'lr': 1.7832368788834236e-05, 'samples': 25363200, 'steps': 132099, 'loss/train': 1.465149998664856} 11/07/2021 15:48:39 - INFO - __main__ - Step 132101: {'lr': 1.783040053928439e-05, 'samples': 25363392, 'steps': 132100, 'loss/train': 1.2579498291015625} 11/07/2021 15:48:39 - INFO - __main__ - Step 132102: {'lr': 1.7828432394346588e-05, 'samples': 25363584, 'steps': 132101, 'loss/train': 1.3933204412460327} 11/07/2021 15:48:40 - INFO - __main__ - Step 132103: {'lr': 1.7826464354021714e-05, 'samples': 25363776, 'steps': 132102, 'loss/train': 1.4111491441726685} 11/07/2021 15:48:41 - INFO - __main__ - Step 132104: {'lr': 1.782449641831066e-05, 'samples': 25363968, 'steps': 132103, 'loss/train': 1.2973060607910156} 11/07/2021 15:48:41 - INFO - __main__ - Step 132105: {'lr': 1.7822528587214282e-05, 'samples': 25364160, 'steps': 132104, 'loss/train': 1.0963091850280762} 11/07/2021 15:48:41 - INFO - __main__ - Step 132106: {'lr': 1.7820560860733526e-05, 'samples': 25364352, 'steps': 132105, 'loss/train': 1.202226161956787} 11/07/2021 15:48:42 - INFO - __main__ - Step 132107: {'lr': 1.7818593238869197e-05, 'samples': 25364544, 'steps': 132106, 'loss/train': 1.0309780836105347} 11/07/2021 15:48:43 - INFO - __main__ - Step 132108: {'lr': 1.7816625721622265e-05, 'samples': 25364736, 'steps': 132107, 'loss/train': 1.3432948589324951} 11/07/2021 15:48:43 - INFO - __main__ - Step 132109: {'lr': 1.7814658308993563e-05, 'samples': 25364928, 'steps': 132108, 'loss/train': 0.9574604034423828} 11/07/2021 15:48:43 - INFO - __main__ - Step 132110: {'lr': 1.7812691000983983e-05, 'samples': 25365120, 'steps': 132109, 'loss/train': 1.4897651672363281} 11/07/2021 15:48:44 - INFO - __main__ - Step 132111: {'lr': 1.7810723797594434e-05, 'samples': 25365312, 'steps': 132110, 'loss/train': 1.5056945085525513} 11/07/2021 15:48:44 - INFO - __main__ - Step 132112: {'lr': 1.7808756698825784e-05, 'samples': 25365504, 'steps': 132111, 'loss/train': 1.4757604598999023} 11/07/2021 15:48:45 - INFO - __main__ - Step 132113: {'lr': 1.780678970467897e-05, 'samples': 25365696, 'steps': 132112, 'loss/train': 0.875765860080719} 11/07/2021 15:48:46 - INFO - __main__ - Step 132114: {'lr': 1.7804822815154804e-05, 'samples': 25365888, 'steps': 132113, 'loss/train': 1.2215358018875122} 11/07/2021 15:48:46 - INFO - __main__ - Step 132115: {'lr': 1.7802856030254196e-05, 'samples': 25366080, 'steps': 132114, 'loss/train': 0.3790520131587982} 11/07/2021 15:48:46 - INFO - __main__ - Step 132116: {'lr': 1.7800889349978033e-05, 'samples': 25366272, 'steps': 132115, 'loss/train': 1.2574594020843506} 11/07/2021 15:48:47 - INFO - __main__ - Step 132117: {'lr': 1.779892277432721e-05, 'samples': 25366464, 'steps': 132116, 'loss/train': 1.2417290210723877} 11/07/2021 15:48:48 - INFO - __main__ - Step 132118: {'lr': 1.779695630330261e-05, 'samples': 25366656, 'steps': 132117, 'loss/train': 1.3339053392410278} 11/07/2021 15:48:48 - INFO - __main__ - Step 132119: {'lr': 1.7794989936905093e-05, 'samples': 25366848, 'steps': 132118, 'loss/train': 1.3537042140960693} 11/07/2021 15:48:48 - INFO - __main__ - Step 132120: {'lr': 1.7793023675135607e-05, 'samples': 25367040, 'steps': 132119, 'loss/train': 1.3000441789627075} 11/07/2021 15:48:49 - INFO - __main__ - Step 132121: {'lr': 1.7791057517994978e-05, 'samples': 25367232, 'steps': 132120, 'loss/train': 1.375375509262085} 11/07/2021 15:48:49 - INFO - __main__ - Step 132122: {'lr': 1.778909146548413e-05, 'samples': 25367424, 'steps': 132121, 'loss/train': 1.3498769998550415} 11/07/2021 15:48:50 - INFO - __main__ - Step 132123: {'lr': 1.778712551760392e-05, 'samples': 25367616, 'steps': 132122, 'loss/train': 1.4030729532241821} 11/07/2021 15:48:50 - INFO - __main__ - Step 132124: {'lr': 1.778515967435526e-05, 'samples': 25367808, 'steps': 132123, 'loss/train': 2.2103052139282227} 11/07/2021 15:48:51 - INFO - __main__ - Step 132125: {'lr': 1.778319393573902e-05, 'samples': 25368000, 'steps': 132124, 'loss/train': 1.5762650966644287} 11/07/2021 15:48:51 - INFO - __main__ - Step 132126: {'lr': 1.7781228301756102e-05, 'samples': 25368192, 'steps': 132125, 'loss/train': 1.6615432500839233} 11/07/2021 15:48:52 - INFO - __main__ - Step 132127: {'lr': 1.7779262772407407e-05, 'samples': 25368384, 'steps': 132126, 'loss/train': 2.103727340698242} 11/07/2021 15:48:53 - INFO - __main__ - Step 132128: {'lr': 1.777729734769373e-05, 'samples': 25368576, 'steps': 132127, 'loss/train': 1.1909786462783813} 11/07/2021 15:48:53 - INFO - __main__ - Step 132129: {'lr': 1.7775332027616052e-05, 'samples': 25368768, 'steps': 132128, 'loss/train': 0.12675841152668} 11/07/2021 15:48:53 - INFO - __main__ - Step 132130: {'lr': 1.7773366812175202e-05, 'samples': 25368960, 'steps': 132129, 'loss/train': 1.0204793214797974} 11/07/2021 15:48:54 - INFO - __main__ - Step 132131: {'lr': 1.777140170137212e-05, 'samples': 25369152, 'steps': 132130, 'loss/train': 1.0188932418823242} 11/07/2021 15:48:54 - INFO - __main__ - Step 132132: {'lr': 1.7769436695207643e-05, 'samples': 25369344, 'steps': 132131, 'loss/train': 1.1000643968582153} 11/07/2021 15:48:55 - INFO - __main__ - Step 132133: {'lr': 1.776747179368268e-05, 'samples': 25369536, 'steps': 132132, 'loss/train': 1.0186254978179932} 11/07/2021 15:48:56 - INFO - __main__ - Step 132134: {'lr': 1.7765506996798104e-05, 'samples': 25369728, 'steps': 132133, 'loss/train': 1.2224462032318115} 11/07/2021 15:48:56 - INFO - __main__ - Step 132135: {'lr': 1.776354230455479e-05, 'samples': 25369920, 'steps': 132134, 'loss/train': 0.3057420551776886} 11/07/2021 15:48:56 - INFO - __main__ - Step 132136: {'lr': 1.7761577716953664e-05, 'samples': 25370112, 'steps': 132135, 'loss/train': 1.2482985258102417} 11/07/2021 15:48:57 - INFO - __main__ - Step 132137: {'lr': 1.775961323399558e-05, 'samples': 25370304, 'steps': 132136, 'loss/train': 1.3620655536651611} 11/07/2021 15:48:57 - INFO - __main__ - Step 132138: {'lr': 1.775764885568143e-05, 'samples': 25370496, 'steps': 132137, 'loss/train': 0.8726410269737244} 11/07/2021 15:48:58 - INFO - __main__ - Step 132139: {'lr': 1.77556845820121e-05, 'samples': 25370688, 'steps': 132138, 'loss/train': 1.3015209436416626} 11/07/2021 15:48:58 - INFO - __main__ - Step 132140: {'lr': 1.7753720412988532e-05, 'samples': 25370880, 'steps': 132139, 'loss/train': 1.2786822319030762} 11/07/2021 15:48:59 - INFO - __main__ - Step 132141: {'lr': 1.775175634861148e-05, 'samples': 25371072, 'steps': 132140, 'loss/train': 1.2058537006378174} 11/07/2021 15:48:59 - INFO - __main__ - Step 132142: {'lr': 1.7749792388881942e-05, 'samples': 25371264, 'steps': 132141, 'loss/train': 1.582973599433899} 11/07/2021 15:49:00 - INFO - __main__ - Step 132143: {'lr': 1.7747828533800746e-05, 'samples': 25371456, 'steps': 132142, 'loss/train': 1.9116202592849731} 11/07/2021 15:49:01 - INFO - __main__ - Step 132144: {'lr': 1.7745864783368786e-05, 'samples': 25371648, 'steps': 132143, 'loss/train': 1.233319878578186} 11/07/2021 15:49:01 - INFO - __main__ - Step 132145: {'lr': 1.7743901137586947e-05, 'samples': 25371840, 'steps': 132144, 'loss/train': 1.5315698385238647} 11/07/2021 15:49:02 - INFO - __main__ - Step 132146: {'lr': 1.7741937596456147e-05, 'samples': 25372032, 'steps': 132145, 'loss/train': 1.4288915395736694} 11/07/2021 15:49:02 - INFO - __main__ - Step 132147: {'lr': 1.7739974159977244e-05, 'samples': 25372224, 'steps': 132146, 'loss/train': 2.289247989654541} 11/07/2021 15:49:02 - INFO - __main__ - Step 132148: {'lr': 1.77380108281511e-05, 'samples': 25372416, 'steps': 132147, 'loss/train': 1.3953111171722412} 11/07/2021 15:49:03 - INFO - __main__ - Step 132149: {'lr': 1.773604760097866e-05, 'samples': 25372608, 'steps': 132148, 'loss/train': 1.539921760559082} 11/07/2021 15:49:04 - INFO - __main__ - Step 132150: {'lr': 1.7734084478460756e-05, 'samples': 25372800, 'steps': 132149, 'loss/train': 1.834219217300415} 11/07/2021 15:49:04 - INFO - __main__ - Step 132151: {'lr': 1.773212146059827e-05, 'samples': 25372992, 'steps': 132150, 'loss/train': 1.0493335723876953} 11/07/2021 15:49:04 - INFO - __main__ - Step 132152: {'lr': 1.7730158547392156e-05, 'samples': 25373184, 'steps': 132151, 'loss/train': 1.2917211055755615} 11/07/2021 15:49:05 - INFO - __main__ - Step 132153: {'lr': 1.772819573884324e-05, 'samples': 25373376, 'steps': 132152, 'loss/train': 1.4152929782867432} 11/07/2021 15:49:06 - INFO - __main__ - Step 132154: {'lr': 1.772623303495238e-05, 'samples': 25373568, 'steps': 132153, 'loss/train': 0.9799885153770447} 11/07/2021 15:49:06 - INFO - __main__ - Step 132155: {'lr': 1.77242704357205e-05, 'samples': 25373760, 'steps': 132154, 'loss/train': 1.1277552843093872} 11/07/2021 15:49:07 - INFO - __main__ - Step 132156: {'lr': 1.7722307941148486e-05, 'samples': 25373952, 'steps': 132155, 'loss/train': 0.4280405640602112} 11/07/2021 15:49:07 - INFO - __main__ - Step 132157: {'lr': 1.772034555123722e-05, 'samples': 25374144, 'steps': 132156, 'loss/train': 1.3715486526489258} 11/07/2021 15:49:07 - INFO - __main__ - Step 132158: {'lr': 1.771838326598757e-05, 'samples': 25374336, 'steps': 132157, 'loss/train': 1.2723183631896973} 11/07/2021 15:49:08 - INFO - __main__ - Step 132159: {'lr': 1.7716421085400446e-05, 'samples': 25374528, 'steps': 132158, 'loss/train': 0.762882649898529} 11/07/2021 15:49:09 - INFO - __main__ - Step 132160: {'lr': 1.7714459009476712e-05, 'samples': 25374720, 'steps': 132159, 'loss/train': 1.6572155952453613} 11/07/2021 15:49:09 - INFO - __main__ - Step 132161: {'lr': 1.7712497038217258e-05, 'samples': 25374912, 'steps': 132160, 'loss/train': 1.397033929824829} 11/07/2021 15:49:09 - INFO - __main__ - Step 132162: {'lr': 1.7710535171622992e-05, 'samples': 25375104, 'steps': 132161, 'loss/train': 1.56415593624115} 11/07/2021 15:49:10 - INFO - __main__ - Step 132163: {'lr': 1.7708573409694757e-05, 'samples': 25375296, 'steps': 132162, 'loss/train': 1.3002045154571533} 11/07/2021 15:49:10 - INFO - __main__ - Step 132164: {'lr': 1.770661175243346e-05, 'samples': 25375488, 'steps': 132163, 'loss/train': 1.3403892517089844} 11/07/2021 15:49:11 - INFO - __main__ - Step 132165: {'lr': 1.7704650199839968e-05, 'samples': 25375680, 'steps': 132164, 'loss/train': 1.0521832704544067} 11/07/2021 15:49:12 - INFO - __main__ - Step 132166: {'lr': 1.7702688751915165e-05, 'samples': 25375872, 'steps': 132165, 'loss/train': 1.5777755975723267} 11/07/2021 15:49:12 - INFO - __main__ - Step 132167: {'lr': 1.770072740865997e-05, 'samples': 25376064, 'steps': 132166, 'loss/train': 1.4047362804412842} 11/07/2021 15:49:12 - INFO - __main__ - Step 132168: {'lr': 1.7698766170075208e-05, 'samples': 25376256, 'steps': 132167, 'loss/train': 0.796167254447937} 11/07/2021 15:49:13 - INFO - __main__ - Step 132169: {'lr': 1.7696805036161832e-05, 'samples': 25376448, 'steps': 132168, 'loss/train': 0.996609091758728} 11/07/2021 15:49:14 - INFO - __main__ - Step 132170: {'lr': 1.7694844006920675e-05, 'samples': 25376640, 'steps': 132169, 'loss/train': 1.040115475654602} 11/07/2021 15:49:14 - INFO - __main__ - Step 132171: {'lr': 1.7692883082352617e-05, 'samples': 25376832, 'steps': 132170, 'loss/train': 1.0881774425506592} 11/07/2021 15:49:14 - INFO - __main__ - Step 132172: {'lr': 1.7690922262458607e-05, 'samples': 25377024, 'steps': 132171, 'loss/train': 1.500204086303711} 11/07/2021 15:49:15 - INFO - __main__ - Step 132173: {'lr': 1.768896154723948e-05, 'samples': 25377216, 'steps': 132172, 'loss/train': 1.188974380493164} 11/07/2021 15:49:15 - INFO - __main__ - Step 132174: {'lr': 1.768700093669612e-05, 'samples': 25377408, 'steps': 132173, 'loss/train': 1.6476753950119019} 11/07/2021 15:49:16 - INFO - __main__ - Step 132175: {'lr': 1.7685040430829387e-05, 'samples': 25377600, 'steps': 132174, 'loss/train': 1.5342386960983276} 11/07/2021 15:49:16 - INFO - __main__ - Step 132176: {'lr': 1.7683080029640196e-05, 'samples': 25377792, 'steps': 132175, 'loss/train': 1.0555669069290161} 11/07/2021 15:49:17 - INFO - __main__ - Step 132177: {'lr': 1.7681119733129413e-05, 'samples': 25377984, 'steps': 132176, 'loss/train': 1.3322007656097412} 11/07/2021 15:49:17 - INFO - __main__ - Step 132178: {'lr': 1.767915954129795e-05, 'samples': 25378176, 'steps': 132177, 'loss/train': 1.1002819538116455} 11/07/2021 15:49:17 - INFO - __main__ - Step 132179: {'lr': 1.767719945414667e-05, 'samples': 25378368, 'steps': 132178, 'loss/train': 1.430343747138977} 11/07/2021 15:49:19 - INFO - __main__ - Step 132180: {'lr': 1.7675239471676456e-05, 'samples': 25378560, 'steps': 132179, 'loss/train': 1.3799504041671753} 11/07/2021 15:49:19 - INFO - __main__ - Step 132181: {'lr': 1.7673279593888174e-05, 'samples': 25378752, 'steps': 132180, 'loss/train': 1.280895709991455} 11/07/2021 15:49:19 - INFO - __main__ - Step 132182: {'lr': 1.7671319820782766e-05, 'samples': 25378944, 'steps': 132181, 'loss/train': 0.8099910616874695} 11/07/2021 15:49:20 - INFO - __main__ - Step 132183: {'lr': 1.7669360152361062e-05, 'samples': 25379136, 'steps': 132182, 'loss/train': 1.3912752866744995} 11/07/2021 15:49:20 - INFO - __main__ - Step 132184: {'lr': 1.7667400588623984e-05, 'samples': 25379328, 'steps': 132183, 'loss/train': 1.143808126449585} 11/07/2021 15:49:21 - INFO - __main__ - Step 132185: {'lr': 1.766544112957236e-05, 'samples': 25379520, 'steps': 132184, 'loss/train': 1.4763518571853638} 11/07/2021 15:49:21 - INFO - __main__ - Step 132186: {'lr': 1.76634817752071e-05, 'samples': 25379712, 'steps': 132185, 'loss/train': 1.5645866394042969} 11/07/2021 15:49:22 - INFO - __main__ - Step 132187: {'lr': 1.7661522525529107e-05, 'samples': 25379904, 'steps': 132186, 'loss/train': 1.3661587238311768} 11/07/2021 15:49:22 - INFO - __main__ - Step 132188: {'lr': 1.765956338053923e-05, 'samples': 25380096, 'steps': 132187, 'loss/train': 1.409630298614502} 11/07/2021 15:49:22 - INFO - __main__ - Step 132189: {'lr': 1.765760434023836e-05, 'samples': 25380288, 'steps': 132188, 'loss/train': 1.4405690431594849} 11/07/2021 15:49:23 - INFO - __main__ - Step 132190: {'lr': 1.7655645404627412e-05, 'samples': 25380480, 'steps': 132189, 'loss/train': 1.2896982431411743} 11/07/2021 15:49:24 - INFO - __main__ - Step 132191: {'lr': 1.765368657370722e-05, 'samples': 25380672, 'steps': 132190, 'loss/train': 1.509154200553894} 11/07/2021 15:49:24 - INFO - __main__ - Step 132192: {'lr': 1.7651727847478703e-05, 'samples': 25380864, 'steps': 132191, 'loss/train': 1.0785781145095825} 11/07/2021 15:49:25 - INFO - __main__ - Step 132193: {'lr': 1.7649769225942746e-05, 'samples': 25381056, 'steps': 132192, 'loss/train': 1.4806404113769531} 11/07/2021 15:49:25 - INFO - __main__ - Step 132194: {'lr': 1.764781070910021e-05, 'samples': 25381248, 'steps': 132193, 'loss/train': 1.1323367357254028} 11/07/2021 15:49:25 - INFO - __main__ - Step 132195: {'lr': 1.764585229695201e-05, 'samples': 25381440, 'steps': 132194, 'loss/train': 1.4257327318191528} 11/07/2021 15:49:26 - INFO - __main__ - Step 132196: {'lr': 1.7643893989498976e-05, 'samples': 25381632, 'steps': 132195, 'loss/train': 1.425140380859375} 11/07/2021 15:49:27 - INFO - __main__ - Step 132197: {'lr': 1.7641935786742003e-05, 'samples': 25381824, 'steps': 132196, 'loss/train': 1.202623963356018} 11/07/2021 15:49:27 - INFO - __main__ - Step 132198: {'lr': 1.7639977688682002e-05, 'samples': 25382016, 'steps': 132197, 'loss/train': 1.1363749504089355} 11/07/2021 15:49:27 - INFO - __main__ - Step 132199: {'lr': 1.7638019695319834e-05, 'samples': 25382208, 'steps': 132198, 'loss/train': 1.5810614824295044} 11/07/2021 15:49:28 - INFO - __main__ - Step 132200: {'lr': 1.7636061806656417e-05, 'samples': 25382400, 'steps': 132199, 'loss/train': 1.727929949760437} 11/07/2021 15:49:29 - INFO - __main__ - Step 132201: {'lr': 1.763410402269258e-05, 'samples': 25382592, 'steps': 132200, 'loss/train': 1.5938632488250732} 11/07/2021 15:49:29 - INFO - __main__ - Step 132202: {'lr': 1.7632146343429216e-05, 'samples': 25382784, 'steps': 132201, 'loss/train': 1.08375084400177} 11/07/2021 15:49:29 - INFO - __main__ - Step 132203: {'lr': 1.763018876886724e-05, 'samples': 25382976, 'steps': 132202, 'loss/train': 1.1466501951217651} 11/07/2021 15:49:30 - INFO - __main__ - Step 132204: {'lr': 1.7628231299007536e-05, 'samples': 25383168, 'steps': 132203, 'loss/train': 1.1952579021453857} 11/07/2021 15:49:30 - INFO - __main__ - Step 132205: {'lr': 1.7626273933850938e-05, 'samples': 25383360, 'steps': 132204, 'loss/train': 1.2310236692428589} 11/07/2021 15:49:31 - INFO - __main__ - Step 132206: {'lr': 1.7624316673398366e-05, 'samples': 25383552, 'steps': 132205, 'loss/train': 1.4009900093078613} 11/07/2021 15:49:31 - INFO - __main__ - Step 132207: {'lr': 1.7622359517650705e-05, 'samples': 25383744, 'steps': 132206, 'loss/train': 1.1431756019592285} 11/07/2021 15:49:32 - INFO - __main__ - Step 132208: {'lr': 1.7620402466608814e-05, 'samples': 25383936, 'steps': 132207, 'loss/train': 1.3285671472549438} 11/07/2021 15:49:32 - INFO - __main__ - Step 132209: {'lr': 1.7618445520273556e-05, 'samples': 25384128, 'steps': 132208, 'loss/train': 1.2756696939468384} 11/07/2021 15:49:33 - INFO - __main__ - Step 132210: {'lr': 1.7616488678645874e-05, 'samples': 25384320, 'steps': 132209, 'loss/train': 1.072708249092102} 11/07/2021 15:49:34 - INFO - __main__ - Step 132211: {'lr': 1.7614531941726603e-05, 'samples': 25384512, 'steps': 132210, 'loss/train': 1.4852359294891357} 11/07/2021 15:49:34 - INFO - __main__ - Step 132212: {'lr': 1.7612575309516626e-05, 'samples': 25384704, 'steps': 132211, 'loss/train': 1.180122971534729} 11/07/2021 15:49:34 - INFO - __main__ - Step 132213: {'lr': 1.7610618782016836e-05, 'samples': 25384896, 'steps': 132212, 'loss/train': 1.2740122079849243} 11/07/2021 15:49:35 - INFO - __main__ - Step 132214: {'lr': 1.7608662359228146e-05, 'samples': 25385088, 'steps': 132213, 'loss/train': 1.3622931241989136} 11/07/2021 15:49:35 - INFO - __main__ - Step 132215: {'lr': 1.7606706041151387e-05, 'samples': 25385280, 'steps': 132214, 'loss/train': 0.805972158908844} 11/07/2021 15:49:36 - INFO - __main__ - Step 132216: {'lr': 1.7604749827787452e-05, 'samples': 25385472, 'steps': 132215, 'loss/train': 1.4420238733291626} 11/07/2021 15:49:36 - INFO - __main__ - Step 132217: {'lr': 1.7602793719137228e-05, 'samples': 25385664, 'steps': 132216, 'loss/train': 1.3845983743667603} 11/07/2021 15:49:37 - INFO - __main__ - Step 132218: {'lr': 1.76008377152016e-05, 'samples': 25385856, 'steps': 132217, 'loss/train': 1.4704444408416748} 11/07/2021 15:49:37 - INFO - __main__ - Step 132219: {'lr': 1.759888181598146e-05, 'samples': 25386048, 'steps': 132218, 'loss/train': 1.4420452117919922} 11/07/2021 15:49:37 - INFO - __main__ - Step 132220: {'lr': 1.7596926021477695e-05, 'samples': 25386240, 'steps': 132219, 'loss/train': 1.21333646774292} 11/07/2021 15:49:38 - INFO - __main__ - Step 132221: {'lr': 1.7594970331691192e-05, 'samples': 25386432, 'steps': 132220, 'loss/train': 1.383949637413025} 11/07/2021 15:49:39 - INFO - __main__ - Step 132222: {'lr': 1.7593014746622755e-05, 'samples': 25386624, 'steps': 132221, 'loss/train': 1.4124541282653809} 11/07/2021 15:49:39 - INFO - __main__ - Step 132223: {'lr': 1.7591059266273328e-05, 'samples': 25386816, 'steps': 132222, 'loss/train': 1.236230731010437} 11/07/2021 15:49:40 - INFO - __main__ - Step 132224: {'lr': 1.7589103890643776e-05, 'samples': 25387008, 'steps': 132223, 'loss/train': 1.4111109972000122} 11/07/2021 15:49:40 - INFO - __main__ - Step 132225: {'lr': 1.758714861973501e-05, 'samples': 25387200, 'steps': 132224, 'loss/train': 1.4524967670440674} 11/07/2021 15:49:40 - INFO - __main__ - Step 132226: {'lr': 1.7585193453547864e-05, 'samples': 25387392, 'steps': 132225, 'loss/train': 1.444976806640625} 11/07/2021 15:49:41 - INFO - __main__ - Step 132227: {'lr': 1.7583238392083256e-05, 'samples': 25387584, 'steps': 132226, 'loss/train': 0.789876401424408} 11/07/2021 15:49:42 - INFO - __main__ - Step 132228: {'lr': 1.7581283435342044e-05, 'samples': 25387776, 'steps': 132227, 'loss/train': 1.2616465091705322} 11/07/2021 15:49:42 - INFO - __main__ - Step 132229: {'lr': 1.7579328583325142e-05, 'samples': 25387968, 'steps': 132228, 'loss/train': 1.1991839408874512} 11/07/2021 15:49:42 - INFO - __main__ - Step 132230: {'lr': 1.757737383603339e-05, 'samples': 25388160, 'steps': 132229, 'loss/train': 1.4649018049240112} 11/07/2021 15:49:43 - INFO - __main__ - Step 132231: {'lr': 1.7575419193467695e-05, 'samples': 25388352, 'steps': 132230, 'loss/train': 1.3838789463043213} 11/07/2021 15:49:44 - INFO - __main__ - Step 132232: {'lr': 1.7573464655628924e-05, 'samples': 25388544, 'steps': 132231, 'loss/train': 0.713299572467804} 11/07/2021 15:49:44 - INFO - __main__ - Step 132233: {'lr': 1.757151022251796e-05, 'samples': 25388736, 'steps': 132232, 'loss/train': 1.1756185293197632} 11/07/2021 15:49:45 - INFO - __main__ - Step 132234: {'lr': 1.7569555894135723e-05, 'samples': 25388928, 'steps': 132233, 'loss/train': 1.0543968677520752} 11/07/2021 15:49:45 - INFO - __main__ - Step 132235: {'lr': 1.7567601670483048e-05, 'samples': 25389120, 'steps': 132234, 'loss/train': 1.375550627708435} 11/07/2021 15:49:45 - INFO - __main__ - Step 132236: {'lr': 1.7565647551560786e-05, 'samples': 25389312, 'steps': 132235, 'loss/train': 1.2748777866363525} 11/07/2021 15:49:46 - INFO - __main__ - Step 132237: {'lr': 1.756369353736989e-05, 'samples': 25389504, 'steps': 132236, 'loss/train': 1.6050636768341064} 11/07/2021 15:49:47 - INFO - __main__ - Step 132238: {'lr': 1.7561739627911187e-05, 'samples': 25389696, 'steps': 132237, 'loss/train': 1.3817147016525269} 11/07/2021 15:49:47 - INFO - __main__ - Step 132239: {'lr': 1.755978582318557e-05, 'samples': 25389888, 'steps': 132238, 'loss/train': 1.5792454481124878} 11/07/2021 15:49:47 - INFO - __main__ - Step 132240: {'lr': 1.755783212319395e-05, 'samples': 25390080, 'steps': 132239, 'loss/train': 1.7167359590530396} 11/07/2021 15:49:48 - INFO - __main__ - Step 132241: {'lr': 1.7555878527937163e-05, 'samples': 25390272, 'steps': 132240, 'loss/train': 1.0033553838729858} 11/07/2021 15:49:48 - INFO - __main__ - Step 132242: {'lr': 1.7553925037416125e-05, 'samples': 25390464, 'steps': 132241, 'loss/train': 1.5870808362960815} 11/07/2021 15:49:49 - INFO - __main__ - Step 132243: {'lr': 1.7551971651631694e-05, 'samples': 25390656, 'steps': 132242, 'loss/train': 1.2570754289627075} 11/07/2021 15:49:49 - INFO - __main__ - Step 132244: {'lr': 1.7550018370584757e-05, 'samples': 25390848, 'steps': 132243, 'loss/train': 1.4011304378509521} 11/07/2021 15:49:50 - INFO - __main__ - Step 132245: {'lr': 1.754806519427621e-05, 'samples': 25391040, 'steps': 132244, 'loss/train': 1.2103098630905151} 11/07/2021 15:49:50 - INFO - __main__ - Step 132246: {'lr': 1.7546112122706903e-05, 'samples': 25391232, 'steps': 132245, 'loss/train': 1.3105520009994507} 11/07/2021 15:49:50 - INFO - __main__ - Step 132247: {'lr': 1.754415915587773e-05, 'samples': 25391424, 'steps': 132246, 'loss/train': 0.9875726103782654} 11/07/2021 15:49:52 - INFO - __main__ - Step 132248: {'lr': 1.754220629378961e-05, 'samples': 25391616, 'steps': 132247, 'loss/train': 0.9729985594749451} 11/07/2021 15:49:52 - INFO - __main__ - Step 132249: {'lr': 1.754025353644334e-05, 'samples': 25391808, 'steps': 132248, 'loss/train': 1.3794583082199097} 11/07/2021 15:49:52 - INFO - __main__ - Step 132250: {'lr': 1.753830088383987e-05, 'samples': 25392000, 'steps': 132249, 'loss/train': 1.2714896202087402} 11/07/2021 15:49:53 - INFO - __main__ - Step 132251: {'lr': 1.7536348335980028e-05, 'samples': 25392192, 'steps': 132250, 'loss/train': 0.9810245633125305} 11/07/2021 15:49:53 - INFO - __main__ - Step 132252: {'lr': 1.7534395892864734e-05, 'samples': 25392384, 'steps': 132251, 'loss/train': 0.7896389961242676} 11/07/2021 15:49:54 - INFO - __main__ - Step 132253: {'lr': 1.7532443554494848e-05, 'samples': 25392576, 'steps': 132252, 'loss/train': 1.7075474262237549} 11/07/2021 15:49:54 - INFO - __main__ - Step 132254: {'lr': 1.7530491320871256e-05, 'samples': 25392768, 'steps': 132253, 'loss/train': 1.4943749904632568} 11/07/2021 15:49:55 - INFO - __main__ - Step 132255: {'lr': 1.7528539191994847e-05, 'samples': 25392960, 'steps': 132254, 'loss/train': 2.5787432193756104} 11/07/2021 15:49:55 - INFO - __main__ - Step 132256: {'lr': 1.752658716786648e-05, 'samples': 25393152, 'steps': 132255, 'loss/train': 1.5998286008834839} 11/07/2021 15:49:55 - INFO - __main__ - Step 132257: {'lr': 1.7524635248487046e-05, 'samples': 25393344, 'steps': 132256, 'loss/train': 1.5543800592422485} 11/07/2021 15:49:56 - INFO - __main__ - Step 132258: {'lr': 1.7522683433857432e-05, 'samples': 25393536, 'steps': 132257, 'loss/train': 1.1975334882736206} 11/07/2021 15:49:57 - INFO - __main__ - Step 132259: {'lr': 1.75207317239785e-05, 'samples': 25393728, 'steps': 132258, 'loss/train': 1.5937517881393433} 11/07/2021 15:49:57 - INFO - __main__ - Step 132260: {'lr': 1.7518780118851165e-05, 'samples': 25393920, 'steps': 132259, 'loss/train': 1.028466820716858} 11/07/2021 15:49:57 - INFO - __main__ - Step 132261: {'lr': 1.7516828618476283e-05, 'samples': 25394112, 'steps': 132260, 'loss/train': 1.6936215162277222} 11/07/2021 15:49:58 - INFO - __main__ - Step 132262: {'lr': 1.751487722285472e-05, 'samples': 25394304, 'steps': 132261, 'loss/train': 1.3251985311508179} 11/07/2021 15:49:58 - INFO - __main__ - Step 132263: {'lr': 1.751292593198736e-05, 'samples': 25394496, 'steps': 132262, 'loss/train': 1.280163288116455} 11/07/2021 15:49:59 - INFO - __main__ - Step 132264: {'lr': 1.751097474587507e-05, 'samples': 25394688, 'steps': 132263, 'loss/train': 1.5441203117370605} 11/07/2021 15:50:00 - INFO - __main__ - Step 132265: {'lr': 1.7509023664518758e-05, 'samples': 25394880, 'steps': 132264, 'loss/train': 0.8131648302078247} 11/07/2021 15:50:00 - INFO - __main__ - Step 132266: {'lr': 1.750707268791929e-05, 'samples': 25395072, 'steps': 132265, 'loss/train': 1.097285270690918} 11/07/2021 15:50:00 - INFO - __main__ - Step 132267: {'lr': 1.7505121816077552e-05, 'samples': 25395264, 'steps': 132266, 'loss/train': 1.3782728910446167} 11/07/2021 15:50:01 - INFO - __main__ - Step 132268: {'lr': 1.750317104899443e-05, 'samples': 25395456, 'steps': 132267, 'loss/train': 1.2505087852478027} 11/07/2021 15:50:02 - INFO - __main__ - Step 132269: {'lr': 1.7501220386670763e-05, 'samples': 25395648, 'steps': 132268, 'loss/train': 1.6715099811553955} 11/07/2021 15:50:02 - INFO - __main__ - Step 132270: {'lr': 1.749926982910749e-05, 'samples': 25395840, 'steps': 132269, 'loss/train': 1.373154640197754} 11/07/2021 15:50:02 - INFO - __main__ - Step 132271: {'lr': 1.7497319376305444e-05, 'samples': 25396032, 'steps': 132270, 'loss/train': 1.2382752895355225} 11/07/2021 15:50:03 - INFO - __main__ - Step 132272: {'lr': 1.7495369028265514e-05, 'samples': 25396224, 'steps': 132271, 'loss/train': 0.5433831810951233} 11/07/2021 15:50:03 - INFO - __main__ - Step 132273: {'lr': 1.7493418784988584e-05, 'samples': 25396416, 'steps': 132272, 'loss/train': 1.522895336151123} 11/07/2021 15:50:04 - INFO - __main__ - Step 132274: {'lr': 1.749146864647552e-05, 'samples': 25396608, 'steps': 132273, 'loss/train': 1.8301937580108643} 11/07/2021 15:50:04 - INFO - __main__ - Step 132275: {'lr': 1.7489518612727268e-05, 'samples': 25396800, 'steps': 132274, 'loss/train': 1.121106505393982} 11/07/2021 15:50:05 - INFO - __main__ - Step 132276: {'lr': 1.74875686837446e-05, 'samples': 25396992, 'steps': 132275, 'loss/train': 1.256129264831543} 11/07/2021 15:50:05 - INFO - __main__ - Step 132277: {'lr': 1.748561885952846e-05, 'samples': 25397184, 'steps': 132276, 'loss/train': 1.2157138586044312} 11/07/2021 15:50:06 - INFO - __main__ - Step 132278: {'lr': 1.7483669140079705e-05, 'samples': 25397376, 'steps': 132277, 'loss/train': 1.6837258338928223} 11/07/2021 15:50:07 - INFO - __main__ - Step 132279: {'lr': 1.748171952539923e-05, 'samples': 25397568, 'steps': 132278, 'loss/train': 1.2017669677734375} 11/07/2021 15:50:07 - INFO - __main__ - Step 132280: {'lr': 1.7479770015487895e-05, 'samples': 25397760, 'steps': 132279, 'loss/train': 1.1059561967849731} 11/07/2021 15:50:07 - INFO - __main__ - Step 132281: {'lr': 1.7477820610346584e-05, 'samples': 25397952, 'steps': 132280, 'loss/train': 1.3437066078186035} 11/07/2021 15:50:08 - INFO - __main__ - Step 132282: {'lr': 1.7475871309976187e-05, 'samples': 25398144, 'steps': 132281, 'loss/train': 1.529368281364441} 11/07/2021 15:50:08 - INFO - __main__ - Step 132283: {'lr': 1.7473922114377565e-05, 'samples': 25398336, 'steps': 132282, 'loss/train': 1.3505057096481323} 11/07/2021 15:50:09 - INFO - __main__ - Step 132284: {'lr': 1.7471973023551606e-05, 'samples': 25398528, 'steps': 132283, 'loss/train': 1.1840471029281616} 11/07/2021 15:50:09 - INFO - __main__ - Step 132285: {'lr': 1.74700240374992e-05, 'samples': 25398720, 'steps': 132284, 'loss/train': 1.9821754693984985} 11/07/2021 15:50:10 - INFO - __main__ - Step 132286: {'lr': 1.74680751562212e-05, 'samples': 25398912, 'steps': 132285, 'loss/train': 1.3491884469985962} 11/07/2021 15:50:10 - INFO - __main__ - Step 132287: {'lr': 1.7466126379718507e-05, 'samples': 25399104, 'steps': 132286, 'loss/train': 1.484886884689331} 11/07/2021 15:50:10 - INFO - __main__ - Step 132288: {'lr': 1.7464177707992023e-05, 'samples': 25399296, 'steps': 132287, 'loss/train': 1.012039065361023} 11/07/2021 15:50:11 - INFO - __main__ - Step 132289: {'lr': 1.7462229141042563e-05, 'samples': 25399488, 'steps': 132288, 'loss/train': 2.238940715789795} 11/07/2021 15:50:12 - INFO - __main__ - Step 132290: {'lr': 1.7460280678871037e-05, 'samples': 25399680, 'steps': 132289, 'loss/train': 1.3213860988616943} 11/07/2021 15:50:12 - INFO - __main__ - Step 132291: {'lr': 1.745833232147831e-05, 'samples': 25399872, 'steps': 132290, 'loss/train': 1.4229035377502441} 11/07/2021 15:50:13 - INFO - __main__ - Step 132292: {'lr': 1.7456384068865267e-05, 'samples': 25400064, 'steps': 132291, 'loss/train': 1.6790592670440674} 11/07/2021 15:50:13 - INFO - __main__ - Step 132293: {'lr': 1.7454435921032797e-05, 'samples': 25400256, 'steps': 132292, 'loss/train': 1.0114983320236206} 11/07/2021 15:50:14 - INFO - __main__ - Step 132294: {'lr': 1.745248787798176e-05, 'samples': 25400448, 'steps': 132293, 'loss/train': 1.3035147190093994} 11/07/2021 15:50:14 - INFO - __main__ - Step 132295: {'lr': 1.7450539939713044e-05, 'samples': 25400640, 'steps': 132294, 'loss/train': 1.4521371126174927} 11/07/2021 15:50:15 - INFO - __main__ - Step 132296: {'lr': 1.744859210622754e-05, 'samples': 25400832, 'steps': 132295, 'loss/train': 1.5491708517074585} 11/07/2021 15:50:15 - INFO - __main__ - Step 132297: {'lr': 1.7446644377526078e-05, 'samples': 25401024, 'steps': 132296, 'loss/train': 1.1974238157272339} 11/07/2021 15:50:15 - INFO - __main__ - Step 132298: {'lr': 1.7444696753609602e-05, 'samples': 25401216, 'steps': 132297, 'loss/train': 1.276222586631775} 11/07/2021 15:50:16 - INFO - __main__ - Step 132299: {'lr': 1.7442749234478942e-05, 'samples': 25401408, 'steps': 132298, 'loss/train': 1.1011494398117065} 11/07/2021 15:50:17 - INFO - __main__ - Step 132300: {'lr': 1.7440801820134993e-05, 'samples': 25401600, 'steps': 132299, 'loss/train': 1.6630948781967163} 11/07/2021 15:50:17 - INFO - __main__ - Step 132301: {'lr': 1.7438854510578695e-05, 'samples': 25401792, 'steps': 132300, 'loss/train': 1.519823670387268} 11/07/2021 15:50:17 - INFO - __main__ - Step 132302: {'lr': 1.7436907305810795e-05, 'samples': 25401984, 'steps': 132301, 'loss/train': 1.43874192237854} 11/07/2021 15:50:18 - INFO - __main__ - Step 132303: {'lr': 1.743496020583224e-05, 'samples': 25402176, 'steps': 132302, 'loss/train': 1.62063729763031} 11/07/2021 15:50:18 - INFO - __main__ - Step 132304: {'lr': 1.7433013210643917e-05, 'samples': 25402368, 'steps': 132303, 'loss/train': 1.5408011674880981} 11/07/2021 15:50:19 - INFO - __main__ - Step 132305: {'lr': 1.7431066320246657e-05, 'samples': 25402560, 'steps': 132304, 'loss/train': 0.7178860306739807} 11/07/2021 15:50:20 - INFO - __main__ - Step 132306: {'lr': 1.7429119534641407e-05, 'samples': 25402752, 'steps': 132305, 'loss/train': 1.184333086013794} 11/07/2021 15:50:20 - INFO - __main__ - Step 132307: {'lr': 1.7427172853828972e-05, 'samples': 25402944, 'steps': 132306, 'loss/train': 1.297568440437317} 11/07/2021 15:50:20 - INFO - __main__ - Step 132308: {'lr': 1.742522627781029e-05, 'samples': 25403136, 'steps': 132307, 'loss/train': 1.1082082986831665} 11/07/2021 15:50:21 - INFO - __main__ - Step 132309: {'lr': 1.74232798065862e-05, 'samples': 25403328, 'steps': 132308, 'loss/train': 1.4337953329086304} 11/07/2021 15:50:22 - INFO - __main__ - Step 132310: {'lr': 1.7421333440157587e-05, 'samples': 25403520, 'steps': 132309, 'loss/train': 1.3859398365020752} 11/07/2021 15:50:22 - INFO - __main__ - Step 132311: {'lr': 1.7419387178525342e-05, 'samples': 25403712, 'steps': 132310, 'loss/train': 1.0448468923568726} 11/07/2021 15:50:22 - INFO - __main__ - Step 132312: {'lr': 1.741744102169035e-05, 'samples': 25403904, 'steps': 132311, 'loss/train': 1.4972829818725586} 11/07/2021 15:50:23 - INFO - __main__ - Step 132313: {'lr': 1.7415494969653445e-05, 'samples': 25404096, 'steps': 132312, 'loss/train': 1.4667407274246216} 11/07/2021 15:50:23 - INFO - __main__ - Step 132314: {'lr': 1.7413549022415544e-05, 'samples': 25404288, 'steps': 132313, 'loss/train': 1.2763376235961914} 11/07/2021 15:50:24 - INFO - __main__ - Step 132315: {'lr': 1.741160317997753e-05, 'samples': 25404480, 'steps': 132314, 'loss/train': 1.3763004541397095} 11/07/2021 15:50:24 - INFO - __main__ - Step 132316: {'lr': 1.7409657442340215e-05, 'samples': 25404672, 'steps': 132315, 'loss/train': 1.4417924880981445} 11/07/2021 15:50:25 - INFO - __main__ - Step 132317: {'lr': 1.740771180950454e-05, 'samples': 25404864, 'steps': 132316, 'loss/train': 1.4647510051727295} 11/07/2021 15:50:25 - INFO - __main__ - Step 132318: {'lr': 1.7405766281471365e-05, 'samples': 25405056, 'steps': 132317, 'loss/train': 1.5872697830200195} 11/07/2021 15:50:25 - INFO - __main__ - Step 132319: {'lr': 1.7403820858241547e-05, 'samples': 25405248, 'steps': 132318, 'loss/train': 0.9568372368812561} 11/07/2021 15:50:27 - INFO - __main__ - Step 132320: {'lr': 1.740187553981598e-05, 'samples': 25405440, 'steps': 132319, 'loss/train': 1.0969856977462769} 11/07/2021 15:50:27 - INFO - __main__ - Step 132321: {'lr': 1.7399930326195523e-05, 'samples': 25405632, 'steps': 132320, 'loss/train': 1.2058782577514648} 11/07/2021 15:50:27 - INFO - __main__ - Step 132322: {'lr': 1.739798521738109e-05, 'samples': 25405824, 'steps': 132321, 'loss/train': 1.1252830028533936} 11/07/2021 15:50:28 - INFO - __main__ - Step 132323: {'lr': 1.7396040213373542e-05, 'samples': 25406016, 'steps': 132322, 'loss/train': 1.3163317441940308} 11/07/2021 15:50:28 - INFO - __main__ - Step 132324: {'lr': 1.7394095314173742e-05, 'samples': 25406208, 'steps': 132323, 'loss/train': 1.3674721717834473} 11/07/2021 15:50:29 - INFO - __main__ - Step 132325: {'lr': 1.7392150519782574e-05, 'samples': 25406400, 'steps': 132324, 'loss/train': 1.534916639328003} 11/07/2021 15:50:29 - INFO - __main__ - Step 132326: {'lr': 1.73902058302009e-05, 'samples': 25406592, 'steps': 132325, 'loss/train': 1.3558002710342407} 11/07/2021 15:50:30 - INFO - __main__ - Step 132327: {'lr': 1.738826124542961e-05, 'samples': 25406784, 'steps': 132326, 'loss/train': 1.7352032661437988} 11/07/2021 15:50:30 - INFO - __main__ - Step 132328: {'lr': 1.7386316765469645e-05, 'samples': 25406976, 'steps': 132327, 'loss/train': 1.2214993238449097} 11/07/2021 15:50:31 - INFO - __main__ - Step 132329: {'lr': 1.7384372390321756e-05, 'samples': 25407168, 'steps': 132328, 'loss/train': 1.191636562347412} 11/07/2021 15:50:31 - INFO - __main__ - Step 132330: {'lr': 1.7382428119986887e-05, 'samples': 25407360, 'steps': 132329, 'loss/train': 1.228333592414856} 11/07/2021 15:50:32 - INFO - __main__ - Step 132331: {'lr': 1.7380483954465898e-05, 'samples': 25407552, 'steps': 132330, 'loss/train': 1.350742220878601} 11/07/2021 15:50:32 - INFO - __main__ - Step 132332: {'lr': 1.7378539893759675e-05, 'samples': 25407744, 'steps': 132331, 'loss/train': 1.2416802644729614} 11/07/2021 15:50:33 - INFO - __main__ - Step 132333: {'lr': 1.7376595937869084e-05, 'samples': 25407936, 'steps': 132332, 'loss/train': 1.2844946384429932} 11/07/2021 15:50:33 - INFO - __main__ - Step 132334: {'lr': 1.7374652086795033e-05, 'samples': 25408128, 'steps': 132333, 'loss/train': 1.0949208736419678} 11/07/2021 15:50:33 - INFO - __main__ - Step 132335: {'lr': 1.7372708340538364e-05, 'samples': 25408320, 'steps': 132334, 'loss/train': 1.561747431755066} 11/07/2021 15:50:34 - INFO - __main__ - Step 132336: {'lr': 1.7370764699099956e-05, 'samples': 25408512, 'steps': 132335, 'loss/train': 1.264702558517456} 11/07/2021 15:50:35 - INFO - __main__ - Step 132337: {'lr': 1.7368821162480702e-05, 'samples': 25408704, 'steps': 132336, 'loss/train': 1.361401915550232} 11/07/2021 15:50:35 - INFO - __main__ - Step 132338: {'lr': 1.7366877730681464e-05, 'samples': 25408896, 'steps': 132337, 'loss/train': 1.6628044843673706} 11/07/2021 15:50:35 - INFO - __main__ - Step 132339: {'lr': 1.7364934403703126e-05, 'samples': 25409088, 'steps': 132338, 'loss/train': 1.1586580276489258} 11/07/2021 15:50:36 - INFO - __main__ - Step 132340: {'lr': 1.736299118154655e-05, 'samples': 25409280, 'steps': 132339, 'loss/train': 1.7195816040039062} 11/07/2021 15:50:37 - INFO - __main__ - Step 132341: {'lr': 1.7361048064212626e-05, 'samples': 25409472, 'steps': 132340, 'loss/train': 1.5680851936340332} 11/07/2021 15:50:37 - INFO - __main__ - Step 132342: {'lr': 1.735910505170224e-05, 'samples': 25409664, 'steps': 132341, 'loss/train': 1.2448917627334595} 11/07/2021 15:50:38 - INFO - __main__ - Step 132343: {'lr': 1.735716214401625e-05, 'samples': 25409856, 'steps': 132342, 'loss/train': 1.775418996810913} 11/07/2021 15:50:38 - INFO - __main__ - Step 132344: {'lr': 1.7355219341155498e-05, 'samples': 25410048, 'steps': 132343, 'loss/train': 1.2190968990325928} 11/07/2021 15:50:38 - INFO - __main__ - Step 132345: {'lr': 1.735327664312092e-05, 'samples': 25410240, 'steps': 132344, 'loss/train': 1.8020353317260742} 11/07/2021 15:50:39 - INFO - __main__ - Step 132346: {'lr': 1.735133404991335e-05, 'samples': 25410432, 'steps': 132345, 'loss/train': 1.0585891008377075} 11/07/2021 15:50:40 - INFO - __main__ - Step 132347: {'lr': 1.73493915615337e-05, 'samples': 25410624, 'steps': 132346, 'loss/train': 1.6492034196853638} 11/07/2021 15:50:40 - INFO - __main__ - Step 132348: {'lr': 1.734744917798281e-05, 'samples': 25410816, 'steps': 132347, 'loss/train': 1.1536633968353271} 11/07/2021 15:50:40 - INFO - __main__ - Step 132349: {'lr': 1.7345506899261566e-05, 'samples': 25411008, 'steps': 132348, 'loss/train': 1.2660648822784424} 11/07/2021 15:50:41 - INFO - __main__ - Step 132350: {'lr': 1.7343564725370853e-05, 'samples': 25411200, 'steps': 132349, 'loss/train': 1.3388211727142334} 11/07/2021 15:50:42 - INFO - __main__ - Step 132351: {'lr': 1.7341622656311533e-05, 'samples': 25411392, 'steps': 132350, 'loss/train': 1.459006667137146} 11/07/2021 15:50:42 - INFO - __main__ - Step 132352: {'lr': 1.733968069208447e-05, 'samples': 25411584, 'steps': 132351, 'loss/train': 1.4061905145645142} 11/07/2021 15:50:42 - INFO - __main__ - Step 132353: {'lr': 1.73377388326906e-05, 'samples': 25411776, 'steps': 132352, 'loss/train': 1.4053831100463867} 11/07/2021 15:50:43 - INFO - __main__ - Step 132354: {'lr': 1.733579707813071e-05, 'samples': 25411968, 'steps': 132353, 'loss/train': 0.8007068634033203} 11/07/2021 15:50:43 - INFO - __main__ - Step 132355: {'lr': 1.7333855428405792e-05, 'samples': 25412160, 'steps': 132354, 'loss/train': 1.29423189163208} 11/07/2021 15:50:44 - INFO - __main__ - Step 132356: {'lr': 1.73319138835166e-05, 'samples': 25412352, 'steps': 132355, 'loss/train': 1.6419200897216797} 11/07/2021 15:50:44 - INFO - __main__ - Step 132357: {'lr': 1.732997244346407e-05, 'samples': 25412544, 'steps': 132356, 'loss/train': 0.5909327268600464} 11/07/2021 15:50:45 - INFO - __main__ - Step 132358: {'lr': 1.7328031108249044e-05, 'samples': 25412736, 'steps': 132357, 'loss/train': 1.3498769998550415} 11/07/2021 15:50:45 - INFO - __main__ - Step 132359: {'lr': 1.7326089877872404e-05, 'samples': 25412928, 'steps': 132358, 'loss/train': 1.181363821029663} 11/07/2021 15:50:46 - INFO - __main__ - Step 132360: {'lr': 1.7324148752335067e-05, 'samples': 25413120, 'steps': 132359, 'loss/train': 1.8180720806121826} 11/07/2021 15:50:46 - INFO - __main__ - Step 132361: {'lr': 1.732220773163787e-05, 'samples': 25413312, 'steps': 132360, 'loss/train': 1.588916301727295} 11/07/2021 15:50:47 - INFO - __main__ - Step 132362: {'lr': 1.7320266815781695e-05, 'samples': 25413504, 'steps': 132361, 'loss/train': 1.3870824575424194} 11/07/2021 15:50:47 - INFO - __main__ - Step 132363: {'lr': 1.7318326004767404e-05, 'samples': 25413696, 'steps': 132362, 'loss/train': 0.8337051272392273} 11/07/2021 15:50:48 - INFO - __main__ - Step 132364: {'lr': 1.7316385298595917e-05, 'samples': 25413888, 'steps': 132363, 'loss/train': 1.4290655851364136} 11/07/2021 15:50:48 - INFO - __main__ - Step 132365: {'lr': 1.7314444697268034e-05, 'samples': 25414080, 'steps': 132364, 'loss/train': 1.191589593887329} 11/07/2021 15:50:49 - INFO - __main__ - Step 132366: {'lr': 1.73125042007847e-05, 'samples': 25414272, 'steps': 132365, 'loss/train': 1.4408233165740967} 11/07/2021 15:50:49 - INFO - __main__ - Step 132367: {'lr': 1.731056380914675e-05, 'samples': 25414464, 'steps': 132366, 'loss/train': 1.5355918407440186} 11/07/2021 15:50:50 - INFO - __main__ - Step 132368: {'lr': 1.7308623522355073e-05, 'samples': 25414656, 'steps': 132367, 'loss/train': 1.485575795173645} 11/07/2021 15:50:50 - INFO - __main__ - Step 132369: {'lr': 1.730668334041058e-05, 'samples': 25414848, 'steps': 132368, 'loss/train': 1.3868317604064941} 11/07/2021 15:50:50 - INFO - __main__ - Step 132370: {'lr': 1.7304743263314078e-05, 'samples': 25415040, 'steps': 132369, 'loss/train': 1.4148668050765991} 11/07/2021 15:50:51 - INFO - __main__ - Step 132371: {'lr': 1.7302803291066455e-05, 'samples': 25415232, 'steps': 132370, 'loss/train': 1.3884979486465454} 11/07/2021 15:50:52 - INFO - __main__ - Step 132372: {'lr': 1.7300863423668602e-05, 'samples': 25415424, 'steps': 132371, 'loss/train': 1.4062527418136597} 11/07/2021 15:50:52 - INFO - __main__ - Step 132373: {'lr': 1.7298923661121373e-05, 'samples': 25415616, 'steps': 132372, 'loss/train': 0.677778422832489} 11/07/2021 15:50:53 - INFO - __main__ - Step 132374: {'lr': 1.7296984003425666e-05, 'samples': 25415808, 'steps': 132373, 'loss/train': 1.1322286128997803} 11/07/2021 15:50:53 - INFO - __main__ - Step 132375: {'lr': 1.7295044450582358e-05, 'samples': 25416000, 'steps': 132374, 'loss/train': 1.4253170490264893} 11/07/2021 15:50:53 - INFO - __main__ - Step 132376: {'lr': 1.729310500259229e-05, 'samples': 25416192, 'steps': 132375, 'loss/train': 1.2980149984359741} 11/07/2021 15:50:54 - INFO - __main__ - Step 132377: {'lr': 1.7291165659456375e-05, 'samples': 25416384, 'steps': 132376, 'loss/train': 1.3889118432998657} 11/07/2021 15:50:55 - INFO - __main__ - Step 132378: {'lr': 1.7289226421175476e-05, 'samples': 25416576, 'steps': 132377, 'loss/train': 1.2967215776443481} 11/07/2021 15:50:55 - INFO - __main__ - Step 132379: {'lr': 1.7287287287750446e-05, 'samples': 25416768, 'steps': 132378, 'loss/train': 1.1384150981903076} 11/07/2021 15:50:55 - INFO - __main__ - Step 132380: {'lr': 1.7285348259182182e-05, 'samples': 25416960, 'steps': 132379, 'loss/train': 1.1648885011672974} 11/07/2021 15:50:56 - INFO - __main__ - Step 132381: {'lr': 1.728340933547157e-05, 'samples': 25417152, 'steps': 132380, 'loss/train': 1.76192307472229} 11/07/2021 15:50:57 - INFO - __main__ - Step 132382: {'lr': 1.7281470516619464e-05, 'samples': 25417344, 'steps': 132381, 'loss/train': 1.4141011238098145} 11/07/2021 15:50:57 - INFO - __main__ - Step 132383: {'lr': 1.7279531802626704e-05, 'samples': 25417536, 'steps': 132382, 'loss/train': 1.3963204622268677} 11/07/2021 15:50:58 - INFO - __main__ - Step 132384: {'lr': 1.7277593193494227e-05, 'samples': 25417728, 'steps': 132383, 'loss/train': 5.64238977432251} 11/07/2021 15:50:58 - INFO - __main__ - Step 132385: {'lr': 1.7275654689222847e-05, 'samples': 25417920, 'steps': 132384, 'loss/train': 1.276476263999939} 11/07/2021 15:50:58 - INFO - __main__ - Step 132386: {'lr': 1.7273716289813472e-05, 'samples': 25418112, 'steps': 132385, 'loss/train': 1.2356442213058472} 11/07/2021 15:50:59 - INFO - __main__ - Step 132387: {'lr': 1.727177799526697e-05, 'samples': 25418304, 'steps': 132386, 'loss/train': 1.2193772792816162} 11/07/2021 15:51:00 - INFO - __main__ - Step 132388: {'lr': 1.726983980558419e-05, 'samples': 25418496, 'steps': 132387, 'loss/train': 1.3548669815063477} 11/07/2021 15:51:00 - INFO - __main__ - Step 132389: {'lr': 1.726790172076606e-05, 'samples': 25418688, 'steps': 132388, 'loss/train': 1.1292672157287598} 11/07/2021 15:51:00 - INFO - __main__ - Step 132390: {'lr': 1.7265963740813405e-05, 'samples': 25418880, 'steps': 132389, 'loss/train': 0.912097692489624} 11/07/2021 15:51:01 - INFO - __main__ - Step 132391: {'lr': 1.7264025865727145e-05, 'samples': 25419072, 'steps': 132390, 'loss/train': 1.5484659671783447} 11/07/2021 15:51:01 - INFO - __main__ - Step 132392: {'lr': 1.7262088095508083e-05, 'samples': 25419264, 'steps': 132391, 'loss/train': 1.1178104877471924} 11/07/2021 15:51:02 - INFO - __main__ - Step 132393: {'lr': 1.7260150430157162e-05, 'samples': 25419456, 'steps': 132392, 'loss/train': 1.3218291997909546} 11/07/2021 15:51:03 - INFO - __main__ - Step 132394: {'lr': 1.7258212869675215e-05, 'samples': 25419648, 'steps': 132393, 'loss/train': 2.240424156188965} 11/07/2021 15:51:03 - INFO - __main__ - Step 132395: {'lr': 1.7256275414063133e-05, 'samples': 25419840, 'steps': 132394, 'loss/train': 1.104554295539856} 11/07/2021 15:51:03 - INFO - __main__ - Step 132396: {'lr': 1.7254338063321827e-05, 'samples': 25420032, 'steps': 132395, 'loss/train': 1.240342140197754} 11/07/2021 15:51:04 - INFO - __main__ - Step 132397: {'lr': 1.725240081745205e-05, 'samples': 25420224, 'steps': 132396, 'loss/train': 1.6087788343429565} 11/07/2021 15:51:05 - INFO - __main__ - Step 132398: {'lr': 1.7250463676454775e-05, 'samples': 25420416, 'steps': 132397, 'loss/train': 1.25370454788208} 11/07/2021 15:51:05 - INFO - __main__ - Step 132399: {'lr': 1.7248526640330857e-05, 'samples': 25420608, 'steps': 132398, 'loss/train': 1.190384030342102} 11/07/2021 15:51:05 - INFO - __main__ - Step 132400: {'lr': 1.7246589709081162e-05, 'samples': 25420800, 'steps': 132399, 'loss/train': 0.6907903552055359} 11/07/2021 15:51:06 - INFO - __main__ - Step 132401: {'lr': 1.7244652882706546e-05, 'samples': 25420992, 'steps': 132400, 'loss/train': 1.3200476169586182} 11/07/2021 15:51:06 - INFO - __main__ - Step 132402: {'lr': 1.72427161612079e-05, 'samples': 25421184, 'steps': 132401, 'loss/train': 1.242648720741272} 11/07/2021 15:51:07 - INFO - __main__ - Step 132403: {'lr': 1.724077954458611e-05, 'samples': 25421376, 'steps': 132402, 'loss/train': 1.111441969871521} 11/07/2021 15:51:08 - INFO - __main__ - Step 132404: {'lr': 1.723884303284201e-05, 'samples': 25421568, 'steps': 132403, 'loss/train': 1.0919361114501953} 11/07/2021 15:51:08 - INFO - __main__ - Step 132405: {'lr': 1.723690662597652e-05, 'samples': 25421760, 'steps': 132404, 'loss/train': 1.5135573148727417} 11/07/2021 15:51:08 - INFO - __main__ - Step 132406: {'lr': 1.7234970323990463e-05, 'samples': 25421952, 'steps': 132405, 'loss/train': 1.1827752590179443} 11/07/2021 15:51:09 - INFO - __main__ - Step 132407: {'lr': 1.7233034126884762e-05, 'samples': 25422144, 'steps': 132406, 'loss/train': 1.2763233184814453} 11/07/2021 15:51:09 - INFO - __main__ - Step 132408: {'lr': 1.723109803466025e-05, 'samples': 25422336, 'steps': 132407, 'loss/train': 0.948814332485199} 11/07/2021 15:51:10 - INFO - __main__ - Step 132409: {'lr': 1.7229162047317836e-05, 'samples': 25422528, 'steps': 132408, 'loss/train': 1.8025131225585938} 11/07/2021 15:51:11 - INFO - __main__ - Step 132410: {'lr': 1.7227226164858362e-05, 'samples': 25422720, 'steps': 132409, 'loss/train': 0.5806668996810913} 11/07/2021 15:51:11 - INFO - __main__ - Step 132411: {'lr': 1.7225290387282683e-05, 'samples': 25422912, 'steps': 132410, 'loss/train': 0.9432635307312012} 11/07/2021 15:51:11 - INFO - __main__ - Step 132412: {'lr': 1.7223354714591715e-05, 'samples': 25423104, 'steps': 132411, 'loss/train': 1.2781789302825928} 11/07/2021 15:51:12 - INFO - __main__ - Step 132413: {'lr': 1.7221419146786293e-05, 'samples': 25423296, 'steps': 132412, 'loss/train': 1.301999807357788} 11/07/2021 15:51:13 - INFO - __main__ - Step 132414: {'lr': 1.721948368386733e-05, 'samples': 25423488, 'steps': 132413, 'loss/train': 1.8287498950958252} 11/07/2021 15:51:13 - INFO - __main__ - Step 132415: {'lr': 1.7217548325835662e-05, 'samples': 25423680, 'steps': 132414, 'loss/train': 1.6234040260314941} 11/07/2021 15:51:13 - INFO - __main__ - Step 132416: {'lr': 1.721561307269218e-05, 'samples': 25423872, 'steps': 132415, 'loss/train': 1.5088881254196167} 11/07/2021 15:51:14 - INFO - __main__ - Step 132417: {'lr': 1.7213677924437733e-05, 'samples': 25424064, 'steps': 132416, 'loss/train': 1.2188687324523926} 11/07/2021 15:51:14 - INFO - __main__ - Step 132418: {'lr': 1.7211742881073245e-05, 'samples': 25424256, 'steps': 132417, 'loss/train': 0.9430855512619019} 11/07/2021 15:51:15 - INFO - __main__ - Step 132419: {'lr': 1.720980794259952e-05, 'samples': 25424448, 'steps': 132418, 'loss/train': 1.521698236465454} 11/07/2021 15:51:15 - INFO - __main__ - Step 132420: {'lr': 1.720787310901753e-05, 'samples': 25424640, 'steps': 132419, 'loss/train': 1.4140019416809082} 11/07/2021 15:51:16 - INFO - __main__ - Step 132421: {'lr': 1.7205938380328023e-05, 'samples': 25424832, 'steps': 132420, 'loss/train': 1.6063854694366455} 11/07/2021 15:51:16 - INFO - __main__ - Step 132422: {'lr': 1.7204003756531917e-05, 'samples': 25425024, 'steps': 132421, 'loss/train': 1.4172708988189697} 11/07/2021 15:51:16 - INFO - __main__ - Step 132423: {'lr': 1.7202069237630124e-05, 'samples': 25425216, 'steps': 132422, 'loss/train': 1.6785211563110352} 11/07/2021 15:51:18 - INFO - __main__ - Step 132424: {'lr': 1.7200134823623455e-05, 'samples': 25425408, 'steps': 132423, 'loss/train': 1.1673544645309448} 11/07/2021 15:51:18 - INFO - __main__ - Step 132425: {'lr': 1.7198200514512848e-05, 'samples': 25425600, 'steps': 132424, 'loss/train': 1.4664907455444336} 11/07/2021 15:51:18 - INFO - __main__ - Step 132426: {'lr': 1.719626631029911e-05, 'samples': 25425792, 'steps': 132425, 'loss/train': 0.9333040118217468} 11/07/2021 15:51:19 - INFO - __main__ - Step 132427: {'lr': 1.7194332210983154e-05, 'samples': 25425984, 'steps': 132426, 'loss/train': 1.521268606185913} 11/07/2021 15:51:19 - INFO - __main__ - Step 132428: {'lr': 1.7192398216565846e-05, 'samples': 25426176, 'steps': 132427, 'loss/train': 1.027456283569336} 11/07/2021 15:51:19 - INFO - __main__ - Step 132429: {'lr': 1.7190464327048043e-05, 'samples': 25426368, 'steps': 132428, 'loss/train': 1.188511848449707} 11/07/2021 15:51:20 - INFO - __main__ - Step 132430: {'lr': 1.7188530542430608e-05, 'samples': 25426560, 'steps': 132429, 'loss/train': 0.45589199662208557} 11/07/2021 15:51:21 - INFO - __main__ - Step 132431: {'lr': 1.7186596862714483e-05, 'samples': 25426752, 'steps': 132430, 'loss/train': 1.2319432497024536} 11/07/2021 15:51:21 - INFO - __main__ - Step 132432: {'lr': 1.7184663287900472e-05, 'samples': 25426944, 'steps': 132431, 'loss/train': 1.4118893146514893} 11/07/2021 15:51:21 - INFO - __main__ - Step 132433: {'lr': 1.7182729817989434e-05, 'samples': 25427136, 'steps': 132432, 'loss/train': 1.248238444328308} 11/07/2021 15:51:22 - INFO - __main__ - Step 132434: {'lr': 1.7180796452982262e-05, 'samples': 25427328, 'steps': 132433, 'loss/train': 1.3155629634857178} 11/07/2021 15:51:23 - INFO - __main__ - Step 132435: {'lr': 1.717886319287984e-05, 'samples': 25427520, 'steps': 132434, 'loss/train': 1.1604535579681396} 11/07/2021 15:51:23 - INFO - __main__ - Step 132436: {'lr': 1.7176930037683002e-05, 'samples': 25427712, 'steps': 132435, 'loss/train': 1.0276020765304565} 11/07/2021 15:51:24 - INFO - __main__ - Step 132437: {'lr': 1.7174996987392666e-05, 'samples': 25427904, 'steps': 132436, 'loss/train': 1.072784423828125} 11/07/2021 15:51:24 - INFO - __main__ - Step 132438: {'lr': 1.7173064042009688e-05, 'samples': 25428096, 'steps': 132437, 'loss/train': 1.0982489585876465} 11/07/2021 15:51:24 - INFO - __main__ - Step 132439: {'lr': 1.7171131201534932e-05, 'samples': 25428288, 'steps': 132438, 'loss/train': 1.8714466094970703} 11/07/2021 15:51:25 - INFO - __main__ - Step 132440: {'lr': 1.7169198465969287e-05, 'samples': 25428480, 'steps': 132439, 'loss/train': 1.2928400039672852} 11/07/2021 15:51:26 - INFO - __main__ - Step 132441: {'lr': 1.7167265835313584e-05, 'samples': 25428672, 'steps': 132440, 'loss/train': 1.5911502838134766} 11/07/2021 15:51:26 - INFO - __main__ - Step 132442: {'lr': 1.7165333309568763e-05, 'samples': 25428864, 'steps': 132441, 'loss/train': 1.708234429359436} 11/07/2021 15:51:26 - INFO - __main__ - Step 132443: {'lr': 1.7163400888735638e-05, 'samples': 25429056, 'steps': 132442, 'loss/train': 1.3894445896148682} 11/07/2021 15:51:27 - INFO - __main__ - Step 132444: {'lr': 1.7161468572815057e-05, 'samples': 25429248, 'steps': 132443, 'loss/train': 1.0145395994186401} 11/07/2021 15:51:28 - INFO - __main__ - Step 132445: {'lr': 1.7159536361807947e-05, 'samples': 25429440, 'steps': 132444, 'loss/train': 0.5008413195610046} 11/07/2021 15:51:28 - INFO - __main__ - Step 132446: {'lr': 1.7157604255715138e-05, 'samples': 25429632, 'steps': 132445, 'loss/train': 2.0948498249053955} 11/07/2021 15:51:29 - INFO - __main__ - Step 132447: {'lr': 1.7155672254537513e-05, 'samples': 25429824, 'steps': 132446, 'loss/train': 1.1926637887954712} 11/07/2021 15:51:29 - INFO - __main__ - Step 132448: {'lr': 1.7153740358275964e-05, 'samples': 25430016, 'steps': 132447, 'loss/train': 1.3212827444076538} 11/07/2021 15:51:29 - INFO - __main__ - Step 132449: {'lr': 1.7151808566931355e-05, 'samples': 25430208, 'steps': 132448, 'loss/train': 0.6023219227790833} 11/07/2021 15:51:30 - INFO - __main__ - Step 132450: {'lr': 1.714987688050454e-05, 'samples': 25430400, 'steps': 132449, 'loss/train': 1.41428542137146} 11/07/2021 15:51:31 - INFO - __main__ - Step 132451: {'lr': 1.714794529899641e-05, 'samples': 25430592, 'steps': 132450, 'loss/train': 1.4727704524993896} 11/07/2021 15:51:31 - INFO - __main__ - Step 132452: {'lr': 1.7146013822407796e-05, 'samples': 25430784, 'steps': 132451, 'loss/train': 1.0070418119430542} 11/07/2021 15:51:31 - INFO - __main__ - Step 132453: {'lr': 1.7144082450739647e-05, 'samples': 25430976, 'steps': 132452, 'loss/train': 0.8086435198783875} 11/07/2021 15:51:32 - INFO - __main__ - Step 132454: {'lr': 1.714215118399276e-05, 'samples': 25431168, 'steps': 132453, 'loss/train': 0.5785719752311707} 11/07/2021 15:51:33 - INFO - __main__ - Step 132455: {'lr': 1.7140220022168e-05, 'samples': 25431360, 'steps': 132454, 'loss/train': 1.5858910083770752} 11/07/2021 15:51:33 - INFO - __main__ - Step 132456: {'lr': 1.7138288965266284e-05, 'samples': 25431552, 'steps': 132455, 'loss/train': 1.503206491470337} 11/07/2021 15:51:33 - INFO - __main__ - Step 132457: {'lr': 1.7136358013288445e-05, 'samples': 25431744, 'steps': 132456, 'loss/train': 1.270784854888916} 11/07/2021 15:51:34 - INFO - __main__ - Step 132458: {'lr': 1.7134427166235366e-05, 'samples': 25431936, 'steps': 132457, 'loss/train': 0.8257114887237549} 11/07/2021 15:51:34 - INFO - __main__ - Step 132459: {'lr': 1.713249642410794e-05, 'samples': 25432128, 'steps': 132458, 'loss/train': 1.4619054794311523} 11/07/2021 15:51:35 - INFO - __main__ - Step 132460: {'lr': 1.7130565786906997e-05, 'samples': 25432320, 'steps': 132459, 'loss/train': 0.4314478635787964} 11/07/2021 15:51:36 - INFO - __main__ - Step 132461: {'lr': 1.7128635254633455e-05, 'samples': 25432512, 'steps': 132460, 'loss/train': 1.449709415435791} 11/07/2021 15:51:36 - INFO - __main__ - Step 132462: {'lr': 1.712670482728815e-05, 'samples': 25432704, 'steps': 132461, 'loss/train': 1.1789735555648804} 11/07/2021 15:51:36 - INFO - __main__ - Step 132463: {'lr': 1.7124774504871933e-05, 'samples': 25432896, 'steps': 132462, 'loss/train': 1.4782273769378662} 11/07/2021 15:51:37 - INFO - __main__ - Step 132464: {'lr': 1.712284428738575e-05, 'samples': 25433088, 'steps': 132463, 'loss/train': 1.4083729982376099} 11/07/2021 15:51:38 - INFO - __main__ - Step 132465: {'lr': 1.7120914174830388e-05, 'samples': 25433280, 'steps': 132464, 'loss/train': 1.2409688234329224} 11/07/2021 15:51:38 - INFO - __main__ - Step 132466: {'lr': 1.7118984167206753e-05, 'samples': 25433472, 'steps': 132465, 'loss/train': 1.1347143650054932} 11/07/2021 15:51:38 - INFO - __main__ - Step 132467: {'lr': 1.7117054264515708e-05, 'samples': 25433664, 'steps': 132466, 'loss/train': 1.183364987373352} 11/07/2021 15:51:39 - INFO - __main__ - Step 132468: {'lr': 1.7115124466758113e-05, 'samples': 25433856, 'steps': 132467, 'loss/train': 1.3063973188400269} 11/07/2021 15:51:39 - INFO - __main__ - Step 132469: {'lr': 1.711319477393486e-05, 'samples': 25434048, 'steps': 132468, 'loss/train': 1.3986467123031616} 11/07/2021 15:51:39 - INFO - __main__ - Step 132470: {'lr': 1.7111265186046803e-05, 'samples': 25434240, 'steps': 132469, 'loss/train': 1.4073467254638672} 11/07/2021 15:51:40 - INFO - __main__ - Step 132471: {'lr': 1.7109335703094807e-05, 'samples': 25434432, 'steps': 132470, 'loss/train': 1.0673294067382812} 11/07/2021 15:51:41 - INFO - __main__ - Step 132472: {'lr': 1.710740632507976e-05, 'samples': 25434624, 'steps': 132471, 'loss/train': 0.6563354730606079} 11/07/2021 15:51:41 - INFO - __main__ - Step 132473: {'lr': 1.7105477052002522e-05, 'samples': 25434816, 'steps': 132472, 'loss/train': 1.5786261558532715} 11/07/2021 15:51:42 - INFO - __main__ - Step 132474: {'lr': 1.7103547883863978e-05, 'samples': 25435008, 'steps': 132473, 'loss/train': 0.950226902961731} 11/07/2021 15:51:42 - INFO - __main__ - Step 132475: {'lr': 1.7101618820664966e-05, 'samples': 25435200, 'steps': 132474, 'loss/train': 1.1973249912261963} 11/07/2021 15:51:43 - INFO - __main__ - Step 132476: {'lr': 1.7099689862406397e-05, 'samples': 25435392, 'steps': 132475, 'loss/train': 1.2249821424484253} 11/07/2021 15:51:43 - INFO - __main__ - Step 132477: {'lr': 1.709776100908908e-05, 'samples': 25435584, 'steps': 132476, 'loss/train': 0.9697874784469604} 11/07/2021 15:51:44 - INFO - __main__ - Step 132478: {'lr': 1.709583226071393e-05, 'samples': 25435776, 'steps': 132477, 'loss/train': 1.504903793334961} 11/07/2021 15:51:44 - INFO - __main__ - Step 132479: {'lr': 1.7093903617281803e-05, 'samples': 25435968, 'steps': 132478, 'loss/train': 1.5013822317123413} 11/07/2021 15:51:44 - INFO - __main__ - Step 132480: {'lr': 1.7091975078793566e-05, 'samples': 25436160, 'steps': 132479, 'loss/train': 1.3826817274093628} 11/07/2021 15:51:46 - INFO - __main__ - Step 132481: {'lr': 1.7090046645250102e-05, 'samples': 25436352, 'steps': 132480, 'loss/train': 1.1675742864608765} 11/07/2021 15:51:46 - INFO - __main__ - Step 132482: {'lr': 1.7088118316652245e-05, 'samples': 25436544, 'steps': 132481, 'loss/train': 1.0631086826324463} 11/07/2021 15:51:46 - INFO - __main__ - Step 132483: {'lr': 1.7086190093000913e-05, 'samples': 25436736, 'steps': 132482, 'loss/train': 0.8094980120658875} 11/07/2021 15:51:47 - INFO - __main__ - Step 132484: {'lr': 1.708426197429694e-05, 'samples': 25436928, 'steps': 132483, 'loss/train': 1.2788478136062622} 11/07/2021 15:51:47 - INFO - __main__ - Step 132485: {'lr': 1.7082333960541208e-05, 'samples': 25437120, 'steps': 132484, 'loss/train': 1.4623210430145264} 11/07/2021 15:51:48 - INFO - __main__ - Step 132486: {'lr': 1.7080406051734553e-05, 'samples': 25437312, 'steps': 132485, 'loss/train': 1.453451156616211} 11/07/2021 15:51:48 - INFO - __main__ - Step 132487: {'lr': 1.7078478247877892e-05, 'samples': 25437504, 'steps': 132486, 'loss/train': 1.7781095504760742} 11/07/2021 15:51:49 - INFO - __main__ - Step 132488: {'lr': 1.7076550548972087e-05, 'samples': 25437696, 'steps': 132487, 'loss/train': 1.3475122451782227} 11/07/2021 15:51:49 - INFO - __main__ - Step 132489: {'lr': 1.7074622955017994e-05, 'samples': 25437888, 'steps': 132488, 'loss/train': 0.830274224281311} 11/07/2021 15:51:49 - INFO - __main__ - Step 132490: {'lr': 1.7072695466016504e-05, 'samples': 25438080, 'steps': 132489, 'loss/train': 1.1788287162780762} 11/07/2021 15:51:50 - INFO - __main__ - Step 132491: {'lr': 1.7070768081968447e-05, 'samples': 25438272, 'steps': 132490, 'loss/train': 0.5567868947982788} 11/07/2021 15:51:51 - INFO - __main__ - Step 132492: {'lr': 1.7068840802874685e-05, 'samples': 25438464, 'steps': 132491, 'loss/train': 1.4418202638626099} 11/07/2021 15:51:51 - INFO - __main__ - Step 132493: {'lr': 1.7066913628736107e-05, 'samples': 25438656, 'steps': 132492, 'loss/train': 1.8607913255691528} 11/07/2021 15:51:51 - INFO - __main__ - Step 132494: {'lr': 1.7064986559553602e-05, 'samples': 25438848, 'steps': 132493, 'loss/train': 1.1088980436325073} 11/07/2021 15:51:52 - INFO - __main__ - Step 132495: {'lr': 1.7063059595328e-05, 'samples': 25439040, 'steps': 132494, 'loss/train': 1.6765557527542114} 11/07/2021 15:51:53 - INFO - __main__ - Step 132496: {'lr': 1.706113273606022e-05, 'samples': 25439232, 'steps': 132495, 'loss/train': 0.8363342881202698} 11/07/2021 15:51:53 - INFO - __main__ - Step 132497: {'lr': 1.7059205981751062e-05, 'samples': 25439424, 'steps': 132496, 'loss/train': 1.528095006942749} 11/07/2021 15:51:54 - INFO - __main__ - Step 132498: {'lr': 1.7057279332401447e-05, 'samples': 25439616, 'steps': 132497, 'loss/train': 0.8846888542175293} 11/07/2021 15:51:54 - INFO - __main__ - Step 132499: {'lr': 1.7055352788012235e-05, 'samples': 25439808, 'steps': 132498, 'loss/train': 1.3571950197219849} 11/07/2021 15:51:54 - INFO - __main__ - Step 132500: {'lr': 1.7053426348584283e-05, 'samples': 25440000, 'steps': 132499, 'loss/train': 1.1135696172714233} 11/07/2021 15:51:55 - INFO - __main__ - Step 132501: {'lr': 1.7051500014118455e-05, 'samples': 25440192, 'steps': 132500, 'loss/train': 1.1561264991760254} 11/07/2021 15:51:56 - INFO - __main__ - Step 132502: {'lr': 1.7049573784615635e-05, 'samples': 25440384, 'steps': 132501, 'loss/train': 0.8630293011665344} 11/07/2021 15:51:56 - INFO - __main__ - Step 132503: {'lr': 1.7047647660076714e-05, 'samples': 25440576, 'steps': 132502, 'loss/train': 1.5509332418441772} 11/07/2021 15:51:56 - INFO - __main__ - Step 132504: {'lr': 1.70457216405025e-05, 'samples': 25440768, 'steps': 132503, 'loss/train': 1.292166829109192} 11/07/2021 15:51:57 - INFO - __main__ - Step 132505: {'lr': 1.7043795725893874e-05, 'samples': 25440960, 'steps': 132504, 'loss/train': 1.3573397397994995} 11/07/2021 15:51:58 - INFO - __main__ - Step 132506: {'lr': 1.704186991625173e-05, 'samples': 25441152, 'steps': 132505, 'loss/train': 1.696309208869934} 11/07/2021 15:51:58 - INFO - __main__ - Step 132507: {'lr': 1.7039944211576924e-05, 'samples': 25441344, 'steps': 132506, 'loss/train': 1.3162509202957153} 11/07/2021 15:51:58 - INFO - __main__ - Step 132508: {'lr': 1.703801861187032e-05, 'samples': 25441536, 'steps': 132507, 'loss/train': 1.5374226570129395} 11/07/2021 15:51:59 - INFO - __main__ - Step 132509: {'lr': 1.703609311713278e-05, 'samples': 25441728, 'steps': 132508, 'loss/train': 1.2524123191833496} 11/07/2021 15:51:59 - INFO - __main__ - Step 132510: {'lr': 1.7034167727365213e-05, 'samples': 25441920, 'steps': 132509, 'loss/train': 1.0840612649917603} 11/07/2021 15:52:00 - INFO - __main__ - Step 132511: {'lr': 1.703224244256843e-05, 'samples': 25442112, 'steps': 132510, 'loss/train': 1.2598143815994263} 11/07/2021 15:52:01 - INFO - __main__ - Step 132512: {'lr': 1.7030317262743317e-05, 'samples': 25442304, 'steps': 132511, 'loss/train': 1.33149254322052} 11/07/2021 15:52:01 - INFO - __main__ - Step 132513: {'lr': 1.7028392187890762e-05, 'samples': 25442496, 'steps': 132512, 'loss/train': 1.2809503078460693} 11/07/2021 15:52:01 - INFO - __main__ - Step 132514: {'lr': 1.7026467218011627e-05, 'samples': 25442688, 'steps': 132513, 'loss/train': 1.263272762298584} 11/07/2021 15:52:02 - INFO - __main__ - Step 132515: {'lr': 1.702454235310677e-05, 'samples': 25442880, 'steps': 132514, 'loss/train': 0.8567332625389099} 11/07/2021 15:52:03 - INFO - __main__ - Step 132516: {'lr': 1.7022617593177026e-05, 'samples': 25443072, 'steps': 132515, 'loss/train': 0.7950100302696228} 11/07/2021 15:52:03 - INFO - __main__ - Step 132517: {'lr': 1.7020692938223365e-05, 'samples': 25443264, 'steps': 132516, 'loss/train': 1.3753300905227661} 11/07/2021 15:52:04 - INFO - __main__ - Step 132518: {'lr': 1.7018768388246536e-05, 'samples': 25443456, 'steps': 132517, 'loss/train': 1.0644642114639282} 11/07/2021 15:52:04 - INFO - __main__ - Step 132519: {'lr': 1.7016843943247457e-05, 'samples': 25443648, 'steps': 132518, 'loss/train': 1.3175849914550781} 11/07/2021 15:52:04 - INFO - __main__ - Step 132520: {'lr': 1.7014919603227013e-05, 'samples': 25443840, 'steps': 132519, 'loss/train': 1.1932927370071411} 11/07/2021 15:52:05 - INFO - __main__ - Step 132521: {'lr': 1.7012995368186012e-05, 'samples': 25444032, 'steps': 132520, 'loss/train': 1.2052271366119385} 11/07/2021 15:52:06 - INFO - __main__ - Step 132522: {'lr': 1.7011071238125398e-05, 'samples': 25444224, 'steps': 132521, 'loss/train': 0.9454622864723206} 11/07/2021 15:52:06 - INFO - __main__ - Step 132523: {'lr': 1.7009147213045972e-05, 'samples': 25444416, 'steps': 132522, 'loss/train': 0.714533805847168} 11/07/2021 15:52:06 - INFO - __main__ - Step 132524: {'lr': 1.7007223292948654e-05, 'samples': 25444608, 'steps': 132523, 'loss/train': 0.8554298281669617} 11/07/2021 15:52:07 - INFO - __main__ - Step 132525: {'lr': 1.7005299477834245e-05, 'samples': 25444800, 'steps': 132524, 'loss/train': 1.2583963871002197} 11/07/2021 15:52:07 - INFO - __main__ - Step 132526: {'lr': 1.7003375767703693e-05, 'samples': 25444992, 'steps': 132525, 'loss/train': 0.7178595066070557} 11/07/2021 15:52:08 - INFO - __main__ - Step 132527: {'lr': 1.70014521625578e-05, 'samples': 25445184, 'steps': 132526, 'loss/train': 1.5064597129821777} 11/07/2021 15:52:09 - INFO - __main__ - Step 132528: {'lr': 1.6999528662397483e-05, 'samples': 25445376, 'steps': 132527, 'loss/train': 1.468516230583191} 11/07/2021 15:52:09 - INFO - __main__ - Step 132529: {'lr': 1.699760526722355e-05, 'samples': 25445568, 'steps': 132528, 'loss/train': 1.660072922706604} 11/07/2021 15:52:09 - INFO - __main__ - Step 132530: {'lr': 1.6995681977036965e-05, 'samples': 25445760, 'steps': 132529, 'loss/train': 1.292112946510315} 11/07/2021 15:52:10 - INFO - __main__ - Step 132531: {'lr': 1.6993758791838483e-05, 'samples': 25445952, 'steps': 132530, 'loss/train': 1.4985407590866089} 11/07/2021 15:52:11 - INFO - __main__ - Step 132532: {'lr': 1.6991835711629016e-05, 'samples': 25446144, 'steps': 132531, 'loss/train': 0.7052136063575745} 11/07/2021 15:52:11 - INFO - __main__ - Step 132533: {'lr': 1.698991273640943e-05, 'samples': 25446336, 'steps': 132532, 'loss/train': 1.2895127534866333} 11/07/2021 15:52:11 - INFO - __main__ - Step 132534: {'lr': 1.698798986618061e-05, 'samples': 25446528, 'steps': 132533, 'loss/train': 1.7238038778305054} 11/07/2021 15:52:12 - INFO - __main__ - Step 132535: {'lr': 1.6986067100943386e-05, 'samples': 25446720, 'steps': 132534, 'loss/train': 1.4161087274551392} 11/07/2021 15:52:12 - INFO - __main__ - Step 132536: {'lr': 1.698414444069865e-05, 'samples': 25446912, 'steps': 132535, 'loss/train': 1.3509268760681152} 11/07/2021 15:52:12 - INFO - __main__ - Step 132537: {'lr': 1.6982221885447263e-05, 'samples': 25447104, 'steps': 132536, 'loss/train': 1.2812137603759766} 11/07/2021 15:52:13 - INFO - __main__ - Step 132538: {'lr': 1.698029943519011e-05, 'samples': 25447296, 'steps': 132537, 'loss/train': 1.7391631603240967} 11/07/2021 15:52:14 - INFO - __main__ - Step 132539: {'lr': 1.6978377089928028e-05, 'samples': 25447488, 'steps': 132538, 'loss/train': 1.3179316520690918} 11/07/2021 15:52:14 - INFO - __main__ - Step 132540: {'lr': 1.697645484966187e-05, 'samples': 25447680, 'steps': 132539, 'loss/train': 1.1171648502349854} 11/07/2021 15:52:14 - INFO - __main__ - Step 132541: {'lr': 1.697453271439256e-05, 'samples': 25447872, 'steps': 132540, 'loss/train': 1.4181022644042969} 11/07/2021 15:52:15 - INFO - __main__ - Step 132542: {'lr': 1.69726106841209e-05, 'samples': 25448064, 'steps': 132541, 'loss/train': 1.226853370666504} 11/07/2021 15:52:16 - INFO - __main__ - Step 132543: {'lr': 1.697068875884783e-05, 'samples': 25448256, 'steps': 132542, 'loss/train': 0.5666098594665527} 11/07/2021 15:52:16 - INFO - __main__ - Step 132544: {'lr': 1.696876693857416e-05, 'samples': 25448448, 'steps': 132543, 'loss/train': 1.526261806488037} 11/07/2021 15:52:17 - INFO - __main__ - Step 132545: {'lr': 1.6966845223300746e-05, 'samples': 25448640, 'steps': 132544, 'loss/train': 1.2692846059799194} 11/07/2021 15:52:17 - INFO - __main__ - Step 132546: {'lr': 1.696492361302848e-05, 'samples': 25448832, 'steps': 132545, 'loss/train': 1.4006644487380981} 11/07/2021 15:52:17 - INFO - __main__ - Step 132547: {'lr': 1.6963002107758223e-05, 'samples': 25449024, 'steps': 132546, 'loss/train': 1.341161847114563} 11/07/2021 15:52:18 - INFO - __main__ - Step 132548: {'lr': 1.6961080707490833e-05, 'samples': 25449216, 'steps': 132547, 'loss/train': 1.2417129278182983} 11/07/2021 15:52:19 - INFO - __main__ - Step 132549: {'lr': 1.6959159412227198e-05, 'samples': 25449408, 'steps': 132548, 'loss/train': 1.212149977684021} 11/07/2021 15:52:19 - INFO - __main__ - Step 132550: {'lr': 1.6957238221968153e-05, 'samples': 25449600, 'steps': 132549, 'loss/train': 0.6014787554740906} 11/07/2021 15:52:19 - INFO - __main__ - Step 132551: {'lr': 1.695531713671458e-05, 'samples': 25449792, 'steps': 132550, 'loss/train': 1.1119205951690674} 11/07/2021 15:52:20 - INFO - __main__ - Step 132552: {'lr': 1.695339615646735e-05, 'samples': 25449984, 'steps': 132551, 'loss/train': 1.404288649559021} 11/07/2021 15:52:21 - INFO - __main__ - Step 132553: {'lr': 1.6951475281227343e-05, 'samples': 25450176, 'steps': 132552, 'loss/train': 1.3452167510986328} 11/07/2021 15:52:21 - INFO - __main__ - Step 132554: {'lr': 1.6949554510995392e-05, 'samples': 25450368, 'steps': 132553, 'loss/train': 1.3134422302246094} 11/07/2021 15:52:22 - INFO - __main__ - Step 132555: {'lr': 1.694763384577236e-05, 'samples': 25450560, 'steps': 132554, 'loss/train': 1.5072494745254517} 11/07/2021 15:52:22 - INFO - __main__ - Step 132556: {'lr': 1.6945713285559162e-05, 'samples': 25450752, 'steps': 132555, 'loss/train': 1.0353672504425049} 11/07/2021 15:52:23 - INFO - __main__ - Step 132557: {'lr': 1.694379283035663e-05, 'samples': 25450944, 'steps': 132556, 'loss/train': 0.6507968306541443} 11/07/2021 15:52:23 - INFO - __main__ - Step 132558: {'lr': 1.6941872480165625e-05, 'samples': 25451136, 'steps': 132557, 'loss/train': 1.5814077854156494} 11/07/2021 15:52:24 - INFO - __main__ - Step 132559: {'lr': 1.6939952234986983e-05, 'samples': 25451328, 'steps': 132558, 'loss/train': 1.3492647409439087} 11/07/2021 15:52:24 - INFO - __main__ - Step 132560: {'lr': 1.6938032094821615e-05, 'samples': 25451520, 'steps': 132559, 'loss/train': 1.0741748809814453} 11/07/2021 15:52:25 - INFO - __main__ - Step 132561: {'lr': 1.6936112059670383e-05, 'samples': 25451712, 'steps': 132560, 'loss/train': 1.2890764474868774} 11/07/2021 15:52:25 - INFO - __main__ - Step 132562: {'lr': 1.693419212953415e-05, 'samples': 25451904, 'steps': 132561, 'loss/train': 1.7396467924118042} 11/07/2021 15:52:25 - INFO - __main__ - Step 132563: {'lr': 1.693227230441374e-05, 'samples': 25452096, 'steps': 132562, 'loss/train': 1.46140718460083} 11/07/2021 15:52:27 - INFO - __main__ - Step 132564: {'lr': 1.693035258431008e-05, 'samples': 25452288, 'steps': 132563, 'loss/train': 1.1983500719070435} 11/07/2021 15:52:27 - INFO - __main__ - Step 132565: {'lr': 1.6928432969224e-05, 'samples': 25452480, 'steps': 132564, 'loss/train': 1.3582135438919067} 11/07/2021 15:52:27 - INFO - __main__ - Step 132566: {'lr': 1.6926513459156357e-05, 'samples': 25452672, 'steps': 132565, 'loss/train': 1.1573364734649658} 11/07/2021 15:52:28 - INFO - __main__ - Step 132567: {'lr': 1.6924594054108066e-05, 'samples': 25452864, 'steps': 132566, 'loss/train': 0.5347856879234314} 11/07/2021 15:52:28 - INFO - __main__ - Step 132568: {'lr': 1.692267475407991e-05, 'samples': 25453056, 'steps': 132567, 'loss/train': 1.1886779069900513} 11/07/2021 15:52:28 - INFO - __main__ - Step 132569: {'lr': 1.6920755559072827e-05, 'samples': 25453248, 'steps': 132568, 'loss/train': 1.2041903734207153} 11/07/2021 15:52:30 - INFO - __main__ - Step 132570: {'lr': 1.6918836469087707e-05, 'samples': 25453440, 'steps': 132569, 'loss/train': 0.7928212881088257} 11/07/2021 15:52:30 - INFO - __main__ - Step 132571: {'lr': 1.69169174841253e-05, 'samples': 25453632, 'steps': 132570, 'loss/train': 1.365330457687378} 11/07/2021 15:52:31 - INFO - __main__ - Step 132572: {'lr': 1.691499860418655e-05, 'samples': 25453824, 'steps': 132571, 'loss/train': 1.4143239259719849} 11/07/2021 15:52:31 - INFO - __main__ - Step 132573: {'lr': 1.6913079829272288e-05, 'samples': 25454016, 'steps': 132572, 'loss/train': 1.6162790060043335} 11/07/2021 15:52:31 - INFO - __main__ - Step 132574: {'lr': 1.6911161159383403e-05, 'samples': 25454208, 'steps': 132573, 'loss/train': 1.513338565826416} 11/07/2021 15:52:32 - INFO - __main__ - Step 132575: {'lr': 1.6909242594520756e-05, 'samples': 25454400, 'steps': 132574, 'loss/train': 1.046903371810913} 11/07/2021 15:52:33 - INFO - __main__ - Step 132576: {'lr': 1.690732413468521e-05, 'samples': 25454592, 'steps': 132575, 'loss/train': 1.910815715789795} 11/07/2021 15:52:33 - INFO - __main__ - Step 132577: {'lr': 1.690540577987762e-05, 'samples': 25454784, 'steps': 132576, 'loss/train': 2.1344552040100098} 11/07/2021 15:52:34 - INFO - __main__ - Step 132578: {'lr': 1.690348753009885e-05, 'samples': 25454976, 'steps': 132577, 'loss/train': 1.2878144979476929} 11/07/2021 15:52:34 - INFO - __main__ - Step 132579: {'lr': 1.6901569385349785e-05, 'samples': 25455168, 'steps': 132578, 'loss/train': 1.591432809829712} 11/07/2021 15:52:34 - INFO - __main__ - Step 132580: {'lr': 1.689965134563126e-05, 'samples': 25455360, 'steps': 132579, 'loss/train': 1.086908221244812} 11/07/2021 15:52:35 - INFO - __main__ - Step 132581: {'lr': 1.6897733410944166e-05, 'samples': 25455552, 'steps': 132580, 'loss/train': 0.6734838485717773} 11/07/2021 15:52:36 - INFO - __main__ - Step 132582: {'lr': 1.689581558128936e-05, 'samples': 25455744, 'steps': 132581, 'loss/train': 0.8881520628929138} 11/07/2021 15:52:36 - INFO - __main__ - Step 132583: {'lr': 1.68938978566677e-05, 'samples': 25455936, 'steps': 132582, 'loss/train': 1.1389272212982178} 11/07/2021 15:52:37 - INFO - __main__ - Step 132584: {'lr': 1.689198023708008e-05, 'samples': 25456128, 'steps': 132583, 'loss/train': 1.5091198682785034} 11/07/2021 15:52:37 - INFO - __main__ - Step 132585: {'lr': 1.68900627225273e-05, 'samples': 25456320, 'steps': 132584, 'loss/train': 1.288718819618225} 11/07/2021 15:52:37 - INFO - __main__ - Step 132586: {'lr': 1.6888145313010277e-05, 'samples': 25456512, 'steps': 132585, 'loss/train': 0.8429829478263855} 11/07/2021 15:52:38 - INFO - __main__ - Step 132587: {'lr': 1.6886228008529846e-05, 'samples': 25456704, 'steps': 132586, 'loss/train': 1.1891977787017822} 11/07/2021 15:52:39 - INFO - __main__ - Step 132588: {'lr': 1.6884310809086895e-05, 'samples': 25456896, 'steps': 132587, 'loss/train': 1.6023327112197876} 11/07/2021 15:52:39 - INFO - __main__ - Step 132589: {'lr': 1.6882393714682254e-05, 'samples': 25457088, 'steps': 132588, 'loss/train': 1.4889098405838013} 11/07/2021 15:52:39 - INFO - __main__ - Step 132590: {'lr': 1.688047672531684e-05, 'samples': 25457280, 'steps': 132589, 'loss/train': 1.1109334230422974} 11/07/2021 15:52:40 - INFO - __main__ - Step 132591: {'lr': 1.687855984099146e-05, 'samples': 25457472, 'steps': 132590, 'loss/train': 1.1206622123718262} 11/07/2021 15:52:41 - INFO - __main__ - Step 132592: {'lr': 1.6876643061707026e-05, 'samples': 25457664, 'steps': 132591, 'loss/train': 1.1614452600479126} 11/07/2021 15:52:41 - INFO - __main__ - Step 132593: {'lr': 1.6874726387464347e-05, 'samples': 25457856, 'steps': 132592, 'loss/train': 0.514036238193512} 11/07/2021 15:52:42 - INFO - __main__ - Step 132594: {'lr': 1.6872809818264334e-05, 'samples': 25458048, 'steps': 132593, 'loss/train': 1.4706602096557617} 11/07/2021 15:52:42 - INFO - __main__ - Step 132595: {'lr': 1.6870893354107852e-05, 'samples': 25458240, 'steps': 132594, 'loss/train': 1.6438883543014526} 11/07/2021 15:52:42 - INFO - __main__ - Step 132596: {'lr': 1.6868976994995734e-05, 'samples': 25458432, 'steps': 132595, 'loss/train': 1.1892638206481934} 11/07/2021 15:52:43 - INFO - __main__ - Step 132597: {'lr': 1.686706074092889e-05, 'samples': 25458624, 'steps': 132596, 'loss/train': 1.539808988571167} 11/07/2021 15:52:44 - INFO - __main__ - Step 132598: {'lr': 1.6865144591908134e-05, 'samples': 25458816, 'steps': 132597, 'loss/train': 1.2774103879928589} 11/07/2021 15:52:44 - INFO - __main__ - Step 132599: {'lr': 1.686322854793432e-05, 'samples': 25459008, 'steps': 132598, 'loss/train': 1.3455570936203003} 11/07/2021 15:52:44 - INFO - __main__ - Step 132600: {'lr': 1.686131260900836e-05, 'samples': 25459200, 'steps': 132599, 'loss/train': 1.3983365297317505} 11/07/2021 15:52:45 - INFO - __main__ - Step 132601: {'lr': 1.6859396775131098e-05, 'samples': 25459392, 'steps': 132600, 'loss/train': 1.413623571395874} 11/07/2021 15:52:46 - INFO - __main__ - Step 132602: {'lr': 1.6857481046303358e-05, 'samples': 25459584, 'steps': 132601, 'loss/train': 1.6592711210250854} 11/07/2021 15:52:46 - INFO - __main__ - Step 132603: {'lr': 1.6855565422526058e-05, 'samples': 25459776, 'steps': 132602, 'loss/train': 1.210447907447815} 11/07/2021 15:52:47 - INFO - __main__ - Step 132604: {'lr': 1.685364990380006e-05, 'samples': 25459968, 'steps': 132603, 'loss/train': 1.3412169218063354} 11/07/2021 15:52:47 - INFO - __main__ - Step 132605: {'lr': 1.6851734490126196e-05, 'samples': 25460160, 'steps': 132604, 'loss/train': 1.0084834098815918} 11/07/2021 15:52:47 - INFO - __main__ - Step 132606: {'lr': 1.6849819181505355e-05, 'samples': 25460352, 'steps': 132605, 'loss/train': 1.014394998550415} 11/07/2021 15:52:48 - INFO - __main__ - Step 132607: {'lr': 1.6847903977938366e-05, 'samples': 25460544, 'steps': 132606, 'loss/train': 1.3420284986495972} 11/07/2021 15:52:49 - INFO - __main__ - Step 132608: {'lr': 1.684598887942612e-05, 'samples': 25460736, 'steps': 132607, 'loss/train': 1.2553632259368896} 11/07/2021 15:52:49 - INFO - __main__ - Step 132609: {'lr': 1.684407388596948e-05, 'samples': 25460928, 'steps': 132608, 'loss/train': 1.639649510383606} 11/07/2021 15:52:49 - INFO - __main__ - Step 132610: {'lr': 1.68421589975693e-05, 'samples': 25461120, 'steps': 132609, 'loss/train': 1.4666063785552979} 11/07/2021 15:52:50 - INFO - __main__ - Step 132611: {'lr': 1.6840244214226503e-05, 'samples': 25461312, 'steps': 132610, 'loss/train': 1.5020887851715088} 11/07/2021 15:52:51 - INFO - __main__ - Step 132612: {'lr': 1.6838329535941832e-05, 'samples': 25461504, 'steps': 132611, 'loss/train': 1.711557149887085} 11/07/2021 15:52:51 - INFO - __main__ - Step 132613: {'lr': 1.6836414962716206e-05, 'samples': 25461696, 'steps': 132612, 'loss/train': 1.2954610586166382} 11/07/2021 15:52:51 - INFO - __main__ - Step 132614: {'lr': 1.6834500494550513e-05, 'samples': 25461888, 'steps': 132613, 'loss/train': 1.2028571367263794} 11/07/2021 15:52:52 - INFO - __main__ - Step 132615: {'lr': 1.683258613144559e-05, 'samples': 25462080, 'steps': 132614, 'loss/train': 1.3328132629394531} 11/07/2021 15:52:52 - INFO - __main__ - Step 132616: {'lr': 1.6830671873402288e-05, 'samples': 25462272, 'steps': 132615, 'loss/train': 0.7766487002372742} 11/07/2021 15:52:52 - INFO - __main__ - Step 132617: {'lr': 1.68287577204215e-05, 'samples': 25462464, 'steps': 132616, 'loss/train': 1.2944895029067993} 11/07/2021 15:52:54 - INFO - __main__ - Step 132618: {'lr': 1.682684367250409e-05, 'samples': 25462656, 'steps': 132617, 'loss/train': 1.8191699981689453} 11/07/2021 15:52:54 - INFO - __main__ - Step 132619: {'lr': 1.6824929729650886e-05, 'samples': 25462848, 'steps': 132618, 'loss/train': 1.2792131900787354} 11/07/2021 15:52:54 - INFO - __main__ - Step 132620: {'lr': 1.6823015891862775e-05, 'samples': 25463040, 'steps': 132619, 'loss/train': 1.4576064348220825} 11/07/2021 15:52:55 - INFO - __main__ - Step 132621: {'lr': 1.682110215914062e-05, 'samples': 25463232, 'steps': 132620, 'loss/train': 1.2147811651229858} 11/07/2021 15:52:55 - INFO - __main__ - Step 132622: {'lr': 1.6819188531485287e-05, 'samples': 25463424, 'steps': 132621, 'loss/train': 1.0308001041412354} 11/07/2021 15:52:56 - INFO - __main__ - Step 132623: {'lr': 1.6817275008897624e-05, 'samples': 25463616, 'steps': 132622, 'loss/train': 1.3028262853622437} 11/07/2021 15:52:56 - INFO - __main__ - Step 132624: {'lr': 1.681536159137853e-05, 'samples': 25463808, 'steps': 132623, 'loss/train': 0.6459027528762817} 11/07/2021 15:52:57 - INFO - __main__ - Step 132625: {'lr': 1.6813448278928805e-05, 'samples': 25464000, 'steps': 132624, 'loss/train': 0.9912625551223755} 11/07/2021 15:52:57 - INFO - __main__ - Step 132626: {'lr': 1.681153507154934e-05, 'samples': 25464192, 'steps': 132625, 'loss/train': 0.8314739465713501} 11/07/2021 15:52:57 - INFO - __main__ - Step 132627: {'lr': 1.680962196924099e-05, 'samples': 25464384, 'steps': 132626, 'loss/train': 1.4183392524719238} 11/07/2021 15:52:59 - INFO - __main__ - Step 132628: {'lr': 1.6807708972004622e-05, 'samples': 25464576, 'steps': 132627, 'loss/train': 1.2167987823486328} 11/07/2021 15:52:59 - INFO - __main__ - Step 132629: {'lr': 1.680579607984112e-05, 'samples': 25464768, 'steps': 132628, 'loss/train': 1.502816915512085} 11/07/2021 15:53:00 - INFO - __main__ - Step 132630: {'lr': 1.6803883292751314e-05, 'samples': 25464960, 'steps': 132629, 'loss/train': 1.379033088684082} 11/07/2021 15:53:00 - INFO - __main__ - Step 132631: {'lr': 1.68019706107361e-05, 'samples': 25465152, 'steps': 132630, 'loss/train': 1.2230521440505981} 11/07/2021 15:53:00 - INFO - __main__ - Step 132632: {'lr': 1.6800058033796307e-05, 'samples': 25465344, 'steps': 132631, 'loss/train': 0.057456862181425095} 11/07/2021 15:53:01 - INFO - __main__ - Step 132633: {'lr': 1.679814556193279e-05, 'samples': 25465536, 'steps': 132632, 'loss/train': 0.6552048921585083} 11/07/2021 15:53:02 - INFO - __main__ - Step 132634: {'lr': 1.6796233195146447e-05, 'samples': 25465728, 'steps': 132633, 'loss/train': 1.4586182832717896} 11/07/2021 15:53:02 - INFO - __main__ - Step 132635: {'lr': 1.6794320933438135e-05, 'samples': 25465920, 'steps': 132634, 'loss/train': 1.479195475578308} 11/07/2021 15:53:03 - INFO - __main__ - Step 132636: {'lr': 1.6792408776808682e-05, 'samples': 25466112, 'steps': 132635, 'loss/train': 1.0507466793060303} 11/07/2021 15:53:03 - INFO - __main__ - Step 132637: {'lr': 1.679049672525898e-05, 'samples': 25466304, 'steps': 132636, 'loss/train': 1.3884131908416748} 11/07/2021 15:53:03 - INFO - __main__ - Step 132638: {'lr': 1.678858477878992e-05, 'samples': 25466496, 'steps': 132637, 'loss/train': 1.6809978485107422} 11/07/2021 15:53:04 - INFO - __main__ - Step 132639: {'lr': 1.6786672937402296e-05, 'samples': 25466688, 'steps': 132638, 'loss/train': 1.5980275869369507} 11/07/2021 15:53:05 - INFO - __main__ - Step 132640: {'lr': 1.678476120109698e-05, 'samples': 25466880, 'steps': 132639, 'loss/train': 1.2181400060653687} 11/07/2021 15:53:05 - INFO - __main__ - Step 132641: {'lr': 1.6782849569874858e-05, 'samples': 25467072, 'steps': 132640, 'loss/train': 1.33975088596344} 11/07/2021 15:53:05 - INFO - __main__ - Step 132642: {'lr': 1.6780938043736788e-05, 'samples': 25467264, 'steps': 132641, 'loss/train': 1.4053353071212769} 11/07/2021 15:53:06 - INFO - __main__ - Step 132643: {'lr': 1.677902662268363e-05, 'samples': 25467456, 'steps': 132642, 'loss/train': 1.2618986368179321} 11/07/2021 15:53:07 - INFO - __main__ - Step 132644: {'lr': 1.6777115306716244e-05, 'samples': 25467648, 'steps': 132643, 'loss/train': 1.2428208589553833} 11/07/2021 15:53:07 - INFO - __main__ - Step 132645: {'lr': 1.6775204095835496e-05, 'samples': 25467840, 'steps': 132644, 'loss/train': 1.2234694957733154} 11/07/2021 15:53:07 - INFO - __main__ - Step 132646: {'lr': 1.677329299004224e-05, 'samples': 25468032, 'steps': 132645, 'loss/train': 1.6191034317016602} 11/07/2021 15:53:08 - INFO - __main__ - Step 132647: {'lr': 1.6771381989337337e-05, 'samples': 25468224, 'steps': 132646, 'loss/train': 0.915910542011261} 11/07/2021 15:53:08 - INFO - __main__ - Step 132648: {'lr': 1.676947109372165e-05, 'samples': 25468416, 'steps': 132647, 'loss/train': 1.4414995908737183} 11/07/2021 15:53:09 - INFO - __main__ - Step 132649: {'lr': 1.676756030319604e-05, 'samples': 25468608, 'steps': 132648, 'loss/train': 1.2438329458236694} 11/07/2021 15:53:10 - INFO - __main__ - Step 132650: {'lr': 1.6765649617761364e-05, 'samples': 25468800, 'steps': 132649, 'loss/train': 1.3969727754592896} 11/07/2021 15:53:10 - INFO - __main__ - Step 132651: {'lr': 1.6763739037418514e-05, 'samples': 25468992, 'steps': 132650, 'loss/train': 1.8581639528274536} 11/07/2021 15:53:10 - INFO - __main__ - Step 132652: {'lr': 1.676182856216832e-05, 'samples': 25469184, 'steps': 132651, 'loss/train': 1.2037391662597656} 11/07/2021 15:53:11 - INFO - __main__ - Step 132653: {'lr': 1.6759918192011613e-05, 'samples': 25469376, 'steps': 132652, 'loss/train': 1.299317479133606} 11/07/2021 15:53:12 - INFO - __main__ - Step 132654: {'lr': 1.6758007926949314e-05, 'samples': 25469568, 'steps': 132653, 'loss/train': 1.3118114471435547} 11/07/2021 15:53:12 - INFO - __main__ - Step 132655: {'lr': 1.6756097766982254e-05, 'samples': 25469760, 'steps': 132654, 'loss/train': 1.0007922649383545} 11/07/2021 15:53:12 - INFO - __main__ - Step 132656: {'lr': 1.6754187712111262e-05, 'samples': 25469952, 'steps': 132655, 'loss/train': 1.2351797819137573} 11/07/2021 15:53:13 - INFO - __main__ - Step 132657: {'lr': 1.6752277762337285e-05, 'samples': 25470144, 'steps': 132656, 'loss/train': 1.3586584329605103} 11/07/2021 15:53:13 - INFO - __main__ - Step 132658: {'lr': 1.6750367917661103e-05, 'samples': 25470336, 'steps': 132657, 'loss/train': 1.0412589311599731} 11/07/2021 15:53:14 - INFO - __main__ - Step 132659: {'lr': 1.6748458178083597e-05, 'samples': 25470528, 'steps': 132658, 'loss/train': 1.2417199611663818} 11/07/2021 15:53:14 - INFO - __main__ - Step 132660: {'lr': 1.6746548543605662e-05, 'samples': 25470720, 'steps': 132659, 'loss/train': 1.123828411102295} 11/07/2021 15:53:15 - INFO - __main__ - Step 132661: {'lr': 1.6744639014228126e-05, 'samples': 25470912, 'steps': 132660, 'loss/train': 1.333870768547058} 11/07/2021 15:53:15 - INFO - __main__ - Step 132662: {'lr': 1.674272958995185e-05, 'samples': 25471104, 'steps': 132661, 'loss/train': 1.329858422279358} 11/07/2021 15:53:15 - INFO - __main__ - Step 132663: {'lr': 1.6740820270777696e-05, 'samples': 25471296, 'steps': 132662, 'loss/train': 1.3470226526260376} 11/07/2021 15:53:16 - INFO - __main__ - Step 132664: {'lr': 1.673891105670658e-05, 'samples': 25471488, 'steps': 132663, 'loss/train': 1.4871424436569214} 11/07/2021 15:53:17 - INFO - __main__ - Step 132665: {'lr': 1.6737001947739278e-05, 'samples': 25471680, 'steps': 132664, 'loss/train': 1.2900899648666382} 11/07/2021 15:53:17 - INFO - __main__ - Step 132666: {'lr': 1.673509294387665e-05, 'samples': 25471872, 'steps': 132665, 'loss/train': 1.0749412775039673} 11/07/2021 15:53:18 - INFO - __main__ - Step 132667: {'lr': 1.6733184045119616e-05, 'samples': 25472064, 'steps': 132666, 'loss/train': 0.6813309192657471} 11/07/2021 15:53:18 - INFO - __main__ - Step 132668: {'lr': 1.6731275251469003e-05, 'samples': 25472256, 'steps': 132667, 'loss/train': 1.1254581212997437} 11/07/2021 15:53:18 - INFO - __main__ - Step 132669: {'lr': 1.6729366562925676e-05, 'samples': 25472448, 'steps': 132668, 'loss/train': 1.5760326385498047} 11/07/2021 15:53:19 - INFO - __main__ - Step 132670: {'lr': 1.6727457979490517e-05, 'samples': 25472640, 'steps': 132669, 'loss/train': 1.3126986026763916} 11/07/2021 15:53:20 - INFO - __main__ - Step 132671: {'lr': 1.6725549501164338e-05, 'samples': 25472832, 'steps': 132670, 'loss/train': 1.7190899848937988} 11/07/2021 15:53:20 - INFO - __main__ - Step 132672: {'lr': 1.6723641127948053e-05, 'samples': 25473024, 'steps': 132671, 'loss/train': 1.377536416053772} 11/07/2021 15:53:20 - INFO - __main__ - Step 132673: {'lr': 1.6721732859842464e-05, 'samples': 25473216, 'steps': 132672, 'loss/train': 1.4056357145309448} 11/07/2021 15:53:21 - INFO - __main__ - Step 132674: {'lr': 1.671982469684849e-05, 'samples': 25473408, 'steps': 132673, 'loss/train': 0.6411304473876953} 11/07/2021 15:53:22 - INFO - __main__ - Step 132675: {'lr': 1.6717916638966963e-05, 'samples': 25473600, 'steps': 132674, 'loss/train': 1.217347502708435} 11/07/2021 15:53:23 - INFO - __main__ - Step 132676: {'lr': 1.6716008686198713e-05, 'samples': 25473792, 'steps': 132675, 'loss/train': 1.4393752813339233} 11/07/2021 15:53:23 - INFO - __main__ - Step 132677: {'lr': 1.671410083854466e-05, 'samples': 25473984, 'steps': 132676, 'loss/train': 1.4785621166229248} 11/07/2021 15:53:23 - INFO - __main__ - Step 132678: {'lr': 1.6712193096005662e-05, 'samples': 25474176, 'steps': 132677, 'loss/train': 0.13026514649391174} 11/07/2021 15:53:24 - INFO - __main__ - Step 132679: {'lr': 1.6710285458582524e-05, 'samples': 25474368, 'steps': 132678, 'loss/train': 1.7868242263793945} 11/07/2021 15:53:25 - INFO - __main__ - Step 132680: {'lr': 1.670837792627611e-05, 'samples': 25474560, 'steps': 132679, 'loss/train': 1.6242398023605347} 11/07/2021 15:53:26 - INFO - __main__ - Step 132681: {'lr': 1.67064704990873e-05, 'samples': 25474752, 'steps': 132680, 'loss/train': 1.4670926332473755} 11/07/2021 15:53:26 - INFO - __main__ - Step 132682: {'lr': 1.670456317701696e-05, 'samples': 25474944, 'steps': 132681, 'loss/train': 1.7306058406829834} 11/07/2021 15:53:26 - INFO - __main__ - Step 132683: {'lr': 1.6702655960065956e-05, 'samples': 25475136, 'steps': 132682, 'loss/train': 5.582643985748291} 11/07/2021 15:53:27 - INFO - __main__ - Step 132684: {'lr': 1.6700748848235137e-05, 'samples': 25475328, 'steps': 132683, 'loss/train': 1.4107624292373657} 11/07/2021 15:53:27 - INFO - __main__ - Step 132685: {'lr': 1.669884184152534e-05, 'samples': 25475520, 'steps': 132684, 'loss/train': 1.410884141921997} 11/07/2021 15:53:27 - INFO - __main__ - Step 132686: {'lr': 1.6696934939937458e-05, 'samples': 25475712, 'steps': 132685, 'loss/train': 1.0701065063476562} 11/07/2021 15:53:28 - INFO - __main__ - Step 132687: {'lr': 1.6695028143472347e-05, 'samples': 25475904, 'steps': 132686, 'loss/train': 1.4053833484649658} 11/07/2021 15:53:29 - INFO - __main__ - Step 132688: {'lr': 1.6693121452130838e-05, 'samples': 25476096, 'steps': 132687, 'loss/train': 1.5240346193313599} 11/07/2021 15:53:29 - INFO - __main__ - Step 132689: {'lr': 1.6691214865913852e-05, 'samples': 25476288, 'steps': 132688, 'loss/train': 1.1429194211959839} 11/07/2021 15:53:29 - INFO - __main__ - Step 132690: {'lr': 1.668930838482216e-05, 'samples': 25476480, 'steps': 132689, 'loss/train': 1.2325963973999023} 11/07/2021 15:53:30 - INFO - __main__ - Step 132691: {'lr': 1.6687402008856683e-05, 'samples': 25476672, 'steps': 132690, 'loss/train': 1.213744878768921} 11/07/2021 15:53:31 - INFO - __main__ - Step 132692: {'lr': 1.6685495738018253e-05, 'samples': 25476864, 'steps': 132691, 'loss/train': 1.496881127357483} 11/07/2021 15:53:31 - INFO - __main__ - Step 132693: {'lr': 1.668358957230773e-05, 'samples': 25477056, 'steps': 132692, 'loss/train': 1.4286541938781738} 11/07/2021 15:53:32 - INFO - __main__ - Step 132694: {'lr': 1.6681683511725997e-05, 'samples': 25477248, 'steps': 132693, 'loss/train': 1.2925176620483398} 11/07/2021 15:53:32 - INFO - __main__ - Step 132695: {'lr': 1.6679777556273868e-05, 'samples': 25477440, 'steps': 132694, 'loss/train': 0.6110157370567322} 11/07/2021 15:53:32 - INFO - __main__ - Step 132696: {'lr': 1.6677871705952253e-05, 'samples': 25477632, 'steps': 132695, 'loss/train': 0.9290660619735718} 11/07/2021 15:53:33 - INFO - __main__ - Step 132697: {'lr': 1.6675965960761986e-05, 'samples': 25477824, 'steps': 132696, 'loss/train': 1.3320441246032715} 11/07/2021 15:53:34 - INFO - __main__ - Step 132698: {'lr': 1.6674060320703927e-05, 'samples': 25478016, 'steps': 132697, 'loss/train': 1.4587489366531372} 11/07/2021 15:53:34 - INFO - __main__ - Step 132699: {'lr': 1.6672154785778937e-05, 'samples': 25478208, 'steps': 132698, 'loss/train': 1.2613083124160767} 11/07/2021 15:53:34 - INFO - __main__ - Step 132700: {'lr': 1.6670249355987933e-05, 'samples': 25478400, 'steps': 132699, 'loss/train': 1.232570767402649} 11/07/2021 15:53:35 - INFO - __main__ - Step 132701: {'lr': 1.6668344031331662e-05, 'samples': 25478592, 'steps': 132700, 'loss/train': 1.45110285282135} 11/07/2021 15:53:36 - INFO - __main__ - Step 132702: {'lr': 1.6666438811811014e-05, 'samples': 25478784, 'steps': 132701, 'loss/train': 1.1953341960906982} 11/07/2021 15:53:36 - INFO - __main__ - Step 132703: {'lr': 1.6664533697426874e-05, 'samples': 25478976, 'steps': 132702, 'loss/train': 1.456337571144104} 11/07/2021 15:53:37 - INFO - __main__ - Step 132704: {'lr': 1.666262868818011e-05, 'samples': 25479168, 'steps': 132703, 'loss/train': 1.3731741905212402} 11/07/2021 15:53:37 - INFO - __main__ - Step 132705: {'lr': 1.6660723784071573e-05, 'samples': 25479360, 'steps': 132704, 'loss/train': 1.536211609840393} 11/07/2021 15:53:37 - INFO - __main__ - Step 132706: {'lr': 1.6658818985102075e-05, 'samples': 25479552, 'steps': 132705, 'loss/train': 1.1840357780456543} 11/07/2021 15:53:38 - INFO - __main__ - Step 132707: {'lr': 1.6656914291272558e-05, 'samples': 25479744, 'steps': 132706, 'loss/train': 1.4964630603790283} 11/07/2021 15:53:39 - INFO - __main__ - Step 132708: {'lr': 1.6655009702583794e-05, 'samples': 25479936, 'steps': 132707, 'loss/train': 0.8054669499397278} 11/07/2021 15:53:39 - INFO - __main__ - Step 132709: {'lr': 1.6653105219036708e-05, 'samples': 25480128, 'steps': 132708, 'loss/train': 1.5093355178833008} 11/07/2021 15:53:39 - INFO - __main__ - Step 132710: {'lr': 1.6651200840632123e-05, 'samples': 25480320, 'steps': 132709, 'loss/train': 1.3522964715957642} 11/07/2021 15:53:40 - INFO - __main__ - Step 132711: {'lr': 1.6649296567370938e-05, 'samples': 25480512, 'steps': 132710, 'loss/train': 1.1016443967819214} 11/07/2021 15:53:40 - INFO - __main__ - Step 132712: {'lr': 1.6647392399253947e-05, 'samples': 25480704, 'steps': 132711, 'loss/train': 1.0808919668197632} 11/07/2021 15:53:41 - INFO - __main__ - Step 132713: {'lr': 1.6645488336282044e-05, 'samples': 25480896, 'steps': 132712, 'loss/train': 1.276245355606079} 11/07/2021 15:53:41 - INFO - __main__ - Step 132714: {'lr': 1.664358437845609e-05, 'samples': 25481088, 'steps': 132713, 'loss/train': 1.4325895309448242} 11/07/2021 15:53:42 - INFO - __main__ - Step 132715: {'lr': 1.6641680525776914e-05, 'samples': 25481280, 'steps': 132714, 'loss/train': 1.1912717819213867} 11/07/2021 15:53:42 - INFO - __main__ - Step 132716: {'lr': 1.6639776778245436e-05, 'samples': 25481472, 'steps': 132715, 'loss/train': 1.351104497909546} 11/07/2021 15:53:42 - INFO - __main__ - Step 132717: {'lr': 1.663787313586243e-05, 'samples': 25481664, 'steps': 132716, 'loss/train': 1.3815388679504395} 11/07/2021 15:53:44 - INFO - __main__ - Step 132718: {'lr': 1.6635969598628812e-05, 'samples': 25481856, 'steps': 132717, 'loss/train': 1.475860357284546} 11/07/2021 15:53:44 - INFO - __main__ - Step 132719: {'lr': 1.6634066166545418e-05, 'samples': 25482048, 'steps': 132718, 'loss/train': 1.8234305381774902} 11/07/2021 15:53:44 - INFO - __main__ - Step 132720: {'lr': 1.6632162839613134e-05, 'samples': 25482240, 'steps': 132719, 'loss/train': 0.6547654867172241} 11/07/2021 15:53:45 - INFO - __main__ - Step 132721: {'lr': 1.6630259617832795e-05, 'samples': 25482432, 'steps': 132720, 'loss/train': 0.6565466523170471} 11/07/2021 15:53:45 - INFO - __main__ - Step 132722: {'lr': 1.6628356501205283e-05, 'samples': 25482624, 'steps': 132721, 'loss/train': 1.5093938112258911} 11/07/2021 15:53:46 - INFO - __main__ - Step 132723: {'lr': 1.662645348973138e-05, 'samples': 25482816, 'steps': 132722, 'loss/train': 1.2518161535263062} 11/07/2021 15:53:47 - INFO - __main__ - Step 132724: {'lr': 1.6624550583412028e-05, 'samples': 25483008, 'steps': 132723, 'loss/train': 1.0508832931518555} 11/07/2021 15:53:47 - INFO - __main__ - Step 132725: {'lr': 1.6622647782248036e-05, 'samples': 25483200, 'steps': 132724, 'loss/train': 1.2330478429794312} 11/07/2021 15:53:47 - INFO - __main__ - Step 132726: {'lr': 1.6620745086240285e-05, 'samples': 25483392, 'steps': 132725, 'loss/train': 1.405532717704773} 11/07/2021 15:53:48 - INFO - __main__ - Step 132727: {'lr': 1.6618842495389614e-05, 'samples': 25483584, 'steps': 132726, 'loss/train': 0.5949479341506958} 11/07/2021 15:53:48 - INFO - __main__ - Step 132728: {'lr': 1.6616940009696907e-05, 'samples': 25483776, 'steps': 132727, 'loss/train': 1.3835930824279785} 11/07/2021 15:53:49 - INFO - __main__ - Step 132729: {'lr': 1.6615037629163e-05, 'samples': 25483968, 'steps': 132728, 'loss/train': 0.9979850053787231} 11/07/2021 15:53:49 - INFO - __main__ - Step 132730: {'lr': 1.661313535378875e-05, 'samples': 25484160, 'steps': 132729, 'loss/train': 1.122421145439148} 11/07/2021 15:53:50 - INFO - __main__ - Step 132731: {'lr': 1.661123318357502e-05, 'samples': 25484352, 'steps': 132730, 'loss/train': 1.2674384117126465} 11/07/2021 15:53:50 - INFO - __main__ - Step 132732: {'lr': 1.660933111852267e-05, 'samples': 25484544, 'steps': 132731, 'loss/train': 1.2333306074142456} 11/07/2021 15:53:50 - INFO - __main__ - Step 132733: {'lr': 1.6607429158632587e-05, 'samples': 25484736, 'steps': 132732, 'loss/train': 1.4979372024536133} 11/07/2021 15:53:52 - INFO - __main__ - Step 132734: {'lr': 1.6605527303905548e-05, 'samples': 25484928, 'steps': 132733, 'loss/train': 0.9808763861656189} 11/07/2021 15:53:52 - INFO - __main__ - Step 132735: {'lr': 1.660362555434247e-05, 'samples': 25485120, 'steps': 132734, 'loss/train': 1.469772219657898} 11/07/2021 15:53:52 - INFO - __main__ - Step 132736: {'lr': 1.660172390994419e-05, 'samples': 25485312, 'steps': 132735, 'loss/train': 1.482478380203247} 11/07/2021 15:53:53 - INFO - __main__ - Step 132737: {'lr': 1.6599822370711586e-05, 'samples': 25485504, 'steps': 132736, 'loss/train': 0.9371505379676819} 11/07/2021 15:53:53 - INFO - __main__ - Step 132738: {'lr': 1.65979209366455e-05, 'samples': 25485696, 'steps': 132737, 'loss/train': 1.0895590782165527} 11/07/2021 15:53:54 - INFO - __main__ - Step 132739: {'lr': 1.659601960774676e-05, 'samples': 25485888, 'steps': 132738, 'loss/train': 1.3317604064941406} 11/07/2021 15:53:54 - INFO - __main__ - Step 132740: {'lr': 1.659411838401628e-05, 'samples': 25486080, 'steps': 132739, 'loss/train': 1.1676545143127441} 11/07/2021 15:53:55 - INFO - __main__ - Step 132741: {'lr': 1.6592217265454874e-05, 'samples': 25486272, 'steps': 132740, 'loss/train': 0.9427229166030884} 11/07/2021 15:53:55 - INFO - __main__ - Step 132742: {'lr': 1.659031625206342e-05, 'samples': 25486464, 'steps': 132741, 'loss/train': 1.4059430360794067} 11/07/2021 15:53:55 - INFO - __main__ - Step 132743: {'lr': 1.6588415343842782e-05, 'samples': 25486656, 'steps': 132742, 'loss/train': 0.2278216928243637} 11/07/2021 15:53:57 - INFO - __main__ - Step 132744: {'lr': 1.6586514540793795e-05, 'samples': 25486848, 'steps': 132743, 'loss/train': 1.084481120109558} 11/07/2021 15:53:57 - INFO - __main__ - Step 132745: {'lr': 1.6584613842917346e-05, 'samples': 25487040, 'steps': 132744, 'loss/train': 1.510790228843689} 11/07/2021 15:53:57 - INFO - __main__ - Step 132746: {'lr': 1.6582713250214236e-05, 'samples': 25487232, 'steps': 132745, 'loss/train': 0.8559746742248535} 11/07/2021 15:53:58 - INFO - __main__ - Step 132747: {'lr': 1.658081276268536e-05, 'samples': 25487424, 'steps': 132746, 'loss/train': 1.2998007535934448} 11/07/2021 15:53:58 - INFO - __main__ - Step 132748: {'lr': 1.6578912380331574e-05, 'samples': 25487616, 'steps': 132747, 'loss/train': 1.3421058654785156} 11/07/2021 15:53:59 - INFO - __main__ - Step 132749: {'lr': 1.657701210315371e-05, 'samples': 25487808, 'steps': 132748, 'loss/train': 1.2307864427566528} 11/07/2021 15:53:59 - INFO - __main__ - Step 132750: {'lr': 1.657511193115266e-05, 'samples': 25488000, 'steps': 132749, 'loss/train': 1.1345007419586182} 11/07/2021 15:54:00 - INFO - __main__ - Step 132751: {'lr': 1.657321186432925e-05, 'samples': 25488192, 'steps': 132750, 'loss/train': 1.2711495161056519} 11/07/2021 15:54:00 - INFO - __main__ - Step 132752: {'lr': 1.657131190268435e-05, 'samples': 25488384, 'steps': 132751, 'loss/train': 1.4261250495910645} 11/07/2021 15:54:00 - INFO - __main__ - Step 132753: {'lr': 1.6569412046218814e-05, 'samples': 25488576, 'steps': 132752, 'loss/train': 1.4732059240341187} 11/07/2021 15:54:02 - INFO - __main__ - Step 132754: {'lr': 1.65675122949335e-05, 'samples': 25488768, 'steps': 132753, 'loss/train': 1.3351749181747437} 11/07/2021 15:54:02 - INFO - __main__ - Step 132755: {'lr': 1.6565612648829276e-05, 'samples': 25488960, 'steps': 132754, 'loss/train': 1.1214624643325806} 11/07/2021 15:54:02 - INFO - __main__ - Step 132756: {'lr': 1.656371310790697e-05, 'samples': 25489152, 'steps': 132755, 'loss/train': 0.9434307217597961} 11/07/2021 15:54:03 - INFO - __main__ - Step 132757: {'lr': 1.656181367216744e-05, 'samples': 25489344, 'steps': 132756, 'loss/train': 0.999129056930542} 11/07/2021 15:54:03 - INFO - __main__ - Step 132758: {'lr': 1.655991434161158e-05, 'samples': 25489536, 'steps': 132757, 'loss/train': 1.3113353252410889} 11/07/2021 15:54:03 - INFO - __main__ - Step 132759: {'lr': 1.655801511624025e-05, 'samples': 25489728, 'steps': 132758, 'loss/train': 1.3149030208587646} 11/07/2021 15:54:04 - INFO - __main__ - Step 132760: {'lr': 1.655611599605425e-05, 'samples': 25489920, 'steps': 132759, 'loss/train': 1.7078500986099243} 11/07/2021 15:54:05 - INFO - __main__ - Step 132761: {'lr': 1.6554216981054443e-05, 'samples': 25490112, 'steps': 132760, 'loss/train': 0.9321094751358032} 11/07/2021 15:54:05 - INFO - __main__ - Step 132762: {'lr': 1.655231807124172e-05, 'samples': 25490304, 'steps': 132761, 'loss/train': 1.3575681447982788} 11/07/2021 15:54:05 - INFO - __main__ - Step 132763: {'lr': 1.6550419266616907e-05, 'samples': 25490496, 'steps': 132762, 'loss/train': 1.5701171159744263} 11/07/2021 15:54:06 - INFO - __main__ - Step 132764: {'lr': 1.654852056718087e-05, 'samples': 25490688, 'steps': 132763, 'loss/train': 1.2099767923355103} 11/07/2021 15:54:07 - INFO - __main__ - Step 132765: {'lr': 1.6546621972934467e-05, 'samples': 25490880, 'steps': 132764, 'loss/train': 0.9464409351348877} 11/07/2021 15:54:07 - INFO - __main__ - Step 132766: {'lr': 1.654472348387856e-05, 'samples': 25491072, 'steps': 132765, 'loss/train': 1.0315784215927124} 11/07/2021 15:54:08 - INFO - __main__ - Step 132767: {'lr': 1.6542825100014007e-05, 'samples': 25491264, 'steps': 132766, 'loss/train': 1.1463358402252197} 11/07/2021 15:54:08 - INFO - __main__ - Step 132768: {'lr': 1.6540926821341646e-05, 'samples': 25491456, 'steps': 132767, 'loss/train': 0.8745507597923279} 11/07/2021 15:54:08 - INFO - __main__ - Step 132769: {'lr': 1.653902864786233e-05, 'samples': 25491648, 'steps': 132768, 'loss/train': 1.4386780261993408} 11/07/2021 15:54:10 - INFO - __main__ - Step 132770: {'lr': 1.6537130579576924e-05, 'samples': 25491840, 'steps': 132769, 'loss/train': 0.7687498331069946} 11/07/2021 15:54:10 - INFO - __main__ - Step 132771: {'lr': 1.6535232616486318e-05, 'samples': 25492032, 'steps': 132770, 'loss/train': 1.2556538581848145} 11/07/2021 15:54:10 - INFO - __main__ - Step 132772: {'lr': 1.653333475859134e-05, 'samples': 25492224, 'steps': 132771, 'loss/train': 1.3947603702545166} 11/07/2021 15:54:11 - INFO - __main__ - Step 132773: {'lr': 1.6531437005892825e-05, 'samples': 25492416, 'steps': 132772, 'loss/train': 0.8273944854736328} 11/07/2021 15:54:11 - INFO - __main__ - Step 132774: {'lr': 1.6529539358391605e-05, 'samples': 25492608, 'steps': 132773, 'loss/train': 1.467710256576538} 11/07/2021 15:54:12 - INFO - __main__ - Step 132775: {'lr': 1.6527641816088597e-05, 'samples': 25492800, 'steps': 132774, 'loss/train': 2.046084403991699} 11/07/2021 15:54:12 - INFO - __main__ - Step 132776: {'lr': 1.652574437898463e-05, 'samples': 25492992, 'steps': 132775, 'loss/train': 1.6832128763198853} 11/07/2021 15:54:13 - INFO - __main__ - Step 132777: {'lr': 1.652384704708057e-05, 'samples': 25493184, 'steps': 132776, 'loss/train': 1.235222339630127} 11/07/2021 15:54:13 - INFO - __main__ - Step 132778: {'lr': 1.6521949820377246e-05, 'samples': 25493376, 'steps': 132777, 'loss/train': 0.6499102115631104} 11/07/2021 15:54:13 - INFO - __main__ - Step 132779: {'lr': 1.652005269887552e-05, 'samples': 25493568, 'steps': 132778, 'loss/train': 1.664064884185791} 11/07/2021 15:54:14 - INFO - __main__ - Step 132780: {'lr': 1.651815568257628e-05, 'samples': 25493760, 'steps': 132779, 'loss/train': 1.0802276134490967} 11/07/2021 15:54:15 - INFO - __main__ - Step 132781: {'lr': 1.651625877148033e-05, 'samples': 25493952, 'steps': 132780, 'loss/train': 1.251137614250183} 11/07/2021 15:54:15 - INFO - __main__ - Step 132782: {'lr': 1.651436196558856e-05, 'samples': 25494144, 'steps': 132781, 'loss/train': 0.616269588470459} 11/07/2021 15:54:16 - INFO - __main__ - Step 132783: {'lr': 1.651246526490183e-05, 'samples': 25494336, 'steps': 132782, 'loss/train': 1.448642611503601} 11/07/2021 15:54:16 - INFO - __main__ - Step 132784: {'lr': 1.6510568669420967e-05, 'samples': 25494528, 'steps': 132783, 'loss/train': 1.0347942113876343} 11/07/2021 15:54:17 - INFO - __main__ - Step 132785: {'lr': 1.6508672179146892e-05, 'samples': 25494720, 'steps': 132784, 'loss/train': 0.9909622669219971} 11/07/2021 15:54:17 - INFO - __main__ - Step 132786: {'lr': 1.6506775794080358e-05, 'samples': 25494912, 'steps': 132785, 'loss/train': 0.14729459583759308} 11/07/2021 15:54:18 - INFO - __main__ - Step 132787: {'lr': 1.6504879514222248e-05, 'samples': 25495104, 'steps': 132786, 'loss/train': 0.6729322075843811} 11/07/2021 15:54:18 - INFO - __main__ - Step 132788: {'lr': 1.650298333957348e-05, 'samples': 25495296, 'steps': 132787, 'loss/train': 1.0102566480636597} 11/07/2021 15:54:18 - INFO - __main__ - Step 132789: {'lr': 1.650108727013483e-05, 'samples': 25495488, 'steps': 132788, 'loss/train': 2.313636064529419} 11/07/2021 15:54:19 - INFO - __main__ - Step 132790: {'lr': 1.6499191305907184e-05, 'samples': 25495680, 'steps': 132789, 'loss/train': 1.254756212234497} 11/07/2021 15:54:20 - INFO - __main__ - Step 132791: {'lr': 1.6497295446891407e-05, 'samples': 25495872, 'steps': 132790, 'loss/train': 0.7919303178787231} 11/07/2021 15:54:20 - INFO - __main__ - Step 132792: {'lr': 1.649539969308836e-05, 'samples': 25496064, 'steps': 132791, 'loss/train': 1.2626205682754517} 11/07/2021 15:54:21 - INFO - __main__ - Step 132793: {'lr': 1.649350404449887e-05, 'samples': 25496256, 'steps': 132792, 'loss/train': 1.2232378721237183} 11/07/2021 15:54:21 - INFO - __main__ - Step 132794: {'lr': 1.64916085011238e-05, 'samples': 25496448, 'steps': 132793, 'loss/train': 1.1233094930648804} 11/07/2021 15:54:21 - INFO - __main__ - Step 132795: {'lr': 1.648971306296404e-05, 'samples': 25496640, 'steps': 132794, 'loss/train': 0.8703613877296448} 11/07/2021 15:54:22 - INFO - __main__ - Step 132796: {'lr': 1.6487817730020365e-05, 'samples': 25496832, 'steps': 132795, 'loss/train': 0.8191754817962646} 11/07/2021 15:54:23 - INFO - __main__ - Step 132797: {'lr': 1.6485922502293693e-05, 'samples': 25497024, 'steps': 132796, 'loss/train': 1.5389525890350342} 11/07/2021 15:54:23 - INFO - __main__ - Step 132798: {'lr': 1.648402737978488e-05, 'samples': 25497216, 'steps': 132797, 'loss/train': 1.397964596748352} 11/07/2021 15:54:23 - INFO - __main__ - Step 132799: {'lr': 1.6482132362494794e-05, 'samples': 25497408, 'steps': 132798, 'loss/train': 1.0298835039138794} 11/07/2021 15:54:24 - INFO - __main__ - Step 132800: {'lr': 1.6480237450424206e-05, 'samples': 25497600, 'steps': 132799, 'loss/train': 1.987168788909912} 11/07/2021 15:54:25 - INFO - __main__ - Step 132801: {'lr': 1.6478342643574006e-05, 'samples': 25497792, 'steps': 132800, 'loss/train': 1.1394717693328857} 11/07/2021 15:54:25 - INFO - __main__ - Step 132802: {'lr': 1.6476447941945082e-05, 'samples': 25497984, 'steps': 132801, 'loss/train': 1.167667031288147} 11/07/2021 15:54:26 - INFO - __main__ - Step 132803: {'lr': 1.647455334553827e-05, 'samples': 25498176, 'steps': 132802, 'loss/train': 1.1211196184158325} 11/07/2021 15:54:26 - INFO - __main__ - Step 132804: {'lr': 1.6472658854354423e-05, 'samples': 25498368, 'steps': 132803, 'loss/train': 1.6323906183242798} 11/07/2021 15:54:26 - INFO - __main__ - Step 132805: {'lr': 1.647076446839438e-05, 'samples': 25498560, 'steps': 132804, 'loss/train': 0.9528706669807434} 11/07/2021 15:54:27 - INFO - __main__ - Step 132806: {'lr': 1.6468870187658996e-05, 'samples': 25498752, 'steps': 132805, 'loss/train': 1.5044214725494385} 11/07/2021 15:54:28 - INFO - __main__ - Step 132807: {'lr': 1.6466976012149137e-05, 'samples': 25498944, 'steps': 132806, 'loss/train': 1.0111931562423706} 11/07/2021 15:54:28 - INFO - __main__ - Step 132808: {'lr': 1.646508194186569e-05, 'samples': 25499136, 'steps': 132807, 'loss/train': 1.0798733234405518} 11/07/2021 15:54:28 - INFO - __main__ - Step 132809: {'lr': 1.646318797680943e-05, 'samples': 25499328, 'steps': 132808, 'loss/train': 0.8659196496009827} 11/07/2021 15:54:29 - INFO - __main__ - Step 132810: {'lr': 1.6461294116981272e-05, 'samples': 25499520, 'steps': 132809, 'loss/train': 1.2434427738189697} 11/07/2021 15:54:29 - INFO - __main__ - Step 132811: {'lr': 1.6459400362382054e-05, 'samples': 25499712, 'steps': 132810, 'loss/train': 1.1235790252685547} 11/07/2021 15:54:30 - INFO - __main__ - Step 132812: {'lr': 1.6457506713012687e-05, 'samples': 25499904, 'steps': 132811, 'loss/train': 0.9669278860092163} 11/07/2021 15:54:31 - INFO - __main__ - Step 132813: {'lr': 1.6455613168873895e-05, 'samples': 25500096, 'steps': 132812, 'loss/train': 0.03370874747633934} 11/07/2021 15:54:31 - INFO - __main__ - Step 132814: {'lr': 1.6453719729966594e-05, 'samples': 25500288, 'steps': 132813, 'loss/train': 1.2565542459487915} 11/07/2021 15:54:31 - INFO - __main__ - Step 132815: {'lr': 1.6451826396291668e-05, 'samples': 25500480, 'steps': 132814, 'loss/train': 0.6549372673034668} 11/07/2021 15:54:32 - INFO - __main__ - Step 132816: {'lr': 1.644993316784993e-05, 'samples': 25500672, 'steps': 132815, 'loss/train': 0.17806176841259003} 11/07/2021 15:54:33 - INFO - __main__ - Step 132817: {'lr': 1.644804004464226e-05, 'samples': 25500864, 'steps': 132816, 'loss/train': 1.4458165168762207} 11/07/2021 15:54:33 - INFO - __main__ - Step 132818: {'lr': 1.6446147026669493e-05, 'samples': 25501056, 'steps': 132817, 'loss/train': 1.2906725406646729} 11/07/2021 15:54:33 - INFO - __main__ - Step 132819: {'lr': 1.6444254113932466e-05, 'samples': 25501248, 'steps': 132818, 'loss/train': 0.9591747522354126} 11/07/2021 15:54:34 - INFO - __main__ - Step 132820: {'lr': 1.6442361306432092e-05, 'samples': 25501440, 'steps': 132819, 'loss/train': 1.3651050329208374} 11/07/2021 15:54:34 - INFO - __main__ - Step 132821: {'lr': 1.6440468604169172e-05, 'samples': 25501632, 'steps': 132820, 'loss/train': 1.5400218963623047} 11/07/2021 15:54:35 - INFO - __main__ - Step 132822: {'lr': 1.6438576007144547e-05, 'samples': 25501824, 'steps': 132821, 'loss/train': 1.1812200546264648} 11/07/2021 15:54:36 - INFO - __main__ - Step 132823: {'lr': 1.6436683515359126e-05, 'samples': 25502016, 'steps': 132822, 'loss/train': 1.1782621145248413} 11/07/2021 15:54:36 - INFO - __main__ - Step 132824: {'lr': 1.6434791128813714e-05, 'samples': 25502208, 'steps': 132823, 'loss/train': 1.2485687732696533} 11/07/2021 15:54:36 - INFO - __main__ - Step 132825: {'lr': 1.6432898847509204e-05, 'samples': 25502400, 'steps': 132824, 'loss/train': 1.5379281044006348} 11/07/2021 15:54:37 - INFO - __main__ - Step 132826: {'lr': 1.643100667144645e-05, 'samples': 25502592, 'steps': 132825, 'loss/train': 1.2545145750045776} 11/07/2021 15:54:38 - INFO - __main__ - Step 132827: {'lr': 1.6429114600626238e-05, 'samples': 25502784, 'steps': 132826, 'loss/train': 1.2627649307250977} 11/07/2021 15:54:38 - INFO - __main__ - Step 132828: {'lr': 1.6427222635049475e-05, 'samples': 25502976, 'steps': 132827, 'loss/train': 1.216719627380371} 11/07/2021 15:54:38 - INFO - __main__ - Step 132829: {'lr': 1.6425330774716973e-05, 'samples': 25503168, 'steps': 132828, 'loss/train': 1.2054318189620972} 11/07/2021 15:54:39 - INFO - __main__ - Step 132830: {'lr': 1.6423439019629642e-05, 'samples': 25503360, 'steps': 132829, 'loss/train': 1.1777719259262085} 11/07/2021 15:54:39 - INFO - __main__ - Step 132831: {'lr': 1.6421547369788293e-05, 'samples': 25503552, 'steps': 132830, 'loss/train': 1.1163240671157837} 11/07/2021 15:54:40 - INFO - __main__ - Step 132832: {'lr': 1.6419655825193807e-05, 'samples': 25503744, 'steps': 132831, 'loss/train': 1.360827922821045} 11/07/2021 15:54:40 - INFO - __main__ - Step 132833: {'lr': 1.6417764385846996e-05, 'samples': 25503936, 'steps': 132832, 'loss/train': 1.1194511651992798} 11/07/2021 15:54:41 - INFO - __main__ - Step 132834: {'lr': 1.6415873051748742e-05, 'samples': 25504128, 'steps': 132833, 'loss/train': 0.5810201168060303} 11/07/2021 15:54:41 - INFO - __main__ - Step 132835: {'lr': 1.641398182289991e-05, 'samples': 25504320, 'steps': 132834, 'loss/train': 1.536744236946106} 11/07/2021 15:54:41 - INFO - __main__ - Step 132836: {'lr': 1.64120906993013e-05, 'samples': 25504512, 'steps': 132835, 'loss/train': 0.9115467667579651} 11/07/2021 15:54:42 - INFO - __main__ - Step 132837: {'lr': 1.6410199680953806e-05, 'samples': 25504704, 'steps': 132836, 'loss/train': 1.2509616613388062} 11/07/2021 15:54:43 - INFO - __main__ - Step 132838: {'lr': 1.6408308767858286e-05, 'samples': 25504896, 'steps': 132837, 'loss/train': 1.8460158109664917} 11/07/2021 15:54:43 - INFO - __main__ - Step 132839: {'lr': 1.640641796001563e-05, 'samples': 25505088, 'steps': 132838, 'loss/train': 1.4715076684951782} 11/07/2021 15:54:44 - INFO - __main__ - Step 132840: {'lr': 1.6404527257426583e-05, 'samples': 25505280, 'steps': 132839, 'loss/train': 1.0234264135360718} 11/07/2021 15:54:44 - INFO - __main__ - Step 132841: {'lr': 1.6402636660092037e-05, 'samples': 25505472, 'steps': 132840, 'loss/train': 1.2246137857437134} 11/07/2021 15:54:45 - INFO - __main__ - Step 132842: {'lr': 1.6400746168012874e-05, 'samples': 25505664, 'steps': 132841, 'loss/train': 1.4724072217941284} 11/07/2021 15:54:45 - INFO - __main__ - Step 132843: {'lr': 1.639885578118991e-05, 'samples': 25505856, 'steps': 132842, 'loss/train': 0.8283597826957703} 11/07/2021 15:54:46 - INFO - __main__ - Step 132844: {'lr': 1.639696549962405e-05, 'samples': 25506048, 'steps': 132843, 'loss/train': 1.6922471523284912} 11/07/2021 15:54:46 - INFO - __main__ - Step 132845: {'lr': 1.6395075323316077e-05, 'samples': 25506240, 'steps': 132844, 'loss/train': 1.3964438438415527} 11/07/2021 15:54:46 - INFO - __main__ - Step 132846: {'lr': 1.6393185252266906e-05, 'samples': 25506432, 'steps': 132845, 'loss/train': 1.066694974899292} 11/07/2021 15:54:47 - INFO - __main__ - Step 132847: {'lr': 1.6391295286477344e-05, 'samples': 25506624, 'steps': 132846, 'loss/train': 1.2468675374984741} 11/07/2021 15:54:48 - INFO - __main__ - Step 132848: {'lr': 1.6389405425948274e-05, 'samples': 25506816, 'steps': 132847, 'loss/train': 1.2550063133239746} 11/07/2021 15:54:48 - INFO - __main__ - Step 132849: {'lr': 1.638751567068053e-05, 'samples': 25507008, 'steps': 132848, 'loss/train': 1.3201545476913452} 11/07/2021 15:54:48 - INFO - __main__ - Step 132850: {'lr': 1.6385626020674975e-05, 'samples': 25507200, 'steps': 132849, 'loss/train': 1.1486330032348633} 11/07/2021 15:54:49 - INFO - __main__ - Step 132851: {'lr': 1.6383736475932416e-05, 'samples': 25507392, 'steps': 132850, 'loss/train': 1.3867329359054565} 11/07/2021 15:54:50 - INFO - __main__ - Step 132852: {'lr': 1.638184703645379e-05, 'samples': 25507584, 'steps': 132851, 'loss/train': 1.0173723697662354} 11/07/2021 15:54:50 - INFO - __main__ - Step 132853: {'lr': 1.6379957702239907e-05, 'samples': 25507776, 'steps': 132852, 'loss/train': 1.4553321599960327} 11/07/2021 15:54:51 - INFO - __main__ - Step 132854: {'lr': 1.637806847329157e-05, 'samples': 25507968, 'steps': 132853, 'loss/train': 0.1528453528881073} 11/07/2021 15:54:51 - INFO - __main__ - Step 132855: {'lr': 1.6376179349609664e-05, 'samples': 25508160, 'steps': 132854, 'loss/train': 2.3912699222564697} 11/07/2021 15:54:51 - INFO - __main__ - Step 132856: {'lr': 1.637429033119506e-05, 'samples': 25508352, 'steps': 132855, 'loss/train': 1.2954559326171875} 11/07/2021 15:54:52 - INFO - __main__ - Step 132857: {'lr': 1.6372401418048604e-05, 'samples': 25508544, 'steps': 132856, 'loss/train': 1.5755860805511475} 11/07/2021 15:54:53 - INFO - __main__ - Step 132858: {'lr': 1.637051261017114e-05, 'samples': 25508736, 'steps': 132857, 'loss/train': 1.5189075469970703} 11/07/2021 15:54:53 - INFO - __main__ - Step 132859: {'lr': 1.6368623907563494e-05, 'samples': 25508928, 'steps': 132858, 'loss/train': 0.737744927406311} 11/07/2021 15:54:53 - INFO - __main__ - Step 132860: {'lr': 1.6366735310226562e-05, 'samples': 25509120, 'steps': 132859, 'loss/train': 1.0820086002349854} 11/07/2021 15:54:54 - INFO - __main__ - Step 132861: {'lr': 1.6364846818161167e-05, 'samples': 25509312, 'steps': 132860, 'loss/train': 1.1871862411499023} 11/07/2021 15:54:55 - INFO - __main__ - Step 132862: {'lr': 1.6362958431368175e-05, 'samples': 25509504, 'steps': 132861, 'loss/train': 1.2923270463943481} 11/07/2021 15:54:55 - INFO - __main__ - Step 132863: {'lr': 1.636107014984842e-05, 'samples': 25509696, 'steps': 132862, 'loss/train': 1.3110607862472534} 11/07/2021 15:54:55 - INFO - __main__ - Step 132864: {'lr': 1.6359181973602755e-05, 'samples': 25509888, 'steps': 132863, 'loss/train': 1.271411418914795} 11/07/2021 15:54:56 - INFO - __main__ - Step 132865: {'lr': 1.635729390263205e-05, 'samples': 25510080, 'steps': 132864, 'loss/train': 1.298101782798767} 11/07/2021 15:54:56 - INFO - __main__ - Step 132866: {'lr': 1.6355405936937156e-05, 'samples': 25510272, 'steps': 132865, 'loss/train': 1.0293000936508179} 11/07/2021 15:54:57 - INFO - __main__ - Step 132867: {'lr': 1.6353518076518887e-05, 'samples': 25510464, 'steps': 132866, 'loss/train': 0.8121083974838257} 11/07/2021 15:54:58 - INFO - __main__ - Step 132868: {'lr': 1.6351630321378124e-05, 'samples': 25510656, 'steps': 132867, 'loss/train': 0.8218778967857361} 11/07/2021 15:54:58 - INFO - __main__ - Step 132869: {'lr': 1.6349742671515705e-05, 'samples': 25510848, 'steps': 132868, 'loss/train': 2.044487953186035} 11/07/2021 15:54:58 - INFO - __main__ - Step 132870: {'lr': 1.6347855126932515e-05, 'samples': 25511040, 'steps': 132869, 'loss/train': 1.0533726215362549} 11/07/2021 15:54:59 - INFO - __main__ - Step 132871: {'lr': 1.634596768762933e-05, 'samples': 25511232, 'steps': 132870, 'loss/train': 1.7599453926086426} 11/07/2021 15:54:59 - INFO - __main__ - Step 132872: {'lr': 1.634408035360707e-05, 'samples': 25511424, 'steps': 132871, 'loss/train': 1.238645315170288} 11/07/2021 15:55:00 - INFO - __main__ - Step 132873: {'lr': 1.6342193124866566e-05, 'samples': 25511616, 'steps': 132872, 'loss/train': 1.676576018333435} 11/07/2021 15:55:00 - INFO - __main__ - Step 132874: {'lr': 1.634030600140865e-05, 'samples': 25511808, 'steps': 132873, 'loss/train': 1.4702706336975098} 11/07/2021 15:55:01 - INFO - __main__ - Step 132875: {'lr': 1.633841898323421e-05, 'samples': 25512000, 'steps': 132874, 'loss/train': 1.3592718839645386} 11/07/2021 15:55:01 - INFO - __main__ - Step 132876: {'lr': 1.6336532070344053e-05, 'samples': 25512192, 'steps': 132875, 'loss/train': 1.2687759399414062} 11/07/2021 15:55:01 - INFO - __main__ - Step 132877: {'lr': 1.6334645262739033e-05, 'samples': 25512384, 'steps': 132876, 'loss/train': 1.0955886840820312} 11/07/2021 15:55:02 - INFO - __main__ - Step 132878: {'lr': 1.6332758560420046e-05, 'samples': 25512576, 'steps': 132877, 'loss/train': 1.0877951383590698} 11/07/2021 15:55:03 - INFO - __main__ - Step 132879: {'lr': 1.6330871963387895e-05, 'samples': 25512768, 'steps': 132878, 'loss/train': 1.2403134107589722} 11/07/2021 15:55:03 - INFO - __main__ - Step 132880: {'lr': 1.632898547164352e-05, 'samples': 25512960, 'steps': 132879, 'loss/train': 0.6791658997535706} 11/07/2021 15:55:03 - INFO - __main__ - Step 132881: {'lr': 1.632709908518762e-05, 'samples': 25513152, 'steps': 132880, 'loss/train': 1.5611810684204102} 11/07/2021 15:55:04 - INFO - __main__ - Step 132882: {'lr': 1.6325212804021133e-05, 'samples': 25513344, 'steps': 132881, 'loss/train': 1.5614573955535889} 11/07/2021 15:55:05 - INFO - __main__ - Step 132883: {'lr': 1.6323326628144897e-05, 'samples': 25513536, 'steps': 132882, 'loss/train': 1.5111520290374756} 11/07/2021 15:55:05 - INFO - __main__ - Step 132884: {'lr': 1.6321440557559768e-05, 'samples': 25513728, 'steps': 132883, 'loss/train': 1.29118812084198} 11/07/2021 15:55:06 - INFO - __main__ - Step 132885: {'lr': 1.6319554592266612e-05, 'samples': 25513920, 'steps': 132884, 'loss/train': 1.1966662406921387} 11/07/2021 15:55:06 - INFO - __main__ - Step 132886: {'lr': 1.6317668732266228e-05, 'samples': 25514112, 'steps': 132885, 'loss/train': 0.929701566696167} 11/07/2021 15:55:06 - INFO - __main__ - Step 132887: {'lr': 1.6315782977559506e-05, 'samples': 25514304, 'steps': 132886, 'loss/train': 1.1773654222488403} 11/07/2021 15:55:07 - INFO - __main__ - Step 132888: {'lr': 1.6313897328147308e-05, 'samples': 25514496, 'steps': 132887, 'loss/train': 1.3489657640457153} 11/07/2021 15:55:08 - INFO - __main__ - Step 132889: {'lr': 1.631201178403044e-05, 'samples': 25514688, 'steps': 132888, 'loss/train': 1.3398733139038086} 11/07/2021 15:55:08 - INFO - __main__ - Step 132890: {'lr': 1.6310126345209785e-05, 'samples': 25514880, 'steps': 132889, 'loss/train': 1.4305304288864136} 11/07/2021 15:55:08 - INFO - __main__ - Step 132891: {'lr': 1.6308241011686154e-05, 'samples': 25515072, 'steps': 132890, 'loss/train': 1.9948828220367432} 11/07/2021 15:55:09 - INFO - __main__ - Step 132892: {'lr': 1.630635578346046e-05, 'samples': 25515264, 'steps': 132891, 'loss/train': 1.0886590480804443} 11/07/2021 15:55:10 - INFO - __main__ - Step 132893: {'lr': 1.6304470660533532e-05, 'samples': 25515456, 'steps': 132892, 'loss/train': 1.0713098049163818} 11/07/2021 15:55:10 - INFO - __main__ - Step 132894: {'lr': 1.6302585642906153e-05, 'samples': 25515648, 'steps': 132893, 'loss/train': 0.04130937159061432} 11/07/2021 15:55:11 - INFO - __main__ - Step 132895: {'lr': 1.6300700730579237e-05, 'samples': 25515840, 'steps': 132894, 'loss/train': 1.1725777387619019} 11/07/2021 15:55:11 - INFO - __main__ - Step 132896: {'lr': 1.6298815923553644e-05, 'samples': 25516032, 'steps': 132895, 'loss/train': 1.2847907543182373} 11/07/2021 15:55:12 - INFO - __main__ - Step 132897: {'lr': 1.629693122183018e-05, 'samples': 25516224, 'steps': 132896, 'loss/train': 0.8657029271125793} 11/07/2021 15:55:13 - INFO - __main__ - Step 132898: {'lr': 1.6295046625409705e-05, 'samples': 25516416, 'steps': 132897, 'loss/train': 1.0460747480392456} 11/07/2021 15:55:13 - INFO - __main__ - Step 132899: {'lr': 1.6293162134293077e-05, 'samples': 25516608, 'steps': 132898, 'loss/train': 0.8223926424980164} 11/07/2021 15:55:14 - INFO - __main__ - Step 132900: {'lr': 1.6291277748481133e-05, 'samples': 25516800, 'steps': 132899, 'loss/train': 0.9684690237045288} 11/07/2021 15:55:14 - INFO - __main__ - Step 132901: {'lr': 1.6289393467974757e-05, 'samples': 25516992, 'steps': 132900, 'loss/train': 1.2021851539611816} 11/07/2021 15:55:14 - INFO - __main__ - Step 132902: {'lr': 1.6287509292774754e-05, 'samples': 25517184, 'steps': 132901, 'loss/train': 0.9134561419487} 11/07/2021 15:55:15 - INFO - __main__ - Step 132903: {'lr': 1.6285625222882016e-05, 'samples': 25517376, 'steps': 132902, 'loss/train': 0.40382975339889526} 11/07/2021 15:55:16 - INFO - __main__ - Step 132904: {'lr': 1.6283741258297348e-05, 'samples': 25517568, 'steps': 132903, 'loss/train': 0.07374582439661026} 11/07/2021 15:55:16 - INFO - __main__ - Step 132905: {'lr': 1.6281857399021632e-05, 'samples': 25517760, 'steps': 132904, 'loss/train': 1.5437265634536743} 11/07/2021 15:55:16 - INFO - __main__ - Step 132906: {'lr': 1.6279973645055735e-05, 'samples': 25517952, 'steps': 132905, 'loss/train': 1.1750491857528687} 11/07/2021 15:55:17 - INFO - __main__ - Step 132907: {'lr': 1.627808999640043e-05, 'samples': 25518144, 'steps': 132906, 'loss/train': 1.187008261680603} 11/07/2021 15:55:17 - INFO - __main__ - Step 132908: {'lr': 1.6276206453056634e-05, 'samples': 25518336, 'steps': 132907, 'loss/train': 1.3894692659378052} 11/07/2021 15:55:18 - INFO - __main__ - Step 132909: {'lr': 1.6274323015025156e-05, 'samples': 25518528, 'steps': 132908, 'loss/train': 1.7443112134933472} 11/07/2021 15:55:19 - INFO - __main__ - Step 132910: {'lr': 1.627243968230685e-05, 'samples': 25518720, 'steps': 132909, 'loss/train': 0.08770480751991272} 11/07/2021 15:55:19 - INFO - __main__ - Step 132911: {'lr': 1.6270556454902608e-05, 'samples': 25518912, 'steps': 132910, 'loss/train': 0.739896833896637} 11/07/2021 15:55:19 - INFO - __main__ - Step 132912: {'lr': 1.6268673332813206e-05, 'samples': 25519104, 'steps': 132911, 'loss/train': 1.1785329580307007} 11/07/2021 15:55:20 - INFO - __main__ - Step 132913: {'lr': 1.626679031603956e-05, 'samples': 25519296, 'steps': 132912, 'loss/train': 0.8993450403213501} 11/07/2021 15:55:21 - INFO - __main__ - Step 132914: {'lr': 1.6264907404582502e-05, 'samples': 25519488, 'steps': 132913, 'loss/train': 1.4804130792617798} 11/07/2021 15:55:21 - INFO - __main__ - Step 132915: {'lr': 1.6263024598442837e-05, 'samples': 25519680, 'steps': 132914, 'loss/train': 0.9908188581466675} 11/07/2021 15:55:21 - INFO - __main__ - Step 132916: {'lr': 1.6261141897621483e-05, 'samples': 25519872, 'steps': 132915, 'loss/train': 1.3419957160949707} 11/07/2021 15:55:22 - INFO - __main__ - Step 132917: {'lr': 1.6259259302119217e-05, 'samples': 25520064, 'steps': 132916, 'loss/train': 1.212352991104126} 11/07/2021 15:55:22 - INFO - __main__ - Step 132918: {'lr': 1.6257376811936953e-05, 'samples': 25520256, 'steps': 132917, 'loss/train': 1.1223305463790894} 11/07/2021 15:55:23 - INFO - __main__ - Step 132919: {'lr': 1.6255494427075497e-05, 'samples': 25520448, 'steps': 132918, 'loss/train': 1.216562271118164} 11/07/2021 15:55:24 - INFO - __main__ - Step 132920: {'lr': 1.6253612147535734e-05, 'samples': 25520640, 'steps': 132919, 'loss/train': 0.5058793425559998} 11/07/2021 15:55:24 - INFO - __main__ - Step 132921: {'lr': 1.6251729973318473e-05, 'samples': 25520832, 'steps': 132920, 'loss/train': 0.8328882455825806} 11/07/2021 15:55:25 - INFO - __main__ - Step 132922: {'lr': 1.624984790442455e-05, 'samples': 25521024, 'steps': 132921, 'loss/train': 1.355825662612915} 11/07/2021 15:55:25 - INFO - __main__ - Step 132923: {'lr': 1.624796594085484e-05, 'samples': 25521216, 'steps': 132922, 'loss/train': 0.6593784689903259} 11/07/2021 15:55:26 - INFO - __main__ - Step 132924: {'lr': 1.624608408261022e-05, 'samples': 25521408, 'steps': 132923, 'loss/train': 1.4536137580871582} 11/07/2021 15:55:26 - INFO - __main__ - Step 132925: {'lr': 1.6244202329691483e-05, 'samples': 25521600, 'steps': 132924, 'loss/train': 0.95860356092453} 11/07/2021 15:55:27 - INFO - __main__ - Step 132926: {'lr': 1.624232068209949e-05, 'samples': 25521792, 'steps': 132925, 'loss/train': 1.5190842151641846} 11/07/2021 15:55:27 - INFO - __main__ - Step 132927: {'lr': 1.6240439139835112e-05, 'samples': 25521984, 'steps': 132926, 'loss/train': 1.3652935028076172} 11/07/2021 15:55:27 - INFO - __main__ - Step 132928: {'lr': 1.6238557702899198e-05, 'samples': 25522176, 'steps': 132927, 'loss/train': 1.0827457904815674} 11/07/2021 15:55:29 - INFO - __main__ - Step 132929: {'lr': 1.6236676371292557e-05, 'samples': 25522368, 'steps': 132928, 'loss/train': 1.507455587387085} 11/07/2021 15:55:29 - INFO - __main__ - Step 132930: {'lr': 1.6234795145016078e-05, 'samples': 25522560, 'steps': 132929, 'loss/train': 0.9133584499359131} 11/07/2021 15:55:29 - INFO - __main__ - Step 132931: {'lr': 1.6232914024070593e-05, 'samples': 25522752, 'steps': 132930, 'loss/train': 1.5399928092956543} 11/07/2021 15:55:30 - INFO - __main__ - Step 132932: {'lr': 1.6231033008456965e-05, 'samples': 25522944, 'steps': 132931, 'loss/train': 1.1901718378067017} 11/07/2021 15:55:30 - INFO - __main__ - Step 132933: {'lr': 1.6229152098176047e-05, 'samples': 25523136, 'steps': 132932, 'loss/train': 0.2149202972650528} 11/07/2021 15:55:31 - INFO - __main__ - Step 132934: {'lr': 1.6227271293228624e-05, 'samples': 25523328, 'steps': 132933, 'loss/train': 1.2100188732147217} 11/07/2021 15:55:31 - INFO - __main__ - Step 132935: {'lr': 1.622539059361558e-05, 'samples': 25523520, 'steps': 132934, 'loss/train': 1.3436098098754883} 11/07/2021 15:55:32 - INFO - __main__ - Step 132936: {'lr': 1.6223509999337805e-05, 'samples': 25523712, 'steps': 132935, 'loss/train': 1.5007824897766113} 11/07/2021 15:55:32 - INFO - __main__ - Step 132937: {'lr': 1.6221629510396074e-05, 'samples': 25523904, 'steps': 132936, 'loss/train': 0.7418285608291626} 11/07/2021 15:55:33 - INFO - __main__ - Step 132938: {'lr': 1.6219749126791278e-05, 'samples': 25524096, 'steps': 132937, 'loss/train': 1.2025898694992065} 11/07/2021 15:55:33 - INFO - __main__ - Step 132939: {'lr': 1.621786884852425e-05, 'samples': 25524288, 'steps': 132938, 'loss/train': 1.274379849433899} 11/07/2021 15:55:34 - INFO - __main__ - Step 132940: {'lr': 1.6215988675595843e-05, 'samples': 25524480, 'steps': 132939, 'loss/train': 1.3598928451538086} 11/07/2021 15:55:34 - INFO - __main__ - Step 132941: {'lr': 1.62141086080069e-05, 'samples': 25524672, 'steps': 132940, 'loss/train': 0.9038366079330444} 11/07/2021 15:55:35 - INFO - __main__ - Step 132942: {'lr': 1.6212228645758302e-05, 'samples': 25524864, 'steps': 132941, 'loss/train': 1.2691081762313843} 11/07/2021 15:55:35 - INFO - __main__ - Step 132943: {'lr': 1.621034878885083e-05, 'samples': 25525056, 'steps': 132942, 'loss/train': 1.2948145866394043} 11/07/2021 15:55:35 - INFO - __main__ - Step 132944: {'lr': 1.6208469037285402e-05, 'samples': 25525248, 'steps': 132943, 'loss/train': 1.4555546045303345} 11/07/2021 15:55:36 - INFO - __main__ - Step 132945: {'lr': 1.6206589391062787e-05, 'samples': 25525440, 'steps': 132944, 'loss/train': 1.4918079376220703} 11/07/2021 15:55:37 - INFO - __main__ - Step 132946: {'lr': 1.620470985018391e-05, 'samples': 25525632, 'steps': 132945, 'loss/train': 1.0872963666915894} 11/07/2021 15:55:37 - INFO - __main__ - Step 132947: {'lr': 1.6202830414649623e-05, 'samples': 25525824, 'steps': 132946, 'loss/train': 1.4028455018997192} 11/07/2021 15:55:37 - INFO - __main__ - Step 132948: {'lr': 1.620095108446068e-05, 'samples': 25526016, 'steps': 132947, 'loss/train': 1.2909730672836304} 11/07/2021 15:55:38 - INFO - __main__ - Step 132949: {'lr': 1.619907185961797e-05, 'samples': 25526208, 'steps': 132948, 'loss/train': 1.33651602268219} 11/07/2021 15:55:39 - INFO - __main__ - Step 132950: {'lr': 1.6197192740122352e-05, 'samples': 25526400, 'steps': 132949, 'loss/train': 1.6566226482391357} 11/07/2021 15:55:39 - INFO - __main__ - Step 132951: {'lr': 1.6195313725974686e-05, 'samples': 25526592, 'steps': 132950, 'loss/train': 0.04404425248503685} 11/07/2021 15:55:40 - INFO - __main__ - Step 132952: {'lr': 1.6193434817175804e-05, 'samples': 25526784, 'steps': 132951, 'loss/train': 1.3542388677597046} 11/07/2021 15:55:40 - INFO - __main__ - Step 132953: {'lr': 1.6191556013726544e-05, 'samples': 25526976, 'steps': 132952, 'loss/train': 1.0475085973739624} 11/07/2021 15:55:40 - INFO - __main__ - Step 132954: {'lr': 1.618967731562779e-05, 'samples': 25527168, 'steps': 132953, 'loss/train': 1.0611495971679688} 11/07/2021 15:55:42 - INFO - __main__ - Step 132955: {'lr': 1.6187798722880315e-05, 'samples': 25527360, 'steps': 132954, 'loss/train': 0.10168693959712982} 11/07/2021 15:55:42 - INFO - __main__ - Step 132956: {'lr': 1.618592023548504e-05, 'samples': 25527552, 'steps': 132955, 'loss/train': 1.2885364294052124} 11/07/2021 15:55:42 - INFO - __main__ - Step 132957: {'lr': 1.6184041853442773e-05, 'samples': 25527744, 'steps': 132956, 'loss/train': 0.3489677309989929} 11/07/2021 15:55:43 - INFO - __main__ - Step 132958: {'lr': 1.6182163576754394e-05, 'samples': 25527936, 'steps': 132957, 'loss/train': 0.7540330290794373} 11/07/2021 15:55:43 - INFO - __main__ - Step 132959: {'lr': 1.6180285405420713e-05, 'samples': 25528128, 'steps': 132958, 'loss/train': 1.3720022439956665} 11/07/2021 15:55:44 - INFO - __main__ - Step 132960: {'lr': 1.6178407339442565e-05, 'samples': 25528320, 'steps': 132959, 'loss/train': 0.9495032429695129} 11/07/2021 15:55:45 - INFO - __main__ - Step 132961: {'lr': 1.617652937882083e-05, 'samples': 25528512, 'steps': 132960, 'loss/train': 0.8284100890159607} 11/07/2021 15:55:45 - INFO - __main__ - Step 132962: {'lr': 1.6174651523556322e-05, 'samples': 25528704, 'steps': 132961, 'loss/train': 1.1891216039657593} 11/07/2021 15:55:45 - INFO - __main__ - Step 132963: {'lr': 1.6172773773649924e-05, 'samples': 25528896, 'steps': 132962, 'loss/train': 1.3148893117904663} 11/07/2021 15:55:46 - INFO - __main__ - Step 132964: {'lr': 1.617089612910247e-05, 'samples': 25529088, 'steps': 132963, 'loss/train': 1.4044952392578125} 11/07/2021 15:55:46 - INFO - __main__ - Step 132965: {'lr': 1.616901858991479e-05, 'samples': 25529280, 'steps': 132964, 'loss/train': 1.6384670734405518} 11/07/2021 15:55:47 - INFO - __main__ - Step 132966: {'lr': 1.616714115608775e-05, 'samples': 25529472, 'steps': 132965, 'loss/train': 1.2003225088119507} 11/07/2021 15:55:47 - INFO - __main__ - Step 132967: {'lr': 1.6165263827622206e-05, 'samples': 25529664, 'steps': 132966, 'loss/train': 0.5807993412017822} 11/07/2021 15:55:48 - INFO - __main__ - Step 132968: {'lr': 1.6163386604518965e-05, 'samples': 25529856, 'steps': 132967, 'loss/train': 1.3054959774017334} 11/07/2021 15:55:48 - INFO - __main__ - Step 132969: {'lr': 1.6161509486778914e-05, 'samples': 25530048, 'steps': 132968, 'loss/train': 1.1870051622390747} 11/07/2021 15:55:48 - INFO - __main__ - Step 132970: {'lr': 1.6159632474402887e-05, 'samples': 25530240, 'steps': 132969, 'loss/train': 1.2259447574615479} 11/07/2021 15:55:49 - INFO - __main__ - Step 132971: {'lr': 1.6157755567391684e-05, 'samples': 25530432, 'steps': 132970, 'loss/train': 1.2250221967697144} 11/07/2021 15:55:50 - INFO - __main__ - Step 132972: {'lr': 1.6155878765746203e-05, 'samples': 25530624, 'steps': 132971, 'loss/train': 1.627832055091858} 11/07/2021 15:55:50 - INFO - __main__ - Step 132973: {'lr': 1.6154002069467266e-05, 'samples': 25530816, 'steps': 132972, 'loss/train': 1.3295302391052246} 11/07/2021 15:55:50 - INFO - __main__ - Step 132974: {'lr': 1.6152125478555742e-05, 'samples': 25531008, 'steps': 132973, 'loss/train': 1.371070384979248} 11/07/2021 15:55:51 - INFO - __main__ - Step 132975: {'lr': 1.615024899301243e-05, 'samples': 25531200, 'steps': 132974, 'loss/train': 1.404374361038208} 11/07/2021 15:55:52 - INFO - __main__ - Step 132976: {'lr': 1.6148372612838247e-05, 'samples': 25531392, 'steps': 132975, 'loss/train': 1.2271164655685425} 11/07/2021 15:55:52 - INFO - __main__ - Step 132977: {'lr': 1.6146496338033974e-05, 'samples': 25531584, 'steps': 132976, 'loss/train': 1.0433493852615356} 11/07/2021 15:55:53 - INFO - __main__ - Step 132978: {'lr': 1.6144620168600495e-05, 'samples': 25531776, 'steps': 132977, 'loss/train': 0.07133501768112183} 11/07/2021 15:55:53 - INFO - __main__ - Step 132979: {'lr': 1.6142744104538615e-05, 'samples': 25531968, 'steps': 132978, 'loss/train': 1.1611193418502808} 11/07/2021 15:55:53 - INFO - __main__ - Step 132980: {'lr': 1.614086814584928e-05, 'samples': 25532160, 'steps': 132979, 'loss/train': 0.6923078894615173} 11/07/2021 15:55:54 - INFO - __main__ - Step 132981: {'lr': 1.6138992292533183e-05, 'samples': 25532352, 'steps': 132980, 'loss/train': 1.3843879699707031} 11/07/2021 15:55:55 - INFO - __main__ - Step 132982: {'lr': 1.6137116544591267e-05, 'samples': 25532544, 'steps': 132981, 'loss/train': 1.1227163076400757} 11/07/2021 15:55:55 - INFO - __main__ - Step 132983: {'lr': 1.6135240902024366e-05, 'samples': 25532736, 'steps': 132982, 'loss/train': 1.225989580154419} 11/07/2021 15:55:55 - INFO - __main__ - Step 132984: {'lr': 1.6133365364833314e-05, 'samples': 25532928, 'steps': 132983, 'loss/train': 0.0375186987221241} 11/07/2021 15:55:56 - INFO - __main__ - Step 132985: {'lr': 1.6131489933018968e-05, 'samples': 25533120, 'steps': 132984, 'loss/train': 1.3989499807357788} 11/07/2021 15:55:56 - INFO - __main__ - Step 132986: {'lr': 1.612961460658216e-05, 'samples': 25533312, 'steps': 132985, 'loss/train': 0.7980923652648926} 11/07/2021 15:55:57 - INFO - __main__ - Step 132987: {'lr': 1.6127739385523727e-05, 'samples': 25533504, 'steps': 132986, 'loss/train': 1.0675805807113647} 11/07/2021 15:55:58 - INFO - __main__ - Step 132988: {'lr': 1.6125864269844527e-05, 'samples': 25533696, 'steps': 132987, 'loss/train': 1.3129078149795532} 11/07/2021 15:55:58 - INFO - __main__ - Step 132989: {'lr': 1.612398925954539e-05, 'samples': 25533888, 'steps': 132988, 'loss/train': 1.330824613571167} 11/07/2021 15:55:58 - INFO - __main__ - Step 132990: {'lr': 1.612211435462721e-05, 'samples': 25534080, 'steps': 132989, 'loss/train': 1.2031710147857666} 11/07/2021 15:55:59 - INFO - __main__ - Step 132991: {'lr': 1.6120239555090816e-05, 'samples': 25534272, 'steps': 132990, 'loss/train': 1.2849880456924438} 11/07/2021 15:56:00 - INFO - __main__ - Step 132992: {'lr': 1.6118364860936986e-05, 'samples': 25534464, 'steps': 132991, 'loss/train': 1.7130789756774902} 11/07/2021 15:56:00 - INFO - __main__ - Step 132993: {'lr': 1.6116490272166607e-05, 'samples': 25534656, 'steps': 132992, 'loss/train': 1.2543927431106567} 11/07/2021 15:56:01 - INFO - __main__ - Step 132994: {'lr': 1.6114615788780568e-05, 'samples': 25534848, 'steps': 132993, 'loss/train': 1.0648423433303833} 11/07/2021 15:56:01 - INFO - __main__ - Step 132995: {'lr': 1.6112741410779647e-05, 'samples': 25535040, 'steps': 132994, 'loss/train': 1.0183712244033813} 11/07/2021 15:56:01 - INFO - __main__ - Step 132996: {'lr': 1.6110867138164702e-05, 'samples': 25535232, 'steps': 132995, 'loss/train': 1.5342109203338623} 11/07/2021 15:56:02 - INFO - __main__ - Step 132997: {'lr': 1.6108992970936597e-05, 'samples': 25535424, 'steps': 132996, 'loss/train': 1.2707676887512207} 11/07/2021 15:56:03 - INFO - __main__ - Step 132998: {'lr': 1.6107118909096193e-05, 'samples': 25535616, 'steps': 132997, 'loss/train': 1.3304946422576904} 11/07/2021 15:56:03 - INFO - __main__ - Step 132999: {'lr': 1.6105244952644288e-05, 'samples': 25535808, 'steps': 132998, 'loss/train': 1.2132107019424438} 11/07/2021 15:56:03 - INFO - __main__ - Step 133000: {'lr': 1.6103371101581778e-05, 'samples': 25536000, 'steps': 132999, 'loss/train': 0.7823420166969299} 11/07/2021 15:56:04 - INFO - __main__ - Step 133001: {'lr': 1.610149735590949e-05, 'samples': 25536192, 'steps': 133000, 'loss/train': 1.0382874011993408} 11/07/2021 15:56:05 - INFO - __main__ - Step 133002: {'lr': 1.609962371562823e-05, 'samples': 25536384, 'steps': 133001, 'loss/train': 1.2912518978118896} 11/07/2021 15:56:05 - INFO - __main__ - Step 133003: {'lr': 1.6097750180738864e-05, 'samples': 25536576, 'steps': 133002, 'loss/train': 0.9849892258644104} 11/07/2021 15:56:05 - INFO - __main__ - Step 133004: {'lr': 1.6095876751242246e-05, 'samples': 25536768, 'steps': 133003, 'loss/train': 1.5516352653503418} 11/07/2021 15:56:06 - INFO - __main__ - Step 133005: {'lr': 1.6094003427139237e-05, 'samples': 25536960, 'steps': 133004, 'loss/train': 1.9998425245285034} 11/07/2021 15:56:06 - INFO - __main__ - Step 133006: {'lr': 1.6092130208430644e-05, 'samples': 25537152, 'steps': 133005, 'loss/train': 0.9937106370925903} 11/07/2021 15:56:07 - INFO - __main__ - Step 133007: {'lr': 1.6090257095117327e-05, 'samples': 25537344, 'steps': 133006, 'loss/train': 1.1321407556533813} 11/07/2021 15:56:07 - INFO - __main__ - Step 133008: {'lr': 1.608838408720012e-05, 'samples': 25537536, 'steps': 133007, 'loss/train': 1.5315505266189575} 11/07/2021 15:56:08 - INFO - __main__ - Step 133009: {'lr': 1.608651118467988e-05, 'samples': 25537728, 'steps': 133008, 'loss/train': 1.0800989866256714} 11/07/2021 15:56:08 - INFO - __main__ - Step 133010: {'lr': 1.608463838755747e-05, 'samples': 25537920, 'steps': 133009, 'loss/train': 1.0239863395690918} 11/07/2021 15:56:08 - INFO - __main__ - Step 133011: {'lr': 1.6082765695833696e-05, 'samples': 25538112, 'steps': 133010, 'loss/train': 1.524574637413025} 11/07/2021 15:56:10 - INFO - __main__ - Step 133012: {'lr': 1.6080893109509415e-05, 'samples': 25538304, 'steps': 133011, 'loss/train': 1.1090298891067505} 11/07/2021 15:56:10 - INFO - __main__ - Step 133013: {'lr': 1.607902062858549e-05, 'samples': 25538496, 'steps': 133012, 'loss/train': 0.9503096342086792} 11/07/2021 15:56:10 - INFO - __main__ - Step 133014: {'lr': 1.607714825306278e-05, 'samples': 25538688, 'steps': 133013, 'loss/train': 0.49818214774131775} 11/07/2021 15:56:11 - INFO - __main__ - Step 133015: {'lr': 1.607527598294206e-05, 'samples': 25538880, 'steps': 133014, 'loss/train': 1.17464017868042} 11/07/2021 15:56:11 - INFO - __main__ - Step 133016: {'lr': 1.6073403818224197e-05, 'samples': 25539072, 'steps': 133015, 'loss/train': 1.2001800537109375} 11/07/2021 15:56:12 - INFO - __main__ - Step 133017: {'lr': 1.6071531758910047e-05, 'samples': 25539264, 'steps': 133016, 'loss/train': 1.545039176940918} 11/07/2021 15:56:12 - INFO - __main__ - Step 133018: {'lr': 1.606965980500047e-05, 'samples': 25539456, 'steps': 133017, 'loss/train': 1.3927620649337769} 11/07/2021 15:56:13 - INFO - __main__ - Step 133019: {'lr': 1.60677879564963e-05, 'samples': 25539648, 'steps': 133018, 'loss/train': 1.2482792139053345} 11/07/2021 15:56:13 - INFO - __main__ - Step 133020: {'lr': 1.606591621339837e-05, 'samples': 25539840, 'steps': 133019, 'loss/train': 1.475862979888916} 11/07/2021 15:56:14 - INFO - __main__ - Step 133021: {'lr': 1.606404457570751e-05, 'samples': 25540032, 'steps': 133020, 'loss/train': 1.494140625} 11/07/2021 15:56:15 - INFO - __main__ - Step 133022: {'lr': 1.606217304342461e-05, 'samples': 25540224, 'steps': 133021, 'loss/train': 1.257987141609192} 11/07/2021 15:56:15 - INFO - __main__ - Step 133023: {'lr': 1.6060301616550477e-05, 'samples': 25540416, 'steps': 133022, 'loss/train': 1.15786612033844} 11/07/2021 15:56:15 - INFO - __main__ - Step 133024: {'lr': 1.605843029508594e-05, 'samples': 25540608, 'steps': 133023, 'loss/train': 0.992961049079895} 11/07/2021 15:56:16 - INFO - __main__ - Step 133025: {'lr': 1.605655907903189e-05, 'samples': 25540800, 'steps': 133024, 'loss/train': 1.4798016548156738} 11/07/2021 15:56:16 - INFO - __main__ - Step 133026: {'lr': 1.605468796838913e-05, 'samples': 25540992, 'steps': 133025, 'loss/train': 0.7953253388404846} 11/07/2021 15:56:17 - INFO - __main__ - Step 133027: {'lr': 1.6052816963158552e-05, 'samples': 25541184, 'steps': 133026, 'loss/train': 1.3085989952087402} 11/07/2021 15:56:18 - INFO - __main__ - Step 133028: {'lr': 1.6050946063340953e-05, 'samples': 25541376, 'steps': 133027, 'loss/train': 1.2392736673355103} 11/07/2021 15:56:18 - INFO - __main__ - Step 133029: {'lr': 1.6049075268937176e-05, 'samples': 25541568, 'steps': 133028, 'loss/train': 0.9246427416801453} 11/07/2021 15:56:18 - INFO - __main__ - Step 133030: {'lr': 1.6047204579948072e-05, 'samples': 25541760, 'steps': 133029, 'loss/train': 0.9026484489440918} 11/07/2021 15:56:19 - INFO - __main__ - Step 133031: {'lr': 1.604533399637448e-05, 'samples': 25541952, 'steps': 133030, 'loss/train': 1.022884488105774} 11/07/2021 15:56:19 - INFO - __main__ - Step 133032: {'lr': 1.6043463518217256e-05, 'samples': 25542144, 'steps': 133031, 'loss/train': 0.879848301410675} 11/07/2021 15:56:20 - INFO - __main__ - Step 133033: {'lr': 1.6041593145477234e-05, 'samples': 25542336, 'steps': 133032, 'loss/train': 1.3638662099838257} 11/07/2021 15:56:20 - INFO - __main__ - Step 133034: {'lr': 1.603972287815525e-05, 'samples': 25542528, 'steps': 133033, 'loss/train': 1.2470307350158691} 11/07/2021 15:56:21 - INFO - __main__ - Step 133035: {'lr': 1.6037852716252188e-05, 'samples': 25542720, 'steps': 133034, 'loss/train': 1.1109386682510376} 11/07/2021 15:56:21 - INFO - __main__ - Step 133036: {'lr': 1.6035982659768827e-05, 'samples': 25542912, 'steps': 133035, 'loss/train': 1.3206688165664673} 11/07/2021 15:56:21 - INFO - __main__ - Step 133037: {'lr': 1.6034112708706056e-05, 'samples': 25543104, 'steps': 133036, 'loss/train': 1.1812872886657715} 11/07/2021 15:56:22 - INFO - __main__ - Step 133038: {'lr': 1.6032242863064706e-05, 'samples': 25543296, 'steps': 133037, 'loss/train': 1.0985262393951416} 11/07/2021 15:56:23 - INFO - __main__ - Step 133039: {'lr': 1.603037312284561e-05, 'samples': 25543488, 'steps': 133038, 'loss/train': 1.2346971035003662} 11/07/2021 15:56:23 - INFO - __main__ - Step 133040: {'lr': 1.602850348804963e-05, 'samples': 25543680, 'steps': 133039, 'loss/train': 1.5293450355529785} 11/07/2021 15:56:23 - INFO - __main__ - Step 133041: {'lr': 1.6026633958677623e-05, 'samples': 25543872, 'steps': 133040, 'loss/train': 1.04268479347229} 11/07/2021 15:56:24 - INFO - __main__ - Step 133042: {'lr': 1.602476453473037e-05, 'samples': 25544064, 'steps': 133041, 'loss/train': 1.4030344486236572} 11/07/2021 15:56:25 - INFO - __main__ - Step 133043: {'lr': 1.6022895216208756e-05, 'samples': 25544256, 'steps': 133042, 'loss/train': 0.9352913498878479} 11/07/2021 15:56:25 - INFO - __main__ - Step 133044: {'lr': 1.6021026003113587e-05, 'samples': 25544448, 'steps': 133043, 'loss/train': 0.09959053248167038} 11/07/2021 15:56:26 - INFO - __main__ - Step 133045: {'lr': 1.6019156895445753e-05, 'samples': 25544640, 'steps': 133044, 'loss/train': 1.4252971410751343} 11/07/2021 15:56:26 - INFO - __main__ - Step 133046: {'lr': 1.6017287893206083e-05, 'samples': 25544832, 'steps': 133045, 'loss/train': 1.2510418891906738} 11/07/2021 15:56:26 - INFO - __main__ - Step 133047: {'lr': 1.6015418996395415e-05, 'samples': 25545024, 'steps': 133046, 'loss/train': 1.3762394189834595} 11/07/2021 15:56:27 - INFO - __main__ - Step 133048: {'lr': 1.6013550205014576e-05, 'samples': 25545216, 'steps': 133047, 'loss/train': 1.3769437074661255} 11/07/2021 15:56:28 - INFO - __main__ - Step 133049: {'lr': 1.60116815190644e-05, 'samples': 25545408, 'steps': 133048, 'loss/train': 1.2014485597610474} 11/07/2021 15:56:28 - INFO - __main__ - Step 133050: {'lr': 1.600981293854578e-05, 'samples': 25545600, 'steps': 133049, 'loss/train': 0.416734516620636} 11/07/2021 15:56:28 - INFO - __main__ - Step 133051: {'lr': 1.600794446345952e-05, 'samples': 25545792, 'steps': 133050, 'loss/train': 0.8163602948188782} 11/07/2021 15:56:29 - INFO - __main__ - Step 133052: {'lr': 1.600607609380647e-05, 'samples': 25545984, 'steps': 133051, 'loss/train': 1.195405125617981} 11/07/2021 15:56:30 - INFO - __main__ - Step 133053: {'lr': 1.6004207829587474e-05, 'samples': 25546176, 'steps': 133052, 'loss/train': 1.365769624710083} 11/07/2021 15:56:30 - INFO - __main__ - Step 133054: {'lr': 1.6002339670803417e-05, 'samples': 25546368, 'steps': 133053, 'loss/train': 1.4899145364761353} 11/07/2021 15:56:30 - INFO - __main__ - Step 133055: {'lr': 1.600047161745505e-05, 'samples': 25546560, 'steps': 133054, 'loss/train': 1.2657619714736938} 11/07/2021 15:56:31 - INFO - __main__ - Step 133056: {'lr': 1.5998603669543256e-05, 'samples': 25546752, 'steps': 133055, 'loss/train': 0.8641316294670105} 11/07/2021 15:56:31 - INFO - __main__ - Step 133057: {'lr': 1.599673582706887e-05, 'samples': 25546944, 'steps': 133056, 'loss/train': 1.1244678497314453} 11/07/2021 15:56:32 - INFO - __main__ - Step 133058: {'lr': 1.5994868090032756e-05, 'samples': 25547136, 'steps': 133057, 'loss/train': 1.3411222696304321} 11/07/2021 15:56:33 - INFO - __main__ - Step 133059: {'lr': 1.599300045843574e-05, 'samples': 25547328, 'steps': 133058, 'loss/train': 1.5082521438598633} 11/07/2021 15:56:33 - INFO - __main__ - Step 133060: {'lr': 1.5991132932278663e-05, 'samples': 25547520, 'steps': 133059, 'loss/train': 0.0496220700442791} 11/07/2021 15:56:33 - INFO - __main__ - Step 133061: {'lr': 1.5989265511562378e-05, 'samples': 25547712, 'steps': 133060, 'loss/train': 0.9019319415092468} 11/07/2021 15:56:34 - INFO - __main__ - Step 133062: {'lr': 1.5987398196287722e-05, 'samples': 25547904, 'steps': 133061, 'loss/train': 1.2251229286193848} 11/07/2021 15:56:35 - INFO - __main__ - Step 133063: {'lr': 1.5985530986455526e-05, 'samples': 25548096, 'steps': 133062, 'loss/train': 1.3901387453079224} 11/07/2021 15:56:35 - INFO - __main__ - Step 133064: {'lr': 1.598366388206665e-05, 'samples': 25548288, 'steps': 133063, 'loss/train': 1.1042718887329102} 11/07/2021 15:56:35 - INFO - __main__ - Step 133065: {'lr': 1.5981796883121903e-05, 'samples': 25548480, 'steps': 133064, 'loss/train': 0.8397325277328491} 11/07/2021 15:56:36 - INFO - __main__ - Step 133066: {'lr': 1.5979929989622167e-05, 'samples': 25548672, 'steps': 133065, 'loss/train': 1.5048506259918213} 11/07/2021 15:56:36 - INFO - __main__ - Step 133067: {'lr': 1.597806320156825e-05, 'samples': 25548864, 'steps': 133066, 'loss/train': 1.4005980491638184} 11/07/2021 15:56:36 - INFO - __main__ - Step 133068: {'lr': 1.5976196518961038e-05, 'samples': 25549056, 'steps': 133067, 'loss/train': 1.0270363092422485} 11/07/2021 15:56:38 - INFO - __main__ - Step 133069: {'lr': 1.5974329941801314e-05, 'samples': 25549248, 'steps': 133068, 'loss/train': 1.2669931650161743} 11/07/2021 15:56:38 - INFO - __main__ - Step 133070: {'lr': 1.597246347008996e-05, 'samples': 25549440, 'steps': 133069, 'loss/train': 1.0808417797088623} 11/07/2021 15:56:38 - INFO - __main__ - Step 133071: {'lr': 1.5970597103827782e-05, 'samples': 25549632, 'steps': 133070, 'loss/train': 0.9019258618354797} 11/07/2021 15:56:39 - INFO - __main__ - Step 133072: {'lr': 1.5968730843015643e-05, 'samples': 25549824, 'steps': 133071, 'loss/train': 1.3257288932800293} 11/07/2021 15:56:39 - INFO - __main__ - Step 133073: {'lr': 1.5966864687654404e-05, 'samples': 25550016, 'steps': 133072, 'loss/train': 0.6872527003288269} 11/07/2021 15:56:40 - INFO - __main__ - Step 133074: {'lr': 1.5964998637744867e-05, 'samples': 25550208, 'steps': 133073, 'loss/train': 2.2180328369140625} 11/07/2021 15:56:40 - INFO - __main__ - Step 133075: {'lr': 1.596313269328789e-05, 'samples': 25550400, 'steps': 133074, 'loss/train': 1.4509285688400269} 11/07/2021 15:56:41 - INFO - __main__ - Step 133076: {'lr': 1.596126685428431e-05, 'samples': 25550592, 'steps': 133075, 'loss/train': 1.5596959590911865} 11/07/2021 15:56:41 - INFO - __main__ - Step 133077: {'lr': 1.595940112073499e-05, 'samples': 25550784, 'steps': 133076, 'loss/train': 1.6082814931869507} 11/07/2021 15:56:41 - INFO - __main__ - Step 133078: {'lr': 1.595753549264073e-05, 'samples': 25550976, 'steps': 133077, 'loss/train': 1.075413465499878} 11/07/2021 15:56:42 - INFO - __main__ - Step 133079: {'lr': 1.5955669970002418e-05, 'samples': 25551168, 'steps': 133078, 'loss/train': 1.1443299055099487} 11/07/2021 15:56:43 - INFO - __main__ - Step 133080: {'lr': 1.5953804552820834e-05, 'samples': 25551360, 'steps': 133079, 'loss/train': 1.1681772470474243} 11/07/2021 15:56:43 - INFO - __main__ - Step 133081: {'lr': 1.595193924109692e-05, 'samples': 25551552, 'steps': 133080, 'loss/train': 1.0056456327438354} 11/07/2021 15:56:43 - INFO - __main__ - Step 133082: {'lr': 1.5950074034831398e-05, 'samples': 25551744, 'steps': 133081, 'loss/train': 1.7217867374420166} 11/07/2021 15:56:44 - INFO - __main__ - Step 133083: {'lr': 1.5948208934025182e-05, 'samples': 25551936, 'steps': 133082, 'loss/train': 1.5222162008285522} 11/07/2021 15:56:45 - INFO - __main__ - Step 133084: {'lr': 1.594634393867908e-05, 'samples': 25552128, 'steps': 133083, 'loss/train': 1.5138286352157593} 11/07/2021 15:56:45 - INFO - __main__ - Step 133085: {'lr': 1.5944479048793925e-05, 'samples': 25552320, 'steps': 133084, 'loss/train': 1.311475157737732} 11/07/2021 15:56:46 - INFO - __main__ - Step 133086: {'lr': 1.5942614264370605e-05, 'samples': 25552512, 'steps': 133085, 'loss/train': 1.463428020477295} 11/07/2021 15:56:46 - INFO - __main__ - Step 133087: {'lr': 1.594074958540992e-05, 'samples': 25552704, 'steps': 133086, 'loss/train': 1.2313079833984375} 11/07/2021 15:56:46 - INFO - __main__ - Step 133088: {'lr': 1.593888501191271e-05, 'samples': 25552896, 'steps': 133087, 'loss/train': 1.327865719795227} 11/07/2021 15:56:47 - INFO - __main__ - Step 133089: {'lr': 1.5937020543879853e-05, 'samples': 25553088, 'steps': 133088, 'loss/train': 1.1624006032943726} 11/07/2021 15:56:48 - INFO - __main__ - Step 133090: {'lr': 1.5935156181312138e-05, 'samples': 25553280, 'steps': 133089, 'loss/train': 1.1182971000671387} 11/07/2021 15:56:48 - INFO - __main__ - Step 133091: {'lr': 1.5933291924210447e-05, 'samples': 25553472, 'steps': 133090, 'loss/train': 1.2503890991210938} 11/07/2021 15:56:49 - INFO - __main__ - Step 133092: {'lr': 1.5931427772575585e-05, 'samples': 25553664, 'steps': 133091, 'loss/train': 1.0744770765304565} 11/07/2021 15:56:49 - INFO - __main__ - Step 133093: {'lr': 1.5929563726408415e-05, 'samples': 25553856, 'steps': 133092, 'loss/train': 0.9904694557189941} 11/07/2021 15:56:50 - INFO - __main__ - Step 133094: {'lr': 1.5927699785709792e-05, 'samples': 25554048, 'steps': 133093, 'loss/train': 0.9762406349182129} 11/07/2021 15:56:50 - INFO - __main__ - Step 133095: {'lr': 1.5925835950480554e-05, 'samples': 25554240, 'steps': 133094, 'loss/train': 1.2967180013656616} 11/07/2021 15:56:51 - INFO - __main__ - Step 133096: {'lr': 1.5923972220721477e-05, 'samples': 25554432, 'steps': 133095, 'loss/train': 1.3252537250518799} 11/07/2021 15:56:51 - INFO - __main__ - Step 133097: {'lr': 1.592210859643345e-05, 'samples': 25554624, 'steps': 133096, 'loss/train': 1.0457310676574707} 11/07/2021 15:56:51 - INFO - __main__ - Step 133098: {'lr': 1.592024507761733e-05, 'samples': 25554816, 'steps': 133097, 'loss/train': 0.9320858120918274} 11/07/2021 15:56:52 - INFO - __main__ - Step 133099: {'lr': 1.5918381664273925e-05, 'samples': 25555008, 'steps': 133098, 'loss/train': 0.7160381078720093} 11/07/2021 15:56:53 - INFO - __main__ - Step 133100: {'lr': 1.5916518356404063e-05, 'samples': 25555200, 'steps': 133099, 'loss/train': 0.9702885150909424} 11/07/2021 15:56:53 - INFO - __main__ - Step 133101: {'lr': 1.591465515400864e-05, 'samples': 25555392, 'steps': 133100, 'loss/train': 1.4165140390396118} 11/07/2021 15:56:53 - INFO - __main__ - Step 133102: {'lr': 1.5912792057088455e-05, 'samples': 25555584, 'steps': 133101, 'loss/train': 1.1240742206573486} 11/07/2021 15:56:54 - INFO - __main__ - Step 133103: {'lr': 1.591092906564434e-05, 'samples': 25555776, 'steps': 133102, 'loss/train': 1.5508270263671875} 11/07/2021 15:56:55 - INFO - __main__ - Step 133104: {'lr': 1.5909066179677135e-05, 'samples': 25555968, 'steps': 133103, 'loss/train': 0.8997632265090942} 11/07/2021 15:56:55 - INFO - __main__ - Step 133105: {'lr': 1.590720339918772e-05, 'samples': 25556160, 'steps': 133104, 'loss/train': 1.262403130531311} 11/07/2021 15:56:55 - INFO - __main__ - Step 133106: {'lr': 1.5905340724176904e-05, 'samples': 25556352, 'steps': 133105, 'loss/train': 1.2758944034576416} 11/07/2021 15:56:56 - INFO - __main__ - Step 133107: {'lr': 1.5903478154645517e-05, 'samples': 25556544, 'steps': 133106, 'loss/train': 1.2922754287719727} 11/07/2021 15:56:56 - INFO - __main__ - Step 133108: {'lr': 1.590161569059445e-05, 'samples': 25556736, 'steps': 133107, 'loss/train': 1.4385063648223877} 11/07/2021 15:56:57 - INFO - __main__ - Step 133109: {'lr': 1.589975333202445e-05, 'samples': 25556928, 'steps': 133108, 'loss/train': 0.7749698162078857} 11/07/2021 15:56:58 - INFO - __main__ - Step 133110: {'lr': 1.5897891078936438e-05, 'samples': 25557120, 'steps': 133109, 'loss/train': 1.1629588603973389} 11/07/2021 15:56:58 - INFO - __main__ - Step 133111: {'lr': 1.5896028931331213e-05, 'samples': 25557312, 'steps': 133110, 'loss/train': 0.6893428564071655} 11/07/2021 15:56:58 - INFO - __main__ - Step 133112: {'lr': 1.589416688920964e-05, 'samples': 25557504, 'steps': 133111, 'loss/train': 0.9097509980201721} 11/07/2021 15:56:59 - INFO - __main__ - Step 133113: {'lr': 1.589230495257252e-05, 'samples': 25557696, 'steps': 133112, 'loss/train': 0.6955121755599976} 11/07/2021 15:57:00 - INFO - __main__ - Step 133114: {'lr': 1.589044312142071e-05, 'samples': 25557888, 'steps': 133113, 'loss/train': 1.0376428365707397} 11/07/2021 15:57:00 - INFO - __main__ - Step 133115: {'lr': 1.588858139575508e-05, 'samples': 25558080, 'steps': 133114, 'loss/train': 1.270887851715088} 11/07/2021 15:57:00 - INFO - __main__ - Step 133116: {'lr': 1.588671977557643e-05, 'samples': 25558272, 'steps': 133115, 'loss/train': 1.5385900735855103} 11/07/2021 15:57:01 - INFO - __main__ - Step 133117: {'lr': 1.588485826088559e-05, 'samples': 25558464, 'steps': 133116, 'loss/train': 1.3431998491287231} 11/07/2021 15:57:01 - INFO - __main__ - Step 133118: {'lr': 1.5882996851683456e-05, 'samples': 25558656, 'steps': 133117, 'loss/train': 1.065360426902771} 11/07/2021 15:57:02 - INFO - __main__ - Step 133119: {'lr': 1.58811355479708e-05, 'samples': 25558848, 'steps': 133118, 'loss/train': 1.3284752368927002} 11/07/2021 15:57:02 - INFO - __main__ - Step 133120: {'lr': 1.5879274349748508e-05, 'samples': 25559040, 'steps': 133119, 'loss/train': 0.7466753721237183} 11/07/2021 15:57:03 - INFO - __main__ - Step 133121: {'lr': 1.5877413257017413e-05, 'samples': 25559232, 'steps': 133120, 'loss/train': 1.4428492784500122} 11/07/2021 15:57:03 - INFO - __main__ - Step 133122: {'lr': 1.5875552269778353e-05, 'samples': 25559424, 'steps': 133121, 'loss/train': 1.0213080644607544} 11/07/2021 15:57:03 - INFO - __main__ - Step 133123: {'lr': 1.5873691388032158e-05, 'samples': 25559616, 'steps': 133122, 'loss/train': 2.1018295288085938} 11/07/2021 15:57:04 - INFO - __main__ - Step 133124: {'lr': 1.587183061177963e-05, 'samples': 25559808, 'steps': 133123, 'loss/train': 1.0058958530426025} 11/07/2021 15:57:05 - INFO - __main__ - Step 133125: {'lr': 1.5869969941021662e-05, 'samples': 25560000, 'steps': 133124, 'loss/train': 0.4872019588947296} 11/07/2021 15:57:05 - INFO - __main__ - Step 133126: {'lr': 1.5868109375759055e-05, 'samples': 25560192, 'steps': 133125, 'loss/train': 1.5082098245620728} 11/07/2021 15:57:05 - INFO - __main__ - Step 133127: {'lr': 1.586624891599267e-05, 'samples': 25560384, 'steps': 133126, 'loss/train': 1.030882716178894} 11/07/2021 15:57:06 - INFO - __main__ - Step 133128: {'lr': 1.586438856172334e-05, 'samples': 25560576, 'steps': 133127, 'loss/train': 1.381474494934082} 11/07/2021 15:57:06 - INFO - __main__ - Step 133129: {'lr': 1.586252831295193e-05, 'samples': 25560768, 'steps': 133128, 'loss/train': 1.4210737943649292} 11/07/2021 15:57:07 - INFO - __main__ - Step 133130: {'lr': 1.586066816967921e-05, 'samples': 25560960, 'steps': 133129, 'loss/train': 1.5890179872512817} 11/07/2021 15:57:08 - INFO - __main__ - Step 133131: {'lr': 1.58588081319061e-05, 'samples': 25561152, 'steps': 133130, 'loss/train': 1.013222098350525} 11/07/2021 15:57:08 - INFO - __main__ - Step 133132: {'lr': 1.5856948199633375e-05, 'samples': 25561344, 'steps': 133131, 'loss/train': 0.05110315978527069} 11/07/2021 15:57:08 - INFO - __main__ - Step 133133: {'lr': 1.5855088372861898e-05, 'samples': 25561536, 'steps': 133132, 'loss/train': 1.251723289489746} 11/07/2021 15:57:09 - INFO - __main__ - Step 133134: {'lr': 1.5853228651592498e-05, 'samples': 25561728, 'steps': 133133, 'loss/train': 1.127142310142517} 11/07/2021 15:57:10 - INFO - __main__ - Step 133135: {'lr': 1.585136903582607e-05, 'samples': 25561920, 'steps': 133134, 'loss/train': 1.199053168296814} 11/07/2021 15:57:10 - INFO - __main__ - Step 133136: {'lr': 1.5849509525563353e-05, 'samples': 25562112, 'steps': 133135, 'loss/train': 1.1404308080673218} 11/07/2021 15:57:11 - INFO - __main__ - Step 133137: {'lr': 1.5847650120805246e-05, 'samples': 25562304, 'steps': 133136, 'loss/train': 1.4570517539978027} 11/07/2021 15:57:11 - INFO - __main__ - Step 133138: {'lr': 1.5845790821552576e-05, 'samples': 25562496, 'steps': 133137, 'loss/train': 1.2349134683609009} 11/07/2021 15:57:11 - INFO - __main__ - Step 133139: {'lr': 1.5843931627806174e-05, 'samples': 25562688, 'steps': 133138, 'loss/train': 1.4548300504684448} 11/07/2021 15:57:12 - INFO - __main__ - Step 133140: {'lr': 1.5842072539566878e-05, 'samples': 25562880, 'steps': 133139, 'loss/train': 1.5016255378723145} 11/07/2021 15:57:13 - INFO - __main__ - Step 133141: {'lr': 1.5840213556835543e-05, 'samples': 25563072, 'steps': 133140, 'loss/train': 0.9451732039451599} 11/07/2021 15:57:13 - INFO - __main__ - Step 133142: {'lr': 1.5838354679612977e-05, 'samples': 25563264, 'steps': 133141, 'loss/train': 1.5951330661773682} 11/07/2021 15:57:13 - INFO - __main__ - Step 133143: {'lr': 1.583649590790004e-05, 'samples': 25563456, 'steps': 133142, 'loss/train': 0.9733403325080872} 11/07/2021 15:57:14 - INFO - __main__ - Step 133144: {'lr': 1.5834637241697565e-05, 'samples': 25563648, 'steps': 133143, 'loss/train': 1.622950792312622} 11/07/2021 15:57:15 - INFO - __main__ - Step 133145: {'lr': 1.583277868100641e-05, 'samples': 25563840, 'steps': 133144, 'loss/train': 1.4599158763885498} 11/07/2021 15:57:15 - INFO - __main__ - Step 133146: {'lr': 1.5830920225827355e-05, 'samples': 25564032, 'steps': 133145, 'loss/train': 1.348791241645813} 11/07/2021 15:57:15 - INFO - __main__ - Step 133147: {'lr': 1.5829061876161317e-05, 'samples': 25564224, 'steps': 133146, 'loss/train': 1.7429015636444092} 11/07/2021 15:57:16 - INFO - __main__ - Step 133148: {'lr': 1.5827203632009095e-05, 'samples': 25564416, 'steps': 133147, 'loss/train': 1.0843182802200317} 11/07/2021 15:57:16 - INFO - __main__ - Step 133149: {'lr': 1.58253454933715e-05, 'samples': 25564608, 'steps': 133148, 'loss/train': 2.4710605144500732} 11/07/2021 15:57:17 - INFO - __main__ - Step 133150: {'lr': 1.5823487460249365e-05, 'samples': 25564800, 'steps': 133149, 'loss/train': 1.7819428443908691} 11/07/2021 15:57:18 - INFO - __main__ - Step 133151: {'lr': 1.582162953264357e-05, 'samples': 25564992, 'steps': 133150, 'loss/train': 1.4742324352264404} 11/07/2021 15:57:18 - INFO - __main__ - Step 133152: {'lr': 1.5819771710554958e-05, 'samples': 25565184, 'steps': 133151, 'loss/train': 1.227167010307312} 11/07/2021 15:57:18 - INFO - __main__ - Step 133153: {'lr': 1.5817913993984302e-05, 'samples': 25565376, 'steps': 133152, 'loss/train': 1.249245285987854} 11/07/2021 15:57:19 - INFO - __main__ - Step 133154: {'lr': 1.5816056382932515e-05, 'samples': 25565568, 'steps': 133153, 'loss/train': 1.4842956066131592} 11/07/2021 15:57:19 - INFO - __main__ - Step 133155: {'lr': 1.5814198877400376e-05, 'samples': 25565760, 'steps': 133154, 'loss/train': 1.2430484294891357} 11/07/2021 15:57:20 - INFO - __main__ - Step 133156: {'lr': 1.5812341477388748e-05, 'samples': 25565952, 'steps': 133155, 'loss/train': 1.2962570190429688} 11/07/2021 15:57:20 - INFO - __main__ - Step 133157: {'lr': 1.581048418289849e-05, 'samples': 25566144, 'steps': 133156, 'loss/train': 1.414963960647583} 11/07/2021 15:57:21 - INFO - __main__ - Step 133158: {'lr': 1.5808626993930403e-05, 'samples': 25566336, 'steps': 133157, 'loss/train': 0.9687652587890625} 11/07/2021 15:57:21 - INFO - __main__ - Step 133159: {'lr': 1.5806769910485325e-05, 'samples': 25566528, 'steps': 133158, 'loss/train': 1.1971356868743896} 11/07/2021 15:57:21 - INFO - __main__ - Step 133160: {'lr': 1.5804912932564086e-05, 'samples': 25566720, 'steps': 133159, 'loss/train': 1.2225459814071655} 11/07/2021 15:57:23 - INFO - __main__ - Step 133161: {'lr': 1.5803056060167577e-05, 'samples': 25566912, 'steps': 133160, 'loss/train': 1.2648566961288452} 11/07/2021 15:57:23 - INFO - __main__ - Step 133162: {'lr': 1.5801199293296597e-05, 'samples': 25567104, 'steps': 133161, 'loss/train': 1.2542427778244019} 11/07/2021 15:57:23 - INFO - __main__ - Step 133163: {'lr': 1.5799342631951984e-05, 'samples': 25567296, 'steps': 133162, 'loss/train': 1.4706038236618042} 11/07/2021 15:57:24 - INFO - __main__ - Step 133164: {'lr': 1.5797486076134544e-05, 'samples': 25567488, 'steps': 133163, 'loss/train': 1.4481074810028076} 11/07/2021 15:57:24 - INFO - __main__ - Step 133165: {'lr': 1.5795629625845158e-05, 'samples': 25567680, 'steps': 133164, 'loss/train': 1.4592878818511963} 11/07/2021 15:57:25 - INFO - __main__ - Step 133166: {'lr': 1.579377328108464e-05, 'samples': 25567872, 'steps': 133165, 'loss/train': 1.5551633834838867} 11/07/2021 15:57:25 - INFO - __main__ - Step 133167: {'lr': 1.5791917041853843e-05, 'samples': 25568064, 'steps': 133166, 'loss/train': 1.2585879564285278} 11/07/2021 15:57:26 - INFO - __main__ - Step 133168: {'lr': 1.5790060908153575e-05, 'samples': 25568256, 'steps': 133167, 'loss/train': 1.255296230316162} 11/07/2021 15:57:26 - INFO - __main__ - Step 133169: {'lr': 1.5788204879984697e-05, 'samples': 25568448, 'steps': 133168, 'loss/train': 1.609779953956604} 11/07/2021 15:57:26 - INFO - __main__ - Step 133170: {'lr': 1.5786348957348068e-05, 'samples': 25568640, 'steps': 133169, 'loss/train': 1.0115764141082764} 11/07/2021 15:57:27 - INFO - __main__ - Step 133171: {'lr': 1.5784493140244467e-05, 'samples': 25568832, 'steps': 133170, 'loss/train': 1.2008533477783203} 11/07/2021 15:57:28 - INFO - __main__ - Step 133172: {'lr': 1.578263742867475e-05, 'samples': 25569024, 'steps': 133171, 'loss/train': 1.4078960418701172} 11/07/2021 15:57:28 - INFO - __main__ - Step 133173: {'lr': 1.5780781822639785e-05, 'samples': 25569216, 'steps': 133172, 'loss/train': 1.4213231801986694} 11/07/2021 15:57:28 - INFO - __main__ - Step 133174: {'lr': 1.577892632214037e-05, 'samples': 25569408, 'steps': 133173, 'loss/train': 1.2161924839019775} 11/07/2021 15:57:29 - INFO - __main__ - Step 133175: {'lr': 1.5777070927177422e-05, 'samples': 25569600, 'steps': 133174, 'loss/train': 0.7668255567550659} 11/07/2021 15:57:29 - INFO - __main__ - Step 133176: {'lr': 1.577521563775164e-05, 'samples': 25569792, 'steps': 133175, 'loss/train': 1.1189378499984741} 11/07/2021 15:57:30 - INFO - __main__ - Step 133177: {'lr': 1.577336045386396e-05, 'samples': 25569984, 'steps': 133176, 'loss/train': 0.963010847568512} 11/07/2021 15:57:31 - INFO - __main__ - Step 133178: {'lr': 1.5771505375515167e-05, 'samples': 25570176, 'steps': 133177, 'loss/train': 1.5142403841018677} 11/07/2021 15:57:31 - INFO - __main__ - Step 133179: {'lr': 1.5769650402706144e-05, 'samples': 25570368, 'steps': 133178, 'loss/train': 0.9410682916641235} 11/07/2021 15:57:31 - INFO - __main__ - Step 133180: {'lr': 1.576779553543767e-05, 'samples': 25570560, 'steps': 133179, 'loss/train': 1.0481185913085938} 11/07/2021 15:57:32 - INFO - __main__ - Step 133181: {'lr': 1.5765940773710628e-05, 'samples': 25570752, 'steps': 133180, 'loss/train': 0.9708167910575867} 11/07/2021 15:57:33 - INFO - __main__ - Step 133182: {'lr': 1.576408611752586e-05, 'samples': 25570944, 'steps': 133181, 'loss/train': 1.2322359085083008} 11/07/2021 15:57:33 - INFO - __main__ - Step 133183: {'lr': 1.576223156688414e-05, 'samples': 25571136, 'steps': 133182, 'loss/train': 1.5104323625564575} 11/07/2021 15:57:33 - INFO - __main__ - Step 133184: {'lr': 1.576037712178638e-05, 'samples': 25571328, 'steps': 133183, 'loss/train': 1.3389086723327637} 11/07/2021 15:57:34 - INFO - __main__ - Step 133185: {'lr': 1.575852278223336e-05, 'samples': 25571520, 'steps': 133184, 'loss/train': 1.1989880800247192} 11/07/2021 15:57:34 - INFO - __main__ - Step 133186: {'lr': 1.5756668548225938e-05, 'samples': 25571712, 'steps': 133185, 'loss/train': 0.7245270609855652} 11/07/2021 15:57:35 - INFO - __main__ - Step 133187: {'lr': 1.575481441976495e-05, 'samples': 25571904, 'steps': 133186, 'loss/train': 1.414485216140747} 11/07/2021 15:57:36 - INFO - __main__ - Step 133188: {'lr': 1.575296039685123e-05, 'samples': 25572096, 'steps': 133187, 'loss/train': 1.4435780048370361} 11/07/2021 15:57:36 - INFO - __main__ - Step 133189: {'lr': 1.575110647948563e-05, 'samples': 25572288, 'steps': 133188, 'loss/train': 1.5755926370620728} 11/07/2021 15:57:37 - INFO - __main__ - Step 133190: {'lr': 1.574925266766894e-05, 'samples': 25572480, 'steps': 133189, 'loss/train': 1.7420059442520142} 11/07/2021 15:57:37 - INFO - __main__ - Step 133191: {'lr': 1.574739896140204e-05, 'samples': 25572672, 'steps': 133190, 'loss/train': 1.724274754524231} 11/07/2021 15:57:37 - INFO - __main__ - Step 133192: {'lr': 1.574554536068573e-05, 'samples': 25572864, 'steps': 133191, 'loss/train': 1.1059376001358032} 11/07/2021 15:57:38 - INFO - __main__ - Step 133193: {'lr': 1.5743691865520853e-05, 'samples': 25573056, 'steps': 133192, 'loss/train': 0.9810830354690552} 11/07/2021 15:57:39 - INFO - __main__ - Step 133194: {'lr': 1.5741838475908266e-05, 'samples': 25573248, 'steps': 133193, 'loss/train': 1.5504817962646484} 11/07/2021 15:57:39 - INFO - __main__ - Step 133195: {'lr': 1.57399851918488e-05, 'samples': 25573440, 'steps': 133194, 'loss/train': 1.261936902999878} 11/07/2021 15:57:39 - INFO - __main__ - Step 133196: {'lr': 1.5738132013343288e-05, 'samples': 25573632, 'steps': 133195, 'loss/train': 1.078011155128479} 11/07/2021 15:57:40 - INFO - __main__ - Step 133197: {'lr': 1.5736278940392533e-05, 'samples': 25573824, 'steps': 133196, 'loss/train': 1.1711077690124512} 11/07/2021 15:57:40 - INFO - __main__ - Step 133198: {'lr': 1.57344259729974e-05, 'samples': 25574016, 'steps': 133197, 'loss/train': 1.603498935699463} 11/07/2021 15:57:41 - INFO - __main__ - Step 133199: {'lr': 1.573257311115872e-05, 'samples': 25574208, 'steps': 133198, 'loss/train': 1.4125354290008545} 11/07/2021 15:57:41 - INFO - __main__ - Step 133200: {'lr': 1.5730720354877355e-05, 'samples': 25574400, 'steps': 133199, 'loss/train': 1.0495502948760986} 11/07/2021 15:57:42 - INFO - __main__ - Step 133201: {'lr': 1.5728867704154075e-05, 'samples': 25574592, 'steps': 133200, 'loss/train': 1.0090161561965942} 11/07/2021 15:57:42 - INFO - __main__ - Step 133202: {'lr': 1.5727015158989834e-05, 'samples': 25574784, 'steps': 133201, 'loss/train': 1.3456752300262451} 11/07/2021 15:57:42 - INFO - __main__ - Step 133203: {'lr': 1.5725162719385315e-05, 'samples': 25574976, 'steps': 133202, 'loss/train': 0.9863357543945312} 11/07/2021 15:57:44 - INFO - __main__ - Step 133204: {'lr': 1.5723310385341417e-05, 'samples': 25575168, 'steps': 133203, 'loss/train': 1.5115361213684082} 11/07/2021 15:57:44 - INFO - __main__ - Step 133205: {'lr': 1.5721458156858993e-05, 'samples': 25575360, 'steps': 133204, 'loss/train': 1.2962899208068848} 11/07/2021 15:57:44 - INFO - __main__ - Step 133206: {'lr': 1.571960603393885e-05, 'samples': 25575552, 'steps': 133205, 'loss/train': 1.2761640548706055} 11/07/2021 15:57:45 - INFO - __main__ - Step 133207: {'lr': 1.5717754016581874e-05, 'samples': 25575744, 'steps': 133206, 'loss/train': 0.8622039556503296} 11/07/2021 15:57:45 - INFO - __main__ - Step 133208: {'lr': 1.571590210478882e-05, 'samples': 25575936, 'steps': 133207, 'loss/train': 1.3529384136199951} 11/07/2021 15:57:46 - INFO - __main__ - Step 133209: {'lr': 1.57140502985606e-05, 'samples': 25576128, 'steps': 133208, 'loss/train': 1.2344673871994019} 11/07/2021 15:57:46 - INFO - __main__ - Step 133210: {'lr': 1.5712198597897992e-05, 'samples': 25576320, 'steps': 133209, 'loss/train': 1.2934980392456055} 11/07/2021 15:57:47 - INFO - __main__ - Step 133211: {'lr': 1.5710347002801856e-05, 'samples': 25576512, 'steps': 133210, 'loss/train': 1.4641833305358887} 11/07/2021 15:57:47 - INFO - __main__ - Step 133212: {'lr': 1.5708495513273025e-05, 'samples': 25576704, 'steps': 133211, 'loss/train': 0.865738034248352} 11/07/2021 15:57:47 - INFO - __main__ - Step 133213: {'lr': 1.5706644129312332e-05, 'samples': 25576896, 'steps': 133212, 'loss/train': 0.037603847682476044} 11/07/2021 15:57:48 - INFO - __main__ - Step 133214: {'lr': 1.570479285092061e-05, 'samples': 25577088, 'steps': 133213, 'loss/train': 0.947493314743042} 11/07/2021 15:57:49 - INFO - __main__ - Step 133215: {'lr': 1.5702941678098688e-05, 'samples': 25577280, 'steps': 133214, 'loss/train': 0.9727096557617188} 11/07/2021 15:57:49 - INFO - __main__ - Step 133216: {'lr': 1.5701090610847456e-05, 'samples': 25577472, 'steps': 133215, 'loss/train': 1.2777491807937622} 11/07/2021 15:57:50 - INFO - __main__ - Step 133217: {'lr': 1.569923964916764e-05, 'samples': 25577664, 'steps': 133216, 'loss/train': 0.9706855416297913} 11/07/2021 15:57:50 - INFO - __main__ - Step 133218: {'lr': 1.5697388793060123e-05, 'samples': 25577856, 'steps': 133217, 'loss/train': 1.3836232423782349} 11/07/2021 15:57:51 - INFO - __main__ - Step 133219: {'lr': 1.569553804252577e-05, 'samples': 25578048, 'steps': 133218, 'loss/train': 0.061754558235406876} 11/07/2021 15:57:51 - INFO - __main__ - Step 133220: {'lr': 1.5693687397565383e-05, 'samples': 25578240, 'steps': 133219, 'loss/train': 0.7123624682426453} 11/07/2021 15:57:52 - INFO - __main__ - Step 133221: {'lr': 1.569183685817982e-05, 'samples': 25578432, 'steps': 133220, 'loss/train': 1.2998173236846924} 11/07/2021 15:57:52 - INFO - __main__ - Step 133222: {'lr': 1.568998642436989e-05, 'samples': 25578624, 'steps': 133221, 'loss/train': 1.3963268995285034} 11/07/2021 15:57:53 - INFO - __main__ - Step 133223: {'lr': 1.5688136096136425e-05, 'samples': 25578816, 'steps': 133222, 'loss/train': 1.3496954441070557} 11/07/2021 15:57:53 - INFO - __main__ - Step 133224: {'lr': 1.5686285873480282e-05, 'samples': 25579008, 'steps': 133223, 'loss/train': 1.5392636060714722} 11/07/2021 15:57:54 - INFO - __main__ - Step 133225: {'lr': 1.5684435756402273e-05, 'samples': 25579200, 'steps': 133224, 'loss/train': 1.2650830745697021} 11/07/2021 15:57:54 - INFO - __main__ - Step 133226: {'lr': 1.568258574490325e-05, 'samples': 25579392, 'steps': 133225, 'loss/train': 1.2534939050674438} 11/07/2021 15:57:55 - INFO - __main__ - Step 133227: {'lr': 1.5680735838984077e-05, 'samples': 25579584, 'steps': 133226, 'loss/train': 0.9455736875534058} 11/07/2021 15:57:55 - INFO - __main__ - Step 133228: {'lr': 1.5678886038645507e-05, 'samples': 25579776, 'steps': 133227, 'loss/train': 1.279036045074463} 11/07/2021 15:57:55 - INFO - __main__ - Step 133229: {'lr': 1.5677036343888422e-05, 'samples': 25579968, 'steps': 133228, 'loss/train': 1.200405478477478} 11/07/2021 15:57:56 - INFO - __main__ - Step 133230: {'lr': 1.567518675471366e-05, 'samples': 25580160, 'steps': 133229, 'loss/train': 1.7286063432693481} 11/07/2021 15:57:57 - INFO - __main__ - Step 133231: {'lr': 1.5673337271122025e-05, 'samples': 25580352, 'steps': 133230, 'loss/train': 1.5176337957382202} 11/07/2021 15:57:57 - INFO - __main__ - Step 133232: {'lr': 1.5671487893114374e-05, 'samples': 25580544, 'steps': 133231, 'loss/train': 0.9880375862121582} 11/07/2021 15:57:57 - INFO - __main__ - Step 133233: {'lr': 1.566963862069151e-05, 'samples': 25580736, 'steps': 133232, 'loss/train': 0.5805540084838867} 11/07/2021 15:57:58 - INFO - __main__ - Step 133234: {'lr': 1.566778945385433e-05, 'samples': 25580928, 'steps': 133233, 'loss/train': 1.247307300567627} 11/07/2021 15:57:59 - INFO - __main__ - Step 133235: {'lr': 1.5665940392603604e-05, 'samples': 25581120, 'steps': 133234, 'loss/train': 1.0568877458572388} 11/07/2021 15:57:59 - INFO - __main__ - Step 133236: {'lr': 1.5664091436940198e-05, 'samples': 25581312, 'steps': 133235, 'loss/train': 1.467051386833191} 11/07/2021 15:58:00 - INFO - __main__ - Step 133237: {'lr': 1.5662242586864967e-05, 'samples': 25581504, 'steps': 133236, 'loss/train': 1.3661859035491943} 11/07/2021 15:58:00 - INFO - __main__ - Step 133238: {'lr': 1.566039384237866e-05, 'samples': 25581696, 'steps': 133237, 'loss/train': 1.5602885484695435} 11/07/2021 15:58:00 - INFO - __main__ - Step 133239: {'lr': 1.56585452034822e-05, 'samples': 25581888, 'steps': 133238, 'loss/train': 1.477757215499878} 11/07/2021 15:58:01 - INFO - __main__ - Step 133240: {'lr': 1.565669667017636e-05, 'samples': 25582080, 'steps': 133239, 'loss/train': 1.1240023374557495} 11/07/2021 15:58:02 - INFO - __main__ - Step 133241: {'lr': 1.5654848242461995e-05, 'samples': 25582272, 'steps': 133240, 'loss/train': 0.8746653199195862} 11/07/2021 15:58:02 - INFO - __main__ - Step 133242: {'lr': 1.565299992033997e-05, 'samples': 25582464, 'steps': 133241, 'loss/train': 1.2112903594970703} 11/07/2021 15:58:02 - INFO - __main__ - Step 133243: {'lr': 1.565115170381104e-05, 'samples': 25582656, 'steps': 133242, 'loss/train': 1.2879501581192017} 11/07/2021 15:58:03 - INFO - __main__ - Step 133244: {'lr': 1.564930359287611e-05, 'samples': 25582848, 'steps': 133243, 'loss/train': 1.102864146232605} 11/07/2021 15:58:04 - INFO - __main__ - Step 133245: {'lr': 1.5647455587535997e-05, 'samples': 25583040, 'steps': 133244, 'loss/train': 0.6739013195037842} 11/07/2021 15:58:04 - INFO - __main__ - Step 133246: {'lr': 1.5645607687791523e-05, 'samples': 25583232, 'steps': 133245, 'loss/train': 1.3458675146102905} 11/07/2021 15:58:05 - INFO - __main__ - Step 133247: {'lr': 1.56437598936435e-05, 'samples': 25583424, 'steps': 133246, 'loss/train': 1.0731439590454102} 11/07/2021 15:58:05 - INFO - __main__ - Step 133248: {'lr': 1.564191220509284e-05, 'samples': 25583616, 'steps': 133247, 'loss/train': 0.9899431467056274} 11/07/2021 15:58:05 - INFO - __main__ - Step 133249: {'lr': 1.5640064622140266e-05, 'samples': 25583808, 'steps': 133248, 'loss/train': 0.9015091061592102} 11/07/2021 15:58:07 - INFO - __main__ - Step 133250: {'lr': 1.5638217144786664e-05, 'samples': 25584000, 'steps': 133249, 'loss/train': 0.5371196866035461} 11/07/2021 15:58:07 - INFO - __main__ - Step 133251: {'lr': 1.5636369773032873e-05, 'samples': 25584192, 'steps': 133250, 'loss/train': 1.2681128978729248} 11/07/2021 15:58:07 - INFO - __main__ - Step 133252: {'lr': 1.5634522506879716e-05, 'samples': 25584384, 'steps': 133251, 'loss/train': 1.0163867473602295} 11/07/2021 15:58:08 - INFO - __main__ - Step 133253: {'lr': 1.5632675346328033e-05, 'samples': 25584576, 'steps': 133252, 'loss/train': 0.5120621919631958} 11/07/2021 15:58:08 - INFO - __main__ - Step 133254: {'lr': 1.563082829137863e-05, 'samples': 25584768, 'steps': 133253, 'loss/train': 0.633613109588623} 11/07/2021 15:58:08 - INFO - __main__ - Step 133255: {'lr': 1.562898134203239e-05, 'samples': 25584960, 'steps': 133254, 'loss/train': 1.826279878616333} 11/07/2021 15:58:09 - INFO - __main__ - Step 133256: {'lr': 1.562713449829009e-05, 'samples': 25585152, 'steps': 133255, 'loss/train': 1.1578449010849} 11/07/2021 15:58:10 - INFO - __main__ - Step 133257: {'lr': 1.5625287760152597e-05, 'samples': 25585344, 'steps': 133256, 'loss/train': 1.0852687358856201} 11/07/2021 15:58:10 - INFO - __main__ - Step 133258: {'lr': 1.5623441127620708e-05, 'samples': 25585536, 'steps': 133257, 'loss/train': 1.058019995689392} 11/07/2021 15:58:11 - INFO - __main__ - Step 133259: {'lr': 1.5621594600695342e-05, 'samples': 25585728, 'steps': 133258, 'loss/train': 1.1573538780212402} 11/07/2021 15:58:11 - INFO - __main__ - Step 133260: {'lr': 1.5619748179377225e-05, 'samples': 25585920, 'steps': 133259, 'loss/train': 1.0646404027938843} 11/07/2021 15:58:11 - INFO - __main__ - Step 133261: {'lr': 1.561790186366724e-05, 'samples': 25586112, 'steps': 133260, 'loss/train': 1.1060576438903809} 11/07/2021 15:58:12 - INFO - __main__ - Step 133262: {'lr': 1.561605565356622e-05, 'samples': 25586304, 'steps': 133261, 'loss/train': 1.278346061706543} 11/07/2021 15:58:13 - INFO - __main__ - Step 133263: {'lr': 1.561420954907497e-05, 'samples': 25586496, 'steps': 133262, 'loss/train': 0.9802129864692688} 11/07/2021 15:58:13 - INFO - __main__ - Step 133264: {'lr': 1.5612363550194352e-05, 'samples': 25586688, 'steps': 133263, 'loss/train': 0.9812164306640625} 11/07/2021 15:58:13 - INFO - __main__ - Step 133265: {'lr': 1.5610517656925173e-05, 'samples': 25586880, 'steps': 133264, 'loss/train': 1.2563745975494385} 11/07/2021 15:58:14 - INFO - __main__ - Step 133266: {'lr': 1.5608671869268286e-05, 'samples': 25587072, 'steps': 133265, 'loss/train': 1.1180747747421265} 11/07/2021 15:58:15 - INFO - __main__ - Step 133267: {'lr': 1.5606826187224506e-05, 'samples': 25587264, 'steps': 133266, 'loss/train': 1.1067101955413818} 11/07/2021 15:58:15 - INFO - __main__ - Step 133268: {'lr': 1.5604980610794685e-05, 'samples': 25587456, 'steps': 133267, 'loss/train': 0.9902024865150452} 11/07/2021 15:58:15 - INFO - __main__ - Step 133269: {'lr': 1.560313513997963e-05, 'samples': 25587648, 'steps': 133268, 'loss/train': 1.2129137516021729} 11/07/2021 15:58:16 - INFO - __main__ - Step 133270: {'lr': 1.560128977478023e-05, 'samples': 25587840, 'steps': 133269, 'loss/train': 1.2452095746994019} 11/07/2021 15:58:16 - INFO - __main__ - Step 133271: {'lr': 1.5599444515197235e-05, 'samples': 25588032, 'steps': 133270, 'loss/train': 1.240206241607666} 11/07/2021 15:58:17 - INFO - __main__ - Step 133272: {'lr': 1.5597599361231534e-05, 'samples': 25588224, 'steps': 133271, 'loss/train': 1.8581398725509644} 11/07/2021 15:58:18 - INFO - __main__ - Step 133273: {'lr': 1.5595754312883903e-05, 'samples': 25588416, 'steps': 133272, 'loss/train': 1.308066487312317} 11/07/2021 15:58:18 - INFO - __main__ - Step 133274: {'lr': 1.559390937015523e-05, 'samples': 25588608, 'steps': 133273, 'loss/train': 0.3945031762123108} 11/07/2021 15:58:18 - INFO - __main__ - Step 133275: {'lr': 1.559206453304632e-05, 'samples': 25588800, 'steps': 133274, 'loss/train': 1.2593176364898682} 11/07/2021 15:58:19 - INFO - __main__ - Step 133276: {'lr': 1.5590219801558002e-05, 'samples': 25588992, 'steps': 133275, 'loss/train': 1.4771440029144287} 11/07/2021 15:58:20 - INFO - __main__ - Step 133277: {'lr': 1.5588375175691116e-05, 'samples': 25589184, 'steps': 133276, 'loss/train': 1.1999213695526123} 11/07/2021 15:58:20 - INFO - __main__ - Step 133278: {'lr': 1.558653065544649e-05, 'samples': 25589376, 'steps': 133277, 'loss/train': 1.0856213569641113} 11/07/2021 15:58:20 - INFO - __main__ - Step 133279: {'lr': 1.5584686240824957e-05, 'samples': 25589568, 'steps': 133278, 'loss/train': 1.5423663854599} 11/07/2021 15:58:21 - INFO - __main__ - Step 133280: {'lr': 1.558284193182735e-05, 'samples': 25589760, 'steps': 133279, 'loss/train': 1.37490713596344} 11/07/2021 15:58:21 - INFO - __main__ - Step 133281: {'lr': 1.5580997728454478e-05, 'samples': 25589952, 'steps': 133280, 'loss/train': 1.0716253519058228} 11/07/2021 15:58:22 - INFO - __main__ - Step 133282: {'lr': 1.557915363070722e-05, 'samples': 25590144, 'steps': 133281, 'loss/train': 1.0519154071807861} 11/07/2021 15:58:22 - INFO - __main__ - Step 133283: {'lr': 1.5577309638586418e-05, 'samples': 25590336, 'steps': 133282, 'loss/train': 1.1767008304595947} 11/07/2021 15:58:23 - INFO - __main__ - Step 133284: {'lr': 1.5575465752092786e-05, 'samples': 25590528, 'steps': 133283, 'loss/train': 1.4879416227340698} 11/07/2021 15:58:23 - INFO - __main__ - Step 133285: {'lr': 1.5573621971227276e-05, 'samples': 25590720, 'steps': 133284, 'loss/train': 1.2762091159820557} 11/07/2021 15:58:24 - INFO - __main__ - Step 133286: {'lr': 1.5571778295990658e-05, 'samples': 25590912, 'steps': 133285, 'loss/train': 1.2268069982528687} 11/07/2021 15:58:24 - INFO - __main__ - Step 133287: {'lr': 1.5569934726383768e-05, 'samples': 25591104, 'steps': 133286, 'loss/train': 1.345582365989685} 11/07/2021 15:58:25 - INFO - __main__ - Step 133288: {'lr': 1.5568091262407463e-05, 'samples': 25591296, 'steps': 133287, 'loss/train': 1.0987067222595215} 11/07/2021 15:58:25 - INFO - __main__ - Step 133289: {'lr': 1.5566247904062553e-05, 'samples': 25591488, 'steps': 133288, 'loss/train': 0.7915785908699036} 11/07/2021 15:58:26 - INFO - __main__ - Step 133290: {'lr': 1.5564404651349868e-05, 'samples': 25591680, 'steps': 133289, 'loss/train': 1.1662968397140503} 11/07/2021 15:58:26 - INFO - __main__ - Step 133291: {'lr': 1.556256150427024e-05, 'samples': 25591872, 'steps': 133290, 'loss/train': 1.0990532636642456} 11/07/2021 15:58:27 - INFO - __main__ - Step 133292: {'lr': 1.556071846282453e-05, 'samples': 25592064, 'steps': 133291, 'loss/train': 1.1789212226867676} 11/07/2021 15:58:27 - INFO - __main__ - Step 133293: {'lr': 1.5558875527013518e-05, 'samples': 25592256, 'steps': 133292, 'loss/train': 1.36897611618042} 11/07/2021 15:58:28 - INFO - __main__ - Step 133294: {'lr': 1.555703269683806e-05, 'samples': 25592448, 'steps': 133293, 'loss/train': 1.412627935409546} 11/07/2021 15:58:28 - INFO - __main__ - Step 133295: {'lr': 1.555518997229899e-05, 'samples': 25592640, 'steps': 133294, 'loss/train': 1.170596957206726} 11/07/2021 15:58:28 - INFO - __main__ - Step 133296: {'lr': 1.5553347353397197e-05, 'samples': 25592832, 'steps': 133295, 'loss/train': 1.1945685148239136} 11/07/2021 15:58:30 - INFO - __main__ - Step 133297: {'lr': 1.5551504840133372e-05, 'samples': 25593024, 'steps': 133296, 'loss/train': 0.5309562683105469} 11/07/2021 15:58:30 - INFO - __main__ - Step 133298: {'lr': 1.5549662432508437e-05, 'samples': 25593216, 'steps': 133297, 'loss/train': 1.180662989616394} 11/07/2021 15:58:30 - INFO - __main__ - Step 133299: {'lr': 1.5547820130523222e-05, 'samples': 25593408, 'steps': 133298, 'loss/train': 0.9782919883728027} 11/07/2021 15:58:31 - INFO - __main__ - Step 133300: {'lr': 1.554597793417853e-05, 'samples': 25593600, 'steps': 133299, 'loss/train': 1.0718166828155518} 11/07/2021 15:58:31 - INFO - __main__ - Step 133301: {'lr': 1.5544135843475194e-05, 'samples': 25593792, 'steps': 133300, 'loss/train': 1.4738534688949585} 11/07/2021 15:58:31 - INFO - __main__ - Step 133302: {'lr': 1.5542293858414047e-05, 'samples': 25593984, 'steps': 133301, 'loss/train': 1.158553957939148} 11/07/2021 15:58:33 - INFO - __main__ - Step 133303: {'lr': 1.5540451978995924e-05, 'samples': 25594176, 'steps': 133302, 'loss/train': 1.1382149457931519} 11/07/2021 15:58:33 - INFO - __main__ - Step 133304: {'lr': 1.553861020522168e-05, 'samples': 25594368, 'steps': 133303, 'loss/train': 1.2614926099777222} 11/07/2021 15:58:33 - INFO - __main__ - Step 133305: {'lr': 1.55367685370921e-05, 'samples': 25594560, 'steps': 133304, 'loss/train': 1.8056328296661377} 11/07/2021 15:58:34 - INFO - __main__ - Step 133306: {'lr': 1.553492697460804e-05, 'samples': 25594752, 'steps': 133305, 'loss/train': 1.3405449390411377} 11/07/2021 15:58:34 - INFO - __main__ - Step 133307: {'lr': 1.553308551777033e-05, 'samples': 25594944, 'steps': 133306, 'loss/train': 1.289125680923462} 11/07/2021 15:58:35 - INFO - __main__ - Step 133308: {'lr': 1.5531244166579778e-05, 'samples': 25595136, 'steps': 133307, 'loss/train': 1.3626574277877808} 11/07/2021 15:58:36 - INFO - __main__ - Step 133309: {'lr': 1.5529402921037243e-05, 'samples': 25595328, 'steps': 133308, 'loss/train': 1.3505644798278809} 11/07/2021 15:58:36 - INFO - __main__ - Step 133310: {'lr': 1.552756178114359e-05, 'samples': 25595520, 'steps': 133309, 'loss/train': 0.8778926134109497} 11/07/2021 15:58:36 - INFO - __main__ - Step 133311: {'lr': 1.5525720746899535e-05, 'samples': 25595712, 'steps': 133310, 'loss/train': 1.2008309364318848} 11/07/2021 15:58:37 - INFO - __main__ - Step 133312: {'lr': 1.5523879818305996e-05, 'samples': 25595904, 'steps': 133311, 'loss/train': 0.940147340297699} 11/07/2021 15:58:38 - INFO - __main__ - Step 133313: {'lr': 1.5522038995363753e-05, 'samples': 25596096, 'steps': 133312, 'loss/train': 0.050168029963970184} 11/07/2021 15:58:38 - INFO - __main__ - Step 133314: {'lr': 1.552019827807369e-05, 'samples': 25596288, 'steps': 133313, 'loss/train': 0.8869742751121521} 11/07/2021 15:58:38 - INFO - __main__ - Step 133315: {'lr': 1.5518357666436584e-05, 'samples': 25596480, 'steps': 133314, 'loss/train': 1.3280733823776245} 11/07/2021 15:58:39 - INFO - __main__ - Step 133316: {'lr': 1.5516517160453298e-05, 'samples': 25596672, 'steps': 133315, 'loss/train': 1.190535306930542} 11/07/2021 15:58:39 - INFO - __main__ - Step 133317: {'lr': 1.5514676760124636e-05, 'samples': 25596864, 'steps': 133316, 'loss/train': 1.0364487171173096} 11/07/2021 15:58:40 - INFO - __main__ - Step 133318: {'lr': 1.551283646545146e-05, 'samples': 25597056, 'steps': 133317, 'loss/train': 1.2605116367340088} 11/07/2021 15:58:40 - INFO - __main__ - Step 133319: {'lr': 1.551099627643457e-05, 'samples': 25597248, 'steps': 133318, 'loss/train': 0.9826151728630066} 11/07/2021 15:58:41 - INFO - __main__ - Step 133320: {'lr': 1.55091561930748e-05, 'samples': 25597440, 'steps': 133319, 'loss/train': 0.031404200941324234} 11/07/2021 15:58:41 - INFO - __main__ - Step 133321: {'lr': 1.5507316215373018e-05, 'samples': 25597632, 'steps': 133320, 'loss/train': 1.1585544347763062} 11/07/2021 15:58:41 - INFO - __main__ - Step 133322: {'lr': 1.5505476343329992e-05, 'samples': 25597824, 'steps': 133321, 'loss/train': 1.4644098281860352} 11/07/2021 15:58:43 - INFO - __main__ - Step 133323: {'lr': 1.5503636576946646e-05, 'samples': 25598016, 'steps': 133322, 'loss/train': 1.524029016494751} 11/07/2021 15:58:43 - INFO - __main__ - Step 133324: {'lr': 1.5501796916223664e-05, 'samples': 25598208, 'steps': 133323, 'loss/train': 0.9626952409744263} 11/07/2021 15:58:43 - INFO - __main__ - Step 133325: {'lr': 1.5499957361161997e-05, 'samples': 25598400, 'steps': 133324, 'loss/train': 1.3039357662200928} 11/07/2021 15:58:44 - INFO - __main__ - Step 133326: {'lr': 1.5498117911762395e-05, 'samples': 25598592, 'steps': 133325, 'loss/train': 1.1249220371246338} 11/07/2021 15:58:44 - INFO - __main__ - Step 133327: {'lr': 1.5496278568025742e-05, 'samples': 25598784, 'steps': 133326, 'loss/train': 1.2931283712387085} 11/07/2021 15:58:45 - INFO - __main__ - Step 133328: {'lr': 1.5494439329952844e-05, 'samples': 25598976, 'steps': 133327, 'loss/train': 1.3299938440322876} 11/07/2021 15:58:45 - INFO - __main__ - Step 133329: {'lr': 1.549260019754453e-05, 'samples': 25599168, 'steps': 133328, 'loss/train': 1.4014053344726562} 11/07/2021 15:58:46 - INFO - __main__ - Step 133330: {'lr': 1.5490761170801643e-05, 'samples': 25599360, 'steps': 133329, 'loss/train': 1.29013991355896} 11/07/2021 15:58:46 - INFO - __main__ - Step 133331: {'lr': 1.5488922249724978e-05, 'samples': 25599552, 'steps': 133330, 'loss/train': 1.1644545793533325} 11/07/2021 15:58:46 - INFO - __main__ - Step 133332: {'lr': 1.54870834343154e-05, 'samples': 25599744, 'steps': 133331, 'loss/train': 1.5787686109542847} 11/07/2021 15:58:47 - INFO - __main__ - Step 133333: {'lr': 1.5485244724573717e-05, 'samples': 25599936, 'steps': 133332, 'loss/train': 1.5885015726089478} 11/07/2021 15:58:48 - INFO - __main__ - Step 133334: {'lr': 1.5483406120500782e-05, 'samples': 25600128, 'steps': 133333, 'loss/train': 1.4705021381378174} 11/07/2021 15:58:48 - INFO - __main__ - Step 133335: {'lr': 1.5481567622097376e-05, 'samples': 25600320, 'steps': 133334, 'loss/train': 1.374005675315857} 11/07/2021 15:58:49 - INFO - __main__ - Step 133336: {'lr': 1.5479729229364385e-05, 'samples': 25600512, 'steps': 133335, 'loss/train': 1.0539137125015259} 11/07/2021 15:58:49 - INFO - __main__ - Step 133337: {'lr': 1.5477890942302618e-05, 'samples': 25600704, 'steps': 133336, 'loss/train': 1.1702529191970825} 11/07/2021 15:58:50 - INFO - __main__ - Step 133338: {'lr': 1.547605276091288e-05, 'samples': 25600896, 'steps': 133337, 'loss/train': 1.4605427980422974} 11/07/2021 15:58:50 - INFO - __main__ - Step 133339: {'lr': 1.5474214685196027e-05, 'samples': 25601088, 'steps': 133338, 'loss/train': 1.0646215677261353} 11/07/2021 15:58:51 - INFO - __main__ - Step 133340: {'lr': 1.5472376715152837e-05, 'samples': 25601280, 'steps': 133339, 'loss/train': 0.9209778904914856} 11/07/2021 15:58:51 - INFO - __main__ - Step 133341: {'lr': 1.54705388507842e-05, 'samples': 25601472, 'steps': 133340, 'loss/train': 1.8594884872436523} 11/07/2021 15:58:51 - INFO - __main__ - Step 133342: {'lr': 1.546870109209092e-05, 'samples': 25601664, 'steps': 133341, 'loss/train': 1.298503041267395} 11/07/2021 15:58:53 - INFO - __main__ - Step 133343: {'lr': 1.5466863439073804e-05, 'samples': 25601856, 'steps': 133342, 'loss/train': 1.2084490060806274} 11/07/2021 15:58:53 - INFO - __main__ - Step 133344: {'lr': 1.546502589173371e-05, 'samples': 25602048, 'steps': 133343, 'loss/train': 1.2199875116348267} 11/07/2021 15:58:53 - INFO - __main__ - Step 133345: {'lr': 1.546318845007147e-05, 'samples': 25602240, 'steps': 133344, 'loss/train': 0.8970987200737} 11/07/2021 15:58:54 - INFO - __main__ - Step 133346: {'lr': 1.546135111408789e-05, 'samples': 25602432, 'steps': 133345, 'loss/train': 1.223444938659668} 11/07/2021 15:58:54 - INFO - __main__ - Step 133347: {'lr': 1.5459513883783805e-05, 'samples': 25602624, 'steps': 133346, 'loss/train': 1.2062304019927979} 11/07/2021 15:58:54 - INFO - __main__ - Step 133348: {'lr': 1.5457676759160015e-05, 'samples': 25602816, 'steps': 133347, 'loss/train': 1.397937297821045} 11/07/2021 15:58:56 - INFO - __main__ - Step 133349: {'lr': 1.5455839740217416e-05, 'samples': 25603008, 'steps': 133348, 'loss/train': 0.06994263082742691} 11/07/2021 15:58:56 - INFO - __main__ - Step 133350: {'lr': 1.545400282695683e-05, 'samples': 25603200, 'steps': 133349, 'loss/train': 1.2858717441558838} 11/07/2021 15:58:56 - INFO - __main__ - Step 133351: {'lr': 1.5452166019378987e-05, 'samples': 25603392, 'steps': 133350, 'loss/train': 1.0362467765808105} 11/07/2021 15:58:57 - INFO - __main__ - Step 133352: {'lr': 1.54503293174848e-05, 'samples': 25603584, 'steps': 133351, 'loss/train': 1.1645169258117676} 11/07/2021 15:58:57 - INFO - __main__ - Step 133353: {'lr': 1.5448492721275075e-05, 'samples': 25603776, 'steps': 133352, 'loss/train': 1.1392532587051392} 11/07/2021 15:58:58 - INFO - __main__ - Step 133354: {'lr': 1.5446656230750645e-05, 'samples': 25603968, 'steps': 133353, 'loss/train': 1.6709321737289429} 11/07/2021 15:58:58 - INFO - __main__ - Step 133355: {'lr': 1.5444819845912313e-05, 'samples': 25604160, 'steps': 133354, 'loss/train': 1.5616968870162964} 11/07/2021 15:58:59 - INFO - __main__ - Step 133356: {'lr': 1.5442983566760937e-05, 'samples': 25604352, 'steps': 133355, 'loss/train': 1.3462061882019043} 11/07/2021 15:58:59 - INFO - __main__ - Step 133357: {'lr': 1.544114739329733e-05, 'samples': 25604544, 'steps': 133356, 'loss/train': 1.1343238353729248} 11/07/2021 15:58:59 - INFO - __main__ - Step 133358: {'lr': 1.5439311325522344e-05, 'samples': 25604736, 'steps': 133357, 'loss/train': 1.5766531229019165} 11/07/2021 15:59:00 - INFO - __main__ - Step 133359: {'lr': 1.543747536343676e-05, 'samples': 25604928, 'steps': 133358, 'loss/train': 0.8964548707008362} 11/07/2021 15:59:01 - INFO - __main__ - Step 133360: {'lr': 1.543563950704144e-05, 'samples': 25605120, 'steps': 133359, 'loss/train': 1.1921885013580322} 11/07/2021 15:59:01 - INFO - __main__ - Step 133361: {'lr': 1.5433803756337185e-05, 'samples': 25605312, 'steps': 133360, 'loss/train': 1.1096781492233276} 11/07/2021 15:59:01 - INFO - __main__ - Step 133362: {'lr': 1.5431968111324856e-05, 'samples': 25605504, 'steps': 133361, 'loss/train': 5.519468307495117} 11/07/2021 15:59:02 - INFO - __main__ - Step 133363: {'lr': 1.5430132572005263e-05, 'samples': 25605696, 'steps': 133362, 'loss/train': 1.385573387145996} 11/07/2021 15:59:03 - INFO - __main__ - Step 133364: {'lr': 1.542829713837926e-05, 'samples': 25605888, 'steps': 133363, 'loss/train': 0.8978381156921387} 11/07/2021 15:59:03 - INFO - __main__ - Step 133365: {'lr': 1.54264618104476e-05, 'samples': 25606080, 'steps': 133364, 'loss/train': 1.278573989868164} 11/07/2021 15:59:04 - INFO - __main__ - Step 133366: {'lr': 1.542462658821117e-05, 'samples': 25606272, 'steps': 133365, 'loss/train': 0.4718656837940216} 11/07/2021 15:59:04 - INFO - __main__ - Step 133367: {'lr': 1.5422791471670777e-05, 'samples': 25606464, 'steps': 133366, 'loss/train': 1.3261065483093262} 11/07/2021 15:59:04 - INFO - __main__ - Step 133368: {'lr': 1.542095646082728e-05, 'samples': 25606656, 'steps': 133367, 'loss/train': 1.5266095399856567} 11/07/2021 15:59:05 - INFO - __main__ - Step 133369: {'lr': 1.5419121555681454e-05, 'samples': 25606848, 'steps': 133368, 'loss/train': 1.5146926641464233} 11/07/2021 15:59:06 - INFO - __main__ - Step 133370: {'lr': 1.541728675623416e-05, 'samples': 25607040, 'steps': 133369, 'loss/train': 1.1436898708343506} 11/07/2021 15:59:06 - INFO - __main__ - Step 133371: {'lr': 1.5415452062486207e-05, 'samples': 25607232, 'steps': 133370, 'loss/train': 1.2406367063522339} 11/07/2021 15:59:06 - INFO - __main__ - Step 133372: {'lr': 1.5413617474438452e-05, 'samples': 25607424, 'steps': 133371, 'loss/train': 1.198809266090393} 11/07/2021 15:59:07 - INFO - __main__ - Step 133373: {'lr': 1.54117829920917e-05, 'samples': 25607616, 'steps': 133372, 'loss/train': 1.149152159690857} 11/07/2021 15:59:07 - INFO - __main__ - Step 133374: {'lr': 1.5409948615446758e-05, 'samples': 25607808, 'steps': 133373, 'loss/train': 1.048872470855713} 11/07/2021 15:59:08 - INFO - __main__ - Step 133375: {'lr': 1.5408114344504482e-05, 'samples': 25608000, 'steps': 133374, 'loss/train': 1.4956938028335571} 11/07/2021 15:59:09 - INFO - __main__ - Step 133376: {'lr': 1.540628017926568e-05, 'samples': 25608192, 'steps': 133375, 'loss/train': 1.2757869958877563} 11/07/2021 15:59:09 - INFO - __main__ - Step 133377: {'lr': 1.540444611973124e-05, 'samples': 25608384, 'steps': 133376, 'loss/train': 1.2229783535003662} 11/07/2021 15:59:09 - INFO - __main__ - Step 133378: {'lr': 1.5402612165901913e-05, 'samples': 25608576, 'steps': 133377, 'loss/train': 1.103337287902832} 11/07/2021 15:59:10 - INFO - __main__ - Step 133379: {'lr': 1.540077831777853e-05, 'samples': 25608768, 'steps': 133378, 'loss/train': 1.2284501791000366} 11/07/2021 15:59:11 - INFO - __main__ - Step 133380: {'lr': 1.539894457536192e-05, 'samples': 25608960, 'steps': 133379, 'loss/train': 0.5220436453819275} 11/07/2021 15:59:11 - INFO - __main__ - Step 133381: {'lr': 1.5397110938652953e-05, 'samples': 25609152, 'steps': 133380, 'loss/train': 1.0932565927505493} 11/07/2021 15:59:11 - INFO - __main__ - Step 133382: {'lr': 1.5395277407652426e-05, 'samples': 25609344, 'steps': 133381, 'loss/train': 1.4916112422943115} 11/07/2021 15:59:12 - INFO - __main__ - Step 133383: {'lr': 1.5393443982361143e-05, 'samples': 25609536, 'steps': 133382, 'loss/train': 1.63218355178833} 11/07/2021 15:59:12 - INFO - __main__ - Step 133384: {'lr': 1.5391610662779972e-05, 'samples': 25609728, 'steps': 133383, 'loss/train': 0.9568156599998474} 11/07/2021 15:59:13 - INFO - __main__ - Step 133385: {'lr': 1.5389777448909735e-05, 'samples': 25609920, 'steps': 133384, 'loss/train': 1.5122613906860352} 11/07/2021 15:59:13 - INFO - __main__ - Step 133386: {'lr': 1.538794434075122e-05, 'samples': 25610112, 'steps': 133385, 'loss/train': 1.695731282234192} 11/07/2021 15:59:14 - INFO - __main__ - Step 133387: {'lr': 1.538611133830528e-05, 'samples': 25610304, 'steps': 133386, 'loss/train': 1.3950645923614502} 11/07/2021 15:59:14 - INFO - __main__ - Step 133388: {'lr': 1.5384278441572754e-05, 'samples': 25610496, 'steps': 133387, 'loss/train': 1.3382737636566162} 11/07/2021 15:59:15 - INFO - __main__ - Step 133389: {'lr': 1.538244565055444e-05, 'samples': 25610688, 'steps': 133388, 'loss/train': 1.265391230583191} 11/07/2021 15:59:16 - INFO - __main__ - Step 133390: {'lr': 1.53806129652512e-05, 'samples': 25610880, 'steps': 133389, 'loss/train': 1.2324469089508057} 11/07/2021 15:59:17 - INFO - __main__ - Step 133391: {'lr': 1.5378780385663816e-05, 'samples': 25611072, 'steps': 133390, 'loss/train': 1.7871687412261963} 11/07/2021 15:59:17 - INFO - __main__ - Step 133392: {'lr': 1.5376947911793143e-05, 'samples': 25611264, 'steps': 133391, 'loss/train': 1.204947829246521} 11/07/2021 15:59:17 - INFO - __main__ - Step 133393: {'lr': 1.537511554363996e-05, 'samples': 25611456, 'steps': 133392, 'loss/train': 1.1047695875167847} 11/07/2021 15:59:18 - INFO - __main__ - Step 133394: {'lr': 1.5373283281205158e-05, 'samples': 25611648, 'steps': 133393, 'loss/train': 1.4890614748001099} 11/07/2021 15:59:18 - INFO - __main__ - Step 133395: {'lr': 1.537145112448954e-05, 'samples': 25611840, 'steps': 133394, 'loss/train': 1.223502516746521} 11/07/2021 15:59:18 - INFO - __main__ - Step 133396: {'lr': 1.5369619073493906e-05, 'samples': 25612032, 'steps': 133395, 'loss/train': 1.7321161031723022} 11/07/2021 15:59:19 - INFO - __main__ - Step 133397: {'lr': 1.5367787128219124e-05, 'samples': 25612224, 'steps': 133396, 'loss/train': 1.7510584592819214} 11/07/2021 15:59:20 - INFO - __main__ - Step 133398: {'lr': 1.5365955288665967e-05, 'samples': 25612416, 'steps': 133397, 'loss/train': 1.5940018892288208} 11/07/2021 15:59:20 - INFO - __main__ - Step 133399: {'lr': 1.5364123554835295e-05, 'samples': 25612608, 'steps': 133398, 'loss/train': 1.2110244035720825} 11/07/2021 15:59:20 - INFO - __main__ - Step 133400: {'lr': 1.5362291926727945e-05, 'samples': 25612800, 'steps': 133399, 'loss/train': 1.3391791582107544} 11/07/2021 15:59:21 - INFO - __main__ - Step 133401: {'lr': 1.5360460404344717e-05, 'samples': 25612992, 'steps': 133400, 'loss/train': 0.9584820866584778} 11/07/2021 15:59:21 - INFO - __main__ - Step 133402: {'lr': 1.5358628987686447e-05, 'samples': 25613184, 'steps': 133401, 'loss/train': 1.5962257385253906} 11/07/2021 15:59:22 - INFO - __main__ - Step 133403: {'lr': 1.5356797676753965e-05, 'samples': 25613376, 'steps': 133402, 'loss/train': 0.45758160948753357} 11/07/2021 15:59:23 - INFO - __main__ - Step 133404: {'lr': 1.5354966471548105e-05, 'samples': 25613568, 'steps': 133403, 'loss/train': 0.9392993450164795} 11/07/2021 15:59:23 - INFO - __main__ - Step 133405: {'lr': 1.535313537206967e-05, 'samples': 25613760, 'steps': 133404, 'loss/train': 0.8518125414848328} 11/07/2021 15:59:23 - INFO - __main__ - Step 133406: {'lr': 1.535130437831947e-05, 'samples': 25613952, 'steps': 133405, 'loss/train': 1.3582690954208374} 11/07/2021 15:59:24 - INFO - __main__ - Step 133407: {'lr': 1.534947349029836e-05, 'samples': 25614144, 'steps': 133406, 'loss/train': 0.9760210514068604} 11/07/2021 15:59:25 - INFO - __main__ - Step 133408: {'lr': 1.534764270800715e-05, 'samples': 25614336, 'steps': 133407, 'loss/train': 0.7721223831176758} 11/07/2021 15:59:25 - INFO - __main__ - Step 133409: {'lr': 1.5345812031446665e-05, 'samples': 25614528, 'steps': 133408, 'loss/train': 1.2695097923278809} 11/07/2021 15:59:25 - INFO - __main__ - Step 133410: {'lr': 1.5343981460617745e-05, 'samples': 25614720, 'steps': 133409, 'loss/train': 1.351932406425476} 11/07/2021 15:59:26 - INFO - __main__ - Step 133411: {'lr': 1.534215099552119e-05, 'samples': 25614912, 'steps': 133410, 'loss/train': 1.3308594226837158} 11/07/2021 15:59:26 - INFO - __main__ - Step 133412: {'lr': 1.534032063615787e-05, 'samples': 25615104, 'steps': 133411, 'loss/train': 2.0059499740600586} 11/07/2021 15:59:27 - INFO - __main__ - Step 133413: {'lr': 1.5338490382528575e-05, 'samples': 25615296, 'steps': 133412, 'loss/train': 0.8636791110038757} 11/07/2021 15:59:28 - INFO - __main__ - Step 133414: {'lr': 1.533666023463412e-05, 'samples': 25615488, 'steps': 133413, 'loss/train': 1.3302905559539795} 11/07/2021 15:59:28 - INFO - __main__ - Step 133415: {'lr': 1.5334830192475362e-05, 'samples': 25615680, 'steps': 133414, 'loss/train': 1.3756898641586304} 11/07/2021 15:59:28 - INFO - __main__ - Step 133416: {'lr': 1.533300025605308e-05, 'samples': 25615872, 'steps': 133415, 'loss/train': 0.12384244054555893} 11/07/2021 15:59:29 - INFO - __main__ - Step 133417: {'lr': 1.533117042536819e-05, 'samples': 25616064, 'steps': 133416, 'loss/train': 2.6882400512695312} 11/07/2021 15:59:30 - INFO - __main__ - Step 133418: {'lr': 1.5329340700421412e-05, 'samples': 25616256, 'steps': 133417, 'loss/train': 1.2098441123962402} 11/07/2021 15:59:30 - INFO - __main__ - Step 133419: {'lr': 1.532751108121361e-05, 'samples': 25616448, 'steps': 133418, 'loss/train': 1.1684740781784058} 11/07/2021 15:59:31 - INFO - __main__ - Step 133420: {'lr': 1.5325681567745608e-05, 'samples': 25616640, 'steps': 133419, 'loss/train': 1.3751649856567383} 11/07/2021 15:59:31 - INFO - __main__ - Step 133421: {'lr': 1.532385216001822e-05, 'samples': 25616832, 'steps': 133420, 'loss/train': 1.2825039625167847} 11/07/2021 15:59:31 - INFO - __main__ - Step 133422: {'lr': 1.5322022858032304e-05, 'samples': 25617024, 'steps': 133421, 'loss/train': 0.20885154604911804} 11/07/2021 15:59:32 - INFO - __main__ - Step 133423: {'lr': 1.532019366178866e-05, 'samples': 25617216, 'steps': 133422, 'loss/train': 1.3836865425109863} 11/07/2021 15:59:33 - INFO - __main__ - Step 133424: {'lr': 1.53183645712881e-05, 'samples': 25617408, 'steps': 133423, 'loss/train': 0.43155935406684875} 11/07/2021 15:59:33 - INFO - __main__ - Step 133425: {'lr': 1.5316535586531482e-05, 'samples': 25617600, 'steps': 133424, 'loss/train': 1.1254016160964966} 11/07/2021 15:59:34 - INFO - __main__ - Step 133426: {'lr': 1.531470670751961e-05, 'samples': 25617792, 'steps': 133425, 'loss/train': 1.503491997718811} 11/07/2021 15:59:34 - INFO - __main__ - Step 133427: {'lr': 1.531287793425329e-05, 'samples': 25617984, 'steps': 133426, 'loss/train': 1.0744863748550415} 11/07/2021 15:59:35 - INFO - __main__ - Step 133428: {'lr': 1.531104926673338e-05, 'samples': 25618176, 'steps': 133427, 'loss/train': 0.5906508564949036} 11/07/2021 15:59:35 - INFO - __main__ - Step 133429: {'lr': 1.5309220704960687e-05, 'samples': 25618368, 'steps': 133428, 'loss/train': 0.8716502785682678} 11/07/2021 15:59:36 - INFO - __main__ - Step 133430: {'lr': 1.530739224893604e-05, 'samples': 25618560, 'steps': 133429, 'loss/train': 1.3968929052352905} 11/07/2021 15:59:36 - INFO - __main__ - Step 133431: {'lr': 1.5305563898660307e-05, 'samples': 25618752, 'steps': 133430, 'loss/train': 1.3714078664779663} 11/07/2021 15:59:36 - INFO - __main__ - Step 133432: {'lr': 1.5303735654134204e-05, 'samples': 25618944, 'steps': 133431, 'loss/train': 0.9939188361167908} 11/07/2021 15:59:37 - INFO - __main__ - Step 133433: {'lr': 1.530190751535865e-05, 'samples': 25619136, 'steps': 133432, 'loss/train': 1.1209250688552856} 11/07/2021 15:59:38 - INFO - __main__ - Step 133434: {'lr': 1.5300079482334416e-05, 'samples': 25619328, 'steps': 133433, 'loss/train': 1.061737298965454} 11/07/2021 15:59:38 - INFO - __main__ - Step 133435: {'lr': 1.5298251555062342e-05, 'samples': 25619520, 'steps': 133434, 'loss/train': 1.3627872467041016} 11/07/2021 15:59:38 - INFO - __main__ - Step 133436: {'lr': 1.529642373354326e-05, 'samples': 25619712, 'steps': 133435, 'loss/train': 1.25464928150177} 11/07/2021 15:59:39 - INFO - __main__ - Step 133437: {'lr': 1.5294596017777968e-05, 'samples': 25619904, 'steps': 133436, 'loss/train': 1.248350739479065} 11/07/2021 15:59:39 - INFO - __main__ - Step 133438: {'lr': 1.5292768407767332e-05, 'samples': 25620096, 'steps': 133437, 'loss/train': 0.619701087474823} 11/07/2021 15:59:40 - INFO - __main__ - Step 133439: {'lr': 1.5290940903512158e-05, 'samples': 25620288, 'steps': 133438, 'loss/train': 1.4515228271484375} 11/07/2021 15:59:40 - INFO - __main__ - Step 133440: {'lr': 1.528911350501325e-05, 'samples': 25620480, 'steps': 133439, 'loss/train': 0.6858823895454407} 11/07/2021 15:59:41 - INFO - __main__ - Step 133441: {'lr': 1.5287286212271434e-05, 'samples': 25620672, 'steps': 133440, 'loss/train': 1.1708323955535889} 11/07/2021 15:59:41 - INFO - __main__ - Step 133442: {'lr': 1.528545902528755e-05, 'samples': 25620864, 'steps': 133441, 'loss/train': 1.2577451467514038} 11/07/2021 15:59:42 - INFO - __main__ - Step 133443: {'lr': 1.528363194406243e-05, 'samples': 25621056, 'steps': 133442, 'loss/train': 1.4684494733810425} 11/07/2021 15:59:43 - INFO - __main__ - Step 133444: {'lr': 1.5281804968596935e-05, 'samples': 25621248, 'steps': 133443, 'loss/train': 1.4337486028671265} 11/07/2021 15:59:43 - INFO - __main__ - Step 133445: {'lr': 1.5279978098891756e-05, 'samples': 25621440, 'steps': 133444, 'loss/train': 1.2910977602005005} 11/07/2021 15:59:43 - INFO - __main__ - Step 133446: {'lr': 1.5278151334947837e-05, 'samples': 25621632, 'steps': 133445, 'loss/train': 0.60901939868927} 11/07/2021 15:59:44 - INFO - __main__ - Step 133447: {'lr': 1.5276324676765956e-05, 'samples': 25621824, 'steps': 133446, 'loss/train': 1.610901951789856} 11/07/2021 15:59:44 - INFO - __main__ - Step 133448: {'lr': 1.527449812434692e-05, 'samples': 25622016, 'steps': 133447, 'loss/train': 1.3506908416748047} 11/07/2021 15:59:45 - INFO - __main__ - Step 133449: {'lr': 1.5272671677691584e-05, 'samples': 25622208, 'steps': 133448, 'loss/train': 1.054682970046997} 11/07/2021 15:59:45 - INFO - __main__ - Step 133450: {'lr': 1.527084533680076e-05, 'samples': 25622400, 'steps': 133449, 'loss/train': 1.9981392621994019} 11/07/2021 15:59:46 - INFO - __main__ - Step 133451: {'lr': 1.5269019101675275e-05, 'samples': 25622592, 'steps': 133450, 'loss/train': 1.073248267173767} 11/07/2021 15:59:46 - INFO - __main__ - Step 133452: {'lr': 1.5267192972315937e-05, 'samples': 25622784, 'steps': 133451, 'loss/train': 0.8997324109077454} 11/07/2021 15:59:46 - INFO - __main__ - Step 133453: {'lr': 1.5265366948723574e-05, 'samples': 25622976, 'steps': 133452, 'loss/train': 1.4336532354354858} 11/07/2021 15:59:48 - INFO - __main__ - Step 133454: {'lr': 1.5263541030899053e-05, 'samples': 25623168, 'steps': 133453, 'loss/train': 1.4207180738449097} 11/07/2021 15:59:48 - INFO - __main__ - Step 133455: {'lr': 1.526171521884312e-05, 'samples': 25623360, 'steps': 133454, 'loss/train': 0.12469340115785599} 11/07/2021 15:59:48 - INFO - __main__ - Step 133456: {'lr': 1.5259889512556634e-05, 'samples': 25623552, 'steps': 133455, 'loss/train': 1.264792799949646} 11/07/2021 15:59:49 - INFO - __main__ - Step 133457: {'lr': 1.525806391204046e-05, 'samples': 25623744, 'steps': 133456, 'loss/train': 0.6973235607147217} 11/07/2021 15:59:49 - INFO - __main__ - Step 133458: {'lr': 1.5256238417295371e-05, 'samples': 25623936, 'steps': 133457, 'loss/train': 0.5898681879043579} 11/07/2021 15:59:50 - INFO - __main__ - Step 133459: {'lr': 1.52544130283222e-05, 'samples': 25624128, 'steps': 133458, 'loss/train': 1.2510044574737549} 11/07/2021 15:59:51 - INFO - __main__ - Step 133460: {'lr': 1.5252587745121727e-05, 'samples': 25624320, 'steps': 133459, 'loss/train': 0.696232795715332} 11/07/2021 15:59:51 - INFO - __main__ - Step 133461: {'lr': 1.5250762567694836e-05, 'samples': 25624512, 'steps': 133460, 'loss/train': 1.235356092453003} 11/07/2021 15:59:51 - INFO - __main__ - Step 133462: {'lr': 1.5248937496042337e-05, 'samples': 25624704, 'steps': 133461, 'loss/train': 1.0801645517349243} 11/07/2021 15:59:52 - INFO - __main__ - Step 133463: {'lr': 1.5247112530165058e-05, 'samples': 25624896, 'steps': 133462, 'loss/train': 0.9789373874664307} 11/07/2021 15:59:53 - INFO - __main__ - Step 133464: {'lr': 1.5245287670063779e-05, 'samples': 25625088, 'steps': 133463, 'loss/train': 1.499046802520752} 11/07/2021 15:59:53 - INFO - __main__ - Step 133465: {'lr': 1.5243462915739359e-05, 'samples': 25625280, 'steps': 133464, 'loss/train': 1.0483229160308838} 11/07/2021 15:59:53 - INFO - __main__ - Step 133466: {'lr': 1.5241638267192603e-05, 'samples': 25625472, 'steps': 133465, 'loss/train': 1.1522443294525146} 11/07/2021 15:59:54 - INFO - __main__ - Step 133467: {'lr': 1.5239813724424345e-05, 'samples': 25625664, 'steps': 133466, 'loss/train': 1.464613676071167} 11/07/2021 15:59:54 - INFO - __main__ - Step 133468: {'lr': 1.5237989287435417e-05, 'samples': 25625856, 'steps': 133467, 'loss/train': 1.4858627319335938} 11/07/2021 15:59:55 - INFO - __main__ - Step 133469: {'lr': 1.5236164956226622e-05, 'samples': 25626048, 'steps': 133468, 'loss/train': 1.7248876094818115} 11/07/2021 15:59:55 - INFO - __main__ - Step 133470: {'lr': 1.5234340730798795e-05, 'samples': 25626240, 'steps': 133469, 'loss/train': 1.2221448421478271} 11/07/2021 15:59:56 - INFO - __main__ - Step 133471: {'lr': 1.5232516611152797e-05, 'samples': 25626432, 'steps': 133470, 'loss/train': 0.5976595282554626} 11/07/2021 15:59:56 - INFO - __main__ - Step 133472: {'lr': 1.5230692597289348e-05, 'samples': 25626624, 'steps': 133471, 'loss/train': 1.1462793350219727} 11/07/2021 15:59:56 - INFO - __main__ - Step 133473: {'lr': 1.5228868689209335e-05, 'samples': 25626816, 'steps': 133472, 'loss/train': 1.6537848711013794} 11/07/2021 15:59:58 - INFO - __main__ - Step 133474: {'lr': 1.5227044886913566e-05, 'samples': 25627008, 'steps': 133473, 'loss/train': 0.9005470871925354} 11/07/2021 15:59:58 - INFO - __main__ - Step 133475: {'lr': 1.5225221190402871e-05, 'samples': 25627200, 'steps': 133474, 'loss/train': 0.7283817529678345} 11/07/2021 15:59:58 - INFO - __main__ - Step 133476: {'lr': 1.5223397599678057e-05, 'samples': 25627392, 'steps': 133475, 'loss/train': 1.1304653882980347} 11/07/2021 15:59:59 - INFO - __main__ - Step 133477: {'lr': 1.5221574114739956e-05, 'samples': 25627584, 'steps': 133476, 'loss/train': 1.043939232826233} 11/07/2021 15:59:59 - INFO - __main__ - Step 133478: {'lr': 1.5219750735589427e-05, 'samples': 25627776, 'steps': 133477, 'loss/train': 1.401223063468933} 11/07/2021 15:59:59 - INFO - __main__ - Step 133479: {'lr': 1.5217927462227222e-05, 'samples': 25627968, 'steps': 133478, 'loss/train': 1.580401062965393} 11/07/2021 16:00:00 - INFO - __main__ - Step 133480: {'lr': 1.5216104294654198e-05, 'samples': 25628160, 'steps': 133479, 'loss/train': 1.236075758934021} 11/07/2021 16:00:01 - INFO - __main__ - Step 133481: {'lr': 1.5214281232871191e-05, 'samples': 25628352, 'steps': 133480, 'loss/train': 1.1229389905929565} 11/07/2021 16:00:01 - INFO - __main__ - Step 133482: {'lr': 1.5212458276878976e-05, 'samples': 25628544, 'steps': 133481, 'loss/train': 0.9787396192550659} 11/07/2021 16:00:01 - INFO - __main__ - Step 133483: {'lr': 1.5210635426678443e-05, 'samples': 25628736, 'steps': 133482, 'loss/train': 0.9037294387817383} 11/07/2021 16:00:02 - INFO - __main__ - Step 133484: {'lr': 1.5208812682270396e-05, 'samples': 25628928, 'steps': 133483, 'loss/train': 1.142722487449646} 11/07/2021 16:00:03 - INFO - __main__ - Step 133485: {'lr': 1.5206990043655583e-05, 'samples': 25629120, 'steps': 133484, 'loss/train': 1.088908314704895} 11/07/2021 16:00:03 - INFO - __main__ - Step 133486: {'lr': 1.5205167510834894e-05, 'samples': 25629312, 'steps': 133485, 'loss/train': 1.1255732774734497} 11/07/2021 16:00:03 - INFO - __main__ - Step 133487: {'lr': 1.5203345083809133e-05, 'samples': 25629504, 'steps': 133486, 'loss/train': 1.1344082355499268} 11/07/2021 16:00:04 - INFO - __main__ - Step 133488: {'lr': 1.5201522762579106e-05, 'samples': 25629696, 'steps': 133487, 'loss/train': 1.344766616821289} 11/07/2021 16:00:04 - INFO - __main__ - Step 133489: {'lr': 1.5199700547145673e-05, 'samples': 25629888, 'steps': 133488, 'loss/train': 1.5276762247085571} 11/07/2021 16:00:05 - INFO - __main__ - Step 133490: {'lr': 1.519787843750961e-05, 'samples': 25630080, 'steps': 133489, 'loss/train': 0.9695435166358948} 11/07/2021 16:00:06 - INFO - __main__ - Step 133491: {'lr': 1.5196056433671778e-05, 'samples': 25630272, 'steps': 133490, 'loss/train': 1.1852537393569946} 11/07/2021 16:00:06 - INFO - __main__ - Step 133492: {'lr': 1.5194234535632955e-05, 'samples': 25630464, 'steps': 133491, 'loss/train': 1.2191462516784668} 11/07/2021 16:00:06 - INFO - __main__ - Step 133493: {'lr': 1.5192412743394002e-05, 'samples': 25630656, 'steps': 133492, 'loss/train': 0.9292805194854736} 11/07/2021 16:00:07 - INFO - __main__ - Step 133494: {'lr': 1.5190591056955722e-05, 'samples': 25630848, 'steps': 133493, 'loss/train': 0.9211809039115906} 11/07/2021 16:00:08 - INFO - __main__ - Step 133495: {'lr': 1.5188769476318976e-05, 'samples': 25631040, 'steps': 133494, 'loss/train': 1.1371848583221436} 11/07/2021 16:00:08 - INFO - __main__ - Step 133496: {'lr': 1.5186948001484513e-05, 'samples': 25631232, 'steps': 133495, 'loss/train': 1.5433330535888672} 11/07/2021 16:00:08 - INFO - __main__ - Step 133497: {'lr': 1.5185126632453194e-05, 'samples': 25631424, 'steps': 133496, 'loss/train': 1.2166736125946045} 11/07/2021 16:00:09 - INFO - __main__ - Step 133498: {'lr': 1.5183305369225825e-05, 'samples': 25631616, 'steps': 133497, 'loss/train': 1.376258373260498} 11/07/2021 16:00:09 - INFO - __main__ - Step 133499: {'lr': 1.5181484211803238e-05, 'samples': 25631808, 'steps': 133498, 'loss/train': 1.154576063156128} 11/07/2021 16:00:10 - INFO - __main__ - Step 133500: {'lr': 1.5179663160186263e-05, 'samples': 25632000, 'steps': 133499, 'loss/train': 1.0950305461883545} 11/07/2021 16:00:10 - INFO - __main__ - Step 133501: {'lr': 1.5177842214375681e-05, 'samples': 25632192, 'steps': 133500, 'loss/train': 1.251041293144226} 11/07/2021 16:00:11 - INFO - __main__ - Step 133502: {'lr': 1.5176021374372351e-05, 'samples': 25632384, 'steps': 133501, 'loss/train': 1.0905073881149292} 11/07/2021 16:00:11 - INFO - __main__ - Step 133503: {'lr': 1.5174200640177105e-05, 'samples': 25632576, 'steps': 133502, 'loss/train': 0.9483540058135986} 11/07/2021 16:00:11 - INFO - __main__ - Step 133504: {'lr': 1.517238001179072e-05, 'samples': 25632768, 'steps': 133503, 'loss/train': 0.6024878025054932} 11/07/2021 16:00:13 - INFO - __main__ - Step 133505: {'lr': 1.517055948921403e-05, 'samples': 25632960, 'steps': 133504, 'loss/train': 1.3655858039855957} 11/07/2021 16:00:13 - INFO - __main__ - Step 133506: {'lr': 1.5168739072447896e-05, 'samples': 25633152, 'steps': 133505, 'loss/train': 1.7139979600906372} 11/07/2021 16:00:13 - INFO - __main__ - Step 133507: {'lr': 1.5166918761493092e-05, 'samples': 25633344, 'steps': 133506, 'loss/train': 1.3150752782821655} 11/07/2021 16:00:14 - INFO - __main__ - Step 133508: {'lr': 1.5165098556350426e-05, 'samples': 25633536, 'steps': 133507, 'loss/train': 1.741762399673462} 11/07/2021 16:00:14 - INFO - __main__ - Step 133509: {'lr': 1.5163278457020758e-05, 'samples': 25633728, 'steps': 133508, 'loss/train': 1.1863975524902344} 11/07/2021 16:00:15 - INFO - __main__ - Step 133510: {'lr': 1.516145846350489e-05, 'samples': 25633920, 'steps': 133509, 'loss/train': 1.75288987159729} 11/07/2021 16:00:15 - INFO - __main__ - Step 133511: {'lr': 1.515963857580363e-05, 'samples': 25634112, 'steps': 133510, 'loss/train': 1.145567774772644} 11/07/2021 16:00:16 - INFO - __main__ - Step 133512: {'lr': 1.5157818793917839e-05, 'samples': 25634304, 'steps': 133511, 'loss/train': 1.2406673431396484} 11/07/2021 16:00:16 - INFO - __main__ - Step 133513: {'lr': 1.5155999117848291e-05, 'samples': 25634496, 'steps': 133512, 'loss/train': 1.032729983329773} 11/07/2021 16:00:16 - INFO - __main__ - Step 133514: {'lr': 1.5154179547595847e-05, 'samples': 25634688, 'steps': 133513, 'loss/train': 1.1115635633468628} 11/07/2021 16:00:17 - INFO - __main__ - Step 133515: {'lr': 1.5152360083161288e-05, 'samples': 25634880, 'steps': 133514, 'loss/train': 1.1884323358535767} 11/07/2021 16:00:18 - INFO - __main__ - Step 133516: {'lr': 1.5150540724545469e-05, 'samples': 25635072, 'steps': 133515, 'loss/train': 1.2737842798233032} 11/07/2021 16:00:18 - INFO - __main__ - Step 133517: {'lr': 1.51487214717492e-05, 'samples': 25635264, 'steps': 133516, 'loss/train': 1.201399803161621} 11/07/2021 16:00:19 - INFO - __main__ - Step 133518: {'lr': 1.5146902324773282e-05, 'samples': 25635456, 'steps': 133517, 'loss/train': 1.0111677646636963} 11/07/2021 16:00:19 - INFO - __main__ - Step 133519: {'lr': 1.5145083283618522e-05, 'samples': 25635648, 'steps': 133518, 'loss/train': 1.2328811883926392} 11/07/2021 16:00:20 - INFO - __main__ - Step 133520: {'lr': 1.514326434828578e-05, 'samples': 25635840, 'steps': 133519, 'loss/train': 0.6551700234413147} 11/07/2021 16:00:20 - INFO - __main__ - Step 133521: {'lr': 1.5141445518775859e-05, 'samples': 25636032, 'steps': 133520, 'loss/train': 1.3393820524215698} 11/07/2021 16:00:21 - INFO - __main__ - Step 133522: {'lr': 1.5139626795089595e-05, 'samples': 25636224, 'steps': 133521, 'loss/train': 0.9169344305992126} 11/07/2021 16:00:21 - INFO - __main__ - Step 133523: {'lr': 1.5137808177227763e-05, 'samples': 25636416, 'steps': 133522, 'loss/train': 1.1705726385116577} 11/07/2021 16:00:21 - INFO - __main__ - Step 133524: {'lr': 1.5135989665191225e-05, 'samples': 25636608, 'steps': 133523, 'loss/train': 0.9307583570480347} 11/07/2021 16:00:22 - INFO - __main__ - Step 133525: {'lr': 1.5134171258980783e-05, 'samples': 25636800, 'steps': 133524, 'loss/train': 1.573422908782959} 11/07/2021 16:00:23 - INFO - __main__ - Step 133526: {'lr': 1.5132352958597245e-05, 'samples': 25636992, 'steps': 133525, 'loss/train': 1.2648118734359741} 11/07/2021 16:00:23 - INFO - __main__ - Step 133527: {'lr': 1.513053476404147e-05, 'samples': 25637184, 'steps': 133526, 'loss/train': 0.7685335874557495} 11/07/2021 16:00:24 - INFO - __main__ - Step 133528: {'lr': 1.5128716675314263e-05, 'samples': 25637376, 'steps': 133527, 'loss/train': 0.8966537117958069} 11/07/2021 16:00:24 - INFO - __main__ - Step 133529: {'lr': 1.5126898692416402e-05, 'samples': 25637568, 'steps': 133528, 'loss/train': 1.1092514991760254} 11/07/2021 16:00:24 - INFO - __main__ - Step 133530: {'lr': 1.5125080815348747e-05, 'samples': 25637760, 'steps': 133529, 'loss/train': 1.2965407371520996} 11/07/2021 16:00:25 - INFO - __main__ - Step 133531: {'lr': 1.51232630441121e-05, 'samples': 25637952, 'steps': 133530, 'loss/train': 1.4364588260650635} 11/07/2021 16:00:26 - INFO - __main__ - Step 133532: {'lr': 1.51214453787073e-05, 'samples': 25638144, 'steps': 133531, 'loss/train': 1.4709070920944214} 11/07/2021 16:00:26 - INFO - __main__ - Step 133533: {'lr': 1.5119627819135118e-05, 'samples': 25638336, 'steps': 133532, 'loss/train': 1.4895626306533813} 11/07/2021 16:00:27 - INFO - __main__ - Step 133534: {'lr': 1.5117810365396444e-05, 'samples': 25638528, 'steps': 133533, 'loss/train': 1.3920022249221802} 11/07/2021 16:00:27 - INFO - __main__ - Step 133535: {'lr': 1.511599301749203e-05, 'samples': 25638720, 'steps': 133534, 'loss/train': 1.3131017684936523} 11/07/2021 16:00:27 - INFO - __main__ - Step 133536: {'lr': 1.5114175775422761e-05, 'samples': 25638912, 'steps': 133535, 'loss/train': 0.5969184637069702} 11/07/2021 16:00:28 - INFO - __main__ - Step 133537: {'lr': 1.5112358639189388e-05, 'samples': 25639104, 'steps': 133536, 'loss/train': 3.882204055786133} 11/07/2021 16:00:29 - INFO - __main__ - Step 133538: {'lr': 1.5110541608792772e-05, 'samples': 25639296, 'steps': 133537, 'loss/train': 1.7639299631118774} 11/07/2021 16:00:29 - INFO - __main__ - Step 133539: {'lr': 1.5108724684233771e-05, 'samples': 25639488, 'steps': 133538, 'loss/train': 1.1752954721450806} 11/07/2021 16:00:29 - INFO - __main__ - Step 133540: {'lr': 1.5106907865513109e-05, 'samples': 25639680, 'steps': 133539, 'loss/train': 1.557336688041687} 11/07/2021 16:00:30 - INFO - __main__ - Step 133541: {'lr': 1.5105091152631645e-05, 'samples': 25639872, 'steps': 133540, 'loss/train': 1.4434378147125244} 11/07/2021 16:00:31 - INFO - __main__ - Step 133542: {'lr': 1.5103274545590185e-05, 'samples': 25640064, 'steps': 133541, 'loss/train': 1.327152132987976} 11/07/2021 16:00:31 - INFO - __main__ - Step 133543: {'lr': 1.5101458044389588e-05, 'samples': 25640256, 'steps': 133542, 'loss/train': 1.9250235557556152} 11/07/2021 16:00:31 - INFO - __main__ - Step 133544: {'lr': 1.509964164903066e-05, 'samples': 25640448, 'steps': 133543, 'loss/train': 1.6312130689620972} 11/07/2021 16:00:32 - INFO - __main__ - Step 133545: {'lr': 1.5097825359514178e-05, 'samples': 25640640, 'steps': 133544, 'loss/train': 1.3443386554718018} 11/07/2021 16:00:32 - INFO - __main__ - Step 133546: {'lr': 1.5096009175841003e-05, 'samples': 25640832, 'steps': 133545, 'loss/train': 0.6402928233146667} 11/07/2021 16:00:33 - INFO - __main__ - Step 133547: {'lr': 1.5094193098011939e-05, 'samples': 25641024, 'steps': 133546, 'loss/train': 0.03694244474172592} 11/07/2021 16:00:33 - INFO - __main__ - Step 133548: {'lr': 1.5092377126027818e-05, 'samples': 25641216, 'steps': 133547, 'loss/train': 1.6745821237564087} 11/07/2021 16:00:34 - INFO - __main__ - Step 133549: {'lr': 1.5090561259889447e-05, 'samples': 25641408, 'steps': 133548, 'loss/train': 1.0885188579559326} 11/07/2021 16:00:34 - INFO - __main__ - Step 133550: {'lr': 1.5088745499597628e-05, 'samples': 25641600, 'steps': 133549, 'loss/train': 1.3226524591445923} 11/07/2021 16:00:35 - INFO - __main__ - Step 133551: {'lr': 1.5086929845153224e-05, 'samples': 25641792, 'steps': 133550, 'loss/train': 0.08871004730463028} 11/07/2021 16:00:35 - INFO - __main__ - Step 133552: {'lr': 1.508511429655704e-05, 'samples': 25641984, 'steps': 133551, 'loss/train': 1.4700627326965332} 11/07/2021 16:00:36 - INFO - __main__ - Step 133553: {'lr': 1.508329885380985e-05, 'samples': 25642176, 'steps': 133552, 'loss/train': 1.4123828411102295} 11/07/2021 16:00:36 - INFO - __main__ - Step 133554: {'lr': 1.5081483516912493e-05, 'samples': 25642368, 'steps': 133553, 'loss/train': 0.6037673950195312} 11/07/2021 16:00:37 - INFO - __main__ - Step 133555: {'lr': 1.5079668285865795e-05, 'samples': 25642560, 'steps': 133554, 'loss/train': 1.445207953453064} 11/07/2021 16:00:37 - INFO - __main__ - Step 133556: {'lr': 1.5077853160670563e-05, 'samples': 25642752, 'steps': 133555, 'loss/train': 1.3644516468048096} 11/07/2021 16:00:37 - INFO - __main__ - Step 133557: {'lr': 1.5076038141327659e-05, 'samples': 25642944, 'steps': 133556, 'loss/train': 1.181258201599121} 11/07/2021 16:00:38 - INFO - __main__ - Step 133558: {'lr': 1.507422322783783e-05, 'samples': 25643136, 'steps': 133557, 'loss/train': 1.1917445659637451} 11/07/2021 16:00:39 - INFO - __main__ - Step 133559: {'lr': 1.5072408420201939e-05, 'samples': 25643328, 'steps': 133558, 'loss/train': 1.0921417474746704} 11/07/2021 16:00:39 - INFO - __main__ - Step 133560: {'lr': 1.5070593718420817e-05, 'samples': 25643520, 'steps': 133559, 'loss/train': 1.313685417175293} 11/07/2021 16:00:39 - INFO - __main__ - Step 133561: {'lr': 1.5068779122495241e-05, 'samples': 25643712, 'steps': 133560, 'loss/train': 1.6331886053085327} 11/07/2021 16:00:40 - INFO - __main__ - Step 133562: {'lr': 1.5066964632426045e-05, 'samples': 25643904, 'steps': 133561, 'loss/train': 0.7696554064750671} 11/07/2021 16:00:41 - INFO - __main__ - Step 133563: {'lr': 1.506515024821406e-05, 'samples': 25644096, 'steps': 133562, 'loss/train': 1.3510578870773315} 11/07/2021 16:00:41 - INFO - __main__ - Step 133564: {'lr': 1.5063335969860093e-05, 'samples': 25644288, 'steps': 133563, 'loss/train': 1.4150351285934448} 11/07/2021 16:00:41 - INFO - __main__ - Step 133565: {'lr': 1.5061521797365002e-05, 'samples': 25644480, 'steps': 133564, 'loss/train': 1.4155738353729248} 11/07/2021 16:00:42 - INFO - __main__ - Step 133566: {'lr': 1.5059707730729511e-05, 'samples': 25644672, 'steps': 133565, 'loss/train': 0.2865176200866699} 11/07/2021 16:00:42 - INFO - __main__ - Step 133567: {'lr': 1.5057893769954505e-05, 'samples': 25644864, 'steps': 133566, 'loss/train': 1.0388368368148804} 11/07/2021 16:00:43 - INFO - __main__ - Step 133568: {'lr': 1.5056079915040793e-05, 'samples': 25645056, 'steps': 133567, 'loss/train': 1.1766433715820312} 11/07/2021 16:00:44 - INFO - __main__ - Step 133569: {'lr': 1.5054266165989177e-05, 'samples': 25645248, 'steps': 133568, 'loss/train': 1.0313228368759155} 11/07/2021 16:00:44 - INFO - __main__ - Step 133570: {'lr': 1.505245252280049e-05, 'samples': 25645440, 'steps': 133569, 'loss/train': 1.4556564092636108} 11/07/2021 16:00:44 - INFO - __main__ - Step 133571: {'lr': 1.5050638985475512e-05, 'samples': 25645632, 'steps': 133570, 'loss/train': 1.0117782354354858} 11/07/2021 16:00:45 - INFO - __main__ - Step 133572: {'lr': 1.5048825554015127e-05, 'samples': 25645824, 'steps': 133571, 'loss/train': 0.48384833335876465} 11/07/2021 16:00:46 - INFO - __main__ - Step 133573: {'lr': 1.5047012228420088e-05, 'samples': 25646016, 'steps': 133572, 'loss/train': 1.2216804027557373} 11/07/2021 16:00:46 - INFO - __main__ - Step 133574: {'lr': 1.5045199008691251e-05, 'samples': 25646208, 'steps': 133573, 'loss/train': 1.0623501539230347} 11/07/2021 16:00:47 - INFO - __main__ - Step 133575: {'lr': 1.5043385894829425e-05, 'samples': 25646400, 'steps': 133574, 'loss/train': 1.2822424173355103} 11/07/2021 16:00:47 - INFO - __main__ - Step 133576: {'lr': 1.5041572886835442e-05, 'samples': 25646592, 'steps': 133575, 'loss/train': 1.3842965364456177} 11/07/2021 16:00:47 - INFO - __main__ - Step 133577: {'lr': 1.5039759984710078e-05, 'samples': 25646784, 'steps': 133576, 'loss/train': 1.3921135663986206} 11/07/2021 16:00:49 - INFO - __main__ - Step 133578: {'lr': 1.5037947188454166e-05, 'samples': 25646976, 'steps': 133577, 'loss/train': 0.07509411871433258} 11/07/2021 16:00:49 - INFO - __main__ - Step 133579: {'lr': 1.5036134498068593e-05, 'samples': 25647168, 'steps': 133578, 'loss/train': 1.367257833480835} 11/07/2021 16:00:49 - INFO - __main__ - Step 133580: {'lr': 1.5034321913554055e-05, 'samples': 25647360, 'steps': 133579, 'loss/train': 1.1431751251220703} 11/07/2021 16:00:50 - INFO - __main__ - Step 133581: {'lr': 1.5032509434911411e-05, 'samples': 25647552, 'steps': 133580, 'loss/train': 1.5402910709381104} 11/07/2021 16:00:50 - INFO - __main__ - Step 133582: {'lr': 1.5030697062141524e-05, 'samples': 25647744, 'steps': 133581, 'loss/train': 1.1609033346176147} 11/07/2021 16:00:51 - INFO - __main__ - Step 133583: {'lr': 1.5028884795245167e-05, 'samples': 25647936, 'steps': 133582, 'loss/train': 1.0944080352783203} 11/07/2021 16:00:51 - INFO - __main__ - Step 133584: {'lr': 1.5027072634223148e-05, 'samples': 25648128, 'steps': 133583, 'loss/train': 1.6099741458892822} 11/07/2021 16:00:52 - INFO - __main__ - Step 133585: {'lr': 1.5025260579076328e-05, 'samples': 25648320, 'steps': 133584, 'loss/train': 0.790341854095459} 11/07/2021 16:00:52 - INFO - __main__ - Step 133586: {'lr': 1.5023448629805509e-05, 'samples': 25648512, 'steps': 133585, 'loss/train': 1.1521481275558472} 11/07/2021 16:00:52 - INFO - __main__ - Step 133587: {'lr': 1.5021636786411469e-05, 'samples': 25648704, 'steps': 133586, 'loss/train': 0.9199879765510559} 11/07/2021 16:00:53 - INFO - __main__ - Step 133588: {'lr': 1.501982504889507e-05, 'samples': 25648896, 'steps': 133587, 'loss/train': 1.2427873611450195} 11/07/2021 16:00:54 - INFO - __main__ - Step 133589: {'lr': 1.5018013417257115e-05, 'samples': 25649088, 'steps': 133588, 'loss/train': 1.2123862504959106} 11/07/2021 16:00:54 - INFO - __main__ - Step 133590: {'lr': 1.501620189149841e-05, 'samples': 25649280, 'steps': 133589, 'loss/train': 1.0904055833816528} 11/07/2021 16:00:54 - INFO - __main__ - Step 133591: {'lr': 1.5014390471619787e-05, 'samples': 25649472, 'steps': 133590, 'loss/train': 0.5363612174987793} 11/07/2021 16:00:55 - INFO - __main__ - Step 133592: {'lr': 1.5012579157622081e-05, 'samples': 25649664, 'steps': 133591, 'loss/train': 1.6812316179275513} 11/07/2021 16:00:56 - INFO - __main__ - Step 133593: {'lr': 1.5010767949506065e-05, 'samples': 25649856, 'steps': 133592, 'loss/train': 1.1744012832641602} 11/07/2021 16:00:56 - INFO - __main__ - Step 133594: {'lr': 1.5008956847272549e-05, 'samples': 25650048, 'steps': 133593, 'loss/train': 1.631417989730835} 11/07/2021 16:00:57 - INFO - __main__ - Step 133595: {'lr': 1.500714585092236e-05, 'samples': 25650240, 'steps': 133594, 'loss/train': 0.9565736651420593} 11/07/2021 16:00:57 - INFO - __main__ - Step 133596: {'lr': 1.5005334960456363e-05, 'samples': 25650432, 'steps': 133595, 'loss/train': 1.4983576536178589} 11/07/2021 16:00:57 - INFO - __main__ - Step 133597: {'lr': 1.5003524175875306e-05, 'samples': 25650624, 'steps': 133596, 'loss/train': 0.9956684112548828} 11/07/2021 16:00:58 - INFO - __main__ - Step 133598: {'lr': 1.5001713497180047e-05, 'samples': 25650816, 'steps': 133597, 'loss/train': 1.2966609001159668} 11/07/2021 16:00:59 - INFO - __main__ - Step 133599: {'lr': 1.4999902924371367e-05, 'samples': 25651008, 'steps': 133598, 'loss/train': 1.0654723644256592} 11/07/2021 16:00:59 - INFO - __main__ - Step 133600: {'lr': 1.4998092457450124e-05, 'samples': 25651200, 'steps': 133599, 'loss/train': 1.7239304780960083} 11/07/2021 16:00:59 - INFO - __main__ - Step 133601: {'lr': 1.4996282096417125e-05, 'samples': 25651392, 'steps': 133600, 'loss/train': 1.2238550186157227} 11/07/2021 16:01:00 - INFO - __main__ - Step 133602: {'lr': 1.4994471841273144e-05, 'samples': 25651584, 'steps': 133601, 'loss/train': 1.3915750980377197} 11/07/2021 16:01:01 - INFO - __main__ - Step 133603: {'lr': 1.4992661692019072e-05, 'samples': 25651776, 'steps': 133602, 'loss/train': 1.4702924489974976} 11/07/2021 16:01:01 - INFO - __main__ - Step 133604: {'lr': 1.4990851648655629e-05, 'samples': 25651968, 'steps': 133603, 'loss/train': 0.6123383641242981} 11/07/2021 16:01:02 - INFO - __main__ - Step 133605: {'lr': 1.4989041711183732e-05, 'samples': 25652160, 'steps': 133604, 'loss/train': 1.2145378589630127} 11/07/2021 16:01:02 - INFO - __main__ - Step 133606: {'lr': 1.4987231879604157e-05, 'samples': 25652352, 'steps': 133605, 'loss/train': 1.1242533922195435} 11/07/2021 16:01:02 - INFO - __main__ - Step 133607: {'lr': 1.4985422153917654e-05, 'samples': 25652544, 'steps': 133606, 'loss/train': 1.24008047580719} 11/07/2021 16:01:03 - INFO - __main__ - Step 133608: {'lr': 1.4983612534125113e-05, 'samples': 25652736, 'steps': 133607, 'loss/train': 1.1142677068710327} 11/07/2021 16:01:04 - INFO - __main__ - Step 133609: {'lr': 1.4981803020227336e-05, 'samples': 25652928, 'steps': 133608, 'loss/train': 1.3556023836135864} 11/07/2021 16:01:04 - INFO - __main__ - Step 133610: {'lr': 1.497999361222513e-05, 'samples': 25653120, 'steps': 133609, 'loss/train': 0.548197329044342} 11/07/2021 16:01:04 - INFO - __main__ - Step 133611: {'lr': 1.4978184310119297e-05, 'samples': 25653312, 'steps': 133610, 'loss/train': 1.4736881256103516} 11/07/2021 16:01:05 - INFO - __main__ - Step 133612: {'lr': 1.4976375113910672e-05, 'samples': 25653504, 'steps': 133611, 'loss/train': 1.0582512617111206} 11/07/2021 16:01:05 - INFO - __main__ - Step 133613: {'lr': 1.4974566023600089e-05, 'samples': 25653696, 'steps': 133612, 'loss/train': 1.179755687713623} 11/07/2021 16:01:06 - INFO - __main__ - Step 133614: {'lr': 1.4972757039188322e-05, 'samples': 25653888, 'steps': 133613, 'loss/train': 1.2932720184326172} 11/07/2021 16:01:07 - INFO - __main__ - Step 133615: {'lr': 1.4970948160676206e-05, 'samples': 25654080, 'steps': 133614, 'loss/train': 1.1017427444458008} 11/07/2021 16:01:07 - INFO - __main__ - Step 133616: {'lr': 1.4969139388064545e-05, 'samples': 25654272, 'steps': 133615, 'loss/train': 1.3793165683746338} 11/07/2021 16:01:07 - INFO - __main__ - Step 133617: {'lr': 1.4967330721354171e-05, 'samples': 25654464, 'steps': 133616, 'loss/train': 1.0492870807647705} 11/07/2021 16:01:08 - INFO - __main__ - Step 133618: {'lr': 1.496552216054589e-05, 'samples': 25654656, 'steps': 133617, 'loss/train': 0.8125311732292175} 11/07/2021 16:01:09 - INFO - __main__ - Step 133619: {'lr': 1.4963713705640564e-05, 'samples': 25654848, 'steps': 133618, 'loss/train': 1.1753357648849487} 11/07/2021 16:01:09 - INFO - __main__ - Step 133620: {'lr': 1.4961905356638911e-05, 'samples': 25655040, 'steps': 133619, 'loss/train': 1.8505820035934448} 11/07/2021 16:01:10 - INFO - __main__ - Step 133621: {'lr': 1.4960097113541793e-05, 'samples': 25655232, 'steps': 133620, 'loss/train': 1.107725739479065} 11/07/2021 16:01:10 - INFO - __main__ - Step 133622: {'lr': 1.4958288976350043e-05, 'samples': 25655424, 'steps': 133621, 'loss/train': 0.8289036154747009} 11/07/2021 16:01:10 - INFO - __main__ - Step 133623: {'lr': 1.4956480945064467e-05, 'samples': 25655616, 'steps': 133622, 'loss/train': 1.0095322132110596} 11/07/2021 16:01:11 - INFO - __main__ - Step 133624: {'lr': 1.4954673019685866e-05, 'samples': 25655808, 'steps': 133623, 'loss/train': 0.5871303677558899} 11/07/2021 16:01:12 - INFO - __main__ - Step 133625: {'lr': 1.495286520021505e-05, 'samples': 25656000, 'steps': 133624, 'loss/train': 1.4830904006958008} 11/07/2021 16:01:12 - INFO - __main__ - Step 133626: {'lr': 1.4951057486652846e-05, 'samples': 25656192, 'steps': 133625, 'loss/train': 1.4075483083724976} 11/07/2021 16:01:12 - INFO - __main__ - Step 133627: {'lr': 1.4949249879000092e-05, 'samples': 25656384, 'steps': 133626, 'loss/train': 1.3433263301849365} 11/07/2021 16:01:13 - INFO - __main__ - Step 133628: {'lr': 1.4947442377257563e-05, 'samples': 25656576, 'steps': 133627, 'loss/train': 1.2930152416229248} 11/07/2021 16:01:14 - INFO - __main__ - Step 133629: {'lr': 1.4945634981426093e-05, 'samples': 25656768, 'steps': 133628, 'loss/train': 1.8922675848007202} 11/07/2021 16:01:14 - INFO - __main__ - Step 133630: {'lr': 1.4943827691506485e-05, 'samples': 25656960, 'steps': 133629, 'loss/train': 0.9402651190757751} 11/07/2021 16:01:15 - INFO - __main__ - Step 133631: {'lr': 1.494202050749957e-05, 'samples': 25657152, 'steps': 133630, 'loss/train': 1.260586142539978} 11/07/2021 16:01:15 - INFO - __main__ - Step 133632: {'lr': 1.4940213429406185e-05, 'samples': 25657344, 'steps': 133631, 'loss/train': 1.4456958770751953} 11/07/2021 16:01:15 - INFO - __main__ - Step 133633: {'lr': 1.4938406457227078e-05, 'samples': 25657536, 'steps': 133632, 'loss/train': 1.4262571334838867} 11/07/2021 16:01:16 - INFO - __main__ - Step 133634: {'lr': 1.4936599590963079e-05, 'samples': 25657728, 'steps': 133633, 'loss/train': 1.57718825340271} 11/07/2021 16:01:17 - INFO - __main__ - Step 133635: {'lr': 1.4934792830615051e-05, 'samples': 25657920, 'steps': 133634, 'loss/train': 1.4080173969268799} 11/07/2021 16:01:17 - INFO - __main__ - Step 133636: {'lr': 1.4932986176183772e-05, 'samples': 25658112, 'steps': 133635, 'loss/train': 0.8116325736045837} 11/07/2021 16:01:17 - INFO - __main__ - Step 133637: {'lr': 1.4931179627670045e-05, 'samples': 25658304, 'steps': 133636, 'loss/train': 1.2340916395187378} 11/07/2021 16:01:18 - INFO - __main__ - Step 133638: {'lr': 1.492937318507473e-05, 'samples': 25658496, 'steps': 133637, 'loss/train': 1.2378686666488647} 11/07/2021 16:01:18 - INFO - __main__ - Step 133639: {'lr': 1.4927566848398577e-05, 'samples': 25658688, 'steps': 133638, 'loss/train': 1.2252998352050781} 11/07/2021 16:01:19 - INFO - __main__ - Step 133640: {'lr': 1.4925760617642448e-05, 'samples': 25658880, 'steps': 133639, 'loss/train': 1.1730080842971802} 11/07/2021 16:01:19 - INFO - __main__ - Step 133641: {'lr': 1.4923954492807146e-05, 'samples': 25659072, 'steps': 133640, 'loss/train': 1.2526299953460693} 11/07/2021 16:01:20 - INFO - __main__ - Step 133642: {'lr': 1.4922148473893504e-05, 'samples': 25659264, 'steps': 133641, 'loss/train': 1.2620221376419067} 11/07/2021 16:01:20 - INFO - __main__ - Step 133643: {'lr': 1.49203425609023e-05, 'samples': 25659456, 'steps': 133642, 'loss/train': 1.0602794885635376} 11/07/2021 16:01:21 - INFO - __main__ - Step 133644: {'lr': 1.4918536753834339e-05, 'samples': 25659648, 'steps': 133643, 'loss/train': 1.1545356512069702} 11/07/2021 16:01:22 - INFO - __main__ - Step 133645: {'lr': 1.491673105269048e-05, 'samples': 25659840, 'steps': 133644, 'loss/train': 1.0306438207626343} 11/07/2021 16:01:22 - INFO - __main__ - Step 133646: {'lr': 1.4914925457471556e-05, 'samples': 25660032, 'steps': 133645, 'loss/train': 1.6145652532577515} 11/07/2021 16:01:22 - INFO - __main__ - Step 133647: {'lr': 1.4913119968178291e-05, 'samples': 25660224, 'steps': 133646, 'loss/train': 0.45884332060813904} 11/07/2021 16:01:23 - INFO - __main__ - Step 133648: {'lr': 1.4911314584811543e-05, 'samples': 25660416, 'steps': 133647, 'loss/train': 1.1123521327972412} 11/07/2021 16:01:23 - INFO - __main__ - Step 133649: {'lr': 1.4909509307372144e-05, 'samples': 25660608, 'steps': 133648, 'loss/train': 1.0590379238128662} 11/07/2021 16:01:23 - INFO - __main__ - Step 133650: {'lr': 1.4907704135860872e-05, 'samples': 25660800, 'steps': 133649, 'loss/train': 1.279904842376709} 11/07/2021 16:01:24 - INFO - __main__ - Step 133651: {'lr': 1.490589907027859e-05, 'samples': 25660992, 'steps': 133650, 'loss/train': 1.3126522302627563} 11/07/2021 16:01:25 - INFO - __main__ - Step 133652: {'lr': 1.4904094110626042e-05, 'samples': 25661184, 'steps': 133651, 'loss/train': 1.466873288154602} 11/07/2021 16:01:25 - INFO - __main__ - Step 133653: {'lr': 1.4902289256904122e-05, 'samples': 25661376, 'steps': 133652, 'loss/train': 1.1294869184494019} 11/07/2021 16:01:25 - INFO - __main__ - Step 133654: {'lr': 1.4900484509113576e-05, 'samples': 25661568, 'steps': 133653, 'loss/train': 1.045506477355957} 11/07/2021 16:01:26 - INFO - __main__ - Step 133655: {'lr': 1.4898679867255265e-05, 'samples': 25661760, 'steps': 133654, 'loss/train': 1.337867021560669} 11/07/2021 16:01:27 - INFO - __main__ - Step 133656: {'lr': 1.4896875331329968e-05, 'samples': 25661952, 'steps': 133655, 'loss/train': 0.7756513953208923} 11/07/2021 16:01:27 - INFO - __main__ - Step 133657: {'lr': 1.4895070901338515e-05, 'samples': 25662144, 'steps': 133656, 'loss/train': 1.3823513984680176} 11/07/2021 16:01:27 - INFO - __main__ - Step 133658: {'lr': 1.4893266577281711e-05, 'samples': 25662336, 'steps': 133657, 'loss/train': 1.4237054586410522} 11/07/2021 16:01:28 - INFO - __main__ - Step 133659: {'lr': 1.489146235916039e-05, 'samples': 25662528, 'steps': 133658, 'loss/train': 0.9623485803604126} 11/07/2021 16:01:28 - INFO - __main__ - Step 133660: {'lr': 1.4889658246975357e-05, 'samples': 25662720, 'steps': 133659, 'loss/train': 1.7553619146347046} 11/07/2021 16:01:29 - INFO - __main__ - Step 133661: {'lr': 1.4887854240727388e-05, 'samples': 25662912, 'steps': 133660, 'loss/train': 1.4981192350387573} 11/07/2021 16:01:30 - INFO - __main__ - Step 133662: {'lr': 1.4886050340417318e-05, 'samples': 25663104, 'steps': 133661, 'loss/train': 1.2380298376083374} 11/07/2021 16:01:30 - INFO - __main__ - Step 133663: {'lr': 1.4884246546045976e-05, 'samples': 25663296, 'steps': 133662, 'loss/train': 1.387298345565796} 11/07/2021 16:01:30 - INFO - __main__ - Step 133664: {'lr': 1.488244285761417e-05, 'samples': 25663488, 'steps': 133663, 'loss/train': 0.5896843075752258} 11/07/2021 16:01:31 - INFO - __main__ - Step 133665: {'lr': 1.4880639275122704e-05, 'samples': 25663680, 'steps': 133664, 'loss/train': 1.1072179079055786} 11/07/2021 16:01:32 - INFO - __main__ - Step 133666: {'lr': 1.4878835798572383e-05, 'samples': 25663872, 'steps': 133665, 'loss/train': 0.045588407665491104} 11/07/2021 16:01:32 - INFO - __main__ - Step 133667: {'lr': 1.4877032427964038e-05, 'samples': 25664064, 'steps': 133666, 'loss/train': 1.2698994874954224} 11/07/2021 16:01:32 - INFO - __main__ - Step 133668: {'lr': 1.4875229163298476e-05, 'samples': 25664256, 'steps': 133667, 'loss/train': 1.1659457683563232} 11/07/2021 16:01:33 - INFO - __main__ - Step 133669: {'lr': 1.4873426004576502e-05, 'samples': 25664448, 'steps': 133668, 'loss/train': 1.259810209274292} 11/07/2021 16:01:33 - INFO - __main__ - Step 133670: {'lr': 1.4871622951798946e-05, 'samples': 25664640, 'steps': 133669, 'loss/train': 0.8819175362586975} 11/07/2021 16:01:35 - INFO - __main__ - Step 133671: {'lr': 1.4869820004966589e-05, 'samples': 25664832, 'steps': 133670, 'loss/train': 1.0948220491409302} 11/07/2021 16:01:35 - INFO - __main__ - Step 133672: {'lr': 1.486801716408029e-05, 'samples': 25665024, 'steps': 133671, 'loss/train': 1.1784077882766724} 11/07/2021 16:01:35 - INFO - __main__ - Step 133673: {'lr': 1.486621442914085e-05, 'samples': 25665216, 'steps': 133672, 'loss/train': 1.2468059062957764} 11/07/2021 16:01:36 - INFO - __main__ - Step 133674: {'lr': 1.4864411800149024e-05, 'samples': 25665408, 'steps': 133673, 'loss/train': 1.1404377222061157} 11/07/2021 16:01:36 - INFO - __main__ - Step 133675: {'lr': 1.486260927710567e-05, 'samples': 25665600, 'steps': 133674, 'loss/train': 1.530545949935913} 11/07/2021 16:01:37 - INFO - __main__ - Step 133676: {'lr': 1.4860806860011594e-05, 'samples': 25665792, 'steps': 133675, 'loss/train': 0.03876110166311264} 11/07/2021 16:01:37 - INFO - __main__ - Step 133677: {'lr': 1.4859004548867628e-05, 'samples': 25665984, 'steps': 133676, 'loss/train': 1.3922425508499146} 11/07/2021 16:01:38 - INFO - __main__ - Step 133678: {'lr': 1.4857202343674548e-05, 'samples': 25666176, 'steps': 133677, 'loss/train': 1.1401125192642212} 11/07/2021 16:01:38 - INFO - __main__ - Step 133679: {'lr': 1.4855400244433187e-05, 'samples': 25666368, 'steps': 133678, 'loss/train': 1.3195725679397583} 11/07/2021 16:01:38 - INFO - __main__ - Step 133680: {'lr': 1.4853598251144379e-05, 'samples': 25666560, 'steps': 133679, 'loss/train': 1.2190403938293457} 11/07/2021 16:01:39 - INFO - __main__ - Step 133681: {'lr': 1.4851796363808872e-05, 'samples': 25666752, 'steps': 133680, 'loss/train': 1.1262004375457764} 11/07/2021 16:01:40 - INFO - __main__ - Step 133682: {'lr': 1.4849994582427556e-05, 'samples': 25666944, 'steps': 133681, 'loss/train': 1.106351613998413} 11/07/2021 16:01:40 - INFO - __main__ - Step 133683: {'lr': 1.4848192907001178e-05, 'samples': 25667136, 'steps': 133682, 'loss/train': 1.0656951665878296} 11/07/2021 16:01:40 - INFO - __main__ - Step 133684: {'lr': 1.4846391337530574e-05, 'samples': 25667328, 'steps': 133683, 'loss/train': 1.022316813468933} 11/07/2021 16:01:41 - INFO - __main__ - Step 133685: {'lr': 1.4844589874016573e-05, 'samples': 25667520, 'steps': 133684, 'loss/train': 1.3356460332870483} 11/07/2021 16:01:41 - INFO - __main__ - Step 133686: {'lr': 1.4842788516459981e-05, 'samples': 25667712, 'steps': 133685, 'loss/train': 1.6725008487701416} 11/07/2021 16:01:42 - INFO - __main__ - Step 133687: {'lr': 1.4840987264861606e-05, 'samples': 25667904, 'steps': 133686, 'loss/train': 1.4696178436279297} 11/07/2021 16:01:43 - INFO - __main__ - Step 133688: {'lr': 1.483918611922222e-05, 'samples': 25668096, 'steps': 133687, 'loss/train': 1.2991317510604858} 11/07/2021 16:01:43 - INFO - __main__ - Step 133689: {'lr': 1.4837385079542659e-05, 'samples': 25668288, 'steps': 133688, 'loss/train': 0.8758544325828552} 11/07/2021 16:01:43 - INFO - __main__ - Step 133690: {'lr': 1.4835584145823783e-05, 'samples': 25668480, 'steps': 133689, 'loss/train': 1.4666067361831665} 11/07/2021 16:01:44 - INFO - __main__ - Step 133691: {'lr': 1.483378331806634e-05, 'samples': 25668672, 'steps': 133690, 'loss/train': 0.7848942279815674} 11/07/2021 16:01:45 - INFO - __main__ - Step 133692: {'lr': 1.4831982596271164e-05, 'samples': 25668864, 'steps': 133691, 'loss/train': 1.1719286441802979} 11/07/2021 16:01:45 - INFO - __main__ - Step 133693: {'lr': 1.4830181980439061e-05, 'samples': 25669056, 'steps': 133692, 'loss/train': 1.3990033864974976} 11/07/2021 16:01:45 - INFO - __main__ - Step 133694: {'lr': 1.4828381470570861e-05, 'samples': 25669248, 'steps': 133693, 'loss/train': 1.1657028198242188} 11/07/2021 16:01:46 - INFO - __main__ - Step 133695: {'lr': 1.4826581066667372e-05, 'samples': 25669440, 'steps': 133694, 'loss/train': 1.14614737033844} 11/07/2021 16:01:46 - INFO - __main__ - Step 133696: {'lr': 1.4824780768729369e-05, 'samples': 25669632, 'steps': 133695, 'loss/train': 1.1605526208877563} 11/07/2021 16:01:47 - INFO - __main__ - Step 133697: {'lr': 1.482298057675771e-05, 'samples': 25669824, 'steps': 133696, 'loss/train': 1.2055679559707642} 11/07/2021 16:01:47 - INFO - __main__ - Step 133698: {'lr': 1.4821180490753205e-05, 'samples': 25670016, 'steps': 133697, 'loss/train': 1.3441550731658936} 11/07/2021 16:01:48 - INFO - __main__ - Step 133699: {'lr': 1.481938051071663e-05, 'samples': 25670208, 'steps': 133698, 'loss/train': 0.8391377925872803} 11/07/2021 16:01:48 - INFO - __main__ - Step 133700: {'lr': 1.4817580636648842e-05, 'samples': 25670400, 'steps': 133699, 'loss/train': 1.49673330783844} 11/07/2021 16:01:49 - INFO - __main__ - Step 133701: {'lr': 1.4815780868550593e-05, 'samples': 25670592, 'steps': 133700, 'loss/train': 1.2511035203933716} 11/07/2021 16:01:50 - INFO - __main__ - Step 133702: {'lr': 1.4813981206422716e-05, 'samples': 25670784, 'steps': 133701, 'loss/train': 0.44580620527267456} 11/07/2021 16:01:50 - INFO - __main__ - Step 133703: {'lr': 1.4812181650266043e-05, 'samples': 25670976, 'steps': 133702, 'loss/train': 0.962001621723175} 11/07/2021 16:01:50 - INFO - __main__ - Step 133704: {'lr': 1.4810382200081351e-05, 'samples': 25671168, 'steps': 133703, 'loss/train': 1.0933204889297485} 11/07/2021 16:01:51 - INFO - __main__ - Step 133705: {'lr': 1.4808582855869501e-05, 'samples': 25671360, 'steps': 133704, 'loss/train': 1.2516615390777588} 11/07/2021 16:01:51 - INFO - __main__ - Step 133706: {'lr': 1.4806783617631242e-05, 'samples': 25671552, 'steps': 133705, 'loss/train': 0.726714015007019} 11/07/2021 16:01:52 - INFO - __main__ - Step 133707: {'lr': 1.4804984485367434e-05, 'samples': 25671744, 'steps': 133706, 'loss/train': 1.356138825416565} 11/07/2021 16:01:52 - INFO - __main__ - Step 133708: {'lr': 1.4803185459078882e-05, 'samples': 25671936, 'steps': 133707, 'loss/train': 1.5253446102142334} 11/07/2021 16:01:53 - INFO - __main__ - Step 133709: {'lr': 1.4801386538766365e-05, 'samples': 25672128, 'steps': 133708, 'loss/train': 0.6713560223579407} 11/07/2021 16:01:53 - INFO - __main__ - Step 133710: {'lr': 1.4799587724430741e-05, 'samples': 25672320, 'steps': 133709, 'loss/train': 1.1433210372924805} 11/07/2021 16:01:53 - INFO - __main__ - Step 133711: {'lr': 1.4797789016072761e-05, 'samples': 25672512, 'steps': 133710, 'loss/train': 1.250730276107788} 11/07/2021 16:01:55 - INFO - __main__ - Step 133712: {'lr': 1.4795990413693283e-05, 'samples': 25672704, 'steps': 133711, 'loss/train': 1.0832970142364502} 11/07/2021 16:01:55 - INFO - __main__ - Step 133713: {'lr': 1.4794191917293143e-05, 'samples': 25672896, 'steps': 133712, 'loss/train': 1.378027081489563} 11/07/2021 16:01:55 - INFO - __main__ - Step 133714: {'lr': 1.479239352687306e-05, 'samples': 25673088, 'steps': 133713, 'loss/train': 0.7907803654670715} 11/07/2021 16:01:56 - INFO - __main__ - Step 133715: {'lr': 1.4790595242433925e-05, 'samples': 25673280, 'steps': 133714, 'loss/train': 0.0358082614839077} 11/07/2021 16:01:56 - INFO - __main__ - Step 133716: {'lr': 1.4788797063976483e-05, 'samples': 25673472, 'steps': 133715, 'loss/train': 1.120009183883667} 11/07/2021 16:01:57 - INFO - __main__ - Step 133717: {'lr': 1.47869989915016e-05, 'samples': 25673664, 'steps': 133716, 'loss/train': 0.373333215713501} 11/07/2021 16:01:57 - INFO - __main__ - Step 133718: {'lr': 1.4785201025010048e-05, 'samples': 25673856, 'steps': 133717, 'loss/train': 1.4069042205810547} 11/07/2021 16:01:58 - INFO - __main__ - Step 133719: {'lr': 1.4783403164502663e-05, 'samples': 25674048, 'steps': 133718, 'loss/train': 1.098186731338501} 11/07/2021 16:01:58 - INFO - __main__ - Step 133720: {'lr': 1.4781605409980248e-05, 'samples': 25674240, 'steps': 133719, 'loss/train': 0.9913355112075806} 11/07/2021 16:01:58 - INFO - __main__ - Step 133721: {'lr': 1.4779807761443637e-05, 'samples': 25674432, 'steps': 133720, 'loss/train': 1.1269526481628418} 11/07/2021 16:01:59 - INFO - __main__ - Step 133722: {'lr': 1.4778010218893578e-05, 'samples': 25674624, 'steps': 133721, 'loss/train': 1.2504079341888428} 11/07/2021 16:02:00 - INFO - __main__ - Step 133723: {'lr': 1.4776212782330933e-05, 'samples': 25674816, 'steps': 133722, 'loss/train': 0.8602614998817444} 11/07/2021 16:02:00 - INFO - __main__ - Step 133724: {'lr': 1.4774415451756506e-05, 'samples': 25675008, 'steps': 133723, 'loss/train': 1.3035792112350464} 11/07/2021 16:02:00 - INFO - __main__ - Step 133725: {'lr': 1.4772618227171074e-05, 'samples': 25675200, 'steps': 133724, 'loss/train': 1.2561708688735962} 11/07/2021 16:02:01 - INFO - __main__ - Step 133726: {'lr': 1.4770821108575499e-05, 'samples': 25675392, 'steps': 133725, 'loss/train': 0.9816964268684387} 11/07/2021 16:02:01 - INFO - __main__ - Step 133727: {'lr': 1.4769024095970584e-05, 'samples': 25675584, 'steps': 133726, 'loss/train': 0.03347034007310867} 11/07/2021 16:02:02 - INFO - __main__ - Step 133728: {'lr': 1.476722718935708e-05, 'samples': 25675776, 'steps': 133727, 'loss/train': 1.484189510345459} 11/07/2021 16:02:03 - INFO - __main__ - Step 133729: {'lr': 1.4765430388735818e-05, 'samples': 25675968, 'steps': 133728, 'loss/train': 0.9872894287109375} 11/07/2021 16:02:03 - INFO - __main__ - Step 133730: {'lr': 1.476363369410766e-05, 'samples': 25676160, 'steps': 133729, 'loss/train': 1.0890363454818726} 11/07/2021 16:02:03 - INFO - __main__ - Step 133731: {'lr': 1.4761837105473352e-05, 'samples': 25676352, 'steps': 133730, 'loss/train': 1.5163379907608032} 11/07/2021 16:02:04 - INFO - __main__ - Step 133732: {'lr': 1.4760040622833731e-05, 'samples': 25676544, 'steps': 133731, 'loss/train': 1.4387445449829102} 11/07/2021 16:02:05 - INFO - __main__ - Step 133733: {'lr': 1.47582442461896e-05, 'samples': 25676736, 'steps': 133732, 'loss/train': 1.2691539525985718} 11/07/2021 16:02:05 - INFO - __main__ - Step 133734: {'lr': 1.4756447975541793e-05, 'samples': 25676928, 'steps': 133733, 'loss/train': 1.045445442199707} 11/07/2021 16:02:06 - INFO - __main__ - Step 133735: {'lr': 1.4754651810891084e-05, 'samples': 25677120, 'steps': 133734, 'loss/train': 1.3261598348617554} 11/07/2021 16:02:06 - INFO - __main__ - Step 133736: {'lr': 1.475285575223831e-05, 'samples': 25677312, 'steps': 133735, 'loss/train': 0.7481061220169067} 11/07/2021 16:02:06 - INFO - __main__ - Step 133737: {'lr': 1.475105979958427e-05, 'samples': 25677504, 'steps': 133736, 'loss/train': 1.2479195594787598} 11/07/2021 16:02:07 - INFO - __main__ - Step 133738: {'lr': 1.4749263952929775e-05, 'samples': 25677696, 'steps': 133737, 'loss/train': 1.1882567405700684} 11/07/2021 16:02:08 - INFO - __main__ - Step 133739: {'lr': 1.4747468212275628e-05, 'samples': 25677888, 'steps': 133738, 'loss/train': 1.3299809694290161} 11/07/2021 16:02:08 - INFO - __main__ - Step 133740: {'lr': 1.474567257762266e-05, 'samples': 25678080, 'steps': 133739, 'loss/train': 1.0939714908599854} 11/07/2021 16:02:08 - INFO - __main__ - Step 133741: {'lr': 1.4743877048971649e-05, 'samples': 25678272, 'steps': 133740, 'loss/train': 1.1215744018554688} 11/07/2021 16:02:09 - INFO - __main__ - Step 133742: {'lr': 1.47420816263234e-05, 'samples': 25678464, 'steps': 133741, 'loss/train': 1.2840125560760498} 11/07/2021 16:02:10 - INFO - __main__ - Step 133743: {'lr': 1.4740286309678747e-05, 'samples': 25678656, 'steps': 133742, 'loss/train': 1.2497304677963257} 11/07/2021 16:02:10 - INFO - __main__ - Step 133744: {'lr': 1.4738491099038492e-05, 'samples': 25678848, 'steps': 133743, 'loss/train': 1.468143343925476} 11/07/2021 16:02:10 - INFO - __main__ - Step 133745: {'lr': 1.4736695994403443e-05, 'samples': 25679040, 'steps': 133744, 'loss/train': 0.8425701856613159} 11/07/2021 16:02:11 - INFO - __main__ - Step 133746: {'lr': 1.4734900995774403e-05, 'samples': 25679232, 'steps': 133745, 'loss/train': 0.6785398125648499} 11/07/2021 16:02:11 - INFO - __main__ - Step 133747: {'lr': 1.4733106103152205e-05, 'samples': 25679424, 'steps': 133746, 'loss/train': 1.3230843544006348} 11/07/2021 16:02:12 - INFO - __main__ - Step 133748: {'lr': 1.4731311316537626e-05, 'samples': 25679616, 'steps': 133747, 'loss/train': 2.369260549545288} 11/07/2021 16:02:12 - INFO - __main__ - Step 133749: {'lr': 1.4729516635931473e-05, 'samples': 25679808, 'steps': 133748, 'loss/train': 1.4040006399154663} 11/07/2021 16:02:13 - INFO - __main__ - Step 133750: {'lr': 1.4727722061334602e-05, 'samples': 25680000, 'steps': 133749, 'loss/train': 1.6031690835952759} 11/07/2021 16:02:13 - INFO - __main__ - Step 133751: {'lr': 1.4725927592747768e-05, 'samples': 25680192, 'steps': 133750, 'loss/train': 1.1754621267318726} 11/07/2021 16:02:14 - INFO - __main__ - Step 133752: {'lr': 1.4724133230171798e-05, 'samples': 25680384, 'steps': 133751, 'loss/train': 1.4599472284317017} 11/07/2021 16:02:14 - INFO - __main__ - Step 133753: {'lr': 1.472233897360753e-05, 'samples': 25680576, 'steps': 133752, 'loss/train': 1.0728411674499512} 11/07/2021 16:02:15 - INFO - __main__ - Step 133754: {'lr': 1.4720544823055738e-05, 'samples': 25680768, 'steps': 133753, 'loss/train': 1.5306274890899658} 11/07/2021 16:02:15 - INFO - __main__ - Step 133755: {'lr': 1.4718750778517227e-05, 'samples': 25680960, 'steps': 133754, 'loss/train': 1.3086756467819214} 11/07/2021 16:02:16 - INFO - __main__ - Step 133756: {'lr': 1.4716956839992802e-05, 'samples': 25681152, 'steps': 133755, 'loss/train': 1.358784556388855} 11/07/2021 16:02:16 - INFO - __main__ - Step 133757: {'lr': 1.4715163007483295e-05, 'samples': 25681344, 'steps': 133756, 'loss/train': 1.9388792514801025} 11/07/2021 16:02:16 - INFO - __main__ - Step 133758: {'lr': 1.4713369280989513e-05, 'samples': 25681536, 'steps': 133757, 'loss/train': 0.5832361578941345} 11/07/2021 16:02:17 - INFO - __main__ - Step 133759: {'lr': 1.4711575660512233e-05, 'samples': 25681728, 'steps': 133758, 'loss/train': 1.3170363903045654} 11/07/2021 16:02:18 - INFO - __main__ - Step 133760: {'lr': 1.4709782146052314e-05, 'samples': 25681920, 'steps': 133759, 'loss/train': 1.1731324195861816} 11/07/2021 16:02:18 - INFO - __main__ - Step 133761: {'lr': 1.4707988737610506e-05, 'samples': 25682112, 'steps': 133760, 'loss/train': 1.724103331565857} 11/07/2021 16:02:18 - INFO - __main__ - Step 133762: {'lr': 1.4706195435187669e-05, 'samples': 25682304, 'steps': 133761, 'loss/train': 1.1852959394454956} 11/07/2021 16:02:19 - INFO - __main__ - Step 133763: {'lr': 1.4704402238784581e-05, 'samples': 25682496, 'steps': 133762, 'loss/train': 1.63604736328125} 11/07/2021 16:02:20 - INFO - __main__ - Step 133764: {'lr': 1.4702609148402102e-05, 'samples': 25682688, 'steps': 133763, 'loss/train': 1.1413723230361938} 11/07/2021 16:02:20 - INFO - __main__ - Step 133765: {'lr': 1.4700816164040982e-05, 'samples': 25682880, 'steps': 133764, 'loss/train': 1.4128425121307373} 11/07/2021 16:02:21 - INFO - __main__ - Step 133766: {'lr': 1.4699023285701996e-05, 'samples': 25683072, 'steps': 133765, 'loss/train': 1.381700038909912} 11/07/2021 16:02:21 - INFO - __main__ - Step 133767: {'lr': 1.4697230513386033e-05, 'samples': 25683264, 'steps': 133766, 'loss/train': 1.0188931226730347} 11/07/2021 16:02:21 - INFO - __main__ - Step 133768: {'lr': 1.4695437847093845e-05, 'samples': 25683456, 'steps': 133767, 'loss/train': 1.48099684715271} 11/07/2021 16:02:22 - INFO - __main__ - Step 133769: {'lr': 1.469364528682629e-05, 'samples': 25683648, 'steps': 133768, 'loss/train': 1.0190317630767822} 11/07/2021 16:02:23 - INFO - __main__ - Step 133770: {'lr': 1.4691852832584118e-05, 'samples': 25683840, 'steps': 133769, 'loss/train': 1.4596595764160156} 11/07/2021 16:02:23 - INFO - __main__ - Step 133771: {'lr': 1.469006048436819e-05, 'samples': 25684032, 'steps': 133770, 'loss/train': 1.0900698900222778} 11/07/2021 16:02:23 - INFO - __main__ - Step 133772: {'lr': 1.4688268242179282e-05, 'samples': 25684224, 'steps': 133771, 'loss/train': 1.2043814659118652} 11/07/2021 16:02:24 - INFO - __main__ - Step 133773: {'lr': 1.4686476106018198e-05, 'samples': 25684416, 'steps': 133772, 'loss/train': 0.5916121602058411} 11/07/2021 16:02:25 - INFO - __main__ - Step 133774: {'lr': 1.4684684075885773e-05, 'samples': 25684608, 'steps': 133773, 'loss/train': 1.0565922260284424} 11/07/2021 16:02:25 - INFO - __main__ - Step 133775: {'lr': 1.4682892151782811e-05, 'samples': 25684800, 'steps': 133774, 'loss/train': 1.0471291542053223} 11/07/2021 16:02:25 - INFO - __main__ - Step 133776: {'lr': 1.468110033371009e-05, 'samples': 25684992, 'steps': 133775, 'loss/train': 1.1272553205490112} 11/07/2021 16:02:26 - INFO - __main__ - Step 133777: {'lr': 1.4679308621668441e-05, 'samples': 25685184, 'steps': 133776, 'loss/train': 1.3126202821731567} 11/07/2021 16:02:26 - INFO - __main__ - Step 133778: {'lr': 1.4677517015658642e-05, 'samples': 25685376, 'steps': 133777, 'loss/train': 0.645753026008606} 11/07/2021 16:02:27 - INFO - __main__ - Step 133779: {'lr': 1.4675725515681526e-05, 'samples': 25685568, 'steps': 133778, 'loss/train': 1.0793521404266357} 11/07/2021 16:02:28 - INFO - __main__ - Step 133780: {'lr': 1.4673934121737925e-05, 'samples': 25685760, 'steps': 133779, 'loss/train': 1.2276225090026855} 11/07/2021 16:02:28 - INFO - __main__ - Step 133781: {'lr': 1.467214283382859e-05, 'samples': 25685952, 'steps': 133780, 'loss/train': 1.2125747203826904} 11/07/2021 16:02:28 - INFO - __main__ - Step 133782: {'lr': 1.467035165195435e-05, 'samples': 25686144, 'steps': 133781, 'loss/train': 1.5188323259353638} 11/07/2021 16:02:29 - INFO - __main__ - Step 133783: {'lr': 1.4668560576116042e-05, 'samples': 25686336, 'steps': 133782, 'loss/train': 1.1384512186050415} 11/07/2021 16:02:30 - INFO - __main__ - Step 133784: {'lr': 1.4666769606314439e-05, 'samples': 25686528, 'steps': 133783, 'loss/train': 1.4029395580291748} 11/07/2021 16:02:30 - INFO - __main__ - Step 133785: {'lr': 1.466497874255035e-05, 'samples': 25686720, 'steps': 133784, 'loss/train': 1.2654348611831665} 11/07/2021 16:02:30 - INFO - __main__ - Step 133786: {'lr': 1.4663187984824633e-05, 'samples': 25686912, 'steps': 133785, 'loss/train': 1.456743597984314} 11/07/2021 16:02:31 - INFO - __main__ - Step 133787: {'lr': 1.466139733313801e-05, 'samples': 25687104, 'steps': 133786, 'loss/train': 1.3870173692703247} 11/07/2021 16:02:31 - INFO - __main__ - Step 133788: {'lr': 1.4659606787491341e-05, 'samples': 25687296, 'steps': 133787, 'loss/train': 1.4972915649414062} 11/07/2021 16:02:32 - INFO - __main__ - Step 133789: {'lr': 1.4657816347885433e-05, 'samples': 25687488, 'steps': 133788, 'loss/train': 1.7078396081924438} 11/07/2021 16:02:32 - INFO - __main__ - Step 133790: {'lr': 1.4656026014321062e-05, 'samples': 25687680, 'steps': 133789, 'loss/train': 0.8613266348838806} 11/07/2021 16:02:33 - INFO - __main__ - Step 133791: {'lr': 1.4654235786799059e-05, 'samples': 25687872, 'steps': 133790, 'loss/train': 1.081244945526123} 11/07/2021 16:02:33 - INFO - __main__ - Step 133792: {'lr': 1.465244566532023e-05, 'samples': 25688064, 'steps': 133791, 'loss/train': 1.4409337043762207} 11/07/2021 16:02:33 - INFO - __main__ - Step 133793: {'lr': 1.4650655649885353e-05, 'samples': 25688256, 'steps': 133792, 'loss/train': 1.473207712173462} 11/07/2021 16:02:34 - INFO - __main__ - Step 133794: {'lr': 1.4648865740495287e-05, 'samples': 25688448, 'steps': 133793, 'loss/train': 0.9100611209869385} 11/07/2021 16:02:35 - INFO - __main__ - Step 133795: {'lr': 1.4647075937150811e-05, 'samples': 25688640, 'steps': 133794, 'loss/train': 1.6249312162399292} 11/07/2021 16:02:35 - INFO - __main__ - Step 133796: {'lr': 1.46452862398527e-05, 'samples': 25688832, 'steps': 133795, 'loss/train': 1.5610129833221436} 11/07/2021 16:02:35 - INFO - __main__ - Step 133797: {'lr': 1.4643496648601873e-05, 'samples': 25689024, 'steps': 133796, 'loss/train': 1.2893550395965576} 11/07/2021 16:02:36 - INFO - __main__ - Step 133798: {'lr': 1.4641707163398993e-05, 'samples': 25689216, 'steps': 133797, 'loss/train': 0.926374077796936} 11/07/2021 16:02:37 - INFO - __main__ - Step 133799: {'lr': 1.463991778424492e-05, 'samples': 25689408, 'steps': 133798, 'loss/train': 1.097580075263977} 11/07/2021 16:02:37 - INFO - __main__ - Step 133800: {'lr': 1.4638128511140465e-05, 'samples': 25689600, 'steps': 133799, 'loss/train': 0.9124215245246887} 11/07/2021 16:02:38 - INFO - __main__ - Step 133801: {'lr': 1.4636339344086453e-05, 'samples': 25689792, 'steps': 133800, 'loss/train': 1.4560275077819824} 11/07/2021 16:02:38 - INFO - __main__ - Step 133802: {'lr': 1.4634550283083665e-05, 'samples': 25689984, 'steps': 133801, 'loss/train': 0.9901097416877747} 11/07/2021 16:02:38 - INFO - __main__ - Step 133803: {'lr': 1.4632761328132932e-05, 'samples': 25690176, 'steps': 133802, 'loss/train': 1.1810940504074097} 11/07/2021 16:02:39 - INFO - __main__ - Step 133804: {'lr': 1.4630972479235032e-05, 'samples': 25690368, 'steps': 133803, 'loss/train': 1.2687174081802368} 11/07/2021 16:02:40 - INFO - __main__ - Step 133805: {'lr': 1.46291837363908e-05, 'samples': 25690560, 'steps': 133804, 'loss/train': 1.5626697540283203} 11/07/2021 16:02:40 - INFO - __main__ - Step 133806: {'lr': 1.462739509960101e-05, 'samples': 25690752, 'steps': 133805, 'loss/train': 1.1628378629684448} 11/07/2021 16:02:40 - INFO - __main__ - Step 133807: {'lr': 1.4625606568866495e-05, 'samples': 25690944, 'steps': 133806, 'loss/train': 0.6469231247901917} 11/07/2021 16:02:41 - INFO - __main__ - Step 133808: {'lr': 1.4623818144188062e-05, 'samples': 25691136, 'steps': 133807, 'loss/train': 1.0832850933074951} 11/07/2021 16:02:42 - INFO - __main__ - Step 133809: {'lr': 1.4622029825566485e-05, 'samples': 25691328, 'steps': 133808, 'loss/train': 1.1806720495224} 11/07/2021 16:02:42 - INFO - __main__ - Step 133810: {'lr': 1.4620241613002599e-05, 'samples': 25691520, 'steps': 133809, 'loss/train': 1.5436465740203857} 11/07/2021 16:02:42 - INFO - __main__ - Step 133811: {'lr': 1.4618453506497182e-05, 'samples': 25691712, 'steps': 133810, 'loss/train': 1.1869556903839111} 11/07/2021 16:02:43 - INFO - __main__ - Step 133812: {'lr': 1.4616665506051064e-05, 'samples': 25691904, 'steps': 133811, 'loss/train': 1.0897530317306519} 11/07/2021 16:02:43 - INFO - __main__ - Step 133813: {'lr': 1.4614877611665051e-05, 'samples': 25692096, 'steps': 133812, 'loss/train': 0.9669438600540161} 11/07/2021 16:02:44 - INFO - __main__ - Step 133814: {'lr': 1.4613089823339947e-05, 'samples': 25692288, 'steps': 133813, 'loss/train': 1.3589195013046265} 11/07/2021 16:02:44 - INFO - __main__ - Step 133815: {'lr': 1.4611302141076533e-05, 'samples': 25692480, 'steps': 133814, 'loss/train': 1.378463625907898} 11/07/2021 16:02:45 - INFO - __main__ - Step 133816: {'lr': 1.4609514564875637e-05, 'samples': 25692672, 'steps': 133815, 'loss/train': 1.5320631265640259} 11/07/2021 16:02:45 - INFO - __main__ - Step 133817: {'lr': 1.4607727094738067e-05, 'samples': 25692864, 'steps': 133816, 'loss/train': 1.4736475944519043} 11/07/2021 16:02:45 - INFO - __main__ - Step 133818: {'lr': 1.4605939730664625e-05, 'samples': 25693056, 'steps': 133817, 'loss/train': 1.0339500904083252} 11/07/2021 16:02:47 - INFO - __main__ - Step 133819: {'lr': 1.4604152472656118e-05, 'samples': 25693248, 'steps': 133818, 'loss/train': 0.28889724612236023} 11/07/2021 16:02:47 - INFO - __main__ - Step 133820: {'lr': 1.460236532071335e-05, 'samples': 25693440, 'steps': 133819, 'loss/train': 1.5555905103683472} 11/07/2021 16:02:47 - INFO - __main__ - Step 133821: {'lr': 1.4600578274837128e-05, 'samples': 25693632, 'steps': 133820, 'loss/train': 1.0348691940307617} 11/07/2021 16:02:48 - INFO - __main__ - Step 133822: {'lr': 1.4598791335028255e-05, 'samples': 25693824, 'steps': 133821, 'loss/train': 1.3683860301971436} 11/07/2021 16:02:48 - INFO - __main__ - Step 133823: {'lr': 1.4597004501287509e-05, 'samples': 25694016, 'steps': 133822, 'loss/train': 0.5701240301132202} 11/07/2021 16:02:49 - INFO - __main__ - Step 133824: {'lr': 1.4595217773615749e-05, 'samples': 25694208, 'steps': 133823, 'loss/train': 1.461211919784546} 11/07/2021 16:02:49 - INFO - __main__ - Step 133825: {'lr': 1.4593431152013725e-05, 'samples': 25694400, 'steps': 133824, 'loss/train': 1.626176118850708} 11/07/2021 16:02:50 - INFO - __main__ - Step 133826: {'lr': 1.459164463648227e-05, 'samples': 25694592, 'steps': 133825, 'loss/train': 1.2405083179473877} 11/07/2021 16:02:50 - INFO - __main__ - Step 133827: {'lr': 1.458985822702219e-05, 'samples': 25694784, 'steps': 133826, 'loss/train': 1.380664348602295} 11/07/2021 16:02:50 - INFO - __main__ - Step 133828: {'lr': 1.4588071923634317e-05, 'samples': 25694976, 'steps': 133827, 'loss/train': 0.24318930506706238} 11/07/2021 16:02:51 - INFO - __main__ - Step 133829: {'lr': 1.4586285726319399e-05, 'samples': 25695168, 'steps': 133828, 'loss/train': 1.6610347032546997} 11/07/2021 16:02:52 - INFO - __main__ - Step 133830: {'lr': 1.458449963507827e-05, 'samples': 25695360, 'steps': 133829, 'loss/train': 1.268506646156311} 11/07/2021 16:02:52 - INFO - __main__ - Step 133831: {'lr': 1.4582713649911734e-05, 'samples': 25695552, 'steps': 133830, 'loss/train': 1.1551238298416138} 11/07/2021 16:02:53 - INFO - __main__ - Step 133832: {'lr': 1.4580927770820568e-05, 'samples': 25695744, 'steps': 133831, 'loss/train': 1.249380111694336} 11/07/2021 16:02:53 - INFO - __main__ - Step 133833: {'lr': 1.4579141997805635e-05, 'samples': 25695936, 'steps': 133832, 'loss/train': 1.1138790845870972} 11/07/2021 16:02:53 - INFO - __main__ - Step 133834: {'lr': 1.4577356330867735e-05, 'samples': 25696128, 'steps': 133833, 'loss/train': 1.3377034664154053} 11/07/2021 16:02:54 - INFO - __main__ - Step 133835: {'lr': 1.4575570770007623e-05, 'samples': 25696320, 'steps': 133834, 'loss/train': 1.1111468076705933} 11/07/2021 16:02:55 - INFO - __main__ - Step 133836: {'lr': 1.4573785315226101e-05, 'samples': 25696512, 'steps': 133835, 'loss/train': 1.7006821632385254} 11/07/2021 16:02:55 - INFO - __main__ - Step 133837: {'lr': 1.457199996652403e-05, 'samples': 25696704, 'steps': 133836, 'loss/train': 1.1979049444198608} 11/07/2021 16:02:55 - INFO - __main__ - Step 133838: {'lr': 1.457021472390216e-05, 'samples': 25696896, 'steps': 133837, 'loss/train': 1.1638108491897583} 11/07/2021 16:02:56 - INFO - __main__ - Step 133839: {'lr': 1.456842958736132e-05, 'samples': 25697088, 'steps': 133838, 'loss/train': 1.1216715574264526} 11/07/2021 16:02:57 - INFO - __main__ - Step 133840: {'lr': 1.456664455690232e-05, 'samples': 25697280, 'steps': 133839, 'loss/train': 1.2636935710906982} 11/07/2021 16:02:57 - INFO - __main__ - Step 133841: {'lr': 1.4564859632525961e-05, 'samples': 25697472, 'steps': 133840, 'loss/train': 1.0674508810043335} 11/07/2021 16:02:57 - INFO - __main__ - Step 133842: {'lr': 1.4563074814233025e-05, 'samples': 25697664, 'steps': 133841, 'loss/train': 1.2243375778198242} 11/07/2021 16:02:58 - INFO - __main__ - Step 133843: {'lr': 1.4561290102024337e-05, 'samples': 25697856, 'steps': 133842, 'loss/train': 1.3629803657531738} 11/07/2021 16:02:58 - INFO - __main__ - Step 133844: {'lr': 1.455950549590071e-05, 'samples': 25698048, 'steps': 133843, 'loss/train': 1.253298044204712} 11/07/2021 16:02:59 - INFO - __main__ - Step 133845: {'lr': 1.4557720995862944e-05, 'samples': 25698240, 'steps': 133844, 'loss/train': 1.2195218801498413} 11/07/2021 16:03:00 - INFO - __main__ - Step 133846: {'lr': 1.4555936601911818e-05, 'samples': 25698432, 'steps': 133845, 'loss/train': 1.2312191724777222} 11/07/2021 16:03:00 - INFO - __main__ - Step 133847: {'lr': 1.4554152314048164e-05, 'samples': 25698624, 'steps': 133846, 'loss/train': 0.9669734835624695} 11/07/2021 16:03:00 - INFO - __main__ - Step 133848: {'lr': 1.4552368132272815e-05, 'samples': 25698816, 'steps': 133847, 'loss/train': 1.355119228363037} 11/07/2021 16:03:01 - INFO - __main__ - Step 133849: {'lr': 1.455058405658649e-05, 'samples': 25699008, 'steps': 133848, 'loss/train': 1.1344826221466064} 11/07/2021 16:03:02 - INFO - __main__ - Step 133850: {'lr': 1.4548800086990027e-05, 'samples': 25699200, 'steps': 133849, 'loss/train': 1.115915060043335} 11/07/2021 16:03:02 - INFO - __main__ - Step 133851: {'lr': 1.4547016223484255e-05, 'samples': 25699392, 'steps': 133850, 'loss/train': 0.18121324479579926} 11/07/2021 16:03:02 - INFO - __main__ - Step 133852: {'lr': 1.454523246606998e-05, 'samples': 25699584, 'steps': 133851, 'loss/train': 1.3219541311264038} 11/07/2021 16:03:03 - INFO - __main__ - Step 133853: {'lr': 1.4543448814747978e-05, 'samples': 25699776, 'steps': 133852, 'loss/train': 0.9708024859428406} 11/07/2021 16:03:03 - INFO - __main__ - Step 133854: {'lr': 1.4541665269519056e-05, 'samples': 25699968, 'steps': 133853, 'loss/train': 0.7132051587104797} 11/07/2021 16:03:04 - INFO - __main__ - Step 133855: {'lr': 1.4539881830384045e-05, 'samples': 25700160, 'steps': 133854, 'loss/train': 1.4681458473205566} 11/07/2021 16:03:04 - INFO - __main__ - Step 133856: {'lr': 1.4538098497343694e-05, 'samples': 25700352, 'steps': 133855, 'loss/train': 0.909195601940155} 11/07/2021 16:03:05 - INFO - __main__ - Step 133857: {'lr': 1.4536315270398864e-05, 'samples': 25700544, 'steps': 133856, 'loss/train': 1.0023177862167358} 11/07/2021 16:03:05 - INFO - __main__ - Step 133858: {'lr': 1.453453214955036e-05, 'samples': 25700736, 'steps': 133857, 'loss/train': 1.0290006399154663} 11/07/2021 16:03:05 - INFO - __main__ - Step 133859: {'lr': 1.4532749134798934e-05, 'samples': 25700928, 'steps': 133858, 'loss/train': 1.5745296478271484} 11/07/2021 16:03:06 - INFO - __main__ - Step 133860: {'lr': 1.4530966226145414e-05, 'samples': 25701120, 'steps': 133859, 'loss/train': 1.4535698890686035} 11/07/2021 16:03:07 - INFO - __main__ - Step 133861: {'lr': 1.4529183423590663e-05, 'samples': 25701312, 'steps': 133860, 'loss/train': 1.2110191583633423} 11/07/2021 16:03:07 - INFO - __main__ - Step 133862: {'lr': 1.4527400727135375e-05, 'samples': 25701504, 'steps': 133861, 'loss/train': 1.499800205230713} 11/07/2021 16:03:08 - INFO - __main__ - Step 133863: {'lr': 1.4525618136780411e-05, 'samples': 25701696, 'steps': 133862, 'loss/train': 1.4491674900054932} 11/07/2021 16:03:08 - INFO - __main__ - Step 133864: {'lr': 1.4523835652526602e-05, 'samples': 25701888, 'steps': 133863, 'loss/train': 1.2035421133041382} 11/07/2021 16:03:08 - INFO - __main__ - Step 133865: {'lr': 1.4522053274374669e-05, 'samples': 25702080, 'steps': 133864, 'loss/train': 1.2948580980300903} 11/07/2021 16:03:09 - INFO - __main__ - Step 133866: {'lr': 1.4520271002325503e-05, 'samples': 25702272, 'steps': 133865, 'loss/train': 1.545723557472229} 11/07/2021 16:03:10 - INFO - __main__ - Step 133867: {'lr': 1.451848883637985e-05, 'samples': 25702464, 'steps': 133866, 'loss/train': 1.3451217412948608} 11/07/2021 16:03:10 - INFO - __main__ - Step 133868: {'lr': 1.451670677653852e-05, 'samples': 25702656, 'steps': 133867, 'loss/train': 1.977089762687683} 11/07/2021 16:03:10 - INFO - __main__ - Step 133869: {'lr': 1.4514924822802367e-05, 'samples': 25702848, 'steps': 133868, 'loss/train': 1.0207146406173706} 11/07/2021 16:03:11 - INFO - __main__ - Step 133870: {'lr': 1.4513142975172117e-05, 'samples': 25703040, 'steps': 133869, 'loss/train': 0.6904116868972778} 11/07/2021 16:03:12 - INFO - __main__ - Step 133871: {'lr': 1.4511361233648629e-05, 'samples': 25703232, 'steps': 133870, 'loss/train': 1.1470320224761963} 11/07/2021 16:03:12 - INFO - __main__ - Step 133872: {'lr': 1.450957959823268e-05, 'samples': 25703424, 'steps': 133871, 'loss/train': 0.9741038084030151} 11/07/2021 16:03:13 - INFO - __main__ - Step 133873: {'lr': 1.4507798068925076e-05, 'samples': 25703616, 'steps': 133872, 'loss/train': 0.5353037118911743} 11/07/2021 16:03:13 - INFO - __main__ - Step 133874: {'lr': 1.450601664572665e-05, 'samples': 25703808, 'steps': 133873, 'loss/train': 1.1447536945343018} 11/07/2021 16:03:13 - INFO - __main__ - Step 133875: {'lr': 1.4504235328638204e-05, 'samples': 25704000, 'steps': 133874, 'loss/train': 1.080679178237915} 11/07/2021 16:03:14 - INFO - __main__ - Step 133876: {'lr': 1.4502454117660464e-05, 'samples': 25704192, 'steps': 133875, 'loss/train': 1.0168187618255615} 11/07/2021 16:03:15 - INFO - __main__ - Step 133877: {'lr': 1.4500673012794285e-05, 'samples': 25704384, 'steps': 133876, 'loss/train': 1.1915825605392456} 11/07/2021 16:03:15 - INFO - __main__ - Step 133878: {'lr': 1.4498892014040477e-05, 'samples': 25704576, 'steps': 133877, 'loss/train': 0.972008228302002} 11/07/2021 16:03:15 - INFO - __main__ - Step 133879: {'lr': 1.4497111121399842e-05, 'samples': 25704768, 'steps': 133878, 'loss/train': 0.7224009037017822} 11/07/2021 16:03:16 - INFO - __main__ - Step 133880: {'lr': 1.4495330334873185e-05, 'samples': 25704960, 'steps': 133879, 'loss/train': 1.1731536388397217} 11/07/2021 16:03:17 - INFO - __main__ - Step 133881: {'lr': 1.4493549654461257e-05, 'samples': 25705152, 'steps': 133880, 'loss/train': 1.3056238889694214} 11/07/2021 16:03:17 - INFO - __main__ - Step 133882: {'lr': 1.4491769080164946e-05, 'samples': 25705344, 'steps': 133881, 'loss/train': 1.4675785303115845} 11/07/2021 16:03:18 - INFO - __main__ - Step 133883: {'lr': 1.4489988611984973e-05, 'samples': 25705536, 'steps': 133882, 'loss/train': 1.2215465307235718} 11/07/2021 16:03:18 - INFO - __main__ - Step 133884: {'lr': 1.4488208249922197e-05, 'samples': 25705728, 'steps': 133883, 'loss/train': 1.3441671133041382} 11/07/2021 16:03:18 - INFO - __main__ - Step 133885: {'lr': 1.4486427993977397e-05, 'samples': 25705920, 'steps': 133884, 'loss/train': 1.2378326654434204} 11/07/2021 16:03:19 - INFO - __main__ - Step 133886: {'lr': 1.4484647844151377e-05, 'samples': 25706112, 'steps': 133885, 'loss/train': 1.0969425439834595} 11/07/2021 16:03:20 - INFO - __main__ - Step 133887: {'lr': 1.4482867800444944e-05, 'samples': 25706304, 'steps': 133886, 'loss/train': 1.333870768547058} 11/07/2021 16:03:20 - INFO - __main__ - Step 133888: {'lr': 1.4481087862858927e-05, 'samples': 25706496, 'steps': 133887, 'loss/train': 1.109471082687378} 11/07/2021 16:03:20 - INFO - __main__ - Step 133889: {'lr': 1.4479308031394079e-05, 'samples': 25706688, 'steps': 133888, 'loss/train': 1.226239800453186} 11/07/2021 16:03:21 - INFO - __main__ - Step 133890: {'lr': 1.4477528306051201e-05, 'samples': 25706880, 'steps': 133889, 'loss/train': 1.4766514301300049} 11/07/2021 16:03:22 - INFO - __main__ - Step 133891: {'lr': 1.447574868683113e-05, 'samples': 25707072, 'steps': 133890, 'loss/train': 1.6992287635803223} 11/07/2021 16:03:22 - INFO - __main__ - Step 133892: {'lr': 1.447396917373464e-05, 'samples': 25707264, 'steps': 133891, 'loss/train': 1.0577846765518188} 11/07/2021 16:03:23 - INFO - __main__ - Step 133893: {'lr': 1.4472189766762538e-05, 'samples': 25707456, 'steps': 133892, 'loss/train': 1.3897548913955688} 11/07/2021 16:03:23 - INFO - __main__ - Step 133894: {'lr': 1.447041046591563e-05, 'samples': 25707648, 'steps': 133893, 'loss/train': 1.0602363348007202} 11/07/2021 16:03:23 - INFO - __main__ - Step 133895: {'lr': 1.4468631271194742e-05, 'samples': 25707840, 'steps': 133894, 'loss/train': 1.034550428390503} 11/07/2021 16:03:24 - INFO - __main__ - Step 133896: {'lr': 1.446685218260066e-05, 'samples': 25708032, 'steps': 133895, 'loss/train': 1.3359884023666382} 11/07/2021 16:03:25 - INFO - __main__ - Step 133897: {'lr': 1.4465073200134154e-05, 'samples': 25708224, 'steps': 133896, 'loss/train': 1.2569489479064941} 11/07/2021 16:03:25 - INFO - __main__ - Step 133898: {'lr': 1.4463294323796062e-05, 'samples': 25708416, 'steps': 133897, 'loss/train': 1.4071286916732788} 11/07/2021 16:03:25 - INFO - __main__ - Step 133899: {'lr': 1.4461515553587185e-05, 'samples': 25708608, 'steps': 133898, 'loss/train': 0.9071282148361206} 11/07/2021 16:03:26 - INFO - __main__ - Step 133900: {'lr': 1.4459736889508302e-05, 'samples': 25708800, 'steps': 133899, 'loss/train': 1.0326833724975586} 11/07/2021 16:03:27 - INFO - __main__ - Step 133901: {'lr': 1.4457958331560245e-05, 'samples': 25708992, 'steps': 133900, 'loss/train': 1.283739447593689} 11/07/2021 16:03:27 - INFO - __main__ - Step 133902: {'lr': 1.445617987974379e-05, 'samples': 25709184, 'steps': 133901, 'loss/train': 1.361826777458191} 11/07/2021 16:03:27 - INFO - __main__ - Step 133903: {'lr': 1.4454401534059746e-05, 'samples': 25709376, 'steps': 133902, 'loss/train': 1.1546692848205566} 11/07/2021 16:03:28 - INFO - __main__ - Step 133904: {'lr': 1.4452623294508888e-05, 'samples': 25709568, 'steps': 133903, 'loss/train': 0.6418929100036621} 11/07/2021 16:03:28 - INFO - __main__ - Step 133905: {'lr': 1.4450845161092074e-05, 'samples': 25709760, 'steps': 133904, 'loss/train': 1.2914448976516724} 11/07/2021 16:03:28 - INFO - __main__ - Step 133906: {'lr': 1.4449067133810057e-05, 'samples': 25709952, 'steps': 133905, 'loss/train': 1.0178148746490479} 11/07/2021 16:03:29 - INFO - __main__ - Step 133907: {'lr': 1.4447289212663667e-05, 'samples': 25710144, 'steps': 133906, 'loss/train': 1.2316300868988037} 11/07/2021 16:03:30 - INFO - __main__ - Step 133908: {'lr': 1.4445511397653682e-05, 'samples': 25710336, 'steps': 133907, 'loss/train': 1.7224115133285522} 11/07/2021 16:03:30 - INFO - __main__ - Step 133909: {'lr': 1.4443733688780908e-05, 'samples': 25710528, 'steps': 133908, 'loss/train': 1.8809291124343872} 11/07/2021 16:03:30 - INFO - __main__ - Step 133910: {'lr': 1.4441956086046177e-05, 'samples': 25710720, 'steps': 133909, 'loss/train': 0.9696530699729919} 11/07/2021 16:03:31 - INFO - __main__ - Step 133911: {'lr': 1.4440178589450237e-05, 'samples': 25710912, 'steps': 133910, 'loss/train': 1.0607945919036865} 11/07/2021 16:03:32 - INFO - __main__ - Step 133912: {'lr': 1.443840119899395e-05, 'samples': 25711104, 'steps': 133911, 'loss/train': 1.474153995513916} 11/07/2021 16:03:32 - INFO - __main__ - Step 133913: {'lr': 1.4436623914678065e-05, 'samples': 25711296, 'steps': 133912, 'loss/train': 0.9295707941055298} 11/07/2021 16:03:33 - INFO - __main__ - Step 133914: {'lr': 1.4434846736503415e-05, 'samples': 25711488, 'steps': 133913, 'loss/train': 1.2475281953811646} 11/07/2021 16:03:33 - INFO - __main__ - Step 133915: {'lr': 1.4433069664470805e-05, 'samples': 25711680, 'steps': 133914, 'loss/train': 0.6676127910614014} 11/07/2021 16:03:33 - INFO - __main__ - Step 133916: {'lr': 1.4431292698580984e-05, 'samples': 25711872, 'steps': 133915, 'loss/train': 0.927419900894165} 11/07/2021 16:03:34 - INFO - __main__ - Step 133917: {'lr': 1.442951583883481e-05, 'samples': 25712064, 'steps': 133916, 'loss/train': 0.9434414505958557} 11/07/2021 16:03:35 - INFO - __main__ - Step 133918: {'lr': 1.4427739085233038e-05, 'samples': 25712256, 'steps': 133917, 'loss/train': 1.4740655422210693} 11/07/2021 16:03:35 - INFO - __main__ - Step 133919: {'lr': 1.4425962437776497e-05, 'samples': 25712448, 'steps': 133918, 'loss/train': 0.9640668630599976} 11/07/2021 16:03:35 - INFO - __main__ - Step 133920: {'lr': 1.442418589646599e-05, 'samples': 25712640, 'steps': 133919, 'loss/train': 1.2782191038131714} 11/07/2021 16:03:36 - INFO - __main__ - Step 133921: {'lr': 1.4422409461302299e-05, 'samples': 25712832, 'steps': 133920, 'loss/train': 1.561051368713379} 11/07/2021 16:03:37 - INFO - __main__ - Step 133922: {'lr': 1.4420633132286254e-05, 'samples': 25713024, 'steps': 133921, 'loss/train': 1.2052714824676514} 11/07/2021 16:03:37 - INFO - __main__ - Step 133923: {'lr': 1.4418856909418604e-05, 'samples': 25713216, 'steps': 133922, 'loss/train': 1.4940497875213623} 11/07/2021 16:03:37 - INFO - __main__ - Step 133924: {'lr': 1.441708079270021e-05, 'samples': 25713408, 'steps': 133923, 'loss/train': 1.32832932472229} 11/07/2021 16:03:38 - INFO - __main__ - Step 133925: {'lr': 1.441530478213185e-05, 'samples': 25713600, 'steps': 133924, 'loss/train': 1.7734627723693848} 11/07/2021 16:03:38 - INFO - __main__ - Step 133926: {'lr': 1.4413528877714298e-05, 'samples': 25713792, 'steps': 133925, 'loss/train': 1.3328536748886108} 11/07/2021 16:03:40 - INFO - __main__ - Step 133927: {'lr': 1.4411753079448365e-05, 'samples': 25713984, 'steps': 133926, 'loss/train': 1.2378534078598022} 11/07/2021 16:03:40 - INFO - __main__ - Step 133928: {'lr': 1.4409977387334932e-05, 'samples': 25714176, 'steps': 133927, 'loss/train': 0.7782494425773621} 11/07/2021 16:03:40 - INFO - __main__ - Step 133929: {'lr': 1.4408201801374671e-05, 'samples': 25714368, 'steps': 133928, 'loss/train': 1.1937196254730225} 11/07/2021 16:03:41 - INFO - __main__ - Step 133930: {'lr': 1.440642632156844e-05, 'samples': 25714560, 'steps': 133929, 'loss/train': 1.565979242324829} 11/07/2021 16:03:41 - INFO - __main__ - Step 133931: {'lr': 1.4404650947917042e-05, 'samples': 25714752, 'steps': 133930, 'loss/train': 1.749930500984192} 11/07/2021 16:03:42 - INFO - __main__ - Step 133932: {'lr': 1.4402875680421257e-05, 'samples': 25714944, 'steps': 133931, 'loss/train': 1.7341824769973755} 11/07/2021 16:03:42 - INFO - __main__ - Step 133933: {'lr': 1.4401100519081917e-05, 'samples': 25715136, 'steps': 133932, 'loss/train': 1.3851557970046997} 11/07/2021 16:03:43 - INFO - __main__ - Step 133934: {'lr': 1.4399325463899799e-05, 'samples': 25715328, 'steps': 133933, 'loss/train': 1.2737430334091187} 11/07/2021 16:03:43 - INFO - __main__ - Step 133935: {'lr': 1.4397550514875707e-05, 'samples': 25715520, 'steps': 133934, 'loss/train': 1.0966941118240356} 11/07/2021 16:03:44 - INFO - __main__ - Step 133936: {'lr': 1.4395775672010447e-05, 'samples': 25715712, 'steps': 133935, 'loss/train': 1.1358675956726074} 11/07/2021 16:03:44 - INFO - __main__ - Step 133937: {'lr': 1.4394000935304824e-05, 'samples': 25715904, 'steps': 133936, 'loss/train': 0.9564775824546814} 11/07/2021 16:03:45 - INFO - __main__ - Step 133938: {'lr': 1.4392226304759615e-05, 'samples': 25716096, 'steps': 133937, 'loss/train': 1.3555690050125122} 11/07/2021 16:03:45 - INFO - __main__ - Step 133939: {'lr': 1.4390451780375625e-05, 'samples': 25716288, 'steps': 133938, 'loss/train': 1.4246752262115479} 11/07/2021 16:03:46 - INFO - __main__ - Step 133940: {'lr': 1.4388677362153685e-05, 'samples': 25716480, 'steps': 133939, 'loss/train': 0.9265779256820679} 11/07/2021 16:03:46 - INFO - __main__ - Step 133941: {'lr': 1.4386903050094575e-05, 'samples': 25716672, 'steps': 133940, 'loss/train': 1.1209659576416016} 11/07/2021 16:03:46 - INFO - __main__ - Step 133942: {'lr': 1.4385128844199097e-05, 'samples': 25716864, 'steps': 133941, 'loss/train': 1.351564884185791} 11/07/2021 16:03:47 - INFO - __main__ - Step 133943: {'lr': 1.4383354744468031e-05, 'samples': 25717056, 'steps': 133942, 'loss/train': 1.5640324354171753} 11/07/2021 16:03:48 - INFO - __main__ - Step 133944: {'lr': 1.4381580750902179e-05, 'samples': 25717248, 'steps': 133943, 'loss/train': 1.25026535987854} 11/07/2021 16:03:48 - INFO - __main__ - Step 133945: {'lr': 1.4379806863502348e-05, 'samples': 25717440, 'steps': 133944, 'loss/train': 0.6862229704856873} 11/07/2021 16:03:48 - INFO - __main__ - Step 133946: {'lr': 1.4378033082269343e-05, 'samples': 25717632, 'steps': 133945, 'loss/train': 0.4069133400917053} 11/07/2021 16:03:49 - INFO - __main__ - Step 133947: {'lr': 1.4376259407203967e-05, 'samples': 25717824, 'steps': 133946, 'loss/train': 1.3129725456237793} 11/07/2021 16:03:50 - INFO - __main__ - Step 133948: {'lr': 1.4374485838307027e-05, 'samples': 25718016, 'steps': 133947, 'loss/train': 1.0514566898345947} 11/07/2021 16:03:51 - INFO - __main__ - Step 133949: {'lr': 1.4372712375579272e-05, 'samples': 25718208, 'steps': 133948, 'loss/train': 0.9480503797531128} 11/07/2021 16:03:51 - INFO - __main__ - Step 133950: {'lr': 1.4370939019021561e-05, 'samples': 25718400, 'steps': 133949, 'loss/train': 1.0650073289871216} 11/07/2021 16:03:51 - INFO - __main__ - Step 133951: {'lr': 1.4369165768634673e-05, 'samples': 25718592, 'steps': 133950, 'loss/train': 0.785493016242981} 11/07/2021 16:03:52 - INFO - __main__ - Step 133952: {'lr': 1.4367392624419384e-05, 'samples': 25718784, 'steps': 133951, 'loss/train': 1.4095470905303955} 11/07/2021 16:03:52 - INFO - __main__ - Step 133953: {'lr': 1.4365619586376527e-05, 'samples': 25718976, 'steps': 133952, 'loss/train': 1.1940101385116577} 11/07/2021 16:03:53 - INFO - __main__ - Step 133954: {'lr': 1.4363846654506879e-05, 'samples': 25719168, 'steps': 133953, 'loss/train': 0.0796893984079361} 11/07/2021 16:03:54 - INFO - __main__ - Step 133955: {'lr': 1.4362073828811273e-05, 'samples': 25719360, 'steps': 133954, 'loss/train': 1.2664873600006104} 11/07/2021 16:03:54 - INFO - __main__ - Step 133956: {'lr': 1.4360301109290459e-05, 'samples': 25719552, 'steps': 133955, 'loss/train': 4.181712627410889} 11/07/2021 16:03:54 - INFO - __main__ - Step 133957: {'lr': 1.4358528495945266e-05, 'samples': 25719744, 'steps': 133956, 'loss/train': 0.8023971319198608} 11/07/2021 16:03:55 - INFO - __main__ - Step 133958: {'lr': 1.4356755988776448e-05, 'samples': 25719936, 'steps': 133957, 'loss/train': 1.3407713174819946} 11/07/2021 16:03:55 - INFO - __main__ - Step 133959: {'lr': 1.4354983587784864e-05, 'samples': 25720128, 'steps': 133958, 'loss/train': 1.5173286199569702} 11/07/2021 16:03:56 - INFO - __main__ - Step 133960: {'lr': 1.4353211292971292e-05, 'samples': 25720320, 'steps': 133959, 'loss/train': 1.2130851745605469} 11/07/2021 16:03:56 - INFO - __main__ - Step 133961: {'lr': 1.4351439104336534e-05, 'samples': 25720512, 'steps': 133960, 'loss/train': 1.5010429620742798} 11/07/2021 16:03:57 - INFO - __main__ - Step 133962: {'lr': 1.4349667021881369e-05, 'samples': 25720704, 'steps': 133961, 'loss/train': 0.9274657368659973} 11/07/2021 16:03:57 - INFO - __main__ - Step 133963: {'lr': 1.4347895045606602e-05, 'samples': 25720896, 'steps': 133962, 'loss/train': 1.1147868633270264} 11/07/2021 16:03:57 - INFO - __main__ - Step 133964: {'lr': 1.4346123175513037e-05, 'samples': 25721088, 'steps': 133963, 'loss/train': 1.220323920249939} 11/07/2021 16:03:58 - INFO - __main__ - Step 133965: {'lr': 1.434435141160148e-05, 'samples': 25721280, 'steps': 133964, 'loss/train': 1.33692467212677} 11/07/2021 16:03:59 - INFO - __main__ - Step 133966: {'lr': 1.434257975387271e-05, 'samples': 25721472, 'steps': 133965, 'loss/train': 1.1998693943023682} 11/07/2021 16:03:59 - INFO - __main__ - Step 133967: {'lr': 1.4340808202327555e-05, 'samples': 25721664, 'steps': 133966, 'loss/train': 1.3427921533584595} 11/07/2021 16:03:59 - INFO - __main__ - Step 133968: {'lr': 1.4339036756966766e-05, 'samples': 25721856, 'steps': 133967, 'loss/train': 1.1216163635253906} 11/07/2021 16:04:00 - INFO - __main__ - Step 133969: {'lr': 1.4337265417791234e-05, 'samples': 25722048, 'steps': 133968, 'loss/train': 1.5656269788742065} 11/07/2021 16:04:01 - INFO - __main__ - Step 133970: {'lr': 1.4335494184801651e-05, 'samples': 25722240, 'steps': 133969, 'loss/train': 1.3341939449310303} 11/07/2021 16:04:01 - INFO - __main__ - Step 133971: {'lr': 1.4333723057998876e-05, 'samples': 25722432, 'steps': 133970, 'loss/train': 1.0210485458374023} 11/07/2021 16:04:01 - INFO - __main__ - Step 133972: {'lr': 1.4331952037383662e-05, 'samples': 25722624, 'steps': 133971, 'loss/train': 1.292093276977539} 11/07/2021 16:04:02 - INFO - __main__ - Step 133973: {'lr': 1.4330181122956838e-05, 'samples': 25722816, 'steps': 133972, 'loss/train': 1.9059933423995972} 11/07/2021 16:04:02 - INFO - __main__ - Step 133974: {'lr': 1.4328410314719209e-05, 'samples': 25723008, 'steps': 133973, 'loss/train': 1.195204734802246} 11/07/2021 16:04:03 - INFO - __main__ - Step 133975: {'lr': 1.4326639612671554e-05, 'samples': 25723200, 'steps': 133974, 'loss/train': 1.346976399421692} 11/07/2021 16:04:04 - INFO - __main__ - Step 133976: {'lr': 1.4324869016814679e-05, 'samples': 25723392, 'steps': 133975, 'loss/train': 1.080264925956726} 11/07/2021 16:04:04 - INFO - __main__ - Step 133977: {'lr': 1.4323098527149386e-05, 'samples': 25723584, 'steps': 133976, 'loss/train': 1.2473169565200806} 11/07/2021 16:04:04 - INFO - __main__ - Step 133978: {'lr': 1.4321328143676454e-05, 'samples': 25723776, 'steps': 133977, 'loss/train': 1.7721580266952515} 11/07/2021 16:04:05 - INFO - __main__ - Step 133979: {'lr': 1.4319557866396715e-05, 'samples': 25723968, 'steps': 133978, 'loss/train': 0.9520217180252075} 11/07/2021 16:04:05 - INFO - __main__ - Step 133980: {'lr': 1.4317787695310918e-05, 'samples': 25724160, 'steps': 133979, 'loss/train': 1.1330326795578003} 11/07/2021 16:04:06 - INFO - __main__ - Step 133981: {'lr': 1.4316017630419926e-05, 'samples': 25724352, 'steps': 133980, 'loss/train': 1.3692125082015991} 11/07/2021 16:04:07 - INFO - __main__ - Step 133982: {'lr': 1.4314247671724511e-05, 'samples': 25724544, 'steps': 133981, 'loss/train': 1.3523658514022827} 11/07/2021 16:04:07 - INFO - __main__ - Step 133983: {'lr': 1.4312477819225428e-05, 'samples': 25724736, 'steps': 133982, 'loss/train': 0.8727238178253174} 11/07/2021 16:04:07 - INFO - __main__ - Step 133984: {'lr': 1.4310708072923506e-05, 'samples': 25724928, 'steps': 133983, 'loss/train': 1.565901756286621} 11/07/2021 16:04:08 - INFO - __main__ - Step 133985: {'lr': 1.4308938432819523e-05, 'samples': 25725120, 'steps': 133984, 'loss/train': 1.0132824182510376} 11/07/2021 16:04:09 - INFO - __main__ - Step 133986: {'lr': 1.430716889891434e-05, 'samples': 25725312, 'steps': 133985, 'loss/train': 1.485001564025879} 11/07/2021 16:04:09 - INFO - __main__ - Step 133987: {'lr': 1.430539947120868e-05, 'samples': 25725504, 'steps': 133986, 'loss/train': 0.47568783164024353} 11/07/2021 16:04:09 - INFO - __main__ - Step 133988: {'lr': 1.4303630149703373e-05, 'samples': 25725696, 'steps': 133987, 'loss/train': 1.218958854675293} 11/07/2021 16:04:10 - INFO - __main__ - Step 133989: {'lr': 1.4301860934399197e-05, 'samples': 25725888, 'steps': 133988, 'loss/train': 1.8597972393035889} 11/07/2021 16:04:10 - INFO - __main__ - Step 133990: {'lr': 1.4300091825296985e-05, 'samples': 25726080, 'steps': 133989, 'loss/train': 1.0071394443511963} 11/07/2021 16:04:11 - INFO - __main__ - Step 133991: {'lr': 1.4298322822397514e-05, 'samples': 25726272, 'steps': 133990, 'loss/train': 1.3717316389083862} 11/07/2021 16:04:12 - INFO - __main__ - Step 133992: {'lr': 1.4296553925701589e-05, 'samples': 25726464, 'steps': 133991, 'loss/train': 0.7578328847885132} 11/07/2021 16:04:12 - INFO - __main__ - Step 133993: {'lr': 1.4294785135209987e-05, 'samples': 25726656, 'steps': 133992, 'loss/train': 1.0316828489303589} 11/07/2021 16:04:12 - INFO - __main__ - Step 133994: {'lr': 1.4293016450923513e-05, 'samples': 25726848, 'steps': 133993, 'loss/train': 1.273125410079956} 11/07/2021 16:04:13 - INFO - __main__ - Step 133995: {'lr': 1.4291247872843e-05, 'samples': 25727040, 'steps': 133994, 'loss/train': 1.2501146793365479} 11/07/2021 16:04:14 - INFO - __main__ - Step 133996: {'lr': 1.4289479400969224e-05, 'samples': 25727232, 'steps': 133995, 'loss/train': 1.3007519245147705} 11/07/2021 16:04:14 - INFO - __main__ - Step 133997: {'lr': 1.4287711035302936e-05, 'samples': 25727424, 'steps': 133996, 'loss/train': 1.3785148859024048} 11/07/2021 16:04:15 - INFO - __main__ - Step 133998: {'lr': 1.4285942775844968e-05, 'samples': 25727616, 'steps': 133997, 'loss/train': 1.3974190950393677} 11/07/2021 16:04:15 - INFO - __main__ - Step 133999: {'lr': 1.4284174622596125e-05, 'samples': 25727808, 'steps': 133998, 'loss/train': 0.8598229289054871} 11/07/2021 16:04:16 - INFO - __main__ - Step 134000: {'lr': 1.4282406575557184e-05, 'samples': 25728000, 'steps': 133999, 'loss/train': 0.6182685494422913} 11/07/2021 16:04:17 - INFO - __main__ - Step 134001: {'lr': 1.428063863472895e-05, 'samples': 25728192, 'steps': 134000, 'loss/train': 0.9368383884429932} 11/07/2021 16:04:17 - INFO - __main__ - Step 134002: {'lr': 1.4278870800112226e-05, 'samples': 25728384, 'steps': 134001, 'loss/train': 1.3708882331848145} 11/07/2021 16:04:17 - INFO - __main__ - Step 134003: {'lr': 1.427710307170782e-05, 'samples': 25728576, 'steps': 134002, 'loss/train': 1.363186001777649} 11/07/2021 16:04:18 - INFO - __main__ - Step 134004: {'lr': 1.4275335449516509e-05, 'samples': 25728768, 'steps': 134003, 'loss/train': 1.1828449964523315} 11/07/2021 16:04:18 - INFO - __main__ - Step 134005: {'lr': 1.4273567933539094e-05, 'samples': 25728960, 'steps': 134004, 'loss/train': 1.1970049142837524} 11/07/2021 16:04:18 - INFO - __main__ - Step 134006: {'lr': 1.4271800523776356e-05, 'samples': 25729152, 'steps': 134005, 'loss/train': 1.760241985321045} 11/07/2021 16:04:19 - INFO - __main__ - Step 134007: {'lr': 1.4270033220229128e-05, 'samples': 25729344, 'steps': 134006, 'loss/train': 1.3153002262115479} 11/07/2021 16:04:20 - INFO - __main__ - Step 134008: {'lr': 1.4268266022898186e-05, 'samples': 25729536, 'steps': 134007, 'loss/train': 1.261224389076233} 11/07/2021 16:04:20 - INFO - __main__ - Step 134009: {'lr': 1.4266498931784332e-05, 'samples': 25729728, 'steps': 134008, 'loss/train': 0.7228826284408569} 11/07/2021 16:04:21 - INFO - __main__ - Step 134010: {'lr': 1.4264731946888349e-05, 'samples': 25729920, 'steps': 134009, 'loss/train': 1.982736587524414} 11/07/2021 16:04:21 - INFO - __main__ - Step 134011: {'lr': 1.4262965068211036e-05, 'samples': 25730112, 'steps': 134010, 'loss/train': 1.0234318971633911} 11/07/2021 16:04:22 - INFO - __main__ - Step 134012: {'lr': 1.4261198295753203e-05, 'samples': 25730304, 'steps': 134011, 'loss/train': 0.9097875952720642} 11/07/2021 16:04:22 - INFO - __main__ - Step 134013: {'lr': 1.4259431629515624e-05, 'samples': 25730496, 'steps': 134012, 'loss/train': 1.314989447593689} 11/07/2021 16:04:23 - INFO - __main__ - Step 134014: {'lr': 1.4257665069499104e-05, 'samples': 25730688, 'steps': 134013, 'loss/train': 1.3790202140808105} 11/07/2021 16:04:23 - INFO - __main__ - Step 134015: {'lr': 1.425589861570442e-05, 'samples': 25730880, 'steps': 134014, 'loss/train': 1.3521790504455566} 11/07/2021 16:04:23 - INFO - __main__ - Step 134016: {'lr': 1.4254132268132435e-05, 'samples': 25731072, 'steps': 134015, 'loss/train': 1.1838659048080444} 11/07/2021 16:04:24 - INFO - __main__ - Step 134017: {'lr': 1.425236602678387e-05, 'samples': 25731264, 'steps': 134016, 'loss/train': 1.1341161727905273} 11/07/2021 16:04:25 - INFO - __main__ - Step 134018: {'lr': 1.4250599891659555e-05, 'samples': 25731456, 'steps': 134017, 'loss/train': 1.4087573289871216} 11/07/2021 16:04:25 - INFO - __main__ - Step 134019: {'lr': 1.4248833862760296e-05, 'samples': 25731648, 'steps': 134018, 'loss/train': 1.1276825666427612} 11/07/2021 16:04:25 - INFO - __main__ - Step 134020: {'lr': 1.4247067940086871e-05, 'samples': 25731840, 'steps': 134019, 'loss/train': 1.3065485954284668} 11/07/2021 16:04:26 - INFO - __main__ - Step 134021: {'lr': 1.4245302123640059e-05, 'samples': 25732032, 'steps': 134020, 'loss/train': 1.3259737491607666} 11/07/2021 16:04:27 - INFO - __main__ - Step 134022: {'lr': 1.4243536413420744e-05, 'samples': 25732224, 'steps': 134021, 'loss/train': 1.2909131050109863} 11/07/2021 16:04:27 - INFO - __main__ - Step 134023: {'lr': 1.4241770809429593e-05, 'samples': 25732416, 'steps': 134022, 'loss/train': 1.0316414833068848} 11/07/2021 16:04:28 - INFO - __main__ - Step 134024: {'lr': 1.424000531166747e-05, 'samples': 25732608, 'steps': 134023, 'loss/train': 0.7568811774253845} 11/07/2021 16:04:28 - INFO - __main__ - Step 134025: {'lr': 1.4238239920135176e-05, 'samples': 25732800, 'steps': 134024, 'loss/train': 1.294651985168457} 11/07/2021 16:04:28 - INFO - __main__ - Step 134026: {'lr': 1.4236474634833463e-05, 'samples': 25732992, 'steps': 134025, 'loss/train': 1.2383729219436646} 11/07/2021 16:04:29 - INFO - __main__ - Step 134027: {'lr': 1.423470945576319e-05, 'samples': 25733184, 'steps': 134026, 'loss/train': 1.1930912733078003} 11/07/2021 16:04:30 - INFO - __main__ - Step 134028: {'lr': 1.4232944382925106e-05, 'samples': 25733376, 'steps': 134027, 'loss/train': 1.0250329971313477} 11/07/2021 16:04:30 - INFO - __main__ - Step 134029: {'lr': 1.4231179416320017e-05, 'samples': 25733568, 'steps': 134028, 'loss/train': 0.8589625954627991} 11/07/2021 16:04:30 - INFO - __main__ - Step 134030: {'lr': 1.42294145559487e-05, 'samples': 25733760, 'steps': 134029, 'loss/train': 0.9546224474906921} 11/07/2021 16:04:31 - INFO - __main__ - Step 134031: {'lr': 1.4227649801811987e-05, 'samples': 25733952, 'steps': 134030, 'loss/train': 1.7853944301605225} 11/07/2021 16:04:32 - INFO - __main__ - Step 134032: {'lr': 1.4225885153910684e-05, 'samples': 25734144, 'steps': 134031, 'loss/train': 1.827026128768921} 11/07/2021 16:04:32 - INFO - __main__ - Step 134033: {'lr': 1.4224120612245566e-05, 'samples': 25734336, 'steps': 134032, 'loss/train': 1.177283763885498} 11/07/2021 16:04:32 - INFO - __main__ - Step 134034: {'lr': 1.4222356176817387e-05, 'samples': 25734528, 'steps': 134033, 'loss/train': 1.1615104675292969} 11/07/2021 16:04:33 - INFO - __main__ - Step 134035: {'lr': 1.4220591847626974e-05, 'samples': 25734720, 'steps': 134034, 'loss/train': 1.2690976858139038} 11/07/2021 16:04:33 - INFO - __main__ - Step 134036: {'lr': 1.4218827624675134e-05, 'samples': 25734912, 'steps': 134035, 'loss/train': 1.2410413026809692} 11/07/2021 16:04:35 - INFO - __main__ - Step 134037: {'lr': 1.4217063507962647e-05, 'samples': 25735104, 'steps': 134036, 'loss/train': 1.5124561786651611} 11/07/2021 16:04:35 - INFO - __main__ - Step 134038: {'lr': 1.4215299497490314e-05, 'samples': 25735296, 'steps': 134037, 'loss/train': 1.0100210905075073} 11/07/2021 16:04:35 - INFO - __main__ - Step 134039: {'lr': 1.4213535593258914e-05, 'samples': 25735488, 'steps': 134038, 'loss/train': 1.2113018035888672} 11/07/2021 16:04:36 - INFO - __main__ - Step 134040: {'lr': 1.4211771795269279e-05, 'samples': 25735680, 'steps': 134039, 'loss/train': 1.2293215990066528} 11/07/2021 16:04:36 - INFO - __main__ - Step 134041: {'lr': 1.4210008103522159e-05, 'samples': 25735872, 'steps': 134040, 'loss/train': 0.05420321971178055} 11/07/2021 16:04:37 - INFO - __main__ - Step 134042: {'lr': 1.4208244518018387e-05, 'samples': 25736064, 'steps': 134041, 'loss/train': 0.8434706926345825} 11/07/2021 16:04:37 - INFO - __main__ - Step 134043: {'lr': 1.420648103875874e-05, 'samples': 25736256, 'steps': 134042, 'loss/train': 1.4742835760116577} 11/07/2021 16:04:38 - INFO - __main__ - Step 134044: {'lr': 1.4204717665744049e-05, 'samples': 25736448, 'steps': 134043, 'loss/train': 0.7452788949012756} 11/07/2021 16:04:38 - INFO - __main__ - Step 134045: {'lr': 1.4202954398975038e-05, 'samples': 25736640, 'steps': 134044, 'loss/train': 0.9187545776367188} 11/07/2021 16:04:38 - INFO - __main__ - Step 134046: {'lr': 1.4201191238452537e-05, 'samples': 25736832, 'steps': 134045, 'loss/train': 1.256571650505066} 11/07/2021 16:04:39 - INFO - __main__ - Step 134047: {'lr': 1.4199428184177326e-05, 'samples': 25737024, 'steps': 134046, 'loss/train': 1.0076136589050293} 11/07/2021 16:04:40 - INFO - __main__ - Step 134048: {'lr': 1.4197665236150237e-05, 'samples': 25737216, 'steps': 134047, 'loss/train': 1.249436855316162} 11/07/2021 16:04:40 - INFO - __main__ - Step 134049: {'lr': 1.4195902394372045e-05, 'samples': 25737408, 'steps': 134048, 'loss/train': 1.2223883867263794} 11/07/2021 16:04:40 - INFO - __main__ - Step 134050: {'lr': 1.419413965884353e-05, 'samples': 25737600, 'steps': 134049, 'loss/train': 1.479224443435669} 11/07/2021 16:04:41 - INFO - __main__ - Step 134051: {'lr': 1.4192377029565496e-05, 'samples': 25737792, 'steps': 134050, 'loss/train': 1.2316583395004272} 11/07/2021 16:04:41 - INFO - __main__ - Step 134052: {'lr': 1.4190614506538719e-05, 'samples': 25737984, 'steps': 134051, 'loss/train': 1.4644757509231567} 11/07/2021 16:04:42 - INFO - __main__ - Step 134053: {'lr': 1.4188852089764031e-05, 'samples': 25738176, 'steps': 134052, 'loss/train': 1.216347336769104} 11/07/2021 16:04:43 - INFO - __main__ - Step 134054: {'lr': 1.4187089779242212e-05, 'samples': 25738368, 'steps': 134053, 'loss/train': 1.3015353679656982} 11/07/2021 16:04:43 - INFO - __main__ - Step 134055: {'lr': 1.4185327574974094e-05, 'samples': 25738560, 'steps': 134054, 'loss/train': 1.5504554510116577} 11/07/2021 16:04:43 - INFO - __main__ - Step 134056: {'lr': 1.418356547696037e-05, 'samples': 25738752, 'steps': 134055, 'loss/train': 0.7510446310043335} 11/07/2021 16:04:44 - INFO - __main__ - Step 134057: {'lr': 1.4181803485201899e-05, 'samples': 25738944, 'steps': 134056, 'loss/train': 1.3634963035583496} 11/07/2021 16:04:45 - INFO - __main__ - Step 134058: {'lr': 1.418004159969946e-05, 'samples': 25739136, 'steps': 134057, 'loss/train': 1.1950920820236206} 11/07/2021 16:04:45 - INFO - __main__ - Step 134059: {'lr': 1.4178279820453887e-05, 'samples': 25739328, 'steps': 134058, 'loss/train': 1.1412088871002197} 11/07/2021 16:04:45 - INFO - __main__ - Step 134060: {'lr': 1.4176518147465927e-05, 'samples': 25739520, 'steps': 134059, 'loss/train': 1.2970601320266724} 11/07/2021 16:04:46 - INFO - __main__ - Step 134061: {'lr': 1.4174756580736387e-05, 'samples': 25739712, 'steps': 134060, 'loss/train': 1.353571891784668} 11/07/2021 16:04:46 - INFO - __main__ - Step 134062: {'lr': 1.4172995120266042e-05, 'samples': 25739904, 'steps': 134061, 'loss/train': 0.4248502552509308} 11/07/2021 16:04:47 - INFO - __main__ - Step 134063: {'lr': 1.4171233766055724e-05, 'samples': 25740096, 'steps': 134062, 'loss/train': 1.2603676319122314} 11/07/2021 16:04:47 - INFO - __main__ - Step 134064: {'lr': 1.4169472518106214e-05, 'samples': 25740288, 'steps': 134063, 'loss/train': 0.9233127236366272} 11/07/2021 16:04:48 - INFO - __main__ - Step 134065: {'lr': 1.4167711376418313e-05, 'samples': 25740480, 'steps': 134064, 'loss/train': 1.1177595853805542} 11/07/2021 16:04:48 - INFO - __main__ - Step 134066: {'lr': 1.41659503409928e-05, 'samples': 25740672, 'steps': 134065, 'loss/train': 0.9967106580734253} 11/07/2021 16:04:48 - INFO - __main__ - Step 134067: {'lr': 1.416418941183048e-05, 'samples': 25740864, 'steps': 134066, 'loss/train': 1.2056430578231812} 11/07/2021 16:04:50 - INFO - __main__ - Step 134068: {'lr': 1.4162428588932103e-05, 'samples': 25741056, 'steps': 134067, 'loss/train': 0.9687670469284058} 11/07/2021 16:04:50 - INFO - __main__ - Step 134069: {'lr': 1.41606678722985e-05, 'samples': 25741248, 'steps': 134068, 'loss/train': 1.3786942958831787} 11/07/2021 16:04:50 - INFO - __main__ - Step 134070: {'lr': 1.4158907261930477e-05, 'samples': 25741440, 'steps': 134069, 'loss/train': 1.2693426609039307} 11/07/2021 16:04:51 - INFO - __main__ - Step 134071: {'lr': 1.4157146757828809e-05, 'samples': 25741632, 'steps': 134070, 'loss/train': 1.0361521244049072} 11/07/2021 16:04:51 - INFO - __main__ - Step 134072: {'lr': 1.4155386359994276e-05, 'samples': 25741824, 'steps': 134071, 'loss/train': 1.4863399267196655} 11/07/2021 16:04:52 - INFO - __main__ - Step 134073: {'lr': 1.4153626068427683e-05, 'samples': 25742016, 'steps': 134072, 'loss/train': 2.0024027824401855} 11/07/2021 16:04:53 - INFO - __main__ - Step 134074: {'lr': 1.4151865883129861e-05, 'samples': 25742208, 'steps': 134073, 'loss/train': 1.3732513189315796} 11/07/2021 16:04:53 - INFO - __main__ - Step 134075: {'lr': 1.4150105804101532e-05, 'samples': 25742400, 'steps': 134074, 'loss/train': 1.2358098030090332} 11/07/2021 16:04:53 - INFO - __main__ - Step 134076: {'lr': 1.4148345831343584e-05, 'samples': 25742592, 'steps': 134075, 'loss/train': 1.8574950695037842} 11/07/2021 16:04:54 - INFO - __main__ - Step 134077: {'lr': 1.4146585964856713e-05, 'samples': 25742784, 'steps': 134076, 'loss/train': 1.6004445552825928} 11/07/2021 16:04:55 - INFO - __main__ - Step 134078: {'lr': 1.4144826204641747e-05, 'samples': 25742976, 'steps': 134077, 'loss/train': 1.4553933143615723} 11/07/2021 16:04:55 - INFO - __main__ - Step 134079: {'lr': 1.4143066550699469e-05, 'samples': 25743168, 'steps': 134078, 'loss/train': 1.1959584951400757} 11/07/2021 16:04:56 - INFO - __main__ - Step 134080: {'lr': 1.4141307003030706e-05, 'samples': 25743360, 'steps': 134079, 'loss/train': 0.49403271079063416} 11/07/2021 16:04:56 - INFO - __main__ - Step 134081: {'lr': 1.4139547561636213e-05, 'samples': 25743552, 'steps': 134080, 'loss/train': 0.7147261500358582} 11/07/2021 16:04:56 - INFO - __main__ - Step 134082: {'lr': 1.4137788226516818e-05, 'samples': 25743744, 'steps': 134081, 'loss/train': 1.4791712760925293} 11/07/2021 16:04:57 - INFO - __main__ - Step 134083: {'lr': 1.41360289976733e-05, 'samples': 25743936, 'steps': 134082, 'loss/train': 1.05892813205719} 11/07/2021 16:04:58 - INFO - __main__ - Step 134084: {'lr': 1.4134269875106438e-05, 'samples': 25744128, 'steps': 134083, 'loss/train': 0.4783177971839905} 11/07/2021 16:04:58 - INFO - __main__ - Step 134085: {'lr': 1.4132510858817032e-05, 'samples': 25744320, 'steps': 134084, 'loss/train': 0.9136859774589539} 11/07/2021 16:04:59 - INFO - __main__ - Step 134086: {'lr': 1.4130751948805865e-05, 'samples': 25744512, 'steps': 134085, 'loss/train': 1.3269996643066406} 11/07/2021 16:04:59 - INFO - __main__ - Step 134087: {'lr': 1.4128993145073764e-05, 'samples': 25744704, 'steps': 134086, 'loss/train': 0.9855039119720459} 11/07/2021 16:04:59 - INFO - __main__ - Step 134088: {'lr': 1.4127234447621483e-05, 'samples': 25744896, 'steps': 134087, 'loss/train': 1.437451958656311} 11/07/2021 16:05:00 - INFO - __main__ - Step 134089: {'lr': 1.4125475856449853e-05, 'samples': 25745088, 'steps': 134088, 'loss/train': 1.4234801530838013} 11/07/2021 16:05:01 - INFO - __main__ - Step 134090: {'lr': 1.4123717371559652e-05, 'samples': 25745280, 'steps': 134089, 'loss/train': 1.4323482513427734} 11/07/2021 16:05:01 - INFO - __main__ - Step 134091: {'lr': 1.4121958992951628e-05, 'samples': 25745472, 'steps': 134090, 'loss/train': 1.619430661201477} 11/07/2021 16:05:01 - INFO - __main__ - Step 134092: {'lr': 1.4120200720626642e-05, 'samples': 25745664, 'steps': 134091, 'loss/train': 1.474448800086975} 11/07/2021 16:05:02 - INFO - __main__ - Step 134093: {'lr': 1.4118442554585415e-05, 'samples': 25745856, 'steps': 134092, 'loss/train': 1.0784099102020264} 11/07/2021 16:05:03 - INFO - __main__ - Step 134094: {'lr': 1.4116684494828807e-05, 'samples': 25746048, 'steps': 134093, 'loss/train': 1.1095943450927734} 11/07/2021 16:05:03 - INFO - __main__ - Step 134095: {'lr': 1.411492654135757e-05, 'samples': 25746240, 'steps': 134094, 'loss/train': 1.6973845958709717} 11/07/2021 16:05:03 - INFO - __main__ - Step 134096: {'lr': 1.4113168694172508e-05, 'samples': 25746432, 'steps': 134095, 'loss/train': 1.4105393886566162} 11/07/2021 16:05:04 - INFO - __main__ - Step 134097: {'lr': 1.4111410953274422e-05, 'samples': 25746624, 'steps': 134096, 'loss/train': 3.083177328109741} 11/07/2021 16:05:04 - INFO - __main__ - Step 134098: {'lr': 1.4109653318664067e-05, 'samples': 25746816, 'steps': 134097, 'loss/train': 1.4034593105316162} 11/07/2021 16:05:05 - INFO - __main__ - Step 134099: {'lr': 1.4107895790342273e-05, 'samples': 25747008, 'steps': 134098, 'loss/train': 0.8132426738739014} 11/07/2021 16:05:06 - INFO - __main__ - Step 134100: {'lr': 1.4106138368309845e-05, 'samples': 25747200, 'steps': 134099, 'loss/train': 1.0929720401763916} 11/07/2021 16:05:06 - INFO - __main__ - Step 134101: {'lr': 1.4104381052567534e-05, 'samples': 25747392, 'steps': 134100, 'loss/train': 0.9126145243644714} 11/07/2021 16:05:06 - INFO - __main__ - Step 134102: {'lr': 1.410262384311614e-05, 'samples': 25747584, 'steps': 134101, 'loss/train': 0.9893049001693726} 11/07/2021 16:05:07 - INFO - __main__ - Step 134103: {'lr': 1.4100866739956503e-05, 'samples': 25747776, 'steps': 134102, 'loss/train': 1.4262622594833374} 11/07/2021 16:05:07 - INFO - __main__ - Step 134104: {'lr': 1.409910974308934e-05, 'samples': 25747968, 'steps': 134103, 'loss/train': 1.2506906986236572} 11/07/2021 16:05:08 - INFO - __main__ - Step 134105: {'lr': 1.4097352852515482e-05, 'samples': 25748160, 'steps': 134104, 'loss/train': 0.7969270348548889} 11/07/2021 16:05:08 - INFO - __main__ - Step 134106: {'lr': 1.409559606823571e-05, 'samples': 25748352, 'steps': 134105, 'loss/train': 1.062925934791565} 11/07/2021 16:05:09 - INFO - __main__ - Step 134107: {'lr': 1.4093839390250856e-05, 'samples': 25748544, 'steps': 134106, 'loss/train': 1.7982518672943115} 11/07/2021 16:05:09 - INFO - __main__ - Step 134108: {'lr': 1.4092082818561642e-05, 'samples': 25748736, 'steps': 134107, 'loss/train': 1.0079528093338013} 11/07/2021 16:05:09 - INFO - __main__ - Step 134109: {'lr': 1.4090326353168897e-05, 'samples': 25748928, 'steps': 134108, 'loss/train': 1.6892668008804321} 11/07/2021 16:05:11 - INFO - __main__ - Step 134110: {'lr': 1.408856999407343e-05, 'samples': 25749120, 'steps': 134109, 'loss/train': 1.149394154548645} 11/07/2021 16:05:11 - INFO - __main__ - Step 134111: {'lr': 1.4086813741275989e-05, 'samples': 25749312, 'steps': 134110, 'loss/train': 1.5041868686676025} 11/07/2021 16:05:11 - INFO - __main__ - Step 134112: {'lr': 1.4085057594777407e-05, 'samples': 25749504, 'steps': 134111, 'loss/train': 0.9419237375259399} 11/07/2021 16:05:12 - INFO - __main__ - Step 134113: {'lr': 1.4083301554578431e-05, 'samples': 25749696, 'steps': 134112, 'loss/train': 1.319014072418213} 11/07/2021 16:05:12 - INFO - __main__ - Step 134114: {'lr': 1.4081545620679925e-05, 'samples': 25749888, 'steps': 134113, 'loss/train': 1.055820107460022} 11/07/2021 16:05:13 - INFO - __main__ - Step 134115: {'lr': 1.4079789793082609e-05, 'samples': 25750080, 'steps': 134114, 'loss/train': 1.601607084274292} 11/07/2021 16:05:13 - INFO - __main__ - Step 134116: {'lr': 1.4078034071787289e-05, 'samples': 25750272, 'steps': 134115, 'loss/train': 1.3305323123931885} 11/07/2021 16:05:14 - INFO - __main__ - Step 134117: {'lr': 1.4076278456794822e-05, 'samples': 25750464, 'steps': 134116, 'loss/train': 1.5219413042068481} 11/07/2021 16:05:14 - INFO - __main__ - Step 134118: {'lr': 1.4074522948105878e-05, 'samples': 25750656, 'steps': 134117, 'loss/train': 1.5533391237258911} 11/07/2021 16:05:14 - INFO - __main__ - Step 134119: {'lr': 1.4072767545721344e-05, 'samples': 25750848, 'steps': 134118, 'loss/train': 1.190872073173523} 11/07/2021 16:05:16 - INFO - __main__ - Step 134120: {'lr': 1.4071012249641967e-05, 'samples': 25751040, 'steps': 134119, 'loss/train': 1.2492201328277588} 11/07/2021 16:05:16 - INFO - __main__ - Step 134121: {'lr': 1.4069257059868556e-05, 'samples': 25751232, 'steps': 134120, 'loss/train': 1.357456922531128} 11/07/2021 16:05:16 - INFO - __main__ - Step 134122: {'lr': 1.4067501976401887e-05, 'samples': 25751424, 'steps': 134121, 'loss/train': 0.4921827018260956} 11/07/2021 16:05:17 - INFO - __main__ - Step 134123: {'lr': 1.4065746999242763e-05, 'samples': 25751616, 'steps': 134122, 'loss/train': 1.2063244581222534} 11/07/2021 16:05:17 - INFO - __main__ - Step 134124: {'lr': 1.406399212839199e-05, 'samples': 25751808, 'steps': 134123, 'loss/train': 0.08082025498151779} 11/07/2021 16:05:18 - INFO - __main__ - Step 134125: {'lr': 1.4062237363850316e-05, 'samples': 25752000, 'steps': 134124, 'loss/train': 1.3905552625656128} 11/07/2021 16:05:18 - INFO - __main__ - Step 134126: {'lr': 1.4060482705618577e-05, 'samples': 25752192, 'steps': 134125, 'loss/train': 1.1722406148910522} 11/07/2021 16:05:19 - INFO - __main__ - Step 134127: {'lr': 1.4058728153697548e-05, 'samples': 25752384, 'steps': 134126, 'loss/train': 1.3220438957214355} 11/07/2021 16:05:19 - INFO - __main__ - Step 134128: {'lr': 1.4056973708088006e-05, 'samples': 25752576, 'steps': 134127, 'loss/train': 1.4993009567260742} 11/07/2021 16:05:19 - INFO - __main__ - Step 134129: {'lr': 1.4055219368790728e-05, 'samples': 25752768, 'steps': 134128, 'loss/train': 0.8314728736877441} 11/07/2021 16:05:21 - INFO - __main__ - Step 134130: {'lr': 1.4053465135806603e-05, 'samples': 25752960, 'steps': 134129, 'loss/train': 1.1536979675292969} 11/07/2021 16:05:21 - INFO - __main__ - Step 134131: {'lr': 1.4051711009136297e-05, 'samples': 25753152, 'steps': 134130, 'loss/train': 1.3065558671951294} 11/07/2021 16:05:21 - INFO - __main__ - Step 134132: {'lr': 1.4049956988780644e-05, 'samples': 25753344, 'steps': 134131, 'loss/train': 1.2379213571548462} 11/07/2021 16:05:22 - INFO - __main__ - Step 134133: {'lr': 1.4048203074740417e-05, 'samples': 25753536, 'steps': 134132, 'loss/train': 1.471121072769165} 11/07/2021 16:05:22 - INFO - __main__ - Step 134134: {'lr': 1.4046449267016453e-05, 'samples': 25753728, 'steps': 134133, 'loss/train': 1.2595888376235962} 11/07/2021 16:05:22 - INFO - __main__ - Step 134135: {'lr': 1.4044695565609528e-05, 'samples': 25753920, 'steps': 134134, 'loss/train': 1.0910136699676514} 11/07/2021 16:05:23 - INFO - __main__ - Step 134136: {'lr': 1.4042941970520418e-05, 'samples': 25754112, 'steps': 134135, 'loss/train': 5.6962504386901855} 11/07/2021 16:05:24 - INFO - __main__ - Step 134137: {'lr': 1.40411884817499e-05, 'samples': 25754304, 'steps': 134136, 'loss/train': 1.3912343978881836} 11/07/2021 16:05:24 - INFO - __main__ - Step 134138: {'lr': 1.4039435099298781e-05, 'samples': 25754496, 'steps': 134137, 'loss/train': 1.3686513900756836} 11/07/2021 16:05:25 - INFO - __main__ - Step 134139: {'lr': 1.4037681823167864e-05, 'samples': 25754688, 'steps': 134138, 'loss/train': 0.8956264853477478} 11/07/2021 16:05:25 - INFO - __main__ - Step 134140: {'lr': 1.4035928653357926e-05, 'samples': 25754880, 'steps': 134139, 'loss/train': 1.1107444763183594} 11/07/2021 16:05:25 - INFO - __main__ - Step 134141: {'lr': 1.4034175589869747e-05, 'samples': 25755072, 'steps': 134140, 'loss/train': 0.9930597543716431} 11/07/2021 16:05:27 - INFO - __main__ - Step 134142: {'lr': 1.4032422632704157e-05, 'samples': 25755264, 'steps': 134141, 'loss/train': 0.047563280910253525} 11/07/2021 16:05:27 - INFO - __main__ - Step 134143: {'lr': 1.4030669781861905e-05, 'samples': 25755456, 'steps': 134142, 'loss/train': 0.7530154585838318} 11/07/2021 16:05:28 - INFO - __main__ - Step 134144: {'lr': 1.4028917037343797e-05, 'samples': 25755648, 'steps': 134143, 'loss/train': 1.1823197603225708} 11/07/2021 16:05:28 - INFO - __main__ - Step 134145: {'lr': 1.402716439915061e-05, 'samples': 25755840, 'steps': 134144, 'loss/train': 1.73277747631073} 11/07/2021 16:05:28 - INFO - __main__ - Step 134146: {'lr': 1.4025411867283123e-05, 'samples': 25756032, 'steps': 134145, 'loss/train': 1.442949652671814} 11/07/2021 16:05:29 - INFO - __main__ - Step 134147: {'lr': 1.4023659441742165e-05, 'samples': 25756224, 'steps': 134146, 'loss/train': 1.2998461723327637} 11/07/2021 16:05:30 - INFO - __main__ - Step 134148: {'lr': 1.4021907122528487e-05, 'samples': 25756416, 'steps': 134147, 'loss/train': 1.252145767211914} 11/07/2021 16:05:30 - INFO - __main__ - Step 134149: {'lr': 1.4020154909642896e-05, 'samples': 25756608, 'steps': 134148, 'loss/train': 1.509015679359436} 11/07/2021 16:05:31 - INFO - __main__ - Step 134150: {'lr': 1.4018402803086195e-05, 'samples': 25756800, 'steps': 134149, 'loss/train': 1.1366254091262817} 11/07/2021 16:05:31 - INFO - __main__ - Step 134151: {'lr': 1.401665080285916e-05, 'samples': 25756992, 'steps': 134150, 'loss/train': 1.0019334554672241} 11/07/2021 16:05:31 - INFO - __main__ - Step 134152: {'lr': 1.401489890896257e-05, 'samples': 25757184, 'steps': 134151, 'loss/train': 1.329712152481079} 11/07/2021 16:05:32 - INFO - __main__ - Step 134153: {'lr': 1.401314712139723e-05, 'samples': 25757376, 'steps': 134152, 'loss/train': 0.977933406829834} 11/07/2021 16:05:33 - INFO - __main__ - Step 134154: {'lr': 1.4011395440163916e-05, 'samples': 25757568, 'steps': 134153, 'loss/train': 0.582118570804596} 11/07/2021 16:05:33 - INFO - __main__ - Step 134155: {'lr': 1.4009643865263432e-05, 'samples': 25757760, 'steps': 134154, 'loss/train': 1.2186752557754517} 11/07/2021 16:05:33 - INFO - __main__ - Step 134156: {'lr': 1.4007892396696586e-05, 'samples': 25757952, 'steps': 134155, 'loss/train': 1.335908055305481} 11/07/2021 16:05:34 - INFO - __main__ - Step 134157: {'lr': 1.4006141034464155e-05, 'samples': 25758144, 'steps': 134156, 'loss/train': 1.426626443862915} 11/07/2021 16:05:34 - INFO - __main__ - Step 134158: {'lr': 1.4004389778566857e-05, 'samples': 25758336, 'steps': 134157, 'loss/train': 1.1279326677322388} 11/07/2021 16:05:35 - INFO - __main__ - Step 134159: {'lr': 1.4002638629005582e-05, 'samples': 25758528, 'steps': 134158, 'loss/train': 1.1437768936157227} 11/07/2021 16:05:35 - INFO - __main__ - Step 134160: {'lr': 1.4000887585781052e-05, 'samples': 25758720, 'steps': 134159, 'loss/train': 1.1611006259918213} 11/07/2021 16:05:36 - INFO - __main__ - Step 134161: {'lr': 1.3999136648894073e-05, 'samples': 25758912, 'steps': 134160, 'loss/train': 0.9535543918609619} 11/07/2021 16:05:36 - INFO - __main__ - Step 134162: {'lr': 1.3997385818345449e-05, 'samples': 25759104, 'steps': 134161, 'loss/train': 1.0386255979537964} 11/07/2021 16:05:37 - INFO - __main__ - Step 134163: {'lr': 1.3995635094135983e-05, 'samples': 25759296, 'steps': 134162, 'loss/train': 1.5721272230148315} 11/07/2021 16:05:38 - INFO - __main__ - Step 134164: {'lr': 1.3993884476266427e-05, 'samples': 25759488, 'steps': 134163, 'loss/train': 0.9785748720169067} 11/07/2021 16:05:38 - INFO - __main__ - Step 134165: {'lr': 1.3992133964737585e-05, 'samples': 25759680, 'steps': 134164, 'loss/train': 1.407926082611084} 11/07/2021 16:05:38 - INFO - __main__ - Step 134166: {'lr': 1.3990383559550235e-05, 'samples': 25759872, 'steps': 134165, 'loss/train': 1.4611384868621826} 11/07/2021 16:05:39 - INFO - __main__ - Step 134167: {'lr': 1.398863326070518e-05, 'samples': 25760064, 'steps': 134166, 'loss/train': 1.3117691278457642} 11/07/2021 16:05:39 - INFO - __main__ - Step 134168: {'lr': 1.39868830682032e-05, 'samples': 25760256, 'steps': 134167, 'loss/train': 1.2696588039398193} 11/07/2021 16:05:40 - INFO - __main__ - Step 134169: {'lr': 1.3985132982045095e-05, 'samples': 25760448, 'steps': 134168, 'loss/train': 1.4475955963134766} 11/07/2021 16:05:40 - INFO - __main__ - Step 134170: {'lr': 1.3983383002231704e-05, 'samples': 25760640, 'steps': 134169, 'loss/train': 1.3164805173873901} 11/07/2021 16:05:41 - INFO - __main__ - Step 134171: {'lr': 1.3981633128763687e-05, 'samples': 25760832, 'steps': 134170, 'loss/train': 1.3534351587295532} 11/07/2021 16:05:41 - INFO - __main__ - Step 134172: {'lr': 1.3979883361641938e-05, 'samples': 25761024, 'steps': 134171, 'loss/train': 1.1198279857635498} 11/07/2021 16:05:41 - INFO - __main__ - Step 134173: {'lr': 1.3978133700867202e-05, 'samples': 25761216, 'steps': 134172, 'loss/train': 0.9415073394775391} 11/07/2021 16:05:43 - INFO - __main__ - Step 134174: {'lr': 1.3976384146440257e-05, 'samples': 25761408, 'steps': 134173, 'loss/train': 1.7529914379119873} 11/07/2021 16:05:43 - INFO - __main__ - Step 134175: {'lr': 1.3974634698361937e-05, 'samples': 25761600, 'steps': 134174, 'loss/train': 1.2413049936294556} 11/07/2021 16:05:43 - INFO - __main__ - Step 134176: {'lr': 1.3972885356632992e-05, 'samples': 25761792, 'steps': 134175, 'loss/train': 1.1386181116104126} 11/07/2021 16:05:44 - INFO - __main__ - Step 134177: {'lr': 1.3971136121254224e-05, 'samples': 25761984, 'steps': 134176, 'loss/train': 5.871616363525391} 11/07/2021 16:05:44 - INFO - __main__ - Step 134178: {'lr': 1.3969386992226413e-05, 'samples': 25762176, 'steps': 134177, 'loss/train': 1.182376503944397} 11/07/2021 16:05:45 - INFO - __main__ - Step 134179: {'lr': 1.3967637969550362e-05, 'samples': 25762368, 'steps': 134178, 'loss/train': 1.5146232843399048} 11/07/2021 16:05:45 - INFO - __main__ - Step 134180: {'lr': 1.3965889053226849e-05, 'samples': 25762560, 'steps': 134179, 'loss/train': 1.2270283699035645} 11/07/2021 16:05:46 - INFO - __main__ - Step 134181: {'lr': 1.3964140243256651e-05, 'samples': 25762752, 'steps': 134180, 'loss/train': 0.13277150690555573} 11/07/2021 16:05:46 - INFO - __main__ - Step 134182: {'lr': 1.39623915396406e-05, 'samples': 25762944, 'steps': 134181, 'loss/train': 1.2208759784698486} 11/07/2021 16:05:47 - INFO - __main__ - Step 134183: {'lr': 1.396064294237942e-05, 'samples': 25763136, 'steps': 134182, 'loss/train': 1.4534132480621338} 11/07/2021 16:05:47 - INFO - __main__ - Step 134184: {'lr': 1.3958894451473997e-05, 'samples': 25763328, 'steps': 134183, 'loss/train': 1.2957074642181396} 11/07/2021 16:05:48 - INFO - __main__ - Step 134185: {'lr': 1.3957146066924998e-05, 'samples': 25763520, 'steps': 134184, 'loss/train': 1.3952809572219849} 11/07/2021 16:05:48 - INFO - __main__ - Step 134186: {'lr': 1.3955397788733281e-05, 'samples': 25763712, 'steps': 134185, 'loss/train': 1.0891088247299194} 11/07/2021 16:05:49 - INFO - __main__ - Step 134187: {'lr': 1.39536496168996e-05, 'samples': 25763904, 'steps': 134186, 'loss/train': 1.3143409490585327} 11/07/2021 16:05:49 - INFO - __main__ - Step 134188: {'lr': 1.3951901551424783e-05, 'samples': 25764096, 'steps': 134187, 'loss/train': 1.598029375076294} 11/07/2021 16:05:49 - INFO - __main__ - Step 134189: {'lr': 1.3950153592309583e-05, 'samples': 25764288, 'steps': 134188, 'loss/train': 1.2221779823303223} 11/07/2021 16:05:51 - INFO - __main__ - Step 134190: {'lr': 1.3948405739554804e-05, 'samples': 25764480, 'steps': 134189, 'loss/train': 1.3670856952667236} 11/07/2021 16:05:51 - INFO - __main__ - Step 134191: {'lr': 1.3946657993161222e-05, 'samples': 25764672, 'steps': 134190, 'loss/train': 1.2548116445541382} 11/07/2021 16:05:51 - INFO - __main__ - Step 134192: {'lr': 1.3944910353129642e-05, 'samples': 25764864, 'steps': 134191, 'loss/train': 1.3345789909362793} 11/07/2021 16:05:52 - INFO - __main__ - Step 134193: {'lr': 1.3943162819460842e-05, 'samples': 25765056, 'steps': 134192, 'loss/train': 0.33744028210639954} 11/07/2021 16:05:52 - INFO - __main__ - Step 134194: {'lr': 1.3941415392155627e-05, 'samples': 25765248, 'steps': 134193, 'loss/train': 0.5550865530967712} 11/07/2021 16:05:53 - INFO - __main__ - Step 134195: {'lr': 1.3939668071214744e-05, 'samples': 25765440, 'steps': 134194, 'loss/train': 1.3900657892227173} 11/07/2021 16:05:53 - INFO - __main__ - Step 134196: {'lr': 1.3937920856639003e-05, 'samples': 25765632, 'steps': 134195, 'loss/train': 0.9516313672065735} 11/07/2021 16:05:54 - INFO - __main__ - Step 134197: {'lr': 1.3936173748429259e-05, 'samples': 25765824, 'steps': 134196, 'loss/train': 0.568999707698822} 11/07/2021 16:05:54 - INFO - __main__ - Step 134198: {'lr': 1.3934426746586183e-05, 'samples': 25766016, 'steps': 134197, 'loss/train': 1.3804888725280762} 11/07/2021 16:05:54 - INFO - __main__ - Step 134199: {'lr': 1.3932679851110602e-05, 'samples': 25766208, 'steps': 134198, 'loss/train': 1.5186368227005005} 11/07/2021 16:05:56 - INFO - __main__ - Step 134200: {'lr': 1.3930933062003299e-05, 'samples': 25766400, 'steps': 134199, 'loss/train': 1.667466163635254} 11/07/2021 16:05:56 - INFO - __main__ - Step 134201: {'lr': 1.3929186379265101e-05, 'samples': 25766592, 'steps': 134200, 'loss/train': 0.9414048194885254} 11/07/2021 16:05:56 - INFO - __main__ - Step 134202: {'lr': 1.3927439802896762e-05, 'samples': 25766784, 'steps': 134201, 'loss/train': 0.9473355412483215} 11/07/2021 16:05:57 - INFO - __main__ - Step 134203: {'lr': 1.3925693332899058e-05, 'samples': 25766976, 'steps': 134202, 'loss/train': 1.2123419046401978} 11/07/2021 16:05:57 - INFO - __main__ - Step 134204: {'lr': 1.3923946969272822e-05, 'samples': 25767168, 'steps': 134203, 'loss/train': 1.0816165208816528} 11/07/2021 16:05:58 - INFO - __main__ - Step 134205: {'lr': 1.3922200712018801e-05, 'samples': 25767360, 'steps': 134204, 'loss/train': 0.607019305229187} 11/07/2021 16:05:58 - INFO - __main__ - Step 134206: {'lr': 1.3920454561137775e-05, 'samples': 25767552, 'steps': 134205, 'loss/train': 1.4251958131790161} 11/07/2021 16:05:59 - INFO - __main__ - Step 134207: {'lr': 1.3918708516630573e-05, 'samples': 25767744, 'steps': 134206, 'loss/train': 1.4518295526504517} 11/07/2021 16:05:59 - INFO - __main__ - Step 134208: {'lr': 1.391696257849795e-05, 'samples': 25767936, 'steps': 134207, 'loss/train': 1.4926568269729614} 11/07/2021 16:06:00 - INFO - __main__ - Step 134209: {'lr': 1.3915216746740705e-05, 'samples': 25768128, 'steps': 134208, 'loss/train': 1.9371402263641357} 11/07/2021 16:06:00 - INFO - __main__ - Step 134210: {'lr': 1.391347102135962e-05, 'samples': 25768320, 'steps': 134209, 'loss/train': 1.0765314102172852} 11/07/2021 16:06:01 - INFO - __main__ - Step 134211: {'lr': 1.3911725402355496e-05, 'samples': 25768512, 'steps': 134210, 'loss/train': 1.0388661623001099} 11/07/2021 16:06:01 - INFO - __main__ - Step 134212: {'lr': 1.3909979889729084e-05, 'samples': 25768704, 'steps': 134211, 'loss/train': 1.8726744651794434} 11/07/2021 16:06:02 - INFO - __main__ - Step 134213: {'lr': 1.3908234483481219e-05, 'samples': 25768896, 'steps': 134212, 'loss/train': 0.9499653577804565} 11/07/2021 16:06:02 - INFO - __main__ - Step 134214: {'lr': 1.3906489183612619e-05, 'samples': 25769088, 'steps': 134213, 'loss/train': 1.1663066148757935} 11/07/2021 16:06:02 - INFO - __main__ - Step 134215: {'lr': 1.3904743990124146e-05, 'samples': 25769280, 'steps': 134214, 'loss/train': 1.3909904956817627} 11/07/2021 16:06:03 - INFO - __main__ - Step 134216: {'lr': 1.3902998903016523e-05, 'samples': 25769472, 'steps': 134215, 'loss/train': 1.5684036016464233} 11/07/2021 16:06:04 - INFO - __main__ - Step 134217: {'lr': 1.390125392229058e-05, 'samples': 25769664, 'steps': 134216, 'loss/train': 1.1757756471633911} 11/07/2021 16:06:04 - INFO - __main__ - Step 134218: {'lr': 1.3899509047947095e-05, 'samples': 25769856, 'steps': 134217, 'loss/train': 1.7702624797821045} 11/07/2021 16:06:05 - INFO - __main__ - Step 134219: {'lr': 1.3897764279986847e-05, 'samples': 25770048, 'steps': 134218, 'loss/train': 1.1802059412002563} 11/07/2021 16:06:05 - INFO - __main__ - Step 134220: {'lr': 1.389601961841061e-05, 'samples': 25770240, 'steps': 134219, 'loss/train': 1.3563592433929443} 11/07/2021 16:06:06 - INFO - __main__ - Step 134221: {'lr': 1.3894275063219192e-05, 'samples': 25770432, 'steps': 134220, 'loss/train': 1.4050430059432983} 11/07/2021 16:06:06 - INFO - __main__ - Step 134222: {'lr': 1.3892530614413368e-05, 'samples': 25770624, 'steps': 134221, 'loss/train': 1.2481988668441772} 11/07/2021 16:06:07 - INFO - __main__ - Step 134223: {'lr': 1.3890786271993915e-05, 'samples': 25770816, 'steps': 134222, 'loss/train': 1.2645124197006226} 11/07/2021 16:06:07 - INFO - __main__ - Step 134224: {'lr': 1.3889042035961697e-05, 'samples': 25771008, 'steps': 134223, 'loss/train': 1.5573780536651611} 11/07/2021 16:06:07 - INFO - __main__ - Step 134225: {'lr': 1.3887297906317375e-05, 'samples': 25771200, 'steps': 134224, 'loss/train': 1.3587855100631714} 11/07/2021 16:06:08 - INFO - __main__ - Step 134226: {'lr': 1.3885553883061786e-05, 'samples': 25771392, 'steps': 134225, 'loss/train': 1.4033763408660889} 11/07/2021 16:06:09 - INFO - __main__ - Step 134227: {'lr': 1.3883809966195731e-05, 'samples': 25771584, 'steps': 134226, 'loss/train': 1.5971627235412598} 11/07/2021 16:06:09 - INFO - __main__ - Step 134228: {'lr': 1.3882066155719991e-05, 'samples': 25771776, 'steps': 134227, 'loss/train': 0.5745930075645447} 11/07/2021 16:06:09 - INFO - __main__ - Step 134229: {'lr': 1.3880322451635342e-05, 'samples': 25771968, 'steps': 134228, 'loss/train': 1.5823378562927246} 11/07/2021 16:06:10 - INFO - __main__ - Step 134230: {'lr': 1.3878578853942586e-05, 'samples': 25772160, 'steps': 134229, 'loss/train': 1.05757474899292} 11/07/2021 16:06:10 - INFO - __main__ - Step 134231: {'lr': 1.3876835362642504e-05, 'samples': 25772352, 'steps': 134230, 'loss/train': 1.2619560956954956} 11/07/2021 16:06:11 - INFO - __main__ - Step 134232: {'lr': 1.3875091977735871e-05, 'samples': 25772544, 'steps': 134231, 'loss/train': 0.4687250554561615} 11/07/2021 16:06:12 - INFO - __main__ - Step 134233: {'lr': 1.3873348699223465e-05, 'samples': 25772736, 'steps': 134232, 'loss/train': 1.4285602569580078} 11/07/2021 16:06:12 - INFO - __main__ - Step 134234: {'lr': 1.387160552710609e-05, 'samples': 25772928, 'steps': 134233, 'loss/train': 0.7171581387519836} 11/07/2021 16:06:12 - INFO - __main__ - Step 134235: {'lr': 1.3869862461384525e-05, 'samples': 25773120, 'steps': 134234, 'loss/train': 1.6106623411178589} 11/07/2021 16:06:13 - INFO - __main__ - Step 134236: {'lr': 1.3868119502059573e-05, 'samples': 25773312, 'steps': 134235, 'loss/train': 1.2108933925628662} 11/07/2021 16:06:14 - INFO - __main__ - Step 134237: {'lr': 1.3866376649131984e-05, 'samples': 25773504, 'steps': 134236, 'loss/train': 1.4534116983413696} 11/07/2021 16:06:14 - INFO - __main__ - Step 134238: {'lr': 1.386463390260259e-05, 'samples': 25773696, 'steps': 134237, 'loss/train': 1.4149119853973389} 11/07/2021 16:06:14 - INFO - __main__ - Step 134239: {'lr': 1.3862891262472144e-05, 'samples': 25773888, 'steps': 134238, 'loss/train': 1.2988258600234985} 11/07/2021 16:06:15 - INFO - __main__ - Step 134240: {'lr': 1.3861148728741418e-05, 'samples': 25774080, 'steps': 134239, 'loss/train': 0.9676976203918457} 11/07/2021 16:06:15 - INFO - __main__ - Step 134241: {'lr': 1.385940630141122e-05, 'samples': 25774272, 'steps': 134240, 'loss/train': 1.473081350326538} 11/07/2021 16:06:16 - INFO - __main__ - Step 134242: {'lr': 1.3857663980482299e-05, 'samples': 25774464, 'steps': 134241, 'loss/train': 1.5701286792755127} 11/07/2021 16:06:17 - INFO - __main__ - Step 134243: {'lr': 1.3855921765955514e-05, 'samples': 25774656, 'steps': 134242, 'loss/train': 1.544694185256958} 11/07/2021 16:06:17 - INFO - __main__ - Step 134244: {'lr': 1.3854179657831589e-05, 'samples': 25774848, 'steps': 134243, 'loss/train': 1.2205162048339844} 11/07/2021 16:06:17 - INFO - __main__ - Step 134245: {'lr': 1.3852437656111327e-05, 'samples': 25775040, 'steps': 134244, 'loss/train': 1.3907808065414429} 11/07/2021 16:06:18 - INFO - __main__ - Step 134246: {'lr': 1.3850695760795507e-05, 'samples': 25775232, 'steps': 134245, 'loss/train': 1.3846954107284546} 11/07/2021 16:06:19 - INFO - __main__ - Step 134247: {'lr': 1.3848953971884932e-05, 'samples': 25775424, 'steps': 134246, 'loss/train': 1.0010327100753784} 11/07/2021 16:06:19 - INFO - __main__ - Step 134248: {'lr': 1.3847212289380351e-05, 'samples': 25775616, 'steps': 134247, 'loss/train': 1.5827871561050415} 11/07/2021 16:06:19 - INFO - __main__ - Step 134249: {'lr': 1.3845470713282599e-05, 'samples': 25775808, 'steps': 134248, 'loss/train': 1.474801778793335} 11/07/2021 16:06:20 - INFO - __main__ - Step 134250: {'lr': 1.3843729243592424e-05, 'samples': 25776000, 'steps': 134249, 'loss/train': 1.0419179201126099} 11/07/2021 16:06:20 - INFO - __main__ - Step 134251: {'lr': 1.384198788031063e-05, 'samples': 25776192, 'steps': 134250, 'loss/train': 1.4511815309524536} 11/07/2021 16:06:21 - INFO - __main__ - Step 134252: {'lr': 1.3840246623437997e-05, 'samples': 25776384, 'steps': 134251, 'loss/train': 1.358208417892456} 11/07/2021 16:06:22 - INFO - __main__ - Step 134253: {'lr': 1.3838505472975271e-05, 'samples': 25776576, 'steps': 134252, 'loss/train': 1.325749158859253} 11/07/2021 16:06:22 - INFO - __main__ - Step 134254: {'lr': 1.3836764428923287e-05, 'samples': 25776768, 'steps': 134253, 'loss/train': 1.330554723739624} 11/07/2021 16:06:22 - INFO - __main__ - Step 134255: {'lr': 1.3835023491282823e-05, 'samples': 25776960, 'steps': 134254, 'loss/train': 0.6147536635398865} 11/07/2021 16:06:23 - INFO - __main__ - Step 134256: {'lr': 1.3833282660054652e-05, 'samples': 25777152, 'steps': 134255, 'loss/train': 0.6431734561920166} 11/07/2021 16:06:23 - INFO - __main__ - Step 134257: {'lr': 1.3831541935239555e-05, 'samples': 25777344, 'steps': 134256, 'loss/train': 1.1430027484893799} 11/07/2021 16:06:24 - INFO - __main__ - Step 134258: {'lr': 1.3829801316838309e-05, 'samples': 25777536, 'steps': 134257, 'loss/train': 1.1301077604293823} 11/07/2021 16:06:25 - INFO - __main__ - Step 134259: {'lr': 1.3828060804851716e-05, 'samples': 25777728, 'steps': 134258, 'loss/train': 1.548638105392456} 11/07/2021 16:06:25 - INFO - __main__ - Step 134260: {'lr': 1.3826320399280557e-05, 'samples': 25777920, 'steps': 134259, 'loss/train': 1.2455785274505615} 11/07/2021 16:06:25 - INFO - __main__ - Step 134261: {'lr': 1.3824580100125605e-05, 'samples': 25778112, 'steps': 134260, 'loss/train': 1.189034342765808} 11/07/2021 16:06:26 - INFO - __main__ - Step 134262: {'lr': 1.382283990738767e-05, 'samples': 25778304, 'steps': 134261, 'loss/train': 1.5744414329528809} 11/07/2021 16:06:27 - INFO - __main__ - Step 134263: {'lr': 1.3821099821067496e-05, 'samples': 25778496, 'steps': 134262, 'loss/train': 1.3212860822677612} 11/07/2021 16:06:27 - INFO - __main__ - Step 134264: {'lr': 1.3819359841165946e-05, 'samples': 25778688, 'steps': 134263, 'loss/train': 0.46960726380348206} 11/07/2021 16:06:27 - INFO - __main__ - Step 134265: {'lr': 1.3817619967683714e-05, 'samples': 25778880, 'steps': 134264, 'loss/train': 1.0662161111831665} 11/07/2021 16:06:28 - INFO - __main__ - Step 134266: {'lr': 1.3815880200621605e-05, 'samples': 25779072, 'steps': 134265, 'loss/train': 1.1834871768951416} 11/07/2021 16:06:28 - INFO - __main__ - Step 134267: {'lr': 1.3814140539980424e-05, 'samples': 25779264, 'steps': 134266, 'loss/train': 1.1642276048660278} 11/07/2021 16:06:29 - INFO - __main__ - Step 134268: {'lr': 1.3812400985760947e-05, 'samples': 25779456, 'steps': 134267, 'loss/train': 1.0182431936264038} 11/07/2021 16:06:30 - INFO - __main__ - Step 134269: {'lr': 1.3810661537963953e-05, 'samples': 25779648, 'steps': 134268, 'loss/train': 1.49028480052948} 11/07/2021 16:06:30 - INFO - __main__ - Step 134270: {'lr': 1.3808922196590217e-05, 'samples': 25779840, 'steps': 134269, 'loss/train': 1.2802412509918213} 11/07/2021 16:06:30 - INFO - __main__ - Step 134271: {'lr': 1.3807182961640574e-05, 'samples': 25780032, 'steps': 134270, 'loss/train': 1.5487068891525269} 11/07/2021 16:06:31 - INFO - __main__ - Step 134272: {'lr': 1.3805443833115745e-05, 'samples': 25780224, 'steps': 134271, 'loss/train': 1.3313484191894531} 11/07/2021 16:06:31 - INFO - __main__ - Step 134273: {'lr': 1.3803704811016533e-05, 'samples': 25780416, 'steps': 134272, 'loss/train': 1.3978474140167236} 11/07/2021 16:06:32 - INFO - __main__ - Step 134274: {'lr': 1.3801965895343716e-05, 'samples': 25780608, 'steps': 134273, 'loss/train': 1.10469651222229} 11/07/2021 16:06:32 - INFO - __main__ - Step 134275: {'lr': 1.3800227086098127e-05, 'samples': 25780800, 'steps': 134274, 'loss/train': 0.980184018611908} 11/07/2021 16:06:33 - INFO - __main__ - Step 134276: {'lr': 1.3798488383280488e-05, 'samples': 25780992, 'steps': 134275, 'loss/train': 0.5234159827232361} 11/07/2021 16:06:33 - INFO - __main__ - Step 134277: {'lr': 1.3796749786891604e-05, 'samples': 25781184, 'steps': 134276, 'loss/train': 0.055887509137392044} 11/07/2021 16:06:33 - INFO - __main__ - Step 134278: {'lr': 1.3795011296932281e-05, 'samples': 25781376, 'steps': 134277, 'loss/train': 1.2028955221176147} 11/07/2021 16:06:35 - INFO - __main__ - Step 134279: {'lr': 1.3793272913403265e-05, 'samples': 25781568, 'steps': 134278, 'loss/train': 1.365314245223999} 11/07/2021 16:06:35 - INFO - __main__ - Step 134280: {'lr': 1.3791534636305365e-05, 'samples': 25781760, 'steps': 134279, 'loss/train': 1.4922760725021362} 11/07/2021 16:06:36 - INFO - __main__ - Step 134281: {'lr': 1.3789796465639326e-05, 'samples': 25781952, 'steps': 134280, 'loss/train': 1.4978196620941162} 11/07/2021 16:06:36 - INFO - __main__ - Step 134282: {'lr': 1.3788058401405984e-05, 'samples': 25782144, 'steps': 134281, 'loss/train': 1.6482322216033936} 11/07/2021 16:06:36 - INFO - __main__ - Step 134283: {'lr': 1.3786320443606088e-05, 'samples': 25782336, 'steps': 134282, 'loss/train': 1.6833064556121826} 11/07/2021 16:06:37 - INFO - __main__ - Step 134284: {'lr': 1.3784582592240442e-05, 'samples': 25782528, 'steps': 134283, 'loss/train': 1.3654587268829346} 11/07/2021 16:06:38 - INFO - __main__ - Step 134285: {'lr': 1.3782844847309795e-05, 'samples': 25782720, 'steps': 134284, 'loss/train': 1.249522089958191} 11/07/2021 16:06:38 - INFO - __main__ - Step 134286: {'lr': 1.378110720881498e-05, 'samples': 25782912, 'steps': 134285, 'loss/train': 1.2739033699035645} 11/07/2021 16:06:38 - INFO - __main__ - Step 134287: {'lr': 1.3779369676756747e-05, 'samples': 25783104, 'steps': 134286, 'loss/train': 1.2561349868774414} 11/07/2021 16:06:39 - INFO - __main__ - Step 134288: {'lr': 1.3777632251135874e-05, 'samples': 25783296, 'steps': 134287, 'loss/train': 1.5555905103683472} 11/07/2021 16:06:39 - INFO - __main__ - Step 134289: {'lr': 1.3775894931953165e-05, 'samples': 25783488, 'steps': 134288, 'loss/train': 1.312331199645996} 11/07/2021 16:06:40 - INFO - __main__ - Step 134290: {'lr': 1.3774157719209369e-05, 'samples': 25783680, 'steps': 134289, 'loss/train': 0.7096983194351196} 11/07/2021 16:06:40 - INFO - __main__ - Step 134291: {'lr': 1.3772420612905345e-05, 'samples': 25783872, 'steps': 134290, 'loss/train': 0.8894631862640381} 11/07/2021 16:06:41 - INFO - __main__ - Step 134292: {'lr': 1.377068361304179e-05, 'samples': 25784064, 'steps': 134291, 'loss/train': 1.4660614728927612} 11/07/2021 16:06:41 - INFO - __main__ - Step 134293: {'lr': 1.3768946719619507e-05, 'samples': 25784256, 'steps': 134292, 'loss/train': 1.3300378322601318} 11/07/2021 16:06:41 - INFO - __main__ - Step 134294: {'lr': 1.3767209932639302e-05, 'samples': 25784448, 'steps': 134293, 'loss/train': 0.9686667919158936} 11/07/2021 16:06:43 - INFO - __main__ - Step 134295: {'lr': 1.376547325210195e-05, 'samples': 25784640, 'steps': 134294, 'loss/train': 1.3545525074005127} 11/07/2021 16:06:43 - INFO - __main__ - Step 134296: {'lr': 1.3763736678008233e-05, 'samples': 25784832, 'steps': 134295, 'loss/train': 0.767326295375824} 11/07/2021 16:06:44 - INFO - __main__ - Step 134297: {'lr': 1.3762000210358921e-05, 'samples': 25785024, 'steps': 134296, 'loss/train': 0.13027602434158325} 11/07/2021 16:06:44 - INFO - __main__ - Step 134298: {'lr': 1.3760263849154826e-05, 'samples': 25785216, 'steps': 134297, 'loss/train': 1.2238242626190186} 11/07/2021 16:06:44 - INFO - __main__ - Step 134299: {'lr': 1.3758527594396691e-05, 'samples': 25785408, 'steps': 134298, 'loss/train': 1.2449617385864258} 11/07/2021 16:06:45 - INFO - __main__ - Step 134300: {'lr': 1.3756791446085327e-05, 'samples': 25785600, 'steps': 134299, 'loss/train': 1.1023290157318115} 11/07/2021 16:06:46 - INFO - __main__ - Step 134301: {'lr': 1.3755055404221505e-05, 'samples': 25785792, 'steps': 134300, 'loss/train': 1.4835680723190308} 11/07/2021 16:06:46 - INFO - __main__ - Step 134302: {'lr': 1.3753319468806036e-05, 'samples': 25785984, 'steps': 134301, 'loss/train': 1.5604822635650635} 11/07/2021 16:06:46 - INFO - __main__ - Step 134303: {'lr': 1.3751583639839638e-05, 'samples': 25786176, 'steps': 134302, 'loss/train': 1.5218745470046997} 11/07/2021 16:06:47 - INFO - __main__ - Step 134304: {'lr': 1.3749847917323143e-05, 'samples': 25786368, 'steps': 134303, 'loss/train': 0.9788736701011658} 11/07/2021 16:06:47 - INFO - __main__ - Step 134305: {'lr': 1.3748112301257331e-05, 'samples': 25786560, 'steps': 134304, 'loss/train': 0.9913754463195801} 11/07/2021 16:06:48 - INFO - __main__ - Step 134306: {'lr': 1.3746376791642951e-05, 'samples': 25786752, 'steps': 134305, 'loss/train': 1.3919991254806519} 11/07/2021 16:06:49 - INFO - __main__ - Step 134307: {'lr': 1.3744641388480805e-05, 'samples': 25786944, 'steps': 134306, 'loss/train': 0.37480172514915466} 11/07/2021 16:06:49 - INFO - __main__ - Step 134308: {'lr': 1.3742906091771702e-05, 'samples': 25787136, 'steps': 134307, 'loss/train': 1.4050638675689697} 11/07/2021 16:06:49 - INFO - __main__ - Step 134309: {'lr': 1.3741170901516386e-05, 'samples': 25787328, 'steps': 134308, 'loss/train': 0.9628568887710571} 11/07/2021 16:06:50 - INFO - __main__ - Step 134310: {'lr': 1.3739435817715668e-05, 'samples': 25787520, 'steps': 134309, 'loss/train': 0.8390266299247742} 11/07/2021 16:06:51 - INFO - __main__ - Step 134311: {'lr': 1.3737700840370293e-05, 'samples': 25787712, 'steps': 134310, 'loss/train': 1.473936676979065} 11/07/2021 16:06:51 - INFO - __main__ - Step 134312: {'lr': 1.3735965969481095e-05, 'samples': 25787904, 'steps': 134311, 'loss/train': 1.1823360919952393} 11/07/2021 16:06:51 - INFO - __main__ - Step 134313: {'lr': 1.3734231205048826e-05, 'samples': 25788096, 'steps': 134312, 'loss/train': 1.5516674518585205} 11/07/2021 16:06:52 - INFO - __main__ - Step 134314: {'lr': 1.373249654707423e-05, 'samples': 25788288, 'steps': 134313, 'loss/train': 1.3552628755569458} 11/07/2021 16:06:52 - INFO - __main__ - Step 134315: {'lr': 1.3730761995558144e-05, 'samples': 25788480, 'steps': 134314, 'loss/train': 1.262979507446289} 11/07/2021 16:06:53 - INFO - __main__ - Step 134316: {'lr': 1.3729027550501316e-05, 'samples': 25788672, 'steps': 134315, 'loss/train': 0.857634961605072} 11/07/2021 16:06:54 - INFO - __main__ - Step 134317: {'lr': 1.372729321190455e-05, 'samples': 25788864, 'steps': 134316, 'loss/train': 1.0529797077178955} 11/07/2021 16:06:54 - INFO - __main__ - Step 134318: {'lr': 1.3725558979768627e-05, 'samples': 25789056, 'steps': 134317, 'loss/train': 1.5150299072265625} 11/07/2021 16:06:54 - INFO - __main__ - Step 134319: {'lr': 1.3723824854094318e-05, 'samples': 25789248, 'steps': 134318, 'loss/train': 1.1842316389083862} 11/07/2021 16:06:55 - INFO - __main__ - Step 134320: {'lr': 1.3722090834882407e-05, 'samples': 25789440, 'steps': 134319, 'loss/train': 1.2045668363571167} 11/07/2021 16:06:55 - INFO - __main__ - Step 134321: {'lr': 1.3720356922133665e-05, 'samples': 25789632, 'steps': 134320, 'loss/train': 1.3782382011413574} 11/07/2021 16:06:56 - INFO - __main__ - Step 134322: {'lr': 1.37186231158489e-05, 'samples': 25789824, 'steps': 134321, 'loss/train': 0.22522366046905518} 11/07/2021 16:06:56 - INFO - __main__ - Step 134323: {'lr': 1.3716889416028916e-05, 'samples': 25790016, 'steps': 134322, 'loss/train': 1.5355504751205444} 11/07/2021 16:06:57 - INFO - __main__ - Step 134324: {'lr': 1.3715155822674408e-05, 'samples': 25790208, 'steps': 134323, 'loss/train': 0.9631935358047485} 11/07/2021 16:06:57 - INFO - __main__ - Step 134325: {'lr': 1.3713422335786207e-05, 'samples': 25790400, 'steps': 134324, 'loss/train': 1.4877132177352905} 11/07/2021 16:06:57 - INFO - __main__ - Step 134326: {'lr': 1.3711688955365092e-05, 'samples': 25790592, 'steps': 134325, 'loss/train': 1.1575790643692017} 11/07/2021 16:06:59 - INFO - __main__ - Step 134327: {'lr': 1.370995568141184e-05, 'samples': 25790784, 'steps': 134326, 'loss/train': 2.2085788249969482} 11/07/2021 16:06:59 - INFO - __main__ - Step 134328: {'lr': 1.3708222513927226e-05, 'samples': 25790976, 'steps': 134327, 'loss/train': 1.0828415155410767} 11/07/2021 16:06:59 - INFO - __main__ - Step 134329: {'lr': 1.3706489452912057e-05, 'samples': 25791168, 'steps': 134328, 'loss/train': 0.7296547293663025} 11/07/2021 16:07:00 - INFO - __main__ - Step 134330: {'lr': 1.3704756498367111e-05, 'samples': 25791360, 'steps': 134329, 'loss/train': 0.9023435115814209} 11/07/2021 16:07:00 - INFO - __main__ - Step 134331: {'lr': 1.3703023650293133e-05, 'samples': 25791552, 'steps': 134330, 'loss/train': 0.7953723669052124} 11/07/2021 16:07:01 - INFO - __main__ - Step 134332: {'lr': 1.3701290908690933e-05, 'samples': 25791744, 'steps': 134331, 'loss/train': 0.7975114583969116} 11/07/2021 16:07:01 - INFO - __main__ - Step 134333: {'lr': 1.3699558273561286e-05, 'samples': 25791936, 'steps': 134332, 'loss/train': 1.2769728899002075} 11/07/2021 16:07:02 - INFO - __main__ - Step 134334: {'lr': 1.3697825744904995e-05, 'samples': 25792128, 'steps': 134333, 'loss/train': 1.244446039199829} 11/07/2021 16:07:02 - INFO - __main__ - Step 134335: {'lr': 1.3696093322722785e-05, 'samples': 25792320, 'steps': 134334, 'loss/train': 1.6032077074050903} 11/07/2021 16:07:02 - INFO - __main__ - Step 134336: {'lr': 1.3694361007015488e-05, 'samples': 25792512, 'steps': 134335, 'loss/train': 1.2413671016693115} 11/07/2021 16:07:04 - INFO - __main__ - Step 134337: {'lr': 1.3692628797783851e-05, 'samples': 25792704, 'steps': 134336, 'loss/train': 1.6150832176208496} 11/07/2021 16:07:04 - INFO - __main__ - Step 134338: {'lr': 1.3690896695028682e-05, 'samples': 25792896, 'steps': 134337, 'loss/train': 1.241702675819397} 11/07/2021 16:07:04 - INFO - __main__ - Step 134339: {'lr': 1.3689164698750728e-05, 'samples': 25793088, 'steps': 134338, 'loss/train': 1.478219747543335} 11/07/2021 16:07:05 - INFO - __main__ - Step 134340: {'lr': 1.3687432808950795e-05, 'samples': 25793280, 'steps': 134339, 'loss/train': 1.2682172060012817} 11/07/2021 16:07:05 - INFO - __main__ - Step 134341: {'lr': 1.368570102562966e-05, 'samples': 25793472, 'steps': 134340, 'loss/train': 1.536264181137085} 11/07/2021 16:07:05 - INFO - __main__ - Step 134342: {'lr': 1.3683969348788129e-05, 'samples': 25793664, 'steps': 134341, 'loss/train': 1.4969033002853394} 11/07/2021 16:07:06 - INFO - __main__ - Step 134343: {'lr': 1.368223777842692e-05, 'samples': 25793856, 'steps': 134342, 'loss/train': 1.4668316841125488} 11/07/2021 16:07:07 - INFO - __main__ - Step 134344: {'lr': 1.368050631454687e-05, 'samples': 25794048, 'steps': 134343, 'loss/train': 0.47401273250579834} 11/07/2021 16:07:07 - INFO - __main__ - Step 134345: {'lr': 1.3678774957148754e-05, 'samples': 25794240, 'steps': 134344, 'loss/train': 1.011567234992981} 11/07/2021 16:07:07 - INFO - __main__ - Step 134346: {'lr': 1.3677043706233294e-05, 'samples': 25794432, 'steps': 134345, 'loss/train': 1.383966326713562} 11/07/2021 16:07:08 - INFO - __main__ - Step 134347: {'lr': 1.3675312561801323e-05, 'samples': 25794624, 'steps': 134346, 'loss/train': 1.3400862216949463} 11/07/2021 16:07:09 - INFO - __main__ - Step 134348: {'lr': 1.3673581523853618e-05, 'samples': 25794816, 'steps': 134347, 'loss/train': 1.3477343320846558} 11/07/2021 16:07:09 - INFO - __main__ - Step 134349: {'lr': 1.3671850592390956e-05, 'samples': 25795008, 'steps': 134348, 'loss/train': 1.2610868215560913} 11/07/2021 16:07:10 - INFO - __main__ - Step 134350: {'lr': 1.3670119767414085e-05, 'samples': 25795200, 'steps': 134349, 'loss/train': 1.3963665962219238} 11/07/2021 16:07:10 - INFO - __main__ - Step 134351: {'lr': 1.366838904892384e-05, 'samples': 25795392, 'steps': 134350, 'loss/train': 1.0066086053848267} 11/07/2021 16:07:10 - INFO - __main__ - Step 134352: {'lr': 1.3666658436920942e-05, 'samples': 25795584, 'steps': 134351, 'loss/train': 1.1827280521392822} 11/07/2021 16:07:11 - INFO - __main__ - Step 134353: {'lr': 1.3664927931406223e-05, 'samples': 25795776, 'steps': 134352, 'loss/train': 1.2492059469223022} 11/07/2021 16:07:12 - INFO - __main__ - Step 134354: {'lr': 1.3663197532380433e-05, 'samples': 25795968, 'steps': 134353, 'loss/train': 1.4198689460754395} 11/07/2021 16:07:12 - INFO - __main__ - Step 134355: {'lr': 1.366146723984435e-05, 'samples': 25796160, 'steps': 134354, 'loss/train': 1.1872243881225586} 11/07/2021 16:07:12 - INFO - __main__ - Step 134356: {'lr': 1.3659737053798776e-05, 'samples': 25796352, 'steps': 134355, 'loss/train': 1.2877249717712402} 11/07/2021 16:07:13 - INFO - __main__ - Step 134357: {'lr': 1.365800697424449e-05, 'samples': 25796544, 'steps': 134356, 'loss/train': 1.0844829082489014} 11/07/2021 16:07:14 - INFO - __main__ - Step 134358: {'lr': 1.3656277001182243e-05, 'samples': 25796736, 'steps': 134357, 'loss/train': 0.1754869669675827} 11/07/2021 16:07:14 - INFO - __main__ - Step 134359: {'lr': 1.3654547134612866e-05, 'samples': 25796928, 'steps': 134358, 'loss/train': 1.5066173076629639} 11/07/2021 16:07:15 - INFO - __main__ - Step 134360: {'lr': 1.3652817374537052e-05, 'samples': 25797120, 'steps': 134359, 'loss/train': 0.8651261329650879} 11/07/2021 16:07:15 - INFO - __main__ - Step 134361: {'lr': 1.3651087720955662e-05, 'samples': 25797312, 'steps': 134360, 'loss/train': 1.5914784669876099} 11/07/2021 16:07:15 - INFO - __main__ - Step 134362: {'lr': 1.364935817386942e-05, 'samples': 25797504, 'steps': 134361, 'loss/train': 1.7521063089370728} 11/07/2021 16:07:16 - INFO - __main__ - Step 134363: {'lr': 1.3647628733279154e-05, 'samples': 25797696, 'steps': 134362, 'loss/train': 1.0446662902832031} 11/07/2021 16:07:17 - INFO - __main__ - Step 134364: {'lr': 1.3645899399185591e-05, 'samples': 25797888, 'steps': 134363, 'loss/train': 1.127249002456665} 11/07/2021 16:07:17 - INFO - __main__ - Step 134365: {'lr': 1.364417017158956e-05, 'samples': 25798080, 'steps': 134364, 'loss/train': 1.9617936611175537} 11/07/2021 16:07:17 - INFO - __main__ - Step 134366: {'lr': 1.364244105049181e-05, 'samples': 25798272, 'steps': 134365, 'loss/train': 1.5970680713653564} 11/07/2021 16:07:18 - INFO - __main__ - Step 134367: {'lr': 1.3640712035893149e-05, 'samples': 25798464, 'steps': 134366, 'loss/train': 1.212058186531067} 11/07/2021 16:07:18 - INFO - __main__ - Step 134368: {'lr': 1.3638983127794296e-05, 'samples': 25798656, 'steps': 134367, 'loss/train': 1.028299331665039} 11/07/2021 16:07:19 - INFO - __main__ - Step 134369: {'lr': 1.3637254326196113e-05, 'samples': 25798848, 'steps': 134368, 'loss/train': 1.9403700828552246} 11/07/2021 16:07:20 - INFO - __main__ - Step 134370: {'lr': 1.3635525631099294e-05, 'samples': 25799040, 'steps': 134369, 'loss/train': 1.356067419052124} 11/07/2021 16:07:20 - INFO - __main__ - Step 134371: {'lr': 1.3633797042504698e-05, 'samples': 25799232, 'steps': 134370, 'loss/train': 1.2177388668060303} 11/07/2021 16:07:20 - INFO - __main__ - Step 134372: {'lr': 1.3632068560413075e-05, 'samples': 25799424, 'steps': 134371, 'loss/train': 1.280942678451538} 11/07/2021 16:07:21 - INFO - __main__ - Step 134373: {'lr': 1.3630340184825174e-05, 'samples': 25799616, 'steps': 134372, 'loss/train': 1.5550084114074707} 11/07/2021 16:07:22 - INFO - __main__ - Step 134374: {'lr': 1.3628611915741773e-05, 'samples': 25799808, 'steps': 134373, 'loss/train': 0.407035768032074} 11/07/2021 16:07:22 - INFO - __main__ - Step 134375: {'lr': 1.3626883753163704e-05, 'samples': 25800000, 'steps': 134374, 'loss/train': 1.1403039693832397} 11/07/2021 16:07:22 - INFO - __main__ - Step 134376: {'lr': 1.362515569709169e-05, 'samples': 25800192, 'steps': 134375, 'loss/train': 1.197690725326538} 11/07/2021 16:07:23 - INFO - __main__ - Step 134377: {'lr': 1.3623427747526535e-05, 'samples': 25800384, 'steps': 134376, 'loss/train': 0.9390093684196472} 11/07/2021 16:07:23 - INFO - __main__ - Step 134378: {'lr': 1.3621699904469043e-05, 'samples': 25800576, 'steps': 134377, 'loss/train': 1.3249197006225586} 11/07/2021 16:07:24 - INFO - __main__ - Step 134379: {'lr': 1.3619972167919937e-05, 'samples': 25800768, 'steps': 134378, 'loss/train': 1.561358094215393} 11/07/2021 16:07:25 - INFO - __main__ - Step 134380: {'lr': 1.361824453788002e-05, 'samples': 25800960, 'steps': 134379, 'loss/train': 1.5322951078414917} 11/07/2021 16:07:25 - INFO - __main__ - Step 134381: {'lr': 1.3616517014350099e-05, 'samples': 25801152, 'steps': 134380, 'loss/train': 1.2324315309524536} 11/07/2021 16:07:25 - INFO - __main__ - Step 134382: {'lr': 1.3614789597330896e-05, 'samples': 25801344, 'steps': 134381, 'loss/train': 2.179018259048462} 11/07/2021 16:07:26 - INFO - __main__ - Step 134383: {'lr': 1.3613062286823241e-05, 'samples': 25801536, 'steps': 134382, 'loss/train': 1.7597310543060303} 11/07/2021 16:07:27 - INFO - __main__ - Step 134384: {'lr': 1.3611335082827886e-05, 'samples': 25801728, 'steps': 134383, 'loss/train': 1.2573670148849487} 11/07/2021 16:07:27 - INFO - __main__ - Step 134385: {'lr': 1.3609607985345662e-05, 'samples': 25801920, 'steps': 134384, 'loss/train': 1.0185691118240356} 11/07/2021 16:07:27 - INFO - __main__ - Step 134386: {'lr': 1.3607880994377263e-05, 'samples': 25802112, 'steps': 134385, 'loss/train': 1.1304659843444824} 11/07/2021 16:07:28 - INFO - __main__ - Step 134387: {'lr': 1.3606154109923497e-05, 'samples': 25802304, 'steps': 134386, 'loss/train': 1.1776812076568604} 11/07/2021 16:07:28 - INFO - __main__ - Step 134388: {'lr': 1.3604427331985164e-05, 'samples': 25802496, 'steps': 134387, 'loss/train': 1.3639956712722778} 11/07/2021 16:07:29 - INFO - __main__ - Step 134389: {'lr': 1.3602700660563017e-05, 'samples': 25802688, 'steps': 134388, 'loss/train': 1.34427011013031} 11/07/2021 16:07:29 - INFO - __main__ - Step 134390: {'lr': 1.3600974095657858e-05, 'samples': 25802880, 'steps': 134389, 'loss/train': 1.5265544652938843} 11/07/2021 16:07:30 - INFO - __main__ - Step 134391: {'lr': 1.3599247637270439e-05, 'samples': 25803072, 'steps': 134390, 'loss/train': 1.156206488609314} 11/07/2021 16:07:30 - INFO - __main__ - Step 134392: {'lr': 1.3597521285401537e-05, 'samples': 25803264, 'steps': 134391, 'loss/train': 1.4195395708084106} 11/07/2021 16:07:30 - INFO - __main__ - Step 134393: {'lr': 1.3595795040051955e-05, 'samples': 25803456, 'steps': 134392, 'loss/train': 1.2836912870407104} 11/07/2021 16:07:31 - INFO - __main__ - Step 134394: {'lr': 1.3594068901222473e-05, 'samples': 25803648, 'steps': 134393, 'loss/train': 1.3174749612808228} 11/07/2021 16:07:32 - INFO - __main__ - Step 134395: {'lr': 1.3592342868913865e-05, 'samples': 25803840, 'steps': 134394, 'loss/train': 0.7522329092025757} 11/07/2021 16:07:32 - INFO - __main__ - Step 134396: {'lr': 1.3590616943126882e-05, 'samples': 25804032, 'steps': 134395, 'loss/train': 1.249650478363037} 11/07/2021 16:07:33 - INFO - __main__ - Step 134397: {'lr': 1.3588891123862302e-05, 'samples': 25804224, 'steps': 134396, 'loss/train': 1.2689136266708374} 11/07/2021 16:07:33 - INFO - __main__ - Step 134398: {'lr': 1.3587165411120928e-05, 'samples': 25804416, 'steps': 134397, 'loss/train': 1.2883706092834473} 11/07/2021 16:07:34 - INFO - __main__ - Step 134399: {'lr': 1.3585439804903594e-05, 'samples': 25804608, 'steps': 134398, 'loss/train': 1.1746445894241333} 11/07/2021 16:07:34 - INFO - __main__ - Step 134400: {'lr': 1.3583714305210936e-05, 'samples': 25804800, 'steps': 134399, 'loss/train': 1.7446383237838745} 11/07/2021 16:07:35 - INFO - __main__ - Step 134401: {'lr': 1.3581988912043846e-05, 'samples': 25804992, 'steps': 134400, 'loss/train': 1.1184062957763672} 11/07/2021 16:07:35 - INFO - __main__ - Step 134402: {'lr': 1.3580263625403044e-05, 'samples': 25805184, 'steps': 134401, 'loss/train': 1.4204827547073364} 11/07/2021 16:07:35 - INFO - __main__ - Step 134403: {'lr': 1.3578538445289335e-05, 'samples': 25805376, 'steps': 134402, 'loss/train': 0.7134221792221069} 11/07/2021 16:07:36 - INFO - __main__ - Step 134404: {'lr': 1.3576813371703467e-05, 'samples': 25805568, 'steps': 134403, 'loss/train': 1.3960659503936768} 11/07/2021 16:07:37 - INFO - __main__ - Step 134405: {'lr': 1.3575088404646274e-05, 'samples': 25805760, 'steps': 134404, 'loss/train': 1.331763744354248} 11/07/2021 16:07:37 - INFO - __main__ - Step 134406: {'lr': 1.3573363544118478e-05, 'samples': 25805952, 'steps': 134405, 'loss/train': 0.9116047024726868} 11/07/2021 16:07:37 - INFO - __main__ - Step 134407: {'lr': 1.3571638790120856e-05, 'samples': 25806144, 'steps': 134406, 'loss/train': 1.497756838798523} 11/07/2021 16:07:38 - INFO - __main__ - Step 134408: {'lr': 1.3569914142654238e-05, 'samples': 25806336, 'steps': 134407, 'loss/train': 1.4381980895996094} 11/07/2021 16:07:38 - INFO - __main__ - Step 134409: {'lr': 1.356818960171935e-05, 'samples': 25806528, 'steps': 134408, 'loss/train': 1.178903579711914} 11/07/2021 16:07:39 - INFO - __main__ - Step 134410: {'lr': 1.3566465167316994e-05, 'samples': 25806720, 'steps': 134409, 'loss/train': 1.4703165292739868} 11/07/2021 16:07:40 - INFO - __main__ - Step 134411: {'lr': 1.3564740839447947e-05, 'samples': 25806912, 'steps': 134410, 'loss/train': 0.644037663936615} 11/07/2021 16:07:40 - INFO - __main__ - Step 134412: {'lr': 1.3563016618113017e-05, 'samples': 25807104, 'steps': 134411, 'loss/train': 1.3532183170318604} 11/07/2021 16:07:40 - INFO - __main__ - Step 134413: {'lr': 1.3561292503312894e-05, 'samples': 25807296, 'steps': 134412, 'loss/train': 1.141481637954712} 11/07/2021 16:07:41 - INFO - __main__ - Step 134414: {'lr': 1.3559568495048413e-05, 'samples': 25807488, 'steps': 134413, 'loss/train': 1.7426711320877075} 11/07/2021 16:07:42 - INFO - __main__ - Step 134415: {'lr': 1.3557844593320323e-05, 'samples': 25807680, 'steps': 134414, 'loss/train': 0.5262126326560974} 11/07/2021 16:07:42 - INFO - __main__ - Step 134416: {'lr': 1.3556120798129429e-05, 'samples': 25807872, 'steps': 134415, 'loss/train': 0.923084020614624} 11/07/2021 16:07:42 - INFO - __main__ - Step 134417: {'lr': 1.3554397109476507e-05, 'samples': 25808064, 'steps': 134416, 'loss/train': 1.2475136518478394} 11/07/2021 16:07:43 - INFO - __main__ - Step 134418: {'lr': 1.3552673527362336e-05, 'samples': 25808256, 'steps': 134417, 'loss/train': 1.4231395721435547} 11/07/2021 16:07:43 - INFO - __main__ - Step 134419: {'lr': 1.3550950051787663e-05, 'samples': 25808448, 'steps': 134418, 'loss/train': 1.3735231161117554} 11/07/2021 16:07:44 - INFO - __main__ - Step 134420: {'lr': 1.3549226682753296e-05, 'samples': 25808640, 'steps': 134419, 'loss/train': 1.101194143295288} 11/07/2021 16:07:45 - INFO - __main__ - Step 134421: {'lr': 1.3547503420259983e-05, 'samples': 25808832, 'steps': 134420, 'loss/train': 1.3492684364318848} 11/07/2021 16:07:45 - INFO - __main__ - Step 134422: {'lr': 1.3545780264308527e-05, 'samples': 25809024, 'steps': 134421, 'loss/train': 1.511507511138916} 11/07/2021 16:07:45 - INFO - __main__ - Step 134423: {'lr': 1.3544057214899679e-05, 'samples': 25809216, 'steps': 134422, 'loss/train': 1.5213645696640015} 11/07/2021 16:07:46 - INFO - __main__ - Step 134424: {'lr': 1.3542334272034245e-05, 'samples': 25809408, 'steps': 134423, 'loss/train': 1.146377682685852} 11/07/2021 16:07:47 - INFO - __main__ - Step 134425: {'lr': 1.3540611435712974e-05, 'samples': 25809600, 'steps': 134424, 'loss/train': 1.3293349742889404} 11/07/2021 16:07:47 - INFO - __main__ - Step 134426: {'lr': 1.3538888705936697e-05, 'samples': 25809792, 'steps': 134425, 'loss/train': 0.9705086946487427} 11/07/2021 16:07:47 - INFO - __main__ - Step 134427: {'lr': 1.3537166082706137e-05, 'samples': 25809984, 'steps': 134426, 'loss/train': 1.3927664756774902} 11/07/2021 16:07:48 - INFO - __main__ - Step 134428: {'lr': 1.3535443566022044e-05, 'samples': 25810176, 'steps': 134427, 'loss/train': 1.3698339462280273} 11/07/2021 16:07:48 - INFO - __main__ - Step 134429: {'lr': 1.3533721155885248e-05, 'samples': 25810368, 'steps': 134428, 'loss/train': 1.3237004280090332} 11/07/2021 16:07:48 - INFO - __main__ - Step 134430: {'lr': 1.3531998852296501e-05, 'samples': 25810560, 'steps': 134429, 'loss/train': 1.0927594900131226} 11/07/2021 16:07:50 - INFO - __main__ - Step 134431: {'lr': 1.3530276655256607e-05, 'samples': 25810752, 'steps': 134430, 'loss/train': 1.6511142253875732} 11/07/2021 16:07:50 - INFO - __main__ - Step 134432: {'lr': 1.3528554564766287e-05, 'samples': 25810944, 'steps': 134431, 'loss/train': 1.1014066934585571} 11/07/2021 16:07:50 - INFO - __main__ - Step 134433: {'lr': 1.3526832580826375e-05, 'samples': 25811136, 'steps': 134432, 'loss/train': 1.230562686920166} 11/07/2021 16:07:51 - INFO - __main__ - Step 134434: {'lr': 1.3525110703437621e-05, 'samples': 25811328, 'steps': 134433, 'loss/train': 1.4445481300354004} 11/07/2021 16:07:51 - INFO - __main__ - Step 134435: {'lr': 1.35233889326008e-05, 'samples': 25811520, 'steps': 134434, 'loss/train': 1.3713217973709106} 11/07/2021 16:07:52 - INFO - __main__ - Step 134436: {'lr': 1.352166726831669e-05, 'samples': 25811712, 'steps': 134435, 'loss/train': 1.0798776149749756} 11/07/2021 16:07:53 - INFO - __main__ - Step 134437: {'lr': 1.3519945710586067e-05, 'samples': 25811904, 'steps': 134436, 'loss/train': 1.157713532447815} 11/07/2021 16:07:53 - INFO - __main__ - Step 134438: {'lr': 1.3518224259409711e-05, 'samples': 25812096, 'steps': 134437, 'loss/train': 1.1295424699783325} 11/07/2021 16:07:53 - INFO - __main__ - Step 134439: {'lr': 1.3516502914788426e-05, 'samples': 25812288, 'steps': 134438, 'loss/train': 0.9139367938041687} 11/07/2021 16:07:54 - INFO - __main__ - Step 134440: {'lr': 1.3514781676722932e-05, 'samples': 25812480, 'steps': 134439, 'loss/train': 1.353400468826294} 11/07/2021 16:07:55 - INFO - __main__ - Step 134441: {'lr': 1.3513060545214034e-05, 'samples': 25812672, 'steps': 134440, 'loss/train': 1.3120100498199463} 11/07/2021 16:07:55 - INFO - __main__ - Step 134442: {'lr': 1.3511339520262484e-05, 'samples': 25812864, 'steps': 134441, 'loss/train': 1.2791787385940552} 11/07/2021 16:07:56 - INFO - __main__ - Step 134443: {'lr': 1.3509618601869055e-05, 'samples': 25813056, 'steps': 134442, 'loss/train': 1.519295334815979} 11/07/2021 16:07:56 - INFO - __main__ - Step 134444: {'lr': 1.3507897790034584e-05, 'samples': 25813248, 'steps': 134443, 'loss/train': 1.397506594657898} 11/07/2021 16:07:56 - INFO - __main__ - Step 134445: {'lr': 1.350617708475979e-05, 'samples': 25813440, 'steps': 134444, 'loss/train': 0.9343209266662598} 11/07/2021 16:07:57 - INFO - __main__ - Step 134446: {'lr': 1.3504456486045453e-05, 'samples': 25813632, 'steps': 134445, 'loss/train': 1.7353154420852661} 11/07/2021 16:07:58 - INFO - __main__ - Step 134447: {'lr': 1.3502735993892373e-05, 'samples': 25813824, 'steps': 134446, 'loss/train': 1.7445651292800903} 11/07/2021 16:07:58 - INFO - __main__ - Step 134448: {'lr': 1.3501015608301303e-05, 'samples': 25814016, 'steps': 134447, 'loss/train': 1.6164578199386597} 11/07/2021 16:07:58 - INFO - __main__ - Step 134449: {'lr': 1.349929532927302e-05, 'samples': 25814208, 'steps': 134448, 'loss/train': 1.3322339057922363} 11/07/2021 16:07:59 - INFO - __main__ - Step 134450: {'lr': 1.34975751568083e-05, 'samples': 25814400, 'steps': 134449, 'loss/train': 1.1316413879394531} 11/07/2021 16:07:59 - INFO - __main__ - Step 134451: {'lr': 1.349585509090795e-05, 'samples': 25814592, 'steps': 134450, 'loss/train': 1.3131476640701294} 11/07/2021 16:08:00 - INFO - __main__ - Step 134452: {'lr': 1.3494135131572688e-05, 'samples': 25814784, 'steps': 134451, 'loss/train': 1.0995858907699585} 11/07/2021 16:08:01 - INFO - __main__ - Step 134453: {'lr': 1.3492415278803377e-05, 'samples': 25814976, 'steps': 134452, 'loss/train': 1.2861119508743286} 11/07/2021 16:08:01 - INFO - __main__ - Step 134454: {'lr': 1.3490695532600682e-05, 'samples': 25815168, 'steps': 134453, 'loss/train': 1.050310730934143} 11/07/2021 16:08:01 - INFO - __main__ - Step 134455: {'lr': 1.3488975892965439e-05, 'samples': 25815360, 'steps': 134454, 'loss/train': 1.3522922992706299} 11/07/2021 16:08:02 - INFO - __main__ - Step 134456: {'lr': 1.348725635989842e-05, 'samples': 25815552, 'steps': 134455, 'loss/train': 1.137355089187622} 11/07/2021 16:08:04 - INFO - __main__ - Step 134457: {'lr': 1.3485536933400377e-05, 'samples': 25815744, 'steps': 134456, 'loss/train': 1.3768757581710815} 11/07/2021 16:08:04 - INFO - __main__ - Step 134458: {'lr': 1.3483817613472116e-05, 'samples': 25815936, 'steps': 134457, 'loss/train': 1.3867418766021729} 11/07/2021 16:08:04 - INFO - __main__ - Step 134459: {'lr': 1.3482098400114384e-05, 'samples': 25816128, 'steps': 134458, 'loss/train': 1.0521960258483887} 11/07/2021 16:08:05 - INFO - __main__ - Step 134460: {'lr': 1.3480379293327987e-05, 'samples': 25816320, 'steps': 134459, 'loss/train': 1.745163917541504} 11/07/2021 16:08:05 - INFO - __main__ - Step 134461: {'lr': 1.3478660293113675e-05, 'samples': 25816512, 'steps': 134460, 'loss/train': 1.6962934732437134} 11/07/2021 16:08:05 - INFO - __main__ - Step 134462: {'lr': 1.3476941399472226e-05, 'samples': 25816704, 'steps': 134461, 'loss/train': 1.6715008020401} 11/07/2021 16:08:06 - INFO - __main__ - Step 134463: {'lr': 1.3475222612404414e-05, 'samples': 25816896, 'steps': 134462, 'loss/train': 1.2588329315185547} 11/07/2021 16:08:07 - INFO - __main__ - Step 134464: {'lr': 1.347350393191102e-05, 'samples': 25817088, 'steps': 134463, 'loss/train': 1.3621883392333984} 11/07/2021 16:08:07 - INFO - __main__ - Step 134465: {'lr': 1.3471785357992816e-05, 'samples': 25817280, 'steps': 134464, 'loss/train': 1.3994553089141846} 11/07/2021 16:08:07 - INFO - __main__ - Step 134466: {'lr': 1.3470066890650611e-05, 'samples': 25817472, 'steps': 134465, 'loss/train': 1.143204689025879} 11/07/2021 16:08:08 - INFO - __main__ - Step 134467: {'lr': 1.34683485298851e-05, 'samples': 25817664, 'steps': 134466, 'loss/train': 1.4627647399902344} 11/07/2021 16:08:08 - INFO - __main__ - Step 134468: {'lr': 1.3466630275697111e-05, 'samples': 25817856, 'steps': 134467, 'loss/train': 1.2854102849960327} 11/07/2021 16:08:09 - INFO - __main__ - Step 134469: {'lr': 1.3464912128087426e-05, 'samples': 25818048, 'steps': 134468, 'loss/train': 1.469225287437439} 11/07/2021 16:08:09 - INFO - __main__ - Step 134470: {'lr': 1.3463194087056763e-05, 'samples': 25818240, 'steps': 134469, 'loss/train': 1.1694674491882324} 11/07/2021 16:08:10 - INFO - __main__ - Step 134471: {'lr': 1.3461476152605956e-05, 'samples': 25818432, 'steps': 134470, 'loss/train': 1.6274285316467285} 11/07/2021 16:08:10 - INFO - __main__ - Step 134472: {'lr': 1.3459758324735755e-05, 'samples': 25818624, 'steps': 134471, 'loss/train': 1.000145673751831} 11/07/2021 16:08:11 - INFO - __main__ - Step 134473: {'lr': 1.3458040603446936e-05, 'samples': 25818816, 'steps': 134472, 'loss/train': 0.7146166563034058} 11/07/2021 16:08:12 - INFO - __main__ - Step 134474: {'lr': 1.3456322988740277e-05, 'samples': 25819008, 'steps': 134473, 'loss/train': 1.3233612775802612} 11/07/2021 16:08:12 - INFO - __main__ - Step 134475: {'lr': 1.3454605480616556e-05, 'samples': 25819200, 'steps': 134474, 'loss/train': 1.0887898206710815} 11/07/2021 16:08:12 - INFO - __main__ - Step 134476: {'lr': 1.345288807907652e-05, 'samples': 25819392, 'steps': 134475, 'loss/train': 1.4732905626296997} 11/07/2021 16:08:13 - INFO - __main__ - Step 134477: {'lr': 1.3451170784120975e-05, 'samples': 25819584, 'steps': 134476, 'loss/train': 0.8485139608383179} 11/07/2021 16:08:13 - INFO - __main__ - Step 134478: {'lr': 1.34494535957507e-05, 'samples': 25819776, 'steps': 134477, 'loss/train': 0.864989697933197} 11/07/2021 16:08:13 - INFO - __main__ - Step 134479: {'lr': 1.3447736513966413e-05, 'samples': 25819968, 'steps': 134478, 'loss/train': 1.2488983869552612} 11/07/2021 16:08:14 - INFO - __main__ - Step 134480: {'lr': 1.3446019538768977e-05, 'samples': 25820160, 'steps': 134479, 'loss/train': 1.554747462272644} 11/07/2021 16:08:15 - INFO - __main__ - Step 134481: {'lr': 1.3444302670159086e-05, 'samples': 25820352, 'steps': 134480, 'loss/train': 0.7482250928878784} 11/07/2021 16:08:15 - INFO - __main__ - Step 134482: {'lr': 1.3442585908137545e-05, 'samples': 25820544, 'steps': 134481, 'loss/train': 1.2736214399337769} 11/07/2021 16:08:15 - INFO - __main__ - Step 134483: {'lr': 1.3440869252705101e-05, 'samples': 25820736, 'steps': 134482, 'loss/train': 0.9015673398971558} 11/07/2021 16:08:16 - INFO - __main__ - Step 134484: {'lr': 1.3439152703862589e-05, 'samples': 25820928, 'steps': 134483, 'loss/train': 1.0589954853057861} 11/07/2021 16:08:17 - INFO - __main__ - Step 134485: {'lr': 1.3437436261610703e-05, 'samples': 25821120, 'steps': 134484, 'loss/train': 1.6121306419372559} 11/07/2021 16:08:17 - INFO - __main__ - Step 134486: {'lr': 1.3435719925950302e-05, 'samples': 25821312, 'steps': 134485, 'loss/train': 1.7221901416778564} 11/07/2021 16:08:18 - INFO - __main__ - Step 134487: {'lr': 1.343400369688208e-05, 'samples': 25821504, 'steps': 134486, 'loss/train': 1.2784814834594727} 11/07/2021 16:08:18 - INFO - __main__ - Step 134488: {'lr': 1.3432287574406843e-05, 'samples': 25821696, 'steps': 134487, 'loss/train': 1.274434208869934} 11/07/2021 16:08:18 - INFO - __main__ - Step 134489: {'lr': 1.3430571558525394e-05, 'samples': 25821888, 'steps': 134488, 'loss/train': 1.2758103609085083} 11/07/2021 16:08:19 - INFO - __main__ - Step 134490: {'lr': 1.3428855649238458e-05, 'samples': 25822080, 'steps': 134489, 'loss/train': 1.171980381011963} 11/07/2021 16:08:20 - INFO - __main__ - Step 134491: {'lr': 1.3427139846546837e-05, 'samples': 25822272, 'steps': 134490, 'loss/train': 1.601218819618225} 11/07/2021 16:08:20 - INFO - __main__ - Step 134492: {'lr': 1.342542415045131e-05, 'samples': 25822464, 'steps': 134491, 'loss/train': 1.434605360031128} 11/07/2021 16:08:20 - INFO - __main__ - Step 134493: {'lr': 1.3423708560952652e-05, 'samples': 25822656, 'steps': 134492, 'loss/train': 1.1314849853515625} 11/07/2021 16:08:21 - INFO - __main__ - Step 134494: {'lr': 1.3421993078051586e-05, 'samples': 25822848, 'steps': 134493, 'loss/train': 1.2218345403671265} 11/07/2021 16:08:22 - INFO - __main__ - Step 134495: {'lr': 1.3420277701748917e-05, 'samples': 25823040, 'steps': 134494, 'loss/train': 1.0213896036148071} 11/07/2021 16:08:22 - INFO - __main__ - Step 134496: {'lr': 1.3418562432045423e-05, 'samples': 25823232, 'steps': 134495, 'loss/train': 1.002561092376709} 11/07/2021 16:08:23 - INFO - __main__ - Step 134497: {'lr': 1.3416847268941879e-05, 'samples': 25823424, 'steps': 134496, 'loss/train': 1.461146593093872} 11/07/2021 16:08:23 - INFO - __main__ - Step 134498: {'lr': 1.3415132212439062e-05, 'samples': 25823616, 'steps': 134497, 'loss/train': 1.3491169214248657} 11/07/2021 16:08:23 - INFO - __main__ - Step 134499: {'lr': 1.3413417262537726e-05, 'samples': 25823808, 'steps': 134498, 'loss/train': 1.3393274545669556} 11/07/2021 16:08:24 - INFO - __main__ - Step 134500: {'lr': 1.3411702419238642e-05, 'samples': 25824000, 'steps': 134499, 'loss/train': 1.3525466918945312} 11/07/2021 16:08:26 - INFO - __main__ - Step 134501: {'lr': 1.3409987682542618e-05, 'samples': 25824192, 'steps': 134500, 'loss/train': 0.5197932720184326} 11/07/2021 16:08:26 - INFO - __main__ - Step 134502: {'lr': 1.3408273052450377e-05, 'samples': 25824384, 'steps': 134501, 'loss/train': 1.531556248664856} 11/07/2021 16:08:26 - INFO - __main__ - Step 134503: {'lr': 1.340655852896272e-05, 'samples': 25824576, 'steps': 134502, 'loss/train': 1.445380449295044} 11/07/2021 16:08:27 - INFO - __main__ - Step 134504: {'lr': 1.3404844112080427e-05, 'samples': 25824768, 'steps': 134503, 'loss/train': 1.1267513036727905} 11/07/2021 16:08:27 - INFO - __main__ - Step 134505: {'lr': 1.3403129801804276e-05, 'samples': 25824960, 'steps': 134504, 'loss/train': 1.0509649515151978} 11/07/2021 16:08:27 - INFO - __main__ - Step 134506: {'lr': 1.3401415598135041e-05, 'samples': 25825152, 'steps': 134505, 'loss/train': 0.1821962296962738} 11/07/2021 16:08:28 - INFO - __main__ - Step 134507: {'lr': 1.3399701501073447e-05, 'samples': 25825344, 'steps': 134506, 'loss/train': 0.14426958560943604} 11/07/2021 16:08:29 - INFO - __main__ - Step 134508: {'lr': 1.3397987510620296e-05, 'samples': 25825536, 'steps': 134507, 'loss/train': 0.9612802863121033} 11/07/2021 16:08:29 - INFO - __main__ - Step 134509: {'lr': 1.339627362677634e-05, 'samples': 25825728, 'steps': 134508, 'loss/train': 1.458284616470337} 11/07/2021 16:08:29 - INFO - __main__ - Step 134510: {'lr': 1.339455984954241e-05, 'samples': 25825920, 'steps': 134509, 'loss/train': 0.9743942022323608} 11/07/2021 16:08:30 - INFO - __main__ - Step 134511: {'lr': 1.3392846178919228e-05, 'samples': 25826112, 'steps': 134510, 'loss/train': 1.261187195777893} 11/07/2021 16:08:31 - INFO - __main__ - Step 134512: {'lr': 1.339113261490757e-05, 'samples': 25826304, 'steps': 134511, 'loss/train': 1.3371435403823853} 11/07/2021 16:08:31 - INFO - __main__ - Step 134513: {'lr': 1.3389419157508216e-05, 'samples': 25826496, 'steps': 134512, 'loss/train': 1.478323221206665} 11/07/2021 16:08:31 - INFO - __main__ - Step 134514: {'lr': 1.3387705806721939e-05, 'samples': 25826688, 'steps': 134513, 'loss/train': 1.4704526662826538} 11/07/2021 16:08:32 - INFO - __main__ - Step 134515: {'lr': 1.3385992562549493e-05, 'samples': 25826880, 'steps': 134514, 'loss/train': 1.2259196043014526} 11/07/2021 16:08:32 - INFO - __main__ - Step 134516: {'lr': 1.3384279424991708e-05, 'samples': 25827072, 'steps': 134515, 'loss/train': 0.9774549007415771} 11/07/2021 16:08:33 - INFO - __main__ - Step 134517: {'lr': 1.3382566394049278e-05, 'samples': 25827264, 'steps': 134516, 'loss/train': 1.5935630798339844} 11/07/2021 16:08:34 - INFO - __main__ - Step 134518: {'lr': 1.3380853469723036e-05, 'samples': 25827456, 'steps': 134517, 'loss/train': 1.2399870157241821} 11/07/2021 16:08:34 - INFO - __main__ - Step 134519: {'lr': 1.3379140652013733e-05, 'samples': 25827648, 'steps': 134518, 'loss/train': 1.5147815942764282} 11/07/2021 16:08:34 - INFO - __main__ - Step 134520: {'lr': 1.3377427940922144e-05, 'samples': 25827840, 'steps': 134519, 'loss/train': 1.1791049242019653} 11/07/2021 16:08:35 - INFO - __main__ - Step 134521: {'lr': 1.3375715336449018e-05, 'samples': 25828032, 'steps': 134520, 'loss/train': 1.3285009860992432} 11/07/2021 16:08:35 - INFO - __main__ - Step 134522: {'lr': 1.3374002838595162e-05, 'samples': 25828224, 'steps': 134521, 'loss/train': 1.5977556705474854} 11/07/2021 16:08:36 - INFO - __main__ - Step 134523: {'lr': 1.3372290447361296e-05, 'samples': 25828416, 'steps': 134522, 'loss/train': 1.2308143377304077} 11/07/2021 16:08:36 - INFO - __main__ - Step 134524: {'lr': 1.3370578162748253e-05, 'samples': 25828608, 'steps': 134523, 'loss/train': 1.1254247426986694} 11/07/2021 16:08:37 - INFO - __main__ - Step 134525: {'lr': 1.3368865984756756e-05, 'samples': 25828800, 'steps': 134524, 'loss/train': 1.4797441959381104} 11/07/2021 16:08:37 - INFO - __main__ - Step 134526: {'lr': 1.3367153913387609e-05, 'samples': 25828992, 'steps': 134525, 'loss/train': 1.7465640306472778} 11/07/2021 16:08:37 - INFO - __main__ - Step 134527: {'lr': 1.336544194864156e-05, 'samples': 25829184, 'steps': 134526, 'loss/train': 1.5056132078170776} 11/07/2021 16:08:38 - INFO - __main__ - Step 134528: {'lr': 1.3363730090519388e-05, 'samples': 25829376, 'steps': 134527, 'loss/train': 1.4362494945526123} 11/07/2021 16:08:39 - INFO - __main__ - Step 134529: {'lr': 1.336201833902187e-05, 'samples': 25829568, 'steps': 134528, 'loss/train': 1.7174062728881836} 11/07/2021 16:08:39 - INFO - __main__ - Step 134530: {'lr': 1.3360306694149781e-05, 'samples': 25829760, 'steps': 134529, 'loss/train': 1.2393829822540283} 11/07/2021 16:08:40 - INFO - __main__ - Step 134531: {'lr': 1.3358595155903903e-05, 'samples': 25829952, 'steps': 134530, 'loss/train': 1.2532768249511719} 11/07/2021 16:08:40 - INFO - __main__ - Step 134532: {'lr': 1.3356883724284952e-05, 'samples': 25830144, 'steps': 134531, 'loss/train': 1.60928213596344} 11/07/2021 16:08:41 - INFO - __main__ - Step 134533: {'lr': 1.3355172399293792e-05, 'samples': 25830336, 'steps': 134532, 'loss/train': 1.3145387172698975} 11/07/2021 16:08:41 - INFO - __main__ - Step 134534: {'lr': 1.3353461180931115e-05, 'samples': 25830528, 'steps': 134533, 'loss/train': 1.141695499420166} 11/07/2021 16:08:42 - INFO - __main__ - Step 134535: {'lr': 1.3351750069197699e-05, 'samples': 25830720, 'steps': 134534, 'loss/train': 1.9033379554748535} 11/07/2021 16:08:42 - INFO - __main__ - Step 134536: {'lr': 1.3350039064094349e-05, 'samples': 25830912, 'steps': 134535, 'loss/train': 1.0976790189743042} 11/07/2021 16:08:42 - INFO - __main__ - Step 134537: {'lr': 1.3348328165621815e-05, 'samples': 25831104, 'steps': 134536, 'loss/train': 1.4311184883117676} 11/07/2021 16:08:43 - INFO - __main__ - Step 134538: {'lr': 1.3346617373780873e-05, 'samples': 25831296, 'steps': 134537, 'loss/train': 0.9401372075080872} 11/07/2021 16:08:44 - INFO - __main__ - Step 134539: {'lr': 1.33449066885723e-05, 'samples': 25831488, 'steps': 134538, 'loss/train': 1.798484206199646} 11/07/2021 16:08:44 - INFO - __main__ - Step 134540: {'lr': 1.3343196109996847e-05, 'samples': 25831680, 'steps': 134539, 'loss/train': 0.9662876129150391} 11/07/2021 16:08:44 - INFO - __main__ - Step 134541: {'lr': 1.3341485638055289e-05, 'samples': 25831872, 'steps': 134540, 'loss/train': 1.0659831762313843} 11/07/2021 16:08:45 - INFO - __main__ - Step 134542: {'lr': 1.3339775272748433e-05, 'samples': 25832064, 'steps': 134541, 'loss/train': 0.698032021522522} 11/07/2021 16:08:45 - INFO - __main__ - Step 134543: {'lr': 1.3338065014077e-05, 'samples': 25832256, 'steps': 134542, 'loss/train': 1.2075392007827759} 11/07/2021 16:08:47 - INFO - __main__ - Step 134544: {'lr': 1.3336354862041794e-05, 'samples': 25832448, 'steps': 134543, 'loss/train': 0.3954601585865021} 11/07/2021 16:08:47 - INFO - __main__ - Step 134545: {'lr': 1.3334644816643565e-05, 'samples': 25832640, 'steps': 134544, 'loss/train': 1.6241191625595093} 11/07/2021 16:08:47 - INFO - __main__ - Step 134546: {'lr': 1.333293487788309e-05, 'samples': 25832832, 'steps': 134545, 'loss/train': 0.2846899926662445} 11/07/2021 16:08:48 - INFO - __main__ - Step 134547: {'lr': 1.3331225045761203e-05, 'samples': 25833024, 'steps': 134546, 'loss/train': 1.265499234199524} 11/07/2021 16:08:48 - INFO - __main__ - Step 134548: {'lr': 1.3329515320278568e-05, 'samples': 25833216, 'steps': 134547, 'loss/train': 2.416381597518921} 11/07/2021 16:08:48 - INFO - __main__ - Step 134549: {'lr': 1.3327805701435992e-05, 'samples': 25833408, 'steps': 134548, 'loss/train': 1.2934730052947998} 11/07/2021 16:08:49 - INFO - __main__ - Step 134550: {'lr': 1.3326096189234248e-05, 'samples': 25833600, 'steps': 134549, 'loss/train': 1.301682472229004} 11/07/2021 16:08:50 - INFO - __main__ - Step 134551: {'lr': 1.3324386783674147e-05, 'samples': 25833792, 'steps': 134550, 'loss/train': 1.5601806640625} 11/07/2021 16:08:50 - INFO - __main__ - Step 134552: {'lr': 1.3322677484756379e-05, 'samples': 25833984, 'steps': 134551, 'loss/train': 1.4764223098754883} 11/07/2021 16:08:50 - INFO - __main__ - Step 134553: {'lr': 1.3320968292481806e-05, 'samples': 25834176, 'steps': 134552, 'loss/train': 1.2384260892868042} 11/07/2021 16:08:51 - INFO - __main__ - Step 134554: {'lr': 1.331925920685112e-05, 'samples': 25834368, 'steps': 134553, 'loss/train': 1.1175616979599} 11/07/2021 16:08:52 - INFO - __main__ - Step 134555: {'lr': 1.3317550227865127e-05, 'samples': 25834560, 'steps': 134554, 'loss/train': 1.1047070026397705} 11/07/2021 16:08:52 - INFO - __main__ - Step 134556: {'lr': 1.3315841355524605e-05, 'samples': 25834752, 'steps': 134555, 'loss/train': 1.497990608215332} 11/07/2021 16:08:53 - INFO - __main__ - Step 134557: {'lr': 1.3314132589830303e-05, 'samples': 25834944, 'steps': 134556, 'loss/train': 1.4497921466827393} 11/07/2021 16:08:53 - INFO - __main__ - Step 134558: {'lr': 1.3312423930783024e-05, 'samples': 25835136, 'steps': 134557, 'loss/train': 1.284635066986084} 11/07/2021 16:08:53 - INFO - __main__ - Step 134559: {'lr': 1.331071537838352e-05, 'samples': 25835328, 'steps': 134558, 'loss/train': 1.2768266201019287} 11/07/2021 16:08:54 - INFO - __main__ - Step 134560: {'lr': 1.3309006932632539e-05, 'samples': 25835520, 'steps': 134559, 'loss/train': 1.3909687995910645} 11/07/2021 16:08:55 - INFO - __main__ - Step 134561: {'lr': 1.3307298593530858e-05, 'samples': 25835712, 'steps': 134560, 'loss/train': 1.9666087627410889} 11/07/2021 16:08:55 - INFO - __main__ - Step 134562: {'lr': 1.3305590361079255e-05, 'samples': 25835904, 'steps': 134561, 'loss/train': 1.3614953756332397} 11/07/2021 16:08:55 - INFO - __main__ - Step 134563: {'lr': 1.3303882235278508e-05, 'samples': 25836096, 'steps': 134562, 'loss/train': 1.6351877450942993} 11/07/2021 16:08:56 - INFO - __main__ - Step 134564: {'lr': 1.3302174216129364e-05, 'samples': 25836288, 'steps': 134563, 'loss/train': 1.1723346710205078} 11/07/2021 16:08:56 - INFO - __main__ - Step 134565: {'lr': 1.330046630363263e-05, 'samples': 25836480, 'steps': 134564, 'loss/train': 0.48952510952949524} 11/07/2021 16:08:57 - INFO - __main__ - Step 134566: {'lr': 1.3298758497789026e-05, 'samples': 25836672, 'steps': 134565, 'loss/train': 1.4364140033721924} 11/07/2021 16:08:57 - INFO - __main__ - Step 134567: {'lr': 1.3297050798599358e-05, 'samples': 25836864, 'steps': 134566, 'loss/train': 0.9377339482307434} 11/07/2021 16:08:58 - INFO - __main__ - Step 134568: {'lr': 1.3295343206064402e-05, 'samples': 25837056, 'steps': 134567, 'loss/train': 1.2589728832244873} 11/07/2021 16:08:58 - INFO - __main__ - Step 134569: {'lr': 1.329363572018491e-05, 'samples': 25837248, 'steps': 134568, 'loss/train': 1.408158302307129} 11/07/2021 16:08:58 - INFO - __main__ - Step 134570: {'lr': 1.3291928340961684e-05, 'samples': 25837440, 'steps': 134569, 'loss/train': 1.5039838552474976} 11/07/2021 16:09:00 - INFO - __main__ - Step 134571: {'lr': 1.329022106839542e-05, 'samples': 25837632, 'steps': 134570, 'loss/train': 1.6353645324707031} 11/07/2021 16:09:00 - INFO - __main__ - Step 134572: {'lr': 1.3288513902486921e-05, 'samples': 25837824, 'steps': 134571, 'loss/train': 0.9965518712997437} 11/07/2021 16:09:01 - INFO - __main__ - Step 134573: {'lr': 1.3286806843236992e-05, 'samples': 25838016, 'steps': 134572, 'loss/train': 1.2775968313217163} 11/07/2021 16:09:01 - INFO - __main__ - Step 134574: {'lr': 1.3285099890646357e-05, 'samples': 25838208, 'steps': 134573, 'loss/train': 1.1574292182922363} 11/07/2021 16:09:01 - INFO - __main__ - Step 134575: {'lr': 1.328339304471579e-05, 'samples': 25838400, 'steps': 134574, 'loss/train': 1.6566318273544312} 11/07/2021 16:09:02 - INFO - __main__ - Step 134576: {'lr': 1.32816863054461e-05, 'samples': 25838592, 'steps': 134575, 'loss/train': 1.054391622543335} 11/07/2021 16:09:03 - INFO - __main__ - Step 134577: {'lr': 1.3279979672838032e-05, 'samples': 25838784, 'steps': 134576, 'loss/train': 0.5639268159866333} 11/07/2021 16:09:03 - INFO - __main__ - Step 134578: {'lr': 1.327827314689234e-05, 'samples': 25838976, 'steps': 134577, 'loss/train': 1.2090849876403809} 11/07/2021 16:09:03 - INFO - __main__ - Step 134579: {'lr': 1.3276566727609795e-05, 'samples': 25839168, 'steps': 134578, 'loss/train': 1.3478150367736816} 11/07/2021 16:09:04 - INFO - __main__ - Step 134580: {'lr': 1.327486041499118e-05, 'samples': 25839360, 'steps': 134579, 'loss/train': 1.4638546705245972} 11/07/2021 16:09:05 - INFO - __main__ - Step 134581: {'lr': 1.3273154209037297e-05, 'samples': 25839552, 'steps': 134580, 'loss/train': 1.3836115598678589} 11/07/2021 16:09:05 - INFO - __main__ - Step 134582: {'lr': 1.3271448109748867e-05, 'samples': 25839744, 'steps': 134581, 'loss/train': 1.3039556741714478} 11/07/2021 16:09:06 - INFO - __main__ - Step 134583: {'lr': 1.3269742117126643e-05, 'samples': 25839936, 'steps': 134582, 'loss/train': 1.4233318567276} 11/07/2021 16:09:06 - INFO - __main__ - Step 134584: {'lr': 1.3268036231171426e-05, 'samples': 25840128, 'steps': 134583, 'loss/train': 1.0356885194778442} 11/07/2021 16:09:06 - INFO - __main__ - Step 134585: {'lr': 1.3266330451883969e-05, 'samples': 25840320, 'steps': 134584, 'loss/train': 1.377963900566101} 11/07/2021 16:09:07 - INFO - __main__ - Step 134586: {'lr': 1.3264624779265072e-05, 'samples': 25840512, 'steps': 134585, 'loss/train': 1.2739158868789673} 11/07/2021 16:09:08 - INFO - __main__ - Step 134587: {'lr': 1.3262919213315461e-05, 'samples': 25840704, 'steps': 134586, 'loss/train': 0.971830427646637} 11/07/2021 16:09:08 - INFO - __main__ - Step 134588: {'lr': 1.326121375403594e-05, 'samples': 25840896, 'steps': 134587, 'loss/train': 1.4819117784500122} 11/07/2021 16:09:08 - INFO - __main__ - Step 134589: {'lr': 1.3259508401427256e-05, 'samples': 25841088, 'steps': 134588, 'loss/train': 1.374852180480957} 11/07/2021 16:09:09 - INFO - __main__ - Step 134590: {'lr': 1.3257803155490189e-05, 'samples': 25841280, 'steps': 134589, 'loss/train': 1.3098706007003784} 11/07/2021 16:09:10 - INFO - __main__ - Step 134591: {'lr': 1.3256098016225515e-05, 'samples': 25841472, 'steps': 134590, 'loss/train': 1.1483008861541748} 11/07/2021 16:09:10 - INFO - __main__ - Step 134592: {'lr': 1.3254392983634011e-05, 'samples': 25841664, 'steps': 134591, 'loss/train': 1.4321602582931519} 11/07/2021 16:09:11 - INFO - __main__ - Step 134593: {'lr': 1.3252688057716373e-05, 'samples': 25841856, 'steps': 134592, 'loss/train': 0.9110382199287415} 11/07/2021 16:09:11 - INFO - __main__ - Step 134594: {'lr': 1.3250983238473457e-05, 'samples': 25842048, 'steps': 134593, 'loss/train': 0.4078730046749115} 11/07/2021 16:09:11 - INFO - __main__ - Step 134595: {'lr': 1.324927852590596e-05, 'samples': 25842240, 'steps': 134594, 'loss/train': 1.1852279901504517} 11/07/2021 16:09:12 - INFO - __main__ - Step 134596: {'lr': 1.3247573920014717e-05, 'samples': 25842432, 'steps': 134595, 'loss/train': 1.6110610961914062} 11/07/2021 16:09:13 - INFO - __main__ - Step 134597: {'lr': 1.3245869420800444e-05, 'samples': 25842624, 'steps': 134596, 'loss/train': 0.7052357196807861} 11/07/2021 16:09:13 - INFO - __main__ - Step 134598: {'lr': 1.3244165028263921e-05, 'samples': 25842816, 'steps': 134597, 'loss/train': 1.3780665397644043} 11/07/2021 16:09:14 - INFO - __main__ - Step 134599: {'lr': 1.3242460742405926e-05, 'samples': 25843008, 'steps': 134598, 'loss/train': 1.2564444541931152} 11/07/2021 16:09:14 - INFO - __main__ - Step 134600: {'lr': 1.3240756563227235e-05, 'samples': 25843200, 'steps': 134599, 'loss/train': 1.2964674234390259} 11/07/2021 16:09:14 - INFO - __main__ - Step 134601: {'lr': 1.3239052490728626e-05, 'samples': 25843392, 'steps': 134600, 'loss/train': 1.4510159492492676} 11/07/2021 16:09:16 - INFO - __main__ - Step 134602: {'lr': 1.323734852491082e-05, 'samples': 25843584, 'steps': 134601, 'loss/train': 0.6362419128417969} 11/07/2021 16:09:16 - INFO - __main__ - Step 134603: {'lr': 1.3235644665774649e-05, 'samples': 25843776, 'steps': 134602, 'loss/train': 1.2049506902694702} 11/07/2021 16:09:16 - INFO - __main__ - Step 134604: {'lr': 1.3233940913320807e-05, 'samples': 25843968, 'steps': 134603, 'loss/train': 1.228906512260437} 11/07/2021 16:09:17 - INFO - __main__ - Step 134605: {'lr': 1.3232237267550101e-05, 'samples': 25844160, 'steps': 134604, 'loss/train': 0.9543282985687256} 11/07/2021 16:09:17 - INFO - __main__ - Step 134606: {'lr': 1.3230533728463278e-05, 'samples': 25844352, 'steps': 134605, 'loss/train': 1.4007633924484253} 11/07/2021 16:09:17 - INFO - __main__ - Step 134607: {'lr': 1.3228830296061146e-05, 'samples': 25844544, 'steps': 134606, 'loss/train': 1.3616917133331299} 11/07/2021 16:09:18 - INFO - __main__ - Step 134608: {'lr': 1.322712697034445e-05, 'samples': 25844736, 'steps': 134607, 'loss/train': 0.11063151061534882} 11/07/2021 16:09:19 - INFO - __main__ - Step 134609: {'lr': 1.3225423751313942e-05, 'samples': 25844928, 'steps': 134608, 'loss/train': 0.8864167928695679} 11/07/2021 16:09:19 - INFO - __main__ - Step 134610: {'lr': 1.3223720638970427e-05, 'samples': 25845120, 'steps': 134609, 'loss/train': 1.4069452285766602} 11/07/2021 16:09:19 - INFO - __main__ - Step 134611: {'lr': 1.3222017633314625e-05, 'samples': 25845312, 'steps': 134610, 'loss/train': 1.5998749732971191} 11/07/2021 16:09:20 - INFO - __main__ - Step 134612: {'lr': 1.3220314734347344e-05, 'samples': 25845504, 'steps': 134611, 'loss/train': 1.4714601039886475} 11/07/2021 16:09:21 - INFO - __main__ - Step 134613: {'lr': 1.321861194206933e-05, 'samples': 25845696, 'steps': 134612, 'loss/train': 1.2321017980575562} 11/07/2021 16:09:21 - INFO - __main__ - Step 134614: {'lr': 1.321690925648139e-05, 'samples': 25845888, 'steps': 134613, 'loss/train': 1.0324429273605347} 11/07/2021 16:09:22 - INFO - __main__ - Step 134615: {'lr': 1.3215206677584218e-05, 'samples': 25846080, 'steps': 134614, 'loss/train': 0.08135740458965302} 11/07/2021 16:09:22 - INFO - __main__ - Step 134616: {'lr': 1.3213504205378617e-05, 'samples': 25846272, 'steps': 134615, 'loss/train': 1.1409646272659302} 11/07/2021 16:09:22 - INFO - __main__ - Step 134617: {'lr': 1.3211801839865367e-05, 'samples': 25846464, 'steps': 134616, 'loss/train': 0.8926801681518555} 11/07/2021 16:09:24 - INFO - __main__ - Step 134618: {'lr': 1.3210099581045215e-05, 'samples': 25846656, 'steps': 134617, 'loss/train': 1.232724666595459} 11/07/2021 16:09:24 - INFO - __main__ - Step 134619: {'lr': 1.3208397428918967e-05, 'samples': 25846848, 'steps': 134618, 'loss/train': 1.3830374479293823} 11/07/2021 16:09:24 - INFO - __main__ - Step 134620: {'lr': 1.3206695383487343e-05, 'samples': 25847040, 'steps': 134619, 'loss/train': 1.3742272853851318} 11/07/2021 16:09:25 - INFO - __main__ - Step 134621: {'lr': 1.3204993444751123e-05, 'samples': 25847232, 'steps': 134620, 'loss/train': 1.461037278175354} 11/07/2021 16:09:25 - INFO - __main__ - Step 134622: {'lr': 1.3203291612711082e-05, 'samples': 25847424, 'steps': 134621, 'loss/train': 1.7751230001449585} 11/07/2021 16:09:25 - INFO - __main__ - Step 134623: {'lr': 1.320158988736797e-05, 'samples': 25847616, 'steps': 134622, 'loss/train': 1.6203995943069458} 11/07/2021 16:09:26 - INFO - __main__ - Step 134624: {'lr': 1.3199888268722593e-05, 'samples': 25847808, 'steps': 134623, 'loss/train': 2.217658042907715} 11/07/2021 16:09:27 - INFO - __main__ - Step 134625: {'lr': 1.3198186756775671e-05, 'samples': 25848000, 'steps': 134624, 'loss/train': 1.5770035982131958} 11/07/2021 16:09:27 - INFO - __main__ - Step 134626: {'lr': 1.319648535152801e-05, 'samples': 25848192, 'steps': 134625, 'loss/train': 1.6623305082321167} 11/07/2021 16:09:28 - INFO - __main__ - Step 134627: {'lr': 1.3194784052980385e-05, 'samples': 25848384, 'steps': 134626, 'loss/train': 0.4825218617916107} 11/07/2021 16:09:28 - INFO - __main__ - Step 134628: {'lr': 1.3193082861133493e-05, 'samples': 25848576, 'steps': 134627, 'loss/train': 1.5459825992584229} 11/07/2021 16:09:29 - INFO - __main__ - Step 134629: {'lr': 1.3191381775988137e-05, 'samples': 25848768, 'steps': 134628, 'loss/train': 1.2472691535949707} 11/07/2021 16:09:29 - INFO - __main__ - Step 134630: {'lr': 1.3189680797545122e-05, 'samples': 25848960, 'steps': 134629, 'loss/train': 0.9453253149986267} 11/07/2021 16:09:30 - INFO - __main__ - Step 134631: {'lr': 1.318797992580517e-05, 'samples': 25849152, 'steps': 134630, 'loss/train': 1.3889620304107666} 11/07/2021 16:09:30 - INFO - __main__ - Step 134632: {'lr': 1.3186279160769032e-05, 'samples': 25849344, 'steps': 134631, 'loss/train': 1.7625125646591187} 11/07/2021 16:09:30 - INFO - __main__ - Step 134633: {'lr': 1.318457850243754e-05, 'samples': 25849536, 'steps': 134632, 'loss/train': 1.6953701972961426} 11/07/2021 16:09:31 - INFO - __main__ - Step 134634: {'lr': 1.3182877950811411e-05, 'samples': 25849728, 'steps': 134633, 'loss/train': 1.1202882528305054} 11/07/2021 16:09:32 - INFO - __main__ - Step 134635: {'lr': 1.3181177505891428e-05, 'samples': 25849920, 'steps': 134634, 'loss/train': 1.1584089994430542} 11/07/2021 16:09:32 - INFO - __main__ - Step 134636: {'lr': 1.317947716767834e-05, 'samples': 25850112, 'steps': 134635, 'loss/train': 1.2660324573516846} 11/07/2021 16:09:32 - INFO - __main__ - Step 134637: {'lr': 1.317777693617292e-05, 'samples': 25850304, 'steps': 134636, 'loss/train': 1.3100756406784058} 11/07/2021 16:09:33 - INFO - __main__ - Step 134638: {'lr': 1.3176076811375947e-05, 'samples': 25850496, 'steps': 134637, 'loss/train': 1.385596752166748} 11/07/2021 16:09:33 - INFO - __main__ - Step 134639: {'lr': 1.3174376793288173e-05, 'samples': 25850688, 'steps': 134638, 'loss/train': 2.5001730918884277} 11/07/2021 16:09:34 - INFO - __main__ - Step 134640: {'lr': 1.31726768819104e-05, 'samples': 25850880, 'steps': 134639, 'loss/train': 0.8148883581161499} 11/07/2021 16:09:35 - INFO - __main__ - Step 134641: {'lr': 1.317097707724338e-05, 'samples': 25851072, 'steps': 134640, 'loss/train': 0.7957866191864014} 11/07/2021 16:09:35 - INFO - __main__ - Step 134642: {'lr': 1.3169277379287803e-05, 'samples': 25851264, 'steps': 134641, 'loss/train': 1.1438971757888794} 11/07/2021 16:09:35 - INFO - __main__ - Step 134643: {'lr': 1.3167577788044532e-05, 'samples': 25851456, 'steps': 134642, 'loss/train': 1.4702144861221313} 11/07/2021 16:09:36 - INFO - __main__ - Step 134644: {'lr': 1.3165878303514289e-05, 'samples': 25851648, 'steps': 134643, 'loss/train': 1.6561236381530762} 11/07/2021 16:09:37 - INFO - __main__ - Step 134645: {'lr': 1.3164178925697824e-05, 'samples': 25851840, 'steps': 134644, 'loss/train': 1.3989049196243286} 11/07/2021 16:09:37 - INFO - __main__ - Step 134646: {'lr': 1.3162479654595938e-05, 'samples': 25852032, 'steps': 134645, 'loss/train': 1.8258323669433594} 11/07/2021 16:09:38 - INFO - __main__ - Step 134647: {'lr': 1.3160780490209384e-05, 'samples': 25852224, 'steps': 134646, 'loss/train': 1.5555500984191895} 11/07/2021 16:09:38 - INFO - __main__ - Step 134648: {'lr': 1.3159081432538939e-05, 'samples': 25852416, 'steps': 134647, 'loss/train': 1.608778476715088} 11/07/2021 16:09:38 - INFO - __main__ - Step 134649: {'lr': 1.315738248158535e-05, 'samples': 25852608, 'steps': 134648, 'loss/train': 1.2274858951568604} 11/07/2021 16:09:40 - INFO - __main__ - Step 134650: {'lr': 1.3155683637349398e-05, 'samples': 25852800, 'steps': 134649, 'loss/train': 1.488991618156433} 11/07/2021 16:09:40 - INFO - __main__ - Step 134651: {'lr': 1.3153984899831827e-05, 'samples': 25852992, 'steps': 134650, 'loss/train': 1.2924625873565674} 11/07/2021 16:09:40 - INFO - __main__ - Step 134652: {'lr': 1.315228626903342e-05, 'samples': 25853184, 'steps': 134651, 'loss/train': 1.4478296041488647} 11/07/2021 16:09:41 - INFO - __main__ - Step 134653: {'lr': 1.3150587744954923e-05, 'samples': 25853376, 'steps': 134652, 'loss/train': 1.620406150817871} 11/07/2021 16:09:41 - INFO - __main__ - Step 134654: {'lr': 1.3148889327597169e-05, 'samples': 25853568, 'steps': 134653, 'loss/train': 0.09294909238815308} 11/07/2021 16:09:42 - INFO - __main__ - Step 134655: {'lr': 1.3147191016960853e-05, 'samples': 25853760, 'steps': 134654, 'loss/train': 1.818591833114624} 11/07/2021 16:09:42 - INFO - __main__ - Step 134656: {'lr': 1.3145492813046722e-05, 'samples': 25853952, 'steps': 134655, 'loss/train': 1.3748944997787476} 11/07/2021 16:09:43 - INFO - __main__ - Step 134657: {'lr': 1.3143794715855584e-05, 'samples': 25854144, 'steps': 134656, 'loss/train': 1.0837836265563965} 11/07/2021 16:09:43 - INFO - __main__ - Step 134658: {'lr': 1.3142096725388214e-05, 'samples': 25854336, 'steps': 134657, 'loss/train': 1.2195508480072021} 11/07/2021 16:09:44 - INFO - __main__ - Step 134659: {'lr': 1.3140398841645362e-05, 'samples': 25854528, 'steps': 134658, 'loss/train': 1.0849146842956543} 11/07/2021 16:09:44 - INFO - __main__ - Step 134660: {'lr': 1.3138701064627778e-05, 'samples': 25854720, 'steps': 134659, 'loss/train': 1.2749278545379639} 11/07/2021 16:09:45 - INFO - __main__ - Step 134661: {'lr': 1.3137003394336239e-05, 'samples': 25854912, 'steps': 134660, 'loss/train': 1.167035698890686} 11/07/2021 16:09:45 - INFO - __main__ - Step 134662: {'lr': 1.313530583077152e-05, 'samples': 25855104, 'steps': 134661, 'loss/train': 1.0389471054077148} 11/07/2021 16:09:46 - INFO - __main__ - Step 134663: {'lr': 1.3133608373934374e-05, 'samples': 25855296, 'steps': 134662, 'loss/train': 1.0256961584091187} 11/07/2021 16:09:46 - INFO - __main__ - Step 134664: {'lr': 1.3131911023825577e-05, 'samples': 25855488, 'steps': 134663, 'loss/train': 1.400849461555481} 11/07/2021 16:09:46 - INFO - __main__ - Step 134665: {'lr': 1.3130213780445877e-05, 'samples': 25855680, 'steps': 134664, 'loss/train': 1.2442857027053833} 11/07/2021 16:09:48 - INFO - __main__ - Step 134666: {'lr': 1.3128516643796023e-05, 'samples': 25855872, 'steps': 134665, 'loss/train': 1.382136583328247} 11/07/2021 16:09:48 - INFO - __main__ - Step 134667: {'lr': 1.312681961387685e-05, 'samples': 25856064, 'steps': 134666, 'loss/train': 1.351227879524231} 11/07/2021 16:09:48 - INFO - __main__ - Step 134668: {'lr': 1.3125122690689079e-05, 'samples': 25856256, 'steps': 134667, 'loss/train': 1.690517544746399} 11/07/2021 16:09:49 - INFO - __main__ - Step 134669: {'lr': 1.3123425874233458e-05, 'samples': 25856448, 'steps': 134668, 'loss/train': 1.3910622596740723} 11/07/2021 16:09:49 - INFO - __main__ - Step 134670: {'lr': 1.3121729164510766e-05, 'samples': 25856640, 'steps': 134669, 'loss/train': 1.3052984476089478} 11/07/2021 16:09:50 - INFO - __main__ - Step 134671: {'lr': 1.312003256152175e-05, 'samples': 25856832, 'steps': 134670, 'loss/train': 1.5560500621795654} 11/07/2021 16:09:50 - INFO - __main__ - Step 134672: {'lr': 1.311833606526719e-05, 'samples': 25857024, 'steps': 134671, 'loss/train': 1.5320041179656982} 11/07/2021 16:09:51 - INFO - __main__ - Step 134673: {'lr': 1.311663967574786e-05, 'samples': 25857216, 'steps': 134672, 'loss/train': 1.697941780090332} 11/07/2021 16:09:51 - INFO - __main__ - Step 134674: {'lr': 1.311494339296454e-05, 'samples': 25857408, 'steps': 134673, 'loss/train': 1.1506493091583252} 11/07/2021 16:09:51 - INFO - __main__ - Step 134675: {'lr': 1.311324721691795e-05, 'samples': 25857600, 'steps': 134674, 'loss/train': 1.3948071002960205} 11/07/2021 16:09:52 - INFO - __main__ - Step 134676: {'lr': 1.3111551147608868e-05, 'samples': 25857792, 'steps': 134675, 'loss/train': 1.7896441221237183} 11/07/2021 16:09:53 - INFO - __main__ - Step 134677: {'lr': 1.3109855185038072e-05, 'samples': 25857984, 'steps': 134676, 'loss/train': 0.7795642614364624} 11/07/2021 16:09:53 - INFO - __main__ - Step 134678: {'lr': 1.3108159329206337e-05, 'samples': 25858176, 'steps': 134677, 'loss/train': 0.9417490363121033} 11/07/2021 16:09:53 - INFO - __main__ - Step 134679: {'lr': 1.3106463580114386e-05, 'samples': 25858368, 'steps': 134678, 'loss/train': 1.5502700805664062} 11/07/2021 16:09:54 - INFO - __main__ - Step 134680: {'lr': 1.3104767937763024e-05, 'samples': 25858560, 'steps': 134679, 'loss/train': 1.5206937789916992} 11/07/2021 16:09:55 - INFO - __main__ - Step 134681: {'lr': 1.3103072402153027e-05, 'samples': 25858752, 'steps': 134680, 'loss/train': 1.154349446296692} 11/07/2021 16:09:55 - INFO - __main__ - Step 134682: {'lr': 1.3101376973285089e-05, 'samples': 25858944, 'steps': 134681, 'loss/train': 0.5192427039146423} 11/07/2021 16:09:55 - INFO - __main__ - Step 134683: {'lr': 1.3099681651160018e-05, 'samples': 25859136, 'steps': 134682, 'loss/train': 1.3087234497070312} 11/07/2021 16:09:56 - INFO - __main__ - Step 134684: {'lr': 1.3097986435778559e-05, 'samples': 25859328, 'steps': 134683, 'loss/train': 1.6110694408416748} 11/07/2021 16:09:56 - INFO - __main__ - Step 134685: {'lr': 1.3096291327141518e-05, 'samples': 25859520, 'steps': 134684, 'loss/train': 1.3968558311462402} 11/07/2021 16:09:57 - INFO - __main__ - Step 134686: {'lr': 1.3094596325249619e-05, 'samples': 25859712, 'steps': 134685, 'loss/train': 1.0757533311843872} 11/07/2021 16:09:58 - INFO - __main__ - Step 134687: {'lr': 1.3092901430103638e-05, 'samples': 25859904, 'steps': 134686, 'loss/train': 1.5501846075057983} 11/07/2021 16:09:58 - INFO - __main__ - Step 134688: {'lr': 1.309120664170435e-05, 'samples': 25860096, 'steps': 134687, 'loss/train': 1.5078376531600952} 11/07/2021 16:09:58 - INFO - __main__ - Step 134689: {'lr': 1.3089511960052508e-05, 'samples': 25860288, 'steps': 134688, 'loss/train': 1.541727066040039} 11/07/2021 16:09:59 - INFO - __main__ - Step 134690: {'lr': 1.308781738514886e-05, 'samples': 25860480, 'steps': 134689, 'loss/train': 1.3868790864944458} 11/07/2021 16:09:59 - INFO - __main__ - Step 134691: {'lr': 1.3086122916994208e-05, 'samples': 25860672, 'steps': 134690, 'loss/train': 1.4908965826034546} 11/07/2021 16:10:00 - INFO - __main__ - Step 134692: {'lr': 1.3084428555589279e-05, 'samples': 25860864, 'steps': 134691, 'loss/train': 1.0963332653045654} 11/07/2021 16:10:00 - INFO - __main__ - Step 134693: {'lr': 1.3082734300934872e-05, 'samples': 25861056, 'steps': 134692, 'loss/train': 0.9609072208404541} 11/07/2021 16:10:01 - INFO - __main__ - Step 134694: {'lr': 1.3081040153031688e-05, 'samples': 25861248, 'steps': 134693, 'loss/train': 1.336383581161499} 11/07/2021 16:10:01 - INFO - __main__ - Step 134695: {'lr': 1.3079346111880607e-05, 'samples': 25861440, 'steps': 134694, 'loss/train': 1.3651396036148071} 11/07/2021 16:10:01 - INFO - __main__ - Step 134696: {'lr': 1.3077652177482246e-05, 'samples': 25861632, 'steps': 134695, 'loss/train': 1.2268319129943848} 11/07/2021 16:10:02 - INFO - __main__ - Step 134697: {'lr': 1.3075958349837463e-05, 'samples': 25861824, 'steps': 134696, 'loss/train': 1.1747756004333496} 11/07/2021 16:10:03 - INFO - __main__ - Step 134698: {'lr': 1.3074264628947008e-05, 'samples': 25862016, 'steps': 134697, 'loss/train': 0.9211385846138} 11/07/2021 16:10:03 - INFO - __main__ - Step 134699: {'lr': 1.3072571014811602e-05, 'samples': 25862208, 'steps': 134698, 'loss/train': 1.362883448600769} 11/07/2021 16:10:03 - INFO - __main__ - Step 134700: {'lr': 1.3070877507432049e-05, 'samples': 25862400, 'steps': 134699, 'loss/train': 0.9713283777236938} 11/07/2021 16:10:04 - INFO - __main__ - Step 134701: {'lr': 1.3069184106809128e-05, 'samples': 25862592, 'steps': 134700, 'loss/train': 1.3268072605133057} 11/07/2021 16:10:05 - INFO - __main__ - Step 134702: {'lr': 1.3067490812943562e-05, 'samples': 25862784, 'steps': 134701, 'loss/train': 0.8797556757926941} 11/07/2021 16:10:05 - INFO - __main__ - Step 134703: {'lr': 1.3065797625836124e-05, 'samples': 25862976, 'steps': 134702, 'loss/train': 1.6688863039016724} 11/07/2021 16:10:06 - INFO - __main__ - Step 134704: {'lr': 1.3064104545487565e-05, 'samples': 25863168, 'steps': 134703, 'loss/train': 1.529528021812439} 11/07/2021 16:10:06 - INFO - __main__ - Step 134705: {'lr': 1.3062411571898691e-05, 'samples': 25863360, 'steps': 134704, 'loss/train': 1.6563090085983276} 11/07/2021 16:10:06 - INFO - __main__ - Step 134706: {'lr': 1.3060718705070223e-05, 'samples': 25863552, 'steps': 134705, 'loss/train': 0.9730424880981445} 11/07/2021 16:10:07 - INFO - __main__ - Step 134707: {'lr': 1.305902594500294e-05, 'samples': 25863744, 'steps': 134706, 'loss/train': 1.340207815170288} 11/07/2021 16:10:08 - INFO - __main__ - Step 134708: {'lr': 1.3057333291697644e-05, 'samples': 25863936, 'steps': 134707, 'loss/train': 1.5110427141189575} 11/07/2021 16:10:08 - INFO - __main__ - Step 134709: {'lr': 1.3055640745155028e-05, 'samples': 25864128, 'steps': 134708, 'loss/train': 1.3341630697250366} 11/07/2021 16:10:08 - INFO - __main__ - Step 134710: {'lr': 1.3053948305375874e-05, 'samples': 25864320, 'steps': 134709, 'loss/train': 1.5068798065185547} 11/07/2021 16:10:09 - INFO - __main__ - Step 134711: {'lr': 1.3052255972360954e-05, 'samples': 25864512, 'steps': 134710, 'loss/train': 1.556067705154419} 11/07/2021 16:10:10 - INFO - __main__ - Step 134712: {'lr': 1.3050563746111022e-05, 'samples': 25864704, 'steps': 134711, 'loss/train': 1.5215060710906982} 11/07/2021 16:10:10 - INFO - __main__ - Step 134713: {'lr': 1.3048871626626879e-05, 'samples': 25864896, 'steps': 134712, 'loss/train': 1.7053782939910889} 11/07/2021 16:10:10 - INFO - __main__ - Step 134714: {'lr': 1.304717961390925e-05, 'samples': 25865088, 'steps': 134713, 'loss/train': 1.385064721107483} 11/07/2021 16:10:11 - INFO - __main__ - Step 134715: {'lr': 1.304548770795888e-05, 'samples': 25865280, 'steps': 134714, 'loss/train': 1.5081229209899902} 11/07/2021 16:10:11 - INFO - __main__ - Step 134716: {'lr': 1.3043795908776579e-05, 'samples': 25865472, 'steps': 134715, 'loss/train': 0.7033602595329285} 11/07/2021 16:10:11 - INFO - __main__ - Step 134717: {'lr': 1.3042104216363065e-05, 'samples': 25865664, 'steps': 134716, 'loss/train': 1.2203065156936646} 11/07/2021 16:10:13 - INFO - __main__ - Step 134718: {'lr': 1.3040412630719145e-05, 'samples': 25865856, 'steps': 134717, 'loss/train': 1.4719111919403076} 11/07/2021 16:10:13 - INFO - __main__ - Step 134719: {'lr': 1.3038721151845567e-05, 'samples': 25866048, 'steps': 134718, 'loss/train': 1.172887921333313} 11/07/2021 16:10:13 - INFO - __main__ - Step 134720: {'lr': 1.3037029779743054e-05, 'samples': 25866240, 'steps': 134719, 'loss/train': 0.8503167629241943} 11/07/2021 16:10:14 - INFO - __main__ - Step 134721: {'lr': 1.3035338514412409e-05, 'samples': 25866432, 'steps': 134720, 'loss/train': 1.3501228094100952} 11/07/2021 16:10:14 - INFO - __main__ - Step 134722: {'lr': 1.3033647355854439e-05, 'samples': 25866624, 'steps': 134721, 'loss/train': 1.1813255548477173} 11/07/2021 16:10:15 - INFO - __main__ - Step 134723: {'lr': 1.3031956304069808e-05, 'samples': 25866816, 'steps': 134722, 'loss/train': 0.9847546219825745} 11/07/2021 16:10:16 - INFO - __main__ - Step 134724: {'lr': 1.3030265359059296e-05, 'samples': 25867008, 'steps': 134723, 'loss/train': 1.7324113845825195} 11/07/2021 16:10:16 - INFO - __main__ - Step 134725: {'lr': 1.3028574520823732e-05, 'samples': 25867200, 'steps': 134724, 'loss/train': 1.0369023084640503} 11/07/2021 16:10:16 - INFO - __main__ - Step 134726: {'lr': 1.3026883789363786e-05, 'samples': 25867392, 'steps': 134725, 'loss/train': 1.3334892988204956} 11/07/2021 16:10:17 - INFO - __main__ - Step 134727: {'lr': 1.3025193164680316e-05, 'samples': 25867584, 'steps': 134726, 'loss/train': 1.4014531373977661} 11/07/2021 16:10:18 - INFO - __main__ - Step 134728: {'lr': 1.3023502646774017e-05, 'samples': 25867776, 'steps': 134727, 'loss/train': 1.4832768440246582} 11/07/2021 16:10:18 - INFO - __main__ - Step 134729: {'lr': 1.3021812235645664e-05, 'samples': 25867968, 'steps': 134728, 'loss/train': 1.535878300666809} 11/07/2021 16:10:18 - INFO - __main__ - Step 134730: {'lr': 1.3020121931296036e-05, 'samples': 25868160, 'steps': 134729, 'loss/train': 0.7401750087738037} 11/07/2021 16:10:19 - INFO - __main__ - Step 134731: {'lr': 1.3018431733725882e-05, 'samples': 25868352, 'steps': 134730, 'loss/train': 1.1593940258026123} 11/07/2021 16:10:19 - INFO - __main__ - Step 134732: {'lr': 1.3016741642935953e-05, 'samples': 25868544, 'steps': 134731, 'loss/train': 1.9723937511444092} 11/07/2021 16:10:20 - INFO - __main__ - Step 134733: {'lr': 1.3015051658927051e-05, 'samples': 25868736, 'steps': 134732, 'loss/train': 1.2884310483932495} 11/07/2021 16:10:21 - INFO - __main__ - Step 134734: {'lr': 1.3013361781699873e-05, 'samples': 25868928, 'steps': 134733, 'loss/train': 1.1760940551757812} 11/07/2021 16:10:21 - INFO - __main__ - Step 134735: {'lr': 1.3011672011255277e-05, 'samples': 25869120, 'steps': 134734, 'loss/train': 0.42986592650413513} 11/07/2021 16:10:21 - INFO - __main__ - Step 134736: {'lr': 1.3009982347593929e-05, 'samples': 25869312, 'steps': 134735, 'loss/train': 1.5938916206359863} 11/07/2021 16:10:22 - INFO - __main__ - Step 134737: {'lr': 1.3008292790716608e-05, 'samples': 25869504, 'steps': 134736, 'loss/train': 1.235451340675354} 11/07/2021 16:10:23 - INFO - __main__ - Step 134738: {'lr': 1.3006603340624118e-05, 'samples': 25869696, 'steps': 134737, 'loss/train': 1.1353774070739746} 11/07/2021 16:10:23 - INFO - __main__ - Step 134739: {'lr': 1.300491399731718e-05, 'samples': 25869888, 'steps': 134738, 'loss/train': 1.6008996963500977} 11/07/2021 16:10:23 - INFO - __main__ - Step 134740: {'lr': 1.3003224760796572e-05, 'samples': 25870080, 'steps': 134739, 'loss/train': 1.1517997980117798} 11/07/2021 16:10:24 - INFO - __main__ - Step 134741: {'lr': 1.3001535631063071e-05, 'samples': 25870272, 'steps': 134740, 'loss/train': 1.1225765943527222} 11/07/2021 16:10:24 - INFO - __main__ - Step 134742: {'lr': 1.2999846608117399e-05, 'samples': 25870464, 'steps': 134741, 'loss/train': 1.7161825895309448} 11/07/2021 16:10:24 - INFO - __main__ - Step 134743: {'lr': 1.299815769196036e-05, 'samples': 25870656, 'steps': 134742, 'loss/train': 1.7360919713974} 11/07/2021 16:10:26 - INFO - __main__ - Step 134744: {'lr': 1.2996468882592677e-05, 'samples': 25870848, 'steps': 134743, 'loss/train': 1.3752554655075073} 11/07/2021 16:10:26 - INFO - __main__ - Step 134745: {'lr': 1.2994780180015125e-05, 'samples': 25871040, 'steps': 134744, 'loss/train': 1.3291860818862915} 11/07/2021 16:10:26 - INFO - __main__ - Step 134746: {'lr': 1.2993091584228483e-05, 'samples': 25871232, 'steps': 134745, 'loss/train': 1.0417606830596924} 11/07/2021 16:10:27 - INFO - __main__ - Step 134747: {'lr': 1.2991403095233473e-05, 'samples': 25871424, 'steps': 134746, 'loss/train': 1.2870421409606934} 11/07/2021 16:10:27 - INFO - __main__ - Step 134748: {'lr': 1.2989714713030953e-05, 'samples': 25871616, 'steps': 134747, 'loss/train': 1.0921937227249146} 11/07/2021 16:10:28 - INFO - __main__ - Step 134749: {'lr': 1.2988026437621537e-05, 'samples': 25871808, 'steps': 134748, 'loss/train': 1.360261082649231} 11/07/2021 16:10:28 - INFO - __main__ - Step 134750: {'lr': 1.2986338269006082e-05, 'samples': 25872000, 'steps': 134749, 'loss/train': 0.8205714225769043} 11/07/2021 16:10:29 - INFO - __main__ - Step 134751: {'lr': 1.2984650207185312e-05, 'samples': 25872192, 'steps': 134750, 'loss/train': 1.6685978174209595} 11/07/2021 16:10:29 - INFO - __main__ - Step 134752: {'lr': 1.2982962252160003e-05, 'samples': 25872384, 'steps': 134751, 'loss/train': 1.8694363832473755} 11/07/2021 16:10:29 - INFO - __main__ - Step 134753: {'lr': 1.2981274403930932e-05, 'samples': 25872576, 'steps': 134752, 'loss/train': 0.8212910294532776} 11/07/2021 16:10:30 - INFO - __main__ - Step 134754: {'lr': 1.297958666249882e-05, 'samples': 25872768, 'steps': 134753, 'loss/train': 1.3778246641159058} 11/07/2021 16:10:31 - INFO - __main__ - Step 134755: {'lr': 1.2977899027864449e-05, 'samples': 25872960, 'steps': 134754, 'loss/train': 0.7869024872779846} 11/07/2021 16:10:31 - INFO - __main__ - Step 134756: {'lr': 1.297621150002859e-05, 'samples': 25873152, 'steps': 134755, 'loss/train': 1.6104954481124878} 11/07/2021 16:10:31 - INFO - __main__ - Step 134757: {'lr': 1.2974524078991995e-05, 'samples': 25873344, 'steps': 134756, 'loss/train': 1.0038294792175293} 11/07/2021 16:10:32 - INFO - __main__ - Step 134758: {'lr': 1.2972836764755413e-05, 'samples': 25873536, 'steps': 134757, 'loss/train': 1.5255440473556519} 11/07/2021 16:10:33 - INFO - __main__ - Step 134759: {'lr': 1.2971149557319623e-05, 'samples': 25873728, 'steps': 134758, 'loss/train': 1.5998514890670776} 11/07/2021 16:10:33 - INFO - __main__ - Step 134760: {'lr': 1.2969462456685372e-05, 'samples': 25873920, 'steps': 134759, 'loss/train': 1.3713481426239014} 11/07/2021 16:10:34 - INFO - __main__ - Step 134761: {'lr': 1.296777546285341e-05, 'samples': 25874112, 'steps': 134760, 'loss/train': 1.0938972234725952} 11/07/2021 16:10:34 - INFO - __main__ - Step 134762: {'lr': 1.296608857582457e-05, 'samples': 25874304, 'steps': 134761, 'loss/train': 1.1000462770462036} 11/07/2021 16:10:34 - INFO - __main__ - Step 134763: {'lr': 1.2964401795599489e-05, 'samples': 25874496, 'steps': 134762, 'loss/train': 1.447908878326416} 11/07/2021 16:10:35 - INFO - __main__ - Step 134764: {'lr': 1.2962715122179003e-05, 'samples': 25874688, 'steps': 134763, 'loss/train': 0.7347546219825745} 11/07/2021 16:10:36 - INFO - __main__ - Step 134765: {'lr': 1.2961028555563858e-05, 'samples': 25874880, 'steps': 134764, 'loss/train': 1.2955999374389648} 11/07/2021 16:10:36 - INFO - __main__ - Step 134766: {'lr': 1.2959342095754833e-05, 'samples': 25875072, 'steps': 134765, 'loss/train': 0.89357590675354} 11/07/2021 16:10:36 - INFO - __main__ - Step 134767: {'lr': 1.2957655742752649e-05, 'samples': 25875264, 'steps': 134766, 'loss/train': 1.8795100450515747} 11/07/2021 16:10:37 - INFO - __main__ - Step 134768: {'lr': 1.2955969496558111e-05, 'samples': 25875456, 'steps': 134767, 'loss/train': 1.1213316917419434} 11/07/2021 16:10:38 - INFO - __main__ - Step 134769: {'lr': 1.2954283357171943e-05, 'samples': 25875648, 'steps': 134768, 'loss/train': 1.2625309228897095} 11/07/2021 16:10:38 - INFO - __main__ - Step 134770: {'lr': 1.2952597324594916e-05, 'samples': 25875840, 'steps': 134769, 'loss/train': 1.0530054569244385} 11/07/2021 16:10:39 - INFO - __main__ - Step 134771: {'lr': 1.2950911398827786e-05, 'samples': 25876032, 'steps': 134770, 'loss/train': 1.5645390748977661} 11/07/2021 16:10:39 - INFO - __main__ - Step 134772: {'lr': 1.2949225579871327e-05, 'samples': 25876224, 'steps': 134771, 'loss/train': 1.1171433925628662} 11/07/2021 16:10:39 - INFO - __main__ - Step 134773: {'lr': 1.2947539867726288e-05, 'samples': 25876416, 'steps': 134772, 'loss/train': 1.3620150089263916} 11/07/2021 16:10:40 - INFO - __main__ - Step 134774: {'lr': 1.294585426239342e-05, 'samples': 25876608, 'steps': 134773, 'loss/train': 1.3385035991668701} 11/07/2021 16:10:41 - INFO - __main__ - Step 134775: {'lr': 1.2944168763873526e-05, 'samples': 25876800, 'steps': 134774, 'loss/train': 1.148057222366333} 11/07/2021 16:10:41 - INFO - __main__ - Step 134776: {'lr': 1.2942483372167302e-05, 'samples': 25876992, 'steps': 134775, 'loss/train': 1.0261671543121338} 11/07/2021 16:10:41 - INFO - __main__ - Step 134777: {'lr': 1.2940798087275523e-05, 'samples': 25877184, 'steps': 134776, 'loss/train': 1.3700577020645142} 11/07/2021 16:10:42 - INFO - __main__ - Step 134778: {'lr': 1.2939112909198996e-05, 'samples': 25877376, 'steps': 134777, 'loss/train': 0.9359004497528076} 11/07/2021 16:10:42 - INFO - __main__ - Step 134779: {'lr': 1.2937427837938415e-05, 'samples': 25877568, 'steps': 134778, 'loss/train': 1.210614800453186} 11/07/2021 16:10:43 - INFO - __main__ - Step 134780: {'lr': 1.2935742873494582e-05, 'samples': 25877760, 'steps': 134779, 'loss/train': 1.3444691896438599} 11/07/2021 16:10:43 - INFO - __main__ - Step 134781: {'lr': 1.293405801586825e-05, 'samples': 25877952, 'steps': 134780, 'loss/train': 1.229155421257019} 11/07/2021 16:10:44 - INFO - __main__ - Step 134782: {'lr': 1.2932373265060165e-05, 'samples': 25878144, 'steps': 134781, 'loss/train': 1.2245984077453613} 11/07/2021 16:10:44 - INFO - __main__ - Step 134783: {'lr': 1.2930688621071107e-05, 'samples': 25878336, 'steps': 134782, 'loss/train': 1.143988847732544} 11/07/2021 16:10:44 - INFO - __main__ - Step 134784: {'lr': 1.2929004083901824e-05, 'samples': 25878528, 'steps': 134783, 'loss/train': 0.9828055500984192} 11/07/2021 16:10:46 - INFO - __main__ - Step 134785: {'lr': 1.2927319653553066e-05, 'samples': 25878720, 'steps': 134784, 'loss/train': 0.17510904371738434} 11/07/2021 16:10:46 - INFO - __main__ - Step 134786: {'lr': 1.292563533002558e-05, 'samples': 25878912, 'steps': 134785, 'loss/train': 0.9683189988136292} 11/07/2021 16:10:46 - INFO - __main__ - Step 134787: {'lr': 1.2923951113320175e-05, 'samples': 25879104, 'steps': 134786, 'loss/train': 1.1486338376998901} 11/07/2021 16:10:47 - INFO - __main__ - Step 134788: {'lr': 1.2922267003437571e-05, 'samples': 25879296, 'steps': 134787, 'loss/train': 1.217489242553711} 11/07/2021 16:10:47 - INFO - __main__ - Step 134789: {'lr': 1.2920583000378573e-05, 'samples': 25879488, 'steps': 134788, 'loss/train': 0.9214939475059509} 11/07/2021 16:10:48 - INFO - __main__ - Step 134790: {'lr': 1.2918899104143844e-05, 'samples': 25879680, 'steps': 134789, 'loss/train': 1.3533412218093872} 11/07/2021 16:10:48 - INFO - __main__ - Step 134791: {'lr': 1.2917215314734221e-05, 'samples': 25879872, 'steps': 134790, 'loss/train': 1.3483279943466187} 11/07/2021 16:10:49 - INFO - __main__ - Step 134792: {'lr': 1.2915531632150423e-05, 'samples': 25880064, 'steps': 134791, 'loss/train': 1.1214323043823242} 11/07/2021 16:10:49 - INFO - __main__ - Step 134793: {'lr': 1.2913848056393258e-05, 'samples': 25880256, 'steps': 134792, 'loss/train': 1.1219063997268677} 11/07/2021 16:10:49 - INFO - __main__ - Step 134794: {'lr': 1.2912164587463442e-05, 'samples': 25880448, 'steps': 134793, 'loss/train': 1.3916800022125244} 11/07/2021 16:10:50 - INFO - __main__ - Step 134795: {'lr': 1.291048122536173e-05, 'samples': 25880640, 'steps': 134794, 'loss/train': 0.9739313125610352} 11/07/2021 16:10:51 - INFO - __main__ - Step 134796: {'lr': 1.2908797970088926e-05, 'samples': 25880832, 'steps': 134795, 'loss/train': 1.443356990814209} 11/07/2021 16:10:51 - INFO - __main__ - Step 134797: {'lr': 1.2907114821645749e-05, 'samples': 25881024, 'steps': 134796, 'loss/train': 0.9463968276977539} 11/07/2021 16:10:51 - INFO - __main__ - Step 134798: {'lr': 1.290543178003295e-05, 'samples': 25881216, 'steps': 134797, 'loss/train': 1.1029675006866455} 11/07/2021 16:10:52 - INFO - __main__ - Step 134799: {'lr': 1.2903748845251306e-05, 'samples': 25881408, 'steps': 134798, 'loss/train': 0.8651047945022583} 11/07/2021 16:10:53 - INFO - __main__ - Step 134800: {'lr': 1.2902066017301595e-05, 'samples': 25881600, 'steps': 134799, 'loss/train': 1.3206506967544556} 11/07/2021 16:10:53 - INFO - __main__ - Step 134801: {'lr': 1.2900383296184538e-05, 'samples': 25881792, 'steps': 134800, 'loss/train': 1.2021732330322266} 11/07/2021 16:10:54 - INFO - __main__ - Step 134802: {'lr': 1.2898700681900966e-05, 'samples': 25881984, 'steps': 134801, 'loss/train': 1.5280578136444092} 11/07/2021 16:10:54 - INFO - __main__ - Step 134803: {'lr': 1.2897018174451519e-05, 'samples': 25882176, 'steps': 134802, 'loss/train': 1.6390373706817627} 11/07/2021 16:10:54 - INFO - __main__ - Step 134804: {'lr': 1.289533577383703e-05, 'samples': 25882368, 'steps': 134803, 'loss/train': 0.8469497561454773} 11/07/2021 16:10:55 - INFO - __main__ - Step 134805: {'lr': 1.2893653480058248e-05, 'samples': 25882560, 'steps': 134804, 'loss/train': 1.5158437490463257} 11/07/2021 16:10:56 - INFO - __main__ - Step 134806: {'lr': 1.2891971293115923e-05, 'samples': 25882752, 'steps': 134805, 'loss/train': 1.3810878992080688} 11/07/2021 16:10:56 - INFO - __main__ - Step 134807: {'lr': 1.2890289213010803e-05, 'samples': 25882944, 'steps': 134806, 'loss/train': 1.3449970483779907} 11/07/2021 16:10:56 - INFO - __main__ - Step 134808: {'lr': 1.2888607239743666e-05, 'samples': 25883136, 'steps': 134807, 'loss/train': 1.1474287509918213} 11/07/2021 16:10:57 - INFO - __main__ - Step 134809: {'lr': 1.288692537331529e-05, 'samples': 25883328, 'steps': 134808, 'loss/train': 1.8185296058654785} 11/07/2021 16:10:57 - INFO - __main__ - Step 134810: {'lr': 1.2885243613726366e-05, 'samples': 25883520, 'steps': 134809, 'loss/train': 1.0617432594299316} 11/07/2021 16:10:59 - INFO - __main__ - Step 134811: {'lr': 1.288356196097773e-05, 'samples': 25883712, 'steps': 134810, 'loss/train': 1.124085783958435} 11/07/2021 16:10:59 - INFO - __main__ - Step 134812: {'lr': 1.2881880415070074e-05, 'samples': 25883904, 'steps': 134811, 'loss/train': 0.8630475401878357} 11/07/2021 16:10:59 - INFO - __main__ - Step 134813: {'lr': 1.2880198976004203e-05, 'samples': 25884096, 'steps': 134812, 'loss/train': 1.0625325441360474} 11/07/2021 16:11:00 - INFO - __main__ - Step 134814: {'lr': 1.2878517643780841e-05, 'samples': 25884288, 'steps': 134813, 'loss/train': 1.5996226072311401} 11/07/2021 16:11:00 - INFO - __main__ - Step 134815: {'lr': 1.287683641840076e-05, 'samples': 25884480, 'steps': 134814, 'loss/train': 0.6043087840080261} 11/07/2021 16:11:00 - INFO - __main__ - Step 134816: {'lr': 1.2875155299864771e-05, 'samples': 25884672, 'steps': 134815, 'loss/train': 1.7398905754089355} 11/07/2021 16:11:01 - INFO - __main__ - Step 134817: {'lr': 1.2873474288173536e-05, 'samples': 25884864, 'steps': 134816, 'loss/train': 1.6235522031784058} 11/07/2021 16:11:02 - INFO - __main__ - Step 134818: {'lr': 1.2871793383327835e-05, 'samples': 25885056, 'steps': 134817, 'loss/train': 1.3574274778366089} 11/07/2021 16:11:02 - INFO - __main__ - Step 134819: {'lr': 1.287011258532847e-05, 'samples': 25885248, 'steps': 134818, 'loss/train': 1.2056418657302856} 11/07/2021 16:11:02 - INFO - __main__ - Step 134820: {'lr': 1.2868431894176164e-05, 'samples': 25885440, 'steps': 134819, 'loss/train': 1.9715707302093506} 11/07/2021 16:11:03 - INFO - __main__ - Step 134821: {'lr': 1.2866751309871665e-05, 'samples': 25885632, 'steps': 134820, 'loss/train': 1.406848430633545} 11/07/2021 16:11:03 - INFO - __main__ - Step 134822: {'lr': 1.2865070832415782e-05, 'samples': 25885824, 'steps': 134821, 'loss/train': 1.1033084392547607} 11/07/2021 16:11:04 - INFO - __main__ - Step 134823: {'lr': 1.2863390461809204e-05, 'samples': 25886016, 'steps': 134822, 'loss/train': 1.5581411123275757} 11/07/2021 16:11:05 - INFO - __main__ - Step 134824: {'lr': 1.2861710198052739e-05, 'samples': 25886208, 'steps': 134823, 'loss/train': 1.3526517152786255} 11/07/2021 16:11:05 - INFO - __main__ - Step 134825: {'lr': 1.2860030041147135e-05, 'samples': 25886400, 'steps': 134824, 'loss/train': 1.1261019706726074} 11/07/2021 16:11:05 - INFO - __main__ - Step 134826: {'lr': 1.2858349991093144e-05, 'samples': 25886592, 'steps': 134825, 'loss/train': 1.4715633392333984} 11/07/2021 16:11:06 - INFO - __main__ - Step 134827: {'lr': 1.2856670047891512e-05, 'samples': 25886784, 'steps': 134826, 'loss/train': 1.534654140472412} 11/07/2021 16:11:07 - INFO - __main__ - Step 134828: {'lr': 1.2854990211543044e-05, 'samples': 25886976, 'steps': 134827, 'loss/train': 1.411892056465149} 11/07/2021 16:11:07 - INFO - __main__ - Step 134829: {'lr': 1.2853310482048409e-05, 'samples': 25887168, 'steps': 134828, 'loss/train': 1.1454613208770752} 11/07/2021 16:11:07 - INFO - __main__ - Step 134830: {'lr': 1.2851630859408437e-05, 'samples': 25887360, 'steps': 134829, 'loss/train': 1.2113189697265625} 11/07/2021 16:11:08 - INFO - __main__ - Step 134831: {'lr': 1.284995134362385e-05, 'samples': 25887552, 'steps': 134830, 'loss/train': 1.1481940746307373} 11/07/2021 16:11:08 - INFO - __main__ - Step 134832: {'lr': 1.2848271934695398e-05, 'samples': 25887744, 'steps': 134831, 'loss/train': 1.4227086305618286} 11/07/2021 16:11:09 - INFO - __main__ - Step 134833: {'lr': 1.2846592632623888e-05, 'samples': 25887936, 'steps': 134832, 'loss/train': 1.3994667530059814} 11/07/2021 16:11:09 - INFO - __main__ - Step 134834: {'lr': 1.2844913437410011e-05, 'samples': 25888128, 'steps': 134833, 'loss/train': 1.1866354942321777} 11/07/2021 16:11:10 - INFO - __main__ - Step 134835: {'lr': 1.28432343490546e-05, 'samples': 25888320, 'steps': 134834, 'loss/train': 1.3979986906051636} 11/07/2021 16:11:10 - INFO - __main__ - Step 134836: {'lr': 1.2841555367558322e-05, 'samples': 25888512, 'steps': 134835, 'loss/train': 1.293241024017334} 11/07/2021 16:11:11 - INFO - __main__ - Step 134837: {'lr': 1.283987649292201e-05, 'samples': 25888704, 'steps': 134836, 'loss/train': 1.4411990642547607} 11/07/2021 16:11:12 - INFO - __main__ - Step 134838: {'lr': 1.2838197725146384e-05, 'samples': 25888896, 'steps': 134837, 'loss/train': 0.17353922128677368} 11/07/2021 16:11:12 - INFO - __main__ - Step 134839: {'lr': 1.2836519064232221e-05, 'samples': 25889088, 'steps': 134838, 'loss/train': 1.31904137134552} 11/07/2021 16:11:12 - INFO - __main__ - Step 134840: {'lr': 1.2834840510180245e-05, 'samples': 25889280, 'steps': 134839, 'loss/train': 1.3344497680664062} 11/07/2021 16:11:13 - INFO - __main__ - Step 134841: {'lr': 1.2833162062991232e-05, 'samples': 25889472, 'steps': 134840, 'loss/train': 1.4117369651794434} 11/07/2021 16:11:13 - INFO - __main__ - Step 134842: {'lr': 1.2831483722665931e-05, 'samples': 25889664, 'steps': 134841, 'loss/train': 1.3523935079574585} 11/07/2021 16:11:13 - INFO - __main__ - Step 134843: {'lr': 1.2829805489205092e-05, 'samples': 25889856, 'steps': 134842, 'loss/train': 1.2212395668029785} 11/07/2021 16:11:14 - INFO - __main__ - Step 134844: {'lr': 1.2828127362609522e-05, 'samples': 25890048, 'steps': 134843, 'loss/train': 1.6736384630203247} 11/07/2021 16:11:15 - INFO - __main__ - Step 134845: {'lr': 1.2826449342879908e-05, 'samples': 25890240, 'steps': 134844, 'loss/train': 1.581305742263794} 11/07/2021 16:11:15 - INFO - __main__ - Step 134846: {'lr': 1.2824771430017035e-05, 'samples': 25890432, 'steps': 134845, 'loss/train': 1.4932397603988647} 11/07/2021 16:11:15 - INFO - __main__ - Step 134847: {'lr': 1.2823093624021676e-05, 'samples': 25890624, 'steps': 134846, 'loss/train': 1.0488781929016113} 11/07/2021 16:11:16 - INFO - __main__ - Step 134848: {'lr': 1.2821415924894554e-05, 'samples': 25890816, 'steps': 134847, 'loss/train': 1.2015304565429688} 11/07/2021 16:11:17 - INFO - __main__ - Step 134849: {'lr': 1.2819738332636444e-05, 'samples': 25891008, 'steps': 134848, 'loss/train': 1.3351500034332275} 11/07/2021 16:11:17 - INFO - __main__ - Step 134850: {'lr': 1.2818060847248125e-05, 'samples': 25891200, 'steps': 134849, 'loss/train': 1.0528594255447388} 11/07/2021 16:11:17 - INFO - __main__ - Step 134851: {'lr': 1.281638346873032e-05, 'samples': 25891392, 'steps': 134850, 'loss/train': 1.2488963603973389} 11/07/2021 16:11:18 - INFO - __main__ - Step 134852: {'lr': 1.2814706197083775e-05, 'samples': 25891584, 'steps': 134851, 'loss/train': 1.2537533044815063} 11/07/2021 16:11:18 - INFO - __main__ - Step 134853: {'lr': 1.281302903230927e-05, 'samples': 25891776, 'steps': 134852, 'loss/train': 1.1792442798614502} 11/07/2021 16:11:19 - INFO - __main__ - Step 134854: {'lr': 1.2811351974407553e-05, 'samples': 25891968, 'steps': 134853, 'loss/train': 0.14917217195034027} 11/07/2021 16:11:20 - INFO - __main__ - Step 134855: {'lr': 1.2809675023379375e-05, 'samples': 25892160, 'steps': 134854, 'loss/train': 1.3749210834503174} 11/07/2021 16:11:20 - INFO - __main__ - Step 134856: {'lr': 1.280799817922551e-05, 'samples': 25892352, 'steps': 134855, 'loss/train': 1.463535189628601} 11/07/2021 16:11:20 - INFO - __main__ - Step 134857: {'lr': 1.2806321441946683e-05, 'samples': 25892544, 'steps': 134856, 'loss/train': 1.0907377004623413} 11/07/2021 16:11:21 - INFO - __main__ - Step 134858: {'lr': 1.2804644811543698e-05, 'samples': 25892736, 'steps': 134857, 'loss/train': 1.7158759832382202} 11/07/2021 16:11:22 - INFO - __main__ - Step 134859: {'lr': 1.2802968288017247e-05, 'samples': 25892928, 'steps': 134858, 'loss/train': 1.435095191001892} 11/07/2021 16:11:22 - INFO - __main__ - Step 134860: {'lr': 1.2801291871368137e-05, 'samples': 25893120, 'steps': 134859, 'loss/train': 1.5923510789871216} 11/07/2021 16:11:22 - INFO - __main__ - Step 134861: {'lr': 1.2799615561597145e-05, 'samples': 25893312, 'steps': 134860, 'loss/train': 0.9983268976211548} 11/07/2021 16:11:23 - INFO - __main__ - Step 134862: {'lr': 1.2797939358704936e-05, 'samples': 25893504, 'steps': 134861, 'loss/train': 1.381702184677124} 11/07/2021 16:11:23 - INFO - __main__ - Step 134863: {'lr': 1.2796263262692315e-05, 'samples': 25893696, 'steps': 134862, 'loss/train': 1.4791808128356934} 11/07/2021 16:11:24 - INFO - __main__ - Step 134864: {'lr': 1.2794587273560033e-05, 'samples': 25893888, 'steps': 134863, 'loss/train': 1.2036523818969727} 11/07/2021 16:11:25 - INFO - __main__ - Step 134865: {'lr': 1.2792911391308865e-05, 'samples': 25894080, 'steps': 134864, 'loss/train': 1.4967896938323975} 11/07/2021 16:11:25 - INFO - __main__ - Step 134866: {'lr': 1.2791235615939535e-05, 'samples': 25894272, 'steps': 134865, 'loss/train': 1.0937964916229248} 11/07/2021 16:11:25 - INFO - __main__ - Step 134867: {'lr': 1.2789559947452845e-05, 'samples': 25894464, 'steps': 134866, 'loss/train': 1.6236459016799927} 11/07/2021 16:11:26 - INFO - __main__ - Step 134868: {'lr': 1.278788438584949e-05, 'samples': 25894656, 'steps': 134867, 'loss/train': 1.2501405477523804} 11/07/2021 16:11:27 - INFO - __main__ - Step 134869: {'lr': 1.2786208931130249e-05, 'samples': 25894848, 'steps': 134868, 'loss/train': 0.19150589406490326} 11/07/2021 16:11:27 - INFO - __main__ - Step 134870: {'lr': 1.2784533583295898e-05, 'samples': 25895040, 'steps': 134869, 'loss/train': 0.6460882425308228} 11/07/2021 16:11:27 - INFO - __main__ - Step 134871: {'lr': 1.2782858342347186e-05, 'samples': 25895232, 'steps': 134870, 'loss/train': 1.6131576299667358} 11/07/2021 16:11:28 - INFO - __main__ - Step 134872: {'lr': 1.2781183208284864e-05, 'samples': 25895424, 'steps': 134871, 'loss/train': 1.5586036443710327} 11/07/2021 16:11:28 - INFO - __main__ - Step 134873: {'lr': 1.2779508181109651e-05, 'samples': 25895616, 'steps': 134872, 'loss/train': 0.8492622971534729} 11/07/2021 16:11:29 - INFO - __main__ - Step 134874: {'lr': 1.2777833260822353e-05, 'samples': 25895808, 'steps': 134873, 'loss/train': 1.496443510055542} 11/07/2021 16:11:30 - INFO - __main__ - Step 134875: {'lr': 1.2776158447423692e-05, 'samples': 25896000, 'steps': 134874, 'loss/train': 1.2250454425811768} 11/07/2021 16:11:30 - INFO - __main__ - Step 134876: {'lr': 1.2774483740914416e-05, 'samples': 25896192, 'steps': 134875, 'loss/train': 0.5619969367980957} 11/07/2021 16:11:30 - INFO - __main__ - Step 134877: {'lr': 1.2772809141295333e-05, 'samples': 25896384, 'steps': 134876, 'loss/train': 1.3525415658950806} 11/07/2021 16:11:31 - INFO - __main__ - Step 134878: {'lr': 1.2771134648567134e-05, 'samples': 25896576, 'steps': 134877, 'loss/train': 1.1889479160308838} 11/07/2021 16:11:32 - INFO - __main__ - Step 134879: {'lr': 1.2769460262730598e-05, 'samples': 25896768, 'steps': 134878, 'loss/train': 1.374812126159668} 11/07/2021 16:11:32 - INFO - __main__ - Step 134880: {'lr': 1.27677859837865e-05, 'samples': 25896960, 'steps': 134879, 'loss/train': 1.497150182723999} 11/07/2021 16:11:32 - INFO - __main__ - Step 134881: {'lr': 1.2766111811735564e-05, 'samples': 25897152, 'steps': 134880, 'loss/train': 1.3814265727996826} 11/07/2021 16:11:33 - INFO - __main__ - Step 134882: {'lr': 1.2764437746578567e-05, 'samples': 25897344, 'steps': 134881, 'loss/train': 1.5581690073013306} 11/07/2021 16:11:33 - INFO - __main__ - Step 134883: {'lr': 1.2762763788316284e-05, 'samples': 25897536, 'steps': 134882, 'loss/train': 1.2090656757354736} 11/07/2021 16:11:34 - INFO - __main__ - Step 134884: {'lr': 1.276108993694941e-05, 'samples': 25897728, 'steps': 134883, 'loss/train': 1.5109453201293945} 11/07/2021 16:11:35 - INFO - __main__ - Step 134885: {'lr': 1.2759416192478724e-05, 'samples': 25897920, 'steps': 134884, 'loss/train': 1.3855353593826294} 11/07/2021 16:11:35 - INFO - __main__ - Step 134886: {'lr': 1.2757742554904972e-05, 'samples': 25898112, 'steps': 134885, 'loss/train': 1.3489707708358765} 11/07/2021 16:11:35 - INFO - __main__ - Step 134887: {'lr': 1.2756069024228934e-05, 'samples': 25898304, 'steps': 134886, 'loss/train': 1.5418834686279297} 11/07/2021 16:11:36 - INFO - __main__ - Step 134888: {'lr': 1.275439560045133e-05, 'samples': 25898496, 'steps': 134887, 'loss/train': 1.2755507230758667} 11/07/2021 16:11:36 - INFO - __main__ - Step 134889: {'lr': 1.2752722283572966e-05, 'samples': 25898688, 'steps': 134888, 'loss/train': 1.455265998840332} 11/07/2021 16:11:37 - INFO - __main__ - Step 134890: {'lr': 1.2751049073594533e-05, 'samples': 25898880, 'steps': 134889, 'loss/train': 1.1276488304138184} 11/07/2021 16:11:37 - INFO - __main__ - Step 134891: {'lr': 1.2749375970516841e-05, 'samples': 25899072, 'steps': 134890, 'loss/train': 1.0421886444091797} 11/07/2021 16:11:38 - INFO - __main__ - Step 134892: {'lr': 1.2747702974340609e-05, 'samples': 25899264, 'steps': 134891, 'loss/train': 1.5399373769760132} 11/07/2021 16:11:38 - INFO - __main__ - Step 134893: {'lr': 1.2746030085066613e-05, 'samples': 25899456, 'steps': 134892, 'loss/train': 0.9123199582099915} 11/07/2021 16:11:39 - INFO - __main__ - Step 134894: {'lr': 1.2744357302695576e-05, 'samples': 25899648, 'steps': 134893, 'loss/train': 1.6140023469924927} 11/07/2021 16:11:39 - INFO - __main__ - Step 134895: {'lr': 1.2742684627228274e-05, 'samples': 25899840, 'steps': 134894, 'loss/train': 1.4452741146087646} 11/07/2021 16:11:40 - INFO - __main__ - Step 134896: {'lr': 1.2741012058665514e-05, 'samples': 25900032, 'steps': 134895, 'loss/train': 1.0353888273239136} 11/07/2021 16:11:40 - INFO - __main__ - Step 134897: {'lr': 1.2739339597007932e-05, 'samples': 25900224, 'steps': 134896, 'loss/train': 1.476210594177246} 11/07/2021 16:11:41 - INFO - __main__ - Step 134898: {'lr': 1.2737667242256363e-05, 'samples': 25900416, 'steps': 134897, 'loss/train': 1.1492382287979126} 11/07/2021 16:11:41 - INFO - __main__ - Step 134899: {'lr': 1.2735994994411527e-05, 'samples': 25900608, 'steps': 134898, 'loss/train': 1.6204783916473389} 11/07/2021 16:11:41 - INFO - __main__ - Step 134900: {'lr': 1.2734322853474173e-05, 'samples': 25900800, 'steps': 134899, 'loss/train': 1.2749701738357544} 11/07/2021 16:11:43 - INFO - __main__ - Step 134901: {'lr': 1.2732650819445107e-05, 'samples': 25900992, 'steps': 134900, 'loss/train': 1.401441216468811} 11/07/2021 16:11:43 - INFO - __main__ - Step 134902: {'lr': 1.2730978892325024e-05, 'samples': 25901184, 'steps': 134901, 'loss/train': 1.070945382118225} 11/07/2021 16:11:43 - INFO - __main__ - Step 134903: {'lr': 1.2729307072114699e-05, 'samples': 25901376, 'steps': 134902, 'loss/train': 4.243330478668213} 11/07/2021 16:11:44 - INFO - __main__ - Step 134904: {'lr': 1.2727635358814909e-05, 'samples': 25901568, 'steps': 134903, 'loss/train': 1.260129451751709} 11/07/2021 16:11:44 - INFO - __main__ - Step 134905: {'lr': 1.2725963752426379e-05, 'samples': 25901760, 'steps': 134904, 'loss/train': 1.316531777381897} 11/07/2021 16:11:44 - INFO - __main__ - Step 134906: {'lr': 1.2724292252949853e-05, 'samples': 25901952, 'steps': 134905, 'loss/train': 0.8173484206199646} 11/07/2021 16:11:46 - INFO - __main__ - Step 134907: {'lr': 1.2722620860386114e-05, 'samples': 25902144, 'steps': 134906, 'loss/train': 1.3408111333847046} 11/07/2021 16:11:47 - INFO - __main__ - Step 134908: {'lr': 1.272094957473588e-05, 'samples': 25902336, 'steps': 134907, 'loss/train': 1.3579071760177612} 11/07/2021 16:11:47 - INFO - __main__ - Step 134909: {'lr': 1.2719278395999956e-05, 'samples': 25902528, 'steps': 134908, 'loss/train': 1.3691651821136475} 11/07/2021 16:11:47 - INFO - __main__ - Step 134910: {'lr': 1.2717607324179064e-05, 'samples': 25902720, 'steps': 134909, 'loss/train': 0.617475688457489} 11/07/2021 16:11:48 - INFO - __main__ - Step 134911: {'lr': 1.2715936359273956e-05, 'samples': 25902912, 'steps': 134910, 'loss/train': 1.3593031167984009} 11/07/2021 16:11:48 - INFO - __main__ - Step 134912: {'lr': 1.2714265501285349e-05, 'samples': 25903104, 'steps': 134911, 'loss/train': 1.7324185371398926} 11/07/2021 16:11:49 - INFO - __main__ - Step 134913: {'lr': 1.2712594750214052e-05, 'samples': 25903296, 'steps': 134912, 'loss/train': 1.727643609046936} 11/07/2021 16:11:50 - INFO - __main__ - Step 134914: {'lr': 1.2710924106060812e-05, 'samples': 25903488, 'steps': 134913, 'loss/train': 1.721448540687561} 11/07/2021 16:11:50 - INFO - __main__ - Step 134915: {'lr': 1.2709253568826352e-05, 'samples': 25903680, 'steps': 134914, 'loss/train': 1.0215809345245361} 11/07/2021 16:11:50 - INFO - __main__ - Step 134916: {'lr': 1.2707583138511448e-05, 'samples': 25903872, 'steps': 134915, 'loss/train': 0.8229004144668579} 11/07/2021 16:11:51 - INFO - __main__ - Step 134917: {'lr': 1.2705912815116821e-05, 'samples': 25904064, 'steps': 134916, 'loss/train': 0.5812721252441406} 11/07/2021 16:11:51 - INFO - __main__ - Step 134918: {'lr': 1.270424259864328e-05, 'samples': 25904256, 'steps': 134917, 'loss/train': 1.3929243087768555} 11/07/2021 16:11:52 - INFO - __main__ - Step 134919: {'lr': 1.2702572489091541e-05, 'samples': 25904448, 'steps': 134918, 'loss/train': 1.3316141366958618} 11/07/2021 16:11:52 - INFO - __main__ - Step 134920: {'lr': 1.2700902486462356e-05, 'samples': 25904640, 'steps': 134919, 'loss/train': 1.3721493482589722} 11/07/2021 16:11:53 - INFO - __main__ - Step 134921: {'lr': 1.2699232590756476e-05, 'samples': 25904832, 'steps': 134920, 'loss/train': 1.1483830213546753} 11/07/2021 16:11:53 - INFO - __main__ - Step 134922: {'lr': 1.269756280197465e-05, 'samples': 25905024, 'steps': 134921, 'loss/train': 1.5868685245513916} 11/07/2021 16:11:53 - INFO - __main__ - Step 134923: {'lr': 1.2695893120117707e-05, 'samples': 25905216, 'steps': 134922, 'loss/train': 1.4498571157455444} 11/07/2021 16:11:54 - INFO - __main__ - Step 134924: {'lr': 1.2694223545186262e-05, 'samples': 25905408, 'steps': 134923, 'loss/train': 1.6824661493301392} 11/07/2021 16:11:55 - INFO - __main__ - Step 134925: {'lr': 1.2692554077181173e-05, 'samples': 25905600, 'steps': 134924, 'loss/train': 1.2405872344970703} 11/07/2021 16:11:55 - INFO - __main__ - Step 134926: {'lr': 1.2690884716103135e-05, 'samples': 25905792, 'steps': 134925, 'loss/train': 1.4223320484161377} 11/07/2021 16:11:55 - INFO - __main__ - Step 134927: {'lr': 1.2689215461952924e-05, 'samples': 25905984, 'steps': 134926, 'loss/train': 1.4625873565673828} 11/07/2021 16:11:56 - INFO - __main__ - Step 134928: {'lr': 1.2687546314731291e-05, 'samples': 25906176, 'steps': 134927, 'loss/train': 1.4284873008728027} 11/07/2021 16:11:56 - INFO - __main__ - Step 134929: {'lr': 1.2685877274438984e-05, 'samples': 25906368, 'steps': 134928, 'loss/train': 1.6630303859710693} 11/07/2021 16:11:57 - INFO - __main__ - Step 134930: {'lr': 1.2684208341076781e-05, 'samples': 25906560, 'steps': 134929, 'loss/train': 0.9985319972038269} 11/07/2021 16:11:58 - INFO - __main__ - Step 134931: {'lr': 1.2682539514645402e-05, 'samples': 25906752, 'steps': 134930, 'loss/train': 1.541703701019287} 11/07/2021 16:11:58 - INFO - __main__ - Step 134932: {'lr': 1.26808707951456e-05, 'samples': 25906944, 'steps': 134931, 'loss/train': 1.3005475997924805} 11/07/2021 16:11:58 - INFO - __main__ - Step 134933: {'lr': 1.2679202182578148e-05, 'samples': 25907136, 'steps': 134932, 'loss/train': 1.2675689458847046} 11/07/2021 16:11:59 - INFO - __main__ - Step 134934: {'lr': 1.267753367694377e-05, 'samples': 25907328, 'steps': 134933, 'loss/train': 1.2030112743377686} 11/07/2021 16:12:00 - INFO - __main__ - Step 134935: {'lr': 1.2675865278243243e-05, 'samples': 25907520, 'steps': 134934, 'loss/train': 0.3266187012195587} 11/07/2021 16:12:00 - INFO - __main__ - Step 134936: {'lr': 1.2674196986477288e-05, 'samples': 25907712, 'steps': 134935, 'loss/train': 1.4516592025756836} 11/07/2021 16:12:00 - INFO - __main__ - Step 134937: {'lr': 1.2672528801646738e-05, 'samples': 25907904, 'steps': 134936, 'loss/train': 1.710021734237671} 11/07/2021 16:12:01 - INFO - __main__ - Step 134938: {'lr': 1.2670860723752231e-05, 'samples': 25908096, 'steps': 134937, 'loss/train': 1.0390781164169312} 11/07/2021 16:12:01 - INFO - __main__ - Step 134939: {'lr': 1.26691927527946e-05, 'samples': 25908288, 'steps': 134938, 'loss/train': 0.8472369313240051} 11/07/2021 16:12:02 - INFO - __main__ - Step 134940: {'lr': 1.266752488877454e-05, 'samples': 25908480, 'steps': 134939, 'loss/train': 1.6788691282272339} 11/07/2021 16:12:03 - INFO - __main__ - Step 134941: {'lr': 1.2665857131692853e-05, 'samples': 25908672, 'steps': 134940, 'loss/train': 1.4433711767196655} 11/07/2021 16:12:03 - INFO - __main__ - Step 134942: {'lr': 1.2664189481550236e-05, 'samples': 25908864, 'steps': 134941, 'loss/train': 0.649669349193573} 11/07/2021 16:12:03 - INFO - __main__ - Step 134943: {'lr': 1.266252193834752e-05, 'samples': 25909056, 'steps': 134942, 'loss/train': 1.4184370040893555} 11/07/2021 16:12:04 - INFO - __main__ - Step 134944: {'lr': 1.2660854502085373e-05, 'samples': 25909248, 'steps': 134943, 'loss/train': 1.2269195318222046} 11/07/2021 16:12:05 - INFO - __main__ - Step 134945: {'lr': 1.2659187172764597e-05, 'samples': 25909440, 'steps': 134944, 'loss/train': 1.4780107736587524} 11/07/2021 16:12:05 - INFO - __main__ - Step 134946: {'lr': 1.2657519950385914e-05, 'samples': 25909632, 'steps': 134945, 'loss/train': 1.308503270149231} 11/07/2021 16:12:05 - INFO - __main__ - Step 134947: {'lr': 1.2655852834950104e-05, 'samples': 25909824, 'steps': 134946, 'loss/train': 0.9033694863319397} 11/07/2021 16:12:06 - INFO - __main__ - Step 134948: {'lr': 1.2654185826457914e-05, 'samples': 25910016, 'steps': 134947, 'loss/train': 1.4584356546401978} 11/07/2021 16:12:06 - INFO - __main__ - Step 134949: {'lr': 1.2652518924910067e-05, 'samples': 25910208, 'steps': 134948, 'loss/train': 1.6042824983596802} 11/07/2021 16:12:07 - INFO - __main__ - Step 134950: {'lr': 1.2650852130307367e-05, 'samples': 25910400, 'steps': 134949, 'loss/train': 1.464131236076355} 11/07/2021 16:12:07 - INFO - __main__ - Step 134951: {'lr': 1.2649185442650507e-05, 'samples': 25910592, 'steps': 134950, 'loss/train': 0.9376916885375977} 11/07/2021 16:12:08 - INFO - __main__ - Step 134952: {'lr': 1.2647518861940266e-05, 'samples': 25910784, 'steps': 134951, 'loss/train': 0.6143465638160706} 11/07/2021 16:12:08 - INFO - __main__ - Step 134953: {'lr': 1.2645852388177365e-05, 'samples': 25910976, 'steps': 134952, 'loss/train': 1.4627506732940674} 11/07/2021 16:12:08 - INFO - __main__ - Step 134954: {'lr': 1.2644186021362608e-05, 'samples': 25911168, 'steps': 134953, 'loss/train': 1.0933310985565186} 11/07/2021 16:12:09 - INFO - __main__ - Step 134955: {'lr': 1.264251976149669e-05, 'samples': 25911360, 'steps': 134954, 'loss/train': 1.4543802738189697} 11/07/2021 16:12:10 - INFO - __main__ - Step 134956: {'lr': 1.2640853608580416e-05, 'samples': 25911552, 'steps': 134955, 'loss/train': 1.422697901725769} 11/07/2021 16:12:10 - INFO - __main__ - Step 134957: {'lr': 1.2639187562614507e-05, 'samples': 25911744, 'steps': 134956, 'loss/train': 1.3523969650268555} 11/07/2021 16:12:11 - INFO - __main__ - Step 134958: {'lr': 1.2637521623599713e-05, 'samples': 25911936, 'steps': 134957, 'loss/train': 0.9405666589736938} 11/07/2021 16:12:11 - INFO - __main__ - Step 134959: {'lr': 1.2635855791536782e-05, 'samples': 25912128, 'steps': 134958, 'loss/train': 1.7095574140548706} 11/07/2021 16:12:11 - INFO - __main__ - Step 134960: {'lr': 1.2634190066426466e-05, 'samples': 25912320, 'steps': 134959, 'loss/train': 0.6135030388832092} 11/07/2021 16:12:12 - INFO - __main__ - Step 134961: {'lr': 1.263252444826954e-05, 'samples': 25912512, 'steps': 134960, 'loss/train': 1.1531929969787598} 11/07/2021 16:12:13 - INFO - __main__ - Step 134962: {'lr': 1.2630858937066725e-05, 'samples': 25912704, 'steps': 134961, 'loss/train': 1.539517879486084} 11/07/2021 16:12:13 - INFO - __main__ - Step 134963: {'lr': 1.26291935328188e-05, 'samples': 25912896, 'steps': 134962, 'loss/train': 1.5433464050292969} 11/07/2021 16:12:13 - INFO - __main__ - Step 134964: {'lr': 1.2627528235526514e-05, 'samples': 25913088, 'steps': 134963, 'loss/train': 1.0222822427749634} 11/07/2021 16:12:14 - INFO - __main__ - Step 134965: {'lr': 1.262586304519056e-05, 'samples': 25913280, 'steps': 134964, 'loss/train': 0.700542151927948} 11/07/2021 16:12:15 - INFO - __main__ - Step 134966: {'lr': 1.2624197961811745e-05, 'samples': 25913472, 'steps': 134965, 'loss/train': 0.9643478989601135} 11/07/2021 16:12:15 - INFO - __main__ - Step 134967: {'lr': 1.262253298539079e-05, 'samples': 25913664, 'steps': 134966, 'loss/train': 1.6409857273101807} 11/07/2021 16:12:16 - INFO - __main__ - Step 134968: {'lr': 1.2620868115928469e-05, 'samples': 25913856, 'steps': 134967, 'loss/train': 1.9833309650421143} 11/07/2021 16:12:16 - INFO - __main__ - Step 134969: {'lr': 1.2619203353425506e-05, 'samples': 25914048, 'steps': 134968, 'loss/train': 1.5122026205062866} 11/07/2021 16:12:16 - INFO - __main__ - Step 134970: {'lr': 1.261753869788268e-05, 'samples': 25914240, 'steps': 134969, 'loss/train': 0.7253329753875732} 11/07/2021 16:12:17 - INFO - __main__ - Step 134971: {'lr': 1.2615874149300737e-05, 'samples': 25914432, 'steps': 134970, 'loss/train': 1.2457234859466553} 11/07/2021 16:12:18 - INFO - __main__ - Step 134972: {'lr': 1.2614209707680401e-05, 'samples': 25914624, 'steps': 134971, 'loss/train': 1.0035768747329712} 11/07/2021 16:12:18 - INFO - __main__ - Step 134973: {'lr': 1.2612545373022449e-05, 'samples': 25914816, 'steps': 134972, 'loss/train': 1.3931148052215576} 11/07/2021 16:12:18 - INFO - __main__ - Step 134974: {'lr': 1.261088114532763e-05, 'samples': 25915008, 'steps': 134973, 'loss/train': 0.8405476808547974} 11/07/2021 16:12:19 - INFO - __main__ - Step 134975: {'lr': 1.2609217024596664e-05, 'samples': 25915200, 'steps': 134974, 'loss/train': 0.8239151835441589} 11/07/2021 16:12:20 - INFO - __main__ - Step 134976: {'lr': 1.260755301083033e-05, 'samples': 25915392, 'steps': 134975, 'loss/train': 1.3474042415618896} 11/07/2021 16:12:20 - INFO - __main__ - Step 134977: {'lr': 1.2605889104029406e-05, 'samples': 25915584, 'steps': 134976, 'loss/train': 1.2518229484558105} 11/07/2021 16:12:21 - INFO - __main__ - Step 134978: {'lr': 1.2604225304194584e-05, 'samples': 25915776, 'steps': 134977, 'loss/train': 0.969771683216095} 11/07/2021 16:12:21 - INFO - __main__ - Step 134979: {'lr': 1.2602561611326613e-05, 'samples': 25915968, 'steps': 134978, 'loss/train': 1.3560168743133545} 11/07/2021 16:12:21 - INFO - __main__ - Step 134980: {'lr': 1.2600898025426272e-05, 'samples': 25916160, 'steps': 134979, 'loss/train': 1.101119041442871} 11/07/2021 16:12:22 - INFO - __main__ - Step 134981: {'lr': 1.259923454649431e-05, 'samples': 25916352, 'steps': 134980, 'loss/train': 0.7582244873046875} 11/07/2021 16:12:23 - INFO - __main__ - Step 134982: {'lr': 1.2597571174531447e-05, 'samples': 25916544, 'steps': 134981, 'loss/train': 0.9626346826553345} 11/07/2021 16:12:23 - INFO - __main__ - Step 134983: {'lr': 1.259590790953849e-05, 'samples': 25916736, 'steps': 134982, 'loss/train': 1.3004575967788696} 11/07/2021 16:12:23 - INFO - __main__ - Step 134984: {'lr': 1.2594244751516133e-05, 'samples': 25916928, 'steps': 134983, 'loss/train': 1.2309041023254395} 11/07/2021 16:12:24 - INFO - __main__ - Step 134985: {'lr': 1.2592581700465122e-05, 'samples': 25917120, 'steps': 134984, 'loss/train': 1.103615164756775} 11/07/2021 16:12:24 - INFO - __main__ - Step 134986: {'lr': 1.2590918756386266e-05, 'samples': 25917312, 'steps': 134985, 'loss/train': 1.2810789346694946} 11/07/2021 16:12:25 - INFO - __main__ - Step 134987: {'lr': 1.2589255919280257e-05, 'samples': 25917504, 'steps': 134986, 'loss/train': 1.9869143962860107} 11/07/2021 16:12:25 - INFO - __main__ - Step 134988: {'lr': 1.25875931891479e-05, 'samples': 25917696, 'steps': 134987, 'loss/train': 0.823602020740509} 11/07/2021 16:12:26 - INFO - __main__ - Step 134989: {'lr': 1.258593056598989e-05, 'samples': 25917888, 'steps': 134988, 'loss/train': 1.6352664232254028} 11/07/2021 16:12:26 - INFO - __main__ - Step 134990: {'lr': 1.2584268049807001e-05, 'samples': 25918080, 'steps': 134989, 'loss/train': 2.128723382949829} 11/07/2021 16:12:26 - INFO - __main__ - Step 134991: {'lr': 1.2582605640599987e-05, 'samples': 25918272, 'steps': 134990, 'loss/train': 1.2533929347991943} 11/07/2021 16:12:28 - INFO - __main__ - Step 134992: {'lr': 1.2580943338369565e-05, 'samples': 25918464, 'steps': 134991, 'loss/train': 1.1021307706832886} 11/07/2021 16:12:28 - INFO - __main__ - Step 134993: {'lr': 1.2579281143116516e-05, 'samples': 25918656, 'steps': 134992, 'loss/train': 1.6473604440689087} 11/07/2021 16:12:28 - INFO - __main__ - Step 134994: {'lr': 1.257761905484156e-05, 'samples': 25918848, 'steps': 134993, 'loss/train': 1.1419823169708252} 11/07/2021 16:12:29 - INFO - __main__ - Step 134995: {'lr': 1.2575957073545503e-05, 'samples': 25919040, 'steps': 134994, 'loss/train': 1.513907551765442} 11/07/2021 16:12:29 - INFO - __main__ - Step 134996: {'lr': 1.2574295199229007e-05, 'samples': 25919232, 'steps': 134995, 'loss/train': 1.7313560247421265} 11/07/2021 16:12:30 - INFO - __main__ - Step 134997: {'lr': 1.257263343189291e-05, 'samples': 25919424, 'steps': 134996, 'loss/train': 1.2643868923187256} 11/07/2021 16:12:30 - INFO - __main__ - Step 134998: {'lr': 1.2570971771537903e-05, 'samples': 25919616, 'steps': 134997, 'loss/train': 0.042405933141708374} 11/07/2021 16:12:31 - INFO - __main__ - Step 134999: {'lr': 1.2569310218164765e-05, 'samples': 25919808, 'steps': 134998, 'loss/train': 1.1821857690811157} 11/07/2021 16:12:31 - INFO - __main__ - Step 135000: {'lr': 1.2567648771774215e-05, 'samples': 25920000, 'steps': 134999, 'loss/train': 1.248957633972168} 11/07/2021 16:12:31 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 16:15:48 - INFO - __main__ - Step 135000: {'loss/eval': 1.2335491180419922, 'perplexity': 3.4333934783935547} 11/07/2021 16:16:05 - WARNING - huggingface_hub.repository - Several commits (9) will be pushed upstream. 11/07/2021 16:16:05 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 16:16:26 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small b58b427..5724e17 proud-haze-135 -> proud-haze-135 11/07/2021 16:16:28 - INFO - __main__ - Step 135001: {'lr': 1.2565987432367032e-05, 'samples': 25920192, 'steps': 135000, 'loss/train': 1.6798568964004517} 11/07/2021 16:16:29 - INFO - __main__ - Step 135002: {'lr': 1.2564326199943937e-05, 'samples': 25920384, 'steps': 135001, 'loss/train': 1.2824721336364746} 11/07/2021 16:16:29 - INFO - __main__ - Step 135003: {'lr': 1.2562665074505708e-05, 'samples': 25920576, 'steps': 135002, 'loss/train': 0.9927428960800171} 11/07/2021 16:16:30 - INFO - __main__ - Step 135004: {'lr': 1.2561004056053093e-05, 'samples': 25920768, 'steps': 135003, 'loss/train': 0.7610594630241394} 11/07/2021 16:16:30 - INFO - __main__ - Step 135005: {'lr': 1.2559343144586816e-05, 'samples': 25920960, 'steps': 135004, 'loss/train': 1.5371102094650269} 11/07/2021 16:16:30 - INFO - __main__ - Step 135006: {'lr': 1.2557682340107624e-05, 'samples': 25921152, 'steps': 135005, 'loss/train': 1.205280065536499} 11/07/2021 16:16:32 - INFO - __main__ - Step 135007: {'lr': 1.2556021642616267e-05, 'samples': 25921344, 'steps': 135006, 'loss/train': 1.3128596544265747} 11/07/2021 16:16:32 - INFO - __main__ - Step 135008: {'lr': 1.2554361052113522e-05, 'samples': 25921536, 'steps': 135007, 'loss/train': 1.6063355207443237} 11/07/2021 16:16:32 - INFO - __main__ - Step 135009: {'lr': 1.2552700568600084e-05, 'samples': 25921728, 'steps': 135008, 'loss/train': 1.645625352859497} 11/07/2021 16:16:33 - INFO - __main__ - Step 135010: {'lr': 1.2551040192076784e-05, 'samples': 25921920, 'steps': 135009, 'loss/train': 1.3162504434585571} 11/07/2021 16:16:33 - INFO - __main__ - Step 135011: {'lr': 1.2549379922544291e-05, 'samples': 25922112, 'steps': 135010, 'loss/train': 1.0465974807739258} 11/07/2021 16:16:33 - INFO - __main__ - Step 135012: {'lr': 1.2547719760003379e-05, 'samples': 25922304, 'steps': 135011, 'loss/train': 1.2932697534561157} 11/07/2021 16:16:34 - INFO - __main__ - Step 135013: {'lr': 1.2546059704454798e-05, 'samples': 25922496, 'steps': 135012, 'loss/train': 1.3780603408813477} 11/07/2021 16:16:35 - INFO - __main__ - Step 135014: {'lr': 1.2544399755899328e-05, 'samples': 25922688, 'steps': 135013, 'loss/train': 1.4957404136657715} 11/07/2021 16:16:35 - INFO - __main__ - Step 135015: {'lr': 1.2542739914337658e-05, 'samples': 25922880, 'steps': 135014, 'loss/train': 1.3764392137527466} 11/07/2021 16:16:35 - INFO - __main__ - Step 135016: {'lr': 1.2541080179770569e-05, 'samples': 25923072, 'steps': 135015, 'loss/train': 1.3807737827301025} 11/07/2021 16:16:36 - INFO - __main__ - Step 135017: {'lr': 1.2539420552198866e-05, 'samples': 25923264, 'steps': 135016, 'loss/train': 0.7749372124671936} 11/07/2021 16:16:37 - INFO - __main__ - Step 135018: {'lr': 1.2537761031623184e-05, 'samples': 25923456, 'steps': 135017, 'loss/train': 1.2637197971343994} 11/07/2021 16:16:37 - INFO - __main__ - Step 135019: {'lr': 1.2536101618044305e-05, 'samples': 25923648, 'steps': 135018, 'loss/train': 1.081217885017395} 11/07/2021 16:16:37 - INFO - __main__ - Step 135020: {'lr': 1.2534442311463001e-05, 'samples': 25923840, 'steps': 135019, 'loss/train': 0.5918198823928833} 11/07/2021 16:16:38 - INFO - __main__ - Step 135021: {'lr': 1.2532783111880025e-05, 'samples': 25924032, 'steps': 135020, 'loss/train': 0.49363407492637634} 11/07/2021 16:16:38 - INFO - __main__ - Step 135022: {'lr': 1.2531124019296125e-05, 'samples': 25924224, 'steps': 135021, 'loss/train': 0.755631148815155} 11/07/2021 16:16:40 - INFO - __main__ - Step 135023: {'lr': 1.2529465033712023e-05, 'samples': 25924416, 'steps': 135022, 'loss/train': 1.0002045631408691} 11/07/2021 16:16:40 - INFO - __main__ - Step 135024: {'lr': 1.2527806155128469e-05, 'samples': 25924608, 'steps': 135023, 'loss/train': 0.9055296182632446} 11/07/2021 16:16:40 - INFO - __main__ - Step 135025: {'lr': 1.2526147383546238e-05, 'samples': 25924800, 'steps': 135024, 'loss/train': 1.228479266166687} 11/07/2021 16:16:41 - INFO - __main__ - Step 135026: {'lr': 1.2524488718966054e-05, 'samples': 25924992, 'steps': 135025, 'loss/train': 2.6583170890808105} 11/07/2021 16:16:41 - INFO - __main__ - Step 135027: {'lr': 1.2522830161388666e-05, 'samples': 25925184, 'steps': 135026, 'loss/train': 0.8403827548027039} 11/07/2021 16:16:41 - INFO - __main__ - Step 135028: {'lr': 1.2521171710814822e-05, 'samples': 25925376, 'steps': 135027, 'loss/train': 1.0109283924102783} 11/07/2021 16:16:42 - INFO - __main__ - Step 135029: {'lr': 1.2519513367245273e-05, 'samples': 25925568, 'steps': 135028, 'loss/train': 1.6520923376083374} 11/07/2021 16:16:43 - INFO - __main__ - Step 135030: {'lr': 1.2517855130680766e-05, 'samples': 25925760, 'steps': 135029, 'loss/train': 1.365203857421875} 11/07/2021 16:16:43 - INFO - __main__ - Step 135031: {'lr': 1.251619700112211e-05, 'samples': 25925952, 'steps': 135030, 'loss/train': 0.44362136721611023} 11/07/2021 16:16:43 - INFO - __main__ - Step 135032: {'lr': 1.251453897856994e-05, 'samples': 25926144, 'steps': 135031, 'loss/train': 1.965047836303711} 11/07/2021 16:16:44 - INFO - __main__ - Step 135033: {'lr': 1.2512881063025033e-05, 'samples': 25926336, 'steps': 135032, 'loss/train': 1.1884535551071167} 11/07/2021 16:16:45 - INFO - __main__ - Step 135034: {'lr': 1.2511223254488197e-05, 'samples': 25926528, 'steps': 135033, 'loss/train': 1.4467498064041138} 11/07/2021 16:16:46 - INFO - __main__ - Step 135035: {'lr': 1.2509565552960122e-05, 'samples': 25926720, 'steps': 135034, 'loss/train': 1.0878639221191406} 11/07/2021 16:16:46 - INFO - __main__ - Step 135036: {'lr': 1.2507907958441561e-05, 'samples': 25926912, 'steps': 135035, 'loss/train': 1.6711223125457764} 11/07/2021 16:16:46 - INFO - __main__ - Step 135037: {'lr': 1.2506250470933262e-05, 'samples': 25927104, 'steps': 135036, 'loss/train': 1.5263806581497192} 11/07/2021 16:16:47 - INFO - __main__ - Step 135038: {'lr': 1.2504593090436e-05, 'samples': 25927296, 'steps': 135037, 'loss/train': 1.2756544351577759} 11/07/2021 16:16:47 - INFO - __main__ - Step 135039: {'lr': 1.25029358169505e-05, 'samples': 25927488, 'steps': 135038, 'loss/train': 1.0898422002792358} 11/07/2021 16:16:48 - INFO - __main__ - Step 135040: {'lr': 1.2501278650477537e-05, 'samples': 25927680, 'steps': 135039, 'loss/train': 1.2483460903167725} 11/07/2021 16:16:48 - INFO - __main__ - Step 135041: {'lr': 1.2499621591017807e-05, 'samples': 25927872, 'steps': 135040, 'loss/train': 1.4370094537734985} 11/07/2021 16:16:49 - INFO - __main__ - Step 135042: {'lr': 1.2497964638572085e-05, 'samples': 25928064, 'steps': 135041, 'loss/train': 1.3597644567489624} 11/07/2021 16:16:49 - INFO - __main__ - Step 135043: {'lr': 1.249630779314112e-05, 'samples': 25928256, 'steps': 135042, 'loss/train': 1.503480076789856} 11/07/2021 16:16:49 - INFO - __main__ - Step 135044: {'lr': 1.2494651054725693e-05, 'samples': 25928448, 'steps': 135043, 'loss/train': 1.229190468788147} 11/07/2021 16:16:51 - INFO - __main__ - Step 135045: {'lr': 1.2492994423326465e-05, 'samples': 25928640, 'steps': 135044, 'loss/train': 1.42711341381073} 11/07/2021 16:16:51 - INFO - __main__ - Step 135046: {'lr': 1.2491337898944217e-05, 'samples': 25928832, 'steps': 135045, 'loss/train': 1.3905162811279297} 11/07/2021 16:16:52 - INFO - __main__ - Step 135047: {'lr': 1.2489681481579723e-05, 'samples': 25929024, 'steps': 135046, 'loss/train': 1.3413023948669434} 11/07/2021 16:16:52 - INFO - __main__ - Step 135048: {'lr': 1.2488025171233707e-05, 'samples': 25929216, 'steps': 135047, 'loss/train': 1.733249306678772} 11/07/2021 16:16:52 - INFO - __main__ - Step 135049: {'lr': 1.2486368967906946e-05, 'samples': 25929408, 'steps': 135048, 'loss/train': 1.3460432291030884} 11/07/2021 16:16:53 - INFO - __main__ - Step 135050: {'lr': 1.2484712871600135e-05, 'samples': 25929600, 'steps': 135049, 'loss/train': 1.4023873805999756} 11/07/2021 16:16:54 - INFO - __main__ - Step 135051: {'lr': 1.2483056882314076e-05, 'samples': 25929792, 'steps': 135050, 'loss/train': 0.9864641427993774} 11/07/2021 16:16:54 - INFO - __main__ - Step 135052: {'lr': 1.2481401000049463e-05, 'samples': 25929984, 'steps': 135051, 'loss/train': 0.9704720973968506} 11/07/2021 16:16:54 - INFO - __main__ - Step 135053: {'lr': 1.2479745224807049e-05, 'samples': 25930176, 'steps': 135052, 'loss/train': 1.7990117073059082} 11/07/2021 16:16:55 - INFO - __main__ - Step 135054: {'lr': 1.2478089556587635e-05, 'samples': 25930368, 'steps': 135053, 'loss/train': 1.062474250793457} 11/07/2021 16:16:55 - INFO - __main__ - Step 135055: {'lr': 1.2476433995391916e-05, 'samples': 25930560, 'steps': 135054, 'loss/train': 1.24918794631958} 11/07/2021 16:16:56 - INFO - __main__ - Step 135056: {'lr': 1.2474778541220644e-05, 'samples': 25930752, 'steps': 135055, 'loss/train': 1.4080990552902222} 11/07/2021 16:16:56 - INFO - __main__ - Step 135057: {'lr': 1.2473123194074564e-05, 'samples': 25930944, 'steps': 135056, 'loss/train': 1.2299377918243408} 11/07/2021 16:16:57 - INFO - __main__ - Step 135058: {'lr': 1.2471467953954486e-05, 'samples': 25931136, 'steps': 135057, 'loss/train': 1.213752269744873} 11/07/2021 16:16:57 - INFO - __main__ - Step 135059: {'lr': 1.2469812820861044e-05, 'samples': 25931328, 'steps': 135058, 'loss/train': 0.8855772018432617} 11/07/2021 16:16:57 - INFO - __main__ - Step 135060: {'lr': 1.2468157794795042e-05, 'samples': 25931520, 'steps': 135059, 'loss/train': 1.020999550819397} 11/07/2021 16:16:59 - INFO - __main__ - Step 135061: {'lr': 1.2466502875757236e-05, 'samples': 25931712, 'steps': 135060, 'loss/train': 1.3931124210357666} 11/07/2021 16:16:59 - INFO - __main__ - Step 135062: {'lr': 1.2464848063748341e-05, 'samples': 25931904, 'steps': 135061, 'loss/train': 1.5271512269973755} 11/07/2021 16:16:59 - INFO - __main__ - Step 135063: {'lr': 1.2463193358769137e-05, 'samples': 25932096, 'steps': 135062, 'loss/train': 1.6274030208587646} 11/07/2021 16:17:00 - INFO - __main__ - Step 135064: {'lr': 1.2461538760820346e-05, 'samples': 25932288, 'steps': 135063, 'loss/train': 1.04976487159729} 11/07/2021 16:17:00 - INFO - __main__ - Step 135065: {'lr': 1.2459884269902716e-05, 'samples': 25932480, 'steps': 135064, 'loss/train': 0.965180516242981} 11/07/2021 16:17:01 - INFO - __main__ - Step 135066: {'lr': 1.2458229886016998e-05, 'samples': 25932672, 'steps': 135065, 'loss/train': 0.9075781106948853} 11/07/2021 16:17:01 - INFO - __main__ - Step 135067: {'lr': 1.245657560916394e-05, 'samples': 25932864, 'steps': 135066, 'loss/train': 1.4869657754898071} 11/07/2021 16:17:02 - INFO - __main__ - Step 135068: {'lr': 1.2454921439344291e-05, 'samples': 25933056, 'steps': 135067, 'loss/train': 1.1505367755889893} 11/07/2021 16:17:02 - INFO - __main__ - Step 135069: {'lr': 1.2453267376558774e-05, 'samples': 25933248, 'steps': 135068, 'loss/train': 1.4826006889343262} 11/07/2021 16:17:03 - INFO - __main__ - Step 135070: {'lr': 1.2451613420808138e-05, 'samples': 25933440, 'steps': 135069, 'loss/train': 1.0891519784927368} 11/07/2021 16:17:03 - INFO - __main__ - Step 135071: {'lr': 1.2449959572093189e-05, 'samples': 25933632, 'steps': 135070, 'loss/train': 1.275869369506836} 11/07/2021 16:17:04 - INFO - __main__ - Step 135072: {'lr': 1.244830583041459e-05, 'samples': 25933824, 'steps': 135071, 'loss/train': 1.4817147254943848} 11/07/2021 16:17:04 - INFO - __main__ - Step 135073: {'lr': 1.2446652195773123e-05, 'samples': 25934016, 'steps': 135072, 'loss/train': 1.493996024131775} 11/07/2021 16:17:05 - INFO - __main__ - Step 135074: {'lr': 1.2444998668169533e-05, 'samples': 25934208, 'steps': 135073, 'loss/train': 1.504859447479248} 11/07/2021 16:17:05 - INFO - __main__ - Step 135075: {'lr': 1.2443345247604542e-05, 'samples': 25934400, 'steps': 135074, 'loss/train': 1.5118379592895508} 11/07/2021 16:17:05 - INFO - __main__ - Step 135076: {'lr': 1.244169193407893e-05, 'samples': 25934592, 'steps': 135075, 'loss/train': 1.2594751119613647} 11/07/2021 16:17:06 - INFO - __main__ - Step 135077: {'lr': 1.2440038727593418e-05, 'samples': 25934784, 'steps': 135076, 'loss/train': 1.5442560911178589} 11/07/2021 16:17:07 - INFO - __main__ - Step 135078: {'lr': 1.2438385628148751e-05, 'samples': 25934976, 'steps': 135077, 'loss/train': 1.6811840534210205} 11/07/2021 16:17:07 - INFO - __main__ - Step 135079: {'lr': 1.2436732635745711e-05, 'samples': 25935168, 'steps': 135078, 'loss/train': 0.9047268629074097} 11/07/2021 16:17:08 - INFO - __main__ - Step 135080: {'lr': 1.2435079750384992e-05, 'samples': 25935360, 'steps': 135079, 'loss/train': 1.7334729433059692} 11/07/2021 16:17:08 - INFO - __main__ - Step 135081: {'lr': 1.2433426972067341e-05, 'samples': 25935552, 'steps': 135080, 'loss/train': 1.3414268493652344} 11/07/2021 16:17:09 - INFO - __main__ - Step 135082: {'lr': 1.2431774300793564e-05, 'samples': 25935744, 'steps': 135081, 'loss/train': 1.1169989109039307} 11/07/2021 16:17:09 - INFO - __main__ - Step 135083: {'lr': 1.2430121736564325e-05, 'samples': 25935936, 'steps': 135082, 'loss/train': 1.3072046041488647} 11/07/2021 16:17:10 - INFO - __main__ - Step 135084: {'lr': 1.2428469279380433e-05, 'samples': 25936128, 'steps': 135083, 'loss/train': 1.1814078092575073} 11/07/2021 16:17:10 - INFO - __main__ - Step 135085: {'lr': 1.2426816929242634e-05, 'samples': 25936320, 'steps': 135084, 'loss/train': 1.0469050407409668} 11/07/2021 16:17:10 - INFO - __main__ - Step 135086: {'lr': 1.2425164686151596e-05, 'samples': 25936512, 'steps': 135085, 'loss/train': 1.3258172273635864} 11/07/2021 16:17:11 - INFO - __main__ - Step 135087: {'lr': 1.2423512550108151e-05, 'samples': 25936704, 'steps': 135086, 'loss/train': 1.303104043006897} 11/07/2021 16:17:12 - INFO - __main__ - Step 135088: {'lr': 1.2421860521112966e-05, 'samples': 25936896, 'steps': 135087, 'loss/train': 1.297392725944519} 11/07/2021 16:17:12 - INFO - __main__ - Step 135089: {'lr': 1.2420208599166843e-05, 'samples': 25937088, 'steps': 135088, 'loss/train': 1.0922415256500244} 11/07/2021 16:17:12 - INFO - __main__ - Step 135090: {'lr': 1.2418556784270508e-05, 'samples': 25937280, 'steps': 135089, 'loss/train': 1.4914780855178833} 11/07/2021 16:17:13 - INFO - __main__ - Step 135091: {'lr': 1.2416905076424706e-05, 'samples': 25937472, 'steps': 135090, 'loss/train': 1.3851112127304077} 11/07/2021 16:17:14 - INFO - __main__ - Step 135092: {'lr': 1.2415253475630161e-05, 'samples': 25937664, 'steps': 135091, 'loss/train': 0.8830716609954834} 11/07/2021 16:17:14 - INFO - __main__ - Step 135093: {'lr': 1.2413601981887652e-05, 'samples': 25937856, 'steps': 135092, 'loss/train': 0.8867608904838562} 11/07/2021 16:17:15 - INFO - __main__ - Step 135094: {'lr': 1.2411950595197923e-05, 'samples': 25938048, 'steps': 135093, 'loss/train': 1.2958080768585205} 11/07/2021 16:17:15 - INFO - __main__ - Step 135095: {'lr': 1.2410299315561673e-05, 'samples': 25938240, 'steps': 135094, 'loss/train': 0.03190302103757858} 11/07/2021 16:17:15 - INFO - __main__ - Step 135096: {'lr': 1.2408648142979707e-05, 'samples': 25938432, 'steps': 135095, 'loss/train': 1.4514219760894775} 11/07/2021 16:17:16 - INFO - __main__ - Step 135097: {'lr': 1.2406997077452742e-05, 'samples': 25938624, 'steps': 135096, 'loss/train': 1.7260081768035889} 11/07/2021 16:17:17 - INFO - __main__ - Step 135098: {'lr': 1.2405346118981504e-05, 'samples': 25938816, 'steps': 135097, 'loss/train': 1.1117368936538696} 11/07/2021 16:17:17 - INFO - __main__ - Step 135099: {'lr': 1.240369526756674e-05, 'samples': 25939008, 'steps': 135098, 'loss/train': 1.312318205833435} 11/07/2021 16:17:17 - INFO - __main__ - Step 135100: {'lr': 1.24020445232092e-05, 'samples': 25939200, 'steps': 135099, 'loss/train': 1.3440790176391602} 11/07/2021 16:17:18 - INFO - __main__ - Step 135101: {'lr': 1.2400393885909633e-05, 'samples': 25939392, 'steps': 135100, 'loss/train': 1.718983769416809} 11/07/2021 16:17:18 - INFO - __main__ - Step 135102: {'lr': 1.239874335566879e-05, 'samples': 25939584, 'steps': 135101, 'loss/train': 1.5250964164733887} 11/07/2021 16:17:19 - INFO - __main__ - Step 135103: {'lr': 1.239709293248742e-05, 'samples': 25939776, 'steps': 135102, 'loss/train': 1.0584796667099} 11/07/2021 16:17:19 - INFO - __main__ - Step 135104: {'lr': 1.2395442616366243e-05, 'samples': 25939968, 'steps': 135103, 'loss/train': 1.481325626373291} 11/07/2021 16:17:20 - INFO - __main__ - Step 135105: {'lr': 1.239379240730601e-05, 'samples': 25940160, 'steps': 135104, 'loss/train': 1.46134352684021} 11/07/2021 16:17:20 - INFO - __main__ - Step 135106: {'lr': 1.2392142305307469e-05, 'samples': 25940352, 'steps': 135105, 'loss/train': 0.686915934085846} 11/07/2021 16:17:21 - INFO - __main__ - Step 135107: {'lr': 1.2390492310371343e-05, 'samples': 25940544, 'steps': 135106, 'loss/train': 1.2546484470367432} 11/07/2021 16:17:22 - INFO - __main__ - Step 135108: {'lr': 1.2388842422498464e-05, 'samples': 25940736, 'steps': 135107, 'loss/train': 1.341391921043396} 11/07/2021 16:17:22 - INFO - __main__ - Step 135109: {'lr': 1.2387192641689444e-05, 'samples': 25940928, 'steps': 135108, 'loss/train': 0.8601222038269043} 11/07/2021 16:17:23 - INFO - __main__ - Step 135110: {'lr': 1.2385542967945113e-05, 'samples': 25941120, 'steps': 135109, 'loss/train': 1.5352660417556763} 11/07/2021 16:17:23 - INFO - __main__ - Step 135111: {'lr': 1.2383893401266166e-05, 'samples': 25941312, 'steps': 135110, 'loss/train': 1.7120970487594604} 11/07/2021 16:17:23 - INFO - __main__ - Step 135112: {'lr': 1.2382243941653382e-05, 'samples': 25941504, 'steps': 135111, 'loss/train': 1.3671680688858032} 11/07/2021 16:17:24 - INFO - __main__ - Step 135113: {'lr': 1.238059458910748e-05, 'samples': 25941696, 'steps': 135112, 'loss/train': 0.9364462494850159} 11/07/2021 16:17:24 - INFO - __main__ - Step 135114: {'lr': 1.2378945343629238e-05, 'samples': 25941888, 'steps': 135113, 'loss/train': 1.057992696762085} 11/07/2021 16:17:25 - INFO - __main__ - Step 135115: {'lr': 1.237729620521935e-05, 'samples': 25942080, 'steps': 135114, 'loss/train': 1.2100497484207153} 11/07/2021 16:17:25 - INFO - __main__ - Step 135116: {'lr': 1.2375647173878596e-05, 'samples': 25942272, 'steps': 135115, 'loss/train': 1.3404165506362915} 11/07/2021 16:17:26 - INFO - __main__ - Step 135117: {'lr': 1.2373998249607721e-05, 'samples': 25942464, 'steps': 135116, 'loss/train': 1.352951169013977} 11/07/2021 16:17:26 - INFO - __main__ - Step 135118: {'lr': 1.2372349432407449e-05, 'samples': 25942656, 'steps': 135117, 'loss/train': 1.4958609342575073} 11/07/2021 16:17:27 - INFO - __main__ - Step 135119: {'lr': 1.2370700722278555e-05, 'samples': 25942848, 'steps': 135118, 'loss/train': 1.683405876159668} 11/07/2021 16:17:28 - INFO - __main__ - Step 135120: {'lr': 1.2369052119221736e-05, 'samples': 25943040, 'steps': 135119, 'loss/train': 1.3071339130401611} 11/07/2021 16:17:28 - INFO - __main__ - Step 135121: {'lr': 1.2367403623237738e-05, 'samples': 25943232, 'steps': 135120, 'loss/train': 1.4543859958648682} 11/07/2021 16:17:28 - INFO - __main__ - Step 135122: {'lr': 1.2365755234327341e-05, 'samples': 25943424, 'steps': 135121, 'loss/train': 1.9854482412338257} 11/07/2021 16:17:29 - INFO - __main__ - Step 135123: {'lr': 1.2364106952491267e-05, 'samples': 25943616, 'steps': 135122, 'loss/train': 1.3878759145736694} 11/07/2021 16:17:29 - INFO - __main__ - Step 135124: {'lr': 1.2362458777730235e-05, 'samples': 25943808, 'steps': 135123, 'loss/train': 1.4876755475997925} 11/07/2021 16:17:30 - INFO - __main__ - Step 135125: {'lr': 1.2360810710045024e-05, 'samples': 25944000, 'steps': 135124, 'loss/train': 1.4244381189346313} 11/07/2021 16:17:30 - INFO - __main__ - Step 135126: {'lr': 1.235916274943638e-05, 'samples': 25944192, 'steps': 135125, 'loss/train': 1.4188073873519897} 11/07/2021 16:17:31 - INFO - __main__ - Step 135127: {'lr': 1.2357514895905003e-05, 'samples': 25944384, 'steps': 135126, 'loss/train': 0.9094980359077454} 11/07/2021 16:17:31 - INFO - __main__ - Step 135128: {'lr': 1.2355867149451694e-05, 'samples': 25944576, 'steps': 135127, 'loss/train': 1.1555670499801636} 11/07/2021 16:17:31 - INFO - __main__ - Step 135129: {'lr': 1.2354219510077147e-05, 'samples': 25944768, 'steps': 135128, 'loss/train': 1.6465119123458862} 11/07/2021 16:17:32 - INFO - __main__ - Step 135130: {'lr': 1.235257197778214e-05, 'samples': 25944960, 'steps': 135129, 'loss/train': 1.2732369899749756} 11/07/2021 16:17:33 - INFO - __main__ - Step 135131: {'lr': 1.2350924552567394e-05, 'samples': 25945152, 'steps': 135130, 'loss/train': 0.7675484418869019} 11/07/2021 16:17:33 - INFO - __main__ - Step 135132: {'lr': 1.2349277234433632e-05, 'samples': 25945344, 'steps': 135131, 'loss/train': 1.3176171779632568} 11/07/2021 16:17:34 - INFO - __main__ - Step 135133: {'lr': 1.2347630023381628e-05, 'samples': 25945536, 'steps': 135132, 'loss/train': 1.4416487216949463} 11/07/2021 16:17:34 - INFO - __main__ - Step 135134: {'lr': 1.2345982919412108e-05, 'samples': 25945728, 'steps': 135133, 'loss/train': 1.1428427696228027} 11/07/2021 16:17:34 - INFO - __main__ - Step 135135: {'lr': 1.2344335922525845e-05, 'samples': 25945920, 'steps': 135134, 'loss/train': 0.9004318118095398} 11/07/2021 16:17:35 - INFO - __main__ - Step 135136: {'lr': 1.2342689032723537e-05, 'samples': 25946112, 'steps': 135135, 'loss/train': 1.3568928241729736} 11/07/2021 16:17:36 - INFO - __main__ - Step 135137: {'lr': 1.2341042250005929e-05, 'samples': 25946304, 'steps': 135136, 'loss/train': 1.3104387521743774} 11/07/2021 16:17:36 - INFO - __main__ - Step 135138: {'lr': 1.2339395574373801e-05, 'samples': 25946496, 'steps': 135137, 'loss/train': 1.7921360731124878} 11/07/2021 16:17:36 - INFO - __main__ - Step 135139: {'lr': 1.2337749005827876e-05, 'samples': 25946688, 'steps': 135138, 'loss/train': 1.1057522296905518} 11/07/2021 16:17:37 - INFO - __main__ - Step 135140: {'lr': 1.2336102544368926e-05, 'samples': 25946880, 'steps': 135139, 'loss/train': 1.642526388168335} 11/07/2021 16:17:38 - INFO - __main__ - Step 135141: {'lr': 1.2334456189997623e-05, 'samples': 25947072, 'steps': 135140, 'loss/train': 1.3991906642913818} 11/07/2021 16:17:38 - INFO - __main__ - Step 135142: {'lr': 1.233280994271474e-05, 'samples': 25947264, 'steps': 135141, 'loss/train': 1.553209900856018} 11/07/2021 16:17:38 - INFO - __main__ - Step 135143: {'lr': 1.2331163802521029e-05, 'samples': 25947456, 'steps': 135142, 'loss/train': 1.4600228071212769} 11/07/2021 16:17:39 - INFO - __main__ - Step 135144: {'lr': 1.2329517769417237e-05, 'samples': 25947648, 'steps': 135143, 'loss/train': 1.4660699367523193} 11/07/2021 16:17:39 - INFO - __main__ - Step 135145: {'lr': 1.2327871843404087e-05, 'samples': 25947840, 'steps': 135144, 'loss/train': 1.1009526252746582} 11/07/2021 16:17:40 - INFO - __main__ - Step 135146: {'lr': 1.2326226024482329e-05, 'samples': 25948032, 'steps': 135145, 'loss/train': 0.03827543556690216} 11/07/2021 16:17:40 - INFO - __main__ - Step 135147: {'lr': 1.2324580312652738e-05, 'samples': 25948224, 'steps': 135146, 'loss/train': 1.4719784259796143} 11/07/2021 16:17:41 - INFO - __main__ - Step 135148: {'lr': 1.2322934707915983e-05, 'samples': 25948416, 'steps': 135147, 'loss/train': 1.7896572351455688} 11/07/2021 16:17:41 - INFO - __main__ - Step 135149: {'lr': 1.2321289210272868e-05, 'samples': 25948608, 'steps': 135148, 'loss/train': 1.03617262840271} 11/07/2021 16:17:42 - INFO - __main__ - Step 135150: {'lr': 1.2319643819724113e-05, 'samples': 25948800, 'steps': 135149, 'loss/train': 1.1674667596817017} 11/07/2021 16:17:43 - INFO - __main__ - Step 135151: {'lr': 1.2317998536270441e-05, 'samples': 25948992, 'steps': 135150, 'loss/train': 2.9631948471069336} 11/07/2021 16:17:43 - INFO - __main__ - Step 135152: {'lr': 1.2316353359912657e-05, 'samples': 25949184, 'steps': 135151, 'loss/train': 1.4894578456878662} 11/07/2021 16:17:43 - INFO - __main__ - Step 135153: {'lr': 1.2314708290651427e-05, 'samples': 25949376, 'steps': 135152, 'loss/train': 0.9059475660324097} 11/07/2021 16:17:44 - INFO - __main__ - Step 135154: {'lr': 1.2313063328487501e-05, 'samples': 25949568, 'steps': 135153, 'loss/train': 1.3015331029891968} 11/07/2021 16:17:44 - INFO - __main__ - Step 135155: {'lr': 1.2311418473421654e-05, 'samples': 25949760, 'steps': 135154, 'loss/train': 1.1655874252319336} 11/07/2021 16:17:45 - INFO - __main__ - Step 135156: {'lr': 1.230977372545461e-05, 'samples': 25949952, 'steps': 135155, 'loss/train': 0.8545206785202026} 11/07/2021 16:17:45 - INFO - __main__ - Step 135157: {'lr': 1.2308129084587144e-05, 'samples': 25950144, 'steps': 135156, 'loss/train': 1.0461499691009521} 11/07/2021 16:17:46 - INFO - __main__ - Step 135158: {'lr': 1.2306484550819924e-05, 'samples': 25950336, 'steps': 135157, 'loss/train': 1.738667368888855} 11/07/2021 16:17:46 - INFO - __main__ - Step 135159: {'lr': 1.2304840124153755e-05, 'samples': 25950528, 'steps': 135158, 'loss/train': 1.2522555589675903} 11/07/2021 16:17:46 - INFO - __main__ - Step 135160: {'lr': 1.2303195804589357e-05, 'samples': 25950720, 'steps': 135159, 'loss/train': 1.1294143199920654} 11/07/2021 16:17:48 - INFO - __main__ - Step 135161: {'lr': 1.230155159212748e-05, 'samples': 25950912, 'steps': 135160, 'loss/train': 2.0991108417510986} 11/07/2021 16:17:48 - INFO - __main__ - Step 135162: {'lr': 1.2299907486768847e-05, 'samples': 25951104, 'steps': 135161, 'loss/train': 1.4365276098251343} 11/07/2021 16:17:48 - INFO - __main__ - Step 135163: {'lr': 1.2298263488514205e-05, 'samples': 25951296, 'steps': 135162, 'loss/train': 1.090226650238037} 11/07/2021 16:17:49 - INFO - __main__ - Step 135164: {'lr': 1.2296619597364278e-05, 'samples': 25951488, 'steps': 135163, 'loss/train': 1.5126222372055054} 11/07/2021 16:17:49 - INFO - __main__ - Step 135165: {'lr': 1.2294975813319898e-05, 'samples': 25951680, 'steps': 135164, 'loss/train': 0.5257989764213562} 11/07/2021 16:17:49 - INFO - __main__ - Step 135166: {'lr': 1.2293332136381675e-05, 'samples': 25951872, 'steps': 135165, 'loss/train': 1.4892191886901855} 11/07/2021 16:17:51 - INFO - __main__ - Step 135167: {'lr': 1.2291688566550413e-05, 'samples': 25952064, 'steps': 135166, 'loss/train': 1.328481912612915} 11/07/2021 16:17:51 - INFO - __main__ - Step 135168: {'lr': 1.2290045103826863e-05, 'samples': 25952256, 'steps': 135167, 'loss/train': 1.2385026216506958} 11/07/2021 16:17:51 - INFO - __main__ - Step 135169: {'lr': 1.228840174821172e-05, 'samples': 25952448, 'steps': 135168, 'loss/train': 0.9539862275123596} 11/07/2021 16:17:52 - INFO - __main__ - Step 135170: {'lr': 1.2286758499705785e-05, 'samples': 25952640, 'steps': 135169, 'loss/train': 1.3803377151489258} 11/07/2021 16:17:52 - INFO - __main__ - Step 135171: {'lr': 1.2285115358309756e-05, 'samples': 25952832, 'steps': 135170, 'loss/train': 0.9308826923370361} 11/07/2021 16:17:53 - INFO - __main__ - Step 135172: {'lr': 1.228347232402438e-05, 'samples': 25953024, 'steps': 135171, 'loss/train': 1.3350974321365356} 11/07/2021 16:17:53 - INFO - __main__ - Step 135173: {'lr': 1.2281829396850409e-05, 'samples': 25953216, 'steps': 135172, 'loss/train': 1.1601303815841675} 11/07/2021 16:17:54 - INFO - __main__ - Step 135174: {'lr': 1.2280186576788588e-05, 'samples': 25953408, 'steps': 135173, 'loss/train': 1.5677231550216675} 11/07/2021 16:17:54 - INFO - __main__ - Step 135175: {'lr': 1.2278543863839642e-05, 'samples': 25953600, 'steps': 135174, 'loss/train': 0.5164360404014587} 11/07/2021 16:17:54 - INFO - __main__ - Step 135176: {'lr': 1.227690125800432e-05, 'samples': 25953792, 'steps': 135175, 'loss/train': 1.1338999271392822} 11/07/2021 16:17:55 - INFO - __main__ - Step 135177: {'lr': 1.2275258759283342e-05, 'samples': 25953984, 'steps': 135176, 'loss/train': 1.2369372844696045} 11/07/2021 16:17:56 - INFO - __main__ - Step 135178: {'lr': 1.2273616367677459e-05, 'samples': 25954176, 'steps': 135177, 'loss/train': 1.1620670557022095} 11/07/2021 16:17:56 - INFO - __main__ - Step 135179: {'lr': 1.2271974083187476e-05, 'samples': 25954368, 'steps': 135178, 'loss/train': 0.6647518873214722} 11/07/2021 16:17:56 - INFO - __main__ - Step 135180: {'lr': 1.227033190581403e-05, 'samples': 25954560, 'steps': 135179, 'loss/train': 1.0126852989196777} 11/07/2021 16:17:57 - INFO - __main__ - Step 135181: {'lr': 1.22686898355579e-05, 'samples': 25954752, 'steps': 135180, 'loss/train': 1.1058834791183472} 11/07/2021 16:17:57 - INFO - __main__ - Step 135182: {'lr': 1.2267047872419834e-05, 'samples': 25954944, 'steps': 135181, 'loss/train': 2.005882501602173} 11/07/2021 16:17:58 - INFO - __main__ - Step 135183: {'lr': 1.2265406016400554e-05, 'samples': 25955136, 'steps': 135182, 'loss/train': 1.692211627960205} 11/07/2021 16:17:59 - INFO - __main__ - Step 135184: {'lr': 1.226376426750081e-05, 'samples': 25955328, 'steps': 135183, 'loss/train': 1.0590317249298096} 11/07/2021 16:17:59 - INFO - __main__ - Step 135185: {'lr': 1.2262122625721377e-05, 'samples': 25955520, 'steps': 135184, 'loss/train': 1.4505399465560913} 11/07/2021 16:17:59 - INFO - __main__ - Step 135186: {'lr': 1.2260481091062925e-05, 'samples': 25955712, 'steps': 135185, 'loss/train': 1.222041368484497} 11/07/2021 16:18:00 - INFO - __main__ - Step 135187: {'lr': 1.2258839663526256e-05, 'samples': 25955904, 'steps': 135186, 'loss/train': 1.1924911737442017} 11/07/2021 16:18:01 - INFO - __main__ - Step 135188: {'lr': 1.2257198343112091e-05, 'samples': 25956096, 'steps': 135187, 'loss/train': 1.0227104425430298} 11/07/2021 16:18:01 - INFO - __main__ - Step 135189: {'lr': 1.2255557129821154e-05, 'samples': 25956288, 'steps': 135188, 'loss/train': 1.343488097190857} 11/07/2021 16:18:01 - INFO - __main__ - Step 135190: {'lr': 1.2253916023654193e-05, 'samples': 25956480, 'steps': 135189, 'loss/train': 1.4673658609390259} 11/07/2021 16:18:02 - INFO - __main__ - Step 135191: {'lr': 1.2252275024611958e-05, 'samples': 25956672, 'steps': 135190, 'loss/train': 1.4323927164077759} 11/07/2021 16:18:02 - INFO - __main__ - Step 135192: {'lr': 1.2250634132695198e-05, 'samples': 25956864, 'steps': 135191, 'loss/train': 1.3824574947357178} 11/07/2021 16:18:03 - INFO - __main__ - Step 135193: {'lr': 1.2248993347904607e-05, 'samples': 25957056, 'steps': 135192, 'loss/train': 1.941829800605774} 11/07/2021 16:18:04 - INFO - __main__ - Step 135194: {'lr': 1.2247352670240936e-05, 'samples': 25957248, 'steps': 135193, 'loss/train': 0.9565936923027039} 11/07/2021 16:18:04 - INFO - __main__ - Step 135195: {'lr': 1.2245712099704958e-05, 'samples': 25957440, 'steps': 135194, 'loss/train': 1.260667085647583} 11/07/2021 16:18:04 - INFO - __main__ - Step 135196: {'lr': 1.2244071636297399e-05, 'samples': 25957632, 'steps': 135195, 'loss/train': 1.2266654968261719} 11/07/2021 16:18:05 - INFO - __main__ - Step 135197: {'lr': 1.2242431280018978e-05, 'samples': 25957824, 'steps': 135196, 'loss/train': 1.185529351234436} 11/07/2021 16:18:06 - INFO - __main__ - Step 135198: {'lr': 1.2240791030870446e-05, 'samples': 25958016, 'steps': 135197, 'loss/train': 1.1788297891616821} 11/07/2021 16:18:06 - INFO - __main__ - Step 135199: {'lr': 1.223915088885255e-05, 'samples': 25958208, 'steps': 135198, 'loss/train': 1.105293869972229} 11/07/2021 16:18:06 - INFO - __main__ - Step 135200: {'lr': 1.2237510853966044e-05, 'samples': 25958400, 'steps': 135199, 'loss/train': 1.476065754890442} 11/07/2021 16:18:07 - INFO - __main__ - Step 135201: {'lr': 1.2235870926211617e-05, 'samples': 25958592, 'steps': 135200, 'loss/train': 1.1976794004440308} 11/07/2021 16:18:07 - INFO - __main__ - Step 135202: {'lr': 1.2234231105590048e-05, 'samples': 25958784, 'steps': 135201, 'loss/train': 1.4920928478240967} 11/07/2021 16:18:08 - INFO - __main__ - Step 135203: {'lr': 1.223259139210206e-05, 'samples': 25958976, 'steps': 135202, 'loss/train': 0.8967857360839844} 11/07/2021 16:18:08 - INFO - __main__ - Step 135204: {'lr': 1.22309517857484e-05, 'samples': 25959168, 'steps': 135203, 'loss/train': 1.328546404838562} 11/07/2021 16:18:09 - INFO - __main__ - Step 135205: {'lr': 1.222931228652982e-05, 'samples': 25959360, 'steps': 135204, 'loss/train': 0.5553337931632996} 11/07/2021 16:18:09 - INFO - __main__ - Step 135206: {'lr': 1.2227672894447067e-05, 'samples': 25959552, 'steps': 135205, 'loss/train': 1.4767874479293823} 11/07/2021 16:18:09 - INFO - __main__ - Step 135207: {'lr': 1.2226033609500809e-05, 'samples': 25959744, 'steps': 135206, 'loss/train': 1.3807142972946167} 11/07/2021 16:18:10 - INFO - __main__ - Step 135208: {'lr': 1.2224394431691849e-05, 'samples': 25959936, 'steps': 135207, 'loss/train': 0.7872051000595093} 11/07/2021 16:18:12 - INFO - __main__ - Step 135209: {'lr': 1.222275536102091e-05, 'samples': 25960128, 'steps': 135208, 'loss/train': 0.9937995076179504} 11/07/2021 16:18:12 - INFO - __main__ - Step 135210: {'lr': 1.2221116397488712e-05, 'samples': 25960320, 'steps': 135209, 'loss/train': 1.3616646528244019} 11/07/2021 16:18:13 - INFO - __main__ - Step 135211: {'lr': 1.2219477541096008e-05, 'samples': 25960512, 'steps': 135210, 'loss/train': 0.8608160614967346} 11/07/2021 16:18:13 - INFO - __main__ - Step 135212: {'lr': 1.2217838791843544e-05, 'samples': 25960704, 'steps': 135211, 'loss/train': 1.1375707387924194} 11/07/2021 16:18:13 - INFO - __main__ - Step 135213: {'lr': 1.221620014973207e-05, 'samples': 25960896, 'steps': 135212, 'loss/train': 1.622380256652832} 11/07/2021 16:18:14 - INFO - __main__ - Step 135214: {'lr': 1.2214561614762281e-05, 'samples': 25961088, 'steps': 135213, 'loss/train': 1.6463074684143066} 11/07/2021 16:18:14 - INFO - __main__ - Step 135215: {'lr': 1.2212923186934955e-05, 'samples': 25961280, 'steps': 135214, 'loss/train': 1.3447837829589844} 11/07/2021 16:18:15 - INFO - __main__ - Step 135216: {'lr': 1.2211284866250811e-05, 'samples': 25961472, 'steps': 135215, 'loss/train': 1.1450272798538208} 11/07/2021 16:18:15 - INFO - __main__ - Step 135217: {'lr': 1.2209646652710599e-05, 'samples': 25961664, 'steps': 135216, 'loss/train': 1.2230576276779175} 11/07/2021 16:18:16 - INFO - __main__ - Step 135218: {'lr': 1.2208008546315042e-05, 'samples': 25961856, 'steps': 135217, 'loss/train': 1.0495827198028564} 11/07/2021 16:18:16 - INFO - __main__ - Step 135219: {'lr': 1.2206370547064916e-05, 'samples': 25962048, 'steps': 135218, 'loss/train': 0.9447384476661682} 11/07/2021 16:18:16 - INFO - __main__ - Step 135220: {'lr': 1.2204732654960914e-05, 'samples': 25962240, 'steps': 135219, 'loss/train': 1.5867635011672974} 11/07/2021 16:18:18 - INFO - __main__ - Step 135221: {'lr': 1.220309487000379e-05, 'samples': 25962432, 'steps': 135220, 'loss/train': 1.4266809225082397} 11/07/2021 16:18:18 - INFO - __main__ - Step 135222: {'lr': 1.220145719219426e-05, 'samples': 25962624, 'steps': 135221, 'loss/train': 1.4714590311050415} 11/07/2021 16:18:18 - INFO - __main__ - Step 135223: {'lr': 1.2199819621533103e-05, 'samples': 25962816, 'steps': 135222, 'loss/train': 2.0346009731292725} 11/07/2021 16:18:19 - INFO - __main__ - Step 135224: {'lr': 1.2198182158021042e-05, 'samples': 25963008, 'steps': 135223, 'loss/train': 1.2211432456970215} 11/07/2021 16:18:19 - INFO - __main__ - Step 135225: {'lr': 1.21965448016588e-05, 'samples': 25963200, 'steps': 135224, 'loss/train': 1.4095314741134644} 11/07/2021 16:18:20 - INFO - __main__ - Step 135226: {'lr': 1.2194907552447121e-05, 'samples': 25963392, 'steps': 135225, 'loss/train': 0.8967146873474121} 11/07/2021 16:18:21 - INFO - __main__ - Step 135227: {'lr': 1.2193270410386758e-05, 'samples': 25963584, 'steps': 135226, 'loss/train': 1.1137174367904663} 11/07/2021 16:18:21 - INFO - __main__ - Step 135228: {'lr': 1.2191633375478434e-05, 'samples': 25963776, 'steps': 135227, 'loss/train': 1.2854071855545044} 11/07/2021 16:18:21 - INFO - __main__ - Step 135229: {'lr': 1.2189996447722896e-05, 'samples': 25963968, 'steps': 135228, 'loss/train': 1.1181578636169434} 11/07/2021 16:18:22 - INFO - __main__ - Step 135230: {'lr': 1.2188359627120866e-05, 'samples': 25964160, 'steps': 135229, 'loss/train': 1.1591566801071167} 11/07/2021 16:18:23 - INFO - __main__ - Step 135231: {'lr': 1.2186722913673093e-05, 'samples': 25964352, 'steps': 135230, 'loss/train': 1.3983285427093506} 11/07/2021 16:18:23 - INFO - __main__ - Step 135232: {'lr': 1.2185086307380355e-05, 'samples': 25964544, 'steps': 135231, 'loss/train': 1.215029001235962} 11/07/2021 16:18:23 - INFO - __main__ - Step 135233: {'lr': 1.218344980824329e-05, 'samples': 25964736, 'steps': 135232, 'loss/train': 1.3357264995574951} 11/07/2021 16:18:24 - INFO - __main__ - Step 135234: {'lr': 1.2181813416262732e-05, 'samples': 25964928, 'steps': 135233, 'loss/train': 1.901018738746643} 11/07/2021 16:18:24 - INFO - __main__ - Step 135235: {'lr': 1.2180177131439347e-05, 'samples': 25965120, 'steps': 135234, 'loss/train': 0.7743856906890869} 11/07/2021 16:18:24 - INFO - __main__ - Step 135236: {'lr': 1.217854095377391e-05, 'samples': 25965312, 'steps': 135235, 'loss/train': 0.9984018206596375} 11/07/2021 16:18:25 - INFO - __main__ - Step 135237: {'lr': 1.217690488326717e-05, 'samples': 25965504, 'steps': 135236, 'loss/train': 1.2106927633285522} 11/07/2021 16:18:26 - INFO - __main__ - Step 135238: {'lr': 1.2175268919919823e-05, 'samples': 25965696, 'steps': 135237, 'loss/train': 1.3692418336868286} 11/07/2021 16:18:26 - INFO - __main__ - Step 135239: {'lr': 1.2173633063732648e-05, 'samples': 25965888, 'steps': 135238, 'loss/train': 1.270953893661499} 11/07/2021 16:18:27 - INFO - __main__ - Step 135240: {'lr': 1.2171997314706362e-05, 'samples': 25966080, 'steps': 135239, 'loss/train': 1.3354945182800293} 11/07/2021 16:18:27 - INFO - __main__ - Step 135241: {'lr': 1.217036167284169e-05, 'samples': 25966272, 'steps': 135240, 'loss/train': 1.6389700174331665} 11/07/2021 16:18:28 - INFO - __main__ - Step 135242: {'lr': 1.216872613813938e-05, 'samples': 25966464, 'steps': 135241, 'loss/train': 1.0930882692337036} 11/07/2021 16:18:28 - INFO - __main__ - Step 135243: {'lr': 1.2167090710600182e-05, 'samples': 25966656, 'steps': 135242, 'loss/train': 1.4596387147903442} 11/07/2021 16:18:29 - INFO - __main__ - Step 135244: {'lr': 1.2165455390224844e-05, 'samples': 25966848, 'steps': 135243, 'loss/train': 1.5859965085983276} 11/07/2021 16:18:29 - INFO - __main__ - Step 135245: {'lr': 1.2163820177014062e-05, 'samples': 25967040, 'steps': 135244, 'loss/train': 1.1361795663833618} 11/07/2021 16:18:29 - INFO - __main__ - Step 135246: {'lr': 1.2162185070968612e-05, 'samples': 25967232, 'steps': 135245, 'loss/train': 1.0663836002349854} 11/07/2021 16:18:30 - INFO - __main__ - Step 135247: {'lr': 1.2160550072089216e-05, 'samples': 25967424, 'steps': 135246, 'loss/train': 5.668532371520996} 11/07/2021 16:18:31 - INFO - __main__ - Step 135248: {'lr': 1.2158915180376568e-05, 'samples': 25967616, 'steps': 135247, 'loss/train': 1.2614766359329224} 11/07/2021 16:18:31 - INFO - __main__ - Step 135249: {'lr': 1.2157280395831471e-05, 'samples': 25967808, 'steps': 135248, 'loss/train': 1.3761194944381714} 11/07/2021 16:18:31 - INFO - __main__ - Step 135250: {'lr': 1.2155645718454622e-05, 'samples': 25968000, 'steps': 135249, 'loss/train': 1.0055313110351562} 11/07/2021 16:18:32 - INFO - __main__ - Step 135251: {'lr': 1.215401114824674e-05, 'samples': 25968192, 'steps': 135250, 'loss/train': 1.0418249368667603} 11/07/2021 16:18:32 - INFO - __main__ - Step 135252: {'lr': 1.2152376685208633e-05, 'samples': 25968384, 'steps': 135251, 'loss/train': 1.726844072341919} 11/07/2021 16:18:33 - INFO - __main__ - Step 135253: {'lr': 1.2150742329340964e-05, 'samples': 25968576, 'steps': 135252, 'loss/train': 1.7223682403564453} 11/07/2021 16:18:34 - INFO - __main__ - Step 135254: {'lr': 1.2149108080644539e-05, 'samples': 25968768, 'steps': 135253, 'loss/train': 1.3888300657272339} 11/07/2021 16:18:34 - INFO - __main__ - Step 135255: {'lr': 1.2147473939120024e-05, 'samples': 25968960, 'steps': 135254, 'loss/train': 1.528988242149353} 11/07/2021 16:18:34 - INFO - __main__ - Step 135256: {'lr': 1.2145839904768197e-05, 'samples': 25969152, 'steps': 135255, 'loss/train': 1.1111083030700684} 11/07/2021 16:18:35 - INFO - __main__ - Step 135257: {'lr': 1.2144205977589778e-05, 'samples': 25969344, 'steps': 135256, 'loss/train': 1.3904719352722168} 11/07/2021 16:18:36 - INFO - __main__ - Step 135258: {'lr': 1.214257215758552e-05, 'samples': 25969536, 'steps': 135257, 'loss/train': 1.634677529335022} 11/07/2021 16:18:36 - INFO - __main__ - Step 135259: {'lr': 1.2140938444756167e-05, 'samples': 25969728, 'steps': 135258, 'loss/train': 0.1735350638628006} 11/07/2021 16:18:36 - INFO - __main__ - Step 135260: {'lr': 1.2139304839102417e-05, 'samples': 25969920, 'steps': 135259, 'loss/train': 1.1920303106307983} 11/07/2021 16:18:37 - INFO - __main__ - Step 135261: {'lr': 1.2137671340625018e-05, 'samples': 25970112, 'steps': 135260, 'loss/train': 1.399727702140808} 11/07/2021 16:18:37 - INFO - __main__ - Step 135262: {'lr': 1.2136037949324718e-05, 'samples': 25970304, 'steps': 135261, 'loss/train': 0.912204921245575} 11/07/2021 16:18:38 - INFO - __main__ - Step 135263: {'lr': 1.2134404665202242e-05, 'samples': 25970496, 'steps': 135262, 'loss/train': 0.7850894927978516} 11/07/2021 16:18:39 - INFO - __main__ - Step 135264: {'lr': 1.2132771488258337e-05, 'samples': 25970688, 'steps': 135263, 'loss/train': 0.03385191783308983} 11/07/2021 16:18:39 - INFO - __main__ - Step 135265: {'lr': 1.2131138418493753e-05, 'samples': 25970880, 'steps': 135264, 'loss/train': 1.2209478616714478} 11/07/2021 16:18:39 - INFO - __main__ - Step 135266: {'lr': 1.2129505455909184e-05, 'samples': 25971072, 'steps': 135265, 'loss/train': 1.0998179912567139} 11/07/2021 16:18:40 - INFO - __main__ - Step 135267: {'lr': 1.212787260050538e-05, 'samples': 25971264, 'steps': 135266, 'loss/train': 1.387648105621338} 11/07/2021 16:18:41 - INFO - __main__ - Step 135268: {'lr': 1.2126239852283116e-05, 'samples': 25971456, 'steps': 135267, 'loss/train': 1.0820425748825073} 11/07/2021 16:18:41 - INFO - __main__ - Step 135269: {'lr': 1.2124607211243088e-05, 'samples': 25971648, 'steps': 135268, 'loss/train': 1.0987969636917114} 11/07/2021 16:18:41 - INFO - __main__ - Step 135270: {'lr': 1.2122974677386017e-05, 'samples': 25971840, 'steps': 135269, 'loss/train': 1.371515154838562} 11/07/2021 16:18:42 - INFO - __main__ - Step 135271: {'lr': 1.2121342250712708e-05, 'samples': 25972032, 'steps': 135270, 'loss/train': 1.73945951461792} 11/07/2021 16:18:42 - INFO - __main__ - Step 135272: {'lr': 1.2119709931223828e-05, 'samples': 25972224, 'steps': 135271, 'loss/train': 1.0466722249984741} 11/07/2021 16:18:43 - INFO - __main__ - Step 135273: {'lr': 1.2118077718920151e-05, 'samples': 25972416, 'steps': 135272, 'loss/train': 1.1872605085372925} 11/07/2021 16:18:43 - INFO - __main__ - Step 135274: {'lr': 1.2116445613802374e-05, 'samples': 25972608, 'steps': 135273, 'loss/train': 1.1800575256347656} 11/07/2021 16:18:44 - INFO - __main__ - Step 135275: {'lr': 1.2114813615871273e-05, 'samples': 25972800, 'steps': 135274, 'loss/train': 1.100616216659546} 11/07/2021 16:18:44 - INFO - __main__ - Step 135276: {'lr': 1.211318172512757e-05, 'samples': 25972992, 'steps': 135275, 'loss/train': 0.9081578850746155} 11/07/2021 16:18:45 - INFO - __main__ - Step 135277: {'lr': 1.2111549941571958e-05, 'samples': 25973184, 'steps': 135276, 'loss/train': 0.7516806721687317} 11/07/2021 16:18:45 - INFO - __main__ - Step 135278: {'lr': 1.2109918265205245e-05, 'samples': 25973376, 'steps': 135277, 'loss/train': 1.3656413555145264} 11/07/2021 16:18:46 - INFO - __main__ - Step 135279: {'lr': 1.210828669602812e-05, 'samples': 25973568, 'steps': 135278, 'loss/train': 1.1567459106445312} 11/07/2021 16:18:47 - INFO - __main__ - Step 135280: {'lr': 1.2106655234041337e-05, 'samples': 25973760, 'steps': 135279, 'loss/train': 1.5191524028778076} 11/07/2021 16:18:47 - INFO - __main__ - Step 135281: {'lr': 1.2105023879245613e-05, 'samples': 25973952, 'steps': 135280, 'loss/train': 1.2727545499801636} 11/07/2021 16:18:47 - INFO - __main__ - Step 135282: {'lr': 1.2103392631641701e-05, 'samples': 25974144, 'steps': 135281, 'loss/train': 0.8816916346549988} 11/07/2021 16:18:48 - INFO - __main__ - Step 135283: {'lr': 1.2101761491230324e-05, 'samples': 25974336, 'steps': 135282, 'loss/train': 1.749528408050537} 11/07/2021 16:18:48 - INFO - __main__ - Step 135284: {'lr': 1.2100130458012226e-05, 'samples': 25974528, 'steps': 135283, 'loss/train': 0.4747309982776642} 11/07/2021 16:18:49 - INFO - __main__ - Step 135285: {'lr': 1.209849953198816e-05, 'samples': 25974720, 'steps': 135284, 'loss/train': 1.6815627813339233} 11/07/2021 16:18:50 - INFO - __main__ - Step 135286: {'lr': 1.2096868713158848e-05, 'samples': 25974912, 'steps': 135285, 'loss/train': 1.4315184354782104} 11/07/2021 16:18:50 - INFO - __main__ - Step 135287: {'lr': 1.2095238001524982e-05, 'samples': 25975104, 'steps': 135286, 'loss/train': 1.4689027070999146} 11/07/2021 16:18:50 - INFO - __main__ - Step 135288: {'lr': 1.2093607397087341e-05, 'samples': 25975296, 'steps': 135287, 'loss/train': 1.357156753540039} 11/07/2021 16:18:51 - INFO - __main__ - Step 135289: {'lr': 1.2091976899846646e-05, 'samples': 25975488, 'steps': 135288, 'loss/train': 1.808677077293396} 11/07/2021 16:18:52 - INFO - __main__ - Step 135290: {'lr': 1.2090346509803618e-05, 'samples': 25975680, 'steps': 135289, 'loss/train': 1.5418965816497803} 11/07/2021 16:18:52 - INFO - __main__ - Step 135291: {'lr': 1.2088716226959035e-05, 'samples': 25975872, 'steps': 135290, 'loss/train': 1.2265511751174927} 11/07/2021 16:18:52 - INFO - __main__ - Step 135292: {'lr': 1.208708605131359e-05, 'samples': 25976064, 'steps': 135291, 'loss/train': 1.3345671892166138} 11/07/2021 16:18:53 - INFO - __main__ - Step 135293: {'lr': 1.2085455982868033e-05, 'samples': 25976256, 'steps': 135292, 'loss/train': 1.2837361097335815} 11/07/2021 16:18:53 - INFO - __main__ - Step 135294: {'lr': 1.2083826021623112e-05, 'samples': 25976448, 'steps': 135293, 'loss/train': 1.215901494026184} 11/07/2021 16:18:54 - INFO - __main__ - Step 135295: {'lr': 1.2082196167579524e-05, 'samples': 25976640, 'steps': 135294, 'loss/train': 1.1629165410995483} 11/07/2021 16:18:54 - INFO - __main__ - Step 135296: {'lr': 1.2080566420738042e-05, 'samples': 25976832, 'steps': 135295, 'loss/train': 0.7577112913131714} 11/07/2021 16:18:55 - INFO - __main__ - Step 135297: {'lr': 1.2078936781099392e-05, 'samples': 25977024, 'steps': 135296, 'loss/train': 1.531657099723816} 11/07/2021 16:18:55 - INFO - __main__ - Step 135298: {'lr': 1.2077307248664294e-05, 'samples': 25977216, 'steps': 135297, 'loss/train': 1.4168705940246582} 11/07/2021 16:18:56 - INFO - __main__ - Step 135299: {'lr': 1.2075677823433495e-05, 'samples': 25977408, 'steps': 135298, 'loss/train': 1.2654615640640259} 11/07/2021 16:18:57 - INFO - __main__ - Step 135300: {'lr': 1.2074048505407747e-05, 'samples': 25977600, 'steps': 135299, 'loss/train': 1.172400712966919} 11/07/2021 16:18:57 - INFO - __main__ - Step 135301: {'lr': 1.2072419294587717e-05, 'samples': 25977792, 'steps': 135300, 'loss/train': 1.1265686750411987} 11/07/2021 16:18:57 - INFO - __main__ - Step 135302: {'lr': 1.2070790190974207e-05, 'samples': 25977984, 'steps': 135301, 'loss/train': 1.48321533203125} 11/07/2021 16:18:58 - INFO - __main__ - Step 135303: {'lr': 1.2069161194567913e-05, 'samples': 25978176, 'steps': 135302, 'loss/train': 1.6688258647918701} 11/07/2021 16:18:58 - INFO - __main__ - Step 135304: {'lr': 1.2067532305369611e-05, 'samples': 25978368, 'steps': 135303, 'loss/train': 0.37360838055610657} 11/07/2021 16:18:58 - INFO - __main__ - Step 135305: {'lr': 1.2065903523379968e-05, 'samples': 25978560, 'steps': 135304, 'loss/train': 1.6855653524398804} 11/07/2021 16:18:59 - INFO - __main__ - Step 135306: {'lr': 1.2064274848599788e-05, 'samples': 25978752, 'steps': 135305, 'loss/train': 1.1671653985977173} 11/07/2021 16:19:00 - INFO - __main__ - Step 135307: {'lr': 1.2062646281029765e-05, 'samples': 25978944, 'steps': 135306, 'loss/train': 1.409596562385559} 11/07/2021 16:19:00 - INFO - __main__ - Step 135308: {'lr': 1.2061017820670622e-05, 'samples': 25979136, 'steps': 135307, 'loss/train': 1.4859857559204102} 11/07/2021 16:19:00 - INFO - __main__ - Step 135309: {'lr': 1.2059389467523135e-05, 'samples': 25979328, 'steps': 135308, 'loss/train': 1.498705506324768} 11/07/2021 16:19:01 - INFO - __main__ - Step 135310: {'lr': 1.2057761221588025e-05, 'samples': 25979520, 'steps': 135309, 'loss/train': 1.5519375801086426} 11/07/2021 16:19:02 - INFO - __main__ - Step 135311: {'lr': 1.2056133082865989e-05, 'samples': 25979712, 'steps': 135310, 'loss/train': 1.3409314155578613} 11/07/2021 16:19:02 - INFO - __main__ - Step 135312: {'lr': 1.20545050513578e-05, 'samples': 25979904, 'steps': 135311, 'loss/train': 1.3839584589004517} 11/07/2021 16:19:03 - INFO - __main__ - Step 135313: {'lr': 1.205287712706421e-05, 'samples': 25980096, 'steps': 135312, 'loss/train': 1.0892620086669922} 11/07/2021 16:19:03 - INFO - __main__ - Step 135314: {'lr': 1.2051249309985912e-05, 'samples': 25980288, 'steps': 135313, 'loss/train': 1.0914572477340698} 11/07/2021 16:19:03 - INFO - __main__ - Step 135315: {'lr': 1.2049621600123629e-05, 'samples': 25980480, 'steps': 135314, 'loss/train': 1.0495047569274902} 11/07/2021 16:19:04 - INFO - __main__ - Step 135316: {'lr': 1.2047993997478108e-05, 'samples': 25980672, 'steps': 135315, 'loss/train': 1.0905869007110596} 11/07/2021 16:19:05 - INFO - __main__ - Step 135317: {'lr': 1.20463665020501e-05, 'samples': 25980864, 'steps': 135316, 'loss/train': 1.4645562171936035} 11/07/2021 16:19:05 - INFO - __main__ - Step 135318: {'lr': 1.2044739113840325e-05, 'samples': 25981056, 'steps': 135317, 'loss/train': 1.4474223852157593} 11/07/2021 16:19:05 - INFO - __main__ - Step 135319: {'lr': 1.2043111832849508e-05, 'samples': 25981248, 'steps': 135318, 'loss/train': 1.1149972677230835} 11/07/2021 16:19:06 - INFO - __main__ - Step 135320: {'lr': 1.2041484659078394e-05, 'samples': 25981440, 'steps': 135319, 'loss/train': 1.2755886316299438} 11/07/2021 16:19:07 - INFO - __main__ - Step 135321: {'lr': 1.2039857592527736e-05, 'samples': 25981632, 'steps': 135320, 'loss/train': 1.4493415355682373} 11/07/2021 16:19:07 - INFO - __main__ - Step 135322: {'lr': 1.2038230633198228e-05, 'samples': 25981824, 'steps': 135321, 'loss/train': 1.3471482992172241} 11/07/2021 16:19:08 - INFO - __main__ - Step 135323: {'lr': 1.2036603781090617e-05, 'samples': 25982016, 'steps': 135322, 'loss/train': 1.5002580881118774} 11/07/2021 16:19:08 - INFO - __main__ - Step 135324: {'lr': 1.2034977036205652e-05, 'samples': 25982208, 'steps': 135323, 'loss/train': 1.0600417852401733} 11/07/2021 16:19:08 - INFO - __main__ - Step 135325: {'lr': 1.2033350398544057e-05, 'samples': 25982400, 'steps': 135324, 'loss/train': 1.6013225317001343} 11/07/2021 16:19:09 - INFO - __main__ - Step 135326: {'lr': 1.2031723868106553e-05, 'samples': 25982592, 'steps': 135325, 'loss/train': 1.158591866493225} 11/07/2021 16:19:10 - INFO - __main__ - Step 135327: {'lr': 1.2030097444893917e-05, 'samples': 25982784, 'steps': 135326, 'loss/train': 0.8087483048439026} 11/07/2021 16:19:10 - INFO - __main__ - Step 135328: {'lr': 1.2028471128906814e-05, 'samples': 25982976, 'steps': 135327, 'loss/train': 1.3204816579818726} 11/07/2021 16:19:10 - INFO - __main__ - Step 135329: {'lr': 1.2026844920146024e-05, 'samples': 25983168, 'steps': 135328, 'loss/train': 1.4394574165344238} 11/07/2021 16:19:11 - INFO - __main__ - Step 135330: {'lr': 1.2025218818612238e-05, 'samples': 25983360, 'steps': 135329, 'loss/train': 1.3705565929412842} 11/07/2021 16:19:12 - INFO - __main__ - Step 135331: {'lr': 1.2023592824306234e-05, 'samples': 25983552, 'steps': 135330, 'loss/train': 0.8490267992019653} 11/07/2021 16:19:12 - INFO - __main__ - Step 135332: {'lr': 1.2021966937228734e-05, 'samples': 25983744, 'steps': 135331, 'loss/train': 0.8577817678451538} 11/07/2021 16:19:13 - INFO - __main__ - Step 135333: {'lr': 1.202034115738046e-05, 'samples': 25983936, 'steps': 135332, 'loss/train': 0.6184898018836975} 11/07/2021 16:19:13 - INFO - __main__ - Step 135334: {'lr': 1.2018715484762133e-05, 'samples': 25984128, 'steps': 135333, 'loss/train': 1.154670238494873} 11/07/2021 16:19:13 - INFO - __main__ - Step 135335: {'lr': 1.2017089919374501e-05, 'samples': 25984320, 'steps': 135334, 'loss/train': 1.515115737915039} 11/07/2021 16:19:14 - INFO - __main__ - Step 135336: {'lr': 1.2015464461218317e-05, 'samples': 25984512, 'steps': 135335, 'loss/train': 1.7609225511550903} 11/07/2021 16:19:15 - INFO - __main__ - Step 135337: {'lr': 1.2013839110294273e-05, 'samples': 25984704, 'steps': 135336, 'loss/train': 1.0381718873977661} 11/07/2021 16:19:15 - INFO - __main__ - Step 135338: {'lr': 1.2012213866603144e-05, 'samples': 25984896, 'steps': 135337, 'loss/train': 1.2726600170135498} 11/07/2021 16:19:15 - INFO - __main__ - Step 135339: {'lr': 1.20105887301456e-05, 'samples': 25985088, 'steps': 135338, 'loss/train': 1.6042379140853882} 11/07/2021 16:19:16 - INFO - __main__ - Step 135340: {'lr': 1.2008963700922471e-05, 'samples': 25985280, 'steps': 135339, 'loss/train': 1.35220468044281} 11/07/2021 16:19:16 - INFO - __main__ - Step 135341: {'lr': 1.2007338778934395e-05, 'samples': 25985472, 'steps': 135340, 'loss/train': 0.8904796838760376} 11/07/2021 16:19:17 - INFO - __main__ - Step 135342: {'lr': 1.2005713964182152e-05, 'samples': 25985664, 'steps': 135341, 'loss/train': 1.8456802368164062} 11/07/2021 16:19:18 - INFO - __main__ - Step 135343: {'lr': 1.2004089256666434e-05, 'samples': 25985856, 'steps': 135342, 'loss/train': 1.8679890632629395} 11/07/2021 16:19:18 - INFO - __main__ - Step 135344: {'lr': 1.2002464656388019e-05, 'samples': 25986048, 'steps': 135343, 'loss/train': 0.35980066657066345} 11/07/2021 16:19:18 - INFO - __main__ - Step 135345: {'lr': 1.2000840163347625e-05, 'samples': 25986240, 'steps': 135344, 'loss/train': 1.0330960750579834} 11/07/2021 16:19:19 - INFO - __main__ - Step 135346: {'lr': 1.1999215777545979e-05, 'samples': 25986432, 'steps': 135345, 'loss/train': 1.4596476554870605} 11/07/2021 16:19:19 - INFO - __main__ - Step 135347: {'lr': 1.1997591498983801e-05, 'samples': 25986624, 'steps': 135346, 'loss/train': 1.4901869297027588} 11/07/2021 16:19:20 - INFO - __main__ - Step 135348: {'lr': 1.1995967327661839e-05, 'samples': 25986816, 'steps': 135347, 'loss/train': 1.6851459741592407} 11/07/2021 16:19:20 - INFO - __main__ - Step 135349: {'lr': 1.1994343263580843e-05, 'samples': 25987008, 'steps': 135348, 'loss/train': 1.1932110786437988} 11/07/2021 16:19:21 - INFO - __main__ - Step 135350: {'lr': 1.1992719306741506e-05, 'samples': 25987200, 'steps': 135349, 'loss/train': 1.395734190940857} 11/07/2021 16:19:21 - INFO - __main__ - Step 135351: {'lr': 1.1991095457144551e-05, 'samples': 25987392, 'steps': 135350, 'loss/train': 1.0335239171981812} 11/07/2021 16:19:21 - INFO - __main__ - Step 135352: {'lr': 1.1989471714790784e-05, 'samples': 25987584, 'steps': 135351, 'loss/train': 1.1773396730422974} 11/07/2021 16:19:23 - INFO - __main__ - Step 135353: {'lr': 1.1987848079680896e-05, 'samples': 25987776, 'steps': 135352, 'loss/train': 1.7386728525161743} 11/07/2021 16:19:23 - INFO - __main__ - Step 135354: {'lr': 1.1986224551815583e-05, 'samples': 25987968, 'steps': 135353, 'loss/train': 0.8596045970916748} 11/07/2021 16:19:23 - INFO - __main__ - Step 135355: {'lr': 1.1984601131195593e-05, 'samples': 25988160, 'steps': 135354, 'loss/train': 1.3661881685256958} 11/07/2021 16:19:24 - INFO - __main__ - Step 135356: {'lr': 1.1982977817821677e-05, 'samples': 25988352, 'steps': 135355, 'loss/train': 1.5463366508483887} 11/07/2021 16:19:24 - INFO - __main__ - Step 135357: {'lr': 1.1981354611694556e-05, 'samples': 25988544, 'steps': 135356, 'loss/train': 1.431862711906433} 11/07/2021 16:19:25 - INFO - __main__ - Step 135358: {'lr': 1.197973151281498e-05, 'samples': 25988736, 'steps': 135357, 'loss/train': 0.9441678524017334} 11/07/2021 16:19:25 - INFO - __main__ - Step 135359: {'lr': 1.197810852118364e-05, 'samples': 25988928, 'steps': 135358, 'loss/train': 1.316064476966858} 11/07/2021 16:19:26 - INFO - __main__ - Step 135360: {'lr': 1.1976485636801315e-05, 'samples': 25989120, 'steps': 135359, 'loss/train': 1.192608118057251} 11/07/2021 16:19:26 - INFO - __main__ - Step 135361: {'lr': 1.19748628596687e-05, 'samples': 25989312, 'steps': 135360, 'loss/train': 1.254820466041565} 11/07/2021 16:19:26 - INFO - __main__ - Step 135362: {'lr': 1.1973240189786516e-05, 'samples': 25989504, 'steps': 135361, 'loss/train': 1.0520914793014526} 11/07/2021 16:19:27 - INFO - __main__ - Step 135363: {'lr': 1.197161762715554e-05, 'samples': 25989696, 'steps': 135362, 'loss/train': 1.7723714113235474} 11/07/2021 16:19:28 - INFO - __main__ - Step 135364: {'lr': 1.1969995171776493e-05, 'samples': 25989888, 'steps': 135363, 'loss/train': 1.2307099103927612} 11/07/2021 16:19:28 - INFO - __main__ - Step 135365: {'lr': 1.196837282365007e-05, 'samples': 25990080, 'steps': 135364, 'loss/train': 1.1233782768249512} 11/07/2021 16:19:29 - INFO - __main__ - Step 135366: {'lr': 1.1966750582777076e-05, 'samples': 25990272, 'steps': 135365, 'loss/train': 1.577817440032959} 11/07/2021 16:19:29 - INFO - __main__ - Step 135367: {'lr': 1.1965128449158146e-05, 'samples': 25990464, 'steps': 135366, 'loss/train': 1.1560426950454712} 11/07/2021 16:19:29 - INFO - __main__ - Step 135368: {'lr': 1.1963506422794063e-05, 'samples': 25990656, 'steps': 135367, 'loss/train': 1.4296314716339111} 11/07/2021 16:19:30 - INFO - __main__ - Step 135369: {'lr': 1.1961884503685544e-05, 'samples': 25990848, 'steps': 135368, 'loss/train': 5.348316192626953} 11/07/2021 16:19:31 - INFO - __main__ - Step 135370: {'lr': 1.196026269183334e-05, 'samples': 25991040, 'steps': 135369, 'loss/train': 1.1499065160751343} 11/07/2021 16:19:31 - INFO - __main__ - Step 135371: {'lr': 1.1958640987238146e-05, 'samples': 25991232, 'steps': 135370, 'loss/train': 1.4432164430618286} 11/07/2021 16:19:31 - INFO - __main__ - Step 135372: {'lr': 1.1957019389900736e-05, 'samples': 25991424, 'steps': 135371, 'loss/train': 1.3153777122497559} 11/07/2021 16:19:32 - INFO - __main__ - Step 135373: {'lr': 1.1955397899821807e-05, 'samples': 25991616, 'steps': 135372, 'loss/train': 1.4833418130874634} 11/07/2021 16:19:33 - INFO - __main__ - Step 135374: {'lr': 1.1953776517002107e-05, 'samples': 25991808, 'steps': 135373, 'loss/train': 1.2054685354232788} 11/07/2021 16:19:33 - INFO - __main__ - Step 135375: {'lr': 1.1952155241442358e-05, 'samples': 25992000, 'steps': 135374, 'loss/train': 0.9320528507232666} 11/07/2021 16:19:33 - INFO - __main__ - Step 135376: {'lr': 1.1950534073143309e-05, 'samples': 25992192, 'steps': 135375, 'loss/train': 1.3387373685836792} 11/07/2021 16:19:34 - INFO - __main__ - Step 135377: {'lr': 1.194891301210571e-05, 'samples': 25992384, 'steps': 135376, 'loss/train': 1.1948007345199585} 11/07/2021 16:19:34 - INFO - __main__ - Step 135378: {'lr': 1.19472920583302e-05, 'samples': 25992576, 'steps': 135377, 'loss/train': 1.658854365348816} 11/07/2021 16:19:36 - INFO - __main__ - Step 135379: {'lr': 1.1945671211817582e-05, 'samples': 25992768, 'steps': 135378, 'loss/train': 1.9166792631149292} 11/07/2021 16:19:36 - INFO - __main__ - Step 135380: {'lr': 1.194405047256858e-05, 'samples': 25992960, 'steps': 135379, 'loss/train': 1.2658944129943848} 11/07/2021 16:19:36 - INFO - __main__ - Step 135381: {'lr': 1.1942429840583884e-05, 'samples': 25993152, 'steps': 135380, 'loss/train': 1.4794583320617676} 11/07/2021 16:19:37 - INFO - __main__ - Step 135382: {'lr': 1.1940809315864275e-05, 'samples': 25993344, 'steps': 135381, 'loss/train': 1.4106208086013794} 11/07/2021 16:19:37 - INFO - __main__ - Step 135383: {'lr': 1.1939188898410474e-05, 'samples': 25993536, 'steps': 135382, 'loss/train': 0.9047016501426697} 11/07/2021 16:19:37 - INFO - __main__ - Step 135384: {'lr': 1.1937568588223202e-05, 'samples': 25993728, 'steps': 135383, 'loss/train': 1.3618885278701782} 11/07/2021 16:19:38 - INFO - __main__ - Step 135385: {'lr': 1.193594838530318e-05, 'samples': 25993920, 'steps': 135384, 'loss/train': 1.6072194576263428} 11/07/2021 16:19:39 - INFO - __main__ - Step 135386: {'lr': 1.1934328289651131e-05, 'samples': 25994112, 'steps': 135385, 'loss/train': 1.2522145509719849} 11/07/2021 16:19:39 - INFO - __main__ - Step 135387: {'lr': 1.1932708301267858e-05, 'samples': 25994304, 'steps': 135386, 'loss/train': 1.5783685445785522} 11/07/2021 16:19:40 - INFO - __main__ - Step 135388: {'lr': 1.1931088420153974e-05, 'samples': 25994496, 'steps': 135387, 'loss/train': 1.5567642450332642} 11/07/2021 16:19:40 - INFO - __main__ - Step 135389: {'lr': 1.192946864631031e-05, 'samples': 25994688, 'steps': 135388, 'loss/train': 0.717771589756012} 11/07/2021 16:19:40 - INFO - __main__ - Step 135390: {'lr': 1.1927848979737505e-05, 'samples': 25994880, 'steps': 135389, 'loss/train': 1.5904935598373413} 11/07/2021 16:19:41 - INFO - __main__ - Step 135391: {'lr': 1.1926229420436362e-05, 'samples': 25995072, 'steps': 135390, 'loss/train': 1.140519618988037} 11/07/2021 16:19:42 - INFO - __main__ - Step 135392: {'lr': 1.1924609968407579e-05, 'samples': 25995264, 'steps': 135391, 'loss/train': 0.9725397825241089} 11/07/2021 16:19:42 - INFO - __main__ - Step 135393: {'lr': 1.1922990623651902e-05, 'samples': 25995456, 'steps': 135392, 'loss/train': 1.1002498865127563} 11/07/2021 16:19:42 - INFO - __main__ - Step 135394: {'lr': 1.1921371386170054e-05, 'samples': 25995648, 'steps': 135393, 'loss/train': 1.7734147310256958} 11/07/2021 16:19:43 - INFO - __main__ - Step 135395: {'lr': 1.1919752255962756e-05, 'samples': 25995840, 'steps': 135394, 'loss/train': 1.7334210872650146} 11/07/2021 16:19:44 - INFO - __main__ - Step 135396: {'lr': 1.1918133233030731e-05, 'samples': 25996032, 'steps': 135395, 'loss/train': 1.611961841583252} 11/07/2021 16:19:44 - INFO - __main__ - Step 135397: {'lr': 1.1916514317374755e-05, 'samples': 25996224, 'steps': 135396, 'loss/train': 1.4598093032836914} 11/07/2021 16:19:44 - INFO - __main__ - Step 135398: {'lr': 1.1914895508995521e-05, 'samples': 25996416, 'steps': 135397, 'loss/train': 1.5406957864761353} 11/07/2021 16:19:45 - INFO - __main__ - Step 135399: {'lr': 1.1913276807893753e-05, 'samples': 25996608, 'steps': 135398, 'loss/train': 1.401227355003357} 11/07/2021 16:19:45 - INFO - __main__ - Step 135400: {'lr': 1.191165821407017e-05, 'samples': 25996800, 'steps': 135399, 'loss/train': 1.0855149030685425} 11/07/2021 16:19:46 - INFO - __main__ - Step 135401: {'lr': 1.1910039727525524e-05, 'samples': 25996992, 'steps': 135400, 'loss/train': 1.6564921140670776} 11/07/2021 16:19:47 - INFO - __main__ - Step 135402: {'lr': 1.1908421348260563e-05, 'samples': 25997184, 'steps': 135401, 'loss/train': 1.173321008682251} 11/07/2021 16:19:47 - INFO - __main__ - Step 135403: {'lr': 1.1906803076275951e-05, 'samples': 25997376, 'steps': 135402, 'loss/train': 0.9077402949333191} 11/07/2021 16:19:47 - INFO - __main__ - Step 135404: {'lr': 1.1905184911572498e-05, 'samples': 25997568, 'steps': 135403, 'loss/train': 1.2446706295013428} 11/07/2021 16:19:48 - INFO - __main__ - Step 135405: {'lr': 1.1903566854150866e-05, 'samples': 25997760, 'steps': 135404, 'loss/train': 1.7444937229156494} 11/07/2021 16:19:50 - INFO - __main__ - Step 135406: {'lr': 1.1901948904011834e-05, 'samples': 25997952, 'steps': 135405, 'loss/train': 1.0971659421920776} 11/07/2021 16:19:50 - INFO - __main__ - Step 135407: {'lr': 1.1900331061156094e-05, 'samples': 25998144, 'steps': 135406, 'loss/train': 1.2139188051223755} 11/07/2021 16:19:50 - INFO - __main__ - Step 135408: {'lr': 1.1898713325584399e-05, 'samples': 25998336, 'steps': 135407, 'loss/train': 1.7470518350601196} 11/07/2021 16:19:51 - INFO - __main__ - Step 135409: {'lr': 1.1897095697297522e-05, 'samples': 25998528, 'steps': 135408, 'loss/train': 1.7356867790222168} 11/07/2021 16:19:51 - INFO - __main__ - Step 135410: {'lr': 1.1895478176296076e-05, 'samples': 25998720, 'steps': 135409, 'loss/train': 1.7302474975585938} 11/07/2021 16:19:51 - INFO - __main__ - Step 135411: {'lr': 1.1893860762580866e-05, 'samples': 25998912, 'steps': 135410, 'loss/train': 1.3163864612579346} 11/07/2021 16:19:52 - INFO - __main__ - Step 135412: {'lr': 1.1892243456152613e-05, 'samples': 25999104, 'steps': 135411, 'loss/train': 1.5276018381118774} 11/07/2021 16:19:53 - INFO - __main__ - Step 135413: {'lr': 1.1890626257012038e-05, 'samples': 25999296, 'steps': 135412, 'loss/train': 1.3326078653335571} 11/07/2021 16:19:53 - INFO - __main__ - Step 135414: {'lr': 1.1889009165159865e-05, 'samples': 25999488, 'steps': 135413, 'loss/train': 1.0974019765853882} 11/07/2021 16:19:54 - INFO - __main__ - Step 135415: {'lr': 1.188739218059684e-05, 'samples': 25999680, 'steps': 135414, 'loss/train': 1.0128642320632935} 11/07/2021 16:19:54 - INFO - __main__ - Step 135416: {'lr': 1.188577530332366e-05, 'samples': 25999872, 'steps': 135415, 'loss/train': 1.8137965202331543} 11/07/2021 16:19:54 - INFO - __main__ - Step 135417: {'lr': 1.18841585333411e-05, 'samples': 26000064, 'steps': 135416, 'loss/train': 1.0487949848175049} 11/07/2021 16:19:55 - INFO - __main__ - Step 135418: {'lr': 1.1882541870649854e-05, 'samples': 26000256, 'steps': 135417, 'loss/train': 1.3415952920913696} 11/07/2021 16:19:56 - INFO - __main__ - Step 135419: {'lr': 1.1880925315250645e-05, 'samples': 26000448, 'steps': 135418, 'loss/train': 0.3508140444755554} 11/07/2021 16:19:56 - INFO - __main__ - Step 135420: {'lr': 1.187930886714425e-05, 'samples': 26000640, 'steps': 135419, 'loss/train': 1.3275854587554932} 11/07/2021 16:19:56 - INFO - __main__ - Step 135421: {'lr': 1.187769252633139e-05, 'samples': 26000832, 'steps': 135420, 'loss/train': 1.610580325126648} 11/07/2021 16:19:57 - INFO - __main__ - Step 135422: {'lr': 1.1876076292812704e-05, 'samples': 26001024, 'steps': 135421, 'loss/train': 1.6721563339233398} 11/07/2021 16:19:58 - INFO - __main__ - Step 135423: {'lr': 1.1874460166589024e-05, 'samples': 26001216, 'steps': 135422, 'loss/train': 1.2531054019927979} 11/07/2021 16:19:58 - INFO - __main__ - Step 135424: {'lr': 1.1872844147661015e-05, 'samples': 26001408, 'steps': 135423, 'loss/train': 0.7496057748794556} 11/07/2021 16:19:58 - INFO - __main__ - Step 135425: {'lr': 1.187122823602943e-05, 'samples': 26001600, 'steps': 135424, 'loss/train': 0.19541223347187042} 11/07/2021 16:19:59 - INFO - __main__ - Step 135426: {'lr': 1.1869612431694987e-05, 'samples': 26001792, 'steps': 135425, 'loss/train': 1.6092561483383179} 11/07/2021 16:19:59 - INFO - __main__ - Step 135427: {'lr': 1.1867996734658438e-05, 'samples': 26001984, 'steps': 135426, 'loss/train': 1.060539722442627} 11/07/2021 16:19:59 - INFO - __main__ - Step 135428: {'lr': 1.1866381144920475e-05, 'samples': 26002176, 'steps': 135427, 'loss/train': 1.2649413347244263} 11/07/2021 16:20:01 - INFO - __main__ - Step 135429: {'lr': 1.1864765662481874e-05, 'samples': 26002368, 'steps': 135428, 'loss/train': 1.9343372583389282} 11/07/2021 16:20:01 - INFO - __main__ - Step 135430: {'lr': 1.1863150287343305e-05, 'samples': 26002560, 'steps': 135429, 'loss/train': 1.4700347185134888} 11/07/2021 16:20:01 - INFO - __main__ - Step 135431: {'lr': 1.1861535019505543e-05, 'samples': 26002752, 'steps': 135430, 'loss/train': 1.4985945224761963} 11/07/2021 16:20:02 - INFO - __main__ - Step 135432: {'lr': 1.1859919858969308e-05, 'samples': 26002944, 'steps': 135431, 'loss/train': 1.1875102519989014} 11/07/2021 16:20:02 - INFO - __main__ - Step 135433: {'lr': 1.1858304805735297e-05, 'samples': 26003136, 'steps': 135432, 'loss/train': 1.483963131904602} 11/07/2021 16:20:03 - INFO - __main__ - Step 135434: {'lr': 1.1856689859804315e-05, 'samples': 26003328, 'steps': 135433, 'loss/train': 0.7451053857803345} 11/07/2021 16:20:03 - INFO - __main__ - Step 135435: {'lr': 1.185507502117697e-05, 'samples': 26003520, 'steps': 135434, 'loss/train': 1.2521741390228271} 11/07/2021 16:20:04 - INFO - __main__ - Step 135436: {'lr': 1.1853460289854068e-05, 'samples': 26003712, 'steps': 135435, 'loss/train': 1.3979345560073853} 11/07/2021 16:20:04 - INFO - __main__ - Step 135437: {'lr': 1.185184566583633e-05, 'samples': 26003904, 'steps': 135436, 'loss/train': 1.4490883350372314} 11/07/2021 16:20:04 - INFO - __main__ - Step 135438: {'lr': 1.1850231149124479e-05, 'samples': 26004096, 'steps': 135437, 'loss/train': 1.3243931531906128} 11/07/2021 16:20:06 - INFO - __main__ - Step 135439: {'lr': 1.1848616739719208e-05, 'samples': 26004288, 'steps': 135438, 'loss/train': 1.342621922492981} 11/07/2021 16:20:06 - INFO - __main__ - Step 135440: {'lr': 1.1847002437621323e-05, 'samples': 26004480, 'steps': 135439, 'loss/train': 1.1205976009368896} 11/07/2021 16:20:06 - INFO - __main__ - Step 135441: {'lr': 1.1845388242831462e-05, 'samples': 26004672, 'steps': 135440, 'loss/train': 1.3556212186813354} 11/07/2021 16:20:07 - INFO - __main__ - Step 135442: {'lr': 1.1843774155350429e-05, 'samples': 26004864, 'steps': 135441, 'loss/train': 1.5059634447097778} 11/07/2021 16:20:07 - INFO - __main__ - Step 135443: {'lr': 1.1842160175178889e-05, 'samples': 26005056, 'steps': 135442, 'loss/train': 1.3226360082626343} 11/07/2021 16:20:07 - INFO - __main__ - Step 135444: {'lr': 1.1840546302317596e-05, 'samples': 26005248, 'steps': 135443, 'loss/train': 1.589189052581787} 11/07/2021 16:20:08 - INFO - __main__ - Step 135445: {'lr': 1.1838932536767295e-05, 'samples': 26005440, 'steps': 135444, 'loss/train': 1.0400927066802979} 11/07/2021 16:20:09 - INFO - __main__ - Step 135446: {'lr': 1.183731887852868e-05, 'samples': 26005632, 'steps': 135445, 'loss/train': 1.9721081256866455} 11/07/2021 16:20:09 - INFO - __main__ - Step 135447: {'lr': 1.1835705327602503e-05, 'samples': 26005824, 'steps': 135446, 'loss/train': 1.186622977256775} 11/07/2021 16:20:09 - INFO - __main__ - Step 135448: {'lr': 1.1834091883989513e-05, 'samples': 26006016, 'steps': 135447, 'loss/train': 1.3498060703277588} 11/07/2021 16:20:10 - INFO - __main__ - Step 135449: {'lr': 1.1832478547690373e-05, 'samples': 26006208, 'steps': 135448, 'loss/train': 1.053752064704895} 11/07/2021 16:20:11 - INFO - __main__ - Step 135450: {'lr': 1.1830865318705863e-05, 'samples': 26006400, 'steps': 135449, 'loss/train': 1.1536301374435425} 11/07/2021 16:20:11 - INFO - __main__ - Step 135451: {'lr': 1.1829252197036649e-05, 'samples': 26006592, 'steps': 135450, 'loss/train': 1.3928192853927612} 11/07/2021 16:20:11 - INFO - __main__ - Step 135452: {'lr': 1.1827639182683536e-05, 'samples': 26006784, 'steps': 135451, 'loss/train': 1.222410798072815} 11/07/2021 16:20:12 - INFO - __main__ - Step 135453: {'lr': 1.1826026275647189e-05, 'samples': 26006976, 'steps': 135452, 'loss/train': 1.3446846008300781} 11/07/2021 16:20:12 - INFO - __main__ - Step 135454: {'lr': 1.1824413475928386e-05, 'samples': 26007168, 'steps': 135453, 'loss/train': 1.1990811824798584} 11/07/2021 16:20:13 - INFO - __main__ - Step 135455: {'lr': 1.1822800783527794e-05, 'samples': 26007360, 'steps': 135454, 'loss/train': 1.334486484527588} 11/07/2021 16:20:14 - INFO - __main__ - Step 135456: {'lr': 1.1821188198446187e-05, 'samples': 26007552, 'steps': 135455, 'loss/train': 0.9627283215522766} 11/07/2021 16:20:14 - INFO - __main__ - Step 135457: {'lr': 1.181957572068429e-05, 'samples': 26007744, 'steps': 135456, 'loss/train': 1.2921619415283203} 11/07/2021 16:20:14 - INFO - __main__ - Step 135458: {'lr': 1.1817963350242794e-05, 'samples': 26007936, 'steps': 135457, 'loss/train': 1.5541749000549316} 11/07/2021 16:20:15 - INFO - __main__ - Step 135459: {'lr': 1.1816351087122479e-05, 'samples': 26008128, 'steps': 135458, 'loss/train': 1.3695061206817627} 11/07/2021 16:20:17 - INFO - __main__ - Step 135460: {'lr': 1.181473893132401e-05, 'samples': 26008320, 'steps': 135459, 'loss/train': 1.3805749416351318} 11/07/2021 16:20:17 - INFO - __main__ - Step 135461: {'lr': 1.181312688284819e-05, 'samples': 26008512, 'steps': 135460, 'loss/train': 1.954480528831482} 11/07/2021 16:20:17 - INFO - __main__ - Step 135462: {'lr': 1.181151494169566e-05, 'samples': 26008704, 'steps': 135461, 'loss/train': 1.4758977890014648} 11/07/2021 16:20:18 - INFO - __main__ - Step 135463: {'lr': 1.1809903107867198e-05, 'samples': 26008896, 'steps': 135462, 'loss/train': 1.443830966949463} 11/07/2021 16:20:18 - INFO - __main__ - Step 135464: {'lr': 1.1808291381363522e-05, 'samples': 26009088, 'steps': 135463, 'loss/train': 1.0473750829696655} 11/07/2021 16:20:18 - INFO - __main__ - Step 135465: {'lr': 1.1806679762185329e-05, 'samples': 26009280, 'steps': 135464, 'loss/train': 1.0591610670089722} 11/07/2021 16:20:19 - INFO - __main__ - Step 135466: {'lr': 1.1805068250333395e-05, 'samples': 26009472, 'steps': 135465, 'loss/train': 1.7090990543365479} 11/07/2021 16:20:20 - INFO - __main__ - Step 135467: {'lr': 1.1803456845808413e-05, 'samples': 26009664, 'steps': 135466, 'loss/train': 1.240109920501709} 11/07/2021 16:20:20 - INFO - __main__ - Step 135468: {'lr': 1.1801845548611106e-05, 'samples': 26009856, 'steps': 135467, 'loss/train': 1.1193368434906006} 11/07/2021 16:20:20 - INFO - __main__ - Step 135469: {'lr': 1.1800234358742223e-05, 'samples': 26010048, 'steps': 135468, 'loss/train': 1.0983139276504517} 11/07/2021 16:20:21 - INFO - __main__ - Step 135470: {'lr': 1.1798623276202486e-05, 'samples': 26010240, 'steps': 135469, 'loss/train': 1.3250682353973389} 11/07/2021 16:20:21 - INFO - __main__ - Step 135471: {'lr': 1.1797012300992616e-05, 'samples': 26010432, 'steps': 135470, 'loss/train': 1.142865538597107} 11/07/2021 16:20:21 - INFO - __main__ - Step 135472: {'lr': 1.1795401433113306e-05, 'samples': 26010624, 'steps': 135471, 'loss/train': 1.2926826477050781} 11/07/2021 16:20:23 - INFO - __main__ - Step 135473: {'lr': 1.1793790672565336e-05, 'samples': 26010816, 'steps': 135472, 'loss/train': 1.1645270586013794} 11/07/2021 16:20:23 - INFO - __main__ - Step 135474: {'lr': 1.1792180019349452e-05, 'samples': 26011008, 'steps': 135473, 'loss/train': 1.2548843622207642} 11/07/2021 16:20:23 - INFO - __main__ - Step 135475: {'lr': 1.1790569473466295e-05, 'samples': 26011200, 'steps': 135474, 'loss/train': 0.723686695098877} 11/07/2021 16:20:24 - INFO - __main__ - Step 135476: {'lr': 1.1788959034916614e-05, 'samples': 26011392, 'steps': 135475, 'loss/train': 1.3303331136703491} 11/07/2021 16:20:24 - INFO - __main__ - Step 135477: {'lr': 1.1787348703701157e-05, 'samples': 26011584, 'steps': 135476, 'loss/train': 1.5611934661865234} 11/07/2021 16:20:25 - INFO - __main__ - Step 135478: {'lr': 1.1785738479820673e-05, 'samples': 26011776, 'steps': 135477, 'loss/train': 1.8349666595458984} 11/07/2021 16:20:25 - INFO - __main__ - Step 135479: {'lr': 1.178412836327583e-05, 'samples': 26011968, 'steps': 135478, 'loss/train': 1.5016038417816162} 11/07/2021 16:20:26 - INFO - __main__ - Step 135480: {'lr': 1.1782518354067379e-05, 'samples': 26012160, 'steps': 135479, 'loss/train': 1.2828108072280884} 11/07/2021 16:20:26 - INFO - __main__ - Step 135481: {'lr': 1.1780908452196065e-05, 'samples': 26012352, 'steps': 135480, 'loss/train': 0.5687029957771301} 11/07/2021 16:20:26 - INFO - __main__ - Step 135482: {'lr': 1.1779298657662613e-05, 'samples': 26012544, 'steps': 135481, 'loss/train': 1.7141611576080322} 11/07/2021 16:20:28 - INFO - __main__ - Step 135483: {'lr': 1.1777688970467714e-05, 'samples': 26012736, 'steps': 135482, 'loss/train': 1.2299891710281372} 11/07/2021 16:20:28 - INFO - __main__ - Step 135484: {'lr': 1.1776079390612093e-05, 'samples': 26012928, 'steps': 135483, 'loss/train': 1.3086594343185425} 11/07/2021 16:20:28 - INFO - __main__ - Step 135485: {'lr': 1.1774469918096525e-05, 'samples': 26013120, 'steps': 135484, 'loss/train': 1.623218059539795} 11/07/2021 16:20:29 - INFO - __main__ - Step 135486: {'lr': 1.1772860552921677e-05, 'samples': 26013312, 'steps': 135485, 'loss/train': 1.5075713396072388} 11/07/2021 16:20:29 - INFO - __main__ - Step 135487: {'lr': 1.1771251295088325e-05, 'samples': 26013504, 'steps': 135486, 'loss/train': 1.1546231508255005} 11/07/2021 16:20:30 - INFO - __main__ - Step 135488: {'lr': 1.176964214459722e-05, 'samples': 26013696, 'steps': 135487, 'loss/train': 0.913125216960907} 11/07/2021 16:20:30 - INFO - __main__ - Step 135489: {'lr': 1.1768033101448972e-05, 'samples': 26013888, 'steps': 135488, 'loss/train': 1.66291081905365} 11/07/2021 16:20:31 - INFO - __main__ - Step 135490: {'lr': 1.1766424165644385e-05, 'samples': 26014080, 'steps': 135489, 'loss/train': 1.611246943473816} 11/07/2021 16:20:31 - INFO - __main__ - Step 135491: {'lr': 1.1764815337184154e-05, 'samples': 26014272, 'steps': 135490, 'loss/train': 1.1498479843139648} 11/07/2021 16:20:31 - INFO - __main__ - Step 135492: {'lr': 1.1763206616069056e-05, 'samples': 26014464, 'steps': 135491, 'loss/train': 1.6814570426940918} 11/07/2021 16:20:32 - INFO - __main__ - Step 135493: {'lr': 1.1761598002299756e-05, 'samples': 26014656, 'steps': 135492, 'loss/train': 1.3214752674102783} 11/07/2021 16:20:33 - INFO - __main__ - Step 135494: {'lr': 1.1759989495877005e-05, 'samples': 26014848, 'steps': 135493, 'loss/train': 1.7512274980545044} 11/07/2021 16:20:33 - INFO - __main__ - Step 135495: {'lr': 1.1758381096801524e-05, 'samples': 26015040, 'steps': 135494, 'loss/train': 1.0215866565704346} 11/07/2021 16:20:33 - INFO - __main__ - Step 135496: {'lr': 1.1756772805074061e-05, 'samples': 26015232, 'steps': 135495, 'loss/train': 1.3746618032455444} 11/07/2021 16:20:34 - INFO - __main__ - Step 135497: {'lr': 1.1755164620695314e-05, 'samples': 26015424, 'steps': 135496, 'loss/train': 1.5633397102355957} 11/07/2021 16:20:34 - INFO - __main__ - Step 135498: {'lr': 1.1753556543666e-05, 'samples': 26015616, 'steps': 135497, 'loss/train': 1.3086090087890625} 11/07/2021 16:20:35 - INFO - __main__ - Step 135499: {'lr': 1.1751948573986843e-05, 'samples': 26015808, 'steps': 135498, 'loss/train': 1.5123586654663086} 11/07/2021 16:20:36 - INFO - __main__ - Step 135500: {'lr': 1.175034071165862e-05, 'samples': 26016000, 'steps': 135499, 'loss/train': 1.3773154020309448} 11/07/2021 16:20:36 - INFO - __main__ - Step 135501: {'lr': 1.1748732956682024e-05, 'samples': 26016192, 'steps': 135500, 'loss/train': 1.2105755805969238} 11/07/2021 16:20:36 - INFO - __main__ - Step 135502: {'lr': 1.174712530905775e-05, 'samples': 26016384, 'steps': 135501, 'loss/train': 1.260170340538025} 11/07/2021 16:20:37 - INFO - __main__ - Step 135503: {'lr': 1.1745517768786545e-05, 'samples': 26016576, 'steps': 135502, 'loss/train': 0.9831538796424866} 11/07/2021 16:20:38 - INFO - __main__ - Step 135504: {'lr': 1.1743910335869135e-05, 'samples': 26016768, 'steps': 135503, 'loss/train': 1.2974809408187866} 11/07/2021 16:20:38 - INFO - __main__ - Step 135505: {'lr': 1.174230301030621e-05, 'samples': 26016960, 'steps': 135504, 'loss/train': 1.003860354423523} 11/07/2021 16:20:38 - INFO - __main__ - Step 135506: {'lr': 1.1740695792098577e-05, 'samples': 26017152, 'steps': 135505, 'loss/train': 1.3921678066253662} 11/07/2021 16:20:39 - INFO - __main__ - Step 135507: {'lr': 1.1739088681246873e-05, 'samples': 26017344, 'steps': 135506, 'loss/train': 1.1644716262817383} 11/07/2021 16:20:39 - INFO - __main__ - Step 135508: {'lr': 1.1737481677751877e-05, 'samples': 26017536, 'steps': 135507, 'loss/train': 0.7668763399124146} 11/07/2021 16:20:40 - INFO - __main__ - Step 135509: {'lr': 1.1735874781614281e-05, 'samples': 26017728, 'steps': 135508, 'loss/train': 1.2749264240264893} 11/07/2021 16:20:41 - INFO - __main__ - Step 135510: {'lr': 1.1734267992834834e-05, 'samples': 26017920, 'steps': 135509, 'loss/train': 1.2330207824707031} 11/07/2021 16:20:41 - INFO - __main__ - Step 135511: {'lr': 1.173266131141426e-05, 'samples': 26018112, 'steps': 135510, 'loss/train': 1.2921051979064941} 11/07/2021 16:20:41 - INFO - __main__ - Step 135512: {'lr': 1.1731054737353252e-05, 'samples': 26018304, 'steps': 135511, 'loss/train': 0.9379370212554932} 11/07/2021 16:20:42 - INFO - __main__ - Step 135513: {'lr': 1.1729448270652559e-05, 'samples': 26018496, 'steps': 135512, 'loss/train': 0.9494053721427917} 11/07/2021 16:20:43 - INFO - __main__ - Step 135514: {'lr': 1.1727841911312902e-05, 'samples': 26018688, 'steps': 135513, 'loss/train': 0.7414166331291199} 11/07/2021 16:20:43 - INFO - __main__ - Step 135515: {'lr': 1.1726235659335032e-05, 'samples': 26018880, 'steps': 135514, 'loss/train': 1.4608765840530396} 11/07/2021 16:20:43 - INFO - __main__ - Step 135516: {'lr': 1.1724629514719642e-05, 'samples': 26019072, 'steps': 135515, 'loss/train': 1.2993113994598389} 11/07/2021 16:20:44 - INFO - __main__ - Step 135517: {'lr': 1.1723023477467426e-05, 'samples': 26019264, 'steps': 135516, 'loss/train': 1.6079480648040771} 11/07/2021 16:20:44 - INFO - __main__ - Step 135518: {'lr': 1.1721417547579134e-05, 'samples': 26019456, 'steps': 135517, 'loss/train': 1.3645615577697754} 11/07/2021 16:20:45 - INFO - __main__ - Step 135519: {'lr': 1.1719811725055513e-05, 'samples': 26019648, 'steps': 135518, 'loss/train': 0.9841102361679077} 11/07/2021 16:20:45 - INFO - __main__ - Step 135520: {'lr': 1.1718206009897259e-05, 'samples': 26019840, 'steps': 135519, 'loss/train': 1.1083239316940308} 11/07/2021 16:20:46 - INFO - __main__ - Step 135521: {'lr': 1.1716600402105092e-05, 'samples': 26020032, 'steps': 135520, 'loss/train': 1.1100976467132568} 11/07/2021 16:20:46 - INFO - __main__ - Step 135522: {'lr': 1.1714994901679765e-05, 'samples': 26020224, 'steps': 135521, 'loss/train': 1.3159338235855103} 11/07/2021 16:20:46 - INFO - __main__ - Step 135523: {'lr': 1.1713389508621969e-05, 'samples': 26020416, 'steps': 135522, 'loss/train': 1.6622551679611206} 11/07/2021 16:20:48 - INFO - __main__ - Step 135524: {'lr': 1.1711784222932453e-05, 'samples': 26020608, 'steps': 135523, 'loss/train': 0.9294440150260925} 11/07/2021 16:20:48 - INFO - __main__ - Step 135525: {'lr': 1.171017904461194e-05, 'samples': 26020800, 'steps': 135524, 'loss/train': 1.4554908275604248} 11/07/2021 16:20:48 - INFO - __main__ - Step 135526: {'lr': 1.1708573973661125e-05, 'samples': 26020992, 'steps': 135525, 'loss/train': 1.3808578252792358} 11/07/2021 16:20:49 - INFO - __main__ - Step 135527: {'lr': 1.1706969010080754e-05, 'samples': 26021184, 'steps': 135526, 'loss/train': 0.5964181423187256} 11/07/2021 16:20:49 - INFO - __main__ - Step 135528: {'lr': 1.1705364153871578e-05, 'samples': 26021376, 'steps': 135527, 'loss/train': 1.828322410583496} 11/07/2021 16:20:49 - INFO - __main__ - Step 135529: {'lr': 1.1703759405034265e-05, 'samples': 26021568, 'steps': 135528, 'loss/train': 0.3774632513523102} 11/07/2021 16:20:50 - INFO - __main__ - Step 135530: {'lr': 1.1702154763569562e-05, 'samples': 26021760, 'steps': 135529, 'loss/train': 1.0280767679214478} 11/07/2021 16:20:51 - INFO - __main__ - Step 135531: {'lr': 1.170055022947819e-05, 'samples': 26021952, 'steps': 135530, 'loss/train': 1.306445837020874} 11/07/2021 16:20:51 - INFO - __main__ - Step 135532: {'lr': 1.1698945802760873e-05, 'samples': 26022144, 'steps': 135531, 'loss/train': 1.3953760862350464} 11/07/2021 16:20:51 - INFO - __main__ - Step 135533: {'lr': 1.1697341483418306e-05, 'samples': 26022336, 'steps': 135532, 'loss/train': 0.9095454812049866} 11/07/2021 16:20:52 - INFO - __main__ - Step 135534: {'lr': 1.1695737271451263e-05, 'samples': 26022528, 'steps': 135533, 'loss/train': 1.5054157972335815} 11/07/2021 16:20:53 - INFO - __main__ - Step 135535: {'lr': 1.1694133166860438e-05, 'samples': 26022720, 'steps': 135534, 'loss/train': 1.1443450450897217} 11/07/2021 16:20:54 - INFO - __main__ - Step 135536: {'lr': 1.1692529169646582e-05, 'samples': 26022912, 'steps': 135535, 'loss/train': 1.1591914892196655} 11/07/2021 16:20:54 - INFO - __main__ - Step 135537: {'lr': 1.169092527981036e-05, 'samples': 26023104, 'steps': 135536, 'loss/train': 0.5236815810203552} 11/07/2021 16:20:54 - INFO - __main__ - Step 135538: {'lr': 1.1689321497352552e-05, 'samples': 26023296, 'steps': 135537, 'loss/train': 0.49768102169036865} 11/07/2021 16:20:55 - INFO - __main__ - Step 135539: {'lr': 1.1687717822273847e-05, 'samples': 26023488, 'steps': 135538, 'loss/train': 1.531158685684204} 11/07/2021 16:20:56 - INFO - __main__ - Step 135540: {'lr': 1.1686114254574998e-05, 'samples': 26023680, 'steps': 135539, 'loss/train': 1.362999677658081} 11/07/2021 16:20:56 - INFO - __main__ - Step 135541: {'lr': 1.1684510794256698e-05, 'samples': 26023872, 'steps': 135540, 'loss/train': 0.5759204626083374} 11/07/2021 16:20:57 - INFO - __main__ - Step 135542: {'lr': 1.1682907441319695e-05, 'samples': 26024064, 'steps': 135541, 'loss/train': 1.731531023979187} 11/07/2021 16:20:57 - INFO - __main__ - Step 135543: {'lr': 1.1681304195764686e-05, 'samples': 26024256, 'steps': 135542, 'loss/train': 2.1361892223358154} 11/07/2021 16:20:57 - INFO - __main__ - Step 135544: {'lr': 1.167970105759239e-05, 'samples': 26024448, 'steps': 135543, 'loss/train': 1.1947540044784546} 11/07/2021 16:20:58 - INFO - __main__ - Step 135545: {'lr': 1.1678098026803557e-05, 'samples': 26024640, 'steps': 135544, 'loss/train': 1.1451823711395264} 11/07/2021 16:20:59 - INFO - __main__ - Step 135546: {'lr': 1.1676495103398883e-05, 'samples': 26024832, 'steps': 135545, 'loss/train': 0.15549226105213165} 11/07/2021 16:20:59 - INFO - __main__ - Step 135547: {'lr': 1.1674892287379113e-05, 'samples': 26025024, 'steps': 135546, 'loss/train': 1.3537862300872803} 11/07/2021 16:21:00 - INFO - __main__ - Step 135548: {'lr': 1.1673289578744945e-05, 'samples': 26025216, 'steps': 135547, 'loss/train': 1.4416077136993408} 11/07/2021 16:21:00 - INFO - __main__ - Step 135549: {'lr': 1.1671686977497126e-05, 'samples': 26025408, 'steps': 135548, 'loss/train': 1.3611621856689453} 11/07/2021 16:21:00 - INFO - __main__ - Step 135550: {'lr': 1.1670084483636378e-05, 'samples': 26025600, 'steps': 135549, 'loss/train': 1.5043284893035889} 11/07/2021 16:21:01 - INFO - __main__ - Step 135551: {'lr': 1.1668482097163396e-05, 'samples': 26025792, 'steps': 135550, 'loss/train': 1.4912397861480713} 11/07/2021 16:21:02 - INFO - __main__ - Step 135552: {'lr': 1.166687981807893e-05, 'samples': 26025984, 'steps': 135551, 'loss/train': 1.2462400197982788} 11/07/2021 16:21:02 - INFO - __main__ - Step 135553: {'lr': 1.1665277646383672e-05, 'samples': 26026176, 'steps': 135552, 'loss/train': 1.45615816116333} 11/07/2021 16:21:02 - INFO - __main__ - Step 135554: {'lr': 1.166367558207837e-05, 'samples': 26026368, 'steps': 135553, 'loss/train': 1.467131495475769} 11/07/2021 16:21:03 - INFO - __main__ - Step 135555: {'lr': 1.1662073625163777e-05, 'samples': 26026560, 'steps': 135554, 'loss/train': 1.2850122451782227} 11/07/2021 16:21:03 - INFO - __main__ - Step 135556: {'lr': 1.166047177564053e-05, 'samples': 26026752, 'steps': 135555, 'loss/train': 1.0464520454406738} 11/07/2021 16:21:04 - INFO - __main__ - Step 135557: {'lr': 1.1658870033509405e-05, 'samples': 26026944, 'steps': 135556, 'loss/train': 1.268729329109192} 11/07/2021 16:21:04 - INFO - __main__ - Step 135558: {'lr': 1.1657268398771126e-05, 'samples': 26027136, 'steps': 135557, 'loss/train': 1.5029581785202026} 11/07/2021 16:21:05 - INFO - __main__ - Step 135559: {'lr': 1.1655666871426384e-05, 'samples': 26027328, 'steps': 135558, 'loss/train': 1.6728875637054443} 11/07/2021 16:21:05 - INFO - __main__ - Step 135560: {'lr': 1.1654065451475932e-05, 'samples': 26027520, 'steps': 135559, 'loss/train': 1.4044629335403442} 11/07/2021 16:21:05 - INFO - __main__ - Step 135561: {'lr': 1.1652464138920488e-05, 'samples': 26027712, 'steps': 135560, 'loss/train': 1.200547456741333} 11/07/2021 16:21:07 - INFO - __main__ - Step 135562: {'lr': 1.1650862933760747e-05, 'samples': 26027904, 'steps': 135561, 'loss/train': 1.1216074228286743} 11/07/2021 16:21:07 - INFO - __main__ - Step 135563: {'lr': 1.164926183599746e-05, 'samples': 26028096, 'steps': 135562, 'loss/train': 1.5792951583862305} 11/07/2021 16:21:07 - INFO - __main__ - Step 135564: {'lr': 1.1647660845631347e-05, 'samples': 26028288, 'steps': 135563, 'loss/train': 1.642229676246643} 11/07/2021 16:21:08 - INFO - __main__ - Step 135565: {'lr': 1.1646059962663102e-05, 'samples': 26028480, 'steps': 135564, 'loss/train': 1.2950758934020996} 11/07/2021 16:21:08 - INFO - __main__ - Step 135566: {'lr': 1.1644459187093476e-05, 'samples': 26028672, 'steps': 135565, 'loss/train': 0.4543752670288086} 11/07/2021 16:21:09 - INFO - __main__ - Step 135567: {'lr': 1.1642858518923161e-05, 'samples': 26028864, 'steps': 135566, 'loss/train': 1.0647799968719482} 11/07/2021 16:21:09 - INFO - __main__ - Step 135568: {'lr': 1.1641257958152906e-05, 'samples': 26029056, 'steps': 135567, 'loss/train': 0.9595513939857483} 11/07/2021 16:21:10 - INFO - __main__ - Step 135569: {'lr': 1.1639657504783463e-05, 'samples': 26029248, 'steps': 135568, 'loss/train': 1.1561000347137451} 11/07/2021 16:21:10 - INFO - __main__ - Step 135570: {'lr': 1.1638057158815468e-05, 'samples': 26029440, 'steps': 135569, 'loss/train': 1.1915842294692993} 11/07/2021 16:21:11 - INFO - __main__ - Step 135571: {'lr': 1.16364569202497e-05, 'samples': 26029632, 'steps': 135570, 'loss/train': 0.6431447267532349} 11/07/2021 16:21:12 - INFO - __main__ - Step 135572: {'lr': 1.1634856789086851e-05, 'samples': 26029824, 'steps': 135571, 'loss/train': 1.0095192193984985} 11/07/2021 16:21:12 - INFO - __main__ - Step 135573: {'lr': 1.163325676532767e-05, 'samples': 26030016, 'steps': 135572, 'loss/train': 1.3335771560668945} 11/07/2021 16:21:12 - INFO - __main__ - Step 135574: {'lr': 1.1631656848972855e-05, 'samples': 26030208, 'steps': 135573, 'loss/train': 0.9294772744178772} 11/07/2021 16:21:13 - INFO - __main__ - Step 135575: {'lr': 1.163005704002315e-05, 'samples': 26030400, 'steps': 135574, 'loss/train': 1.5251104831695557} 11/07/2021 16:21:13 - INFO - __main__ - Step 135576: {'lr': 1.1628457338479254e-05, 'samples': 26030592, 'steps': 135575, 'loss/train': 1.545179009437561} 11/07/2021 16:21:14 - INFO - __main__ - Step 135577: {'lr': 1.1626857744341913e-05, 'samples': 26030784, 'steps': 135576, 'loss/train': 1.1729825735092163} 11/07/2021 16:21:15 - INFO - __main__ - Step 135578: {'lr': 1.162525825761182e-05, 'samples': 26030976, 'steps': 135577, 'loss/train': 1.1395548582077026} 11/07/2021 16:21:15 - INFO - __main__ - Step 135579: {'lr': 1.16236588782897e-05, 'samples': 26031168, 'steps': 135578, 'loss/train': 5.5370049476623535} 11/07/2021 16:21:15 - INFO - __main__ - Step 135580: {'lr': 1.1622059606376274e-05, 'samples': 26031360, 'steps': 135579, 'loss/train': 1.497712254524231} 11/07/2021 16:21:16 - INFO - __main__ - Step 135581: {'lr': 1.1620460441872288e-05, 'samples': 26031552, 'steps': 135580, 'loss/train': 1.069351077079773} 11/07/2021 16:21:16 - INFO - __main__ - Step 135582: {'lr': 1.1618861384778467e-05, 'samples': 26031744, 'steps': 135581, 'loss/train': 0.9459696412086487} 11/07/2021 16:21:17 - INFO - __main__ - Step 135583: {'lr': 1.1617262435095477e-05, 'samples': 26031936, 'steps': 135582, 'loss/train': 0.7521494030952454} 11/07/2021 16:21:18 - INFO - __main__ - Step 135584: {'lr': 1.1615663592824065e-05, 'samples': 26032128, 'steps': 135583, 'loss/train': 1.4986110925674438} 11/07/2021 16:21:18 - INFO - __main__ - Step 135585: {'lr': 1.1614064857964984e-05, 'samples': 26032320, 'steps': 135584, 'loss/train': 1.3300776481628418} 11/07/2021 16:21:18 - INFO - __main__ - Step 135586: {'lr': 1.1612466230518898e-05, 'samples': 26032512, 'steps': 135585, 'loss/train': 1.6917062997817993} 11/07/2021 16:21:19 - INFO - __main__ - Step 135587: {'lr': 1.1610867710486556e-05, 'samples': 26032704, 'steps': 135586, 'loss/train': 1.9537049531936646} 11/07/2021 16:21:20 - INFO - __main__ - Step 135588: {'lr': 1.160926929786868e-05, 'samples': 26032896, 'steps': 135587, 'loss/train': 1.1685404777526855} 11/07/2021 16:21:20 - INFO - __main__ - Step 135589: {'lr': 1.1607670992665992e-05, 'samples': 26033088, 'steps': 135588, 'loss/train': 0.7726439833641052} 11/07/2021 16:21:20 - INFO - __main__ - Step 135590: {'lr': 1.1606072794879213e-05, 'samples': 26033280, 'steps': 135589, 'loss/train': 0.8353034853935242} 11/07/2021 16:21:21 - INFO - __main__ - Step 135591: {'lr': 1.1604474704509065e-05, 'samples': 26033472, 'steps': 135590, 'loss/train': 1.4343844652175903} 11/07/2021 16:21:21 - INFO - __main__ - Step 135592: {'lr': 1.1602876721556271e-05, 'samples': 26033664, 'steps': 135591, 'loss/train': 1.4147956371307373} 11/07/2021 16:21:22 - INFO - __main__ - Step 135593: {'lr': 1.1601278846021524e-05, 'samples': 26033856, 'steps': 135592, 'loss/train': 1.0042798519134521} 11/07/2021 16:21:22 - INFO - __main__ - Step 135594: {'lr': 1.1599681077905543e-05, 'samples': 26034048, 'steps': 135593, 'loss/train': 1.4904969930648804} 11/07/2021 16:21:23 - INFO - __main__ - Step 135595: {'lr': 1.1598083417209138e-05, 'samples': 26034240, 'steps': 135594, 'loss/train': 0.8028585314750671} 11/07/2021 16:21:23 - INFO - __main__ - Step 135596: {'lr': 1.1596485863932915e-05, 'samples': 26034432, 'steps': 135595, 'loss/train': 0.8231572508811951} 11/07/2021 16:21:23 - INFO - __main__ - Step 135597: {'lr': 1.1594888418077626e-05, 'samples': 26034624, 'steps': 135596, 'loss/train': 1.246738076210022} 11/07/2021 16:21:25 - INFO - __main__ - Step 135598: {'lr': 1.1593291079643992e-05, 'samples': 26034816, 'steps': 135597, 'loss/train': 1.0420113801956177} 11/07/2021 16:21:25 - INFO - __main__ - Step 135599: {'lr': 1.1591693848632762e-05, 'samples': 26035008, 'steps': 135598, 'loss/train': 1.5010627508163452} 11/07/2021 16:21:25 - INFO - __main__ - Step 135600: {'lr': 1.159009672504463e-05, 'samples': 26035200, 'steps': 135599, 'loss/train': 1.3446322679519653} 11/07/2021 16:21:26 - INFO - __main__ - Step 135601: {'lr': 1.1588499708880318e-05, 'samples': 26035392, 'steps': 135600, 'loss/train': 0.8548798561096191} 11/07/2021 16:21:26 - INFO - __main__ - Step 135602: {'lr': 1.1586902800140547e-05, 'samples': 26035584, 'steps': 135601, 'loss/train': 1.1983096599578857} 11/07/2021 16:21:27 - INFO - __main__ - Step 135603: {'lr': 1.158530599882604e-05, 'samples': 26035776, 'steps': 135602, 'loss/train': 0.6315560936927795} 11/07/2021 16:21:27 - INFO - __main__ - Step 135604: {'lr': 1.1583709304937518e-05, 'samples': 26035968, 'steps': 135603, 'loss/train': 1.0614652633666992} 11/07/2021 16:21:28 - INFO - __main__ - Step 135605: {'lr': 1.15821127184757e-05, 'samples': 26036160, 'steps': 135604, 'loss/train': 1.2788312435150146} 11/07/2021 16:21:28 - INFO - __main__ - Step 135606: {'lr': 1.1580516239441314e-05, 'samples': 26036352, 'steps': 135605, 'loss/train': 1.2419028282165527} 11/07/2021 16:21:28 - INFO - __main__ - Step 135607: {'lr': 1.1578919867835047e-05, 'samples': 26036544, 'steps': 135606, 'loss/train': 1.324536919593811} 11/07/2021 16:21:29 - INFO - __main__ - Step 135608: {'lr': 1.1577323603657652e-05, 'samples': 26036736, 'steps': 135607, 'loss/train': 1.5149356126785278} 11/07/2021 16:21:30 - INFO - __main__ - Step 135609: {'lr': 1.1575727446909878e-05, 'samples': 26036928, 'steps': 135608, 'loss/train': 0.9170591831207275} 11/07/2021 16:21:30 - INFO - __main__ - Step 135610: {'lr': 1.1574131397592335e-05, 'samples': 26037120, 'steps': 135609, 'loss/train': 0.8056329488754272} 11/07/2021 16:21:30 - INFO - __main__ - Step 135611: {'lr': 1.157253545570583e-05, 'samples': 26037312, 'steps': 135610, 'loss/train': 1.4274933338165283} 11/07/2021 16:21:31 - INFO - __main__ - Step 135612: {'lr': 1.157093962125108e-05, 'samples': 26037504, 'steps': 135611, 'loss/train': 1.0913020372390747} 11/07/2021 16:21:32 - INFO - __main__ - Step 135613: {'lr': 1.1569343894228756e-05, 'samples': 26037696, 'steps': 135612, 'loss/train': 1.2864468097686768} 11/07/2021 16:21:32 - INFO - __main__ - Step 135614: {'lr': 1.1567748274639634e-05, 'samples': 26037888, 'steps': 135613, 'loss/train': 1.4584442377090454} 11/07/2021 16:21:33 - INFO - __main__ - Step 135615: {'lr': 1.1566152762484378e-05, 'samples': 26038080, 'steps': 135614, 'loss/train': 1.4912174940109253} 11/07/2021 16:21:33 - INFO - __main__ - Step 135616: {'lr': 1.1564557357763739e-05, 'samples': 26038272, 'steps': 135615, 'loss/train': 0.979218602180481} 11/07/2021 16:21:33 - INFO - __main__ - Step 135617: {'lr': 1.1562962060478437e-05, 'samples': 26038464, 'steps': 135616, 'loss/train': 1.3944003582000732} 11/07/2021 16:21:34 - INFO - __main__ - Step 135618: {'lr': 1.1561366870629198e-05, 'samples': 26038656, 'steps': 135617, 'loss/train': 1.3113760948181152} 11/07/2021 16:21:35 - INFO - __main__ - Step 135619: {'lr': 1.155977178821671e-05, 'samples': 26038848, 'steps': 135618, 'loss/train': 1.4322983026504517} 11/07/2021 16:21:35 - INFO - __main__ - Step 135620: {'lr': 1.1558176813241728e-05, 'samples': 26039040, 'steps': 135619, 'loss/train': 1.3155248165130615} 11/07/2021 16:21:36 - INFO - __main__ - Step 135621: {'lr': 1.1556581945704913e-05, 'samples': 26039232, 'steps': 135620, 'loss/train': 1.3381518125534058} 11/07/2021 16:21:36 - INFO - __main__ - Step 135622: {'lr': 1.1554987185607102e-05, 'samples': 26039424, 'steps': 135621, 'loss/train': 1.2977855205535889} 11/07/2021 16:21:36 - INFO - __main__ - Step 135623: {'lr': 1.1553392532948875e-05, 'samples': 26039616, 'steps': 135622, 'loss/train': 0.9106414914131165} 11/07/2021 16:21:37 - INFO - __main__ - Step 135624: {'lr': 1.155179798773101e-05, 'samples': 26039808, 'steps': 135623, 'loss/train': 1.7905049324035645} 11/07/2021 16:21:38 - INFO - __main__ - Step 135625: {'lr': 1.155020354995423e-05, 'samples': 26040000, 'steps': 135624, 'loss/train': 1.1034507751464844} 11/07/2021 16:21:38 - INFO - __main__ - Step 135626: {'lr': 1.1548609219619254e-05, 'samples': 26040192, 'steps': 135625, 'loss/train': 1.5063238143920898} 11/07/2021 16:21:38 - INFO - __main__ - Step 135627: {'lr': 1.1547014996726806e-05, 'samples': 26040384, 'steps': 135626, 'loss/train': 0.9938121438026428} 11/07/2021 16:21:39 - INFO - __main__ - Step 135628: {'lr': 1.1545420881277579e-05, 'samples': 26040576, 'steps': 135627, 'loss/train': 1.1438809633255005} 11/07/2021 16:21:40 - INFO - __main__ - Step 135629: {'lr': 1.1543826873272296e-05, 'samples': 26040768, 'steps': 135628, 'loss/train': 1.0304206609725952} 11/07/2021 16:21:40 - INFO - __main__ - Step 135630: {'lr': 1.1542232972711703e-05, 'samples': 26040960, 'steps': 135629, 'loss/train': 1.075308918952942} 11/07/2021 16:21:40 - INFO - __main__ - Step 135631: {'lr': 1.1540639179596468e-05, 'samples': 26041152, 'steps': 135630, 'loss/train': 0.407845139503479} 11/07/2021 16:21:41 - INFO - __main__ - Step 135632: {'lr': 1.1539045493927369e-05, 'samples': 26041344, 'steps': 135631, 'loss/train': 1.256483554840088} 11/07/2021 16:21:41 - INFO - __main__ - Step 135633: {'lr': 1.15374519157051e-05, 'samples': 26041536, 'steps': 135632, 'loss/train': 1.2917704582214355} 11/07/2021 16:21:42 - INFO - __main__ - Step 135634: {'lr': 1.1535858444930408e-05, 'samples': 26041728, 'steps': 135633, 'loss/train': 0.9283514618873596} 11/07/2021 16:21:42 - INFO - __main__ - Step 135635: {'lr': 1.1534265081603933e-05, 'samples': 26041920, 'steps': 135634, 'loss/train': 1.2883052825927734} 11/07/2021 16:21:43 - INFO - __main__ - Step 135636: {'lr': 1.1532671825726455e-05, 'samples': 26042112, 'steps': 135635, 'loss/train': 1.5087766647338867} 11/07/2021 16:21:43 - INFO - __main__ - Step 135637: {'lr': 1.1531078677298661e-05, 'samples': 26042304, 'steps': 135636, 'loss/train': 1.1721062660217285} 11/07/2021 16:21:43 - INFO - __main__ - Step 135638: {'lr': 1.152948563632128e-05, 'samples': 26042496, 'steps': 135637, 'loss/train': 1.125170350074768} 11/07/2021 16:21:44 - INFO - __main__ - Step 135639: {'lr': 1.1527892702795029e-05, 'samples': 26042688, 'steps': 135638, 'loss/train': 1.7131725549697876} 11/07/2021 16:21:46 - INFO - __main__ - Step 135640: {'lr': 1.1526299876720658e-05, 'samples': 26042880, 'steps': 135639, 'loss/train': 1.3077493906021118} 11/07/2021 16:21:46 - INFO - __main__ - Step 135641: {'lr': 1.1524707158098834e-05, 'samples': 26043072, 'steps': 135640, 'loss/train': 1.4243431091308594} 11/07/2021 16:21:46 - INFO - __main__ - Step 135642: {'lr': 1.1523114546930307e-05, 'samples': 26043264, 'steps': 135641, 'loss/train': 1.077000617980957} 11/07/2021 16:21:47 - INFO - __main__ - Step 135643: {'lr': 1.1521522043215772e-05, 'samples': 26043456, 'steps': 135642, 'loss/train': 1.5374321937561035} 11/07/2021 16:21:47 - INFO - __main__ - Step 135644: {'lr': 1.1519929646955973e-05, 'samples': 26043648, 'steps': 135643, 'loss/train': 1.409267783164978} 11/07/2021 16:21:47 - INFO - __main__ - Step 135645: {'lr': 1.1518337358151636e-05, 'samples': 26043840, 'steps': 135644, 'loss/train': 0.853496789932251} 11/07/2021 16:21:48 - INFO - __main__ - Step 135646: {'lr': 1.1516745176803427e-05, 'samples': 26044032, 'steps': 135645, 'loss/train': 0.8111945986747742} 11/07/2021 16:21:49 - INFO - __main__ - Step 135647: {'lr': 1.1515153102912097e-05, 'samples': 26044224, 'steps': 135646, 'loss/train': 1.5536348819732666} 11/07/2021 16:21:49 - INFO - __main__ - Step 135648: {'lr': 1.1513561136478334e-05, 'samples': 26044416, 'steps': 135647, 'loss/train': 0.7910428047180176} 11/07/2021 16:21:49 - INFO - __main__ - Step 135649: {'lr': 1.1511969277502921e-05, 'samples': 26044608, 'steps': 135648, 'loss/train': 1.5773409605026245} 11/07/2021 16:21:50 - INFO - __main__ - Step 135650: {'lr': 1.1510377525986492e-05, 'samples': 26044800, 'steps': 135649, 'loss/train': 1.3009686470031738} 11/07/2021 16:21:50 - INFO - __main__ - Step 135651: {'lr': 1.1508785881929856e-05, 'samples': 26044992, 'steps': 135650, 'loss/train': 1.3650174140930176} 11/07/2021 16:21:51 - INFO - __main__ - Step 135652: {'lr': 1.150719434533365e-05, 'samples': 26045184, 'steps': 135651, 'loss/train': 1.102670669555664} 11/07/2021 16:21:52 - INFO - __main__ - Step 135653: {'lr': 1.1505602916198621e-05, 'samples': 26045376, 'steps': 135652, 'loss/train': 1.3040225505828857} 11/07/2021 16:21:52 - INFO - __main__ - Step 135654: {'lr': 1.1504011594525492e-05, 'samples': 26045568, 'steps': 135653, 'loss/train': 1.239525556564331} 11/07/2021 16:21:52 - INFO - __main__ - Step 135655: {'lr': 1.1502420380314988e-05, 'samples': 26045760, 'steps': 135654, 'loss/train': 1.1155062913894653} 11/07/2021 16:21:53 - INFO - __main__ - Step 135656: {'lr': 1.1500829273567826e-05, 'samples': 26045952, 'steps': 135655, 'loss/train': 1.5331447124481201} 11/07/2021 16:21:54 - INFO - __main__ - Step 135657: {'lr': 1.1499238274284673e-05, 'samples': 26046144, 'steps': 135656, 'loss/train': 1.2631568908691406} 11/07/2021 16:21:54 - INFO - __main__ - Step 135658: {'lr': 1.149764738246631e-05, 'samples': 26046336, 'steps': 135657, 'loss/train': 1.6325969696044922} 11/07/2021 16:21:54 - INFO - __main__ - Step 135659: {'lr': 1.1496056598113398e-05, 'samples': 26046528, 'steps': 135658, 'loss/train': 1.2937837839126587} 11/07/2021 16:21:55 - INFO - __main__ - Step 135660: {'lr': 1.1494465921226688e-05, 'samples': 26046720, 'steps': 135659, 'loss/train': 1.9881243705749512} 11/07/2021 16:21:55 - INFO - __main__ - Step 135661: {'lr': 1.1492875351806903e-05, 'samples': 26046912, 'steps': 135660, 'loss/train': 1.222364068031311} 11/07/2021 16:21:56 - INFO - __main__ - Step 135662: {'lr': 1.1491284889854765e-05, 'samples': 26047104, 'steps': 135661, 'loss/train': 1.4917796850204468} 11/07/2021 16:21:56 - INFO - __main__ - Step 135663: {'lr': 1.1489694535370937e-05, 'samples': 26047296, 'steps': 135662, 'loss/train': 1.8960005044937134} 11/07/2021 16:21:57 - INFO - __main__ - Step 135664: {'lr': 1.1488104288356199e-05, 'samples': 26047488, 'steps': 135663, 'loss/train': 1.422839641571045} 11/07/2021 16:21:57 - INFO - __main__ - Step 135665: {'lr': 1.1486514148811216e-05, 'samples': 26047680, 'steps': 135664, 'loss/train': 1.4719754457473755} 11/07/2021 16:21:58 - INFO - __main__ - Step 135666: {'lr': 1.1484924116736739e-05, 'samples': 26047872, 'steps': 135665, 'loss/train': 0.6140120029449463} 11/07/2021 16:21:58 - INFO - __main__ - Step 135667: {'lr': 1.1483334192133515e-05, 'samples': 26048064, 'steps': 135666, 'loss/train': 0.9575623273849487} 11/07/2021 16:21:59 - INFO - __main__ - Step 135668: {'lr': 1.1481744375002185e-05, 'samples': 26048256, 'steps': 135667, 'loss/train': 1.8342273235321045} 11/07/2021 16:21:59 - INFO - __main__ - Step 135669: {'lr': 1.1480154665343496e-05, 'samples': 26048448, 'steps': 135668, 'loss/train': 0.4560762941837311} 11/07/2021 16:22:00 - INFO - __main__ - Step 135670: {'lr': 1.1478565063158197e-05, 'samples': 26048640, 'steps': 135669, 'loss/train': 1.1715350151062012} 11/07/2021 16:22:00 - INFO - __main__ - Step 135671: {'lr': 1.1476975568446928e-05, 'samples': 26048832, 'steps': 135670, 'loss/train': 1.1622428894042969} 11/07/2021 16:22:01 - INFO - __main__ - Step 135672: {'lr': 1.1475386181210495e-05, 'samples': 26049024, 'steps': 135671, 'loss/train': 1.0686008930206299} 11/07/2021 16:22:01 - INFO - __main__ - Step 135673: {'lr': 1.1473796901449535e-05, 'samples': 26049216, 'steps': 135672, 'loss/train': 1.3494552373886108} 11/07/2021 16:22:02 - INFO - __main__ - Step 135674: {'lr': 1.1472207729164824e-05, 'samples': 26049408, 'steps': 135673, 'loss/train': 0.956142008304596} 11/07/2021 16:22:02 - INFO - __main__ - Step 135675: {'lr': 1.1470618664357057e-05, 'samples': 26049600, 'steps': 135674, 'loss/train': 1.3294038772583008} 11/07/2021 16:22:02 - INFO - __main__ - Step 135676: {'lr': 1.1469029707026957e-05, 'samples': 26049792, 'steps': 135675, 'loss/train': 1.2166391611099243} 11/07/2021 16:22:03 - INFO - __main__ - Step 135677: {'lr': 1.1467440857175216e-05, 'samples': 26049984, 'steps': 135676, 'loss/train': 0.6198204159736633} 11/07/2021 16:22:04 - INFO - __main__ - Step 135678: {'lr': 1.1465852114802584e-05, 'samples': 26050176, 'steps': 135677, 'loss/train': 1.3078320026397705} 11/07/2021 16:22:04 - INFO - __main__ - Step 135679: {'lr': 1.1464263479909754e-05, 'samples': 26050368, 'steps': 135678, 'loss/train': 1.167129397392273} 11/07/2021 16:22:04 - INFO - __main__ - Step 135680: {'lr': 1.1462674952497421e-05, 'samples': 26050560, 'steps': 135679, 'loss/train': 1.201867699623108} 11/07/2021 16:22:05 - INFO - __main__ - Step 135681: {'lr': 1.1461086532566334e-05, 'samples': 26050752, 'steps': 135680, 'loss/train': 1.403152346611023} 11/07/2021 16:22:06 - INFO - __main__ - Step 135682: {'lr': 1.1459498220117214e-05, 'samples': 26050944, 'steps': 135681, 'loss/train': 0.9652767181396484} 11/07/2021 16:22:06 - INFO - __main__ - Step 135683: {'lr': 1.1457910015150758e-05, 'samples': 26051136, 'steps': 135682, 'loss/train': 1.4604854583740234} 11/07/2021 16:22:07 - INFO - __main__ - Step 135684: {'lr': 1.1456321917667684e-05, 'samples': 26051328, 'steps': 135683, 'loss/train': 1.2930513620376587} 11/07/2021 16:22:07 - INFO - __main__ - Step 135685: {'lr': 1.1454733927668687e-05, 'samples': 26051520, 'steps': 135684, 'loss/train': 1.332000970840454} 11/07/2021 16:22:07 - INFO - __main__ - Step 135686: {'lr': 1.1453146045154545e-05, 'samples': 26051712, 'steps': 135685, 'loss/train': 1.33024263381958} 11/07/2021 16:22:08 - INFO - __main__ - Step 135687: {'lr': 1.1451558270125923e-05, 'samples': 26051904, 'steps': 135686, 'loss/train': 1.1320360898971558} 11/07/2021 16:22:09 - INFO - __main__ - Step 135688: {'lr': 1.1449970602583543e-05, 'samples': 26052096, 'steps': 135687, 'loss/train': 1.1416352987289429} 11/07/2021 16:22:09 - INFO - __main__ - Step 135689: {'lr': 1.1448383042528127e-05, 'samples': 26052288, 'steps': 135688, 'loss/train': 0.885408341884613} 11/07/2021 16:22:09 - INFO - __main__ - Step 135690: {'lr': 1.1446795589960396e-05, 'samples': 26052480, 'steps': 135689, 'loss/train': 1.6007142066955566} 11/07/2021 16:22:10 - INFO - __main__ - Step 135691: {'lr': 1.1445208244881044e-05, 'samples': 26052672, 'steps': 135690, 'loss/train': 1.2785193920135498} 11/07/2021 16:22:10 - INFO - __main__ - Step 135692: {'lr': 1.1443621007290822e-05, 'samples': 26052864, 'steps': 135691, 'loss/train': 0.8679123520851135} 11/07/2021 16:22:11 - INFO - __main__ - Step 135693: {'lr': 1.1442033877190395e-05, 'samples': 26053056, 'steps': 135692, 'loss/train': 0.5721511244773865} 11/07/2021 16:22:11 - INFO - __main__ - Step 135694: {'lr': 1.1440446854580511e-05, 'samples': 26053248, 'steps': 135693, 'loss/train': 1.0758668184280396} 11/07/2021 16:22:12 - INFO - __main__ - Step 135695: {'lr': 1.1438859939461893e-05, 'samples': 26053440, 'steps': 135694, 'loss/train': 1.6123712062835693} 11/07/2021 16:22:12 - INFO - __main__ - Step 135696: {'lr': 1.1437273131835234e-05, 'samples': 26053632, 'steps': 135695, 'loss/train': 1.215234398841858} 11/07/2021 16:22:12 - INFO - __main__ - Step 135697: {'lr': 1.143568643170126e-05, 'samples': 26053824, 'steps': 135696, 'loss/train': 1.300635814666748} 11/07/2021 16:22:14 - INFO - __main__ - Step 135698: {'lr': 1.1434099839060685e-05, 'samples': 26054016, 'steps': 135697, 'loss/train': 1.4105960130691528} 11/07/2021 16:22:14 - INFO - __main__ - Step 135699: {'lr': 1.1432513353914236e-05, 'samples': 26054208, 'steps': 135698, 'loss/train': 0.7831587195396423} 11/07/2021 16:22:14 - INFO - __main__ - Step 135700: {'lr': 1.1430926976262606e-05, 'samples': 26054400, 'steps': 135699, 'loss/train': 1.0845805406570435} 11/07/2021 16:22:15 - INFO - __main__ - Step 135701: {'lr': 1.1429340706106516e-05, 'samples': 26054592, 'steps': 135700, 'loss/train': 1.2984423637390137} 11/07/2021 16:22:15 - INFO - __main__ - Step 135702: {'lr': 1.1427754543446661e-05, 'samples': 26054784, 'steps': 135701, 'loss/train': 1.3766778707504272} 11/07/2021 16:22:16 - INFO - __main__ - Step 135703: {'lr': 1.1426168488283845e-05, 'samples': 26054976, 'steps': 135702, 'loss/train': 1.4018023014068604} 11/07/2021 16:22:16 - INFO - __main__ - Step 135704: {'lr': 1.1424582540618678e-05, 'samples': 26055168, 'steps': 135703, 'loss/train': 1.5731626749038696} 11/07/2021 16:22:17 - INFO - __main__ - Step 135705: {'lr': 1.142299670045191e-05, 'samples': 26055360, 'steps': 135704, 'loss/train': 1.1846345663070679} 11/07/2021 16:22:17 - INFO - __main__ - Step 135706: {'lr': 1.1421410967784234e-05, 'samples': 26055552, 'steps': 135705, 'loss/train': 1.42076575756073} 11/07/2021 16:22:17 - INFO - __main__ - Step 135707: {'lr': 1.141982534261643e-05, 'samples': 26055744, 'steps': 135706, 'loss/train': 1.30851149559021} 11/07/2021 16:22:19 - INFO - __main__ - Step 135708: {'lr': 1.1418239824949133e-05, 'samples': 26055936, 'steps': 135707, 'loss/train': 1.335372805595398} 11/07/2021 16:22:19 - INFO - __main__ - Step 135709: {'lr': 1.1416654414783123e-05, 'samples': 26056128, 'steps': 135708, 'loss/train': 0.8766849040985107} 11/07/2021 16:22:19 - INFO - __main__ - Step 135710: {'lr': 1.1415069112119065e-05, 'samples': 26056320, 'steps': 135709, 'loss/train': 0.9534925222396851} 11/07/2021 16:22:20 - INFO - __main__ - Step 135711: {'lr': 1.1413483916957706e-05, 'samples': 26056512, 'steps': 135710, 'loss/train': 1.402580976486206} 11/07/2021 16:22:20 - INFO - __main__ - Step 135712: {'lr': 1.1411898829299772e-05, 'samples': 26056704, 'steps': 135711, 'loss/train': 1.030491828918457} 11/07/2021 16:22:20 - INFO - __main__ - Step 135713: {'lr': 1.1410313849145926e-05, 'samples': 26056896, 'steps': 135712, 'loss/train': 1.2359212636947632} 11/07/2021 16:22:22 - INFO - __main__ - Step 135714: {'lr': 1.1408728976496918e-05, 'samples': 26057088, 'steps': 135713, 'loss/train': 0.8364384174346924} 11/07/2021 16:22:22 - INFO - __main__ - Step 135715: {'lr': 1.1407144211353443e-05, 'samples': 26057280, 'steps': 135714, 'loss/train': 1.3842636346817017} 11/07/2021 16:22:22 - INFO - __main__ - Step 135716: {'lr': 1.1405559553716277e-05, 'samples': 26057472, 'steps': 135715, 'loss/train': 1.3646447658538818} 11/07/2021 16:22:23 - INFO - __main__ - Step 135717: {'lr': 1.140397500358606e-05, 'samples': 26057664, 'steps': 135716, 'loss/train': 1.4215322732925415} 11/07/2021 16:22:23 - INFO - __main__ - Step 135718: {'lr': 1.1402390560963511e-05, 'samples': 26057856, 'steps': 135717, 'loss/train': 1.3924758434295654} 11/07/2021 16:22:25 - INFO - __main__ - Step 135719: {'lr': 1.1400806225849352e-05, 'samples': 26058048, 'steps': 135718, 'loss/train': 1.1169387102127075} 11/07/2021 16:22:25 - INFO - __main__ - Step 135720: {'lr': 1.1399221998244336e-05, 'samples': 26058240, 'steps': 135719, 'loss/train': 1.1154979467391968} 11/07/2021 16:22:26 - INFO - __main__ - Step 135721: {'lr': 1.1397637878149154e-05, 'samples': 26058432, 'steps': 135720, 'loss/train': 1.226962685585022} 11/07/2021 16:22:26 - INFO - __main__ - Step 135722: {'lr': 1.139605386556447e-05, 'samples': 26058624, 'steps': 135721, 'loss/train': 1.0979763269424438} 11/07/2021 16:22:26 - INFO - __main__ - Step 135723: {'lr': 1.1394469960491094e-05, 'samples': 26058816, 'steps': 135722, 'loss/train': 1.6086114645004272} 11/07/2021 16:22:27 - INFO - __main__ - Step 135724: {'lr': 1.1392886162929661e-05, 'samples': 26059008, 'steps': 135723, 'loss/train': 1.5915813446044922} 11/07/2021 16:22:28 - INFO - __main__ - Step 135725: {'lr': 1.1391302472880921e-05, 'samples': 26059200, 'steps': 135724, 'loss/train': 0.526020884513855} 11/07/2021 16:22:28 - INFO - __main__ - Step 135726: {'lr': 1.1389718890345568e-05, 'samples': 26059392, 'steps': 135725, 'loss/train': 1.4224432706832886} 11/07/2021 16:22:28 - INFO - __main__ - Step 135727: {'lr': 1.1388135415324324e-05, 'samples': 26059584, 'steps': 135726, 'loss/train': 1.3858126401901245} 11/07/2021 16:22:29 - INFO - __main__ - Step 135728: {'lr': 1.1386552047817911e-05, 'samples': 26059776, 'steps': 135727, 'loss/train': 1.4190950393676758} 11/07/2021 16:22:29 - INFO - __main__ - Step 135729: {'lr': 1.138496878782702e-05, 'samples': 26059968, 'steps': 135728, 'loss/train': 1.2710236310958862} 11/07/2021 16:22:30 - INFO - __main__ - Step 135730: {'lr': 1.1383385635352433e-05, 'samples': 26060160, 'steps': 135729, 'loss/train': 1.1781412363052368} 11/07/2021 16:22:30 - INFO - __main__ - Step 135731: {'lr': 1.1381802590394785e-05, 'samples': 26060352, 'steps': 135730, 'loss/train': 1.2585958242416382} 11/07/2021 16:22:31 - INFO - __main__ - Step 135732: {'lr': 1.1380219652954799e-05, 'samples': 26060544, 'steps': 135731, 'loss/train': 1.507380723953247} 11/07/2021 16:22:31 - INFO - __main__ - Step 135733: {'lr': 1.1378636823033195e-05, 'samples': 26060736, 'steps': 135732, 'loss/train': 1.5178035497665405} 11/07/2021 16:22:32 - INFO - __main__ - Step 135734: {'lr': 1.1377054100630723e-05, 'samples': 26060928, 'steps': 135733, 'loss/train': 1.4396376609802246} 11/07/2021 16:22:32 - INFO - __main__ - Step 135735: {'lr': 1.137547148574805e-05, 'samples': 26061120, 'steps': 135734, 'loss/train': 1.2821917533874512} 11/07/2021 16:22:33 - INFO - __main__ - Step 135736: {'lr': 1.1373888978385899e-05, 'samples': 26061312, 'steps': 135735, 'loss/train': 1.3842240571975708} 11/07/2021 16:22:33 - INFO - __main__ - Step 135737: {'lr': 1.1372306578544988e-05, 'samples': 26061504, 'steps': 135736, 'loss/train': 1.123157262802124} 11/07/2021 16:22:34 - INFO - __main__ - Step 135738: {'lr': 1.137072428622607e-05, 'samples': 26061696, 'steps': 135737, 'loss/train': 1.116175889968872} 11/07/2021 16:22:34 - INFO - __main__ - Step 135739: {'lr': 1.136914210142978e-05, 'samples': 26061888, 'steps': 135738, 'loss/train': 1.5278267860412598} 11/07/2021 16:22:35 - INFO - __main__ - Step 135740: {'lr': 1.1367560024156899e-05, 'samples': 26062080, 'steps': 135739, 'loss/train': 1.2692883014678955} 11/07/2021 16:22:36 - INFO - __main__ - Step 135741: {'lr': 1.1365978054408115e-05, 'samples': 26062272, 'steps': 135740, 'loss/train': 0.9019208550453186} 11/07/2021 16:22:37 - INFO - __main__ - Step 135742: {'lr': 1.1364396192184129e-05, 'samples': 26062464, 'steps': 135741, 'loss/train': 1.7547017335891724} 11/07/2021 16:22:37 - INFO - __main__ - Step 135743: {'lr': 1.1362814437485686e-05, 'samples': 26062656, 'steps': 135742, 'loss/train': 1.762292742729187} 11/07/2021 16:22:37 - INFO - __main__ - Step 135744: {'lr': 1.1361232790313452e-05, 'samples': 26062848, 'steps': 135743, 'loss/train': 1.7463499307632446} 11/07/2021 16:22:38 - INFO - __main__ - Step 135745: {'lr': 1.135965125066818e-05, 'samples': 26063040, 'steps': 135744, 'loss/train': 1.280273675918579} 11/07/2021 16:22:38 - INFO - __main__ - Step 135746: {'lr': 1.1358069818550532e-05, 'samples': 26063232, 'steps': 135745, 'loss/train': 1.1413494348526} 11/07/2021 16:22:38 - INFO - __main__ - Step 135747: {'lr': 1.1356488493961286e-05, 'samples': 26063424, 'steps': 135746, 'loss/train': 1.4983696937561035} 11/07/2021 16:22:39 - INFO - __main__ - Step 135748: {'lr': 1.1354907276901111e-05, 'samples': 26063616, 'steps': 135747, 'loss/train': 0.9089025855064392} 11/07/2021 16:22:40 - INFO - __main__ - Step 135749: {'lr': 1.1353326167370725e-05, 'samples': 26063808, 'steps': 135748, 'loss/train': 1.3406524658203125} 11/07/2021 16:22:40 - INFO - __main__ - Step 135750: {'lr': 1.135174516537088e-05, 'samples': 26064000, 'steps': 135749, 'loss/train': 1.543060064315796} 11/07/2021 16:22:40 - INFO - __main__ - Step 135751: {'lr': 1.1350164270902214e-05, 'samples': 26064192, 'steps': 135750, 'loss/train': 1.212560772895813} 11/07/2021 16:22:41 - INFO - __main__ - Step 135752: {'lr': 1.1348583483965502e-05, 'samples': 26064384, 'steps': 135751, 'loss/train': 1.4385368824005127} 11/07/2021 16:22:42 - INFO - __main__ - Step 135753: {'lr': 1.134700280456144e-05, 'samples': 26064576, 'steps': 135752, 'loss/train': 1.3648028373718262} 11/07/2021 16:22:42 - INFO - __main__ - Step 135754: {'lr': 1.1345422232690721e-05, 'samples': 26064768, 'steps': 135753, 'loss/train': 1.4625482559204102} 11/07/2021 16:22:43 - INFO - __main__ - Step 135755: {'lr': 1.1343841768354097e-05, 'samples': 26064960, 'steps': 135754, 'loss/train': 1.4597246646881104} 11/07/2021 16:22:43 - INFO - __main__ - Step 135756: {'lr': 1.134226141155223e-05, 'samples': 26065152, 'steps': 135755, 'loss/train': 1.4549299478530884} 11/07/2021 16:22:43 - INFO - __main__ - Step 135757: {'lr': 1.1340681162285898e-05, 'samples': 26065344, 'steps': 135756, 'loss/train': 1.603572130203247} 11/07/2021 16:22:44 - INFO - __main__ - Step 135758: {'lr': 1.1339101020555742e-05, 'samples': 26065536, 'steps': 135757, 'loss/train': 1.634710431098938} 11/07/2021 16:22:45 - INFO - __main__ - Step 135759: {'lr': 1.1337520986362509e-05, 'samples': 26065728, 'steps': 135758, 'loss/train': 0.9574289321899414} 11/07/2021 16:22:45 - INFO - __main__ - Step 135760: {'lr': 1.1335941059706895e-05, 'samples': 26065920, 'steps': 135759, 'loss/train': 1.5947469472885132} 11/07/2021 16:22:45 - INFO - __main__ - Step 135761: {'lr': 1.1334361240589647e-05, 'samples': 26066112, 'steps': 135760, 'loss/train': 1.3581825494766235} 11/07/2021 16:22:46 - INFO - __main__ - Step 135762: {'lr': 1.1332781529011433e-05, 'samples': 26066304, 'steps': 135761, 'loss/train': 1.3979806900024414} 11/07/2021 16:22:47 - INFO - __main__ - Step 135763: {'lr': 1.1331201924972972e-05, 'samples': 26066496, 'steps': 135762, 'loss/train': 0.9706791639328003} 11/07/2021 16:22:47 - INFO - __main__ - Step 135764: {'lr': 1.1329622428475017e-05, 'samples': 26066688, 'steps': 135763, 'loss/train': 0.8832399845123291} 11/07/2021 16:22:48 - INFO - __main__ - Step 135765: {'lr': 1.1328043039518232e-05, 'samples': 26066880, 'steps': 135764, 'loss/train': 1.3610172271728516} 11/07/2021 16:22:48 - INFO - __main__ - Step 135766: {'lr': 1.1326463758103339e-05, 'samples': 26067072, 'steps': 135765, 'loss/train': 0.9008601307868958} 11/07/2021 16:22:48 - INFO - __main__ - Step 135767: {'lr': 1.1324884584231087e-05, 'samples': 26067264, 'steps': 135766, 'loss/train': 1.542677640914917} 11/07/2021 16:22:49 - INFO - __main__ - Step 135768: {'lr': 1.1323305517902144e-05, 'samples': 26067456, 'steps': 135767, 'loss/train': 1.4517124891281128} 11/07/2021 16:22:50 - INFO - __main__ - Step 135769: {'lr': 1.132172655911723e-05, 'samples': 26067648, 'steps': 135768, 'loss/train': 0.7762079834938049} 11/07/2021 16:22:50 - INFO - __main__ - Step 135770: {'lr': 1.132014770787712e-05, 'samples': 26067840, 'steps': 135769, 'loss/train': 1.488993525505066} 11/07/2021 16:22:50 - INFO - __main__ - Step 135771: {'lr': 1.1318568964182403e-05, 'samples': 26068032, 'steps': 135770, 'loss/train': 1.6192647218704224} 11/07/2021 16:22:51 - INFO - __main__ - Step 135772: {'lr': 1.1316990328033878e-05, 'samples': 26068224, 'steps': 135771, 'loss/train': 1.4689916372299194} 11/07/2021 16:22:52 - INFO - __main__ - Step 135773: {'lr': 1.1315411799432213e-05, 'samples': 26068416, 'steps': 135772, 'loss/train': 1.5345185995101929} 11/07/2021 16:22:52 - INFO - __main__ - Step 135774: {'lr': 1.1313833378378158e-05, 'samples': 26068608, 'steps': 135773, 'loss/train': 1.2135300636291504} 11/07/2021 16:22:52 - INFO - __main__ - Step 135775: {'lr': 1.1312255064872407e-05, 'samples': 26068800, 'steps': 135774, 'loss/train': 1.3305368423461914} 11/07/2021 16:22:53 - INFO - __main__ - Step 135776: {'lr': 1.131067685891568e-05, 'samples': 26068992, 'steps': 135775, 'loss/train': 1.256069302558899} 11/07/2021 16:22:53 - INFO - __main__ - Step 135777: {'lr': 1.1309098760508646e-05, 'samples': 26069184, 'steps': 135776, 'loss/train': 1.0390169620513916} 11/07/2021 16:22:53 - INFO - __main__ - Step 135778: {'lr': 1.130752076965208e-05, 'samples': 26069376, 'steps': 135777, 'loss/train': 0.7695204019546509} 11/07/2021 16:22:55 - INFO - __main__ - Step 135779: {'lr': 1.1305942886346648e-05, 'samples': 26069568, 'steps': 135778, 'loss/train': 1.2736785411834717} 11/07/2021 16:22:55 - INFO - __main__ - Step 135780: {'lr': 1.1304365110593073e-05, 'samples': 26069760, 'steps': 135779, 'loss/train': 1.3375877141952515} 11/07/2021 16:22:55 - INFO - __main__ - Step 135781: {'lr': 1.1302787442392077e-05, 'samples': 26069952, 'steps': 135780, 'loss/train': 1.3262853622436523} 11/07/2021 16:22:56 - INFO - __main__ - Step 135782: {'lr': 1.1301209881744351e-05, 'samples': 26070144, 'steps': 135781, 'loss/train': 0.8551815152168274} 11/07/2021 16:22:56 - INFO - __main__ - Step 135783: {'lr': 1.1299632428650647e-05, 'samples': 26070336, 'steps': 135782, 'loss/train': 1.001765489578247} 11/07/2021 16:22:57 - INFO - __main__ - Step 135784: {'lr': 1.129805508311163e-05, 'samples': 26070528, 'steps': 135783, 'loss/train': 1.0306817293167114} 11/07/2021 16:22:57 - INFO - __main__ - Step 135785: {'lr': 1.1296477845128022e-05, 'samples': 26070720, 'steps': 135784, 'loss/train': 1.3165454864501953} 11/07/2021 16:22:58 - INFO - __main__ - Step 135786: {'lr': 1.1294900714700545e-05, 'samples': 26070912, 'steps': 135785, 'loss/train': 1.1674573421478271} 11/07/2021 16:22:58 - INFO - __main__ - Step 135787: {'lr': 1.1293323691829893e-05, 'samples': 26071104, 'steps': 135786, 'loss/train': 1.6411473751068115} 11/07/2021 16:22:58 - INFO - __main__ - Step 135788: {'lr': 1.129174677651676e-05, 'samples': 26071296, 'steps': 135787, 'loss/train': 1.2430380582809448} 11/07/2021 16:22:59 - INFO - __main__ - Step 135789: {'lr': 1.1290169968761922e-05, 'samples': 26071488, 'steps': 135788, 'loss/train': 0.950523316860199} 11/07/2021 16:23:00 - INFO - __main__ - Step 135790: {'lr': 1.1288593268566016e-05, 'samples': 26071680, 'steps': 135789, 'loss/train': 1.322239637374878} 11/07/2021 16:23:00 - INFO - __main__ - Step 135791: {'lr': 1.1287016675929823e-05, 'samples': 26071872, 'steps': 135790, 'loss/train': 0.7569808959960938} 11/07/2021 16:23:00 - INFO - __main__ - Step 135792: {'lr': 1.128544019085398e-05, 'samples': 26072064, 'steps': 135791, 'loss/train': 1.8879588842391968} 11/07/2021 16:23:01 - INFO - __main__ - Step 135793: {'lr': 1.1283863813339262e-05, 'samples': 26072256, 'steps': 135792, 'loss/train': 1.3339530229568481} 11/07/2021 16:23:02 - INFO - __main__ - Step 135794: {'lr': 1.1282287543386338e-05, 'samples': 26072448, 'steps': 135793, 'loss/train': 1.3536362648010254} 11/07/2021 16:23:02 - INFO - __main__ - Step 135795: {'lr': 1.1280711380995928e-05, 'samples': 26072640, 'steps': 135794, 'loss/train': 1.0348364114761353} 11/07/2021 16:23:03 - INFO - __main__ - Step 135796: {'lr': 1.1279135326168755e-05, 'samples': 26072832, 'steps': 135795, 'loss/train': 1.4416687488555908} 11/07/2021 16:23:03 - INFO - __main__ - Step 135797: {'lr': 1.1277559378905567e-05, 'samples': 26073024, 'steps': 135796, 'loss/train': 0.9223677515983582} 11/07/2021 16:23:03 - INFO - __main__ - Step 135798: {'lr': 1.1275983539206975e-05, 'samples': 26073216, 'steps': 135797, 'loss/train': 1.025512933731079} 11/07/2021 16:23:04 - INFO - __main__ - Step 135799: {'lr': 1.1274407807073728e-05, 'samples': 26073408, 'steps': 135798, 'loss/train': 1.387266993522644} 11/07/2021 16:23:05 - INFO - __main__ - Step 135800: {'lr': 1.1272832182506576e-05, 'samples': 26073600, 'steps': 135799, 'loss/train': 1.3281521797180176} 11/07/2021 16:23:05 - INFO - __main__ - Step 135801: {'lr': 1.1271256665506185e-05, 'samples': 26073792, 'steps': 135800, 'loss/train': 1.6841367483139038} 11/07/2021 16:23:05 - INFO - __main__ - Step 135802: {'lr': 1.1269681256073277e-05, 'samples': 26073984, 'steps': 135801, 'loss/train': 1.40491783618927} 11/07/2021 16:23:06 - INFO - __main__ - Step 135803: {'lr': 1.1268105954208573e-05, 'samples': 26074176, 'steps': 135802, 'loss/train': 1.5760459899902344} 11/07/2021 16:23:07 - INFO - __main__ - Step 135804: {'lr': 1.1266530759912797e-05, 'samples': 26074368, 'steps': 135803, 'loss/train': 1.0718752145767212} 11/07/2021 16:23:07 - INFO - __main__ - Step 135805: {'lr': 1.1264955673186611e-05, 'samples': 26074560, 'steps': 135804, 'loss/train': 0.789828360080719} 11/07/2021 16:23:08 - INFO - __main__ - Step 135806: {'lr': 1.1263380694030767e-05, 'samples': 26074752, 'steps': 135805, 'loss/train': 1.3909852504730225} 11/07/2021 16:23:08 - INFO - __main__ - Step 135807: {'lr': 1.1261805822445959e-05, 'samples': 26074944, 'steps': 135806, 'loss/train': 1.2865417003631592} 11/07/2021 16:23:08 - INFO - __main__ - Step 135808: {'lr': 1.1260231058432879e-05, 'samples': 26075136, 'steps': 135807, 'loss/train': 0.933961808681488} 11/07/2021 16:23:09 - INFO - __main__ - Step 135809: {'lr': 1.125865640199228e-05, 'samples': 26075328, 'steps': 135808, 'loss/train': 1.377108097076416} 11/07/2021 16:23:10 - INFO - __main__ - Step 135810: {'lr': 1.1257081853124822e-05, 'samples': 26075520, 'steps': 135809, 'loss/train': 1.4731978178024292} 11/07/2021 16:23:10 - INFO - __main__ - Step 135811: {'lr': 1.125550741183129e-05, 'samples': 26075712, 'steps': 135810, 'loss/train': 1.590374231338501} 11/07/2021 16:23:10 - INFO - __main__ - Step 135812: {'lr': 1.1253933078112316e-05, 'samples': 26075904, 'steps': 135811, 'loss/train': 1.2866297960281372} 11/07/2021 16:23:11 - INFO - __main__ - Step 135813: {'lr': 1.1252358851968624e-05, 'samples': 26076096, 'steps': 135812, 'loss/train': 1.2557326555252075} 11/07/2021 16:23:11 - INFO - __main__ - Step 135814: {'lr': 1.1250784733400937e-05, 'samples': 26076288, 'steps': 135813, 'loss/train': 1.1563910245895386} 11/07/2021 16:23:12 - INFO - __main__ - Step 135815: {'lr': 1.1249210722409975e-05, 'samples': 26076480, 'steps': 135814, 'loss/train': 1.2688161134719849} 11/07/2021 16:23:13 - INFO - __main__ - Step 135816: {'lr': 1.1247636818996405e-05, 'samples': 26076672, 'steps': 135815, 'loss/train': 1.382609486579895} 11/07/2021 16:23:13 - INFO - __main__ - Step 135817: {'lr': 1.1246063023160974e-05, 'samples': 26076864, 'steps': 135816, 'loss/train': 0.8894914984703064} 11/07/2021 16:23:13 - INFO - __main__ - Step 135818: {'lr': 1.1244489334904407e-05, 'samples': 26077056, 'steps': 135817, 'loss/train': 1.4493046998977661} 11/07/2021 16:23:14 - INFO - __main__ - Step 135819: {'lr': 1.1242915754227367e-05, 'samples': 26077248, 'steps': 135818, 'loss/train': 1.2702103853225708} 11/07/2021 16:23:15 - INFO - __main__ - Step 135820: {'lr': 1.124134228113058e-05, 'samples': 26077440, 'steps': 135819, 'loss/train': 1.3683810234069824} 11/07/2021 16:23:15 - INFO - __main__ - Step 135821: {'lr': 1.123976891561479e-05, 'samples': 26077632, 'steps': 135820, 'loss/train': 1.4582951068878174} 11/07/2021 16:23:15 - INFO - __main__ - Step 135822: {'lr': 1.123819565768064e-05, 'samples': 26077824, 'steps': 135821, 'loss/train': 1.193463921546936} 11/07/2021 16:23:16 - INFO - __main__ - Step 135823: {'lr': 1.1236622507328903e-05, 'samples': 26078016, 'steps': 135822, 'loss/train': 0.9644692540168762} 11/07/2021 16:23:16 - INFO - __main__ - Step 135824: {'lr': 1.1235049464560276e-05, 'samples': 26078208, 'steps': 135823, 'loss/train': 1.323904275894165} 11/07/2021 16:23:16 - INFO - __main__ - Step 135825: {'lr': 1.1233476529375425e-05, 'samples': 26078400, 'steps': 135824, 'loss/train': 1.2843283414840698} 11/07/2021 16:23:17 - INFO - __main__ - Step 135826: {'lr': 1.123190370177507e-05, 'samples': 26078592, 'steps': 135825, 'loss/train': 1.3915655612945557} 11/07/2021 16:23:18 - INFO - __main__ - Step 135827: {'lr': 1.1230330981759962e-05, 'samples': 26078784, 'steps': 135826, 'loss/train': 1.7700034379959106} 11/07/2021 16:23:18 - INFO - __main__ - Step 135828: {'lr': 1.1228758369330766e-05, 'samples': 26078976, 'steps': 135827, 'loss/train': 1.285169005393982} 11/07/2021 16:23:18 - INFO - __main__ - Step 135829: {'lr': 1.1227185864488205e-05, 'samples': 26079168, 'steps': 135828, 'loss/train': 0.7227658033370972} 11/07/2021 16:23:19 - INFO - __main__ - Step 135830: {'lr': 1.1225613467232998e-05, 'samples': 26079360, 'steps': 135829, 'loss/train': 1.3610081672668457} 11/07/2021 16:23:20 - INFO - __main__ - Step 135831: {'lr': 1.1224041177565841e-05, 'samples': 26079552, 'steps': 135830, 'loss/train': 1.0325630903244019} 11/07/2021 16:23:20 - INFO - __main__ - Step 135832: {'lr': 1.1222468995487428e-05, 'samples': 26079744, 'steps': 135831, 'loss/train': 1.2406115531921387} 11/07/2021 16:23:21 - INFO - __main__ - Step 135833: {'lr': 1.1220896920998507e-05, 'samples': 26079936, 'steps': 135832, 'loss/train': 1.3234206438064575} 11/07/2021 16:23:21 - INFO - __main__ - Step 135834: {'lr': 1.1219324954099774e-05, 'samples': 26080128, 'steps': 135833, 'loss/train': 1.4832239151000977} 11/07/2021 16:23:21 - INFO - __main__ - Step 135835: {'lr': 1.1217753094791922e-05, 'samples': 26080320, 'steps': 135834, 'loss/train': 1.1567996740341187} 11/07/2021 16:23:23 - INFO - __main__ - Step 135836: {'lr': 1.1216181343075643e-05, 'samples': 26080512, 'steps': 135835, 'loss/train': 1.0627127885818481} 11/07/2021 16:23:23 - INFO - __main__ - Step 135837: {'lr': 1.1214609698951717e-05, 'samples': 26080704, 'steps': 135836, 'loss/train': 1.88348388671875} 11/07/2021 16:23:24 - INFO - __main__ - Step 135838: {'lr': 1.121303816242078e-05, 'samples': 26080896, 'steps': 135837, 'loss/train': 1.1451513767242432} 11/07/2021 16:23:24 - INFO - __main__ - Step 135839: {'lr': 1.1211466733483556e-05, 'samples': 26081088, 'steps': 135838, 'loss/train': 0.7188524007797241} 11/07/2021 16:23:24 - INFO - __main__ - Step 135840: {'lr': 1.1209895412140764e-05, 'samples': 26081280, 'steps': 135839, 'loss/train': 0.6617701649665833} 11/07/2021 16:23:25 - INFO - __main__ - Step 135841: {'lr': 1.12083241983931e-05, 'samples': 26081472, 'steps': 135840, 'loss/train': 1.2949559688568115} 11/07/2021 16:23:25 - INFO - __main__ - Step 135842: {'lr': 1.1206753092241284e-05, 'samples': 26081664, 'steps': 135841, 'loss/train': 1.1299484968185425} 11/07/2021 16:23:27 - INFO - __main__ - Step 135843: {'lr': 1.1205182093686011e-05, 'samples': 26081856, 'steps': 135842, 'loss/train': 1.4447115659713745} 11/07/2021 16:23:27 - INFO - __main__ - Step 135844: {'lr': 1.1203611202728004e-05, 'samples': 26082048, 'steps': 135843, 'loss/train': 1.272745132446289} 11/07/2021 16:23:27 - INFO - __main__ - Step 135845: {'lr': 1.1202040419367982e-05, 'samples': 26082240, 'steps': 135844, 'loss/train': 2.704756021499634} 11/07/2021 16:23:28 - INFO - __main__ - Step 135846: {'lr': 1.1200469743606612e-05, 'samples': 26082432, 'steps': 135845, 'loss/train': 0.23708315193653107} 11/07/2021 16:23:28 - INFO - __main__ - Step 135847: {'lr': 1.1198899175444643e-05, 'samples': 26082624, 'steps': 135846, 'loss/train': 1.2957425117492676} 11/07/2021 16:23:29 - INFO - __main__ - Step 135848: {'lr': 1.1197328714882743e-05, 'samples': 26082816, 'steps': 135847, 'loss/train': 0.936664879322052} 11/07/2021 16:23:29 - INFO - __main__ - Step 135849: {'lr': 1.119575836192166e-05, 'samples': 26083008, 'steps': 135848, 'loss/train': 1.4391146898269653} 11/07/2021 16:23:30 - INFO - __main__ - Step 135850: {'lr': 1.1194188116562087e-05, 'samples': 26083200, 'steps': 135849, 'loss/train': 1.4195338487625122} 11/07/2021 16:23:30 - INFO - __main__ - Step 135851: {'lr': 1.1192617978804748e-05, 'samples': 26083392, 'steps': 135850, 'loss/train': 1.073644757270813} 11/07/2021 16:23:31 - INFO - __main__ - Step 135852: {'lr': 1.1191047948650306e-05, 'samples': 26083584, 'steps': 135851, 'loss/train': 1.4183064699172974} 11/07/2021 16:23:31 - INFO - __main__ - Step 135853: {'lr': 1.1189478026099487e-05, 'samples': 26083776, 'steps': 135852, 'loss/train': 1.1923762559890747} 11/07/2021 16:23:32 - INFO - __main__ - Step 135854: {'lr': 1.1187908211153008e-05, 'samples': 26083968, 'steps': 135853, 'loss/train': 1.1548984050750732} 11/07/2021 16:23:32 - INFO - __main__ - Step 135855: {'lr': 1.1186338503811566e-05, 'samples': 26084160, 'steps': 135854, 'loss/train': 1.2615872621536255} 11/07/2021 16:23:33 - INFO - __main__ - Step 135856: {'lr': 1.1184768904075882e-05, 'samples': 26084352, 'steps': 135855, 'loss/train': 1.3687715530395508} 11/07/2021 16:23:33 - INFO - __main__ - Step 135857: {'lr': 1.1183199411946648e-05, 'samples': 26084544, 'steps': 135856, 'loss/train': 1.3370330333709717} 11/07/2021 16:23:33 - INFO - __main__ - Step 135858: {'lr': 1.118163002742459e-05, 'samples': 26084736, 'steps': 135857, 'loss/train': 1.7077851295471191} 11/07/2021 16:23:34 - INFO - __main__ - Step 135859: {'lr': 1.1180060750510396e-05, 'samples': 26084928, 'steps': 135858, 'loss/train': 1.36056649684906} 11/07/2021 16:23:35 - INFO - __main__ - Step 135860: {'lr': 1.1178491581204791e-05, 'samples': 26085120, 'steps': 135859, 'loss/train': 1.217451810836792} 11/07/2021 16:23:35 - INFO - __main__ - Step 135861: {'lr': 1.117692251950847e-05, 'samples': 26085312, 'steps': 135860, 'loss/train': 1.2704914808273315} 11/07/2021 16:23:35 - INFO - __main__ - Step 135862: {'lr': 1.1175353565422125e-05, 'samples': 26085504, 'steps': 135861, 'loss/train': 2.14909291267395} 11/07/2021 16:23:36 - INFO - __main__ - Step 135863: {'lr': 1.1173784718946506e-05, 'samples': 26085696, 'steps': 135862, 'loss/train': 1.2094011306762695} 11/07/2021 16:23:37 - INFO - __main__ - Step 135864: {'lr': 1.1172215980082307e-05, 'samples': 26085888, 'steps': 135863, 'loss/train': 1.0923010110855103} 11/07/2021 16:23:37 - INFO - __main__ - Step 135865: {'lr': 1.117064734883022e-05, 'samples': 26086080, 'steps': 135864, 'loss/train': 1.30128014087677} 11/07/2021 16:23:37 - INFO - __main__ - Step 135866: {'lr': 1.1169078825190915e-05, 'samples': 26086272, 'steps': 135865, 'loss/train': 1.4337890148162842} 11/07/2021 16:23:38 - INFO - __main__ - Step 135867: {'lr': 1.1167510409165166e-05, 'samples': 26086464, 'steps': 135866, 'loss/train': 1.7896745204925537} 11/07/2021 16:23:38 - INFO - __main__ - Step 135868: {'lr': 1.116594210075364e-05, 'samples': 26086656, 'steps': 135867, 'loss/train': 1.453705906867981} 11/07/2021 16:23:39 - INFO - __main__ - Step 135869: {'lr': 1.1164373899957059e-05, 'samples': 26086848, 'steps': 135868, 'loss/train': 1.6064375638961792} 11/07/2021 16:23:39 - INFO - __main__ - Step 135870: {'lr': 1.1162805806776117e-05, 'samples': 26087040, 'steps': 135869, 'loss/train': 1.4941506385803223} 11/07/2021 16:23:40 - INFO - __main__ - Step 135871: {'lr': 1.1161237821211534e-05, 'samples': 26087232, 'steps': 135870, 'loss/train': 1.1227607727050781} 11/07/2021 16:23:40 - INFO - __main__ - Step 135872: {'lr': 1.1159669943264006e-05, 'samples': 26087424, 'steps': 135871, 'loss/train': 1.8249189853668213} 11/07/2021 16:23:41 - INFO - __main__ - Step 135873: {'lr': 1.1158102172934254e-05, 'samples': 26087616, 'steps': 135872, 'loss/train': 1.5226432085037231} 11/07/2021 16:23:42 - INFO - __main__ - Step 135874: {'lr': 1.1156534510222971e-05, 'samples': 26087808, 'steps': 135873, 'loss/train': 1.4794219732284546} 11/07/2021 16:23:42 - INFO - __main__ - Step 135875: {'lr': 1.1154966955130853e-05, 'samples': 26088000, 'steps': 135874, 'loss/train': 1.3004575967788696} 11/07/2021 16:23:43 - INFO - __main__ - Step 135876: {'lr': 1.1153399507658646e-05, 'samples': 26088192, 'steps': 135875, 'loss/train': 1.2384660243988037} 11/07/2021 16:23:43 - INFO - __main__ - Step 135877: {'lr': 1.1151832167807018e-05, 'samples': 26088384, 'steps': 135876, 'loss/train': 1.4932277202606201} 11/07/2021 16:23:43 - INFO - __main__ - Step 135878: {'lr': 1.115026493557672e-05, 'samples': 26088576, 'steps': 135877, 'loss/train': 1.046754240989685} 11/07/2021 16:23:44 - INFO - __main__ - Step 135879: {'lr': 1.1148697810968416e-05, 'samples': 26088768, 'steps': 135878, 'loss/train': 1.0622568130493164} 11/07/2021 16:23:45 - INFO - __main__ - Step 135880: {'lr': 1.1147130793982802e-05, 'samples': 26088960, 'steps': 135879, 'loss/train': 1.0289242267608643} 11/07/2021 16:23:45 - INFO - __main__ - Step 135881: {'lr': 1.1145563884620625e-05, 'samples': 26089152, 'steps': 135880, 'loss/train': 1.3200455904006958} 11/07/2021 16:23:45 - INFO - __main__ - Step 135882: {'lr': 1.114399708288255e-05, 'samples': 26089344, 'steps': 135881, 'loss/train': 1.1749809980392456} 11/07/2021 16:23:46 - INFO - __main__ - Step 135883: {'lr': 1.1142430388769304e-05, 'samples': 26089536, 'steps': 135882, 'loss/train': 1.1216493844985962} 11/07/2021 16:23:46 - INFO - __main__ - Step 135884: {'lr': 1.1140863802281604e-05, 'samples': 26089728, 'steps': 135883, 'loss/train': 1.1848024129867554} 11/07/2021 16:23:47 - INFO - __main__ - Step 135885: {'lr': 1.1139297323420144e-05, 'samples': 26089920, 'steps': 135884, 'loss/train': 1.5805184841156006} 11/07/2021 16:23:47 - INFO - __main__ - Step 135886: {'lr': 1.113773095218562e-05, 'samples': 26090112, 'steps': 135885, 'loss/train': 0.6484628915786743} 11/07/2021 16:23:48 - INFO - __main__ - Step 135887: {'lr': 1.1136164688578753e-05, 'samples': 26090304, 'steps': 135886, 'loss/train': 1.436737060546875} 11/07/2021 16:23:48 - INFO - __main__ - Step 135888: {'lr': 1.1134598532600265e-05, 'samples': 26090496, 'steps': 135887, 'loss/train': 1.3085384368896484} 11/07/2021 16:23:48 - INFO - __main__ - Step 135889: {'lr': 1.113303248425082e-05, 'samples': 26090688, 'steps': 135888, 'loss/train': 1.1650985479354858} 11/07/2021 16:23:50 - INFO - __main__ - Step 135890: {'lr': 1.1131466543531144e-05, 'samples': 26090880, 'steps': 135889, 'loss/train': 1.1215577125549316} 11/07/2021 16:23:50 - INFO - __main__ - Step 135891: {'lr': 1.1129900710441981e-05, 'samples': 26091072, 'steps': 135890, 'loss/train': 1.6781333684921265} 11/07/2021 16:23:50 - INFO - __main__ - Step 135892: {'lr': 1.1128334984983973e-05, 'samples': 26091264, 'steps': 135891, 'loss/train': 1.1237504482269287} 11/07/2021 16:23:51 - INFO - __main__ - Step 135893: {'lr': 1.112676936715784e-05, 'samples': 26091456, 'steps': 135892, 'loss/train': 1.4934967756271362} 11/07/2021 16:23:51 - INFO - __main__ - Step 135894: {'lr': 1.1125203856964305e-05, 'samples': 26091648, 'steps': 135893, 'loss/train': 1.2569754123687744} 11/07/2021 16:23:52 - INFO - __main__ - Step 135895: {'lr': 1.112363845440406e-05, 'samples': 26091840, 'steps': 135894, 'loss/train': 1.318381905555725} 11/07/2021 16:23:53 - INFO - __main__ - Step 135896: {'lr': 1.1122073159477802e-05, 'samples': 26092032, 'steps': 135895, 'loss/train': 1.230330467224121} 11/07/2021 16:23:53 - INFO - __main__ - Step 135897: {'lr': 1.1120507972186277e-05, 'samples': 26092224, 'steps': 135896, 'loss/train': 1.1664772033691406} 11/07/2021 16:23:53 - INFO - __main__ - Step 135898: {'lr': 1.1118942892530154e-05, 'samples': 26092416, 'steps': 135897, 'loss/train': 1.3076491355895996} 11/07/2021 16:23:54 - INFO - __main__ - Step 135899: {'lr': 1.111737792051018e-05, 'samples': 26092608, 'steps': 135898, 'loss/train': 1.49860680103302} 11/07/2021 16:23:55 - INFO - __main__ - Step 135900: {'lr': 1.1115813056126995e-05, 'samples': 26092800, 'steps': 135899, 'loss/train': 1.116560697555542} 11/07/2021 16:23:55 - INFO - __main__ - Step 135901: {'lr': 1.1114248299381346e-05, 'samples': 26092992, 'steps': 135900, 'loss/train': 1.298383116722107} 11/07/2021 16:23:55 - INFO - __main__ - Step 135902: {'lr': 1.111268365027393e-05, 'samples': 26093184, 'steps': 135901, 'loss/train': 1.5361210107803345} 11/07/2021 16:23:56 - INFO - __main__ - Step 135903: {'lr': 1.1111119108805495e-05, 'samples': 26093376, 'steps': 135902, 'loss/train': 1.259149193763733} 11/07/2021 16:23:56 - INFO - __main__ - Step 135904: {'lr': 1.1109554674976651e-05, 'samples': 26093568, 'steps': 135903, 'loss/train': 1.1571927070617676} 11/07/2021 16:23:56 - INFO - __main__ - Step 135905: {'lr': 1.1107990348788178e-05, 'samples': 26093760, 'steps': 135904, 'loss/train': 1.2547963857650757} 11/07/2021 16:23:57 - INFO - __main__ - Step 135906: {'lr': 1.1106426130240738e-05, 'samples': 26093952, 'steps': 135905, 'loss/train': 1.375842571258545} 11/07/2021 16:23:58 - INFO - __main__ - Step 135907: {'lr': 1.1104862019335055e-05, 'samples': 26094144, 'steps': 135906, 'loss/train': 1.346784234046936} 11/07/2021 16:23:58 - INFO - __main__ - Step 135908: {'lr': 1.110329801607185e-05, 'samples': 26094336, 'steps': 135907, 'loss/train': 1.3007221221923828} 11/07/2021 16:23:59 - INFO - __main__ - Step 135909: {'lr': 1.1101734120451818e-05, 'samples': 26094528, 'steps': 135908, 'loss/train': 1.6352767944335938} 11/07/2021 16:23:59 - INFO - __main__ - Step 135910: {'lr': 1.110017033247565e-05, 'samples': 26094720, 'steps': 135909, 'loss/train': 1.353910207748413} 11/07/2021 16:24:00 - INFO - __main__ - Step 135911: {'lr': 1.1098606652144045e-05, 'samples': 26094912, 'steps': 135910, 'loss/train': 0.7908491492271423} 11/07/2021 16:24:00 - INFO - __main__ - Step 135912: {'lr': 1.1097043079457747e-05, 'samples': 26095104, 'steps': 135911, 'loss/train': 1.4622541666030884} 11/07/2021 16:24:01 - INFO - __main__ - Step 135913: {'lr': 1.1095479614417425e-05, 'samples': 26095296, 'steps': 135912, 'loss/train': 1.0851231813430786} 11/07/2021 16:24:01 - INFO - __main__ - Step 135914: {'lr': 1.10939162570238e-05, 'samples': 26095488, 'steps': 135913, 'loss/train': 1.4364128112792969} 11/07/2021 16:24:01 - INFO - __main__ - Step 135915: {'lr': 1.1092353007277567e-05, 'samples': 26095680, 'steps': 135914, 'loss/train': 1.1298596858978271} 11/07/2021 16:24:02 - INFO - __main__ - Step 135916: {'lr': 1.1090789865179417e-05, 'samples': 26095872, 'steps': 135915, 'loss/train': 1.737837553024292} 11/07/2021 16:24:03 - INFO - __main__ - Step 135917: {'lr': 1.1089226830730075e-05, 'samples': 26096064, 'steps': 135916, 'loss/train': 1.3139835596084595} 11/07/2021 16:24:03 - INFO - __main__ - Step 135918: {'lr': 1.1087663903930262e-05, 'samples': 26096256, 'steps': 135917, 'loss/train': 1.1767408847808838} 11/07/2021 16:24:03 - INFO - __main__ - Step 135919: {'lr': 1.1086101084780642e-05, 'samples': 26096448, 'steps': 135918, 'loss/train': 1.4173767566680908} 11/07/2021 16:24:04 - INFO - __main__ - Step 135920: {'lr': 1.1084538373281939e-05, 'samples': 26096640, 'steps': 135919, 'loss/train': 0.9860000610351562} 11/07/2021 16:24:05 - INFO - __main__ - Step 135921: {'lr': 1.1082975769434845e-05, 'samples': 26096832, 'steps': 135920, 'loss/train': 1.6400622129440308} 11/07/2021 16:24:05 - INFO - __main__ - Step 135922: {'lr': 1.1081413273240109e-05, 'samples': 26097024, 'steps': 135921, 'loss/train': 1.2697639465332031} 11/07/2021 16:24:06 - INFO - __main__ - Step 135923: {'lr': 1.1079850884698373e-05, 'samples': 26097216, 'steps': 135922, 'loss/train': 1.264234185218811} 11/07/2021 16:24:06 - INFO - __main__ - Step 135924: {'lr': 1.1078288603810383e-05, 'samples': 26097408, 'steps': 135923, 'loss/train': 1.1288925409317017} 11/07/2021 16:24:06 - INFO - __main__ - Step 135925: {'lr': 1.1076726430576833e-05, 'samples': 26097600, 'steps': 135924, 'loss/train': 1.0544384717941284} 11/07/2021 16:24:07 - INFO - __main__ - Step 135926: {'lr': 1.1075164364998418e-05, 'samples': 26097792, 'steps': 135925, 'loss/train': 1.026249885559082} 11/07/2021 16:24:08 - INFO - __main__ - Step 135927: {'lr': 1.1073602407075861e-05, 'samples': 26097984, 'steps': 135926, 'loss/train': 1.3145687580108643} 11/07/2021 16:24:08 - INFO - __main__ - Step 135928: {'lr': 1.1072040556809826e-05, 'samples': 26098176, 'steps': 135927, 'loss/train': 0.9264113903045654} 11/07/2021 16:24:08 - INFO - __main__ - Step 135929: {'lr': 1.1070478814201035e-05, 'samples': 26098368, 'steps': 135928, 'loss/train': 1.7695808410644531} 11/07/2021 16:24:09 - INFO - __main__ - Step 135930: {'lr': 1.106891717925021e-05, 'samples': 26098560, 'steps': 135929, 'loss/train': 1.629459261894226} 11/07/2021 16:24:09 - INFO - __main__ - Step 135931: {'lr': 1.1067355651958072e-05, 'samples': 26098752, 'steps': 135930, 'loss/train': 1.919874906539917} 11/07/2021 16:24:10 - INFO - __main__ - Step 135932: {'lr': 1.1065794232325261e-05, 'samples': 26098944, 'steps': 135931, 'loss/train': 1.052774429321289} 11/07/2021 16:24:10 - INFO - __main__ - Step 135933: {'lr': 1.1064232920352523e-05, 'samples': 26099136, 'steps': 135932, 'loss/train': 1.2740932703018188} 11/07/2021 16:24:11 - INFO - __main__ - Step 135934: {'lr': 1.1062671716040556e-05, 'samples': 26099328, 'steps': 135933, 'loss/train': 1.4317675828933716} 11/07/2021 16:24:11 - INFO - __main__ - Step 135935: {'lr': 1.106111061939008e-05, 'samples': 26099520, 'steps': 135934, 'loss/train': 1.083873987197876} 11/07/2021 16:24:12 - INFO - __main__ - Step 135936: {'lr': 1.1059549630401788e-05, 'samples': 26099712, 'steps': 135935, 'loss/train': 0.9640196561813354} 11/07/2021 16:24:12 - INFO - __main__ - Step 135937: {'lr': 1.1057988749076347e-05, 'samples': 26099904, 'steps': 135936, 'loss/train': 1.305806040763855} 11/07/2021 16:24:13 - INFO - __main__ - Step 135938: {'lr': 1.1056427975414508e-05, 'samples': 26100096, 'steps': 135937, 'loss/train': 1.4679425954818726} 11/07/2021 16:24:13 - INFO - __main__ - Step 135939: {'lr': 1.1054867309416932e-05, 'samples': 26100288, 'steps': 135938, 'loss/train': 1.5653325319290161} 11/07/2021 16:24:14 - INFO - __main__ - Step 135940: {'lr': 1.1053306751084375e-05, 'samples': 26100480, 'steps': 135939, 'loss/train': 2.069354772567749} 11/07/2021 16:24:14 - INFO - __main__ - Step 135941: {'lr': 1.105174630041747e-05, 'samples': 26100672, 'steps': 135940, 'loss/train': 1.2276313304901123} 11/07/2021 16:24:15 - INFO - __main__ - Step 135942: {'lr': 1.1050185957416997e-05, 'samples': 26100864, 'steps': 135941, 'loss/train': 1.1166315078735352} 11/07/2021 16:24:15 - INFO - __main__ - Step 135943: {'lr': 1.1048625722083622e-05, 'samples': 26101056, 'steps': 135942, 'loss/train': 0.9213332533836365} 11/07/2021 16:24:16 - INFO - __main__ - Step 135944: {'lr': 1.1047065594418038e-05, 'samples': 26101248, 'steps': 135943, 'loss/train': 1.3526712656021118} 11/07/2021 16:24:16 - INFO - __main__ - Step 135945: {'lr': 1.104550557442094e-05, 'samples': 26101440, 'steps': 135944, 'loss/train': 1.5509140491485596} 11/07/2021 16:24:16 - INFO - __main__ - Step 135946: {'lr': 1.1043945662093074e-05, 'samples': 26101632, 'steps': 135945, 'loss/train': 0.8488993048667908} 11/07/2021 16:24:18 - INFO - __main__ - Step 135947: {'lr': 1.1042385857435139e-05, 'samples': 26101824, 'steps': 135946, 'loss/train': 0.5976818203926086} 11/07/2021 16:24:18 - INFO - __main__ - Step 135948: {'lr': 1.1040826160447797e-05, 'samples': 26102016, 'steps': 135947, 'loss/train': 1.137189507484436} 11/07/2021 16:24:18 - INFO - __main__ - Step 135949: {'lr': 1.1039266571131773e-05, 'samples': 26102208, 'steps': 135948, 'loss/train': 1.276670217514038} 11/07/2021 16:24:19 - INFO - __main__ - Step 135950: {'lr': 1.1037707089487759e-05, 'samples': 26102400, 'steps': 135949, 'loss/train': 1.3396425247192383} 11/07/2021 16:24:19 - INFO - __main__ - Step 135951: {'lr': 1.1036147715516448e-05, 'samples': 26102592, 'steps': 135950, 'loss/train': 1.3388789892196655} 11/07/2021 16:24:20 - INFO - __main__ - Step 135952: {'lr': 1.103458844921859e-05, 'samples': 26102784, 'steps': 135951, 'loss/train': 0.5991871356964111} 11/07/2021 16:24:20 - INFO - __main__ - Step 135953: {'lr': 1.1033029290594855e-05, 'samples': 26102976, 'steps': 135952, 'loss/train': 1.576806664466858} 11/07/2021 16:24:21 - INFO - __main__ - Step 135954: {'lr': 1.103147023964593e-05, 'samples': 26103168, 'steps': 135953, 'loss/train': 1.4622650146484375} 11/07/2021 16:24:21 - INFO - __main__ - Step 135955: {'lr': 1.1029911296372569e-05, 'samples': 26103360, 'steps': 135954, 'loss/train': 1.3660430908203125} 11/07/2021 16:24:21 - INFO - __main__ - Step 135956: {'lr': 1.102835246077541e-05, 'samples': 26103552, 'steps': 135955, 'loss/train': 1.3965903520584106} 11/07/2021 16:24:22 - INFO - __main__ - Step 135957: {'lr': 1.1026793732855229e-05, 'samples': 26103744, 'steps': 135956, 'loss/train': 1.6428638696670532} 11/07/2021 16:24:23 - INFO - __main__ - Step 135958: {'lr': 1.1025235112612691e-05, 'samples': 26103936, 'steps': 135957, 'loss/train': 1.1151448488235474} 11/07/2021 16:24:23 - INFO - __main__ - Step 135959: {'lr': 1.1023676600048465e-05, 'samples': 26104128, 'steps': 135958, 'loss/train': 1.440635085105896} 11/07/2021 16:24:24 - INFO - __main__ - Step 135960: {'lr': 1.1022118195163272e-05, 'samples': 26104320, 'steps': 135959, 'loss/train': 0.9425734877586365} 11/07/2021 16:24:24 - INFO - __main__ - Step 135961: {'lr': 1.1020559897957832e-05, 'samples': 26104512, 'steps': 135960, 'loss/train': 1.2859517335891724} 11/07/2021 16:24:25 - INFO - __main__ - Step 135962: {'lr': 1.1019001708432841e-05, 'samples': 26104704, 'steps': 135961, 'loss/train': 1.0409389734268188} 11/07/2021 16:24:25 - INFO - __main__ - Step 135963: {'lr': 1.101744362658902e-05, 'samples': 26104896, 'steps': 135962, 'loss/train': 1.400393009185791} 11/07/2021 16:24:26 - INFO - __main__ - Step 135964: {'lr': 1.1015885652427032e-05, 'samples': 26105088, 'steps': 135963, 'loss/train': 1.0293604135513306} 11/07/2021 16:24:26 - INFO - __main__ - Step 135965: {'lr': 1.1014327785947604e-05, 'samples': 26105280, 'steps': 135964, 'loss/train': 1.2993381023406982} 11/07/2021 16:24:26 - INFO - __main__ - Step 135966: {'lr': 1.1012770027151425e-05, 'samples': 26105472, 'steps': 135965, 'loss/train': 1.2360873222351074} 11/07/2021 16:24:27 - INFO - __main__ - Step 135967: {'lr': 1.1011212376039193e-05, 'samples': 26105664, 'steps': 135966, 'loss/train': 1.5484520196914673} 11/07/2021 16:24:28 - INFO - __main__ - Step 135968: {'lr': 1.1009654832611627e-05, 'samples': 26105856, 'steps': 135967, 'loss/train': 1.0246039628982544} 11/07/2021 16:24:28 - INFO - __main__ - Step 135969: {'lr': 1.1008097396869448e-05, 'samples': 26106048, 'steps': 135968, 'loss/train': 1.6305508613586426} 11/07/2021 16:24:28 - INFO - __main__ - Step 135970: {'lr': 1.1006540068813297e-05, 'samples': 26106240, 'steps': 135969, 'loss/train': 1.0176851749420166} 11/07/2021 16:24:29 - INFO - __main__ - Step 135971: {'lr': 1.1004982848443951e-05, 'samples': 26106432, 'steps': 135970, 'loss/train': 1.5143083333969116} 11/07/2021 16:24:29 - INFO - __main__ - Step 135972: {'lr': 1.1003425735762073e-05, 'samples': 26106624, 'steps': 135971, 'loss/train': 1.2302467823028564} 11/07/2021 16:24:30 - INFO - __main__ - Step 135973: {'lr': 1.1001868730768334e-05, 'samples': 26106816, 'steps': 135972, 'loss/train': 1.4121992588043213} 11/07/2021 16:24:31 - INFO - __main__ - Step 135974: {'lr': 1.1000311833463478e-05, 'samples': 26107008, 'steps': 135973, 'loss/train': 1.2488853931427002} 11/07/2021 16:24:31 - INFO - __main__ - Step 135975: {'lr': 1.0998755043848174e-05, 'samples': 26107200, 'steps': 135974, 'loss/train': 0.887565553188324} 11/07/2021 16:24:31 - INFO - __main__ - Step 135976: {'lr': 1.0997198361923172e-05, 'samples': 26107392, 'steps': 135975, 'loss/train': 1.4317946434020996} 11/07/2021 16:24:32 - INFO - __main__ - Step 135977: {'lr': 1.0995641787689137e-05, 'samples': 26107584, 'steps': 135976, 'loss/train': 1.7324588298797607} 11/07/2021 16:24:33 - INFO - __main__ - Step 135978: {'lr': 1.0994085321146763e-05, 'samples': 26107776, 'steps': 135977, 'loss/train': 1.471434235572815} 11/07/2021 16:24:33 - INFO - __main__ - Step 135979: {'lr': 1.0992528962296772e-05, 'samples': 26107968, 'steps': 135978, 'loss/train': 1.6350090503692627} 11/07/2021 16:24:33 - INFO - __main__ - Step 135980: {'lr': 1.0990972711139857e-05, 'samples': 26108160, 'steps': 135979, 'loss/train': 0.9829590916633606} 11/07/2021 16:24:34 - INFO - __main__ - Step 135981: {'lr': 1.0989416567676713e-05, 'samples': 26108352, 'steps': 135980, 'loss/train': 1.553018569946289} 11/07/2021 16:24:34 - INFO - __main__ - Step 135982: {'lr': 1.0987860531908062e-05, 'samples': 26108544, 'steps': 135981, 'loss/train': 0.899009644985199} 11/07/2021 16:24:35 - INFO - __main__ - Step 135983: {'lr': 1.0986304603834595e-05, 'samples': 26108736, 'steps': 135982, 'loss/train': 1.8798460960388184} 11/07/2021 16:24:35 - INFO - __main__ - Step 135984: {'lr': 1.098474878345701e-05, 'samples': 26108928, 'steps': 135983, 'loss/train': 1.6894962787628174} 11/07/2021 16:24:36 - INFO - __main__ - Step 135985: {'lr': 1.0983193070776055e-05, 'samples': 26109120, 'steps': 135984, 'loss/train': 1.4784181118011475} 11/07/2021 16:24:36 - INFO - __main__ - Step 135986: {'lr': 1.098163746579231e-05, 'samples': 26109312, 'steps': 135985, 'loss/train': 1.2890688180923462} 11/07/2021 16:24:37 - INFO - __main__ - Step 135987: {'lr': 1.0980081968506584e-05, 'samples': 26109504, 'steps': 135986, 'loss/train': 1.4680942296981812} 11/07/2021 16:24:38 - INFO - __main__ - Step 135988: {'lr': 1.097852657891954e-05, 'samples': 26109696, 'steps': 135987, 'loss/train': 1.3710263967514038} 11/07/2021 16:24:38 - INFO - __main__ - Step 135989: {'lr': 1.0976971297031873e-05, 'samples': 26109888, 'steps': 135988, 'loss/train': 1.2343313694000244} 11/07/2021 16:24:38 - INFO - __main__ - Step 135990: {'lr': 1.0975416122844306e-05, 'samples': 26110080, 'steps': 135989, 'loss/train': 1.284414529800415} 11/07/2021 16:24:39 - INFO - __main__ - Step 135991: {'lr': 1.0973861056357532e-05, 'samples': 26110272, 'steps': 135990, 'loss/train': 0.8724243640899658} 11/07/2021 16:24:39 - INFO - __main__ - Step 135992: {'lr': 1.0972306097572244e-05, 'samples': 26110464, 'steps': 135991, 'loss/train': 1.6659480333328247} 11/07/2021 16:24:40 - INFO - __main__ - Step 135993: {'lr': 1.0970751246489135e-05, 'samples': 26110656, 'steps': 135992, 'loss/train': 1.7854419946670532} 11/07/2021 16:24:40 - INFO - __main__ - Step 135994: {'lr': 1.096919650310893e-05, 'samples': 26110848, 'steps': 135993, 'loss/train': 1.6473886966705322} 11/07/2021 16:24:41 - INFO - __main__ - Step 135995: {'lr': 1.0967641867432322e-05, 'samples': 26111040, 'steps': 135994, 'loss/train': 1.4190911054611206} 11/07/2021 16:24:41 - INFO - __main__ - Step 135996: {'lr': 1.0966087339460001e-05, 'samples': 26111232, 'steps': 135995, 'loss/train': 0.9373067021369934} 11/07/2021 16:24:41 - INFO - __main__ - Step 135997: {'lr': 1.0964532919192666e-05, 'samples': 26111424, 'steps': 135996, 'loss/train': 1.613310694694519} 11/07/2021 16:24:43 - INFO - __main__ - Step 135998: {'lr': 1.0962978606631007e-05, 'samples': 26111616, 'steps': 135997, 'loss/train': 1.1103758811950684} 11/07/2021 16:24:43 - INFO - __main__ - Step 135999: {'lr': 1.0961424401775805e-05, 'samples': 26111808, 'steps': 135998, 'loss/train': 1.6099750995635986} 11/07/2021 16:24:44 - INFO - __main__ - Step 136000: {'lr': 1.095987030462764e-05, 'samples': 26112000, 'steps': 135999, 'loss/train': 0.6269552111625671} 11/07/2021 16:24:44 - INFO - __main__ - Step 136001: {'lr': 1.0958316315187289e-05, 'samples': 26112192, 'steps': 136000, 'loss/train': 0.724662184715271} 11/07/2021 16:24:44 - INFO - __main__ - Step 136002: {'lr': 1.095676243345542e-05, 'samples': 26112384, 'steps': 136001, 'loss/train': 1.222910761833191} 11/07/2021 16:24:45 - INFO - __main__ - Step 136003: {'lr': 1.0955208659432752e-05, 'samples': 26112576, 'steps': 136002, 'loss/train': 1.2915112972259521} 11/07/2021 16:24:46 - INFO - __main__ - Step 136004: {'lr': 1.0953654993119982e-05, 'samples': 26112768, 'steps': 136003, 'loss/train': 0.8742798566818237} 11/07/2021 16:24:46 - INFO - __main__ - Step 136005: {'lr': 1.0952101434517803e-05, 'samples': 26112960, 'steps': 136004, 'loss/train': 1.1946970224380493} 11/07/2021 16:24:46 - INFO - __main__ - Step 136006: {'lr': 1.0950547983626907e-05, 'samples': 26113152, 'steps': 136005, 'loss/train': 1.4912012815475464} 11/07/2021 16:24:47 - INFO - __main__ - Step 136007: {'lr': 1.0948994640448018e-05, 'samples': 26113344, 'steps': 136006, 'loss/train': 1.5001477003097534} 11/07/2021 16:24:47 - INFO - __main__ - Step 136008: {'lr': 1.09474414049818e-05, 'samples': 26113536, 'steps': 136007, 'loss/train': 1.2537329196929932} 11/07/2021 16:24:47 - INFO - __main__ - Step 136009: {'lr': 1.0945888277229005e-05, 'samples': 26113728, 'steps': 136008, 'loss/train': 1.0071651935577393} 11/07/2021 16:24:49 - INFO - __main__ - Step 136010: {'lr': 1.0944335257190296e-05, 'samples': 26113920, 'steps': 136009, 'loss/train': 1.1117295026779175} 11/07/2021 16:24:49 - INFO - __main__ - Step 136011: {'lr': 1.094278234486637e-05, 'samples': 26114112, 'steps': 136010, 'loss/train': 1.810054898262024} 11/07/2021 16:24:49 - INFO - __main__ - Step 136012: {'lr': 1.0941229540257974e-05, 'samples': 26114304, 'steps': 136011, 'loss/train': 1.2417514324188232} 11/07/2021 16:24:50 - INFO - __main__ - Step 136013: {'lr': 1.0939676843365748e-05, 'samples': 26114496, 'steps': 136012, 'loss/train': 1.3012884855270386} 11/07/2021 16:24:50 - INFO - __main__ - Step 136014: {'lr': 1.0938124254190412e-05, 'samples': 26114688, 'steps': 136013, 'loss/train': 0.9868929386138916} 11/07/2021 16:24:51 - INFO - __main__ - Step 136015: {'lr': 1.0936571772732662e-05, 'samples': 26114880, 'steps': 136014, 'loss/train': 1.2325401306152344} 11/07/2021 16:24:52 - INFO - __main__ - Step 136016: {'lr': 1.0935019398993218e-05, 'samples': 26115072, 'steps': 136015, 'loss/train': 1.607596516609192} 11/07/2021 16:24:52 - INFO - __main__ - Step 136017: {'lr': 1.0933467132972747e-05, 'samples': 26115264, 'steps': 136016, 'loss/train': 0.5137472152709961} 11/07/2021 16:24:52 - INFO - __main__ - Step 136018: {'lr': 1.0931914974671969e-05, 'samples': 26115456, 'steps': 136017, 'loss/train': 1.5856891870498657} 11/07/2021 16:24:53 - INFO - __main__ - Step 136019: {'lr': 1.093036292409158e-05, 'samples': 26115648, 'steps': 136018, 'loss/train': 1.1912888288497925} 11/07/2021 16:24:54 - INFO - __main__ - Step 136020: {'lr': 1.09288109812323e-05, 'samples': 26115840, 'steps': 136019, 'loss/train': 3.1832971572875977} 11/07/2021 16:24:54 - INFO - __main__ - Step 136021: {'lr': 1.0927259146094798e-05, 'samples': 26116032, 'steps': 136020, 'loss/train': 1.4989163875579834} 11/07/2021 16:24:54 - INFO - __main__ - Step 136022: {'lr': 1.0925707418679765e-05, 'samples': 26116224, 'steps': 136021, 'loss/train': 1.3472297191619873} 11/07/2021 16:24:55 - INFO - __main__ - Step 136023: {'lr': 1.092415579898795e-05, 'samples': 26116416, 'steps': 136022, 'loss/train': 1.2513880729675293} 11/07/2021 16:24:55 - INFO - __main__ - Step 136024: {'lr': 1.0922604287019994e-05, 'samples': 26116608, 'steps': 136023, 'loss/train': 1.2824125289916992} 11/07/2021 16:24:56 - INFO - __main__ - Step 136025: {'lr': 1.0921052882776645e-05, 'samples': 26116800, 'steps': 136024, 'loss/train': 0.9044393301010132} 11/07/2021 16:24:56 - INFO - __main__ - Step 136026: {'lr': 1.0919501586258595e-05, 'samples': 26116992, 'steps': 136025, 'loss/train': 1.3093823194503784} 11/07/2021 16:24:57 - INFO - __main__ - Step 136027: {'lr': 1.0917950397466513e-05, 'samples': 26117184, 'steps': 136026, 'loss/train': 1.3616862297058105} 11/07/2021 16:24:57 - INFO - __main__ - Step 136028: {'lr': 1.0916399316401094e-05, 'samples': 26117376, 'steps': 136027, 'loss/train': 0.7951229810714722} 11/07/2021 16:24:58 - INFO - __main__ - Step 136029: {'lr': 1.0914848343063083e-05, 'samples': 26117568, 'steps': 136028, 'loss/train': 0.9836952686309814} 11/07/2021 16:24:59 - INFO - __main__ - Step 136030: {'lr': 1.091329747745312e-05, 'samples': 26117760, 'steps': 136029, 'loss/train': 1.5250229835510254} 11/07/2021 16:24:59 - INFO - __main__ - Step 136031: {'lr': 1.0911746719571958e-05, 'samples': 26117952, 'steps': 136030, 'loss/train': 1.6242269277572632} 11/07/2021 16:24:59 - INFO - __main__ - Step 136032: {'lr': 1.0910196069420286e-05, 'samples': 26118144, 'steps': 136031, 'loss/train': 1.3884594440460205} 11/07/2021 16:25:00 - INFO - __main__ - Step 136033: {'lr': 1.0908645526998745e-05, 'samples': 26118336, 'steps': 136032, 'loss/train': 1.3458746671676636} 11/07/2021 16:25:00 - INFO - __main__ - Step 136034: {'lr': 1.0907095092308111e-05, 'samples': 26118528, 'steps': 136033, 'loss/train': 1.1534696817398071} 11/07/2021 16:25:00 - INFO - __main__ - Step 136035: {'lr': 1.0905544765349052e-05, 'samples': 26118720, 'steps': 136034, 'loss/train': 0.9096169471740723} 11/07/2021 16:25:01 - INFO - __main__ - Step 136036: {'lr': 1.090399454612226e-05, 'samples': 26118912, 'steps': 136035, 'loss/train': 1.5011122226715088} 11/07/2021 16:25:02 - INFO - __main__ - Step 136037: {'lr': 1.0902444434628427e-05, 'samples': 26119104, 'steps': 136036, 'loss/train': 1.2778856754302979} 11/07/2021 16:25:02 - INFO - __main__ - Step 136038: {'lr': 1.090089443086828e-05, 'samples': 26119296, 'steps': 136037, 'loss/train': 1.0556584596633911} 11/07/2021 16:25:02 - INFO - __main__ - Step 136039: {'lr': 1.0899344534842537e-05, 'samples': 26119488, 'steps': 136038, 'loss/train': 1.2031880617141724} 11/07/2021 16:25:03 - INFO - __main__ - Step 136040: {'lr': 1.0897794746551808e-05, 'samples': 26119680, 'steps': 136039, 'loss/train': 2.234074831008911} 11/07/2021 16:25:04 - INFO - __main__ - Step 136041: {'lr': 1.0896245065996845e-05, 'samples': 26119872, 'steps': 136040, 'loss/train': 1.1345266103744507} 11/07/2021 16:25:04 - INFO - __main__ - Step 136042: {'lr': 1.089469549317834e-05, 'samples': 26120064, 'steps': 136041, 'loss/train': 0.7681853175163269} 11/07/2021 16:25:05 - INFO - __main__ - Step 136043: {'lr': 1.0893146028097018e-05, 'samples': 26120256, 'steps': 136042, 'loss/train': 1.4235498905181885} 11/07/2021 16:25:05 - INFO - __main__ - Step 136044: {'lr': 1.089159667075354e-05, 'samples': 26120448, 'steps': 136043, 'loss/train': 1.2844810485839844} 11/07/2021 16:25:05 - INFO - __main__ - Step 136045: {'lr': 1.089004742114863e-05, 'samples': 26120640, 'steps': 136044, 'loss/train': 0.7875595688819885} 11/07/2021 16:25:06 - INFO - __main__ - Step 136046: {'lr': 1.0888498279282955e-05, 'samples': 26120832, 'steps': 136045, 'loss/train': 2.2127232551574707} 11/07/2021 16:25:07 - INFO - __main__ - Step 136047: {'lr': 1.0886949245157262e-05, 'samples': 26121024, 'steps': 136046, 'loss/train': 0.2830113470554352} 11/07/2021 16:25:07 - INFO - __main__ - Step 136048: {'lr': 1.0885400318772192e-05, 'samples': 26121216, 'steps': 136047, 'loss/train': 1.6807587146759033} 11/07/2021 16:25:07 - INFO - __main__ - Step 136049: {'lr': 1.0883851500128495e-05, 'samples': 26121408, 'steps': 136048, 'loss/train': 1.4941964149475098} 11/07/2021 16:25:08 - INFO - __main__ - Step 136050: {'lr': 1.0882302789226833e-05, 'samples': 26121600, 'steps': 136049, 'loss/train': 1.419342279434204} 11/07/2021 16:25:09 - INFO - __main__ - Step 136051: {'lr': 1.0880754186067904e-05, 'samples': 26121792, 'steps': 136050, 'loss/train': 1.482743740081787} 11/07/2021 16:25:09 - INFO - __main__ - Step 136052: {'lr': 1.0879205690652428e-05, 'samples': 26121984, 'steps': 136051, 'loss/train': 5.65468692779541} 11/07/2021 16:25:10 - INFO - __main__ - Step 136053: {'lr': 1.0877657302981125e-05, 'samples': 26122176, 'steps': 136052, 'loss/train': 1.4112244844436646} 11/07/2021 16:25:10 - INFO - __main__ - Step 136054: {'lr': 1.087610902305461e-05, 'samples': 26122368, 'steps': 136053, 'loss/train': 1.5762730836868286} 11/07/2021 16:25:10 - INFO - __main__ - Step 136055: {'lr': 1.0874560850873655e-05, 'samples': 26122560, 'steps': 136054, 'loss/train': 1.5135222673416138} 11/07/2021 16:25:11 - INFO - __main__ - Step 136056: {'lr': 1.087301278643893e-05, 'samples': 26122752, 'steps': 136055, 'loss/train': 1.4752066135406494} 11/07/2021 16:25:12 - INFO - __main__ - Step 136057: {'lr': 1.0871464829751126e-05, 'samples': 26122944, 'steps': 136056, 'loss/train': 1.0672423839569092} 11/07/2021 16:25:12 - INFO - __main__ - Step 136058: {'lr': 1.086991698081094e-05, 'samples': 26123136, 'steps': 136057, 'loss/train': 1.3761041164398193} 11/07/2021 16:25:12 - INFO - __main__ - Step 136059: {'lr': 1.086836923961912e-05, 'samples': 26123328, 'steps': 136058, 'loss/train': 1.8215714693069458} 11/07/2021 16:25:13 - INFO - __main__ - Step 136060: {'lr': 1.0866821606176274e-05, 'samples': 26123520, 'steps': 136059, 'loss/train': 1.330074667930603} 11/07/2021 16:25:13 - INFO - __main__ - Step 136061: {'lr': 1.0865274080483185e-05, 'samples': 26123712, 'steps': 136060, 'loss/train': 1.5550107955932617} 11/07/2021 16:25:14 - INFO - __main__ - Step 136062: {'lr': 1.0863726662540484e-05, 'samples': 26123904, 'steps': 136061, 'loss/train': 1.1513752937316895} 11/07/2021 16:25:15 - INFO - __main__ - Step 136063: {'lr': 1.0862179352348928e-05, 'samples': 26124096, 'steps': 136062, 'loss/train': 1.6476315259933472} 11/07/2021 16:25:15 - INFO - __main__ - Step 136064: {'lr': 1.086063214990915e-05, 'samples': 26124288, 'steps': 136063, 'loss/train': 0.6262418031692505} 11/07/2021 16:25:15 - INFO - __main__ - Step 136065: {'lr': 1.0859085055221901e-05, 'samples': 26124480, 'steps': 136064, 'loss/train': 0.9805464744567871} 11/07/2021 16:25:16 - INFO - __main__ - Step 136066: {'lr': 1.0857538068287903e-05, 'samples': 26124672, 'steps': 136065, 'loss/train': 1.5807585716247559} 11/07/2021 16:25:17 - INFO - __main__ - Step 136067: {'lr': 1.0855991189107767e-05, 'samples': 26124864, 'steps': 136066, 'loss/train': 1.4609580039978027} 11/07/2021 16:25:17 - INFO - __main__ - Step 136068: {'lr': 1.0854444417682213e-05, 'samples': 26125056, 'steps': 136067, 'loss/train': 1.483200192451477} 11/07/2021 16:25:17 - INFO - __main__ - Step 136069: {'lr': 1.0852897754011964e-05, 'samples': 26125248, 'steps': 136068, 'loss/train': 1.6044657230377197} 11/07/2021 16:25:18 - INFO - __main__ - Step 136070: {'lr': 1.0851351198097715e-05, 'samples': 26125440, 'steps': 136069, 'loss/train': 1.3764426708221436} 11/07/2021 16:25:18 - INFO - __main__ - Step 136071: {'lr': 1.0849804749940156e-05, 'samples': 26125632, 'steps': 136070, 'loss/train': 1.3418515920639038} 11/07/2021 16:25:19 - INFO - __main__ - Step 136072: {'lr': 1.0848258409539985e-05, 'samples': 26125824, 'steps': 136071, 'loss/train': 1.9178704023361206} 11/07/2021 16:25:19 - INFO - __main__ - Step 136073: {'lr': 1.0846712176897893e-05, 'samples': 26126016, 'steps': 136072, 'loss/train': 1.100514531135559} 11/07/2021 16:25:20 - INFO - __main__ - Step 136074: {'lr': 1.0845166052014604e-05, 'samples': 26126208, 'steps': 136073, 'loss/train': 1.1261621713638306} 11/07/2021 16:25:20 - INFO - __main__ - Step 136075: {'lr': 1.0843620034890756e-05, 'samples': 26126400, 'steps': 136074, 'loss/train': 0.9665152430534363} 11/07/2021 16:25:20 - INFO - __main__ - Step 136076: {'lr': 1.0842074125527096e-05, 'samples': 26126592, 'steps': 136075, 'loss/train': 1.527390718460083} 11/07/2021 16:25:21 - INFO - __main__ - Step 136077: {'lr': 1.084052832392432e-05, 'samples': 26126784, 'steps': 136076, 'loss/train': 1.1702656745910645} 11/07/2021 16:25:22 - INFO - __main__ - Step 136078: {'lr': 1.0838982630083122e-05, 'samples': 26126976, 'steps': 136077, 'loss/train': 1.2285902500152588} 11/07/2021 16:25:23 - INFO - __main__ - Step 136079: {'lr': 1.0837437044004195e-05, 'samples': 26127168, 'steps': 136078, 'loss/train': 1.6770622730255127} 11/07/2021 16:25:23 - INFO - __main__ - Step 136080: {'lr': 1.0835891565688205e-05, 'samples': 26127360, 'steps': 136079, 'loss/train': 1.3968505859375} 11/07/2021 16:25:23 - INFO - __main__ - Step 136081: {'lr': 1.0834346195135874e-05, 'samples': 26127552, 'steps': 136080, 'loss/train': 1.2307252883911133} 11/07/2021 16:25:24 - INFO - __main__ - Step 136082: {'lr': 1.083280093234787e-05, 'samples': 26127744, 'steps': 136081, 'loss/train': 1.047673225402832} 11/07/2021 16:25:24 - INFO - __main__ - Step 136083: {'lr': 1.0831255777324939e-05, 'samples': 26127936, 'steps': 136082, 'loss/train': 0.6835839748382568} 11/07/2021 16:25:25 - INFO - __main__ - Step 136084: {'lr': 1.082971073006775e-05, 'samples': 26128128, 'steps': 136083, 'loss/train': 0.9724396467208862} 11/07/2021 16:25:25 - INFO - __main__ - Step 136085: {'lr': 1.0828165790577022e-05, 'samples': 26128320, 'steps': 136084, 'loss/train': 1.3674794435501099} 11/07/2021 16:25:26 - INFO - __main__ - Step 136086: {'lr': 1.0826620958853423e-05, 'samples': 26128512, 'steps': 136085, 'loss/train': 1.0890132188796997} 11/07/2021 16:25:26 - INFO - __main__ - Step 136087: {'lr': 1.0825076234897646e-05, 'samples': 26128704, 'steps': 136086, 'loss/train': 1.39522385597229} 11/07/2021 16:25:26 - INFO - __main__ - Step 136088: {'lr': 1.0823531618710385e-05, 'samples': 26128896, 'steps': 136087, 'loss/train': 1.4253243207931519} 11/07/2021 16:25:27 - INFO - __main__ - Step 136089: {'lr': 1.0821987110292364e-05, 'samples': 26129088, 'steps': 136088, 'loss/train': 0.8430856466293335} 11/07/2021 16:25:28 - INFO - __main__ - Step 136090: {'lr': 1.0820442709644273e-05, 'samples': 26129280, 'steps': 136089, 'loss/train': 1.2049833536148071} 11/07/2021 16:25:28 - INFO - __main__ - Step 136091: {'lr': 1.0818898416766809e-05, 'samples': 26129472, 'steps': 136090, 'loss/train': 1.355657935142517} 11/07/2021 16:25:29 - INFO - __main__ - Step 136092: {'lr': 1.0817354231660636e-05, 'samples': 26129664, 'steps': 136091, 'loss/train': 1.5379618406295776} 11/07/2021 16:25:29 - INFO - __main__ - Step 136093: {'lr': 1.0815810154326506e-05, 'samples': 26129856, 'steps': 136092, 'loss/train': 1.5092211961746216} 11/07/2021 16:25:30 - INFO - __main__ - Step 136094: {'lr': 1.0814266184765053e-05, 'samples': 26130048, 'steps': 136093, 'loss/train': 1.4530023336410522} 11/07/2021 16:25:30 - INFO - __main__ - Step 136095: {'lr': 1.0812722322977003e-05, 'samples': 26130240, 'steps': 136094, 'loss/train': 0.8591222763061523} 11/07/2021 16:25:31 - INFO - __main__ - Step 136096: {'lr': 1.0811178568963077e-05, 'samples': 26130432, 'steps': 136095, 'loss/train': 0.9937345385551453} 11/07/2021 16:25:31 - INFO - __main__ - Step 136097: {'lr': 1.080963492272391e-05, 'samples': 26130624, 'steps': 136096, 'loss/train': 2.887129783630371} 11/07/2021 16:25:31 - INFO - __main__ - Step 136098: {'lr': 1.0808091384260226e-05, 'samples': 26130816, 'steps': 136097, 'loss/train': 1.3678991794586182} 11/07/2021 16:25:33 - INFO - __main__ - Step 136099: {'lr': 1.0806547953572749e-05, 'samples': 26131008, 'steps': 136098, 'loss/train': 0.964330792427063} 11/07/2021 16:25:33 - INFO - __main__ - Step 136100: {'lr': 1.080500463066214e-05, 'samples': 26131200, 'steps': 136099, 'loss/train': 1.4159467220306396} 11/07/2021 16:25:33 - INFO - __main__ - Step 136101: {'lr': 1.0803461415529098e-05, 'samples': 26131392, 'steps': 136100, 'loss/train': 1.6375794410705566} 11/07/2021 16:25:34 - INFO - __main__ - Step 136102: {'lr': 1.0801918308174313e-05, 'samples': 26131584, 'steps': 136101, 'loss/train': 0.44296833872795105} 11/07/2021 16:25:34 - INFO - __main__ - Step 136103: {'lr': 1.080037530859851e-05, 'samples': 26131776, 'steps': 136102, 'loss/train': 1.6647268533706665} 11/07/2021 16:25:34 - INFO - __main__ - Step 136104: {'lr': 1.0798832416802379e-05, 'samples': 26131968, 'steps': 136103, 'loss/train': 0.9100825190544128} 11/07/2021 16:25:35 - INFO - __main__ - Step 136105: {'lr': 1.0797289632786589e-05, 'samples': 26132160, 'steps': 136104, 'loss/train': 1.3305031061172485} 11/07/2021 16:25:36 - INFO - __main__ - Step 136106: {'lr': 1.0795746956551888e-05, 'samples': 26132352, 'steps': 136105, 'loss/train': 0.8402105569839478} 11/07/2021 16:25:36 - INFO - __main__ - Step 136107: {'lr': 1.0794204388098889e-05, 'samples': 26132544, 'steps': 136106, 'loss/train': 1.1993772983551025} 11/07/2021 16:25:37 - INFO - __main__ - Step 136108: {'lr': 1.0792661927428338e-05, 'samples': 26132736, 'steps': 136107, 'loss/train': 1.2818595170974731} 11/07/2021 16:25:37 - INFO - __main__ - Step 136109: {'lr': 1.0791119574540903e-05, 'samples': 26132928, 'steps': 136108, 'loss/train': 1.277653455734253} 11/07/2021 16:25:37 - INFO - __main__ - Step 136110: {'lr': 1.0789577329437334e-05, 'samples': 26133120, 'steps': 136109, 'loss/train': 1.455833911895752} 11/07/2021 16:25:38 - INFO - __main__ - Step 136111: {'lr': 1.0788035192118267e-05, 'samples': 26133312, 'steps': 136110, 'loss/train': 1.1427061557769775} 11/07/2021 16:25:39 - INFO - __main__ - Step 136112: {'lr': 1.0786493162584427e-05, 'samples': 26133504, 'steps': 136111, 'loss/train': 1.0897347927093506} 11/07/2021 16:25:39 - INFO - __main__ - Step 136113: {'lr': 1.0784951240836505e-05, 'samples': 26133696, 'steps': 136112, 'loss/train': 1.183571696281433} 11/07/2021 16:25:39 - INFO - __main__ - Step 136114: {'lr': 1.0783409426875168e-05, 'samples': 26133888, 'steps': 136113, 'loss/train': 1.2405847311019897} 11/07/2021 16:25:40 - INFO - __main__ - Step 136115: {'lr': 1.0781867720701167e-05, 'samples': 26134080, 'steps': 136114, 'loss/train': 1.320215106010437} 11/07/2021 16:25:41 - INFO - __main__ - Step 136116: {'lr': 1.0780326122315164e-05, 'samples': 26134272, 'steps': 136115, 'loss/train': 1.2631921768188477} 11/07/2021 16:25:41 - INFO - __main__ - Step 136117: {'lr': 1.077878463171783e-05, 'samples': 26134464, 'steps': 136116, 'loss/train': 1.6508569717407227} 11/07/2021 16:25:42 - INFO - __main__ - Step 136118: {'lr': 1.0777243248909913e-05, 'samples': 26134656, 'steps': 136117, 'loss/train': 1.736167311668396} 11/07/2021 16:25:42 - INFO - __main__ - Step 136119: {'lr': 1.0775701973892049e-05, 'samples': 26134848, 'steps': 136118, 'loss/train': 1.452592134475708} 11/07/2021 16:25:42 - INFO - __main__ - Step 136120: {'lr': 1.0774160806665017e-05, 'samples': 26135040, 'steps': 136119, 'loss/train': 1.5699893236160278} 11/07/2021 16:25:43 - INFO - __main__ - Step 136121: {'lr': 1.07726197472294e-05, 'samples': 26135232, 'steps': 136120, 'loss/train': 2.255418062210083} 11/07/2021 16:25:44 - INFO - __main__ - Step 136122: {'lr': 1.0771078795585976e-05, 'samples': 26135424, 'steps': 136121, 'loss/train': 1.1501095294952393} 11/07/2021 16:25:44 - INFO - __main__ - Step 136123: {'lr': 1.0769537951735409e-05, 'samples': 26135616, 'steps': 136122, 'loss/train': 1.1700242757797241} 11/07/2021 16:25:44 - INFO - __main__ - Step 136124: {'lr': 1.0767997215678393e-05, 'samples': 26135808, 'steps': 136123, 'loss/train': 0.7596436738967896} 11/07/2021 16:25:45 - INFO - __main__ - Step 136125: {'lr': 1.0766456587415623e-05, 'samples': 26136000, 'steps': 136124, 'loss/train': 5.691653251647949} 11/07/2021 16:25:45 - INFO - __main__ - Step 136126: {'lr': 1.0764916066947795e-05, 'samples': 26136192, 'steps': 136125, 'loss/train': 1.384940266609192} 11/07/2021 16:25:46 - INFO - __main__ - Step 136127: {'lr': 1.0763375654275598e-05, 'samples': 26136384, 'steps': 136126, 'loss/train': 1.2869164943695068} 11/07/2021 16:25:47 - INFO - __main__ - Step 136128: {'lr': 1.0761835349399729e-05, 'samples': 26136576, 'steps': 136127, 'loss/train': 1.5672709941864014} 11/07/2021 16:25:47 - INFO - __main__ - Step 136129: {'lr': 1.0760295152320909e-05, 'samples': 26136768, 'steps': 136128, 'loss/train': 1.2668293714523315} 11/07/2021 16:25:47 - INFO - __main__ - Step 136130: {'lr': 1.0758755063039776e-05, 'samples': 26136960, 'steps': 136129, 'loss/train': 1.4684898853302002} 11/07/2021 16:25:48 - INFO - __main__ - Step 136131: {'lr': 1.0757215081557081e-05, 'samples': 26137152, 'steps': 136130, 'loss/train': 1.5081318616867065} 11/07/2021 16:25:49 - INFO - __main__ - Step 136132: {'lr': 1.0755675207873489e-05, 'samples': 26137344, 'steps': 136131, 'loss/train': 1.1513007879257202} 11/07/2021 16:25:49 - INFO - __main__ - Step 136133: {'lr': 1.075413544198972e-05, 'samples': 26137536, 'steps': 136132, 'loss/train': 1.579275131225586} 11/07/2021 16:25:49 - INFO - __main__ - Step 136134: {'lr': 1.0752595783906444e-05, 'samples': 26137728, 'steps': 136133, 'loss/train': 1.4669722318649292} 11/07/2021 16:25:50 - INFO - __main__ - Step 136135: {'lr': 1.0751056233624324e-05, 'samples': 26137920, 'steps': 136134, 'loss/train': 1.3990440368652344} 11/07/2021 16:25:50 - INFO - __main__ - Step 136136: {'lr': 1.0749516791144082e-05, 'samples': 26138112, 'steps': 136135, 'loss/train': 0.5798874497413635} 11/07/2021 16:25:51 - INFO - __main__ - Step 136137: {'lr': 1.074797745646644e-05, 'samples': 26138304, 'steps': 136136, 'loss/train': 1.308834433555603} 11/07/2021 16:25:52 - INFO - __main__ - Step 136138: {'lr': 1.0746438229592065e-05, 'samples': 26138496, 'steps': 136137, 'loss/train': 1.325027585029602} 11/07/2021 16:25:52 - INFO - __main__ - Step 136139: {'lr': 1.074489911052165e-05, 'samples': 26138688, 'steps': 136138, 'loss/train': 1.6891610622406006} 11/07/2021 16:25:52 - INFO - __main__ - Step 136140: {'lr': 1.0743360099255888e-05, 'samples': 26138880, 'steps': 136139, 'loss/train': 1.1681369543075562} 11/07/2021 16:25:53 - INFO - __main__ - Step 136141: {'lr': 1.0741821195795476e-05, 'samples': 26139072, 'steps': 136140, 'loss/train': 0.8251140713691711} 11/07/2021 16:25:53 - INFO - __main__ - Step 136142: {'lr': 1.0740282400141106e-05, 'samples': 26139264, 'steps': 136141, 'loss/train': 1.33977472782135} 11/07/2021 16:25:54 - INFO - __main__ - Step 136143: {'lr': 1.073874371229347e-05, 'samples': 26139456, 'steps': 136142, 'loss/train': 1.6290712356567383} 11/07/2021 16:25:54 - INFO - __main__ - Step 136144: {'lr': 1.0737205132253264e-05, 'samples': 26139648, 'steps': 136143, 'loss/train': 1.4815956354141235} 11/07/2021 16:25:55 - INFO - __main__ - Step 136145: {'lr': 1.0735666660021181e-05, 'samples': 26139840, 'steps': 136144, 'loss/train': 1.0818578004837036} 11/07/2021 16:25:55 - INFO - __main__ - Step 136146: {'lr': 1.0734128295597916e-05, 'samples': 26140032, 'steps': 136145, 'loss/train': 1.4058430194854736} 11/07/2021 16:25:55 - INFO - __main__ - Step 136147: {'lr': 1.073259003898419e-05, 'samples': 26140224, 'steps': 136146, 'loss/train': 1.3755768537521362} 11/07/2021 16:25:57 - INFO - __main__ - Step 136148: {'lr': 1.0731051890180644e-05, 'samples': 26140416, 'steps': 136147, 'loss/train': 0.9939178228378296} 11/07/2021 16:25:57 - INFO - __main__ - Step 136149: {'lr': 1.0729513849187994e-05, 'samples': 26140608, 'steps': 136148, 'loss/train': 1.3507367372512817} 11/07/2021 16:25:57 - INFO - __main__ - Step 136150: {'lr': 1.0727975916006938e-05, 'samples': 26140800, 'steps': 136149, 'loss/train': 1.4445792436599731} 11/07/2021 16:25:58 - INFO - __main__ - Step 136151: {'lr': 1.0726438090638141e-05, 'samples': 26140992, 'steps': 136150, 'loss/train': 2.1677463054656982} 11/07/2021 16:25:58 - INFO - __main__ - Step 136152: {'lr': 1.0724900373082324e-05, 'samples': 26141184, 'steps': 136151, 'loss/train': 0.8665309548377991} 11/07/2021 16:25:58 - INFO - __main__ - Step 136153: {'lr': 1.0723362763340184e-05, 'samples': 26141376, 'steps': 136152, 'loss/train': 1.0096826553344727} 11/07/2021 16:25:59 - INFO - __main__ - Step 136154: {'lr': 1.072182526141241e-05, 'samples': 26141568, 'steps': 136153, 'loss/train': 1.279800534248352} 11/07/2021 16:26:00 - INFO - __main__ - Step 136155: {'lr': 1.0720287867299699e-05, 'samples': 26141760, 'steps': 136154, 'loss/train': 1.0575209856033325} 11/07/2021 16:26:00 - INFO - __main__ - Step 136156: {'lr': 1.0718750581002717e-05, 'samples': 26141952, 'steps': 136155, 'loss/train': 1.5306342840194702} 11/07/2021 16:26:00 - INFO - __main__ - Step 136157: {'lr': 1.0717213402522157e-05, 'samples': 26142144, 'steps': 136156, 'loss/train': 1.3769912719726562} 11/07/2021 16:26:01 - INFO - __main__ - Step 136158: {'lr': 1.0715676331858743e-05, 'samples': 26142336, 'steps': 136157, 'loss/train': 1.043434977531433} 11/07/2021 16:26:02 - INFO - __main__ - Step 136159: {'lr': 1.0714139369013164e-05, 'samples': 26142528, 'steps': 136158, 'loss/train': 1.6550281047821045} 11/07/2021 16:26:02 - INFO - __main__ - Step 136160: {'lr': 1.071260251398612e-05, 'samples': 26142720, 'steps': 136159, 'loss/train': 1.1495712995529175} 11/07/2021 16:26:03 - INFO - __main__ - Step 136161: {'lr': 1.0711065766778272e-05, 'samples': 26142912, 'steps': 136160, 'loss/train': 1.3864212036132812} 11/07/2021 16:26:03 - INFO - __main__ - Step 136162: {'lr': 1.0709529127390315e-05, 'samples': 26143104, 'steps': 136161, 'loss/train': 1.1773700714111328} 11/07/2021 16:26:04 - INFO - __main__ - Step 136163: {'lr': 1.0707992595822946e-05, 'samples': 26143296, 'steps': 136162, 'loss/train': 1.2894086837768555} 11/07/2021 16:26:05 - INFO - __main__ - Step 136164: {'lr': 1.0706456172076855e-05, 'samples': 26143488, 'steps': 136163, 'loss/train': 1.0154048204421997} 11/07/2021 16:26:05 - INFO - __main__ - Step 136165: {'lr': 1.070491985615274e-05, 'samples': 26143680, 'steps': 136164, 'loss/train': 1.1880784034729004} 11/07/2021 16:26:05 - INFO - __main__ - Step 136166: {'lr': 1.0703383648051318e-05, 'samples': 26143872, 'steps': 136165, 'loss/train': 1.534030795097351} 11/07/2021 16:26:06 - INFO - __main__ - Step 136167: {'lr': 1.0701847547773258e-05, 'samples': 26144064, 'steps': 136166, 'loss/train': 1.016560673713684} 11/07/2021 16:26:06 - INFO - __main__ - Step 136168: {'lr': 1.0700311555319225e-05, 'samples': 26144256, 'steps': 136167, 'loss/train': 1.3538193702697754} 11/07/2021 16:26:06 - INFO - __main__ - Step 136169: {'lr': 1.069877567068997e-05, 'samples': 26144448, 'steps': 136168, 'loss/train': 1.5881595611572266} 11/07/2021 16:26:07 - INFO - __main__ - Step 136170: {'lr': 1.0697239893886129e-05, 'samples': 26144640, 'steps': 136169, 'loss/train': 1.106368899345398} 11/07/2021 16:26:08 - INFO - __main__ - Step 136171: {'lr': 1.0695704224908453e-05, 'samples': 26144832, 'steps': 136170, 'loss/train': 1.1540794372558594} 11/07/2021 16:26:08 - INFO - __main__ - Step 136172: {'lr': 1.0694168663757609e-05, 'samples': 26145024, 'steps': 136171, 'loss/train': 1.5928759574890137} 11/07/2021 16:26:08 - INFO - __main__ - Step 136173: {'lr': 1.0692633210434233e-05, 'samples': 26145216, 'steps': 136172, 'loss/train': 1.1107325553894043} 11/07/2021 16:26:09 - INFO - __main__ - Step 136174: {'lr': 1.0691097864939075e-05, 'samples': 26145408, 'steps': 136173, 'loss/train': 1.4258977174758911} 11/07/2021 16:26:10 - INFO - __main__ - Step 136175: {'lr': 1.068956262727283e-05, 'samples': 26145600, 'steps': 136174, 'loss/train': 1.3975188732147217} 11/07/2021 16:26:11 - INFO - __main__ - Step 136176: {'lr': 1.0688027497436165e-05, 'samples': 26145792, 'steps': 136175, 'loss/train': 1.3202695846557617} 11/07/2021 16:26:11 - INFO - __main__ - Step 136177: {'lr': 1.0686492475429798e-05, 'samples': 26145984, 'steps': 136176, 'loss/train': 1.619093656539917} 11/07/2021 16:26:11 - INFO - __main__ - Step 136178: {'lr': 1.068495756125437e-05, 'samples': 26146176, 'steps': 136177, 'loss/train': 1.0415385961532593} 11/07/2021 16:26:12 - INFO - __main__ - Step 136179: {'lr': 1.0683422754910632e-05, 'samples': 26146368, 'steps': 136178, 'loss/train': 1.7271116971969604} 11/07/2021 16:26:12 - INFO - __main__ - Step 136180: {'lr': 1.0681888056399247e-05, 'samples': 26146560, 'steps': 136179, 'loss/train': 2.4212467670440674} 11/07/2021 16:26:13 - INFO - __main__ - Step 136181: {'lr': 1.068035346572091e-05, 'samples': 26146752, 'steps': 136180, 'loss/train': 1.425045132637024} 11/07/2021 16:26:13 - INFO - __main__ - Step 136182: {'lr': 1.0678818982876315e-05, 'samples': 26146944, 'steps': 136181, 'loss/train': 0.8607906699180603} 11/07/2021 16:26:14 - INFO - __main__ - Step 136183: {'lr': 1.0677284607866183e-05, 'samples': 26147136, 'steps': 136182, 'loss/train': 1.3696939945220947} 11/07/2021 16:26:14 - INFO - __main__ - Step 136184: {'lr': 1.0675750340691126e-05, 'samples': 26147328, 'steps': 136183, 'loss/train': 1.2124711275100708} 11/07/2021 16:26:14 - INFO - __main__ - Step 136185: {'lr': 1.0674216181351893e-05, 'samples': 26147520, 'steps': 136184, 'loss/train': 1.6211090087890625} 11/07/2021 16:26:15 - INFO - __main__ - Step 136186: {'lr': 1.0672682129849176e-05, 'samples': 26147712, 'steps': 136185, 'loss/train': 1.1135327816009521} 11/07/2021 16:26:16 - INFO - __main__ - Step 136187: {'lr': 1.0671148186183644e-05, 'samples': 26147904, 'steps': 136186, 'loss/train': 0.9048489332199097} 11/07/2021 16:26:16 - INFO - __main__ - Step 136188: {'lr': 1.0669614350356016e-05, 'samples': 26148096, 'steps': 136187, 'loss/train': 0.8751630783081055} 11/07/2021 16:26:16 - INFO - __main__ - Step 136189: {'lr': 1.0668080622366932e-05, 'samples': 26148288, 'steps': 136188, 'loss/train': 1.2606719732284546} 11/07/2021 16:26:17 - INFO - __main__ - Step 136190: {'lr': 1.0666547002217141e-05, 'samples': 26148480, 'steps': 136189, 'loss/train': 1.4070215225219727} 11/07/2021 16:26:18 - INFO - __main__ - Step 136191: {'lr': 1.066501348990731e-05, 'samples': 26148672, 'steps': 136190, 'loss/train': 1.2599565982818604} 11/07/2021 16:26:18 - INFO - __main__ - Step 136192: {'lr': 1.0663480085438132e-05, 'samples': 26148864, 'steps': 136191, 'loss/train': 0.6881777048110962} 11/07/2021 16:26:19 - INFO - __main__ - Step 136193: {'lr': 1.06619467888103e-05, 'samples': 26149056, 'steps': 136192, 'loss/train': 1.1122715473175049} 11/07/2021 16:26:19 - INFO - __main__ - Step 136194: {'lr': 1.0660413600024538e-05, 'samples': 26149248, 'steps': 136193, 'loss/train': 0.7695769667625427} 11/07/2021 16:26:19 - INFO - __main__ - Step 136195: {'lr': 1.0658880519081454e-05, 'samples': 26149440, 'steps': 136194, 'loss/train': 1.4309544563293457} 11/07/2021 16:26:20 - INFO - __main__ - Step 136196: {'lr': 1.0657347545981772e-05, 'samples': 26149632, 'steps': 136195, 'loss/train': 1.1954010725021362} 11/07/2021 16:26:21 - INFO - __main__ - Step 136197: {'lr': 1.0655814680726211e-05, 'samples': 26149824, 'steps': 136196, 'loss/train': 1.173592209815979} 11/07/2021 16:26:21 - INFO - __main__ - Step 136198: {'lr': 1.0654281923315468e-05, 'samples': 26150016, 'steps': 136197, 'loss/train': 1.608807921409607} 11/07/2021 16:26:21 - INFO - __main__ - Step 136199: {'lr': 1.0652749273750179e-05, 'samples': 26150208, 'steps': 136198, 'loss/train': 1.4803812503814697} 11/07/2021 16:26:22 - INFO - __main__ - Step 136200: {'lr': 1.0651216732031094e-05, 'samples': 26150400, 'steps': 136199, 'loss/train': 1.272822618484497} 11/07/2021 16:26:23 - INFO - __main__ - Step 136201: {'lr': 1.0649684298158852e-05, 'samples': 26150592, 'steps': 136200, 'loss/train': 1.145187497138977} 11/07/2021 16:26:23 - INFO - __main__ - Step 136202: {'lr': 1.0648151972134201e-05, 'samples': 26150784, 'steps': 136201, 'loss/train': 1.323333978652954} 11/07/2021 16:26:24 - INFO - __main__ - Step 136203: {'lr': 1.0646619753957781e-05, 'samples': 26150976, 'steps': 136202, 'loss/train': 1.680857539176941} 11/07/2021 16:26:24 - INFO - __main__ - Step 136204: {'lr': 1.0645087643630286e-05, 'samples': 26151168, 'steps': 136203, 'loss/train': 1.1969388723373413} 11/07/2021 16:26:24 - INFO - __main__ - Step 136205: {'lr': 1.0643555641152463e-05, 'samples': 26151360, 'steps': 136204, 'loss/train': 1.5385613441467285} 11/07/2021 16:26:25 - INFO - __main__ - Step 136206: {'lr': 1.0642023746524953e-05, 'samples': 26151552, 'steps': 136205, 'loss/train': 1.8780885934829712} 11/07/2021 16:26:26 - INFO - __main__ - Step 136207: {'lr': 1.0640491959748422e-05, 'samples': 26151744, 'steps': 136206, 'loss/train': 1.2467459440231323} 11/07/2021 16:26:26 - INFO - __main__ - Step 136208: {'lr': 1.063896028082359e-05, 'samples': 26151936, 'steps': 136207, 'loss/train': 1.1093318462371826} 11/07/2021 16:26:26 - INFO - __main__ - Step 136209: {'lr': 1.063742870975118e-05, 'samples': 26152128, 'steps': 136208, 'loss/train': 1.6625349521636963} 11/07/2021 16:26:27 - INFO - __main__ - Step 136210: {'lr': 1.0635897246531829e-05, 'samples': 26152320, 'steps': 136209, 'loss/train': 1.002406120300293} 11/07/2021 16:26:28 - INFO - __main__ - Step 136211: {'lr': 1.063436589116626e-05, 'samples': 26152512, 'steps': 136210, 'loss/train': 1.2206352949142456} 11/07/2021 16:26:29 - INFO - __main__ - Step 136212: {'lr': 1.0632834643655136e-05, 'samples': 26152704, 'steps': 136211, 'loss/train': 1.7255566120147705} 11/07/2021 16:26:29 - INFO - __main__ - Step 136213: {'lr': 1.0631303503999185e-05, 'samples': 26152896, 'steps': 136212, 'loss/train': 2.108372211456299} 11/07/2021 16:26:29 - INFO - __main__ - Step 136214: {'lr': 1.062977247219904e-05, 'samples': 26153088, 'steps': 136213, 'loss/train': 2.2971463203430176} 11/07/2021 16:26:30 - INFO - __main__ - Step 136215: {'lr': 1.062824154825548e-05, 'samples': 26153280, 'steps': 136214, 'loss/train': 1.3268036842346191} 11/07/2021 16:26:30 - INFO - __main__ - Step 136216: {'lr': 1.0626710732169115e-05, 'samples': 26153472, 'steps': 136215, 'loss/train': 2.5692217350006104} 11/07/2021 16:26:31 - INFO - __main__ - Step 136217: {'lr': 1.0625180023940668e-05, 'samples': 26153664, 'steps': 136216, 'loss/train': 1.701784372329712} 11/07/2021 16:26:32 - INFO - __main__ - Step 136218: {'lr': 1.0623649423570802e-05, 'samples': 26153856, 'steps': 136217, 'loss/train': 1.620986819267273} 11/07/2021 16:26:32 - INFO - __main__ - Step 136219: {'lr': 1.0622118931060215e-05, 'samples': 26154048, 'steps': 136218, 'loss/train': 0.5073795318603516} 11/07/2021 16:26:32 - INFO - __main__ - Step 136220: {'lr': 1.0620588546409626e-05, 'samples': 26154240, 'steps': 136219, 'loss/train': 0.5520867109298706} 11/07/2021 16:26:33 - INFO - __main__ - Step 136221: {'lr': 1.0619058269619703e-05, 'samples': 26154432, 'steps': 136220, 'loss/train': 1.3913894891738892} 11/07/2021 16:26:34 - INFO - __main__ - Step 136222: {'lr': 1.0617528100691138e-05, 'samples': 26154624, 'steps': 136221, 'loss/train': 1.808426022529602} 11/07/2021 16:26:34 - INFO - __main__ - Step 136223: {'lr': 1.0615998039624624e-05, 'samples': 26154816, 'steps': 136222, 'loss/train': 1.0848217010498047} 11/07/2021 16:26:34 - INFO - __main__ - Step 136224: {'lr': 1.0614468086420858e-05, 'samples': 26155008, 'steps': 136223, 'loss/train': 1.2932932376861572} 11/07/2021 16:26:35 - INFO - __main__ - Step 136225: {'lr': 1.0612938241080505e-05, 'samples': 26155200, 'steps': 136224, 'loss/train': 1.568148136138916} 11/07/2021 16:26:35 - INFO - __main__ - Step 136226: {'lr': 1.0611408503604259e-05, 'samples': 26155392, 'steps': 136225, 'loss/train': 1.1861835718154907} 11/07/2021 16:26:35 - INFO - __main__ - Step 136227: {'lr': 1.0609878873992867e-05, 'samples': 26155584, 'steps': 136226, 'loss/train': 1.4982870817184448} 11/07/2021 16:26:36 - INFO - __main__ - Step 136228: {'lr': 1.0608349352246915e-05, 'samples': 26155776, 'steps': 136227, 'loss/train': 1.2698994874954224} 11/07/2021 16:26:37 - INFO - __main__ - Step 136229: {'lr': 1.0606819938367179e-05, 'samples': 26155968, 'steps': 136228, 'loss/train': 1.336193323135376} 11/07/2021 16:26:37 - INFO - __main__ - Step 136230: {'lr': 1.0605290632354298e-05, 'samples': 26156160, 'steps': 136229, 'loss/train': 1.6034983396530151} 11/07/2021 16:26:38 - INFO - __main__ - Step 136231: {'lr': 1.0603761434208964e-05, 'samples': 26156352, 'steps': 136230, 'loss/train': 1.6735143661499023} 11/07/2021 16:26:38 - INFO - __main__ - Step 136232: {'lr': 1.06022323439319e-05, 'samples': 26156544, 'steps': 136231, 'loss/train': 1.1000622510910034} 11/07/2021 16:26:39 - INFO - __main__ - Step 136233: {'lr': 1.0600703361523772e-05, 'samples': 26156736, 'steps': 136232, 'loss/train': 1.289878249168396} 11/07/2021 16:26:39 - INFO - __main__ - Step 136234: {'lr': 1.0599174486985275e-05, 'samples': 26156928, 'steps': 136233, 'loss/train': 1.6291744709014893} 11/07/2021 16:26:40 - INFO - __main__ - Step 136235: {'lr': 1.0597645720317101e-05, 'samples': 26157120, 'steps': 136234, 'loss/train': 0.9670987129211426} 11/07/2021 16:26:40 - INFO - __main__ - Step 136236: {'lr': 1.0596117061519916e-05, 'samples': 26157312, 'steps': 136235, 'loss/train': 1.8093523979187012} 11/07/2021 16:26:40 - INFO - __main__ - Step 136237: {'lr': 1.0594588510594445e-05, 'samples': 26157504, 'steps': 136236, 'loss/train': 1.5177644491195679} 11/07/2021 16:26:41 - INFO - __main__ - Step 136238: {'lr': 1.059306006754135e-05, 'samples': 26157696, 'steps': 136237, 'loss/train': 1.357299566268921} 11/07/2021 16:26:42 - INFO - __main__ - Step 136239: {'lr': 1.0591531732361325e-05, 'samples': 26157888, 'steps': 136238, 'loss/train': 1.6683803796768188} 11/07/2021 16:26:42 - INFO - __main__ - Step 136240: {'lr': 1.0590003505055069e-05, 'samples': 26158080, 'steps': 136239, 'loss/train': 1.3131816387176514} 11/07/2021 16:26:42 - INFO - __main__ - Step 136241: {'lr': 1.0588475385623298e-05, 'samples': 26158272, 'steps': 136240, 'loss/train': 1.4455759525299072} 11/07/2021 16:26:43 - INFO - __main__ - Step 136242: {'lr': 1.0586947374066625e-05, 'samples': 26158464, 'steps': 136241, 'loss/train': 1.5322585105895996} 11/07/2021 16:26:44 - INFO - __main__ - Step 136243: {'lr': 1.058541947038577e-05, 'samples': 26158656, 'steps': 136242, 'loss/train': 0.8185425996780396} 11/07/2021 16:26:44 - INFO - __main__ - Step 136244: {'lr': 1.0583891674581458e-05, 'samples': 26158848, 'steps': 136243, 'loss/train': 1.2122507095336914} 11/07/2021 16:26:45 - INFO - __main__ - Step 136245: {'lr': 1.0582363986654325e-05, 'samples': 26159040, 'steps': 136244, 'loss/train': 1.1366925239562988} 11/07/2021 16:26:45 - INFO - __main__ - Step 136246: {'lr': 1.0580836406605094e-05, 'samples': 26159232, 'steps': 136245, 'loss/train': 1.2165162563323975} 11/07/2021 16:26:45 - INFO - __main__ - Step 136247: {'lr': 1.0579308934434456e-05, 'samples': 26159424, 'steps': 136246, 'loss/train': 0.96684730052948} 11/07/2021 16:26:46 - INFO - __main__ - Step 136248: {'lr': 1.057778157014308e-05, 'samples': 26159616, 'steps': 136247, 'loss/train': 1.1525229215621948} 11/07/2021 16:26:47 - INFO - __main__ - Step 136249: {'lr': 1.0576254313731632e-05, 'samples': 26159808, 'steps': 136248, 'loss/train': 1.4348716735839844} 11/07/2021 16:26:47 - INFO - __main__ - Step 136250: {'lr': 1.0574727165200859e-05, 'samples': 26160000, 'steps': 136249, 'loss/train': 1.3994691371917725} 11/07/2021 16:26:47 - INFO - __main__ - Step 136251: {'lr': 1.0573200124551401e-05, 'samples': 26160192, 'steps': 136250, 'loss/train': 1.4868687391281128} 11/07/2021 16:26:48 - INFO - __main__ - Step 136252: {'lr': 1.0571673191783982e-05, 'samples': 26160384, 'steps': 136251, 'loss/train': 1.26215660572052} 11/07/2021 16:26:48 - INFO - __main__ - Step 136253: {'lr': 1.0570146366899263e-05, 'samples': 26160576, 'steps': 136252, 'loss/train': 1.2252671718597412} 11/07/2021 16:26:49 - INFO - __main__ - Step 136254: {'lr': 1.0568619649897998e-05, 'samples': 26160768, 'steps': 136253, 'loss/train': 0.8873447179794312} 11/07/2021 16:26:49 - INFO - __main__ - Step 136255: {'lr': 1.0567093040780767e-05, 'samples': 26160960, 'steps': 136254, 'loss/train': 1.2078065872192383} 11/07/2021 16:26:50 - INFO - __main__ - Step 136256: {'lr': 1.0565566539548293e-05, 'samples': 26161152, 'steps': 136255, 'loss/train': 1.2124141454696655} 11/07/2021 16:26:50 - INFO - __main__ - Step 136257: {'lr': 1.0564040146201299e-05, 'samples': 26161344, 'steps': 136256, 'loss/train': 1.2612123489379883} 11/07/2021 16:26:51 - INFO - __main__ - Step 136258: {'lr': 1.0562513860740447e-05, 'samples': 26161536, 'steps': 136257, 'loss/train': 1.3147939443588257} 11/07/2021 16:26:52 - INFO - __main__ - Step 136259: {'lr': 1.0560987683166435e-05, 'samples': 26161728, 'steps': 136258, 'loss/train': 1.2896912097930908} 11/07/2021 16:26:52 - INFO - __main__ - Step 136260: {'lr': 1.0559461613479954e-05, 'samples': 26161920, 'steps': 136259, 'loss/train': 1.3549318313598633} 11/07/2021 16:26:53 - INFO - __main__ - Step 136261: {'lr': 1.0557935651681671e-05, 'samples': 26162112, 'steps': 136260, 'loss/train': 1.738875389099121} 11/07/2021 16:26:53 - INFO - __main__ - Step 136262: {'lr': 1.0556409797772282e-05, 'samples': 26162304, 'steps': 136261, 'loss/train': 1.3583980798721313} 11/07/2021 16:26:53 - INFO - __main__ - Step 136263: {'lr': 1.0554884051752506e-05, 'samples': 26162496, 'steps': 136262, 'loss/train': 1.6071571111679077} 11/07/2021 16:26:54 - INFO - __main__ - Step 136264: {'lr': 1.0553358413622983e-05, 'samples': 26162688, 'steps': 136263, 'loss/train': 1.2745604515075684} 11/07/2021 16:26:55 - INFO - __main__ - Step 136265: {'lr': 1.0551832883384432e-05, 'samples': 26162880, 'steps': 136264, 'loss/train': 1.4292924404144287} 11/07/2021 16:26:55 - INFO - __main__ - Step 136266: {'lr': 1.0550307461037523e-05, 'samples': 26163072, 'steps': 136265, 'loss/train': 0.649205207824707} 11/07/2021 16:26:55 - INFO - __main__ - Step 136267: {'lr': 1.0548782146582947e-05, 'samples': 26163264, 'steps': 136266, 'loss/train': 1.252924919128418} 11/07/2021 16:26:56 - INFO - __main__ - Step 136268: {'lr': 1.0547256940021426e-05, 'samples': 26163456, 'steps': 136267, 'loss/train': 1.2735660076141357} 11/07/2021 16:26:57 - INFO - __main__ - Step 136269: {'lr': 1.0545731841353601e-05, 'samples': 26163648, 'steps': 136268, 'loss/train': 1.1374421119689941} 11/07/2021 16:26:57 - INFO - __main__ - Step 136270: {'lr': 1.0544206850580163e-05, 'samples': 26163840, 'steps': 136269, 'loss/train': 0.9407646656036377} 11/07/2021 16:26:58 - INFO - __main__ - Step 136271: {'lr': 1.0542681967701806e-05, 'samples': 26164032, 'steps': 136270, 'loss/train': 1.1890285015106201} 11/07/2021 16:26:58 - INFO - __main__ - Step 136272: {'lr': 1.0541157192719253e-05, 'samples': 26164224, 'steps': 136271, 'loss/train': 1.296242594718933} 11/07/2021 16:26:58 - INFO - __main__ - Step 136273: {'lr': 1.0539632525633113e-05, 'samples': 26164416, 'steps': 136272, 'loss/train': 1.3055766820907593} 11/07/2021 16:26:59 - INFO - __main__ - Step 136274: {'lr': 1.0538107966444138e-05, 'samples': 26164608, 'steps': 136273, 'loss/train': 0.8535577654838562} 11/07/2021 16:27:00 - INFO - __main__ - Step 136275: {'lr': 1.053658351515302e-05, 'samples': 26164800, 'steps': 136274, 'loss/train': 1.3155916929244995} 11/07/2021 16:27:00 - INFO - __main__ - Step 136276: {'lr': 1.0535059171760397e-05, 'samples': 26164992, 'steps': 136275, 'loss/train': 1.5811052322387695} 11/07/2021 16:27:00 - INFO - __main__ - Step 136277: {'lr': 1.0533534936266992e-05, 'samples': 26165184, 'steps': 136276, 'loss/train': 0.4198194146156311} 11/07/2021 16:27:01 - INFO - __main__ - Step 136278: {'lr': 1.053201080867347e-05, 'samples': 26165376, 'steps': 136277, 'loss/train': 1.3769639730453491} 11/07/2021 16:27:01 - INFO - __main__ - Step 136279: {'lr': 1.0530486788980526e-05, 'samples': 26165568, 'steps': 136278, 'loss/train': 1.6759142875671387} 11/07/2021 16:27:02 - INFO - __main__ - Step 136280: {'lr': 1.0528962877188853e-05, 'samples': 26165760, 'steps': 136279, 'loss/train': 1.1940340995788574} 11/07/2021 16:27:03 - INFO - __main__ - Step 136281: {'lr': 1.0527439073299172e-05, 'samples': 26165952, 'steps': 136280, 'loss/train': 1.5516297817230225} 11/07/2021 16:27:03 - INFO - __main__ - Step 136282: {'lr': 1.0525915377312095e-05, 'samples': 26166144, 'steps': 136281, 'loss/train': 1.6396411657333374} 11/07/2021 16:27:03 - INFO - __main__ - Step 136283: {'lr': 1.0524391789228343e-05, 'samples': 26166336, 'steps': 136282, 'loss/train': 1.3159213066101074} 11/07/2021 16:27:04 - INFO - __main__ - Step 136284: {'lr': 1.0522868309048612e-05, 'samples': 26166528, 'steps': 136283, 'loss/train': 2.0329666137695312} 11/07/2021 16:27:05 - INFO - __main__ - Step 136285: {'lr': 1.0521344936773592e-05, 'samples': 26166720, 'steps': 136284, 'loss/train': 1.076885461807251} 11/07/2021 16:27:05 - INFO - __main__ - Step 136286: {'lr': 1.051982167240395e-05, 'samples': 26166912, 'steps': 136285, 'loss/train': 0.9302129149436951} 11/07/2021 16:27:06 - INFO - __main__ - Step 136287: {'lr': 1.0518298515940355e-05, 'samples': 26167104, 'steps': 136286, 'loss/train': 1.193410038948059} 11/07/2021 16:27:06 - INFO - __main__ - Step 136288: {'lr': 1.0516775467383555e-05, 'samples': 26167296, 'steps': 136287, 'loss/train': 1.3324081897735596} 11/07/2021 16:27:06 - INFO - __main__ - Step 136289: {'lr': 1.0515252526734187e-05, 'samples': 26167488, 'steps': 136288, 'loss/train': 1.471845030784607} 11/07/2021 16:27:08 - INFO - __main__ - Step 136290: {'lr': 1.0513729693992947e-05, 'samples': 26167680, 'steps': 136289, 'loss/train': 0.959330677986145} 11/07/2021 16:27:08 - INFO - __main__ - Step 136291: {'lr': 1.0512206969160526e-05, 'samples': 26167872, 'steps': 136290, 'loss/train': 1.2500994205474854} 11/07/2021 16:27:08 - INFO - __main__ - Step 136292: {'lr': 1.051068435223762e-05, 'samples': 26168064, 'steps': 136291, 'loss/train': 1.6880651712417603} 11/07/2021 16:27:09 - INFO - __main__ - Step 136293: {'lr': 1.0509161843224896e-05, 'samples': 26168256, 'steps': 136292, 'loss/train': 0.24971887469291687} 11/07/2021 16:27:09 - INFO - __main__ - Step 136294: {'lr': 1.0507639442123046e-05, 'samples': 26168448, 'steps': 136293, 'loss/train': 1.5604828596115112} 11/07/2021 16:27:10 - INFO - __main__ - Step 136295: {'lr': 1.0506117148932793e-05, 'samples': 26168640, 'steps': 136294, 'loss/train': 1.3732887506484985} 11/07/2021 16:27:11 - INFO - __main__ - Step 136296: {'lr': 1.0504594963654745e-05, 'samples': 26168832, 'steps': 136295, 'loss/train': 1.3518483638763428} 11/07/2021 16:27:11 - INFO - __main__ - Step 136297: {'lr': 1.0503072886289627e-05, 'samples': 26169024, 'steps': 136296, 'loss/train': 1.3341917991638184} 11/07/2021 16:27:11 - INFO - __main__ - Step 136298: {'lr': 1.050155091683816e-05, 'samples': 26169216, 'steps': 136297, 'loss/train': 0.7795333862304688} 11/07/2021 16:27:12 - INFO - __main__ - Step 136299: {'lr': 1.050002905530098e-05, 'samples': 26169408, 'steps': 136298, 'loss/train': 1.235713243484497} 11/07/2021 16:27:13 - INFO - __main__ - Step 136300: {'lr': 1.0498507301678784e-05, 'samples': 26169600, 'steps': 136299, 'loss/train': 0.9178443551063538} 11/07/2021 16:27:13 - INFO - __main__ - Step 136301: {'lr': 1.0496985655972264e-05, 'samples': 26169792, 'steps': 136300, 'loss/train': 0.9984259605407715} 11/07/2021 16:27:13 - INFO - __main__ - Step 136302: {'lr': 1.0495464118182112e-05, 'samples': 26169984, 'steps': 136301, 'loss/train': 0.8039814829826355} 11/07/2021 16:27:14 - INFO - __main__ - Step 136303: {'lr': 1.0493942688309027e-05, 'samples': 26170176, 'steps': 136302, 'loss/train': 1.435092806816101} 11/07/2021 16:27:14 - INFO - __main__ - Step 136304: {'lr': 1.0492421366353644e-05, 'samples': 26170368, 'steps': 136303, 'loss/train': 1.1135761737823486} 11/07/2021 16:27:15 - INFO - __main__ - Step 136305: {'lr': 1.0490900152316712e-05, 'samples': 26170560, 'steps': 136304, 'loss/train': 0.767559826374054} 11/07/2021 16:27:16 - INFO - __main__ - Step 136306: {'lr': 1.0489379046198872e-05, 'samples': 26170752, 'steps': 136305, 'loss/train': 1.2043946981430054} 11/07/2021 16:27:17 - INFO - __main__ - Step 136307: {'lr': 1.0487858048000815e-05, 'samples': 26170944, 'steps': 136306, 'loss/train': 1.5368531942367554} 11/07/2021 16:27:17 - INFO - __main__ - Step 136308: {'lr': 1.0486337157723264e-05, 'samples': 26171136, 'steps': 136307, 'loss/train': 5.671844959259033} 11/07/2021 16:27:17 - INFO - __main__ - Step 136309: {'lr': 1.0484816375366829e-05, 'samples': 26171328, 'steps': 136308, 'loss/train': 1.429598093032837} 11/07/2021 16:27:18 - INFO - __main__ - Step 136310: {'lr': 1.048329570093226e-05, 'samples': 26171520, 'steps': 136309, 'loss/train': 1.226805329322815} 11/07/2021 16:27:18 - INFO - __main__ - Step 136311: {'lr': 1.0481775134420224e-05, 'samples': 26171712, 'steps': 136310, 'loss/train': 1.6078777313232422} 11/07/2021 16:27:18 - INFO - __main__ - Step 136312: {'lr': 1.0480254675831413e-05, 'samples': 26171904, 'steps': 136311, 'loss/train': 1.2102015018463135} 11/07/2021 16:27:19 - INFO - __main__ - Step 136313: {'lr': 1.0478734325166467e-05, 'samples': 26172096, 'steps': 136312, 'loss/train': 1.4377015829086304} 11/07/2021 16:27:20 - INFO - __main__ - Step 136314: {'lr': 1.0477214082426135e-05, 'samples': 26172288, 'steps': 136313, 'loss/train': 1.017520546913147} 11/07/2021 16:27:20 - INFO - __main__ - Step 136315: {'lr': 1.047569394761108e-05, 'samples': 26172480, 'steps': 136314, 'loss/train': 1.46040678024292} 11/07/2021 16:27:21 - INFO - __main__ - Step 136316: {'lr': 1.0474173920721974e-05, 'samples': 26172672, 'steps': 136315, 'loss/train': 1.488016128540039} 11/07/2021 16:27:21 - INFO - __main__ - Step 136317: {'lr': 1.047265400175948e-05, 'samples': 26172864, 'steps': 136316, 'loss/train': 1.0785170793533325} 11/07/2021 16:27:21 - INFO - __main__ - Step 136318: {'lr': 1.0471134190724345e-05, 'samples': 26173056, 'steps': 136317, 'loss/train': 1.549801230430603} 11/07/2021 16:27:22 - INFO - __main__ - Step 136319: {'lr': 1.0469614487617212e-05, 'samples': 26173248, 'steps': 136318, 'loss/train': 1.4696403741836548} 11/07/2021 16:27:23 - INFO - __main__ - Step 136320: {'lr': 1.0468094892438773e-05, 'samples': 26173440, 'steps': 136319, 'loss/train': 1.890316128730774} 11/07/2021 16:27:23 - INFO - __main__ - Step 136321: {'lr': 1.046657540518975e-05, 'samples': 26173632, 'steps': 136320, 'loss/train': 1.22859787940979} 11/07/2021 16:27:24 - INFO - __main__ - Step 136322: {'lr': 1.046505602587075e-05, 'samples': 26173824, 'steps': 136321, 'loss/train': 1.030562400817871} 11/07/2021 16:27:24 - INFO - __main__ - Step 136323: {'lr': 1.0463536754482528e-05, 'samples': 26174016, 'steps': 136322, 'loss/train': 1.286825180053711} 11/07/2021 16:27:24 - INFO - __main__ - Step 136324: {'lr': 1.0462017591025718e-05, 'samples': 26174208, 'steps': 136323, 'loss/train': 1.7234069108963013} 11/07/2021 16:27:25 - INFO - __main__ - Step 136325: {'lr': 1.0460498535501018e-05, 'samples': 26174400, 'steps': 136324, 'loss/train': 1.5693635940551758} 11/07/2021 16:27:26 - INFO - __main__ - Step 136326: {'lr': 1.0458979587909145e-05, 'samples': 26174592, 'steps': 136325, 'loss/train': 3.19270920753479} 11/07/2021 16:27:26 - INFO - __main__ - Step 136327: {'lr': 1.0457460748250742e-05, 'samples': 26174784, 'steps': 136326, 'loss/train': 1.3351162672042847} 11/07/2021 16:27:26 - INFO - __main__ - Step 136328: {'lr': 1.0455942016526498e-05, 'samples': 26174976, 'steps': 136327, 'loss/train': 1.6355170011520386} 11/07/2021 16:27:27 - INFO - __main__ - Step 136329: {'lr': 1.0454423392737138e-05, 'samples': 26175168, 'steps': 136328, 'loss/train': 1.2458653450012207} 11/07/2021 16:27:27 - INFO - __main__ - Step 136330: {'lr': 1.0452904876883302e-05, 'samples': 26175360, 'steps': 136329, 'loss/train': 0.47025230526924133} 11/07/2021 16:27:28 - INFO - __main__ - Step 136331: {'lr': 1.0451386468965707e-05, 'samples': 26175552, 'steps': 136330, 'loss/train': 1.6610264778137207} 11/07/2021 16:27:28 - INFO - __main__ - Step 136332: {'lr': 1.0449868168984995e-05, 'samples': 26175744, 'steps': 136331, 'loss/train': 1.320154070854187} 11/07/2021 16:27:29 - INFO - __main__ - Step 136333: {'lr': 1.0448349976941913e-05, 'samples': 26175936, 'steps': 136332, 'loss/train': 1.027477502822876} 11/07/2021 16:27:29 - INFO - __main__ - Step 136334: {'lr': 1.0446831892837072e-05, 'samples': 26176128, 'steps': 136333, 'loss/train': 1.150498867034912} 11/07/2021 16:27:29 - INFO - __main__ - Step 136335: {'lr': 1.0445313916671223e-05, 'samples': 26176320, 'steps': 136334, 'loss/train': 1.2063486576080322} 11/07/2021 16:27:30 - INFO - __main__ - Step 136336: {'lr': 1.0443796048445003e-05, 'samples': 26176512, 'steps': 136335, 'loss/train': 1.0244828462600708} 11/07/2021 16:27:31 - INFO - __main__ - Step 136337: {'lr': 1.0442278288159135e-05, 'samples': 26176704, 'steps': 136336, 'loss/train': 1.272934079170227} 11/07/2021 16:27:31 - INFO - __main__ - Step 136338: {'lr': 1.0440760635814256e-05, 'samples': 26176896, 'steps': 136337, 'loss/train': 1.348644733428955} 11/07/2021 16:27:31 - INFO - __main__ - Step 136339: {'lr': 1.043924309141106e-05, 'samples': 26177088, 'steps': 136338, 'loss/train': 0.9453842043876648} 11/07/2021 16:27:32 - INFO - __main__ - Step 136340: {'lr': 1.043772565495027e-05, 'samples': 26177280, 'steps': 136339, 'loss/train': 1.6203144788742065} 11/07/2021 16:27:33 - INFO - __main__ - Step 136341: {'lr': 1.0436208326432522e-05, 'samples': 26177472, 'steps': 136340, 'loss/train': 1.4297926425933838} 11/07/2021 16:27:33 - INFO - __main__ - Step 136342: {'lr': 1.043469110585854e-05, 'samples': 26177664, 'steps': 136341, 'loss/train': 1.0545482635498047} 11/07/2021 16:27:34 - INFO - __main__ - Step 136343: {'lr': 1.0433173993228962e-05, 'samples': 26177856, 'steps': 136342, 'loss/train': 1.1904797554016113} 11/07/2021 16:27:34 - INFO - __main__ - Step 136344: {'lr': 1.0431656988544536e-05, 'samples': 26178048, 'steps': 136343, 'loss/train': 0.9136726260185242} 11/07/2021 16:27:34 - INFO - __main__ - Step 136345: {'lr': 1.0430140091805872e-05, 'samples': 26178240, 'steps': 136344, 'loss/train': 1.355919361114502} 11/07/2021 16:27:35 - INFO - __main__ - Step 136346: {'lr': 1.0428623303013723e-05, 'samples': 26178432, 'steps': 136345, 'loss/train': 1.4048435688018799} 11/07/2021 16:27:36 - INFO - __main__ - Step 136347: {'lr': 1.0427106622168726e-05, 'samples': 26178624, 'steps': 136346, 'loss/train': 1.3307102918624878} 11/07/2021 16:27:36 - INFO - __main__ - Step 136348: {'lr': 1.04255900492716e-05, 'samples': 26178816, 'steps': 136347, 'loss/train': 1.589573621749878} 11/07/2021 16:27:36 - INFO - __main__ - Step 136349: {'lr': 1.0424073584322985e-05, 'samples': 26179008, 'steps': 136348, 'loss/train': 1.2770488262176514} 11/07/2021 16:27:37 - INFO - __main__ - Step 136350: {'lr': 1.0422557227323575e-05, 'samples': 26179200, 'steps': 136349, 'loss/train': 1.587385654449463} 11/07/2021 16:27:37 - INFO - __main__ - Step 136351: {'lr': 1.0421040978274065e-05, 'samples': 26179392, 'steps': 136350, 'loss/train': 1.8441518545150757} 11/07/2021 16:27:38 - INFO - __main__ - Step 136352: {'lr': 1.0419524837175149e-05, 'samples': 26179584, 'steps': 136351, 'loss/train': 1.3579638004302979} 11/07/2021 16:27:39 - INFO - __main__ - Step 136353: {'lr': 1.041800880402749e-05, 'samples': 26179776, 'steps': 136352, 'loss/train': 1.274314522743225} 11/07/2021 16:27:39 - INFO - __main__ - Step 136354: {'lr': 1.0416492878831785e-05, 'samples': 26179968, 'steps': 136353, 'loss/train': 1.1506235599517822} 11/07/2021 16:27:39 - INFO - __main__ - Step 136355: {'lr': 1.0414977061588726e-05, 'samples': 26180160, 'steps': 136354, 'loss/train': 1.0139758586883545} 11/07/2021 16:27:40 - INFO - __main__ - Step 136356: {'lr': 1.0413461352298953e-05, 'samples': 26180352, 'steps': 136355, 'loss/train': 1.2805461883544922} 11/07/2021 16:27:41 - INFO - __main__ - Step 136357: {'lr': 1.0411945750963186e-05, 'samples': 26180544, 'steps': 136356, 'loss/train': 1.5315613746643066} 11/07/2021 16:27:42 - INFO - __main__ - Step 136358: {'lr': 1.0410430257582121e-05, 'samples': 26180736, 'steps': 136357, 'loss/train': 0.6912451386451721} 11/07/2021 16:27:42 - INFO - __main__ - Step 136359: {'lr': 1.0408914872156395e-05, 'samples': 26180928, 'steps': 136358, 'loss/train': 1.5022218227386475} 11/07/2021 16:27:42 - INFO - __main__ - Step 136360: {'lr': 1.0407399594686701e-05, 'samples': 26181120, 'steps': 136359, 'loss/train': 0.9106904864311218} 11/07/2021 16:27:43 - INFO - __main__ - Step 136361: {'lr': 1.0405884425173762e-05, 'samples': 26181312, 'steps': 136360, 'loss/train': 1.1572855710983276} 11/07/2021 16:27:43 - INFO - __main__ - Step 136362: {'lr': 1.0404369363618271e-05, 'samples': 26181504, 'steps': 136361, 'loss/train': 1.2120387554168701} 11/07/2021 16:27:43 - INFO - __main__ - Step 136363: {'lr': 1.0402854410020812e-05, 'samples': 26181696, 'steps': 136362, 'loss/train': 1.7403234243392944} 11/07/2021 16:27:45 - INFO - __main__ - Step 136364: {'lr': 1.0401339564382162e-05, 'samples': 26181888, 'steps': 136363, 'loss/train': 1.6893272399902344} 11/07/2021 16:27:45 - INFO - __main__ - Step 136365: {'lr': 1.0399824826702958e-05, 'samples': 26182080, 'steps': 136364, 'loss/train': 1.3363635540008545} 11/07/2021 16:27:45 - INFO - __main__ - Step 136366: {'lr': 1.0398310196983896e-05, 'samples': 26182272, 'steps': 136365, 'loss/train': 1.525311827659607} 11/07/2021 16:27:46 - INFO - __main__ - Step 136367: {'lr': 1.039679567522564e-05, 'samples': 26182464, 'steps': 136366, 'loss/train': 0.9365721940994263} 11/07/2021 16:27:46 - INFO - __main__ - Step 136368: {'lr': 1.0395281261428913e-05, 'samples': 26182656, 'steps': 136367, 'loss/train': 0.8270352482795715} 11/07/2021 16:27:47 - INFO - __main__ - Step 136369: {'lr': 1.039376695559438e-05, 'samples': 26182848, 'steps': 136368, 'loss/train': 1.2133151292800903} 11/07/2021 16:27:47 - INFO - __main__ - Step 136370: {'lr': 1.0392252757722709e-05, 'samples': 26183040, 'steps': 136369, 'loss/train': 1.1297906637191772} 11/07/2021 16:27:48 - INFO - __main__ - Step 136371: {'lr': 1.0390738667814592e-05, 'samples': 26183232, 'steps': 136370, 'loss/train': 1.2048431634902954} 11/07/2021 16:27:48 - INFO - __main__ - Step 136372: {'lr': 1.0389224685870697e-05, 'samples': 26183424, 'steps': 136371, 'loss/train': 1.564154863357544} 11/07/2021 16:27:48 - INFO - __main__ - Step 136373: {'lr': 1.0387710811891744e-05, 'samples': 26183616, 'steps': 136372, 'loss/train': 0.9512173533439636} 11/07/2021 16:27:49 - INFO - __main__ - Step 136374: {'lr': 1.038619704587837e-05, 'samples': 26183808, 'steps': 136373, 'loss/train': 1.5103845596313477} 11/07/2021 16:27:50 - INFO - __main__ - Step 136375: {'lr': 1.03846833878313e-05, 'samples': 26184000, 'steps': 136374, 'loss/train': 0.6108931303024292} 11/07/2021 16:27:51 - INFO - __main__ - Step 136376: {'lr': 1.03831698377512e-05, 'samples': 26184192, 'steps': 136375, 'loss/train': 1.6840887069702148} 11/07/2021 16:27:51 - INFO - __main__ - Step 136377: {'lr': 1.0381656395638733e-05, 'samples': 26184384, 'steps': 136376, 'loss/train': 1.6340187788009644} 11/07/2021 16:27:51 - INFO - __main__ - Step 136378: {'lr': 1.0380143061494568e-05, 'samples': 26184576, 'steps': 136377, 'loss/train': 0.33273041248321533} 11/07/2021 16:27:52 - INFO - __main__ - Step 136379: {'lr': 1.0378629835319452e-05, 'samples': 26184768, 'steps': 136378, 'loss/train': 1.0402759313583374} 11/07/2021 16:27:52 - INFO - __main__ - Step 136380: {'lr': 1.0377116717113998e-05, 'samples': 26184960, 'steps': 136379, 'loss/train': 1.059626579284668} 11/07/2021 16:27:53 - INFO - __main__ - Step 136381: {'lr': 1.0375603706878928e-05, 'samples': 26185152, 'steps': 136380, 'loss/train': 1.4995503425598145} 11/07/2021 16:27:53 - INFO - __main__ - Step 136382: {'lr': 1.0374090804614905e-05, 'samples': 26185344, 'steps': 136381, 'loss/train': 1.349961519241333} 11/07/2021 16:27:54 - INFO - __main__ - Step 136383: {'lr': 1.0372578010322626e-05, 'samples': 26185536, 'steps': 136382, 'loss/train': 1.0953747034072876} 11/07/2021 16:27:54 - INFO - __main__ - Step 136384: {'lr': 1.0371065324002781e-05, 'samples': 26185728, 'steps': 136383, 'loss/train': 1.062570333480835} 11/07/2021 16:27:54 - INFO - __main__ - Step 136385: {'lr': 1.0369552745656014e-05, 'samples': 26185920, 'steps': 136384, 'loss/train': 1.541205883026123} 11/07/2021 16:27:55 - INFO - __main__ - Step 136386: {'lr': 1.0368040275283042e-05, 'samples': 26186112, 'steps': 136385, 'loss/train': 1.597298264503479} 11/07/2021 16:27:56 - INFO - __main__ - Step 136387: {'lr': 1.0366527912884533e-05, 'samples': 26186304, 'steps': 136386, 'loss/train': 1.234057903289795} 11/07/2021 16:27:56 - INFO - __main__ - Step 136388: {'lr': 1.0365015658461152e-05, 'samples': 26186496, 'steps': 136387, 'loss/train': 1.274930715560913} 11/07/2021 16:27:56 - INFO - __main__ - Step 136389: {'lr': 1.036350351201365e-05, 'samples': 26186688, 'steps': 136388, 'loss/train': 0.9593856930732727} 11/07/2021 16:27:57 - INFO - __main__ - Step 136390: {'lr': 1.036199147354261e-05, 'samples': 26186880, 'steps': 136389, 'loss/train': 1.639417052268982} 11/07/2021 16:27:58 - INFO - __main__ - Step 136391: {'lr': 1.0360479543048778e-05, 'samples': 26187072, 'steps': 136390, 'loss/train': 0.9913906455039978} 11/07/2021 16:27:58 - INFO - __main__ - Step 136392: {'lr': 1.0358967720532796e-05, 'samples': 26187264, 'steps': 136391, 'loss/train': 1.4661504030227661} 11/07/2021 16:27:59 - INFO - __main__ - Step 136393: {'lr': 1.0357456005995358e-05, 'samples': 26187456, 'steps': 136392, 'loss/train': 1.8399873971939087} 11/07/2021 16:27:59 - INFO - __main__ - Step 136394: {'lr': 1.0355944399437184e-05, 'samples': 26187648, 'steps': 136393, 'loss/train': 1.4033852815628052} 11/07/2021 16:27:59 - INFO - __main__ - Step 136395: {'lr': 1.0354432900858912e-05, 'samples': 26187840, 'steps': 136394, 'loss/train': 1.6550287008285522} 11/07/2021 16:28:01 - INFO - __main__ - Step 136396: {'lr': 1.0352921510261209e-05, 'samples': 26188032, 'steps': 136395, 'loss/train': 0.7970246076583862} 11/07/2021 16:28:02 - INFO - __main__ - Step 136397: {'lr': 1.0351410227644797e-05, 'samples': 26188224, 'steps': 136396, 'loss/train': 1.0051734447479248} 11/07/2021 16:28:02 - INFO - __main__ - Step 136398: {'lr': 1.0349899053010342e-05, 'samples': 26188416, 'steps': 136397, 'loss/train': 1.1581189632415771} 11/07/2021 16:28:02 - INFO - __main__ - Step 136399: {'lr': 1.0348387986358537e-05, 'samples': 26188608, 'steps': 136398, 'loss/train': 1.2196071147918701} 11/07/2021 16:28:03 - INFO - __main__ - Step 136400: {'lr': 1.0346877027690049e-05, 'samples': 26188800, 'steps': 136399, 'loss/train': 0.6888424754142761} 11/07/2021 16:28:03 - INFO - __main__ - Step 136401: {'lr': 1.0345366177005544e-05, 'samples': 26188992, 'steps': 136400, 'loss/train': 0.5594998598098755} 11/07/2021 16:28:03 - INFO - __main__ - Step 136402: {'lr': 1.034385543430577e-05, 'samples': 26189184, 'steps': 136401, 'loss/train': 0.7044044733047485} 11/07/2021 16:28:05 - INFO - __main__ - Step 136403: {'lr': 1.0342344799591314e-05, 'samples': 26189376, 'steps': 136402, 'loss/train': 1.2978260517120361} 11/07/2021 16:28:05 - INFO - __main__ - Step 136404: {'lr': 1.0340834272862893e-05, 'samples': 26189568, 'steps': 136403, 'loss/train': 0.838833212852478} 11/07/2021 16:28:05 - INFO - __main__ - Step 136405: {'lr': 1.0339323854121203e-05, 'samples': 26189760, 'steps': 136404, 'loss/train': 1.3213804960250854} 11/07/2021 16:28:06 - INFO - __main__ - Step 136406: {'lr': 1.033781354336691e-05, 'samples': 26189952, 'steps': 136405, 'loss/train': 1.1586354970932007} 11/07/2021 16:28:06 - INFO - __main__ - Step 136407: {'lr': 1.0336303340600706e-05, 'samples': 26190144, 'steps': 136406, 'loss/train': 1.4462324380874634} 11/07/2021 16:28:06 - INFO - __main__ - Step 136408: {'lr': 1.033479324582326e-05, 'samples': 26190336, 'steps': 136407, 'loss/train': 1.6366403102874756} 11/07/2021 16:28:07 - INFO - __main__ - Step 136409: {'lr': 1.0333283259035264e-05, 'samples': 26190528, 'steps': 136408, 'loss/train': 1.0386658906936646} 11/07/2021 16:28:08 - INFO - __main__ - Step 136410: {'lr': 1.0331773380237386e-05, 'samples': 26190720, 'steps': 136409, 'loss/train': 1.7623642683029175} 11/07/2021 16:28:08 - INFO - __main__ - Step 136411: {'lr': 1.0330263609430318e-05, 'samples': 26190912, 'steps': 136410, 'loss/train': 1.7631036043167114} 11/07/2021 16:28:08 - INFO - __main__ - Step 136412: {'lr': 1.0328753946614728e-05, 'samples': 26191104, 'steps': 136411, 'loss/train': 1.0401928424835205} 11/07/2021 16:28:09 - INFO - __main__ - Step 136413: {'lr': 1.0327244391791307e-05, 'samples': 26191296, 'steps': 136412, 'loss/train': 1.5253602266311646} 11/07/2021 16:28:10 - INFO - __main__ - Step 136414: {'lr': 1.0325734944960725e-05, 'samples': 26191488, 'steps': 136413, 'loss/train': 1.6534597873687744} 11/07/2021 16:28:10 - INFO - __main__ - Step 136415: {'lr': 1.0324225606123672e-05, 'samples': 26191680, 'steps': 136414, 'loss/train': 1.4578975439071655} 11/07/2021 16:28:10 - INFO - __main__ - Step 136416: {'lr': 1.0322716375280843e-05, 'samples': 26191872, 'steps': 136415, 'loss/train': 1.276833176612854} 11/07/2021 16:28:11 - INFO - __main__ - Step 136417: {'lr': 1.0321207252432908e-05, 'samples': 26192064, 'steps': 136416, 'loss/train': 1.3749572038650513} 11/07/2021 16:28:11 - INFO - __main__ - Step 136418: {'lr': 1.0319698237580499e-05, 'samples': 26192256, 'steps': 136417, 'loss/train': 1.5892233848571777} 11/07/2021 16:28:12 - INFO - __main__ - Step 136419: {'lr': 1.0318189330724342e-05, 'samples': 26192448, 'steps': 136418, 'loss/train': 1.1973720788955688} 11/07/2021 16:28:13 - INFO - __main__ - Step 136420: {'lr': 1.031668053186513e-05, 'samples': 26192640, 'steps': 136419, 'loss/train': 1.7492724657058716} 11/07/2021 16:28:13 - INFO - __main__ - Step 136421: {'lr': 1.03151718410035e-05, 'samples': 26192832, 'steps': 136420, 'loss/train': 1.39484703540802} 11/07/2021 16:28:13 - INFO - __main__ - Step 136422: {'lr': 1.0313663258140177e-05, 'samples': 26193024, 'steps': 136421, 'loss/train': 1.1710494756698608} 11/07/2021 16:28:14 - INFO - __main__ - Step 136423: {'lr': 1.0312154783275824e-05, 'samples': 26193216, 'steps': 136422, 'loss/train': 1.564199686050415} 11/07/2021 16:28:14 - INFO - __main__ - Step 136424: {'lr': 1.0310646416411078e-05, 'samples': 26193408, 'steps': 136423, 'loss/train': 1.6040234565734863} 11/07/2021 16:28:15 - INFO - __main__ - Step 136425: {'lr': 1.0309138157546693e-05, 'samples': 26193600, 'steps': 136424, 'loss/train': 1.0110019445419312} 11/07/2021 16:28:15 - INFO - __main__ - Step 136426: {'lr': 1.0307630006683305e-05, 'samples': 26193792, 'steps': 136425, 'loss/train': 1.2576268911361694} 11/07/2021 16:28:16 - INFO - __main__ - Step 136427: {'lr': 1.030612196382158e-05, 'samples': 26193984, 'steps': 136426, 'loss/train': 1.0315593481063843} 11/07/2021 16:28:16 - INFO - __main__ - Step 136428: {'lr': 1.030461402896224e-05, 'samples': 26194176, 'steps': 136427, 'loss/train': 1.2040830850601196} 11/07/2021 16:28:16 - INFO - __main__ - Step 136429: {'lr': 1.030310620210595e-05, 'samples': 26194368, 'steps': 136428, 'loss/train': 1.5097616910934448} 11/07/2021 16:28:18 - INFO - __main__ - Step 136430: {'lr': 1.0301598483253377e-05, 'samples': 26194560, 'steps': 136429, 'loss/train': 1.4452707767486572} 11/07/2021 16:28:18 - INFO - __main__ - Step 136431: {'lr': 1.0300090872405188e-05, 'samples': 26194752, 'steps': 136430, 'loss/train': 1.155710220336914} 11/07/2021 16:28:18 - INFO - __main__ - Step 136432: {'lr': 1.0298583369562076e-05, 'samples': 26194944, 'steps': 136431, 'loss/train': 1.3180546760559082} 11/07/2021 16:28:19 - INFO - __main__ - Step 136433: {'lr': 1.0297075974724734e-05, 'samples': 26195136, 'steps': 136432, 'loss/train': 0.7440516352653503} 11/07/2021 16:28:19 - INFO - __main__ - Step 136434: {'lr': 1.029556868789383e-05, 'samples': 26195328, 'steps': 136433, 'loss/train': 0.9411868453025818} 11/07/2021 16:28:20 - INFO - __main__ - Step 136435: {'lr': 1.029406150907003e-05, 'samples': 26195520, 'steps': 136434, 'loss/train': 1.2959340810775757} 11/07/2021 16:28:20 - INFO - __main__ - Step 136436: {'lr': 1.0292554438254054e-05, 'samples': 26195712, 'steps': 136435, 'loss/train': 1.5283591747283936} 11/07/2021 16:28:21 - INFO - __main__ - Step 136437: {'lr': 1.0291047475446514e-05, 'samples': 26195904, 'steps': 136436, 'loss/train': 1.5176669359207153} 11/07/2021 16:28:21 - INFO - __main__ - Step 136438: {'lr': 1.0289540620648157e-05, 'samples': 26196096, 'steps': 136437, 'loss/train': 1.2531133890151978} 11/07/2021 16:28:21 - INFO - __main__ - Step 136439: {'lr': 1.0288033873859627e-05, 'samples': 26196288, 'steps': 136438, 'loss/train': 1.4090083837509155} 11/07/2021 16:28:22 - INFO - __main__ - Step 136440: {'lr': 1.028652723508161e-05, 'samples': 26196480, 'steps': 136439, 'loss/train': 1.132277250289917} 11/07/2021 16:28:23 - INFO - __main__ - Step 136441: {'lr': 1.0285020704314835e-05, 'samples': 26196672, 'steps': 136440, 'loss/train': 1.1993910074234009} 11/07/2021 16:28:23 - INFO - __main__ - Step 136442: {'lr': 1.028351428155988e-05, 'samples': 26196864, 'steps': 136441, 'loss/train': 1.688712239265442} 11/07/2021 16:28:23 - INFO - __main__ - Step 136443: {'lr': 1.028200796681747e-05, 'samples': 26197056, 'steps': 136442, 'loss/train': 1.3925596475601196} 11/07/2021 16:28:24 - INFO - __main__ - Step 136444: {'lr': 1.0280501760088295e-05, 'samples': 26197248, 'steps': 136443, 'loss/train': 1.2935805320739746} 11/07/2021 16:28:25 - INFO - __main__ - Step 136445: {'lr': 1.0278995661373024e-05, 'samples': 26197440, 'steps': 136444, 'loss/train': 1.1327580213546753} 11/07/2021 16:28:25 - INFO - __main__ - Step 136446: {'lr': 1.0277489670672352e-05, 'samples': 26197632, 'steps': 136445, 'loss/train': 1.3832662105560303} 11/07/2021 16:28:26 - INFO - __main__ - Step 136447: {'lr': 1.0275983787986943e-05, 'samples': 26197824, 'steps': 136446, 'loss/train': 1.6173738241195679} 11/07/2021 16:28:26 - INFO - __main__ - Step 136448: {'lr': 1.0274478013317461e-05, 'samples': 26198016, 'steps': 136447, 'loss/train': 0.9747700691223145} 11/07/2021 16:28:26 - INFO - __main__ - Step 136449: {'lr': 1.0272972346664605e-05, 'samples': 26198208, 'steps': 136448, 'loss/train': 1.5088077783584595} 11/07/2021 16:28:27 - INFO - __main__ - Step 136450: {'lr': 1.0271466788029066e-05, 'samples': 26198400, 'steps': 136449, 'loss/train': 1.3951729536056519} 11/07/2021 16:28:28 - INFO - __main__ - Step 136451: {'lr': 1.0269961337411482e-05, 'samples': 26198592, 'steps': 136450, 'loss/train': 1.2564926147460938} 11/07/2021 16:28:28 - INFO - __main__ - Step 136452: {'lr': 1.0268455994812604e-05, 'samples': 26198784, 'steps': 136451, 'loss/train': 1.3579257726669312} 11/07/2021 16:28:29 - INFO - __main__ - Step 136453: {'lr': 1.0266950760233012e-05, 'samples': 26198976, 'steps': 136452, 'loss/train': 0.1866033524274826} 11/07/2021 16:28:29 - INFO - __main__ - Step 136454: {'lr': 1.0265445633673432e-05, 'samples': 26199168, 'steps': 136453, 'loss/train': 1.9340068101882935} 11/07/2021 16:28:29 - INFO - __main__ - Step 136455: {'lr': 1.0263940615134553e-05, 'samples': 26199360, 'steps': 136454, 'loss/train': 1.3141865730285645} 11/07/2021 16:28:30 - INFO - __main__ - Step 136456: {'lr': 1.0262435704617045e-05, 'samples': 26199552, 'steps': 136455, 'loss/train': 1.3710063695907593} 11/07/2021 16:28:31 - INFO - __main__ - Step 136457: {'lr': 1.0260930902121574e-05, 'samples': 26199744, 'steps': 136456, 'loss/train': 0.589084267616272} 11/07/2021 16:28:31 - INFO - __main__ - Step 136458: {'lr': 1.0259426207648831e-05, 'samples': 26199936, 'steps': 136457, 'loss/train': 1.1568711996078491} 11/07/2021 16:28:31 - INFO - __main__ - Step 136459: {'lr': 1.0257921621199484e-05, 'samples': 26200128, 'steps': 136458, 'loss/train': 1.0291764736175537} 11/07/2021 16:28:32 - INFO - __main__ - Step 136460: {'lr': 1.0256417142774227e-05, 'samples': 26200320, 'steps': 136459, 'loss/train': 1.5169953107833862} 11/07/2021 16:28:33 - INFO - __main__ - Step 136461: {'lr': 1.0254912772373725e-05, 'samples': 26200512, 'steps': 136460, 'loss/train': 1.6151111125946045} 11/07/2021 16:28:33 - INFO - __main__ - Step 136462: {'lr': 1.0253408509998701e-05, 'samples': 26200704, 'steps': 136461, 'loss/train': 1.3061705827713013} 11/07/2021 16:28:34 - INFO - __main__ - Step 136463: {'lr': 1.0251904355649766e-05, 'samples': 26200896, 'steps': 136462, 'loss/train': 0.9741088151931763} 11/07/2021 16:28:34 - INFO - __main__ - Step 136464: {'lr': 1.0250400309327612e-05, 'samples': 26201088, 'steps': 136463, 'loss/train': 1.384292721748352} 11/07/2021 16:28:34 - INFO - __main__ - Step 136465: {'lr': 1.0248896371032906e-05, 'samples': 26201280, 'steps': 136464, 'loss/train': 1.5077520608901978} 11/07/2021 16:28:35 - INFO - __main__ - Step 136466: {'lr': 1.024739254076637e-05, 'samples': 26201472, 'steps': 136465, 'loss/train': 0.9598815441131592} 11/07/2021 16:28:36 - INFO - __main__ - Step 136467: {'lr': 1.0245888818528671e-05, 'samples': 26201664, 'steps': 136466, 'loss/train': 1.474195122718811} 11/07/2021 16:28:36 - INFO - __main__ - Step 136468: {'lr': 1.0244385204320471e-05, 'samples': 26201856, 'steps': 136467, 'loss/train': 2.567732810974121} 11/07/2021 16:28:36 - INFO - __main__ - Step 136469: {'lr': 1.0242881698142442e-05, 'samples': 26202048, 'steps': 136468, 'loss/train': 1.0140832662582397} 11/07/2021 16:28:37 - INFO - __main__ - Step 136470: {'lr': 1.0241378299995275e-05, 'samples': 26202240, 'steps': 136469, 'loss/train': 1.3507499694824219} 11/07/2021 16:28:37 - INFO - __main__ - Step 136471: {'lr': 1.0239875009879635e-05, 'samples': 26202432, 'steps': 136470, 'loss/train': 1.131066083908081} 11/07/2021 16:28:38 - INFO - __main__ - Step 136472: {'lr': 1.0238371827796217e-05, 'samples': 26202624, 'steps': 136471, 'loss/train': 0.2251196652650833} 11/07/2021 16:28:38 - INFO - __main__ - Step 136473: {'lr': 1.0236868753745715e-05, 'samples': 26202816, 'steps': 136472, 'loss/train': 1.412522554397583} 11/07/2021 16:28:39 - INFO - __main__ - Step 136474: {'lr': 1.023536578772874e-05, 'samples': 26203008, 'steps': 136473, 'loss/train': 1.2961151599884033} 11/07/2021 16:28:39 - INFO - __main__ - Step 136475: {'lr': 1.0233862929746012e-05, 'samples': 26203200, 'steps': 136474, 'loss/train': 1.5611811876296997} 11/07/2021 16:28:39 - INFO - __main__ - Step 136476: {'lr': 1.0232360179798228e-05, 'samples': 26203392, 'steps': 136475, 'loss/train': 1.3036378622055054} 11/07/2021 16:28:41 - INFO - __main__ - Step 136477: {'lr': 1.0230857537886023e-05, 'samples': 26203584, 'steps': 136476, 'loss/train': 1.6737886667251587} 11/07/2021 16:28:41 - INFO - __main__ - Step 136478: {'lr': 1.0229355004010094e-05, 'samples': 26203776, 'steps': 136477, 'loss/train': 0.8758240938186646} 11/07/2021 16:28:41 - INFO - __main__ - Step 136479: {'lr': 1.0227852578171132e-05, 'samples': 26203968, 'steps': 136478, 'loss/train': 1.09952974319458} 11/07/2021 16:28:42 - INFO - __main__ - Step 136480: {'lr': 1.0226350260369777e-05, 'samples': 26204160, 'steps': 136479, 'loss/train': 1.012953281402588} 11/07/2021 16:28:42 - INFO - __main__ - Step 136481: {'lr': 1.022484805060675e-05, 'samples': 26204352, 'steps': 136480, 'loss/train': 1.5816162824630737} 11/07/2021 16:28:43 - INFO - __main__ - Step 136482: {'lr': 1.022334594888269e-05, 'samples': 26204544, 'steps': 136481, 'loss/train': 1.3903785943984985} 11/07/2021 16:28:44 - INFO - __main__ - Step 136483: {'lr': 1.022184395519829e-05, 'samples': 26204736, 'steps': 136482, 'loss/train': 1.6408940553665161} 11/07/2021 16:28:44 - INFO - __main__ - Step 136484: {'lr': 1.0220342069554272e-05, 'samples': 26204928, 'steps': 136483, 'loss/train': 1.496094822883606} 11/07/2021 16:28:44 - INFO - __main__ - Step 136485: {'lr': 1.021884029195122e-05, 'samples': 26205120, 'steps': 136484, 'loss/train': 0.14996911585330963} 11/07/2021 16:28:45 - INFO - __main__ - Step 136486: {'lr': 1.0217338622389883e-05, 'samples': 26205312, 'steps': 136485, 'loss/train': 0.31528788805007935} 11/07/2021 16:28:45 - INFO - __main__ - Step 136487: {'lr': 1.0215837060870897e-05, 'samples': 26205504, 'steps': 136486, 'loss/train': 1.289595127105713} 11/07/2021 16:28:46 - INFO - __main__ - Step 136488: {'lr': 1.0214335607394959e-05, 'samples': 26205696, 'steps': 136487, 'loss/train': 1.383098840713501} 11/07/2021 16:28:46 - INFO - __main__ - Step 136489: {'lr': 1.0212834261962733e-05, 'samples': 26205888, 'steps': 136488, 'loss/train': 1.0651754140853882} 11/07/2021 16:28:47 - INFO - __main__ - Step 136490: {'lr': 1.0211333024574915e-05, 'samples': 26206080, 'steps': 136489, 'loss/train': 1.285850167274475} 11/07/2021 16:28:47 - INFO - __main__ - Step 136491: {'lr': 1.020983189523217e-05, 'samples': 26206272, 'steps': 136490, 'loss/train': 5.664328575134277} 11/07/2021 16:28:47 - INFO - __main__ - Step 136492: {'lr': 1.0208330873935161e-05, 'samples': 26206464, 'steps': 136491, 'loss/train': 0.6334317922592163} 11/07/2021 16:28:48 - INFO - __main__ - Step 136493: {'lr': 1.0206829960684589e-05, 'samples': 26206656, 'steps': 136492, 'loss/train': 1.4628132581710815} 11/07/2021 16:28:49 - INFO - __main__ - Step 136494: {'lr': 1.0205329155481113e-05, 'samples': 26206848, 'steps': 136493, 'loss/train': 1.240997314453125} 11/07/2021 16:28:49 - INFO - __main__ - Step 136495: {'lr': 1.0203828458325432e-05, 'samples': 26207040, 'steps': 136494, 'loss/train': 1.1172878742218018} 11/07/2021 16:28:50 - INFO - __main__ - Step 136496: {'lr': 1.0202327869218208e-05, 'samples': 26207232, 'steps': 136495, 'loss/train': 1.1718989610671997} 11/07/2021 16:28:50 - INFO - __main__ - Step 136497: {'lr': 1.0200827388160112e-05, 'samples': 26207424, 'steps': 136496, 'loss/train': 1.0750185251235962} 11/07/2021 16:28:50 - INFO - __main__ - Step 136498: {'lr': 1.0199327015151805e-05, 'samples': 26207616, 'steps': 136497, 'loss/train': 1.226137638092041} 11/07/2021 16:28:51 - INFO - __main__ - Step 136499: {'lr': 1.0197826750193983e-05, 'samples': 26207808, 'steps': 136498, 'loss/train': 1.5397729873657227} 11/07/2021 16:28:52 - INFO - __main__ - Step 136500: {'lr': 1.0196326593287342e-05, 'samples': 26208000, 'steps': 136499, 'loss/train': 1.1282020807266235} 11/07/2021 16:28:52 - INFO - __main__ - Step 136501: {'lr': 1.0194826544432518e-05, 'samples': 26208192, 'steps': 136500, 'loss/train': 1.081178903579712} 11/07/2021 16:28:52 - INFO - __main__ - Step 136502: {'lr': 1.0193326603630204e-05, 'samples': 26208384, 'steps': 136501, 'loss/train': 1.6160025596618652} 11/07/2021 16:28:53 - INFO - __main__ - Step 136503: {'lr': 1.019182677088107e-05, 'samples': 26208576, 'steps': 136502, 'loss/train': 1.1813510656356812} 11/07/2021 16:28:54 - INFO - __main__ - Step 136504: {'lr': 1.0190327046185805e-05, 'samples': 26208768, 'steps': 136503, 'loss/train': 0.9221071004867554} 11/07/2021 16:28:54 - INFO - __main__ - Step 136505: {'lr': 1.0188827429545078e-05, 'samples': 26208960, 'steps': 136504, 'loss/train': 1.3815386295318604} 11/07/2021 16:28:54 - INFO - __main__ - Step 136506: {'lr': 1.0187327920959556e-05, 'samples': 26209152, 'steps': 136505, 'loss/train': 0.9762648344039917} 11/07/2021 16:28:55 - INFO - __main__ - Step 136507: {'lr': 1.018582852042993e-05, 'samples': 26209344, 'steps': 136506, 'loss/train': 1.4041714668273926} 11/07/2021 16:28:55 - INFO - __main__ - Step 136508: {'lr': 1.0184329227956867e-05, 'samples': 26209536, 'steps': 136507, 'loss/train': 1.488706350326538} 11/07/2021 16:28:57 - INFO - __main__ - Step 136509: {'lr': 1.0182830043541063e-05, 'samples': 26209728, 'steps': 136508, 'loss/train': 1.0880788564682007} 11/07/2021 16:28:57 - INFO - __main__ - Step 136510: {'lr': 1.0181330967183184e-05, 'samples': 26209920, 'steps': 136509, 'loss/train': 0.07733719795942307} 11/07/2021 16:28:57 - INFO - __main__ - Step 136511: {'lr': 1.0179831998883893e-05, 'samples': 26210112, 'steps': 136510, 'loss/train': 1.4532991647720337} 11/07/2021 16:28:58 - INFO - __main__ - Step 136512: {'lr': 1.017833313864383e-05, 'samples': 26210304, 'steps': 136511, 'loss/train': 0.6479577422142029} 11/07/2021 16:28:58 - INFO - __main__ - Step 136513: {'lr': 1.0176834386463746e-05, 'samples': 26210496, 'steps': 136512, 'loss/train': 1.5673247575759888} 11/07/2021 16:28:59 - INFO - __main__ - Step 136514: {'lr': 1.0175335742344249e-05, 'samples': 26210688, 'steps': 136513, 'loss/train': 1.0539764165878296} 11/07/2021 16:28:59 - INFO - __main__ - Step 136515: {'lr': 1.0173837206286064e-05, 'samples': 26210880, 'steps': 136514, 'loss/train': 1.4569567441940308} 11/07/2021 16:29:00 - INFO - __main__ - Step 136516: {'lr': 1.0172338778289852e-05, 'samples': 26211072, 'steps': 136515, 'loss/train': 1.2969111204147339} 11/07/2021 16:29:00 - INFO - __main__ - Step 136517: {'lr': 1.0170840458356257e-05, 'samples': 26211264, 'steps': 136516, 'loss/train': 1.590401291847229} 11/07/2021 16:29:00 - INFO - __main__ - Step 136518: {'lr': 1.0169342246485996e-05, 'samples': 26211456, 'steps': 136517, 'loss/train': 0.5389702320098877} 11/07/2021 16:29:02 - INFO - __main__ - Step 136519: {'lr': 1.0167844142679739e-05, 'samples': 26211648, 'steps': 136518, 'loss/train': 1.5251810550689697} 11/07/2021 16:29:02 - INFO - __main__ - Step 136520: {'lr': 1.016634614693815e-05, 'samples': 26211840, 'steps': 136519, 'loss/train': 1.5400396585464478} 11/07/2021 16:29:02 - INFO - __main__ - Step 136521: {'lr': 1.0164848259261894e-05, 'samples': 26212032, 'steps': 136520, 'loss/train': 0.4036003649234772} 11/07/2021 16:29:03 - INFO - __main__ - Step 136522: {'lr': 1.0163350479651668e-05, 'samples': 26212224, 'steps': 136521, 'loss/train': 1.7544224262237549} 11/07/2021 16:29:03 - INFO - __main__ - Step 136523: {'lr': 1.0161852808108135e-05, 'samples': 26212416, 'steps': 136522, 'loss/train': 1.122972011566162} 11/07/2021 16:29:03 - INFO - __main__ - Step 136524: {'lr': 1.0160355244631964e-05, 'samples': 26212608, 'steps': 136523, 'loss/train': 1.5337532758712769} 11/07/2021 16:29:04 - INFO - __main__ - Step 136525: {'lr': 1.0158857789223846e-05, 'samples': 26212800, 'steps': 136524, 'loss/train': 1.2658159732818604} 11/07/2021 16:29:05 - INFO - __main__ - Step 136526: {'lr': 1.0157360441884423e-05, 'samples': 26212992, 'steps': 136525, 'loss/train': 1.3432693481445312} 11/07/2021 16:29:05 - INFO - __main__ - Step 136527: {'lr': 1.0155863202614412e-05, 'samples': 26213184, 'steps': 136526, 'loss/train': 1.2075698375701904} 11/07/2021 16:29:05 - INFO - __main__ - Step 136528: {'lr': 1.0154366071414457e-05, 'samples': 26213376, 'steps': 136527, 'loss/train': 1.5829098224639893} 11/07/2021 16:29:06 - INFO - __main__ - Step 136529: {'lr': 1.0152869048285246e-05, 'samples': 26213568, 'steps': 136528, 'loss/train': 1.406766414642334} 11/07/2021 16:29:07 - INFO - __main__ - Step 136530: {'lr': 1.0151372133227448e-05, 'samples': 26213760, 'steps': 136529, 'loss/train': 1.4750418663024902} 11/07/2021 16:29:07 - INFO - __main__ - Step 136531: {'lr': 1.014987532624173e-05, 'samples': 26213952, 'steps': 136530, 'loss/train': 1.1045253276824951} 11/07/2021 16:29:07 - INFO - __main__ - Step 136532: {'lr': 1.0148378627328787e-05, 'samples': 26214144, 'steps': 136531, 'loss/train': 1.6012382507324219} 11/07/2021 16:29:08 - INFO - __main__ - Step 136533: {'lr': 1.0146882036489307e-05, 'samples': 26214336, 'steps': 136532, 'loss/train': 1.1337867975234985} 11/07/2021 16:29:08 - INFO - __main__ - Step 136534: {'lr': 1.0145385553723906e-05, 'samples': 26214528, 'steps': 136533, 'loss/train': 1.14753258228302} 11/07/2021 16:29:09 - INFO - __main__ - Step 136535: {'lr': 1.0143889179033305e-05, 'samples': 26214720, 'steps': 136534, 'loss/train': 1.407990574836731} 11/07/2021 16:29:10 - INFO - __main__ - Step 136536: {'lr': 1.014239291241817e-05, 'samples': 26214912, 'steps': 136535, 'loss/train': 1.0839787721633911} 11/07/2021 16:29:10 - INFO - __main__ - Step 136537: {'lr': 1.014089675387922e-05, 'samples': 26215104, 'steps': 136536, 'loss/train': 1.5665160417556763} 11/07/2021 16:29:10 - INFO - __main__ - Step 136538: {'lr': 1.0139400703417012e-05, 'samples': 26215296, 'steps': 136537, 'loss/train': 0.9837374091148376} 11/07/2021 16:29:11 - INFO - __main__ - Step 136539: {'lr': 1.0137904761032324e-05, 'samples': 26215488, 'steps': 136538, 'loss/train': 1.2892552614212036} 11/07/2021 16:29:12 - INFO - __main__ - Step 136540: {'lr': 1.0136408926725765e-05, 'samples': 26215680, 'steps': 136539, 'loss/train': 1.174245834350586} 11/07/2021 16:29:12 - INFO - __main__ - Step 136541: {'lr': 1.0134913200498058e-05, 'samples': 26215872, 'steps': 136540, 'loss/train': 1.2424677610397339} 11/07/2021 16:29:12 - INFO - __main__ - Step 136542: {'lr': 1.013341758234984e-05, 'samples': 26216064, 'steps': 136541, 'loss/train': 1.1612521409988403} 11/07/2021 16:29:13 - INFO - __main__ - Step 136543: {'lr': 1.0131922072281835e-05, 'samples': 26216256, 'steps': 136542, 'loss/train': 1.3135340213775635} 11/07/2021 16:29:13 - INFO - __main__ - Step 136544: {'lr': 1.013042667029465e-05, 'samples': 26216448, 'steps': 136543, 'loss/train': 1.4090055227279663} 11/07/2021 16:29:14 - INFO - __main__ - Step 136545: {'lr': 1.0128931376389011e-05, 'samples': 26216640, 'steps': 136544, 'loss/train': 0.8433499336242676} 11/07/2021 16:29:14 - INFO - __main__ - Step 136546: {'lr': 1.0127436190565582e-05, 'samples': 26216832, 'steps': 136545, 'loss/train': 1.2605395317077637} 11/07/2021 16:29:15 - INFO - __main__ - Step 136547: {'lr': 1.0125941112824998e-05, 'samples': 26217024, 'steps': 136546, 'loss/train': 0.6681340336799622} 11/07/2021 16:29:15 - INFO - __main__ - Step 136548: {'lr': 1.0124446143167987e-05, 'samples': 26217216, 'steps': 136547, 'loss/train': 1.4059066772460938} 11/07/2021 16:29:15 - INFO - __main__ - Step 136549: {'lr': 1.0122951281595182e-05, 'samples': 26217408, 'steps': 136548, 'loss/train': 1.4673470258712769} 11/07/2021 16:29:16 - INFO - __main__ - Step 136550: {'lr': 1.0121456528107337e-05, 'samples': 26217600, 'steps': 136549, 'loss/train': 1.5192623138427734} 11/07/2021 16:29:17 - INFO - __main__ - Step 136551: {'lr': 1.0119961882705003e-05, 'samples': 26217792, 'steps': 136550, 'loss/train': 1.379194736480713} 11/07/2021 16:29:17 - INFO - __main__ - Step 136552: {'lr': 1.0118467345388932e-05, 'samples': 26217984, 'steps': 136551, 'loss/train': 1.8892319202423096} 11/07/2021 16:29:18 - INFO - __main__ - Step 136553: {'lr': 1.0116972916159762e-05, 'samples': 26218176, 'steps': 136552, 'loss/train': 1.7134264707565308} 11/07/2021 16:29:18 - INFO - __main__ - Step 136554: {'lr': 1.0115478595018185e-05, 'samples': 26218368, 'steps': 136553, 'loss/train': 1.3741971254348755} 11/07/2021 16:29:18 - INFO - __main__ - Step 136555: {'lr': 1.0113984381964869e-05, 'samples': 26218560, 'steps': 136554, 'loss/train': 1.2118229866027832} 11/07/2021 16:29:19 - INFO - __main__ - Step 136556: {'lr': 1.0112490277000509e-05, 'samples': 26218752, 'steps': 136555, 'loss/train': 1.0696616172790527} 11/07/2021 16:29:20 - INFO - __main__ - Step 136557: {'lr': 1.011099628012574e-05, 'samples': 26218944, 'steps': 136556, 'loss/train': 1.286994218826294} 11/07/2021 16:29:20 - INFO - __main__ - Step 136558: {'lr': 1.0109502391341257e-05, 'samples': 26219136, 'steps': 136557, 'loss/train': 1.5676143169403076} 11/07/2021 16:29:20 - INFO - __main__ - Step 136559: {'lr': 1.0108008610647728e-05, 'samples': 26219328, 'steps': 136558, 'loss/train': 1.3859730958938599} 11/07/2021 16:29:21 - INFO - __main__ - Step 136560: {'lr': 1.0106514938045847e-05, 'samples': 26219520, 'steps': 136559, 'loss/train': 0.9156005382537842} 11/07/2021 16:29:22 - INFO - __main__ - Step 136561: {'lr': 1.0105021373536249e-05, 'samples': 26219712, 'steps': 136560, 'loss/train': 1.368485450744629} 11/07/2021 16:29:22 - INFO - __main__ - Step 136562: {'lr': 1.0103527917119631e-05, 'samples': 26219904, 'steps': 136561, 'loss/train': 1.4203500747680664} 11/07/2021 16:29:23 - INFO - __main__ - Step 136563: {'lr': 1.0102034568796687e-05, 'samples': 26220096, 'steps': 136562, 'loss/train': 0.9743785858154297} 11/07/2021 16:29:23 - INFO - __main__ - Step 136564: {'lr': 1.0100541328568053e-05, 'samples': 26220288, 'steps': 136563, 'loss/train': 1.4676703214645386} 11/07/2021 16:29:24 - INFO - __main__ - Step 136565: {'lr': 1.0099048196434397e-05, 'samples': 26220480, 'steps': 136564, 'loss/train': 1.3216813802719116} 11/07/2021 16:29:24 - INFO - __main__ - Step 136566: {'lr': 1.009755517239641e-05, 'samples': 26220672, 'steps': 136565, 'loss/train': 1.7449793815612793} 11/07/2021 16:29:25 - INFO - __main__ - Step 136567: {'lr': 1.0096062256454764e-05, 'samples': 26220864, 'steps': 136566, 'loss/train': 0.9614728689193726} 11/07/2021 16:29:25 - INFO - __main__ - Step 136568: {'lr': 1.009456944861012e-05, 'samples': 26221056, 'steps': 136567, 'loss/train': 1.4504369497299194} 11/07/2021 16:29:26 - INFO - __main__ - Step 136569: {'lr': 1.0093076748863173e-05, 'samples': 26221248, 'steps': 136568, 'loss/train': 1.3587437868118286} 11/07/2021 16:29:26 - INFO - __main__ - Step 136570: {'lr': 1.009158415721459e-05, 'samples': 26221440, 'steps': 136569, 'loss/train': 1.59022855758667} 11/07/2021 16:29:26 - INFO - __main__ - Step 136571: {'lr': 1.0090091673665036e-05, 'samples': 26221632, 'steps': 136570, 'loss/train': 1.4724904298782349} 11/07/2021 16:29:27 - INFO - __main__ - Step 136572: {'lr': 1.0088599298215179e-05, 'samples': 26221824, 'steps': 136571, 'loss/train': 1.3808155059814453} 11/07/2021 16:29:28 - INFO - __main__ - Step 136573: {'lr': 1.0087107030865684e-05, 'samples': 26222016, 'steps': 136572, 'loss/train': 1.2852864265441895} 11/07/2021 16:29:28 - INFO - __main__ - Step 136574: {'lr': 1.0085614871617271e-05, 'samples': 26222208, 'steps': 136573, 'loss/train': 1.0585441589355469} 11/07/2021 16:29:28 - INFO - __main__ - Step 136575: {'lr': 1.0084122820470554e-05, 'samples': 26222400, 'steps': 136574, 'loss/train': 1.2285979986190796} 11/07/2021 16:29:29 - INFO - __main__ - Step 136576: {'lr': 1.0082630877426224e-05, 'samples': 26222592, 'steps': 136575, 'loss/train': 1.2695672512054443} 11/07/2021 16:29:29 - INFO - __main__ - Step 136577: {'lr': 1.0081139042485005e-05, 'samples': 26222784, 'steps': 136576, 'loss/train': 1.154728651046753} 11/07/2021 16:29:30 - INFO - __main__ - Step 136578: {'lr': 1.0079647315647478e-05, 'samples': 26222976, 'steps': 136577, 'loss/train': 0.897691547870636} 11/07/2021 16:29:31 - INFO - __main__ - Step 136579: {'lr': 1.0078155696914365e-05, 'samples': 26223168, 'steps': 136578, 'loss/train': 1.6467863321304321} 11/07/2021 16:29:31 - INFO - __main__ - Step 136580: {'lr': 1.0076664186286333e-05, 'samples': 26223360, 'steps': 136579, 'loss/train': 1.7208986282348633} 11/07/2021 16:29:31 - INFO - __main__ - Step 136581: {'lr': 1.0075172783764048e-05, 'samples': 26223552, 'steps': 136580, 'loss/train': 1.354325294494629} 11/07/2021 16:29:32 - INFO - __main__ - Step 136582: {'lr': 1.0073681489348202e-05, 'samples': 26223744, 'steps': 136581, 'loss/train': 1.4077749252319336} 11/07/2021 16:29:33 - INFO - __main__ - Step 136583: {'lr': 1.0072190303039436e-05, 'samples': 26223936, 'steps': 136582, 'loss/train': 1.0227514505386353} 11/07/2021 16:29:33 - INFO - __main__ - Step 136584: {'lr': 1.0070699224838442e-05, 'samples': 26224128, 'steps': 136583, 'loss/train': 1.7158546447753906} 11/07/2021 16:29:34 - INFO - __main__ - Step 136585: {'lr': 1.0069208254745888e-05, 'samples': 26224320, 'steps': 136584, 'loss/train': 1.3119391202926636} 11/07/2021 16:29:34 - INFO - __main__ - Step 136586: {'lr': 1.0067717392762466e-05, 'samples': 26224512, 'steps': 136585, 'loss/train': 1.3993333578109741} 11/07/2021 16:29:34 - INFO - __main__ - Step 136587: {'lr': 1.0066226638888815e-05, 'samples': 26224704, 'steps': 136586, 'loss/train': 1.3722970485687256} 11/07/2021 16:29:35 - INFO - __main__ - Step 136588: {'lr': 1.0064735993125601e-05, 'samples': 26224896, 'steps': 136587, 'loss/train': 1.4022499322891235} 11/07/2021 16:29:36 - INFO - __main__ - Step 136589: {'lr': 1.0063245455473546e-05, 'samples': 26225088, 'steps': 136588, 'loss/train': 0.320951908826828} 11/07/2021 16:29:36 - INFO - __main__ - Step 136590: {'lr': 1.0061755025933317e-05, 'samples': 26225280, 'steps': 136589, 'loss/train': 1.101621150970459} 11/07/2021 16:29:37 - INFO - __main__ - Step 136591: {'lr': 1.0060264704505496e-05, 'samples': 26225472, 'steps': 136590, 'loss/train': 1.7456600666046143} 11/07/2021 16:29:37 - INFO - __main__ - Step 136592: {'lr': 1.0058774491190859e-05, 'samples': 26225664, 'steps': 136591, 'loss/train': 1.7409542798995972} 11/07/2021 16:29:37 - INFO - __main__ - Step 136593: {'lr': 1.0057284385990018e-05, 'samples': 26225856, 'steps': 136592, 'loss/train': 1.3788336515426636} 11/07/2021 16:29:38 - INFO - __main__ - Step 136594: {'lr': 1.005579438890364e-05, 'samples': 26226048, 'steps': 136593, 'loss/train': 1.0778391361236572} 11/07/2021 16:29:39 - INFO - __main__ - Step 136595: {'lr': 1.0054304499932443e-05, 'samples': 26226240, 'steps': 136594, 'loss/train': 1.4610658884048462} 11/07/2021 16:29:39 - INFO - __main__ - Step 136596: {'lr': 1.0052814719077068e-05, 'samples': 26226432, 'steps': 136595, 'loss/train': 1.699560523033142} 11/07/2021 16:29:39 - INFO - __main__ - Step 136597: {'lr': 1.0051325046338211e-05, 'samples': 26226624, 'steps': 136596, 'loss/train': 1.5352927446365356} 11/07/2021 16:29:40 - INFO - __main__ - Step 136598: {'lr': 1.0049835481716508e-05, 'samples': 26226816, 'steps': 136597, 'loss/train': 1.1605108976364136} 11/07/2021 16:29:41 - INFO - __main__ - Step 136599: {'lr': 1.0048346025212624e-05, 'samples': 26227008, 'steps': 136598, 'loss/train': 1.3543701171875} 11/07/2021 16:29:41 - INFO - __main__ - Step 136600: {'lr': 1.0046856676827282e-05, 'samples': 26227200, 'steps': 136599, 'loss/train': 1.0723334550857544} 11/07/2021 16:29:41 - INFO - __main__ - Step 136601: {'lr': 1.004536743656112e-05, 'samples': 26227392, 'steps': 136600, 'loss/train': 1.3325777053833008} 11/07/2021 16:29:42 - INFO - __main__ - Step 136602: {'lr': 1.0043878304414805e-05, 'samples': 26227584, 'steps': 136601, 'loss/train': 1.4007614850997925} 11/07/2021 16:29:42 - INFO - __main__ - Step 136603: {'lr': 1.0042389280389031e-05, 'samples': 26227776, 'steps': 136602, 'loss/train': 1.457775354385376} 11/07/2021 16:29:43 - INFO - __main__ - Step 136604: {'lr': 1.0040900364484461e-05, 'samples': 26227968, 'steps': 136603, 'loss/train': 1.0229092836380005} 11/07/2021 16:29:44 - INFO - __main__ - Step 136605: {'lr': 1.0039411556701738e-05, 'samples': 26228160, 'steps': 136604, 'loss/train': 1.479509711265564} 11/07/2021 16:29:44 - INFO - __main__ - Step 136606: {'lr': 1.0037922857041554e-05, 'samples': 26228352, 'steps': 136605, 'loss/train': 2.0597827434539795} 11/07/2021 16:29:44 - INFO - __main__ - Step 136607: {'lr': 1.0036434265504574e-05, 'samples': 26228544, 'steps': 136606, 'loss/train': 1.3402577638626099} 11/07/2021 16:29:45 - INFO - __main__ - Step 136608: {'lr': 1.0034945782091493e-05, 'samples': 26228736, 'steps': 136607, 'loss/train': 1.119892954826355} 11/07/2021 16:29:46 - INFO - __main__ - Step 136609: {'lr': 1.0033457406802949e-05, 'samples': 26228928, 'steps': 136608, 'loss/train': 1.421849250793457} 11/07/2021 16:29:46 - INFO - __main__ - Step 136610: {'lr': 1.0031969139639636e-05, 'samples': 26229120, 'steps': 136609, 'loss/train': 0.8800232410430908} 11/07/2021 16:29:47 - INFO - __main__ - Step 136611: {'lr': 1.0030480980602191e-05, 'samples': 26229312, 'steps': 136610, 'loss/train': 1.1769195795059204} 11/07/2021 16:29:47 - INFO - __main__ - Step 136612: {'lr': 1.0028992929691338e-05, 'samples': 26229504, 'steps': 136611, 'loss/train': 1.1520626544952393} 11/07/2021 16:29:47 - INFO - __main__ - Step 136613: {'lr': 1.0027504986907687e-05, 'samples': 26229696, 'steps': 136612, 'loss/train': 1.1403610706329346} 11/07/2021 16:29:48 - INFO - __main__ - Step 136614: {'lr': 1.002601715225196e-05, 'samples': 26229888, 'steps': 136613, 'loss/train': 1.7790553569793701} 11/07/2021 16:29:49 - INFO - __main__ - Step 136615: {'lr': 1.0024529425724793e-05, 'samples': 26230080, 'steps': 136614, 'loss/train': 1.1469076871871948} 11/07/2021 16:29:49 - INFO - __main__ - Step 136616: {'lr': 1.0023041807326883e-05, 'samples': 26230272, 'steps': 136615, 'loss/train': 0.8715529441833496} 11/07/2021 16:29:49 - INFO - __main__ - Step 136617: {'lr': 1.0021554297058922e-05, 'samples': 26230464, 'steps': 136616, 'loss/train': 0.24404236674308777} 11/07/2021 16:29:50 - INFO - __main__ - Step 136618: {'lr': 1.0020066894921493e-05, 'samples': 26230656, 'steps': 136617, 'loss/train': 1.1034244298934937} 11/07/2021 16:29:51 - INFO - __main__ - Step 136619: {'lr': 1.0018579600915346e-05, 'samples': 26230848, 'steps': 136618, 'loss/train': 1.0684274435043335} 11/07/2021 16:29:51 - INFO - __main__ - Step 136620: {'lr': 1.001709241504109e-05, 'samples': 26231040, 'steps': 136619, 'loss/train': 1.2316575050354004} 11/07/2021 16:29:51 - INFO - __main__ - Step 136621: {'lr': 1.001560533729945e-05, 'samples': 26231232, 'steps': 136620, 'loss/train': 1.3387588262557983} 11/07/2021 16:29:52 - INFO - __main__ - Step 136622: {'lr': 1.0014118367691089e-05, 'samples': 26231424, 'steps': 136621, 'loss/train': 1.6674717664718628} 11/07/2021 16:29:52 - INFO - __main__ - Step 136623: {'lr': 1.0012631506216647e-05, 'samples': 26231616, 'steps': 136622, 'loss/train': 0.4989858567714691} 11/07/2021 16:29:52 - INFO - __main__ - Step 136624: {'lr': 1.0011144752876816e-05, 'samples': 26231808, 'steps': 136623, 'loss/train': 1.3872902393341064} 11/07/2021 16:29:53 - INFO - __main__ - Step 136625: {'lr': 1.0009658107672237e-05, 'samples': 26232000, 'steps': 136624, 'loss/train': 1.2869987487792969} 11/07/2021 16:29:54 - INFO - __main__ - Step 136626: {'lr': 1.000817157060363e-05, 'samples': 26232192, 'steps': 136625, 'loss/train': 1.4328892230987549} 11/07/2021 16:29:54 - INFO - __main__ - Step 136627: {'lr': 1.0006685141671635e-05, 'samples': 26232384, 'steps': 136626, 'loss/train': 1.417452335357666} 11/07/2021 16:29:54 - INFO - __main__ - Step 136628: {'lr': 1.0005198820876915e-05, 'samples': 26232576, 'steps': 136627, 'loss/train': 1.3159968852996826} 11/07/2021 16:29:55 - INFO - __main__ - Step 136629: {'lr': 1.000371260822014e-05, 'samples': 26232768, 'steps': 136628, 'loss/train': 1.077599287033081} 11/07/2021 16:29:56 - INFO - __main__ - Step 136630: {'lr': 1.0002226503702e-05, 'samples': 26232960, 'steps': 136629, 'loss/train': 1.235666275024414} 11/07/2021 16:29:56 - INFO - __main__ - Step 136631: {'lr': 1.000074050732319e-05, 'samples': 26233152, 'steps': 136630, 'loss/train': 1.5961358547210693} 11/07/2021 16:29:56 - INFO - __main__ - Step 136632: {'lr': 9.999254619084298e-06, 'samples': 26233344, 'steps': 136631, 'loss/train': 1.3302221298217773} 11/07/2021 16:29:57 - INFO - __main__ - Step 136633: {'lr': 9.997768838986065e-06, 'samples': 26233536, 'steps': 136632, 'loss/train': 1.3698956966400146} 11/07/2021 16:29:57 - INFO - __main__ - Step 136634: {'lr': 9.996283167029108e-06, 'samples': 26233728, 'steps': 136633, 'loss/train': 0.8186970949172974} 11/07/2021 16:29:58 - INFO - __main__ - Step 136635: {'lr': 9.994797603214117e-06, 'samples': 26233920, 'steps': 136634, 'loss/train': 1.2843406200408936} 11/07/2021 16:29:59 - INFO - __main__ - Step 136636: {'lr': 9.993312147541788e-06, 'samples': 26234112, 'steps': 136635, 'loss/train': 1.1493818759918213} 11/07/2021 16:29:59 - INFO - __main__ - Step 136637: {'lr': 9.99182680001276e-06, 'samples': 26234304, 'steps': 136636, 'loss/train': 1.5959599018096924} 11/07/2021 16:29:59 - INFO - __main__ - Step 136638: {'lr': 9.990341560627725e-06, 'samples': 26234496, 'steps': 136637, 'loss/train': 0.38158538937568665} 11/07/2021 16:30:00 - INFO - __main__ - Step 136639: {'lr': 9.988856429387321e-06, 'samples': 26234688, 'steps': 136638, 'loss/train': 1.071962833404541} 11/07/2021 16:30:01 - INFO - __main__ - Step 136640: {'lr': 9.987371406292244e-06, 'samples': 26234880, 'steps': 136639, 'loss/train': 1.5867886543273926} 11/07/2021 16:30:01 - INFO - __main__ - Step 136641: {'lr': 9.985886491343132e-06, 'samples': 26235072, 'steps': 136640, 'loss/train': 1.3833849430084229} 11/07/2021 16:30:01 - INFO - __main__ - Step 136642: {'lr': 9.984401684540706e-06, 'samples': 26235264, 'steps': 136641, 'loss/train': 1.1847952604293823} 11/07/2021 16:30:02 - INFO - __main__ - Step 136643: {'lr': 9.982916985885575e-06, 'samples': 26235456, 'steps': 136642, 'loss/train': 1.3339433670043945} 11/07/2021 16:30:02 - INFO - __main__ - Step 136644: {'lr': 9.981432395378493e-06, 'samples': 26235648, 'steps': 136643, 'loss/train': 1.4665648937225342} 11/07/2021 16:30:03 - INFO - __main__ - Step 136645: {'lr': 9.979947913020037e-06, 'samples': 26235840, 'steps': 136644, 'loss/train': 1.879041314125061} 11/07/2021 16:30:04 - INFO - __main__ - Step 136646: {'lr': 9.978463538810905e-06, 'samples': 26236032, 'steps': 136645, 'loss/train': 0.8542148470878601} 11/07/2021 16:30:04 - INFO - __main__ - Step 136647: {'lr': 9.97697927275179e-06, 'samples': 26236224, 'steps': 136646, 'loss/train': 1.255265235900879} 11/07/2021 16:30:04 - INFO - __main__ - Step 136648: {'lr': 9.97549511484333e-06, 'samples': 26236416, 'steps': 136647, 'loss/train': 0.6290729641914368} 11/07/2021 16:30:05 - INFO - __main__ - Step 136649: {'lr': 9.97401106508622e-06, 'samples': 26236608, 'steps': 136648, 'loss/train': 1.74509859085083} 11/07/2021 16:30:06 - INFO - __main__ - Step 136650: {'lr': 9.972527123481122e-06, 'samples': 26236800, 'steps': 136649, 'loss/train': 1.0800244808197021} 11/07/2021 16:30:06 - INFO - __main__ - Step 136651: {'lr': 9.971043290028681e-06, 'samples': 26236992, 'steps': 136650, 'loss/train': 1.2801461219787598} 11/07/2021 16:30:06 - INFO - __main__ - Step 136652: {'lr': 9.969559564729586e-06, 'samples': 26237184, 'steps': 136651, 'loss/train': 1.4477040767669678} 11/07/2021 16:30:07 - INFO - __main__ - Step 136653: {'lr': 9.968075947584503e-06, 'samples': 26237376, 'steps': 136652, 'loss/train': 1.4575023651123047} 11/07/2021 16:30:07 - INFO - __main__ - Step 136654: {'lr': 9.966592438594102e-06, 'samples': 26237568, 'steps': 136653, 'loss/train': 1.0891317129135132} 11/07/2021 16:30:08 - INFO - __main__ - Step 136655: {'lr': 9.965109037759045e-06, 'samples': 26237760, 'steps': 136654, 'loss/train': 1.4106088876724243} 11/07/2021 16:30:09 - INFO - __main__ - Step 136656: {'lr': 9.963625745080029e-06, 'samples': 26237952, 'steps': 136655, 'loss/train': 1.4544777870178223} 11/07/2021 16:30:09 - INFO - __main__ - Step 136657: {'lr': 9.96214256055769e-06, 'samples': 26238144, 'steps': 136656, 'loss/train': 1.457389235496521} 11/07/2021 16:30:09 - INFO - __main__ - Step 136658: {'lr': 9.960659484192724e-06, 'samples': 26238336, 'steps': 136657, 'loss/train': 1.720950961112976} 11/07/2021 16:30:10 - INFO - __main__ - Step 136659: {'lr': 9.959176515985768e-06, 'samples': 26238528, 'steps': 136658, 'loss/train': 0.9998958110809326} 11/07/2021 16:30:10 - INFO - __main__ - Step 136660: {'lr': 9.957693655937488e-06, 'samples': 26238720, 'steps': 136659, 'loss/train': 1.4195116758346558} 11/07/2021 16:30:11 - INFO - __main__ - Step 136661: {'lr': 9.95621090404858e-06, 'samples': 26238912, 'steps': 136660, 'loss/train': 1.5097990036010742} 11/07/2021 16:30:11 - INFO - __main__ - Step 136662: {'lr': 9.954728260319679e-06, 'samples': 26239104, 'steps': 136661, 'loss/train': 0.7407291531562805} 11/07/2021 16:30:12 - INFO - __main__ - Step 136663: {'lr': 9.953245724751481e-06, 'samples': 26239296, 'steps': 136662, 'loss/train': 1.5125983953475952} 11/07/2021 16:30:12 - INFO - __main__ - Step 136664: {'lr': 9.951763297344652e-06, 'samples': 26239488, 'steps': 136663, 'loss/train': 0.8580541610717773} 11/07/2021 16:30:12 - INFO - __main__ - Step 136665: {'lr': 9.950280978099856e-06, 'samples': 26239680, 'steps': 136664, 'loss/train': 1.4127517938613892} 11/07/2021 16:30:13 - INFO - __main__ - Step 136666: {'lr': 9.948798767017763e-06, 'samples': 26239872, 'steps': 136665, 'loss/train': 1.2480417490005493} 11/07/2021 16:30:14 - INFO - __main__ - Step 136667: {'lr': 9.94731666409901e-06, 'samples': 26240064, 'steps': 136666, 'loss/train': 1.3449381589889526} 11/07/2021 16:30:14 - INFO - __main__ - Step 136668: {'lr': 9.945834669344317e-06, 'samples': 26240256, 'steps': 136667, 'loss/train': 1.1896677017211914} 11/07/2021 16:30:14 - INFO - __main__ - Step 136669: {'lr': 9.944352782754324e-06, 'samples': 26240448, 'steps': 136668, 'loss/train': 1.562712550163269} 11/07/2021 16:30:15 - INFO - __main__ - Step 136670: {'lr': 9.942871004329695e-06, 'samples': 26240640, 'steps': 136669, 'loss/train': 1.3108253479003906} 11/07/2021 16:30:16 - INFO - __main__ - Step 136671: {'lr': 9.941389334071154e-06, 'samples': 26240832, 'steps': 136670, 'loss/train': 1.1770042181015015} 11/07/2021 16:30:16 - INFO - __main__ - Step 136672: {'lr': 9.939907771979257e-06, 'samples': 26241024, 'steps': 136671, 'loss/train': 0.8002156615257263} 11/07/2021 16:30:17 - INFO - __main__ - Step 136673: {'lr': 9.93842631805475e-06, 'samples': 26241216, 'steps': 136672, 'loss/train': 1.032739281654358} 11/07/2021 16:30:17 - INFO - __main__ - Step 136674: {'lr': 9.9369449722983e-06, 'samples': 26241408, 'steps': 136673, 'loss/train': 1.2373487949371338} 11/07/2021 16:30:17 - INFO - __main__ - Step 136675: {'lr': 9.93546373471052e-06, 'samples': 26241600, 'steps': 136674, 'loss/train': 1.3824470043182373} 11/07/2021 16:30:18 - INFO - __main__ - Step 136676: {'lr': 9.933982605292157e-06, 'samples': 26241792, 'steps': 136675, 'loss/train': 1.091524600982666} 11/07/2021 16:30:19 - INFO - __main__ - Step 136677: {'lr': 9.932501584043796e-06, 'samples': 26241984, 'steps': 136676, 'loss/train': 1.124096155166626} 11/07/2021 16:30:19 - INFO - __main__ - Step 136678: {'lr': 9.931020670966185e-06, 'samples': 26242176, 'steps': 136677, 'loss/train': 1.230808138847351} 11/07/2021 16:30:19 - INFO - __main__ - Step 136679: {'lr': 9.929539866059933e-06, 'samples': 26242368, 'steps': 136678, 'loss/train': 0.9784823656082153} 11/07/2021 16:30:20 - INFO - __main__ - Step 136680: {'lr': 9.92805916932571e-06, 'samples': 26242560, 'steps': 136679, 'loss/train': 1.6368955373764038} 11/07/2021 16:30:21 - INFO - __main__ - Step 136681: {'lr': 9.926578580764234e-06, 'samples': 26242752, 'steps': 136680, 'loss/train': 0.945492684841156} 11/07/2021 16:30:21 - INFO - __main__ - Step 136682: {'lr': 9.925098100376117e-06, 'samples': 26242944, 'steps': 136681, 'loss/train': 1.1707738637924194} 11/07/2021 16:30:21 - INFO - __main__ - Step 136683: {'lr': 9.923617728162026e-06, 'samples': 26243136, 'steps': 136682, 'loss/train': 1.1022430658340454} 11/07/2021 16:30:22 - INFO - __main__ - Step 136684: {'lr': 9.92213746412271e-06, 'samples': 26243328, 'steps': 136683, 'loss/train': 1.497639775276184} 11/07/2021 16:30:22 - INFO - __main__ - Step 136685: {'lr': 9.920657308258724e-06, 'samples': 26243520, 'steps': 136684, 'loss/train': 1.3057535886764526} 11/07/2021 16:30:23 - INFO - __main__ - Step 136686: {'lr': 9.91917726057079e-06, 'samples': 26243712, 'steps': 136685, 'loss/train': 1.4090861082077026} 11/07/2021 16:30:24 - INFO - __main__ - Step 136687: {'lr': 9.917697321059599e-06, 'samples': 26243904, 'steps': 136686, 'loss/train': 1.2262195348739624} 11/07/2021 16:30:24 - INFO - __main__ - Step 136688: {'lr': 9.916217489725737e-06, 'samples': 26244096, 'steps': 136687, 'loss/train': 1.3945249319076538} 11/07/2021 16:30:24 - INFO - __main__ - Step 136689: {'lr': 9.914737766569953e-06, 'samples': 26244288, 'steps': 136688, 'loss/train': 1.3090076446533203} 11/07/2021 16:30:25 - INFO - __main__ - Step 136690: {'lr': 9.913258151592886e-06, 'samples': 26244480, 'steps': 136689, 'loss/train': 1.7253601551055908} 11/07/2021 16:30:25 - INFO - __main__ - Step 136691: {'lr': 9.9117786447952e-06, 'samples': 26244672, 'steps': 136690, 'loss/train': 0.6928254961967468} 11/07/2021 16:30:26 - INFO - __main__ - Step 136692: {'lr': 9.910299246177562e-06, 'samples': 26244864, 'steps': 136691, 'loss/train': 1.30897855758667} 11/07/2021 16:30:26 - INFO - __main__ - Step 136693: {'lr': 9.908819955740611e-06, 'samples': 26245056, 'steps': 136692, 'loss/train': 1.528348445892334} 11/07/2021 16:30:27 - INFO - __main__ - Step 136694: {'lr': 9.907340773485068e-06, 'samples': 26245248, 'steps': 136693, 'loss/train': 1.681390404701233} 11/07/2021 16:30:27 - INFO - __main__ - Step 136695: {'lr': 9.905861699411572e-06, 'samples': 26245440, 'steps': 136694, 'loss/train': 1.670967698097229} 11/07/2021 16:30:27 - INFO - __main__ - Step 136696: {'lr': 9.904382733520788e-06, 'samples': 26245632, 'steps': 136695, 'loss/train': 1.088552474975586} 11/07/2021 16:30:29 - INFO - __main__ - Step 136697: {'lr': 9.902903875813385e-06, 'samples': 26245824, 'steps': 136696, 'loss/train': 1.481606125831604} 11/07/2021 16:30:29 - INFO - __main__ - Step 136698: {'lr': 9.901425126290054e-06, 'samples': 26246016, 'steps': 136697, 'loss/train': 1.6454312801361084} 11/07/2021 16:30:29 - INFO - __main__ - Step 136699: {'lr': 9.899946484951405e-06, 'samples': 26246208, 'steps': 136698, 'loss/train': 1.1285231113433838} 11/07/2021 16:30:30 - INFO - __main__ - Step 136700: {'lr': 9.898467951798134e-06, 'samples': 26246400, 'steps': 136699, 'loss/train': 1.2758828401565552} 11/07/2021 16:30:30 - INFO - __main__ - Step 136701: {'lr': 9.896989526830907e-06, 'samples': 26246592, 'steps': 136700, 'loss/train': 1.5511292219161987} 11/07/2021 16:30:31 - INFO - __main__ - Step 136702: {'lr': 9.89551121005039e-06, 'samples': 26246784, 'steps': 136701, 'loss/train': 1.8007019758224487} 11/07/2021 16:30:31 - INFO - __main__ - Step 136703: {'lr': 9.894033001457275e-06, 'samples': 26246976, 'steps': 136702, 'loss/train': 1.6566308736801147} 11/07/2021 16:30:32 - INFO - __main__ - Step 136704: {'lr': 9.892554901052175e-06, 'samples': 26247168, 'steps': 136703, 'loss/train': 1.6983996629714966} 11/07/2021 16:30:32 - INFO - __main__ - Step 136705: {'lr': 9.891076908835783e-06, 'samples': 26247360, 'steps': 136704, 'loss/train': 1.2476078271865845} 11/07/2021 16:30:32 - INFO - __main__ - Step 136706: {'lr': 9.889599024808794e-06, 'samples': 26247552, 'steps': 136705, 'loss/train': 1.5347900390625} 11/07/2021 16:30:33 - INFO - __main__ - Step 136707: {'lr': 9.888121248971815e-06, 'samples': 26247744, 'steps': 136706, 'loss/train': 1.4060145616531372} 11/07/2021 16:30:34 - INFO - __main__ - Step 136708: {'lr': 9.886643581325571e-06, 'samples': 26247936, 'steps': 136707, 'loss/train': 1.2323691844940186} 11/07/2021 16:30:34 - INFO - __main__ - Step 136709: {'lr': 9.885166021870728e-06, 'samples': 26248128, 'steps': 136708, 'loss/train': 1.1949622631072998} 11/07/2021 16:30:34 - INFO - __main__ - Step 136710: {'lr': 9.883688570607869e-06, 'samples': 26248320, 'steps': 136709, 'loss/train': 1.1018024682998657} 11/07/2021 16:30:35 - INFO - __main__ - Step 136711: {'lr': 9.88221122753774e-06, 'samples': 26248512, 'steps': 136710, 'loss/train': 1.3455414772033691} 11/07/2021 16:30:36 - INFO - __main__ - Step 136712: {'lr': 9.880733992660956e-06, 'samples': 26248704, 'steps': 136711, 'loss/train': 1.6362942457199097} 11/07/2021 16:30:36 - INFO - __main__ - Step 136713: {'lr': 9.879256865978237e-06, 'samples': 26248896, 'steps': 136712, 'loss/train': 1.5034321546554565} 11/07/2021 16:30:37 - INFO - __main__ - Step 136714: {'lr': 9.877779847490192e-06, 'samples': 26249088, 'steps': 136713, 'loss/train': 1.3308085203170776} 11/07/2021 16:30:37 - INFO - __main__ - Step 136715: {'lr': 9.876302937197546e-06, 'samples': 26249280, 'steps': 136714, 'loss/train': 1.1961108446121216} 11/07/2021 16:30:37 - INFO - __main__ - Step 136716: {'lr': 9.874826135100907e-06, 'samples': 26249472, 'steps': 136715, 'loss/train': 1.252340316772461} 11/07/2021 16:30:38 - INFO - __main__ - Step 136717: {'lr': 9.873349441200968e-06, 'samples': 26249664, 'steps': 136716, 'loss/train': 0.9315025210380554} 11/07/2021 16:30:39 - INFO - __main__ - Step 136718: {'lr': 9.871872855498399e-06, 'samples': 26249856, 'steps': 136717, 'loss/train': 1.3535492420196533} 11/07/2021 16:30:39 - INFO - __main__ - Step 136719: {'lr': 9.870396377993862e-06, 'samples': 26250048, 'steps': 136718, 'loss/train': 1.1214232444763184} 11/07/2021 16:30:40 - INFO - __main__ - Step 136720: {'lr': 9.868920008688054e-06, 'samples': 26250240, 'steps': 136719, 'loss/train': 0.9693838357925415} 11/07/2021 16:30:40 - INFO - __main__ - Step 136721: {'lr': 9.867443747581556e-06, 'samples': 26250432, 'steps': 136720, 'loss/train': 0.07684940844774246} 11/07/2021 16:30:40 - INFO - __main__ - Step 136722: {'lr': 9.865967594675091e-06, 'samples': 26250624, 'steps': 136721, 'loss/train': 1.6184824705123901} 11/07/2021 16:30:41 - INFO - __main__ - Step 136723: {'lr': 9.864491549969323e-06, 'samples': 26250816, 'steps': 136722, 'loss/train': 0.4997015595436096} 11/07/2021 16:30:42 - INFO - __main__ - Step 136724: {'lr': 9.863015613464892e-06, 'samples': 26251008, 'steps': 136723, 'loss/train': 1.3526430130004883} 11/07/2021 16:30:42 - INFO - __main__ - Step 136725: {'lr': 9.861539785162493e-06, 'samples': 26251200, 'steps': 136724, 'loss/train': 1.4252649545669556} 11/07/2021 16:30:42 - INFO - __main__ - Step 136726: {'lr': 9.86006406506279e-06, 'samples': 26251392, 'steps': 136725, 'loss/train': 1.4303839206695557} 11/07/2021 16:30:43 - INFO - __main__ - Step 136727: {'lr': 9.858588453166423e-06, 'samples': 26251584, 'steps': 136726, 'loss/train': 1.1653170585632324} 11/07/2021 16:30:44 - INFO - __main__ - Step 136728: {'lr': 9.857112949474056e-06, 'samples': 26251776, 'steps': 136727, 'loss/train': 1.749674916267395} 11/07/2021 16:30:44 - INFO - __main__ - Step 136729: {'lr': 9.855637553986385e-06, 'samples': 26251968, 'steps': 136728, 'loss/train': 1.4418542385101318} 11/07/2021 16:30:45 - INFO - __main__ - Step 136730: {'lr': 9.854162266704047e-06, 'samples': 26252160, 'steps': 136729, 'loss/train': 1.1952258348464966} 11/07/2021 16:30:45 - INFO - __main__ - Step 136731: {'lr': 9.852687087627765e-06, 'samples': 26252352, 'steps': 136730, 'loss/train': 1.700740098953247} 11/07/2021 16:30:46 - INFO - __main__ - Step 136732: {'lr': 9.851212016758121e-06, 'samples': 26252544, 'steps': 136731, 'loss/train': 1.8654659986495972} 11/07/2021 16:30:47 - INFO - __main__ - Step 136733: {'lr': 9.84973705409581e-06, 'samples': 26252736, 'steps': 136732, 'loss/train': 1.4443672895431519} 11/07/2021 16:30:47 - INFO - __main__ - Step 136734: {'lr': 9.848262199641522e-06, 'samples': 26252928, 'steps': 136733, 'loss/train': 1.0783538818359375} 11/07/2021 16:30:48 - INFO - __main__ - Step 136735: {'lr': 9.846787453395872e-06, 'samples': 26253120, 'steps': 136734, 'loss/train': 2.170011281967163} 11/07/2021 16:30:48 - INFO - __main__ - Step 136736: {'lr': 9.845312815359553e-06, 'samples': 26253312, 'steps': 136735, 'loss/train': 1.7371324300765991} 11/07/2021 16:30:48 - INFO - __main__ - Step 136737: {'lr': 9.843838285533258e-06, 'samples': 26253504, 'steps': 136736, 'loss/train': 1.6691244840621948} 11/07/2021 16:30:49 - INFO - __main__ - Step 136738: {'lr': 9.842363863917597e-06, 'samples': 26253696, 'steps': 136737, 'loss/train': 1.2596886157989502} 11/07/2021 16:30:49 - INFO - __main__ - Step 136739: {'lr': 9.840889550513293e-06, 'samples': 26253888, 'steps': 136738, 'loss/train': 1.2428287267684937} 11/07/2021 16:30:50 - INFO - __main__ - Step 136740: {'lr': 9.839415345320957e-06, 'samples': 26254080, 'steps': 136739, 'loss/train': 0.5658249855041504} 11/07/2021 16:30:50 - INFO - __main__ - Step 136741: {'lr': 9.837941248341253e-06, 'samples': 26254272, 'steps': 136740, 'loss/train': 1.5904277563095093} 11/07/2021 16:30:51 - INFO - __main__ - Step 136742: {'lr': 9.836467259574933e-06, 'samples': 26254464, 'steps': 136741, 'loss/train': 1.1407049894332886} 11/07/2021 16:30:51 - INFO - __main__ - Step 136743: {'lr': 9.83499337902255e-06, 'samples': 26254656, 'steps': 136742, 'loss/train': 1.0089002847671509} 11/07/2021 16:30:52 - INFO - __main__ - Step 136744: {'lr': 9.833519606684826e-06, 'samples': 26254848, 'steps': 136743, 'loss/train': 1.0931721925735474} 11/07/2021 16:30:53 - INFO - __main__ - Step 136745: {'lr': 9.8320459425624e-06, 'samples': 26255040, 'steps': 136744, 'loss/train': 1.4560623168945312} 11/07/2021 16:30:53 - INFO - __main__ - Step 136746: {'lr': 9.83057238665594e-06, 'samples': 26255232, 'steps': 136745, 'loss/train': 1.4248830080032349} 11/07/2021 16:30:53 - INFO - __main__ - Step 136747: {'lr': 9.829098938966136e-06, 'samples': 26255424, 'steps': 136746, 'loss/train': 1.1803332567214966} 11/07/2021 16:30:54 - INFO - __main__ - Step 136748: {'lr': 9.8276255994936e-06, 'samples': 26255616, 'steps': 136747, 'loss/train': 1.4835562705993652} 11/07/2021 16:30:54 - INFO - __main__ - Step 136749: {'lr': 9.826152368239056e-06, 'samples': 26255808, 'steps': 136748, 'loss/train': 1.3291789293289185} 11/07/2021 16:30:55 - INFO - __main__ - Step 136750: {'lr': 9.824679245203138e-06, 'samples': 26256000, 'steps': 136749, 'loss/train': 1.2091561555862427} 11/07/2021 16:30:55 - INFO - __main__ - Step 136751: {'lr': 9.823206230386517e-06, 'samples': 26256192, 'steps': 136750, 'loss/train': 1.3117533922195435} 11/07/2021 16:30:56 - INFO - __main__ - Step 136752: {'lr': 9.821733323789855e-06, 'samples': 26256384, 'steps': 136751, 'loss/train': 1.4890888929367065} 11/07/2021 16:30:56 - INFO - __main__ - Step 136753: {'lr': 9.820260525413848e-06, 'samples': 26256576, 'steps': 136752, 'loss/train': 1.2364836931228638} 11/07/2021 16:30:56 - INFO - __main__ - Step 136754: {'lr': 9.81878783525908e-06, 'samples': 26256768, 'steps': 136753, 'loss/train': 1.4053642749786377} 11/07/2021 16:30:57 - INFO - __main__ - Step 136755: {'lr': 9.81731525332627e-06, 'samples': 26256960, 'steps': 136754, 'loss/train': 1.1055856943130493} 11/07/2021 16:30:58 - INFO - __main__ - Step 136756: {'lr': 9.815842779616086e-06, 'samples': 26257152, 'steps': 136755, 'loss/train': 1.006015419960022} 11/07/2021 16:30:58 - INFO - __main__ - Step 136757: {'lr': 9.814370414129136e-06, 'samples': 26257344, 'steps': 136756, 'loss/train': 1.3089299201965332} 11/07/2021 16:30:58 - INFO - __main__ - Step 136758: {'lr': 9.812898156866174e-06, 'samples': 26257536, 'steps': 136757, 'loss/train': 1.271177887916565} 11/07/2021 16:30:59 - INFO - __main__ - Step 136759: {'lr': 9.811426007827779e-06, 'samples': 26257728, 'steps': 136758, 'loss/train': 1.3463815450668335} 11/07/2021 16:30:59 - INFO - __main__ - Step 136760: {'lr': 9.809953967014645e-06, 'samples': 26257920, 'steps': 136759, 'loss/train': 1.0407510995864868} 11/07/2021 16:31:00 - INFO - __main__ - Step 136761: {'lr': 9.808482034427468e-06, 'samples': 26258112, 'steps': 136760, 'loss/train': 1.4747902154922485} 11/07/2021 16:31:01 - INFO - __main__ - Step 136762: {'lr': 9.807010210066858e-06, 'samples': 26258304, 'steps': 136761, 'loss/train': 1.5003702640533447} 11/07/2021 16:31:01 - INFO - __main__ - Step 136763: {'lr': 9.805538493933508e-06, 'samples': 26258496, 'steps': 136762, 'loss/train': 1.3847085237503052} 11/07/2021 16:31:01 - INFO - __main__ - Step 136764: {'lr': 9.804066886028084e-06, 'samples': 26258688, 'steps': 136763, 'loss/train': 1.2065004110336304} 11/07/2021 16:31:02 - INFO - __main__ - Step 136765: {'lr': 9.802595386351281e-06, 'samples': 26258880, 'steps': 136764, 'loss/train': 0.8497106432914734} 11/07/2021 16:31:03 - INFO - __main__ - Step 136766: {'lr': 9.801123994903654e-06, 'samples': 26259072, 'steps': 136765, 'loss/train': 1.2188955545425415} 11/07/2021 16:31:03 - INFO - __main__ - Step 136767: {'lr': 9.79965271168598e-06, 'samples': 26259264, 'steps': 136766, 'loss/train': 1.4381248950958252} 11/07/2021 16:31:03 - INFO - __main__ - Step 136768: {'lr': 9.79818153669884e-06, 'samples': 26259456, 'steps': 136767, 'loss/train': 0.4342951774597168} 11/07/2021 16:31:04 - INFO - __main__ - Step 136769: {'lr': 9.79671046994296e-06, 'samples': 26259648, 'steps': 136768, 'loss/train': 1.790955662727356} 11/07/2021 16:31:04 - INFO - __main__ - Step 136770: {'lr': 9.795239511418946e-06, 'samples': 26259840, 'steps': 136769, 'loss/train': 1.36793053150177} 11/07/2021 16:31:05 - INFO - __main__ - Step 136771: {'lr': 9.793768661127522e-06, 'samples': 26260032, 'steps': 136770, 'loss/train': 1.3637733459472656} 11/07/2021 16:31:06 - INFO - __main__ - Step 136772: {'lr': 9.7922979190693e-06, 'samples': 26260224, 'steps': 136771, 'loss/train': 1.2428901195526123} 11/07/2021 16:31:06 - INFO - __main__ - Step 136773: {'lr': 9.790827285244969e-06, 'samples': 26260416, 'steps': 136772, 'loss/train': 1.2810673713684082} 11/07/2021 16:31:06 - INFO - __main__ - Step 136774: {'lr': 9.789356759655171e-06, 'samples': 26260608, 'steps': 136773, 'loss/train': 0.9503474831581116} 11/07/2021 16:31:07 - INFO - __main__ - Step 136775: {'lr': 9.7878863423006e-06, 'samples': 26260800, 'steps': 136774, 'loss/train': 1.412257194519043} 11/07/2021 16:31:08 - INFO - __main__ - Step 136776: {'lr': 9.786416033181894e-06, 'samples': 26260992, 'steps': 136775, 'loss/train': 1.4615874290466309} 11/07/2021 16:31:08 - INFO - __main__ - Step 136777: {'lr': 9.784945832299719e-06, 'samples': 26261184, 'steps': 136776, 'loss/train': 1.1897494792938232} 11/07/2021 16:31:08 - INFO - __main__ - Step 136778: {'lr': 9.78347573965474e-06, 'samples': 26261376, 'steps': 136777, 'loss/train': 5.675485134124756} 11/07/2021 16:31:09 - INFO - __main__ - Step 136779: {'lr': 9.782005755247653e-06, 'samples': 26261568, 'steps': 136778, 'loss/train': 1.0732187032699585} 11/07/2021 16:31:09 - INFO - __main__ - Step 136780: {'lr': 9.780535879079066e-06, 'samples': 26261760, 'steps': 136779, 'loss/train': 0.9500952363014221} 11/07/2021 16:31:09 - INFO - __main__ - Step 136781: {'lr': 9.779066111149648e-06, 'samples': 26261952, 'steps': 136780, 'loss/train': 1.2651824951171875} 11/07/2021 16:31:11 - INFO - __main__ - Step 136782: {'lr': 9.77759645146009e-06, 'samples': 26262144, 'steps': 136781, 'loss/train': 1.7194421291351318} 11/07/2021 16:31:11 - INFO - __main__ - Step 136783: {'lr': 9.776126900011034e-06, 'samples': 26262336, 'steps': 136782, 'loss/train': 1.3669896125793457} 11/07/2021 16:31:11 - INFO - __main__ - Step 136784: {'lr': 9.774657456803143e-06, 'samples': 26262528, 'steps': 136783, 'loss/train': 1.2286478281021118} 11/07/2021 16:31:12 - INFO - __main__ - Step 136785: {'lr': 9.773188121837084e-06, 'samples': 26262720, 'steps': 136784, 'loss/train': 1.698364496231079} 11/07/2021 16:31:12 - INFO - __main__ - Step 136786: {'lr': 9.771718895113523e-06, 'samples': 26262912, 'steps': 136785, 'loss/train': 0.7351346015930176} 11/07/2021 16:31:13 - INFO - __main__ - Step 136787: {'lr': 9.770249776633128e-06, 'samples': 26263104, 'steps': 136786, 'loss/train': 1.262783169746399} 11/07/2021 16:31:13 - INFO - __main__ - Step 136788: {'lr': 9.768780766396535e-06, 'samples': 26263296, 'steps': 136787, 'loss/train': 1.5497233867645264} 11/07/2021 16:31:14 - INFO - __main__ - Step 136789: {'lr': 9.767311864404438e-06, 'samples': 26263488, 'steps': 136788, 'loss/train': 0.716904878616333} 11/07/2021 16:31:14 - INFO - __main__ - Step 136790: {'lr': 9.765843070657476e-06, 'samples': 26263680, 'steps': 136789, 'loss/train': 1.298353910446167} 11/07/2021 16:31:14 - INFO - __main__ - Step 136791: {'lr': 9.764374385156316e-06, 'samples': 26263872, 'steps': 136790, 'loss/train': 1.3235499858856201} 11/07/2021 16:31:16 - INFO - __main__ - Step 136792: {'lr': 9.762905807901651e-06, 'samples': 26264064, 'steps': 136791, 'loss/train': 1.3532673120498657} 11/07/2021 16:31:16 - INFO - __main__ - Step 136793: {'lr': 9.761437338894092e-06, 'samples': 26264256, 'steps': 136792, 'loss/train': 1.526702880859375} 11/07/2021 16:31:16 - INFO - __main__ - Step 136794: {'lr': 9.759968978134303e-06, 'samples': 26264448, 'steps': 136793, 'loss/train': 1.8891229629516602} 11/07/2021 16:31:17 - INFO - __main__ - Step 136795: {'lr': 9.758500725622982e-06, 'samples': 26264640, 'steps': 136794, 'loss/train': 1.0499242544174194} 11/07/2021 16:31:17 - INFO - __main__ - Step 136796: {'lr': 9.757032581360791e-06, 'samples': 26264832, 'steps': 136795, 'loss/train': 1.4141099452972412} 11/07/2021 16:31:17 - INFO - __main__ - Step 136797: {'lr': 9.755564545348345e-06, 'samples': 26265024, 'steps': 136796, 'loss/train': 1.0130772590637207} 11/07/2021 16:31:18 - INFO - __main__ - Step 136798: {'lr': 9.754096617586333e-06, 'samples': 26265216, 'steps': 136797, 'loss/train': 1.7246602773666382} 11/07/2021 16:31:19 - INFO - __main__ - Step 136799: {'lr': 9.752628798075424e-06, 'samples': 26265408, 'steps': 136798, 'loss/train': 1.43467116355896} 11/07/2021 16:31:19 - INFO - __main__ - Step 136800: {'lr': 9.751161086816285e-06, 'samples': 26265600, 'steps': 136799, 'loss/train': 1.1780354976654053} 11/07/2021 16:31:20 - INFO - __main__ - Step 136801: {'lr': 9.749693483809551e-06, 'samples': 26265792, 'steps': 136800, 'loss/train': 1.0906379222869873} 11/07/2021 16:31:20 - INFO - __main__ - Step 136802: {'lr': 9.74822598905592e-06, 'samples': 26265984, 'steps': 136801, 'loss/train': 1.120574712753296} 11/07/2021 16:31:20 - INFO - __main__ - Step 136803: {'lr': 9.746758602556e-06, 'samples': 26266176, 'steps': 136802, 'loss/train': 1.198532223701477} 11/07/2021 16:31:21 - INFO - __main__ - Step 136804: {'lr': 9.745291324310513e-06, 'samples': 26266368, 'steps': 136803, 'loss/train': 1.5310555696487427} 11/07/2021 16:31:22 - INFO - __main__ - Step 136805: {'lr': 9.743824154320096e-06, 'samples': 26266560, 'steps': 136804, 'loss/train': 1.4094313383102417} 11/07/2021 16:31:22 - INFO - __main__ - Step 136806: {'lr': 9.742357092585391e-06, 'samples': 26266752, 'steps': 136805, 'loss/train': 0.5036410689353943} 11/07/2021 16:31:23 - INFO - __main__ - Step 136807: {'lr': 9.740890139107062e-06, 'samples': 26266944, 'steps': 136806, 'loss/train': 1.9319610595703125} 11/07/2021 16:31:23 - INFO - __main__ - Step 136808: {'lr': 9.739423293885774e-06, 'samples': 26267136, 'steps': 136807, 'loss/train': 1.4170857667922974} 11/07/2021 16:31:24 - INFO - __main__ - Step 136809: {'lr': 9.737956556922223e-06, 'samples': 26267328, 'steps': 136808, 'loss/train': 1.269345998764038} 11/07/2021 16:31:24 - INFO - __main__ - Step 136810: {'lr': 9.736489928217019e-06, 'samples': 26267520, 'steps': 136809, 'loss/train': 1.07863450050354} 11/07/2021 16:31:25 - INFO - __main__ - Step 136811: {'lr': 9.735023407770826e-06, 'samples': 26267712, 'steps': 136810, 'loss/train': 1.267351746559143} 11/07/2021 16:31:25 - INFO - __main__ - Step 136812: {'lr': 9.73355699558437e-06, 'samples': 26267904, 'steps': 136811, 'loss/train': 1.1027125120162964} 11/07/2021 16:31:25 - INFO - __main__ - Step 136813: {'lr': 9.73209069165823e-06, 'samples': 26268096, 'steps': 136812, 'loss/train': 1.579569935798645} 11/07/2021 16:31:26 - INFO - __main__ - Step 136814: {'lr': 9.73062449599313e-06, 'samples': 26268288, 'steps': 136813, 'loss/train': 1.1278822422027588} 11/07/2021 16:31:27 - INFO - __main__ - Step 136815: {'lr': 9.729158408589678e-06, 'samples': 26268480, 'steps': 136814, 'loss/train': 1.729017734527588} 11/07/2021 16:31:27 - INFO - __main__ - Step 136816: {'lr': 9.72769242944857e-06, 'samples': 26268672, 'steps': 136815, 'loss/train': 1.59040105342865} 11/07/2021 16:31:27 - INFO - __main__ - Step 136817: {'lr': 9.726226558570444e-06, 'samples': 26268864, 'steps': 136816, 'loss/train': 0.9975262880325317} 11/07/2021 16:31:28 - INFO - __main__ - Step 136818: {'lr': 9.724760795955994e-06, 'samples': 26269056, 'steps': 136817, 'loss/train': 0.9633104801177979} 11/07/2021 16:31:28 - INFO - __main__ - Step 136819: {'lr': 9.723295141605888e-06, 'samples': 26269248, 'steps': 136818, 'loss/train': 1.7224005460739136} 11/07/2021 16:31:29 - INFO - __main__ - Step 136820: {'lr': 9.721829595520703e-06, 'samples': 26269440, 'steps': 136819, 'loss/train': 1.1695454120635986} 11/07/2021 16:31:30 - INFO - __main__ - Step 136821: {'lr': 9.720364157701168e-06, 'samples': 26269632, 'steps': 136820, 'loss/train': 1.5189919471740723} 11/07/2021 16:31:30 - INFO - __main__ - Step 136822: {'lr': 9.718898828147943e-06, 'samples': 26269824, 'steps': 136821, 'loss/train': 1.4661649465560913} 11/07/2021 16:31:30 - INFO - __main__ - Step 136823: {'lr': 9.717433606861642e-06, 'samples': 26270016, 'steps': 136822, 'loss/train': 0.558757483959198} 11/07/2021 16:31:31 - INFO - __main__ - Step 136824: {'lr': 9.715968493842986e-06, 'samples': 26270208, 'steps': 136823, 'loss/train': 1.226559042930603} 11/07/2021 16:31:32 - INFO - __main__ - Step 136825: {'lr': 9.714503489092614e-06, 'samples': 26270400, 'steps': 136824, 'loss/train': 1.61769437789917} 11/07/2021 16:31:32 - INFO - __main__ - Step 136826: {'lr': 9.713038592611162e-06, 'samples': 26270592, 'steps': 136825, 'loss/train': 1.022381067276001} 11/07/2021 16:31:32 - INFO - __main__ - Step 136827: {'lr': 9.711573804399299e-06, 'samples': 26270784, 'steps': 136826, 'loss/train': 1.4473018646240234} 11/07/2021 16:31:33 - INFO - __main__ - Step 136828: {'lr': 9.710109124457717e-06, 'samples': 26270976, 'steps': 136827, 'loss/train': 1.2332651615142822} 11/07/2021 16:31:33 - INFO - __main__ - Step 136829: {'lr': 9.708644552787028e-06, 'samples': 26271168, 'steps': 136828, 'loss/train': 1.3214795589447021} 11/07/2021 16:31:34 - INFO - __main__ - Step 136830: {'lr': 9.707180089387923e-06, 'samples': 26271360, 'steps': 136829, 'loss/train': 1.7917187213897705} 11/07/2021 16:31:35 - INFO - __main__ - Step 136831: {'lr': 9.705715734261044e-06, 'samples': 26271552, 'steps': 136830, 'loss/train': 0.9986461400985718} 11/07/2021 16:31:35 - INFO - __main__ - Step 136832: {'lr': 9.704251487407112e-06, 'samples': 26271744, 'steps': 136831, 'loss/train': 1.1314239501953125} 11/07/2021 16:31:35 - INFO - __main__ - Step 136833: {'lr': 9.702787348826708e-06, 'samples': 26271936, 'steps': 136832, 'loss/train': 1.3734630346298218} 11/07/2021 16:31:36 - INFO - __main__ - Step 136834: {'lr': 9.7013233185205e-06, 'samples': 26272128, 'steps': 136833, 'loss/train': 1.3490132093429565} 11/07/2021 16:31:37 - INFO - __main__ - Step 136835: {'lr': 9.69985939648918e-06, 'samples': 26272320, 'steps': 136834, 'loss/train': 1.3078486919403076} 11/07/2021 16:31:37 - INFO - __main__ - Step 136836: {'lr': 9.698395582733389e-06, 'samples': 26272512, 'steps': 136835, 'loss/train': 1.4576725959777832} 11/07/2021 16:31:37 - INFO - __main__ - Step 136837: {'lr': 9.696931877253817e-06, 'samples': 26272704, 'steps': 136836, 'loss/train': 1.3840495347976685} 11/07/2021 16:31:38 - INFO - __main__ - Step 136838: {'lr': 9.695468280051078e-06, 'samples': 26272896, 'steps': 136837, 'loss/train': 1.4029489755630493} 11/07/2021 16:31:38 - INFO - __main__ - Step 136839: {'lr': 9.694004791125839e-06, 'samples': 26273088, 'steps': 136838, 'loss/train': 1.2790635824203491} 11/07/2021 16:31:39 - INFO - __main__ - Step 136840: {'lr': 9.69254141047879e-06, 'samples': 26273280, 'steps': 136839, 'loss/train': 1.866371750831604} 11/07/2021 16:31:39 - INFO - __main__ - Step 136841: {'lr': 9.691078138110571e-06, 'samples': 26273472, 'steps': 136840, 'loss/train': 1.4488166570663452} 11/07/2021 16:31:40 - INFO - __main__ - Step 136842: {'lr': 9.689614974021849e-06, 'samples': 26273664, 'steps': 136841, 'loss/train': 1.7467784881591797} 11/07/2021 16:31:40 - INFO - __main__ - Step 136843: {'lr': 9.688151918213289e-06, 'samples': 26273856, 'steps': 136842, 'loss/train': 1.3530346155166626} 11/07/2021 16:31:40 - INFO - __main__ - Step 136844: {'lr': 9.686688970685502e-06, 'samples': 26274048, 'steps': 136843, 'loss/train': 1.3299099206924438} 11/07/2021 16:31:42 - INFO - __main__ - Step 136845: {'lr': 9.685226131439211e-06, 'samples': 26274240, 'steps': 136844, 'loss/train': 1.4099892377853394} 11/07/2021 16:31:42 - INFO - __main__ - Step 136846: {'lr': 9.683763400475081e-06, 'samples': 26274432, 'steps': 136845, 'loss/train': 1.1584099531173706} 11/07/2021 16:31:42 - INFO - __main__ - Step 136847: {'lr': 9.682300777793723e-06, 'samples': 26274624, 'steps': 136846, 'loss/train': 1.1899470090866089} 11/07/2021 16:31:43 - INFO - __main__ - Step 136848: {'lr': 9.680838263395775e-06, 'samples': 26274816, 'steps': 136847, 'loss/train': 0.9583634734153748} 11/07/2021 16:31:43 - INFO - __main__ - Step 136849: {'lr': 9.679375857281959e-06, 'samples': 26275008, 'steps': 136848, 'loss/train': 1.7351144552230835} 11/07/2021 16:31:43 - INFO - __main__ - Step 136850: {'lr': 9.677913559452912e-06, 'samples': 26275200, 'steps': 136849, 'loss/train': 1.3135417699813843} 11/07/2021 16:31:44 - INFO - __main__ - Step 136851: {'lr': 9.676451369909273e-06, 'samples': 26275392, 'steps': 136850, 'loss/train': 1.6418529748916626} 11/07/2021 16:31:45 - INFO - __main__ - Step 136852: {'lr': 9.67498928865171e-06, 'samples': 26275584, 'steps': 136851, 'loss/train': 1.4310482740402222} 11/07/2021 16:31:45 - INFO - __main__ - Step 136853: {'lr': 9.673527315680886e-06, 'samples': 26275776, 'steps': 136852, 'loss/train': 1.1718146800994873} 11/07/2021 16:31:45 - INFO - __main__ - Step 136854: {'lr': 9.672065450997496e-06, 'samples': 26275968, 'steps': 136853, 'loss/train': 0.8688152432441711} 11/07/2021 16:31:46 - INFO - __main__ - Step 136855: {'lr': 9.670603694602126e-06, 'samples': 26276160, 'steps': 136854, 'loss/train': 1.1399677991867065} 11/07/2021 16:31:47 - INFO - __main__ - Step 136856: {'lr': 9.669142046495494e-06, 'samples': 26276352, 'steps': 136855, 'loss/train': 1.0738645792007446} 11/07/2021 16:31:47 - INFO - __main__ - Step 136857: {'lr': 9.667680506678239e-06, 'samples': 26276544, 'steps': 136856, 'loss/train': 1.546407699584961} 11/07/2021 16:31:47 - INFO - __main__ - Step 136858: {'lr': 9.666219075151027e-06, 'samples': 26276736, 'steps': 136857, 'loss/train': 1.0842281579971313} 11/07/2021 16:31:48 - INFO - __main__ - Step 136859: {'lr': 9.664757751914527e-06, 'samples': 26276928, 'steps': 136858, 'loss/train': 1.4551186561584473} 11/07/2021 16:31:48 - INFO - __main__ - Step 136860: {'lr': 9.663296536969346e-06, 'samples': 26277120, 'steps': 136859, 'loss/train': 1.0968949794769287} 11/07/2021 16:31:49 - INFO - __main__ - Step 136861: {'lr': 9.661835430316151e-06, 'samples': 26277312, 'steps': 136860, 'loss/train': 1.9637064933776855} 11/07/2021 16:31:50 - INFO - __main__ - Step 136862: {'lr': 9.660374431955665e-06, 'samples': 26277504, 'steps': 136861, 'loss/train': 0.8730409741401672} 11/07/2021 16:31:50 - INFO - __main__ - Step 136863: {'lr': 9.658913541888498e-06, 'samples': 26277696, 'steps': 136862, 'loss/train': 1.2977005243301392} 11/07/2021 16:31:50 - INFO - __main__ - Step 136864: {'lr': 9.657452760115287e-06, 'samples': 26277888, 'steps': 136863, 'loss/train': 1.286852478981018} 11/07/2021 16:31:51 - INFO - __main__ - Step 136865: {'lr': 9.655992086636756e-06, 'samples': 26278080, 'steps': 136864, 'loss/train': 1.5441771745681763} 11/07/2021 16:31:51 - INFO - __main__ - Step 136866: {'lr': 9.654531521453513e-06, 'samples': 26278272, 'steps': 136865, 'loss/train': 0.8710697889328003} 11/07/2021 16:31:52 - INFO - __main__ - Step 136867: {'lr': 9.653071064566226e-06, 'samples': 26278464, 'steps': 136866, 'loss/train': 0.7388492226600647} 11/07/2021 16:31:53 - INFO - __main__ - Step 136868: {'lr': 9.651610715975561e-06, 'samples': 26278656, 'steps': 136867, 'loss/train': 1.322670578956604} 11/07/2021 16:31:53 - INFO - __main__ - Step 136869: {'lr': 9.650150475682158e-06, 'samples': 26278848, 'steps': 136868, 'loss/train': 0.06400944292545319} 11/07/2021 16:31:53 - INFO - __main__ - Step 136870: {'lr': 9.648690343686706e-06, 'samples': 26279040, 'steps': 136869, 'loss/train': 1.22221839427948} 11/07/2021 16:31:54 - INFO - __main__ - Step 136871: {'lr': 9.64723031998982e-06, 'samples': 26279232, 'steps': 136870, 'loss/train': 1.560205101966858} 11/07/2021 16:31:55 - INFO - __main__ - Step 136872: {'lr': 9.645770404592219e-06, 'samples': 26279424, 'steps': 136871, 'loss/train': 1.396913766860962} 11/07/2021 16:31:55 - INFO - __main__ - Step 136873: {'lr': 9.644310597494516e-06, 'samples': 26279616, 'steps': 136872, 'loss/train': 1.3331217765808105} 11/07/2021 16:31:56 - INFO - __main__ - Step 136874: {'lr': 9.642850898697375e-06, 'samples': 26279808, 'steps': 136873, 'loss/train': 0.9746885299682617} 11/07/2021 16:31:56 - INFO - __main__ - Step 136875: {'lr': 9.641391308201463e-06, 'samples': 26280000, 'steps': 136874, 'loss/train': 0.7178313732147217} 11/07/2021 16:31:56 - INFO - __main__ - Step 136876: {'lr': 9.63993182600742e-06, 'samples': 26280192, 'steps': 136875, 'loss/train': 1.2191112041473389} 11/07/2021 16:31:57 - INFO - __main__ - Step 136877: {'lr': 9.63847245211591e-06, 'samples': 26280384, 'steps': 136876, 'loss/train': 1.2009665966033936} 11/07/2021 16:31:58 - INFO - __main__ - Step 136878: {'lr': 9.637013186527599e-06, 'samples': 26280576, 'steps': 136877, 'loss/train': 1.3844342231750488} 11/07/2021 16:31:58 - INFO - __main__ - Step 136879: {'lr': 9.635554029243127e-06, 'samples': 26280768, 'steps': 136878, 'loss/train': 1.8162181377410889} 11/07/2021 16:31:58 - INFO - __main__ - Step 136880: {'lr': 9.634094980263186e-06, 'samples': 26280960, 'steps': 136879, 'loss/train': 1.0036368370056152} 11/07/2021 16:31:59 - INFO - __main__ - Step 136881: {'lr': 9.632636039588389e-06, 'samples': 26281152, 'steps': 136880, 'loss/train': 1.3306070566177368} 11/07/2021 16:31:59 - INFO - __main__ - Step 136882: {'lr': 9.631177207219427e-06, 'samples': 26281344, 'steps': 136881, 'loss/train': 1.1011008024215698} 11/07/2021 16:32:00 - INFO - __main__ - Step 136883: {'lr': 9.629718483156968e-06, 'samples': 26281536, 'steps': 136882, 'loss/train': 1.4186620712280273} 11/07/2021 16:32:01 - INFO - __main__ - Step 136884: {'lr': 9.628259867401623e-06, 'samples': 26281728, 'steps': 136883, 'loss/train': 0.5784137845039368} 11/07/2021 16:32:01 - INFO - __main__ - Step 136885: {'lr': 9.626801359954085e-06, 'samples': 26281920, 'steps': 136884, 'loss/train': 1.2201416492462158} 11/07/2021 16:32:01 - INFO - __main__ - Step 136886: {'lr': 9.625342960815048e-06, 'samples': 26282112, 'steps': 136885, 'loss/train': 1.487064003944397} 11/07/2021 16:32:02 - INFO - __main__ - Step 136887: {'lr': 9.623884669985068e-06, 'samples': 26282304, 'steps': 136886, 'loss/train': 1.6693147420883179} 11/07/2021 16:32:03 - INFO - __main__ - Step 136888: {'lr': 9.622426487464863e-06, 'samples': 26282496, 'steps': 136887, 'loss/train': 1.2211295366287231} 11/07/2021 16:32:03 - INFO - __main__ - Step 136889: {'lr': 9.620968413255077e-06, 'samples': 26282688, 'steps': 136888, 'loss/train': 1.2344021797180176} 11/07/2021 16:32:03 - INFO - __main__ - Step 136890: {'lr': 9.6195104473564e-06, 'samples': 26282880, 'steps': 136889, 'loss/train': 1.0773550271987915} 11/07/2021 16:32:04 - INFO - __main__ - Step 136891: {'lr': 9.618052589769443e-06, 'samples': 26283072, 'steps': 136890, 'loss/train': 0.6772016286849976} 11/07/2021 16:32:04 - INFO - __main__ - Step 136892: {'lr': 9.616594840494875e-06, 'samples': 26283264, 'steps': 136891, 'loss/train': 1.2420742511749268} 11/07/2021 16:32:06 - INFO - __main__ - Step 136893: {'lr': 9.615137199533358e-06, 'samples': 26283456, 'steps': 136892, 'loss/train': 1.2304413318634033} 11/07/2021 16:32:06 - INFO - __main__ - Step 136894: {'lr': 9.613679666885562e-06, 'samples': 26283648, 'steps': 136893, 'loss/train': 1.462432861328125} 11/07/2021 16:32:06 - INFO - __main__ - Step 136895: {'lr': 9.61222224255215e-06, 'samples': 26283840, 'steps': 136894, 'loss/train': 1.5502101182937622} 11/07/2021 16:32:07 - INFO - __main__ - Step 136896: {'lr': 9.610764926533733e-06, 'samples': 26284032, 'steps': 136895, 'loss/train': 1.7044651508331299} 11/07/2021 16:32:07 - INFO - __main__ - Step 136897: {'lr': 9.609307718831006e-06, 'samples': 26284224, 'steps': 136896, 'loss/train': 0.9281290173530579} 11/07/2021 16:32:07 - INFO - __main__ - Step 136898: {'lr': 9.607850619444608e-06, 'samples': 26284416, 'steps': 136897, 'loss/train': 1.2535488605499268} 11/07/2021 16:32:08 - INFO - __main__ - Step 136899: {'lr': 9.606393628375204e-06, 'samples': 26284608, 'steps': 136898, 'loss/train': 1.3495513200759888} 11/07/2021 16:32:09 - INFO - __main__ - Step 136900: {'lr': 9.604936745623488e-06, 'samples': 26284800, 'steps': 136899, 'loss/train': 0.19279050827026367} 11/07/2021 16:32:09 - INFO - __main__ - Step 136901: {'lr': 9.603479971190044e-06, 'samples': 26284992, 'steps': 136900, 'loss/train': 1.495041012763977} 11/07/2021 16:32:09 - INFO - __main__ - Step 136902: {'lr': 9.602023305075564e-06, 'samples': 26285184, 'steps': 136901, 'loss/train': 1.291307806968689} 11/07/2021 16:32:10 - INFO - __main__ - Step 136903: {'lr': 9.600566747280714e-06, 'samples': 26285376, 'steps': 136902, 'loss/train': 0.8616219758987427} 11/07/2021 16:32:10 - INFO - __main__ - Step 136904: {'lr': 9.599110297806135e-06, 'samples': 26285568, 'steps': 136903, 'loss/train': 1.0814779996871948} 11/07/2021 16:32:11 - INFO - __main__ - Step 136905: {'lr': 9.597653956652464e-06, 'samples': 26285760, 'steps': 136904, 'loss/train': 1.2640646696090698} 11/07/2021 16:32:11 - INFO - __main__ - Step 136906: {'lr': 9.596197723820394e-06, 'samples': 26285952, 'steps': 136905, 'loss/train': 1.327504277229309} 11/07/2021 16:32:12 - INFO - __main__ - Step 136907: {'lr': 9.594741599310564e-06, 'samples': 26286144, 'steps': 136906, 'loss/train': 1.4831550121307373} 11/07/2021 16:32:12 - INFO - __main__ - Step 136908: {'lr': 9.59328558312364e-06, 'samples': 26286336, 'steps': 136907, 'loss/train': 1.4548441171646118} 11/07/2021 16:32:13 - INFO - __main__ - Step 136909: {'lr': 9.591829675260288e-06, 'samples': 26286528, 'steps': 136908, 'loss/train': 1.3879331350326538} 11/07/2021 16:32:14 - INFO - __main__ - Step 136910: {'lr': 9.59037387572112e-06, 'samples': 26286720, 'steps': 136909, 'loss/train': 1.29970383644104} 11/07/2021 16:32:14 - INFO - __main__ - Step 136911: {'lr': 9.588918184506829e-06, 'samples': 26286912, 'steps': 136910, 'loss/train': 1.4761236906051636} 11/07/2021 16:32:14 - INFO - __main__ - Step 136912: {'lr': 9.58746260161808e-06, 'samples': 26287104, 'steps': 136911, 'loss/train': 0.6335599422454834} 11/07/2021 16:32:15 - INFO - __main__ - Step 136913: {'lr': 9.586007127055513e-06, 'samples': 26287296, 'steps': 136912, 'loss/train': 1.050858736038208} 11/07/2021 16:32:15 - INFO - __main__ - Step 136914: {'lr': 9.584551760819764e-06, 'samples': 26287488, 'steps': 136913, 'loss/train': 1.109098196029663} 11/07/2021 16:32:16 - INFO - __main__ - Step 136915: {'lr': 9.583096502911503e-06, 'samples': 26287680, 'steps': 136914, 'loss/train': 2.0212018489837646} 11/07/2021 16:32:17 - INFO - __main__ - Step 136916: {'lr': 9.581641353331393e-06, 'samples': 26287872, 'steps': 136915, 'loss/train': 0.463416188955307} 11/07/2021 16:32:17 - INFO - __main__ - Step 136917: {'lr': 9.580186312080074e-06, 'samples': 26288064, 'steps': 136916, 'loss/train': 5.690675258636475} 11/07/2021 16:32:17 - INFO - __main__ - Step 136918: {'lr': 9.57873137915824e-06, 'samples': 26288256, 'steps': 136917, 'loss/train': 1.2654732465744019} 11/07/2021 16:32:18 - INFO - __main__ - Step 136919: {'lr': 9.577276554566499e-06, 'samples': 26288448, 'steps': 136918, 'loss/train': 0.9755619168281555} 11/07/2021 16:32:18 - INFO - __main__ - Step 136920: {'lr': 9.575821838305521e-06, 'samples': 26288640, 'steps': 136919, 'loss/train': 1.2613986730575562} 11/07/2021 16:32:19 - INFO - __main__ - Step 136921: {'lr': 9.57436723037597e-06, 'samples': 26288832, 'steps': 136920, 'loss/train': 1.0704059600830078} 11/07/2021 16:32:19 - INFO - __main__ - Step 136922: {'lr': 9.572912730778511e-06, 'samples': 26289024, 'steps': 136921, 'loss/train': 0.9377416968345642} 11/07/2021 16:32:20 - INFO - __main__ - Step 136923: {'lr': 9.571458339513783e-06, 'samples': 26289216, 'steps': 136922, 'loss/train': 1.3805075883865356} 11/07/2021 16:32:20 - INFO - __main__ - Step 136924: {'lr': 9.570004056582454e-06, 'samples': 26289408, 'steps': 136923, 'loss/train': 1.0157890319824219} 11/07/2021 16:32:20 - INFO - __main__ - Step 136925: {'lr': 9.568549881985161e-06, 'samples': 26289600, 'steps': 136924, 'loss/train': 1.2166928052902222} 11/07/2021 16:32:22 - INFO - __main__ - Step 136926: {'lr': 9.567095815722598e-06, 'samples': 26289792, 'steps': 136925, 'loss/train': 1.2848840951919556} 11/07/2021 16:32:22 - INFO - __main__ - Step 136927: {'lr': 9.565641857795376e-06, 'samples': 26289984, 'steps': 136926, 'loss/train': 1.4100135564804077} 11/07/2021 16:32:22 - INFO - __main__ - Step 136928: {'lr': 9.564188008204134e-06, 'samples': 26290176, 'steps': 136927, 'loss/train': 1.2030072212219238} 11/07/2021 16:32:23 - INFO - __main__ - Step 136929: {'lr': 9.562734266949591e-06, 'samples': 26290368, 'steps': 136928, 'loss/train': 1.260185956954956} 11/07/2021 16:32:23 - INFO - __main__ - Step 136930: {'lr': 9.56128063403236e-06, 'samples': 26290560, 'steps': 136929, 'loss/train': 1.1868278980255127} 11/07/2021 16:32:24 - INFO - __main__ - Step 136931: {'lr': 9.559827109453106e-06, 'samples': 26290752, 'steps': 136930, 'loss/train': 1.3350093364715576} 11/07/2021 16:32:25 - INFO - __main__ - Step 136932: {'lr': 9.558373693212468e-06, 'samples': 26290944, 'steps': 136931, 'loss/train': 1.6278620958328247} 11/07/2021 16:32:25 - INFO - __main__ - Step 136933: {'lr': 9.55692038531114e-06, 'samples': 26291136, 'steps': 136932, 'loss/train': 1.6820143461227417} 11/07/2021 16:32:25 - INFO - __main__ - Step 136934: {'lr': 9.555467185749733e-06, 'samples': 26291328, 'steps': 136933, 'loss/train': 1.151458740234375} 11/07/2021 16:32:26 - INFO - __main__ - Step 136935: {'lr': 9.55401409452894e-06, 'samples': 26291520, 'steps': 136934, 'loss/train': 1.1667481660842896} 11/07/2021 16:32:26 - INFO - __main__ - Step 136936: {'lr': 9.552561111649372e-06, 'samples': 26291712, 'steps': 136935, 'loss/train': 1.4766544103622437} 11/07/2021 16:32:27 - INFO - __main__ - Step 136937: {'lr': 9.55110823711175e-06, 'samples': 26291904, 'steps': 136936, 'loss/train': 1.0323193073272705} 11/07/2021 16:32:27 - INFO - __main__ - Step 136938: {'lr': 9.549655470916657e-06, 'samples': 26292096, 'steps': 136937, 'loss/train': 0.8460630774497986} 11/07/2021 16:32:28 - INFO - __main__ - Step 136939: {'lr': 9.548202813064788e-06, 'samples': 26292288, 'steps': 136938, 'loss/train': 1.4144363403320312} 11/07/2021 16:32:28 - INFO - __main__ - Step 136940: {'lr': 9.546750263556808e-06, 'samples': 26292480, 'steps': 136939, 'loss/train': 1.2257236242294312} 11/07/2021 16:32:28 - INFO - __main__ - Step 136941: {'lr': 9.545297822393357e-06, 'samples': 26292672, 'steps': 136940, 'loss/train': 1.4608769416809082} 11/07/2021 16:32:29 - INFO - __main__ - Step 136942: {'lr': 9.543845489575043e-06, 'samples': 26292864, 'steps': 136941, 'loss/train': 0.7593738436698914} 11/07/2021 16:32:30 - INFO - __main__ - Step 136943: {'lr': 9.54239326510259e-06, 'samples': 26293056, 'steps': 136942, 'loss/train': 1.0140916109085083} 11/07/2021 16:32:30 - INFO - __main__ - Step 136944: {'lr': 9.540941148976606e-06, 'samples': 26293248, 'steps': 136943, 'loss/train': 1.1692568063735962} 11/07/2021 16:32:30 - INFO - __main__ - Step 136945: {'lr': 9.53948914119776e-06, 'samples': 26293440, 'steps': 136944, 'loss/train': 1.4247785806655884} 11/07/2021 16:32:31 - INFO - __main__ - Step 136946: {'lr': 9.538037241766717e-06, 'samples': 26293632, 'steps': 136945, 'loss/train': 1.1456724405288696} 11/07/2021 16:32:31 - INFO - __main__ - Step 136947: {'lr': 9.536585450684142e-06, 'samples': 26293824, 'steps': 136946, 'loss/train': 0.814092755317688} 11/07/2021 16:32:32 - INFO - __main__ - Step 136948: {'lr': 9.53513376795065e-06, 'samples': 26294016, 'steps': 136947, 'loss/train': 0.9355370402336121} 11/07/2021 16:32:33 - INFO - __main__ - Step 136949: {'lr': 9.533682193566928e-06, 'samples': 26294208, 'steps': 136948, 'loss/train': 1.4138833284378052} 11/07/2021 16:32:33 - INFO - __main__ - Step 136950: {'lr': 9.532230727533592e-06, 'samples': 26294400, 'steps': 136949, 'loss/train': 1.4950873851776123} 11/07/2021 16:32:33 - INFO - __main__ - Step 136951: {'lr': 9.53077936985136e-06, 'samples': 26294592, 'steps': 136950, 'loss/train': 1.0558383464813232} 11/07/2021 16:32:34 - INFO - __main__ - Step 136952: {'lr': 9.52932812052082e-06, 'samples': 26294784, 'steps': 136951, 'loss/train': 1.3712435960769653} 11/07/2021 16:32:35 - INFO - __main__ - Step 136953: {'lr': 9.527876979542688e-06, 'samples': 26294976, 'steps': 136952, 'loss/train': 1.1759648323059082} 11/07/2021 16:32:35 - INFO - __main__ - Step 136954: {'lr': 9.526425946917577e-06, 'samples': 26295168, 'steps': 136953, 'loss/train': 1.355371356010437} 11/07/2021 16:32:35 - INFO - __main__ - Step 136955: {'lr': 9.524975022646127e-06, 'samples': 26295360, 'steps': 136954, 'loss/train': 0.9177355170249939} 11/07/2021 16:32:36 - INFO - __main__ - Step 136956: {'lr': 9.523524206729001e-06, 'samples': 26295552, 'steps': 136955, 'loss/train': 1.188301920890808} 11/07/2021 16:32:36 - INFO - __main__ - Step 136957: {'lr': 9.522073499166895e-06, 'samples': 26295744, 'steps': 136956, 'loss/train': 1.3059881925582886} 11/07/2021 16:32:37 - INFO - __main__ - Step 136958: {'lr': 9.520622899960418e-06, 'samples': 26295936, 'steps': 136957, 'loss/train': 1.2025792598724365} 11/07/2021 16:32:38 - INFO - __main__ - Step 136959: {'lr': 9.519172409110238e-06, 'samples': 26296128, 'steps': 136958, 'loss/train': 1.5569528341293335} 11/07/2021 16:32:38 - INFO - __main__ - Step 136960: {'lr': 9.517722026616993e-06, 'samples': 26296320, 'steps': 136959, 'loss/train': 1.228353500366211} 11/07/2021 16:32:38 - INFO - __main__ - Step 136961: {'lr': 9.516271752481376e-06, 'samples': 26296512, 'steps': 136960, 'loss/train': 1.150259017944336} 11/07/2021 16:32:39 - INFO - __main__ - Step 136962: {'lr': 9.514821586703998e-06, 'samples': 26296704, 'steps': 136961, 'loss/train': 1.116127371788025} 11/07/2021 16:32:39 - INFO - __main__ - Step 136963: {'lr': 9.513371529285525e-06, 'samples': 26296896, 'steps': 136962, 'loss/train': 0.6703917980194092} 11/07/2021 16:32:40 - INFO - __main__ - Step 136964: {'lr': 9.511921580226651e-06, 'samples': 26297088, 'steps': 136963, 'loss/train': 0.9270676374435425} 11/07/2021 16:32:41 - INFO - __main__ - Step 136965: {'lr': 9.51047173952796e-06, 'samples': 26297280, 'steps': 136964, 'loss/train': 1.032195806503296} 11/07/2021 16:32:41 - INFO - __main__ - Step 136966: {'lr': 9.509022007190143e-06, 'samples': 26297472, 'steps': 136965, 'loss/train': 1.1312063932418823} 11/07/2021 16:32:41 - INFO - __main__ - Step 136967: {'lr': 9.507572383213898e-06, 'samples': 26297664, 'steps': 136966, 'loss/train': 0.07310379296541214} 11/07/2021 16:32:42 - INFO - __main__ - Step 136968: {'lr': 9.506122867599775e-06, 'samples': 26297856, 'steps': 136967, 'loss/train': 1.417590856552124} 11/07/2021 16:32:43 - INFO - __main__ - Step 136969: {'lr': 9.5046734603485e-06, 'samples': 26298048, 'steps': 136968, 'loss/train': 2.764132022857666} 11/07/2021 16:32:43 - INFO - __main__ - Step 136970: {'lr': 9.50322416146071e-06, 'samples': 26298240, 'steps': 136969, 'loss/train': 0.9993659853935242} 11/07/2021 16:32:43 - INFO - __main__ - Step 136971: {'lr': 9.501774970937044e-06, 'samples': 26298432, 'steps': 136970, 'loss/train': 1.767940878868103} 11/07/2021 16:32:44 - INFO - __main__ - Step 136972: {'lr': 9.500325888778166e-06, 'samples': 26298624, 'steps': 136971, 'loss/train': 0.8118339776992798} 11/07/2021 16:32:44 - INFO - __main__ - Step 136973: {'lr': 9.498876914984745e-06, 'samples': 26298816, 'steps': 136972, 'loss/train': 1.0084978342056274} 11/07/2021 16:32:45 - INFO - __main__ - Step 136974: {'lr': 9.497428049557416e-06, 'samples': 26299008, 'steps': 136973, 'loss/train': 1.7293580770492554} 11/07/2021 16:32:46 - INFO - __main__ - Step 136975: {'lr': 9.49597929249682e-06, 'samples': 26299200, 'steps': 136974, 'loss/train': 1.1843416690826416} 11/07/2021 16:32:46 - INFO - __main__ - Step 136976: {'lr': 9.49453064380365e-06, 'samples': 26299392, 'steps': 136975, 'loss/train': 1.3231821060180664} 11/07/2021 16:32:46 - INFO - __main__ - Step 136977: {'lr': 9.493082103478518e-06, 'samples': 26299584, 'steps': 136976, 'loss/train': 1.187849998474121} 11/07/2021 16:32:47 - INFO - __main__ - Step 136978: {'lr': 9.491633671522116e-06, 'samples': 26299776, 'steps': 136977, 'loss/train': 1.0029542446136475} 11/07/2021 16:32:47 - INFO - __main__ - Step 136979: {'lr': 9.490185347935055e-06, 'samples': 26299968, 'steps': 136978, 'loss/train': 1.7465293407440186} 11/07/2021 16:32:48 - INFO - __main__ - Step 136980: {'lr': 9.488737132718e-06, 'samples': 26300160, 'steps': 136979, 'loss/train': 0.6331982016563416} 11/07/2021 16:32:48 - INFO - __main__ - Step 136981: {'lr': 9.487289025871593e-06, 'samples': 26300352, 'steps': 136980, 'loss/train': 0.7948840856552124} 11/07/2021 16:32:49 - INFO - __main__ - Step 136982: {'lr': 9.485841027396524e-06, 'samples': 26300544, 'steps': 136981, 'loss/train': 1.0774303674697876} 11/07/2021 16:32:49 - INFO - __main__ - Step 136983: {'lr': 9.484393137293406e-06, 'samples': 26300736, 'steps': 136982, 'loss/train': 1.3134664297103882} 11/07/2021 16:32:49 - INFO - __main__ - Step 136984: {'lr': 9.482945355562934e-06, 'samples': 26300928, 'steps': 136983, 'loss/train': 1.2228444814682007} 11/07/2021 16:32:51 - INFO - __main__ - Step 136985: {'lr': 9.481497682205713e-06, 'samples': 26301120, 'steps': 136984, 'loss/train': 1.301138162612915} 11/07/2021 16:32:51 - INFO - __main__ - Step 136986: {'lr': 9.480050117222417e-06, 'samples': 26301312, 'steps': 136985, 'loss/train': 1.238535761833191} 11/07/2021 16:32:51 - INFO - __main__ - Step 136987: {'lr': 9.478602660613706e-06, 'samples': 26301504, 'steps': 136986, 'loss/train': 1.6233253479003906} 11/07/2021 16:32:52 - INFO - __main__ - Step 136988: {'lr': 9.477155312380249e-06, 'samples': 26301696, 'steps': 136987, 'loss/train': 1.1891615390777588} 11/07/2021 16:32:52 - INFO - __main__ - Step 136989: {'lr': 9.475708072522681e-06, 'samples': 26301888, 'steps': 136988, 'loss/train': 1.2006033658981323} 11/07/2021 16:32:53 - INFO - __main__ - Step 136990: {'lr': 9.474260941041618e-06, 'samples': 26302080, 'steps': 136989, 'loss/train': 1.3115047216415405} 11/07/2021 16:32:53 - INFO - __main__ - Step 136991: {'lr': 9.472813917937722e-06, 'samples': 26302272, 'steps': 136990, 'loss/train': 1.4026554822921753} 11/07/2021 16:32:54 - INFO - __main__ - Step 136992: {'lr': 9.47136700321169e-06, 'samples': 26302464, 'steps': 136991, 'loss/train': 1.1770590543746948} 11/07/2021 16:32:54 - INFO - __main__ - Step 136993: {'lr': 9.469920196864157e-06, 'samples': 26302656, 'steps': 136992, 'loss/train': 0.9952117800712585} 11/07/2021 16:32:54 - INFO - __main__ - Step 136994: {'lr': 9.468473498895735e-06, 'samples': 26302848, 'steps': 136993, 'loss/train': 1.515442967414856} 11/07/2021 16:32:55 - INFO - __main__ - Step 136995: {'lr': 9.46702690930712e-06, 'samples': 26303040, 'steps': 136994, 'loss/train': 1.2495120763778687} 11/07/2021 16:32:56 - INFO - __main__ - Step 136996: {'lr': 9.465580428098974e-06, 'samples': 26303232, 'steps': 136995, 'loss/train': 0.7887546420097351} 11/07/2021 16:32:56 - INFO - __main__ - Step 136997: {'lr': 9.464134055271911e-06, 'samples': 26303424, 'steps': 136996, 'loss/train': 1.687744379043579} 11/07/2021 16:32:57 - INFO - __main__ - Step 136998: {'lr': 9.462687790826568e-06, 'samples': 26303616, 'steps': 136997, 'loss/train': 1.5756720304489136} 11/07/2021 16:32:57 - INFO - __main__ - Step 136999: {'lr': 9.461241634763667e-06, 'samples': 26303808, 'steps': 136998, 'loss/train': 1.3458517789840698} 11/07/2021 16:32:58 - INFO - __main__ - Step 137000: {'lr': 9.45979558708382e-06, 'samples': 26304000, 'steps': 136999, 'loss/train': 1.2699170112609863} 11/07/2021 16:32:58 - INFO - __main__ - Step 137001: {'lr': 9.458349647787662e-06, 'samples': 26304192, 'steps': 137000, 'loss/train': 1.232543706893921} 11/07/2021 16:32:59 - INFO - __main__ - Step 137002: {'lr': 9.456903816875861e-06, 'samples': 26304384, 'steps': 137001, 'loss/train': 1.3190560340881348} 11/07/2021 16:32:59 - INFO - __main__ - Step 137003: {'lr': 9.455458094349084e-06, 'samples': 26304576, 'steps': 137002, 'loss/train': 1.3238295316696167} 11/07/2021 16:32:59 - INFO - __main__ - Step 137004: {'lr': 9.45401248020794e-06, 'samples': 26304768, 'steps': 137003, 'loss/train': 0.7974807620048523} 11/07/2021 16:33:00 - INFO - __main__ - Step 137005: {'lr': 9.452566974453098e-06, 'samples': 26304960, 'steps': 137004, 'loss/train': 0.8865166902542114} 11/07/2021 16:33:01 - INFO - __main__ - Step 137006: {'lr': 9.451121577085247e-06, 'samples': 26305152, 'steps': 137005, 'loss/train': 1.2930877208709717} 11/07/2021 16:33:01 - INFO - __main__ - Step 137007: {'lr': 9.449676288105002e-06, 'samples': 26305344, 'steps': 137006, 'loss/train': 0.688584566116333} 11/07/2021 16:33:01 - INFO - __main__ - Step 137008: {'lr': 9.448231107513e-06, 'samples': 26305536, 'steps': 137007, 'loss/train': 1.1073987483978271} 11/07/2021 16:33:02 - INFO - __main__ - Step 137009: {'lr': 9.446786035309935e-06, 'samples': 26305728, 'steps': 137008, 'loss/train': 0.8000568747520447} 11/07/2021 16:33:03 - INFO - __main__ - Step 137010: {'lr': 9.445341071496416e-06, 'samples': 26305920, 'steps': 137009, 'loss/train': 0.7892572283744812} 11/07/2021 16:33:03 - INFO - __main__ - Step 137011: {'lr': 9.443896216073167e-06, 'samples': 26306112, 'steps': 137010, 'loss/train': 1.176794171333313} 11/07/2021 16:33:04 - INFO - __main__ - Step 137012: {'lr': 9.442451469040741e-06, 'samples': 26306304, 'steps': 137011, 'loss/train': 0.9913642406463623} 11/07/2021 16:33:04 - INFO - __main__ - Step 137013: {'lr': 9.441006830399834e-06, 'samples': 26306496, 'steps': 137012, 'loss/train': 1.5286551713943481} 11/07/2021 16:33:04 - INFO - __main__ - Step 137014: {'lr': 9.439562300151112e-06, 'samples': 26306688, 'steps': 137013, 'loss/train': 0.8478291034698486} 11/07/2021 16:33:05 - INFO - __main__ - Step 137015: {'lr': 9.438117878295182e-06, 'samples': 26306880, 'steps': 137014, 'loss/train': 1.5972492694854736} 11/07/2021 16:33:06 - INFO - __main__ - Step 137016: {'lr': 9.436673564832744e-06, 'samples': 26307072, 'steps': 137015, 'loss/train': 1.4223015308380127} 11/07/2021 16:33:06 - INFO - __main__ - Step 137017: {'lr': 9.43522935976443e-06, 'samples': 26307264, 'steps': 137016, 'loss/train': 0.9278798699378967} 11/07/2021 16:33:06 - INFO - __main__ - Step 137018: {'lr': 9.433785263090883e-06, 'samples': 26307456, 'steps': 137017, 'loss/train': 1.1599483489990234} 11/07/2021 16:33:07 - INFO - __main__ - Step 137019: {'lr': 9.432341274812767e-06, 'samples': 26307648, 'steps': 137018, 'loss/train': 0.38555335998535156} 11/07/2021 16:33:07 - INFO - __main__ - Step 137020: {'lr': 9.430897394930721e-06, 'samples': 26307840, 'steps': 137019, 'loss/train': 1.5013744831085205} 11/07/2021 16:33:08 - INFO - __main__ - Step 137021: {'lr': 9.42945362344541e-06, 'samples': 26308032, 'steps': 137020, 'loss/train': 1.0464762449264526} 11/07/2021 16:33:08 - INFO - __main__ - Step 137022: {'lr': 9.428009960357504e-06, 'samples': 26308224, 'steps': 137021, 'loss/train': 1.456389307975769} 11/07/2021 16:33:09 - INFO - __main__ - Step 137023: {'lr': 9.426566405667581e-06, 'samples': 26308416, 'steps': 137022, 'loss/train': 1.0079333782196045} 11/07/2021 16:33:09 - INFO - __main__ - Step 137024: {'lr': 9.425122959376337e-06, 'samples': 26308608, 'steps': 137023, 'loss/train': 1.635650873184204} 11/07/2021 16:33:09 - INFO - __main__ - Step 137025: {'lr': 9.423679621484438e-06, 'samples': 26308800, 'steps': 137024, 'loss/train': 1.450881838798523} 11/07/2021 16:33:11 - INFO - __main__ - Step 137026: {'lr': 9.422236391992495e-06, 'samples': 26308992, 'steps': 137025, 'loss/train': 0.9440540671348572} 11/07/2021 16:33:11 - INFO - __main__ - Step 137027: {'lr': 9.420793270901202e-06, 'samples': 26309184, 'steps': 137026, 'loss/train': 1.448738694190979} 11/07/2021 16:33:11 - INFO - __main__ - Step 137028: {'lr': 9.419350258211168e-06, 'samples': 26309376, 'steps': 137027, 'loss/train': 1.3477965593338013} 11/07/2021 16:33:12 - INFO - __main__ - Step 137029: {'lr': 9.41790735392306e-06, 'samples': 26309568, 'steps': 137028, 'loss/train': 1.553197979927063} 11/07/2021 16:33:12 - INFO - __main__ - Step 137030: {'lr': 9.416464558037546e-06, 'samples': 26309760, 'steps': 137029, 'loss/train': 1.3121306896209717} 11/07/2021 16:33:13 - INFO - __main__ - Step 137031: {'lr': 9.415021870555262e-06, 'samples': 26309952, 'steps': 137030, 'loss/train': 1.313671350479126} 11/07/2021 16:33:13 - INFO - __main__ - Step 137032: {'lr': 9.413579291476848e-06, 'samples': 26310144, 'steps': 137031, 'loss/train': 1.431710124015808} 11/07/2021 16:33:14 - INFO - __main__ - Step 137033: {'lr': 9.412136820802942e-06, 'samples': 26310336, 'steps': 137032, 'loss/train': 1.002617597579956} 11/07/2021 16:33:14 - INFO - __main__ - Step 137034: {'lr': 9.410694458534264e-06, 'samples': 26310528, 'steps': 137033, 'loss/train': 2.748305320739746} 11/07/2021 16:33:15 - INFO - __main__ - Step 137035: {'lr': 9.409252204671399e-06, 'samples': 26310720, 'steps': 137034, 'loss/train': 0.6952771544456482} 11/07/2021 16:33:16 - INFO - __main__ - Step 137036: {'lr': 9.40781005921501e-06, 'samples': 26310912, 'steps': 137035, 'loss/train': 0.7358689308166504} 11/07/2021 16:33:16 - INFO - __main__ - Step 137037: {'lr': 9.40636802216574e-06, 'samples': 26311104, 'steps': 137036, 'loss/train': 1.4701272249221802} 11/07/2021 16:33:16 - INFO - __main__ - Step 137038: {'lr': 9.404926093524225e-06, 'samples': 26311296, 'steps': 137037, 'loss/train': 0.434855580329895} 11/07/2021 16:33:17 - INFO - __main__ - Step 137039: {'lr': 9.403484273291157e-06, 'samples': 26311488, 'steps': 137038, 'loss/train': 0.9989266395568848} 11/07/2021 16:33:17 - INFO - __main__ - Step 137040: {'lr': 9.402042561467177e-06, 'samples': 26311680, 'steps': 137039, 'loss/train': 1.2499579191207886} 11/07/2021 16:33:18 - INFO - __main__ - Step 137041: {'lr': 9.400600958052923e-06, 'samples': 26311872, 'steps': 137040, 'loss/train': 1.3430140018463135} 11/07/2021 16:33:18 - INFO - __main__ - Step 137042: {'lr': 9.399159463049034e-06, 'samples': 26312064, 'steps': 137041, 'loss/train': 1.2669346332550049} 11/07/2021 16:33:19 - INFO - __main__ - Step 137043: {'lr': 9.397718076456174e-06, 'samples': 26312256, 'steps': 137042, 'loss/train': 0.9600937962532043} 11/07/2021 16:33:19 - INFO - __main__ - Step 137044: {'lr': 9.396276798274983e-06, 'samples': 26312448, 'steps': 137043, 'loss/train': 0.9961363077163696} 11/07/2021 16:33:19 - INFO - __main__ - Step 137045: {'lr': 9.394835628506127e-06, 'samples': 26312640, 'steps': 137044, 'loss/train': 1.1911067962646484} 11/07/2021 16:33:20 - INFO - __main__ - Step 137046: {'lr': 9.393394567150243e-06, 'samples': 26312832, 'steps': 137045, 'loss/train': 1.1819590330123901} 11/07/2021 16:33:21 - INFO - __main__ - Step 137047: {'lr': 9.391953614208026e-06, 'samples': 26313024, 'steps': 137046, 'loss/train': 0.4308485686779022} 11/07/2021 16:33:21 - INFO - __main__ - Step 137048: {'lr': 9.390512769680032e-06, 'samples': 26313216, 'steps': 137047, 'loss/train': 1.0969256162643433} 11/07/2021 16:33:22 - INFO - __main__ - Step 137049: {'lr': 9.38907203356698e-06, 'samples': 26313408, 'steps': 137048, 'loss/train': 1.3516976833343506} 11/07/2021 16:33:22 - INFO - __main__ - Step 137050: {'lr': 9.387631405869485e-06, 'samples': 26313600, 'steps': 137049, 'loss/train': 1.0666885375976562} 11/07/2021 16:33:22 - INFO - __main__ - Step 137051: {'lr': 9.386190886588208e-06, 'samples': 26313792, 'steps': 137050, 'loss/train': 1.2133699655532837} 11/07/2021 16:33:23 - INFO - __main__ - Step 137052: {'lr': 9.384750475723792e-06, 'samples': 26313984, 'steps': 137051, 'loss/train': 0.6960039734840393} 11/07/2021 16:33:24 - INFO - __main__ - Step 137053: {'lr': 9.3833101732769e-06, 'samples': 26314176, 'steps': 137052, 'loss/train': 0.7673358917236328} 11/07/2021 16:33:24 - INFO - __main__ - Step 137054: {'lr': 9.381869979248198e-06, 'samples': 26314368, 'steps': 137053, 'loss/train': 1.0500967502593994} 11/07/2021 16:33:24 - INFO - __main__ - Step 137055: {'lr': 9.380429893638298e-06, 'samples': 26314560, 'steps': 137054, 'loss/train': 1.526657223701477} 11/07/2021 16:33:25 - INFO - __main__ - Step 137056: {'lr': 9.378989916447866e-06, 'samples': 26314752, 'steps': 137055, 'loss/train': 1.46775221824646} 11/07/2021 16:33:26 - INFO - __main__ - Step 137057: {'lr': 9.377550047677541e-06, 'samples': 26314944, 'steps': 137056, 'loss/train': 0.988021194934845} 11/07/2021 16:33:26 - INFO - __main__ - Step 137058: {'lr': 9.37611028732796e-06, 'samples': 26315136, 'steps': 137057, 'loss/train': 1.192449927330017} 11/07/2021 16:33:27 - INFO - __main__ - Step 137059: {'lr': 9.374670635399819e-06, 'samples': 26315328, 'steps': 137058, 'loss/train': 0.9760972857475281} 11/07/2021 16:33:27 - INFO - __main__ - Step 137060: {'lr': 9.373231091893752e-06, 'samples': 26315520, 'steps': 137059, 'loss/train': 1.7394336462020874} 11/07/2021 16:33:27 - INFO - __main__ - Step 137061: {'lr': 9.371791656810402e-06, 'samples': 26315712, 'steps': 137060, 'loss/train': 0.027460573241114616} 11/07/2021 16:33:28 - INFO - __main__ - Step 137062: {'lr': 9.370352330150378e-06, 'samples': 26315904, 'steps': 137061, 'loss/train': 1.4316177368164062} 11/07/2021 16:33:29 - INFO - __main__ - Step 137063: {'lr': 9.368913111914345e-06, 'samples': 26316096, 'steps': 137062, 'loss/train': 1.0472651720046997} 11/07/2021 16:33:29 - INFO - __main__ - Step 137064: {'lr': 9.367474002102998e-06, 'samples': 26316288, 'steps': 137063, 'loss/train': 1.1748074293136597} 11/07/2021 16:33:29 - INFO - __main__ - Step 137065: {'lr': 9.36603500071695e-06, 'samples': 26316480, 'steps': 137064, 'loss/train': 1.14303719997406} 11/07/2021 16:33:30 - INFO - __main__ - Step 137066: {'lr': 9.364596107756834e-06, 'samples': 26316672, 'steps': 137065, 'loss/train': 1.270209789276123} 11/07/2021 16:33:31 - INFO - __main__ - Step 137067: {'lr': 9.36315732322332e-06, 'samples': 26316864, 'steps': 137066, 'loss/train': 1.2959766387939453} 11/07/2021 16:33:31 - INFO - __main__ - Step 137068: {'lr': 9.361718647117073e-06, 'samples': 26317056, 'steps': 137067, 'loss/train': 1.604539394378662} 11/07/2021 16:33:31 - INFO - __main__ - Step 137069: {'lr': 9.360280079438705e-06, 'samples': 26317248, 'steps': 137068, 'loss/train': 0.18692262470722198} 11/07/2021 16:33:32 - INFO - __main__ - Step 137070: {'lr': 9.358841620188879e-06, 'samples': 26317440, 'steps': 137069, 'loss/train': 1.5481892824172974} 11/07/2021 16:33:32 - INFO - __main__ - Step 137071: {'lr': 9.357403269368264e-06, 'samples': 26317632, 'steps': 137070, 'loss/train': 1.3573330640792847} 11/07/2021 16:33:32 - INFO - __main__ - Step 137072: {'lr': 9.35596502697747e-06, 'samples': 26317824, 'steps': 137071, 'loss/train': 1.442799687385559} 11/07/2021 16:33:34 - INFO - __main__ - Step 137073: {'lr': 9.35452689301719e-06, 'samples': 26318016, 'steps': 137072, 'loss/train': 1.1986521482467651} 11/07/2021 16:33:34 - INFO - __main__ - Step 137074: {'lr': 9.353088867488064e-06, 'samples': 26318208, 'steps': 137073, 'loss/train': 1.2786626815795898} 11/07/2021 16:33:34 - INFO - __main__ - Step 137075: {'lr': 9.3516509503907e-06, 'samples': 26318400, 'steps': 137074, 'loss/train': 1.469544529914856} 11/07/2021 16:33:35 - INFO - __main__ - Step 137076: {'lr': 9.350213141725739e-06, 'samples': 26318592, 'steps': 137075, 'loss/train': 1.3102562427520752} 11/07/2021 16:33:35 - INFO - __main__ - Step 137077: {'lr': 9.348775441493873e-06, 'samples': 26318784, 'steps': 137076, 'loss/train': 1.1109607219696045} 11/07/2021 16:33:36 - INFO - __main__ - Step 137078: {'lr': 9.347337849695742e-06, 'samples': 26318976, 'steps': 137077, 'loss/train': 1.3347198963165283} 11/07/2021 16:33:36 - INFO - __main__ - Step 137079: {'lr': 9.34590036633201e-06, 'samples': 26319168, 'steps': 137078, 'loss/train': 1.3768061399459839} 11/07/2021 16:33:37 - INFO - __main__ - Step 137080: {'lr': 9.344462991403263e-06, 'samples': 26319360, 'steps': 137079, 'loss/train': 1.0143364667892456} 11/07/2021 16:33:37 - INFO - __main__ - Step 137081: {'lr': 9.343025724910192e-06, 'samples': 26319552, 'steps': 137080, 'loss/train': 1.1728557348251343} 11/07/2021 16:33:38 - INFO - __main__ - Step 137082: {'lr': 9.341588566853465e-06, 'samples': 26319744, 'steps': 137081, 'loss/train': 0.9486132264137268} 11/07/2021 16:33:39 - INFO - __main__ - Step 137083: {'lr': 9.340151517233691e-06, 'samples': 26319936, 'steps': 137082, 'loss/train': 1.3267477750778198} 11/07/2021 16:33:39 - INFO - __main__ - Step 137084: {'lr': 9.338714576051538e-06, 'samples': 26320128, 'steps': 137083, 'loss/train': 1.4844681024551392} 11/07/2021 16:33:40 - INFO - __main__ - Step 137085: {'lr': 9.337277743307642e-06, 'samples': 26320320, 'steps': 137084, 'loss/train': 0.3914954960346222} 11/07/2021 16:33:40 - INFO - __main__ - Step 137086: {'lr': 9.335841019002644e-06, 'samples': 26320512, 'steps': 137085, 'loss/train': 1.3315519094467163} 11/07/2021 16:33:40 - INFO - __main__ - Step 137087: {'lr': 9.334404403137209e-06, 'samples': 26320704, 'steps': 137086, 'loss/train': 1.6369726657867432} 11/07/2021 16:33:41 - INFO - __main__ - Step 137088: {'lr': 9.332967895712002e-06, 'samples': 26320896, 'steps': 137087, 'loss/train': 1.7423701286315918} 11/07/2021 16:33:42 - INFO - __main__ - Step 137089: {'lr': 9.331531496727635e-06, 'samples': 26321088, 'steps': 137088, 'loss/train': 1.0486325025558472} 11/07/2021 16:33:42 - INFO - __main__ - Step 137090: {'lr': 9.330095206184747e-06, 'samples': 26321280, 'steps': 137089, 'loss/train': 0.9570795297622681} 11/07/2021 16:33:42 - INFO - __main__ - Step 137091: {'lr': 9.328659024084002e-06, 'samples': 26321472, 'steps': 137090, 'loss/train': 1.4216277599334717} 11/07/2021 16:33:43 - INFO - __main__ - Step 137092: {'lr': 9.327222950426068e-06, 'samples': 26321664, 'steps': 137091, 'loss/train': 0.6825554370880127} 11/07/2021 16:33:43 - INFO - __main__ - Step 137093: {'lr': 9.325786985211554e-06, 'samples': 26321856, 'steps': 137092, 'loss/train': 0.4619743227958679} 11/07/2021 16:33:44 - INFO - __main__ - Step 137094: {'lr': 9.324351128441155e-06, 'samples': 26322048, 'steps': 137093, 'loss/train': 1.6282659769058228} 11/07/2021 16:33:44 - INFO - __main__ - Step 137095: {'lr': 9.322915380115454e-06, 'samples': 26322240, 'steps': 137094, 'loss/train': 1.3781378269195557} 11/07/2021 16:33:45 - INFO - __main__ - Step 137096: {'lr': 9.321479740235172e-06, 'samples': 26322432, 'steps': 137095, 'loss/train': 1.4079077243804932} 11/07/2021 16:33:45 - INFO - __main__ - Step 137097: {'lr': 9.320044208800893e-06, 'samples': 26322624, 'steps': 137096, 'loss/train': 2.4069952964782715} 11/07/2021 16:33:46 - INFO - __main__ - Step 137098: {'lr': 9.318608785813281e-06, 'samples': 26322816, 'steps': 137097, 'loss/train': 1.2872872352600098} 11/07/2021 16:33:46 - INFO - __main__ - Step 137099: {'lr': 9.317173471273005e-06, 'samples': 26323008, 'steps': 137098, 'loss/train': 1.5734628438949585} 11/07/2021 16:33:47 - INFO - __main__ - Step 137100: {'lr': 9.315738265180702e-06, 'samples': 26323200, 'steps': 137099, 'loss/train': 0.7010256052017212} 11/07/2021 16:33:47 - INFO - __main__ - Step 137101: {'lr': 9.314303167537008e-06, 'samples': 26323392, 'steps': 137100, 'loss/train': 1.2654094696044922} 11/07/2021 16:33:48 - INFO - __main__ - Step 137102: {'lr': 9.312868178342593e-06, 'samples': 26323584, 'steps': 137101, 'loss/train': 1.3057377338409424} 11/07/2021 16:33:48 - INFO - __main__ - Step 137103: {'lr': 9.311433297598066e-06, 'samples': 26323776, 'steps': 137102, 'loss/train': 1.327391266822815} 11/07/2021 16:33:48 - INFO - __main__ - Step 137104: {'lr': 9.309998525304064e-06, 'samples': 26323968, 'steps': 137103, 'loss/train': 1.2710216045379639} 11/07/2021 16:33:49 - INFO - __main__ - Step 137105: {'lr': 9.308563861461311e-06, 'samples': 26324160, 'steps': 137104, 'loss/train': 1.5339114665985107} 11/07/2021 16:33:50 - INFO - __main__ - Step 137106: {'lr': 9.307129306070362e-06, 'samples': 26324352, 'steps': 137105, 'loss/train': 1.412033200263977} 11/07/2021 16:33:50 - INFO - __main__ - Step 137107: {'lr': 9.305694859131935e-06, 'samples': 26324544, 'steps': 137106, 'loss/train': 1.228474497795105} 11/07/2021 16:33:50 - INFO - __main__ - Step 137108: {'lr': 9.304260520646645e-06, 'samples': 26324736, 'steps': 137107, 'loss/train': 0.885399341583252} 11/07/2021 16:33:51 - INFO - __main__ - Step 137109: {'lr': 9.30282629061513e-06, 'samples': 26324928, 'steps': 137108, 'loss/train': 1.2980120182037354} 11/07/2021 16:33:52 - INFO - __main__ - Step 137110: {'lr': 9.301392169038052e-06, 'samples': 26325120, 'steps': 137109, 'loss/train': 1.3382384777069092} 11/07/2021 16:33:52 - INFO - __main__ - Step 137111: {'lr': 9.299958155916056e-06, 'samples': 26325312, 'steps': 137110, 'loss/train': 0.9399216771125793} 11/07/2021 16:33:53 - INFO - __main__ - Step 137112: {'lr': 9.298524251249774e-06, 'samples': 26325504, 'steps': 137111, 'loss/train': 1.0856863260269165} 11/07/2021 16:33:53 - INFO - __main__ - Step 137113: {'lr': 9.297090455039874e-06, 'samples': 26325696, 'steps': 137112, 'loss/train': 1.3505090475082397} 11/07/2021 16:33:53 - INFO - __main__ - Step 137114: {'lr': 9.295656767286997e-06, 'samples': 26325888, 'steps': 137113, 'loss/train': 1.5352727174758911} 11/07/2021 16:33:54 - INFO - __main__ - Step 137115: {'lr': 9.294223187991806e-06, 'samples': 26326080, 'steps': 137114, 'loss/train': 0.6133520007133484} 11/07/2021 16:33:55 - INFO - __main__ - Step 137116: {'lr': 9.292789717154887e-06, 'samples': 26326272, 'steps': 137115, 'loss/train': 1.3010259866714478} 11/07/2021 16:33:55 - INFO - __main__ - Step 137117: {'lr': 9.291356354776931e-06, 'samples': 26326464, 'steps': 137116, 'loss/train': 2.093855381011963} 11/07/2021 16:33:56 - INFO - __main__ - Step 137118: {'lr': 9.289923100858576e-06, 'samples': 26326656, 'steps': 137117, 'loss/train': 0.9574427604675293} 11/07/2021 16:33:56 - INFO - __main__ - Step 137119: {'lr': 9.288489955400465e-06, 'samples': 26326848, 'steps': 137118, 'loss/train': 1.227710247039795} 11/07/2021 16:33:57 - INFO - __main__ - Step 137120: {'lr': 9.287056918403231e-06, 'samples': 26327040, 'steps': 137119, 'loss/train': 1.9140454530715942} 11/07/2021 16:33:57 - INFO - __main__ - Step 137121: {'lr': 9.285623989867541e-06, 'samples': 26327232, 'steps': 137120, 'loss/train': 0.8910666108131409} 11/07/2021 16:33:58 - INFO - __main__ - Step 137122: {'lr': 9.284191169794038e-06, 'samples': 26327424, 'steps': 137121, 'loss/train': 0.963636040687561} 11/07/2021 16:33:58 - INFO - __main__ - Step 137123: {'lr': 9.282758458183383e-06, 'samples': 26327616, 'steps': 137122, 'loss/train': 1.4515742063522339} 11/07/2021 16:33:58 - INFO - __main__ - Step 137124: {'lr': 9.281325855036188e-06, 'samples': 26327808, 'steps': 137123, 'loss/train': 1.5812556743621826} 11/07/2021 16:33:59 - INFO - __main__ - Step 137125: {'lr': 9.279893360353093e-06, 'samples': 26328000, 'steps': 137124, 'loss/train': 1.1713801622390747} 11/07/2021 16:34:00 - INFO - __main__ - Step 137126: {'lr': 9.27846097413479e-06, 'samples': 26328192, 'steps': 137125, 'loss/train': 0.9620557427406311} 11/07/2021 16:34:00 - INFO - __main__ - Step 137127: {'lr': 9.27702869638189e-06, 'samples': 26328384, 'steps': 137126, 'loss/train': 0.8890747427940369} 11/07/2021 16:34:00 - INFO - __main__ - Step 137128: {'lr': 9.275596527095087e-06, 'samples': 26328576, 'steps': 137127, 'loss/train': 1.1606203317642212} 11/07/2021 16:34:01 - INFO - __main__ - Step 137129: {'lr': 9.274164466274937e-06, 'samples': 26328768, 'steps': 137128, 'loss/train': 1.3361161947250366} 11/07/2021 16:34:02 - INFO - __main__ - Step 137130: {'lr': 9.272732513922132e-06, 'samples': 26328960, 'steps': 137129, 'loss/train': 1.390215277671814} 11/07/2021 16:34:02 - INFO - __main__ - Step 137131: {'lr': 9.271300670037341e-06, 'samples': 26329152, 'steps': 137130, 'loss/train': 1.1173487901687622} 11/07/2021 16:34:02 - INFO - __main__ - Step 137132: {'lr': 9.269868934621173e-06, 'samples': 26329344, 'steps': 137131, 'loss/train': 1.398127555847168} 11/07/2021 16:34:03 - INFO - __main__ - Step 137133: {'lr': 9.268437307674293e-06, 'samples': 26329536, 'steps': 137132, 'loss/train': 0.8765532970428467} 11/07/2021 16:34:03 - INFO - __main__ - Step 137134: {'lr': 9.267005789197341e-06, 'samples': 26329728, 'steps': 137133, 'loss/train': 1.5273327827453613} 11/07/2021 16:34:04 - INFO - __main__ - Step 137135: {'lr': 9.265574379190955e-06, 'samples': 26329920, 'steps': 137134, 'loss/train': 1.1337287425994873} 11/07/2021 16:34:05 - INFO - __main__ - Step 137136: {'lr': 9.264143077655773e-06, 'samples': 26330112, 'steps': 137135, 'loss/train': 0.9134779572486877} 11/07/2021 16:34:05 - INFO - __main__ - Step 137137: {'lr': 9.262711884592462e-06, 'samples': 26330304, 'steps': 137136, 'loss/train': 0.5116821527481079} 11/07/2021 16:34:05 - INFO - __main__ - Step 137138: {'lr': 9.261280800001686e-06, 'samples': 26330496, 'steps': 137137, 'loss/train': 1.3134379386901855} 11/07/2021 16:34:06 - INFO - __main__ - Step 137139: {'lr': 9.259849823884031e-06, 'samples': 26330688, 'steps': 137138, 'loss/train': 1.2814937829971313} 11/07/2021 16:34:06 - INFO - __main__ - Step 137140: {'lr': 9.258418956240189e-06, 'samples': 26330880, 'steps': 137139, 'loss/train': 1.172891616821289} 11/07/2021 16:34:07 - INFO - __main__ - Step 137141: {'lr': 9.25698819707077e-06, 'samples': 26331072, 'steps': 137140, 'loss/train': 1.3369568586349487} 11/07/2021 16:34:07 - INFO - __main__ - Step 137142: {'lr': 9.255557546376498e-06, 'samples': 26331264, 'steps': 137141, 'loss/train': 0.8456249237060547} 11/07/2021 16:34:08 - INFO - __main__ - Step 137143: {'lr': 9.254127004157897e-06, 'samples': 26331456, 'steps': 137142, 'loss/train': 1.2325118780136108} 11/07/2021 16:34:08 - INFO - __main__ - Step 137144: {'lr': 9.252696570415691e-06, 'samples': 26331648, 'steps': 137143, 'loss/train': 1.2993767261505127} 11/07/2021 16:34:08 - INFO - __main__ - Step 137145: {'lr': 9.25126624515049e-06, 'samples': 26331840, 'steps': 137144, 'loss/train': 0.7795166969299316} 11/07/2021 16:34:09 - INFO - __main__ - Step 137146: {'lr': 9.249836028362962e-06, 'samples': 26332032, 'steps': 137145, 'loss/train': 1.8151772022247314} 11/07/2021 16:34:10 - INFO - __main__ - Step 137147: {'lr': 9.248405920053743e-06, 'samples': 26332224, 'steps': 137146, 'loss/train': 1.259210228919983} 11/07/2021 16:34:10 - INFO - __main__ - Step 137148: {'lr': 9.246975920223471e-06, 'samples': 26332416, 'steps': 137147, 'loss/train': 1.4781907796859741} 11/07/2021 16:34:10 - INFO - __main__ - Step 137149: {'lr': 9.245546028872814e-06, 'samples': 26332608, 'steps': 137148, 'loss/train': 1.1279855966567993} 11/07/2021 16:34:11 - INFO - __main__ - Step 137150: {'lr': 9.244116246002382e-06, 'samples': 26332800, 'steps': 137149, 'loss/train': 1.225103497505188} 11/07/2021 16:34:12 - INFO - __main__ - Step 137151: {'lr': 9.242686571612841e-06, 'samples': 26332992, 'steps': 137150, 'loss/train': 1.4008878469467163} 11/07/2021 16:34:12 - INFO - __main__ - Step 137152: {'lr': 9.241257005704828e-06, 'samples': 26333184, 'steps': 137151, 'loss/train': 1.46623957157135} 11/07/2021 16:34:13 - INFO - __main__ - Step 137153: {'lr': 9.239827548278983e-06, 'samples': 26333376, 'steps': 137152, 'loss/train': 1.5356545448303223} 11/07/2021 16:34:13 - INFO - __main__ - Step 137154: {'lr': 9.238398199335974e-06, 'samples': 26333568, 'steps': 137153, 'loss/train': 1.419562816619873} 11/07/2021 16:34:13 - INFO - __main__ - Step 137155: {'lr': 9.236968958876435e-06, 'samples': 26333760, 'steps': 137154, 'loss/train': 0.9347634315490723} 11/07/2021 16:34:14 - INFO - __main__ - Step 137156: {'lr': 9.23553982690098e-06, 'samples': 26333952, 'steps': 137155, 'loss/train': 1.2479487657546997} 11/07/2021 16:34:15 - INFO - __main__ - Step 137157: {'lr': 9.2341108034103e-06, 'samples': 26334144, 'steps': 137156, 'loss/train': 0.16600747406482697} 11/07/2021 16:34:15 - INFO - __main__ - Step 137158: {'lr': 9.232681888404981e-06, 'samples': 26334336, 'steps': 137157, 'loss/train': 0.8805684447288513} 11/07/2021 16:34:15 - INFO - __main__ - Step 137159: {'lr': 9.231253081885715e-06, 'samples': 26334528, 'steps': 137158, 'loss/train': 1.1124365329742432} 11/07/2021 16:34:16 - INFO - __main__ - Step 137160: {'lr': 9.229824383853141e-06, 'samples': 26334720, 'steps': 137159, 'loss/train': 1.3949143886566162} 11/07/2021 16:34:17 - INFO - __main__ - Step 137161: {'lr': 9.22839579430787e-06, 'samples': 26334912, 'steps': 137160, 'loss/train': 0.849983811378479} 11/07/2021 16:34:17 - INFO - __main__ - Step 137162: {'lr': 9.226967313250595e-06, 'samples': 26335104, 'steps': 137161, 'loss/train': 1.1104217767715454} 11/07/2021 16:34:17 - INFO - __main__ - Step 137163: {'lr': 9.225538940681927e-06, 'samples': 26335296, 'steps': 137162, 'loss/train': 1.4320350885391235} 11/07/2021 16:34:18 - INFO - __main__ - Step 137164: {'lr': 9.224110676602504e-06, 'samples': 26335488, 'steps': 137163, 'loss/train': 1.046335220336914} 11/07/2021 16:34:18 - INFO - __main__ - Step 137165: {'lr': 9.222682521012966e-06, 'samples': 26335680, 'steps': 137164, 'loss/train': 1.1312587261199951} 11/07/2021 16:34:19 - INFO - __main__ - Step 137166: {'lr': 9.221254473914004e-06, 'samples': 26335872, 'steps': 137165, 'loss/train': 1.188008189201355} 11/07/2021 16:34:20 - INFO - __main__ - Step 137167: {'lr': 9.21982653530623e-06, 'samples': 26336064, 'steps': 137166, 'loss/train': 0.31160110235214233} 11/07/2021 16:34:20 - INFO - __main__ - Step 137168: {'lr': 9.218398705190285e-06, 'samples': 26336256, 'steps': 137167, 'loss/train': 1.21698796749115} 11/07/2021 16:34:20 - INFO - __main__ - Step 137169: {'lr': 9.216970983566803e-06, 'samples': 26336448, 'steps': 137168, 'loss/train': 1.226285457611084} 11/07/2021 16:34:21 - INFO - __main__ - Step 137170: {'lr': 9.215543370436452e-06, 'samples': 26336640, 'steps': 137169, 'loss/train': 1.1349440813064575} 11/07/2021 16:34:22 - INFO - __main__ - Step 137171: {'lr': 9.214115865799843e-06, 'samples': 26336832, 'steps': 137170, 'loss/train': 1.3320541381835938} 11/07/2021 16:34:22 - INFO - __main__ - Step 137172: {'lr': 9.212688469657643e-06, 'samples': 26337024, 'steps': 137171, 'loss/train': 1.1641908884048462} 11/07/2021 16:34:22 - INFO - __main__ - Step 137173: {'lr': 9.211261182010488e-06, 'samples': 26337216, 'steps': 137172, 'loss/train': 1.5956107378005981} 11/07/2021 16:34:23 - INFO - __main__ - Step 137174: {'lr': 9.209834002859018e-06, 'samples': 26337408, 'steps': 137173, 'loss/train': 1.1829071044921875} 11/07/2021 16:34:23 - INFO - __main__ - Step 137175: {'lr': 9.2084069322039e-06, 'samples': 26337600, 'steps': 137174, 'loss/train': 1.8845287561416626} 11/07/2021 16:34:24 - INFO - __main__ - Step 137176: {'lr': 9.206979970045742e-06, 'samples': 26337792, 'steps': 137175, 'loss/train': 0.906486451625824} 11/07/2021 16:34:24 - INFO - __main__ - Step 137177: {'lr': 9.205553116385212e-06, 'samples': 26337984, 'steps': 137176, 'loss/train': 1.1242996454238892} 11/07/2021 16:34:25 - INFO - __main__ - Step 137178: {'lr': 9.204126371222921e-06, 'samples': 26338176, 'steps': 137177, 'loss/train': 1.4219645261764526} 11/07/2021 16:34:25 - INFO - __main__ - Step 137179: {'lr': 9.20269973455956e-06, 'samples': 26338368, 'steps': 137178, 'loss/train': 0.9647974371910095} 11/07/2021 16:34:25 - INFO - __main__ - Step 137180: {'lr': 9.201273206395743e-06, 'samples': 26338560, 'steps': 137179, 'loss/train': 1.2309901714324951} 11/07/2021 16:34:27 - INFO - __main__ - Step 137181: {'lr': 9.199846786732108e-06, 'samples': 26338752, 'steps': 137180, 'loss/train': 1.0658659934997559} 11/07/2021 16:34:27 - INFO - __main__ - Step 137182: {'lr': 9.198420475569346e-06, 'samples': 26338944, 'steps': 137181, 'loss/train': 1.2376502752304077} 11/07/2021 16:34:27 - INFO - __main__ - Step 137183: {'lr': 9.196994272908016e-06, 'samples': 26339136, 'steps': 137182, 'loss/train': 0.5967800617218018} 11/07/2021 16:34:28 - INFO - __main__ - Step 137184: {'lr': 9.195568178748809e-06, 'samples': 26339328, 'steps': 137183, 'loss/train': 1.4686540365219116} 11/07/2021 16:34:28 - INFO - __main__ - Step 137185: {'lr': 9.194142193092392e-06, 'samples': 26339520, 'steps': 137184, 'loss/train': 1.4778056144714355} 11/07/2021 16:34:28 - INFO - __main__ - Step 137186: {'lr': 9.192716315939349e-06, 'samples': 26339712, 'steps': 137185, 'loss/train': 1.1220506429672241} 11/07/2021 16:34:29 - INFO - __main__ - Step 137187: {'lr': 9.191290547290343e-06, 'samples': 26339904, 'steps': 137186, 'loss/train': 0.7701860070228577} 11/07/2021 16:34:30 - INFO - __main__ - Step 137188: {'lr': 9.189864887146044e-06, 'samples': 26340096, 'steps': 137187, 'loss/train': 1.192718744277954} 11/07/2021 16:34:30 - INFO - __main__ - Step 137189: {'lr': 9.188439335507087e-06, 'samples': 26340288, 'steps': 137188, 'loss/train': 1.2832690477371216} 11/07/2021 16:34:30 - INFO - __main__ - Step 137190: {'lr': 9.187013892374085e-06, 'samples': 26340480, 'steps': 137189, 'loss/train': 1.341437578201294} 11/07/2021 16:34:31 - INFO - __main__ - Step 137191: {'lr': 9.185588557747704e-06, 'samples': 26340672, 'steps': 137190, 'loss/train': 1.3995305299758911} 11/07/2021 16:34:32 - INFO - __main__ - Step 137192: {'lr': 9.18416333162858e-06, 'samples': 26340864, 'steps': 137191, 'loss/train': 1.4387218952178955} 11/07/2021 16:34:32 - INFO - __main__ - Step 137193: {'lr': 9.182738214017355e-06, 'samples': 26341056, 'steps': 137192, 'loss/train': 1.4074627161026} 11/07/2021 16:34:32 - INFO - __main__ - Step 137194: {'lr': 9.181313204914665e-06, 'samples': 26341248, 'steps': 137193, 'loss/train': 1.6131585836410522} 11/07/2021 16:34:33 - INFO - __main__ - Step 137195: {'lr': 9.179888304321205e-06, 'samples': 26341440, 'steps': 137194, 'loss/train': 1.3087794780731201} 11/07/2021 16:34:33 - INFO - __main__ - Step 137196: {'lr': 9.178463512237528e-06, 'samples': 26341632, 'steps': 137195, 'loss/train': 0.6126501560211182} 11/07/2021 16:34:34 - INFO - __main__ - Step 137197: {'lr': 9.17703882866433e-06, 'samples': 26341824, 'steps': 137196, 'loss/train': 1.1586400270462036} 11/07/2021 16:34:35 - INFO - __main__ - Step 137198: {'lr': 9.17561425360222e-06, 'samples': 26342016, 'steps': 137197, 'loss/train': 1.2825735807418823} 11/07/2021 16:34:35 - INFO - __main__ - Step 137199: {'lr': 9.174189787051896e-06, 'samples': 26342208, 'steps': 137198, 'loss/train': 0.7497508525848389} 11/07/2021 16:34:35 - INFO - __main__ - Step 137200: {'lr': 9.172765429013935e-06, 'samples': 26342400, 'steps': 137199, 'loss/train': 1.0672509670257568} 11/07/2021 16:34:36 - INFO - __main__ - Step 137201: {'lr': 9.171341179489035e-06, 'samples': 26342592, 'steps': 137200, 'loss/train': 1.4965647459030151} 11/07/2021 16:34:37 - INFO - __main__ - Step 137202: {'lr': 9.169917038477805e-06, 'samples': 26342784, 'steps': 137201, 'loss/train': 1.102005958557129} 11/07/2021 16:34:37 - INFO - __main__ - Step 137203: {'lr': 9.168493005980882e-06, 'samples': 26342976, 'steps': 137202, 'loss/train': 1.843698263168335} 11/07/2021 16:34:37 - INFO - __main__ - Step 137204: {'lr': 9.167069081998936e-06, 'samples': 26343168, 'steps': 137203, 'loss/train': 1.457945704460144} 11/07/2021 16:34:38 - INFO - __main__ - Step 137205: {'lr': 9.165645266532573e-06, 'samples': 26343360, 'steps': 137204, 'loss/train': 1.0672193765640259} 11/07/2021 16:34:38 - INFO - __main__ - Step 137206: {'lr': 9.164221559582464e-06, 'samples': 26343552, 'steps': 137205, 'loss/train': 1.4953258037567139} 11/07/2021 16:34:39 - INFO - __main__ - Step 137207: {'lr': 9.162797961149244e-06, 'samples': 26343744, 'steps': 137206, 'loss/train': 1.2422842979431152} 11/07/2021 16:34:39 - INFO - __main__ - Step 137208: {'lr': 9.161374471233552e-06, 'samples': 26343936, 'steps': 137207, 'loss/train': 1.2171144485473633} 11/07/2021 16:34:40 - INFO - __main__ - Step 137209: {'lr': 9.159951089836055e-06, 'samples': 26344128, 'steps': 137208, 'loss/train': 1.1157103776931763} 11/07/2021 16:34:40 - INFO - __main__ - Step 137210: {'lr': 9.158527816957334e-06, 'samples': 26344320, 'steps': 137209, 'loss/train': 0.9108512997627258} 11/07/2021 16:34:41 - INFO - __main__ - Step 137211: {'lr': 9.157104652598059e-06, 'samples': 26344512, 'steps': 137210, 'loss/train': 1.399949312210083} 11/07/2021 16:34:42 - INFO - __main__ - Step 137212: {'lr': 9.155681596758892e-06, 'samples': 26344704, 'steps': 137211, 'loss/train': 0.8032096028327942} 11/07/2021 16:34:42 - INFO - __main__ - Step 137213: {'lr': 9.154258649440445e-06, 'samples': 26344896, 'steps': 137212, 'loss/train': 1.001500129699707} 11/07/2021 16:34:42 - INFO - __main__ - Step 137214: {'lr': 9.152835810643384e-06, 'samples': 26345088, 'steps': 137213, 'loss/train': 1.094832181930542} 11/07/2021 16:34:43 - INFO - __main__ - Step 137215: {'lr': 9.15141308036832e-06, 'samples': 26345280, 'steps': 137214, 'loss/train': 1.1482164859771729} 11/07/2021 16:34:43 - INFO - __main__ - Step 137216: {'lr': 9.14999045861592e-06, 'samples': 26345472, 'steps': 137215, 'loss/train': 1.2463287115097046} 11/07/2021 16:34:44 - INFO - __main__ - Step 137217: {'lr': 9.14856794538685e-06, 'samples': 26345664, 'steps': 137216, 'loss/train': 1.040697693824768} 11/07/2021 16:34:44 - INFO - __main__ - Step 137218: {'lr': 9.14714554068169e-06, 'samples': 26345856, 'steps': 137217, 'loss/train': 1.3550612926483154} 11/07/2021 16:34:45 - INFO - __main__ - Step 137219: {'lr': 9.145723244501108e-06, 'samples': 26346048, 'steps': 137218, 'loss/train': 1.4818629026412964} 11/07/2021 16:34:45 - INFO - __main__ - Step 137220: {'lr': 9.144301056845744e-06, 'samples': 26346240, 'steps': 137219, 'loss/train': 1.5612763166427612} 11/07/2021 16:34:45 - INFO - __main__ - Step 137221: {'lr': 9.142878977716235e-06, 'samples': 26346432, 'steps': 137220, 'loss/train': 1.1989129781723022} 11/07/2021 16:34:47 - INFO - __main__ - Step 137222: {'lr': 9.141457007113274e-06, 'samples': 26346624, 'steps': 137221, 'loss/train': 1.2275941371917725} 11/07/2021 16:34:48 - INFO - __main__ - Step 137223: {'lr': 9.140035145037417e-06, 'samples': 26346816, 'steps': 137222, 'loss/train': 1.5498946905136108} 11/07/2021 16:34:48 - INFO - __main__ - Step 137224: {'lr': 9.138613391489358e-06, 'samples': 26347008, 'steps': 137223, 'loss/train': 1.7010093927383423} 11/07/2021 16:34:48 - INFO - __main__ - Step 137225: {'lr': 9.13719174646971e-06, 'samples': 26347200, 'steps': 137224, 'loss/train': 1.666106104850769} 11/07/2021 16:34:49 - INFO - __main__ - Step 137226: {'lr': 9.135770209979133e-06, 'samples': 26347392, 'steps': 137225, 'loss/train': 1.7019920349121094} 11/07/2021 16:34:49 - INFO - __main__ - Step 137227: {'lr': 9.13434878201827e-06, 'samples': 26347584, 'steps': 137226, 'loss/train': 1.9750186204910278} 11/07/2021 16:34:49 - INFO - __main__ - Step 137228: {'lr': 9.132927462587731e-06, 'samples': 26347776, 'steps': 137227, 'loss/train': 1.345660924911499} 11/07/2021 16:34:50 - INFO - __main__ - Step 137229: {'lr': 9.131506251688182e-06, 'samples': 26347968, 'steps': 137228, 'loss/train': 1.2620153427124023} 11/07/2021 16:34:51 - INFO - __main__ - Step 137230: {'lr': 9.130085149320289e-06, 'samples': 26348160, 'steps': 137229, 'loss/train': 1.2425676584243774} 11/07/2021 16:34:51 - INFO - __main__ - Step 137231: {'lr': 9.128664155484633e-06, 'samples': 26348352, 'steps': 137230, 'loss/train': 2.020725727081299} 11/07/2021 16:34:51 - INFO - __main__ - Step 137232: {'lr': 9.127243270181884e-06, 'samples': 26348544, 'steps': 137231, 'loss/train': 1.4661513566970825} 11/07/2021 16:34:52 - INFO - __main__ - Step 137233: {'lr': 9.125822493412677e-06, 'samples': 26348736, 'steps': 137232, 'loss/train': 1.420859932899475} 11/07/2021 16:34:53 - INFO - __main__ - Step 137234: {'lr': 9.12440182517768e-06, 'samples': 26348928, 'steps': 137233, 'loss/train': 1.2616323232650757} 11/07/2021 16:34:53 - INFO - __main__ - Step 137235: {'lr': 9.122981265477504e-06, 'samples': 26349120, 'steps': 137234, 'loss/train': 1.5361721515655518} 11/07/2021 16:34:54 - INFO - __main__ - Step 137236: {'lr': 9.121560814312813e-06, 'samples': 26349312, 'steps': 137235, 'loss/train': 1.640406608581543} 11/07/2021 16:34:54 - INFO - __main__ - Step 137237: {'lr': 9.120140471684219e-06, 'samples': 26349504, 'steps': 137236, 'loss/train': 1.4018833637237549} 11/07/2021 16:34:54 - INFO - __main__ - Step 137238: {'lr': 9.11872023759236e-06, 'samples': 26349696, 'steps': 137237, 'loss/train': 1.330592393875122} 11/07/2021 16:34:55 - INFO - __main__ - Step 137239: {'lr': 9.117300112037902e-06, 'samples': 26349888, 'steps': 137238, 'loss/train': 0.8656807541847229} 11/07/2021 16:34:56 - INFO - __main__ - Step 137240: {'lr': 9.115880095021456e-06, 'samples': 26350080, 'steps': 137239, 'loss/train': 0.9233191013336182} 11/07/2021 16:34:56 - INFO - __main__ - Step 137241: {'lr': 9.114460186543688e-06, 'samples': 26350272, 'steps': 137240, 'loss/train': 0.7652855515480042} 11/07/2021 16:34:56 - INFO - __main__ - Step 137242: {'lr': 9.113040386605209e-06, 'samples': 26350464, 'steps': 137241, 'loss/train': 0.9783901572227478} 11/07/2021 16:34:57 - INFO - __main__ - Step 137243: {'lr': 9.111620695206685e-06, 'samples': 26350656, 'steps': 137242, 'loss/train': 1.48123037815094} 11/07/2021 16:34:59 - INFO - __main__ - Step 137244: {'lr': 9.110201112348753e-06, 'samples': 26350848, 'steps': 137243, 'loss/train': 1.0171918869018555} 11/07/2021 16:34:59 - INFO - __main__ - Step 137245: {'lr': 9.108781638032055e-06, 'samples': 26351040, 'steps': 137244, 'loss/train': 0.7939260601997375} 11/07/2021 16:35:00 - INFO - __main__ - Step 137246: {'lr': 9.107362272257197e-06, 'samples': 26351232, 'steps': 137245, 'loss/train': 0.5320292711257935} 11/07/2021 16:35:00 - INFO - __main__ - Step 137247: {'lr': 9.105943015024904e-06, 'samples': 26351424, 'steps': 137246, 'loss/train': 0.47210144996643066} 11/07/2021 16:35:00 - INFO - __main__ - Step 137248: {'lr': 9.104523866335702e-06, 'samples': 26351616, 'steps': 137247, 'loss/train': 0.7876442074775696} 11/07/2021 16:35:01 - INFO - __main__ - Step 137249: {'lr': 9.103104826190311e-06, 'samples': 26351808, 'steps': 137248, 'loss/train': 0.5468829274177551} 11/07/2021 16:35:01 - INFO - __main__ - Step 137250: {'lr': 9.101685894589318e-06, 'samples': 26352000, 'steps': 137249, 'loss/train': 1.9461933374404907} 11/07/2021 16:35:01 - INFO - __main__ - Step 137251: {'lr': 9.100267071533386e-06, 'samples': 26352192, 'steps': 137250, 'loss/train': 1.4839195013046265} 11/07/2021 16:35:03 - INFO - __main__ - Step 137252: {'lr': 9.098848357023182e-06, 'samples': 26352384, 'steps': 137251, 'loss/train': 1.4566224813461304} 11/07/2021 16:35:03 - INFO - __main__ - Step 137253: {'lr': 9.097429751059316e-06, 'samples': 26352576, 'steps': 137252, 'loss/train': 1.5568526983261108} 11/07/2021 16:35:03 - INFO - __main__ - Step 137254: {'lr': 9.0960112536424e-06, 'samples': 26352768, 'steps': 137253, 'loss/train': 2.0570945739746094} 11/07/2021 16:35:04 - INFO - __main__ - Step 137255: {'lr': 9.094592864773126e-06, 'samples': 26352960, 'steps': 137254, 'loss/train': 0.9152356386184692} 11/07/2021 16:35:04 - INFO - __main__ - Step 137256: {'lr': 9.093174584452107e-06, 'samples': 26353152, 'steps': 137255, 'loss/train': 1.2345809936523438} 11/07/2021 16:35:05 - INFO - __main__ - Step 137257: {'lr': 9.091756412680008e-06, 'samples': 26353344, 'steps': 137256, 'loss/train': 1.0050452947616577} 11/07/2021 16:35:05 - INFO - __main__ - Step 137258: {'lr': 9.09033834945744e-06, 'samples': 26353536, 'steps': 137257, 'loss/train': 0.5196020007133484} 11/07/2021 16:35:06 - INFO - __main__ - Step 137259: {'lr': 9.088920394785038e-06, 'samples': 26353728, 'steps': 137258, 'loss/train': 1.9587390422821045} 11/07/2021 16:35:06 - INFO - __main__ - Step 137260: {'lr': 9.087502548663446e-06, 'samples': 26353920, 'steps': 137259, 'loss/train': 1.3685334920883179} 11/07/2021 16:35:07 - INFO - __main__ - Step 137261: {'lr': 9.086084811093326e-06, 'samples': 26354112, 'steps': 137260, 'loss/train': 1.4323670864105225} 11/07/2021 16:35:07 - INFO - __main__ - Step 137262: {'lr': 9.084667182075262e-06, 'samples': 26354304, 'steps': 137261, 'loss/train': 1.2129361629486084} 11/07/2021 16:35:07 - INFO - __main__ - Step 137263: {'lr': 9.08324966160995e-06, 'samples': 26354496, 'steps': 137262, 'loss/train': 0.9828220009803772} 11/07/2021 16:35:08 - INFO - __main__ - Step 137264: {'lr': 9.081832249698024e-06, 'samples': 26354688, 'steps': 137263, 'loss/train': 1.2468171119689941} 11/07/2021 16:35:09 - INFO - __main__ - Step 137265: {'lr': 9.080414946340071e-06, 'samples': 26354880, 'steps': 137264, 'loss/train': 1.5062495470046997} 11/07/2021 16:35:09 - INFO - __main__ - Step 137266: {'lr': 9.078997751536783e-06, 'samples': 26355072, 'steps': 137265, 'loss/train': 1.2616387605667114} 11/07/2021 16:35:09 - INFO - __main__ - Step 137267: {'lr': 9.077580665288799e-06, 'samples': 26355264, 'steps': 137266, 'loss/train': 0.9232638478279114} 11/07/2021 16:35:10 - INFO - __main__ - Step 137268: {'lr': 9.076163687596728e-06, 'samples': 26355456, 'steps': 137267, 'loss/train': 1.223512053489685} 11/07/2021 16:35:11 - INFO - __main__ - Step 137269: {'lr': 9.074746818461237e-06, 'samples': 26355648, 'steps': 137268, 'loss/train': 1.1709846258163452} 11/07/2021 16:35:11 - INFO - __main__ - Step 137270: {'lr': 9.073330057882939e-06, 'samples': 26355840, 'steps': 137269, 'loss/train': 1.0913310050964355} 11/07/2021 16:35:12 - INFO - __main__ - Step 137271: {'lr': 9.071913405862443e-06, 'samples': 26356032, 'steps': 137270, 'loss/train': 1.0222373008728027} 11/07/2021 16:35:12 - INFO - __main__ - Step 137272: {'lr': 9.070496862400468e-06, 'samples': 26356224, 'steps': 137271, 'loss/train': 0.984302818775177} 11/07/2021 16:35:12 - INFO - __main__ - Step 137273: {'lr': 9.069080427497572e-06, 'samples': 26356416, 'steps': 137272, 'loss/train': 1.0898486375808716} 11/07/2021 16:35:13 - INFO - __main__ - Step 137274: {'lr': 9.067664101154476e-06, 'samples': 26356608, 'steps': 137273, 'loss/train': 1.3538404703140259} 11/07/2021 16:35:14 - INFO - __main__ - Step 137275: {'lr': 9.066247883371736e-06, 'samples': 26356800, 'steps': 137274, 'loss/train': 1.461632251739502} 11/07/2021 16:35:14 - INFO - __main__ - Step 137276: {'lr': 9.064831774150045e-06, 'samples': 26356992, 'steps': 137275, 'loss/train': 0.9492358565330505} 11/07/2021 16:35:14 - INFO - __main__ - Step 137277: {'lr': 9.063415773490014e-06, 'samples': 26357184, 'steps': 137276, 'loss/train': 1.2868765592575073} 11/07/2021 16:35:15 - INFO - __main__ - Step 137278: {'lr': 9.06199988139228e-06, 'samples': 26357376, 'steps': 137277, 'loss/train': 1.2950356006622314} 11/07/2021 16:35:16 - INFO - __main__ - Step 137279: {'lr': 9.06058409785751e-06, 'samples': 26357568, 'steps': 137278, 'loss/train': 1.1169415712356567} 11/07/2021 16:35:16 - INFO - __main__ - Step 137280: {'lr': 9.059168422886344e-06, 'samples': 26357760, 'steps': 137279, 'loss/train': 0.655487596988678} 11/07/2021 16:35:16 - INFO - __main__ - Step 137281: {'lr': 9.057752856479362e-06, 'samples': 26357952, 'steps': 137280, 'loss/train': 1.5077892541885376} 11/07/2021 16:35:17 - INFO - __main__ - Step 137282: {'lr': 9.05633739863726e-06, 'samples': 26358144, 'steps': 137281, 'loss/train': 0.4425649046897888} 11/07/2021 16:35:17 - INFO - __main__ - Step 137283: {'lr': 9.05492204936062e-06, 'samples': 26358336, 'steps': 137282, 'loss/train': 1.2993804216384888} 11/07/2021 16:35:18 - INFO - __main__ - Step 137284: {'lr': 9.053506808650136e-06, 'samples': 26358528, 'steps': 137283, 'loss/train': 1.7593446969985962} 11/07/2021 16:35:18 - INFO - __main__ - Step 137285: {'lr': 9.052091676506419e-06, 'samples': 26358720, 'steps': 137284, 'loss/train': 1.4092479944229126} 11/07/2021 16:35:19 - INFO - __main__ - Step 137286: {'lr': 9.050676652930134e-06, 'samples': 26358912, 'steps': 137285, 'loss/train': 1.2653515338897705} 11/07/2021 16:35:19 - INFO - __main__ - Step 137287: {'lr': 9.049261737921866e-06, 'samples': 26359104, 'steps': 137286, 'loss/train': 1.4316154718399048} 11/07/2021 16:35:20 - INFO - __main__ - Step 137288: {'lr': 9.047846931482306e-06, 'samples': 26359296, 'steps': 137287, 'loss/train': 1.4085981845855713} 11/07/2021 16:35:21 - INFO - __main__ - Step 137289: {'lr': 9.04643223361204e-06, 'samples': 26359488, 'steps': 137288, 'loss/train': 1.1704134941101074} 11/07/2021 16:35:21 - INFO - __main__ - Step 137290: {'lr': 9.045017644311787e-06, 'samples': 26359680, 'steps': 137289, 'loss/train': 1.1249982118606567} 11/07/2021 16:35:21 - INFO - __main__ - Step 137291: {'lr': 9.043603163582104e-06, 'samples': 26359872, 'steps': 137290, 'loss/train': 0.9667947888374329} 11/07/2021 16:35:22 - INFO - __main__ - Step 137292: {'lr': 9.042188791423628e-06, 'samples': 26360064, 'steps': 137291, 'loss/train': 0.8101268410682678} 11/07/2021 16:35:22 - INFO - __main__ - Step 137293: {'lr': 9.040774527837054e-06, 'samples': 26360256, 'steps': 137292, 'loss/train': 1.2086846828460693} 11/07/2021 16:35:23 - INFO - __main__ - Step 137294: {'lr': 9.039360372822964e-06, 'samples': 26360448, 'steps': 137293, 'loss/train': 0.7474721074104309} 11/07/2021 16:35:23 - INFO - __main__ - Step 137295: {'lr': 9.037946326382025e-06, 'samples': 26360640, 'steps': 137294, 'loss/train': 1.4694839715957642} 11/07/2021 16:35:24 - INFO - __main__ - Step 137296: {'lr': 9.036532388514873e-06, 'samples': 26360832, 'steps': 137295, 'loss/train': 1.5436595678329468} 11/07/2021 16:35:24 - INFO - __main__ - Step 137297: {'lr': 9.03511855922215e-06, 'samples': 26361024, 'steps': 137296, 'loss/train': 1.0777068138122559} 11/07/2021 16:35:24 - INFO - __main__ - Step 137298: {'lr': 9.033704838504492e-06, 'samples': 26361216, 'steps': 137297, 'loss/train': 1.5910323858261108} 11/07/2021 16:35:26 - INFO - __main__ - Step 137299: {'lr': 9.03229122636251e-06, 'samples': 26361408, 'steps': 137298, 'loss/train': 1.4422563314437866} 11/07/2021 16:35:26 - INFO - __main__ - Step 137300: {'lr': 9.030877722796843e-06, 'samples': 26361600, 'steps': 137299, 'loss/train': 1.3290033340454102} 11/07/2021 16:35:26 - INFO - __main__ - Step 137301: {'lr': 9.029464327808185e-06, 'samples': 26361792, 'steps': 137300, 'loss/train': 0.5520823001861572} 11/07/2021 16:35:27 - INFO - __main__ - Step 137302: {'lr': 9.028051041397089e-06, 'samples': 26361984, 'steps': 137301, 'loss/train': 1.173396110534668} 11/07/2021 16:35:27 - INFO - __main__ - Step 137303: {'lr': 9.026637863564307e-06, 'samples': 26362176, 'steps': 137302, 'loss/train': 1.328078031539917} 11/07/2021 16:35:27 - INFO - __main__ - Step 137304: {'lr': 9.025224794310339e-06, 'samples': 26362368, 'steps': 137303, 'loss/train': 1.1813298463821411} 11/07/2021 16:35:28 - INFO - __main__ - Step 137305: {'lr': 9.023811833635903e-06, 'samples': 26362560, 'steps': 137304, 'loss/train': 1.162799596786499} 11/07/2021 16:35:29 - INFO - __main__ - Step 137306: {'lr': 9.022398981541613e-06, 'samples': 26362752, 'steps': 137305, 'loss/train': 1.241944432258606} 11/07/2021 16:35:29 - INFO - __main__ - Step 137307: {'lr': 9.020986238028105e-06, 'samples': 26362944, 'steps': 137306, 'loss/train': 1.1914139986038208} 11/07/2021 16:35:29 - INFO - __main__ - Step 137308: {'lr': 9.019573603096048e-06, 'samples': 26363136, 'steps': 137307, 'loss/train': 1.3874651193618774} 11/07/2021 16:35:30 - INFO - __main__ - Step 137309: {'lr': 9.018161076746023e-06, 'samples': 26363328, 'steps': 137308, 'loss/train': 0.8667387366294861} 11/07/2021 16:35:31 - INFO - __main__ - Step 137310: {'lr': 9.016748658978723e-06, 'samples': 26363520, 'steps': 137309, 'loss/train': 1.9138456583023071} 11/07/2021 16:35:31 - INFO - __main__ - Step 137311: {'lr': 9.015336349794734e-06, 'samples': 26363712, 'steps': 137310, 'loss/train': 1.453348159790039} 11/07/2021 16:35:32 - INFO - __main__ - Step 137312: {'lr': 9.013924149194746e-06, 'samples': 26363904, 'steps': 137311, 'loss/train': 1.027488350868225} 11/07/2021 16:35:32 - INFO - __main__ - Step 137313: {'lr': 9.012512057179345e-06, 'samples': 26364096, 'steps': 137312, 'loss/train': 1.289856195449829} 11/07/2021 16:35:32 - INFO - __main__ - Step 137314: {'lr': 9.011100073749167e-06, 'samples': 26364288, 'steps': 137313, 'loss/train': 1.3038097620010376} 11/07/2021 16:35:34 - INFO - __main__ - Step 137315: {'lr': 9.009688198904908e-06, 'samples': 26364480, 'steps': 137314, 'loss/train': 1.1324377059936523} 11/07/2021 16:35:34 - INFO - __main__ - Step 137316: {'lr': 9.008276432647178e-06, 'samples': 26364672, 'steps': 137315, 'loss/train': 1.2547723054885864} 11/07/2021 16:35:34 - INFO - __main__ - Step 137317: {'lr': 9.006864774976559e-06, 'samples': 26364864, 'steps': 137316, 'loss/train': 1.2367746829986572} 11/07/2021 16:35:35 - INFO - __main__ - Step 137318: {'lr': 9.005453225893745e-06, 'samples': 26365056, 'steps': 137317, 'loss/train': 1.0834290981292725} 11/07/2021 16:35:35 - INFO - __main__ - Step 137319: {'lr': 9.00404178539932e-06, 'samples': 26365248, 'steps': 137318, 'loss/train': 1.625543475151062} 11/07/2021 16:35:35 - INFO - __main__ - Step 137320: {'lr': 9.002630453494004e-06, 'samples': 26365440, 'steps': 137319, 'loss/train': 1.3492904901504517} 11/07/2021 16:35:36 - INFO - __main__ - Step 137321: {'lr': 9.001219230178354e-06, 'samples': 26365632, 'steps': 137320, 'loss/train': 1.452185869216919} 11/07/2021 16:35:37 - INFO - __main__ - Step 137322: {'lr': 8.999808115453034e-06, 'samples': 26365824, 'steps': 137321, 'loss/train': 1.8582350015640259} 11/07/2021 16:35:37 - INFO - __main__ - Step 137323: {'lr': 8.998397109318684e-06, 'samples': 26366016, 'steps': 137322, 'loss/train': 1.5082100629806519} 11/07/2021 16:35:37 - INFO - __main__ - Step 137324: {'lr': 8.99698621177597e-06, 'samples': 26366208, 'steps': 137323, 'loss/train': 0.8483145236968994} 11/07/2021 16:35:38 - INFO - __main__ - Step 137325: {'lr': 8.995575422825448e-06, 'samples': 26366400, 'steps': 137324, 'loss/train': 1.6882961988449097} 11/07/2021 16:35:39 - INFO - __main__ - Step 137326: {'lr': 8.994164742467837e-06, 'samples': 26366592, 'steps': 137325, 'loss/train': 1.2673364877700806} 11/07/2021 16:35:39 - INFO - __main__ - Step 137327: {'lr': 8.992754170703721e-06, 'samples': 26366784, 'steps': 137326, 'loss/train': 0.9053921103477478} 11/07/2021 16:35:40 - INFO - __main__ - Step 137328: {'lr': 8.991343707533738e-06, 'samples': 26366976, 'steps': 137327, 'loss/train': 1.5173752307891846} 11/07/2021 16:35:40 - INFO - __main__ - Step 137329: {'lr': 8.989933352958557e-06, 'samples': 26367168, 'steps': 137328, 'loss/train': 1.5628434419631958} 11/07/2021 16:35:40 - INFO - __main__ - Step 137330: {'lr': 8.988523106978813e-06, 'samples': 26367360, 'steps': 137329, 'loss/train': 1.5343974828720093} 11/07/2021 16:35:41 - INFO - __main__ - Step 137331: {'lr': 8.98711296959509e-06, 'samples': 26367552, 'steps': 137330, 'loss/train': 1.2181280851364136} 11/07/2021 16:35:42 - INFO - __main__ - Step 137332: {'lr': 8.985702940808054e-06, 'samples': 26367744, 'steps': 137331, 'loss/train': 1.6037601232528687} 11/07/2021 16:35:42 - INFO - __main__ - Step 137333: {'lr': 8.984293020618373e-06, 'samples': 26367936, 'steps': 137332, 'loss/train': 1.2807042598724365} 11/07/2021 16:35:42 - INFO - __main__ - Step 137334: {'lr': 8.982883209026598e-06, 'samples': 26368128, 'steps': 137333, 'loss/train': 1.4277551174163818} 11/07/2021 16:35:43 - INFO - __main__ - Step 137335: {'lr': 8.981473506033456e-06, 'samples': 26368320, 'steps': 137334, 'loss/train': 1.2133959531784058} 11/07/2021 16:35:44 - INFO - __main__ - Step 137336: {'lr': 8.980063911639524e-06, 'samples': 26368512, 'steps': 137335, 'loss/train': 1.1964287757873535} 11/07/2021 16:35:44 - INFO - __main__ - Step 137337: {'lr': 8.978654425845472e-06, 'samples': 26368704, 'steps': 137336, 'loss/train': 1.1156072616577148} 11/07/2021 16:35:44 - INFO - __main__ - Step 137338: {'lr': 8.977245048651911e-06, 'samples': 26368896, 'steps': 137337, 'loss/train': 1.0988236665725708} 11/07/2021 16:35:45 - INFO - __main__ - Step 137339: {'lr': 8.975835780059477e-06, 'samples': 26369088, 'steps': 137338, 'loss/train': 1.5657163858413696} 11/07/2021 16:35:45 - INFO - __main__ - Step 137340: {'lr': 8.97442662006881e-06, 'samples': 26369280, 'steps': 137339, 'loss/train': 1.4523824453353882} 11/07/2021 16:35:46 - INFO - __main__ - Step 137341: {'lr': 8.973017568680547e-06, 'samples': 26369472, 'steps': 137340, 'loss/train': 1.4143378734588623} 11/07/2021 16:35:47 - INFO - __main__ - Step 137342: {'lr': 8.971608625895356e-06, 'samples': 26369664, 'steps': 137341, 'loss/train': 1.3153977394104004} 11/07/2021 16:35:47 - INFO - __main__ - Step 137343: {'lr': 8.970199791713818e-06, 'samples': 26369856, 'steps': 137342, 'loss/train': 1.3577219247817993} 11/07/2021 16:35:47 - INFO - __main__ - Step 137344: {'lr': 8.9687910661366e-06, 'samples': 26370048, 'steps': 137343, 'loss/train': 1.269587755203247} 11/07/2021 16:35:48 - INFO - __main__ - Step 137345: {'lr': 8.967382449164313e-06, 'samples': 26370240, 'steps': 137344, 'loss/train': 1.2120537757873535} 11/07/2021 16:35:49 - INFO - __main__ - Step 137346: {'lr': 8.965973940797595e-06, 'samples': 26370432, 'steps': 137345, 'loss/train': 1.2228261232376099} 11/07/2021 16:35:49 - INFO - __main__ - Step 137347: {'lr': 8.964565541037084e-06, 'samples': 26370624, 'steps': 137346, 'loss/train': 1.381270170211792} 11/07/2021 16:35:49 - INFO - __main__ - Step 137348: {'lr': 8.963157249883447e-06, 'samples': 26370816, 'steps': 137347, 'loss/train': 1.0991028547286987} 11/07/2021 16:35:50 - INFO - __main__ - Step 137349: {'lr': 8.961749067337266e-06, 'samples': 26371008, 'steps': 137348, 'loss/train': 1.0864475965499878} 11/07/2021 16:35:50 - INFO - __main__ - Step 137350: {'lr': 8.960340993399208e-06, 'samples': 26371200, 'steps': 137349, 'loss/train': 1.0473403930664062} 11/07/2021 16:35:51 - INFO - __main__ - Step 137351: {'lr': 8.95893302806991e-06, 'samples': 26371392, 'steps': 137350, 'loss/train': 1.0701053142547607} 11/07/2021 16:35:51 - INFO - __main__ - Step 137352: {'lr': 8.957525171349983e-06, 'samples': 26371584, 'steps': 137351, 'loss/train': 1.0844703912734985} 11/07/2021 16:35:52 - INFO - __main__ - Step 137353: {'lr': 8.956117423240096e-06, 'samples': 26371776, 'steps': 137352, 'loss/train': 0.4416721761226654} 11/07/2021 16:35:52 - INFO - __main__ - Step 137354: {'lr': 8.954709783740855e-06, 'samples': 26371968, 'steps': 137353, 'loss/train': 1.0431160926818848} 11/07/2021 16:35:52 - INFO - __main__ - Step 137355: {'lr': 8.953302252852902e-06, 'samples': 26372160, 'steps': 137354, 'loss/train': 1.204431176185608} 11/07/2021 16:35:54 - INFO - __main__ - Step 137356: {'lr': 8.951894830576873e-06, 'samples': 26372352, 'steps': 137355, 'loss/train': 1.2554728984832764} 11/07/2021 16:35:54 - INFO - __main__ - Step 137357: {'lr': 8.950487516913407e-06, 'samples': 26372544, 'steps': 137356, 'loss/train': 1.2983242273330688} 11/07/2021 16:35:54 - INFO - __main__ - Step 137358: {'lr': 8.949080311863117e-06, 'samples': 26372736, 'steps': 137357, 'loss/train': 1.6517119407653809} 11/07/2021 16:35:55 - INFO - __main__ - Step 137359: {'lr': 8.947673215426666e-06, 'samples': 26372928, 'steps': 137358, 'loss/train': 1.3925299644470215} 11/07/2021 16:35:55 - INFO - __main__ - Step 137360: {'lr': 8.946266227604666e-06, 'samples': 26373120, 'steps': 137359, 'loss/train': 1.557184100151062} 11/07/2021 16:35:55 - INFO - __main__ - Step 137361: {'lr': 8.944859348397754e-06, 'samples': 26373312, 'steps': 137360, 'loss/train': 0.9694837927818298} 11/07/2021 16:35:56 - INFO - __main__ - Step 137362: {'lr': 8.943452577806571e-06, 'samples': 26373504, 'steps': 137361, 'loss/train': 0.8794465661048889} 11/07/2021 16:35:57 - INFO - __main__ - Step 137363: {'lr': 8.942045915831753e-06, 'samples': 26373696, 'steps': 137362, 'loss/train': 1.0577638149261475} 11/07/2021 16:35:57 - INFO - __main__ - Step 137364: {'lr': 8.940639362473913e-06, 'samples': 26373888, 'steps': 137363, 'loss/train': 1.3336046934127808} 11/07/2021 16:35:57 - INFO - __main__ - Step 137365: {'lr': 8.939232917733713e-06, 'samples': 26374080, 'steps': 137364, 'loss/train': 1.090819001197815} 11/07/2021 16:35:58 - INFO - __main__ - Step 137366: {'lr': 8.937826581611769e-06, 'samples': 26374272, 'steps': 137365, 'loss/train': 0.3458479046821594} 11/07/2021 16:35:59 - INFO - __main__ - Step 137367: {'lr': 8.936420354108743e-06, 'samples': 26374464, 'steps': 137366, 'loss/train': 1.2909204959869385} 11/07/2021 16:35:59 - INFO - __main__ - Step 137368: {'lr': 8.93501423522522e-06, 'samples': 26374656, 'steps': 137367, 'loss/train': 1.4893126487731934} 11/07/2021 16:35:59 - INFO - __main__ - Step 137369: {'lr': 8.933608224961865e-06, 'samples': 26374848, 'steps': 137368, 'loss/train': 1.0241203308105469} 11/07/2021 16:36:00 - INFO - __main__ - Step 137370: {'lr': 8.932202323319343e-06, 'samples': 26375040, 'steps': 137369, 'loss/train': 1.026477336883545} 11/07/2021 16:36:00 - INFO - __main__ - Step 137371: {'lr': 8.930796530298212e-06, 'samples': 26375232, 'steps': 137370, 'loss/train': 1.4062139987945557} 11/07/2021 16:36:01 - INFO - __main__ - Step 137372: {'lr': 8.929390845899165e-06, 'samples': 26375424, 'steps': 137371, 'loss/train': 1.205973744392395} 11/07/2021 16:36:02 - INFO - __main__ - Step 137373: {'lr': 8.927985270122785e-06, 'samples': 26375616, 'steps': 137372, 'loss/train': 0.963722825050354} 11/07/2021 16:36:02 - INFO - __main__ - Step 137374: {'lr': 8.926579802969764e-06, 'samples': 26375808, 'steps': 137373, 'loss/train': 1.1325057744979858} 11/07/2021 16:36:02 - INFO - __main__ - Step 137375: {'lr': 8.925174444440687e-06, 'samples': 26376000, 'steps': 137374, 'loss/train': 0.7156855463981628} 11/07/2021 16:36:03 - INFO - __main__ - Step 137376: {'lr': 8.923769194536218e-06, 'samples': 26376192, 'steps': 137375, 'loss/train': 0.9668420553207397} 11/07/2021 16:36:04 - INFO - __main__ - Step 137377: {'lr': 8.92236405325697e-06, 'samples': 26376384, 'steps': 137376, 'loss/train': 1.6692180633544922} 11/07/2021 16:36:04 - INFO - __main__ - Step 137378: {'lr': 8.920959020603581e-06, 'samples': 26376576, 'steps': 137377, 'loss/train': 0.9947736263275146} 11/07/2021 16:36:04 - INFO - __main__ - Step 137379: {'lr': 8.919554096576687e-06, 'samples': 26376768, 'steps': 137378, 'loss/train': 0.7554694414138794} 11/07/2021 16:36:05 - INFO - __main__ - Step 137380: {'lr': 8.91814928117693e-06, 'samples': 26376960, 'steps': 137379, 'loss/train': 1.0537664890289307} 11/07/2021 16:36:05 - INFO - __main__ - Step 137381: {'lr': 8.916744574404945e-06, 'samples': 26377152, 'steps': 137380, 'loss/train': 1.63579261302948} 11/07/2021 16:36:06 - INFO - __main__ - Step 137382: {'lr': 8.915339976261316e-06, 'samples': 26377344, 'steps': 137381, 'loss/train': 1.3585034608840942} 11/07/2021 16:36:06 - INFO - __main__ - Step 137383: {'lr': 8.913935486746765e-06, 'samples': 26377536, 'steps': 137382, 'loss/train': 1.3789029121398926} 11/07/2021 16:36:07 - INFO - __main__ - Step 137384: {'lr': 8.912531105861876e-06, 'samples': 26377728, 'steps': 137383, 'loss/train': 1.281812071800232} 11/07/2021 16:36:07 - INFO - __main__ - Step 137385: {'lr': 8.911126833607258e-06, 'samples': 26377920, 'steps': 137384, 'loss/train': 1.3946105241775513} 11/07/2021 16:36:08 - INFO - __main__ - Step 137386: {'lr': 8.90972266998355e-06, 'samples': 26378112, 'steps': 137385, 'loss/train': 0.956156849861145} 11/07/2021 16:36:09 - INFO - __main__ - Step 137387: {'lr': 8.908318614991417e-06, 'samples': 26378304, 'steps': 137386, 'loss/train': 0.7619863748550415} 11/07/2021 16:36:09 - INFO - __main__ - Step 137388: {'lr': 8.906914668631472e-06, 'samples': 26378496, 'steps': 137387, 'loss/train': 0.8576258420944214} 11/07/2021 16:36:09 - INFO - __main__ - Step 137389: {'lr': 8.905510830904351e-06, 'samples': 26378688, 'steps': 137388, 'loss/train': 1.5684858560562134} 11/07/2021 16:36:10 - INFO - __main__ - Step 137390: {'lr': 8.904107101810693e-06, 'samples': 26378880, 'steps': 137389, 'loss/train': 1.9027211666107178} 11/07/2021 16:36:10 - INFO - __main__ - Step 137391: {'lr': 8.90270348135111e-06, 'samples': 26379072, 'steps': 137390, 'loss/train': 0.06907250732183456} 11/07/2021 16:36:11 - INFO - __main__ - Step 137392: {'lr': 8.901299969526266e-06, 'samples': 26379264, 'steps': 137391, 'loss/train': 1.3047709465026855} 11/07/2021 16:36:12 - INFO - __main__ - Step 137393: {'lr': 8.899896566336745e-06, 'samples': 26379456, 'steps': 137392, 'loss/train': 1.5060900449752808} 11/07/2021 16:36:12 - INFO - __main__ - Step 137394: {'lr': 8.898493271783242e-06, 'samples': 26379648, 'steps': 137393, 'loss/train': 0.3025842308998108} 11/07/2021 16:36:12 - INFO - __main__ - Step 137395: {'lr': 8.897090085866338e-06, 'samples': 26379840, 'steps': 137394, 'loss/train': 1.3417702913284302} 11/07/2021 16:36:13 - INFO - __main__ - Step 137396: {'lr': 8.8956870085867e-06, 'samples': 26380032, 'steps': 137395, 'loss/train': 0.886598527431488} 11/07/2021 16:36:14 - INFO - __main__ - Step 137397: {'lr': 8.894284039944966e-06, 'samples': 26380224, 'steps': 137396, 'loss/train': 1.254569172859192} 11/07/2021 16:36:14 - INFO - __main__ - Step 137398: {'lr': 8.89288117994172e-06, 'samples': 26380416, 'steps': 137397, 'loss/train': 1.1868693828582764} 11/07/2021 16:36:14 - INFO - __main__ - Step 137399: {'lr': 8.891478428577627e-06, 'samples': 26380608, 'steps': 137398, 'loss/train': 1.201311707496643} 11/07/2021 16:36:15 - INFO - __main__ - Step 137400: {'lr': 8.890075785853297e-06, 'samples': 26380800, 'steps': 137399, 'loss/train': 1.3897935152053833} 11/07/2021 16:36:15 - INFO - __main__ - Step 137401: {'lr': 8.888673251769396e-06, 'samples': 26380992, 'steps': 137400, 'loss/train': 1.396255373954773} 11/07/2021 16:36:16 - INFO - __main__ - Step 137402: {'lr': 8.887270826326537e-06, 'samples': 26381184, 'steps': 137401, 'loss/train': 1.0068196058273315} 11/07/2021 16:36:17 - INFO - __main__ - Step 137403: {'lr': 8.88586850952533e-06, 'samples': 26381376, 'steps': 137402, 'loss/train': 1.2219923734664917} 11/07/2021 16:36:17 - INFO - __main__ - Step 137404: {'lr': 8.88446630136644e-06, 'samples': 26381568, 'steps': 137403, 'loss/train': 0.5344817638397217} 11/07/2021 16:36:17 - INFO - __main__ - Step 137405: {'lr': 8.883064201850506e-06, 'samples': 26381760, 'steps': 137404, 'loss/train': 1.1471346616744995} 11/07/2021 16:36:18 - INFO - __main__ - Step 137406: {'lr': 8.881662210978136e-06, 'samples': 26381952, 'steps': 137405, 'loss/train': 1.404489517211914} 11/07/2021 16:36:19 - INFO - __main__ - Step 137407: {'lr': 8.880260328749973e-06, 'samples': 26382144, 'steps': 137406, 'loss/train': 1.3211992979049683} 11/07/2021 16:36:19 - INFO - __main__ - Step 137408: {'lr': 8.878858555166624e-06, 'samples': 26382336, 'steps': 137407, 'loss/train': 1.2125442028045654} 11/07/2021 16:36:19 - INFO - __main__ - Step 137409: {'lr': 8.877456890228758e-06, 'samples': 26382528, 'steps': 137408, 'loss/train': 0.5614283680915833} 11/07/2021 16:36:20 - INFO - __main__ - Step 137410: {'lr': 8.876055333937012e-06, 'samples': 26382720, 'steps': 137409, 'loss/train': 1.0016599893569946} 11/07/2021 16:36:20 - INFO - __main__ - Step 137411: {'lr': 8.874653886291967e-06, 'samples': 26382912, 'steps': 137410, 'loss/train': 0.7723680138587952} 11/07/2021 16:36:21 - INFO - __main__ - Step 137412: {'lr': 8.873252547294264e-06, 'samples': 26383104, 'steps': 137411, 'loss/train': 1.3807636499404907} 11/07/2021 16:36:21 - INFO - __main__ - Step 137413: {'lr': 8.871851316944569e-06, 'samples': 26383296, 'steps': 137412, 'loss/train': 1.0695806741714478} 11/07/2021 16:36:22 - INFO - __main__ - Step 137414: {'lr': 8.870450195243518e-06, 'samples': 26383488, 'steps': 137413, 'loss/train': 1.4675605297088623} 11/07/2021 16:36:22 - INFO - __main__ - Step 137415: {'lr': 8.869049182191696e-06, 'samples': 26383680, 'steps': 137414, 'loss/train': 1.0691226720809937} 11/07/2021 16:36:22 - INFO - __main__ - Step 137416: {'lr': 8.867648277789769e-06, 'samples': 26383872, 'steps': 137415, 'loss/train': 1.303730845451355} 11/07/2021 16:36:23 - INFO - __main__ - Step 137417: {'lr': 8.866247482038348e-06, 'samples': 26384064, 'steps': 137416, 'loss/train': 1.1330302953720093} 11/07/2021 16:36:24 - INFO - __main__ - Step 137418: {'lr': 8.864846794938069e-06, 'samples': 26384256, 'steps': 137417, 'loss/train': 1.205001711845398} 11/07/2021 16:36:24 - INFO - __main__ - Step 137419: {'lr': 8.863446216489573e-06, 'samples': 26384448, 'steps': 137418, 'loss/train': 1.2525123357772827} 11/07/2021 16:36:24 - INFO - __main__ - Step 137420: {'lr': 8.862045746693498e-06, 'samples': 26384640, 'steps': 137419, 'loss/train': 1.1257506608963013} 11/07/2021 16:36:25 - INFO - __main__ - Step 137421: {'lr': 8.860645385550481e-06, 'samples': 26384832, 'steps': 137420, 'loss/train': 0.2690235674381256} 11/07/2021 16:36:26 - INFO - __main__ - Step 137422: {'lr': 8.859245133061105e-06, 'samples': 26385024, 'steps': 137421, 'loss/train': 1.0084584951400757} 11/07/2021 16:36:26 - INFO - __main__ - Step 137423: {'lr': 8.857844989226039e-06, 'samples': 26385216, 'steps': 137422, 'loss/train': 1.2494115829467773} 11/07/2021 16:36:27 - INFO - __main__ - Step 137424: {'lr': 8.856444954045945e-06, 'samples': 26385408, 'steps': 137423, 'loss/train': 1.2107150554656982} 11/07/2021 16:36:27 - INFO - __main__ - Step 137425: {'lr': 8.85504502752138e-06, 'samples': 26385600, 'steps': 137424, 'loss/train': 1.3413821458816528} 11/07/2021 16:36:27 - INFO - __main__ - Step 137426: {'lr': 8.85364520965301e-06, 'samples': 26385792, 'steps': 137425, 'loss/train': 0.840765118598938} 11/07/2021 16:36:28 - INFO - __main__ - Step 137427: {'lr': 8.852245500441474e-06, 'samples': 26385984, 'steps': 137426, 'loss/train': 1.4200310707092285} 11/07/2021 16:36:29 - INFO - __main__ - Step 137428: {'lr': 8.850845899887383e-06, 'samples': 26386176, 'steps': 137427, 'loss/train': 1.0215928554534912} 11/07/2021 16:36:29 - INFO - __main__ - Step 137429: {'lr': 8.849446407991401e-06, 'samples': 26386368, 'steps': 137428, 'loss/train': 0.6657443046569824} 11/07/2021 16:36:29 - INFO - __main__ - Step 137430: {'lr': 8.848047024754114e-06, 'samples': 26386560, 'steps': 137429, 'loss/train': 0.9397507905960083} 11/07/2021 16:36:30 - INFO - __main__ - Step 137431: {'lr': 8.846647750176184e-06, 'samples': 26386752, 'steps': 137430, 'loss/train': 0.8647831082344055} 11/07/2021 16:36:30 - INFO - __main__ - Step 137432: {'lr': 8.845248584258252e-06, 'samples': 26386944, 'steps': 137431, 'loss/train': 1.3009848594665527} 11/07/2021 16:36:31 - INFO - __main__ - Step 137433: {'lr': 8.843849527000902e-06, 'samples': 26387136, 'steps': 137432, 'loss/train': 1.9015687704086304} 11/07/2021 16:36:32 - INFO - __main__ - Step 137434: {'lr': 8.842450578404798e-06, 'samples': 26387328, 'steps': 137433, 'loss/train': 1.0886750221252441} 11/07/2021 16:36:32 - INFO - __main__ - Step 137435: {'lr': 8.84105173847058e-06, 'samples': 26387520, 'steps': 137434, 'loss/train': 1.2258875370025635} 11/07/2021 16:36:32 - INFO - __main__ - Step 137436: {'lr': 8.839653007198856e-06, 'samples': 26387712, 'steps': 137435, 'loss/train': 1.6222814321517944} 11/07/2021 16:36:33 - INFO - __main__ - Step 137437: {'lr': 8.838254384590294e-06, 'samples': 26387904, 'steps': 137436, 'loss/train': 0.5710427165031433} 11/07/2021 16:36:34 - INFO - __main__ - Step 137438: {'lr': 8.83685587064545e-06, 'samples': 26388096, 'steps': 137437, 'loss/train': 1.3319660425186157} 11/07/2021 16:36:34 - INFO - __main__ - Step 137439: {'lr': 8.835457465364988e-06, 'samples': 26388288, 'steps': 137438, 'loss/train': 1.0135694742202759} 11/07/2021 16:36:35 - INFO - __main__ - Step 137440: {'lr': 8.834059168749575e-06, 'samples': 26388480, 'steps': 137439, 'loss/train': 1.0987951755523682} 11/07/2021 16:36:35 - INFO - __main__ - Step 137441: {'lr': 8.832660980799795e-06, 'samples': 26388672, 'steps': 137440, 'loss/train': 1.7696504592895508} 11/07/2021 16:36:35 - INFO - __main__ - Step 137442: {'lr': 8.831262901516313e-06, 'samples': 26388864, 'steps': 137441, 'loss/train': 1.561927080154419} 11/07/2021 16:36:36 - INFO - __main__ - Step 137443: {'lr': 8.829864930899739e-06, 'samples': 26389056, 'steps': 137442, 'loss/train': 0.8129285573959351} 11/07/2021 16:36:37 - INFO - __main__ - Step 137444: {'lr': 8.828467068950713e-06, 'samples': 26389248, 'steps': 137443, 'loss/train': 1.4413141012191772} 11/07/2021 16:36:37 - INFO - __main__ - Step 137445: {'lr': 8.827069315669844e-06, 'samples': 26389440, 'steps': 137444, 'loss/train': 1.2560274600982666} 11/07/2021 16:36:37 - INFO - __main__ - Step 137446: {'lr': 8.825671671057773e-06, 'samples': 26389632, 'steps': 137445, 'loss/train': 0.7415681481361389} 11/07/2021 16:36:38 - INFO - __main__ - Step 137447: {'lr': 8.824274135115135e-06, 'samples': 26389824, 'steps': 137446, 'loss/train': 1.0178618431091309} 11/07/2021 16:36:39 - INFO - __main__ - Step 137448: {'lr': 8.82287670784257e-06, 'samples': 26390016, 'steps': 137447, 'loss/train': 1.6416219472885132} 11/07/2021 16:36:39 - INFO - __main__ - Step 137449: {'lr': 8.821479389240688e-06, 'samples': 26390208, 'steps': 137448, 'loss/train': 1.2446430921554565} 11/07/2021 16:36:39 - INFO - __main__ - Step 137450: {'lr': 8.820082179310102e-06, 'samples': 26390400, 'steps': 137449, 'loss/train': 0.7148875594139099} 11/07/2021 16:36:40 - INFO - __main__ - Step 137451: {'lr': 8.818685078051531e-06, 'samples': 26390592, 'steps': 137450, 'loss/train': 1.066440463066101} 11/07/2021 16:36:40 - INFO - __main__ - Step 137452: {'lr': 8.817288085465502e-06, 'samples': 26390784, 'steps': 137451, 'loss/train': 0.469267874956131} 11/07/2021 16:36:41 - INFO - __main__ - Step 137453: {'lr': 8.815891201552655e-06, 'samples': 26390976, 'steps': 137452, 'loss/train': 1.196619987487793} 11/07/2021 16:36:41 - INFO - __main__ - Step 137454: {'lr': 8.814494426313685e-06, 'samples': 26391168, 'steps': 137453, 'loss/train': 1.1117480993270874} 11/07/2021 16:36:42 - INFO - __main__ - Step 137455: {'lr': 8.813097759749145e-06, 'samples': 26391360, 'steps': 137454, 'loss/train': 1.355571985244751} 11/07/2021 16:36:42 - INFO - __main__ - Step 137456: {'lr': 8.811701201859729e-06, 'samples': 26391552, 'steps': 137455, 'loss/train': 1.202976107597351} 11/07/2021 16:36:43 - INFO - __main__ - Step 137457: {'lr': 8.81030475264602e-06, 'samples': 26391744, 'steps': 137456, 'loss/train': 1.1707074642181396} 11/07/2021 16:36:44 - INFO - __main__ - Step 137458: {'lr': 8.808908412108685e-06, 'samples': 26391936, 'steps': 137457, 'loss/train': 1.1091862916946411} 11/07/2021 16:36:44 - INFO - __main__ - Step 137459: {'lr': 8.807512180248334e-06, 'samples': 26392128, 'steps': 137458, 'loss/train': 1.4077248573303223} 11/07/2021 16:36:44 - INFO - __main__ - Step 137460: {'lr': 8.806116057065578e-06, 'samples': 26392320, 'steps': 137459, 'loss/train': 0.40220871567726135} 11/07/2021 16:36:45 - INFO - __main__ - Step 137461: {'lr': 8.804720042561082e-06, 'samples': 26392512, 'steps': 137460, 'loss/train': 0.3761148154735565} 11/07/2021 16:36:45 - INFO - __main__ - Step 137462: {'lr': 8.80332413673543e-06, 'samples': 26392704, 'steps': 137461, 'loss/train': 1.2398662567138672} 11/07/2021 16:36:45 - INFO - __main__ - Step 137463: {'lr': 8.801928339589288e-06, 'samples': 26392896, 'steps': 137462, 'loss/train': 1.3765902519226074} 11/07/2021 16:36:46 - INFO - __main__ - Step 137464: {'lr': 8.800532651123323e-06, 'samples': 26393088, 'steps': 137463, 'loss/train': 1.1965842247009277} 11/07/2021 16:36:47 - INFO - __main__ - Step 137465: {'lr': 8.799137071338087e-06, 'samples': 26393280, 'steps': 137464, 'loss/train': 0.8637139797210693} 11/07/2021 16:36:47 - INFO - __main__ - Step 137466: {'lr': 8.797741600234222e-06, 'samples': 26393472, 'steps': 137465, 'loss/train': 0.8180498480796814} 11/07/2021 16:36:47 - INFO - __main__ - Step 137467: {'lr': 8.796346237812364e-06, 'samples': 26393664, 'steps': 137466, 'loss/train': 1.372283935546875} 11/07/2021 16:36:48 - INFO - __main__ - Step 137468: {'lr': 8.79495098407318e-06, 'samples': 26393856, 'steps': 137467, 'loss/train': 0.7037883996963501} 11/07/2021 16:36:49 - INFO - __main__ - Step 137469: {'lr': 8.793555839017253e-06, 'samples': 26394048, 'steps': 137468, 'loss/train': 1.5311049222946167} 11/07/2021 16:36:49 - INFO - __main__ - Step 137470: {'lr': 8.792160802645222e-06, 'samples': 26394240, 'steps': 137469, 'loss/train': 1.351503849029541} 11/07/2021 16:36:49 - INFO - __main__ - Step 137471: {'lr': 8.790765874957724e-06, 'samples': 26394432, 'steps': 137470, 'loss/train': 1.0038862228393555} 11/07/2021 16:36:50 - INFO - __main__ - Step 137472: {'lr': 8.789371055955398e-06, 'samples': 26394624, 'steps': 137471, 'loss/train': 1.2021087408065796} 11/07/2021 16:36:51 - INFO - __main__ - Step 137473: {'lr': 8.787976345638827e-06, 'samples': 26394816, 'steps': 137472, 'loss/train': 1.132435917854309} 11/07/2021 16:36:51 - INFO - __main__ - Step 137474: {'lr': 8.786581744008704e-06, 'samples': 26395008, 'steps': 137473, 'loss/train': 1.4433236122131348} 11/07/2021 16:36:52 - INFO - __main__ - Step 137475: {'lr': 8.785187251065613e-06, 'samples': 26395200, 'steps': 137474, 'loss/train': 0.43668293952941895} 11/07/2021 16:36:52 - INFO - __main__ - Step 137476: {'lr': 8.783792866810191e-06, 'samples': 26395392, 'steps': 137475, 'loss/train': 1.0191833972930908} 11/07/2021 16:36:52 - INFO - __main__ - Step 137477: {'lr': 8.782398591243079e-06, 'samples': 26395584, 'steps': 137476, 'loss/train': 0.5512829422950745} 11/07/2021 16:36:53 - INFO - __main__ - Step 137478: {'lr': 8.781004424364913e-06, 'samples': 26395776, 'steps': 137477, 'loss/train': 0.6897361874580383} 11/07/2021 16:36:54 - INFO - __main__ - Step 137479: {'lr': 8.779610366176304e-06, 'samples': 26395968, 'steps': 137478, 'loss/train': 1.3039106130599976} 11/07/2021 16:36:54 - INFO - __main__ - Step 137480: {'lr': 8.778216416677864e-06, 'samples': 26396160, 'steps': 137479, 'loss/train': 1.3544727563858032} 11/07/2021 16:36:54 - INFO - __main__ - Step 137481: {'lr': 8.77682257587023e-06, 'samples': 26396352, 'steps': 137480, 'loss/train': 1.281306266784668} 11/07/2021 16:36:55 - INFO - __main__ - Step 137482: {'lr': 8.775428843754041e-06, 'samples': 26396544, 'steps': 137481, 'loss/train': 1.2959511280059814} 11/07/2021 16:36:55 - INFO - __main__ - Step 137483: {'lr': 8.774035220329907e-06, 'samples': 26396736, 'steps': 137482, 'loss/train': 1.3958594799041748} 11/07/2021 16:36:56 - INFO - __main__ - Step 137484: {'lr': 8.772641705598494e-06, 'samples': 26396928, 'steps': 137483, 'loss/train': 1.231770634651184} 11/07/2021 16:36:56 - INFO - __main__ - Step 137485: {'lr': 8.771248299560386e-06, 'samples': 26397120, 'steps': 137484, 'loss/train': 1.326941967010498} 11/07/2021 16:36:57 - INFO - __main__ - Step 137486: {'lr': 8.769855002216249e-06, 'samples': 26397312, 'steps': 137485, 'loss/train': 1.242586612701416} 11/07/2021 16:36:57 - INFO - __main__ - Step 137487: {'lr': 8.768461813566692e-06, 'samples': 26397504, 'steps': 137486, 'loss/train': 1.3925639390945435} 11/07/2021 16:36:57 - INFO - __main__ - Step 137488: {'lr': 8.767068733612327e-06, 'samples': 26397696, 'steps': 137487, 'loss/train': 0.9353419542312622} 11/07/2021 16:36:59 - INFO - __main__ - Step 137489: {'lr': 8.76567576235382e-06, 'samples': 26397888, 'steps': 137488, 'loss/train': 1.1316381692886353} 11/07/2021 16:36:59 - INFO - __main__ - Step 137490: {'lr': 8.764282899791754e-06, 'samples': 26398080, 'steps': 137489, 'loss/train': 1.2211811542510986} 11/07/2021 16:36:59 - INFO - __main__ - Step 137491: {'lr': 8.762890145926822e-06, 'samples': 26398272, 'steps': 137490, 'loss/train': 1.428818702697754} 11/07/2021 16:37:00 - INFO - __main__ - Step 137492: {'lr': 8.761497500759579e-06, 'samples': 26398464, 'steps': 137491, 'loss/train': 1.7427122592926025} 11/07/2021 16:37:00 - INFO - __main__ - Step 137493: {'lr': 8.760104964290693e-06, 'samples': 26398656, 'steps': 137492, 'loss/train': 1.4490450620651245} 11/07/2021 16:37:01 - INFO - __main__ - Step 137494: {'lr': 8.758712536520746e-06, 'samples': 26398848, 'steps': 137493, 'loss/train': 1.4074680805206299} 11/07/2021 16:37:01 - INFO - __main__ - Step 137495: {'lr': 8.757320217450432e-06, 'samples': 26399040, 'steps': 137494, 'loss/train': 1.2231221199035645} 11/07/2021 16:37:02 - INFO - __main__ - Step 137496: {'lr': 8.755928007080333e-06, 'samples': 26399232, 'steps': 137495, 'loss/train': 1.3712105751037598} 11/07/2021 16:37:02 - INFO - __main__ - Step 137497: {'lr': 8.754535905411115e-06, 'samples': 26399424, 'steps': 137496, 'loss/train': 1.1132426261901855} 11/07/2021 16:37:02 - INFO - __main__ - Step 137498: {'lr': 8.753143912443363e-06, 'samples': 26399616, 'steps': 137497, 'loss/train': 1.7630926370620728} 11/07/2021 16:37:03 - INFO - __main__ - Step 137499: {'lr': 8.751752028177712e-06, 'samples': 26399808, 'steps': 137498, 'loss/train': 1.2108887434005737} 11/07/2021 16:37:04 - INFO - __main__ - Step 137500: {'lr': 8.750360252614802e-06, 'samples': 26400000, 'steps': 137499, 'loss/train': 1.3697046041488647} 11/07/2021 16:37:04 - INFO - __main__ - Step 137501: {'lr': 8.748968585755273e-06, 'samples': 26400192, 'steps': 137500, 'loss/train': 1.2430393695831299} 11/07/2021 16:37:05 - INFO - __main__ - Step 137502: {'lr': 8.747577027599735e-06, 'samples': 26400384, 'steps': 137501, 'loss/train': 0.05986207723617554} 11/07/2021 16:37:05 - INFO - __main__ - Step 137503: {'lr': 8.746185578148796e-06, 'samples': 26400576, 'steps': 137502, 'loss/train': 1.448029637336731} 11/07/2021 16:37:06 - INFO - __main__ - Step 137504: {'lr': 8.744794237403125e-06, 'samples': 26400768, 'steps': 137503, 'loss/train': 0.9399271011352539} 11/07/2021 16:37:06 - INFO - __main__ - Step 137505: {'lr': 8.743403005363331e-06, 'samples': 26400960, 'steps': 137504, 'loss/train': 0.4136463403701782} 11/07/2021 16:37:07 - INFO - __main__ - Step 137506: {'lr': 8.742011882030026e-06, 'samples': 26401152, 'steps': 137505, 'loss/train': 1.2297152280807495} 11/07/2021 16:37:07 - INFO - __main__ - Step 137507: {'lr': 8.740620867403848e-06, 'samples': 26401344, 'steps': 137506, 'loss/train': 1.0186762809753418} 11/07/2021 16:37:07 - INFO - __main__ - Step 137508: {'lr': 8.739229961485406e-06, 'samples': 26401536, 'steps': 137507, 'loss/train': 1.3024224042892456} 11/07/2021 16:37:08 - INFO - __main__ - Step 137509: {'lr': 8.737839164275368e-06, 'samples': 26401728, 'steps': 137508, 'loss/train': 0.03763458877801895} 11/07/2021 16:37:09 - INFO - __main__ - Step 137510: {'lr': 8.736448475774317e-06, 'samples': 26401920, 'steps': 137509, 'loss/train': 1.4416781663894653} 11/07/2021 16:37:09 - INFO - __main__ - Step 137511: {'lr': 8.735057895982918e-06, 'samples': 26402112, 'steps': 137510, 'loss/train': 1.3262628316879272} 11/07/2021 16:37:09 - INFO - __main__ - Step 137512: {'lr': 8.733667424901754e-06, 'samples': 26402304, 'steps': 137511, 'loss/train': 0.8098776340484619} 11/07/2021 16:37:10 - INFO - __main__ - Step 137513: {'lr': 8.73227706253149e-06, 'samples': 26402496, 'steps': 137512, 'loss/train': 1.2503856420516968} 11/07/2021 16:37:10 - INFO - __main__ - Step 137514: {'lr': 8.73088680887274e-06, 'samples': 26402688, 'steps': 137513, 'loss/train': 0.6727802157402039} 11/07/2021 16:37:11 - INFO - __main__ - Step 137515: {'lr': 8.72949666392614e-06, 'samples': 26402880, 'steps': 137514, 'loss/train': 1.2898688316345215} 11/07/2021 16:37:12 - INFO - __main__ - Step 137516: {'lr': 8.728106627692327e-06, 'samples': 26403072, 'steps': 137515, 'loss/train': 1.2578716278076172} 11/07/2021 16:37:12 - INFO - __main__ - Step 137517: {'lr': 8.72671670017186e-06, 'samples': 26403264, 'steps': 137516, 'loss/train': 1.1433677673339844} 11/07/2021 16:37:12 - INFO - __main__ - Step 137518: {'lr': 8.725326881365431e-06, 'samples': 26403456, 'steps': 137517, 'loss/train': 1.389976143836975} 11/07/2021 16:37:13 - INFO - __main__ - Step 137519: {'lr': 8.723937171273649e-06, 'samples': 26403648, 'steps': 137518, 'loss/train': 1.274928092956543} 11/07/2021 16:37:14 - INFO - __main__ - Step 137520: {'lr': 8.722547569897126e-06, 'samples': 26403840, 'steps': 137519, 'loss/train': 1.4507393836975098} 11/07/2021 16:37:14 - INFO - __main__ - Step 137521: {'lr': 8.721158077236502e-06, 'samples': 26404032, 'steps': 137520, 'loss/train': 1.4976545572280884} 11/07/2021 16:37:14 - INFO - __main__ - Step 137522: {'lr': 8.719768693292413e-06, 'samples': 26404224, 'steps': 137521, 'loss/train': 1.519310712814331} 11/07/2021 16:37:15 - INFO - __main__ - Step 137523: {'lr': 8.71837941806547e-06, 'samples': 26404416, 'steps': 137522, 'loss/train': 1.2954463958740234} 11/07/2021 16:37:15 - INFO - __main__ - Step 137524: {'lr': 8.716990251556284e-06, 'samples': 26404608, 'steps': 137523, 'loss/train': 1.644501805305481} 11/07/2021 16:37:16 - INFO - __main__ - Step 137525: {'lr': 8.715601193765522e-06, 'samples': 26404800, 'steps': 137524, 'loss/train': 0.8226993083953857} 11/07/2021 16:37:16 - INFO - __main__ - Step 137526: {'lr': 8.714212244693764e-06, 'samples': 26404992, 'steps': 137525, 'loss/train': 1.1385283470153809} 11/07/2021 16:37:17 - INFO - __main__ - Step 137527: {'lr': 8.712823404341708e-06, 'samples': 26405184, 'steps': 137526, 'loss/train': 1.323683500289917} 11/07/2021 16:37:17 - INFO - __main__ - Step 137528: {'lr': 8.711434672709878e-06, 'samples': 26405376, 'steps': 137527, 'loss/train': 0.5346939563751221} 11/07/2021 16:37:17 - INFO - __main__ - Step 137529: {'lr': 8.71004604979897e-06, 'samples': 26405568, 'steps': 137528, 'loss/train': 1.2596817016601562} 11/07/2021 16:37:18 - INFO - __main__ - Step 137530: {'lr': 8.708657535609593e-06, 'samples': 26405760, 'steps': 137529, 'loss/train': 1.1520371437072754} 11/07/2021 16:37:19 - INFO - __main__ - Step 137531: {'lr': 8.70726913014236e-06, 'samples': 26405952, 'steps': 137530, 'loss/train': 1.25456702709198} 11/07/2021 16:37:19 - INFO - __main__ - Step 137532: {'lr': 8.705880833397934e-06, 'samples': 26406144, 'steps': 137531, 'loss/train': 1.5472538471221924} 11/07/2021 16:37:20 - INFO - __main__ - Step 137533: {'lr': 8.704492645376872e-06, 'samples': 26406336, 'steps': 137532, 'loss/train': 0.296092689037323} 11/07/2021 16:37:20 - INFO - __main__ - Step 137534: {'lr': 8.703104566079866e-06, 'samples': 26406528, 'steps': 137533, 'loss/train': 1.3472120761871338} 11/07/2021 16:37:21 - INFO - __main__ - Step 137535: {'lr': 8.70171659550753e-06, 'samples': 26406720, 'steps': 137534, 'loss/train': 1.030178427696228} 11/07/2021 16:37:21 - INFO - __main__ - Step 137536: {'lr': 8.700328733660473e-06, 'samples': 26406912, 'steps': 137535, 'loss/train': 1.371339201927185} 11/07/2021 16:37:22 - INFO - __main__ - Step 137537: {'lr': 8.698940980539333e-06, 'samples': 26407104, 'steps': 137536, 'loss/train': 1.3120285272598267} 11/07/2021 16:37:22 - INFO - __main__ - Step 137538: {'lr': 8.69755333614472e-06, 'samples': 26407296, 'steps': 137537, 'loss/train': 1.4129670858383179} 11/07/2021 16:37:22 - INFO - __main__ - Step 137539: {'lr': 8.696165800477246e-06, 'samples': 26407488, 'steps': 137538, 'loss/train': 1.4398962259292603} 11/07/2021 16:37:23 - INFO - __main__ - Step 137540: {'lr': 8.694778373537576e-06, 'samples': 26407680, 'steps': 137539, 'loss/train': 1.2746615409851074} 11/07/2021 16:37:24 - INFO - __main__ - Step 137541: {'lr': 8.693391055326294e-06, 'samples': 26407872, 'steps': 137540, 'loss/train': 0.709980845451355} 11/07/2021 16:37:24 - INFO - __main__ - Step 137542: {'lr': 8.692003845844066e-06, 'samples': 26408064, 'steps': 137541, 'loss/train': 0.9983959197998047} 11/07/2021 16:37:25 - INFO - __main__ - Step 137543: {'lr': 8.6906167450915e-06, 'samples': 26408256, 'steps': 137542, 'loss/train': 1.3391672372817993} 11/07/2021 16:37:25 - INFO - __main__ - Step 137544: {'lr': 8.689229753069183e-06, 'samples': 26408448, 'steps': 137543, 'loss/train': 1.3772873878479004} 11/07/2021 16:37:25 - INFO - __main__ - Step 137545: {'lr': 8.687842869777806e-06, 'samples': 26408640, 'steps': 137544, 'loss/train': 1.3853152990341187} 11/07/2021 16:37:26 - INFO - __main__ - Step 137546: {'lr': 8.686456095217954e-06, 'samples': 26408832, 'steps': 137545, 'loss/train': 1.2897824048995972} 11/07/2021 16:37:27 - INFO - __main__ - Step 137547: {'lr': 8.685069429390264e-06, 'samples': 26409024, 'steps': 137546, 'loss/train': 1.0013322830200195} 11/07/2021 16:37:27 - INFO - __main__ - Step 137548: {'lr': 8.6836828722954e-06, 'samples': 26409216, 'steps': 137547, 'loss/train': 1.358132243156433} 11/07/2021 16:37:27 - INFO - __main__ - Step 137549: {'lr': 8.682296423933894e-06, 'samples': 26409408, 'steps': 137548, 'loss/train': 0.9302381277084351} 11/07/2021 16:37:28 - INFO - __main__ - Step 137550: {'lr': 8.680910084306437e-06, 'samples': 26409600, 'steps': 137549, 'loss/train': 0.916493833065033} 11/07/2021 16:37:28 - INFO - __main__ - Step 137551: {'lr': 8.67952385341364e-06, 'samples': 26409792, 'steps': 137550, 'loss/train': 0.7636890411376953} 11/07/2021 16:37:29 - INFO - __main__ - Step 137552: {'lr': 8.678137731256114e-06, 'samples': 26409984, 'steps': 137551, 'loss/train': 1.2664824724197388} 11/07/2021 16:37:30 - INFO - __main__ - Step 137553: {'lr': 8.676751717834497e-06, 'samples': 26410176, 'steps': 137552, 'loss/train': 1.2268108129501343} 11/07/2021 16:37:30 - INFO - __main__ - Step 137554: {'lr': 8.675365813149428e-06, 'samples': 26410368, 'steps': 137553, 'loss/train': 1.5132291316986084} 11/07/2021 16:37:30 - INFO - __main__ - Step 137555: {'lr': 8.673980017201488e-06, 'samples': 26410560, 'steps': 137554, 'loss/train': 1.7249127626419067} 11/07/2021 16:37:31 - INFO - __main__ - Step 137556: {'lr': 8.672594329991345e-06, 'samples': 26410752, 'steps': 137555, 'loss/train': 0.755914032459259} 11/07/2021 16:37:32 - INFO - __main__ - Step 137557: {'lr': 8.67120875151961e-06, 'samples': 26410944, 'steps': 137556, 'loss/train': 1.100510597229004} 11/07/2021 16:37:32 - INFO - __main__ - Step 137558: {'lr': 8.669823281786893e-06, 'samples': 26411136, 'steps': 137557, 'loss/train': 1.516588568687439} 11/07/2021 16:37:32 - INFO - __main__ - Step 137559: {'lr': 8.66843792079386e-06, 'samples': 26411328, 'steps': 137558, 'loss/train': 1.668936848640442} 11/07/2021 16:37:33 - INFO - __main__ - Step 137560: {'lr': 8.667052668541092e-06, 'samples': 26411520, 'steps': 137559, 'loss/train': 1.140468955039978} 11/07/2021 16:37:33 - INFO - __main__ - Step 137561: {'lr': 8.665667525029202e-06, 'samples': 26411712, 'steps': 137560, 'loss/train': 3.2827892303466797} 11/07/2021 16:37:34 - INFO - __main__ - Step 137562: {'lr': 8.664282490258857e-06, 'samples': 26411904, 'steps': 137561, 'loss/train': 0.9056341052055359} 11/07/2021 16:37:34 - INFO - __main__ - Step 137563: {'lr': 8.662897564230637e-06, 'samples': 26412096, 'steps': 137562, 'loss/train': 1.3113288879394531} 11/07/2021 16:37:35 - INFO - __main__ - Step 137564: {'lr': 8.661512746945211e-06, 'samples': 26412288, 'steps': 137563, 'loss/train': 1.3324679136276245} 11/07/2021 16:37:35 - INFO - __main__ - Step 137565: {'lr': 8.660128038403187e-06, 'samples': 26412480, 'steps': 137564, 'loss/train': 1.2563596963882446} 11/07/2021 16:37:36 - INFO - __main__ - Step 137566: {'lr': 8.658743438605176e-06, 'samples': 26412672, 'steps': 137565, 'loss/train': 0.9368733167648315} 11/07/2021 16:37:36 - INFO - __main__ - Step 137567: {'lr': 8.657358947551819e-06, 'samples': 26412864, 'steps': 137566, 'loss/train': 1.2540329694747925} 11/07/2021 16:37:37 - INFO - __main__ - Step 137568: {'lr': 8.655974565243724e-06, 'samples': 26413056, 'steps': 137567, 'loss/train': 1.8710980415344238} 11/07/2021 16:37:37 - INFO - __main__ - Step 137569: {'lr': 8.65459029168153e-06, 'samples': 26413248, 'steps': 137568, 'loss/train': 1.709012508392334} 11/07/2021 16:37:38 - INFO - __main__ - Step 137570: {'lr': 8.653206126865847e-06, 'samples': 26413440, 'steps': 137569, 'loss/train': 1.409237265586853} 11/07/2021 16:37:38 - INFO - __main__ - Step 137571: {'lr': 8.651822070797317e-06, 'samples': 26413632, 'steps': 137570, 'loss/train': 1.0426197052001953} 11/07/2021 16:37:38 - INFO - __main__ - Step 137572: {'lr': 8.650438123476573e-06, 'samples': 26413824, 'steps': 137571, 'loss/train': 1.371561050415039} 11/07/2021 16:37:40 - INFO - __main__ - Step 137573: {'lr': 8.649054284904201e-06, 'samples': 26414016, 'steps': 137572, 'loss/train': 1.9867295026779175} 11/07/2021 16:37:40 - INFO - __main__ - Step 137574: {'lr': 8.64767055508081e-06, 'samples': 26414208, 'steps': 137573, 'loss/train': 0.36091646552085876} 11/07/2021 16:37:40 - INFO - __main__ - Step 137575: {'lr': 8.64628693400707e-06, 'samples': 26414400, 'steps': 137574, 'loss/train': 1.148996114730835} 11/07/2021 16:37:41 - INFO - __main__ - Step 137576: {'lr': 8.644903421683587e-06, 'samples': 26414592, 'steps': 137575, 'loss/train': 1.4100788831710815} 11/07/2021 16:37:41 - INFO - __main__ - Step 137577: {'lr': 8.643520018111e-06, 'samples': 26414784, 'steps': 137576, 'loss/train': 2.123145818710327} 11/07/2021 16:37:42 - INFO - __main__ - Step 137578: {'lr': 8.642136723289923e-06, 'samples': 26414976, 'steps': 137577, 'loss/train': 0.7546897530555725} 11/07/2021 16:37:42 - INFO - __main__ - Step 137579: {'lr': 8.640753537220935e-06, 'samples': 26415168, 'steps': 137578, 'loss/train': 1.2478290796279907} 11/07/2021 16:37:43 - INFO - __main__ - Step 137580: {'lr': 8.639370459904733e-06, 'samples': 26415360, 'steps': 137579, 'loss/train': 1.2836318016052246} 11/07/2021 16:37:43 - INFO - __main__ - Step 137581: {'lr': 8.637987491341897e-06, 'samples': 26415552, 'steps': 137580, 'loss/train': 1.5259320735931396} 11/07/2021 16:37:43 - INFO - __main__ - Step 137582: {'lr': 8.636604631533068e-06, 'samples': 26415744, 'steps': 137581, 'loss/train': 1.0317682027816772} 11/07/2021 16:37:44 - INFO - __main__ - Step 137583: {'lr': 8.635221880478855e-06, 'samples': 26415936, 'steps': 137582, 'loss/train': 1.0626221895217896} 11/07/2021 16:37:45 - INFO - __main__ - Step 137584: {'lr': 8.63383923817987e-06, 'samples': 26416128, 'steps': 137583, 'loss/train': 1.4896761178970337} 11/07/2021 16:37:45 - INFO - __main__ - Step 137585: {'lr': 8.632456704636804e-06, 'samples': 26416320, 'steps': 137584, 'loss/train': 1.280288815498352} 11/07/2021 16:37:45 - INFO - __main__ - Step 137586: {'lr': 8.631074279850187e-06, 'samples': 26416512, 'steps': 137585, 'loss/train': 1.2551707029342651} 11/07/2021 16:37:46 - INFO - __main__ - Step 137587: {'lr': 8.629691963820685e-06, 'samples': 26416704, 'steps': 137586, 'loss/train': 1.1908528804779053} 11/07/2021 16:37:47 - INFO - __main__ - Step 137588: {'lr': 8.628309756548935e-06, 'samples': 26416896, 'steps': 137587, 'loss/train': 1.2060840129852295} 11/07/2021 16:37:47 - INFO - __main__ - Step 137589: {'lr': 8.626927658035522e-06, 'samples': 26417088, 'steps': 137588, 'loss/train': 1.3224495649337769} 11/07/2021 16:37:48 - INFO - __main__ - Step 137590: {'lr': 8.62554566828111e-06, 'samples': 26417280, 'steps': 137589, 'loss/train': 5.673499584197998} 11/07/2021 16:37:48 - INFO - __main__ - Step 137591: {'lr': 8.624163787286282e-06, 'samples': 26417472, 'steps': 137590, 'loss/train': 1.1321215629577637} 11/07/2021 16:37:48 - INFO - __main__ - Step 137592: {'lr': 8.622782015051678e-06, 'samples': 26417664, 'steps': 137591, 'loss/train': 0.7982601523399353} 11/07/2021 16:37:49 - INFO - __main__ - Step 137593: {'lr': 8.621400351577962e-06, 'samples': 26417856, 'steps': 137592, 'loss/train': 0.17687202990055084} 11/07/2021 16:37:50 - INFO - __main__ - Step 137594: {'lr': 8.620018796865691e-06, 'samples': 26418048, 'steps': 137593, 'loss/train': 0.6257336735725403} 11/07/2021 16:37:50 - INFO - __main__ - Step 137595: {'lr': 8.618637350915503e-06, 'samples': 26418240, 'steps': 137594, 'loss/train': 1.2537418603897095} 11/07/2021 16:37:51 - INFO - __main__ - Step 137596: {'lr': 8.617256013728037e-06, 'samples': 26418432, 'steps': 137595, 'loss/train': 1.7697068452835083} 11/07/2021 16:37:51 - INFO - __main__ - Step 137597: {'lr': 8.61587478530393e-06, 'samples': 26418624, 'steps': 137596, 'loss/train': 0.8513680696487427} 11/07/2021 16:37:51 - INFO - __main__ - Step 137598: {'lr': 8.614493665643763e-06, 'samples': 26418816, 'steps': 137597, 'loss/train': 1.2143040895462036} 11/07/2021 16:37:52 - INFO - __main__ - Step 137599: {'lr': 8.613112654748233e-06, 'samples': 26419008, 'steps': 137598, 'loss/train': 1.0776199102401733} 11/07/2021 16:37:53 - INFO - __main__ - Step 137600: {'lr': 8.611731752617868e-06, 'samples': 26419200, 'steps': 137599, 'loss/train': 1.0830987691879272} 11/07/2021 16:37:53 - INFO - __main__ - Step 137601: {'lr': 8.610350959253332e-06, 'samples': 26419392, 'steps': 137600, 'loss/train': 0.8108770251274109} 11/07/2021 16:37:53 - INFO - __main__ - Step 137602: {'lr': 8.608970274655264e-06, 'samples': 26419584, 'steps': 137601, 'loss/train': 1.5999653339385986} 11/07/2021 16:37:54 - INFO - __main__ - Step 137603: {'lr': 8.607589698824247e-06, 'samples': 26419776, 'steps': 137602, 'loss/train': 0.8519894480705261} 11/07/2021 16:37:55 - INFO - __main__ - Step 137604: {'lr': 8.60620923176092e-06, 'samples': 26419968, 'steps': 137603, 'loss/train': 1.3456059694290161} 11/07/2021 16:37:55 - INFO - __main__ - Step 137605: {'lr': 8.604828873465919e-06, 'samples': 26420160, 'steps': 137604, 'loss/train': 0.9930843710899353} 11/07/2021 16:37:55 - INFO - __main__ - Step 137606: {'lr': 8.603448623939857e-06, 'samples': 26420352, 'steps': 137605, 'loss/train': 1.4791717529296875} 11/07/2021 16:37:56 - INFO - __main__ - Step 137607: {'lr': 8.602068483183373e-06, 'samples': 26420544, 'steps': 137606, 'loss/train': 1.3955299854278564} 11/07/2021 16:37:56 - INFO - __main__ - Step 137608: {'lr': 8.600688451197047e-06, 'samples': 26420736, 'steps': 137607, 'loss/train': 1.3205689191818237} 11/07/2021 16:37:57 - INFO - __main__ - Step 137609: {'lr': 8.599308527981548e-06, 'samples': 26420928, 'steps': 137608, 'loss/train': 1.2584919929504395} 11/07/2021 16:37:57 - INFO - __main__ - Step 137610: {'lr': 8.597928713537456e-06, 'samples': 26421120, 'steps': 137609, 'loss/train': 0.6518523097038269} 11/07/2021 16:37:58 - INFO - __main__ - Step 137611: {'lr': 8.596549007865411e-06, 'samples': 26421312, 'steps': 137610, 'loss/train': 0.9607048034667969} 11/07/2021 16:37:58 - INFO - __main__ - Step 137612: {'lr': 8.59516941096608e-06, 'samples': 26421504, 'steps': 137611, 'loss/train': 0.9422702193260193} 11/07/2021 16:37:59 - INFO - __main__ - Step 137613: {'lr': 8.593789922840018e-06, 'samples': 26421696, 'steps': 137612, 'loss/train': 1.5973175764083862} 11/07/2021 16:37:59 - INFO - __main__ - Step 137614: {'lr': 8.592410543487833e-06, 'samples': 26421888, 'steps': 137613, 'loss/train': 1.0473413467407227} 11/07/2021 16:38:00 - INFO - __main__ - Step 137615: {'lr': 8.591031272910222e-06, 'samples': 26422080, 'steps': 137614, 'loss/train': 1.4880805015563965} 11/07/2021 16:38:00 - INFO - __main__ - Step 137616: {'lr': 8.589652111107738e-06, 'samples': 26422272, 'steps': 137615, 'loss/train': 0.8482583165168762} 11/07/2021 16:38:01 - INFO - __main__ - Step 137617: {'lr': 8.588273058081047e-06, 'samples': 26422464, 'steps': 137616, 'loss/train': 0.8016936779022217} 11/07/2021 16:38:01 - INFO - __main__ - Step 137618: {'lr': 8.586894113830761e-06, 'samples': 26422656, 'steps': 137617, 'loss/train': 1.2463858127593994} 11/07/2021 16:38:01 - INFO - __main__ - Step 137619: {'lr': 8.585515278357492e-06, 'samples': 26422848, 'steps': 137618, 'loss/train': 0.8123329281806946} 11/07/2021 16:38:02 - INFO - __main__ - Step 137620: {'lr': 8.584136551661847e-06, 'samples': 26423040, 'steps': 137619, 'loss/train': 0.971051812171936} 11/07/2021 16:38:03 - INFO - __main__ - Step 137621: {'lr': 8.582757933744495e-06, 'samples': 26423232, 'steps': 137620, 'loss/train': 1.2247624397277832} 11/07/2021 16:38:03 - INFO - __main__ - Step 137622: {'lr': 8.58137942460599e-06, 'samples': 26423424, 'steps': 137621, 'loss/train': 1.2288727760314941} 11/07/2021 16:38:03 - INFO - __main__ - Step 137623: {'lr': 8.580001024247025e-06, 'samples': 26423616, 'steps': 137622, 'loss/train': 1.2983877658843994} 11/07/2021 16:38:04 - INFO - __main__ - Step 137624: {'lr': 8.578622732668156e-06, 'samples': 26423808, 'steps': 137623, 'loss/train': 0.7665138840675354} 11/07/2021 16:38:05 - INFO - __main__ - Step 137625: {'lr': 8.57724454987005e-06, 'samples': 26424000, 'steps': 137624, 'loss/train': 1.1715900897979736} 11/07/2021 16:38:05 - INFO - __main__ - Step 137626: {'lr': 8.575866475853344e-06, 'samples': 26424192, 'steps': 137625, 'loss/train': 1.7980093955993652} 11/07/2021 16:38:06 - INFO - __main__ - Step 137627: {'lr': 8.574488510618594e-06, 'samples': 26424384, 'steps': 137626, 'loss/train': 1.5294922590255737} 11/07/2021 16:38:06 - INFO - __main__ - Step 137628: {'lr': 8.573110654166466e-06, 'samples': 26424576, 'steps': 137627, 'loss/train': 1.4792264699935913} 11/07/2021 16:38:06 - INFO - __main__ - Step 137629: {'lr': 8.571732906497542e-06, 'samples': 26424768, 'steps': 137628, 'loss/train': 1.0918786525726318} 11/07/2021 16:38:07 - INFO - __main__ - Step 137630: {'lr': 8.57035526761249e-06, 'samples': 26424960, 'steps': 137629, 'loss/train': 1.7982739210128784} 11/07/2021 16:38:08 - INFO - __main__ - Step 137631: {'lr': 8.568977737511918e-06, 'samples': 26425152, 'steps': 137630, 'loss/train': 1.32456636428833} 11/07/2021 16:38:08 - INFO - __main__ - Step 137632: {'lr': 8.56760031619641e-06, 'samples': 26425344, 'steps': 137631, 'loss/train': 0.43973836302757263} 11/07/2021 16:38:08 - INFO - __main__ - Step 137633: {'lr': 8.566223003666635e-06, 'samples': 26425536, 'steps': 137632, 'loss/train': 0.5296491384506226} 11/07/2021 16:38:09 - INFO - __main__ - Step 137634: {'lr': 8.564845799923199e-06, 'samples': 26425728, 'steps': 137633, 'loss/train': 1.0786693096160889} 11/07/2021 16:38:10 - INFO - __main__ - Step 137635: {'lr': 8.563468704966716e-06, 'samples': 26425920, 'steps': 137634, 'loss/train': 1.5735822916030884} 11/07/2021 16:38:10 - INFO - __main__ - Step 137636: {'lr': 8.562091718797793e-06, 'samples': 26426112, 'steps': 137635, 'loss/train': 0.6797624230384827} 11/07/2021 16:38:11 - INFO - __main__ - Step 137637: {'lr': 8.560714841417072e-06, 'samples': 26426304, 'steps': 137636, 'loss/train': 1.309207797050476} 11/07/2021 16:38:11 - INFO - __main__ - Step 137638: {'lr': 8.559338072825162e-06, 'samples': 26426496, 'steps': 137637, 'loss/train': 1.0271508693695068} 11/07/2021 16:38:11 - INFO - __main__ - Step 137639: {'lr': 8.55796141302273e-06, 'samples': 26426688, 'steps': 137638, 'loss/train': 0.8826794624328613} 11/07/2021 16:38:12 - INFO - __main__ - Step 137640: {'lr': 8.556584862010331e-06, 'samples': 26426880, 'steps': 137639, 'loss/train': 1.536661982536316} 11/07/2021 16:38:13 - INFO - __main__ - Step 137641: {'lr': 8.555208419788601e-06, 'samples': 26427072, 'steps': 137640, 'loss/train': 1.327177882194519} 11/07/2021 16:38:13 - INFO - __main__ - Step 137642: {'lr': 8.553832086358182e-06, 'samples': 26427264, 'steps': 137641, 'loss/train': 1.1159837245941162} 11/07/2021 16:38:13 - INFO - __main__ - Step 137643: {'lr': 8.552455861719655e-06, 'samples': 26427456, 'steps': 137642, 'loss/train': 1.3718063831329346} 11/07/2021 16:38:14 - INFO - __main__ - Step 137644: {'lr': 8.551079745873686e-06, 'samples': 26427648, 'steps': 137643, 'loss/train': 1.0340447425842285} 11/07/2021 16:38:14 - INFO - __main__ - Step 137645: {'lr': 8.549703738820858e-06, 'samples': 26427840, 'steps': 137644, 'loss/train': 1.453708529472351} 11/07/2021 16:38:15 - INFO - __main__ - Step 137646: {'lr': 8.54832784056181e-06, 'samples': 26428032, 'steps': 137645, 'loss/train': 1.4037387371063232} 11/07/2021 16:38:15 - INFO - __main__ - Step 137647: {'lr': 8.54695205109718e-06, 'samples': 26428224, 'steps': 137646, 'loss/train': 0.8355200290679932} 11/07/2021 16:38:16 - INFO - __main__ - Step 137648: {'lr': 8.54557637042755e-06, 'samples': 26428416, 'steps': 137647, 'loss/train': 1.2180954217910767} 11/07/2021 16:38:16 - INFO - __main__ - Step 137649: {'lr': 8.544200798553559e-06, 'samples': 26428608, 'steps': 137648, 'loss/train': 0.8758627772331238} 11/07/2021 16:38:17 - INFO - __main__ - Step 137650: {'lr': 8.542825335475817e-06, 'samples': 26428800, 'steps': 137649, 'loss/train': 1.1229439973831177} 11/07/2021 16:38:18 - INFO - __main__ - Step 137651: {'lr': 8.541449981194965e-06, 'samples': 26428992, 'steps': 137650, 'loss/train': 1.321415901184082} 11/07/2021 16:38:18 - INFO - __main__ - Step 137652: {'lr': 8.540074735711639e-06, 'samples': 26429184, 'steps': 137651, 'loss/train': 1.2150261402130127} 11/07/2021 16:38:19 - INFO - __main__ - Step 137653: {'lr': 8.538699599026395e-06, 'samples': 26429376, 'steps': 137652, 'loss/train': 0.7961384654045105} 11/07/2021 16:38:19 - INFO - __main__ - Step 137654: {'lr': 8.537324571139898e-06, 'samples': 26429568, 'steps': 137653, 'loss/train': 0.9163202047348022} 11/07/2021 16:38:19 - INFO - __main__ - Step 137655: {'lr': 8.535949652052732e-06, 'samples': 26429760, 'steps': 137654, 'loss/train': 1.138547420501709} 11/07/2021 16:38:20 - INFO - __main__ - Step 137656: {'lr': 8.534574841765563e-06, 'samples': 26429952, 'steps': 137655, 'loss/train': 1.847033977508545} 11/07/2021 16:38:21 - INFO - __main__ - Step 137657: {'lr': 8.533200140278975e-06, 'samples': 26430144, 'steps': 137656, 'loss/train': 1.4394484758377075} 11/07/2021 16:38:21 - INFO - __main__ - Step 137658: {'lr': 8.531825547593602e-06, 'samples': 26430336, 'steps': 137657, 'loss/train': 0.9810031652450562} 11/07/2021 16:38:22 - INFO - __main__ - Step 137659: {'lr': 8.530451063710088e-06, 'samples': 26430528, 'steps': 137658, 'loss/train': 0.8417865037918091} 11/07/2021 16:38:22 - INFO - __main__ - Step 137660: {'lr': 8.529076688628984e-06, 'samples': 26430720, 'steps': 137659, 'loss/train': 0.7420225739479065} 11/07/2021 16:38:22 - INFO - __main__ - Step 137661: {'lr': 8.527702422350985e-06, 'samples': 26430912, 'steps': 137660, 'loss/train': 1.6098406314849854} 11/07/2021 16:38:24 - INFO - __main__ - Step 137662: {'lr': 8.526328264876676e-06, 'samples': 26431104, 'steps': 137661, 'loss/train': 1.2642042636871338} 11/07/2021 16:38:24 - INFO - __main__ - Step 137663: {'lr': 8.524954216206666e-06, 'samples': 26431296, 'steps': 137662, 'loss/train': 1.4032835960388184} 11/07/2021 16:38:25 - INFO - __main__ - Step 137664: {'lr': 8.523580276341591e-06, 'samples': 26431488, 'steps': 137663, 'loss/train': 1.4574545621871948} 11/07/2021 16:38:25 - INFO - __main__ - Step 137665: {'lr': 8.522206445282038e-06, 'samples': 26431680, 'steps': 137664, 'loss/train': 0.913759708404541} 11/07/2021 16:38:25 - INFO - __main__ - Step 137666: {'lr': 8.520832723028727e-06, 'samples': 26431872, 'steps': 137665, 'loss/train': 0.38548487424850464} 11/07/2021 16:38:26 - INFO - __main__ - Step 137667: {'lr': 8.519459109582128e-06, 'samples': 26432064, 'steps': 137666, 'loss/train': 1.1649683713912964} 11/07/2021 16:38:27 - INFO - __main__ - Step 137668: {'lr': 8.518085604942965e-06, 'samples': 26432256, 'steps': 137667, 'loss/train': 1.3805906772613525} 11/07/2021 16:38:27 - INFO - __main__ - Step 137669: {'lr': 8.516712209111822e-06, 'samples': 26432448, 'steps': 137668, 'loss/train': 1.2789318561553955} 11/07/2021 16:38:28 - INFO - __main__ - Step 137670: {'lr': 8.515338922089333e-06, 'samples': 26432640, 'steps': 137669, 'loss/train': 0.99848473072052} 11/07/2021 16:38:28 - INFO - __main__ - Step 137671: {'lr': 8.513965743876084e-06, 'samples': 26432832, 'steps': 137670, 'loss/train': 1.5390901565551758} 11/07/2021 16:38:28 - INFO - __main__ - Step 137672: {'lr': 8.512592674472714e-06, 'samples': 26433024, 'steps': 137671, 'loss/train': 1.3934649229049683} 11/07/2021 16:38:29 - INFO - __main__ - Step 137673: {'lr': 8.511219713879859e-06, 'samples': 26433216, 'steps': 137672, 'loss/train': 1.1490181684494019} 11/07/2021 16:38:30 - INFO - __main__ - Step 137674: {'lr': 8.50984686209813e-06, 'samples': 26433408, 'steps': 137673, 'loss/train': 0.5483201146125793} 11/07/2021 16:38:30 - INFO - __main__ - Step 137675: {'lr': 8.508474119128112e-06, 'samples': 26433600, 'steps': 137674, 'loss/train': 1.6120727062225342} 11/07/2021 16:38:30 - INFO - __main__ - Step 137676: {'lr': 8.50710148497047e-06, 'samples': 26433792, 'steps': 137675, 'loss/train': 1.464660406112671} 11/07/2021 16:38:31 - INFO - __main__ - Step 137677: {'lr': 8.505728959625787e-06, 'samples': 26433984, 'steps': 137676, 'loss/train': 1.121443271636963} 11/07/2021 16:38:32 - INFO - __main__ - Step 137678: {'lr': 8.504356543094699e-06, 'samples': 26434176, 'steps': 137677, 'loss/train': 1.2140886783599854} 11/07/2021 16:38:32 - INFO - __main__ - Step 137679: {'lr': 8.502984235377848e-06, 'samples': 26434368, 'steps': 137678, 'loss/train': 1.4648659229278564} 11/07/2021 16:38:33 - INFO - __main__ - Step 137680: {'lr': 8.501612036475815e-06, 'samples': 26434560, 'steps': 137679, 'loss/train': 0.08742816001176834} 11/07/2021 16:38:33 - INFO - __main__ - Step 137681: {'lr': 8.50023994638921e-06, 'samples': 26434752, 'steps': 137680, 'loss/train': 1.3856513500213623} 11/07/2021 16:38:33 - INFO - __main__ - Step 137682: {'lr': 8.498867965118673e-06, 'samples': 26434944, 'steps': 137681, 'loss/train': 1.1965711116790771} 11/07/2021 16:38:35 - INFO - __main__ - Step 137683: {'lr': 8.497496092664813e-06, 'samples': 26435136, 'steps': 137682, 'loss/train': 1.301505446434021} 11/07/2021 16:38:35 - INFO - __main__ - Step 137684: {'lr': 8.49612432902827e-06, 'samples': 26435328, 'steps': 137683, 'loss/train': 0.8830795288085938} 11/07/2021 16:38:35 - INFO - __main__ - Step 137685: {'lr': 8.494752674209655e-06, 'samples': 26435520, 'steps': 137684, 'loss/train': 1.0495773553848267} 11/07/2021 16:38:36 - INFO - __main__ - Step 137686: {'lr': 8.493381128209548e-06, 'samples': 26435712, 'steps': 137685, 'loss/train': 1.5618540048599243} 11/07/2021 16:38:36 - INFO - __main__ - Step 137687: {'lr': 8.492009691028619e-06, 'samples': 26435904, 'steps': 137686, 'loss/train': 1.3597781658172607} 11/07/2021 16:38:37 - INFO - __main__ - Step 137688: {'lr': 8.490638362667446e-06, 'samples': 26436096, 'steps': 137687, 'loss/train': 1.3736923933029175} 11/07/2021 16:38:37 - INFO - __main__ - Step 137689: {'lr': 8.489267143126672e-06, 'samples': 26436288, 'steps': 137688, 'loss/train': 1.0659083127975464} 11/07/2021 16:38:38 - INFO - __main__ - Step 137690: {'lr': 8.487896032406933e-06, 'samples': 26436480, 'steps': 137689, 'loss/train': 1.5397181510925293} 11/07/2021 16:38:38 - INFO - __main__ - Step 137691: {'lr': 8.486525030508784e-06, 'samples': 26436672, 'steps': 137690, 'loss/train': 1.0713964700698853} 11/07/2021 16:38:38 - INFO - __main__ - Step 137692: {'lr': 8.485154137432894e-06, 'samples': 26436864, 'steps': 137691, 'loss/train': 0.8893342018127441} 11/07/2021 16:38:40 - INFO - __main__ - Step 137693: {'lr': 8.483783353179896e-06, 'samples': 26437056, 'steps': 137692, 'loss/train': 1.3967382907867432} 11/07/2021 16:38:40 - INFO - __main__ - Step 137694: {'lr': 8.482412677750351e-06, 'samples': 26437248, 'steps': 137693, 'loss/train': 1.9590109586715698} 11/07/2021 16:38:40 - INFO - __main__ - Step 137695: {'lr': 8.481042111144921e-06, 'samples': 26437440, 'steps': 137694, 'loss/train': 1.4112986326217651} 11/07/2021 16:38:41 - INFO - __main__ - Step 137696: {'lr': 8.47967165336419e-06, 'samples': 26437632, 'steps': 137695, 'loss/train': 1.5011464357376099} 11/07/2021 16:38:41 - INFO - __main__ - Step 137697: {'lr': 8.47830130440877e-06, 'samples': 26437824, 'steps': 137696, 'loss/train': 1.3141921758651733} 11/07/2021 16:38:42 - INFO - __main__ - Step 137698: {'lr': 8.476931064279324e-06, 'samples': 26438016, 'steps': 137697, 'loss/train': 0.34531813859939575} 11/07/2021 16:38:42 - INFO - __main__ - Step 137699: {'lr': 8.475560932976467e-06, 'samples': 26438208, 'steps': 137698, 'loss/train': 1.29184889793396} 11/07/2021 16:38:43 - INFO - __main__ - Step 137700: {'lr': 8.47419091050075e-06, 'samples': 26438400, 'steps': 137699, 'loss/train': 1.5574171543121338} 11/07/2021 16:38:43 - INFO - __main__ - Step 137701: {'lr': 8.47282099685287e-06, 'samples': 26438592, 'steps': 137700, 'loss/train': 1.4076263904571533} 11/07/2021 16:38:43 - INFO - __main__ - Step 137702: {'lr': 8.471451192033409e-06, 'samples': 26438784, 'steps': 137701, 'loss/train': 1.0835247039794922} 11/07/2021 16:38:44 - INFO - __main__ - Step 137703: {'lr': 8.470081496042946e-06, 'samples': 26438976, 'steps': 137702, 'loss/train': 1.4734978675842285} 11/07/2021 16:38:45 - INFO - __main__ - Step 137704: {'lr': 8.468711908882183e-06, 'samples': 26439168, 'steps': 137703, 'loss/train': 1.2950176000595093} 11/07/2021 16:38:45 - INFO - __main__ - Step 137705: {'lr': 8.467342430551666e-06, 'samples': 26439360, 'steps': 137704, 'loss/train': 1.5426654815673828} 11/07/2021 16:38:45 - INFO - __main__ - Step 137706: {'lr': 8.465973061052067e-06, 'samples': 26439552, 'steps': 137705, 'loss/train': 0.6544806957244873} 11/07/2021 16:38:46 - INFO - __main__ - Step 137707: {'lr': 8.464603800383968e-06, 'samples': 26439744, 'steps': 137706, 'loss/train': 0.8451513648033142} 11/07/2021 16:38:47 - INFO - __main__ - Step 137708: {'lr': 8.46323464854798e-06, 'samples': 26439936, 'steps': 137707, 'loss/train': 1.3013577461242676} 11/07/2021 16:38:47 - INFO - __main__ - Step 137709: {'lr': 8.461865605544712e-06, 'samples': 26440128, 'steps': 137708, 'loss/train': 1.2172417640686035} 11/07/2021 16:38:48 - INFO - __main__ - Step 137710: {'lr': 8.460496671374828e-06, 'samples': 26440320, 'steps': 137709, 'loss/train': 1.172226071357727} 11/07/2021 16:38:48 - INFO - __main__ - Step 137711: {'lr': 8.459127846038889e-06, 'samples': 26440512, 'steps': 137710, 'loss/train': 1.6809743642807007} 11/07/2021 16:38:48 - INFO - __main__ - Step 137712: {'lr': 8.457759129537557e-06, 'samples': 26440704, 'steps': 137711, 'loss/train': 0.47296372056007385} 11/07/2021 16:38:49 - INFO - __main__ - Step 137713: {'lr': 8.456390521871415e-06, 'samples': 26440896, 'steps': 137712, 'loss/train': 1.0600993633270264} 11/07/2021 16:38:50 - INFO - __main__ - Step 137714: {'lr': 8.45502202304113e-06, 'samples': 26441088, 'steps': 137713, 'loss/train': 0.9574078321456909} 11/07/2021 16:38:50 - INFO - __main__ - Step 137715: {'lr': 8.45365363304726e-06, 'samples': 26441280, 'steps': 137714, 'loss/train': 1.076338291168213} 11/07/2021 16:38:51 - INFO - __main__ - Step 137716: {'lr': 8.452285351890437e-06, 'samples': 26441472, 'steps': 137715, 'loss/train': 0.7156243920326233} 11/07/2021 16:38:51 - INFO - __main__ - Step 137717: {'lr': 8.450917179571306e-06, 'samples': 26441664, 'steps': 137716, 'loss/train': 1.0965403318405151} 11/07/2021 16:38:51 - INFO - __main__ - Step 137718: {'lr': 8.449549116090472e-06, 'samples': 26441856, 'steps': 137717, 'loss/train': 1.392529010772705} 11/07/2021 16:38:52 - INFO - __main__ - Step 137719: {'lr': 8.44818116144852e-06, 'samples': 26442048, 'steps': 137718, 'loss/train': 1.118012547492981} 11/07/2021 16:38:53 - INFO - __main__ - Step 137720: {'lr': 8.446813315646146e-06, 'samples': 26442240, 'steps': 137719, 'loss/train': 1.470406413078308} 11/07/2021 16:38:53 - INFO - __main__ - Step 137721: {'lr': 8.445445578683847e-06, 'samples': 26442432, 'steps': 137720, 'loss/train': 0.9848471283912659} 11/07/2021 16:38:53 - INFO - __main__ - Step 137722: {'lr': 8.444077950562318e-06, 'samples': 26442624, 'steps': 137721, 'loss/train': 1.3410998582839966} 11/07/2021 16:38:54 - INFO - __main__ - Step 137723: {'lr': 8.442710431282169e-06, 'samples': 26442816, 'steps': 137722, 'loss/train': 1.044102430343628} 11/07/2021 16:38:55 - INFO - __main__ - Step 137724: {'lr': 8.441343020844011e-06, 'samples': 26443008, 'steps': 137723, 'loss/train': 0.68837571144104} 11/07/2021 16:38:56 - INFO - __main__ - Step 137725: {'lr': 8.439975719248454e-06, 'samples': 26443200, 'steps': 137724, 'loss/train': 1.0979490280151367} 11/07/2021 16:38:56 - INFO - __main__ - Step 137726: {'lr': 8.438608526496111e-06, 'samples': 26443392, 'steps': 137725, 'loss/train': 1.4615763425827026} 11/07/2021 16:38:56 - INFO - __main__ - Step 137727: {'lr': 8.43724144258759e-06, 'samples': 26443584, 'steps': 137726, 'loss/train': 1.4552441835403442} 11/07/2021 16:38:57 - INFO - __main__ - Step 137728: {'lr': 8.43587446752353e-06, 'samples': 26443776, 'steps': 137727, 'loss/train': 1.6425492763519287} 11/07/2021 16:38:57 - INFO - __main__ - Step 137729: {'lr': 8.434507601304542e-06, 'samples': 26443968, 'steps': 137728, 'loss/train': 0.8925405740737915} 11/07/2021 16:38:58 - INFO - __main__ - Step 137730: {'lr': 8.433140843931236e-06, 'samples': 26444160, 'steps': 137729, 'loss/train': 0.12648500502109528} 11/07/2021 16:38:58 - INFO - __main__ - Step 137731: {'lr': 8.431774195404224e-06, 'samples': 26444352, 'steps': 137730, 'loss/train': 1.8947688341140747} 11/07/2021 16:38:59 - INFO - __main__ - Step 137732: {'lr': 8.430407655724143e-06, 'samples': 26444544, 'steps': 137731, 'loss/train': 1.0978654623031616} 11/07/2021 16:38:59 - INFO - __main__ - Step 137733: {'lr': 8.429041224891604e-06, 'samples': 26444736, 'steps': 137732, 'loss/train': 0.8685097098350525} 11/07/2021 16:39:00 - INFO - __main__ - Step 137734: {'lr': 8.42767490290719e-06, 'samples': 26444928, 'steps': 137733, 'loss/train': 1.3267978429794312} 11/07/2021 16:39:01 - INFO - __main__ - Step 137735: {'lr': 8.42630868977154e-06, 'samples': 26445120, 'steps': 137734, 'loss/train': 1.2536112070083618} 11/07/2021 16:39:01 - INFO - __main__ - Step 137736: {'lr': 8.424942585485263e-06, 'samples': 26445312, 'steps': 137735, 'loss/train': 1.2060531377792358} 11/07/2021 16:39:01 - INFO - __main__ - Step 137737: {'lr': 8.42357659004897e-06, 'samples': 26445504, 'steps': 137736, 'loss/train': 1.0376124382019043} 11/07/2021 16:39:02 - INFO - __main__ - Step 137738: {'lr': 8.422210703463302e-06, 'samples': 26445696, 'steps': 137737, 'loss/train': 1.0911765098571777} 11/07/2021 16:39:02 - INFO - __main__ - Step 137739: {'lr': 8.420844925728838e-06, 'samples': 26445888, 'steps': 137738, 'loss/train': 1.0038766860961914} 11/07/2021 16:39:03 - INFO - __main__ - Step 137740: {'lr': 8.419479256846247e-06, 'samples': 26446080, 'steps': 137739, 'loss/train': 1.1317517757415771} 11/07/2021 16:39:03 - INFO - __main__ - Step 137741: {'lr': 8.418113696816083e-06, 'samples': 26446272, 'steps': 137740, 'loss/train': 1.5203860998153687} 11/07/2021 16:39:04 - INFO - __main__ - Step 137742: {'lr': 8.416748245638984e-06, 'samples': 26446464, 'steps': 137741, 'loss/train': 0.6703207492828369} 11/07/2021 16:39:04 - INFO - __main__ - Step 137743: {'lr': 8.415382903315588e-06, 'samples': 26446656, 'steps': 137742, 'loss/train': 0.8663391470909119} 11/07/2021 16:39:04 - INFO - __main__ - Step 137744: {'lr': 8.414017669846507e-06, 'samples': 26446848, 'steps': 137743, 'loss/train': 1.5488677024841309} 11/07/2021 16:39:06 - INFO - __main__ - Step 137745: {'lr': 8.412652545232325e-06, 'samples': 26447040, 'steps': 137744, 'loss/train': 1.57046639919281} 11/07/2021 16:39:07 - INFO - __main__ - Step 137746: {'lr': 8.41128752947365e-06, 'samples': 26447232, 'steps': 137745, 'loss/train': 1.590279221534729} 11/07/2021 16:39:07 - INFO - __main__ - Step 137747: {'lr': 8.409922622571175e-06, 'samples': 26447424, 'steps': 137746, 'loss/train': 1.3203306198120117} 11/07/2021 16:39:07 - INFO - __main__ - Step 137748: {'lr': 8.408557824525432e-06, 'samples': 26447616, 'steps': 137747, 'loss/train': 0.1379656046628952} 11/07/2021 16:39:08 - INFO - __main__ - Step 137749: {'lr': 8.407193135337055e-06, 'samples': 26447808, 'steps': 137748, 'loss/train': 0.949552059173584} 11/07/2021 16:39:09 - INFO - __main__ - Step 137750: {'lr': 8.405828555006683e-06, 'samples': 26448000, 'steps': 137749, 'loss/train': 2.0286738872528076} 11/07/2021 16:39:09 - INFO - __main__ - Step 137751: {'lr': 8.404464083534901e-06, 'samples': 26448192, 'steps': 137750, 'loss/train': 1.5943028926849365} 11/07/2021 16:39:10 - INFO - __main__ - Step 137752: {'lr': 8.403099720922347e-06, 'samples': 26448384, 'steps': 137751, 'loss/train': 1.489646077156067} 11/07/2021 16:39:10 - INFO - __main__ - Step 137753: {'lr': 8.40173546716963e-06, 'samples': 26448576, 'steps': 137752, 'loss/train': 0.18296682834625244} 11/07/2021 16:39:10 - INFO - __main__ - Step 137754: {'lr': 8.400371322277362e-06, 'samples': 26448768, 'steps': 137753, 'loss/train': 1.525024652481079} 11/07/2021 16:39:11 - INFO - __main__ - Step 137755: {'lr': 8.399007286246153e-06, 'samples': 26448960, 'steps': 137754, 'loss/train': 0.7926852107048035} 11/07/2021 16:39:12 - INFO - __main__ - Step 137756: {'lr': 8.39764335907664e-06, 'samples': 26449152, 'steps': 137755, 'loss/train': 1.5560554265975952} 11/07/2021 16:39:12 - INFO - __main__ - Step 137757: {'lr': 8.396279540769409e-06, 'samples': 26449344, 'steps': 137756, 'loss/train': 0.9651577472686768} 11/07/2021 16:39:12 - INFO - __main__ - Step 137758: {'lr': 8.394915831325095e-06, 'samples': 26449536, 'steps': 137757, 'loss/train': 1.2852766513824463} 11/07/2021 16:39:13 - INFO - __main__ - Step 137759: {'lr': 8.393552230744283e-06, 'samples': 26449728, 'steps': 137758, 'loss/train': 1.2874804735183716} 11/07/2021 16:39:14 - INFO - __main__ - Step 137760: {'lr': 8.39218873902764e-06, 'samples': 26449920, 'steps': 137759, 'loss/train': 1.1725614070892334} 11/07/2021 16:39:14 - INFO - __main__ - Step 137761: {'lr': 8.390825356175746e-06, 'samples': 26450112, 'steps': 137760, 'loss/train': 1.2146999835968018} 11/07/2021 16:39:15 - INFO - __main__ - Step 137762: {'lr': 8.389462082189187e-06, 'samples': 26450304, 'steps': 137761, 'loss/train': 1.2552136182785034} 11/07/2021 16:39:15 - INFO - __main__ - Step 137763: {'lr': 8.388098917068626e-06, 'samples': 26450496, 'steps': 137762, 'loss/train': 1.2836867570877075} 11/07/2021 16:39:15 - INFO - __main__ - Step 137764: {'lr': 8.386735860814649e-06, 'samples': 26450688, 'steps': 137763, 'loss/train': 0.8264093995094299} 11/07/2021 16:39:16 - INFO - __main__ - Step 137765: {'lr': 8.385372913427892e-06, 'samples': 26450880, 'steps': 137764, 'loss/train': 1.086832046508789} 11/07/2021 16:39:17 - INFO - __main__ - Step 137766: {'lr': 8.384010074908965e-06, 'samples': 26451072, 'steps': 137765, 'loss/train': 1.1627835035324097} 11/07/2021 16:39:17 - INFO - __main__ - Step 137767: {'lr': 8.382647345258454e-06, 'samples': 26451264, 'steps': 137766, 'loss/train': 0.47974061965942383} 11/07/2021 16:39:17 - INFO - __main__ - Step 137768: {'lr': 8.381284724476995e-06, 'samples': 26451456, 'steps': 137767, 'loss/train': 1.551268458366394} 11/07/2021 16:39:18 - INFO - __main__ - Step 137769: {'lr': 8.3799222125652e-06, 'samples': 26451648, 'steps': 137768, 'loss/train': 1.4600837230682373} 11/07/2021 16:39:19 - INFO - __main__ - Step 137770: {'lr': 8.378559809523705e-06, 'samples': 26451840, 'steps': 137769, 'loss/train': 2.6537599563598633} 11/07/2021 16:39:19 - INFO - __main__ - Step 137771: {'lr': 8.377197515353097e-06, 'samples': 26452032, 'steps': 137770, 'loss/train': 1.42970871925354} 11/07/2021 16:39:19 - INFO - __main__ - Step 137772: {'lr': 8.375835330053982e-06, 'samples': 26452224, 'steps': 137771, 'loss/train': 1.1900665760040283} 11/07/2021 16:39:20 - INFO - __main__ - Step 137773: {'lr': 8.374473253627001e-06, 'samples': 26452416, 'steps': 137772, 'loss/train': 1.3388853073120117} 11/07/2021 16:39:20 - INFO - __main__ - Step 137774: {'lr': 8.373111286072765e-06, 'samples': 26452608, 'steps': 137773, 'loss/train': 1.2715634107589722} 11/07/2021 16:39:21 - INFO - __main__ - Step 137775: {'lr': 8.371749427391857e-06, 'samples': 26452800, 'steps': 137774, 'loss/train': 2.134425401687622} 11/07/2021 16:39:21 - INFO - __main__ - Step 137776: {'lr': 8.370387677584912e-06, 'samples': 26452992, 'steps': 137775, 'loss/train': 1.0358268022537231} 11/07/2021 16:39:22 - INFO - __main__ - Step 137777: {'lr': 8.369026036652517e-06, 'samples': 26453184, 'steps': 137776, 'loss/train': 1.354178786277771} 11/07/2021 16:39:22 - INFO - __main__ - Step 137778: {'lr': 8.367664504595334e-06, 'samples': 26453376, 'steps': 137777, 'loss/train': 0.9535612463951111} 11/07/2021 16:39:23 - INFO - __main__ - Step 137779: {'lr': 8.36630308141395e-06, 'samples': 26453568, 'steps': 137778, 'loss/train': 1.7397823333740234} 11/07/2021 16:39:23 - INFO - __main__ - Step 137780: {'lr': 8.364941767109003e-06, 'samples': 26453760, 'steps': 137779, 'loss/train': 1.3217322826385498} 11/07/2021 16:39:24 - INFO - __main__ - Step 137781: {'lr': 8.363580561681045e-06, 'samples': 26453952, 'steps': 137780, 'loss/train': 1.2728519439697266} 11/07/2021 16:39:24 - INFO - __main__ - Step 137782: {'lr': 8.362219465130745e-06, 'samples': 26454144, 'steps': 137781, 'loss/train': 0.9612738490104675} 11/07/2021 16:39:25 - INFO - __main__ - Step 137783: {'lr': 8.36085847745871e-06, 'samples': 26454336, 'steps': 137782, 'loss/train': 1.2989832162857056} 11/07/2021 16:39:25 - INFO - __main__ - Step 137784: {'lr': 8.359497598665555e-06, 'samples': 26454528, 'steps': 137783, 'loss/train': 1.2953416109085083} 11/07/2021 16:39:25 - INFO - __main__ - Step 137785: {'lr': 8.358136828751888e-06, 'samples': 26454720, 'steps': 137784, 'loss/train': 1.0862549543380737} 11/07/2021 16:39:26 - INFO - __main__ - Step 137786: {'lr': 8.356776167718267e-06, 'samples': 26454912, 'steps': 137785, 'loss/train': 1.2038394212722778} 11/07/2021 16:39:27 - INFO - __main__ - Step 137787: {'lr': 8.355415615565381e-06, 'samples': 26455104, 'steps': 137786, 'loss/train': 1.1561620235443115} 11/07/2021 16:39:27 - INFO - __main__ - Step 137788: {'lr': 8.354055172293817e-06, 'samples': 26455296, 'steps': 137787, 'loss/train': 1.136642575263977} 11/07/2021 16:39:27 - INFO - __main__ - Step 137789: {'lr': 8.352694837904184e-06, 'samples': 26455488, 'steps': 137788, 'loss/train': 1.2511568069458008} 11/07/2021 16:39:28 - INFO - __main__ - Step 137790: {'lr': 8.351334612397093e-06, 'samples': 26455680, 'steps': 137789, 'loss/train': 1.3211346864700317} 11/07/2021 16:39:29 - INFO - __main__ - Step 137791: {'lr': 8.349974495773182e-06, 'samples': 26455872, 'steps': 137790, 'loss/train': 1.3021167516708374} 11/07/2021 16:39:29 - INFO - __main__ - Step 137792: {'lr': 8.348614488033008e-06, 'samples': 26456064, 'steps': 137791, 'loss/train': 1.16098153591156} 11/07/2021 16:39:30 - INFO - __main__ - Step 137793: {'lr': 8.347254589177234e-06, 'samples': 26456256, 'steps': 137792, 'loss/train': 1.1780179738998413} 11/07/2021 16:39:30 - INFO - __main__ - Step 137794: {'lr': 8.345894799206471e-06, 'samples': 26456448, 'steps': 137793, 'loss/train': 1.276511788368225} 11/07/2021 16:39:30 - INFO - __main__ - Step 137795: {'lr': 8.344535118121333e-06, 'samples': 26456640, 'steps': 137794, 'loss/train': 1.3721858263015747} 11/07/2021 16:39:32 - INFO - __main__ - Step 137796: {'lr': 8.343175545922399e-06, 'samples': 26456832, 'steps': 137795, 'loss/train': 1.1435397863388062} 11/07/2021 16:39:32 - INFO - __main__ - Step 137797: {'lr': 8.34181608261031e-06, 'samples': 26457024, 'steps': 137796, 'loss/train': 1.1372554302215576} 11/07/2021 16:39:32 - INFO - __main__ - Step 137798: {'lr': 8.340456728185647e-06, 'samples': 26457216, 'steps': 137797, 'loss/train': 1.1646785736083984} 11/07/2021 16:39:33 - INFO - __main__ - Step 137799: {'lr': 8.33909748264905e-06, 'samples': 26457408, 'steps': 137798, 'loss/train': 0.11282099783420563} 11/07/2021 16:39:33 - INFO - __main__ - Step 137800: {'lr': 8.337738346001128e-06, 'samples': 26457600, 'steps': 137799, 'loss/train': 1.3552974462509155} 11/07/2021 16:39:34 - INFO - __main__ - Step 137801: {'lr': 8.336379318242521e-06, 'samples': 26457792, 'steps': 137800, 'loss/train': 1.3809826374053955} 11/07/2021 16:39:34 - INFO - __main__ - Step 137802: {'lr': 8.335020399373784e-06, 'samples': 26457984, 'steps': 137801, 'loss/train': 1.006208896636963} 11/07/2021 16:39:35 - INFO - __main__ - Step 137803: {'lr': 8.333661589395553e-06, 'samples': 26458176, 'steps': 137802, 'loss/train': 1.2005298137664795} 11/07/2021 16:39:35 - INFO - __main__ - Step 137804: {'lr': 8.332302888308441e-06, 'samples': 26458368, 'steps': 137803, 'loss/train': 1.5611279010772705} 11/07/2021 16:39:35 - INFO - __main__ - Step 137805: {'lr': 8.330944296113085e-06, 'samples': 26458560, 'steps': 137804, 'loss/train': 1.1964317560195923} 11/07/2021 16:39:36 - INFO - __main__ - Step 137806: {'lr': 8.329585812810097e-06, 'samples': 26458752, 'steps': 137805, 'loss/train': 1.4976880550384521} 11/07/2021 16:39:37 - INFO - __main__ - Step 137807: {'lr': 8.328227438400032e-06, 'samples': 26458944, 'steps': 137806, 'loss/train': 0.041956957429647446} 11/07/2021 16:39:37 - INFO - __main__ - Step 137808: {'lr': 8.326869172883555e-06, 'samples': 26459136, 'steps': 137807, 'loss/train': 1.2664568424224854} 11/07/2021 16:39:38 - INFO - __main__ - Step 137809: {'lr': 8.32551101626125e-06, 'samples': 26459328, 'steps': 137808, 'loss/train': 1.1506469249725342} 11/07/2021 16:39:38 - INFO - __main__ - Step 137810: {'lr': 8.324152968533755e-06, 'samples': 26459520, 'steps': 137809, 'loss/train': 0.667718231678009} 11/07/2021 16:39:38 - INFO - __main__ - Step 137811: {'lr': 8.322795029701651e-06, 'samples': 26459712, 'steps': 137810, 'loss/train': 1.2357373237609863} 11/07/2021 16:39:39 - INFO - __main__ - Step 137812: {'lr': 8.321437199765552e-06, 'samples': 26459904, 'steps': 137811, 'loss/train': 1.4127464294433594} 11/07/2021 16:39:40 - INFO - __main__ - Step 137813: {'lr': 8.320079478726123e-06, 'samples': 26460096, 'steps': 137812, 'loss/train': 1.8061062097549438} 11/07/2021 16:39:40 - INFO - __main__ - Step 137814: {'lr': 8.318721866583917e-06, 'samples': 26460288, 'steps': 137813, 'loss/train': 1.2443394660949707} 11/07/2021 16:39:40 - INFO - __main__ - Step 137815: {'lr': 8.317364363339547e-06, 'samples': 26460480, 'steps': 137814, 'loss/train': 0.7185616493225098} 11/07/2021 16:39:41 - INFO - __main__ - Step 137816: {'lr': 8.316006968993678e-06, 'samples': 26460672, 'steps': 137815, 'loss/train': 0.5714501738548279} 11/07/2021 16:39:42 - INFO - __main__ - Step 137817: {'lr': 8.314649683546893e-06, 'samples': 26460864, 'steps': 137816, 'loss/train': 1.2755168676376343} 11/07/2021 16:39:42 - INFO - __main__ - Step 137818: {'lr': 8.313292506999775e-06, 'samples': 26461056, 'steps': 137817, 'loss/train': 1.1557234525680542} 11/07/2021 16:39:42 - INFO - __main__ - Step 137819: {'lr': 8.311935439352964e-06, 'samples': 26461248, 'steps': 137818, 'loss/train': 0.9175376892089844} 11/07/2021 16:39:43 - INFO - __main__ - Step 137820: {'lr': 8.31057848060704e-06, 'samples': 26461440, 'steps': 137819, 'loss/train': 1.2484787702560425} 11/07/2021 16:39:43 - INFO - __main__ - Step 137821: {'lr': 8.30922163076267e-06, 'samples': 26461632, 'steps': 137820, 'loss/train': 0.8120062351226807} 11/07/2021 16:39:44 - INFO - __main__ - Step 137822: {'lr': 8.30786488982041e-06, 'samples': 26461824, 'steps': 137821, 'loss/train': 1.5643484592437744} 11/07/2021 16:39:45 - INFO - __main__ - Step 137823: {'lr': 8.306508257780926e-06, 'samples': 26462016, 'steps': 137822, 'loss/train': 1.3172733783721924} 11/07/2021 16:39:45 - INFO - __main__ - Step 137824: {'lr': 8.305151734644773e-06, 'samples': 26462208, 'steps': 137823, 'loss/train': 1.2881133556365967} 11/07/2021 16:39:45 - INFO - __main__ - Step 137825: {'lr': 8.303795320412616e-06, 'samples': 26462400, 'steps': 137824, 'loss/train': 1.6131477355957031} 11/07/2021 16:39:46 - INFO - __main__ - Step 137826: {'lr': 8.302439015085012e-06, 'samples': 26462592, 'steps': 137825, 'loss/train': 0.8413287401199341} 11/07/2021 16:39:47 - INFO - __main__ - Step 137827: {'lr': 8.301082818662626e-06, 'samples': 26462784, 'steps': 137826, 'loss/train': 1.212311863899231} 11/07/2021 16:39:47 - INFO - __main__ - Step 137828: {'lr': 8.29972673114604e-06, 'samples': 26462976, 'steps': 137827, 'loss/train': 1.6159694194793701} 11/07/2021 16:39:47 - INFO - __main__ - Step 137829: {'lr': 8.298370752535866e-06, 'samples': 26463168, 'steps': 137828, 'loss/train': 0.6158146858215332} 11/07/2021 16:39:48 - INFO - __main__ - Step 137830: {'lr': 8.297014882832687e-06, 'samples': 26463360, 'steps': 137829, 'loss/train': 1.009566068649292} 11/07/2021 16:39:48 - INFO - __main__ - Step 137831: {'lr': 8.295659122037168e-06, 'samples': 26463552, 'steps': 137830, 'loss/train': 1.2722722291946411} 11/07/2021 16:39:49 - INFO - __main__ - Step 137832: {'lr': 8.294303470149894e-06, 'samples': 26463744, 'steps': 137831, 'loss/train': 1.5096491575241089} 11/07/2021 16:39:49 - INFO - __main__ - Step 137833: {'lr': 8.292947927171473e-06, 'samples': 26463936, 'steps': 137832, 'loss/train': 1.4361953735351562} 11/07/2021 16:39:50 - INFO - __main__ - Step 137834: {'lr': 8.291592493102517e-06, 'samples': 26464128, 'steps': 137833, 'loss/train': 1.2454783916473389} 11/07/2021 16:39:50 - INFO - __main__ - Step 137835: {'lr': 8.290237167943637e-06, 'samples': 26464320, 'steps': 137834, 'loss/train': 1.4677796363830566} 11/07/2021 16:39:50 - INFO - __main__ - Step 137836: {'lr': 8.288881951695443e-06, 'samples': 26464512, 'steps': 137835, 'loss/train': 1.5104912519454956} 11/07/2021 16:39:51 - INFO - __main__ - Step 137837: {'lr': 8.287526844358572e-06, 'samples': 26464704, 'steps': 137836, 'loss/train': 1.0686516761779785} 11/07/2021 16:39:52 - INFO - __main__ - Step 137838: {'lr': 8.286171845933583e-06, 'samples': 26464896, 'steps': 137837, 'loss/train': 1.6875569820404053} 11/07/2021 16:39:52 - INFO - __main__ - Step 137839: {'lr': 8.284816956421138e-06, 'samples': 26465088, 'steps': 137838, 'loss/train': 1.2288928031921387} 11/07/2021 16:39:53 - INFO - __main__ - Step 137840: {'lr': 8.283462175821821e-06, 'samples': 26465280, 'steps': 137839, 'loss/train': 1.6502965688705444} 11/07/2021 16:39:53 - INFO - __main__ - Step 137841: {'lr': 8.28210750413627e-06, 'samples': 26465472, 'steps': 137840, 'loss/train': 1.433634638786316} 11/07/2021 16:39:53 - INFO - __main__ - Step 137842: {'lr': 8.280752941365044e-06, 'samples': 26465664, 'steps': 137841, 'loss/train': 1.0420691967010498} 11/07/2021 16:39:54 - INFO - __main__ - Step 137843: {'lr': 8.279398487508776e-06, 'samples': 26465856, 'steps': 137842, 'loss/train': 0.2752953767776489} 11/07/2021 16:39:55 - INFO - __main__ - Step 137844: {'lr': 8.278044142568081e-06, 'samples': 26466048, 'steps': 137843, 'loss/train': 1.3553788661956787} 11/07/2021 16:39:55 - INFO - __main__ - Step 137845: {'lr': 8.276689906543566e-06, 'samples': 26466240, 'steps': 137844, 'loss/train': 1.093527913093567} 11/07/2021 16:39:55 - INFO - __main__ - Step 137846: {'lr': 8.275335779435845e-06, 'samples': 26466432, 'steps': 137845, 'loss/train': 1.0632084608078003} 11/07/2021 16:39:56 - INFO - __main__ - Step 137847: {'lr': 8.273981761245525e-06, 'samples': 26466624, 'steps': 137846, 'loss/train': 1.2091325521469116} 11/07/2021 16:39:57 - INFO - __main__ - Step 137848: {'lr': 8.272627851973246e-06, 'samples': 26466816, 'steps': 137847, 'loss/train': 1.247051477432251} 11/07/2021 16:39:57 - INFO - __main__ - Step 137849: {'lr': 8.271274051619566e-06, 'samples': 26467008, 'steps': 137848, 'loss/train': 1.4503099918365479} 11/07/2021 16:39:57 - INFO - __main__ - Step 137850: {'lr': 8.269920360185118e-06, 'samples': 26467200, 'steps': 137849, 'loss/train': 1.1786552667617798} 11/07/2021 16:39:58 - INFO - __main__ - Step 137851: {'lr': 8.268566777670516e-06, 'samples': 26467392, 'steps': 137850, 'loss/train': 1.4199576377868652} 11/07/2021 16:39:58 - INFO - __main__ - Step 137852: {'lr': 8.267213304076371e-06, 'samples': 26467584, 'steps': 137851, 'loss/train': 1.4050257205963135} 11/07/2021 16:39:59 - INFO - __main__ - Step 137853: {'lr': 8.265859939403292e-06, 'samples': 26467776, 'steps': 137852, 'loss/train': 1.187874436378479} 11/07/2021 16:40:00 - INFO - __main__ - Step 137854: {'lr': 8.264506683651918e-06, 'samples': 26467968, 'steps': 137853, 'loss/train': 0.7401227951049805} 11/07/2021 16:40:00 - INFO - __main__ - Step 137855: {'lr': 8.263153536822804e-06, 'samples': 26468160, 'steps': 137854, 'loss/train': 1.4166775941848755} 11/07/2021 16:40:00 - INFO - __main__ - Step 137856: {'lr': 8.261800498916561e-06, 'samples': 26468352, 'steps': 137855, 'loss/train': 1.3209306001663208} 11/07/2021 16:40:01 - INFO - __main__ - Step 137857: {'lr': 8.260447569933827e-06, 'samples': 26468544, 'steps': 137856, 'loss/train': 1.2415046691894531} 11/07/2021 16:40:02 - INFO - __main__ - Step 137858: {'lr': 8.259094749875213e-06, 'samples': 26468736, 'steps': 137857, 'loss/train': 1.4171078205108643} 11/07/2021 16:40:02 - INFO - __main__ - Step 137859: {'lr': 8.257742038741327e-06, 'samples': 26468928, 'steps': 137858, 'loss/train': 0.8585596084594727} 11/07/2021 16:40:02 - INFO - __main__ - Step 137860: {'lr': 8.256389436532757e-06, 'samples': 26469120, 'steps': 137859, 'loss/train': 1.3093597888946533} 11/07/2021 16:40:03 - INFO - __main__ - Step 137861: {'lr': 8.255036943250139e-06, 'samples': 26469312, 'steps': 137860, 'loss/train': 1.2386130094528198} 11/07/2021 16:40:03 - INFO - __main__ - Step 137862: {'lr': 8.253684558894053e-06, 'samples': 26469504, 'steps': 137861, 'loss/train': 1.5050700902938843} 11/07/2021 16:40:04 - INFO - __main__ - Step 137863: {'lr': 8.252332283465142e-06, 'samples': 26469696, 'steps': 137862, 'loss/train': 1.6125977039337158} 11/07/2021 16:40:05 - INFO - __main__ - Step 137864: {'lr': 8.250980116964013e-06, 'samples': 26469888, 'steps': 137863, 'loss/train': 1.1580384969711304} 11/07/2021 16:40:05 - INFO - __main__ - Step 137865: {'lr': 8.249628059391251e-06, 'samples': 26470080, 'steps': 137864, 'loss/train': 1.3488211631774902} 11/07/2021 16:40:05 - INFO - __main__ - Step 137866: {'lr': 8.248276110747465e-06, 'samples': 26470272, 'steps': 137865, 'loss/train': 1.5337669849395752} 11/07/2021 16:40:06 - INFO - __main__ - Step 137867: {'lr': 8.246924271033268e-06, 'samples': 26470464, 'steps': 137866, 'loss/train': 1.3749207258224487} 11/07/2021 16:40:07 - INFO - __main__ - Step 137868: {'lr': 8.245572540249324e-06, 'samples': 26470656, 'steps': 137867, 'loss/train': 1.3858734369277954} 11/07/2021 16:40:07 - INFO - __main__ - Step 137869: {'lr': 8.244220918396162e-06, 'samples': 26470848, 'steps': 137868, 'loss/train': 1.4529030323028564} 11/07/2021 16:40:07 - INFO - __main__ - Step 137870: {'lr': 8.242869405474418e-06, 'samples': 26471040, 'steps': 137869, 'loss/train': 1.2358059883117676} 11/07/2021 16:40:08 - INFO - __main__ - Step 137871: {'lr': 8.241518001484705e-06, 'samples': 26471232, 'steps': 137870, 'loss/train': 2.603693723678589} 11/07/2021 16:40:08 - INFO - __main__ - Step 137872: {'lr': 8.24016670642766e-06, 'samples': 26471424, 'steps': 137871, 'loss/train': 1.7182608842849731} 11/07/2021 16:40:09 - INFO - __main__ - Step 137873: {'lr': 8.238815520303838e-06, 'samples': 26471616, 'steps': 137872, 'loss/train': 1.2367223501205444} 11/07/2021 16:40:09 - INFO - __main__ - Step 137874: {'lr': 8.237464443113879e-06, 'samples': 26471808, 'steps': 137873, 'loss/train': 1.0653800964355469} 11/07/2021 16:40:10 - INFO - __main__ - Step 137875: {'lr': 8.236113474858393e-06, 'samples': 26472000, 'steps': 137874, 'loss/train': 0.11889683455228806} 11/07/2021 16:40:10 - INFO - __main__ - Step 137876: {'lr': 8.234762615537988e-06, 'samples': 26472192, 'steps': 137875, 'loss/train': 1.3565118312835693} 11/07/2021 16:40:10 - INFO - __main__ - Step 137877: {'lr': 8.23341186515325e-06, 'samples': 26472384, 'steps': 137876, 'loss/train': 1.1907795667648315} 11/07/2021 16:40:12 - INFO - __main__ - Step 137878: {'lr': 8.232061223704817e-06, 'samples': 26472576, 'steps': 137877, 'loss/train': 1.4905675649642944} 11/07/2021 16:40:12 - INFO - __main__ - Step 137879: {'lr': 8.2307106911933e-06, 'samples': 26472768, 'steps': 137878, 'loss/train': 1.1211469173431396} 11/07/2021 16:40:12 - INFO - __main__ - Step 137880: {'lr': 8.229360267619279e-06, 'samples': 26472960, 'steps': 137879, 'loss/train': 0.8419032096862793} 11/07/2021 16:40:13 - INFO - __main__ - Step 137881: {'lr': 8.228009952983395e-06, 'samples': 26473152, 'steps': 137880, 'loss/train': 1.3297134637832642} 11/07/2021 16:40:13 - INFO - __main__ - Step 137882: {'lr': 8.22665974728623e-06, 'samples': 26473344, 'steps': 137881, 'loss/train': 2.0266499519348145} 11/07/2021 16:40:14 - INFO - __main__ - Step 137883: {'lr': 8.225309650528395e-06, 'samples': 26473536, 'steps': 137882, 'loss/train': 1.1770747900009155} 11/07/2021 16:40:14 - INFO - __main__ - Step 137884: {'lr': 8.223959662710501e-06, 'samples': 26473728, 'steps': 137883, 'loss/train': 1.4249211549758911} 11/07/2021 16:40:15 - INFO - __main__ - Step 137885: {'lr': 8.222609783833157e-06, 'samples': 26473920, 'steps': 137884, 'loss/train': 1.3009576797485352} 11/07/2021 16:40:15 - INFO - __main__ - Step 137886: {'lr': 8.221260013896976e-06, 'samples': 26474112, 'steps': 137885, 'loss/train': 1.0649269819259644} 11/07/2021 16:40:15 - INFO - __main__ - Step 137887: {'lr': 8.219910352902565e-06, 'samples': 26474304, 'steps': 137886, 'loss/train': 1.2350525856018066} 11/07/2021 16:40:16 - INFO - __main__ - Step 137888: {'lr': 8.21856080085054e-06, 'samples': 26474496, 'steps': 137887, 'loss/train': 1.2408320903778076} 11/07/2021 16:40:17 - INFO - __main__ - Step 137889: {'lr': 8.217211357741506e-06, 'samples': 26474688, 'steps': 137888, 'loss/train': 1.2019044160842896} 11/07/2021 16:40:17 - INFO - __main__ - Step 137890: {'lr': 8.215862023576049e-06, 'samples': 26474880, 'steps': 137889, 'loss/train': 1.150106430053711} 11/07/2021 16:40:18 - INFO - __main__ - Step 137891: {'lr': 8.214512798354806e-06, 'samples': 26475072, 'steps': 137890, 'loss/train': 1.020194411277771} 11/07/2021 16:40:18 - INFO - __main__ - Step 137892: {'lr': 8.213163682078361e-06, 'samples': 26475264, 'steps': 137891, 'loss/train': 1.8969799280166626} 11/07/2021 16:40:18 - INFO - __main__ - Step 137893: {'lr': 8.211814674747354e-06, 'samples': 26475456, 'steps': 137892, 'loss/train': 1.1365658044815063} 11/07/2021 16:40:20 - INFO - __main__ - Step 137894: {'lr': 8.210465776362364e-06, 'samples': 26475648, 'steps': 137893, 'loss/train': 1.5054981708526611} 11/07/2021 16:40:20 - INFO - __main__ - Step 137895: {'lr': 8.209116986924004e-06, 'samples': 26475840, 'steps': 137894, 'loss/train': 1.303384780883789} 11/07/2021 16:40:20 - INFO - __main__ - Step 137896: {'lr': 8.207768306432883e-06, 'samples': 26476032, 'steps': 137895, 'loss/train': 1.4673556089401245} 11/07/2021 16:40:21 - INFO - __main__ - Step 137897: {'lr': 8.206419734889614e-06, 'samples': 26476224, 'steps': 137896, 'loss/train': 0.049410756677389145} 11/07/2021 16:40:21 - INFO - __main__ - Step 137898: {'lr': 8.205071272294807e-06, 'samples': 26476416, 'steps': 137897, 'loss/train': 1.4186711311340332} 11/07/2021 16:40:22 - INFO - __main__ - Step 137899: {'lr': 8.203722918649042e-06, 'samples': 26476608, 'steps': 137898, 'loss/train': 1.2474533319473267} 11/07/2021 16:40:22 - INFO - __main__ - Step 137900: {'lr': 8.20237467395296e-06, 'samples': 26476800, 'steps': 137899, 'loss/train': 0.9805604815483093} 11/07/2021 16:40:23 - INFO - __main__ - Step 137901: {'lr': 8.201026538207146e-06, 'samples': 26476992, 'steps': 137900, 'loss/train': 0.779588520526886} 11/07/2021 16:40:23 - INFO - __main__ - Step 137902: {'lr': 8.199678511412234e-06, 'samples': 26477184, 'steps': 137901, 'loss/train': 1.4695011377334595} 11/07/2021 16:40:24 - INFO - __main__ - Step 137903: {'lr': 8.198330593568808e-06, 'samples': 26477376, 'steps': 137902, 'loss/train': 1.2930378913879395} 11/07/2021 16:40:25 - INFO - __main__ - Step 137904: {'lr': 8.196982784677482e-06, 'samples': 26477568, 'steps': 137903, 'loss/train': 0.891297459602356} 11/07/2021 16:40:25 - INFO - __main__ - Step 137905: {'lr': 8.195635084738862e-06, 'samples': 26477760, 'steps': 137904, 'loss/train': 1.157989263534546} 11/07/2021 16:40:25 - INFO - __main__ - Step 137906: {'lr': 8.19428749375356e-06, 'samples': 26477952, 'steps': 137905, 'loss/train': 1.1986552476882935} 11/07/2021 16:40:26 - INFO - __main__ - Step 137907: {'lr': 8.192940011722189e-06, 'samples': 26478144, 'steps': 137906, 'loss/train': 1.0945591926574707} 11/07/2021 16:40:26 - INFO - __main__ - Step 137908: {'lr': 8.191592638645384e-06, 'samples': 26478336, 'steps': 137907, 'loss/train': 1.2276482582092285} 11/07/2021 16:40:27 - INFO - __main__ - Step 137909: {'lr': 8.190245374523674e-06, 'samples': 26478528, 'steps': 137908, 'loss/train': 1.2721686363220215} 11/07/2021 16:40:27 - INFO - __main__ - Step 137910: {'lr': 8.1888982193577e-06, 'samples': 26478720, 'steps': 137909, 'loss/train': 1.7058284282684326} 11/07/2021 16:40:28 - INFO - __main__ - Step 137911: {'lr': 8.187551173148095e-06, 'samples': 26478912, 'steps': 137910, 'loss/train': 1.1530160903930664} 11/07/2021 16:40:28 - INFO - __main__ - Step 137912: {'lr': 8.186204235895417e-06, 'samples': 26479104, 'steps': 137911, 'loss/train': 1.2214844226837158} 11/07/2021 16:40:29 - INFO - __main__ - Step 137913: {'lr': 8.184857407600332e-06, 'samples': 26479296, 'steps': 137912, 'loss/train': 1.1660269498825073} 11/07/2021 16:40:29 - INFO - __main__ - Step 137914: {'lr': 8.183510688263424e-06, 'samples': 26479488, 'steps': 137913, 'loss/train': 1.2666398286819458} 11/07/2021 16:40:30 - INFO - __main__ - Step 137915: {'lr': 8.182164077885273e-06, 'samples': 26479680, 'steps': 137914, 'loss/train': 0.9439259171485901} 11/07/2021 16:40:30 - INFO - __main__ - Step 137916: {'lr': 8.18081757646652e-06, 'samples': 26479872, 'steps': 137915, 'loss/train': 0.9490886926651001} 11/07/2021 16:40:31 - INFO - __main__ - Step 137917: {'lr': 8.179471184007748e-06, 'samples': 26480064, 'steps': 137916, 'loss/train': 1.112806797027588} 11/07/2021 16:40:31 - INFO - __main__ - Step 137918: {'lr': 8.178124900509593e-06, 'samples': 26480256, 'steps': 137917, 'loss/train': 1.1863139867782593} 11/07/2021 16:40:31 - INFO - __main__ - Step 137919: {'lr': 8.17677872597264e-06, 'samples': 26480448, 'steps': 137918, 'loss/train': 1.2260940074920654} 11/07/2021 16:40:32 - INFO - __main__ - Step 137920: {'lr': 8.175432660397496e-06, 'samples': 26480640, 'steps': 137919, 'loss/train': 1.372708797454834} 11/07/2021 16:40:33 - INFO - __main__ - Step 137921: {'lr': 8.174086703784779e-06, 'samples': 26480832, 'steps': 137920, 'loss/train': 1.1095609664916992} 11/07/2021 16:40:33 - INFO - __main__ - Step 137922: {'lr': 8.172740856135092e-06, 'samples': 26481024, 'steps': 137921, 'loss/train': 1.1813994646072388} 11/07/2021 16:40:33 - INFO - __main__ - Step 137923: {'lr': 8.171395117449022e-06, 'samples': 26481216, 'steps': 137922, 'loss/train': 1.3964160680770874} 11/07/2021 16:40:34 - INFO - __main__ - Step 137924: {'lr': 8.170049487727177e-06, 'samples': 26481408, 'steps': 137923, 'loss/train': 1.4620877504348755} 11/07/2021 16:40:35 - INFO - __main__ - Step 137925: {'lr': 8.16870396697017e-06, 'samples': 26481600, 'steps': 137924, 'loss/train': 1.3389750719070435} 11/07/2021 16:40:35 - INFO - __main__ - Step 137926: {'lr': 8.167358555178639e-06, 'samples': 26481792, 'steps': 137925, 'loss/train': 1.211728572845459} 11/07/2021 16:40:36 - INFO - __main__ - Step 137927: {'lr': 8.166013252353167e-06, 'samples': 26481984, 'steps': 137926, 'loss/train': 1.0989607572555542} 11/07/2021 16:40:36 - INFO - __main__ - Step 137928: {'lr': 8.164668058494334e-06, 'samples': 26482176, 'steps': 137927, 'loss/train': 1.6181063652038574} 11/07/2021 16:40:36 - INFO - __main__ - Step 137929: {'lr': 8.163322973602782e-06, 'samples': 26482368, 'steps': 137928, 'loss/train': 1.1454355716705322} 11/07/2021 16:40:38 - INFO - __main__ - Step 137930: {'lr': 8.161977997679093e-06, 'samples': 26482560, 'steps': 137929, 'loss/train': 1.567449927330017} 11/07/2021 16:40:38 - INFO - __main__ - Step 137931: {'lr': 8.160633130723904e-06, 'samples': 26482752, 'steps': 137930, 'loss/train': 0.9018440246582031} 11/07/2021 16:40:38 - INFO - __main__ - Step 137932: {'lr': 8.1592883727378e-06, 'samples': 26482944, 'steps': 137931, 'loss/train': 1.010187029838562} 11/07/2021 16:40:39 - INFO - __main__ - Step 137933: {'lr': 8.157943723721389e-06, 'samples': 26483136, 'steps': 137932, 'loss/train': 1.508713960647583} 11/07/2021 16:40:39 - INFO - __main__ - Step 137934: {'lr': 8.156599183675256e-06, 'samples': 26483328, 'steps': 137933, 'loss/train': 2.7429633140563965} 11/07/2021 16:40:40 - INFO - __main__ - Step 137935: {'lr': 8.155254752600067e-06, 'samples': 26483520, 'steps': 137934, 'loss/train': 1.1891270875930786} 11/07/2021 16:40:41 - INFO - __main__ - Step 137936: {'lr': 8.153910430496375e-06, 'samples': 26483712, 'steps': 137935, 'loss/train': 1.3857042789459229} 11/07/2021 16:40:41 - INFO - __main__ - Step 137937: {'lr': 8.152566217364793e-06, 'samples': 26483904, 'steps': 137936, 'loss/train': 1.194496989250183} 11/07/2021 16:40:41 - INFO - __main__ - Step 137938: {'lr': 8.15122211320593e-06, 'samples': 26484096, 'steps': 137937, 'loss/train': 1.5430861711502075} 11/07/2021 16:40:42 - INFO - __main__ - Step 137939: {'lr': 8.149878118020371e-06, 'samples': 26484288, 'steps': 137938, 'loss/train': 1.094005823135376} 11/07/2021 16:40:43 - INFO - __main__ - Step 137940: {'lr': 8.14853423180878e-06, 'samples': 26484480, 'steps': 137939, 'loss/train': 0.9173237681388855} 11/07/2021 16:40:43 - INFO - __main__ - Step 137941: {'lr': 8.147190454571712e-06, 'samples': 26484672, 'steps': 137940, 'loss/train': 1.3123762607574463} 11/07/2021 16:40:44 - INFO - __main__ - Step 137942: {'lr': 8.14584678630978e-06, 'samples': 26484864, 'steps': 137941, 'loss/train': 1.5153683423995972} 11/07/2021 16:40:44 - INFO - __main__ - Step 137943: {'lr': 8.14450322702362e-06, 'samples': 26485056, 'steps': 137942, 'loss/train': 1.2760436534881592} 11/07/2021 16:40:44 - INFO - __main__ - Step 137944: {'lr': 8.143159776713788e-06, 'samples': 26485248, 'steps': 137943, 'loss/train': 1.1763310432434082} 11/07/2021 16:40:45 - INFO - __main__ - Step 137945: {'lr': 8.141816435380923e-06, 'samples': 26485440, 'steps': 137944, 'loss/train': 0.9733001589775085} 11/07/2021 16:40:46 - INFO - __main__ - Step 137946: {'lr': 8.140473203025633e-06, 'samples': 26485632, 'steps': 137945, 'loss/train': 1.3844585418701172} 11/07/2021 16:40:46 - INFO - __main__ - Step 137947: {'lr': 8.139130079648505e-06, 'samples': 26485824, 'steps': 137946, 'loss/train': 1.3309623003005981} 11/07/2021 16:40:46 - INFO - __main__ - Step 137948: {'lr': 8.137787065250202e-06, 'samples': 26486016, 'steps': 137947, 'loss/train': 1.4829459190368652} 11/07/2021 16:40:47 - INFO - __main__ - Step 137949: {'lr': 8.136444159831225e-06, 'samples': 26486208, 'steps': 137948, 'loss/train': 1.276607632637024} 11/07/2021 16:40:48 - INFO - __main__ - Step 137950: {'lr': 8.135101363392266e-06, 'samples': 26486400, 'steps': 137949, 'loss/train': 1.2804450988769531} 11/07/2021 16:40:48 - INFO - __main__ - Step 137951: {'lr': 8.133758675933856e-06, 'samples': 26486592, 'steps': 137950, 'loss/train': 0.38918378949165344} 11/07/2021 16:40:49 - INFO - __main__ - Step 137952: {'lr': 8.132416097456686e-06, 'samples': 26486784, 'steps': 137951, 'loss/train': 1.0144259929656982} 11/07/2021 16:40:49 - INFO - __main__ - Step 137953: {'lr': 8.131073627961283e-06, 'samples': 26486976, 'steps': 137952, 'loss/train': 1.1520601511001587} 11/07/2021 16:40:49 - INFO - __main__ - Step 137954: {'lr': 8.129731267448287e-06, 'samples': 26487168, 'steps': 137953, 'loss/train': 1.4040828943252563} 11/07/2021 16:40:50 - INFO - __main__ - Step 137955: {'lr': 8.128389015918309e-06, 'samples': 26487360, 'steps': 137954, 'loss/train': 0.6227376461029053} 11/07/2021 16:40:51 - INFO - __main__ - Step 137956: {'lr': 8.127046873371958e-06, 'samples': 26487552, 'steps': 137955, 'loss/train': 1.18320894241333} 11/07/2021 16:40:51 - INFO - __main__ - Step 137957: {'lr': 8.125704839809816e-06, 'samples': 26487744, 'steps': 137956, 'loss/train': 0.07955396175384521} 11/07/2021 16:40:52 - INFO - __main__ - Step 137958: {'lr': 8.124362915232497e-06, 'samples': 26487936, 'steps': 137957, 'loss/train': 1.335862398147583} 11/07/2021 16:40:52 - INFO - __main__ - Step 137959: {'lr': 8.123021099640637e-06, 'samples': 26488128, 'steps': 137958, 'loss/train': 1.2632824182510376} 11/07/2021 16:40:53 - INFO - __main__ - Step 137960: {'lr': 8.121679393034765e-06, 'samples': 26488320, 'steps': 137959, 'loss/train': 1.1624997854232788} 11/07/2021 16:40:54 - INFO - __main__ - Step 137961: {'lr': 8.120337795415573e-06, 'samples': 26488512, 'steps': 137960, 'loss/train': 1.1098434925079346} 11/07/2021 16:40:54 - INFO - __main__ - Step 137962: {'lr': 8.118996306783616e-06, 'samples': 26488704, 'steps': 137961, 'loss/train': 1.3900065422058105} 11/07/2021 16:40:54 - INFO - __main__ - Step 137963: {'lr': 8.117654927139505e-06, 'samples': 26488896, 'steps': 137962, 'loss/train': 1.1958801746368408} 11/07/2021 16:40:55 - INFO - __main__ - Step 137964: {'lr': 8.116313656483825e-06, 'samples': 26489088, 'steps': 137963, 'loss/train': 1.7544571161270142} 11/07/2021 16:40:55 - INFO - __main__ - Step 137965: {'lr': 8.114972494817242e-06, 'samples': 26489280, 'steps': 137964, 'loss/train': 1.3633534908294678} 11/07/2021 16:40:56 - INFO - __main__ - Step 137966: {'lr': 8.11363144214028e-06, 'samples': 26489472, 'steps': 137965, 'loss/train': 1.2621430158615112} 11/07/2021 16:40:56 - INFO - __main__ - Step 137967: {'lr': 8.112290498453607e-06, 'samples': 26489664, 'steps': 137966, 'loss/train': 1.1528464555740356} 11/07/2021 16:40:57 - INFO - __main__ - Step 137968: {'lr': 8.110949663757777e-06, 'samples': 26489856, 'steps': 137967, 'loss/train': 0.8715443015098572} 11/07/2021 16:40:57 - INFO - __main__ - Step 137969: {'lr': 8.10960893805343e-06, 'samples': 26490048, 'steps': 137968, 'loss/train': 1.2354497909545898} 11/07/2021 16:40:57 - INFO - __main__ - Step 137970: {'lr': 8.108268321341179e-06, 'samples': 26490240, 'steps': 137969, 'loss/train': 1.203884243965149} 11/07/2021 16:40:59 - INFO - __main__ - Step 137971: {'lr': 8.106927813621601e-06, 'samples': 26490432, 'steps': 137970, 'loss/train': 1.0786534547805786} 11/07/2021 16:40:59 - INFO - __main__ - Step 137972: {'lr': 8.105587414895283e-06, 'samples': 26490624, 'steps': 137971, 'loss/train': 1.2221417427062988} 11/07/2021 16:40:59 - INFO - __main__ - Step 137973: {'lr': 8.104247125162889e-06, 'samples': 26490816, 'steps': 137972, 'loss/train': 1.4180271625518799} 11/07/2021 16:41:00 - INFO - __main__ - Step 137974: {'lr': 8.102906944424976e-06, 'samples': 26491008, 'steps': 137973, 'loss/train': 1.2907519340515137} 11/07/2021 16:41:00 - INFO - __main__ - Step 137975: {'lr': 8.101566872682181e-06, 'samples': 26491200, 'steps': 137974, 'loss/train': 0.9927247762680054} 11/07/2021 16:41:00 - INFO - __main__ - Step 137976: {'lr': 8.10022690993506e-06, 'samples': 26491392, 'steps': 137975, 'loss/train': 0.6685069799423218} 11/07/2021 16:41:01 - INFO - __main__ - Step 137977: {'lr': 8.09888705618425e-06, 'samples': 26491584, 'steps': 137976, 'loss/train': 1.235238790512085} 11/07/2021 16:41:02 - INFO - __main__ - Step 137978: {'lr': 8.097547311430364e-06, 'samples': 26491776, 'steps': 137977, 'loss/train': 0.8033630847930908} 11/07/2021 16:41:02 - INFO - __main__ - Step 137979: {'lr': 8.096207675673955e-06, 'samples': 26491968, 'steps': 137978, 'loss/train': 5.674821376800537} 11/07/2021 16:41:02 - INFO - __main__ - Step 137980: {'lr': 8.09486814891569e-06, 'samples': 26492160, 'steps': 137979, 'loss/train': 1.0044583082199097} 11/07/2021 16:41:03 - INFO - __main__ - Step 137981: {'lr': 8.093528731156153e-06, 'samples': 26492352, 'steps': 137980, 'loss/train': 1.3708528280258179} 11/07/2021 16:41:04 - INFO - __main__ - Step 137982: {'lr': 8.092189422395897e-06, 'samples': 26492544, 'steps': 137981, 'loss/train': 1.1973133087158203} 11/07/2021 16:41:04 - INFO - __main__ - Step 137983: {'lr': 8.090850222635614e-06, 'samples': 26492736, 'steps': 137982, 'loss/train': 1.3327475786209106} 11/07/2021 16:41:05 - INFO - __main__ - Step 137984: {'lr': 8.089511131875838e-06, 'samples': 26492928, 'steps': 137983, 'loss/train': 1.3170430660247803} 11/07/2021 16:41:05 - INFO - __main__ - Step 137985: {'lr': 8.088172150117201e-06, 'samples': 26493120, 'steps': 137984, 'loss/train': 1.7668272256851196} 11/07/2021 16:41:05 - INFO - __main__ - Step 137986: {'lr': 8.086833277360289e-06, 'samples': 26493312, 'steps': 137985, 'loss/train': 1.3026001453399658} 11/07/2021 16:41:07 - INFO - __main__ - Step 137987: {'lr': 8.085494513605713e-06, 'samples': 26493504, 'steps': 137986, 'loss/train': 0.92491614818573} 11/07/2021 16:41:08 - INFO - __main__ - Step 137988: {'lr': 8.084155858854108e-06, 'samples': 26493696, 'steps': 137987, 'loss/train': 1.1139024496078491} 11/07/2021 16:41:08 - INFO - __main__ - Step 137989: {'lr': 8.08281731310606e-06, 'samples': 26493888, 'steps': 137988, 'loss/train': 1.1919673681259155} 11/07/2021 16:41:08 - INFO - __main__ - Step 137990: {'lr': 8.081478876362126e-06, 'samples': 26494080, 'steps': 137989, 'loss/train': 1.093482494354248} 11/07/2021 16:41:09 - INFO - __main__ - Step 137991: {'lr': 8.080140548622966e-06, 'samples': 26494272, 'steps': 137990, 'loss/train': 1.7498149871826172} 11/07/2021 16:41:09 - INFO - __main__ - Step 137992: {'lr': 8.07880232988914e-06, 'samples': 26494464, 'steps': 137991, 'loss/train': 0.8307532072067261} 11/07/2021 16:41:09 - INFO - __main__ - Step 137993: {'lr': 8.077464220161285e-06, 'samples': 26494656, 'steps': 137992, 'loss/train': 0.83493971824646} 11/07/2021 16:41:10 - INFO - __main__ - Step 137994: {'lr': 8.076126219439984e-06, 'samples': 26494848, 'steps': 137993, 'loss/train': 1.148851990699768} 11/07/2021 16:41:11 - INFO - __main__ - Step 137995: {'lr': 8.074788327725873e-06, 'samples': 26495040, 'steps': 137994, 'loss/train': 0.6350134015083313} 11/07/2021 16:41:11 - INFO - __main__ - Step 137996: {'lr': 8.073450545019484e-06, 'samples': 26495232, 'steps': 137995, 'loss/train': 1.0378552675247192} 11/07/2021 16:41:11 - INFO - __main__ - Step 137997: {'lr': 8.072112871321508e-06, 'samples': 26495424, 'steps': 137996, 'loss/train': 0.9528510570526123} 11/07/2021 16:41:12 - INFO - __main__ - Step 137998: {'lr': 8.070775306632472e-06, 'samples': 26495616, 'steps': 137997, 'loss/train': 0.8630245923995972} 11/07/2021 16:41:12 - INFO - __main__ - Step 137999: {'lr': 8.069437850953043e-06, 'samples': 26495808, 'steps': 137998, 'loss/train': 1.4823566675186157} 11/07/2021 16:41:13 - INFO - __main__ - Step 138000: {'lr': 8.068100504283776e-06, 'samples': 26496000, 'steps': 137999, 'loss/train': 1.1735506057739258} 11/07/2021 16:41:14 - INFO - __main__ - Step 138001: {'lr': 8.066763266625281e-06, 'samples': 26496192, 'steps': 138000, 'loss/train': 0.9324312210083008} 11/07/2021 16:41:14 - INFO - __main__ - Step 138002: {'lr': 8.0654261379782e-06, 'samples': 26496384, 'steps': 138001, 'loss/train': 1.7903448343276978} 11/07/2021 16:41:14 - INFO - __main__ - Step 138003: {'lr': 8.064089118343082e-06, 'samples': 26496576, 'steps': 138002, 'loss/train': 1.0617595911026} 11/07/2021 16:41:15 - INFO - __main__ - Step 138004: {'lr': 8.062752207720541e-06, 'samples': 26496768, 'steps': 138003, 'loss/train': 0.8959036469459534} 11/07/2021 16:41:16 - INFO - __main__ - Step 138005: {'lr': 8.061415406111217e-06, 'samples': 26496960, 'steps': 138004, 'loss/train': 0.9095622301101685} 11/07/2021 16:41:16 - INFO - __main__ - Step 138006: {'lr': 8.060078713515661e-06, 'samples': 26497152, 'steps': 138005, 'loss/train': 1.4646943807601929} 11/07/2021 16:41:16 - INFO - __main__ - Step 138007: {'lr': 8.058742129934515e-06, 'samples': 26497344, 'steps': 138006, 'loss/train': 1.5506192445755005} 11/07/2021 16:41:17 - INFO - __main__ - Step 138008: {'lr': 8.05740565536836e-06, 'samples': 26497536, 'steps': 138007, 'loss/train': 1.8804606199264526} 11/07/2021 16:41:17 - INFO - __main__ - Step 138009: {'lr': 8.056069289817807e-06, 'samples': 26497728, 'steps': 138008, 'loss/train': 1.6037894487380981} 11/07/2021 16:41:18 - INFO - __main__ - Step 138010: {'lr': 8.05473303328344e-06, 'samples': 26497920, 'steps': 138009, 'loss/train': 5.689696311950684} 11/07/2021 16:41:19 - INFO - __main__ - Step 138011: {'lr': 8.053396885765896e-06, 'samples': 26498112, 'steps': 138010, 'loss/train': 0.8669865727424622} 11/07/2021 16:41:19 - INFO - __main__ - Step 138012: {'lr': 8.052060847265757e-06, 'samples': 26498304, 'steps': 138011, 'loss/train': 1.1570394039154053} 11/07/2021 16:41:19 - INFO - __main__ - Step 138013: {'lr': 8.050724917783635e-06, 'samples': 26498496, 'steps': 138012, 'loss/train': 1.350688099861145} 11/07/2021 16:41:20 - INFO - __main__ - Step 138014: {'lr': 8.049389097320087e-06, 'samples': 26498688, 'steps': 138013, 'loss/train': 1.25532865524292} 11/07/2021 16:41:20 - INFO - __main__ - Step 138015: {'lr': 8.048053385875803e-06, 'samples': 26498880, 'steps': 138014, 'loss/train': 1.3426413536071777} 11/07/2021 16:41:21 - INFO - __main__ - Step 138016: {'lr': 8.046717783451312e-06, 'samples': 26499072, 'steps': 138015, 'loss/train': 1.1469502449035645} 11/07/2021 16:41:21 - INFO - __main__ - Step 138017: {'lr': 8.045382290047225e-06, 'samples': 26499264, 'steps': 138016, 'loss/train': 1.0615825653076172} 11/07/2021 16:41:22 - INFO - __main__ - Step 138018: {'lr': 8.044046905664155e-06, 'samples': 26499456, 'steps': 138017, 'loss/train': 0.03835948556661606} 11/07/2021 16:41:22 - INFO - __main__ - Step 138019: {'lr': 8.042711630302707e-06, 'samples': 26499648, 'steps': 138018, 'loss/train': 1.406929850578308} 11/07/2021 16:41:23 - INFO - __main__ - Step 138020: {'lr': 8.041376463963496e-06, 'samples': 26499840, 'steps': 138019, 'loss/train': 1.4329389333724976} 11/07/2021 16:41:24 - INFO - __main__ - Step 138021: {'lr': 8.040041406647075e-06, 'samples': 26500032, 'steps': 138020, 'loss/train': 1.087497591972351} 11/07/2021 16:41:24 - INFO - __main__ - Step 138022: {'lr': 8.038706458354084e-06, 'samples': 26500224, 'steps': 138021, 'loss/train': 1.0471478700637817} 11/07/2021 16:41:25 - INFO - __main__ - Step 138023: {'lr': 8.037371619085132e-06, 'samples': 26500416, 'steps': 138022, 'loss/train': 1.2249021530151367} 11/07/2021 16:41:25 - INFO - __main__ - Step 138024: {'lr': 8.036036888840804e-06, 'samples': 26500608, 'steps': 138023, 'loss/train': 1.4123194217681885} 11/07/2021 16:41:25 - INFO - __main__ - Step 138025: {'lr': 8.034702267621708e-06, 'samples': 26500800, 'steps': 138024, 'loss/train': 1.3629988431930542} 11/07/2021 16:41:26 - INFO - __main__ - Step 138026: {'lr': 8.033367755428428e-06, 'samples': 26500992, 'steps': 138025, 'loss/train': 1.329280138015747} 11/07/2021 16:41:27 - INFO - __main__ - Step 138027: {'lr': 8.032033352261576e-06, 'samples': 26501184, 'steps': 138026, 'loss/train': 1.2220975160598755} 11/07/2021 16:41:27 - INFO - __main__ - Step 138028: {'lr': 8.030699058121788e-06, 'samples': 26501376, 'steps': 138027, 'loss/train': 1.1126922369003296} 11/07/2021 16:41:27 - INFO - __main__ - Step 138029: {'lr': 8.02936487300962e-06, 'samples': 26501568, 'steps': 138028, 'loss/train': 0.8965775370597839} 11/07/2021 16:41:28 - INFO - __main__ - Step 138030: {'lr': 8.028030796925684e-06, 'samples': 26501760, 'steps': 138029, 'loss/train': 1.3844457864761353} 11/07/2021 16:41:28 - INFO - __main__ - Step 138031: {'lr': 8.026696829870589e-06, 'samples': 26501952, 'steps': 138030, 'loss/train': 1.171614170074463} 11/07/2021 16:41:29 - INFO - __main__ - Step 138032: {'lr': 8.025362971844918e-06, 'samples': 26502144, 'steps': 138031, 'loss/train': 1.3362126350402832} 11/07/2021 16:41:29 - INFO - __main__ - Step 138033: {'lr': 8.024029222849284e-06, 'samples': 26502336, 'steps': 138032, 'loss/train': 1.3108857870101929} 11/07/2021 16:41:30 - INFO - __main__ - Step 138034: {'lr': 8.022695582884266e-06, 'samples': 26502528, 'steps': 138033, 'loss/train': 1.057707667350769} 11/07/2021 16:41:30 - INFO - __main__ - Step 138035: {'lr': 8.021362051950532e-06, 'samples': 26502720, 'steps': 138034, 'loss/train': 0.07080969214439392} 11/07/2021 16:41:31 - INFO - __main__ - Step 138036: {'lr': 8.02002863004861e-06, 'samples': 26502912, 'steps': 138035, 'loss/train': 1.3078089952468872} 11/07/2021 16:41:32 - INFO - __main__ - Step 138037: {'lr': 8.018695317179137e-06, 'samples': 26503104, 'steps': 138036, 'loss/train': 1.689756155014038} 11/07/2021 16:41:32 - INFO - __main__ - Step 138038: {'lr': 8.017362113342697e-06, 'samples': 26503296, 'steps': 138037, 'loss/train': 1.2872847318649292} 11/07/2021 16:41:32 - INFO - __main__ - Step 138039: {'lr': 8.0160290185399e-06, 'samples': 26503488, 'steps': 138038, 'loss/train': 1.1619681119918823} 11/07/2021 16:41:33 - INFO - __main__ - Step 138040: {'lr': 8.014696032771356e-06, 'samples': 26503680, 'steps': 138039, 'loss/train': 1.3440443277359009} 11/07/2021 16:41:33 - INFO - __main__ - Step 138041: {'lr': 8.01336315603765e-06, 'samples': 26503872, 'steps': 138040, 'loss/train': 1.35250723361969} 11/07/2021 16:41:35 - INFO - __main__ - Step 138042: {'lr': 8.01203038833942e-06, 'samples': 26504064, 'steps': 138041, 'loss/train': 0.9250983595848083} 11/07/2021 16:41:35 - INFO - __main__ - Step 138043: {'lr': 8.010697729677218e-06, 'samples': 26504256, 'steps': 138042, 'loss/train': 1.3741850852966309} 11/07/2021 16:41:35 - INFO - __main__ - Step 138044: {'lr': 8.009365180051658e-06, 'samples': 26504448, 'steps': 138043, 'loss/train': 0.9245524406433105} 11/07/2021 16:41:36 - INFO - __main__ - Step 138045: {'lr': 8.008032739463322e-06, 'samples': 26504640, 'steps': 138044, 'loss/train': 1.3656584024429321} 11/07/2021 16:41:36 - INFO - __main__ - Step 138046: {'lr': 8.006700407912848e-06, 'samples': 26504832, 'steps': 138045, 'loss/train': 0.12280923128128052} 11/07/2021 16:41:37 - INFO - __main__ - Step 138047: {'lr': 8.005368185400818e-06, 'samples': 26505024, 'steps': 138046, 'loss/train': 1.4280928373336792} 11/07/2021 16:41:38 - INFO - __main__ - Step 138048: {'lr': 8.004036071927844e-06, 'samples': 26505216, 'steps': 138047, 'loss/train': 0.9743027687072754} 11/07/2021 16:41:38 - INFO - __main__ - Step 138049: {'lr': 8.002704067494509e-06, 'samples': 26505408, 'steps': 138048, 'loss/train': 1.3558984994888306} 11/07/2021 16:41:38 - INFO - __main__ - Step 138050: {'lr': 8.001372172101422e-06, 'samples': 26505600, 'steps': 138049, 'loss/train': 1.1748219728469849} 11/07/2021 16:41:39 - INFO - __main__ - Step 138051: {'lr': 8.000040385749196e-06, 'samples': 26505792, 'steps': 138050, 'loss/train': 2.0079288482666016} 11/07/2021 16:41:39 - INFO - __main__ - Step 138052: {'lr': 7.998708708438384e-06, 'samples': 26505984, 'steps': 138051, 'loss/train': 1.1836298704147339} 11/07/2021 16:41:40 - INFO - __main__ - Step 138053: {'lr': 7.997377140169681e-06, 'samples': 26506176, 'steps': 138052, 'loss/train': 1.1587780714035034} 11/07/2021 16:41:40 - INFO - __main__ - Step 138054: {'lr': 7.996045680943587e-06, 'samples': 26506368, 'steps': 138053, 'loss/train': 1.8397923707962036} 11/07/2021 16:41:41 - INFO - __main__ - Step 138055: {'lr': 7.994714330760738e-06, 'samples': 26506560, 'steps': 138054, 'loss/train': 1.236493468284607} 11/07/2021 16:41:41 - INFO - __main__ - Step 138056: {'lr': 7.99338308962172e-06, 'samples': 26506752, 'steps': 138055, 'loss/train': 1.2635117769241333} 11/07/2021 16:41:42 - INFO - __main__ - Step 138057: {'lr': 7.992051957527169e-06, 'samples': 26506944, 'steps': 138056, 'loss/train': 1.2212170362472534} 11/07/2021 16:41:42 - INFO - __main__ - Step 138058: {'lr': 7.990720934477668e-06, 'samples': 26507136, 'steps': 138057, 'loss/train': 1.1599507331848145} 11/07/2021 16:41:43 - INFO - __main__ - Step 138059: {'lr': 7.989390020473802e-06, 'samples': 26507328, 'steps': 138058, 'loss/train': 0.862735390663147} 11/07/2021 16:41:43 - INFO - __main__ - Step 138060: {'lr': 7.98805921551618e-06, 'samples': 26507520, 'steps': 138059, 'loss/train': 1.0656416416168213} 11/07/2021 16:41:44 - INFO - __main__ - Step 138061: {'lr': 7.986728519605413e-06, 'samples': 26507712, 'steps': 138060, 'loss/train': 1.1682770252227783} 11/07/2021 16:41:44 - INFO - __main__ - Step 138062: {'lr': 7.985397932742083e-06, 'samples': 26507904, 'steps': 138061, 'loss/train': 1.643847107887268} 11/07/2021 16:41:45 - INFO - __main__ - Step 138063: {'lr': 7.984067454926802e-06, 'samples': 26508096, 'steps': 138062, 'loss/train': 1.1175386905670166} 11/07/2021 16:41:45 - INFO - __main__ - Step 138064: {'lr': 7.982737086160209e-06, 'samples': 26508288, 'steps': 138063, 'loss/train': 0.761568546295166} 11/07/2021 16:41:46 - INFO - __main__ - Step 138065: {'lr': 7.981406826442827e-06, 'samples': 26508480, 'steps': 138064, 'loss/train': 1.3178324699401855} 11/07/2021 16:41:46 - INFO - __main__ - Step 138066: {'lr': 7.980076675775272e-06, 'samples': 26508672, 'steps': 138065, 'loss/train': 1.5641206502914429} 11/07/2021 16:41:46 - INFO - __main__ - Step 138067: {'lr': 7.97874663415818e-06, 'samples': 26508864, 'steps': 138066, 'loss/train': 1.1728953123092651} 11/07/2021 16:41:47 - INFO - __main__ - Step 138068: {'lr': 7.977416701592104e-06, 'samples': 26509056, 'steps': 138067, 'loss/train': 1.1046044826507568} 11/07/2021 16:41:48 - INFO - __main__ - Step 138069: {'lr': 7.976086878077687e-06, 'samples': 26509248, 'steps': 138068, 'loss/train': 1.360101342201233} 11/07/2021 16:41:48 - INFO - __main__ - Step 138070: {'lr': 7.97475716361551e-06, 'samples': 26509440, 'steps': 138069, 'loss/train': 0.8957763910293579} 11/07/2021 16:41:48 - INFO - __main__ - Step 138071: {'lr': 7.97342755820618e-06, 'samples': 26509632, 'steps': 138070, 'loss/train': 1.1391229629516602} 11/07/2021 16:41:49 - INFO - __main__ - Step 138072: {'lr': 7.972098061850258e-06, 'samples': 26509824, 'steps': 138071, 'loss/train': 1.0732102394104004} 11/07/2021 16:41:50 - INFO - __main__ - Step 138073: {'lr': 7.970768674548407e-06, 'samples': 26510016, 'steps': 138072, 'loss/train': 1.0538647174835205} 11/07/2021 16:41:51 - INFO - __main__ - Step 138074: {'lr': 7.969439396301182e-06, 'samples': 26510208, 'steps': 138073, 'loss/train': 1.8615050315856934} 11/07/2021 16:41:51 - INFO - __main__ - Step 138075: {'lr': 7.968110227109221e-06, 'samples': 26510400, 'steps': 138074, 'loss/train': 1.1823896169662476} 11/07/2021 16:41:51 - INFO - __main__ - Step 138076: {'lr': 7.96678116697308e-06, 'samples': 26510592, 'steps': 138075, 'loss/train': 1.4807398319244385} 11/07/2021 16:41:52 - INFO - __main__ - Step 138077: {'lr': 7.965452215893342e-06, 'samples': 26510784, 'steps': 138076, 'loss/train': 1.1756395101547241} 11/07/2021 16:41:52 - INFO - __main__ - Step 138078: {'lr': 7.964123373870646e-06, 'samples': 26510976, 'steps': 138077, 'loss/train': 1.3260743618011475} 11/07/2021 16:41:53 - INFO - __main__ - Step 138079: {'lr': 7.962794640905602e-06, 'samples': 26511168, 'steps': 138078, 'loss/train': 0.09236077219247818} 11/07/2021 16:41:53 - INFO - __main__ - Step 138080: {'lr': 7.961466016998764e-06, 'samples': 26511360, 'steps': 138079, 'loss/train': 1.303261160850525} 11/07/2021 16:41:54 - INFO - __main__ - Step 138081: {'lr': 7.960137502150772e-06, 'samples': 26511552, 'steps': 138080, 'loss/train': 1.530465841293335} 11/07/2021 16:41:54 - INFO - __main__ - Step 138082: {'lr': 7.958809096362207e-06, 'samples': 26511744, 'steps': 138081, 'loss/train': 1.2418618202209473} 11/07/2021 16:41:54 - INFO - __main__ - Step 138083: {'lr': 7.957480799633654e-06, 'samples': 26511936, 'steps': 138082, 'loss/train': 1.1957110166549683} 11/07/2021 16:41:56 - INFO - __main__ - Step 138084: {'lr': 7.956152611965723e-06, 'samples': 26512128, 'steps': 138083, 'loss/train': 0.535368025302887} 11/07/2021 16:41:56 - INFO - __main__ - Step 138085: {'lr': 7.954824533359023e-06, 'samples': 26512320, 'steps': 138084, 'loss/train': 1.1779448986053467} 11/07/2021 16:41:56 - INFO - __main__ - Step 138086: {'lr': 7.953496563814166e-06, 'samples': 26512512, 'steps': 138085, 'loss/train': 0.31427863240242004} 11/07/2021 16:41:57 - INFO - __main__ - Step 138087: {'lr': 7.952168703331708e-06, 'samples': 26512704, 'steps': 138086, 'loss/train': 1.3691320419311523} 11/07/2021 16:41:57 - INFO - __main__ - Step 138088: {'lr': 7.950840951912285e-06, 'samples': 26512896, 'steps': 138087, 'loss/train': 1.4758421182632446} 11/07/2021 16:41:58 - INFO - __main__ - Step 138089: {'lr': 7.949513309556456e-06, 'samples': 26513088, 'steps': 138088, 'loss/train': 0.9081668257713318} 11/07/2021 16:41:58 - INFO - __main__ - Step 138090: {'lr': 7.948185776264854e-06, 'samples': 26513280, 'steps': 138089, 'loss/train': 0.8240480422973633} 11/07/2021 16:41:59 - INFO - __main__ - Step 138091: {'lr': 7.946858352038038e-06, 'samples': 26513472, 'steps': 138090, 'loss/train': 1.2585519552230835} 11/07/2021 16:41:59 - INFO - __main__ - Step 138092: {'lr': 7.945531036876647e-06, 'samples': 26513664, 'steps': 138091, 'loss/train': 1.064591407775879} 11/07/2021 16:42:00 - INFO - __main__ - Step 138093: {'lr': 7.944203830781288e-06, 'samples': 26513856, 'steps': 138092, 'loss/train': 0.7653442025184631} 11/07/2021 16:42:01 - INFO - __main__ - Step 138094: {'lr': 7.942876733752491e-06, 'samples': 26514048, 'steps': 138093, 'loss/train': 1.442185640335083} 11/07/2021 16:42:01 - INFO - __main__ - Step 138095: {'lr': 7.941549745790922e-06, 'samples': 26514240, 'steps': 138094, 'loss/train': 1.4370598793029785} 11/07/2021 16:42:01 - INFO - __main__ - Step 138096: {'lr': 7.940222866897162e-06, 'samples': 26514432, 'steps': 138095, 'loss/train': 1.2628995180130005} 11/07/2021 16:42:02 - INFO - __main__ - Step 138097: {'lr': 7.938896097071824e-06, 'samples': 26514624, 'steps': 138096, 'loss/train': 1.4906814098358154} 11/07/2021 16:42:02 - INFO - __main__ - Step 138098: {'lr': 7.937569436315462e-06, 'samples': 26514816, 'steps': 138097, 'loss/train': 1.3102705478668213} 11/07/2021 16:42:03 - INFO - __main__ - Step 138099: {'lr': 7.936242884628686e-06, 'samples': 26515008, 'steps': 138098, 'loss/train': 1.3804861307144165} 11/07/2021 16:42:03 - INFO - __main__ - Step 138100: {'lr': 7.934916442012109e-06, 'samples': 26515200, 'steps': 138099, 'loss/train': 1.1583178043365479} 11/07/2021 16:42:04 - INFO - __main__ - Step 138101: {'lr': 7.933590108466337e-06, 'samples': 26515392, 'steps': 138100, 'loss/train': 1.079983115196228} 11/07/2021 16:42:04 - INFO - __main__ - Step 138102: {'lr': 7.93226388399193e-06, 'samples': 26515584, 'steps': 138101, 'loss/train': 1.2636311054229736} 11/07/2021 16:42:04 - INFO - __main__ - Step 138103: {'lr': 7.930937768589524e-06, 'samples': 26515776, 'steps': 138102, 'loss/train': 0.9343209266662598} 11/07/2021 16:42:06 - INFO - __main__ - Step 138104: {'lr': 7.929611762259702e-06, 'samples': 26515968, 'steps': 138103, 'loss/train': 1.0772695541381836} 11/07/2021 16:42:06 - INFO - __main__ - Step 138105: {'lr': 7.928285865003048e-06, 'samples': 26516160, 'steps': 138104, 'loss/train': 1.1193485260009766} 11/07/2021 16:42:06 - INFO - __main__ - Step 138106: {'lr': 7.926960076820172e-06, 'samples': 26516352, 'steps': 138105, 'loss/train': 1.5471227169036865} 11/07/2021 16:42:07 - INFO - __main__ - Step 138107: {'lr': 7.925634397711685e-06, 'samples': 26516544, 'steps': 138106, 'loss/train': 1.3167325258255005} 11/07/2021 16:42:07 - INFO - __main__ - Step 138108: {'lr': 7.924308827678166e-06, 'samples': 26516736, 'steps': 138107, 'loss/train': 0.9927034378051758} 11/07/2021 16:42:08 - INFO - __main__ - Step 138109: {'lr': 7.922983366720231e-06, 'samples': 26516928, 'steps': 138108, 'loss/train': 0.8707624077796936} 11/07/2021 16:42:08 - INFO - __main__ - Step 138110: {'lr': 7.92165801483849e-06, 'samples': 26517120, 'steps': 138109, 'loss/train': 1.2331044673919678} 11/07/2021 16:42:09 - INFO - __main__ - Step 138111: {'lr': 7.920332772033467e-06, 'samples': 26517312, 'steps': 138110, 'loss/train': 1.2285666465759277} 11/07/2021 16:42:09 - INFO - __main__ - Step 138112: {'lr': 7.91900763830583e-06, 'samples': 26517504, 'steps': 138111, 'loss/train': 1.2217977046966553} 11/07/2021 16:42:09 - INFO - __main__ - Step 138113: {'lr': 7.917682613656135e-06, 'samples': 26517696, 'steps': 138112, 'loss/train': 1.324545979499817} 11/07/2021 16:42:10 - INFO - __main__ - Step 138114: {'lr': 7.916357698084992e-06, 'samples': 26517888, 'steps': 138113, 'loss/train': 1.2143577337265015} 11/07/2021 16:42:11 - INFO - __main__ - Step 138115: {'lr': 7.91503289159301e-06, 'samples': 26518080, 'steps': 138114, 'loss/train': 1.4823507070541382} 11/07/2021 16:42:11 - INFO - __main__ - Step 138116: {'lr': 7.913708194180803e-06, 'samples': 26518272, 'steps': 138115, 'loss/train': 0.9049156904220581} 11/07/2021 16:42:11 - INFO - __main__ - Step 138117: {'lr': 7.912383605848922e-06, 'samples': 26518464, 'steps': 138116, 'loss/train': 1.0082155466079712} 11/07/2021 16:42:12 - INFO - __main__ - Step 138118: {'lr': 7.91105912659798e-06, 'samples': 26518656, 'steps': 138117, 'loss/train': 1.354322910308838} 11/07/2021 16:42:12 - INFO - __main__ - Step 138119: {'lr': 7.909734756428589e-06, 'samples': 26518848, 'steps': 138118, 'loss/train': 1.6168580055236816} 11/07/2021 16:42:14 - INFO - __main__ - Step 138120: {'lr': 7.908410495341328e-06, 'samples': 26519040, 'steps': 138119, 'loss/train': 1.5113651752471924} 11/07/2021 16:42:14 - INFO - __main__ - Step 138121: {'lr': 7.907086343336812e-06, 'samples': 26519232, 'steps': 138120, 'loss/train': 1.2270690202713013} 11/07/2021 16:42:14 - INFO - __main__ - Step 138122: {'lr': 7.905762300415592e-06, 'samples': 26519424, 'steps': 138121, 'loss/train': 1.4048768281936646} 11/07/2021 16:42:15 - INFO - __main__ - Step 138123: {'lr': 7.904438366578364e-06, 'samples': 26519616, 'steps': 138122, 'loss/train': 1.2175843715667725} 11/07/2021 16:42:15 - INFO - __main__ - Step 138124: {'lr': 7.903114541825628e-06, 'samples': 26519808, 'steps': 138123, 'loss/train': 1.0213805437088013} 11/07/2021 16:42:15 - INFO - __main__ - Step 138125: {'lr': 7.901790826158022e-06, 'samples': 26520000, 'steps': 138124, 'loss/train': 0.4467918872833252} 11/07/2021 16:42:16 - INFO - __main__ - Step 138126: {'lr': 7.900467219576102e-06, 'samples': 26520192, 'steps': 138125, 'loss/train': 0.8324394226074219} 11/07/2021 16:42:17 - INFO - __main__ - Step 138127: {'lr': 7.899143722080532e-06, 'samples': 26520384, 'steps': 138126, 'loss/train': 0.7007185220718384} 11/07/2021 16:42:17 - INFO - __main__ - Step 138128: {'lr': 7.89782033367184e-06, 'samples': 26520576, 'steps': 138127, 'loss/train': 1.0980311632156372} 11/07/2021 16:42:17 - INFO - __main__ - Step 138129: {'lr': 7.896497054350665e-06, 'samples': 26520768, 'steps': 138128, 'loss/train': 1.2651009559631348} 11/07/2021 16:42:18 - INFO - __main__ - Step 138130: {'lr': 7.895173884117591e-06, 'samples': 26520960, 'steps': 138129, 'loss/train': 1.7314157485961914} 11/07/2021 16:42:19 - INFO - __main__ - Step 138131: {'lr': 7.893850822973226e-06, 'samples': 26521152, 'steps': 138130, 'loss/train': 1.0385586023330688} 11/07/2021 16:42:19 - INFO - __main__ - Step 138132: {'lr': 7.892527870918153e-06, 'samples': 26521344, 'steps': 138131, 'loss/train': 1.187246561050415} 11/07/2021 16:42:20 - INFO - __main__ - Step 138133: {'lr': 7.891205027952958e-06, 'samples': 26521536, 'steps': 138132, 'loss/train': 1.6003987789154053} 11/07/2021 16:42:20 - INFO - __main__ - Step 138134: {'lr': 7.889882294078277e-06, 'samples': 26521728, 'steps': 138133, 'loss/train': 1.1514863967895508} 11/07/2021 16:42:20 - INFO - __main__ - Step 138135: {'lr': 7.888559669294664e-06, 'samples': 26521920, 'steps': 138134, 'loss/train': 1.201690435409546} 11/07/2021 16:42:21 - INFO - __main__ - Step 138136: {'lr': 7.887237153602761e-06, 'samples': 26522112, 'steps': 138135, 'loss/train': 0.692778468132019} 11/07/2021 16:42:22 - INFO - __main__ - Step 138137: {'lr': 7.885914747003093e-06, 'samples': 26522304, 'steps': 138136, 'loss/train': 0.7663267850875854} 11/07/2021 16:42:22 - INFO - __main__ - Step 138138: {'lr': 7.884592449496298e-06, 'samples': 26522496, 'steps': 138137, 'loss/train': 1.4235446453094482} 11/07/2021 16:42:22 - INFO - __main__ - Step 138139: {'lr': 7.883270261082987e-06, 'samples': 26522688, 'steps': 138138, 'loss/train': 0.7128851413726807} 11/07/2021 16:42:23 - INFO - __main__ - Step 138140: {'lr': 7.881948181763715e-06, 'samples': 26522880, 'steps': 138139, 'loss/train': 0.33377739787101746} 11/07/2021 16:42:24 - INFO - __main__ - Step 138141: {'lr': 7.880626211539121e-06, 'samples': 26523072, 'steps': 138140, 'loss/train': 1.2850298881530762} 11/07/2021 16:42:24 - INFO - __main__ - Step 138142: {'lr': 7.87930435040976e-06, 'samples': 26523264, 'steps': 138141, 'loss/train': 2.2053518295288086} 11/07/2021 16:42:25 - INFO - __main__ - Step 138143: {'lr': 7.87798259837627e-06, 'samples': 26523456, 'steps': 138142, 'loss/train': 1.5199520587921143} 11/07/2021 16:42:25 - INFO - __main__ - Step 138144: {'lr': 7.876660955439208e-06, 'samples': 26523648, 'steps': 138143, 'loss/train': 1.4134846925735474} 11/07/2021 16:42:25 - INFO - __main__ - Step 138145: {'lr': 7.875339421599181e-06, 'samples': 26523840, 'steps': 138144, 'loss/train': 0.9603142142295837} 11/07/2021 16:42:26 - INFO - __main__ - Step 138146: {'lr': 7.874017996856803e-06, 'samples': 26524032, 'steps': 138145, 'loss/train': 0.8566007018089294} 11/07/2021 16:42:27 - INFO - __main__ - Step 138147: {'lr': 7.872696681212654e-06, 'samples': 26524224, 'steps': 138146, 'loss/train': 1.1082011461257935} 11/07/2021 16:42:27 - INFO - __main__ - Step 138148: {'lr': 7.871375474667347e-06, 'samples': 26524416, 'steps': 138147, 'loss/train': 1.5962231159210205} 11/07/2021 16:42:27 - INFO - __main__ - Step 138149: {'lr': 7.870054377221436e-06, 'samples': 26524608, 'steps': 138148, 'loss/train': 1.2453618049621582} 11/07/2021 16:42:28 - INFO - __main__ - Step 138150: {'lr': 7.868733388875587e-06, 'samples': 26524800, 'steps': 138149, 'loss/train': 2.784761428833008} 11/07/2021 16:42:29 - INFO - __main__ - Step 138151: {'lr': 7.867412509630329e-06, 'samples': 26524992, 'steps': 138150, 'loss/train': 1.1727765798568726} 11/07/2021 16:42:29 - INFO - __main__ - Step 138152: {'lr': 7.86609173948627e-06, 'samples': 26525184, 'steps': 138151, 'loss/train': 1.0293539762496948} 11/07/2021 16:42:29 - INFO - __main__ - Step 138153: {'lr': 7.864771078443994e-06, 'samples': 26525376, 'steps': 138152, 'loss/train': 0.2810194492340088} 11/07/2021 16:42:30 - INFO - __main__ - Step 138154: {'lr': 7.86345052650414e-06, 'samples': 26525568, 'steps': 138153, 'loss/train': 1.3651790618896484} 11/07/2021 16:42:30 - INFO - __main__ - Step 138155: {'lr': 7.862130083667263e-06, 'samples': 26525760, 'steps': 138154, 'loss/train': 0.9502419233322144} 11/07/2021 16:42:31 - INFO - __main__ - Step 138156: {'lr': 7.860809749934e-06, 'samples': 26525952, 'steps': 138155, 'loss/train': 1.6860333681106567} 11/07/2021 16:42:32 - INFO - __main__ - Step 138157: {'lr': 7.85948952530488e-06, 'samples': 26526144, 'steps': 138156, 'loss/train': 1.2392204999923706} 11/07/2021 16:42:32 - INFO - __main__ - Step 138158: {'lr': 7.85816940978057e-06, 'samples': 26526336, 'steps': 138157, 'loss/train': 1.3916953802108765} 11/07/2021 16:42:32 - INFO - __main__ - Step 138159: {'lr': 7.856849403361621e-06, 'samples': 26526528, 'steps': 138158, 'loss/train': 0.9483413100242615} 11/07/2021 16:42:33 - INFO - __main__ - Step 138160: {'lr': 7.855529506048647e-06, 'samples': 26526720, 'steps': 138159, 'loss/train': 1.111126184463501} 11/07/2021 16:42:34 - INFO - __main__ - Step 138161: {'lr': 7.854209717842232e-06, 'samples': 26526912, 'steps': 138160, 'loss/train': 1.3577669858932495} 11/07/2021 16:42:34 - INFO - __main__ - Step 138162: {'lr': 7.852890038742955e-06, 'samples': 26527104, 'steps': 138161, 'loss/train': 1.1612762212753296} 11/07/2021 16:42:35 - INFO - __main__ - Step 138163: {'lr': 7.851570468751485e-06, 'samples': 26527296, 'steps': 138162, 'loss/train': 1.2970972061157227} 11/07/2021 16:42:35 - INFO - __main__ - Step 138164: {'lr': 7.85025100786832e-06, 'samples': 26527488, 'steps': 138163, 'loss/train': 1.1873611211776733} 11/07/2021 16:42:35 - INFO - __main__ - Step 138165: {'lr': 7.848931656094072e-06, 'samples': 26527680, 'steps': 138164, 'loss/train': 1.6699621677398682} 11/07/2021 16:42:36 - INFO - __main__ - Step 138166: {'lr': 7.847612413429406e-06, 'samples': 26527872, 'steps': 138165, 'loss/train': 0.06487719714641571} 11/07/2021 16:42:37 - INFO - __main__ - Step 138167: {'lr': 7.846293279874823e-06, 'samples': 26528064, 'steps': 138166, 'loss/train': 0.9444244503974915} 11/07/2021 16:42:37 - INFO - __main__ - Step 138168: {'lr': 7.844974255430987e-06, 'samples': 26528256, 'steps': 138167, 'loss/train': 1.5493189096450806} 11/07/2021 16:42:37 - INFO - __main__ - Step 138169: {'lr': 7.843655340098483e-06, 'samples': 26528448, 'steps': 138168, 'loss/train': 0.45807352662086487} 11/07/2021 16:42:38 - INFO - __main__ - Step 138170: {'lr': 7.842336533877865e-06, 'samples': 26528640, 'steps': 138169, 'loss/train': 1.3958300352096558} 11/07/2021 16:42:38 - INFO - __main__ - Step 138171: {'lr': 7.84101783676977e-06, 'samples': 26528832, 'steps': 138170, 'loss/train': 0.8491644263267517} 11/07/2021 16:42:39 - INFO - __main__ - Step 138172: {'lr': 7.839699248774757e-06, 'samples': 26529024, 'steps': 138171, 'loss/train': 1.5109827518463135} 11/07/2021 16:42:40 - INFO - __main__ - Step 138173: {'lr': 7.83838076989346e-06, 'samples': 26529216, 'steps': 138172, 'loss/train': 1.4536018371582031} 11/07/2021 16:42:40 - INFO - __main__ - Step 138174: {'lr': 7.837062400126437e-06, 'samples': 26529408, 'steps': 138173, 'loss/train': 1.4995976686477661} 11/07/2021 16:42:40 - INFO - __main__ - Step 138175: {'lr': 7.835744139474299e-06, 'samples': 26529600, 'steps': 138174, 'loss/train': 1.325795292854309} 11/07/2021 16:42:41 - INFO - __main__ - Step 138176: {'lr': 7.834425987937655e-06, 'samples': 26529792, 'steps': 138175, 'loss/train': 0.8412637114524841} 11/07/2021 16:42:42 - INFO - __main__ - Step 138177: {'lr': 7.833107945517087e-06, 'samples': 26529984, 'steps': 138176, 'loss/train': 1.1806049346923828} 11/07/2021 16:42:42 - INFO - __main__ - Step 138178: {'lr': 7.83179001221318e-06, 'samples': 26530176, 'steps': 138177, 'loss/train': 1.9921138286590576} 11/07/2021 16:42:42 - INFO - __main__ - Step 138179: {'lr': 7.830472188026516e-06, 'samples': 26530368, 'steps': 138178, 'loss/train': 1.479194164276123} 11/07/2021 16:42:43 - INFO - __main__ - Step 138180: {'lr': 7.829154472957706e-06, 'samples': 26530560, 'steps': 138179, 'loss/train': 1.4218558073043823} 11/07/2021 16:42:43 - INFO - __main__ - Step 138181: {'lr': 7.827836867007333e-06, 'samples': 26530752, 'steps': 138180, 'loss/train': 0.9737928509712219} 11/07/2021 16:42:44 - INFO - __main__ - Step 138182: {'lr': 7.826519370176006e-06, 'samples': 26530944, 'steps': 138181, 'loss/train': 1.0562562942504883} 11/07/2021 16:42:45 - INFO - __main__ - Step 138183: {'lr': 7.825201982464309e-06, 'samples': 26531136, 'steps': 138182, 'loss/train': 1.0211427211761475} 11/07/2021 16:42:45 - INFO - __main__ - Step 138184: {'lr': 7.823884703872852e-06, 'samples': 26531328, 'steps': 138183, 'loss/train': 1.7258830070495605} 11/07/2021 16:42:45 - INFO - __main__ - Step 138185: {'lr': 7.822567534402219e-06, 'samples': 26531520, 'steps': 138184, 'loss/train': 1.5196621417999268} 11/07/2021 16:42:46 - INFO - __main__ - Step 138186: {'lr': 7.821250474052965e-06, 'samples': 26531712, 'steps': 138185, 'loss/train': 1.4447280168533325} 11/07/2021 16:42:47 - INFO - __main__ - Step 138187: {'lr': 7.819933522825757e-06, 'samples': 26531904, 'steps': 138186, 'loss/train': 1.0818499326705933} 11/07/2021 16:42:47 - INFO - __main__ - Step 138188: {'lr': 7.818616680721147e-06, 'samples': 26532096, 'steps': 138187, 'loss/train': 1.2183688879013062} 11/07/2021 16:42:47 - INFO - __main__ - Step 138189: {'lr': 7.817299947739692e-06, 'samples': 26532288, 'steps': 138188, 'loss/train': 1.2758718729019165} 11/07/2021 16:42:48 - INFO - __main__ - Step 138190: {'lr': 7.815983323882087e-06, 'samples': 26532480, 'steps': 138189, 'loss/train': 0.8991718888282776} 11/07/2021 16:42:48 - INFO - __main__ - Step 138191: {'lr': 7.81466680914883e-06, 'samples': 26532672, 'steps': 138190, 'loss/train': 0.7494106292724609} 11/07/2021 16:42:49 - INFO - __main__ - Step 138192: {'lr': 7.813350403540559e-06, 'samples': 26532864, 'steps': 138191, 'loss/train': 1.2346395254135132} 11/07/2021 16:42:49 - INFO - __main__ - Step 138193: {'lr': 7.812034107057831e-06, 'samples': 26533056, 'steps': 138192, 'loss/train': 2.4074459075927734} 11/07/2021 16:42:50 - INFO - __main__ - Step 138194: {'lr': 7.810717919701282e-06, 'samples': 26533248, 'steps': 138193, 'loss/train': 1.419466257095337} 11/07/2021 16:42:50 - INFO - __main__ - Step 138195: {'lr': 7.809401841471469e-06, 'samples': 26533440, 'steps': 138194, 'loss/train': 1.607017159461975} 11/07/2021 16:42:50 - INFO - __main__ - Step 138196: {'lr': 7.808085872369003e-06, 'samples': 26533632, 'steps': 138195, 'loss/train': 1.544091820716858} 11/07/2021 16:42:51 - INFO - __main__ - Step 138197: {'lr': 7.806770012394493e-06, 'samples': 26533824, 'steps': 138196, 'loss/train': 1.4075285196304321} 11/07/2021 16:42:52 - INFO - __main__ - Step 138198: {'lr': 7.805454261548495e-06, 'samples': 26534016, 'steps': 138197, 'loss/train': 1.7127294540405273} 11/07/2021 16:42:52 - INFO - __main__ - Step 138199: {'lr': 7.804138619831647e-06, 'samples': 26534208, 'steps': 138198, 'loss/train': 1.31856107711792} 11/07/2021 16:42:53 - INFO - __main__ - Step 138200: {'lr': 7.802823087244504e-06, 'samples': 26534400, 'steps': 138199, 'loss/train': 1.1755913496017456} 11/07/2021 16:42:53 - INFO - __main__ - Step 138201: {'lr': 7.801507663787676e-06, 'samples': 26534592, 'steps': 138200, 'loss/train': 1.301986813545227} 11/07/2021 16:42:53 - INFO - __main__ - Step 138202: {'lr': 7.800192349461749e-06, 'samples': 26534784, 'steps': 138201, 'loss/train': 1.1509500741958618} 11/07/2021 16:42:54 - INFO - __main__ - Step 138203: {'lr': 7.798877144267302e-06, 'samples': 26534976, 'steps': 138202, 'loss/train': 1.4003878831863403} 11/07/2021 16:42:55 - INFO - __main__ - Step 138204: {'lr': 7.797562048204976e-06, 'samples': 26535168, 'steps': 138203, 'loss/train': 1.10122549533844} 11/07/2021 16:42:55 - INFO - __main__ - Step 138205: {'lr': 7.796247061275324e-06, 'samples': 26535360, 'steps': 138204, 'loss/train': 1.2249144315719604} 11/07/2021 16:42:55 - INFO - __main__ - Step 138206: {'lr': 7.79493218347893e-06, 'samples': 26535552, 'steps': 138205, 'loss/train': 0.6997278928756714} 11/07/2021 16:42:56 - INFO - __main__ - Step 138207: {'lr': 7.793617414816406e-06, 'samples': 26535744, 'steps': 138206, 'loss/train': 1.4279735088348389} 11/07/2021 16:42:57 - INFO - __main__ - Step 138208: {'lr': 7.792302755288332e-06, 'samples': 26535936, 'steps': 138207, 'loss/train': 1.347392201423645} 11/07/2021 16:42:57 - INFO - __main__ - Step 138209: {'lr': 7.790988204895321e-06, 'samples': 26536128, 'steps': 138208, 'loss/train': 0.9488916993141174} 11/07/2021 16:42:58 - INFO - __main__ - Step 138210: {'lr': 7.789673763637956e-06, 'samples': 26536320, 'steps': 138209, 'loss/train': 1.6051069498062134} 11/07/2021 16:42:58 - INFO - __main__ - Step 138211: {'lr': 7.788359431516818e-06, 'samples': 26536512, 'steps': 138210, 'loss/train': 1.1928200721740723} 11/07/2021 16:42:58 - INFO - __main__ - Step 138212: {'lr': 7.787045208532517e-06, 'samples': 26536704, 'steps': 138211, 'loss/train': 1.201285481452942} 11/07/2021 16:42:59 - INFO - __main__ - Step 138213: {'lr': 7.78573109468561e-06, 'samples': 26536896, 'steps': 138212, 'loss/train': 1.1158418655395508} 11/07/2021 16:43:00 - INFO - __main__ - Step 138214: {'lr': 7.784417089976737e-06, 'samples': 26537088, 'steps': 138213, 'loss/train': 1.2281192541122437} 11/07/2021 16:43:00 - INFO - __main__ - Step 138215: {'lr': 7.783103194406477e-06, 'samples': 26537280, 'steps': 138214, 'loss/train': 1.2186540365219116} 11/07/2021 16:43:00 - INFO - __main__ - Step 138216: {'lr': 7.781789407975386e-06, 'samples': 26537472, 'steps': 138215, 'loss/train': 1.0483791828155518} 11/07/2021 16:43:01 - INFO - __main__ - Step 138217: {'lr': 7.780475730684133e-06, 'samples': 26537664, 'steps': 138216, 'loss/train': 2.1290249824523926} 11/07/2021 16:43:02 - INFO - __main__ - Step 138218: {'lr': 7.779162162533215e-06, 'samples': 26537856, 'steps': 138217, 'loss/train': 1.1549173593521118} 11/07/2021 16:43:03 - INFO - __main__ - Step 138219: {'lr': 7.777848703523272e-06, 'samples': 26538048, 'steps': 138218, 'loss/train': 2.249504804611206} 11/07/2021 16:43:03 - INFO - __main__ - Step 138220: {'lr': 7.776535353654912e-06, 'samples': 26538240, 'steps': 138219, 'loss/train': 0.7340319156646729} 11/07/2021 16:43:03 - INFO - __main__ - Step 138221: {'lr': 7.775222112928692e-06, 'samples': 26538432, 'steps': 138220, 'loss/train': 1.2643342018127441} 11/07/2021 16:43:04 - INFO - __main__ - Step 138222: {'lr': 7.773908981345224e-06, 'samples': 26538624, 'steps': 138221, 'loss/train': 1.0489485263824463} 11/07/2021 16:43:04 - INFO - __main__ - Step 138223: {'lr': 7.772595958905088e-06, 'samples': 26538816, 'steps': 138222, 'loss/train': 1.2978005409240723} 11/07/2021 16:43:05 - INFO - __main__ - Step 138224: {'lr': 7.771283045608895e-06, 'samples': 26539008, 'steps': 138223, 'loss/train': 0.6688159108161926} 11/07/2021 16:43:05 - INFO - __main__ - Step 138225: {'lr': 7.769970241457202e-06, 'samples': 26539200, 'steps': 138224, 'loss/train': 1.8671491146087646} 11/07/2021 16:43:06 - INFO - __main__ - Step 138226: {'lr': 7.768657546450648e-06, 'samples': 26539392, 'steps': 138225, 'loss/train': 1.1169673204421997} 11/07/2021 16:43:06 - INFO - __main__ - Step 138227: {'lr': 7.767344960589784e-06, 'samples': 26539584, 'steps': 138226, 'loss/train': 1.0894700288772583} 11/07/2021 16:43:06 - INFO - __main__ - Step 138228: {'lr': 7.766032483875224e-06, 'samples': 26539776, 'steps': 138227, 'loss/train': 1.477367877960205} 11/07/2021 16:43:08 - INFO - __main__ - Step 138229: {'lr': 7.76472011630755e-06, 'samples': 26539968, 'steps': 138228, 'loss/train': 1.4406943321228027} 11/07/2021 16:43:08 - INFO - __main__ - Step 138230: {'lr': 7.763407857887344e-06, 'samples': 26540160, 'steps': 138229, 'loss/train': 1.1284592151641846} 11/07/2021 16:43:08 - INFO - __main__ - Step 138231: {'lr': 7.762095708615247e-06, 'samples': 26540352, 'steps': 138230, 'loss/train': 1.456426978111267} 11/07/2021 16:43:09 - INFO - __main__ - Step 138232: {'lr': 7.760783668491784e-06, 'samples': 26540544, 'steps': 138231, 'loss/train': 0.7474150657653809} 11/07/2021 16:43:09 - INFO - __main__ - Step 138233: {'lr': 7.759471737517565e-06, 'samples': 26540736, 'steps': 138232, 'loss/train': 1.3473014831542969} 11/07/2021 16:43:10 - INFO - __main__ - Step 138234: {'lr': 7.758159915693203e-06, 'samples': 26540928, 'steps': 138233, 'loss/train': 1.436363697052002} 11/07/2021 16:43:10 - INFO - __main__ - Step 138235: {'lr': 7.756848203019279e-06, 'samples': 26541120, 'steps': 138234, 'loss/train': 1.039911150932312} 11/07/2021 16:43:11 - INFO - __main__ - Step 138236: {'lr': 7.75553659949635e-06, 'samples': 26541312, 'steps': 138235, 'loss/train': 0.9546405076980591} 11/07/2021 16:43:11 - INFO - __main__ - Step 138237: {'lr': 7.754225105125079e-06, 'samples': 26541504, 'steps': 138236, 'loss/train': 1.288235068321228} 11/07/2021 16:43:11 - INFO - __main__ - Step 138238: {'lr': 7.752913719905996e-06, 'samples': 26541696, 'steps': 138237, 'loss/train': 1.2333272695541382} 11/07/2021 16:43:13 - INFO - __main__ - Step 138239: {'lr': 7.751602443839712e-06, 'samples': 26541888, 'steps': 138238, 'loss/train': 0.9743587374687195} 11/07/2021 16:43:13 - INFO - __main__ - Step 138240: {'lr': 7.750291276926807e-06, 'samples': 26542080, 'steps': 138239, 'loss/train': 1.2501410245895386} 11/07/2021 16:43:14 - INFO - __main__ - Step 138241: {'lr': 7.748980219167895e-06, 'samples': 26542272, 'steps': 138240, 'loss/train': 1.2458598613739014} 11/07/2021 16:43:14 - INFO - __main__ - Step 138242: {'lr': 7.747669270563557e-06, 'samples': 26542464, 'steps': 138241, 'loss/train': 0.09632467478513718} 11/07/2021 16:43:14 - INFO - __main__ - Step 138243: {'lr': 7.746358431114375e-06, 'samples': 26542656, 'steps': 138242, 'loss/train': 0.9516955018043518} 11/07/2021 16:43:15 - INFO - __main__ - Step 138244: {'lr': 7.74504770082099e-06, 'samples': 26542848, 'steps': 138243, 'loss/train': 0.06414791941642761} 11/07/2021 16:43:16 - INFO - __main__ - Step 138245: {'lr': 7.743737079683899e-06, 'samples': 26543040, 'steps': 138244, 'loss/train': 1.3246742486953735} 11/07/2021 16:43:16 - INFO - __main__ - Step 138246: {'lr': 7.742426567703741e-06, 'samples': 26543232, 'steps': 138245, 'loss/train': 1.0821994543075562} 11/07/2021 16:43:16 - INFO - __main__ - Step 138247: {'lr': 7.74111616488113e-06, 'samples': 26543424, 'steps': 138246, 'loss/train': 1.280509352684021} 11/07/2021 16:43:17 - INFO - __main__ - Step 138248: {'lr': 7.739805871216616e-06, 'samples': 26543616, 'steps': 138247, 'loss/train': 1.4209973812103271} 11/07/2021 16:43:18 - INFO - __main__ - Step 138249: {'lr': 7.738495686710812e-06, 'samples': 26543808, 'steps': 138248, 'loss/train': 1.0148375034332275} 11/07/2021 16:43:18 - INFO - __main__ - Step 138250: {'lr': 7.7371856113643e-06, 'samples': 26544000, 'steps': 138249, 'loss/train': 1.2832363843917847} 11/07/2021 16:43:19 - INFO - __main__ - Step 138251: {'lr': 7.735875645177693e-06, 'samples': 26544192, 'steps': 138250, 'loss/train': 1.1793500185012817} 11/07/2021 16:43:19 - INFO - __main__ - Step 138252: {'lr': 7.734565788151543e-06, 'samples': 26544384, 'steps': 138251, 'loss/train': 0.9684511423110962} 11/07/2021 16:43:19 - INFO - __main__ - Step 138253: {'lr': 7.733256040286463e-06, 'samples': 26544576, 'steps': 138252, 'loss/train': 1.4745690822601318} 11/07/2021 16:43:20 - INFO - __main__ - Step 138254: {'lr': 7.731946401583034e-06, 'samples': 26544768, 'steps': 138253, 'loss/train': 1.169036865234375} 11/07/2021 16:43:21 - INFO - __main__ - Step 138255: {'lr': 7.730636872041841e-06, 'samples': 26544960, 'steps': 138254, 'loss/train': 1.4793082475662231} 11/07/2021 16:43:21 - INFO - __main__ - Step 138256: {'lr': 7.72932745166352e-06, 'samples': 26545152, 'steps': 138255, 'loss/train': 0.8193822503089905} 11/07/2021 16:43:21 - INFO - __main__ - Step 138257: {'lr': 7.728018140448629e-06, 'samples': 26545344, 'steps': 138256, 'loss/train': 1.1924747228622437} 11/07/2021 16:43:22 - INFO - __main__ - Step 138258: {'lr': 7.72670893839772e-06, 'samples': 26545536, 'steps': 138257, 'loss/train': 1.3209673166275024} 11/07/2021 16:43:22 - INFO - __main__ - Step 138259: {'lr': 7.725399845511433e-06, 'samples': 26545728, 'steps': 138258, 'loss/train': 1.4776721000671387} 11/07/2021 16:43:23 - INFO - __main__ - Step 138260: {'lr': 7.72409086179035e-06, 'samples': 26545920, 'steps': 138259, 'loss/train': 1.4730455875396729} 11/07/2021 16:43:24 - INFO - __main__ - Step 138261: {'lr': 7.722781987235028e-06, 'samples': 26546112, 'steps': 138260, 'loss/train': 1.2350317239761353} 11/07/2021 16:43:24 - INFO - __main__ - Step 138262: {'lr': 7.721473221846104e-06, 'samples': 26546304, 'steps': 138261, 'loss/train': 1.1466748714447021} 11/07/2021 16:43:24 - INFO - __main__ - Step 138263: {'lr': 7.720164565624132e-06, 'samples': 26546496, 'steps': 138262, 'loss/train': 1.5784010887145996} 11/07/2021 16:43:25 - INFO - __main__ - Step 138264: {'lr': 7.718856018569725e-06, 'samples': 26546688, 'steps': 138263, 'loss/train': 1.1500014066696167} 11/07/2021 16:43:25 - INFO - __main__ - Step 138265: {'lr': 7.717547580683437e-06, 'samples': 26546880, 'steps': 138264, 'loss/train': 0.5941764116287231} 11/07/2021 16:43:26 - INFO - __main__ - Step 138266: {'lr': 7.716239251965906e-06, 'samples': 26547072, 'steps': 138265, 'loss/train': 1.2339059114456177} 11/07/2021 16:43:27 - INFO - __main__ - Step 138267: {'lr': 7.714931032417716e-06, 'samples': 26547264, 'steps': 138266, 'loss/train': 1.2202736139297485} 11/07/2021 16:43:27 - INFO - __main__ - Step 138268: {'lr': 7.713622922039392e-06, 'samples': 26547456, 'steps': 138267, 'loss/train': 1.2921977043151855} 11/07/2021 16:43:27 - INFO - __main__ - Step 138269: {'lr': 7.712314920831603e-06, 'samples': 26547648, 'steps': 138268, 'loss/train': 1.3485815525054932} 11/07/2021 16:43:28 - INFO - __main__ - Step 138270: {'lr': 7.711007028794902e-06, 'samples': 26547840, 'steps': 138269, 'loss/train': 1.5038548707962036} 11/07/2021 16:43:28 - INFO - __main__ - Step 138271: {'lr': 7.70969924592993e-06, 'samples': 26548032, 'steps': 138270, 'loss/train': 1.2150311470031738} 11/07/2021 16:43:29 - INFO - __main__ - Step 138272: {'lr': 7.708391572237183e-06, 'samples': 26548224, 'steps': 138271, 'loss/train': 1.2335052490234375} 11/07/2021 16:43:29 - INFO - __main__ - Step 138273: {'lr': 7.707084007717274e-06, 'samples': 26548416, 'steps': 138272, 'loss/train': 1.7258669137954712} 11/07/2021 16:43:30 - INFO - __main__ - Step 138274: {'lr': 7.705776552370842e-06, 'samples': 26548608, 'steps': 138273, 'loss/train': 1.2365926504135132} 11/07/2021 16:43:30 - INFO - __main__ - Step 138275: {'lr': 7.704469206198439e-06, 'samples': 26548800, 'steps': 138274, 'loss/train': 1.2117632627487183} 11/07/2021 16:43:30 - INFO - __main__ - Step 138276: {'lr': 7.703161969200678e-06, 'samples': 26548992, 'steps': 138275, 'loss/train': 1.004430890083313} 11/07/2021 16:43:31 - INFO - __main__ - Step 138277: {'lr': 7.701854841378114e-06, 'samples': 26549184, 'steps': 138276, 'loss/train': 1.1643110513687134} 11/07/2021 16:43:32 - INFO - __main__ - Step 138278: {'lr': 7.700547822731357e-06, 'samples': 26549376, 'steps': 138277, 'loss/train': 0.9055137038230896} 11/07/2021 16:43:32 - INFO - __main__ - Step 138279: {'lr': 7.699240913260992e-06, 'samples': 26549568, 'steps': 138278, 'loss/train': 0.9070257544517517} 11/07/2021 16:43:32 - INFO - __main__ - Step 138280: {'lr': 7.697934112967625e-06, 'samples': 26549760, 'steps': 138279, 'loss/train': 1.1652758121490479} 11/07/2021 16:43:33 - INFO - __main__ - Step 138281: {'lr': 7.696627421851816e-06, 'samples': 26549952, 'steps': 138280, 'loss/train': 1.2049559354782104} 11/07/2021 16:43:34 - INFO - __main__ - Step 138282: {'lr': 7.695320839914171e-06, 'samples': 26550144, 'steps': 138281, 'loss/train': 1.6175670623779297} 11/07/2021 16:43:34 - INFO - __main__ - Step 138283: {'lr': 7.694014367155277e-06, 'samples': 26550336, 'steps': 138282, 'loss/train': 1.2729339599609375} 11/07/2021 16:43:35 - INFO - __main__ - Step 138284: {'lr': 7.692708003575743e-06, 'samples': 26550528, 'steps': 138283, 'loss/train': 1.251272201538086} 11/07/2021 16:43:35 - INFO - __main__ - Step 138285: {'lr': 7.691401749176125e-06, 'samples': 26550720, 'steps': 138284, 'loss/train': 2.7029104232788086} 11/07/2021 16:43:35 - INFO - __main__ - Step 138286: {'lr': 7.690095603957003e-06, 'samples': 26550912, 'steps': 138285, 'loss/train': 1.1883569955825806} 11/07/2021 16:43:36 - INFO - __main__ - Step 138287: {'lr': 7.688789567918991e-06, 'samples': 26551104, 'steps': 138286, 'loss/train': 1.2101130485534668} 11/07/2021 16:43:37 - INFO - __main__ - Step 138288: {'lr': 7.687483641062697e-06, 'samples': 26551296, 'steps': 138287, 'loss/train': 0.6306008696556091} 11/07/2021 16:43:37 - INFO - __main__ - Step 138289: {'lr': 7.68617782338865e-06, 'samples': 26551488, 'steps': 138288, 'loss/train': 1.1923977136611938} 11/07/2021 16:43:37 - INFO - __main__ - Step 138290: {'lr': 7.68487211489749e-06, 'samples': 26551680, 'steps': 138289, 'loss/train': 0.9556211233139038} 11/07/2021 16:43:38 - INFO - __main__ - Step 138291: {'lr': 7.683566515589769e-06, 'samples': 26551872, 'steps': 138290, 'loss/train': 1.1892300844192505} 11/07/2021 16:43:39 - INFO - __main__ - Step 138292: {'lr': 7.682261025466124e-06, 'samples': 26552064, 'steps': 138291, 'loss/train': 1.572617769241333} 11/07/2021 16:43:39 - INFO - __main__ - Step 138293: {'lr': 7.680955644527089e-06, 'samples': 26552256, 'steps': 138292, 'loss/train': 1.2763310670852661} 11/07/2021 16:43:40 - INFO - __main__ - Step 138294: {'lr': 7.679650372773268e-06, 'samples': 26552448, 'steps': 138293, 'loss/train': 1.5747075080871582} 11/07/2021 16:43:40 - INFO - __main__ - Step 138295: {'lr': 7.678345210205273e-06, 'samples': 26552640, 'steps': 138294, 'loss/train': 1.5074584484100342} 11/07/2021 16:43:40 - INFO - __main__ - Step 138296: {'lr': 7.677040156823689e-06, 'samples': 26552832, 'steps': 138295, 'loss/train': 1.3759846687316895} 11/07/2021 16:43:41 - INFO - __main__ - Step 138297: {'lr': 7.67573521262907e-06, 'samples': 26553024, 'steps': 138296, 'loss/train': 1.1952885389328003} 11/07/2021 16:43:42 - INFO - __main__ - Step 138298: {'lr': 7.674430377622054e-06, 'samples': 26553216, 'steps': 138297, 'loss/train': 1.0404694080352783} 11/07/2021 16:43:42 - INFO - __main__ - Step 138299: {'lr': 7.67312565180317e-06, 'samples': 26553408, 'steps': 138298, 'loss/train': 1.5830787420272827} 11/07/2021 16:43:42 - INFO - __main__ - Step 138300: {'lr': 7.671821035173054e-06, 'samples': 26553600, 'steps': 138299, 'loss/train': 1.0639898777008057} 11/07/2021 16:43:43 - INFO - __main__ - Step 138301: {'lr': 7.670516527732263e-06, 'samples': 26553792, 'steps': 138300, 'loss/train': 0.05111364647746086} 11/07/2021 16:43:44 - INFO - __main__ - Step 138302: {'lr': 7.669212129481407e-06, 'samples': 26553984, 'steps': 138301, 'loss/train': 1.1826231479644775} 11/07/2021 16:43:44 - INFO - __main__ - Step 138303: {'lr': 7.667907840421068e-06, 'samples': 26554176, 'steps': 138302, 'loss/train': 1.5541144609451294} 11/07/2021 16:43:44 - INFO - __main__ - Step 138304: {'lr': 7.666603660551802e-06, 'samples': 26554368, 'steps': 138303, 'loss/train': 0.8862033486366272} 11/07/2021 16:43:45 - INFO - __main__ - Step 138305: {'lr': 7.665299589874247e-06, 'samples': 26554560, 'steps': 138304, 'loss/train': 1.3352723121643066} 11/07/2021 16:43:45 - INFO - __main__ - Step 138306: {'lr': 7.663995628388986e-06, 'samples': 26554752, 'steps': 138305, 'loss/train': 1.1017532348632812} 11/07/2021 16:43:46 - INFO - __main__ - Step 138307: {'lr': 7.662691776096548e-06, 'samples': 26554944, 'steps': 138306, 'loss/train': 1.3533357381820679} 11/07/2021 16:43:47 - INFO - __main__ - Step 138308: {'lr': 7.661388032997596e-06, 'samples': 26555136, 'steps': 138307, 'loss/train': 1.340671420097351} 11/07/2021 16:43:47 - INFO - __main__ - Step 138309: {'lr': 7.660084399092659e-06, 'samples': 26555328, 'steps': 138308, 'loss/train': 1.5136851072311401} 11/07/2021 16:43:47 - INFO - __main__ - Step 138310: {'lr': 7.658780874382376e-06, 'samples': 26555520, 'steps': 138309, 'loss/train': 1.3637281656265259} 11/07/2021 16:43:48 - INFO - __main__ - Step 138311: {'lr': 7.657477458867302e-06, 'samples': 26555712, 'steps': 138310, 'loss/train': 1.4221607446670532} 11/07/2021 16:43:48 - INFO - __main__ - Step 138312: {'lr': 7.656174152548018e-06, 'samples': 26555904, 'steps': 138311, 'loss/train': 1.0340735912322998} 11/07/2021 16:43:49 - INFO - __main__ - Step 138313: {'lr': 7.654870955425136e-06, 'samples': 26556096, 'steps': 138312, 'loss/train': 0.4154829978942871} 11/07/2021 16:43:49 - INFO - __main__ - Step 138314: {'lr': 7.653567867499212e-06, 'samples': 26556288, 'steps': 138313, 'loss/train': 1.1983811855316162} 11/07/2021 16:43:50 - INFO - __main__ - Step 138315: {'lr': 7.652264888770854e-06, 'samples': 26556480, 'steps': 138314, 'loss/train': 0.709363579750061} 11/07/2021 16:43:50 - INFO - __main__ - Step 138316: {'lr': 7.650962019240648e-06, 'samples': 26556672, 'steps': 138315, 'loss/train': 1.0301153659820557} 11/07/2021 16:43:50 - INFO - __main__ - Step 138317: {'lr': 7.649659258909175e-06, 'samples': 26556864, 'steps': 138316, 'loss/train': 1.4375157356262207} 11/07/2021 16:43:52 - INFO - __main__ - Step 138318: {'lr': 7.648356607777018e-06, 'samples': 26557056, 'steps': 138317, 'loss/train': 0.11972752213478088} 11/07/2021 16:43:52 - INFO - __main__ - Step 138319: {'lr': 7.647054065844788e-06, 'samples': 26557248, 'steps': 138318, 'loss/train': 1.3817455768585205} 11/07/2021 16:43:52 - INFO - __main__ - Step 138320: {'lr': 7.64575163311304e-06, 'samples': 26557440, 'steps': 138319, 'loss/train': 0.9974932074546814} 11/07/2021 16:43:53 - INFO - __main__ - Step 138321: {'lr': 7.644449309582385e-06, 'samples': 26557632, 'steps': 138320, 'loss/train': 1.3286082744598389} 11/07/2021 16:43:53 - INFO - __main__ - Step 138322: {'lr': 7.643147095253434e-06, 'samples': 26557824, 'steps': 138321, 'loss/train': 1.3346140384674072} 11/07/2021 16:43:54 - INFO - __main__ - Step 138323: {'lr': 7.641844990126711e-06, 'samples': 26558016, 'steps': 138322, 'loss/train': 1.1514370441436768} 11/07/2021 16:43:54 - INFO - __main__ - Step 138324: {'lr': 7.640542994202832e-06, 'samples': 26558208, 'steps': 138323, 'loss/train': 1.3262090682983398} 11/07/2021 16:43:55 - INFO - __main__ - Step 138325: {'lr': 7.639241107482376e-06, 'samples': 26558400, 'steps': 138324, 'loss/train': 1.1983332633972168} 11/07/2021 16:43:55 - INFO - __main__ - Step 138326: {'lr': 7.637939329965954e-06, 'samples': 26558592, 'steps': 138325, 'loss/train': 1.3752700090408325} 11/07/2021 16:43:55 - INFO - __main__ - Step 138327: {'lr': 7.636637661654122e-06, 'samples': 26558784, 'steps': 138326, 'loss/train': 1.1914218664169312} 11/07/2021 16:43:56 - INFO - __main__ - Step 138328: {'lr': 7.63533610254749e-06, 'samples': 26558976, 'steps': 138327, 'loss/train': 1.271105408668518} 11/07/2021 16:43:57 - INFO - __main__ - Step 138329: {'lr': 7.634034652646643e-06, 'samples': 26559168, 'steps': 138328, 'loss/train': 1.2568503618240356} 11/07/2021 16:43:57 - INFO - __main__ - Step 138330: {'lr': 7.632733311952134e-06, 'samples': 26559360, 'steps': 138329, 'loss/train': 1.4444268941879272} 11/07/2021 16:43:57 - INFO - __main__ - Step 138331: {'lr': 7.631432080464602e-06, 'samples': 26559552, 'steps': 138330, 'loss/train': 1.0748618841171265} 11/07/2021 16:43:58 - INFO - __main__ - Step 138332: {'lr': 7.6301309581846e-06, 'samples': 26559744, 'steps': 138331, 'loss/train': 1.0860759019851685} 11/07/2021 16:43:59 - INFO - __main__ - Step 138333: {'lr': 7.628829945112742e-06, 'samples': 26559936, 'steps': 138332, 'loss/train': 1.0568265914916992} 11/07/2021 16:43:59 - INFO - __main__ - Step 138334: {'lr': 7.627529041249554e-06, 'samples': 26560128, 'steps': 138333, 'loss/train': 0.8979945778846741} 11/07/2021 16:44:00 - INFO - __main__ - Step 138335: {'lr': 7.6262282465956735e-06, 'samples': 26560320, 'steps': 138334, 'loss/train': 1.5949889421463013} 11/07/2021 16:44:00 - INFO - __main__ - Step 138336: {'lr': 7.624927561151684e-06, 'samples': 26560512, 'steps': 138335, 'loss/train': 0.9102537035942078} 11/07/2021 16:44:00 - INFO - __main__ - Step 138337: {'lr': 7.623626984918142e-06, 'samples': 26560704, 'steps': 138336, 'loss/train': 1.4303922653198242} 11/07/2021 16:44:01 - INFO - __main__ - Step 138338: {'lr': 7.622326517895683e-06, 'samples': 26560896, 'steps': 138337, 'loss/train': 1.3369687795639038} 11/07/2021 16:44:02 - INFO - __main__ - Step 138339: {'lr': 7.6210261600848375e-06, 'samples': 26561088, 'steps': 138338, 'loss/train': 1.4355993270874023} 11/07/2021 16:44:02 - INFO - __main__ - Step 138340: {'lr': 7.6197259114862415e-06, 'samples': 26561280, 'steps': 138339, 'loss/train': 1.3270684480667114} 11/07/2021 16:44:02 - INFO - __main__ - Step 138341: {'lr': 7.618425772100424e-06, 'samples': 26561472, 'steps': 138340, 'loss/train': 1.475553274154663} 11/07/2021 16:44:03 - INFO - __main__ - Step 138342: {'lr': 7.6171257419280216e-06, 'samples': 26561664, 'steps': 138341, 'loss/train': 0.9094234108924866} 11/07/2021 16:44:03 - INFO - __main__ - Step 138343: {'lr': 7.615825820969618e-06, 'samples': 26561856, 'steps': 138342, 'loss/train': 1.373570442199707} 11/07/2021 16:44:04 - INFO - __main__ - Step 138344: {'lr': 7.614526009225797e-06, 'samples': 26562048, 'steps': 138343, 'loss/train': 1.1639573574066162} 11/07/2021 16:44:05 - INFO - __main__ - Step 138345: {'lr': 7.613226306697085e-06, 'samples': 26562240, 'steps': 138344, 'loss/train': 1.2249675989151} 11/07/2021 16:44:05 - INFO - __main__ - Step 138346: {'lr': 7.611926713384121e-06, 'samples': 26562432, 'steps': 138345, 'loss/train': 1.392830491065979} 11/07/2021 16:44:05 - INFO - __main__ - Step 138347: {'lr': 7.610627229287514e-06, 'samples': 26562624, 'steps': 138346, 'loss/train': 0.849307656288147} 11/07/2021 16:44:06 - INFO - __main__ - Step 138348: {'lr': 7.609327854407794e-06, 'samples': 26562816, 'steps': 138347, 'loss/train': 1.2385269403457642} 11/07/2021 16:44:07 - INFO - __main__ - Step 138349: {'lr': 7.608028588745569e-06, 'samples': 26563008, 'steps': 138348, 'loss/train': 0.6783626079559326} 11/07/2021 16:44:07 - INFO - __main__ - Step 138350: {'lr': 7.606729432301424e-06, 'samples': 26563200, 'steps': 138349, 'loss/train': 0.6078838109970093} 11/07/2021 16:44:08 - INFO - __main__ - Step 138351: {'lr': 7.6054303850759395e-06, 'samples': 26563392, 'steps': 138350, 'loss/train': 1.134668231010437} 11/07/2021 16:44:08 - INFO - __main__ - Step 138352: {'lr': 7.604131447069729e-06, 'samples': 26563584, 'steps': 138351, 'loss/train': 1.1322652101516724} 11/07/2021 16:44:08 - INFO - __main__ - Step 138353: {'lr': 7.602832618283345e-06, 'samples': 26563776, 'steps': 138352, 'loss/train': 0.5090171098709106} 11/07/2021 16:44:09 - INFO - __main__ - Step 138354: {'lr': 7.6015338987173724e-06, 'samples': 26563968, 'steps': 138353, 'loss/train': 0.3406889736652374} 11/07/2021 16:44:10 - INFO - __main__ - Step 138355: {'lr': 7.600235288372448e-06, 'samples': 26564160, 'steps': 138354, 'loss/train': 1.2602914571762085} 11/07/2021 16:44:10 - INFO - __main__ - Step 138356: {'lr': 7.5989367872491e-06, 'samples': 26564352, 'steps': 138355, 'loss/train': 1.5236107110977173} 11/07/2021 16:44:10 - INFO - __main__ - Step 138357: {'lr': 7.5976383953479115e-06, 'samples': 26564544, 'steps': 138356, 'loss/train': 1.1748180389404297} 11/07/2021 16:44:11 - INFO - __main__ - Step 138358: {'lr': 7.596340112669492e-06, 'samples': 26564736, 'steps': 138357, 'loss/train': 0.9382304549217224} 11/07/2021 16:44:12 - INFO - __main__ - Step 138359: {'lr': 7.5950419392144254e-06, 'samples': 26564928, 'steps': 138358, 'loss/train': 0.8206194043159485} 11/07/2021 16:44:12 - INFO - __main__ - Step 138360: {'lr': 7.593743874983295e-06, 'samples': 26565120, 'steps': 138359, 'loss/train': 1.0625474452972412} 11/07/2021 16:44:12 - INFO - __main__ - Step 138361: {'lr': 7.592445919976681e-06, 'samples': 26565312, 'steps': 138360, 'loss/train': 1.185194730758667} 11/07/2021 16:44:13 - INFO - __main__ - Step 138362: {'lr': 7.59114807419517e-06, 'samples': 26565504, 'steps': 138361, 'loss/train': 1.2372034788131714} 11/07/2021 16:44:13 - INFO - __main__ - Step 138363: {'lr': 7.589850337639343e-06, 'samples': 26565696, 'steps': 138362, 'loss/train': 1.6639765501022339} 11/07/2021 16:44:14 - INFO - __main__ - Step 138364: {'lr': 7.588552710309809e-06, 'samples': 26565888, 'steps': 138363, 'loss/train': 1.593521237373352} 11/07/2021 16:44:15 - INFO - __main__ - Step 138365: {'lr': 7.587255192207126e-06, 'samples': 26566080, 'steps': 138364, 'loss/train': 1.056755781173706} 11/07/2021 16:44:15 - INFO - __main__ - Step 138366: {'lr': 7.585957783331876e-06, 'samples': 26566272, 'steps': 138365, 'loss/train': 1.1496912240982056} 11/07/2021 16:44:15 - INFO - __main__ - Step 138367: {'lr': 7.584660483684669e-06, 'samples': 26566464, 'steps': 138366, 'loss/train': 1.354425072669983} 11/07/2021 16:44:16 - INFO - __main__ - Step 138368: {'lr': 7.5833632932660325e-06, 'samples': 26566656, 'steps': 138367, 'loss/train': 1.1186890602111816} 11/07/2021 16:44:17 - INFO - __main__ - Step 138369: {'lr': 7.582066212076632e-06, 'samples': 26566848, 'steps': 138368, 'loss/train': 0.9226511120796204} 11/07/2021 16:44:17 - INFO - __main__ - Step 138370: {'lr': 7.580769240116997e-06, 'samples': 26567040, 'steps': 138369, 'loss/train': 1.1971739530563354} 11/07/2021 16:44:17 - INFO - __main__ - Step 138371: {'lr': 7.579472377387708e-06, 'samples': 26567232, 'steps': 138370, 'loss/train': 1.4617879390716553} 11/07/2021 16:44:18 - INFO - __main__ - Step 138372: {'lr': 7.578175623889405e-06, 'samples': 26567424, 'steps': 138371, 'loss/train': 1.1310211420059204} 11/07/2021 16:44:18 - INFO - __main__ - Step 138373: {'lr': 7.5768789796226144e-06, 'samples': 26567616, 'steps': 138372, 'loss/train': 0.9417533874511719} 11/07/2021 16:44:19 - INFO - __main__ - Step 138374: {'lr': 7.57558244458792e-06, 'samples': 26567808, 'steps': 138373, 'loss/train': 0.6987670660018921} 11/07/2021 16:44:19 - INFO - __main__ - Step 138375: {'lr': 7.574286018785959e-06, 'samples': 26568000, 'steps': 138374, 'loss/train': 1.3414978981018066} 11/07/2021 16:44:20 - INFO - __main__ - Step 138376: {'lr': 7.572989702217259e-06, 'samples': 26568192, 'steps': 138375, 'loss/train': 1.0480924844741821} 11/07/2021 16:44:20 - INFO - __main__ - Step 138377: {'lr': 7.571693494882459e-06, 'samples': 26568384, 'steps': 138376, 'loss/train': 1.5703976154327393} 11/07/2021 16:44:20 - INFO - __main__ - Step 138378: {'lr': 7.570397396782114e-06, 'samples': 26568576, 'steps': 138377, 'loss/train': 1.3481345176696777} 11/07/2021 16:44:21 - INFO - __main__ - Step 138379: {'lr': 7.569101407916806e-06, 'samples': 26568768, 'steps': 138378, 'loss/train': 1.4206002950668335} 11/07/2021 16:44:22 - INFO - __main__ - Step 138380: {'lr': 7.567805528287092e-06, 'samples': 26568960, 'steps': 138379, 'loss/train': 0.914145290851593} 11/07/2021 16:44:22 - INFO - __main__ - Step 138381: {'lr': 7.566509757893608e-06, 'samples': 26569152, 'steps': 138380, 'loss/train': 1.296454906463623} 11/07/2021 16:44:23 - INFO - __main__ - Step 138382: {'lr': 7.565214096736883e-06, 'samples': 26569344, 'steps': 138381, 'loss/train': 1.404468059539795} 11/07/2021 16:44:23 - INFO - __main__ - Step 138383: {'lr': 7.563918544817555e-06, 'samples': 26569536, 'steps': 138382, 'loss/train': 0.9608911871910095} 11/07/2021 16:44:24 - INFO - __main__ - Step 138384: {'lr': 7.562623102136179e-06, 'samples': 26569728, 'steps': 138383, 'loss/train': 1.3629226684570312} 11/07/2021 16:44:24 - INFO - __main__ - Step 138385: {'lr': 7.561327768693366e-06, 'samples': 26569920, 'steps': 138384, 'loss/train': 0.8655726909637451} 11/07/2021 16:44:25 - INFO - __main__ - Step 138386: {'lr': 7.5600325444896426e-06, 'samples': 26570112, 'steps': 138385, 'loss/train': 1.5464189052581787} 11/07/2021 16:44:25 - INFO - __main__ - Step 138387: {'lr': 7.558737429525647e-06, 'samples': 26570304, 'steps': 138386, 'loss/train': 1.1440626382827759} 11/07/2021 16:44:25 - INFO - __main__ - Step 138388: {'lr': 7.557442423801935e-06, 'samples': 26570496, 'steps': 138387, 'loss/train': 1.2266392707824707} 11/07/2021 16:44:26 - INFO - __main__ - Step 138389: {'lr': 7.5561475273190905e-06, 'samples': 26570688, 'steps': 138388, 'loss/train': 0.9872320294380188} 11/07/2021 16:44:27 - INFO - __main__ - Step 138390: {'lr': 7.5548527400777224e-06, 'samples': 26570880, 'steps': 138389, 'loss/train': 0.49301761388778687} 11/07/2021 16:44:27 - INFO - __main__ - Step 138391: {'lr': 7.553558062078386e-06, 'samples': 26571072, 'steps': 138390, 'loss/train': 1.1601563692092896} 11/07/2021 16:44:27 - INFO - __main__ - Step 138392: {'lr': 7.55226349332172e-06, 'samples': 26571264, 'steps': 138391, 'loss/train': 1.292283535003662} 11/07/2021 16:44:28 - INFO - __main__ - Step 138393: {'lr': 7.5509690338082244e-06, 'samples': 26571456, 'steps': 138392, 'loss/train': 1.6512144804000854} 11/07/2021 16:44:29 - INFO - __main__ - Step 138394: {'lr': 7.54967468353851e-06, 'samples': 26571648, 'steps': 138393, 'loss/train': 1.2602202892303467} 11/07/2021 16:44:29 - INFO - __main__ - Step 138395: {'lr': 7.548380442513186e-06, 'samples': 26571840, 'steps': 138394, 'loss/train': 0.902192234992981} 11/07/2021 16:44:30 - INFO - __main__ - Step 138396: {'lr': 7.5470863107328095e-06, 'samples': 26572032, 'steps': 138395, 'loss/train': 0.9509066343307495} 11/07/2021 16:44:30 - INFO - __main__ - Step 138397: {'lr': 7.5457922881979616e-06, 'samples': 26572224, 'steps': 138396, 'loss/train': 1.6549967527389526} 11/07/2021 16:44:30 - INFO - __main__ - Step 138398: {'lr': 7.544498374909281e-06, 'samples': 26572416, 'steps': 138397, 'loss/train': 1.2589757442474365} 11/07/2021 16:44:31 - INFO - __main__ - Step 138399: {'lr': 7.543204570867268e-06, 'samples': 26572608, 'steps': 138398, 'loss/train': 1.0129523277282715} 11/07/2021 16:44:32 - INFO - __main__ - Step 138400: {'lr': 7.541910876072561e-06, 'samples': 26572800, 'steps': 138399, 'loss/train': 1.0949798822402954} 11/07/2021 16:44:32 - INFO - __main__ - Step 138401: {'lr': 7.540617290525742e-06, 'samples': 26572992, 'steps': 138400, 'loss/train': 1.125745415687561} 11/07/2021 16:44:32 - INFO - __main__ - Step 138402: {'lr': 7.539323814227339e-06, 'samples': 26573184, 'steps': 138401, 'loss/train': 0.99437415599823} 11/07/2021 16:44:33 - INFO - __main__ - Step 138403: {'lr': 7.538030447178018e-06, 'samples': 26573376, 'steps': 138402, 'loss/train': 1.3157652616500854} 11/07/2021 16:44:33 - INFO - __main__ - Step 138404: {'lr': 7.536737189378306e-06, 'samples': 26573568, 'steps': 138403, 'loss/train': 1.2905738353729248} 11/07/2021 16:44:34 - INFO - __main__ - Step 138405: {'lr': 7.535444040828815e-06, 'samples': 26573760, 'steps': 138404, 'loss/train': 1.1768440008163452} 11/07/2021 16:44:34 - INFO - __main__ - Step 138406: {'lr': 7.534151001530099e-06, 'samples': 26573952, 'steps': 138405, 'loss/train': 1.0179928541183472} 11/07/2021 16:44:35 - INFO - __main__ - Step 138407: {'lr': 7.53285807148274e-06, 'samples': 26574144, 'steps': 138406, 'loss/train': 1.3406659364700317} 11/07/2021 16:44:35 - INFO - __main__ - Step 138408: {'lr': 7.531565250687322e-06, 'samples': 26574336, 'steps': 138407, 'loss/train': 0.8955969214439392} 11/07/2021 16:44:35 - INFO - __main__ - Step 138409: {'lr': 7.530272539144456e-06, 'samples': 26574528, 'steps': 138408, 'loss/train': 1.1026692390441895} 11/07/2021 16:44:37 - INFO - __main__ - Step 138410: {'lr': 7.528979936854724e-06, 'samples': 26574720, 'steps': 138409, 'loss/train': 1.3843721151351929} 11/07/2021 16:44:37 - INFO - __main__ - Step 138411: {'lr': 7.527687443818681e-06, 'samples': 26574912, 'steps': 138410, 'loss/train': 1.2320829629898071} 11/07/2021 16:44:37 - INFO - __main__ - Step 138412: {'lr': 7.526395060036911e-06, 'samples': 26575104, 'steps': 138411, 'loss/train': 0.5067291855812073} 11/07/2021 16:44:38 - INFO - __main__ - Step 138413: {'lr': 7.525102785509996e-06, 'samples': 26575296, 'steps': 138412, 'loss/train': 1.1145192384719849} 11/07/2021 16:44:38 - INFO - __main__ - Step 138414: {'lr': 7.523810620238547e-06, 'samples': 26575488, 'steps': 138413, 'loss/train': 1.5385997295379639} 11/07/2021 16:44:39 - INFO - __main__ - Step 138415: {'lr': 7.522518564223119e-06, 'samples': 26575680, 'steps': 138414, 'loss/train': 0.7632546424865723} 11/07/2021 16:44:39 - INFO - __main__ - Step 138416: {'lr': 7.521226617464322e-06, 'samples': 26575872, 'steps': 138415, 'loss/train': 1.3750474452972412} 11/07/2021 16:44:40 - INFO - __main__ - Step 138417: {'lr': 7.519934779962684e-06, 'samples': 26576064, 'steps': 138416, 'loss/train': 1.4048702716827393} 11/07/2021 16:44:40 - INFO - __main__ - Step 138418: {'lr': 7.518643051718843e-06, 'samples': 26576256, 'steps': 138417, 'loss/train': 1.2166677713394165} 11/07/2021 16:44:40 - INFO - __main__ - Step 138419: {'lr': 7.5173514327333825e-06, 'samples': 26576448, 'steps': 138418, 'loss/train': 1.1245392560958862} 11/07/2021 16:44:41 - INFO - __main__ - Step 138420: {'lr': 7.516059923006829e-06, 'samples': 26576640, 'steps': 138419, 'loss/train': 1.2808380126953125} 11/07/2021 16:44:42 - INFO - __main__ - Step 138421: {'lr': 7.514768522539822e-06, 'samples': 26576832, 'steps': 138420, 'loss/train': 1.0135964155197144} 11/07/2021 16:44:42 - INFO - __main__ - Step 138422: {'lr': 7.5134772313328884e-06, 'samples': 26577024, 'steps': 138421, 'loss/train': 1.2705215215682983} 11/07/2021 16:44:43 - INFO - __main__ - Step 138423: {'lr': 7.512186049386666e-06, 'samples': 26577216, 'steps': 138422, 'loss/train': 1.7541996240615845} 11/07/2021 16:44:43 - INFO - __main__ - Step 138424: {'lr': 7.510894976701682e-06, 'samples': 26577408, 'steps': 138423, 'loss/train': 0.5639904737472534} 11/07/2021 16:44:44 - INFO - __main__ - Step 138425: {'lr': 7.509604013278576e-06, 'samples': 26577600, 'steps': 138424, 'loss/train': 0.7540420293807983} 11/07/2021 16:44:44 - INFO - __main__ - Step 138426: {'lr': 7.5083131591178745e-06, 'samples': 26577792, 'steps': 138425, 'loss/train': 1.2994731664657593} 11/07/2021 16:44:45 - INFO - __main__ - Step 138427: {'lr': 7.5070224142202155e-06, 'samples': 26577984, 'steps': 138426, 'loss/train': 0.4066222608089447} 11/07/2021 16:44:45 - INFO - __main__ - Step 138428: {'lr': 7.505731778586128e-06, 'samples': 26578176, 'steps': 138427, 'loss/train': 0.8778418898582458} 11/07/2021 16:44:45 - INFO - __main__ - Step 138429: {'lr': 7.504441252216221e-06, 'samples': 26578368, 'steps': 138428, 'loss/train': 1.378525972366333} 11/07/2021 16:44:46 - INFO - __main__ - Step 138430: {'lr': 7.50315083511105e-06, 'samples': 26578560, 'steps': 138429, 'loss/train': 1.1449031829833984} 11/07/2021 16:44:47 - INFO - __main__ - Step 138431: {'lr': 7.501860527271254e-06, 'samples': 26578752, 'steps': 138430, 'loss/train': 1.534035563468933} 11/07/2021 16:44:47 - INFO - __main__ - Step 138432: {'lr': 7.500570328697387e-06, 'samples': 26578944, 'steps': 138431, 'loss/train': 1.4568383693695068} 11/07/2021 16:44:47 - INFO - __main__ - Step 138433: {'lr': 7.499280239389977e-06, 'samples': 26579136, 'steps': 138432, 'loss/train': 1.3333487510681152} 11/07/2021 16:44:48 - INFO - __main__ - Step 138434: {'lr': 7.49799025934969e-06, 'samples': 26579328, 'steps': 138433, 'loss/train': 1.2911555767059326} 11/07/2021 16:44:48 - INFO - __main__ - Step 138435: {'lr': 7.496700388577027e-06, 'samples': 26579520, 'steps': 138434, 'loss/train': 1.113328456878662} 11/07/2021 16:44:49 - INFO - __main__ - Step 138436: {'lr': 7.495410627072624e-06, 'samples': 26579712, 'steps': 138435, 'loss/train': 1.2692819833755493} 11/07/2021 16:44:50 - INFO - __main__ - Step 138437: {'lr': 7.494120974837065e-06, 'samples': 26579904, 'steps': 138436, 'loss/train': 1.4143108129501343} 11/07/2021 16:44:50 - INFO - __main__ - Step 138438: {'lr': 7.492831431870878e-06, 'samples': 26580096, 'steps': 138437, 'loss/train': 1.313600778579712} 11/07/2021 16:44:50 - INFO - __main__ - Step 138439: {'lr': 7.4915419981747e-06, 'samples': 26580288, 'steps': 138438, 'loss/train': 0.7053871154785156} 11/07/2021 16:44:51 - INFO - __main__ - Step 138440: {'lr': 7.490252673749087e-06, 'samples': 26580480, 'steps': 138439, 'loss/train': 1.2182821035385132} 11/07/2021 16:44:52 - INFO - __main__ - Step 138441: {'lr': 7.488963458594622e-06, 'samples': 26580672, 'steps': 138440, 'loss/train': 1.1202539205551147} 11/07/2021 16:44:52 - INFO - __main__ - Step 138442: {'lr': 7.487674352711915e-06, 'samples': 26580864, 'steps': 138441, 'loss/train': 1.2581489086151123} 11/07/2021 16:44:52 - INFO - __main__ - Step 138443: {'lr': 7.486385356101494e-06, 'samples': 26581056, 'steps': 138442, 'loss/train': 1.420412540435791} 11/07/2021 16:44:53 - INFO - __main__ - Step 138444: {'lr': 7.485096468763969e-06, 'samples': 26581248, 'steps': 138443, 'loss/train': 1.4154084920883179} 11/07/2021 16:44:53 - INFO - __main__ - Step 138445: {'lr': 7.483807690699896e-06, 'samples': 26581440, 'steps': 138444, 'loss/train': 1.1168562173843384} 11/07/2021 16:44:54 - INFO - __main__ - Step 138446: {'lr': 7.482519021909939e-06, 'samples': 26581632, 'steps': 138445, 'loss/train': 1.4599180221557617} 11/07/2021 16:44:54 - INFO - __main__ - Step 138447: {'lr': 7.481230462394573e-06, 'samples': 26581824, 'steps': 138446, 'loss/train': 1.2906155586242676} 11/07/2021 16:44:55 - INFO - __main__ - Step 138448: {'lr': 7.479942012154406e-06, 'samples': 26582016, 'steps': 138447, 'loss/train': 1.0386178493499756} 11/07/2021 16:44:55 - INFO - __main__ - Step 138449: {'lr': 7.478653671190078e-06, 'samples': 26582208, 'steps': 138448, 'loss/train': 1.6638058423995972} 11/07/2021 16:44:55 - INFO - __main__ - Step 138450: {'lr': 7.4773654395020875e-06, 'samples': 26582400, 'steps': 138449, 'loss/train': 1.2865173816680908} 11/07/2021 16:44:57 - INFO - __main__ - Step 138451: {'lr': 7.476077317091073e-06, 'samples': 26582592, 'steps': 138450, 'loss/train': 1.3005056381225586} 11/07/2021 16:44:57 - INFO - __main__ - Step 138452: {'lr': 7.474789303957591e-06, 'samples': 26582784, 'steps': 138451, 'loss/train': 1.940355658531189} 11/07/2021 16:44:57 - INFO - __main__ - Step 138453: {'lr': 7.473501400102223e-06, 'samples': 26582976, 'steps': 138452, 'loss/train': 1.3487879037857056} 11/07/2021 16:44:58 - INFO - __main__ - Step 138454: {'lr': 7.472213605525552e-06, 'samples': 26583168, 'steps': 138453, 'loss/train': 1.1633268594741821} 11/07/2021 16:44:58 - INFO - __main__ - Step 138455: {'lr': 7.470925920228161e-06, 'samples': 26583360, 'steps': 138454, 'loss/train': 1.6998060941696167} 11/07/2021 16:44:59 - INFO - __main__ - Step 138456: {'lr': 7.469638344210633e-06, 'samples': 26583552, 'steps': 138455, 'loss/train': 0.9042196869850159} 11/07/2021 16:44:59 - INFO - __main__ - Step 138457: {'lr': 7.468350877473551e-06, 'samples': 26583744, 'steps': 138456, 'loss/train': 1.2301982641220093} 11/07/2021 16:45:00 - INFO - __main__ - Step 138458: {'lr': 7.46706352001747e-06, 'samples': 26583936, 'steps': 138457, 'loss/train': 1.3310571908950806} 11/07/2021 16:45:00 - INFO - __main__ - Step 138459: {'lr': 7.465776271843028e-06, 'samples': 26584128, 'steps': 138458, 'loss/train': 1.316955327987671} 11/07/2021 16:45:00 - INFO - __main__ - Step 138460: {'lr': 7.464489132950724e-06, 'samples': 26584320, 'steps': 138459, 'loss/train': 1.5011107921600342} 11/07/2021 16:45:01 - INFO - __main__ - Step 138461: {'lr': 7.463202103341171e-06, 'samples': 26584512, 'steps': 138460, 'loss/train': 1.210526466369629} 11/07/2021 16:45:02 - INFO - __main__ - Step 138462: {'lr': 7.4619151830149774e-06, 'samples': 26584704, 'steps': 138461, 'loss/train': 1.2399828433990479} 11/07/2021 16:45:02 - INFO - __main__ - Step 138463: {'lr': 7.4606283719726995e-06, 'samples': 26584896, 'steps': 138462, 'loss/train': 1.1592556238174438} 11/07/2021 16:45:03 - INFO - __main__ - Step 138464: {'lr': 7.45934167021492e-06, 'samples': 26585088, 'steps': 138463, 'loss/train': 1.1468236446380615} 11/07/2021 16:45:03 - INFO - __main__ - Step 138465: {'lr': 7.4580550777422205e-06, 'samples': 26585280, 'steps': 138464, 'loss/train': 1.505786657333374} 11/07/2021 16:45:04 - INFO - __main__ - Step 138466: {'lr': 7.456768594555158e-06, 'samples': 26585472, 'steps': 138465, 'loss/train': 0.8476134538650513} 11/07/2021 16:45:04 - INFO - __main__ - Step 138467: {'lr': 7.455482220654342e-06, 'samples': 26585664, 'steps': 138466, 'loss/train': 1.744113564491272} 11/07/2021 16:45:05 - INFO - __main__ - Step 138468: {'lr': 7.454195956040355e-06, 'samples': 26585856, 'steps': 138467, 'loss/train': 0.9503555297851562} 11/07/2021 16:45:05 - INFO - __main__ - Step 138469: {'lr': 7.452909800713753e-06, 'samples': 26586048, 'steps': 138468, 'loss/train': 0.9025232791900635} 11/07/2021 16:45:05 - INFO - __main__ - Step 138470: {'lr': 7.451623754675147e-06, 'samples': 26586240, 'steps': 138469, 'loss/train': 1.2324849367141724} 11/07/2021 16:45:06 - INFO - __main__ - Step 138471: {'lr': 7.45033781792509e-06, 'samples': 26586432, 'steps': 138470, 'loss/train': 1.3324966430664062} 11/07/2021 16:45:07 - INFO - __main__ - Step 138472: {'lr': 7.449051990464139e-06, 'samples': 26586624, 'steps': 138471, 'loss/train': 1.2262866497039795} 11/07/2021 16:45:07 - INFO - __main__ - Step 138473: {'lr': 7.4477662722929604e-06, 'samples': 26586816, 'steps': 138472, 'loss/train': 0.8633838891983032} 11/07/2021 16:45:07 - INFO - __main__ - Step 138474: {'lr': 7.446480663412053e-06, 'samples': 26587008, 'steps': 138473, 'loss/train': 1.167197823524475} 11/07/2021 16:45:08 - INFO - __main__ - Step 138475: {'lr': 7.4451951638219995e-06, 'samples': 26587200, 'steps': 138474, 'loss/train': 1.148893117904663} 11/07/2021 16:45:08 - INFO - __main__ - Step 138476: {'lr': 7.443909773523411e-06, 'samples': 26587392, 'steps': 138475, 'loss/train': 1.2335968017578125} 11/07/2021 16:45:09 - INFO - __main__ - Step 138477: {'lr': 7.442624492516842e-06, 'samples': 26587584, 'steps': 138476, 'loss/train': 1.5278680324554443} 11/07/2021 16:45:10 - INFO - __main__ - Step 138478: {'lr': 7.441339320802876e-06, 'samples': 26587776, 'steps': 138477, 'loss/train': 1.4109894037246704} 11/07/2021 16:45:10 - INFO - __main__ - Step 138479: {'lr': 7.440054258382123e-06, 'samples': 26587968, 'steps': 138478, 'loss/train': 1.1597988605499268} 11/07/2021 16:45:10 - INFO - __main__ - Step 138480: {'lr': 7.438769305255111e-06, 'samples': 26588160, 'steps': 138479, 'loss/train': 1.2756427526474} 11/07/2021 16:45:11 - INFO - __main__ - Step 138481: {'lr': 7.437484461422478e-06, 'samples': 26588352, 'steps': 138480, 'loss/train': 1.1611477136611938} 11/07/2021 16:45:12 - INFO - __main__ - Step 138482: {'lr': 7.4361997268847514e-06, 'samples': 26588544, 'steps': 138481, 'loss/train': 1.2584426403045654} 11/07/2021 16:45:13 - INFO - __main__ - Step 138483: {'lr': 7.434915101642542e-06, 'samples': 26588736, 'steps': 138482, 'loss/train': 1.388146162033081} 11/07/2021 16:45:13 - INFO - __main__ - Step 138484: {'lr': 7.433630585696405e-06, 'samples': 26588928, 'steps': 138483, 'loss/train': 1.1453803777694702} 11/07/2021 16:45:13 - INFO - __main__ - Step 138485: {'lr': 7.432346179046923e-06, 'samples': 26589120, 'steps': 138484, 'loss/train': 1.2928920984268188} 11/07/2021 16:45:14 - INFO - __main__ - Step 138486: {'lr': 7.431061881694734e-06, 'samples': 26589312, 'steps': 138485, 'loss/train': 1.3796906471252441} 11/07/2021 16:45:14 - INFO - __main__ - Step 138487: {'lr': 7.42977769364031e-06, 'samples': 26589504, 'steps': 138486, 'loss/train': 1.1615527868270874} 11/07/2021 16:45:16 - INFO - __main__ - Step 138488: {'lr': 7.4284936148843185e-06, 'samples': 26589696, 'steps': 138487, 'loss/train': 1.7366622686386108} 11/07/2021 16:45:16 - INFO - __main__ - Step 138489: {'lr': 7.427209645427285e-06, 'samples': 26589888, 'steps': 138488, 'loss/train': 1.4023077487945557} 11/07/2021 16:45:16 - INFO - __main__ - Step 138490: {'lr': 7.425925785269822e-06, 'samples': 26590080, 'steps': 138489, 'loss/train': 0.06980116665363312} 11/07/2021 16:45:17 - INFO - __main__ - Step 138491: {'lr': 7.424642034412482e-06, 'samples': 26590272, 'steps': 138490, 'loss/train': 1.7821145057678223} 11/07/2021 16:45:17 - INFO - __main__ - Step 138492: {'lr': 7.42335839285585e-06, 'samples': 26590464, 'steps': 138491, 'loss/train': 1.53273344039917} 11/07/2021 16:45:18 - INFO - __main__ - Step 138493: {'lr': 7.422074860600509e-06, 'samples': 26590656, 'steps': 138492, 'loss/train': 1.4017754793167114} 11/07/2021 16:45:18 - INFO - __main__ - Step 138494: {'lr': 7.4207914376470395e-06, 'samples': 26590848, 'steps': 138493, 'loss/train': 0.9877405166625977} 11/07/2021 16:45:19 - INFO - __main__ - Step 138495: {'lr': 7.4195081239960275e-06, 'samples': 26591040, 'steps': 138494, 'loss/train': 0.9124326705932617} 11/07/2021 16:45:19 - INFO - __main__ - Step 138496: {'lr': 7.418224919648026e-06, 'samples': 26591232, 'steps': 138495, 'loss/train': 1.3907033205032349} 11/07/2021 16:45:19 - INFO - __main__ - Step 138497: {'lr': 7.416941824603646e-06, 'samples': 26591424, 'steps': 138496, 'loss/train': 1.4058046340942383} 11/07/2021 16:45:20 - INFO - __main__ - Step 138498: {'lr': 7.4156588388634425e-06, 'samples': 26591616, 'steps': 138497, 'loss/train': 1.1206748485565186} 11/07/2021 16:45:21 - INFO - __main__ - Step 138499: {'lr': 7.414375962427999e-06, 'samples': 26591808, 'steps': 138498, 'loss/train': 1.998777151107788} 11/07/2021 16:45:21 - INFO - __main__ - Step 138500: {'lr': 7.413093195297926e-06, 'samples': 26592000, 'steps': 138499, 'loss/train': 1.391373872756958} 11/07/2021 16:45:21 - INFO - __main__ - Step 138501: {'lr': 7.411810537473751e-06, 'samples': 26592192, 'steps': 138500, 'loss/train': 1.721417784690857} 11/07/2021 16:45:22 - INFO - __main__ - Step 138502: {'lr': 7.410527988956056e-06, 'samples': 26592384, 'steps': 138501, 'loss/train': 1.3386659622192383} 11/07/2021 16:45:22 - INFO - __main__ - Step 138503: {'lr': 7.409245549745425e-06, 'samples': 26592576, 'steps': 138502, 'loss/train': 1.1688646078109741} 11/07/2021 16:45:23 - INFO - __main__ - Step 138504: {'lr': 7.407963219842467e-06, 'samples': 26592768, 'steps': 138503, 'loss/train': 1.216385841369629} 11/07/2021 16:45:24 - INFO - __main__ - Step 138505: {'lr': 7.406680999247739e-06, 'samples': 26592960, 'steps': 138504, 'loss/train': 1.1505383253097534} 11/07/2021 16:45:24 - INFO - __main__ - Step 138506: {'lr': 7.405398887961795e-06, 'samples': 26593152, 'steps': 138505, 'loss/train': 1.1483047008514404} 11/07/2021 16:45:24 - INFO - __main__ - Step 138507: {'lr': 7.404116885985246e-06, 'samples': 26593344, 'steps': 138506, 'loss/train': 1.3468561172485352} 11/07/2021 16:45:25 - INFO - __main__ - Step 138508: {'lr': 7.402834993318647e-06, 'samples': 26593536, 'steps': 138507, 'loss/train': 0.05464718118309975} 11/07/2021 16:45:26 - INFO - __main__ - Step 138509: {'lr': 7.4015532099626085e-06, 'samples': 26593728, 'steps': 138508, 'loss/train': 1.403070092201233} 11/07/2021 16:45:26 - INFO - __main__ - Step 138510: {'lr': 7.4002715359176855e-06, 'samples': 26593920, 'steps': 138509, 'loss/train': 1.5250905752182007} 11/07/2021 16:45:26 - INFO - __main__ - Step 138511: {'lr': 7.398989971184433e-06, 'samples': 26594112, 'steps': 138510, 'loss/train': 1.3007407188415527} 11/07/2021 16:45:27 - INFO - __main__ - Step 138512: {'lr': 7.397708515763463e-06, 'samples': 26594304, 'steps': 138511, 'loss/train': 1.007300615310669} 11/07/2021 16:45:27 - INFO - __main__ - Step 138513: {'lr': 7.396427169655384e-06, 'samples': 26594496, 'steps': 138512, 'loss/train': 1.1878535747528076} 11/07/2021 16:45:28 - INFO - __main__ - Step 138514: {'lr': 7.395145932860669e-06, 'samples': 26594688, 'steps': 138513, 'loss/train': 0.9658278822898865} 11/07/2021 16:45:29 - INFO - __main__ - Step 138515: {'lr': 7.393864805379985e-06, 'samples': 26594880, 'steps': 138514, 'loss/train': 1.0542844533920288} 11/07/2021 16:45:29 - INFO - __main__ - Step 138516: {'lr': 7.392583787213886e-06, 'samples': 26595072, 'steps': 138515, 'loss/train': 1.5439656972885132} 11/07/2021 16:45:29 - INFO - __main__ - Step 138517: {'lr': 7.391302878362927e-06, 'samples': 26595264, 'steps': 138516, 'loss/train': 1.1982951164245605} 11/07/2021 16:45:30 - INFO - __main__ - Step 138518: {'lr': 7.390022078827718e-06, 'samples': 26595456, 'steps': 138517, 'loss/train': 0.8816930651664734} 11/07/2021 16:45:31 - INFO - __main__ - Step 138519: {'lr': 7.388741388608816e-06, 'samples': 26595648, 'steps': 138518, 'loss/train': 1.589868426322937} 11/07/2021 16:45:31 - INFO - __main__ - Step 138520: {'lr': 7.387460807706803e-06, 'samples': 26595840, 'steps': 138519, 'loss/train': 1.6877059936523438} 11/07/2021 16:45:32 - INFO - __main__ - Step 138521: {'lr': 7.386180336122261e-06, 'samples': 26596032, 'steps': 138520, 'loss/train': 1.4331607818603516} 11/07/2021 16:45:32 - INFO - __main__ - Step 138522: {'lr': 7.384899973855746e-06, 'samples': 26596224, 'steps': 138521, 'loss/train': 1.5031061172485352} 11/07/2021 16:45:32 - INFO - __main__ - Step 138523: {'lr': 7.383619720907869e-06, 'samples': 26596416, 'steps': 138522, 'loss/train': 1.5012776851654053} 11/07/2021 16:45:33 - INFO - __main__ - Step 138524: {'lr': 7.382339577279185e-06, 'samples': 26596608, 'steps': 138523, 'loss/train': 0.4853067696094513} 11/07/2021 16:45:34 - INFO - __main__ - Step 138525: {'lr': 7.381059542970276e-06, 'samples': 26596800, 'steps': 138524, 'loss/train': 1.1235511302947998} 11/07/2021 16:45:34 - INFO - __main__ - Step 138526: {'lr': 7.379779617981752e-06, 'samples': 26596992, 'steps': 138525, 'loss/train': 1.162192463874817} 11/07/2021 16:45:34 - INFO - __main__ - Step 138527: {'lr': 7.378499802314115e-06, 'samples': 26597184, 'steps': 138526, 'loss/train': 1.2988719940185547} 11/07/2021 16:45:35 - INFO - __main__ - Step 138528: {'lr': 7.377220095967974e-06, 'samples': 26597376, 'steps': 138527, 'loss/train': 1.3897472620010376} 11/07/2021 16:45:36 - INFO - __main__ - Step 138529: {'lr': 7.3759404989439396e-06, 'samples': 26597568, 'steps': 138528, 'loss/train': 1.1665873527526855} 11/07/2021 16:45:36 - INFO - __main__ - Step 138530: {'lr': 7.3746610112425394e-06, 'samples': 26597760, 'steps': 138529, 'loss/train': 1.0109095573425293} 11/07/2021 16:45:37 - INFO - __main__ - Step 138531: {'lr': 7.373381632864384e-06, 'samples': 26597952, 'steps': 138530, 'loss/train': 1.0866502523422241} 11/07/2021 16:45:37 - INFO - __main__ - Step 138532: {'lr': 7.372102363810029e-06, 'samples': 26598144, 'steps': 138531, 'loss/train': 1.2696071863174438} 11/07/2021 16:45:37 - INFO - __main__ - Step 138533: {'lr': 7.370823204080085e-06, 'samples': 26598336, 'steps': 138532, 'loss/train': 1.5600910186767578} 11/07/2021 16:45:38 - INFO - __main__ - Step 138534: {'lr': 7.369544153675078e-06, 'samples': 26598528, 'steps': 138533, 'loss/train': 1.5309057235717773} 11/07/2021 16:45:39 - INFO - __main__ - Step 138535: {'lr': 7.368265212595621e-06, 'samples': 26598720, 'steps': 138534, 'loss/train': 1.2313158512115479} 11/07/2021 16:45:39 - INFO - __main__ - Step 138536: {'lr': 7.366986380842295e-06, 'samples': 26598912, 'steps': 138535, 'loss/train': 1.1464930772781372} 11/07/2021 16:45:39 - INFO - __main__ - Step 138537: {'lr': 7.3657076584156265e-06, 'samples': 26599104, 'steps': 138536, 'loss/train': 1.4330413341522217} 11/07/2021 16:45:40 - INFO - __main__ - Step 138538: {'lr': 7.364429045316256e-06, 'samples': 26599296, 'steps': 138537, 'loss/train': 1.5037864446640015} 11/07/2021 16:45:41 - INFO - __main__ - Step 138539: {'lr': 7.363150541544711e-06, 'samples': 26599488, 'steps': 138538, 'loss/train': 1.2424925565719604} 11/07/2021 16:45:41 - INFO - __main__ - Step 138540: {'lr': 7.361872147101628e-06, 'samples': 26599680, 'steps': 138539, 'loss/train': 1.4375094175338745} 11/07/2021 16:45:42 - INFO - __main__ - Step 138541: {'lr': 7.360593861987508e-06, 'samples': 26599872, 'steps': 138540, 'loss/train': 0.758124828338623} 11/07/2021 16:45:42 - INFO - __main__ - Step 138542: {'lr': 7.3593156862029605e-06, 'samples': 26600064, 'steps': 138541, 'loss/train': 0.6697014570236206} 11/07/2021 16:45:42 - INFO - __main__ - Step 138543: {'lr': 7.358037619748542e-06, 'samples': 26600256, 'steps': 138542, 'loss/train': 1.0842722654342651} 11/07/2021 16:45:43 - INFO - __main__ - Step 138544: {'lr': 7.35675966262489e-06, 'samples': 26600448, 'steps': 138543, 'loss/train': 1.19651460647583} 11/07/2021 16:45:44 - INFO - __main__ - Step 138545: {'lr': 7.355481814832504e-06, 'samples': 26600640, 'steps': 138544, 'loss/train': 1.2742173671722412} 11/07/2021 16:45:44 - INFO - __main__ - Step 138546: {'lr': 7.3542040763719955e-06, 'samples': 26600832, 'steps': 138545, 'loss/train': 0.909546971321106} 11/07/2021 16:45:44 - INFO - __main__ - Step 138547: {'lr': 7.352926447243946e-06, 'samples': 26601024, 'steps': 138546, 'loss/train': 1.135678768157959} 11/07/2021 16:45:45 - INFO - __main__ - Step 138548: {'lr': 7.3516489274489116e-06, 'samples': 26601216, 'steps': 138547, 'loss/train': 0.8166192173957825} 11/07/2021 16:45:46 - INFO - __main__ - Step 138549: {'lr': 7.350371516987503e-06, 'samples': 26601408, 'steps': 138548, 'loss/train': 1.322216272354126} 11/07/2021 16:45:46 - INFO - __main__ - Step 138550: {'lr': 7.349094215860275e-06, 'samples': 26601600, 'steps': 138549, 'loss/train': 1.2651983499526978} 11/07/2021 16:45:47 - INFO - __main__ - Step 138551: {'lr': 7.347817024067782e-06, 'samples': 26601792, 'steps': 138550, 'loss/train': 0.9740868806838989} 11/07/2021 16:45:47 - INFO - __main__ - Step 138552: {'lr': 7.346539941610608e-06, 'samples': 26601984, 'steps': 138551, 'loss/train': 1.2655174732208252} 11/07/2021 16:45:47 - INFO - __main__ - Step 138553: {'lr': 7.345262968489391e-06, 'samples': 26602176, 'steps': 138552, 'loss/train': 1.191491961479187} 11/07/2021 16:45:48 - INFO - __main__ - Step 138554: {'lr': 7.343986104704603e-06, 'samples': 26602368, 'steps': 138553, 'loss/train': 1.270389437675476} 11/07/2021 16:45:49 - INFO - __main__ - Step 138555: {'lr': 7.34270935025691e-06, 'samples': 26602560, 'steps': 138554, 'loss/train': 1.2846299409866333} 11/07/2021 16:45:49 - INFO - __main__ - Step 138556: {'lr': 7.341432705146811e-06, 'samples': 26602752, 'steps': 138555, 'loss/train': 1.3191298246383667} 11/07/2021 16:45:49 - INFO - __main__ - Step 138557: {'lr': 7.340156169374917e-06, 'samples': 26602944, 'steps': 138556, 'loss/train': 0.04085048288106918} 11/07/2021 16:45:50 - INFO - __main__ - Step 138558: {'lr': 7.338879742941839e-06, 'samples': 26603136, 'steps': 138557, 'loss/train': 1.1648753881454468} 11/07/2021 16:45:51 - INFO - __main__ - Step 138559: {'lr': 7.337603425848077e-06, 'samples': 26603328, 'steps': 138558, 'loss/train': 1.3519542217254639} 11/07/2021 16:45:51 - INFO - __main__ - Step 138560: {'lr': 7.336327218094269e-06, 'samples': 26603520, 'steps': 138559, 'loss/train': 1.143386960029602} 11/07/2021 16:45:51 - INFO - __main__ - Step 138561: {'lr': 7.3350511196809684e-06, 'samples': 26603712, 'steps': 138560, 'loss/train': 1.1182547807693481} 11/07/2021 16:45:52 - INFO - __main__ - Step 138562: {'lr': 7.333775130608733e-06, 'samples': 26603904, 'steps': 138561, 'loss/train': 1.3678518533706665} 11/07/2021 16:45:52 - INFO - __main__ - Step 138563: {'lr': 7.332499250878172e-06, 'samples': 26604096, 'steps': 138562, 'loss/train': 1.308606743812561} 11/07/2021 16:45:53 - INFO - __main__ - Step 138564: {'lr': 7.331223480489841e-06, 'samples': 26604288, 'steps': 138563, 'loss/train': 1.1253880262374878} 11/07/2021 16:45:54 - INFO - __main__ - Step 138565: {'lr': 7.329947819444294e-06, 'samples': 26604480, 'steps': 138564, 'loss/train': 1.8945260047912598} 11/07/2021 16:45:54 - INFO - __main__ - Step 138566: {'lr': 7.3286722677421424e-06, 'samples': 26604672, 'steps': 138565, 'loss/train': 1.210925579071045} 11/07/2021 16:45:54 - INFO - __main__ - Step 138567: {'lr': 7.3273968253839695e-06, 'samples': 26604864, 'steps': 138566, 'loss/train': 1.1184624433517456} 11/07/2021 16:45:55 - INFO - __main__ - Step 138568: {'lr': 7.32612149237033e-06, 'samples': 26605056, 'steps': 138567, 'loss/train': 1.7996127605438232} 11/07/2021 16:45:55 - INFO - __main__ - Step 138569: {'lr': 7.324846268701752e-06, 'samples': 26605248, 'steps': 138568, 'loss/train': 0.5376297235488892} 11/07/2021 16:45:56 - INFO - __main__ - Step 138570: {'lr': 7.323571154378872e-06, 'samples': 26605440, 'steps': 138569, 'loss/train': 1.3435602188110352} 11/07/2021 16:45:56 - INFO - __main__ - Step 138571: {'lr': 7.322296149402246e-06, 'samples': 26605632, 'steps': 138570, 'loss/train': 1.603914499282837} 11/07/2021 16:45:57 - INFO - __main__ - Step 138572: {'lr': 7.321021253772458e-06, 'samples': 26605824, 'steps': 138571, 'loss/train': 1.5930348634719849} 11/07/2021 16:45:57 - INFO - __main__ - Step 138573: {'lr': 7.3197464674900624e-06, 'samples': 26606016, 'steps': 138572, 'loss/train': 0.4519197642803192} 11/07/2021 16:45:57 - INFO - __main__ - Step 138574: {'lr': 7.318471790555642e-06, 'samples': 26606208, 'steps': 138573, 'loss/train': 1.6570452451705933} 11/07/2021 16:45:59 - INFO - __main__ - Step 138575: {'lr': 7.317197222969779e-06, 'samples': 26606400, 'steps': 138574, 'loss/train': 1.5472692251205444} 11/07/2021 16:45:59 - INFO - __main__ - Step 138576: {'lr': 7.31592276473303e-06, 'samples': 26606592, 'steps': 138575, 'loss/train': 1.3178167343139648} 11/07/2021 16:45:59 - INFO - __main__ - Step 138577: {'lr': 7.314648415846004e-06, 'samples': 26606784, 'steps': 138576, 'loss/train': 1.880993366241455} 11/07/2021 16:46:00 - INFO - __main__ - Step 138578: {'lr': 7.3133741763092285e-06, 'samples': 26606976, 'steps': 138577, 'loss/train': 0.7364013195037842} 11/07/2021 16:46:00 - INFO - __main__ - Step 138579: {'lr': 7.3121000461233155e-06, 'samples': 26607168, 'steps': 138578, 'loss/train': 0.04714124649763107} 11/07/2021 16:46:01 - INFO - __main__ - Step 138580: {'lr': 7.310826025288847e-06, 'samples': 26607360, 'steps': 138579, 'loss/train': 1.406151533126831} 11/07/2021 16:46:01 - INFO - __main__ - Step 138581: {'lr': 7.3095521138063505e-06, 'samples': 26607552, 'steps': 138580, 'loss/train': 1.1579896211624146} 11/07/2021 16:46:02 - INFO - __main__ - Step 138582: {'lr': 7.308278311676436e-06, 'samples': 26607744, 'steps': 138581, 'loss/train': 1.5052703619003296} 11/07/2021 16:46:02 - INFO - __main__ - Step 138583: {'lr': 7.30700461889966e-06, 'samples': 26607936, 'steps': 138582, 'loss/train': 1.5943248271942139} 11/07/2021 16:46:02 - INFO - __main__ - Step 138584: {'lr': 7.305731035476604e-06, 'samples': 26608128, 'steps': 138583, 'loss/train': 1.4801334142684937} 11/07/2021 16:46:04 - INFO - __main__ - Step 138585: {'lr': 7.304457561407823e-06, 'samples': 26608320, 'steps': 138584, 'loss/train': 1.0003783702850342} 11/07/2021 16:46:04 - INFO - __main__ - Step 138586: {'lr': 7.30318419669393e-06, 'samples': 26608512, 'steps': 138585, 'loss/train': 1.3258169889450073} 11/07/2021 16:46:04 - INFO - __main__ - Step 138587: {'lr': 7.301910941335477e-06, 'samples': 26608704, 'steps': 138586, 'loss/train': 1.2897908687591553} 11/07/2021 16:46:05 - INFO - __main__ - Step 138588: {'lr': 7.30063779533302e-06, 'samples': 26608896, 'steps': 138587, 'loss/train': 1.202229619026184} 11/07/2021 16:46:05 - INFO - __main__ - Step 138589: {'lr': 7.299364758687144e-06, 'samples': 26609088, 'steps': 138588, 'loss/train': 1.589185118675232} 11/07/2021 16:46:06 - INFO - __main__ - Step 138590: {'lr': 7.298091831398457e-06, 'samples': 26609280, 'steps': 138589, 'loss/train': 1.1372939348220825} 11/07/2021 16:46:07 - INFO - __main__ - Step 138591: {'lr': 7.296819013467515e-06, 'samples': 26609472, 'steps': 138590, 'loss/train': 1.4908733367919922} 11/07/2021 16:46:07 - INFO - __main__ - Step 138592: {'lr': 7.2955463048948735e-06, 'samples': 26609664, 'steps': 138591, 'loss/train': 1.3825469017028809} 11/07/2021 16:46:07 - INFO - __main__ - Step 138593: {'lr': 7.294273705681087e-06, 'samples': 26609856, 'steps': 138592, 'loss/train': 1.230646014213562} 11/07/2021 16:46:08 - INFO - __main__ - Step 138594: {'lr': 7.293001215826767e-06, 'samples': 26610048, 'steps': 138593, 'loss/train': 1.4233336448669434} 11/07/2021 16:46:09 - INFO - __main__ - Step 138595: {'lr': 7.291728835332468e-06, 'samples': 26610240, 'steps': 138594, 'loss/train': 1.6905194520950317} 11/07/2021 16:46:09 - INFO - __main__ - Step 138596: {'lr': 7.2904565641988e-06, 'samples': 26610432, 'steps': 138595, 'loss/train': 1.1140700578689575} 11/07/2021 16:46:09 - INFO - __main__ - Step 138597: {'lr': 7.289184402426263e-06, 'samples': 26610624, 'steps': 138596, 'loss/train': 1.2095074653625488} 11/07/2021 16:46:10 - INFO - __main__ - Step 138598: {'lr': 7.287912350015497e-06, 'samples': 26610816, 'steps': 138597, 'loss/train': 1.2139075994491577} 11/07/2021 16:46:10 - INFO - __main__ - Step 138599: {'lr': 7.286640406967054e-06, 'samples': 26611008, 'steps': 138598, 'loss/train': 1.6368967294692993} 11/07/2021 16:46:11 - INFO - __main__ - Step 138600: {'lr': 7.28536857328152e-06, 'samples': 26611200, 'steps': 138599, 'loss/train': 1.256898283958435} 11/07/2021 16:46:11 - INFO - __main__ - Step 138601: {'lr': 7.2840968489594205e-06, 'samples': 26611392, 'steps': 138600, 'loss/train': 1.0425126552581787} 11/07/2021 16:46:12 - INFO - __main__ - Step 138602: {'lr': 7.282825234001422e-06, 'samples': 26611584, 'steps': 138601, 'loss/train': 1.375959873199463} 11/07/2021 16:46:12 - INFO - __main__ - Step 138603: {'lr': 7.281553728407997e-06, 'samples': 26611776, 'steps': 138602, 'loss/train': 1.3202815055847168} 11/07/2021 16:46:13 - INFO - __main__ - Step 138604: {'lr': 7.280282332179755e-06, 'samples': 26611968, 'steps': 138603, 'loss/train': 1.1774920225143433} 11/07/2021 16:46:14 - INFO - __main__ - Step 138605: {'lr': 7.279011045317252e-06, 'samples': 26612160, 'steps': 138604, 'loss/train': 1.3294658660888672} 11/07/2021 16:46:14 - INFO - __main__ - Step 138606: {'lr': 7.277739867821098e-06, 'samples': 26612352, 'steps': 138605, 'loss/train': 0.4847676157951355} 11/07/2021 16:46:14 - INFO - __main__ - Step 138607: {'lr': 7.2764687996918765e-06, 'samples': 26612544, 'steps': 138606, 'loss/train': 1.045597791671753} 11/07/2021 16:46:15 - INFO - __main__ - Step 138608: {'lr': 7.275197840930087e-06, 'samples': 26612736, 'steps': 138607, 'loss/train': 1.2826405763626099} 11/07/2021 16:46:15 - INFO - __main__ - Step 138609: {'lr': 7.273926991536367e-06, 'samples': 26612928, 'steps': 138608, 'loss/train': 1.1371712684631348} 11/07/2021 16:46:15 - INFO - __main__ - Step 138610: {'lr': 7.272656251511273e-06, 'samples': 26613120, 'steps': 138609, 'loss/train': 1.0566738843917847} 11/07/2021 16:46:17 - INFO - __main__ - Step 138611: {'lr': 7.271385620855387e-06, 'samples': 26613312, 'steps': 138610, 'loss/train': 0.6226484179496765} 11/07/2021 16:46:17 - INFO - __main__ - Step 138612: {'lr': 7.270115099569291e-06, 'samples': 26613504, 'steps': 138611, 'loss/train': 1.1443787813186646} 11/07/2021 16:46:17 - INFO - __main__ - Step 138613: {'lr': 7.2688446876534865e-06, 'samples': 26613696, 'steps': 138612, 'loss/train': 1.2319985628128052} 11/07/2021 16:46:18 - INFO - __main__ - Step 138614: {'lr': 7.267574385108611e-06, 'samples': 26613888, 'steps': 138613, 'loss/train': 1.0724083185195923} 11/07/2021 16:46:18 - INFO - __main__ - Step 138615: {'lr': 7.266304191935219e-06, 'samples': 26614080, 'steps': 138614, 'loss/train': 0.8859825730323792} 11/07/2021 16:46:19 - INFO - __main__ - Step 138616: {'lr': 7.265034108133894e-06, 'samples': 26614272, 'steps': 138615, 'loss/train': 0.03949522227048874} 11/07/2021 16:46:19 - INFO - __main__ - Step 138617: {'lr': 7.263764133705192e-06, 'samples': 26614464, 'steps': 138616, 'loss/train': 1.4502824544906616} 11/07/2021 16:46:20 - INFO - __main__ - Step 138618: {'lr': 7.262494268649694e-06, 'samples': 26614656, 'steps': 138617, 'loss/train': 1.9087374210357666} 11/07/2021 16:46:20 - INFO - __main__ - Step 138619: {'lr': 7.261224512967956e-06, 'samples': 26614848, 'steps': 138618, 'loss/train': 1.3212538957595825} 11/07/2021 16:46:20 - INFO - __main__ - Step 138620: {'lr': 7.2599548666605895e-06, 'samples': 26615040, 'steps': 138619, 'loss/train': 0.7259055972099304} 11/07/2021 16:46:22 - INFO - __main__ - Step 138621: {'lr': 7.258685329728121e-06, 'samples': 26615232, 'steps': 138620, 'loss/train': 1.2803010940551758} 11/07/2021 16:46:22 - INFO - __main__ - Step 138622: {'lr': 7.2574159021711325e-06, 'samples': 26615424, 'steps': 138621, 'loss/train': 1.502387523651123} 11/07/2021 16:46:22 - INFO - __main__ - Step 138623: {'lr': 7.256146583990264e-06, 'samples': 26615616, 'steps': 138622, 'loss/train': 0.8594983220100403} 11/07/2021 16:46:23 - INFO - __main__ - Step 138624: {'lr': 7.254877375185987e-06, 'samples': 26615808, 'steps': 138623, 'loss/train': 1.4731132984161377} 11/07/2021 16:46:23 - INFO - __main__ - Step 138625: {'lr': 7.253608275758911e-06, 'samples': 26616000, 'steps': 138624, 'loss/train': 0.05351502448320389} 11/07/2021 16:46:24 - INFO - __main__ - Step 138626: {'lr': 7.252339285709619e-06, 'samples': 26616192, 'steps': 138625, 'loss/train': 1.5268173217773438} 11/07/2021 16:46:25 - INFO - __main__ - Step 138627: {'lr': 7.251070405038696e-06, 'samples': 26616384, 'steps': 138626, 'loss/train': 0.9992127418518066} 11/07/2021 16:46:25 - INFO - __main__ - Step 138628: {'lr': 7.249801633746666e-06, 'samples': 26616576, 'steps': 138627, 'loss/train': 1.4787458181381226} 11/07/2021 16:46:25 - INFO - __main__ - Step 138629: {'lr': 7.248532971834143e-06, 'samples': 26616768, 'steps': 138628, 'loss/train': 1.8182889223098755} 11/07/2021 16:46:26 - INFO - __main__ - Step 138630: {'lr': 7.24726441930168e-06, 'samples': 26616960, 'steps': 138629, 'loss/train': 1.6288269758224487} 11/07/2021 16:46:26 - INFO - __main__ - Step 138631: {'lr': 7.245995976149861e-06, 'samples': 26617152, 'steps': 138630, 'loss/train': 1.1163573265075684} 11/07/2021 16:46:27 - INFO - __main__ - Step 138632: {'lr': 7.244727642379267e-06, 'samples': 26617344, 'steps': 138631, 'loss/train': 0.9607428312301636} 11/07/2021 16:46:27 - INFO - __main__ - Step 138633: {'lr': 7.243459417990428e-06, 'samples': 26617536, 'steps': 138632, 'loss/train': 0.7462661266326904} 11/07/2021 16:46:28 - INFO - __main__ - Step 138634: {'lr': 7.24219130298398e-06, 'samples': 26617728, 'steps': 138633, 'loss/train': 1.2592653036117554} 11/07/2021 16:46:28 - INFO - __main__ - Step 138635: {'lr': 7.240923297360397e-06, 'samples': 26617920, 'steps': 138634, 'loss/train': 1.5515468120574951} 11/07/2021 16:46:28 - INFO - __main__ - Step 138636: {'lr': 7.239655401120343e-06, 'samples': 26618112, 'steps': 138635, 'loss/train': 1.2416746616363525} 11/07/2021 16:46:29 - INFO - __main__ - Step 138637: {'lr': 7.2383876142643465e-06, 'samples': 26618304, 'steps': 138636, 'loss/train': 0.5580950975418091} 11/07/2021 16:46:30 - INFO - __main__ - Step 138638: {'lr': 7.237119936792991e-06, 'samples': 26618496, 'steps': 138637, 'loss/train': 1.525032639503479} 11/07/2021 16:46:30 - INFO - __main__ - Step 138639: {'lr': 7.23585236870683e-06, 'samples': 26618688, 'steps': 138638, 'loss/train': 0.5750301480293274} 11/07/2021 16:46:30 - INFO - __main__ - Step 138640: {'lr': 7.2345849100064475e-06, 'samples': 26618880, 'steps': 138639, 'loss/train': 1.4112858772277832} 11/07/2021 16:46:31 - INFO - __main__ - Step 138641: {'lr': 7.233317560692427e-06, 'samples': 26619072, 'steps': 138640, 'loss/train': 1.0823934078216553} 11/07/2021 16:46:32 - INFO - __main__ - Step 138642: {'lr': 7.232050320765321e-06, 'samples': 26619264, 'steps': 138641, 'loss/train': 0.962265133857727} 11/07/2021 16:46:32 - INFO - __main__ - Step 138643: {'lr': 7.230783190225687e-06, 'samples': 26619456, 'steps': 138642, 'loss/train': 1.4805846214294434} 11/07/2021 16:46:33 - INFO - __main__ - Step 138644: {'lr': 7.229516169074135e-06, 'samples': 26619648, 'steps': 138643, 'loss/train': 1.405085563659668} 11/07/2021 16:46:33 - INFO - __main__ - Step 138645: {'lr': 7.22824925731122e-06, 'samples': 26619840, 'steps': 138644, 'loss/train': 1.2624118328094482} 11/07/2021 16:46:33 - INFO - __main__ - Step 138646: {'lr': 7.2269824549374974e-06, 'samples': 26620032, 'steps': 138645, 'loss/train': 0.4477655291557312} 11/07/2021 16:46:34 - INFO - __main__ - Step 138647: {'lr': 7.225715761953605e-06, 'samples': 26620224, 'steps': 138646, 'loss/train': 1.4424717426300049} 11/07/2021 16:46:35 - INFO - __main__ - Step 138648: {'lr': 7.224449178360015e-06, 'samples': 26620416, 'steps': 138647, 'loss/train': 1.3125348091125488} 11/07/2021 16:46:35 - INFO - __main__ - Step 138649: {'lr': 7.2231827041573386e-06, 'samples': 26620608, 'steps': 138648, 'loss/train': 0.5240727663040161} 11/07/2021 16:46:35 - INFO - __main__ - Step 138650: {'lr': 7.221916339346157e-06, 'samples': 26620800, 'steps': 138649, 'loss/train': 1.306967854499817} 11/07/2021 16:46:36 - INFO - __main__ - Step 138651: {'lr': 7.220650083927027e-06, 'samples': 26620992, 'steps': 138650, 'loss/train': 1.3632166385650635} 11/07/2021 16:46:37 - INFO - __main__ - Step 138652: {'lr': 7.219383937900503e-06, 'samples': 26621184, 'steps': 138651, 'loss/train': 1.278330683708191} 11/07/2021 16:46:37 - INFO - __main__ - Step 138653: {'lr': 7.218117901267224e-06, 'samples': 26621376, 'steps': 138652, 'loss/train': 1.2317848205566406} 11/07/2021 16:46:38 - INFO - __main__ - Step 138654: {'lr': 7.216851974027689e-06, 'samples': 26621568, 'steps': 138653, 'loss/train': 0.790755569934845} 11/07/2021 16:46:38 - INFO - __main__ - Step 138655: {'lr': 7.215586156182508e-06, 'samples': 26621760, 'steps': 138654, 'loss/train': 1.2298911809921265} 11/07/2021 16:46:38 - INFO - __main__ - Step 138656: {'lr': 7.214320447732209e-06, 'samples': 26621952, 'steps': 138655, 'loss/train': 1.2508493661880493} 11/07/2021 16:46:39 - INFO - __main__ - Step 138657: {'lr': 7.213054848677403e-06, 'samples': 26622144, 'steps': 138656, 'loss/train': 0.9953100085258484} 11/07/2021 16:46:40 - INFO - __main__ - Step 138658: {'lr': 7.211789359018672e-06, 'samples': 26622336, 'steps': 138657, 'loss/train': 1.5938361883163452} 11/07/2021 16:46:40 - INFO - __main__ - Step 138659: {'lr': 7.210523978756545e-06, 'samples': 26622528, 'steps': 138658, 'loss/train': 1.421116828918457} 11/07/2021 16:46:40 - INFO - __main__ - Step 138660: {'lr': 7.209258707891603e-06, 'samples': 26622720, 'steps': 138659, 'loss/train': 0.9219837188720703} 11/07/2021 16:46:41 - INFO - __main__ - Step 138661: {'lr': 7.207993546424457e-06, 'samples': 26622912, 'steps': 138660, 'loss/train': 1.1379578113555908} 11/07/2021 16:46:42 - INFO - __main__ - Step 138662: {'lr': 7.2067284943556075e-06, 'samples': 26623104, 'steps': 138661, 'loss/train': 1.3248012065887451} 11/07/2021 16:46:42 - INFO - __main__ - Step 138663: {'lr': 7.205463551685665e-06, 'samples': 26623296, 'steps': 138662, 'loss/train': 1.6024528741836548} 11/07/2021 16:46:43 - INFO - __main__ - Step 138664: {'lr': 7.2041987184152115e-06, 'samples': 26623488, 'steps': 138663, 'loss/train': 0.9214326739311218} 11/07/2021 16:46:43 - INFO - __main__ - Step 138665: {'lr': 7.202933994544775e-06, 'samples': 26623680, 'steps': 138664, 'loss/train': 1.0940892696380615} 11/07/2021 16:46:43 - INFO - __main__ - Step 138666: {'lr': 7.201669380074965e-06, 'samples': 26623872, 'steps': 138665, 'loss/train': 1.4617398977279663} 11/07/2021 16:46:44 - INFO - __main__ - Step 138667: {'lr': 7.200404875006311e-06, 'samples': 26624064, 'steps': 138666, 'loss/train': 0.1831066608428955} 11/07/2021 16:46:45 - INFO - __main__ - Step 138668: {'lr': 7.199140479339422e-06, 'samples': 26624256, 'steps': 138667, 'loss/train': 1.2002607583999634} 11/07/2021 16:46:45 - INFO - __main__ - Step 138669: {'lr': 7.197876193074882e-06, 'samples': 26624448, 'steps': 138668, 'loss/train': 1.0605179071426392} 11/07/2021 16:46:45 - INFO - __main__ - Step 138670: {'lr': 7.196612016213189e-06, 'samples': 26624640, 'steps': 138669, 'loss/train': 1.5716921091079712} 11/07/2021 16:46:46 - INFO - __main__ - Step 138671: {'lr': 7.195347948754982e-06, 'samples': 26624832, 'steps': 138670, 'loss/train': 0.6836035251617432} 11/07/2021 16:46:46 - INFO - __main__ - Step 138672: {'lr': 7.194083990700789e-06, 'samples': 26625024, 'steps': 138671, 'loss/train': 1.4999258518218994} 11/07/2021 16:46:47 - INFO - __main__ - Step 138673: {'lr': 7.1928201420512205e-06, 'samples': 26625216, 'steps': 138672, 'loss/train': 0.046525683254003525} 11/07/2021 16:46:48 - INFO - __main__ - Step 138674: {'lr': 7.191556402806832e-06, 'samples': 26625408, 'steps': 138673, 'loss/train': 0.9954272508621216} 11/07/2021 16:46:48 - INFO - __main__ - Step 138675: {'lr': 7.1902927729681485e-06, 'samples': 26625600, 'steps': 138674, 'loss/train': 1.2003873586654663} 11/07/2021 16:46:48 - INFO - __main__ - Step 138676: {'lr': 7.189029252535784e-06, 'samples': 26625792, 'steps': 138675, 'loss/train': 1.7336599826812744} 11/07/2021 16:46:49 - INFO - __main__ - Step 138677: {'lr': 7.187765841510291e-06, 'samples': 26625984, 'steps': 138676, 'loss/train': 1.0133180618286133} 11/07/2021 16:46:50 - INFO - __main__ - Step 138678: {'lr': 7.186502539892226e-06, 'samples': 26626176, 'steps': 138677, 'loss/train': 1.6108999252319336} 11/07/2021 16:46:50 - INFO - __main__ - Step 138679: {'lr': 7.185239347682199e-06, 'samples': 26626368, 'steps': 138678, 'loss/train': 1.2762705087661743} 11/07/2021 16:46:51 - INFO - __main__ - Step 138680: {'lr': 7.183976264880737e-06, 'samples': 26626560, 'steps': 138679, 'loss/train': 1.4944257736206055} 11/07/2021 16:46:51 - INFO - __main__ - Step 138681: {'lr': 7.182713291488452e-06, 'samples': 26626752, 'steps': 138680, 'loss/train': 1.7234601974487305} 11/07/2021 16:46:51 - INFO - __main__ - Step 138682: {'lr': 7.18145042750587e-06, 'samples': 26626944, 'steps': 138681, 'loss/train': 1.4167816638946533} 11/07/2021 16:46:52 - INFO - __main__ - Step 138683: {'lr': 7.180187672933603e-06, 'samples': 26627136, 'steps': 138682, 'loss/train': 0.625313401222229} 11/07/2021 16:46:53 - INFO - __main__ - Step 138684: {'lr': 7.178925027772177e-06, 'samples': 26627328, 'steps': 138683, 'loss/train': 1.538744568824768} 11/07/2021 16:46:53 - INFO - __main__ - Step 138685: {'lr': 7.177662492022174e-06, 'samples': 26627520, 'steps': 138684, 'loss/train': 1.340307354927063} 11/07/2021 16:46:53 - INFO - __main__ - Step 138686: {'lr': 7.17640006568418e-06, 'samples': 26627712, 'steps': 138685, 'loss/train': 0.09733877331018448} 11/07/2021 16:46:54 - INFO - __main__ - Step 138687: {'lr': 7.175137748758748e-06, 'samples': 26627904, 'steps': 138686, 'loss/train': 1.2997101545333862} 11/07/2021 16:46:54 - INFO - __main__ - Step 138688: {'lr': 7.1738755412464884e-06, 'samples': 26628096, 'steps': 138687, 'loss/train': 2.116103410720825} 11/07/2021 16:46:55 - INFO - __main__ - Step 138689: {'lr': 7.172613443147902e-06, 'samples': 26628288, 'steps': 138688, 'loss/train': 1.4954019784927368} 11/07/2021 16:46:56 - INFO - __main__ - Step 138690: {'lr': 7.171351454463598e-06, 'samples': 26628480, 'steps': 138689, 'loss/train': 1.3089160919189453} 11/07/2021 16:46:56 - INFO - __main__ - Step 138691: {'lr': 7.170089575194133e-06, 'samples': 26628672, 'steps': 138690, 'loss/train': 0.27208980917930603} 11/07/2021 16:46:56 - INFO - __main__ - Step 138692: {'lr': 7.1688278053400615e-06, 'samples': 26628864, 'steps': 138691, 'loss/train': 1.5109418630599976} 11/07/2021 16:46:57 - INFO - __main__ - Step 138693: {'lr': 7.167566144901993e-06, 'samples': 26629056, 'steps': 138692, 'loss/train': 1.4158192873001099} 11/07/2021 16:46:58 - INFO - __main__ - Step 138694: {'lr': 7.166304593880457e-06, 'samples': 26629248, 'steps': 138693, 'loss/train': 0.9677733778953552} 11/07/2021 16:46:58 - INFO - __main__ - Step 138695: {'lr': 7.165043152276035e-06, 'samples': 26629440, 'steps': 138694, 'loss/train': 1.2162625789642334} 11/07/2021 16:46:59 - INFO - __main__ - Step 138696: {'lr': 7.16378182008931e-06, 'samples': 26629632, 'steps': 138695, 'loss/train': 1.5671175718307495} 11/07/2021 16:46:59 - INFO - __main__ - Step 138697: {'lr': 7.16252059732081e-06, 'samples': 26629824, 'steps': 138696, 'loss/train': 1.3993946313858032} 11/07/2021 16:46:59 - INFO - __main__ - Step 138698: {'lr': 7.161259483971172e-06, 'samples': 26630016, 'steps': 138697, 'loss/train': 1.3886088132858276} 11/07/2021 16:47:00 - INFO - __main__ - Step 138699: {'lr': 7.159998480040897e-06, 'samples': 26630208, 'steps': 138698, 'loss/train': 1.1038174629211426} 11/07/2021 16:47:01 - INFO - __main__ - Step 138700: {'lr': 7.158737585530567e-06, 'samples': 26630400, 'steps': 138699, 'loss/train': 1.2381078004837036} 11/07/2021 16:47:01 - INFO - __main__ - Step 138701: {'lr': 7.157476800440821e-06, 'samples': 26630592, 'steps': 138700, 'loss/train': 1.4566569328308105} 11/07/2021 16:47:01 - INFO - __main__ - Step 138702: {'lr': 7.1562161247721305e-06, 'samples': 26630784, 'steps': 138701, 'loss/train': 1.2725856304168701} 11/07/2021 16:47:02 - INFO - __main__ - Step 138703: {'lr': 7.154955558525078e-06, 'samples': 26630976, 'steps': 138702, 'loss/train': 1.5253950357437134} 11/07/2021 16:47:03 - INFO - __main__ - Step 138704: {'lr': 7.153695101700275e-06, 'samples': 26631168, 'steps': 138703, 'loss/train': 1.4247510433197021} 11/07/2021 16:47:04 - INFO - __main__ - Step 138705: {'lr': 7.152434754298276e-06, 'samples': 26631360, 'steps': 138704, 'loss/train': 1.3612397909164429} 11/07/2021 16:47:04 - INFO - __main__ - Step 138706: {'lr': 7.151174516319636e-06, 'samples': 26631552, 'steps': 138705, 'loss/train': 1.7327581644058228} 11/07/2021 16:47:05 - INFO - __main__ - Step 138707: {'lr': 7.149914387764938e-06, 'samples': 26631744, 'steps': 138706, 'loss/train': 1.1912058591842651} 11/07/2021 16:47:05 - INFO - __main__ - Step 138708: {'lr': 7.148654368634738e-06, 'samples': 26631936, 'steps': 138707, 'loss/train': 1.3557097911834717} 11/07/2021 16:47:05 - INFO - __main__ - Step 138709: {'lr': 7.14739445892959e-06, 'samples': 26632128, 'steps': 138708, 'loss/train': 1.3870213031768799} 11/07/2021 16:47:06 - INFO - __main__ - Step 138710: {'lr': 7.146134658650106e-06, 'samples': 26632320, 'steps': 138709, 'loss/train': 1.3877248764038086} 11/07/2021 16:47:07 - INFO - __main__ - Step 138711: {'lr': 7.14487496779681e-06, 'samples': 26632512, 'steps': 138710, 'loss/train': 0.7028882503509521} 11/07/2021 16:47:07 - INFO - __main__ - Step 138712: {'lr': 7.143615386370289e-06, 'samples': 26632704, 'steps': 138711, 'loss/train': 0.437483012676239} 11/07/2021 16:47:08 - INFO - __main__ - Step 138713: {'lr': 7.142355914371096e-06, 'samples': 26632896, 'steps': 138712, 'loss/train': 1.0363545417785645} 11/07/2021 16:47:08 - INFO - __main__ - Step 138714: {'lr': 7.141096551799814e-06, 'samples': 26633088, 'steps': 138713, 'loss/train': 0.28070810437202454} 11/07/2021 16:47:08 - INFO - __main__ - Step 138715: {'lr': 7.139837298657054e-06, 'samples': 26633280, 'steps': 138714, 'loss/train': 1.0304200649261475} 11/07/2021 16:47:09 - INFO - __main__ - Step 138716: {'lr': 7.138578154943287e-06, 'samples': 26633472, 'steps': 138715, 'loss/train': 2.808852434158325} 11/07/2021 16:47:10 - INFO - __main__ - Step 138717: {'lr': 7.137319120659125e-06, 'samples': 26633664, 'steps': 138716, 'loss/train': 1.2387648820877075} 11/07/2021 16:47:10 - INFO - __main__ - Step 138718: {'lr': 7.13606019580515e-06, 'samples': 26633856, 'steps': 138717, 'loss/train': 0.9833104014396667} 11/07/2021 16:47:10 - INFO - __main__ - Step 138719: {'lr': 7.134801380381916e-06, 'samples': 26634048, 'steps': 138718, 'loss/train': 1.0094668865203857} 11/07/2021 16:47:11 - INFO - __main__ - Step 138720: {'lr': 7.133542674390009e-06, 'samples': 26634240, 'steps': 138719, 'loss/train': 1.0690064430236816} 11/07/2021 16:47:12 - INFO - __main__ - Step 138721: {'lr': 7.132284077829953e-06, 'samples': 26634432, 'steps': 138720, 'loss/train': 1.6302542686462402} 11/07/2021 16:47:12 - INFO - __main__ - Step 138722: {'lr': 7.131025590702361e-06, 'samples': 26634624, 'steps': 138721, 'loss/train': 1.4587433338165283} 11/07/2021 16:47:13 - INFO - __main__ - Step 138723: {'lr': 7.129767213007787e-06, 'samples': 26634816, 'steps': 138722, 'loss/train': 1.0655772686004639} 11/07/2021 16:47:13 - INFO - __main__ - Step 138724: {'lr': 7.1285089447467865e-06, 'samples': 26635008, 'steps': 138723, 'loss/train': 1.0370748043060303} 11/07/2021 16:47:13 - INFO - __main__ - Step 138725: {'lr': 7.127250785919914e-06, 'samples': 26635200, 'steps': 138724, 'loss/train': 1.6643034219741821} 11/07/2021 16:47:14 - INFO - __main__ - Step 138726: {'lr': 7.125992736527753e-06, 'samples': 26635392, 'steps': 138725, 'loss/train': 1.2555245161056519} 11/07/2021 16:47:15 - INFO - __main__ - Step 138727: {'lr': 7.124734796570887e-06, 'samples': 26635584, 'steps': 138726, 'loss/train': 1.4723252058029175} 11/07/2021 16:47:15 - INFO - __main__ - Step 138728: {'lr': 7.123476966049896e-06, 'samples': 26635776, 'steps': 138727, 'loss/train': 0.9908653497695923} 11/07/2021 16:47:15 - INFO - __main__ - Step 138729: {'lr': 7.122219244965311e-06, 'samples': 26635968, 'steps': 138728, 'loss/train': 1.0003150701522827} 11/07/2021 16:47:16 - INFO - __main__ - Step 138730: {'lr': 7.120961633317685e-06, 'samples': 26636160, 'steps': 138729, 'loss/train': 1.5000405311584473} 11/07/2021 16:47:17 - INFO - __main__ - Step 138731: {'lr': 7.119704131107602e-06, 'samples': 26636352, 'steps': 138730, 'loss/train': 1.2717233896255493} 11/07/2021 16:47:17 - INFO - __main__ - Step 138732: {'lr': 7.118446738335616e-06, 'samples': 26636544, 'steps': 138731, 'loss/train': 1.1772336959838867} 11/07/2021 16:47:17 - INFO - __main__ - Step 138733: {'lr': 7.117189455002338e-06, 'samples': 26636736, 'steps': 138732, 'loss/train': 0.9520431756973267} 11/07/2021 16:47:18 - INFO - __main__ - Step 138734: {'lr': 7.115932281108295e-06, 'samples': 26636928, 'steps': 138733, 'loss/train': 0.6053853034973145} 11/07/2021 16:47:18 - INFO - __main__ - Step 138735: {'lr': 7.114675216654071e-06, 'samples': 26637120, 'steps': 138734, 'loss/train': 1.4675180912017822} 11/07/2021 16:47:19 - INFO - __main__ - Step 138736: {'lr': 7.11341826164022e-06, 'samples': 26637312, 'steps': 138735, 'loss/train': 0.22384923696517944} 11/07/2021 16:47:20 - INFO - __main__ - Step 138737: {'lr': 7.112161416067326e-06, 'samples': 26637504, 'steps': 138736, 'loss/train': 0.9443438649177551} 11/07/2021 16:47:20 - INFO - __main__ - Step 138738: {'lr': 7.110904679935942e-06, 'samples': 26637696, 'steps': 138737, 'loss/train': 0.6756911873817444} 11/07/2021 16:47:20 - INFO - __main__ - Step 138739: {'lr': 7.109648053246626e-06, 'samples': 26637888, 'steps': 138738, 'loss/train': 1.1906448602676392} 11/07/2021 16:47:21 - INFO - __main__ - Step 138740: {'lr': 7.108391535999959e-06, 'samples': 26638080, 'steps': 138739, 'loss/train': 0.9472803473472595} 11/07/2021 16:47:21 - INFO - __main__ - Step 138741: {'lr': 7.107135128196496e-06, 'samples': 26638272, 'steps': 138740, 'loss/train': 1.5276484489440918} 11/07/2021 16:47:22 - INFO - __main__ - Step 138742: {'lr': 7.105878829836848e-06, 'samples': 26638464, 'steps': 138741, 'loss/train': 1.2449954748153687} 11/07/2021 16:47:22 - INFO - __main__ - Step 138743: {'lr': 7.104622640921516e-06, 'samples': 26638656, 'steps': 138742, 'loss/train': 1.1118417978286743} 11/07/2021 16:47:23 - INFO - __main__ - Step 138744: {'lr': 7.10336656145108e-06, 'samples': 26638848, 'steps': 138743, 'loss/train': 1.507412075996399} 11/07/2021 16:47:23 - INFO - __main__ - Step 138745: {'lr': 7.1021105914261255e-06, 'samples': 26639040, 'steps': 138744, 'loss/train': 1.2834138870239258} 11/07/2021 16:47:23 - INFO - __main__ - Step 138746: {'lr': 7.100854730847234e-06, 'samples': 26639232, 'steps': 138745, 'loss/train': 1.172605276107788} 11/07/2021 16:47:24 - INFO - __main__ - Step 138747: {'lr': 7.0995989797149055e-06, 'samples': 26639424, 'steps': 138746, 'loss/train': 1.3408360481262207} 11/07/2021 16:47:25 - INFO - __main__ - Step 138748: {'lr': 7.098343338029778e-06, 'samples': 26639616, 'steps': 138747, 'loss/train': 0.9358780384063721} 11/07/2021 16:47:25 - INFO - __main__ - Step 138749: {'lr': 7.09708780579238e-06, 'samples': 26639808, 'steps': 138748, 'loss/train': 0.8246721625328064} 11/07/2021 16:47:25 - INFO - __main__ - Step 138750: {'lr': 7.0958323830032926e-06, 'samples': 26640000, 'steps': 138749, 'loss/train': 0.8080664277076721} 11/07/2021 16:47:26 - INFO - __main__ - Step 138751: {'lr': 7.094577069663072e-06, 'samples': 26640192, 'steps': 138750, 'loss/train': 1.394027590751648} 11/07/2021 16:47:27 - INFO - __main__ - Step 138752: {'lr': 7.093321865772301e-06, 'samples': 26640384, 'steps': 138751, 'loss/train': 0.7801473736763} 11/07/2021 16:47:27 - INFO - __main__ - Step 138753: {'lr': 7.092066771331507e-06, 'samples': 26640576, 'steps': 138752, 'loss/train': 1.722522497177124} 11/07/2021 16:47:28 - INFO - __main__ - Step 138754: {'lr': 7.0908117863413e-06, 'samples': 26640768, 'steps': 138753, 'loss/train': 1.6586322784423828} 11/07/2021 16:47:28 - INFO - __main__ - Step 138755: {'lr': 7.089556910802236e-06, 'samples': 26640960, 'steps': 138754, 'loss/train': 1.1851946115493774} 11/07/2021 16:47:28 - INFO - __main__ - Step 138756: {'lr': 7.088302144714842e-06, 'samples': 26641152, 'steps': 138755, 'loss/train': 1.3818327188491821} 11/07/2021 16:47:29 - INFO - __main__ - Step 138757: {'lr': 7.087047488079729e-06, 'samples': 26641344, 'steps': 138756, 'loss/train': 1.5979200601577759} 11/07/2021 16:47:30 - INFO - __main__ - Step 138758: {'lr': 7.085792940897423e-06, 'samples': 26641536, 'steps': 138757, 'loss/train': 1.1496847867965698} 11/07/2021 16:47:30 - INFO - __main__ - Step 138759: {'lr': 7.084538503168508e-06, 'samples': 26641728, 'steps': 138758, 'loss/train': 1.1483781337738037} 11/07/2021 16:47:31 - INFO - __main__ - Step 138760: {'lr': 7.083284174893567e-06, 'samples': 26641920, 'steps': 138759, 'loss/train': 1.3986599445343018} 11/07/2021 16:47:31 - INFO - __main__ - Step 138761: {'lr': 7.082029956073155e-06, 'samples': 26642112, 'steps': 138760, 'loss/train': 1.0062482357025146} 11/07/2021 16:47:31 - INFO - __main__ - Step 138762: {'lr': 7.080775846707826e-06, 'samples': 26642304, 'steps': 138761, 'loss/train': 1.4975039958953857} 11/07/2021 16:47:32 - INFO - __main__ - Step 138763: {'lr': 7.079521846798137e-06, 'samples': 26642496, 'steps': 138762, 'loss/train': 0.9204018115997314} 11/07/2021 16:47:33 - INFO - __main__ - Step 138764: {'lr': 7.0782679563446694e-06, 'samples': 26642688, 'steps': 138763, 'loss/train': 0.8975769281387329} 11/07/2021 16:47:33 - INFO - __main__ - Step 138765: {'lr': 7.0770141753480065e-06, 'samples': 26642880, 'steps': 138764, 'loss/train': 0.8203284740447998} 11/07/2021 16:47:33 - INFO - __main__ - Step 138766: {'lr': 7.0757605038086755e-06, 'samples': 26643072, 'steps': 138765, 'loss/train': 1.099849820137024} 11/07/2021 16:47:34 - INFO - __main__ - Step 138767: {'lr': 7.07450694172726e-06, 'samples': 26643264, 'steps': 138766, 'loss/train': 0.6879099011421204} 11/07/2021 16:47:35 - INFO - __main__ - Step 138768: {'lr': 7.0732534891043424e-06, 'samples': 26643456, 'steps': 138767, 'loss/train': 1.9455629587173462} 11/07/2021 16:47:35 - INFO - __main__ - Step 138769: {'lr': 7.072000145940449e-06, 'samples': 26643648, 'steps': 138768, 'loss/train': 1.4928317070007324} 11/07/2021 16:47:35 - INFO - __main__ - Step 138770: {'lr': 7.0707469122361645e-06, 'samples': 26643840, 'steps': 138769, 'loss/train': 1.101669192314148} 11/07/2021 16:47:36 - INFO - __main__ - Step 138771: {'lr': 7.069493787992071e-06, 'samples': 26644032, 'steps': 138770, 'loss/train': 1.4303412437438965} 11/07/2021 16:47:36 - INFO - __main__ - Step 138772: {'lr': 7.068240773208695e-06, 'samples': 26644224, 'steps': 138771, 'loss/train': 1.1098355054855347} 11/07/2021 16:47:37 - INFO - __main__ - Step 138773: {'lr': 7.06698786788662e-06, 'samples': 26644416, 'steps': 138772, 'loss/train': 1.445162296295166} 11/07/2021 16:47:38 - INFO - __main__ - Step 138774: {'lr': 7.065735072026402e-06, 'samples': 26644608, 'steps': 138773, 'loss/train': 1.2410320043563843} 11/07/2021 16:47:38 - INFO - __main__ - Step 138775: {'lr': 7.064482385628651e-06, 'samples': 26644800, 'steps': 138774, 'loss/train': 2.622772693634033} 11/07/2021 16:47:38 - INFO - __main__ - Step 138776: {'lr': 7.063229808693867e-06, 'samples': 26644992, 'steps': 138775, 'loss/train': 0.18586891889572144} 11/07/2021 16:47:39 - INFO - __main__ - Step 138777: {'lr': 7.061977341222631e-06, 'samples': 26645184, 'steps': 138776, 'loss/train': 1.036202311515808} 11/07/2021 16:47:39 - INFO - __main__ - Step 138778: {'lr': 7.060724983215555e-06, 'samples': 26645376, 'steps': 138777, 'loss/train': 1.4370747804641724} 11/07/2021 16:47:40 - INFO - __main__ - Step 138779: {'lr': 7.059472734673139e-06, 'samples': 26645568, 'steps': 138778, 'loss/train': 1.6685197353363037} 11/07/2021 16:47:40 - INFO - __main__ - Step 138780: {'lr': 7.0582205955959934e-06, 'samples': 26645760, 'steps': 138779, 'loss/train': 1.1437159776687622} 11/07/2021 16:47:41 - INFO - __main__ - Step 138781: {'lr': 7.056968565984645e-06, 'samples': 26645952, 'steps': 138780, 'loss/train': 1.0892233848571777} 11/07/2021 16:47:41 - INFO - __main__ - Step 138782: {'lr': 7.0557166458397326e-06, 'samples': 26646144, 'steps': 138781, 'loss/train': 1.3220258951187134} 11/07/2021 16:47:42 - INFO - __main__ - Step 138783: {'lr': 7.054464835161728e-06, 'samples': 26646336, 'steps': 138782, 'loss/train': 1.3397976160049438} 11/07/2021 16:47:43 - INFO - __main__ - Step 138784: {'lr': 7.053213133951214e-06, 'samples': 26646528, 'steps': 138783, 'loss/train': 1.164280891418457} 11/07/2021 16:47:43 - INFO - __main__ - Step 138785: {'lr': 7.051961542208801e-06, 'samples': 26646720, 'steps': 138784, 'loss/train': 1.3099275827407837} 11/07/2021 16:47:43 - INFO - __main__ - Step 138786: {'lr': 7.050710059934989e-06, 'samples': 26646912, 'steps': 138785, 'loss/train': 1.3177871704101562} 11/07/2021 16:47:44 - INFO - __main__ - Step 138787: {'lr': 7.0494586871304166e-06, 'samples': 26647104, 'steps': 138786, 'loss/train': 1.1701563596725464} 11/07/2021 16:47:44 - INFO - __main__ - Step 138788: {'lr': 7.0482074237955825e-06, 'samples': 26647296, 'steps': 138787, 'loss/train': 0.668626606464386} 11/07/2021 16:47:45 - INFO - __main__ - Step 138789: {'lr': 7.0469562699310985e-06, 'samples': 26647488, 'steps': 138788, 'loss/train': 1.0628623962402344} 11/07/2021 16:47:46 - INFO - __main__ - Step 138790: {'lr': 7.045705225537491e-06, 'samples': 26647680, 'steps': 138789, 'loss/train': 1.2593348026275635} 11/07/2021 16:47:46 - INFO - __main__ - Step 138791: {'lr': 7.044454290615343e-06, 'samples': 26647872, 'steps': 138790, 'loss/train': 1.1008782386779785} 11/07/2021 16:47:46 - INFO - __main__ - Step 138792: {'lr': 7.04320346516521e-06, 'samples': 26648064, 'steps': 138791, 'loss/train': 0.8878816962242126} 11/07/2021 16:47:47 - INFO - __main__ - Step 138793: {'lr': 7.041952749187675e-06, 'samples': 26648256, 'steps': 138792, 'loss/train': 1.359113097190857} 11/07/2021 16:47:48 - INFO - __main__ - Step 138794: {'lr': 7.040702142683292e-06, 'samples': 26648448, 'steps': 138793, 'loss/train': 1.784274935722351} 11/07/2021 16:47:48 - INFO - __main__ - Step 138795: {'lr': 7.039451645652617e-06, 'samples': 26648640, 'steps': 138794, 'loss/train': 1.0892972946166992} 11/07/2021 16:47:48 - INFO - __main__ - Step 138796: {'lr': 7.038201258096205e-06, 'samples': 26648832, 'steps': 138795, 'loss/train': 1.2197487354278564} 11/07/2021 16:47:49 - INFO - __main__ - Step 138797: {'lr': 7.0369509800146395e-06, 'samples': 26649024, 'steps': 138796, 'loss/train': 1.2303940057754517} 11/07/2021 16:47:49 - INFO - __main__ - Step 138798: {'lr': 7.035700811408474e-06, 'samples': 26649216, 'steps': 138797, 'loss/train': 1.2214213609695435} 11/07/2021 16:47:50 - INFO - __main__ - Step 138799: {'lr': 7.034450752278265e-06, 'samples': 26649408, 'steps': 138798, 'loss/train': 1.1540184020996094} 11/07/2021 16:47:51 - INFO - __main__ - Step 138800: {'lr': 7.033200802624567e-06, 'samples': 26649600, 'steps': 138799, 'loss/train': 1.723486065864563} 11/07/2021 16:47:51 - INFO - __main__ - Step 138801: {'lr': 7.031950962447992e-06, 'samples': 26649792, 'steps': 138800, 'loss/train': 0.6443910598754883} 11/07/2021 16:47:51 - INFO - __main__ - Step 138802: {'lr': 7.030701231749037e-06, 'samples': 26649984, 'steps': 138801, 'loss/train': 1.3099956512451172} 11/07/2021 16:47:52 - INFO - __main__ - Step 138803: {'lr': 7.0294516105283146e-06, 'samples': 26650176, 'steps': 138802, 'loss/train': 1.055010199546814} 11/07/2021 16:47:53 - INFO - __main__ - Step 138804: {'lr': 7.028202098786379e-06, 'samples': 26650368, 'steps': 138803, 'loss/train': 1.2181118726730347} 11/07/2021 16:47:53 - INFO - __main__ - Step 138805: {'lr': 7.026952696523786e-06, 'samples': 26650560, 'steps': 138804, 'loss/train': 1.528554081916809} 11/07/2021 16:47:54 - INFO - __main__ - Step 138806: {'lr': 7.02570340374109e-06, 'samples': 26650752, 'steps': 138805, 'loss/train': 1.210053563117981} 11/07/2021 16:47:54 - INFO - __main__ - Step 138807: {'lr': 7.024454220438875e-06, 'samples': 26650944, 'steps': 138806, 'loss/train': 1.107251763343811} 11/07/2021 16:47:54 - INFO - __main__ - Step 138808: {'lr': 7.023205146617667e-06, 'samples': 26651136, 'steps': 138807, 'loss/train': 0.21393784880638123} 11/07/2021 16:47:55 - INFO - __main__ - Step 138809: {'lr': 7.021956182278105e-06, 'samples': 26651328, 'steps': 138808, 'loss/train': 1.4329805374145508} 11/07/2021 16:47:56 - INFO - __main__ - Step 138810: {'lr': 7.0207073274206615e-06, 'samples': 26651520, 'steps': 138809, 'loss/train': 1.2896240949630737} 11/07/2021 16:47:56 - INFO - __main__ - Step 138811: {'lr': 7.019458582045946e-06, 'samples': 26651712, 'steps': 138810, 'loss/train': 1.0871599912643433} 11/07/2021 16:47:56 - INFO - __main__ - Step 138812: {'lr': 7.0182099461545135e-06, 'samples': 26651904, 'steps': 138811, 'loss/train': 1.0326876640319824} 11/07/2021 16:47:57 - INFO - __main__ - Step 138813: {'lr': 7.01696141974692e-06, 'samples': 26652096, 'steps': 138812, 'loss/train': 1.1263524293899536} 11/07/2021 16:47:58 - INFO - __main__ - Step 138814: {'lr': 7.01571300282372e-06, 'samples': 26652288, 'steps': 138813, 'loss/train': 0.3955998122692108} 11/07/2021 16:47:58 - INFO - __main__ - Step 138815: {'lr': 7.014464695385525e-06, 'samples': 26652480, 'steps': 138814, 'loss/train': 1.0573513507843018} 11/07/2021 16:47:59 - INFO - __main__ - Step 138816: {'lr': 7.013216497432834e-06, 'samples': 26652672, 'steps': 138815, 'loss/train': 1.39877450466156} 11/07/2021 16:47:59 - INFO - __main__ - Step 138817: {'lr': 7.011968408966258e-06, 'samples': 26652864, 'steps': 138816, 'loss/train': 1.454489827156067} 11/07/2021 16:47:59 - INFO - __main__ - Step 138818: {'lr': 7.010720429986322e-06, 'samples': 26653056, 'steps': 138817, 'loss/train': 0.9759959578514099} 11/07/2021 16:48:00 - INFO - __main__ - Step 138819: {'lr': 7.009472560493613e-06, 'samples': 26653248, 'steps': 138818, 'loss/train': 1.323804497718811} 11/07/2021 16:48:01 - INFO - __main__ - Step 138820: {'lr': 7.008224800488683e-06, 'samples': 26653440, 'steps': 138819, 'loss/train': 1.058984637260437} 11/07/2021 16:48:01 - INFO - __main__ - Step 138821: {'lr': 7.006977149972088e-06, 'samples': 26653632, 'steps': 138820, 'loss/train': 1.4989750385284424} 11/07/2021 16:48:01 - INFO - __main__ - Step 138822: {'lr': 7.005729608944439e-06, 'samples': 26653824, 'steps': 138821, 'loss/train': 1.2065538167953491} 11/07/2021 16:48:02 - INFO - __main__ - Step 138823: {'lr': 7.004482177406235e-06, 'samples': 26654016, 'steps': 138822, 'loss/train': 1.6988825798034668} 11/07/2021 16:48:02 - INFO - __main__ - Step 138824: {'lr': 7.0032348553580595e-06, 'samples': 26654208, 'steps': 138823, 'loss/train': 1.4939452409744263} 11/07/2021 16:48:03 - INFO - __main__ - Step 138825: {'lr': 7.001987642800467e-06, 'samples': 26654400, 'steps': 138824, 'loss/train': 1.3770967721939087} 11/07/2021 16:48:04 - INFO - __main__ - Step 138826: {'lr': 7.000740539734041e-06, 'samples': 26654592, 'steps': 138825, 'loss/train': 1.4185088872909546} 11/07/2021 16:48:04 - INFO - __main__ - Step 138827: {'lr': 6.999493546159336e-06, 'samples': 26654784, 'steps': 138826, 'loss/train': 1.1741939783096313} 11/07/2021 16:48:04 - INFO - __main__ - Step 138828: {'lr': 6.998246662076907e-06, 'samples': 26654976, 'steps': 138827, 'loss/train': 0.6781278848648071} 11/07/2021 16:48:05 - INFO - __main__ - Step 138829: {'lr': 6.99699988748731e-06, 'samples': 26655168, 'steps': 138828, 'loss/train': 1.5787017345428467} 11/07/2021 16:48:06 - INFO - __main__ - Step 138830: {'lr': 6.995753222391099e-06, 'samples': 26655360, 'steps': 138829, 'loss/train': 1.5287972688674927} 11/07/2021 16:48:06 - INFO - __main__ - Step 138831: {'lr': 6.994506666788886e-06, 'samples': 26655552, 'steps': 138830, 'loss/train': 1.4040766954421997} 11/07/2021 16:48:06 - INFO - __main__ - Step 138832: {'lr': 6.993260220681169e-06, 'samples': 26655744, 'steps': 138831, 'loss/train': 1.122201681137085} 11/07/2021 16:48:07 - INFO - __main__ - Step 138833: {'lr': 6.99201388406856e-06, 'samples': 26655936, 'steps': 138832, 'loss/train': 1.272420048713684} 11/07/2021 16:48:07 - INFO - __main__ - Step 138834: {'lr': 6.990767656951585e-06, 'samples': 26656128, 'steps': 138833, 'loss/train': 1.0441875457763672} 11/07/2021 16:48:08 - INFO - __main__ - Step 138835: {'lr': 6.989521539330829e-06, 'samples': 26656320, 'steps': 138834, 'loss/train': 1.3315924406051636} 11/07/2021 16:48:08 - INFO - __main__ - Step 138836: {'lr': 6.9882755312068725e-06, 'samples': 26656512, 'steps': 138835, 'loss/train': 1.078637719154358} 11/07/2021 16:48:09 - INFO - __main__ - Step 138837: {'lr': 6.987029632580216e-06, 'samples': 26656704, 'steps': 138836, 'loss/train': 0.7104277610778809} 11/07/2021 16:48:09 - INFO - __main__ - Step 138838: {'lr': 6.985783843451471e-06, 'samples': 26656896, 'steps': 138837, 'loss/train': 1.177881121635437} 11/07/2021 16:48:09 - INFO - __main__ - Step 138839: {'lr': 6.984538163821164e-06, 'samples': 26657088, 'steps': 138838, 'loss/train': 1.2418705224990845} 11/07/2021 16:48:11 - INFO - __main__ - Step 138840: {'lr': 6.983292593689877e-06, 'samples': 26657280, 'steps': 138839, 'loss/train': 1.6305112838745117} 11/07/2021 16:48:11 - INFO - __main__ - Step 138841: {'lr': 6.982047133058167e-06, 'samples': 26657472, 'steps': 138840, 'loss/train': 1.6846773624420166} 11/07/2021 16:48:11 - INFO - __main__ - Step 138842: {'lr': 6.980801781926616e-06, 'samples': 26657664, 'steps': 138841, 'loss/train': 1.311991572380066} 11/07/2021 16:48:12 - INFO - __main__ - Step 138843: {'lr': 6.97955654029575e-06, 'samples': 26657856, 'steps': 138842, 'loss/train': 1.138973355293274} 11/07/2021 16:48:12 - INFO - __main__ - Step 138844: {'lr': 6.978311408166127e-06, 'samples': 26658048, 'steps': 138843, 'loss/train': 1.184097409248352} 11/07/2021 16:48:12 - INFO - __main__ - Step 138845: {'lr': 6.9770663855383555e-06, 'samples': 26658240, 'steps': 138844, 'loss/train': 1.0071457624435425} 11/07/2021 16:48:14 - INFO - __main__ - Step 138846: {'lr': 6.9758214724129634e-06, 'samples': 26658432, 'steps': 138845, 'loss/train': 1.4668267965316772} 11/07/2021 16:48:14 - INFO - __main__ - Step 138847: {'lr': 6.974576668790505e-06, 'samples': 26658624, 'steps': 138846, 'loss/train': 1.173463225364685} 11/07/2021 16:48:14 - INFO - __main__ - Step 138848: {'lr': 6.973331974671593e-06, 'samples': 26658816, 'steps': 138847, 'loss/train': 1.5467681884765625} 11/07/2021 16:48:15 - INFO - __main__ - Step 138849: {'lr': 6.972087390056697e-06, 'samples': 26659008, 'steps': 138848, 'loss/train': 1.1501810550689697} 11/07/2021 16:48:15 - INFO - __main__ - Step 138850: {'lr': 6.970842914946457e-06, 'samples': 26659200, 'steps': 138849, 'loss/train': 1.2991141080856323} 11/07/2021 16:48:16 - INFO - __main__ - Step 138851: {'lr': 6.969598549341372e-06, 'samples': 26659392, 'steps': 138850, 'loss/train': 1.6597578525543213} 11/07/2021 16:48:17 - INFO - __main__ - Step 138852: {'lr': 6.968354293242052e-06, 'samples': 26659584, 'steps': 138851, 'loss/train': 1.6087592840194702} 11/07/2021 16:48:17 - INFO - __main__ - Step 138853: {'lr': 6.9671101466490525e-06, 'samples': 26659776, 'steps': 138852, 'loss/train': 0.767359733581543} 11/07/2021 16:48:17 - INFO - __main__ - Step 138854: {'lr': 6.965866109562929e-06, 'samples': 26659968, 'steps': 138853, 'loss/train': 0.63294517993927} 11/07/2021 16:48:18 - INFO - __main__ - Step 138855: {'lr': 6.964622181984209e-06, 'samples': 26660160, 'steps': 138854, 'loss/train': 1.3319473266601562} 11/07/2021 16:48:19 - INFO - __main__ - Step 138856: {'lr': 6.963378363913503e-06, 'samples': 26660352, 'steps': 138855, 'loss/train': 1.44522225856781} 11/07/2021 16:48:19 - INFO - __main__ - Step 138857: {'lr': 6.962134655351337e-06, 'samples': 26660544, 'steps': 138856, 'loss/train': 1.1923553943634033} 11/07/2021 16:48:19 - INFO - __main__ - Step 138858: {'lr': 6.9608910562982686e-06, 'samples': 26660736, 'steps': 138857, 'loss/train': 0.04025191813707352} 11/07/2021 16:48:20 - INFO - __main__ - Step 138859: {'lr': 6.959647566754934e-06, 'samples': 26660928, 'steps': 138858, 'loss/train': 1.3122633695602417} 11/07/2021 16:48:20 - INFO - __main__ - Step 138860: {'lr': 6.958404186721779e-06, 'samples': 26661120, 'steps': 138859, 'loss/train': 1.1839289665222168} 11/07/2021 16:48:22 - INFO - __main__ - Step 138861: {'lr': 6.957160916199412e-06, 'samples': 26661312, 'steps': 138860, 'loss/train': 1.896193504333496} 11/07/2021 16:48:22 - INFO - __main__ - Step 138862: {'lr': 6.955917755188418e-06, 'samples': 26661504, 'steps': 138861, 'loss/train': 0.9409072995185852} 11/07/2021 16:48:22 - INFO - __main__ - Step 138863: {'lr': 6.954674703689323e-06, 'samples': 26661696, 'steps': 138862, 'loss/train': 0.25685933232307434} 11/07/2021 16:48:23 - INFO - __main__ - Step 138864: {'lr': 6.953431761702711e-06, 'samples': 26661888, 'steps': 138863, 'loss/train': 1.3062798976898193} 11/07/2021 16:48:23 - INFO - __main__ - Step 138865: {'lr': 6.952188929229136e-06, 'samples': 26662080, 'steps': 138864, 'loss/train': 0.2346845120191574} 11/07/2021 16:48:24 - INFO - __main__ - Step 138866: {'lr': 6.950946206269127e-06, 'samples': 26662272, 'steps': 138865, 'loss/train': 1.0291643142700195} 11/07/2021 16:48:24 - INFO - __main__ - Step 138867: {'lr': 6.949703592823292e-06, 'samples': 26662464, 'steps': 138866, 'loss/train': 0.925878643989563} 11/07/2021 16:48:25 - INFO - __main__ - Step 138868: {'lr': 6.948461088892188e-06, 'samples': 26662656, 'steps': 138867, 'loss/train': 1.3625062704086304} 11/07/2021 16:48:25 - INFO - __main__ - Step 138869: {'lr': 6.947218694476315e-06, 'samples': 26662848, 'steps': 138868, 'loss/train': 0.9926415681838989} 11/07/2021 16:48:25 - INFO - __main__ - Step 138870: {'lr': 6.945976409576338e-06, 'samples': 26663040, 'steps': 138869, 'loss/train': 1.123972773551941} 11/07/2021 16:48:26 - INFO - __main__ - Step 138871: {'lr': 6.944734234192701e-06, 'samples': 26663232, 'steps': 138870, 'loss/train': 1.4378949403762817} 11/07/2021 16:48:27 - INFO - __main__ - Step 138872: {'lr': 6.943492168326043e-06, 'samples': 26663424, 'steps': 138871, 'loss/train': 0.6751130223274231} 11/07/2021 16:48:27 - INFO - __main__ - Step 138873: {'lr': 6.942250211976864e-06, 'samples': 26663616, 'steps': 138872, 'loss/train': 1.4876066446304321} 11/07/2021 16:48:28 - INFO - __main__ - Step 138874: {'lr': 6.941008365145773e-06, 'samples': 26663808, 'steps': 138873, 'loss/train': 0.04232487455010414} 11/07/2021 16:48:28 - INFO - __main__ - Step 138875: {'lr': 6.939766627833327e-06, 'samples': 26664000, 'steps': 138874, 'loss/train': 0.24564918875694275} 11/07/2021 16:48:29 - INFO - __main__ - Step 138876: {'lr': 6.9385250000400526e-06, 'samples': 26664192, 'steps': 138875, 'loss/train': 0.9539720416069031} 11/07/2021 16:48:29 - INFO - __main__ - Step 138877: {'lr': 6.937283481766532e-06, 'samples': 26664384, 'steps': 138876, 'loss/train': 1.0621267557144165} 11/07/2021 16:48:30 - INFO - __main__ - Step 138878: {'lr': 6.936042073013321e-06, 'samples': 26664576, 'steps': 138877, 'loss/train': 1.3052191734313965} 11/07/2021 16:48:30 - INFO - __main__ - Step 138879: {'lr': 6.934800773780975e-06, 'samples': 26664768, 'steps': 138878, 'loss/train': 1.0024484395980835} 11/07/2021 16:48:30 - INFO - __main__ - Step 138880: {'lr': 6.933559584070076e-06, 'samples': 26664960, 'steps': 138879, 'loss/train': 1.6544498205184937} 11/07/2021 16:48:32 - INFO - __main__ - Step 138881: {'lr': 6.93231850388118e-06, 'samples': 26665152, 'steps': 138880, 'loss/train': 1.44025456905365} 11/07/2021 16:48:32 - INFO - __main__ - Step 138882: {'lr': 6.931077533214786e-06, 'samples': 26665344, 'steps': 138881, 'loss/train': 0.09611542522907257} 11/07/2021 16:48:32 - INFO - __main__ - Step 138883: {'lr': 6.929836672071532e-06, 'samples': 26665536, 'steps': 138882, 'loss/train': 0.9477154016494751} 11/07/2021 16:48:33 - INFO - __main__ - Step 138884: {'lr': 6.928595920451919e-06, 'samples': 26665728, 'steps': 138883, 'loss/train': 1.3405619859695435} 11/07/2021 16:48:33 - INFO - __main__ - Step 138885: {'lr': 6.927355278356529e-06, 'samples': 26665920, 'steps': 138884, 'loss/train': 0.677914559841156} 11/07/2021 16:48:34 - INFO - __main__ - Step 138886: {'lr': 6.926114745785916e-06, 'samples': 26666112, 'steps': 138885, 'loss/train': 1.0300203561782837} 11/07/2021 16:48:34 - INFO - __main__ - Step 138887: {'lr': 6.924874322740665e-06, 'samples': 26666304, 'steps': 138886, 'loss/train': 1.2524144649505615} 11/07/2021 16:48:35 - INFO - __main__ - Step 138888: {'lr': 6.923634009221303e-06, 'samples': 26666496, 'steps': 138887, 'loss/train': 1.1062872409820557} 11/07/2021 16:48:35 - INFO - __main__ - Step 138889: {'lr': 6.922393805228411e-06, 'samples': 26666688, 'steps': 138888, 'loss/train': 1.7290279865264893} 11/07/2021 16:48:35 - INFO - __main__ - Step 138890: {'lr': 6.921153710762518e-06, 'samples': 26666880, 'steps': 138889, 'loss/train': 0.9907609224319458} 11/07/2021 16:48:36 - INFO - __main__ - Step 138891: {'lr': 6.919913725824234e-06, 'samples': 26667072, 'steps': 138890, 'loss/train': 0.9172926545143127} 11/07/2021 16:48:37 - INFO - __main__ - Step 138892: {'lr': 6.918673850414087e-06, 'samples': 26667264, 'steps': 138891, 'loss/train': 0.8529251217842102} 11/07/2021 16:48:37 - INFO - __main__ - Step 138893: {'lr': 6.917434084532604e-06, 'samples': 26667456, 'steps': 138892, 'loss/train': 0.8027434349060059} 11/07/2021 16:48:38 - INFO - __main__ - Step 138894: {'lr': 6.916194428180395e-06, 'samples': 26667648, 'steps': 138893, 'loss/train': 1.4589611291885376} 11/07/2021 16:48:38 - INFO - __main__ - Step 138895: {'lr': 6.9149548813579875e-06, 'samples': 26667840, 'steps': 138894, 'loss/train': 1.0544049739837646} 11/07/2021 16:48:38 - INFO - __main__ - Step 138896: {'lr': 6.913715444065937e-06, 'samples': 26668032, 'steps': 138895, 'loss/train': 1.7336547374725342} 11/07/2021 16:48:39 - INFO - __main__ - Step 138897: {'lr': 6.912476116304828e-06, 'samples': 26668224, 'steps': 138896, 'loss/train': 1.2892725467681885} 11/07/2021 16:48:40 - INFO - __main__ - Step 138898: {'lr': 6.911236898075213e-06, 'samples': 26668416, 'steps': 138897, 'loss/train': 1.4270330667495728} 11/07/2021 16:48:40 - INFO - __main__ - Step 138899: {'lr': 6.909997789377648e-06, 'samples': 26668608, 'steps': 138898, 'loss/train': 0.9609357714653015} 11/07/2021 16:48:41 - INFO - __main__ - Step 138900: {'lr': 6.90875879021266e-06, 'samples': 26668800, 'steps': 138899, 'loss/train': 0.9563473463058472} 11/07/2021 16:48:41 - INFO - __main__ - Step 138901: {'lr': 6.907519900580861e-06, 'samples': 26668992, 'steps': 138900, 'loss/train': 1.1779457330703735} 11/07/2021 16:48:41 - INFO - __main__ - Step 138902: {'lr': 6.906281120482777e-06, 'samples': 26669184, 'steps': 138901, 'loss/train': 1.2468911409378052} 11/07/2021 16:48:42 - INFO - __main__ - Step 138903: {'lr': 6.905042449918991e-06, 'samples': 26669376, 'steps': 138902, 'loss/train': 0.7444061040878296} 11/07/2021 16:48:43 - INFO - __main__ - Step 138904: {'lr': 6.903803888890003e-06, 'samples': 26669568, 'steps': 138903, 'loss/train': 1.1243120431900024} 11/07/2021 16:48:43 - INFO - __main__ - Step 138905: {'lr': 6.902565437396424e-06, 'samples': 26669760, 'steps': 138904, 'loss/train': 0.4128488302230835} 11/07/2021 16:48:43 - INFO - __main__ - Step 138906: {'lr': 6.901327095438809e-06, 'samples': 26669952, 'steps': 138905, 'loss/train': 0.6044281721115112} 11/07/2021 16:48:44 - INFO - __main__ - Step 138907: {'lr': 6.900088863017684e-06, 'samples': 26670144, 'steps': 138906, 'loss/train': 0.5648091435432434} 11/07/2021 16:48:45 - INFO - __main__ - Step 138908: {'lr': 6.898850740133633e-06, 'samples': 26670336, 'steps': 138907, 'loss/train': 1.2357807159423828} 11/07/2021 16:48:45 - INFO - __main__ - Step 138909: {'lr': 6.897612726787212e-06, 'samples': 26670528, 'steps': 138908, 'loss/train': 1.096412181854248} 11/07/2021 16:48:46 - INFO - __main__ - Step 138910: {'lr': 6.896374822978974e-06, 'samples': 26670720, 'steps': 138909, 'loss/train': 1.5812733173370361} 11/07/2021 16:48:46 - INFO - __main__ - Step 138911: {'lr': 6.895137028709475e-06, 'samples': 26670912, 'steps': 138910, 'loss/train': 1.1359459161758423} 11/07/2021 16:48:46 - INFO - __main__ - Step 138912: {'lr': 6.893899343979299e-06, 'samples': 26671104, 'steps': 138911, 'loss/train': 1.695116639137268} 11/07/2021 16:48:47 - INFO - __main__ - Step 138913: {'lr': 6.892661768788944e-06, 'samples': 26671296, 'steps': 138912, 'loss/train': 1.29329514503479} 11/07/2021 16:48:48 - INFO - __main__ - Step 138914: {'lr': 6.891424303139021e-06, 'samples': 26671488, 'steps': 138913, 'loss/train': 1.13572359085083} 11/07/2021 16:48:48 - INFO - __main__ - Step 138915: {'lr': 6.890186947030086e-06, 'samples': 26671680, 'steps': 138914, 'loss/train': 1.4006088972091675} 11/07/2021 16:48:48 - INFO - __main__ - Step 138916: {'lr': 6.888949700462693e-06, 'samples': 26671872, 'steps': 138915, 'loss/train': 1.1775174140930176} 11/07/2021 16:48:49 - INFO - __main__ - Step 138917: {'lr': 6.887712563437371e-06, 'samples': 26672064, 'steps': 138916, 'loss/train': 0.9936534762382507} 11/07/2021 16:48:50 - INFO - __main__ - Step 138918: {'lr': 6.886475535954673e-06, 'samples': 26672256, 'steps': 138917, 'loss/train': 1.1372406482696533} 11/07/2021 16:48:50 - INFO - __main__ - Step 138919: {'lr': 6.885238618015183e-06, 'samples': 26672448, 'steps': 138918, 'loss/train': 1.615579605102539} 11/07/2021 16:48:51 - INFO - __main__ - Step 138920: {'lr': 6.884001809619455e-06, 'samples': 26672640, 'steps': 138919, 'loss/train': 1.1758793592453003} 11/07/2021 16:48:51 - INFO - __main__ - Step 138921: {'lr': 6.8827651107680745e-06, 'samples': 26672832, 'steps': 138920, 'loss/train': 1.1864116191864014} 11/07/2021 16:48:51 - INFO - __main__ - Step 138922: {'lr': 6.881528521461539e-06, 'samples': 26673024, 'steps': 138921, 'loss/train': 1.3456388711929321} 11/07/2021 16:48:52 - INFO - __main__ - Step 138923: {'lr': 6.880292041700431e-06, 'samples': 26673216, 'steps': 138922, 'loss/train': 1.1713718175888062} 11/07/2021 16:48:53 - INFO - __main__ - Step 138924: {'lr': 6.879055671485334e-06, 'samples': 26673408, 'steps': 138923, 'loss/train': 1.35267174243927} 11/07/2021 16:48:53 - INFO - __main__ - Step 138925: {'lr': 6.877819410816749e-06, 'samples': 26673600, 'steps': 138924, 'loss/train': 1.2191805839538574} 11/07/2021 16:48:54 - INFO - __main__ - Step 138926: {'lr': 6.876583259695285e-06, 'samples': 26673792, 'steps': 138925, 'loss/train': 1.793023943901062} 11/07/2021 16:48:54 - INFO - __main__ - Step 138927: {'lr': 6.875347218121497e-06, 'samples': 26673984, 'steps': 138926, 'loss/train': 0.9310736656188965} 11/07/2021 16:48:54 - INFO - __main__ - Step 138928: {'lr': 6.874111286095913e-06, 'samples': 26674176, 'steps': 138927, 'loss/train': 0.060132645070552826} 11/07/2021 16:48:55 - INFO - __main__ - Step 138929: {'lr': 6.872875463619088e-06, 'samples': 26674368, 'steps': 138928, 'loss/train': 1.2876425981521606} 11/07/2021 16:48:56 - INFO - __main__ - Step 138930: {'lr': 6.871639750691633e-06, 'samples': 26674560, 'steps': 138929, 'loss/train': 1.1928939819335938} 11/07/2021 16:48:56 - INFO - __main__ - Step 138931: {'lr': 6.870404147314047e-06, 'samples': 26674752, 'steps': 138930, 'loss/train': 1.4424502849578857} 11/07/2021 16:48:56 - INFO - __main__ - Step 138932: {'lr': 6.8691686534869126e-06, 'samples': 26674944, 'steps': 138931, 'loss/train': 0.5849564671516418} 11/07/2021 16:48:57 - INFO - __main__ - Step 138933: {'lr': 6.867933269210757e-06, 'samples': 26675136, 'steps': 138932, 'loss/train': 1.127909779548645} 11/07/2021 16:48:58 - INFO - __main__ - Step 138934: {'lr': 6.8666979944861655e-06, 'samples': 26675328, 'steps': 138933, 'loss/train': 1.2470089197158813} 11/07/2021 16:48:58 - INFO - __main__ - Step 138935: {'lr': 6.86546282931369e-06, 'samples': 26675520, 'steps': 138934, 'loss/train': 1.4738390445709229} 11/07/2021 16:48:59 - INFO - __main__ - Step 138936: {'lr': 6.864227773693888e-06, 'samples': 26675712, 'steps': 138935, 'loss/train': 1.2026439905166626} 11/07/2021 16:48:59 - INFO - __main__ - Step 138937: {'lr': 6.862992827627312e-06, 'samples': 26675904, 'steps': 138936, 'loss/train': 1.173466444015503} 11/07/2021 16:48:59 - INFO - __main__ - Step 138938: {'lr': 6.86175799111452e-06, 'samples': 26676096, 'steps': 138937, 'loss/train': 1.240141749382019} 11/07/2021 16:49:01 - INFO - __main__ - Step 138939: {'lr': 6.860523264156065e-06, 'samples': 26676288, 'steps': 138938, 'loss/train': 1.2638291120529175} 11/07/2021 16:49:01 - INFO - __main__ - Step 138940: {'lr': 6.859288646752504e-06, 'samples': 26676480, 'steps': 138939, 'loss/train': 1.348771572113037} 11/07/2021 16:49:01 - INFO - __main__ - Step 138941: {'lr': 6.8580541389043905e-06, 'samples': 26676672, 'steps': 138940, 'loss/train': 0.8842095136642456} 11/07/2021 16:49:02 - INFO - __main__ - Step 138942: {'lr': 6.856819740612308e-06, 'samples': 26676864, 'steps': 138941, 'loss/train': 0.12531565129756927} 11/07/2021 16:49:02 - INFO - __main__ - Step 138943: {'lr': 6.855585451876783e-06, 'samples': 26677056, 'steps': 138942, 'loss/train': 0.9078782200813293} 11/07/2021 16:49:03 - INFO - __main__ - Step 138944: {'lr': 6.854351272698373e-06, 'samples': 26677248, 'steps': 138943, 'loss/train': 1.4737943410873413} 11/07/2021 16:49:03 - INFO - __main__ - Step 138945: {'lr': 6.853117203077658e-06, 'samples': 26677440, 'steps': 138944, 'loss/train': 1.388197898864746} 11/07/2021 16:49:04 - INFO - __main__ - Step 138946: {'lr': 6.851883243015139e-06, 'samples': 26677632, 'steps': 138945, 'loss/train': 1.242492437362671} 11/07/2021 16:49:04 - INFO - __main__ - Step 138947: {'lr': 6.850649392511426e-06, 'samples': 26677824, 'steps': 138946, 'loss/train': 1.2746810913085938} 11/07/2021 16:49:04 - INFO - __main__ - Step 138948: {'lr': 6.849415651567076e-06, 'samples': 26678016, 'steps': 138947, 'loss/train': 1.370296597480774} 11/07/2021 16:49:05 - INFO - __main__ - Step 138949: {'lr': 6.848182020182614e-06, 'samples': 26678208, 'steps': 138948, 'loss/train': 1.287134051322937} 11/07/2021 16:49:06 - INFO - __main__ - Step 138950: {'lr': 6.8469484983585965e-06, 'samples': 26678400, 'steps': 138949, 'loss/train': 1.1546889543533325} 11/07/2021 16:49:06 - INFO - __main__ - Step 138951: {'lr': 6.845715086095605e-06, 'samples': 26678592, 'steps': 138950, 'loss/train': 1.4247467517852783} 11/07/2021 16:49:06 - INFO - __main__ - Step 138952: {'lr': 6.8444817833941684e-06, 'samples': 26678784, 'steps': 138951, 'loss/train': 1.1983423233032227} 11/07/2021 16:49:07 - INFO - __main__ - Step 138953: {'lr': 6.843248590254869e-06, 'samples': 26678976, 'steps': 138952, 'loss/train': 1.3642665147781372} 11/07/2021 16:49:08 - INFO - __main__ - Step 138954: {'lr': 6.842015506678262e-06, 'samples': 26679168, 'steps': 138953, 'loss/train': 1.2280690670013428} 11/07/2021 16:49:08 - INFO - __main__ - Step 138955: {'lr': 6.840782532664875e-06, 'samples': 26679360, 'steps': 138954, 'loss/train': 1.2669156789779663} 11/07/2021 16:49:09 - INFO - __main__ - Step 138956: {'lr': 6.839549668215289e-06, 'samples': 26679552, 'steps': 138955, 'loss/train': 0.8780088424682617} 11/07/2021 16:49:09 - INFO - __main__ - Step 138957: {'lr': 6.838316913330062e-06, 'samples': 26679744, 'steps': 138956, 'loss/train': 1.6521360874176025} 11/07/2021 16:49:09 - INFO - __main__ - Step 138958: {'lr': 6.837084268009719e-06, 'samples': 26679936, 'steps': 138957, 'loss/train': 1.228028655052185} 11/07/2021 16:49:10 - INFO - __main__ - Step 138959: {'lr': 6.835851732254816e-06, 'samples': 26680128, 'steps': 138958, 'loss/train': 1.0508143901824951} 11/07/2021 16:49:11 - INFO - __main__ - Step 138960: {'lr': 6.8346193060659645e-06, 'samples': 26680320, 'steps': 138959, 'loss/train': 1.5833848714828491} 11/07/2021 16:49:11 - INFO - __main__ - Step 138961: {'lr': 6.833386989443635e-06, 'samples': 26680512, 'steps': 138960, 'loss/train': 1.3733843564987183} 11/07/2021 16:49:11 - INFO - __main__ - Step 138962: {'lr': 6.832154782388466e-06, 'samples': 26680704, 'steps': 138961, 'loss/train': 1.043505072593689} 11/07/2021 16:49:12 - INFO - __main__ - Step 138963: {'lr': 6.830922684900959e-06, 'samples': 26680896, 'steps': 138962, 'loss/train': 1.6416680812835693} 11/07/2021 16:49:13 - INFO - __main__ - Step 138964: {'lr': 6.829690696981694e-06, 'samples': 26681088, 'steps': 138963, 'loss/train': 0.6049063205718994} 11/07/2021 16:49:13 - INFO - __main__ - Step 138965: {'lr': 6.8284588186312e-06, 'samples': 26681280, 'steps': 138964, 'loss/train': 1.7050877809524536} 11/07/2021 16:49:14 - INFO - __main__ - Step 138966: {'lr': 6.827227049850088e-06, 'samples': 26681472, 'steps': 138965, 'loss/train': 1.348645567893982} 11/07/2021 16:49:14 - INFO - __main__ - Step 138967: {'lr': 6.825995390638828e-06, 'samples': 26681664, 'steps': 138966, 'loss/train': 1.5318995714187622} 11/07/2021 16:49:14 - INFO - __main__ - Step 138968: {'lr': 6.82476384099806e-06, 'samples': 26681856, 'steps': 138967, 'loss/train': 1.350536584854126} 11/07/2021 16:49:15 - INFO - __main__ - Step 138969: {'lr': 6.823532400928284e-06, 'samples': 26682048, 'steps': 138968, 'loss/train': 1.4626972675323486} 11/07/2021 16:49:16 - INFO - __main__ - Step 138970: {'lr': 6.8223010704300816e-06, 'samples': 26682240, 'steps': 138969, 'loss/train': 0.6509256362915039} 11/07/2021 16:49:16 - INFO - __main__ - Step 138971: {'lr': 6.821069849504008e-06, 'samples': 26682432, 'steps': 138970, 'loss/train': 1.5021594762802124} 11/07/2021 16:49:17 - INFO - __main__ - Step 138972: {'lr': 6.8198387381505915e-06, 'samples': 26682624, 'steps': 138971, 'loss/train': 1.3470553159713745} 11/07/2021 16:49:17 - INFO - __main__ - Step 138973: {'lr': 6.818607736370386e-06, 'samples': 26682816, 'steps': 138972, 'loss/train': 0.047513194382190704} 11/07/2021 16:49:17 - INFO - __main__ - Step 138974: {'lr': 6.817376844163975e-06, 'samples': 26683008, 'steps': 138973, 'loss/train': 0.15108932554721832} 11/07/2021 16:49:18 - INFO - __main__ - Step 138975: {'lr': 6.816146061531914e-06, 'samples': 26683200, 'steps': 138974, 'loss/train': 1.5439109802246094} 11/07/2021 16:49:19 - INFO - __main__ - Step 138976: {'lr': 6.81491538847473e-06, 'samples': 26683392, 'steps': 138975, 'loss/train': 1.2058099508285522} 11/07/2021 16:49:19 - INFO - __main__ - Step 138977: {'lr': 6.813684824993005e-06, 'samples': 26683584, 'steps': 138976, 'loss/train': 0.9987470507621765} 11/07/2021 16:49:20 - INFO - __main__ - Step 138978: {'lr': 6.8124543710872674e-06, 'samples': 26683776, 'steps': 138977, 'loss/train': 0.946687638759613} 11/07/2021 16:49:20 - INFO - __main__ - Step 138979: {'lr': 6.8112240267581e-06, 'samples': 26683968, 'steps': 138978, 'loss/train': 1.1434348821640015} 11/07/2021 16:49:21 - INFO - __main__ - Step 138980: {'lr': 6.80999379200603e-06, 'samples': 26684160, 'steps': 138979, 'loss/train': 1.30095636844635} 11/07/2021 16:49:21 - INFO - __main__ - Step 138981: {'lr': 6.808763666831641e-06, 'samples': 26684352, 'steps': 138980, 'loss/train': 1.0106613636016846} 11/07/2021 16:49:22 - INFO - __main__ - Step 138982: {'lr': 6.807533651235459e-06, 'samples': 26684544, 'steps': 138981, 'loss/train': 1.087647557258606} 11/07/2021 16:49:22 - INFO - __main__ - Step 138983: {'lr': 6.80630374521804e-06, 'samples': 26684736, 'steps': 138982, 'loss/train': 1.059249758720398} 11/07/2021 16:49:22 - INFO - __main__ - Step 138984: {'lr': 6.805073948779994e-06, 'samples': 26684928, 'steps': 138983, 'loss/train': 0.7178373336791992} 11/07/2021 16:49:24 - INFO - __main__ - Step 138985: {'lr': 6.803844261921793e-06, 'samples': 26685120, 'steps': 138984, 'loss/train': 1.2035090923309326} 11/07/2021 16:49:24 - INFO - __main__ - Step 138986: {'lr': 6.8026146846440205e-06, 'samples': 26685312, 'steps': 138985, 'loss/train': 1.8057525157928467} 11/07/2021 16:49:24 - INFO - __main__ - Step 138987: {'lr': 6.801385216947231e-06, 'samples': 26685504, 'steps': 138986, 'loss/train': 0.8875778317451477} 11/07/2021 16:49:25 - INFO - __main__ - Step 138988: {'lr': 6.800155858832008e-06, 'samples': 26685696, 'steps': 138987, 'loss/train': 1.405226707458496} 11/07/2021 16:49:25 - INFO - __main__ - Step 138989: {'lr': 6.798926610298878e-06, 'samples': 26685888, 'steps': 138988, 'loss/train': 1.2723783254623413} 11/07/2021 16:49:26 - INFO - __main__ - Step 138990: {'lr': 6.7976974713483685e-06, 'samples': 26686080, 'steps': 138989, 'loss/train': 0.04453745856881142} 11/07/2021 16:49:26 - INFO - __main__ - Step 138991: {'lr': 6.7964684419810906e-06, 'samples': 26686272, 'steps': 138990, 'loss/train': 1.466178297996521} 11/07/2021 16:49:27 - INFO - __main__ - Step 138992: {'lr': 6.795239522197572e-06, 'samples': 26686464, 'steps': 138991, 'loss/train': 1.2367326021194458} 11/07/2021 16:49:27 - INFO - __main__ - Step 138993: {'lr': 6.7940107119983665e-06, 'samples': 26686656, 'steps': 138992, 'loss/train': 1.119428277015686} 11/07/2021 16:49:28 - INFO - __main__ - Step 138994: {'lr': 6.792782011384002e-06, 'samples': 26686848, 'steps': 138993, 'loss/train': 1.0459191799163818} 11/07/2021 16:49:29 - INFO - __main__ - Step 138995: {'lr': 6.79155342035509e-06, 'samples': 26687040, 'steps': 138994, 'loss/train': 0.42277395725250244} 11/07/2021 16:49:29 - INFO - __main__ - Step 138996: {'lr': 6.790324938912129e-06, 'samples': 26687232, 'steps': 138995, 'loss/train': 1.2875186204910278} 11/07/2021 16:49:29 - INFO - __main__ - Step 138997: {'lr': 6.78909656705573e-06, 'samples': 26687424, 'steps': 138996, 'loss/train': 0.6626763939857483} 11/07/2021 16:49:30 - INFO - __main__ - Step 138998: {'lr': 6.787868304786393e-06, 'samples': 26687616, 'steps': 138997, 'loss/train': 1.5787690877914429} 11/07/2021 16:49:30 - INFO - __main__ - Step 138999: {'lr': 6.7866401521046724e-06, 'samples': 26687808, 'steps': 138998, 'loss/train': 1.1874816417694092} 11/07/2021 16:49:30 - INFO - __main__ - Step 139000: {'lr': 6.785412109011152e-06, 'samples': 26688000, 'steps': 138999, 'loss/train': 1.568562388420105} 11/07/2021 16:49:31 - INFO - __main__ - Step 139001: {'lr': 6.784184175506358e-06, 'samples': 26688192, 'steps': 139000, 'loss/train': 1.1389845609664917} 11/07/2021 16:49:32 - INFO - __main__ - Step 139002: {'lr': 6.782956351590874e-06, 'samples': 26688384, 'steps': 139001, 'loss/train': 1.2778565883636475} 11/07/2021 16:49:32 - INFO - __main__ - Step 139003: {'lr': 6.7817286372652274e-06, 'samples': 26688576, 'steps': 139002, 'loss/train': 1.1025320291519165} 11/07/2021 16:49:32 - INFO - __main__ - Step 139004: {'lr': 6.780501032529973e-06, 'samples': 26688768, 'steps': 139003, 'loss/train': 0.029069803655147552} 11/07/2021 16:49:33 - INFO - __main__ - Step 139005: {'lr': 6.7792735373856936e-06, 'samples': 26688960, 'steps': 139004, 'loss/train': 1.201503872871399} 11/07/2021 16:49:34 - INFO - __main__ - Step 139006: {'lr': 6.77804615183289e-06, 'samples': 26689152, 'steps': 139005, 'loss/train': 0.8685140013694763} 11/07/2021 16:49:34 - INFO - __main__ - Step 139007: {'lr': 6.77681887587217e-06, 'samples': 26689344, 'steps': 139006, 'loss/train': 1.0233088731765747} 11/07/2021 16:49:34 - INFO - __main__ - Step 139008: {'lr': 6.7755917095040645e-06, 'samples': 26689536, 'steps': 139007, 'loss/train': 1.3355076313018799} 11/07/2021 16:49:35 - INFO - __main__ - Step 139009: {'lr': 6.774364652729098e-06, 'samples': 26689728, 'steps': 139008, 'loss/train': 1.2672656774520874} 11/07/2021 16:49:35 - INFO - __main__ - Step 139010: {'lr': 6.77313770554791e-06, 'samples': 26689920, 'steps': 139009, 'loss/train': 1.42082679271698} 11/07/2021 16:49:36 - INFO - __main__ - Step 139011: {'lr': 6.771910867960945e-06, 'samples': 26690112, 'steps': 139010, 'loss/train': 1.2648957967758179} 11/07/2021 16:49:36 - INFO - __main__ - Step 139012: {'lr': 6.770684139968814e-06, 'samples': 26690304, 'steps': 139011, 'loss/train': 1.5726782083511353} 11/07/2021 16:49:37 - INFO - __main__ - Step 139013: {'lr': 6.7694575215720425e-06, 'samples': 26690496, 'steps': 139012, 'loss/train': 1.4020062685012817} 11/07/2021 16:49:37 - INFO - __main__ - Step 139014: {'lr': 6.768231012771214e-06, 'samples': 26690688, 'steps': 139013, 'loss/train': 2.0726370811462402} 11/07/2021 16:49:38 - INFO - __main__ - Step 139015: {'lr': 6.767004613566858e-06, 'samples': 26690880, 'steps': 139014, 'loss/train': 1.4842864274978638} 11/07/2021 16:49:39 - INFO - __main__ - Step 139016: {'lr': 6.7657783239595536e-06, 'samples': 26691072, 'steps': 139015, 'loss/train': 1.5030969381332397} 11/07/2021 16:49:39 - INFO - __main__ - Step 139017: {'lr': 6.7645521439498316e-06, 'samples': 26691264, 'steps': 139016, 'loss/train': 0.6619526743888855} 11/07/2021 16:49:39 - INFO - __main__ - Step 139018: {'lr': 6.7633260735382455e-06, 'samples': 26691456, 'steps': 139017, 'loss/train': 1.1167157888412476} 11/07/2021 16:49:40 - INFO - __main__ - Step 139019: {'lr': 6.76210011272535e-06, 'samples': 26691648, 'steps': 139018, 'loss/train': 0.9810742735862732} 11/07/2021 16:49:40 - INFO - __main__ - Step 139020: {'lr': 6.760874261511674e-06, 'samples': 26691840, 'steps': 139019, 'loss/train': 1.1304668188095093} 11/07/2021 16:49:41 - INFO - __main__ - Step 139021: {'lr': 6.759648519897826e-06, 'samples': 26692032, 'steps': 139020, 'loss/train': 1.7338414192199707} 11/07/2021 16:49:41 - INFO - __main__ - Step 139022: {'lr': 6.7584228878843355e-06, 'samples': 26692224, 'steps': 139021, 'loss/train': 1.4306906461715698} 11/07/2021 16:49:42 - INFO - __main__ - Step 139023: {'lr': 6.757197365471729e-06, 'samples': 26692416, 'steps': 139022, 'loss/train': 1.0142548084259033} 11/07/2021 16:49:42 - INFO - __main__ - Step 139024: {'lr': 6.755971952660589e-06, 'samples': 26692608, 'steps': 139023, 'loss/train': 1.4401905536651611} 11/07/2021 16:49:42 - INFO - __main__ - Step 139025: {'lr': 6.754746649451443e-06, 'samples': 26692800, 'steps': 139024, 'loss/train': 1.496820092201233} 11/07/2021 16:49:44 - INFO - __main__ - Step 139026: {'lr': 6.753521455844847e-06, 'samples': 26692992, 'steps': 139025, 'loss/train': 1.0347871780395508} 11/07/2021 16:49:44 - INFO - __main__ - Step 139027: {'lr': 6.752296371841382e-06, 'samples': 26693184, 'steps': 139026, 'loss/train': 0.930253267288208} 11/07/2021 16:49:44 - INFO - __main__ - Step 139028: {'lr': 6.75107139744155e-06, 'samples': 26693376, 'steps': 139027, 'loss/train': 1.3555047512054443} 11/07/2021 16:49:45 - INFO - __main__ - Step 139029: {'lr': 6.749846532645959e-06, 'samples': 26693568, 'steps': 139028, 'loss/train': 1.3054358959197998} 11/07/2021 16:49:45 - INFO - __main__ - Step 139030: {'lr': 6.7486217774551106e-06, 'samples': 26693760, 'steps': 139029, 'loss/train': 0.8540902733802795} 11/07/2021 16:49:46 - INFO - __main__ - Step 139031: {'lr': 6.747397131869587e-06, 'samples': 26693952, 'steps': 139030, 'loss/train': 1.6176434755325317} 11/07/2021 16:49:46 - INFO - __main__ - Step 139032: {'lr': 6.746172595889943e-06, 'samples': 26694144, 'steps': 139031, 'loss/train': 0.9738638997077942} 11/07/2021 16:49:47 - INFO - __main__ - Step 139033: {'lr': 6.744948169516707e-06, 'samples': 26694336, 'steps': 139032, 'loss/train': 1.0240787267684937} 11/07/2021 16:49:47 - INFO - __main__ - Step 139034: {'lr': 6.743723852750461e-06, 'samples': 26694528, 'steps': 139033, 'loss/train': 1.523693323135376} 11/07/2021 16:49:47 - INFO - __main__ - Step 139035: {'lr': 6.742499645591732e-06, 'samples': 26694720, 'steps': 139034, 'loss/train': 1.3023958206176758} 11/07/2021 16:49:48 - INFO - __main__ - Step 139036: {'lr': 6.741275548041076e-06, 'samples': 26694912, 'steps': 139035, 'loss/train': 0.8065170049667358} 11/07/2021 16:49:49 - INFO - __main__ - Step 139037: {'lr': 6.7400515600990754e-06, 'samples': 26695104, 'steps': 139036, 'loss/train': 1.0818254947662354} 11/07/2021 16:49:49 - INFO - __main__ - Step 139038: {'lr': 6.738827681766202e-06, 'samples': 26695296, 'steps': 139037, 'loss/train': 0.9494635462760925} 11/07/2021 16:49:49 - INFO - __main__ - Step 139039: {'lr': 6.737603913043094e-06, 'samples': 26695488, 'steps': 139038, 'loss/train': 0.9723314642906189} 11/07/2021 16:49:50 - INFO - __main__ - Step 139040: {'lr': 6.736380253930252e-06, 'samples': 26695680, 'steps': 139039, 'loss/train': 1.3377821445465088} 11/07/2021 16:49:51 - INFO - __main__ - Step 139041: {'lr': 6.735156704428258e-06, 'samples': 26695872, 'steps': 139040, 'loss/train': 1.2434343099594116} 11/07/2021 16:49:51 - INFO - __main__ - Step 139042: {'lr': 6.733933264537639e-06, 'samples': 26696064, 'steps': 139041, 'loss/train': 1.041127324104309} 11/07/2021 16:49:52 - INFO - __main__ - Step 139043: {'lr': 6.732709934258951e-06, 'samples': 26696256, 'steps': 139042, 'loss/train': 0.456666499376297} 11/07/2021 16:49:52 - INFO - __main__ - Step 139044: {'lr': 6.73148671359275e-06, 'samples': 26696448, 'steps': 139043, 'loss/train': 0.9705220460891724} 11/07/2021 16:49:52 - INFO - __main__ - Step 139045: {'lr': 6.7302636025395884e-06, 'samples': 26696640, 'steps': 139044, 'loss/train': 1.712236762046814} 11/07/2021 16:49:53 - INFO - __main__ - Step 139046: {'lr': 6.729040601100022e-06, 'samples': 26696832, 'steps': 139045, 'loss/train': 2.039492607116699} 11/07/2021 16:49:54 - INFO - __main__ - Step 139047: {'lr': 6.727817709274581e-06, 'samples': 26697024, 'steps': 139046, 'loss/train': 1.1581133604049683} 11/07/2021 16:49:54 - INFO - __main__ - Step 139048: {'lr': 6.726594927063845e-06, 'samples': 26697216, 'steps': 139047, 'loss/train': 0.9502219557762146} 11/07/2021 16:49:54 - INFO - __main__ - Step 139049: {'lr': 6.725372254468343e-06, 'samples': 26697408, 'steps': 139048, 'loss/train': 1.0391638278961182} 11/07/2021 16:49:55 - INFO - __main__ - Step 139050: {'lr': 6.724149691488657e-06, 'samples': 26697600, 'steps': 139049, 'loss/train': 1.1761445999145508} 11/07/2021 16:49:56 - INFO - __main__ - Step 139051: {'lr': 6.722927238125315e-06, 'samples': 26697792, 'steps': 139050, 'loss/train': 1.5628317594528198} 11/07/2021 16:49:56 - INFO - __main__ - Step 139052: {'lr': 6.721704894378844e-06, 'samples': 26697984, 'steps': 139051, 'loss/train': 1.2415130138397217} 11/07/2021 16:49:56 - INFO - __main__ - Step 139053: {'lr': 6.720482660249827e-06, 'samples': 26698176, 'steps': 139052, 'loss/train': 1.0087448358535767} 11/07/2021 16:49:57 - INFO - __main__ - Step 139054: {'lr': 6.719260535738819e-06, 'samples': 26698368, 'steps': 139053, 'loss/train': 1.3707233667373657} 11/07/2021 16:49:57 - INFO - __main__ - Step 139055: {'lr': 6.7180385208463476e-06, 'samples': 26698560, 'steps': 139054, 'loss/train': 1.2060751914978027} 11/07/2021 16:49:57 - INFO - __main__ - Step 139056: {'lr': 6.716816615572968e-06, 'samples': 26698752, 'steps': 139055, 'loss/train': 1.374407172203064} 11/07/2021 16:49:59 - INFO - __main__ - Step 139057: {'lr': 6.715594819919235e-06, 'samples': 26698944, 'steps': 139056, 'loss/train': 0.92844158411026} 11/07/2021 16:49:59 - INFO - __main__ - Step 139058: {'lr': 6.714373133885704e-06, 'samples': 26699136, 'steps': 139057, 'loss/train': 1.142436146736145} 11/07/2021 16:50:00 - INFO - __main__ - Step 139059: {'lr': 6.71315155747293e-06, 'samples': 26699328, 'steps': 139058, 'loss/train': 1.0149540901184082} 11/07/2021 16:50:00 - INFO - __main__ - Step 139060: {'lr': 6.7119300906814394e-06, 'samples': 26699520, 'steps': 139059, 'loss/train': 1.2061651945114136} 11/07/2021 16:50:00 - INFO - __main__ - Step 139061: {'lr': 6.7107087335118165e-06, 'samples': 26699712, 'steps': 139060, 'loss/train': 1.7273274660110474} 11/07/2021 16:50:01 - INFO - __main__ - Step 139062: {'lr': 6.7094874859645885e-06, 'samples': 26699904, 'steps': 139061, 'loss/train': 1.2971171140670776} 11/07/2021 16:50:02 - INFO - __main__ - Step 139063: {'lr': 6.7082663480403096e-06, 'samples': 26700096, 'steps': 139062, 'loss/train': 1.2836332321166992} 11/07/2021 16:50:02 - INFO - __main__ - Step 139064: {'lr': 6.707045319739563e-06, 'samples': 26700288, 'steps': 139063, 'loss/train': 1.3865890502929688} 11/07/2021 16:50:02 - INFO - __main__ - Step 139065: {'lr': 6.705824401062821e-06, 'samples': 26700480, 'steps': 139064, 'loss/train': 1.0840595960617065} 11/07/2021 16:50:03 - INFO - __main__ - Step 139066: {'lr': 6.704603592010694e-06, 'samples': 26700672, 'steps': 139065, 'loss/train': 0.8102603554725647} 11/07/2021 16:50:03 - INFO - __main__ - Step 139067: {'lr': 6.703382892583737e-06, 'samples': 26700864, 'steps': 139066, 'loss/train': 1.602455973625183} 11/07/2021 16:50:04 - INFO - __main__ - Step 139068: {'lr': 6.70216230278245e-06, 'samples': 26701056, 'steps': 139067, 'loss/train': 0.547849178314209} 11/07/2021 16:50:05 - INFO - __main__ - Step 139069: {'lr': 6.700941822607443e-06, 'samples': 26701248, 'steps': 139068, 'loss/train': 1.3330323696136475} 11/07/2021 16:50:05 - INFO - __main__ - Step 139070: {'lr': 6.699721452059215e-06, 'samples': 26701440, 'steps': 139069, 'loss/train': 1.260255217552185} 11/07/2021 16:50:05 - INFO - __main__ - Step 139071: {'lr': 6.698501191138351e-06, 'samples': 26701632, 'steps': 139070, 'loss/train': 1.026437759399414} 11/07/2021 16:50:06 - INFO - __main__ - Step 139072: {'lr': 6.697281039845377e-06, 'samples': 26701824, 'steps': 139071, 'loss/train': 1.4111618995666504} 11/07/2021 16:50:07 - INFO - __main__ - Step 139073: {'lr': 6.696060998180875e-06, 'samples': 26702016, 'steps': 139072, 'loss/train': 1.4447252750396729} 11/07/2021 16:50:07 - INFO - __main__ - Step 139074: {'lr': 6.694841066145346e-06, 'samples': 26702208, 'steps': 139073, 'loss/train': 0.8757238388061523} 11/07/2021 16:50:07 - INFO - __main__ - Step 139075: {'lr': 6.693621243739373e-06, 'samples': 26702400, 'steps': 139074, 'loss/train': 1.4521465301513672} 11/07/2021 16:50:08 - INFO - __main__ - Step 139076: {'lr': 6.69240153096351e-06, 'samples': 26702592, 'steps': 139075, 'loss/train': 1.0483795404434204} 11/07/2021 16:50:08 - INFO - __main__ - Step 139077: {'lr': 6.691181927818285e-06, 'samples': 26702784, 'steps': 139076, 'loss/train': 1.3403575420379639} 11/07/2021 16:50:09 - INFO - __main__ - Step 139078: {'lr': 6.689962434304309e-06, 'samples': 26702976, 'steps': 139077, 'loss/train': 1.2685046195983887} 11/07/2021 16:50:09 - INFO - __main__ - Step 139079: {'lr': 6.688743050422025e-06, 'samples': 26703168, 'steps': 139078, 'loss/train': 1.0369155406951904} 11/07/2021 16:50:10 - INFO - __main__ - Step 139080: {'lr': 6.687523776172072e-06, 'samples': 26703360, 'steps': 139079, 'loss/train': 0.8315116167068481} 11/07/2021 16:50:10 - INFO - __main__ - Step 139081: {'lr': 6.6863046115549495e-06, 'samples': 26703552, 'steps': 139080, 'loss/train': 1.3925095796585083} 11/07/2021 16:50:10 - INFO - __main__ - Step 139082: {'lr': 6.685085556571213e-06, 'samples': 26703744, 'steps': 139081, 'loss/train': 0.19300533831119537} 11/07/2021 16:50:11 - INFO - __main__ - Step 139083: {'lr': 6.6838666112214176e-06, 'samples': 26703936, 'steps': 139082, 'loss/train': 0.8768734931945801} 11/07/2021 16:50:12 - INFO - __main__ - Step 139084: {'lr': 6.682647775506146e-06, 'samples': 26704128, 'steps': 139083, 'loss/train': 1.5343232154846191} 11/07/2021 16:50:12 - INFO - __main__ - Step 139085: {'lr': 6.6814290494258965e-06, 'samples': 26704320, 'steps': 139084, 'loss/train': 0.9289339184761047} 11/07/2021 16:50:13 - INFO - __main__ - Step 139086: {'lr': 6.680210432981254e-06, 'samples': 26704512, 'steps': 139085, 'loss/train': 1.3108769655227661} 11/07/2021 16:50:13 - INFO - __main__ - Step 139087: {'lr': 6.678991926172745e-06, 'samples': 26704704, 'steps': 139086, 'loss/train': 1.1754564046859741} 11/07/2021 16:50:13 - INFO - __main__ - Step 139088: {'lr': 6.677773529000925e-06, 'samples': 26704896, 'steps': 139087, 'loss/train': 0.993010938167572} 11/07/2021 16:50:15 - INFO - __main__ - Step 139089: {'lr': 6.676555241466348e-06, 'samples': 26705088, 'steps': 139088, 'loss/train': 1.5271369218826294} 11/07/2021 16:50:15 - INFO - __main__ - Step 139090: {'lr': 6.67533706356957e-06, 'samples': 26705280, 'steps': 139089, 'loss/train': 1.3016307353973389} 11/07/2021 16:50:15 - INFO - __main__ - Step 139091: {'lr': 6.674118995311146e-06, 'samples': 26705472, 'steps': 139090, 'loss/train': 1.2187737226486206} 11/07/2021 16:50:16 - INFO - __main__ - Step 139092: {'lr': 6.672901036691575e-06, 'samples': 26705664, 'steps': 139091, 'loss/train': 1.2811455726623535} 11/07/2021 16:50:16 - INFO - __main__ - Step 139093: {'lr': 6.671683187711469e-06, 'samples': 26705856, 'steps': 139092, 'loss/train': 1.287393569946289} 11/07/2021 16:50:17 - INFO - __main__ - Step 139094: {'lr': 6.6704654483713265e-06, 'samples': 26706048, 'steps': 139093, 'loss/train': 1.1308279037475586} 11/07/2021 16:50:17 - INFO - __main__ - Step 139095: {'lr': 6.669247818671731e-06, 'samples': 26706240, 'steps': 139094, 'loss/train': 0.04644764959812164} 11/07/2021 16:50:18 - INFO - __main__ - Step 139096: {'lr': 6.6680302986132094e-06, 'samples': 26706432, 'steps': 139095, 'loss/train': 0.15210847556591034} 11/07/2021 16:50:18 - INFO - __main__ - Step 139097: {'lr': 6.666812888196316e-06, 'samples': 26706624, 'steps': 139096, 'loss/train': 1.030463695526123} 11/07/2021 16:50:18 - INFO - __main__ - Step 139098: {'lr': 6.6655955874215805e-06, 'samples': 26706816, 'steps': 139097, 'loss/train': 1.0121577978134155} 11/07/2021 16:50:19 - INFO - __main__ - Step 139099: {'lr': 6.664378396289611e-06, 'samples': 26707008, 'steps': 139098, 'loss/train': 1.0430018901824951} 11/07/2021 16:50:20 - INFO - __main__ - Step 139100: {'lr': 6.663161314800909e-06, 'samples': 26707200, 'steps': 139099, 'loss/train': 1.3453770875930786} 11/07/2021 16:50:20 - INFO - __main__ - Step 139101: {'lr': 6.661944342956e-06, 'samples': 26707392, 'steps': 139100, 'loss/train': 1.1929373741149902} 11/07/2021 16:50:20 - INFO - __main__ - Step 139102: {'lr': 6.660727480755496e-06, 'samples': 26707584, 'steps': 139101, 'loss/train': 0.5105741620063782} 11/07/2021 16:50:21 - INFO - __main__ - Step 139103: {'lr': 6.659510728199897e-06, 'samples': 26707776, 'steps': 139102, 'loss/train': 1.3604319095611572} 11/07/2021 16:50:22 - INFO - __main__ - Step 139104: {'lr': 6.658294085289784e-06, 'samples': 26707968, 'steps': 139103, 'loss/train': 1.0408086776733398} 11/07/2021 16:50:22 - INFO - __main__ - Step 139105: {'lr': 6.657077552025714e-06, 'samples': 26708160, 'steps': 139104, 'loss/train': 1.1473276615142822} 11/07/2021 16:50:23 - INFO - __main__ - Step 139106: {'lr': 6.655861128408186e-06, 'samples': 26708352, 'steps': 139105, 'loss/train': 1.8756754398345947} 11/07/2021 16:50:23 - INFO - __main__ - Step 139107: {'lr': 6.654644814437755e-06, 'samples': 26708544, 'steps': 139106, 'loss/train': 1.4119088649749756} 11/07/2021 16:50:23 - INFO - __main__ - Step 139108: {'lr': 6.653428610114975e-06, 'samples': 26708736, 'steps': 139107, 'loss/train': 1.0976483821868896} 11/07/2021 16:50:24 - INFO - __main__ - Step 139109: {'lr': 6.652212515440431e-06, 'samples': 26708928, 'steps': 139108, 'loss/train': 0.9022614359855652} 11/07/2021 16:50:25 - INFO - __main__ - Step 139110: {'lr': 6.650996530414649e-06, 'samples': 26709120, 'steps': 139109, 'loss/train': 1.2931162118911743} 11/07/2021 16:50:25 - INFO - __main__ - Step 139111: {'lr': 6.649780655038156e-06, 'samples': 26709312, 'steps': 139110, 'loss/train': 1.5654144287109375} 11/07/2021 16:50:26 - INFO - __main__ - Step 139112: {'lr': 6.648564889311509e-06, 'samples': 26709504, 'steps': 139111, 'loss/train': 1.45720636844635} 11/07/2021 16:50:26 - INFO - __main__ - Step 139113: {'lr': 6.647349233235289e-06, 'samples': 26709696, 'steps': 139112, 'loss/train': 0.342839777469635} 11/07/2021 16:50:26 - INFO - __main__ - Step 139114: {'lr': 6.646133686809997e-06, 'samples': 26709888, 'steps': 139113, 'loss/train': 1.5735821723937988} 11/07/2021 16:50:27 - INFO - __main__ - Step 139115: {'lr': 6.644918250036214e-06, 'samples': 26710080, 'steps': 139114, 'loss/train': 1.1435482501983643} 11/07/2021 16:50:28 - INFO - __main__ - Step 139116: {'lr': 6.6437029229144684e-06, 'samples': 26710272, 'steps': 139115, 'loss/train': 1.4031115770339966} 11/07/2021 16:50:28 - INFO - __main__ - Step 139117: {'lr': 6.642487705445344e-06, 'samples': 26710464, 'steps': 139116, 'loss/train': 1.3987326622009277} 11/07/2021 16:50:28 - INFO - __main__ - Step 139118: {'lr': 6.6412725976293386e-06, 'samples': 26710656, 'steps': 139117, 'loss/train': 1.4011286497116089} 11/07/2021 16:50:29 - INFO - __main__ - Step 139119: {'lr': 6.640057599467036e-06, 'samples': 26710848, 'steps': 139118, 'loss/train': 0.20065249502658844} 11/07/2021 16:50:30 - INFO - __main__ - Step 139120: {'lr': 6.638842710958937e-06, 'samples': 26711040, 'steps': 139119, 'loss/train': 1.0490094423294067} 11/07/2021 16:50:30 - INFO - __main__ - Step 139121: {'lr': 6.637627932105622e-06, 'samples': 26711232, 'steps': 139120, 'loss/train': 0.7342522740364075} 11/07/2021 16:50:30 - INFO - __main__ - Step 139122: {'lr': 6.636413262907648e-06, 'samples': 26711424, 'steps': 139121, 'loss/train': 1.2575455904006958} 11/07/2021 16:50:31 - INFO - __main__ - Step 139123: {'lr': 6.635198703365569e-06, 'samples': 26711616, 'steps': 139122, 'loss/train': 1.6285896301269531} 11/07/2021 16:50:31 - INFO - __main__ - Step 139124: {'lr': 6.6339842534798855e-06, 'samples': 26711808, 'steps': 139123, 'loss/train': 1.3828904628753662} 11/07/2021 16:50:32 - INFO - __main__ - Step 139125: {'lr': 6.63276991325118e-06, 'samples': 26712000, 'steps': 139124, 'loss/train': 1.1505440473556519} 11/07/2021 16:50:33 - INFO - __main__ - Step 139126: {'lr': 6.6315556826800075e-06, 'samples': 26712192, 'steps': 139125, 'loss/train': 1.4206044673919678} 11/07/2021 16:50:33 - INFO - __main__ - Step 139127: {'lr': 6.630341561766867e-06, 'samples': 26712384, 'steps': 139126, 'loss/train': 1.0266444683074951} 11/07/2021 16:50:33 - INFO - __main__ - Step 139128: {'lr': 6.629127550512398e-06, 'samples': 26712576, 'steps': 139127, 'loss/train': 1.4525502920150757} 11/07/2021 16:50:34 - INFO - __main__ - Step 139129: {'lr': 6.6279136489170445e-06, 'samples': 26712768, 'steps': 139128, 'loss/train': 1.1767627000808716} 11/07/2021 16:50:35 - INFO - __main__ - Step 139130: {'lr': 6.626699856981416e-06, 'samples': 26712960, 'steps': 139129, 'loss/train': 1.0876566171646118} 11/07/2021 16:50:35 - INFO - __main__ - Step 139131: {'lr': 6.625486174706013e-06, 'samples': 26713152, 'steps': 139130, 'loss/train': 1.2034574747085571} 11/07/2021 16:50:35 - INFO - __main__ - Step 139132: {'lr': 6.624272602091447e-06, 'samples': 26713344, 'steps': 139131, 'loss/train': 1.0217946767807007} 11/07/2021 16:50:36 - INFO - __main__ - Step 139133: {'lr': 6.623059139138188e-06, 'samples': 26713536, 'steps': 139132, 'loss/train': 1.149643063545227} 11/07/2021 16:50:36 - INFO - __main__ - Step 139134: {'lr': 6.621845785846848e-06, 'samples': 26713728, 'steps': 139133, 'loss/train': 0.8749340772628784} 11/07/2021 16:50:37 - INFO - __main__ - Step 139135: {'lr': 6.620632542217953e-06, 'samples': 26713920, 'steps': 139134, 'loss/train': 1.635353446006775} 11/07/2021 16:50:38 - INFO - __main__ - Step 139136: {'lr': 6.619419408252031e-06, 'samples': 26714112, 'steps': 139135, 'loss/train': 0.5181225538253784} 11/07/2021 16:50:38 - INFO - __main__ - Step 139137: {'lr': 6.618206383949638e-06, 'samples': 26714304, 'steps': 139136, 'loss/train': 1.5249124765396118} 11/07/2021 16:50:38 - INFO - __main__ - Step 139138: {'lr': 6.616993469311355e-06, 'samples': 26714496, 'steps': 139137, 'loss/train': 1.0765600204467773} 11/07/2021 16:50:39 - INFO - __main__ - Step 139139: {'lr': 6.615780664337684e-06, 'samples': 26714688, 'steps': 139138, 'loss/train': 1.551082730293274} 11/07/2021 16:50:40 - INFO - __main__ - Step 139140: {'lr': 6.614567969029206e-06, 'samples': 26714880, 'steps': 139139, 'loss/train': 1.5490838289260864} 11/07/2021 16:50:40 - INFO - __main__ - Step 139141: {'lr': 6.613355383386421e-06, 'samples': 26715072, 'steps': 139140, 'loss/train': 0.8800719976425171} 11/07/2021 16:50:41 - INFO - __main__ - Step 139142: {'lr': 6.612142907409885e-06, 'samples': 26715264, 'steps': 139141, 'loss/train': 0.2927168607711792} 11/07/2021 16:50:41 - INFO - __main__ - Step 139143: {'lr': 6.610930541100208e-06, 'samples': 26715456, 'steps': 139142, 'loss/train': 1.396134853363037} 11/07/2021 16:50:41 - INFO - __main__ - Step 139144: {'lr': 6.6097182844578605e-06, 'samples': 26715648, 'steps': 139143, 'loss/train': 1.2417281866073608} 11/07/2021 16:50:42 - INFO - __main__ - Step 139145: {'lr': 6.6085061374834e-06, 'samples': 26715840, 'steps': 139144, 'loss/train': 1.1118148565292358} 11/07/2021 16:50:43 - INFO - __main__ - Step 139146: {'lr': 6.607294100177435e-06, 'samples': 26716032, 'steps': 139145, 'loss/train': 0.8418751358985901} 11/07/2021 16:50:43 - INFO - __main__ - Step 139147: {'lr': 6.6060821725404395e-06, 'samples': 26716224, 'steps': 139146, 'loss/train': 0.6900070309638977} 11/07/2021 16:50:43 - INFO - __main__ - Step 139148: {'lr': 6.604870354572995e-06, 'samples': 26716416, 'steps': 139147, 'loss/train': 1.1427356004714966} 11/07/2021 16:50:44 - INFO - __main__ - Step 139149: {'lr': 6.603658646275629e-06, 'samples': 26716608, 'steps': 139148, 'loss/train': 1.3248448371887207} 11/07/2021 16:50:44 - INFO - __main__ - Step 139150: {'lr': 6.6024470476489515e-06, 'samples': 26716800, 'steps': 139149, 'loss/train': 1.0953203439712524} 11/07/2021 16:50:45 - INFO - __main__ - Step 139151: {'lr': 6.601235558693408e-06, 'samples': 26716992, 'steps': 139150, 'loss/train': 1.3943140506744385} 11/07/2021 16:50:46 - INFO - __main__ - Step 139152: {'lr': 6.60002417940958e-06, 'samples': 26717184, 'steps': 139151, 'loss/train': 0.4465072751045227} 11/07/2021 16:50:46 - INFO - __main__ - Step 139153: {'lr': 6.598812909798052e-06, 'samples': 26717376, 'steps': 139152, 'loss/train': 1.5732245445251465} 11/07/2021 16:50:46 - INFO - __main__ - Step 139154: {'lr': 6.597601749859322e-06, 'samples': 26717568, 'steps': 139153, 'loss/train': 1.5029979944229126} 11/07/2021 16:50:47 - INFO - __main__ - Step 139155: {'lr': 6.596390699593974e-06, 'samples': 26717760, 'steps': 139154, 'loss/train': 1.4411123991012573} 11/07/2021 16:50:48 - INFO - __main__ - Step 139156: {'lr': 6.595179759002534e-06, 'samples': 26717952, 'steps': 139155, 'loss/train': 1.6443133354187012} 11/07/2021 16:50:48 - INFO - __main__ - Step 139157: {'lr': 6.593968928085531e-06, 'samples': 26718144, 'steps': 139156, 'loss/train': 1.3650826215744019} 11/07/2021 16:50:48 - INFO - __main__ - Step 139158: {'lr': 6.592758206843547e-06, 'samples': 26718336, 'steps': 139157, 'loss/train': 1.3561710119247437} 11/07/2021 16:50:49 - INFO - __main__ - Step 139159: {'lr': 6.591547595277109e-06, 'samples': 26718528, 'steps': 139158, 'loss/train': 1.4585872888565063} 11/07/2021 16:50:49 - INFO - __main__ - Step 139160: {'lr': 6.590337093386772e-06, 'samples': 26718720, 'steps': 139159, 'loss/train': 1.10319185256958} 11/07/2021 16:50:50 - INFO - __main__ - Step 139161: {'lr': 6.589126701173092e-06, 'samples': 26718912, 'steps': 139160, 'loss/train': 1.5017648935317993} 11/07/2021 16:50:50 - INFO - __main__ - Step 139162: {'lr': 6.587916418636569e-06, 'samples': 26719104, 'steps': 139161, 'loss/train': 1.8081008195877075} 11/07/2021 16:50:51 - INFO - __main__ - Step 139163: {'lr': 6.586706245777757e-06, 'samples': 26719296, 'steps': 139162, 'loss/train': 0.6796047687530518} 11/07/2021 16:50:51 - INFO - __main__ - Step 139164: {'lr': 6.585496182597239e-06, 'samples': 26719488, 'steps': 139163, 'loss/train': 1.3750147819519043} 11/07/2021 16:50:51 - INFO - __main__ - Step 139165: {'lr': 6.584286229095543e-06, 'samples': 26719680, 'steps': 139164, 'loss/train': 0.9716690182685852} 11/07/2021 16:50:53 - INFO - __main__ - Step 139166: {'lr': 6.583076385273196e-06, 'samples': 26719872, 'steps': 139165, 'loss/train': 1.1454083919525146} 11/07/2021 16:50:53 - INFO - __main__ - Step 139167: {'lr': 6.58186665113078e-06, 'samples': 26720064, 'steps': 139166, 'loss/train': 1.3238755464553833} 11/07/2021 16:50:53 - INFO - __main__ - Step 139168: {'lr': 6.580657026668796e-06, 'samples': 26720256, 'steps': 139167, 'loss/train': 1.5268237590789795} 11/07/2021 16:50:54 - INFO - __main__ - Step 139169: {'lr': 6.579447511887826e-06, 'samples': 26720448, 'steps': 139168, 'loss/train': 1.3018754720687866} 11/07/2021 16:50:54 - INFO - __main__ - Step 139170: {'lr': 6.578238106788398e-06, 'samples': 26720640, 'steps': 139169, 'loss/train': 1.124124526977539} 11/07/2021 16:50:55 - INFO - __main__ - Step 139171: {'lr': 6.577028811371039e-06, 'samples': 26720832, 'steps': 139170, 'loss/train': 1.2437015771865845} 11/07/2021 16:50:55 - INFO - __main__ - Step 139172: {'lr': 6.575819625636359e-06, 'samples': 26721024, 'steps': 139171, 'loss/train': 1.582335114479065} 11/07/2021 16:50:56 - INFO - __main__ - Step 139173: {'lr': 6.574610549584831e-06, 'samples': 26721216, 'steps': 139172, 'loss/train': 1.2482117414474487} 11/07/2021 16:50:56 - INFO - __main__ - Step 139174: {'lr': 6.573401583217037e-06, 'samples': 26721408, 'steps': 139173, 'loss/train': 1.5063477754592896} 11/07/2021 16:50:56 - INFO - __main__ - Step 139175: {'lr': 6.572192726533505e-06, 'samples': 26721600, 'steps': 139174, 'loss/train': 1.2507524490356445} 11/07/2021 16:50:58 - INFO - __main__ - Step 139176: {'lr': 6.570983979534789e-06, 'samples': 26721792, 'steps': 139175, 'loss/train': 1.1878660917282104} 11/07/2021 16:50:58 - INFO - __main__ - Step 139177: {'lr': 6.569775342221418e-06, 'samples': 26721984, 'steps': 139176, 'loss/train': 0.9974116086959839} 11/07/2021 16:50:58 - INFO - __main__ - Step 139178: {'lr': 6.568566814593974e-06, 'samples': 26722176, 'steps': 139177, 'loss/train': 1.3321408033370972} 11/07/2021 16:50:59 - INFO - __main__ - Step 139179: {'lr': 6.567358396652956e-06, 'samples': 26722368, 'steps': 139178, 'loss/train': 1.0141781568527222} 11/07/2021 16:50:59 - INFO - __main__ - Step 139180: {'lr': 6.5661500883989476e-06, 'samples': 26722560, 'steps': 139179, 'loss/train': 1.0572714805603027} 11/07/2021 16:50:59 - INFO - __main__ - Step 139181: {'lr': 6.564941889832449e-06, 'samples': 26722752, 'steps': 139180, 'loss/train': 1.2556893825531006} 11/07/2021 16:51:00 - INFO - __main__ - Step 139182: {'lr': 6.563733800954069e-06, 'samples': 26722944, 'steps': 139181, 'loss/train': 1.346535325050354} 11/07/2021 16:51:01 - INFO - __main__ - Step 139183: {'lr': 6.562525821764281e-06, 'samples': 26723136, 'steps': 139182, 'loss/train': 1.1211447715759277} 11/07/2021 16:51:01 - INFO - __main__ - Step 139184: {'lr': 6.561317952263668e-06, 'samples': 26723328, 'steps': 139183, 'loss/train': 0.9191237688064575} 11/07/2021 16:51:01 - INFO - __main__ - Step 139185: {'lr': 6.560110192452812e-06, 'samples': 26723520, 'steps': 139184, 'loss/train': 1.2801693677902222} 11/07/2021 16:51:02 - INFO - __main__ - Step 139186: {'lr': 6.5589025423321565e-06, 'samples': 26723712, 'steps': 139185, 'loss/train': 0.9964375495910645} 11/07/2021 16:51:03 - INFO - __main__ - Step 139187: {'lr': 6.5576950019023415e-06, 'samples': 26723904, 'steps': 139186, 'loss/train': 1.513369083404541} 11/07/2021 16:51:03 - INFO - __main__ - Step 139188: {'lr': 6.556487571163838e-06, 'samples': 26724096, 'steps': 139187, 'loss/train': 1.5539896488189697} 11/07/2021 16:51:03 - INFO - __main__ - Step 139189: {'lr': 6.555280250117257e-06, 'samples': 26724288, 'steps': 139188, 'loss/train': 1.6598787307739258} 11/07/2021 16:51:04 - INFO - __main__ - Step 139190: {'lr': 6.554073038763097e-06, 'samples': 26724480, 'steps': 139189, 'loss/train': 1.3455939292907715} 11/07/2021 16:51:04 - INFO - __main__ - Step 139191: {'lr': 6.552865937101887e-06, 'samples': 26724672, 'steps': 139190, 'loss/train': 0.6965353488922119} 11/07/2021 16:51:05 - INFO - __main__ - Step 139192: {'lr': 6.551658945134237e-06, 'samples': 26724864, 'steps': 139191, 'loss/train': 1.295003056526184} 11/07/2021 16:51:06 - INFO - __main__ - Step 139193: {'lr': 6.550452062860646e-06, 'samples': 26725056, 'steps': 139192, 'loss/train': 1.4453339576721191} 11/07/2021 16:51:06 - INFO - __main__ - Step 139194: {'lr': 6.5492452902816415e-06, 'samples': 26725248, 'steps': 139193, 'loss/train': 1.2564589977264404} 11/07/2021 16:51:06 - INFO - __main__ - Step 139195: {'lr': 6.548038627397807e-06, 'samples': 26725440, 'steps': 139194, 'loss/train': 1.5100593566894531} 11/07/2021 16:51:07 - INFO - __main__ - Step 139196: {'lr': 6.5468320742096684e-06, 'samples': 26725632, 'steps': 139195, 'loss/train': 1.1057450771331787} 11/07/2021 16:51:08 - INFO - __main__ - Step 139197: {'lr': 6.545625630717783e-06, 'samples': 26725824, 'steps': 139196, 'loss/train': 1.4810537099838257} 11/07/2021 16:51:08 - INFO - __main__ - Step 139198: {'lr': 6.5444192969226765e-06, 'samples': 26726016, 'steps': 139197, 'loss/train': 1.1104450225830078} 11/07/2021 16:51:08 - INFO - __main__ - Step 139199: {'lr': 6.543213072824905e-06, 'samples': 26726208, 'steps': 139198, 'loss/train': 1.36050283908844} 11/07/2021 16:51:09 - INFO - __main__ - Step 139200: {'lr': 6.542006958424996e-06, 'samples': 26726400, 'steps': 139199, 'loss/train': 1.2474676370620728} 11/07/2021 16:51:09 - INFO - __main__ - Step 139201: {'lr': 6.540800953723502e-06, 'samples': 26726592, 'steps': 139200, 'loss/train': 1.3429745435714722} 11/07/2021 16:51:10 - INFO - __main__ - Step 139202: {'lr': 6.539595058720954e-06, 'samples': 26726784, 'steps': 139201, 'loss/train': 1.646437406539917} 11/07/2021 16:51:10 - INFO - __main__ - Step 139203: {'lr': 6.538389273417933e-06, 'samples': 26726976, 'steps': 139202, 'loss/train': 1.3112725019454956} 11/07/2021 16:51:11 - INFO - __main__ - Step 139204: {'lr': 6.537183597814938e-06, 'samples': 26727168, 'steps': 139203, 'loss/train': 1.4154956340789795} 11/07/2021 16:51:11 - INFO - __main__ - Step 139205: {'lr': 6.535978031912526e-06, 'samples': 26727360, 'steps': 139204, 'loss/train': 1.1832668781280518} 11/07/2021 16:51:12 - INFO - __main__ - Step 139206: {'lr': 6.534772575711251e-06, 'samples': 26727552, 'steps': 139205, 'loss/train': 1.4488551616668701} 11/07/2021 16:51:12 - INFO - __main__ - Step 139207: {'lr': 6.53356722921164e-06, 'samples': 26727744, 'steps': 139206, 'loss/train': 0.9829655289649963} 11/07/2021 16:51:13 - INFO - __main__ - Step 139208: {'lr': 6.532361992414277e-06, 'samples': 26727936, 'steps': 139207, 'loss/train': 1.2975788116455078} 11/07/2021 16:51:13 - INFO - __main__ - Step 139209: {'lr': 6.531156865319659e-06, 'samples': 26728128, 'steps': 139208, 'loss/train': 1.3509963750839233} 11/07/2021 16:51:14 - INFO - __main__ - Step 139210: {'lr': 6.529951847928317e-06, 'samples': 26728320, 'steps': 139209, 'loss/train': 1.564755916595459} 11/07/2021 16:51:14 - INFO - __main__ - Step 139211: {'lr': 6.528746940240859e-06, 'samples': 26728512, 'steps': 139210, 'loss/train': 1.2798329591751099} 11/07/2021 16:51:14 - INFO - __main__ - Step 139212: {'lr': 6.527542142257814e-06, 'samples': 26728704, 'steps': 139211, 'loss/train': 1.1056140661239624} 11/07/2021 16:51:15 - INFO - __main__ - Step 139213: {'lr': 6.526337453979653e-06, 'samples': 26728896, 'steps': 139212, 'loss/train': 0.6942272186279297} 11/07/2021 16:51:16 - INFO - __main__ - Step 139214: {'lr': 6.525132875406986e-06, 'samples': 26729088, 'steps': 139213, 'loss/train': 1.645990252494812} 11/07/2021 16:51:16 - INFO - __main__ - Step 139215: {'lr': 6.523928406540341e-06, 'samples': 26729280, 'steps': 139214, 'loss/train': 1.0103024244308472} 11/07/2021 16:51:16 - INFO - __main__ - Step 139216: {'lr': 6.5227240473802466e-06, 'samples': 26729472, 'steps': 139215, 'loss/train': 1.2099595069885254} 11/07/2021 16:51:17 - INFO - __main__ - Step 139217: {'lr': 6.521519797927255e-06, 'samples': 26729664, 'steps': 139216, 'loss/train': 0.8024139404296875} 11/07/2021 16:51:18 - INFO - __main__ - Step 139218: {'lr': 6.520315658181897e-06, 'samples': 26729856, 'steps': 139217, 'loss/train': 1.228328824043274} 11/07/2021 16:51:18 - INFO - __main__ - Step 139219: {'lr': 6.519111628144753e-06, 'samples': 26730048, 'steps': 139218, 'loss/train': 1.3376363515853882} 11/07/2021 16:51:18 - INFO - __main__ - Step 139220: {'lr': 6.517907707816323e-06, 'samples': 26730240, 'steps': 139219, 'loss/train': 1.020680546760559} 11/07/2021 16:51:19 - INFO - __main__ - Step 139221: {'lr': 6.516703897197163e-06, 'samples': 26730432, 'steps': 139220, 'loss/train': 1.3731662034988403} 11/07/2021 16:51:19 - INFO - __main__ - Step 139222: {'lr': 6.515500196287827e-06, 'samples': 26730624, 'steps': 139221, 'loss/train': 1.0702263116836548} 11/07/2021 16:51:20 - INFO - __main__ - Step 139223: {'lr': 6.514296605088871e-06, 'samples': 26730816, 'steps': 139222, 'loss/train': 1.4763333797454834} 11/07/2021 16:51:20 - INFO - __main__ - Step 139224: {'lr': 6.513093123600794e-06, 'samples': 26731008, 'steps': 139223, 'loss/train': 1.3359736204147339} 11/07/2021 16:51:21 - INFO - __main__ - Step 139225: {'lr': 6.511889751824151e-06, 'samples': 26731200, 'steps': 139224, 'loss/train': 1.0747270584106445} 11/07/2021 16:51:21 - INFO - __main__ - Step 139226: {'lr': 6.510686489759527e-06, 'samples': 26731392, 'steps': 139225, 'loss/train': 0.966978907585144} 11/07/2021 16:51:22 - INFO - __main__ - Step 139227: {'lr': 6.509483337407418e-06, 'samples': 26731584, 'steps': 139226, 'loss/train': 1.3704904317855835} 11/07/2021 16:51:23 - INFO - __main__ - Step 139228: {'lr': 6.508280294768354e-06, 'samples': 26731776, 'steps': 139227, 'loss/train': 1.0103812217712402} 11/07/2021 16:51:23 - INFO - __main__ - Step 139229: {'lr': 6.507077361842917e-06, 'samples': 26731968, 'steps': 139228, 'loss/train': 1.1630247831344604} 11/07/2021 16:51:23 - INFO - __main__ - Step 139230: {'lr': 6.505874538631635e-06, 'samples': 26732160, 'steps': 139229, 'loss/train': 1.2588893175125122} 11/07/2021 16:51:24 - INFO - __main__ - Step 139231: {'lr': 6.5046718251350335e-06, 'samples': 26732352, 'steps': 139230, 'loss/train': 1.1858773231506348} 11/07/2021 16:51:24 - INFO - __main__ - Step 139232: {'lr': 6.503469221353697e-06, 'samples': 26732544, 'steps': 139231, 'loss/train': 0.19916170835494995} 11/07/2021 16:51:25 - INFO - __main__ - Step 139233: {'lr': 6.5022667272881256e-06, 'samples': 26732736, 'steps': 139232, 'loss/train': 1.0130517482757568} 11/07/2021 16:51:26 - INFO - __main__ - Step 139234: {'lr': 6.5010643429388724e-06, 'samples': 26732928, 'steps': 139233, 'loss/train': 1.2905751466751099} 11/07/2021 16:51:26 - INFO - __main__ - Step 139235: {'lr': 6.499862068306495e-06, 'samples': 26733120, 'steps': 139234, 'loss/train': 1.7246451377868652} 11/07/2021 16:51:27 - INFO - __main__ - Step 139236: {'lr': 6.49865990339149e-06, 'samples': 26733312, 'steps': 139235, 'loss/train': 0.9996336102485657} 11/07/2021 16:51:27 - INFO - __main__ - Step 139237: {'lr': 6.497457848194472e-06, 'samples': 26733504, 'steps': 139236, 'loss/train': 1.0626846551895142} 11/07/2021 16:51:27 - INFO - __main__ - Step 139238: {'lr': 6.496255902715908e-06, 'samples': 26733696, 'steps': 139237, 'loss/train': 0.9861428737640381} 11/07/2021 16:51:28 - INFO - __main__ - Step 139239: {'lr': 6.495054066956413e-06, 'samples': 26733888, 'steps': 139238, 'loss/train': 0.7578033804893494} 11/07/2021 16:51:29 - INFO - __main__ - Step 139240: {'lr': 6.493852340916456e-06, 'samples': 26734080, 'steps': 139239, 'loss/train': 1.2096002101898193} 11/07/2021 16:51:29 - INFO - __main__ - Step 139241: {'lr': 6.492650724596621e-06, 'samples': 26734272, 'steps': 139240, 'loss/train': 1.4274933338165283} 11/07/2021 16:51:29 - INFO - __main__ - Step 139242: {'lr': 6.491449217997436e-06, 'samples': 26734464, 'steps': 139241, 'loss/train': 1.2747485637664795} 11/07/2021 16:51:30 - INFO - __main__ - Step 139243: {'lr': 6.490247821119455e-06, 'samples': 26734656, 'steps': 139242, 'loss/train': 1.25399649143219} 11/07/2021 16:51:31 - INFO - __main__ - Step 139244: {'lr': 6.489046533963205e-06, 'samples': 26734848, 'steps': 139243, 'loss/train': 1.6742147207260132} 11/07/2021 16:51:31 - INFO - __main__ - Step 139245: {'lr': 6.487845356529243e-06, 'samples': 26735040, 'steps': 139244, 'loss/train': 1.5113989114761353} 11/07/2021 16:51:32 - INFO - __main__ - Step 139246: {'lr': 6.4866442888180945e-06, 'samples': 26735232, 'steps': 139245, 'loss/train': 0.9267975687980652} 11/07/2021 16:51:32 - INFO - __main__ - Step 139247: {'lr': 6.485443330830287e-06, 'samples': 26735424, 'steps': 139246, 'loss/train': 1.4041517972946167} 11/07/2021 16:51:32 - INFO - __main__ - Step 139248: {'lr': 6.484242482566404e-06, 'samples': 26735616, 'steps': 139247, 'loss/train': 1.1584266424179077} 11/07/2021 16:51:33 - INFO - __main__ - Step 139249: {'lr': 6.483041744026946e-06, 'samples': 26735808, 'steps': 139248, 'loss/train': 1.5194873809814453} 11/07/2021 16:51:34 - INFO - __main__ - Step 139250: {'lr': 6.481841115212495e-06, 'samples': 26736000, 'steps': 139249, 'loss/train': 1.1962602138519287} 11/07/2021 16:51:34 - INFO - __main__ - Step 139251: {'lr': 6.48064059612355e-06, 'samples': 26736192, 'steps': 139250, 'loss/train': 1.0467032194137573} 11/07/2021 16:51:34 - INFO - __main__ - Step 139252: {'lr': 6.479440186760693e-06, 'samples': 26736384, 'steps': 139251, 'loss/train': 1.6555591821670532} 11/07/2021 16:51:35 - INFO - __main__ - Step 139253: {'lr': 6.478239887124426e-06, 'samples': 26736576, 'steps': 139252, 'loss/train': 1.0611737966537476} 11/07/2021 16:51:36 - INFO - __main__ - Step 139254: {'lr': 6.477039697215331e-06, 'samples': 26736768, 'steps': 139253, 'loss/train': 1.4039466381072998} 11/07/2021 16:51:36 - INFO - __main__ - Step 139255: {'lr': 6.4758396170338796e-06, 'samples': 26736960, 'steps': 139254, 'loss/train': 1.4730831384658813} 11/07/2021 16:51:37 - INFO - __main__ - Step 139256: {'lr': 6.4746396465806824e-06, 'samples': 26737152, 'steps': 139255, 'loss/train': 1.0134267807006836} 11/07/2021 16:51:37 - INFO - __main__ - Step 139257: {'lr': 6.473439785856239e-06, 'samples': 26737344, 'steps': 139256, 'loss/train': 1.3315668106079102} 11/07/2021 16:51:37 - INFO - __main__ - Step 139258: {'lr': 6.4722400348611325e-06, 'samples': 26737536, 'steps': 139257, 'loss/train': 0.8238503336906433} 11/07/2021 16:51:38 - INFO - __main__ - Step 139259: {'lr': 6.471040393595862e-06, 'samples': 26737728, 'steps': 139258, 'loss/train': 0.632785975933075} 11/07/2021 16:51:39 - INFO - __main__ - Step 139260: {'lr': 6.469840862060983e-06, 'samples': 26737920, 'steps': 139259, 'loss/train': 1.293076992034912} 11/07/2021 16:51:39 - INFO - __main__ - Step 139261: {'lr': 6.468641440257023e-06, 'samples': 26738112, 'steps': 139260, 'loss/train': 1.4340816736221313} 11/07/2021 16:51:39 - INFO - __main__ - Step 139262: {'lr': 6.467442128184537e-06, 'samples': 26738304, 'steps': 139261, 'loss/train': 0.36692970991134644} 11/07/2021 16:51:40 - INFO - __main__ - Step 139263: {'lr': 6.466242925844079e-06, 'samples': 26738496, 'steps': 139262, 'loss/train': 1.0916188955307007} 11/07/2021 16:51:40 - INFO - __main__ - Step 139264: {'lr': 6.465043833236178e-06, 'samples': 26738688, 'steps': 139263, 'loss/train': 0.7873523235321045} 11/07/2021 16:51:41 - INFO - __main__ - Step 139265: {'lr': 6.463844850361361e-06, 'samples': 26738880, 'steps': 139264, 'loss/train': 1.4646313190460205} 11/07/2021 16:51:41 - INFO - __main__ - Step 139266: {'lr': 6.462645977220183e-06, 'samples': 26739072, 'steps': 139265, 'loss/train': 1.462762713432312} 11/07/2021 16:51:42 - INFO - __main__ - Step 139267: {'lr': 6.461447213813171e-06, 'samples': 26739264, 'steps': 139266, 'loss/train': 1.5431736707687378} 11/07/2021 16:51:42 - INFO - __main__ - Step 139268: {'lr': 6.46024856014088e-06, 'samples': 26739456, 'steps': 139267, 'loss/train': 1.5982719659805298} 11/07/2021 16:51:42 - INFO - __main__ - Step 139269: {'lr': 6.459050016203838e-06, 'samples': 26739648, 'steps': 139268, 'loss/train': 1.0640398263931274} 11/07/2021 16:51:43 - INFO - __main__ - Step 139270: {'lr': 6.4578515820026e-06, 'samples': 26739840, 'steps': 139269, 'loss/train': 1.411085605621338} 11/07/2021 16:51:44 - INFO - __main__ - Step 139271: {'lr': 6.456653257537665e-06, 'samples': 26740032, 'steps': 139270, 'loss/train': 1.273200273513794} 11/07/2021 16:51:44 - INFO - __main__ - Step 139272: {'lr': 6.455455042809644e-06, 'samples': 26740224, 'steps': 139271, 'loss/train': 1.183526873588562} 11/07/2021 16:51:45 - INFO - __main__ - Step 139273: {'lr': 6.454256937819008e-06, 'samples': 26740416, 'steps': 139272, 'loss/train': 1.1547335386276245} 11/07/2021 16:51:45 - INFO - __main__ - Step 139274: {'lr': 6.453058942566342e-06, 'samples': 26740608, 'steps': 139273, 'loss/train': 1.059961199760437} 11/07/2021 16:51:46 - INFO - __main__ - Step 139275: {'lr': 6.451861057052144e-06, 'samples': 26740800, 'steps': 139274, 'loss/train': 1.2950797080993652} 11/07/2021 16:51:46 - INFO - __main__ - Step 139276: {'lr': 6.450663281276998e-06, 'samples': 26740992, 'steps': 139275, 'loss/train': 1.1944352388381958} 11/07/2021 16:51:47 - INFO - __main__ - Step 139277: {'lr': 6.449465615241429e-06, 'samples': 26741184, 'steps': 139276, 'loss/train': 1.1281229257583618} 11/07/2021 16:51:47 - INFO - __main__ - Step 139278: {'lr': 6.448268058945966e-06, 'samples': 26741376, 'steps': 139277, 'loss/train': 1.6175556182861328} 11/07/2021 16:51:47 - INFO - __main__ - Step 139279: {'lr': 6.447070612391193e-06, 'samples': 26741568, 'steps': 139278, 'loss/train': 1.4411970376968384} 11/07/2021 16:51:48 - INFO - __main__ - Step 139280: {'lr': 6.445873275577579e-06, 'samples': 26741760, 'steps': 139279, 'loss/train': 1.5698304176330566} 11/07/2021 16:51:49 - INFO - __main__ - Step 139281: {'lr': 6.444676048505682e-06, 'samples': 26741952, 'steps': 139280, 'loss/train': 1.1699676513671875} 11/07/2021 16:51:49 - INFO - __main__ - Step 139282: {'lr': 6.443478931176056e-06, 'samples': 26742144, 'steps': 139281, 'loss/train': 1.2215832471847534} 11/07/2021 16:51:49 - INFO - __main__ - Step 139283: {'lr': 6.442281923589255e-06, 'samples': 26742336, 'steps': 139282, 'loss/train': 1.6882282495498657} 11/07/2021 16:51:50 - INFO - __main__ - Step 139284: {'lr': 6.441085025745808e-06, 'samples': 26742528, 'steps': 139283, 'loss/train': 0.7911263704299927} 11/07/2021 16:51:51 - INFO - __main__ - Step 139285: {'lr': 6.439888237646241e-06, 'samples': 26742720, 'steps': 139284, 'loss/train': 1.458080530166626} 11/07/2021 16:51:52 - INFO - __main__ - Step 139286: {'lr': 6.43869155929111e-06, 'samples': 26742912, 'steps': 139285, 'loss/train': 1.4298739433288574} 11/07/2021 16:51:52 - INFO - __main__ - Step 139287: {'lr': 6.437494990680914e-06, 'samples': 26743104, 'steps': 139286, 'loss/train': 1.0947452783584595} 11/07/2021 16:51:52 - INFO - __main__ - Step 139288: {'lr': 6.436298531816265e-06, 'samples': 26743296, 'steps': 139287, 'loss/train': 1.1469610929489136} 11/07/2021 16:51:53 - INFO - __main__ - Step 139289: {'lr': 6.435102182697633e-06, 'samples': 26743488, 'steps': 139288, 'loss/train': 1.0646754503250122} 11/07/2021 16:51:53 - INFO - __main__ - Step 139290: {'lr': 6.4339059433256016e-06, 'samples': 26743680, 'steps': 139289, 'loss/train': 1.0052387714385986} 11/07/2021 16:51:54 - INFO - __main__ - Step 139291: {'lr': 6.432709813700699e-06, 'samples': 26743872, 'steps': 139290, 'loss/train': 1.1487001180648804} 11/07/2021 16:51:54 - INFO - __main__ - Step 139292: {'lr': 6.431513793823451e-06, 'samples': 26744064, 'steps': 139291, 'loss/train': 1.4007139205932617} 11/07/2021 16:51:55 - INFO - __main__ - Step 139293: {'lr': 6.430317883694414e-06, 'samples': 26744256, 'steps': 139292, 'loss/train': 1.0672187805175781} 11/07/2021 16:51:55 - INFO - __main__ - Step 139294: {'lr': 6.429122083314115e-06, 'samples': 26744448, 'steps': 139293, 'loss/train': 0.7931724786758423} 11/07/2021 16:51:55 - INFO - __main__ - Step 139295: {'lr': 6.427926392683081e-06, 'samples': 26744640, 'steps': 139294, 'loss/train': 1.5475420951843262} 11/07/2021 16:51:56 - INFO - __main__ - Step 139296: {'lr': 6.426730811801867e-06, 'samples': 26744832, 'steps': 139295, 'loss/train': 1.2095777988433838} 11/07/2021 16:51:57 - INFO - __main__ - Step 139297: {'lr': 6.425535340671001e-06, 'samples': 26745024, 'steps': 139296, 'loss/train': 1.2907963991165161} 11/07/2021 16:51:57 - INFO - __main__ - Step 139298: {'lr': 6.424339979291066e-06, 'samples': 26745216, 'steps': 139297, 'loss/train': 1.4855440855026245} 11/07/2021 16:51:58 - INFO - __main__ - Step 139299: {'lr': 6.423144727662534e-06, 'samples': 26745408, 'steps': 139298, 'loss/train': 1.5615119934082031} 11/07/2021 16:51:58 - INFO - __main__ - Step 139300: {'lr': 6.421949585785986e-06, 'samples': 26745600, 'steps': 139299, 'loss/train': 1.1406136751174927} 11/07/2021 16:51:59 - INFO - __main__ - Step 139301: {'lr': 6.4207545536619226e-06, 'samples': 26745792, 'steps': 139300, 'loss/train': 1.3394486904144287} 11/07/2021 16:51:59 - INFO - __main__ - Step 139302: {'lr': 6.419559631290928e-06, 'samples': 26745984, 'steps': 139301, 'loss/train': 1.1508944034576416} 11/07/2021 16:52:00 - INFO - __main__ - Step 139303: {'lr': 6.418364818673528e-06, 'samples': 26746176, 'steps': 139302, 'loss/train': 1.0968912839889526} 11/07/2021 16:52:00 - INFO - __main__ - Step 139304: {'lr': 6.41717011581025e-06, 'samples': 26746368, 'steps': 139303, 'loss/train': 1.3191357851028442} 11/07/2021 16:52:00 - INFO - __main__ - Step 139305: {'lr': 6.41597552270165e-06, 'samples': 26746560, 'steps': 139304, 'loss/train': 1.3122397661209106} 11/07/2021 16:52:01 - INFO - __main__ - Step 139306: {'lr': 6.414781039348255e-06, 'samples': 26746752, 'steps': 139305, 'loss/train': 1.388149380683899} 11/07/2021 16:52:02 - INFO - __main__ - Step 139307: {'lr': 6.4135866657505924e-06, 'samples': 26746944, 'steps': 139306, 'loss/train': 1.2645156383514404} 11/07/2021 16:52:02 - INFO - __main__ - Step 139308: {'lr': 6.4123924019091896e-06, 'samples': 26747136, 'steps': 139307, 'loss/train': 1.5454264879226685} 11/07/2021 16:52:02 - INFO - __main__ - Step 139309: {'lr': 6.411198247824601e-06, 'samples': 26747328, 'steps': 139308, 'loss/train': 1.3919397592544556} 11/07/2021 16:52:03 - INFO - __main__ - Step 139310: {'lr': 6.4100042034973825e-06, 'samples': 26747520, 'steps': 139309, 'loss/train': 1.3051987886428833} 11/07/2021 16:52:04 - INFO - __main__ - Step 139311: {'lr': 6.408810268928062e-06, 'samples': 26747712, 'steps': 139310, 'loss/train': 1.4442851543426514} 11/07/2021 16:52:04 - INFO - __main__ - Step 139312: {'lr': 6.407616444117164e-06, 'samples': 26747904, 'steps': 139311, 'loss/train': 1.2959176301956177} 11/07/2021 16:52:04 - INFO - __main__ - Step 139313: {'lr': 6.406422729065248e-06, 'samples': 26748096, 'steps': 139312, 'loss/train': 2.011352300643921} 11/07/2021 16:52:05 - INFO - __main__ - Step 139314: {'lr': 6.405229123772838e-06, 'samples': 26748288, 'steps': 139313, 'loss/train': 1.4554550647735596} 11/07/2021 16:52:05 - INFO - __main__ - Step 139315: {'lr': 6.4040356282404626e-06, 'samples': 26748480, 'steps': 139314, 'loss/train': 1.3937162160873413} 11/07/2021 16:52:06 - INFO - __main__ - Step 139316: {'lr': 6.4028422424686486e-06, 'samples': 26748672, 'steps': 139315, 'loss/train': 0.4948905408382416} 11/07/2021 16:52:06 - INFO - __main__ - Step 139317: {'lr': 6.4016489664579795e-06, 'samples': 26748864, 'steps': 139316, 'loss/train': 1.5871968269348145} 11/07/2021 16:52:07 - INFO - __main__ - Step 139318: {'lr': 6.400455800208982e-06, 'samples': 26749056, 'steps': 139317, 'loss/train': 0.9227327704429626} 11/07/2021 16:52:07 - INFO - __main__ - Step 139319: {'lr': 6.3992627437221565e-06, 'samples': 26749248, 'steps': 139318, 'loss/train': 1.2908084392547607} 11/07/2021 16:52:08 - INFO - __main__ - Step 139320: {'lr': 6.398069796998113e-06, 'samples': 26749440, 'steps': 139319, 'loss/train': 1.5389436483383179} 11/07/2021 16:52:09 - INFO - __main__ - Step 139321: {'lr': 6.396876960037296e-06, 'samples': 26749632, 'steps': 139320, 'loss/train': 1.4128628969192505} 11/07/2021 16:52:09 - INFO - __main__ - Step 139322: {'lr': 6.395684232840287e-06, 'samples': 26749824, 'steps': 139321, 'loss/train': 1.3267920017242432} 11/07/2021 16:52:09 - INFO - __main__ - Step 139323: {'lr': 6.394491615407616e-06, 'samples': 26750016, 'steps': 139322, 'loss/train': 1.206915259361267} 11/07/2021 16:52:10 - INFO - __main__ - Step 139324: {'lr': 6.393299107739836e-06, 'samples': 26750208, 'steps': 139323, 'loss/train': 0.9640383720397949} 11/07/2021 16:52:10 - INFO - __main__ - Step 139325: {'lr': 6.392106709837475e-06, 'samples': 26750400, 'steps': 139324, 'loss/train': 1.0161340236663818} 11/07/2021 16:52:11 - INFO - __main__ - Step 139326: {'lr': 6.390914421701088e-06, 'samples': 26750592, 'steps': 139325, 'loss/train': 1.2912219762802124} 11/07/2021 16:52:11 - INFO - __main__ - Step 139327: {'lr': 6.389722243331175e-06, 'samples': 26750784, 'steps': 139326, 'loss/train': 1.4111170768737793} 11/07/2021 16:52:12 - INFO - __main__ - Step 139328: {'lr': 6.388530174728319e-06, 'samples': 26750976, 'steps': 139327, 'loss/train': 0.044540077447891235} 11/07/2021 16:52:12 - INFO - __main__ - Step 139329: {'lr': 6.387338215893018e-06, 'samples': 26751168, 'steps': 139328, 'loss/train': 1.3290627002716064} 11/07/2021 16:52:12 - INFO - __main__ - Step 139330: {'lr': 6.386146366825829e-06, 'samples': 26751360, 'steps': 139329, 'loss/train': 0.789925217628479} 11/07/2021 16:52:14 - INFO - __main__ - Step 139331: {'lr': 6.384954627527251e-06, 'samples': 26751552, 'steps': 139330, 'loss/train': 1.3645902872085571} 11/07/2021 16:52:15 - INFO - __main__ - Step 139332: {'lr': 6.383762997997894e-06, 'samples': 26751744, 'steps': 139331, 'loss/train': 1.2554609775543213} 11/07/2021 16:52:15 - INFO - __main__ - Step 139333: {'lr': 6.382571478238258e-06, 'samples': 26751936, 'steps': 139332, 'loss/train': 1.4098405838012695} 11/07/2021 16:52:15 - INFO - __main__ - Step 139334: {'lr': 6.381380068248844e-06, 'samples': 26752128, 'steps': 139333, 'loss/train': 1.7348772287368774} 11/07/2021 16:52:16 - INFO - __main__ - Step 139335: {'lr': 6.380188768030232e-06, 'samples': 26752320, 'steps': 139334, 'loss/train': 1.7282812595367432} 11/07/2021 16:52:16 - INFO - __main__ - Step 139336: {'lr': 6.378997577582951e-06, 'samples': 26752512, 'steps': 139335, 'loss/train': 1.740593671798706} 11/07/2021 16:52:16 - INFO - __main__ - Step 139337: {'lr': 6.377806496907557e-06, 'samples': 26752704, 'steps': 139336, 'loss/train': 1.339949369430542} 11/07/2021 16:52:18 - INFO - __main__ - Step 139338: {'lr': 6.37661552600452e-06, 'samples': 26752896, 'steps': 139337, 'loss/train': 1.1736735105514526} 11/07/2021 16:52:18 - INFO - __main__ - Step 139339: {'lr': 6.375424664874452e-06, 'samples': 26753088, 'steps': 139338, 'loss/train': 1.3916382789611816} 11/07/2021 16:52:18 - INFO - __main__ - Step 139340: {'lr': 6.374233913517852e-06, 'samples': 26753280, 'steps': 139339, 'loss/train': 0.9063202142715454} 11/07/2021 16:52:19 - INFO - __main__ - Step 139341: {'lr': 6.373043271935247e-06, 'samples': 26753472, 'steps': 139340, 'loss/train': 1.120537519454956} 11/07/2021 16:52:19 - INFO - __main__ - Step 139342: {'lr': 6.371852740127193e-06, 'samples': 26753664, 'steps': 139341, 'loss/train': 0.2057264745235443} 11/07/2021 16:52:19 - INFO - __main__ - Step 139343: {'lr': 6.370662318094245e-06, 'samples': 26753856, 'steps': 139342, 'loss/train': 1.1430033445358276} 11/07/2021 16:52:21 - INFO - __main__ - Step 139344: {'lr': 6.369472005836901e-06, 'samples': 26754048, 'steps': 139343, 'loss/train': 1.505474328994751} 11/07/2021 16:52:21 - INFO - __main__ - Step 139345: {'lr': 6.368281803355691e-06, 'samples': 26754240, 'steps': 139344, 'loss/train': 1.2948812246322632} 11/07/2021 16:52:21 - INFO - __main__ - Step 139346: {'lr': 6.367091710651196e-06, 'samples': 26754432, 'steps': 139345, 'loss/train': 1.1072701215744019} 11/07/2021 16:52:22 - INFO - __main__ - Step 139347: {'lr': 6.365901727723972e-06, 'samples': 26754624, 'steps': 139346, 'loss/train': 1.5417646169662476} 11/07/2021 16:52:22 - INFO - __main__ - Step 139348: {'lr': 6.364711854574462e-06, 'samples': 26754816, 'steps': 139347, 'loss/train': 1.2330173254013062} 11/07/2021 16:52:23 - INFO - __main__ - Step 139349: {'lr': 6.363522091203278e-06, 'samples': 26755008, 'steps': 139348, 'loss/train': 1.7175275087356567} 11/07/2021 16:52:23 - INFO - __main__ - Step 139350: {'lr': 6.362332437610918e-06, 'samples': 26755200, 'steps': 139349, 'loss/train': 1.233462929725647} 11/07/2021 16:52:24 - INFO - __main__ - Step 139351: {'lr': 6.361142893797911e-06, 'samples': 26755392, 'steps': 139350, 'loss/train': 1.132408857345581} 11/07/2021 16:52:24 - INFO - __main__ - Step 139352: {'lr': 6.359953459764839e-06, 'samples': 26755584, 'steps': 139351, 'loss/train': 1.0465153455734253} 11/07/2021 16:52:24 - INFO - __main__ - Step 139353: {'lr': 6.358764135512202e-06, 'samples': 26755776, 'steps': 139352, 'loss/train': 1.271104097366333} 11/07/2021 16:52:25 - INFO - __main__ - Step 139354: {'lr': 6.357574921040554e-06, 'samples': 26755968, 'steps': 139353, 'loss/train': 1.0683367252349854} 11/07/2021 16:52:26 - INFO - __main__ - Step 139355: {'lr': 6.356385816350424e-06, 'samples': 26756160, 'steps': 139354, 'loss/train': 1.1225768327713013} 11/07/2021 16:52:26 - INFO - __main__ - Step 139356: {'lr': 6.355196821442338e-06, 'samples': 26756352, 'steps': 139355, 'loss/train': 1.0015162229537964} 11/07/2021 16:52:26 - INFO - __main__ - Step 139357: {'lr': 6.354007936316853e-06, 'samples': 26756544, 'steps': 139356, 'loss/train': 1.081733226776123} 11/07/2021 16:52:27 - INFO - __main__ - Step 139358: {'lr': 6.352819160974465e-06, 'samples': 26756736, 'steps': 139357, 'loss/train': 1.3368779420852661} 11/07/2021 16:52:28 - INFO - __main__ - Step 139359: {'lr': 6.351630495415761e-06, 'samples': 26756928, 'steps': 139358, 'loss/train': 1.3064669370651245} 11/07/2021 16:52:28 - INFO - __main__ - Step 139360: {'lr': 6.350441939641266e-06, 'samples': 26757120, 'steps': 139359, 'loss/train': 1.2434769868850708} 11/07/2021 16:52:29 - INFO - __main__ - Step 139361: {'lr': 6.349253493651508e-06, 'samples': 26757312, 'steps': 139360, 'loss/train': 1.0544334650039673} 11/07/2021 16:52:29 - INFO - __main__ - Step 139362: {'lr': 6.348065157446986e-06, 'samples': 26757504, 'steps': 139361, 'loss/train': 1.4470711946487427} 11/07/2021 16:52:29 - INFO - __main__ - Step 139363: {'lr': 6.346876931028256e-06, 'samples': 26757696, 'steps': 139362, 'loss/train': 0.8726140260696411} 11/07/2021 16:52:30 - INFO - __main__ - Step 139364: {'lr': 6.3456888143959e-06, 'samples': 26757888, 'steps': 139363, 'loss/train': 1.8136459589004517} 11/07/2021 16:52:31 - INFO - __main__ - Step 139365: {'lr': 6.344500807550391e-06, 'samples': 26758080, 'steps': 139364, 'loss/train': 1.0906466245651245} 11/07/2021 16:52:31 - INFO - __main__ - Step 139366: {'lr': 6.343312910492282e-06, 'samples': 26758272, 'steps': 139365, 'loss/train': 1.3102067708969116} 11/07/2021 16:52:31 - INFO - __main__ - Step 139367: {'lr': 6.342125123222131e-06, 'samples': 26758464, 'steps': 139366, 'loss/train': 1.313571810722351} 11/07/2021 16:52:32 - INFO - __main__ - Step 139368: {'lr': 6.340937445740463e-06, 'samples': 26758656, 'steps': 139367, 'loss/train': 1.037156105041504} 11/07/2021 16:52:33 - INFO - __main__ - Step 139369: {'lr': 6.339749878047807e-06, 'samples': 26758848, 'steps': 139368, 'loss/train': 0.8697133660316467} 11/07/2021 16:52:33 - INFO - __main__ - Step 139370: {'lr': 6.338562420144689e-06, 'samples': 26759040, 'steps': 139369, 'loss/train': 1.1986525058746338} 11/07/2021 16:52:33 - INFO - __main__ - Step 139371: {'lr': 6.3373750720316645e-06, 'samples': 26759232, 'steps': 139370, 'loss/train': 0.7289814949035645} 11/07/2021 16:52:34 - INFO - __main__ - Step 139372: {'lr': 6.336187833709261e-06, 'samples': 26759424, 'steps': 139371, 'loss/train': 1.3633711338043213} 11/07/2021 16:52:34 - INFO - __main__ - Step 139373: {'lr': 6.335000705178034e-06, 'samples': 26759616, 'steps': 139372, 'loss/train': 1.543317198753357} 11/07/2021 16:52:35 - INFO - __main__ - Step 139374: {'lr': 6.333813686438456e-06, 'samples': 26759808, 'steps': 139373, 'loss/train': 0.899112343788147} 11/07/2021 16:52:35 - INFO - __main__ - Step 139375: {'lr': 6.332626777491107e-06, 'samples': 26760000, 'steps': 139374, 'loss/train': 1.2878189086914062} 11/07/2021 16:52:36 - INFO - __main__ - Step 139376: {'lr': 6.331439978336545e-06, 'samples': 26760192, 'steps': 139375, 'loss/train': 1.4673230648040771} 11/07/2021 16:52:36 - INFO - __main__ - Step 139377: {'lr': 6.330253288975241e-06, 'samples': 26760384, 'steps': 139376, 'loss/train': 1.03940749168396} 11/07/2021 16:52:37 - INFO - __main__ - Step 139378: {'lr': 6.329066709407777e-06, 'samples': 26760576, 'steps': 139377, 'loss/train': 0.4913143515586853} 11/07/2021 16:52:38 - INFO - __main__ - Step 139379: {'lr': 6.327880239634681e-06, 'samples': 26760768, 'steps': 139378, 'loss/train': 1.6424397230148315} 11/07/2021 16:52:38 - INFO - __main__ - Step 139380: {'lr': 6.326693879656481e-06, 'samples': 26760960, 'steps': 139379, 'loss/train': 1.525457739830017} 11/07/2021 16:52:38 - INFO - __main__ - Step 139381: {'lr': 6.3255076294737035e-06, 'samples': 26761152, 'steps': 139380, 'loss/train': 0.9532141089439392} 11/07/2021 16:52:39 - INFO - __main__ - Step 139382: {'lr': 6.324321489086904e-06, 'samples': 26761344, 'steps': 139381, 'loss/train': 0.3567613363265991} 11/07/2021 16:52:39 - INFO - __main__ - Step 139383: {'lr': 6.32313545849661e-06, 'samples': 26761536, 'steps': 139382, 'loss/train': 1.3484057188034058} 11/07/2021 16:52:40 - INFO - __main__ - Step 139384: {'lr': 6.321949537703319e-06, 'samples': 26761728, 'steps': 139383, 'loss/train': 0.03738735616207123} 11/07/2021 16:52:40 - INFO - __main__ - Step 139385: {'lr': 6.320763726707618e-06, 'samples': 26761920, 'steps': 139384, 'loss/train': 1.5496069192886353} 11/07/2021 16:52:41 - INFO - __main__ - Step 139386: {'lr': 6.3195780255100585e-06, 'samples': 26762112, 'steps': 139385, 'loss/train': 1.5418124198913574} 11/07/2021 16:52:41 - INFO - __main__ - Step 139387: {'lr': 6.3183924341110865e-06, 'samples': 26762304, 'steps': 139386, 'loss/train': 1.497504472732544} 11/07/2021 16:52:42 - INFO - __main__ - Step 139388: {'lr': 6.3172069525113115e-06, 'samples': 26762496, 'steps': 139387, 'loss/train': 0.6452668309211731} 11/07/2021 16:52:42 - INFO - __main__ - Step 139389: {'lr': 6.316021580711234e-06, 'samples': 26762688, 'steps': 139388, 'loss/train': 1.322221040725708} 11/07/2021 16:52:43 - INFO - __main__ - Step 139390: {'lr': 6.314836318711381e-06, 'samples': 26762880, 'steps': 139389, 'loss/train': 1.3373178243637085} 11/07/2021 16:52:43 - INFO - __main__ - Step 139391: {'lr': 6.313651166512308e-06, 'samples': 26763072, 'steps': 139390, 'loss/train': 1.2170768976211548} 11/07/2021 16:52:44 - INFO - __main__ - Step 139392: {'lr': 6.3124661241145685e-06, 'samples': 26763264, 'steps': 139391, 'loss/train': 1.476599097251892} 11/07/2021 16:52:44 - INFO - __main__ - Step 139393: {'lr': 6.311281191518637e-06, 'samples': 26763456, 'steps': 139392, 'loss/train': 1.0670835971832275} 11/07/2021 16:52:44 - INFO - __main__ - Step 139394: {'lr': 6.3100963687251215e-06, 'samples': 26763648, 'steps': 139393, 'loss/train': 1.0384796857833862} 11/07/2021 16:52:46 - INFO - __main__ - Step 139395: {'lr': 6.308911655734495e-06, 'samples': 26763840, 'steps': 139394, 'loss/train': 1.0494654178619385} 11/07/2021 16:52:46 - INFO - __main__ - Step 139396: {'lr': 6.307727052547285e-06, 'samples': 26764032, 'steps': 139395, 'loss/train': 0.7093831896781921} 11/07/2021 16:52:47 - INFO - __main__ - Step 139397: {'lr': 6.306542559164102e-06, 'samples': 26764224, 'steps': 139396, 'loss/train': 0.6207001805305481} 11/07/2021 16:52:47 - INFO - __main__ - Step 139398: {'lr': 6.305358175585419e-06, 'samples': 26764416, 'steps': 139397, 'loss/train': 0.036854762583971024} 11/07/2021 16:52:47 - INFO - __main__ - Step 139399: {'lr': 6.304173901811761e-06, 'samples': 26764608, 'steps': 139398, 'loss/train': 0.26934027671813965} 11/07/2021 16:52:48 - INFO - __main__ - Step 139400: {'lr': 6.302989737843712e-06, 'samples': 26764800, 'steps': 139399, 'loss/train': 0.3011823296546936} 11/07/2021 16:52:49 - INFO - __main__ - Step 139401: {'lr': 6.301805683681744e-06, 'samples': 26764992, 'steps': 139400, 'loss/train': 1.3751665353775024} 11/07/2021 16:52:49 - INFO - __main__ - Step 139402: {'lr': 6.30062173932644e-06, 'samples': 26765184, 'steps': 139401, 'loss/train': 1.1046490669250488} 11/07/2021 16:52:49 - INFO - __main__ - Step 139403: {'lr': 6.299437904778299e-06, 'samples': 26765376, 'steps': 139402, 'loss/train': 1.1703603267669678} 11/07/2021 16:52:50 - INFO - __main__ - Step 139404: {'lr': 6.2982541800378765e-06, 'samples': 26765568, 'steps': 139403, 'loss/train': 1.173089861869812} 11/07/2021 16:52:51 - INFO - __main__ - Step 139405: {'lr': 6.2970705651057005e-06, 'samples': 26765760, 'steps': 139404, 'loss/train': 1.4493162631988525} 11/07/2021 16:52:51 - INFO - __main__ - Step 139406: {'lr': 6.295887059982297e-06, 'samples': 26765952, 'steps': 139405, 'loss/train': 1.1700632572174072} 11/07/2021 16:52:52 - INFO - __main__ - Step 139407: {'lr': 6.294703664668222e-06, 'samples': 26766144, 'steps': 139406, 'loss/train': 1.3257229328155518} 11/07/2021 16:52:52 - INFO - __main__ - Step 139408: {'lr': 6.293520379164003e-06, 'samples': 26766336, 'steps': 139407, 'loss/train': 1.2145891189575195} 11/07/2021 16:52:52 - INFO - __main__ - Step 139409: {'lr': 6.292337203470139e-06, 'samples': 26766528, 'steps': 139408, 'loss/train': 1.1993985176086426} 11/07/2021 16:52:53 - INFO - __main__ - Step 139410: {'lr': 6.291154137587213e-06, 'samples': 26766720, 'steps': 139409, 'loss/train': 0.39490264654159546} 11/07/2021 16:52:54 - INFO - __main__ - Step 139411: {'lr': 6.289971181515697e-06, 'samples': 26766912, 'steps': 139410, 'loss/train': 1.265356421470642} 11/07/2021 16:52:54 - INFO - __main__ - Step 139412: {'lr': 6.288788335256174e-06, 'samples': 26767104, 'steps': 139411, 'loss/train': 1.307380199432373} 11/07/2021 16:52:54 - INFO - __main__ - Step 139413: {'lr': 6.2876055988091705e-06, 'samples': 26767296, 'steps': 139412, 'loss/train': 1.0543999671936035} 11/07/2021 16:52:55 - INFO - __main__ - Step 139414: {'lr': 6.286422972175215e-06, 'samples': 26767488, 'steps': 139413, 'loss/train': 1.4791276454925537} 11/07/2021 16:52:56 - INFO - __main__ - Step 139415: {'lr': 6.285240455354807e-06, 'samples': 26767680, 'steps': 139414, 'loss/train': 1.7754696607589722} 11/07/2021 16:52:56 - INFO - __main__ - Step 139416: {'lr': 6.284058048348529e-06, 'samples': 26767872, 'steps': 139415, 'loss/train': 1.6341912746429443} 11/07/2021 16:52:57 - INFO - __main__ - Step 139417: {'lr': 6.282875751156908e-06, 'samples': 26768064, 'steps': 139416, 'loss/train': 1.4816350936889648} 11/07/2021 16:52:57 - INFO - __main__ - Step 139418: {'lr': 6.281693563780444e-06, 'samples': 26768256, 'steps': 139417, 'loss/train': 1.1442041397094727} 11/07/2021 16:52:57 - INFO - __main__ - Step 139419: {'lr': 6.28051148621972e-06, 'samples': 26768448, 'steps': 139418, 'loss/train': 0.8552833795547485} 11/07/2021 16:52:58 - INFO - __main__ - Step 139420: {'lr': 6.279329518475207e-06, 'samples': 26768640, 'steps': 139419, 'loss/train': 1.0262486934661865} 11/07/2021 16:52:59 - INFO - __main__ - Step 139421: {'lr': 6.278147660547462e-06, 'samples': 26768832, 'steps': 139420, 'loss/train': 0.036844510585069656} 11/07/2021 16:52:59 - INFO - __main__ - Step 139422: {'lr': 6.276965912437038e-06, 'samples': 26769024, 'steps': 139421, 'loss/train': 1.207910418510437} 11/07/2021 16:52:59 - INFO - __main__ - Step 139423: {'lr': 6.275784274144436e-06, 'samples': 26769216, 'steps': 139422, 'loss/train': 0.9265379309654236} 11/07/2021 16:53:00 - INFO - __main__ - Step 139424: {'lr': 6.274602745670211e-06, 'samples': 26769408, 'steps': 139423, 'loss/train': 1.0869041681289673} 11/07/2021 16:53:00 - INFO - __main__ - Step 139425: {'lr': 6.273421327014889e-06, 'samples': 26769600, 'steps': 139424, 'loss/train': 1.2867474555969238} 11/07/2021 16:53:01 - INFO - __main__ - Step 139426: {'lr': 6.272240018178998e-06, 'samples': 26769792, 'steps': 139425, 'loss/train': 1.3354763984680176} 11/07/2021 16:53:01 - INFO - __main__ - Step 139427: {'lr': 6.271058819163094e-06, 'samples': 26769984, 'steps': 139426, 'loss/train': 1.7426295280456543} 11/07/2021 16:53:02 - INFO - __main__ - Step 139428: {'lr': 6.269877729967677e-06, 'samples': 26770176, 'steps': 139427, 'loss/train': 1.2316675186157227} 11/07/2021 16:53:02 - INFO - __main__ - Step 139429: {'lr': 6.268696750593272e-06, 'samples': 26770368, 'steps': 139428, 'loss/train': 1.3181859254837036} 11/07/2021 16:53:02 - INFO - __main__ - Step 139430: {'lr': 6.267515881040492e-06, 'samples': 26770560, 'steps': 139429, 'loss/train': 1.327242374420166} 11/07/2021 16:53:04 - INFO - __main__ - Step 139431: {'lr': 6.266335121309752e-06, 'samples': 26770752, 'steps': 139430, 'loss/train': 1.0874935388565063} 11/07/2021 16:53:04 - INFO - __main__ - Step 139432: {'lr': 6.2651544714016625e-06, 'samples': 26770944, 'steps': 139431, 'loss/train': 1.0358818769454956} 11/07/2021 16:53:04 - INFO - __main__ - Step 139433: {'lr': 6.263973931316724e-06, 'samples': 26771136, 'steps': 139432, 'loss/train': 1.2485262155532837} 11/07/2021 16:53:05 - INFO - __main__ - Step 139434: {'lr': 6.262793501055491e-06, 'samples': 26771328, 'steps': 139433, 'loss/train': 1.0790830850601196} 11/07/2021 16:53:05 - INFO - __main__ - Step 139435: {'lr': 6.261613180618464e-06, 'samples': 26771520, 'steps': 139434, 'loss/train': 1.4791978597640991} 11/07/2021 16:53:06 - INFO - __main__ - Step 139436: {'lr': 6.260432970006197e-06, 'samples': 26771712, 'steps': 139435, 'loss/train': 1.1248836517333984} 11/07/2021 16:53:07 - INFO - __main__ - Step 139437: {'lr': 6.259252869219218e-06, 'samples': 26771904, 'steps': 139436, 'loss/train': 1.1366581916809082} 11/07/2021 16:53:07 - INFO - __main__ - Step 139438: {'lr': 6.258072878258053e-06, 'samples': 26772096, 'steps': 139437, 'loss/train': 1.2162635326385498} 11/07/2021 16:53:07 - INFO - __main__ - Step 139439: {'lr': 6.2568929971232595e-06, 'samples': 26772288, 'steps': 139438, 'loss/train': 0.8077369332313538} 11/07/2021 16:53:08 - INFO - __main__ - Step 139440: {'lr': 6.2557132258153345e-06, 'samples': 26772480, 'steps': 139439, 'loss/train': 0.9527719020843506} 11/07/2021 16:53:09 - INFO - __main__ - Step 139441: {'lr': 6.254533564334864e-06, 'samples': 26772672, 'steps': 139440, 'loss/train': 1.065591812133789} 11/07/2021 16:53:09 - INFO - __main__ - Step 139442: {'lr': 6.253354012682288e-06, 'samples': 26772864, 'steps': 139441, 'loss/train': 1.219761610031128} 11/07/2021 16:53:09 - INFO - __main__ - Step 139443: {'lr': 6.252174570858193e-06, 'samples': 26773056, 'steps': 139442, 'loss/train': 1.040008783340454} 11/07/2021 16:53:10 - INFO - __main__ - Step 139444: {'lr': 6.250995238863133e-06, 'samples': 26773248, 'steps': 139443, 'loss/train': 1.5830364227294922} 11/07/2021 16:53:10 - INFO - __main__ - Step 139445: {'lr': 6.2498160166975794e-06, 'samples': 26773440, 'steps': 139444, 'loss/train': 1.219664216041565} 11/07/2021 16:53:11 - INFO - __main__ - Step 139446: {'lr': 6.248636904362115e-06, 'samples': 26773632, 'steps': 139445, 'loss/train': 1.4045578241348267} 11/07/2021 16:53:11 - INFO - __main__ - Step 139447: {'lr': 6.247457901857267e-06, 'samples': 26773824, 'steps': 139446, 'loss/train': 1.2752554416656494} 11/07/2021 16:53:12 - INFO - __main__ - Step 139448: {'lr': 6.246279009183536e-06, 'samples': 26774016, 'steps': 139447, 'loss/train': 1.0172587633132935} 11/07/2021 16:53:12 - INFO - __main__ - Step 139449: {'lr': 6.245100226341477e-06, 'samples': 26774208, 'steps': 139448, 'loss/train': 1.0579943656921387} 11/07/2021 16:53:12 - INFO - __main__ - Step 139450: {'lr': 6.243921553331616e-06, 'samples': 26774400, 'steps': 139449, 'loss/train': 1.4330588579177856} 11/07/2021 16:53:14 - INFO - __main__ - Step 139451: {'lr': 6.242742990154482e-06, 'samples': 26774592, 'steps': 139450, 'loss/train': 1.4232124090194702} 11/07/2021 16:53:14 - INFO - __main__ - Step 139452: {'lr': 6.241564536810601e-06, 'samples': 26774784, 'steps': 139451, 'loss/train': 0.829399824142456} 11/07/2021 16:53:14 - INFO - __main__ - Step 139453: {'lr': 6.240386193300502e-06, 'samples': 26774976, 'steps': 139452, 'loss/train': 1.3164448738098145} 11/07/2021 16:53:15 - INFO - __main__ - Step 139454: {'lr': 6.239207959624766e-06, 'samples': 26775168, 'steps': 139453, 'loss/train': 0.9500367641448975} 11/07/2021 16:53:15 - INFO - __main__ - Step 139455: {'lr': 6.238029835783837e-06, 'samples': 26775360, 'steps': 139454, 'loss/train': 1.3185908794403076} 11/07/2021 16:53:15 - INFO - __main__ - Step 139456: {'lr': 6.2368518217783e-06, 'samples': 26775552, 'steps': 139455, 'loss/train': 1.4654340744018555} 11/07/2021 16:53:16 - INFO - __main__ - Step 139457: {'lr': 6.235673917608681e-06, 'samples': 26775744, 'steps': 139456, 'loss/train': 1.2546581029891968} 11/07/2021 16:53:17 - INFO - __main__ - Step 139458: {'lr': 6.234496123275507e-06, 'samples': 26775936, 'steps': 139457, 'loss/train': 1.6231017112731934} 11/07/2021 16:53:17 - INFO - __main__ - Step 139459: {'lr': 6.233318438779306e-06, 'samples': 26776128, 'steps': 139458, 'loss/train': 1.3912676572799683} 11/07/2021 16:53:17 - INFO - __main__ - Step 139460: {'lr': 6.232140864120606e-06, 'samples': 26776320, 'steps': 139459, 'loss/train': 1.8850047588348389} 11/07/2021 16:53:18 - INFO - __main__ - Step 139461: {'lr': 6.230963399299933e-06, 'samples': 26776512, 'steps': 139460, 'loss/train': 1.2280235290527344} 11/07/2021 16:53:19 - INFO - __main__ - Step 139462: {'lr': 6.229786044317842e-06, 'samples': 26776704, 'steps': 139461, 'loss/train': 1.4694230556488037} 11/07/2021 16:53:19 - INFO - __main__ - Step 139463: {'lr': 6.228608799174834e-06, 'samples': 26776896, 'steps': 139462, 'loss/train': 1.2926104068756104} 11/07/2021 16:53:20 - INFO - __main__ - Step 139464: {'lr': 6.227431663871463e-06, 'samples': 26777088, 'steps': 139463, 'loss/train': 1.334626317024231} 11/07/2021 16:53:20 - INFO - __main__ - Step 139465: {'lr': 6.226254638408257e-06, 'samples': 26777280, 'steps': 139464, 'loss/train': 1.2733012437820435} 11/07/2021 16:53:20 - INFO - __main__ - Step 139466: {'lr': 6.225077722785716e-06, 'samples': 26777472, 'steps': 139465, 'loss/train': 1.4366778135299683} 11/07/2021 16:53:21 - INFO - __main__ - Step 139467: {'lr': 6.223900917004421e-06, 'samples': 26777664, 'steps': 139466, 'loss/train': 1.4310226440429688} 11/07/2021 16:53:22 - INFO - __main__ - Step 139468: {'lr': 6.222724221064874e-06, 'samples': 26777856, 'steps': 139467, 'loss/train': 1.264375925064087} 11/07/2021 16:53:22 - INFO - __main__ - Step 139469: {'lr': 6.221547634967601e-06, 'samples': 26778048, 'steps': 139468, 'loss/train': 1.5351001024246216} 11/07/2021 16:53:22 - INFO - __main__ - Step 139470: {'lr': 6.2203711587131284e-06, 'samples': 26778240, 'steps': 139469, 'loss/train': 1.2935926914215088} 11/07/2021 16:53:23 - INFO - __main__ - Step 139471: {'lr': 6.219194792301985e-06, 'samples': 26778432, 'steps': 139470, 'loss/train': 1.3259317874908447} 11/07/2021 16:53:24 - INFO - __main__ - Step 139472: {'lr': 6.218018535734726e-06, 'samples': 26778624, 'steps': 139471, 'loss/train': 1.5664457082748413} 11/07/2021 16:53:24 - INFO - __main__ - Step 139473: {'lr': 6.216842389011851e-06, 'samples': 26778816, 'steps': 139472, 'loss/train': 1.1842697858810425} 11/07/2021 16:53:24 - INFO - __main__ - Step 139474: {'lr': 6.215666352133914e-06, 'samples': 26779008, 'steps': 139473, 'loss/train': 1.3835285902023315} 11/07/2021 16:53:25 - INFO - __main__ - Step 139475: {'lr': 6.214490425101443e-06, 'samples': 26779200, 'steps': 139474, 'loss/train': 1.0747004747390747} 11/07/2021 16:53:25 - INFO - __main__ - Step 139476: {'lr': 6.213314607914966e-06, 'samples': 26779392, 'steps': 139475, 'loss/train': 1.3525766134262085} 11/07/2021 16:53:26 - INFO - __main__ - Step 139477: {'lr': 6.2121389005749815e-06, 'samples': 26779584, 'steps': 139476, 'loss/train': 0.6878137588500977} 11/07/2021 16:53:26 - INFO - __main__ - Step 139478: {'lr': 6.210963303082073e-06, 'samples': 26779776, 'steps': 139477, 'loss/train': 1.215018391609192} 11/07/2021 16:53:27 - INFO - __main__ - Step 139479: {'lr': 6.209787815436713e-06, 'samples': 26779968, 'steps': 139478, 'loss/train': 1.3553979396820068} 11/07/2021 16:53:27 - INFO - __main__ - Step 139480: {'lr': 6.208612437639482e-06, 'samples': 26780160, 'steps': 139479, 'loss/train': 1.308188796043396} 11/07/2021 16:53:27 - INFO - __main__ - Step 139481: {'lr': 6.20743716969091e-06, 'samples': 26780352, 'steps': 139480, 'loss/train': 1.1649107933044434} 11/07/2021 16:53:29 - INFO - __main__ - Step 139482: {'lr': 6.206262011591468e-06, 'samples': 26780544, 'steps': 139481, 'loss/train': 1.2648462057113647} 11/07/2021 16:53:29 - INFO - __main__ - Step 139483: {'lr': 6.205086963341738e-06, 'samples': 26780736, 'steps': 139482, 'loss/train': 1.1773266792297363} 11/07/2021 16:53:29 - INFO - __main__ - Step 139484: {'lr': 6.203912024942248e-06, 'samples': 26780928, 'steps': 139483, 'loss/train': 1.7195971012115479} 11/07/2021 16:53:30 - INFO - __main__ - Step 139485: {'lr': 6.202737196393471e-06, 'samples': 26781120, 'steps': 139484, 'loss/train': 1.584341049194336} 11/07/2021 16:53:30 - INFO - __main__ - Step 139486: {'lr': 6.201562477696016e-06, 'samples': 26781312, 'steps': 139485, 'loss/train': 1.5461647510528564} 11/07/2021 16:53:30 - INFO - __main__ - Step 139487: {'lr': 6.200387868850355e-06, 'samples': 26781504, 'steps': 139486, 'loss/train': 1.5371441841125488} 11/07/2021 16:53:31 - INFO - __main__ - Step 139488: {'lr': 6.199213369857043e-06, 'samples': 26781696, 'steps': 139487, 'loss/train': 1.057295322418213} 11/07/2021 16:53:32 - INFO - __main__ - Step 139489: {'lr': 6.198038980716608e-06, 'samples': 26781888, 'steps': 139488, 'loss/train': 1.4527291059494019} 11/07/2021 16:53:32 - INFO - __main__ - Step 139490: {'lr': 6.1968647014295495e-06, 'samples': 26782080, 'steps': 139489, 'loss/train': 1.2720274925231934} 11/07/2021 16:53:33 - INFO - __main__ - Step 139491: {'lr': 6.195690531996451e-06, 'samples': 26782272, 'steps': 139490, 'loss/train': 1.2507144212722778} 11/07/2021 16:53:33 - INFO - __main__ - Step 139492: {'lr': 6.19451647241781e-06, 'samples': 26782464, 'steps': 139491, 'loss/train': 1.363171100616455} 11/07/2021 16:53:34 - INFO - __main__ - Step 139493: {'lr': 6.1933425226941566e-06, 'samples': 26782656, 'steps': 139492, 'loss/train': 1.0708867311477661} 11/07/2021 16:53:34 - INFO - __main__ - Step 139494: {'lr': 6.192168682826016e-06, 'samples': 26782848, 'steps': 139493, 'loss/train': 0.9385887384414673} 11/07/2021 16:53:35 - INFO - __main__ - Step 139495: {'lr': 6.1909949528139445e-06, 'samples': 26783040, 'steps': 139494, 'loss/train': 1.5189603567123413} 11/07/2021 16:53:35 - INFO - __main__ - Step 139496: {'lr': 6.1898213326584126e-06, 'samples': 26783232, 'steps': 139495, 'loss/train': 1.1029891967773438} 11/07/2021 16:53:35 - INFO - __main__ - Step 139497: {'lr': 6.1886478223600055e-06, 'samples': 26783424, 'steps': 139496, 'loss/train': 0.8835906386375427} 11/07/2021 16:53:36 - INFO - __main__ - Step 139498: {'lr': 6.18747442191922e-06, 'samples': 26783616, 'steps': 139497, 'loss/train': 0.97804856300354} 11/07/2021 16:53:37 - INFO - __main__ - Step 139499: {'lr': 6.186301131336586e-06, 'samples': 26783808, 'steps': 139498, 'loss/train': 1.6879106760025024} 11/07/2021 16:53:37 - INFO - __main__ - Step 139500: {'lr': 6.185127950612657e-06, 'samples': 26784000, 'steps': 139499, 'loss/train': 1.050852656364441} 11/07/2021 16:53:37 - INFO - __main__ - Step 139501: {'lr': 6.1839548797479605e-06, 'samples': 26784192, 'steps': 139500, 'loss/train': 1.3497673273086548} 11/07/2021 16:53:38 - INFO - __main__ - Step 139502: {'lr': 6.182781918742997e-06, 'samples': 26784384, 'steps': 139501, 'loss/train': 0.8258057832717896} 11/07/2021 16:53:38 - INFO - __main__ - Step 139503: {'lr': 6.181609067598293e-06, 'samples': 26784576, 'steps': 139502, 'loss/train': 1.6317988634109497} 11/07/2021 16:53:39 - INFO - __main__ - Step 139504: {'lr': 6.180436326314403e-06, 'samples': 26784768, 'steps': 139503, 'loss/train': 1.5014547109603882} 11/07/2021 16:53:40 - INFO - __main__ - Step 139505: {'lr': 6.179263694891857e-06, 'samples': 26784960, 'steps': 139504, 'loss/train': 1.3259974718093872} 11/07/2021 16:53:40 - INFO - __main__ - Step 139506: {'lr': 6.178091173331179e-06, 'samples': 26785152, 'steps': 139505, 'loss/train': 0.4526408016681671} 11/07/2021 16:53:40 - INFO - __main__ - Step 139507: {'lr': 6.1769187616328715e-06, 'samples': 26785344, 'steps': 139506, 'loss/train': 1.4368401765823364} 11/07/2021 16:53:41 - INFO - __main__ - Step 139508: {'lr': 6.1757464597975155e-06, 'samples': 26785536, 'steps': 139507, 'loss/train': 1.1761959791183472} 11/07/2021 16:53:42 - INFO - __main__ - Step 139509: {'lr': 6.174574267825584e-06, 'samples': 26785728, 'steps': 139508, 'loss/train': 1.0703494548797607} 11/07/2021 16:53:42 - INFO - __main__ - Step 139510: {'lr': 6.173402185717631e-06, 'samples': 26785920, 'steps': 139509, 'loss/train': 1.2223856449127197} 11/07/2021 16:53:43 - INFO - __main__ - Step 139511: {'lr': 6.172230213474156e-06, 'samples': 26786112, 'steps': 139510, 'loss/train': 1.3521976470947266} 11/07/2021 16:53:43 - INFO - __main__ - Step 139512: {'lr': 6.171058351095743e-06, 'samples': 26786304, 'steps': 139511, 'loss/train': 1.0099334716796875} 11/07/2021 16:53:43 - INFO - __main__ - Step 139513: {'lr': 6.1698865985828636e-06, 'samples': 26786496, 'steps': 139512, 'loss/train': 0.6823744773864746} 11/07/2021 16:53:44 - INFO - __main__ - Step 139514: {'lr': 6.1687149559361e-06, 'samples': 26786688, 'steps': 139513, 'loss/train': 1.1947598457336426} 11/07/2021 16:53:45 - INFO - __main__ - Step 139515: {'lr': 6.167543423155925e-06, 'samples': 26786880, 'steps': 139514, 'loss/train': 1.4796665906906128} 11/07/2021 16:53:45 - INFO - __main__ - Step 139516: {'lr': 6.166372000242893e-06, 'samples': 26787072, 'steps': 139515, 'loss/train': 1.127637505531311} 11/07/2021 16:53:45 - INFO - __main__ - Step 139517: {'lr': 6.165200687197531e-06, 'samples': 26787264, 'steps': 139516, 'loss/train': 1.119896650314331} 11/07/2021 16:53:46 - INFO - __main__ - Step 139518: {'lr': 6.1640294840203944e-06, 'samples': 26787456, 'steps': 139517, 'loss/train': 1.436779260635376} 11/07/2021 16:53:47 - INFO - __main__ - Step 139519: {'lr': 6.1628583907119565e-06, 'samples': 26787648, 'steps': 139518, 'loss/train': 1.5050272941589355} 11/07/2021 16:53:47 - INFO - __main__ - Step 139520: {'lr': 6.16168740727277e-06, 'samples': 26787840, 'steps': 139519, 'loss/train': 1.2018593549728394} 11/07/2021 16:53:48 - INFO - __main__ - Step 139521: {'lr': 6.160516533703392e-06, 'samples': 26788032, 'steps': 139520, 'loss/train': 1.2581820487976074} 11/07/2021 16:53:48 - INFO - __main__ - Step 139522: {'lr': 6.159345770004321e-06, 'samples': 26788224, 'steps': 139521, 'loss/train': 1.3060152530670166} 11/07/2021 16:53:48 - INFO - __main__ - Step 139523: {'lr': 6.158175116176057e-06, 'samples': 26788416, 'steps': 139522, 'loss/train': 1.196274995803833} 11/07/2021 16:53:49 - INFO - __main__ - Step 139524: {'lr': 6.157004572219182e-06, 'samples': 26788608, 'steps': 139523, 'loss/train': 1.3733240365982056} 11/07/2021 16:53:50 - INFO - __main__ - Step 139525: {'lr': 6.1558341381341696e-06, 'samples': 26788800, 'steps': 139524, 'loss/train': 1.367688536643982} 11/07/2021 16:53:50 - INFO - __main__ - Step 139526: {'lr': 6.154663813921602e-06, 'samples': 26788992, 'steps': 139525, 'loss/train': 0.035717736929655075} 11/07/2021 16:53:50 - INFO - __main__ - Step 139527: {'lr': 6.1534935995819775e-06, 'samples': 26789184, 'steps': 139526, 'loss/train': 1.5999687910079956} 11/07/2021 16:53:51 - INFO - __main__ - Step 139528: {'lr': 6.152323495115797e-06, 'samples': 26789376, 'steps': 139527, 'loss/train': 1.3628443479537964} 11/07/2021 16:53:51 - INFO - __main__ - Step 139529: {'lr': 6.151153500523643e-06, 'samples': 26789568, 'steps': 139528, 'loss/train': 1.6082991361618042} 11/07/2021 16:53:52 - INFO - __main__ - Step 139530: {'lr': 6.149983615806015e-06, 'samples': 26789760, 'steps': 139529, 'loss/train': 1.1553609371185303} 11/07/2021 16:53:52 - INFO - __main__ - Step 139531: {'lr': 6.148813840963441e-06, 'samples': 26789952, 'steps': 139530, 'loss/train': 0.29953062534332275} 11/07/2021 16:53:53 - INFO - __main__ - Step 139532: {'lr': 6.147644175996447e-06, 'samples': 26790144, 'steps': 139531, 'loss/train': 0.36374005675315857} 11/07/2021 16:53:53 - INFO - __main__ - Step 139533: {'lr': 6.146474620905534e-06, 'samples': 26790336, 'steps': 139532, 'loss/train': 1.0085039138793945} 11/07/2021 16:53:54 - INFO - __main__ - Step 139534: {'lr': 6.145305175691285e-06, 'samples': 26790528, 'steps': 139533, 'loss/train': 1.6562707424163818} 11/07/2021 16:53:55 - INFO - __main__ - Step 139535: {'lr': 6.1441358403542255e-06, 'samples': 26790720, 'steps': 139534, 'loss/train': 1.2131303548812866} 11/07/2021 16:53:55 - INFO - __main__ - Step 139536: {'lr': 6.142966614894829e-06, 'samples': 26790912, 'steps': 139535, 'loss/train': 0.8387482762336731} 11/07/2021 16:53:55 - INFO - __main__ - Step 139537: {'lr': 6.14179749931365e-06, 'samples': 26791104, 'steps': 139536, 'loss/train': 0.9551588892936707} 11/07/2021 16:53:56 - INFO - __main__ - Step 139538: {'lr': 6.1406284936111885e-06, 'samples': 26791296, 'steps': 139537, 'loss/train': 1.19436776638031} 11/07/2021 16:53:56 - INFO - __main__ - Step 139539: {'lr': 6.139459597788027e-06, 'samples': 26791488, 'steps': 139538, 'loss/train': 1.6483235359191895} 11/07/2021 16:53:57 - INFO - __main__ - Step 139540: {'lr': 6.1382908118446655e-06, 'samples': 26791680, 'steps': 139539, 'loss/train': 0.48629969358444214} 11/07/2021 16:53:58 - INFO - __main__ - Step 139541: {'lr': 6.1371221357816035e-06, 'samples': 26791872, 'steps': 139540, 'loss/train': 1.1448646783828735} 11/07/2021 16:53:58 - INFO - __main__ - Step 139542: {'lr': 6.135953569599395e-06, 'samples': 26792064, 'steps': 139541, 'loss/train': 0.08000705391168594} 11/07/2021 16:53:58 - INFO - __main__ - Step 139543: {'lr': 6.134785113298569e-06, 'samples': 26792256, 'steps': 139542, 'loss/train': 1.3131773471832275} 11/07/2021 16:53:59 - INFO - __main__ - Step 139544: {'lr': 6.133616766879651e-06, 'samples': 26792448, 'steps': 139543, 'loss/train': 1.1961416006088257} 11/07/2021 16:54:00 - INFO - __main__ - Step 139545: {'lr': 6.132448530343171e-06, 'samples': 26792640, 'steps': 139544, 'loss/train': 1.252254843711853} 11/07/2021 16:54:00 - INFO - __main__ - Step 139546: {'lr': 6.1312804036896265e-06, 'samples': 26792832, 'steps': 139545, 'loss/train': 1.165264368057251} 11/07/2021 16:54:01 - INFO - __main__ - Step 139547: {'lr': 6.130112386919573e-06, 'samples': 26793024, 'steps': 139546, 'loss/train': 1.3392505645751953} 11/07/2021 16:54:01 - INFO - __main__ - Step 139548: {'lr': 6.128944480033538e-06, 'samples': 26793216, 'steps': 139547, 'loss/train': 1.4788261651992798} 11/07/2021 16:54:01 - INFO - __main__ - Step 139549: {'lr': 6.1277766830320215e-06, 'samples': 26793408, 'steps': 139548, 'loss/train': 1.3392456769943237} 11/07/2021 16:54:02 - INFO - __main__ - Step 139550: {'lr': 6.126608995915578e-06, 'samples': 26793600, 'steps': 139549, 'loss/train': 0.7565128803253174} 11/07/2021 16:54:03 - INFO - __main__ - Step 139551: {'lr': 6.1254414186847075e-06, 'samples': 26793792, 'steps': 139550, 'loss/train': 1.4869277477264404} 11/07/2021 16:54:03 - INFO - __main__ - Step 139552: {'lr': 6.124273951339965e-06, 'samples': 26793984, 'steps': 139551, 'loss/train': 1.2768328189849854} 11/07/2021 16:54:03 - INFO - __main__ - Step 139553: {'lr': 6.12310659388185e-06, 'samples': 26794176, 'steps': 139552, 'loss/train': 1.5579016208648682} 11/07/2021 16:54:04 - INFO - __main__ - Step 139554: {'lr': 6.12193934631089e-06, 'samples': 26794368, 'steps': 139553, 'loss/train': 1.4662691354751587} 11/07/2021 16:54:05 - INFO - __main__ - Step 139555: {'lr': 6.1207722086276394e-06, 'samples': 26794560, 'steps': 139554, 'loss/train': 0.9261882901191711} 11/07/2021 16:54:05 - INFO - __main__ - Step 139556: {'lr': 6.119605180832599e-06, 'samples': 26794752, 'steps': 139555, 'loss/train': 1.1153090000152588} 11/07/2021 16:54:06 - INFO - __main__ - Step 139557: {'lr': 6.1184382629262956e-06, 'samples': 26794944, 'steps': 139556, 'loss/train': 1.4964195489883423} 11/07/2021 16:54:06 - INFO - __main__ - Step 139558: {'lr': 6.117271454909257e-06, 'samples': 26795136, 'steps': 139557, 'loss/train': 0.231853649020195} 11/07/2021 16:54:06 - INFO - __main__ - Step 139559: {'lr': 6.116104756782037e-06, 'samples': 26795328, 'steps': 139558, 'loss/train': 0.8903438448905945} 11/07/2021 16:54:07 - INFO - __main__ - Step 139560: {'lr': 6.114938168545109e-06, 'samples': 26795520, 'steps': 139559, 'loss/train': 1.146031141281128} 11/07/2021 16:54:08 - INFO - __main__ - Step 139561: {'lr': 6.113771690199027e-06, 'samples': 26795712, 'steps': 139560, 'loss/train': 1.3458504676818848} 11/07/2021 16:54:08 - INFO - __main__ - Step 139562: {'lr': 6.112605321744374e-06, 'samples': 26795904, 'steps': 139561, 'loss/train': 1.287131428718567} 11/07/2021 16:54:08 - INFO - __main__ - Step 139563: {'lr': 6.111439063181568e-06, 'samples': 26796096, 'steps': 139562, 'loss/train': 1.3503170013427734} 11/07/2021 16:54:09 - INFO - __main__ - Step 139564: {'lr': 6.110272914511189e-06, 'samples': 26796288, 'steps': 139563, 'loss/train': 1.208053469657898} 11/07/2021 16:54:09 - INFO - __main__ - Step 139565: {'lr': 6.109106875733739e-06, 'samples': 26796480, 'steps': 139564, 'loss/train': 1.302463173866272} 11/07/2021 16:54:10 - INFO - __main__ - Step 139566: {'lr': 6.107940946849799e-06, 'samples': 26796672, 'steps': 139565, 'loss/train': 1.5085015296936035} 11/07/2021 16:54:11 - INFO - __main__ - Step 139567: {'lr': 6.106775127859815e-06, 'samples': 26796864, 'steps': 139566, 'loss/train': 1.329789638519287} 11/07/2021 16:54:11 - INFO - __main__ - Step 139568: {'lr': 6.105609418764396e-06, 'samples': 26797056, 'steps': 139567, 'loss/train': 1.2559434175491333} 11/07/2021 16:54:11 - INFO - __main__ - Step 139569: {'lr': 6.104443819563987e-06, 'samples': 26797248, 'steps': 139568, 'loss/train': 0.059571556746959686} 11/07/2021 16:54:12 - INFO - __main__ - Step 139570: {'lr': 6.103278330259171e-06, 'samples': 26797440, 'steps': 139569, 'loss/train': 1.2178610563278198} 11/07/2021 16:54:13 - INFO - __main__ - Step 139571: {'lr': 6.102112950850475e-06, 'samples': 26797632, 'steps': 139570, 'loss/train': 0.6476883888244629} 11/07/2021 16:54:13 - INFO - __main__ - Step 139572: {'lr': 6.10094768133837e-06, 'samples': 26797824, 'steps': 139571, 'loss/train': 1.183524250984192} 11/07/2021 16:54:13 - INFO - __main__ - Step 139573: {'lr': 6.099782521723413e-06, 'samples': 26798016, 'steps': 139572, 'loss/train': 1.209731936454773} 11/07/2021 16:54:14 - INFO - __main__ - Step 139574: {'lr': 6.098617472006157e-06, 'samples': 26798208, 'steps': 139573, 'loss/train': 0.8965895175933838} 11/07/2021 16:54:14 - INFO - __main__ - Step 139575: {'lr': 6.0974525321871035e-06, 'samples': 26798400, 'steps': 139574, 'loss/train': 1.2892582416534424} 11/07/2021 16:54:15 - INFO - __main__ - Step 139576: {'lr': 6.096287702266778e-06, 'samples': 26798592, 'steps': 139575, 'loss/train': 0.4629965126514435} 11/07/2021 16:54:15 - INFO - __main__ - Step 139577: {'lr': 6.095122982245682e-06, 'samples': 26798784, 'steps': 139576, 'loss/train': 1.1269073486328125} 11/07/2021 16:54:16 - INFO - __main__ - Step 139578: {'lr': 6.093958372124342e-06, 'samples': 26798976, 'steps': 139577, 'loss/train': 1.1523232460021973} 11/07/2021 16:54:16 - INFO - __main__ - Step 139579: {'lr': 6.09279387190334e-06, 'samples': 26799168, 'steps': 139578, 'loss/train': 1.3000683784484863} 11/07/2021 16:54:17 - INFO - __main__ - Step 139580: {'lr': 6.091629481583122e-06, 'samples': 26799360, 'steps': 139579, 'loss/train': 1.0399609804153442} 11/07/2021 16:54:18 - INFO - __main__ - Step 139581: {'lr': 6.090465201164269e-06, 'samples': 26799552, 'steps': 139580, 'loss/train': 1.1771267652511597} 11/07/2021 16:54:18 - INFO - __main__ - Step 139582: {'lr': 6.089301030647309e-06, 'samples': 26799744, 'steps': 139581, 'loss/train': 1.4579486846923828} 11/07/2021 16:54:18 - INFO - __main__ - Step 139583: {'lr': 6.088136970032715e-06, 'samples': 26799936, 'steps': 139582, 'loss/train': 1.3769716024398804} 11/07/2021 16:54:19 - INFO - __main__ - Step 139584: {'lr': 6.086973019321041e-06, 'samples': 26800128, 'steps': 139583, 'loss/train': 1.1868948936462402} 11/07/2021 16:54:19 - INFO - __main__ - Step 139585: {'lr': 6.0858091785128415e-06, 'samples': 26800320, 'steps': 139584, 'loss/train': 1.1684377193450928} 11/07/2021 16:54:20 - INFO - __main__ - Step 139586: {'lr': 6.0846454476085885e-06, 'samples': 26800512, 'steps': 139585, 'loss/train': 1.256164312362671} 11/07/2021 16:54:21 - INFO - __main__ - Step 139587: {'lr': 6.083481826608839e-06, 'samples': 26800704, 'steps': 139586, 'loss/train': 0.036827582865953445} 11/07/2021 16:54:21 - INFO - __main__ - Step 139588: {'lr': 6.082318315514118e-06, 'samples': 26800896, 'steps': 139587, 'loss/train': 1.0147521495819092} 11/07/2021 16:54:21 - INFO - __main__ - Step 139589: {'lr': 6.081154914324955e-06, 'samples': 26801088, 'steps': 139588, 'loss/train': 1.127002477645874} 11/07/2021 16:54:22 - INFO - __main__ - Step 139590: {'lr': 6.079991623041848e-06, 'samples': 26801280, 'steps': 139589, 'loss/train': 1.2162364721298218} 11/07/2021 16:54:22 - INFO - __main__ - Step 139591: {'lr': 6.0788284416653235e-06, 'samples': 26801472, 'steps': 139590, 'loss/train': 0.8717607855796814} 11/07/2021 16:54:23 - INFO - __main__ - Step 139592: {'lr': 6.077665370195912e-06, 'samples': 26801664, 'steps': 139591, 'loss/train': 1.3572388887405396} 11/07/2021 16:54:23 - INFO - __main__ - Step 139593: {'lr': 6.0765024086341384e-06, 'samples': 26801856, 'steps': 139592, 'loss/train': 1.351558804512024} 11/07/2021 16:54:24 - INFO - __main__ - Step 139594: {'lr': 6.0753395569805304e-06, 'samples': 26802048, 'steps': 139593, 'loss/train': 0.9546968340873718} 11/07/2021 16:54:24 - INFO - __main__ - Step 139595: {'lr': 6.074176815235615e-06, 'samples': 26802240, 'steps': 139594, 'loss/train': 1.4250543117523193} 11/07/2021 16:54:24 - INFO - __main__ - Step 139596: {'lr': 6.073014183399894e-06, 'samples': 26802432, 'steps': 139595, 'loss/train': 1.2466799020767212} 11/07/2021 16:54:25 - INFO - __main__ - Step 139597: {'lr': 6.07185166147392e-06, 'samples': 26802624, 'steps': 139596, 'loss/train': 1.745084524154663} 11/07/2021 16:54:26 - INFO - __main__ - Step 139598: {'lr': 6.070689249458222e-06, 'samples': 26802816, 'steps': 139597, 'loss/train': 1.3574618101119995} 11/07/2021 16:54:26 - INFO - __main__ - Step 139599: {'lr': 6.069526947353299e-06, 'samples': 26803008, 'steps': 139598, 'loss/train': 1.040579080581665} 11/07/2021 16:54:26 - INFO - __main__ - Step 139600: {'lr': 6.068364755159678e-06, 'samples': 26803200, 'steps': 139599, 'loss/train': 1.114575743675232} 11/07/2021 16:54:27 - INFO - __main__ - Step 139601: {'lr': 6.067202672877886e-06, 'samples': 26803392, 'steps': 139600, 'loss/train': 1.5346518754959106} 11/07/2021 16:54:28 - INFO - __main__ - Step 139602: {'lr': 6.06604070050848e-06, 'samples': 26803584, 'steps': 139601, 'loss/train': 1.4774231910705566} 11/07/2021 16:54:28 - INFO - __main__ - Step 139603: {'lr': 6.064878838051902e-06, 'samples': 26803776, 'steps': 139602, 'loss/train': 1.5177316665649414} 11/07/2021 16:54:29 - INFO - __main__ - Step 139604: {'lr': 6.063717085508763e-06, 'samples': 26803968, 'steps': 139603, 'loss/train': 0.4777734875679016} 11/07/2021 16:54:29 - INFO - __main__ - Step 139605: {'lr': 6.062555442879508e-06, 'samples': 26804160, 'steps': 139604, 'loss/train': 1.3679600954055786} 11/07/2021 16:54:29 - INFO - __main__ - Step 139606: {'lr': 6.061393910164747e-06, 'samples': 26804352, 'steps': 139605, 'loss/train': 1.055587887763977} 11/07/2021 16:54:30 - INFO - __main__ - Step 139607: {'lr': 6.060232487364925e-06, 'samples': 26804544, 'steps': 139606, 'loss/train': 1.339532494544983} 11/07/2021 16:54:31 - INFO - __main__ - Step 139608: {'lr': 6.059071174480623e-06, 'samples': 26804736, 'steps': 139607, 'loss/train': 1.138903260231018} 11/07/2021 16:54:31 - INFO - __main__ - Step 139609: {'lr': 6.057909971512315e-06, 'samples': 26804928, 'steps': 139608, 'loss/train': 1.1933166980743408} 11/07/2021 16:54:31 - INFO - __main__ - Step 139610: {'lr': 6.056748878460555e-06, 'samples': 26805120, 'steps': 139609, 'loss/train': 1.4869881868362427} 11/07/2021 16:54:32 - INFO - __main__ - Step 139611: {'lr': 6.0555878953258704e-06, 'samples': 26805312, 'steps': 139610, 'loss/train': 1.2574949264526367} 11/07/2021 16:54:33 - INFO - __main__ - Step 139612: {'lr': 6.054427022108761e-06, 'samples': 26805504, 'steps': 139611, 'loss/train': 1.0878522396087646} 11/07/2021 16:54:33 - INFO - __main__ - Step 139613: {'lr': 6.053266258809781e-06, 'samples': 26805696, 'steps': 139612, 'loss/train': 1.3818281888961792} 11/07/2021 16:54:33 - INFO - __main__ - Step 139614: {'lr': 6.052105605429403e-06, 'samples': 26805888, 'steps': 139613, 'loss/train': 1.0005296468734741} 11/07/2021 16:54:34 - INFO - __main__ - Step 139615: {'lr': 6.050945061968238e-06, 'samples': 26806080, 'steps': 139614, 'loss/train': 0.8265625238418579} 11/07/2021 16:54:34 - INFO - __main__ - Step 139616: {'lr': 6.049784628426702e-06, 'samples': 26806272, 'steps': 139615, 'loss/train': 2.5572402477264404} 11/07/2021 16:54:35 - INFO - __main__ - Step 139617: {'lr': 6.048624304805378e-06, 'samples': 26806464, 'steps': 139616, 'loss/train': 1.3192832469940186} 11/07/2021 16:54:36 - INFO - __main__ - Step 139618: {'lr': 6.047464091104793e-06, 'samples': 26806656, 'steps': 139617, 'loss/train': 1.1261224746704102} 11/07/2021 16:54:36 - INFO - __main__ - Step 139619: {'lr': 6.046303987325446e-06, 'samples': 26806848, 'steps': 139618, 'loss/train': 1.1688933372497559} 11/07/2021 16:54:36 - INFO - __main__ - Step 139620: {'lr': 6.045143993467867e-06, 'samples': 26807040, 'steps': 139619, 'loss/train': 1.380424976348877} 11/07/2021 16:54:37 - INFO - __main__ - Step 139621: {'lr': 6.0439841095325795e-06, 'samples': 26807232, 'steps': 139620, 'loss/train': 1.320696473121643} 11/07/2021 16:54:38 - INFO - __main__ - Step 139622: {'lr': 6.042824335520114e-06, 'samples': 26807424, 'steps': 139621, 'loss/train': 1.597536563873291} 11/07/2021 16:54:38 - INFO - __main__ - Step 139623: {'lr': 6.041664671430996e-06, 'samples': 26807616, 'steps': 139622, 'loss/train': 1.114051103591919} 11/07/2021 16:54:38 - INFO - __main__ - Step 139624: {'lr': 6.040505117265727e-06, 'samples': 26807808, 'steps': 139623, 'loss/train': 0.06398974359035492} 11/07/2021 16:54:39 - INFO - __main__ - Step 139625: {'lr': 6.0393456730248595e-06, 'samples': 26808000, 'steps': 139624, 'loss/train': 1.1686195135116577} 11/07/2021 16:54:39 - INFO - __main__ - Step 139626: {'lr': 6.038186338708868e-06, 'samples': 26808192, 'steps': 139625, 'loss/train': 1.76750910282135} 11/07/2021 16:54:40 - INFO - __main__ - Step 139627: {'lr': 6.0370271143183335e-06, 'samples': 26808384, 'steps': 139626, 'loss/train': 1.1316038370132446} 11/07/2021 16:54:40 - INFO - __main__ - Step 139628: {'lr': 6.035867999853728e-06, 'samples': 26808576, 'steps': 139627, 'loss/train': 1.2756645679473877} 11/07/2021 16:54:41 - INFO - __main__ - Step 139629: {'lr': 6.0347089953156355e-06, 'samples': 26808768, 'steps': 139628, 'loss/train': 1.4250129461288452} 11/07/2021 16:54:41 - INFO - __main__ - Step 139630: {'lr': 6.033550100704526e-06, 'samples': 26808960, 'steps': 139629, 'loss/train': 1.0188207626342773} 11/07/2021 16:54:41 - INFO - __main__ - Step 139631: {'lr': 6.032391316020902e-06, 'samples': 26809152, 'steps': 139630, 'loss/train': 0.9078286290168762} 11/07/2021 16:54:43 - INFO - __main__ - Step 139632: {'lr': 6.031232641265344e-06, 'samples': 26809344, 'steps': 139631, 'loss/train': 1.5704233646392822} 11/07/2021 16:54:43 - INFO - __main__ - Step 139633: {'lr': 6.030074076438325e-06, 'samples': 26809536, 'steps': 139632, 'loss/train': 1.2940473556518555} 11/07/2021 16:54:44 - INFO - __main__ - Step 139634: {'lr': 6.028915621540398e-06, 'samples': 26809728, 'steps': 139633, 'loss/train': 0.49866268038749695} 11/07/2021 16:54:44 - INFO - __main__ - Step 139635: {'lr': 6.027757276572093e-06, 'samples': 26809920, 'steps': 139634, 'loss/train': 0.44172507524490356} 11/07/2021 16:54:44 - INFO - __main__ - Step 139636: {'lr': 6.026599041533909e-06, 'samples': 26810112, 'steps': 139635, 'loss/train': 1.176435112953186} 11/07/2021 16:54:45 - INFO - __main__ - Step 139637: {'lr': 6.025440916426372e-06, 'samples': 26810304, 'steps': 139636, 'loss/train': 1.4103235006332397} 11/07/2021 16:54:46 - INFO - __main__ - Step 139638: {'lr': 6.024282901249984e-06, 'samples': 26810496, 'steps': 139637, 'loss/train': 0.8745039105415344} 11/07/2021 16:54:46 - INFO - __main__ - Step 139639: {'lr': 6.023124996005325e-06, 'samples': 26810688, 'steps': 139638, 'loss/train': 1.3171532154083252} 11/07/2021 16:54:47 - INFO - __main__ - Step 139640: {'lr': 6.0219672006928684e-06, 'samples': 26810880, 'steps': 139639, 'loss/train': 0.9050981998443604} 11/07/2021 16:54:47 - INFO - __main__ - Step 139641: {'lr': 6.020809515313141e-06, 'samples': 26811072, 'steps': 139640, 'loss/train': 1.6956239938735962} 11/07/2021 16:54:47 - INFO - __main__ - Step 139642: {'lr': 6.0196519398667e-06, 'samples': 26811264, 'steps': 139641, 'loss/train': 1.4018917083740234} 11/07/2021 16:54:48 - INFO - __main__ - Step 139643: {'lr': 6.018494474354014e-06, 'samples': 26811456, 'steps': 139642, 'loss/train': 0.7072728276252747} 11/07/2021 16:54:49 - INFO - __main__ - Step 139644: {'lr': 6.01733711877564e-06, 'samples': 26811648, 'steps': 139643, 'loss/train': 0.8666209578514099} 11/07/2021 16:54:49 - INFO - __main__ - Step 139645: {'lr': 6.016179873132077e-06, 'samples': 26811840, 'steps': 139644, 'loss/train': 1.4868918657302856} 11/07/2021 16:54:49 - INFO - __main__ - Step 139646: {'lr': 6.015022737423853e-06, 'samples': 26812032, 'steps': 139645, 'loss/train': 1.0373247861862183} 11/07/2021 16:54:50 - INFO - __main__ - Step 139647: {'lr': 6.013865711651495e-06, 'samples': 26812224, 'steps': 139646, 'loss/train': 1.393404483795166} 11/07/2021 16:54:50 - INFO - __main__ - Step 139648: {'lr': 6.01270879581553e-06, 'samples': 26812416, 'steps': 139647, 'loss/train': 1.2959458827972412} 11/07/2021 16:54:51 - INFO - __main__ - Step 139649: {'lr': 6.011551989916486e-06, 'samples': 26812608, 'steps': 139648, 'loss/train': 1.0416604280471802} 11/07/2021 16:54:52 - INFO - __main__ - Step 139650: {'lr': 6.010395293954863e-06, 'samples': 26812800, 'steps': 139649, 'loss/train': 1.139768123626709} 11/07/2021 16:54:52 - INFO - __main__ - Step 139651: {'lr': 6.009238707931186e-06, 'samples': 26812992, 'steps': 139650, 'loss/train': 1.1529566049575806} 11/07/2021 16:54:52 - INFO - __main__ - Step 139652: {'lr': 6.008082231845985e-06, 'samples': 26813184, 'steps': 139651, 'loss/train': 1.2207046747207642} 11/07/2021 16:54:53 - INFO - __main__ - Step 139653: {'lr': 6.006925865699786e-06, 'samples': 26813376, 'steps': 139652, 'loss/train': 1.5339651107788086} 11/07/2021 16:54:53 - INFO - __main__ - Step 139654: {'lr': 6.005769609493089e-06, 'samples': 26813568, 'steps': 139653, 'loss/train': 5.633831977844238} 11/07/2021 16:54:54 - INFO - __main__ - Step 139655: {'lr': 6.0046134632264495e-06, 'samples': 26813760, 'steps': 139654, 'loss/train': 1.3968133926391602} 11/07/2021 16:54:54 - INFO - __main__ - Step 139656: {'lr': 6.003457426900366e-06, 'samples': 26813952, 'steps': 139655, 'loss/train': 1.3639369010925293} 11/07/2021 16:54:55 - INFO - __main__ - Step 139657: {'lr': 6.002301500515339e-06, 'samples': 26814144, 'steps': 139656, 'loss/train': 1.229170560836792} 11/07/2021 16:54:55 - INFO - __main__ - Step 139658: {'lr': 6.0011456840718955e-06, 'samples': 26814336, 'steps': 139657, 'loss/train': 1.325547695159912} 11/07/2021 16:54:55 - INFO - __main__ - Step 139659: {'lr': 5.9999899775705916e-06, 'samples': 26814528, 'steps': 139658, 'loss/train': 1.5785291194915771} 11/07/2021 16:54:57 - INFO - __main__ - Step 139660: {'lr': 5.998834381011925e-06, 'samples': 26814720, 'steps': 139659, 'loss/train': 0.7268579602241516} 11/07/2021 16:54:57 - INFO - __main__ - Step 139661: {'lr': 5.997678894396424e-06, 'samples': 26814912, 'steps': 139660, 'loss/train': 1.3041455745697021} 11/07/2021 16:54:57 - INFO - __main__ - Step 139662: {'lr': 5.996523517724589e-06, 'samples': 26815104, 'steps': 139661, 'loss/train': 1.2810739278793335} 11/07/2021 16:54:58 - INFO - __main__ - Step 139663: {'lr': 5.995368250996947e-06, 'samples': 26815296, 'steps': 139662, 'loss/train': 1.3864905834197998} 11/07/2021 16:54:58 - INFO - __main__ - Step 139664: {'lr': 5.9942130942140516e-06, 'samples': 26815488, 'steps': 139663, 'loss/train': 0.9343996047973633} 11/07/2021 16:54:59 - INFO - __main__ - Step 139665: {'lr': 5.993058047376376e-06, 'samples': 26815680, 'steps': 139664, 'loss/train': 1.4427191019058228} 11/07/2021 16:54:59 - INFO - __main__ - Step 139666: {'lr': 5.9919031104845035e-06, 'samples': 26815872, 'steps': 139665, 'loss/train': 1.0868408679962158} 11/07/2021 16:55:00 - INFO - __main__ - Step 139667: {'lr': 5.990748283538877e-06, 'samples': 26816064, 'steps': 139666, 'loss/train': 0.4968287944793701} 11/07/2021 16:55:00 - INFO - __main__ - Step 139668: {'lr': 5.98959356654008e-06, 'samples': 26816256, 'steps': 139667, 'loss/train': 1.2403289079666138} 11/07/2021 16:55:00 - INFO - __main__ - Step 139669: {'lr': 5.988438959488584e-06, 'samples': 26816448, 'steps': 139668, 'loss/train': 1.799087405204773} 11/07/2021 16:55:03 - INFO - __main__ - Step 139670: {'lr': 5.987284462384945e-06, 'samples': 26816640, 'steps': 139669, 'loss/train': 1.8791377544403076} 11/07/2021 16:55:03 - INFO - __main__ - Step 139671: {'lr': 5.986130075229662e-06, 'samples': 26816832, 'steps': 139670, 'loss/train': 1.5727654695510864} 11/07/2021 16:55:04 - INFO - __main__ - Step 139672: {'lr': 5.984975798023262e-06, 'samples': 26817024, 'steps': 139671, 'loss/train': 1.2527108192443848} 11/07/2021 16:55:04 - INFO - __main__ - Step 139673: {'lr': 5.983821630766273e-06, 'samples': 26817216, 'steps': 139672, 'loss/train': 0.9560865759849548} 11/07/2021 16:55:04 - INFO - __main__ - Step 139674: {'lr': 5.982667573459194e-06, 'samples': 26817408, 'steps': 139673, 'loss/train': 1.4679549932479858} 11/07/2021 16:55:05 - INFO - __main__ - Step 139675: {'lr': 5.98151362610258e-06, 'samples': 26817600, 'steps': 139674, 'loss/train': 0.8461706638336182} 11/07/2021 16:55:05 - INFO - __main__ - Step 139676: {'lr': 5.980359788696904e-06, 'samples': 26817792, 'steps': 139675, 'loss/train': 1.7177348136901855} 11/07/2021 16:55:05 - INFO - __main__ - Step 139677: {'lr': 5.979206061242776e-06, 'samples': 26817984, 'steps': 139676, 'loss/train': 1.6960217952728271} 11/07/2021 16:55:06 - INFO - __main__ - Step 139678: {'lr': 5.978052443740584e-06, 'samples': 26818176, 'steps': 139677, 'loss/train': 1.7369976043701172} 11/07/2021 16:55:07 - INFO - __main__ - Step 139679: {'lr': 5.976898936190939e-06, 'samples': 26818368, 'steps': 139678, 'loss/train': 1.5525866746902466} 11/07/2021 16:55:07 - INFO - __main__ - Step 139680: {'lr': 5.97574553859434e-06, 'samples': 26818560, 'steps': 139679, 'loss/train': 0.8890566825866699} 11/07/2021 16:55:08 - INFO - __main__ - Step 139681: {'lr': 5.974592250951316e-06, 'samples': 26818752, 'steps': 139680, 'loss/train': 1.2342536449432373} 11/07/2021 16:55:08 - INFO - __main__ - Step 139682: {'lr': 5.973439073262366e-06, 'samples': 26818944, 'steps': 139681, 'loss/train': 0.9096999764442444} 11/07/2021 16:55:08 - INFO - __main__ - Step 139683: {'lr': 5.972286005527988e-06, 'samples': 26819136, 'steps': 139682, 'loss/train': 1.4552273750305176} 11/07/2021 16:55:09 - INFO - __main__ - Step 139684: {'lr': 5.9711330477487666e-06, 'samples': 26819328, 'steps': 139683, 'loss/train': 1.3663203716278076} 11/07/2021 16:55:10 - INFO - __main__ - Step 139685: {'lr': 5.969980199925174e-06, 'samples': 26819520, 'steps': 139684, 'loss/train': 1.0410798788070679} 11/07/2021 16:55:10 - INFO - __main__ - Step 139686: {'lr': 5.968827462057763e-06, 'samples': 26819712, 'steps': 139685, 'loss/train': 1.2856365442276} 11/07/2021 16:55:10 - INFO - __main__ - Step 139687: {'lr': 5.967674834147035e-06, 'samples': 26819904, 'steps': 139686, 'loss/train': 1.5676593780517578} 11/07/2021 16:55:11 - INFO - __main__ - Step 139688: {'lr': 5.96652231619349e-06, 'samples': 26820096, 'steps': 139687, 'loss/train': 1.6226418018341064} 11/07/2021 16:55:12 - INFO - __main__ - Step 139689: {'lr': 5.9653699081976545e-06, 'samples': 26820288, 'steps': 139688, 'loss/train': 1.5671312808990479} 11/07/2021 16:55:12 - INFO - __main__ - Step 139690: {'lr': 5.964217610160055e-06, 'samples': 26820480, 'steps': 139689, 'loss/train': 1.2544971704483032} 11/07/2021 16:55:12 - INFO - __main__ - Step 139691: {'lr': 5.963065422081249e-06, 'samples': 26820672, 'steps': 139690, 'loss/train': 1.6306395530700684} 11/07/2021 16:55:13 - INFO - __main__ - Step 139692: {'lr': 5.961913343961678e-06, 'samples': 26820864, 'steps': 139691, 'loss/train': 1.486158847808838} 11/07/2021 16:55:13 - INFO - __main__ - Step 139693: {'lr': 5.960761375801927e-06, 'samples': 26821056, 'steps': 139692, 'loss/train': 1.4021940231323242} 11/07/2021 16:55:14 - INFO - __main__ - Step 139694: {'lr': 5.959609517602493e-06, 'samples': 26821248, 'steps': 139693, 'loss/train': 0.8294188380241394} 11/07/2021 16:55:15 - INFO - __main__ - Step 139695: {'lr': 5.958457769363879e-06, 'samples': 26821440, 'steps': 139694, 'loss/train': 1.2296671867370605} 11/07/2021 16:55:15 - INFO - __main__ - Step 139696: {'lr': 5.957306131086609e-06, 'samples': 26821632, 'steps': 139695, 'loss/train': 1.0446674823760986} 11/07/2021 16:55:15 - INFO - __main__ - Step 139697: {'lr': 5.956154602771241e-06, 'samples': 26821824, 'steps': 139696, 'loss/train': 0.7453622221946716} 11/07/2021 16:55:16 - INFO - __main__ - Step 139698: {'lr': 5.955003184418273e-06, 'samples': 26822016, 'steps': 139697, 'loss/train': 1.2203452587127686} 11/07/2021 16:55:16 - INFO - __main__ - Step 139699: {'lr': 5.953851876028177e-06, 'samples': 26822208, 'steps': 139698, 'loss/train': 1.0661251544952393} 11/07/2021 16:55:18 - INFO - __main__ - Step 139700: {'lr': 5.952700677601536e-06, 'samples': 26822400, 'steps': 139699, 'loss/train': 0.9709337949752808} 11/07/2021 16:55:18 - INFO - __main__ - Step 139701: {'lr': 5.951549589138822e-06, 'samples': 26822592, 'steps': 139700, 'loss/train': 1.1612991094589233} 11/07/2021 16:55:18 - INFO - __main__ - Step 139702: {'lr': 5.9503986106405895e-06, 'samples': 26822784, 'steps': 139701, 'loss/train': 1.2455353736877441} 11/07/2021 16:55:19 - INFO - __main__ - Step 139703: {'lr': 5.949247742107311e-06, 'samples': 26822976, 'steps': 139702, 'loss/train': 1.7984400987625122} 11/07/2021 16:55:19 - INFO - __main__ - Step 139704: {'lr': 5.948096983539569e-06, 'samples': 26823168, 'steps': 139703, 'loss/train': 0.35243576765060425} 11/07/2021 16:55:19 - INFO - __main__ - Step 139705: {'lr': 5.946946334937836e-06, 'samples': 26823360, 'steps': 139704, 'loss/train': 1.3747475147247314} 11/07/2021 16:55:20 - INFO - __main__ - Step 139706: {'lr': 5.945795796302638e-06, 'samples': 26823552, 'steps': 139705, 'loss/train': 0.4207613468170166} 11/07/2021 16:55:21 - INFO - __main__ - Step 139707: {'lr': 5.9446453676345045e-06, 'samples': 26823744, 'steps': 139706, 'loss/train': 1.218497395515442} 11/07/2021 16:55:21 - INFO - __main__ - Step 139708: {'lr': 5.943495048933961e-06, 'samples': 26823936, 'steps': 139707, 'loss/train': 1.3162360191345215} 11/07/2021 16:55:21 - INFO - __main__ - Step 139709: {'lr': 5.942344840201508e-06, 'samples': 26824128, 'steps': 139708, 'loss/train': 1.1853878498077393} 11/07/2021 16:55:22 - INFO - __main__ - Step 139710: {'lr': 5.941194741437672e-06, 'samples': 26824320, 'steps': 139709, 'loss/train': 1.2116258144378662} 11/07/2021 16:55:23 - INFO - __main__ - Step 139711: {'lr': 5.9400447526429535e-06, 'samples': 26824512, 'steps': 139710, 'loss/train': 1.3231757879257202} 11/07/2021 16:55:23 - INFO - __main__ - Step 139712: {'lr': 5.938894873817879e-06, 'samples': 26824704, 'steps': 139711, 'loss/train': 0.7008940577507019} 11/07/2021 16:55:23 - INFO - __main__ - Step 139713: {'lr': 5.937745104962977e-06, 'samples': 26824896, 'steps': 139712, 'loss/train': 1.21067214012146} 11/07/2021 16:55:24 - INFO - __main__ - Step 139714: {'lr': 5.936595446078774e-06, 'samples': 26825088, 'steps': 139713, 'loss/train': 1.1343858242034912} 11/07/2021 16:55:24 - INFO - __main__ - Step 139715: {'lr': 5.93544589716577e-06, 'samples': 26825280, 'steps': 139714, 'loss/train': 1.2119572162628174} 11/07/2021 16:55:25 - INFO - __main__ - Step 139716: {'lr': 5.934296458224464e-06, 'samples': 26825472, 'steps': 139715, 'loss/train': 1.21440589427948} 11/07/2021 16:55:26 - INFO - __main__ - Step 139717: {'lr': 5.93314712925544e-06, 'samples': 26825664, 'steps': 139716, 'loss/train': 1.1486116647720337} 11/07/2021 16:55:26 - INFO - __main__ - Step 139718: {'lr': 5.931997910259141e-06, 'samples': 26825856, 'steps': 139717, 'loss/train': 1.1407053470611572} 11/07/2021 16:55:26 - INFO - __main__ - Step 139719: {'lr': 5.930848801236122e-06, 'samples': 26826048, 'steps': 139718, 'loss/train': 1.0811388492584229} 11/07/2021 16:55:27 - INFO - __main__ - Step 139720: {'lr': 5.929699802186911e-06, 'samples': 26826240, 'steps': 139719, 'loss/train': 1.3913135528564453} 11/07/2021 16:55:28 - INFO - __main__ - Step 139721: {'lr': 5.928550913112008e-06, 'samples': 26826432, 'steps': 139720, 'loss/train': 1.273470163345337} 11/07/2021 16:55:28 - INFO - __main__ - Step 139722: {'lr': 5.927402134011911e-06, 'samples': 26826624, 'steps': 139721, 'loss/train': 1.082757830619812} 11/07/2021 16:55:28 - INFO - __main__ - Step 139723: {'lr': 5.926253464887204e-06, 'samples': 26826816, 'steps': 139722, 'loss/train': 1.4100227355957031} 11/07/2021 16:55:29 - INFO - __main__ - Step 139724: {'lr': 5.925104905738332e-06, 'samples': 26827008, 'steps': 139723, 'loss/train': 1.0583826303482056} 11/07/2021 16:55:29 - INFO - __main__ - Step 139725: {'lr': 5.9239564565658485e-06, 'samples': 26827200, 'steps': 139724, 'loss/train': 0.8488168716430664} 11/07/2021 16:55:30 - INFO - __main__ - Step 139726: {'lr': 5.922808117370254e-06, 'samples': 26827392, 'steps': 139725, 'loss/train': 0.48895159363746643} 11/07/2021 16:55:31 - INFO - __main__ - Step 139727: {'lr': 5.921659888152075e-06, 'samples': 26827584, 'steps': 139726, 'loss/train': 0.6651192903518677} 11/07/2021 16:55:31 - INFO - __main__ - Step 139728: {'lr': 5.92051176891184e-06, 'samples': 26827776, 'steps': 139727, 'loss/train': 1.468629240989685} 11/07/2021 16:55:31 - INFO - __main__ - Step 139729: {'lr': 5.919363759650049e-06, 'samples': 26827968, 'steps': 139728, 'loss/train': 0.9950185418128967} 11/07/2021 16:55:32 - INFO - __main__ - Step 139730: {'lr': 5.918215860367227e-06, 'samples': 26828160, 'steps': 139729, 'loss/train': 2.1542251110076904} 11/07/2021 16:55:33 - INFO - __main__ - Step 139731: {'lr': 5.917068071063902e-06, 'samples': 26828352, 'steps': 139730, 'loss/train': 1.53778076171875} 11/07/2021 16:55:33 - INFO - __main__ - Step 139732: {'lr': 5.915920391740548e-06, 'samples': 26828544, 'steps': 139731, 'loss/train': 1.5811108350753784} 11/07/2021 16:55:33 - INFO - __main__ - Step 139733: {'lr': 5.914772822397746e-06, 'samples': 26828736, 'steps': 139732, 'loss/train': 0.8124833106994629} 11/07/2021 16:55:34 - INFO - __main__ - Step 139734: {'lr': 5.913625363035969e-06, 'samples': 26828928, 'steps': 139733, 'loss/train': 1.2685935497283936} 11/07/2021 16:55:34 - INFO - __main__ - Step 139735: {'lr': 5.912478013655742e-06, 'samples': 26829120, 'steps': 139734, 'loss/train': 1.369620442390442} 11/07/2021 16:55:35 - INFO - __main__ - Step 139736: {'lr': 5.911330774257623e-06, 'samples': 26829312, 'steps': 139735, 'loss/train': 1.629082202911377} 11/07/2021 16:55:36 - INFO - __main__ - Step 139737: {'lr': 5.910183644842054e-06, 'samples': 26829504, 'steps': 139736, 'loss/train': 1.2064015865325928} 11/07/2021 16:55:36 - INFO - __main__ - Step 139738: {'lr': 5.909036625409592e-06, 'samples': 26829696, 'steps': 139737, 'loss/train': 1.2683868408203125} 11/07/2021 16:55:36 - INFO - __main__ - Step 139739: {'lr': 5.907889715960762e-06, 'samples': 26829888, 'steps': 139738, 'loss/train': 1.2298741340637207} 11/07/2021 16:55:37 - INFO - __main__ - Step 139740: {'lr': 5.9067429164960665e-06, 'samples': 26830080, 'steps': 139739, 'loss/train': 0.9161929488182068} 11/07/2021 16:55:38 - INFO - __main__ - Step 139741: {'lr': 5.90559622701603e-06, 'samples': 26830272, 'steps': 139740, 'loss/train': 0.9829636216163635} 11/07/2021 16:55:38 - INFO - __main__ - Step 139742: {'lr': 5.904449647521154e-06, 'samples': 26830464, 'steps': 139741, 'loss/train': 1.3356248140335083} 11/07/2021 16:55:38 - INFO - __main__ - Step 139743: {'lr': 5.903303178011965e-06, 'samples': 26830656, 'steps': 139742, 'loss/train': 1.2846298217773438} 11/07/2021 16:55:39 - INFO - __main__ - Step 139744: {'lr': 5.90215681848899e-06, 'samples': 26830848, 'steps': 139743, 'loss/train': 1.3857921361923218} 11/07/2021 16:55:39 - INFO - __main__ - Step 139745: {'lr': 5.90101056895273e-06, 'samples': 26831040, 'steps': 139744, 'loss/train': 1.4395430088043213} 11/07/2021 16:55:39 - INFO - __main__ - Step 139746: {'lr': 5.899864429403712e-06, 'samples': 26831232, 'steps': 139745, 'loss/train': 0.9243510961532593} 11/07/2021 16:55:41 - INFO - __main__ - Step 139747: {'lr': 5.898718399842435e-06, 'samples': 26831424, 'steps': 139746, 'loss/train': 1.460442066192627} 11/07/2021 16:55:41 - INFO - __main__ - Step 139748: {'lr': 5.897572480269453e-06, 'samples': 26831616, 'steps': 139747, 'loss/train': 1.2706876993179321} 11/07/2021 16:55:41 - INFO - __main__ - Step 139749: {'lr': 5.89642667068524e-06, 'samples': 26831808, 'steps': 139748, 'loss/train': 1.3923285007476807} 11/07/2021 16:55:42 - INFO - __main__ - Step 139750: {'lr': 5.89528097109035e-06, 'samples': 26832000, 'steps': 139749, 'loss/train': 1.403555989265442} 11/07/2021 16:55:42 - INFO - __main__ - Step 139751: {'lr': 5.8941353814852825e-06, 'samples': 26832192, 'steps': 139750, 'loss/train': 1.3586453199386597} 11/07/2021 16:55:43 - INFO - __main__ - Step 139752: {'lr': 5.892989901870538e-06, 'samples': 26832384, 'steps': 139751, 'loss/train': 1.088430404663086} 11/07/2021 16:55:43 - INFO - __main__ - Step 139753: {'lr': 5.891844532246643e-06, 'samples': 26832576, 'steps': 139752, 'loss/train': 1.063808560371399} 11/07/2021 16:55:44 - INFO - __main__ - Step 139754: {'lr': 5.890699272614097e-06, 'samples': 26832768, 'steps': 139753, 'loss/train': 0.9456320405006409} 11/07/2021 16:55:44 - INFO - __main__ - Step 139755: {'lr': 5.8895541229734566e-06, 'samples': 26832960, 'steps': 139754, 'loss/train': 1.4394434690475464} 11/07/2021 16:55:44 - INFO - __main__ - Step 139756: {'lr': 5.888409083325219e-06, 'samples': 26833152, 'steps': 139755, 'loss/train': 1.312744140625} 11/07/2021 16:55:46 - INFO - __main__ - Step 139757: {'lr': 5.8872641536698856e-06, 'samples': 26833344, 'steps': 139756, 'loss/train': 1.689226746559143} 11/07/2021 16:55:46 - INFO - __main__ - Step 139758: {'lr': 5.886119334007984e-06, 'samples': 26833536, 'steps': 139757, 'loss/train': 0.9719366431236267} 11/07/2021 16:55:47 - INFO - __main__ - Step 139759: {'lr': 5.8849746243400395e-06, 'samples': 26833728, 'steps': 139758, 'loss/train': 1.6518415212631226} 11/07/2021 16:55:47 - INFO - __main__ - Step 139760: {'lr': 5.883830024666553e-06, 'samples': 26833920, 'steps': 139759, 'loss/train': 1.2822849750518799} 11/07/2021 16:55:47 - INFO - __main__ - Step 139761: {'lr': 5.882685534988053e-06, 'samples': 26834112, 'steps': 139760, 'loss/train': 0.10477413237094879} 11/07/2021 16:55:48 - INFO - __main__ - Step 139762: {'lr': 5.881541155305037e-06, 'samples': 26834304, 'steps': 139761, 'loss/train': 0.9134489297866821} 11/07/2021 16:55:49 - INFO - __main__ - Step 139763: {'lr': 5.880396885618061e-06, 'samples': 26834496, 'steps': 139762, 'loss/train': 1.2453373670578003} 11/07/2021 16:55:49 - INFO - __main__ - Step 139764: {'lr': 5.879252725927598e-06, 'samples': 26834688, 'steps': 139763, 'loss/train': 1.5333613157272339} 11/07/2021 16:55:49 - INFO - __main__ - Step 139765: {'lr': 5.8781086762341455e-06, 'samples': 26834880, 'steps': 139764, 'loss/train': 1.944799542427063} 11/07/2021 16:55:50 - INFO - __main__ - Step 139766: {'lr': 5.8769647365382875e-06, 'samples': 26835072, 'steps': 139765, 'loss/train': 1.074033498764038} 11/07/2021 16:55:50 - INFO - __main__ - Step 139767: {'lr': 5.8758209068404675e-06, 'samples': 26835264, 'steps': 139766, 'loss/train': 0.797667384147644} 11/07/2021 16:55:51 - INFO - __main__ - Step 139768: {'lr': 5.87467718714127e-06, 'samples': 26835456, 'steps': 139767, 'loss/train': 1.3468923568725586} 11/07/2021 16:55:51 - INFO - __main__ - Step 139769: {'lr': 5.873533577441164e-06, 'samples': 26835648, 'steps': 139768, 'loss/train': 1.1025298833847046} 11/07/2021 16:55:52 - INFO - __main__ - Step 139770: {'lr': 5.872390077740653e-06, 'samples': 26835840, 'steps': 139769, 'loss/train': 1.3230682611465454} 11/07/2021 16:55:52 - INFO - __main__ - Step 139771: {'lr': 5.871246688040316e-06, 'samples': 26836032, 'steps': 139770, 'loss/train': 1.4016193151474} 11/07/2021 16:55:53 - INFO - __main__ - Step 139772: {'lr': 5.870103408340599e-06, 'samples': 26836224, 'steps': 139771, 'loss/train': 1.500889539718628} 11/07/2021 16:55:54 - INFO - __main__ - Step 139773: {'lr': 5.868960238642057e-06, 'samples': 26836416, 'steps': 139772, 'loss/train': 1.2207282781600952} 11/07/2021 16:55:54 - INFO - __main__ - Step 139774: {'lr': 5.86781717894519e-06, 'samples': 26836608, 'steps': 139773, 'loss/train': 0.8691325783729553} 11/07/2021 16:55:54 - INFO - __main__ - Step 139775: {'lr': 5.866674229250524e-06, 'samples': 26836800, 'steps': 139774, 'loss/train': 1.5401333570480347} 11/07/2021 16:55:55 - INFO - __main__ - Step 139776: {'lr': 5.865531389558559e-06, 'samples': 26836992, 'steps': 139775, 'loss/train': 1.0574229955673218} 11/07/2021 16:55:55 - INFO - __main__ - Step 139777: {'lr': 5.864388659869824e-06, 'samples': 26837184, 'steps': 139776, 'loss/train': 1.587717890739441} 11/07/2021 16:55:56 - INFO - __main__ - Step 139778: {'lr': 5.863246040184844e-06, 'samples': 26837376, 'steps': 139777, 'loss/train': 0.8171401023864746} 11/07/2021 16:55:57 - INFO - __main__ - Step 139779: {'lr': 5.862103530504092e-06, 'samples': 26837568, 'steps': 139778, 'loss/train': 0.5304549932479858} 11/07/2021 16:55:57 - INFO - __main__ - Step 139780: {'lr': 5.860961130828124e-06, 'samples': 26837760, 'steps': 139779, 'loss/train': 1.4677305221557617} 11/07/2021 16:55:57 - INFO - __main__ - Step 139781: {'lr': 5.85981884115741e-06, 'samples': 26837952, 'steps': 139780, 'loss/train': 1.0949020385742188} 11/07/2021 16:55:58 - INFO - __main__ - Step 139782: {'lr': 5.858676661492535e-06, 'samples': 26838144, 'steps': 139781, 'loss/train': 1.3326119184494019} 11/07/2021 16:55:58 - INFO - __main__ - Step 139783: {'lr': 5.85753459183394e-06, 'samples': 26838336, 'steps': 139782, 'loss/train': 1.4397364854812622} 11/07/2021 16:55:59 - INFO - __main__ - Step 139784: {'lr': 5.856392632182184e-06, 'samples': 26838528, 'steps': 139783, 'loss/train': 1.582048773765564} 11/07/2021 16:56:00 - INFO - __main__ - Step 139785: {'lr': 5.855250782537791e-06, 'samples': 26838720, 'steps': 139784, 'loss/train': 4.949529647827148} 11/07/2021 16:56:00 - INFO - __main__ - Step 139786: {'lr': 5.8541090429012346e-06, 'samples': 26838912, 'steps': 139785, 'loss/train': 1.2157188653945923} 11/07/2021 16:56:00 - INFO - __main__ - Step 139787: {'lr': 5.852967413273042e-06, 'samples': 26839104, 'steps': 139786, 'loss/train': 0.8657254576683044} 11/07/2021 16:56:01 - INFO - __main__ - Step 139788: {'lr': 5.8518258936537394e-06, 'samples': 26839296, 'steps': 139787, 'loss/train': 1.0971578359603882} 11/07/2021 16:56:01 - INFO - __main__ - Step 139789: {'lr': 5.850684484043856e-06, 'samples': 26839488, 'steps': 139788, 'loss/train': 1.6056100130081177} 11/07/2021 16:56:02 - INFO - __main__ - Step 139790: {'lr': 5.84954318444389e-06, 'samples': 26839680, 'steps': 139789, 'loss/train': 1.0866631269454956} 11/07/2021 16:56:03 - INFO - __main__ - Step 139791: {'lr': 5.848401994854341e-06, 'samples': 26839872, 'steps': 139790, 'loss/train': 1.204818606376648} 11/07/2021 16:56:03 - INFO - __main__ - Step 139792: {'lr': 5.847260915275737e-06, 'samples': 26840064, 'steps': 139791, 'loss/train': 2.3124990463256836} 11/07/2021 16:56:03 - INFO - __main__ - Step 139793: {'lr': 5.846119945708578e-06, 'samples': 26840256, 'steps': 139792, 'loss/train': 1.0355472564697266} 11/07/2021 16:56:04 - INFO - __main__ - Step 139794: {'lr': 5.8449790861533906e-06, 'samples': 26840448, 'steps': 139793, 'loss/train': 1.103738784790039} 11/07/2021 16:56:05 - INFO - __main__ - Step 139795: {'lr': 5.843838336610674e-06, 'samples': 26840640, 'steps': 139794, 'loss/train': 1.4812955856323242} 11/07/2021 16:56:05 - INFO - __main__ - Step 139796: {'lr': 5.842697697080984e-06, 'samples': 26840832, 'steps': 139795, 'loss/train': 1.1551381349563599} 11/07/2021 16:56:05 - INFO - __main__ - Step 139797: {'lr': 5.8415571675647925e-06, 'samples': 26841024, 'steps': 139796, 'loss/train': 1.3677871227264404} 11/07/2021 16:56:06 - INFO - __main__ - Step 139798: {'lr': 5.840416748062627e-06, 'samples': 26841216, 'steps': 139797, 'loss/train': 0.5423839688301086} 11/07/2021 16:56:06 - INFO - __main__ - Step 139799: {'lr': 5.839276438575014e-06, 'samples': 26841408, 'steps': 139798, 'loss/train': 1.5238142013549805} 11/07/2021 16:56:07 - INFO - __main__ - Step 139800: {'lr': 5.838136239102454e-06, 'samples': 26841600, 'steps': 139799, 'loss/train': 1.5199673175811768} 11/07/2021 16:56:08 - INFO - __main__ - Step 139801: {'lr': 5.836996149645446e-06, 'samples': 26841792, 'steps': 139800, 'loss/train': 0.5076759457588196} 11/07/2021 16:56:08 - INFO - __main__ - Step 139802: {'lr': 5.835856170204517e-06, 'samples': 26841984, 'steps': 139801, 'loss/train': 1.1488323211669922} 11/07/2021 16:56:08 - INFO - __main__ - Step 139803: {'lr': 5.834716300780197e-06, 'samples': 26842176, 'steps': 139802, 'loss/train': 1.515330195426941} 11/07/2021 16:56:09 - INFO - __main__ - Step 139804: {'lr': 5.8335765413730094e-06, 'samples': 26842368, 'steps': 139803, 'loss/train': 1.2783828973770142} 11/07/2021 16:56:09 - INFO - __main__ - Step 139805: {'lr': 5.8324368919834285e-06, 'samples': 26842560, 'steps': 139804, 'loss/train': 1.4628957509994507} 11/07/2021 16:56:10 - INFO - __main__ - Step 139806: {'lr': 5.8312973526119806e-06, 'samples': 26842752, 'steps': 139805, 'loss/train': 1.4848341941833496} 11/07/2021 16:56:10 - INFO - __main__ - Step 139807: {'lr': 5.830157923259166e-06, 'samples': 26842944, 'steps': 139806, 'loss/train': 0.9358699917793274} 11/07/2021 16:56:11 - INFO - __main__ - Step 139808: {'lr': 5.82901860392554e-06, 'samples': 26843136, 'steps': 139807, 'loss/train': 1.3288275003433228} 11/07/2021 16:56:11 - INFO - __main__ - Step 139809: {'lr': 5.827879394611574e-06, 'samples': 26843328, 'steps': 139808, 'loss/train': 1.3026583194732666} 11/07/2021 16:56:11 - INFO - __main__ - Step 139810: {'lr': 5.826740295317795e-06, 'samples': 26843520, 'steps': 139809, 'loss/train': 1.1998573541641235} 11/07/2021 16:56:13 - INFO - __main__ - Step 139811: {'lr': 5.825601306044703e-06, 'samples': 26843712, 'steps': 139810, 'loss/train': 1.545479655265808} 11/07/2021 16:56:13 - INFO - __main__ - Step 139812: {'lr': 5.824462426792854e-06, 'samples': 26843904, 'steps': 139811, 'loss/train': 1.3926033973693848} 11/07/2021 16:56:13 - INFO - __main__ - Step 139813: {'lr': 5.823323657562745e-06, 'samples': 26844096, 'steps': 139812, 'loss/train': 1.3970272541046143} 11/07/2021 16:56:14 - INFO - __main__ - Step 139814: {'lr': 5.822184998354852e-06, 'samples': 26844288, 'steps': 139813, 'loss/train': 1.017878532409668} 11/07/2021 16:56:14 - INFO - __main__ - Step 139815: {'lr': 5.821046449169726e-06, 'samples': 26844480, 'steps': 139814, 'loss/train': 1.088010549545288} 11/07/2021 16:56:15 - INFO - __main__ - Step 139816: {'lr': 5.819908010007868e-06, 'samples': 26844672, 'steps': 139815, 'loss/train': 1.6285799741744995} 11/07/2021 16:56:16 - INFO - __main__ - Step 139817: {'lr': 5.8187696808698065e-06, 'samples': 26844864, 'steps': 139816, 'loss/train': 1.0736242532730103} 11/07/2021 16:56:16 - INFO - __main__ - Step 139818: {'lr': 5.81763146175604e-06, 'samples': 26845056, 'steps': 139817, 'loss/train': 1.3202643394470215} 11/07/2021 16:56:16 - INFO - __main__ - Step 139819: {'lr': 5.816493352667041e-06, 'samples': 26845248, 'steps': 139818, 'loss/train': 1.3471765518188477} 11/07/2021 16:56:17 - INFO - __main__ - Step 139820: {'lr': 5.815355353603391e-06, 'samples': 26845440, 'steps': 139819, 'loss/train': 1.0797349214553833} 11/07/2021 16:56:18 - INFO - __main__ - Step 139821: {'lr': 5.814217464565563e-06, 'samples': 26845632, 'steps': 139820, 'loss/train': 1.280181884765625} 11/07/2021 16:56:18 - INFO - __main__ - Step 139822: {'lr': 5.813079685554084e-06, 'samples': 26845824, 'steps': 139821, 'loss/train': 0.940671443939209} 11/07/2021 16:56:18 - INFO - __main__ - Step 139823: {'lr': 5.8119420165694824e-06, 'samples': 26846016, 'steps': 139822, 'loss/train': 1.6647952795028687} 11/07/2021 16:56:19 - INFO - __main__ - Step 139824: {'lr': 5.8108044576122285e-06, 'samples': 26846208, 'steps': 139823, 'loss/train': 1.2923723459243774} 11/07/2021 16:56:19 - INFO - __main__ - Step 139825: {'lr': 5.809667008682851e-06, 'samples': 26846400, 'steps': 139824, 'loss/train': 1.5831772089004517} 11/07/2021 16:56:20 - INFO - __main__ - Step 139826: {'lr': 5.808529669781903e-06, 'samples': 26846592, 'steps': 139825, 'loss/train': 1.403663992881775} 11/07/2021 16:56:20 - INFO - __main__ - Step 139827: {'lr': 5.807392440909831e-06, 'samples': 26846784, 'steps': 139826, 'loss/train': 1.293834924697876} 11/07/2021 16:56:21 - INFO - __main__ - Step 139828: {'lr': 5.8062553220671885e-06, 'samples': 26846976, 'steps': 139827, 'loss/train': 1.3577899932861328} 11/07/2021 16:56:21 - INFO - __main__ - Step 139829: {'lr': 5.805118313254476e-06, 'samples': 26847168, 'steps': 139828, 'loss/train': 1.2579503059387207} 11/07/2021 16:56:22 - INFO - __main__ - Step 139830: {'lr': 5.80398141447222e-06, 'samples': 26847360, 'steps': 139829, 'loss/train': 0.877252459526062} 11/07/2021 16:56:23 - INFO - __main__ - Step 139831: {'lr': 5.802844625720949e-06, 'samples': 26847552, 'steps': 139830, 'loss/train': 1.307787299156189} 11/07/2021 16:56:23 - INFO - __main__ - Step 139832: {'lr': 5.801707947001106e-06, 'samples': 26847744, 'steps': 139831, 'loss/train': 1.318365216255188} 11/07/2021 16:56:23 - INFO - __main__ - Step 139833: {'lr': 5.8005713783132744e-06, 'samples': 26847936, 'steps': 139832, 'loss/train': 1.2195030450820923} 11/07/2021 16:56:24 - INFO - __main__ - Step 139834: {'lr': 5.799434919657897e-06, 'samples': 26848128, 'steps': 139833, 'loss/train': 1.1999441385269165} 11/07/2021 16:56:24 - INFO - __main__ - Step 139835: {'lr': 5.798298571035559e-06, 'samples': 26848320, 'steps': 139834, 'loss/train': 1.2622884511947632} 11/07/2021 16:56:24 - INFO - __main__ - Step 139836: {'lr': 5.797162332446731e-06, 'samples': 26848512, 'steps': 139835, 'loss/train': 1.4072179794311523} 11/07/2021 16:56:25 - INFO - __main__ - Step 139837: {'lr': 5.796026203891913e-06, 'samples': 26848704, 'steps': 139836, 'loss/train': 1.0934263467788696} 11/07/2021 16:56:26 - INFO - __main__ - Step 139838: {'lr': 5.79489018537166e-06, 'samples': 26848896, 'steps': 139837, 'loss/train': 1.2560089826583862} 11/07/2021 16:56:26 - INFO - __main__ - Step 139839: {'lr': 5.793754276886443e-06, 'samples': 26849088, 'steps': 139838, 'loss/train': 1.592359185218811} 11/07/2021 16:56:26 - INFO - __main__ - Step 139840: {'lr': 5.792618478436817e-06, 'samples': 26849280, 'steps': 139839, 'loss/train': 1.366459608078003} 11/07/2021 16:56:27 - INFO - __main__ - Step 139841: {'lr': 5.791482790023256e-06, 'samples': 26849472, 'steps': 139840, 'loss/train': 1.178861141204834} 11/07/2021 16:56:28 - INFO - __main__ - Step 139842: {'lr': 5.790347211646285e-06, 'samples': 26849664, 'steps': 139841, 'loss/train': 1.4100139141082764} 11/07/2021 16:56:28 - INFO - __main__ - Step 139843: {'lr': 5.789211743306405e-06, 'samples': 26849856, 'steps': 139842, 'loss/train': 1.2155230045318604} 11/07/2021 16:56:29 - INFO - __main__ - Step 139844: {'lr': 5.788076385004171e-06, 'samples': 26850048, 'steps': 139843, 'loss/train': 0.7840135097503662} 11/07/2021 16:56:29 - INFO - __main__ - Step 139845: {'lr': 5.786941136740054e-06, 'samples': 26850240, 'steps': 139844, 'loss/train': 1.170481562614441} 11/07/2021 16:56:29 - INFO - __main__ - Step 139846: {'lr': 5.7858059985145536e-06, 'samples': 26850432, 'steps': 139845, 'loss/train': 1.7292070388793945} 11/07/2021 16:56:30 - INFO - __main__ - Step 139847: {'lr': 5.784670970328198e-06, 'samples': 26850624, 'steps': 139846, 'loss/train': 1.3476831912994385} 11/07/2021 16:56:31 - INFO - __main__ - Step 139848: {'lr': 5.783536052181515e-06, 'samples': 26850816, 'steps': 139847, 'loss/train': 1.4872561693191528} 11/07/2021 16:56:31 - INFO - __main__ - Step 139849: {'lr': 5.782401244074975e-06, 'samples': 26851008, 'steps': 139848, 'loss/train': 1.4384750127792358} 11/07/2021 16:56:31 - INFO - __main__ - Step 139850: {'lr': 5.781266546009134e-06, 'samples': 26851200, 'steps': 139849, 'loss/train': 1.4883474111557007} 11/07/2021 16:56:32 - INFO - __main__ - Step 139851: {'lr': 5.780131957984492e-06, 'samples': 26851392, 'steps': 139850, 'loss/train': 1.3649470806121826} 11/07/2021 16:56:33 - INFO - __main__ - Step 139852: {'lr': 5.778997480001547e-06, 'samples': 26851584, 'steps': 139851, 'loss/train': 1.0640050172805786} 11/07/2021 16:56:34 - INFO - __main__ - Step 139853: {'lr': 5.777863112060827e-06, 'samples': 26851776, 'steps': 139852, 'loss/train': 0.9020656943321228} 11/07/2021 16:56:34 - INFO - __main__ - Step 139854: {'lr': 5.776728854162833e-06, 'samples': 26851968, 'steps': 139853, 'loss/train': 1.2275595664978027} 11/07/2021 16:56:34 - INFO - __main__ - Step 139855: {'lr': 5.775594706308063e-06, 'samples': 26852160, 'steps': 139854, 'loss/train': 0.6069278120994568} 11/07/2021 16:56:35 - INFO - __main__ - Step 139856: {'lr': 5.7744606684970444e-06, 'samples': 26852352, 'steps': 139855, 'loss/train': 0.9315148591995239} 11/07/2021 16:56:36 - INFO - __main__ - Step 139857: {'lr': 5.773326740730306e-06, 'samples': 26852544, 'steps': 139856, 'loss/train': 0.05148935317993164} 11/07/2021 16:56:36 - INFO - __main__ - Step 139858: {'lr': 5.772192923008318e-06, 'samples': 26852736, 'steps': 139857, 'loss/train': 1.3529807329177856} 11/07/2021 16:56:36 - INFO - __main__ - Step 139859: {'lr': 5.771059215331637e-06, 'samples': 26852928, 'steps': 139858, 'loss/train': 0.9950188398361206} 11/07/2021 16:56:37 - INFO - __main__ - Step 139860: {'lr': 5.769925617700705e-06, 'samples': 26853120, 'steps': 139859, 'loss/train': 1.1505433320999146} 11/07/2021 16:56:37 - INFO - __main__ - Step 139861: {'lr': 5.768792130116108e-06, 'samples': 26853312, 'steps': 139860, 'loss/train': 1.391923427581787} 11/07/2021 16:56:38 - INFO - __main__ - Step 139862: {'lr': 5.7676587525783144e-06, 'samples': 26853504, 'steps': 139861, 'loss/train': 1.046412467956543} 11/07/2021 16:56:39 - INFO - __main__ - Step 139863: {'lr': 5.766525485087826e-06, 'samples': 26853696, 'steps': 139862, 'loss/train': 1.3669294118881226} 11/07/2021 16:56:39 - INFO - __main__ - Step 139864: {'lr': 5.7653923276451965e-06, 'samples': 26853888, 'steps': 139863, 'loss/train': 1.795312762260437} 11/07/2021 16:56:39 - INFO - __main__ - Step 139865: {'lr': 5.764259280250899e-06, 'samples': 26854080, 'steps': 139864, 'loss/train': 1.0256165266036987} 11/07/2021 16:56:40 - INFO - __main__ - Step 139866: {'lr': 5.763126342905461e-06, 'samples': 26854272, 'steps': 139865, 'loss/train': 1.2778244018554688} 11/07/2021 16:56:40 - INFO - __main__ - Step 139867: {'lr': 5.761993515609409e-06, 'samples': 26854464, 'steps': 139866, 'loss/train': 2.90146803855896} 11/07/2021 16:56:41 - INFO - __main__ - Step 139868: {'lr': 5.760860798363216e-06, 'samples': 26854656, 'steps': 139867, 'loss/train': 0.047507982701063156} 11/07/2021 16:56:41 - INFO - __main__ - Step 139869: {'lr': 5.759728191167407e-06, 'samples': 26854848, 'steps': 139868, 'loss/train': 1.3498883247375488} 11/07/2021 16:56:42 - INFO - __main__ - Step 139870: {'lr': 5.758595694022484e-06, 'samples': 26855040, 'steps': 139869, 'loss/train': 1.3088629245758057} 11/07/2021 16:56:42 - INFO - __main__ - Step 139871: {'lr': 5.757463306929028e-06, 'samples': 26855232, 'steps': 139870, 'loss/train': 0.9053886532783508} 11/07/2021 16:56:42 - INFO - __main__ - Step 139872: {'lr': 5.75633102988743e-06, 'samples': 26855424, 'steps': 139871, 'loss/train': 1.1261566877365112} 11/07/2021 16:56:43 - INFO - __main__ - Step 139873: {'lr': 5.75519886289827e-06, 'samples': 26855616, 'steps': 139872, 'loss/train': 1.3951853513717651} 11/07/2021 16:56:44 - INFO - __main__ - Step 139874: {'lr': 5.754066805962077e-06, 'samples': 26855808, 'steps': 139873, 'loss/train': 0.09904265403747559} 11/07/2021 16:56:44 - INFO - __main__ - Step 139875: {'lr': 5.752934859079295e-06, 'samples': 26856000, 'steps': 139874, 'loss/train': 1.313240885734558} 11/07/2021 16:56:45 - INFO - __main__ - Step 139876: {'lr': 5.751803022250479e-06, 'samples': 26856192, 'steps': 139875, 'loss/train': 0.9643576741218567} 11/07/2021 16:56:45 - INFO - __main__ - Step 139877: {'lr': 5.750671295476157e-06, 'samples': 26856384, 'steps': 139876, 'loss/train': 1.0875928401947021} 11/07/2021 16:56:46 - INFO - __main__ - Step 139878: {'lr': 5.7495396787567984e-06, 'samples': 26856576, 'steps': 139877, 'loss/train': 0.9583622813224792} 11/07/2021 16:56:46 - INFO - __main__ - Step 139879: {'lr': 5.748408172092933e-06, 'samples': 26856768, 'steps': 139878, 'loss/train': 1.1629753112792969} 11/07/2021 16:56:47 - INFO - __main__ - Step 139880: {'lr': 5.747276775485033e-06, 'samples': 26856960, 'steps': 139879, 'loss/train': 1.218180775642395} 11/07/2021 16:56:47 - INFO - __main__ - Step 139881: {'lr': 5.746145488933679e-06, 'samples': 26857152, 'steps': 139880, 'loss/train': 1.3975803852081299} 11/07/2021 16:56:47 - INFO - __main__ - Step 139882: {'lr': 5.745014312439345e-06, 'samples': 26857344, 'steps': 139881, 'loss/train': 1.25668203830719} 11/07/2021 16:56:48 - INFO - __main__ - Step 139883: {'lr': 5.743883246002501e-06, 'samples': 26857536, 'steps': 139882, 'loss/train': 1.216389775276184} 11/07/2021 16:56:49 - INFO - __main__ - Step 139884: {'lr': 5.74275228962376e-06, 'samples': 26857728, 'steps': 139883, 'loss/train': 1.316116213798523} 11/07/2021 16:56:49 - INFO - __main__ - Step 139885: {'lr': 5.741621443303507e-06, 'samples': 26857920, 'steps': 139884, 'loss/train': 0.8296110033988953} 11/07/2021 16:56:49 - INFO - __main__ - Step 139886: {'lr': 5.7404907070423286e-06, 'samples': 26858112, 'steps': 139885, 'loss/train': 1.0244731903076172} 11/07/2021 16:56:50 - INFO - __main__ - Step 139887: {'lr': 5.739360080840722e-06, 'samples': 26858304, 'steps': 139886, 'loss/train': 1.1681489944458008} 11/07/2021 16:56:51 - INFO - __main__ - Step 139888: {'lr': 5.738229564699188e-06, 'samples': 26858496, 'steps': 139887, 'loss/train': 1.5581544637680054} 11/07/2021 16:56:51 - INFO - __main__ - Step 139889: {'lr': 5.737099158618225e-06, 'samples': 26858688, 'steps': 139888, 'loss/train': 1.1917942762374878} 11/07/2021 16:56:52 - INFO - __main__ - Step 139890: {'lr': 5.7359688625983616e-06, 'samples': 26858880, 'steps': 139889, 'loss/train': 1.184585690498352} 11/07/2021 16:56:52 - INFO - __main__ - Step 139891: {'lr': 5.7348386766400975e-06, 'samples': 26859072, 'steps': 139890, 'loss/train': 1.406082034111023} 11/07/2021 16:56:52 - INFO - __main__ - Step 139892: {'lr': 5.733708600743959e-06, 'samples': 26859264, 'steps': 139891, 'loss/train': 1.0878398418426514} 11/07/2021 16:56:54 - INFO - __main__ - Step 139893: {'lr': 5.732578634910446e-06, 'samples': 26859456, 'steps': 139892, 'loss/train': 1.2659943103790283} 11/07/2021 16:56:54 - INFO - __main__ - Step 139894: {'lr': 5.7314487791400305e-06, 'samples': 26859648, 'steps': 139893, 'loss/train': 1.5704578161239624} 11/07/2021 16:56:54 - INFO - __main__ - Step 139895: {'lr': 5.730319033433295e-06, 'samples': 26859840, 'steps': 139894, 'loss/train': 1.358097791671753} 11/07/2021 16:56:55 - INFO - __main__ - Step 139896: {'lr': 5.7291893977906855e-06, 'samples': 26860032, 'steps': 139895, 'loss/train': 1.4626458883285522} 11/07/2021 16:56:55 - INFO - __main__ - Step 139897: {'lr': 5.728059872212754e-06, 'samples': 26860224, 'steps': 139896, 'loss/train': 0.09207943081855774} 11/07/2021 16:56:56 - INFO - __main__ - Step 139898: {'lr': 5.726930456699975e-06, 'samples': 26860416, 'steps': 139897, 'loss/train': 1.450693964958191} 11/07/2021 16:56:57 - INFO - __main__ - Step 139899: {'lr': 5.725801151252874e-06, 'samples': 26860608, 'steps': 139898, 'loss/train': 1.5381981134414673} 11/07/2021 16:56:57 - INFO - __main__ - Step 139900: {'lr': 5.724671955871951e-06, 'samples': 26860800, 'steps': 139899, 'loss/train': 1.5463751554489136} 11/07/2021 16:56:57 - INFO - __main__ - Step 139901: {'lr': 5.723542870557735e-06, 'samples': 26860992, 'steps': 139900, 'loss/train': 1.0729916095733643} 11/07/2021 16:56:58 - INFO - __main__ - Step 139902: {'lr': 5.7224138953107245e-06, 'samples': 26861184, 'steps': 139901, 'loss/train': 1.121440052986145} 11/07/2021 16:56:58 - INFO - __main__ - Step 139903: {'lr': 5.72128503013139e-06, 'samples': 26861376, 'steps': 139902, 'loss/train': 1.6185486316680908} 11/07/2021 16:56:59 - INFO - __main__ - Step 139904: {'lr': 5.720156275020316e-06, 'samples': 26861568, 'steps': 139903, 'loss/train': 0.2951335310935974} 11/07/2021 16:57:00 - INFO - __main__ - Step 139905: {'lr': 5.719027629977946e-06, 'samples': 26861760, 'steps': 139904, 'loss/train': 1.6657332181930542} 11/07/2021 16:57:00 - INFO - __main__ - Step 139906: {'lr': 5.717899095004808e-06, 'samples': 26861952, 'steps': 139905, 'loss/train': 1.2817151546478271} 11/07/2021 16:57:00 - INFO - __main__ - Step 139907: {'lr': 5.7167706701014286e-06, 'samples': 26862144, 'steps': 139906, 'loss/train': 1.2798770666122437} 11/07/2021 16:57:01 - INFO - __main__ - Step 139908: {'lr': 5.715642355268308e-06, 'samples': 26862336, 'steps': 139907, 'loss/train': 1.5296695232391357} 11/07/2021 16:57:02 - INFO - __main__ - Step 139909: {'lr': 5.7145141505059454e-06, 'samples': 26862528, 'steps': 139908, 'loss/train': 1.2204715013504028} 11/07/2021 16:57:02 - INFO - __main__ - Step 139910: {'lr': 5.71338605581484e-06, 'samples': 26862720, 'steps': 139909, 'loss/train': 1.1821695566177368} 11/07/2021 16:57:02 - INFO - __main__ - Step 139911: {'lr': 5.712258071195547e-06, 'samples': 26862912, 'steps': 139910, 'loss/train': 1.3383979797363281} 11/07/2021 16:57:03 - INFO - __main__ - Step 139912: {'lr': 5.711130196648512e-06, 'samples': 26863104, 'steps': 139911, 'loss/train': 1.5159430503845215} 11/07/2021 16:57:03 - INFO - __main__ - Step 139913: {'lr': 5.710002432174261e-06, 'samples': 26863296, 'steps': 139912, 'loss/train': 1.1840955018997192} 11/07/2021 16:57:04 - INFO - __main__ - Step 139914: {'lr': 5.708874777773349e-06, 'samples': 26863488, 'steps': 139913, 'loss/train': 1.7322325706481934} 11/07/2021 16:57:04 - INFO - __main__ - Step 139915: {'lr': 5.70774723344622e-06, 'samples': 26863680, 'steps': 139914, 'loss/train': 1.6083567142486572} 11/07/2021 16:57:05 - INFO - __main__ - Step 139916: {'lr': 5.70661979919343e-06, 'samples': 26863872, 'steps': 139915, 'loss/train': 1.2249480485916138} 11/07/2021 16:57:05 - INFO - __main__ - Step 139917: {'lr': 5.7054924750154505e-06, 'samples': 26864064, 'steps': 139916, 'loss/train': 1.0802339315414429} 11/07/2021 16:57:05 - INFO - __main__ - Step 139918: {'lr': 5.704365260912808e-06, 'samples': 26864256, 'steps': 139917, 'loss/train': 0.47074946761131287} 11/07/2021 16:57:07 - INFO - __main__ - Step 139919: {'lr': 5.703238156886004e-06, 'samples': 26864448, 'steps': 139918, 'loss/train': 1.6071070432662964} 11/07/2021 16:57:07 - INFO - __main__ - Step 139920: {'lr': 5.702111162935564e-06, 'samples': 26864640, 'steps': 139919, 'loss/train': 1.248415231704712} 11/07/2021 16:57:07 - INFO - __main__ - Step 139921: {'lr': 5.700984279061988e-06, 'samples': 26864832, 'steps': 139920, 'loss/train': 1.3491190671920776} 11/07/2021 16:57:08 - INFO - __main__ - Step 139922: {'lr': 5.699857505265749e-06, 'samples': 26865024, 'steps': 139921, 'loss/train': 1.2056344747543335} 11/07/2021 16:57:08 - INFO - __main__ - Step 139923: {'lr': 5.698730841547428e-06, 'samples': 26865216, 'steps': 139922, 'loss/train': 1.1680750846862793} 11/07/2021 16:57:09 - INFO - __main__ - Step 139924: {'lr': 5.697604287907471e-06, 'samples': 26865408, 'steps': 139923, 'loss/train': 1.093336582183838} 11/07/2021 16:57:09 - INFO - __main__ - Step 139925: {'lr': 5.6964778443464035e-06, 'samples': 26865600, 'steps': 139924, 'loss/train': 1.12249755859375} 11/07/2021 16:57:10 - INFO - __main__ - Step 139926: {'lr': 5.695351510864727e-06, 'samples': 26865792, 'steps': 139925, 'loss/train': 1.3610447645187378} 11/07/2021 16:57:10 - INFO - __main__ - Step 139927: {'lr': 5.6942252874629395e-06, 'samples': 26865984, 'steps': 139926, 'loss/train': 0.9387858510017395} 11/07/2021 16:57:10 - INFO - __main__ - Step 139928: {'lr': 5.693099174141597e-06, 'samples': 26866176, 'steps': 139927, 'loss/train': 1.687925100326538} 11/07/2021 16:57:11 - INFO - __main__ - Step 139929: {'lr': 5.691973170901144e-06, 'samples': 26866368, 'steps': 139928, 'loss/train': 0.8849765062332153} 11/07/2021 16:57:12 - INFO - __main__ - Step 139930: {'lr': 5.690847277742134e-06, 'samples': 26866560, 'steps': 139929, 'loss/train': 1.6749238967895508} 11/07/2021 16:57:12 - INFO - __main__ - Step 139931: {'lr': 5.6897214946650676e-06, 'samples': 26866752, 'steps': 139930, 'loss/train': 1.385767936706543} 11/07/2021 16:57:12 - INFO - __main__ - Step 139932: {'lr': 5.688595821670417e-06, 'samples': 26866944, 'steps': 139931, 'loss/train': 1.166579008102417} 11/07/2021 16:57:13 - INFO - __main__ - Step 139933: {'lr': 5.687470258758737e-06, 'samples': 26867136, 'steps': 139932, 'loss/train': 1.275405764579773} 11/07/2021 16:57:13 - INFO - __main__ - Step 139934: {'lr': 5.6863448059305266e-06, 'samples': 26867328, 'steps': 139933, 'loss/train': 0.8607516288757324} 11/07/2021 16:57:14 - INFO - __main__ - Step 139935: {'lr': 5.6852194631862585e-06, 'samples': 26867520, 'steps': 139934, 'loss/train': 1.0476843118667603} 11/07/2021 16:57:15 - INFO - __main__ - Step 139936: {'lr': 5.68409423052646e-06, 'samples': 26867712, 'steps': 139935, 'loss/train': 1.1261557340621948} 11/07/2021 16:57:15 - INFO - __main__ - Step 139937: {'lr': 5.68296910795163e-06, 'samples': 26867904, 'steps': 139936, 'loss/train': 1.3954654932022095} 11/07/2021 16:57:15 - INFO - __main__ - Step 139938: {'lr': 5.681844095462296e-06, 'samples': 26868096, 'steps': 139937, 'loss/train': 1.214661955833435} 11/07/2021 16:57:16 - INFO - __main__ - Step 139939: {'lr': 5.680719193058959e-06, 'samples': 26868288, 'steps': 139938, 'loss/train': 0.5212255716323853} 11/07/2021 16:57:17 - INFO - __main__ - Step 139940: {'lr': 5.679594400742117e-06, 'samples': 26868480, 'steps': 139939, 'loss/train': 1.4607449769973755} 11/07/2021 16:57:17 - INFO - __main__ - Step 139941: {'lr': 5.678469718512269e-06, 'samples': 26868672, 'steps': 139940, 'loss/train': 1.5959701538085938} 11/07/2021 16:57:17 - INFO - __main__ - Step 139942: {'lr': 5.677345146369944e-06, 'samples': 26868864, 'steps': 139941, 'loss/train': 1.2843679189682007} 11/07/2021 16:57:18 - INFO - __main__ - Step 139943: {'lr': 5.676220684315614e-06, 'samples': 26869056, 'steps': 139942, 'loss/train': 1.1281887292861938} 11/07/2021 16:57:18 - INFO - __main__ - Step 139944: {'lr': 5.675096332349833e-06, 'samples': 26869248, 'steps': 139943, 'loss/train': 1.101239562034607} 11/07/2021 16:57:19 - INFO - __main__ - Step 139945: {'lr': 5.6739720904731005e-06, 'samples': 26869440, 'steps': 139944, 'loss/train': 1.5956836938858032} 11/07/2021 16:57:20 - INFO - __main__ - Step 139946: {'lr': 5.67284795868589e-06, 'samples': 26869632, 'steps': 139945, 'loss/train': 1.0440572500228882} 11/07/2021 16:57:20 - INFO - __main__ - Step 139947: {'lr': 5.671723936988698e-06, 'samples': 26869824, 'steps': 139946, 'loss/train': 1.5404599905014038} 11/07/2021 16:57:20 - INFO - __main__ - Step 139948: {'lr': 5.670600025382083e-06, 'samples': 26870016, 'steps': 139947, 'loss/train': 1.0579109191894531} 11/07/2021 16:57:21 - INFO - __main__ - Step 139949: {'lr': 5.669476223866515e-06, 'samples': 26870208, 'steps': 139948, 'loss/train': 1.1374598741531372} 11/07/2021 16:57:22 - INFO - __main__ - Step 139950: {'lr': 5.668352532442494e-06, 'samples': 26870400, 'steps': 139949, 'loss/train': 1.5413345098495483} 11/07/2021 16:57:22 - INFO - __main__ - Step 139951: {'lr': 5.667228951110575e-06, 'samples': 26870592, 'steps': 139950, 'loss/train': 1.0230695009231567} 11/07/2021 16:57:22 - INFO - __main__ - Step 139952: {'lr': 5.666105479871203e-06, 'samples': 26870784, 'steps': 139951, 'loss/train': 0.6699736714363098} 11/07/2021 16:57:23 - INFO - __main__ - Step 139953: {'lr': 5.664982118724932e-06, 'samples': 26870976, 'steps': 139952, 'loss/train': 1.1291968822479248} 11/07/2021 16:57:23 - INFO - __main__ - Step 139954: {'lr': 5.663858867672261e-06, 'samples': 26871168, 'steps': 139953, 'loss/train': 1.109904170036316} 11/07/2021 16:57:24 - INFO - __main__ - Step 139955: {'lr': 5.662735726713664e-06, 'samples': 26871360, 'steps': 139954, 'loss/train': 1.4811118841171265} 11/07/2021 16:57:24 - INFO - __main__ - Step 139956: {'lr': 5.661612695849694e-06, 'samples': 26871552, 'steps': 139955, 'loss/train': 1.4831085205078125} 11/07/2021 16:57:25 - INFO - __main__ - Step 139957: {'lr': 5.660489775080824e-06, 'samples': 26871744, 'steps': 139956, 'loss/train': 1.1196848154067993} 11/07/2021 16:57:25 - INFO - __main__ - Step 139958: {'lr': 5.659366964407553e-06, 'samples': 26871936, 'steps': 139957, 'loss/train': 1.2128835916519165} 11/07/2021 16:57:25 - INFO - __main__ - Step 139959: {'lr': 5.658244263830381e-06, 'samples': 26872128, 'steps': 139958, 'loss/train': 1.2994228601455688} 11/07/2021 16:57:27 - INFO - __main__ - Step 139960: {'lr': 5.657121673349863e-06, 'samples': 26872320, 'steps': 139959, 'loss/train': 1.3421134948730469} 11/07/2021 16:57:27 - INFO - __main__ - Step 139961: {'lr': 5.6559991929664715e-06, 'samples': 26872512, 'steps': 139960, 'loss/train': 1.0668054819107056} 11/07/2021 16:57:27 - INFO - __main__ - Step 139962: {'lr': 5.654876822680704e-06, 'samples': 26872704, 'steps': 139961, 'loss/train': 1.3076270818710327} 11/07/2021 16:57:28 - INFO - __main__ - Step 139963: {'lr': 5.653754562493091e-06, 'samples': 26872896, 'steps': 139962, 'loss/train': 1.658194899559021} 11/07/2021 16:57:28 - INFO - __main__ - Step 139964: {'lr': 5.65263241240413e-06, 'samples': 26873088, 'steps': 139963, 'loss/train': 1.280985951423645} 11/07/2021 16:57:29 - INFO - __main__ - Step 139965: {'lr': 5.65151037241432e-06, 'samples': 26873280, 'steps': 139964, 'loss/train': 0.8520583510398865} 11/07/2021 16:57:29 - INFO - __main__ - Step 139966: {'lr': 5.650388442524162e-06, 'samples': 26873472, 'steps': 139965, 'loss/train': 1.5014739036560059} 11/07/2021 16:57:30 - INFO - __main__ - Step 139967: {'lr': 5.649266622734184e-06, 'samples': 26873664, 'steps': 139966, 'loss/train': 0.882989764213562} 11/07/2021 16:57:30 - INFO - __main__ - Step 139968: {'lr': 5.648144913044856e-06, 'samples': 26873856, 'steps': 139967, 'loss/train': 0.9131788611412048} 11/07/2021 16:57:30 - INFO - __main__ - Step 139969: {'lr': 5.647023313456706e-06, 'samples': 26874048, 'steps': 139968, 'loss/train': 1.4008698463439941} 11/07/2021 16:57:32 - INFO - __main__ - Step 139970: {'lr': 5.645901823970234e-06, 'samples': 26874240, 'steps': 139969, 'loss/train': 1.4964226484298706} 11/07/2021 16:57:32 - INFO - __main__ - Step 139971: {'lr': 5.644780444585968e-06, 'samples': 26874432, 'steps': 139970, 'loss/train': 1.4441839456558228} 11/07/2021 16:57:32 - INFO - __main__ - Step 139972: {'lr': 5.6436591753043776e-06, 'samples': 26874624, 'steps': 139971, 'loss/train': 1.2832962274551392} 11/07/2021 16:57:33 - INFO - __main__ - Step 139973: {'lr': 5.642538016125992e-06, 'samples': 26874816, 'steps': 139972, 'loss/train': 1.369773268699646} 11/07/2021 16:57:33 - INFO - __main__ - Step 139974: {'lr': 5.641416967051283e-06, 'samples': 26875008, 'steps': 139973, 'loss/train': 1.2170170545578003} 11/07/2021 16:57:34 - INFO - __main__ - Step 139975: {'lr': 5.640296028080805e-06, 'samples': 26875200, 'steps': 139974, 'loss/train': 1.3873975276947021} 11/07/2021 16:57:34 - INFO - __main__ - Step 139976: {'lr': 5.63917519921503e-06, 'samples': 26875392, 'steps': 139975, 'loss/train': 1.4791227579116821} 11/07/2021 16:57:35 - INFO - __main__ - Step 139977: {'lr': 5.638054480454485e-06, 'samples': 26875584, 'steps': 139976, 'loss/train': 1.3111639022827148} 11/07/2021 16:57:35 - INFO - __main__ - Step 139978: {'lr': 5.636933871799671e-06, 'samples': 26875776, 'steps': 139977, 'loss/train': 1.630928635597229} 11/07/2021 16:57:35 - INFO - __main__ - Step 139979: {'lr': 5.6358133732510585e-06, 'samples': 26875968, 'steps': 139978, 'loss/train': 1.3182510137557983} 11/07/2021 16:57:36 - INFO - __main__ - Step 139980: {'lr': 5.634692984809175e-06, 'samples': 26876160, 'steps': 139979, 'loss/train': 0.9028451442718506} 11/07/2021 16:57:37 - INFO - __main__ - Step 139981: {'lr': 5.633572706474549e-06, 'samples': 26876352, 'steps': 139980, 'loss/train': 1.2532247304916382} 11/07/2021 16:57:37 - INFO - __main__ - Step 139982: {'lr': 5.632452538247651e-06, 'samples': 26876544, 'steps': 139981, 'loss/train': 1.3389978408813477} 11/07/2021 16:57:37 - INFO - __main__ - Step 139983: {'lr': 5.631332480128981e-06, 'samples': 26876736, 'steps': 139982, 'loss/train': 1.472499966621399} 11/07/2021 16:57:38 - INFO - __main__ - Step 139984: {'lr': 5.6302125321190946e-06, 'samples': 26876928, 'steps': 139983, 'loss/train': 1.0306974649429321} 11/07/2021 16:57:40 - INFO - __main__ - Step 139985: {'lr': 5.629092694218435e-06, 'samples': 26877120, 'steps': 139984, 'loss/train': 1.5446302890777588} 11/07/2021 16:57:40 - INFO - __main__ - Step 139986: {'lr': 5.627972966427558e-06, 'samples': 26877312, 'steps': 139985, 'loss/train': 1.622493028640747} 11/07/2021 16:57:40 - INFO - __main__ - Step 139987: {'lr': 5.626853348746936e-06, 'samples': 26877504, 'steps': 139986, 'loss/train': 1.0557676553726196} 11/07/2021 16:57:41 - INFO - __main__ - Step 139988: {'lr': 5.625733841177094e-06, 'samples': 26877696, 'steps': 139987, 'loss/train': 1.1954866647720337} 11/07/2021 16:57:41 - INFO - __main__ - Step 139989: {'lr': 5.624614443718506e-06, 'samples': 26877888, 'steps': 139988, 'loss/train': 3.758086681365967} 11/07/2021 16:57:41 - INFO - __main__ - Step 139990: {'lr': 5.6234951563716995e-06, 'samples': 26878080, 'steps': 139989, 'loss/train': 3.7396774291992188} 11/07/2021 16:57:42 - INFO - __main__ - Step 139991: {'lr': 5.622375979137201e-06, 'samples': 26878272, 'steps': 139990, 'loss/train': 1.2579914331436157} 11/07/2021 16:57:43 - INFO - __main__ - Step 139992: {'lr': 5.621256912015482e-06, 'samples': 26878464, 'steps': 139991, 'loss/train': 0.727709949016571} 11/07/2021 16:57:43 - INFO - __main__ - Step 139993: {'lr': 5.620137955007043e-06, 'samples': 26878656, 'steps': 139992, 'loss/train': 0.4169858992099762} 11/07/2021 16:57:43 - INFO - __main__ - Step 139994: {'lr': 5.619019108112383e-06, 'samples': 26878848, 'steps': 139993, 'loss/train': 1.5525305271148682} 11/07/2021 16:57:44 - INFO - __main__ - Step 139995: {'lr': 5.61790037133203e-06, 'samples': 26879040, 'steps': 139994, 'loss/train': 1.3481099605560303} 11/07/2021 16:57:44 - INFO - __main__ - Step 139996: {'lr': 5.616781744666511e-06, 'samples': 26879232, 'steps': 139995, 'loss/train': 1.186768651008606} 11/07/2021 16:57:45 - INFO - __main__ - Step 139997: {'lr': 5.615663228116269e-06, 'samples': 26879424, 'steps': 139996, 'loss/train': 1.3759431838989258} 11/07/2021 16:57:45 - INFO - __main__ - Step 139998: {'lr': 5.614544821681833e-06, 'samples': 26879616, 'steps': 139997, 'loss/train': 0.7946931719779968} 11/07/2021 16:57:46 - INFO - __main__ - Step 139999: {'lr': 5.613426525363729e-06, 'samples': 26879808, 'steps': 139998, 'loss/train': 1.57685124874115} 11/07/2021 16:57:46 - INFO - __main__ - Step 140000: {'lr': 5.612308339162431e-06, 'samples': 26880000, 'steps': 139999, 'loss/train': 0.8284565806388855} 11/07/2021 16:57:47 - INFO - __main__ - Step 140001: {'lr': 5.611190263078464e-06, 'samples': 26880192, 'steps': 140000, 'loss/train': 0.8656111359596252} 11/07/2021 16:57:48 - INFO - __main__ - Step 140002: {'lr': 5.610072297112329e-06, 'samples': 26880384, 'steps': 140001, 'loss/train': 1.4651985168457031} 11/07/2021 16:57:48 - INFO - __main__ - Step 140003: {'lr': 5.6089544412644964e-06, 'samples': 26880576, 'steps': 140002, 'loss/train': 1.672175407409668} 11/07/2021 16:57:48 - INFO - __main__ - Step 140004: {'lr': 5.607836695535523e-06, 'samples': 26880768, 'steps': 140003, 'loss/train': 0.41325250267982483} 11/07/2021 16:57:49 - INFO - __main__ - Step 140005: {'lr': 5.606719059925908e-06, 'samples': 26880960, 'steps': 140004, 'loss/train': 1.2506115436553955} 11/07/2021 16:57:49 - INFO - __main__ - Step 140006: {'lr': 5.605601534436094e-06, 'samples': 26881152, 'steps': 140005, 'loss/train': 0.7840285301208496} 11/07/2021 16:57:49 - INFO - __main__ - Step 140007: {'lr': 5.604484119066639e-06, 'samples': 26881344, 'steps': 140006, 'loss/train': 1.4897812604904175} 11/07/2021 16:57:50 - INFO - __main__ - Step 140008: {'lr': 5.603366813818039e-06, 'samples': 26881536, 'steps': 140007, 'loss/train': 1.1954212188720703} 11/07/2021 16:57:51 - INFO - __main__ - Step 140009: {'lr': 5.6022496186907966e-06, 'samples': 26881728, 'steps': 140008, 'loss/train': 1.5977269411087036} 11/07/2021 16:57:51 - INFO - __main__ - Step 140010: {'lr': 5.6011325336853824e-06, 'samples': 26881920, 'steps': 140009, 'loss/train': 1.928367018699646} 11/07/2021 16:57:52 - INFO - __main__ - Step 140011: {'lr': 5.600015558802352e-06, 'samples': 26882112, 'steps': 140010, 'loss/train': 1.0059764385223389} 11/07/2021 16:57:52 - INFO - __main__ - Step 140012: {'lr': 5.598898694042148e-06, 'samples': 26882304, 'steps': 140011, 'loss/train': 1.6012732982635498} 11/07/2021 16:57:53 - INFO - __main__ - Step 140013: {'lr': 5.597781939405355e-06, 'samples': 26882496, 'steps': 140012, 'loss/train': 1.3593007326126099} 11/07/2021 16:57:53 - INFO - __main__ - Step 140014: {'lr': 5.596665294892389e-06, 'samples': 26882688, 'steps': 140013, 'loss/train': 1.0184367895126343} 11/07/2021 16:57:54 - INFO - __main__ - Step 140015: {'lr': 5.595548760503832e-06, 'samples': 26882880, 'steps': 140014, 'loss/train': 1.0392484664916992} 11/07/2021 16:57:54 - INFO - __main__ - Step 140016: {'lr': 5.594432336240129e-06, 'samples': 26883072, 'steps': 140015, 'loss/train': 1.5925344228744507} 11/07/2021 16:57:54 - INFO - __main__ - Step 140017: {'lr': 5.59331602210178e-06, 'samples': 26883264, 'steps': 140016, 'loss/train': 1.456783652305603} 11/07/2021 16:57:55 - INFO - __main__ - Step 140018: {'lr': 5.592199818089339e-06, 'samples': 26883456, 'steps': 140017, 'loss/train': 1.1291842460632324} 11/07/2021 16:57:56 - INFO - __main__ - Step 140019: {'lr': 5.591083724203305e-06, 'samples': 26883648, 'steps': 140018, 'loss/train': 1.6010282039642334} 11/07/2021 16:57:56 - INFO - __main__ - Step 140020: {'lr': 5.589967740444124e-06, 'samples': 26883840, 'steps': 140019, 'loss/train': 1.632413625717163} 11/07/2021 16:57:56 - INFO - __main__ - Step 140021: {'lr': 5.58885186681235e-06, 'samples': 26884032, 'steps': 140020, 'loss/train': 1.5161086320877075} 11/07/2021 16:57:57 - INFO - __main__ - Step 140022: {'lr': 5.587736103308455e-06, 'samples': 26884224, 'steps': 140021, 'loss/train': 1.5268474817276} 11/07/2021 16:57:58 - INFO - __main__ - Step 140023: {'lr': 5.586620449932966e-06, 'samples': 26884416, 'steps': 140022, 'loss/train': 1.9140723943710327} 11/07/2021 16:57:58 - INFO - __main__ - Step 140024: {'lr': 5.585504906686356e-06, 'samples': 26884608, 'steps': 140023, 'loss/train': 1.257991075515747} 11/07/2021 16:57:58 - INFO - __main__ - Step 140025: {'lr': 5.584389473569152e-06, 'samples': 26884800, 'steps': 140024, 'loss/train': 1.159529685974121} 11/07/2021 16:57:59 - INFO - __main__ - Step 140026: {'lr': 5.5832741505818515e-06, 'samples': 26884992, 'steps': 140025, 'loss/train': 1.3766597509384155} 11/07/2021 16:57:59 - INFO - __main__ - Step 140027: {'lr': 5.582158937724957e-06, 'samples': 26885184, 'steps': 140026, 'loss/train': 1.8054592609405518} 11/07/2021 16:58:00 - INFO - __main__ - Step 140028: {'lr': 5.581043834998967e-06, 'samples': 26885376, 'steps': 140027, 'loss/train': 1.3513275384902954} 11/07/2021 16:58:01 - INFO - __main__ - Step 140029: {'lr': 5.57992884240438e-06, 'samples': 26885568, 'steps': 140028, 'loss/train': 1.3664671182632446} 11/07/2021 16:58:01 - INFO - __main__ - Step 140030: {'lr': 5.5788139599417255e-06, 'samples': 26885760, 'steps': 140029, 'loss/train': 1.1117537021636963} 11/07/2021 16:58:01 - INFO - __main__ - Step 140031: {'lr': 5.577699187611474e-06, 'samples': 26885952, 'steps': 140030, 'loss/train': 1.611396074295044} 11/07/2021 16:58:02 - INFO - __main__ - Step 140032: {'lr': 5.57658452541418e-06, 'samples': 26886144, 'steps': 140031, 'loss/train': 1.4654113054275513} 11/07/2021 16:58:03 - INFO - __main__ - Step 140033: {'lr': 5.575469973350261e-06, 'samples': 26886336, 'steps': 140032, 'loss/train': 1.2882527112960815} 11/07/2021 16:58:03 - INFO - __main__ - Step 140034: {'lr': 5.5743555314202724e-06, 'samples': 26886528, 'steps': 140033, 'loss/train': 1.2101342678070068} 11/07/2021 16:58:03 - INFO - __main__ - Step 140035: {'lr': 5.573241199624685e-06, 'samples': 26886720, 'steps': 140034, 'loss/train': 1.222485899925232} 11/07/2021 16:58:04 - INFO - __main__ - Step 140036: {'lr': 5.572126977964053e-06, 'samples': 26886912, 'steps': 140035, 'loss/train': 0.969295859336853} 11/07/2021 16:58:04 - INFO - __main__ - Step 140037: {'lr': 5.5710128664388516e-06, 'samples': 26887104, 'steps': 140036, 'loss/train': 5.692290782928467} 11/07/2021 16:58:05 - INFO - __main__ - Step 140038: {'lr': 5.56989886504955e-06, 'samples': 26887296, 'steps': 140037, 'loss/train': 0.9980440735816956} 11/07/2021 16:58:06 - INFO - __main__ - Step 140039: {'lr': 5.5687849737967035e-06, 'samples': 26887488, 'steps': 140038, 'loss/train': 1.2193223237991333} 11/07/2021 16:58:06 - INFO - __main__ - Step 140040: {'lr': 5.567671192680785e-06, 'samples': 26887680, 'steps': 140039, 'loss/train': 1.467738151550293} 11/07/2021 16:58:06 - INFO - __main__ - Step 140041: {'lr': 5.5665575217022925e-06, 'samples': 26887872, 'steps': 140040, 'loss/train': 1.0419777631759644} 11/07/2021 16:58:07 - INFO - __main__ - Step 140042: {'lr': 5.565443960861755e-06, 'samples': 26888064, 'steps': 140041, 'loss/train': 1.4543812274932861} 11/07/2021 16:58:07 - INFO - __main__ - Step 140043: {'lr': 5.564330510159643e-06, 'samples': 26888256, 'steps': 140042, 'loss/train': 1.2230814695358276} 11/07/2021 16:58:08 - INFO - __main__ - Step 140044: {'lr': 5.563217169596485e-06, 'samples': 26888448, 'steps': 140043, 'loss/train': 1.506882667541504} 11/07/2021 16:58:08 - INFO - __main__ - Step 140045: {'lr': 5.562103939172752e-06, 'samples': 26888640, 'steps': 140044, 'loss/train': 1.8654705286026} 11/07/2021 16:58:09 - INFO - __main__ - Step 140046: {'lr': 5.560990818889e-06, 'samples': 26888832, 'steps': 140045, 'loss/train': 1.2825734615325928} 11/07/2021 16:58:09 - INFO - __main__ - Step 140047: {'lr': 5.559877808745673e-06, 'samples': 26889024, 'steps': 140046, 'loss/train': 0.4332890212535858} 11/07/2021 16:58:09 - INFO - __main__ - Step 140048: {'lr': 5.558764908743269e-06, 'samples': 26889216, 'steps': 140047, 'loss/train': 1.3863661289215088} 11/07/2021 16:58:10 - INFO - __main__ - Step 140049: {'lr': 5.557652118882344e-06, 'samples': 26889408, 'steps': 140048, 'loss/train': 1.291071891784668} 11/07/2021 16:58:11 - INFO - __main__ - Step 140050: {'lr': 5.556539439163344e-06, 'samples': 26889600, 'steps': 140049, 'loss/train': 1.3482377529144287} 11/07/2021 16:58:11 - INFO - __main__ - Step 140051: {'lr': 5.555426869586821e-06, 'samples': 26889792, 'steps': 140050, 'loss/train': 1.3724738359451294} 11/07/2021 16:58:11 - INFO - __main__ - Step 140052: {'lr': 5.554314410153221e-06, 'samples': 26889984, 'steps': 140051, 'loss/train': 0.9723540544509888} 11/07/2021 16:58:12 - INFO - __main__ - Step 140053: {'lr': 5.553202060863099e-06, 'samples': 26890176, 'steps': 140052, 'loss/train': 1.1826112270355225} 11/07/2021 16:58:13 - INFO - __main__ - Step 140054: {'lr': 5.552089821716927e-06, 'samples': 26890368, 'steps': 140053, 'loss/train': 1.1571224927902222} 11/07/2021 16:58:13 - INFO - __main__ - Step 140055: {'lr': 5.550977692715203e-06, 'samples': 26890560, 'steps': 140054, 'loss/train': 1.188915729522705} 11/07/2021 16:58:14 - INFO - __main__ - Step 140056: {'lr': 5.549865673858428e-06, 'samples': 26890752, 'steps': 140055, 'loss/train': 1.1646393537521362} 11/07/2021 16:58:14 - INFO - __main__ - Step 140057: {'lr': 5.54875376514713e-06, 'samples': 26890944, 'steps': 140056, 'loss/train': 1.2341680526733398} 11/07/2021 16:58:14 - INFO - __main__ - Step 140058: {'lr': 5.547641966581779e-06, 'samples': 26891136, 'steps': 140057, 'loss/train': 1.8242086172103882} 11/07/2021 16:58:15 - INFO - __main__ - Step 140059: {'lr': 5.546530278162931e-06, 'samples': 26891328, 'steps': 140058, 'loss/train': 1.4515414237976074} 11/07/2021 16:58:16 - INFO - __main__ - Step 140060: {'lr': 5.545418699891003e-06, 'samples': 26891520, 'steps': 140059, 'loss/train': 1.0480550527572632} 11/07/2021 16:58:16 - INFO - __main__ - Step 140061: {'lr': 5.544307231766549e-06, 'samples': 26891712, 'steps': 140060, 'loss/train': 0.9828872680664062} 11/07/2021 16:58:16 - INFO - __main__ - Step 140062: {'lr': 5.5431958737900415e-06, 'samples': 26891904, 'steps': 140061, 'loss/train': 1.4359396696090698} 11/07/2021 16:58:17 - INFO - __main__ - Step 140063: {'lr': 5.542084625962007e-06, 'samples': 26892096, 'steps': 140062, 'loss/train': 1.256051778793335} 11/07/2021 16:58:17 - INFO - __main__ - Step 140064: {'lr': 5.540973488282947e-06, 'samples': 26892288, 'steps': 140063, 'loss/train': 1.1365591287612915} 11/07/2021 16:58:18 - INFO - __main__ - Step 140065: {'lr': 5.539862460753331e-06, 'samples': 26892480, 'steps': 140064, 'loss/train': 1.2657852172851562} 11/07/2021 16:58:19 - INFO - __main__ - Step 140066: {'lr': 5.5387515433737155e-06, 'samples': 26892672, 'steps': 140065, 'loss/train': 1.6450414657592773} 11/07/2021 16:58:19 - INFO - __main__ - Step 140067: {'lr': 5.537640736144545e-06, 'samples': 26892864, 'steps': 140066, 'loss/train': 1.699015736579895} 11/07/2021 16:58:19 - INFO - __main__ - Step 140068: {'lr': 5.536530039066317e-06, 'samples': 26893056, 'steps': 140067, 'loss/train': 1.10368013381958} 11/07/2021 16:58:20 - INFO - __main__ - Step 140069: {'lr': 5.5354194521395896e-06, 'samples': 26893248, 'steps': 140068, 'loss/train': 0.7015876770019531} 11/07/2021 16:58:21 - INFO - __main__ - Step 140070: {'lr': 5.534308975364832e-06, 'samples': 26893440, 'steps': 140069, 'loss/train': 1.4427965879440308} 11/07/2021 16:58:21 - INFO - __main__ - Step 140071: {'lr': 5.533198608742518e-06, 'samples': 26893632, 'steps': 140070, 'loss/train': 1.4790796041488647} 11/07/2021 16:58:21 - INFO - __main__ - Step 140072: {'lr': 5.532088352273173e-06, 'samples': 26893824, 'steps': 140071, 'loss/train': 0.6849888563156128} 11/07/2021 16:58:22 - INFO - __main__ - Step 140073: {'lr': 5.5309782059573544e-06, 'samples': 26894016, 'steps': 140072, 'loss/train': 1.299936056137085} 11/07/2021 16:58:22 - INFO - __main__ - Step 140074: {'lr': 5.529868169795449e-06, 'samples': 26894208, 'steps': 140073, 'loss/train': 1.4146517515182495} 11/07/2021 16:58:23 - INFO - __main__ - Step 140075: {'lr': 5.528758243788012e-06, 'samples': 26894400, 'steps': 140074, 'loss/train': 1.6313815116882324} 11/07/2021 16:58:24 - INFO - __main__ - Step 140076: {'lr': 5.527648427935572e-06, 'samples': 26894592, 'steps': 140075, 'loss/train': 1.802841305732727} 11/07/2021 16:58:24 - INFO - __main__ - Step 140077: {'lr': 5.526538722238572e-06, 'samples': 26894784, 'steps': 140076, 'loss/train': 1.2396804094314575} 11/07/2021 16:58:24 - INFO - __main__ - Step 140078: {'lr': 5.52542912669754e-06, 'samples': 26894976, 'steps': 140077, 'loss/train': 1.1446278095245361} 11/07/2021 16:58:25 - INFO - __main__ - Step 140079: {'lr': 5.524319641313003e-06, 'samples': 26895168, 'steps': 140078, 'loss/train': 1.172031044960022} 11/07/2021 16:58:26 - INFO - __main__ - Step 140080: {'lr': 5.523210266085404e-06, 'samples': 26895360, 'steps': 140079, 'loss/train': 1.0794459581375122} 11/07/2021 16:58:26 - INFO - __main__ - Step 140081: {'lr': 5.5221010010153006e-06, 'samples': 26895552, 'steps': 140080, 'loss/train': 0.8537068367004395} 11/07/2021 16:58:26 - INFO - __main__ - Step 140082: {'lr': 5.520991846103163e-06, 'samples': 26895744, 'steps': 140081, 'loss/train': 1.505678653717041} 11/07/2021 16:58:27 - INFO - __main__ - Step 140083: {'lr': 5.519882801349491e-06, 'samples': 26895936, 'steps': 140082, 'loss/train': 1.0852354764938354} 11/07/2021 16:58:27 - INFO - __main__ - Step 140084: {'lr': 5.518773866754784e-06, 'samples': 26896128, 'steps': 140083, 'loss/train': 1.9322186708450317} 11/07/2021 16:58:28 - INFO - __main__ - Step 140085: {'lr': 5.517665042319542e-06, 'samples': 26896320, 'steps': 140084, 'loss/train': 1.2885386943817139} 11/07/2021 16:58:28 - INFO - __main__ - Step 140086: {'lr': 5.516556328044292e-06, 'samples': 26896512, 'steps': 140085, 'loss/train': 1.6725200414657593} 11/07/2021 16:58:29 - INFO - __main__ - Step 140087: {'lr': 5.515447723929479e-06, 'samples': 26896704, 'steps': 140086, 'loss/train': 0.6608256101608276} 11/07/2021 16:58:29 - INFO - __main__ - Step 140088: {'lr': 5.514339229975656e-06, 'samples': 26896896, 'steps': 140087, 'loss/train': 1.5422371625900269} 11/07/2021 16:58:30 - INFO - __main__ - Step 140089: {'lr': 5.51323084618327e-06, 'samples': 26897088, 'steps': 140088, 'loss/train': 0.8608591556549072} 11/07/2021 16:58:31 - INFO - __main__ - Step 140090: {'lr': 5.512122572552875e-06, 'samples': 26897280, 'steps': 140089, 'loss/train': 0.991786539554596} 11/07/2021 16:58:31 - INFO - __main__ - Step 140091: {'lr': 5.511014409084942e-06, 'samples': 26897472, 'steps': 140090, 'loss/train': 0.7128795385360718} 11/07/2021 16:58:31 - INFO - __main__ - Step 140092: {'lr': 5.509906355779942e-06, 'samples': 26897664, 'steps': 140091, 'loss/train': 0.7804053425788879} 11/07/2021 16:58:32 - INFO - __main__ - Step 140093: {'lr': 5.508798412638433e-06, 'samples': 26897856, 'steps': 140092, 'loss/train': 0.9306735396385193} 11/07/2021 16:58:32 - INFO - __main__ - Step 140094: {'lr': 5.507690579660884e-06, 'samples': 26898048, 'steps': 140093, 'loss/train': 1.3297209739685059} 11/07/2021 16:58:33 - INFO - __main__ - Step 140095: {'lr': 5.506582856847797e-06, 'samples': 26898240, 'steps': 140094, 'loss/train': 1.4256895780563354} 11/07/2021 16:58:34 - INFO - __main__ - Step 140096: {'lr': 5.505475244199671e-06, 'samples': 26898432, 'steps': 140095, 'loss/train': 1.2851169109344482} 11/07/2021 16:58:34 - INFO - __main__ - Step 140097: {'lr': 5.504367741717003e-06, 'samples': 26898624, 'steps': 140096, 'loss/train': 1.8711206912994385} 11/07/2021 16:58:34 - INFO - __main__ - Step 140098: {'lr': 5.503260349400296e-06, 'samples': 26898816, 'steps': 140097, 'loss/train': 1.3692270517349243} 11/07/2021 16:58:35 - INFO - __main__ - Step 140099: {'lr': 5.502153067250076e-06, 'samples': 26899008, 'steps': 140098, 'loss/train': 1.0795921087265015} 11/07/2021 16:58:36 - INFO - __main__ - Step 140100: {'lr': 5.50104589526676e-06, 'samples': 26899200, 'steps': 140099, 'loss/train': 1.1396207809448242} 11/07/2021 16:58:36 - INFO - __main__ - Step 140101: {'lr': 5.499938833450929e-06, 'samples': 26899392, 'steps': 140100, 'loss/train': 0.8847519159317017} 11/07/2021 16:58:36 - INFO - __main__ - Step 140102: {'lr': 5.498831881803057e-06, 'samples': 26899584, 'steps': 140101, 'loss/train': 1.0138713121414185} 11/07/2021 16:58:37 - INFO - __main__ - Step 140103: {'lr': 5.497725040323614e-06, 'samples': 26899776, 'steps': 140102, 'loss/train': 0.996558666229248} 11/07/2021 16:58:37 - INFO - __main__ - Step 140104: {'lr': 5.496618309013129e-06, 'samples': 26899968, 'steps': 140103, 'loss/train': 1.459717869758606} 11/07/2021 16:58:38 - INFO - __main__ - Step 140105: {'lr': 5.495511687872102e-06, 'samples': 26900160, 'steps': 140104, 'loss/train': 1.5512582063674927} 11/07/2021 16:58:38 - INFO - __main__ - Step 140106: {'lr': 5.494405176901029e-06, 'samples': 26900352, 'steps': 140105, 'loss/train': 1.293642520904541} 11/07/2021 16:58:39 - INFO - __main__ - Step 140107: {'lr': 5.493298776100413e-06, 'samples': 26900544, 'steps': 140106, 'loss/train': 1.0986032485961914} 11/07/2021 16:58:39 - INFO - __main__ - Step 140108: {'lr': 5.492192485470726e-06, 'samples': 26900736, 'steps': 140107, 'loss/train': 1.3695415258407593} 11/07/2021 16:58:39 - INFO - __main__ - Step 140109: {'lr': 5.491086305012493e-06, 'samples': 26900928, 'steps': 140108, 'loss/train': 1.5791300535202026} 11/07/2021 16:58:40 - INFO - __main__ - Step 140110: {'lr': 5.4899802347261885e-06, 'samples': 26901120, 'steps': 140109, 'loss/train': 1.569046974182129} 11/07/2021 16:58:41 - INFO - __main__ - Step 140111: {'lr': 5.488874274612338e-06, 'samples': 26901312, 'steps': 140110, 'loss/train': 0.9618206024169922} 11/07/2021 16:58:41 - INFO - __main__ - Step 140112: {'lr': 5.487768424671441e-06, 'samples': 26901504, 'steps': 140111, 'loss/train': 0.9820562601089478} 11/07/2021 16:58:42 - INFO - __main__ - Step 140113: {'lr': 5.486662684903971e-06, 'samples': 26901696, 'steps': 140112, 'loss/train': 1.1913092136383057} 11/07/2021 16:58:42 - INFO - __main__ - Step 140114: {'lr': 5.485557055310453e-06, 'samples': 26901888, 'steps': 140113, 'loss/train': 1.2267287969589233} 11/07/2021 16:58:42 - INFO - __main__ - Step 140115: {'lr': 5.484451535891333e-06, 'samples': 26902080, 'steps': 140114, 'loss/train': 1.1237925291061401} 11/07/2021 16:58:43 - INFO - __main__ - Step 140116: {'lr': 5.483346126647165e-06, 'samples': 26902272, 'steps': 140115, 'loss/train': 1.0537735223770142} 11/07/2021 16:58:44 - INFO - __main__ - Step 140117: {'lr': 5.482240827578422e-06, 'samples': 26902464, 'steps': 140116, 'loss/train': 1.4179356098175049} 11/07/2021 16:58:44 - INFO - __main__ - Step 140118: {'lr': 5.481135638685631e-06, 'samples': 26902656, 'steps': 140117, 'loss/train': 1.4985771179199219} 11/07/2021 16:58:44 - INFO - __main__ - Step 140119: {'lr': 5.480030559969235e-06, 'samples': 26902848, 'steps': 140118, 'loss/train': 1.7852221727371216} 11/07/2021 16:58:45 - INFO - __main__ - Step 140120: {'lr': 5.47892559142979e-06, 'samples': 26903040, 'steps': 140119, 'loss/train': 1.2207975387573242} 11/07/2021 16:58:46 - INFO - __main__ - Step 140121: {'lr': 5.477820733067768e-06, 'samples': 26903232, 'steps': 140120, 'loss/train': 1.5690302848815918} 11/07/2021 16:58:46 - INFO - __main__ - Step 140122: {'lr': 5.47671598488364e-06, 'samples': 26903424, 'steps': 140121, 'loss/train': 1.7116529941558838} 11/07/2021 16:58:46 - INFO - __main__ - Step 140123: {'lr': 5.475611346877962e-06, 'samples': 26903616, 'steps': 140122, 'loss/train': 1.0795727968215942} 11/07/2021 16:58:47 - INFO - __main__ - Step 140124: {'lr': 5.474506819051178e-06, 'samples': 26903808, 'steps': 140123, 'loss/train': 0.9993423819541931} 11/07/2021 16:58:47 - INFO - __main__ - Step 140125: {'lr': 5.473402401403815e-06, 'samples': 26904000, 'steps': 140124, 'loss/train': 1.3132870197296143} 11/07/2021 16:58:48 - INFO - __main__ - Step 140126: {'lr': 5.472298093936373e-06, 'samples': 26904192, 'steps': 140125, 'loss/train': 1.3431371450424194} 11/07/2021 16:58:48 - INFO - __main__ - Step 140127: {'lr': 5.471193896649324e-06, 'samples': 26904384, 'steps': 140126, 'loss/train': 1.549763798713684} 11/07/2021 16:58:49 - INFO - __main__ - Step 140128: {'lr': 5.470089809543194e-06, 'samples': 26904576, 'steps': 140127, 'loss/train': 1.3675744533538818} 11/07/2021 16:58:49 - INFO - __main__ - Step 140129: {'lr': 5.468985832618456e-06, 'samples': 26904768, 'steps': 140128, 'loss/train': 1.1601402759552002} 11/07/2021 16:58:50 - INFO - __main__ - Step 140130: {'lr': 5.4678819658756376e-06, 'samples': 26904960, 'steps': 140129, 'loss/train': 1.3046709299087524} 11/07/2021 16:58:51 - INFO - __main__ - Step 140131: {'lr': 5.46677820931521e-06, 'samples': 26905152, 'steps': 140130, 'loss/train': 0.9686427712440491} 11/07/2021 16:58:51 - INFO - __main__ - Step 140132: {'lr': 5.465674562937672e-06, 'samples': 26905344, 'steps': 140131, 'loss/train': 1.6775298118591309} 11/07/2021 16:58:51 - INFO - __main__ - Step 140133: {'lr': 5.464571026743525e-06, 'samples': 26905536, 'steps': 140132, 'loss/train': 1.2776967287063599} 11/07/2021 16:58:52 - INFO - __main__ - Step 140134: {'lr': 5.4634676007332685e-06, 'samples': 26905728, 'steps': 140133, 'loss/train': 1.1840542554855347} 11/07/2021 16:58:52 - INFO - __main__ - Step 140135: {'lr': 5.462364284907428e-06, 'samples': 26905920, 'steps': 140134, 'loss/train': 1.6512373685836792} 11/07/2021 16:58:52 - INFO - __main__ - Step 140136: {'lr': 5.46126107926645e-06, 'samples': 26906112, 'steps': 140135, 'loss/train': 1.029977560043335} 11/07/2021 16:58:53 - INFO - __main__ - Step 140137: {'lr': 5.460157983810832e-06, 'samples': 26906304, 'steps': 140136, 'loss/train': 0.9762575626373291} 11/07/2021 16:58:54 - INFO - __main__ - Step 140138: {'lr': 5.45905499854113e-06, 'samples': 26906496, 'steps': 140137, 'loss/train': 1.6613752841949463} 11/07/2021 16:58:54 - INFO - __main__ - Step 140139: {'lr': 5.457952123457788e-06, 'samples': 26906688, 'steps': 140138, 'loss/train': 1.0453351736068726} 11/07/2021 16:58:54 - INFO - __main__ - Step 140140: {'lr': 5.4568493585613335e-06, 'samples': 26906880, 'steps': 140139, 'loss/train': 1.4399181604385376} 11/07/2021 16:58:55 - INFO - __main__ - Step 140141: {'lr': 5.455746703852238e-06, 'samples': 26907072, 'steps': 140140, 'loss/train': 1.1742628812789917} 11/07/2021 16:58:56 - INFO - __main__ - Step 140142: {'lr': 5.454644159331029e-06, 'samples': 26907264, 'steps': 140141, 'loss/train': 2.016199827194214} 11/07/2021 16:58:56 - INFO - __main__ - Step 140143: {'lr': 5.453541724998151e-06, 'samples': 26907456, 'steps': 140142, 'loss/train': 0.6671898365020752} 11/07/2021 16:58:56 - INFO - __main__ - Step 140144: {'lr': 5.452439400854159e-06, 'samples': 26907648, 'steps': 140143, 'loss/train': 1.1786158084869385} 11/07/2021 16:58:57 - INFO - __main__ - Step 140145: {'lr': 5.451337186899496e-06, 'samples': 26907840, 'steps': 140144, 'loss/train': 1.2449356317520142} 11/07/2021 16:58:57 - INFO - __main__ - Step 140146: {'lr': 5.450235083134719e-06, 'samples': 26908032, 'steps': 140145, 'loss/train': 1.338836669921875} 11/07/2021 16:58:58 - INFO - __main__ - Step 140147: {'lr': 5.449133089560271e-06, 'samples': 26908224, 'steps': 140146, 'loss/train': 1.7658687829971313} 11/07/2021 16:58:59 - INFO - __main__ - Step 140148: {'lr': 5.448031206176679e-06, 'samples': 26908416, 'steps': 140147, 'loss/train': 1.017313003540039} 11/07/2021 16:58:59 - INFO - __main__ - Step 140149: {'lr': 5.446929432984415e-06, 'samples': 26908608, 'steps': 140148, 'loss/train': 0.9780689477920532} 11/07/2021 16:58:59 - INFO - __main__ - Step 140150: {'lr': 5.445827769984007e-06, 'samples': 26908800, 'steps': 140149, 'loss/train': 1.2330704927444458} 11/07/2021 16:59:00 - INFO - __main__ - Step 140151: {'lr': 5.444726217175927e-06, 'samples': 26908992, 'steps': 140150, 'loss/train': 1.2022342681884766} 11/07/2021 16:59:01 - INFO - __main__ - Step 140152: {'lr': 5.443624774560674e-06, 'samples': 26909184, 'steps': 140151, 'loss/train': 1.2955652475357056} 11/07/2021 16:59:01 - INFO - __main__ - Step 140153: {'lr': 5.4425234421388025e-06, 'samples': 26909376, 'steps': 140152, 'loss/train': 1.3098654747009277} 11/07/2021 16:59:01 - INFO - __main__ - Step 140154: {'lr': 5.441422219910702e-06, 'samples': 26909568, 'steps': 140153, 'loss/train': 0.7601386904716492} 11/07/2021 16:59:02 - INFO - __main__ - Step 140155: {'lr': 5.440321107876928e-06, 'samples': 26909760, 'steps': 140154, 'loss/train': 1.6072440147399902} 11/07/2021 16:59:02 - INFO - __main__ - Step 140156: {'lr': 5.439220106037979e-06, 'samples': 26909952, 'steps': 140155, 'loss/train': 1.3219900131225586} 11/07/2021 16:59:03 - INFO - __main__ - Step 140157: {'lr': 5.438119214394355e-06, 'samples': 26910144, 'steps': 140156, 'loss/train': 0.4486042261123657} 11/07/2021 16:59:04 - INFO - __main__ - Step 140158: {'lr': 5.437018432946528e-06, 'samples': 26910336, 'steps': 140157, 'loss/train': 0.3272995352745056} 11/07/2021 16:59:04 - INFO - __main__ - Step 140159: {'lr': 5.435917761694998e-06, 'samples': 26910528, 'steps': 140158, 'loss/train': 1.310979962348938} 11/07/2021 16:59:04 - INFO - __main__ - Step 140160: {'lr': 5.434817200640291e-06, 'samples': 26910720, 'steps': 140159, 'loss/train': 1.6604015827178955} 11/07/2021 16:59:05 - INFO - __main__ - Step 140161: {'lr': 5.433716749782852e-06, 'samples': 26910912, 'steps': 140160, 'loss/train': 1.4579203128814697} 11/07/2021 16:59:06 - INFO - __main__ - Step 140162: {'lr': 5.4326164091232365e-06, 'samples': 26911104, 'steps': 140161, 'loss/train': 1.556479811668396} 11/07/2021 16:59:06 - INFO - __main__ - Step 140163: {'lr': 5.431516178661888e-06, 'samples': 26911296, 'steps': 140162, 'loss/train': 1.3401966094970703} 11/07/2021 16:59:06 - INFO - __main__ - Step 140164: {'lr': 5.430416058399335e-06, 'samples': 26911488, 'steps': 140163, 'loss/train': 1.2335984706878662} 11/07/2021 16:59:07 - INFO - __main__ - Step 140165: {'lr': 5.429316048336047e-06, 'samples': 26911680, 'steps': 140164, 'loss/train': 1.3189486265182495} 11/07/2021 16:59:07 - INFO - __main__ - Step 140166: {'lr': 5.4282161484725534e-06, 'samples': 26911872, 'steps': 140165, 'loss/train': 1.1037534475326538} 11/07/2021 16:59:08 - INFO - __main__ - Step 140167: {'lr': 5.427116358809353e-06, 'samples': 26912064, 'steps': 140166, 'loss/train': 1.4106004238128662} 11/07/2021 16:59:08 - INFO - __main__ - Step 140168: {'lr': 5.42601667934689e-06, 'samples': 26912256, 'steps': 140167, 'loss/train': 1.3004989624023438} 11/07/2021 16:59:09 - INFO - __main__ - Step 140169: {'lr': 5.424917110085692e-06, 'samples': 26912448, 'steps': 140168, 'loss/train': 0.9221832752227783} 11/07/2021 16:59:09 - INFO - __main__ - Step 140170: {'lr': 5.423817651026258e-06, 'samples': 26912640, 'steps': 140169, 'loss/train': 1.0823655128479004} 11/07/2021 16:59:09 - INFO - __main__ - Step 140171: {'lr': 5.4227183021690605e-06, 'samples': 26912832, 'steps': 140170, 'loss/train': 1.3396620750427246} 11/07/2021 16:59:10 - INFO - __main__ - Step 140172: {'lr': 5.421619063514627e-06, 'samples': 26913024, 'steps': 140171, 'loss/train': 1.355090618133545} 11/07/2021 16:59:11 - INFO - __main__ - Step 140173: {'lr': 5.420519935063456e-06, 'samples': 26913216, 'steps': 140172, 'loss/train': 1.4928147792816162} 11/07/2021 16:59:12 - INFO - __main__ - Step 140174: {'lr': 5.419420916815993e-06, 'samples': 26913408, 'steps': 140173, 'loss/train': 1.5366781949996948} 11/07/2021 16:59:12 - INFO - __main__ - Step 140175: {'lr': 5.418322008772791e-06, 'samples': 26913600, 'steps': 140174, 'loss/train': 1.395032525062561} 11/07/2021 16:59:12 - INFO - __main__ - Step 140176: {'lr': 5.4172232109342965e-06, 'samples': 26913792, 'steps': 140175, 'loss/train': 0.9558656215667725} 11/07/2021 16:59:13 - INFO - __main__ - Step 140177: {'lr': 5.416124523301036e-06, 'samples': 26913984, 'steps': 140176, 'loss/train': 1.288888931274414} 11/07/2021 16:59:14 - INFO - __main__ - Step 140178: {'lr': 5.415025945873481e-06, 'samples': 26914176, 'steps': 140177, 'loss/train': 0.10948576778173447} 11/07/2021 16:59:14 - INFO - __main__ - Step 140179: {'lr': 5.4139274786521585e-06, 'samples': 26914368, 'steps': 140178, 'loss/train': 1.1287137269973755} 11/07/2021 16:59:15 - INFO - __main__ - Step 140180: {'lr': 5.4128291216375695e-06, 'samples': 26914560, 'steps': 140179, 'loss/train': 1.4705592393875122} 11/07/2021 16:59:15 - INFO - __main__ - Step 140181: {'lr': 5.411730874830156e-06, 'samples': 26914752, 'steps': 140180, 'loss/train': 0.5001085996627808} 11/07/2021 16:59:15 - INFO - __main__ - Step 140182: {'lr': 5.4106327382304475e-06, 'samples': 26914944, 'steps': 140181, 'loss/train': 1.1012098789215088} 11/07/2021 16:59:16 - INFO - __main__ - Step 140183: {'lr': 5.409534711838943e-06, 'samples': 26915136, 'steps': 140182, 'loss/train': 1.439710021018982} 11/07/2021 16:59:17 - INFO - __main__ - Step 140184: {'lr': 5.408436795656113e-06, 'samples': 26915328, 'steps': 140183, 'loss/train': 1.173565149307251} 11/07/2021 16:59:17 - INFO - __main__ - Step 140185: {'lr': 5.4073389896824584e-06, 'samples': 26915520, 'steps': 140184, 'loss/train': 1.279097557067871} 11/07/2021 16:59:17 - INFO - __main__ - Step 140186: {'lr': 5.406241293918507e-06, 'samples': 26915712, 'steps': 140185, 'loss/train': 1.3795831203460693} 11/07/2021 16:59:18 - INFO - __main__ - Step 140187: {'lr': 5.405143708364702e-06, 'samples': 26915904, 'steps': 140186, 'loss/train': 1.1372592449188232} 11/07/2021 16:59:18 - INFO - __main__ - Step 140188: {'lr': 5.404046233021598e-06, 'samples': 26916096, 'steps': 140187, 'loss/train': 1.5011725425720215} 11/07/2021 16:59:18 - INFO - __main__ - Step 140189: {'lr': 5.402948867889612e-06, 'samples': 26916288, 'steps': 140188, 'loss/train': 1.4946413040161133} 11/07/2021 16:59:20 - INFO - __main__ - Step 140190: {'lr': 5.401851612969328e-06, 'samples': 26916480, 'steps': 140189, 'loss/train': 1.5128521919250488} 11/07/2021 16:59:20 - INFO - __main__ - Step 140191: {'lr': 5.40075446826116e-06, 'samples': 26916672, 'steps': 140190, 'loss/train': 1.3707612752914429} 11/07/2021 16:59:21 - INFO - __main__ - Step 140192: {'lr': 5.399657433765693e-06, 'samples': 26916864, 'steps': 140191, 'loss/train': 1.1328282356262207} 11/07/2021 16:59:21 - INFO - __main__ - Step 140193: {'lr': 5.398560509483314e-06, 'samples': 26917056, 'steps': 140192, 'loss/train': 1.08378005027771} 11/07/2021 16:59:21 - INFO - __main__ - Step 140194: {'lr': 5.397463695414578e-06, 'samples': 26917248, 'steps': 140193, 'loss/train': 1.343386173248291} 11/07/2021 16:59:22 - INFO - __main__ - Step 140195: {'lr': 5.396366991559987e-06, 'samples': 26917440, 'steps': 140194, 'loss/train': 1.7245769500732422} 11/07/2021 16:59:23 - INFO - __main__ - Step 140196: {'lr': 5.39527039792001e-06, 'samples': 26917632, 'steps': 140195, 'loss/train': 2.418994665145874} 11/07/2021 16:59:23 - INFO - __main__ - Step 140197: {'lr': 5.39417391449512e-06, 'samples': 26917824, 'steps': 140196, 'loss/train': 1.464741587638855} 11/07/2021 16:59:23 - INFO - __main__ - Step 140198: {'lr': 5.3930775412858734e-06, 'samples': 26918016, 'steps': 140197, 'loss/train': 1.1644359827041626} 11/07/2021 16:59:24 - INFO - __main__ - Step 140199: {'lr': 5.39198127829274e-06, 'samples': 26918208, 'steps': 140198, 'loss/train': 1.3046274185180664} 11/07/2021 16:59:24 - INFO - __main__ - Step 140200: {'lr': 5.3908851255161654e-06, 'samples': 26918400, 'steps': 140199, 'loss/train': 1.0880422592163086} 11/07/2021 16:59:26 - INFO - __main__ - Step 140201: {'lr': 5.389789082956731e-06, 'samples': 26918592, 'steps': 140200, 'loss/train': 1.1336301565170288} 11/07/2021 16:59:26 - INFO - __main__ - Step 140202: {'lr': 5.388693150614854e-06, 'samples': 26918784, 'steps': 140201, 'loss/train': 0.9548583030700684} 11/07/2021 16:59:26 - INFO - __main__ - Step 140203: {'lr': 5.38759732849109e-06, 'samples': 26918976, 'steps': 140202, 'loss/train': 1.3203513622283936} 11/07/2021 16:59:27 - INFO - __main__ - Step 140204: {'lr': 5.386501616585854e-06, 'samples': 26919168, 'steps': 140203, 'loss/train': 1.6969900131225586} 11/07/2021 16:59:27 - INFO - __main__ - Step 140205: {'lr': 5.385406014899702e-06, 'samples': 26919360, 'steps': 140204, 'loss/train': 1.4503146409988403} 11/07/2021 16:59:27 - INFO - __main__ - Step 140206: {'lr': 5.384310523433133e-06, 'samples': 26919552, 'steps': 140205, 'loss/train': 1.3543425798416138} 11/07/2021 16:59:29 - INFO - __main__ - Step 140207: {'lr': 5.3832151421865925e-06, 'samples': 26919744, 'steps': 140206, 'loss/train': 1.3082513809204102} 11/07/2021 16:59:29 - INFO - __main__ - Step 140208: {'lr': 5.382119871160607e-06, 'samples': 26919936, 'steps': 140207, 'loss/train': 1.1332437992095947} 11/07/2021 16:59:29 - INFO - __main__ - Step 140209: {'lr': 5.381024710355675e-06, 'samples': 26920128, 'steps': 140208, 'loss/train': 1.6273454427719116} 11/07/2021 16:59:30 - INFO - __main__ - Step 140210: {'lr': 5.3799296597722705e-06, 'samples': 26920320, 'steps': 140209, 'loss/train': 0.8647668361663818} 11/07/2021 16:59:30 - INFO - __main__ - Step 140211: {'lr': 5.378834719410891e-06, 'samples': 26920512, 'steps': 140210, 'loss/train': 1.555039882659912} 11/07/2021 16:59:30 - INFO - __main__ - Step 140212: {'lr': 5.37773988927201e-06, 'samples': 26920704, 'steps': 140211, 'loss/train': 1.3602455854415894} 11/07/2021 16:59:31 - INFO - __main__ - Step 140213: {'lr': 5.376645169356181e-06, 'samples': 26920896, 'steps': 140212, 'loss/train': 1.3144617080688477} 11/07/2021 16:59:32 - INFO - __main__ - Step 140214: {'lr': 5.375550559663878e-06, 'samples': 26921088, 'steps': 140213, 'loss/train': 3.0295913219451904} 11/07/2021 16:59:32 - INFO - __main__ - Step 140215: {'lr': 5.374456060195543e-06, 'samples': 26921280, 'steps': 140214, 'loss/train': 1.4333829879760742} 11/07/2021 16:59:32 - INFO - __main__ - Step 140216: {'lr': 5.373361670951704e-06, 'samples': 26921472, 'steps': 140215, 'loss/train': 1.0512995719909668} 11/07/2021 16:59:33 - INFO - __main__ - Step 140217: {'lr': 5.372267391932861e-06, 'samples': 26921664, 'steps': 140216, 'loss/train': 1.3185158967971802} 11/07/2021 16:59:34 - INFO - __main__ - Step 140218: {'lr': 5.371173223139514e-06, 'samples': 26921856, 'steps': 140217, 'loss/train': 1.366152048110962} 11/07/2021 16:59:34 - INFO - __main__ - Step 140219: {'lr': 5.370079164572106e-06, 'samples': 26922048, 'steps': 140218, 'loss/train': 0.7852049469947815} 11/07/2021 16:59:34 - INFO - __main__ - Step 140220: {'lr': 5.368985216231193e-06, 'samples': 26922240, 'steps': 140219, 'loss/train': 1.4387626647949219} 11/07/2021 16:59:35 - INFO - __main__ - Step 140221: {'lr': 5.367891378117218e-06, 'samples': 26922432, 'steps': 140220, 'loss/train': 0.8466717600822449} 11/07/2021 16:59:35 - INFO - __main__ - Step 140222: {'lr': 5.3667976502307095e-06, 'samples': 26922624, 'steps': 140221, 'loss/train': 1.2818833589553833} 11/07/2021 16:59:36 - INFO - __main__ - Step 140223: {'lr': 5.365704032572166e-06, 'samples': 26922816, 'steps': 140222, 'loss/train': 0.9737740755081177} 11/07/2021 16:59:37 - INFO - __main__ - Step 140224: {'lr': 5.364610525142033e-06, 'samples': 26923008, 'steps': 140223, 'loss/train': 1.523059368133545} 11/07/2021 16:59:37 - INFO - __main__ - Step 140225: {'lr': 5.363517127940864e-06, 'samples': 26923200, 'steps': 140224, 'loss/train': 1.3444550037384033} 11/07/2021 16:59:37 - INFO - __main__ - Step 140226: {'lr': 5.362423840969105e-06, 'samples': 26923392, 'steps': 140225, 'loss/train': 1.5647023916244507} 11/07/2021 16:59:38 - INFO - __main__ - Step 140227: {'lr': 5.361330664227254e-06, 'samples': 26923584, 'steps': 140226, 'loss/train': 0.4394557476043701} 11/07/2021 16:59:39 - INFO - __main__ - Step 140228: {'lr': 5.360237597715811e-06, 'samples': 26923776, 'steps': 140227, 'loss/train': 0.5707634091377258} 11/07/2021 16:59:39 - INFO - __main__ - Step 140229: {'lr': 5.359144641435276e-06, 'samples': 26923968, 'steps': 140228, 'loss/train': 1.3691649436950684} 11/07/2021 16:59:39 - INFO - __main__ - Step 140230: {'lr': 5.358051795386121e-06, 'samples': 26924160, 'steps': 140229, 'loss/train': 1.4701664447784424} 11/07/2021 16:59:40 - INFO - __main__ - Step 140231: {'lr': 5.356959059568872e-06, 'samples': 26924352, 'steps': 140230, 'loss/train': 0.976089596748352} 11/07/2021 16:59:40 - INFO - __main__ - Step 140232: {'lr': 5.355866433983975e-06, 'samples': 26924544, 'steps': 140231, 'loss/train': 1.0319368839263916} 11/07/2021 16:59:41 - INFO - __main__ - Step 140233: {'lr': 5.3547739186319836e-06, 'samples': 26924736, 'steps': 140232, 'loss/train': 1.4534175395965576} 11/07/2021 16:59:41 - INFO - __main__ - Step 140234: {'lr': 5.353681513513342e-06, 'samples': 26924928, 'steps': 140233, 'loss/train': 1.252241611480713} 11/07/2021 16:59:42 - INFO - __main__ - Step 140235: {'lr': 5.352589218628551e-06, 'samples': 26925120, 'steps': 140234, 'loss/train': 1.4432026147842407} 11/07/2021 16:59:42 - INFO - __main__ - Step 140236: {'lr': 5.351497033978137e-06, 'samples': 26925312, 'steps': 140235, 'loss/train': 1.4170215129852295} 11/07/2021 16:59:42 - INFO - __main__ - Step 140237: {'lr': 5.350404959562544e-06, 'samples': 26925504, 'steps': 140236, 'loss/train': 1.2637015581130981} 11/07/2021 16:59:43 - INFO - __main__ - Step 140238: {'lr': 5.3493129953822715e-06, 'samples': 26925696, 'steps': 140237, 'loss/train': 1.374834656715393} 11/07/2021 16:59:44 - INFO - __main__ - Step 140239: {'lr': 5.34822114143782e-06, 'samples': 26925888, 'steps': 140238, 'loss/train': 1.2893191576004028} 11/07/2021 16:59:44 - INFO - __main__ - Step 140240: {'lr': 5.347129397729689e-06, 'samples': 26926080, 'steps': 140239, 'loss/train': 1.3455082178115845} 11/07/2021 16:59:45 - INFO - __main__ - Step 140241: {'lr': 5.346037764258377e-06, 'samples': 26926272, 'steps': 140240, 'loss/train': 1.6993719339370728} 11/07/2021 16:59:45 - INFO - __main__ - Step 140242: {'lr': 5.344946241024356e-06, 'samples': 26926464, 'steps': 140241, 'loss/train': 1.1225403547286987} 11/07/2021 16:59:45 - INFO - __main__ - Step 140243: {'lr': 5.343854828028127e-06, 'samples': 26926656, 'steps': 140242, 'loss/train': 0.29628437757492065} 11/07/2021 16:59:46 - INFO - __main__ - Step 140244: {'lr': 5.342763525270189e-06, 'samples': 26926848, 'steps': 140243, 'loss/train': 1.0999538898468018} 11/07/2021 16:59:47 - INFO - __main__ - Step 140245: {'lr': 5.341672332751013e-06, 'samples': 26927040, 'steps': 140244, 'loss/train': 0.8924398422241211} 11/07/2021 16:59:47 - INFO - __main__ - Step 140246: {'lr': 5.340581250471127e-06, 'samples': 26927232, 'steps': 140245, 'loss/train': 1.3044378757476807} 11/07/2021 16:59:47 - INFO - __main__ - Step 140247: {'lr': 5.3394902784310025e-06, 'samples': 26927424, 'steps': 140246, 'loss/train': 1.6055904626846313} 11/07/2021 16:59:48 - INFO - __main__ - Step 140248: {'lr': 5.338399416631112e-06, 'samples': 26927616, 'steps': 140247, 'loss/train': 1.0560921430587769} 11/07/2021 16:59:49 - INFO - __main__ - Step 140249: {'lr': 5.337308665071983e-06, 'samples': 26927808, 'steps': 140248, 'loss/train': 1.471171259880066} 11/07/2021 16:59:49 - INFO - __main__ - Step 140250: {'lr': 5.336218023754058e-06, 'samples': 26928000, 'steps': 140249, 'loss/train': 1.4389207363128662} 11/07/2021 16:59:50 - INFO - __main__ - Step 140251: {'lr': 5.335127492677866e-06, 'samples': 26928192, 'steps': 140250, 'loss/train': 1.059586524963379} 11/07/2021 16:59:50 - INFO - __main__ - Step 140252: {'lr': 5.334037071843878e-06, 'samples': 26928384, 'steps': 140251, 'loss/train': 1.3426907062530518} 11/07/2021 16:59:50 - INFO - __main__ - Step 140253: {'lr': 5.3329467612526216e-06, 'samples': 26928576, 'steps': 140252, 'loss/train': 1.2958332300186157} 11/07/2021 16:59:51 - INFO - __main__ - Step 140254: {'lr': 5.331856560904541e-06, 'samples': 26928768, 'steps': 140253, 'loss/train': 1.238543152809143} 11/07/2021 16:59:52 - INFO - __main__ - Step 140255: {'lr': 5.330766470800164e-06, 'samples': 26928960, 'steps': 140254, 'loss/train': 0.8868178129196167} 11/07/2021 16:59:52 - INFO - __main__ - Step 140256: {'lr': 5.329676490939989e-06, 'samples': 26929152, 'steps': 140255, 'loss/train': 1.278421401977539} 11/07/2021 16:59:52 - INFO - __main__ - Step 140257: {'lr': 5.328586621324461e-06, 'samples': 26929344, 'steps': 140256, 'loss/train': 1.510390281677246} 11/07/2021 16:59:53 - INFO - __main__ - Step 140258: {'lr': 5.327496861954106e-06, 'samples': 26929536, 'steps': 140257, 'loss/train': 1.2672297954559326} 11/07/2021 16:59:54 - INFO - __main__ - Step 140259: {'lr': 5.326407212829398e-06, 'samples': 26929728, 'steps': 140258, 'loss/train': 1.106475591659546} 11/07/2021 16:59:54 - INFO - __main__ - Step 140260: {'lr': 5.325317673950836e-06, 'samples': 26929920, 'steps': 140259, 'loss/train': 1.2476409673690796} 11/07/2021 16:59:55 - INFO - __main__ - Step 140261: {'lr': 5.3242282453189186e-06, 'samples': 26930112, 'steps': 140260, 'loss/train': 1.588348150253296} 11/07/2021 16:59:55 - INFO - __main__ - Step 140262: {'lr': 5.323138926934118e-06, 'samples': 26930304, 'steps': 140261, 'loss/train': 1.192468523979187} 11/07/2021 16:59:55 - INFO - __main__ - Step 140263: {'lr': 5.3220497187969345e-06, 'samples': 26930496, 'steps': 140262, 'loss/train': 1.1400277614593506} 11/07/2021 16:59:56 - INFO - __main__ - Step 140264: {'lr': 5.320960620907866e-06, 'samples': 26930688, 'steps': 140263, 'loss/train': 1.188838243484497} 11/07/2021 16:59:57 - INFO - __main__ - Step 140265: {'lr': 5.319871633267415e-06, 'samples': 26930880, 'steps': 140264, 'loss/train': 1.2731614112854004} 11/07/2021 16:59:57 - INFO - __main__ - Step 140266: {'lr': 5.318782755876023e-06, 'samples': 26931072, 'steps': 140265, 'loss/train': 1.4839400053024292} 11/07/2021 16:59:57 - INFO - __main__ - Step 140267: {'lr': 5.317693988734218e-06, 'samples': 26931264, 'steps': 140266, 'loss/train': 1.595439076423645} 11/07/2021 16:59:58 - INFO - __main__ - Step 140268: {'lr': 5.3166053318425e-06, 'samples': 26931456, 'steps': 140267, 'loss/train': 1.5838003158569336} 11/07/2021 16:59:58 - INFO - __main__ - Step 140269: {'lr': 5.315516785201313e-06, 'samples': 26931648, 'steps': 140268, 'loss/train': 1.4294650554656982} 11/07/2021 16:59:59 - INFO - __main__ - Step 140270: {'lr': 5.314428348811212e-06, 'samples': 26931840, 'steps': 140269, 'loss/train': 1.3081295490264893} 11/07/2021 16:59:59 - INFO - __main__ - Step 140271: {'lr': 5.313340022672642e-06, 'samples': 26932032, 'steps': 140270, 'loss/train': 1.2130825519561768} 11/07/2021 17:00:00 - INFO - __main__ - Step 140272: {'lr': 5.312251806786101e-06, 'samples': 26932224, 'steps': 140271, 'loss/train': 1.1531072854995728} 11/07/2021 17:00:00 - INFO - __main__ - Step 140273: {'lr': 5.311163701152089e-06, 'samples': 26932416, 'steps': 140272, 'loss/train': 1.470332145690918} 11/07/2021 17:00:00 - INFO - __main__ - Step 140274: {'lr': 5.310075705771105e-06, 'samples': 26932608, 'steps': 140273, 'loss/train': 1.3369559049606323} 11/07/2021 17:00:01 - INFO - __main__ - Step 140275: {'lr': 5.308987820643596e-06, 'samples': 26932800, 'steps': 140274, 'loss/train': 1.2724435329437256} 11/07/2021 17:00:02 - INFO - __main__ - Step 140276: {'lr': 5.307900045770114e-06, 'samples': 26932992, 'steps': 140275, 'loss/train': 1.357765793800354} 11/07/2021 17:00:02 - INFO - __main__ - Step 140277: {'lr': 5.306812381151077e-06, 'samples': 26933184, 'steps': 140276, 'loss/train': 0.6745786070823669} 11/07/2021 17:00:03 - INFO - __main__ - Step 140278: {'lr': 5.305724826787039e-06, 'samples': 26933376, 'steps': 140277, 'loss/train': 0.9647443294525146} 11/07/2021 17:00:03 - INFO - __main__ - Step 140279: {'lr': 5.304637382678446e-06, 'samples': 26933568, 'steps': 140278, 'loss/train': 0.15682193636894226} 11/07/2021 17:00:04 - INFO - __main__ - Step 140280: {'lr': 5.303550048825823e-06, 'samples': 26933760, 'steps': 140279, 'loss/train': 1.5936279296875} 11/07/2021 17:00:04 - INFO - __main__ - Step 140281: {'lr': 5.302462825229642e-06, 'samples': 26933952, 'steps': 140280, 'loss/train': 1.4151508808135986} 11/07/2021 17:00:05 - INFO - __main__ - Step 140282: {'lr': 5.301375711890405e-06, 'samples': 26934144, 'steps': 140281, 'loss/train': 1.2305265665054321} 11/07/2021 17:00:05 - INFO - __main__ - Step 140283: {'lr': 5.300288708808582e-06, 'samples': 26934336, 'steps': 140282, 'loss/train': 1.4343178272247314} 11/07/2021 17:00:05 - INFO - __main__ - Step 140284: {'lr': 5.299201815984672e-06, 'samples': 26934528, 'steps': 140283, 'loss/train': 1.37746262550354} 11/07/2021 17:00:06 - INFO - __main__ - Step 140285: {'lr': 5.298115033419176e-06, 'samples': 26934720, 'steps': 140284, 'loss/train': 1.2908755540847778} 11/07/2021 17:00:07 - INFO - __main__ - Step 140286: {'lr': 5.297028361112566e-06, 'samples': 26934912, 'steps': 140285, 'loss/train': 0.7312090992927551} 11/07/2021 17:00:07 - INFO - __main__ - Step 140287: {'lr': 5.29594179906534e-06, 'samples': 26935104, 'steps': 140286, 'loss/train': 1.3573932647705078} 11/07/2021 17:00:07 - INFO - __main__ - Step 140288: {'lr': 5.294855347277999e-06, 'samples': 26935296, 'steps': 140287, 'loss/train': 1.2978861331939697} 11/07/2021 17:00:08 - INFO - __main__ - Step 140289: {'lr': 5.293769005751014e-06, 'samples': 26935488, 'steps': 140288, 'loss/train': 1.4715303182601929} 11/07/2021 17:00:09 - INFO - __main__ - Step 140290: {'lr': 5.292682774484858e-06, 'samples': 26935680, 'steps': 140289, 'loss/train': 1.5601180791854858} 11/07/2021 17:00:09 - INFO - __main__ - Step 140291: {'lr': 5.2915966534800575e-06, 'samples': 26935872, 'steps': 140290, 'loss/train': 1.3472704887390137} 11/07/2021 17:00:10 - INFO - __main__ - Step 140292: {'lr': 5.2905106427370845e-06, 'samples': 26936064, 'steps': 140291, 'loss/train': 1.5150043964385986} 11/07/2021 17:00:10 - INFO - __main__ - Step 140293: {'lr': 5.289424742256438e-06, 'samples': 26936256, 'steps': 140292, 'loss/train': 1.7879409790039062} 11/07/2021 17:00:10 - INFO - __main__ - Step 140294: {'lr': 5.2883389520385905e-06, 'samples': 26936448, 'steps': 140293, 'loss/train': 1.1835482120513916} 11/07/2021 17:00:11 - INFO - __main__ - Step 140295: {'lr': 5.287253272084069e-06, 'samples': 26936640, 'steps': 140294, 'loss/train': 0.8732261061668396} 11/07/2021 17:00:12 - INFO - __main__ - Step 140296: {'lr': 5.286167702393291e-06, 'samples': 26936832, 'steps': 140295, 'loss/train': 1.168378233909607} 11/07/2021 17:00:12 - INFO - __main__ - Step 140297: {'lr': 5.2850822429668375e-06, 'samples': 26937024, 'steps': 140296, 'loss/train': 1.2449692487716675} 11/07/2021 17:00:12 - INFO - __main__ - Step 140298: {'lr': 5.283996893805126e-06, 'samples': 26937216, 'steps': 140297, 'loss/train': 1.3360569477081299} 11/07/2021 17:00:13 - INFO - __main__ - Step 140299: {'lr': 5.282911654908656e-06, 'samples': 26937408, 'steps': 140298, 'loss/train': 1.5443809032440186} 11/07/2021 17:00:13 - INFO - __main__ - Step 140300: {'lr': 5.2818265262779275e-06, 'samples': 26937600, 'steps': 140299, 'loss/train': 1.1159074306488037} 11/07/2021 17:00:14 - INFO - __main__ - Step 140301: {'lr': 5.2807415079134946e-06, 'samples': 26937792, 'steps': 140300, 'loss/train': 1.9223805665969849} 11/07/2021 17:00:14 - INFO - __main__ - Step 140302: {'lr': 5.279656599815718e-06, 'samples': 26937984, 'steps': 140301, 'loss/train': 1.9711054563522339} 11/07/2021 17:00:15 - INFO - __main__ - Step 140303: {'lr': 5.278571801985183e-06, 'samples': 26938176, 'steps': 140302, 'loss/train': 1.495884656906128} 11/07/2021 17:00:15 - INFO - __main__ - Step 140304: {'lr': 5.27748711442233e-06, 'samples': 26938368, 'steps': 140303, 'loss/train': 0.7717480659484863} 11/07/2021 17:00:16 - INFO - __main__ - Step 140305: {'lr': 5.276402537127662e-06, 'samples': 26938560, 'steps': 140304, 'loss/train': 1.4721322059631348} 11/07/2021 17:00:17 - INFO - __main__ - Step 140306: {'lr': 5.275318070101676e-06, 'samples': 26938752, 'steps': 140305, 'loss/train': 1.1712822914123535} 11/07/2021 17:00:17 - INFO - __main__ - Step 140307: {'lr': 5.274233713344845e-06, 'samples': 26938944, 'steps': 140306, 'loss/train': 1.0519194602966309} 11/07/2021 17:00:17 - INFO - __main__ - Step 140308: {'lr': 5.273149466857696e-06, 'samples': 26939136, 'steps': 140307, 'loss/train': 1.6620690822601318} 11/07/2021 17:00:18 - INFO - __main__ - Step 140309: {'lr': 5.272065330640674e-06, 'samples': 26939328, 'steps': 140308, 'loss/train': 1.9964361190795898} 11/07/2021 17:00:18 - INFO - __main__ - Step 140310: {'lr': 5.270981304694278e-06, 'samples': 26939520, 'steps': 140309, 'loss/train': 0.6311318278312683} 11/07/2021 17:00:19 - INFO - __main__ - Step 140311: {'lr': 5.269897389019007e-06, 'samples': 26939712, 'steps': 140310, 'loss/train': 1.6708383560180664} 11/07/2021 17:00:20 - INFO - __main__ - Step 140312: {'lr': 5.268813583615334e-06, 'samples': 26939904, 'steps': 140311, 'loss/train': 1.12363600730896} 11/07/2021 17:00:20 - INFO - __main__ - Step 140313: {'lr': 5.267729888483758e-06, 'samples': 26940096, 'steps': 140312, 'loss/train': 1.0712906122207642} 11/07/2021 17:00:20 - INFO - __main__ - Step 140314: {'lr': 5.266646303624778e-06, 'samples': 26940288, 'steps': 140313, 'loss/train': 0.9914337396621704} 11/07/2021 17:00:21 - INFO - __main__ - Step 140315: {'lr': 5.265562829038895e-06, 'samples': 26940480, 'steps': 140314, 'loss/train': 1.2859612703323364} 11/07/2021 17:00:22 - INFO - __main__ - Step 140316: {'lr': 5.264479464726524e-06, 'samples': 26940672, 'steps': 140315, 'loss/train': 0.9756931662559509} 11/07/2021 17:00:22 - INFO - __main__ - Step 140317: {'lr': 5.263396210688248e-06, 'samples': 26940864, 'steps': 140316, 'loss/train': 1.297029733657837} 11/07/2021 17:00:22 - INFO - __main__ - Step 140318: {'lr': 5.262313066924457e-06, 'samples': 26941056, 'steps': 140317, 'loss/train': 1.1633384227752686} 11/07/2021 17:00:23 - INFO - __main__ - Step 140319: {'lr': 5.261230033435732e-06, 'samples': 26941248, 'steps': 140318, 'loss/train': 2.116497755050659} 11/07/2021 17:00:23 - INFO - __main__ - Step 140320: {'lr': 5.260147110222491e-06, 'samples': 26941440, 'steps': 140319, 'loss/train': 1.0919196605682373} 11/07/2021 17:00:24 - INFO - __main__ - Step 140321: {'lr': 5.259064297285287e-06, 'samples': 26941632, 'steps': 140320, 'loss/train': 0.80912846326828} 11/07/2021 17:00:24 - INFO - __main__ - Step 140322: {'lr': 5.257981594624539e-06, 'samples': 26941824, 'steps': 140321, 'loss/train': 1.0858997106552124} 11/07/2021 17:00:25 - INFO - __main__ - Step 140323: {'lr': 5.2568990022407725e-06, 'samples': 26942016, 'steps': 140322, 'loss/train': 1.3393689393997192} 11/07/2021 17:00:25 - INFO - __main__ - Step 140324: {'lr': 5.255816520134488e-06, 'samples': 26942208, 'steps': 140323, 'loss/train': 0.754555881023407} 11/07/2021 17:00:25 - INFO - __main__ - Step 140325: {'lr': 5.254734148306156e-06, 'samples': 26942400, 'steps': 140324, 'loss/train': 1.3852695226669312} 11/07/2021 17:00:26 - INFO - __main__ - Step 140326: {'lr': 5.25365188675625e-06, 'samples': 26942592, 'steps': 140325, 'loss/train': 0.8618209958076477} 11/07/2021 17:00:27 - INFO - __main__ - Step 140327: {'lr': 5.252569735485269e-06, 'samples': 26942784, 'steps': 140326, 'loss/train': 1.0264689922332764} 11/07/2021 17:00:27 - INFO - __main__ - Step 140328: {'lr': 5.25148769449374e-06, 'samples': 26942976, 'steps': 140327, 'loss/train': 1.2820405960083008} 11/07/2021 17:00:28 - INFO - __main__ - Step 140329: {'lr': 5.250405763782079e-06, 'samples': 26943168, 'steps': 140328, 'loss/train': 0.4220086932182312} 11/07/2021 17:00:28 - INFO - __main__ - Step 140330: {'lr': 5.249323943350815e-06, 'samples': 26943360, 'steps': 140329, 'loss/train': 0.8752583265304565} 11/07/2021 17:00:28 - INFO - __main__ - Step 140331: {'lr': 5.2482422332004175e-06, 'samples': 26943552, 'steps': 140330, 'loss/train': 0.9922376275062561} 11/07/2021 17:00:29 - INFO - __main__ - Step 140332: {'lr': 5.247160633331388e-06, 'samples': 26943744, 'steps': 140331, 'loss/train': 1.2592169046401978} 11/07/2021 17:00:30 - INFO - __main__ - Step 140333: {'lr': 5.246079143744226e-06, 'samples': 26943936, 'steps': 140332, 'loss/train': 1.2179667949676514} 11/07/2021 17:00:30 - INFO - __main__ - Step 140334: {'lr': 5.244997764439402e-06, 'samples': 26944128, 'steps': 140333, 'loss/train': 0.905376672744751} 11/07/2021 17:00:30 - INFO - __main__ - Step 140335: {'lr': 5.243916495417389e-06, 'samples': 26944320, 'steps': 140334, 'loss/train': 1.1946709156036377} 11/07/2021 17:00:31 - INFO - __main__ - Step 140336: {'lr': 5.242835336678714e-06, 'samples': 26944512, 'steps': 140335, 'loss/train': 1.1395814418792725} 11/07/2021 17:00:32 - INFO - __main__ - Step 140337: {'lr': 5.241754288223821e-06, 'samples': 26944704, 'steps': 140336, 'loss/train': 1.6657556295394897} 11/07/2021 17:00:32 - INFO - __main__ - Step 140338: {'lr': 5.24067335005321e-06, 'samples': 26944896, 'steps': 140337, 'loss/train': 1.0169469118118286} 11/07/2021 17:00:32 - INFO - __main__ - Step 140339: {'lr': 5.239592522167408e-06, 'samples': 26945088, 'steps': 140338, 'loss/train': 0.9954442381858826} 11/07/2021 17:00:33 - INFO - __main__ - Step 140340: {'lr': 5.238511804566831e-06, 'samples': 26945280, 'steps': 140339, 'loss/train': 1.3389921188354492} 11/07/2021 17:00:33 - INFO - __main__ - Step 140341: {'lr': 5.237431197252063e-06, 'samples': 26945472, 'steps': 140340, 'loss/train': 1.2350155115127563} 11/07/2021 17:00:34 - INFO - __main__ - Step 140342: {'lr': 5.236350700223491e-06, 'samples': 26945664, 'steps': 140341, 'loss/train': 1.2256014347076416} 11/07/2021 17:00:34 - INFO - __main__ - Step 140343: {'lr': 5.235270313481644e-06, 'samples': 26945856, 'steps': 140342, 'loss/train': 0.856502115726471} 11/07/2021 17:00:35 - INFO - __main__ - Step 140344: {'lr': 5.234190037026992e-06, 'samples': 26946048, 'steps': 140343, 'loss/train': 1.0708869695663452} 11/07/2021 17:00:35 - INFO - __main__ - Step 140345: {'lr': 5.233109870860037e-06, 'samples': 26946240, 'steps': 140344, 'loss/train': 1.1893740892410278} 11/07/2021 17:00:36 - INFO - __main__ - Step 140346: {'lr': 5.232029814981276e-06, 'samples': 26946432, 'steps': 140345, 'loss/train': 1.4651786088943481} 11/07/2021 17:00:36 - INFO - __main__ - Step 140347: {'lr': 5.23094986939121e-06, 'samples': 26946624, 'steps': 140346, 'loss/train': 0.6644596457481384} 11/07/2021 17:00:37 - INFO - __main__ - Step 140348: {'lr': 5.2298700340902565e-06, 'samples': 26946816, 'steps': 140347, 'loss/train': 1.3146075010299683} 11/07/2021 17:00:37 - INFO - __main__ - Step 140349: {'lr': 5.228790309078968e-06, 'samples': 26947008, 'steps': 140348, 'loss/train': 1.2406286001205444} 11/07/2021 17:00:38 - INFO - __main__ - Step 140350: {'lr': 5.227710694357818e-06, 'samples': 26947200, 'steps': 140349, 'loss/train': 0.7291215658187866} 11/07/2021 17:00:38 - INFO - __main__ - Step 140351: {'lr': 5.226631189927278e-06, 'samples': 26947392, 'steps': 140350, 'loss/train': 1.2590469121932983} 11/07/2021 17:00:38 - INFO - __main__ - Step 140352: {'lr': 5.2255517957878475e-06, 'samples': 26947584, 'steps': 140351, 'loss/train': 1.16264808177948} 11/07/2021 17:00:41 - INFO - __main__ - Step 140353: {'lr': 5.224472511939999e-06, 'samples': 26947776, 'steps': 140352, 'loss/train': 1.7772371768951416} 11/07/2021 17:00:41 - INFO - __main__ - Step 140354: {'lr': 5.22339333838423e-06, 'samples': 26947968, 'steps': 140353, 'loss/train': 1.300607681274414} 11/07/2021 17:00:41 - INFO - __main__ - Step 140355: {'lr': 5.222314275121043e-06, 'samples': 26948160, 'steps': 140354, 'loss/train': 0.3780193328857422} 11/07/2021 17:00:42 - INFO - __main__ - Step 140356: {'lr': 5.22123532215088e-06, 'samples': 26948352, 'steps': 140355, 'loss/train': 0.43994078040122986} 11/07/2021 17:00:42 - INFO - __main__ - Step 140357: {'lr': 5.220156479474242e-06, 'samples': 26948544, 'steps': 140356, 'loss/train': 0.46744075417518616} 11/07/2021 17:00:42 - INFO - __main__ - Step 140358: {'lr': 5.219077747091627e-06, 'samples': 26948736, 'steps': 140357, 'loss/train': 0.513845682144165} 11/07/2021 17:00:43 - INFO - __main__ - Step 140359: {'lr': 5.217999125003536e-06, 'samples': 26948928, 'steps': 140358, 'loss/train': 1.8808246850967407} 11/07/2021 17:00:44 - INFO - __main__ - Step 140360: {'lr': 5.216920613210413e-06, 'samples': 26949120, 'steps': 140359, 'loss/train': 0.3927803933620453} 11/07/2021 17:00:44 - INFO - __main__ - Step 140361: {'lr': 5.215842211712785e-06, 'samples': 26949312, 'steps': 140360, 'loss/train': 1.1823935508728027} 11/07/2021 17:00:44 - INFO - __main__ - Step 140362: {'lr': 5.214763920511123e-06, 'samples': 26949504, 'steps': 140361, 'loss/train': 0.6512784957885742} 11/07/2021 17:00:45 - INFO - __main__ - Step 140363: {'lr': 5.2136857396059e-06, 'samples': 26949696, 'steps': 140362, 'loss/train': 1.6495956182479858} 11/07/2021 17:00:45 - INFO - __main__ - Step 140364: {'lr': 5.212607668997615e-06, 'samples': 26949888, 'steps': 140363, 'loss/train': 1.0576632022857666} 11/07/2021 17:00:46 - INFO - __main__ - Step 140365: {'lr': 5.211529708686741e-06, 'samples': 26950080, 'steps': 140364, 'loss/train': 1.741041660308838} 11/07/2021 17:00:47 - INFO - __main__ - Step 140366: {'lr': 5.210451858673804e-06, 'samples': 26950272, 'steps': 140365, 'loss/train': 0.09420565515756607} 11/07/2021 17:00:47 - INFO - __main__ - Step 140367: {'lr': 5.209374118959248e-06, 'samples': 26950464, 'steps': 140366, 'loss/train': 1.034958839416504} 11/07/2021 17:00:47 - INFO - __main__ - Step 140368: {'lr': 5.208296489543573e-06, 'samples': 26950656, 'steps': 140367, 'loss/train': 1.2068394422531128} 11/07/2021 17:00:48 - INFO - __main__ - Step 140369: {'lr': 5.207218970427252e-06, 'samples': 26950848, 'steps': 140368, 'loss/train': 1.455683708190918} 11/07/2021 17:00:49 - INFO - __main__ - Step 140370: {'lr': 5.206141561610783e-06, 'samples': 26951040, 'steps': 140369, 'loss/train': 1.2551064491271973} 11/07/2021 17:00:49 - INFO - __main__ - Step 140371: {'lr': 5.205064263094666e-06, 'samples': 26951232, 'steps': 140370, 'loss/train': 1.3644040822982788} 11/07/2021 17:00:50 - INFO - __main__ - Step 140372: {'lr': 5.203987074879346e-06, 'samples': 26951424, 'steps': 140371, 'loss/train': 1.254861831665039} 11/07/2021 17:00:50 - INFO - __main__ - Step 140373: {'lr': 5.202909996965349e-06, 'samples': 26951616, 'steps': 140372, 'loss/train': 1.157201886177063} 11/07/2021 17:00:50 - INFO - __main__ - Step 140374: {'lr': 5.201833029353121e-06, 'samples': 26951808, 'steps': 140373, 'loss/train': 1.0858876705169678} 11/07/2021 17:00:51 - INFO - __main__ - Step 140375: {'lr': 5.200756172043186e-06, 'samples': 26952000, 'steps': 140374, 'loss/train': 1.4381574392318726} 11/07/2021 17:00:52 - INFO - __main__ - Step 140376: {'lr': 5.199679425036019e-06, 'samples': 26952192, 'steps': 140375, 'loss/train': 1.1513426303863525} 11/07/2021 17:00:52 - INFO - __main__ - Step 140377: {'lr': 5.198602788332091e-06, 'samples': 26952384, 'steps': 140376, 'loss/train': 1.5038012266159058} 11/07/2021 17:00:52 - INFO - __main__ - Step 140378: {'lr': 5.197526261931901e-06, 'samples': 26952576, 'steps': 140377, 'loss/train': 1.2605106830596924} 11/07/2021 17:00:53 - INFO - __main__ - Step 140379: {'lr': 5.196449845835921e-06, 'samples': 26952768, 'steps': 140378, 'loss/train': 1.342084527015686} 11/07/2021 17:00:53 - INFO - __main__ - Step 140380: {'lr': 5.195373540044651e-06, 'samples': 26952960, 'steps': 140379, 'loss/train': 1.3615537881851196} 11/07/2021 17:00:54 - INFO - __main__ - Step 140381: {'lr': 5.194297344558535e-06, 'samples': 26953152, 'steps': 140380, 'loss/train': 1.4124820232391357} 11/07/2021 17:00:55 - INFO - __main__ - Step 140382: {'lr': 5.193221259378156e-06, 'samples': 26953344, 'steps': 140381, 'loss/train': 0.936513364315033} 11/07/2021 17:00:55 - INFO - __main__ - Step 140383: {'lr': 5.192145284503902e-06, 'samples': 26953536, 'steps': 140382, 'loss/train': 1.596134901046753} 11/07/2021 17:00:55 - INFO - __main__ - Step 140384: {'lr': 5.191069419936273e-06, 'samples': 26953728, 'steps': 140383, 'loss/train': 1.2300100326538086} 11/07/2021 17:00:56 - INFO - __main__ - Step 140385: {'lr': 5.1899936656757685e-06, 'samples': 26953920, 'steps': 140384, 'loss/train': 1.6027483940124512} 11/07/2021 17:00:57 - INFO - __main__ - Step 140386: {'lr': 5.188918021722888e-06, 'samples': 26954112, 'steps': 140385, 'loss/train': 1.8183033466339111} 11/07/2021 17:00:57 - INFO - __main__ - Step 140387: {'lr': 5.187842488078104e-06, 'samples': 26954304, 'steps': 140386, 'loss/train': 1.2100926637649536} 11/07/2021 17:00:57 - INFO - __main__ - Step 140388: {'lr': 5.186767064741915e-06, 'samples': 26954496, 'steps': 140387, 'loss/train': 1.313698410987854} 11/07/2021 17:00:58 - INFO - __main__ - Step 140389: {'lr': 5.185691751714766e-06, 'samples': 26954688, 'steps': 140388, 'loss/train': 1.4631294012069702} 11/07/2021 17:00:58 - INFO - __main__ - Step 140390: {'lr': 5.1846165489971846e-06, 'samples': 26954880, 'steps': 140389, 'loss/train': 1.334754467010498} 11/07/2021 17:00:59 - INFO - __main__ - Step 140391: {'lr': 5.183541456589613e-06, 'samples': 26955072, 'steps': 140390, 'loss/train': 1.527948021888733} 11/07/2021 17:00:59 - INFO - __main__ - Step 140392: {'lr': 5.182466474492581e-06, 'samples': 26955264, 'steps': 140391, 'loss/train': 1.183270812034607} 11/07/2021 17:01:00 - INFO - __main__ - Step 140393: {'lr': 5.181391602706531e-06, 'samples': 26955456, 'steps': 140392, 'loss/train': 0.5154502391815186} 11/07/2021 17:01:00 - INFO - __main__ - Step 140394: {'lr': 5.180316841231991e-06, 'samples': 26955648, 'steps': 140393, 'loss/train': 1.4441243410110474} 11/07/2021 17:01:00 - INFO - __main__ - Step 140395: {'lr': 5.179242190069433e-06, 'samples': 26955840, 'steps': 140394, 'loss/train': 1.4041794538497925} 11/07/2021 17:01:02 - INFO - __main__ - Step 140396: {'lr': 5.178167649219329e-06, 'samples': 26956032, 'steps': 140395, 'loss/train': 1.3494800329208374} 11/07/2021 17:01:02 - INFO - __main__ - Step 140397: {'lr': 5.177093218682122e-06, 'samples': 26956224, 'steps': 140396, 'loss/train': 0.5485601425170898} 11/07/2021 17:01:02 - INFO - __main__ - Step 140398: {'lr': 5.1760188984583675e-06, 'samples': 26956416, 'steps': 140397, 'loss/train': 1.2639909982681274} 11/07/2021 17:01:03 - INFO - __main__ - Step 140399: {'lr': 5.174944688548538e-06, 'samples': 26956608, 'steps': 140398, 'loss/train': 1.424593448638916} 11/07/2021 17:01:03 - INFO - __main__ - Step 140400: {'lr': 5.173870588953078e-06, 'samples': 26956800, 'steps': 140399, 'loss/train': 0.80418860912323} 11/07/2021 17:01:03 - INFO - __main__ - Step 140401: {'lr': 5.172796599672486e-06, 'samples': 26956992, 'steps': 140400, 'loss/train': 1.1140360832214355} 11/07/2021 17:01:04 - INFO - __main__ - Step 140402: {'lr': 5.171722720707262e-06, 'samples': 26957184, 'steps': 140401, 'loss/train': 1.5044152736663818} 11/07/2021 17:01:05 - INFO - __main__ - Step 140403: {'lr': 5.170648952057877e-06, 'samples': 26957376, 'steps': 140402, 'loss/train': 0.9486138224601746} 11/07/2021 17:01:05 - INFO - __main__ - Step 140404: {'lr': 5.169575293724832e-06, 'samples': 26957568, 'steps': 140403, 'loss/train': 1.2005550861358643} 11/07/2021 17:01:05 - INFO - __main__ - Step 140405: {'lr': 5.168501745708598e-06, 'samples': 26957760, 'steps': 140404, 'loss/train': 0.6828961968421936} 11/07/2021 17:01:06 - INFO - __main__ - Step 140406: {'lr': 5.167428308009647e-06, 'samples': 26957952, 'steps': 140405, 'loss/train': 1.066625714302063} 11/07/2021 17:01:07 - INFO - __main__ - Step 140407: {'lr': 5.166354980628479e-06, 'samples': 26958144, 'steps': 140406, 'loss/train': 1.1675381660461426} 11/07/2021 17:01:07 - INFO - __main__ - Step 140408: {'lr': 5.165281763565594e-06, 'samples': 26958336, 'steps': 140407, 'loss/train': 1.3041263818740845} 11/07/2021 17:01:08 - INFO - __main__ - Step 140409: {'lr': 5.1642086568214615e-06, 'samples': 26958528, 'steps': 140408, 'loss/train': 1.0930802822113037} 11/07/2021 17:01:08 - INFO - __main__ - Step 140410: {'lr': 5.163135660396528e-06, 'samples': 26958720, 'steps': 140409, 'loss/train': 1.3654634952545166} 11/07/2021 17:01:08 - INFO - __main__ - Step 140411: {'lr': 5.162062774291321e-06, 'samples': 26958912, 'steps': 140410, 'loss/train': 1.5594866275787354} 11/07/2021 17:01:09 - INFO - __main__ - Step 140412: {'lr': 5.16098999850631e-06, 'samples': 26959104, 'steps': 140411, 'loss/train': 1.7766233682632446} 11/07/2021 17:01:10 - INFO - __main__ - Step 140413: {'lr': 5.159917333041969e-06, 'samples': 26959296, 'steps': 140412, 'loss/train': 1.1741389036178589} 11/07/2021 17:01:10 - INFO - __main__ - Step 140414: {'lr': 5.1588447778987966e-06, 'samples': 26959488, 'steps': 140413, 'loss/train': 5.643012046813965} 11/07/2021 17:01:10 - INFO - __main__ - Step 140415: {'lr': 5.157772333077265e-06, 'samples': 26959680, 'steps': 140414, 'loss/train': 1.3128982782363892} 11/07/2021 17:01:11 - INFO - __main__ - Step 140416: {'lr': 5.156699998577846e-06, 'samples': 26959872, 'steps': 140415, 'loss/train': 1.4863147735595703} 11/07/2021 17:01:12 - INFO - __main__ - Step 140417: {'lr': 5.155627774401067e-06, 'samples': 26960064, 'steps': 140416, 'loss/train': 1.395634412765503} 11/07/2021 17:01:12 - INFO - __main__ - Step 140418: {'lr': 5.1545556605473724e-06, 'samples': 26960256, 'steps': 140417, 'loss/train': 1.5469095706939697} 11/07/2021 17:01:12 - INFO - __main__ - Step 140419: {'lr': 5.15348365701726e-06, 'samples': 26960448, 'steps': 140418, 'loss/train': 1.1609631776809692} 11/07/2021 17:01:13 - INFO - __main__ - Step 140420: {'lr': 5.152411763811232e-06, 'samples': 26960640, 'steps': 140419, 'loss/train': 1.1210713386535645} 11/07/2021 17:01:13 - INFO - __main__ - Step 140421: {'lr': 5.151339980929731e-06, 'samples': 26960832, 'steps': 140420, 'loss/train': 1.3285150527954102} 11/07/2021 17:01:14 - INFO - __main__ - Step 140422: {'lr': 5.150268308373257e-06, 'samples': 26961024, 'steps': 140421, 'loss/train': 1.382049322128296} 11/07/2021 17:01:15 - INFO - __main__ - Step 140423: {'lr': 5.14919674614231e-06, 'samples': 26961216, 'steps': 140422, 'loss/train': 1.6031166315078735} 11/07/2021 17:01:15 - INFO - __main__ - Step 140424: {'lr': 5.148125294237332e-06, 'samples': 26961408, 'steps': 140423, 'loss/train': 1.2717825174331665} 11/07/2021 17:01:15 - INFO - __main__ - Step 140425: {'lr': 5.147053952658826e-06, 'samples': 26961600, 'steps': 140424, 'loss/train': 1.2323062419891357} 11/07/2021 17:01:16 - INFO - __main__ - Step 140426: {'lr': 5.145982721407316e-06, 'samples': 26961792, 'steps': 140425, 'loss/train': 1.100763201713562} 11/07/2021 17:01:16 - INFO - __main__ - Step 140427: {'lr': 5.144911600483221e-06, 'samples': 26961984, 'steps': 140426, 'loss/train': 1.1052782535552979} 11/07/2021 17:01:17 - INFO - __main__ - Step 140428: {'lr': 5.143840589887039e-06, 'samples': 26962176, 'steps': 140427, 'loss/train': 1.365991234779358} 11/07/2021 17:01:18 - INFO - __main__ - Step 140429: {'lr': 5.14276968961927e-06, 'samples': 26962368, 'steps': 140428, 'loss/train': 1.8312896490097046} 11/07/2021 17:01:18 - INFO - __main__ - Step 140430: {'lr': 5.141698899680414e-06, 'samples': 26962560, 'steps': 140429, 'loss/train': 1.6368638277053833} 11/07/2021 17:01:18 - INFO - __main__ - Step 140431: {'lr': 5.140628220070914e-06, 'samples': 26962752, 'steps': 140430, 'loss/train': 1.618958830833435} 11/07/2021 17:01:19 - INFO - __main__ - Step 140432: {'lr': 5.139557650791271e-06, 'samples': 26962944, 'steps': 140431, 'loss/train': 0.8824868202209473} 11/07/2021 17:01:20 - INFO - __main__ - Step 140433: {'lr': 5.1384871918419565e-06, 'samples': 26963136, 'steps': 140432, 'loss/train': 1.1745126247406006} 11/07/2021 17:01:20 - INFO - __main__ - Step 140434: {'lr': 5.137416843223469e-06, 'samples': 26963328, 'steps': 140433, 'loss/train': 1.2830402851104736} 11/07/2021 17:01:20 - INFO - __main__ - Step 140435: {'lr': 5.136346604936282e-06, 'samples': 26963520, 'steps': 140434, 'loss/train': 1.0361871719360352} 11/07/2021 17:01:21 - INFO - __main__ - Step 140436: {'lr': 5.135276476980893e-06, 'samples': 26963712, 'steps': 140435, 'loss/train': 1.6123499870300293} 11/07/2021 17:01:21 - INFO - __main__ - Step 140437: {'lr': 5.134206459357748e-06, 'samples': 26963904, 'steps': 140436, 'loss/train': 1.1015071868896484} 11/07/2021 17:01:22 - INFO - __main__ - Step 140438: {'lr': 5.133136552067374e-06, 'samples': 26964096, 'steps': 140437, 'loss/train': 1.261631965637207} 11/07/2021 17:01:23 - INFO - __main__ - Step 140439: {'lr': 5.132066755110215e-06, 'samples': 26964288, 'steps': 140438, 'loss/train': 1.2864205837249756} 11/07/2021 17:01:23 - INFO - __main__ - Step 140440: {'lr': 5.130997068486742e-06, 'samples': 26964480, 'steps': 140439, 'loss/train': 1.3259090185165405} 11/07/2021 17:01:23 - INFO - __main__ - Step 140441: {'lr': 5.129927492197511e-06, 'samples': 26964672, 'steps': 140440, 'loss/train': 1.4268218278884888} 11/07/2021 17:01:24 - INFO - __main__ - Step 140442: {'lr': 5.1288580262429105e-06, 'samples': 26964864, 'steps': 140441, 'loss/train': 1.2557027339935303} 11/07/2021 17:01:25 - INFO - __main__ - Step 140443: {'lr': 5.127788670623496e-06, 'samples': 26965056, 'steps': 140442, 'loss/train': 1.3600249290466309} 11/07/2021 17:01:25 - INFO - __main__ - Step 140444: {'lr': 5.12671942533971e-06, 'samples': 26965248, 'steps': 140443, 'loss/train': 1.6031770706176758} 11/07/2021 17:01:25 - INFO - __main__ - Step 140445: {'lr': 5.125650290392053e-06, 'samples': 26965440, 'steps': 140444, 'loss/train': 1.2660548686981201} 11/07/2021 17:01:26 - INFO - __main__ - Step 140446: {'lr': 5.1245812657809976e-06, 'samples': 26965632, 'steps': 140445, 'loss/train': 2.0513205528259277} 11/07/2021 17:01:26 - INFO - __main__ - Step 140447: {'lr': 5.123512351507042e-06, 'samples': 26965824, 'steps': 140446, 'loss/train': 1.5643372535705566} 11/07/2021 17:01:27 - INFO - __main__ - Step 140448: {'lr': 5.1224435475706325e-06, 'samples': 26966016, 'steps': 140447, 'loss/train': 4.981987953186035} 11/07/2021 17:01:28 - INFO - __main__ - Step 140449: {'lr': 5.121374853972294e-06, 'samples': 26966208, 'steps': 140448, 'loss/train': 1.162127137184143} 11/07/2021 17:01:28 - INFO - __main__ - Step 140450: {'lr': 5.120306270712472e-06, 'samples': 26966400, 'steps': 140449, 'loss/train': 4.362430095672607} 11/07/2021 17:01:28 - INFO - __main__ - Step 140451: {'lr': 5.119237797791665e-06, 'samples': 26966592, 'steps': 140450, 'loss/train': 1.1445413827896118} 11/07/2021 17:01:29 - INFO - __main__ - Step 140452: {'lr': 5.118169435210346e-06, 'samples': 26966784, 'steps': 140451, 'loss/train': 0.8982679843902588} 11/07/2021 17:01:29 - INFO - __main__ - Step 140453: {'lr': 5.117101182968986e-06, 'samples': 26966976, 'steps': 140452, 'loss/train': 0.04585092142224312} 11/07/2021 17:01:30 - INFO - __main__ - Step 140454: {'lr': 5.116033041068113e-06, 'samples': 26967168, 'steps': 140453, 'loss/train': 1.393532395362854} 11/07/2021 17:01:31 - INFO - __main__ - Step 140455: {'lr': 5.114965009508143e-06, 'samples': 26967360, 'steps': 140454, 'loss/train': 1.244634747505188} 11/07/2021 17:01:31 - INFO - __main__ - Step 140456: {'lr': 5.113897088289604e-06, 'samples': 26967552, 'steps': 140455, 'loss/train': 1.3458667993545532} 11/07/2021 17:01:31 - INFO - __main__ - Step 140457: {'lr': 5.112829277412967e-06, 'samples': 26967744, 'steps': 140456, 'loss/train': 1.3554588556289673} 11/07/2021 17:01:32 - INFO - __main__ - Step 140458: {'lr': 5.111761576878704e-06, 'samples': 26967936, 'steps': 140457, 'loss/train': 1.1641833782196045} 11/07/2021 17:01:32 - INFO - __main__ - Step 140459: {'lr': 5.110693986687315e-06, 'samples': 26968128, 'steps': 140458, 'loss/train': 1.120538592338562} 11/07/2021 17:01:33 - INFO - __main__ - Step 140460: {'lr': 5.109626506839271e-06, 'samples': 26968320, 'steps': 140459, 'loss/train': 1.2750842571258545} 11/07/2021 17:01:33 - INFO - __main__ - Step 140461: {'lr': 5.108559137335045e-06, 'samples': 26968512, 'steps': 140460, 'loss/train': 1.3161382675170898} 11/07/2021 17:01:34 - INFO - __main__ - Step 140462: {'lr': 5.107491878175135e-06, 'samples': 26968704, 'steps': 140461, 'loss/train': 1.496795892715454} 11/07/2021 17:01:34 - INFO - __main__ - Step 140463: {'lr': 5.1064247293599875e-06, 'samples': 26968896, 'steps': 140462, 'loss/train': 1.0566322803497314} 11/07/2021 17:01:34 - INFO - __main__ - Step 140464: {'lr': 5.105357690890128e-06, 'samples': 26969088, 'steps': 140463, 'loss/train': 1.1155730485916138} 11/07/2021 17:01:35 - INFO - __main__ - Step 140465: {'lr': 5.104290762766e-06, 'samples': 26969280, 'steps': 140464, 'loss/train': 0.4838675558567047} 11/07/2021 17:01:36 - INFO - __main__ - Step 140466: {'lr': 5.103223944988078e-06, 'samples': 26969472, 'steps': 140465, 'loss/train': 0.5875265002250671} 11/07/2021 17:01:36 - INFO - __main__ - Step 140467: {'lr': 5.102157237556887e-06, 'samples': 26969664, 'steps': 140466, 'loss/train': 0.6790622472763062} 11/07/2021 17:01:37 - INFO - __main__ - Step 140468: {'lr': 5.1010906404728994e-06, 'samples': 26969856, 'steps': 140467, 'loss/train': 1.1682419776916504} 11/07/2021 17:01:37 - INFO - __main__ - Step 140469: {'lr': 5.10002415373656e-06, 'samples': 26970048, 'steps': 140468, 'loss/train': 1.6017537117004395} 11/07/2021 17:01:38 - INFO - __main__ - Step 140470: {'lr': 5.098957777348367e-06, 'samples': 26970240, 'steps': 140469, 'loss/train': 1.305539846420288} 11/07/2021 17:01:38 - INFO - __main__ - Step 140471: {'lr': 5.097891511308822e-06, 'samples': 26970432, 'steps': 140470, 'loss/train': 1.3564233779907227} 11/07/2021 17:01:39 - INFO - __main__ - Step 140472: {'lr': 5.096825355618395e-06, 'samples': 26970624, 'steps': 140471, 'loss/train': 1.114585518836975} 11/07/2021 17:01:39 - INFO - __main__ - Step 140473: {'lr': 5.095759310277559e-06, 'samples': 26970816, 'steps': 140472, 'loss/train': 0.95037841796875} 11/07/2021 17:01:39 - INFO - __main__ - Step 140474: {'lr': 5.094693375286785e-06, 'samples': 26971008, 'steps': 140473, 'loss/train': 1.3717347383499146} 11/07/2021 17:01:40 - INFO - __main__ - Step 140475: {'lr': 5.093627550646545e-06, 'samples': 26971200, 'steps': 140474, 'loss/train': 1.151566743850708} 11/07/2021 17:01:41 - INFO - __main__ - Step 140476: {'lr': 5.09256183635734e-06, 'samples': 26971392, 'steps': 140475, 'loss/train': 1.210998296737671} 11/07/2021 17:01:41 - INFO - __main__ - Step 140477: {'lr': 5.091496232419668e-06, 'samples': 26971584, 'steps': 140476, 'loss/train': 1.333581805229187} 11/07/2021 17:01:41 - INFO - __main__ - Step 140478: {'lr': 5.090430738833973e-06, 'samples': 26971776, 'steps': 140477, 'loss/train': 1.3547375202178955} 11/07/2021 17:01:42 - INFO - __main__ - Step 140479: {'lr': 5.089365355600756e-06, 'samples': 26971968, 'steps': 140478, 'loss/train': 1.1126734018325806} 11/07/2021 17:01:43 - INFO - __main__ - Step 140480: {'lr': 5.088300082720487e-06, 'samples': 26972160, 'steps': 140479, 'loss/train': 1.079721450805664} 11/07/2021 17:01:43 - INFO - __main__ - Step 140481: {'lr': 5.087234920193667e-06, 'samples': 26972352, 'steps': 140480, 'loss/train': 1.1117440462112427} 11/07/2021 17:01:43 - INFO - __main__ - Step 140482: {'lr': 5.0861698680207405e-06, 'samples': 26972544, 'steps': 140481, 'loss/train': 1.1525920629501343} 11/07/2021 17:01:44 - INFO - __main__ - Step 140483: {'lr': 5.0851049262022606e-06, 'samples': 26972736, 'steps': 140482, 'loss/train': 1.0553964376449585} 11/07/2021 17:01:44 - INFO - __main__ - Step 140484: {'lr': 5.08404009473859e-06, 'samples': 26972928, 'steps': 140483, 'loss/train': 0.7396302819252014} 11/07/2021 17:01:45 - INFO - __main__ - Step 140485: {'lr': 5.082975373630283e-06, 'samples': 26973120, 'steps': 140484, 'loss/train': 1.419571876525879} 11/07/2021 17:01:46 - INFO - __main__ - Step 140486: {'lr': 5.081910762877812e-06, 'samples': 26973312, 'steps': 140485, 'loss/train': 1.0860260725021362} 11/07/2021 17:01:46 - INFO - __main__ - Step 140487: {'lr': 5.0808462624816755e-06, 'samples': 26973504, 'steps': 140486, 'loss/train': 1.1883416175842285} 11/07/2021 17:01:46 - INFO - __main__ - Step 140488: {'lr': 5.0797818724422905e-06, 'samples': 26973696, 'steps': 140487, 'loss/train': 1.8032175302505493} 11/07/2021 17:01:47 - INFO - __main__ - Step 140489: {'lr': 5.0787175927602125e-06, 'samples': 26973888, 'steps': 140488, 'loss/train': 1.4825806617736816} 11/07/2021 17:01:48 - INFO - __main__ - Step 140490: {'lr': 5.0776534234358576e-06, 'samples': 26974080, 'steps': 140489, 'loss/train': 1.48613703250885} 11/07/2021 17:01:48 - INFO - __main__ - Step 140491: {'lr': 5.076589364469752e-06, 'samples': 26974272, 'steps': 140490, 'loss/train': 1.330350399017334} 11/07/2021 17:01:48 - INFO - __main__ - Step 140492: {'lr': 5.075525415862342e-06, 'samples': 26974464, 'steps': 140491, 'loss/train': 1.0654059648513794} 11/07/2021 17:01:49 - INFO - __main__ - Step 140493: {'lr': 5.074461577614125e-06, 'samples': 26974656, 'steps': 140492, 'loss/train': 1.4346619844436646} 11/07/2021 17:01:49 - INFO - __main__ - Step 140494: {'lr': 5.073397849725603e-06, 'samples': 26974848, 'steps': 140493, 'loss/train': 1.7640128135681152} 11/07/2021 17:01:49 - INFO - __main__ - Step 140495: {'lr': 5.07233423219719e-06, 'samples': 26975040, 'steps': 140494, 'loss/train': 1.4912890195846558} 11/07/2021 17:01:50 - INFO - __main__ - Step 140496: {'lr': 5.0712707250294145e-06, 'samples': 26975232, 'steps': 140495, 'loss/train': 1.458267331123352} 11/07/2021 17:01:51 - INFO - __main__ - Step 140497: {'lr': 5.070207328222748e-06, 'samples': 26975424, 'steps': 140496, 'loss/train': 1.2559436559677124} 11/07/2021 17:01:51 - INFO - __main__ - Step 140498: {'lr': 5.069144041777662e-06, 'samples': 26975616, 'steps': 140497, 'loss/train': 1.3082917928695679} 11/07/2021 17:01:51 - INFO - __main__ - Step 140499: {'lr': 5.068080865694658e-06, 'samples': 26975808, 'steps': 140498, 'loss/train': 1.360979676246643} 11/07/2021 17:01:52 - INFO - __main__ - Step 140500: {'lr': 5.0670177999741775e-06, 'samples': 26976000, 'steps': 140499, 'loss/train': 1.5501375198364258} 11/07/2021 17:01:53 - INFO - __main__ - Step 140501: {'lr': 5.065954844616722e-06, 'samples': 26976192, 'steps': 140500, 'loss/train': 0.933594286441803} 11/07/2021 17:01:53 - INFO - __main__ - Step 140502: {'lr': 5.064891999622761e-06, 'samples': 26976384, 'steps': 140501, 'loss/train': 1.5407987833023071} 11/07/2021 17:01:54 - INFO - __main__ - Step 140503: {'lr': 5.063829264992797e-06, 'samples': 26976576, 'steps': 140502, 'loss/train': 1.4683129787445068} 11/07/2021 17:01:54 - INFO - __main__ - Step 140504: {'lr': 5.062766640727301e-06, 'samples': 26976768, 'steps': 140503, 'loss/train': 1.3593759536743164} 11/07/2021 17:01:54 - INFO - __main__ - Step 140505: {'lr': 5.061704126826744e-06, 'samples': 26976960, 'steps': 140504, 'loss/train': 1.3695001602172852} 11/07/2021 17:01:55 - INFO - __main__ - Step 140506: {'lr': 5.06064172329157e-06, 'samples': 26977152, 'steps': 140505, 'loss/train': 1.688956618309021} 11/07/2021 17:01:56 - INFO - __main__ - Step 140507: {'lr': 5.059579430122307e-06, 'samples': 26977344, 'steps': 140506, 'loss/train': 1.5112547874450684} 11/07/2021 17:01:56 - INFO - __main__ - Step 140508: {'lr': 5.0585172473194275e-06, 'samples': 26977536, 'steps': 140507, 'loss/train': 1.4936903715133667} 11/07/2021 17:01:56 - INFO - __main__ - Step 140509: {'lr': 5.0574551748833746e-06, 'samples': 26977728, 'steps': 140508, 'loss/train': 1.3501077890396118} 11/07/2021 17:01:57 - INFO - __main__ - Step 140510: {'lr': 5.056393212814675e-06, 'samples': 26977920, 'steps': 140509, 'loss/train': 1.4623043537139893} 11/07/2021 17:01:58 - INFO - __main__ - Step 140511: {'lr': 5.055331361113774e-06, 'samples': 26978112, 'steps': 140510, 'loss/train': 0.7674014568328857} 11/07/2021 17:01:58 - INFO - __main__ - Step 140512: {'lr': 5.054269619781171e-06, 'samples': 26978304, 'steps': 140511, 'loss/train': 1.4197815656661987} 11/07/2021 17:01:58 - INFO - __main__ - Step 140513: {'lr': 5.053207988817338e-06, 'samples': 26978496, 'steps': 140512, 'loss/train': 1.2924551963806152} 11/07/2021 17:01:59 - INFO - __main__ - Step 140514: {'lr': 5.052146468222746e-06, 'samples': 26978688, 'steps': 140513, 'loss/train': 1.1783499717712402} 11/07/2021 17:01:59 - INFO - __main__ - Step 140515: {'lr': 5.051085057997868e-06, 'samples': 26978880, 'steps': 140514, 'loss/train': 1.4419819116592407} 11/07/2021 17:02:00 - INFO - __main__ - Step 140516: {'lr': 5.050023758143202e-06, 'samples': 26979072, 'steps': 140515, 'loss/train': 1.623806118965149} 11/07/2021 17:02:01 - INFO - __main__ - Step 140517: {'lr': 5.048962568659221e-06, 'samples': 26979264, 'steps': 140516, 'loss/train': 1.2071666717529297} 11/07/2021 17:02:01 - INFO - __main__ - Step 140518: {'lr': 5.0479014895463695e-06, 'samples': 26979456, 'steps': 140517, 'loss/train': 1.3579448461532593} 11/07/2021 17:02:01 - INFO - __main__ - Step 140519: {'lr': 5.0468405208051736e-06, 'samples': 26979648, 'steps': 140518, 'loss/train': 1.2520036697387695} 11/07/2021 17:02:02 - INFO - __main__ - Step 140520: {'lr': 5.045779662436078e-06, 'samples': 26979840, 'steps': 140519, 'loss/train': 1.501245141029358} 11/07/2021 17:02:03 - INFO - __main__ - Step 140521: {'lr': 5.044718914439583e-06, 'samples': 26980032, 'steps': 140520, 'loss/train': 1.1116660833358765} 11/07/2021 17:02:03 - INFO - __main__ - Step 140522: {'lr': 5.04365827681616e-06, 'samples': 26980224, 'steps': 140521, 'loss/train': 1.6057137250900269} 11/07/2021 17:02:03 - INFO - __main__ - Step 140523: {'lr': 5.04259774956628e-06, 'samples': 26980416, 'steps': 140522, 'loss/train': 1.2632040977478027} 11/07/2021 17:02:04 - INFO - __main__ - Step 140524: {'lr': 5.041537332690443e-06, 'samples': 26980608, 'steps': 140523, 'loss/train': 1.2013434171676636} 11/07/2021 17:02:04 - INFO - __main__ - Step 140525: {'lr': 5.040477026189094e-06, 'samples': 26980800, 'steps': 140524, 'loss/train': 1.0752302408218384} 11/07/2021 17:02:04 - INFO - __main__ - Step 140526: {'lr': 5.039416830062732e-06, 'samples': 26980992, 'steps': 140525, 'loss/train': 0.818056583404541} 11/07/2021 17:02:05 - INFO - __main__ - Step 140527: {'lr': 5.038356744311828e-06, 'samples': 26981184, 'steps': 140526, 'loss/train': 1.4687663316726685} 11/07/2021 17:02:06 - INFO - __main__ - Step 140528: {'lr': 5.037296768936855e-06, 'samples': 26981376, 'steps': 140527, 'loss/train': 1.5782315731048584} 11/07/2021 17:02:06 - INFO - __main__ - Step 140529: {'lr': 5.0362369039382845e-06, 'samples': 26981568, 'steps': 140528, 'loss/train': 1.3884830474853516} 11/07/2021 17:02:06 - INFO - __main__ - Step 140530: {'lr': 5.035177149316644e-06, 'samples': 26981760, 'steps': 140529, 'loss/train': 1.3251954317092896} 11/07/2021 17:02:07 - INFO - __main__ - Step 140531: {'lr': 5.034117505072349e-06, 'samples': 26981952, 'steps': 140530, 'loss/train': 1.1095365285873413} 11/07/2021 17:02:08 - INFO - __main__ - Step 140532: {'lr': 5.033057971205901e-06, 'samples': 26982144, 'steps': 140531, 'loss/train': 1.160041093826294} 11/07/2021 17:02:08 - INFO - __main__ - Step 140533: {'lr': 5.03199854771777e-06, 'samples': 26982336, 'steps': 140532, 'loss/train': 1.5141401290893555} 11/07/2021 17:02:09 - INFO - __main__ - Step 140534: {'lr': 5.030939234608428e-06, 'samples': 26982528, 'steps': 140533, 'loss/train': 1.2062429189682007} 11/07/2021 17:02:09 - INFO - __main__ - Step 140535: {'lr': 5.029880031878404e-06, 'samples': 26982720, 'steps': 140534, 'loss/train': 1.3282461166381836} 11/07/2021 17:02:09 - INFO - __main__ - Step 140536: {'lr': 5.028820939528111e-06, 'samples': 26982912, 'steps': 140535, 'loss/train': 0.7246084213256836} 11/07/2021 17:02:10 - INFO - __main__ - Step 140537: {'lr': 5.027761957558053e-06, 'samples': 26983104, 'steps': 140536, 'loss/train': 1.345505952835083} 11/07/2021 17:02:11 - INFO - __main__ - Step 140538: {'lr': 5.026703085968698e-06, 'samples': 26983296, 'steps': 140537, 'loss/train': 1.3217800855636597} 11/07/2021 17:02:11 - INFO - __main__ - Step 140539: {'lr': 5.0256443247605476e-06, 'samples': 26983488, 'steps': 140538, 'loss/train': 1.5785009860992432} 11/07/2021 17:02:11 - INFO - __main__ - Step 140540: {'lr': 5.024585673934045e-06, 'samples': 26983680, 'steps': 140539, 'loss/train': 1.3501737117767334} 11/07/2021 17:02:12 - INFO - __main__ - Step 140541: {'lr': 5.023527133489691e-06, 'samples': 26983872, 'steps': 140540, 'loss/train': 1.0744718313217163} 11/07/2021 17:02:13 - INFO - __main__ - Step 140542: {'lr': 5.022468703427957e-06, 'samples': 26984064, 'steps': 140541, 'loss/train': 1.5225698947906494} 11/07/2021 17:02:13 - INFO - __main__ - Step 140543: {'lr': 5.021410383749342e-06, 'samples': 26984256, 'steps': 140542, 'loss/train': 1.057174801826477} 11/07/2021 17:02:13 - INFO - __main__ - Step 140544: {'lr': 5.020352174454263e-06, 'samples': 26984448, 'steps': 140543, 'loss/train': 0.7279711961746216} 11/07/2021 17:02:14 - INFO - __main__ - Step 140545: {'lr': 5.019294075543246e-06, 'samples': 26984640, 'steps': 140544, 'loss/train': 1.3329081535339355} 11/07/2021 17:02:14 - INFO - __main__ - Step 140546: {'lr': 5.018236087016764e-06, 'samples': 26984832, 'steps': 140545, 'loss/train': 1.1300032138824463} 11/07/2021 17:02:15 - INFO - __main__ - Step 140547: {'lr': 5.017178208875262e-06, 'samples': 26985024, 'steps': 140546, 'loss/train': 1.5279561281204224} 11/07/2021 17:02:16 - INFO - __main__ - Step 140548: {'lr': 5.016120441119265e-06, 'samples': 26985216, 'steps': 140547, 'loss/train': 0.957642674446106} 11/07/2021 17:02:16 - INFO - __main__ - Step 140549: {'lr': 5.015062783749191e-06, 'samples': 26985408, 'steps': 140548, 'loss/train': 1.2843992710113525} 11/07/2021 17:02:16 - INFO - __main__ - Step 140550: {'lr': 5.0140052367655676e-06, 'samples': 26985600, 'steps': 140549, 'loss/train': 1.6064437627792358} 11/07/2021 17:02:17 - INFO - __main__ - Step 140551: {'lr': 5.012947800168866e-06, 'samples': 26985792, 'steps': 140550, 'loss/train': 0.7599531412124634} 11/07/2021 17:02:18 - INFO - __main__ - Step 140552: {'lr': 5.01189047395953e-06, 'samples': 26985984, 'steps': 140551, 'loss/train': 1.2681515216827393} 11/07/2021 17:02:18 - INFO - __main__ - Step 140553: {'lr': 5.010833258138059e-06, 'samples': 26986176, 'steps': 140552, 'loss/train': 1.1982364654541016} 11/07/2021 17:02:18 - INFO - __main__ - Step 140554: {'lr': 5.009776152704926e-06, 'samples': 26986368, 'steps': 140553, 'loss/train': 1.5732465982437134} 11/07/2021 17:02:19 - INFO - __main__ - Step 140555: {'lr': 5.00871915766063e-06, 'samples': 26986560, 'steps': 140554, 'loss/train': 1.3606996536254883} 11/07/2021 17:02:19 - INFO - __main__ - Step 140556: {'lr': 5.007662273005586e-06, 'samples': 26986752, 'steps': 140555, 'loss/train': 0.7811370491981506} 11/07/2021 17:02:20 - INFO - __main__ - Step 140557: {'lr': 5.0066054987403516e-06, 'samples': 26986944, 'steps': 140556, 'loss/train': 0.6932734847068787} 11/07/2021 17:02:21 - INFO - __main__ - Step 140558: {'lr': 5.005548834865342e-06, 'samples': 26987136, 'steps': 140557, 'loss/train': 1.2230826616287231} 11/07/2021 17:02:21 - INFO - __main__ - Step 140559: {'lr': 5.004492281381057e-06, 'samples': 26987328, 'steps': 140558, 'loss/train': 1.4595519304275513} 11/07/2021 17:02:21 - INFO - __main__ - Step 140560: {'lr': 5.0034358382879395e-06, 'samples': 26987520, 'steps': 140559, 'loss/train': 1.0809177160263062} 11/07/2021 17:02:22 - INFO - __main__ - Step 140561: {'lr': 5.002379505586518e-06, 'samples': 26987712, 'steps': 140560, 'loss/train': 0.9774028658866882} 11/07/2021 17:02:22 - INFO - __main__ - Step 140562: {'lr': 5.001323283277237e-06, 'samples': 26987904, 'steps': 140561, 'loss/train': 1.3987733125686646} 11/07/2021 17:02:23 - INFO - __main__ - Step 140563: {'lr': 5.000267171360595e-06, 'samples': 26988096, 'steps': 140562, 'loss/train': 0.9620789885520935} 11/07/2021 17:02:23 - INFO - __main__ - Step 140564: {'lr': 4.999211169837037e-06, 'samples': 26988288, 'steps': 140563, 'loss/train': 1.0255731344223022} 11/07/2021 17:02:24 - INFO - __main__ - Step 140565: {'lr': 4.998155278707034e-06, 'samples': 26988480, 'steps': 140564, 'loss/train': 1.1940579414367676} 11/07/2021 17:02:24 - INFO - __main__ - Step 140566: {'lr': 4.997099497971114e-06, 'samples': 26988672, 'steps': 140565, 'loss/train': 1.1238688230514526} 11/07/2021 17:02:24 - INFO - __main__ - Step 140567: {'lr': 4.996043827629693e-06, 'samples': 26988864, 'steps': 140566, 'loss/train': 1.1439170837402344} 11/07/2021 17:02:25 - INFO - __main__ - Step 140568: {'lr': 4.9949882676832984e-06, 'samples': 26989056, 'steps': 140567, 'loss/train': 1.1303683519363403} 11/07/2021 17:02:26 - INFO - __main__ - Step 140569: {'lr': 4.993932818132374e-06, 'samples': 26989248, 'steps': 140568, 'loss/train': 0.5385122895240784} 11/07/2021 17:02:26 - INFO - __main__ - Step 140570: {'lr': 4.99287747897742e-06, 'samples': 26989440, 'steps': 140569, 'loss/train': 1.5542834997177124} 11/07/2021 17:02:26 - INFO - __main__ - Step 140571: {'lr': 4.99182225021888e-06, 'samples': 26989632, 'steps': 140570, 'loss/train': 1.225793480873108} 11/07/2021 17:02:27 - INFO - __main__ - Step 140572: {'lr': 4.990767131857227e-06, 'samples': 26989824, 'steps': 140571, 'loss/train': 1.1050729751586914} 11/07/2021 17:02:28 - INFO - __main__ - Step 140573: {'lr': 4.989712123892959e-06, 'samples': 26990016, 'steps': 140572, 'loss/train': 1.5927776098251343} 11/07/2021 17:02:28 - INFO - __main__ - Step 140574: {'lr': 4.988657226326576e-06, 'samples': 26990208, 'steps': 140573, 'loss/train': 1.7405768632888794} 11/07/2021 17:02:29 - INFO - __main__ - Step 140575: {'lr': 4.9876024391584955e-06, 'samples': 26990400, 'steps': 140574, 'loss/train': 1.1781151294708252} 11/07/2021 17:02:29 - INFO - __main__ - Step 140576: {'lr': 4.986547762389215e-06, 'samples': 26990592, 'steps': 140575, 'loss/train': 1.2995396852493286} 11/07/2021 17:02:29 - INFO - __main__ - Step 140577: {'lr': 4.985493196019236e-06, 'samples': 26990784, 'steps': 140576, 'loss/train': 1.4268144369125366} 11/07/2021 17:02:30 - INFO - __main__ - Step 140578: {'lr': 4.984438740049002e-06, 'samples': 26990976, 'steps': 140577, 'loss/train': 0.5897637605667114} 11/07/2021 17:02:31 - INFO - __main__ - Step 140579: {'lr': 4.983384394478985e-06, 'samples': 26991168, 'steps': 140578, 'loss/train': 1.434222936630249} 11/07/2021 17:02:31 - INFO - __main__ - Step 140580: {'lr': 4.982330159309684e-06, 'samples': 26991360, 'steps': 140579, 'loss/train': 1.420149564743042} 11/07/2021 17:02:31 - INFO - __main__ - Step 140581: {'lr': 4.981276034541571e-06, 'samples': 26991552, 'steps': 140580, 'loss/train': 1.3643031120300293} 11/07/2021 17:02:32 - INFO - __main__ - Step 140582: {'lr': 4.980222020175118e-06, 'samples': 26991744, 'steps': 140581, 'loss/train': 0.9779305458068848} 11/07/2021 17:02:33 - INFO - __main__ - Step 140583: {'lr': 4.9791681162108245e-06, 'samples': 26991936, 'steps': 140582, 'loss/train': 1.2157924175262451} 11/07/2021 17:02:33 - INFO - __main__ - Step 140584: {'lr': 4.9781143226490795e-06, 'samples': 26992128, 'steps': 140583, 'loss/train': 1.4471418857574463} 11/07/2021 17:02:33 - INFO - __main__ - Step 140585: {'lr': 4.977060639490438e-06, 'samples': 26992320, 'steps': 140584, 'loss/train': 1.151460886001587} 11/07/2021 17:02:34 - INFO - __main__ - Step 140586: {'lr': 4.9760070667353714e-06, 'samples': 26992512, 'steps': 140585, 'loss/train': 1.4044536352157593} 11/07/2021 17:02:34 - INFO - __main__ - Step 140587: {'lr': 4.974953604384297e-06, 'samples': 26992704, 'steps': 140586, 'loss/train': 0.9551594853401184} 11/07/2021 17:02:35 - INFO - __main__ - Step 140588: {'lr': 4.973900252437768e-06, 'samples': 26992896, 'steps': 140587, 'loss/train': 1.1666375398635864} 11/07/2021 17:02:36 - INFO - __main__ - Step 140589: {'lr': 4.972847010896175e-06, 'samples': 26993088, 'steps': 140588, 'loss/train': 1.308172583580017} 11/07/2021 17:02:36 - INFO - __main__ - Step 140590: {'lr': 4.971793879760072e-06, 'samples': 26993280, 'steps': 140589, 'loss/train': 1.1061623096466064} 11/07/2021 17:02:36 - INFO - __main__ - Step 140591: {'lr': 4.970740859029876e-06, 'samples': 26993472, 'steps': 140590, 'loss/train': 1.3046998977661133} 11/07/2021 17:02:37 - INFO - __main__ - Step 140592: {'lr': 4.969687948706087e-06, 'samples': 26993664, 'steps': 140591, 'loss/train': 1.1278413534164429} 11/07/2021 17:02:37 - INFO - __main__ - Step 140593: {'lr': 4.968635148789175e-06, 'samples': 26993856, 'steps': 140592, 'loss/train': 1.4591621160507202} 11/07/2021 17:02:38 - INFO - __main__ - Step 140594: {'lr': 4.96758245927964e-06, 'samples': 26994048, 'steps': 140593, 'loss/train': 0.8097019791603088} 11/07/2021 17:02:39 - INFO - __main__ - Step 140595: {'lr': 4.9665298801779e-06, 'samples': 26994240, 'steps': 140594, 'loss/train': 1.7217177152633667} 11/07/2021 17:02:39 - INFO - __main__ - Step 140596: {'lr': 4.965477411484481e-06, 'samples': 26994432, 'steps': 140595, 'loss/train': 1.6055994033813477} 11/07/2021 17:02:39 - INFO - __main__ - Step 140597: {'lr': 4.964425053199828e-06, 'samples': 26994624, 'steps': 140596, 'loss/train': 0.9435338973999023} 11/07/2021 17:02:40 - INFO - __main__ - Step 140598: {'lr': 4.96337280532444e-06, 'samples': 26994816, 'steps': 140597, 'loss/train': 1.32722806930542} 11/07/2021 17:02:40 - INFO - __main__ - Step 140599: {'lr': 4.9623206678587606e-06, 'samples': 26995008, 'steps': 140598, 'loss/train': 1.1872401237487793} 11/07/2021 17:02:41 - INFO - __main__ - Step 140600: {'lr': 4.9612686408032625e-06, 'samples': 26995200, 'steps': 140599, 'loss/train': 1.392848253250122} 11/07/2021 17:02:42 - INFO - __main__ - Step 140601: {'lr': 4.960216724158445e-06, 'samples': 26995392, 'steps': 140600, 'loss/train': 1.4132990837097168} 11/07/2021 17:02:42 - INFO - __main__ - Step 140602: {'lr': 4.95916491792478e-06, 'samples': 26995584, 'steps': 140601, 'loss/train': 1.4484180212020874} 11/07/2021 17:02:42 - INFO - __main__ - Step 140603: {'lr': 4.958113222102739e-06, 'samples': 26995776, 'steps': 140602, 'loss/train': 0.3276442289352417} 11/07/2021 17:02:43 - INFO - __main__ - Step 140604: {'lr': 4.957061636692767e-06, 'samples': 26995968, 'steps': 140603, 'loss/train': 0.24603895843029022} 11/07/2021 17:02:44 - INFO - __main__ - Step 140605: {'lr': 4.95601016169539e-06, 'samples': 26996160, 'steps': 140604, 'loss/train': 1.0610164403915405} 11/07/2021 17:02:44 - INFO - __main__ - Step 140606: {'lr': 4.954958797111025e-06, 'samples': 26996352, 'steps': 140605, 'loss/train': 1.7492669820785522} 11/07/2021 17:02:44 - INFO - __main__ - Step 140607: {'lr': 4.9539075429402e-06, 'samples': 26996544, 'steps': 140606, 'loss/train': 1.5423697233200073} 11/07/2021 17:02:45 - INFO - __main__ - Step 140608: {'lr': 4.9528563991833585e-06, 'samples': 26996736, 'steps': 140607, 'loss/train': 0.6173700094223022} 11/07/2021 17:02:45 - INFO - __main__ - Step 140609: {'lr': 4.951805365840972e-06, 'samples': 26996928, 'steps': 140608, 'loss/train': 0.45673584938049316} 11/07/2021 17:02:46 - INFO - __main__ - Step 140610: {'lr': 4.950754442913541e-06, 'samples': 26997120, 'steps': 140609, 'loss/train': 1.0949021577835083} 11/07/2021 17:02:47 - INFO - __main__ - Step 140611: {'lr': 4.949703630401509e-06, 'samples': 26997312, 'steps': 140610, 'loss/train': 1.4388408660888672} 11/07/2021 17:02:47 - INFO - __main__ - Step 140612: {'lr': 4.948652928305347e-06, 'samples': 26997504, 'steps': 140611, 'loss/train': 1.163126826286316} 11/07/2021 17:02:47 - INFO - __main__ - Step 140613: {'lr': 4.947602336625529e-06, 'samples': 26997696, 'steps': 140612, 'loss/train': 1.5033385753631592} 11/07/2021 17:02:48 - INFO - __main__ - Step 140614: {'lr': 4.946551855362552e-06, 'samples': 26997888, 'steps': 140613, 'loss/train': 1.4856911897659302} 11/07/2021 17:02:49 - INFO - __main__ - Step 140615: {'lr': 4.94550148451689e-06, 'samples': 26998080, 'steps': 140614, 'loss/train': 1.1632018089294434} 11/07/2021 17:02:49 - INFO - __main__ - Step 140616: {'lr': 4.944451224088986e-06, 'samples': 26998272, 'steps': 140615, 'loss/train': 1.4439128637313843} 11/07/2021 17:02:49 - INFO - __main__ - Step 140617: {'lr': 4.94340107407934e-06, 'samples': 26998464, 'steps': 140616, 'loss/train': 1.66404390335083} 11/07/2021 17:02:50 - INFO - __main__ - Step 140618: {'lr': 4.942351034488424e-06, 'samples': 26998656, 'steps': 140617, 'loss/train': 1.573129415512085} 11/07/2021 17:02:50 - INFO - __main__ - Step 140619: {'lr': 4.941301105316682e-06, 'samples': 26998848, 'steps': 140618, 'loss/train': 1.159739375114441} 11/07/2021 17:02:51 - INFO - __main__ - Step 140620: {'lr': 4.940251286564612e-06, 'samples': 26999040, 'steps': 140619, 'loss/train': 1.3830335140228271} 11/07/2021 17:02:51 - INFO - __main__ - Step 140621: {'lr': 4.939201578232716e-06, 'samples': 26999232, 'steps': 140620, 'loss/train': 0.7965039610862732} 11/07/2021 17:02:52 - INFO - __main__ - Step 140622: {'lr': 4.9381519803213815e-06, 'samples': 26999424, 'steps': 140621, 'loss/train': 1.833078145980835} 11/07/2021 17:02:52 - INFO - __main__ - Step 140623: {'lr': 4.937102492831164e-06, 'samples': 26999616, 'steps': 140622, 'loss/train': 1.3829690217971802} 11/07/2021 17:02:53 - INFO - __main__ - Step 140624: {'lr': 4.936053115762534e-06, 'samples': 26999808, 'steps': 140623, 'loss/train': 0.7336937189102173} 11/07/2021 17:02:53 - INFO - __main__ - Step 140625: {'lr': 4.93500384911591e-06, 'samples': 27000000, 'steps': 140624, 'loss/train': 1.5294981002807617} 11/07/2021 17:02:54 - INFO - __main__ - Step 140626: {'lr': 4.93395469289179e-06, 'samples': 27000192, 'steps': 140625, 'loss/train': 1.3285844326019287} 11/07/2021 17:02:54 - INFO - __main__ - Step 140627: {'lr': 4.932905647090647e-06, 'samples': 27000384, 'steps': 140626, 'loss/train': 1.3108937740325928} 11/07/2021 17:02:55 - INFO - __main__ - Step 140628: {'lr': 4.9318567117129505e-06, 'samples': 27000576, 'steps': 140627, 'loss/train': 1.545557975769043} 11/07/2021 17:02:55 - INFO - __main__ - Step 140629: {'lr': 4.930807886759176e-06, 'samples': 27000768, 'steps': 140628, 'loss/train': 1.2352306842803955} 11/07/2021 17:02:55 - INFO - __main__ - Step 140630: {'lr': 4.92975917222982e-06, 'samples': 27000960, 'steps': 140629, 'loss/train': 1.4874069690704346} 11/07/2021 17:02:56 - INFO - __main__ - Step 140631: {'lr': 4.9287105681253e-06, 'samples': 27001152, 'steps': 140630, 'loss/train': 0.9846442937850952} 11/07/2021 17:02:57 - INFO - __main__ - Step 140632: {'lr': 4.927662074446143e-06, 'samples': 27001344, 'steps': 140631, 'loss/train': 1.2551888227462769} 11/07/2021 17:02:57 - INFO - __main__ - Step 140633: {'lr': 4.926613691192794e-06, 'samples': 27001536, 'steps': 140632, 'loss/train': 0.7261644005775452} 11/07/2021 17:02:57 - INFO - __main__ - Step 140634: {'lr': 4.925565418365752e-06, 'samples': 27001728, 'steps': 140633, 'loss/train': 0.9452007412910461} 11/07/2021 17:02:58 - INFO - __main__ - Step 140635: {'lr': 4.9245172559654325e-06, 'samples': 27001920, 'steps': 140634, 'loss/train': 1.2093267440795898} 11/07/2021 17:02:59 - INFO - __main__ - Step 140636: {'lr': 4.923469203992365e-06, 'samples': 27002112, 'steps': 140635, 'loss/train': 1.0991544723510742} 11/07/2021 17:02:59 - INFO - __main__ - Step 140637: {'lr': 4.922421262447019e-06, 'samples': 27002304, 'steps': 140636, 'loss/train': 1.1000428199768066} 11/07/2021 17:03:00 - INFO - __main__ - Step 140638: {'lr': 4.921373431329812e-06, 'samples': 27002496, 'steps': 140637, 'loss/train': 1.3364495038986206} 11/07/2021 17:03:00 - INFO - __main__ - Step 140639: {'lr': 4.920325710641271e-06, 'samples': 27002688, 'steps': 140638, 'loss/train': 1.3200737237930298} 11/07/2021 17:03:00 - INFO - __main__ - Step 140640: {'lr': 4.919278100381841e-06, 'samples': 27002880, 'steps': 140639, 'loss/train': 1.2319566011428833} 11/07/2021 17:03:01 - INFO - __main__ - Step 140641: {'lr': 4.9182306005520205e-06, 'samples': 27003072, 'steps': 140640, 'loss/train': 1.5062676668167114} 11/07/2021 17:03:02 - INFO - __main__ - Step 140642: {'lr': 4.9171832111522545e-06, 'samples': 27003264, 'steps': 140641, 'loss/train': 0.9655126333236694} 11/07/2021 17:03:02 - INFO - __main__ - Step 140643: {'lr': 4.916135932183013e-06, 'samples': 27003456, 'steps': 140642, 'loss/train': 1.4374399185180664} 11/07/2021 17:03:02 - INFO - __main__ - Step 140644: {'lr': 4.9150887636447705e-06, 'samples': 27003648, 'steps': 140643, 'loss/train': 1.3723150491714478} 11/07/2021 17:03:03 - INFO - __main__ - Step 140645: {'lr': 4.914041705538025e-06, 'samples': 27003840, 'steps': 140644, 'loss/train': 1.418485164642334} 11/07/2021 17:03:04 - INFO - __main__ - Step 140646: {'lr': 4.91299475786322e-06, 'samples': 27004032, 'steps': 140645, 'loss/train': 1.5194032192230225} 11/07/2021 17:03:04 - INFO - __main__ - Step 140647: {'lr': 4.911947920620857e-06, 'samples': 27004224, 'steps': 140646, 'loss/train': 1.4427257776260376} 11/07/2021 17:03:04 - INFO - __main__ - Step 140648: {'lr': 4.910901193811351e-06, 'samples': 27004416, 'steps': 140647, 'loss/train': 1.4103615283966064} 11/07/2021 17:03:05 - INFO - __main__ - Step 140649: {'lr': 4.909854577435257e-06, 'samples': 27004608, 'steps': 140648, 'loss/train': 1.3053760528564453} 11/07/2021 17:03:05 - INFO - __main__ - Step 140650: {'lr': 4.908808071492965e-06, 'samples': 27004800, 'steps': 140649, 'loss/train': 1.549443006515503} 11/07/2021 17:03:07 - INFO - __main__ - Step 140651: {'lr': 4.907761675985029e-06, 'samples': 27004992, 'steps': 140650, 'loss/train': 0.8116984367370605} 11/07/2021 17:03:07 - INFO - __main__ - Step 140652: {'lr': 4.906715390911837e-06, 'samples': 27005184, 'steps': 140651, 'loss/train': 1.4377703666687012} 11/07/2021 17:03:07 - INFO - __main__ - Step 140653: {'lr': 4.905669216273889e-06, 'samples': 27005376, 'steps': 140652, 'loss/train': 0.21115639805793762} 11/07/2021 17:03:08 - INFO - __main__ - Step 140654: {'lr': 4.904623152071686e-06, 'samples': 27005568, 'steps': 140653, 'loss/train': 1.5700104236602783} 11/07/2021 17:03:08 - INFO - __main__ - Step 140655: {'lr': 4.90357719830567e-06, 'samples': 27005760, 'steps': 140654, 'loss/train': 0.9730591177940369} 11/07/2021 17:03:08 - INFO - __main__ - Step 140656: {'lr': 4.902531354976314e-06, 'samples': 27005952, 'steps': 140655, 'loss/train': 1.6091928482055664} 11/07/2021 17:03:10 - INFO - __main__ - Step 140657: {'lr': 4.90148562208409e-06, 'samples': 27006144, 'steps': 140656, 'loss/train': 1.3143407106399536} 11/07/2021 17:03:10 - INFO - __main__ - Step 140658: {'lr': 4.900439999629469e-06, 'samples': 27006336, 'steps': 140657, 'loss/train': 1.918505072593689} 11/07/2021 17:03:10 - INFO - __main__ - Step 140659: {'lr': 4.899394487612951e-06, 'samples': 27006528, 'steps': 140658, 'loss/train': 1.0543313026428223} 11/07/2021 17:03:11 - INFO - __main__ - Step 140660: {'lr': 4.898349086034981e-06, 'samples': 27006720, 'steps': 140659, 'loss/train': 0.975000262260437} 11/07/2021 17:03:11 - INFO - __main__ - Step 140661: {'lr': 4.89730379489603e-06, 'samples': 27006912, 'steps': 140660, 'loss/train': 1.1419135332107544} 11/07/2021 17:03:12 - INFO - __main__ - Step 140662: {'lr': 4.896258614196569e-06, 'samples': 27007104, 'steps': 140661, 'loss/train': 1.8581806421279907} 11/07/2021 17:03:12 - INFO - __main__ - Step 140663: {'lr': 4.8952135439370715e-06, 'samples': 27007296, 'steps': 140662, 'loss/train': 1.2116076946258545} 11/07/2021 17:03:13 - INFO - __main__ - Step 140664: {'lr': 4.894168584118009e-06, 'samples': 27007488, 'steps': 140663, 'loss/train': 0.8974142670631409} 11/07/2021 17:03:13 - INFO - __main__ - Step 140665: {'lr': 4.893123734739852e-06, 'samples': 27007680, 'steps': 140664, 'loss/train': 1.033457636833191} 11/07/2021 17:03:13 - INFO - __main__ - Step 140666: {'lr': 4.892078995803073e-06, 'samples': 27007872, 'steps': 140665, 'loss/train': 1.0428186655044556} 11/07/2021 17:03:14 - INFO - __main__ - Step 140667: {'lr': 4.891034367308145e-06, 'samples': 27008064, 'steps': 140666, 'loss/train': 1.2150307893753052} 11/07/2021 17:03:15 - INFO - __main__ - Step 140668: {'lr': 4.8899898492555106e-06, 'samples': 27008256, 'steps': 140667, 'loss/train': 1.1180051565170288} 11/07/2021 17:03:15 - INFO - __main__ - Step 140669: {'lr': 4.888945441645698e-06, 'samples': 27008448, 'steps': 140668, 'loss/train': 1.2039128541946411} 11/07/2021 17:03:15 - INFO - __main__ - Step 140670: {'lr': 4.8879011444791235e-06, 'samples': 27008640, 'steps': 140669, 'loss/train': 0.09924500435590744} 11/07/2021 17:03:16 - INFO - __main__ - Step 140671: {'lr': 4.886856957756286e-06, 'samples': 27008832, 'steps': 140670, 'loss/train': 1.0791950225830078} 11/07/2021 17:03:17 - INFO - __main__ - Step 140672: {'lr': 4.88581288147763e-06, 'samples': 27009024, 'steps': 140671, 'loss/train': 0.8366789221763611} 11/07/2021 17:03:17 - INFO - __main__ - Step 140673: {'lr': 4.8847689156436555e-06, 'samples': 27009216, 'steps': 140672, 'loss/train': 1.2839852571487427} 11/07/2021 17:03:18 - INFO - __main__ - Step 140674: {'lr': 4.883725060254834e-06, 'samples': 27009408, 'steps': 140673, 'loss/train': 1.4239284992218018} 11/07/2021 17:03:18 - INFO - __main__ - Step 140675: {'lr': 4.88268131531161e-06, 'samples': 27009600, 'steps': 140674, 'loss/train': 1.26962411403656} 11/07/2021 17:03:18 - INFO - __main__ - Step 140676: {'lr': 4.881637680814483e-06, 'samples': 27009792, 'steps': 140675, 'loss/train': 1.3933687210083008} 11/07/2021 17:03:20 - INFO - __main__ - Step 140677: {'lr': 4.880594156763896e-06, 'samples': 27009984, 'steps': 140676, 'loss/train': 0.6826162338256836} 11/07/2021 17:03:20 - INFO - __main__ - Step 140678: {'lr': 4.879550743160349e-06, 'samples': 27010176, 'steps': 140677, 'loss/train': 0.8977209329605103} 11/07/2021 17:03:20 - INFO - __main__ - Step 140679: {'lr': 4.8785074400042596e-06, 'samples': 27010368, 'steps': 140678, 'loss/train': 1.048097014427185} 11/07/2021 17:03:21 - INFO - __main__ - Step 140680: {'lr': 4.877464247296154e-06, 'samples': 27010560, 'steps': 140679, 'loss/train': 1.358144998550415} 11/07/2021 17:03:21 - INFO - __main__ - Step 140681: {'lr': 4.876421165036477e-06, 'samples': 27010752, 'steps': 140680, 'loss/train': 1.2760409116744995} 11/07/2021 17:03:21 - INFO - __main__ - Step 140682: {'lr': 4.8753781932256995e-06, 'samples': 27010944, 'steps': 140681, 'loss/train': 0.31509262323379517} 11/07/2021 17:03:22 - INFO - __main__ - Step 140683: {'lr': 4.874335331864293e-06, 'samples': 27011136, 'steps': 140682, 'loss/train': 0.496659517288208} 11/07/2021 17:03:23 - INFO - __main__ - Step 140684: {'lr': 4.873292580952732e-06, 'samples': 27011328, 'steps': 140683, 'loss/train': 1.8345110416412354} 11/07/2021 17:03:23 - INFO - __main__ - Step 140685: {'lr': 4.872249940491486e-06, 'samples': 27011520, 'steps': 140684, 'loss/train': 1.216846227645874} 11/07/2021 17:03:23 - INFO - __main__ - Step 140686: {'lr': 4.871207410481027e-06, 'samples': 27011712, 'steps': 140685, 'loss/train': 1.1309101581573486} 11/07/2021 17:03:24 - INFO - __main__ - Step 140687: {'lr': 4.8701649909217995e-06, 'samples': 27011904, 'steps': 140686, 'loss/train': 0.370355486869812} 11/07/2021 17:03:25 - INFO - __main__ - Step 140688: {'lr': 4.869122681814303e-06, 'samples': 27012096, 'steps': 140687, 'loss/train': 1.429862380027771} 11/07/2021 17:03:25 - INFO - __main__ - Step 140689: {'lr': 4.86808048315901e-06, 'samples': 27012288, 'steps': 140688, 'loss/train': 1.24050772190094} 11/07/2021 17:03:26 - INFO - __main__ - Step 140690: {'lr': 4.867038394956363e-06, 'samples': 27012480, 'steps': 140689, 'loss/train': 1.4547696113586426} 11/07/2021 17:03:26 - INFO - __main__ - Step 140691: {'lr': 4.865996417206864e-06, 'samples': 27012672, 'steps': 140690, 'loss/train': 1.0516480207443237} 11/07/2021 17:03:27 - INFO - __main__ - Step 140692: {'lr': 4.8649545499109545e-06, 'samples': 27012864, 'steps': 140691, 'loss/train': 1.5745768547058105} 11/07/2021 17:03:27 - INFO - __main__ - Step 140693: {'lr': 4.863912793069109e-06, 'samples': 27013056, 'steps': 140692, 'loss/train': 1.0302109718322754} 11/07/2021 17:03:28 - INFO - __main__ - Step 140694: {'lr': 4.862871146681797e-06, 'samples': 27013248, 'steps': 140693, 'loss/train': 1.2965635061264038} 11/07/2021 17:03:28 - INFO - __main__ - Step 140695: {'lr': 4.8618296107494906e-06, 'samples': 27013440, 'steps': 140694, 'loss/train': 1.5750428438186646} 11/07/2021 17:03:29 - INFO - __main__ - Step 140696: {'lr': 4.860788185272663e-06, 'samples': 27013632, 'steps': 140695, 'loss/train': 1.1481571197509766} 11/07/2021 17:03:29 - INFO - __main__ - Step 140697: {'lr': 4.859746870251786e-06, 'samples': 27013824, 'steps': 140696, 'loss/train': 1.5951666831970215} 11/07/2021 17:03:30 - INFO - __main__ - Step 140698: {'lr': 4.858705665687329e-06, 'samples': 27014016, 'steps': 140697, 'loss/train': 1.3805131912231445} 11/07/2021 17:03:30 - INFO - __main__ - Step 140699: {'lr': 4.8576645715797394e-06, 'samples': 27014208, 'steps': 140698, 'loss/train': 1.2400968074798584} 11/07/2021 17:03:31 - INFO - __main__ - Step 140700: {'lr': 4.856623587929515e-06, 'samples': 27014400, 'steps': 140699, 'loss/train': 0.8295935988426208} 11/07/2021 17:03:31 - INFO - __main__ - Step 140701: {'lr': 4.855582714737128e-06, 'samples': 27014592, 'steps': 140700, 'loss/train': 1.261768102645874} 11/07/2021 17:03:31 - INFO - __main__ - Step 140702: {'lr': 4.854541952003022e-06, 'samples': 27014784, 'steps': 140701, 'loss/train': 1.059788465499878} 11/07/2021 17:03:33 - INFO - __main__ - Step 140703: {'lr': 4.85350129972767e-06, 'samples': 27014976, 'steps': 140702, 'loss/train': 0.49114781618118286} 11/07/2021 17:03:33 - INFO - __main__ - Step 140704: {'lr': 4.85246075791157e-06, 'samples': 27015168, 'steps': 140703, 'loss/train': 1.2078641653060913} 11/07/2021 17:03:33 - INFO - __main__ - Step 140705: {'lr': 4.8514203265551395e-06, 'samples': 27015360, 'steps': 140704, 'loss/train': 0.2268139123916626} 11/07/2021 17:03:34 - INFO - __main__ - Step 140706: {'lr': 4.850380005658878e-06, 'samples': 27015552, 'steps': 140705, 'loss/train': 1.012483835220337} 11/07/2021 17:03:34 - INFO - __main__ - Step 140707: {'lr': 4.849339795223257e-06, 'samples': 27015744, 'steps': 140706, 'loss/train': 1.5348763465881348} 11/07/2021 17:03:34 - INFO - __main__ - Step 140708: {'lr': 4.848299695248748e-06, 'samples': 27015936, 'steps': 140707, 'loss/train': 1.7199651002883911} 11/07/2021 17:03:36 - INFO - __main__ - Step 140709: {'lr': 4.8472597057357955e-06, 'samples': 27016128, 'steps': 140708, 'loss/train': 1.1603137254714966} 11/07/2021 17:03:36 - INFO - __main__ - Step 140710: {'lr': 4.8462198266849e-06, 'samples': 27016320, 'steps': 140709, 'loss/train': 1.294149398803711} 11/07/2021 17:03:36 - INFO - __main__ - Step 140711: {'lr': 4.845180058096504e-06, 'samples': 27016512, 'steps': 140710, 'loss/train': 1.7225881814956665} 11/07/2021 17:03:37 - INFO - __main__ - Step 140712: {'lr': 4.8441403999711085e-06, 'samples': 27016704, 'steps': 140711, 'loss/train': 1.0562560558319092} 11/07/2021 17:03:37 - INFO - __main__ - Step 140713: {'lr': 4.843100852309157e-06, 'samples': 27016896, 'steps': 140712, 'loss/train': 1.2177830934524536} 11/07/2021 17:03:37 - INFO - __main__ - Step 140714: {'lr': 4.842061415111093e-06, 'samples': 27017088, 'steps': 140713, 'loss/train': 0.29750242829322815} 11/07/2021 17:03:38 - INFO - __main__ - Step 140715: {'lr': 4.841022088377445e-06, 'samples': 27017280, 'steps': 140714, 'loss/train': 0.974868655204773} 11/07/2021 17:03:39 - INFO - __main__ - Step 140716: {'lr': 4.839982872108628e-06, 'samples': 27017472, 'steps': 140715, 'loss/train': 0.7294365763664246} 11/07/2021 17:03:39 - INFO - __main__ - Step 140717: {'lr': 4.838943766305143e-06, 'samples': 27017664, 'steps': 140716, 'loss/train': 1.2779533863067627} 11/07/2021 17:03:39 - INFO - __main__ - Step 140718: {'lr': 4.837904770967461e-06, 'samples': 27017856, 'steps': 140717, 'loss/train': 1.5212595462799072} 11/07/2021 17:03:40 - INFO - __main__ - Step 140719: {'lr': 4.836865886095998e-06, 'samples': 27018048, 'steps': 140718, 'loss/train': 1.8451666831970215} 11/07/2021 17:03:41 - INFO - __main__ - Step 140720: {'lr': 4.835827111691282e-06, 'samples': 27018240, 'steps': 140719, 'loss/train': 0.9411516189575195} 11/07/2021 17:03:41 - INFO - __main__ - Step 140721: {'lr': 4.834788447753758e-06, 'samples': 27018432, 'steps': 140720, 'loss/train': 0.6831653714179993} 11/07/2021 17:03:42 - INFO - __main__ - Step 140722: {'lr': 4.833749894283896e-06, 'samples': 27018624, 'steps': 140721, 'loss/train': 1.340980887413025} 11/07/2021 17:03:42 - INFO - __main__ - Step 140723: {'lr': 4.832711451282168e-06, 'samples': 27018816, 'steps': 140722, 'loss/train': 0.876626193523407} 11/07/2021 17:03:42 - INFO - __main__ - Step 140724: {'lr': 4.831673118749019e-06, 'samples': 27019008, 'steps': 140723, 'loss/train': 2.194852590560913} 11/07/2021 17:03:43 - INFO - __main__ - Step 140725: {'lr': 4.830634896684949e-06, 'samples': 27019200, 'steps': 140724, 'loss/train': 1.067135214805603} 11/07/2021 17:03:44 - INFO - __main__ - Step 140726: {'lr': 4.829596785090401e-06, 'samples': 27019392, 'steps': 140725, 'loss/train': 1.434791088104248} 11/07/2021 17:03:44 - INFO - __main__ - Step 140727: {'lr': 4.828558783965875e-06, 'samples': 27019584, 'steps': 140726, 'loss/train': 1.4144794940948486} 11/07/2021 17:03:44 - INFO - __main__ - Step 140728: {'lr': 4.827520893311788e-06, 'samples': 27019776, 'steps': 140727, 'loss/train': 1.405422329902649} 11/07/2021 17:03:45 - INFO - __main__ - Step 140729: {'lr': 4.8264831131286655e-06, 'samples': 27019968, 'steps': 140728, 'loss/train': 1.0828857421875} 11/07/2021 17:03:46 - INFO - __main__ - Step 140730: {'lr': 4.825445443416954e-06, 'samples': 27020160, 'steps': 140729, 'loss/train': 1.0315442085266113} 11/07/2021 17:03:46 - INFO - __main__ - Step 140731: {'lr': 4.824407884177095e-06, 'samples': 27020352, 'steps': 140730, 'loss/train': 1.5778969526290894} 11/07/2021 17:03:46 - INFO - __main__ - Step 140732: {'lr': 4.823370435409563e-06, 'samples': 27020544, 'steps': 140731, 'loss/train': 1.0972338914871216} 11/07/2021 17:03:47 - INFO - __main__ - Step 140733: {'lr': 4.822333097114856e-06, 'samples': 27020736, 'steps': 140732, 'loss/train': 1.6933735609054565} 11/07/2021 17:03:47 - INFO - __main__ - Step 140734: {'lr': 4.82129586929339e-06, 'samples': 27020928, 'steps': 140733, 'loss/train': 0.9690759181976318} 11/07/2021 17:03:47 - INFO - __main__ - Step 140735: {'lr': 4.820258751945694e-06, 'samples': 27021120, 'steps': 140734, 'loss/train': 0.6583340764045715} 11/07/2021 17:03:48 - INFO - __main__ - Step 140736: {'lr': 4.819221745072211e-06, 'samples': 27021312, 'steps': 140735, 'loss/train': 1.3277475833892822} 11/07/2021 17:03:49 - INFO - __main__ - Step 140737: {'lr': 4.818184848673384e-06, 'samples': 27021504, 'steps': 140736, 'loss/train': 1.1092631816864014} 11/07/2021 17:03:49 - INFO - __main__ - Step 140738: {'lr': 4.817148062749716e-06, 'samples': 27021696, 'steps': 140737, 'loss/train': 1.190661072731018} 11/07/2021 17:03:50 - INFO - __main__ - Step 140739: {'lr': 4.816111387301647e-06, 'samples': 27021888, 'steps': 140738, 'loss/train': 1.4657477140426636} 11/07/2021 17:03:50 - INFO - __main__ - Step 140740: {'lr': 4.815074822329651e-06, 'samples': 27022080, 'steps': 140739, 'loss/train': 1.0882625579833984} 11/07/2021 17:03:51 - INFO - __main__ - Step 140741: {'lr': 4.814038367834228e-06, 'samples': 27022272, 'steps': 140740, 'loss/train': 1.2126715183258057} 11/07/2021 17:03:51 - INFO - __main__ - Step 140742: {'lr': 4.813002023815793e-06, 'samples': 27022464, 'steps': 140741, 'loss/train': 1.2365344762802124} 11/07/2021 17:03:52 - INFO - __main__ - Step 140743: {'lr': 4.8119657902748195e-06, 'samples': 27022656, 'steps': 140742, 'loss/train': 1.0515962839126587} 11/07/2021 17:03:52 - INFO - __main__ - Step 140744: {'lr': 4.810929667211805e-06, 'samples': 27022848, 'steps': 140743, 'loss/train': 1.3015425205230713} 11/07/2021 17:03:52 - INFO - __main__ - Step 140745: {'lr': 4.809893654627223e-06, 'samples': 27023040, 'steps': 140744, 'loss/train': 1.4506274461746216} 11/07/2021 17:03:53 - INFO - __main__ - Step 140746: {'lr': 4.808857752521489e-06, 'samples': 27023232, 'steps': 140745, 'loss/train': 1.8173776865005493} 11/07/2021 17:03:54 - INFO - __main__ - Step 140747: {'lr': 4.807821960895104e-06, 'samples': 27023424, 'steps': 140746, 'loss/train': 1.4215240478515625} 11/07/2021 17:03:54 - INFO - __main__ - Step 140748: {'lr': 4.806786279748538e-06, 'samples': 27023616, 'steps': 140747, 'loss/train': 0.8797392249107361} 11/07/2021 17:03:54 - INFO - __main__ - Step 140749: {'lr': 4.805750709082263e-06, 'samples': 27023808, 'steps': 140748, 'loss/train': 1.4242188930511475} 11/07/2021 17:03:55 - INFO - __main__ - Step 140750: {'lr': 4.8047152488967235e-06, 'samples': 27024000, 'steps': 140749, 'loss/train': 1.149765968322754} 11/07/2021 17:03:56 - INFO - __main__ - Step 140751: {'lr': 4.803679899192393e-06, 'samples': 27024192, 'steps': 140750, 'loss/train': 1.195122480392456} 11/07/2021 17:03:56 - INFO - __main__ - Step 140752: {'lr': 4.802644659969741e-06, 'samples': 27024384, 'steps': 140751, 'loss/train': 1.3720184564590454} 11/07/2021 17:03:56 - INFO - __main__ - Step 140753: {'lr': 4.80160953122924e-06, 'samples': 27024576, 'steps': 140752, 'loss/train': 0.9289634823799133} 11/07/2021 17:03:57 - INFO - __main__ - Step 140754: {'lr': 4.800574512971334e-06, 'samples': 27024768, 'steps': 140753, 'loss/train': 1.164465069770813} 11/07/2021 17:03:57 - INFO - __main__ - Step 140755: {'lr': 4.7995396051965234e-06, 'samples': 27024960, 'steps': 140754, 'loss/train': 1.4251718521118164} 11/07/2021 17:03:58 - INFO - __main__ - Step 140756: {'lr': 4.798504807905252e-06, 'samples': 27025152, 'steps': 140755, 'loss/train': 1.248702883720398} 11/07/2021 17:03:59 - INFO - __main__ - Step 140757: {'lr': 4.797470121097991e-06, 'samples': 27025344, 'steps': 140756, 'loss/train': 1.1514396667480469} 11/07/2021 17:03:59 - INFO - __main__ - Step 140758: {'lr': 4.796435544775185e-06, 'samples': 27025536, 'steps': 140757, 'loss/train': 1.202960729598999} 11/07/2021 17:03:59 - INFO - __main__ - Step 140759: {'lr': 4.795401078937334e-06, 'samples': 27025728, 'steps': 140758, 'loss/train': 1.659677505493164} 11/07/2021 17:04:00 - INFO - __main__ - Step 140760: {'lr': 4.794366723584908e-06, 'samples': 27025920, 'steps': 140759, 'loss/train': 1.4062929153442383} 11/07/2021 17:04:01 - INFO - __main__ - Step 140761: {'lr': 4.793332478718354e-06, 'samples': 27026112, 'steps': 140760, 'loss/train': 0.9828885197639465} 11/07/2021 17:04:01 - INFO - __main__ - Step 140762: {'lr': 4.792298344338142e-06, 'samples': 27026304, 'steps': 140761, 'loss/train': 1.3117629289627075} 11/07/2021 17:04:01 - INFO - __main__ - Step 140763: {'lr': 4.7912643204447155e-06, 'samples': 27026496, 'steps': 140762, 'loss/train': 1.3194314241409302} 11/07/2021 17:04:02 - INFO - __main__ - Step 140764: {'lr': 4.790230407038576e-06, 'samples': 27026688, 'steps': 140763, 'loss/train': 1.5065218210220337} 11/07/2021 17:04:02 - INFO - __main__ - Step 140765: {'lr': 4.789196604120166e-06, 'samples': 27026880, 'steps': 140764, 'loss/train': 1.089391827583313} 11/07/2021 17:04:02 - INFO - __main__ - Step 140766: {'lr': 4.788162911689986e-06, 'samples': 27027072, 'steps': 140765, 'loss/train': 1.2838006019592285} 11/07/2021 17:04:04 - INFO - __main__ - Step 140767: {'lr': 4.787129329748452e-06, 'samples': 27027264, 'steps': 140766, 'loss/train': 0.2809026539325714} 11/07/2021 17:04:04 - INFO - __main__ - Step 140768: {'lr': 4.786095858296035e-06, 'samples': 27027456, 'steps': 140767, 'loss/train': 1.4023228883743286} 11/07/2021 17:04:04 - INFO - __main__ - Step 140769: {'lr': 4.785062497333264e-06, 'samples': 27027648, 'steps': 140768, 'loss/train': 1.5142303705215454} 11/07/2021 17:04:05 - INFO - __main__ - Step 140770: {'lr': 4.784029246860528e-06, 'samples': 27027840, 'steps': 140769, 'loss/train': 0.8873171806335449} 11/07/2021 17:04:05 - INFO - __main__ - Step 140771: {'lr': 4.782996106878323e-06, 'samples': 27028032, 'steps': 140770, 'loss/train': 1.3799818754196167} 11/07/2021 17:04:06 - INFO - __main__ - Step 140772: {'lr': 4.7819630773871245e-06, 'samples': 27028224, 'steps': 140771, 'loss/train': 1.1616029739379883} 11/07/2021 17:04:06 - INFO - __main__ - Step 140773: {'lr': 4.780930158387431e-06, 'samples': 27028416, 'steps': 140772, 'loss/train': 1.381282925605774} 11/07/2021 17:04:07 - INFO - __main__ - Step 140774: {'lr': 4.779897349879602e-06, 'samples': 27028608, 'steps': 140773, 'loss/train': 1.2988053560256958} 11/07/2021 17:04:07 - INFO - __main__ - Step 140775: {'lr': 4.778864651864195e-06, 'samples': 27028800, 'steps': 140774, 'loss/train': 1.1081119775772095} 11/07/2021 17:04:07 - INFO - __main__ - Step 140776: {'lr': 4.777832064341653e-06, 'samples': 27028992, 'steps': 140775, 'loss/train': 0.2427780032157898} 11/07/2021 17:04:08 - INFO - __main__ - Step 140777: {'lr': 4.77679958731242e-06, 'samples': 27029184, 'steps': 140776, 'loss/train': 1.2604572772979736} 11/07/2021 17:04:09 - INFO - __main__ - Step 140778: {'lr': 4.7757672207769945e-06, 'samples': 27029376, 'steps': 140777, 'loss/train': 1.3372327089309692} 11/07/2021 17:04:09 - INFO - __main__ - Step 140779: {'lr': 4.774734964735794e-06, 'samples': 27029568, 'steps': 140778, 'loss/train': 1.085033893585205} 11/07/2021 17:04:10 - INFO - __main__ - Step 140780: {'lr': 4.7737028191893465e-06, 'samples': 27029760, 'steps': 140779, 'loss/train': 1.30923593044281} 11/07/2021 17:04:10 - INFO - __main__ - Step 140781: {'lr': 4.772670784138067e-06, 'samples': 27029952, 'steps': 140780, 'loss/train': 1.2771391868591309} 11/07/2021 17:04:11 - INFO - __main__ - Step 140782: {'lr': 4.771638859582455e-06, 'samples': 27030144, 'steps': 140781, 'loss/train': 1.2970482110977173} 11/07/2021 17:04:11 - INFO - __main__ - Step 140783: {'lr': 4.770607045522929e-06, 'samples': 27030336, 'steps': 140782, 'loss/train': 1.4451395273208618} 11/07/2021 17:04:12 - INFO - __main__ - Step 140784: {'lr': 4.769575341960014e-06, 'samples': 27030528, 'steps': 140783, 'loss/train': 0.4957500994205475} 11/07/2021 17:04:12 - INFO - __main__ - Step 140785: {'lr': 4.768543748894155e-06, 'samples': 27030720, 'steps': 140784, 'loss/train': 1.915226936340332} 11/07/2021 17:04:12 - INFO - __main__ - Step 140786: {'lr': 4.767512266325769e-06, 'samples': 27030912, 'steps': 140785, 'loss/train': 1.1349362134933472} 11/07/2021 17:04:13 - INFO - __main__ - Step 140787: {'lr': 4.766480894255382e-06, 'samples': 27031104, 'steps': 140786, 'loss/train': 1.2151939868927002} 11/07/2021 17:04:14 - INFO - __main__ - Step 140788: {'lr': 4.7654496326834105e-06, 'samples': 27031296, 'steps': 140787, 'loss/train': 1.4436813592910767} 11/07/2021 17:04:14 - INFO - __main__ - Step 140789: {'lr': 4.764418481610355e-06, 'samples': 27031488, 'steps': 140788, 'loss/train': 1.7361228466033936} 11/07/2021 17:04:14 - INFO - __main__ - Step 140790: {'lr': 4.763387441036687e-06, 'samples': 27031680, 'steps': 140789, 'loss/train': 1.165564775466919} 11/07/2021 17:04:15 - INFO - __main__ - Step 140791: {'lr': 4.762356510962823e-06, 'samples': 27031872, 'steps': 140790, 'loss/train': 1.675992727279663} 11/07/2021 17:04:15 - INFO - __main__ - Step 140792: {'lr': 4.761325691389262e-06, 'samples': 27032064, 'steps': 140791, 'loss/train': 1.210250735282898} 11/07/2021 17:04:16 - INFO - __main__ - Step 140793: {'lr': 4.760294982316477e-06, 'samples': 27032256, 'steps': 140792, 'loss/train': 1.29947829246521} 11/07/2021 17:04:16 - INFO - __main__ - Step 140794: {'lr': 4.75926438374491e-06, 'samples': 27032448, 'steps': 140793, 'loss/train': 1.3991553783416748} 11/07/2021 17:04:17 - INFO - __main__ - Step 140795: {'lr': 4.758233895675035e-06, 'samples': 27032640, 'steps': 140794, 'loss/train': 1.5561333894729614} 11/07/2021 17:04:17 - INFO - __main__ - Step 140796: {'lr': 4.757203518107323e-06, 'samples': 27032832, 'steps': 140795, 'loss/train': 1.3440746068954468} 11/07/2021 17:04:17 - INFO - __main__ - Step 140797: {'lr': 4.756173251042217e-06, 'samples': 27033024, 'steps': 140796, 'loss/train': 1.2044326066970825} 11/07/2021 17:04:19 - INFO - __main__ - Step 140798: {'lr': 4.755143094480191e-06, 'samples': 27033216, 'steps': 140797, 'loss/train': 1.113837718963623} 11/07/2021 17:04:19 - INFO - __main__ - Step 140799: {'lr': 4.7541130484217435e-06, 'samples': 27033408, 'steps': 140798, 'loss/train': 1.2629200220108032} 11/07/2021 17:04:19 - INFO - __main__ - Step 140800: {'lr': 4.753083112867291e-06, 'samples': 27033600, 'steps': 140799, 'loss/train': 0.6073750853538513} 11/07/2021 17:04:20 - INFO - __main__ - Step 140801: {'lr': 4.752053287817332e-06, 'samples': 27033792, 'steps': 140800, 'loss/train': 1.0921456813812256} 11/07/2021 17:04:20 - INFO - __main__ - Step 140802: {'lr': 4.751023573272284e-06, 'samples': 27033984, 'steps': 140801, 'loss/train': 1.0635772943496704} 11/07/2021 17:04:21 - INFO - __main__ - Step 140803: {'lr': 4.749993969232647e-06, 'samples': 27034176, 'steps': 140802, 'loss/train': 1.571060299873352} 11/07/2021 17:04:21 - INFO - __main__ - Step 140804: {'lr': 4.748964475698892e-06, 'samples': 27034368, 'steps': 140803, 'loss/train': 1.492427110671997} 11/07/2021 17:04:22 - INFO - __main__ - Step 140805: {'lr': 4.747935092671435e-06, 'samples': 27034560, 'steps': 140804, 'loss/train': 1.4294623136520386} 11/07/2021 17:04:22 - INFO - __main__ - Step 140806: {'lr': 4.746905820150804e-06, 'samples': 27034752, 'steps': 140805, 'loss/train': 1.1356275081634521} 11/07/2021 17:04:22 - INFO - __main__ - Step 140807: {'lr': 4.745876658137443e-06, 'samples': 27034944, 'steps': 140806, 'loss/train': 1.3257509469985962} 11/07/2021 17:04:23 - INFO - __main__ - Step 140808: {'lr': 4.744847606631769e-06, 'samples': 27035136, 'steps': 140807, 'loss/train': 1.0705536603927612} 11/07/2021 17:04:24 - INFO - __main__ - Step 140809: {'lr': 4.743818665634309e-06, 'samples': 27035328, 'steps': 140808, 'loss/train': 1.6042720079421997} 11/07/2021 17:04:24 - INFO - __main__ - Step 140810: {'lr': 4.742789835145506e-06, 'samples': 27035520, 'steps': 140809, 'loss/train': 1.3092471361160278} 11/07/2021 17:04:25 - INFO - __main__ - Step 140811: {'lr': 4.741761115165805e-06, 'samples': 27035712, 'steps': 140810, 'loss/train': 1.402199625968933} 11/07/2021 17:04:25 - INFO - __main__ - Step 140812: {'lr': 4.740732505695677e-06, 'samples': 27035904, 'steps': 140811, 'loss/train': 1.640988826751709} 11/07/2021 17:04:26 - INFO - __main__ - Step 140813: {'lr': 4.739704006735596e-06, 'samples': 27036096, 'steps': 140812, 'loss/train': 1.3080891370773315} 11/07/2021 17:04:26 - INFO - __main__ - Step 140814: {'lr': 4.7386756182860315e-06, 'samples': 27036288, 'steps': 140813, 'loss/train': 0.9664053320884705} 11/07/2021 17:04:27 - INFO - __main__ - Step 140815: {'lr': 4.737647340347429e-06, 'samples': 27036480, 'steps': 140814, 'loss/train': 1.4346280097961426} 11/07/2021 17:04:27 - INFO - __main__ - Step 140816: {'lr': 4.736619172920231e-06, 'samples': 27036672, 'steps': 140815, 'loss/train': 0.15575110912322998} 11/07/2021 17:04:27 - INFO - __main__ - Step 140817: {'lr': 4.7355911160049666e-06, 'samples': 27036864, 'steps': 140816, 'loss/train': 1.0067697763442993} 11/07/2021 17:04:28 - INFO - __main__ - Step 140818: {'lr': 4.7345631696020245e-06, 'samples': 27037056, 'steps': 140817, 'loss/train': 1.0859376192092896} 11/07/2021 17:04:29 - INFO - __main__ - Step 140819: {'lr': 4.73353533371193e-06, 'samples': 27037248, 'steps': 140818, 'loss/train': 1.0488266944885254} 11/07/2021 17:04:29 - INFO - __main__ - Step 140820: {'lr': 4.732507608335101e-06, 'samples': 27037440, 'steps': 140819, 'loss/train': 1.128475546836853} 11/07/2021 17:04:30 - INFO - __main__ - Step 140821: {'lr': 4.731479993472038e-06, 'samples': 27037632, 'steps': 140820, 'loss/train': 1.3156731128692627} 11/07/2021 17:04:30 - INFO - __main__ - Step 140822: {'lr': 4.730452489123183e-06, 'samples': 27037824, 'steps': 140821, 'loss/train': 1.621568202972412} 11/07/2021 17:04:30 - INFO - __main__ - Step 140823: {'lr': 4.729425095288981e-06, 'samples': 27038016, 'steps': 140822, 'loss/train': 1.5943148136138916} 11/07/2021 17:04:31 - INFO - __main__ - Step 140824: {'lr': 4.728397811969931e-06, 'samples': 27038208, 'steps': 140823, 'loss/train': 1.4209877252578735} 11/07/2021 17:04:32 - INFO - __main__ - Step 140825: {'lr': 4.727370639166506e-06, 'samples': 27038400, 'steps': 140824, 'loss/train': 1.0326322317123413} 11/07/2021 17:04:32 - INFO - __main__ - Step 140826: {'lr': 4.726343576879122e-06, 'samples': 27038592, 'steps': 140825, 'loss/train': 1.2664295434951782} 11/07/2021 17:04:32 - INFO - __main__ - Step 140827: {'lr': 4.72531662510825e-06, 'samples': 27038784, 'steps': 140826, 'loss/train': 1.029078483581543} 11/07/2021 17:04:33 - INFO - __main__ - Step 140828: {'lr': 4.724289783854363e-06, 'samples': 27038976, 'steps': 140827, 'loss/train': 1.0480762720108032} 11/07/2021 17:04:33 - INFO - __main__ - Step 140829: {'lr': 4.723263053117932e-06, 'samples': 27039168, 'steps': 140828, 'loss/train': 1.4402574300765991} 11/07/2021 17:04:34 - INFO - __main__ - Step 140830: {'lr': 4.722236432899429e-06, 'samples': 27039360, 'steps': 140829, 'loss/train': 1.2614870071411133} 11/07/2021 17:04:35 - INFO - __main__ - Step 140831: {'lr': 4.721209923199271e-06, 'samples': 27039552, 'steps': 140830, 'loss/train': 0.7553324103355408} 11/07/2021 17:04:35 - INFO - __main__ - Step 140832: {'lr': 4.720183524017984e-06, 'samples': 27039744, 'steps': 140831, 'loss/train': 1.2416199445724487} 11/07/2021 17:04:35 - INFO - __main__ - Step 140833: {'lr': 4.7191572353559584e-06, 'samples': 27039936, 'steps': 140832, 'loss/train': 1.1929739713668823} 11/07/2021 17:04:36 - INFO - __main__ - Step 140834: {'lr': 4.71813105721372e-06, 'samples': 27040128, 'steps': 140833, 'loss/train': 1.0176290273666382} 11/07/2021 17:04:37 - INFO - __main__ - Step 140835: {'lr': 4.717104989591714e-06, 'samples': 27040320, 'steps': 140834, 'loss/train': 1.5620936155319214} 11/07/2021 17:04:37 - INFO - __main__ - Step 140836: {'lr': 4.716079032490384e-06, 'samples': 27040512, 'steps': 140835, 'loss/train': 1.1794811487197876} 11/07/2021 17:04:37 - INFO - __main__ - Step 140837: {'lr': 4.715053185910201e-06, 'samples': 27040704, 'steps': 140836, 'loss/train': 1.6337294578552246} 11/07/2021 17:04:38 - INFO - __main__ - Step 140838: {'lr': 4.7140274498516375e-06, 'samples': 27040896, 'steps': 140837, 'loss/train': 1.422528624534607} 11/07/2021 17:04:38 - INFO - __main__ - Step 140839: {'lr': 4.713001824315166e-06, 'samples': 27041088, 'steps': 140838, 'loss/train': 1.6937466859817505} 11/07/2021 17:04:39 - INFO - __main__ - Step 140840: {'lr': 4.711976309301231e-06, 'samples': 27041280, 'steps': 140839, 'loss/train': 0.34019288420677185} 11/07/2021 17:04:39 - INFO - __main__ - Step 140841: {'lr': 4.7109509048102735e-06, 'samples': 27041472, 'steps': 140840, 'loss/train': 1.1898478269577026} 11/07/2021 17:04:40 - INFO - __main__ - Step 140842: {'lr': 4.709925610842769e-06, 'samples': 27041664, 'steps': 140841, 'loss/train': 1.3582029342651367} 11/07/2021 17:04:40 - INFO - __main__ - Step 140843: {'lr': 4.708900427399188e-06, 'samples': 27041856, 'steps': 140842, 'loss/train': 1.421828031539917} 11/07/2021 17:04:41 - INFO - __main__ - Step 140844: {'lr': 4.707875354480001e-06, 'samples': 27042048, 'steps': 140843, 'loss/train': 0.6795441508293152} 11/07/2021 17:04:41 - INFO - __main__ - Step 140845: {'lr': 4.706850392085682e-06, 'samples': 27042240, 'steps': 140844, 'loss/train': 1.3019459247589111} 11/07/2021 17:04:42 - INFO - __main__ - Step 140846: {'lr': 4.705825540216646e-06, 'samples': 27042432, 'steps': 140845, 'loss/train': 1.278032660484314} 11/07/2021 17:04:42 - INFO - __main__ - Step 140847: {'lr': 4.7048007988733655e-06, 'samples': 27042624, 'steps': 140846, 'loss/train': 1.5296977758407593} 11/07/2021 17:04:43 - INFO - __main__ - Step 140848: {'lr': 4.703776168056339e-06, 'samples': 27042816, 'steps': 140847, 'loss/train': 1.2466627359390259} 11/07/2021 17:04:43 - INFO - __main__ - Step 140849: {'lr': 4.702751647765985e-06, 'samples': 27043008, 'steps': 140848, 'loss/train': 1.4317413568496704} 11/07/2021 17:04:44 - INFO - __main__ - Step 140850: {'lr': 4.701727238002801e-06, 'samples': 27043200, 'steps': 140849, 'loss/train': 1.5671731233596802} 11/07/2021 17:04:44 - INFO - __main__ - Step 140851: {'lr': 4.700702938767259e-06, 'samples': 27043392, 'steps': 140850, 'loss/train': 1.4331867694854736} 11/07/2021 17:04:45 - INFO - __main__ - Step 140852: {'lr': 4.699678750059777e-06, 'samples': 27043584, 'steps': 140851, 'loss/train': 1.3044830560684204} 11/07/2021 17:04:45 - INFO - __main__ - Step 140853: {'lr': 4.698654671880825e-06, 'samples': 27043776, 'steps': 140852, 'loss/train': 1.7229357957839966} 11/07/2021 17:04:45 - INFO - __main__ - Step 140854: {'lr': 4.697630704230877e-06, 'samples': 27043968, 'steps': 140853, 'loss/train': 1.3740543127059937} 11/07/2021 17:04:46 - INFO - __main__ - Step 140855: {'lr': 4.6966068471104014e-06, 'samples': 27044160, 'steps': 140854, 'loss/train': 1.1539697647094727} 11/07/2021 17:04:47 - INFO - __main__ - Step 140856: {'lr': 4.695583100519818e-06, 'samples': 27044352, 'steps': 140855, 'loss/train': 0.8779641389846802} 11/07/2021 17:04:47 - INFO - __main__ - Step 140857: {'lr': 4.694559464459652e-06, 'samples': 27044544, 'steps': 140856, 'loss/train': 1.2386926412582397} 11/07/2021 17:04:48 - INFO - __main__ - Step 140858: {'lr': 4.6935359389303214e-06, 'samples': 27044736, 'steps': 140857, 'loss/train': 1.6513245105743408} 11/07/2021 17:04:48 - INFO - __main__ - Step 140859: {'lr': 4.692512523932296e-06, 'samples': 27044928, 'steps': 140858, 'loss/train': 1.40189528465271} 11/07/2021 17:04:48 - INFO - __main__ - Step 140860: {'lr': 4.69148921946605e-06, 'samples': 27045120, 'steps': 140859, 'loss/train': 1.1449748277664185} 11/07/2021 17:04:50 - INFO - __main__ - Step 140861: {'lr': 4.690466025531998e-06, 'samples': 27045312, 'steps': 140860, 'loss/train': 1.2714555263519287} 11/07/2021 17:04:50 - INFO - __main__ - Step 140862: {'lr': 4.689442942130667e-06, 'samples': 27045504, 'steps': 140861, 'loss/train': 1.6357054710388184} 11/07/2021 17:04:50 - INFO - __main__ - Step 140863: {'lr': 4.688419969262503e-06, 'samples': 27045696, 'steps': 140862, 'loss/train': 1.5264201164245605} 11/07/2021 17:04:51 - INFO - __main__ - Step 140864: {'lr': 4.687397106927921e-06, 'samples': 27045888, 'steps': 140863, 'loss/train': 1.281237244606018} 11/07/2021 17:04:51 - INFO - __main__ - Step 140865: {'lr': 4.686374355127421e-06, 'samples': 27046080, 'steps': 140864, 'loss/train': 1.705886960029602} 11/07/2021 17:04:52 - INFO - __main__ - Step 140866: {'lr': 4.6853517138614746e-06, 'samples': 27046272, 'steps': 140865, 'loss/train': 1.417914628982544} 11/07/2021 17:04:52 - INFO - __main__ - Step 140867: {'lr': 4.684329183130498e-06, 'samples': 27046464, 'steps': 140866, 'loss/train': 1.2000364065170288} 11/07/2021 17:04:53 - INFO - __main__ - Step 140868: {'lr': 4.683306762934991e-06, 'samples': 27046656, 'steps': 140867, 'loss/train': 1.3921784162521362} 11/07/2021 17:04:53 - INFO - __main__ - Step 140869: {'lr': 4.682284453275399e-06, 'samples': 27046848, 'steps': 140868, 'loss/train': 0.9406264424324036} 11/07/2021 17:04:53 - INFO - __main__ - Step 140870: {'lr': 4.681262254152191e-06, 'samples': 27047040, 'steps': 140869, 'loss/train': 0.3148778975009918} 11/07/2021 17:04:54 - INFO - __main__ - Step 140871: {'lr': 4.680240165565785e-06, 'samples': 27047232, 'steps': 140870, 'loss/train': 1.2148298025131226} 11/07/2021 17:04:55 - INFO - __main__ - Step 140872: {'lr': 4.67921818751671e-06, 'samples': 27047424, 'steps': 140871, 'loss/train': 1.332646369934082} 11/07/2021 17:04:55 - INFO - __main__ - Step 140873: {'lr': 4.678196320005379e-06, 'samples': 27047616, 'steps': 140872, 'loss/train': 1.0996677875518799} 11/07/2021 17:04:56 - INFO - __main__ - Step 140874: {'lr': 4.677174563032294e-06, 'samples': 27047808, 'steps': 140873, 'loss/train': 1.3835179805755615} 11/07/2021 17:04:56 - INFO - __main__ - Step 140875: {'lr': 4.67615291659787e-06, 'samples': 27048000, 'steps': 140874, 'loss/train': 0.9928702712059021} 11/07/2021 17:04:56 - INFO - __main__ - Step 140876: {'lr': 4.675131380702579e-06, 'samples': 27048192, 'steps': 140875, 'loss/train': 1.3619004487991333} 11/07/2021 17:04:57 - INFO - __main__ - Step 140877: {'lr': 4.674109955346894e-06, 'samples': 27048384, 'steps': 140876, 'loss/train': 0.6701020002365112} 11/07/2021 17:04:58 - INFO - __main__ - Step 140878: {'lr': 4.673088640531259e-06, 'samples': 27048576, 'steps': 140877, 'loss/train': 0.9533061981201172} 11/07/2021 17:04:58 - INFO - __main__ - Step 140879: {'lr': 4.672067436256172e-06, 'samples': 27048768, 'steps': 140878, 'loss/train': 1.0153714418411255} 11/07/2021 17:04:59 - INFO - __main__ - Step 140880: {'lr': 4.6710463425220506e-06, 'samples': 27048960, 'steps': 140879, 'loss/train': 0.9492987990379333} 11/07/2021 17:04:59 - INFO - __main__ - Step 140881: {'lr': 4.670025359329367e-06, 'samples': 27049152, 'steps': 140880, 'loss/train': 1.6536588668823242} 11/07/2021 17:05:00 - INFO - __main__ - Step 140882: {'lr': 4.669004486678591e-06, 'samples': 27049344, 'steps': 140881, 'loss/train': 0.8762497305870056} 11/07/2021 17:05:00 - INFO - __main__ - Step 140883: {'lr': 4.667983724570168e-06, 'samples': 27049536, 'steps': 140882, 'loss/train': 0.9816457033157349} 11/07/2021 17:05:01 - INFO - __main__ - Step 140884: {'lr': 4.666963073004571e-06, 'samples': 27049728, 'steps': 140883, 'loss/train': 1.386509895324707} 11/07/2021 17:05:01 - INFO - __main__ - Step 140885: {'lr': 4.665942531982242e-06, 'samples': 27049920, 'steps': 140884, 'loss/train': 1.6230487823486328} 11/07/2021 17:05:01 - INFO - __main__ - Step 140886: {'lr': 4.664922101503683e-06, 'samples': 27050112, 'steps': 140885, 'loss/train': 1.2410386800765991} 11/07/2021 17:05:02 - INFO - __main__ - Step 140887: {'lr': 4.66390178156928e-06, 'samples': 27050304, 'steps': 140886, 'loss/train': 1.5470161437988281} 11/07/2021 17:05:03 - INFO - __main__ - Step 140888: {'lr': 4.662881572179561e-06, 'samples': 27050496, 'steps': 140887, 'loss/train': 1.0498720407485962} 11/07/2021 17:05:03 - INFO - __main__ - Step 140889: {'lr': 4.6618614733349716e-06, 'samples': 27050688, 'steps': 140888, 'loss/train': 1.413394570350647} 11/07/2021 17:05:03 - INFO - __main__ - Step 140890: {'lr': 4.660841485035955e-06, 'samples': 27050880, 'steps': 140889, 'loss/train': 1.39169180393219} 11/07/2021 17:05:04 - INFO - __main__ - Step 140891: {'lr': 4.659821607282983e-06, 'samples': 27051072, 'steps': 140890, 'loss/train': 1.3127058744430542} 11/07/2021 17:05:05 - INFO - __main__ - Step 140892: {'lr': 4.658801840076499e-06, 'samples': 27051264, 'steps': 140891, 'loss/train': 1.2186472415924072} 11/07/2021 17:05:05 - INFO - __main__ - Step 140893: {'lr': 4.657782183416976e-06, 'samples': 27051456, 'steps': 140892, 'loss/train': 0.944646418094635} 11/07/2021 17:05:06 - INFO - __main__ - Step 140894: {'lr': 4.656762637304884e-06, 'samples': 27051648, 'steps': 140893, 'loss/train': 1.0695083141326904} 11/07/2021 17:05:06 - INFO - __main__ - Step 140895: {'lr': 4.655743201740642e-06, 'samples': 27051840, 'steps': 140894, 'loss/train': 1.1238828897476196} 11/07/2021 17:05:06 - INFO - __main__ - Step 140896: {'lr': 4.654723876724748e-06, 'samples': 27052032, 'steps': 140895, 'loss/train': 1.5308985710144043} 11/07/2021 17:05:07 - INFO - __main__ - Step 140897: {'lr': 4.6537046622576735e-06, 'samples': 27052224, 'steps': 140896, 'loss/train': 1.385218858718872} 11/07/2021 17:05:08 - INFO - __main__ - Step 140898: {'lr': 4.652685558339808e-06, 'samples': 27052416, 'steps': 140897, 'loss/train': 1.22709059715271} 11/07/2021 17:05:08 - INFO - __main__ - Step 140899: {'lr': 4.651666564971679e-06, 'samples': 27052608, 'steps': 140898, 'loss/train': 1.2782058715820312} 11/07/2021 17:05:08 - INFO - __main__ - Step 140900: {'lr': 4.6506476821537305e-06, 'samples': 27052800, 'steps': 140899, 'loss/train': 0.8778427243232727} 11/07/2021 17:05:09 - INFO - __main__ - Step 140901: {'lr': 4.649628909886406e-06, 'samples': 27052992, 'steps': 140900, 'loss/train': 0.973590075969696} 11/07/2021 17:05:09 - INFO - __main__ - Step 140902: {'lr': 4.648610248170176e-06, 'samples': 27053184, 'steps': 140901, 'loss/train': 1.4857808351516724} 11/07/2021 17:05:10 - INFO - __main__ - Step 140903: {'lr': 4.647591697005488e-06, 'samples': 27053376, 'steps': 140902, 'loss/train': 1.5392769575119019} 11/07/2021 17:05:10 - INFO - __main__ - Step 140904: {'lr': 4.646573256392811e-06, 'samples': 27053568, 'steps': 140903, 'loss/train': 0.5324598550796509} 11/07/2021 17:05:11 - INFO - __main__ - Step 140905: {'lr': 4.645554926332618e-06, 'samples': 27053760, 'steps': 140904, 'loss/train': 0.817613959312439} 11/07/2021 17:05:11 - INFO - __main__ - Step 140906: {'lr': 4.644536706825353e-06, 'samples': 27053952, 'steps': 140905, 'loss/train': 1.6522880792617798} 11/07/2021 17:05:11 - INFO - __main__ - Step 140907: {'lr': 4.64351859787146e-06, 'samples': 27054144, 'steps': 140906, 'loss/train': 1.0192415714263916} 11/07/2021 17:05:13 - INFO - __main__ - Step 140908: {'lr': 4.642500599471411e-06, 'samples': 27054336, 'steps': 140907, 'loss/train': 1.3330063819885254} 11/07/2021 17:05:13 - INFO - __main__ - Step 140909: {'lr': 4.641482711625678e-06, 'samples': 27054528, 'steps': 140908, 'loss/train': 1.5675325393676758} 11/07/2021 17:05:13 - INFO - __main__ - Step 140910: {'lr': 4.640464934334704e-06, 'samples': 27054720, 'steps': 140909, 'loss/train': 1.4719839096069336} 11/07/2021 17:05:14 - INFO - __main__ - Step 140911: {'lr': 4.639447267598934e-06, 'samples': 27054912, 'steps': 140910, 'loss/train': 1.412675142288208} 11/07/2021 17:05:14 - INFO - __main__ - Step 140912: {'lr': 4.63842971141884e-06, 'samples': 27055104, 'steps': 140911, 'loss/train': 1.4757355451583862} 11/07/2021 17:05:15 - INFO - __main__ - Step 140913: {'lr': 4.637412265794894e-06, 'samples': 27055296, 'steps': 140912, 'loss/train': 1.2541369199752808} 11/07/2021 17:05:15 - INFO - __main__ - Step 140914: {'lr': 4.636394930727539e-06, 'samples': 27055488, 'steps': 140913, 'loss/train': 1.4408681392669678} 11/07/2021 17:05:16 - INFO - __main__ - Step 140915: {'lr': 4.635377706217248e-06, 'samples': 27055680, 'steps': 140914, 'loss/train': 1.446502447128296} 11/07/2021 17:05:16 - INFO - __main__ - Step 140916: {'lr': 4.634360592264463e-06, 'samples': 27055872, 'steps': 140915, 'loss/train': 1.486100435256958} 11/07/2021 17:05:16 - INFO - __main__ - Step 140917: {'lr': 4.633343588869659e-06, 'samples': 27056064, 'steps': 140916, 'loss/train': 1.7967495918273926} 11/07/2021 17:05:18 - INFO - __main__ - Step 140918: {'lr': 4.632326696033279e-06, 'samples': 27056256, 'steps': 140917, 'loss/train': 1.2096580266952515} 11/07/2021 17:05:18 - INFO - __main__ - Step 140919: {'lr': 4.631309913755766e-06, 'samples': 27056448, 'steps': 140918, 'loss/train': 1.3758848905563354} 11/07/2021 17:05:18 - INFO - __main__ - Step 140920: {'lr': 4.6302932420376474e-06, 'samples': 27056640, 'steps': 140919, 'loss/train': 1.1795941591262817} 11/07/2021 17:05:19 - INFO - __main__ - Step 140921: {'lr': 4.629276680879285e-06, 'samples': 27056832, 'steps': 140920, 'loss/train': 0.6866194605827332} 11/07/2021 17:05:19 - INFO - __main__ - Step 140922: {'lr': 4.628260230281206e-06, 'samples': 27057024, 'steps': 140921, 'loss/train': 1.2821764945983887} 11/07/2021 17:05:19 - INFO - __main__ - Step 140923: {'lr': 4.627243890243854e-06, 'samples': 27057216, 'steps': 140922, 'loss/train': 1.5800294876098633} 11/07/2021 17:05:20 - INFO - __main__ - Step 140924: {'lr': 4.6262276607676454e-06, 'samples': 27057408, 'steps': 140923, 'loss/train': 0.6979626417160034} 11/07/2021 17:05:21 - INFO - __main__ - Step 140925: {'lr': 4.625211541853108e-06, 'samples': 27057600, 'steps': 140924, 'loss/train': 1.7620065212249756} 11/07/2021 17:05:21 - INFO - __main__ - Step 140926: {'lr': 4.624195533500658e-06, 'samples': 27057792, 'steps': 140925, 'loss/train': 1.1953675746917725} 11/07/2021 17:05:21 - INFO - __main__ - Step 140927: {'lr': 4.623179635710739e-06, 'samples': 27057984, 'steps': 140926, 'loss/train': 1.371856927871704} 11/07/2021 17:05:22 - INFO - __main__ - Step 140928: {'lr': 4.622163848483823e-06, 'samples': 27058176, 'steps': 140927, 'loss/train': 1.9375567436218262} 11/07/2021 17:05:23 - INFO - __main__ - Step 140929: {'lr': 4.621148171820411e-06, 'samples': 27058368, 'steps': 140928, 'loss/train': 1.5336967706680298} 11/07/2021 17:05:23 - INFO - __main__ - Step 140930: {'lr': 4.620132605720889e-06, 'samples': 27058560, 'steps': 140929, 'loss/train': 1.206673502922058} 11/07/2021 17:05:24 - INFO - __main__ - Step 140931: {'lr': 4.619117150185759e-06, 'samples': 27058752, 'steps': 140930, 'loss/train': 0.5471276640892029} 11/07/2021 17:05:24 - INFO - __main__ - Step 140932: {'lr': 4.618101805215491e-06, 'samples': 27058944, 'steps': 140931, 'loss/train': 1.2919220924377441} 11/07/2021 17:05:24 - INFO - __main__ - Step 140933: {'lr': 4.6170865708105025e-06, 'samples': 27059136, 'steps': 140932, 'loss/train': 1.523440957069397} 11/07/2021 17:05:25 - INFO - __main__ - Step 140934: {'lr': 4.616071446971265e-06, 'samples': 27059328, 'steps': 140933, 'loss/train': 1.5816043615341187} 11/07/2021 17:05:26 - INFO - __main__ - Step 140935: {'lr': 4.615056433698251e-06, 'samples': 27059520, 'steps': 140934, 'loss/train': 1.3262484073638916} 11/07/2021 17:05:26 - INFO - __main__ - Step 140936: {'lr': 4.6140415309919026e-06, 'samples': 27059712, 'steps': 140935, 'loss/train': 1.2444671392440796} 11/07/2021 17:05:26 - INFO - __main__ - Step 140937: {'lr': 4.613026738852666e-06, 'samples': 27059904, 'steps': 140936, 'loss/train': 1.2929285764694214} 11/07/2021 17:05:27 - INFO - __main__ - Step 140938: {'lr': 4.612012057281012e-06, 'samples': 27060096, 'steps': 140937, 'loss/train': 1.2075746059417725} 11/07/2021 17:05:28 - INFO - __main__ - Step 140939: {'lr': 4.610997486277413e-06, 'samples': 27060288, 'steps': 140938, 'loss/train': 1.809948205947876} 11/07/2021 17:05:28 - INFO - __main__ - Step 140940: {'lr': 4.609983025842313e-06, 'samples': 27060480, 'steps': 140939, 'loss/train': 1.0811411142349243} 11/07/2021 17:05:28 - INFO - __main__ - Step 140941: {'lr': 4.608968675976155e-06, 'samples': 27060672, 'steps': 140940, 'loss/train': 1.352412462234497} 11/07/2021 17:05:29 - INFO - __main__ - Step 140942: {'lr': 4.607954436679412e-06, 'samples': 27060864, 'steps': 140941, 'loss/train': 1.3179386854171753} 11/07/2021 17:05:29 - INFO - __main__ - Step 140943: {'lr': 4.606940307952529e-06, 'samples': 27061056, 'steps': 140942, 'loss/train': 1.0843126773834229} 11/07/2021 17:05:30 - INFO - __main__ - Step 140944: {'lr': 4.605926289796003e-06, 'samples': 27061248, 'steps': 140943, 'loss/train': 1.4373259544372559} 11/07/2021 17:05:31 - INFO - __main__ - Step 140945: {'lr': 4.604912382210224e-06, 'samples': 27061440, 'steps': 140944, 'loss/train': 1.3674204349517822} 11/07/2021 17:05:31 - INFO - __main__ - Step 140946: {'lr': 4.603898585195721e-06, 'samples': 27061632, 'steps': 140945, 'loss/train': 1.109850287437439} 11/07/2021 17:05:31 - INFO - __main__ - Step 140947: {'lr': 4.602884898752907e-06, 'samples': 27061824, 'steps': 140946, 'loss/train': 0.3413761854171753} 11/07/2021 17:05:32 - INFO - __main__ - Step 140948: {'lr': 4.601871322882229e-06, 'samples': 27062016, 'steps': 140947, 'loss/train': 1.458303451538086} 11/07/2021 17:05:33 - INFO - __main__ - Step 140949: {'lr': 4.600857857584184e-06, 'samples': 27062208, 'steps': 140948, 'loss/train': 1.5006141662597656} 11/07/2021 17:05:33 - INFO - __main__ - Step 140950: {'lr': 4.599844502859191e-06, 'samples': 27062400, 'steps': 140949, 'loss/train': 1.37466299533844} 11/07/2021 17:05:33 - INFO - __main__ - Step 140951: {'lr': 4.598831258707719e-06, 'samples': 27062592, 'steps': 140950, 'loss/train': 1.5130456686019897} 11/07/2021 17:05:34 - INFO - __main__ - Step 140952: {'lr': 4.597818125130215e-06, 'samples': 27062784, 'steps': 140951, 'loss/train': 0.9250797629356384} 11/07/2021 17:05:34 - INFO - __main__ - Step 140953: {'lr': 4.59680510212715e-06, 'samples': 27062976, 'steps': 140952, 'loss/train': 1.48830246925354} 11/07/2021 17:05:35 - INFO - __main__ - Step 140954: {'lr': 4.595792189698994e-06, 'samples': 27063168, 'steps': 140953, 'loss/train': 1.414387822151184} 11/07/2021 17:05:36 - INFO - __main__ - Step 140955: {'lr': 4.594779387846193e-06, 'samples': 27063360, 'steps': 140954, 'loss/train': 1.1561260223388672} 11/07/2021 17:05:36 - INFO - __main__ - Step 140956: {'lr': 4.593766696569162e-06, 'samples': 27063552, 'steps': 140955, 'loss/train': 0.8831173181533813} 11/07/2021 17:05:36 - INFO - __main__ - Step 140957: {'lr': 4.592754115868431e-06, 'samples': 27063744, 'steps': 140956, 'loss/train': 1.3207287788391113} 11/07/2021 17:05:37 - INFO - __main__ - Step 140958: {'lr': 4.591741645744385e-06, 'samples': 27063936, 'steps': 140957, 'loss/train': 2.9372689723968506} 11/07/2021 17:05:37 - INFO - __main__ - Step 140959: {'lr': 4.590729286197554e-06, 'samples': 27064128, 'steps': 140958, 'loss/train': 1.512363314628601} 11/07/2021 17:05:38 - INFO - __main__ - Step 140960: {'lr': 4.589717037228353e-06, 'samples': 27064320, 'steps': 140959, 'loss/train': 1.1385483741760254} 11/07/2021 17:05:39 - INFO - __main__ - Step 140961: {'lr': 4.5887048988371985e-06, 'samples': 27064512, 'steps': 140960, 'loss/train': 1.3331986665725708} 11/07/2021 17:05:39 - INFO - __main__ - Step 140962: {'lr': 4.587692871024618e-06, 'samples': 27064704, 'steps': 140961, 'loss/train': 1.2496007680892944} 11/07/2021 17:05:39 - INFO - __main__ - Step 140963: {'lr': 4.586680953791028e-06, 'samples': 27064896, 'steps': 140962, 'loss/train': 1.4084925651550293} 11/07/2021 17:05:40 - INFO - __main__ - Step 140964: {'lr': 4.585669147136873e-06, 'samples': 27065088, 'steps': 140963, 'loss/train': 0.7734197378158569} 11/07/2021 17:05:41 - INFO - __main__ - Step 140965: {'lr': 4.584657451062652e-06, 'samples': 27065280, 'steps': 140964, 'loss/train': 1.189849853515625} 11/07/2021 17:05:41 - INFO - __main__ - Step 140966: {'lr': 4.58364586556878e-06, 'samples': 27065472, 'steps': 140965, 'loss/train': 1.2753745317459106} 11/07/2021 17:05:41 - INFO - __main__ - Step 140967: {'lr': 4.582634390655732e-06, 'samples': 27065664, 'steps': 140966, 'loss/train': 0.8860864639282227} 11/07/2021 17:05:42 - INFO - __main__ - Step 140968: {'lr': 4.581623026323978e-06, 'samples': 27065856, 'steps': 140967, 'loss/train': 1.38002347946167} 11/07/2021 17:05:42 - INFO - __main__ - Step 140969: {'lr': 4.580611772573934e-06, 'samples': 27066048, 'steps': 140968, 'loss/train': 2.1515324115753174} 11/07/2021 17:05:43 - INFO - __main__ - Step 140970: {'lr': 4.5796006294061e-06, 'samples': 27066240, 'steps': 140969, 'loss/train': 0.2712504267692566} 11/07/2021 17:05:43 - INFO - __main__ - Step 140971: {'lr': 4.57858959682092e-06, 'samples': 27066432, 'steps': 140970, 'loss/train': 1.0543889999389648} 11/07/2021 17:05:44 - INFO - __main__ - Step 140972: {'lr': 4.577578674818811e-06, 'samples': 27066624, 'steps': 140971, 'loss/train': 0.6798655986785889} 11/07/2021 17:05:44 - INFO - __main__ - Step 140973: {'lr': 4.5765678634003e-06, 'samples': 27066816, 'steps': 140972, 'loss/train': 1.3533074855804443} 11/07/2021 17:05:44 - INFO - __main__ - Step 140974: {'lr': 4.575557162565774e-06, 'samples': 27067008, 'steps': 140973, 'loss/train': 1.6788218021392822} 11/07/2021 17:05:46 - INFO - __main__ - Step 140975: {'lr': 4.574546572315708e-06, 'samples': 27067200, 'steps': 140974, 'loss/train': 1.3625680208206177} 11/07/2021 17:05:46 - INFO - __main__ - Step 140976: {'lr': 4.573536092650571e-06, 'samples': 27067392, 'steps': 140975, 'loss/train': 1.2425373792648315} 11/07/2021 17:05:46 - INFO - __main__ - Step 140977: {'lr': 4.572525723570809e-06, 'samples': 27067584, 'steps': 140976, 'loss/train': 1.5227785110473633} 11/07/2021 17:05:47 - INFO - __main__ - Step 140978: {'lr': 4.571515465076864e-06, 'samples': 27067776, 'steps': 140977, 'loss/train': 1.591447114944458} 11/07/2021 17:05:47 - INFO - __main__ - Step 140979: {'lr': 4.570505317169238e-06, 'samples': 27067968, 'steps': 140978, 'loss/train': 3.6802830696105957} 11/07/2021 17:05:47 - INFO - __main__ - Step 140980: {'lr': 4.569495279848345e-06, 'samples': 27068160, 'steps': 140979, 'loss/train': 1.0853205919265747} 11/07/2021 17:05:48 - INFO - __main__ - Step 140981: {'lr': 4.568485353114632e-06, 'samples': 27068352, 'steps': 140980, 'loss/train': 0.9262933731079102} 11/07/2021 17:05:49 - INFO - __main__ - Step 140982: {'lr': 4.567475536968596e-06, 'samples': 27068544, 'steps': 140981, 'loss/train': 1.2999286651611328} 11/07/2021 17:05:49 - INFO - __main__ - Step 140983: {'lr': 4.566465831410655e-06, 'samples': 27068736, 'steps': 140982, 'loss/train': 0.8933275938034058} 11/07/2021 17:05:50 - INFO - __main__ - Step 140984: {'lr': 4.5654562364412786e-06, 'samples': 27068928, 'steps': 140983, 'loss/train': 1.2922940254211426} 11/07/2021 17:05:50 - INFO - __main__ - Step 140985: {'lr': 4.564446752060914e-06, 'samples': 27069120, 'steps': 140984, 'loss/train': 1.9964898824691772} 11/07/2021 17:05:51 - INFO - __main__ - Step 140986: {'lr': 4.56343737827003e-06, 'samples': 27069312, 'steps': 140985, 'loss/train': 1.2736977338790894} 11/07/2021 17:05:51 - INFO - __main__ - Step 140987: {'lr': 4.5624281150691e-06, 'samples': 27069504, 'steps': 140986, 'loss/train': 0.7611567378044128} 11/07/2021 17:05:52 - INFO - __main__ - Step 140988: {'lr': 4.561418962458513e-06, 'samples': 27069696, 'steps': 140987, 'loss/train': 1.305756688117981} 11/07/2021 17:05:52 - INFO - __main__ - Step 140989: {'lr': 4.560409920438796e-06, 'samples': 27069888, 'steps': 140988, 'loss/train': 1.0380473136901855} 11/07/2021 17:05:52 - INFO - __main__ - Step 140990: {'lr': 4.559400989010337e-06, 'samples': 27070080, 'steps': 140989, 'loss/train': 1.2169212102890015} 11/07/2021 17:05:53 - INFO - __main__ - Step 140991: {'lr': 4.5583921681736365e-06, 'samples': 27070272, 'steps': 140990, 'loss/train': 1.5160127878189087} 11/07/2021 17:05:54 - INFO - __main__ - Step 140992: {'lr': 4.557383457929137e-06, 'samples': 27070464, 'steps': 140991, 'loss/train': 1.2020070552825928} 11/07/2021 17:05:54 - INFO - __main__ - Step 140993: {'lr': 4.556374858277312e-06, 'samples': 27070656, 'steps': 140992, 'loss/train': 1.1261377334594727} 11/07/2021 17:05:54 - INFO - __main__ - Step 140994: {'lr': 4.555366369218578e-06, 'samples': 27070848, 'steps': 140993, 'loss/train': 0.673716127872467} 11/07/2021 17:05:55 - INFO - __main__ - Step 140995: {'lr': 4.554357990753405e-06, 'samples': 27071040, 'steps': 140994, 'loss/train': 1.2844244241714478} 11/07/2021 17:05:55 - INFO - __main__ - Step 140996: {'lr': 4.553349722882266e-06, 'samples': 27071232, 'steps': 140995, 'loss/train': 1.340876817703247} 11/07/2021 17:05:56 - INFO - __main__ - Step 140997: {'lr': 4.552341565605578e-06, 'samples': 27071424, 'steps': 140996, 'loss/train': 1.3456637859344482} 11/07/2021 17:05:56 - INFO - __main__ - Step 140998: {'lr': 4.551333518923867e-06, 'samples': 27071616, 'steps': 140997, 'loss/train': 1.1755448579788208} 11/07/2021 17:05:57 - INFO - __main__ - Step 140999: {'lr': 4.550325582837495e-06, 'samples': 27071808, 'steps': 140998, 'loss/train': 1.491976261138916} 11/07/2021 17:05:57 - INFO - __main__ - Step 141000: {'lr': 4.5493177573469605e-06, 'samples': 27072000, 'steps': 140999, 'loss/train': 1.1136256456375122} 11/07/2021 17:05:57 - INFO - __main__ - Step 141001: {'lr': 4.548310042452736e-06, 'samples': 27072192, 'steps': 141000, 'loss/train': 1.3662660121917725} 11/07/2021 17:05:59 - INFO - __main__ - Step 141002: {'lr': 4.547302438155238e-06, 'samples': 27072384, 'steps': 141001, 'loss/train': 1.408588171005249} 11/07/2021 17:05:59 - INFO - __main__ - Step 141003: {'lr': 4.5462949444549374e-06, 'samples': 27072576, 'steps': 141002, 'loss/train': 1.0799304246902466} 11/07/2021 17:05:59 - INFO - __main__ - Step 141004: {'lr': 4.545287561352279e-06, 'samples': 27072768, 'steps': 141003, 'loss/train': 1.20368230342865} 11/07/2021 17:06:00 - INFO - __main__ - Step 141005: {'lr': 4.5442802888477355e-06, 'samples': 27072960, 'steps': 141004, 'loss/train': 1.894527792930603} 11/07/2021 17:06:00 - INFO - __main__ - Step 141006: {'lr': 4.543273126941749e-06, 'samples': 27073152, 'steps': 141005, 'loss/train': 1.3507453203201294} 11/07/2021 17:06:01 - INFO - __main__ - Step 141007: {'lr': 4.542266075634793e-06, 'samples': 27073344, 'steps': 141006, 'loss/train': 1.1839275360107422} 11/07/2021 17:06:02 - INFO - __main__ - Step 141008: {'lr': 4.541259134927284e-06, 'samples': 27073536, 'steps': 141007, 'loss/train': 1.5358966588974} 11/07/2021 17:06:02 - INFO - __main__ - Step 141009: {'lr': 4.540252304819748e-06, 'samples': 27073728, 'steps': 141008, 'loss/train': 0.6354786157608032} 11/07/2021 17:06:02 - INFO - __main__ - Step 141010: {'lr': 4.539245585312546e-06, 'samples': 27073920, 'steps': 141009, 'loss/train': 0.8279646039009094} 11/07/2021 17:06:03 - INFO - __main__ - Step 141011: {'lr': 4.5382389764061506e-06, 'samples': 27074112, 'steps': 141010, 'loss/train': 0.803709089756012} 11/07/2021 17:06:04 - INFO - __main__ - Step 141012: {'lr': 4.537232478101061e-06, 'samples': 27074304, 'steps': 141011, 'loss/train': 1.595360279083252} 11/07/2021 17:06:04 - INFO - __main__ - Step 141013: {'lr': 4.536226090397694e-06, 'samples': 27074496, 'steps': 141012, 'loss/train': 1.0356035232543945} 11/07/2021 17:06:05 - INFO - __main__ - Step 141014: {'lr': 4.535219813296521e-06, 'samples': 27074688, 'steps': 141013, 'loss/train': 1.558254599571228} 11/07/2021 17:06:05 - INFO - __main__ - Step 141015: {'lr': 4.534213646798013e-06, 'samples': 27074880, 'steps': 141014, 'loss/train': 1.3684190511703491} 11/07/2021 17:06:05 - INFO - __main__ - Step 141016: {'lr': 4.533207590902561e-06, 'samples': 27075072, 'steps': 141015, 'loss/train': 0.5566559433937073} 11/07/2021 17:06:06 - INFO - __main__ - Step 141017: {'lr': 4.53220164561069e-06, 'samples': 27075264, 'steps': 141016, 'loss/train': 1.026716947555542} 11/07/2021 17:06:07 - INFO - __main__ - Step 141018: {'lr': 4.531195810922817e-06, 'samples': 27075456, 'steps': 141017, 'loss/train': 1.6568049192428589} 11/07/2021 17:06:07 - INFO - __main__ - Step 141019: {'lr': 4.530190086839386e-06, 'samples': 27075648, 'steps': 141018, 'loss/train': 0.9749072194099426} 11/07/2021 17:06:07 - INFO - __main__ - Step 141020: {'lr': 4.5291844733608975e-06, 'samples': 27075840, 'steps': 141019, 'loss/train': 1.606966495513916} 11/07/2021 17:06:08 - INFO - __main__ - Step 141021: {'lr': 4.52817897048774e-06, 'samples': 27076032, 'steps': 141020, 'loss/train': 1.1017184257507324} 11/07/2021 17:06:08 - INFO - __main__ - Step 141022: {'lr': 4.527173578220384e-06, 'samples': 27076224, 'steps': 141021, 'loss/train': 1.3677959442138672} 11/07/2021 17:06:09 - INFO - __main__ - Step 141023: {'lr': 4.52616829655933e-06, 'samples': 27076416, 'steps': 141022, 'loss/train': 1.5730682611465454} 11/07/2021 17:06:09 - INFO - __main__ - Step 141024: {'lr': 4.525163125504967e-06, 'samples': 27076608, 'steps': 141023, 'loss/train': 2.0601799488067627} 11/07/2021 17:06:10 - INFO - __main__ - Step 141025: {'lr': 4.524158065057793e-06, 'samples': 27076800, 'steps': 141024, 'loss/train': 1.3033016920089722} 11/07/2021 17:06:10 - INFO - __main__ - Step 141026: {'lr': 4.523153115218226e-06, 'samples': 27076992, 'steps': 141025, 'loss/train': 1.276179313659668} 11/07/2021 17:06:11 - INFO - __main__ - Step 141027: {'lr': 4.522148275986765e-06, 'samples': 27077184, 'steps': 141026, 'loss/train': 1.3215519189834595} 11/07/2021 17:06:12 - INFO - __main__ - Step 141028: {'lr': 4.521143547363826e-06, 'samples': 27077376, 'steps': 141027, 'loss/train': 0.7411032915115356} 11/07/2021 17:06:12 - INFO - __main__ - Step 141029: {'lr': 4.520138929349854e-06, 'samples': 27077568, 'steps': 141028, 'loss/train': 1.04676353931427} 11/07/2021 17:06:12 - INFO - __main__ - Step 141030: {'lr': 4.519134421945348e-06, 'samples': 27077760, 'steps': 141029, 'loss/train': 1.4851237535476685} 11/07/2021 17:06:13 - INFO - __main__ - Step 141031: {'lr': 4.518130025150724e-06, 'samples': 27077952, 'steps': 141030, 'loss/train': 1.5009522438049316} 11/07/2021 17:06:13 - INFO - __main__ - Step 141032: {'lr': 4.517125738966455e-06, 'samples': 27078144, 'steps': 141031, 'loss/train': 1.3583871126174927} 11/07/2021 17:06:14 - INFO - __main__ - Step 141033: {'lr': 4.516121563392955e-06, 'samples': 27078336, 'steps': 141032, 'loss/train': 1.0361950397491455} 11/07/2021 17:06:15 - INFO - __main__ - Step 141034: {'lr': 4.515117498430698e-06, 'samples': 27078528, 'steps': 141033, 'loss/train': 1.1682969331741333} 11/07/2021 17:06:15 - INFO - __main__ - Step 141035: {'lr': 4.514113544080156e-06, 'samples': 27078720, 'steps': 141034, 'loss/train': 1.213483214378357} 11/07/2021 17:06:15 - INFO - __main__ - Step 141036: {'lr': 4.513109700341772e-06, 'samples': 27078912, 'steps': 141035, 'loss/train': 1.1147538423538208} 11/07/2021 17:06:16 - INFO - __main__ - Step 141037: {'lr': 4.5121059672159906e-06, 'samples': 27079104, 'steps': 141036, 'loss/train': 1.3137757778167725} 11/07/2021 17:06:17 - INFO - __main__ - Step 141038: {'lr': 4.511102344703255e-06, 'samples': 27079296, 'steps': 141037, 'loss/train': 1.3882863521575928} 11/07/2021 17:06:17 - INFO - __main__ - Step 141039: {'lr': 4.510098832804038e-06, 'samples': 27079488, 'steps': 141038, 'loss/train': 1.4325947761535645} 11/07/2021 17:06:17 - INFO - __main__ - Step 141040: {'lr': 4.509095431518784e-06, 'samples': 27079680, 'steps': 141039, 'loss/train': 1.0674933195114136} 11/07/2021 17:06:18 - INFO - __main__ - Step 141041: {'lr': 4.508092140847936e-06, 'samples': 27079872, 'steps': 141040, 'loss/train': 1.353209376335144} 11/07/2021 17:06:18 - INFO - __main__ - Step 141042: {'lr': 4.507088960791967e-06, 'samples': 27080064, 'steps': 141041, 'loss/train': 1.3392690420150757} 11/07/2021 17:06:18 - INFO - __main__ - Step 141043: {'lr': 4.506085891351319e-06, 'samples': 27080256, 'steps': 141042, 'loss/train': 1.2318650484085083} 11/07/2021 17:06:20 - INFO - __main__ - Step 141044: {'lr': 4.505082932526411e-06, 'samples': 27080448, 'steps': 141043, 'loss/train': 1.345490574836731} 11/07/2021 17:06:20 - INFO - __main__ - Step 141045: {'lr': 4.504080084317741e-06, 'samples': 27080640, 'steps': 141044, 'loss/train': 0.9789386987686157} 11/07/2021 17:06:20 - INFO - __main__ - Step 141046: {'lr': 4.503077346725754e-06, 'samples': 27080832, 'steps': 141045, 'loss/train': 0.9740573763847351} 11/07/2021 17:06:21 - INFO - __main__ - Step 141047: {'lr': 4.502074719750865e-06, 'samples': 27081024, 'steps': 141046, 'loss/train': 1.1904821395874023} 11/07/2021 17:06:21 - INFO - __main__ - Step 141048: {'lr': 4.501072203393575e-06, 'samples': 27081216, 'steps': 141047, 'loss/train': 1.7859041690826416} 11/07/2021 17:06:22 - INFO - __main__ - Step 141049: {'lr': 4.500069797654299e-06, 'samples': 27081408, 'steps': 141048, 'loss/train': 0.4233951270580292} 11/07/2021 17:06:22 - INFO - __main__ - Step 141050: {'lr': 4.49906750253351e-06, 'samples': 27081600, 'steps': 141049, 'loss/train': 0.5432690978050232} 11/07/2021 17:06:23 - INFO - __main__ - Step 141051: {'lr': 4.498065318031652e-06, 'samples': 27081792, 'steps': 141050, 'loss/train': 1.1021113395690918} 11/07/2021 17:06:23 - INFO - __main__ - Step 141052: {'lr': 4.497063244149196e-06, 'samples': 27081984, 'steps': 141051, 'loss/train': 1.3392376899719238} 11/07/2021 17:06:23 - INFO - __main__ - Step 141053: {'lr': 4.49606128088656e-06, 'samples': 27082176, 'steps': 141052, 'loss/train': 1.6235346794128418} 11/07/2021 17:06:24 - INFO - __main__ - Step 141054: {'lr': 4.495059428244213e-06, 'samples': 27082368, 'steps': 141053, 'loss/train': 1.0325541496276855} 11/07/2021 17:06:25 - INFO - __main__ - Step 141055: {'lr': 4.494057686222603e-06, 'samples': 27082560, 'steps': 141054, 'loss/train': 0.7880937457084656} 11/07/2021 17:06:25 - INFO - __main__ - Step 141056: {'lr': 4.49305605482217e-06, 'samples': 27082752, 'steps': 141055, 'loss/train': 1.810654878616333} 11/07/2021 17:06:25 - INFO - __main__ - Step 141057: {'lr': 4.492054534043389e-06, 'samples': 27082944, 'steps': 141056, 'loss/train': 1.4769694805145264} 11/07/2021 17:06:26 - INFO - __main__ - Step 141058: {'lr': 4.491053123886702e-06, 'samples': 27083136, 'steps': 141057, 'loss/train': 1.3151302337646484} 11/07/2021 17:06:27 - INFO - __main__ - Step 141059: {'lr': 4.490051824352553e-06, 'samples': 27083328, 'steps': 141058, 'loss/train': 1.2096999883651733} 11/07/2021 17:06:27 - INFO - __main__ - Step 141060: {'lr': 4.4890506354414165e-06, 'samples': 27083520, 'steps': 141059, 'loss/train': 1.2571313381195068} 11/07/2021 17:06:28 - INFO - __main__ - Step 141061: {'lr': 4.488049557153706e-06, 'samples': 27083712, 'steps': 141060, 'loss/train': 1.2343273162841797} 11/07/2021 17:06:28 - INFO - __main__ - Step 141062: {'lr': 4.4870485894898675e-06, 'samples': 27083904, 'steps': 141061, 'loss/train': 1.1366404294967651} 11/07/2021 17:06:28 - INFO - __main__ - Step 141063: {'lr': 4.4860477324504265e-06, 'samples': 27084096, 'steps': 141062, 'loss/train': 1.2690963745117188} 11/07/2021 17:06:29 - INFO - __main__ - Step 141064: {'lr': 4.485046986035746e-06, 'samples': 27084288, 'steps': 141063, 'loss/train': 0.1911131888628006} 11/07/2021 17:06:30 - INFO - __main__ - Step 141065: {'lr': 4.4840463502463235e-06, 'samples': 27084480, 'steps': 141064, 'loss/train': 1.4929120540618896} 11/07/2021 17:06:30 - INFO - __main__ - Step 141066: {'lr': 4.483045825082604e-06, 'samples': 27084672, 'steps': 141065, 'loss/train': 0.9809763431549072} 11/07/2021 17:06:30 - INFO - __main__ - Step 141067: {'lr': 4.48204541054506e-06, 'samples': 27084864, 'steps': 141066, 'loss/train': 1.4803160429000854} 11/07/2021 17:06:31 - INFO - __main__ - Step 141068: {'lr': 4.481045106634107e-06, 'samples': 27085056, 'steps': 141067, 'loss/train': 1.356432557106018} 11/07/2021 17:06:31 - INFO - __main__ - Step 141069: {'lr': 4.48004491335019e-06, 'samples': 27085248, 'steps': 141068, 'loss/train': 1.3674808740615845} 11/07/2021 17:06:32 - INFO - __main__ - Step 141070: {'lr': 4.479044830693779e-06, 'samples': 27085440, 'steps': 141069, 'loss/train': 1.4739705324172974} 11/07/2021 17:06:33 - INFO - __main__ - Step 141071: {'lr': 4.47804485866532e-06, 'samples': 27085632, 'steps': 141070, 'loss/train': 1.0425350666046143} 11/07/2021 17:06:33 - INFO - __main__ - Step 141072: {'lr': 4.477044997265284e-06, 'samples': 27085824, 'steps': 141071, 'loss/train': 1.3637205362319946} 11/07/2021 17:06:33 - INFO - __main__ - Step 141073: {'lr': 4.476045246494087e-06, 'samples': 27086016, 'steps': 141072, 'loss/train': 1.230858564376831} 11/07/2021 17:06:34 - INFO - __main__ - Step 141074: {'lr': 4.475045606352174e-06, 'samples': 27086208, 'steps': 141073, 'loss/train': 1.0093538761138916} 11/07/2021 17:06:35 - INFO - __main__ - Step 141075: {'lr': 4.474046076840044e-06, 'samples': 27086400, 'steps': 141074, 'loss/train': 1.2040131092071533} 11/07/2021 17:06:35 - INFO - __main__ - Step 141076: {'lr': 4.473046657958113e-06, 'samples': 27086592, 'steps': 141075, 'loss/train': 1.5486711263656616} 11/07/2021 17:06:35 - INFO - __main__ - Step 141077: {'lr': 4.4720473497068535e-06, 'samples': 27086784, 'steps': 141076, 'loss/train': 1.7121028900146484} 11/07/2021 17:06:36 - INFO - __main__ - Step 141078: {'lr': 4.471048152086682e-06, 'samples': 27086976, 'steps': 141077, 'loss/train': 1.0471833944320679} 11/07/2021 17:06:36 - INFO - __main__ - Step 141079: {'lr': 4.4700490650980695e-06, 'samples': 27087168, 'steps': 141078, 'loss/train': 1.5430446863174438} 11/07/2021 17:06:37 - INFO - __main__ - Step 141080: {'lr': 4.469050088741461e-06, 'samples': 27087360, 'steps': 141079, 'loss/train': 1.3299533128738403} 11/07/2021 17:06:37 - INFO - __main__ - Step 141081: {'lr': 4.468051223017355e-06, 'samples': 27087552, 'steps': 141080, 'loss/train': 0.6390939950942993} 11/07/2021 17:06:38 - INFO - __main__ - Step 141082: {'lr': 4.467052467926114e-06, 'samples': 27087744, 'steps': 141081, 'loss/train': 1.5085350275039673} 11/07/2021 17:06:38 - INFO - __main__ - Step 141083: {'lr': 4.466053823468236e-06, 'samples': 27087936, 'steps': 141082, 'loss/train': 1.157873511314392} 11/07/2021 17:06:38 - INFO - __main__ - Step 141084: {'lr': 4.465055289644166e-06, 'samples': 27088128, 'steps': 141083, 'loss/train': 1.3954774141311646} 11/07/2021 17:06:39 - INFO - __main__ - Step 141085: {'lr': 4.464056866454347e-06, 'samples': 27088320, 'steps': 141084, 'loss/train': 0.9839330911636353} 11/07/2021 17:06:40 - INFO - __main__ - Step 141086: {'lr': 4.463058553899224e-06, 'samples': 27088512, 'steps': 141085, 'loss/train': 1.6287336349487305} 11/07/2021 17:06:40 - INFO - __main__ - Step 141087: {'lr': 4.462060351979297e-06, 'samples': 27088704, 'steps': 141086, 'loss/train': 1.2888394594192505} 11/07/2021 17:06:40 - INFO - __main__ - Step 141088: {'lr': 4.4610622606949535e-06, 'samples': 27088896, 'steps': 141087, 'loss/train': 1.587041974067688} 11/07/2021 17:06:41 - INFO - __main__ - Step 141089: {'lr': 4.460064280046666e-06, 'samples': 27089088, 'steps': 141088, 'loss/train': 1.3622411489486694} 11/07/2021 17:06:42 - INFO - __main__ - Step 141090: {'lr': 4.459066410034878e-06, 'samples': 27089280, 'steps': 141089, 'loss/train': 1.2523761987686157} 11/07/2021 17:06:42 - INFO - __main__ - Step 141091: {'lr': 4.458068650660035e-06, 'samples': 27089472, 'steps': 141090, 'loss/train': 1.5365720987319946} 11/07/2021 17:06:43 - INFO - __main__ - Step 141092: {'lr': 4.457071001922636e-06, 'samples': 27089664, 'steps': 141091, 'loss/train': 1.3165149688720703} 11/07/2021 17:06:43 - INFO - __main__ - Step 141093: {'lr': 4.456073463823068e-06, 'samples': 27089856, 'steps': 141092, 'loss/train': 1.1951916217803955} 11/07/2021 17:06:43 - INFO - __main__ - Step 141094: {'lr': 4.455076036361805e-06, 'samples': 27090048, 'steps': 141093, 'loss/train': 1.1863219738006592} 11/07/2021 17:06:45 - INFO - __main__ - Step 141095: {'lr': 4.4540787195393176e-06, 'samples': 27090240, 'steps': 141094, 'loss/train': 1.428005337715149} 11/07/2021 17:06:45 - INFO - __main__ - Step 141096: {'lr': 4.453081513355994e-06, 'samples': 27090432, 'steps': 141095, 'loss/train': 1.2331241369247437} 11/07/2021 17:06:45 - INFO - __main__ - Step 141097: {'lr': 4.452084417812363e-06, 'samples': 27090624, 'steps': 141096, 'loss/train': 1.0953298807144165} 11/07/2021 17:06:46 - INFO - __main__ - Step 141098: {'lr': 4.451087432908813e-06, 'samples': 27090816, 'steps': 141097, 'loss/train': 1.4274747371673584} 11/07/2021 17:06:46 - INFO - __main__ - Step 141099: {'lr': 4.450090558645814e-06, 'samples': 27091008, 'steps': 141098, 'loss/train': 0.9420353174209595} 11/07/2021 17:06:47 - INFO - __main__ - Step 141100: {'lr': 4.449093795023812e-06, 'samples': 27091200, 'steps': 141099, 'loss/train': 1.344778060913086} 11/07/2021 17:06:48 - INFO - __main__ - Step 141101: {'lr': 4.44809714204325e-06, 'samples': 27091392, 'steps': 141100, 'loss/train': 0.5578677654266357} 11/07/2021 17:06:48 - INFO - __main__ - Step 141102: {'lr': 4.447100599704601e-06, 'samples': 27091584, 'steps': 141101, 'loss/train': 1.2700389623641968} 11/07/2021 17:06:48 - INFO - __main__ - Step 141103: {'lr': 4.446104168008308e-06, 'samples': 27091776, 'steps': 141102, 'loss/train': 1.7361775636672974} 11/07/2021 17:06:49 - INFO - __main__ - Step 141104: {'lr': 4.445107846954788e-06, 'samples': 27091968, 'steps': 141103, 'loss/train': 0.8043045997619629} 11/07/2021 17:06:49 - INFO - __main__ - Step 141105: {'lr': 4.444111636544512e-06, 'samples': 27092160, 'steps': 141104, 'loss/train': 1.3586833477020264} 11/07/2021 17:06:50 - INFO - __main__ - Step 141106: {'lr': 4.443115536777953e-06, 'samples': 27092352, 'steps': 141105, 'loss/train': 1.5745139122009277} 11/07/2021 17:06:50 - INFO - __main__ - Step 141107: {'lr': 4.4421195476555265e-06, 'samples': 27092544, 'steps': 141106, 'loss/train': 1.677897572517395} 11/07/2021 17:06:51 - INFO - __main__ - Step 141108: {'lr': 4.441123669177705e-06, 'samples': 27092736, 'steps': 141107, 'loss/train': 1.0738720893859863} 11/07/2021 17:06:51 - INFO - __main__ - Step 141109: {'lr': 4.440127901344932e-06, 'samples': 27092928, 'steps': 141108, 'loss/train': 1.15213942527771} 11/07/2021 17:06:51 - INFO - __main__ - Step 141110: {'lr': 4.4391322441576236e-06, 'samples': 27093120, 'steps': 141109, 'loss/train': 0.9040024876594543} 11/07/2021 17:06:52 - INFO - __main__ - Step 141111: {'lr': 4.4381366976162516e-06, 'samples': 27093312, 'steps': 141110, 'loss/train': 1.3067580461502075} 11/07/2021 17:06:53 - INFO - __main__ - Step 141112: {'lr': 4.437141261721261e-06, 'samples': 27093504, 'steps': 141111, 'loss/train': 0.6530557870864868} 11/07/2021 17:06:53 - INFO - __main__ - Step 141113: {'lr': 4.436145936473124e-06, 'samples': 27093696, 'steps': 141112, 'loss/train': 0.7638456225395203} 11/07/2021 17:06:54 - INFO - __main__ - Step 141114: {'lr': 4.435150721872256e-06, 'samples': 27093888, 'steps': 141113, 'loss/train': 1.345474123954773} 11/07/2021 17:06:54 - INFO - __main__ - Step 141115: {'lr': 4.434155617919127e-06, 'samples': 27094080, 'steps': 141114, 'loss/train': 1.1571290493011475} 11/07/2021 17:06:55 - INFO - __main__ - Step 141116: {'lr': 4.433160624614185e-06, 'samples': 27094272, 'steps': 141115, 'loss/train': 1.2349621057510376} 11/07/2021 17:06:55 - INFO - __main__ - Step 141117: {'lr': 4.432165741957872e-06, 'samples': 27094464, 'steps': 141116, 'loss/train': 1.4857710599899292} 11/07/2021 17:06:56 - INFO - __main__ - Step 141118: {'lr': 4.431170969950632e-06, 'samples': 27094656, 'steps': 141117, 'loss/train': 0.7707303166389465} 11/07/2021 17:06:56 - INFO - __main__ - Step 141119: {'lr': 4.430176308592909e-06, 'samples': 27094848, 'steps': 141118, 'loss/train': 1.3470758199691772} 11/07/2021 17:06:56 - INFO - __main__ - Step 141120: {'lr': 4.429181757885148e-06, 'samples': 27095040, 'steps': 141119, 'loss/train': 1.5554057359695435} 11/07/2021 17:06:57 - INFO - __main__ - Step 141121: {'lr': 4.4281873178278475e-06, 'samples': 27095232, 'steps': 141120, 'loss/train': 1.246880292892456} 11/07/2021 17:06:58 - INFO - __main__ - Step 141122: {'lr': 4.4271929884213965e-06, 'samples': 27095424, 'steps': 141121, 'loss/train': 1.3101195096969604} 11/07/2021 17:06:58 - INFO - __main__ - Step 141123: {'lr': 4.42619876966624e-06, 'samples': 27095616, 'steps': 141122, 'loss/train': 1.407211184501648} 11/07/2021 17:06:58 - INFO - __main__ - Step 141124: {'lr': 4.425204661562876e-06, 'samples': 27095808, 'steps': 141123, 'loss/train': 1.3405522108078003} 11/07/2021 17:06:59 - INFO - __main__ - Step 141125: {'lr': 4.424210664111722e-06, 'samples': 27096000, 'steps': 141124, 'loss/train': 1.0305492877960205} 11/07/2021 17:06:59 - INFO - __main__ - Step 141126: {'lr': 4.423216777313221e-06, 'samples': 27096192, 'steps': 141125, 'loss/train': 1.437605381011963} 11/07/2021 17:07:00 - INFO - __main__ - Step 141127: {'lr': 4.422223001167819e-06, 'samples': 27096384, 'steps': 141126, 'loss/train': 1.4010117053985596} 11/07/2021 17:07:01 - INFO - __main__ - Step 141128: {'lr': 4.421229335675986e-06, 'samples': 27096576, 'steps': 141127, 'loss/train': 0.23683404922485352} 11/07/2021 17:07:01 - INFO - __main__ - Step 141129: {'lr': 4.4202357808381664e-06, 'samples': 27096768, 'steps': 141128, 'loss/train': 0.9483622908592224} 11/07/2021 17:07:01 - INFO - __main__ - Step 141130: {'lr': 4.419242336654805e-06, 'samples': 27096960, 'steps': 141129, 'loss/train': 1.658276915550232} 11/07/2021 17:07:02 - INFO - __main__ - Step 141131: {'lr': 4.4182490031263175e-06, 'samples': 27097152, 'steps': 141130, 'loss/train': 1.014966607093811} 11/07/2021 17:07:03 - INFO - __main__ - Step 141132: {'lr': 4.4172557802532046e-06, 'samples': 27097344, 'steps': 141131, 'loss/train': 0.9117335677146912} 11/07/2021 17:07:03 - INFO - __main__ - Step 141133: {'lr': 4.416262668035853e-06, 'samples': 27097536, 'steps': 141132, 'loss/train': 1.235319972038269} 11/07/2021 17:07:04 - INFO - __main__ - Step 141134: {'lr': 4.415269666474763e-06, 'samples': 27097728, 'steps': 141133, 'loss/train': 1.4039924144744873} 11/07/2021 17:07:04 - INFO - __main__ - Step 141135: {'lr': 4.4142767755703805e-06, 'samples': 27097920, 'steps': 141134, 'loss/train': 1.155704379081726} 11/07/2021 17:07:04 - INFO - __main__ - Step 141136: {'lr': 4.41328399532312e-06, 'samples': 27098112, 'steps': 141135, 'loss/train': 1.5249531269073486} 11/07/2021 17:07:05 - INFO - __main__ - Step 141137: {'lr': 4.412291325733453e-06, 'samples': 27098304, 'steps': 141136, 'loss/train': 1.0484119653701782} 11/07/2021 17:07:06 - INFO - __main__ - Step 141138: {'lr': 4.411298766801797e-06, 'samples': 27098496, 'steps': 141137, 'loss/train': 0.7566105127334595} 11/07/2021 17:07:06 - INFO - __main__ - Step 141139: {'lr': 4.410306318528623e-06, 'samples': 27098688, 'steps': 141138, 'loss/train': 1.4919687509536743} 11/07/2021 17:07:07 - INFO - __main__ - Step 141140: {'lr': 4.409313980914376e-06, 'samples': 27098880, 'steps': 141139, 'loss/train': 1.6564116477966309} 11/07/2021 17:07:07 - INFO - __main__ - Step 141141: {'lr': 4.408321753959527e-06, 'samples': 27099072, 'steps': 141140, 'loss/train': 1.3871155977249146} 11/07/2021 17:07:07 - INFO - __main__ - Step 141142: {'lr': 4.407329637664464e-06, 'samples': 27099264, 'steps': 141141, 'loss/train': 1.2867823839187622} 11/07/2021 17:07:08 - INFO - __main__ - Step 141143: {'lr': 4.406337632029689e-06, 'samples': 27099456, 'steps': 141142, 'loss/train': 1.1598846912384033} 11/07/2021 17:07:09 - INFO - __main__ - Step 141144: {'lr': 4.405345737055616e-06, 'samples': 27099648, 'steps': 141143, 'loss/train': 1.5607638359069824} 11/07/2021 17:07:09 - INFO - __main__ - Step 141145: {'lr': 4.404353952742718e-06, 'samples': 27099840, 'steps': 141144, 'loss/train': 0.4152504801750183} 11/07/2021 17:07:09 - INFO - __main__ - Step 141146: {'lr': 4.403362279091411e-06, 'samples': 27100032, 'steps': 141145, 'loss/train': 1.7358489036560059} 11/07/2021 17:07:10 - INFO - __main__ - Step 141147: {'lr': 4.402370716102166e-06, 'samples': 27100224, 'steps': 141146, 'loss/train': 1.5565389394760132} 11/07/2021 17:07:11 - INFO - __main__ - Step 141148: {'lr': 4.401379263775457e-06, 'samples': 27100416, 'steps': 141147, 'loss/train': 1.090652585029602} 11/07/2021 17:07:11 - INFO - __main__ - Step 141149: {'lr': 4.4003879221116706e-06, 'samples': 27100608, 'steps': 141148, 'loss/train': 0.9202014207839966} 11/07/2021 17:07:11 - INFO - __main__ - Step 141150: {'lr': 4.3993966911112795e-06, 'samples': 27100800, 'steps': 141149, 'loss/train': 1.0882600545883179} 11/07/2021 17:07:12 - INFO - __main__ - Step 141151: {'lr': 4.3984055707747274e-06, 'samples': 27100992, 'steps': 141150, 'loss/train': 2.2499430179595947} 11/07/2021 17:07:12 - INFO - __main__ - Step 141152: {'lr': 4.397414561102458e-06, 'samples': 27101184, 'steps': 141151, 'loss/train': 1.4319441318511963} 11/07/2021 17:07:13 - INFO - __main__ - Step 141153: {'lr': 4.396423662094917e-06, 'samples': 27101376, 'steps': 141152, 'loss/train': 2.105635404586792} 11/07/2021 17:07:14 - INFO - __main__ - Step 141154: {'lr': 4.395432873752575e-06, 'samples': 27101568, 'steps': 141153, 'loss/train': 1.2332589626312256} 11/07/2021 17:07:14 - INFO - __main__ - Step 141155: {'lr': 4.394442196075848e-06, 'samples': 27101760, 'steps': 141154, 'loss/train': 1.2102816104888916} 11/07/2021 17:07:14 - INFO - __main__ - Step 141156: {'lr': 4.393451629065209e-06, 'samples': 27101952, 'steps': 141155, 'loss/train': 1.1708078384399414} 11/07/2021 17:07:15 - INFO - __main__ - Step 141157: {'lr': 4.392461172721074e-06, 'samples': 27102144, 'steps': 141156, 'loss/train': 1.181403636932373} 11/07/2021 17:07:16 - INFO - __main__ - Step 141158: {'lr': 4.391470827043942e-06, 'samples': 27102336, 'steps': 141157, 'loss/train': 1.1191155910491943} 11/07/2021 17:07:17 - INFO - __main__ - Step 141159: {'lr': 4.390480592034174e-06, 'samples': 27102528, 'steps': 141158, 'loss/train': 1.093964695930481} 11/07/2021 17:07:17 - INFO - __main__ - Step 141160: {'lr': 4.389490467692297e-06, 'samples': 27102720, 'steps': 141159, 'loss/train': 0.77314293384552} 11/07/2021 17:07:17 - INFO - __main__ - Step 141161: {'lr': 4.388500454018729e-06, 'samples': 27102912, 'steps': 141160, 'loss/train': 1.5964876413345337} 11/07/2021 17:07:18 - INFO - __main__ - Step 141162: {'lr': 4.387510551013912e-06, 'samples': 27103104, 'steps': 141161, 'loss/train': 1.5913267135620117} 11/07/2021 17:07:18 - INFO - __main__ - Step 141163: {'lr': 4.386520758678292e-06, 'samples': 27103296, 'steps': 141162, 'loss/train': 1.3480803966522217} 11/07/2021 17:07:18 - INFO - __main__ - Step 141164: {'lr': 4.385531077012311e-06, 'samples': 27103488, 'steps': 141163, 'loss/train': 1.1745527982711792} 11/07/2021 17:07:19 - INFO - __main__ - Step 141165: {'lr': 4.384541506016415e-06, 'samples': 27103680, 'steps': 141164, 'loss/train': 1.497001051902771} 11/07/2021 17:07:20 - INFO - __main__ - Step 141166: {'lr': 4.383552045691047e-06, 'samples': 27103872, 'steps': 141165, 'loss/train': 1.151129126548767} 11/07/2021 17:07:20 - INFO - __main__ - Step 141167: {'lr': 4.3825626960366515e-06, 'samples': 27104064, 'steps': 141166, 'loss/train': 1.3220072984695435} 11/07/2021 17:07:20 - INFO - __main__ - Step 141168: {'lr': 4.3815734570537005e-06, 'samples': 27104256, 'steps': 141167, 'loss/train': 0.9961921572685242} 11/07/2021 17:07:21 - INFO - __main__ - Step 141169: {'lr': 4.380584328742637e-06, 'samples': 27104448, 'steps': 141168, 'loss/train': 1.4456520080566406} 11/07/2021 17:07:22 - INFO - __main__ - Step 141170: {'lr': 4.379595311103879e-06, 'samples': 27104640, 'steps': 141169, 'loss/train': 1.3201465606689453} 11/07/2021 17:07:22 - INFO - __main__ - Step 141171: {'lr': 4.378606404137869e-06, 'samples': 27104832, 'steps': 141170, 'loss/train': 0.8931124806404114} 11/07/2021 17:07:22 - INFO - __main__ - Step 141172: {'lr': 4.37761760784508e-06, 'samples': 27105024, 'steps': 141171, 'loss/train': 1.1462759971618652} 11/07/2021 17:07:23 - INFO - __main__ - Step 141173: {'lr': 4.376628922225956e-06, 'samples': 27105216, 'steps': 141172, 'loss/train': 1.3895741701126099} 11/07/2021 17:07:23 - INFO - __main__ - Step 141174: {'lr': 4.3756403472809126e-06, 'samples': 27105408, 'steps': 141173, 'loss/train': 0.9876969456672668} 11/07/2021 17:07:24 - INFO - __main__ - Step 141175: {'lr': 4.37465188301045e-06, 'samples': 27105600, 'steps': 141174, 'loss/train': 1.3466753959655762} 11/07/2021 17:07:25 - INFO - __main__ - Step 141176: {'lr': 4.373663529414957e-06, 'samples': 27105792, 'steps': 141175, 'loss/train': 1.6924539804458618} 11/07/2021 17:07:25 - INFO - __main__ - Step 141177: {'lr': 4.3726752864949036e-06, 'samples': 27105984, 'steps': 141176, 'loss/train': 1.3946514129638672} 11/07/2021 17:07:25 - INFO - __main__ - Step 141178: {'lr': 4.371687154250737e-06, 'samples': 27106176, 'steps': 141177, 'loss/train': 1.7416306734085083} 11/07/2021 17:07:26 - INFO - __main__ - Step 141179: {'lr': 4.3706991326828985e-06, 'samples': 27106368, 'steps': 141178, 'loss/train': 0.9102336764335632} 11/07/2021 17:07:27 - INFO - __main__ - Step 141180: {'lr': 4.369711221791805e-06, 'samples': 27106560, 'steps': 141179, 'loss/train': 1.2672178745269775} 11/07/2021 17:07:27 - INFO - __main__ - Step 141181: {'lr': 4.368723421577958e-06, 'samples': 27106752, 'steps': 141180, 'loss/train': 1.3991626501083374} 11/07/2021 17:07:28 - INFO - __main__ - Step 141182: {'lr': 4.3677357320417724e-06, 'samples': 27106944, 'steps': 141181, 'loss/train': 0.921986997127533} 11/07/2021 17:07:28 - INFO - __main__ - Step 141183: {'lr': 4.3667481531836915e-06, 'samples': 27107136, 'steps': 141182, 'loss/train': 1.6206347942352295} 11/07/2021 17:07:28 - INFO - __main__ - Step 141184: {'lr': 4.365760685004161e-06, 'samples': 27107328, 'steps': 141183, 'loss/train': 1.3516994714736938} 11/07/2021 17:07:29 - INFO - __main__ - Step 141185: {'lr': 4.364773327503624e-06, 'samples': 27107520, 'steps': 141184, 'loss/train': 1.3090208768844604} 11/07/2021 17:07:30 - INFO - __main__ - Step 141186: {'lr': 4.363786080682525e-06, 'samples': 27107712, 'steps': 141185, 'loss/train': 1.4334545135498047} 11/07/2021 17:07:31 - INFO - __main__ - Step 141187: {'lr': 4.362798944541308e-06, 'samples': 27107904, 'steps': 141186, 'loss/train': 0.3446611762046814} 11/07/2021 17:07:31 - INFO - __main__ - Step 141188: {'lr': 4.3618119190804716e-06, 'samples': 27108096, 'steps': 141187, 'loss/train': 1.0970216989517212} 11/07/2021 17:07:31 - INFO - __main__ - Step 141189: {'lr': 4.3608250043003785e-06, 'samples': 27108288, 'steps': 141188, 'loss/train': 2.1269843578338623} 11/07/2021 17:07:32 - INFO - __main__ - Step 141190: {'lr': 4.359838200201499e-06, 'samples': 27108480, 'steps': 141189, 'loss/train': 0.6506363749504089} 11/07/2021 17:07:33 - INFO - __main__ - Step 141191: {'lr': 4.358851506784306e-06, 'samples': 27108672, 'steps': 141190, 'loss/train': 1.4896653890609741} 11/07/2021 17:07:33 - INFO - __main__ - Step 141192: {'lr': 4.357864924049188e-06, 'samples': 27108864, 'steps': 141191, 'loss/train': 1.0339014530181885} 11/07/2021 17:07:33 - INFO - __main__ - Step 141193: {'lr': 4.356878451996671e-06, 'samples': 27109056, 'steps': 141192, 'loss/train': 1.182232141494751} 11/07/2021 17:07:34 - INFO - __main__ - Step 141194: {'lr': 4.355892090627117e-06, 'samples': 27109248, 'steps': 141193, 'loss/train': 0.870433509349823} 11/07/2021 17:07:34 - INFO - __main__ - Step 141195: {'lr': 4.354905839941026e-06, 'samples': 27109440, 'steps': 141194, 'loss/train': 0.9511042833328247} 11/07/2021 17:07:35 - INFO - __main__ - Step 141196: {'lr': 4.353919699938813e-06, 'samples': 27109632, 'steps': 141195, 'loss/train': 1.1744489669799805} 11/07/2021 17:07:36 - INFO - __main__ - Step 141197: {'lr': 4.352933670620951e-06, 'samples': 27109824, 'steps': 141196, 'loss/train': 0.37968096137046814} 11/07/2021 17:07:36 - INFO - __main__ - Step 141198: {'lr': 4.3519477519878555e-06, 'samples': 27110016, 'steps': 141197, 'loss/train': 1.153294563293457} 11/07/2021 17:07:36 - INFO - __main__ - Step 141199: {'lr': 4.350961944039972e-06, 'samples': 27110208, 'steps': 141198, 'loss/train': 1.2642667293548584} 11/07/2021 17:07:37 - INFO - __main__ - Step 141200: {'lr': 4.3499762467777705e-06, 'samples': 27110400, 'steps': 141199, 'loss/train': 1.292771816253662} 11/07/2021 17:07:38 - INFO - __main__ - Step 141201: {'lr': 4.348990660201668e-06, 'samples': 27110592, 'steps': 141200, 'loss/train': 1.3651785850524902} 11/07/2021 17:07:38 - INFO - __main__ - Step 141202: {'lr': 4.3480051843121374e-06, 'samples': 27110784, 'steps': 141201, 'loss/train': 0.5755302906036377} 11/07/2021 17:07:38 - INFO - __main__ - Step 141203: {'lr': 4.347019819109593e-06, 'samples': 27110976, 'steps': 141202, 'loss/train': 1.2990752458572388} 11/07/2021 17:07:39 - INFO - __main__ - Step 141204: {'lr': 4.346034564594509e-06, 'samples': 27111168, 'steps': 141203, 'loss/train': 1.5046992301940918} 11/07/2021 17:07:39 - INFO - __main__ - Step 141205: {'lr': 4.345049420767272e-06, 'samples': 27111360, 'steps': 141204, 'loss/train': 1.3021152019500732} 11/07/2021 17:07:40 - INFO - __main__ - Step 141206: {'lr': 4.3440643876284105e-06, 'samples': 27111552, 'steps': 141205, 'loss/train': 1.56157386302948} 11/07/2021 17:07:40 - INFO - __main__ - Step 141207: {'lr': 4.343079465178285e-06, 'samples': 27111744, 'steps': 141206, 'loss/train': 1.4594782590866089} 11/07/2021 17:07:41 - INFO - __main__ - Step 141208: {'lr': 4.342094653417394e-06, 'samples': 27111936, 'steps': 141207, 'loss/train': 1.5707738399505615} 11/07/2021 17:07:41 - INFO - __main__ - Step 141209: {'lr': 4.3411099523461565e-06, 'samples': 27112128, 'steps': 141208, 'loss/train': 1.3415452241897583} 11/07/2021 17:07:41 - INFO - __main__ - Step 141210: {'lr': 4.340125361965014e-06, 'samples': 27112320, 'steps': 141209, 'loss/train': 1.4153378009796143} 11/07/2021 17:07:43 - INFO - __main__ - Step 141211: {'lr': 4.339140882274439e-06, 'samples': 27112512, 'steps': 141210, 'loss/train': 1.2849640846252441} 11/07/2021 17:07:43 - INFO - __main__ - Step 141212: {'lr': 4.338156513274849e-06, 'samples': 27112704, 'steps': 141211, 'loss/train': 1.1135307550430298} 11/07/2021 17:07:43 - INFO - __main__ - Step 141213: {'lr': 4.3371722549667146e-06, 'samples': 27112896, 'steps': 141212, 'loss/train': 1.5158025026321411} 11/07/2021 17:07:44 - INFO - __main__ - Step 141214: {'lr': 4.336188107350425e-06, 'samples': 27113088, 'steps': 141213, 'loss/train': 1.4027841091156006} 11/07/2021 17:07:44 - INFO - __main__ - Step 141215: {'lr': 4.3352040704265075e-06, 'samples': 27113280, 'steps': 141214, 'loss/train': 1.1900664567947388} 11/07/2021 17:07:44 - INFO - __main__ - Step 141216: {'lr': 4.334220144195323e-06, 'samples': 27113472, 'steps': 141215, 'loss/train': 1.2300747632980347} 11/07/2021 17:07:45 - INFO - __main__ - Step 141217: {'lr': 4.3332363286573415e-06, 'samples': 27113664, 'steps': 141216, 'loss/train': 0.8172418475151062} 11/07/2021 17:07:46 - INFO - __main__ - Step 141218: {'lr': 4.332252623813038e-06, 'samples': 27113856, 'steps': 141217, 'loss/train': 0.9959660172462463} 11/07/2021 17:07:46 - INFO - __main__ - Step 141219: {'lr': 4.331269029662799e-06, 'samples': 27114048, 'steps': 141218, 'loss/train': 1.3238905668258667} 11/07/2021 17:07:47 - INFO - __main__ - Step 141220: {'lr': 4.330285546207125e-06, 'samples': 27114240, 'steps': 141219, 'loss/train': 1.466942310333252} 11/07/2021 17:07:47 - INFO - __main__ - Step 141221: {'lr': 4.329302173446404e-06, 'samples': 27114432, 'steps': 141220, 'loss/train': 1.6056795120239258} 11/07/2021 17:07:48 - INFO - __main__ - Step 141222: {'lr': 4.3283189113811346e-06, 'samples': 27114624, 'steps': 141221, 'loss/train': 1.460079312324524} 11/07/2021 17:07:48 - INFO - __main__ - Step 141223: {'lr': 4.327335760011736e-06, 'samples': 27114816, 'steps': 141222, 'loss/train': 1.5503538846969604} 11/07/2021 17:07:49 - INFO - __main__ - Step 141224: {'lr': 4.326352719338622e-06, 'samples': 27115008, 'steps': 141223, 'loss/train': 1.1683827638626099} 11/07/2021 17:07:49 - INFO - __main__ - Step 141225: {'lr': 4.3253697893622935e-06, 'samples': 27115200, 'steps': 141224, 'loss/train': 1.359566569328308} 11/07/2021 17:07:49 - INFO - __main__ - Step 141226: {'lr': 4.324386970083138e-06, 'samples': 27115392, 'steps': 141225, 'loss/train': 1.9233185052871704} 11/07/2021 17:07:51 - INFO - __main__ - Step 141227: {'lr': 4.323404261501629e-06, 'samples': 27115584, 'steps': 141226, 'loss/train': 1.1268506050109863} 11/07/2021 17:07:51 - INFO - __main__ - Step 141228: {'lr': 4.322421663618209e-06, 'samples': 27115776, 'steps': 141227, 'loss/train': 1.20869779586792} 11/07/2021 17:07:51 - INFO - __main__ - Step 141229: {'lr': 4.32143917643335e-06, 'samples': 27115968, 'steps': 141228, 'loss/train': 1.2629534006118774} 11/07/2021 17:07:52 - INFO - __main__ - Step 141230: {'lr': 4.320456799947414e-06, 'samples': 27116160, 'steps': 141229, 'loss/train': 1.2182456254959106} 11/07/2021 17:07:52 - INFO - __main__ - Step 141231: {'lr': 4.3194745341609e-06, 'samples': 27116352, 'steps': 141230, 'loss/train': 1.3451790809631348} 11/07/2021 17:07:53 - INFO - __main__ - Step 141232: {'lr': 4.318492379074224e-06, 'samples': 27116544, 'steps': 141231, 'loss/train': 1.1215802431106567} 11/07/2021 17:07:53 - INFO - __main__ - Step 141233: {'lr': 4.317510334687858e-06, 'samples': 27116736, 'steps': 141232, 'loss/train': 0.4364990293979645} 11/07/2021 17:07:54 - INFO - __main__ - Step 141234: {'lr': 4.316528401002246e-06, 'samples': 27116928, 'steps': 141233, 'loss/train': 1.625029444694519} 11/07/2021 17:07:54 - INFO - __main__ - Step 141235: {'lr': 4.3155465780177765e-06, 'samples': 27117120, 'steps': 141234, 'loss/train': 1.6921316385269165} 11/07/2021 17:07:54 - INFO - __main__ - Step 141236: {'lr': 4.3145648657349765e-06, 'samples': 27117312, 'steps': 141235, 'loss/train': 1.347169041633606} 11/07/2021 17:07:55 - INFO - __main__ - Step 141237: {'lr': 4.313583264154208e-06, 'samples': 27117504, 'steps': 141236, 'loss/train': 5.268321514129639} 11/07/2021 17:07:56 - INFO - __main__ - Step 141238: {'lr': 4.3126017732759705e-06, 'samples': 27117696, 'steps': 141237, 'loss/train': 0.9714957475662231} 11/07/2021 17:07:56 - INFO - __main__ - Step 141239: {'lr': 4.311620393100651e-06, 'samples': 27117888, 'steps': 141238, 'loss/train': 1.1665681600570679} 11/07/2021 17:07:56 - INFO - __main__ - Step 141240: {'lr': 4.310639123628751e-06, 'samples': 27118080, 'steps': 141239, 'loss/train': 0.7246345281600952} 11/07/2021 17:07:57 - INFO - __main__ - Step 141241: {'lr': 4.309657964860686e-06, 'samples': 27118272, 'steps': 141240, 'loss/train': 0.7104424834251404} 11/07/2021 17:07:57 - INFO - __main__ - Step 141242: {'lr': 4.3086769167969e-06, 'samples': 27118464, 'steps': 141241, 'loss/train': 0.8608599901199341} 11/07/2021 17:07:58 - INFO - __main__ - Step 141243: {'lr': 4.307695979437837e-06, 'samples': 27118656, 'steps': 141242, 'loss/train': 1.3633337020874023} 11/07/2021 17:07:59 - INFO - __main__ - Step 141244: {'lr': 4.3067151527839134e-06, 'samples': 27118848, 'steps': 141243, 'loss/train': 1.5223276615142822} 11/07/2021 17:07:59 - INFO - __main__ - Step 141245: {'lr': 4.305734436835601e-06, 'samples': 27119040, 'steps': 141244, 'loss/train': 1.1994625329971313} 11/07/2021 17:07:59 - INFO - __main__ - Step 141246: {'lr': 4.304753831593345e-06, 'samples': 27119232, 'steps': 141245, 'loss/train': 1.2679035663604736} 11/07/2021 17:08:00 - INFO - __main__ - Step 141247: {'lr': 4.30377333705756e-06, 'samples': 27119424, 'steps': 141246, 'loss/train': 1.5249048471450806} 11/07/2021 17:08:01 - INFO - __main__ - Step 141248: {'lr': 4.302792953228718e-06, 'samples': 27119616, 'steps': 141247, 'loss/train': 1.3844058513641357} 11/07/2021 17:08:02 - INFO - __main__ - Step 141249: {'lr': 4.301812680107208e-06, 'samples': 27119808, 'steps': 141248, 'loss/train': 1.2922616004943848} 11/07/2021 17:08:02 - INFO - __main__ - Step 141250: {'lr': 4.3008325176935596e-06, 'samples': 27120000, 'steps': 141249, 'loss/train': 1.1137644052505493} 11/07/2021 17:08:02 - INFO - __main__ - Step 141251: {'lr': 4.29985246598813e-06, 'samples': 27120192, 'steps': 141250, 'loss/train': 1.2963447570800781} 11/07/2021 17:08:03 - INFO - __main__ - Step 141252: {'lr': 4.298872524991421e-06, 'samples': 27120384, 'steps': 141251, 'loss/train': 0.9501886367797852} 11/07/2021 17:08:04 - INFO - __main__ - Step 141253: {'lr': 4.29789269470382e-06, 'samples': 27120576, 'steps': 141252, 'loss/train': 0.513279914855957} 11/07/2021 17:08:04 - INFO - __main__ - Step 141254: {'lr': 4.296912975125827e-06, 'samples': 27120768, 'steps': 141253, 'loss/train': 1.7078239917755127} 11/07/2021 17:08:04 - INFO - __main__ - Step 141255: {'lr': 4.295933366257832e-06, 'samples': 27120960, 'steps': 141254, 'loss/train': 0.7830960750579834} 11/07/2021 17:08:05 - INFO - __main__ - Step 141256: {'lr': 4.294953868100332e-06, 'samples': 27121152, 'steps': 141255, 'loss/train': 1.2921475172042847} 11/07/2021 17:08:05 - INFO - __main__ - Step 141257: {'lr': 4.293974480653718e-06, 'samples': 27121344, 'steps': 141256, 'loss/train': 0.9254564642906189} 11/07/2021 17:08:06 - INFO - __main__ - Step 141258: {'lr': 4.292995203918432e-06, 'samples': 27121536, 'steps': 141257, 'loss/train': 2.0666072368621826} 11/07/2021 17:08:07 - INFO - __main__ - Step 141259: {'lr': 4.292016037894919e-06, 'samples': 27121728, 'steps': 141258, 'loss/train': 1.024787425994873} 11/07/2021 17:08:07 - INFO - __main__ - Step 141260: {'lr': 4.291036982583651e-06, 'samples': 27121920, 'steps': 141259, 'loss/train': 1.4694322347640991} 11/07/2021 17:08:07 - INFO - __main__ - Step 141261: {'lr': 4.290058037985045e-06, 'samples': 27122112, 'steps': 141260, 'loss/train': 0.621557354927063} 11/07/2021 17:08:08 - INFO - __main__ - Step 141262: {'lr': 4.289079204099572e-06, 'samples': 27122304, 'steps': 141261, 'loss/train': 1.4066013097763062} 11/07/2021 17:08:08 - INFO - __main__ - Step 141263: {'lr': 4.28810048092762e-06, 'samples': 27122496, 'steps': 141262, 'loss/train': 0.6664243340492249} 11/07/2021 17:08:09 - INFO - __main__ - Step 141264: {'lr': 4.287121868469662e-06, 'samples': 27122688, 'steps': 141263, 'loss/train': 0.9927290081977844} 11/07/2021 17:08:09 - INFO - __main__ - Step 141265: {'lr': 4.286143366726142e-06, 'samples': 27122880, 'steps': 141264, 'loss/train': 0.8150977492332458} 11/07/2021 17:08:10 - INFO - __main__ - Step 141266: {'lr': 4.2851649756975034e-06, 'samples': 27123072, 'steps': 141265, 'loss/train': 1.6524296998977661} 11/07/2021 17:08:10 - INFO - __main__ - Step 141267: {'lr': 4.284186695384163e-06, 'samples': 27123264, 'steps': 141266, 'loss/train': 1.4787243604660034} 11/07/2021 17:08:10 - INFO - __main__ - Step 141268: {'lr': 4.2832085257865915e-06, 'samples': 27123456, 'steps': 141267, 'loss/train': 1.0536495447158813} 11/07/2021 17:08:12 - INFO - __main__ - Step 141269: {'lr': 4.282230466905207e-06, 'samples': 27123648, 'steps': 141268, 'loss/train': 1.2430142164230347} 11/07/2021 17:08:12 - INFO - __main__ - Step 141270: {'lr': 4.281252518740452e-06, 'samples': 27123840, 'steps': 141269, 'loss/train': 1.4892957210540771} 11/07/2021 17:08:12 - INFO - __main__ - Step 141271: {'lr': 4.280274681292773e-06, 'samples': 27124032, 'steps': 141270, 'loss/train': 1.1485086679458618} 11/07/2021 17:08:13 - INFO - __main__ - Step 141272: {'lr': 4.279296954562612e-06, 'samples': 27124224, 'steps': 141271, 'loss/train': 1.0764625072479248} 11/07/2021 17:08:13 - INFO - __main__ - Step 141273: {'lr': 4.278319338550413e-06, 'samples': 27124416, 'steps': 141272, 'loss/train': 1.5770978927612305} 11/07/2021 17:08:14 - INFO - __main__ - Step 141274: {'lr': 4.277341833256593e-06, 'samples': 27124608, 'steps': 141273, 'loss/train': 1.1462129354476929} 11/07/2021 17:08:14 - INFO - __main__ - Step 141275: {'lr': 4.276364438681624e-06, 'samples': 27124800, 'steps': 141274, 'loss/train': 1.201772928237915} 11/07/2021 17:08:15 - INFO - __main__ - Step 141276: {'lr': 4.275387154825949e-06, 'samples': 27124992, 'steps': 141275, 'loss/train': 1.493618130683899} 11/07/2021 17:08:15 - INFO - __main__ - Step 141277: {'lr': 4.274409981689958e-06, 'samples': 27125184, 'steps': 141276, 'loss/train': 0.9479673504829407} 11/07/2021 17:08:15 - INFO - __main__ - Step 141278: {'lr': 4.273432919274178e-06, 'samples': 27125376, 'steps': 141277, 'loss/train': 1.5459636449813843} 11/07/2021 17:08:16 - INFO - __main__ - Step 141279: {'lr': 4.27245596757897e-06, 'samples': 27125568, 'steps': 141278, 'loss/train': 1.4858119487762451} 11/07/2021 17:08:17 - INFO - __main__ - Step 141280: {'lr': 4.271479126604805e-06, 'samples': 27125760, 'steps': 141279, 'loss/train': 1.1349152326583862} 11/07/2021 17:08:17 - INFO - __main__ - Step 141281: {'lr': 4.2705023963520996e-06, 'samples': 27125952, 'steps': 141280, 'loss/train': 1.3827877044677734} 11/07/2021 17:08:17 - INFO - __main__ - Step 141282: {'lr': 4.269525776821326e-06, 'samples': 27126144, 'steps': 141281, 'loss/train': 0.6477691531181335} 11/07/2021 17:08:18 - INFO - __main__ - Step 141283: {'lr': 4.2685492680129e-06, 'samples': 27126336, 'steps': 141282, 'loss/train': 1.2082242965698242} 11/07/2021 17:08:18 - INFO - __main__ - Step 141284: {'lr': 4.2675728699272946e-06, 'samples': 27126528, 'steps': 141283, 'loss/train': 1.1851589679718018} 11/07/2021 17:08:19 - INFO - __main__ - Step 141285: {'lr': 4.266596582564925e-06, 'samples': 27126720, 'steps': 141284, 'loss/train': 1.5299172401428223} 11/07/2021 17:08:20 - INFO - __main__ - Step 141286: {'lr': 4.265620405926235e-06, 'samples': 27126912, 'steps': 141285, 'loss/train': 0.842440664768219} 11/07/2021 17:08:20 - INFO - __main__ - Step 141287: {'lr': 4.264644340011642e-06, 'samples': 27127104, 'steps': 141286, 'loss/train': 1.1652013063430786} 11/07/2021 17:08:20 - INFO - __main__ - Step 141288: {'lr': 4.263668384821645e-06, 'samples': 27127296, 'steps': 141287, 'loss/train': 0.07607757300138474} 11/07/2021 17:08:21 - INFO - __main__ - Step 141289: {'lr': 4.262692540356633e-06, 'samples': 27127488, 'steps': 141288, 'loss/train': 1.4861741065979004} 11/07/2021 17:08:22 - INFO - __main__ - Step 141290: {'lr': 4.2617168066170495e-06, 'samples': 27127680, 'steps': 141289, 'loss/train': 0.9411960244178772} 11/07/2021 17:08:22 - INFO - __main__ - Step 141291: {'lr': 4.2607411836033675e-06, 'samples': 27127872, 'steps': 141290, 'loss/train': 0.8033814430236816} 11/07/2021 17:08:23 - INFO - __main__ - Step 141292: {'lr': 4.259765671315974e-06, 'samples': 27128064, 'steps': 141291, 'loss/train': 1.0172637701034546} 11/07/2021 17:08:23 - INFO - __main__ - Step 141293: {'lr': 4.2587902697553695e-06, 'samples': 27128256, 'steps': 141292, 'loss/train': 1.2350516319274902} 11/07/2021 17:08:23 - INFO - __main__ - Step 141294: {'lr': 4.257814978921942e-06, 'samples': 27128448, 'steps': 141293, 'loss/train': 0.0748821496963501} 11/07/2021 17:08:25 - INFO - __main__ - Step 141295: {'lr': 4.256839798816137e-06, 'samples': 27128640, 'steps': 141294, 'loss/train': 1.1972169876098633} 11/07/2021 17:08:25 - INFO - __main__ - Step 141296: {'lr': 4.255864729438425e-06, 'samples': 27128832, 'steps': 141295, 'loss/train': 1.4146686792373657} 11/07/2021 17:08:25 - INFO - __main__ - Step 141297: {'lr': 4.254889770789222e-06, 'samples': 27129024, 'steps': 141296, 'loss/train': 0.3854982852935791} 11/07/2021 17:08:26 - INFO - __main__ - Step 141298: {'lr': 4.253914922868973e-06, 'samples': 27129216, 'steps': 141297, 'loss/train': 1.152289628982544} 11/07/2021 17:08:26 - INFO - __main__ - Step 141299: {'lr': 4.252940185678123e-06, 'samples': 27129408, 'steps': 141298, 'loss/train': 2.545156478881836} 11/07/2021 17:08:27 - INFO - __main__ - Step 141300: {'lr': 4.251965559217141e-06, 'samples': 27129600, 'steps': 141299, 'loss/train': 1.3758069276809692} 11/07/2021 17:08:27 - INFO - __main__ - Step 141301: {'lr': 4.250991043486391e-06, 'samples': 27129792, 'steps': 141300, 'loss/train': 1.4943599700927734} 11/07/2021 17:08:28 - INFO - __main__ - Step 141302: {'lr': 4.250016638486342e-06, 'samples': 27129984, 'steps': 141301, 'loss/train': 1.2215408086776733} 11/07/2021 17:08:28 - INFO - __main__ - Step 141303: {'lr': 4.249042344217469e-06, 'samples': 27130176, 'steps': 141302, 'loss/train': 1.8795000314712524} 11/07/2021 17:08:28 - INFO - __main__ - Step 141304: {'lr': 4.248068160680185e-06, 'samples': 27130368, 'steps': 141303, 'loss/train': 1.5156867504119873} 11/07/2021 17:08:30 - INFO - __main__ - Step 141305: {'lr': 4.247094087874909e-06, 'samples': 27130560, 'steps': 141304, 'loss/train': 1.3716771602630615} 11/07/2021 17:08:30 - INFO - __main__ - Step 141306: {'lr': 4.246120125802111e-06, 'samples': 27130752, 'steps': 141305, 'loss/train': 0.19085252285003662} 11/07/2021 17:08:30 - INFO - __main__ - Step 141307: {'lr': 4.245146274462208e-06, 'samples': 27130944, 'steps': 141306, 'loss/train': 1.3861620426177979} 11/07/2021 17:08:31 - INFO - __main__ - Step 141308: {'lr': 4.244172533855673e-06, 'samples': 27131136, 'steps': 141307, 'loss/train': 1.5576553344726562} 11/07/2021 17:08:31 - INFO - __main__ - Step 141309: {'lr': 4.243198903982892e-06, 'samples': 27131328, 'steps': 141308, 'loss/train': 1.57831609249115} 11/07/2021 17:08:32 - INFO - __main__ - Step 141310: {'lr': 4.242225384844367e-06, 'samples': 27131520, 'steps': 141309, 'loss/train': 0.9893909096717834} 11/07/2021 17:08:32 - INFO - __main__ - Step 141311: {'lr': 4.241251976440514e-06, 'samples': 27131712, 'steps': 141310, 'loss/train': 0.8863540887832642} 11/07/2021 17:08:33 - INFO - __main__ - Step 141312: {'lr': 4.24027867877172e-06, 'samples': 27131904, 'steps': 141311, 'loss/train': 0.9656922221183777} 11/07/2021 17:08:33 - INFO - __main__ - Step 141313: {'lr': 4.2393054918384855e-06, 'samples': 27132096, 'steps': 141312, 'loss/train': 1.301392912864685} 11/07/2021 17:08:33 - INFO - __main__ - Step 141314: {'lr': 4.2383324156412e-06, 'samples': 27132288, 'steps': 141313, 'loss/train': 1.2709472179412842} 11/07/2021 17:08:35 - INFO - __main__ - Step 141315: {'lr': 4.237359450180362e-06, 'samples': 27132480, 'steps': 141314, 'loss/train': 1.4152966737747192} 11/07/2021 17:08:35 - INFO - __main__ - Step 141316: {'lr': 4.23638659545636e-06, 'samples': 27132672, 'steps': 141315, 'loss/train': 1.133202314376831} 11/07/2021 17:08:35 - INFO - __main__ - Step 141317: {'lr': 4.235413851469666e-06, 'samples': 27132864, 'steps': 141316, 'loss/train': 1.2836394309997559} 11/07/2021 17:08:36 - INFO - __main__ - Step 141318: {'lr': 4.234441218220669e-06, 'samples': 27133056, 'steps': 141317, 'loss/train': 1.418775200843811} 11/07/2021 17:08:36 - INFO - __main__ - Step 141319: {'lr': 4.2334686957098685e-06, 'samples': 27133248, 'steps': 141318, 'loss/train': 1.3695216178894043} 11/07/2021 17:08:36 - INFO - __main__ - Step 141320: {'lr': 4.2324962839376815e-06, 'samples': 27133440, 'steps': 141319, 'loss/train': 0.8352346420288086} 11/07/2021 17:08:37 - INFO - __main__ - Step 141321: {'lr': 4.231523982904523e-06, 'samples': 27133632, 'steps': 141320, 'loss/train': 1.6494863033294678} 11/07/2021 17:08:38 - INFO - __main__ - Step 141322: {'lr': 4.230551792610837e-06, 'samples': 27133824, 'steps': 141321, 'loss/train': 0.909075140953064} 11/07/2021 17:08:38 - INFO - __main__ - Step 141323: {'lr': 4.2295797130570965e-06, 'samples': 27134016, 'steps': 141322, 'loss/train': 1.2075313329696655} 11/07/2021 17:08:38 - INFO - __main__ - Step 141324: {'lr': 4.228607744243717e-06, 'samples': 27134208, 'steps': 141323, 'loss/train': 1.235835075378418} 11/07/2021 17:08:39 - INFO - __main__ - Step 141325: {'lr': 4.227635886171116e-06, 'samples': 27134400, 'steps': 141324, 'loss/train': 1.1892045736312866} 11/07/2021 17:08:40 - INFO - __main__ - Step 141326: {'lr': 4.226664138839764e-06, 'samples': 27134592, 'steps': 141325, 'loss/train': 1.50801420211792} 11/07/2021 17:08:40 - INFO - __main__ - Step 141327: {'lr': 4.22569250225005e-06, 'samples': 27134784, 'steps': 141326, 'loss/train': 0.6661449074745178} 11/07/2021 17:08:41 - INFO - __main__ - Step 141328: {'lr': 4.224720976402474e-06, 'samples': 27134976, 'steps': 141327, 'loss/train': 1.3152490854263306} 11/07/2021 17:08:41 - INFO - __main__ - Step 141329: {'lr': 4.223749561297452e-06, 'samples': 27135168, 'steps': 141328, 'loss/train': 1.284403920173645} 11/07/2021 17:08:41 - INFO - __main__ - Step 141330: {'lr': 4.2227782569354e-06, 'samples': 27135360, 'steps': 141329, 'loss/train': 0.7327048182487488} 11/07/2021 17:08:42 - INFO - __main__ - Step 141331: {'lr': 4.22180706331679e-06, 'samples': 27135552, 'steps': 141330, 'loss/train': 1.5352941751480103} 11/07/2021 17:08:43 - INFO - __main__ - Step 141332: {'lr': 4.220835980442011e-06, 'samples': 27135744, 'steps': 141331, 'loss/train': 1.020729422569275} 11/07/2021 17:08:43 - INFO - __main__ - Step 141333: {'lr': 4.2198650083115634e-06, 'samples': 27135936, 'steps': 141332, 'loss/train': 1.5186054706573486} 11/07/2021 17:08:43 - INFO - __main__ - Step 141334: {'lr': 4.218894146925833e-06, 'samples': 27136128, 'steps': 141333, 'loss/train': 1.4444152116775513} 11/07/2021 17:08:44 - INFO - __main__ - Step 141335: {'lr': 4.217923396285295e-06, 'samples': 27136320, 'steps': 141334, 'loss/train': 1.4126524925231934} 11/07/2021 17:08:44 - INFO - __main__ - Step 141336: {'lr': 4.216952756390363e-06, 'samples': 27136512, 'steps': 141335, 'loss/train': 1.3349604606628418} 11/07/2021 17:08:45 - INFO - __main__ - Step 141337: {'lr': 4.215982227241483e-06, 'samples': 27136704, 'steps': 141336, 'loss/train': 1.0884536504745483} 11/07/2021 17:08:45 - INFO - __main__ - Step 141338: {'lr': 4.21501180883907e-06, 'samples': 27136896, 'steps': 141337, 'loss/train': 0.5214104056358337} 11/07/2021 17:08:46 - INFO - __main__ - Step 141339: {'lr': 4.214041501183596e-06, 'samples': 27137088, 'steps': 141338, 'loss/train': 1.1472936868667603} 11/07/2021 17:08:46 - INFO - __main__ - Step 141340: {'lr': 4.213071304275451e-06, 'samples': 27137280, 'steps': 141339, 'loss/train': 1.3331148624420166} 11/07/2021 17:08:47 - INFO - __main__ - Step 141341: {'lr': 4.212101218115133e-06, 'samples': 27137472, 'steps': 141340, 'loss/train': 0.8537743091583252} 11/07/2021 17:08:48 - INFO - __main__ - Step 141342: {'lr': 4.211131242703031e-06, 'samples': 27137664, 'steps': 141341, 'loss/train': 1.2697219848632812} 11/07/2021 17:08:48 - INFO - __main__ - Step 141343: {'lr': 4.210161378039618e-06, 'samples': 27137856, 'steps': 141342, 'loss/train': 1.1429036855697632} 11/07/2021 17:08:48 - INFO - __main__ - Step 141344: {'lr': 4.209191624125308e-06, 'samples': 27138048, 'steps': 141343, 'loss/train': 1.724770188331604} 11/07/2021 17:08:49 - INFO - __main__ - Step 141345: {'lr': 4.208221980960547e-06, 'samples': 27138240, 'steps': 141344, 'loss/train': 1.5674155950546265} 11/07/2021 17:08:49 - INFO - __main__ - Step 141346: {'lr': 4.207252448545751e-06, 'samples': 27138432, 'steps': 141345, 'loss/train': 0.995057225227356} 11/07/2021 17:08:50 - INFO - __main__ - Step 141347: {'lr': 4.206283026881391e-06, 'samples': 27138624, 'steps': 141346, 'loss/train': 1.1930102109909058} 11/07/2021 17:08:50 - INFO - __main__ - Step 141348: {'lr': 4.205313715967884e-06, 'samples': 27138816, 'steps': 141347, 'loss/train': 1.460007667541504} 11/07/2021 17:08:51 - INFO - __main__ - Step 141349: {'lr': 4.204344515805674e-06, 'samples': 27139008, 'steps': 141348, 'loss/train': 1.4140125513076782} 11/07/2021 17:08:51 - INFO - __main__ - Step 141350: {'lr': 4.203375426395206e-06, 'samples': 27139200, 'steps': 141349, 'loss/train': 1.5487823486328125} 11/07/2021 17:08:51 - INFO - __main__ - Step 141351: {'lr': 4.202406447736895e-06, 'samples': 27139392, 'steps': 141350, 'loss/train': 1.5143797397613525} 11/07/2021 17:08:53 - INFO - __main__ - Step 141352: {'lr': 4.201437579831158e-06, 'samples': 27139584, 'steps': 141351, 'loss/train': 1.4357199668884277} 11/07/2021 17:08:53 - INFO - __main__ - Step 141353: {'lr': 4.200468822678493e-06, 'samples': 27139776, 'steps': 141352, 'loss/train': 1.2624528408050537} 11/07/2021 17:08:53 - INFO - __main__ - Step 141354: {'lr': 4.199500176279291e-06, 'samples': 27139968, 'steps': 141353, 'loss/train': 1.3677270412445068} 11/07/2021 17:08:54 - INFO - __main__ - Step 141355: {'lr': 4.198531640633996e-06, 'samples': 27140160, 'steps': 141354, 'loss/train': 1.1949690580368042} 11/07/2021 17:08:54 - INFO - __main__ - Step 141356: {'lr': 4.19756321574305e-06, 'samples': 27140352, 'steps': 141355, 'loss/train': 1.6624321937561035} 11/07/2021 17:08:55 - INFO - __main__ - Step 141357: {'lr': 4.196594901606898e-06, 'samples': 27140544, 'steps': 141356, 'loss/train': 0.6116508841514587} 11/07/2021 17:08:55 - INFO - __main__ - Step 141358: {'lr': 4.195626698225957e-06, 'samples': 27140736, 'steps': 141357, 'loss/train': 1.2921223640441895} 11/07/2021 17:08:56 - INFO - __main__ - Step 141359: {'lr': 4.194658605600698e-06, 'samples': 27140928, 'steps': 141358, 'loss/train': 1.3564105033874512} 11/07/2021 17:08:56 - INFO - __main__ - Step 141360: {'lr': 4.193690623731511e-06, 'samples': 27141120, 'steps': 141359, 'loss/train': 1.2978514432907104} 11/07/2021 17:08:56 - INFO - __main__ - Step 141361: {'lr': 4.192722752618866e-06, 'samples': 27141312, 'steps': 141360, 'loss/train': 0.8957082629203796} 11/07/2021 17:08:57 - INFO - __main__ - Step 141362: {'lr': 4.19175499226318e-06, 'samples': 27141504, 'steps': 141361, 'loss/train': 1.182142734527588} 11/07/2021 17:08:58 - INFO - __main__ - Step 141363: {'lr': 4.190787342664898e-06, 'samples': 27141696, 'steps': 141362, 'loss/train': 1.273145318031311} 11/07/2021 17:08:58 - INFO - __main__ - Step 141364: {'lr': 4.189819803824463e-06, 'samples': 27141888, 'steps': 141363, 'loss/train': 0.7772733569145203} 11/07/2021 17:08:59 - INFO - __main__ - Step 141365: {'lr': 4.188852375742292e-06, 'samples': 27142080, 'steps': 141364, 'loss/train': 1.5597856044769287} 11/07/2021 17:08:59 - INFO - __main__ - Step 141366: {'lr': 4.187885058418828e-06, 'samples': 27142272, 'steps': 141365, 'loss/train': 0.8011024594306946} 11/07/2021 17:08:59 - INFO - __main__ - Step 141367: {'lr': 4.186917851854516e-06, 'samples': 27142464, 'steps': 141366, 'loss/train': 1.3924751281738281} 11/07/2021 17:09:00 - INFO - __main__ - Step 141368: {'lr': 4.185950756049772e-06, 'samples': 27142656, 'steps': 141367, 'loss/train': 1.2780301570892334} 11/07/2021 17:09:01 - INFO - __main__ - Step 141369: {'lr': 4.184983771005041e-06, 'samples': 27142848, 'steps': 141368, 'loss/train': 1.0005474090576172} 11/07/2021 17:09:01 - INFO - __main__ - Step 141370: {'lr': 4.184016896720793e-06, 'samples': 27143040, 'steps': 141369, 'loss/train': 1.1745209693908691} 11/07/2021 17:09:01 - INFO - __main__ - Step 141371: {'lr': 4.183050133197419e-06, 'samples': 27143232, 'steps': 141370, 'loss/train': 0.5793346166610718} 11/07/2021 17:09:02 - INFO - __main__ - Step 141372: {'lr': 4.182083480435362e-06, 'samples': 27143424, 'steps': 141371, 'loss/train': 1.6233686208724976} 11/07/2021 17:09:03 - INFO - __main__ - Step 141373: {'lr': 4.181116938435065e-06, 'samples': 27143616, 'steps': 141372, 'loss/train': 1.1565927267074585} 11/07/2021 17:09:03 - INFO - __main__ - Step 141374: {'lr': 4.180150507196973e-06, 'samples': 27143808, 'steps': 141373, 'loss/train': 0.890661358833313} 11/07/2021 17:09:04 - INFO - __main__ - Step 141375: {'lr': 4.1791841867215016e-06, 'samples': 27144000, 'steps': 141374, 'loss/train': 1.6428395509719849} 11/07/2021 17:09:04 - INFO - __main__ - Step 141376: {'lr': 4.178217977009097e-06, 'samples': 27144192, 'steps': 141375, 'loss/train': 1.0568498373031616} 11/07/2021 17:09:04 - INFO - __main__ - Step 141377: {'lr': 4.177251878060229e-06, 'samples': 27144384, 'steps': 141376, 'loss/train': 1.0210161209106445} 11/07/2021 17:09:05 - INFO - __main__ - Step 141378: {'lr': 4.176285889875259e-06, 'samples': 27144576, 'steps': 141377, 'loss/train': 1.4980264902114868} 11/07/2021 17:09:06 - INFO - __main__ - Step 141379: {'lr': 4.175320012454686e-06, 'samples': 27144768, 'steps': 141378, 'loss/train': 1.3657805919647217} 11/07/2021 17:09:06 - INFO - __main__ - Step 141380: {'lr': 4.1743542457989005e-06, 'samples': 27144960, 'steps': 141379, 'loss/train': 0.7943114638328552} 11/07/2021 17:09:06 - INFO - __main__ - Step 141381: {'lr': 4.173388589908372e-06, 'samples': 27145152, 'steps': 141380, 'loss/train': 1.6019949913024902} 11/07/2021 17:09:07 - INFO - __main__ - Step 141382: {'lr': 4.172423044783518e-06, 'samples': 27145344, 'steps': 141381, 'loss/train': 0.5825318098068237} 11/07/2021 17:09:08 - INFO - __main__ - Step 141383: {'lr': 4.171457610424756e-06, 'samples': 27145536, 'steps': 141382, 'loss/train': 1.175033688545227} 11/07/2021 17:09:08 - INFO - __main__ - Step 141384: {'lr': 4.170492286832556e-06, 'samples': 27145728, 'steps': 141383, 'loss/train': 1.5822597742080688} 11/07/2021 17:09:08 - INFO - __main__ - Step 141385: {'lr': 4.169527074007335e-06, 'samples': 27145920, 'steps': 141384, 'loss/train': 1.2406522035598755} 11/07/2021 17:09:09 - INFO - __main__ - Step 141386: {'lr': 4.168561971949536e-06, 'samples': 27146112, 'steps': 141385, 'loss/train': 1.3558725118637085} 11/07/2021 17:09:09 - INFO - __main__ - Step 141387: {'lr': 4.167596980659605e-06, 'samples': 27146304, 'steps': 141386, 'loss/train': 0.9984948039054871} 11/07/2021 17:09:10 - INFO - __main__ - Step 141388: {'lr': 4.166632100137957e-06, 'samples': 27146496, 'steps': 141387, 'loss/train': 1.2036609649658203} 11/07/2021 17:09:11 - INFO - __main__ - Step 141389: {'lr': 4.165667330385009e-06, 'samples': 27146688, 'steps': 141388, 'loss/train': 1.3497103452682495} 11/07/2021 17:09:11 - INFO - __main__ - Step 141390: {'lr': 4.16470267140126e-06, 'samples': 27146880, 'steps': 141389, 'loss/train': 1.2343324422836304} 11/07/2021 17:09:11 - INFO - __main__ - Step 141391: {'lr': 4.163738123187072e-06, 'samples': 27147072, 'steps': 141390, 'loss/train': 1.1974560022354126} 11/07/2021 17:09:12 - INFO - __main__ - Step 141392: {'lr': 4.162773685742888e-06, 'samples': 27147264, 'steps': 141391, 'loss/train': 0.9765795469284058} 11/07/2021 17:09:12 - INFO - __main__ - Step 141393: {'lr': 4.161809359069207e-06, 'samples': 27147456, 'steps': 141392, 'loss/train': 1.4169023036956787} 11/07/2021 17:09:13 - INFO - __main__ - Step 141394: {'lr': 4.160845143166392e-06, 'samples': 27147648, 'steps': 141393, 'loss/train': 0.9315038919448853} 11/07/2021 17:09:13 - INFO - __main__ - Step 141395: {'lr': 4.159881038034913e-06, 'samples': 27147840, 'steps': 141394, 'loss/train': 1.2628153562545776} 11/07/2021 17:09:14 - INFO - __main__ - Step 141396: {'lr': 4.158917043675214e-06, 'samples': 27148032, 'steps': 141395, 'loss/train': 1.435559868812561} 11/07/2021 17:09:14 - INFO - __main__ - Step 141397: {'lr': 4.157953160087685e-06, 'samples': 27148224, 'steps': 141396, 'loss/train': 0.9300529360771179} 11/07/2021 17:09:14 - INFO - __main__ - Step 141398: {'lr': 4.156989387272797e-06, 'samples': 27148416, 'steps': 141397, 'loss/train': 1.727115273475647} 11/07/2021 17:09:15 - INFO - __main__ - Step 141399: {'lr': 4.156025725230994e-06, 'samples': 27148608, 'steps': 141398, 'loss/train': 1.774735927581787} 11/07/2021 17:09:16 - INFO - __main__ - Step 141400: {'lr': 4.155062173962693e-06, 'samples': 27148800, 'steps': 141399, 'loss/train': 1.2802798748016357} 11/07/2021 17:09:16 - INFO - __main__ - Step 141401: {'lr': 4.1540987334683086e-06, 'samples': 27148992, 'steps': 141400, 'loss/train': 1.3921558856964111} 11/07/2021 17:09:16 - INFO - __main__ - Step 141402: {'lr': 4.153135403748287e-06, 'samples': 27149184, 'steps': 141401, 'loss/train': 1.2841302156448364} 11/07/2021 17:09:17 - INFO - __main__ - Step 141403: {'lr': 4.152172184803099e-06, 'samples': 27149376, 'steps': 141402, 'loss/train': 0.9247944355010986} 11/07/2021 17:09:18 - INFO - __main__ - Step 141404: {'lr': 4.151209076633133e-06, 'samples': 27149568, 'steps': 141403, 'loss/train': 1.0661545991897583} 11/07/2021 17:09:18 - INFO - __main__ - Step 141405: {'lr': 4.150246079238834e-06, 'samples': 27149760, 'steps': 141404, 'loss/train': 0.7691270709037781} 11/07/2021 17:09:19 - INFO - __main__ - Step 141406: {'lr': 4.149283192620645e-06, 'samples': 27149952, 'steps': 141405, 'loss/train': 1.859308123588562} 11/07/2021 17:09:19 - INFO - __main__ - Step 141407: {'lr': 4.148320416779011e-06, 'samples': 27150144, 'steps': 141406, 'loss/train': 1.5303804874420166} 11/07/2021 17:09:19 - INFO - __main__ - Step 141408: {'lr': 4.14735775171432e-06, 'samples': 27150336, 'steps': 141407, 'loss/train': 1.4714462757110596} 11/07/2021 17:09:20 - INFO - __main__ - Step 141409: {'lr': 4.1463951974270715e-06, 'samples': 27150528, 'steps': 141408, 'loss/train': 1.4425373077392578} 11/07/2021 17:09:21 - INFO - __main__ - Step 141410: {'lr': 4.145432753917627e-06, 'samples': 27150720, 'steps': 141409, 'loss/train': 1.396346926689148} 11/07/2021 17:09:21 - INFO - __main__ - Step 141411: {'lr': 4.144470421186486e-06, 'samples': 27150912, 'steps': 141410, 'loss/train': 1.3697696924209595} 11/07/2021 17:09:21 - INFO - __main__ - Step 141412: {'lr': 4.143508199234036e-06, 'samples': 27151104, 'steps': 141411, 'loss/train': 1.4698619842529297} 11/07/2021 17:09:22 - INFO - __main__ - Step 141413: {'lr': 4.142546088060722e-06, 'samples': 27151296, 'steps': 141412, 'loss/train': 1.4853595495224} 11/07/2021 17:09:23 - INFO - __main__ - Step 141414: {'lr': 4.141584087666988e-06, 'samples': 27151488, 'steps': 141413, 'loss/train': 1.2361116409301758} 11/07/2021 17:09:23 - INFO - __main__ - Step 141415: {'lr': 4.140622198053251e-06, 'samples': 27151680, 'steps': 141414, 'loss/train': 1.3930096626281738} 11/07/2021 17:09:23 - INFO - __main__ - Step 141416: {'lr': 4.139660419219981e-06, 'samples': 27151872, 'steps': 141415, 'loss/train': 1.102473258972168} 11/07/2021 17:09:24 - INFO - __main__ - Step 141417: {'lr': 4.138698751167597e-06, 'samples': 27152064, 'steps': 141416, 'loss/train': 1.4993332624435425} 11/07/2021 17:09:24 - INFO - __main__ - Step 141418: {'lr': 4.137737193896484e-06, 'samples': 27152256, 'steps': 141417, 'loss/train': 1.5284357070922852} 11/07/2021 17:09:25 - INFO - __main__ - Step 141419: {'lr': 4.136775747407145e-06, 'samples': 27152448, 'steps': 141418, 'loss/train': 1.568004846572876} 11/07/2021 17:09:26 - INFO - __main__ - Step 141420: {'lr': 4.135814411699967e-06, 'samples': 27152640, 'steps': 141419, 'loss/train': 1.0655993223190308} 11/07/2021 17:09:26 - INFO - __main__ - Step 141421: {'lr': 4.1348531867753945e-06, 'samples': 27152832, 'steps': 141420, 'loss/train': 1.7279266119003296} 11/07/2021 17:09:26 - INFO - __main__ - Step 141422: {'lr': 4.133892072633844e-06, 'samples': 27153024, 'steps': 141421, 'loss/train': 1.1775809526443481} 11/07/2021 17:09:27 - INFO - __main__ - Step 141423: {'lr': 4.132931069275786e-06, 'samples': 27153216, 'steps': 141422, 'loss/train': 1.0972918272018433} 11/07/2021 17:09:27 - INFO - __main__ - Step 141424: {'lr': 4.131970176701638e-06, 'samples': 27153408, 'steps': 141423, 'loss/train': 1.2942423820495605} 11/07/2021 17:09:28 - INFO - __main__ - Step 141425: {'lr': 4.1310093949118444e-06, 'samples': 27153600, 'steps': 141424, 'loss/train': 1.826538324356079} 11/07/2021 17:09:28 - INFO - __main__ - Step 141426: {'lr': 4.130048723906793e-06, 'samples': 27153792, 'steps': 141425, 'loss/train': 1.6399117708206177} 11/07/2021 17:09:29 - INFO - __main__ - Step 141427: {'lr': 4.129088163686956e-06, 'samples': 27153984, 'steps': 141426, 'loss/train': 1.6473833322525024} 11/07/2021 17:09:29 - INFO - __main__ - Step 141428: {'lr': 4.128127714252777e-06, 'samples': 27154176, 'steps': 141427, 'loss/train': 1.4460787773132324} 11/07/2021 17:09:29 - INFO - __main__ - Step 141429: {'lr': 4.127167375604646e-06, 'samples': 27154368, 'steps': 141428, 'loss/train': 0.7231354117393494} 11/07/2021 17:09:31 - INFO - __main__ - Step 141430: {'lr': 4.126207147743061e-06, 'samples': 27154560, 'steps': 141429, 'loss/train': 1.3616969585418701} 11/07/2021 17:09:31 - INFO - __main__ - Step 141431: {'lr': 4.125247030668383e-06, 'samples': 27154752, 'steps': 141430, 'loss/train': 1.5689303874969482} 11/07/2021 17:09:31 - INFO - __main__ - Step 141432: {'lr': 4.124287024381057e-06, 'samples': 27154944, 'steps': 141431, 'loss/train': 1.165781021118164} 11/07/2021 17:09:32 - INFO - __main__ - Step 141433: {'lr': 4.123327128881555e-06, 'samples': 27155136, 'steps': 141432, 'loss/train': 1.2443832159042358} 11/07/2021 17:09:32 - INFO - __main__ - Step 141434: {'lr': 4.1223673441702916e-06, 'samples': 27155328, 'steps': 141433, 'loss/train': 1.470565676689148} 11/07/2021 17:09:33 - INFO - __main__ - Step 141435: {'lr': 4.121407670247684e-06, 'samples': 27155520, 'steps': 141434, 'loss/train': 1.0111550092697144} 11/07/2021 17:09:33 - INFO - __main__ - Step 141436: {'lr': 4.120448107114177e-06, 'samples': 27155712, 'steps': 141435, 'loss/train': 1.3125170469284058} 11/07/2021 17:09:34 - INFO - __main__ - Step 141437: {'lr': 4.1194886547702145e-06, 'samples': 27155904, 'steps': 141436, 'loss/train': 1.1642487049102783} 11/07/2021 17:09:34 - INFO - __main__ - Step 141438: {'lr': 4.118529313216185e-06, 'samples': 27156096, 'steps': 141437, 'loss/train': 1.0484052896499634} 11/07/2021 17:09:34 - INFO - __main__ - Step 141439: {'lr': 4.117570082452587e-06, 'samples': 27156288, 'steps': 141438, 'loss/train': 1.4610490798950195} 11/07/2021 17:09:36 - INFO - __main__ - Step 141440: {'lr': 4.1166109624798106e-06, 'samples': 27156480, 'steps': 141439, 'loss/train': 1.5902594327926636} 11/07/2021 17:09:36 - INFO - __main__ - Step 141441: {'lr': 4.1156519532982716e-06, 'samples': 27156672, 'steps': 141440, 'loss/train': 0.9383496642112732} 11/07/2021 17:09:37 - INFO - __main__ - Step 141442: {'lr': 4.114693054908441e-06, 'samples': 27156864, 'steps': 141441, 'loss/train': 0.5123030543327332} 11/07/2021 17:09:37 - INFO - __main__ - Step 141443: {'lr': 4.1137342673107366e-06, 'samples': 27157056, 'steps': 141442, 'loss/train': 0.9833778142929077} 11/07/2021 17:09:37 - INFO - __main__ - Step 141444: {'lr': 4.112775590505602e-06, 'samples': 27157248, 'steps': 141443, 'loss/train': 1.2697360515594482} 11/07/2021 17:09:38 - INFO - __main__ - Step 141445: {'lr': 4.111817024493453e-06, 'samples': 27157440, 'steps': 141444, 'loss/train': 1.040461778640747} 11/07/2021 17:09:39 - INFO - __main__ - Step 141446: {'lr': 4.110858569274706e-06, 'samples': 27157632, 'steps': 141445, 'loss/train': 1.357329249382019} 11/07/2021 17:09:39 - INFO - __main__ - Step 141447: {'lr': 4.109900224849833e-06, 'samples': 27157824, 'steps': 141446, 'loss/train': 1.0896657705307007} 11/07/2021 17:09:39 - INFO - __main__ - Step 141448: {'lr': 4.108941991219222e-06, 'samples': 27158016, 'steps': 141447, 'loss/train': 0.9133830666542053} 11/07/2021 17:09:40 - INFO - __main__ - Step 141449: {'lr': 4.1079838683833474e-06, 'samples': 27158208, 'steps': 141448, 'loss/train': 1.623825192451477} 11/07/2021 17:09:41 - INFO - __main__ - Step 141450: {'lr': 4.107025856342595e-06, 'samples': 27158400, 'steps': 141449, 'loss/train': 1.5041488409042358} 11/07/2021 17:09:41 - INFO - __main__ - Step 141451: {'lr': 4.106067955097437e-06, 'samples': 27158592, 'steps': 141450, 'loss/train': 1.3122838735580444} 11/07/2021 17:09:42 - INFO - __main__ - Step 141452: {'lr': 4.10511016464829e-06, 'samples': 27158784, 'steps': 141451, 'loss/train': 1.6819933652877808} 11/07/2021 17:09:42 - INFO - __main__ - Step 141453: {'lr': 4.104152484995599e-06, 'samples': 27158976, 'steps': 141452, 'loss/train': 1.9030230045318604} 11/07/2021 17:09:42 - INFO - __main__ - Step 141454: {'lr': 4.10319491613978e-06, 'samples': 27159168, 'steps': 141453, 'loss/train': 1.3950830698013306} 11/07/2021 17:09:43 - INFO - __main__ - Step 141455: {'lr': 4.102237458081249e-06, 'samples': 27159360, 'steps': 141454, 'loss/train': 2.7347512245178223} 11/07/2021 17:09:44 - INFO - __main__ - Step 141456: {'lr': 4.101280110820477e-06, 'samples': 27159552, 'steps': 141455, 'loss/train': 1.497658371925354} 11/07/2021 17:09:44 - INFO - __main__ - Step 141457: {'lr': 4.100322874357882e-06, 'samples': 27159744, 'steps': 141456, 'loss/train': 1.5966089963912964} 11/07/2021 17:09:44 - INFO - __main__ - Step 141458: {'lr': 4.09936574869385e-06, 'samples': 27159936, 'steps': 141457, 'loss/train': 1.3623777627944946} 11/07/2021 17:09:45 - INFO - __main__ - Step 141459: {'lr': 4.098408733828856e-06, 'samples': 27160128, 'steps': 141458, 'loss/train': 0.6587188839912415} 11/07/2021 17:09:45 - INFO - __main__ - Step 141460: {'lr': 4.097451829763343e-06, 'samples': 27160320, 'steps': 141459, 'loss/train': 1.130037784576416} 11/07/2021 17:09:46 - INFO - __main__ - Step 141461: {'lr': 4.0964950364977274e-06, 'samples': 27160512, 'steps': 141460, 'loss/train': 1.3261511325836182} 11/07/2021 17:09:46 - INFO - __main__ - Step 141462: {'lr': 4.0955383540324244e-06, 'samples': 27160704, 'steps': 141461, 'loss/train': 1.0631240606307983} 11/07/2021 17:09:47 - INFO - __main__ - Step 141463: {'lr': 4.094581782367879e-06, 'samples': 27160896, 'steps': 141462, 'loss/train': 1.4800167083740234} 11/07/2021 17:09:47 - INFO - __main__ - Step 141464: {'lr': 4.093625321504507e-06, 'samples': 27161088, 'steps': 141463, 'loss/train': 1.0273503065109253} 11/07/2021 17:09:48 - INFO - __main__ - Step 141465: {'lr': 4.0926689714427534e-06, 'samples': 27161280, 'steps': 141464, 'loss/train': 1.3941566944122314} 11/07/2021 17:09:49 - INFO - __main__ - Step 141466: {'lr': 4.091712732183062e-06, 'samples': 27161472, 'steps': 141465, 'loss/train': 1.288442850112915} 11/07/2021 17:09:49 - INFO - __main__ - Step 141467: {'lr': 4.090756603725848e-06, 'samples': 27161664, 'steps': 141466, 'loss/train': 0.8188799023628235} 11/07/2021 17:09:49 - INFO - __main__ - Step 141468: {'lr': 4.089800586071557e-06, 'samples': 27161856, 'steps': 141467, 'loss/train': 1.1317204236984253} 11/07/2021 17:09:50 - INFO - __main__ - Step 141469: {'lr': 4.088844679220604e-06, 'samples': 27162048, 'steps': 141468, 'loss/train': 1.289552092552185} 11/07/2021 17:09:50 - INFO - __main__ - Step 141470: {'lr': 4.087888883173407e-06, 'samples': 27162240, 'steps': 141469, 'loss/train': 1.256334900856018} 11/07/2021 17:09:51 - INFO - __main__ - Step 141471: {'lr': 4.086933197930437e-06, 'samples': 27162432, 'steps': 141470, 'loss/train': 0.8981807231903076} 11/07/2021 17:09:51 - INFO - __main__ - Step 141472: {'lr': 4.085977623492082e-06, 'samples': 27162624, 'steps': 141471, 'loss/train': 1.3386622667312622} 11/07/2021 17:09:52 - INFO - __main__ - Step 141473: {'lr': 4.085022159858787e-06, 'samples': 27162816, 'steps': 141472, 'loss/train': 1.2276495695114136} 11/07/2021 17:09:52 - INFO - __main__ - Step 141474: {'lr': 4.084066807030995e-06, 'samples': 27163008, 'steps': 141473, 'loss/train': 1.5783066749572754} 11/07/2021 17:09:52 - INFO - __main__ - Step 141475: {'lr': 4.083111565009124e-06, 'samples': 27163200, 'steps': 141474, 'loss/train': 1.665966272354126} 11/07/2021 17:09:53 - INFO - __main__ - Step 141476: {'lr': 4.082156433793588e-06, 'samples': 27163392, 'steps': 141475, 'loss/train': 0.6175978779792786} 11/07/2021 17:09:54 - INFO - __main__ - Step 141477: {'lr': 4.08120141338486e-06, 'samples': 27163584, 'steps': 141476, 'loss/train': 1.0084398984909058} 11/07/2021 17:09:54 - INFO - __main__ - Step 141478: {'lr': 4.080246503783358e-06, 'samples': 27163776, 'steps': 141477, 'loss/train': 1.3521077632904053} 11/07/2021 17:09:55 - INFO - __main__ - Step 141479: {'lr': 4.079291704989496e-06, 'samples': 27163968, 'steps': 141478, 'loss/train': 1.9316949844360352} 11/07/2021 17:09:55 - INFO - __main__ - Step 141480: {'lr': 4.078337017003692e-06, 'samples': 27164160, 'steps': 141479, 'loss/train': 1.0302541255950928} 11/07/2021 17:09:55 - INFO - __main__ - Step 141481: {'lr': 4.077382439826416e-06, 'samples': 27164352, 'steps': 141480, 'loss/train': 1.041216254234314} 11/07/2021 17:09:56 - INFO - __main__ - Step 141482: {'lr': 4.076427973458058e-06, 'samples': 27164544, 'steps': 141481, 'loss/train': 1.7245588302612305} 11/07/2021 17:09:57 - INFO - __main__ - Step 141483: {'lr': 4.075473617899062e-06, 'samples': 27164736, 'steps': 141482, 'loss/train': 1.4938995838165283} 11/07/2021 17:09:57 - INFO - __main__ - Step 141484: {'lr': 4.074519373149899e-06, 'samples': 27164928, 'steps': 141483, 'loss/train': 1.6703664064407349} 11/07/2021 17:09:57 - INFO - __main__ - Step 141485: {'lr': 4.073565239210958e-06, 'samples': 27165120, 'steps': 141484, 'loss/train': 1.4073867797851562} 11/07/2021 17:09:58 - INFO - __main__ - Step 141486: {'lr': 4.072611216082656e-06, 'samples': 27165312, 'steps': 141485, 'loss/train': 0.8688284158706665} 11/07/2021 17:09:59 - INFO - __main__ - Step 141487: {'lr': 4.071657303765436e-06, 'samples': 27165504, 'steps': 141486, 'loss/train': 0.891402542591095} 11/07/2021 17:09:59 - INFO - __main__ - Step 141488: {'lr': 4.070703502259743e-06, 'samples': 27165696, 'steps': 141487, 'loss/train': 0.9777089953422546} 11/07/2021 17:09:59 - INFO - __main__ - Step 141489: {'lr': 4.069749811565965e-06, 'samples': 27165888, 'steps': 141488, 'loss/train': 1.430432915687561} 11/07/2021 17:10:00 - INFO - __main__ - Step 141490: {'lr': 4.068796231684602e-06, 'samples': 27166080, 'steps': 141489, 'loss/train': 1.3724881410598755} 11/07/2021 17:10:00 - INFO - __main__ - Step 141491: {'lr': 4.067842762616014e-06, 'samples': 27166272, 'steps': 141490, 'loss/train': 1.4439104795455933} 11/07/2021 17:10:01 - INFO - __main__ - Step 141492: {'lr': 4.066889404360702e-06, 'samples': 27166464, 'steps': 141491, 'loss/train': 1.0958213806152344} 11/07/2021 17:10:02 - INFO - __main__ - Step 141493: {'lr': 4.0659361569190254e-06, 'samples': 27166656, 'steps': 141492, 'loss/train': 1.3772085905075073} 11/07/2021 17:10:02 - INFO - __main__ - Step 141494: {'lr': 4.064983020291429e-06, 'samples': 27166848, 'steps': 141493, 'loss/train': 1.0016224384307861} 11/07/2021 17:10:02 - INFO - __main__ - Step 141495: {'lr': 4.064029994478385e-06, 'samples': 27167040, 'steps': 141494, 'loss/train': 0.9176458716392517} 11/07/2021 17:10:03 - INFO - __main__ - Step 141496: {'lr': 4.063077079480282e-06, 'samples': 27167232, 'steps': 141495, 'loss/train': 1.4304720163345337} 11/07/2021 17:10:04 - INFO - __main__ - Step 141497: {'lr': 4.062124275297563e-06, 'samples': 27167424, 'steps': 141496, 'loss/train': 1.3337675333023071} 11/07/2021 17:10:04 - INFO - __main__ - Step 141498: {'lr': 4.061171581930673e-06, 'samples': 27167616, 'steps': 141497, 'loss/train': 1.463760256767273} 11/07/2021 17:10:04 - INFO - __main__ - Step 141499: {'lr': 4.06021899938e-06, 'samples': 27167808, 'steps': 141498, 'loss/train': 1.3907418251037598} 11/07/2021 17:10:05 - INFO - __main__ - Step 141500: {'lr': 4.059266527646016e-06, 'samples': 27168000, 'steps': 141499, 'loss/train': 1.2105755805969238} 11/07/2021 17:10:05 - INFO - __main__ - Step 141501: {'lr': 4.05831416672911e-06, 'samples': 27168192, 'steps': 141500, 'loss/train': 1.3665379285812378} 11/07/2021 17:10:06 - INFO - __main__ - Step 141502: {'lr': 4.0573619166297536e-06, 'samples': 27168384, 'steps': 141501, 'loss/train': 1.0006731748580933} 11/07/2021 17:10:06 - INFO - __main__ - Step 141503: {'lr': 4.0564097773483356e-06, 'samples': 27168576, 'steps': 141502, 'loss/train': 1.4911121129989624} 11/07/2021 17:10:07 - INFO - __main__ - Step 141504: {'lr': 4.055457748885299e-06, 'samples': 27168768, 'steps': 141503, 'loss/train': 1.1064260005950928} 11/07/2021 17:10:07 - INFO - __main__ - Step 141505: {'lr': 4.05450583124109e-06, 'samples': 27168960, 'steps': 141504, 'loss/train': 1.278484582901001} 11/07/2021 17:10:07 - INFO - __main__ - Step 141506: {'lr': 4.053554024416123e-06, 'samples': 27169152, 'steps': 141505, 'loss/train': 1.1256572008132935} 11/07/2021 17:10:08 - INFO - __main__ - Step 141507: {'lr': 4.052602328410842e-06, 'samples': 27169344, 'steps': 141506, 'loss/train': 1.4772826433181763} 11/07/2021 17:10:09 - INFO - __main__ - Step 141508: {'lr': 4.051650743225666e-06, 'samples': 27169536, 'steps': 141507, 'loss/train': 1.0161974430084229} 11/07/2021 17:10:09 - INFO - __main__ - Step 141509: {'lr': 4.050699268861008e-06, 'samples': 27169728, 'steps': 141508, 'loss/train': 1.3834154605865479} 11/07/2021 17:10:09 - INFO - __main__ - Step 141510: {'lr': 4.049747905317314e-06, 'samples': 27169920, 'steps': 141509, 'loss/train': 0.8817458748817444} 11/07/2021 17:10:10 - INFO - __main__ - Step 141511: {'lr': 4.048796652594999e-06, 'samples': 27170112, 'steps': 141510, 'loss/train': 1.2009185552597046} 11/07/2021 17:10:10 - INFO - __main__ - Step 141512: {'lr': 4.047845510694509e-06, 'samples': 27170304, 'steps': 141511, 'loss/train': 1.2968616485595703} 11/07/2021 17:10:11 - INFO - __main__ - Step 141513: {'lr': 4.046894479616259e-06, 'samples': 27170496, 'steps': 141512, 'loss/train': 0.3410485088825226} 11/07/2021 17:10:12 - INFO - __main__ - Step 141514: {'lr': 4.045943559360693e-06, 'samples': 27170688, 'steps': 141513, 'loss/train': 1.4021743535995483} 11/07/2021 17:10:12 - INFO - __main__ - Step 141515: {'lr': 4.044992749928228e-06, 'samples': 27170880, 'steps': 141514, 'loss/train': 1.5848987102508545} 11/07/2021 17:10:12 - INFO - __main__ - Step 141516: {'lr': 4.04404205131928e-06, 'samples': 27171072, 'steps': 141515, 'loss/train': 1.3114854097366333} 11/07/2021 17:10:13 - INFO - __main__ - Step 141517: {'lr': 4.043091463534321e-06, 'samples': 27171264, 'steps': 141516, 'loss/train': 1.0294972658157349} 11/07/2021 17:10:14 - INFO - __main__ - Step 141518: {'lr': 4.0421409865737115e-06, 'samples': 27171456, 'steps': 141517, 'loss/train': 1.0662487745285034} 11/07/2021 17:10:14 - INFO - __main__ - Step 141519: {'lr': 4.041190620437951e-06, 'samples': 27171648, 'steps': 141518, 'loss/train': 1.4795098304748535} 11/07/2021 17:10:14 - INFO - __main__ - Step 141520: {'lr': 4.040240365127401e-06, 'samples': 27171840, 'steps': 141519, 'loss/train': 0.7920264005661011} 11/07/2021 17:10:15 - INFO - __main__ - Step 141521: {'lr': 4.039290220642533e-06, 'samples': 27172032, 'steps': 141520, 'loss/train': 1.23479163646698} 11/07/2021 17:10:15 - INFO - __main__ - Step 141522: {'lr': 4.038340186983791e-06, 'samples': 27172224, 'steps': 141521, 'loss/train': 1.4085183143615723} 11/07/2021 17:10:16 - INFO - __main__ - Step 141523: {'lr': 4.037390264151564e-06, 'samples': 27172416, 'steps': 141522, 'loss/train': 0.9887630939483643} 11/07/2021 17:10:17 - INFO - __main__ - Step 141524: {'lr': 4.036440452146267e-06, 'samples': 27172608, 'steps': 141523, 'loss/train': 1.2016466856002808} 11/07/2021 17:10:17 - INFO - __main__ - Step 141525: {'lr': 4.035490750968401e-06, 'samples': 27172800, 'steps': 141524, 'loss/train': 0.9867158532142639} 11/07/2021 17:10:17 - INFO - __main__ - Step 141526: {'lr': 4.034541160618327e-06, 'samples': 27172992, 'steps': 141525, 'loss/train': 1.3692067861557007} 11/07/2021 17:10:18 - INFO - __main__ - Step 141527: {'lr': 4.033591681096488e-06, 'samples': 27173184, 'steps': 141526, 'loss/train': 1.2283562421798706} 11/07/2021 17:10:19 - INFO - __main__ - Step 141528: {'lr': 4.032642312403329e-06, 'samples': 27173376, 'steps': 141527, 'loss/train': 1.0814168453216553} 11/07/2021 17:10:19 - INFO - __main__ - Step 141529: {'lr': 4.0316930545392376e-06, 'samples': 27173568, 'steps': 141528, 'loss/train': 0.5089592337608337} 11/07/2021 17:10:20 - INFO - __main__ - Step 141530: {'lr': 4.030743907504686e-06, 'samples': 27173760, 'steps': 141529, 'loss/train': 1.0240453481674194} 11/07/2021 17:10:20 - INFO - __main__ - Step 141531: {'lr': 4.029794871300091e-06, 'samples': 27173952, 'steps': 141530, 'loss/train': 1.6459414958953857} 11/07/2021 17:10:20 - INFO - __main__ - Step 141532: {'lr': 4.028845945925868e-06, 'samples': 27174144, 'steps': 141531, 'loss/train': 1.2215303182601929} 11/07/2021 17:10:21 - INFO - __main__ - Step 141533: {'lr': 4.027897131382463e-06, 'samples': 27174336, 'steps': 141532, 'loss/train': 1.9906374216079712} 11/07/2021 17:10:22 - INFO - __main__ - Step 141534: {'lr': 4.02694842767029e-06, 'samples': 27174528, 'steps': 141533, 'loss/train': 1.3013055324554443} 11/07/2021 17:10:22 - INFO - __main__ - Step 141535: {'lr': 4.025999834789768e-06, 'samples': 27174720, 'steps': 141534, 'loss/train': 1.3495579957962036} 11/07/2021 17:10:22 - INFO - __main__ - Step 141536: {'lr': 4.025051352741366e-06, 'samples': 27174912, 'steps': 141535, 'loss/train': 0.8228867053985596} 11/07/2021 17:10:23 - INFO - __main__ - Step 141537: {'lr': 4.024102981525446e-06, 'samples': 27175104, 'steps': 141536, 'loss/train': 1.4979126453399658} 11/07/2021 17:10:24 - INFO - __main__ - Step 141538: {'lr': 4.0231547211424805e-06, 'samples': 27175296, 'steps': 141537, 'loss/train': 1.5197267532348633} 11/07/2021 17:10:24 - INFO - __main__ - Step 141539: {'lr': 4.0222065715928845e-06, 'samples': 27175488, 'steps': 141538, 'loss/train': 1.130186915397644} 11/07/2021 17:10:25 - INFO - __main__ - Step 141540: {'lr': 4.021258532877075e-06, 'samples': 27175680, 'steps': 141539, 'loss/train': 1.4287141561508179} 11/07/2021 17:10:25 - INFO - __main__ - Step 141541: {'lr': 4.020310604995498e-06, 'samples': 27175872, 'steps': 141540, 'loss/train': 0.7951191663742065} 11/07/2021 17:10:25 - INFO - __main__ - Step 141542: {'lr': 4.019362787948566e-06, 'samples': 27176064, 'steps': 141541, 'loss/train': 1.2535152435302734} 11/07/2021 17:10:26 - INFO - __main__ - Step 141543: {'lr': 4.018415081736726e-06, 'samples': 27176256, 'steps': 141542, 'loss/train': 0.03994724527001381} 11/07/2021 17:10:27 - INFO - __main__ - Step 141544: {'lr': 4.017467486360393e-06, 'samples': 27176448, 'steps': 141543, 'loss/train': 1.1741406917572021} 11/07/2021 17:10:27 - INFO - __main__ - Step 141545: {'lr': 4.016520001819984e-06, 'samples': 27176640, 'steps': 141544, 'loss/train': 1.4943783283233643} 11/07/2021 17:10:27 - INFO - __main__ - Step 141546: {'lr': 4.015572628115943e-06, 'samples': 27176832, 'steps': 141545, 'loss/train': 0.9824565649032593} 11/07/2021 17:10:28 - INFO - __main__ - Step 141547: {'lr': 4.014625365248714e-06, 'samples': 27177024, 'steps': 141546, 'loss/train': 1.647679090499878} 11/07/2021 17:10:29 - INFO - __main__ - Step 141548: {'lr': 4.013678213218685e-06, 'samples': 27177216, 'steps': 141547, 'loss/train': 1.1380962133407593} 11/07/2021 17:10:29 - INFO - __main__ - Step 141549: {'lr': 4.012731172026274e-06, 'samples': 27177408, 'steps': 141548, 'loss/train': 1.404737949371338} 11/07/2021 17:10:30 - INFO - __main__ - Step 141550: {'lr': 4.011784241671923e-06, 'samples': 27177600, 'steps': 141549, 'loss/train': 1.56475830078125} 11/07/2021 17:10:30 - INFO - __main__ - Step 141551: {'lr': 4.010837422156105e-06, 'samples': 27177792, 'steps': 141550, 'loss/train': 1.2782676219940186} 11/07/2021 17:10:30 - INFO - __main__ - Step 141552: {'lr': 4.009890713479181e-06, 'samples': 27177984, 'steps': 141551, 'loss/train': 1.445137858390808} 11/07/2021 17:10:31 - INFO - __main__ - Step 141553: {'lr': 4.008944115641594e-06, 'samples': 27178176, 'steps': 141552, 'loss/train': 0.7710831761360168} 11/07/2021 17:10:32 - INFO - __main__ - Step 141554: {'lr': 4.007997628643817e-06, 'samples': 27178368, 'steps': 141553, 'loss/train': 1.100628137588501} 11/07/2021 17:10:32 - INFO - __main__ - Step 141555: {'lr': 4.00705125248621e-06, 'samples': 27178560, 'steps': 141554, 'loss/train': 1.5202336311340332} 11/07/2021 17:10:33 - INFO - __main__ - Step 141556: {'lr': 4.006104987169246e-06, 'samples': 27178752, 'steps': 141555, 'loss/train': 1.3922187089920044} 11/07/2021 17:10:33 - INFO - __main__ - Step 141557: {'lr': 4.005158832693312e-06, 'samples': 27178944, 'steps': 141556, 'loss/train': 1.3342925310134888} 11/07/2021 17:10:33 - INFO - __main__ - Step 141558: {'lr': 4.004212789058909e-06, 'samples': 27179136, 'steps': 141557, 'loss/train': 1.0762543678283691} 11/07/2021 17:10:34 - INFO - __main__ - Step 141559: {'lr': 4.0032668562663685e-06, 'samples': 27179328, 'steps': 141558, 'loss/train': 1.3627065420150757} 11/07/2021 17:10:35 - INFO - __main__ - Step 141560: {'lr': 4.002321034316164e-06, 'samples': 27179520, 'steps': 141559, 'loss/train': 1.0895942449569702} 11/07/2021 17:10:35 - INFO - __main__ - Step 141561: {'lr': 4.001375323208739e-06, 'samples': 27179712, 'steps': 141560, 'loss/train': 1.353954553604126} 11/07/2021 17:10:35 - INFO - __main__ - Step 141562: {'lr': 4.000429722944482e-06, 'samples': 27179904, 'steps': 141561, 'loss/train': 1.247206449508667} 11/07/2021 17:10:36 - INFO - __main__ - Step 141563: {'lr': 3.999484233523837e-06, 'samples': 27180096, 'steps': 141562, 'loss/train': 1.0628645420074463} 11/07/2021 17:10:37 - INFO - __main__ - Step 141564: {'lr': 3.9985388549472205e-06, 'samples': 27180288, 'steps': 141563, 'loss/train': 1.476744532585144} 11/07/2021 17:10:37 - INFO - __main__ - Step 141565: {'lr': 3.997593587215076e-06, 'samples': 27180480, 'steps': 141564, 'loss/train': 1.3222317695617676} 11/07/2021 17:10:37 - INFO - __main__ - Step 141566: {'lr': 3.996648430327821e-06, 'samples': 27180672, 'steps': 141565, 'loss/train': 1.355832815170288} 11/07/2021 17:10:38 - INFO - __main__ - Step 141567: {'lr': 3.99570338428587e-06, 'samples': 27180864, 'steps': 141566, 'loss/train': 1.6075363159179688} 11/07/2021 17:10:38 - INFO - __main__ - Step 141568: {'lr': 3.994758449089669e-06, 'samples': 27181056, 'steps': 141567, 'loss/train': 0.566460907459259} 11/07/2021 17:10:38 - INFO - __main__ - Step 141569: {'lr': 3.993813624739634e-06, 'samples': 27181248, 'steps': 141568, 'loss/train': 0.6586307287216187} 11/07/2021 17:10:40 - INFO - __main__ - Step 141570: {'lr': 3.992868911236181e-06, 'samples': 27181440, 'steps': 141569, 'loss/train': 0.936235249042511} 11/07/2021 17:10:40 - INFO - __main__ - Step 141571: {'lr': 3.991924308579753e-06, 'samples': 27181632, 'steps': 141570, 'loss/train': 1.4964455366134644} 11/07/2021 17:10:40 - INFO - __main__ - Step 141572: {'lr': 3.990979816770768e-06, 'samples': 27181824, 'steps': 141571, 'loss/train': 1.4709851741790771} 11/07/2021 17:10:41 - INFO - __main__ - Step 141573: {'lr': 3.990035435809641e-06, 'samples': 27182016, 'steps': 141572, 'loss/train': 1.4122393131256104} 11/07/2021 17:10:41 - INFO - __main__ - Step 141574: {'lr': 3.989091165696817e-06, 'samples': 27182208, 'steps': 141573, 'loss/train': 1.2748677730560303} 11/07/2021 17:10:42 - INFO - __main__ - Step 141575: {'lr': 3.988147006432713e-06, 'samples': 27182400, 'steps': 141574, 'loss/train': 1.2750515937805176} 11/07/2021 17:10:42 - INFO - __main__ - Step 141576: {'lr': 3.987202958017744e-06, 'samples': 27182592, 'steps': 141575, 'loss/train': 1.5339537858963013} 11/07/2021 17:10:43 - INFO - __main__ - Step 141577: {'lr': 3.986259020452354e-06, 'samples': 27182784, 'steps': 141576, 'loss/train': 1.3782331943511963} 11/07/2021 17:10:43 - INFO - __main__ - Step 141578: {'lr': 3.98531519373696e-06, 'samples': 27182976, 'steps': 141577, 'loss/train': 1.198997139930725} 11/07/2021 17:10:43 - INFO - __main__ - Step 141579: {'lr': 3.984371477871979e-06, 'samples': 27183168, 'steps': 141578, 'loss/train': 1.5591577291488647} 11/07/2021 17:10:44 - INFO - __main__ - Step 141580: {'lr': 3.983427872857881e-06, 'samples': 27183360, 'steps': 141579, 'loss/train': 1.231576681137085} 11/07/2021 17:10:45 - INFO - __main__ - Step 141581: {'lr': 3.982484378695e-06, 'samples': 27183552, 'steps': 141580, 'loss/train': 1.5020663738250732} 11/07/2021 17:10:45 - INFO - __main__ - Step 141582: {'lr': 3.981540995383864e-06, 'samples': 27183744, 'steps': 141581, 'loss/train': 1.4180269241333008} 11/07/2021 17:10:46 - INFO - __main__ - Step 141583: {'lr': 3.9805977229248054e-06, 'samples': 27183936, 'steps': 141582, 'loss/train': 0.6253988742828369} 11/07/2021 17:10:46 - INFO - __main__ - Step 141584: {'lr': 3.979654561318324e-06, 'samples': 27184128, 'steps': 141583, 'loss/train': 1.8933696746826172} 11/07/2021 17:10:47 - INFO - __main__ - Step 141585: {'lr': 3.9787115105647806e-06, 'samples': 27184320, 'steps': 141584, 'loss/train': 2.0787720680236816} 11/07/2021 17:10:47 - INFO - __main__ - Step 141586: {'lr': 3.977768570664675e-06, 'samples': 27184512, 'steps': 141585, 'loss/train': 0.6662040948867798} 11/07/2021 17:10:48 - INFO - __main__ - Step 141587: {'lr': 3.976825741618367e-06, 'samples': 27184704, 'steps': 141586, 'loss/train': 1.4726989269256592} 11/07/2021 17:10:48 - INFO - __main__ - Step 141588: {'lr': 3.975883023426302e-06, 'samples': 27184896, 'steps': 141587, 'loss/train': 1.3857436180114746} 11/07/2021 17:10:49 - INFO - __main__ - Step 141589: {'lr': 3.974940416088896e-06, 'samples': 27185088, 'steps': 141588, 'loss/train': 0.05048297345638275} 11/07/2021 17:10:50 - INFO - __main__ - Step 141590: {'lr': 3.97399791960662e-06, 'samples': 27185280, 'steps': 141589, 'loss/train': 1.3064805269241333} 11/07/2021 17:10:50 - INFO - __main__ - Step 141591: {'lr': 3.9730555339798355e-06, 'samples': 27185472, 'steps': 141590, 'loss/train': 1.552980661392212} 11/07/2021 17:10:50 - INFO - __main__ - Step 141592: {'lr': 3.972113259209015e-06, 'samples': 27185664, 'steps': 141591, 'loss/train': 1.0941301584243774} 11/07/2021 17:10:51 - INFO - __main__ - Step 141593: {'lr': 3.9711710952945736e-06, 'samples': 27185856, 'steps': 141592, 'loss/train': 1.3664348125457764} 11/07/2021 17:10:51 - INFO - __main__ - Step 141594: {'lr': 3.9702290422369005e-06, 'samples': 27186048, 'steps': 141593, 'loss/train': 0.9335242509841919} 11/07/2021 17:10:52 - INFO - __main__ - Step 141595: {'lr': 3.96928710003644e-06, 'samples': 27186240, 'steps': 141594, 'loss/train': 1.0960992574691772} 11/07/2021 17:10:53 - INFO - __main__ - Step 141596: {'lr': 3.968345268693635e-06, 'samples': 27186432, 'steps': 141595, 'loss/train': 1.0796117782592773} 11/07/2021 17:10:53 - INFO - __main__ - Step 141597: {'lr': 3.967403548208903e-06, 'samples': 27186624, 'steps': 141596, 'loss/train': 1.328992486000061} 11/07/2021 17:10:53 - INFO - __main__ - Step 141598: {'lr': 3.96646193858266e-06, 'samples': 27186816, 'steps': 141597, 'loss/train': 1.5488533973693848} 11/07/2021 17:10:54 - INFO - __main__ - Step 141599: {'lr': 3.965520439815323e-06, 'samples': 27187008, 'steps': 141598, 'loss/train': 1.3139533996582031} 11/07/2021 17:10:55 - INFO - __main__ - Step 141600: {'lr': 3.964579051907335e-06, 'samples': 27187200, 'steps': 141599, 'loss/train': 0.11642036586999893} 11/07/2021 17:10:55 - INFO - __main__ - Step 141601: {'lr': 3.963637774859114e-06, 'samples': 27187392, 'steps': 141600, 'loss/train': 1.441933035850525} 11/07/2021 17:10:55 - INFO - __main__ - Step 141602: {'lr': 3.9626966086710735e-06, 'samples': 27187584, 'steps': 141601, 'loss/train': 1.4892741441726685} 11/07/2021 17:10:56 - INFO - __main__ - Step 141603: {'lr': 3.96175555334366e-06, 'samples': 27187776, 'steps': 141602, 'loss/train': 1.1527175903320312} 11/07/2021 17:10:56 - INFO - __main__ - Step 141604: {'lr': 3.960814608877261e-06, 'samples': 27187968, 'steps': 141603, 'loss/train': 0.9892758131027222} 11/07/2021 17:10:57 - INFO - __main__ - Step 141605: {'lr': 3.959873775272349e-06, 'samples': 27188160, 'steps': 141604, 'loss/train': 1.2775987386703491} 11/07/2021 17:10:58 - INFO - __main__ - Step 141606: {'lr': 3.958933052529312e-06, 'samples': 27188352, 'steps': 141605, 'loss/train': 0.7587562799453735} 11/07/2021 17:10:58 - INFO - __main__ - Step 141607: {'lr': 3.957992440648567e-06, 'samples': 27188544, 'steps': 141606, 'loss/train': 1.411408543586731} 11/07/2021 17:10:58 - INFO - __main__ - Step 141608: {'lr': 3.957051939630557e-06, 'samples': 27188736, 'steps': 141607, 'loss/train': 1.3833152055740356} 11/07/2021 17:10:59 - INFO - __main__ - Step 141609: {'lr': 3.956111549475699e-06, 'samples': 27188928, 'steps': 141608, 'loss/train': 0.9372619986534119} 11/07/2021 17:11:00 - INFO - __main__ - Step 141610: {'lr': 3.955171270184438e-06, 'samples': 27189120, 'steps': 141609, 'loss/train': 0.7353643774986267} 11/07/2021 17:11:00 - INFO - __main__ - Step 141611: {'lr': 3.954231101757188e-06, 'samples': 27189312, 'steps': 141610, 'loss/train': 0.9804461598396301} 11/07/2021 17:11:01 - INFO - __main__ - Step 141612: {'lr': 3.953291044194341e-06, 'samples': 27189504, 'steps': 141611, 'loss/train': 0.1981913298368454} 11/07/2021 17:11:01 - INFO - __main__ - Step 141613: {'lr': 3.952351097496337e-06, 'samples': 27189696, 'steps': 141612, 'loss/train': 1.2998371124267578} 11/07/2021 17:11:01 - INFO - __main__ - Step 141614: {'lr': 3.951411261663623e-06, 'samples': 27189888, 'steps': 141613, 'loss/train': 0.5832651257514954} 11/07/2021 17:11:02 - INFO - __main__ - Step 141615: {'lr': 3.950471536696615e-06, 'samples': 27190080, 'steps': 141614, 'loss/train': 2.174443006515503} 11/07/2021 17:11:03 - INFO - __main__ - Step 141616: {'lr': 3.9495319225957e-06, 'samples': 27190272, 'steps': 141615, 'loss/train': 2.2129690647125244} 11/07/2021 17:11:03 - INFO - __main__ - Step 141617: {'lr': 3.948592419361352e-06, 'samples': 27190464, 'steps': 141616, 'loss/train': 1.5286493301391602} 11/07/2021 17:11:04 - INFO - __main__ - Step 141618: {'lr': 3.947653026993958e-06, 'samples': 27190656, 'steps': 141617, 'loss/train': 1.1039952039718628} 11/07/2021 17:11:04 - INFO - __main__ - Step 141619: {'lr': 3.9467137454939905e-06, 'samples': 27190848, 'steps': 141618, 'loss/train': 1.0606428384780884} 11/07/2021 17:11:04 - INFO - __main__ - Step 141620: {'lr': 3.94577457486181e-06, 'samples': 27191040, 'steps': 141619, 'loss/train': 1.0417507886886597} 11/07/2021 17:11:05 - INFO - __main__ - Step 141621: {'lr': 3.944835515097861e-06, 'samples': 27191232, 'steps': 141620, 'loss/train': 0.7659996747970581} 11/07/2021 17:11:06 - INFO - __main__ - Step 141622: {'lr': 3.9438965662025875e-06, 'samples': 27191424, 'steps': 141621, 'loss/train': 1.5503952503204346} 11/07/2021 17:11:06 - INFO - __main__ - Step 141623: {'lr': 3.942957728176377e-06, 'samples': 27191616, 'steps': 141622, 'loss/train': 0.9284815192222595} 11/07/2021 17:11:07 - INFO - __main__ - Step 141624: {'lr': 3.942019001019675e-06, 'samples': 27191808, 'steps': 141623, 'loss/train': 1.6165231466293335} 11/07/2021 17:11:07 - INFO - __main__ - Step 141625: {'lr': 3.941080384732926e-06, 'samples': 27192000, 'steps': 141624, 'loss/train': 1.6318604946136475} 11/07/2021 17:11:07 - INFO - __main__ - Step 141626: {'lr': 3.9401418793165165e-06, 'samples': 27192192, 'steps': 141625, 'loss/train': 1.258481740951538} 11/07/2021 17:11:08 - INFO - __main__ - Step 141627: {'lr': 3.939203484770865e-06, 'samples': 27192384, 'steps': 141626, 'loss/train': 1.3896609544754028} 11/07/2021 17:11:09 - INFO - __main__ - Step 141628: {'lr': 3.938265201096442e-06, 'samples': 27192576, 'steps': 141627, 'loss/train': 1.2513972520828247} 11/07/2021 17:11:09 - INFO - __main__ - Step 141629: {'lr': 3.937327028293608e-06, 'samples': 27192768, 'steps': 141628, 'loss/train': 1.2124937772750854} 11/07/2021 17:11:10 - INFO - __main__ - Step 141630: {'lr': 3.936388966362836e-06, 'samples': 27192960, 'steps': 141629, 'loss/train': 1.0261729955673218} 11/07/2021 17:11:10 - INFO - __main__ - Step 141631: {'lr': 3.935451015304514e-06, 'samples': 27193152, 'steps': 141630, 'loss/train': 1.1494770050048828} 11/07/2021 17:11:10 - INFO - __main__ - Step 141632: {'lr': 3.934513175119114e-06, 'samples': 27193344, 'steps': 141631, 'loss/train': 1.0900205373764038} 11/07/2021 17:11:11 - INFO - __main__ - Step 141633: {'lr': 3.933575445807025e-06, 'samples': 27193536, 'steps': 141632, 'loss/train': 0.4135739803314209} 11/07/2021 17:11:12 - INFO - __main__ - Step 141634: {'lr': 3.932637827368635e-06, 'samples': 27193728, 'steps': 141633, 'loss/train': 1.0987621545791626} 11/07/2021 17:11:12 - INFO - __main__ - Step 141635: {'lr': 3.931700319804415e-06, 'samples': 27193920, 'steps': 141634, 'loss/train': 1.102086067199707} 11/07/2021 17:11:12 - INFO - __main__ - Step 141636: {'lr': 3.930762923114783e-06, 'samples': 27194112, 'steps': 141635, 'loss/train': 1.5515429973602295} 11/07/2021 17:11:13 - INFO - __main__ - Step 141637: {'lr': 3.929825637300155e-06, 'samples': 27194304, 'steps': 141636, 'loss/train': 1.2583715915679932} 11/07/2021 17:11:14 - INFO - __main__ - Step 141638: {'lr': 3.928888462360919e-06, 'samples': 27194496, 'steps': 141637, 'loss/train': 1.4674416780471802} 11/07/2021 17:11:14 - INFO - __main__ - Step 141639: {'lr': 3.927951398297547e-06, 'samples': 27194688, 'steps': 141638, 'loss/train': 1.1874405145645142} 11/07/2021 17:11:14 - INFO - __main__ - Step 141640: {'lr': 3.927014445110455e-06, 'samples': 27194880, 'steps': 141639, 'loss/train': 1.0243359804153442} 11/07/2021 17:11:15 - INFO - __main__ - Step 141641: {'lr': 3.926077602800032e-06, 'samples': 27195072, 'steps': 141640, 'loss/train': 1.3138271570205688} 11/07/2021 17:11:15 - INFO - __main__ - Step 141642: {'lr': 3.9251408713667505e-06, 'samples': 27195264, 'steps': 141641, 'loss/train': 1.3438849449157715} 11/07/2021 17:11:16 - INFO - __main__ - Step 141643: {'lr': 3.92420425081097e-06, 'samples': 27195456, 'steps': 141642, 'loss/train': 1.137992024421692} 11/07/2021 17:11:17 - INFO - __main__ - Step 141644: {'lr': 3.923267741133163e-06, 'samples': 27195648, 'steps': 141643, 'loss/train': 0.6901651620864868} 11/07/2021 17:11:17 - INFO - __main__ - Step 141645: {'lr': 3.9223313423337455e-06, 'samples': 27195840, 'steps': 141644, 'loss/train': 0.6432854533195496} 11/07/2021 17:11:17 - INFO - __main__ - Step 141646: {'lr': 3.921395054413135e-06, 'samples': 27196032, 'steps': 141645, 'loss/train': 0.9496861100196838} 11/07/2021 17:11:18 - INFO - __main__ - Step 141647: {'lr': 3.9204588773717185e-06, 'samples': 27196224, 'steps': 141646, 'loss/train': 1.3197782039642334} 11/07/2021 17:11:19 - INFO - __main__ - Step 141648: {'lr': 3.9195228112099415e-06, 'samples': 27196416, 'steps': 141647, 'loss/train': 1.2396440505981445} 11/07/2021 17:11:19 - INFO - __main__ - Step 141649: {'lr': 3.918586855928247e-06, 'samples': 27196608, 'steps': 141648, 'loss/train': 1.4607160091400146} 11/07/2021 17:11:19 - INFO - __main__ - Step 141650: {'lr': 3.917651011527051e-06, 'samples': 27196800, 'steps': 141649, 'loss/train': 1.2628742456436157} 11/07/2021 17:11:20 - INFO - __main__ - Step 141651: {'lr': 3.916715278006744e-06, 'samples': 27196992, 'steps': 141650, 'loss/train': 1.0703130960464478} 11/07/2021 17:11:20 - INFO - __main__ - Step 141652: {'lr': 3.915779655367768e-06, 'samples': 27197184, 'steps': 141651, 'loss/train': 1.1896870136260986} 11/07/2021 17:11:20 - INFO - __main__ - Step 141653: {'lr': 3.914844143610541e-06, 'samples': 27197376, 'steps': 141652, 'loss/train': 1.8108402490615845} 11/07/2021 17:11:22 - INFO - __main__ - Step 141654: {'lr': 3.9139087427355055e-06, 'samples': 27197568, 'steps': 141653, 'loss/train': 1.3891481161117554} 11/07/2021 17:11:22 - INFO - __main__ - Step 141655: {'lr': 3.9129734527430515e-06, 'samples': 27197760, 'steps': 141654, 'loss/train': 1.7330821752548218} 11/07/2021 17:11:22 - INFO - __main__ - Step 141656: {'lr': 3.912038273633622e-06, 'samples': 27197952, 'steps': 141655, 'loss/train': 1.3629382848739624} 11/07/2021 17:11:23 - INFO - __main__ - Step 141657: {'lr': 3.911103205407635e-06, 'samples': 27198144, 'steps': 141656, 'loss/train': 1.8519010543823242} 11/07/2021 17:11:23 - INFO - __main__ - Step 141658: {'lr': 3.910168248065504e-06, 'samples': 27198336, 'steps': 141657, 'loss/train': 1.3640387058258057} 11/07/2021 17:11:24 - INFO - __main__ - Step 141659: {'lr': 3.909233401607648e-06, 'samples': 27198528, 'steps': 141658, 'loss/train': 1.1625169515609741} 11/07/2021 17:11:24 - INFO - __main__ - Step 141660: {'lr': 3.90829866603451e-06, 'samples': 27198720, 'steps': 141659, 'loss/train': 1.412068486213684} 11/07/2021 17:11:25 - INFO - __main__ - Step 141661: {'lr': 3.907364041346478e-06, 'samples': 27198912, 'steps': 141660, 'loss/train': 1.0787558555603027} 11/07/2021 17:11:25 - INFO - __main__ - Step 141662: {'lr': 3.906429527543997e-06, 'samples': 27199104, 'steps': 141661, 'loss/train': 1.3550000190734863} 11/07/2021 17:11:25 - INFO - __main__ - Step 141663: {'lr': 3.9054951246274835e-06, 'samples': 27199296, 'steps': 141662, 'loss/train': 1.2341045141220093} 11/07/2021 17:11:26 - INFO - __main__ - Step 141664: {'lr': 3.904560832597354e-06, 'samples': 27199488, 'steps': 141663, 'loss/train': 1.4477264881134033} 11/07/2021 17:11:27 - INFO - __main__ - Step 141665: {'lr': 3.903626651454023e-06, 'samples': 27199680, 'steps': 141664, 'loss/train': 1.1516972780227661} 11/07/2021 17:11:27 - INFO - __main__ - Step 141666: {'lr': 3.902692581197936e-06, 'samples': 27199872, 'steps': 141665, 'loss/train': 1.218177080154419} 11/07/2021 17:11:27 - INFO - __main__ - Step 141667: {'lr': 3.901758621829482e-06, 'samples': 27200064, 'steps': 141666, 'loss/train': 0.9676464200019836} 11/07/2021 17:11:28 - INFO - __main__ - Step 141668: {'lr': 3.900824773349105e-06, 'samples': 27200256, 'steps': 141667, 'loss/train': 1.5893417596817017} 11/07/2021 17:11:29 - INFO - __main__ - Step 141669: {'lr': 3.899891035757219e-06, 'samples': 27200448, 'steps': 141668, 'loss/train': 1.5319459438323975} 11/07/2021 17:11:29 - INFO - __main__ - Step 141670: {'lr': 3.898957409054243e-06, 'samples': 27200640, 'steps': 141669, 'loss/train': 1.356919288635254} 11/07/2021 17:11:30 - INFO - __main__ - Step 141671: {'lr': 3.898023893240593e-06, 'samples': 27200832, 'steps': 141670, 'loss/train': 1.6085219383239746} 11/07/2021 17:11:30 - INFO - __main__ - Step 141672: {'lr': 3.897090488316712e-06, 'samples': 27201024, 'steps': 141671, 'loss/train': 1.465740442276001} 11/07/2021 17:11:30 - INFO - __main__ - Step 141673: {'lr': 3.896157194283018e-06, 'samples': 27201216, 'steps': 141672, 'loss/train': 1.8812236785888672} 11/07/2021 17:11:31 - INFO - __main__ - Step 141674: {'lr': 3.895224011139869e-06, 'samples': 27201408, 'steps': 141673, 'loss/train': 0.4191858470439911} 11/07/2021 17:11:32 - INFO - __main__ - Step 141675: {'lr': 3.894290938887768e-06, 'samples': 27201600, 'steps': 141674, 'loss/train': 1.296119213104248} 11/07/2021 17:11:32 - INFO - __main__ - Step 141676: {'lr': 3.893357977527101e-06, 'samples': 27201792, 'steps': 141675, 'loss/train': 1.0888111591339111} 11/07/2021 17:11:32 - INFO - __main__ - Step 141677: {'lr': 3.892425127058286e-06, 'samples': 27201984, 'steps': 141676, 'loss/train': 1.3658047914505005} 11/07/2021 17:11:33 - INFO - __main__ - Step 141678: {'lr': 3.891492387481738e-06, 'samples': 27202176, 'steps': 141677, 'loss/train': 1.1606625318527222} 11/07/2021 17:11:33 - INFO - __main__ - Step 141679: {'lr': 3.890559758797901e-06, 'samples': 27202368, 'steps': 141678, 'loss/train': 1.5996099710464478} 11/07/2021 17:11:34 - INFO - __main__ - Step 141680: {'lr': 3.889627241007165e-06, 'samples': 27202560, 'steps': 141679, 'loss/train': 0.7716629505157471} 11/07/2021 17:11:35 - INFO - __main__ - Step 141681: {'lr': 3.888694834109974e-06, 'samples': 27202752, 'steps': 141680, 'loss/train': 1.404894471168518} 11/07/2021 17:11:35 - INFO - __main__ - Step 141682: {'lr': 3.887762538106715e-06, 'samples': 27202944, 'steps': 141681, 'loss/train': 1.3055768013000488} 11/07/2021 17:11:35 - INFO - __main__ - Step 141683: {'lr': 3.886830352997861e-06, 'samples': 27203136, 'steps': 141682, 'loss/train': 1.0815876722335815} 11/07/2021 17:11:36 - INFO - __main__ - Step 141684: {'lr': 3.8858982787838005e-06, 'samples': 27203328, 'steps': 141683, 'loss/train': 1.0660909414291382} 11/07/2021 17:11:37 - INFO - __main__ - Step 141685: {'lr': 3.88496631546495e-06, 'samples': 27203520, 'steps': 141684, 'loss/train': 0.9022493958473206} 11/07/2021 17:11:37 - INFO - __main__ - Step 141686: {'lr': 3.8840344630417526e-06, 'samples': 27203712, 'steps': 141685, 'loss/train': 0.8876247406005859} 11/07/2021 17:11:37 - INFO - __main__ - Step 141687: {'lr': 3.883102721514597e-06, 'samples': 27203904, 'steps': 141686, 'loss/train': 1.5855923891067505} 11/07/2021 17:11:38 - INFO - __main__ - Step 141688: {'lr': 3.8821710908839295e-06, 'samples': 27204096, 'steps': 141687, 'loss/train': 1.80308997631073} 11/07/2021 17:11:38 - INFO - __main__ - Step 141689: {'lr': 3.881239571150136e-06, 'samples': 27204288, 'steps': 141688, 'loss/train': 1.2248140573501587} 11/07/2021 17:11:39 - INFO - __main__ - Step 141690: {'lr': 3.880308162313662e-06, 'samples': 27204480, 'steps': 141689, 'loss/train': 0.4429817795753479} 11/07/2021 17:11:39 - INFO - __main__ - Step 141691: {'lr': 3.879376864374923e-06, 'samples': 27204672, 'steps': 141690, 'loss/train': 1.0091724395751953} 11/07/2021 17:11:40 - INFO - __main__ - Step 141692: {'lr': 3.878445677334363e-06, 'samples': 27204864, 'steps': 141691, 'loss/train': 0.8886504173278809} 11/07/2021 17:11:40 - INFO - __main__ - Step 141693: {'lr': 3.877514601192345e-06, 'samples': 27205056, 'steps': 141692, 'loss/train': 1.4671849012374878} 11/07/2021 17:11:41 - INFO - __main__ - Step 141694: {'lr': 3.876583635949338e-06, 'samples': 27205248, 'steps': 141693, 'loss/train': 1.1759003400802612} 11/07/2021 17:11:42 - INFO - __main__ - Step 141695: {'lr': 3.8756527816057596e-06, 'samples': 27205440, 'steps': 141694, 'loss/train': 1.3745603561401367} 11/07/2021 17:11:42 - INFO - __main__ - Step 141696: {'lr': 3.8747220381619705e-06, 'samples': 27205632, 'steps': 141695, 'loss/train': 1.7963210344314575} 11/07/2021 17:11:42 - INFO - __main__ - Step 141697: {'lr': 3.8737914056184706e-06, 'samples': 27205824, 'steps': 141696, 'loss/train': 1.3598932027816772} 11/07/2021 17:11:43 - INFO - __main__ - Step 141698: {'lr': 3.8728608839756205e-06, 'samples': 27206016, 'steps': 141697, 'loss/train': 1.276527762413025} 11/07/2021 17:11:43 - INFO - __main__ - Step 141699: {'lr': 3.871930473233892e-06, 'samples': 27206208, 'steps': 141698, 'loss/train': 1.3809317350387573} 11/07/2021 17:11:43 - INFO - __main__ - Step 141700: {'lr': 3.871000173393674e-06, 'samples': 27206400, 'steps': 141699, 'loss/train': 1.3028454780578613} 11/07/2021 17:11:44 - INFO - __main__ - Step 141701: {'lr': 3.870069984455355e-06, 'samples': 27206592, 'steps': 141700, 'loss/train': 1.5603337287902832} 11/07/2021 17:11:45 - INFO - __main__ - Step 141702: {'lr': 3.8691399064194055e-06, 'samples': 27206784, 'steps': 141701, 'loss/train': 1.5160348415374756} 11/07/2021 17:11:45 - INFO - __main__ - Step 141703: {'lr': 3.868209939286216e-06, 'samples': 27206976, 'steps': 141702, 'loss/train': 1.418325424194336} 11/07/2021 17:11:45 - INFO - __main__ - Step 141704: {'lr': 3.8672800830562016e-06, 'samples': 27207168, 'steps': 141703, 'loss/train': 1.114676594734192} 11/07/2021 17:11:46 - INFO - __main__ - Step 141705: {'lr': 3.866350337729807e-06, 'samples': 27207360, 'steps': 141704, 'loss/train': 1.491434097290039} 11/07/2021 17:11:47 - INFO - __main__ - Step 141706: {'lr': 3.865420703307421e-06, 'samples': 27207552, 'steps': 141705, 'loss/train': 2.0639116764068604} 11/07/2021 17:11:48 - INFO - __main__ - Step 141707: {'lr': 3.864491179789487e-06, 'samples': 27207744, 'steps': 141706, 'loss/train': 1.013355016708374} 11/07/2021 17:11:48 - INFO - __main__ - Step 141708: {'lr': 3.863561767176421e-06, 'samples': 27207936, 'steps': 141707, 'loss/train': 0.866156280040741} 11/07/2021 17:11:48 - INFO - __main__ - Step 141709: {'lr': 3.862632465468613e-06, 'samples': 27208128, 'steps': 141708, 'loss/train': 1.3497223854064941} 11/07/2021 17:11:49 - INFO - __main__ - Step 141710: {'lr': 3.861703274666534e-06, 'samples': 27208320, 'steps': 141709, 'loss/train': 1.4281327724456787} 11/07/2021 17:11:50 - INFO - __main__ - Step 141711: {'lr': 3.860774194770572e-06, 'samples': 27208512, 'steps': 141710, 'loss/train': 0.8604257702827454} 11/07/2021 17:11:50 - INFO - __main__ - Step 141712: {'lr': 3.859845225781117e-06, 'samples': 27208704, 'steps': 141711, 'loss/train': 1.322885513305664} 11/07/2021 17:11:51 - INFO - __main__ - Step 141713: {'lr': 3.858916367698667e-06, 'samples': 27208896, 'steps': 141712, 'loss/train': 1.0436054468154907} 11/07/2021 17:11:51 - INFO - __main__ - Step 141714: {'lr': 3.857987620523557e-06, 'samples': 27209088, 'steps': 141713, 'loss/train': 1.486232876777649} 11/07/2021 17:11:51 - INFO - __main__ - Step 141715: {'lr': 3.8570589842562564e-06, 'samples': 27209280, 'steps': 141714, 'loss/train': 1.188028335571289} 11/07/2021 17:11:52 - INFO - __main__ - Step 141716: {'lr': 3.856130458897156e-06, 'samples': 27209472, 'steps': 141715, 'loss/train': 0.8814691305160522} 11/07/2021 17:11:53 - INFO - __main__ - Step 141717: {'lr': 3.85520204444667e-06, 'samples': 27209664, 'steps': 141716, 'loss/train': 1.5106908082962036} 11/07/2021 17:11:53 - INFO - __main__ - Step 141718: {'lr': 3.854273740905245e-06, 'samples': 27209856, 'steps': 141717, 'loss/train': 1.3754374980926514} 11/07/2021 17:11:53 - INFO - __main__ - Step 141719: {'lr': 3.853345548273296e-06, 'samples': 27210048, 'steps': 141718, 'loss/train': 0.5197484493255615} 11/07/2021 17:11:54 - INFO - __main__ - Step 141720: {'lr': 3.852417466551211e-06, 'samples': 27210240, 'steps': 141719, 'loss/train': 0.5812214612960815} 11/07/2021 17:11:54 - INFO - __main__ - Step 141721: {'lr': 3.851489495739435e-06, 'samples': 27210432, 'steps': 141720, 'loss/train': 1.1615676879882812} 11/07/2021 17:11:55 - INFO - __main__ - Step 141722: {'lr': 3.850561635838385e-06, 'samples': 27210624, 'steps': 141721, 'loss/train': 1.499994158744812} 11/07/2021 17:11:55 - INFO - __main__ - Step 141723: {'lr': 3.8496338868484744e-06, 'samples': 27210816, 'steps': 141722, 'loss/train': 1.2071980237960815} 11/07/2021 17:11:56 - INFO - __main__ - Step 141724: {'lr': 3.848706248770123e-06, 'samples': 27211008, 'steps': 141723, 'loss/train': 0.8965361714363098} 11/07/2021 17:11:56 - INFO - __main__ - Step 141725: {'lr': 3.847778721603745e-06, 'samples': 27211200, 'steps': 141724, 'loss/train': 1.5728529691696167} 11/07/2021 17:11:56 - INFO - __main__ - Step 141726: {'lr': 3.846851305349786e-06, 'samples': 27211392, 'steps': 141725, 'loss/train': 1.1948193311691284} 11/07/2021 17:11:58 - INFO - __main__ - Step 141727: {'lr': 3.845924000008605e-06, 'samples': 27211584, 'steps': 141726, 'loss/train': 0.5330202579498291} 11/07/2021 17:11:58 - INFO - __main__ - Step 141728: {'lr': 3.844996805580648e-06, 'samples': 27211776, 'steps': 141727, 'loss/train': 1.0140070915222168} 11/07/2021 17:11:58 - INFO - __main__ - Step 141729: {'lr': 3.844069722066329e-06, 'samples': 27211968, 'steps': 141728, 'loss/train': 1.0151349306106567} 11/07/2021 17:11:59 - INFO - __main__ - Step 141730: {'lr': 3.843142749466094e-06, 'samples': 27212160, 'steps': 141729, 'loss/train': 1.5089659690856934} 11/07/2021 17:11:59 - INFO - __main__ - Step 141731: {'lr': 3.842215887780332e-06, 'samples': 27212352, 'steps': 141730, 'loss/train': 0.9764630794525146} 11/07/2021 17:12:00 - INFO - __main__ - Step 141732: {'lr': 3.841289137009485e-06, 'samples': 27212544, 'steps': 141731, 'loss/train': 1.3287420272827148} 11/07/2021 17:12:00 - INFO - __main__ - Step 141733: {'lr': 3.840362497153943e-06, 'samples': 27212736, 'steps': 141732, 'loss/train': 1.523417592048645} 11/07/2021 17:12:01 - INFO - __main__ - Step 141734: {'lr': 3.839435968214122e-06, 'samples': 27212928, 'steps': 141733, 'loss/train': 1.7943826913833618} 11/07/2021 17:12:01 - INFO - __main__ - Step 141735: {'lr': 3.838509550190466e-06, 'samples': 27213120, 'steps': 141734, 'loss/train': 1.0940951108932495} 11/07/2021 17:12:01 - INFO - __main__ - Step 141736: {'lr': 3.837583243083393e-06, 'samples': 27213312, 'steps': 141735, 'loss/train': 0.6764024496078491} 11/07/2021 17:12:02 - INFO - __main__ - Step 141737: {'lr': 3.836657046893288e-06, 'samples': 27213504, 'steps': 141736, 'loss/train': 1.3776309490203857} 11/07/2021 17:12:03 - INFO - __main__ - Step 141738: {'lr': 3.835730961620571e-06, 'samples': 27213696, 'steps': 141737, 'loss/train': 1.5072728395462036} 11/07/2021 17:12:03 - INFO - __main__ - Step 141739: {'lr': 3.8348049872657105e-06, 'samples': 27213888, 'steps': 141738, 'loss/train': 1.1215574741363525} 11/07/2021 17:12:04 - INFO - __main__ - Step 141740: {'lr': 3.83387912382907e-06, 'samples': 27214080, 'steps': 141739, 'loss/train': 0.8294079303741455} 11/07/2021 17:12:04 - INFO - __main__ - Step 141741: {'lr': 3.832953371311093e-06, 'samples': 27214272, 'steps': 141740, 'loss/train': 1.4248682260513306} 11/07/2021 17:12:05 - INFO - __main__ - Step 141742: {'lr': 3.8320277297121955e-06, 'samples': 27214464, 'steps': 141741, 'loss/train': 1.6906052827835083} 11/07/2021 17:12:05 - INFO - __main__ - Step 141743: {'lr': 3.8311021990327654e-06, 'samples': 27214656, 'steps': 141742, 'loss/train': 0.810661256313324} 11/07/2021 17:12:06 - INFO - __main__ - Step 141744: {'lr': 3.830176779273248e-06, 'samples': 27214848, 'steps': 141743, 'loss/train': 1.4420690536499023} 11/07/2021 17:12:06 - INFO - __main__ - Step 141745: {'lr': 3.829251470434059e-06, 'samples': 27215040, 'steps': 141744, 'loss/train': 0.9034472703933716} 11/07/2021 17:12:06 - INFO - __main__ - Step 141746: {'lr': 3.828326272515614e-06, 'samples': 27215232, 'steps': 141745, 'loss/train': 0.8735635280609131} 11/07/2021 17:12:07 - INFO - __main__ - Step 141747: {'lr': 3.827401185518331e-06, 'samples': 27215424, 'steps': 141746, 'loss/train': 1.0241318941116333} 11/07/2021 17:12:08 - INFO - __main__ - Step 141748: {'lr': 3.826476209442598e-06, 'samples': 27215616, 'steps': 141747, 'loss/train': 1.3011976480484009} 11/07/2021 17:12:08 - INFO - __main__ - Step 141749: {'lr': 3.825551344288886e-06, 'samples': 27215808, 'steps': 141748, 'loss/train': 1.0816055536270142} 11/07/2021 17:12:09 - INFO - __main__ - Step 141750: {'lr': 3.824626590057556e-06, 'samples': 27216000, 'steps': 141749, 'loss/train': 1.6408817768096924} 11/07/2021 17:12:09 - INFO - __main__ - Step 141751: {'lr': 3.823701946749053e-06, 'samples': 27216192, 'steps': 141750, 'loss/train': 1.5064927339553833} 11/07/2021 17:12:09 - INFO - __main__ - Step 141752: {'lr': 3.822777414363793e-06, 'samples': 27216384, 'steps': 141751, 'loss/train': 1.283774733543396} 11/07/2021 17:12:11 - INFO - __main__ - Step 141753: {'lr': 3.821852992902219e-06, 'samples': 27216576, 'steps': 141752, 'loss/train': 1.5650759935379028} 11/07/2021 17:12:11 - INFO - __main__ - Step 141754: {'lr': 3.820928682364722e-06, 'samples': 27216768, 'steps': 141753, 'loss/train': 0.7582300901412964} 11/07/2021 17:12:11 - INFO - __main__ - Step 141755: {'lr': 3.820004482751688e-06, 'samples': 27216960, 'steps': 141754, 'loss/train': 1.197695016860962} 11/07/2021 17:12:12 - INFO - __main__ - Step 141756: {'lr': 3.8190803940635624e-06, 'samples': 27217152, 'steps': 141755, 'loss/train': 0.8803466558456421} 11/07/2021 17:12:12 - INFO - __main__ - Step 141757: {'lr': 3.818156416300761e-06, 'samples': 27217344, 'steps': 141756, 'loss/train': 1.294869065284729} 11/07/2021 17:12:13 - INFO - __main__ - Step 141758: {'lr': 3.8172325494637004e-06, 'samples': 27217536, 'steps': 141757, 'loss/train': 0.8159918189048767} 11/07/2021 17:12:13 - INFO - __main__ - Step 141759: {'lr': 3.816308793552798e-06, 'samples': 27217728, 'steps': 141758, 'loss/train': 1.255479335784912} 11/07/2021 17:12:14 - INFO - __main__ - Step 141760: {'lr': 3.815385148568467e-06, 'samples': 27217920, 'steps': 141759, 'loss/train': 1.3338260650634766} 11/07/2021 17:12:14 - INFO - __main__ - Step 141761: {'lr': 3.8144616145111276e-06, 'samples': 27218112, 'steps': 141760, 'loss/train': 0.8972194194793701} 11/07/2021 17:12:14 - INFO - __main__ - Step 141762: {'lr': 3.8135381913811662e-06, 'samples': 27218304, 'steps': 141761, 'loss/train': 1.245100736618042} 11/07/2021 17:12:15 - INFO - __main__ - Step 141763: {'lr': 3.8126148791790547e-06, 'samples': 27218496, 'steps': 141762, 'loss/train': 1.2824968099594116} 11/07/2021 17:12:16 - INFO - __main__ - Step 141764: {'lr': 3.8116916779051827e-06, 'samples': 27218688, 'steps': 141763, 'loss/train': 1.2278457880020142} 11/07/2021 17:12:16 - INFO - __main__ - Step 141765: {'lr': 3.810768587559965e-06, 'samples': 27218880, 'steps': 141764, 'loss/train': 0.4531492292881012} 11/07/2021 17:12:16 - INFO - __main__ - Step 141766: {'lr': 3.8098456081437917e-06, 'samples': 27219072, 'steps': 141765, 'loss/train': 1.2118955850601196} 11/07/2021 17:12:17 - INFO - __main__ - Step 141767: {'lr': 3.8089227396571337e-06, 'samples': 27219264, 'steps': 141766, 'loss/train': 1.6071562767028809} 11/07/2021 17:12:18 - INFO - __main__ - Step 141768: {'lr': 3.8079999821003797e-06, 'samples': 27219456, 'steps': 141767, 'loss/train': 1.0062638521194458} 11/07/2021 17:12:18 - INFO - __main__ - Step 141769: {'lr': 3.8070773354739187e-06, 'samples': 27219648, 'steps': 141768, 'loss/train': 1.4963871240615845} 11/07/2021 17:12:18 - INFO - __main__ - Step 141770: {'lr': 3.8061547997781943e-06, 'samples': 27219840, 'steps': 141769, 'loss/train': 1.5946654081344604} 11/07/2021 17:12:19 - INFO - __main__ - Step 141771: {'lr': 3.805232375013595e-06, 'samples': 27220032, 'steps': 141770, 'loss/train': 1.4108860492706299} 11/07/2021 17:12:19 - INFO - __main__ - Step 141772: {'lr': 3.8043100611805935e-06, 'samples': 27220224, 'steps': 141771, 'loss/train': 1.2602559328079224} 11/07/2021 17:12:20 - INFO - __main__ - Step 141773: {'lr': 3.80338785827955e-06, 'samples': 27220416, 'steps': 141772, 'loss/train': 1.1900922060012817} 11/07/2021 17:12:21 - INFO - __main__ - Step 141774: {'lr': 3.8024657663109087e-06, 'samples': 27220608, 'steps': 141773, 'loss/train': 1.4183154106140137} 11/07/2021 17:12:21 - INFO - __main__ - Step 141775: {'lr': 3.8015437852750857e-06, 'samples': 27220800, 'steps': 141774, 'loss/train': 1.464518666267395} 11/07/2021 17:12:21 - INFO - __main__ - Step 141776: {'lr': 3.8006219151724695e-06, 'samples': 27220992, 'steps': 141775, 'loss/train': 0.15366120636463165} 11/07/2021 17:12:22 - INFO - __main__ - Step 141777: {'lr': 3.799700156003505e-06, 'samples': 27221184, 'steps': 141776, 'loss/train': 1.491884469985962} 11/07/2021 17:12:23 - INFO - __main__ - Step 141778: {'lr': 3.798778507768608e-06, 'samples': 27221376, 'steps': 141777, 'loss/train': 1.1389806270599365} 11/07/2021 17:12:23 - INFO - __main__ - Step 141779: {'lr': 3.7978569704681666e-06, 'samples': 27221568, 'steps': 141778, 'loss/train': 1.2665327787399292} 11/07/2021 17:12:23 - INFO - __main__ - Step 141780: {'lr': 3.7969355441026254e-06, 'samples': 27221760, 'steps': 141779, 'loss/train': 1.0880705118179321} 11/07/2021 17:12:24 - INFO - __main__ - Step 141781: {'lr': 3.796014228672373e-06, 'samples': 27221952, 'steps': 141780, 'loss/train': 1.1869410276412964} 11/07/2021 17:12:24 - INFO - __main__ - Step 141782: {'lr': 3.7950930241778536e-06, 'samples': 27222144, 'steps': 141781, 'loss/train': 1.3125410079956055} 11/07/2021 17:12:25 - INFO - __main__ - Step 141783: {'lr': 3.7941719306194556e-06, 'samples': 27222336, 'steps': 141782, 'loss/train': 1.2165607213974} 11/07/2021 17:12:25 - INFO - __main__ - Step 141784: {'lr': 3.7932509479975954e-06, 'samples': 27222528, 'steps': 141783, 'loss/train': 1.2410213947296143} 11/07/2021 17:12:26 - INFO - __main__ - Step 141785: {'lr': 3.792330076312689e-06, 'samples': 27222720, 'steps': 141784, 'loss/train': 1.661592721939087} 11/07/2021 17:12:26 - INFO - __main__ - Step 141786: {'lr': 3.791409315565181e-06, 'samples': 27222912, 'steps': 141785, 'loss/train': 1.077893614768982} 11/07/2021 17:12:26 - INFO - __main__ - Step 141787: {'lr': 3.790488665755459e-06, 'samples': 27223104, 'steps': 141786, 'loss/train': 1.2435718774795532} 11/07/2021 17:12:28 - INFO - __main__ - Step 141788: {'lr': 3.789568126883941e-06, 'samples': 27223296, 'steps': 141787, 'loss/train': 1.02601957321167} 11/07/2021 17:12:28 - INFO - __main__ - Step 141789: {'lr': 3.7886476989510423e-06, 'samples': 27223488, 'steps': 141788, 'loss/train': 0.7671656012535095} 11/07/2021 17:12:28 - INFO - __main__ - Step 141790: {'lr': 3.787727381957179e-06, 'samples': 27223680, 'steps': 141789, 'loss/train': 1.0378763675689697} 11/07/2021 17:12:29 - INFO - __main__ - Step 141791: {'lr': 3.786807175902768e-06, 'samples': 27223872, 'steps': 141790, 'loss/train': 0.9960227012634277} 11/07/2021 17:12:29 - INFO - __main__ - Step 141792: {'lr': 3.785887080788225e-06, 'samples': 27224064, 'steps': 141791, 'loss/train': 1.331379771232605} 11/07/2021 17:12:29 - INFO - __main__ - Step 141793: {'lr': 3.784967096613995e-06, 'samples': 27224256, 'steps': 141792, 'loss/train': 1.1275477409362793} 11/07/2021 17:12:30 - INFO - __main__ - Step 141794: {'lr': 3.78404722338041e-06, 'samples': 27224448, 'steps': 141793, 'loss/train': 1.148646593093872} 11/07/2021 17:12:31 - INFO - __main__ - Step 141795: {'lr': 3.7831274610879705e-06, 'samples': 27224640, 'steps': 141794, 'loss/train': 0.6777094006538391} 11/07/2021 17:12:31 - INFO - __main__ - Step 141796: {'lr': 3.7822078097370095e-06, 'samples': 27224832, 'steps': 141795, 'loss/train': 1.5993036031723022} 11/07/2021 17:12:31 - INFO - __main__ - Step 141797: {'lr': 3.781288269328026e-06, 'samples': 27225024, 'steps': 141796, 'loss/train': 1.1394431591033936} 11/07/2021 17:12:32 - INFO - __main__ - Step 141798: {'lr': 3.7803688398613812e-06, 'samples': 27225216, 'steps': 141797, 'loss/train': 1.7014334201812744} 11/07/2021 17:12:33 - INFO - __main__ - Step 141799: {'lr': 3.779449521337491e-06, 'samples': 27225408, 'steps': 141798, 'loss/train': 1.3033815622329712} 11/07/2021 17:12:33 - INFO - __main__ - Step 141800: {'lr': 3.7785303137568007e-06, 'samples': 27225600, 'steps': 141799, 'loss/train': 0.7895970940589905} 11/07/2021 17:12:33 - INFO - __main__ - Step 141801: {'lr': 3.7776112171196976e-06, 'samples': 27225792, 'steps': 141800, 'loss/train': 1.4676264524459839} 11/07/2021 17:12:34 - INFO - __main__ - Step 141802: {'lr': 3.7766922314265985e-06, 'samples': 27225984, 'steps': 141801, 'loss/train': 1.1601243019104004} 11/07/2021 17:12:34 - INFO - __main__ - Step 141803: {'lr': 3.7757733566779197e-06, 'samples': 27226176, 'steps': 141802, 'loss/train': 1.3065226078033447} 11/07/2021 17:12:35 - INFO - __main__ - Step 141804: {'lr': 3.7748545928740775e-06, 'samples': 27226368, 'steps': 141803, 'loss/train': 1.392745852470398} 11/07/2021 17:12:35 - INFO - __main__ - Step 141805: {'lr': 3.773935940015516e-06, 'samples': 27226560, 'steps': 141804, 'loss/train': 1.5050050020217896} 11/07/2021 17:12:36 - INFO - __main__ - Step 141806: {'lr': 3.773017398102596e-06, 'samples': 27226752, 'steps': 141805, 'loss/train': 1.4583169221878052} 11/07/2021 17:12:36 - INFO - __main__ - Step 141807: {'lr': 3.772098967135762e-06, 'samples': 27226944, 'steps': 141806, 'loss/train': 1.2637841701507568} 11/07/2021 17:12:37 - INFO - __main__ - Step 141808: {'lr': 3.771180647115402e-06, 'samples': 27227136, 'steps': 141807, 'loss/train': 1.5723251104354858} 11/07/2021 17:12:38 - INFO - __main__ - Step 141809: {'lr': 3.7702624380419604e-06, 'samples': 27227328, 'steps': 141808, 'loss/train': 1.4499261379241943} 11/07/2021 17:12:38 - INFO - __main__ - Step 141810: {'lr': 3.769344339915853e-06, 'samples': 27227520, 'steps': 141809, 'loss/train': 1.3690769672393799} 11/07/2021 17:12:38 - INFO - __main__ - Step 141811: {'lr': 3.7684263527374697e-06, 'samples': 27227712, 'steps': 141810, 'loss/train': 1.4688507318496704} 11/07/2021 17:12:39 - INFO - __main__ - Step 141812: {'lr': 3.7675084765072255e-06, 'samples': 27227904, 'steps': 141811, 'loss/train': 1.3781110048294067} 11/07/2021 17:12:39 - INFO - __main__ - Step 141813: {'lr': 3.7665907112255373e-06, 'samples': 27228096, 'steps': 141812, 'loss/train': 1.2391763925552368} 11/07/2021 17:12:39 - INFO - __main__ - Step 141814: {'lr': 3.765673056892821e-06, 'samples': 27228288, 'steps': 141813, 'loss/train': 1.2089636325836182} 11/07/2021 17:12:40 - INFO - __main__ - Step 141815: {'lr': 3.7647555135095215e-06, 'samples': 27228480, 'steps': 141814, 'loss/train': 1.3283259868621826} 11/07/2021 17:12:41 - INFO - __main__ - Step 141816: {'lr': 3.7638380810760265e-06, 'samples': 27228672, 'steps': 141815, 'loss/train': 1.4618412256240845} 11/07/2021 17:12:41 - INFO - __main__ - Step 141817: {'lr': 3.762920759592725e-06, 'samples': 27228864, 'steps': 141816, 'loss/train': 1.1797775030136108} 11/07/2021 17:12:41 - INFO - __main__ - Step 141818: {'lr': 3.7620035490600337e-06, 'samples': 27229056, 'steps': 141817, 'loss/train': 1.3305021524429321} 11/07/2021 17:12:42 - INFO - __main__ - Step 141819: {'lr': 3.7610864494784234e-06, 'samples': 27229248, 'steps': 141818, 'loss/train': 0.9595531821250916} 11/07/2021 17:12:43 - INFO - __main__ - Step 141820: {'lr': 3.7601694608482283e-06, 'samples': 27229440, 'steps': 141819, 'loss/train': 1.0837351083755493} 11/07/2021 17:12:43 - INFO - __main__ - Step 141821: {'lr': 3.7592525831699197e-06, 'samples': 27229632, 'steps': 141820, 'loss/train': 1.080356240272522} 11/07/2021 17:12:44 - INFO - __main__ - Step 141822: {'lr': 3.7583358164439143e-06, 'samples': 27229824, 'steps': 141821, 'loss/train': 1.296917200088501} 11/07/2021 17:12:44 - INFO - __main__ - Step 141823: {'lr': 3.757419160670572e-06, 'samples': 27230016, 'steps': 141822, 'loss/train': 1.5998033285140991} 11/07/2021 17:12:44 - INFO - __main__ - Step 141824: {'lr': 3.756502615850338e-06, 'samples': 27230208, 'steps': 141823, 'loss/train': 1.1726206541061401} 11/07/2021 17:12:45 - INFO - __main__ - Step 141825: {'lr': 3.755586181983628e-06, 'samples': 27230400, 'steps': 141824, 'loss/train': 1.5405282974243164} 11/07/2021 17:12:46 - INFO - __main__ - Step 141826: {'lr': 3.754669859070886e-06, 'samples': 27230592, 'steps': 141825, 'loss/train': 2.1875534057617188} 11/07/2021 17:12:46 - INFO - __main__ - Step 141827: {'lr': 3.7537536471124733e-06, 'samples': 27230784, 'steps': 141826, 'loss/train': 1.1209901571273804} 11/07/2021 17:12:46 - INFO - __main__ - Step 141828: {'lr': 3.752837546108806e-06, 'samples': 27230976, 'steps': 141827, 'loss/train': 0.9706765413284302} 11/07/2021 17:12:47 - INFO - __main__ - Step 141829: {'lr': 3.7519215560603006e-06, 'samples': 27231168, 'steps': 141828, 'loss/train': 1.1702930927276611} 11/07/2021 17:12:48 - INFO - __main__ - Step 141830: {'lr': 3.7510056769673726e-06, 'samples': 27231360, 'steps': 141829, 'loss/train': 0.7165502905845642} 11/07/2021 17:12:48 - INFO - __main__ - Step 141831: {'lr': 3.7500899088304672e-06, 'samples': 27231552, 'steps': 141830, 'loss/train': 1.4030439853668213} 11/07/2021 17:12:49 - INFO - __main__ - Step 141832: {'lr': 3.7491742516499728e-06, 'samples': 27231744, 'steps': 141831, 'loss/train': 1.5668303966522217} 11/07/2021 17:12:49 - INFO - __main__ - Step 141833: {'lr': 3.748258705426277e-06, 'samples': 27231936, 'steps': 141832, 'loss/train': 1.3879554271697998} 11/07/2021 17:12:49 - INFO - __main__ - Step 141834: {'lr': 3.7473432701598255e-06, 'samples': 27232128, 'steps': 141833, 'loss/train': 1.2021872997283936} 11/07/2021 17:12:50 - INFO - __main__ - Step 141835: {'lr': 3.746427945851033e-06, 'samples': 27232320, 'steps': 141834, 'loss/train': 1.4204890727996826} 11/07/2021 17:12:51 - INFO - __main__ - Step 141836: {'lr': 3.745512732500289e-06, 'samples': 27232512, 'steps': 141835, 'loss/train': 1.6456691026687622} 11/07/2021 17:12:51 - INFO - __main__ - Step 141837: {'lr': 3.7445976301080376e-06, 'samples': 27232704, 'steps': 141836, 'loss/train': 1.5204882621765137} 11/07/2021 17:12:51 - INFO - __main__ - Step 141838: {'lr': 3.743682638674667e-06, 'samples': 27232896, 'steps': 141837, 'loss/train': 1.4676589965820312} 11/07/2021 17:12:52 - INFO - __main__ - Step 141839: {'lr': 3.742767758200566e-06, 'samples': 27233088, 'steps': 141838, 'loss/train': 1.135431170463562} 11/07/2021 17:12:53 - INFO - __main__ - Step 141840: {'lr': 3.7418529886861787e-06, 'samples': 27233280, 'steps': 141839, 'loss/train': 1.1947717666625977} 11/07/2021 17:12:53 - INFO - __main__ - Step 141841: {'lr': 3.740938330131921e-06, 'samples': 27233472, 'steps': 141840, 'loss/train': 1.7282789945602417} 11/07/2021 17:12:53 - INFO - __main__ - Step 141842: {'lr': 3.74002378253821e-06, 'samples': 27233664, 'steps': 141841, 'loss/train': 1.33830988407135} 11/07/2021 17:12:54 - INFO - __main__ - Step 141843: {'lr': 3.7391093459054336e-06, 'samples': 27233856, 'steps': 141842, 'loss/train': 0.862578809261322} 11/07/2021 17:12:54 - INFO - __main__ - Step 141844: {'lr': 3.7381950202340087e-06, 'samples': 27234048, 'steps': 141843, 'loss/train': 1.2517789602279663} 11/07/2021 17:12:54 - INFO - __main__ - Step 141845: {'lr': 3.7372808055243514e-06, 'samples': 27234240, 'steps': 141844, 'loss/train': 1.8371636867523193} 11/07/2021 17:12:56 - INFO - __main__ - Step 141846: {'lr': 3.7363667017768776e-06, 'samples': 27234432, 'steps': 141845, 'loss/train': 1.5115838050842285} 11/07/2021 17:12:57 - INFO - __main__ - Step 141847: {'lr': 3.7354527089920044e-06, 'samples': 27234624, 'steps': 141846, 'loss/train': 1.491485834121704} 11/07/2021 17:12:57 - INFO - __main__ - Step 141848: {'lr': 3.7345388271701477e-06, 'samples': 27234816, 'steps': 141847, 'loss/train': 1.0528894662857056} 11/07/2021 17:12:57 - INFO - __main__ - Step 141849: {'lr': 3.733625056311696e-06, 'samples': 27235008, 'steps': 141848, 'loss/train': 1.520082712173462} 11/07/2021 17:12:58 - INFO - __main__ - Step 141850: {'lr': 3.7327113964170656e-06, 'samples': 27235200, 'steps': 141849, 'loss/train': 1.4342999458312988} 11/07/2021 17:12:59 - INFO - __main__ - Step 141851: {'lr': 3.7317978474866733e-06, 'samples': 27235392, 'steps': 141850, 'loss/train': 1.1149115562438965} 11/07/2021 17:12:59 - INFO - __main__ - Step 141852: {'lr': 3.730884409520935e-06, 'samples': 27235584, 'steps': 141851, 'loss/train': 1.0628063678741455} 11/07/2021 17:13:00 - INFO - __main__ - Step 141853: {'lr': 3.729971082520267e-06, 'samples': 27235776, 'steps': 141852, 'loss/train': 0.11149768531322479} 11/07/2021 17:13:00 - INFO - __main__ - Step 141854: {'lr': 3.7290578664850583e-06, 'samples': 27235968, 'steps': 141853, 'loss/train': 0.9180381894111633} 11/07/2021 17:13:00 - INFO - __main__ - Step 141855: {'lr': 3.7281447614157526e-06, 'samples': 27236160, 'steps': 141854, 'loss/train': 1.3088427782058716} 11/07/2021 17:13:01 - INFO - __main__ - Step 141856: {'lr': 3.7272317673127388e-06, 'samples': 27236352, 'steps': 141855, 'loss/train': 1.2345284223556519} 11/07/2021 17:13:02 - INFO - __main__ - Step 141857: {'lr': 3.726318884176433e-06, 'samples': 27236544, 'steps': 141856, 'loss/train': 1.6492705345153809} 11/07/2021 17:13:02 - INFO - __main__ - Step 141858: {'lr': 3.725406112007251e-06, 'samples': 27236736, 'steps': 141857, 'loss/train': 0.8836938142776489} 11/07/2021 17:13:02 - INFO - __main__ - Step 141859: {'lr': 3.724493450805583e-06, 'samples': 27236928, 'steps': 141858, 'loss/train': 1.2267156839370728} 11/07/2021 17:13:03 - INFO - __main__ - Step 141860: {'lr': 3.7235809005718713e-06, 'samples': 27237120, 'steps': 141859, 'loss/train': 1.0947210788726807} 11/07/2021 17:13:03 - INFO - __main__ - Step 141861: {'lr': 3.722668461306533e-06, 'samples': 27237312, 'steps': 141860, 'loss/train': 1.2841286659240723} 11/07/2021 17:13:04 - INFO - __main__ - Step 141862: {'lr': 3.721756133009957e-06, 'samples': 27237504, 'steps': 141861, 'loss/train': 1.162513017654419} 11/07/2021 17:13:05 - INFO - __main__ - Step 141863: {'lr': 3.7208439156825313e-06, 'samples': 27237696, 'steps': 141862, 'loss/train': 1.261956810951233} 11/07/2021 17:13:05 - INFO - __main__ - Step 141864: {'lr': 3.7199318093247e-06, 'samples': 27237888, 'steps': 141863, 'loss/train': 1.4545807838439941} 11/07/2021 17:13:05 - INFO - __main__ - Step 141865: {'lr': 3.7190198139368803e-06, 'samples': 27238080, 'steps': 141864, 'loss/train': 0.9689874053001404} 11/07/2021 17:13:06 - INFO - __main__ - Step 141866: {'lr': 3.7181079295194598e-06, 'samples': 27238272, 'steps': 141865, 'loss/train': 1.159136176109314} 11/07/2021 17:13:07 - INFO - __main__ - Step 141867: {'lr': 3.717196156072855e-06, 'samples': 27238464, 'steps': 141866, 'loss/train': 1.1913199424743652} 11/07/2021 17:13:07 - INFO - __main__ - Step 141868: {'lr': 3.7162844935974825e-06, 'samples': 27238656, 'steps': 141867, 'loss/train': 0.8698520660400391} 11/07/2021 17:13:07 - INFO - __main__ - Step 141869: {'lr': 3.715372942093759e-06, 'samples': 27238848, 'steps': 141868, 'loss/train': 1.0292677879333496} 11/07/2021 17:13:08 - INFO - __main__ - Step 141870: {'lr': 3.7144615015620997e-06, 'samples': 27239040, 'steps': 141869, 'loss/train': 1.5545071363449097} 11/07/2021 17:13:08 - INFO - __main__ - Step 141871: {'lr': 3.7135501720028943e-06, 'samples': 27239232, 'steps': 141870, 'loss/train': 1.1100753545761108} 11/07/2021 17:13:09 - INFO - __main__ - Step 141872: {'lr': 3.7126389534165306e-06, 'samples': 27239424, 'steps': 141871, 'loss/train': 1.4402413368225098} 11/07/2021 17:13:09 - INFO - __main__ - Step 141873: {'lr': 3.7117278458034807e-06, 'samples': 27239616, 'steps': 141872, 'loss/train': 1.6155917644500732} 11/07/2021 17:13:10 - INFO - __main__ - Step 141874: {'lr': 3.7108168491641615e-06, 'samples': 27239808, 'steps': 141873, 'loss/train': 1.2097046375274658} 11/07/2021 17:13:10 - INFO - __main__ - Step 141875: {'lr': 3.7099059634989053e-06, 'samples': 27240000, 'steps': 141874, 'loss/train': 0.8191941976547241} 11/07/2021 17:13:10 - INFO - __main__ - Step 141876: {'lr': 3.708995188808156e-06, 'samples': 27240192, 'steps': 141875, 'loss/train': 0.7478453516960144} 11/07/2021 17:13:12 - INFO - __main__ - Step 141877: {'lr': 3.708084525092359e-06, 'samples': 27240384, 'steps': 141876, 'loss/train': 1.865283727645874} 11/07/2021 17:13:12 - INFO - __main__ - Step 141878: {'lr': 3.707173972351874e-06, 'samples': 27240576, 'steps': 141877, 'loss/train': 0.5584873557090759} 11/07/2021 17:13:12 - INFO - __main__ - Step 141879: {'lr': 3.706263530587145e-06, 'samples': 27240768, 'steps': 141878, 'loss/train': 1.1014734506607056} 11/07/2021 17:13:13 - INFO - __main__ - Step 141880: {'lr': 3.705353199798589e-06, 'samples': 27240960, 'steps': 141879, 'loss/train': 1.1412848234176636} 11/07/2021 17:13:13 - INFO - __main__ - Step 141881: {'lr': 3.704442979986594e-06, 'samples': 27241152, 'steps': 141880, 'loss/train': 1.4255061149597168} 11/07/2021 17:13:13 - INFO - __main__ - Step 141882: {'lr': 3.703532871151549e-06, 'samples': 27241344, 'steps': 141881, 'loss/train': 1.4620356559753418} 11/07/2021 17:13:15 - INFO - __main__ - Step 141883: {'lr': 3.702622873293926e-06, 'samples': 27241536, 'steps': 141882, 'loss/train': 1.148549199104309} 11/07/2021 17:13:15 - INFO - __main__ - Step 141884: {'lr': 3.701712986414085e-06, 'samples': 27241728, 'steps': 141883, 'loss/train': 1.0946468114852905} 11/07/2021 17:13:15 - INFO - __main__ - Step 141885: {'lr': 3.7008032105124433e-06, 'samples': 27241920, 'steps': 141884, 'loss/train': 4.096440315246582} 11/07/2021 17:13:16 - INFO - __main__ - Step 141886: {'lr': 3.6998935455894444e-06, 'samples': 27242112, 'steps': 141885, 'loss/train': 1.0472108125686646} 11/07/2021 17:13:16 - INFO - __main__ - Step 141887: {'lr': 3.6989839916454494e-06, 'samples': 27242304, 'steps': 141886, 'loss/train': 1.2667814493179321} 11/07/2021 17:13:16 - INFO - __main__ - Step 141888: {'lr': 3.6980745486809296e-06, 'samples': 27242496, 'steps': 141887, 'loss/train': 1.4461404085159302} 11/07/2021 17:13:17 - INFO - __main__ - Step 141889: {'lr': 3.6971652166962187e-06, 'samples': 27242688, 'steps': 141888, 'loss/train': 1.3122329711914062} 11/07/2021 17:13:18 - INFO - __main__ - Step 141890: {'lr': 3.6962559956917606e-06, 'samples': 27242880, 'steps': 141889, 'loss/train': 1.6096512079238892} 11/07/2021 17:13:18 - INFO - __main__ - Step 141891: {'lr': 3.6953468856679996e-06, 'samples': 27243072, 'steps': 141890, 'loss/train': 1.216264009475708} 11/07/2021 17:13:18 - INFO - __main__ - Step 141892: {'lr': 3.694437886625296e-06, 'samples': 27243264, 'steps': 141891, 'loss/train': 1.8673992156982422} 11/07/2021 17:13:19 - INFO - __main__ - Step 141893: {'lr': 3.693528998564066e-06, 'samples': 27243456, 'steps': 141892, 'loss/train': 1.217932939529419} 11/07/2021 17:13:20 - INFO - __main__ - Step 141894: {'lr': 3.692620221484755e-06, 'samples': 27243648, 'steps': 141893, 'loss/train': 1.051261305809021} 11/07/2021 17:13:20 - INFO - __main__ - Step 141895: {'lr': 3.6917115553877224e-06, 'samples': 27243840, 'steps': 141894, 'loss/train': 1.1630334854125977} 11/07/2021 17:13:20 - INFO - __main__ - Step 141896: {'lr': 3.6908030002734127e-06, 'samples': 27244032, 'steps': 141895, 'loss/train': 0.9376685619354248} 11/07/2021 17:13:21 - INFO - __main__ - Step 141897: {'lr': 3.6898945561422424e-06, 'samples': 27244224, 'steps': 141896, 'loss/train': 1.1957231760025024} 11/07/2021 17:13:21 - INFO - __main__ - Step 141898: {'lr': 3.6889862229946004e-06, 'samples': 27244416, 'steps': 141897, 'loss/train': 0.1478184014558792} 11/07/2021 17:13:22 - INFO - __main__ - Step 141899: {'lr': 3.6880780008308747e-06, 'samples': 27244608, 'steps': 141898, 'loss/train': 1.673539400100708} 11/07/2021 17:13:23 - INFO - __main__ - Step 141900: {'lr': 3.6871698896515373e-06, 'samples': 27244800, 'steps': 141899, 'loss/train': 1.0653289556503296} 11/07/2021 17:13:23 - INFO - __main__ - Step 141901: {'lr': 3.6862618894569488e-06, 'samples': 27244992, 'steps': 141900, 'loss/train': 0.4576829969882965} 11/07/2021 17:13:23 - INFO - __main__ - Step 141902: {'lr': 3.6853540002475262e-06, 'samples': 27245184, 'steps': 141901, 'loss/train': 0.7869049906730652} 11/07/2021 17:13:24 - INFO - __main__ - Step 141903: {'lr': 3.684446222023685e-06, 'samples': 27245376, 'steps': 141902, 'loss/train': 1.3970229625701904} 11/07/2021 17:13:25 - INFO - __main__ - Step 141904: {'lr': 3.6835385547858148e-06, 'samples': 27245568, 'steps': 141903, 'loss/train': 1.32867431640625} 11/07/2021 17:13:25 - INFO - __main__ - Step 141905: {'lr': 3.6826309985343585e-06, 'samples': 27245760, 'steps': 141904, 'loss/train': 1.3304240703582764} 11/07/2021 17:13:25 - INFO - __main__ - Step 141906: {'lr': 3.681723553269706e-06, 'samples': 27245952, 'steps': 141905, 'loss/train': 1.175204873085022} 11/07/2021 17:13:26 - INFO - __main__ - Step 141907: {'lr': 3.6808162189922446e-06, 'samples': 27246144, 'steps': 141906, 'loss/train': 1.1503740549087524} 11/07/2021 17:13:26 - INFO - __main__ - Step 141908: {'lr': 3.6799089957024468e-06, 'samples': 27246336, 'steps': 141907, 'loss/train': 1.4851799011230469} 11/07/2021 17:13:27 - INFO - __main__ - Step 141909: {'lr': 3.6790018834006457e-06, 'samples': 27246528, 'steps': 141908, 'loss/train': 1.4143842458724976} 11/07/2021 17:13:27 - INFO - __main__ - Step 141910: {'lr': 3.678094882087313e-06, 'samples': 27246720, 'steps': 141909, 'loss/train': 1.5374921560287476} 11/07/2021 17:13:28 - INFO - __main__ - Step 141911: {'lr': 3.6771879917628094e-06, 'samples': 27246912, 'steps': 141910, 'loss/train': 0.8817831873893738} 11/07/2021 17:13:28 - INFO - __main__ - Step 141912: {'lr': 3.676281212427579e-06, 'samples': 27247104, 'steps': 141911, 'loss/train': 1.1283332109451294} 11/07/2021 17:13:28 - INFO - __main__ - Step 141913: {'lr': 3.675374544081983e-06, 'samples': 27247296, 'steps': 141912, 'loss/train': 1.310847520828247} 11/07/2021 17:13:29 - INFO - __main__ - Step 141914: {'lr': 3.6744679867265208e-06, 'samples': 27247488, 'steps': 141913, 'loss/train': 1.5222522020339966} 11/07/2021 17:13:30 - INFO - __main__ - Step 141915: {'lr': 3.6735615403614977e-06, 'samples': 27247680, 'steps': 141914, 'loss/train': 1.5407474040985107} 11/07/2021 17:13:30 - INFO - __main__ - Step 141916: {'lr': 3.672655204987385e-06, 'samples': 27247872, 'steps': 141915, 'loss/train': 1.2960041761398315} 11/07/2021 17:13:31 - INFO - __main__ - Step 141917: {'lr': 3.6717489806045725e-06, 'samples': 27248064, 'steps': 141916, 'loss/train': 1.3033576011657715} 11/07/2021 17:13:31 - INFO - __main__ - Step 141918: {'lr': 3.6708428672134475e-06, 'samples': 27248256, 'steps': 141917, 'loss/train': 1.406219482421875} 11/07/2021 17:13:31 - INFO - __main__ - Step 141919: {'lr': 3.669936864814455e-06, 'samples': 27248448, 'steps': 141918, 'loss/train': 1.616262674331665} 11/07/2021 17:13:32 - INFO - __main__ - Step 141920: {'lr': 3.669030973407983e-06, 'samples': 27248640, 'steps': 141919, 'loss/train': 1.5126254558563232} 11/07/2021 17:13:33 - INFO - __main__ - Step 141921: {'lr': 3.668125192994448e-06, 'samples': 27248832, 'steps': 141920, 'loss/train': 1.1860628128051758} 11/07/2021 17:13:33 - INFO - __main__ - Step 141922: {'lr': 3.6672195235742667e-06, 'samples': 27249024, 'steps': 141921, 'loss/train': 1.5104260444641113} 11/07/2021 17:13:33 - INFO - __main__ - Step 141923: {'lr': 3.6663139651477993e-06, 'samples': 27249216, 'steps': 141922, 'loss/train': 1.101530909538269} 11/07/2021 17:13:34 - INFO - __main__ - Step 141924: {'lr': 3.6654085177155183e-06, 'samples': 27249408, 'steps': 141923, 'loss/train': 0.6029496788978577} 11/07/2021 17:13:35 - INFO - __main__ - Step 141925: {'lr': 3.664503181277812e-06, 'samples': 27249600, 'steps': 141924, 'loss/train': 1.7550969123840332} 11/07/2021 17:13:35 - INFO - __main__ - Step 141926: {'lr': 3.6635979558350685e-06, 'samples': 27249792, 'steps': 141925, 'loss/train': 1.1241413354873657} 11/07/2021 17:13:36 - INFO - __main__ - Step 141927: {'lr': 3.6626928413877046e-06, 'samples': 27249984, 'steps': 141926, 'loss/train': 0.8933755159378052} 11/07/2021 17:13:36 - INFO - __main__ - Step 141928: {'lr': 3.661787837936137e-06, 'samples': 27250176, 'steps': 141927, 'loss/train': 1.0329991579055786} 11/07/2021 17:13:36 - INFO - __main__ - Step 141929: {'lr': 3.6608829454807537e-06, 'samples': 27250368, 'steps': 141928, 'loss/train': 1.3385262489318848} 11/07/2021 17:13:37 - INFO - __main__ - Step 141930: {'lr': 3.659978164021971e-06, 'samples': 27250560, 'steps': 141929, 'loss/train': 1.4397997856140137} 11/07/2021 17:13:38 - INFO - __main__ - Step 141931: {'lr': 3.6590734935602053e-06, 'samples': 27250752, 'steps': 141930, 'loss/train': 1.4189472198486328} 11/07/2021 17:13:38 - INFO - __main__ - Step 141932: {'lr': 3.6581689340958733e-06, 'samples': 27250944, 'steps': 141931, 'loss/train': 0.9898349642753601} 11/07/2021 17:13:38 - INFO - __main__ - Step 141933: {'lr': 3.6572644856293636e-06, 'samples': 27251136, 'steps': 141932, 'loss/train': 1.2877277135849} 11/07/2021 17:13:39 - INFO - __main__ - Step 141934: {'lr': 3.656360148161092e-06, 'samples': 27251328, 'steps': 141933, 'loss/train': 1.0794447660446167} 11/07/2021 17:13:40 - INFO - __main__ - Step 141935: {'lr': 3.6554559216914475e-06, 'samples': 27251520, 'steps': 141934, 'loss/train': 1.0008493661880493} 11/07/2021 17:13:40 - INFO - __main__ - Step 141936: {'lr': 3.6545518062208736e-06, 'samples': 27251712, 'steps': 141935, 'loss/train': 1.234704613685608} 11/07/2021 17:13:40 - INFO - __main__ - Step 141937: {'lr': 3.6536478017497323e-06, 'samples': 27251904, 'steps': 141936, 'loss/train': 0.9916374087333679} 11/07/2021 17:13:41 - INFO - __main__ - Step 141938: {'lr': 3.6527439082784665e-06, 'samples': 27252096, 'steps': 141937, 'loss/train': 1.0167546272277832} 11/07/2021 17:13:41 - INFO - __main__ - Step 141939: {'lr': 3.651840125807493e-06, 'samples': 27252288, 'steps': 141938, 'loss/train': 1.5014824867248535} 11/07/2021 17:13:42 - INFO - __main__ - Step 141940: {'lr': 3.6509364543371724e-06, 'samples': 27252480, 'steps': 141939, 'loss/train': 0.9008221626281738} 11/07/2021 17:13:43 - INFO - __main__ - Step 141941: {'lr': 3.650032893867977e-06, 'samples': 27252672, 'steps': 141940, 'loss/train': 1.3609899282455444} 11/07/2021 17:13:43 - INFO - __main__ - Step 141942: {'lr': 3.64912944440024e-06, 'samples': 27252864, 'steps': 141941, 'loss/train': 0.9819684624671936} 11/07/2021 17:13:43 - INFO - __main__ - Step 141943: {'lr': 3.6482261059344046e-06, 'samples': 27253056, 'steps': 141942, 'loss/train': 0.4880847632884979} 11/07/2021 17:13:44 - INFO - __main__ - Step 141944: {'lr': 3.647322878470888e-06, 'samples': 27253248, 'steps': 141943, 'loss/train': 1.0544401407241821} 11/07/2021 17:13:44 - INFO - __main__ - Step 141945: {'lr': 3.6464197620100783e-06, 'samples': 27253440, 'steps': 141944, 'loss/train': 1.1527694463729858} 11/07/2021 17:13:45 - INFO - __main__ - Step 141946: {'lr': 3.6455167565523916e-06, 'samples': 27253632, 'steps': 141945, 'loss/train': 1.3306174278259277} 11/07/2021 17:13:45 - INFO - __main__ - Step 141947: {'lr': 3.644613862098245e-06, 'samples': 27253824, 'steps': 141946, 'loss/train': 1.2584171295166016} 11/07/2021 17:13:46 - INFO - __main__ - Step 141948: {'lr': 3.6437110786480265e-06, 'samples': 27254016, 'steps': 141947, 'loss/train': 1.3434098958969116} 11/07/2021 17:13:46 - INFO - __main__ - Step 141949: {'lr': 3.642808406202125e-06, 'samples': 27254208, 'steps': 141948, 'loss/train': 1.2649039030075073} 11/07/2021 17:13:46 - INFO - __main__ - Step 141950: {'lr': 3.641905844761012e-06, 'samples': 27254400, 'steps': 141949, 'loss/train': 1.6173174381256104} 11/07/2021 17:13:47 - INFO - __main__ - Step 141951: {'lr': 3.6410033943250207e-06, 'samples': 27254592, 'steps': 141950, 'loss/train': 1.4913582801818848} 11/07/2021 17:13:48 - INFO - __main__ - Step 141952: {'lr': 3.6401010548946234e-06, 'samples': 27254784, 'steps': 141951, 'loss/train': 1.2825230360031128} 11/07/2021 17:13:48 - INFO - __main__ - Step 141953: {'lr': 3.6391988264701804e-06, 'samples': 27254976, 'steps': 141952, 'loss/train': 1.1602814197540283} 11/07/2021 17:13:48 - INFO - __main__ - Step 141954: {'lr': 3.638296709052108e-06, 'samples': 27255168, 'steps': 141953, 'loss/train': 1.5018678903579712} 11/07/2021 17:13:49 - INFO - __main__ - Step 141955: {'lr': 3.6373947026408505e-06, 'samples': 27255360, 'steps': 141954, 'loss/train': 1.3609133958816528} 11/07/2021 17:13:50 - INFO - __main__ - Step 141956: {'lr': 3.6364928072367407e-06, 'samples': 27255552, 'steps': 141955, 'loss/train': 1.2247474193572998} 11/07/2021 17:13:50 - INFO - __main__ - Step 141957: {'lr': 3.635591022840251e-06, 'samples': 27255744, 'steps': 141956, 'loss/train': 1.3293685913085938} 11/07/2021 17:13:51 - INFO - __main__ - Step 141958: {'lr': 3.634689349451742e-06, 'samples': 27255936, 'steps': 141957, 'loss/train': 1.3791881799697876} 11/07/2021 17:13:51 - INFO - __main__ - Step 141959: {'lr': 3.6337877870716574e-06, 'samples': 27256128, 'steps': 141958, 'loss/train': 1.179450273513794} 11/07/2021 17:13:51 - INFO - __main__ - Step 141960: {'lr': 3.6328863357003582e-06, 'samples': 27256320, 'steps': 141959, 'loss/train': 1.3646228313446045} 11/07/2021 17:13:52 - INFO - __main__ - Step 141961: {'lr': 3.6319849953383164e-06, 'samples': 27256512, 'steps': 141960, 'loss/train': 1.1328577995300293} 11/07/2021 17:13:53 - INFO - __main__ - Step 141962: {'lr': 3.631083765985865e-06, 'samples': 27256704, 'steps': 141961, 'loss/train': 1.5596235990524292} 11/07/2021 17:13:53 - INFO - __main__ - Step 141963: {'lr': 3.6301826476434764e-06, 'samples': 27256896, 'steps': 141962, 'loss/train': 0.8423997163772583} 11/07/2021 17:13:53 - INFO - __main__ - Step 141964: {'lr': 3.6292816403115104e-06, 'samples': 27257088, 'steps': 141963, 'loss/train': 1.390034556388855} 11/07/2021 17:13:54 - INFO - __main__ - Step 141965: {'lr': 3.628380743990384e-06, 'samples': 27257280, 'steps': 141964, 'loss/train': 1.307847261428833} 11/07/2021 17:13:55 - INFO - __main__ - Step 141966: {'lr': 3.627479958680513e-06, 'samples': 27257472, 'steps': 141965, 'loss/train': 1.373599886894226} 11/07/2021 17:13:55 - INFO - __main__ - Step 141967: {'lr': 3.6265792843822863e-06, 'samples': 27257664, 'steps': 141966, 'loss/train': 1.333953857421875} 11/07/2021 17:13:56 - INFO - __main__ - Step 141968: {'lr': 3.6256787210961485e-06, 'samples': 27257856, 'steps': 141967, 'loss/train': 0.8307627439498901} 11/07/2021 17:13:56 - INFO - __main__ - Step 141969: {'lr': 3.6247782688224596e-06, 'samples': 27258048, 'steps': 141968, 'loss/train': 1.0873032808303833} 11/07/2021 17:13:56 - INFO - __main__ - Step 141970: {'lr': 3.623877927561664e-06, 'samples': 27258240, 'steps': 141969, 'loss/train': 1.5648263692855835} 11/07/2021 17:13:57 - INFO - __main__ - Step 141971: {'lr': 3.6229776973141228e-06, 'samples': 27258432, 'steps': 141970, 'loss/train': 1.1458240747451782} 11/07/2021 17:13:58 - INFO - __main__ - Step 141972: {'lr': 3.6220775780802794e-06, 'samples': 27258624, 'steps': 141971, 'loss/train': 1.2151778936386108} 11/07/2021 17:13:58 - INFO - __main__ - Step 141973: {'lr': 3.6211775698605232e-06, 'samples': 27258816, 'steps': 141972, 'loss/train': 0.8051931262016296} 11/07/2021 17:13:58 - INFO - __main__ - Step 141974: {'lr': 3.620277672655242e-06, 'samples': 27259008, 'steps': 141973, 'loss/train': 1.488478183746338} 11/07/2021 17:13:59 - INFO - __main__ - Step 141975: {'lr': 3.6193778864648805e-06, 'samples': 27259200, 'steps': 141974, 'loss/train': 0.3597372770309448} 11/07/2021 17:13:59 - INFO - __main__ - Step 141976: {'lr': 3.618478211289827e-06, 'samples': 27259392, 'steps': 141975, 'loss/train': 1.2476595640182495} 11/07/2021 17:14:00 - INFO - __main__ - Step 141977: {'lr': 3.6175786471304706e-06, 'samples': 27259584, 'steps': 141976, 'loss/train': 0.8425622582435608} 11/07/2021 17:14:00 - INFO - __main__ - Step 141978: {'lr': 3.6166791939872544e-06, 'samples': 27259776, 'steps': 141977, 'loss/train': 1.1128250360488892} 11/07/2021 17:14:01 - INFO - __main__ - Step 141979: {'lr': 3.61577985186054e-06, 'samples': 27259968, 'steps': 141978, 'loss/train': 1.2715033292770386} 11/07/2021 17:14:01 - INFO - __main__ - Step 141980: {'lr': 3.6148806207507714e-06, 'samples': 27260160, 'steps': 141979, 'loss/train': 1.2248948812484741} 11/07/2021 17:14:01 - INFO - __main__ - Step 141981: {'lr': 3.6139815006583367e-06, 'samples': 27260352, 'steps': 141980, 'loss/train': 1.304992914199829} 11/07/2021 17:14:02 - INFO - __main__ - Step 141982: {'lr': 3.6130824915836246e-06, 'samples': 27260544, 'steps': 141981, 'loss/train': 1.3572125434875488} 11/07/2021 17:14:03 - INFO - __main__ - Step 141983: {'lr': 3.61218359352708e-06, 'samples': 27260736, 'steps': 141982, 'loss/train': 1.6053742170333862} 11/07/2021 17:14:03 - INFO - __main__ - Step 141984: {'lr': 3.6112848064890626e-06, 'samples': 27260928, 'steps': 141983, 'loss/train': 1.1796085834503174} 11/07/2021 17:14:04 - INFO - __main__ - Step 141985: {'lr': 3.610386130469989e-06, 'samples': 27261120, 'steps': 141984, 'loss/train': 1.176135778427124} 11/07/2021 17:14:04 - INFO - __main__ - Step 141986: {'lr': 3.6094875654702765e-06, 'samples': 27261312, 'steps': 141985, 'loss/train': 1.3698760271072388} 11/07/2021 17:14:05 - INFO - __main__ - Step 141987: {'lr': 3.6085891114903402e-06, 'samples': 27261504, 'steps': 141986, 'loss/train': 1.460477352142334} 11/07/2021 17:14:05 - INFO - __main__ - Step 141988: {'lr': 3.6076907685305693e-06, 'samples': 27261696, 'steps': 141987, 'loss/train': 1.2489674091339111} 11/07/2021 17:14:06 - INFO - __main__ - Step 141989: {'lr': 3.60679253659138e-06, 'samples': 27261888, 'steps': 141988, 'loss/train': 1.4375176429748535} 11/07/2021 17:14:06 - INFO - __main__ - Step 141990: {'lr': 3.605894415673133e-06, 'samples': 27262080, 'steps': 141989, 'loss/train': 1.3720098733901978} 11/07/2021 17:14:06 - INFO - __main__ - Step 141991: {'lr': 3.604996405776301e-06, 'samples': 27262272, 'steps': 141990, 'loss/train': 0.7359317541122437} 11/07/2021 17:14:07 - INFO - __main__ - Step 141992: {'lr': 3.6040985069012433e-06, 'samples': 27262464, 'steps': 141991, 'loss/train': 1.5935593843460083} 11/07/2021 17:14:08 - INFO - __main__ - Step 141993: {'lr': 3.6032007190483773e-06, 'samples': 27262656, 'steps': 141992, 'loss/train': 1.6410852670669556} 11/07/2021 17:14:08 - INFO - __main__ - Step 141994: {'lr': 3.602303042218119e-06, 'samples': 27262848, 'steps': 141993, 'loss/train': 1.5436503887176514} 11/07/2021 17:14:08 - INFO - __main__ - Step 141995: {'lr': 3.601405476410857e-06, 'samples': 27263040, 'steps': 141994, 'loss/train': 1.254608154296875} 11/07/2021 17:14:09 - INFO - __main__ - Step 141996: {'lr': 3.6005080216269804e-06, 'samples': 27263232, 'steps': 141995, 'loss/train': 1.2628413438796997} 11/07/2021 17:14:10 - INFO - __main__ - Step 141997: {'lr': 3.5996106778669326e-06, 'samples': 27263424, 'steps': 141996, 'loss/train': 1.4068403244018555} 11/07/2021 17:14:10 - INFO - __main__ - Step 141998: {'lr': 3.598713445131102e-06, 'samples': 27263616, 'steps': 141997, 'loss/train': 1.4803452491760254} 11/07/2021 17:14:11 - INFO - __main__ - Step 141999: {'lr': 3.597816323419878e-06, 'samples': 27263808, 'steps': 141998, 'loss/train': 1.2615071535110474} 11/07/2021 17:14:11 - INFO - __main__ - Step 142000: {'lr': 3.5969193127336762e-06, 'samples': 27264000, 'steps': 141999, 'loss/train': 0.7151772975921631} 11/07/2021 17:14:11 - INFO - __main__ - Step 142001: {'lr': 3.5960224130728858e-06, 'samples': 27264192, 'steps': 142000, 'loss/train': 1.2325923442840576} 11/07/2021 17:14:12 - INFO - __main__ - Step 142002: {'lr': 3.5951256244379506e-06, 'samples': 27264384, 'steps': 142001, 'loss/train': 1.266796588897705} 11/07/2021 17:14:13 - INFO - __main__ - Step 142003: {'lr': 3.594228946829231e-06, 'samples': 27264576, 'steps': 142002, 'loss/train': 1.2309346199035645} 11/07/2021 17:14:13 - INFO - __main__ - Step 142004: {'lr': 3.593332380247144e-06, 'samples': 27264768, 'steps': 142003, 'loss/train': 1.0643059015274048} 11/07/2021 17:14:13 - INFO - __main__ - Step 142005: {'lr': 3.5924359246921055e-06, 'samples': 27264960, 'steps': 142004, 'loss/train': 1.3402063846588135} 11/07/2021 17:14:14 - INFO - __main__ - Step 142006: {'lr': 3.591539580164532e-06, 'samples': 27265152, 'steps': 142005, 'loss/train': 1.2576545476913452} 11/07/2021 17:14:15 - INFO - __main__ - Step 142007: {'lr': 3.590643346664785e-06, 'samples': 27265344, 'steps': 142006, 'loss/train': 0.6447149515151978} 11/07/2021 17:14:15 - INFO - __main__ - Step 142008: {'lr': 3.5897472241933072e-06, 'samples': 27265536, 'steps': 142007, 'loss/train': 0.7781431674957275} 11/07/2021 17:14:15 - INFO - __main__ - Step 142009: {'lr': 3.588851212750488e-06, 'samples': 27265728, 'steps': 142008, 'loss/train': 1.3319108486175537} 11/07/2021 17:14:16 - INFO - __main__ - Step 142010: {'lr': 3.5879553123367438e-06, 'samples': 27265920, 'steps': 142009, 'loss/train': 1.5842782258987427} 11/07/2021 17:14:16 - INFO - __main__ - Step 142011: {'lr': 3.587059522952435e-06, 'samples': 27266112, 'steps': 142010, 'loss/train': 1.598928451538086} 11/07/2021 17:14:17 - INFO - __main__ - Step 142012: {'lr': 3.586163844598006e-06, 'samples': 27266304, 'steps': 142011, 'loss/train': 1.2912007570266724} 11/07/2021 17:14:18 - INFO - __main__ - Step 142013: {'lr': 3.5852682772738456e-06, 'samples': 27266496, 'steps': 142012, 'loss/train': 1.597283959388733} 11/07/2021 17:14:18 - INFO - __main__ - Step 142014: {'lr': 3.5843728209803695e-06, 'samples': 27266688, 'steps': 142013, 'loss/train': 1.1395370960235596} 11/07/2021 17:14:18 - INFO - __main__ - Step 142015: {'lr': 3.5834774757179665e-06, 'samples': 27266880, 'steps': 142014, 'loss/train': 1.3470423221588135} 11/07/2021 17:14:19 - INFO - __main__ - Step 142016: {'lr': 3.582582241487026e-06, 'samples': 27267072, 'steps': 142015, 'loss/train': 1.863046407699585} 11/07/2021 17:14:19 - INFO - __main__ - Step 142017: {'lr': 3.581687118287991e-06, 'samples': 27267264, 'steps': 142016, 'loss/train': 0.9069520235061646} 11/07/2021 17:14:20 - INFO - __main__ - Step 142018: {'lr': 3.5807921061212503e-06, 'samples': 27267456, 'steps': 142017, 'loss/train': 1.2110668420791626} 11/07/2021 17:14:20 - INFO - __main__ - Step 142019: {'lr': 3.5798972049871926e-06, 'samples': 27267648, 'steps': 142018, 'loss/train': 0.6867234110832214} 11/07/2021 17:14:21 - INFO - __main__ - Step 142020: {'lr': 3.5790024148862345e-06, 'samples': 27267840, 'steps': 142019, 'loss/train': 1.5563069581985474} 11/07/2021 17:14:21 - INFO - __main__ - Step 142021: {'lr': 3.578107735818792e-06, 'samples': 27268032, 'steps': 142020, 'loss/train': 0.6539595723152161} 11/07/2021 17:14:21 - INFO - __main__ - Step 142022: {'lr': 3.5772131677852536e-06, 'samples': 27268224, 'steps': 142021, 'loss/train': 0.7153881788253784} 11/07/2021 17:14:22 - INFO - __main__ - Step 142023: {'lr': 3.5763187107860083e-06, 'samples': 27268416, 'steps': 142022, 'loss/train': 1.4420875310897827} 11/07/2021 17:14:23 - INFO - __main__ - Step 142024: {'lr': 3.5754243648214445e-06, 'samples': 27268608, 'steps': 142023, 'loss/train': 0.050808753818273544} 11/07/2021 17:14:23 - INFO - __main__ - Step 142025: {'lr': 3.5745301298920343e-06, 'samples': 27268800, 'steps': 142024, 'loss/train': 1.3995044231414795} 11/07/2021 17:14:24 - INFO - __main__ - Step 142026: {'lr': 3.5736360059981098e-06, 'samples': 27268992, 'steps': 142025, 'loss/train': 1.3780362606048584} 11/07/2021 17:14:24 - INFO - __main__ - Step 142027: {'lr': 3.572741993140116e-06, 'samples': 27269184, 'steps': 142026, 'loss/train': 1.5914779901504517} 11/07/2021 17:14:25 - INFO - __main__ - Step 142028: {'lr': 3.5718480913184136e-06, 'samples': 27269376, 'steps': 142027, 'loss/train': 1.416524052619934} 11/07/2021 17:14:25 - INFO - __main__ - Step 142029: {'lr': 3.5709543005334745e-06, 'samples': 27269568, 'steps': 142028, 'loss/train': 1.456498384475708} 11/07/2021 17:14:26 - INFO - __main__ - Step 142030: {'lr': 3.5700606207856313e-06, 'samples': 27269760, 'steps': 142029, 'loss/train': 1.6688251495361328} 11/07/2021 17:14:26 - INFO - __main__ - Step 142031: {'lr': 3.5691670520753283e-06, 'samples': 27269952, 'steps': 142030, 'loss/train': 1.0884923934936523} 11/07/2021 17:14:26 - INFO - __main__ - Step 142032: {'lr': 3.5682735944029542e-06, 'samples': 27270144, 'steps': 142031, 'loss/train': 1.0735936164855957} 11/07/2021 17:14:27 - INFO - __main__ - Step 142033: {'lr': 3.5673802477689255e-06, 'samples': 27270336, 'steps': 142032, 'loss/train': 1.61405348777771} 11/07/2021 17:14:28 - INFO - __main__ - Step 142034: {'lr': 3.5664870121736028e-06, 'samples': 27270528, 'steps': 142033, 'loss/train': 0.564548671245575} 11/07/2021 17:14:28 - INFO - __main__ - Step 142035: {'lr': 3.5655938876174575e-06, 'samples': 27270720, 'steps': 142034, 'loss/train': 1.3697885274887085} 11/07/2021 17:14:29 - INFO - __main__ - Step 142036: {'lr': 3.5647008741008235e-06, 'samples': 27270912, 'steps': 142035, 'loss/train': 1.8481760025024414} 11/07/2021 17:14:29 - INFO - __main__ - Step 142037: {'lr': 3.5638079716241446e-06, 'samples': 27271104, 'steps': 142036, 'loss/train': 1.3973745107650757} 11/07/2021 17:14:31 - INFO - __main__ - Step 142038: {'lr': 3.562915180187809e-06, 'samples': 27271296, 'steps': 142037, 'loss/train': 1.2496839761734009} 11/07/2021 17:14:32 - INFO - __main__ - Step 142039: {'lr': 3.5620224997922337e-06, 'samples': 27271488, 'steps': 142038, 'loss/train': 1.107820987701416} 11/07/2021 17:14:32 - INFO - __main__ - Step 142040: {'lr': 3.561129930437779e-06, 'samples': 27271680, 'steps': 142039, 'loss/train': 1.1459952592849731} 11/07/2021 17:14:32 - INFO - __main__ - Step 142041: {'lr': 3.560237472124889e-06, 'samples': 27271872, 'steps': 142040, 'loss/train': 1.2404077053070068} 11/07/2021 17:14:33 - INFO - __main__ - Step 142042: {'lr': 3.559345124853952e-06, 'samples': 27272064, 'steps': 142041, 'loss/train': 1.200998067855835} 11/07/2021 17:14:33 - INFO - __main__ - Step 142043: {'lr': 3.5584528886253853e-06, 'samples': 27272256, 'steps': 142042, 'loss/train': 1.2956429719924927} 11/07/2021 17:14:34 - INFO - __main__ - Step 142044: {'lr': 3.5575607634395492e-06, 'samples': 27272448, 'steps': 142043, 'loss/train': 1.479299783706665} 11/07/2021 17:14:34 - INFO - __main__ - Step 142045: {'lr': 3.5566687492969153e-06, 'samples': 27272640, 'steps': 142044, 'loss/train': 1.708937644958496} 11/07/2021 17:14:34 - INFO - __main__ - Step 142046: {'lr': 3.555776846197817e-06, 'samples': 27272832, 'steps': 142045, 'loss/train': 1.7292296886444092} 11/07/2021 17:14:35 - INFO - __main__ - Step 142047: {'lr': 3.5548850541426702e-06, 'samples': 27273024, 'steps': 142046, 'loss/train': 0.5505019426345825} 11/07/2021 17:14:36 - INFO - __main__ - Step 142048: {'lr': 3.5539933731319196e-06, 'samples': 27273216, 'steps': 142047, 'loss/train': 1.813782811164856} 11/07/2021 17:14:36 - INFO - __main__ - Step 142049: {'lr': 3.5531018031659255e-06, 'samples': 27273408, 'steps': 142048, 'loss/train': 1.3949395418167114} 11/07/2021 17:14:36 - INFO - __main__ - Step 142050: {'lr': 3.552210344245105e-06, 'samples': 27273600, 'steps': 142049, 'loss/train': 0.9861950278282166} 11/07/2021 17:14:37 - INFO - __main__ - Step 142051: {'lr': 3.5513189963698456e-06, 'samples': 27273792, 'steps': 142050, 'loss/train': 1.491127610206604} 11/07/2021 17:14:37 - INFO - __main__ - Step 142052: {'lr': 3.5504277595405645e-06, 'samples': 27273984, 'steps': 142051, 'loss/train': 1.2167637348175049} 11/07/2021 17:14:38 - INFO - __main__ - Step 142053: {'lr': 3.5495366337576497e-06, 'samples': 27274176, 'steps': 142052, 'loss/train': 1.5432265996932983} 11/07/2021 17:14:39 - INFO - __main__ - Step 142054: {'lr': 3.548645619021518e-06, 'samples': 27274368, 'steps': 142053, 'loss/train': 1.7722090482711792} 11/07/2021 17:14:39 - INFO - __main__ - Step 142055: {'lr': 3.5477547153325573e-06, 'samples': 27274560, 'steps': 142054, 'loss/train': 1.2517012357711792} 11/07/2021 17:14:39 - INFO - __main__ - Step 142056: {'lr': 3.546863922691157e-06, 'samples': 27274752, 'steps': 142055, 'loss/train': 1.3506226539611816} 11/07/2021 17:14:40 - INFO - __main__ - Step 142057: {'lr': 3.5459732410977606e-06, 'samples': 27274944, 'steps': 142056, 'loss/train': 1.534045696258545} 11/07/2021 17:14:41 - INFO - __main__ - Step 142058: {'lr': 3.5450826705527574e-06, 'samples': 27275136, 'steps': 142057, 'loss/train': 1.2470756769180298} 11/07/2021 17:14:42 - INFO - __main__ - Step 142059: {'lr': 3.5441922110565074e-06, 'samples': 27275328, 'steps': 142058, 'loss/train': 1.176367163658142} 11/07/2021 17:14:42 - INFO - __main__ - Step 142060: {'lr': 3.543301862609455e-06, 'samples': 27275520, 'steps': 142059, 'loss/train': 2.7527823448181152} 11/07/2021 17:14:42 - INFO - __main__ - Step 142061: {'lr': 3.5424116252119885e-06, 'samples': 27275712, 'steps': 142060, 'loss/train': 1.2985634803771973} 11/07/2021 17:14:43 - INFO - __main__ - Step 142062: {'lr': 3.541521498864525e-06, 'samples': 27275904, 'steps': 142061, 'loss/train': 0.8384346961975098} 11/07/2021 17:14:44 - INFO - __main__ - Step 142063: {'lr': 3.5406314835674524e-06, 'samples': 27276096, 'steps': 142062, 'loss/train': 1.1482181549072266} 11/07/2021 17:14:44 - INFO - __main__ - Step 142064: {'lr': 3.5397415793211317e-06, 'samples': 27276288, 'steps': 142063, 'loss/train': 0.977854311466217} 11/07/2021 17:14:44 - INFO - __main__ - Step 142065: {'lr': 3.538851786126035e-06, 'samples': 27276480, 'steps': 142064, 'loss/train': 1.2239922285079956} 11/07/2021 17:14:45 - INFO - __main__ - Step 142066: {'lr': 3.537962103982495e-06, 'samples': 27276672, 'steps': 142065, 'loss/train': 1.197359323501587} 11/07/2021 17:14:45 - INFO - __main__ - Step 142067: {'lr': 3.5370725328909557e-06, 'samples': 27276864, 'steps': 142066, 'loss/train': 1.3144967555999756} 11/07/2021 17:14:45 - INFO - __main__ - Step 142068: {'lr': 3.536183072851834e-06, 'samples': 27277056, 'steps': 142067, 'loss/train': 1.4993597269058228} 11/07/2021 17:14:46 - INFO - __main__ - Step 142069: {'lr': 3.5352937238654627e-06, 'samples': 27277248, 'steps': 142068, 'loss/train': 1.4884066581726074} 11/07/2021 17:14:47 - INFO - __main__ - Step 142070: {'lr': 3.534404485932313e-06, 'samples': 27277440, 'steps': 142069, 'loss/train': 0.4313313364982605} 11/07/2021 17:14:47 - INFO - __main__ - Step 142071: {'lr': 3.533515359052747e-06, 'samples': 27277632, 'steps': 142070, 'loss/train': 1.1298803091049194} 11/07/2021 17:14:48 - INFO - __main__ - Step 142072: {'lr': 3.53262634322718e-06, 'samples': 27277824, 'steps': 142071, 'loss/train': 1.2402127981185913} 11/07/2021 17:14:48 - INFO - __main__ - Step 142073: {'lr': 3.5317374384560286e-06, 'samples': 27278016, 'steps': 142072, 'loss/train': 0.8443664908409119} 11/07/2021 17:14:49 - INFO - __main__ - Step 142074: {'lr': 3.5308486447396537e-06, 'samples': 27278208, 'steps': 142073, 'loss/train': 1.152526617050171} 11/07/2021 17:14:49 - INFO - __main__ - Step 142075: {'lr': 3.5299599620784716e-06, 'samples': 27278400, 'steps': 142074, 'loss/train': 1.3103587627410889} 11/07/2021 17:14:50 - INFO - __main__ - Step 142076: {'lr': 3.529071390472899e-06, 'samples': 27278592, 'steps': 142075, 'loss/train': 1.2101632356643677} 11/07/2021 17:14:50 - INFO - __main__ - Step 142077: {'lr': 3.5281829299233237e-06, 'samples': 27278784, 'steps': 142076, 'loss/train': 0.8564345836639404} 11/07/2021 17:14:50 - INFO - __main__ - Step 142078: {'lr': 3.5272945804301347e-06, 'samples': 27278976, 'steps': 142077, 'loss/train': 1.3799455165863037} 11/07/2021 17:14:52 - INFO - __main__ - Step 142079: {'lr': 3.5264063419937486e-06, 'samples': 27279168, 'steps': 142078, 'loss/train': 1.8181322813034058} 11/07/2021 17:14:52 - INFO - __main__ - Step 142080: {'lr': 3.5255182146145535e-06, 'samples': 27279360, 'steps': 142079, 'loss/train': 1.2033567428588867} 11/07/2021 17:14:52 - INFO - __main__ - Step 142081: {'lr': 3.5246301982929385e-06, 'samples': 27279552, 'steps': 142080, 'loss/train': 1.4908559322357178} 11/07/2021 17:14:53 - INFO - __main__ - Step 142082: {'lr': 3.5237422930293474e-06, 'samples': 27279744, 'steps': 142081, 'loss/train': 1.3829742670059204} 11/07/2021 17:14:53 - INFO - __main__ - Step 142083: {'lr': 3.5228544988241682e-06, 'samples': 27279936, 'steps': 142082, 'loss/train': 1.2210980653762817} 11/07/2021 17:14:54 - INFO - __main__ - Step 142084: {'lr': 3.5219668156777906e-06, 'samples': 27280128, 'steps': 142083, 'loss/train': 1.4965405464172363} 11/07/2021 17:14:54 - INFO - __main__ - Step 142085: {'lr': 3.521079243590575e-06, 'samples': 27280320, 'steps': 142084, 'loss/train': 1.7148659229278564} 11/07/2021 17:14:55 - INFO - __main__ - Step 142086: {'lr': 3.5201917825629646e-06, 'samples': 27280512, 'steps': 142085, 'loss/train': 0.9739064574241638} 11/07/2021 17:14:55 - INFO - __main__ - Step 142087: {'lr': 3.5193044325953772e-06, 'samples': 27280704, 'steps': 142086, 'loss/train': 1.2709687948226929} 11/07/2021 17:14:55 - INFO - __main__ - Step 142088: {'lr': 3.5184171936881724e-06, 'samples': 27280896, 'steps': 142087, 'loss/train': 1.7747938632965088} 11/07/2021 17:14:57 - INFO - __main__ - Step 142089: {'lr': 3.5175300658417677e-06, 'samples': 27281088, 'steps': 142088, 'loss/train': 0.18981748819351196} 11/07/2021 17:14:57 - INFO - __main__ - Step 142090: {'lr': 3.5166430490565506e-06, 'samples': 27281280, 'steps': 142089, 'loss/train': 1.211336374282837} 11/07/2021 17:14:57 - INFO - __main__ - Step 142091: {'lr': 3.515756143332938e-06, 'samples': 27281472, 'steps': 142090, 'loss/train': 1.2553035020828247} 11/07/2021 17:14:58 - INFO - __main__ - Step 142092: {'lr': 3.514869348671318e-06, 'samples': 27281664, 'steps': 142091, 'loss/train': 1.3641986846923828} 11/07/2021 17:14:58 - INFO - __main__ - Step 142093: {'lr': 3.5139826650721073e-06, 'samples': 27281856, 'steps': 142092, 'loss/train': 1.1426702737808228} 11/07/2021 17:14:58 - INFO - __main__ - Step 142094: {'lr': 3.513096092535667e-06, 'samples': 27282048, 'steps': 142093, 'loss/train': 1.7175533771514893} 11/07/2021 17:15:00 - INFO - __main__ - Step 142095: {'lr': 3.5122096310624686e-06, 'samples': 27282240, 'steps': 142094, 'loss/train': 1.1043094396591187} 11/07/2021 17:15:00 - INFO - __main__ - Step 142096: {'lr': 3.511323280652817e-06, 'samples': 27282432, 'steps': 142095, 'loss/train': 1.403062105178833} 11/07/2021 17:15:00 - INFO - __main__ - Step 142097: {'lr': 3.5104370413071853e-06, 'samples': 27282624, 'steps': 142096, 'loss/train': 1.2197281122207642} 11/07/2021 17:15:01 - INFO - __main__ - Step 142098: {'lr': 3.5095509130259327e-06, 'samples': 27282816, 'steps': 142097, 'loss/train': 1.0334078073501587} 11/07/2021 17:15:01 - INFO - __main__ - Step 142099: {'lr': 3.508664895809477e-06, 'samples': 27283008, 'steps': 142098, 'loss/train': 1.1227047443389893} 11/07/2021 17:15:02 - INFO - __main__ - Step 142100: {'lr': 3.5077789896582336e-06, 'samples': 27283200, 'steps': 142099, 'loss/train': 0.49946585297584534} 11/07/2021 17:15:02 - INFO - __main__ - Step 142101: {'lr': 3.5068931945725637e-06, 'samples': 27283392, 'steps': 142100, 'loss/train': 1.73313307762146} 11/07/2021 17:15:03 - INFO - __main__ - Step 142102: {'lr': 3.5060075105528833e-06, 'samples': 27283584, 'steps': 142101, 'loss/train': 1.4743255376815796} 11/07/2021 17:15:03 - INFO - __main__ - Step 142103: {'lr': 3.505121937599581e-06, 'samples': 27283776, 'steps': 142102, 'loss/train': 1.4429831504821777} 11/07/2021 17:15:03 - INFO - __main__ - Step 142104: {'lr': 3.504236475713074e-06, 'samples': 27283968, 'steps': 142103, 'loss/train': 1.254664659500122} 11/07/2021 17:15:05 - INFO - __main__ - Step 142105: {'lr': 3.5033511248937777e-06, 'samples': 27284160, 'steps': 142104, 'loss/train': 1.0699735879898071} 11/07/2021 17:15:05 - INFO - __main__ - Step 142106: {'lr': 3.502465885142053e-06, 'samples': 27284352, 'steps': 142105, 'loss/train': 1.4503601789474487} 11/07/2021 17:15:05 - INFO - __main__ - Step 142107: {'lr': 3.501580756458317e-06, 'samples': 27284544, 'steps': 142106, 'loss/train': 1.1796694993972778} 11/07/2021 17:15:06 - INFO - __main__ - Step 142108: {'lr': 3.5006957388429572e-06, 'samples': 27284736, 'steps': 142107, 'loss/train': 1.4046359062194824} 11/07/2021 17:15:06 - INFO - __main__ - Step 142109: {'lr': 3.4998108322963627e-06, 'samples': 27284928, 'steps': 142108, 'loss/train': 1.1061742305755615} 11/07/2021 17:15:06 - INFO - __main__ - Step 142110: {'lr': 3.498926036818978e-06, 'samples': 27285120, 'steps': 142109, 'loss/train': 1.4366575479507446} 11/07/2021 17:15:07 - INFO - __main__ - Step 142111: {'lr': 3.4980413524111632e-06, 'samples': 27285312, 'steps': 142110, 'loss/train': 1.6157323122024536} 11/07/2021 17:15:08 - INFO - __main__ - Step 142112: {'lr': 3.497156779073335e-06, 'samples': 27285504, 'steps': 142111, 'loss/train': 1.0239019393920898} 11/07/2021 17:15:08 - INFO - __main__ - Step 142113: {'lr': 3.496272316805882e-06, 'samples': 27285696, 'steps': 142112, 'loss/train': 1.0774630308151245} 11/07/2021 17:15:08 - INFO - __main__ - Step 142114: {'lr': 3.4953879656091923e-06, 'samples': 27285888, 'steps': 142113, 'loss/train': 1.372241497039795} 11/07/2021 17:15:09 - INFO - __main__ - Step 142115: {'lr': 3.494503725483683e-06, 'samples': 27286080, 'steps': 142114, 'loss/train': 1.1138789653778076} 11/07/2021 17:15:10 - INFO - __main__ - Step 142116: {'lr': 3.4936195964297424e-06, 'samples': 27286272, 'steps': 142115, 'loss/train': 0.8976489901542664} 11/07/2021 17:15:10 - INFO - __main__ - Step 142117: {'lr': 3.4927355784477866e-06, 'samples': 27286464, 'steps': 142116, 'loss/train': 1.5578229427337646} 11/07/2021 17:15:10 - INFO - __main__ - Step 142118: {'lr': 3.4918516715382044e-06, 'samples': 27286656, 'steps': 142117, 'loss/train': 1.4136250019073486} 11/07/2021 17:15:11 - INFO - __main__ - Step 142119: {'lr': 3.490967875701384e-06, 'samples': 27286848, 'steps': 142118, 'loss/train': 1.0834412574768066} 11/07/2021 17:15:11 - INFO - __main__ - Step 142120: {'lr': 3.4900841909377145e-06, 'samples': 27287040, 'steps': 142119, 'loss/train': 1.2294321060180664} 11/07/2021 17:15:12 - INFO - __main__ - Step 142121: {'lr': 3.489200617247612e-06, 'samples': 27287232, 'steps': 142120, 'loss/train': 0.6298584938049316} 11/07/2021 17:15:13 - INFO - __main__ - Step 142122: {'lr': 3.488317154631493e-06, 'samples': 27287424, 'steps': 142121, 'loss/train': 0.03983626514673233} 11/07/2021 17:15:13 - INFO - __main__ - Step 142123: {'lr': 3.487433803089718e-06, 'samples': 27287616, 'steps': 142122, 'loss/train': 1.36705482006073} 11/07/2021 17:15:13 - INFO - __main__ - Step 142124: {'lr': 3.4865505626227033e-06, 'samples': 27287808, 'steps': 142123, 'loss/train': 1.1076711416244507} 11/07/2021 17:15:14 - INFO - __main__ - Step 142125: {'lr': 3.485667433230838e-06, 'samples': 27288000, 'steps': 142124, 'loss/train': 1.435789942741394} 11/07/2021 17:15:15 - INFO - __main__ - Step 142126: {'lr': 3.484784414914538e-06, 'samples': 27288192, 'steps': 142125, 'loss/train': 1.1607879400253296} 11/07/2021 17:15:15 - INFO - __main__ - Step 142127: {'lr': 3.4839015076741644e-06, 'samples': 27288384, 'steps': 142126, 'loss/train': 1.5727320909500122} 11/07/2021 17:15:16 - INFO - __main__ - Step 142128: {'lr': 3.483018711510161e-06, 'samples': 27288576, 'steps': 142127, 'loss/train': 1.2349772453308105} 11/07/2021 17:15:16 - INFO - __main__ - Step 142129: {'lr': 3.4821360264229165e-06, 'samples': 27288768, 'steps': 142128, 'loss/train': 1.2793432474136353} 11/07/2021 17:15:16 - INFO - __main__ - Step 142130: {'lr': 3.481253452412819e-06, 'samples': 27288960, 'steps': 142129, 'loss/train': 0.9651134014129639} 11/07/2021 17:15:17 - INFO - __main__ - Step 142131: {'lr': 3.4803709894802582e-06, 'samples': 27289152, 'steps': 142130, 'loss/train': 1.321060061454773} 11/07/2021 17:15:18 - INFO - __main__ - Step 142132: {'lr': 3.479488637625622e-06, 'samples': 27289344, 'steps': 142131, 'loss/train': 1.6994717121124268} 11/07/2021 17:15:18 - INFO - __main__ - Step 142133: {'lr': 3.478606396849354e-06, 'samples': 27289536, 'steps': 142132, 'loss/train': 1.039830207824707} 11/07/2021 17:15:18 - INFO - __main__ - Step 142134: {'lr': 3.4777242671517885e-06, 'samples': 27289728, 'steps': 142133, 'loss/train': 1.3558086156845093} 11/07/2021 17:15:19 - INFO - __main__ - Step 142135: {'lr': 3.4768422485333684e-06, 'samples': 27289920, 'steps': 142134, 'loss/train': 1.1990559101104736} 11/07/2021 17:15:20 - INFO - __main__ - Step 142136: {'lr': 3.4759603409944827e-06, 'samples': 27290112, 'steps': 142135, 'loss/train': 1.462756633758545} 11/07/2021 17:15:20 - INFO - __main__ - Step 142137: {'lr': 3.4750785445355206e-06, 'samples': 27290304, 'steps': 142136, 'loss/train': 1.507062315940857} 11/07/2021 17:15:21 - INFO - __main__ - Step 142138: {'lr': 3.4741968591568975e-06, 'samples': 27290496, 'steps': 142137, 'loss/train': 0.9709648489952087} 11/07/2021 17:15:21 - INFO - __main__ - Step 142139: {'lr': 3.4733152848589742e-06, 'samples': 27290688, 'steps': 142138, 'loss/train': 1.084580659866333} 11/07/2021 17:15:21 - INFO - __main__ - Step 142140: {'lr': 3.472433821642196e-06, 'samples': 27290880, 'steps': 142139, 'loss/train': 1.2670880556106567} 11/07/2021 17:15:22 - INFO - __main__ - Step 142141: {'lr': 3.4715524695069223e-06, 'samples': 27291072, 'steps': 142140, 'loss/train': 1.3683457374572754} 11/07/2021 17:15:23 - INFO - __main__ - Step 142142: {'lr': 3.4706712284535424e-06, 'samples': 27291264, 'steps': 142141, 'loss/train': 1.2998921871185303} 11/07/2021 17:15:23 - INFO - __main__ - Step 142143: {'lr': 3.469790098482528e-06, 'samples': 27291456, 'steps': 142142, 'loss/train': 0.9246242642402649} 11/07/2021 17:15:23 - INFO - __main__ - Step 142144: {'lr': 3.4689090795941847e-06, 'samples': 27291648, 'steps': 142143, 'loss/train': 1.0845856666564941} 11/07/2021 17:15:24 - INFO - __main__ - Step 142145: {'lr': 3.468028171788956e-06, 'samples': 27291840, 'steps': 142144, 'loss/train': 1.2091621160507202} 11/07/2021 17:15:24 - INFO - __main__ - Step 142146: {'lr': 3.467147375067231e-06, 'samples': 27292032, 'steps': 142145, 'loss/train': 1.3780114650726318} 11/07/2021 17:15:25 - INFO - __main__ - Step 142147: {'lr': 3.466266689429398e-06, 'samples': 27292224, 'steps': 142146, 'loss/train': 1.3538398742675781} 11/07/2021 17:15:26 - INFO - __main__ - Step 142148: {'lr': 3.4653861148758457e-06, 'samples': 27292416, 'steps': 142147, 'loss/train': 0.7669196128845215} 11/07/2021 17:15:26 - INFO - __main__ - Step 142149: {'lr': 3.4645056514070183e-06, 'samples': 27292608, 'steps': 142148, 'loss/train': 1.2557475566864014} 11/07/2021 17:15:26 - INFO - __main__ - Step 142150: {'lr': 3.4636252990232487e-06, 'samples': 27292800, 'steps': 142149, 'loss/train': 1.267081618309021} 11/07/2021 17:15:27 - INFO - __main__ - Step 142151: {'lr': 3.4627450577249808e-06, 'samples': 27292992, 'steps': 142150, 'loss/train': 1.762975811958313} 11/07/2021 17:15:28 - INFO - __main__ - Step 142152: {'lr': 3.4618649275126034e-06, 'samples': 27293184, 'steps': 142151, 'loss/train': 1.1769338846206665} 11/07/2021 17:15:28 - INFO - __main__ - Step 142153: {'lr': 3.4609849083864777e-06, 'samples': 27293376, 'steps': 142152, 'loss/train': 1.3518720865249634} 11/07/2021 17:15:28 - INFO - __main__ - Step 142154: {'lr': 3.460105000347047e-06, 'samples': 27293568, 'steps': 142153, 'loss/train': 1.079576015472412} 11/07/2021 17:15:29 - INFO - __main__ - Step 142155: {'lr': 3.4592252033947003e-06, 'samples': 27293760, 'steps': 142154, 'loss/train': 1.398964762687683} 11/07/2021 17:15:29 - INFO - __main__ - Step 142156: {'lr': 3.458345517529826e-06, 'samples': 27293952, 'steps': 142155, 'loss/train': 1.2183831930160522} 11/07/2021 17:15:30 - INFO - __main__ - Step 142157: {'lr': 3.457465942752813e-06, 'samples': 27294144, 'steps': 142156, 'loss/train': 1.2834733724594116} 11/07/2021 17:15:31 - INFO - __main__ - Step 142158: {'lr': 3.45658647906405e-06, 'samples': 27294336, 'steps': 142157, 'loss/train': 1.1831096410751343} 11/07/2021 17:15:31 - INFO - __main__ - Step 142159: {'lr': 3.455707126463925e-06, 'samples': 27294528, 'steps': 142158, 'loss/train': 0.48157843947410583} 11/07/2021 17:15:31 - INFO - __main__ - Step 142160: {'lr': 3.4548278849528823e-06, 'samples': 27294720, 'steps': 142159, 'loss/train': 1.0042423009872437} 11/07/2021 17:15:32 - INFO - __main__ - Step 142161: {'lr': 3.453948754531283e-06, 'samples': 27294912, 'steps': 142160, 'loss/train': 1.4148228168487549} 11/07/2021 17:15:33 - INFO - __main__ - Step 142162: {'lr': 3.453069735199543e-06, 'samples': 27295104, 'steps': 142161, 'loss/train': 1.548638939857483} 11/07/2021 17:15:33 - INFO - __main__ - Step 142163: {'lr': 3.4521908269580236e-06, 'samples': 27295296, 'steps': 142162, 'loss/train': 1.493224024772644} 11/07/2021 17:15:33 - INFO - __main__ - Step 142164: {'lr': 3.451312029807141e-06, 'samples': 27295488, 'steps': 142163, 'loss/train': 0.9495099186897278} 11/07/2021 17:15:34 - INFO - __main__ - Step 142165: {'lr': 3.4504333437473113e-06, 'samples': 27295680, 'steps': 142164, 'loss/train': 1.6898174285888672} 11/07/2021 17:15:34 - INFO - __main__ - Step 142166: {'lr': 3.4495547687788953e-06, 'samples': 27295872, 'steps': 142165, 'loss/train': 0.7986466884613037} 11/07/2021 17:15:35 - INFO - __main__ - Step 142167: {'lr': 3.4486763049023096e-06, 'samples': 27296064, 'steps': 142166, 'loss/train': 1.2919561862945557} 11/07/2021 17:15:35 - INFO - __main__ - Step 142168: {'lr': 3.4477979521179426e-06, 'samples': 27296256, 'steps': 142167, 'loss/train': 1.4838839769363403} 11/07/2021 17:15:36 - INFO - __main__ - Step 142169: {'lr': 3.4469197104262106e-06, 'samples': 27296448, 'steps': 142168, 'loss/train': 1.1905417442321777} 11/07/2021 17:15:36 - INFO - __main__ - Step 142170: {'lr': 3.4460415798275023e-06, 'samples': 27296640, 'steps': 142169, 'loss/train': 1.1824976205825806} 11/07/2021 17:15:36 - INFO - __main__ - Step 142171: {'lr': 3.4451635603221787e-06, 'samples': 27296832, 'steps': 142170, 'loss/train': 0.4598195254802704} 11/07/2021 17:15:37 - INFO - __main__ - Step 142172: {'lr': 3.4442856519106556e-06, 'samples': 27297024, 'steps': 142171, 'loss/train': 1.7813079357147217} 11/07/2021 17:15:38 - INFO - __main__ - Step 142173: {'lr': 3.4434078545933502e-06, 'samples': 27297216, 'steps': 142172, 'loss/train': 1.1937437057495117} 11/07/2021 17:15:38 - INFO - __main__ - Step 142174: {'lr': 3.4425301683706225e-06, 'samples': 27297408, 'steps': 142173, 'loss/train': 1.2351586818695068} 11/07/2021 17:15:39 - INFO - __main__ - Step 142175: {'lr': 3.4416525932428887e-06, 'samples': 27297600, 'steps': 142174, 'loss/train': 1.4242725372314453} 11/07/2021 17:15:39 - INFO - __main__ - Step 142176: {'lr': 3.440775129210538e-06, 'samples': 27297792, 'steps': 142175, 'loss/train': 1.1221270561218262} 11/07/2021 17:15:40 - INFO - __main__ - Step 142177: {'lr': 3.439897776273987e-06, 'samples': 27297984, 'steps': 142176, 'loss/train': 1.2443076372146606} 11/07/2021 17:15:40 - INFO - __main__ - Step 142178: {'lr': 3.4390205344335955e-06, 'samples': 27298176, 'steps': 142177, 'loss/train': 1.5732693672180176} 11/07/2021 17:15:41 - INFO - __main__ - Step 142179: {'lr': 3.4381434036897806e-06, 'samples': 27298368, 'steps': 142178, 'loss/train': 1.0914371013641357} 11/07/2021 17:15:41 - INFO - __main__ - Step 142180: {'lr': 3.4372663840429587e-06, 'samples': 27298560, 'steps': 142179, 'loss/train': 1.0081218481063843} 11/07/2021 17:15:41 - INFO - __main__ - Step 142181: {'lr': 3.436389475493462e-06, 'samples': 27298752, 'steps': 142180, 'loss/train': 1.0602221488952637} 11/07/2021 17:15:42 - INFO - __main__ - Step 142182: {'lr': 3.4355126780417634e-06, 'samples': 27298944, 'steps': 142181, 'loss/train': 1.3392224311828613} 11/07/2021 17:15:43 - INFO - __main__ - Step 142183: {'lr': 3.434635991688195e-06, 'samples': 27299136, 'steps': 142182, 'loss/train': 1.0529030561447144} 11/07/2021 17:15:43 - INFO - __main__ - Step 142184: {'lr': 3.433759416433174e-06, 'samples': 27299328, 'steps': 142183, 'loss/train': 1.443138599395752} 11/07/2021 17:15:44 - INFO - __main__ - Step 142185: {'lr': 3.4328829522771168e-06, 'samples': 27299520, 'steps': 142184, 'loss/train': 1.491267442703247} 11/07/2021 17:15:44 - INFO - __main__ - Step 142186: {'lr': 3.4320065992203554e-06, 'samples': 27299712, 'steps': 142185, 'loss/train': 1.3172117471694946} 11/07/2021 17:15:44 - INFO - __main__ - Step 142187: {'lr': 3.4311303572633624e-06, 'samples': 27299904, 'steps': 142186, 'loss/train': 1.3133835792541504} 11/07/2021 17:15:45 - INFO - __main__ - Step 142188: {'lr': 3.4302542264064985e-06, 'samples': 27300096, 'steps': 142187, 'loss/train': 1.343654990196228} 11/07/2021 17:15:46 - INFO - __main__ - Step 142189: {'lr': 3.4293782066501245e-06, 'samples': 27300288, 'steps': 142188, 'loss/train': 1.700828194618225} 11/07/2021 17:15:46 - INFO - __main__ - Step 142190: {'lr': 3.4285022979946844e-06, 'samples': 27300480, 'steps': 142189, 'loss/train': 1.4984757900238037} 11/07/2021 17:15:46 - INFO - __main__ - Step 142191: {'lr': 3.4276265004405673e-06, 'samples': 27300672, 'steps': 142190, 'loss/train': 1.0621552467346191} 11/07/2021 17:15:47 - INFO - __main__ - Step 142192: {'lr': 3.426750813988161e-06, 'samples': 27300864, 'steps': 142191, 'loss/train': 0.1290620118379593} 11/07/2021 17:15:48 - INFO - __main__ - Step 142193: {'lr': 3.4258752386378267e-06, 'samples': 27301056, 'steps': 142192, 'loss/train': 1.4037457704544067} 11/07/2021 17:15:48 - INFO - __main__ - Step 142194: {'lr': 3.4249997743900083e-06, 'samples': 27301248, 'steps': 142193, 'loss/train': 0.10115376114845276} 11/07/2021 17:15:49 - INFO - __main__ - Step 142195: {'lr': 3.424124421245095e-06, 'samples': 27301440, 'steps': 142194, 'loss/train': 0.9679355621337891} 11/07/2021 17:15:49 - INFO - __main__ - Step 142196: {'lr': 3.423249179203447e-06, 'samples': 27301632, 'steps': 142195, 'loss/train': 0.8834850788116455} 11/07/2021 17:15:49 - INFO - __main__ - Step 142197: {'lr': 3.422374048265481e-06, 'samples': 27301824, 'steps': 142196, 'loss/train': 0.8785211443901062} 11/07/2021 17:15:50 - INFO - __main__ - Step 142198: {'lr': 3.421499028431585e-06, 'samples': 27302016, 'steps': 142197, 'loss/train': 1.7299091815948486} 11/07/2021 17:15:51 - INFO - __main__ - Step 142199: {'lr': 3.4206241197021758e-06, 'samples': 27302208, 'steps': 142198, 'loss/train': 1.4820231199264526} 11/07/2021 17:15:51 - INFO - __main__ - Step 142200: {'lr': 3.4197493220775866e-06, 'samples': 27302400, 'steps': 142199, 'loss/train': 1.210845708847046} 11/07/2021 17:15:51 - INFO - __main__ - Step 142201: {'lr': 3.4188746355582887e-06, 'samples': 27302592, 'steps': 142200, 'loss/train': 1.1848098039627075} 11/07/2021 17:15:52 - INFO - __main__ - Step 142202: {'lr': 3.4180000601446435e-06, 'samples': 27302784, 'steps': 142201, 'loss/train': 1.5241788625717163} 11/07/2021 17:15:52 - INFO - __main__ - Step 142203: {'lr': 3.4171255958370118e-06, 'samples': 27302976, 'steps': 142202, 'loss/train': 0.9392438530921936} 11/07/2021 17:15:53 - INFO - __main__ - Step 142204: {'lr': 3.4162512426358373e-06, 'samples': 27303168, 'steps': 142203, 'loss/train': 1.5438916683197021} 11/07/2021 17:15:54 - INFO - __main__ - Step 142205: {'lr': 3.4153770005415085e-06, 'samples': 27303360, 'steps': 142204, 'loss/train': 1.1984400749206543} 11/07/2021 17:15:54 - INFO - __main__ - Step 142206: {'lr': 3.4145028695543867e-06, 'samples': 27303552, 'steps': 142205, 'loss/train': 1.3586044311523438} 11/07/2021 17:15:54 - INFO - __main__ - Step 142207: {'lr': 3.413628849674888e-06, 'samples': 27303744, 'steps': 142206, 'loss/train': 1.2961498498916626} 11/07/2021 17:15:55 - INFO - __main__ - Step 142208: {'lr': 3.412754940903401e-06, 'samples': 27303936, 'steps': 142207, 'loss/train': 1.4283833503723145} 11/07/2021 17:15:56 - INFO - __main__ - Step 142209: {'lr': 3.411881143240314e-06, 'samples': 27304128, 'steps': 142208, 'loss/train': 1.6009880304336548} 11/07/2021 17:15:56 - INFO - __main__ - Step 142210: {'lr': 3.4110074566860717e-06, 'samples': 27304320, 'steps': 142209, 'loss/train': 0.9709374308586121} 11/07/2021 17:15:56 - INFO - __main__ - Step 142211: {'lr': 3.410133881240979e-06, 'samples': 27304512, 'steps': 142210, 'loss/train': 1.35488760471344} 11/07/2021 17:15:57 - INFO - __main__ - Step 142212: {'lr': 3.4092604169054796e-06, 'samples': 27304704, 'steps': 142211, 'loss/train': 0.5781316161155701} 11/07/2021 17:15:57 - INFO - __main__ - Step 142213: {'lr': 3.408387063679991e-06, 'samples': 27304896, 'steps': 142212, 'loss/train': 1.2755959033966064} 11/07/2021 17:15:58 - INFO - __main__ - Step 142214: {'lr': 3.407513821564845e-06, 'samples': 27305088, 'steps': 142213, 'loss/train': 1.5675184726715088} 11/07/2021 17:15:59 - INFO - __main__ - Step 142215: {'lr': 3.4066406905604865e-06, 'samples': 27305280, 'steps': 142214, 'loss/train': 1.6685030460357666} 11/07/2021 17:15:59 - INFO - __main__ - Step 142216: {'lr': 3.405767670667276e-06, 'samples': 27305472, 'steps': 142215, 'loss/train': 0.8307016491889954} 11/07/2021 17:15:59 - INFO - __main__ - Step 142217: {'lr': 3.4048947618856294e-06, 'samples': 27305664, 'steps': 142216, 'loss/train': 1.4316691160202026} 11/07/2021 17:16:00 - INFO - __main__ - Step 142218: {'lr': 3.4040219642159366e-06, 'samples': 27305856, 'steps': 142217, 'loss/train': 1.1468226909637451} 11/07/2021 17:16:01 - INFO - __main__ - Step 142219: {'lr': 3.4031492776585846e-06, 'samples': 27306048, 'steps': 142218, 'loss/train': 1.441573143005371} 11/07/2021 17:16:01 - INFO - __main__ - Step 142220: {'lr': 3.4022767022139635e-06, 'samples': 27306240, 'steps': 142219, 'loss/train': 1.1367640495300293} 11/07/2021 17:16:01 - INFO - __main__ - Step 142221: {'lr': 3.4014042378824607e-06, 'samples': 27306432, 'steps': 142220, 'loss/train': 0.7095997929573059} 11/07/2021 17:16:02 - INFO - __main__ - Step 142222: {'lr': 3.4005318846644926e-06, 'samples': 27306624, 'steps': 142221, 'loss/train': 1.7495464086532593} 11/07/2021 17:16:02 - INFO - __main__ - Step 142223: {'lr': 3.399659642560449e-06, 'samples': 27306816, 'steps': 142222, 'loss/train': 1.723788857460022} 11/07/2021 17:16:03 - INFO - __main__ - Step 142224: {'lr': 3.398787511570717e-06, 'samples': 27307008, 'steps': 142223, 'loss/train': 1.2829675674438477} 11/07/2021 17:16:04 - INFO - __main__ - Step 142225: {'lr': 3.3979154916956856e-06, 'samples': 27307200, 'steps': 142224, 'loss/train': 1.1542023420333862} 11/07/2021 17:16:04 - INFO - __main__ - Step 142226: {'lr': 3.397043582935716e-06, 'samples': 27307392, 'steps': 142225, 'loss/train': 1.3392479419708252} 11/07/2021 17:16:04 - INFO - __main__ - Step 142227: {'lr': 3.396171785291252e-06, 'samples': 27307584, 'steps': 142226, 'loss/train': 1.285902976989746} 11/07/2021 17:16:05 - INFO - __main__ - Step 142228: {'lr': 3.3953000987626825e-06, 'samples': 27307776, 'steps': 142227, 'loss/train': 0.9747876524925232} 11/07/2021 17:16:06 - INFO - __main__ - Step 142229: {'lr': 3.394428523350368e-06, 'samples': 27307968, 'steps': 142228, 'loss/train': 0.9675542712211609} 11/07/2021 17:16:06 - INFO - __main__ - Step 142230: {'lr': 3.393557059054725e-06, 'samples': 27308160, 'steps': 142229, 'loss/train': 1.2755892276763916} 11/07/2021 17:16:06 - INFO - __main__ - Step 142231: {'lr': 3.392685705876142e-06, 'samples': 27308352, 'steps': 142230, 'loss/train': 0.4590553343296051} 11/07/2021 17:16:07 - INFO - __main__ - Step 142232: {'lr': 3.3918144638150074e-06, 'samples': 27308544, 'steps': 142231, 'loss/train': 0.9482552409172058} 11/07/2021 17:16:07 - INFO - __main__ - Step 142233: {'lr': 3.39094333287171e-06, 'samples': 27308736, 'steps': 142232, 'loss/train': 1.2987756729125977} 11/07/2021 17:16:07 - INFO - __main__ - Step 142234: {'lr': 3.3900723130466383e-06, 'samples': 27308928, 'steps': 142233, 'loss/train': 1.286138892173767} 11/07/2021 17:16:08 - INFO - __main__ - Step 142235: {'lr': 3.3892014043402088e-06, 'samples': 27309120, 'steps': 142234, 'loss/train': 1.508961796760559} 11/07/2021 17:16:09 - INFO - __main__ - Step 142236: {'lr': 3.3883306067528095e-06, 'samples': 27309312, 'steps': 142235, 'loss/train': 1.6366584300994873} 11/07/2021 17:16:09 - INFO - __main__ - Step 142237: {'lr': 3.38745992028483e-06, 'samples': 27309504, 'steps': 142236, 'loss/train': 1.1024245023727417} 11/07/2021 17:16:09 - INFO - __main__ - Step 142238: {'lr': 3.38658934493663e-06, 'samples': 27309696, 'steps': 142237, 'loss/train': 1.639046549797058} 11/07/2021 17:16:10 - INFO - __main__ - Step 142239: {'lr': 3.385718880708627e-06, 'samples': 27309888, 'steps': 142238, 'loss/train': 1.2339314222335815} 11/07/2021 17:16:11 - INFO - __main__ - Step 142240: {'lr': 3.3848485276012364e-06, 'samples': 27310080, 'steps': 142239, 'loss/train': 1.1957165002822876} 11/07/2021 17:16:11 - INFO - __main__ - Step 142241: {'lr': 3.3839782856147918e-06, 'samples': 27310272, 'steps': 142240, 'loss/train': 1.0376710891723633} 11/07/2021 17:16:12 - INFO - __main__ - Step 142242: {'lr': 3.383108154749737e-06, 'samples': 27310464, 'steps': 142241, 'loss/train': 1.3971501588821411} 11/07/2021 17:16:12 - INFO - __main__ - Step 142243: {'lr': 3.3822381350064603e-06, 'samples': 27310656, 'steps': 142242, 'loss/train': 1.4120947122573853} 11/07/2021 17:16:12 - INFO - __main__ - Step 142244: {'lr': 3.3813682263853505e-06, 'samples': 27310848, 'steps': 142243, 'loss/train': 1.437700629234314} 11/07/2021 17:16:13 - INFO - __main__ - Step 142245: {'lr': 3.3804984288867693e-06, 'samples': 27311040, 'steps': 142244, 'loss/train': 1.72135591506958} 11/07/2021 17:16:14 - INFO - __main__ - Step 142246: {'lr': 3.3796287425111315e-06, 'samples': 27311232, 'steps': 142245, 'loss/train': 1.6936745643615723} 11/07/2021 17:16:14 - INFO - __main__ - Step 142247: {'lr': 3.378759167258827e-06, 'samples': 27311424, 'steps': 142246, 'loss/train': 0.9008484482765198} 11/07/2021 17:16:14 - INFO - __main__ - Step 142248: {'lr': 3.3778897031302435e-06, 'samples': 27311616, 'steps': 142247, 'loss/train': 1.3145415782928467} 11/07/2021 17:16:15 - INFO - __main__ - Step 142249: {'lr': 3.377020350125798e-06, 'samples': 27311808, 'steps': 142248, 'loss/train': 0.8356263637542725} 11/07/2021 17:16:16 - INFO - __main__ - Step 142250: {'lr': 3.3761511082458505e-06, 'samples': 27312000, 'steps': 142249, 'loss/train': 1.326837420463562} 11/07/2021 17:16:16 - INFO - __main__ - Step 142251: {'lr': 3.375281977490818e-06, 'samples': 27312192, 'steps': 142250, 'loss/train': 1.1994742155075073} 11/07/2021 17:16:17 - INFO - __main__ - Step 142252: {'lr': 3.3744129578610616e-06, 'samples': 27312384, 'steps': 142251, 'loss/train': 1.2442294359207153} 11/07/2021 17:16:17 - INFO - __main__ - Step 142253: {'lr': 3.373544049356997e-06, 'samples': 27312576, 'steps': 142252, 'loss/train': 1.2161318063735962} 11/07/2021 17:16:17 - INFO - __main__ - Step 142254: {'lr': 3.372675251978985e-06, 'samples': 27312768, 'steps': 142253, 'loss/train': 1.4227179288864136} 11/07/2021 17:16:18 - INFO - __main__ - Step 142255: {'lr': 3.37180656572747e-06, 'samples': 27312960, 'steps': 142254, 'loss/train': 0.9993703365325928} 11/07/2021 17:16:19 - INFO - __main__ - Step 142256: {'lr': 3.370937990602785e-06, 'samples': 27313152, 'steps': 142255, 'loss/train': 1.3942207098007202} 11/07/2021 17:16:19 - INFO - __main__ - Step 142257: {'lr': 3.370069526605374e-06, 'samples': 27313344, 'steps': 142256, 'loss/train': 1.6532338857650757} 11/07/2021 17:16:19 - INFO - __main__ - Step 142258: {'lr': 3.36920117373557e-06, 'samples': 27313536, 'steps': 142257, 'loss/train': 1.3103506565093994} 11/07/2021 17:16:20 - INFO - __main__ - Step 142259: {'lr': 3.368332931993845e-06, 'samples': 27313728, 'steps': 142258, 'loss/train': 0.9448655247688293} 11/07/2021 17:16:21 - INFO - __main__ - Step 142260: {'lr': 3.367464801380504e-06, 'samples': 27313920, 'steps': 142259, 'loss/train': 0.8420737981796265} 11/07/2021 17:16:21 - INFO - __main__ - Step 142261: {'lr': 3.366596781895992e-06, 'samples': 27314112, 'steps': 142260, 'loss/train': 1.1233811378479004} 11/07/2021 17:16:21 - INFO - __main__ - Step 142262: {'lr': 3.3657288735406965e-06, 'samples': 27314304, 'steps': 142261, 'loss/train': 0.9495002627372742} 11/07/2021 17:16:22 - INFO - __main__ - Step 142263: {'lr': 3.364861076314979e-06, 'samples': 27314496, 'steps': 142262, 'loss/train': 0.9303818345069885} 11/07/2021 17:16:22 - INFO - __main__ - Step 142264: {'lr': 3.363993390219283e-06, 'samples': 27314688, 'steps': 142263, 'loss/train': 1.6089740991592407} 11/07/2021 17:16:22 - INFO - __main__ - Step 142265: {'lr': 3.3631258152539424e-06, 'samples': 27314880, 'steps': 142264, 'loss/train': 1.259738564491272} 11/07/2021 17:16:24 - INFO - __main__ - Step 142266: {'lr': 3.3622583514193726e-06, 'samples': 27315072, 'steps': 142265, 'loss/train': 0.8906605839729309} 11/07/2021 17:16:24 - INFO - __main__ - Step 142267: {'lr': 3.361390998715963e-06, 'samples': 27315264, 'steps': 142266, 'loss/train': 1.1800874471664429} 11/07/2021 17:16:24 - INFO - __main__ - Step 142268: {'lr': 3.360523757144102e-06, 'samples': 27315456, 'steps': 142267, 'loss/train': 1.0974711179733276} 11/07/2021 17:16:25 - INFO - __main__ - Step 142269: {'lr': 3.3596566267041772e-06, 'samples': 27315648, 'steps': 142268, 'loss/train': 1.2214434146881104} 11/07/2021 17:16:25 - INFO - __main__ - Step 142270: {'lr': 3.3587896073965783e-06, 'samples': 27315840, 'steps': 142269, 'loss/train': 1.061408519744873} 11/07/2021 17:16:26 - INFO - __main__ - Step 142271: {'lr': 3.3579226992217214e-06, 'samples': 27316032, 'steps': 142270, 'loss/train': 1.0809261798858643} 11/07/2021 17:16:26 - INFO - __main__ - Step 142272: {'lr': 3.3570559021799673e-06, 'samples': 27316224, 'steps': 142271, 'loss/train': 1.2847613096237183} 11/07/2021 17:16:27 - INFO - __main__ - Step 142273: {'lr': 3.3561892162717324e-06, 'samples': 27316416, 'steps': 142272, 'loss/train': 1.174536108970642} 11/07/2021 17:16:27 - INFO - __main__ - Step 142274: {'lr': 3.355322641497377e-06, 'samples': 27316608, 'steps': 142273, 'loss/train': 1.0634902715682983} 11/07/2021 17:16:27 - INFO - __main__ - Step 142275: {'lr': 3.354456177857318e-06, 'samples': 27316800, 'steps': 142274, 'loss/train': 0.9159936904907227} 11/07/2021 17:16:28 - INFO - __main__ - Step 142276: {'lr': 3.3535898253519437e-06, 'samples': 27316992, 'steps': 142275, 'loss/train': 1.3776271343231201} 11/07/2021 17:16:29 - INFO - __main__ - Step 142277: {'lr': 3.3527235839816428e-06, 'samples': 27317184, 'steps': 142276, 'loss/train': 0.07705087959766388} 11/07/2021 17:16:29 - INFO - __main__ - Step 142278: {'lr': 3.351857453746776e-06, 'samples': 27317376, 'steps': 142277, 'loss/train': 1.3407419919967651} 11/07/2021 17:16:29 - INFO - __main__ - Step 142279: {'lr': 3.35099143464776e-06, 'samples': 27317568, 'steps': 142278, 'loss/train': 1.1415417194366455} 11/07/2021 17:16:30 - INFO - __main__ - Step 142280: {'lr': 3.350125526684983e-06, 'samples': 27317760, 'steps': 142279, 'loss/train': 0.43367934226989746} 11/07/2021 17:16:31 - INFO - __main__ - Step 142281: {'lr': 3.3492597298588336e-06, 'samples': 27317952, 'steps': 142280, 'loss/train': 1.2067805528640747} 11/07/2021 17:16:31 - INFO - __main__ - Step 142282: {'lr': 3.3483940441697e-06, 'samples': 27318144, 'steps': 142281, 'loss/train': 1.3821496963500977} 11/07/2021 17:16:32 - INFO - __main__ - Step 142283: {'lr': 3.347528469617972e-06, 'samples': 27318336, 'steps': 142282, 'loss/train': 0.982093334197998} 11/07/2021 17:16:32 - INFO - __main__ - Step 142284: {'lr': 3.346663006204037e-06, 'samples': 27318528, 'steps': 142283, 'loss/train': 0.9529561400413513} 11/07/2021 17:16:32 - INFO - __main__ - Step 142285: {'lr': 3.3457976539282842e-06, 'samples': 27318720, 'steps': 142284, 'loss/train': 1.4606950283050537} 11/07/2021 17:16:33 - INFO - __main__ - Step 142286: {'lr': 3.3449324127911294e-06, 'samples': 27318912, 'steps': 142285, 'loss/train': 1.2842644453048706} 11/07/2021 17:16:34 - INFO - __main__ - Step 142287: {'lr': 3.3440672827929342e-06, 'samples': 27319104, 'steps': 142286, 'loss/train': 1.386486291885376} 11/07/2021 17:16:34 - INFO - __main__ - Step 142288: {'lr': 3.3432022639340866e-06, 'samples': 27319296, 'steps': 142287, 'loss/train': 1.50653076171875} 11/07/2021 17:16:34 - INFO - __main__ - Step 142289: {'lr': 3.342337356215003e-06, 'samples': 27319488, 'steps': 142288, 'loss/train': 1.4631564617156982} 11/07/2021 17:16:35 - INFO - __main__ - Step 142290: {'lr': 3.341472559636044e-06, 'samples': 27319680, 'steps': 142289, 'loss/train': 1.072510004043579} 11/07/2021 17:16:35 - INFO - __main__ - Step 142291: {'lr': 3.3406078741976266e-06, 'samples': 27319872, 'steps': 142290, 'loss/train': 1.233959674835205} 11/07/2021 17:16:36 - INFO - __main__ - Step 142292: {'lr': 3.339743299900111e-06, 'samples': 27320064, 'steps': 142291, 'loss/train': 0.9627750515937805} 11/07/2021 17:16:37 - INFO - __main__ - Step 142293: {'lr': 3.3388788367439137e-06, 'samples': 27320256, 'steps': 142292, 'loss/train': 1.0064640045166016} 11/07/2021 17:16:37 - INFO - __main__ - Step 142294: {'lr': 3.338014484729396e-06, 'samples': 27320448, 'steps': 142293, 'loss/train': 0.9990656971931458} 11/07/2021 17:16:37 - INFO - __main__ - Step 142295: {'lr': 3.3371502438569733e-06, 'samples': 27320640, 'steps': 142294, 'loss/train': 1.165158748626709} 11/07/2021 17:16:38 - INFO - __main__ - Step 142296: {'lr': 3.3362861141270075e-06, 'samples': 27320832, 'steps': 142295, 'loss/train': 1.0139660835266113} 11/07/2021 17:16:39 - INFO - __main__ - Step 142297: {'lr': 3.335422095539914e-06, 'samples': 27321024, 'steps': 142296, 'loss/train': 1.2315502166748047} 11/07/2021 17:16:39 - INFO - __main__ - Step 142298: {'lr': 3.3345581880960817e-06, 'samples': 27321216, 'steps': 142297, 'loss/train': 1.2867149114608765} 11/07/2021 17:16:39 - INFO - __main__ - Step 142299: {'lr': 3.3336943917958718e-06, 'samples': 27321408, 'steps': 142298, 'loss/train': 1.2615681886672974} 11/07/2021 17:16:40 - INFO - __main__ - Step 142300: {'lr': 3.332830706639728e-06, 'samples': 27321600, 'steps': 142299, 'loss/train': 1.2554991245269775} 11/07/2021 17:16:40 - INFO - __main__ - Step 142301: {'lr': 3.3319671326279833e-06, 'samples': 27321792, 'steps': 142300, 'loss/train': 1.4634500741958618} 11/07/2021 17:16:41 - INFO - __main__ - Step 142302: {'lr': 3.3311036697610263e-06, 'samples': 27321984, 'steps': 142301, 'loss/train': 1.745918869972229} 11/07/2021 17:16:42 - INFO - __main__ - Step 142303: {'lr': 3.3302403180393013e-06, 'samples': 27322176, 'steps': 142302, 'loss/train': 1.4227778911590576} 11/07/2021 17:16:42 - INFO - __main__ - Step 142304: {'lr': 3.3293770774631695e-06, 'samples': 27322368, 'steps': 142303, 'loss/train': 1.956539511680603} 11/07/2021 17:16:42 - INFO - __main__ - Step 142305: {'lr': 3.328513948032991e-06, 'samples': 27322560, 'steps': 142304, 'loss/train': 1.3523354530334473} 11/07/2021 17:16:43 - INFO - __main__ - Step 142306: {'lr': 3.327650929749182e-06, 'samples': 27322752, 'steps': 142305, 'loss/train': 0.9977192878723145} 11/07/2021 17:16:44 - INFO - __main__ - Step 142307: {'lr': 3.3267880226121317e-06, 'samples': 27322944, 'steps': 142306, 'loss/train': 1.1694025993347168} 11/07/2021 17:16:44 - INFO - __main__ - Step 142308: {'lr': 3.3259252266222008e-06, 'samples': 27323136, 'steps': 142307, 'loss/train': 0.3538591265678406} 11/07/2021 17:16:44 - INFO - __main__ - Step 142309: {'lr': 3.325062541779833e-06, 'samples': 27323328, 'steps': 142308, 'loss/train': 1.1875255107879639} 11/07/2021 17:16:45 - INFO - __main__ - Step 142310: {'lr': 3.324199968085362e-06, 'samples': 27323520, 'steps': 142309, 'loss/train': 1.1621774435043335} 11/07/2021 17:16:45 - INFO - __main__ - Step 142311: {'lr': 3.3233375055392313e-06, 'samples': 27323712, 'steps': 142310, 'loss/train': 1.1741605997085571} 11/07/2021 17:16:46 - INFO - __main__ - Step 142312: {'lr': 3.322475154141774e-06, 'samples': 27323904, 'steps': 142311, 'loss/train': 1.0077779293060303} 11/07/2021 17:16:47 - INFO - __main__ - Step 142313: {'lr': 3.321612913893407e-06, 'samples': 27324096, 'steps': 142312, 'loss/train': 1.3227694034576416} 11/07/2021 17:16:47 - INFO - __main__ - Step 142314: {'lr': 3.3207507847945184e-06, 'samples': 27324288, 'steps': 142313, 'loss/train': 0.8182446360588074} 11/07/2021 17:16:47 - INFO - __main__ - Step 142315: {'lr': 3.319888766845469e-06, 'samples': 27324480, 'steps': 142314, 'loss/train': 1.395835041999817} 11/07/2021 17:16:48 - INFO - __main__ - Step 142316: {'lr': 3.319026860046703e-06, 'samples': 27324672, 'steps': 142315, 'loss/train': 1.5365829467773438} 11/07/2021 17:16:48 - INFO - __main__ - Step 142317: {'lr': 3.3181650643985537e-06, 'samples': 27324864, 'steps': 142316, 'loss/train': 1.3092409372329712} 11/07/2021 17:16:49 - INFO - __main__ - Step 142318: {'lr': 3.317303379901465e-06, 'samples': 27325056, 'steps': 142317, 'loss/train': 1.3501927852630615} 11/07/2021 17:16:50 - INFO - __main__ - Step 142319: {'lr': 3.31644180655577e-06, 'samples': 27325248, 'steps': 142318, 'loss/train': 1.319704532623291} 11/07/2021 17:16:50 - INFO - __main__ - Step 142320: {'lr': 3.315580344361885e-06, 'samples': 27325440, 'steps': 142319, 'loss/train': 0.9988812804222107} 11/07/2021 17:16:50 - INFO - __main__ - Step 142321: {'lr': 3.3147189933201983e-06, 'samples': 27325632, 'steps': 142320, 'loss/train': 1.222884178161621} 11/07/2021 17:16:51 - INFO - __main__ - Step 142322: {'lr': 3.313857753431071e-06, 'samples': 27325824, 'steps': 142321, 'loss/train': 1.3828495740890503} 11/07/2021 17:16:52 - INFO - __main__ - Step 142323: {'lr': 3.3129966246949193e-06, 'samples': 27326016, 'steps': 142322, 'loss/train': 1.2959647178649902} 11/07/2021 17:16:52 - INFO - __main__ - Step 142324: {'lr': 3.312135607112132e-06, 'samples': 27326208, 'steps': 142323, 'loss/train': 1.1117775440216064} 11/07/2021 17:16:53 - INFO - __main__ - Step 142325: {'lr': 3.311274700683098e-06, 'samples': 27326400, 'steps': 142324, 'loss/train': 1.1528584957122803} 11/07/2021 17:16:53 - INFO - __main__ - Step 142326: {'lr': 3.310413905408177e-06, 'samples': 27326592, 'steps': 142325, 'loss/train': 1.0901200771331787} 11/07/2021 17:16:53 - INFO - __main__ - Step 142327: {'lr': 3.3095532212877867e-06, 'samples': 27326784, 'steps': 142326, 'loss/train': 1.3063195943832397} 11/07/2021 17:16:55 - INFO - __main__ - Step 142328: {'lr': 3.3086926483223144e-06, 'samples': 27326976, 'steps': 142327, 'loss/train': 1.8494473695755005} 11/07/2021 17:16:55 - INFO - __main__ - Step 142329: {'lr': 3.307832186512122e-06, 'samples': 27327168, 'steps': 142328, 'loss/train': 1.1360887289047241} 11/07/2021 17:16:56 - INFO - __main__ - Step 142330: {'lr': 3.306971835857625e-06, 'samples': 27327360, 'steps': 142329, 'loss/train': 1.3262205123901367} 11/07/2021 17:16:56 - INFO - __main__ - Step 142331: {'lr': 3.3061115963592124e-06, 'samples': 27327552, 'steps': 142330, 'loss/train': 1.0024341344833374} 11/07/2021 17:16:56 - INFO - __main__ - Step 142332: {'lr': 3.3052514680172452e-06, 'samples': 27327744, 'steps': 142331, 'loss/train': 1.0026309490203857} 11/07/2021 17:16:57 - INFO - __main__ - Step 142333: {'lr': 3.3043914508321393e-06, 'samples': 27327936, 'steps': 142332, 'loss/train': 1.6092455387115479} 11/07/2021 17:16:57 - INFO - __main__ - Step 142334: {'lr': 3.303531544804256e-06, 'samples': 27328128, 'steps': 142333, 'loss/train': 1.6379519701004028} 11/07/2021 17:16:58 - INFO - __main__ - Step 142335: {'lr': 3.302671749933983e-06, 'samples': 27328320, 'steps': 142334, 'loss/train': 0.4936974048614502} 11/07/2021 17:16:58 - INFO - __main__ - Step 142336: {'lr': 3.301812066221738e-06, 'samples': 27328512, 'steps': 142335, 'loss/train': 1.3700919151306152} 11/07/2021 17:16:59 - INFO - __main__ - Step 142337: {'lr': 3.3009524936678526e-06, 'samples': 27328704, 'steps': 142336, 'loss/train': 0.9301857352256775} 11/07/2021 17:16:59 - INFO - __main__ - Step 142338: {'lr': 3.3000930322727998e-06, 'samples': 27328896, 'steps': 142337, 'loss/train': 1.5356707572937012} 11/07/2021 17:17:00 - INFO - __main__ - Step 142339: {'lr': 3.299233682036884e-06, 'samples': 27329088, 'steps': 142338, 'loss/train': 1.4071487188339233} 11/07/2021 17:17:01 - INFO - __main__ - Step 142340: {'lr': 3.29837444296055e-06, 'samples': 27329280, 'steps': 142339, 'loss/train': 0.7641662955284119} 11/07/2021 17:17:01 - INFO - __main__ - Step 142341: {'lr': 3.297515315044131e-06, 'samples': 27329472, 'steps': 142340, 'loss/train': 0.9266943335533142} 11/07/2021 17:17:01 - INFO - __main__ - Step 142342: {'lr': 3.2966562982880977e-06, 'samples': 27329664, 'steps': 142341, 'loss/train': 1.5280603170394897} 11/07/2021 17:17:02 - INFO - __main__ - Step 142343: {'lr': 3.2957973926927287e-06, 'samples': 27329856, 'steps': 142342, 'loss/train': 1.0783246755599976} 11/07/2021 17:17:02 - INFO - __main__ - Step 142344: {'lr': 3.2949385982584954e-06, 'samples': 27330048, 'steps': 142343, 'loss/train': 0.9807920455932617} 11/07/2021 17:17:03 - INFO - __main__ - Step 142345: {'lr': 3.2940799149857593e-06, 'samples': 27330240, 'steps': 142344, 'loss/train': 0.5123445987701416} 11/07/2021 17:17:03 - INFO - __main__ - Step 142346: {'lr': 3.29322134287488e-06, 'samples': 27330432, 'steps': 142345, 'loss/train': 1.3627313375473022} 11/07/2021 17:17:04 - INFO - __main__ - Step 142347: {'lr': 3.292362881926303e-06, 'samples': 27330624, 'steps': 142346, 'loss/train': 1.2995859384536743} 11/07/2021 17:17:04 - INFO - __main__ - Step 142348: {'lr': 3.2915045321403327e-06, 'samples': 27330816, 'steps': 142347, 'loss/train': 0.6919886469841003} 11/07/2021 17:17:04 - INFO - __main__ - Step 142349: {'lr': 3.290646293517441e-06, 'samples': 27331008, 'steps': 142348, 'loss/train': 0.9166935682296753} 11/07/2021 17:17:06 - INFO - __main__ - Step 142350: {'lr': 3.289788166057961e-06, 'samples': 27331200, 'steps': 142349, 'loss/train': 1.5081006288528442} 11/07/2021 17:17:06 - INFO - __main__ - Step 142351: {'lr': 3.2889301497623093e-06, 'samples': 27331392, 'steps': 142350, 'loss/train': 1.7253361940383911} 11/07/2021 17:17:06 - INFO - __main__ - Step 142352: {'lr': 3.2880722446308464e-06, 'samples': 27331584, 'steps': 142351, 'loss/train': 1.0699281692504883} 11/07/2021 17:17:07 - INFO - __main__ - Step 142353: {'lr': 3.287214450663989e-06, 'samples': 27331776, 'steps': 142352, 'loss/train': 1.0292881727218628} 11/07/2021 17:17:07 - INFO - __main__ - Step 142354: {'lr': 3.2863567678620974e-06, 'samples': 27331968, 'steps': 142353, 'loss/train': 0.5154806971549988} 11/07/2021 17:17:08 - INFO - __main__ - Step 142355: {'lr': 3.285499196225561e-06, 'samples': 27332160, 'steps': 142354, 'loss/train': 1.399717092514038} 11/07/2021 17:17:08 - INFO - __main__ - Step 142356: {'lr': 3.2846417357547675e-06, 'samples': 27332352, 'steps': 142355, 'loss/train': 1.1414990425109863} 11/07/2021 17:17:09 - INFO - __main__ - Step 142357: {'lr': 3.2837843864501062e-06, 'samples': 27332544, 'steps': 142356, 'loss/train': 0.9948118925094604} 11/07/2021 17:17:09 - INFO - __main__ - Step 142358: {'lr': 3.282927148311965e-06, 'samples': 27332736, 'steps': 142357, 'loss/train': 1.2205554246902466} 11/07/2021 17:17:09 - INFO - __main__ - Step 142359: {'lr': 3.282070021340733e-06, 'samples': 27332928, 'steps': 142358, 'loss/train': 1.2036041021347046} 11/07/2021 17:17:10 - INFO - __main__ - Step 142360: {'lr': 3.2812130055367983e-06, 'samples': 27333120, 'steps': 142359, 'loss/train': 1.1724449396133423} 11/07/2021 17:17:11 - INFO - __main__ - Step 142361: {'lr': 3.28035610090055e-06, 'samples': 27333312, 'steps': 142360, 'loss/train': 1.537607192993164} 11/07/2021 17:17:11 - INFO - __main__ - Step 142362: {'lr': 3.2794993074323487e-06, 'samples': 27333504, 'steps': 142361, 'loss/train': 1.1338279247283936} 11/07/2021 17:17:11 - INFO - __main__ - Step 142363: {'lr': 3.278642625132583e-06, 'samples': 27333696, 'steps': 142362, 'loss/train': 1.4713197946548462} 11/07/2021 17:17:12 - INFO - __main__ - Step 142364: {'lr': 3.277786054001697e-06, 'samples': 27333888, 'steps': 142363, 'loss/train': 1.4999390840530396} 11/07/2021 17:17:13 - INFO - __main__ - Step 142365: {'lr': 3.2769295940400235e-06, 'samples': 27334080, 'steps': 142364, 'loss/train': 1.1889407634735107} 11/07/2021 17:17:13 - INFO - __main__ - Step 142366: {'lr': 3.2760732452479512e-06, 'samples': 27334272, 'steps': 142365, 'loss/train': 1.2565701007843018} 11/07/2021 17:17:14 - INFO - __main__ - Step 142367: {'lr': 3.2752170076258413e-06, 'samples': 27334464, 'steps': 142366, 'loss/train': 1.8143794536590576} 11/07/2021 17:17:14 - INFO - __main__ - Step 142368: {'lr': 3.2743608811741375e-06, 'samples': 27334656, 'steps': 142367, 'loss/train': 1.1317704916000366} 11/07/2021 17:17:14 - INFO - __main__ - Step 142369: {'lr': 3.273504865893201e-06, 'samples': 27334848, 'steps': 142368, 'loss/train': 1.07118558883667} 11/07/2021 17:17:15 - INFO - __main__ - Step 142370: {'lr': 3.2726489617834198e-06, 'samples': 27335040, 'steps': 142369, 'loss/train': 1.4778803586959839} 11/07/2021 17:17:16 - INFO - __main__ - Step 142371: {'lr': 3.271793168845183e-06, 'samples': 27335232, 'steps': 142370, 'loss/train': 1.35250723361969} 11/07/2021 17:17:16 - INFO - __main__ - Step 142372: {'lr': 3.270937487078851e-06, 'samples': 27335424, 'steps': 142371, 'loss/train': 1.3293884992599487} 11/07/2021 17:17:16 - INFO - __main__ - Step 142373: {'lr': 3.2700819164848407e-06, 'samples': 27335616, 'steps': 142372, 'loss/train': 0.9656414985656738} 11/07/2021 17:17:17 - INFO - __main__ - Step 142374: {'lr': 3.2692264570635123e-06, 'samples': 27335808, 'steps': 142373, 'loss/train': 1.3847072124481201} 11/07/2021 17:17:17 - INFO - __main__ - Step 142375: {'lr': 3.2683711088152825e-06, 'samples': 27336000, 'steps': 142374, 'loss/train': 1.2354097366333008} 11/07/2021 17:17:18 - INFO - __main__ - Step 142376: {'lr': 3.267515871740484e-06, 'samples': 27336192, 'steps': 142375, 'loss/train': 1.7422024011611938} 11/07/2021 17:17:19 - INFO - __main__ - Step 142377: {'lr': 3.2666607458395615e-06, 'samples': 27336384, 'steps': 142376, 'loss/train': 1.5098503828048706} 11/07/2021 17:17:19 - INFO - __main__ - Step 142378: {'lr': 3.2658057311128755e-06, 'samples': 27336576, 'steps': 142377, 'loss/train': 0.9723089337348938} 11/07/2021 17:17:19 - INFO - __main__ - Step 142379: {'lr': 3.2649508275607863e-06, 'samples': 27336768, 'steps': 142378, 'loss/train': 1.3069185018539429} 11/07/2021 17:17:20 - INFO - __main__ - Step 142380: {'lr': 3.264096035183711e-06, 'samples': 27336960, 'steps': 142379, 'loss/train': 1.2305200099945068} 11/07/2021 17:17:20 - INFO - __main__ - Step 142381: {'lr': 3.263241353982038e-06, 'samples': 27337152, 'steps': 142380, 'loss/train': 1.737297534942627} 11/07/2021 17:17:22 - INFO - __main__ - Step 142382: {'lr': 3.262386783956128e-06, 'samples': 27337344, 'steps': 142381, 'loss/train': 1.161015272140503} 11/07/2021 17:17:22 - INFO - __main__ - Step 142383: {'lr': 3.261532325106398e-06, 'samples': 27337536, 'steps': 142382, 'loss/train': 0.9551671147346497} 11/07/2021 17:17:22 - INFO - __main__ - Step 142384: {'lr': 3.2606779774332074e-06, 'samples': 27337728, 'steps': 142383, 'loss/train': 1.1695767641067505} 11/07/2021 17:17:23 - INFO - __main__ - Step 142385: {'lr': 3.2598237409369458e-06, 'samples': 27337920, 'steps': 142384, 'loss/train': 1.4188250303268433} 11/07/2021 17:17:23 - INFO - __main__ - Step 142386: {'lr': 3.2589696156180016e-06, 'samples': 27338112, 'steps': 142385, 'loss/train': 1.5538002252578735} 11/07/2021 17:17:23 - INFO - __main__ - Step 142387: {'lr': 3.258115601476763e-06, 'samples': 27338304, 'steps': 142386, 'loss/train': 1.3409656286239624} 11/07/2021 17:17:24 - INFO - __main__ - Step 142388: {'lr': 3.2572616985135915e-06, 'samples': 27338496, 'steps': 142387, 'loss/train': 1.3962128162384033} 11/07/2021 17:17:25 - INFO - __main__ - Step 142389: {'lr': 3.256407906728903e-06, 'samples': 27338688, 'steps': 142388, 'loss/train': 1.3264943361282349} 11/07/2021 17:17:26 - INFO - __main__ - Step 142390: {'lr': 3.2555542261230586e-06, 'samples': 27338880, 'steps': 142389, 'loss/train': 1.5528860092163086} 11/07/2021 17:17:26 - INFO - __main__ - Step 142391: {'lr': 3.2547006566964743e-06, 'samples': 27339072, 'steps': 142390, 'loss/train': 1.06322181224823} 11/07/2021 17:17:26 - INFO - __main__ - Step 142392: {'lr': 3.253847198449511e-06, 'samples': 27339264, 'steps': 142391, 'loss/train': 0.15509934723377228} 11/07/2021 17:17:27 - INFO - __main__ - Step 142393: {'lr': 3.2529938513825297e-06, 'samples': 27339456, 'steps': 142392, 'loss/train': 1.5305795669555664} 11/07/2021 17:17:28 - INFO - __main__ - Step 142394: {'lr': 3.2521406154959744e-06, 'samples': 27339648, 'steps': 142393, 'loss/train': 1.6648693084716797} 11/07/2021 17:17:28 - INFO - __main__ - Step 142395: {'lr': 3.251287490790178e-06, 'samples': 27339840, 'steps': 142394, 'loss/train': 1.2516003847122192} 11/07/2021 17:17:28 - INFO - __main__ - Step 142396: {'lr': 3.250434477265557e-06, 'samples': 27340032, 'steps': 142395, 'loss/train': 1.2607885599136353} 11/07/2021 17:17:29 - INFO - __main__ - Step 142397: {'lr': 3.2495815749224723e-06, 'samples': 27340224, 'steps': 142396, 'loss/train': 0.9026877284049988} 11/07/2021 17:17:29 - INFO - __main__ - Step 142398: {'lr': 3.24872878376134e-06, 'samples': 27340416, 'steps': 142397, 'loss/train': 1.5387576818466187} 11/07/2021 17:17:30 - INFO - __main__ - Step 142399: {'lr': 3.2478761037824934e-06, 'samples': 27340608, 'steps': 142398, 'loss/train': 1.0817911624908447} 11/07/2021 17:17:30 - INFO - __main__ - Step 142400: {'lr': 3.2470235349863487e-06, 'samples': 27340800, 'steps': 142399, 'loss/train': 1.2745590209960938} 11/07/2021 17:17:31 - INFO - __main__ - Step 142401: {'lr': 3.2461710773732946e-06, 'samples': 27340992, 'steps': 142400, 'loss/train': 1.1376041173934937} 11/07/2021 17:17:31 - INFO - __main__ - Step 142402: {'lr': 3.2453187309437193e-06, 'samples': 27341184, 'steps': 142401, 'loss/train': 1.2512987852096558} 11/07/2021 17:17:32 - INFO - __main__ - Step 142403: {'lr': 3.244466495697984e-06, 'samples': 27341376, 'steps': 142402, 'loss/train': 1.2576797008514404} 11/07/2021 17:17:33 - INFO - __main__ - Step 142404: {'lr': 3.2436143716364774e-06, 'samples': 27341568, 'steps': 142403, 'loss/train': 1.1972270011901855} 11/07/2021 17:17:33 - INFO - __main__ - Step 142405: {'lr': 3.2427623587596155e-06, 'samples': 27341760, 'steps': 142404, 'loss/train': 1.3393498659133911} 11/07/2021 17:17:33 - INFO - __main__ - Step 142406: {'lr': 3.2419104570677317e-06, 'samples': 27341952, 'steps': 142405, 'loss/train': 1.07639479637146} 11/07/2021 17:17:34 - INFO - __main__ - Step 142407: {'lr': 3.241058666561242e-06, 'samples': 27342144, 'steps': 142406, 'loss/train': 1.29060697555542} 11/07/2021 17:17:34 - INFO - __main__ - Step 142408: {'lr': 3.240206987240535e-06, 'samples': 27342336, 'steps': 142407, 'loss/train': 1.2247222661972046} 11/07/2021 17:17:35 - INFO - __main__ - Step 142409: {'lr': 3.2393554191059715e-06, 'samples': 27342528, 'steps': 142408, 'loss/train': 1.487721562385559} 11/07/2021 17:17:35 - INFO - __main__ - Step 142410: {'lr': 3.2385039621579405e-06, 'samples': 27342720, 'steps': 142409, 'loss/train': 1.0506937503814697} 11/07/2021 17:17:36 - INFO - __main__ - Step 142411: {'lr': 3.2376526163968303e-06, 'samples': 27342912, 'steps': 142410, 'loss/train': 1.475386142730713} 11/07/2021 17:17:36 - INFO - __main__ - Step 142412: {'lr': 3.2368013818230567e-06, 'samples': 27343104, 'steps': 142411, 'loss/train': 1.1243236064910889} 11/07/2021 17:17:36 - INFO - __main__ - Step 142413: {'lr': 3.2359502584369536e-06, 'samples': 27343296, 'steps': 142412, 'loss/train': 0.29179397225379944} 11/07/2021 17:17:37 - INFO - __main__ - Step 142414: {'lr': 3.235099246238937e-06, 'samples': 27343488, 'steps': 142413, 'loss/train': 1.7744777202606201} 11/07/2021 17:17:38 - INFO - __main__ - Step 142415: {'lr': 3.2342483452293403e-06, 'samples': 27343680, 'steps': 142414, 'loss/train': 1.0820519924163818} 11/07/2021 17:17:38 - INFO - __main__ - Step 142416: {'lr': 3.233397555408607e-06, 'samples': 27343872, 'steps': 142415, 'loss/train': 1.1429486274719238} 11/07/2021 17:17:39 - INFO - __main__ - Step 142417: {'lr': 3.2325468767770983e-06, 'samples': 27344064, 'steps': 142416, 'loss/train': 0.780495285987854} 11/07/2021 17:17:39 - INFO - __main__ - Step 142418: {'lr': 3.2316963093352027e-06, 'samples': 27344256, 'steps': 142417, 'loss/train': 0.8412118554115295} 11/07/2021 17:17:39 - INFO - __main__ - Step 142419: {'lr': 3.2308458530832805e-06, 'samples': 27344448, 'steps': 142418, 'loss/train': 1.4155222177505493} 11/07/2021 17:17:40 - INFO - __main__ - Step 142420: {'lr': 3.229995508021749e-06, 'samples': 27344640, 'steps': 142419, 'loss/train': 0.8228874206542969} 11/07/2021 17:17:41 - INFO - __main__ - Step 142421: {'lr': 3.2291452741509407e-06, 'samples': 27344832, 'steps': 142420, 'loss/train': 0.598537802696228} 11/07/2021 17:17:41 - INFO - __main__ - Step 142422: {'lr': 3.2282951514713e-06, 'samples': 27345024, 'steps': 142421, 'loss/train': 0.9895862936973572} 11/07/2021 17:17:41 - INFO - __main__ - Step 142423: {'lr': 3.2274451399831873e-06, 'samples': 27345216, 'steps': 142422, 'loss/train': 1.5264198780059814} 11/07/2021 17:17:42 - INFO - __main__ - Step 142424: {'lr': 3.2265952396869636e-06, 'samples': 27345408, 'steps': 142423, 'loss/train': 1.1111304759979248} 11/07/2021 17:17:43 - INFO - __main__ - Step 142425: {'lr': 3.225745450583045e-06, 'samples': 27345600, 'steps': 142424, 'loss/train': 1.2020710706710815} 11/07/2021 17:17:43 - INFO - __main__ - Step 142426: {'lr': 3.224895772671793e-06, 'samples': 27345792, 'steps': 142425, 'loss/train': 1.483397364616394} 11/07/2021 17:17:43 - INFO - __main__ - Step 142427: {'lr': 3.224046205953568e-06, 'samples': 27345984, 'steps': 142426, 'loss/train': 0.9271025061607361} 11/07/2021 17:17:44 - INFO - __main__ - Step 142428: {'lr': 3.223196750428814e-06, 'samples': 27346176, 'steps': 142427, 'loss/train': 1.3391939401626587} 11/07/2021 17:17:44 - INFO - __main__ - Step 142429: {'lr': 3.2223474060978643e-06, 'samples': 27346368, 'steps': 142428, 'loss/train': 1.3638198375701904} 11/07/2021 17:17:45 - INFO - __main__ - Step 142430: {'lr': 3.2214981729611072e-06, 'samples': 27346560, 'steps': 142429, 'loss/train': 1.3106685876846313} 11/07/2021 17:17:46 - INFO - __main__ - Step 142431: {'lr': 3.220649051018931e-06, 'samples': 27346752, 'steps': 142430, 'loss/train': 0.815936803817749} 11/07/2021 17:17:46 - INFO - __main__ - Step 142432: {'lr': 3.219800040271753e-06, 'samples': 27346944, 'steps': 142431, 'loss/train': 1.4891825914382935} 11/07/2021 17:17:46 - INFO - __main__ - Step 142433: {'lr': 3.218951140719906e-06, 'samples': 27347136, 'steps': 142432, 'loss/train': 1.1678917407989502} 11/07/2021 17:17:47 - INFO - __main__ - Step 142434: {'lr': 3.2181023523637776e-06, 'samples': 27347328, 'steps': 142433, 'loss/train': 1.5108286142349243} 11/07/2021 17:17:48 - INFO - __main__ - Step 142435: {'lr': 3.217253675203785e-06, 'samples': 27347520, 'steps': 142434, 'loss/train': 1.0343519449234009} 11/07/2021 17:17:48 - INFO - __main__ - Step 142436: {'lr': 3.216405109240289e-06, 'samples': 27347712, 'steps': 142435, 'loss/train': 1.4099013805389404} 11/07/2021 17:17:48 - INFO - __main__ - Step 142437: {'lr': 3.215556654473678e-06, 'samples': 27347904, 'steps': 142436, 'loss/train': 1.1378577947616577} 11/07/2021 17:17:49 - INFO - __main__ - Step 142438: {'lr': 3.214708310904313e-06, 'samples': 27348096, 'steps': 142437, 'loss/train': 1.4269695281982422} 11/07/2021 17:17:49 - INFO - __main__ - Step 142439: {'lr': 3.21386007853261e-06, 'samples': 27348288, 'steps': 142438, 'loss/train': 1.2293931245803833} 11/07/2021 17:17:50 - INFO - __main__ - Step 142440: {'lr': 3.21301195735893e-06, 'samples': 27348480, 'steps': 142439, 'loss/train': 0.9899607300758362} 11/07/2021 17:17:51 - INFO - __main__ - Step 142441: {'lr': 3.212163947383634e-06, 'samples': 27348672, 'steps': 142440, 'loss/train': 1.3790082931518555} 11/07/2021 17:17:51 - INFO - __main__ - Step 142442: {'lr': 3.2113160486071658e-06, 'samples': 27348864, 'steps': 142441, 'loss/train': 1.3158856630325317} 11/07/2021 17:17:51 - INFO - __main__ - Step 142443: {'lr': 3.2104682610298308e-06, 'samples': 27349056, 'steps': 142442, 'loss/train': 1.7400109767913818} 11/07/2021 17:17:52 - INFO - __main__ - Step 142444: {'lr': 3.2096205846520734e-06, 'samples': 27349248, 'steps': 142443, 'loss/train': 1.3344727754592896} 11/07/2021 17:17:53 - INFO - __main__ - Step 142445: {'lr': 3.208773019474254e-06, 'samples': 27349440, 'steps': 142444, 'loss/train': 1.490903377532959} 11/07/2021 17:17:53 - INFO - __main__ - Step 142446: {'lr': 3.207925565496761e-06, 'samples': 27349632, 'steps': 142445, 'loss/train': 1.291731357574463} 11/07/2021 17:17:53 - INFO - __main__ - Step 142447: {'lr': 3.2070782227199557e-06, 'samples': 27349824, 'steps': 142446, 'loss/train': 1.3441264629364014} 11/07/2021 17:17:54 - INFO - __main__ - Step 142448: {'lr': 3.2062309911442266e-06, 'samples': 27350016, 'steps': 142447, 'loss/train': 1.4622310400009155} 11/07/2021 17:17:54 - INFO - __main__ - Step 142449: {'lr': 3.2053838707699622e-06, 'samples': 27350208, 'steps': 142448, 'loss/train': 1.094963788986206} 11/07/2021 17:17:55 - INFO - __main__ - Step 142450: {'lr': 3.204536861597551e-06, 'samples': 27350400, 'steps': 142449, 'loss/train': 3.1421561241149902} 11/07/2021 17:17:56 - INFO - __main__ - Step 142451: {'lr': 3.203689963627354e-06, 'samples': 27350592, 'steps': 142450, 'loss/train': 1.475818395614624} 11/07/2021 17:17:56 - INFO - __main__ - Step 142452: {'lr': 3.2028431768598155e-06, 'samples': 27350784, 'steps': 142451, 'loss/train': 1.8720250129699707} 11/07/2021 17:17:56 - INFO - __main__ - Step 142453: {'lr': 3.2019965012952125e-06, 'samples': 27350976, 'steps': 142452, 'loss/train': 1.3255010843276978} 11/07/2021 17:17:57 - INFO - __main__ - Step 142454: {'lr': 3.201149936933989e-06, 'samples': 27351168, 'steps': 142453, 'loss/train': 1.3137831687927246} 11/07/2021 17:17:58 - INFO - __main__ - Step 142455: {'lr': 3.2003034837765345e-06, 'samples': 27351360, 'steps': 142454, 'loss/train': 1.470068335533142} 11/07/2021 17:17:58 - INFO - __main__ - Step 142456: {'lr': 3.1994571418232088e-06, 'samples': 27351552, 'steps': 142455, 'loss/train': 1.1424893140792847} 11/07/2021 17:17:58 - INFO - __main__ - Step 142457: {'lr': 3.198610911074401e-06, 'samples': 27351744, 'steps': 142456, 'loss/train': 1.153344988822937} 11/07/2021 17:17:59 - INFO - __main__ - Step 142458: {'lr': 3.1977647915304997e-06, 'samples': 27351936, 'steps': 142457, 'loss/train': 0.45102331042289734} 11/07/2021 17:17:59 - INFO - __main__ - Step 142459: {'lr': 3.1969187831918656e-06, 'samples': 27352128, 'steps': 142458, 'loss/train': 1.147691249847412} 11/07/2021 17:17:59 - INFO - __main__ - Step 142460: {'lr': 3.1960728860588873e-06, 'samples': 27352320, 'steps': 142459, 'loss/train': 1.288500189781189} 11/07/2021 17:18:02 - INFO - __main__ - Step 142461: {'lr': 3.1952271001319533e-06, 'samples': 27352512, 'steps': 142460, 'loss/train': 1.5509809255599976} 11/07/2021 17:18:02 - INFO - __main__ - Step 142462: {'lr': 3.1943814254114523e-06, 'samples': 27352704, 'steps': 142461, 'loss/train': 1.827945351600647} 11/07/2021 17:18:02 - INFO - __main__ - Step 142463: {'lr': 3.193535861897745e-06, 'samples': 27352896, 'steps': 142462, 'loss/train': 1.0645029544830322} 11/07/2021 17:18:03 - INFO - __main__ - Step 142464: {'lr': 3.1926904095912203e-06, 'samples': 27353088, 'steps': 142463, 'loss/train': 1.0187681913375854} 11/07/2021 17:18:03 - INFO - __main__ - Step 142465: {'lr': 3.191845068492266e-06, 'samples': 27353280, 'steps': 142464, 'loss/train': 1.4207849502563477} 11/07/2021 17:18:03 - INFO - __main__ - Step 142466: {'lr': 3.190999838601272e-06, 'samples': 27353472, 'steps': 142465, 'loss/train': 0.9665714502334595} 11/07/2021 17:18:04 - INFO - __main__ - Step 142467: {'lr': 3.1901547199185697e-06, 'samples': 27353664, 'steps': 142466, 'loss/train': 1.745705485343933} 11/07/2021 17:18:04 - INFO - __main__ - Step 142468: {'lr': 3.189309712444605e-06, 'samples': 27353856, 'steps': 142467, 'loss/train': 1.702601432800293} 11/07/2021 17:18:05 - INFO - __main__ - Step 142469: {'lr': 3.1884648161797094e-06, 'samples': 27354048, 'steps': 142468, 'loss/train': 1.1011940240859985} 11/07/2021 17:18:06 - INFO - __main__ - Step 142470: {'lr': 3.1876200311242997e-06, 'samples': 27354240, 'steps': 142469, 'loss/train': 1.3055700063705444} 11/07/2021 17:18:06 - INFO - __main__ - Step 142471: {'lr': 3.1867753572787374e-06, 'samples': 27354432, 'steps': 142470, 'loss/train': 1.5200042724609375} 11/07/2021 17:18:06 - INFO - __main__ - Step 142472: {'lr': 3.1859307946433825e-06, 'samples': 27354624, 'steps': 142471, 'loss/train': 0.06391355395317078} 11/07/2021 17:18:07 - INFO - __main__ - Step 142473: {'lr': 3.1850863432186793e-06, 'samples': 27354816, 'steps': 142472, 'loss/train': 1.4094040393829346} 11/07/2021 17:18:07 - INFO - __main__ - Step 142474: {'lr': 3.1842420030049335e-06, 'samples': 27355008, 'steps': 142473, 'loss/train': 1.4078844785690308} 11/07/2021 17:18:08 - INFO - __main__ - Step 142475: {'lr': 3.183397774002561e-06, 'samples': 27355200, 'steps': 142474, 'loss/train': 0.29308852553367615} 11/07/2021 17:18:09 - INFO - __main__ - Step 142476: {'lr': 3.1825536562119508e-06, 'samples': 27355392, 'steps': 142475, 'loss/train': 1.1850051879882812} 11/07/2021 17:18:09 - INFO - __main__ - Step 142477: {'lr': 3.1817096496334906e-06, 'samples': 27355584, 'steps': 142476, 'loss/train': 1.2656363248825073} 11/07/2021 17:18:09 - INFO - __main__ - Step 142478: {'lr': 3.1808657542675146e-06, 'samples': 27355776, 'steps': 142477, 'loss/train': 0.8735725283622742} 11/07/2021 17:18:10 - INFO - __main__ - Step 142479: {'lr': 3.1800219701144662e-06, 'samples': 27355968, 'steps': 142478, 'loss/train': 1.4174009561538696} 11/07/2021 17:18:11 - INFO - __main__ - Step 142480: {'lr': 3.179178297174651e-06, 'samples': 27356160, 'steps': 142479, 'loss/train': 1.1115869283676147} 11/07/2021 17:18:11 - INFO - __main__ - Step 142481: {'lr': 3.1783347354485124e-06, 'samples': 27356352, 'steps': 142480, 'loss/train': 1.263999581336975} 11/07/2021 17:18:11 - INFO - __main__ - Step 142482: {'lr': 3.1774912849364125e-06, 'samples': 27356544, 'steps': 142481, 'loss/train': 1.2776148319244385} 11/07/2021 17:18:12 - INFO - __main__ - Step 142483: {'lr': 3.176647945638711e-06, 'samples': 27356736, 'steps': 142482, 'loss/train': 1.4344313144683838} 11/07/2021 17:18:12 - INFO - __main__ - Step 142484: {'lr': 3.175804717555797e-06, 'samples': 27356928, 'steps': 142483, 'loss/train': 1.6721035242080688} 11/07/2021 17:18:13 - INFO - __main__ - Step 142485: {'lr': 3.174961600688059e-06, 'samples': 27357120, 'steps': 142484, 'loss/train': 1.0319185256958008} 11/07/2021 17:18:14 - INFO - __main__ - Step 142486: {'lr': 3.1741185950358854e-06, 'samples': 27357312, 'steps': 142485, 'loss/train': 1.9227850437164307} 11/07/2021 17:18:14 - INFO - __main__ - Step 142487: {'lr': 3.173275700599637e-06, 'samples': 27357504, 'steps': 142486, 'loss/train': 1.6038118600845337} 11/07/2021 17:18:14 - INFO - __main__ - Step 142488: {'lr': 3.1724329173797306e-06, 'samples': 27357696, 'steps': 142487, 'loss/train': 1.5817275047302246} 11/07/2021 17:18:15 - INFO - __main__ - Step 142489: {'lr': 3.171590245376471e-06, 'samples': 27357888, 'steps': 142488, 'loss/train': 1.3421268463134766} 11/07/2021 17:18:15 - INFO - __main__ - Step 142490: {'lr': 3.1707476845903025e-06, 'samples': 27358080, 'steps': 142489, 'loss/train': 1.3224331140518188} 11/07/2021 17:18:16 - INFO - __main__ - Step 142491: {'lr': 3.169905235021614e-06, 'samples': 27358272, 'steps': 142490, 'loss/train': 1.1405991315841675} 11/07/2021 17:18:16 - INFO - __main__ - Step 142492: {'lr': 3.1690628966707103e-06, 'samples': 27358464, 'steps': 142491, 'loss/train': 1.4275044202804565} 11/07/2021 17:18:17 - INFO - __main__ - Step 142493: {'lr': 3.168220669538063e-06, 'samples': 27358656, 'steps': 142492, 'loss/train': 1.2237814664840698} 11/07/2021 17:18:17 - INFO - __main__ - Step 142494: {'lr': 3.167378553624006e-06, 'samples': 27358848, 'steps': 142493, 'loss/train': 1.3679369688034058} 11/07/2021 17:18:17 - INFO - __main__ - Step 142495: {'lr': 3.166536548928872e-06, 'samples': 27359040, 'steps': 142494, 'loss/train': 1.4712145328521729} 11/07/2021 17:18:19 - INFO - __main__ - Step 142496: {'lr': 3.1656946554531328e-06, 'samples': 27359232, 'steps': 142495, 'loss/train': 1.3232122659683228} 11/07/2021 17:18:19 - INFO - __main__ - Step 142497: {'lr': 3.164852873197094e-06, 'samples': 27359424, 'steps': 142496, 'loss/train': 1.2089613676071167} 11/07/2021 17:18:20 - INFO - __main__ - Step 142498: {'lr': 3.164011202161171e-06, 'samples': 27359616, 'steps': 142497, 'loss/train': 1.1021581888198853} 11/07/2021 17:18:20 - INFO - __main__ - Step 142499: {'lr': 3.1631696423457536e-06, 'samples': 27359808, 'steps': 142498, 'loss/train': 0.7762266993522644} 11/07/2021 17:18:21 - INFO - __main__ - Step 142500: {'lr': 3.1623281937511737e-06, 'samples': 27360000, 'steps': 142499, 'loss/train': 1.3777306079864502} 11/07/2021 17:18:21 - INFO - __main__ - Step 142501: {'lr': 3.1614868563778486e-06, 'samples': 27360192, 'steps': 142500, 'loss/train': 1.4735877513885498} 11/07/2021 17:18:21 - INFO - __main__ - Step 142502: {'lr': 3.1606456302261667e-06, 'samples': 27360384, 'steps': 142501, 'loss/train': 0.06121412664651871} 11/07/2021 17:18:22 - INFO - __main__ - Step 142503: {'lr': 3.1598045152964605e-06, 'samples': 27360576, 'steps': 142502, 'loss/train': 1.3499679565429688} 11/07/2021 17:18:23 - INFO - __main__ - Step 142504: {'lr': 3.1589635115891746e-06, 'samples': 27360768, 'steps': 142503, 'loss/train': 1.0454481840133667} 11/07/2021 17:18:23 - INFO - __main__ - Step 142505: {'lr': 3.1581226191046144e-06, 'samples': 27360960, 'steps': 142504, 'loss/train': 0.999911367893219} 11/07/2021 17:18:23 - INFO - __main__ - Step 142506: {'lr': 3.1572818378432233e-06, 'samples': 27361152, 'steps': 142505, 'loss/train': 1.5785415172576904} 11/07/2021 17:18:24 - INFO - __main__ - Step 142507: {'lr': 3.156441167805335e-06, 'samples': 27361344, 'steps': 142506, 'loss/train': 0.9011591672897339} 11/07/2021 17:18:25 - INFO - __main__ - Step 142508: {'lr': 3.1556006089913657e-06, 'samples': 27361536, 'steps': 142507, 'loss/train': 1.722226858139038} 11/07/2021 17:18:25 - INFO - __main__ - Step 142509: {'lr': 3.1547601614016487e-06, 'samples': 27361728, 'steps': 142508, 'loss/train': 1.1217628717422485} 11/07/2021 17:18:26 - INFO - __main__ - Step 142510: {'lr': 3.1539198250365997e-06, 'samples': 27361920, 'steps': 142509, 'loss/train': 1.5711307525634766} 11/07/2021 17:18:26 - INFO - __main__ - Step 142511: {'lr': 3.15307959989658e-06, 'samples': 27362112, 'steps': 142510, 'loss/train': 0.6908512115478516} 11/07/2021 17:18:26 - INFO - __main__ - Step 142512: {'lr': 3.152239485981978e-06, 'samples': 27362304, 'steps': 142511, 'loss/train': 1.3294649124145508} 11/07/2021 17:18:27 - INFO - __main__ - Step 142513: {'lr': 3.1513994832931547e-06, 'samples': 27362496, 'steps': 142512, 'loss/train': 0.9304221272468567} 11/07/2021 17:18:28 - INFO - __main__ - Step 142514: {'lr': 3.1505595918305264e-06, 'samples': 27362688, 'steps': 142513, 'loss/train': 0.45528221130371094} 11/07/2021 17:18:28 - INFO - __main__ - Step 142515: {'lr': 3.1497198115944258e-06, 'samples': 27362880, 'steps': 142514, 'loss/train': 1.3756153583526611} 11/07/2021 17:18:28 - INFO - __main__ - Step 142516: {'lr': 3.148880142585242e-06, 'samples': 27363072, 'steps': 142515, 'loss/train': 0.9787878394126892} 11/07/2021 17:18:29 - INFO - __main__ - Step 142517: {'lr': 3.148040584803391e-06, 'samples': 27363264, 'steps': 142516, 'loss/train': 1.379019021987915} 11/07/2021 17:18:30 - INFO - __main__ - Step 142518: {'lr': 3.1472011382492062e-06, 'samples': 27363456, 'steps': 142517, 'loss/train': 1.3108179569244385} 11/07/2021 17:18:30 - INFO - __main__ - Step 142519: {'lr': 3.1463618029231035e-06, 'samples': 27363648, 'steps': 142518, 'loss/train': 0.11753109097480774} 11/07/2021 17:18:31 - INFO - __main__ - Step 142520: {'lr': 3.145522578825444e-06, 'samples': 27363840, 'steps': 142519, 'loss/train': 1.6188803911209106} 11/07/2021 17:18:31 - INFO - __main__ - Step 142521: {'lr': 3.144683465956588e-06, 'samples': 27364032, 'steps': 142520, 'loss/train': 1.162319540977478} 11/07/2021 17:18:31 - INFO - __main__ - Step 142522: {'lr': 3.1438444643169252e-06, 'samples': 27364224, 'steps': 142521, 'loss/train': 1.2653898000717163} 11/07/2021 17:18:32 - INFO - __main__ - Step 142523: {'lr': 3.1430055739068432e-06, 'samples': 27364416, 'steps': 142522, 'loss/train': 1.4063749313354492} 11/07/2021 17:18:33 - INFO - __main__ - Step 142524: {'lr': 3.142166794726703e-06, 'samples': 27364608, 'steps': 142523, 'loss/train': 0.3001512289047241} 11/07/2021 17:18:33 - INFO - __main__ - Step 142525: {'lr': 3.1413281267768936e-06, 'samples': 27364800, 'steps': 142524, 'loss/train': 1.444319725036621} 11/07/2021 17:18:33 - INFO - __main__ - Step 142526: {'lr': 3.1404895700578027e-06, 'samples': 27364992, 'steps': 142525, 'loss/train': 1.3266955614089966} 11/07/2021 17:18:34 - INFO - __main__ - Step 142527: {'lr': 3.1396511245697922e-06, 'samples': 27365184, 'steps': 142526, 'loss/train': 1.3331518173217773} 11/07/2021 17:18:34 - INFO - __main__ - Step 142528: {'lr': 3.13881279031325e-06, 'samples': 27365376, 'steps': 142527, 'loss/train': 1.2657686471939087} 11/07/2021 17:18:35 - INFO - __main__ - Step 142529: {'lr': 3.1379745672885376e-06, 'samples': 27365568, 'steps': 142528, 'loss/train': 1.3308120965957642} 11/07/2021 17:18:36 - INFO - __main__ - Step 142530: {'lr': 3.13713645549607e-06, 'samples': 27365760, 'steps': 142529, 'loss/train': 1.3651703596115112} 11/07/2021 17:18:36 - INFO - __main__ - Step 142531: {'lr': 3.1362984549361815e-06, 'samples': 27365952, 'steps': 142530, 'loss/train': 1.5313152074813843} 11/07/2021 17:18:36 - INFO - __main__ - Step 142532: {'lr': 3.1354605656092605e-06, 'samples': 27366144, 'steps': 142531, 'loss/train': 0.7775653600692749} 11/07/2021 17:18:37 - INFO - __main__ - Step 142533: {'lr': 3.134622787515723e-06, 'samples': 27366336, 'steps': 142532, 'loss/train': 0.6359217166900635} 11/07/2021 17:18:38 - INFO - __main__ - Step 142534: {'lr': 3.1337851206558743e-06, 'samples': 27366528, 'steps': 142533, 'loss/train': 1.643383502960205} 11/07/2021 17:18:38 - INFO - __main__ - Step 142535: {'lr': 3.1329475650301588e-06, 'samples': 27366720, 'steps': 142534, 'loss/train': 1.3452998399734497} 11/07/2021 17:18:38 - INFO - __main__ - Step 142536: {'lr': 3.1321101206389092e-06, 'samples': 27366912, 'steps': 142535, 'loss/train': 1.5170024633407593} 11/07/2021 17:18:39 - INFO - __main__ - Step 142537: {'lr': 3.131272787482542e-06, 'samples': 27367104, 'steps': 142536, 'loss/train': 1.4904383420944214} 11/07/2021 17:18:39 - INFO - __main__ - Step 142538: {'lr': 3.1304355655613904e-06, 'samples': 27367296, 'steps': 142537, 'loss/train': 1.4572583436965942} 11/07/2021 17:18:40 - INFO - __main__ - Step 142539: {'lr': 3.1295984548758704e-06, 'samples': 27367488, 'steps': 142538, 'loss/train': 1.5907227993011475} 11/07/2021 17:18:41 - INFO - __main__ - Step 142540: {'lr': 3.128761455426343e-06, 'samples': 27367680, 'steps': 142539, 'loss/train': 0.7885992527008057} 11/07/2021 17:18:41 - INFO - __main__ - Step 142541: {'lr': 3.1279245672131974e-06, 'samples': 27367872, 'steps': 142540, 'loss/train': 1.3180168867111206} 11/07/2021 17:18:41 - INFO - __main__ - Step 142542: {'lr': 3.127087790236793e-06, 'samples': 27368064, 'steps': 142541, 'loss/train': 1.5299092531204224} 11/07/2021 17:18:42 - INFO - __main__ - Step 142543: {'lr': 3.1262511244974923e-06, 'samples': 27368256, 'steps': 142542, 'loss/train': 1.3106626272201538} 11/07/2021 17:18:43 - INFO - __main__ - Step 142544: {'lr': 3.1254145699957105e-06, 'samples': 27368448, 'steps': 142543, 'loss/train': 2.0150370597839355} 11/07/2021 17:18:43 - INFO - __main__ - Step 142545: {'lr': 3.1245781267318087e-06, 'samples': 27368640, 'steps': 142544, 'loss/train': 0.1184912919998169} 11/07/2021 17:18:43 - INFO - __main__ - Step 142546: {'lr': 3.1237417947061752e-06, 'samples': 27368832, 'steps': 142545, 'loss/train': 1.3343242406845093} 11/07/2021 17:18:44 - INFO - __main__ - Step 142547: {'lr': 3.122905573919144e-06, 'samples': 27369024, 'steps': 142546, 'loss/train': 1.6273276805877686} 11/07/2021 17:18:44 - INFO - __main__ - Step 142548: {'lr': 3.122069464371158e-06, 'samples': 27369216, 'steps': 142547, 'loss/train': 1.2695543766021729} 11/07/2021 17:18:45 - INFO - __main__ - Step 142549: {'lr': 3.121233466062523e-06, 'samples': 27369408, 'steps': 142548, 'loss/train': 1.1733713150024414} 11/07/2021 17:18:46 - INFO - __main__ - Step 142550: {'lr': 3.1203975789936557e-06, 'samples': 27369600, 'steps': 142549, 'loss/train': 1.008500337600708} 11/07/2021 17:18:46 - INFO - __main__ - Step 142551: {'lr': 3.119561803164944e-06, 'samples': 27369792, 'steps': 142550, 'loss/train': 1.881112813949585} 11/07/2021 17:18:46 - INFO - __main__ - Step 142552: {'lr': 3.1187261385767494e-06, 'samples': 27369984, 'steps': 142551, 'loss/train': 1.4677269458770752} 11/07/2021 17:18:47 - INFO - __main__ - Step 142553: {'lr': 3.1178905852294327e-06, 'samples': 27370176, 'steps': 142552, 'loss/train': 1.1815931797027588} 11/07/2021 17:18:48 - INFO - __main__ - Step 142554: {'lr': 3.117055143123382e-06, 'samples': 27370368, 'steps': 142553, 'loss/train': 1.3687974214553833} 11/07/2021 17:18:48 - INFO - __main__ - Step 142555: {'lr': 3.1162198122589856e-06, 'samples': 27370560, 'steps': 142554, 'loss/train': 1.4654176235198975} 11/07/2021 17:18:48 - INFO - __main__ - Step 142556: {'lr': 3.1153845926366053e-06, 'samples': 27370752, 'steps': 142555, 'loss/train': 1.7109801769256592} 11/07/2021 17:18:49 - INFO - __main__ - Step 142557: {'lr': 3.114549484256629e-06, 'samples': 27370944, 'steps': 142556, 'loss/train': 1.0382205247879028} 11/07/2021 17:18:49 - INFO - __main__ - Step 142558: {'lr': 3.1137144871194177e-06, 'samples': 27371136, 'steps': 142557, 'loss/train': 1.0915354490280151} 11/07/2021 17:18:50 - INFO - __main__ - Step 142559: {'lr': 3.1128796012253877e-06, 'samples': 27371328, 'steps': 142558, 'loss/train': 1.4425885677337646} 11/07/2021 17:18:51 - INFO - __main__ - Step 142560: {'lr': 3.1120448265748726e-06, 'samples': 27371520, 'steps': 142559, 'loss/train': 1.0998611450195312} 11/07/2021 17:18:51 - INFO - __main__ - Step 142561: {'lr': 3.1112101631682323e-06, 'samples': 27371712, 'steps': 142560, 'loss/train': 1.1826902627944946} 11/07/2021 17:18:51 - INFO - __main__ - Step 142562: {'lr': 3.1103756110058835e-06, 'samples': 27371904, 'steps': 142561, 'loss/train': 1.067132830619812} 11/07/2021 17:18:52 - INFO - __main__ - Step 142563: {'lr': 3.1095411700882148e-06, 'samples': 27372096, 'steps': 142562, 'loss/train': 1.0367441177368164} 11/07/2021 17:18:53 - INFO - __main__ - Step 142564: {'lr': 3.1087068404155593e-06, 'samples': 27372288, 'steps': 142563, 'loss/train': 1.2527261972427368} 11/07/2021 17:18:53 - INFO - __main__ - Step 142565: {'lr': 3.1078726219883058e-06, 'samples': 27372480, 'steps': 142564, 'loss/train': 1.4539318084716797} 11/07/2021 17:18:53 - INFO - __main__ - Step 142566: {'lr': 3.1070385148068148e-06, 'samples': 27372672, 'steps': 142565, 'loss/train': 1.6257964372634888} 11/07/2021 17:18:54 - INFO - __main__ - Step 142567: {'lr': 3.1062045188715305e-06, 'samples': 27372864, 'steps': 142566, 'loss/train': 1.3246915340423584} 11/07/2021 17:18:54 - INFO - __main__ - Step 142568: {'lr': 3.1053706341827304e-06, 'samples': 27373056, 'steps': 142567, 'loss/train': 1.0535070896148682} 11/07/2021 17:18:54 - INFO - __main__ - Step 142569: {'lr': 3.1045368607408866e-06, 'samples': 27373248, 'steps': 142568, 'loss/train': 1.12577486038208} 11/07/2021 17:18:55 - INFO - __main__ - Step 142570: {'lr': 3.1037031985463036e-06, 'samples': 27373440, 'steps': 142569, 'loss/train': 1.9971948862075806} 11/07/2021 17:18:56 - INFO - __main__ - Step 142571: {'lr': 3.102869647599371e-06, 'samples': 27373632, 'steps': 142570, 'loss/train': 1.7210301160812378} 11/07/2021 17:18:56 - INFO - __main__ - Step 142572: {'lr': 3.1020362079005047e-06, 'samples': 27373824, 'steps': 142571, 'loss/train': 0.3862866759300232} 11/07/2021 17:18:56 - INFO - __main__ - Step 142573: {'lr': 3.101202879450038e-06, 'samples': 27374016, 'steps': 142572, 'loss/train': 1.3227049112319946} 11/07/2021 17:18:57 - INFO - __main__ - Step 142574: {'lr': 3.1003696622483592e-06, 'samples': 27374208, 'steps': 142573, 'loss/train': 0.8725963830947876} 11/07/2021 17:18:58 - INFO - __main__ - Step 142575: {'lr': 3.0995365562958565e-06, 'samples': 27374400, 'steps': 142574, 'loss/train': 1.3934166431427002} 11/07/2021 17:18:58 - INFO - __main__ - Step 142576: {'lr': 3.098703561592864e-06, 'samples': 27374592, 'steps': 142575, 'loss/train': 1.2744837999343872} 11/07/2021 17:18:59 - INFO - __main__ - Step 142577: {'lr': 3.097870678139797e-06, 'samples': 27374784, 'steps': 142576, 'loss/train': 1.3112339973449707} 11/07/2021 17:18:59 - INFO - __main__ - Step 142578: {'lr': 3.097037905937017e-06, 'samples': 27374976, 'steps': 142577, 'loss/train': 1.0254294872283936} 11/07/2021 17:18:59 - INFO - __main__ - Step 142579: {'lr': 3.096205244984912e-06, 'samples': 27375168, 'steps': 142578, 'loss/train': 1.2635935544967651} 11/07/2021 17:19:00 - INFO - __main__ - Step 142580: {'lr': 3.0953726952838435e-06, 'samples': 27375360, 'steps': 142579, 'loss/train': 0.8027219772338867} 11/07/2021 17:19:01 - INFO - __main__ - Step 142581: {'lr': 3.0945402568341997e-06, 'samples': 27375552, 'steps': 142580, 'loss/train': 1.3120821714401245} 11/07/2021 17:19:01 - INFO - __main__ - Step 142582: {'lr': 3.093707929636341e-06, 'samples': 27375744, 'steps': 142581, 'loss/train': 1.4099901914596558} 11/07/2021 17:19:01 - INFO - __main__ - Step 142583: {'lr': 3.0928757136906295e-06, 'samples': 27375936, 'steps': 142582, 'loss/train': 1.5805730819702148} 11/07/2021 17:19:02 - INFO - __main__ - Step 142584: {'lr': 3.09204360899748e-06, 'samples': 27376128, 'steps': 142583, 'loss/train': 0.9686486721038818} 11/07/2021 17:19:03 - INFO - __main__ - Step 142585: {'lr': 3.0912116155572266e-06, 'samples': 27376320, 'steps': 142584, 'loss/train': 1.206552505493164} 11/07/2021 17:19:03 - INFO - __main__ - Step 142586: {'lr': 3.0903797333702853e-06, 'samples': 27376512, 'steps': 142585, 'loss/train': 1.230075478553772} 11/07/2021 17:19:04 - INFO - __main__ - Step 142587: {'lr': 3.0895479624370173e-06, 'samples': 27376704, 'steps': 142586, 'loss/train': 1.5119130611419678} 11/07/2021 17:19:04 - INFO - __main__ - Step 142588: {'lr': 3.0887163027577546e-06, 'samples': 27376896, 'steps': 142587, 'loss/train': 1.3346433639526367} 11/07/2021 17:19:04 - INFO - __main__ - Step 142589: {'lr': 3.0878847543329145e-06, 'samples': 27377088, 'steps': 142588, 'loss/train': 1.480877161026001} 11/07/2021 17:19:06 - INFO - __main__ - Step 142590: {'lr': 3.0870533171628857e-06, 'samples': 27377280, 'steps': 142589, 'loss/train': 1.47978937625885} 11/07/2021 17:19:06 - INFO - __main__ - Step 142591: {'lr': 3.0862219912480007e-06, 'samples': 27377472, 'steps': 142590, 'loss/train': 5.711002826690674} 11/07/2021 17:19:07 - INFO - __main__ - Step 142592: {'lr': 3.085390776588648e-06, 'samples': 27377664, 'steps': 142591, 'loss/train': 1.221822738647461} 11/07/2021 17:19:07 - INFO - __main__ - Step 142593: {'lr': 3.0845596731852167e-06, 'samples': 27377856, 'steps': 142592, 'loss/train': 0.9305167198181152} 11/07/2021 17:19:07 - INFO - __main__ - Step 142594: {'lr': 3.0837286810380673e-06, 'samples': 27378048, 'steps': 142593, 'loss/train': 0.9343197345733643} 11/07/2021 17:19:08 - INFO - __main__ - Step 142595: {'lr': 3.082897800147588e-06, 'samples': 27378240, 'steps': 142594, 'loss/train': 1.4251257181167603} 11/07/2021 17:19:08 - INFO - __main__ - Step 142596: {'lr': 3.0820670305141407e-06, 'samples': 27378432, 'steps': 142595, 'loss/train': 1.3451330661773682} 11/07/2021 17:19:09 - INFO - __main__ - Step 142597: {'lr': 3.081236372138113e-06, 'samples': 27378624, 'steps': 142596, 'loss/train': 1.0587990283966064} 11/07/2021 17:19:10 - INFO - __main__ - Step 142598: {'lr': 3.0804058250198663e-06, 'samples': 27378816, 'steps': 142597, 'loss/train': 1.1530965566635132} 11/07/2021 17:19:10 - INFO - __main__ - Step 142599: {'lr': 3.079575389159761e-06, 'samples': 27379008, 'steps': 142598, 'loss/train': 0.27688124775886536} 11/07/2021 17:19:10 - INFO - __main__ - Step 142600: {'lr': 3.0787450645582136e-06, 'samples': 27379200, 'steps': 142599, 'loss/train': 1.197205901145935} 11/07/2021 17:19:11 - INFO - __main__ - Step 142601: {'lr': 3.077914851215585e-06, 'samples': 27379392, 'steps': 142600, 'loss/train': 1.3525413274765015} 11/07/2021 17:19:12 - INFO - __main__ - Step 142602: {'lr': 3.0770847491322085e-06, 'samples': 27379584, 'steps': 142601, 'loss/train': 1.5079551935195923} 11/07/2021 17:19:12 - INFO - __main__ - Step 142603: {'lr': 3.0762547583085e-06, 'samples': 27379776, 'steps': 142602, 'loss/train': 1.4612998962402344} 11/07/2021 17:19:13 - INFO - __main__ - Step 142604: {'lr': 3.0754248787447923e-06, 'samples': 27379968, 'steps': 142603, 'loss/train': 0.8452417254447937} 11/07/2021 17:19:13 - INFO - __main__ - Step 142605: {'lr': 3.0745951104415303e-06, 'samples': 27380160, 'steps': 142604, 'loss/train': 1.1769516468048096} 11/07/2021 17:19:13 - INFO - __main__ - Step 142606: {'lr': 3.073765453399019e-06, 'samples': 27380352, 'steps': 142605, 'loss/train': 1.530021071434021} 11/07/2021 17:19:14 - INFO - __main__ - Step 142607: {'lr': 3.0729359076176464e-06, 'samples': 27380544, 'steps': 142606, 'loss/train': 1.039892554283142} 11/07/2021 17:19:15 - INFO - __main__ - Step 142608: {'lr': 3.07210647309783e-06, 'samples': 27380736, 'steps': 142607, 'loss/train': 1.5301549434661865} 11/07/2021 17:19:15 - INFO - __main__ - Step 142609: {'lr': 3.0712771498399015e-06, 'samples': 27380928, 'steps': 142608, 'loss/train': 1.5144034624099731} 11/07/2021 17:19:15 - INFO - __main__ - Step 142610: {'lr': 3.070447937844251e-06, 'samples': 27381120, 'steps': 142609, 'loss/train': 1.4809495210647583} 11/07/2021 17:19:16 - INFO - __main__ - Step 142611: {'lr': 3.0696188371112377e-06, 'samples': 27381312, 'steps': 142610, 'loss/train': 1.1267673969268799} 11/07/2021 17:19:17 - INFO - __main__ - Step 142612: {'lr': 3.0687898476412512e-06, 'samples': 27381504, 'steps': 142611, 'loss/train': 1.3984860181808472} 11/07/2021 17:19:17 - INFO - __main__ - Step 142613: {'lr': 3.0679609694346523e-06, 'samples': 27381696, 'steps': 142612, 'loss/train': 1.0651841163635254} 11/07/2021 17:19:17 - INFO - __main__ - Step 142614: {'lr': 3.0671322024918292e-06, 'samples': 27381888, 'steps': 142613, 'loss/train': 1.3773828744888306} 11/07/2021 17:19:18 - INFO - __main__ - Step 142615: {'lr': 3.066303546813115e-06, 'samples': 27382080, 'steps': 142614, 'loss/train': 1.3153598308563232} 11/07/2021 17:19:18 - INFO - __main__ - Step 142616: {'lr': 3.065475002398954e-06, 'samples': 27382272, 'steps': 142615, 'loss/train': 1.5327500104904175} 11/07/2021 17:19:19 - INFO - __main__ - Step 142617: {'lr': 3.0646465692496516e-06, 'samples': 27382464, 'steps': 142616, 'loss/train': 1.769315481185913} 11/07/2021 17:19:20 - INFO - __main__ - Step 142618: {'lr': 3.063818247365624e-06, 'samples': 27382656, 'steps': 142617, 'loss/train': 0.8464761972427368} 11/07/2021 17:19:20 - INFO - __main__ - Step 142619: {'lr': 3.062990036747204e-06, 'samples': 27382848, 'steps': 142618, 'loss/train': 1.8633434772491455} 11/07/2021 17:19:20 - INFO - __main__ - Step 142620: {'lr': 3.0621619373948084e-06, 'samples': 27383040, 'steps': 142619, 'loss/train': 1.2848182916641235} 11/07/2021 17:19:21 - INFO - __main__ - Step 142621: {'lr': 3.0613339493087978e-06, 'samples': 27383232, 'steps': 142620, 'loss/train': 0.04250134155154228} 11/07/2021 17:19:21 - INFO - __main__ - Step 142622: {'lr': 3.0605060724895607e-06, 'samples': 27383424, 'steps': 142621, 'loss/train': 1.1493725776672363} 11/07/2021 17:19:22 - INFO - __main__ - Step 142623: {'lr': 3.0596783069374025e-06, 'samples': 27383616, 'steps': 142622, 'loss/train': 1.499244213104248} 11/07/2021 17:19:23 - INFO - __main__ - Step 142624: {'lr': 3.058850652652767e-06, 'samples': 27383808, 'steps': 142623, 'loss/train': 0.22031089663505554} 11/07/2021 17:19:23 - INFO - __main__ - Step 142625: {'lr': 3.0580231096360155e-06, 'samples': 27384000, 'steps': 142624, 'loss/train': 1.4656020402908325} 11/07/2021 17:19:23 - INFO - __main__ - Step 142626: {'lr': 3.057195677887481e-06, 'samples': 27384192, 'steps': 142625, 'loss/train': 0.7727726697921753} 11/07/2021 17:19:24 - INFO - __main__ - Step 142627: {'lr': 3.0563683574075795e-06, 'samples': 27384384, 'steps': 142626, 'loss/train': 1.4618364572525024} 11/07/2021 17:19:25 - INFO - __main__ - Step 142628: {'lr': 3.0555411481966446e-06, 'samples': 27384576, 'steps': 142627, 'loss/train': 1.1295945644378662} 11/07/2021 17:19:25 - INFO - __main__ - Step 142629: {'lr': 3.0547140502550917e-06, 'samples': 27384768, 'steps': 142628, 'loss/train': 1.5966551303863525} 11/07/2021 17:19:26 - INFO - __main__ - Step 142630: {'lr': 3.053887063583283e-06, 'samples': 27384960, 'steps': 142629, 'loss/train': 1.378865361213684} 11/07/2021 17:19:26 - INFO - __main__ - Step 142631: {'lr': 3.0530601881815776e-06, 'samples': 27385152, 'steps': 142630, 'loss/train': 1.0430774688720703} 11/07/2021 17:19:26 - INFO - __main__ - Step 142632: {'lr': 3.052233424050338e-06, 'samples': 27385344, 'steps': 142631, 'loss/train': 2.615814208984375} 11/07/2021 17:19:27 - INFO - __main__ - Step 142633: {'lr': 3.0514067711899796e-06, 'samples': 27385536, 'steps': 142632, 'loss/train': 1.154584527015686} 11/07/2021 17:19:28 - INFO - __main__ - Step 142634: {'lr': 3.0505802296008354e-06, 'samples': 27385728, 'steps': 142633, 'loss/train': 1.702627182006836} 11/07/2021 17:19:28 - INFO - __main__ - Step 142635: {'lr': 3.049753799283267e-06, 'samples': 27385920, 'steps': 142634, 'loss/train': 1.5329926013946533} 11/07/2021 17:19:28 - INFO - __main__ - Step 142636: {'lr': 3.04892748023769e-06, 'samples': 27386112, 'steps': 142635, 'loss/train': 1.311407208442688} 11/07/2021 17:19:29 - INFO - __main__ - Step 142637: {'lr': 3.0481012724644375e-06, 'samples': 27386304, 'steps': 142636, 'loss/train': 1.1345586776733398} 11/07/2021 17:19:30 - INFO - __main__ - Step 142638: {'lr': 3.0472751759639263e-06, 'samples': 27386496, 'steps': 142637, 'loss/train': 1.6330845355987549} 11/07/2021 17:19:30 - INFO - __main__ - Step 142639: {'lr': 3.046449190736461e-06, 'samples': 27386688, 'steps': 142638, 'loss/train': 1.0360115766525269} 11/07/2021 17:19:30 - INFO - __main__ - Step 142640: {'lr': 3.0456233167824865e-06, 'samples': 27386880, 'steps': 142639, 'loss/train': 1.5400511026382446} 11/07/2021 17:19:31 - INFO - __main__ - Step 142641: {'lr': 3.0447975541023354e-06, 'samples': 27387072, 'steps': 142640, 'loss/train': 1.1977568864822388} 11/07/2021 17:19:31 - INFO - __main__ - Step 142642: {'lr': 3.0439719026963965e-06, 'samples': 27387264, 'steps': 142641, 'loss/train': 0.8626246452331543} 11/07/2021 17:19:32 - INFO - __main__ - Step 142643: {'lr': 3.0431463625650302e-06, 'samples': 27387456, 'steps': 142642, 'loss/train': 1.1033419370651245} 11/07/2021 17:19:33 - INFO - __main__ - Step 142644: {'lr': 3.0423209337086253e-06, 'samples': 27387648, 'steps': 142643, 'loss/train': 1.3740513324737549} 11/07/2021 17:19:33 - INFO - __main__ - Step 142645: {'lr': 3.0414956161275155e-06, 'samples': 27387840, 'steps': 142644, 'loss/train': 1.5652527809143066} 11/07/2021 17:19:33 - INFO - __main__ - Step 142646: {'lr': 3.040670409822116e-06, 'samples': 27388032, 'steps': 142645, 'loss/train': 1.0200918912887573} 11/07/2021 17:19:34 - INFO - __main__ - Step 142647: {'lr': 3.0398453147927605e-06, 'samples': 27388224, 'steps': 142646, 'loss/train': 1.5008466243743896} 11/07/2021 17:19:34 - INFO - __main__ - Step 142648: {'lr': 3.039020331039838e-06, 'samples': 27388416, 'steps': 142647, 'loss/train': 1.4785969257354736} 11/07/2021 17:19:35 - INFO - __main__ - Step 142649: {'lr': 3.0381954585637362e-06, 'samples': 27388608, 'steps': 142648, 'loss/train': 0.9918891191482544} 11/07/2021 17:19:35 - INFO - __main__ - Step 142650: {'lr': 3.0373706973647885e-06, 'samples': 27388800, 'steps': 142649, 'loss/train': 1.5908679962158203} 11/07/2021 17:19:36 - INFO - __main__ - Step 142651: {'lr': 3.0365460474434115e-06, 'samples': 27388992, 'steps': 142650, 'loss/train': 0.780810534954071} 11/07/2021 17:19:36 - INFO - __main__ - Step 142652: {'lr': 3.0357215087999655e-06, 'samples': 27389184, 'steps': 142651, 'loss/train': 1.1874009370803833} 11/07/2021 17:19:36 - INFO - __main__ - Step 142653: {'lr': 3.034897081434812e-06, 'samples': 27389376, 'steps': 142652, 'loss/train': 1.3902112245559692} 11/07/2021 17:19:38 - INFO - __main__ - Step 142654: {'lr': 3.0340727653483115e-06, 'samples': 27389568, 'steps': 142653, 'loss/train': 1.4254907369613647} 11/07/2021 17:19:38 - INFO - __main__ - Step 142655: {'lr': 3.033248560540852e-06, 'samples': 27389760, 'steps': 142654, 'loss/train': 1.276884913444519} 11/07/2021 17:19:38 - INFO - __main__ - Step 142656: {'lr': 3.0324244670127956e-06, 'samples': 27389952, 'steps': 142655, 'loss/train': 0.6056091785430908} 11/07/2021 17:19:39 - INFO - __main__ - Step 142657: {'lr': 3.031600484764502e-06, 'samples': 27390144, 'steps': 142656, 'loss/train': 1.7228708267211914} 11/07/2021 17:19:39 - INFO - __main__ - Step 142658: {'lr': 3.0307766137963876e-06, 'samples': 27390336, 'steps': 142657, 'loss/train': 1.0816153287887573} 11/07/2021 17:19:40 - INFO - __main__ - Step 142659: {'lr': 3.0299528541087862e-06, 'samples': 27390528, 'steps': 142658, 'loss/train': 0.9532233476638794} 11/07/2021 17:19:40 - INFO - __main__ - Step 142660: {'lr': 3.029129205702058e-06, 'samples': 27390720, 'steps': 142659, 'loss/train': 0.99903404712677} 11/07/2021 17:19:41 - INFO - __main__ - Step 142661: {'lr': 3.02830566857662e-06, 'samples': 27390912, 'steps': 142660, 'loss/train': 1.2624479532241821} 11/07/2021 17:19:41 - INFO - __main__ - Step 142662: {'lr': 3.027482242732804e-06, 'samples': 27391104, 'steps': 142661, 'loss/train': 1.446356177330017} 11/07/2021 17:19:41 - INFO - __main__ - Step 142663: {'lr': 3.026658928170972e-06, 'samples': 27391296, 'steps': 142662, 'loss/train': 1.4766006469726562} 11/07/2021 17:19:44 - INFO - __main__ - Step 142664: {'lr': 3.02583572489154e-06, 'samples': 27391488, 'steps': 142663, 'loss/train': 1.1622620820999146} 11/07/2021 17:19:44 - INFO - __main__ - Step 142665: {'lr': 3.025012632894841e-06, 'samples': 27391680, 'steps': 142664, 'loss/train': 1.0224591493606567} 11/07/2021 17:19:44 - INFO - __main__ - Step 142666: {'lr': 3.0241896521812917e-06, 'samples': 27391872, 'steps': 142665, 'loss/train': 1.7771992683410645} 11/07/2021 17:19:45 - INFO - __main__ - Step 142667: {'lr': 3.0233667827512248e-06, 'samples': 27392064, 'steps': 142666, 'loss/train': 1.5228151082992554} 11/07/2021 17:19:45 - INFO - __main__ - Step 142668: {'lr': 3.022544024605001e-06, 'samples': 27392256, 'steps': 142667, 'loss/train': 1.703397512435913} 11/07/2021 17:19:45 - INFO - __main__ - Step 142669: {'lr': 3.0217213777430086e-06, 'samples': 27392448, 'steps': 142668, 'loss/train': 1.7076797485351562} 11/07/2021 17:19:46 - INFO - __main__ - Step 142670: {'lr': 3.0208988421656093e-06, 'samples': 27392640, 'steps': 142669, 'loss/train': 1.6402387619018555} 11/07/2021 17:19:47 - INFO - __main__ - Step 142671: {'lr': 3.020076417873191e-06, 'samples': 27392832, 'steps': 142670, 'loss/train': 1.07144033908844} 11/07/2021 17:19:47 - INFO - __main__ - Step 142672: {'lr': 3.019254104866115e-06, 'samples': 27393024, 'steps': 142671, 'loss/train': 1.9912523031234741} 11/07/2021 17:19:48 - INFO - __main__ - Step 142673: {'lr': 3.0184319031447693e-06, 'samples': 27393216, 'steps': 142672, 'loss/train': 1.4875524044036865} 11/07/2021 17:19:48 - INFO - __main__ - Step 142674: {'lr': 3.0176098127094876e-06, 'samples': 27393408, 'steps': 142673, 'loss/train': 0.21918943524360657} 11/07/2021 17:19:48 - INFO - __main__ - Step 142675: {'lr': 3.016787833560658e-06, 'samples': 27393600, 'steps': 142674, 'loss/train': 1.5125675201416016} 11/07/2021 17:19:49 - INFO - __main__ - Step 142676: {'lr': 3.0159659656986693e-06, 'samples': 27393792, 'steps': 142675, 'loss/train': 1.0704829692840576} 11/07/2021 17:19:50 - INFO - __main__ - Step 142677: {'lr': 3.0151442091238548e-06, 'samples': 27393984, 'steps': 142676, 'loss/train': 1.3381116390228271} 11/07/2021 17:19:50 - INFO - __main__ - Step 142678: {'lr': 3.0143225638366025e-06, 'samples': 27394176, 'steps': 142677, 'loss/train': 1.2066494226455688} 11/07/2021 17:19:50 - INFO - __main__ - Step 142679: {'lr': 3.0135010298373012e-06, 'samples': 27394368, 'steps': 142678, 'loss/train': 2.0655648708343506} 11/07/2021 17:19:51 - INFO - __main__ - Step 142680: {'lr': 3.012679607126312e-06, 'samples': 27394560, 'steps': 142679, 'loss/train': 1.131143569946289} 11/07/2021 17:19:51 - INFO - __main__ - Step 142681: {'lr': 3.0118582957039953e-06, 'samples': 27394752, 'steps': 142680, 'loss/train': 1.4762357473373413} 11/07/2021 17:19:52 - INFO - __main__ - Step 142682: {'lr': 3.011037095570712e-06, 'samples': 27394944, 'steps': 142681, 'loss/train': 1.2797380685806274} 11/07/2021 17:19:53 - INFO - __main__ - Step 142683: {'lr': 3.0102160067268515e-06, 'samples': 27395136, 'steps': 142682, 'loss/train': 1.4866403341293335} 11/07/2021 17:19:53 - INFO - __main__ - Step 142684: {'lr': 3.0093950291727736e-06, 'samples': 27395328, 'steps': 142683, 'loss/train': 1.0861642360687256} 11/07/2021 17:19:53 - INFO - __main__ - Step 142685: {'lr': 3.008574162908839e-06, 'samples': 27395520, 'steps': 142684, 'loss/train': 1.5134974718093872} 11/07/2021 17:19:54 - INFO - __main__ - Step 142686: {'lr': 3.0077534079354372e-06, 'samples': 27395712, 'steps': 142685, 'loss/train': 1.4337553977966309} 11/07/2021 17:19:55 - INFO - __main__ - Step 142687: {'lr': 3.006932764252929e-06, 'samples': 27395904, 'steps': 142686, 'loss/train': 1.525759220123291} 11/07/2021 17:19:55 - INFO - __main__ - Step 142688: {'lr': 3.0061122318617018e-06, 'samples': 27396096, 'steps': 142687, 'loss/train': 1.5248439311981201} 11/07/2021 17:19:55 - INFO - __main__ - Step 142689: {'lr': 3.0052918107620895e-06, 'samples': 27396288, 'steps': 142688, 'loss/train': 1.0670356750488281} 11/07/2021 17:19:56 - INFO - __main__ - Step 142690: {'lr': 3.004471500954481e-06, 'samples': 27396480, 'steps': 142689, 'loss/train': 1.0036613941192627} 11/07/2021 17:19:56 - INFO - __main__ - Step 142691: {'lr': 3.0036513024392644e-06, 'samples': 27396672, 'steps': 142690, 'loss/train': 1.3295998573303223} 11/07/2021 17:19:57 - INFO - __main__ - Step 142692: {'lr': 3.0028312152167724e-06, 'samples': 27396864, 'steps': 142691, 'loss/train': 1.3597900867462158} 11/07/2021 17:19:58 - INFO - __main__ - Step 142693: {'lr': 3.002011239287422e-06, 'samples': 27397056, 'steps': 142692, 'loss/train': 1.351122260093689} 11/07/2021 17:19:58 - INFO - __main__ - Step 142694: {'lr': 3.0011913746515462e-06, 'samples': 27397248, 'steps': 142693, 'loss/train': 1.7160513401031494} 11/07/2021 17:19:58 - INFO - __main__ - Step 142695: {'lr': 3.0003716213095054e-06, 'samples': 27397440, 'steps': 142694, 'loss/train': 1.4845733642578125} 11/07/2021 17:19:59 - INFO - __main__ - Step 142696: {'lr': 2.9995519792616887e-06, 'samples': 27397632, 'steps': 142695, 'loss/train': 1.2327029705047607} 11/07/2021 17:19:59 - INFO - __main__ - Step 142697: {'lr': 2.9987324485084567e-06, 'samples': 27397824, 'steps': 142696, 'loss/train': 1.502932071685791} 11/07/2021 17:20:00 - INFO - __main__ - Step 142698: {'lr': 2.9979130290501976e-06, 'samples': 27398016, 'steps': 142697, 'loss/train': 0.31430545449256897} 11/07/2021 17:20:01 - INFO - __main__ - Step 142699: {'lr': 2.9970937208872727e-06, 'samples': 27398208, 'steps': 142698, 'loss/train': 1.0926518440246582} 11/07/2021 17:20:01 - INFO - __main__ - Step 142700: {'lr': 2.9962745240200153e-06, 'samples': 27398400, 'steps': 142699, 'loss/train': 1.4600447416305542} 11/07/2021 17:20:01 - INFO - __main__ - Step 142701: {'lr': 2.995455438448841e-06, 'samples': 27398592, 'steps': 142700, 'loss/train': 1.1115174293518066} 11/07/2021 17:20:02 - INFO - __main__ - Step 142702: {'lr': 2.994636464174111e-06, 'samples': 27398784, 'steps': 142701, 'loss/train': 1.1093072891235352} 11/07/2021 17:20:02 - INFO - __main__ - Step 142703: {'lr': 2.9938176011961858e-06, 'samples': 27398976, 'steps': 142702, 'loss/train': 1.5019415616989136} 11/07/2021 17:20:03 - INFO - __main__ - Step 142704: {'lr': 2.992998849515427e-06, 'samples': 27399168, 'steps': 142703, 'loss/train': 1.4684945344924927} 11/07/2021 17:20:03 - INFO - __main__ - Step 142705: {'lr': 2.9921802091322227e-06, 'samples': 27399360, 'steps': 142704, 'loss/train': 1.2595089673995972} 11/07/2021 17:20:04 - INFO - __main__ - Step 142706: {'lr': 2.9913616800469055e-06, 'samples': 27399552, 'steps': 142705, 'loss/train': 1.4225172996520996} 11/07/2021 17:20:04 - INFO - __main__ - Step 142707: {'lr': 2.9905432622598926e-06, 'samples': 27399744, 'steps': 142706, 'loss/train': 1.6209896802902222} 11/07/2021 17:20:04 - INFO - __main__ - Step 142708: {'lr': 2.9897249557715445e-06, 'samples': 27399936, 'steps': 142707, 'loss/train': 1.4753212928771973} 11/07/2021 17:20:05 - INFO - __main__ - Step 142709: {'lr': 2.988906760582194e-06, 'samples': 27400128, 'steps': 142708, 'loss/train': 1.3708561658859253} 11/07/2021 17:20:06 - INFO - __main__ - Step 142710: {'lr': 2.98808867669223e-06, 'samples': 27400320, 'steps': 142709, 'loss/train': 1.3984601497650146} 11/07/2021 17:20:06 - INFO - __main__ - Step 142711: {'lr': 2.987270704102013e-06, 'samples': 27400512, 'steps': 142710, 'loss/train': 0.7493411898612976} 11/07/2021 17:20:07 - INFO - __main__ - Step 142712: {'lr': 2.986452842811932e-06, 'samples': 27400704, 'steps': 142711, 'loss/train': 0.976443350315094} 11/07/2021 17:20:07 - INFO - __main__ - Step 142713: {'lr': 2.9856350928223475e-06, 'samples': 27400896, 'steps': 142712, 'loss/train': 1.135141134262085} 11/07/2021 17:20:08 - INFO - __main__ - Step 142714: {'lr': 2.9848174541336205e-06, 'samples': 27401088, 'steps': 142713, 'loss/train': 1.2990870475769043} 11/07/2021 17:20:08 - INFO - __main__ - Step 142715: {'lr': 2.9839999267461116e-06, 'samples': 27401280, 'steps': 142714, 'loss/train': 1.0563328266143799} 11/07/2021 17:20:09 - INFO - __main__ - Step 142716: {'lr': 2.9831825106602096e-06, 'samples': 27401472, 'steps': 142715, 'loss/train': 1.1813008785247803} 11/07/2021 17:20:09 - INFO - __main__ - Step 142717: {'lr': 2.982365205876275e-06, 'samples': 27401664, 'steps': 142716, 'loss/train': 1.7491663694381714} 11/07/2021 17:20:09 - INFO - __main__ - Step 142718: {'lr': 2.9815480123946693e-06, 'samples': 27401856, 'steps': 142717, 'loss/train': 1.2920207977294922} 11/07/2021 17:20:10 - INFO - __main__ - Step 142719: {'lr': 2.9807309302157802e-06, 'samples': 27402048, 'steps': 142718, 'loss/train': 1.3592816591262817} 11/07/2021 17:20:11 - INFO - __main__ - Step 142720: {'lr': 2.9799139593399414e-06, 'samples': 27402240, 'steps': 142719, 'loss/train': 1.4800416231155396} 11/07/2021 17:20:11 - INFO - __main__ - Step 142721: {'lr': 2.979097099767569e-06, 'samples': 27402432, 'steps': 142720, 'loss/train': 1.690476655960083} 11/07/2021 17:20:11 - INFO - __main__ - Step 142722: {'lr': 2.978280351498969e-06, 'samples': 27402624, 'steps': 142721, 'loss/train': 1.4818531274795532} 11/07/2021 17:20:12 - INFO - __main__ - Step 142723: {'lr': 2.977463714534584e-06, 'samples': 27402816, 'steps': 142722, 'loss/train': 1.2698354721069336} 11/07/2021 17:20:12 - INFO - __main__ - Step 142724: {'lr': 2.9766471888747204e-06, 'samples': 27403008, 'steps': 142723, 'loss/train': 1.3307499885559082} 11/07/2021 17:20:13 - INFO - __main__ - Step 142725: {'lr': 2.9758307745197665e-06, 'samples': 27403200, 'steps': 142724, 'loss/train': 1.7143150568008423} 11/07/2021 17:20:13 - INFO - __main__ - Step 142726: {'lr': 2.9750144714700834e-06, 'samples': 27403392, 'steps': 142725, 'loss/train': 0.9909108877182007} 11/07/2021 17:20:14 - INFO - __main__ - Step 142727: {'lr': 2.9741982797260593e-06, 'samples': 27403584, 'steps': 142726, 'loss/train': 1.110968828201294} 11/07/2021 17:20:14 - INFO - __main__ - Step 142728: {'lr': 2.9733821992880273e-06, 'samples': 27403776, 'steps': 142727, 'loss/train': 1.534027099609375} 11/07/2021 17:20:15 - INFO - __main__ - Step 142729: {'lr': 2.9725662301564036e-06, 'samples': 27403968, 'steps': 142728, 'loss/train': 1.5407456159591675} 11/07/2021 17:20:16 - INFO - __main__ - Step 142730: {'lr': 2.971750372331522e-06, 'samples': 27404160, 'steps': 142729, 'loss/train': 1.2982149124145508} 11/07/2021 17:20:16 - INFO - __main__ - Step 142731: {'lr': 2.9709346258137703e-06, 'samples': 27404352, 'steps': 142730, 'loss/train': 1.2768454551696777} 11/07/2021 17:20:16 - INFO - __main__ - Step 142732: {'lr': 2.970118990603482e-06, 'samples': 27404544, 'steps': 142731, 'loss/train': 0.7823470234870911} 11/07/2021 17:20:17 - INFO - __main__ - Step 142733: {'lr': 2.9693034667010453e-06, 'samples': 27404736, 'steps': 142732, 'loss/train': 0.9098199605941772} 11/07/2021 17:20:17 - INFO - __main__ - Step 142734: {'lr': 2.9684880541068495e-06, 'samples': 27404928, 'steps': 142733, 'loss/train': 1.6362733840942383} 11/07/2021 17:20:18 - INFO - __main__ - Step 142735: {'lr': 2.967672752821254e-06, 'samples': 27405120, 'steps': 142734, 'loss/train': 1.1618125438690186} 11/07/2021 17:20:18 - INFO - __main__ - Step 142736: {'lr': 2.966857562844566e-06, 'samples': 27405312, 'steps': 142735, 'loss/train': 1.4415359497070312} 11/07/2021 17:20:19 - INFO - __main__ - Step 142737: {'lr': 2.966042484177228e-06, 'samples': 27405504, 'steps': 142736, 'loss/train': 1.3126853704452515} 11/07/2021 17:20:19 - INFO - __main__ - Step 142738: {'lr': 2.965227516819574e-06, 'samples': 27405696, 'steps': 142737, 'loss/train': 1.307429313659668} 11/07/2021 17:20:19 - INFO - __main__ - Step 142739: {'lr': 2.9644126607719923e-06, 'samples': 27405888, 'steps': 142738, 'loss/train': 0.8385583758354187} 11/07/2021 17:20:20 - INFO - __main__ - Step 142740: {'lr': 2.963597916034816e-06, 'samples': 27406080, 'steps': 142739, 'loss/train': 1.3308643102645874} 11/07/2021 17:20:21 - INFO - __main__ - Step 142741: {'lr': 2.9627832826084335e-06, 'samples': 27406272, 'steps': 142740, 'loss/train': 0.940527081489563} 11/07/2021 17:20:21 - INFO - __main__ - Step 142742: {'lr': 2.9619687604932056e-06, 'samples': 27406464, 'steps': 142741, 'loss/train': 1.3998314142227173} 11/07/2021 17:20:22 - INFO - __main__ - Step 142743: {'lr': 2.9611543496894933e-06, 'samples': 27406656, 'steps': 142742, 'loss/train': 1.5638453960418701} 11/07/2021 17:20:22 - INFO - __main__ - Step 142744: {'lr': 2.9603400501976853e-06, 'samples': 27406848, 'steps': 142743, 'loss/train': 1.1290799379348755} 11/07/2021 17:20:22 - INFO - __main__ - Step 142745: {'lr': 2.959525862018142e-06, 'samples': 27407040, 'steps': 142744, 'loss/train': 1.9760222434997559} 11/07/2021 17:20:23 - INFO - __main__ - Step 142746: {'lr': 2.9587117851512246e-06, 'samples': 27407232, 'steps': 142745, 'loss/train': 1.3736493587493896} 11/07/2021 17:20:24 - INFO - __main__ - Step 142747: {'lr': 2.9578978195972937e-06, 'samples': 27407424, 'steps': 142746, 'loss/train': 1.5670655965805054} 11/07/2021 17:20:24 - INFO - __main__ - Step 142748: {'lr': 2.9570839653567383e-06, 'samples': 27407616, 'steps': 142747, 'loss/train': 1.4298518896102905} 11/07/2021 17:20:24 - INFO - __main__ - Step 142749: {'lr': 2.956270222429891e-06, 'samples': 27407808, 'steps': 142748, 'loss/train': 1.3382210731506348} 11/07/2021 17:20:25 - INFO - __main__ - Step 142750: {'lr': 2.9554565908171405e-06, 'samples': 27408000, 'steps': 142749, 'loss/train': 0.7888666987419128} 11/07/2021 17:20:26 - INFO - __main__ - Step 142751: {'lr': 2.9546430705188477e-06, 'samples': 27408192, 'steps': 142750, 'loss/train': 1.6662039756774902} 11/07/2021 17:20:26 - INFO - __main__ - Step 142752: {'lr': 2.9538296615353734e-06, 'samples': 27408384, 'steps': 142751, 'loss/train': 1.484194278717041} 11/07/2021 17:20:27 - INFO - __main__ - Step 142753: {'lr': 2.953016363867078e-06, 'samples': 27408576, 'steps': 142752, 'loss/train': 1.7166675329208374} 11/07/2021 17:20:27 - INFO - __main__ - Step 142754: {'lr': 2.952203177514379e-06, 'samples': 27408768, 'steps': 142753, 'loss/train': 1.4169700145721436} 11/07/2021 17:20:27 - INFO - __main__ - Step 142755: {'lr': 2.95139010247758e-06, 'samples': 27408960, 'steps': 142754, 'loss/train': 1.3392515182495117} 11/07/2021 17:20:28 - INFO - __main__ - Step 142756: {'lr': 2.9505771387570713e-06, 'samples': 27409152, 'steps': 142755, 'loss/train': 1.3174561262130737} 11/07/2021 17:20:29 - INFO - __main__ - Step 142757: {'lr': 2.9497642863532402e-06, 'samples': 27409344, 'steps': 142756, 'loss/train': 1.3563657999038696} 11/07/2021 17:20:29 - INFO - __main__ - Step 142758: {'lr': 2.9489515452664206e-06, 'samples': 27409536, 'steps': 142757, 'loss/train': 0.9167950749397278} 11/07/2021 17:20:29 - INFO - __main__ - Step 142759: {'lr': 2.948138915496973e-06, 'samples': 27409728, 'steps': 142758, 'loss/train': 1.0965131521224976} 11/07/2021 17:20:30 - INFO - __main__ - Step 142760: {'lr': 2.9473263970453134e-06, 'samples': 27409920, 'steps': 142759, 'loss/train': 1.4970378875732422} 11/07/2021 17:20:30 - INFO - __main__ - Step 142761: {'lr': 2.9465139899117754e-06, 'samples': 27410112, 'steps': 142760, 'loss/train': 1.5671629905700684} 11/07/2021 17:20:31 - INFO - __main__ - Step 142762: {'lr': 2.9457016940967198e-06, 'samples': 27410304, 'steps': 142761, 'loss/train': 1.3769270181655884} 11/07/2021 17:20:32 - INFO - __main__ - Step 142763: {'lr': 2.944889509600507e-06, 'samples': 27410496, 'steps': 142762, 'loss/train': 1.2894291877746582} 11/07/2021 17:20:32 - INFO - __main__ - Step 142764: {'lr': 2.9440774364235256e-06, 'samples': 27410688, 'steps': 142763, 'loss/train': 0.5357548594474792} 11/07/2021 17:20:32 - INFO - __main__ - Step 142765: {'lr': 2.943265474566109e-06, 'samples': 27410880, 'steps': 142764, 'loss/train': 0.89985191822052} 11/07/2021 17:20:33 - INFO - __main__ - Step 142766: {'lr': 2.942453624028674e-06, 'samples': 27411072, 'steps': 142765, 'loss/train': 1.2631983757019043} 11/07/2021 17:20:34 - INFO - __main__ - Step 142767: {'lr': 2.9416418848115243e-06, 'samples': 27411264, 'steps': 142766, 'loss/train': 1.2868099212646484} 11/07/2021 17:20:34 - INFO - __main__ - Step 142768: {'lr': 2.940830256915078e-06, 'samples': 27411456, 'steps': 142767, 'loss/train': 1.2281314134597778} 11/07/2021 17:20:35 - INFO - __main__ - Step 142769: {'lr': 2.940018740339695e-06, 'samples': 27411648, 'steps': 142768, 'loss/train': 1.2064054012298584} 11/07/2021 17:20:35 - INFO - __main__ - Step 142770: {'lr': 2.9392073350857085e-06, 'samples': 27411840, 'steps': 142769, 'loss/train': 1.4081318378448486} 11/07/2021 17:20:35 - INFO - __main__ - Step 142771: {'lr': 2.938396041153507e-06, 'samples': 27412032, 'steps': 142770, 'loss/train': 1.3313020467758179} 11/07/2021 17:20:36 - INFO - __main__ - Step 142772: {'lr': 2.9375848585434516e-06, 'samples': 27412224, 'steps': 142771, 'loss/train': 1.219391942024231} 11/07/2021 17:20:37 - INFO - __main__ - Step 142773: {'lr': 2.936773787255903e-06, 'samples': 27412416, 'steps': 142772, 'loss/train': 0.7591325640678406} 11/07/2021 17:20:37 - INFO - __main__ - Step 142774: {'lr': 2.9359628272912496e-06, 'samples': 27412608, 'steps': 142773, 'loss/train': 1.665669560432434} 11/07/2021 17:20:37 - INFO - __main__ - Step 142775: {'lr': 2.935151978649825e-06, 'samples': 27412800, 'steps': 142774, 'loss/train': 0.9897587299346924} 11/07/2021 17:20:38 - INFO - __main__ - Step 142776: {'lr': 2.934341241332017e-06, 'samples': 27412992, 'steps': 142775, 'loss/train': 1.7162292003631592} 11/07/2021 17:20:39 - INFO - __main__ - Step 142777: {'lr': 2.933530615338187e-06, 'samples': 27413184, 'steps': 142776, 'loss/train': 1.1195393800735474} 11/07/2021 17:20:39 - INFO - __main__ - Step 142778: {'lr': 2.9327201006686677e-06, 'samples': 27413376, 'steps': 142777, 'loss/train': 1.2578682899475098} 11/07/2021 17:20:40 - INFO - __main__ - Step 142779: {'lr': 2.9319096973238755e-06, 'samples': 27413568, 'steps': 142778, 'loss/train': 1.2494211196899414} 11/07/2021 17:20:40 - INFO - __main__ - Step 142780: {'lr': 2.931099405304144e-06, 'samples': 27413760, 'steps': 142779, 'loss/train': 1.0127644538879395} 11/07/2021 17:20:40 - INFO - __main__ - Step 142781: {'lr': 2.930289224609861e-06, 'samples': 27413952, 'steps': 142780, 'loss/train': 1.7569684982299805} 11/07/2021 17:20:41 - INFO - __main__ - Step 142782: {'lr': 2.9294791552413603e-06, 'samples': 27414144, 'steps': 142781, 'loss/train': 1.3678544759750366} 11/07/2021 17:20:42 - INFO - __main__ - Step 142783: {'lr': 2.92866919719903e-06, 'samples': 27414336, 'steps': 142782, 'loss/train': 1.3944780826568604} 11/07/2021 17:20:42 - INFO - __main__ - Step 142784: {'lr': 2.9278593504832307e-06, 'samples': 27414528, 'steps': 142783, 'loss/train': 1.1823865175247192} 11/07/2021 17:20:42 - INFO - __main__ - Step 142785: {'lr': 2.927049615094296e-06, 'samples': 27414720, 'steps': 142784, 'loss/train': 1.4091078042984009} 11/07/2021 17:20:43 - INFO - __main__ - Step 142786: {'lr': 2.92623999103267e-06, 'samples': 27414912, 'steps': 142785, 'loss/train': 1.659454584121704} 11/07/2021 17:20:44 - INFO - __main__ - Step 142787: {'lr': 2.9254304782986297e-06, 'samples': 27415104, 'steps': 142786, 'loss/train': 0.05545838922262192} 11/07/2021 17:20:44 - INFO - __main__ - Step 142788: {'lr': 2.9246210768926196e-06, 'samples': 27415296, 'steps': 142787, 'loss/train': 1.1475138664245605} 11/07/2021 17:20:44 - INFO - __main__ - Step 142789: {'lr': 2.9238117868149173e-06, 'samples': 27415488, 'steps': 142788, 'loss/train': 1.1864008903503418} 11/07/2021 17:20:45 - INFO - __main__ - Step 142790: {'lr': 2.9230026080659664e-06, 'samples': 27415680, 'steps': 142789, 'loss/train': 1.4679384231567383} 11/07/2021 17:20:45 - INFO - __main__ - Step 142791: {'lr': 2.922193540646073e-06, 'samples': 27415872, 'steps': 142790, 'loss/train': 1.2949202060699463} 11/07/2021 17:20:46 - INFO - __main__ - Step 142792: {'lr': 2.9213845845556253e-06, 'samples': 27416064, 'steps': 142791, 'loss/train': 1.7547407150268555} 11/07/2021 17:20:47 - INFO - __main__ - Step 142793: {'lr': 2.920575739795012e-06, 'samples': 27416256, 'steps': 142792, 'loss/train': 1.212540626525879} 11/07/2021 17:20:47 - INFO - __main__ - Step 142794: {'lr': 2.9197670063645655e-06, 'samples': 27416448, 'steps': 142793, 'loss/train': 0.8861299157142639} 11/07/2021 17:20:47 - INFO - __main__ - Step 142795: {'lr': 2.918958384264647e-06, 'samples': 27416640, 'steps': 142794, 'loss/train': 1.324187159538269} 11/07/2021 17:20:48 - INFO - __main__ - Step 142796: {'lr': 2.9181498734956456e-06, 'samples': 27416832, 'steps': 142795, 'loss/train': 1.8040510416030884} 11/07/2021 17:20:48 - INFO - __main__ - Step 142797: {'lr': 2.917341474057894e-06, 'samples': 27417024, 'steps': 142796, 'loss/train': 1.3105329275131226} 11/07/2021 17:20:49 - INFO - __main__ - Step 142798: {'lr': 2.916533185951781e-06, 'samples': 27417216, 'steps': 142797, 'loss/train': 0.8786587715148926} 11/07/2021 17:20:49 - INFO - __main__ - Step 142799: {'lr': 2.9157250091776942e-06, 'samples': 27417408, 'steps': 142798, 'loss/train': 1.441166639328003} 11/07/2021 17:20:50 - INFO - __main__ - Step 142800: {'lr': 2.9149169437359403e-06, 'samples': 27417600, 'steps': 142799, 'loss/train': 2.059474468231201} 11/07/2021 17:20:50 - INFO - __main__ - Step 142801: {'lr': 2.9141089896269345e-06, 'samples': 27417792, 'steps': 142800, 'loss/train': 1.2109090089797974} 11/07/2021 17:20:51 - INFO - __main__ - Step 142802: {'lr': 2.913301146851011e-06, 'samples': 27417984, 'steps': 142801, 'loss/train': 1.7583805322647095} 11/07/2021 17:20:52 - INFO - __main__ - Step 142803: {'lr': 2.912493415408529e-06, 'samples': 27418176, 'steps': 142802, 'loss/train': 1.4098402261734009} 11/07/2021 17:20:52 - INFO - __main__ - Step 142804: {'lr': 2.9116857952998787e-06, 'samples': 27418368, 'steps': 142803, 'loss/train': 0.9906109571456909} 11/07/2021 17:20:53 - INFO - __main__ - Step 142805: {'lr': 2.9108782865253923e-06, 'samples': 27418560, 'steps': 142804, 'loss/train': 1.3543751239776611} 11/07/2021 17:20:53 - INFO - __main__ - Step 142806: {'lr': 2.910070889085459e-06, 'samples': 27418752, 'steps': 142805, 'loss/train': 1.1276921033859253} 11/07/2021 17:20:53 - INFO - __main__ - Step 142807: {'lr': 2.9092636029804387e-06, 'samples': 27418944, 'steps': 142806, 'loss/train': 1.3625314235687256} 11/07/2021 17:20:54 - INFO - __main__ - Step 142808: {'lr': 2.9084564282106928e-06, 'samples': 27419136, 'steps': 142807, 'loss/train': 1.1686850786209106} 11/07/2021 17:20:55 - INFO - __main__ - Step 142809: {'lr': 2.907649364776582e-06, 'samples': 27419328, 'steps': 142808, 'loss/train': 0.06257408857345581} 11/07/2021 17:20:55 - INFO - __main__ - Step 142810: {'lr': 2.9068424126784674e-06, 'samples': 27419520, 'steps': 142809, 'loss/train': 1.3693370819091797} 11/07/2021 17:20:56 - INFO - __main__ - Step 142811: {'lr': 2.9060355719167376e-06, 'samples': 27419712, 'steps': 142810, 'loss/train': 0.6780030131340027} 11/07/2021 17:20:56 - INFO - __main__ - Step 142812: {'lr': 2.905228842491697e-06, 'samples': 27419904, 'steps': 142811, 'loss/train': 1.3414078950881958} 11/07/2021 17:20:56 - INFO - __main__ - Step 142813: {'lr': 2.904422224403763e-06, 'samples': 27420096, 'steps': 142812, 'loss/train': 0.5912939310073853} 11/07/2021 17:20:57 - INFO - __main__ - Step 142814: {'lr': 2.9036157176532964e-06, 'samples': 27420288, 'steps': 142813, 'loss/train': 1.838625192642212} 11/07/2021 17:20:58 - INFO - __main__ - Step 142815: {'lr': 2.902809322240657e-06, 'samples': 27420480, 'steps': 142814, 'loss/train': 1.12453031539917} 11/07/2021 17:20:58 - INFO - __main__ - Step 142816: {'lr': 2.9020030381661787e-06, 'samples': 27420672, 'steps': 142815, 'loss/train': 1.2682915925979614} 11/07/2021 17:20:58 - INFO - __main__ - Step 142817: {'lr': 2.90119686543025e-06, 'samples': 27420864, 'steps': 142816, 'loss/train': 1.6467764377593994} 11/07/2021 17:20:59 - INFO - __main__ - Step 142818: {'lr': 2.9003908040332315e-06, 'samples': 27421056, 'steps': 142817, 'loss/train': 1.381334900856018} 11/07/2021 17:21:00 - INFO - __main__ - Step 142819: {'lr': 2.8995848539754844e-06, 'samples': 27421248, 'steps': 142818, 'loss/train': 1.4634006023406982} 11/07/2021 17:21:00 - INFO - __main__ - Step 142820: {'lr': 2.898779015257341e-06, 'samples': 27421440, 'steps': 142819, 'loss/train': 1.2805863618850708} 11/07/2021 17:21:01 - INFO - __main__ - Step 142821: {'lr': 2.8979732878792463e-06, 'samples': 27421632, 'steps': 142820, 'loss/train': 1.1887351274490356} 11/07/2021 17:21:01 - INFO - __main__ - Step 142822: {'lr': 2.8971676718414774e-06, 'samples': 27421824, 'steps': 142821, 'loss/train': 1.1057206392288208} 11/07/2021 17:21:01 - INFO - __main__ - Step 142823: {'lr': 2.896362167144423e-06, 'samples': 27422016, 'steps': 142822, 'loss/train': 0.9304496645927429} 11/07/2021 17:21:02 - INFO - __main__ - Step 142824: {'lr': 2.8955567737884713e-06, 'samples': 27422208, 'steps': 142823, 'loss/train': 1.4954866170883179} 11/07/2021 17:21:03 - INFO - __main__ - Step 142825: {'lr': 2.8947514917739837e-06, 'samples': 27422400, 'steps': 142824, 'loss/train': 1.4187983274459839} 11/07/2021 17:21:03 - INFO - __main__ - Step 142826: {'lr': 2.893946321101293e-06, 'samples': 27422592, 'steps': 142825, 'loss/train': 1.2917474508285522} 11/07/2021 17:21:03 - INFO - __main__ - Step 142827: {'lr': 2.8931412617707597e-06, 'samples': 27422784, 'steps': 142826, 'loss/train': 1.2206577062606812} 11/07/2021 17:21:04 - INFO - __main__ - Step 142828: {'lr': 2.892336313782801e-06, 'samples': 27422976, 'steps': 142827, 'loss/train': 1.561567783355713} 11/07/2021 17:21:04 - INFO - __main__ - Step 142829: {'lr': 2.891531477137721e-06, 'samples': 27423168, 'steps': 142828, 'loss/train': 1.096915364265442} 11/07/2021 17:21:05 - INFO - __main__ - Step 142830: {'lr': 2.890726751835909e-06, 'samples': 27423360, 'steps': 142829, 'loss/train': 1.507744550704956} 11/07/2021 17:21:05 - INFO - __main__ - Step 142831: {'lr': 2.8899221378777262e-06, 'samples': 27423552, 'steps': 142830, 'loss/train': 0.7412917017936707} 11/07/2021 17:21:06 - INFO - __main__ - Step 142832: {'lr': 2.8891176352635053e-06, 'samples': 27423744, 'steps': 142831, 'loss/train': 1.4190975427627563} 11/07/2021 17:21:06 - INFO - __main__ - Step 142833: {'lr': 2.888313243993662e-06, 'samples': 27423936, 'steps': 142832, 'loss/train': 0.8896267414093018} 11/07/2021 17:21:07 - INFO - __main__ - Step 142834: {'lr': 2.8875089640685303e-06, 'samples': 27424128, 'steps': 142833, 'loss/train': 1.3542925119400024} 11/07/2021 17:21:08 - INFO - __main__ - Step 142835: {'lr': 2.8867047954884707e-06, 'samples': 27424320, 'steps': 142834, 'loss/train': 1.5680233240127563} 11/07/2021 17:21:08 - INFO - __main__ - Step 142836: {'lr': 2.8859007382538436e-06, 'samples': 27424512, 'steps': 142835, 'loss/train': 1.4893299341201782} 11/07/2021 17:21:08 - INFO - __main__ - Step 142837: {'lr': 2.8850967923650106e-06, 'samples': 27424704, 'steps': 142836, 'loss/train': 1.496116042137146} 11/07/2021 17:21:09 - INFO - __main__ - Step 142838: {'lr': 2.884292957822332e-06, 'samples': 27424896, 'steps': 142837, 'loss/train': 0.7986182570457458} 11/07/2021 17:21:09 - INFO - __main__ - Step 142839: {'lr': 2.8834892346261963e-06, 'samples': 27425088, 'steps': 142838, 'loss/train': 1.2299946546554565} 11/07/2021 17:21:10 - INFO - __main__ - Step 142840: {'lr': 2.8826856227769648e-06, 'samples': 27425280, 'steps': 142839, 'loss/train': 1.1848043203353882} 11/07/2021 17:21:11 - INFO - __main__ - Step 142841: {'lr': 2.8818821222749427e-06, 'samples': 27425472, 'steps': 142840, 'loss/train': 1.1991182565689087} 11/07/2021 17:21:11 - INFO - __main__ - Step 142842: {'lr': 2.8810787331205735e-06, 'samples': 27425664, 'steps': 142841, 'loss/train': 1.6479625701904297} 11/07/2021 17:21:11 - INFO - __main__ - Step 142843: {'lr': 2.880275455314135e-06, 'samples': 27425856, 'steps': 142842, 'loss/train': 1.6086479425430298} 11/07/2021 17:21:12 - INFO - __main__ - Step 142844: {'lr': 2.879472288856072e-06, 'samples': 27426048, 'steps': 142843, 'loss/train': 1.2091946601867676} 11/07/2021 17:21:12 - INFO - __main__ - Step 142845: {'lr': 2.8786692337466614e-06, 'samples': 27426240, 'steps': 142844, 'loss/train': 1.3728653192520142} 11/07/2021 17:21:13 - INFO - __main__ - Step 142846: {'lr': 2.8778662899863474e-06, 'samples': 27426432, 'steps': 142845, 'loss/train': 1.2340171337127686} 11/07/2021 17:21:13 - INFO - __main__ - Step 142847: {'lr': 2.877063457575435e-06, 'samples': 27426624, 'steps': 142846, 'loss/train': 1.3693797588348389} 11/07/2021 17:21:14 - INFO - __main__ - Step 142848: {'lr': 2.8762607365142856e-06, 'samples': 27426816, 'steps': 142847, 'loss/train': 1.3611384630203247} 11/07/2021 17:21:14 - INFO - __main__ - Step 142849: {'lr': 2.8754581268033152e-06, 'samples': 27427008, 'steps': 142848, 'loss/train': 1.1242774724960327} 11/07/2021 17:21:15 - INFO - __main__ - Step 142850: {'lr': 2.8746556284428294e-06, 'samples': 27427200, 'steps': 142849, 'loss/train': 1.3912497758865356} 11/07/2021 17:21:16 - INFO - __main__ - Step 142851: {'lr': 2.8738532414332165e-06, 'samples': 27427392, 'steps': 142850, 'loss/train': 1.2870172262191772} 11/07/2021 17:21:16 - INFO - __main__ - Step 142852: {'lr': 2.8730509657748373e-06, 'samples': 27427584, 'steps': 142851, 'loss/train': 1.1380553245544434} 11/07/2021 17:21:17 - INFO - __main__ - Step 142853: {'lr': 2.8722488014680248e-06, 'samples': 27427776, 'steps': 142852, 'loss/train': 1.1945942640304565} 11/07/2021 17:21:17 - INFO - __main__ - Step 142854: {'lr': 2.8714467485131956e-06, 'samples': 27427968, 'steps': 142853, 'loss/train': 1.276349663734436} 11/07/2021 17:21:17 - INFO - __main__ - Step 142855: {'lr': 2.870644806910655e-06, 'samples': 27428160, 'steps': 142854, 'loss/train': 1.6725391149520874} 11/07/2021 17:21:18 - INFO - __main__ - Step 142856: {'lr': 2.869842976660819e-06, 'samples': 27428352, 'steps': 142855, 'loss/train': 1.3539562225341797} 11/07/2021 17:21:19 - INFO - __main__ - Step 142857: {'lr': 2.8690412577639937e-06, 'samples': 27428544, 'steps': 142856, 'loss/train': 0.3754728138446808} 11/07/2021 17:21:19 - INFO - __main__ - Step 142858: {'lr': 2.8682396502205665e-06, 'samples': 27428736, 'steps': 142857, 'loss/train': 1.6578141450881958} 11/07/2021 17:21:20 - INFO - __main__ - Step 142859: {'lr': 2.8674381540308993e-06, 'samples': 27428928, 'steps': 142858, 'loss/train': 1.1916509866714478} 11/07/2021 17:21:20 - INFO - __main__ - Step 142860: {'lr': 2.866636769195352e-06, 'samples': 27429120, 'steps': 142859, 'loss/train': 1.352810025215149} 11/07/2021 17:21:20 - INFO - __main__ - Step 142861: {'lr': 2.8658354957142587e-06, 'samples': 27429312, 'steps': 142860, 'loss/train': 1.4112039804458618} 11/07/2021 17:21:21 - INFO - __main__ - Step 142862: {'lr': 2.865034333588035e-06, 'samples': 27429504, 'steps': 142861, 'loss/train': 1.7961862087249756} 11/07/2021 17:21:22 - INFO - __main__ - Step 142863: {'lr': 2.8642332828170135e-06, 'samples': 27429696, 'steps': 142862, 'loss/train': 1.5344184637069702} 11/07/2021 17:21:22 - INFO - __main__ - Step 142864: {'lr': 2.8634323434015564e-06, 'samples': 27429888, 'steps': 142863, 'loss/train': 0.9132359623908997} 11/07/2021 17:21:22 - INFO - __main__ - Step 142865: {'lr': 2.8626315153420234e-06, 'samples': 27430080, 'steps': 142864, 'loss/train': 1.6020326614379883} 11/07/2021 17:21:23 - INFO - __main__ - Step 142866: {'lr': 2.8618307986387484e-06, 'samples': 27430272, 'steps': 142865, 'loss/train': 1.5972285270690918} 11/07/2021 17:21:23 - INFO - __main__ - Step 142867: {'lr': 2.8610301932921467e-06, 'samples': 27430464, 'steps': 142866, 'loss/train': 1.4212422370910645} 11/07/2021 17:21:24 - INFO - __main__ - Step 142868: {'lr': 2.860229699302552e-06, 'samples': 27430656, 'steps': 142867, 'loss/train': 1.2411434650421143} 11/07/2021 17:21:24 - INFO - __main__ - Step 142869: {'lr': 2.8594293166703255e-06, 'samples': 27430848, 'steps': 142868, 'loss/train': 1.2226983308792114} 11/07/2021 17:21:25 - INFO - __main__ - Step 142870: {'lr': 2.8586290453957997e-06, 'samples': 27431040, 'steps': 142869, 'loss/train': 1.685990333557129} 11/07/2021 17:21:25 - INFO - __main__ - Step 142871: {'lr': 2.857828885479391e-06, 'samples': 27431232, 'steps': 142870, 'loss/train': 1.4163646697998047} 11/07/2021 17:21:25 - INFO - __main__ - Step 142872: {'lr': 2.857028836921405e-06, 'samples': 27431424, 'steps': 142871, 'loss/train': 1.6351070404052734} 11/07/2021 17:21:27 - INFO - __main__ - Step 142873: {'lr': 2.8562288997222576e-06, 'samples': 27431616, 'steps': 142872, 'loss/train': 1.3207017183303833} 11/07/2021 17:21:27 - INFO - __main__ - Step 142874: {'lr': 2.855429073882254e-06, 'samples': 27431808, 'steps': 142873, 'loss/train': 1.3267157077789307} 11/07/2021 17:21:27 - INFO - __main__ - Step 142875: {'lr': 2.8546293594017838e-06, 'samples': 27432000, 'steps': 142874, 'loss/train': 1.0231554508209229} 11/07/2021 17:21:28 - INFO - __main__ - Step 142876: {'lr': 2.8538297562812345e-06, 'samples': 27432192, 'steps': 142875, 'loss/train': 1.153712272644043} 11/07/2021 17:21:28 - INFO - __main__ - Step 142877: {'lr': 2.853030264520912e-06, 'samples': 27432384, 'steps': 142876, 'loss/train': 1.350154161453247} 11/07/2021 17:21:28 - INFO - __main__ - Step 142878: {'lr': 2.8522308841211763e-06, 'samples': 27432576, 'steps': 142877, 'loss/train': 5.646336555480957} 11/07/2021 17:21:29 - INFO - __main__ - Step 142879: {'lr': 2.851431615082445e-06, 'samples': 27432768, 'steps': 142878, 'loss/train': 0.4118913412094116} 11/07/2021 17:21:30 - INFO - __main__ - Step 142880: {'lr': 2.85063245740505e-06, 'samples': 27432960, 'steps': 142879, 'loss/train': 1.9407110214233398} 11/07/2021 17:21:30 - INFO - __main__ - Step 142881: {'lr': 2.8498334110893255e-06, 'samples': 27433152, 'steps': 142880, 'loss/train': 1.9438185691833496} 11/07/2021 17:21:30 - INFO - __main__ - Step 142882: {'lr': 2.849034476135659e-06, 'samples': 27433344, 'steps': 142881, 'loss/train': 1.0409126281738281} 11/07/2021 17:21:31 - INFO - __main__ - Step 142883: {'lr': 2.848235652544412e-06, 'samples': 27433536, 'steps': 142882, 'loss/train': 1.6053273677825928} 11/07/2021 17:21:32 - INFO - __main__ - Step 142884: {'lr': 2.8474369403159172e-06, 'samples': 27433728, 'steps': 142883, 'loss/train': 1.0487253665924072} 11/07/2021 17:21:32 - INFO - __main__ - Step 142885: {'lr': 2.8466383394505633e-06, 'samples': 27433920, 'steps': 142884, 'loss/train': 1.5630648136138916} 11/07/2021 17:21:33 - INFO - __main__ - Step 142886: {'lr': 2.8458398499487116e-06, 'samples': 27434112, 'steps': 142885, 'loss/train': 1.0668286085128784} 11/07/2021 17:21:33 - INFO - __main__ - Step 142887: {'lr': 2.8450414718106944e-06, 'samples': 27434304, 'steps': 142886, 'loss/train': 1.1034655570983887} 11/07/2021 17:21:33 - INFO - __main__ - Step 142888: {'lr': 2.8442432050369003e-06, 'samples': 27434496, 'steps': 142887, 'loss/train': 1.0089707374572754} 11/07/2021 17:21:35 - INFO - __main__ - Step 142889: {'lr': 2.8434450496276633e-06, 'samples': 27434688, 'steps': 142888, 'loss/train': 1.1280131340026855} 11/07/2021 17:21:35 - INFO - __main__ - Step 142890: {'lr': 2.842647005583371e-06, 'samples': 27434880, 'steps': 142889, 'loss/train': 0.8140457272529602} 11/07/2021 17:21:36 - INFO - __main__ - Step 142891: {'lr': 2.8418490729043565e-06, 'samples': 27435072, 'steps': 142890, 'loss/train': 0.5961179137229919} 11/07/2021 17:21:36 - INFO - __main__ - Step 142892: {'lr': 2.8410512515910093e-06, 'samples': 27435264, 'steps': 142891, 'loss/train': 1.4438197612762451} 11/07/2021 17:21:36 - INFO - __main__ - Step 142893: {'lr': 2.8402535416436613e-06, 'samples': 27435456, 'steps': 142892, 'loss/train': 0.1609542965888977} 11/07/2021 17:21:37 - INFO - __main__ - Step 142894: {'lr': 2.839455943062674e-06, 'samples': 27435648, 'steps': 142893, 'loss/train': 1.4300940036773682} 11/07/2021 17:21:38 - INFO - __main__ - Step 142895: {'lr': 2.8386584558484087e-06, 'samples': 27435840, 'steps': 142894, 'loss/train': 1.3719584941864014} 11/07/2021 17:21:38 - INFO - __main__ - Step 142896: {'lr': 2.837861080001225e-06, 'samples': 27436032, 'steps': 142895, 'loss/train': 1.2355762720108032} 11/07/2021 17:21:38 - INFO - __main__ - Step 142897: {'lr': 2.8370638155215123e-06, 'samples': 27436224, 'steps': 142896, 'loss/train': 0.7491282224655151} 11/07/2021 17:21:39 - INFO - __main__ - Step 142898: {'lr': 2.836266662409576e-06, 'samples': 27436416, 'steps': 142897, 'loss/train': 1.4975444078445435} 11/07/2021 17:21:39 - INFO - __main__ - Step 142899: {'lr': 2.8354696206658315e-06, 'samples': 27436608, 'steps': 142898, 'loss/train': 1.3221184015274048} 11/07/2021 17:21:40 - INFO - __main__ - Step 142900: {'lr': 2.8346726902905852e-06, 'samples': 27436800, 'steps': 142899, 'loss/train': 1.0884240865707397} 11/07/2021 17:21:41 - INFO - __main__ - Step 142901: {'lr': 2.8338758712842527e-06, 'samples': 27436992, 'steps': 142900, 'loss/train': 1.299908995628357} 11/07/2021 17:21:41 - INFO - __main__ - Step 142902: {'lr': 2.83307916364714e-06, 'samples': 27437184, 'steps': 142901, 'loss/train': 1.358563780784607} 11/07/2021 17:21:41 - INFO - __main__ - Step 142903: {'lr': 2.832282567379635e-06, 'samples': 27437376, 'steps': 142902, 'loss/train': 1.5615791082382202} 11/07/2021 17:21:42 - INFO - __main__ - Step 142904: {'lr': 2.831486082482071e-06, 'samples': 27437568, 'steps': 142903, 'loss/train': 1.2352572679519653} 11/07/2021 17:21:43 - INFO - __main__ - Step 142905: {'lr': 2.8306897089548367e-06, 'samples': 27437760, 'steps': 142904, 'loss/train': 1.619136929512024} 11/07/2021 17:21:43 - INFO - __main__ - Step 142906: {'lr': 2.829893446798293e-06, 'samples': 27437952, 'steps': 142905, 'loss/train': 1.0453078746795654} 11/07/2021 17:21:43 - INFO - __main__ - Step 142907: {'lr': 2.8290972960127725e-06, 'samples': 27438144, 'steps': 142906, 'loss/train': 1.7395302057266235} 11/07/2021 17:21:44 - INFO - __main__ - Step 142908: {'lr': 2.8283012565986367e-06, 'samples': 27438336, 'steps': 142907, 'loss/train': 0.8194828033447266} 11/07/2021 17:21:44 - INFO - __main__ - Step 142909: {'lr': 2.8275053285562735e-06, 'samples': 27438528, 'steps': 142908, 'loss/train': 1.3123962879180908} 11/07/2021 17:21:45 - INFO - __main__ - Step 142910: {'lr': 2.8267095118860166e-06, 'samples': 27438720, 'steps': 142909, 'loss/train': 0.6379693746566772} 11/07/2021 17:21:45 - INFO - __main__ - Step 142911: {'lr': 2.825913806588226e-06, 'samples': 27438912, 'steps': 142910, 'loss/train': 1.0947414636611938} 11/07/2021 17:21:46 - INFO - __main__ - Step 142912: {'lr': 2.8251182126632914e-06, 'samples': 27439104, 'steps': 142911, 'loss/train': 1.4806771278381348} 11/07/2021 17:21:46 - INFO - __main__ - Step 142913: {'lr': 2.8243227301115173e-06, 'samples': 27439296, 'steps': 142912, 'loss/train': 1.486698865890503} 11/07/2021 17:21:46 - INFO - __main__ - Step 142914: {'lr': 2.8235273589332923e-06, 'samples': 27439488, 'steps': 142913, 'loss/train': 1.6284360885620117} 11/07/2021 17:21:48 - INFO - __main__ - Step 142915: {'lr': 2.8227320991289775e-06, 'samples': 27439680, 'steps': 142914, 'loss/train': 1.3946738243103027} 11/07/2021 17:21:48 - INFO - __main__ - Step 142916: {'lr': 2.8219369506989055e-06, 'samples': 27439872, 'steps': 142915, 'loss/train': 1.5278676748275757} 11/07/2021 17:21:48 - INFO - __main__ - Step 142917: {'lr': 2.821141913643466e-06, 'samples': 27440064, 'steps': 142916, 'loss/train': 1.400227665901184} 11/07/2021 17:21:49 - INFO - __main__ - Step 142918: {'lr': 2.8203469879630182e-06, 'samples': 27440256, 'steps': 142917, 'loss/train': 1.394483208656311} 11/07/2021 17:21:49 - INFO - __main__ - Step 142919: {'lr': 2.8195521736578965e-06, 'samples': 27440448, 'steps': 142918, 'loss/train': 0.9752068519592285} 11/07/2021 17:21:49 - INFO - __main__ - Step 142920: {'lr': 2.818757470728461e-06, 'samples': 27440640, 'steps': 142919, 'loss/train': 1.5108150243759155} 11/07/2021 17:21:50 - INFO - __main__ - Step 142921: {'lr': 2.8179628791751013e-06, 'samples': 27440832, 'steps': 142920, 'loss/train': 1.4914063215255737} 11/07/2021 17:21:51 - INFO - __main__ - Step 142922: {'lr': 2.817168398998121e-06, 'samples': 27441024, 'steps': 142921, 'loss/train': 1.2689934968948364} 11/07/2021 17:21:51 - INFO - __main__ - Step 142923: {'lr': 2.816374030197966e-06, 'samples': 27441216, 'steps': 142922, 'loss/train': 0.41859015822410583} 11/07/2021 17:21:51 - INFO - __main__ - Step 142924: {'lr': 2.8155797727748845e-06, 'samples': 27441408, 'steps': 142923, 'loss/train': 1.4011626243591309} 11/07/2021 17:21:52 - INFO - __main__ - Step 142925: {'lr': 2.8147856267293215e-06, 'samples': 27441600, 'steps': 142924, 'loss/train': 0.8568814992904663} 11/07/2021 17:21:53 - INFO - __main__ - Step 142926: {'lr': 2.8139915920615823e-06, 'samples': 27441792, 'steps': 142925, 'loss/train': 1.2629170417785645} 11/07/2021 17:21:53 - INFO - __main__ - Step 142927: {'lr': 2.813197668772055e-06, 'samples': 27441984, 'steps': 142926, 'loss/train': 1.1697335243225098} 11/07/2021 17:21:53 - INFO - __main__ - Step 142928: {'lr': 2.8124038568610733e-06, 'samples': 27442176, 'steps': 142927, 'loss/train': 1.5993320941925049} 11/07/2021 17:21:54 - INFO - __main__ - Step 142929: {'lr': 2.811610156329025e-06, 'samples': 27442368, 'steps': 142928, 'loss/train': 1.3494765758514404} 11/07/2021 17:21:54 - INFO - __main__ - Step 142930: {'lr': 2.810816567176244e-06, 'samples': 27442560, 'steps': 142929, 'loss/train': 1.232390284538269} 11/07/2021 17:21:55 - INFO - __main__ - Step 142931: {'lr': 2.810023089403091e-06, 'samples': 27442752, 'steps': 142930, 'loss/train': 1.194886565208435} 11/07/2021 17:21:56 - INFO - __main__ - Step 142932: {'lr': 2.809229723009926e-06, 'samples': 27442944, 'steps': 142931, 'loss/train': 1.5891361236572266} 11/07/2021 17:21:56 - INFO - __main__ - Step 142933: {'lr': 2.808436467997111e-06, 'samples': 27443136, 'steps': 142932, 'loss/train': 0.9370372295379639} 11/07/2021 17:21:57 - INFO - __main__ - Step 142934: {'lr': 2.8076433243650056e-06, 'samples': 27443328, 'steps': 142933, 'loss/train': 1.7718220949172974} 11/07/2021 17:21:57 - INFO - __main__ - Step 142935: {'lr': 2.806850292113944e-06, 'samples': 27443520, 'steps': 142934, 'loss/train': 1.0769226551055908} 11/07/2021 17:21:58 - INFO - __main__ - Step 142936: {'lr': 2.8060573712443416e-06, 'samples': 27443712, 'steps': 142935, 'loss/train': 1.9163398742675781} 11/07/2021 17:21:58 - INFO - __main__ - Step 142937: {'lr': 2.8052645617564764e-06, 'samples': 27443904, 'steps': 142936, 'loss/train': 1.3055411577224731} 11/07/2021 17:21:59 - INFO - __main__ - Step 142938: {'lr': 2.804471863650765e-06, 'samples': 27444096, 'steps': 142937, 'loss/train': 1.846790075302124} 11/07/2021 17:21:59 - INFO - __main__ - Step 142939: {'lr': 2.8036792769275122e-06, 'samples': 27444288, 'steps': 142938, 'loss/train': 1.5796806812286377} 11/07/2021 17:21:59 - INFO - __main__ - Step 142940: {'lr': 2.8028868015871346e-06, 'samples': 27444480, 'steps': 142939, 'loss/train': 1.123342514038086} 11/07/2021 17:22:00 - INFO - __main__ - Step 142941: {'lr': 2.802094437629965e-06, 'samples': 27444672, 'steps': 142940, 'loss/train': 1.3907015323638916} 11/07/2021 17:22:01 - INFO - __main__ - Step 142942: {'lr': 2.801302185056337e-06, 'samples': 27444864, 'steps': 142941, 'loss/train': 1.3927664756774902} 11/07/2021 17:22:01 - INFO - __main__ - Step 142943: {'lr': 2.8005100438666386e-06, 'samples': 27445056, 'steps': 142942, 'loss/train': 1.0324208736419678} 11/07/2021 17:22:01 - INFO - __main__ - Step 142944: {'lr': 2.799718014061231e-06, 'samples': 27445248, 'steps': 142943, 'loss/train': 1.1692510843276978} 11/07/2021 17:22:02 - INFO - __main__ - Step 142945: {'lr': 2.7989260956404195e-06, 'samples': 27445440, 'steps': 142944, 'loss/train': 1.5126339197158813} 11/07/2021 17:22:03 - INFO - __main__ - Step 142946: {'lr': 2.79813428860462e-06, 'samples': 27445632, 'steps': 142945, 'loss/train': 1.62186598777771} 11/07/2021 17:22:03 - INFO - __main__ - Step 142947: {'lr': 2.7973425929541663e-06, 'samples': 27445824, 'steps': 142946, 'loss/train': 1.817043423652649} 11/07/2021 17:22:04 - INFO - __main__ - Step 142948: {'lr': 2.7965510086894185e-06, 'samples': 27446016, 'steps': 142947, 'loss/train': 1.4185470342636108} 11/07/2021 17:22:04 - INFO - __main__ - Step 142949: {'lr': 2.795759535810738e-06, 'samples': 27446208, 'steps': 142948, 'loss/train': 0.9681703448295593} 11/07/2021 17:22:04 - INFO - __main__ - Step 142950: {'lr': 2.7949681743184576e-06, 'samples': 27446400, 'steps': 142949, 'loss/train': 1.1520438194274902} 11/07/2021 17:22:05 - INFO - __main__ - Step 142951: {'lr': 2.7941769242129657e-06, 'samples': 27446592, 'steps': 142950, 'loss/train': 1.49210524559021} 11/07/2021 17:22:06 - INFO - __main__ - Step 142952: {'lr': 2.7933857854945955e-06, 'samples': 27446784, 'steps': 142951, 'loss/train': 1.4908143281936646} 11/07/2021 17:22:06 - INFO - __main__ - Step 142953: {'lr': 2.7925947581637077e-06, 'samples': 27446976, 'steps': 142952, 'loss/train': 1.1694282293319702} 11/07/2021 17:22:06 - INFO - __main__ - Step 142954: {'lr': 2.791803842220664e-06, 'samples': 27447168, 'steps': 142953, 'loss/train': 1.2838634252548218} 11/07/2021 17:22:07 - INFO - __main__ - Step 142955: {'lr': 2.791013037665796e-06, 'samples': 27447360, 'steps': 142954, 'loss/train': 1.275468111038208} 11/07/2021 17:22:09 - INFO - __main__ - Step 142956: {'lr': 2.7902223444995213e-06, 'samples': 27447552, 'steps': 142955, 'loss/train': 1.2145779132843018} 11/07/2021 17:22:09 - INFO - __main__ - Step 142957: {'lr': 2.7894317627221165e-06, 'samples': 27447744, 'steps': 142956, 'loss/train': 1.5571949481964111} 11/07/2021 17:22:10 - INFO - __main__ - Step 142958: {'lr': 2.7886412923340263e-06, 'samples': 27447936, 'steps': 142957, 'loss/train': 1.3433213233947754} 11/07/2021 17:22:10 - INFO - __main__ - Step 142959: {'lr': 2.7878509333355286e-06, 'samples': 27448128, 'steps': 142958, 'loss/train': 1.1073615550994873} 11/07/2021 17:22:11 - INFO - __main__ - Step 142960: {'lr': 2.787060685727011e-06, 'samples': 27448320, 'steps': 142959, 'loss/train': 1.1782252788543701} 11/07/2021 17:22:11 - INFO - __main__ - Step 142961: {'lr': 2.786270549508835e-06, 'samples': 27448512, 'steps': 142960, 'loss/train': 1.1953516006469727} 11/07/2021 17:22:11 - INFO - __main__ - Step 142962: {'lr': 2.785480524681361e-06, 'samples': 27448704, 'steps': 142961, 'loss/train': 1.1564003229141235} 11/07/2021 17:22:12 - INFO - __main__ - Step 142963: {'lr': 2.7846906112449223e-06, 'samples': 27448896, 'steps': 142962, 'loss/train': 1.1321059465408325} 11/07/2021 17:22:13 - INFO - __main__ - Step 142964: {'lr': 2.7839008091999074e-06, 'samples': 27449088, 'steps': 142963, 'loss/train': 1.272851586341858} 11/07/2021 17:22:13 - INFO - __main__ - Step 142965: {'lr': 2.7831111185466217e-06, 'samples': 27449280, 'steps': 142964, 'loss/train': 1.7308156490325928} 11/07/2021 17:22:14 - INFO - __main__ - Step 142966: {'lr': 2.7823215392854818e-06, 'samples': 27449472, 'steps': 142965, 'loss/train': 1.4892915487289429} 11/07/2021 17:22:14 - INFO - __main__ - Step 142967: {'lr': 2.7815320714167922e-06, 'samples': 27449664, 'steps': 142966, 'loss/train': 1.1653794050216675} 11/07/2021 17:22:14 - INFO - __main__ - Step 142968: {'lr': 2.7807427149409147e-06, 'samples': 27449856, 'steps': 142967, 'loss/train': 0.7788036465644836} 11/07/2021 17:22:15 - INFO - __main__ - Step 142969: {'lr': 2.7799534698582372e-06, 'samples': 27450048, 'steps': 142968, 'loss/train': 0.871694028377533} 11/07/2021 17:22:16 - INFO - __main__ - Step 142970: {'lr': 2.7791643361690934e-06, 'samples': 27450240, 'steps': 142969, 'loss/train': 0.91520756483078} 11/07/2021 17:22:16 - INFO - __main__ - Step 142971: {'lr': 2.778375313873871e-06, 'samples': 27450432, 'steps': 142970, 'loss/train': 1.079116940498352} 11/07/2021 17:22:16 - INFO - __main__ - Step 142972: {'lr': 2.777586402972876e-06, 'samples': 27450624, 'steps': 142971, 'loss/train': 1.6629037857055664} 11/07/2021 17:22:17 - INFO - __main__ - Step 142973: {'lr': 2.776797603466469e-06, 'samples': 27450816, 'steps': 142972, 'loss/train': 0.6568059921264648} 11/07/2021 17:22:17 - INFO - __main__ - Step 142974: {'lr': 2.776008915355038e-06, 'samples': 27451008, 'steps': 142973, 'loss/train': 1.03036630153656} 11/07/2021 17:22:18 - INFO - __main__ - Step 142975: {'lr': 2.7752203386389174e-06, 'samples': 27451200, 'steps': 142974, 'loss/train': 0.9164997935295105} 11/07/2021 17:22:18 - INFO - __main__ - Step 142976: {'lr': 2.774431873318467e-06, 'samples': 27451392, 'steps': 142975, 'loss/train': 0.3988361656665802} 11/07/2021 17:22:19 - INFO - __main__ - Step 142977: {'lr': 2.7736435193940757e-06, 'samples': 27451584, 'steps': 142976, 'loss/train': 0.8477259874343872} 11/07/2021 17:22:19 - INFO - __main__ - Step 142978: {'lr': 2.7728552768660486e-06, 'samples': 27451776, 'steps': 142977, 'loss/train': 1.2777717113494873} 11/07/2021 17:22:20 - INFO - __main__ - Step 142979: {'lr': 2.7720671457347467e-06, 'samples': 27451968, 'steps': 142978, 'loss/train': 1.4087542295455933} 11/07/2021 17:22:21 - INFO - __main__ - Step 142980: {'lr': 2.771279126000531e-06, 'samples': 27452160, 'steps': 142979, 'loss/train': 1.2984448671340942} 11/07/2021 17:22:21 - INFO - __main__ - Step 142981: {'lr': 2.770491217663762e-06, 'samples': 27452352, 'steps': 142980, 'loss/train': 1.5400904417037964} 11/07/2021 17:22:21 - INFO - __main__ - Step 142982: {'lr': 2.7697034207248006e-06, 'samples': 27452544, 'steps': 142981, 'loss/train': 1.5090469121932983} 11/07/2021 17:22:22 - INFO - __main__ - Step 142983: {'lr': 2.7689157351840076e-06, 'samples': 27452736, 'steps': 142982, 'loss/train': 1.6259065866470337} 11/07/2021 17:22:22 - INFO - __main__ - Step 142984: {'lr': 2.7681281610417166e-06, 'samples': 27452928, 'steps': 142983, 'loss/train': 0.36431989073753357} 11/07/2021 17:22:23 - INFO - __main__ - Step 142985: {'lr': 2.7673406982982875e-06, 'samples': 27453120, 'steps': 142984, 'loss/train': 1.3667259216308594} 11/07/2021 17:22:24 - INFO - __main__ - Step 142986: {'lr': 2.7665533469540817e-06, 'samples': 27453312, 'steps': 142985, 'loss/train': 1.0699574947357178} 11/07/2021 17:22:24 - INFO - __main__ - Step 142987: {'lr': 2.765766107009432e-06, 'samples': 27453504, 'steps': 142986, 'loss/train': 1.0879861116409302} 11/07/2021 17:22:24 - INFO - __main__ - Step 142988: {'lr': 2.764978978464755e-06, 'samples': 27453696, 'steps': 142987, 'loss/train': 1.184856653213501} 11/07/2021 17:22:25 - INFO - __main__ - Step 142989: {'lr': 2.764191961320328e-06, 'samples': 27453888, 'steps': 142988, 'loss/train': 0.42998409271240234} 11/07/2021 17:22:25 - INFO - __main__ - Step 142990: {'lr': 2.7634050555765676e-06, 'samples': 27454080, 'steps': 142989, 'loss/train': 1.2712138891220093} 11/07/2021 17:22:26 - INFO - __main__ - Step 142991: {'lr': 2.762618261233807e-06, 'samples': 27454272, 'steps': 142990, 'loss/train': 0.5287706255912781} 11/07/2021 17:22:26 - INFO - __main__ - Step 142992: {'lr': 2.761831578292379e-06, 'samples': 27454464, 'steps': 142991, 'loss/train': 1.2597018480300903} 11/07/2021 17:22:27 - INFO - __main__ - Step 142993: {'lr': 2.7610450067526437e-06, 'samples': 27454656, 'steps': 142992, 'loss/train': 1.5638455152511597} 11/07/2021 17:22:27 - INFO - __main__ - Step 142994: {'lr': 2.760258546614963e-06, 'samples': 27454848, 'steps': 142993, 'loss/train': 0.8884726762771606} 11/07/2021 17:22:27 - INFO - __main__ - Step 142995: {'lr': 2.7594721978797253e-06, 'samples': 27455040, 'steps': 142994, 'loss/train': 1.5512877702713013} 11/07/2021 17:22:29 - INFO - __main__ - Step 142996: {'lr': 2.758685960547236e-06, 'samples': 27455232, 'steps': 142995, 'loss/train': 0.36357295513153076} 11/07/2021 17:22:29 - INFO - __main__ - Step 142997: {'lr': 2.7578998346178554e-06, 'samples': 27455424, 'steps': 142996, 'loss/train': 1.551049828529358} 11/07/2021 17:22:30 - INFO - __main__ - Step 142998: {'lr': 2.757113820091972e-06, 'samples': 27455616, 'steps': 142997, 'loss/train': 1.3418210744857788} 11/07/2021 17:22:30 - INFO - __main__ - Step 142999: {'lr': 2.756327916969892e-06, 'samples': 27455808, 'steps': 142998, 'loss/train': 0.7390632033348083} 11/07/2021 17:22:30 - INFO - __main__ - Step 143000: {'lr': 2.7555421252520308e-06, 'samples': 27456000, 'steps': 142999, 'loss/train': 1.276936650276184} 11/07/2021 17:22:31 - INFO - __main__ - Step 143001: {'lr': 2.7547564449386665e-06, 'samples': 27456192, 'steps': 143000, 'loss/train': 1.212971806526184} 11/07/2021 17:22:32 - INFO - __main__ - Step 143002: {'lr': 2.753970876030215e-06, 'samples': 27456384, 'steps': 143001, 'loss/train': 1.4647706747055054} 11/07/2021 17:22:32 - INFO - __main__ - Step 143003: {'lr': 2.753185418527038e-06, 'samples': 27456576, 'steps': 143002, 'loss/train': 1.4160984754562378} 11/07/2021 17:22:32 - INFO - __main__ - Step 143004: {'lr': 2.7524000724294397e-06, 'samples': 27456768, 'steps': 143003, 'loss/train': 0.9117769598960876} 11/07/2021 17:22:33 - INFO - __main__ - Step 143005: {'lr': 2.7516148377377813e-06, 'samples': 27456960, 'steps': 143004, 'loss/train': 0.8112437129020691} 11/07/2021 17:22:34 - INFO - __main__ - Step 143006: {'lr': 2.750829714452424e-06, 'samples': 27457152, 'steps': 143005, 'loss/train': 1.0613508224487305} 11/07/2021 17:22:34 - INFO - __main__ - Step 143007: {'lr': 2.750044702573756e-06, 'samples': 27457344, 'steps': 143006, 'loss/train': 1.1610771417617798} 11/07/2021 17:22:35 - INFO - __main__ - Step 143008: {'lr': 2.7492598021020833e-06, 'samples': 27457536, 'steps': 143007, 'loss/train': 1.4875606298446655} 11/07/2021 17:22:35 - INFO - __main__ - Step 143009: {'lr': 2.7484750130377657e-06, 'samples': 27457728, 'steps': 143008, 'loss/train': 1.033136248588562} 11/07/2021 17:22:35 - INFO - __main__ - Step 143010: {'lr': 2.747690335381192e-06, 'samples': 27457920, 'steps': 143009, 'loss/train': 1.4555330276489258} 11/07/2021 17:22:36 - INFO - __main__ - Step 143011: {'lr': 2.746905769132696e-06, 'samples': 27458112, 'steps': 143010, 'loss/train': 0.9776278138160706} 11/07/2021 17:22:37 - INFO - __main__ - Step 143012: {'lr': 2.74612131429261e-06, 'samples': 27458304, 'steps': 143011, 'loss/train': 0.45572009682655334} 11/07/2021 17:22:37 - INFO - __main__ - Step 143013: {'lr': 2.745336970861323e-06, 'samples': 27458496, 'steps': 143012, 'loss/train': 1.3607875108718872} 11/07/2021 17:22:37 - INFO - __main__ - Step 143014: {'lr': 2.7445527388391676e-06, 'samples': 27458688, 'steps': 143013, 'loss/train': 1.177678108215332} 11/07/2021 17:22:38 - INFO - __main__ - Step 143015: {'lr': 2.7437686182265053e-06, 'samples': 27458880, 'steps': 143014, 'loss/train': 1.2639116048812866} 11/07/2021 17:22:38 - INFO - __main__ - Step 143016: {'lr': 2.742984609023669e-06, 'samples': 27459072, 'steps': 143015, 'loss/train': 1.6228299140930176} 11/07/2021 17:22:39 - INFO - __main__ - Step 143017: {'lr': 2.7422007112310464e-06, 'samples': 27459264, 'steps': 143016, 'loss/train': 1.3126012086868286} 11/07/2021 17:22:39 - INFO - __main__ - Step 143018: {'lr': 2.741416924848972e-06, 'samples': 27459456, 'steps': 143017, 'loss/train': 1.4488118886947632} 11/07/2021 17:22:40 - INFO - __main__ - Step 143019: {'lr': 2.7406332498777776e-06, 'samples': 27459648, 'steps': 143018, 'loss/train': 1.834118127822876} 11/07/2021 17:22:40 - INFO - __main__ - Step 143020: {'lr': 2.73984968631788e-06, 'samples': 27459840, 'steps': 143019, 'loss/train': 1.6300790309906006} 11/07/2021 17:22:41 - INFO - __main__ - Step 143021: {'lr': 2.7390662341695572e-06, 'samples': 27460032, 'steps': 143020, 'loss/train': 1.4791598320007324} 11/07/2021 17:22:42 - INFO - __main__ - Step 143022: {'lr': 2.738282893433197e-06, 'samples': 27460224, 'steps': 143021, 'loss/train': 1.4194021224975586} 11/07/2021 17:22:42 - INFO - __main__ - Step 143023: {'lr': 2.737499664109161e-06, 'samples': 27460416, 'steps': 143022, 'loss/train': 1.408225417137146} 11/07/2021 17:22:42 - INFO - __main__ - Step 143024: {'lr': 2.736716546197782e-06, 'samples': 27460608, 'steps': 143023, 'loss/train': 1.2020989656448364} 11/07/2021 17:22:43 - INFO - __main__ - Step 143025: {'lr': 2.7359335396994202e-06, 'samples': 27460800, 'steps': 143024, 'loss/train': 1.5098410844802856} 11/07/2021 17:22:43 - INFO - __main__ - Step 143026: {'lr': 2.735150644614437e-06, 'samples': 27460992, 'steps': 143025, 'loss/train': 0.8981700539588928} 11/07/2021 17:22:44 - INFO - __main__ - Step 143027: {'lr': 2.734367860943193e-06, 'samples': 27461184, 'steps': 143026, 'loss/train': 1.1561353206634521} 11/07/2021 17:22:44 - INFO - __main__ - Step 143028: {'lr': 2.7335851886860217e-06, 'samples': 27461376, 'steps': 143027, 'loss/train': 0.5728533267974854} 11/07/2021 17:22:45 - INFO - __main__ - Step 143029: {'lr': 2.7328026278432563e-06, 'samples': 27461568, 'steps': 143028, 'loss/train': 1.1353039741516113} 11/07/2021 17:22:45 - INFO - __main__ - Step 143030: {'lr': 2.732020178415312e-06, 'samples': 27461760, 'steps': 143029, 'loss/train': 1.440919041633606} 11/07/2021 17:22:45 - INFO - __main__ - Step 143031: {'lr': 2.731237840402495e-06, 'samples': 27461952, 'steps': 143030, 'loss/train': 1.4274486303329468} 11/07/2021 17:22:46 - INFO - __main__ - Step 143032: {'lr': 2.730455613805166e-06, 'samples': 27462144, 'steps': 143031, 'loss/train': 1.2469139099121094} 11/07/2021 17:22:47 - INFO - __main__ - Step 143033: {'lr': 2.729673498623658e-06, 'samples': 27462336, 'steps': 143032, 'loss/train': 0.9654675722122192} 11/07/2021 17:22:47 - INFO - __main__ - Step 143034: {'lr': 2.72889149485836e-06, 'samples': 27462528, 'steps': 143033, 'loss/train': 1.5775526762008667} 11/07/2021 17:22:48 - INFO - __main__ - Step 143035: {'lr': 2.7281096025096043e-06, 'samples': 27462720, 'steps': 143034, 'loss/train': 1.3328845500946045} 11/07/2021 17:22:48 - INFO - __main__ - Step 143036: {'lr': 2.727327821577752e-06, 'samples': 27462912, 'steps': 143035, 'loss/train': 1.8134987354278564} 11/07/2021 17:22:48 - INFO - __main__ - Step 143037: {'lr': 2.7265461520631363e-06, 'samples': 27463104, 'steps': 143036, 'loss/train': 1.7330570220947266} 11/07/2021 17:22:49 - INFO - __main__ - Step 143038: {'lr': 2.725764593966118e-06, 'samples': 27463296, 'steps': 143037, 'loss/train': 1.2185720205307007} 11/07/2021 17:22:50 - INFO - __main__ - Step 143039: {'lr': 2.7249831472870854e-06, 'samples': 27463488, 'steps': 143038, 'loss/train': 1.4960339069366455} 11/07/2021 17:22:50 - INFO - __main__ - Step 143040: {'lr': 2.7242018120263447e-06, 'samples': 27463680, 'steps': 143039, 'loss/train': 1.3425564765930176} 11/07/2021 17:22:50 - INFO - __main__ - Step 143041: {'lr': 2.7234205881842557e-06, 'samples': 27463872, 'steps': 143040, 'loss/train': 1.7973281145095825} 11/07/2021 17:22:51 - INFO - __main__ - Step 143042: {'lr': 2.7226394757611795e-06, 'samples': 27464064, 'steps': 143041, 'loss/train': 1.6627137660980225} 11/07/2021 17:22:52 - INFO - __main__ - Step 143043: {'lr': 2.721858474757477e-06, 'samples': 27464256, 'steps': 143042, 'loss/train': 0.9955282807350159} 11/07/2021 17:22:52 - INFO - __main__ - Step 143044: {'lr': 2.7210775851734817e-06, 'samples': 27464448, 'steps': 143043, 'loss/train': 0.3506867587566376} 11/07/2021 17:22:52 - INFO - __main__ - Step 143045: {'lr': 2.7202968070095537e-06, 'samples': 27464640, 'steps': 143044, 'loss/train': 1.3382525444030762} 11/07/2021 17:22:53 - INFO - __main__ - Step 143046: {'lr': 2.719516140266054e-06, 'samples': 27464832, 'steps': 143045, 'loss/train': 1.313065767288208} 11/07/2021 17:22:53 - INFO - __main__ - Step 143047: {'lr': 2.718735584943316e-06, 'samples': 27465024, 'steps': 143046, 'loss/train': 1.6586798429489136} 11/07/2021 17:22:54 - INFO - __main__ - Step 143048: {'lr': 2.7179551410416726e-06, 'samples': 27465216, 'steps': 143047, 'loss/train': 1.4252163171768188} 11/07/2021 17:22:54 - INFO - __main__ - Step 143049: {'lr': 2.71717480856154e-06, 'samples': 27465408, 'steps': 143048, 'loss/train': 1.4882739782333374} 11/07/2021 17:22:55 - INFO - __main__ - Step 143050: {'lr': 2.716394587503224e-06, 'samples': 27465600, 'steps': 143049, 'loss/train': 1.4048820734024048} 11/07/2021 17:22:55 - INFO - __main__ - Step 143051: {'lr': 2.7156144778670845e-06, 'samples': 27465792, 'steps': 143050, 'loss/train': 0.9041110277175903} 11/07/2021 17:22:56 - INFO - __main__ - Step 143052: {'lr': 2.714834479653455e-06, 'samples': 27465984, 'steps': 143051, 'loss/train': 1.4687048196792603} 11/07/2021 17:22:57 - INFO - __main__ - Step 143053: {'lr': 2.7140545928627247e-06, 'samples': 27466176, 'steps': 143052, 'loss/train': 1.5950167179107666} 11/07/2021 17:22:57 - INFO - __main__ - Step 143054: {'lr': 2.713274817495226e-06, 'samples': 27466368, 'steps': 143053, 'loss/train': 1.1268664598464966} 11/07/2021 17:22:57 - INFO - __main__ - Step 143055: {'lr': 2.712495153551292e-06, 'samples': 27466560, 'steps': 143054, 'loss/train': 1.159876823425293} 11/07/2021 17:22:58 - INFO - __main__ - Step 143056: {'lr': 2.7117156010313114e-06, 'samples': 27466752, 'steps': 143055, 'loss/train': 1.3428436517715454} 11/07/2021 17:22:58 - INFO - __main__ - Step 143057: {'lr': 2.7109361599356175e-06, 'samples': 27466944, 'steps': 143056, 'loss/train': 1.1984025239944458} 11/07/2021 17:22:59 - INFO - __main__ - Step 143058: {'lr': 2.7101568302645705e-06, 'samples': 27467136, 'steps': 143057, 'loss/train': 1.0545554161071777} 11/07/2021 17:22:59 - INFO - __main__ - Step 143059: {'lr': 2.7093776120184767e-06, 'samples': 27467328, 'steps': 143058, 'loss/train': 1.5312081575393677} 11/07/2021 17:23:00 - INFO - __main__ - Step 143060: {'lr': 2.7085985051977515e-06, 'samples': 27467520, 'steps': 143059, 'loss/train': 1.3659064769744873} 11/07/2021 17:23:00 - INFO - __main__ - Step 143061: {'lr': 2.7078195098027e-06, 'samples': 27467712, 'steps': 143060, 'loss/train': 1.2124254703521729} 11/07/2021 17:23:00 - INFO - __main__ - Step 143062: {'lr': 2.7070406258336845e-06, 'samples': 27467904, 'steps': 143061, 'loss/train': 1.5361019372940063} 11/07/2021 17:23:02 - INFO - __main__ - Step 143063: {'lr': 2.7062618532910645e-06, 'samples': 27468096, 'steps': 143062, 'loss/train': 1.4484156370162964} 11/07/2021 17:23:02 - INFO - __main__ - Step 143064: {'lr': 2.7054831921751734e-06, 'samples': 27468288, 'steps': 143063, 'loss/train': 1.5987403392791748} 11/07/2021 17:23:02 - INFO - __main__ - Step 143065: {'lr': 2.7047046424863996e-06, 'samples': 27468480, 'steps': 143064, 'loss/train': 1.0876595973968506} 11/07/2021 17:23:03 - INFO - __main__ - Step 143066: {'lr': 2.703926204225077e-06, 'samples': 27468672, 'steps': 143065, 'loss/train': 1.223467469215393} 11/07/2021 17:23:03 - INFO - __main__ - Step 143067: {'lr': 2.7031478773915096e-06, 'samples': 27468864, 'steps': 143066, 'loss/train': 1.3290103673934937} 11/07/2021 17:23:03 - INFO - __main__ - Step 143068: {'lr': 2.702369661986115e-06, 'samples': 27469056, 'steps': 143067, 'loss/train': 1.4475557804107666} 11/07/2021 17:23:04 - INFO - __main__ - Step 143069: {'lr': 2.7015915580092252e-06, 'samples': 27469248, 'steps': 143068, 'loss/train': 1.241707682609558} 11/07/2021 17:23:05 - INFO - __main__ - Step 143070: {'lr': 2.7008135654611743e-06, 'samples': 27469440, 'steps': 143069, 'loss/train': 1.5480444431304932} 11/07/2021 17:23:05 - INFO - __main__ - Step 143071: {'lr': 2.7000356843423223e-06, 'samples': 27469632, 'steps': 143070, 'loss/train': 1.1606264114379883} 11/07/2021 17:23:05 - INFO - __main__ - Step 143072: {'lr': 2.699257914653003e-06, 'samples': 27469824, 'steps': 143071, 'loss/train': 1.7214723825454712} 11/07/2021 17:23:06 - INFO - __main__ - Step 143073: {'lr': 2.698480256393604e-06, 'samples': 27470016, 'steps': 143072, 'loss/train': 1.3510913848876953} 11/07/2021 17:23:07 - INFO - __main__ - Step 143074: {'lr': 2.6977027095644314e-06, 'samples': 27470208, 'steps': 143073, 'loss/train': 1.4633816480636597} 11/07/2021 17:23:07 - INFO - __main__ - Step 143075: {'lr': 2.6969252741658736e-06, 'samples': 27470400, 'steps': 143074, 'loss/train': 1.1574069261550903} 11/07/2021 17:23:08 - INFO - __main__ - Step 143076: {'lr': 2.6961479501982633e-06, 'samples': 27470592, 'steps': 143075, 'loss/train': 1.256029725074768} 11/07/2021 17:23:08 - INFO - __main__ - Step 143077: {'lr': 2.695370737661934e-06, 'samples': 27470784, 'steps': 143076, 'loss/train': 1.6414713859558105} 11/07/2021 17:23:08 - INFO - __main__ - Step 143078: {'lr': 2.694593636557274e-06, 'samples': 27470976, 'steps': 143077, 'loss/train': 1.2407888174057007} 11/07/2021 17:23:10 - INFO - __main__ - Step 143079: {'lr': 2.693816646884617e-06, 'samples': 27471168, 'steps': 143078, 'loss/train': 1.1332640647888184} 11/07/2021 17:23:10 - INFO - __main__ - Step 143080: {'lr': 2.6930397686442953e-06, 'samples': 27471360, 'steps': 143079, 'loss/train': 0.6441757678985596} 11/07/2021 17:23:10 - INFO - __main__ - Step 143081: {'lr': 2.69226300183667e-06, 'samples': 27471552, 'steps': 143080, 'loss/train': 1.0551024675369263} 11/07/2021 17:23:11 - INFO - __main__ - Step 143082: {'lr': 2.691486346462102e-06, 'samples': 27471744, 'steps': 143081, 'loss/train': 1.6093335151672363} 11/07/2021 17:23:11 - INFO - __main__ - Step 143083: {'lr': 2.690709802520952e-06, 'samples': 27471936, 'steps': 143082, 'loss/train': 2.4156768321990967} 11/07/2021 17:23:11 - INFO - __main__ - Step 143084: {'lr': 2.689933370013553e-06, 'samples': 27472128, 'steps': 143083, 'loss/train': 1.7427610158920288} 11/07/2021 17:23:12 - INFO - __main__ - Step 143085: {'lr': 2.6891570489402384e-06, 'samples': 27472320, 'steps': 143084, 'loss/train': 1.2116117477416992} 11/07/2021 17:23:13 - INFO - __main__ - Step 143086: {'lr': 2.688380839301369e-06, 'samples': 27472512, 'steps': 143085, 'loss/train': 1.2392189502716064} 11/07/2021 17:23:13 - INFO - __main__ - Step 143087: {'lr': 2.6876047410973047e-06, 'samples': 27472704, 'steps': 143086, 'loss/train': 1.3524010181427002} 11/07/2021 17:23:13 - INFO - __main__ - Step 143088: {'lr': 2.68682875432838e-06, 'samples': 27472896, 'steps': 143087, 'loss/train': 1.711409091949463} 11/07/2021 17:23:14 - INFO - __main__ - Step 143089: {'lr': 2.6860528789949545e-06, 'samples': 27473088, 'steps': 143088, 'loss/train': 1.4184207916259766} 11/07/2021 17:23:15 - INFO - __main__ - Step 143090: {'lr': 2.6852771150973898e-06, 'samples': 27473280, 'steps': 143089, 'loss/train': 1.2983297109603882} 11/07/2021 17:23:15 - INFO - __main__ - Step 143091: {'lr': 2.6845014626360187e-06, 'samples': 27473472, 'steps': 143090, 'loss/train': 2.1967554092407227} 11/07/2021 17:23:16 - INFO - __main__ - Step 143092: {'lr': 2.683725921611202e-06, 'samples': 27473664, 'steps': 143091, 'loss/train': 0.8432749509811401} 11/07/2021 17:23:16 - INFO - __main__ - Step 143093: {'lr': 2.6829504920232726e-06, 'samples': 27473856, 'steps': 143092, 'loss/train': 0.6840240955352783} 11/07/2021 17:23:16 - INFO - __main__ - Step 143094: {'lr': 2.6821751738725917e-06, 'samples': 27474048, 'steps': 143093, 'loss/train': 0.7669702768325806} 11/07/2021 17:23:17 - INFO - __main__ - Step 143095: {'lr': 2.6813999671594923e-06, 'samples': 27474240, 'steps': 143094, 'loss/train': 1.1773277521133423} 11/07/2021 17:23:18 - INFO - __main__ - Step 143096: {'lr': 2.680624871884363e-06, 'samples': 27474432, 'steps': 143095, 'loss/train': 1.710423469543457} 11/07/2021 17:23:18 - INFO - __main__ - Step 143097: {'lr': 2.6798498880475087e-06, 'samples': 27474624, 'steps': 143096, 'loss/train': 1.2881007194519043} 11/07/2021 17:23:18 - INFO - __main__ - Step 143098: {'lr': 2.679075015649318e-06, 'samples': 27474816, 'steps': 143097, 'loss/train': 1.0188666582107544} 11/07/2021 17:23:19 - INFO - __main__ - Step 143099: {'lr': 2.678300254690097e-06, 'samples': 27475008, 'steps': 143098, 'loss/train': 1.2862331867218018} 11/07/2021 17:23:19 - INFO - __main__ - Step 143100: {'lr': 2.6775256051702335e-06, 'samples': 27475200, 'steps': 143099, 'loss/train': 1.1611888408660889} 11/07/2021 17:23:20 - INFO - __main__ - Step 143101: {'lr': 2.6767510670900608e-06, 'samples': 27475392, 'steps': 143100, 'loss/train': 1.2973246574401855} 11/07/2021 17:23:20 - INFO - __main__ - Step 143102: {'lr': 2.675976640449912e-06, 'samples': 27475584, 'steps': 143101, 'loss/train': 1.3910353183746338} 11/07/2021 17:23:21 - INFO - __main__ - Step 143103: {'lr': 2.675202325250148e-06, 'samples': 27475776, 'steps': 143102, 'loss/train': 1.2937572002410889} 11/07/2021 17:23:21 - INFO - __main__ - Step 143104: {'lr': 2.674428121491157e-06, 'samples': 27475968, 'steps': 143103, 'loss/train': 1.1739343404769897} 11/07/2021 17:23:21 - INFO - __main__ - Step 143105: {'lr': 2.673654029173217e-06, 'samples': 27476160, 'steps': 143104, 'loss/train': 1.1875733137130737} 11/07/2021 17:23:23 - INFO - __main__ - Step 143106: {'lr': 2.6728800482967164e-06, 'samples': 27476352, 'steps': 143105, 'loss/train': 1.3095659017562866} 11/07/2021 17:23:23 - INFO - __main__ - Step 143107: {'lr': 2.672106178862016e-06, 'samples': 27476544, 'steps': 143106, 'loss/train': 1.7458430528640747} 11/07/2021 17:23:23 - INFO - __main__ - Step 143108: {'lr': 2.6713324208694213e-06, 'samples': 27476736, 'steps': 143107, 'loss/train': 1.406135082244873} 11/07/2021 17:23:24 - INFO - __main__ - Step 143109: {'lr': 2.6705587743193484e-06, 'samples': 27476928, 'steps': 143108, 'loss/train': 0.759296178817749} 11/07/2021 17:23:24 - INFO - __main__ - Step 143110: {'lr': 2.6697852392120748e-06, 'samples': 27477120, 'steps': 143109, 'loss/train': 1.1002326011657715} 11/07/2021 17:23:25 - INFO - __main__ - Step 143111: {'lr': 2.669011815547989e-06, 'samples': 27477312, 'steps': 143110, 'loss/train': 1.2084230184555054} 11/07/2021 17:23:26 - INFO - __main__ - Step 143112: {'lr': 2.668238503327425e-06, 'samples': 27477504, 'steps': 143111, 'loss/train': 1.293761968612671} 11/07/2021 17:23:26 - INFO - __main__ - Step 143113: {'lr': 2.667465302550742e-06, 'samples': 27477696, 'steps': 143112, 'loss/train': 1.5453377962112427} 11/07/2021 17:23:26 - INFO - __main__ - Step 143114: {'lr': 2.6666922132182747e-06, 'samples': 27477888, 'steps': 143113, 'loss/train': 0.8372640013694763} 11/07/2021 17:23:27 - INFO - __main__ - Step 143115: {'lr': 2.665919235330383e-06, 'samples': 27478080, 'steps': 143114, 'loss/train': 0.4351027309894562} 11/07/2021 17:23:28 - INFO - __main__ - Step 143116: {'lr': 2.6651463688874277e-06, 'samples': 27478272, 'steps': 143115, 'loss/train': 0.5879501104354858} 11/07/2021 17:23:28 - INFO - __main__ - Step 143117: {'lr': 2.6643736138897147e-06, 'samples': 27478464, 'steps': 143116, 'loss/train': 1.1391710042953491} 11/07/2021 17:23:28 - INFO - __main__ - Step 143118: {'lr': 2.6636009703376317e-06, 'samples': 27478656, 'steps': 143117, 'loss/train': 1.3765875101089478} 11/07/2021 17:23:29 - INFO - __main__ - Step 143119: {'lr': 2.6628284382315125e-06, 'samples': 27478848, 'steps': 143118, 'loss/train': 1.5804588794708252} 11/07/2021 17:23:29 - INFO - __main__ - Step 143120: {'lr': 2.662056017571718e-06, 'samples': 27479040, 'steps': 143119, 'loss/train': 0.6671305298805237} 11/07/2021 17:23:30 - INFO - __main__ - Step 143121: {'lr': 2.661283708358553e-06, 'samples': 27479232, 'steps': 143120, 'loss/train': 1.2462958097457886} 11/07/2021 17:23:31 - INFO - __main__ - Step 143122: {'lr': 2.6605115105924337e-06, 'samples': 27479424, 'steps': 143121, 'loss/train': 1.3328262567520142} 11/07/2021 17:23:31 - INFO - __main__ - Step 143123: {'lr': 2.6597394242736382e-06, 'samples': 27479616, 'steps': 143122, 'loss/train': 1.7177364826202393} 11/07/2021 17:23:31 - INFO - __main__ - Step 143124: {'lr': 2.6589674494025552e-06, 'samples': 27479808, 'steps': 143123, 'loss/train': 1.2154134511947632} 11/07/2021 17:23:32 - INFO - __main__ - Step 143125: {'lr': 2.6581955859795447e-06, 'samples': 27480000, 'steps': 143124, 'loss/train': 1.0550144910812378} 11/07/2021 17:23:32 - INFO - __main__ - Step 143126: {'lr': 2.6574238340049408e-06, 'samples': 27480192, 'steps': 143125, 'loss/train': 1.0440977811813354} 11/07/2021 17:23:33 - INFO - __main__ - Step 143127: {'lr': 2.6566521934790476e-06, 'samples': 27480384, 'steps': 143126, 'loss/train': 1.7067373991012573} 11/07/2021 17:23:34 - INFO - __main__ - Step 143128: {'lr': 2.6558806644022826e-06, 'samples': 27480576, 'steps': 143127, 'loss/train': 1.1656684875488281} 11/07/2021 17:23:34 - INFO - __main__ - Step 143129: {'lr': 2.6551092467749505e-06, 'samples': 27480768, 'steps': 143128, 'loss/train': 1.4551081657409668} 11/07/2021 17:23:34 - INFO - __main__ - Step 143130: {'lr': 2.6543379405973845e-06, 'samples': 27480960, 'steps': 143129, 'loss/train': 1.5629268884658813} 11/07/2021 17:23:35 - INFO - __main__ - Step 143131: {'lr': 2.653566745869973e-06, 'samples': 27481152, 'steps': 143130, 'loss/train': 1.6467348337173462} 11/07/2021 17:23:36 - INFO - __main__ - Step 143132: {'lr': 2.652795662593077e-06, 'samples': 27481344, 'steps': 143131, 'loss/train': 1.1271283626556396} 11/07/2021 17:23:36 - INFO - __main__ - Step 143133: {'lr': 2.652024690766974e-06, 'samples': 27481536, 'steps': 143132, 'loss/train': 1.6933650970458984} 11/07/2021 17:23:36 - INFO - __main__ - Step 143134: {'lr': 2.651253830392081e-06, 'samples': 27481728, 'steps': 143133, 'loss/train': 0.6577365398406982} 11/07/2021 17:23:37 - INFO - __main__ - Step 143135: {'lr': 2.650483081468702e-06, 'samples': 27481920, 'steps': 143134, 'loss/train': 1.374346375465393} 11/07/2021 17:23:37 - INFO - __main__ - Step 143136: {'lr': 2.6497124439971985e-06, 'samples': 27482112, 'steps': 143135, 'loss/train': 1.351727843284607} 11/07/2021 17:23:38 - INFO - __main__ - Step 143137: {'lr': 2.648941917977904e-06, 'samples': 27482304, 'steps': 143136, 'loss/train': 1.3556404113769531} 11/07/2021 17:23:39 - INFO - __main__ - Step 143138: {'lr': 2.6481715034112065e-06, 'samples': 27482496, 'steps': 143137, 'loss/train': 1.0563867092132568} 11/07/2021 17:23:39 - INFO - __main__ - Step 143139: {'lr': 2.6474012002974114e-06, 'samples': 27482688, 'steps': 143138, 'loss/train': 1.4042150974273682} 11/07/2021 17:23:39 - INFO - __main__ - Step 143140: {'lr': 2.6466310086368794e-06, 'samples': 27482880, 'steps': 143139, 'loss/train': 1.4214788675308228} 11/07/2021 17:23:40 - INFO - __main__ - Step 143141: {'lr': 2.645860928429972e-06, 'samples': 27483072, 'steps': 143140, 'loss/train': 1.1934157609939575} 11/07/2021 17:23:41 - INFO - __main__ - Step 143142: {'lr': 2.6450909596770214e-06, 'samples': 27483264, 'steps': 143141, 'loss/train': 0.9404038190841675} 11/07/2021 17:23:41 - INFO - __main__ - Step 143143: {'lr': 2.644321102378361e-06, 'samples': 27483456, 'steps': 143142, 'loss/train': 1.707438588142395} 11/07/2021 17:23:41 - INFO - __main__ - Step 143144: {'lr': 2.6435513565343517e-06, 'samples': 27483648, 'steps': 143143, 'loss/train': 1.2225449085235596} 11/07/2021 17:23:42 - INFO - __main__ - Step 143145: {'lr': 2.642781722145354e-06, 'samples': 27483840, 'steps': 143144, 'loss/train': 1.4331691265106201} 11/07/2021 17:23:42 - INFO - __main__ - Step 143146: {'lr': 2.642012199211702e-06, 'samples': 27484032, 'steps': 143145, 'loss/train': 1.1517276763916016} 11/07/2021 17:23:42 - INFO - __main__ - Step 143147: {'lr': 2.6412427877337276e-06, 'samples': 27484224, 'steps': 143146, 'loss/train': 1.4363815784454346} 11/07/2021 17:23:43 - INFO - __main__ - Step 143148: {'lr': 2.64047348771182e-06, 'samples': 27484416, 'steps': 143147, 'loss/train': 1.0556743144989014} 11/07/2021 17:23:44 - INFO - __main__ - Step 143149: {'lr': 2.639704299146284e-06, 'samples': 27484608, 'steps': 143148, 'loss/train': 1.2357772588729858} 11/07/2021 17:23:44 - INFO - __main__ - Step 143150: {'lr': 2.6389352220374527e-06, 'samples': 27484800, 'steps': 143149, 'loss/train': 1.5788525342941284} 11/07/2021 17:23:44 - INFO - __main__ - Step 143151: {'lr': 2.6381662563857435e-06, 'samples': 27484992, 'steps': 143150, 'loss/train': 0.41297590732574463} 11/07/2021 17:23:45 - INFO - __main__ - Step 143152: {'lr': 2.6373974021914325e-06, 'samples': 27485184, 'steps': 143151, 'loss/train': 1.1271954774856567} 11/07/2021 17:23:46 - INFO - __main__ - Step 143153: {'lr': 2.6366286594549094e-06, 'samples': 27485376, 'steps': 143152, 'loss/train': 1.4436837434768677} 11/07/2021 17:23:46 - INFO - __main__ - Step 143154: {'lr': 2.6358600281764788e-06, 'samples': 27485568, 'steps': 143153, 'loss/train': 1.5539597272872925} 11/07/2021 17:23:46 - INFO - __main__ - Step 143155: {'lr': 2.63509150835653e-06, 'samples': 27485760, 'steps': 143154, 'loss/train': 1.1597707271575928} 11/07/2021 17:23:47 - INFO - __main__ - Step 143156: {'lr': 2.634323099995395e-06, 'samples': 27485952, 'steps': 143155, 'loss/train': 1.6924432516098022} 11/07/2021 17:23:47 - INFO - __main__ - Step 143157: {'lr': 2.6335548030934075e-06, 'samples': 27486144, 'steps': 143156, 'loss/train': 1.117418885231018} 11/07/2021 17:23:48 - INFO - __main__ - Step 143158: {'lr': 2.6327866176509284e-06, 'samples': 27486336, 'steps': 143157, 'loss/train': 1.4673759937286377} 11/07/2021 17:23:49 - INFO - __main__ - Step 143159: {'lr': 2.6320185436683187e-06, 'samples': 27486528, 'steps': 143158, 'loss/train': 1.0351814031600952} 11/07/2021 17:23:49 - INFO - __main__ - Step 143160: {'lr': 2.631250581145883e-06, 'samples': 27486720, 'steps': 143159, 'loss/train': 1.3520689010620117} 11/07/2021 17:23:49 - INFO - __main__ - Step 143161: {'lr': 2.6304827300839828e-06, 'samples': 27486912, 'steps': 143160, 'loss/train': 1.131056308746338} 11/07/2021 17:23:50 - INFO - __main__ - Step 143162: {'lr': 2.6297149904829786e-06, 'samples': 27487104, 'steps': 143161, 'loss/train': 1.3687102794647217} 11/07/2021 17:23:51 - INFO - __main__ - Step 143163: {'lr': 2.6289473623432038e-06, 'samples': 27487296, 'steps': 143162, 'loss/train': 1.264581561088562} 11/07/2021 17:23:51 - INFO - __main__ - Step 143164: {'lr': 2.6281798456650184e-06, 'samples': 27487488, 'steps': 143163, 'loss/train': 1.1428395509719849} 11/07/2021 17:23:51 - INFO - __main__ - Step 143165: {'lr': 2.6274124404487287e-06, 'samples': 27487680, 'steps': 143164, 'loss/train': 1.0253162384033203} 11/07/2021 17:23:52 - INFO - __main__ - Step 143166: {'lr': 2.6266451466947505e-06, 'samples': 27487872, 'steps': 143165, 'loss/train': 1.3080533742904663} 11/07/2021 17:23:52 - INFO - __main__ - Step 143167: {'lr': 2.6258779644033616e-06, 'samples': 27488064, 'steps': 143166, 'loss/train': 1.4700219631195068} 11/07/2021 17:23:53 - INFO - __main__ - Step 143168: {'lr': 2.6251108935749224e-06, 'samples': 27488256, 'steps': 143167, 'loss/train': 1.481208086013794} 11/07/2021 17:23:54 - INFO - __main__ - Step 143169: {'lr': 2.624343934209822e-06, 'samples': 27488448, 'steps': 143168, 'loss/train': 1.316493272781372} 11/07/2021 17:23:54 - INFO - __main__ - Step 143170: {'lr': 2.623577086308393e-06, 'samples': 27488640, 'steps': 143169, 'loss/train': 1.4834753274917603} 11/07/2021 17:23:54 - INFO - __main__ - Step 143171: {'lr': 2.622810349870913e-06, 'samples': 27488832, 'steps': 143170, 'loss/train': 1.3714802265167236} 11/07/2021 17:23:55 - INFO - __main__ - Step 143172: {'lr': 2.6220437248977993e-06, 'samples': 27489024, 'steps': 143171, 'loss/train': 2.6441173553466797} 11/07/2021 17:23:55 - INFO - __main__ - Step 143173: {'lr': 2.6212772113893834e-06, 'samples': 27489216, 'steps': 143172, 'loss/train': 1.3320523500442505} 11/07/2021 17:23:56 - INFO - __main__ - Step 143174: {'lr': 2.6205108093459997e-06, 'samples': 27489408, 'steps': 143173, 'loss/train': 1.55935537815094} 11/07/2021 17:23:56 - INFO - __main__ - Step 143175: {'lr': 2.6197445187679802e-06, 'samples': 27489600, 'steps': 143174, 'loss/train': 1.488198161125183} 11/07/2021 17:23:57 - INFO - __main__ - Step 143176: {'lr': 2.6189783396556866e-06, 'samples': 27489792, 'steps': 143175, 'loss/train': 1.3712966442108154} 11/07/2021 17:23:57 - INFO - __main__ - Step 143177: {'lr': 2.6182122720094794e-06, 'samples': 27489984, 'steps': 143176, 'loss/train': 1.459452509880066} 11/07/2021 17:23:57 - INFO - __main__ - Step 143178: {'lr': 2.6174463158296913e-06, 'samples': 27490176, 'steps': 143177, 'loss/train': 1.4745593070983887} 11/07/2021 17:23:58 - INFO - __main__ - Step 143179: {'lr': 2.6166804711166558e-06, 'samples': 27490368, 'steps': 143178, 'loss/train': 1.4299626350402832} 11/07/2021 17:23:59 - INFO - __main__ - Step 143180: {'lr': 2.615914737870706e-06, 'samples': 27490560, 'steps': 143179, 'loss/train': 1.0029151439666748} 11/07/2021 17:23:59 - INFO - __main__ - Step 143181: {'lr': 2.615149116092258e-06, 'samples': 27490752, 'steps': 143180, 'loss/train': 1.3210632801055908} 11/07/2021 17:23:59 - INFO - __main__ - Step 143182: {'lr': 2.6143836057815616e-06, 'samples': 27490944, 'steps': 143181, 'loss/train': 1.4009207487106323} 11/07/2021 17:24:00 - INFO - __main__ - Step 143183: {'lr': 2.6136182069390335e-06, 'samples': 27491136, 'steps': 143182, 'loss/train': 1.6450167894363403} 11/07/2021 17:24:01 - INFO - __main__ - Step 143184: {'lr': 2.6128529195649786e-06, 'samples': 27491328, 'steps': 143183, 'loss/train': 1.7909328937530518} 11/07/2021 17:24:01 - INFO - __main__ - Step 143185: {'lr': 2.612087743659758e-06, 'samples': 27491520, 'steps': 143184, 'loss/train': 1.4661803245544434} 11/07/2021 17:24:02 - INFO - __main__ - Step 143186: {'lr': 2.6113226792237045e-06, 'samples': 27491712, 'steps': 143185, 'loss/train': 0.9661522507667542} 11/07/2021 17:24:02 - INFO - __main__ - Step 143187: {'lr': 2.610557726257179e-06, 'samples': 27491904, 'steps': 143186, 'loss/train': 1.2652573585510254} 11/07/2021 17:24:02 - INFO - __main__ - Step 143188: {'lr': 2.6097928847605145e-06, 'samples': 27492096, 'steps': 143187, 'loss/train': 0.9811277389526367} 11/07/2021 17:24:03 - INFO - __main__ - Step 143189: {'lr': 2.609028154734072e-06, 'samples': 27492288, 'steps': 143188, 'loss/train': 1.3476749658584595} 11/07/2021 17:24:04 - INFO - __main__ - Step 143190: {'lr': 2.6082635361781848e-06, 'samples': 27492480, 'steps': 143189, 'loss/train': 0.5885257124900818} 11/07/2021 17:24:04 - INFO - __main__ - Step 143191: {'lr': 2.6074990290931857e-06, 'samples': 27492672, 'steps': 143190, 'loss/train': 1.3824622631072998} 11/07/2021 17:24:04 - INFO - __main__ - Step 143192: {'lr': 2.606734633479435e-06, 'samples': 27492864, 'steps': 143191, 'loss/train': 1.0515965223312378} 11/07/2021 17:24:05 - INFO - __main__ - Step 143193: {'lr': 2.6059703493372665e-06, 'samples': 27493056, 'steps': 143192, 'loss/train': 1.292260766029358} 11/07/2021 17:24:06 - INFO - __main__ - Step 143194: {'lr': 2.605206176667041e-06, 'samples': 27493248, 'steps': 143193, 'loss/train': 1.8182318210601807} 11/07/2021 17:24:06 - INFO - __main__ - Step 143195: {'lr': 2.604442115469091e-06, 'samples': 27493440, 'steps': 143194, 'loss/train': 1.0464165210723877} 11/07/2021 17:24:07 - INFO - __main__ - Step 143196: {'lr': 2.6036781657437505e-06, 'samples': 27493632, 'steps': 143195, 'loss/train': 1.6284898519515991} 11/07/2021 17:24:07 - INFO - __main__ - Step 143197: {'lr': 2.602914327491379e-06, 'samples': 27493824, 'steps': 143196, 'loss/train': 1.349876880645752} 11/07/2021 17:24:07 - INFO - __main__ - Step 143198: {'lr': 2.602150600712311e-06, 'samples': 27494016, 'steps': 143197, 'loss/train': 1.3478682041168213} 11/07/2021 17:24:08 - INFO - __main__ - Step 143199: {'lr': 2.601386985406906e-06, 'samples': 27494208, 'steps': 143198, 'loss/train': 1.6645318269729614} 11/07/2021 17:24:09 - INFO - __main__ - Step 143200: {'lr': 2.600623481575498e-06, 'samples': 27494400, 'steps': 143199, 'loss/train': 1.3354997634887695} 11/07/2021 17:24:09 - INFO - __main__ - Step 143201: {'lr': 2.59986008921842e-06, 'samples': 27494592, 'steps': 143200, 'loss/train': 0.06504669785499573} 11/07/2021 17:24:10 - INFO - __main__ - Step 143202: {'lr': 2.599096808336032e-06, 'samples': 27494784, 'steps': 143201, 'loss/train': 1.2529464960098267} 11/07/2021 17:24:10 - INFO - __main__ - Step 143203: {'lr': 2.5983336389286683e-06, 'samples': 27494976, 'steps': 143202, 'loss/train': 1.2604998350143433} 11/07/2021 17:24:10 - INFO - __main__ - Step 143204: {'lr': 2.5975705809966888e-06, 'samples': 27495168, 'steps': 143203, 'loss/train': 1.3823374509811401} 11/07/2021 17:24:11 - INFO - __main__ - Step 143205: {'lr': 2.5968076345404547e-06, 'samples': 27495360, 'steps': 143204, 'loss/train': 1.0156254768371582} 11/07/2021 17:24:12 - INFO - __main__ - Step 143206: {'lr': 2.596044799560243e-06, 'samples': 27495552, 'steps': 143205, 'loss/train': 1.2082103490829468} 11/07/2021 17:24:12 - INFO - __main__ - Step 143207: {'lr': 2.5952820760564435e-06, 'samples': 27495744, 'steps': 143206, 'loss/train': 1.271050214767456} 11/07/2021 17:24:12 - INFO - __main__ - Step 143208: {'lr': 2.594519464029388e-06, 'samples': 27495936, 'steps': 143207, 'loss/train': 1.4881253242492676} 11/07/2021 17:24:13 - INFO - __main__ - Step 143209: {'lr': 2.593756963479438e-06, 'samples': 27496128, 'steps': 143208, 'loss/train': 0.8646864295005798} 11/07/2021 17:24:14 - INFO - __main__ - Step 143210: {'lr': 2.5929945744068985e-06, 'samples': 27496320, 'steps': 143209, 'loss/train': 1.1699085235595703} 11/07/2021 17:24:14 - INFO - __main__ - Step 143211: {'lr': 2.5922322968121583e-06, 'samples': 27496512, 'steps': 143210, 'loss/train': 1.2318823337554932} 11/07/2021 17:24:14 - INFO - __main__ - Step 143212: {'lr': 2.59147013069555e-06, 'samples': 27496704, 'steps': 143211, 'loss/train': 1.1817187070846558} 11/07/2021 17:24:15 - INFO - __main__ - Step 143213: {'lr': 2.59070807605738e-06, 'samples': 27496896, 'steps': 143212, 'loss/train': 1.135282278060913} 11/07/2021 17:24:15 - INFO - __main__ - Step 143214: {'lr': 2.589946132898036e-06, 'samples': 27497088, 'steps': 143213, 'loss/train': 1.173911213874817} 11/07/2021 17:24:16 - INFO - __main__ - Step 143215: {'lr': 2.589184301217823e-06, 'samples': 27497280, 'steps': 143214, 'loss/train': 0.9936124086380005} 11/07/2021 17:24:17 - INFO - __main__ - Step 143216: {'lr': 2.58842258101713e-06, 'samples': 27497472, 'steps': 143215, 'loss/train': 1.0989500284194946} 11/07/2021 17:24:17 - INFO - __main__ - Step 143217: {'lr': 2.587660972296263e-06, 'samples': 27497664, 'steps': 143216, 'loss/train': 1.8307557106018066} 11/07/2021 17:24:17 - INFO - __main__ - Step 143218: {'lr': 2.5868994750556095e-06, 'samples': 27497856, 'steps': 143217, 'loss/train': 1.4821462631225586} 11/07/2021 17:24:18 - INFO - __main__ - Step 143219: {'lr': 2.5861380892954477e-06, 'samples': 27498048, 'steps': 143218, 'loss/train': 1.3692153692245483} 11/07/2021 17:24:19 - INFO - __main__ - Step 143220: {'lr': 2.585376815016166e-06, 'samples': 27498240, 'steps': 143219, 'loss/train': 1.5826128721237183} 11/07/2021 17:24:19 - INFO - __main__ - Step 143221: {'lr': 2.5846156522180977e-06, 'samples': 27498432, 'steps': 143220, 'loss/train': 0.9323861002922058} 11/07/2021 17:24:20 - INFO - __main__ - Step 143222: {'lr': 2.583854600901575e-06, 'samples': 27498624, 'steps': 143221, 'loss/train': 1.7371087074279785} 11/07/2021 17:24:20 - INFO - __main__ - Step 143223: {'lr': 2.5830936610669597e-06, 'samples': 27498816, 'steps': 143222, 'loss/train': 1.73583984375} 11/07/2021 17:24:20 - INFO - __main__ - Step 143224: {'lr': 2.5823328327145844e-06, 'samples': 27499008, 'steps': 143223, 'loss/train': 0.9727363586425781} 11/07/2021 17:24:21 - INFO - __main__ - Step 143225: {'lr': 2.5815721158447825e-06, 'samples': 27499200, 'steps': 143224, 'loss/train': 1.3958663940429688} 11/07/2021 17:24:22 - INFO - __main__ - Step 143226: {'lr': 2.5808115104579144e-06, 'samples': 27499392, 'steps': 143225, 'loss/train': 1.130064606666565} 11/07/2021 17:24:22 - INFO - __main__ - Step 143227: {'lr': 2.5800510165542855e-06, 'samples': 27499584, 'steps': 143226, 'loss/train': 1.4648053646087646} 11/07/2021 17:24:23 - INFO - __main__ - Step 143228: {'lr': 2.5792906341343125e-06, 'samples': 27499776, 'steps': 143227, 'loss/train': 1.3460942506790161} 11/07/2021 17:24:23 - INFO - __main__ - Step 143229: {'lr': 2.5785303631982727e-06, 'samples': 27499968, 'steps': 143228, 'loss/train': 0.9702025651931763} 11/07/2021 17:24:23 - INFO - __main__ - Step 143230: {'lr': 2.5777702037465267e-06, 'samples': 27500160, 'steps': 143229, 'loss/train': 0.6493167281150818} 11/07/2021 17:24:24 - INFO - __main__ - Step 143231: {'lr': 2.5770101557794077e-06, 'samples': 27500352, 'steps': 143230, 'loss/train': 1.4902944564819336} 11/07/2021 17:24:25 - INFO - __main__ - Step 143232: {'lr': 2.576250219297305e-06, 'samples': 27500544, 'steps': 143231, 'loss/train': 1.6640734672546387} 11/07/2021 17:24:25 - INFO - __main__ - Step 143233: {'lr': 2.575490394300495e-06, 'samples': 27500736, 'steps': 143232, 'loss/train': 0.9478687644004822} 11/07/2021 17:24:25 - INFO - __main__ - Step 143234: {'lr': 2.5747306807893665e-06, 'samples': 27500928, 'steps': 143233, 'loss/train': 0.5690730810165405} 11/07/2021 17:24:26 - INFO - __main__ - Step 143235: {'lr': 2.5739710787642534e-06, 'samples': 27501120, 'steps': 143234, 'loss/train': 1.334101915359497} 11/07/2021 17:24:26 - INFO - __main__ - Step 143236: {'lr': 2.57321158822546e-06, 'samples': 27501312, 'steps': 143235, 'loss/train': 0.9289902448654175} 11/07/2021 17:24:27 - INFO - __main__ - Step 143237: {'lr': 2.572452209173404e-06, 'samples': 27501504, 'steps': 143236, 'loss/train': 1.6957112550735474} 11/07/2021 17:24:27 - INFO - __main__ - Step 143238: {'lr': 2.5716929416083336e-06, 'samples': 27501696, 'steps': 143237, 'loss/train': 1.1813822984695435} 11/07/2021 17:24:28 - INFO - __main__ - Step 143239: {'lr': 2.570933785530666e-06, 'samples': 27501888, 'steps': 143238, 'loss/train': 0.9631576538085938} 11/07/2021 17:24:28 - INFO - __main__ - Step 143240: {'lr': 2.570174740940734e-06, 'samples': 27502080, 'steps': 143239, 'loss/train': 1.092149257659912} 11/07/2021 17:24:28 - INFO - __main__ - Step 143241: {'lr': 2.569415807838843e-06, 'samples': 27502272, 'steps': 143240, 'loss/train': 1.3520045280456543} 11/07/2021 17:24:30 - INFO - __main__ - Step 143242: {'lr': 2.568656986225354e-06, 'samples': 27502464, 'steps': 143241, 'loss/train': 1.4381605386734009} 11/07/2021 17:24:30 - INFO - __main__ - Step 143243: {'lr': 2.5678982761005997e-06, 'samples': 27502656, 'steps': 143242, 'loss/train': 0.9299049377441406} 11/07/2021 17:24:30 - INFO - __main__ - Step 143244: {'lr': 2.5671396774649412e-06, 'samples': 27502848, 'steps': 143243, 'loss/train': 1.4711495637893677} 11/07/2021 17:24:31 - INFO - __main__ - Step 143245: {'lr': 2.5663811903187117e-06, 'samples': 27503040, 'steps': 143244, 'loss/train': 1.6916216611862183} 11/07/2021 17:24:31 - INFO - __main__ - Step 143246: {'lr': 2.5656228146622718e-06, 'samples': 27503232, 'steps': 143245, 'loss/train': 0.6606189012527466} 11/07/2021 17:24:32 - INFO - __main__ - Step 143247: {'lr': 2.5648645504959265e-06, 'samples': 27503424, 'steps': 143246, 'loss/train': 1.7405369281768799} 11/07/2021 17:24:32 - INFO - __main__ - Step 143248: {'lr': 2.5641063978200375e-06, 'samples': 27503616, 'steps': 143247, 'loss/train': 1.3672902584075928} 11/07/2021 17:24:33 - INFO - __main__ - Step 143249: {'lr': 2.563348356634965e-06, 'samples': 27503808, 'steps': 143248, 'loss/train': 1.2652555704116821} 11/07/2021 17:24:33 - INFO - __main__ - Step 143250: {'lr': 2.5625904269409863e-06, 'samples': 27504000, 'steps': 143249, 'loss/train': 1.1815279722213745} 11/07/2021 17:24:33 - INFO - __main__ - Step 143251: {'lr': 2.561832608738518e-06, 'samples': 27504192, 'steps': 143250, 'loss/train': 1.1998982429504395} 11/07/2021 17:24:35 - INFO - __main__ - Step 143252: {'lr': 2.561074902027866e-06, 'samples': 27504384, 'steps': 143251, 'loss/train': 1.9677497148513794} 11/07/2021 17:24:35 - INFO - __main__ - Step 143253: {'lr': 2.560317306809362e-06, 'samples': 27504576, 'steps': 143252, 'loss/train': 1.3763078451156616} 11/07/2021 17:24:35 - INFO - __main__ - Step 143254: {'lr': 2.5595598230833684e-06, 'samples': 27504768, 'steps': 143253, 'loss/train': 1.1224350929260254} 11/07/2021 17:24:36 - INFO - __main__ - Step 143255: {'lr': 2.558802450850217e-06, 'samples': 27504960, 'steps': 143254, 'loss/train': 1.446175456047058} 11/07/2021 17:24:36 - INFO - __main__ - Step 143256: {'lr': 2.55804519011027e-06, 'samples': 27505152, 'steps': 143255, 'loss/train': 1.1849713325500488} 11/07/2021 17:24:37 - INFO - __main__ - Step 143257: {'lr': 2.557288040863831e-06, 'samples': 27505344, 'steps': 143256, 'loss/train': 1.0031678676605225} 11/07/2021 17:24:37 - INFO - __main__ - Step 143258: {'lr': 2.556531003111262e-06, 'samples': 27505536, 'steps': 143257, 'loss/train': 1.3088963031768799} 11/07/2021 17:24:38 - INFO - __main__ - Step 143259: {'lr': 2.555774076852896e-06, 'samples': 27505728, 'steps': 143258, 'loss/train': 1.7246683835983276} 11/07/2021 17:24:38 - INFO - __main__ - Step 143260: {'lr': 2.5550172620890933e-06, 'samples': 27505920, 'steps': 143259, 'loss/train': 1.17474365234375} 11/07/2021 17:24:38 - INFO - __main__ - Step 143261: {'lr': 2.5542605588201597e-06, 'samples': 27506112, 'steps': 143260, 'loss/train': 1.0406807661056519} 11/07/2021 17:24:39 - INFO - __main__ - Step 143262: {'lr': 2.5535039670464833e-06, 'samples': 27506304, 'steps': 143261, 'loss/train': 1.4601143598556519} 11/07/2021 17:24:40 - INFO - __main__ - Step 143263: {'lr': 2.5527474867683697e-06, 'samples': 27506496, 'steps': 143262, 'loss/train': 1.169148564338684} 11/07/2021 17:24:40 - INFO - __main__ - Step 143264: {'lr': 2.5519911179861523e-06, 'samples': 27506688, 'steps': 143263, 'loss/train': 1.4166866540908813} 11/07/2021 17:24:40 - INFO - __main__ - Step 143265: {'lr': 2.551234860700219e-06, 'samples': 27506880, 'steps': 143264, 'loss/train': 1.630475640296936} 11/07/2021 17:24:41 - INFO - __main__ - Step 143266: {'lr': 2.5504787149108756e-06, 'samples': 27507072, 'steps': 143265, 'loss/train': 1.1973953247070312} 11/07/2021 17:24:41 - INFO - __main__ - Step 143267: {'lr': 2.5497226806184548e-06, 'samples': 27507264, 'steps': 143266, 'loss/train': 1.6762111186981201} 11/07/2021 17:24:42 - INFO - __main__ - Step 143268: {'lr': 2.548966757823318e-06, 'samples': 27507456, 'steps': 143267, 'loss/train': 1.285744309425354} 11/07/2021 17:24:42 - INFO - __main__ - Step 143269: {'lr': 2.5482109465257975e-06, 'samples': 27507648, 'steps': 143268, 'loss/train': 1.2869946956634521} 11/07/2021 17:24:43 - INFO - __main__ - Step 143270: {'lr': 2.547455246726227e-06, 'samples': 27507840, 'steps': 143269, 'loss/train': 0.5929316282272339} 11/07/2021 17:24:43 - INFO - __main__ - Step 143271: {'lr': 2.546699658424939e-06, 'samples': 27508032, 'steps': 143270, 'loss/train': 1.0836069583892822} 11/07/2021 17:24:44 - INFO - __main__ - Step 143272: {'lr': 2.5459441816223504e-06, 'samples': 27508224, 'steps': 143271, 'loss/train': 1.5260038375854492} 11/07/2021 17:24:44 - INFO - __main__ - Step 143273: {'lr': 2.5451888163186833e-06, 'samples': 27508416, 'steps': 143272, 'loss/train': 1.674448013305664} 11/07/2021 17:24:45 - INFO - __main__ - Step 143274: {'lr': 2.5444335625143533e-06, 'samples': 27508608, 'steps': 143273, 'loss/train': 1.2082223892211914} 11/07/2021 17:24:45 - INFO - __main__ - Step 143275: {'lr': 2.543678420209694e-06, 'samples': 27508800, 'steps': 143274, 'loss/train': 1.3800853490829468} 11/07/2021 17:24:46 - INFO - __main__ - Step 143276: {'lr': 2.5429233894050108e-06, 'samples': 27508992, 'steps': 143275, 'loss/train': 0.9185590744018555} 11/07/2021 17:24:46 - INFO - __main__ - Step 143277: {'lr': 2.542168470100692e-06, 'samples': 27509184, 'steps': 143276, 'loss/train': 1.6537666320800781} 11/07/2021 17:24:47 - INFO - __main__ - Step 143278: {'lr': 2.541413662297043e-06, 'samples': 27509376, 'steps': 143277, 'loss/train': 1.1287097930908203} 11/07/2021 17:24:47 - INFO - __main__ - Step 143279: {'lr': 2.5406589659944245e-06, 'samples': 27509568, 'steps': 143278, 'loss/train': 0.9637224078178406} 11/07/2021 17:24:48 - INFO - __main__ - Step 143280: {'lr': 2.5399043811931423e-06, 'samples': 27509760, 'steps': 143279, 'loss/train': 1.2630906105041504} 11/07/2021 17:24:48 - INFO - __main__ - Step 143281: {'lr': 2.5391499078935845e-06, 'samples': 27509952, 'steps': 143280, 'loss/train': 1.5649242401123047} 11/07/2021 17:24:48 - INFO - __main__ - Step 143282: {'lr': 2.5383955460960564e-06, 'samples': 27510144, 'steps': 143281, 'loss/train': 1.1703214645385742} 11/07/2021 17:24:49 - INFO - __main__ - Step 143283: {'lr': 2.537641295800891e-06, 'samples': 27510336, 'steps': 143282, 'loss/train': 1.4794450998306274} 11/07/2021 17:24:50 - INFO - __main__ - Step 143284: {'lr': 2.5368871570084772e-06, 'samples': 27510528, 'steps': 143283, 'loss/train': 0.3629586398601532} 11/07/2021 17:24:50 - INFO - __main__ - Step 143285: {'lr': 2.5361331297191203e-06, 'samples': 27510720, 'steps': 143284, 'loss/train': 1.0768625736236572} 11/07/2021 17:24:50 - INFO - __main__ - Step 143286: {'lr': 2.535379213933153e-06, 'samples': 27510912, 'steps': 143285, 'loss/train': 1.4653077125549316} 11/07/2021 17:24:51 - INFO - __main__ - Step 143287: {'lr': 2.5346254096509368e-06, 'samples': 27511104, 'steps': 143286, 'loss/train': 1.1513532400131226} 11/07/2021 17:24:52 - INFO - __main__ - Step 143288: {'lr': 2.5338717168727767e-06, 'samples': 27511296, 'steps': 143287, 'loss/train': 1.239648699760437} 11/07/2021 17:24:52 - INFO - __main__ - Step 143289: {'lr': 2.533118135599061e-06, 'samples': 27511488, 'steps': 143288, 'loss/train': 1.0048454999923706} 11/07/2021 17:24:52 - INFO - __main__ - Step 143290: {'lr': 2.5323646658300945e-06, 'samples': 27511680, 'steps': 143289, 'loss/train': 0.8918119668960571} 11/07/2021 17:24:53 - INFO - __main__ - Step 143291: {'lr': 2.5316113075662115e-06, 'samples': 27511872, 'steps': 143290, 'loss/train': 1.3815226554870605} 11/07/2021 17:24:53 - INFO - __main__ - Step 143292: {'lr': 2.5308580608077726e-06, 'samples': 27512064, 'steps': 143291, 'loss/train': 1.306061863899231} 11/07/2021 17:24:54 - INFO - __main__ - Step 143293: {'lr': 2.53010492555511e-06, 'samples': 27512256, 'steps': 143292, 'loss/train': 0.9925764799118042} 11/07/2021 17:24:55 - INFO - __main__ - Step 143294: {'lr': 2.5293519018085575e-06, 'samples': 27512448, 'steps': 143293, 'loss/train': 1.2038071155548096} 11/07/2021 17:24:55 - INFO - __main__ - Step 143295: {'lr': 2.528598989568476e-06, 'samples': 27512640, 'steps': 143294, 'loss/train': 1.0071666240692139} 11/07/2021 17:24:55 - INFO - __main__ - Step 143296: {'lr': 2.527846188835198e-06, 'samples': 27512832, 'steps': 143295, 'loss/train': 1.112127661705017} 11/07/2021 17:24:56 - INFO - __main__ - Step 143297: {'lr': 2.5270934996090287e-06, 'samples': 27513024, 'steps': 143296, 'loss/train': 0.8538229465484619} 11/07/2021 17:24:56 - INFO - __main__ - Step 143298: {'lr': 2.52634092189033e-06, 'samples': 27513216, 'steps': 143297, 'loss/train': 1.5310124158859253} 11/07/2021 17:24:57 - INFO - __main__ - Step 143299: {'lr': 2.5255884556794896e-06, 'samples': 27513408, 'steps': 143298, 'loss/train': 1.4411041736602783} 11/07/2021 17:24:58 - INFO - __main__ - Step 143300: {'lr': 2.5248361009767573e-06, 'samples': 27513600, 'steps': 143299, 'loss/train': 2.3893492221832275} 11/07/2021 17:24:58 - INFO - __main__ - Step 143301: {'lr': 2.524083857782522e-06, 'samples': 27513792, 'steps': 143300, 'loss/train': 1.268620491027832} 11/07/2021 17:24:58 - INFO - __main__ - Step 143302: {'lr': 2.5233317260971167e-06, 'samples': 27513984, 'steps': 143301, 'loss/train': 0.9851171970367432} 11/07/2021 17:24:59 - INFO - __main__ - Step 143303: {'lr': 2.5225797059208745e-06, 'samples': 27514176, 'steps': 143302, 'loss/train': 1.3827793598175049} 11/07/2021 17:25:00 - INFO - __main__ - Step 143304: {'lr': 2.5218277972541557e-06, 'samples': 27514368, 'steps': 143303, 'loss/train': 1.3709235191345215} 11/07/2021 17:25:00 - INFO - __main__ - Step 143305: {'lr': 2.5210760000972666e-06, 'samples': 27514560, 'steps': 143304, 'loss/train': 1.5839638710021973} 11/07/2021 17:25:00 - INFO - __main__ - Step 143306: {'lr': 2.520324314450567e-06, 'samples': 27514752, 'steps': 143305, 'loss/train': 1.490950345993042} 11/07/2021 17:25:01 - INFO - __main__ - Step 143307: {'lr': 2.519572740314391e-06, 'samples': 27514944, 'steps': 143306, 'loss/train': 0.867405891418457} 11/07/2021 17:25:01 - INFO - __main__ - Step 143308: {'lr': 2.5188212776890705e-06, 'samples': 27515136, 'steps': 143307, 'loss/train': 1.5394266843795776} 11/07/2021 17:25:02 - INFO - __main__ - Step 143309: {'lr': 2.5180699265749673e-06, 'samples': 27515328, 'steps': 143308, 'loss/train': 0.8926482796669006} 11/07/2021 17:25:03 - INFO - __main__ - Step 143310: {'lr': 2.5173186869723865e-06, 'samples': 27515520, 'steps': 143309, 'loss/train': 1.4086191654205322} 11/07/2021 17:25:03 - INFO - __main__ - Step 143311: {'lr': 2.5165675588816885e-06, 'samples': 27515712, 'steps': 143310, 'loss/train': 1.3291757106781006} 11/07/2021 17:25:03 - INFO - __main__ - Step 143312: {'lr': 2.5158165423032067e-06, 'samples': 27515904, 'steps': 143311, 'loss/train': 1.4295976161956787} 11/07/2021 17:25:04 - INFO - __main__ - Step 143313: {'lr': 2.5150656372373014e-06, 'samples': 27516096, 'steps': 143312, 'loss/train': 0.9118059873580933} 11/07/2021 17:25:05 - INFO - __main__ - Step 143314: {'lr': 2.514314843684251e-06, 'samples': 27516288, 'steps': 143313, 'loss/train': 1.7586244344711304} 11/07/2021 17:25:05 - INFO - __main__ - Step 143315: {'lr': 2.5135641616444437e-06, 'samples': 27516480, 'steps': 143314, 'loss/train': 1.3358138799667358} 11/07/2021 17:25:05 - INFO - __main__ - Step 143316: {'lr': 2.5128135911182125e-06, 'samples': 27516672, 'steps': 143315, 'loss/train': 1.2418087720870972} 11/07/2021 17:25:06 - INFO - __main__ - Step 143317: {'lr': 2.5120631321058907e-06, 'samples': 27516864, 'steps': 143316, 'loss/train': 1.2012109756469727} 11/07/2021 17:25:06 - INFO - __main__ - Step 143318: {'lr': 2.511312784607811e-06, 'samples': 27517056, 'steps': 143317, 'loss/train': 0.8652546405792236} 11/07/2021 17:25:06 - INFO - __main__ - Step 143319: {'lr': 2.5105625486243067e-06, 'samples': 27517248, 'steps': 143318, 'loss/train': 0.07869917899370193} 11/07/2021 17:25:08 - INFO - __main__ - Step 143320: {'lr': 2.5098124241557387e-06, 'samples': 27517440, 'steps': 143319, 'loss/train': 1.0101189613342285} 11/07/2021 17:25:08 - INFO - __main__ - Step 143321: {'lr': 2.509062411202412e-06, 'samples': 27517632, 'steps': 143320, 'loss/train': 1.2956068515777588} 11/07/2021 17:25:08 - INFO - __main__ - Step 143322: {'lr': 2.5083125097646875e-06, 'samples': 27517824, 'steps': 143321, 'loss/train': 1.093543291091919} 11/07/2021 17:25:09 - INFO - __main__ - Step 143323: {'lr': 2.5075627198428985e-06, 'samples': 27518016, 'steps': 143322, 'loss/train': 1.318969964981079} 11/07/2021 17:25:09 - INFO - __main__ - Step 143324: {'lr': 2.506813041437378e-06, 'samples': 27518208, 'steps': 143323, 'loss/train': 1.1337623596191406} 11/07/2021 17:25:10 - INFO - __main__ - Step 143325: {'lr': 2.5060634745484867e-06, 'samples': 27518400, 'steps': 143324, 'loss/train': 1.5147203207015991} 11/07/2021 17:25:10 - INFO - __main__ - Step 143326: {'lr': 2.50531401917653e-06, 'samples': 27518592, 'steps': 143325, 'loss/train': 4.1014628410339355} 11/07/2021 17:25:11 - INFO - __main__ - Step 143327: {'lr': 2.5045646753218687e-06, 'samples': 27518784, 'steps': 143326, 'loss/train': 0.9942875504493713} 11/07/2021 17:25:11 - INFO - __main__ - Step 143328: {'lr': 2.503815442984836e-06, 'samples': 27518976, 'steps': 143327, 'loss/train': 1.4323598146438599} 11/07/2021 17:25:12 - INFO - __main__ - Step 143329: {'lr': 2.5030663221657646e-06, 'samples': 27519168, 'steps': 143328, 'loss/train': 1.4835432767868042} 11/07/2021 17:25:12 - INFO - __main__ - Step 143330: {'lr': 2.5023173128649603e-06, 'samples': 27519360, 'steps': 143329, 'loss/train': 1.4749822616577148} 11/07/2021 17:25:13 - INFO - __main__ - Step 143331: {'lr': 2.5015684150828112e-06, 'samples': 27519552, 'steps': 143330, 'loss/train': 1.4246472120285034} 11/07/2021 17:25:13 - INFO - __main__ - Step 143332: {'lr': 2.5008196288196504e-06, 'samples': 27519744, 'steps': 143331, 'loss/train': 1.534843921661377} 11/07/2021 17:25:14 - INFO - __main__ - Step 143333: {'lr': 2.500070954075784e-06, 'samples': 27519936, 'steps': 143332, 'loss/train': 1.2412059307098389} 11/07/2021 17:25:14 - INFO - __main__ - Step 143334: {'lr': 2.499322390851572e-06, 'samples': 27520128, 'steps': 143333, 'loss/train': 1.5615357160568237} 11/07/2021 17:25:14 - INFO - __main__ - Step 143335: {'lr': 2.498573939147347e-06, 'samples': 27520320, 'steps': 143334, 'loss/train': 1.5613257884979248} 11/07/2021 17:25:15 - INFO - __main__ - Step 143336: {'lr': 2.4978255989634436e-06, 'samples': 27520512, 'steps': 143335, 'loss/train': 1.2794294357299805} 11/07/2021 17:25:16 - INFO - __main__ - Step 143337: {'lr': 2.4970773703002216e-06, 'samples': 27520704, 'steps': 143336, 'loss/train': 1.4160057306289673} 11/07/2021 17:25:16 - INFO - __main__ - Step 143338: {'lr': 2.4963292531579584e-06, 'samples': 27520896, 'steps': 143337, 'loss/train': 1.2971636056900024} 11/07/2021 17:25:16 - INFO - __main__ - Step 143339: {'lr': 2.495581247537071e-06, 'samples': 27521088, 'steps': 143338, 'loss/train': 1.212328553199768} 11/07/2021 17:25:17 - INFO - __main__ - Step 143340: {'lr': 2.4948333534378364e-06, 'samples': 27521280, 'steps': 143339, 'loss/train': 1.1324458122253418} 11/07/2021 17:25:18 - INFO - __main__ - Step 143341: {'lr': 2.494085570860616e-06, 'samples': 27521472, 'steps': 143340, 'loss/train': 1.0032962560653687} 11/07/2021 17:25:18 - INFO - __main__ - Step 143342: {'lr': 2.493337899805742e-06, 'samples': 27521664, 'steps': 143341, 'loss/train': 1.6153002977371216} 11/07/2021 17:25:18 - INFO - __main__ - Step 143343: {'lr': 2.492590340273548e-06, 'samples': 27521856, 'steps': 143342, 'loss/train': 1.2798871994018555} 11/07/2021 17:25:19 - INFO - __main__ - Step 143344: {'lr': 2.4918428922643676e-06, 'samples': 27522048, 'steps': 143343, 'loss/train': 1.4385101795196533} 11/07/2021 17:25:19 - INFO - __main__ - Step 143345: {'lr': 2.4910955557785332e-06, 'samples': 27522240, 'steps': 143344, 'loss/train': 1.9193357229232788} 11/07/2021 17:25:20 - INFO - __main__ - Step 143346: {'lr': 2.4903483308164054e-06, 'samples': 27522432, 'steps': 143345, 'loss/train': 1.5697529315948486} 11/07/2021 17:25:21 - INFO - __main__ - Step 143347: {'lr': 2.48960121737829e-06, 'samples': 27522624, 'steps': 143346, 'loss/train': 1.1515952348709106} 11/07/2021 17:25:21 - INFO - __main__ - Step 143348: {'lr': 2.4888542154645756e-06, 'samples': 27522816, 'steps': 143347, 'loss/train': 1.3250839710235596} 11/07/2021 17:25:21 - INFO - __main__ - Step 143349: {'lr': 2.4881073250755394e-06, 'samples': 27523008, 'steps': 143348, 'loss/train': 0.5534660220146179} 11/07/2021 17:25:22 - INFO - __main__ - Step 143350: {'lr': 2.487360546211542e-06, 'samples': 27523200, 'steps': 143349, 'loss/train': 1.0795588493347168} 11/07/2021 17:25:22 - INFO - __main__ - Step 143351: {'lr': 2.4866138788729176e-06, 'samples': 27523392, 'steps': 143350, 'loss/train': 1.3386191129684448} 11/07/2021 17:25:23 - INFO - __main__ - Step 143352: {'lr': 2.4858673230600258e-06, 'samples': 27523584, 'steps': 143351, 'loss/train': 1.569571614265442} 11/07/2021 17:25:24 - INFO - __main__ - Step 143353: {'lr': 2.4851208787732003e-06, 'samples': 27523776, 'steps': 143352, 'loss/train': 0.9973859786987305} 11/07/2021 17:25:24 - INFO - __main__ - Step 143354: {'lr': 2.4843745460127187e-06, 'samples': 27523968, 'steps': 143353, 'loss/train': 1.1891422271728516} 11/07/2021 17:25:24 - INFO - __main__ - Step 143355: {'lr': 2.4836283247789693e-06, 'samples': 27524160, 'steps': 143354, 'loss/train': 1.4171384572982788} 11/07/2021 17:25:25 - INFO - __main__ - Step 143356: {'lr': 2.4828822150722853e-06, 'samples': 27524352, 'steps': 143355, 'loss/train': 1.0675896406173706} 11/07/2021 17:25:26 - INFO - __main__ - Step 143357: {'lr': 2.4821362168929718e-06, 'samples': 27524544, 'steps': 143356, 'loss/train': 1.6267876625061035} 11/07/2021 17:25:26 - INFO - __main__ - Step 143358: {'lr': 2.4813903302414175e-06, 'samples': 27524736, 'steps': 143357, 'loss/train': 1.1240663528442383} 11/07/2021 17:25:26 - INFO - __main__ - Step 143359: {'lr': 2.480644555117928e-06, 'samples': 27524928, 'steps': 143358, 'loss/train': 1.2759196758270264} 11/07/2021 17:25:27 - INFO - __main__ - Step 143360: {'lr': 2.4798988915228083e-06, 'samples': 27525120, 'steps': 143359, 'loss/train': 1.2704404592514038} 11/07/2021 17:25:27 - INFO - __main__ - Step 143361: {'lr': 2.4791533394564467e-06, 'samples': 27525312, 'steps': 143360, 'loss/train': 1.4275022745132446} 11/07/2021 17:25:28 - INFO - __main__ - Step 143362: {'lr': 2.478407898919177e-06, 'samples': 27525504, 'steps': 143361, 'loss/train': 1.778362512588501} 11/07/2021 17:25:28 - INFO - __main__ - Step 143363: {'lr': 2.477662569911304e-06, 'samples': 27525696, 'steps': 143362, 'loss/train': 1.4232757091522217} 11/07/2021 17:25:29 - INFO - __main__ - Step 143364: {'lr': 2.476917352433161e-06, 'samples': 27525888, 'steps': 143363, 'loss/train': 1.1557283401489258} 11/07/2021 17:25:29 - INFO - __main__ - Step 143365: {'lr': 2.476172246485109e-06, 'samples': 27526080, 'steps': 143364, 'loss/train': 1.0152884721755981} 11/07/2021 17:25:30 - INFO - __main__ - Step 143366: {'lr': 2.4754272520674804e-06, 'samples': 27526272, 'steps': 143365, 'loss/train': 1.4113589525222778} 11/07/2021 17:25:31 - INFO - __main__ - Step 143367: {'lr': 2.4746823691806362e-06, 'samples': 27526464, 'steps': 143366, 'loss/train': 1.5284326076507568} 11/07/2021 17:25:31 - INFO - __main__ - Step 143368: {'lr': 2.4739375978248268e-06, 'samples': 27526656, 'steps': 143367, 'loss/train': 1.4639450311660767} 11/07/2021 17:25:31 - INFO - __main__ - Step 143369: {'lr': 2.473192938000468e-06, 'samples': 27526848, 'steps': 143368, 'loss/train': 1.38168466091156} 11/07/2021 17:25:32 - INFO - __main__ - Step 143370: {'lr': 2.4724483897078653e-06, 'samples': 27527040, 'steps': 143369, 'loss/train': 0.9012822508811951} 11/07/2021 17:25:32 - INFO - __main__ - Step 143371: {'lr': 2.4717039529473516e-06, 'samples': 27527232, 'steps': 143370, 'loss/train': 1.5906472206115723} 11/07/2021 17:25:33 - INFO - __main__ - Step 143372: {'lr': 2.4709596277192605e-06, 'samples': 27527424, 'steps': 143371, 'loss/train': 1.3006929159164429} 11/07/2021 17:25:33 - INFO - __main__ - Step 143373: {'lr': 2.470215414023952e-06, 'samples': 27527616, 'steps': 143372, 'loss/train': 1.0429545640945435} 11/07/2021 17:25:34 - INFO - __main__ - Step 143374: {'lr': 2.469471311861732e-06, 'samples': 27527808, 'steps': 143373, 'loss/train': 0.8359832167625427} 11/07/2021 17:25:34 - INFO - __main__ - Step 143375: {'lr': 2.468727321232961e-06, 'samples': 27528000, 'steps': 143374, 'loss/train': 1.1661381721496582} 11/07/2021 17:25:34 - INFO - __main__ - Step 143376: {'lr': 2.467983442137972e-06, 'samples': 27528192, 'steps': 143375, 'loss/train': 1.0920851230621338} 11/07/2021 17:25:36 - INFO - __main__ - Step 143377: {'lr': 2.467239674577071e-06, 'samples': 27528384, 'steps': 143376, 'loss/train': 1.2415399551391602} 11/07/2021 17:25:36 - INFO - __main__ - Step 143378: {'lr': 2.466496018550618e-06, 'samples': 27528576, 'steps': 143377, 'loss/train': 1.4441180229187012} 11/07/2021 17:25:36 - INFO - __main__ - Step 143379: {'lr': 2.4657524740589187e-06, 'samples': 27528768, 'steps': 143378, 'loss/train': 1.9400511980056763} 11/07/2021 17:25:37 - INFO - __main__ - Step 143380: {'lr': 2.4650090411023895e-06, 'samples': 27528960, 'steps': 143379, 'loss/train': 1.2694542407989502} 11/07/2021 17:25:37 - INFO - __main__ - Step 143381: {'lr': 2.464265719681252e-06, 'samples': 27529152, 'steps': 143380, 'loss/train': 1.4788669347763062} 11/07/2021 17:25:37 - INFO - __main__ - Step 143382: {'lr': 2.4635225097959234e-06, 'samples': 27529344, 'steps': 143381, 'loss/train': 1.231265664100647} 11/07/2021 17:25:38 - INFO - __main__ - Step 143383: {'lr': 2.4627794114467086e-06, 'samples': 27529536, 'steps': 143382, 'loss/train': 1.3653523921966553} 11/07/2021 17:25:39 - INFO - __main__ - Step 143384: {'lr': 2.46203642463394e-06, 'samples': 27529728, 'steps': 143383, 'loss/train': 1.1251736879348755} 11/07/2021 17:25:39 - INFO - __main__ - Step 143385: {'lr': 2.4612935493579513e-06, 'samples': 27529920, 'steps': 143384, 'loss/train': 1.4411828517913818} 11/07/2021 17:25:39 - INFO - __main__ - Step 143386: {'lr': 2.4605507856191035e-06, 'samples': 27530112, 'steps': 143385, 'loss/train': 1.7890355587005615} 11/07/2021 17:25:40 - INFO - __main__ - Step 143387: {'lr': 2.4598081334177015e-06, 'samples': 27530304, 'steps': 143386, 'loss/train': 0.9914727210998535} 11/07/2021 17:25:41 - INFO - __main__ - Step 143388: {'lr': 2.4590655927540783e-06, 'samples': 27530496, 'steps': 143387, 'loss/train': 1.3314306735992432} 11/07/2021 17:25:41 - INFO - __main__ - Step 143389: {'lr': 2.458323163628595e-06, 'samples': 27530688, 'steps': 143388, 'loss/train': 0.7348185777664185} 11/07/2021 17:25:41 - INFO - __main__ - Step 143390: {'lr': 2.4575808460415573e-06, 'samples': 27530880, 'steps': 143389, 'loss/train': 1.309460997581482} 11/07/2021 17:25:42 - INFO - __main__ - Step 143391: {'lr': 2.4568386399933253e-06, 'samples': 27531072, 'steps': 143390, 'loss/train': 1.388185977935791} 11/07/2021 17:25:42 - INFO - __main__ - Step 143392: {'lr': 2.456096545484232e-06, 'samples': 27531264, 'steps': 143391, 'loss/train': 1.0726583003997803} 11/07/2021 17:25:43 - INFO - __main__ - Step 143393: {'lr': 2.4553545625145835e-06, 'samples': 27531456, 'steps': 143392, 'loss/train': 1.5145584344863892} 11/07/2021 17:25:43 - INFO - __main__ - Step 143394: {'lr': 2.45461269108474e-06, 'samples': 27531648, 'steps': 143393, 'loss/train': 1.569389820098877} 11/07/2021 17:25:44 - INFO - __main__ - Step 143395: {'lr': 2.453870931195035e-06, 'samples': 27531840, 'steps': 143394, 'loss/train': 1.1403942108154297} 11/07/2021 17:25:44 - INFO - __main__ - Step 143396: {'lr': 2.453129282845801e-06, 'samples': 27532032, 'steps': 143395, 'loss/train': 1.516230821609497} 11/07/2021 17:25:45 - INFO - __main__ - Step 143397: {'lr': 2.4523877460373434e-06, 'samples': 27532224, 'steps': 143396, 'loss/train': 0.6695624589920044} 11/07/2021 17:25:46 - INFO - __main__ - Step 143398: {'lr': 2.4516463207700235e-06, 'samples': 27532416, 'steps': 143397, 'loss/train': 1.323542833328247} 11/07/2021 17:25:46 - INFO - __main__ - Step 143399: {'lr': 2.4509050070442017e-06, 'samples': 27532608, 'steps': 143398, 'loss/train': 0.9058836102485657} 11/07/2021 17:25:46 - INFO - __main__ - Step 143400: {'lr': 2.450163804860156e-06, 'samples': 27532800, 'steps': 143399, 'loss/train': 1.7190394401550293} 11/07/2021 17:25:47 - INFO - __main__ - Step 143401: {'lr': 2.4494227142182467e-06, 'samples': 27532992, 'steps': 143400, 'loss/train': 1.4401962757110596} 11/07/2021 17:25:47 - INFO - __main__ - Step 143402: {'lr': 2.4486817351188073e-06, 'samples': 27533184, 'steps': 143401, 'loss/train': 1.8958163261413574} 11/07/2021 17:25:48 - INFO - __main__ - Step 143403: {'lr': 2.4479408675621707e-06, 'samples': 27533376, 'steps': 143402, 'loss/train': 0.9155100584030151} 11/07/2021 17:25:49 - INFO - __main__ - Step 143404: {'lr': 2.4472001115486695e-06, 'samples': 27533568, 'steps': 143403, 'loss/train': 1.330793023109436} 11/07/2021 17:25:49 - INFO - __main__ - Step 143405: {'lr': 2.4464594670786655e-06, 'samples': 27533760, 'steps': 143404, 'loss/train': 1.3640568256378174} 11/07/2021 17:25:49 - INFO - __main__ - Step 143406: {'lr': 2.4457189341524634e-06, 'samples': 27533952, 'steps': 143405, 'loss/train': 1.0526080131530762} 11/07/2021 17:25:50 - INFO - __main__ - Step 143407: {'lr': 2.4449785127703683e-06, 'samples': 27534144, 'steps': 143406, 'loss/train': 1.1680355072021484} 11/07/2021 17:25:50 - INFO - __main__ - Step 143408: {'lr': 2.4442382029327693e-06, 'samples': 27534336, 'steps': 143407, 'loss/train': 1.4470627307891846} 11/07/2021 17:25:51 - INFO - __main__ - Step 143409: {'lr': 2.4434980046399715e-06, 'samples': 27534528, 'steps': 143408, 'loss/train': 0.27786895632743835} 11/07/2021 17:25:51 - INFO - __main__ - Step 143410: {'lr': 2.4427579178923076e-06, 'samples': 27534720, 'steps': 143409, 'loss/train': 1.1104590892791748} 11/07/2021 17:25:52 - INFO - __main__ - Step 143411: {'lr': 2.442017942690139e-06, 'samples': 27534912, 'steps': 143410, 'loss/train': 0.8072749376296997} 11/07/2021 17:25:52 - INFO - __main__ - Step 143412: {'lr': 2.441278079033743e-06, 'samples': 27535104, 'steps': 143411, 'loss/train': 1.9653041362762451} 11/07/2021 17:25:53 - INFO - __main__ - Step 143413: {'lr': 2.4405383269235082e-06, 'samples': 27535296, 'steps': 143412, 'loss/train': 3.5715181827545166} 11/07/2021 17:25:54 - INFO - __main__ - Step 143414: {'lr': 2.43979868635974e-06, 'samples': 27535488, 'steps': 143413, 'loss/train': 1.194839596748352} 11/07/2021 17:25:54 - INFO - __main__ - Step 143415: {'lr': 2.439059157342799e-06, 'samples': 27535680, 'steps': 143414, 'loss/train': 0.4355219006538391} 11/07/2021 17:25:54 - INFO - __main__ - Step 143416: {'lr': 2.438319739872963e-06, 'samples': 27535872, 'steps': 143415, 'loss/train': 0.7322697639465332} 11/07/2021 17:25:55 - INFO - __main__ - Step 143417: {'lr': 2.437580433950648e-06, 'samples': 27536064, 'steps': 143416, 'loss/train': 0.8149152994155884} 11/07/2021 17:25:55 - INFO - __main__ - Step 143418: {'lr': 2.4368412395761043e-06, 'samples': 27536256, 'steps': 143417, 'loss/train': 1.23506760597229} 11/07/2021 17:25:56 - INFO - __main__ - Step 143419: {'lr': 2.436102156749692e-06, 'samples': 27536448, 'steps': 143418, 'loss/train': 1.4192733764648438} 11/07/2021 17:25:56 - INFO - __main__ - Step 143420: {'lr': 2.435363185471773e-06, 'samples': 27536640, 'steps': 143419, 'loss/train': 1.5172559022903442} 11/07/2021 17:25:57 - INFO - __main__ - Step 143421: {'lr': 2.4346243257426514e-06, 'samples': 27536832, 'steps': 143420, 'loss/train': 1.3231146335601807} 11/07/2021 17:25:57 - INFO - __main__ - Step 143422: {'lr': 2.4338855775626613e-06, 'samples': 27537024, 'steps': 143421, 'loss/train': 1.0870182514190674} 11/07/2021 17:25:57 - INFO - __main__ - Step 143423: {'lr': 2.433146940932135e-06, 'samples': 27537216, 'steps': 143422, 'loss/train': 1.4507839679718018} 11/07/2021 17:25:59 - INFO - __main__ - Step 143424: {'lr': 2.4324084158514336e-06, 'samples': 27537408, 'steps': 143423, 'loss/train': 1.492809772491455} 11/07/2021 17:25:59 - INFO - __main__ - Step 143425: {'lr': 2.4316700023208626e-06, 'samples': 27537600, 'steps': 143424, 'loss/train': 1.497240424156189} 11/07/2021 17:25:59 - INFO - __main__ - Step 143426: {'lr': 2.430931700340755e-06, 'samples': 27537792, 'steps': 143425, 'loss/train': 0.9874420166015625} 11/07/2021 17:26:00 - INFO - __main__ - Step 143427: {'lr': 2.4301935099114436e-06, 'samples': 27537984, 'steps': 143426, 'loss/train': 0.773286759853363} 11/07/2021 17:26:00 - INFO - __main__ - Step 143428: {'lr': 2.4294554310332897e-06, 'samples': 27538176, 'steps': 143427, 'loss/train': 1.503762125968933} 11/07/2021 17:26:00 - INFO - __main__ - Step 143429: {'lr': 2.428717463706598e-06, 'samples': 27538368, 'steps': 143428, 'loss/train': 1.3768776655197144} 11/07/2021 17:26:02 - INFO - __main__ - Step 143430: {'lr': 2.4279796079317016e-06, 'samples': 27538560, 'steps': 143429, 'loss/train': 1.5660024881362915} 11/07/2021 17:26:02 - INFO - __main__ - Step 143431: {'lr': 2.4272418637089346e-06, 'samples': 27538752, 'steps': 143430, 'loss/train': 1.512614369392395} 11/07/2021 17:26:02 - INFO - __main__ - Step 143432: {'lr': 2.4265042310386287e-06, 'samples': 27538944, 'steps': 143431, 'loss/train': 1.3198224306106567} 11/07/2021 17:26:03 - INFO - __main__ - Step 143433: {'lr': 2.425766709921118e-06, 'samples': 27539136, 'steps': 143432, 'loss/train': 1.4073357582092285} 11/07/2021 17:26:03 - INFO - __main__ - Step 143434: {'lr': 2.4250293003567348e-06, 'samples': 27539328, 'steps': 143433, 'loss/train': 1.1101545095443726} 11/07/2021 17:26:04 - INFO - __main__ - Step 143435: {'lr': 2.4242920023458124e-06, 'samples': 27539520, 'steps': 143434, 'loss/train': 1.4336317777633667} 11/07/2021 17:26:04 - INFO - __main__ - Step 143436: {'lr': 2.4235548158886846e-06, 'samples': 27539712, 'steps': 143435, 'loss/train': 1.5283132791519165} 11/07/2021 17:26:05 - INFO - __main__ - Step 143437: {'lr': 2.4228177409856832e-06, 'samples': 27539904, 'steps': 143436, 'loss/train': 0.8355331420898438} 11/07/2021 17:26:05 - INFO - __main__ - Step 143438: {'lr': 2.42208077763717e-06, 'samples': 27540096, 'steps': 143437, 'loss/train': 2.426781177520752} 11/07/2021 17:26:05 - INFO - __main__ - Step 143439: {'lr': 2.421343925843422e-06, 'samples': 27540288, 'steps': 143438, 'loss/train': 1.1147079467773438} 11/07/2021 17:26:07 - INFO - __main__ - Step 143440: {'lr': 2.4206071856048005e-06, 'samples': 27540480, 'steps': 143439, 'loss/train': 1.0464386940002441} 11/07/2021 17:26:07 - INFO - __main__ - Step 143441: {'lr': 2.4198705569216106e-06, 'samples': 27540672, 'steps': 143440, 'loss/train': 1.0661344528198242} 11/07/2021 17:26:07 - INFO - __main__ - Step 143442: {'lr': 2.4191340397942405e-06, 'samples': 27540864, 'steps': 143441, 'loss/train': 1.4684613943099976} 11/07/2021 17:26:08 - INFO - __main__ - Step 143443: {'lr': 2.4183976342229684e-06, 'samples': 27541056, 'steps': 143442, 'loss/train': 0.918493926525116} 11/07/2021 17:26:08 - INFO - __main__ - Step 143444: {'lr': 2.417661340208155e-06, 'samples': 27541248, 'steps': 143443, 'loss/train': 1.188372015953064} 11/07/2021 17:26:08 - INFO - __main__ - Step 143445: {'lr': 2.416925157750105e-06, 'samples': 27541440, 'steps': 143444, 'loss/train': 1.2941895723342896} 11/07/2021 17:26:09 - INFO - __main__ - Step 143446: {'lr': 2.41618908684918e-06, 'samples': 27541632, 'steps': 143445, 'loss/train': 1.8440147638320923} 11/07/2021 17:26:10 - INFO - __main__ - Step 143447: {'lr': 2.4154531275056845e-06, 'samples': 27541824, 'steps': 143446, 'loss/train': 0.8560178279876709} 11/07/2021 17:26:10 - INFO - __main__ - Step 143448: {'lr': 2.41471727971998e-06, 'samples': 27542016, 'steps': 143447, 'loss/train': 1.4165338277816772} 11/07/2021 17:26:10 - INFO - __main__ - Step 143449: {'lr': 2.4139815434923995e-06, 'samples': 27542208, 'steps': 143448, 'loss/train': 1.1710128784179688} 11/07/2021 17:26:11 - INFO - __main__ - Step 143450: {'lr': 2.413245918823248e-06, 'samples': 27542400, 'steps': 143449, 'loss/train': 1.389465093612671} 11/07/2021 17:26:12 - INFO - __main__ - Step 143451: {'lr': 2.412510405712859e-06, 'samples': 27542592, 'steps': 143450, 'loss/train': 1.4416978359222412} 11/07/2021 17:26:12 - INFO - __main__ - Step 143452: {'lr': 2.4117750041615926e-06, 'samples': 27542784, 'steps': 143451, 'loss/train': 1.3624526262283325} 11/07/2021 17:26:12 - INFO - __main__ - Step 143453: {'lr': 2.411039714169727e-06, 'samples': 27542976, 'steps': 143452, 'loss/train': 1.3766335248947144} 11/07/2021 17:26:13 - INFO - __main__ - Step 143454: {'lr': 2.4103045357376506e-06, 'samples': 27543168, 'steps': 143453, 'loss/train': 1.236724853515625} 11/07/2021 17:26:13 - INFO - __main__ - Step 143455: {'lr': 2.409569468865669e-06, 'samples': 27543360, 'steps': 143454, 'loss/train': 1.0847409963607788} 11/07/2021 17:26:14 - INFO - __main__ - Step 143456: {'lr': 2.408834513554087e-06, 'samples': 27543552, 'steps': 143455, 'loss/train': 1.3042994737625122} 11/07/2021 17:26:15 - INFO - __main__ - Step 143457: {'lr': 2.4080996698032933e-06, 'samples': 27543744, 'steps': 143456, 'loss/train': 1.1595255136489868} 11/07/2021 17:26:15 - INFO - __main__ - Step 143458: {'lr': 2.407364937613593e-06, 'samples': 27543936, 'steps': 143457, 'loss/train': 0.9570783972740173} 11/07/2021 17:26:15 - INFO - __main__ - Step 143459: {'lr': 2.4066303169852923e-06, 'samples': 27544128, 'steps': 143458, 'loss/train': 1.2545068264007568} 11/07/2021 17:26:16 - INFO - __main__ - Step 143460: {'lr': 2.405895807918751e-06, 'samples': 27544320, 'steps': 143459, 'loss/train': 1.1629730463027954} 11/07/2021 17:26:17 - INFO - __main__ - Step 143461: {'lr': 2.4051614104143027e-06, 'samples': 27544512, 'steps': 143460, 'loss/train': 1.1854991912841797} 11/07/2021 17:26:17 - INFO - __main__ - Step 143462: {'lr': 2.4044271244722526e-06, 'samples': 27544704, 'steps': 143461, 'loss/train': 0.8225865960121155} 11/07/2021 17:26:17 - INFO - __main__ - Step 143463: {'lr': 2.4036929500929614e-06, 'samples': 27544896, 'steps': 143462, 'loss/train': 1.2874540090560913} 11/07/2021 17:26:18 - INFO - __main__ - Step 143464: {'lr': 2.402958887276735e-06, 'samples': 27545088, 'steps': 143463, 'loss/train': 1.257209300994873} 11/07/2021 17:26:18 - INFO - __main__ - Step 143465: {'lr': 2.4022249360239057e-06, 'samples': 27545280, 'steps': 143464, 'loss/train': 1.4528214931488037} 11/07/2021 17:26:19 - INFO - __main__ - Step 143466: {'lr': 2.4014910963348348e-06, 'samples': 27545472, 'steps': 143465, 'loss/train': 1.475683569908142} 11/07/2021 17:26:20 - INFO - __main__ - Step 143467: {'lr': 2.4007573682098273e-06, 'samples': 27545664, 'steps': 143466, 'loss/train': 0.8643388152122498} 11/07/2021 17:26:20 - INFO - __main__ - Step 143468: {'lr': 2.400023751649216e-06, 'samples': 27545856, 'steps': 143467, 'loss/train': 2.0056583881378174} 11/07/2021 17:26:20 - INFO - __main__ - Step 143469: {'lr': 2.3992902466533072e-06, 'samples': 27546048, 'steps': 143468, 'loss/train': 1.1982083320617676} 11/07/2021 17:26:21 - INFO - __main__ - Step 143470: {'lr': 2.3985568532224887e-06, 'samples': 27546240, 'steps': 143469, 'loss/train': 1.0957967042922974} 11/07/2021 17:26:21 - INFO - __main__ - Step 143471: {'lr': 2.3978235713570386e-06, 'samples': 27546432, 'steps': 143470, 'loss/train': 1.4713869094848633} 11/07/2021 17:26:22 - INFO - __main__ - Step 143472: {'lr': 2.3970904010573167e-06, 'samples': 27546624, 'steps': 143471, 'loss/train': 0.9035183191299438} 11/07/2021 17:26:23 - INFO - __main__ - Step 143473: {'lr': 2.3963573423236573e-06, 'samples': 27546816, 'steps': 143472, 'loss/train': 1.2976353168487549} 11/07/2021 17:26:23 - INFO - __main__ - Step 143474: {'lr': 2.395624395156393e-06, 'samples': 27547008, 'steps': 143473, 'loss/train': 1.116848349571228} 11/07/2021 17:26:23 - INFO - __main__ - Step 143475: {'lr': 2.3948915595558007e-06, 'samples': 27547200, 'steps': 143474, 'loss/train': 0.9172769784927368} 11/07/2021 17:26:24 - INFO - __main__ - Step 143476: {'lr': 2.3941588355222697e-06, 'samples': 27547392, 'steps': 143475, 'loss/train': 1.0474307537078857} 11/07/2021 17:26:24 - INFO - __main__ - Step 143477: {'lr': 2.3934262230561055e-06, 'samples': 27547584, 'steps': 143476, 'loss/train': 0.8872216939926147} 11/07/2021 17:26:25 - INFO - __main__ - Step 143478: {'lr': 2.3926937221576407e-06, 'samples': 27547776, 'steps': 143477, 'loss/train': 1.22083580493927} 11/07/2021 17:26:26 - INFO - __main__ - Step 143479: {'lr': 2.391961332827208e-06, 'samples': 27547968, 'steps': 143478, 'loss/train': 1.3928948640823364} 11/07/2021 17:26:26 - INFO - __main__ - Step 143480: {'lr': 2.3912290550651416e-06, 'samples': 27548160, 'steps': 143479, 'loss/train': 1.3080470561981201} 11/07/2021 17:26:26 - INFO - __main__ - Step 143481: {'lr': 2.390496888871774e-06, 'samples': 27548352, 'steps': 143480, 'loss/train': 1.174222469329834} 11/07/2021 17:26:27 - INFO - __main__ - Step 143482: {'lr': 2.38976483424741e-06, 'samples': 27548544, 'steps': 143481, 'loss/train': 0.2801666855812073} 11/07/2021 17:26:28 - INFO - __main__ - Step 143483: {'lr': 2.389032891192411e-06, 'samples': 27548736, 'steps': 143482, 'loss/train': 1.4650925397872925} 11/07/2021 17:26:28 - INFO - __main__ - Step 143484: {'lr': 2.388301059707082e-06, 'samples': 27548928, 'steps': 143483, 'loss/train': 1.0140718221664429} 11/07/2021 17:26:28 - INFO - __main__ - Step 143485: {'lr': 2.3875693397917565e-06, 'samples': 27549120, 'steps': 143484, 'loss/train': 1.5541887283325195} 11/07/2021 17:26:29 - INFO - __main__ - Step 143486: {'lr': 2.386837731446767e-06, 'samples': 27549312, 'steps': 143485, 'loss/train': 1.275999665260315} 11/07/2021 17:26:29 - INFO - __main__ - Step 143487: {'lr': 2.386106234672475e-06, 'samples': 27549504, 'steps': 143486, 'loss/train': 1.60480797290802} 11/07/2021 17:26:30 - INFO - __main__ - Step 143488: {'lr': 2.3853748494691853e-06, 'samples': 27549696, 'steps': 143487, 'loss/train': 1.63968026638031} 11/07/2021 17:26:30 - INFO - __main__ - Step 143489: {'lr': 2.3846435758372033e-06, 'samples': 27549888, 'steps': 143488, 'loss/train': 1.2260711193084717} 11/07/2021 17:26:31 - INFO - __main__ - Step 143490: {'lr': 2.38391241377689e-06, 'samples': 27550080, 'steps': 143489, 'loss/train': 1.6768150329589844} 11/07/2021 17:26:31 - INFO - __main__ - Step 143491: {'lr': 2.38318136328855e-06, 'samples': 27550272, 'steps': 143490, 'loss/train': 1.2926322221755981} 11/07/2021 17:26:31 - INFO - __main__ - Step 143492: {'lr': 2.3824504243725452e-06, 'samples': 27550464, 'steps': 143491, 'loss/train': 1.4011118412017822} 11/07/2021 17:26:32 - INFO - __main__ - Step 143493: {'lr': 2.3817195970291805e-06, 'samples': 27550656, 'steps': 143492, 'loss/train': 0.9904667139053345} 11/07/2021 17:26:33 - INFO - __main__ - Step 143494: {'lr': 2.380988881258789e-06, 'samples': 27550848, 'steps': 143493, 'loss/train': 1.2630739212036133} 11/07/2021 17:26:33 - INFO - __main__ - Step 143495: {'lr': 2.380258277061703e-06, 'samples': 27551040, 'steps': 143494, 'loss/train': 1.1179853677749634} 11/07/2021 17:26:33 - INFO - __main__ - Step 143496: {'lr': 2.3795277844382568e-06, 'samples': 27551232, 'steps': 143495, 'loss/train': 1.4608296155929565} 11/07/2021 17:26:34 - INFO - __main__ - Step 143497: {'lr': 2.3787974033887826e-06, 'samples': 27551424, 'steps': 143496, 'loss/train': 1.272691011428833} 11/07/2021 17:26:35 - INFO - __main__ - Step 143498: {'lr': 2.3780671339135863e-06, 'samples': 27551616, 'steps': 143497, 'loss/train': 1.4394997358322144} 11/07/2021 17:26:35 - INFO - __main__ - Step 143499: {'lr': 2.377336976013028e-06, 'samples': 27551808, 'steps': 143498, 'loss/train': 1.0396417379379272} 11/07/2021 17:26:36 - INFO - __main__ - Step 143500: {'lr': 2.376606929687386e-06, 'samples': 27552000, 'steps': 143499, 'loss/train': 1.4138375520706177} 11/07/2021 17:26:36 - INFO - __main__ - Step 143501: {'lr': 2.375876994937076e-06, 'samples': 27552192, 'steps': 143500, 'loss/train': 1.412373661994934} 11/07/2021 17:26:36 - INFO - __main__ - Step 143502: {'lr': 2.3751471717623483e-06, 'samples': 27552384, 'steps': 143501, 'loss/train': 1.3956652879714966} 11/07/2021 17:26:37 - INFO - __main__ - Step 143503: {'lr': 2.374417460163536e-06, 'samples': 27552576, 'steps': 143502, 'loss/train': 1.4737720489501953} 11/07/2021 17:26:38 - INFO - __main__ - Step 143504: {'lr': 2.3736878601410274e-06, 'samples': 27552768, 'steps': 143503, 'loss/train': 0.8243356347084045} 11/07/2021 17:26:38 - INFO - __main__ - Step 143505: {'lr': 2.3729583716950996e-06, 'samples': 27552960, 'steps': 143504, 'loss/train': 1.3458197116851807} 11/07/2021 17:26:38 - INFO - __main__ - Step 143506: {'lr': 2.3722289948260865e-06, 'samples': 27553152, 'steps': 143505, 'loss/train': 1.0397858619689941} 11/07/2021 17:26:39 - INFO - __main__ - Step 143507: {'lr': 2.371499729534321e-06, 'samples': 27553344, 'steps': 143506, 'loss/train': 1.183642864227295} 11/07/2021 17:26:39 - INFO - __main__ - Step 143508: {'lr': 2.3707705758201357e-06, 'samples': 27553536, 'steps': 143507, 'loss/train': 1.281440019607544} 11/07/2021 17:26:40 - INFO - __main__ - Step 143509: {'lr': 2.3700415336838922e-06, 'samples': 27553728, 'steps': 143508, 'loss/train': 1.1934354305267334} 11/07/2021 17:26:41 - INFO - __main__ - Step 143510: {'lr': 2.369312603125867e-06, 'samples': 27553920, 'steps': 143509, 'loss/train': 1.4524047374725342} 11/07/2021 17:26:41 - INFO - __main__ - Step 143511: {'lr': 2.368583784146394e-06, 'samples': 27554112, 'steps': 143510, 'loss/train': 1.0665603876113892} 11/07/2021 17:26:41 - INFO - __main__ - Step 143512: {'lr': 2.367855076745834e-06, 'samples': 27554304, 'steps': 143511, 'loss/train': 1.3445039987564087} 11/07/2021 17:26:42 - INFO - __main__ - Step 143513: {'lr': 2.367126480924492e-06, 'samples': 27554496, 'steps': 143512, 'loss/train': 1.3767234086990356} 11/07/2021 17:26:43 - INFO - __main__ - Step 143514: {'lr': 2.3663979966827287e-06, 'samples': 27554688, 'steps': 143513, 'loss/train': 1.381736397743225} 11/07/2021 17:26:43 - INFO - __main__ - Step 143515: {'lr': 2.3656696240207942e-06, 'samples': 27554880, 'steps': 143514, 'loss/train': 0.9778257608413696} 11/07/2021 17:26:43 - INFO - __main__ - Step 143516: {'lr': 2.364941362939105e-06, 'samples': 27555072, 'steps': 143515, 'loss/train': 1.2304530143737793} 11/07/2021 17:26:44 - INFO - __main__ - Step 143517: {'lr': 2.3642132134379378e-06, 'samples': 27555264, 'steps': 143516, 'loss/train': 0.4315597116947174} 11/07/2021 17:26:44 - INFO - __main__ - Step 143518: {'lr': 2.363485175517627e-06, 'samples': 27555456, 'steps': 143517, 'loss/train': 1.5322359800338745} 11/07/2021 17:26:45 - INFO - __main__ - Step 143519: {'lr': 2.362757249178532e-06, 'samples': 27555648, 'steps': 143518, 'loss/train': 1.3526045083999634} 11/07/2021 17:26:45 - INFO - __main__ - Step 143520: {'lr': 2.3620294344209316e-06, 'samples': 27555840, 'steps': 143519, 'loss/train': 1.6499755382537842} 11/07/2021 17:26:46 - INFO - __main__ - Step 143521: {'lr': 2.3613017312451857e-06, 'samples': 27556032, 'steps': 143520, 'loss/train': 0.709223747253418} 11/07/2021 17:26:46 - INFO - __main__ - Step 143522: {'lr': 2.3605741396516277e-06, 'samples': 27556224, 'steps': 143521, 'loss/train': 1.0394747257232666} 11/07/2021 17:26:47 - INFO - __main__ - Step 143523: {'lr': 2.3598466596405634e-06, 'samples': 27556416, 'steps': 143522, 'loss/train': 1.4083514213562012} 11/07/2021 17:26:48 - INFO - __main__ - Step 143524: {'lr': 2.3591192912123526e-06, 'samples': 27556608, 'steps': 143523, 'loss/train': 1.2255454063415527} 11/07/2021 17:26:48 - INFO - __main__ - Step 143525: {'lr': 2.3583920343672738e-06, 'samples': 27556800, 'steps': 143524, 'loss/train': 1.1782346963882446} 11/07/2021 17:26:48 - INFO - __main__ - Step 143526: {'lr': 2.357664889105687e-06, 'samples': 27556992, 'steps': 143525, 'loss/train': 1.1621592044830322} 11/07/2021 17:26:49 - INFO - __main__ - Step 143527: {'lr': 2.3569378554279266e-06, 'samples': 27557184, 'steps': 143526, 'loss/train': 1.5191614627838135} 11/07/2021 17:26:49 - INFO - __main__ - Step 143528: {'lr': 2.356210933334324e-06, 'samples': 27557376, 'steps': 143527, 'loss/train': 1.5479025840759277} 11/07/2021 17:26:50 - INFO - __main__ - Step 143529: {'lr': 2.355484122825158e-06, 'samples': 27557568, 'steps': 143528, 'loss/train': 1.2500823736190796} 11/07/2021 17:26:50 - INFO - __main__ - Step 143530: {'lr': 2.3547574239007883e-06, 'samples': 27557760, 'steps': 143529, 'loss/train': 1.36100435256958} 11/07/2021 17:26:51 - INFO - __main__ - Step 143531: {'lr': 2.354030836561577e-06, 'samples': 27557952, 'steps': 143530, 'loss/train': 1.3633588552474976} 11/07/2021 17:26:51 - INFO - __main__ - Step 143532: {'lr': 2.353304360807773e-06, 'samples': 27558144, 'steps': 143531, 'loss/train': 1.346187710762024} 11/07/2021 17:26:52 - INFO - __main__ - Step 143533: {'lr': 2.352577996639793e-06, 'samples': 27558336, 'steps': 143532, 'loss/train': 1.0778224468231201} 11/07/2021 17:26:52 - INFO - __main__ - Step 143534: {'lr': 2.351851744057887e-06, 'samples': 27558528, 'steps': 143533, 'loss/train': 1.4655019044876099} 11/07/2021 17:26:53 - INFO - __main__ - Step 143535: {'lr': 2.351125603062443e-06, 'samples': 27558720, 'steps': 143534, 'loss/train': 1.6244542598724365} 11/07/2021 17:26:53 - INFO - __main__ - Step 143536: {'lr': 2.3503995736537388e-06, 'samples': 27558912, 'steps': 143535, 'loss/train': 1.191293716430664} 11/07/2021 17:26:54 - INFO - __main__ - Step 143537: {'lr': 2.3496736558321353e-06, 'samples': 27559104, 'steps': 143536, 'loss/train': 1.2185765504837036} 11/07/2021 17:26:54 - INFO - __main__ - Step 143538: {'lr': 2.348947849597938e-06, 'samples': 27559296, 'steps': 143537, 'loss/train': 1.0668820142745972} 11/07/2021 17:26:54 - INFO - __main__ - Step 143539: {'lr': 2.3482221549514792e-06, 'samples': 27559488, 'steps': 143538, 'loss/train': 1.8549315929412842} 11/07/2021 17:26:55 - INFO - __main__ - Step 143540: {'lr': 2.347496571893093e-06, 'samples': 27559680, 'steps': 143539, 'loss/train': 1.5814812183380127} 11/07/2021 17:26:56 - INFO - __main__ - Step 143541: {'lr': 2.346771100423112e-06, 'samples': 27559872, 'steps': 143540, 'loss/train': 0.9131342768669128} 11/07/2021 17:26:56 - INFO - __main__ - Step 143542: {'lr': 2.346045740541869e-06, 'samples': 27560064, 'steps': 143541, 'loss/train': 1.4503973722457886} 11/07/2021 17:26:56 - INFO - __main__ - Step 143543: {'lr': 2.345320492249642e-06, 'samples': 27560256, 'steps': 143542, 'loss/train': 0.9634972810745239} 11/07/2021 17:26:57 - INFO - __main__ - Step 143544: {'lr': 2.3445953555468192e-06, 'samples': 27560448, 'steps': 143543, 'loss/train': 1.6240438222885132} 11/07/2021 17:26:58 - INFO - __main__ - Step 143545: {'lr': 2.343870330433678e-06, 'samples': 27560640, 'steps': 143544, 'loss/train': 1.311843752861023} 11/07/2021 17:26:58 - INFO - __main__ - Step 143546: {'lr': 2.3431454169105804e-06, 'samples': 27560832, 'steps': 143545, 'loss/train': 1.1942956447601318} 11/07/2021 17:26:58 - INFO - __main__ - Step 143547: {'lr': 2.3424206149778303e-06, 'samples': 27561024, 'steps': 143546, 'loss/train': 1.4577767848968506} 11/07/2021 17:26:59 - INFO - __main__ - Step 143548: {'lr': 2.3416959246357893e-06, 'samples': 27561216, 'steps': 143547, 'loss/train': 1.4075525999069214} 11/07/2021 17:26:59 - INFO - __main__ - Step 143549: {'lr': 2.3409713458847346e-06, 'samples': 27561408, 'steps': 143548, 'loss/train': 0.964880645275116} 11/07/2021 17:27:00 - INFO - __main__ - Step 143550: {'lr': 2.3402468787250277e-06, 'samples': 27561600, 'steps': 143549, 'loss/train': 1.102425456047058} 11/07/2021 17:27:01 - INFO - __main__ - Step 143551: {'lr': 2.339522523156973e-06, 'samples': 27561792, 'steps': 143550, 'loss/train': 1.3504371643066406} 11/07/2021 17:27:01 - INFO - __main__ - Step 143552: {'lr': 2.3387982791809035e-06, 'samples': 27561984, 'steps': 143551, 'loss/train': 1.2002681493759155} 11/07/2021 17:27:01 - INFO - __main__ - Step 143553: {'lr': 2.3380741467971534e-06, 'samples': 27562176, 'steps': 143552, 'loss/train': 1.1587821245193481} 11/07/2021 17:27:02 - INFO - __main__ - Step 143554: {'lr': 2.337350126006055e-06, 'samples': 27562368, 'steps': 143553, 'loss/train': 1.217521071434021} 11/07/2021 17:27:03 - INFO - __main__ - Step 143555: {'lr': 2.336626216807941e-06, 'samples': 27562560, 'steps': 143554, 'loss/train': 1.2056561708450317} 11/07/2021 17:27:03 - INFO - __main__ - Step 143556: {'lr': 2.3359024192030896e-06, 'samples': 27562752, 'steps': 143555, 'loss/train': 1.5392478704452515} 11/07/2021 17:27:03 - INFO - __main__ - Step 143557: {'lr': 2.3351787331918893e-06, 'samples': 27562944, 'steps': 143556, 'loss/train': 1.5448877811431885} 11/07/2021 17:27:04 - INFO - __main__ - Step 143558: {'lr': 2.3344551587746176e-06, 'samples': 27563136, 'steps': 143557, 'loss/train': 0.85478675365448} 11/07/2021 17:27:04 - INFO - __main__ - Step 143559: {'lr': 2.3337316959516073e-06, 'samples': 27563328, 'steps': 143558, 'loss/train': 1.5820794105529785} 11/07/2021 17:27:05 - INFO - __main__ - Step 143560: {'lr': 2.3330083447232197e-06, 'samples': 27563520, 'steps': 143559, 'loss/train': 1.5718005895614624} 11/07/2021 17:27:05 - INFO - __main__ - Step 143561: {'lr': 2.332285105089732e-06, 'samples': 27563712, 'steps': 143560, 'loss/train': 1.5304882526397705} 11/07/2021 17:27:06 - INFO - __main__ - Step 143562: {'lr': 2.331561977051505e-06, 'samples': 27563904, 'steps': 143561, 'loss/train': 1.3595619201660156} 11/07/2021 17:27:06 - INFO - __main__ - Step 143563: {'lr': 2.330838960608872e-06, 'samples': 27564096, 'steps': 143562, 'loss/train': 1.2735806703567505} 11/07/2021 17:27:07 - INFO - __main__ - Step 143564: {'lr': 2.3301160557621105e-06, 'samples': 27564288, 'steps': 143563, 'loss/train': 1.2082021236419678} 11/07/2021 17:27:08 - INFO - __main__ - Step 143565: {'lr': 2.3293932625116088e-06, 'samples': 27564480, 'steps': 143564, 'loss/train': 1.2956814765930176} 11/07/2021 17:27:08 - INFO - __main__ - Step 143566: {'lr': 2.328670580857645e-06, 'samples': 27564672, 'steps': 143565, 'loss/train': 1.1216938495635986} 11/07/2021 17:27:08 - INFO - __main__ - Step 143567: {'lr': 2.3279480108005512e-06, 'samples': 27564864, 'steps': 143566, 'loss/train': 1.4890376329421997} 11/07/2021 17:27:09 - INFO - __main__ - Step 143568: {'lr': 2.3272255523406892e-06, 'samples': 27565056, 'steps': 143567, 'loss/train': 1.4118263721466064} 11/07/2021 17:27:09 - INFO - __main__ - Step 143569: {'lr': 2.326503205478336e-06, 'samples': 27565248, 'steps': 143568, 'loss/train': 1.43120276927948} 11/07/2021 17:27:09 - INFO - __main__ - Step 143570: {'lr': 2.3257809702138255e-06, 'samples': 27565440, 'steps': 143569, 'loss/train': 0.6323961019515991} 11/07/2021 17:27:10 - INFO - __main__ - Step 143571: {'lr': 2.3250588465475174e-06, 'samples': 27565632, 'steps': 143570, 'loss/train': 1.0281513929367065} 11/07/2021 17:27:11 - INFO - __main__ - Step 143572: {'lr': 2.324336834479718e-06, 'samples': 27565824, 'steps': 143571, 'loss/train': 1.3558995723724365} 11/07/2021 17:27:11 - INFO - __main__ - Step 143573: {'lr': 2.3236149340107317e-06, 'samples': 27566016, 'steps': 143572, 'loss/train': 1.101060390472412} 11/07/2021 17:27:11 - INFO - __main__ - Step 143574: {'lr': 2.32289314514092e-06, 'samples': 27566208, 'steps': 143573, 'loss/train': 1.1898454427719116} 11/07/2021 17:27:12 - INFO - __main__ - Step 143575: {'lr': 2.3221714678705876e-06, 'samples': 27566400, 'steps': 143574, 'loss/train': 1.2583914995193481} 11/07/2021 17:27:13 - INFO - __main__ - Step 143576: {'lr': 2.321449902200068e-06, 'samples': 27566592, 'steps': 143575, 'loss/train': 1.423499584197998} 11/07/2021 17:27:13 - INFO - __main__ - Step 143577: {'lr': 2.3207284481296663e-06, 'samples': 27566784, 'steps': 143576, 'loss/train': 1.5413734912872314} 11/07/2021 17:27:14 - INFO - __main__ - Step 143578: {'lr': 2.320007105659716e-06, 'samples': 27566976, 'steps': 143577, 'loss/train': 1.2316594123840332} 11/07/2021 17:27:14 - INFO - __main__ - Step 143579: {'lr': 2.3192858747905775e-06, 'samples': 27567168, 'steps': 143578, 'loss/train': 0.7474720478057861} 11/07/2021 17:27:14 - INFO - __main__ - Step 143580: {'lr': 2.3185647555225286e-06, 'samples': 27567360, 'steps': 143579, 'loss/train': 1.376933217048645} 11/07/2021 17:27:15 - INFO - __main__ - Step 143581: {'lr': 2.3178437478559023e-06, 'samples': 27567552, 'steps': 143580, 'loss/train': 1.3464343547821045} 11/07/2021 17:27:16 - INFO - __main__ - Step 143582: {'lr': 2.317122851791087e-06, 'samples': 27567744, 'steps': 143581, 'loss/train': 1.2655330896377563} 11/07/2021 17:27:16 - INFO - __main__ - Step 143583: {'lr': 2.316402067328305e-06, 'samples': 27567936, 'steps': 143582, 'loss/train': 1.1672935485839844} 11/07/2021 17:27:16 - INFO - __main__ - Step 143584: {'lr': 2.3156813944679444e-06, 'samples': 27568128, 'steps': 143583, 'loss/train': 1.3409390449523926} 11/07/2021 17:27:17 - INFO - __main__ - Step 143585: {'lr': 2.314960833210311e-06, 'samples': 27568320, 'steps': 143584, 'loss/train': 1.6783136129379272} 11/07/2021 17:27:18 - INFO - __main__ - Step 143586: {'lr': 2.3142403835557102e-06, 'samples': 27568512, 'steps': 143585, 'loss/train': 1.572360873222351} 11/07/2021 17:27:18 - INFO - __main__ - Step 143587: {'lr': 2.3135200455045302e-06, 'samples': 27568704, 'steps': 143586, 'loss/train': 0.6478919982910156} 11/07/2021 17:27:18 - INFO - __main__ - Step 143588: {'lr': 2.312799819057049e-06, 'samples': 27568896, 'steps': 143587, 'loss/train': 0.906624436378479} 11/07/2021 17:27:19 - INFO - __main__ - Step 143589: {'lr': 2.3120797042135712e-06, 'samples': 27569088, 'steps': 143588, 'loss/train': 1.3512591123580933} 11/07/2021 17:27:19 - INFO - __main__ - Step 143590: {'lr': 2.311359700974458e-06, 'samples': 27569280, 'steps': 143589, 'loss/train': 1.1302587985992432} 11/07/2021 17:27:20 - INFO - __main__ - Step 143591: {'lr': 2.310639809340043e-06, 'samples': 27569472, 'steps': 143590, 'loss/train': 0.9816364049911499} 11/07/2021 17:27:20 - INFO - __main__ - Step 143592: {'lr': 2.3099200293106305e-06, 'samples': 27569664, 'steps': 143591, 'loss/train': 1.2779732942581177} 11/07/2021 17:27:21 - INFO - __main__ - Step 143593: {'lr': 2.3092003608865266e-06, 'samples': 27569856, 'steps': 143592, 'loss/train': 1.2763878107070923} 11/07/2021 17:27:21 - INFO - __main__ - Step 143594: {'lr': 2.3084808040680915e-06, 'samples': 27570048, 'steps': 143593, 'loss/train': 1.361155390739441} 11/07/2021 17:27:21 - INFO - __main__ - Step 143595: {'lr': 2.307761358855631e-06, 'samples': 27570240, 'steps': 143594, 'loss/train': 0.7823206186294556} 11/07/2021 17:27:23 - INFO - __main__ - Step 143596: {'lr': 2.3070420252494783e-06, 'samples': 27570432, 'steps': 143595, 'loss/train': 1.5095136165618896} 11/07/2021 17:27:23 - INFO - __main__ - Step 143597: {'lr': 2.3063228032499383e-06, 'samples': 27570624, 'steps': 143596, 'loss/train': 2.1591296195983887} 11/07/2021 17:27:23 - INFO - __main__ - Step 143598: {'lr': 2.305603692857344e-06, 'samples': 27570816, 'steps': 143597, 'loss/train': 1.7236390113830566} 11/07/2021 17:27:24 - INFO - __main__ - Step 143599: {'lr': 2.304884694072029e-06, 'samples': 27571008, 'steps': 143598, 'loss/train': 1.5429234504699707} 11/07/2021 17:27:24 - INFO - __main__ - Step 143600: {'lr': 2.3041658068942984e-06, 'samples': 27571200, 'steps': 143599, 'loss/train': 1.2971009016036987} 11/07/2021 17:27:25 - INFO - __main__ - Step 143601: {'lr': 2.303447031324485e-06, 'samples': 27571392, 'steps': 143600, 'loss/train': 0.7222891449928284} 11/07/2021 17:27:25 - INFO - __main__ - Step 143602: {'lr': 2.3027283673629495e-06, 'samples': 27571584, 'steps': 143601, 'loss/train': 0.9348170757293701} 11/07/2021 17:27:26 - INFO - __main__ - Step 143603: {'lr': 2.3020098150099423e-06, 'samples': 27571776, 'steps': 143602, 'loss/train': 1.1043943166732788} 11/07/2021 17:27:26 - INFO - __main__ - Step 143604: {'lr': 2.3012913742658516e-06, 'samples': 27571968, 'steps': 143603, 'loss/train': 1.220024585723877} 11/07/2021 17:27:26 - INFO - __main__ - Step 143605: {'lr': 2.3005730451309826e-06, 'samples': 27572160, 'steps': 143604, 'loss/train': 1.6004266738891602} 11/07/2021 17:27:28 - INFO - __main__ - Step 143606: {'lr': 2.2998548276056408e-06, 'samples': 27572352, 'steps': 143605, 'loss/train': 1.4434415102005005} 11/07/2021 17:27:28 - INFO - __main__ - Step 143607: {'lr': 2.2991367216901593e-06, 'samples': 27572544, 'steps': 143606, 'loss/train': 1.306391954421997} 11/07/2021 17:27:28 - INFO - __main__ - Step 143608: {'lr': 2.298418727384871e-06, 'samples': 27572736, 'steps': 143607, 'loss/train': 1.196334719657898} 11/07/2021 17:27:29 - INFO - __main__ - Step 143609: {'lr': 2.2977008446901092e-06, 'samples': 27572928, 'steps': 143608, 'loss/train': 1.047247052192688} 11/07/2021 17:27:29 - INFO - __main__ - Step 143610: {'lr': 2.2969830736061513e-06, 'samples': 27573120, 'steps': 143609, 'loss/train': 0.6242823004722595} 11/07/2021 17:27:29 - INFO - __main__ - Step 143611: {'lr': 2.296265414133358e-06, 'samples': 27573312, 'steps': 143610, 'loss/train': 1.6742141246795654} 11/07/2021 17:27:31 - INFO - __main__ - Step 143612: {'lr': 2.295547866272063e-06, 'samples': 27573504, 'steps': 143611, 'loss/train': 1.4261702299118042} 11/07/2021 17:27:31 - INFO - __main__ - Step 143613: {'lr': 2.294830430022543e-06, 'samples': 27573696, 'steps': 143612, 'loss/train': 1.4577475786209106} 11/07/2021 17:27:32 - INFO - __main__ - Step 143614: {'lr': 2.294113105385159e-06, 'samples': 27573888, 'steps': 143613, 'loss/train': 1.363168478012085} 11/07/2021 17:27:32 - INFO - __main__ - Step 143615: {'lr': 2.293395892360245e-06, 'samples': 27574080, 'steps': 143614, 'loss/train': 1.1239553689956665} 11/07/2021 17:27:32 - INFO - __main__ - Step 143616: {'lr': 2.2926787909480772e-06, 'samples': 27574272, 'steps': 143615, 'loss/train': 0.6334245800971985} 11/07/2021 17:27:33 - INFO - __main__ - Step 143617: {'lr': 2.2919618011490173e-06, 'samples': 27574464, 'steps': 143616, 'loss/train': 1.3224639892578125} 11/07/2021 17:27:34 - INFO - __main__ - Step 143618: {'lr': 2.291244922963398e-06, 'samples': 27574656, 'steps': 143617, 'loss/train': 1.5569140911102295} 11/07/2021 17:27:34 - INFO - __main__ - Step 143619: {'lr': 2.290528156391497e-06, 'samples': 27574848, 'steps': 143618, 'loss/train': 0.4155651032924652} 11/07/2021 17:27:34 - INFO - __main__ - Step 143620: {'lr': 2.289811501433675e-06, 'samples': 27575040, 'steps': 143619, 'loss/train': 1.1610758304595947} 11/07/2021 17:27:35 - INFO - __main__ - Step 143621: {'lr': 2.28909495809021e-06, 'samples': 27575232, 'steps': 143620, 'loss/train': 1.6470471620559692} 11/07/2021 17:27:36 - INFO - __main__ - Step 143622: {'lr': 2.2883785263615177e-06, 'samples': 27575424, 'steps': 143621, 'loss/train': 1.481719732284546} 11/07/2021 17:27:36 - INFO - __main__ - Step 143623: {'lr': 2.2876622062478203e-06, 'samples': 27575616, 'steps': 143622, 'loss/train': 1.359272837638855} 11/07/2021 17:27:36 - INFO - __main__ - Step 143624: {'lr': 2.286945997749479e-06, 'samples': 27575808, 'steps': 143623, 'loss/train': 1.4918527603149414} 11/07/2021 17:27:37 - INFO - __main__ - Step 143625: {'lr': 2.2862299008667987e-06, 'samples': 27576000, 'steps': 143624, 'loss/train': 1.2247796058654785} 11/07/2021 17:27:37 - INFO - __main__ - Step 143626: {'lr': 2.285513915600168e-06, 'samples': 27576192, 'steps': 143625, 'loss/train': 1.178728461265564} 11/07/2021 17:27:38 - INFO - __main__ - Step 143627: {'lr': 2.284798041949837e-06, 'samples': 27576384, 'steps': 143626, 'loss/train': 1.356951355934143} 11/07/2021 17:27:39 - INFO - __main__ - Step 143628: {'lr': 2.2840822799161386e-06, 'samples': 27576576, 'steps': 143627, 'loss/train': 1.631362795829773} 11/07/2021 17:27:39 - INFO - __main__ - Step 143629: {'lr': 2.283366629499434e-06, 'samples': 27576768, 'steps': 143628, 'loss/train': 0.7443016171455383} 11/07/2021 17:27:39 - INFO - __main__ - Step 143630: {'lr': 2.2826510907e-06, 'samples': 27576960, 'steps': 143629, 'loss/train': 1.7982605695724487} 11/07/2021 17:27:40 - INFO - __main__ - Step 143631: {'lr': 2.2819356635181974e-06, 'samples': 27577152, 'steps': 143630, 'loss/train': 1.3064308166503906} 11/07/2021 17:27:41 - INFO - __main__ - Step 143632: {'lr': 2.2812203479543326e-06, 'samples': 27577344, 'steps': 143631, 'loss/train': 1.3262537717819214} 11/07/2021 17:27:41 - INFO - __main__ - Step 143633: {'lr': 2.28050514400871e-06, 'samples': 27577536, 'steps': 143632, 'loss/train': 1.8146427869796753} 11/07/2021 17:27:42 - INFO - __main__ - Step 143634: {'lr': 2.2797900516816906e-06, 'samples': 27577728, 'steps': 143633, 'loss/train': 0.964857816696167} 11/07/2021 17:27:42 - INFO - __main__ - Step 143635: {'lr': 2.2790750709735796e-06, 'samples': 27577920, 'steps': 143634, 'loss/train': 1.2281301021575928} 11/07/2021 17:27:42 - INFO - __main__ - Step 143636: {'lr': 2.2783602018846827e-06, 'samples': 27578112, 'steps': 143635, 'loss/train': 1.3725353479385376} 11/07/2021 17:27:43 - INFO - __main__ - Step 143637: {'lr': 2.2776454444153326e-06, 'samples': 27578304, 'steps': 143636, 'loss/train': 1.2559661865234375} 11/07/2021 17:27:44 - INFO - __main__ - Step 143638: {'lr': 2.2769307985658628e-06, 'samples': 27578496, 'steps': 143637, 'loss/train': 1.1304433345794678} 11/07/2021 17:27:44 - INFO - __main__ - Step 143639: {'lr': 2.2762162643365504e-06, 'samples': 27578688, 'steps': 143638, 'loss/train': 1.0028135776519775} 11/07/2021 17:27:44 - INFO - __main__ - Step 143640: {'lr': 2.275501841727784e-06, 'samples': 27578880, 'steps': 143639, 'loss/train': 1.3516980409622192} 11/07/2021 17:27:45 - INFO - __main__ - Step 143641: {'lr': 2.2747875307398414e-06, 'samples': 27579072, 'steps': 143640, 'loss/train': 1.4904459714889526} 11/07/2021 17:27:45 - INFO - __main__ - Step 143642: {'lr': 2.274073331373083e-06, 'samples': 27579264, 'steps': 143641, 'loss/train': 4.100195407867432} 11/07/2021 17:27:46 - INFO - __main__ - Step 143643: {'lr': 2.273359243627787e-06, 'samples': 27579456, 'steps': 143642, 'loss/train': 1.1884009838104248} 11/07/2021 17:27:47 - INFO - __main__ - Step 143644: {'lr': 2.272645267504286e-06, 'samples': 27579648, 'steps': 143643, 'loss/train': 0.9266228675842285} 11/07/2021 17:27:47 - INFO - __main__ - Step 143645: {'lr': 2.2719314030029137e-06, 'samples': 27579840, 'steps': 143644, 'loss/train': 0.1645362377166748} 11/07/2021 17:27:47 - INFO - __main__ - Step 143646: {'lr': 2.2712176501239745e-06, 'samples': 27580032, 'steps': 143645, 'loss/train': 1.3720991611480713} 11/07/2021 17:27:48 - INFO - __main__ - Step 143647: {'lr': 2.2705040088678296e-06, 'samples': 27580224, 'steps': 143646, 'loss/train': 0.7983099222183228} 11/07/2021 17:27:48 - INFO - __main__ - Step 143648: {'lr': 2.2697904792347566e-06, 'samples': 27580416, 'steps': 143647, 'loss/train': 2.874821424484253} 11/07/2021 17:27:49 - INFO - __main__ - Step 143649: {'lr': 2.269077061225089e-06, 'samples': 27580608, 'steps': 143648, 'loss/train': 1.5210825204849243} 11/07/2021 17:27:49 - INFO - __main__ - Step 143650: {'lr': 2.268363754839159e-06, 'samples': 27580800, 'steps': 143649, 'loss/train': 1.3788881301879883} 11/07/2021 17:27:50 - INFO - __main__ - Step 143651: {'lr': 2.2676505600772724e-06, 'samples': 27580992, 'steps': 143650, 'loss/train': 1.2966467142105103} 11/07/2021 17:27:50 - INFO - __main__ - Step 143652: {'lr': 2.26693747693979e-06, 'samples': 27581184, 'steps': 143651, 'loss/train': 0.9701062440872192} 11/07/2021 17:27:50 - INFO - __main__ - Step 143653: {'lr': 2.2662245054269616e-06, 'samples': 27581376, 'steps': 143652, 'loss/train': 1.597795009613037} 11/07/2021 17:27:51 - INFO - __main__ - Step 143654: {'lr': 2.2655116455391754e-06, 'samples': 27581568, 'steps': 143653, 'loss/train': 1.6066462993621826} 11/07/2021 17:27:52 - INFO - __main__ - Step 143655: {'lr': 2.2647988972767096e-06, 'samples': 27581760, 'steps': 143654, 'loss/train': 1.5554914474487305} 11/07/2021 17:27:52 - INFO - __main__ - Step 143656: {'lr': 2.2640862606399247e-06, 'samples': 27581952, 'steps': 143655, 'loss/train': 0.9884195327758789} 11/07/2021 17:27:53 - INFO - __main__ - Step 143657: {'lr': 2.2633737356290985e-06, 'samples': 27582144, 'steps': 143656, 'loss/train': 1.6086980104446411} 11/07/2021 17:27:53 - INFO - __main__ - Step 143658: {'lr': 2.2626613222445914e-06, 'samples': 27582336, 'steps': 143657, 'loss/train': 1.1246695518493652} 11/07/2021 17:27:53 - INFO - __main__ - Step 143659: {'lr': 2.2619490204866812e-06, 'samples': 27582528, 'steps': 143658, 'loss/train': 1.4298789501190186} 11/07/2021 17:27:54 - INFO - __main__ - Step 143660: {'lr': 2.2612368303557285e-06, 'samples': 27582720, 'steps': 143659, 'loss/train': 1.1257137060165405} 11/07/2021 17:27:55 - INFO - __main__ - Step 143661: {'lr': 2.260524751852039e-06, 'samples': 27582912, 'steps': 143660, 'loss/train': 1.4850510358810425} 11/07/2021 17:27:55 - INFO - __main__ - Step 143662: {'lr': 2.259812784975945e-06, 'samples': 27583104, 'steps': 143661, 'loss/train': 0.7808092832565308} 11/07/2021 17:27:55 - INFO - __main__ - Step 143663: {'lr': 2.2591009297277533e-06, 'samples': 27583296, 'steps': 143662, 'loss/train': 1.257574200630188} 11/07/2021 17:27:56 - INFO - __main__ - Step 143664: {'lr': 2.2583891861077953e-06, 'samples': 27583488, 'steps': 143663, 'loss/train': 1.5269057750701904} 11/07/2021 17:27:57 - INFO - __main__ - Step 143665: {'lr': 2.25767755411635e-06, 'samples': 27583680, 'steps': 143664, 'loss/train': 1.5703405141830444} 11/07/2021 17:27:57 - INFO - __main__ - Step 143666: {'lr': 2.2569660337538043e-06, 'samples': 27583872, 'steps': 143665, 'loss/train': 1.0987778902053833} 11/07/2021 17:27:57 - INFO - __main__ - Step 143667: {'lr': 2.2562546250204376e-06, 'samples': 27584064, 'steps': 143666, 'loss/train': 1.3858431577682495} 11/07/2021 17:27:58 - INFO - __main__ - Step 143668: {'lr': 2.2555433279165815e-06, 'samples': 27584256, 'steps': 143667, 'loss/train': 1.5604699850082397} 11/07/2021 17:27:58 - INFO - __main__ - Step 143669: {'lr': 2.254832142442542e-06, 'samples': 27584448, 'steps': 143668, 'loss/train': 1.1833924055099487} 11/07/2021 17:27:59 - INFO - __main__ - Step 143670: {'lr': 2.2541210685986523e-06, 'samples': 27584640, 'steps': 143669, 'loss/train': 1.6745834350585938} 11/07/2021 17:28:00 - INFO - __main__ - Step 143671: {'lr': 2.2534101063852453e-06, 'samples': 27584832, 'steps': 143670, 'loss/train': 1.273209571838379} 11/07/2021 17:28:00 - INFO - __main__ - Step 143672: {'lr': 2.252699255802626e-06, 'samples': 27585024, 'steps': 143671, 'loss/train': 1.2944961786270142} 11/07/2021 17:28:00 - INFO - __main__ - Step 143673: {'lr': 2.251988516851128e-06, 'samples': 27585216, 'steps': 143672, 'loss/train': 1.545836091041565} 11/07/2021 17:28:01 - INFO - __main__ - Step 143674: {'lr': 2.251277889531056e-06, 'samples': 27585408, 'steps': 143673, 'loss/train': 1.3455939292907715} 11/07/2021 17:28:02 - INFO - __main__ - Step 143675: {'lr': 2.2505673738427434e-06, 'samples': 27585600, 'steps': 143674, 'loss/train': 1.5798895359039307} 11/07/2021 17:28:02 - INFO - __main__ - Step 143676: {'lr': 2.249856969786468e-06, 'samples': 27585792, 'steps': 143675, 'loss/train': 1.620708703994751} 11/07/2021 17:28:02 - INFO - __main__ - Step 143677: {'lr': 2.2491466773626178e-06, 'samples': 27585984, 'steps': 143676, 'loss/train': 1.272871971130371} 11/07/2021 17:28:03 - INFO - __main__ - Step 143678: {'lr': 2.2484364965714433e-06, 'samples': 27586176, 'steps': 143677, 'loss/train': 1.5388952493667603} 11/07/2021 17:28:03 - INFO - __main__ - Step 143679: {'lr': 2.2477264274133325e-06, 'samples': 27586368, 'steps': 143678, 'loss/train': 1.1556183099746704} 11/07/2021 17:28:04 - INFO - __main__ - Step 143680: {'lr': 2.247016469888563e-06, 'samples': 27586560, 'steps': 143679, 'loss/train': 1.3602551221847534} 11/07/2021 17:28:05 - INFO - __main__ - Step 143681: {'lr': 2.2463066239974685e-06, 'samples': 27586752, 'steps': 143680, 'loss/train': 1.4102693796157837} 11/07/2021 17:28:05 - INFO - __main__ - Step 143682: {'lr': 2.2455968897403536e-06, 'samples': 27586944, 'steps': 143681, 'loss/train': 1.3127330541610718} 11/07/2021 17:28:05 - INFO - __main__ - Step 143683: {'lr': 2.244887267117551e-06, 'samples': 27587136, 'steps': 143682, 'loss/train': 1.4579027891159058} 11/07/2021 17:28:06 - INFO - __main__ - Step 143684: {'lr': 2.244177756129395e-06, 'samples': 27587328, 'steps': 143683, 'loss/train': 0.7951673865318298} 11/07/2021 17:28:06 - INFO - __main__ - Step 143685: {'lr': 2.2434683567761627e-06, 'samples': 27587520, 'steps': 143684, 'loss/train': 1.4527827501296997} 11/07/2021 17:28:07 - INFO - __main__ - Step 143686: {'lr': 2.2427590690582424e-06, 'samples': 27587712, 'steps': 143685, 'loss/train': 1.251995325088501} 11/07/2021 17:28:07 - INFO - __main__ - Step 143687: {'lr': 2.2420498929758836e-06, 'samples': 27587904, 'steps': 143686, 'loss/train': 1.0687206983566284} 11/07/2021 17:28:08 - INFO - __main__ - Step 143688: {'lr': 2.241340828529448e-06, 'samples': 27588096, 'steps': 143687, 'loss/train': 1.0293121337890625} 11/07/2021 17:28:08 - INFO - __main__ - Step 143689: {'lr': 2.240631875719212e-06, 'samples': 27588288, 'steps': 143688, 'loss/train': 2.1509077548980713} 11/07/2021 17:28:08 - INFO - __main__ - Step 143690: {'lr': 2.2399230345455378e-06, 'samples': 27588480, 'steps': 143689, 'loss/train': 1.5379642248153687} 11/07/2021 17:28:09 - INFO - __main__ - Step 143691: {'lr': 2.2392143050087577e-06, 'samples': 27588672, 'steps': 143690, 'loss/train': 0.39193102717399597} 11/07/2021 17:28:10 - INFO - __main__ - Step 143692: {'lr': 2.2385056871091214e-06, 'samples': 27588864, 'steps': 143691, 'loss/train': 1.2971075773239136} 11/07/2021 17:28:10 - INFO - __main__ - Step 143693: {'lr': 2.2377971808470176e-06, 'samples': 27589056, 'steps': 143692, 'loss/train': 1.4666997194290161} 11/07/2021 17:28:11 - INFO - __main__ - Step 143694: {'lr': 2.237088786222724e-06, 'samples': 27589248, 'steps': 143693, 'loss/train': 1.3732889890670776} 11/07/2021 17:28:11 - INFO - __main__ - Step 143695: {'lr': 2.2363805032366013e-06, 'samples': 27589440, 'steps': 143694, 'loss/train': 1.470781683921814} 11/07/2021 17:28:12 - INFO - __main__ - Step 143696: {'lr': 2.2356723318889273e-06, 'samples': 27589632, 'steps': 143695, 'loss/train': 1.0029128789901733} 11/07/2021 17:28:12 - INFO - __main__ - Step 143697: {'lr': 2.2349642721800345e-06, 'samples': 27589824, 'steps': 143696, 'loss/train': 1.4237486124038696} 11/07/2021 17:28:13 - INFO - __main__ - Step 143698: {'lr': 2.2342563241102565e-06, 'samples': 27590016, 'steps': 143697, 'loss/train': 1.4188940525054932} 11/07/2021 17:28:13 - INFO - __main__ - Step 143699: {'lr': 2.2335484876798707e-06, 'samples': 27590208, 'steps': 143698, 'loss/train': 1.0876991748809814} 11/07/2021 17:28:13 - INFO - __main__ - Step 143700: {'lr': 2.232840762889238e-06, 'samples': 27590400, 'steps': 143699, 'loss/train': 1.4770079851150513} 11/07/2021 17:28:14 - INFO - __main__ - Step 143701: {'lr': 2.232133149738663e-06, 'samples': 27590592, 'steps': 143700, 'loss/train': 1.3598779439926147} 11/07/2021 17:28:15 - INFO - __main__ - Step 143702: {'lr': 2.23142564822848e-06, 'samples': 27590784, 'steps': 143701, 'loss/train': 0.989029586315155} 11/07/2021 17:28:15 - INFO - __main__ - Step 143703: {'lr': 2.2307182583589934e-06, 'samples': 27590976, 'steps': 143702, 'loss/train': 1.021953821182251} 11/07/2021 17:28:15 - INFO - __main__ - Step 143704: {'lr': 2.230010980130509e-06, 'samples': 27591168, 'steps': 143703, 'loss/train': 1.1161218881607056} 11/07/2021 17:28:16 - INFO - __main__ - Step 143705: {'lr': 2.2293038135433595e-06, 'samples': 27591360, 'steps': 143704, 'loss/train': 1.4028226137161255} 11/07/2021 17:28:17 - INFO - __main__ - Step 143706: {'lr': 2.2285967585978507e-06, 'samples': 27591552, 'steps': 143705, 'loss/train': 1.3525655269622803} 11/07/2021 17:28:17 - INFO - __main__ - Step 143707: {'lr': 2.227889815294315e-06, 'samples': 27591744, 'steps': 143706, 'loss/train': 1.4647653102874756} 11/07/2021 17:28:17 - INFO - __main__ - Step 143708: {'lr': 2.2271829836331138e-06, 'samples': 27591936, 'steps': 143707, 'loss/train': 1.3697528839111328} 11/07/2021 17:28:18 - INFO - __main__ - Step 143709: {'lr': 2.2264762636144688e-06, 'samples': 27592128, 'steps': 143708, 'loss/train': 0.8993080854415894} 11/07/2021 17:28:18 - INFO - __main__ - Step 143710: {'lr': 2.2257696552387685e-06, 'samples': 27592320, 'steps': 143709, 'loss/train': 1.4785116910934448} 11/07/2021 17:28:19 - INFO - __main__ - Step 143711: {'lr': 2.2250631585063187e-06, 'samples': 27592512, 'steps': 143710, 'loss/train': 1.4631372690200806} 11/07/2021 17:28:20 - INFO - __main__ - Step 143712: {'lr': 2.2243567734174242e-06, 'samples': 27592704, 'steps': 143711, 'loss/train': 1.4659173488616943} 11/07/2021 17:28:20 - INFO - __main__ - Step 143713: {'lr': 2.2236504999723905e-06, 'samples': 27592896, 'steps': 143712, 'loss/train': 1.4606760740280151} 11/07/2021 17:28:20 - INFO - __main__ - Step 143714: {'lr': 2.2229443381715784e-06, 'samples': 27593088, 'steps': 143713, 'loss/train': 0.9786295294761658} 11/07/2021 17:28:21 - INFO - __main__ - Step 143715: {'lr': 2.2222382880152937e-06, 'samples': 27593280, 'steps': 143714, 'loss/train': 1.8921105861663818} 11/07/2021 17:28:21 - INFO - __main__ - Step 143716: {'lr': 2.2215323495038408e-06, 'samples': 27593472, 'steps': 143715, 'loss/train': 1.6419577598571777} 11/07/2021 17:28:22 - INFO - __main__ - Step 143717: {'lr': 2.2208265226375255e-06, 'samples': 27593664, 'steps': 143716, 'loss/train': 1.418225646018982} 11/07/2021 17:28:22 - INFO - __main__ - Step 143718: {'lr': 2.2201208074167088e-06, 'samples': 27593856, 'steps': 143717, 'loss/train': 0.22073253989219666} 11/07/2021 17:28:23 - INFO - __main__ - Step 143719: {'lr': 2.2194152038416683e-06, 'samples': 27594048, 'steps': 143718, 'loss/train': 1.6741437911987305} 11/07/2021 17:28:23 - INFO - __main__ - Step 143720: {'lr': 2.2187097119127362e-06, 'samples': 27594240, 'steps': 143719, 'loss/train': 1.7907893657684326} 11/07/2021 17:28:23 - INFO - __main__ - Step 143721: {'lr': 2.218004331630219e-06, 'samples': 27594432, 'steps': 143720, 'loss/train': 1.7606618404388428} 11/07/2021 17:28:24 - INFO - __main__ - Step 143722: {'lr': 2.217299062994449e-06, 'samples': 27594624, 'steps': 143721, 'loss/train': 1.4713780879974365} 11/07/2021 17:28:25 - INFO - __main__ - Step 143723: {'lr': 2.216593906005759e-06, 'samples': 27594816, 'steps': 143722, 'loss/train': 1.0537538528442383} 11/07/2021 17:28:25 - INFO - __main__ - Step 143724: {'lr': 2.215888860664428e-06, 'samples': 27595008, 'steps': 143723, 'loss/train': 1.7844791412353516} 11/07/2021 17:28:25 - INFO - __main__ - Step 143725: {'lr': 2.2151839269707873e-06, 'samples': 27595200, 'steps': 143724, 'loss/train': 1.0214717388153076} 11/07/2021 17:28:26 - INFO - __main__ - Step 143726: {'lr': 2.214479104925171e-06, 'samples': 27595392, 'steps': 143725, 'loss/train': 1.300819993019104} 11/07/2021 17:28:27 - INFO - __main__ - Step 143727: {'lr': 2.213774394527912e-06, 'samples': 27595584, 'steps': 143726, 'loss/train': 1.5623061656951904} 11/07/2021 17:28:27 - INFO - __main__ - Step 143728: {'lr': 2.21306979577926e-06, 'samples': 27595776, 'steps': 143727, 'loss/train': 1.4553505182266235} 11/07/2021 17:28:28 - INFO - __main__ - Step 143729: {'lr': 2.2123653086796035e-06, 'samples': 27595968, 'steps': 143728, 'loss/train': 1.2263402938842773} 11/07/2021 17:28:28 - INFO - __main__ - Step 143730: {'lr': 2.211660933229248e-06, 'samples': 27596160, 'steps': 143729, 'loss/train': 1.0232937335968018} 11/07/2021 17:28:28 - INFO - __main__ - Step 143731: {'lr': 2.210956669428471e-06, 'samples': 27596352, 'steps': 143730, 'loss/train': 1.2193893194198608} 11/07/2021 17:28:29 - INFO - __main__ - Step 143732: {'lr': 2.2102525172776056e-06, 'samples': 27596544, 'steps': 143731, 'loss/train': 1.6859910488128662} 11/07/2021 17:28:30 - INFO - __main__ - Step 143733: {'lr': 2.209548476776985e-06, 'samples': 27596736, 'steps': 143732, 'loss/train': 1.9939000606536865} 11/07/2021 17:28:30 - INFO - __main__ - Step 143734: {'lr': 2.2088445479269135e-06, 'samples': 27596928, 'steps': 143733, 'loss/train': 1.2198052406311035} 11/07/2021 17:28:30 - INFO - __main__ - Step 143735: {'lr': 2.2081407307277256e-06, 'samples': 27597120, 'steps': 143734, 'loss/train': 1.1121519804000854} 11/07/2021 17:28:31 - INFO - __main__ - Step 143736: {'lr': 2.2074370251796982e-06, 'samples': 27597312, 'steps': 143735, 'loss/train': 1.2137092351913452} 11/07/2021 17:28:31 - INFO - __main__ - Step 143737: {'lr': 2.206733431283192e-06, 'samples': 27597504, 'steps': 143736, 'loss/train': 1.4546215534210205} 11/07/2021 17:28:32 - INFO - __main__ - Step 143738: {'lr': 2.2060299490385127e-06, 'samples': 27597696, 'steps': 143737, 'loss/train': 1.2613651752471924} 11/07/2021 17:28:33 - INFO - __main__ - Step 143739: {'lr': 2.205326578445993e-06, 'samples': 27597888, 'steps': 143738, 'loss/train': 0.8926458358764648} 11/07/2021 17:28:33 - INFO - __main__ - Step 143740: {'lr': 2.204623319505883e-06, 'samples': 27598080, 'steps': 143739, 'loss/train': 1.1993772983551025} 11/07/2021 17:28:33 - INFO - __main__ - Step 143741: {'lr': 2.203920172218571e-06, 'samples': 27598272, 'steps': 143740, 'loss/train': 1.2517330646514893} 11/07/2021 17:28:34 - INFO - __main__ - Step 143742: {'lr': 2.2032171365843624e-06, 'samples': 27598464, 'steps': 143741, 'loss/train': 1.6898125410079956} 11/07/2021 17:28:35 - INFO - __main__ - Step 143743: {'lr': 2.2025142126035626e-06, 'samples': 27598656, 'steps': 143742, 'loss/train': 1.0784319639205933} 11/07/2021 17:28:35 - INFO - __main__ - Step 143744: {'lr': 2.2018114002764488e-06, 'samples': 27598848, 'steps': 143743, 'loss/train': 1.3587290048599243} 11/07/2021 17:28:35 - INFO - __main__ - Step 143745: {'lr': 2.20110869960341e-06, 'samples': 27599040, 'steps': 143744, 'loss/train': 1.3538066148757935} 11/07/2021 17:28:36 - INFO - __main__ - Step 143746: {'lr': 2.200406110584724e-06, 'samples': 27599232, 'steps': 143745, 'loss/train': 1.2875051498413086} 11/07/2021 17:28:36 - INFO - __main__ - Step 143747: {'lr': 2.1997036332206955e-06, 'samples': 27599424, 'steps': 143746, 'loss/train': 1.4653348922729492} 11/07/2021 17:28:37 - INFO - __main__ - Step 143748: {'lr': 2.199001267511658e-06, 'samples': 27599616, 'steps': 143747, 'loss/train': 1.136195182800293} 11/07/2021 17:28:37 - INFO - __main__ - Step 143749: {'lr': 2.198299013457916e-06, 'samples': 27599808, 'steps': 143748, 'loss/train': 0.7147278189659119} 11/07/2021 17:28:38 - INFO - __main__ - Step 143750: {'lr': 2.1975968710598316e-06, 'samples': 27600000, 'steps': 143749, 'loss/train': 1.4728524684906006} 11/07/2021 17:28:38 - INFO - __main__ - Step 143751: {'lr': 2.1968948403176535e-06, 'samples': 27600192, 'steps': 143750, 'loss/train': 1.363353967666626} 11/07/2021 17:28:38 - INFO - __main__ - Step 143752: {'lr': 2.1961929212317434e-06, 'samples': 27600384, 'steps': 143751, 'loss/train': 1.6657122373580933} 11/07/2021 17:28:39 - INFO - __main__ - Step 143753: {'lr': 2.195491113802406e-06, 'samples': 27600576, 'steps': 143752, 'loss/train': 1.356592059135437} 11/07/2021 17:28:40 - INFO - __main__ - Step 143754: {'lr': 2.1947894180299465e-06, 'samples': 27600768, 'steps': 143753, 'loss/train': 1.3019737005233765} 11/07/2021 17:28:40 - INFO - __main__ - Step 143755: {'lr': 2.1940878339146987e-06, 'samples': 27600960, 'steps': 143754, 'loss/train': 1.1029951572418213} 11/07/2021 17:28:41 - INFO - __main__ - Step 143756: {'lr': 2.193386361456995e-06, 'samples': 27601152, 'steps': 143755, 'loss/train': 0.7051100134849548} 11/07/2021 17:28:41 - INFO - __main__ - Step 143757: {'lr': 2.192685000657113e-06, 'samples': 27601344, 'steps': 143756, 'loss/train': 1.5408519506454468} 11/07/2021 17:28:41 - INFO - __main__ - Step 143758: {'lr': 2.1919837515153585e-06, 'samples': 27601536, 'steps': 143757, 'loss/train': 1.3333823680877686} 11/07/2021 17:28:42 - INFO - __main__ - Step 143759: {'lr': 2.191282614032092e-06, 'samples': 27601728, 'steps': 143758, 'loss/train': 1.5403170585632324} 11/07/2021 17:28:43 - INFO - __main__ - Step 143760: {'lr': 2.1905815882076187e-06, 'samples': 27601920, 'steps': 143759, 'loss/train': 1.1045464277267456} 11/07/2021 17:28:43 - INFO - __main__ - Step 143761: {'lr': 2.189880674042216e-06, 'samples': 27602112, 'steps': 143760, 'loss/train': 1.2028839588165283} 11/07/2021 17:28:43 - INFO - __main__ - Step 143762: {'lr': 2.1891798715362456e-06, 'samples': 27602304, 'steps': 143761, 'loss/train': 1.28948175907135} 11/07/2021 17:28:44 - INFO - __main__ - Step 143763: {'lr': 2.188479180690012e-06, 'samples': 27602496, 'steps': 143762, 'loss/train': 1.218320369720459} 11/07/2021 17:28:45 - INFO - __main__ - Step 143764: {'lr': 2.1877786015038205e-06, 'samples': 27602688, 'steps': 143763, 'loss/train': 1.3478083610534668} 11/07/2021 17:28:45 - INFO - __main__ - Step 143765: {'lr': 2.1870781339780045e-06, 'samples': 27602880, 'steps': 143764, 'loss/train': 1.5222223997116089} 11/07/2021 17:28:46 - INFO - __main__ - Step 143766: {'lr': 2.1863777781128413e-06, 'samples': 27603072, 'steps': 143765, 'loss/train': 1.1973903179168701} 11/07/2021 17:28:46 - INFO - __main__ - Step 143767: {'lr': 2.185677533908692e-06, 'samples': 27603264, 'steps': 143766, 'loss/train': 1.4150339365005493} 11/07/2021 17:28:46 - INFO - __main__ - Step 143768: {'lr': 2.1849774013658343e-06, 'samples': 27603456, 'steps': 143767, 'loss/train': 1.597379446029663} 11/07/2021 17:28:47 - INFO - __main__ - Step 143769: {'lr': 2.1842773804846283e-06, 'samples': 27603648, 'steps': 143768, 'loss/train': 0.8560854196548462} 11/07/2021 17:28:48 - INFO - __main__ - Step 143770: {'lr': 2.1835774712653524e-06, 'samples': 27603840, 'steps': 143769, 'loss/train': 0.8451992273330688} 11/07/2021 17:28:48 - INFO - __main__ - Step 143771: {'lr': 2.182877673708339e-06, 'samples': 27604032, 'steps': 143770, 'loss/train': 1.5516632795333862} 11/07/2021 17:28:48 - INFO - __main__ - Step 143772: {'lr': 2.1821779878138936e-06, 'samples': 27604224, 'steps': 143771, 'loss/train': 1.3742719888687134} 11/07/2021 17:28:49 - INFO - __main__ - Step 143773: {'lr': 2.1814784135823217e-06, 'samples': 27604416, 'steps': 143772, 'loss/train': 1.2336177825927734} 11/07/2021 17:28:50 - INFO - __main__ - Step 143774: {'lr': 2.1807789510139565e-06, 'samples': 27604608, 'steps': 143773, 'loss/train': 1.476953148841858} 11/07/2021 17:28:50 - INFO - __main__ - Step 143775: {'lr': 2.1800796001091027e-06, 'samples': 27604800, 'steps': 143774, 'loss/train': 1.28074049949646} 11/07/2021 17:28:50 - INFO - __main__ - Step 143776: {'lr': 2.179380360868094e-06, 'samples': 27604992, 'steps': 143775, 'loss/train': 1.5117433071136475} 11/07/2021 17:28:51 - INFO - __main__ - Step 143777: {'lr': 2.178681233291208e-06, 'samples': 27605184, 'steps': 143776, 'loss/train': 0.8814643025398254} 11/07/2021 17:28:51 - INFO - __main__ - Step 143778: {'lr': 2.1779822173788045e-06, 'samples': 27605376, 'steps': 143777, 'loss/train': 1.370948314666748} 11/07/2021 17:28:52 - INFO - __main__ - Step 143779: {'lr': 2.17728331313119e-06, 'samples': 27605568, 'steps': 143778, 'loss/train': 1.0383776426315308} 11/07/2021 17:28:53 - INFO - __main__ - Step 143780: {'lr': 2.1765845205486412e-06, 'samples': 27605760, 'steps': 143779, 'loss/train': 1.34644615650177} 11/07/2021 17:28:53 - INFO - __main__ - Step 143781: {'lr': 2.1758858396315196e-06, 'samples': 27605952, 'steps': 143780, 'loss/train': 4.422877788543701} 11/07/2021 17:28:53 - INFO - __main__ - Step 143782: {'lr': 2.1751872703801024e-06, 'samples': 27606144, 'steps': 143781, 'loss/train': 3.065248727798462} 11/07/2021 17:28:54 - INFO - __main__ - Step 143783: {'lr': 2.1744888127947504e-06, 'samples': 27606336, 'steps': 143782, 'loss/train': 1.2539993524551392} 11/07/2021 17:28:54 - INFO - __main__ - Step 143784: {'lr': 2.1737904668757137e-06, 'samples': 27606528, 'steps': 143783, 'loss/train': 1.4655258655548096} 11/07/2021 17:28:55 - INFO - __main__ - Step 143785: {'lr': 2.1730922326233804e-06, 'samples': 27606720, 'steps': 143784, 'loss/train': 1.1317181587219238} 11/07/2021 17:28:55 - INFO - __main__ - Step 143786: {'lr': 2.1723941100380006e-06, 'samples': 27606912, 'steps': 143785, 'loss/train': 1.379563331604004} 11/07/2021 17:28:56 - INFO - __main__ - Step 143787: {'lr': 2.1716960991199075e-06, 'samples': 27607104, 'steps': 143786, 'loss/train': 0.7629773020744324} 11/07/2021 17:28:56 - INFO - __main__ - Step 143788: {'lr': 2.170998199869434e-06, 'samples': 27607296, 'steps': 143787, 'loss/train': 1.619066834449768} 11/07/2021 17:28:56 - INFO - __main__ - Step 143789: {'lr': 2.1703004122868854e-06, 'samples': 27607488, 'steps': 143788, 'loss/train': 1.3474465608596802} 11/07/2021 17:28:57 - INFO - __main__ - Step 143790: {'lr': 2.1696027363725947e-06, 'samples': 27607680, 'steps': 143789, 'loss/train': 1.4059470891952515} 11/07/2021 17:28:58 - INFO - __main__ - Step 143791: {'lr': 2.168905172126839e-06, 'samples': 27607872, 'steps': 143790, 'loss/train': 0.3294861614704132} 11/07/2021 17:28:58 - INFO - __main__ - Step 143792: {'lr': 2.1682077195499527e-06, 'samples': 27608064, 'steps': 143791, 'loss/train': 1.3915473222732544} 11/07/2021 17:28:58 - INFO - __main__ - Step 143793: {'lr': 2.16751037864224e-06, 'samples': 27608256, 'steps': 143792, 'loss/train': 1.8135857582092285} 11/07/2021 17:28:59 - INFO - __main__ - Step 143794: {'lr': 2.1668131494040346e-06, 'samples': 27608448, 'steps': 143793, 'loss/train': 1.4058923721313477} 11/07/2021 17:29:00 - INFO - __main__ - Step 143795: {'lr': 2.1661160318356134e-06, 'samples': 27608640, 'steps': 143794, 'loss/train': 1.6164100170135498} 11/07/2021 17:29:00 - INFO - __main__ - Step 143796: {'lr': 2.1654190259373376e-06, 'samples': 27608832, 'steps': 143795, 'loss/train': 2.135049819946289} 11/07/2021 17:29:01 - INFO - __main__ - Step 143797: {'lr': 2.164722131709512e-06, 'samples': 27609024, 'steps': 143796, 'loss/train': 1.1993237733840942} 11/07/2021 17:29:01 - INFO - __main__ - Step 143798: {'lr': 2.164025349152443e-06, 'samples': 27609216, 'steps': 143797, 'loss/train': 1.342998743057251} 11/07/2021 17:29:01 - INFO - __main__ - Step 143799: {'lr': 2.1633286782664073e-06, 'samples': 27609408, 'steps': 143798, 'loss/train': 1.6759594678878784} 11/07/2021 17:29:02 - INFO - __main__ - Step 143800: {'lr': 2.162632119051766e-06, 'samples': 27609600, 'steps': 143799, 'loss/train': 1.423592209815979} 11/07/2021 17:29:03 - INFO - __main__ - Step 143801: {'lr': 2.1619356715088245e-06, 'samples': 27609792, 'steps': 143800, 'loss/train': 1.8896476030349731} 11/07/2021 17:29:03 - INFO - __main__ - Step 143802: {'lr': 2.161239335637888e-06, 'samples': 27609984, 'steps': 143801, 'loss/train': 1.1931360960006714} 11/07/2021 17:29:03 - INFO - __main__ - Step 143803: {'lr': 2.1605431114392617e-06, 'samples': 27610176, 'steps': 143802, 'loss/train': 1.6185953617095947} 11/07/2021 17:29:04 - INFO - __main__ - Step 143804: {'lr': 2.1598469989132786e-06, 'samples': 27610368, 'steps': 143803, 'loss/train': 1.4316991567611694} 11/07/2021 17:29:04 - INFO - __main__ - Step 143805: {'lr': 2.1591509980602443e-06, 'samples': 27610560, 'steps': 143804, 'loss/train': 1.2776877880096436} 11/07/2021 17:29:06 - INFO - __main__ - Step 143806: {'lr': 2.158455108880464e-06, 'samples': 27610752, 'steps': 143805, 'loss/train': 1.4019153118133545} 11/07/2021 17:29:06 - INFO - __main__ - Step 143807: {'lr': 2.1577593313742707e-06, 'samples': 27610944, 'steps': 143806, 'loss/train': 0.2820802330970764} 11/07/2021 17:29:06 - INFO - __main__ - Step 143808: {'lr': 2.1570636655419417e-06, 'samples': 27611136, 'steps': 143807, 'loss/train': 1.300653338432312} 11/07/2021 17:29:07 - INFO - __main__ - Step 143809: {'lr': 2.1563681113838383e-06, 'samples': 27611328, 'steps': 143808, 'loss/train': 1.3844507932662964} 11/07/2021 17:29:07 - INFO - __main__ - Step 143810: {'lr': 2.155672668900266e-06, 'samples': 27611520, 'steps': 143809, 'loss/train': 1.089347243309021} 11/07/2021 17:29:08 - INFO - __main__ - Step 143811: {'lr': 2.1549773380915014e-06, 'samples': 27611712, 'steps': 143810, 'loss/train': 0.985270082950592} 11/07/2021 17:29:08 - INFO - __main__ - Step 143812: {'lr': 2.1542821189578786e-06, 'samples': 27611904, 'steps': 143811, 'loss/train': 2.0123047828674316} 11/07/2021 17:29:09 - INFO - __main__ - Step 143813: {'lr': 2.1535870114997304e-06, 'samples': 27612096, 'steps': 143812, 'loss/train': 1.5186446905136108} 11/07/2021 17:29:09 - INFO - __main__ - Step 143814: {'lr': 2.1528920157173337e-06, 'samples': 27612288, 'steps': 143813, 'loss/train': 1.910151481628418} 11/07/2021 17:29:09 - INFO - __main__ - Step 143815: {'lr': 2.1521971316110222e-06, 'samples': 27612480, 'steps': 143814, 'loss/train': 1.0630042552947998} 11/07/2021 17:29:10 - INFO - __main__ - Step 143816: {'lr': 2.151502359181101e-06, 'samples': 27612672, 'steps': 143815, 'loss/train': 1.3438211679458618} 11/07/2021 17:29:11 - INFO - __main__ - Step 143817: {'lr': 2.1508076984279037e-06, 'samples': 27612864, 'steps': 143816, 'loss/train': 1.3001017570495605} 11/07/2021 17:29:11 - INFO - __main__ - Step 143818: {'lr': 2.150113149351707e-06, 'samples': 27613056, 'steps': 143817, 'loss/train': 1.3551530838012695} 11/07/2021 17:29:11 - INFO - __main__ - Step 143819: {'lr': 2.1494187119528442e-06, 'samples': 27613248, 'steps': 143818, 'loss/train': 1.5058642625808716} 11/07/2021 17:29:12 - INFO - __main__ - Step 143820: {'lr': 2.1487243862316487e-06, 'samples': 27613440, 'steps': 143819, 'loss/train': 1.341639518737793} 11/07/2021 17:29:13 - INFO - __main__ - Step 143821: {'lr': 2.1480301721883977e-06, 'samples': 27613632, 'steps': 143820, 'loss/train': 1.5593205690383911} 11/07/2021 17:29:13 - INFO - __main__ - Step 143822: {'lr': 2.1473360698234245e-06, 'samples': 27613824, 'steps': 143821, 'loss/train': 1.1816143989562988} 11/07/2021 17:29:13 - INFO - __main__ - Step 143823: {'lr': 2.1466420791370624e-06, 'samples': 27614016, 'steps': 143822, 'loss/train': 1.5535166263580322} 11/07/2021 17:29:14 - INFO - __main__ - Step 143824: {'lr': 2.1459482001295884e-06, 'samples': 27614208, 'steps': 143823, 'loss/train': 1.0715453624725342} 11/07/2021 17:29:14 - INFO - __main__ - Step 143825: {'lr': 2.145254432801308e-06, 'samples': 27614400, 'steps': 143824, 'loss/train': 1.5298060178756714} 11/07/2021 17:29:15 - INFO - __main__ - Step 143826: {'lr': 2.1445607771525545e-06, 'samples': 27614592, 'steps': 143825, 'loss/train': 0.938098132610321} 11/07/2021 17:29:16 - INFO - __main__ - Step 143827: {'lr': 2.143867233183633e-06, 'samples': 27614784, 'steps': 143826, 'loss/train': 1.0687474012374878} 11/07/2021 17:29:16 - INFO - __main__ - Step 143828: {'lr': 2.1431738008948767e-06, 'samples': 27614976, 'steps': 143827, 'loss/train': 1.472769021987915} 11/07/2021 17:29:16 - INFO - __main__ - Step 143829: {'lr': 2.142480480286563e-06, 'samples': 27615168, 'steps': 143828, 'loss/train': 1.3410875797271729} 11/07/2021 17:29:17 - INFO - __main__ - Step 143830: {'lr': 2.1417872713590247e-06, 'samples': 27615360, 'steps': 143829, 'loss/train': 1.4186443090438843} 11/07/2021 17:29:18 - INFO - __main__ - Step 143831: {'lr': 2.1410941741125956e-06, 'samples': 27615552, 'steps': 143830, 'loss/train': 1.7835108041763306} 11/07/2021 17:29:18 - INFO - __main__ - Step 143832: {'lr': 2.140401188547525e-06, 'samples': 27615744, 'steps': 143831, 'loss/train': 1.3110086917877197} 11/07/2021 17:29:18 - INFO - __main__ - Step 143833: {'lr': 2.1397083146642016e-06, 'samples': 27615936, 'steps': 143832, 'loss/train': 1.0351229906082153} 11/07/2021 17:29:19 - INFO - __main__ - Step 143834: {'lr': 2.139015552462875e-06, 'samples': 27616128, 'steps': 143833, 'loss/train': 1.253993034362793} 11/07/2021 17:29:19 - INFO - __main__ - Step 143835: {'lr': 2.1383229019439067e-06, 'samples': 27616320, 'steps': 143834, 'loss/train': 1.6174966096878052} 11/07/2021 17:29:19 - INFO - __main__ - Step 143836: {'lr': 2.1376303631075734e-06, 'samples': 27616512, 'steps': 143835, 'loss/train': 0.8045604825019836} 11/07/2021 17:29:20 - INFO - __main__ - Step 143837: {'lr': 2.1369379359542083e-06, 'samples': 27616704, 'steps': 143836, 'loss/train': 1.5673353672027588} 11/07/2021 17:29:21 - INFO - __main__ - Step 143838: {'lr': 2.1362456204841173e-06, 'samples': 27616896, 'steps': 143837, 'loss/train': 1.39918053150177} 11/07/2021 17:29:21 - INFO - __main__ - Step 143839: {'lr': 2.1355534166976053e-06, 'samples': 27617088, 'steps': 143838, 'loss/train': 1.297722339630127} 11/07/2021 17:29:22 - INFO - __main__ - Step 143840: {'lr': 2.1348613245949775e-06, 'samples': 27617280, 'steps': 143839, 'loss/train': 1.2397489547729492} 11/07/2021 17:29:22 - INFO - __main__ - Step 143841: {'lr': 2.13416934417654e-06, 'samples': 27617472, 'steps': 143840, 'loss/train': 1.3607033491134644} 11/07/2021 17:29:23 - INFO - __main__ - Step 143842: {'lr': 2.1334774754426523e-06, 'samples': 27617664, 'steps': 143841, 'loss/train': 1.616019368171692} 11/07/2021 17:29:23 - INFO - __main__ - Step 143843: {'lr': 2.1327857183935927e-06, 'samples': 27617856, 'steps': 143842, 'loss/train': 1.7682024240493774} 11/07/2021 17:29:24 - INFO - __main__ - Step 143844: {'lr': 2.132094073029639e-06, 'samples': 27618048, 'steps': 143843, 'loss/train': 1.255635380744934} 11/07/2021 17:29:24 - INFO - __main__ - Step 143845: {'lr': 2.1314025393511795e-06, 'samples': 27618240, 'steps': 143844, 'loss/train': 1.7805712223052979} 11/07/2021 17:29:24 - INFO - __main__ - Step 143846: {'lr': 2.1307111173584637e-06, 'samples': 27618432, 'steps': 143845, 'loss/train': 1.6561335325241089} 11/07/2021 17:29:26 - INFO - __main__ - Step 143847: {'lr': 2.130019807051825e-06, 'samples': 27618624, 'steps': 143846, 'loss/train': 1.4484447240829468} 11/07/2021 17:29:26 - INFO - __main__ - Step 143848: {'lr': 2.1293286084315964e-06, 'samples': 27618816, 'steps': 143847, 'loss/train': 1.191259741783142} 11/07/2021 17:29:26 - INFO - __main__ - Step 143849: {'lr': 2.1286375214980557e-06, 'samples': 27619008, 'steps': 143848, 'loss/train': 1.4513235092163086} 11/07/2021 17:29:27 - INFO - __main__ - Step 143850: {'lr': 2.127946546251508e-06, 'samples': 27619200, 'steps': 143849, 'loss/train': 0.992472231388092} 11/07/2021 17:29:27 - INFO - __main__ - Step 143851: {'lr': 2.127255682692314e-06, 'samples': 27619392, 'steps': 143850, 'loss/train': 1.269993543624878} 11/07/2021 17:29:28 - INFO - __main__ - Step 143852: {'lr': 2.1265649308207514e-06, 'samples': 27619584, 'steps': 143851, 'loss/train': 0.9882006645202637} 11/07/2021 17:29:28 - INFO - __main__ - Step 143853: {'lr': 2.1258742906370975e-06, 'samples': 27619776, 'steps': 143852, 'loss/train': 1.5117383003234863} 11/07/2021 17:29:29 - INFO - __main__ - Step 143854: {'lr': 2.1251837621417414e-06, 'samples': 27619968, 'steps': 143853, 'loss/train': 1.5149247646331787} 11/07/2021 17:29:29 - INFO - __main__ - Step 143855: {'lr': 2.1244933453349325e-06, 'samples': 27620160, 'steps': 143854, 'loss/train': 0.8830081820487976} 11/07/2021 17:29:30 - INFO - __main__ - Step 143856: {'lr': 2.123803040217004e-06, 'samples': 27620352, 'steps': 143855, 'loss/train': 1.0790534019470215} 11/07/2021 17:29:31 - INFO - __main__ - Step 143857: {'lr': 2.123112846788261e-06, 'samples': 27620544, 'steps': 143856, 'loss/train': 1.3405629396438599} 11/07/2021 17:29:31 - INFO - __main__ - Step 143858: {'lr': 2.122422765049009e-06, 'samples': 27620736, 'steps': 143857, 'loss/train': 1.2645604610443115} 11/07/2021 17:29:31 - INFO - __main__ - Step 143859: {'lr': 2.121732794999581e-06, 'samples': 27620928, 'steps': 143858, 'loss/train': 1.6305679082870483} 11/07/2021 17:29:32 - INFO - __main__ - Step 143860: {'lr': 2.121042936640283e-06, 'samples': 27621120, 'steps': 143859, 'loss/train': 1.2031502723693848} 11/07/2021 17:29:32 - INFO - __main__ - Step 143861: {'lr': 2.1203531899713913e-06, 'samples': 27621312, 'steps': 143860, 'loss/train': 1.2609291076660156} 11/07/2021 17:29:33 - INFO - __main__ - Step 143862: {'lr': 2.1196635549932676e-06, 'samples': 27621504, 'steps': 143861, 'loss/train': 0.8105145692825317} 11/07/2021 17:29:34 - INFO - __main__ - Step 143863: {'lr': 2.118974031706189e-06, 'samples': 27621696, 'steps': 143862, 'loss/train': 1.2683337926864624} 11/07/2021 17:29:34 - INFO - __main__ - Step 143864: {'lr': 2.118284620110489e-06, 'samples': 27621888, 'steps': 143863, 'loss/train': 1.1249163150787354} 11/07/2021 17:29:34 - INFO - __main__ - Step 143865: {'lr': 2.1175953202064726e-06, 'samples': 27622080, 'steps': 143864, 'loss/train': 1.1754233837127686} 11/07/2021 17:29:35 - INFO - __main__ - Step 143866: {'lr': 2.116906131994417e-06, 'samples': 27622272, 'steps': 143865, 'loss/train': 1.332663893699646} 11/07/2021 17:29:35 - INFO - __main__ - Step 143867: {'lr': 2.1162170554746564e-06, 'samples': 27622464, 'steps': 143866, 'loss/train': 0.912757933139801} 11/07/2021 17:29:36 - INFO - __main__ - Step 143868: {'lr': 2.1155280906475228e-06, 'samples': 27622656, 'steps': 143867, 'loss/train': 1.3626374006271362} 11/07/2021 17:29:36 - INFO - __main__ - Step 143869: {'lr': 2.114839237513294e-06, 'samples': 27622848, 'steps': 143868, 'loss/train': 1.0733284950256348} 11/07/2021 17:29:37 - INFO - __main__ - Step 143870: {'lr': 2.1141504960723036e-06, 'samples': 27623040, 'steps': 143869, 'loss/train': 1.3494399785995483} 11/07/2021 17:29:37 - INFO - __main__ - Step 143871: {'lr': 2.1134618663248284e-06, 'samples': 27623232, 'steps': 143870, 'loss/train': 0.7977849841117859} 11/07/2021 17:29:37 - INFO - __main__ - Step 143872: {'lr': 2.1127733482712295e-06, 'samples': 27623424, 'steps': 143871, 'loss/train': 1.3686227798461914} 11/07/2021 17:29:38 - INFO - __main__ - Step 143873: {'lr': 2.1120849419117848e-06, 'samples': 27623616, 'steps': 143872, 'loss/train': 1.1184415817260742} 11/07/2021 17:29:39 - INFO - __main__ - Step 143874: {'lr': 2.111396647246799e-06, 'samples': 27623808, 'steps': 143873, 'loss/train': 1.3375985622406006} 11/07/2021 17:29:39 - INFO - __main__ - Step 143875: {'lr': 2.110708464276606e-06, 'samples': 27624000, 'steps': 143874, 'loss/train': 1.6790543794631958} 11/07/2021 17:29:39 - INFO - __main__ - Step 143876: {'lr': 2.1100203930014826e-06, 'samples': 27624192, 'steps': 143875, 'loss/train': 1.3335516452789307} 11/07/2021 17:29:40 - INFO - __main__ - Step 143877: {'lr': 2.10933243342179e-06, 'samples': 27624384, 'steps': 143876, 'loss/train': 1.385259985923767} 11/07/2021 17:29:41 - INFO - __main__ - Step 143878: {'lr': 2.1086445855377777e-06, 'samples': 27624576, 'steps': 143877, 'loss/train': 1.1380664110183716} 11/07/2021 17:29:41 - INFO - __main__ - Step 143879: {'lr': 2.107956849349807e-06, 'samples': 27624768, 'steps': 143878, 'loss/train': 1.5209201574325562} 11/07/2021 17:29:42 - INFO - __main__ - Step 143880: {'lr': 2.107269224858155e-06, 'samples': 27624960, 'steps': 143879, 'loss/train': 1.3107167482376099} 11/07/2021 17:29:42 - INFO - __main__ - Step 143881: {'lr': 2.106581712063127e-06, 'samples': 27625152, 'steps': 143880, 'loss/train': 0.6334788799285889} 11/07/2021 17:29:42 - INFO - __main__ - Step 143882: {'lr': 2.1058943109650563e-06, 'samples': 27625344, 'steps': 143881, 'loss/train': 0.8734022378921509} 11/07/2021 17:29:43 - INFO - __main__ - Step 143883: {'lr': 2.105207021564276e-06, 'samples': 27625536, 'steps': 143882, 'loss/train': 1.5320667028427124} 11/07/2021 17:29:44 - INFO - __main__ - Step 143884: {'lr': 2.1045198438610357e-06, 'samples': 27625728, 'steps': 143883, 'loss/train': 0.9972303509712219} 11/07/2021 17:29:44 - INFO - __main__ - Step 143885: {'lr': 2.1038327778556687e-06, 'samples': 27625920, 'steps': 143884, 'loss/train': 0.9379029870033264} 11/07/2021 17:29:44 - INFO - __main__ - Step 143886: {'lr': 2.1031458235484802e-06, 'samples': 27626112, 'steps': 143885, 'loss/train': 1.0700280666351318} 11/07/2021 17:29:45 - INFO - __main__ - Step 143887: {'lr': 2.102458980939831e-06, 'samples': 27626304, 'steps': 143886, 'loss/train': 0.7858112454414368} 11/07/2021 17:29:46 - INFO - __main__ - Step 143888: {'lr': 2.101772250029943e-06, 'samples': 27626496, 'steps': 143887, 'loss/train': 1.2742360830307007} 11/07/2021 17:29:46 - INFO - __main__ - Step 143889: {'lr': 2.101085630819205e-06, 'samples': 27626688, 'steps': 143888, 'loss/train': 1.2796332836151123} 11/07/2021 17:29:47 - INFO - __main__ - Step 143890: {'lr': 2.1003991233078665e-06, 'samples': 27626880, 'steps': 143889, 'loss/train': 0.6742353439331055} 11/07/2021 17:29:47 - INFO - __main__ - Step 143891: {'lr': 2.099712727496289e-06, 'samples': 27627072, 'steps': 143890, 'loss/train': 0.8630927205085754} 11/07/2021 17:29:47 - INFO - __main__ - Step 143892: {'lr': 2.0990264433847495e-06, 'samples': 27627264, 'steps': 143891, 'loss/train': 1.5895013809204102} 11/07/2021 17:29:48 - INFO - __main__ - Step 143893: {'lr': 2.0983402709735535e-06, 'samples': 27627456, 'steps': 143892, 'loss/train': 0.4222421944141388} 11/07/2021 17:29:49 - INFO - __main__ - Step 143894: {'lr': 2.097654210263006e-06, 'samples': 27627648, 'steps': 143893, 'loss/train': 1.4158179759979248} 11/07/2021 17:29:49 - INFO - __main__ - Step 143895: {'lr': 2.0969682612534678e-06, 'samples': 27627840, 'steps': 143894, 'loss/train': 1.002400517463684} 11/07/2021 17:29:49 - INFO - __main__ - Step 143896: {'lr': 2.0962824239451893e-06, 'samples': 27628032, 'steps': 143895, 'loss/train': 1.6517266035079956} 11/07/2021 17:29:50 - INFO - __main__ - Step 143897: {'lr': 2.0955966983384756e-06, 'samples': 27628224, 'steps': 143896, 'loss/train': 1.3736584186553955} 11/07/2021 17:29:51 - INFO - __main__ - Step 143898: {'lr': 2.0949110844336872e-06, 'samples': 27628416, 'steps': 143897, 'loss/train': 1.103350281715393} 11/07/2021 17:29:51 - INFO - __main__ - Step 143899: {'lr': 2.094225582231102e-06, 'samples': 27628608, 'steps': 143898, 'loss/train': 1.7522133588790894} 11/07/2021 17:29:52 - INFO - __main__ - Step 143900: {'lr': 2.0935401917310527e-06, 'samples': 27628800, 'steps': 143899, 'loss/train': 1.5557652711868286} 11/07/2021 17:29:52 - INFO - __main__ - Step 143901: {'lr': 2.0928549129338172e-06, 'samples': 27628992, 'steps': 143900, 'loss/train': 1.4031250476837158} 11/07/2021 17:29:52 - INFO - __main__ - Step 143902: {'lr': 2.0921697458397005e-06, 'samples': 27629184, 'steps': 143901, 'loss/train': 1.3829395771026611} 11/07/2021 17:29:53 - INFO - __main__ - Step 143903: {'lr': 2.091484690449036e-06, 'samples': 27629376, 'steps': 143902, 'loss/train': 0.8356289863586426} 11/07/2021 17:29:54 - INFO - __main__ - Step 143904: {'lr': 2.0907997467621286e-06, 'samples': 27629568, 'steps': 143903, 'loss/train': 1.8072535991668701} 11/07/2021 17:29:54 - INFO - __main__ - Step 143905: {'lr': 2.090114914779284e-06, 'samples': 27629760, 'steps': 143904, 'loss/train': 1.52945077419281} 11/07/2021 17:29:55 - INFO - __main__ - Step 143906: {'lr': 2.0894301945008077e-06, 'samples': 27629952, 'steps': 143905, 'loss/train': 1.4707252979278564} 11/07/2021 17:29:55 - INFO - __main__ - Step 143907: {'lr': 2.0887455859269764e-06, 'samples': 27630144, 'steps': 143906, 'loss/train': 1.3416599035263062} 11/07/2021 17:29:55 - INFO - __main__ - Step 143908: {'lr': 2.08806108905818e-06, 'samples': 27630336, 'steps': 143907, 'loss/train': 0.9525964856147766} 11/07/2021 17:29:56 - INFO - __main__ - Step 143909: {'lr': 2.087376703894639e-06, 'samples': 27630528, 'steps': 143908, 'loss/train': 1.7369403839111328} 11/07/2021 17:29:57 - INFO - __main__ - Step 143910: {'lr': 2.086692430436715e-06, 'samples': 27630720, 'steps': 143909, 'loss/train': 1.4494478702545166} 11/07/2021 17:29:57 - INFO - __main__ - Step 143911: {'lr': 2.086008268684714e-06, 'samples': 27630912, 'steps': 143910, 'loss/train': 1.4767653942108154} 11/07/2021 17:29:57 - INFO - __main__ - Step 143912: {'lr': 2.085324218638912e-06, 'samples': 27631104, 'steps': 143911, 'loss/train': 1.3711400032043457} 11/07/2021 17:29:58 - INFO - __main__ - Step 143913: {'lr': 2.0846402802996433e-06, 'samples': 27631296, 'steps': 143912, 'loss/train': 1.3170627355575562} 11/07/2021 17:29:58 - INFO - __main__ - Step 143914: {'lr': 2.0839564536672127e-06, 'samples': 27631488, 'steps': 143913, 'loss/train': 0.744035542011261} 11/07/2021 17:30:00 - INFO - __main__ - Step 143915: {'lr': 2.083272738741926e-06, 'samples': 27631680, 'steps': 143914, 'loss/train': 1.2521412372589111} 11/07/2021 17:30:00 - INFO - __main__ - Step 143916: {'lr': 2.082589135524088e-06, 'samples': 27631872, 'steps': 143915, 'loss/train': 0.6087607145309448} 11/07/2021 17:30:00 - INFO - __main__ - Step 143917: {'lr': 2.0819056440140036e-06, 'samples': 27632064, 'steps': 143916, 'loss/train': 1.03335702419281} 11/07/2021 17:30:01 - INFO - __main__ - Step 143918: {'lr': 2.0812222642120072e-06, 'samples': 27632256, 'steps': 143917, 'loss/train': 1.4242922067642212} 11/07/2021 17:30:01 - INFO - __main__ - Step 143919: {'lr': 2.0805389961183752e-06, 'samples': 27632448, 'steps': 143918, 'loss/train': 1.078730821609497} 11/07/2021 17:30:02 - INFO - __main__ - Step 143920: {'lr': 2.079855839733441e-06, 'samples': 27632640, 'steps': 143919, 'loss/train': 1.430338978767395} 11/07/2021 17:30:02 - INFO - __main__ - Step 143921: {'lr': 2.0791727950574822e-06, 'samples': 27632832, 'steps': 143920, 'loss/train': 1.255515694618225} 11/07/2021 17:30:03 - INFO - __main__ - Step 143922: {'lr': 2.0784898620908044e-06, 'samples': 27633024, 'steps': 143921, 'loss/train': 0.7660807967185974} 11/07/2021 17:30:03 - INFO - __main__ - Step 143923: {'lr': 2.077807040833768e-06, 'samples': 27633216, 'steps': 143922, 'loss/train': 0.8315672278404236} 11/07/2021 17:30:03 - INFO - __main__ - Step 143924: {'lr': 2.0771243312866227e-06, 'samples': 27633408, 'steps': 143923, 'loss/train': 1.4449892044067383} 11/07/2021 17:30:04 - INFO - __main__ - Step 143925: {'lr': 2.0764417334497023e-06, 'samples': 27633600, 'steps': 143924, 'loss/train': 1.064063549041748} 11/07/2021 17:30:05 - INFO - __main__ - Step 143926: {'lr': 2.075759247323311e-06, 'samples': 27633792, 'steps': 143925, 'loss/train': 1.4741661548614502} 11/07/2021 17:30:05 - INFO - __main__ - Step 143927: {'lr': 2.075076872907783e-06, 'samples': 27633984, 'steps': 143926, 'loss/train': 0.5536365509033203} 11/07/2021 17:30:05 - INFO - __main__ - Step 143928: {'lr': 2.0743946102033672e-06, 'samples': 27634176, 'steps': 143927, 'loss/train': 1.066349744796753} 11/07/2021 17:30:06 - INFO - __main__ - Step 143929: {'lr': 2.073712459210425e-06, 'samples': 27634368, 'steps': 143928, 'loss/train': 0.9712258577346802} 11/07/2021 17:30:07 - INFO - __main__ - Step 143930: {'lr': 2.073030419929234e-06, 'samples': 27634560, 'steps': 143929, 'loss/train': 1.0954082012176514} 11/07/2021 17:30:07 - INFO - __main__ - Step 143931: {'lr': 2.072348492360099e-06, 'samples': 27634752, 'steps': 143930, 'loss/train': 0.7675206065177917} 11/07/2021 17:30:08 - INFO - __main__ - Step 143932: {'lr': 2.071666676503353e-06, 'samples': 27634944, 'steps': 143931, 'loss/train': 0.9963115453720093} 11/07/2021 17:30:08 - INFO - __main__ - Step 143933: {'lr': 2.0709849723593023e-06, 'samples': 27635136, 'steps': 143932, 'loss/train': 1.2118223905563354} 11/07/2021 17:30:08 - INFO - __main__ - Step 143934: {'lr': 2.0703033799281956e-06, 'samples': 27635328, 'steps': 143933, 'loss/train': 1.1845813989639282} 11/07/2021 17:30:09 - INFO - __main__ - Step 143935: {'lr': 2.069621899210422e-06, 'samples': 27635520, 'steps': 143934, 'loss/train': 1.7382621765136719} 11/07/2021 17:30:10 - INFO - __main__ - Step 143936: {'lr': 2.0689405302062314e-06, 'samples': 27635712, 'steps': 143935, 'loss/train': 1.334816336631775} 11/07/2021 17:30:10 - INFO - __main__ - Step 143937: {'lr': 2.0682592729159567e-06, 'samples': 27635904, 'steps': 143936, 'loss/train': 1.2122118473052979} 11/07/2021 17:30:11 - INFO - __main__ - Step 143938: {'lr': 2.067578127339903e-06, 'samples': 27636096, 'steps': 143937, 'loss/train': 1.64082932472229} 11/07/2021 17:30:11 - INFO - __main__ - Step 143939: {'lr': 2.066897093478376e-06, 'samples': 27636288, 'steps': 143938, 'loss/train': 0.9843822121620178} 11/07/2021 17:30:11 - INFO - __main__ - Step 143940: {'lr': 2.066216171331653e-06, 'samples': 27636480, 'steps': 143939, 'loss/train': 1.439577579498291} 11/07/2021 17:30:12 - INFO - __main__ - Step 143941: {'lr': 2.065535360900095e-06, 'samples': 27636672, 'steps': 143940, 'loss/train': 1.28179132938385} 11/07/2021 17:30:13 - INFO - __main__ - Step 143942: {'lr': 2.0648546621839794e-06, 'samples': 27636864, 'steps': 143941, 'loss/train': 1.3946806192398071} 11/07/2021 17:30:13 - INFO - __main__ - Step 143943: {'lr': 2.0641740751836115e-06, 'samples': 27637056, 'steps': 143942, 'loss/train': 1.3509552478790283} 11/07/2021 17:30:13 - INFO - __main__ - Step 143944: {'lr': 2.0634935998992966e-06, 'samples': 27637248, 'steps': 143943, 'loss/train': 1.7032158374786377} 11/07/2021 17:30:14 - INFO - __main__ - Step 143945: {'lr': 2.06281323633134e-06, 'samples': 27637440, 'steps': 143944, 'loss/train': 1.2934151887893677} 11/07/2021 17:30:15 - INFO - __main__ - Step 143946: {'lr': 2.0621329844800753e-06, 'samples': 27637632, 'steps': 143945, 'loss/train': 1.6175363063812256} 11/07/2021 17:30:15 - INFO - __main__ - Step 143947: {'lr': 2.0614528443457514e-06, 'samples': 27637824, 'steps': 143946, 'loss/train': 1.006910800933838} 11/07/2021 17:30:15 - INFO - __main__ - Step 143948: {'lr': 2.0607728159287297e-06, 'samples': 27638016, 'steps': 143947, 'loss/train': 1.2309329509735107} 11/07/2021 17:30:16 - INFO - __main__ - Step 143949: {'lr': 2.0600928992293157e-06, 'samples': 27638208, 'steps': 143948, 'loss/train': 2.1595256328582764} 11/07/2021 17:30:16 - INFO - __main__ - Step 143950: {'lr': 2.0594130942477863e-06, 'samples': 27638400, 'steps': 143949, 'loss/train': 1.4540621042251587} 11/07/2021 17:30:17 - INFO - __main__ - Step 143951: {'lr': 2.058733400984447e-06, 'samples': 27638592, 'steps': 143950, 'loss/train': 1.1806049346923828} 11/07/2021 17:30:18 - INFO - __main__ - Step 143952: {'lr': 2.058053819439604e-06, 'samples': 27638784, 'steps': 143951, 'loss/train': 1.2315785884857178} 11/07/2021 17:30:18 - INFO - __main__ - Step 143953: {'lr': 2.0573743496136165e-06, 'samples': 27638976, 'steps': 143952, 'loss/train': 1.1854177713394165} 11/07/2021 17:30:18 - INFO - __main__ - Step 143954: {'lr': 2.056694991506708e-06, 'samples': 27639168, 'steps': 143953, 'loss/train': 1.052890658378601} 11/07/2021 17:30:19 - INFO - __main__ - Step 143955: {'lr': 2.056015745119266e-06, 'samples': 27639360, 'steps': 143954, 'loss/train': 1.341360330581665} 11/07/2021 17:30:20 - INFO - __main__ - Step 143956: {'lr': 2.0553366104515415e-06, 'samples': 27639552, 'steps': 143955, 'loss/train': 1.376175880432129} 11/07/2021 17:30:20 - INFO - __main__ - Step 143957: {'lr': 2.0546575875038664e-06, 'samples': 27639744, 'steps': 143956, 'loss/train': 0.6870977282524109} 11/07/2021 17:30:20 - INFO - __main__ - Step 143958: {'lr': 2.053978676276519e-06, 'samples': 27639936, 'steps': 143957, 'loss/train': 1.331527829170227} 11/07/2021 17:30:21 - INFO - __main__ - Step 143959: {'lr': 2.0532998767698317e-06, 'samples': 27640128, 'steps': 143958, 'loss/train': 1.0482659339904785} 11/07/2021 17:30:21 - INFO - __main__ - Step 143960: {'lr': 2.0526211889840827e-06, 'samples': 27640320, 'steps': 143959, 'loss/train': 1.5719660520553589} 11/07/2021 17:30:22 - INFO - __main__ - Step 143961: {'lr': 2.0519426129196328e-06, 'samples': 27640512, 'steps': 143960, 'loss/train': 1.4594060182571411} 11/07/2021 17:30:22 - INFO - __main__ - Step 143962: {'lr': 2.0512641485767313e-06, 'samples': 27640704, 'steps': 143961, 'loss/train': 1.3816603422164917} 11/07/2021 17:30:23 - INFO - __main__ - Step 143963: {'lr': 2.050585795955684e-06, 'samples': 27640896, 'steps': 143962, 'loss/train': 0.9993127584457397} 11/07/2021 17:30:23 - INFO - __main__ - Step 143964: {'lr': 2.049907555056851e-06, 'samples': 27641088, 'steps': 143963, 'loss/train': 1.2019859552383423} 11/07/2021 17:30:23 - INFO - __main__ - Step 143965: {'lr': 2.0492294258804833e-06, 'samples': 27641280, 'steps': 143964, 'loss/train': 1.0634472370147705} 11/07/2021 17:30:24 - INFO - __main__ - Step 143966: {'lr': 2.048551408426913e-06, 'samples': 27641472, 'steps': 143965, 'loss/train': 0.8453394770622253} 11/07/2021 17:30:25 - INFO - __main__ - Step 143967: {'lr': 2.047873502696446e-06, 'samples': 27641664, 'steps': 143966, 'loss/train': 0.9291074275970459} 11/07/2021 17:30:25 - INFO - __main__ - Step 143968: {'lr': 2.0471957086893867e-06, 'samples': 27641856, 'steps': 143967, 'loss/train': 1.4363412857055664} 11/07/2021 17:30:25 - INFO - __main__ - Step 143969: {'lr': 2.0465180264060136e-06, 'samples': 27642048, 'steps': 143968, 'loss/train': 1.141063928604126} 11/07/2021 17:30:26 - INFO - __main__ - Step 143970: {'lr': 2.0458404558466593e-06, 'samples': 27642240, 'steps': 143969, 'loss/train': 1.467926025390625} 11/07/2021 17:30:26 - INFO - __main__ - Step 143971: {'lr': 2.045162997011629e-06, 'samples': 27642432, 'steps': 143970, 'loss/train': 1.427030324935913} 11/07/2021 17:30:27 - INFO - __main__ - Step 143972: {'lr': 2.044485649901201e-06, 'samples': 27642624, 'steps': 143971, 'loss/train': 1.0776954889297485} 11/07/2021 17:30:28 - INFO - __main__ - Step 143973: {'lr': 2.0438084145157355e-06, 'samples': 27642816, 'steps': 143972, 'loss/train': 0.9345765709877014} 11/07/2021 17:30:28 - INFO - __main__ - Step 143974: {'lr': 2.0431312908554823e-06, 'samples': 27643008, 'steps': 143973, 'loss/train': 1.334956169128418} 11/07/2021 17:30:28 - INFO - __main__ - Step 143975: {'lr': 2.0424542789207747e-06, 'samples': 27643200, 'steps': 143974, 'loss/train': 1.4628407955169678} 11/07/2021 17:30:29 - INFO - __main__ - Step 143976: {'lr': 2.0417773787119176e-06, 'samples': 27643392, 'steps': 143975, 'loss/train': 1.4940941333770752} 11/07/2021 17:30:30 - INFO - __main__ - Step 143977: {'lr': 2.041100590229189e-06, 'samples': 27643584, 'steps': 143976, 'loss/train': 1.2342098951339722} 11/07/2021 17:30:30 - INFO - __main__ - Step 143978: {'lr': 2.040423913472922e-06, 'samples': 27643776, 'steps': 143977, 'loss/train': 1.1798903942108154} 11/07/2021 17:30:30 - INFO - __main__ - Step 143979: {'lr': 2.0397473484434213e-06, 'samples': 27643968, 'steps': 143978, 'loss/train': 1.5701402425765991} 11/07/2021 17:30:31 - INFO - __main__ - Step 143980: {'lr': 2.039070895140993e-06, 'samples': 27644160, 'steps': 143979, 'loss/train': 1.791366457939148} 11/07/2021 17:30:31 - INFO - __main__ - Step 143981: {'lr': 2.0383945535659144e-06, 'samples': 27644352, 'steps': 143980, 'loss/train': 1.1508678197860718} 11/07/2021 17:30:32 - INFO - __main__ - Step 143982: {'lr': 2.0377183237185182e-06, 'samples': 27644544, 'steps': 143981, 'loss/train': 1.0436879396438599} 11/07/2021 17:30:33 - INFO - __main__ - Step 143983: {'lr': 2.037042205599082e-06, 'samples': 27644736, 'steps': 143982, 'loss/train': 0.9294248819351196} 11/07/2021 17:30:33 - INFO - __main__ - Step 143984: {'lr': 2.03636619920794e-06, 'samples': 27644928, 'steps': 143983, 'loss/train': 1.137954831123352} 11/07/2021 17:30:33 - INFO - __main__ - Step 143985: {'lr': 2.0356903045453954e-06, 'samples': 27645120, 'steps': 143984, 'loss/train': 1.1720072031021118} 11/07/2021 17:30:34 - INFO - __main__ - Step 143986: {'lr': 2.0350145216117277e-06, 'samples': 27645312, 'steps': 143985, 'loss/train': 1.3132455348968506} 11/07/2021 17:30:35 - INFO - __main__ - Step 143987: {'lr': 2.034338850407297e-06, 'samples': 27645504, 'steps': 143986, 'loss/train': 1.5026861429214478} 11/07/2021 17:30:35 - INFO - __main__ - Step 143988: {'lr': 2.033663290932325e-06, 'samples': 27645696, 'steps': 143987, 'loss/train': 1.16925847530365} 11/07/2021 17:30:35 - INFO - __main__ - Step 143989: {'lr': 2.032987843187173e-06, 'samples': 27645888, 'steps': 143988, 'loss/train': 1.659225344657898} 11/07/2021 17:30:36 - INFO - __main__ - Step 143990: {'lr': 2.032312507172118e-06, 'samples': 27646080, 'steps': 143989, 'loss/train': 1.2443664073944092} 11/07/2021 17:30:36 - INFO - __main__ - Step 143991: {'lr': 2.031637282887494e-06, 'samples': 27646272, 'steps': 143990, 'loss/train': 1.488345980644226} 11/07/2021 17:30:36 - INFO - __main__ - Step 143992: {'lr': 2.03096217033355e-06, 'samples': 27646464, 'steps': 143991, 'loss/train': 2.1136393547058105} 11/07/2021 17:30:38 - INFO - __main__ - Step 143993: {'lr': 2.030287169510675e-06, 'samples': 27646656, 'steps': 143992, 'loss/train': 0.5876056551933289} 11/07/2021 17:30:38 - INFO - __main__ - Step 143994: {'lr': 2.029612280419091e-06, 'samples': 27646848, 'steps': 143993, 'loss/train': 1.4592864513397217} 11/07/2021 17:30:38 - INFO - __main__ - Step 143995: {'lr': 2.0289375030591584e-06, 'samples': 27647040, 'steps': 143994, 'loss/train': 2.2686679363250732} 11/07/2021 17:30:39 - INFO - __main__ - Step 143996: {'lr': 2.0282628374311553e-06, 'samples': 27647232, 'steps': 143995, 'loss/train': 1.3348833322525024} 11/07/2021 17:30:39 - INFO - __main__ - Step 143997: {'lr': 2.0275882835353865e-06, 'samples': 27647424, 'steps': 143996, 'loss/train': 1.3622808456420898} 11/07/2021 17:30:40 - INFO - __main__ - Step 143998: {'lr': 2.0269138413721856e-06, 'samples': 27647616, 'steps': 143997, 'loss/train': 1.4115270376205444} 11/07/2021 17:30:41 - INFO - __main__ - Step 143999: {'lr': 2.026239510941802e-06, 'samples': 27647808, 'steps': 143998, 'loss/train': 0.3270270526409149} 11/07/2021 17:30:41 - INFO - __main__ - Step 144000: {'lr': 2.025565292244569e-06, 'samples': 27648000, 'steps': 143999, 'loss/train': 0.6588061451911926} 11/07/2021 17:30:41 - INFO - __main__ - Step 144001: {'lr': 2.024891185280792e-06, 'samples': 27648192, 'steps': 144000, 'loss/train': 1.0441641807556152} 11/07/2021 17:30:42 - INFO - __main__ - Step 144002: {'lr': 2.0242171900507757e-06, 'samples': 27648384, 'steps': 144001, 'loss/train': 1.3947148323059082} 11/07/2021 17:30:43 - INFO - __main__ - Step 144003: {'lr': 2.023543306554826e-06, 'samples': 27648576, 'steps': 144002, 'loss/train': 1.8080986738204956} 11/07/2021 17:30:43 - INFO - __main__ - Step 144004: {'lr': 2.02286953479322e-06, 'samples': 27648768, 'steps': 144003, 'loss/train': 1.0805810689926147} 11/07/2021 17:30:43 - INFO - __main__ - Step 144005: {'lr': 2.0221958747662916e-06, 'samples': 27648960, 'steps': 144004, 'loss/train': 0.5811890363693237} 11/07/2021 17:30:44 - INFO - __main__ - Step 144006: {'lr': 2.0215223264743456e-06, 'samples': 27649152, 'steps': 144005, 'loss/train': 1.3480310440063477} 11/07/2021 17:30:44 - INFO - __main__ - Step 144007: {'lr': 2.020848889917687e-06, 'samples': 27649344, 'steps': 144006, 'loss/train': 2.3393394947052} 11/07/2021 17:30:45 - INFO - __main__ - Step 144008: {'lr': 2.0201755650965934e-06, 'samples': 27649536, 'steps': 144007, 'loss/train': 1.217755675315857} 11/07/2021 17:30:46 - INFO - __main__ - Step 144009: {'lr': 2.019502352011371e-06, 'samples': 27649728, 'steps': 144008, 'loss/train': 0.4078093469142914} 11/07/2021 17:30:46 - INFO - __main__ - Step 144010: {'lr': 2.018829250662352e-06, 'samples': 27649920, 'steps': 144009, 'loss/train': 1.8352482318878174} 11/07/2021 17:30:46 - INFO - __main__ - Step 144011: {'lr': 2.018156261049814e-06, 'samples': 27650112, 'steps': 144010, 'loss/train': 1.4551242589950562} 11/07/2021 17:30:47 - INFO - __main__ - Step 144012: {'lr': 2.0174833831740624e-06, 'samples': 27650304, 'steps': 144011, 'loss/train': 1.2375253438949585} 11/07/2021 17:30:47 - INFO - __main__ - Step 144013: {'lr': 2.0168106170354028e-06, 'samples': 27650496, 'steps': 144012, 'loss/train': 1.3940938711166382} 11/07/2021 17:30:48 - INFO - __main__ - Step 144014: {'lr': 2.016137962634168e-06, 'samples': 27650688, 'steps': 144013, 'loss/train': 1.5705368518829346} 11/07/2021 17:30:49 - INFO - __main__ - Step 144015: {'lr': 2.015465419970608e-06, 'samples': 27650880, 'steps': 144014, 'loss/train': 0.8508609533309937} 11/07/2021 17:30:49 - INFO - __main__ - Step 144016: {'lr': 2.0147929890450554e-06, 'samples': 27651072, 'steps': 144015, 'loss/train': 1.2127169370651245} 11/07/2021 17:30:49 - INFO - __main__ - Step 144017: {'lr': 2.0141206698578163e-06, 'samples': 27651264, 'steps': 144016, 'loss/train': 1.3433767557144165} 11/07/2021 17:30:50 - INFO - __main__ - Step 144018: {'lr': 2.013448462409195e-06, 'samples': 27651456, 'steps': 144017, 'loss/train': 1.3973287343978882} 11/07/2021 17:30:50 - INFO - __main__ - Step 144019: {'lr': 2.01277636669947e-06, 'samples': 27651648, 'steps': 144018, 'loss/train': 1.9367703199386597} 11/07/2021 17:30:51 - INFO - __main__ - Step 144020: {'lr': 2.012104382728974e-06, 'samples': 27651840, 'steps': 144019, 'loss/train': 1.2357045412063599} 11/07/2021 17:30:52 - INFO - __main__ - Step 144021: {'lr': 2.0114325104980125e-06, 'samples': 27652032, 'steps': 144020, 'loss/train': 1.3026009798049927} 11/07/2021 17:30:52 - INFO - __main__ - Step 144022: {'lr': 2.0107607500068347e-06, 'samples': 27652224, 'steps': 144021, 'loss/train': 1.3414174318313599} 11/07/2021 17:30:52 - INFO - __main__ - Step 144023: {'lr': 2.010089101255802e-06, 'samples': 27652416, 'steps': 144022, 'loss/train': 0.8017565608024597} 11/07/2021 17:30:53 - INFO - __main__ - Step 144024: {'lr': 2.009417564245192e-06, 'samples': 27652608, 'steps': 144023, 'loss/train': 0.9537937641143799} 11/07/2021 17:30:54 - INFO - __main__ - Step 144025: {'lr': 2.008746138975337e-06, 'samples': 27652800, 'steps': 144024, 'loss/train': 1.3077946901321411} 11/07/2021 17:30:54 - INFO - __main__ - Step 144026: {'lr': 2.0080748254464875e-06, 'samples': 27652992, 'steps': 144025, 'loss/train': 1.6741113662719727} 11/07/2021 17:30:54 - INFO - __main__ - Step 144027: {'lr': 2.0074036236589765e-06, 'samples': 27653184, 'steps': 144026, 'loss/train': 1.3646440505981445} 11/07/2021 17:30:55 - INFO - __main__ - Step 144028: {'lr': 2.006732533613109e-06, 'samples': 27653376, 'steps': 144027, 'loss/train': 1.3869882822036743} 11/07/2021 17:30:55 - INFO - __main__ - Step 144029: {'lr': 2.006061555309191e-06, 'samples': 27653568, 'steps': 144028, 'loss/train': 0.7535853385925293} 11/07/2021 17:30:56 - INFO - __main__ - Step 144030: {'lr': 2.0053906887474994e-06, 'samples': 27653760, 'steps': 144029, 'loss/train': 1.3534514904022217} 11/07/2021 17:30:56 - INFO - __main__ - Step 144031: {'lr': 2.0047199339283395e-06, 'samples': 27653952, 'steps': 144030, 'loss/train': 1.3297175168991089} 11/07/2021 17:30:57 - INFO - __main__ - Step 144032: {'lr': 2.0040492908520445e-06, 'samples': 27654144, 'steps': 144031, 'loss/train': 1.5873323678970337} 11/07/2021 17:30:57 - INFO - __main__ - Step 144033: {'lr': 2.0033787595188923e-06, 'samples': 27654336, 'steps': 144032, 'loss/train': 2.016050338745117} 11/07/2021 17:30:57 - INFO - __main__ - Step 144034: {'lr': 2.0027083399291877e-06, 'samples': 27654528, 'steps': 144033, 'loss/train': 1.1170731782913208} 11/07/2021 17:30:58 - INFO - __main__ - Step 144035: {'lr': 2.0020380320832366e-06, 'samples': 27654720, 'steps': 144034, 'loss/train': 1.5649534463882446} 11/07/2021 17:30:59 - INFO - __main__ - Step 144036: {'lr': 2.0013678359813435e-06, 'samples': 27654912, 'steps': 144035, 'loss/train': 1.182050108909607} 11/07/2021 17:30:59 - INFO - __main__ - Step 144037: {'lr': 2.0006977516238145e-06, 'samples': 27655104, 'steps': 144036, 'loss/train': 1.1412463188171387} 11/07/2021 17:31:00 - INFO - __main__ - Step 144038: {'lr': 2.0000277790109267e-06, 'samples': 27655296, 'steps': 144037, 'loss/train': 1.845732569694519} 11/07/2021 17:31:00 - INFO - __main__ - Step 144039: {'lr': 1.999357918143041e-06, 'samples': 27655488, 'steps': 144038, 'loss/train': 0.6528957486152649} 11/07/2021 17:31:00 - INFO - __main__ - Step 144040: {'lr': 1.9986881690203794e-06, 'samples': 27655680, 'steps': 144039, 'loss/train': 1.6353628635406494} 11/07/2021 17:31:01 - INFO - __main__ - Step 144041: {'lr': 1.998018531643275e-06, 'samples': 27655872, 'steps': 144040, 'loss/train': 1.3819903135299683} 11/07/2021 17:31:02 - INFO - __main__ - Step 144042: {'lr': 1.9973490060120613e-06, 'samples': 27656064, 'steps': 144041, 'loss/train': 0.8429622650146484} 11/07/2021 17:31:02 - INFO - __main__ - Step 144043: {'lr': 1.9966795921269876e-06, 'samples': 27656256, 'steps': 144042, 'loss/train': 1.549175500869751} 11/07/2021 17:31:02 - INFO - __main__ - Step 144044: {'lr': 1.9960102899884146e-06, 'samples': 27656448, 'steps': 144043, 'loss/train': 1.2211040258407593} 11/07/2021 17:31:03 - INFO - __main__ - Step 144045: {'lr': 1.9953410995965927e-06, 'samples': 27656640, 'steps': 144044, 'loss/train': 1.3297033309936523} 11/07/2021 17:31:04 - INFO - __main__ - Step 144046: {'lr': 1.9946720209518544e-06, 'samples': 27656832, 'steps': 144045, 'loss/train': 1.2864445447921753} 11/07/2021 17:31:04 - INFO - __main__ - Step 144047: {'lr': 1.9940030540544775e-06, 'samples': 27657024, 'steps': 144046, 'loss/train': 0.939106822013855} 11/07/2021 17:31:05 - INFO - __main__ - Step 144048: {'lr': 1.993334198904795e-06, 'samples': 27657216, 'steps': 144047, 'loss/train': 0.9323468804359436} 11/07/2021 17:31:05 - INFO - __main__ - Step 144049: {'lr': 1.9926654555030564e-06, 'samples': 27657408, 'steps': 144048, 'loss/train': 0.8780410289764404} 11/07/2021 17:31:05 - INFO - __main__ - Step 144050: {'lr': 1.9919968238496233e-06, 'samples': 27657600, 'steps': 144049, 'loss/train': 0.9114017486572266} 11/07/2021 17:31:06 - INFO - __main__ - Step 144051: {'lr': 1.9913283039447727e-06, 'samples': 27657792, 'steps': 144050, 'loss/train': 1.2506053447723389} 11/07/2021 17:31:07 - INFO - __main__ - Step 144052: {'lr': 1.9906598957887822e-06, 'samples': 27657984, 'steps': 144051, 'loss/train': 1.4801340103149414} 11/07/2021 17:31:07 - INFO - __main__ - Step 144053: {'lr': 1.989991599381985e-06, 'samples': 27658176, 'steps': 144052, 'loss/train': 1.4136838912963867} 11/07/2021 17:31:07 - INFO - __main__ - Step 144054: {'lr': 1.9893234147246864e-06, 'samples': 27658368, 'steps': 144053, 'loss/train': 1.4559657573699951} 11/07/2021 17:31:08 - INFO - __main__ - Step 144055: {'lr': 1.988655341817136e-06, 'samples': 27658560, 'steps': 144054, 'loss/train': 1.4581196308135986} 11/07/2021 17:31:08 - INFO - __main__ - Step 144056: {'lr': 1.987987380659695e-06, 'samples': 27658752, 'steps': 144055, 'loss/train': 1.4150804281234741} 11/07/2021 17:31:09 - INFO - __main__ - Step 144057: {'lr': 1.9873195312526405e-06, 'samples': 27658944, 'steps': 144056, 'loss/train': 1.1383553743362427} 11/07/2021 17:31:10 - INFO - __main__ - Step 144058: {'lr': 1.98665179359625e-06, 'samples': 27659136, 'steps': 144057, 'loss/train': 1.2170668840408325} 11/07/2021 17:31:10 - INFO - __main__ - Step 144059: {'lr': 1.9859841676908574e-06, 'samples': 27659328, 'steps': 144058, 'loss/train': 0.9782248735427856} 11/07/2021 17:31:10 - INFO - __main__ - Step 144060: {'lr': 1.9853166535367673e-06, 'samples': 27659520, 'steps': 144059, 'loss/train': 1.29999577999115} 11/07/2021 17:31:11 - INFO - __main__ - Step 144061: {'lr': 1.9846492511342573e-06, 'samples': 27659712, 'steps': 144060, 'loss/train': 1.6666345596313477} 11/07/2021 17:31:12 - INFO - __main__ - Step 144062: {'lr': 1.9839819604836327e-06, 'samples': 27659904, 'steps': 144061, 'loss/train': 1.2836588621139526} 11/07/2021 17:31:12 - INFO - __main__ - Step 144063: {'lr': 1.9833147815851993e-06, 'samples': 27660096, 'steps': 144062, 'loss/train': 1.1511892080307007} 11/07/2021 17:31:12 - INFO - __main__ - Step 144064: {'lr': 1.982647714439262e-06, 'samples': 27660288, 'steps': 144063, 'loss/train': 1.6474019289016724} 11/07/2021 17:31:13 - INFO - __main__ - Step 144065: {'lr': 1.981980759046126e-06, 'samples': 27660480, 'steps': 144064, 'loss/train': 1.1307698488235474} 11/07/2021 17:31:13 - INFO - __main__ - Step 144066: {'lr': 1.981313915406069e-06, 'samples': 27660672, 'steps': 144065, 'loss/train': 1.4344267845153809} 11/07/2021 17:31:13 - INFO - __main__ - Step 144067: {'lr': 1.9806471835194238e-06, 'samples': 27660864, 'steps': 144066, 'loss/train': 1.5741868019104004} 11/07/2021 17:31:14 - INFO - __main__ - Step 144068: {'lr': 1.979980563386441e-06, 'samples': 27661056, 'steps': 144067, 'loss/train': 1.182215690612793} 11/07/2021 17:31:15 - INFO - __main__ - Step 144069: {'lr': 1.9793140550074807e-06, 'samples': 27661248, 'steps': 144068, 'loss/train': 1.260598063468933} 11/07/2021 17:31:15 - INFO - __main__ - Step 144070: {'lr': 1.9786476583827927e-06, 'samples': 27661440, 'steps': 144069, 'loss/train': 1.3587583303451538} 11/07/2021 17:31:16 - INFO - __main__ - Step 144071: {'lr': 1.9779813735127108e-06, 'samples': 27661632, 'steps': 144070, 'loss/train': 1.1863861083984375} 11/07/2021 17:31:16 - INFO - __main__ - Step 144072: {'lr': 1.9773152003975115e-06, 'samples': 27661824, 'steps': 144071, 'loss/train': 1.143251657485962} 11/07/2021 17:31:17 - INFO - __main__ - Step 144073: {'lr': 1.9766491390375285e-06, 'samples': 27662016, 'steps': 144072, 'loss/train': 1.3125383853912354} 11/07/2021 17:31:17 - INFO - __main__ - Step 144074: {'lr': 1.9759831894330114e-06, 'samples': 27662208, 'steps': 144073, 'loss/train': 1.4793472290039062} 11/07/2021 17:31:18 - INFO - __main__ - Step 144075: {'lr': 1.975317351584294e-06, 'samples': 27662400, 'steps': 144074, 'loss/train': 0.09537263959646225} 11/07/2021 17:31:18 - INFO - __main__ - Step 144076: {'lr': 1.9746516254916803e-06, 'samples': 27662592, 'steps': 144075, 'loss/train': 1.4677149057388306} 11/07/2021 17:31:18 - INFO - __main__ - Step 144077: {'lr': 1.9739860111554765e-06, 'samples': 27662784, 'steps': 144076, 'loss/train': 1.0359628200531006} 11/07/2021 17:31:19 - INFO - __main__ - Step 144078: {'lr': 1.973320508575932e-06, 'samples': 27662976, 'steps': 144077, 'loss/train': 1.4546786546707153} 11/07/2021 17:31:20 - INFO - __main__ - Step 144079: {'lr': 1.9726551177534357e-06, 'samples': 27663168, 'steps': 144078, 'loss/train': 0.8222969174385071} 11/07/2021 17:31:20 - INFO - __main__ - Step 144080: {'lr': 1.9719898386881818e-06, 'samples': 27663360, 'steps': 144079, 'loss/train': 1.8140610456466675} 11/07/2021 17:31:20 - INFO - __main__ - Step 144081: {'lr': 1.9713246713805587e-06, 'samples': 27663552, 'steps': 144080, 'loss/train': 1.1493362188339233} 11/07/2021 17:31:21 - INFO - __main__ - Step 144082: {'lr': 1.9706596158308167e-06, 'samples': 27663744, 'steps': 144081, 'loss/train': 1.4944416284561157} 11/07/2021 17:31:22 - INFO - __main__ - Step 144083: {'lr': 1.9699946720392603e-06, 'samples': 27663936, 'steps': 144082, 'loss/train': 1.5365313291549683} 11/07/2021 17:31:22 - INFO - __main__ - Step 144084: {'lr': 1.9693298400061954e-06, 'samples': 27664128, 'steps': 144083, 'loss/train': 0.543809175491333} 11/07/2021 17:31:23 - INFO - __main__ - Step 144085: {'lr': 1.968665119731927e-06, 'samples': 27664320, 'steps': 144084, 'loss/train': 1.1996757984161377} 11/07/2021 17:31:23 - INFO - __main__ - Step 144086: {'lr': 1.968000511216733e-06, 'samples': 27664512, 'steps': 144085, 'loss/train': 1.0994149446487427} 11/07/2021 17:31:23 - INFO - __main__ - Step 144087: {'lr': 1.9673360144609466e-06, 'samples': 27664704, 'steps': 144086, 'loss/train': 1.8572328090667725} 11/07/2021 17:31:24 - INFO - __main__ - Step 144088: {'lr': 1.9666716294648725e-06, 'samples': 27664896, 'steps': 144087, 'loss/train': 0.9232559204101562} 11/07/2021 17:31:25 - INFO - __main__ - Step 144089: {'lr': 1.9660073562287606e-06, 'samples': 27665088, 'steps': 144088, 'loss/train': 1.2423841953277588} 11/07/2021 17:31:25 - INFO - __main__ - Step 144090: {'lr': 1.9653431947529444e-06, 'samples': 27665280, 'steps': 144089, 'loss/train': 1.1183890104293823} 11/07/2021 17:31:25 - INFO - __main__ - Step 144091: {'lr': 1.964679145037729e-06, 'samples': 27665472, 'steps': 144090, 'loss/train': 1.4549620151519775} 11/07/2021 17:31:26 - INFO - __main__ - Step 144092: {'lr': 1.9640152070833916e-06, 'samples': 27665664, 'steps': 144091, 'loss/train': 1.341456413269043} 11/07/2021 17:31:27 - INFO - __main__ - Step 144093: {'lr': 1.963351380890238e-06, 'samples': 27665856, 'steps': 144092, 'loss/train': 1.0778857469558716} 11/07/2021 17:31:27 - INFO - __main__ - Step 144094: {'lr': 1.9626876664585737e-06, 'samples': 27666048, 'steps': 144093, 'loss/train': 1.1603716611862183} 11/07/2021 17:31:28 - INFO - __main__ - Step 144095: {'lr': 1.962024063788703e-06, 'samples': 27666240, 'steps': 144094, 'loss/train': 1.623063087463379} 11/07/2021 17:31:28 - INFO - __main__ - Step 144096: {'lr': 1.9613605728809046e-06, 'samples': 27666432, 'steps': 144095, 'loss/train': 1.3636845350265503} 11/07/2021 17:31:28 - INFO - __main__ - Step 144097: {'lr': 1.9606971937355113e-06, 'samples': 27666624, 'steps': 144096, 'loss/train': 0.548560619354248} 11/07/2021 17:31:29 - INFO - __main__ - Step 144098: {'lr': 1.960033926352772e-06, 'samples': 27666816, 'steps': 144097, 'loss/train': 1.2857036590576172} 11/07/2021 17:31:30 - INFO - __main__ - Step 144099: {'lr': 1.959370770733021e-06, 'samples': 27667008, 'steps': 144098, 'loss/train': 1.3719419240951538} 11/07/2021 17:31:30 - INFO - __main__ - Step 144100: {'lr': 1.958707726876563e-06, 'samples': 27667200, 'steps': 144099, 'loss/train': 1.4368934631347656} 11/07/2021 17:31:31 - INFO - __main__ - Step 144101: {'lr': 1.9580447947836755e-06, 'samples': 27667392, 'steps': 144100, 'loss/train': 1.5308221578598022} 11/07/2021 17:31:31 - INFO - __main__ - Step 144102: {'lr': 1.957381974454664e-06, 'samples': 27667584, 'steps': 144101, 'loss/train': 1.3177191019058228} 11/07/2021 17:31:31 - INFO - __main__ - Step 144103: {'lr': 1.9567192658898337e-06, 'samples': 27667776, 'steps': 144102, 'loss/train': 1.567558765411377} 11/07/2021 17:31:32 - INFO - __main__ - Step 144104: {'lr': 1.956056669089462e-06, 'samples': 27667968, 'steps': 144103, 'loss/train': 2.1863839626312256} 11/07/2021 17:31:33 - INFO - __main__ - Step 144105: {'lr': 1.9553941840538823e-06, 'samples': 27668160, 'steps': 144104, 'loss/train': 0.8408399820327759} 11/07/2021 17:31:33 - INFO - __main__ - Step 144106: {'lr': 1.9547318107834e-06, 'samples': 27668352, 'steps': 144105, 'loss/train': 1.1215788125991821} 11/07/2021 17:31:33 - INFO - __main__ - Step 144107: {'lr': 1.954069549278237e-06, 'samples': 27668544, 'steps': 144106, 'loss/train': 0.8097317814826965} 11/07/2021 17:31:34 - INFO - __main__ - Step 144108: {'lr': 1.953407399538781e-06, 'samples': 27668736, 'steps': 144107, 'loss/train': 1.2738503217697144} 11/07/2021 17:31:35 - INFO - __main__ - Step 144109: {'lr': 1.9527453615652557e-06, 'samples': 27668928, 'steps': 144108, 'loss/train': 1.1042288541793823} 11/07/2021 17:31:35 - INFO - __main__ - Step 144110: {'lr': 1.952083435358021e-06, 'samples': 27669120, 'steps': 144109, 'loss/train': 1.056481122970581} 11/07/2021 17:31:35 - INFO - __main__ - Step 144111: {'lr': 1.951421620917354e-06, 'samples': 27669312, 'steps': 144110, 'loss/train': 1.3962931632995605} 11/07/2021 17:31:36 - INFO - __main__ - Step 144112: {'lr': 1.950759918243533e-06, 'samples': 27669504, 'steps': 144111, 'loss/train': 1.3946664333343506} 11/07/2021 17:31:36 - INFO - __main__ - Step 144113: {'lr': 1.9500983273368635e-06, 'samples': 27669696, 'steps': 144112, 'loss/train': 1.1874381303787231} 11/07/2021 17:31:37 - INFO - __main__ - Step 144114: {'lr': 1.9494368481976778e-06, 'samples': 27669888, 'steps': 144113, 'loss/train': 1.2809299230575562} 11/07/2021 17:31:38 - INFO - __main__ - Step 144115: {'lr': 1.948775480826226e-06, 'samples': 27670080, 'steps': 144114, 'loss/train': 1.3894106149673462} 11/07/2021 17:31:38 - INFO - __main__ - Step 144116: {'lr': 1.948114225222841e-06, 'samples': 27670272, 'steps': 144115, 'loss/train': 1.3210065364837646} 11/07/2021 17:31:38 - INFO - __main__ - Step 144117: {'lr': 1.9474530813878013e-06, 'samples': 27670464, 'steps': 144116, 'loss/train': 0.46088120341300964} 11/07/2021 17:31:39 - INFO - __main__ - Step 144118: {'lr': 1.946792049321411e-06, 'samples': 27670656, 'steps': 144117, 'loss/train': 1.2918208837509155} 11/07/2021 17:31:39 - INFO - __main__ - Step 144119: {'lr': 1.9461311290239757e-06, 'samples': 27670848, 'steps': 144118, 'loss/train': 1.606904149055481} 11/07/2021 17:31:40 - INFO - __main__ - Step 144120: {'lr': 1.945470320495801e-06, 'samples': 27671040, 'steps': 144119, 'loss/train': 0.9705484509468079} 11/07/2021 17:31:40 - INFO - __main__ - Step 144121: {'lr': 1.944809623737137e-06, 'samples': 27671232, 'steps': 144120, 'loss/train': 1.1970950365066528} 11/07/2021 17:31:41 - INFO - __main__ - Step 144122: {'lr': 1.9441490387483442e-06, 'samples': 27671424, 'steps': 144121, 'loss/train': 0.859944760799408} 11/07/2021 17:31:41 - INFO - __main__ - Step 144123: {'lr': 1.9434885655296718e-06, 'samples': 27671616, 'steps': 144122, 'loss/train': 1.3502222299575806} 11/07/2021 17:31:41 - INFO - __main__ - Step 144124: {'lr': 1.9428282040814262e-06, 'samples': 27671808, 'steps': 144123, 'loss/train': 0.6077592372894287} 11/07/2021 17:31:42 - INFO - __main__ - Step 144125: {'lr': 1.9421679544039397e-06, 'samples': 27672000, 'steps': 144124, 'loss/train': 0.6966733932495117} 11/07/2021 17:31:43 - INFO - __main__ - Step 144126: {'lr': 1.941507816497462e-06, 'samples': 27672192, 'steps': 144125, 'loss/train': 1.6184146404266357} 11/07/2021 17:31:43 - INFO - __main__ - Step 144127: {'lr': 1.940847790362327e-06, 'samples': 27672384, 'steps': 144126, 'loss/train': 0.586027204990387} 11/07/2021 17:31:44 - INFO - __main__ - Step 144128: {'lr': 1.9401878759988113e-06, 'samples': 27672576, 'steps': 144127, 'loss/train': 1.3462728261947632} 11/07/2021 17:31:44 - INFO - __main__ - Step 144129: {'lr': 1.9395280734072207e-06, 'samples': 27672768, 'steps': 144128, 'loss/train': 1.0627954006195068} 11/07/2021 17:31:45 - INFO - __main__ - Step 144130: {'lr': 1.9388683825878604e-06, 'samples': 27672960, 'steps': 144129, 'loss/train': 1.4034380912780762} 11/07/2021 17:31:45 - INFO - __main__ - Step 144131: {'lr': 1.938208803541008e-06, 'samples': 27673152, 'steps': 144130, 'loss/train': 0.8902574777603149} 11/07/2021 17:31:46 - INFO - __main__ - Step 144132: {'lr': 1.9375493362669694e-06, 'samples': 27673344, 'steps': 144131, 'loss/train': 1.335784912109375} 11/07/2021 17:31:46 - INFO - __main__ - Step 144133: {'lr': 1.936889980766049e-06, 'samples': 27673536, 'steps': 144132, 'loss/train': 1.389816164970398} 11/07/2021 17:31:46 - INFO - __main__ - Step 144134: {'lr': 1.9362307370385525e-06, 'samples': 27673728, 'steps': 144133, 'loss/train': 1.327539086341858} 11/07/2021 17:31:47 - INFO - __main__ - Step 144135: {'lr': 1.9355716050847295e-06, 'samples': 27673920, 'steps': 144134, 'loss/train': 1.4385849237442017} 11/07/2021 17:31:48 - INFO - __main__ - Step 144136: {'lr': 1.934912584904941e-06, 'samples': 27674112, 'steps': 144135, 'loss/train': 1.3362517356872559} 11/07/2021 17:31:48 - INFO - __main__ - Step 144137: {'lr': 1.934253676499437e-06, 'samples': 27674304, 'steps': 144136, 'loss/train': 1.3180328607559204} 11/07/2021 17:31:49 - INFO - __main__ - Step 144138: {'lr': 1.933594879868522e-06, 'samples': 27674496, 'steps': 144137, 'loss/train': 1.612847089767456} 11/07/2021 17:31:49 - INFO - __main__ - Step 144139: {'lr': 1.9329361950125025e-06, 'samples': 27674688, 'steps': 144138, 'loss/train': 1.146270990371704} 11/07/2021 17:31:50 - INFO - __main__ - Step 144140: {'lr': 1.932277621931683e-06, 'samples': 27674880, 'steps': 144139, 'loss/train': 1.1475903987884521} 11/07/2021 17:31:50 - INFO - __main__ - Step 144141: {'lr': 1.931619160626341e-06, 'samples': 27675072, 'steps': 144140, 'loss/train': 1.434301733970642} 11/07/2021 17:31:51 - INFO - __main__ - Step 144142: {'lr': 1.9309608110968104e-06, 'samples': 27675264, 'steps': 144141, 'loss/train': 1.8027033805847168} 11/07/2021 17:31:51 - INFO - __main__ - Step 144143: {'lr': 1.93030257334334e-06, 'samples': 27675456, 'steps': 144142, 'loss/train': 1.1384532451629639} 11/07/2021 17:31:51 - INFO - __main__ - Step 144144: {'lr': 1.9296444473662356e-06, 'samples': 27675648, 'steps': 144143, 'loss/train': 0.9990177750587463} 11/07/2021 17:31:52 - INFO - __main__ - Step 144145: {'lr': 1.9289864331658303e-06, 'samples': 27675840, 'steps': 144144, 'loss/train': 1.4380022287368774} 11/07/2021 17:31:53 - INFO - __main__ - Step 144146: {'lr': 1.9283285307424013e-06, 'samples': 27676032, 'steps': 144145, 'loss/train': 1.2600034475326538} 11/07/2021 17:31:53 - INFO - __main__ - Step 144147: {'lr': 1.9276707400962267e-06, 'samples': 27676224, 'steps': 144146, 'loss/train': 1.4672572612762451} 11/07/2021 17:31:53 - INFO - __main__ - Step 144148: {'lr': 1.9270130612276115e-06, 'samples': 27676416, 'steps': 144147, 'loss/train': 1.511178970336914} 11/07/2021 17:31:54 - INFO - __main__ - Step 144149: {'lr': 1.926355494136861e-06, 'samples': 27676608, 'steps': 144148, 'loss/train': 1.4101839065551758} 11/07/2021 17:31:54 - INFO - __main__ - Step 144150: {'lr': 1.9256980388242528e-06, 'samples': 27676800, 'steps': 144149, 'loss/train': 1.2275145053863525} 11/07/2021 17:31:55 - INFO - __main__ - Step 144151: {'lr': 1.92504069529012e-06, 'samples': 27676992, 'steps': 144150, 'loss/train': 1.3383018970489502} 11/07/2021 17:31:56 - INFO - __main__ - Step 144152: {'lr': 1.9243834635347124e-06, 'samples': 27677184, 'steps': 144151, 'loss/train': 0.9491932392120361} 11/07/2021 17:31:56 - INFO - __main__ - Step 144153: {'lr': 1.923726343558363e-06, 'samples': 27677376, 'steps': 144152, 'loss/train': 1.3622533082962036} 11/07/2021 17:31:57 - INFO - __main__ - Step 144154: {'lr': 1.9230693353613494e-06, 'samples': 27677568, 'steps': 144153, 'loss/train': 1.4292558431625366} 11/07/2021 17:31:57 - INFO - __main__ - Step 144155: {'lr': 1.9224124389439767e-06, 'samples': 27677760, 'steps': 144154, 'loss/train': 2.2511627674102783} 11/07/2021 17:31:58 - INFO - __main__ - Step 144156: {'lr': 1.9217556543065508e-06, 'samples': 27677952, 'steps': 144155, 'loss/train': 1.2927778959274292} 11/07/2021 17:31:58 - INFO - __main__ - Step 144157: {'lr': 1.9210989814493206e-06, 'samples': 27678144, 'steps': 144156, 'loss/train': 1.381818413734436} 11/07/2021 17:31:59 - INFO - __main__ - Step 144158: {'lr': 1.920442420372648e-06, 'samples': 27678336, 'steps': 144157, 'loss/train': 1.0683321952819824} 11/07/2021 17:31:59 - INFO - __main__ - Step 144159: {'lr': 1.9197859710767817e-06, 'samples': 27678528, 'steps': 144158, 'loss/train': 1.1136220693588257} 11/07/2021 17:31:59 - INFO - __main__ - Step 144160: {'lr': 1.9191296335620277e-06, 'samples': 27678720, 'steps': 144159, 'loss/train': 1.6278458833694458} 11/07/2021 17:32:01 - INFO - __main__ - Step 144161: {'lr': 1.9184734078286914e-06, 'samples': 27678912, 'steps': 144160, 'loss/train': 1.6260735988616943} 11/07/2021 17:32:01 - INFO - __main__ - Step 144162: {'lr': 1.9178172938770777e-06, 'samples': 27679104, 'steps': 144161, 'loss/train': 1.1647343635559082} 11/07/2021 17:32:01 - INFO - __main__ - Step 144163: {'lr': 1.9171612917074368e-06, 'samples': 27679296, 'steps': 144162, 'loss/train': 1.200454592704773} 11/07/2021 17:32:02 - INFO - __main__ - Step 144164: {'lr': 1.916505401320101e-06, 'samples': 27679488, 'steps': 144163, 'loss/train': 1.2434659004211426} 11/07/2021 17:32:02 - INFO - __main__ - Step 144165: {'lr': 1.9158496227153767e-06, 'samples': 27679680, 'steps': 144164, 'loss/train': 1.3151397705078125} 11/07/2021 17:32:02 - INFO - __main__ - Step 144166: {'lr': 1.9151939558935407e-06, 'samples': 27679872, 'steps': 144165, 'loss/train': 1.1884194612503052} 11/07/2021 17:32:03 - INFO - __main__ - Step 144167: {'lr': 1.9145384008548706e-06, 'samples': 27680064, 'steps': 144166, 'loss/train': 1.2966421842575073} 11/07/2021 17:32:04 - INFO - __main__ - Step 144168: {'lr': 1.9138829575996997e-06, 'samples': 27680256, 'steps': 144167, 'loss/train': 1.1278555393218994} 11/07/2021 17:32:04 - INFO - __main__ - Step 144169: {'lr': 1.9132276261283054e-06, 'samples': 27680448, 'steps': 144168, 'loss/train': 1.6518298387527466} 11/07/2021 17:32:04 - INFO - __main__ - Step 144170: {'lr': 1.9125724064409655e-06, 'samples': 27680640, 'steps': 144169, 'loss/train': 0.7687417268753052} 11/07/2021 17:32:05 - INFO - __main__ - Step 144171: {'lr': 1.911917298538013e-06, 'samples': 27680832, 'steps': 144170, 'loss/train': 1.856581449508667} 11/07/2021 17:32:05 - INFO - __main__ - Step 144172: {'lr': 1.9112623024196973e-06, 'samples': 27681024, 'steps': 144171, 'loss/train': 1.2064249515533447} 11/07/2021 17:32:06 - INFO - __main__ - Step 144173: {'lr': 1.9106074180863796e-06, 'samples': 27681216, 'steps': 144172, 'loss/train': 1.0612895488739014} 11/07/2021 17:32:07 - INFO - __main__ - Step 144174: {'lr': 1.9099526455382822e-06, 'samples': 27681408, 'steps': 144173, 'loss/train': 1.2930049896240234} 11/07/2021 17:32:07 - INFO - __main__ - Step 144175: {'lr': 1.9092979847757373e-06, 'samples': 27681600, 'steps': 144174, 'loss/train': 1.0729724168777466} 11/07/2021 17:32:07 - INFO - __main__ - Step 144176: {'lr': 1.9086434357990235e-06, 'samples': 27681792, 'steps': 144175, 'loss/train': 0.9932098984718323} 11/07/2021 17:32:08 - INFO - __main__ - Step 144177: {'lr': 1.9079889986084453e-06, 'samples': 27681984, 'steps': 144176, 'loss/train': 1.0782755613327026} 11/07/2021 17:32:09 - INFO - __main__ - Step 144178: {'lr': 1.9073346732043361e-06, 'samples': 27682176, 'steps': 144177, 'loss/train': 1.4646881818771362} 11/07/2021 17:32:09 - INFO - __main__ - Step 144179: {'lr': 1.906680459586918e-06, 'samples': 27682368, 'steps': 144178, 'loss/train': 1.337709903717041} 11/07/2021 17:32:10 - INFO - __main__ - Step 144180: {'lr': 1.906026357756524e-06, 'samples': 27682560, 'steps': 144179, 'loss/train': 0.07029043138027191} 11/07/2021 17:32:10 - INFO - __main__ - Step 144181: {'lr': 1.9053723677134593e-06, 'samples': 27682752, 'steps': 144180, 'loss/train': 1.9519360065460205} 11/07/2021 17:32:10 - INFO - __main__ - Step 144182: {'lr': 1.9047184894580017e-06, 'samples': 27682944, 'steps': 144181, 'loss/train': 1.2304840087890625} 11/07/2021 17:32:12 - INFO - __main__ - Step 144183: {'lr': 1.9040647229904562e-06, 'samples': 27683136, 'steps': 144182, 'loss/train': 1.0584392547607422} 11/07/2021 17:32:12 - INFO - __main__ - Step 144184: {'lr': 1.9034110683111006e-06, 'samples': 27683328, 'steps': 144183, 'loss/train': 1.5150648355484009} 11/07/2021 17:32:12 - INFO - __main__ - Step 144185: {'lr': 1.9027575254202401e-06, 'samples': 27683520, 'steps': 144184, 'loss/train': 0.9743837714195251} 11/07/2021 17:32:13 - INFO - __main__ - Step 144186: {'lr': 1.9021040943181521e-06, 'samples': 27683712, 'steps': 144185, 'loss/train': 1.1706994771957397} 11/07/2021 17:32:13 - INFO - __main__ - Step 144187: {'lr': 1.901450775005198e-06, 'samples': 27683904, 'steps': 144186, 'loss/train': 0.29517897963523865} 11/07/2021 17:32:14 - INFO - __main__ - Step 144188: {'lr': 1.9007975674815713e-06, 'samples': 27684096, 'steps': 144187, 'loss/train': 1.5496525764465332} 11/07/2021 17:32:15 - INFO - __main__ - Step 144189: {'lr': 1.9001444717476335e-06, 'samples': 27684288, 'steps': 144188, 'loss/train': 1.4100197553634644} 11/07/2021 17:32:15 - INFO - __main__ - Step 144190: {'lr': 1.8994914878036618e-06, 'samples': 27684480, 'steps': 144189, 'loss/train': 1.4605528116226196} 11/07/2021 17:32:15 - INFO - __main__ - Step 144191: {'lr': 1.8988386156499616e-06, 'samples': 27684672, 'steps': 144190, 'loss/train': 1.2637293338775635} 11/07/2021 17:32:16 - INFO - __main__ - Step 144192: {'lr': 1.8981858552868104e-06, 'samples': 27684864, 'steps': 144191, 'loss/train': 0.8580414652824402} 11/07/2021 17:32:17 - INFO - __main__ - Step 144193: {'lr': 1.8975332067145134e-06, 'samples': 27685056, 'steps': 144192, 'loss/train': 1.2319691181182861} 11/07/2021 17:32:17 - INFO - __main__ - Step 144194: {'lr': 1.8968806699333484e-06, 'samples': 27685248, 'steps': 144193, 'loss/train': 1.2589216232299805} 11/07/2021 17:32:17 - INFO - __main__ - Step 144195: {'lr': 1.8962282449436209e-06, 'samples': 27685440, 'steps': 144194, 'loss/train': 1.2836509943008423} 11/07/2021 17:32:18 - INFO - __main__ - Step 144196: {'lr': 1.8955759317456079e-06, 'samples': 27685632, 'steps': 144195, 'loss/train': 1.559923529624939} 11/07/2021 17:32:18 - INFO - __main__ - Step 144197: {'lr': 1.8949237303396428e-06, 'samples': 27685824, 'steps': 144196, 'loss/train': 1.71435546875} 11/07/2021 17:32:19 - INFO - __main__ - Step 144198: {'lr': 1.894271640726003e-06, 'samples': 27686016, 'steps': 144197, 'loss/train': 0.7932050228118896} 11/07/2021 17:32:20 - INFO - __main__ - Step 144199: {'lr': 1.8936196629049663e-06, 'samples': 27686208, 'steps': 144198, 'loss/train': 5.655589580535889} 11/07/2021 17:32:20 - INFO - __main__ - Step 144200: {'lr': 1.8929677968768377e-06, 'samples': 27686400, 'steps': 144199, 'loss/train': 0.9229701161384583} 11/07/2021 17:32:20 - INFO - __main__ - Step 144201: {'lr': 1.892316042641923e-06, 'samples': 27686592, 'steps': 144200, 'loss/train': 1.7844678163528442} 11/07/2021 17:32:21 - INFO - __main__ - Step 144202: {'lr': 1.8916644002004712e-06, 'samples': 27686784, 'steps': 144201, 'loss/train': 0.6073979735374451} 11/07/2021 17:32:21 - INFO - __main__ - Step 144203: {'lr': 1.8910128695528162e-06, 'samples': 27686976, 'steps': 144202, 'loss/train': 1.0281171798706055} 11/07/2021 17:32:22 - INFO - __main__ - Step 144204: {'lr': 1.8903614506992628e-06, 'samples': 27687168, 'steps': 144203, 'loss/train': 1.4211711883544922} 11/07/2021 17:32:23 - INFO - __main__ - Step 144205: {'lr': 1.889710143640061e-06, 'samples': 27687360, 'steps': 144204, 'loss/train': 1.6768646240234375} 11/07/2021 17:32:23 - INFO - __main__ - Step 144206: {'lr': 1.889058948375544e-06, 'samples': 27687552, 'steps': 144205, 'loss/train': 1.3090821504592896} 11/07/2021 17:32:23 - INFO - __main__ - Step 144207: {'lr': 1.888407864905961e-06, 'samples': 27687744, 'steps': 144206, 'loss/train': 1.4703946113586426} 11/07/2021 17:32:24 - INFO - __main__ - Step 144208: {'lr': 1.8877568932316736e-06, 'samples': 27687936, 'steps': 144207, 'loss/train': 0.948702871799469} 11/07/2021 17:32:25 - INFO - __main__ - Step 144209: {'lr': 1.8871060333529032e-06, 'samples': 27688128, 'steps': 144208, 'loss/train': 1.1216926574707031} 11/07/2021 17:32:25 - INFO - __main__ - Step 144210: {'lr': 1.8864552852699835e-06, 'samples': 27688320, 'steps': 144209, 'loss/train': 1.2323590517044067} 11/07/2021 17:32:25 - INFO - __main__ - Step 144211: {'lr': 1.8858046489831915e-06, 'samples': 27688512, 'steps': 144210, 'loss/train': 1.321726679801941} 11/07/2021 17:32:26 - INFO - __main__ - Step 144212: {'lr': 1.8851541244928328e-06, 'samples': 27688704, 'steps': 144211, 'loss/train': 1.5834769010543823} 11/07/2021 17:32:26 - INFO - __main__ - Step 144213: {'lr': 1.8845037117992126e-06, 'samples': 27688896, 'steps': 144212, 'loss/train': 0.9273300170898438} 11/07/2021 17:32:27 - INFO - __main__ - Step 144214: {'lr': 1.8838534109025806e-06, 'samples': 27689088, 'steps': 144213, 'loss/train': 1.249049186706543} 11/07/2021 17:32:28 - INFO - __main__ - Step 144215: {'lr': 1.88320322180327e-06, 'samples': 27689280, 'steps': 144214, 'loss/train': 1.1065329313278198} 11/07/2021 17:32:28 - INFO - __main__ - Step 144216: {'lr': 1.8825531445015588e-06, 'samples': 27689472, 'steps': 144215, 'loss/train': 0.8382784128189087} 11/07/2021 17:32:28 - INFO - __main__ - Step 144217: {'lr': 1.8819031789977237e-06, 'samples': 27689664, 'steps': 144216, 'loss/train': 1.2966338396072388} 11/07/2021 17:32:29 - INFO - __main__ - Step 144218: {'lr': 1.8812533252920983e-06, 'samples': 27689856, 'steps': 144217, 'loss/train': 0.7612354159355164} 11/07/2021 17:32:30 - INFO - __main__ - Step 144219: {'lr': 1.8806035833849322e-06, 'samples': 27690048, 'steps': 144218, 'loss/train': 0.937942385673523} 11/07/2021 17:32:30 - INFO - __main__ - Step 144220: {'lr': 1.8799539532765585e-06, 'samples': 27690240, 'steps': 144219, 'loss/train': 1.3822743892669678} 11/07/2021 17:32:31 - INFO - __main__ - Step 144221: {'lr': 1.8793044349672273e-06, 'samples': 27690432, 'steps': 144220, 'loss/train': 1.2295126914978027} 11/07/2021 17:32:31 - INFO - __main__ - Step 144222: {'lr': 1.8786550284572712e-06, 'samples': 27690624, 'steps': 144221, 'loss/train': 1.2567269802093506} 11/07/2021 17:32:32 - INFO - __main__ - Step 144223: {'lr': 1.878005733746968e-06, 'samples': 27690816, 'steps': 144222, 'loss/train': 1.631124496459961} 11/07/2021 17:32:32 - INFO - __main__ - Step 144224: {'lr': 1.8773565508365953e-06, 'samples': 27691008, 'steps': 144223, 'loss/train': 1.5813044309616089} 11/07/2021 17:32:33 - INFO - __main__ - Step 144225: {'lr': 1.8767074797264306e-06, 'samples': 27691200, 'steps': 144224, 'loss/train': 1.2420566082000732} 11/07/2021 17:32:33 - INFO - __main__ - Step 144226: {'lr': 1.8760585204168345e-06, 'samples': 27691392, 'steps': 144225, 'loss/train': 1.2759220600128174} 11/07/2021 17:32:34 - INFO - __main__ - Step 144227: {'lr': 1.8754096729080295e-06, 'samples': 27691584, 'steps': 144226, 'loss/train': 0.8839136362075806} 11/07/2021 17:32:34 - INFO - __main__ - Step 144228: {'lr': 1.8747609372003482e-06, 'samples': 27691776, 'steps': 144227, 'loss/train': 0.8778852224349976} 11/07/2021 17:32:34 - INFO - __main__ - Step 144229: {'lr': 1.8741123132940685e-06, 'samples': 27691968, 'steps': 144228, 'loss/train': 1.127188801765442} 11/07/2021 17:32:35 - INFO - __main__ - Step 144230: {'lr': 1.8734638011894955e-06, 'samples': 27692160, 'steps': 144229, 'loss/train': 1.4516292810440063} 11/07/2021 17:32:36 - INFO - __main__ - Step 144231: {'lr': 1.8728154008868791e-06, 'samples': 27692352, 'steps': 144230, 'loss/train': 1.6712236404418945} 11/07/2021 17:32:36 - INFO - __main__ - Step 144232: {'lr': 1.8721671123865803e-06, 'samples': 27692544, 'steps': 144231, 'loss/train': 1.2728092670440674} 11/07/2021 17:32:36 - INFO - __main__ - Step 144233: {'lr': 1.8715189356888207e-06, 'samples': 27692736, 'steps': 144232, 'loss/train': 1.5479998588562012} 11/07/2021 17:32:37 - INFO - __main__ - Step 144234: {'lr': 1.8708708707939614e-06, 'samples': 27692928, 'steps': 144233, 'loss/train': 1.417454719543457} 11/07/2021 17:32:38 - INFO - __main__ - Step 144235: {'lr': 1.8702229177022523e-06, 'samples': 27693120, 'steps': 144234, 'loss/train': 1.3222920894622803} 11/07/2021 17:32:38 - INFO - __main__ - Step 144236: {'lr': 1.8695750764139707e-06, 'samples': 27693312, 'steps': 144235, 'loss/train': 1.5662468671798706} 11/07/2021 17:32:38 - INFO - __main__ - Step 144237: {'lr': 1.8689273469294498e-06, 'samples': 27693504, 'steps': 144236, 'loss/train': 0.8370186686515808} 11/07/2021 17:32:39 - INFO - __main__ - Step 144238: {'lr': 1.8682797292489396e-06, 'samples': 27693696, 'steps': 144237, 'loss/train': 1.5249193906784058} 11/07/2021 17:32:39 - INFO - __main__ - Step 144239: {'lr': 1.8676322233727727e-06, 'samples': 27693888, 'steps': 144238, 'loss/train': 1.022531509399414} 11/07/2021 17:32:40 - INFO - __main__ - Step 144240: {'lr': 1.8669848293011992e-06, 'samples': 27694080, 'steps': 144239, 'loss/train': 1.1275891065597534} 11/07/2021 17:32:41 - INFO - __main__ - Step 144241: {'lr': 1.8663375470345523e-06, 'samples': 27694272, 'steps': 144240, 'loss/train': 1.5174036026000977} 11/07/2021 17:32:41 - INFO - __main__ - Step 144242: {'lr': 1.8656903765731093e-06, 'samples': 27694464, 'steps': 144241, 'loss/train': 1.416550636291504} 11/07/2021 17:32:41 - INFO - __main__ - Step 144243: {'lr': 1.8650433179171478e-06, 'samples': 27694656, 'steps': 144242, 'loss/train': 1.3511954545974731} 11/07/2021 17:32:42 - INFO - __main__ - Step 144244: {'lr': 1.8643963710669732e-06, 'samples': 27694848, 'steps': 144243, 'loss/train': 0.7743146419525146} 11/07/2021 17:32:42 - INFO - __main__ - Step 144245: {'lr': 1.8637495360228906e-06, 'samples': 27695040, 'steps': 144244, 'loss/train': 0.9774690866470337} 11/07/2021 17:32:43 - INFO - __main__ - Step 144246: {'lr': 1.8631028127851502e-06, 'samples': 27695232, 'steps': 144245, 'loss/train': 1.3275688886642456} 11/07/2021 17:32:43 - INFO - __main__ - Step 144247: {'lr': 1.8624562013540568e-06, 'samples': 27695424, 'steps': 144246, 'loss/train': 1.2925418615341187} 11/07/2021 17:32:44 - INFO - __main__ - Step 144248: {'lr': 1.8618097017299163e-06, 'samples': 27695616, 'steps': 144247, 'loss/train': 1.359363079071045} 11/07/2021 17:32:44 - INFO - __main__ - Step 144249: {'lr': 1.8611633139130334e-06, 'samples': 27695808, 'steps': 144248, 'loss/train': 1.0731548070907593} 11/07/2021 17:32:44 - INFO - __main__ - Step 144250: {'lr': 1.8605170379036585e-06, 'samples': 27696000, 'steps': 144249, 'loss/train': 1.2695887088775635} 11/07/2021 17:32:46 - INFO - __main__ - Step 144251: {'lr': 1.8598708737021241e-06, 'samples': 27696192, 'steps': 144250, 'loss/train': 0.986497163772583} 11/07/2021 17:32:46 - INFO - __main__ - Step 144252: {'lr': 1.8592248213087082e-06, 'samples': 27696384, 'steps': 144251, 'loss/train': 0.11909870058298111} 11/07/2021 17:32:46 - INFO - __main__ - Step 144253: {'lr': 1.8585788807236881e-06, 'samples': 27696576, 'steps': 144252, 'loss/train': 1.1261765956878662} 11/07/2021 17:32:47 - INFO - __main__ - Step 144254: {'lr': 1.8579330519473415e-06, 'samples': 27696768, 'steps': 144253, 'loss/train': 1.640479564666748} 11/07/2021 17:32:47 - INFO - __main__ - Step 144255: {'lr': 1.8572873349800012e-06, 'samples': 27696960, 'steps': 144254, 'loss/train': 1.7700443267822266} 11/07/2021 17:32:48 - INFO - __main__ - Step 144256: {'lr': 1.8566417298219451e-06, 'samples': 27697152, 'steps': 144255, 'loss/train': 1.394751787185669} 11/07/2021 17:32:49 - INFO - __main__ - Step 144257: {'lr': 1.8559962364734505e-06, 'samples': 27697344, 'steps': 144256, 'loss/train': 1.3319859504699707} 11/07/2021 17:32:49 - INFO - __main__ - Step 144258: {'lr': 1.8553508549348231e-06, 'samples': 27697536, 'steps': 144257, 'loss/train': 1.0845638513565063} 11/07/2021 17:32:49 - INFO - __main__ - Step 144259: {'lr': 1.85470558520634e-06, 'samples': 27697728, 'steps': 144258, 'loss/train': 1.4465361833572388} 11/07/2021 17:32:50 - INFO - __main__ - Step 144260: {'lr': 1.8540604272882789e-06, 'samples': 27697920, 'steps': 144259, 'loss/train': 1.7070385217666626} 11/07/2021 17:32:51 - INFO - __main__ - Step 144261: {'lr': 1.853415381180973e-06, 'samples': 27698112, 'steps': 144260, 'loss/train': 0.4533693492412567} 11/07/2021 17:32:51 - INFO - __main__ - Step 144262: {'lr': 1.8527704468846717e-06, 'samples': 27698304, 'steps': 144261, 'loss/train': 0.8726610541343689} 11/07/2021 17:32:51 - INFO - __main__ - Step 144263: {'lr': 1.852125624399681e-06, 'samples': 27698496, 'steps': 144262, 'loss/train': 1.0741199254989624} 11/07/2021 17:32:52 - INFO - __main__ - Step 144264: {'lr': 1.8514809137263056e-06, 'samples': 27698688, 'steps': 144263, 'loss/train': 1.3194926977157593} 11/07/2021 17:32:52 - INFO - __main__ - Step 144265: {'lr': 1.8508363148648233e-06, 'samples': 27698880, 'steps': 144264, 'loss/train': 1.373850703239441} 11/07/2021 17:32:53 - INFO - __main__ - Step 144266: {'lr': 1.8501918278155394e-06, 'samples': 27699072, 'steps': 144265, 'loss/train': 1.3115967512130737} 11/07/2021 17:32:53 - INFO - __main__ - Step 144267: {'lr': 1.8495474525787037e-06, 'samples': 27699264, 'steps': 144266, 'loss/train': 1.267182469367981} 11/07/2021 17:32:54 - INFO - __main__ - Step 144268: {'lr': 1.8489031891546492e-06, 'samples': 27699456, 'steps': 144267, 'loss/train': 0.8442133069038391} 11/07/2021 17:32:54 - INFO - __main__ - Step 144269: {'lr': 1.8482590375436536e-06, 'samples': 27699648, 'steps': 144268, 'loss/train': 0.7892988324165344} 11/07/2021 17:32:54 - INFO - __main__ - Step 144270: {'lr': 1.8476149977459944e-06, 'samples': 27699840, 'steps': 144269, 'loss/train': 1.7786891460418701} 11/07/2021 17:32:56 - INFO - __main__ - Step 144271: {'lr': 1.8469710697619492e-06, 'samples': 27700032, 'steps': 144270, 'loss/train': 1.436526894569397} 11/07/2021 17:32:56 - INFO - __main__ - Step 144272: {'lr': 1.8463272535918508e-06, 'samples': 27700224, 'steps': 144271, 'loss/train': 1.6616820096969604} 11/07/2021 17:32:56 - INFO - __main__ - Step 144273: {'lr': 1.845683549235977e-06, 'samples': 27700416, 'steps': 144272, 'loss/train': 1.5083422660827637} 11/07/2021 17:32:57 - INFO - __main__ - Step 144274: {'lr': 1.8450399566946051e-06, 'samples': 27700608, 'steps': 144273, 'loss/train': 1.380321979522705} 11/07/2021 17:32:57 - INFO - __main__ - Step 144275: {'lr': 1.8443964759680133e-06, 'samples': 27700800, 'steps': 144274, 'loss/train': 0.7761214375495911} 11/07/2021 17:32:57 - INFO - __main__ - Step 144276: {'lr': 1.843753107056506e-06, 'samples': 27700992, 'steps': 144275, 'loss/train': 1.1702499389648438} 11/07/2021 17:32:58 - INFO - __main__ - Step 144277: {'lr': 1.8431098499603893e-06, 'samples': 27701184, 'steps': 144276, 'loss/train': 1.4231586456298828} 11/07/2021 17:32:59 - INFO - __main__ - Step 144278: {'lr': 1.8424667046799403e-06, 'samples': 27701376, 'steps': 144277, 'loss/train': 1.1455525159835815} 11/07/2021 17:32:59 - INFO - __main__ - Step 144279: {'lr': 1.8418236712154368e-06, 'samples': 27701568, 'steps': 144278, 'loss/train': 1.4600285291671753} 11/07/2021 17:33:00 - INFO - __main__ - Step 144280: {'lr': 1.841180749567184e-06, 'samples': 27701760, 'steps': 144279, 'loss/train': 0.7044113874435425} 11/07/2021 17:33:00 - INFO - __main__ - Step 144281: {'lr': 1.8405379397354593e-06, 'samples': 27701952, 'steps': 144280, 'loss/train': 1.3655062913894653} 11/07/2021 17:33:01 - INFO - __main__ - Step 144282: {'lr': 1.8398952417205683e-06, 'samples': 27702144, 'steps': 144281, 'loss/train': 1.7297991514205933} 11/07/2021 17:33:01 - INFO - __main__ - Step 144283: {'lr': 1.8392526555227883e-06, 'samples': 27702336, 'steps': 144282, 'loss/train': 1.3971352577209473} 11/07/2021 17:33:02 - INFO - __main__ - Step 144284: {'lr': 1.8386101811423973e-06, 'samples': 27702528, 'steps': 144283, 'loss/train': 0.744369626045227} 11/07/2021 17:33:02 - INFO - __main__ - Step 144285: {'lr': 1.8379678185797277e-06, 'samples': 27702720, 'steps': 144284, 'loss/train': 1.3275206089019775} 11/07/2021 17:33:02 - INFO - __main__ - Step 144286: {'lr': 1.837325567835002e-06, 'samples': 27702912, 'steps': 144285, 'loss/train': 0.9929385185241699} 11/07/2021 17:33:03 - INFO - __main__ - Step 144287: {'lr': 1.836683428908581e-06, 'samples': 27703104, 'steps': 144286, 'loss/train': 1.3480780124664307} 11/07/2021 17:33:04 - INFO - __main__ - Step 144288: {'lr': 1.8360414018007142e-06, 'samples': 27703296, 'steps': 144287, 'loss/train': 1.0479836463928223} 11/07/2021 17:33:04 - INFO - __main__ - Step 144289: {'lr': 1.8353994865116796e-06, 'samples': 27703488, 'steps': 144288, 'loss/train': 1.2365336418151855} 11/07/2021 17:33:05 - INFO - __main__ - Step 144290: {'lr': 1.83475768304181e-06, 'samples': 27703680, 'steps': 144289, 'loss/train': 0.7748189568519592} 11/07/2021 17:33:05 - INFO - __main__ - Step 144291: {'lr': 1.8341159913913553e-06, 'samples': 27703872, 'steps': 144290, 'loss/train': 1.5015466213226318} 11/07/2021 17:33:05 - INFO - __main__ - Step 144292: {'lr': 1.8334744115606205e-06, 'samples': 27704064, 'steps': 144291, 'loss/train': 0.33347734808921814} 11/07/2021 17:33:07 - INFO - __main__ - Step 144293: {'lr': 1.8328329435498836e-06, 'samples': 27704256, 'steps': 144292, 'loss/train': 1.2245502471923828} 11/07/2021 17:33:07 - INFO - __main__ - Step 144294: {'lr': 1.8321915873594497e-06, 'samples': 27704448, 'steps': 144293, 'loss/train': 1.3802136182785034} 11/07/2021 17:33:07 - INFO - __main__ - Step 144295: {'lr': 1.831550342989624e-06, 'samples': 27704640, 'steps': 144294, 'loss/train': 0.5892258882522583} 11/07/2021 17:33:08 - INFO - __main__ - Step 144296: {'lr': 1.830909210440629e-06, 'samples': 27704832, 'steps': 144295, 'loss/train': 1.1924480199813843} 11/07/2021 17:33:08 - INFO - __main__ - Step 144297: {'lr': 1.8302681897128248e-06, 'samples': 27705024, 'steps': 144296, 'loss/train': 1.9819186925888062} 11/07/2021 17:33:09 - INFO - __main__ - Step 144298: {'lr': 1.8296272808064619e-06, 'samples': 27705216, 'steps': 144297, 'loss/train': 1.4285093545913696} 11/07/2021 17:33:09 - INFO - __main__ - Step 144299: {'lr': 1.828986483721845e-06, 'samples': 27705408, 'steps': 144298, 'loss/train': 1.1055116653442383} 11/07/2021 17:33:10 - INFO - __main__ - Step 144300: {'lr': 1.8283457984592522e-06, 'samples': 27705600, 'steps': 144299, 'loss/train': 1.0904589891433716} 11/07/2021 17:33:10 - INFO - __main__ - Step 144301: {'lr': 1.8277052250189885e-06, 'samples': 27705792, 'steps': 144300, 'loss/train': 0.9730716347694397} 11/07/2021 17:33:10 - INFO - __main__ - Step 144302: {'lr': 1.8270647634013316e-06, 'samples': 27705984, 'steps': 144301, 'loss/train': 1.148414134979248} 11/07/2021 17:33:12 - INFO - __main__ - Step 144303: {'lr': 1.826424413606559e-06, 'samples': 27706176, 'steps': 144302, 'loss/train': 1.0664066076278687} 11/07/2021 17:33:12 - INFO - __main__ - Step 144304: {'lr': 1.8257841756349757e-06, 'samples': 27706368, 'steps': 144303, 'loss/train': 0.8999864459037781} 11/07/2021 17:33:12 - INFO - __main__ - Step 144305: {'lr': 1.8251440494868598e-06, 'samples': 27706560, 'steps': 144304, 'loss/train': 1.2139960527420044} 11/07/2021 17:33:13 - INFO - __main__ - Step 144306: {'lr': 1.8245040351624886e-06, 'samples': 27706752, 'steps': 144305, 'loss/train': 1.5663032531738281} 11/07/2021 17:33:13 - INFO - __main__ - Step 144307: {'lr': 1.8238641326621953e-06, 'samples': 27706944, 'steps': 144306, 'loss/train': 0.5723943710327148} 11/07/2021 17:33:14 - INFO - __main__ - Step 144308: {'lr': 1.8232243419862293e-06, 'samples': 27707136, 'steps': 144307, 'loss/train': 0.5271216630935669} 11/07/2021 17:33:14 - INFO - __main__ - Step 144309: {'lr': 1.8225846631348964e-06, 'samples': 27707328, 'steps': 144308, 'loss/train': 1.7717173099517822} 11/07/2021 17:33:15 - INFO - __main__ - Step 144310: {'lr': 1.8219450961084738e-06, 'samples': 27707520, 'steps': 144309, 'loss/train': 1.4132353067398071} 11/07/2021 17:33:15 - INFO - __main__ - Step 144311: {'lr': 1.8213056409072394e-06, 'samples': 27707712, 'steps': 144310, 'loss/train': 1.190677285194397} 11/07/2021 17:33:15 - INFO - __main__ - Step 144312: {'lr': 1.820666297531498e-06, 'samples': 27707904, 'steps': 144311, 'loss/train': 1.4879482984542847} 11/07/2021 17:33:17 - INFO - __main__ - Step 144313: {'lr': 1.8200270659815555e-06, 'samples': 27708096, 'steps': 144312, 'loss/train': 0.9855076670646667} 11/07/2021 17:33:17 - INFO - __main__ - Step 144314: {'lr': 1.8193879462576613e-06, 'samples': 27708288, 'steps': 144313, 'loss/train': 1.7112841606140137} 11/07/2021 17:33:17 - INFO - __main__ - Step 144315: {'lr': 1.8187489383601208e-06, 'samples': 27708480, 'steps': 144314, 'loss/train': 0.7522433996200562} 11/07/2021 17:33:18 - INFO - __main__ - Step 144316: {'lr': 1.8181100422892116e-06, 'samples': 27708672, 'steps': 144315, 'loss/train': 1.4263274669647217} 11/07/2021 17:33:18 - INFO - __main__ - Step 144317: {'lr': 1.817471258045239e-06, 'samples': 27708864, 'steps': 144316, 'loss/train': 0.42763155698776245} 11/07/2021 17:33:19 - INFO - __main__ - Step 144318: {'lr': 1.8168325856285085e-06, 'samples': 27709056, 'steps': 144317, 'loss/train': 1.2293851375579834} 11/07/2021 17:33:19 - INFO - __main__ - Step 144319: {'lr': 1.8161940250392694e-06, 'samples': 27709248, 'steps': 144318, 'loss/train': 1.6798110008239746} 11/07/2021 17:33:20 - INFO - __main__ - Step 144320: {'lr': 1.8155555762777997e-06, 'samples': 27709440, 'steps': 144319, 'loss/train': 0.6434555053710938} 11/07/2021 17:33:20 - INFO - __main__ - Step 144321: {'lr': 1.81491723934446e-06, 'samples': 27709632, 'steps': 144320, 'loss/train': 1.6843920946121216} 11/07/2021 17:33:20 - INFO - __main__ - Step 144322: {'lr': 1.814279014239445e-06, 'samples': 27709824, 'steps': 144321, 'loss/train': 1.3573282957077026} 11/07/2021 17:33:21 - INFO - __main__ - Step 144323: {'lr': 1.813640900963115e-06, 'samples': 27710016, 'steps': 144322, 'loss/train': 0.9695777893066406} 11/07/2021 17:33:22 - INFO - __main__ - Step 144324: {'lr': 1.81300289951572e-06, 'samples': 27710208, 'steps': 144323, 'loss/train': 1.1015409231185913} 11/07/2021 17:33:22 - INFO - __main__ - Step 144325: {'lr': 1.8123650098975375e-06, 'samples': 27710400, 'steps': 144324, 'loss/train': 1.246030569076538} 11/07/2021 17:33:23 - INFO - __main__ - Step 144326: {'lr': 1.811727232108873e-06, 'samples': 27710592, 'steps': 144325, 'loss/train': 1.2168244123458862} 11/07/2021 17:33:23 - INFO - __main__ - Step 144327: {'lr': 1.8110895661500315e-06, 'samples': 27710784, 'steps': 144326, 'loss/train': 1.2994719743728638} 11/07/2021 17:33:23 - INFO - __main__ - Step 144328: {'lr': 1.8104520120212909e-06, 'samples': 27710976, 'steps': 144327, 'loss/train': 0.9530442357063293} 11/07/2021 17:33:24 - INFO - __main__ - Step 144329: {'lr': 1.8098145697229285e-06, 'samples': 27711168, 'steps': 144328, 'loss/train': 1.2583144903182983} 11/07/2021 17:33:25 - INFO - __main__ - Step 144330: {'lr': 1.809177239255222e-06, 'samples': 27711360, 'steps': 144329, 'loss/train': 1.6766154766082764} 11/07/2021 17:33:25 - INFO - __main__ - Step 144331: {'lr': 1.8085400206184766e-06, 'samples': 27711552, 'steps': 144330, 'loss/train': 1.683923363685608} 11/07/2021 17:33:25 - INFO - __main__ - Step 144332: {'lr': 1.80790291381297e-06, 'samples': 27711744, 'steps': 144331, 'loss/train': 0.5264694690704346} 11/07/2021 17:33:26 - INFO - __main__ - Step 144333: {'lr': 1.8072659188389794e-06, 'samples': 27711936, 'steps': 144332, 'loss/train': 1.2336779832839966} 11/07/2021 17:33:27 - INFO - __main__ - Step 144334: {'lr': 1.8066290356968108e-06, 'samples': 27712128, 'steps': 144333, 'loss/train': 1.4523073434829712} 11/07/2021 17:33:27 - INFO - __main__ - Step 144335: {'lr': 1.8059922643867688e-06, 'samples': 27712320, 'steps': 144334, 'loss/train': 1.197759985923767} 11/07/2021 17:33:27 - INFO - __main__ - Step 144336: {'lr': 1.8053556049091035e-06, 'samples': 27712512, 'steps': 144335, 'loss/train': 1.4728553295135498} 11/07/2021 17:33:28 - INFO - __main__ - Step 144337: {'lr': 1.8047190572641204e-06, 'samples': 27712704, 'steps': 144336, 'loss/train': 1.6356709003448486} 11/07/2021 17:33:28 - INFO - __main__ - Step 144338: {'lr': 1.8040826214520966e-06, 'samples': 27712896, 'steps': 144337, 'loss/train': 1.2268263101577759} 11/07/2021 17:33:28 - INFO - __main__ - Step 144339: {'lr': 1.80344629747331e-06, 'samples': 27713088, 'steps': 144338, 'loss/train': 1.137484073638916} 11/07/2021 17:33:30 - INFO - __main__ - Step 144340: {'lr': 1.802810085328066e-06, 'samples': 27713280, 'steps': 144339, 'loss/train': 1.147347092628479} 11/07/2021 17:33:30 - INFO - __main__ - Step 144341: {'lr': 1.8021739850166697e-06, 'samples': 27713472, 'steps': 144340, 'loss/train': 1.2701374292373657} 11/07/2021 17:33:30 - INFO - __main__ - Step 144342: {'lr': 1.801537996539343e-06, 'samples': 27713664, 'steps': 144341, 'loss/train': 1.2606127262115479} 11/07/2021 17:33:31 - INFO - __main__ - Step 144343: {'lr': 1.800902119896447e-06, 'samples': 27713856, 'steps': 144342, 'loss/train': 1.1005154848098755} 11/07/2021 17:33:31 - INFO - __main__ - Step 144344: {'lr': 1.8002663550882036e-06, 'samples': 27714048, 'steps': 144343, 'loss/train': 0.9150955677032471} 11/07/2021 17:33:32 - INFO - __main__ - Step 144345: {'lr': 1.7996307021149738e-06, 'samples': 27714240, 'steps': 144344, 'loss/train': 1.2439489364624023} 11/07/2021 17:33:32 - INFO - __main__ - Step 144346: {'lr': 1.7989951609769518e-06, 'samples': 27714432, 'steps': 144345, 'loss/train': 1.0422879457473755} 11/07/2021 17:33:33 - INFO - __main__ - Step 144347: {'lr': 1.7983597316744982e-06, 'samples': 27714624, 'steps': 144346, 'loss/train': 1.293168067932129} 11/07/2021 17:33:33 - INFO - __main__ - Step 144348: {'lr': 1.7977244142078907e-06, 'samples': 27714816, 'steps': 144347, 'loss/train': 1.3061795234680176} 11/07/2021 17:33:33 - INFO - __main__ - Step 144349: {'lr': 1.7970892085773793e-06, 'samples': 27715008, 'steps': 144348, 'loss/train': 1.594761610031128} 11/07/2021 17:33:34 - INFO - __main__ - Step 144350: {'lr': 1.7964541147832692e-06, 'samples': 27715200, 'steps': 144349, 'loss/train': 1.094400405883789} 11/07/2021 17:33:35 - INFO - __main__ - Step 144351: {'lr': 1.7958191328258656e-06, 'samples': 27715392, 'steps': 144350, 'loss/train': 1.205104947090149} 11/07/2021 17:33:35 - INFO - __main__ - Step 144352: {'lr': 1.7951842627053905e-06, 'samples': 27715584, 'steps': 144351, 'loss/train': 1.0944722890853882} 11/07/2021 17:33:36 - INFO - __main__ - Step 144353: {'lr': 1.7945495044222048e-06, 'samples': 27715776, 'steps': 144352, 'loss/train': 1.6856660842895508} 11/07/2021 17:33:36 - INFO - __main__ - Step 144354: {'lr': 1.7939148579765863e-06, 'samples': 27715968, 'steps': 144353, 'loss/train': 1.2742011547088623} 11/07/2021 17:33:37 - INFO - __main__ - Step 144355: {'lr': 1.7932803233687568e-06, 'samples': 27716160, 'steps': 144354, 'loss/train': 1.0151851177215576} 11/07/2021 17:33:37 - INFO - __main__ - Step 144356: {'lr': 1.792645900599077e-06, 'samples': 27716352, 'steps': 144355, 'loss/train': 1.2440943717956543} 11/07/2021 17:33:38 - INFO - __main__ - Step 144357: {'lr': 1.792011589667797e-06, 'samples': 27716544, 'steps': 144356, 'loss/train': 1.3286703824996948} 11/07/2021 17:33:38 - INFO - __main__ - Step 144358: {'lr': 1.791377390575194e-06, 'samples': 27716736, 'steps': 144357, 'loss/train': 1.646584153175354} 11/07/2021 17:33:38 - INFO - __main__ - Step 144359: {'lr': 1.7907433033215736e-06, 'samples': 27716928, 'steps': 144358, 'loss/train': 1.0439954996109009} 11/07/2021 17:33:39 - INFO - __main__ - Step 144360: {'lr': 1.7901093279071857e-06, 'samples': 27717120, 'steps': 144359, 'loss/train': 1.5174726247787476} 11/07/2021 17:33:40 - INFO - __main__ - Step 144361: {'lr': 1.7894754643323907e-06, 'samples': 27717312, 'steps': 144360, 'loss/train': 1.3139697313308716} 11/07/2021 17:33:40 - INFO - __main__ - Step 144362: {'lr': 1.7888417125974111e-06, 'samples': 27717504, 'steps': 144361, 'loss/train': 0.4078291654586792} 11/07/2021 17:33:40 - INFO - __main__ - Step 144363: {'lr': 1.7882080727025517e-06, 'samples': 27717696, 'steps': 144362, 'loss/train': 1.4745346307754517} 11/07/2021 17:33:41 - INFO - __main__ - Step 144364: {'lr': 1.7875745446480906e-06, 'samples': 27717888, 'steps': 144363, 'loss/train': 1.1284173727035522} 11/07/2021 17:33:41 - INFO - __main__ - Step 144365: {'lr': 1.786941128434305e-06, 'samples': 27718080, 'steps': 144364, 'loss/train': 0.9642359614372253} 11/07/2021 17:33:42 - INFO - __main__ - Step 144366: {'lr': 1.7863078240615005e-06, 'samples': 27718272, 'steps': 144365, 'loss/train': 0.9819007515907288} 11/07/2021 17:33:42 - INFO - __main__ - Step 144367: {'lr': 1.7856746315299543e-06, 'samples': 27718464, 'steps': 144366, 'loss/train': 1.1908652782440186} 11/07/2021 17:33:43 - INFO - __main__ - Step 144368: {'lr': 1.785041550839972e-06, 'samples': 27718656, 'steps': 144367, 'loss/train': 1.1152318716049194} 11/07/2021 17:33:43 - INFO - __main__ - Step 144369: {'lr': 1.7844085819918033e-06, 'samples': 27718848, 'steps': 144368, 'loss/train': 1.3695214986801147} 11/07/2021 17:33:44 - INFO - __main__ - Step 144370: {'lr': 1.7837757249857534e-06, 'samples': 27719040, 'steps': 144369, 'loss/train': 1.4674146175384521} 11/07/2021 17:33:45 - INFO - __main__ - Step 144371: {'lr': 1.7831429798221e-06, 'samples': 27719232, 'steps': 144370, 'loss/train': 1.0509088039398193} 11/07/2021 17:33:45 - INFO - __main__ - Step 144372: {'lr': 1.7825103465011482e-06, 'samples': 27719424, 'steps': 144371, 'loss/train': 0.6552433967590332} 11/07/2021 17:33:45 - INFO - __main__ - Step 144373: {'lr': 1.7818778250231483e-06, 'samples': 27719616, 'steps': 144372, 'loss/train': 1.485298752784729} 11/07/2021 17:33:46 - INFO - __main__ - Step 144374: {'lr': 1.781245415388405e-06, 'samples': 27719808, 'steps': 144373, 'loss/train': 1.6240652799606323} 11/07/2021 17:33:46 - INFO - __main__ - Step 144375: {'lr': 1.780613117597224e-06, 'samples': 27720000, 'steps': 144374, 'loss/train': 1.2868095636367798} 11/07/2021 17:33:47 - INFO - __main__ - Step 144376: {'lr': 1.779980931649855e-06, 'samples': 27720192, 'steps': 144375, 'loss/train': 0.9295515418052673} 11/07/2021 17:33:47 - INFO - __main__ - Step 144377: {'lr': 1.7793488575466032e-06, 'samples': 27720384, 'steps': 144376, 'loss/train': 1.28999662399292} 11/07/2021 17:33:48 - INFO - __main__ - Step 144378: {'lr': 1.7787168952877187e-06, 'samples': 27720576, 'steps': 144377, 'loss/train': 0.6634238958358765} 11/07/2021 17:33:48 - INFO - __main__ - Step 144379: {'lr': 1.7780850448735342e-06, 'samples': 27720768, 'steps': 144378, 'loss/train': 1.2624056339263916} 11/07/2021 17:33:49 - INFO - __main__ - Step 144380: {'lr': 1.7774533063043274e-06, 'samples': 27720960, 'steps': 144379, 'loss/train': 1.3109513521194458} 11/07/2021 17:33:50 - INFO - __main__ - Step 144381: {'lr': 1.7768216795803483e-06, 'samples': 27721152, 'steps': 144380, 'loss/train': 1.3072993755340576} 11/07/2021 17:33:50 - INFO - __main__ - Step 144382: {'lr': 1.7761901647019018e-06, 'samples': 27721344, 'steps': 144381, 'loss/train': 1.355739712715149} 11/07/2021 17:33:50 - INFO - __main__ - Step 144383: {'lr': 1.7755587616692937e-06, 'samples': 27721536, 'steps': 144382, 'loss/train': 1.0817201137542725} 11/07/2021 17:33:51 - INFO - __main__ - Step 144384: {'lr': 1.7749274704827733e-06, 'samples': 27721728, 'steps': 144383, 'loss/train': 1.5313693284988403} 11/07/2021 17:33:51 - INFO - __main__ - Step 144385: {'lr': 1.7742962911426464e-06, 'samples': 27721920, 'steps': 144384, 'loss/train': 1.0994309186935425} 11/07/2021 17:33:51 - INFO - __main__ - Step 144386: {'lr': 1.7736652236491625e-06, 'samples': 27722112, 'steps': 144385, 'loss/train': 1.040442705154419} 11/07/2021 17:33:52 - INFO - __main__ - Step 144387: {'lr': 1.7730342680026824e-06, 'samples': 27722304, 'steps': 144386, 'loss/train': 1.4294991493225098} 11/07/2021 17:33:53 - INFO - __main__ - Step 144388: {'lr': 1.7724034242034282e-06, 'samples': 27722496, 'steps': 144387, 'loss/train': 0.638450026512146} 11/07/2021 17:33:53 - INFO - __main__ - Step 144389: {'lr': 1.7717726922516774e-06, 'samples': 27722688, 'steps': 144388, 'loss/train': 0.9945452213287354} 11/07/2021 17:33:53 - INFO - __main__ - Step 144390: {'lr': 1.7711420721477634e-06, 'samples': 27722880, 'steps': 144389, 'loss/train': 1.5385304689407349} 11/07/2021 17:33:54 - INFO - __main__ - Step 144391: {'lr': 1.7705115638919356e-06, 'samples': 27723072, 'steps': 144390, 'loss/train': 1.4695987701416016} 11/07/2021 17:33:55 - INFO - __main__ - Step 144392: {'lr': 1.7698811674844717e-06, 'samples': 27723264, 'steps': 144391, 'loss/train': 1.3315461874008179} 11/07/2021 17:33:55 - INFO - __main__ - Step 144393: {'lr': 1.769250882925677e-06, 'samples': 27723456, 'steps': 144392, 'loss/train': 1.4453760385513306} 11/07/2021 17:33:56 - INFO - __main__ - Step 144394: {'lr': 1.7686207102158014e-06, 'samples': 27723648, 'steps': 144393, 'loss/train': 1.2953879833221436} 11/07/2021 17:33:56 - INFO - __main__ - Step 144395: {'lr': 1.7679906493551778e-06, 'samples': 27723840, 'steps': 144394, 'loss/train': 1.3345324993133545} 11/07/2021 17:33:56 - INFO - __main__ - Step 144396: {'lr': 1.767360700344084e-06, 'samples': 27724032, 'steps': 144395, 'loss/train': 1.2735872268676758} 11/07/2021 17:33:57 - INFO - __main__ - Step 144397: {'lr': 1.7667308631827694e-06, 'samples': 27724224, 'steps': 144396, 'loss/train': 0.9924308657646179} 11/07/2021 17:33:58 - INFO - __main__ - Step 144398: {'lr': 1.7661011378715396e-06, 'samples': 27724416, 'steps': 144397, 'loss/train': 1.3014286756515503} 11/07/2021 17:33:58 - INFO - __main__ - Step 144399: {'lr': 1.7654715244106722e-06, 'samples': 27724608, 'steps': 144398, 'loss/train': 1.1209919452667236} 11/07/2021 17:33:58 - INFO - __main__ - Step 144400: {'lr': 1.7648420228004446e-06, 'samples': 27724800, 'steps': 144399, 'loss/train': 1.6849546432495117} 11/07/2021 17:33:59 - INFO - __main__ - Step 144401: {'lr': 1.7642126330411624e-06, 'samples': 27724992, 'steps': 144400, 'loss/train': 1.3016493320465088} 11/07/2021 17:34:00 - INFO - __main__ - Step 144402: {'lr': 1.7635833551331026e-06, 'samples': 27725184, 'steps': 144401, 'loss/train': 1.3315558433532715} 11/07/2021 17:34:00 - INFO - __main__ - Step 144403: {'lr': 1.7629541890765155e-06, 'samples': 27725376, 'steps': 144402, 'loss/train': 0.9988191723823547} 11/07/2021 17:34:00 - INFO - __main__ - Step 144404: {'lr': 1.7623251348717339e-06, 'samples': 27725568, 'steps': 144403, 'loss/train': 1.7755441665649414} 11/07/2021 17:34:01 - INFO - __main__ - Step 144405: {'lr': 1.7616961925190077e-06, 'samples': 27725760, 'steps': 144404, 'loss/train': 1.3470062017440796} 11/07/2021 17:34:01 - INFO - __main__ - Step 144406: {'lr': 1.7610673620186145e-06, 'samples': 27725952, 'steps': 144405, 'loss/train': 1.607301115989685} 11/07/2021 17:34:02 - INFO - __main__ - Step 144407: {'lr': 1.7604386433708874e-06, 'samples': 27726144, 'steps': 144406, 'loss/train': 1.2659512758255005} 11/07/2021 17:34:03 - INFO - __main__ - Step 144408: {'lr': 1.7598100365760483e-06, 'samples': 27726336, 'steps': 144407, 'loss/train': 1.039455771446228} 11/07/2021 17:34:03 - INFO - __main__ - Step 144409: {'lr': 1.7591815416344303e-06, 'samples': 27726528, 'steps': 144408, 'loss/train': 0.8478962779045105} 11/07/2021 17:34:03 - INFO - __main__ - Step 144410: {'lr': 1.7585531585462832e-06, 'samples': 27726720, 'steps': 144409, 'loss/train': 1.0441629886627197} 11/07/2021 17:34:04 - INFO - __main__ - Step 144411: {'lr': 1.7579248873118846e-06, 'samples': 27726912, 'steps': 144410, 'loss/train': 1.0350323915481567} 11/07/2021 17:34:05 - INFO - __main__ - Step 144412: {'lr': 1.75729672793154e-06, 'samples': 27727104, 'steps': 144411, 'loss/train': 1.6916885375976562} 11/07/2021 17:34:05 - INFO - __main__ - Step 144413: {'lr': 1.7566686804055542e-06, 'samples': 27727296, 'steps': 144412, 'loss/train': 1.506980299949646} 11/07/2021 17:34:05 - INFO - __main__ - Step 144414: {'lr': 1.7560407447341497e-06, 'samples': 27727488, 'steps': 144413, 'loss/train': 1.1103178262710571} 11/07/2021 17:34:06 - INFO - __main__ - Step 144415: {'lr': 1.7554129209176872e-06, 'samples': 27727680, 'steps': 144414, 'loss/train': 1.3405203819274902} 11/07/2021 17:34:06 - INFO - __main__ - Step 144416: {'lr': 1.754785208956361e-06, 'samples': 27727872, 'steps': 144415, 'loss/train': 1.7920451164245605} 11/07/2021 17:34:06 - INFO - __main__ - Step 144417: {'lr': 1.7541576088505319e-06, 'samples': 27728064, 'steps': 144416, 'loss/train': 1.4587346315383911} 11/07/2021 17:34:07 - INFO - __main__ - Step 144418: {'lr': 1.7535301206004217e-06, 'samples': 27728256, 'steps': 144417, 'loss/train': 1.8815211057662964} 11/07/2021 17:34:08 - INFO - __main__ - Step 144419: {'lr': 1.7529027442063361e-06, 'samples': 27728448, 'steps': 144418, 'loss/train': 1.6996748447418213} 11/07/2021 17:34:08 - INFO - __main__ - Step 144420: {'lr': 1.7522754796685803e-06, 'samples': 27728640, 'steps': 144419, 'loss/train': 1.1084364652633667} 11/07/2021 17:34:09 - INFO - __main__ - Step 144421: {'lr': 1.7516483269874317e-06, 'samples': 27728832, 'steps': 144420, 'loss/train': 1.4566465616226196} 11/07/2021 17:34:09 - INFO - __main__ - Step 144422: {'lr': 1.7510212861631402e-06, 'samples': 27729024, 'steps': 144421, 'loss/train': 0.9175388813018799} 11/07/2021 17:34:10 - INFO - __main__ - Step 144423: {'lr': 1.7503943571959835e-06, 'samples': 27729216, 'steps': 144422, 'loss/train': 1.2470701932907104} 11/07/2021 17:34:11 - INFO - __main__ - Step 144424: {'lr': 1.7497675400862944e-06, 'samples': 27729408, 'steps': 144423, 'loss/train': 0.20426762104034424} 11/07/2021 17:34:11 - INFO - __main__ - Step 144425: {'lr': 1.7491408348343508e-06, 'samples': 27729600, 'steps': 144424, 'loss/train': 1.2090965509414673} 11/07/2021 17:34:11 - INFO - __main__ - Step 144426: {'lr': 1.7485142414403743e-06, 'samples': 27729792, 'steps': 144425, 'loss/train': 1.076468825340271} 11/07/2021 17:34:12 - INFO - __main__ - Step 144427: {'lr': 1.7478877599046983e-06, 'samples': 27729984, 'steps': 144426, 'loss/train': 1.1428085565567017} 11/07/2021 17:34:13 - INFO - __main__ - Step 144428: {'lr': 1.7472613902276002e-06, 'samples': 27730176, 'steps': 144427, 'loss/train': 1.314380168914795} 11/07/2021 17:34:13 - INFO - __main__ - Step 144429: {'lr': 1.7466351324093855e-06, 'samples': 27730368, 'steps': 144428, 'loss/train': 1.2913132905960083} 11/07/2021 17:34:14 - INFO - __main__ - Step 144430: {'lr': 1.746008986450276e-06, 'samples': 27730560, 'steps': 144429, 'loss/train': 0.9422720670700073} 11/07/2021 17:34:14 - INFO - __main__ - Step 144431: {'lr': 1.745382952350577e-06, 'samples': 27730752, 'steps': 144430, 'loss/train': 1.0935313701629639} 11/07/2021 17:34:14 - INFO - __main__ - Step 144432: {'lr': 1.7447570301105664e-06, 'samples': 27730944, 'steps': 144431, 'loss/train': 1.1459639072418213} 11/07/2021 17:34:15 - INFO - __main__ - Step 144433: {'lr': 1.7441312197305492e-06, 'samples': 27731136, 'steps': 144432, 'loss/train': 1.5383145809173584} 11/07/2021 17:34:16 - INFO - __main__ - Step 144434: {'lr': 1.7435055212108031e-06, 'samples': 27731328, 'steps': 144433, 'loss/train': 1.0182985067367554} 11/07/2021 17:34:16 - INFO - __main__ - Step 144435: {'lr': 1.7428799345516056e-06, 'samples': 27731520, 'steps': 144434, 'loss/train': 1.5165678262710571} 11/07/2021 17:34:17 - INFO - __main__ - Step 144436: {'lr': 1.7422544597532341e-06, 'samples': 27731712, 'steps': 144435, 'loss/train': 1.5128165483474731} 11/07/2021 17:34:17 - INFO - __main__ - Step 144437: {'lr': 1.7416290968159664e-06, 'samples': 27731904, 'steps': 144436, 'loss/train': 1.3736892938613892} 11/07/2021 17:34:17 - INFO - __main__ - Step 144438: {'lr': 1.74100384574008e-06, 'samples': 27732096, 'steps': 144437, 'loss/train': 1.6785941123962402} 11/07/2021 17:34:18 - INFO - __main__ - Step 144439: {'lr': 1.7403787065258803e-06, 'samples': 27732288, 'steps': 144438, 'loss/train': 1.3816334009170532} 11/07/2021 17:34:19 - INFO - __main__ - Step 144440: {'lr': 1.7397536791736446e-06, 'samples': 27732480, 'steps': 144439, 'loss/train': 1.1945816278457642} 11/07/2021 17:34:19 - INFO - __main__ - Step 144441: {'lr': 1.7391287636836228e-06, 'samples': 27732672, 'steps': 144440, 'loss/train': 1.0787607431411743} 11/07/2021 17:34:19 - INFO - __main__ - Step 144442: {'lr': 1.738503960056148e-06, 'samples': 27732864, 'steps': 144441, 'loss/train': 1.2795087099075317} 11/07/2021 17:34:20 - INFO - __main__ - Step 144443: {'lr': 1.7378792682914423e-06, 'samples': 27733056, 'steps': 144442, 'loss/train': 1.4546555280685425} 11/07/2021 17:34:21 - INFO - __main__ - Step 144444: {'lr': 1.737254688389839e-06, 'samples': 27733248, 'steps': 144443, 'loss/train': 1.4648665189743042} 11/07/2021 17:34:21 - INFO - __main__ - Step 144445: {'lr': 1.736630220351587e-06, 'samples': 27733440, 'steps': 144444, 'loss/train': 1.0405415296554565} 11/07/2021 17:34:21 - INFO - __main__ - Step 144446: {'lr': 1.736005864176965e-06, 'samples': 27733632, 'steps': 144445, 'loss/train': 0.8132141828536987} 11/07/2021 17:34:22 - INFO - __main__ - Step 144447: {'lr': 1.7353816198662776e-06, 'samples': 27733824, 'steps': 144446, 'loss/train': 1.1151208877563477} 11/07/2021 17:34:22 - INFO - __main__ - Step 144448: {'lr': 1.7347574874198024e-06, 'samples': 27734016, 'steps': 144447, 'loss/train': 1.1585453748703003} 11/07/2021 17:34:23 - INFO - __main__ - Step 144449: {'lr': 1.7341334668378173e-06, 'samples': 27734208, 'steps': 144448, 'loss/train': 1.2743897438049316} 11/07/2021 17:34:24 - INFO - __main__ - Step 144450: {'lr': 1.7335095581205994e-06, 'samples': 27734400, 'steps': 144449, 'loss/train': 1.6015633344650269} 11/07/2021 17:34:24 - INFO - __main__ - Step 144451: {'lr': 1.7328857612684267e-06, 'samples': 27734592, 'steps': 144450, 'loss/train': 1.4569146633148193} 11/07/2021 17:34:24 - INFO - __main__ - Step 144452: {'lr': 1.7322620762815766e-06, 'samples': 27734784, 'steps': 144451, 'loss/train': 1.2988146543502808} 11/07/2021 17:34:25 - INFO - __main__ - Step 144453: {'lr': 1.7316385031603542e-06, 'samples': 27734976, 'steps': 144452, 'loss/train': 1.0914591550827026} 11/07/2021 17:34:26 - INFO - __main__ - Step 144454: {'lr': 1.7310150419050097e-06, 'samples': 27735168, 'steps': 144453, 'loss/train': 1.1648880243301392} 11/07/2021 17:34:26 - INFO - __main__ - Step 144455: {'lr': 1.730391692515848e-06, 'samples': 27735360, 'steps': 144454, 'loss/train': 1.269775629043579} 11/07/2021 17:34:26 - INFO - __main__ - Step 144456: {'lr': 1.729768454993147e-06, 'samples': 27735552, 'steps': 144455, 'loss/train': 0.8077573776245117} 11/07/2021 17:34:27 - INFO - __main__ - Step 144457: {'lr': 1.729145329337184e-06, 'samples': 27735744, 'steps': 144456, 'loss/train': 0.822123646736145} 11/07/2021 17:34:27 - INFO - __main__ - Step 144458: {'lr': 1.7285223155482088e-06, 'samples': 27735936, 'steps': 144457, 'loss/train': 1.3020097017288208} 11/07/2021 17:34:28 - INFO - __main__ - Step 144459: {'lr': 1.7278994136265546e-06, 'samples': 27736128, 'steps': 144458, 'loss/train': 0.03568603843450546} 11/07/2021 17:34:29 - INFO - __main__ - Step 144460: {'lr': 1.7272766235724712e-06, 'samples': 27736320, 'steps': 144459, 'loss/train': 1.4295611381530762} 11/07/2021 17:34:29 - INFO - __main__ - Step 144461: {'lr': 1.7266539453862363e-06, 'samples': 27736512, 'steps': 144460, 'loss/train': 1.3528251647949219} 11/07/2021 17:34:29 - INFO - __main__ - Step 144462: {'lr': 1.7260313790681547e-06, 'samples': 27736704, 'steps': 144461, 'loss/train': 1.2935353517532349} 11/07/2021 17:34:30 - INFO - __main__ - Step 144463: {'lr': 1.7254089246185045e-06, 'samples': 27736896, 'steps': 144462, 'loss/train': 1.2366613149642944} 11/07/2021 17:34:31 - INFO - __main__ - Step 144464: {'lr': 1.7247865820375352e-06, 'samples': 27737088, 'steps': 144463, 'loss/train': 1.370854139328003} 11/07/2021 17:34:31 - INFO - __main__ - Step 144465: {'lr': 1.7241643513255246e-06, 'samples': 27737280, 'steps': 144464, 'loss/train': 1.3849133253097534} 11/07/2021 17:34:31 - INFO - __main__ - Step 144466: {'lr': 1.7235422324828054e-06, 'samples': 27737472, 'steps': 144465, 'loss/train': 1.394189476966858} 11/07/2021 17:34:32 - INFO - __main__ - Step 144467: {'lr': 1.7229202255096276e-06, 'samples': 27737664, 'steps': 144466, 'loss/train': 1.5156474113464355} 11/07/2021 17:34:32 - INFO - __main__ - Step 144468: {'lr': 1.7222983304062411e-06, 'samples': 27737856, 'steps': 144467, 'loss/train': 1.020707130432129} 11/07/2021 17:34:33 - INFO - __main__ - Step 144469: {'lr': 1.7216765471730066e-06, 'samples': 27738048, 'steps': 144468, 'loss/train': 1.2467131614685059} 11/07/2021 17:34:34 - INFO - __main__ - Step 144470: {'lr': 1.7210548758101186e-06, 'samples': 27738240, 'steps': 144469, 'loss/train': 1.4023860692977905} 11/07/2021 17:34:34 - INFO - __main__ - Step 144471: {'lr': 1.720433316317882e-06, 'samples': 27738432, 'steps': 144470, 'loss/train': 0.9961729049682617} 11/07/2021 17:34:34 - INFO - __main__ - Step 144472: {'lr': 1.7198118686966025e-06, 'samples': 27738624, 'steps': 144471, 'loss/train': 1.1408097743988037} 11/07/2021 17:34:35 - INFO - __main__ - Step 144473: {'lr': 1.7191905329465574e-06, 'samples': 27738816, 'steps': 144472, 'loss/train': 1.282628059387207} 11/07/2021 17:34:35 - INFO - __main__ - Step 144474: {'lr': 1.7185693090679965e-06, 'samples': 27739008, 'steps': 144473, 'loss/train': 1.6296511888504028} 11/07/2021 17:34:36 - INFO - __main__ - Step 144475: {'lr': 1.7179481970612254e-06, 'samples': 27739200, 'steps': 144474, 'loss/train': 1.1017996072769165} 11/07/2021 17:34:37 - INFO - __main__ - Step 144476: {'lr': 1.7173271969264936e-06, 'samples': 27739392, 'steps': 144475, 'loss/train': 1.3702925443649292} 11/07/2021 17:34:37 - INFO - __main__ - Step 144477: {'lr': 1.7167063086641344e-06, 'samples': 27739584, 'steps': 144476, 'loss/train': 0.433072566986084} 11/07/2021 17:34:37 - INFO - __main__ - Step 144478: {'lr': 1.7160855322743696e-06, 'samples': 27739776, 'steps': 144477, 'loss/train': 1.324779987335205} 11/07/2021 17:34:38 - INFO - __main__ - Step 144479: {'lr': 1.7154648677575323e-06, 'samples': 27739968, 'steps': 144478, 'loss/train': 1.03327214717865} 11/07/2021 17:34:39 - INFO - __main__ - Step 144480: {'lr': 1.7148443151138448e-06, 'samples': 27740160, 'steps': 144479, 'loss/train': 0.9807614684104919} 11/07/2021 17:34:39 - INFO - __main__ - Step 144481: {'lr': 1.71422387434364e-06, 'samples': 27740352, 'steps': 144480, 'loss/train': 1.115209698677063} 11/07/2021 17:34:39 - INFO - __main__ - Step 144482: {'lr': 1.7136035454471676e-06, 'samples': 27740544, 'steps': 144481, 'loss/train': 0.6052078008651733} 11/07/2021 17:34:40 - INFO - __main__ - Step 144483: {'lr': 1.712983328424733e-06, 'samples': 27740736, 'steps': 144482, 'loss/train': 1.0461840629577637} 11/07/2021 17:34:40 - INFO - __main__ - Step 144484: {'lr': 1.7123632232765584e-06, 'samples': 27740928, 'steps': 144483, 'loss/train': 1.167881727218628} 11/07/2021 17:34:41 - INFO - __main__ - Step 144485: {'lr': 1.7117432300030044e-06, 'samples': 27741120, 'steps': 144484, 'loss/train': 1.1338306665420532} 11/07/2021 17:34:41 - INFO - __main__ - Step 144486: {'lr': 1.7111233486042655e-06, 'samples': 27741312, 'steps': 144485, 'loss/train': 1.413141131401062} 11/07/2021 17:34:42 - INFO - __main__ - Step 144487: {'lr': 1.7105035790807023e-06, 'samples': 27741504, 'steps': 144486, 'loss/train': 0.8555063605308533} 11/07/2021 17:34:42 - INFO - __main__ - Step 144488: {'lr': 1.709883921432509e-06, 'samples': 27741696, 'steps': 144487, 'loss/train': 1.6155216693878174} 11/07/2021 17:34:42 - INFO - __main__ - Step 144489: {'lr': 1.7092643756600468e-06, 'samples': 27741888, 'steps': 144488, 'loss/train': 1.3964402675628662} 11/07/2021 17:34:44 - INFO - __main__ - Step 144490: {'lr': 1.7086449417635374e-06, 'samples': 27742080, 'steps': 144489, 'loss/train': 1.4077544212341309} 11/07/2021 17:34:44 - INFO - __main__ - Step 144491: {'lr': 1.7080256197433143e-06, 'samples': 27742272, 'steps': 144490, 'loss/train': 1.212348461151123} 11/07/2021 17:34:45 - INFO - __main__ - Step 144492: {'lr': 1.707406409599599e-06, 'samples': 27742464, 'steps': 144491, 'loss/train': 1.2385438680648804} 11/07/2021 17:34:45 - INFO - __main__ - Step 144493: {'lr': 1.7067873113326971e-06, 'samples': 27742656, 'steps': 144492, 'loss/train': 1.329586386680603} 11/07/2021 17:34:45 - INFO - __main__ - Step 144494: {'lr': 1.706168324942886e-06, 'samples': 27742848, 'steps': 144493, 'loss/train': 1.7059078216552734} 11/07/2021 17:34:46 - INFO - __main__ - Step 144495: {'lr': 1.7055494504304435e-06, 'samples': 27743040, 'steps': 144494, 'loss/train': 1.6874414682388306} 11/07/2021 17:34:46 - INFO - __main__ - Step 144496: {'lr': 1.7049306877956473e-06, 'samples': 27743232, 'steps': 144495, 'loss/train': 1.4016375541687012} 11/07/2021 17:34:47 - INFO - __main__ - Step 144497: {'lr': 1.7043120370387743e-06, 'samples': 27743424, 'steps': 144496, 'loss/train': 1.229004979133606} 11/07/2021 17:34:48 - INFO - __main__ - Step 144498: {'lr': 1.7036934981601303e-06, 'samples': 27743616, 'steps': 144497, 'loss/train': 0.5036287903785706} 11/07/2021 17:34:48 - INFO - __main__ - Step 144499: {'lr': 1.7030750711599373e-06, 'samples': 27743808, 'steps': 144498, 'loss/train': 0.9494361877441406} 11/07/2021 17:34:48 - INFO - __main__ - Step 144500: {'lr': 1.7024567560385284e-06, 'samples': 27744000, 'steps': 144499, 'loss/train': 1.167291283607483} 11/07/2021 17:34:49 - INFO - __main__ - Step 144501: {'lr': 1.7018385527961532e-06, 'samples': 27744192, 'steps': 144500, 'loss/train': 1.2276824712753296} 11/07/2021 17:34:50 - INFO - __main__ - Step 144502: {'lr': 1.7012204614330895e-06, 'samples': 27744384, 'steps': 144501, 'loss/train': 1.3249316215515137} 11/07/2021 17:34:50 - INFO - __main__ - Step 144503: {'lr': 1.7006024819496701e-06, 'samples': 27744576, 'steps': 144502, 'loss/train': 1.5107722282409668} 11/07/2021 17:34:50 - INFO - __main__ - Step 144504: {'lr': 1.6999846143460895e-06, 'samples': 27744768, 'steps': 144503, 'loss/train': 1.397539496421814} 11/07/2021 17:34:51 - INFO - __main__ - Step 144505: {'lr': 1.699366858622653e-06, 'samples': 27744960, 'steps': 144504, 'loss/train': 1.1717932224273682} 11/07/2021 17:34:51 - INFO - __main__ - Step 144506: {'lr': 1.6987492147796656e-06, 'samples': 27745152, 'steps': 144505, 'loss/train': 1.373023271560669} 11/07/2021 17:34:52 - INFO - __main__ - Step 144507: {'lr': 1.6981316828173775e-06, 'samples': 27745344, 'steps': 144506, 'loss/train': 1.3672740459442139} 11/07/2021 17:34:52 - INFO - __main__ - Step 144508: {'lr': 1.697514262736094e-06, 'samples': 27745536, 'steps': 144507, 'loss/train': 0.7498220205307007} 11/07/2021 17:34:53 - INFO - __main__ - Step 144509: {'lr': 1.6968969545360924e-06, 'samples': 27745728, 'steps': 144508, 'loss/train': 0.791182816028595} 11/07/2021 17:34:53 - INFO - __main__ - Step 144510: {'lr': 1.6962797582176227e-06, 'samples': 27745920, 'steps': 144509, 'loss/train': 1.0314580202102661} 11/07/2021 17:34:53 - INFO - __main__ - Step 144511: {'lr': 1.6956626737809622e-06, 'samples': 27746112, 'steps': 144510, 'loss/train': 0.9339500069618225} 11/07/2021 17:34:55 - INFO - __main__ - Step 144512: {'lr': 1.6950457012264165e-06, 'samples': 27746304, 'steps': 144511, 'loss/train': 1.5179146528244019} 11/07/2021 17:34:55 - INFO - __main__ - Step 144513: {'lr': 1.694428840554263e-06, 'samples': 27746496, 'steps': 144512, 'loss/train': 1.4129788875579834} 11/07/2021 17:34:55 - INFO - __main__ - Step 144514: {'lr': 1.6938120917647792e-06, 'samples': 27746688, 'steps': 144513, 'loss/train': 1.1723084449768066} 11/07/2021 17:34:56 - INFO - __main__ - Step 144515: {'lr': 1.6931954548582152e-06, 'samples': 27746880, 'steps': 144514, 'loss/train': 1.099509358406067} 11/07/2021 17:34:56 - INFO - __main__ - Step 144516: {'lr': 1.6925789298348482e-06, 'samples': 27747072, 'steps': 144515, 'loss/train': 1.1216051578521729} 11/07/2021 17:34:57 - INFO - __main__ - Step 144517: {'lr': 1.6919625166949836e-06, 'samples': 27747264, 'steps': 144516, 'loss/train': 1.3120372295379639} 11/07/2021 17:34:57 - INFO - __main__ - Step 144518: {'lr': 1.6913462154388993e-06, 'samples': 27747456, 'steps': 144517, 'loss/train': 0.9355857968330383} 11/07/2021 17:34:58 - INFO - __main__ - Step 144519: {'lr': 1.6907300260668445e-06, 'samples': 27747648, 'steps': 144518, 'loss/train': 1.5303391218185425} 11/07/2021 17:34:58 - INFO - __main__ - Step 144520: {'lr': 1.690113948579125e-06, 'samples': 27747840, 'steps': 144519, 'loss/train': 1.1698945760726929} 11/07/2021 17:34:58 - INFO - __main__ - Step 144521: {'lr': 1.6894979829760182e-06, 'samples': 27748032, 'steps': 144520, 'loss/train': 1.5752527713775635} 11/07/2021 17:34:59 - INFO - __main__ - Step 144522: {'lr': 1.6888821292578016e-06, 'samples': 27748224, 'steps': 144521, 'loss/train': 1.7740789651870728} 11/07/2021 17:35:00 - INFO - __main__ - Step 144523: {'lr': 1.6882663874247251e-06, 'samples': 27748416, 'steps': 144522, 'loss/train': 0.8808435201644897} 11/07/2021 17:35:00 - INFO - __main__ - Step 144524: {'lr': 1.687650757477066e-06, 'samples': 27748608, 'steps': 144523, 'loss/train': 1.4957537651062012} 11/07/2021 17:35:00 - INFO - __main__ - Step 144525: {'lr': 1.6870352394151579e-06, 'samples': 27748800, 'steps': 144524, 'loss/train': 1.453264832496643} 11/07/2021 17:35:01 - INFO - __main__ - Step 144526: {'lr': 1.6864198332392221e-06, 'samples': 27748992, 'steps': 144525, 'loss/train': 1.0042009353637695} 11/07/2021 17:35:01 - INFO - __main__ - Step 144527: {'lr': 1.6858045389495646e-06, 'samples': 27749184, 'steps': 144526, 'loss/train': 1.0817161798477173} 11/07/2021 17:35:02 - INFO - __main__ - Step 144528: {'lr': 1.6851893565464348e-06, 'samples': 27749376, 'steps': 144527, 'loss/train': 1.8483713865280151} 11/07/2021 17:35:03 - INFO - __main__ - Step 144529: {'lr': 1.6845742860301383e-06, 'samples': 27749568, 'steps': 144528, 'loss/train': 1.3556956052780151} 11/07/2021 17:35:03 - INFO - __main__ - Step 144530: {'lr': 1.6839593274009247e-06, 'samples': 27749760, 'steps': 144529, 'loss/train': 0.528870701789856} 11/07/2021 17:35:03 - INFO - __main__ - Step 144531: {'lr': 1.6833444806590992e-06, 'samples': 27749952, 'steps': 144530, 'loss/train': 1.2765672206878662} 11/07/2021 17:35:04 - INFO - __main__ - Step 144532: {'lr': 1.6827297458049117e-06, 'samples': 27750144, 'steps': 144531, 'loss/train': 1.3865286111831665} 11/07/2021 17:35:05 - INFO - __main__ - Step 144533: {'lr': 1.6821151228386678e-06, 'samples': 27750336, 'steps': 144532, 'loss/train': 1.4200594425201416} 11/07/2021 17:35:05 - INFO - __main__ - Step 144534: {'lr': 1.6815006117606446e-06, 'samples': 27750528, 'steps': 144533, 'loss/train': 1.061115026473999} 11/07/2021 17:35:05 - INFO - __main__ - Step 144535: {'lr': 1.680886212571092e-06, 'samples': 27750720, 'steps': 144534, 'loss/train': 1.436996340751648} 11/07/2021 17:35:06 - INFO - __main__ - Step 144536: {'lr': 1.6802719252703159e-06, 'samples': 27750912, 'steps': 144535, 'loss/train': 1.1846156120300293} 11/07/2021 17:35:06 - INFO - __main__ - Step 144537: {'lr': 1.6796577498585375e-06, 'samples': 27751104, 'steps': 144536, 'loss/train': 1.2522165775299072} 11/07/2021 17:35:07 - INFO - __main__ - Step 144538: {'lr': 1.6790436863361181e-06, 'samples': 27751296, 'steps': 144537, 'loss/train': 1.91649329662323} 11/07/2021 17:35:07 - INFO - __main__ - Step 144539: {'lr': 1.6784297347032518e-06, 'samples': 27751488, 'steps': 144538, 'loss/train': 1.205330491065979} 11/07/2021 17:35:08 - INFO - __main__ - Step 144540: {'lr': 1.6778158949602718e-06, 'samples': 27751680, 'steps': 144539, 'loss/train': 1.6796979904174805} 11/07/2021 17:35:08 - INFO - __main__ - Step 144541: {'lr': 1.6772021671074556e-06, 'samples': 27751872, 'steps': 144540, 'loss/train': 1.2918957471847534} 11/07/2021 17:35:08 - INFO - __main__ - Step 144542: {'lr': 1.6765885511450252e-06, 'samples': 27752064, 'steps': 144541, 'loss/train': 1.2746111154556274} 11/07/2021 17:35:10 - INFO - __main__ - Step 144543: {'lr': 1.6759750470733138e-06, 'samples': 27752256, 'steps': 144542, 'loss/train': 1.3741494417190552} 11/07/2021 17:35:10 - INFO - __main__ - Step 144544: {'lr': 1.675361654892571e-06, 'samples': 27752448, 'steps': 144543, 'loss/train': 1.737381100654602} 11/07/2021 17:35:10 - INFO - __main__ - Step 144545: {'lr': 1.6747483746030745e-06, 'samples': 27752640, 'steps': 144544, 'loss/train': 1.0129793882369995} 11/07/2021 17:35:11 - INFO - __main__ - Step 144546: {'lr': 1.6741352062051018e-06, 'samples': 27752832, 'steps': 144545, 'loss/train': 0.7068833708763123} 11/07/2021 17:35:11 - INFO - __main__ - Step 144547: {'lr': 1.6735221496989306e-06, 'samples': 27753024, 'steps': 144546, 'loss/train': 1.2218117713928223} 11/07/2021 17:35:12 - INFO - __main__ - Step 144548: {'lr': 1.6729092050848383e-06, 'samples': 27753216, 'steps': 144547, 'loss/train': 1.2760088443756104} 11/07/2021 17:35:12 - INFO - __main__ - Step 144549: {'lr': 1.6722963723631023e-06, 'samples': 27753408, 'steps': 144548, 'loss/train': 1.3291332721710205} 11/07/2021 17:35:13 - INFO - __main__ - Step 144550: {'lr': 1.6716836515340283e-06, 'samples': 27753600, 'steps': 144549, 'loss/train': 1.2458043098449707} 11/07/2021 17:35:13 - INFO - __main__ - Step 144551: {'lr': 1.6710710425978381e-06, 'samples': 27753792, 'steps': 144550, 'loss/train': 0.9519199728965759} 11/07/2021 17:35:13 - INFO - __main__ - Step 144552: {'lr': 1.670458545554837e-06, 'samples': 27753984, 'steps': 144551, 'loss/train': 0.536497175693512} 11/07/2021 17:35:14 - INFO - __main__ - Step 144553: {'lr': 1.669846160405275e-06, 'samples': 27754176, 'steps': 144552, 'loss/train': 1.2905309200286865} 11/07/2021 17:35:15 - INFO - __main__ - Step 144554: {'lr': 1.6692338871494573e-06, 'samples': 27754368, 'steps': 144553, 'loss/train': 1.432187557220459} 11/07/2021 17:35:15 - INFO - __main__ - Step 144555: {'lr': 1.6686217257876612e-06, 'samples': 27754560, 'steps': 144554, 'loss/train': 1.2737054824829102} 11/07/2021 17:35:15 - INFO - __main__ - Step 144556: {'lr': 1.6680096763201369e-06, 'samples': 27754752, 'steps': 144555, 'loss/train': 1.447285771369934} 11/07/2021 17:35:16 - INFO - __main__ - Step 144557: {'lr': 1.6673977387471895e-06, 'samples': 27754944, 'steps': 144556, 'loss/train': 1.3842418193817139} 11/07/2021 17:35:17 - INFO - __main__ - Step 144558: {'lr': 1.6667859130690689e-06, 'samples': 27755136, 'steps': 144557, 'loss/train': 1.559706211090088} 11/07/2021 17:35:17 - INFO - __main__ - Step 144559: {'lr': 1.6661741992860802e-06, 'samples': 27755328, 'steps': 144558, 'loss/train': 1.3848116397857666} 11/07/2021 17:35:18 - INFO - __main__ - Step 144560: {'lr': 1.6655625973984735e-06, 'samples': 27755520, 'steps': 144559, 'loss/train': 1.4311323165893555} 11/07/2021 17:35:18 - INFO - __main__ - Step 144561: {'lr': 1.664951107406526e-06, 'samples': 27755712, 'steps': 144560, 'loss/train': 1.4539859294891357} 11/07/2021 17:35:18 - INFO - __main__ - Step 144562: {'lr': 1.6643397293105156e-06, 'samples': 27755904, 'steps': 144561, 'loss/train': 1.265012264251709} 11/07/2021 17:35:19 - INFO - __main__ - Step 144563: {'lr': 1.6637284631107475e-06, 'samples': 27756096, 'steps': 144562, 'loss/train': 2.1354000568389893} 11/07/2021 17:35:20 - INFO - __main__ - Step 144564: {'lr': 1.6631173088074436e-06, 'samples': 27756288, 'steps': 144563, 'loss/train': 0.6458365321159363} 11/07/2021 17:35:20 - INFO - __main__ - Step 144565: {'lr': 1.6625062664009094e-06, 'samples': 27756480, 'steps': 144564, 'loss/train': 0.8141136765480042} 11/07/2021 17:35:20 - INFO - __main__ - Step 144566: {'lr': 1.6618953358914224e-06, 'samples': 27756672, 'steps': 144565, 'loss/train': 0.822769045829773} 11/07/2021 17:35:21 - INFO - __main__ - Step 144567: {'lr': 1.6612845172792601e-06, 'samples': 27756864, 'steps': 144566, 'loss/train': 1.3849366903305054} 11/07/2021 17:35:21 - INFO - __main__ - Step 144568: {'lr': 1.6606738105646723e-06, 'samples': 27757056, 'steps': 144567, 'loss/train': 1.1076709032058716} 11/07/2021 17:35:22 - INFO - __main__ - Step 144569: {'lr': 1.6600632157479922e-06, 'samples': 27757248, 'steps': 144568, 'loss/train': 1.033257007598877} 11/07/2021 17:35:22 - INFO - __main__ - Step 144570: {'lr': 1.659452732829414e-06, 'samples': 27757440, 'steps': 144569, 'loss/train': 1.2520273923873901} 11/07/2021 17:35:23 - INFO - __main__ - Step 144571: {'lr': 1.6588423618092707e-06, 'samples': 27757632, 'steps': 144570, 'loss/train': 0.204332634806633} 11/07/2021 17:35:23 - INFO - __main__ - Step 144572: {'lr': 1.6582321026878122e-06, 'samples': 27757824, 'steps': 144571, 'loss/train': 1.4674463272094727} 11/07/2021 17:35:24 - INFO - __main__ - Step 144573: {'lr': 1.6576219554653437e-06, 'samples': 27758016, 'steps': 144572, 'loss/train': 0.929963231086731} 11/07/2021 17:35:25 - INFO - __main__ - Step 144574: {'lr': 1.6570119201420875e-06, 'samples': 27758208, 'steps': 144573, 'loss/train': 1.6558092832565308} 11/07/2021 17:35:25 - INFO - __main__ - Step 144575: {'lr': 1.6564019967183762e-06, 'samples': 27758400, 'steps': 144574, 'loss/train': 1.3504446744918823} 11/07/2021 17:35:25 - INFO - __main__ - Step 144576: {'lr': 1.6557921851944601e-06, 'samples': 27758592, 'steps': 144575, 'loss/train': 0.703905463218689} 11/07/2021 17:35:26 - INFO - __main__ - Step 144577: {'lr': 1.6551824855705889e-06, 'samples': 27758784, 'steps': 144576, 'loss/train': 0.8027372360229492} 11/07/2021 17:35:26 - INFO - __main__ - Step 144578: {'lr': 1.6545728978470953e-06, 'samples': 27758976, 'steps': 144577, 'loss/train': 1.6126888990402222} 11/07/2021 17:35:27 - INFO - __main__ - Step 144579: {'lr': 1.653963422024174e-06, 'samples': 27759168, 'steps': 144578, 'loss/train': 1.4140580892562866} 11/07/2021 17:35:27 - INFO - __main__ - Step 144580: {'lr': 1.6533540581021855e-06, 'samples': 27759360, 'steps': 144579, 'loss/train': 1.6156010627746582} 11/07/2021 17:35:28 - INFO - __main__ - Step 144581: {'lr': 1.6527448060813245e-06, 'samples': 27759552, 'steps': 144580, 'loss/train': 1.5537550449371338} 11/07/2021 17:35:28 - INFO - __main__ - Step 144582: {'lr': 1.6521356659619236e-06, 'samples': 27759744, 'steps': 144581, 'loss/train': 1.2634685039520264} 11/07/2021 17:35:28 - INFO - __main__ - Step 144583: {'lr': 1.6515266377442327e-06, 'samples': 27759936, 'steps': 144582, 'loss/train': 1.1741671562194824} 11/07/2021 17:35:29 - INFO - __main__ - Step 144584: {'lr': 1.6509177214285575e-06, 'samples': 27760128, 'steps': 144583, 'loss/train': 0.8197630643844604} 11/07/2021 17:35:30 - INFO - __main__ - Step 144585: {'lr': 1.6503089170151197e-06, 'samples': 27760320, 'steps': 144584, 'loss/train': 1.7001073360443115} 11/07/2021 17:35:30 - INFO - __main__ - Step 144586: {'lr': 1.6497002245042248e-06, 'samples': 27760512, 'steps': 144585, 'loss/train': 1.2831321954727173} 11/07/2021 17:35:30 - INFO - __main__ - Step 144587: {'lr': 1.6490916438961501e-06, 'samples': 27760704, 'steps': 144586, 'loss/train': 1.4192904233932495} 11/07/2021 17:35:31 - INFO - __main__ - Step 144588: {'lr': 1.6484831751911455e-06, 'samples': 27760896, 'steps': 144587, 'loss/train': 1.4841790199279785} 11/07/2021 17:35:32 - INFO - __main__ - Step 144589: {'lr': 1.6478748183895164e-06, 'samples': 27761088, 'steps': 144588, 'loss/train': 0.4001653492450714} 11/07/2021 17:35:32 - INFO - __main__ - Step 144590: {'lr': 1.6472665734915405e-06, 'samples': 27761280, 'steps': 144589, 'loss/train': 1.2893743515014648} 11/07/2021 17:35:33 - INFO - __main__ - Step 144591: {'lr': 1.6466584404974394e-06, 'samples': 27761472, 'steps': 144590, 'loss/train': 1.3282771110534668} 11/07/2021 17:35:33 - INFO - __main__ - Step 144592: {'lr': 1.6460504194075466e-06, 'samples': 27761664, 'steps': 144591, 'loss/train': 1.7612751722335815} 11/07/2021 17:35:33 - INFO - __main__ - Step 144593: {'lr': 1.6454425102220838e-06, 'samples': 27761856, 'steps': 144592, 'loss/train': 1.6606510877609253} 11/07/2021 17:35:35 - INFO - __main__ - Step 144594: {'lr': 1.6448347129413844e-06, 'samples': 27762048, 'steps': 144593, 'loss/train': 1.5843446254730225} 11/07/2021 17:35:35 - INFO - __main__ - Step 144595: {'lr': 1.6442270275656702e-06, 'samples': 27762240, 'steps': 144594, 'loss/train': 1.3984034061431885} 11/07/2021 17:35:35 - INFO - __main__ - Step 144596: {'lr': 1.6436194540952464e-06, 'samples': 27762432, 'steps': 144595, 'loss/train': 1.3842461109161377} 11/07/2021 17:35:36 - INFO - __main__ - Step 144597: {'lr': 1.6430119925303632e-06, 'samples': 27762624, 'steps': 144596, 'loss/train': 1.1530628204345703} 11/07/2021 17:35:36 - INFO - __main__ - Step 144598: {'lr': 1.642404642871298e-06, 'samples': 27762816, 'steps': 144597, 'loss/train': 0.8383429646492004} 11/07/2021 17:35:36 - INFO - __main__ - Step 144599: {'lr': 1.641797405118356e-06, 'samples': 27763008, 'steps': 144598, 'loss/train': 1.6325538158416748} 11/07/2021 17:35:37 - INFO - __main__ - Step 144600: {'lr': 1.641190279271787e-06, 'samples': 27763200, 'steps': 144599, 'loss/train': 1.7206830978393555} 11/07/2021 17:35:38 - INFO - __main__ - Step 144601: {'lr': 1.6405832653318408e-06, 'samples': 27763392, 'steps': 144600, 'loss/train': 1.2369635105133057} 11/07/2021 17:35:38 - INFO - __main__ - Step 144602: {'lr': 1.639976363298823e-06, 'samples': 27763584, 'steps': 144601, 'loss/train': 1.1752957105636597} 11/07/2021 17:35:38 - INFO - __main__ - Step 144603: {'lr': 1.6393695731730384e-06, 'samples': 27763776, 'steps': 144602, 'loss/train': 1.296825885772705} 11/07/2021 17:35:39 - INFO - __main__ - Step 144604: {'lr': 1.6387628949546817e-06, 'samples': 27763968, 'steps': 144603, 'loss/train': 1.1024034023284912} 11/07/2021 17:35:39 - INFO - __main__ - Step 144605: {'lr': 1.638156328644086e-06, 'samples': 27764160, 'steps': 144604, 'loss/train': 0.6292383074760437} 11/07/2021 17:35:40 - INFO - __main__ - Step 144606: {'lr': 1.6375498742414729e-06, 'samples': 27764352, 'steps': 144605, 'loss/train': 1.2797125577926636} 11/07/2021 17:35:41 - INFO - __main__ - Step 144607: {'lr': 1.636943531747176e-06, 'samples': 27764544, 'steps': 144606, 'loss/train': 1.1672638654708862} 11/07/2021 17:35:41 - INFO - __main__ - Step 144608: {'lr': 1.636337301161417e-06, 'samples': 27764736, 'steps': 144607, 'loss/train': 1.3785194158554077} 11/07/2021 17:35:41 - INFO - __main__ - Step 144609: {'lr': 1.635731182484529e-06, 'samples': 27764928, 'steps': 144608, 'loss/train': 1.3432189226150513} 11/07/2021 17:35:42 - INFO - __main__ - Step 144610: {'lr': 1.6351251757167063e-06, 'samples': 27765120, 'steps': 144609, 'loss/train': 1.3157944679260254} 11/07/2021 17:35:43 - INFO - __main__ - Step 144611: {'lr': 1.634519280858282e-06, 'samples': 27765312, 'steps': 144610, 'loss/train': 1.5518354177474976} 11/07/2021 17:35:43 - INFO - __main__ - Step 144612: {'lr': 1.6339134979095062e-06, 'samples': 27765504, 'steps': 144611, 'loss/train': 1.089913010597229} 11/07/2021 17:35:43 - INFO - __main__ - Step 144613: {'lr': 1.6333078268706835e-06, 'samples': 27765696, 'steps': 144612, 'loss/train': 1.4767794609069824} 11/07/2021 17:35:44 - INFO - __main__ - Step 144614: {'lr': 1.6327022677420368e-06, 'samples': 27765888, 'steps': 144613, 'loss/train': 1.4055736064910889} 11/07/2021 17:35:44 - INFO - __main__ - Step 144615: {'lr': 1.632096820523843e-06, 'samples': 27766080, 'steps': 144614, 'loss/train': 1.1002541780471802} 11/07/2021 17:35:45 - INFO - __main__ - Step 144616: {'lr': 1.6314914852164352e-06, 'samples': 27766272, 'steps': 144615, 'loss/train': 1.4534847736358643} 11/07/2021 17:35:45 - INFO - __main__ - Step 144617: {'lr': 1.630886261820036e-06, 'samples': 27766464, 'steps': 144616, 'loss/train': 1.4007188081741333} 11/07/2021 17:35:46 - INFO - __main__ - Step 144618: {'lr': 1.6302811503348947e-06, 'samples': 27766656, 'steps': 144617, 'loss/train': 1.0365709066390991} 11/07/2021 17:35:46 - INFO - __main__ - Step 144619: {'lr': 1.6296761507613445e-06, 'samples': 27766848, 'steps': 144618, 'loss/train': 1.256648302078247} 11/07/2021 17:35:46 - INFO - __main__ - Step 144620: {'lr': 1.6290712630996073e-06, 'samples': 27767040, 'steps': 144619, 'loss/train': 0.9742711782455444} 11/07/2021 17:35:48 - INFO - __main__ - Step 144621: {'lr': 1.6284664873500165e-06, 'samples': 27767232, 'steps': 144620, 'loss/train': 2.2403626441955566} 11/07/2021 17:35:48 - INFO - __main__ - Step 144622: {'lr': 1.6278618235127662e-06, 'samples': 27767424, 'steps': 144621, 'loss/train': 1.262987494468689} 11/07/2021 17:35:48 - INFO - __main__ - Step 144623: {'lr': 1.6272572715881894e-06, 'samples': 27767616, 'steps': 144622, 'loss/train': 1.5108649730682373} 11/07/2021 17:35:49 - INFO - __main__ - Step 144624: {'lr': 1.626652831576536e-06, 'samples': 27767808, 'steps': 144623, 'loss/train': 1.3962541818618774} 11/07/2021 17:35:49 - INFO - __main__ - Step 144625: {'lr': 1.626048503478056e-06, 'samples': 27768000, 'steps': 144624, 'loss/train': 1.6170339584350586} 11/07/2021 17:35:50 - INFO - __main__ - Step 144626: {'lr': 1.6254442872930818e-06, 'samples': 27768192, 'steps': 144625, 'loss/train': 1.392716646194458} 11/07/2021 17:35:50 - INFO - __main__ - Step 144627: {'lr': 1.6248401830218362e-06, 'samples': 27768384, 'steps': 144626, 'loss/train': 1.6591612100601196} 11/07/2021 17:35:51 - INFO - __main__ - Step 144628: {'lr': 1.6242361906645963e-06, 'samples': 27768576, 'steps': 144627, 'loss/train': 1.3443149328231812} 11/07/2021 17:35:51 - INFO - __main__ - Step 144629: {'lr': 1.6236323102216398e-06, 'samples': 27768768, 'steps': 144628, 'loss/train': 0.3423522114753723} 11/07/2021 17:35:51 - INFO - __main__ - Step 144630: {'lr': 1.6230285416932722e-06, 'samples': 27768960, 'steps': 144629, 'loss/train': 1.077086329460144} 11/07/2021 17:35:52 - INFO - __main__ - Step 144631: {'lr': 1.6224248850797152e-06, 'samples': 27769152, 'steps': 144630, 'loss/train': 1.5102320909500122} 11/07/2021 17:35:53 - INFO - __main__ - Step 144632: {'lr': 1.6218213403812466e-06, 'samples': 27769344, 'steps': 144631, 'loss/train': 1.2262723445892334} 11/07/2021 17:35:53 - INFO - __main__ - Step 144633: {'lr': 1.6212179075981714e-06, 'samples': 27769536, 'steps': 144632, 'loss/train': 1.305319905281067} 11/07/2021 17:35:54 - INFO - __main__ - Step 144634: {'lr': 1.6206145867307397e-06, 'samples': 27769728, 'steps': 144633, 'loss/train': 1.2348554134368896} 11/07/2021 17:35:54 - INFO - __main__ - Step 144635: {'lr': 1.620011377779229e-06, 'samples': 27769920, 'steps': 144634, 'loss/train': 2.022523880004883} 11/07/2021 17:35:54 - INFO - __main__ - Step 144636: {'lr': 1.619408280743917e-06, 'samples': 27770112, 'steps': 144635, 'loss/train': 0.9902563095092773} 11/07/2021 17:35:55 - INFO - __main__ - Step 144637: {'lr': 1.6188052956250533e-06, 'samples': 27770304, 'steps': 144636, 'loss/train': 0.9573137164115906} 11/07/2021 17:35:56 - INFO - __main__ - Step 144638: {'lr': 1.6182024224229152e-06, 'samples': 27770496, 'steps': 144637, 'loss/train': 1.4134429693222046} 11/07/2021 17:35:56 - INFO - __main__ - Step 144639: {'lr': 1.6175996611377808e-06, 'samples': 27770688, 'steps': 144638, 'loss/train': 1.3956024646759033} 11/07/2021 17:35:57 - INFO - __main__ - Step 144640: {'lr': 1.6169970117699273e-06, 'samples': 27770880, 'steps': 144639, 'loss/train': 1.4464465379714966} 11/07/2021 17:35:57 - INFO - __main__ - Step 144641: {'lr': 1.6163944743196323e-06, 'samples': 27771072, 'steps': 144640, 'loss/train': 0.6227352023124695} 11/07/2021 17:35:58 - INFO - __main__ - Step 144642: {'lr': 1.6157920487871458e-06, 'samples': 27771264, 'steps': 144641, 'loss/train': 0.7815136909484863} 11/07/2021 17:35:58 - INFO - __main__ - Step 144643: {'lr': 1.6151897351727728e-06, 'samples': 27771456, 'steps': 144642, 'loss/train': 1.1894972324371338} 11/07/2021 17:35:59 - INFO - __main__ - Step 144644: {'lr': 1.6145875334767635e-06, 'samples': 27771648, 'steps': 144643, 'loss/train': 0.8612644672393799} 11/07/2021 17:35:59 - INFO - __main__ - Step 144645: {'lr': 1.6139854436993673e-06, 'samples': 27771840, 'steps': 144644, 'loss/train': 2.028494119644165} 11/07/2021 17:35:59 - INFO - __main__ - Step 144646: {'lr': 1.6133834658408898e-06, 'samples': 27772032, 'steps': 144645, 'loss/train': 1.4808909893035889} 11/07/2021 17:36:00 - INFO - __main__ - Step 144647: {'lr': 1.6127815999015805e-06, 'samples': 27772224, 'steps': 144646, 'loss/train': 1.1017677783966064} 11/07/2021 17:36:01 - INFO - __main__ - Step 144648: {'lr': 1.612179845881717e-06, 'samples': 27772416, 'steps': 144647, 'loss/train': 1.3218389749526978} 11/07/2021 17:36:01 - INFO - __main__ - Step 144649: {'lr': 1.6115782037815497e-06, 'samples': 27772608, 'steps': 144648, 'loss/train': 1.291529893875122} 11/07/2021 17:36:01 - INFO - __main__ - Step 144650: {'lr': 1.6109766736014109e-06, 'samples': 27772800, 'steps': 144649, 'loss/train': 1.1937143802642822} 11/07/2021 17:36:02 - INFO - __main__ - Step 144651: {'lr': 1.610375255341523e-06, 'samples': 27772992, 'steps': 144650, 'loss/train': 1.1915258169174194} 11/07/2021 17:36:03 - INFO - __main__ - Step 144652: {'lr': 1.6097739490021634e-06, 'samples': 27773184, 'steps': 144651, 'loss/train': 1.4446500539779663} 11/07/2021 17:36:03 - INFO - __main__ - Step 144653: {'lr': 1.60917275458361e-06, 'samples': 27773376, 'steps': 144652, 'loss/train': 1.1599059104919434} 11/07/2021 17:36:04 - INFO - __main__ - Step 144654: {'lr': 1.608571672086112e-06, 'samples': 27773568, 'steps': 144653, 'loss/train': 1.308449149131775} 11/07/2021 17:36:04 - INFO - __main__ - Step 144655: {'lr': 1.6079707015099755e-06, 'samples': 27773760, 'steps': 144654, 'loss/train': 1.680395483970642} 11/07/2021 17:36:04 - INFO - __main__ - Step 144656: {'lr': 1.6073698428554494e-06, 'samples': 27773952, 'steps': 144655, 'loss/train': 1.2079622745513916} 11/07/2021 17:36:05 - INFO - __main__ - Step 144657: {'lr': 1.6067690961228398e-06, 'samples': 27774144, 'steps': 144656, 'loss/train': 0.44552478194236755} 11/07/2021 17:36:06 - INFO - __main__ - Step 144658: {'lr': 1.6061684613123407e-06, 'samples': 27774336, 'steps': 144657, 'loss/train': 1.2262290716171265} 11/07/2021 17:36:06 - INFO - __main__ - Step 144659: {'lr': 1.605567938424285e-06, 'samples': 27774528, 'steps': 144658, 'loss/train': 1.3328685760498047} 11/07/2021 17:36:07 - INFO - __main__ - Step 144660: {'lr': 1.6049675274589503e-06, 'samples': 27774720, 'steps': 144659, 'loss/train': 1.850322961807251} 11/07/2021 17:36:07 - INFO - __main__ - Step 144661: {'lr': 1.604367228416559e-06, 'samples': 27774912, 'steps': 144660, 'loss/train': 1.2541768550872803} 11/07/2021 17:36:07 - INFO - __main__ - Step 144662: {'lr': 1.603767041297416e-06, 'samples': 27775104, 'steps': 144661, 'loss/train': 1.4882150888442993} 11/07/2021 17:36:08 - INFO - __main__ - Step 144663: {'lr': 1.6031669661017712e-06, 'samples': 27775296, 'steps': 144662, 'loss/train': 1.4029241800308228} 11/07/2021 17:36:09 - INFO - __main__ - Step 144664: {'lr': 1.60256700282993e-06, 'samples': 27775488, 'steps': 144663, 'loss/train': 1.711363434791565} 11/07/2021 17:36:09 - INFO - __main__ - Step 144665: {'lr': 1.6019671514821145e-06, 'samples': 27775680, 'steps': 144664, 'loss/train': 0.7234158515930176} 11/07/2021 17:36:09 - INFO - __main__ - Step 144666: {'lr': 1.6013674120586297e-06, 'samples': 27775872, 'steps': 144665, 'loss/train': 1.5506134033203125} 11/07/2021 17:36:10 - INFO - __main__ - Step 144667: {'lr': 1.6007677845597257e-06, 'samples': 27776064, 'steps': 144666, 'loss/train': 1.3498376607894897} 11/07/2021 17:36:11 - INFO - __main__ - Step 144668: {'lr': 1.6001682689857077e-06, 'samples': 27776256, 'steps': 144667, 'loss/train': 1.447580337524414} 11/07/2021 17:36:11 - INFO - __main__ - Step 144669: {'lr': 1.5995688653367978e-06, 'samples': 27776448, 'steps': 144668, 'loss/train': 1.1549358367919922} 11/07/2021 17:36:11 - INFO - __main__ - Step 144670: {'lr': 1.5989695736133013e-06, 'samples': 27776640, 'steps': 144669, 'loss/train': 1.2904380559921265} 11/07/2021 17:36:12 - INFO - __main__ - Step 144671: {'lr': 1.5983703938154681e-06, 'samples': 27776832, 'steps': 144670, 'loss/train': 1.3954812288284302} 11/07/2021 17:36:12 - INFO - __main__ - Step 144672: {'lr': 1.5977713259435755e-06, 'samples': 27777024, 'steps': 144671, 'loss/train': 1.7592782974243164} 11/07/2021 17:36:13 - INFO - __main__ - Step 144673: {'lr': 1.5971723699979013e-06, 'samples': 27777216, 'steps': 144672, 'loss/train': 1.4678281545639038} 11/07/2021 17:36:14 - INFO - __main__ - Step 144674: {'lr': 1.596573525978695e-06, 'samples': 27777408, 'steps': 144673, 'loss/train': 1.247337818145752} 11/07/2021 17:36:14 - INFO - __main__ - Step 144675: {'lr': 1.5959747938862624e-06, 'samples': 27777600, 'steps': 144674, 'loss/train': 1.187995195388794} 11/07/2021 17:36:14 - INFO - __main__ - Step 144676: {'lr': 1.595376173720825e-06, 'samples': 27777792, 'steps': 144675, 'loss/train': 0.8832045793533325} 11/07/2021 17:36:15 - INFO - __main__ - Step 144677: {'lr': 1.5947776654826884e-06, 'samples': 27777984, 'steps': 144676, 'loss/train': 1.3441226482391357} 11/07/2021 17:36:16 - INFO - __main__ - Step 144678: {'lr': 1.5941792691721302e-06, 'samples': 27778176, 'steps': 144677, 'loss/train': 1.2387032508850098} 11/07/2021 17:36:16 - INFO - __main__ - Step 144679: {'lr': 1.5935809847893724e-06, 'samples': 27778368, 'steps': 144678, 'loss/train': 1.4040789604187012} 11/07/2021 17:36:16 - INFO - __main__ - Step 144680: {'lr': 1.59298281233472e-06, 'samples': 27778560, 'steps': 144679, 'loss/train': 1.0703458786010742} 11/07/2021 17:36:17 - INFO - __main__ - Step 144681: {'lr': 1.592384751808451e-06, 'samples': 27778752, 'steps': 144680, 'loss/train': 1.0308982133865356} 11/07/2021 17:36:17 - INFO - __main__ - Step 144682: {'lr': 1.591786803210815e-06, 'samples': 27778944, 'steps': 144681, 'loss/train': 1.7761955261230469} 11/07/2021 17:36:18 - INFO - __main__ - Step 144683: {'lr': 1.5911889665420898e-06, 'samples': 27779136, 'steps': 144682, 'loss/train': 1.5883690118789673} 11/07/2021 17:36:18 - INFO - __main__ - Step 144684: {'lr': 1.5905912418025524e-06, 'samples': 27779328, 'steps': 144683, 'loss/train': 1.4848452806472778} 11/07/2021 17:36:19 - INFO - __main__ - Step 144685: {'lr': 1.5899936289924255e-06, 'samples': 27779520, 'steps': 144684, 'loss/train': 1.752434253692627} 11/07/2021 17:36:19 - INFO - __main__ - Step 144686: {'lr': 1.5893961281120416e-06, 'samples': 27779712, 'steps': 144685, 'loss/train': 0.9833414554595947} 11/07/2021 17:36:19 - INFO - __main__ - Step 144687: {'lr': 1.5887987391616231e-06, 'samples': 27779904, 'steps': 144686, 'loss/train': 1.1546728610992432} 11/07/2021 17:36:20 - INFO - __main__ - Step 144688: {'lr': 1.5882014621414754e-06, 'samples': 27780096, 'steps': 144687, 'loss/train': 1.2459001541137695} 11/07/2021 17:36:21 - INFO - __main__ - Step 144689: {'lr': 1.5876042970518478e-06, 'samples': 27780288, 'steps': 144688, 'loss/train': 1.5565544366836548} 11/07/2021 17:36:21 - INFO - __main__ - Step 144690: {'lr': 1.5870072438930183e-06, 'samples': 27780480, 'steps': 144689, 'loss/train': 1.534447193145752} 11/07/2021 17:36:21 - INFO - __main__ - Step 144691: {'lr': 1.5864103026652367e-06, 'samples': 27780672, 'steps': 144690, 'loss/train': 1.025070071220398} 11/07/2021 17:36:22 - INFO - __main__ - Step 144692: {'lr': 1.5858134733687801e-06, 'samples': 27780864, 'steps': 144691, 'loss/train': 1.399423599243164} 11/07/2021 17:36:22 - INFO - __main__ - Step 144693: {'lr': 1.5852167560039265e-06, 'samples': 27781056, 'steps': 144692, 'loss/train': 1.3525760173797607} 11/07/2021 17:36:24 - INFO - __main__ - Step 144694: {'lr': 1.5846201505709534e-06, 'samples': 27781248, 'steps': 144693, 'loss/train': 1.475429892539978} 11/07/2021 17:36:24 - INFO - __main__ - Step 144695: {'lr': 1.5840236570701106e-06, 'samples': 27781440, 'steps': 144694, 'loss/train': 1.4099594354629517} 11/07/2021 17:36:25 - INFO - __main__ - Step 144696: {'lr': 1.5834272755016755e-06, 'samples': 27781632, 'steps': 144695, 'loss/train': 1.573475956916809} 11/07/2021 17:36:25 - INFO - __main__ - Step 144697: {'lr': 1.5828310058659256e-06, 'samples': 27781824, 'steps': 144696, 'loss/train': 1.7446943521499634} 11/07/2021 17:36:25 - INFO - __main__ - Step 144698: {'lr': 1.582234848163111e-06, 'samples': 27782016, 'steps': 144697, 'loss/train': 1.7298592329025269} 11/07/2021 17:36:26 - INFO - __main__ - Step 144699: {'lr': 1.581638802393509e-06, 'samples': 27782208, 'steps': 144698, 'loss/train': 1.167361855506897} 11/07/2021 17:36:26 - INFO - __main__ - Step 144700: {'lr': 1.5810428685573698e-06, 'samples': 27782400, 'steps': 144699, 'loss/train': 1.263481855392456} 11/07/2021 17:36:27 - INFO - __main__ - Step 144701: {'lr': 1.580447046654998e-06, 'samples': 27782592, 'steps': 144700, 'loss/train': 0.970771074295044} 11/07/2021 17:36:28 - INFO - __main__ - Step 144702: {'lr': 1.579851336686644e-06, 'samples': 27782784, 'steps': 144701, 'loss/train': 1.0952749252319336} 11/07/2021 17:36:28 - INFO - __main__ - Step 144703: {'lr': 1.5792557386525574e-06, 'samples': 27782976, 'steps': 144702, 'loss/train': 1.2828574180603027} 11/07/2021 17:36:28 - INFO - __main__ - Step 144704: {'lr': 1.5786602525530435e-06, 'samples': 27783168, 'steps': 144703, 'loss/train': 1.547931432723999} 11/07/2021 17:36:29 - INFO - __main__ - Step 144705: {'lr': 1.5780648783883522e-06, 'samples': 27783360, 'steps': 144704, 'loss/train': 1.163017988204956} 11/07/2021 17:36:30 - INFO - __main__ - Step 144706: {'lr': 1.5774696161587332e-06, 'samples': 27783552, 'steps': 144705, 'loss/train': 1.6538006067276} 11/07/2021 17:36:30 - INFO - __main__ - Step 144707: {'lr': 1.5768744658644919e-06, 'samples': 27783744, 'steps': 144706, 'loss/train': 0.7353101968765259} 11/07/2021 17:36:30 - INFO - __main__ - Step 144708: {'lr': 1.57627942750585e-06, 'samples': 27783936, 'steps': 144707, 'loss/train': 1.262270212173462} 11/07/2021 17:36:31 - INFO - __main__ - Step 144709: {'lr': 1.5756845010831412e-06, 'samples': 27784128, 'steps': 144708, 'loss/train': 1.3565864562988281} 11/07/2021 17:36:31 - INFO - __main__ - Step 144710: {'lr': 1.5750896865965592e-06, 'samples': 27784320, 'steps': 144709, 'loss/train': 1.4453831911087036} 11/07/2021 17:36:32 - INFO - __main__ - Step 144711: {'lr': 1.5744949840464372e-06, 'samples': 27784512, 'steps': 144710, 'loss/train': 1.5844504833221436} 11/07/2021 17:36:32 - INFO - __main__ - Step 144712: {'lr': 1.5739003934329977e-06, 'samples': 27784704, 'steps': 144711, 'loss/train': 0.9417652487754822} 11/07/2021 17:36:33 - INFO - __main__ - Step 144713: {'lr': 1.5733059147565454e-06, 'samples': 27784896, 'steps': 144712, 'loss/train': 1.5657387971878052} 11/07/2021 17:36:33 - INFO - __main__ - Step 144714: {'lr': 1.5727115480173027e-06, 'samples': 27785088, 'steps': 144713, 'loss/train': 1.2517973184585571} 11/07/2021 17:36:33 - INFO - __main__ - Step 144715: {'lr': 1.5721172932155746e-06, 'samples': 27785280, 'steps': 144714, 'loss/train': 1.4316082000732422} 11/07/2021 17:36:34 - INFO - __main__ - Step 144716: {'lr': 1.5715231503516114e-06, 'samples': 27785472, 'steps': 144715, 'loss/train': 1.285827398300171} 11/07/2021 17:36:35 - INFO - __main__ - Step 144717: {'lr': 1.5709291194256903e-06, 'samples': 27785664, 'steps': 144716, 'loss/train': 0.8966829180717468} 11/07/2021 17:36:35 - INFO - __main__ - Step 144718: {'lr': 1.5703352004380889e-06, 'samples': 27785856, 'steps': 144717, 'loss/train': 1.5291223526000977} 11/07/2021 17:36:36 - INFO - __main__ - Step 144719: {'lr': 1.5697413933890292e-06, 'samples': 27786048, 'steps': 144718, 'loss/train': 0.8487987518310547} 11/07/2021 17:36:36 - INFO - __main__ - Step 144720: {'lr': 1.5691476982788445e-06, 'samples': 27786240, 'steps': 144719, 'loss/train': 1.1374070644378662} 11/07/2021 17:36:36 - INFO - __main__ - Step 144721: {'lr': 1.5685541151077566e-06, 'samples': 27786432, 'steps': 144720, 'loss/train': 1.1224409341812134} 11/07/2021 17:36:37 - INFO - __main__ - Step 144722: {'lr': 1.5679606438760152e-06, 'samples': 27786624, 'steps': 144721, 'loss/train': 1.3039427995681763} 11/07/2021 17:36:38 - INFO - __main__ - Step 144723: {'lr': 1.5673672845839538e-06, 'samples': 27786816, 'steps': 144722, 'loss/train': 1.0434141159057617} 11/07/2021 17:36:38 - INFO - __main__ - Step 144724: {'lr': 1.566774037231794e-06, 'samples': 27787008, 'steps': 144723, 'loss/train': 0.984775960445404} 11/07/2021 17:36:38 - INFO - __main__ - Step 144725: {'lr': 1.5661809018198138e-06, 'samples': 27787200, 'steps': 144724, 'loss/train': 0.929021418094635} 11/07/2021 17:36:39 - INFO - __main__ - Step 144726: {'lr': 1.5655878783482902e-06, 'samples': 27787392, 'steps': 144725, 'loss/train': 1.3845584392547607} 11/07/2021 17:36:40 - INFO - __main__ - Step 144727: {'lr': 1.5649949668174457e-06, 'samples': 27787584, 'steps': 144726, 'loss/train': 1.4083763360977173} 11/07/2021 17:36:40 - INFO - __main__ - Step 144728: {'lr': 1.5644021672276131e-06, 'samples': 27787776, 'steps': 144727, 'loss/train': 1.3123738765716553} 11/07/2021 17:36:40 - INFO - __main__ - Step 144729: {'lr': 1.5638094795790147e-06, 'samples': 27787968, 'steps': 144728, 'loss/train': 1.3687888383865356} 11/07/2021 17:36:41 - INFO - __main__ - Step 144730: {'lr': 1.563216903871928e-06, 'samples': 27788160, 'steps': 144729, 'loss/train': 1.2309081554412842} 11/07/2021 17:36:41 - INFO - __main__ - Step 144731: {'lr': 1.5626244401066302e-06, 'samples': 27788352, 'steps': 144730, 'loss/train': 1.5637454986572266} 11/07/2021 17:36:42 - INFO - __main__ - Step 144732: {'lr': 1.5620320882833717e-06, 'samples': 27788544, 'steps': 144731, 'loss/train': 1.4285379648208618} 11/07/2021 17:36:43 - INFO - __main__ - Step 144733: {'lr': 1.5614398484024295e-06, 'samples': 27788736, 'steps': 144732, 'loss/train': 1.4963130950927734} 11/07/2021 17:36:43 - INFO - __main__ - Step 144734: {'lr': 1.5608477204640536e-06, 'samples': 27788928, 'steps': 144733, 'loss/train': 1.3604798316955566} 11/07/2021 17:36:43 - INFO - __main__ - Step 144735: {'lr': 1.5602557044685496e-06, 'samples': 27789120, 'steps': 144734, 'loss/train': 0.9455098509788513} 11/07/2021 17:36:44 - INFO - __main__ - Step 144736: {'lr': 1.559663800416139e-06, 'samples': 27789312, 'steps': 144735, 'loss/train': 1.3775185346603394} 11/07/2021 17:36:45 - INFO - __main__ - Step 144737: {'lr': 1.5590720083071275e-06, 'samples': 27789504, 'steps': 144736, 'loss/train': 1.2081339359283447} 11/07/2021 17:36:45 - INFO - __main__ - Step 144738: {'lr': 1.5584803281417647e-06, 'samples': 27789696, 'steps': 144737, 'loss/train': 0.8698846697807312} 11/07/2021 17:36:45 - INFO - __main__ - Step 144739: {'lr': 1.5578887599203283e-06, 'samples': 27789888, 'steps': 144738, 'loss/train': 1.1396561861038208} 11/07/2021 17:36:46 - INFO - __main__ - Step 144740: {'lr': 1.5572973036430405e-06, 'samples': 27790080, 'steps': 144739, 'loss/train': 1.6690819263458252} 11/07/2021 17:36:46 - INFO - __main__ - Step 144741: {'lr': 1.556705959310234e-06, 'samples': 27790272, 'steps': 144740, 'loss/train': 0.712466835975647} 11/07/2021 17:36:47 - INFO - __main__ - Step 144742: {'lr': 1.556114726922131e-06, 'samples': 27790464, 'steps': 144741, 'loss/train': 1.2382726669311523} 11/07/2021 17:36:48 - INFO - __main__ - Step 144743: {'lr': 1.5555236064789813e-06, 'samples': 27790656, 'steps': 144742, 'loss/train': 0.49991902709007263} 11/07/2021 17:36:48 - INFO - __main__ - Step 144744: {'lr': 1.554932597981118e-06, 'samples': 27790848, 'steps': 144743, 'loss/train': 1.1309148073196411} 11/07/2021 17:36:48 - INFO - __main__ - Step 144745: {'lr': 1.5543417014287353e-06, 'samples': 27791040, 'steps': 144744, 'loss/train': 1.449164628982544} 11/07/2021 17:36:49 - INFO - __main__ - Step 144746: {'lr': 1.5537509168221665e-06, 'samples': 27791232, 'steps': 144745, 'loss/train': 1.4627407789230347} 11/07/2021 17:36:49 - INFO - __main__ - Step 144747: {'lr': 1.5531602441616332e-06, 'samples': 27791424, 'steps': 144746, 'loss/train': 1.553440809249878} 11/07/2021 17:36:50 - INFO - __main__ - Step 144748: {'lr': 1.5525696834473857e-06, 'samples': 27791616, 'steps': 144747, 'loss/train': 1.0249749422073364} 11/07/2021 17:36:51 - INFO - __main__ - Step 144749: {'lr': 1.5519792346797568e-06, 'samples': 27791808, 'steps': 144748, 'loss/train': 0.9174311757087708} 11/07/2021 17:36:51 - INFO - __main__ - Step 144750: {'lr': 1.5513888978589407e-06, 'samples': 27792000, 'steps': 144749, 'loss/train': 1.1165814399719238} 11/07/2021 17:36:51 - INFO - __main__ - Step 144751: {'lr': 1.550798672985243e-06, 'samples': 27792192, 'steps': 144750, 'loss/train': 1.3927375078201294} 11/07/2021 17:36:52 - INFO - __main__ - Step 144752: {'lr': 1.5502085600589411e-06, 'samples': 27792384, 'steps': 144751, 'loss/train': 1.0888761281967163} 11/07/2021 17:36:53 - INFO - __main__ - Step 144753: {'lr': 1.549618559080257e-06, 'samples': 27792576, 'steps': 144752, 'loss/train': 1.053794264793396} 11/07/2021 17:36:53 - INFO - __main__ - Step 144754: {'lr': 1.549028670049496e-06, 'samples': 27792768, 'steps': 144753, 'loss/train': 1.688409686088562} 11/07/2021 17:36:54 - INFO - __main__ - Step 144755: {'lr': 1.548438892966908e-06, 'samples': 27792960, 'steps': 144754, 'loss/train': 1.3005834817886353} 11/07/2021 17:36:54 - INFO - __main__ - Step 144756: {'lr': 1.5478492278327426e-06, 'samples': 27793152, 'steps': 144755, 'loss/train': 1.0752352476119995} 11/07/2021 17:36:54 - INFO - __main__ - Step 144757: {'lr': 1.5472596746473056e-06, 'samples': 27793344, 'steps': 144756, 'loss/train': 1.1847106218338013} 11/07/2021 17:36:55 - INFO - __main__ - Step 144758: {'lr': 1.5466702334108185e-06, 'samples': 27793536, 'steps': 144757, 'loss/train': 1.8166155815124512} 11/07/2021 17:36:56 - INFO - __main__ - Step 144759: {'lr': 1.5460809041235591e-06, 'samples': 27793728, 'steps': 144758, 'loss/train': 1.367729902267456} 11/07/2021 17:36:56 - INFO - __main__ - Step 144760: {'lr': 1.5454916867858327e-06, 'samples': 27793920, 'steps': 144759, 'loss/train': 1.3267054557800293} 11/07/2021 17:36:56 - INFO - __main__ - Step 144761: {'lr': 1.5449025813978613e-06, 'samples': 27794112, 'steps': 144760, 'loss/train': 1.4485586881637573} 11/07/2021 17:36:57 - INFO - __main__ - Step 144762: {'lr': 1.5443135879599224e-06, 'samples': 27794304, 'steps': 144761, 'loss/train': 1.441062092781067} 11/07/2021 17:36:58 - INFO - __main__ - Step 144763: {'lr': 1.5437247064722937e-06, 'samples': 27794496, 'steps': 144762, 'loss/train': 1.6274707317352295} 11/07/2021 17:36:58 - INFO - __main__ - Step 144764: {'lr': 1.543135936935225e-06, 'samples': 27794688, 'steps': 144763, 'loss/train': 1.8721972703933716} 11/07/2021 17:36:58 - INFO - __main__ - Step 144765: {'lr': 1.5425472793489659e-06, 'samples': 27794880, 'steps': 144764, 'loss/train': 1.0872728824615479} 11/07/2021 17:36:59 - INFO - __main__ - Step 144766: {'lr': 1.541958733713822e-06, 'samples': 27795072, 'steps': 144765, 'loss/train': 1.2658292055130005} 11/07/2021 17:36:59 - INFO - __main__ - Step 144767: {'lr': 1.541370300030015e-06, 'samples': 27795264, 'steps': 144766, 'loss/train': 1.0268018245697021} 11/07/2021 17:37:00 - INFO - __main__ - Step 144768: {'lr': 1.5407819782978504e-06, 'samples': 27795456, 'steps': 144767, 'loss/train': 1.3832242488861084} 11/07/2021 17:37:01 - INFO - __main__ - Step 144769: {'lr': 1.5401937685175781e-06, 'samples': 27795648, 'steps': 144768, 'loss/train': 1.1626518964767456} 11/07/2021 17:37:01 - INFO - __main__ - Step 144770: {'lr': 1.5396056706894478e-06, 'samples': 27795840, 'steps': 144769, 'loss/train': 0.8935350179672241} 11/07/2021 17:37:01 - INFO - __main__ - Step 144771: {'lr': 1.5390176848137371e-06, 'samples': 27796032, 'steps': 144770, 'loss/train': 0.11231900006532669} 11/07/2021 17:37:02 - INFO - __main__ - Step 144772: {'lr': 1.5384298108907236e-06, 'samples': 27796224, 'steps': 144771, 'loss/train': 1.1068696975708008} 11/07/2021 17:37:02 - INFO - __main__ - Step 144773: {'lr': 1.5378420489206568e-06, 'samples': 27796416, 'steps': 144772, 'loss/train': 1.3958535194396973} 11/07/2021 17:37:03 - INFO - __main__ - Step 144774: {'lr': 1.5372543989037867e-06, 'samples': 27796608, 'steps': 144773, 'loss/train': 1.766416072845459} 11/07/2021 17:37:04 - INFO - __main__ - Step 144775: {'lr': 1.5366668608404188e-06, 'samples': 27796800, 'steps': 144774, 'loss/train': 1.479247808456421} 11/07/2021 17:37:04 - INFO - __main__ - Step 144776: {'lr': 1.5360794347307749e-06, 'samples': 27796992, 'steps': 144775, 'loss/train': 0.8172323703765869} 11/07/2021 17:37:04 - INFO - __main__ - Step 144777: {'lr': 1.5354921205751603e-06, 'samples': 27797184, 'steps': 144776, 'loss/train': 1.2661073207855225} 11/07/2021 17:37:05 - INFO - __main__ - Step 144778: {'lr': 1.5349049183737973e-06, 'samples': 27797376, 'steps': 144777, 'loss/train': 0.979292094707489} 11/07/2021 17:37:06 - INFO - __main__ - Step 144779: {'lr': 1.534317828126991e-06, 'samples': 27797568, 'steps': 144778, 'loss/train': 1.5840998888015747} 11/07/2021 17:37:06 - INFO - __main__ - Step 144780: {'lr': 1.5337308498349911e-06, 'samples': 27797760, 'steps': 144779, 'loss/train': 0.9357437491416931} 11/07/2021 17:37:07 - INFO - __main__ - Step 144781: {'lr': 1.5331439834980477e-06, 'samples': 27797952, 'steps': 144780, 'loss/train': 1.2189303636550903} 11/07/2021 17:37:07 - INFO - __main__ - Step 144782: {'lr': 1.5325572291164102e-06, 'samples': 27798144, 'steps': 144781, 'loss/train': 1.218016266822815} 11/07/2021 17:37:07 - INFO - __main__ - Step 144783: {'lr': 1.531970586690412e-06, 'samples': 27798336, 'steps': 144782, 'loss/train': 1.1500407457351685} 11/07/2021 17:37:08 - INFO - __main__ - Step 144784: {'lr': 1.5313840562202475e-06, 'samples': 27798528, 'steps': 144783, 'loss/train': 0.7002424001693726} 11/07/2021 17:37:09 - INFO - __main__ - Step 144785: {'lr': 1.5307976377062216e-06, 'samples': 27798720, 'steps': 144784, 'loss/train': 1.5804965496063232} 11/07/2021 17:37:09 - INFO - __main__ - Step 144786: {'lr': 1.5302113311485843e-06, 'samples': 27798912, 'steps': 144785, 'loss/train': 1.1741080284118652} 11/07/2021 17:37:10 - INFO - __main__ - Step 144787: {'lr': 1.5296251365475855e-06, 'samples': 27799104, 'steps': 144786, 'loss/train': 1.3177590370178223} 11/07/2021 17:37:10 - INFO - __main__ - Step 144788: {'lr': 1.5290390539035027e-06, 'samples': 27799296, 'steps': 144787, 'loss/train': 1.4283291101455688} 11/07/2021 17:37:11 - INFO - __main__ - Step 144789: {'lr': 1.5284530832166132e-06, 'samples': 27799488, 'steps': 144788, 'loss/train': 1.2823405265808105} 11/07/2021 17:37:11 - INFO - __main__ - Step 144790: {'lr': 1.5278672244871671e-06, 'samples': 27799680, 'steps': 144789, 'loss/train': 1.5670313835144043} 11/07/2021 17:37:12 - INFO - __main__ - Step 144791: {'lr': 1.5272814777154142e-06, 'samples': 27799872, 'steps': 144790, 'loss/train': 1.0483356714248657} 11/07/2021 17:37:12 - INFO - __main__ - Step 144792: {'lr': 1.5266958429016597e-06, 'samples': 27800064, 'steps': 144791, 'loss/train': 1.0342257022857666} 11/07/2021 17:37:12 - INFO - __main__ - Step 144793: {'lr': 1.5261103200461257e-06, 'samples': 27800256, 'steps': 144792, 'loss/train': 0.9836145043373108} 11/07/2021 17:37:13 - INFO - __main__ - Step 144794: {'lr': 1.5255249091490897e-06, 'samples': 27800448, 'steps': 144793, 'loss/train': 1.203210711479187} 11/07/2021 17:37:14 - INFO - __main__ - Step 144795: {'lr': 1.5249396102108294e-06, 'samples': 27800640, 'steps': 144794, 'loss/train': 1.5502421855926514} 11/07/2021 17:37:14 - INFO - __main__ - Step 144796: {'lr': 1.5243544232315942e-06, 'samples': 27800832, 'steps': 144795, 'loss/train': 1.351829171180725} 11/07/2021 17:37:14 - INFO - __main__ - Step 144797: {'lr': 1.5237693482116345e-06, 'samples': 27801024, 'steps': 144796, 'loss/train': 1.3001924753189087} 11/07/2021 17:37:15 - INFO - __main__ - Step 144798: {'lr': 1.523184385151255e-06, 'samples': 27801216, 'steps': 144797, 'loss/train': 0.8090242743492126} 11/07/2021 17:37:16 - INFO - __main__ - Step 144799: {'lr': 1.5225995340506782e-06, 'samples': 27801408, 'steps': 144798, 'loss/train': 1.0677552223205566} 11/07/2021 17:37:16 - INFO - __main__ - Step 144800: {'lr': 1.5220147949101815e-06, 'samples': 27801600, 'steps': 144799, 'loss/train': 1.269116759300232} 11/07/2021 17:37:16 - INFO - __main__ - Step 144801: {'lr': 1.5214301677300425e-06, 'samples': 27801792, 'steps': 144800, 'loss/train': 1.379325270652771} 11/07/2021 17:37:17 - INFO - __main__ - Step 144802: {'lr': 1.5208456525105107e-06, 'samples': 27801984, 'steps': 144801, 'loss/train': 1.2056647539138794} 11/07/2021 17:37:17 - INFO - __main__ - Step 144803: {'lr': 1.5202612492518365e-06, 'samples': 27802176, 'steps': 144802, 'loss/train': 1.2964812517166138} 11/07/2021 17:37:18 - INFO - __main__ - Step 144804: {'lr': 1.5196769579542967e-06, 'samples': 27802368, 'steps': 144803, 'loss/train': 1.2074202299118042} 11/07/2021 17:37:19 - INFO - __main__ - Step 144805: {'lr': 1.5190927786181973e-06, 'samples': 27802560, 'steps': 144804, 'loss/train': 0.99290931224823} 11/07/2021 17:37:19 - INFO - __main__ - Step 144806: {'lr': 1.5185087112437323e-06, 'samples': 27802752, 'steps': 144805, 'loss/train': 1.6287145614624023} 11/07/2021 17:37:19 - INFO - __main__ - Step 144807: {'lr': 1.5179247558311793e-06, 'samples': 27802944, 'steps': 144806, 'loss/train': 1.2887200117111206} 11/07/2021 17:37:20 - INFO - __main__ - Step 144808: {'lr': 1.5173409123808434e-06, 'samples': 27803136, 'steps': 144807, 'loss/train': 1.2971835136413574} 11/07/2021 17:37:20 - INFO - __main__ - Step 144809: {'lr': 1.516757180892947e-06, 'samples': 27803328, 'steps': 144808, 'loss/train': 1.1245214939117432} 11/07/2021 17:37:21 - INFO - __main__ - Step 144810: {'lr': 1.5161735613677397e-06, 'samples': 27803520, 'steps': 144809, 'loss/train': 1.43474280834198} 11/07/2021 17:37:21 - INFO - __main__ - Step 144811: {'lr': 1.5155900538055546e-06, 'samples': 27803712, 'steps': 144810, 'loss/train': 1.0912368297576904} 11/07/2021 17:37:22 - INFO - __main__ - Step 144812: {'lr': 1.5150066582065857e-06, 'samples': 27803904, 'steps': 144811, 'loss/train': 1.370383858680725} 11/07/2021 17:37:22 - INFO - __main__ - Step 144813: {'lr': 1.514423374571111e-06, 'samples': 27804096, 'steps': 144812, 'loss/train': 1.1196380853652954} 11/07/2021 17:37:23 - INFO - __main__ - Step 144814: {'lr': 1.5138402028994081e-06, 'samples': 27804288, 'steps': 144813, 'loss/train': 1.2223680019378662} 11/07/2021 17:37:24 - INFO - __main__ - Step 144815: {'lr': 1.5132571431917541e-06, 'samples': 27804480, 'steps': 144814, 'loss/train': 1.4374204874038696} 11/07/2021 17:37:24 - INFO - __main__ - Step 144816: {'lr': 1.5126741954483714e-06, 'samples': 27804672, 'steps': 144815, 'loss/train': 1.4596760272979736} 11/07/2021 17:37:24 - INFO - __main__ - Step 144817: {'lr': 1.5120913596695651e-06, 'samples': 27804864, 'steps': 144816, 'loss/train': 1.6027685403823853} 11/07/2021 17:37:25 - INFO - __main__ - Step 144818: {'lr': 1.5115086358555574e-06, 'samples': 27805056, 'steps': 144817, 'loss/train': 1.1195443868637085} 11/07/2021 17:37:25 - INFO - __main__ - Step 144819: {'lr': 1.5109260240066536e-06, 'samples': 27805248, 'steps': 144818, 'loss/train': 1.0139758586883545} 11/07/2021 17:37:26 - INFO - __main__ - Step 144820: {'lr': 1.5103435241230757e-06, 'samples': 27805440, 'steps': 144819, 'loss/train': 1.2631117105484009} 11/07/2021 17:37:26 - INFO - __main__ - Step 144821: {'lr': 1.5097611362051012e-06, 'samples': 27805632, 'steps': 144820, 'loss/train': 1.111641526222229} 11/07/2021 17:37:27 - INFO - __main__ - Step 144822: {'lr': 1.50917886025298e-06, 'samples': 27805824, 'steps': 144821, 'loss/train': 1.8492307662963867} 11/07/2021 17:37:27 - INFO - __main__ - Step 144823: {'lr': 1.5085966962669896e-06, 'samples': 27806016, 'steps': 144822, 'loss/train': 1.1042039394378662} 11/07/2021 17:37:27 - INFO - __main__ - Step 144824: {'lr': 1.5080146442474075e-06, 'samples': 27806208, 'steps': 144823, 'loss/train': 1.5040192604064941} 11/07/2021 17:37:28 - INFO - __main__ - Step 144825: {'lr': 1.5074327041944834e-06, 'samples': 27806400, 'steps': 144824, 'loss/train': 1.3811402320861816} 11/07/2021 17:37:29 - INFO - __main__ - Step 144826: {'lr': 1.5068508761084677e-06, 'samples': 27806592, 'steps': 144825, 'loss/train': 1.5033557415008545} 11/07/2021 17:37:29 - INFO - __main__ - Step 144827: {'lr': 1.5062691599896372e-06, 'samples': 27806784, 'steps': 144826, 'loss/train': 1.1196365356445312} 11/07/2021 17:37:30 - INFO - __main__ - Step 144828: {'lr': 1.5056875558382144e-06, 'samples': 27806976, 'steps': 144827, 'loss/train': 0.9846555590629578} 11/07/2021 17:37:30 - INFO - __main__ - Step 144829: {'lr': 1.505106063654532e-06, 'samples': 27807168, 'steps': 144828, 'loss/train': 1.4964113235473633} 11/07/2021 17:37:30 - INFO - __main__ - Step 144830: {'lr': 1.5045246834388126e-06, 'samples': 27807360, 'steps': 144829, 'loss/train': 1.1871747970581055} 11/07/2021 17:37:31 - INFO - __main__ - Step 144831: {'lr': 1.5039434151913055e-06, 'samples': 27807552, 'steps': 144830, 'loss/train': 1.3926936388015747} 11/07/2021 17:37:32 - INFO - __main__ - Step 144832: {'lr': 1.5033622589122885e-06, 'samples': 27807744, 'steps': 144831, 'loss/train': 1.2548812627792358} 11/07/2021 17:37:32 - INFO - __main__ - Step 144833: {'lr': 1.5027812146020391e-06, 'samples': 27807936, 'steps': 144832, 'loss/train': 1.50688636302948} 11/07/2021 17:37:32 - INFO - __main__ - Step 144834: {'lr': 1.5022002822607794e-06, 'samples': 27808128, 'steps': 144833, 'loss/train': 1.3733985424041748} 11/07/2021 17:37:33 - INFO - __main__ - Step 144835: {'lr': 1.5016194618888147e-06, 'samples': 27808320, 'steps': 144834, 'loss/train': 1.3625465631484985} 11/07/2021 17:37:34 - INFO - __main__ - Step 144836: {'lr': 1.5010387534863667e-06, 'samples': 27808512, 'steps': 144835, 'loss/train': 1.3354222774505615} 11/07/2021 17:37:34 - INFO - __main__ - Step 144837: {'lr': 1.5004581570537136e-06, 'samples': 27808704, 'steps': 144836, 'loss/train': 0.942339301109314} 11/07/2021 17:37:35 - INFO - __main__ - Step 144838: {'lr': 1.4998776725911324e-06, 'samples': 27808896, 'steps': 144837, 'loss/train': 1.3981728553771973} 11/07/2021 17:37:35 - INFO - __main__ - Step 144839: {'lr': 1.499297300098873e-06, 'samples': 27809088, 'steps': 144838, 'loss/train': 1.4141736030578613} 11/07/2021 17:37:35 - INFO - __main__ - Step 144840: {'lr': 1.4987170395771854e-06, 'samples': 27809280, 'steps': 144839, 'loss/train': 1.404546856880188} 11/07/2021 17:37:36 - INFO - __main__ - Step 144841: {'lr': 1.4981368910263472e-06, 'samples': 27809472, 'steps': 144840, 'loss/train': 1.566931128501892} 11/07/2021 17:37:37 - INFO - __main__ - Step 144842: {'lr': 1.497556854446608e-06, 'samples': 27809664, 'steps': 144841, 'loss/train': 1.6104395389556885} 11/07/2021 17:37:37 - INFO - __main__ - Step 144843: {'lr': 1.4969769298382453e-06, 'samples': 27809856, 'steps': 144842, 'loss/train': 1.018880009651184} 11/07/2021 17:37:37 - INFO - __main__ - Step 144844: {'lr': 1.4963971172014812e-06, 'samples': 27810048, 'steps': 144843, 'loss/train': 0.5878109931945801} 11/07/2021 17:37:38 - INFO - __main__ - Step 144845: {'lr': 1.495817416536649e-06, 'samples': 27810240, 'steps': 144844, 'loss/train': 1.1463451385498047} 11/07/2021 17:37:39 - INFO - __main__ - Step 144846: {'lr': 1.4952378278439428e-06, 'samples': 27810432, 'steps': 144845, 'loss/train': 0.7502509951591492} 11/07/2021 17:37:39 - INFO - __main__ - Step 144847: {'lr': 1.4946583511236677e-06, 'samples': 27810624, 'steps': 144846, 'loss/train': 1.1452573537826538} 11/07/2021 17:37:40 - INFO - __main__ - Step 144848: {'lr': 1.4940789863760462e-06, 'samples': 27810816, 'steps': 144847, 'loss/train': 0.9922999143600464} 11/07/2021 17:37:40 - INFO - __main__ - Step 144849: {'lr': 1.4934997336013557e-06, 'samples': 27811008, 'steps': 144848, 'loss/train': 1.2410914897918701} 11/07/2021 17:37:40 - INFO - __main__ - Step 144850: {'lr': 1.4929205927998457e-06, 'samples': 27811200, 'steps': 144849, 'loss/train': 1.390437126159668} 11/07/2021 17:37:41 - INFO - __main__ - Step 144851: {'lr': 1.4923415639718219e-06, 'samples': 27811392, 'steps': 144850, 'loss/train': 1.3562637567520142} 11/07/2021 17:37:42 - INFO - __main__ - Step 144852: {'lr': 1.4917626471175061e-06, 'samples': 27811584, 'steps': 144851, 'loss/train': 1.787393569946289} 11/07/2021 17:37:42 - INFO - __main__ - Step 144853: {'lr': 1.491183842237148e-06, 'samples': 27811776, 'steps': 144852, 'loss/train': 1.491951584815979} 11/07/2021 17:37:43 - INFO - __main__ - Step 144854: {'lr': 1.4906051493310534e-06, 'samples': 27811968, 'steps': 144853, 'loss/train': 1.5181020498275757} 11/07/2021 17:37:43 - INFO - __main__ - Step 144855: {'lr': 1.490026568399444e-06, 'samples': 27812160, 'steps': 144854, 'loss/train': 0.07258595526218414} 11/07/2021 17:37:43 - INFO - __main__ - Step 144856: {'lr': 1.4894480994425973e-06, 'samples': 27812352, 'steps': 144855, 'loss/train': 0.8802388906478882} 11/07/2021 17:37:45 - INFO - __main__ - Step 144857: {'lr': 1.4888697424607632e-06, 'samples': 27812544, 'steps': 144856, 'loss/train': 1.0366710424423218} 11/07/2021 17:37:45 - INFO - __main__ - Step 144858: {'lr': 1.4882914974542195e-06, 'samples': 27812736, 'steps': 144857, 'loss/train': 1.2968538999557495} 11/07/2021 17:37:45 - INFO - __main__ - Step 144859: {'lr': 1.4877133644232433e-06, 'samples': 27812928, 'steps': 144858, 'loss/train': 1.2134218215942383} 11/07/2021 17:37:46 - INFO - __main__ - Step 144860: {'lr': 1.487135343368029e-06, 'samples': 27813120, 'steps': 144859, 'loss/train': 1.2057926654815674} 11/07/2021 17:37:46 - INFO - __main__ - Step 144861: {'lr': 1.4865574342888821e-06, 'samples': 27813312, 'steps': 144860, 'loss/train': 0.7128580808639526} 11/07/2021 17:37:46 - INFO - __main__ - Step 144862: {'lr': 1.4859796371860522e-06, 'samples': 27813504, 'steps': 144861, 'loss/train': 0.8495539426803589} 11/07/2021 17:37:48 - INFO - __main__ - Step 144863: {'lr': 1.4854019520598171e-06, 'samples': 27813696, 'steps': 144862, 'loss/train': 1.9376503229141235} 11/07/2021 17:37:48 - INFO - __main__ - Step 144864: {'lr': 1.4848243789104265e-06, 'samples': 27813888, 'steps': 144863, 'loss/train': 1.0341898202896118} 11/07/2021 17:37:49 - INFO - __main__ - Step 144865: {'lr': 1.4842469177381578e-06, 'samples': 27814080, 'steps': 144864, 'loss/train': 1.945284128189087} 11/07/2021 17:37:49 - INFO - __main__ - Step 144866: {'lr': 1.4836695685432056e-06, 'samples': 27814272, 'steps': 144865, 'loss/train': 1.163822054862976} 11/07/2021 17:37:49 - INFO - __main__ - Step 144867: {'lr': 1.4830923313259026e-06, 'samples': 27814464, 'steps': 144866, 'loss/train': 1.0784944295883179} 11/07/2021 17:37:50 - INFO - __main__ - Step 144868: {'lr': 1.4825152060864989e-06, 'samples': 27814656, 'steps': 144867, 'loss/train': 2.439152240753174} 11/07/2021 17:37:51 - INFO - __main__ - Step 144869: {'lr': 1.4819381928252163e-06, 'samples': 27814848, 'steps': 144868, 'loss/train': 1.0104684829711914} 11/07/2021 17:37:51 - INFO - __main__ - Step 144870: {'lr': 1.4813612915423325e-06, 'samples': 27815040, 'steps': 144869, 'loss/train': 1.4622013568878174} 11/07/2021 17:37:51 - INFO - __main__ - Step 144871: {'lr': 1.480784502238125e-06, 'samples': 27815232, 'steps': 144870, 'loss/train': 1.6489933729171753} 11/07/2021 17:37:52 - INFO - __main__ - Step 144872: {'lr': 1.4802078249128714e-06, 'samples': 27815424, 'steps': 144871, 'loss/train': 0.13792888820171356} 11/07/2021 17:37:53 - INFO - __main__ - Step 144873: {'lr': 1.479631259566766e-06, 'samples': 27815616, 'steps': 144872, 'loss/train': 1.067356824874878} 11/07/2021 17:37:53 - INFO - __main__ - Step 144874: {'lr': 1.4790548062001142e-06, 'samples': 27815808, 'steps': 144873, 'loss/train': 1.4002563953399658} 11/07/2021 17:37:53 - INFO - __main__ - Step 144875: {'lr': 1.4784784648131378e-06, 'samples': 27816000, 'steps': 144874, 'loss/train': 1.2686740159988403} 11/07/2021 17:37:54 - INFO - __main__ - Step 144876: {'lr': 1.47790223540617e-06, 'samples': 27816192, 'steps': 144875, 'loss/train': 1.1569918394088745} 11/07/2021 17:37:54 - INFO - __main__ - Step 144877: {'lr': 1.4773261179793774e-06, 'samples': 27816384, 'steps': 144876, 'loss/train': 1.0824882984161377} 11/07/2021 17:37:55 - INFO - __main__ - Step 144878: {'lr': 1.476750112533093e-06, 'samples': 27816576, 'steps': 144877, 'loss/train': 1.426871418952942} 11/07/2021 17:37:56 - INFO - __main__ - Step 144879: {'lr': 1.4761742190675665e-06, 'samples': 27816768, 'steps': 144878, 'loss/train': 1.514944076538086} 11/07/2021 17:37:56 - INFO - __main__ - Step 144880: {'lr': 1.47559843758302e-06, 'samples': 27816960, 'steps': 144879, 'loss/train': 1.6326013803482056} 11/07/2021 17:37:56 - INFO - __main__ - Step 144881: {'lr': 1.4750227680797313e-06, 'samples': 27817152, 'steps': 144880, 'loss/train': 1.2910583019256592} 11/07/2021 17:37:57 - INFO - __main__ - Step 144882: {'lr': 1.4744472105579499e-06, 'samples': 27817344, 'steps': 144881, 'loss/train': 1.0661895275115967} 11/07/2021 17:37:58 - INFO - __main__ - Step 144883: {'lr': 1.4738717650179812e-06, 'samples': 27817536, 'steps': 144882, 'loss/train': 1.3935545682907104} 11/07/2021 17:37:58 - INFO - __main__ - Step 144884: {'lr': 1.4732964314600194e-06, 'samples': 27817728, 'steps': 144883, 'loss/train': 1.4302165508270264} 11/07/2021 17:37:58 - INFO - __main__ - Step 144885: {'lr': 1.47272120988437e-06, 'samples': 27817920, 'steps': 144884, 'loss/train': 1.1047155857086182} 11/07/2021 17:37:59 - INFO - __main__ - Step 144886: {'lr': 1.4721461002912828e-06, 'samples': 27818112, 'steps': 144885, 'loss/train': 1.0844935178756714} 11/07/2021 17:37:59 - INFO - __main__ - Step 144887: {'lr': 1.4715711026810075e-06, 'samples': 27818304, 'steps': 144886, 'loss/train': 1.6515679359436035} 11/07/2021 17:37:59 - INFO - __main__ - Step 144888: {'lr': 1.4709962170538217e-06, 'samples': 27818496, 'steps': 144887, 'loss/train': 1.4302823543548584} 11/07/2021 17:38:00 - INFO - __main__ - Step 144889: {'lr': 1.4704214434099471e-06, 'samples': 27818688, 'steps': 144888, 'loss/train': 1.4879295825958252} 11/07/2021 17:38:01 - INFO - __main__ - Step 144890: {'lr': 1.469846781749662e-06, 'samples': 27818880, 'steps': 144889, 'loss/train': 1.0137028694152832} 11/07/2021 17:38:01 - INFO - __main__ - Step 144891: {'lr': 1.4692722320732433e-06, 'samples': 27819072, 'steps': 144890, 'loss/train': 1.337985873222351} 11/07/2021 17:38:02 - INFO - __main__ - Step 144892: {'lr': 1.468697794380941e-06, 'samples': 27819264, 'steps': 144891, 'loss/train': 1.3503011465072632} 11/07/2021 17:38:02 - INFO - __main__ - Step 144893: {'lr': 1.4681234686729772e-06, 'samples': 27819456, 'steps': 144892, 'loss/train': 0.9435763359069824} 11/07/2021 17:38:03 - INFO - __main__ - Step 144894: {'lr': 1.4675492549496572e-06, 'samples': 27819648, 'steps': 144893, 'loss/train': 1.4687964916229248} 11/07/2021 17:38:03 - INFO - __main__ - Step 144895: {'lr': 1.4669751532112308e-06, 'samples': 27819840, 'steps': 144894, 'loss/train': 1.2899669408798218} 11/07/2021 17:38:04 - INFO - __main__ - Step 144896: {'lr': 1.4664011634579477e-06, 'samples': 27820032, 'steps': 144895, 'loss/train': 0.9499567151069641} 11/07/2021 17:38:04 - INFO - __main__ - Step 144897: {'lr': 1.4658272856900856e-06, 'samples': 27820224, 'steps': 144896, 'loss/train': 0.8283629417419434} 11/07/2021 17:38:04 - INFO - __main__ - Step 144898: {'lr': 1.4652535199078665e-06, 'samples': 27820416, 'steps': 144897, 'loss/train': 1.2499545812606812} 11/07/2021 17:38:05 - INFO - __main__ - Step 144899: {'lr': 1.4646798661115957e-06, 'samples': 27820608, 'steps': 144898, 'loss/train': 1.2295458316802979} 11/07/2021 17:38:06 - INFO - __main__ - Step 144900: {'lr': 1.4641063243014674e-06, 'samples': 27820800, 'steps': 144899, 'loss/train': 1.741917371749878} 11/07/2021 17:38:06 - INFO - __main__ - Step 144901: {'lr': 1.4635328944778148e-06, 'samples': 27820992, 'steps': 144900, 'loss/train': 1.4972541332244873} 11/07/2021 17:38:06 - INFO - __main__ - Step 144902: {'lr': 1.4629595766408322e-06, 'samples': 27821184, 'steps': 144901, 'loss/train': 1.1349883079528809} 11/07/2021 17:38:07 - INFO - __main__ - Step 144903: {'lr': 1.462386370790797e-06, 'samples': 27821376, 'steps': 144902, 'loss/train': 1.3608113527297974} 11/07/2021 17:38:08 - INFO - __main__ - Step 144904: {'lr': 1.4618132769279869e-06, 'samples': 27821568, 'steps': 144903, 'loss/train': 1.1468075513839722} 11/07/2021 17:38:08 - INFO - __main__ - Step 144905: {'lr': 1.4612402950526516e-06, 'samples': 27821760, 'steps': 144904, 'loss/train': 1.6877273321151733} 11/07/2021 17:38:08 - INFO - __main__ - Step 144906: {'lr': 1.4606674251650687e-06, 'samples': 27821952, 'steps': 144905, 'loss/train': 1.179608941078186} 11/07/2021 17:38:09 - INFO - __main__ - Step 144907: {'lr': 1.4600946672654324e-06, 'samples': 27822144, 'steps': 144906, 'loss/train': 0.40580078959465027} 11/07/2021 17:38:09 - INFO - __main__ - Step 144908: {'lr': 1.459522021354076e-06, 'samples': 27822336, 'steps': 144907, 'loss/train': 1.1700626611709595} 11/07/2021 17:38:10 - INFO - __main__ - Step 144909: {'lr': 1.4589494874311937e-06, 'samples': 27822528, 'steps': 144908, 'loss/train': 0.7284704446792603} 11/07/2021 17:38:11 - INFO - __main__ - Step 144910: {'lr': 1.4583770654970906e-06, 'samples': 27822720, 'steps': 144909, 'loss/train': 1.1256592273712158} 11/07/2021 17:38:11 - INFO - __main__ - Step 144911: {'lr': 1.457804755552017e-06, 'samples': 27822912, 'steps': 144910, 'loss/train': 1.2710247039794922} 11/07/2021 17:38:11 - INFO - __main__ - Step 144912: {'lr': 1.4572325575961941e-06, 'samples': 27823104, 'steps': 144911, 'loss/train': 1.1854557991027832} 11/07/2021 17:38:12 - INFO - __main__ - Step 144913: {'lr': 1.456660471629928e-06, 'samples': 27823296, 'steps': 144912, 'loss/train': 1.3444725275039673} 11/07/2021 17:38:13 - INFO - __main__ - Step 144914: {'lr': 1.4560884976534684e-06, 'samples': 27823488, 'steps': 144913, 'loss/train': 1.4780759811401367} 11/07/2021 17:38:13 - INFO - __main__ - Step 144915: {'lr': 1.455516635667037e-06, 'samples': 27823680, 'steps': 144914, 'loss/train': 1.5935131311416626} 11/07/2021 17:38:13 - INFO - __main__ - Step 144916: {'lr': 1.4549448856709114e-06, 'samples': 27823872, 'steps': 144915, 'loss/train': 1.201167106628418} 11/07/2021 17:38:14 - INFO - __main__ - Step 144917: {'lr': 1.4543732476653692e-06, 'samples': 27824064, 'steps': 144916, 'loss/train': 1.43563973903656} 11/07/2021 17:38:14 - INFO - __main__ - Step 144918: {'lr': 1.4538017216506326e-06, 'samples': 27824256, 'steps': 144917, 'loss/train': 0.8890660405158997} 11/07/2021 17:38:15 - INFO - __main__ - Step 144919: {'lr': 1.4532303076269793e-06, 'samples': 27824448, 'steps': 144918, 'loss/train': 1.419437289237976} 11/07/2021 17:38:15 - INFO - __main__ - Step 144920: {'lr': 1.4526590055946865e-06, 'samples': 27824640, 'steps': 144919, 'loss/train': 1.356225609779358} 11/07/2021 17:38:16 - INFO - __main__ - Step 144921: {'lr': 1.4520878155539763e-06, 'samples': 27824832, 'steps': 144920, 'loss/train': 1.438254475593567} 11/07/2021 17:38:16 - INFO - __main__ - Step 144922: {'lr': 1.4515167375051264e-06, 'samples': 27825024, 'steps': 144921, 'loss/train': 0.9450387954711914} 11/07/2021 17:38:17 - INFO - __main__ - Step 144923: {'lr': 1.4509457714483865e-06, 'samples': 27825216, 'steps': 144922, 'loss/train': 0.902652382850647} 11/07/2021 17:38:17 - INFO - __main__ - Step 144924: {'lr': 1.4503749173840065e-06, 'samples': 27825408, 'steps': 144923, 'loss/train': 1.405591607093811} 11/07/2021 17:38:18 - INFO - __main__ - Step 144925: {'lr': 1.449804175312236e-06, 'samples': 27825600, 'steps': 144924, 'loss/train': 1.328940987586975} 11/07/2021 17:38:18 - INFO - __main__ - Step 144926: {'lr': 1.4492335452333805e-06, 'samples': 27825792, 'steps': 144925, 'loss/train': 1.5045866966247559} 11/07/2021 17:38:19 - INFO - __main__ - Step 144927: {'lr': 1.448663027147662e-06, 'samples': 27825984, 'steps': 144926, 'loss/train': 0.8699185252189636} 11/07/2021 17:38:19 - INFO - __main__ - Step 144928: {'lr': 1.4480926210553303e-06, 'samples': 27826176, 'steps': 144927, 'loss/train': 0.4550004303455353} 11/07/2021 17:38:19 - INFO - __main__ - Step 144929: {'lr': 1.447522326956663e-06, 'samples': 27826368, 'steps': 144928, 'loss/train': 1.3839906454086304} 11/07/2021 17:38:20 - INFO - __main__ - Step 144930: {'lr': 1.446952144851882e-06, 'samples': 27826560, 'steps': 144929, 'loss/train': 1.8051905632019043} 11/07/2021 17:38:21 - INFO - __main__ - Step 144931: {'lr': 1.4463820747412925e-06, 'samples': 27826752, 'steps': 144930, 'loss/train': 0.9466748237609863} 11/07/2021 17:38:21 - INFO - __main__ - Step 144932: {'lr': 1.445812116625117e-06, 'samples': 27826944, 'steps': 144931, 'loss/train': 1.2927101850509644} 11/07/2021 17:38:21 - INFO - __main__ - Step 144933: {'lr': 1.4452422705036328e-06, 'samples': 27827136, 'steps': 144932, 'loss/train': 1.3972604274749756} 11/07/2021 17:38:22 - INFO - __main__ - Step 144934: {'lr': 1.4446725363770618e-06, 'samples': 27827328, 'steps': 144933, 'loss/train': 1.702505111694336} 11/07/2021 17:38:23 - INFO - __main__ - Step 144935: {'lr': 1.4441029142457097e-06, 'samples': 27827520, 'steps': 144934, 'loss/train': 1.35957932472229} 11/07/2021 17:38:23 - INFO - __main__ - Step 144936: {'lr': 1.443533404109798e-06, 'samples': 27827712, 'steps': 144935, 'loss/train': 0.9495635032653809} 11/07/2021 17:38:24 - INFO - __main__ - Step 144937: {'lr': 1.4429640059696047e-06, 'samples': 27827904, 'steps': 144936, 'loss/train': 1.3514665365219116} 11/07/2021 17:38:24 - INFO - __main__ - Step 144938: {'lr': 1.4423947198253796e-06, 'samples': 27828096, 'steps': 144937, 'loss/train': 1.2136224508285522} 11/07/2021 17:38:24 - INFO - __main__ - Step 144939: {'lr': 1.4418255456773722e-06, 'samples': 27828288, 'steps': 144938, 'loss/train': 1.2943528890609741} 11/07/2021 17:38:25 - INFO - __main__ - Step 144940: {'lr': 1.4412564835258602e-06, 'samples': 27828480, 'steps': 144939, 'loss/train': 1.0701813697814941} 11/07/2021 17:38:26 - INFO - __main__ - Step 144941: {'lr': 1.440687533371038e-06, 'samples': 27828672, 'steps': 144940, 'loss/train': 0.7969015836715698} 11/07/2021 17:38:26 - INFO - __main__ - Step 144942: {'lr': 1.4401186952132384e-06, 'samples': 27828864, 'steps': 144941, 'loss/train': 1.4592596292495728} 11/07/2021 17:38:26 - INFO - __main__ - Step 144943: {'lr': 1.4395499690526836e-06, 'samples': 27829056, 'steps': 144942, 'loss/train': 0.159165158867836} 11/07/2021 17:38:27 - INFO - __main__ - Step 144944: {'lr': 1.4389813548896235e-06, 'samples': 27829248, 'steps': 144943, 'loss/train': 1.5141853094100952} 11/07/2021 17:38:27 - INFO - __main__ - Step 144945: {'lr': 1.4384128527243357e-06, 'samples': 27829440, 'steps': 144944, 'loss/train': 0.5225747227668762} 11/07/2021 17:38:28 - INFO - __main__ - Step 144946: {'lr': 1.4378444625570418e-06, 'samples': 27829632, 'steps': 144945, 'loss/train': 1.0826929807662964} 11/07/2021 17:38:28 - INFO - __main__ - Step 144947: {'lr': 1.43727618438802e-06, 'samples': 27829824, 'steps': 144946, 'loss/train': 1.4486490488052368} 11/07/2021 17:38:29 - INFO - __main__ - Step 144948: {'lr': 1.4367080182175473e-06, 'samples': 27830016, 'steps': 144947, 'loss/train': 1.2100830078125} 11/07/2021 17:38:29 - INFO - __main__ - Step 144949: {'lr': 1.436139964045846e-06, 'samples': 27830208, 'steps': 144948, 'loss/train': 1.4321043491363525} 11/07/2021 17:38:30 - INFO - __main__ - Step 144950: {'lr': 1.4355720218731939e-06, 'samples': 27830400, 'steps': 144949, 'loss/train': 1.0677618980407715} 11/07/2021 17:38:31 - INFO - __main__ - Step 144951: {'lr': 1.4350041916998124e-06, 'samples': 27830592, 'steps': 144950, 'loss/train': 1.1550980806350708} 11/07/2021 17:38:31 - INFO - __main__ - Step 144952: {'lr': 1.4344364735260073e-06, 'samples': 27830784, 'steps': 144951, 'loss/train': 1.4541722536087036} 11/07/2021 17:38:31 - INFO - __main__ - Step 144953: {'lr': 1.4338688673520284e-06, 'samples': 27830976, 'steps': 144952, 'loss/train': 1.023738145828247} 11/07/2021 17:38:32 - INFO - __main__ - Step 144954: {'lr': 1.4333013731780697e-06, 'samples': 27831168, 'steps': 144953, 'loss/train': 1.0710880756378174} 11/07/2021 17:38:32 - INFO - __main__ - Step 144955: {'lr': 1.4327339910044368e-06, 'samples': 27831360, 'steps': 144954, 'loss/train': 1.1397544145584106} 11/07/2021 17:38:33 - INFO - __main__ - Step 144956: {'lr': 1.432166720831407e-06, 'samples': 27831552, 'steps': 144955, 'loss/train': 1.6939417123794556} 11/07/2021 17:38:33 - INFO - __main__ - Step 144957: {'lr': 1.4315995626591749e-06, 'samples': 27831744, 'steps': 144956, 'loss/train': 1.6670914888381958} 11/07/2021 17:38:34 - INFO - __main__ - Step 144958: {'lr': 1.4310325164880455e-06, 'samples': 27831936, 'steps': 144957, 'loss/train': 1.6026822328567505} 11/07/2021 17:38:34 - INFO - __main__ - Step 144959: {'lr': 1.4304655823182688e-06, 'samples': 27832128, 'steps': 144958, 'loss/train': 1.6557073593139648} 11/07/2021 17:38:34 - INFO - __main__ - Step 144960: {'lr': 1.4298987601500667e-06, 'samples': 27832320, 'steps': 144959, 'loss/train': 1.2472805976867676} 11/07/2021 17:38:35 - INFO - __main__ - Step 144961: {'lr': 1.429332049983717e-06, 'samples': 27832512, 'steps': 144960, 'loss/train': 1.2083840370178223} 11/07/2021 17:38:36 - INFO - __main__ - Step 144962: {'lr': 1.4287654518194693e-06, 'samples': 27832704, 'steps': 144961, 'loss/train': 1.412050485610962} 11/07/2021 17:38:36 - INFO - __main__ - Step 144963: {'lr': 1.4281989656576011e-06, 'samples': 27832896, 'steps': 144962, 'loss/train': 1.1288398504257202} 11/07/2021 17:38:36 - INFO - __main__ - Step 144964: {'lr': 1.4276325914983623e-06, 'samples': 27833088, 'steps': 144963, 'loss/train': 0.912104606628418} 11/07/2021 17:38:37 - INFO - __main__ - Step 144965: {'lr': 1.427066329341975e-06, 'samples': 27833280, 'steps': 144964, 'loss/train': 1.1328181028366089} 11/07/2021 17:38:38 - INFO - __main__ - Step 144966: {'lr': 1.4265001791887168e-06, 'samples': 27833472, 'steps': 144965, 'loss/train': 1.6264561414718628} 11/07/2021 17:38:38 - INFO - __main__ - Step 144967: {'lr': 1.4259341410388649e-06, 'samples': 27833664, 'steps': 144966, 'loss/train': 1.6161425113677979} 11/07/2021 17:38:39 - INFO - __main__ - Step 144968: {'lr': 1.425368214892614e-06, 'samples': 27833856, 'steps': 144967, 'loss/train': 1.5114161968231201} 11/07/2021 17:38:39 - INFO - __main__ - Step 144969: {'lr': 1.424802400750269e-06, 'samples': 27834048, 'steps': 144968, 'loss/train': 1.5342596769332886} 11/07/2021 17:38:39 - INFO - __main__ - Step 144970: {'lr': 1.4242366986120802e-06, 'samples': 27834240, 'steps': 144969, 'loss/train': 1.4361796379089355} 11/07/2021 17:38:40 - INFO - __main__ - Step 144971: {'lr': 1.423671108478297e-06, 'samples': 27834432, 'steps': 144970, 'loss/train': 1.480186939239502} 11/07/2021 17:38:41 - INFO - __main__ - Step 144972: {'lr': 1.4231056303491697e-06, 'samples': 27834624, 'steps': 144971, 'loss/train': 1.5146870613098145} 11/07/2021 17:38:41 - INFO - __main__ - Step 144973: {'lr': 1.4225402642249475e-06, 'samples': 27834816, 'steps': 144972, 'loss/train': 1.0474408864974976} 11/07/2021 17:38:41 - INFO - __main__ - Step 144974: {'lr': 1.4219750101059082e-06, 'samples': 27835008, 'steps': 144973, 'loss/train': 1.7546578645706177} 11/07/2021 17:38:42 - INFO - __main__ - Step 144975: {'lr': 1.4214098679922739e-06, 'samples': 27835200, 'steps': 144974, 'loss/train': 1.4158785343170166} 11/07/2021 17:38:43 - INFO - __main__ - Step 144976: {'lr': 1.420844837884322e-06, 'samples': 27835392, 'steps': 144975, 'loss/train': 1.5294811725616455} 11/07/2021 17:38:44 - INFO - __main__ - Step 144977: {'lr': 1.4202799197823024e-06, 'samples': 27835584, 'steps': 144976, 'loss/train': 1.238556981086731} 11/07/2021 17:38:44 - INFO - __main__ - Step 144978: {'lr': 1.419715113686465e-06, 'samples': 27835776, 'steps': 144977, 'loss/train': 1.5461928844451904} 11/07/2021 17:38:44 - INFO - __main__ - Step 144979: {'lr': 1.4191504195970872e-06, 'samples': 27835968, 'steps': 144978, 'loss/train': 2.163895845413208} 11/07/2021 17:38:45 - INFO - __main__ - Step 144980: {'lr': 1.4185858375143913e-06, 'samples': 27836160, 'steps': 144979, 'loss/train': 1.058974266052246} 11/07/2021 17:38:45 - INFO - __main__ - Step 144981: {'lr': 1.4180213674386544e-06, 'samples': 27836352, 'steps': 144980, 'loss/train': 1.3761848211288452} 11/07/2021 17:38:46 - INFO - __main__ - Step 144982: {'lr': 1.417457009370099e-06, 'samples': 27836544, 'steps': 144981, 'loss/train': 1.416474461555481} 11/07/2021 17:38:46 - INFO - __main__ - Step 144983: {'lr': 1.4168927633090023e-06, 'samples': 27836736, 'steps': 144982, 'loss/train': 1.4289168119430542} 11/07/2021 17:38:47 - INFO - __main__ - Step 144984: {'lr': 1.416328629255642e-06, 'samples': 27836928, 'steps': 144983, 'loss/train': 1.6394973993301392} 11/07/2021 17:38:47 - INFO - __main__ - Step 144985: {'lr': 1.4157646072102405e-06, 'samples': 27837120, 'steps': 144984, 'loss/train': 1.2631803750991821} 11/07/2021 17:38:47 - INFO - __main__ - Step 144986: {'lr': 1.4152006971730468e-06, 'samples': 27837312, 'steps': 144985, 'loss/train': 1.0935821533203125} 11/07/2021 17:38:48 - INFO - __main__ - Step 144987: {'lr': 1.4146368991443392e-06, 'samples': 27837504, 'steps': 144986, 'loss/train': 1.3249605894088745} 11/07/2021 17:38:49 - INFO - __main__ - Step 144988: {'lr': 1.4140732131243394e-06, 'samples': 27837696, 'steps': 144987, 'loss/train': 1.3658188581466675} 11/07/2021 17:38:49 - INFO - __main__ - Step 144989: {'lr': 1.413509639113353e-06, 'samples': 27837888, 'steps': 144988, 'loss/train': 2.53151273727417} 11/07/2021 17:38:50 - INFO - __main__ - Step 144990: {'lr': 1.4129461771115737e-06, 'samples': 27838080, 'steps': 144989, 'loss/train': 1.3736212253570557} 11/07/2021 17:38:50 - INFO - __main__ - Step 144991: {'lr': 1.4123828271193072e-06, 'samples': 27838272, 'steps': 144990, 'loss/train': 1.33355712890625} 11/07/2021 17:38:50 - INFO - __main__ - Step 144992: {'lr': 1.4118195891367757e-06, 'samples': 27838464, 'steps': 144991, 'loss/train': 1.312008261680603} 11/07/2021 17:38:51 - INFO - __main__ - Step 144993: {'lr': 1.4112564631642565e-06, 'samples': 27838656, 'steps': 144992, 'loss/train': 1.6407016515731812} 11/07/2021 17:38:52 - INFO - __main__ - Step 144994: {'lr': 1.4106934492019718e-06, 'samples': 27838848, 'steps': 144993, 'loss/train': 1.166068434715271} 11/07/2021 17:38:52 - INFO - __main__ - Step 144995: {'lr': 1.4101305472501991e-06, 'samples': 27839040, 'steps': 144994, 'loss/train': 0.5567113161087036} 11/07/2021 17:38:52 - INFO - __main__ - Step 144996: {'lr': 1.409567757309188e-06, 'samples': 27839232, 'steps': 144995, 'loss/train': 0.4522007703781128} 11/07/2021 17:38:53 - INFO - __main__ - Step 144997: {'lr': 1.4090050793791886e-06, 'samples': 27839424, 'steps': 144996, 'loss/train': 1.547468662261963} 11/07/2021 17:38:54 - INFO - __main__ - Step 144998: {'lr': 1.4084425134604506e-06, 'samples': 27839616, 'steps': 144997, 'loss/train': 1.3237037658691406} 11/07/2021 17:38:54 - INFO - __main__ - Step 144999: {'lr': 1.4078800595532238e-06, 'samples': 27839808, 'steps': 144998, 'loss/train': 1.131464958190918} 11/07/2021 17:38:55 - INFO - __main__ - Step 145000: {'lr': 1.4073177176577855e-06, 'samples': 27840000, 'steps': 144999, 'loss/train': 1.338047981262207} 11/07/2021 17:38:55 - INFO - __main__ - Step 145001: {'lr': 1.406755487774386e-06, 'samples': 27840192, 'steps': 145000, 'loss/train': 1.6784484386444092} 11/07/2021 17:38:55 - INFO - __main__ - Step 145002: {'lr': 1.4061933699032469e-06, 'samples': 27840384, 'steps': 145001, 'loss/train': 0.17990128695964813} 11/07/2021 17:38:56 - INFO - __main__ - Step 145003: {'lr': 1.405631364044646e-06, 'samples': 27840576, 'steps': 145002, 'loss/train': 1.0644632577896118} 11/07/2021 17:38:57 - INFO - __main__ - Step 145004: {'lr': 1.405069470198833e-06, 'samples': 27840768, 'steps': 145003, 'loss/train': 0.37926673889160156} 11/07/2021 17:38:57 - INFO - __main__ - Step 145005: {'lr': 1.404507688366058e-06, 'samples': 27840960, 'steps': 145004, 'loss/train': 1.3580048084259033} 11/07/2021 17:38:57 - INFO - __main__ - Step 145006: {'lr': 1.4039460185465703e-06, 'samples': 27841152, 'steps': 145005, 'loss/train': 1.4766703844070435} 11/07/2021 17:38:58 - INFO - __main__ - Step 145007: {'lr': 1.4033844607406477e-06, 'samples': 27841344, 'steps': 145006, 'loss/train': 1.2215298414230347} 11/07/2021 17:38:58 - INFO - __main__ - Step 145008: {'lr': 1.4028230149484844e-06, 'samples': 27841536, 'steps': 145007, 'loss/train': 0.6648860573768616} 11/07/2021 17:38:59 - INFO - __main__ - Step 145009: {'lr': 1.4022616811704136e-06, 'samples': 27841728, 'steps': 145008, 'loss/train': 0.3929162621498108} 11/07/2021 17:39:00 - INFO - __main__ - Step 145010: {'lr': 1.4017004594066295e-06, 'samples': 27841920, 'steps': 145009, 'loss/train': 2.042980432510376} 11/07/2021 17:39:00 - INFO - __main__ - Step 145011: {'lr': 1.40113934965741e-06, 'samples': 27842112, 'steps': 145010, 'loss/train': 1.0613913536071777} 11/07/2021 17:39:00 - INFO - __main__ - Step 145012: {'lr': 1.4005783519229763e-06, 'samples': 27842304, 'steps': 145011, 'loss/train': 1.3725347518920898} 11/07/2021 17:39:01 - INFO - __main__ - Step 145013: {'lr': 1.4000174662036347e-06, 'samples': 27842496, 'steps': 145012, 'loss/train': 1.3297656774520874} 11/07/2021 17:39:02 - INFO - __main__ - Step 145014: {'lr': 1.3994566924996066e-06, 'samples': 27842688, 'steps': 145013, 'loss/train': 1.2915257215499878} 11/07/2021 17:39:02 - INFO - __main__ - Step 145015: {'lr': 1.3988960308111421e-06, 'samples': 27842880, 'steps': 145014, 'loss/train': 1.1856812238693237} 11/07/2021 17:39:02 - INFO - __main__ - Step 145016: {'lr': 1.3983354811384908e-06, 'samples': 27843072, 'steps': 145015, 'loss/train': 1.0743417739868164} 11/07/2021 17:39:03 - INFO - __main__ - Step 145017: {'lr': 1.3977750434819026e-06, 'samples': 27843264, 'steps': 145016, 'loss/train': 1.8707596063613892} 11/07/2021 17:39:03 - INFO - __main__ - Step 145018: {'lr': 1.3972147178416827e-06, 'samples': 27843456, 'steps': 145017, 'loss/train': 1.434679627418518} 11/07/2021 17:39:04 - INFO - __main__ - Step 145019: {'lr': 1.3966545042180257e-06, 'samples': 27843648, 'steps': 145018, 'loss/train': 0.8880484104156494} 11/07/2021 17:39:05 - INFO - __main__ - Step 145020: {'lr': 1.396094402611181e-06, 'samples': 27843840, 'steps': 145019, 'loss/train': 1.710268497467041} 11/07/2021 17:39:05 - INFO - __main__ - Step 145021: {'lr': 1.3955344130214266e-06, 'samples': 27844032, 'steps': 145020, 'loss/train': 1.3270577192306519} 11/07/2021 17:39:05 - INFO - __main__ - Step 145022: {'lr': 1.3949745354490118e-06, 'samples': 27844224, 'steps': 145021, 'loss/train': 1.2489066123962402} 11/07/2021 17:39:06 - INFO - __main__ - Step 145023: {'lr': 1.3944147698941867e-06, 'samples': 27844416, 'steps': 145022, 'loss/train': 1.3984390497207642} 11/07/2021 17:39:07 - INFO - __main__ - Step 145024: {'lr': 1.3938551163572011e-06, 'samples': 27844608, 'steps': 145023, 'loss/train': 1.5254946947097778} 11/07/2021 17:39:07 - INFO - __main__ - Step 145025: {'lr': 1.3932955748383048e-06, 'samples': 27844800, 'steps': 145024, 'loss/train': 1.4194577932357788} 11/07/2021 17:39:07 - INFO - __main__ - Step 145026: {'lr': 1.3927361453377474e-06, 'samples': 27844992, 'steps': 145025, 'loss/train': 1.4390617609024048} 11/07/2021 17:39:08 - INFO - __main__ - Step 145027: {'lr': 1.3921768278558066e-06, 'samples': 27845184, 'steps': 145026, 'loss/train': 1.5238277912139893} 11/07/2021 17:39:08 - INFO - __main__ - Step 145028: {'lr': 1.3916176223927047e-06, 'samples': 27845376, 'steps': 145027, 'loss/train': 0.3621349334716797} 11/07/2021 17:39:08 - INFO - __main__ - Step 145029: {'lr': 1.391058528948691e-06, 'samples': 27845568, 'steps': 145028, 'loss/train': 1.3067518472671509} 11/07/2021 17:39:09 - INFO - __main__ - Step 145030: {'lr': 1.3904995475240712e-06, 'samples': 27845760, 'steps': 145029, 'loss/train': 1.3953750133514404} 11/07/2021 17:39:10 - INFO - __main__ - Step 145031: {'lr': 1.3899406781190115e-06, 'samples': 27845952, 'steps': 145030, 'loss/train': 1.3006278276443481} 11/07/2021 17:39:10 - INFO - __main__ - Step 145032: {'lr': 1.3893819207338177e-06, 'samples': 27846144, 'steps': 145031, 'loss/train': 1.2174583673477173} 11/07/2021 17:39:10 - INFO - __main__ - Step 145033: {'lr': 1.388823275368739e-06, 'samples': 27846336, 'steps': 145032, 'loss/train': 1.1731802225112915} 11/07/2021 17:39:11 - INFO - __main__ - Step 145034: {'lr': 1.3882647420240258e-06, 'samples': 27846528, 'steps': 145033, 'loss/train': 1.7314815521240234} 11/07/2021 17:39:12 - INFO - __main__ - Step 145035: {'lr': 1.3877063206999274e-06, 'samples': 27846720, 'steps': 145034, 'loss/train': 1.4660348892211914} 11/07/2021 17:39:12 - INFO - __main__ - Step 145036: {'lr': 1.3871480113966662e-06, 'samples': 27846912, 'steps': 145035, 'loss/train': 1.2508368492126465} 11/07/2021 17:39:13 - INFO - __main__ - Step 145037: {'lr': 1.3865898141145471e-06, 'samples': 27847104, 'steps': 145036, 'loss/train': 1.486353874206543} 11/07/2021 17:39:13 - INFO - __main__ - Step 145038: {'lr': 1.3860317288537927e-06, 'samples': 27847296, 'steps': 145037, 'loss/train': 1.4008498191833496} 11/07/2021 17:39:13 - INFO - __main__ - Step 145039: {'lr': 1.3854737556146246e-06, 'samples': 27847488, 'steps': 145038, 'loss/train': 1.5563849210739136} 11/07/2021 17:39:14 - INFO - __main__ - Step 145040: {'lr': 1.3849158943973484e-06, 'samples': 27847680, 'steps': 145039, 'loss/train': 1.3563231229782104} 11/07/2021 17:39:15 - INFO - __main__ - Step 145041: {'lr': 1.3843581452022136e-06, 'samples': 27847872, 'steps': 145040, 'loss/train': 1.5627248287200928} 11/07/2021 17:39:15 - INFO - __main__ - Step 145042: {'lr': 1.3838005080294146e-06, 'samples': 27848064, 'steps': 145041, 'loss/train': 1.5140509605407715} 11/07/2021 17:39:15 - INFO - __main__ - Step 145043: {'lr': 1.383242982879257e-06, 'samples': 27848256, 'steps': 145042, 'loss/train': 1.1899621486663818} 11/07/2021 17:39:16 - INFO - __main__ - Step 145044: {'lr': 1.3826855697519626e-06, 'samples': 27848448, 'steps': 145043, 'loss/train': 1.3406903743743896} 11/07/2021 17:39:16 - INFO - __main__ - Step 145045: {'lr': 1.3821282686478088e-06, 'samples': 27848640, 'steps': 145044, 'loss/train': 0.8759263157844543} 11/07/2021 17:39:17 - INFO - __main__ - Step 145046: {'lr': 1.3815710795670177e-06, 'samples': 27848832, 'steps': 145045, 'loss/train': 0.8738054037094116} 11/07/2021 17:39:17 - INFO - __main__ - Step 145047: {'lr': 1.3810140025098672e-06, 'samples': 27849024, 'steps': 145046, 'loss/train': 1.48989999294281} 11/07/2021 17:39:18 - INFO - __main__ - Step 145048: {'lr': 1.3804570374765791e-06, 'samples': 27849216, 'steps': 145047, 'loss/train': 1.2482892274856567} 11/07/2021 17:39:18 - INFO - __main__ - Step 145049: {'lr': 1.3799001844674308e-06, 'samples': 27849408, 'steps': 145048, 'loss/train': 1.2305889129638672} 11/07/2021 17:39:19 - INFO - __main__ - Step 145050: {'lr': 1.3793434434826724e-06, 'samples': 27849600, 'steps': 145049, 'loss/train': 1.1651171445846558} 11/07/2021 17:39:20 - INFO - __main__ - Step 145051: {'lr': 1.3787868145225535e-06, 'samples': 27849792, 'steps': 145050, 'loss/train': 1.316470742225647} 11/07/2021 17:39:20 - INFO - __main__ - Step 145052: {'lr': 1.3782302975872963e-06, 'samples': 27849984, 'steps': 145051, 'loss/train': 1.7195396423339844} 11/07/2021 17:39:20 - INFO - __main__ - Step 145053: {'lr': 1.3776738926771782e-06, 'samples': 27850176, 'steps': 145052, 'loss/train': 1.2477627992630005} 11/07/2021 17:39:21 - INFO - __main__ - Step 145054: {'lr': 1.3771175997924213e-06, 'samples': 27850368, 'steps': 145053, 'loss/train': 1.2829804420471191} 11/07/2021 17:39:21 - INFO - __main__ - Step 145055: {'lr': 1.3765614189333309e-06, 'samples': 27850560, 'steps': 145054, 'loss/train': 1.5661156177520752} 11/07/2021 17:39:22 - INFO - __main__ - Step 145056: {'lr': 1.3760053501001013e-06, 'samples': 27850752, 'steps': 145055, 'loss/train': 0.9458659887313843} 11/07/2021 17:39:22 - INFO - __main__ - Step 145057: {'lr': 1.3754493932930102e-06, 'samples': 27850944, 'steps': 145056, 'loss/train': 1.4977420568466187} 11/07/2021 17:39:23 - INFO - __main__ - Step 145058: {'lr': 1.3748935485123072e-06, 'samples': 27851136, 'steps': 145057, 'loss/train': 1.0570883750915527} 11/07/2021 17:39:23 - INFO - __main__ - Step 145059: {'lr': 1.3743378157582698e-06, 'samples': 27851328, 'steps': 145058, 'loss/train': 1.3790255784988403} 11/07/2021 17:39:23 - INFO - __main__ - Step 145060: {'lr': 1.3737821950310924e-06, 'samples': 27851520, 'steps': 145059, 'loss/train': 1.4606574773788452} 11/07/2021 17:39:25 - INFO - __main__ - Step 145061: {'lr': 1.3732266863310527e-06, 'samples': 27851712, 'steps': 145060, 'loss/train': 1.3656803369522095} 11/07/2021 17:39:25 - INFO - __main__ - Step 145062: {'lr': 1.3726712896584003e-06, 'samples': 27851904, 'steps': 145061, 'loss/train': 1.419749140739441} 11/07/2021 17:39:25 - INFO - __main__ - Step 145063: {'lr': 1.3721160050133851e-06, 'samples': 27852096, 'steps': 145062, 'loss/train': 1.471571922302246} 11/07/2021 17:39:26 - INFO - __main__ - Step 145064: {'lr': 1.371560832396257e-06, 'samples': 27852288, 'steps': 145063, 'loss/train': 0.8668680787086487} 11/07/2021 17:39:26 - INFO - __main__ - Step 145065: {'lr': 1.3710057718072933e-06, 'samples': 27852480, 'steps': 145064, 'loss/train': 1.109971046447754} 11/07/2021 17:39:26 - INFO - __main__ - Step 145066: {'lr': 1.3704508232466882e-06, 'samples': 27852672, 'steps': 145065, 'loss/train': 1.1051236391067505} 11/07/2021 17:39:27 - INFO - __main__ - Step 145067: {'lr': 1.3698959867147199e-06, 'samples': 27852864, 'steps': 145066, 'loss/train': 1.2072311639785767} 11/07/2021 17:39:28 - INFO - __main__ - Step 145068: {'lr': 1.3693412622116652e-06, 'samples': 27853056, 'steps': 145067, 'loss/train': 1.3269398212432861} 11/07/2021 17:39:28 - INFO - __main__ - Step 145069: {'lr': 1.3687866497377188e-06, 'samples': 27853248, 'steps': 145068, 'loss/train': 1.2923483848571777} 11/07/2021 17:39:28 - INFO - __main__ - Step 145070: {'lr': 1.368232149293186e-06, 'samples': 27853440, 'steps': 145069, 'loss/train': 0.6991701722145081} 11/07/2021 17:39:29 - INFO - __main__ - Step 145071: {'lr': 1.367677760878261e-06, 'samples': 27853632, 'steps': 145070, 'loss/train': 1.475708246231079} 11/07/2021 17:39:30 - INFO - __main__ - Step 145072: {'lr': 1.367123484493249e-06, 'samples': 27853824, 'steps': 145071, 'loss/train': 1.7462055683135986} 11/07/2021 17:39:30 - INFO - __main__ - Step 145073: {'lr': 1.3665693201383722e-06, 'samples': 27854016, 'steps': 145072, 'loss/train': 1.821246862411499} 11/07/2021 17:39:31 - INFO - __main__ - Step 145074: {'lr': 1.3660152678138804e-06, 'samples': 27854208, 'steps': 145073, 'loss/train': 1.3379629850387573} 11/07/2021 17:39:31 - INFO - __main__ - Step 145075: {'lr': 1.3654613275200235e-06, 'samples': 27854400, 'steps': 145074, 'loss/train': 0.928056001663208} 11/07/2021 17:39:31 - INFO - __main__ - Step 145076: {'lr': 1.364907499257051e-06, 'samples': 27854592, 'steps': 145075, 'loss/train': 0.9051296710968018} 11/07/2021 17:39:32 - INFO - __main__ - Step 145077: {'lr': 1.3643537830252128e-06, 'samples': 27854784, 'steps': 145076, 'loss/train': 1.438576102256775} 11/07/2021 17:39:33 - INFO - __main__ - Step 145078: {'lr': 1.363800178824759e-06, 'samples': 27854976, 'steps': 145077, 'loss/train': 1.309330940246582} 11/07/2021 17:39:33 - INFO - __main__ - Step 145079: {'lr': 1.363246686655939e-06, 'samples': 27855168, 'steps': 145078, 'loss/train': 1.3207581043243408} 11/07/2021 17:39:33 - INFO - __main__ - Step 145080: {'lr': 1.3626933065190306e-06, 'samples': 27855360, 'steps': 145079, 'loss/train': 1.8337905406951904} 11/07/2021 17:39:34 - INFO - __main__ - Step 145081: {'lr': 1.362140038414228e-06, 'samples': 27855552, 'steps': 145080, 'loss/train': 1.4697898626327515} 11/07/2021 17:39:35 - INFO - __main__ - Step 145082: {'lr': 1.3615868823418088e-06, 'samples': 27855744, 'steps': 145081, 'loss/train': 1.1728650331497192} 11/07/2021 17:39:35 - INFO - __main__ - Step 145083: {'lr': 1.3610338383020226e-06, 'samples': 27855936, 'steps': 145082, 'loss/train': 1.080130934715271} 11/07/2021 17:39:35 - INFO - __main__ - Step 145084: {'lr': 1.3604809062951195e-06, 'samples': 27856128, 'steps': 145083, 'loss/train': 0.1480327546596527} 11/07/2021 17:39:36 - INFO - __main__ - Step 145085: {'lr': 1.3599280863213492e-06, 'samples': 27856320, 'steps': 145084, 'loss/train': 0.851112425327301} 11/07/2021 17:39:36 - INFO - __main__ - Step 145086: {'lr': 1.3593753783809614e-06, 'samples': 27856512, 'steps': 145085, 'loss/train': 1.3697439432144165} 11/07/2021 17:39:37 - INFO - __main__ - Step 145087: {'lr': 1.3588227824742062e-06, 'samples': 27856704, 'steps': 145086, 'loss/train': 1.6326889991760254} 11/07/2021 17:39:37 - INFO - __main__ - Step 145088: {'lr': 1.3582702986013329e-06, 'samples': 27856896, 'steps': 145087, 'loss/train': 1.5674073696136475} 11/07/2021 17:39:38 - INFO - __main__ - Step 145089: {'lr': 1.3577179267625638e-06, 'samples': 27857088, 'steps': 145088, 'loss/train': 1.1800845861434937} 11/07/2021 17:39:38 - INFO - __main__ - Step 145090: {'lr': 1.3571656669582044e-06, 'samples': 27857280, 'steps': 145089, 'loss/train': 0.9624501466751099} 11/07/2021 17:39:39 - INFO - __main__ - Step 145091: {'lr': 1.3566135191884488e-06, 'samples': 27857472, 'steps': 145090, 'loss/train': 1.3762223720550537} 11/07/2021 17:39:39 - INFO - __main__ - Step 145092: {'lr': 1.3560614834535467e-06, 'samples': 27857664, 'steps': 145091, 'loss/train': 0.5806673765182495} 11/07/2021 17:39:40 - INFO - __main__ - Step 145093: {'lr': 1.3555095597538037e-06, 'samples': 27857856, 'steps': 145092, 'loss/train': 0.9128943681716919} 11/07/2021 17:39:40 - INFO - __main__ - Step 145094: {'lr': 1.3549577480893859e-06, 'samples': 27858048, 'steps': 145093, 'loss/train': 1.3273499011993408} 11/07/2021 17:39:41 - INFO - __main__ - Step 145095: {'lr': 1.354406048460627e-06, 'samples': 27858240, 'steps': 145094, 'loss/train': 1.3705127239227295} 11/07/2021 17:39:41 - INFO - __main__ - Step 145096: {'lr': 1.3538544608677205e-06, 'samples': 27858432, 'steps': 145095, 'loss/train': 1.7059718370437622} 11/07/2021 17:39:41 - INFO - __main__ - Step 145097: {'lr': 1.3533029853109447e-06, 'samples': 27858624, 'steps': 145096, 'loss/train': 1.7376413345336914} 11/07/2021 17:39:42 - INFO - __main__ - Step 145098: {'lr': 1.3527516217905212e-06, 'samples': 27858816, 'steps': 145097, 'loss/train': 1.4655959606170654} 11/07/2021 17:39:43 - INFO - __main__ - Step 145099: {'lr': 1.3522003703066998e-06, 'samples': 27859008, 'steps': 145098, 'loss/train': 1.986587405204773} 11/07/2021 17:39:43 - INFO - __main__ - Step 145100: {'lr': 1.3516492308597583e-06, 'samples': 27859200, 'steps': 145099, 'loss/train': 0.7562567591667175} 11/07/2021 17:39:43 - INFO - __main__ - Step 145101: {'lr': 1.3510982034499187e-06, 'samples': 27859392, 'steps': 145100, 'loss/train': 1.5862832069396973} 11/07/2021 17:39:44 - INFO - __main__ - Step 145102: {'lr': 1.3505472880774305e-06, 'samples': 27859584, 'steps': 145101, 'loss/train': 1.148643970489502} 11/07/2021 17:39:45 - INFO - __main__ - Step 145103: {'lr': 1.3499964847425717e-06, 'samples': 27859776, 'steps': 145102, 'loss/train': 1.4563846588134766} 11/07/2021 17:39:45 - INFO - __main__ - Step 145104: {'lr': 1.349445793445564e-06, 'samples': 27859968, 'steps': 145103, 'loss/train': 1.5573158264160156} 11/07/2021 17:39:45 - INFO - __main__ - Step 145105: {'lr': 1.3488952141866294e-06, 'samples': 27860160, 'steps': 145104, 'loss/train': 0.9226667881011963} 11/07/2021 17:39:46 - INFO - __main__ - Step 145106: {'lr': 1.3483447469660737e-06, 'samples': 27860352, 'steps': 145105, 'loss/train': 0.8505119681358337} 11/07/2021 17:39:46 - INFO - __main__ - Step 145107: {'lr': 1.3477943917840907e-06, 'samples': 27860544, 'steps': 145106, 'loss/train': 1.1496927738189697} 11/07/2021 17:39:47 - INFO - __main__ - Step 145108: {'lr': 1.3472441486409859e-06, 'samples': 27860736, 'steps': 145107, 'loss/train': 1.4916126728057861} 11/07/2021 17:39:48 - INFO - __main__ - Step 145109: {'lr': 1.3466940175369536e-06, 'samples': 27860928, 'steps': 145108, 'loss/train': 1.3834364414215088} 11/07/2021 17:39:48 - INFO - __main__ - Step 145110: {'lr': 1.3461439984722711e-06, 'samples': 27861120, 'steps': 145109, 'loss/train': 1.4073920249938965} 11/07/2021 17:39:48 - INFO - __main__ - Step 145111: {'lr': 1.3455940914471886e-06, 'samples': 27861312, 'steps': 145110, 'loss/train': 1.0594466924667358} 11/07/2021 17:39:49 - INFO - __main__ - Step 145112: {'lr': 1.3450442964619281e-06, 'samples': 27861504, 'steps': 145111, 'loss/train': 1.1459993124008179} 11/07/2021 17:39:49 - INFO - __main__ - Step 145113: {'lr': 1.3444946135167668e-06, 'samples': 27861696, 'steps': 145112, 'loss/train': 0.17492026090621948} 11/07/2021 17:39:50 - INFO - __main__ - Step 145114: {'lr': 1.343945042611955e-06, 'samples': 27861888, 'steps': 145113, 'loss/train': 1.3882951736450195} 11/07/2021 17:39:51 - INFO - __main__ - Step 145115: {'lr': 1.3433955837476862e-06, 'samples': 27862080, 'steps': 145114, 'loss/train': 0.8905945420265198} 11/07/2021 17:39:51 - INFO - __main__ - Step 145116: {'lr': 1.3428462369242666e-06, 'samples': 27862272, 'steps': 145115, 'loss/train': 0.41007137298583984} 11/07/2021 17:39:51 - INFO - __main__ - Step 145117: {'lr': 1.3422970021419178e-06, 'samples': 27862464, 'steps': 145116, 'loss/train': 1.510027289390564} 11/07/2021 17:39:52 - INFO - __main__ - Step 145118: {'lr': 1.3417478794008896e-06, 'samples': 27862656, 'steps': 145117, 'loss/train': 1.433909296989441} 11/07/2021 17:39:53 - INFO - __main__ - Step 145119: {'lr': 1.3411988687014598e-06, 'samples': 27862848, 'steps': 145118, 'loss/train': 1.7607797384262085} 11/07/2021 17:39:53 - INFO - __main__ - Step 145120: {'lr': 1.3406499700438224e-06, 'samples': 27863040, 'steps': 145119, 'loss/train': 0.9657756090164185} 11/07/2021 17:39:53 - INFO - __main__ - Step 145121: {'lr': 1.340101183428255e-06, 'samples': 27863232, 'steps': 145120, 'loss/train': 1.0196270942687988} 11/07/2021 17:39:54 - INFO - __main__ - Step 145122: {'lr': 1.3395525088550075e-06, 'samples': 27863424, 'steps': 145121, 'loss/train': 1.393068790435791} 11/07/2021 17:39:54 - INFO - __main__ - Step 145123: {'lr': 1.339003946324302e-06, 'samples': 27863616, 'steps': 145122, 'loss/train': 1.4044004678726196} 11/07/2021 17:39:55 - INFO - __main__ - Step 145124: {'lr': 1.3384554958364158e-06, 'samples': 27863808, 'steps': 145123, 'loss/train': 1.5535876750946045} 11/07/2021 17:39:55 - INFO - __main__ - Step 145125: {'lr': 1.3379071573915992e-06, 'samples': 27864000, 'steps': 145124, 'loss/train': 2.0268192291259766} 11/07/2021 17:39:56 - INFO - __main__ - Step 145126: {'lr': 1.3373589309900458e-06, 'samples': 27864192, 'steps': 145125, 'loss/train': 1.1492140293121338} 11/07/2021 17:39:56 - INFO - __main__ - Step 145127: {'lr': 1.3368108166320891e-06, 'samples': 27864384, 'steps': 145126, 'loss/train': 0.49436846375465393} 11/07/2021 17:39:57 - INFO - __main__ - Step 145128: {'lr': 1.3362628143178957e-06, 'samples': 27864576, 'steps': 145127, 'loss/train': 0.9319328665733337} 11/07/2021 17:39:58 - INFO - __main__ - Step 145129: {'lr': 1.3357149240477707e-06, 'samples': 27864768, 'steps': 145128, 'loss/train': 1.1770374774932861} 11/07/2021 17:39:58 - INFO - __main__ - Step 145130: {'lr': 1.3351671458219083e-06, 'samples': 27864960, 'steps': 145129, 'loss/train': 0.44320470094680786} 11/07/2021 17:39:59 - INFO - __main__ - Step 145131: {'lr': 1.3346194796406141e-06, 'samples': 27865152, 'steps': 145130, 'loss/train': 1.422669768333435} 11/07/2021 17:39:59 - INFO - __main__ - Step 145132: {'lr': 1.3340719255040822e-06, 'samples': 27865344, 'steps': 145131, 'loss/train': 1.7038716077804565} 11/07/2021 17:39:59 - INFO - __main__ - Step 145133: {'lr': 1.3335244834125626e-06, 'samples': 27865536, 'steps': 145132, 'loss/train': 1.4136717319488525} 11/07/2021 17:40:00 - INFO - __main__ - Step 145134: {'lr': 1.3329771533663604e-06, 'samples': 27865728, 'steps': 145133, 'loss/train': 1.021605134010315} 11/07/2021 17:40:00 - INFO - __main__ - Step 145135: {'lr': 1.332429935365642e-06, 'samples': 27865920, 'steps': 145134, 'loss/train': 2.098557949066162} 11/07/2021 17:40:01 - INFO - __main__ - Step 145136: {'lr': 1.3318828294107133e-06, 'samples': 27866112, 'steps': 145135, 'loss/train': 2.6165480613708496} 11/07/2021 17:40:02 - INFO - __main__ - Step 145137: {'lr': 1.3313358355017958e-06, 'samples': 27866304, 'steps': 145136, 'loss/train': 0.8960002064704895} 11/07/2021 17:40:02 - INFO - __main__ - Step 145138: {'lr': 1.3307889536391394e-06, 'samples': 27866496, 'steps': 145137, 'loss/train': 1.0154207944869995} 11/07/2021 17:40:03 - INFO - __main__ - Step 145139: {'lr': 1.3302421838230217e-06, 'samples': 27866688, 'steps': 145138, 'loss/train': 1.3355449438095093} 11/07/2021 17:40:03 - INFO - __main__ - Step 145140: {'lr': 1.329695526053637e-06, 'samples': 27866880, 'steps': 145139, 'loss/train': 1.4687494039535522} 11/07/2021 17:40:04 - INFO - __main__ - Step 145141: {'lr': 1.3291489803312629e-06, 'samples': 27867072, 'steps': 145140, 'loss/train': 1.478419542312622} 11/07/2021 17:40:04 - INFO - __main__ - Step 145142: {'lr': 1.3286025466561212e-06, 'samples': 27867264, 'steps': 145141, 'loss/train': 0.9571791887283325} 11/07/2021 17:40:05 - INFO - __main__ - Step 145143: {'lr': 1.3280562250284622e-06, 'samples': 27867456, 'steps': 145142, 'loss/train': 0.9765370488166809} 11/07/2021 17:40:05 - INFO - __main__ - Step 145144: {'lr': 1.327510015448563e-06, 'samples': 27867648, 'steps': 145143, 'loss/train': 0.6345440149307251} 11/07/2021 17:40:05 - INFO - __main__ - Step 145145: {'lr': 1.326963917916646e-06, 'samples': 27867840, 'steps': 145144, 'loss/train': 1.1780109405517578} 11/07/2021 17:40:06 - INFO - __main__ - Step 145146: {'lr': 1.3264179324329884e-06, 'samples': 27868032, 'steps': 145145, 'loss/train': 1.28000807762146} 11/07/2021 17:40:07 - INFO - __main__ - Step 145147: {'lr': 1.3258720589977846e-06, 'samples': 27868224, 'steps': 145146, 'loss/train': 1.002426266670227} 11/07/2021 17:40:07 - INFO - __main__ - Step 145148: {'lr': 1.3253262976112844e-06, 'samples': 27868416, 'steps': 145147, 'loss/train': 1.9408082962036133} 11/07/2021 17:40:08 - INFO - __main__ - Step 145149: {'lr': 1.3247806482737933e-06, 'samples': 27868608, 'steps': 145148, 'loss/train': 1.1997315883636475} 11/07/2021 17:40:08 - INFO - __main__ - Step 145150: {'lr': 1.3242351109855055e-06, 'samples': 27868800, 'steps': 145149, 'loss/train': 1.1896940469741821} 11/07/2021 17:40:08 - INFO - __main__ - Step 145151: {'lr': 1.3236896857466706e-06, 'samples': 27868992, 'steps': 145150, 'loss/train': 1.4182416200637817} 11/07/2021 17:40:09 - INFO - __main__ - Step 145152: {'lr': 1.3231443725575388e-06, 'samples': 27869184, 'steps': 145151, 'loss/train': 1.9611247777938843} 11/07/2021 17:40:10 - INFO - __main__ - Step 145153: {'lr': 1.3225991714183872e-06, 'samples': 27869376, 'steps': 145152, 'loss/train': 0.8633104562759399} 11/07/2021 17:40:10 - INFO - __main__ - Step 145154: {'lr': 1.3220540823294104e-06, 'samples': 27869568, 'steps': 145153, 'loss/train': 2.0389294624328613} 11/07/2021 17:40:10 - INFO - __main__ - Step 145155: {'lr': 1.3215091052909133e-06, 'samples': 27869760, 'steps': 145154, 'loss/train': 1.3644630908966064} 11/07/2021 17:40:11 - INFO - __main__ - Step 145156: {'lr': 1.3209642403030631e-06, 'samples': 27869952, 'steps': 145155, 'loss/train': 1.1467756032943726} 11/07/2021 17:40:12 - INFO - __main__ - Step 145157: {'lr': 1.3204194873661924e-06, 'samples': 27870144, 'steps': 145156, 'loss/train': 1.1538420915603638} 11/07/2021 17:40:12 - INFO - __main__ - Step 145158: {'lr': 1.3198748464804677e-06, 'samples': 27870336, 'steps': 145157, 'loss/train': 1.2816052436828613} 11/07/2021 17:40:12 - INFO - __main__ - Step 145159: {'lr': 1.3193303176461946e-06, 'samples': 27870528, 'steps': 145158, 'loss/train': 1.1717393398284912} 11/07/2021 17:40:13 - INFO - __main__ - Step 145160: {'lr': 1.318785900863567e-06, 'samples': 27870720, 'steps': 145159, 'loss/train': 0.8748659491539001} 11/07/2021 17:40:13 - INFO - __main__ - Step 145161: {'lr': 1.318241596132863e-06, 'samples': 27870912, 'steps': 145160, 'loss/train': 1.4414914846420288} 11/07/2021 17:40:14 - INFO - __main__ - Step 145162: {'lr': 1.317697403454332e-06, 'samples': 27871104, 'steps': 145161, 'loss/train': 1.078112006187439} 11/07/2021 17:40:15 - INFO - __main__ - Step 145163: {'lr': 1.3171533228282239e-06, 'samples': 27871296, 'steps': 145162, 'loss/train': 1.0628923177719116} 11/07/2021 17:40:15 - INFO - __main__ - Step 145164: {'lr': 1.316609354254733e-06, 'samples': 27871488, 'steps': 145163, 'loss/train': 1.2505525350570679} 11/07/2021 17:40:15 - INFO - __main__ - Step 145165: {'lr': 1.3160654977341646e-06, 'samples': 27871680, 'steps': 145164, 'loss/train': 1.772456169128418} 11/07/2021 17:40:16 - INFO - __main__ - Step 145166: {'lr': 1.3155217532667408e-06, 'samples': 27871872, 'steps': 145165, 'loss/train': 1.6719263792037964} 11/07/2021 17:40:16 - INFO - __main__ - Step 145167: {'lr': 1.3149781208526834e-06, 'samples': 27872064, 'steps': 145166, 'loss/train': 0.9828547239303589} 11/07/2021 17:40:17 - INFO - __main__ - Step 145168: {'lr': 1.314434600492298e-06, 'samples': 27872256, 'steps': 145167, 'loss/train': 0.8873456716537476} 11/07/2021 17:40:17 - INFO - __main__ - Step 145169: {'lr': 1.3138911921857787e-06, 'samples': 27872448, 'steps': 145168, 'loss/train': 1.2625967264175415} 11/07/2021 17:40:18 - INFO - __main__ - Step 145170: {'lr': 1.3133478959333757e-06, 'samples': 27872640, 'steps': 145169, 'loss/train': 1.0928325653076172} 11/07/2021 17:40:18 - INFO - __main__ - Step 145171: {'lr': 1.3128047117353104e-06, 'samples': 27872832, 'steps': 145170, 'loss/train': 1.322023630142212} 11/07/2021 17:40:18 - INFO - __main__ - Step 145172: {'lr': 1.3122616395918884e-06, 'samples': 27873024, 'steps': 145171, 'loss/train': 1.0145297050476074} 11/07/2021 17:40:19 - INFO - __main__ - Step 145173: {'lr': 1.311718679503332e-06, 'samples': 27873216, 'steps': 145172, 'loss/train': 1.1479713916778564} 11/07/2021 17:40:20 - INFO - __main__ - Step 145174: {'lr': 1.3111758314698629e-06, 'samples': 27873408, 'steps': 145173, 'loss/train': 0.846750020980835} 11/07/2021 17:40:20 - INFO - __main__ - Step 145175: {'lr': 1.310633095491731e-06, 'samples': 27873600, 'steps': 145174, 'loss/train': 1.3531657457351685} 11/07/2021 17:40:20 - INFO - __main__ - Step 145176: {'lr': 1.3100904715692142e-06, 'samples': 27873792, 'steps': 145175, 'loss/train': 1.2298520803451538} 11/07/2021 17:40:21 - INFO - __main__ - Step 145177: {'lr': 1.309547959702534e-06, 'samples': 27873984, 'steps': 145176, 'loss/train': 1.187883973121643} 11/07/2021 17:40:22 - INFO - __main__ - Step 145178: {'lr': 1.3090055598919126e-06, 'samples': 27874176, 'steps': 145177, 'loss/train': 1.6105247735977173} 11/07/2021 17:40:22 - INFO - __main__ - Step 145179: {'lr': 1.3084632721376276e-06, 'samples': 27874368, 'steps': 145178, 'loss/train': 1.517195224761963} 11/07/2021 17:40:23 - INFO - __main__ - Step 145180: {'lr': 1.3079210964399014e-06, 'samples': 27874560, 'steps': 145179, 'loss/train': 1.2575311660766602} 11/07/2021 17:40:23 - INFO - __main__ - Step 145181: {'lr': 1.3073790327989832e-06, 'samples': 27874752, 'steps': 145180, 'loss/train': 1.3083131313323975} 11/07/2021 17:40:23 - INFO - __main__ - Step 145182: {'lr': 1.3068370812151509e-06, 'samples': 27874944, 'steps': 145181, 'loss/train': 1.2638742923736572} 11/07/2021 17:40:24 - INFO - __main__ - Step 145183: {'lr': 1.3062952416885987e-06, 'samples': 27875136, 'steps': 145182, 'loss/train': 1.614729404449463} 11/07/2021 17:40:25 - INFO - __main__ - Step 145184: {'lr': 1.3057535142196043e-06, 'samples': 27875328, 'steps': 145183, 'loss/train': 1.215593934059143} 11/07/2021 17:40:25 - INFO - __main__ - Step 145185: {'lr': 1.3052118988083894e-06, 'samples': 27875520, 'steps': 145184, 'loss/train': 1.2855894565582275} 11/07/2021 17:40:25 - INFO - __main__ - Step 145186: {'lr': 1.304670395455232e-06, 'samples': 27875712, 'steps': 145185, 'loss/train': 0.9356456995010376} 11/07/2021 17:40:26 - INFO - __main__ - Step 145187: {'lr': 1.304129004160326e-06, 'samples': 27875904, 'steps': 145186, 'loss/train': 0.8405578136444092} 11/07/2021 17:40:27 - INFO - __main__ - Step 145188: {'lr': 1.303587724923949e-06, 'samples': 27876096, 'steps': 145187, 'loss/train': 1.2374430894851685} 11/07/2021 17:40:27 - INFO - __main__ - Step 145189: {'lr': 1.3030465577463236e-06, 'samples': 27876288, 'steps': 145188, 'loss/train': 1.3655767440795898} 11/07/2021 17:40:28 - INFO - __main__ - Step 145190: {'lr': 1.3025055026277267e-06, 'samples': 27876480, 'steps': 145189, 'loss/train': 0.9079682230949402} 11/07/2021 17:40:28 - INFO - __main__ - Step 145191: {'lr': 1.3019645595683804e-06, 'samples': 27876672, 'steps': 145190, 'loss/train': 1.2524298429489136} 11/07/2021 17:40:28 - INFO - __main__ - Step 145192: {'lr': 1.301423728568535e-06, 'samples': 27876864, 'steps': 145191, 'loss/train': 1.2251533269882202} 11/07/2021 17:40:29 - INFO - __main__ - Step 145193: {'lr': 1.3008830096284118e-06, 'samples': 27877056, 'steps': 145192, 'loss/train': 1.4844951629638672} 11/07/2021 17:40:30 - INFO - __main__ - Step 145194: {'lr': 1.3003424027482892e-06, 'samples': 27877248, 'steps': 145193, 'loss/train': 1.4365359544754028} 11/07/2021 17:40:30 - INFO - __main__ - Step 145195: {'lr': 1.2998019079284162e-06, 'samples': 27877440, 'steps': 145194, 'loss/train': 1.8679221868515015} 11/07/2021 17:40:30 - INFO - __main__ - Step 145196: {'lr': 1.2992615251689876e-06, 'samples': 27877632, 'steps': 145195, 'loss/train': 1.686550498008728} 11/07/2021 17:40:31 - INFO - __main__ - Step 145197: {'lr': 1.298721254470281e-06, 'samples': 27877824, 'steps': 145196, 'loss/train': 0.7123776078224182} 11/07/2021 17:40:32 - INFO - __main__ - Step 145198: {'lr': 1.298181095832518e-06, 'samples': 27878016, 'steps': 145197, 'loss/train': 1.1021728515625} 11/07/2021 17:40:32 - INFO - __main__ - Step 145199: {'lr': 1.2976410492559765e-06, 'samples': 27878208, 'steps': 145198, 'loss/train': 1.3629131317138672} 11/07/2021 17:40:32 - INFO - __main__ - Step 145200: {'lr': 1.2971011147408506e-06, 'samples': 27878400, 'steps': 145199, 'loss/train': 1.1248990297317505} 11/07/2021 17:40:33 - INFO - __main__ - Step 145201: {'lr': 1.296561292287446e-06, 'samples': 27878592, 'steps': 145200, 'loss/train': 1.2082818746566772} 11/07/2021 17:40:33 - INFO - __main__ - Step 145202: {'lr': 1.2960215818959565e-06, 'samples': 27878784, 'steps': 145201, 'loss/train': 1.5748707056045532} 11/07/2021 17:40:34 - INFO - __main__ - Step 145203: {'lr': 1.2954819835666321e-06, 'samples': 27878976, 'steps': 145202, 'loss/train': 1.4672774076461792} 11/07/2021 17:40:35 - INFO - __main__ - Step 145204: {'lr': 1.2949424972997503e-06, 'samples': 27879168, 'steps': 145203, 'loss/train': 1.4450006484985352} 11/07/2021 17:40:35 - INFO - __main__ - Step 145205: {'lr': 1.2944031230955056e-06, 'samples': 27879360, 'steps': 145204, 'loss/train': 1.214569330215454} 11/07/2021 17:40:35 - INFO - __main__ - Step 145206: {'lr': 1.2938638609542031e-06, 'samples': 27879552, 'steps': 145205, 'loss/train': 1.259667158126831} 11/07/2021 17:40:36 - INFO - __main__ - Step 145207: {'lr': 1.2933247108760093e-06, 'samples': 27879744, 'steps': 145206, 'loss/train': 1.2979246377944946} 11/07/2021 17:40:36 - INFO - __main__ - Step 145208: {'lr': 1.2927856728612297e-06, 'samples': 27879936, 'steps': 145207, 'loss/train': 1.1457457542419434} 11/07/2021 17:40:37 - INFO - __main__ - Step 145209: {'lr': 1.2922467469100863e-06, 'samples': 27880128, 'steps': 145208, 'loss/train': 1.0053060054779053} 11/07/2021 17:40:37 - INFO - __main__ - Step 145210: {'lr': 1.291707933022801e-06, 'samples': 27880320, 'steps': 145209, 'loss/train': 1.111890196800232} 11/07/2021 17:40:38 - INFO - __main__ - Step 145211: {'lr': 1.2911692311996238e-06, 'samples': 27880512, 'steps': 145210, 'loss/train': 5.674117565155029} 11/07/2021 17:40:38 - INFO - __main__ - Step 145212: {'lr': 1.290630641440832e-06, 'samples': 27880704, 'steps': 145211, 'loss/train': 1.1292753219604492} 11/07/2021 17:40:38 - INFO - __main__ - Step 145213: {'lr': 1.2900921637466201e-06, 'samples': 27880896, 'steps': 145212, 'loss/train': 1.4145894050598145} 11/07/2021 17:40:39 - INFO - __main__ - Step 145214: {'lr': 1.2895537981172933e-06, 'samples': 27881088, 'steps': 145213, 'loss/train': 1.2562322616577148} 11/07/2021 17:40:40 - INFO - __main__ - Step 145215: {'lr': 1.2890155445530182e-06, 'samples': 27881280, 'steps': 145214, 'loss/train': 1.446103811264038} 11/07/2021 17:40:40 - INFO - __main__ - Step 145216: {'lr': 1.2884774030541003e-06, 'samples': 27881472, 'steps': 145215, 'loss/train': 0.06384401023387909} 11/07/2021 17:40:41 - INFO - __main__ - Step 145217: {'lr': 1.2879393736207335e-06, 'samples': 27881664, 'steps': 145216, 'loss/train': 0.8625550866127014} 11/07/2021 17:40:41 - INFO - __main__ - Step 145218: {'lr': 1.2874014562531954e-06, 'samples': 27881856, 'steps': 145217, 'loss/train': 1.3481478691101074} 11/07/2021 17:40:41 - INFO - __main__ - Step 145219: {'lr': 1.2868636509517084e-06, 'samples': 27882048, 'steps': 145218, 'loss/train': 1.5019879341125488} 11/07/2021 17:40:43 - INFO - __main__ - Step 145220: {'lr': 1.2863259577165497e-06, 'samples': 27882240, 'steps': 145219, 'loss/train': 0.08850276470184326} 11/07/2021 17:40:43 - INFO - __main__ - Step 145221: {'lr': 1.2857883765479139e-06, 'samples': 27882432, 'steps': 145220, 'loss/train': 1.2833473682403564} 11/07/2021 17:40:43 - INFO - __main__ - Step 145222: {'lr': 1.2852509074460784e-06, 'samples': 27882624, 'steps': 145221, 'loss/train': 1.0362510681152344} 11/07/2021 17:40:44 - INFO - __main__ - Step 145223: {'lr': 1.2847135504112372e-06, 'samples': 27882816, 'steps': 145222, 'loss/train': 1.3086967468261719} 11/07/2021 17:40:44 - INFO - __main__ - Step 145224: {'lr': 1.284176305443696e-06, 'samples': 27883008, 'steps': 145223, 'loss/train': 1.3361119031906128} 11/07/2021 17:40:45 - INFO - __main__ - Step 145225: {'lr': 1.2836391725436491e-06, 'samples': 27883200, 'steps': 145224, 'loss/train': 1.0783854722976685} 11/07/2021 17:40:45 - INFO - __main__ - Step 145226: {'lr': 1.283102151711374e-06, 'samples': 27883392, 'steps': 145225, 'loss/train': 1.019194483757019} 11/07/2021 17:40:46 - INFO - __main__ - Step 145227: {'lr': 1.2825652429470924e-06, 'samples': 27883584, 'steps': 145226, 'loss/train': 1.3828619718551636} 11/07/2021 17:40:46 - INFO - __main__ - Step 145228: {'lr': 1.2820284462510267e-06, 'samples': 27883776, 'steps': 145227, 'loss/train': 0.7741737961769104} 11/07/2021 17:40:46 - INFO - __main__ - Step 145229: {'lr': 1.2814917616234546e-06, 'samples': 27883968, 'steps': 145228, 'loss/train': 1.2276002168655396} 11/07/2021 17:40:47 - INFO - __main__ - Step 145230: {'lr': 1.2809551890646253e-06, 'samples': 27884160, 'steps': 145229, 'loss/train': 1.2752697467803955} 11/07/2021 17:40:48 - INFO - __main__ - Step 145231: {'lr': 1.2804187285747337e-06, 'samples': 27884352, 'steps': 145230, 'loss/train': 1.4157214164733887} 11/07/2021 17:40:48 - INFO - __main__ - Step 145232: {'lr': 1.2798823801540571e-06, 'samples': 27884544, 'steps': 145231, 'loss/train': 0.16410750150680542} 11/07/2021 17:40:49 - INFO - __main__ - Step 145233: {'lr': 1.2793461438028176e-06, 'samples': 27884736, 'steps': 145232, 'loss/train': 1.45164954662323} 11/07/2021 17:40:49 - INFO - __main__ - Step 145234: {'lr': 1.2788100195212649e-06, 'samples': 27884928, 'steps': 145233, 'loss/train': 1.1175587177276611} 11/07/2021 17:40:49 - INFO - __main__ - Step 145235: {'lr': 1.2782740073096767e-06, 'samples': 27885120, 'steps': 145234, 'loss/train': 1.5743415355682373} 11/07/2021 17:40:50 - INFO - __main__ - Step 145236: {'lr': 1.2777381071682192e-06, 'samples': 27885312, 'steps': 145235, 'loss/train': 0.8562171459197998} 11/07/2021 17:40:51 - INFO - __main__ - Step 145237: {'lr': 1.2772023190971982e-06, 'samples': 27885504, 'steps': 145236, 'loss/train': 1.1136960983276367} 11/07/2021 17:40:51 - INFO - __main__ - Step 145238: {'lr': 1.2766666430968354e-06, 'samples': 27885696, 'steps': 145237, 'loss/train': 0.6727160215377808} 11/07/2021 17:40:51 - INFO - __main__ - Step 145239: {'lr': 1.276131079167353e-06, 'samples': 27885888, 'steps': 145238, 'loss/train': 1.5251662731170654} 11/07/2021 17:40:52 - INFO - __main__ - Step 145240: {'lr': 1.2755956273090008e-06, 'samples': 27886080, 'steps': 145239, 'loss/train': 1.2760251760482788} 11/07/2021 17:40:53 - INFO - __main__ - Step 145241: {'lr': 1.2750602875220562e-06, 'samples': 27886272, 'steps': 145240, 'loss/train': 1.6625055074691772} 11/07/2021 17:40:53 - INFO - __main__ - Step 145242: {'lr': 1.2745250598067137e-06, 'samples': 27886464, 'steps': 145241, 'loss/train': 0.9667705297470093} 11/07/2021 17:40:53 - INFO - __main__ - Step 145243: {'lr': 1.273989944163223e-06, 'samples': 27886656, 'steps': 145242, 'loss/train': 1.1150532960891724} 11/07/2021 17:40:54 - INFO - __main__ - Step 145244: {'lr': 1.2734549405918617e-06, 'samples': 27886848, 'steps': 145243, 'loss/train': 1.0971322059631348} 11/07/2021 17:40:54 - INFO - __main__ - Step 145245: {'lr': 1.272920049092824e-06, 'samples': 27887040, 'steps': 145244, 'loss/train': 1.3811190128326416} 11/07/2021 17:40:55 - INFO - __main__ - Step 145246: {'lr': 1.2723852696663597e-06, 'samples': 27887232, 'steps': 145245, 'loss/train': 1.4341508150100708} 11/07/2021 17:40:55 - INFO - __main__ - Step 145247: {'lr': 1.2718506023127464e-06, 'samples': 27887424, 'steps': 145246, 'loss/train': 1.034528136253357} 11/07/2021 17:40:56 - INFO - __main__ - Step 145248: {'lr': 1.2713160470321782e-06, 'samples': 27887616, 'steps': 145247, 'loss/train': 1.2851853370666504} 11/07/2021 17:40:56 - INFO - __main__ - Step 145249: {'lr': 1.270781603824933e-06, 'samples': 27887808, 'steps': 145248, 'loss/train': 1.4963144063949585} 11/07/2021 17:40:56 - INFO - __main__ - Step 145250: {'lr': 1.2702472726912328e-06, 'samples': 27888000, 'steps': 145249, 'loss/train': 0.5302770733833313} 11/07/2021 17:40:58 - INFO - __main__ - Step 145251: {'lr': 1.2697130536312995e-06, 'samples': 27888192, 'steps': 145250, 'loss/train': 0.7844222187995911} 11/07/2021 17:40:58 - INFO - __main__ - Step 145252: {'lr': 1.2691789466454107e-06, 'samples': 27888384, 'steps': 145251, 'loss/train': 1.5301649570465088} 11/07/2021 17:40:58 - INFO - __main__ - Step 145253: {'lr': 1.2686449517337884e-06, 'samples': 27888576, 'steps': 145252, 'loss/train': 0.5611286163330078} 11/07/2021 17:40:59 - INFO - __main__ - Step 145254: {'lr': 1.2681110688966823e-06, 'samples': 27888768, 'steps': 145253, 'loss/train': 1.4038711786270142} 11/07/2021 17:40:59 - INFO - __main__ - Step 145255: {'lr': 1.2675772981343426e-06, 'samples': 27888960, 'steps': 145254, 'loss/train': 1.3342217206954956} 11/07/2021 17:41:00 - INFO - __main__ - Step 145256: {'lr': 1.267043639446963e-06, 'samples': 27889152, 'steps': 145255, 'loss/train': 1.0995293855667114} 11/07/2021 17:41:01 - INFO - __main__ - Step 145257: {'lr': 1.2665100928348217e-06, 'samples': 27889344, 'steps': 145256, 'loss/train': 1.7439121007919312} 11/07/2021 17:41:01 - INFO - __main__ - Step 145258: {'lr': 1.265976658298168e-06, 'samples': 27889536, 'steps': 145257, 'loss/train': 1.1516107320785522} 11/07/2021 17:41:01 - INFO - __main__ - Step 145259: {'lr': 1.265443335837224e-06, 'samples': 27889728, 'steps': 145258, 'loss/train': 1.5767492055892944} 11/07/2021 17:41:02 - INFO - __main__ - Step 145260: {'lr': 1.26491012545224e-06, 'samples': 27889920, 'steps': 145259, 'loss/train': 1.6079399585723877} 11/07/2021 17:41:02 - INFO - __main__ - Step 145261: {'lr': 1.2643770271434373e-06, 'samples': 27890112, 'steps': 145260, 'loss/train': 1.1526596546173096} 11/07/2021 17:41:03 - INFO - __main__ - Step 145262: {'lr': 1.2638440409110663e-06, 'samples': 27890304, 'steps': 145261, 'loss/train': 0.9517669081687927} 11/07/2021 17:41:03 - INFO - __main__ - Step 145263: {'lr': 1.2633111667553765e-06, 'samples': 27890496, 'steps': 145262, 'loss/train': 0.9304308295249939} 11/07/2021 17:41:04 - INFO - __main__ - Step 145264: {'lr': 1.26277840467659e-06, 'samples': 27890688, 'steps': 145263, 'loss/train': 1.2332956790924072} 11/07/2021 17:41:04 - INFO - __main__ - Step 145265: {'lr': 1.2622457546749566e-06, 'samples': 27890880, 'steps': 145264, 'loss/train': 1.2100954055786133} 11/07/2021 17:41:04 - INFO - __main__ - Step 145266: {'lr': 1.2617132167507262e-06, 'samples': 27891072, 'steps': 145265, 'loss/train': 1.4450910091400146} 11/07/2021 17:41:05 - INFO - __main__ - Step 145267: {'lr': 1.2611807909041207e-06, 'samples': 27891264, 'steps': 145266, 'loss/train': 1.1098823547363281} 11/07/2021 17:41:06 - INFO - __main__ - Step 145268: {'lr': 1.2606484771353898e-06, 'samples': 27891456, 'steps': 145267, 'loss/train': 0.9794409275054932} 11/07/2021 17:41:07 - INFO - __main__ - Step 145269: {'lr': 1.2601162754447836e-06, 'samples': 27891648, 'steps': 145268, 'loss/train': 1.3066824674606323} 11/07/2021 17:41:07 - INFO - __main__ - Step 145270: {'lr': 1.259584185832524e-06, 'samples': 27891840, 'steps': 145269, 'loss/train': 2.087599039077759} 11/07/2021 17:41:07 - INFO - __main__ - Step 145271: {'lr': 1.259052208298861e-06, 'samples': 27892032, 'steps': 145270, 'loss/train': 0.1713971346616745} 11/07/2021 17:41:08 - INFO - __main__ - Step 145272: {'lr': 1.2585203428440162e-06, 'samples': 27892224, 'steps': 145271, 'loss/train': 1.2129393815994263} 11/07/2021 17:41:09 - INFO - __main__ - Step 145273: {'lr': 1.2579885894682674e-06, 'samples': 27892416, 'steps': 145272, 'loss/train': 1.4253994226455688} 11/07/2021 17:41:09 - INFO - __main__ - Step 145274: {'lr': 1.257456948171809e-06, 'samples': 27892608, 'steps': 145273, 'loss/train': 1.3090764284133911} 11/07/2021 17:41:09 - INFO - __main__ - Step 145275: {'lr': 1.2569254189549183e-06, 'samples': 27892800, 'steps': 145274, 'loss/train': 1.270697832107544} 11/07/2021 17:41:10 - INFO - __main__ - Step 145276: {'lr': 1.2563940018178176e-06, 'samples': 27892992, 'steps': 145275, 'loss/train': 1.3292726278305054} 11/07/2021 17:41:10 - INFO - __main__ - Step 145277: {'lr': 1.2558626967607566e-06, 'samples': 27893184, 'steps': 145276, 'loss/train': 1.0706257820129395} 11/07/2021 17:41:11 - INFO - __main__ - Step 145278: {'lr': 1.2553315037839297e-06, 'samples': 27893376, 'steps': 145277, 'loss/train': 1.4696342945098877} 11/07/2021 17:41:12 - INFO - __main__ - Step 145279: {'lr': 1.254800422887642e-06, 'samples': 27893568, 'steps': 145278, 'loss/train': 1.7911829948425293} 11/07/2021 17:41:12 - INFO - __main__ - Step 145280: {'lr': 1.2542694540720877e-06, 'samples': 27893760, 'steps': 145279, 'loss/train': 1.0922298431396484} 11/07/2021 17:41:12 - INFO - __main__ - Step 145281: {'lr': 1.2537385973375448e-06, 'samples': 27893952, 'steps': 145280, 'loss/train': 0.9227055907249451} 11/07/2021 17:41:13 - INFO - __main__ - Step 145282: {'lr': 1.2532078526842072e-06, 'samples': 27894144, 'steps': 145281, 'loss/train': 1.1806390285491943} 11/07/2021 17:41:13 - INFO - __main__ - Step 145283: {'lr': 1.2526772201123249e-06, 'samples': 27894336, 'steps': 145282, 'loss/train': 0.9604284763336182} 11/07/2021 17:41:14 - INFO - __main__ - Step 145284: {'lr': 1.2521466996221753e-06, 'samples': 27894528, 'steps': 145283, 'loss/train': 1.053778052330017} 11/07/2021 17:41:14 - INFO - __main__ - Step 145285: {'lr': 1.2516162912139528e-06, 'samples': 27894720, 'steps': 145284, 'loss/train': 1.4486693143844604} 11/07/2021 17:41:15 - INFO - __main__ - Step 145286: {'lr': 1.2510859948879071e-06, 'samples': 27894912, 'steps': 145285, 'loss/train': 1.472267746925354} 11/07/2021 17:41:15 - INFO - __main__ - Step 145287: {'lr': 1.250555810644316e-06, 'samples': 27895104, 'steps': 145286, 'loss/train': 0.801484227180481} 11/07/2021 17:41:15 - INFO - __main__ - Step 145288: {'lr': 1.2500257384833736e-06, 'samples': 27895296, 'steps': 145287, 'loss/train': 2.0055429935455322} 11/07/2021 17:41:16 - INFO - __main__ - Step 145289: {'lr': 1.2494957784053019e-06, 'samples': 27895488, 'steps': 145288, 'loss/train': 1.7339589595794678} 11/07/2021 17:41:17 - INFO - __main__ - Step 145290: {'lr': 1.2489659304104062e-06, 'samples': 27895680, 'steps': 145289, 'loss/train': 1.5258675813674927} 11/07/2021 17:41:17 - INFO - __main__ - Step 145291: {'lr': 1.248436194498853e-06, 'samples': 27895872, 'steps': 145290, 'loss/train': 1.1634961366653442} 11/07/2021 17:41:18 - INFO - __main__ - Step 145292: {'lr': 1.247906570670948e-06, 'samples': 27896064, 'steps': 145291, 'loss/train': 1.4879379272460938} 11/07/2021 17:41:18 - INFO - __main__ - Step 145293: {'lr': 1.2473770589268852e-06, 'samples': 27896256, 'steps': 145292, 'loss/train': 1.5855281352996826} 11/07/2021 17:41:19 - INFO - __main__ - Step 145294: {'lr': 1.2468476592669143e-06, 'samples': 27896448, 'steps': 145293, 'loss/train': 1.0026819705963135} 11/07/2021 17:41:19 - INFO - __main__ - Step 145295: {'lr': 1.246318371691285e-06, 'samples': 27896640, 'steps': 145294, 'loss/train': 1.5075080394744873} 11/07/2021 17:41:20 - INFO - __main__ - Step 145296: {'lr': 1.2457891962001922e-06, 'samples': 27896832, 'steps': 145295, 'loss/train': 1.9347546100616455} 11/07/2021 17:41:20 - INFO - __main__ - Step 145297: {'lr': 1.2452601327939405e-06, 'samples': 27897024, 'steps': 145296, 'loss/train': 1.2894377708435059} 11/07/2021 17:41:20 - INFO - __main__ - Step 145298: {'lr': 1.2447311814727524e-06, 'samples': 27897216, 'steps': 145297, 'loss/train': 1.3721506595611572} 11/07/2021 17:41:21 - INFO - __main__ - Step 145299: {'lr': 1.244202342236822e-06, 'samples': 27897408, 'steps': 145298, 'loss/train': 1.1255440711975098} 11/07/2021 17:41:22 - INFO - __main__ - Step 145300: {'lr': 1.243673615086427e-06, 'samples': 27897600, 'steps': 145299, 'loss/train': 1.5656384229660034} 11/07/2021 17:41:23 - INFO - __main__ - Step 145301: {'lr': 1.2431450000217615e-06, 'samples': 27897792, 'steps': 145300, 'loss/train': 1.2591196298599243} 11/07/2021 17:41:23 - INFO - __main__ - Step 145302: {'lr': 1.242616497043131e-06, 'samples': 27897984, 'steps': 145301, 'loss/train': 1.4074093103408813} 11/07/2021 17:41:23 - INFO - __main__ - Step 145303: {'lr': 1.2420881061507295e-06, 'samples': 27898176, 'steps': 145302, 'loss/train': 1.4235408306121826} 11/07/2021 17:41:24 - INFO - __main__ - Step 145304: {'lr': 1.241559827344807e-06, 'samples': 27898368, 'steps': 145303, 'loss/train': 1.7290922403335571} 11/07/2021 17:41:24 - INFO - __main__ - Step 145305: {'lr': 1.2410316606255856e-06, 'samples': 27898560, 'steps': 145304, 'loss/train': 1.6935927867889404} 11/07/2021 17:41:25 - INFO - __main__ - Step 145306: {'lr': 1.2405036059933427e-06, 'samples': 27898752, 'steps': 145305, 'loss/train': 1.4142496585845947} 11/07/2021 17:41:26 - INFO - __main__ - Step 145307: {'lr': 1.2399756634482728e-06, 'samples': 27898944, 'steps': 145306, 'loss/train': 0.05938304588198662} 11/07/2021 17:41:26 - INFO - __main__ - Step 145308: {'lr': 1.2394478329906256e-06, 'samples': 27899136, 'steps': 145307, 'loss/train': 1.3062971830368042} 11/07/2021 17:41:26 - INFO - __main__ - Step 145309: {'lr': 1.2389201146206784e-06, 'samples': 27899328, 'steps': 145308, 'loss/train': 1.210412621498108} 11/07/2021 17:41:27 - INFO - __main__ - Step 145310: {'lr': 1.2383925083385982e-06, 'samples': 27899520, 'steps': 145309, 'loss/train': 1.3751522302627563} 11/07/2021 17:41:28 - INFO - __main__ - Step 145311: {'lr': 1.2378650141446624e-06, 'samples': 27899712, 'steps': 145310, 'loss/train': 0.29888635873794556} 11/07/2021 17:41:28 - INFO - __main__ - Step 145312: {'lr': 1.2373376320391206e-06, 'samples': 27899904, 'steps': 145311, 'loss/train': 1.2129756212234497} 11/07/2021 17:41:29 - INFO - __main__ - Step 145313: {'lr': 1.2368103620221949e-06, 'samples': 27900096, 'steps': 145312, 'loss/train': 1.828493595123291} 11/07/2021 17:41:29 - INFO - __main__ - Step 145314: {'lr': 1.2362832040941075e-06, 'samples': 27900288, 'steps': 145313, 'loss/train': 1.7078617811203003} 11/07/2021 17:41:29 - INFO - __main__ - Step 145315: {'lr': 1.2357561582551357e-06, 'samples': 27900480, 'steps': 145314, 'loss/train': 1.4231146574020386} 11/07/2021 17:41:30 - INFO - __main__ - Step 145316: {'lr': 1.235229224505474e-06, 'samples': 27900672, 'steps': 145315, 'loss/train': 1.4967882633209229} 11/07/2021 17:41:31 - INFO - __main__ - Step 145317: {'lr': 1.2347024028454001e-06, 'samples': 27900864, 'steps': 145316, 'loss/train': 0.6132462024688721} 11/07/2021 17:41:31 - INFO - __main__ - Step 145318: {'lr': 1.234175693275108e-06, 'samples': 27901056, 'steps': 145317, 'loss/train': 1.742503046989441} 11/07/2021 17:41:31 - INFO - __main__ - Step 145319: {'lr': 1.2336490957948477e-06, 'samples': 27901248, 'steps': 145318, 'loss/train': 1.3163930177688599} 11/07/2021 17:41:32 - INFO - __main__ - Step 145320: {'lr': 1.2331226104048965e-06, 'samples': 27901440, 'steps': 145319, 'loss/train': 1.5365992784500122} 11/07/2021 17:41:32 - INFO - __main__ - Step 145321: {'lr': 1.2325962371054766e-06, 'samples': 27901632, 'steps': 145320, 'loss/train': 1.6506925821304321} 11/07/2021 17:41:33 - INFO - __main__ - Step 145322: {'lr': 1.2320699758967824e-06, 'samples': 27901824, 'steps': 145321, 'loss/train': 1.4472436904907227} 11/07/2021 17:41:33 - INFO - __main__ - Step 145323: {'lr': 1.2315438267790636e-06, 'samples': 27902016, 'steps': 145322, 'loss/train': 0.897429883480072} 11/07/2021 17:41:34 - INFO - __main__ - Step 145324: {'lr': 1.2310177897525977e-06, 'samples': 27902208, 'steps': 145323, 'loss/train': 0.9987709522247314} 11/07/2021 17:41:34 - INFO - __main__ - Step 145325: {'lr': 1.230491864817579e-06, 'samples': 27902400, 'steps': 145324, 'loss/train': 1.1735568046569824} 11/07/2021 17:41:34 - INFO - __main__ - Step 145326: {'lr': 1.229966051974285e-06, 'samples': 27902592, 'steps': 145325, 'loss/train': 1.487215280532837} 11/07/2021 17:41:36 - INFO - __main__ - Step 145327: {'lr': 1.2294403512229103e-06, 'samples': 27902784, 'steps': 145326, 'loss/train': 1.008117914199829} 11/07/2021 17:41:36 - INFO - __main__ - Step 145328: {'lr': 1.2289147625637042e-06, 'samples': 27902976, 'steps': 145327, 'loss/train': 1.4882464408874512} 11/07/2021 17:41:37 - INFO - __main__ - Step 145329: {'lr': 1.228389285996917e-06, 'samples': 27903168, 'steps': 145328, 'loss/train': 0.04788906127214432} 11/07/2021 17:41:37 - INFO - __main__ - Step 145330: {'lr': 1.2278639215227983e-06, 'samples': 27903360, 'steps': 145329, 'loss/train': 1.262801170349121} 11/07/2021 17:41:37 - INFO - __main__ - Step 145331: {'lr': 1.22733866914157e-06, 'samples': 27903552, 'steps': 145330, 'loss/train': 1.1441296339035034} 11/07/2021 17:41:39 - INFO - __main__ - Step 145332: {'lr': 1.2268135288534266e-06, 'samples': 27903744, 'steps': 145331, 'loss/train': 0.9780707955360413} 11/07/2021 17:41:39 - INFO - __main__ - Step 145333: {'lr': 1.2262885006586732e-06, 'samples': 27903936, 'steps': 145332, 'loss/train': 1.234470248222351} 11/07/2021 17:41:39 - INFO - __main__ - Step 145334: {'lr': 1.2257635845575044e-06, 'samples': 27904128, 'steps': 145333, 'loss/train': 0.6267085671424866} 11/07/2021 17:41:40 - INFO - __main__ - Step 145335: {'lr': 1.2252387805501697e-06, 'samples': 27904320, 'steps': 145334, 'loss/train': 1.5739713907241821} 11/07/2021 17:41:40 - INFO - __main__ - Step 145336: {'lr': 1.2247140886368912e-06, 'samples': 27904512, 'steps': 145335, 'loss/train': 1.2421603202819824} 11/07/2021 17:41:41 - INFO - __main__ - Step 145337: {'lr': 1.2241895088179189e-06, 'samples': 27904704, 'steps': 145336, 'loss/train': 1.2417150735855103} 11/07/2021 17:41:42 - INFO - __main__ - Step 145338: {'lr': 1.2236650410935024e-06, 'samples': 27904896, 'steps': 145337, 'loss/train': 0.8075993061065674} 11/07/2021 17:41:42 - INFO - __main__ - Step 145339: {'lr': 1.223140685463864e-06, 'samples': 27905088, 'steps': 145338, 'loss/train': 1.3094178438186646} 11/07/2021 17:41:42 - INFO - __main__ - Step 145340: {'lr': 1.2226164419292252e-06, 'samples': 27905280, 'steps': 145339, 'loss/train': 1.498841404914856} 11/07/2021 17:41:43 - INFO - __main__ - Step 145341: {'lr': 1.2220923104898364e-06, 'samples': 27905472, 'steps': 145340, 'loss/train': 1.2872779369354248} 11/07/2021 17:41:44 - INFO - __main__ - Step 145342: {'lr': 1.2215682911459469e-06, 'samples': 27905664, 'steps': 145341, 'loss/train': 1.1037729978561401} 11/07/2021 17:41:44 - INFO - __main__ - Step 145343: {'lr': 1.2210443838977791e-06, 'samples': 27905856, 'steps': 145342, 'loss/train': 1.7800425291061401} 11/07/2021 17:41:45 - INFO - __main__ - Step 145344: {'lr': 1.220520588745555e-06, 'samples': 27906048, 'steps': 145343, 'loss/train': 1.166983962059021} 11/07/2021 17:41:45 - INFO - __main__ - Step 145345: {'lr': 1.2199969056895522e-06, 'samples': 27906240, 'steps': 145344, 'loss/train': 1.8348995447158813} 11/07/2021 17:41:45 - INFO - __main__ - Step 145346: {'lr': 1.2194733347299647e-06, 'samples': 27906432, 'steps': 145345, 'loss/train': 1.8099709749221802} 11/07/2021 17:41:46 - INFO - __main__ - Step 145347: {'lr': 1.2189498758670425e-06, 'samples': 27906624, 'steps': 145346, 'loss/train': 1.3824801445007324} 11/07/2021 17:41:46 - INFO - __main__ - Step 145348: {'lr': 1.2184265291010077e-06, 'samples': 27906816, 'steps': 145347, 'loss/train': 0.7890352010726929} 11/07/2021 17:41:47 - INFO - __main__ - Step 145349: {'lr': 1.2179032944321377e-06, 'samples': 27907008, 'steps': 145348, 'loss/train': 0.8851152658462524} 11/07/2021 17:41:48 - INFO - __main__ - Step 145350: {'lr': 1.217380171860627e-06, 'samples': 27907200, 'steps': 145349, 'loss/train': 1.4655922651290894} 11/07/2021 17:41:48 - INFO - __main__ - Step 145351: {'lr': 1.2168571613867253e-06, 'samples': 27907392, 'steps': 145350, 'loss/train': 1.4754093885421753} 11/07/2021 17:41:48 - INFO - __main__ - Step 145352: {'lr': 1.2163342630106821e-06, 'samples': 27907584, 'steps': 145351, 'loss/train': 1.822379231452942} 11/07/2021 17:41:49 - INFO - __main__ - Step 145353: {'lr': 1.2158114767326922e-06, 'samples': 27907776, 'steps': 145352, 'loss/train': 0.7935240268707275} 11/07/2021 17:41:50 - INFO - __main__ - Step 145354: {'lr': 1.215288802553033e-06, 'samples': 27907968, 'steps': 145353, 'loss/train': 1.0834885835647583} 11/07/2021 17:41:50 - INFO - __main__ - Step 145355: {'lr': 1.2147662404719262e-06, 'samples': 27908160, 'steps': 145354, 'loss/train': 1.2737663984298706} 11/07/2021 17:41:50 - INFO - __main__ - Step 145356: {'lr': 1.2142437904896219e-06, 'samples': 27908352, 'steps': 145355, 'loss/train': 1.5267813205718994} 11/07/2021 17:41:51 - INFO - __main__ - Step 145357: {'lr': 1.2137214526063422e-06, 'samples': 27908544, 'steps': 145356, 'loss/train': 1.474877119064331} 11/07/2021 17:41:51 - INFO - __main__ - Step 145358: {'lr': 1.2131992268222814e-06, 'samples': 27908736, 'steps': 145357, 'loss/train': 1.506928563117981} 11/07/2021 17:41:52 - INFO - __main__ - Step 145359: {'lr': 1.2126771131377444e-06, 'samples': 27908928, 'steps': 145358, 'loss/train': 0.7187352180480957} 11/07/2021 17:41:52 - INFO - __main__ - Step 145360: {'lr': 1.212155111552926e-06, 'samples': 27909120, 'steps': 145359, 'loss/train': 1.5477735996246338} 11/07/2021 17:41:53 - INFO - __main__ - Step 145361: {'lr': 1.2116332220680758e-06, 'samples': 27909312, 'steps': 145360, 'loss/train': 1.4590113162994385} 11/07/2021 17:41:53 - INFO - __main__ - Step 145362: {'lr': 1.2111114446834437e-06, 'samples': 27909504, 'steps': 145361, 'loss/train': 1.2348850965499878} 11/07/2021 17:41:53 - INFO - __main__ - Step 145363: {'lr': 1.2105897793992238e-06, 'samples': 27909696, 'steps': 145362, 'loss/train': 1.1899102926254272} 11/07/2021 17:41:55 - INFO - __main__ - Step 145364: {'lr': 1.210068226215666e-06, 'samples': 27909888, 'steps': 145363, 'loss/train': 1.0658762454986572} 11/07/2021 17:41:55 - INFO - __main__ - Step 145365: {'lr': 1.20954678513302e-06, 'samples': 27910080, 'steps': 145364, 'loss/train': 1.3417233228683472} 11/07/2021 17:41:55 - INFO - __main__ - Step 145366: {'lr': 1.209025456151508e-06, 'samples': 27910272, 'steps': 145365, 'loss/train': 1.2973798513412476} 11/07/2021 17:41:56 - INFO - __main__ - Step 145367: {'lr': 1.2085042392713796e-06, 'samples': 27910464, 'steps': 145366, 'loss/train': 1.477564811706543} 11/07/2021 17:41:56 - INFO - __main__ - Step 145368: {'lr': 1.207983134492857e-06, 'samples': 27910656, 'steps': 145367, 'loss/train': 1.25092351436615} 11/07/2021 17:41:57 - INFO - __main__ - Step 145369: {'lr': 1.2074621418161902e-06, 'samples': 27910848, 'steps': 145368, 'loss/train': 1.2735743522644043} 11/07/2021 17:41:57 - INFO - __main__ - Step 145370: {'lr': 1.2069412612416008e-06, 'samples': 27911040, 'steps': 145369, 'loss/train': 1.164362907409668} 11/07/2021 17:41:58 - INFO - __main__ - Step 145371: {'lr': 1.2064204927693111e-06, 'samples': 27911232, 'steps': 145370, 'loss/train': 1.4182391166687012} 11/07/2021 17:41:58 - INFO - __main__ - Step 145372: {'lr': 1.205899836399571e-06, 'samples': 27911424, 'steps': 145371, 'loss/train': 0.9892387986183167} 11/07/2021 17:41:58 - INFO - __main__ - Step 145373: {'lr': 1.2053792921326024e-06, 'samples': 27911616, 'steps': 145372, 'loss/train': 1.3372763395309448} 11/07/2021 17:41:59 - INFO - __main__ - Step 145374: {'lr': 1.2048588599686828e-06, 'samples': 27911808, 'steps': 145373, 'loss/train': 1.6171029806137085} 11/07/2021 17:42:00 - INFO - __main__ - Step 145375: {'lr': 1.2043385399079787e-06, 'samples': 27912000, 'steps': 145374, 'loss/train': 1.2293812036514282} 11/07/2021 17:42:00 - INFO - __main__ - Step 145376: {'lr': 1.2038183319507957e-06, 'samples': 27912192, 'steps': 145375, 'loss/train': 1.4989486932754517} 11/07/2021 17:42:01 - INFO - __main__ - Step 145377: {'lr': 1.2032982360972999e-06, 'samples': 27912384, 'steps': 145376, 'loss/train': 1.1592433452606201} 11/07/2021 17:42:01 - INFO - __main__ - Step 145378: {'lr': 1.2027782523477693e-06, 'samples': 27912576, 'steps': 145377, 'loss/train': 1.02040696144104} 11/07/2021 17:42:02 - INFO - __main__ - Step 145379: {'lr': 1.2022583807024257e-06, 'samples': 27912768, 'steps': 145378, 'loss/train': 2.1063356399536133} 11/07/2021 17:42:02 - INFO - __main__ - Step 145380: {'lr': 1.201738621161519e-06, 'samples': 27912960, 'steps': 145379, 'loss/train': 1.3168609142303467} 11/07/2021 17:42:03 - INFO - __main__ - Step 145381: {'lr': 1.201218973725271e-06, 'samples': 27913152, 'steps': 145380, 'loss/train': 1.4119033813476562} 11/07/2021 17:42:03 - INFO - __main__ - Step 145382: {'lr': 1.200699438393904e-06, 'samples': 27913344, 'steps': 145381, 'loss/train': 0.9836835265159607} 11/07/2021 17:42:04 - INFO - __main__ - Step 145383: {'lr': 1.2001800151676678e-06, 'samples': 27913536, 'steps': 145382, 'loss/train': 1.434599757194519} 11/07/2021 17:42:04 - INFO - __main__ - Step 145384: {'lr': 1.1996607040467845e-06, 'samples': 27913728, 'steps': 145383, 'loss/train': 1.597650408744812} 11/07/2021 17:42:05 - INFO - __main__ - Step 145385: {'lr': 1.1991415050315035e-06, 'samples': 27913920, 'steps': 145384, 'loss/train': 1.1202714443206787} 11/07/2021 17:42:05 - INFO - __main__ - Step 145386: {'lr': 1.1986224181220473e-06, 'samples': 27914112, 'steps': 145385, 'loss/train': 1.0700337886810303} 11/07/2021 17:42:06 - INFO - __main__ - Step 145387: {'lr': 1.1981034433186378e-06, 'samples': 27914304, 'steps': 145386, 'loss/train': 1.3758447170257568} 11/07/2021 17:42:06 - INFO - __main__ - Step 145388: {'lr': 1.1975845806215246e-06, 'samples': 27914496, 'steps': 145387, 'loss/train': 1.3769936561584473} 11/07/2021 17:42:06 - INFO - __main__ - Step 145389: {'lr': 1.1970658300309579e-06, 'samples': 27914688, 'steps': 145388, 'loss/train': 0.9669426679611206} 11/07/2021 17:42:07 - INFO - __main__ - Step 145390: {'lr': 1.1965471915471592e-06, 'samples': 27914880, 'steps': 145389, 'loss/train': 0.97984778881073} 11/07/2021 17:42:08 - INFO - __main__ - Step 145391: {'lr': 1.1960286651703512e-06, 'samples': 27915072, 'steps': 145390, 'loss/train': 1.3576589822769165} 11/07/2021 17:42:08 - INFO - __main__ - Step 145392: {'lr': 1.1955102509007553e-06, 'samples': 27915264, 'steps': 145391, 'loss/train': 1.4873980283737183} 11/07/2021 17:42:08 - INFO - __main__ - Step 145393: {'lr': 1.1949919487386218e-06, 'samples': 27915456, 'steps': 145392, 'loss/train': 1.3658456802368164} 11/07/2021 17:42:09 - INFO - __main__ - Step 145394: {'lr': 1.1944737586842002e-06, 'samples': 27915648, 'steps': 145393, 'loss/train': 1.0570194721221924} 11/07/2021 17:42:10 - INFO - __main__ - Step 145395: {'lr': 1.1939556807377128e-06, 'samples': 27915840, 'steps': 145394, 'loss/train': 1.1156591176986694} 11/07/2021 17:42:10 - INFO - __main__ - Step 145396: {'lr': 1.1934377148993814e-06, 'samples': 27916032, 'steps': 145395, 'loss/train': 0.9494497776031494} 11/07/2021 17:42:11 - INFO - __main__ - Step 145397: {'lr': 1.1929198611694837e-06, 'samples': 27916224, 'steps': 145396, 'loss/train': 1.291961669921875} 11/07/2021 17:42:11 - INFO - __main__ - Step 145398: {'lr': 1.1924021195481582e-06, 'samples': 27916416, 'steps': 145397, 'loss/train': 0.7255545854568481} 11/07/2021 17:42:11 - INFO - __main__ - Step 145399: {'lr': 1.1918844900357385e-06, 'samples': 27916608, 'steps': 145398, 'loss/train': 1.6380748748779297} 11/07/2021 17:42:12 - INFO - __main__ - Step 145400: {'lr': 1.1913669726323905e-06, 'samples': 27916800, 'steps': 145399, 'loss/train': 0.9432445168495178} 11/07/2021 17:42:13 - INFO - __main__ - Step 145401: {'lr': 1.1908495673383924e-06, 'samples': 27916992, 'steps': 145400, 'loss/train': 1.2126247882843018} 11/07/2021 17:42:13 - INFO - __main__ - Step 145402: {'lr': 1.190332274153938e-06, 'samples': 27917184, 'steps': 145401, 'loss/train': 1.0922192335128784} 11/07/2021 17:42:14 - INFO - __main__ - Step 145403: {'lr': 1.189815093079305e-06, 'samples': 27917376, 'steps': 145402, 'loss/train': 0.7858760952949524} 11/07/2021 17:42:14 - INFO - __main__ - Step 145404: {'lr': 1.1892980241146879e-06, 'samples': 27917568, 'steps': 145403, 'loss/train': 1.3606051206588745} 11/07/2021 17:42:14 - INFO - __main__ - Step 145405: {'lr': 1.188781067260336e-06, 'samples': 27917760, 'steps': 145404, 'loss/train': 1.6354156732559204} 11/07/2021 17:42:15 - INFO - __main__ - Step 145406: {'lr': 1.188264222516472e-06, 'samples': 27917952, 'steps': 145405, 'loss/train': 0.8982000946998596} 11/07/2021 17:42:16 - INFO - __main__ - Step 145407: {'lr': 1.1877474898833451e-06, 'samples': 27918144, 'steps': 145406, 'loss/train': 1.2728685140609741} 11/07/2021 17:42:16 - INFO - __main__ - Step 145408: {'lr': 1.1872308693611777e-06, 'samples': 27918336, 'steps': 145407, 'loss/train': 1.0100582838058472} 11/07/2021 17:42:16 - INFO - __main__ - Step 145409: {'lr': 1.1867143609502196e-06, 'samples': 27918528, 'steps': 145408, 'loss/train': 1.4437230825424194} 11/07/2021 17:42:17 - INFO - __main__ - Step 145410: {'lr': 1.1861979646506649e-06, 'samples': 27918720, 'steps': 145409, 'loss/train': 0.9263346195220947} 11/07/2021 17:42:18 - INFO - __main__ - Step 145411: {'lr': 1.1856816804627912e-06, 'samples': 27918912, 'steps': 145410, 'loss/train': 1.2188702821731567} 11/07/2021 17:42:18 - INFO - __main__ - Step 145412: {'lr': 1.1851655083867929e-06, 'samples': 27919104, 'steps': 145411, 'loss/train': 1.0000147819519043} 11/07/2021 17:42:19 - INFO - __main__ - Step 145413: {'lr': 1.1846494484229198e-06, 'samples': 27919296, 'steps': 145412, 'loss/train': 1.544281005859375} 11/07/2021 17:42:19 - INFO - __main__ - Step 145414: {'lr': 1.1841335005714215e-06, 'samples': 27919488, 'steps': 145413, 'loss/train': 1.6445646286010742} 11/07/2021 17:42:19 - INFO - __main__ - Step 145415: {'lr': 1.1836176648324925e-06, 'samples': 27919680, 'steps': 145414, 'loss/train': 1.094948410987854} 11/07/2021 17:42:21 - INFO - __main__ - Step 145416: {'lr': 1.1831019412063826e-06, 'samples': 27919872, 'steps': 145415, 'loss/train': 0.9842973351478577} 11/07/2021 17:42:21 - INFO - __main__ - Step 145417: {'lr': 1.1825863296933415e-06, 'samples': 27920064, 'steps': 145416, 'loss/train': 1.3090813159942627} 11/07/2021 17:42:22 - INFO - __main__ - Step 145418: {'lr': 1.1820708302935913e-06, 'samples': 27920256, 'steps': 145417, 'loss/train': 0.6906574368476868} 11/07/2021 17:42:22 - INFO - __main__ - Step 145419: {'lr': 1.1815554430073538e-06, 'samples': 27920448, 'steps': 145418, 'loss/train': 1.3592917919158936} 11/07/2021 17:42:22 - INFO - __main__ - Step 145420: {'lr': 1.1810401678348792e-06, 'samples': 27920640, 'steps': 145419, 'loss/train': 0.9487009644508362} 11/07/2021 17:42:23 - INFO - __main__ - Step 145421: {'lr': 1.1805250047763616e-06, 'samples': 27920832, 'steps': 145420, 'loss/train': 1.4673197269439697} 11/07/2021 17:42:24 - INFO - __main__ - Step 145422: {'lr': 1.1800099538320785e-06, 'samples': 27921024, 'steps': 145421, 'loss/train': 0.14383815228939056} 11/07/2021 17:42:24 - INFO - __main__ - Step 145423: {'lr': 1.1794950150022522e-06, 'samples': 27921216, 'steps': 145422, 'loss/train': 1.427112102508545} 11/07/2021 17:42:24 - INFO - __main__ - Step 145424: {'lr': 1.1789801882871043e-06, 'samples': 27921408, 'steps': 145423, 'loss/train': 1.2626875638961792} 11/07/2021 17:42:25 - INFO - __main__ - Step 145425: {'lr': 1.1784654736868571e-06, 'samples': 27921600, 'steps': 145424, 'loss/train': 1.6063764095306396} 11/07/2021 17:42:25 - INFO - __main__ - Step 145426: {'lr': 1.1779508712017607e-06, 'samples': 27921792, 'steps': 145425, 'loss/train': 1.2662957906723022} 11/07/2021 17:42:25 - INFO - __main__ - Step 145427: {'lr': 1.1774363808320087e-06, 'samples': 27921984, 'steps': 145426, 'loss/train': 0.9808775782585144} 11/07/2021 17:42:27 - INFO - __main__ - Step 145428: {'lr': 1.176922002577907e-06, 'samples': 27922176, 'steps': 145427, 'loss/train': 1.039857268333435} 11/07/2021 17:42:27 - INFO - __main__ - Step 145429: {'lr': 1.176407736439622e-06, 'samples': 27922368, 'steps': 145428, 'loss/train': 1.533537745475769} 11/07/2021 17:42:28 - INFO - __main__ - Step 145430: {'lr': 1.1758935824174033e-06, 'samples': 27922560, 'steps': 145429, 'loss/train': 1.2654428482055664} 11/07/2021 17:42:28 - INFO - __main__ - Step 145431: {'lr': 1.1753795405115009e-06, 'samples': 27922752, 'steps': 145430, 'loss/train': 1.2945735454559326} 11/07/2021 17:42:28 - INFO - __main__ - Step 145432: {'lr': 1.1748656107221366e-06, 'samples': 27922944, 'steps': 145431, 'loss/train': 0.6851101517677307} 11/07/2021 17:42:29 - INFO - __main__ - Step 145433: {'lr': 1.1743517930495329e-06, 'samples': 27923136, 'steps': 145432, 'loss/train': 1.7617459297180176} 11/07/2021 17:42:30 - INFO - __main__ - Step 145434: {'lr': 1.173838087493939e-06, 'samples': 27923328, 'steps': 145433, 'loss/train': 1.4424678087234497} 11/07/2021 17:42:30 - INFO - __main__ - Step 145435: {'lr': 1.1733244940555499e-06, 'samples': 27923520, 'steps': 145434, 'loss/train': 1.4715917110443115} 11/07/2021 17:42:31 - INFO - __main__ - Step 145436: {'lr': 1.1728110127346424e-06, 'samples': 27923712, 'steps': 145435, 'loss/train': 1.127629041671753} 11/07/2021 17:42:31 - INFO - __main__ - Step 145437: {'lr': 1.1722976435314115e-06, 'samples': 27923904, 'steps': 145436, 'loss/train': 1.3698415756225586} 11/07/2021 17:42:32 - INFO - __main__ - Step 145438: {'lr': 1.1717843864461065e-06, 'samples': 27924096, 'steps': 145437, 'loss/train': 0.5848445296287537} 11/07/2021 17:42:32 - INFO - __main__ - Step 145439: {'lr': 1.1712712414789496e-06, 'samples': 27924288, 'steps': 145438, 'loss/train': 0.6499578952789307} 11/07/2021 17:42:33 - INFO - __main__ - Step 145440: {'lr': 1.1707582086301905e-06, 'samples': 27924480, 'steps': 145439, 'loss/train': 0.6814635992050171} 11/07/2021 17:42:33 - INFO - __main__ - Step 145441: {'lr': 1.1702452879000514e-06, 'samples': 27924672, 'steps': 145440, 'loss/train': 1.0741801261901855} 11/07/2021 17:42:33 - INFO - __main__ - Step 145442: {'lr': 1.1697324792887544e-06, 'samples': 27924864, 'steps': 145441, 'loss/train': 1.381041169166565} 11/07/2021 17:42:34 - INFO - __main__ - Step 145443: {'lr': 1.1692197827965211e-06, 'samples': 27925056, 'steps': 145442, 'loss/train': 1.175632357597351} 11/07/2021 17:42:35 - INFO - __main__ - Step 145444: {'lr': 1.1687071984236298e-06, 'samples': 27925248, 'steps': 145443, 'loss/train': 0.9982313513755798} 11/07/2021 17:42:35 - INFO - __main__ - Step 145445: {'lr': 1.1681947261702464e-06, 'samples': 27925440, 'steps': 145444, 'loss/train': 1.2591893672943115} 11/07/2021 17:42:35 - INFO - __main__ - Step 145446: {'lr': 1.1676823660366486e-06, 'samples': 27925632, 'steps': 145445, 'loss/train': 1.2882307767868042} 11/07/2021 17:42:36 - INFO - __main__ - Step 145447: {'lr': 1.1671701180230588e-06, 'samples': 27925824, 'steps': 145446, 'loss/train': 1.1137256622314453} 11/07/2021 17:42:37 - INFO - __main__ - Step 145448: {'lr': 1.1666579821296986e-06, 'samples': 27926016, 'steps': 145447, 'loss/train': 1.3505507707595825} 11/07/2021 17:42:37 - INFO - __main__ - Step 145449: {'lr': 1.1661459583567902e-06, 'samples': 27926208, 'steps': 145448, 'loss/train': 1.3185441493988037} 11/07/2021 17:42:38 - INFO - __main__ - Step 145450: {'lr': 1.165634046704611e-06, 'samples': 27926400, 'steps': 145449, 'loss/train': 1.1856845617294312} 11/07/2021 17:42:38 - INFO - __main__ - Step 145451: {'lr': 1.1651222471733281e-06, 'samples': 27926592, 'steps': 145450, 'loss/train': 0.2168867290019989} 11/07/2021 17:42:38 - INFO - __main__ - Step 145452: {'lr': 1.1646105597631906e-06, 'samples': 27926784, 'steps': 145451, 'loss/train': 1.2289131879806519} 11/07/2021 17:42:39 - INFO - __main__ - Step 145453: {'lr': 1.1640989844744764e-06, 'samples': 27926976, 'steps': 145452, 'loss/train': 1.3154773712158203} 11/07/2021 17:42:40 - INFO - __main__ - Step 145454: {'lr': 1.1635875213073522e-06, 'samples': 27927168, 'steps': 145453, 'loss/train': 1.35237455368042} 11/07/2021 17:42:40 - INFO - __main__ - Step 145455: {'lr': 1.1630761702620952e-06, 'samples': 27927360, 'steps': 145454, 'loss/train': 1.3085386753082275} 11/07/2021 17:42:40 - INFO - __main__ - Step 145456: {'lr': 1.1625649313388998e-06, 'samples': 27927552, 'steps': 145455, 'loss/train': 0.9046837091445923} 11/07/2021 17:42:41 - INFO - __main__ - Step 145457: {'lr': 1.1620538045380158e-06, 'samples': 27927744, 'steps': 145456, 'loss/train': 1.2697229385375977} 11/07/2021 17:42:41 - INFO - __main__ - Step 145458: {'lr': 1.1615427898596653e-06, 'samples': 27927936, 'steps': 145457, 'loss/train': 1.3662793636322021} 11/07/2021 17:42:43 - INFO - __main__ - Step 145459: {'lr': 1.161031887304098e-06, 'samples': 27928128, 'steps': 145458, 'loss/train': 0.9056562185287476} 11/07/2021 17:42:43 - INFO - __main__ - Step 145460: {'lr': 1.1605210968715086e-06, 'samples': 27928320, 'steps': 145459, 'loss/train': 2.1238505840301514} 11/07/2021 17:42:43 - INFO - __main__ - Step 145461: {'lr': 1.160010418562174e-06, 'samples': 27928512, 'steps': 145460, 'loss/train': 0.07803796976804733} 11/07/2021 17:42:44 - INFO - __main__ - Step 145462: {'lr': 1.159499852376289e-06, 'samples': 27928704, 'steps': 145461, 'loss/train': 1.2194249629974365} 11/07/2021 17:42:44 - INFO - __main__ - Step 145463: {'lr': 1.1589893983140753e-06, 'samples': 27928896, 'steps': 145462, 'loss/train': 1.2368477582931519} 11/07/2021 17:42:45 - INFO - __main__ - Step 145464: {'lr': 1.1584790563758108e-06, 'samples': 27929088, 'steps': 145463, 'loss/train': 1.0649654865264893} 11/07/2021 17:42:46 - INFO - __main__ - Step 145465: {'lr': 1.1579688265616895e-06, 'samples': 27929280, 'steps': 145464, 'loss/train': 1.2930651903152466} 11/07/2021 17:42:46 - INFO - __main__ - Step 145466: {'lr': 1.1574587088719335e-06, 'samples': 27929472, 'steps': 145465, 'loss/train': 0.7195792198181152} 11/07/2021 17:42:46 - INFO - __main__ - Step 145467: {'lr': 1.1569487033067926e-06, 'samples': 27929664, 'steps': 145466, 'loss/train': 1.1378157138824463} 11/07/2021 17:42:47 - INFO - __main__ - Step 145468: {'lr': 1.156438809866489e-06, 'samples': 27929856, 'steps': 145467, 'loss/train': 1.0256199836730957} 11/07/2021 17:42:47 - INFO - __main__ - Step 145469: {'lr': 1.1559290285512724e-06, 'samples': 27930048, 'steps': 145468, 'loss/train': 1.2105519771575928} 11/07/2021 17:42:48 - INFO - __main__ - Step 145470: {'lr': 1.155419359361337e-06, 'samples': 27930240, 'steps': 145469, 'loss/train': 1.831721305847168} 11/07/2021 17:42:49 - INFO - __main__ - Step 145471: {'lr': 1.1549098022969328e-06, 'samples': 27930432, 'steps': 145470, 'loss/train': 1.3197599649429321} 11/07/2021 17:42:49 - INFO - __main__ - Step 145472: {'lr': 1.1544003573582818e-06, 'samples': 27930624, 'steps': 145471, 'loss/train': 0.879317581653595} 11/07/2021 17:42:49 - INFO - __main__ - Step 145473: {'lr': 1.1538910245456058e-06, 'samples': 27930816, 'steps': 145472, 'loss/train': 1.45632803440094} 11/07/2021 17:42:50 - INFO - __main__ - Step 145474: {'lr': 1.1533818038591826e-06, 'samples': 27931008, 'steps': 145473, 'loss/train': 1.403232216835022} 11/07/2021 17:42:51 - INFO - __main__ - Step 145475: {'lr': 1.1528726952991786e-06, 'samples': 27931200, 'steps': 145474, 'loss/train': 1.3661525249481201} 11/07/2021 17:42:51 - INFO - __main__ - Step 145476: {'lr': 1.1523636988658436e-06, 'samples': 27931392, 'steps': 145475, 'loss/train': 1.3075259923934937} 11/07/2021 17:42:51 - INFO - __main__ - Step 145477: {'lr': 1.1518548145594554e-06, 'samples': 27931584, 'steps': 145476, 'loss/train': 1.257891297340393} 11/07/2021 17:42:52 - INFO - __main__ - Step 145478: {'lr': 1.1513460423801524e-06, 'samples': 27931776, 'steps': 145477, 'loss/train': 1.9069355726242065} 11/07/2021 17:42:52 - INFO - __main__ - Step 145479: {'lr': 1.1508373823282402e-06, 'samples': 27931968, 'steps': 145478, 'loss/train': 2.1792171001434326} 11/07/2021 17:42:53 - INFO - __main__ - Step 145480: {'lr': 1.1503288344039132e-06, 'samples': 27932160, 'steps': 145479, 'loss/train': 1.1388148069381714} 11/07/2021 17:42:53 - INFO - __main__ - Step 145481: {'lr': 1.1498203986074207e-06, 'samples': 27932352, 'steps': 145480, 'loss/train': 1.0929112434387207} 11/07/2021 17:42:54 - INFO - __main__ - Step 145482: {'lr': 1.1493120749389573e-06, 'samples': 27932544, 'steps': 145481, 'loss/train': 1.231534719467163} 11/07/2021 17:42:54 - INFO - __main__ - Step 145483: {'lr': 1.148803863398773e-06, 'samples': 27932736, 'steps': 145482, 'loss/train': 1.3627145290374756} 11/07/2021 17:42:54 - INFO - __main__ - Step 145484: {'lr': 1.1482957639871172e-06, 'samples': 27932928, 'steps': 145483, 'loss/train': 0.7329543232917786} 11/07/2021 17:42:56 - INFO - __main__ - Step 145485: {'lr': 1.1477877767041844e-06, 'samples': 27933120, 'steps': 145484, 'loss/train': 1.1410921812057495} 11/07/2021 17:42:56 - INFO - __main__ - Step 145486: {'lr': 1.1472799015502244e-06, 'samples': 27933312, 'steps': 145485, 'loss/train': 1.4133707284927368} 11/07/2021 17:42:56 - INFO - __main__ - Step 145487: {'lr': 1.1467721385254593e-06, 'samples': 27933504, 'steps': 145486, 'loss/train': 1.1764601469039917} 11/07/2021 17:42:57 - INFO - __main__ - Step 145488: {'lr': 1.1462644876301387e-06, 'samples': 27933696, 'steps': 145487, 'loss/train': 1.233707070350647} 11/07/2021 17:42:57 - INFO - __main__ - Step 145489: {'lr': 1.1457569488644293e-06, 'samples': 27933888, 'steps': 145488, 'loss/train': 1.6701347827911377} 11/07/2021 17:42:59 - INFO - __main__ - Step 145490: {'lr': 1.1452495222286363e-06, 'samples': 27934080, 'steps': 145489, 'loss/train': 1.2799235582351685} 11/07/2021 17:42:59 - INFO - __main__ - Step 145491: {'lr': 1.1447422077229542e-06, 'samples': 27934272, 'steps': 145490, 'loss/train': 1.4607748985290527} 11/07/2021 17:42:59 - INFO - __main__ - Step 145492: {'lr': 1.1442350053476048e-06, 'samples': 27934464, 'steps': 145491, 'loss/train': 1.5181701183319092} 11/07/2021 17:43:00 - INFO - __main__ - Step 145493: {'lr': 1.1437279151028102e-06, 'samples': 27934656, 'steps': 145492, 'loss/train': 1.3381149768829346} 11/07/2021 17:43:00 - INFO - __main__ - Step 145494: {'lr': 1.1432209369888202e-06, 'samples': 27934848, 'steps': 145493, 'loss/train': 1.2724404335021973} 11/07/2021 17:43:01 - INFO - __main__ - Step 145495: {'lr': 1.142714071005857e-06, 'samples': 27935040, 'steps': 145494, 'loss/train': 2.697531223297119} 11/07/2021 17:43:01 - INFO - __main__ - Step 145496: {'lr': 1.1422073171541424e-06, 'samples': 27935232, 'steps': 145495, 'loss/train': 2.6746623516082764} 11/07/2021 17:43:02 - INFO - __main__ - Step 145497: {'lr': 1.1417006754338988e-06, 'samples': 27935424, 'steps': 145496, 'loss/train': 2.597987413406372} 11/07/2021 17:43:02 - INFO - __main__ - Step 145498: {'lr': 1.1411941458453755e-06, 'samples': 27935616, 'steps': 145497, 'loss/train': 1.5607088804244995} 11/07/2021 17:43:03 - INFO - __main__ - Step 145499: {'lr': 1.140687728388795e-06, 'samples': 27935808, 'steps': 145498, 'loss/train': 1.3636622428894043} 11/07/2021 17:43:03 - INFO - __main__ - Step 145500: {'lr': 1.1401814230643791e-06, 'samples': 27936000, 'steps': 145499, 'loss/train': 0.8455096483230591} 11/07/2021 17:43:03 - INFO - __main__ - Step 145501: {'lr': 1.13967522987235e-06, 'samples': 27936192, 'steps': 145500, 'loss/train': 1.3607752323150635} 11/07/2021 17:43:04 - INFO - __main__ - Step 145502: {'lr': 1.1391691488129296e-06, 'samples': 27936384, 'steps': 145501, 'loss/train': 1.2344887256622314} 11/07/2021 17:43:05 - INFO - __main__ - Step 145503: {'lr': 1.1386631798863956e-06, 'samples': 27936576, 'steps': 145502, 'loss/train': 1.4279123544692993} 11/07/2021 17:43:05 - INFO - __main__ - Step 145504: {'lr': 1.1381573230929143e-06, 'samples': 27936768, 'steps': 145503, 'loss/train': 1.8211140632629395} 11/07/2021 17:43:05 - INFO - __main__ - Step 145505: {'lr': 1.1376515784327634e-06, 'samples': 27936960, 'steps': 145504, 'loss/train': 0.7652351260185242} 11/07/2021 17:43:06 - INFO - __main__ - Step 145506: {'lr': 1.1371459459061095e-06, 'samples': 27937152, 'steps': 145505, 'loss/train': 0.9281927347183228} 11/07/2021 17:43:06 - INFO - __main__ - Step 145507: {'lr': 1.1366404255132578e-06, 'samples': 27937344, 'steps': 145506, 'loss/train': 1.366552710533142} 11/07/2021 17:43:07 - INFO - __main__ - Step 145508: {'lr': 1.1361350172543749e-06, 'samples': 27937536, 'steps': 145507, 'loss/train': 1.7083568572998047} 11/07/2021 17:43:08 - INFO - __main__ - Step 145509: {'lr': 1.1356297211297107e-06, 'samples': 27937728, 'steps': 145508, 'loss/train': 1.2783153057098389} 11/07/2021 17:43:08 - INFO - __main__ - Step 145510: {'lr': 1.135124537139487e-06, 'samples': 27937920, 'steps': 145509, 'loss/train': 1.2781167030334473} 11/07/2021 17:43:08 - INFO - __main__ - Step 145511: {'lr': 1.134619465283926e-06, 'samples': 27938112, 'steps': 145510, 'loss/train': 1.285136342048645} 11/07/2021 17:43:09 - INFO - __main__ - Step 145512: {'lr': 1.1341145055632774e-06, 'samples': 27938304, 'steps': 145511, 'loss/train': 1.0621705055236816} 11/07/2021 17:43:10 - INFO - __main__ - Step 145513: {'lr': 1.1336096579777632e-06, 'samples': 27938496, 'steps': 145512, 'loss/train': 1.3189045190811157} 11/07/2021 17:43:10 - INFO - __main__ - Step 145514: {'lr': 1.1331049225276059e-06, 'samples': 27938688, 'steps': 145513, 'loss/train': 1.7031008005142212} 11/07/2021 17:43:11 - INFO - __main__ - Step 145515: {'lr': 1.132600299213027e-06, 'samples': 27938880, 'steps': 145514, 'loss/train': 1.3552682399749756} 11/07/2021 17:43:11 - INFO - __main__ - Step 145516: {'lr': 1.1320957880342486e-06, 'samples': 27939072, 'steps': 145515, 'loss/train': 1.4516410827636719} 11/07/2021 17:43:11 - INFO - __main__ - Step 145517: {'lr': 1.1315913889915209e-06, 'samples': 27939264, 'steps': 145516, 'loss/train': 1.2246510982513428} 11/07/2021 17:43:12 - INFO - __main__ - Step 145518: {'lr': 1.1310871020850377e-06, 'samples': 27939456, 'steps': 145517, 'loss/train': 1.4227283000946045} 11/07/2021 17:43:13 - INFO - __main__ - Step 145519: {'lr': 1.130582927315077e-06, 'samples': 27939648, 'steps': 145518, 'loss/train': 1.0046292543411255} 11/07/2021 17:43:13 - INFO - __main__ - Step 145520: {'lr': 1.130078864681805e-06, 'samples': 27939840, 'steps': 145519, 'loss/train': 1.4280121326446533} 11/07/2021 17:43:13 - INFO - __main__ - Step 145521: {'lr': 1.1295749141854994e-06, 'samples': 27940032, 'steps': 145520, 'loss/train': 1.1189539432525635} 11/07/2021 17:43:14 - INFO - __main__ - Step 145522: {'lr': 1.1290710758263545e-06, 'samples': 27940224, 'steps': 145521, 'loss/train': 1.6191506385803223} 11/07/2021 17:43:14 - INFO - __main__ - Step 145523: {'lr': 1.12856734960462e-06, 'samples': 27940416, 'steps': 145522, 'loss/train': 1.2774651050567627} 11/07/2021 17:43:15 - INFO - __main__ - Step 145524: {'lr': 1.1280637355205182e-06, 'samples': 27940608, 'steps': 145523, 'loss/train': 1.8109381198883057} 11/07/2021 17:43:16 - INFO - __main__ - Step 145525: {'lr': 1.1275602335742429e-06, 'samples': 27940800, 'steps': 145524, 'loss/train': 1.155096173286438} 11/07/2021 17:43:16 - INFO - __main__ - Step 145526: {'lr': 1.1270568437660723e-06, 'samples': 27940992, 'steps': 145525, 'loss/train': 1.4329456090927124} 11/07/2021 17:43:16 - INFO - __main__ - Step 145527: {'lr': 1.1265535660962001e-06, 'samples': 27941184, 'steps': 145526, 'loss/train': 0.6878204941749573} 11/07/2021 17:43:17 - INFO - __main__ - Step 145528: {'lr': 1.1260504005648765e-06, 'samples': 27941376, 'steps': 145527, 'loss/train': 1.3952391147613525} 11/07/2021 17:43:17 - INFO - __main__ - Step 145529: {'lr': 1.1255473471722954e-06, 'samples': 27941568, 'steps': 145528, 'loss/train': 1.476293921470642} 11/07/2021 17:43:18 - INFO - __main__ - Step 145530: {'lr': 1.125044405918707e-06, 'samples': 27941760, 'steps': 145529, 'loss/train': 1.1743186712265015} 11/07/2021 17:43:18 - INFO - __main__ - Step 145531: {'lr': 1.124541576804361e-06, 'samples': 27941952, 'steps': 145530, 'loss/train': 1.334825038909912} 11/07/2021 17:43:19 - INFO - __main__ - Step 145532: {'lr': 1.1240388598294238e-06, 'samples': 27942144, 'steps': 145531, 'loss/train': 1.3498343229293823} 11/07/2021 17:43:19 - INFO - __main__ - Step 145533: {'lr': 1.1235362549941453e-06, 'samples': 27942336, 'steps': 145532, 'loss/train': 1.3067481517791748} 11/07/2021 17:43:20 - INFO - __main__ - Step 145534: {'lr': 1.1230337622987752e-06, 'samples': 27942528, 'steps': 145533, 'loss/train': 1.4319334030151367} 11/07/2021 17:43:21 - INFO - __main__ - Step 145535: {'lr': 1.1225313817435355e-06, 'samples': 27942720, 'steps': 145534, 'loss/train': 0.9244388341903687} 11/07/2021 17:43:21 - INFO - __main__ - Step 145536: {'lr': 1.1220291133286486e-06, 'samples': 27942912, 'steps': 145535, 'loss/train': 1.3831466436386108} 11/07/2021 17:43:21 - INFO - __main__ - Step 145537: {'lr': 1.1215269570543086e-06, 'samples': 27943104, 'steps': 145536, 'loss/train': 1.1643470525741577} 11/07/2021 17:43:22 - INFO - __main__ - Step 145538: {'lr': 1.1210249129207929e-06, 'samples': 27943296, 'steps': 145537, 'loss/train': 1.5462567806243896} 11/07/2021 17:43:22 - INFO - __main__ - Step 145539: {'lr': 1.1205229809282958e-06, 'samples': 27943488, 'steps': 145538, 'loss/train': 1.7047762870788574} 11/07/2021 17:43:23 - INFO - __main__ - Step 145540: {'lr': 1.1200211610770394e-06, 'samples': 27943680, 'steps': 145539, 'loss/train': 1.8239277601242065} 11/07/2021 17:43:23 - INFO - __main__ - Step 145541: {'lr': 1.1195194533672737e-06, 'samples': 27943872, 'steps': 145540, 'loss/train': 1.0432528257369995} 11/07/2021 17:43:24 - INFO - __main__ - Step 145542: {'lr': 1.1190178577991928e-06, 'samples': 27944064, 'steps': 145541, 'loss/train': 1.2766821384429932} 11/07/2021 17:43:24 - INFO - __main__ - Step 145543: {'lr': 1.1185163743730465e-06, 'samples': 27944256, 'steps': 145542, 'loss/train': 1.9138495922088623} 11/07/2021 17:43:25 - INFO - __main__ - Step 145544: {'lr': 1.1180150030890846e-06, 'samples': 27944448, 'steps': 145543, 'loss/train': 1.4252631664276123} 11/07/2021 17:43:26 - INFO - __main__ - Step 145545: {'lr': 1.1175137439475013e-06, 'samples': 27944640, 'steps': 145544, 'loss/train': 1.183377981185913} 11/07/2021 17:43:26 - INFO - __main__ - Step 145546: {'lr': 1.1170125969484912e-06, 'samples': 27944832, 'steps': 145545, 'loss/train': 0.6745740175247192} 11/07/2021 17:43:26 - INFO - __main__ - Step 145547: {'lr': 1.1165115620923317e-06, 'samples': 27945024, 'steps': 145546, 'loss/train': 0.955558717250824} 11/07/2021 17:43:27 - INFO - __main__ - Step 145548: {'lr': 1.1160106393792445e-06, 'samples': 27945216, 'steps': 145547, 'loss/train': 0.9329259395599365} 11/07/2021 17:43:27 - INFO - __main__ - Step 145549: {'lr': 1.1155098288094245e-06, 'samples': 27945408, 'steps': 145548, 'loss/train': 1.0372445583343506} 11/07/2021 17:43:28 - INFO - __main__ - Step 145550: {'lr': 1.115009130383121e-06, 'samples': 27945600, 'steps': 145549, 'loss/train': 1.1050143241882324} 11/07/2021 17:43:28 - INFO - __main__ - Step 145551: {'lr': 1.1145085441005565e-06, 'samples': 27945792, 'steps': 145550, 'loss/train': 0.9686985015869141} 11/07/2021 17:43:29 - INFO - __main__ - Step 145552: {'lr': 1.1140080699619527e-06, 'samples': 27945984, 'steps': 145551, 'loss/train': 1.7278746366500854} 11/07/2021 17:43:29 - INFO - __main__ - Step 145553: {'lr': 1.1135077079675316e-06, 'samples': 27946176, 'steps': 145552, 'loss/train': 1.2912876605987549} 11/07/2021 17:43:29 - INFO - __main__ - Step 145554: {'lr': 1.1130074581175431e-06, 'samples': 27946368, 'steps': 145553, 'loss/train': 0.5835559368133545} 11/07/2021 17:43:30 - INFO - __main__ - Step 145555: {'lr': 1.1125073204121538e-06, 'samples': 27946560, 'steps': 145554, 'loss/train': 0.9252521395683289} 11/07/2021 17:43:31 - INFO - __main__ - Step 145556: {'lr': 1.112007294851669e-06, 'samples': 27946752, 'steps': 145555, 'loss/train': 1.1910866498947144} 11/07/2021 17:43:31 - INFO - __main__ - Step 145557: {'lr': 1.1115073814362553e-06, 'samples': 27946944, 'steps': 145556, 'loss/train': 1.705257773399353} 11/07/2021 17:43:32 - INFO - __main__ - Step 145558: {'lr': 1.1110075801661622e-06, 'samples': 27947136, 'steps': 145557, 'loss/train': 1.2580007314682007} 11/07/2021 17:43:32 - INFO - __main__ - Step 145559: {'lr': 1.1105078910416121e-06, 'samples': 27947328, 'steps': 145558, 'loss/train': 1.805027961730957} 11/07/2021 17:43:32 - INFO - __main__ - Step 145560: {'lr': 1.1100083140627993e-06, 'samples': 27947520, 'steps': 145559, 'loss/train': 1.575459599494934} 11/07/2021 17:43:33 - INFO - __main__ - Step 145561: {'lr': 1.109508849230001e-06, 'samples': 27947712, 'steps': 145560, 'loss/train': 1.3363149166107178} 11/07/2021 17:43:34 - INFO - __main__ - Step 145562: {'lr': 1.1090094965434117e-06, 'samples': 27947904, 'steps': 145561, 'loss/train': 0.9571558833122253} 11/07/2021 17:43:34 - INFO - __main__ - Step 145563: {'lr': 1.1085102560032534e-06, 'samples': 27948096, 'steps': 145562, 'loss/train': 1.1188064813613892} 11/07/2021 17:43:34 - INFO - __main__ - Step 145564: {'lr': 1.1080111276097759e-06, 'samples': 27948288, 'steps': 145563, 'loss/train': 1.4272319078445435} 11/07/2021 17:43:35 - INFO - __main__ - Step 145565: {'lr': 1.1075121113631736e-06, 'samples': 27948480, 'steps': 145564, 'loss/train': 1.2583075761795044} 11/07/2021 17:43:36 - INFO - __main__ - Step 145566: {'lr': 1.1070132072636963e-06, 'samples': 27948672, 'steps': 145565, 'loss/train': 1.3470865488052368} 11/07/2021 17:43:36 - INFO - __main__ - Step 145567: {'lr': 1.1065144153115658e-06, 'samples': 27948864, 'steps': 145566, 'loss/train': 1.3442740440368652} 11/07/2021 17:43:36 - INFO - __main__ - Step 145568: {'lr': 1.1060157355069766e-06, 'samples': 27949056, 'steps': 145567, 'loss/train': 1.1753647327423096} 11/07/2021 17:43:37 - INFO - __main__ - Step 145569: {'lr': 1.1055171678501786e-06, 'samples': 27949248, 'steps': 145568, 'loss/train': 1.2919385433197021} 11/07/2021 17:43:37 - INFO - __main__ - Step 145570: {'lr': 1.1050187123413936e-06, 'samples': 27949440, 'steps': 145569, 'loss/train': 1.4884817600250244} 11/07/2021 17:43:38 - INFO - __main__ - Step 145571: {'lr': 1.1045203689808713e-06, 'samples': 27949632, 'steps': 145570, 'loss/train': 1.5325969457626343} 11/07/2021 17:43:39 - INFO - __main__ - Step 145572: {'lr': 1.1040221377687786e-06, 'samples': 27949824, 'steps': 145571, 'loss/train': 0.6696421504020691} 11/07/2021 17:43:39 - INFO - __main__ - Step 145573: {'lr': 1.103524018705393e-06, 'samples': 27950016, 'steps': 145572, 'loss/train': 1.2430448532104492} 11/07/2021 17:43:39 - INFO - __main__ - Step 145574: {'lr': 1.1030260117909086e-06, 'samples': 27950208, 'steps': 145573, 'loss/train': 1.1508110761642456} 11/07/2021 17:43:40 - INFO - __main__ - Step 145575: {'lr': 1.1025281170255752e-06, 'samples': 27950400, 'steps': 145574, 'loss/train': 0.7392030954360962} 11/07/2021 17:43:41 - INFO - __main__ - Step 145576: {'lr': 1.1020303344095871e-06, 'samples': 27950592, 'steps': 145575, 'loss/train': 1.1835911273956299} 11/07/2021 17:43:41 - INFO - __main__ - Step 145577: {'lr': 1.1015326639431944e-06, 'samples': 27950784, 'steps': 145576, 'loss/train': 1.6028685569763184} 11/07/2021 17:43:41 - INFO - __main__ - Step 145578: {'lr': 1.1010351056266188e-06, 'samples': 27950976, 'steps': 145577, 'loss/train': 1.0968390703201294} 11/07/2021 17:43:42 - INFO - __main__ - Step 145579: {'lr': 1.1005376594600547e-06, 'samples': 27951168, 'steps': 145578, 'loss/train': 1.2614563703536987} 11/07/2021 17:43:42 - INFO - __main__ - Step 145580: {'lr': 1.1000403254437518e-06, 'samples': 27951360, 'steps': 145579, 'loss/train': 1.3163247108459473} 11/07/2021 17:43:43 - INFO - __main__ - Step 145581: {'lr': 1.0995431035779325e-06, 'samples': 27951552, 'steps': 145580, 'loss/train': 1.358678936958313} 11/07/2021 17:43:44 - INFO - __main__ - Step 145582: {'lr': 1.0990459938628183e-06, 'samples': 27951744, 'steps': 145581, 'loss/train': 1.6790388822555542} 11/07/2021 17:43:44 - INFO - __main__ - Step 145583: {'lr': 1.0985489962986316e-06, 'samples': 27951936, 'steps': 145582, 'loss/train': 1.0993688106536865} 11/07/2021 17:43:44 - INFO - __main__ - Step 145584: {'lr': 1.0980521108855946e-06, 'samples': 27952128, 'steps': 145583, 'loss/train': 1.2829217910766602} 11/07/2021 17:43:45 - INFO - __main__ - Step 145585: {'lr': 1.0975553376239566e-06, 'samples': 27952320, 'steps': 145584, 'loss/train': 0.7557047009468079} 11/07/2021 17:43:46 - INFO - __main__ - Step 145586: {'lr': 1.0970586765138846e-06, 'samples': 27952512, 'steps': 145585, 'loss/train': 0.7455961108207703} 11/07/2021 17:43:46 - INFO - __main__ - Step 145587: {'lr': 1.096562127555656e-06, 'samples': 27952704, 'steps': 145586, 'loss/train': 1.4360759258270264} 11/07/2021 17:43:46 - INFO - __main__ - Step 145588: {'lr': 1.0960656907494927e-06, 'samples': 27952896, 'steps': 145587, 'loss/train': 1.4733531475067139} 11/07/2021 17:43:47 - INFO - __main__ - Step 145589: {'lr': 1.095569366095589e-06, 'samples': 27953088, 'steps': 145588, 'loss/train': 0.6992353200912476} 11/07/2021 17:43:47 - INFO - __main__ - Step 145590: {'lr': 1.0950731535941672e-06, 'samples': 27953280, 'steps': 145589, 'loss/train': 0.5939700603485107} 11/07/2021 17:43:48 - INFO - __main__ - Step 145591: {'lr': 1.0945770532454769e-06, 'samples': 27953472, 'steps': 145590, 'loss/train': 0.831588864326477} 11/07/2021 17:43:48 - INFO - __main__ - Step 145592: {'lr': 1.0940810650497402e-06, 'samples': 27953664, 'steps': 145591, 'loss/train': 1.3539997339248657} 11/07/2021 17:43:49 - INFO - __main__ - Step 145593: {'lr': 1.0935851890071512e-06, 'samples': 27953856, 'steps': 145592, 'loss/train': 1.0588886737823486} 11/07/2021 17:43:49 - INFO - __main__ - Step 145594: {'lr': 1.0930894251179324e-06, 'samples': 27954048, 'steps': 145593, 'loss/train': 1.2417114973068237} 11/07/2021 17:43:50 - INFO - __main__ - Step 145595: {'lr': 1.092593773382361e-06, 'samples': 27954240, 'steps': 145594, 'loss/train': 1.5207303762435913} 11/07/2021 17:43:50 - INFO - __main__ - Step 145596: {'lr': 1.0920982338006036e-06, 'samples': 27954432, 'steps': 145595, 'loss/train': 1.5652661323547363} 11/07/2021 17:43:51 - INFO - __main__ - Step 145597: {'lr': 1.0916028063729377e-06, 'samples': 27954624, 'steps': 145596, 'loss/train': 1.3593225479125977} 11/07/2021 17:43:51 - INFO - __main__ - Step 145598: {'lr': 1.09110749109953e-06, 'samples': 27954816, 'steps': 145597, 'loss/train': 1.237810492515564} 11/07/2021 17:43:52 - INFO - __main__ - Step 145599: {'lr': 1.0906122879806301e-06, 'samples': 27955008, 'steps': 145598, 'loss/train': 1.104689121246338} 11/07/2021 17:43:52 - INFO - __main__ - Step 145600: {'lr': 1.0901171970164604e-06, 'samples': 27955200, 'steps': 145599, 'loss/train': 1.8409667015075684} 11/07/2021 17:43:52 - INFO - __main__ - Step 145601: {'lr': 1.0896222182072423e-06, 'samples': 27955392, 'steps': 145600, 'loss/train': 1.086053490638733} 11/07/2021 17:43:54 - INFO - __main__ - Step 145602: {'lr': 1.0891273515531986e-06, 'samples': 27955584, 'steps': 145601, 'loss/train': 1.3957936763763428} 11/07/2021 17:43:54 - INFO - __main__ - Step 145603: {'lr': 1.0886325970545786e-06, 'samples': 27955776, 'steps': 145602, 'loss/train': 1.3061085939407349} 11/07/2021 17:43:54 - INFO - __main__ - Step 145604: {'lr': 1.0881379547115488e-06, 'samples': 27955968, 'steps': 145603, 'loss/train': 1.1266206502914429} 11/07/2021 17:43:55 - INFO - __main__ - Step 145605: {'lr': 1.0876434245243593e-06, 'samples': 27956160, 'steps': 145604, 'loss/train': 1.238461971282959} 11/07/2021 17:43:55 - INFO - __main__ - Step 145606: {'lr': 1.08714900649326e-06, 'samples': 27956352, 'steps': 145605, 'loss/train': 1.3566787242889404} 11/07/2021 17:43:55 - INFO - __main__ - Step 145607: {'lr': 1.0866547006184447e-06, 'samples': 27956544, 'steps': 145606, 'loss/train': 2.016876459121704} 11/07/2021 17:43:56 - INFO - __main__ - Step 145608: {'lr': 1.0861605069001357e-06, 'samples': 27956736, 'steps': 145607, 'loss/train': 1.0625745058059692} 11/07/2021 17:43:57 - INFO - __main__ - Step 145609: {'lr': 1.0856664253385552e-06, 'samples': 27956928, 'steps': 145608, 'loss/train': 1.5351145267486572} 11/07/2021 17:43:57 - INFO - __main__ - Step 145610: {'lr': 1.085172455933925e-06, 'samples': 27957120, 'steps': 145609, 'loss/train': 1.0765299797058105} 11/07/2021 17:43:57 - INFO - __main__ - Step 145611: {'lr': 1.0846785986864948e-06, 'samples': 27957312, 'steps': 145610, 'loss/train': 1.6443709135055542} 11/07/2021 17:43:58 - INFO - __main__ - Step 145612: {'lr': 1.084184853596487e-06, 'samples': 27957504, 'steps': 145611, 'loss/train': 1.637351632118225} 11/07/2021 17:43:59 - INFO - __main__ - Step 145613: {'lr': 1.0836912206640681e-06, 'samples': 27957696, 'steps': 145612, 'loss/train': 1.835349202156067} 11/07/2021 17:43:59 - INFO - __main__ - Step 145614: {'lr': 1.0831976998895155e-06, 'samples': 27957888, 'steps': 145613, 'loss/train': 1.16742742061615} 11/07/2021 17:44:00 - INFO - __main__ - Step 145615: {'lr': 1.0827042912730233e-06, 'samples': 27958080, 'steps': 145614, 'loss/train': 1.2772047519683838} 11/07/2021 17:44:00 - INFO - __main__ - Step 145616: {'lr': 1.082210994814814e-06, 'samples': 27958272, 'steps': 145615, 'loss/train': 0.672545850276947} 11/07/2021 17:44:00 - INFO - __main__ - Step 145617: {'lr': 1.081717810515137e-06, 'samples': 27958464, 'steps': 145616, 'loss/train': 1.6016629934310913} 11/07/2021 17:44:01 - INFO - __main__ - Step 145618: {'lr': 1.0812247383741868e-06, 'samples': 27958656, 'steps': 145617, 'loss/train': 0.979871392250061} 11/07/2021 17:44:02 - INFO - __main__ - Step 145619: {'lr': 1.0807317783922133e-06, 'samples': 27958848, 'steps': 145618, 'loss/train': 0.6258878707885742} 11/07/2021 17:44:02 - INFO - __main__ - Step 145620: {'lr': 1.0802389305694105e-06, 'samples': 27959040, 'steps': 145619, 'loss/train': 1.0306614637374878} 11/07/2021 17:44:02 - INFO - __main__ - Step 145621: {'lr': 1.0797461949060005e-06, 'samples': 27959232, 'steps': 145620, 'loss/train': 1.5137171745300293} 11/07/2021 17:44:03 - INFO - __main__ - Step 145622: {'lr': 1.0792535714022333e-06, 'samples': 27959424, 'steps': 145621, 'loss/train': 1.609895944595337} 11/07/2021 17:44:03 - INFO - __main__ - Step 145623: {'lr': 1.0787610600583031e-06, 'samples': 27959616, 'steps': 145622, 'loss/train': 1.338051438331604} 11/07/2021 17:44:04 - INFO - __main__ - Step 145624: {'lr': 1.0782686608744319e-06, 'samples': 27959808, 'steps': 145623, 'loss/train': 1.1929539442062378} 11/07/2021 17:44:05 - INFO - __main__ - Step 145625: {'lr': 1.0777763738508694e-06, 'samples': 27960000, 'steps': 145624, 'loss/train': 1.3787306547164917} 11/07/2021 17:44:05 - INFO - __main__ - Step 145626: {'lr': 1.0772841989878101e-06, 'samples': 27960192, 'steps': 145625, 'loss/train': 1.1026136875152588} 11/07/2021 17:44:05 - INFO - __main__ - Step 145627: {'lr': 1.0767921362855037e-06, 'samples': 27960384, 'steps': 145626, 'loss/train': 1.824899435043335} 11/07/2021 17:44:06 - INFO - __main__ - Step 145628: {'lr': 1.0763001857441446e-06, 'samples': 27960576, 'steps': 145627, 'loss/train': 1.1156774759292603} 11/07/2021 17:44:06 - INFO - __main__ - Step 145629: {'lr': 1.0758083473639546e-06, 'samples': 27960768, 'steps': 145628, 'loss/train': 1.2068381309509277} 11/07/2021 17:44:07 - INFO - __main__ - Step 145630: {'lr': 1.075316621145156e-06, 'samples': 27960960, 'steps': 145629, 'loss/train': 1.3547945022583008} 11/07/2021 17:44:08 - INFO - __main__ - Step 145631: {'lr': 1.0748250070879983e-06, 'samples': 27961152, 'steps': 145630, 'loss/train': 1.1949623823165894} 11/07/2021 17:44:08 - INFO - __main__ - Step 145632: {'lr': 1.0743335051926762e-06, 'samples': 27961344, 'steps': 145631, 'loss/train': 0.7423311471939087} 11/07/2021 17:44:08 - INFO - __main__ - Step 145633: {'lr': 1.0738421154594114e-06, 'samples': 27961536, 'steps': 145632, 'loss/train': 1.176004409790039} 11/07/2021 17:44:09 - INFO - __main__ - Step 145634: {'lr': 1.073350837888426e-06, 'samples': 27961728, 'steps': 145633, 'loss/train': 1.532923936843872} 11/07/2021 17:44:10 - INFO - __main__ - Step 145635: {'lr': 1.07285967247997e-06, 'samples': 27961920, 'steps': 145634, 'loss/train': 1.4954252243041992} 11/07/2021 17:44:10 - INFO - __main__ - Step 145636: {'lr': 1.0723686192342375e-06, 'samples': 27962112, 'steps': 145635, 'loss/train': 1.3533401489257812} 11/07/2021 17:44:11 - INFO - __main__ - Step 145637: {'lr': 1.0718776781514505e-06, 'samples': 27962304, 'steps': 145636, 'loss/train': 1.218415379524231} 11/07/2021 17:44:11 - INFO - __main__ - Step 145638: {'lr': 1.0713868492318313e-06, 'samples': 27962496, 'steps': 145637, 'loss/train': 1.2165690660476685} 11/07/2021 17:44:11 - INFO - __main__ - Step 145639: {'lr': 1.0708961324756017e-06, 'samples': 27962688, 'steps': 145638, 'loss/train': 1.4523539543151855} 11/07/2021 17:44:12 - INFO - __main__ - Step 145640: {'lr': 1.0704055278829838e-06, 'samples': 27962880, 'steps': 145639, 'loss/train': 1.2072798013687134} 11/07/2021 17:44:13 - INFO - __main__ - Step 145641: {'lr': 1.0699150354541997e-06, 'samples': 27963072, 'steps': 145640, 'loss/train': 1.228212594985962} 11/07/2021 17:44:13 - INFO - __main__ - Step 145642: {'lr': 1.0694246551894993e-06, 'samples': 27963264, 'steps': 145641, 'loss/train': 1.5067956447601318} 11/07/2021 17:44:13 - INFO - __main__ - Step 145643: {'lr': 1.068934387089049e-06, 'samples': 27963456, 'steps': 145642, 'loss/train': 1.222229242324829} 11/07/2021 17:44:14 - INFO - __main__ - Step 145644: {'lr': 1.0684442311530985e-06, 'samples': 27963648, 'steps': 145643, 'loss/train': 0.9618166089057922} 11/07/2021 17:44:14 - INFO - __main__ - Step 145645: {'lr': 1.0679541873818422e-06, 'samples': 27963840, 'steps': 145644, 'loss/train': 0.625504195690155} 11/07/2021 17:44:15 - INFO - __main__ - Step 145646: {'lr': 1.0674642557755575e-06, 'samples': 27964032, 'steps': 145645, 'loss/train': 1.2295104265213013} 11/07/2021 17:44:16 - INFO - __main__ - Step 145647: {'lr': 1.0669744363344113e-06, 'samples': 27964224, 'steps': 145646, 'loss/train': 0.7814059853553772} 11/07/2021 17:44:16 - INFO - __main__ - Step 145648: {'lr': 1.0664847290586533e-06, 'samples': 27964416, 'steps': 145647, 'loss/train': 1.3547660112380981} 11/07/2021 17:44:16 - INFO - __main__ - Step 145649: {'lr': 1.0659951339485052e-06, 'samples': 27964608, 'steps': 145648, 'loss/train': 1.4415327310562134} 11/07/2021 17:44:17 - INFO - __main__ - Step 145650: {'lr': 1.0655056510041895e-06, 'samples': 27964800, 'steps': 145649, 'loss/train': 1.0175033807754517} 11/07/2021 17:44:18 - INFO - __main__ - Step 145651: {'lr': 1.0650162802258723e-06, 'samples': 27964992, 'steps': 145650, 'loss/train': 1.0875202417373657} 11/07/2021 17:44:18 - INFO - __main__ - Step 145652: {'lr': 1.0645270216138592e-06, 'samples': 27965184, 'steps': 145651, 'loss/train': 0.45223891735076904} 11/07/2021 17:44:18 - INFO - __main__ - Step 145653: {'lr': 1.0640378751683166e-06, 'samples': 27965376, 'steps': 145652, 'loss/train': 1.4636552333831787} 11/07/2021 17:44:19 - INFO - __main__ - Step 145654: {'lr': 1.0635488408894667e-06, 'samples': 27965568, 'steps': 145653, 'loss/train': 1.5213189125061035} 11/07/2021 17:44:19 - INFO - __main__ - Step 145655: {'lr': 1.0630599187775592e-06, 'samples': 27965760, 'steps': 145654, 'loss/train': 1.2429084777832031} 11/07/2021 17:44:19 - INFO - __main__ - Step 145656: {'lr': 1.0625711088327882e-06, 'samples': 27965952, 'steps': 145655, 'loss/train': 2.4299094676971436} 11/07/2021 17:44:20 - INFO - __main__ - Step 145657: {'lr': 1.0620824110553763e-06, 'samples': 27966144, 'steps': 145656, 'loss/train': 1.129231333732605} 11/07/2021 17:44:21 - INFO - __main__ - Step 145658: {'lr': 1.061593825445545e-06, 'samples': 27966336, 'steps': 145657, 'loss/train': 1.3209465742111206} 11/07/2021 17:44:21 - INFO - __main__ - Step 145659: {'lr': 1.0611053520035163e-06, 'samples': 27966528, 'steps': 145658, 'loss/train': 1.4516563415527344} 11/07/2021 17:44:22 - INFO - __main__ - Step 145660: {'lr': 1.0606169907295127e-06, 'samples': 27966720, 'steps': 145659, 'loss/train': 0.9269917607307434} 11/07/2021 17:44:22 - INFO - __main__ - Step 145661: {'lr': 1.060128741623756e-06, 'samples': 27966912, 'steps': 145660, 'loss/train': 1.601680040359497} 11/07/2021 17:44:23 - INFO - __main__ - Step 145662: {'lr': 1.0596406046864682e-06, 'samples': 27967104, 'steps': 145661, 'loss/train': 1.0012867450714111} 11/07/2021 17:44:23 - INFO - __main__ - Step 145663: {'lr': 1.0591525799178714e-06, 'samples': 27967296, 'steps': 145662, 'loss/train': 1.0432889461517334} 11/07/2021 17:44:24 - INFO - __main__ - Step 145664: {'lr': 1.05866466731816e-06, 'samples': 27967488, 'steps': 145663, 'loss/train': 1.4667242765426636} 11/07/2021 17:44:24 - INFO - __main__ - Step 145665: {'lr': 1.0581768668875836e-06, 'samples': 27967680, 'steps': 145664, 'loss/train': 1.2459033727645874} 11/07/2021 17:44:24 - INFO - __main__ - Step 145666: {'lr': 1.0576891786263643e-06, 'samples': 27967872, 'steps': 145665, 'loss/train': 1.4261444807052612} 11/07/2021 17:44:25 - INFO - __main__ - Step 145667: {'lr': 1.0572016025346964e-06, 'samples': 27968064, 'steps': 145666, 'loss/train': 1.3120558261871338} 11/07/2021 17:44:26 - INFO - __main__ - Step 145668: {'lr': 1.0567141386128298e-06, 'samples': 27968256, 'steps': 145667, 'loss/train': 0.5719354748725891} 11/07/2021 17:44:26 - INFO - __main__ - Step 145669: {'lr': 1.056226786860931e-06, 'samples': 27968448, 'steps': 145668, 'loss/train': 1.3021278381347656} 11/07/2021 17:44:26 - INFO - __main__ - Step 145670: {'lr': 1.0557395472792775e-06, 'samples': 27968640, 'steps': 145669, 'loss/train': 1.2584242820739746} 11/07/2021 17:44:27 - INFO - __main__ - Step 145671: {'lr': 1.0552524198680635e-06, 'samples': 27968832, 'steps': 145670, 'loss/train': 1.3615853786468506} 11/07/2021 17:44:28 - INFO - __main__ - Step 145672: {'lr': 1.0547654046275114e-06, 'samples': 27969024, 'steps': 145671, 'loss/train': 1.882974624633789} 11/07/2021 17:44:28 - INFO - __main__ - Step 145673: {'lr': 1.0542785015578426e-06, 'samples': 27969216, 'steps': 145672, 'loss/train': 1.5951393842697144} 11/07/2021 17:44:29 - INFO - __main__ - Step 145674: {'lr': 1.053791710659252e-06, 'samples': 27969408, 'steps': 145673, 'loss/train': 0.7174002528190613} 11/07/2021 17:44:29 - INFO - __main__ - Step 145675: {'lr': 1.053305031932017e-06, 'samples': 27969600, 'steps': 145674, 'loss/train': 1.511507511138916} 11/07/2021 17:44:29 - INFO - __main__ - Step 145676: {'lr': 1.052818465376304e-06, 'samples': 27969792, 'steps': 145675, 'loss/train': 1.144109845161438} 11/07/2021 17:44:30 - INFO - __main__ - Step 145677: {'lr': 1.052332010992335e-06, 'samples': 27969984, 'steps': 145676, 'loss/train': 1.6131120920181274} 11/07/2021 17:44:31 - INFO - __main__ - Step 145678: {'lr': 1.0518456687803601e-06, 'samples': 27970176, 'steps': 145677, 'loss/train': 0.6337679624557495} 11/07/2021 17:44:31 - INFO - __main__ - Step 145679: {'lr': 1.0513594387406012e-06, 'samples': 27970368, 'steps': 145678, 'loss/train': 0.911051332950592} 11/07/2021 17:44:31 - INFO - __main__ - Step 145680: {'lr': 1.0508733208732246e-06, 'samples': 27970560, 'steps': 145679, 'loss/train': 1.6329013109207153} 11/07/2021 17:44:32 - INFO - __main__ - Step 145681: {'lr': 1.0503873151785081e-06, 'samples': 27970752, 'steps': 145680, 'loss/train': 0.5317294597625732} 11/07/2021 17:44:32 - INFO - __main__ - Step 145682: {'lr': 1.0499014216566183e-06, 'samples': 27970944, 'steps': 145681, 'loss/train': 0.9178489446640015} 11/07/2021 17:44:33 - INFO - __main__ - Step 145683: {'lr': 1.0494156403078048e-06, 'samples': 27971136, 'steps': 145682, 'loss/train': 1.6222569942474365} 11/07/2021 17:44:33 - INFO - __main__ - Step 145684: {'lr': 1.0489299711323176e-06, 'samples': 27971328, 'steps': 145683, 'loss/train': 1.8418606519699097} 11/07/2021 17:44:34 - INFO - __main__ - Step 145685: {'lr': 1.0484444141302952e-06, 'samples': 27971520, 'steps': 145684, 'loss/train': 1.3266459703445435} 11/07/2021 17:44:34 - INFO - __main__ - Step 145686: {'lr': 1.047958969302043e-06, 'samples': 27971712, 'steps': 145685, 'loss/train': 1.9132157564163208} 11/07/2021 17:44:34 - INFO - __main__ - Step 145687: {'lr': 1.0474736366477e-06, 'samples': 27971904, 'steps': 145686, 'loss/train': 1.1955326795578003} 11/07/2021 17:44:35 - INFO - __main__ - Step 145688: {'lr': 1.0469884161675435e-06, 'samples': 27972096, 'steps': 145687, 'loss/train': 1.2809104919433594} 11/07/2021 17:44:36 - INFO - __main__ - Step 145689: {'lr': 1.0465033078617958e-06, 'samples': 27972288, 'steps': 145688, 'loss/train': 1.2789684534072876} 11/07/2021 17:44:36 - INFO - __main__ - Step 145690: {'lr': 1.0460183117306232e-06, 'samples': 27972480, 'steps': 145689, 'loss/train': 1.0175952911376953} 11/07/2021 17:44:37 - INFO - __main__ - Step 145691: {'lr': 1.0455334277742756e-06, 'samples': 27972672, 'steps': 145690, 'loss/train': 1.4875950813293457} 11/07/2021 17:44:37 - INFO - __main__ - Step 145692: {'lr': 1.045048655992975e-06, 'samples': 27972864, 'steps': 145691, 'loss/train': 1.2514004707336426} 11/07/2021 17:44:38 - INFO - __main__ - Step 145693: {'lr': 1.0445639963869435e-06, 'samples': 27973056, 'steps': 145692, 'loss/train': 1.125286340713501} 11/07/2021 17:44:38 - INFO - __main__ - Step 145694: {'lr': 1.0440794489563754e-06, 'samples': 27973248, 'steps': 145693, 'loss/train': 1.140405535697937} 11/07/2021 17:44:39 - INFO - __main__ - Step 145695: {'lr': 1.0435950137014926e-06, 'samples': 27973440, 'steps': 145694, 'loss/train': 0.9590394496917725} 11/07/2021 17:44:39 - INFO - __main__ - Step 145696: {'lr': 1.0431106906225451e-06, 'samples': 27973632, 'steps': 145695, 'loss/train': 1.200311541557312} 11/07/2021 17:44:39 - INFO - __main__ - Step 145697: {'lr': 1.042626479719727e-06, 'samples': 27973824, 'steps': 145696, 'loss/train': 1.647140383720398} 11/07/2021 17:44:40 - INFO - __main__ - Step 145698: {'lr': 1.0421423809932606e-06, 'samples': 27974016, 'steps': 145697, 'loss/train': 1.482435703277588} 11/07/2021 17:44:41 - INFO - __main__ - Step 145699: {'lr': 1.04165839444334e-06, 'samples': 27974208, 'steps': 145698, 'loss/train': 1.3826751708984375} 11/07/2021 17:44:41 - INFO - __main__ - Step 145700: {'lr': 1.0411745200702427e-06, 'samples': 27974400, 'steps': 145699, 'loss/train': 1.414873719215393} 11/07/2021 17:44:41 - INFO - __main__ - Step 145701: {'lr': 1.0406907578741353e-06, 'samples': 27974592, 'steps': 145700, 'loss/train': 1.5218409299850464} 11/07/2021 17:44:42 - INFO - __main__ - Step 145702: {'lr': 1.04020710785524e-06, 'samples': 27974784, 'steps': 145701, 'loss/train': 1.6573201417922974} 11/07/2021 17:44:43 - INFO - __main__ - Step 145703: {'lr': 1.0397235700138063e-06, 'samples': 27974976, 'steps': 145702, 'loss/train': 1.2622920274734497} 11/07/2021 17:44:43 - INFO - __main__ - Step 145704: {'lr': 1.039240144350001e-06, 'samples': 27975168, 'steps': 145703, 'loss/train': 1.4982894659042358} 11/07/2021 17:44:44 - INFO - __main__ - Step 145705: {'lr': 1.0387568308641015e-06, 'samples': 27975360, 'steps': 145704, 'loss/train': 0.519539475440979} 11/07/2021 17:44:44 - INFO - __main__ - Step 145706: {'lr': 1.0382736295563023e-06, 'samples': 27975552, 'steps': 145705, 'loss/train': 0.637066125869751} 11/07/2021 17:44:45 - INFO - __main__ - Step 145707: {'lr': 1.0377905404267973e-06, 'samples': 27975744, 'steps': 145706, 'loss/train': 1.3836839199066162} 11/07/2021 17:44:45 - INFO - __main__ - Step 145708: {'lr': 1.037307563475809e-06, 'samples': 27975936, 'steps': 145707, 'loss/train': 1.6791424751281738} 11/07/2021 17:44:46 - INFO - __main__ - Step 145709: {'lr': 1.0368246987035868e-06, 'samples': 27976128, 'steps': 145708, 'loss/train': 2.0122463703155518} 11/07/2021 17:44:46 - INFO - __main__ - Step 145710: {'lr': 1.0363419461103252e-06, 'samples': 27976320, 'steps': 145709, 'loss/train': 1.1001944541931152} 11/07/2021 17:44:47 - INFO - __main__ - Step 145711: {'lr': 1.0358593056962462e-06, 'samples': 27976512, 'steps': 145710, 'loss/train': 1.3404724597930908} 11/07/2021 17:44:47 - INFO - __main__ - Step 145712: {'lr': 1.035376777461572e-06, 'samples': 27976704, 'steps': 145711, 'loss/train': 1.8545491695404053} 11/07/2021 17:44:47 - INFO - __main__ - Step 145713: {'lr': 1.0348943614064964e-06, 'samples': 27976896, 'steps': 145712, 'loss/train': 1.2420072555541992} 11/07/2021 17:44:48 - INFO - __main__ - Step 145714: {'lr': 1.0344120575312699e-06, 'samples': 27977088, 'steps': 145713, 'loss/train': 1.407476544380188} 11/07/2021 17:44:49 - INFO - __main__ - Step 145715: {'lr': 1.033929865836114e-06, 'samples': 27977280, 'steps': 145714, 'loss/train': 1.3468072414398193} 11/07/2021 17:44:49 - INFO - __main__ - Step 145716: {'lr': 1.0334477863211956e-06, 'samples': 27977472, 'steps': 145715, 'loss/train': 1.1149770021438599} 11/07/2021 17:44:49 - INFO - __main__ - Step 145717: {'lr': 1.0329658189867641e-06, 'samples': 27977664, 'steps': 145716, 'loss/train': 1.3503273725509644} 11/07/2021 17:44:50 - INFO - __main__ - Step 145718: {'lr': 1.0324839638330696e-06, 'samples': 27977856, 'steps': 145717, 'loss/train': 1.334141731262207} 11/07/2021 17:44:50 - INFO - __main__ - Step 145719: {'lr': 1.0320022208602787e-06, 'samples': 27978048, 'steps': 145718, 'loss/train': 1.1402225494384766} 11/07/2021 17:44:51 - INFO - __main__ - Step 145720: {'lr': 1.0315205900686132e-06, 'samples': 27978240, 'steps': 145719, 'loss/train': 0.047840289771556854} 11/07/2021 17:44:52 - INFO - __main__ - Step 145721: {'lr': 1.0310390714583229e-06, 'samples': 27978432, 'steps': 145720, 'loss/train': 0.34266233444213867} 11/07/2021 17:44:52 - INFO - __main__ - Step 145722: {'lr': 1.0305576650296023e-06, 'samples': 27978624, 'steps': 145721, 'loss/train': 0.9244818687438965} 11/07/2021 17:44:52 - INFO - __main__ - Step 145723: {'lr': 1.0300763707826455e-06, 'samples': 27978816, 'steps': 145722, 'loss/train': 1.7274466753005981} 11/07/2021 17:44:53 - INFO - __main__ - Step 145724: {'lr': 1.0295951887177302e-06, 'samples': 27979008, 'steps': 145723, 'loss/train': 1.739437460899353} 11/07/2021 17:44:54 - INFO - __main__ - Step 145725: {'lr': 1.0291141188349951e-06, 'samples': 27979200, 'steps': 145724, 'loss/train': 1.2001070976257324} 11/07/2021 17:44:54 - INFO - __main__ - Step 145726: {'lr': 1.0286331611347455e-06, 'samples': 27979392, 'steps': 145725, 'loss/train': 1.6891608238220215} 11/07/2021 17:44:54 - INFO - __main__ - Step 145727: {'lr': 1.0281523156171201e-06, 'samples': 27979584, 'steps': 145726, 'loss/train': 1.5054519176483154} 11/07/2021 17:44:55 - INFO - __main__ - Step 145728: {'lr': 1.027671582282369e-06, 'samples': 27979776, 'steps': 145727, 'loss/train': 1.4001048803329468} 11/07/2021 17:44:55 - INFO - __main__ - Step 145729: {'lr': 1.027190961130714e-06, 'samples': 27979968, 'steps': 145728, 'loss/train': 1.290323257446289} 11/07/2021 17:44:55 - INFO - __main__ - Step 145730: {'lr': 1.0267104521623771e-06, 'samples': 27980160, 'steps': 145729, 'loss/train': 1.127691626548767} 11/07/2021 17:44:56 - INFO - __main__ - Step 145731: {'lr': 1.0262300553775527e-06, 'samples': 27980352, 'steps': 145730, 'loss/train': 1.3927325010299683} 11/07/2021 17:44:57 - INFO - __main__ - Step 145732: {'lr': 1.025749770776463e-06, 'samples': 27980544, 'steps': 145731, 'loss/train': 1.331573247909546} 11/07/2021 17:44:57 - INFO - __main__ - Step 145733: {'lr': 1.0252695983593296e-06, 'samples': 27980736, 'steps': 145732, 'loss/train': 1.2125014066696167} 11/07/2021 17:44:58 - INFO - __main__ - Step 145734: {'lr': 1.024789538126375e-06, 'samples': 27980928, 'steps': 145733, 'loss/train': 1.2974214553833008} 11/07/2021 17:44:58 - INFO - __main__ - Step 145735: {'lr': 1.0243095900777931e-06, 'samples': 27981120, 'steps': 145734, 'loss/train': 1.5882257223129272} 11/07/2021 17:44:59 - INFO - __main__ - Step 145736: {'lr': 1.023829754213834e-06, 'samples': 27981312, 'steps': 145735, 'loss/train': 0.9296568632125854} 11/07/2021 17:44:59 - INFO - __main__ - Step 145737: {'lr': 1.0233500305346921e-06, 'samples': 27981504, 'steps': 145736, 'loss/train': 1.0846943855285645} 11/07/2021 17:45:00 - INFO - __main__ - Step 145738: {'lr': 1.0228704190405891e-06, 'samples': 27981696, 'steps': 145737, 'loss/train': 0.8656812310218811} 11/07/2021 17:45:00 - INFO - __main__ - Step 145739: {'lr': 1.0223909197317193e-06, 'samples': 27981888, 'steps': 145738, 'loss/train': 1.3889044523239136} 11/07/2021 17:45:00 - INFO - __main__ - Step 145740: {'lr': 1.0219115326083327e-06, 'samples': 27982080, 'steps': 145739, 'loss/train': 1.2404841184616089} 11/07/2021 17:45:01 - INFO - __main__ - Step 145741: {'lr': 1.0214322576706236e-06, 'samples': 27982272, 'steps': 145740, 'loss/train': 1.5534420013427734} 11/07/2021 17:45:02 - INFO - __main__ - Step 145742: {'lr': 1.0209530949188138e-06, 'samples': 27982464, 'steps': 145741, 'loss/train': 1.514046549797058} 11/07/2021 17:45:02 - INFO - __main__ - Step 145743: {'lr': 1.0204740443531258e-06, 'samples': 27982656, 'steps': 145742, 'loss/train': 1.4647191762924194} 11/07/2021 17:45:02 - INFO - __main__ - Step 145744: {'lr': 1.0199951059737812e-06, 'samples': 27982848, 'steps': 145743, 'loss/train': 1.1697046756744385} 11/07/2021 17:45:03 - INFO - __main__ - Step 145745: {'lr': 1.0195162797809743e-06, 'samples': 27983040, 'steps': 145744, 'loss/train': 1.5589147806167603} 11/07/2021 17:45:04 - INFO - __main__ - Step 145746: {'lr': 1.0190375657749273e-06, 'samples': 27983232, 'steps': 145745, 'loss/train': 1.2084797620773315} 11/07/2021 17:45:04 - INFO - __main__ - Step 145747: {'lr': 1.0185589639558623e-06, 'samples': 27983424, 'steps': 145746, 'loss/train': 1.4326448440551758} 11/07/2021 17:45:05 - INFO - __main__ - Step 145748: {'lr': 1.0180804743240013e-06, 'samples': 27983616, 'steps': 145747, 'loss/train': 1.5010207891464233} 11/07/2021 17:45:05 - INFO - __main__ - Step 145749: {'lr': 1.0176020968795385e-06, 'samples': 27983808, 'steps': 145748, 'loss/train': 1.5034703016281128} 11/07/2021 17:45:05 - INFO - __main__ - Step 145750: {'lr': 1.0171238316227237e-06, 'samples': 27984000, 'steps': 145749, 'loss/train': 1.371708869934082} 11/07/2021 17:45:06 - INFO - __main__ - Step 145751: {'lr': 1.0166456785537237e-06, 'samples': 27984192, 'steps': 145750, 'loss/train': 1.4731967449188232} 11/07/2021 17:45:07 - INFO - __main__ - Step 145752: {'lr': 1.016167637672788e-06, 'samples': 27984384, 'steps': 145751, 'loss/train': 0.9773077964782715} 11/07/2021 17:45:07 - INFO - __main__ - Step 145753: {'lr': 1.0156897089801387e-06, 'samples': 27984576, 'steps': 145752, 'loss/train': 1.3415987491607666} 11/07/2021 17:45:07 - INFO - __main__ - Step 145754: {'lr': 1.01521189247597e-06, 'samples': 27984768, 'steps': 145753, 'loss/train': 1.2928897142410278} 11/07/2021 17:45:08 - INFO - __main__ - Step 145755: {'lr': 1.0147341881605044e-06, 'samples': 27984960, 'steps': 145754, 'loss/train': 1.4420785903930664} 11/07/2021 17:45:08 - INFO - __main__ - Step 145756: {'lr': 1.0142565960339356e-06, 'samples': 27985152, 'steps': 145755, 'loss/train': 1.637543797492981} 11/07/2021 17:45:09 - INFO - __main__ - Step 145757: {'lr': 1.0137791160965414e-06, 'samples': 27985344, 'steps': 145756, 'loss/train': 1.0449274778366089} 11/07/2021 17:45:09 - INFO - __main__ - Step 145758: {'lr': 1.0133017483484608e-06, 'samples': 27985536, 'steps': 145757, 'loss/train': 1.181870698928833} 11/07/2021 17:45:10 - INFO - __main__ - Step 145759: {'lr': 1.012824492789971e-06, 'samples': 27985728, 'steps': 145758, 'loss/train': 1.6446475982666016} 11/07/2021 17:45:10 - INFO - __main__ - Step 145760: {'lr': 1.0123473494212388e-06, 'samples': 27985920, 'steps': 145759, 'loss/train': 1.18601393699646} 11/07/2021 17:45:10 - INFO - __main__ - Step 145761: {'lr': 1.0118703182425137e-06, 'samples': 27986112, 'steps': 145760, 'loss/train': 1.3520694971084595} 11/07/2021 17:45:11 - INFO - __main__ - Step 145762: {'lr': 1.0113933992539903e-06, 'samples': 27986304, 'steps': 145761, 'loss/train': 1.2730399370193481} 11/07/2021 17:45:12 - INFO - __main__ - Step 145763: {'lr': 1.0109165924559182e-06, 'samples': 27986496, 'steps': 145762, 'loss/train': 1.4366275072097778} 11/07/2021 17:45:12 - INFO - __main__ - Step 145764: {'lr': 1.010439897848464e-06, 'samples': 27986688, 'steps': 145763, 'loss/train': 1.0775067806243896} 11/07/2021 17:45:13 - INFO - __main__ - Step 145765: {'lr': 1.0099633154318499e-06, 'samples': 27986880, 'steps': 145764, 'loss/train': 1.1972029209136963} 11/07/2021 17:45:13 - INFO - __main__ - Step 145766: {'lr': 1.0094868452063254e-06, 'samples': 27987072, 'steps': 145765, 'loss/train': 0.7823657989501953} 11/07/2021 17:45:14 - INFO - __main__ - Step 145767: {'lr': 1.0090104871720574e-06, 'samples': 27987264, 'steps': 145766, 'loss/train': 1.4176777601242065} 11/07/2021 17:45:14 - INFO - __main__ - Step 145768: {'lr': 1.008534241329323e-06, 'samples': 27987456, 'steps': 145767, 'loss/train': 1.3030574321746826} 11/07/2021 17:45:15 - INFO - __main__ - Step 145769: {'lr': 1.0080581076782613e-06, 'samples': 27987648, 'steps': 145768, 'loss/train': 0.69999760389328} 11/07/2021 17:45:15 - INFO - __main__ - Step 145770: {'lr': 1.0075820862191498e-06, 'samples': 27987840, 'steps': 145769, 'loss/train': 1.4635660648345947} 11/07/2021 17:45:15 - INFO - __main__ - Step 145771: {'lr': 1.0071061769521828e-06, 'samples': 27988032, 'steps': 145770, 'loss/train': 1.3037687540054321} 11/07/2021 17:45:16 - INFO - __main__ - Step 145772: {'lr': 1.0066303798775545e-06, 'samples': 27988224, 'steps': 145771, 'loss/train': 0.8290954232215881} 11/07/2021 17:45:17 - INFO - __main__ - Step 145773: {'lr': 1.0061546949955146e-06, 'samples': 27988416, 'steps': 145772, 'loss/train': 0.8950677514076233} 11/07/2021 17:45:17 - INFO - __main__ - Step 145774: {'lr': 1.0056791223062577e-06, 'samples': 27988608, 'steps': 145773, 'loss/train': 1.5861632823944092} 11/07/2021 17:45:17 - INFO - __main__ - Step 145775: {'lr': 1.0052036618100057e-06, 'samples': 27988800, 'steps': 145774, 'loss/train': 1.8199472427368164} 11/07/2021 17:45:18 - INFO - __main__ - Step 145776: {'lr': 1.004728313506953e-06, 'samples': 27988992, 'steps': 145775, 'loss/train': 0.8569278120994568} 11/07/2021 17:45:18 - INFO - __main__ - Step 145777: {'lr': 1.0042530773973213e-06, 'samples': 27989184, 'steps': 145776, 'loss/train': 1.320884346961975} 11/07/2021 17:45:19 - INFO - __main__ - Step 145778: {'lr': 1.003777953481333e-06, 'samples': 27989376, 'steps': 145777, 'loss/train': 1.7421239614486694} 11/07/2021 17:45:20 - INFO - __main__ - Step 145779: {'lr': 1.0033029417592098e-06, 'samples': 27989568, 'steps': 145778, 'loss/train': 1.293189525604248} 11/07/2021 17:45:20 - INFO - __main__ - Step 145780: {'lr': 1.0028280422311464e-06, 'samples': 27989760, 'steps': 145779, 'loss/train': 1.5487878322601318} 11/07/2021 17:45:20 - INFO - __main__ - Step 145781: {'lr': 1.0023532548973924e-06, 'samples': 27989952, 'steps': 145780, 'loss/train': 0.9937403798103333} 11/07/2021 17:45:21 - INFO - __main__ - Step 145782: {'lr': 1.0018785797581142e-06, 'samples': 27990144, 'steps': 145781, 'loss/train': 0.7347866892814636} 11/07/2021 17:45:22 - INFO - __main__ - Step 145783: {'lr': 1.001404016813534e-06, 'samples': 27990336, 'steps': 145782, 'loss/train': 1.844003438949585} 11/07/2021 17:45:22 - INFO - __main__ - Step 145784: {'lr': 1.0009295660639018e-06, 'samples': 27990528, 'steps': 145783, 'loss/train': 0.9775509238243103} 11/07/2021 17:45:23 - INFO - __main__ - Step 145785: {'lr': 1.0004552275094114e-06, 'samples': 27990720, 'steps': 145784, 'loss/train': 0.9313710927963257} 11/07/2021 17:45:23 - INFO - __main__ - Step 145786: {'lr': 9.999810011502575e-07, 'samples': 27990912, 'steps': 145785, 'loss/train': 1.3141206502914429} 11/07/2021 17:45:23 - INFO - __main__ - Step 145787: {'lr': 9.995068869866897e-07, 'samples': 27991104, 'steps': 145786, 'loss/train': 2.088149309158325} 11/07/2021 17:45:24 - INFO - __main__ - Step 145788: {'lr': 9.990328850188745e-07, 'samples': 27991296, 'steps': 145787, 'loss/train': 0.8796727061271667} 11/07/2021 17:45:25 - INFO - __main__ - Step 145789: {'lr': 9.98558995247062e-07, 'samples': 27991488, 'steps': 145788, 'loss/train': 1.7075698375701904} 11/07/2021 17:45:25 - INFO - __main__ - Step 145790: {'lr': 9.980852176714738e-07, 'samples': 27991680, 'steps': 145789, 'loss/train': 1.3584988117218018} 11/07/2021 17:45:25 - INFO - __main__ - Step 145791: {'lr': 9.97611552292277e-07, 'samples': 27991872, 'steps': 145790, 'loss/train': 0.8102580904960632} 11/07/2021 17:45:26 - INFO - __main__ - Step 145792: {'lr': 9.971379991097484e-07, 'samples': 27992064, 'steps': 145791, 'loss/train': 1.2228782176971436} 11/07/2021 17:45:26 - INFO - __main__ - Step 145793: {'lr': 9.966645581240275e-07, 'samples': 27992256, 'steps': 145792, 'loss/train': 1.3593907356262207} 11/07/2021 17:45:27 - INFO - __main__ - Step 145794: {'lr': 9.961912293353913e-07, 'samples': 27992448, 'steps': 145793, 'loss/train': 1.1709434986114502} 11/07/2021 17:45:27 - INFO - __main__ - Step 145795: {'lr': 9.957180127440345e-07, 'samples': 27992640, 'steps': 145794, 'loss/train': 0.8650078773498535} 11/07/2021 17:45:28 - INFO - __main__ - Step 145796: {'lr': 9.952449083501514e-07, 'samples': 27992832, 'steps': 145795, 'loss/train': 0.863847553730011} 11/07/2021 17:45:28 - INFO - __main__ - Step 145797: {'lr': 9.947719161539637e-07, 'samples': 27993024, 'steps': 145796, 'loss/train': 0.8052687048912048} 11/07/2021 17:45:28 - INFO - __main__ - Step 145798: {'lr': 9.942990361556936e-07, 'samples': 27993216, 'steps': 145797, 'loss/train': 0.776978611946106} 11/07/2021 17:45:30 - INFO - __main__ - Step 145799: {'lr': 9.938262683555632e-07, 'samples': 27993408, 'steps': 145798, 'loss/train': 0.9824508428573608} 11/07/2021 17:45:30 - INFO - __main__ - Step 145800: {'lr': 9.933536127537667e-07, 'samples': 27993600, 'steps': 145799, 'loss/train': 1.1129740476608276} 11/07/2021 17:45:30 - INFO - __main__ - Step 145801: {'lr': 9.928810693505263e-07, 'samples': 27993792, 'steps': 145800, 'loss/train': 1.434852123260498} 11/07/2021 17:45:31 - INFO - __main__ - Step 145802: {'lr': 9.924086381460361e-07, 'samples': 27993984, 'steps': 145801, 'loss/train': 1.450958013534546} 11/07/2021 17:45:31 - INFO - __main__ - Step 145803: {'lr': 9.919363191405462e-07, 'samples': 27994176, 'steps': 145802, 'loss/train': 0.13306105136871338} 11/07/2021 17:45:32 - INFO - __main__ - Step 145804: {'lr': 9.914641123342227e-07, 'samples': 27994368, 'steps': 145803, 'loss/train': 1.520709753036499} 11/07/2021 17:45:32 - INFO - __main__ - Step 145805: {'lr': 9.909920177273157e-07, 'samples': 27994560, 'steps': 145804, 'loss/train': 1.1337590217590332} 11/07/2021 17:45:33 - INFO - __main__ - Step 145806: {'lr': 9.905200353200194e-07, 'samples': 27994752, 'steps': 145805, 'loss/train': 1.635334849357605} 11/07/2021 17:45:33 - INFO - __main__ - Step 145807: {'lr': 9.900481651125558e-07, 'samples': 27994944, 'steps': 145806, 'loss/train': 1.1588773727416992} 11/07/2021 17:45:33 - INFO - __main__ - Step 145808: {'lr': 9.895764071051472e-07, 'samples': 27995136, 'steps': 145807, 'loss/train': 1.088740587234497} 11/07/2021 17:45:34 - INFO - __main__ - Step 145809: {'lr': 9.891047612979875e-07, 'samples': 27995328, 'steps': 145808, 'loss/train': 1.0193943977355957} 11/07/2021 17:45:35 - INFO - __main__ - Step 145810: {'lr': 9.886332276912713e-07, 'samples': 27995520, 'steps': 145809, 'loss/train': 1.8607650995254517} 11/07/2021 17:45:35 - INFO - __main__ - Step 145811: {'lr': 9.881618062852482e-07, 'samples': 27995712, 'steps': 145810, 'loss/train': 1.477144479751587} 11/07/2021 17:45:35 - INFO - __main__ - Step 145812: {'lr': 9.876904970801404e-07, 'samples': 27995904, 'steps': 145811, 'loss/train': 1.2363823652267456} 11/07/2021 17:45:36 - INFO - __main__ - Step 145813: {'lr': 9.872193000761144e-07, 'samples': 27996096, 'steps': 145812, 'loss/train': 1.1819515228271484} 11/07/2021 17:45:36 - INFO - __main__ - Step 145814: {'lr': 9.867482152734198e-07, 'samples': 27996288, 'steps': 145813, 'loss/train': 1.1628457307815552} 11/07/2021 17:45:37 - INFO - __main__ - Step 145815: {'lr': 9.862772426722234e-07, 'samples': 27996480, 'steps': 145814, 'loss/train': 1.047685146331787} 11/07/2021 17:45:38 - INFO - __main__ - Step 145816: {'lr': 9.858063822728024e-07, 'samples': 27996672, 'steps': 145815, 'loss/train': 1.2799115180969238} 11/07/2021 17:45:38 - INFO - __main__ - Step 145817: {'lr': 9.853356340752961e-07, 'samples': 27996864, 'steps': 145816, 'loss/train': 1.3545663356781006} 11/07/2021 17:45:38 - INFO - __main__ - Step 145818: {'lr': 9.848649980799817e-07, 'samples': 27997056, 'steps': 145817, 'loss/train': 1.5264832973480225} 11/07/2021 17:45:39 - INFO - __main__ - Step 145819: {'lr': 9.843944742870537e-07, 'samples': 27997248, 'steps': 145818, 'loss/train': 1.524864912033081} 11/07/2021 17:45:40 - INFO - __main__ - Step 145820: {'lr': 9.83924062696706e-07, 'samples': 27997440, 'steps': 145819, 'loss/train': 1.6026685237884521} 11/07/2021 17:45:40 - INFO - __main__ - Step 145821: {'lr': 9.834537633091334e-07, 'samples': 27997632, 'steps': 145820, 'loss/train': 1.2276878356933594} 11/07/2021 17:45:40 - INFO - __main__ - Step 145822: {'lr': 9.82983576124613e-07, 'samples': 27997824, 'steps': 145821, 'loss/train': 1.1409517526626587} 11/07/2021 17:45:41 - INFO - __main__ - Step 145823: {'lr': 9.82513501143284e-07, 'samples': 27998016, 'steps': 145822, 'loss/train': 1.3766002655029297} 11/07/2021 17:45:41 - INFO - __main__ - Step 145824: {'lr': 9.820435383654236e-07, 'samples': 27998208, 'steps': 145823, 'loss/train': 0.9794116616249084} 11/07/2021 17:45:42 - INFO - __main__ - Step 145825: {'lr': 9.815736877911984e-07, 'samples': 27998400, 'steps': 145824, 'loss/train': 0.21719098091125488} 11/07/2021 17:45:43 - INFO - __main__ - Step 145826: {'lr': 9.811039494208308e-07, 'samples': 27998592, 'steps': 145825, 'loss/train': 1.4979978799819946} 11/07/2021 17:45:43 - INFO - __main__ - Step 145827: {'lr': 9.806343232545424e-07, 'samples': 27998784, 'steps': 145826, 'loss/train': 1.4465315341949463} 11/07/2021 17:45:43 - INFO - __main__ - Step 145828: {'lr': 9.801648092925276e-07, 'samples': 27998976, 'steps': 145827, 'loss/train': 1.4377808570861816} 11/07/2021 17:45:44 - INFO - __main__ - Step 145829: {'lr': 9.796954075350083e-07, 'samples': 27999168, 'steps': 145828, 'loss/train': 1.4699898958206177} 11/07/2021 17:45:44 - INFO - __main__ - Step 145830: {'lr': 9.792261179821792e-07, 'samples': 27999360, 'steps': 145829, 'loss/train': 1.5880742073059082} 11/07/2021 17:45:45 - INFO - __main__ - Step 145831: {'lr': 9.7875694063429e-07, 'samples': 27999552, 'steps': 145830, 'loss/train': 1.4756295680999756} 11/07/2021 17:45:45 - INFO - __main__ - Step 145832: {'lr': 9.782878754915347e-07, 'samples': 27999744, 'steps': 145831, 'loss/train': 1.1139434576034546} 11/07/2021 17:45:46 - INFO - __main__ - Step 145833: {'lr': 9.778189225541078e-07, 'samples': 27999936, 'steps': 145832, 'loss/train': 2.0581629276275635} 11/07/2021 17:45:46 - INFO - __main__ - Step 145834: {'lr': 9.773500818222314e-07, 'samples': 28000128, 'steps': 145833, 'loss/train': 1.1920194625854492} 11/07/2021 17:45:46 - INFO - __main__ - Step 145835: {'lr': 9.768813532961273e-07, 'samples': 28000320, 'steps': 145834, 'loss/train': 1.1529914140701294} 11/07/2021 17:45:47 - INFO - __main__ - Step 145836: {'lr': 9.764127369760178e-07, 'samples': 28000512, 'steps': 145835, 'loss/train': 1.0786434412002563} 11/07/2021 17:45:48 - INFO - __main__ - Step 145837: {'lr': 9.759442328620693e-07, 'samples': 28000704, 'steps': 145836, 'loss/train': 1.6430569887161255} 11/07/2021 17:45:48 - INFO - __main__ - Step 145838: {'lr': 9.75475840954504e-07, 'samples': 28000896, 'steps': 145837, 'loss/train': 0.4284057915210724} 11/07/2021 17:45:48 - INFO - __main__ - Step 145839: {'lr': 9.750075612535714e-07, 'samples': 28001088, 'steps': 145838, 'loss/train': 1.2277498245239258} 11/07/2021 17:45:49 - INFO - __main__ - Step 145840: {'lr': 9.74539393759466e-07, 'samples': 28001280, 'steps': 145839, 'loss/train': 1.3863898515701294} 11/07/2021 17:45:50 - INFO - __main__ - Step 145841: {'lr': 9.740713384723542e-07, 'samples': 28001472, 'steps': 145840, 'loss/train': 1.011620044708252} 11/07/2021 17:45:50 - INFO - __main__ - Step 145842: {'lr': 9.736033953925138e-07, 'samples': 28001664, 'steps': 145841, 'loss/train': 1.2684284448623657} 11/07/2021 17:45:51 - INFO - __main__ - Step 145843: {'lr': 9.731355645201112e-07, 'samples': 28001856, 'steps': 145842, 'loss/train': 1.0065596103668213} 11/07/2021 17:45:51 - INFO - __main__ - Step 145844: {'lr': 9.726678458553683e-07, 'samples': 28002048, 'steps': 145843, 'loss/train': 1.3496592044830322} 11/07/2021 17:45:51 - INFO - __main__ - Step 145845: {'lr': 9.722002393985075e-07, 'samples': 28002240, 'steps': 145844, 'loss/train': 1.1556463241577148} 11/07/2021 17:45:52 - INFO - __main__ - Step 145846: {'lr': 9.717327451497226e-07, 'samples': 28002432, 'steps': 145845, 'loss/train': 1.419642686843872} 11/07/2021 17:45:53 - INFO - __main__ - Step 145847: {'lr': 9.71265363109236e-07, 'samples': 28002624, 'steps': 145846, 'loss/train': 1.2301521301269531} 11/07/2021 17:45:53 - INFO - __main__ - Step 145848: {'lr': 9.707980932772697e-07, 'samples': 28002816, 'steps': 145847, 'loss/train': 1.8117536306381226} 11/07/2021 17:45:53 - INFO - __main__ - Step 145849: {'lr': 9.703309356539903e-07, 'samples': 28003008, 'steps': 145848, 'loss/train': 1.1435497999191284} 11/07/2021 17:45:54 - INFO - __main__ - Step 145850: {'lr': 9.698638902396473e-07, 'samples': 28003200, 'steps': 145849, 'loss/train': 1.473118543624878} 11/07/2021 17:45:55 - INFO - __main__ - Step 145851: {'lr': 9.693969570344629e-07, 'samples': 28003392, 'steps': 145850, 'loss/train': 1.8046475648880005} 11/07/2021 17:45:55 - INFO - __main__ - Step 145852: {'lr': 9.689301360386037e-07, 'samples': 28003584, 'steps': 145851, 'loss/train': 1.5328630208969116} 11/07/2021 17:45:56 - INFO - __main__ - Step 145853: {'lr': 9.684634272522919e-07, 'samples': 28003776, 'steps': 145852, 'loss/train': 0.5533215999603271} 11/07/2021 17:45:56 - INFO - __main__ - Step 145854: {'lr': 9.679968306757769e-07, 'samples': 28003968, 'steps': 145853, 'loss/train': 1.0004786252975464} 11/07/2021 17:45:56 - INFO - __main__ - Step 145855: {'lr': 9.675303463092255e-07, 'samples': 28004160, 'steps': 145854, 'loss/train': 1.3237323760986328} 11/07/2021 17:45:57 - INFO - __main__ - Step 145856: {'lr': 9.670639741528598e-07, 'samples': 28004352, 'steps': 145855, 'loss/train': 1.1622910499572754} 11/07/2021 17:45:58 - INFO - __main__ - Step 145857: {'lr': 9.665977142068738e-07, 'samples': 28004544, 'steps': 145856, 'loss/train': 1.0496562719345093} 11/07/2021 17:45:58 - INFO - __main__ - Step 145858: {'lr': 9.661315664715453e-07, 'samples': 28004736, 'steps': 145857, 'loss/train': 1.0342235565185547} 11/07/2021 17:45:58 - INFO - __main__ - Step 145859: {'lr': 9.656655309469852e-07, 'samples': 28004928, 'steps': 145858, 'loss/train': 1.4074420928955078} 11/07/2021 17:45:59 - INFO - __main__ - Step 145860: {'lr': 9.651996076334712e-07, 'samples': 28005120, 'steps': 145859, 'loss/train': 0.9237319231033325} 11/07/2021 17:45:59 - INFO - __main__ - Step 145861: {'lr': 9.647337965311975e-07, 'samples': 28005312, 'steps': 145860, 'loss/train': 1.397188663482666} 11/07/2021 17:46:00 - INFO - __main__ - Step 145862: {'lr': 9.642680976403862e-07, 'samples': 28005504, 'steps': 145861, 'loss/train': 0.1511479914188385} 11/07/2021 17:46:00 - INFO - __main__ - Step 145863: {'lr': 9.638025109612037e-07, 'samples': 28005696, 'steps': 145862, 'loss/train': 1.3988254070281982} 11/07/2021 17:46:01 - INFO - __main__ - Step 145864: {'lr': 9.633370364938999e-07, 'samples': 28005888, 'steps': 145863, 'loss/train': 1.0489661693572998} 11/07/2021 17:46:01 - INFO - __main__ - Step 145865: {'lr': 9.628716742386967e-07, 'samples': 28006080, 'steps': 145864, 'loss/train': 1.055505633354187} 11/07/2021 17:46:01 - INFO - __main__ - Step 145866: {'lr': 9.624064241957609e-07, 'samples': 28006272, 'steps': 145865, 'loss/train': 1.15151846408844} 11/07/2021 17:46:02 - INFO - __main__ - Step 145867: {'lr': 9.619412863653144e-07, 'samples': 28006464, 'steps': 145866, 'loss/train': 1.6166572570800781} 11/07/2021 17:46:03 - INFO - __main__ - Step 145868: {'lr': 9.61476260747579e-07, 'samples': 28006656, 'steps': 145867, 'loss/train': 1.495600700378418} 11/07/2021 17:46:03 - INFO - __main__ - Step 145869: {'lr': 9.610113473427773e-07, 'samples': 28006848, 'steps': 145868, 'loss/train': 0.728148877620697} 11/07/2021 17:46:04 - INFO - __main__ - Step 145870: {'lr': 9.605465461510753e-07, 'samples': 28007040, 'steps': 145869, 'loss/train': 1.737595558166504} 11/07/2021 17:46:04 - INFO - __main__ - Step 145871: {'lr': 9.600818571727232e-07, 'samples': 28007232, 'steps': 145870, 'loss/train': 1.312833547592163} 11/07/2021 17:46:05 - INFO - __main__ - Step 145872: {'lr': 9.59617280407915e-07, 'samples': 28007424, 'steps': 145871, 'loss/train': 1.171597957611084} 11/07/2021 17:46:05 - INFO - __main__ - Step 145873: {'lr': 9.591528158568453e-07, 'samples': 28007616, 'steps': 145872, 'loss/train': 1.038730263710022} 11/07/2021 17:46:06 - INFO - __main__ - Step 145874: {'lr': 9.586884635197636e-07, 'samples': 28007808, 'steps': 145873, 'loss/train': 0.9767788052558899} 11/07/2021 17:46:06 - INFO - __main__ - Step 145875: {'lr': 9.582242233968363e-07, 'samples': 28008000, 'steps': 145874, 'loss/train': 1.1170986890792847} 11/07/2021 17:46:06 - INFO - __main__ - Step 145876: {'lr': 9.577600954882858e-07, 'samples': 28008192, 'steps': 145875, 'loss/train': 1.234187364578247} 11/07/2021 17:46:08 - INFO - __main__ - Step 145877: {'lr': 9.572960797943342e-07, 'samples': 28008384, 'steps': 145876, 'loss/train': 1.360708475112915} 11/07/2021 17:46:08 - INFO - __main__ - Step 145878: {'lr': 9.568321763151756e-07, 'samples': 28008576, 'steps': 145877, 'loss/train': 1.54502534866333} 11/07/2021 17:46:08 - INFO - __main__ - Step 145879: {'lr': 9.563683850510319e-07, 'samples': 28008768, 'steps': 145878, 'loss/train': 1.2881145477294922} 11/07/2021 17:46:09 - INFO - __main__ - Step 145880: {'lr': 9.559047060021254e-07, 'samples': 28008960, 'steps': 145879, 'loss/train': 1.0277420282363892} 11/07/2021 17:46:09 - INFO - __main__ - Step 145881: {'lr': 9.554411391686225e-07, 'samples': 28009152, 'steps': 145880, 'loss/train': 2.086571455001831} 11/07/2021 17:46:09 - INFO - __main__ - Step 145882: {'lr': 9.549776845507452e-07, 'samples': 28009344, 'steps': 145881, 'loss/train': 0.8277300596237183} 11/07/2021 17:46:10 - INFO - __main__ - Step 145883: {'lr': 9.545143421487435e-07, 'samples': 28009536, 'steps': 145882, 'loss/train': 1.0267701148986816} 11/07/2021 17:46:11 - INFO - __main__ - Step 145884: {'lr': 9.54051111962756e-07, 'samples': 28009728, 'steps': 145883, 'loss/train': 0.8919546604156494} 11/07/2021 17:46:11 - INFO - __main__ - Step 145885: {'lr': 9.535879939930603e-07, 'samples': 28009920, 'steps': 145884, 'loss/train': 1.2646106481552124} 11/07/2021 17:46:11 - INFO - __main__ - Step 145886: {'lr': 9.531249882398229e-07, 'samples': 28010112, 'steps': 145885, 'loss/train': 0.9693202972412109} 11/07/2021 17:46:12 - INFO - __main__ - Step 145887: {'lr': 9.526620947032661e-07, 'samples': 28010304, 'steps': 145886, 'loss/train': 1.4547330141067505} 11/07/2021 17:46:13 - INFO - __main__ - Step 145888: {'lr': 9.521993133835838e-07, 'samples': 28010496, 'steps': 145887, 'loss/train': 1.0894157886505127} 11/07/2021 17:46:13 - INFO - __main__ - Step 145889: {'lr': 9.51736644281026e-07, 'samples': 28010688, 'steps': 145888, 'loss/train': 1.5443388223648071} 11/07/2021 17:46:14 - INFO - __main__ - Step 145890: {'lr': 9.512740873957592e-07, 'samples': 28010880, 'steps': 145889, 'loss/train': 1.2550756931304932} 11/07/2021 17:46:14 - INFO - __main__ - Step 145891: {'lr': 9.508116427279779e-07, 'samples': 28011072, 'steps': 145890, 'loss/train': 1.5105429887771606} 11/07/2021 17:46:14 - INFO - __main__ - Step 145892: {'lr': 9.503493102779592e-07, 'samples': 28011264, 'steps': 145891, 'loss/train': 0.9545254111289978} 11/07/2021 17:46:15 - INFO - __main__ - Step 145893: {'lr': 9.498870900458422e-07, 'samples': 28011456, 'steps': 145892, 'loss/train': 1.2506790161132812} 11/07/2021 17:46:16 - INFO - __main__ - Step 145894: {'lr': 9.494249820318768e-07, 'samples': 28011648, 'steps': 145893, 'loss/train': 1.2408266067504883} 11/07/2021 17:46:16 - INFO - __main__ - Step 145895: {'lr': 9.48962986236257e-07, 'samples': 28011840, 'steps': 145894, 'loss/train': 1.2041102647781372} 11/07/2021 17:46:16 - INFO - __main__ - Step 145896: {'lr': 9.48501102659205e-07, 'samples': 28012032, 'steps': 145895, 'loss/train': 1.2472002506256104} 11/07/2021 17:46:17 - INFO - __main__ - Step 145897: {'lr': 9.480393313008873e-07, 'samples': 28012224, 'steps': 145896, 'loss/train': 1.3736045360565186} 11/07/2021 17:46:18 - INFO - __main__ - Step 145898: {'lr': 9.475776721615537e-07, 'samples': 28012416, 'steps': 145897, 'loss/train': 1.2344627380371094} 11/07/2021 17:46:18 - INFO - __main__ - Step 145899: {'lr': 9.471161252413984e-07, 'samples': 28012608, 'steps': 145898, 'loss/train': 1.4923572540283203} 11/07/2021 17:46:19 - INFO - __main__ - Step 145900: {'lr': 9.46654690540616e-07, 'samples': 28012800, 'steps': 145899, 'loss/train': 2.2263002395629883} 11/07/2021 17:46:19 - INFO - __main__ - Step 145901: {'lr': 9.461933680594559e-07, 'samples': 28012992, 'steps': 145900, 'loss/train': 1.165945053100586} 11/07/2021 17:46:19 - INFO - __main__ - Step 145902: {'lr': 9.457321577980849e-07, 'samples': 28013184, 'steps': 145901, 'loss/train': 1.1422423124313354} 11/07/2021 17:46:20 - INFO - __main__ - Step 145903: {'lr': 9.452710597566971e-07, 'samples': 28013376, 'steps': 145902, 'loss/train': 1.2574113607406616} 11/07/2021 17:46:21 - INFO - __main__ - Step 145904: {'lr': 9.448100739355703e-07, 'samples': 28013568, 'steps': 145903, 'loss/train': 0.763078510761261} 11/07/2021 17:46:21 - INFO - __main__ - Step 145905: {'lr': 9.443492003348431e-07, 'samples': 28013760, 'steps': 145904, 'loss/train': 1.1532588005065918} 11/07/2021 17:46:21 - INFO - __main__ - Step 145906: {'lr': 9.438884389547375e-07, 'samples': 28013952, 'steps': 145905, 'loss/train': 1.259904146194458} 11/07/2021 17:46:22 - INFO - __main__ - Step 145907: {'lr': 9.434277897955034e-07, 'samples': 28014144, 'steps': 145906, 'loss/train': 1.542797565460205} 11/07/2021 17:46:22 - INFO - __main__ - Step 145908: {'lr': 9.429672528573075e-07, 'samples': 28014336, 'steps': 145907, 'loss/train': 1.097790241241455} 11/07/2021 17:46:23 - INFO - __main__ - Step 145909: {'lr': 9.425068281403714e-07, 'samples': 28014528, 'steps': 145908, 'loss/train': 1.6482113599777222} 11/07/2021 17:46:23 - INFO - __main__ - Step 145910: {'lr': 9.420465156448898e-07, 'samples': 28014720, 'steps': 145909, 'loss/train': 0.7816482186317444} 11/07/2021 17:46:24 - INFO - __main__ - Step 145911: {'lr': 9.415863153710846e-07, 'samples': 28014912, 'steps': 145910, 'loss/train': 1.108435869216919} 11/07/2021 17:46:24 - INFO - __main__ - Step 145912: {'lr': 9.411262273191501e-07, 'samples': 28015104, 'steps': 145911, 'loss/train': 1.4148972034454346} 11/07/2021 17:46:25 - INFO - __main__ - Step 145913: {'lr': 9.406662514893083e-07, 'samples': 28015296, 'steps': 145912, 'loss/train': 1.2508389949798584} 11/07/2021 17:46:26 - INFO - __main__ - Step 145914: {'lr': 9.402063878817535e-07, 'samples': 28015488, 'steps': 145913, 'loss/train': 1.3909621238708496} 11/07/2021 17:46:26 - INFO - __main__ - Step 145915: {'lr': 9.397466364966801e-07, 'samples': 28015680, 'steps': 145914, 'loss/train': 1.3819754123687744} 11/07/2021 17:46:26 - INFO - __main__ - Step 145916: {'lr': 9.392869973343377e-07, 'samples': 28015872, 'steps': 145915, 'loss/train': 1.4083266258239746} 11/07/2021 17:46:27 - INFO - __main__ - Step 145917: {'lr': 9.388274703949207e-07, 'samples': 28016064, 'steps': 145916, 'loss/train': 1.1223492622375488} 11/07/2021 17:46:27 - INFO - __main__ - Step 145918: {'lr': 9.383680556785956e-07, 'samples': 28016256, 'steps': 145917, 'loss/train': 1.4372084140777588} 11/07/2021 17:46:28 - INFO - __main__ - Step 145919: {'lr': 9.379087531856123e-07, 'samples': 28016448, 'steps': 145918, 'loss/train': 1.0286674499511719} 11/07/2021 17:46:28 - INFO - __main__ - Step 145920: {'lr': 9.37449562916165e-07, 'samples': 28016640, 'steps': 145919, 'loss/train': 1.1489309072494507} 11/07/2021 17:46:29 - INFO - __main__ - Step 145921: {'lr': 9.369904848704757e-07, 'samples': 28016832, 'steps': 145920, 'loss/train': 1.1278035640716553} 11/07/2021 17:46:29 - INFO - __main__ - Step 145922: {'lr': 9.365315190487111e-07, 'samples': 28017024, 'steps': 145921, 'loss/train': 1.2220097780227661} 11/07/2021 17:46:29 - INFO - __main__ - Step 145923: {'lr': 9.360726654510932e-07, 'samples': 28017216, 'steps': 145922, 'loss/train': 1.2192316055297852} 11/07/2021 17:46:30 - INFO - __main__ - Step 145924: {'lr': 9.356139240778716e-07, 'samples': 28017408, 'steps': 145923, 'loss/train': 1.4019192457199097} 11/07/2021 17:46:31 - INFO - __main__ - Step 145925: {'lr': 9.351552949291853e-07, 'samples': 28017600, 'steps': 145924, 'loss/train': 1.3997470140457153} 11/07/2021 17:46:31 - INFO - __main__ - Step 145926: {'lr': 9.34696778005284e-07, 'samples': 28017792, 'steps': 145925, 'loss/train': 1.1619271039962769} 11/07/2021 17:46:32 - INFO - __main__ - Step 145927: {'lr': 9.342383733063898e-07, 'samples': 28017984, 'steps': 145926, 'loss/train': 1.0977632999420166} 11/07/2021 17:46:32 - INFO - __main__ - Step 145928: {'lr': 9.337800808326691e-07, 'samples': 28018176, 'steps': 145927, 'loss/train': 1.3644579648971558} 11/07/2021 17:46:32 - INFO - __main__ - Step 145929: {'lr': 9.333219005843163e-07, 'samples': 28018368, 'steps': 145928, 'loss/train': 1.1463991403579712} 11/07/2021 17:46:33 - INFO - __main__ - Step 145930: {'lr': 9.32863832561609e-07, 'samples': 28018560, 'steps': 145929, 'loss/train': 1.5163594484329224} 11/07/2021 17:46:34 - INFO - __main__ - Step 145931: {'lr': 9.324058767646859e-07, 'samples': 28018752, 'steps': 145930, 'loss/train': 1.1715819835662842} 11/07/2021 17:46:34 - INFO - __main__ - Step 145932: {'lr': 9.31948033193769e-07, 'samples': 28018944, 'steps': 145931, 'loss/train': 0.12585680186748505} 11/07/2021 17:46:34 - INFO - __main__ - Step 145933: {'lr': 9.314903018490806e-07, 'samples': 28019136, 'steps': 145932, 'loss/train': 1.3436044454574585} 11/07/2021 17:46:35 - INFO - __main__ - Step 145934: {'lr': 9.310326827308424e-07, 'samples': 28019328, 'steps': 145933, 'loss/train': 1.192139983177185} 11/07/2021 17:46:36 - INFO - __main__ - Step 145935: {'lr': 9.305751758392212e-07, 'samples': 28019520, 'steps': 145934, 'loss/train': 1.7445451021194458} 11/07/2021 17:46:36 - INFO - __main__ - Step 145936: {'lr': 9.301177811744388e-07, 'samples': 28019712, 'steps': 145935, 'loss/train': 0.30207982659339905} 11/07/2021 17:46:37 - INFO - __main__ - Step 145937: {'lr': 9.296604987366897e-07, 'samples': 28019904, 'steps': 145936, 'loss/train': 1.184849500656128} 11/07/2021 17:46:37 - INFO - __main__ - Step 145938: {'lr': 9.292033285262236e-07, 'samples': 28020096, 'steps': 145937, 'loss/train': 1.3870861530303955} 11/07/2021 17:46:37 - INFO - __main__ - Step 145939: {'lr': 9.287462705431793e-07, 'samples': 28020288, 'steps': 145938, 'loss/train': 1.4288748502731323} 11/07/2021 17:46:38 - INFO - __main__ - Step 145940: {'lr': 9.282893247878066e-07, 'samples': 28020480, 'steps': 145939, 'loss/train': 1.5095399618148804} 11/07/2021 17:46:39 - INFO - __main__ - Step 145941: {'lr': 9.278324912603276e-07, 'samples': 28020672, 'steps': 145940, 'loss/train': 1.3052536249160767} 11/07/2021 17:46:39 - INFO - __main__ - Step 145942: {'lr': 9.273757699609087e-07, 'samples': 28020864, 'steps': 145941, 'loss/train': 0.5918026566505432} 11/07/2021 17:46:40 - INFO - __main__ - Step 145943: {'lr': 9.26919160889772e-07, 'samples': 28021056, 'steps': 145942, 'loss/train': 1.5329313278198242} 11/07/2021 17:46:40 - INFO - __main__ - Step 145944: {'lr': 9.264626640471119e-07, 'samples': 28021248, 'steps': 145943, 'loss/train': 0.0418735034763813} 11/07/2021 17:46:41 - INFO - __main__ - Step 145945: {'lr': 9.260062794331503e-07, 'samples': 28021440, 'steps': 145944, 'loss/train': 0.7019053101539612} 11/07/2021 17:46:41 - INFO - __main__ - Step 145946: {'lr': 9.255500070480816e-07, 'samples': 28021632, 'steps': 145945, 'loss/train': 0.3601733148097992} 11/07/2021 17:46:42 - INFO - __main__ - Step 145947: {'lr': 9.250938468921278e-07, 'samples': 28021824, 'steps': 145946, 'loss/train': 1.0334677696228027} 11/07/2021 17:46:42 - INFO - __main__ - Step 145948: {'lr': 9.246377989654831e-07, 'samples': 28022016, 'steps': 145947, 'loss/train': 1.3748165369033813} 11/07/2021 17:46:42 - INFO - __main__ - Step 145949: {'lr': 9.241818632683419e-07, 'samples': 28022208, 'steps': 145948, 'loss/train': 1.430880069732666} 11/07/2021 17:46:43 - INFO - __main__ - Step 145950: {'lr': 9.237260398009261e-07, 'samples': 28022400, 'steps': 145949, 'loss/train': 1.1842796802520752} 11/07/2021 17:46:44 - INFO - __main__ - Step 145951: {'lr': 9.23270328563458e-07, 'samples': 28022592, 'steps': 145950, 'loss/train': 1.0988311767578125} 11/07/2021 17:46:44 - INFO - __main__ - Step 145952: {'lr': 9.228147295561041e-07, 'samples': 28022784, 'steps': 145951, 'loss/train': 1.3491131067276} 11/07/2021 17:46:44 - INFO - __main__ - Step 145953: {'lr': 9.223592427790584e-07, 'samples': 28022976, 'steps': 145952, 'loss/train': 1.5982975959777832} 11/07/2021 17:46:45 - INFO - __main__ - Step 145954: {'lr': 9.219038682325986e-07, 'samples': 28023168, 'steps': 145953, 'loss/train': 1.0837693214416504} 11/07/2021 17:46:45 - INFO - __main__ - Step 145955: {'lr': 9.214486059168636e-07, 'samples': 28023360, 'steps': 145954, 'loss/train': 0.6234476566314697} 11/07/2021 17:46:46 - INFO - __main__ - Step 145956: {'lr': 9.209934558320754e-07, 'samples': 28023552, 'steps': 145955, 'loss/train': 1.3971128463745117} 11/07/2021 17:46:47 - INFO - __main__ - Step 145957: {'lr': 9.20538417978456e-07, 'samples': 28023744, 'steps': 145956, 'loss/train': 1.4113115072250366} 11/07/2021 17:46:47 - INFO - __main__ - Step 145958: {'lr': 9.200834923561719e-07, 'samples': 28023936, 'steps': 145957, 'loss/train': 1.436173677444458} 11/07/2021 17:46:47 - INFO - __main__ - Step 145959: {'lr': 9.196286789655006e-07, 'samples': 28024128, 'steps': 145958, 'loss/train': 1.183017611503601} 11/07/2021 17:46:48 - INFO - __main__ - Step 145960: {'lr': 9.191739778065533e-07, 'samples': 28024320, 'steps': 145959, 'loss/train': 1.2696794271469116} 11/07/2021 17:46:49 - INFO - __main__ - Step 145961: {'lr': 9.187193888796353e-07, 'samples': 28024512, 'steps': 145960, 'loss/train': 1.5319311618804932} 11/07/2021 17:46:49 - INFO - __main__ - Step 145962: {'lr': 9.182649121848574e-07, 'samples': 28024704, 'steps': 145961, 'loss/train': 1.5535132884979248} 11/07/2021 17:46:49 - INFO - __main__ - Step 145963: {'lr': 9.178105477224696e-07, 'samples': 28024896, 'steps': 145962, 'loss/train': 1.734606385231018} 11/07/2021 17:46:50 - INFO - __main__ - Step 145964: {'lr': 9.173562954926939e-07, 'samples': 28025088, 'steps': 145963, 'loss/train': 1.453546404838562} 11/07/2021 17:46:50 - INFO - __main__ - Step 145965: {'lr': 9.169021554956969e-07, 'samples': 28025280, 'steps': 145964, 'loss/train': 1.1698590517044067} 11/07/2021 17:46:51 - INFO - __main__ - Step 145966: {'lr': 9.164481277317005e-07, 'samples': 28025472, 'steps': 145965, 'loss/train': 1.3857585191726685} 11/07/2021 17:46:52 - INFO - __main__ - Step 145967: {'lr': 9.159942122009269e-07, 'samples': 28025664, 'steps': 145966, 'loss/train': 1.054118275642395} 11/07/2021 17:46:52 - INFO - __main__ - Step 145968: {'lr': 9.155404089035424e-07, 'samples': 28025856, 'steps': 145967, 'loss/train': 1.1013846397399902} 11/07/2021 17:46:52 - INFO - __main__ - Step 145969: {'lr': 9.150867178397692e-07, 'samples': 28026048, 'steps': 145968, 'loss/train': 1.5357646942138672} 11/07/2021 17:46:53 - INFO - __main__ - Step 145970: {'lr': 9.146331390098294e-07, 'samples': 28026240, 'steps': 145969, 'loss/train': 2.2632479667663574} 11/07/2021 17:46:54 - INFO - __main__ - Step 145971: {'lr': 9.141796724138895e-07, 'samples': 28026432, 'steps': 145970, 'loss/train': 1.3644055128097534} 11/07/2021 17:46:54 - INFO - __main__ - Step 145972: {'lr': 9.137263180521993e-07, 'samples': 28026624, 'steps': 145971, 'loss/train': 1.4245940446853638} 11/07/2021 17:46:54 - INFO - __main__ - Step 145973: {'lr': 9.132730759249252e-07, 'samples': 28026816, 'steps': 145972, 'loss/train': 0.9996790885925293} 11/07/2021 17:46:55 - INFO - __main__ - Step 145974: {'lr': 9.128199460323172e-07, 'samples': 28027008, 'steps': 145973, 'loss/train': 1.3220419883728027} 11/07/2021 17:46:55 - INFO - __main__ - Step 145975: {'lr': 9.12366928374514e-07, 'samples': 28027200, 'steps': 145974, 'loss/train': 1.3595726490020752} 11/07/2021 17:46:55 - INFO - __main__ - Step 145976: {'lr': 9.119140229517653e-07, 'samples': 28027392, 'steps': 145975, 'loss/train': 0.9241524338722229} 11/07/2021 17:46:56 - INFO - __main__ - Step 145977: {'lr': 9.114612297642655e-07, 'samples': 28027584, 'steps': 145976, 'loss/train': 1.487976312637329} 11/07/2021 17:46:57 - INFO - __main__ - Step 145978: {'lr': 9.110085488122367e-07, 'samples': 28027776, 'steps': 145977, 'loss/train': 1.7449592351913452} 11/07/2021 17:46:57 - INFO - __main__ - Step 145979: {'lr': 9.105559800958452e-07, 'samples': 28027968, 'steps': 145978, 'loss/train': 1.2129937410354614} 11/07/2021 17:46:58 - INFO - __main__ - Step 145980: {'lr': 9.101035236153133e-07, 'samples': 28028160, 'steps': 145979, 'loss/train': 0.988226056098938} 11/07/2021 17:46:58 - INFO - __main__ - Step 145981: {'lr': 9.09651179370835e-07, 'samples': 28028352, 'steps': 145980, 'loss/train': 1.4221954345703125} 11/07/2021 17:46:59 - INFO - __main__ - Step 145982: {'lr': 9.091989473626327e-07, 'samples': 28028544, 'steps': 145981, 'loss/train': 1.3716951608657837} 11/07/2021 17:46:59 - INFO - __main__ - Step 145983: {'lr': 9.087468275909006e-07, 'samples': 28028736, 'steps': 145982, 'loss/train': 1.1748418807983398} 11/07/2021 17:47:00 - INFO - __main__ - Step 145984: {'lr': 9.082948200558606e-07, 'samples': 28028928, 'steps': 145983, 'loss/train': 0.5945744514465332} 11/07/2021 17:47:00 - INFO - __main__ - Step 145985: {'lr': 9.078429247576792e-07, 'samples': 28029120, 'steps': 145984, 'loss/train': 1.3048456907272339} 11/07/2021 17:47:00 - INFO - __main__ - Step 145986: {'lr': 9.073911416965785e-07, 'samples': 28029312, 'steps': 145985, 'loss/train': 1.3314777612686157} 11/07/2021 17:47:02 - INFO - __main__ - Step 145987: {'lr': 9.069394708727807e-07, 'samples': 28029504, 'steps': 145986, 'loss/train': 1.0832916498184204} 11/07/2021 17:47:02 - INFO - __main__ - Step 145988: {'lr': 9.0648791228648e-07, 'samples': 28029696, 'steps': 145987, 'loss/train': 1.2493090629577637} 11/07/2021 17:47:02 - INFO - __main__ - Step 145989: {'lr': 9.060364659378428e-07, 'samples': 28029888, 'steps': 145988, 'loss/train': 1.1026993989944458} 11/07/2021 17:47:03 - INFO - __main__ - Step 145990: {'lr': 9.055851318271191e-07, 'samples': 28030080, 'steps': 145989, 'loss/train': 1.4339118003845215} 11/07/2021 17:47:03 - INFO - __main__ - Step 145991: {'lr': 9.051339099544753e-07, 'samples': 28030272, 'steps': 145990, 'loss/train': 0.5870440602302551} 11/07/2021 17:47:04 - INFO - __main__ - Step 145992: {'lr': 9.046828003201613e-07, 'samples': 28030464, 'steps': 145991, 'loss/train': 1.0392589569091797} 11/07/2021 17:47:04 - INFO - __main__ - Step 145993: {'lr': 9.042318029243435e-07, 'samples': 28030656, 'steps': 145992, 'loss/train': 1.2655837535858154} 11/07/2021 17:47:05 - INFO - __main__ - Step 145994: {'lr': 9.037809177672162e-07, 'samples': 28030848, 'steps': 145993, 'loss/train': 1.0790865421295166} 11/07/2021 17:47:05 - INFO - __main__ - Step 145995: {'lr': 9.033301448490294e-07, 'samples': 28031040, 'steps': 145994, 'loss/train': 1.3316866159439087} 11/07/2021 17:47:06 - INFO - __main__ - Step 145996: {'lr': 9.028794841699495e-07, 'samples': 28031232, 'steps': 145995, 'loss/train': 1.4657611846923828} 11/07/2021 17:47:06 - INFO - __main__ - Step 145997: {'lr': 9.024289357301707e-07, 'samples': 28031424, 'steps': 145996, 'loss/train': 0.6397385001182556} 11/07/2021 17:47:07 - INFO - __main__ - Step 145998: {'lr': 9.019784995299429e-07, 'samples': 28031616, 'steps': 145997, 'loss/train': 1.6463907957077026} 11/07/2021 17:47:07 - INFO - __main__ - Step 145999: {'lr': 9.015281755694327e-07, 'samples': 28031808, 'steps': 145998, 'loss/train': 1.2570230960845947} 11/07/2021 17:47:08 - INFO - __main__ - Step 146000: {'lr': 9.010779638488342e-07, 'samples': 28032000, 'steps': 145999, 'loss/train': 1.336675763130188} 11/07/2021 17:47:08 - INFO - __main__ - Step 146001: {'lr': 9.006278643683696e-07, 'samples': 28032192, 'steps': 146000, 'loss/train': 1.6635974645614624} 11/07/2021 17:47:08 - INFO - __main__ - Step 146002: {'lr': 9.001778771282609e-07, 'samples': 28032384, 'steps': 146001, 'loss/train': 1.435117244720459} 11/07/2021 17:47:09 - INFO - __main__ - Step 146003: {'lr': 8.997280021286747e-07, 'samples': 28032576, 'steps': 146002, 'loss/train': 0.7880780100822449} 11/07/2021 17:47:10 - INFO - __main__ - Step 146004: {'lr': 8.992782393698051e-07, 'samples': 28032768, 'steps': 146003, 'loss/train': 1.140568494796753} 11/07/2021 17:47:10 - INFO - __main__ - Step 146005: {'lr': 8.988285888519021e-07, 'samples': 28032960, 'steps': 146004, 'loss/train': 1.4581319093704224} 11/07/2021 17:47:10 - INFO - __main__ - Step 146006: {'lr': 8.983790505751322e-07, 'samples': 28033152, 'steps': 146005, 'loss/train': 1.1360266208648682} 11/07/2021 17:47:11 - INFO - __main__ - Step 146007: {'lr': 8.979296245397172e-07, 'samples': 28033344, 'steps': 146006, 'loss/train': 1.0702860355377197} 11/07/2021 17:47:12 - INFO - __main__ - Step 146008: {'lr': 8.974803107458518e-07, 'samples': 28033536, 'steps': 146007, 'loss/train': 1.5466974973678589} 11/07/2021 17:47:12 - INFO - __main__ - Step 146009: {'lr': 8.9703110919373e-07, 'samples': 28033728, 'steps': 146008, 'loss/train': 1.1853716373443604} 11/07/2021 17:47:13 - INFO - __main__ - Step 146010: {'lr': 8.965820198835461e-07, 'samples': 28033920, 'steps': 146009, 'loss/train': 1.634477972984314} 11/07/2021 17:47:13 - INFO - __main__ - Step 146011: {'lr': 8.961330428155501e-07, 'samples': 28034112, 'steps': 146010, 'loss/train': 1.037127137184143} 11/07/2021 17:47:13 - INFO - __main__ - Step 146012: {'lr': 8.956841779899083e-07, 'samples': 28034304, 'steps': 146011, 'loss/train': 1.3267018795013428} 11/07/2021 17:47:15 - INFO - __main__ - Step 146013: {'lr': 8.952354254068151e-07, 'samples': 28034496, 'steps': 146012, 'loss/train': 1.1543676853179932} 11/07/2021 17:47:15 - INFO - __main__ - Step 146014: {'lr': 8.947867850664926e-07, 'samples': 28034688, 'steps': 146013, 'loss/train': 1.3724730014801025} 11/07/2021 17:47:15 - INFO - __main__ - Step 146015: {'lr': 8.94338256969135e-07, 'samples': 28034880, 'steps': 146014, 'loss/train': 1.194100022315979} 11/07/2021 17:47:16 - INFO - __main__ - Step 146016: {'lr': 8.938898411149366e-07, 'samples': 28035072, 'steps': 146015, 'loss/train': 1.2080556154251099} 11/07/2021 17:47:16 - INFO - __main__ - Step 146017: {'lr': 8.934415375041194e-07, 'samples': 28035264, 'steps': 146016, 'loss/train': 0.04119187220931053} 11/07/2021 17:47:17 - INFO - __main__ - Step 146018: {'lr': 8.9299334613685e-07, 'samples': 28035456, 'steps': 146017, 'loss/train': 0.5720486640930176} 11/07/2021 17:47:17 - INFO - __main__ - Step 146019: {'lr': 8.925452670133783e-07, 'samples': 28035648, 'steps': 146018, 'loss/train': 1.5377449989318848} 11/07/2021 17:47:18 - INFO - __main__ - Step 146020: {'lr': 8.920973001338705e-07, 'samples': 28035840, 'steps': 146019, 'loss/train': 1.381713628768921} 11/07/2021 17:47:18 - INFO - __main__ - Step 146021: {'lr': 8.916494454985491e-07, 'samples': 28036032, 'steps': 146020, 'loss/train': 1.1304699182510376} 11/07/2021 17:47:18 - INFO - __main__ - Step 146022: {'lr': 8.912017031075804e-07, 'samples': 28036224, 'steps': 146021, 'loss/train': 1.3618531227111816} 11/07/2021 17:47:19 - INFO - __main__ - Step 146023: {'lr': 8.90754072961214e-07, 'samples': 28036416, 'steps': 146022, 'loss/train': 1.12021005153656} 11/07/2021 17:47:20 - INFO - __main__ - Step 146024: {'lr': 8.903065550596445e-07, 'samples': 28036608, 'steps': 146023, 'loss/train': 1.1232773065567017} 11/07/2021 17:47:20 - INFO - __main__ - Step 146025: {'lr': 8.898591494030384e-07, 'samples': 28036800, 'steps': 146024, 'loss/train': 1.099450945854187} 11/07/2021 17:47:20 - INFO - __main__ - Step 146026: {'lr': 8.894118559916176e-07, 'samples': 28036992, 'steps': 146025, 'loss/train': 0.8909407258033752} 11/07/2021 17:47:21 - INFO - __main__ - Step 146027: {'lr': 8.889646748255764e-07, 'samples': 28037184, 'steps': 146026, 'loss/train': 1.0792341232299805} 11/07/2021 17:47:21 - INFO - __main__ - Step 146028: {'lr': 8.885176059051369e-07, 'samples': 28037376, 'steps': 146027, 'loss/train': 1.5723646879196167} 11/07/2021 17:47:23 - INFO - __main__ - Step 146029: {'lr': 8.880706492304935e-07, 'samples': 28037568, 'steps': 146028, 'loss/train': 1.6326180696487427} 11/07/2021 17:47:23 - INFO - __main__ - Step 146030: {'lr': 8.876238048018404e-07, 'samples': 28037760, 'steps': 146029, 'loss/train': 2.267573595046997} 11/07/2021 17:47:23 - INFO - __main__ - Step 146031: {'lr': 8.871770726193718e-07, 'samples': 28037952, 'steps': 146030, 'loss/train': 1.1838380098342896} 11/07/2021 17:47:24 - INFO - __main__ - Step 146032: {'lr': 8.86730452683282e-07, 'samples': 28038144, 'steps': 146031, 'loss/train': 1.8547897338867188} 11/07/2021 17:47:24 - INFO - __main__ - Step 146033: {'lr': 8.862839449938209e-07, 'samples': 28038336, 'steps': 146032, 'loss/train': 0.9834957122802734} 11/07/2021 17:47:25 - INFO - __main__ - Step 146034: {'lr': 8.858375495511273e-07, 'samples': 28038528, 'steps': 146033, 'loss/train': 1.3107391595840454} 11/07/2021 17:47:25 - INFO - __main__ - Step 146035: {'lr': 8.853912663554508e-07, 'samples': 28038720, 'steps': 146034, 'loss/train': 1.1498305797576904} 11/07/2021 17:47:26 - INFO - __main__ - Step 146036: {'lr': 8.849450954069582e-07, 'samples': 28038912, 'steps': 146035, 'loss/train': 1.139827013015747} 11/07/2021 17:47:26 - INFO - __main__ - Step 146037: {'lr': 8.844990367058714e-07, 'samples': 28039104, 'steps': 146036, 'loss/train': 0.6289168000221252} 11/07/2021 17:47:26 - INFO - __main__ - Step 146038: {'lr': 8.840530902523847e-07, 'samples': 28039296, 'steps': 146037, 'loss/train': 0.7377802133560181} 11/07/2021 17:47:27 - INFO - __main__ - Step 146039: {'lr': 8.836072560466923e-07, 'samples': 28039488, 'steps': 146038, 'loss/train': 1.5315619707107544} 11/07/2021 17:47:28 - INFO - __main__ - Step 146040: {'lr': 8.831615340890165e-07, 'samples': 28039680, 'steps': 146039, 'loss/train': 1.2897496223449707} 11/07/2021 17:47:28 - INFO - __main__ - Step 146041: {'lr': 8.827159243795513e-07, 'samples': 28039872, 'steps': 146040, 'loss/train': 1.3708865642547607} 11/07/2021 17:47:28 - INFO - __main__ - Step 146042: {'lr': 8.822704269184633e-07, 'samples': 28040064, 'steps': 146041, 'loss/train': 0.8614375591278076} 11/07/2021 17:47:29 - INFO - __main__ - Step 146043: {'lr': 8.818250417060026e-07, 'samples': 28040256, 'steps': 146042, 'loss/train': 1.5979540348052979} 11/07/2021 17:47:30 - INFO - __main__ - Step 146044: {'lr': 8.813797687423353e-07, 'samples': 28040448, 'steps': 146043, 'loss/train': 1.4214774370193481} 11/07/2021 17:47:30 - INFO - __main__ - Step 146045: {'lr': 8.809346080276837e-07, 'samples': 28040640, 'steps': 146044, 'loss/train': 1.652829647064209} 11/07/2021 17:47:31 - INFO - __main__ - Step 146046: {'lr': 8.80489559562242e-07, 'samples': 28040832, 'steps': 146045, 'loss/train': 1.185032844543457} 11/07/2021 17:47:31 - INFO - __main__ - Step 146047: {'lr': 8.800446233461768e-07, 'samples': 28041024, 'steps': 146046, 'loss/train': 1.8707256317138672} 11/07/2021 17:47:31 - INFO - __main__ - Step 146048: {'lr': 8.795997993797378e-07, 'samples': 28041216, 'steps': 146047, 'loss/train': 1.158681035041809} 11/07/2021 17:47:32 - INFO - __main__ - Step 146049: {'lr': 8.791550876631193e-07, 'samples': 28041408, 'steps': 146048, 'loss/train': 1.0169546604156494} 11/07/2021 17:47:33 - INFO - __main__ - Step 146050: {'lr': 8.78710488196488e-07, 'samples': 28041600, 'steps': 146049, 'loss/train': 0.969200074672699} 11/07/2021 17:47:33 - INFO - __main__ - Step 146051: {'lr': 8.782660009800936e-07, 'samples': 28041792, 'steps': 146050, 'loss/train': 0.04623624309897423} 11/07/2021 17:47:34 - INFO - __main__ - Step 146052: {'lr': 8.778216260140748e-07, 'samples': 28041984, 'steps': 146051, 'loss/train': 1.4509986639022827} 11/07/2021 17:47:34 - INFO - __main__ - Step 146053: {'lr': 8.773773632987092e-07, 'samples': 28042176, 'steps': 146052, 'loss/train': 1.2653396129608154} 11/07/2021 17:47:34 - INFO - __main__ - Step 146054: {'lr': 8.769332128341079e-07, 'samples': 28042368, 'steps': 146053, 'loss/train': 0.9687672853469849} 11/07/2021 17:47:36 - INFO - __main__ - Step 146055: {'lr': 8.764891746205483e-07, 'samples': 28042560, 'steps': 146054, 'loss/train': 0.9788981676101685} 11/07/2021 17:47:36 - INFO - __main__ - Step 146056: {'lr': 8.760452486581971e-07, 'samples': 28042752, 'steps': 146055, 'loss/train': 1.3810118436813354} 11/07/2021 17:47:36 - INFO - __main__ - Step 146057: {'lr': 8.756014349472208e-07, 'samples': 28042944, 'steps': 146056, 'loss/train': 1.6646783351898193} 11/07/2021 17:47:37 - INFO - __main__ - Step 146058: {'lr': 8.751577334878969e-07, 'samples': 28043136, 'steps': 146057, 'loss/train': 1.563788890838623} 11/07/2021 17:47:37 - INFO - __main__ - Step 146059: {'lr': 8.747141442803641e-07, 'samples': 28043328, 'steps': 146058, 'loss/train': 0.8290058374404907} 11/07/2021 17:47:38 - INFO - __main__ - Step 146060: {'lr': 8.742706673248168e-07, 'samples': 28043520, 'steps': 146059, 'loss/train': 0.9548178911209106} 11/07/2021 17:47:39 - INFO - __main__ - Step 146061: {'lr': 8.738273026215049e-07, 'samples': 28043712, 'steps': 146060, 'loss/train': 1.0405367612838745} 11/07/2021 17:47:39 - INFO - __main__ - Step 146062: {'lr': 8.733840501705948e-07, 'samples': 28043904, 'steps': 146061, 'loss/train': 1.1449998617172241} 11/07/2021 17:47:39 - INFO - __main__ - Step 146063: {'lr': 8.729409099723085e-07, 'samples': 28044096, 'steps': 146062, 'loss/train': 1.0467091798782349} 11/07/2021 17:47:40 - INFO - __main__ - Step 146064: {'lr': 8.724978820268126e-07, 'samples': 28044288, 'steps': 146063, 'loss/train': 1.228217601776123} 11/07/2021 17:47:40 - INFO - __main__ - Step 146065: {'lr': 8.720549663343291e-07, 'samples': 28044480, 'steps': 146064, 'loss/train': 1.3415944576263428} 11/07/2021 17:47:41 - INFO - __main__ - Step 146066: {'lr': 8.716121628950802e-07, 'samples': 28044672, 'steps': 146065, 'loss/train': 1.3155137300491333} 11/07/2021 17:47:41 - INFO - __main__ - Step 146067: {'lr': 8.711694717092045e-07, 'samples': 28044864, 'steps': 146066, 'loss/train': 1.2990649938583374} 11/07/2021 17:47:42 - INFO - __main__ - Step 146068: {'lr': 8.707268927769518e-07, 'samples': 28045056, 'steps': 146067, 'loss/train': 1.2460633516311646} 11/07/2021 17:47:42 - INFO - __main__ - Step 146069: {'lr': 8.702844260984888e-07, 'samples': 28045248, 'steps': 146068, 'loss/train': 1.7156391143798828} 11/07/2021 17:47:42 - INFO - __main__ - Step 146070: {'lr': 8.698420716740652e-07, 'samples': 28045440, 'steps': 146069, 'loss/train': 0.9976329803466797} 11/07/2021 17:47:43 - INFO - __main__ - Step 146071: {'lr': 8.693998295038196e-07, 'samples': 28045632, 'steps': 146070, 'loss/train': 0.983167827129364} 11/07/2021 17:47:44 - INFO - __main__ - Step 146072: {'lr': 8.689576995879744e-07, 'samples': 28045824, 'steps': 146071, 'loss/train': 1.4264981746673584} 11/07/2021 17:47:44 - INFO - __main__ - Step 146073: {'lr': 8.685156819267515e-07, 'samples': 28046016, 'steps': 146072, 'loss/train': 1.074087142944336} 11/07/2021 17:47:45 - INFO - __main__ - Step 146074: {'lr': 8.680737765203173e-07, 'samples': 28046208, 'steps': 146073, 'loss/train': 1.5289256572723389} 11/07/2021 17:47:45 - INFO - __main__ - Step 146075: {'lr': 8.676319833688662e-07, 'samples': 28046400, 'steps': 146074, 'loss/train': 1.426896095275879} 11/07/2021 17:47:46 - INFO - __main__ - Step 146076: {'lr': 8.67190302472648e-07, 'samples': 28046592, 'steps': 146075, 'loss/train': 1.5843642950057983} 11/07/2021 17:47:46 - INFO - __main__ - Step 146077: {'lr': 8.667487338318292e-07, 'samples': 28046784, 'steps': 146076, 'loss/train': 1.4553309679031372} 11/07/2021 17:47:47 - INFO - __main__ - Step 146078: {'lr': 8.663072774465763e-07, 'samples': 28046976, 'steps': 146077, 'loss/train': 1.1749879121780396} 11/07/2021 17:47:47 - INFO - __main__ - Step 146079: {'lr': 8.658659333171392e-07, 'samples': 28047168, 'steps': 146078, 'loss/train': 1.4424527883529663} 11/07/2021 17:47:47 - INFO - __main__ - Step 146080: {'lr': 8.654247014437122e-07, 'samples': 28047360, 'steps': 146079, 'loss/train': 1.7045912742614746} 11/07/2021 17:47:49 - INFO - __main__ - Step 146081: {'lr': 8.649835818264617e-07, 'samples': 28047552, 'steps': 146080, 'loss/train': 0.9899308681488037} 11/07/2021 17:47:49 - INFO - __main__ - Step 146082: {'lr': 8.645425744656376e-07, 'samples': 28047744, 'steps': 146081, 'loss/train': 0.41157591342926025} 11/07/2021 17:47:49 - INFO - __main__ - Step 146083: {'lr': 8.641016793613787e-07, 'samples': 28047936, 'steps': 146082, 'loss/train': 1.6112011671066284} 11/07/2021 17:47:50 - INFO - __main__ - Step 146084: {'lr': 8.63660896513907e-07, 'samples': 28048128, 'steps': 146083, 'loss/train': 1.3124635219573975} 11/07/2021 17:47:50 - INFO - __main__ - Step 146085: {'lr': 8.632202259234445e-07, 'samples': 28048320, 'steps': 146084, 'loss/train': 1.1916723251342773} 11/07/2021 17:47:51 - INFO - __main__ - Step 146086: {'lr': 8.627796675901578e-07, 'samples': 28048512, 'steps': 146085, 'loss/train': 1.5914884805679321} 11/07/2021 17:47:51 - INFO - __main__ - Step 146087: {'lr': 8.623392215142689e-07, 'samples': 28048704, 'steps': 146086, 'loss/train': 0.9933158159255981} 11/07/2021 17:47:52 - INFO - __main__ - Step 146088: {'lr': 8.618988876959443e-07, 'samples': 28048896, 'steps': 146087, 'loss/train': 1.3981722593307495} 11/07/2021 17:47:52 - INFO - __main__ - Step 146089: {'lr': 8.614586661354063e-07, 'samples': 28049088, 'steps': 146088, 'loss/train': 1.4902045726776123} 11/07/2021 17:47:52 - INFO - __main__ - Step 146090: {'lr': 8.610185568328766e-07, 'samples': 28049280, 'steps': 146089, 'loss/train': 1.3340492248535156} 11/07/2021 17:47:53 - INFO - __main__ - Step 146091: {'lr': 8.605785597884941e-07, 'samples': 28049472, 'steps': 146090, 'loss/train': 1.4350316524505615} 11/07/2021 17:47:54 - INFO - __main__ - Step 146092: {'lr': 8.601386750025086e-07, 'samples': 28049664, 'steps': 146091, 'loss/train': 1.2872267961502075} 11/07/2021 17:47:54 - INFO - __main__ - Step 146093: {'lr': 8.596989024751145e-07, 'samples': 28049856, 'steps': 146092, 'loss/train': 0.9650023579597473} 11/07/2021 17:47:54 - INFO - __main__ - Step 146094: {'lr': 8.592592422064782e-07, 'samples': 28050048, 'steps': 146093, 'loss/train': 1.1662579774856567} 11/07/2021 17:47:55 - INFO - __main__ - Step 146095: {'lr': 8.588196941968218e-07, 'samples': 28050240, 'steps': 146094, 'loss/train': 1.6032323837280273} 11/07/2021 17:47:55 - INFO - __main__ - Step 146096: {'lr': 8.583802584463118e-07, 'samples': 28050432, 'steps': 146095, 'loss/train': 1.6747527122497559} 11/07/2021 17:47:57 - INFO - __main__ - Step 146097: {'lr': 8.579409349551981e-07, 'samples': 28050624, 'steps': 146096, 'loss/train': 1.3553427457809448} 11/07/2021 17:47:57 - INFO - __main__ - Step 146098: {'lr': 8.575017237236471e-07, 'samples': 28050816, 'steps': 146097, 'loss/train': 1.2410881519317627} 11/07/2021 17:47:58 - INFO - __main__ - Step 146099: {'lr': 8.570626247518532e-07, 'samples': 28051008, 'steps': 146098, 'loss/train': 1.4773873090744019} 11/07/2021 17:47:58 - INFO - __main__ - Step 146100: {'lr': 8.566236380400383e-07, 'samples': 28051200, 'steps': 146099, 'loss/train': 1.5652661323547363} 11/07/2021 17:47:58 - INFO - __main__ - Step 146101: {'lr': 8.56184763588369e-07, 'samples': 28051392, 'steps': 146100, 'loss/train': 1.129091739654541} 11/07/2021 17:47:59 - INFO - __main__ - Step 146102: {'lr': 8.557460013970675e-07, 'samples': 28051584, 'steps': 146101, 'loss/train': 1.3300389051437378} 11/07/2021 17:48:00 - INFO - __main__ - Step 146103: {'lr': 8.553073514663279e-07, 'samples': 28051776, 'steps': 146102, 'loss/train': 1.0700647830963135} 11/07/2021 17:48:00 - INFO - __main__ - Step 146104: {'lr': 8.548688137963168e-07, 'samples': 28051968, 'steps': 146103, 'loss/train': 1.3755673170089722} 11/07/2021 17:48:00 - INFO - __main__ - Step 146105: {'lr': 8.544303883872839e-07, 'samples': 28052160, 'steps': 146104, 'loss/train': 1.227815866470337} 11/07/2021 17:48:01 - INFO - __main__ - Step 146106: {'lr': 8.539920752393959e-07, 'samples': 28052352, 'steps': 146105, 'loss/train': 1.292376160621643} 11/07/2021 17:48:02 - INFO - __main__ - Step 146107: {'lr': 8.535538743528471e-07, 'samples': 28052544, 'steps': 146106, 'loss/train': 1.9040038585662842} 11/07/2021 17:48:02 - INFO - __main__ - Step 146108: {'lr': 8.531157857278315e-07, 'samples': 28052736, 'steps': 146107, 'loss/train': 1.6686776876449585} 11/07/2021 17:48:02 - INFO - __main__ - Step 146109: {'lr': 8.526778093645715e-07, 'samples': 28052928, 'steps': 146108, 'loss/train': 1.416430115699768} 11/07/2021 17:48:03 - INFO - __main__ - Step 146110: {'lr': 8.522399452632613e-07, 'samples': 28053120, 'steps': 146109, 'loss/train': 1.2596211433410645} 11/07/2021 17:48:03 - INFO - __main__ - Step 146111: {'lr': 8.518021934240672e-07, 'samples': 28053312, 'steps': 146110, 'loss/train': 1.1110641956329346} 11/07/2021 17:48:04 - INFO - __main__ - Step 146112: {'lr': 8.513645538472114e-07, 'samples': 28053504, 'steps': 146111, 'loss/train': 1.3678261041641235} 11/07/2021 17:48:05 - INFO - __main__ - Step 146113: {'lr': 8.509270265328883e-07, 'samples': 28053696, 'steps': 146112, 'loss/train': 1.5073487758636475} 11/07/2021 17:48:05 - INFO - __main__ - Step 146114: {'lr': 8.504896114812921e-07, 'samples': 28053888, 'steps': 146113, 'loss/train': 1.3653737306594849} 11/07/2021 17:48:05 - INFO - __main__ - Step 146115: {'lr': 8.500523086926171e-07, 'samples': 28054080, 'steps': 146114, 'loss/train': 1.1976443529129028} 11/07/2021 17:48:06 - INFO - __main__ - Step 146116: {'lr': 8.496151181670853e-07, 'samples': 28054272, 'steps': 146115, 'loss/train': 1.1704806089401245} 11/07/2021 17:48:06 - INFO - __main__ - Step 146117: {'lr': 8.491780399048354e-07, 'samples': 28054464, 'steps': 146116, 'loss/train': 0.11997266113758087} 11/07/2021 17:48:07 - INFO - __main__ - Step 146118: {'lr': 8.487410739061175e-07, 'samples': 28054656, 'steps': 146117, 'loss/train': 1.5561943054199219} 11/07/2021 17:48:07 - INFO - __main__ - Step 146119: {'lr': 8.483042201711255e-07, 'samples': 28054848, 'steps': 146118, 'loss/train': 1.4874227046966553} 11/07/2021 17:48:08 - INFO - __main__ - Step 146120: {'lr': 8.478674787000262e-07, 'samples': 28055040, 'steps': 146119, 'loss/train': 1.4678807258605957} 11/07/2021 17:48:08 - INFO - __main__ - Step 146121: {'lr': 8.474308494930416e-07, 'samples': 28055232, 'steps': 146120, 'loss/train': 1.300833821296692} 11/07/2021 17:48:08 - INFO - __main__ - Step 146122: {'lr': 8.46994332550366e-07, 'samples': 28055424, 'steps': 146121, 'loss/train': 1.1230340003967285} 11/07/2021 17:48:11 - INFO - __main__ - Step 146123: {'lr': 8.465579278721658e-07, 'samples': 28055616, 'steps': 146122, 'loss/train': 1.0119537115097046} 11/07/2021 17:48:11 - INFO - __main__ - Step 146124: {'lr': 8.46121635458691e-07, 'samples': 28055808, 'steps': 146123, 'loss/train': 1.1122852563858032} 11/07/2021 17:48:12 - INFO - __main__ - Step 146125: {'lr': 8.456854553101078e-07, 'samples': 28056000, 'steps': 146124, 'loss/train': 1.701937198638916} 11/07/2021 17:48:12 - INFO - __main__ - Step 146126: {'lr': 8.45249387426611e-07, 'samples': 28056192, 'steps': 146125, 'loss/train': 1.728767991065979} 11/07/2021 17:48:13 - INFO - __main__ - Step 146127: {'lr': 8.448134318083667e-07, 'samples': 28056384, 'steps': 146126, 'loss/train': 1.3219915628433228} 11/07/2021 17:48:13 - INFO - __main__ - Step 146128: {'lr': 8.443775884556526e-07, 'samples': 28056576, 'steps': 146127, 'loss/train': 1.3685802221298218} 11/07/2021 17:48:13 - INFO - __main__ - Step 146129: {'lr': 8.439418573685798e-07, 'samples': 28056768, 'steps': 146128, 'loss/train': 1.1186357736587524} 11/07/2021 17:48:14 - INFO - __main__ - Step 146130: {'lr': 8.435062385473979e-07, 'samples': 28056960, 'steps': 146129, 'loss/train': 1.1072486639022827} 11/07/2021 17:48:15 - INFO - __main__ - Step 146131: {'lr': 8.430707319923015e-07, 'samples': 28057152, 'steps': 146130, 'loss/train': 1.2580652236938477} 11/07/2021 17:48:15 - INFO - __main__ - Step 146132: {'lr': 8.426353377034568e-07, 'samples': 28057344, 'steps': 146131, 'loss/train': 1.423966884613037} 11/07/2021 17:48:15 - INFO - __main__ - Step 146133: {'lr': 8.422000556810583e-07, 'samples': 28057536, 'steps': 146132, 'loss/train': 1.0091112852096558} 11/07/2021 17:48:16 - INFO - __main__ - Step 146134: {'lr': 8.417648859253557e-07, 'samples': 28057728, 'steps': 146133, 'loss/train': 1.2805814743041992} 11/07/2021 17:48:16 - INFO - __main__ - Step 146135: {'lr': 8.413298284364878e-07, 'samples': 28057920, 'steps': 146134, 'loss/train': 1.6253974437713623} 11/07/2021 17:48:17 - INFO - __main__ - Step 146136: {'lr': 8.408948832146767e-07, 'samples': 28058112, 'steps': 146135, 'loss/train': 1.1697907447814941} 11/07/2021 17:48:18 - INFO - __main__ - Step 146137: {'lr': 8.404600502601167e-07, 'samples': 28058304, 'steps': 146136, 'loss/train': 1.0895452499389648} 11/07/2021 17:48:18 - INFO - __main__ - Step 146138: {'lr': 8.40025329573002e-07, 'samples': 28058496, 'steps': 146137, 'loss/train': 1.283890962600708} 11/07/2021 17:48:18 - INFO - __main__ - Step 146139: {'lr': 8.395907211534992e-07, 'samples': 28058688, 'steps': 146138, 'loss/train': 1.11154043674469} 11/07/2021 17:48:19 - INFO - __main__ - Step 146140: {'lr': 8.39156225001858e-07, 'samples': 28058880, 'steps': 146139, 'loss/train': 1.3193602561950684} 11/07/2021 17:48:20 - INFO - __main__ - Step 146141: {'lr': 8.387218411182451e-07, 'samples': 28059072, 'steps': 146140, 'loss/train': 1.0169931650161743} 11/07/2021 17:48:20 - INFO - __main__ - Step 146142: {'lr': 8.382875695028547e-07, 'samples': 28059264, 'steps': 146141, 'loss/train': 1.3808709383010864} 11/07/2021 17:48:20 - INFO - __main__ - Step 146143: {'lr': 8.378534101559087e-07, 'samples': 28059456, 'steps': 146142, 'loss/train': 1.0731556415557861} 11/07/2021 17:48:21 - INFO - __main__ - Step 146144: {'lr': 8.374193630775461e-07, 'samples': 28059648, 'steps': 146143, 'loss/train': 1.0215989351272583} 11/07/2021 17:48:21 - INFO - __main__ - Step 146145: {'lr': 8.369854282680168e-07, 'samples': 28059840, 'steps': 146144, 'loss/train': 1.5297971963882446} 11/07/2021 17:48:22 - INFO - __main__ - Step 146146: {'lr': 8.365516057274869e-07, 'samples': 28060032, 'steps': 146145, 'loss/train': 1.2015223503112793} 11/07/2021 17:48:22 - INFO - __main__ - Step 146147: {'lr': 8.361178954561787e-07, 'samples': 28060224, 'steps': 146146, 'loss/train': 1.586052417755127} 11/07/2021 17:48:23 - INFO - __main__ - Step 146148: {'lr': 8.356842974542589e-07, 'samples': 28060416, 'steps': 146147, 'loss/train': 1.3362776041030884} 11/07/2021 17:48:23 - INFO - __main__ - Step 146149: {'lr': 8.352508117219493e-07, 'samples': 28060608, 'steps': 146148, 'loss/train': 1.3044862747192383} 11/07/2021 17:48:23 - INFO - __main__ - Step 146150: {'lr': 8.348174382594165e-07, 'samples': 28060800, 'steps': 146149, 'loss/train': 1.3940708637237549} 11/07/2021 17:48:24 - INFO - __main__ - Step 146151: {'lr': 8.343841770668826e-07, 'samples': 28060992, 'steps': 146150, 'loss/train': 1.428458333015442} 11/07/2021 17:48:25 - INFO - __main__ - Step 146152: {'lr': 8.33951028144514e-07, 'samples': 28061184, 'steps': 146151, 'loss/train': 1.160857081413269} 11/07/2021 17:48:25 - INFO - __main__ - Step 146153: {'lr': 8.335179914925328e-07, 'samples': 28061376, 'steps': 146152, 'loss/train': 1.0268534421920776} 11/07/2021 17:48:26 - INFO - __main__ - Step 146154: {'lr': 8.330850671111334e-07, 'samples': 28061568, 'steps': 146153, 'loss/train': 1.3890278339385986} 11/07/2021 17:48:26 - INFO - __main__ - Step 146155: {'lr': 8.326522550004823e-07, 'samples': 28061760, 'steps': 146154, 'loss/train': 1.0997530221939087} 11/07/2021 17:48:26 - INFO - __main__ - Step 146156: {'lr': 8.322195551608014e-07, 'samples': 28061952, 'steps': 146155, 'loss/train': 1.4079591035842896} 11/07/2021 17:48:27 - INFO - __main__ - Step 146157: {'lr': 8.317869675922574e-07, 'samples': 28062144, 'steps': 146156, 'loss/train': 1.1962308883666992} 11/07/2021 17:48:28 - INFO - __main__ - Step 146158: {'lr': 8.313544922951e-07, 'samples': 28062336, 'steps': 146157, 'loss/train': 1.5005710124969482} 11/07/2021 17:48:28 - INFO - __main__ - Step 146159: {'lr': 8.309221292694679e-07, 'samples': 28062528, 'steps': 146158, 'loss/train': 1.3813120126724243} 11/07/2021 17:48:28 - INFO - __main__ - Step 146160: {'lr': 8.304898785155834e-07, 'samples': 28062720, 'steps': 146159, 'loss/train': 1.153929591178894} 11/07/2021 17:48:29 - INFO - __main__ - Step 146161: {'lr': 8.300577400336407e-07, 'samples': 28062912, 'steps': 146160, 'loss/train': 0.5279120802879333} 11/07/2021 17:48:31 - INFO - __main__ - Step 146162: {'lr': 8.296257138238061e-07, 'samples': 28063104, 'steps': 146161, 'loss/train': 1.6052240133285522} 11/07/2021 17:48:31 - INFO - __main__ - Step 146163: {'lr': 8.291937998863297e-07, 'samples': 28063296, 'steps': 146162, 'loss/train': 0.8884298205375671} 11/07/2021 17:48:31 - INFO - __main__ - Step 146164: {'lr': 8.287619982213502e-07, 'samples': 28063488, 'steps': 146163, 'loss/train': 0.48862653970718384} 11/07/2021 17:48:32 - INFO - __main__ - Step 146165: {'lr': 8.283303088290894e-07, 'samples': 28063680, 'steps': 146164, 'loss/train': 0.4579610824584961} 11/07/2021 17:48:32 - INFO - __main__ - Step 146166: {'lr': 8.278987317097419e-07, 'samples': 28063872, 'steps': 146165, 'loss/train': 1.1385736465454102} 11/07/2021 17:48:32 - INFO - __main__ - Step 146167: {'lr': 8.274672668635019e-07, 'samples': 28064064, 'steps': 146166, 'loss/train': 0.7880924344062805} 11/07/2021 17:48:33 - INFO - __main__ - Step 146168: {'lr': 8.270359142905637e-07, 'samples': 28064256, 'steps': 146167, 'loss/train': 0.12456241250038147} 11/07/2021 17:48:34 - INFO - __main__ - Step 146169: {'lr': 8.266046739910937e-07, 'samples': 28064448, 'steps': 146168, 'loss/train': 1.542737603187561} 11/07/2021 17:48:34 - INFO - __main__ - Step 146170: {'lr': 8.261735459653418e-07, 'samples': 28064640, 'steps': 146169, 'loss/train': 1.0695867538452148} 11/07/2021 17:48:35 - INFO - __main__ - Step 146171: {'lr': 8.257425302134469e-07, 'samples': 28064832, 'steps': 146170, 'loss/train': 1.4005818367004395} 11/07/2021 17:48:35 - INFO - __main__ - Step 146172: {'lr': 8.253116267356308e-07, 'samples': 28065024, 'steps': 146171, 'loss/train': 1.1127383708953857} 11/07/2021 17:48:36 - INFO - __main__ - Step 146173: {'lr': 8.248808355320881e-07, 'samples': 28065216, 'steps': 146172, 'loss/train': 1.473366618156433} 11/07/2021 17:48:37 - INFO - __main__ - Step 146174: {'lr': 8.244501566030127e-07, 'samples': 28065408, 'steps': 146173, 'loss/train': 0.9140765070915222} 11/07/2021 17:48:37 - INFO - __main__ - Step 146175: {'lr': 8.240195899485992e-07, 'samples': 28065600, 'steps': 146174, 'loss/train': 0.6088756918907166} 11/07/2021 17:48:37 - INFO - __main__ - Step 146176: {'lr': 8.235891355690417e-07, 'samples': 28065792, 'steps': 146175, 'loss/train': 1.344448447227478} 11/07/2021 17:48:38 - INFO - __main__ - Step 146177: {'lr': 8.231587934645069e-07, 'samples': 28065984, 'steps': 146176, 'loss/train': 1.2841796875} 11/07/2021 17:48:38 - INFO - __main__ - Step 146178: {'lr': 8.227285636352444e-07, 'samples': 28066176, 'steps': 146177, 'loss/train': 1.5079598426818848} 11/07/2021 17:48:39 - INFO - __main__ - Step 146179: {'lr': 8.222984460813931e-07, 'samples': 28066368, 'steps': 146178, 'loss/train': 1.3014472723007202} 11/07/2021 17:48:39 - INFO - __main__ - Step 146180: {'lr': 8.218684408031752e-07, 'samples': 28066560, 'steps': 146179, 'loss/train': 0.8796275854110718} 11/07/2021 17:48:40 - INFO - __main__ - Step 146181: {'lr': 8.214385478007568e-07, 'samples': 28066752, 'steps': 146180, 'loss/train': 1.245387077331543} 11/07/2021 17:48:40 - INFO - __main__ - Step 146182: {'lr': 8.210087670743882e-07, 'samples': 28066944, 'steps': 146181, 'loss/train': 1.1539889574050903} 11/07/2021 17:48:40 - INFO - __main__ - Step 146183: {'lr': 8.205790986242079e-07, 'samples': 28067136, 'steps': 146182, 'loss/train': 1.31263267993927} 11/07/2021 17:48:42 - INFO - __main__ - Step 146184: {'lr': 8.20149542450438e-07, 'samples': 28067328, 'steps': 146183, 'loss/train': 0.5538541674613953} 11/07/2021 17:48:42 - INFO - __main__ - Step 146185: {'lr': 8.197200985532449e-07, 'samples': 28067520, 'steps': 146184, 'loss/train': 1.0101505517959595} 11/07/2021 17:48:42 - INFO - __main__ - Step 146186: {'lr': 8.192907669328508e-07, 'samples': 28067712, 'steps': 146185, 'loss/train': 0.5125925540924072} 11/07/2021 17:48:43 - INFO - __main__ - Step 146187: {'lr': 8.188615475894501e-07, 'samples': 28067904, 'steps': 146186, 'loss/train': 0.6455321311950684} 11/07/2021 17:48:43 - INFO - __main__ - Step 146188: {'lr': 8.184324405232091e-07, 'samples': 28068096, 'steps': 146187, 'loss/train': 0.6812224984169006} 11/07/2021 17:48:44 - INFO - __main__ - Step 146189: {'lr': 8.1800344573435e-07, 'samples': 28068288, 'steps': 146188, 'loss/train': 1.1073861122131348} 11/07/2021 17:48:44 - INFO - __main__ - Step 146190: {'lr': 8.175745632230669e-07, 'samples': 28068480, 'steps': 146189, 'loss/train': 1.2901228666305542} 11/07/2021 17:48:45 - INFO - __main__ - Step 146191: {'lr': 8.171457929895265e-07, 'samples': 28068672, 'steps': 146190, 'loss/train': 1.1870687007904053} 11/07/2021 17:48:45 - INFO - __main__ - Step 146192: {'lr': 8.167171350339231e-07, 'samples': 28068864, 'steps': 146191, 'loss/train': 0.4175911247730255} 11/07/2021 17:48:45 - INFO - __main__ - Step 146193: {'lr': 8.162885893564786e-07, 'samples': 28069056, 'steps': 146192, 'loss/train': 2.034694194793701} 11/07/2021 17:48:47 - INFO - __main__ - Step 146194: {'lr': 8.158601559573597e-07, 'samples': 28069248, 'steps': 146193, 'loss/train': 2.206284523010254} 11/07/2021 17:48:47 - INFO - __main__ - Step 146195: {'lr': 8.154318348367607e-07, 'samples': 28069440, 'steps': 146194, 'loss/train': 1.0953269004821777} 11/07/2021 17:48:47 - INFO - __main__ - Step 146196: {'lr': 8.150036259949035e-07, 'samples': 28069632, 'steps': 146195, 'loss/train': 1.023226022720337} 11/07/2021 17:48:48 - INFO - __main__ - Step 146197: {'lr': 8.145755294319268e-07, 'samples': 28069824, 'steps': 146196, 'loss/train': 1.4429851770401} 11/07/2021 17:48:48 - INFO - __main__ - Step 146198: {'lr': 8.141475451480807e-07, 'samples': 28070016, 'steps': 146197, 'loss/train': 1.393538475036621} 11/07/2021 17:48:49 - INFO - __main__ - Step 146199: {'lr': 8.137196731435315e-07, 'samples': 28070208, 'steps': 146198, 'loss/train': 1.1573388576507568} 11/07/2021 17:48:49 - INFO - __main__ - Step 146200: {'lr': 8.132919134184736e-07, 'samples': 28070400, 'steps': 146199, 'loss/train': 1.4586222171783447} 11/07/2021 17:48:50 - INFO - __main__ - Step 146201: {'lr': 8.128642659731289e-07, 'samples': 28070592, 'steps': 146200, 'loss/train': 1.3844693899154663} 11/07/2021 17:48:50 - INFO - __main__ - Step 146202: {'lr': 8.124367308076364e-07, 'samples': 28070784, 'steps': 146201, 'loss/train': 1.2135237455368042} 11/07/2021 17:48:50 - INFO - __main__ - Step 146203: {'lr': 8.120093079222179e-07, 'samples': 28070976, 'steps': 146202, 'loss/train': 1.3195639848709106} 11/07/2021 17:48:51 - INFO - __main__ - Step 146204: {'lr': 8.11581997317068e-07, 'samples': 28071168, 'steps': 146203, 'loss/train': 0.7662909030914307} 11/07/2021 17:48:52 - INFO - __main__ - Step 146205: {'lr': 8.111547989923529e-07, 'samples': 28071360, 'steps': 146204, 'loss/train': 1.0988775491714478} 11/07/2021 17:48:52 - INFO - __main__ - Step 146206: {'lr': 8.107277129482949e-07, 'samples': 28071552, 'steps': 146205, 'loss/train': 1.1015329360961914} 11/07/2021 17:48:53 - INFO - __main__ - Step 146207: {'lr': 8.103007391850881e-07, 'samples': 28071744, 'steps': 146206, 'loss/train': 0.8071165680885315} 11/07/2021 17:48:53 - INFO - __main__ - Step 146208: {'lr': 8.098738777028991e-07, 'samples': 28071936, 'steps': 146207, 'loss/train': 1.3576951026916504} 11/07/2021 17:48:53 - INFO - __main__ - Step 146209: {'lr': 8.0944712850195e-07, 'samples': 28072128, 'steps': 146208, 'loss/train': 1.3611024618148804} 11/07/2021 17:48:54 - INFO - __main__ - Step 146210: {'lr': 8.090204915824351e-07, 'samples': 28072320, 'steps': 146209, 'loss/train': 1.1552904844284058} 11/07/2021 17:48:55 - INFO - __main__ - Step 146211: {'lr': 8.085939669444931e-07, 'samples': 28072512, 'steps': 146210, 'loss/train': 1.0702375173568726} 11/07/2021 17:48:55 - INFO - __main__ - Step 146212: {'lr': 8.081675545883737e-07, 'samples': 28072704, 'steps': 146211, 'loss/train': 1.4810364246368408} 11/07/2021 17:48:55 - INFO - __main__ - Step 146213: {'lr': 8.077412545142437e-07, 'samples': 28072896, 'steps': 146212, 'loss/train': 1.2620291709899902} 11/07/2021 17:48:56 - INFO - __main__ - Step 146214: {'lr': 8.073150667222973e-07, 'samples': 28073088, 'steps': 146213, 'loss/train': 1.021478295326233} 11/07/2021 17:48:57 - INFO - __main__ - Step 146215: {'lr': 8.068889912127287e-07, 'samples': 28073280, 'steps': 146214, 'loss/train': 1.7756315469741821} 11/07/2021 17:48:57 - INFO - __main__ - Step 146216: {'lr': 8.064630279857598e-07, 'samples': 28073472, 'steps': 146215, 'loss/train': 1.5720345973968506} 11/07/2021 17:48:57 - INFO - __main__ - Step 146217: {'lr': 8.060371770415298e-07, 'samples': 28073664, 'steps': 146216, 'loss/train': 1.4867898225784302} 11/07/2021 17:48:58 - INFO - __main__ - Step 146218: {'lr': 8.056114383802605e-07, 'samples': 28073856, 'steps': 146217, 'loss/train': 1.3464670181274414} 11/07/2021 17:48:58 - INFO - __main__ - Step 146219: {'lr': 8.051858120021182e-07, 'samples': 28074048, 'steps': 146218, 'loss/train': 1.502508282661438} 11/07/2021 17:48:59 - INFO - __main__ - Step 146220: {'lr': 8.047602979073254e-07, 'samples': 28074240, 'steps': 146219, 'loss/train': 1.5718637704849243} 11/07/2021 17:49:00 - INFO - __main__ - Step 146221: {'lr': 8.043348960960761e-07, 'samples': 28074432, 'steps': 146220, 'loss/train': 0.742470920085907} 11/07/2021 17:49:00 - INFO - __main__ - Step 146222: {'lr': 8.039096065685369e-07, 'samples': 28074624, 'steps': 146221, 'loss/train': 1.4819139242172241} 11/07/2021 17:49:00 - INFO - __main__ - Step 146223: {'lr': 8.034844293249022e-07, 'samples': 28074816, 'steps': 146222, 'loss/train': 1.215273380279541} 11/07/2021 17:49:01 - INFO - __main__ - Step 146224: {'lr': 8.03059364365366e-07, 'samples': 28075008, 'steps': 146223, 'loss/train': 1.4375594854354858} 11/07/2021 17:49:02 - INFO - __main__ - Step 146225: {'lr': 8.026344116901507e-07, 'samples': 28075200, 'steps': 146224, 'loss/train': 0.9577522873878479} 11/07/2021 17:49:02 - INFO - __main__ - Step 146226: {'lr': 8.022095712994227e-07, 'samples': 28075392, 'steps': 146225, 'loss/train': 1.1241328716278076} 11/07/2021 17:49:02 - INFO - __main__ - Step 146227: {'lr': 8.017848431933484e-07, 'samples': 28075584, 'steps': 146226, 'loss/train': 1.3667362928390503} 11/07/2021 17:49:03 - INFO - __main__ - Step 146228: {'lr': 8.013602273721499e-07, 'samples': 28075776, 'steps': 146227, 'loss/train': 0.8684447407722473} 11/07/2021 17:49:03 - INFO - __main__ - Step 146229: {'lr': 8.009357238360215e-07, 'samples': 28075968, 'steps': 146228, 'loss/train': 1.381655216217041} 11/07/2021 17:49:03 - INFO - __main__ - Step 146230: {'lr': 8.005113325851576e-07, 'samples': 28076160, 'steps': 146229, 'loss/train': 1.7695392370224} 11/07/2021 17:49:04 - INFO - __main__ - Step 146231: {'lr': 8.000870536197247e-07, 'samples': 28076352, 'steps': 146230, 'loss/train': 0.679676353931427} 11/07/2021 17:49:05 - INFO - __main__ - Step 146232: {'lr': 7.996628869399447e-07, 'samples': 28076544, 'steps': 146231, 'loss/train': 0.8156420588493347} 11/07/2021 17:49:05 - INFO - __main__ - Step 146233: {'lr': 7.992388325459565e-07, 'samples': 28076736, 'steps': 146232, 'loss/train': 0.9512227773666382} 11/07/2021 17:49:05 - INFO - __main__ - Step 146234: {'lr': 7.988148904380099e-07, 'samples': 28076928, 'steps': 146233, 'loss/train': 2.6162986755371094} 11/07/2021 17:49:06 - INFO - __main__ - Step 146235: {'lr': 7.983910606162714e-07, 'samples': 28077120, 'steps': 146234, 'loss/train': 1.3183177709579468} 11/07/2021 17:49:07 - INFO - __main__ - Step 146236: {'lr': 7.979673430809353e-07, 'samples': 28077312, 'steps': 146235, 'loss/train': 1.1566606760025024} 11/07/2021 17:49:07 - INFO - __main__ - Step 146237: {'lr': 7.975437378321682e-07, 'samples': 28077504, 'steps': 146236, 'loss/train': 1.295674443244934} 11/07/2021 17:49:08 - INFO - __main__ - Step 146238: {'lr': 7.971202448702198e-07, 'samples': 28077696, 'steps': 146237, 'loss/train': 1.1318455934524536} 11/07/2021 17:49:08 - INFO - __main__ - Step 146239: {'lr': 7.966968641952011e-07, 'samples': 28077888, 'steps': 146238, 'loss/train': 1.2765089273452759} 11/07/2021 17:49:08 - INFO - __main__ - Step 146240: {'lr': 7.962735958073619e-07, 'samples': 28078080, 'steps': 146239, 'loss/train': 1.0906087160110474} 11/07/2021 17:49:09 - INFO - __main__ - Step 146241: {'lr': 7.958504397068966e-07, 'samples': 28078272, 'steps': 146240, 'loss/train': 1.5637212991714478} 11/07/2021 17:49:10 - INFO - __main__ - Step 146242: {'lr': 7.95427395893944e-07, 'samples': 28078464, 'steps': 146241, 'loss/train': 0.9519189596176147} 11/07/2021 17:49:10 - INFO - __main__ - Step 146243: {'lr': 7.950044643687537e-07, 'samples': 28078656, 'steps': 146242, 'loss/train': 1.340530276298523} 11/07/2021 17:49:10 - INFO - __main__ - Step 146244: {'lr': 7.945816451314647e-07, 'samples': 28078848, 'steps': 146243, 'loss/train': 1.309266209602356} 11/07/2021 17:49:11 - INFO - __main__ - Step 146245: {'lr': 7.941589381823267e-07, 'samples': 28079040, 'steps': 146244, 'loss/train': 1.175291657447815} 11/07/2021 17:49:12 - INFO - __main__ - Step 146246: {'lr': 7.937363435214507e-07, 'samples': 28079232, 'steps': 146245, 'loss/train': 1.1611764430999756} 11/07/2021 17:49:13 - INFO - __main__ - Step 146247: {'lr': 7.933138611491142e-07, 'samples': 28079424, 'steps': 146246, 'loss/train': 1.068123698234558} 11/07/2021 17:49:13 - INFO - __main__ - Step 146248: {'lr': 7.928914910654283e-07, 'samples': 28079616, 'steps': 146247, 'loss/train': 1.5375711917877197} 11/07/2021 17:49:13 - INFO - __main__ - Step 146249: {'lr': 7.924692332706429e-07, 'samples': 28079808, 'steps': 146248, 'loss/train': 1.5764347314834595} 11/07/2021 17:49:14 - INFO - __main__ - Step 146250: {'lr': 7.920470877649243e-07, 'samples': 28080000, 'steps': 146249, 'loss/train': 0.9937587976455688} 11/07/2021 17:49:14 - INFO - __main__ - Step 146251: {'lr': 7.916250545484393e-07, 'samples': 28080192, 'steps': 146250, 'loss/train': 1.4647958278656006} 11/07/2021 17:49:15 - INFO - __main__ - Step 146252: {'lr': 7.912031336214376e-07, 'samples': 28080384, 'steps': 146251, 'loss/train': 1.1688179969787598} 11/07/2021 17:49:15 - INFO - __main__ - Step 146253: {'lr': 7.907813249840301e-07, 'samples': 28080576, 'steps': 146252, 'loss/train': 1.0107672214508057} 11/07/2021 17:49:16 - INFO - __main__ - Step 146254: {'lr': 7.903596286364945e-07, 'samples': 28080768, 'steps': 146253, 'loss/train': 1.0395987033843994} 11/07/2021 17:49:16 - INFO - __main__ - Step 146255: {'lr': 7.899380445789695e-07, 'samples': 28080960, 'steps': 146254, 'loss/train': 2.498117446899414} 11/07/2021 17:49:17 - INFO - __main__ - Step 146256: {'lr': 7.895165728116216e-07, 'samples': 28081152, 'steps': 146255, 'loss/train': 1.3852776288986206} 11/07/2021 17:49:18 - INFO - __main__ - Step 146257: {'lr': 7.890952133347007e-07, 'samples': 28081344, 'steps': 146256, 'loss/train': 1.2644307613372803} 11/07/2021 17:49:18 - INFO - __main__ - Step 146258: {'lr': 7.886739661483732e-07, 'samples': 28081536, 'steps': 146257, 'loss/train': 1.8415477275848389} 11/07/2021 17:49:18 - INFO - __main__ - Step 146259: {'lr': 7.882528312528059e-07, 'samples': 28081728, 'steps': 146258, 'loss/train': 1.2038705348968506} 11/07/2021 17:49:19 - INFO - __main__ - Step 146260: {'lr': 7.878318086482205e-07, 'samples': 28081920, 'steps': 146259, 'loss/train': 0.9842473268508911} 11/07/2021 17:49:19 - INFO - __main__ - Step 146261: {'lr': 7.874108983347839e-07, 'samples': 28082112, 'steps': 146260, 'loss/train': 0.6788673400878906} 11/07/2021 17:49:20 - INFO - __main__ - Step 146262: {'lr': 7.869901003126901e-07, 'samples': 28082304, 'steps': 146261, 'loss/train': 1.3252798318862915} 11/07/2021 17:49:20 - INFO - __main__ - Step 146263: {'lr': 7.865694145821334e-07, 'samples': 28082496, 'steps': 146262, 'loss/train': 1.2407538890838623} 11/07/2021 17:49:21 - INFO - __main__ - Step 146264: {'lr': 7.861488411433082e-07, 'samples': 28082688, 'steps': 146263, 'loss/train': 1.5611414909362793} 11/07/2021 17:49:21 - INFO - __main__ - Step 146265: {'lr': 7.857283799964088e-07, 'samples': 28082880, 'steps': 146264, 'loss/train': 1.011276364326477} 11/07/2021 17:49:22 - INFO - __main__ - Step 146266: {'lr': 7.853080311416016e-07, 'samples': 28083072, 'steps': 146265, 'loss/train': 1.20695960521698} 11/07/2021 17:49:23 - INFO - __main__ - Step 146267: {'lr': 7.848877945790811e-07, 'samples': 28083264, 'steps': 146266, 'loss/train': 1.1615220308303833} 11/07/2021 17:49:23 - INFO - __main__ - Step 146268: {'lr': 7.844676703090692e-07, 'samples': 28083456, 'steps': 146267, 'loss/train': 1.011146903038025} 11/07/2021 17:49:23 - INFO - __main__ - Step 146269: {'lr': 7.840476583317047e-07, 'samples': 28083648, 'steps': 146268, 'loss/train': 1.3923108577728271} 11/07/2021 17:49:24 - INFO - __main__ - Step 146270: {'lr': 7.836277586472096e-07, 'samples': 28083840, 'steps': 146269, 'loss/train': 1.4394451379776} 11/07/2021 17:49:24 - INFO - __main__ - Step 146271: {'lr': 7.832079712557782e-07, 'samples': 28084032, 'steps': 146270, 'loss/train': 0.31882336735725403} 11/07/2021 17:49:25 - INFO - __main__ - Step 146272: {'lr': 7.827882961575772e-07, 'samples': 28084224, 'steps': 146271, 'loss/train': 1.4740898609161377} 11/07/2021 17:49:25 - INFO - __main__ - Step 146273: {'lr': 7.823687333528007e-07, 'samples': 28084416, 'steps': 146272, 'loss/train': 1.5732758045196533} 11/07/2021 17:49:26 - INFO - __main__ - Step 146274: {'lr': 7.819492828416707e-07, 'samples': 28084608, 'steps': 146273, 'loss/train': 1.247402548789978} 11/07/2021 17:49:26 - INFO - __main__ - Step 146275: {'lr': 7.815299446243262e-07, 'samples': 28084800, 'steps': 146274, 'loss/train': 1.3597267866134644} 11/07/2021 17:49:26 - INFO - __main__ - Step 146276: {'lr': 7.811107187009892e-07, 'samples': 28084992, 'steps': 146275, 'loss/train': 1.5185801982879639} 11/07/2021 17:49:27 - INFO - __main__ - Step 146277: {'lr': 7.80691605071826e-07, 'samples': 28085184, 'steps': 146276, 'loss/train': 1.1283595561981201} 11/07/2021 17:49:28 - INFO - __main__ - Step 146278: {'lr': 7.802726037370311e-07, 'samples': 28085376, 'steps': 146277, 'loss/train': 0.9155973196029663} 11/07/2021 17:49:28 - INFO - __main__ - Step 146279: {'lr': 7.798537146967988e-07, 'samples': 28085568, 'steps': 146278, 'loss/train': 1.669621467590332} 11/07/2021 17:49:28 - INFO - __main__ - Step 146280: {'lr': 7.79434937951351e-07, 'samples': 28085760, 'steps': 146279, 'loss/train': 1.048779010772705} 11/07/2021 17:49:29 - INFO - __main__ - Step 146281: {'lr': 7.790162735008266e-07, 'samples': 28085952, 'steps': 146280, 'loss/train': 1.2264103889465332} 11/07/2021 17:49:30 - INFO - __main__ - Step 146282: {'lr': 7.785977213454199e-07, 'samples': 28086144, 'steps': 146281, 'loss/train': 1.150195837020874} 11/07/2021 17:49:30 - INFO - __main__ - Step 146283: {'lr': 7.78179281485325e-07, 'samples': 28086336, 'steps': 146282, 'loss/train': 1.722350001335144} 11/07/2021 17:49:31 - INFO - __main__ - Step 146284: {'lr': 7.777609539207642e-07, 'samples': 28086528, 'steps': 146283, 'loss/train': 0.846246063709259} 11/07/2021 17:49:31 - INFO - __main__ - Step 146285: {'lr': 7.773427386519039e-07, 'samples': 28086720, 'steps': 146284, 'loss/train': 1.460792064666748} 11/07/2021 17:49:31 - INFO - __main__ - Step 146286: {'lr': 7.769246356789106e-07, 'samples': 28086912, 'steps': 146285, 'loss/train': 0.9544568061828613} 11/07/2021 17:49:32 - INFO - __main__ - Step 146287: {'lr': 7.765066450019786e-07, 'samples': 28087104, 'steps': 146286, 'loss/train': 1.479791283607483} 11/07/2021 17:49:33 - INFO - __main__ - Step 146288: {'lr': 7.760887666213024e-07, 'samples': 28087296, 'steps': 146287, 'loss/train': 1.3298529386520386} 11/07/2021 17:49:33 - INFO - __main__ - Step 146289: {'lr': 7.756710005371037e-07, 'samples': 28087488, 'steps': 146288, 'loss/train': 1.385347604751587} 11/07/2021 17:49:33 - INFO - __main__ - Step 146290: {'lr': 7.752533467495215e-07, 'samples': 28087680, 'steps': 146289, 'loss/train': 0.9652655124664307} 11/07/2021 17:49:34 - INFO - __main__ - Step 146291: {'lr': 7.748358052587778e-07, 'samples': 28087872, 'steps': 146290, 'loss/train': 1.515605092048645} 11/07/2021 17:49:34 - INFO - __main__ - Step 146292: {'lr': 7.744183760650392e-07, 'samples': 28088064, 'steps': 146291, 'loss/train': 1.1315547227859497} 11/07/2021 17:49:35 - INFO - __main__ - Step 146293: {'lr': 7.740010591684998e-07, 'samples': 28088256, 'steps': 146292, 'loss/train': 1.651389479637146} 11/07/2021 17:49:35 - INFO - __main__ - Step 146294: {'lr': 7.735838545693541e-07, 'samples': 28088448, 'steps': 146293, 'loss/train': 1.4250774383544922} 11/07/2021 17:49:36 - INFO - __main__ - Step 146295: {'lr': 7.731667622677685e-07, 'samples': 28088640, 'steps': 146294, 'loss/train': 1.2942962646484375} 11/07/2021 17:49:36 - INFO - __main__ - Step 146296: {'lr': 7.727497822639651e-07, 'samples': 28088832, 'steps': 146295, 'loss/train': 1.4715083837509155} 11/07/2021 17:49:36 - INFO - __main__ - Step 146297: {'lr': 7.723329145581381e-07, 'samples': 28089024, 'steps': 146296, 'loss/train': 1.4154210090637207} 11/07/2021 17:49:38 - INFO - __main__ - Step 146298: {'lr': 7.719161591504264e-07, 'samples': 28089216, 'steps': 146297, 'loss/train': 1.7818310260772705} 11/07/2021 17:49:38 - INFO - __main__ - Step 146299: {'lr': 7.714995160410243e-07, 'samples': 28089408, 'steps': 146298, 'loss/train': 1.299609899520874} 11/07/2021 17:49:38 - INFO - __main__ - Step 146300: {'lr': 7.710829852301815e-07, 'samples': 28089600, 'steps': 146299, 'loss/train': 1.9199920892715454} 11/07/2021 17:49:39 - INFO - __main__ - Step 146301: {'lr': 7.70666566718009e-07, 'samples': 28089792, 'steps': 146300, 'loss/train': 1.381871223449707} 11/07/2021 17:49:39 - INFO - __main__ - Step 146302: {'lr': 7.70250260504729e-07, 'samples': 28089984, 'steps': 146301, 'loss/train': 1.3233280181884766} 11/07/2021 17:49:40 - INFO - __main__ - Step 146303: {'lr': 7.698340665905356e-07, 'samples': 28090176, 'steps': 146302, 'loss/train': 1.575265645980835} 11/07/2021 17:49:41 - INFO - __main__ - Step 146304: {'lr': 7.694179849756234e-07, 'samples': 28090368, 'steps': 146303, 'loss/train': 1.9406756162643433} 11/07/2021 17:49:41 - INFO - __main__ - Step 146305: {'lr': 7.690020156601585e-07, 'samples': 28090560, 'steps': 146304, 'loss/train': 0.8618423342704773} 11/07/2021 17:49:41 - INFO - __main__ - Step 146306: {'lr': 7.685861586443354e-07, 'samples': 28090752, 'steps': 146305, 'loss/train': 0.7301026582717896} 11/07/2021 17:49:42 - INFO - __main__ - Step 146307: {'lr': 7.681704139283207e-07, 'samples': 28090944, 'steps': 146306, 'loss/train': 0.8738893866539001} 11/07/2021 17:49:43 - INFO - __main__ - Step 146308: {'lr': 7.677547815123365e-07, 'samples': 28091136, 'steps': 146307, 'loss/train': 1.234847903251648} 11/07/2021 17:49:43 - INFO - __main__ - Step 146309: {'lr': 7.673392613965769e-07, 'samples': 28091328, 'steps': 146308, 'loss/train': 1.5027810335159302} 11/07/2021 17:49:43 - INFO - __main__ - Step 146310: {'lr': 7.669238535811807e-07, 'samples': 28091520, 'steps': 146309, 'loss/train': 1.6792471408843994} 11/07/2021 17:49:44 - INFO - __main__ - Step 146311: {'lr': 7.6650855806637e-07, 'samples': 28091712, 'steps': 146310, 'loss/train': 1.0935189723968506} 11/07/2021 17:49:44 - INFO - __main__ - Step 146312: {'lr': 7.660933748523391e-07, 'samples': 28091904, 'steps': 146311, 'loss/train': 0.7320420742034912} 11/07/2021 17:49:45 - INFO - __main__ - Step 146313: {'lr': 7.656783039392545e-07, 'samples': 28092096, 'steps': 146312, 'loss/train': 1.0473196506500244} 11/07/2021 17:49:45 - INFO - __main__ - Step 146314: {'lr': 7.652633453273106e-07, 'samples': 28092288, 'steps': 146313, 'loss/train': 1.3159314393997192} 11/07/2021 17:49:46 - INFO - __main__ - Step 146315: {'lr': 7.648484990166738e-07, 'samples': 28092480, 'steps': 146314, 'loss/train': 1.1946218013763428} 11/07/2021 17:49:46 - INFO - __main__ - Step 146316: {'lr': 7.644337650075661e-07, 'samples': 28092672, 'steps': 146315, 'loss/train': 1.1815518140792847} 11/07/2021 17:49:46 - INFO - __main__ - Step 146317: {'lr': 7.640191433001542e-07, 'samples': 28092864, 'steps': 146316, 'loss/train': 0.5148669481277466} 11/07/2021 17:49:48 - INFO - __main__ - Step 146318: {'lr': 7.636046338946323e-07, 'samples': 28093056, 'steps': 146317, 'loss/train': 1.3933995962142944} 11/07/2021 17:49:48 - INFO - __main__ - Step 146319: {'lr': 7.63190236791167e-07, 'samples': 28093248, 'steps': 146318, 'loss/train': 1.4506783485412598} 11/07/2021 17:49:48 - INFO - __main__ - Step 146320: {'lr': 7.627759519899802e-07, 'samples': 28093440, 'steps': 146319, 'loss/train': 1.1456092596054077} 11/07/2021 17:49:49 - INFO - __main__ - Step 146321: {'lr': 7.623617794912385e-07, 'samples': 28093632, 'steps': 146320, 'loss/train': 1.9451569318771362} 11/07/2021 17:49:49 - INFO - __main__ - Step 146322: {'lr': 7.619477192951363e-07, 'samples': 28093824, 'steps': 146321, 'loss/train': 1.3273487091064453} 11/07/2021 17:49:49 - INFO - __main__ - Step 146323: {'lr': 7.6153377140184e-07, 'samples': 28094016, 'steps': 146322, 'loss/train': 0.8799511194229126} 11/07/2021 17:49:50 - INFO - __main__ - Step 146324: {'lr': 7.611199358115717e-07, 'samples': 28094208, 'steps': 146323, 'loss/train': 1.0704243183135986} 11/07/2021 17:49:51 - INFO - __main__ - Step 146325: {'lr': 7.607062125244702e-07, 'samples': 28094400, 'steps': 146324, 'loss/train': 1.4501169919967651} 11/07/2021 17:49:51 - INFO - __main__ - Step 146326: {'lr': 7.602926015407852e-07, 'samples': 28094592, 'steps': 146325, 'loss/train': 0.37418070435523987} 11/07/2021 17:49:51 - INFO - __main__ - Step 146327: {'lr': 7.598791028606277e-07, 'samples': 28094784, 'steps': 146326, 'loss/train': 1.4106554985046387} 11/07/2021 17:49:52 - INFO - __main__ - Step 146328: {'lr': 7.594657164842478e-07, 'samples': 28094976, 'steps': 146327, 'loss/train': 1.286666750907898} 11/07/2021 17:49:53 - INFO - __main__ - Step 146329: {'lr': 7.590524424117839e-07, 'samples': 28095168, 'steps': 146328, 'loss/train': 0.9019376635551453} 11/07/2021 17:49:53 - INFO - __main__ - Step 146330: {'lr': 7.586392806434583e-07, 'samples': 28095360, 'steps': 146329, 'loss/train': 0.8391901850700378} 11/07/2021 17:49:54 - INFO - __main__ - Step 146331: {'lr': 7.582262311794374e-07, 'samples': 28095552, 'steps': 146330, 'loss/train': 1.3558986186981201} 11/07/2021 17:49:54 - INFO - __main__ - Step 146332: {'lr': 7.578132940199156e-07, 'samples': 28095744, 'steps': 146331, 'loss/train': 0.5169556140899658} 11/07/2021 17:49:54 - INFO - __main__ - Step 146333: {'lr': 7.574004691650871e-07, 'samples': 28095936, 'steps': 146332, 'loss/train': 1.5845921039581299} 11/07/2021 17:49:55 - INFO - __main__ - Step 146334: {'lr': 7.569877566151185e-07, 'samples': 28096128, 'steps': 146333, 'loss/train': 1.186406135559082} 11/07/2021 17:49:56 - INFO - __main__ - Step 146335: {'lr': 7.565751563702039e-07, 'samples': 28096320, 'steps': 146334, 'loss/train': 1.022395133972168} 11/07/2021 17:49:56 - INFO - __main__ - Step 146336: {'lr': 7.561626684305378e-07, 'samples': 28096512, 'steps': 146335, 'loss/train': 1.604751706123352} 11/07/2021 17:49:56 - INFO - __main__ - Step 146337: {'lr': 7.557502927963144e-07, 'samples': 28096704, 'steps': 146336, 'loss/train': 0.16903524100780487} 11/07/2021 17:49:57 - INFO - __main__ - Step 146338: {'lr': 7.553380294676726e-07, 'samples': 28096896, 'steps': 146337, 'loss/train': 1.4538809061050415} 11/07/2021 17:49:58 - INFO - __main__ - Step 146339: {'lr': 7.54925878444862e-07, 'samples': 28097088, 'steps': 146338, 'loss/train': 0.6800181865692139} 11/07/2021 17:49:58 - INFO - __main__ - Step 146340: {'lr': 7.545138397280216e-07, 'samples': 28097280, 'steps': 146339, 'loss/train': 1.5344270467758179} 11/07/2021 17:49:59 - INFO - __main__ - Step 146341: {'lr': 7.541019133173454e-07, 'samples': 28097472, 'steps': 146340, 'loss/train': 1.1105118989944458} 11/07/2021 17:49:59 - INFO - __main__ - Step 146342: {'lr': 7.536900992130003e-07, 'samples': 28097664, 'steps': 146341, 'loss/train': 1.1636055707931519} 11/07/2021 17:50:00 - INFO - __main__ - Step 146343: {'lr': 7.532783974152357e-07, 'samples': 28097856, 'steps': 146342, 'loss/train': 1.5636111497879028} 11/07/2021 17:50:00 - INFO - __main__ - Step 146344: {'lr': 7.528668079241907e-07, 'samples': 28098048, 'steps': 146343, 'loss/train': 1.1135250329971313} 11/07/2021 17:50:01 - INFO - __main__ - Step 146345: {'lr': 7.524553307400317e-07, 'samples': 28098240, 'steps': 146344, 'loss/train': 0.6748407483100891} 11/07/2021 17:50:01 - INFO - __main__ - Step 146346: {'lr': 7.520439658630085e-07, 'samples': 28098432, 'steps': 146345, 'loss/train': 1.3975688219070435} 11/07/2021 17:50:02 - INFO - __main__ - Step 146347: {'lr': 7.516327132932321e-07, 'samples': 28098624, 'steps': 146346, 'loss/train': 1.4656533002853394} 11/07/2021 17:50:02 - INFO - __main__ - Step 146348: {'lr': 7.512215730309524e-07, 'samples': 28098816, 'steps': 146347, 'loss/train': 1.3312050104141235} 11/07/2021 17:50:02 - INFO - __main__ - Step 146349: {'lr': 7.50810545076308e-07, 'samples': 28099008, 'steps': 146348, 'loss/train': 0.38511696457862854} 11/07/2021 17:50:03 - INFO - __main__ - Step 146350: {'lr': 7.503996294294934e-07, 'samples': 28099200, 'steps': 146349, 'loss/train': 1.6326675415039062} 11/07/2021 17:50:04 - INFO - __main__ - Step 146351: {'lr': 7.499888260907306e-07, 'samples': 28099392, 'steps': 146350, 'loss/train': 1.1032212972640991} 11/07/2021 17:50:04 - INFO - __main__ - Step 146352: {'lr': 7.495781350601583e-07, 'samples': 28099584, 'steps': 146351, 'loss/train': 1.1762453317642212} 11/07/2021 17:50:04 - INFO - __main__ - Step 146353: {'lr': 7.491675563379984e-07, 'samples': 28099776, 'steps': 146352, 'loss/train': 1.3303440809249878} 11/07/2021 17:50:05 - INFO - __main__ - Step 146354: {'lr': 7.4875708992439e-07, 'samples': 28099968, 'steps': 146353, 'loss/train': 1.686939001083374} 11/07/2021 17:50:05 - INFO - __main__ - Step 146355: {'lr': 7.48346735819555e-07, 'samples': 28100160, 'steps': 146354, 'loss/train': 1.3172868490219116} 11/07/2021 17:50:06 - INFO - __main__ - Step 146356: {'lr': 7.479364940236877e-07, 'samples': 28100352, 'steps': 146355, 'loss/train': 1.0679187774658203} 11/07/2021 17:50:07 - INFO - __main__ - Step 146357: {'lr': 7.475263645369268e-07, 'samples': 28100544, 'steps': 146356, 'loss/train': 1.1368072032928467} 11/07/2021 17:50:07 - INFO - __main__ - Step 146358: {'lr': 7.471163473594945e-07, 'samples': 28100736, 'steps': 146357, 'loss/train': 1.2893859148025513} 11/07/2021 17:50:07 - INFO - __main__ - Step 146359: {'lr': 7.467064424915848e-07, 'samples': 28100928, 'steps': 146358, 'loss/train': 1.4684889316558838} 11/07/2021 17:50:08 - INFO - __main__ - Step 146360: {'lr': 7.462966499333368e-07, 'samples': 28101120, 'steps': 146359, 'loss/train': 1.4216258525848389} 11/07/2021 17:50:09 - INFO - __main__ - Step 146361: {'lr': 7.458869696849723e-07, 'samples': 28101312, 'steps': 146360, 'loss/train': 1.4054880142211914} 11/07/2021 17:50:09 - INFO - __main__ - Step 146362: {'lr': 7.454774017466581e-07, 'samples': 28101504, 'steps': 146361, 'loss/train': 1.6005665063858032} 11/07/2021 17:50:09 - INFO - __main__ - Step 146363: {'lr': 7.45067946118616e-07, 'samples': 28101696, 'steps': 146362, 'loss/train': 0.6893692016601562} 11/07/2021 17:50:10 - INFO - __main__ - Step 146364: {'lr': 7.446586028009572e-07, 'samples': 28101888, 'steps': 146363, 'loss/train': 1.4915428161621094} 11/07/2021 17:50:10 - INFO - __main__ - Step 146365: {'lr': 7.442493717939313e-07, 'samples': 28102080, 'steps': 146364, 'loss/train': 1.3182346820831299} 11/07/2021 17:50:11 - INFO - __main__ - Step 146366: {'lr': 7.438402530977051e-07, 'samples': 28102272, 'steps': 146365, 'loss/train': 1.3561021089553833} 11/07/2021 17:50:12 - INFO - __main__ - Step 146367: {'lr': 7.434312467124449e-07, 'samples': 28102464, 'steps': 146366, 'loss/train': 1.5879696607589722} 11/07/2021 17:50:12 - INFO - __main__ - Step 146368: {'lr': 7.430223526383451e-07, 'samples': 28102656, 'steps': 146367, 'loss/train': 1.7012312412261963} 11/07/2021 17:50:12 - INFO - __main__ - Step 146369: {'lr': 7.426135708755999e-07, 'samples': 28102848, 'steps': 146368, 'loss/train': 1.0680766105651855} 11/07/2021 17:50:13 - INFO - __main__ - Step 146370: {'lr': 7.42204901424376e-07, 'samples': 28103040, 'steps': 146369, 'loss/train': 1.7127176523208618} 11/07/2021 17:50:13 - INFO - __main__ - Step 146371: {'lr': 7.417963442848952e-07, 'samples': 28103232, 'steps': 146370, 'loss/train': 1.4380724430084229} 11/07/2021 17:50:14 - INFO - __main__ - Step 146372: {'lr': 7.413878994572964e-07, 'samples': 28103424, 'steps': 146371, 'loss/train': 1.262094497680664} 11/07/2021 17:50:14 - INFO - __main__ - Step 146373: {'lr': 7.409795669418018e-07, 'samples': 28103616, 'steps': 146372, 'loss/train': 1.172942042350769} 11/07/2021 17:50:15 - INFO - __main__ - Step 146374: {'lr': 7.4057134673855e-07, 'samples': 28103808, 'steps': 146373, 'loss/train': 1.3709763288497925} 11/07/2021 17:50:15 - INFO - __main__ - Step 146375: {'lr': 7.401632388477631e-07, 'samples': 28104000, 'steps': 146374, 'loss/train': 1.011926293373108} 11/07/2021 17:50:16 - INFO - __main__ - Step 146376: {'lr': 7.397552432696076e-07, 'samples': 28104192, 'steps': 146375, 'loss/train': 0.9390880465507507} 11/07/2021 17:50:17 - INFO - __main__ - Step 146377: {'lr': 7.393473600042777e-07, 'samples': 28104384, 'steps': 146376, 'loss/train': 1.1720232963562012} 11/07/2021 17:50:17 - INFO - __main__ - Step 146378: {'lr': 7.389395890519401e-07, 'samples': 28104576, 'steps': 146377, 'loss/train': 1.5307409763336182} 11/07/2021 17:50:17 - INFO - __main__ - Step 146379: {'lr': 7.385319304127891e-07, 'samples': 28104768, 'steps': 146378, 'loss/train': 1.5535035133361816} 11/07/2021 17:50:18 - INFO - __main__ - Step 146380: {'lr': 7.381243840870189e-07, 'samples': 28104960, 'steps': 146379, 'loss/train': 1.6285094022750854} 11/07/2021 17:50:18 - INFO - __main__ - Step 146381: {'lr': 7.37716950074796e-07, 'samples': 28105152, 'steps': 146380, 'loss/train': 0.6278860569000244} 11/07/2021 17:50:19 - INFO - __main__ - Step 146382: {'lr': 7.373096283763147e-07, 'samples': 28105344, 'steps': 146381, 'loss/train': 1.1294670104980469} 11/07/2021 17:50:19 - INFO - __main__ - Step 146383: {'lr': 7.369024189917695e-07, 'samples': 28105536, 'steps': 146382, 'loss/train': 1.180692195892334} 11/07/2021 17:50:20 - INFO - __main__ - Step 146384: {'lr': 7.364953219213267e-07, 'samples': 28105728, 'steps': 146383, 'loss/train': 1.2467471361160278} 11/07/2021 17:50:20 - INFO - __main__ - Step 146385: {'lr': 7.360883371651528e-07, 'samples': 28105920, 'steps': 146384, 'loss/train': 1.37661874294281} 11/07/2021 17:50:20 - INFO - __main__ - Step 146386: {'lr': 7.3568146472347e-07, 'samples': 28106112, 'steps': 146385, 'loss/train': 1.478738784790039} 11/07/2021 17:50:21 - INFO - __main__ - Step 146387: {'lr': 7.35274704596417e-07, 'samples': 28106304, 'steps': 146386, 'loss/train': 1.2289963960647583} 11/07/2021 17:50:22 - INFO - __main__ - Step 146388: {'lr': 7.348680567842158e-07, 'samples': 28106496, 'steps': 146387, 'loss/train': 1.0449150800704956} 11/07/2021 17:50:22 - INFO - __main__ - Step 146389: {'lr': 7.344615212870609e-07, 'samples': 28106688, 'steps': 146388, 'loss/train': 0.9647455215454102} 11/07/2021 17:50:22 - INFO - __main__ - Step 146390: {'lr': 7.340550981050909e-07, 'samples': 28106880, 'steps': 146389, 'loss/train': 1.4194668531417847} 11/07/2021 17:50:23 - INFO - __main__ - Step 146391: {'lr': 7.336487872385e-07, 'samples': 28107072, 'steps': 146390, 'loss/train': 0.8990105986595154} 11/07/2021 17:50:23 - INFO - __main__ - Step 146392: {'lr': 7.332425886874827e-07, 'samples': 28107264, 'steps': 146391, 'loss/train': 1.1709671020507812} 11/07/2021 17:50:24 - INFO - __main__ - Step 146393: {'lr': 7.328365024522332e-07, 'samples': 28107456, 'steps': 146392, 'loss/train': 1.1447697877883911} 11/07/2021 17:50:25 - INFO - __main__ - Step 146394: {'lr': 7.32430528532918e-07, 'samples': 28107648, 'steps': 146393, 'loss/train': 1.1730620861053467} 11/07/2021 17:50:25 - INFO - __main__ - Step 146395: {'lr': 7.320246669297315e-07, 'samples': 28107840, 'steps': 146394, 'loss/train': 1.3383015394210815} 11/07/2021 17:50:25 - INFO - __main__ - Step 146396: {'lr': 7.3161891764284e-07, 'samples': 28108032, 'steps': 146395, 'loss/train': 1.0057172775268555} 11/07/2021 17:50:26 - INFO - __main__ - Step 146397: {'lr': 7.312132806724103e-07, 'samples': 28108224, 'steps': 146396, 'loss/train': 1.39475417137146} 11/07/2021 17:50:27 - INFO - __main__ - Step 146398: {'lr': 7.308077560186921e-07, 'samples': 28108416, 'steps': 146397, 'loss/train': 1.5170661211013794} 11/07/2021 17:50:27 - INFO - __main__ - Step 146399: {'lr': 7.304023436817964e-07, 'samples': 28108608, 'steps': 146398, 'loss/train': 1.375942349433899} 11/07/2021 17:50:28 - INFO - __main__ - Step 146400: {'lr': 7.299970436619452e-07, 'samples': 28108800, 'steps': 146399, 'loss/train': 1.323119044303894} 11/07/2021 17:50:28 - INFO - __main__ - Step 146401: {'lr': 7.295918559593051e-07, 'samples': 28108992, 'steps': 146400, 'loss/train': 0.9771812558174133} 11/07/2021 17:50:28 - INFO - __main__ - Step 146402: {'lr': 7.291867805740704e-07, 'samples': 28109184, 'steps': 146401, 'loss/train': 1.3244755268096924} 11/07/2021 17:50:29 - INFO - __main__ - Step 146403: {'lr': 7.287818175064076e-07, 'samples': 28109376, 'steps': 146402, 'loss/train': 1.3388029336929321} 11/07/2021 17:50:30 - INFO - __main__ - Step 146404: {'lr': 7.28376966756511e-07, 'samples': 28109568, 'steps': 146403, 'loss/train': 1.3059757947921753} 11/07/2021 17:50:30 - INFO - __main__ - Step 146405: {'lr': 7.279722283245749e-07, 'samples': 28109760, 'steps': 146404, 'loss/train': 1.8430719375610352} 11/07/2021 17:50:30 - INFO - __main__ - Step 146406: {'lr': 7.275676022107658e-07, 'samples': 28109952, 'steps': 146405, 'loss/train': 1.4322786331176758} 11/07/2021 17:50:31 - INFO - __main__ - Step 146407: {'lr': 7.271630884152502e-07, 'samples': 28110144, 'steps': 146406, 'loss/train': 1.394721508026123} 11/07/2021 17:50:32 - INFO - __main__ - Step 146408: {'lr': 7.267586869382503e-07, 'samples': 28110336, 'steps': 146407, 'loss/train': 1.6451655626296997} 11/07/2021 17:50:32 - INFO - __main__ - Step 146409: {'lr': 7.263543977799047e-07, 'samples': 28110528, 'steps': 146408, 'loss/train': 1.2769372463226318} 11/07/2021 17:50:33 - INFO - __main__ - Step 146410: {'lr': 7.259502209404357e-07, 'samples': 28110720, 'steps': 146409, 'loss/train': 0.6780509352684021} 11/07/2021 17:50:33 - INFO - __main__ - Step 146411: {'lr': 7.255461564200095e-07, 'samples': 28110912, 'steps': 146410, 'loss/train': 1.1614313125610352} 11/07/2021 17:50:33 - INFO - __main__ - Step 146412: {'lr': 7.251422042187927e-07, 'samples': 28111104, 'steps': 146411, 'loss/train': 0.9939290881156921} 11/07/2021 17:50:34 - INFO - __main__ - Step 146413: {'lr': 7.247383643369798e-07, 'samples': 28111296, 'steps': 146412, 'loss/train': 1.8252004384994507} 11/07/2021 17:50:35 - INFO - __main__ - Step 146414: {'lr': 7.24334636774765e-07, 'samples': 28111488, 'steps': 146413, 'loss/train': 1.4006531238555908} 11/07/2021 17:50:35 - INFO - __main__ - Step 146415: {'lr': 7.239310215323147e-07, 'samples': 28111680, 'steps': 146414, 'loss/train': 1.2762254476547241} 11/07/2021 17:50:35 - INFO - __main__ - Step 146416: {'lr': 7.235275186097956e-07, 'samples': 28111872, 'steps': 146415, 'loss/train': 1.16228187084198} 11/07/2021 17:50:36 - INFO - __main__ - Step 146417: {'lr': 7.231241280074296e-07, 'samples': 28112064, 'steps': 146416, 'loss/train': 1.2696398496627808} 11/07/2021 17:50:37 - INFO - __main__ - Step 146418: {'lr': 7.227208497253835e-07, 'samples': 28112256, 'steps': 146417, 'loss/train': 0.1803615540266037} 11/07/2021 17:50:37 - INFO - __main__ - Step 146419: {'lr': 7.223176837638234e-07, 'samples': 28112448, 'steps': 146418, 'loss/train': 0.463271826505661} 11/07/2021 17:50:38 - INFO - __main__ - Step 146420: {'lr': 7.21914630122944e-07, 'samples': 28112640, 'steps': 146419, 'loss/train': 1.0304136276245117} 11/07/2021 17:50:38 - INFO - __main__ - Step 146421: {'lr': 7.215116888029117e-07, 'samples': 28112832, 'steps': 146420, 'loss/train': 1.3080418109893799} 11/07/2021 17:50:38 - INFO - __main__ - Step 146422: {'lr': 7.211088598039206e-07, 'samples': 28113024, 'steps': 146421, 'loss/train': 1.8488786220550537} 11/07/2021 17:50:39 - INFO - __main__ - Step 146423: {'lr': 7.207061431261652e-07, 'samples': 28113216, 'steps': 146422, 'loss/train': 1.1531428098678589} 11/07/2021 17:50:40 - INFO - __main__ - Step 146424: {'lr': 7.203035387697843e-07, 'samples': 28113408, 'steps': 146423, 'loss/train': 1.2381659746170044} 11/07/2021 17:50:40 - INFO - __main__ - Step 146425: {'lr': 7.199010467349998e-07, 'samples': 28113600, 'steps': 146424, 'loss/train': 1.357361078262329} 11/07/2021 17:50:41 - INFO - __main__ - Step 146426: {'lr': 7.19498667022006e-07, 'samples': 28113792, 'steps': 146425, 'loss/train': 1.1710014343261719} 11/07/2021 17:50:41 - INFO - __main__ - Step 146427: {'lr': 7.190963996309419e-07, 'samples': 28113984, 'steps': 146426, 'loss/train': 1.2681288719177246} 11/07/2021 17:50:42 - INFO - __main__ - Step 146428: {'lr': 7.186942445620015e-07, 'samples': 28114176, 'steps': 146427, 'loss/train': 1.1391468048095703} 11/07/2021 17:50:42 - INFO - __main__ - Step 146429: {'lr': 7.182922018153793e-07, 'samples': 28114368, 'steps': 146428, 'loss/train': 1.3392398357391357} 11/07/2021 17:50:43 - INFO - __main__ - Step 146430: {'lr': 7.178902713912417e-07, 'samples': 28114560, 'steps': 146429, 'loss/train': 1.6107426881790161} 11/07/2021 17:50:43 - INFO - __main__ - Step 146431: {'lr': 7.174884532897829e-07, 'samples': 28114752, 'steps': 146430, 'loss/train': 1.3884466886520386} 11/07/2021 17:50:43 - INFO - __main__ - Step 146432: {'lr': 7.170867475111697e-07, 'samples': 28114944, 'steps': 146431, 'loss/train': 1.7815546989440918} 11/07/2021 17:50:44 - INFO - __main__ - Step 146433: {'lr': 7.166851540555963e-07, 'samples': 28115136, 'steps': 146432, 'loss/train': 1.7199666500091553} 11/07/2021 17:50:45 - INFO - __main__ - Step 146434: {'lr': 7.162836729232292e-07, 'samples': 28115328, 'steps': 146433, 'loss/train': 1.3481920957565308} 11/07/2021 17:50:45 - INFO - __main__ - Step 146435: {'lr': 7.158823041142626e-07, 'samples': 28115520, 'steps': 146434, 'loss/train': 0.7098181247711182} 11/07/2021 17:50:45 - INFO - __main__ - Step 146436: {'lr': 7.154810476288909e-07, 'samples': 28115712, 'steps': 146435, 'loss/train': 1.5180772542953491} 11/07/2021 17:50:46 - INFO - __main__ - Step 146437: {'lr': 7.150799034672528e-07, 'samples': 28115904, 'steps': 146436, 'loss/train': 0.9247349500656128} 11/07/2021 17:50:46 - INFO - __main__ - Step 146438: {'lr': 7.146788716295705e-07, 'samples': 28116096, 'steps': 146437, 'loss/train': 1.3907843828201294} 11/07/2021 17:50:47 - INFO - __main__ - Step 146439: {'lr': 7.142779521159826e-07, 'samples': 28116288, 'steps': 146438, 'loss/train': 1.5526260137557983} 11/07/2021 17:50:48 - INFO - __main__ - Step 146440: {'lr': 7.138771449267112e-07, 'samples': 28116480, 'steps': 146439, 'loss/train': 1.4084274768829346} 11/07/2021 17:50:48 - INFO - __main__ - Step 146441: {'lr': 7.134764500619228e-07, 'samples': 28116672, 'steps': 146440, 'loss/train': 1.4026856422424316} 11/07/2021 17:50:48 - INFO - __main__ - Step 146442: {'lr': 7.130758675217841e-07, 'samples': 28116864, 'steps': 146441, 'loss/train': 0.40443000197410583} 11/07/2021 17:50:49 - INFO - __main__ - Step 146443: {'lr': 7.126753973064892e-07, 'samples': 28117056, 'steps': 146442, 'loss/train': 2.2470545768737793} 11/07/2021 17:50:50 - INFO - __main__ - Step 146444: {'lr': 7.122750394162325e-07, 'samples': 28117248, 'steps': 146443, 'loss/train': 1.2672393321990967} 11/07/2021 17:50:50 - INFO - __main__ - Step 146445: {'lr': 7.118747938511528e-07, 'samples': 28117440, 'steps': 146444, 'loss/train': 1.3587265014648438} 11/07/2021 17:50:50 - INFO - __main__ - Step 146446: {'lr': 7.114746606114719e-07, 'samples': 28117632, 'steps': 146445, 'loss/train': 1.418546438217163} 11/07/2021 17:50:51 - INFO - __main__ - Step 146447: {'lr': 7.110746396973567e-07, 'samples': 28117824, 'steps': 146446, 'loss/train': 1.2188115119934082} 11/07/2021 17:50:51 - INFO - __main__ - Step 146448: {'lr': 7.106747311089734e-07, 'samples': 28118016, 'steps': 146447, 'loss/train': 1.4459643363952637} 11/07/2021 17:50:52 - INFO - __main__ - Step 146449: {'lr': 7.102749348465165e-07, 'samples': 28118208, 'steps': 146448, 'loss/train': 0.9352024793624878} 11/07/2021 17:50:53 - INFO - __main__ - Step 146450: {'lr': 7.098752509101803e-07, 'samples': 28118400, 'steps': 146449, 'loss/train': 1.4145394563674927} 11/07/2021 17:50:53 - INFO - __main__ - Step 146451: {'lr': 7.094756793001034e-07, 'samples': 28118592, 'steps': 146450, 'loss/train': 1.2645459175109863} 11/07/2021 17:50:53 - INFO - __main__ - Step 146452: {'lr': 7.09076220016508e-07, 'samples': 28118784, 'steps': 146451, 'loss/train': 1.216365098953247} 11/07/2021 17:50:54 - INFO - __main__ - Step 146453: {'lr': 7.086768730595328e-07, 'samples': 28118976, 'steps': 146452, 'loss/train': 1.7961347103118896} 11/07/2021 17:50:55 - INFO - __main__ - Step 146454: {'lr': 7.082776384293998e-07, 'samples': 28119168, 'steps': 146453, 'loss/train': 1.6625115871429443} 11/07/2021 17:50:55 - INFO - __main__ - Step 146455: {'lr': 7.07878516126248e-07, 'samples': 28119360, 'steps': 146454, 'loss/train': 1.223948359489441} 11/07/2021 17:50:56 - INFO - __main__ - Step 146456: {'lr': 7.074795061502992e-07, 'samples': 28119552, 'steps': 146455, 'loss/train': 1.2673313617706299} 11/07/2021 17:50:56 - INFO - __main__ - Step 146457: {'lr': 7.070806085017201e-07, 'samples': 28119744, 'steps': 146456, 'loss/train': 1.258002758026123} 11/07/2021 17:50:56 - INFO - __main__ - Step 146458: {'lr': 7.066818231806771e-07, 'samples': 28119936, 'steps': 146457, 'loss/train': 0.9005180597305298} 11/07/2021 17:50:57 - INFO - __main__ - Step 146459: {'lr': 7.062831501873368e-07, 'samples': 28120128, 'steps': 146458, 'loss/train': 1.1366057395935059} 11/07/2021 17:50:58 - INFO - __main__ - Step 146460: {'lr': 7.058845895219213e-07, 'samples': 28120320, 'steps': 146459, 'loss/train': 1.896572232246399} 11/07/2021 17:50:58 - INFO - __main__ - Step 146461: {'lr': 7.054861411845969e-07, 'samples': 28120512, 'steps': 146460, 'loss/train': 1.048567295074463} 11/07/2021 17:50:59 - INFO - __main__ - Step 146462: {'lr': 7.050878051755027e-07, 'samples': 28120704, 'steps': 146461, 'loss/train': 0.3427963852882385} 11/07/2021 17:50:59 - INFO - __main__ - Step 146463: {'lr': 7.046895814948606e-07, 'samples': 28120896, 'steps': 146462, 'loss/train': 1.4177519083023071} 11/07/2021 17:50:59 - INFO - __main__ - Step 146464: {'lr': 7.042914701428371e-07, 'samples': 28121088, 'steps': 146463, 'loss/train': 1.2987785339355469} 11/07/2021 17:51:01 - INFO - __main__ - Step 146465: {'lr': 7.038934711196265e-07, 'samples': 28121280, 'steps': 146464, 'loss/train': 1.1585484743118286} 11/07/2021 17:51:01 - INFO - __main__ - Step 146466: {'lr': 7.034955844253954e-07, 'samples': 28121472, 'steps': 146465, 'loss/train': 1.4314810037612915} 11/07/2021 17:51:01 - INFO - __main__ - Step 146467: {'lr': 7.030978100603102e-07, 'samples': 28121664, 'steps': 146466, 'loss/train': 1.3389493227005005} 11/07/2021 17:51:02 - INFO - __main__ - Step 146468: {'lr': 7.027001480245654e-07, 'samples': 28121856, 'steps': 146467, 'loss/train': 1.0565518140792847} 11/07/2021 17:51:02 - INFO - __main__ - Step 146469: {'lr': 7.023025983183273e-07, 'samples': 28122048, 'steps': 146468, 'loss/train': 1.3792251348495483} 11/07/2021 17:51:03 - INFO - __main__ - Step 146470: {'lr': 7.019051609417904e-07, 'samples': 28122240, 'steps': 146469, 'loss/train': 1.0230275392532349} 11/07/2021 17:51:03 - INFO - __main__ - Step 146471: {'lr': 7.01507835895121e-07, 'samples': 28122432, 'steps': 146470, 'loss/train': 1.4162850379943848} 11/07/2021 17:51:04 - INFO - __main__ - Step 146472: {'lr': 7.011106231785414e-07, 'samples': 28122624, 'steps': 146471, 'loss/train': 0.8872507214546204} 11/07/2021 17:51:04 - INFO - __main__ - Step 146473: {'lr': 7.007135227921624e-07, 'samples': 28122816, 'steps': 146472, 'loss/train': 1.3156135082244873} 11/07/2021 17:51:04 - INFO - __main__ - Step 146474: {'lr': 7.003165347362061e-07, 'samples': 28123008, 'steps': 146473, 'loss/train': 1.5330390930175781} 11/07/2021 17:51:05 - INFO - __main__ - Step 146475: {'lr': 6.999196590108392e-07, 'samples': 28123200, 'steps': 146474, 'loss/train': 1.2322379350662231} 11/07/2021 17:51:06 - INFO - __main__ - Step 146476: {'lr': 6.995228956162281e-07, 'samples': 28123392, 'steps': 146475, 'loss/train': 0.6117074489593506} 11/07/2021 17:51:06 - INFO - __main__ - Step 146477: {'lr': 6.99126244552567e-07, 'samples': 28123584, 'steps': 146476, 'loss/train': 1.1660898923873901} 11/07/2021 17:51:06 - INFO - __main__ - Step 146478: {'lr': 6.987297058200503e-07, 'samples': 28123776, 'steps': 146477, 'loss/train': 1.8311138153076172} 11/07/2021 17:51:07 - INFO - __main__ - Step 146479: {'lr': 6.983332794188169e-07, 'samples': 28123968, 'steps': 146478, 'loss/train': 1.5197443962097168} 11/07/2021 17:51:08 - INFO - __main__ - Step 146480: {'lr': 6.979369653490886e-07, 'samples': 28124160, 'steps': 146479, 'loss/train': 1.2623257637023926} 11/07/2021 17:51:08 - INFO - __main__ - Step 146481: {'lr': 6.975407636110043e-07, 'samples': 28124352, 'steps': 146480, 'loss/train': 1.6677881479263306} 11/07/2021 17:51:09 - INFO - __main__ - Step 146482: {'lr': 6.97144674204786e-07, 'samples': 28124544, 'steps': 146481, 'loss/train': 1.5056467056274414} 11/07/2021 17:51:09 - INFO - __main__ - Step 146483: {'lr': 6.967486971305725e-07, 'samples': 28124736, 'steps': 146482, 'loss/train': 1.2752050161361694} 11/07/2021 17:51:09 - INFO - __main__ - Step 146484: {'lr': 6.963528323885304e-07, 'samples': 28124928, 'steps': 146483, 'loss/train': 1.3645501136779785} 11/07/2021 17:51:10 - INFO - __main__ - Step 146485: {'lr': 6.959570799789095e-07, 'samples': 28125120, 'steps': 146484, 'loss/train': 1.329735279083252} 11/07/2021 17:51:11 - INFO - __main__ - Step 146486: {'lr': 6.955614399018207e-07, 'samples': 28125312, 'steps': 146485, 'loss/train': 1.47622811794281} 11/07/2021 17:51:11 - INFO - __main__ - Step 146487: {'lr': 6.95165912157486e-07, 'samples': 28125504, 'steps': 146486, 'loss/train': 1.4646002054214478} 11/07/2021 17:51:11 - INFO - __main__ - Step 146488: {'lr': 6.947704967460444e-07, 'samples': 28125696, 'steps': 146487, 'loss/train': 1.462740421295166} 11/07/2021 17:51:12 - INFO - __main__ - Step 146489: {'lr': 6.943751936676901e-07, 'samples': 28125888, 'steps': 146488, 'loss/train': 0.5325272083282471} 11/07/2021 17:51:12 - INFO - __main__ - Step 146490: {'lr': 6.939800029225896e-07, 'samples': 28126080, 'steps': 146489, 'loss/train': 1.3243519067764282} 11/07/2021 17:51:13 - INFO - __main__ - Step 146491: {'lr': 6.935849245109649e-07, 'samples': 28126272, 'steps': 146490, 'loss/train': 1.3137344121932983} 11/07/2021 17:51:14 - INFO - __main__ - Step 146492: {'lr': 6.931899584329548e-07, 'samples': 28126464, 'steps': 146491, 'loss/train': 1.376267433166504} 11/07/2021 17:51:14 - INFO - __main__ - Step 146493: {'lr': 6.92795104688726e-07, 'samples': 28126656, 'steps': 146492, 'loss/train': 1.0459641218185425} 11/07/2021 17:51:14 - INFO - __main__ - Step 146494: {'lr': 6.924003632785003e-07, 'samples': 28126848, 'steps': 146493, 'loss/train': 1.4095381498336792} 11/07/2021 17:51:15 - INFO - __main__ - Step 146495: {'lr': 6.920057342024167e-07, 'samples': 28127040, 'steps': 146494, 'loss/train': 1.225692868232727} 11/07/2021 17:51:16 - INFO - __main__ - Step 146496: {'lr': 6.916112174606692e-07, 'samples': 28127232, 'steps': 146495, 'loss/train': 1.7367875576019287} 11/07/2021 17:51:16 - INFO - __main__ - Step 146497: {'lr': 6.912168130534524e-07, 'samples': 28127424, 'steps': 146496, 'loss/train': 0.8313879370689392} 11/07/2021 17:51:16 - INFO - __main__ - Step 146498: {'lr': 6.908225209809049e-07, 'samples': 28127616, 'steps': 146497, 'loss/train': 1.4379372596740723} 11/07/2021 17:51:17 - INFO - __main__ - Step 146499: {'lr': 6.904283412432488e-07, 'samples': 28127808, 'steps': 146498, 'loss/train': 1.2868155241012573} 11/07/2021 17:51:17 - INFO - __main__ - Step 146500: {'lr': 6.900342738406229e-07, 'samples': 28128000, 'steps': 146499, 'loss/train': 1.1224861145019531} 11/07/2021 17:51:18 - INFO - __main__ - Step 146501: {'lr': 6.896403187732214e-07, 'samples': 28128192, 'steps': 146500, 'loss/train': 1.5902893543243408} 11/07/2021 17:51:18 - INFO - __main__ - Step 146502: {'lr': 6.892464760412387e-07, 'samples': 28128384, 'steps': 146501, 'loss/train': 0.9494122862815857} 11/07/2021 17:51:19 - INFO - __main__ - Step 146503: {'lr': 6.888527456448134e-07, 'samples': 28128576, 'steps': 146502, 'loss/train': 1.3156263828277588} 11/07/2021 17:51:19 - INFO - __main__ - Step 146504: {'lr': 6.884591275841401e-07, 'samples': 28128768, 'steps': 146503, 'loss/train': 1.2371079921722412} 11/07/2021 17:51:19 - INFO - __main__ - Step 146505: {'lr': 6.880656218594128e-07, 'samples': 28128960, 'steps': 146504, 'loss/train': 1.4990451335906982} 11/07/2021 17:51:21 - INFO - __main__ - Step 146506: {'lr': 6.876722284707981e-07, 'samples': 28129152, 'steps': 146505, 'loss/train': 1.4897229671478271} 11/07/2021 17:51:21 - INFO - __main__ - Step 146507: {'lr': 6.872789474184627e-07, 'samples': 28129344, 'steps': 146506, 'loss/train': 1.44149649143219} 11/07/2021 17:51:21 - INFO - __main__ - Step 146508: {'lr': 6.868857787026006e-07, 'samples': 28129536, 'steps': 146507, 'loss/train': 0.8583541512489319} 11/07/2021 17:51:22 - INFO - __main__ - Step 146509: {'lr': 6.864927223233785e-07, 'samples': 28129728, 'steps': 146508, 'loss/train': 1.3180333375930786} 11/07/2021 17:51:22 - INFO - __main__ - Step 146510: {'lr': 6.860997782809631e-07, 'samples': 28129920, 'steps': 146509, 'loss/train': 1.0351635217666626} 11/07/2021 17:51:22 - INFO - __main__ - Step 146511: {'lr': 6.857069465755484e-07, 'samples': 28130112, 'steps': 146510, 'loss/train': 1.4182780981063843} 11/07/2021 17:51:24 - INFO - __main__ - Step 146512: {'lr': 6.85314227207301e-07, 'samples': 28130304, 'steps': 146511, 'loss/train': 1.5081437826156616} 11/07/2021 17:51:24 - INFO - __main__ - Step 146513: {'lr': 6.84921620176443e-07, 'samples': 28130496, 'steps': 146512, 'loss/train': 1.114930510520935} 11/07/2021 17:51:24 - INFO - __main__ - Step 146514: {'lr': 6.845291254830854e-07, 'samples': 28130688, 'steps': 146513, 'loss/train': 1.2187343835830688} 11/07/2021 17:51:25 - INFO - __main__ - Step 146515: {'lr': 6.841367431274226e-07, 'samples': 28130880, 'steps': 146514, 'loss/train': 1.790961503982544} 11/07/2021 17:51:25 - INFO - __main__ - Step 146516: {'lr': 6.837444731096487e-07, 'samples': 28131072, 'steps': 146515, 'loss/train': 1.283650279045105} 11/07/2021 17:51:26 - INFO - __main__ - Step 146517: {'lr': 6.833523154299303e-07, 'samples': 28131264, 'steps': 146516, 'loss/train': 0.855334997177124} 11/07/2021 17:51:26 - INFO - __main__ - Step 146518: {'lr': 6.829602700884341e-07, 'samples': 28131456, 'steps': 146517, 'loss/train': 0.3411705195903778} 11/07/2021 17:51:27 - INFO - __main__ - Step 146519: {'lr': 6.825683370853819e-07, 'samples': 28131648, 'steps': 146518, 'loss/train': 1.4660303592681885} 11/07/2021 17:51:27 - INFO - __main__ - Step 146520: {'lr': 6.821765164208849e-07, 'samples': 28131840, 'steps': 146519, 'loss/train': 0.968801736831665} 11/07/2021 17:51:28 - INFO - __main__ - Step 146521: {'lr': 6.817848080951649e-07, 'samples': 28132032, 'steps': 146520, 'loss/train': 1.3674505949020386} 11/07/2021 17:51:29 - INFO - __main__ - Step 146522: {'lr': 6.813932121083888e-07, 'samples': 28132224, 'steps': 146521, 'loss/train': 1.3052538633346558} 11/07/2021 17:51:29 - INFO - __main__ - Step 146523: {'lr': 6.810017284607229e-07, 'samples': 28132416, 'steps': 146522, 'loss/train': 1.1021018028259277} 11/07/2021 17:51:29 - INFO - __main__ - Step 146524: {'lr': 6.806103571523614e-07, 'samples': 28132608, 'steps': 146523, 'loss/train': 1.3480638265609741} 11/07/2021 17:51:30 - INFO - __main__ - Step 146525: {'lr': 6.802190981834433e-07, 'samples': 28132800, 'steps': 146524, 'loss/train': 1.2706499099731445} 11/07/2021 17:51:30 - INFO - __main__ - Step 146526: {'lr': 6.798279515541906e-07, 'samples': 28132992, 'steps': 146525, 'loss/train': 1.0089459419250488} 11/07/2021 17:51:31 - INFO - __main__ - Step 146527: {'lr': 6.794369172647697e-07, 'samples': 28133184, 'steps': 146526, 'loss/train': 1.5209262371063232} 11/07/2021 17:51:31 - INFO - __main__ - Step 146528: {'lr': 6.790459953153471e-07, 'samples': 28133376, 'steps': 146527, 'loss/train': 1.8403894901275635} 11/07/2021 17:51:32 - INFO - __main__ - Step 146529: {'lr': 6.786551857060896e-07, 'samples': 28133568, 'steps': 146528, 'loss/train': 1.0925511121749878} 11/07/2021 17:51:32 - INFO - __main__ - Step 146530: {'lr': 6.782644884371636e-07, 'samples': 28133760, 'steps': 146529, 'loss/train': 1.322888970375061} 11/07/2021 17:51:33 - INFO - __main__ - Step 146531: {'lr': 6.778739035087911e-07, 'samples': 28133952, 'steps': 146530, 'loss/train': 1.3540366888046265} 11/07/2021 17:51:34 - INFO - __main__ - Step 146532: {'lr': 6.774834309211109e-07, 'samples': 28134144, 'steps': 146531, 'loss/train': 1.4583511352539062} 11/07/2021 17:51:34 - INFO - __main__ - Step 146533: {'lr': 6.770930706743172e-07, 'samples': 28134336, 'steps': 146532, 'loss/train': 0.9683578014373779} 11/07/2021 17:51:34 - INFO - __main__ - Step 146534: {'lr': 6.767028227685767e-07, 'samples': 28134528, 'steps': 146533, 'loss/train': 1.3723891973495483} 11/07/2021 17:51:35 - INFO - __main__ - Step 146535: {'lr': 6.763126872040559e-07, 'samples': 28134720, 'steps': 146534, 'loss/train': 1.2846029996871948} 11/07/2021 17:51:35 - INFO - __main__ - Step 146536: {'lr': 6.759226639809491e-07, 'samples': 28134912, 'steps': 146535, 'loss/train': 1.1615066528320312} 11/07/2021 17:51:35 - INFO - __main__ - Step 146537: {'lr': 6.755327530994227e-07, 'samples': 28135104, 'steps': 146536, 'loss/train': 1.0858955383300781} 11/07/2021 17:51:36 - INFO - __main__ - Step 146538: {'lr': 6.751429545596432e-07, 'samples': 28135296, 'steps': 146537, 'loss/train': 1.5958383083343506} 11/07/2021 17:51:37 - INFO - __main__ - Step 146539: {'lr': 6.74753268361833e-07, 'samples': 28135488, 'steps': 146538, 'loss/train': 1.4243868589401245} 11/07/2021 17:51:37 - INFO - __main__ - Step 146540: {'lr': 6.743636945061027e-07, 'samples': 28135680, 'steps': 146539, 'loss/train': 1.1847074031829834} 11/07/2021 17:51:37 - INFO - __main__ - Step 146541: {'lr': 6.739742329926468e-07, 'samples': 28135872, 'steps': 146540, 'loss/train': 1.3956842422485352} 11/07/2021 17:51:38 - INFO - __main__ - Step 146542: {'lr': 6.735848838216597e-07, 'samples': 28136064, 'steps': 146541, 'loss/train': 1.2863969802856445} 11/07/2021 17:51:39 - INFO - __main__ - Step 146543: {'lr': 6.731956469933077e-07, 'samples': 28136256, 'steps': 146542, 'loss/train': 1.301048994064331} 11/07/2021 17:51:39 - INFO - __main__ - Step 146544: {'lr': 6.728065225077851e-07, 'samples': 28136448, 'steps': 146543, 'loss/train': 1.3059461116790771} 11/07/2021 17:51:39 - INFO - __main__ - Step 146545: {'lr': 6.724175103652308e-07, 'samples': 28136640, 'steps': 146544, 'loss/train': 1.2146117687225342} 11/07/2021 17:51:40 - INFO - __main__ - Step 146546: {'lr': 6.720286105658391e-07, 'samples': 28136832, 'steps': 146545, 'loss/train': 1.135565996170044} 11/07/2021 17:51:40 - INFO - __main__ - Step 146547: {'lr': 6.716398231098042e-07, 'samples': 28137024, 'steps': 146546, 'loss/train': 1.1799895763397217} 11/07/2021 17:51:41 - INFO - __main__ - Step 146548: {'lr': 6.712511479972372e-07, 'samples': 28137216, 'steps': 146547, 'loss/train': 1.5935441255569458} 11/07/2021 17:51:42 - INFO - __main__ - Step 146549: {'lr': 6.708625852283878e-07, 'samples': 28137408, 'steps': 146548, 'loss/train': 0.8106997013092041} 11/07/2021 17:51:42 - INFO - __main__ - Step 146550: {'lr': 6.704741348033949e-07, 'samples': 28137600, 'steps': 146549, 'loss/train': 1.2537275552749634} 11/07/2021 17:51:42 - INFO - __main__ - Step 146551: {'lr': 6.700857967224527e-07, 'samples': 28137792, 'steps': 146550, 'loss/train': 1.0568063259124756} 11/07/2021 17:51:43 - INFO - __main__ - Step 146552: {'lr': 6.696975709857001e-07, 'samples': 28137984, 'steps': 146551, 'loss/train': 1.2722949981689453} 11/07/2021 17:51:44 - INFO - __main__ - Step 146553: {'lr': 6.69309457593359e-07, 'samples': 28138176, 'steps': 146552, 'loss/train': 1.3945226669311523} 11/07/2021 17:51:44 - INFO - __main__ - Step 146554: {'lr': 6.689214565455682e-07, 'samples': 28138368, 'steps': 146553, 'loss/train': 1.6980239152908325} 11/07/2021 17:51:44 - INFO - __main__ - Step 146555: {'lr': 6.685335678424942e-07, 'samples': 28138560, 'steps': 146554, 'loss/train': 1.6436307430267334} 11/07/2021 17:51:45 - INFO - __main__ - Step 146556: {'lr': 6.681457914843592e-07, 'samples': 28138752, 'steps': 146555, 'loss/train': 2.004333019256592} 11/07/2021 17:51:45 - INFO - __main__ - Step 146557: {'lr': 6.677581274713019e-07, 'samples': 28138944, 'steps': 146556, 'loss/train': 1.6032017469406128} 11/07/2021 17:51:46 - INFO - __main__ - Step 146558: {'lr': 6.673705758034887e-07, 'samples': 28139136, 'steps': 146557, 'loss/train': 1.5520216226577759} 11/07/2021 17:51:47 - INFO - __main__ - Step 146559: {'lr': 6.669831364811419e-07, 'samples': 28139328, 'steps': 146558, 'loss/train': 1.3620461225509644} 11/07/2021 17:51:47 - INFO - __main__ - Step 146560: {'lr': 6.665958095043723e-07, 'samples': 28139520, 'steps': 146559, 'loss/train': 1.493082880973816} 11/07/2021 17:51:48 - INFO - __main__ - Step 146561: {'lr': 6.66208594873402e-07, 'samples': 28139712, 'steps': 146560, 'loss/train': 1.1489208936691284} 11/07/2021 17:51:48 - INFO - __main__ - Step 146562: {'lr': 6.658214925883977e-07, 'samples': 28139904, 'steps': 146561, 'loss/train': 0.6422065496444702} 11/07/2021 17:51:49 - INFO - __main__ - Step 146563: {'lr': 6.654345026495256e-07, 'samples': 28140096, 'steps': 146562, 'loss/train': 0.9447275400161743} 11/07/2021 17:51:49 - INFO - __main__ - Step 146564: {'lr': 6.650476250569526e-07, 'samples': 28140288, 'steps': 146563, 'loss/train': 1.3714622259140015} 11/07/2021 17:51:50 - INFO - __main__ - Step 146565: {'lr': 6.646608598108728e-07, 'samples': 28140480, 'steps': 146564, 'loss/train': 1.368924856185913} 11/07/2021 17:51:50 - INFO - __main__ - Step 146566: {'lr': 6.642742069114249e-07, 'samples': 28140672, 'steps': 146565, 'loss/train': 1.475317358970642} 11/07/2021 17:51:50 - INFO - __main__ - Step 146567: {'lr': 6.638876663588312e-07, 'samples': 28140864, 'steps': 146566, 'loss/train': 1.1492230892181396} 11/07/2021 17:51:51 - INFO - __main__ - Step 146568: {'lr': 6.635012381532302e-07, 'samples': 28141056, 'steps': 146567, 'loss/train': 1.4673337936401367} 11/07/2021 17:51:52 - INFO - __main__ - Step 146569: {'lr': 6.631149222948163e-07, 'samples': 28141248, 'steps': 146568, 'loss/train': 1.418496012687683} 11/07/2021 17:51:52 - INFO - __main__ - Step 146570: {'lr': 6.627287187837561e-07, 'samples': 28141440, 'steps': 146569, 'loss/train': 1.0401324033737183} 11/07/2021 17:51:52 - INFO - __main__ - Step 146571: {'lr': 6.62342627620216e-07, 'samples': 28141632, 'steps': 146570, 'loss/train': 2.0251779556274414} 11/07/2021 17:51:53 - INFO - __main__ - Step 146572: {'lr': 6.619566488043904e-07, 'samples': 28141824, 'steps': 146571, 'loss/train': 1.3106220960617065} 11/07/2021 17:51:53 - INFO - __main__ - Step 146573: {'lr': 6.615707823364181e-07, 'samples': 28142016, 'steps': 146572, 'loss/train': 0.9469976425170898} 11/07/2021 17:51:54 - INFO - __main__ - Step 146574: {'lr': 6.61185028216521e-07, 'samples': 28142208, 'steps': 146573, 'loss/train': 1.1710474491119385} 11/07/2021 17:51:55 - INFO - __main__ - Step 146575: {'lr': 6.607993864448103e-07, 'samples': 28142400, 'steps': 146574, 'loss/train': 1.197298526763916} 11/07/2021 17:51:55 - INFO - __main__ - Step 146576: {'lr': 6.604138570215356e-07, 'samples': 28142592, 'steps': 146575, 'loss/train': 0.5455052256584167} 11/07/2021 17:51:55 - INFO - __main__ - Step 146577: {'lr': 6.600284399468082e-07, 'samples': 28142784, 'steps': 146576, 'loss/train': 1.3579634428024292} 11/07/2021 17:51:56 - INFO - __main__ - Step 146578: {'lr': 6.596431352208221e-07, 'samples': 28142976, 'steps': 146577, 'loss/train': 1.5168520212173462} 11/07/2021 17:51:57 - INFO - __main__ - Step 146579: {'lr': 6.59257942843744e-07, 'samples': 28143168, 'steps': 146578, 'loss/train': 1.1780648231506348} 11/07/2021 17:51:57 - INFO - __main__ - Step 146580: {'lr': 6.588728628157958e-07, 'samples': 28143360, 'steps': 146579, 'loss/train': 1.1524202823638916} 11/07/2021 17:51:57 - INFO - __main__ - Step 146581: {'lr': 6.584878951370887e-07, 'samples': 28143552, 'steps': 146580, 'loss/train': 1.1315969228744507} 11/07/2021 17:51:58 - INFO - __main__ - Step 146582: {'lr': 6.581030398078169e-07, 'samples': 28143744, 'steps': 146581, 'loss/train': 1.2582571506500244} 11/07/2021 17:51:58 - INFO - __main__ - Step 146583: {'lr': 6.57718296828147e-07, 'samples': 28143936, 'steps': 146582, 'loss/train': 5.470465183258057} 11/07/2021 17:51:59 - INFO - __main__ - Step 146584: {'lr': 6.573336661982731e-07, 'samples': 28144128, 'steps': 146583, 'loss/train': 1.1473802328109741} 11/07/2021 17:52:00 - INFO - __main__ - Step 146585: {'lr': 6.56949147918362e-07, 'samples': 28144320, 'steps': 146584, 'loss/train': 1.0327425003051758} 11/07/2021 17:52:00 - INFO - __main__ - Step 146586: {'lr': 6.565647419885801e-07, 'samples': 28144512, 'steps': 146585, 'loss/train': 1.269823670387268} 11/07/2021 17:52:00 - INFO - __main__ - Step 146587: {'lr': 6.561804484090938e-07, 'samples': 28144704, 'steps': 146586, 'loss/train': 1.1371999979019165} 11/07/2021 17:52:01 - INFO - __main__ - Step 146588: {'lr': 6.557962671800977e-07, 'samples': 28144896, 'steps': 146587, 'loss/train': 0.8673091530799866} 11/07/2021 17:52:01 - INFO - __main__ - Step 146589: {'lr': 6.554121983017303e-07, 'samples': 28145088, 'steps': 146588, 'loss/train': 2.1240274906158447} 11/07/2021 17:52:02 - INFO - __main__ - Step 146590: {'lr': 6.550282417742137e-07, 'samples': 28145280, 'steps': 146589, 'loss/train': 1.8866430521011353} 11/07/2021 17:52:03 - INFO - __main__ - Step 146591: {'lr': 6.54644397597659e-07, 'samples': 28145472, 'steps': 146590, 'loss/train': 1.3931753635406494} 11/07/2021 17:52:03 - INFO - __main__ - Step 146592: {'lr': 6.54260665772316e-07, 'samples': 28145664, 'steps': 146591, 'loss/train': 1.1622400283813477} 11/07/2021 17:52:03 - INFO - __main__ - Step 146593: {'lr': 6.538770462982957e-07, 'samples': 28145856, 'steps': 146592, 'loss/train': 1.2656060457229614} 11/07/2021 17:52:04 - INFO - __main__ - Step 146594: {'lr': 6.534935391757923e-07, 'samples': 28146048, 'steps': 146593, 'loss/train': 1.5127053260803223} 11/07/2021 17:52:05 - INFO - __main__ - Step 146595: {'lr': 6.531101444049725e-07, 'samples': 28146240, 'steps': 146594, 'loss/train': 1.4575421810150146} 11/07/2021 17:52:05 - INFO - __main__ - Step 146596: {'lr': 6.527268619860027e-07, 'samples': 28146432, 'steps': 146595, 'loss/train': 1.2874410152435303} 11/07/2021 17:52:05 - INFO - __main__ - Step 146597: {'lr': 6.523436919190773e-07, 'samples': 28146624, 'steps': 146596, 'loss/train': 1.2975621223449707} 11/07/2021 17:52:06 - INFO - __main__ - Step 146598: {'lr': 6.519606342043627e-07, 'samples': 28146816, 'steps': 146597, 'loss/train': 1.2696541547775269} 11/07/2021 17:52:06 - INFO - __main__ - Step 146599: {'lr': 6.515776888420255e-07, 'samples': 28147008, 'steps': 146598, 'loss/train': 1.5339339971542358} 11/07/2021 17:52:07 - INFO - __main__ - Step 146600: {'lr': 6.5119485583226e-07, 'samples': 28147200, 'steps': 146599, 'loss/train': 1.3647359609603882} 11/07/2021 17:52:07 - INFO - __main__ - Step 146601: {'lr': 6.508121351751773e-07, 'samples': 28147392, 'steps': 146600, 'loss/train': 0.9430774450302124} 11/07/2021 17:52:08 - INFO - __main__ - Step 146602: {'lr': 6.50429526871027e-07, 'samples': 28147584, 'steps': 146601, 'loss/train': 1.0907543897628784} 11/07/2021 17:52:08 - INFO - __main__ - Step 146603: {'lr': 6.500470309199202e-07, 'samples': 28147776, 'steps': 146602, 'loss/train': 1.1378040313720703} 11/07/2021 17:52:09 - INFO - __main__ - Step 146604: {'lr': 6.496646473220513e-07, 'samples': 28147968, 'steps': 146603, 'loss/train': 1.3124141693115234} 11/07/2021 17:52:09 - INFO - __main__ - Step 146605: {'lr': 6.492823760776146e-07, 'samples': 28148160, 'steps': 146604, 'loss/train': 1.0519068241119385} 11/07/2021 17:52:10 - INFO - __main__ - Step 146606: {'lr': 6.489002171867764e-07, 'samples': 28148352, 'steps': 146605, 'loss/train': 0.7333872318267822} 11/07/2021 17:52:10 - INFO - __main__ - Step 146607: {'lr': 6.485181706496756e-07, 'samples': 28148544, 'steps': 146606, 'loss/train': 1.5612338781356812} 11/07/2021 17:52:11 - INFO - __main__ - Step 146608: {'lr': 6.481362364665066e-07, 'samples': 28148736, 'steps': 146607, 'loss/train': 1.1409403085708618} 11/07/2021 17:52:11 - INFO - __main__ - Step 146609: {'lr': 6.477544146374359e-07, 'samples': 28148928, 'steps': 146608, 'loss/train': 1.2756296396255493} 11/07/2021 17:52:12 - INFO - __main__ - Step 146610: {'lr': 6.473727051626299e-07, 'samples': 28149120, 'steps': 146609, 'loss/train': 0.7898438572883606} 11/07/2021 17:52:12 - INFO - __main__ - Step 146611: {'lr': 6.46991108042283e-07, 'samples': 28149312, 'steps': 146610, 'loss/train': 0.9127933382987976} 11/07/2021 17:52:13 - INFO - __main__ - Step 146612: {'lr': 6.466096232765617e-07, 'samples': 28149504, 'steps': 146611, 'loss/train': 1.4556015729904175} 11/07/2021 17:52:13 - INFO - __main__ - Step 146613: {'lr': 6.462282508656326e-07, 'samples': 28149696, 'steps': 146612, 'loss/train': 0.9973006844520569} 11/07/2021 17:52:13 - INFO - __main__ - Step 146614: {'lr': 6.458469908096342e-07, 'samples': 28149888, 'steps': 146613, 'loss/train': 1.4963124990463257} 11/07/2021 17:52:14 - INFO - __main__ - Step 146615: {'lr': 6.454658431088167e-07, 'samples': 28150080, 'steps': 146614, 'loss/train': 1.5503125190734863} 11/07/2021 17:52:15 - INFO - __main__ - Step 146616: {'lr': 6.450848077632632e-07, 'samples': 28150272, 'steps': 146615, 'loss/train': 1.3057829141616821} 11/07/2021 17:52:15 - INFO - __main__ - Step 146617: {'lr': 6.447038847731957e-07, 'samples': 28150464, 'steps': 146616, 'loss/train': 1.656814455986023} 11/07/2021 17:52:15 - INFO - __main__ - Step 146618: {'lr': 6.443230741387807e-07, 'samples': 28150656, 'steps': 146617, 'loss/train': 1.4013062715530396} 11/07/2021 17:52:16 - INFO - __main__ - Step 146619: {'lr': 6.439423758601848e-07, 'samples': 28150848, 'steps': 146618, 'loss/train': 1.4090378284454346} 11/07/2021 17:52:16 - INFO - __main__ - Step 146620: {'lr': 6.435617899376023e-07, 'samples': 28151040, 'steps': 146619, 'loss/train': 1.5942587852478027} 11/07/2021 17:52:16 - INFO - __main__ - Dataset epoch: 2 11/07/2021 17:52:17 - INFO - __main__ - Step 146621: {'lr': 6.43181316371172e-07, 'samples': 28151232, 'steps': 146620, 'loss/train': 1.1616575717926025} 11/07/2021 17:52:18 - INFO - __main__ - Step 146622: {'lr': 6.428009551610603e-07, 'samples': 28151424, 'steps': 146621, 'loss/train': 0.9959945678710938} 11/07/2021 17:52:18 - INFO - __main__ - Step 146623: {'lr': 6.424207063074617e-07, 'samples': 28151616, 'steps': 146622, 'loss/train': 1.2618372440338135} 11/07/2021 17:52:18 - INFO - __main__ - Step 146624: {'lr': 6.420405698105425e-07, 'samples': 28151808, 'steps': 146623, 'loss/train': 1.3685191869735718} 11/07/2021 17:52:19 - INFO - __main__ - Step 146625: {'lr': 6.416605456704694e-07, 'samples': 28152000, 'steps': 146624, 'loss/train': 0.8241365551948547} 11/07/2021 17:52:20 - INFO - __main__ - Step 146626: {'lr': 6.412806338874366e-07, 'samples': 28152192, 'steps': 146625, 'loss/train': 1.1720131635665894} 11/07/2021 17:52:20 - INFO - __main__ - Step 146627: {'lr': 6.409008344615553e-07, 'samples': 28152384, 'steps': 146626, 'loss/train': 1.3841174840927124} 11/07/2021 17:52:20 - INFO - __main__ - Step 146628: {'lr': 6.40521147393075e-07, 'samples': 28152576, 'steps': 146627, 'loss/train': 1.4267905950546265} 11/07/2021 17:52:21 - INFO - __main__ - Step 146629: {'lr': 6.401415726821069e-07, 'samples': 28152768, 'steps': 146628, 'loss/train': 0.540405809879303} 11/07/2021 17:52:21 - INFO - __main__ - Step 146630: {'lr': 6.397621103288454e-07, 'samples': 28152960, 'steps': 146629, 'loss/train': 1.2570724487304688} 11/07/2021 17:52:22 - INFO - __main__ - Step 146631: {'lr': 6.393827603334568e-07, 'samples': 28153152, 'steps': 146630, 'loss/train': 1.17487633228302} 11/07/2021 17:52:23 - INFO - __main__ - Step 146632: {'lr': 6.390035226961355e-07, 'samples': 28153344, 'steps': 146631, 'loss/train': 1.2142524719238281} 11/07/2021 17:52:23 - INFO - __main__ - Step 146633: {'lr': 6.386243974170203e-07, 'samples': 28153536, 'steps': 146632, 'loss/train': 1.030953049659729} 11/07/2021 17:52:23 - INFO - __main__ - Step 146634: {'lr': 6.382453844962776e-07, 'samples': 28153728, 'steps': 146633, 'loss/train': 1.743503451347351} 11/07/2021 17:52:24 - INFO - __main__ - Step 146635: {'lr': 6.378664839341019e-07, 'samples': 28153920, 'steps': 146634, 'loss/train': 1.3079156875610352} 11/07/2021 17:52:25 - INFO - __main__ - Step 146636: {'lr': 6.374876957306597e-07, 'samples': 28154112, 'steps': 146635, 'loss/train': 0.8975310325622559} 11/07/2021 17:52:25 - INFO - __main__ - Step 146637: {'lr': 6.371090198861174e-07, 'samples': 28154304, 'steps': 146636, 'loss/train': 1.1207590103149414} 11/07/2021 17:52:25 - INFO - __main__ - Step 146638: {'lr': 6.367304564006415e-07, 'samples': 28154496, 'steps': 146637, 'loss/train': 1.4453999996185303} 11/07/2021 17:52:26 - INFO - __main__ - Step 146639: {'lr': 6.363520052743988e-07, 'samples': 28154688, 'steps': 146638, 'loss/train': 0.43349766731262207} 11/07/2021 17:52:26 - INFO - __main__ - Step 146640: {'lr': 6.359736665075832e-07, 'samples': 28154880, 'steps': 146639, 'loss/train': 1.272512435913086} 11/07/2021 17:52:27 - INFO - __main__ - Step 146641: {'lr': 6.355954401003339e-07, 'samples': 28155072, 'steps': 146640, 'loss/train': 1.0988118648529053} 11/07/2021 17:52:28 - INFO - __main__ - Step 146642: {'lr': 6.352173260528727e-07, 'samples': 28155264, 'steps': 146641, 'loss/train': 1.6329765319824219} 11/07/2021 17:52:28 - INFO - __main__ - Step 146643: {'lr': 6.348393243653106e-07, 'samples': 28155456, 'steps': 146642, 'loss/train': 1.41365647315979} 11/07/2021 17:52:28 - INFO - __main__ - Step 146644: {'lr': 6.344614350378419e-07, 'samples': 28155648, 'steps': 146643, 'loss/train': 1.1121189594268799} 11/07/2021 17:52:29 - INFO - __main__ - Step 146645: {'lr': 6.340836580706332e-07, 'samples': 28155840, 'steps': 146644, 'loss/train': 1.3186314105987549} 11/07/2021 17:52:29 - INFO - __main__ - Step 146646: {'lr': 6.337059934638511e-07, 'samples': 28156032, 'steps': 146645, 'loss/train': 1.141436219215393} 11/07/2021 17:52:30 - INFO - __main__ - Step 146647: {'lr': 6.333284412176899e-07, 'samples': 28156224, 'steps': 146646, 'loss/train': 1.133500337600708} 11/07/2021 17:52:30 - INFO - __main__ - Step 146648: {'lr': 6.329510013322881e-07, 'samples': 28156416, 'steps': 146647, 'loss/train': 1.743545413017273} 11/07/2021 17:52:31 - INFO - __main__ - Step 146649: {'lr': 6.325736738078403e-07, 'samples': 28156608, 'steps': 146648, 'loss/train': 1.4641761779785156} 11/07/2021 17:52:31 - INFO - __main__ - Step 146650: {'lr': 6.321964586445128e-07, 'samples': 28156800, 'steps': 146649, 'loss/train': 1.2153500318527222} 11/07/2021 17:52:31 - INFO - __main__ - Step 146651: {'lr': 6.318193558424722e-07, 'samples': 28156992, 'steps': 146650, 'loss/train': 0.9886782765388489} 11/07/2021 17:52:33 - INFO - __main__ - Step 146652: {'lr': 6.314423654018575e-07, 'samples': 28157184, 'steps': 146651, 'loss/train': 0.888131320476532} 11/07/2021 17:52:33 - INFO - __main__ - Step 146653: {'lr': 6.310654873228905e-07, 'samples': 28157376, 'steps': 146652, 'loss/train': 1.0522269010543823} 11/07/2021 17:52:33 - INFO - __main__ - Step 146654: {'lr': 6.306887216057377e-07, 'samples': 28157568, 'steps': 146653, 'loss/train': 1.4973573684692383} 11/07/2021 17:52:34 - INFO - __main__ - Step 146655: {'lr': 6.303120682505104e-07, 'samples': 28157760, 'steps': 146654, 'loss/train': 0.6226727366447449} 11/07/2021 17:52:34 - INFO - __main__ - Step 146656: {'lr': 6.299355272574303e-07, 'samples': 28157952, 'steps': 146655, 'loss/train': 1.4488803148269653} 11/07/2021 17:52:35 - INFO - __main__ - Step 146657: {'lr': 6.295590986266641e-07, 'samples': 28158144, 'steps': 146656, 'loss/train': 1.5868573188781738} 11/07/2021 17:52:36 - INFO - __main__ - Step 146658: {'lr': 6.291827823583507e-07, 'samples': 28158336, 'steps': 146657, 'loss/train': 1.3764348030090332} 11/07/2021 17:52:36 - INFO - __main__ - Step 146659: {'lr': 6.288065784526842e-07, 'samples': 28158528, 'steps': 146658, 'loss/train': 1.3219412565231323} 11/07/2021 17:52:36 - INFO - __main__ - Step 146660: {'lr': 6.284304869098312e-07, 'samples': 28158720, 'steps': 146659, 'loss/train': 1.302454948425293} 11/07/2021 17:52:37 - INFO - __main__ - Step 146661: {'lr': 6.280545077299582e-07, 'samples': 28158912, 'steps': 146660, 'loss/train': 0.083397775888443} 11/07/2021 17:52:38 - INFO - __main__ - Step 146662: {'lr': 6.276786409132318e-07, 'samples': 28159104, 'steps': 146661, 'loss/train': 1.3191319704055786} 11/07/2021 17:52:38 - INFO - __main__ - Step 146663: {'lr': 6.273028864598462e-07, 'samples': 28159296, 'steps': 146662, 'loss/train': 1.5110759735107422} 11/07/2021 17:52:39 - INFO - __main__ - Step 146664: {'lr': 6.269272443699403e-07, 'samples': 28159488, 'steps': 146663, 'loss/train': 0.9677852988243103} 11/07/2021 17:52:39 - INFO - __main__ - Step 146665: {'lr': 6.265517146436806e-07, 'samples': 28159680, 'steps': 146664, 'loss/train': 1.6516239643096924} 11/07/2021 17:52:39 - INFO - __main__ - Step 146666: {'lr': 6.261762972812612e-07, 'samples': 28159872, 'steps': 146665, 'loss/train': 1.4757194519042969} 11/07/2021 17:52:40 - INFO - __main__ - Step 146667: {'lr': 6.25800992282849e-07, 'samples': 28160064, 'steps': 146666, 'loss/train': 1.5312283039093018} 11/07/2021 17:52:41 - INFO - __main__ - Step 146668: {'lr': 6.254257996485824e-07, 'samples': 28160256, 'steps': 146667, 'loss/train': 1.230118751525879} 11/07/2021 17:52:41 - INFO - __main__ - Step 146669: {'lr': 6.250507193786558e-07, 'samples': 28160448, 'steps': 146668, 'loss/train': 1.495611310005188} 11/07/2021 17:52:41 - INFO - __main__ - Step 146670: {'lr': 6.24675751473236e-07, 'samples': 28160640, 'steps': 146669, 'loss/train': 1.0049742460250854} 11/07/2021 17:52:42 - INFO - __main__ - Step 146671: {'lr': 6.243008959324892e-07, 'samples': 28160832, 'steps': 146670, 'loss/train': 1.734047770500183} 11/07/2021 17:52:43 - INFO - __main__ - Step 146672: {'lr': 6.239261527565821e-07, 'samples': 28161024, 'steps': 146671, 'loss/train': 1.1058306694030762} 11/07/2021 17:52:43 - INFO - __main__ - Step 146673: {'lr': 6.235515219456811e-07, 'samples': 28161216, 'steps': 146672, 'loss/train': 0.9162260890007019} 11/07/2021 17:52:44 - INFO - __main__ - Step 146674: {'lr': 6.231770034999529e-07, 'samples': 28161408, 'steps': 146673, 'loss/train': 1.0127451419830322} 11/07/2021 17:52:44 - INFO - __main__ - Step 146675: {'lr': 6.228025974195916e-07, 'samples': 28161600, 'steps': 146674, 'loss/train': 0.6057507395744324} 11/07/2021 17:52:44 - INFO - __main__ - Step 146676: {'lr': 6.224283037047362e-07, 'samples': 28161792, 'steps': 146675, 'loss/train': 1.2551430463790894} 11/07/2021 17:52:45 - INFO - __main__ - Step 146677: {'lr': 6.220541223555809e-07, 'samples': 28161984, 'steps': 146676, 'loss/train': 1.3683966398239136} 11/07/2021 17:52:46 - INFO - __main__ - Step 146678: {'lr': 6.216800533722644e-07, 'samples': 28162176, 'steps': 146677, 'loss/train': 1.1841667890548706} 11/07/2021 17:52:46 - INFO - __main__ - Step 146679: {'lr': 6.213060967549811e-07, 'samples': 28162368, 'steps': 146678, 'loss/train': 1.56492280960083} 11/07/2021 17:52:46 - INFO - __main__ - Step 146680: {'lr': 6.209322525038697e-07, 'samples': 28162560, 'steps': 146679, 'loss/train': 1.325766682624817} 11/07/2021 17:52:47 - INFO - __main__ - Step 146681: {'lr': 6.205585206191522e-07, 'samples': 28162752, 'steps': 146680, 'loss/train': 0.8882434368133545} 11/07/2021 17:52:48 - INFO - __main__ - Step 146682: {'lr': 6.201849011009397e-07, 'samples': 28162944, 'steps': 146681, 'loss/train': 1.4790310859680176} 11/07/2021 17:52:48 - INFO - __main__ - Step 146683: {'lr': 6.198113939494265e-07, 'samples': 28163136, 'steps': 146682, 'loss/train': 1.3781639337539673} 11/07/2021 17:52:49 - INFO - __main__ - Step 146684: {'lr': 6.194379991647792e-07, 'samples': 28163328, 'steps': 146683, 'loss/train': 1.2845796346664429} 11/07/2021 17:52:49 - INFO - __main__ - Step 146685: {'lr': 6.190647167471641e-07, 'samples': 28163520, 'steps': 146684, 'loss/train': 1.3962215185165405} 11/07/2021 17:52:49 - INFO - __main__ - Step 146686: {'lr': 6.18691546696748e-07, 'samples': 28163712, 'steps': 146685, 'loss/train': 1.273434042930603} 11/07/2021 17:52:50 - INFO - __main__ - Step 146687: {'lr': 6.183184890136973e-07, 'samples': 28163904, 'steps': 146686, 'loss/train': 1.0163475275039673} 11/07/2021 17:52:51 - INFO - __main__ - Step 146688: {'lr': 6.179455436982062e-07, 'samples': 28164096, 'steps': 146687, 'loss/train': 0.03425811976194382} 11/07/2021 17:52:51 - INFO - __main__ - Step 146689: {'lr': 6.175727107504136e-07, 'samples': 28164288, 'steps': 146688, 'loss/train': 1.215532660484314} 11/07/2021 17:52:52 - INFO - __main__ - Step 146690: {'lr': 6.171999901704861e-07, 'samples': 28164480, 'steps': 146689, 'loss/train': 1.293936014175415} 11/07/2021 17:52:52 - INFO - __main__ - Step 146691: {'lr': 6.168273819585901e-07, 'samples': 28164672, 'steps': 146690, 'loss/train': 0.5122979879379272} 11/07/2021 17:52:52 - INFO - __main__ - Step 146692: {'lr': 6.164548861149199e-07, 'samples': 28164864, 'steps': 146691, 'loss/train': 1.1032264232635498} 11/07/2021 17:52:54 - INFO - __main__ - Step 146693: {'lr': 6.160825026396144e-07, 'samples': 28165056, 'steps': 146692, 'loss/train': 1.6796717643737793} 11/07/2021 17:52:54 - INFO - __main__ - Step 146694: {'lr': 6.157102315328677e-07, 'samples': 28165248, 'steps': 146693, 'loss/train': 0.5389129519462585} 11/07/2021 17:52:54 - INFO - __main__ - Step 146695: {'lr': 6.153380727948188e-07, 'samples': 28165440, 'steps': 146694, 'loss/train': 1.4851346015930176} 11/07/2021 17:52:55 - INFO - __main__ - Step 146696: {'lr': 6.149660264256618e-07, 'samples': 28165632, 'steps': 146695, 'loss/train': 0.9136058688163757} 11/07/2021 17:52:55 - INFO - __main__ - Step 146697: {'lr': 6.145940924255356e-07, 'samples': 28165824, 'steps': 146696, 'loss/train': 1.2302452325820923} 11/07/2021 17:52:55 - INFO - __main__ - Step 146698: {'lr': 6.142222707946343e-07, 'samples': 28166016, 'steps': 146697, 'loss/train': 0.9895080924034119} 11/07/2021 17:52:56 - INFO - __main__ - Step 146699: {'lr': 6.138505615331246e-07, 'samples': 28166208, 'steps': 146698, 'loss/train': 0.9224690198898315} 11/07/2021 17:52:57 - INFO - __main__ - Step 146700: {'lr': 6.134789646411732e-07, 'samples': 28166400, 'steps': 146699, 'loss/train': 1.5684858560562134} 11/07/2021 17:52:57 - INFO - __main__ - Step 146701: {'lr': 6.131074801189185e-07, 'samples': 28166592, 'steps': 146700, 'loss/train': 1.3240717649459839} 11/07/2021 17:52:57 - INFO - __main__ - Step 146702: {'lr': 6.12736107966555e-07, 'samples': 28166784, 'steps': 146701, 'loss/train': 1.5806128978729248} 11/07/2021 17:52:58 - INFO - __main__ - Step 146703: {'lr': 6.123648481842492e-07, 'samples': 28166976, 'steps': 146702, 'loss/train': 0.8701185584068298} 11/07/2021 17:52:59 - INFO - __main__ - Step 146704: {'lr': 6.119937007721677e-07, 'samples': 28167168, 'steps': 146703, 'loss/train': 1.4015125036239624} 11/07/2021 17:52:59 - INFO - __main__ - Step 146705: {'lr': 6.116226657304768e-07, 'samples': 28167360, 'steps': 146704, 'loss/train': 1.3421006202697754} 11/07/2021 17:53:00 - INFO - __main__ - Step 146706: {'lr': 6.112517430593156e-07, 'samples': 28167552, 'steps': 146705, 'loss/train': 1.5122921466827393} 11/07/2021 17:53:00 - INFO - __main__ - Step 146707: {'lr': 6.10880932758906e-07, 'samples': 28167744, 'steps': 146706, 'loss/train': 1.697771430015564} 11/07/2021 17:53:00 - INFO - __main__ - Step 146708: {'lr': 6.10510234829359e-07, 'samples': 28167936, 'steps': 146707, 'loss/train': 1.1211116313934326} 11/07/2021 17:53:01 - INFO - __main__ - Step 146709: {'lr': 6.101396492708966e-07, 'samples': 28168128, 'steps': 146708, 'loss/train': 1.4078854322433472} 11/07/2021 17:53:02 - INFO - __main__ - Step 146710: {'lr': 6.0976917608363e-07, 'samples': 28168320, 'steps': 146709, 'loss/train': 0.990811288356781} 11/07/2021 17:53:02 - INFO - __main__ - Step 146711: {'lr': 6.093988152677532e-07, 'samples': 28168512, 'steps': 146710, 'loss/train': 1.4789716005325317} 11/07/2021 17:53:02 - INFO - __main__ - Step 146712: {'lr': 6.090285668234607e-07, 'samples': 28168704, 'steps': 146711, 'loss/train': 0.7869669795036316} 11/07/2021 17:53:03 - INFO - __main__ - Step 146713: {'lr': 6.086584307508635e-07, 'samples': 28168896, 'steps': 146712, 'loss/train': 1.3763788938522339} 11/07/2021 17:53:04 - INFO - __main__ - Step 146714: {'lr': 6.082884070501838e-07, 'samples': 28169088, 'steps': 146713, 'loss/train': 1.5508655309677124} 11/07/2021 17:53:04 - INFO - __main__ - Step 146715: {'lr': 6.079184957215322e-07, 'samples': 28169280, 'steps': 146714, 'loss/train': 1.039650797843933} 11/07/2021 17:53:05 - INFO - __main__ - Step 146716: {'lr': 6.075486967651312e-07, 'samples': 28169472, 'steps': 146715, 'loss/train': 1.6546306610107422} 11/07/2021 17:53:05 - INFO - __main__ - Step 146717: {'lr': 6.071790101810915e-07, 'samples': 28169664, 'steps': 146716, 'loss/train': 1.4948949813842773} 11/07/2021 17:53:05 - INFO - __main__ - Step 146718: {'lr': 6.068094359696352e-07, 'samples': 28169856, 'steps': 146717, 'loss/train': 1.436495065689087} 11/07/2021 17:53:06 - INFO - __main__ - Step 146719: {'lr': 6.064399741308735e-07, 'samples': 28170048, 'steps': 146718, 'loss/train': 0.9702956080436707} 11/07/2021 17:53:07 - INFO - __main__ - Step 146720: {'lr': 6.060706246650283e-07, 'samples': 28170240, 'steps': 146719, 'loss/train': 1.7641892433166504} 11/07/2021 17:53:07 - INFO - __main__ - Step 146721: {'lr': 6.057013875722106e-07, 'samples': 28170432, 'steps': 146720, 'loss/train': 0.6784911155700684} 11/07/2021 17:53:07 - INFO - __main__ - Step 146722: {'lr': 6.053322628526425e-07, 'samples': 28170624, 'steps': 146721, 'loss/train': 0.46807003021240234} 11/07/2021 17:53:08 - INFO - __main__ - Step 146723: {'lr': 6.049632505064628e-07, 'samples': 28170816, 'steps': 146722, 'loss/train': 1.5101531744003296} 11/07/2021 17:53:09 - INFO - __main__ - Step 146724: {'lr': 6.045943505338103e-07, 'samples': 28171008, 'steps': 146723, 'loss/train': 1.2141295671463013} 11/07/2021 17:53:09 - INFO - __main__ - Step 146725: {'lr': 6.04225562934907e-07, 'samples': 28171200, 'steps': 146724, 'loss/train': 1.1893861293792725} 11/07/2021 17:53:10 - INFO - __main__ - Step 146726: {'lr': 6.038568877098916e-07, 'samples': 28171392, 'steps': 146725, 'loss/train': 0.04426191747188568} 11/07/2021 17:53:10 - INFO - __main__ - Step 146727: {'lr': 6.03488324858903e-07, 'samples': 28171584, 'steps': 146726, 'loss/train': 1.5803025960922241} 11/07/2021 17:53:10 - INFO - __main__ - Step 146728: {'lr': 6.031198743821631e-07, 'samples': 28171776, 'steps': 146727, 'loss/train': 1.0880522727966309} 11/07/2021 17:53:12 - INFO - __main__ - Step 146729: {'lr': 6.027515362798108e-07, 'samples': 28171968, 'steps': 146728, 'loss/train': 1.0487697124481201} 11/07/2021 17:53:12 - INFO - __main__ - Step 146730: {'lr': 6.02383310551985e-07, 'samples': 28172160, 'steps': 146729, 'loss/train': 1.4280568361282349} 11/07/2021 17:53:12 - INFO - __main__ - Step 146731: {'lr': 6.020151971989075e-07, 'samples': 28172352, 'steps': 146730, 'loss/train': 1.1476856470108032} 11/07/2021 17:53:13 - INFO - __main__ - Step 146732: {'lr': 6.016471962206893e-07, 'samples': 28172544, 'steps': 146731, 'loss/train': 1.3462024927139282} 11/07/2021 17:53:13 - INFO - __main__ - Step 146733: {'lr': 6.012793076175249e-07, 'samples': 28172736, 'steps': 146732, 'loss/train': 1.7358193397521973} 11/07/2021 17:53:14 - INFO - __main__ - Step 146734: {'lr': 6.009115313895808e-07, 'samples': 28172928, 'steps': 146733, 'loss/train': 1.4437335729599} 11/07/2021 17:53:14 - INFO - __main__ - Step 146735: {'lr': 6.005438675369956e-07, 'samples': 28173120, 'steps': 146734, 'loss/train': 1.2057336568832397} 11/07/2021 17:53:15 - INFO - __main__ - Step 146736: {'lr': 6.001763160599916e-07, 'samples': 28173312, 'steps': 146735, 'loss/train': 1.5259877443313599} 11/07/2021 17:53:15 - INFO - __main__ - Step 146737: {'lr': 5.998088769586796e-07, 'samples': 28173504, 'steps': 146736, 'loss/train': 0.13445588946342468} 11/07/2021 17:53:15 - INFO - __main__ - Step 146738: {'lr': 5.99441550233254e-07, 'samples': 28173696, 'steps': 146737, 'loss/train': 1.1075413227081299} 11/07/2021 17:53:16 - INFO - __main__ - Step 146739: {'lr': 5.990743358838536e-07, 'samples': 28173888, 'steps': 146738, 'loss/train': 1.6275689601898193} 11/07/2021 17:53:17 - INFO - __main__ - Step 146740: {'lr': 5.987072339106725e-07, 'samples': 28174080, 'steps': 146739, 'loss/train': 0.8556650280952454} 11/07/2021 17:53:17 - INFO - __main__ - Step 146741: {'lr': 5.983402443138774e-07, 'samples': 28174272, 'steps': 146740, 'loss/train': 1.2850843667984009} 11/07/2021 17:53:18 - INFO - __main__ - Step 146742: {'lr': 5.979733670936072e-07, 'samples': 28174464, 'steps': 146741, 'loss/train': 0.8707236647605896} 11/07/2021 17:53:18 - INFO - __main__ - Step 146743: {'lr': 5.976066022500559e-07, 'samples': 28174656, 'steps': 146742, 'loss/train': 1.1972914934158325} 11/07/2021 17:53:18 - INFO - __main__ - Step 146744: {'lr': 5.972399497833625e-07, 'samples': 28174848, 'steps': 146743, 'loss/train': 1.3683645725250244} 11/07/2021 17:53:20 - INFO - __main__ - Step 146745: {'lr': 5.968734096936935e-07, 'samples': 28175040, 'steps': 146744, 'loss/train': 1.518330693244934} 11/07/2021 17:53:20 - INFO - __main__ - Step 146746: {'lr': 5.965069819812429e-07, 'samples': 28175232, 'steps': 146745, 'loss/train': 1.620033621788025} 11/07/2021 17:53:20 - INFO - __main__ - Step 146747: {'lr': 5.9614066664615e-07, 'samples': 28175424, 'steps': 146746, 'loss/train': 1.5442560911178589} 11/07/2021 17:53:21 - INFO - __main__ - Step 146748: {'lr': 5.957744636886087e-07, 'samples': 28175616, 'steps': 146747, 'loss/train': 1.5737494230270386} 11/07/2021 17:53:21 - INFO - __main__ - Step 146749: {'lr': 5.954083731087301e-07, 'samples': 28175808, 'steps': 146748, 'loss/train': 0.08200675249099731} 11/07/2021 17:53:22 - INFO - __main__ - Step 146750: {'lr': 5.950423949067362e-07, 'samples': 28176000, 'steps': 146749, 'loss/train': 1.4428436756134033} 11/07/2021 17:53:23 - INFO - __main__ - Step 146751: {'lr': 5.946765290827383e-07, 'samples': 28176192, 'steps': 146750, 'loss/train': 1.4286121129989624} 11/07/2021 17:53:23 - INFO - __main__ - Step 146752: {'lr': 5.943107756369582e-07, 'samples': 28176384, 'steps': 146751, 'loss/train': 0.5607372522354126} 11/07/2021 17:53:23 - INFO - __main__ - Step 146753: {'lr': 5.939451345695345e-07, 'samples': 28176576, 'steps': 146752, 'loss/train': 1.1196048259735107} 11/07/2021 17:53:24 - INFO - __main__ - Step 146754: {'lr': 5.935796058806064e-07, 'samples': 28176768, 'steps': 146753, 'loss/train': 1.244171380996704} 11/07/2021 17:53:25 - INFO - __main__ - Step 146755: {'lr': 5.932141895703957e-07, 'samples': 28176960, 'steps': 146754, 'loss/train': 1.2110061645507812} 11/07/2021 17:53:25 - INFO - __main__ - Step 146756: {'lr': 5.928488856390136e-07, 'samples': 28177152, 'steps': 146755, 'loss/train': 1.3624240159988403} 11/07/2021 17:53:25 - INFO - __main__ - Step 146757: {'lr': 5.92483694086654e-07, 'samples': 28177344, 'steps': 146756, 'loss/train': 0.9539843797683716} 11/07/2021 17:53:26 - INFO - __main__ - Step 146758: {'lr': 5.921186149134561e-07, 'samples': 28177536, 'steps': 146757, 'loss/train': 1.4150679111480713} 11/07/2021 17:53:26 - INFO - __main__ - Step 146759: {'lr': 5.91753648119614e-07, 'samples': 28177728, 'steps': 146758, 'loss/train': 1.2230783700942993} 11/07/2021 17:53:27 - INFO - __main__ - Step 146760: {'lr': 5.913887937052664e-07, 'samples': 28177920, 'steps': 146759, 'loss/train': 1.1233742237091064} 11/07/2021 17:53:28 - INFO - __main__ - Step 146761: {'lr': 5.910240516706078e-07, 'samples': 28178112, 'steps': 146760, 'loss/train': 1.3428471088409424} 11/07/2021 17:53:28 - INFO - __main__ - Step 146762: {'lr': 5.906594220157768e-07, 'samples': 28178304, 'steps': 146761, 'loss/train': 1.3098169565200806} 11/07/2021 17:53:28 - INFO - __main__ - Step 146763: {'lr': 5.902949047409401e-07, 'samples': 28178496, 'steps': 146762, 'loss/train': 1.3008440732955933} 11/07/2021 17:53:29 - INFO - __main__ - Step 146764: {'lr': 5.899304998462917e-07, 'samples': 28178688, 'steps': 146763, 'loss/train': 1.4143919944763184} 11/07/2021 17:53:30 - INFO - __main__ - Step 146765: {'lr': 5.89566207331943e-07, 'samples': 28178880, 'steps': 146764, 'loss/train': 1.4721200466156006} 11/07/2021 17:53:30 - INFO - __main__ - Step 146766: {'lr': 5.89202027198088e-07, 'samples': 28179072, 'steps': 146765, 'loss/train': 1.216088891029358} 11/07/2021 17:53:30 - INFO - __main__ - Step 146767: {'lr': 5.888379594449211e-07, 'samples': 28179264, 'steps': 146766, 'loss/train': 3.292877674102783} 11/07/2021 17:53:31 - INFO - __main__ - Step 146768: {'lr': 5.884740040725533e-07, 'samples': 28179456, 'steps': 146767, 'loss/train': 1.055941104888916} 11/07/2021 17:53:31 - INFO - __main__ - Step 146769: {'lr': 5.881101610811789e-07, 'samples': 28179648, 'steps': 146768, 'loss/train': 0.921539306640625} 11/07/2021 17:53:32 - INFO - __main__ - Step 146770: {'lr': 5.877464304709367e-07, 'samples': 28179840, 'steps': 146769, 'loss/train': 1.0209541320800781} 11/07/2021 17:53:33 - INFO - __main__ - Step 146771: {'lr': 5.87382812242021e-07, 'samples': 28180032, 'steps': 146770, 'loss/train': 1.3011921644210815} 11/07/2021 17:53:33 - INFO - __main__ - Step 146772: {'lr': 5.870193063945706e-07, 'samples': 28180224, 'steps': 146771, 'loss/train': 1.165719747543335} 11/07/2021 17:53:33 - INFO - __main__ - Step 146773: {'lr': 5.866559129287797e-07, 'samples': 28180416, 'steps': 146772, 'loss/train': 1.3283231258392334} 11/07/2021 17:53:34 - INFO - __main__ - Step 146774: {'lr': 5.862926318447593e-07, 'samples': 28180608, 'steps': 146773, 'loss/train': 1.1759963035583496} 11/07/2021 17:53:35 - INFO - __main__ - Step 146775: {'lr': 5.859294631427314e-07, 'samples': 28180800, 'steps': 146774, 'loss/train': 1.4857419729232788} 11/07/2021 17:53:35 - INFO - __main__ - Step 146776: {'lr': 5.85566406822835e-07, 'samples': 28180992, 'steps': 146775, 'loss/train': 1.3131957054138184} 11/07/2021 17:53:35 - INFO - __main__ - Step 146777: {'lr': 5.852034628852366e-07, 'samples': 28181184, 'steps': 146776, 'loss/train': 1.0588924884796143} 11/07/2021 17:53:36 - INFO - __main__ - Step 146778: {'lr': 5.848406313301025e-07, 'samples': 28181376, 'steps': 146777, 'loss/train': 1.1422573328018188} 11/07/2021 17:53:36 - INFO - __main__ - Step 146779: {'lr': 5.844779121575716e-07, 'samples': 28181568, 'steps': 146778, 'loss/train': 1.368438482284546} 11/07/2021 17:53:37 - INFO - __main__ - Step 146780: {'lr': 5.841153053678383e-07, 'samples': 28181760, 'steps': 146779, 'loss/train': 1.4310704469680786} 11/07/2021 17:53:38 - INFO - __main__ - Step 146781: {'lr': 5.837528109610413e-07, 'samples': 28181952, 'steps': 146780, 'loss/train': 1.1296262741088867} 11/07/2021 17:53:38 - INFO - __main__ - Step 146782: {'lr': 5.833904289373748e-07, 'samples': 28182144, 'steps': 146781, 'loss/train': 1.3865306377410889} 11/07/2021 17:53:38 - INFO - __main__ - Step 146783: {'lr': 5.830281592969777e-07, 'samples': 28182336, 'steps': 146782, 'loss/train': 1.6733745336532593} 11/07/2021 17:53:39 - INFO - __main__ - Step 146784: {'lr': 5.826660020400166e-07, 'samples': 28182528, 'steps': 146783, 'loss/train': 1.4572044610977173} 11/07/2021 17:53:39 - INFO - __main__ - Step 146785: {'lr': 5.823039571666577e-07, 'samples': 28182720, 'steps': 146784, 'loss/train': 1.2333837747573853} 11/07/2021 17:53:40 - INFO - __main__ - Step 146786: {'lr': 5.819420246770402e-07, 'samples': 28182912, 'steps': 146785, 'loss/train': 1.3683466911315918} 11/07/2021 17:53:40 - INFO - __main__ - Step 146787: {'lr': 5.815802045713858e-07, 'samples': 28183104, 'steps': 146786, 'loss/train': 1.2556873559951782} 11/07/2021 17:53:41 - INFO - __main__ - Step 146788: {'lr': 5.812184968498057e-07, 'samples': 28183296, 'steps': 146787, 'loss/train': 1.1307921409606934} 11/07/2021 17:53:41 - INFO - __main__ - Step 146789: {'lr': 5.808569015124943e-07, 'samples': 28183488, 'steps': 146788, 'loss/train': 1.382706642150879} 11/07/2021 17:53:41 - INFO - __main__ - Step 146790: {'lr': 5.804954185595901e-07, 'samples': 28183680, 'steps': 146789, 'loss/train': 0.987138569355011} 11/07/2021 17:53:42 - INFO - __main__ - Step 146791: {'lr': 5.801340479912598e-07, 'samples': 28183872, 'steps': 146790, 'loss/train': 1.3093807697296143} 11/07/2021 17:53:43 - INFO - __main__ - Step 146792: {'lr': 5.797727898076699e-07, 'samples': 28184064, 'steps': 146791, 'loss/train': 1.4384899139404297} 11/07/2021 17:53:43 - INFO - __main__ - Step 146793: {'lr': 5.794116440089869e-07, 'samples': 28184256, 'steps': 146792, 'loss/train': 1.314670205116272} 11/07/2021 17:53:44 - INFO - __main__ - Step 146794: {'lr': 5.790506105953774e-07, 'samples': 28184448, 'steps': 146793, 'loss/train': 1.1912442445755005} 11/07/2021 17:53:44 - INFO - __main__ - Step 146795: {'lr': 5.786896895670079e-07, 'samples': 28184640, 'steps': 146794, 'loss/train': 1.4755327701568604} 11/07/2021 17:53:45 - INFO - __main__ - Step 146796: {'lr': 5.783288809240451e-07, 'samples': 28184832, 'steps': 146795, 'loss/train': 1.0296097993850708} 11/07/2021 17:53:45 - INFO - __main__ - Step 146797: {'lr': 5.779681846665996e-07, 'samples': 28185024, 'steps': 146796, 'loss/train': 1.1586246490478516} 11/07/2021 17:53:46 - INFO - __main__ - Step 146798: {'lr': 5.776076007948939e-07, 'samples': 28185216, 'steps': 146797, 'loss/train': 1.2200140953063965} 11/07/2021 17:53:46 - INFO - __main__ - Step 146799: {'lr': 5.772471293090665e-07, 'samples': 28185408, 'steps': 146798, 'loss/train': 1.466619849205017} 11/07/2021 17:53:46 - INFO - __main__ - Step 146800: {'lr': 5.76886770209284e-07, 'samples': 28185600, 'steps': 146799, 'loss/train': 1.2065421342849731} 11/07/2021 17:53:48 - INFO - __main__ - Step 146801: {'lr': 5.765265234957129e-07, 'samples': 28185792, 'steps': 146800, 'loss/train': 1.2560113668441772} 11/07/2021 17:53:48 - INFO - __main__ - Step 146802: {'lr': 5.76166389168492e-07, 'samples': 28185984, 'steps': 146801, 'loss/train': 0.9893499612808228} 11/07/2021 17:53:48 - INFO - __main__ - Step 146803: {'lr': 5.758063672278157e-07, 'samples': 28186176, 'steps': 146802, 'loss/train': 0.42414283752441406} 11/07/2021 17:53:49 - INFO - __main__ - Step 146804: {'lr': 5.754464576738505e-07, 'samples': 28186368, 'steps': 146803, 'loss/train': 1.0783412456512451} 11/07/2021 17:53:49 - INFO - __main__ - Step 146805: {'lr': 5.750866605067073e-07, 'samples': 28186560, 'steps': 146804, 'loss/train': 1.1648298501968384} 11/07/2021 17:53:50 - INFO - __main__ - Step 146806: {'lr': 5.747269757265805e-07, 'samples': 28186752, 'steps': 146805, 'loss/train': 1.2213908433914185} 11/07/2021 17:53:50 - INFO - __main__ - Step 146807: {'lr': 5.743674033336644e-07, 'samples': 28186944, 'steps': 146806, 'loss/train': 2.695523262023926} 11/07/2021 17:53:51 - INFO - __main__ - Step 146808: {'lr': 5.740079433280698e-07, 'samples': 28187136, 'steps': 146807, 'loss/train': 5.573707103729248} 11/07/2021 17:53:51 - INFO - __main__ - Step 146809: {'lr': 5.736485957099913e-07, 'samples': 28187328, 'steps': 146808, 'loss/train': 1.0898240804672241} 11/07/2021 17:53:52 - INFO - __main__ - Step 146810: {'lr': 5.732893604795675e-07, 'samples': 28187520, 'steps': 146809, 'loss/train': 1.680394172668457} 11/07/2021 17:53:52 - INFO - __main__ - Step 146811: {'lr': 5.729302376369649e-07, 'samples': 28187712, 'steps': 146810, 'loss/train': 1.1100621223449707} 11/07/2021 17:53:53 - INFO - __main__ - Step 146812: {'lr': 5.725712271823503e-07, 'samples': 28187904, 'steps': 146811, 'loss/train': 0.777320146560669} 11/07/2021 17:53:53 - INFO - __main__ - Step 146813: {'lr': 5.722123291158898e-07, 'samples': 28188096, 'steps': 146812, 'loss/train': 1.0581674575805664} 11/07/2021 17:53:54 - INFO - __main__ - Step 146814: {'lr': 5.718535434377503e-07, 'samples': 28188288, 'steps': 146813, 'loss/train': 1.5094060897827148} 11/07/2021 17:53:54 - INFO - __main__ - Step 146815: {'lr': 5.714948701480704e-07, 'samples': 28188480, 'steps': 146814, 'loss/train': 1.1023740768432617} 11/07/2021 17:53:54 - INFO - __main__ - Step 146816: {'lr': 5.711363092470167e-07, 'samples': 28188672, 'steps': 146815, 'loss/train': 1.1732531785964966} 11/07/2021 17:53:55 - INFO - __main__ - Step 146817: {'lr': 5.707778607347836e-07, 'samples': 28188864, 'steps': 146816, 'loss/train': 0.910969078540802} 11/07/2021 17:53:56 - INFO - __main__ - Step 146818: {'lr': 5.704195246115096e-07, 'samples': 28189056, 'steps': 146817, 'loss/train': 1.24733567237854} 11/07/2021 17:53:56 - INFO - __main__ - Step 146819: {'lr': 5.700613008773336e-07, 'samples': 28189248, 'steps': 146818, 'loss/train': 1.2323929071426392} 11/07/2021 17:53:56 - INFO - __main__ - Step 146820: {'lr': 5.697031895324778e-07, 'samples': 28189440, 'steps': 146819, 'loss/train': 0.49412181973457336} 11/07/2021 17:53:57 - INFO - __main__ - Step 146821: {'lr': 5.693451905770252e-07, 'samples': 28189632, 'steps': 146820, 'loss/train': 1.2214395999908447} 11/07/2021 17:53:58 - INFO - __main__ - Step 146822: {'lr': 5.689873040111982e-07, 'samples': 28189824, 'steps': 146821, 'loss/train': 1.3618996143341064} 11/07/2021 17:53:58 - INFO - __main__ - Step 146823: {'lr': 5.686295298351351e-07, 'samples': 28190016, 'steps': 146822, 'loss/train': 1.2849645614624023} 11/07/2021 17:53:59 - INFO - __main__ - Step 146824: {'lr': 5.682718680490029e-07, 'samples': 28190208, 'steps': 146823, 'loss/train': 1.2700403928756714} 11/07/2021 17:53:59 - INFO - __main__ - Step 146825: {'lr': 5.679143186529401e-07, 'samples': 28190400, 'steps': 146824, 'loss/train': 1.14402174949646} 11/07/2021 17:53:59 - INFO - __main__ - Step 146826: {'lr': 5.675568816471411e-07, 'samples': 28190592, 'steps': 146825, 'loss/train': 0.5142641067504883} 11/07/2021 17:54:00 - INFO - __main__ - Step 146827: {'lr': 5.671995570317445e-07, 'samples': 28190784, 'steps': 146826, 'loss/train': 1.2652642726898193} 11/07/2021 17:54:01 - INFO - __main__ - Step 146828: {'lr': 5.668423448069171e-07, 'samples': 28190976, 'steps': 146827, 'loss/train': 1.3843352794647217} 11/07/2021 17:54:01 - INFO - __main__ - Step 146829: {'lr': 5.664852449728253e-07, 'samples': 28191168, 'steps': 146828, 'loss/train': 1.4941140413284302} 11/07/2021 17:54:01 - INFO - __main__ - Step 146830: {'lr': 5.661282575296079e-07, 'samples': 28191360, 'steps': 146829, 'loss/train': 1.413188099861145} 11/07/2021 17:54:02 - INFO - __main__ - Step 146831: {'lr': 5.657713824774869e-07, 'samples': 28191552, 'steps': 146830, 'loss/train': 0.8062659502029419} 11/07/2021 17:54:03 - INFO - __main__ - Step 146832: {'lr': 5.654146198165455e-07, 'samples': 28191744, 'steps': 146831, 'loss/train': 1.2935272455215454} 11/07/2021 17:54:03 - INFO - __main__ - Step 146833: {'lr': 5.650579695469782e-07, 'samples': 28191936, 'steps': 146832, 'loss/train': 1.295685052871704} 11/07/2021 17:54:03 - INFO - __main__ - Step 146834: {'lr': 5.647014316689513e-07, 'samples': 28192128, 'steps': 146833, 'loss/train': 1.075175404548645} 11/07/2021 17:54:04 - INFO - __main__ - Step 146835: {'lr': 5.643450061826594e-07, 'samples': 28192320, 'steps': 146834, 'loss/train': 1.7419426441192627} 11/07/2021 17:54:04 - INFO - __main__ - Step 146836: {'lr': 5.639886930881854e-07, 'samples': 28192512, 'steps': 146835, 'loss/train': 1.156470775604248} 11/07/2021 17:54:05 - INFO - __main__ - Step 146837: {'lr': 5.636324923857239e-07, 'samples': 28192704, 'steps': 146836, 'loss/train': 0.8868529796600342} 11/07/2021 17:54:05 - INFO - __main__ - Step 146838: {'lr': 5.63276404075469e-07, 'samples': 28192896, 'steps': 146837, 'loss/train': 1.362951397895813} 11/07/2021 17:54:06 - INFO - __main__ - Step 146839: {'lr': 5.629204281575317e-07, 'samples': 28193088, 'steps': 146838, 'loss/train': 1.210608959197998} 11/07/2021 17:54:06 - INFO - __main__ - Step 146840: {'lr': 5.625645646321065e-07, 'samples': 28193280, 'steps': 146839, 'loss/train': 1.3519688844680786} 11/07/2021 17:54:07 - INFO - __main__ - Step 146841: {'lr': 5.622088134993319e-07, 'samples': 28193472, 'steps': 146840, 'loss/train': 1.281372308731079} 11/07/2021 17:54:07 - INFO - __main__ - Step 146842: {'lr': 5.618531747593747e-07, 'samples': 28193664, 'steps': 146841, 'loss/train': 1.2072089910507202} 11/07/2021 17:54:08 - INFO - __main__ - Step 146843: {'lr': 5.614976484124013e-07, 'samples': 28193856, 'steps': 146842, 'loss/train': 1.24773371219635} 11/07/2021 17:54:08 - INFO - __main__ - Step 146844: {'lr': 5.611422344585504e-07, 'samples': 28194048, 'steps': 146843, 'loss/train': 1.470170021057129} 11/07/2021 17:54:09 - INFO - __main__ - Step 146845: {'lr': 5.607869328980164e-07, 'samples': 28194240, 'steps': 146844, 'loss/train': 1.2867478132247925} 11/07/2021 17:54:09 - INFO - __main__ - Step 146846: {'lr': 5.604317437309381e-07, 'samples': 28194432, 'steps': 146845, 'loss/train': 1.5668070316314697} 11/07/2021 17:54:09 - INFO - __main__ - Step 146847: {'lr': 5.600766669574819e-07, 'samples': 28194624, 'steps': 146846, 'loss/train': 0.9317459464073181} 11/07/2021 17:54:10 - INFO - __main__ - Step 146848: {'lr': 5.597217025778145e-07, 'samples': 28194816, 'steps': 146847, 'loss/train': 1.0356998443603516} 11/07/2021 17:54:11 - INFO - __main__ - Step 146849: {'lr': 5.593668505921023e-07, 'samples': 28195008, 'steps': 146848, 'loss/train': 1.0791699886322021} 11/07/2021 17:54:11 - INFO - __main__ - Step 146850: {'lr': 5.590121110004565e-07, 'samples': 28195200, 'steps': 146849, 'loss/train': 1.270632028579712} 11/07/2021 17:54:11 - INFO - __main__ - Step 146851: {'lr': 5.586574838030989e-07, 'samples': 28195392, 'steps': 146850, 'loss/train': 1.425949215888977} 11/07/2021 17:54:12 - INFO - __main__ - Step 146852: {'lr': 5.583029690001407e-07, 'samples': 28195584, 'steps': 146851, 'loss/train': 1.4238815307617188} 11/07/2021 17:54:13 - INFO - __main__ - Step 146853: {'lr': 5.579485665917483e-07, 'samples': 28195776, 'steps': 146852, 'loss/train': 1.6891003847122192} 11/07/2021 17:54:13 - INFO - __main__ - Step 146854: {'lr': 5.575942765781161e-07, 'samples': 28195968, 'steps': 146853, 'loss/train': 1.4824167490005493} 11/07/2021 17:54:14 - INFO - __main__ - Step 146855: {'lr': 5.572400989593829e-07, 'samples': 28196160, 'steps': 146854, 'loss/train': 1.2157167196273804} 11/07/2021 17:54:14 - INFO - __main__ - Step 146856: {'lr': 5.568860337357151e-07, 'samples': 28196352, 'steps': 146855, 'loss/train': 1.4287636280059814} 11/07/2021 17:54:14 - INFO - __main__ - Step 146857: {'lr': 5.565320809072516e-07, 'samples': 28196544, 'steps': 146856, 'loss/train': 1.4742841720581055} 11/07/2021 17:54:15 - INFO - __main__ - Step 146858: {'lr': 5.561782404741866e-07, 'samples': 28196736, 'steps': 146857, 'loss/train': 1.2483267784118652} 11/07/2021 17:54:16 - INFO - __main__ - Step 146859: {'lr': 5.558245124366312e-07, 'samples': 28196928, 'steps': 146858, 'loss/train': 1.2895567417144775} 11/07/2021 17:54:16 - INFO - __main__ - Step 146860: {'lr': 5.554708967947797e-07, 'samples': 28197120, 'steps': 146859, 'loss/train': 1.401288628578186} 11/07/2021 17:54:16 - INFO - __main__ - Step 146861: {'lr': 5.551173935487986e-07, 'samples': 28197312, 'steps': 146860, 'loss/train': 1.9497485160827637} 11/07/2021 17:54:17 - INFO - __main__ - Step 146862: {'lr': 5.547640026988266e-07, 'samples': 28197504, 'steps': 146861, 'loss/train': 1.3475432395935059} 11/07/2021 17:54:18 - INFO - __main__ - Step 146863: {'lr': 5.544107242450302e-07, 'samples': 28197696, 'steps': 146862, 'loss/train': 0.912577748298645} 11/07/2021 17:54:18 - INFO - __main__ - Step 146864: {'lr': 5.540575581875485e-07, 'samples': 28197888, 'steps': 146863, 'loss/train': 1.5259106159210205} 11/07/2021 17:54:19 - INFO - __main__ - Step 146865: {'lr': 5.537045045265754e-07, 'samples': 28198080, 'steps': 146864, 'loss/train': 1.3905768394470215} 11/07/2021 17:54:19 - INFO - __main__ - Step 146866: {'lr': 5.533515632622499e-07, 'samples': 28198272, 'steps': 146865, 'loss/train': 1.7337368726730347} 11/07/2021 17:54:20 - INFO - __main__ - Step 146867: {'lr': 5.529987343947384e-07, 'samples': 28198464, 'steps': 146866, 'loss/train': 1.1251068115234375} 11/07/2021 17:54:20 - INFO - __main__ - Step 146868: {'lr': 5.526460179241799e-07, 'samples': 28198656, 'steps': 146867, 'loss/train': 1.2106064558029175} 11/07/2021 17:54:21 - INFO - __main__ - Step 146869: {'lr': 5.522934138507685e-07, 'samples': 28198848, 'steps': 146868, 'loss/train': 1.466456413269043} 11/07/2021 17:54:21 - INFO - __main__ - Step 146870: {'lr': 5.51940922174643e-07, 'samples': 28199040, 'steps': 146869, 'loss/train': 1.2047966718673706} 11/07/2021 17:54:22 - INFO - __main__ - Step 146871: {'lr': 5.515885428959422e-07, 'samples': 28199232, 'steps': 146870, 'loss/train': 1.4824823141098022} 11/07/2021 17:54:22 - INFO - __main__ - Step 146872: {'lr': 5.512362760148603e-07, 'samples': 28199424, 'steps': 146871, 'loss/train': 1.5363768339157104} 11/07/2021 17:54:22 - INFO - __main__ - Step 146873: {'lr': 5.508841215315641e-07, 'samples': 28199616, 'steps': 146872, 'loss/train': 1.0199365615844727} 11/07/2021 17:54:24 - INFO - __main__ - Step 146874: {'lr': 5.505320794461643e-07, 'samples': 28199808, 'steps': 146873, 'loss/train': 1.1554770469665527} 11/07/2021 17:54:24 - INFO - __main__ - Step 146875: {'lr': 5.501801497588555e-07, 'samples': 28200000, 'steps': 146874, 'loss/train': 1.9794623851776123} 11/07/2021 17:54:24 - INFO - __main__ - Step 146876: {'lr': 5.498283324697761e-07, 'samples': 28200192, 'steps': 146875, 'loss/train': 4.725739002227783} 11/07/2021 17:54:25 - INFO - __main__ - Step 146877: {'lr': 5.494766275791208e-07, 'samples': 28200384, 'steps': 146876, 'loss/train': 1.216283917427063} 11/07/2021 17:54:25 - INFO - __main__ - Step 146878: {'lr': 5.491250350870003e-07, 'samples': 28200576, 'steps': 146877, 'loss/train': 0.8560175895690918} 11/07/2021 17:54:26 - INFO - __main__ - Step 146879: {'lr': 5.487735549935813e-07, 'samples': 28200768, 'steps': 146878, 'loss/train': 1.4615312814712524} 11/07/2021 17:54:26 - INFO - __main__ - Step 146880: {'lr': 5.48422187299058e-07, 'samples': 28200960, 'steps': 146879, 'loss/train': 0.9792540073394775} 11/07/2021 17:54:27 - INFO - __main__ - Step 146881: {'lr': 5.480709320035693e-07, 'samples': 28201152, 'steps': 146880, 'loss/train': 1.6916835308074951} 11/07/2021 17:54:27 - INFO - __main__ - Step 146882: {'lr': 5.477197891072538e-07, 'samples': 28201344, 'steps': 146881, 'loss/train': 1.515516996383667} 11/07/2021 17:54:28 - INFO - __main__ - Step 146883: {'lr': 5.47368758610306e-07, 'samples': 28201536, 'steps': 146882, 'loss/train': 0.9215167164802551} 11/07/2021 17:54:28 - INFO - __main__ - Step 146884: {'lr': 5.470178405128368e-07, 'samples': 28201728, 'steps': 146883, 'loss/train': 1.4321978092193604} 11/07/2021 17:54:29 - INFO - __main__ - Step 146885: {'lr': 5.466670348150682e-07, 'samples': 28201920, 'steps': 146884, 'loss/train': 1.2871018648147583} 11/07/2021 17:54:29 - INFO - __main__ - Step 146886: {'lr': 5.463163415170835e-07, 'samples': 28202112, 'steps': 146885, 'loss/train': 1.4114329814910889} 11/07/2021 17:54:30 - INFO - __main__ - Step 146887: {'lr': 5.459657606191049e-07, 'samples': 28202304, 'steps': 146886, 'loss/train': 1.2528204917907715} 11/07/2021 17:54:30 - INFO - __main__ - Step 146888: {'lr': 5.456152921212709e-07, 'samples': 28202496, 'steps': 146887, 'loss/train': 1.519328236579895} 11/07/2021 17:54:30 - INFO - __main__ - Step 146889: {'lr': 5.452649360237205e-07, 'samples': 28202688, 'steps': 146888, 'loss/train': 0.7979539632797241} 11/07/2021 17:54:31 - INFO - __main__ - Step 146890: {'lr': 5.449146923266201e-07, 'samples': 28202880, 'steps': 146889, 'loss/train': 1.2188726663589478} 11/07/2021 17:54:32 - INFO - __main__ - Step 146891: {'lr': 5.445645610301364e-07, 'samples': 28203072, 'steps': 146890, 'loss/train': 1.2581528425216675} 11/07/2021 17:54:32 - INFO - __main__ - Step 146892: {'lr': 5.442145421344358e-07, 'samples': 28203264, 'steps': 146891, 'loss/train': 0.9964261054992676} 11/07/2021 17:54:32 - INFO - __main__ - Step 146893: {'lr': 5.438646356396293e-07, 'samples': 28203456, 'steps': 146892, 'loss/train': 1.1602802276611328} 11/07/2021 17:54:33 - INFO - __main__ - Step 146894: {'lr': 5.43514841545939e-07, 'samples': 28203648, 'steps': 146893, 'loss/train': 0.9495696425437927} 11/07/2021 17:54:34 - INFO - __main__ - Step 146895: {'lr': 5.431651598534759e-07, 'samples': 28203840, 'steps': 146894, 'loss/train': 1.089673399925232} 11/07/2021 17:54:34 - INFO - __main__ - Step 146896: {'lr': 5.428155905624344e-07, 'samples': 28204032, 'steps': 146895, 'loss/train': 1.258792757987976} 11/07/2021 17:54:35 - INFO - __main__ - Step 146897: {'lr': 5.424661336729253e-07, 'samples': 28204224, 'steps': 146896, 'loss/train': 1.4645522832870483} 11/07/2021 17:54:35 - INFO - __main__ - Step 146898: {'lr': 5.421167891851431e-07, 'samples': 28204416, 'steps': 146897, 'loss/train': 1.5376505851745605} 11/07/2021 17:54:35 - INFO - __main__ - Step 146899: {'lr': 5.417675570992264e-07, 'samples': 28204608, 'steps': 146898, 'loss/train': 1.1480213403701782} 11/07/2021 17:54:36 - INFO - __main__ - Step 146900: {'lr': 5.41418437415342e-07, 'samples': 28204800, 'steps': 146899, 'loss/train': 1.282261610031128} 11/07/2021 17:54:37 - INFO - __main__ - Step 146901: {'lr': 5.41069430133656e-07, 'samples': 28204992, 'steps': 146900, 'loss/train': 1.3443889617919922} 11/07/2021 17:54:37 - INFO - __main__ - Step 146902: {'lr': 5.407205352543077e-07, 'samples': 28205184, 'steps': 146901, 'loss/train': 1.0687371492385864} 11/07/2021 17:54:37 - INFO - __main__ - Step 146903: {'lr': 5.403717527774632e-07, 'samples': 28205376, 'steps': 146902, 'loss/train': 1.5261595249176025} 11/07/2021 17:54:38 - INFO - __main__ - Step 146904: {'lr': 5.400230827032615e-07, 'samples': 28205568, 'steps': 146903, 'loss/train': 1.1179457902908325} 11/07/2021 17:54:39 - INFO - __main__ - Step 146905: {'lr': 5.396745250318968e-07, 'samples': 28205760, 'steps': 146904, 'loss/train': 1.2012492418289185} 11/07/2021 17:54:39 - INFO - __main__ - Step 146906: {'lr': 5.39326079763508e-07, 'samples': 28205952, 'steps': 146905, 'loss/train': 0.6399219036102295} 11/07/2021 17:54:39 - INFO - __main__ - Step 146907: {'lr': 5.389777468982338e-07, 'samples': 28206144, 'steps': 146906, 'loss/train': 1.5378706455230713} 11/07/2021 17:54:40 - INFO - __main__ - Step 146908: {'lr': 5.386295264362407e-07, 'samples': 28206336, 'steps': 146907, 'loss/train': 1.0783356428146362} 11/07/2021 17:54:40 - INFO - __main__ - Step 146909: {'lr': 5.382814183777229e-07, 'samples': 28206528, 'steps': 146908, 'loss/train': 1.4766967296600342} 11/07/2021 17:54:40 - INFO - __main__ - Step 146910: {'lr': 5.379334227227639e-07, 'samples': 28206720, 'steps': 146909, 'loss/train': 1.1394184827804565} 11/07/2021 17:54:41 - INFO - __main__ - Step 146911: {'lr': 5.375855394716133e-07, 'samples': 28206912, 'steps': 146910, 'loss/train': 1.6134228706359863} 11/07/2021 17:54:42 - INFO - __main__ - Step 146912: {'lr': 5.372377686243546e-07, 'samples': 28207104, 'steps': 146911, 'loss/train': 1.0940643548965454} 11/07/2021 17:54:42 - INFO - __main__ - Step 146913: {'lr': 5.368901101811541e-07, 'samples': 28207296, 'steps': 146912, 'loss/train': 0.827338695526123} 11/07/2021 17:54:42 - INFO - __main__ - Step 146914: {'lr': 5.365425641421784e-07, 'samples': 28207488, 'steps': 146913, 'loss/train': 0.9580337405204773} 11/07/2021 17:54:43 - INFO - __main__ - Step 146915: {'lr': 5.361951305075941e-07, 'samples': 28207680, 'steps': 146914, 'loss/train': 1.090048909187317} 11/07/2021 17:54:44 - INFO - __main__ - Step 146916: {'lr': 5.358478092775676e-07, 'samples': 28207872, 'steps': 146915, 'loss/train': 1.279392123222351} 11/07/2021 17:54:44 - INFO - __main__ - Step 146917: {'lr': 5.355006004522101e-07, 'samples': 28208064, 'steps': 146916, 'loss/train': 1.384578824043274} 11/07/2021 17:54:45 - INFO - __main__ - Step 146918: {'lr': 5.351535040317435e-07, 'samples': 28208256, 'steps': 146917, 'loss/train': 0.2848527729511261} 11/07/2021 17:54:45 - INFO - __main__ - Step 146919: {'lr': 5.348065200162511e-07, 'samples': 28208448, 'steps': 146918, 'loss/train': 1.3035752773284912} 11/07/2021 17:54:45 - INFO - __main__ - Step 146920: {'lr': 5.344596484059549e-07, 'samples': 28208640, 'steps': 146919, 'loss/train': 1.054231882095337} 11/07/2021 17:54:46 - INFO - __main__ - Step 146921: {'lr': 5.341128892009661e-07, 'samples': 28208832, 'steps': 146920, 'loss/train': 1.317069172859192} 11/07/2021 17:54:47 - INFO - __main__ - Step 146922: {'lr': 5.337662424014511e-07, 'samples': 28209024, 'steps': 146921, 'loss/train': 0.22448651492595673} 11/07/2021 17:54:47 - INFO - __main__ - Step 146923: {'lr': 5.334197080075765e-07, 'samples': 28209216, 'steps': 146922, 'loss/train': 1.1565955877304077} 11/07/2021 17:54:47 - INFO - __main__ - Step 146924: {'lr': 5.330732860195087e-07, 'samples': 28209408, 'steps': 146923, 'loss/train': 0.8049302697181702} 11/07/2021 17:54:48 - INFO - __main__ - Step 146925: {'lr': 5.327269764373588e-07, 'samples': 28209600, 'steps': 146924, 'loss/train': 1.2180676460266113} 11/07/2021 17:54:49 - INFO - __main__ - Step 146926: {'lr': 5.323807792613211e-07, 'samples': 28209792, 'steps': 146925, 'loss/train': 1.2100555896759033} 11/07/2021 17:54:49 - INFO - __main__ - Step 146927: {'lr': 5.320346944915621e-07, 'samples': 28209984, 'steps': 146926, 'loss/train': 1.5688482522964478} 11/07/2021 17:54:50 - INFO - __main__ - Step 146928: {'lr': 5.316887221282208e-07, 'samples': 28210176, 'steps': 146927, 'loss/train': 1.285355567932129} 11/07/2021 17:54:50 - INFO - __main__ - Step 146929: {'lr': 5.313428621714356e-07, 'samples': 28210368, 'steps': 146928, 'loss/train': 1.5772253274917603} 11/07/2021 17:54:50 - INFO - __main__ - Step 146930: {'lr': 5.309971146213732e-07, 'samples': 28210560, 'steps': 146929, 'loss/train': 1.199657678604126} 11/07/2021 17:54:51 - INFO - __main__ - Step 146931: {'lr': 5.306514794782002e-07, 'samples': 28210752, 'steps': 146930, 'loss/train': 0.7951894998550415} 11/07/2021 17:54:52 - INFO - __main__ - Step 146932: {'lr': 5.303059567420554e-07, 'samples': 28210944, 'steps': 146931, 'loss/train': 1.1606521606445312} 11/07/2021 17:54:52 - INFO - __main__ - Step 146933: {'lr': 5.299605464131329e-07, 'samples': 28211136, 'steps': 146932, 'loss/train': 1.312366008758545} 11/07/2021 17:54:52 - INFO - __main__ - Step 146934: {'lr': 5.296152484915439e-07, 'samples': 28211328, 'steps': 146933, 'loss/train': 0.9871907830238342} 11/07/2021 17:54:53 - INFO - __main__ - Step 146935: {'lr': 5.292700629774549e-07, 'samples': 28211520, 'steps': 146934, 'loss/train': 1.521774411201477} 11/07/2021 17:54:53 - INFO - __main__ - Step 146936: {'lr': 5.289249898710324e-07, 'samples': 28211712, 'steps': 146935, 'loss/train': 1.6031862497329712} 11/07/2021 17:54:54 - INFO - __main__ - Step 146937: {'lr': 5.28580029172443e-07, 'samples': 28211904, 'steps': 146936, 'loss/train': 1.4648900032043457} 11/07/2021 17:54:54 - INFO - __main__ - Step 146938: {'lr': 5.282351808817975e-07, 'samples': 28212096, 'steps': 146937, 'loss/train': 0.9306327700614929} 11/07/2021 17:54:55 - INFO - __main__ - Step 146939: {'lr': 5.278904449992905e-07, 'samples': 28212288, 'steps': 146938, 'loss/train': 1.0911020040512085} 11/07/2021 17:54:55 - INFO - __main__ - Step 146940: {'lr': 5.275458215250606e-07, 'samples': 28212480, 'steps': 146939, 'loss/train': 1.2147325277328491} 11/07/2021 17:54:56 - INFO - __main__ - Step 146941: {'lr': 5.272013104592743e-07, 'samples': 28212672, 'steps': 146940, 'loss/train': 1.3477442264556885} 11/07/2021 17:54:57 - INFO - __main__ - Step 146942: {'lr': 5.268569118020982e-07, 'samples': 28212864, 'steps': 146941, 'loss/train': 0.8836772441864014} 11/07/2021 17:54:57 - INFO - __main__ - Step 146943: {'lr': 5.265126255536434e-07, 'samples': 28213056, 'steps': 146942, 'loss/train': 1.1393611431121826} 11/07/2021 17:54:57 - INFO - __main__ - Step 146944: {'lr': 5.261684517141318e-07, 'samples': 28213248, 'steps': 146943, 'loss/train': 1.5884079933166504} 11/07/2021 17:54:58 - INFO - __main__ - Step 146945: {'lr': 5.258243902836468e-07, 'samples': 28213440, 'steps': 146944, 'loss/train': 1.139747977256775} 11/07/2021 17:54:58 - INFO - __main__ - Step 146946: {'lr': 5.254804412623826e-07, 'samples': 28213632, 'steps': 146945, 'loss/train': 0.457065612077713} 11/07/2021 17:54:59 - INFO - __main__ - Step 146947: {'lr': 5.25136604650478e-07, 'samples': 28213824, 'steps': 146946, 'loss/train': 1.6736493110656738} 11/07/2021 17:54:59 - INFO - __main__ - Step 146948: {'lr': 5.247928804480994e-07, 'samples': 28214016, 'steps': 146947, 'loss/train': 1.3991676568984985} 11/07/2021 17:55:00 - INFO - __main__ - Step 146949: {'lr': 5.244492686554137e-07, 'samples': 28214208, 'steps': 146948, 'loss/train': 1.0704271793365479} 11/07/2021 17:55:00 - INFO - __main__ - Step 146950: {'lr': 5.241057692725593e-07, 'samples': 28214400, 'steps': 146949, 'loss/train': 0.9385663866996765} 11/07/2021 17:55:01 - INFO - __main__ - Step 146951: {'lr': 5.237623822996751e-07, 'samples': 28214592, 'steps': 146950, 'loss/train': 1.1804230213165283} 11/07/2021 17:55:02 - INFO - __main__ - Step 146952: {'lr': 5.234191077369554e-07, 'samples': 28214784, 'steps': 146951, 'loss/train': 1.216329574584961} 11/07/2021 17:55:02 - INFO - __main__ - Step 146953: {'lr': 5.230759455845113e-07, 'samples': 28214976, 'steps': 146952, 'loss/train': 1.4804236888885498} 11/07/2021 17:55:02 - INFO - __main__ - Step 146954: {'lr': 5.22732895842537e-07, 'samples': 28215168, 'steps': 146953, 'loss/train': 1.683626413345337} 11/07/2021 17:55:03 - INFO - __main__ - Step 146955: {'lr': 5.223899585111713e-07, 'samples': 28215360, 'steps': 146954, 'loss/train': 1.3696026802062988} 11/07/2021 17:55:03 - INFO - __main__ - Step 146956: {'lr': 5.22047133590553e-07, 'samples': 28215552, 'steps': 146955, 'loss/train': 1.587267279624939} 11/07/2021 17:55:04 - INFO - __main__ - Step 146957: {'lr': 5.217044210808764e-07, 'samples': 28215744, 'steps': 146956, 'loss/train': 1.621290683746338} 11/07/2021 17:55:04 - INFO - __main__ - Step 146958: {'lr': 5.213618209822524e-07, 'samples': 28215936, 'steps': 146957, 'loss/train': 1.380253553390503} 11/07/2021 17:55:05 - INFO - __main__ - Step 146959: {'lr': 5.210193332948476e-07, 'samples': 28216128, 'steps': 146958, 'loss/train': 0.6504949331283569} 11/07/2021 17:55:05 - INFO - __main__ - Step 146960: {'lr': 5.206769580188286e-07, 'samples': 28216320, 'steps': 146959, 'loss/train': 1.8461871147155762} 11/07/2021 17:55:05 - INFO - __main__ - Step 146961: {'lr': 5.203346951543342e-07, 'samples': 28216512, 'steps': 146960, 'loss/train': 1.3598436117172241} 11/07/2021 17:55:07 - INFO - __main__ - Step 146962: {'lr': 5.199925447015307e-07, 'samples': 28216704, 'steps': 146961, 'loss/train': 1.3128970861434937} 11/07/2021 17:55:07 - INFO - __main__ - Step 146963: {'lr': 5.19650506660585e-07, 'samples': 28216896, 'steps': 146962, 'loss/train': 1.1556332111358643} 11/07/2021 17:55:07 - INFO - __main__ - Step 146964: {'lr': 5.193085810316078e-07, 'samples': 28217088, 'steps': 146963, 'loss/train': 1.392492413520813} 11/07/2021 17:55:08 - INFO - __main__ - Step 146965: {'lr': 5.189667678147936e-07, 'samples': 28217280, 'steps': 146964, 'loss/train': 0.9358727335929871} 11/07/2021 17:55:08 - INFO - __main__ - Step 146966: {'lr': 5.18625067010281e-07, 'samples': 28217472, 'steps': 146965, 'loss/train': 1.2333550453186035} 11/07/2021 17:55:09 - INFO - __main__ - Step 146967: {'lr': 5.182834786182366e-07, 'samples': 28217664, 'steps': 146966, 'loss/train': 1.41159987449646} 11/07/2021 17:55:10 - INFO - __main__ - Step 146968: {'lr': 5.179420026387993e-07, 'samples': 28217856, 'steps': 146967, 'loss/train': 1.1573940515518188} 11/07/2021 17:55:10 - INFO - __main__ - Step 146969: {'lr': 5.176006390721355e-07, 'samples': 28218048, 'steps': 146968, 'loss/train': 1.3055312633514404} 11/07/2021 17:55:10 - INFO - __main__ - Step 146970: {'lr': 5.17259387918384e-07, 'samples': 28218240, 'steps': 146969, 'loss/train': 1.7697861194610596} 11/07/2021 17:55:11 - INFO - __main__ - Step 146971: {'lr': 5.169182491777113e-07, 'samples': 28218432, 'steps': 146970, 'loss/train': 1.262890100479126} 11/07/2021 17:55:12 - INFO - __main__ - Step 146972: {'lr': 5.165772228502563e-07, 'samples': 28218624, 'steps': 146971, 'loss/train': 1.1605818271636963} 11/07/2021 17:55:12 - INFO - __main__ - Step 146973: {'lr': 5.162363089361577e-07, 'samples': 28218816, 'steps': 146972, 'loss/train': 0.978467583656311} 11/07/2021 17:55:12 - INFO - __main__ - Step 146974: {'lr': 5.158955074356375e-07, 'samples': 28219008, 'steps': 146973, 'loss/train': 1.2679568529129028} 11/07/2021 17:55:13 - INFO - __main__ - Step 146975: {'lr': 5.15554818348779e-07, 'samples': 28219200, 'steps': 146974, 'loss/train': 1.599779486656189} 11/07/2021 17:55:13 - INFO - __main__ - Step 146976: {'lr': 5.152142416757766e-07, 'samples': 28219392, 'steps': 146975, 'loss/train': 1.5289872884750366} 11/07/2021 17:55:13 - INFO - __main__ - Step 146977: {'lr': 5.148737774167412e-07, 'samples': 28219584, 'steps': 146976, 'loss/train': 1.3124738931655884} 11/07/2021 17:55:14 - INFO - __main__ - Step 146978: {'lr': 5.145334255718948e-07, 'samples': 28219776, 'steps': 146977, 'loss/train': 1.8127230405807495} 11/07/2021 17:55:15 - INFO - __main__ - Step 146979: {'lr': 5.141931861413207e-07, 'samples': 28219968, 'steps': 146978, 'loss/train': 0.9317293167114258} 11/07/2021 17:55:15 - INFO - __main__ - Step 146980: {'lr': 5.138530591252133e-07, 'samples': 28220160, 'steps': 146979, 'loss/train': 1.026314377784729} 11/07/2021 17:55:15 - INFO - __main__ - Step 146981: {'lr': 5.135130445237113e-07, 'samples': 28220352, 'steps': 146980, 'loss/train': 1.0226856470108032} 11/07/2021 17:55:16 - INFO - __main__ - Step 146982: {'lr': 5.131731423369534e-07, 'samples': 28220544, 'steps': 146981, 'loss/train': 1.1721185445785522} 11/07/2021 17:55:17 - INFO - __main__ - Step 146983: {'lr': 5.128333525651341e-07, 'samples': 28220736, 'steps': 146982, 'loss/train': 1.0568633079528809} 11/07/2021 17:55:17 - INFO - __main__ - Step 146984: {'lr': 5.124936752083642e-07, 'samples': 28220928, 'steps': 146983, 'loss/train': 1.3103200197219849} 11/07/2021 17:55:18 - INFO - __main__ - Step 146985: {'lr': 5.121541102668382e-07, 'samples': 28221120, 'steps': 146984, 'loss/train': 1.2934582233428955} 11/07/2021 17:55:18 - INFO - __main__ - Step 146986: {'lr': 5.118146577406668e-07, 'samples': 28221312, 'steps': 146985, 'loss/train': 1.2876310348510742} 11/07/2021 17:55:18 - INFO - __main__ - Step 146987: {'lr': 5.114753176300169e-07, 'samples': 28221504, 'steps': 146986, 'loss/train': 1.4493091106414795} 11/07/2021 17:55:19 - INFO - __main__ - Step 146988: {'lr': 5.111360899350548e-07, 'samples': 28221696, 'steps': 146987, 'loss/train': 1.0999068021774292} 11/07/2021 17:55:20 - INFO - __main__ - Step 146989: {'lr': 5.107969746559471e-07, 'samples': 28221888, 'steps': 146988, 'loss/train': 1.3607807159423828} 11/07/2021 17:55:20 - INFO - __main__ - Step 146990: {'lr': 5.104579717928049e-07, 'samples': 28222080, 'steps': 146989, 'loss/train': 0.7048705816268921} 11/07/2021 17:55:20 - INFO - __main__ - Step 146991: {'lr': 5.101190813457945e-07, 'samples': 28222272, 'steps': 146990, 'loss/train': 0.6327205896377563} 11/07/2021 17:55:21 - INFO - __main__ - Step 146992: {'lr': 5.097803033150827e-07, 'samples': 28222464, 'steps': 146991, 'loss/train': 1.0540413856506348} 11/07/2021 17:55:22 - INFO - __main__ - Step 146993: {'lr': 5.094416377008082e-07, 'samples': 28222656, 'steps': 146992, 'loss/train': 1.6593713760375977} 11/07/2021 17:55:22 - INFO - __main__ - Step 146994: {'lr': 5.091030845031097e-07, 'samples': 28222848, 'steps': 146993, 'loss/train': 0.9229803681373596} 11/07/2021 17:55:22 - INFO - __main__ - Step 146995: {'lr': 5.087646437221815e-07, 'samples': 28223040, 'steps': 146994, 'loss/train': 1.1546940803527832} 11/07/2021 17:55:23 - INFO - __main__ - Step 146996: {'lr': 5.084263153581625e-07, 'samples': 28223232, 'steps': 146995, 'loss/train': 1.2601268291473389} 11/07/2021 17:55:23 - INFO - __main__ - Step 146997: {'lr': 5.080880994111914e-07, 'samples': 28223424, 'steps': 146996, 'loss/train': 1.3377821445465088} 11/07/2021 17:55:24 - INFO - __main__ - Step 146998: {'lr': 5.077499958814347e-07, 'samples': 28223616, 'steps': 146997, 'loss/train': 1.477994680404663} 11/07/2021 17:55:25 - INFO - __main__ - Step 146999: {'lr': 5.074120047690312e-07, 'samples': 28223808, 'steps': 146998, 'loss/train': 1.5007028579711914} 11/07/2021 17:55:25 - INFO - __main__ - Step 147000: {'lr': 5.070741260741197e-07, 'samples': 28224000, 'steps': 146999, 'loss/train': 1.3725451231002808} 11/07/2021 17:55:25 - INFO - __main__ - Step 147001: {'lr': 5.067363597968666e-07, 'samples': 28224192, 'steps': 147000, 'loss/train': 0.6772065758705139} 11/07/2021 17:55:26 - INFO - __main__ - Step 147002: {'lr': 5.063987059374664e-07, 'samples': 28224384, 'steps': 147001, 'loss/train': 1.3547298908233643} 11/07/2021 17:55:26 - INFO - __main__ - Step 147003: {'lr': 5.060611644960022e-07, 'samples': 28224576, 'steps': 147002, 'loss/train': 1.3746819496154785} 11/07/2021 17:55:27 - INFO - __main__ - Step 147004: {'lr': 5.057237354726685e-07, 'samples': 28224768, 'steps': 147003, 'loss/train': 1.401484727859497} 11/07/2021 17:55:27 - INFO - __main__ - Step 147005: {'lr': 5.053864188676038e-07, 'samples': 28224960, 'steps': 147004, 'loss/train': 0.8717550039291382} 11/07/2021 17:55:28 - INFO - __main__ - Step 147006: {'lr': 5.050492146809471e-07, 'samples': 28225152, 'steps': 147005, 'loss/train': 1.6884628534317017} 11/07/2021 17:55:28 - INFO - __main__ - Step 147007: {'lr': 5.047121229128926e-07, 'samples': 28225344, 'steps': 147006, 'loss/train': 1.2523167133331299} 11/07/2021 17:55:28 - INFO - __main__ - Step 147008: {'lr': 5.043751435635513e-07, 'samples': 28225536, 'steps': 147007, 'loss/train': 1.2347675561904907} 11/07/2021 17:55:30 - INFO - __main__ - Step 147009: {'lr': 5.040382766330898e-07, 'samples': 28225728, 'steps': 147008, 'loss/train': 0.9639045596122742} 11/07/2021 17:55:30 - INFO - __main__ - Step 147010: {'lr': 5.037015221216468e-07, 'samples': 28225920, 'steps': 147009, 'loss/train': 1.4070466756820679} 11/07/2021 17:55:30 - INFO - __main__ - Step 147011: {'lr': 5.033648800293889e-07, 'samples': 28226112, 'steps': 147010, 'loss/train': 1.2179322242736816} 11/07/2021 17:55:31 - INFO - __main__ - Step 147012: {'lr': 5.030283503564825e-07, 'samples': 28226304, 'steps': 147011, 'loss/train': 1.369434118270874} 11/07/2021 17:55:31 - INFO - __main__ - Step 147013: {'lr': 5.026919331030388e-07, 'samples': 28226496, 'steps': 147012, 'loss/train': 1.220733642578125} 11/07/2021 17:55:32 - INFO - __main__ - Step 147014: {'lr': 5.023556282692521e-07, 'samples': 28226688, 'steps': 147013, 'loss/train': 1.5939210653305054} 11/07/2021 17:55:32 - INFO - __main__ - Step 147015: {'lr': 5.02019435855261e-07, 'samples': 28226880, 'steps': 147014, 'loss/train': 1.1041672229766846} 11/07/2021 17:55:33 - INFO - __main__ - Step 147016: {'lr': 5.016833558611767e-07, 'samples': 28227072, 'steps': 147015, 'loss/train': 1.0131853818893433} 11/07/2021 17:55:33 - INFO - __main__ - Step 147017: {'lr': 5.013473882872211e-07, 'samples': 28227264, 'steps': 147016, 'loss/train': 1.5551544427871704} 11/07/2021 17:55:33 - INFO - __main__ - Step 147018: {'lr': 5.010115331334774e-07, 'samples': 28227456, 'steps': 147017, 'loss/train': 1.5259180068969727} 11/07/2021 17:55:35 - INFO - __main__ - Step 147019: {'lr': 5.006757904001403e-07, 'samples': 28227648, 'steps': 147018, 'loss/train': 0.8583418726921082} 11/07/2021 17:55:35 - INFO - __main__ - Step 147020: {'lr': 5.00340160087348e-07, 'samples': 28227840, 'steps': 147019, 'loss/train': 1.2710398435592651} 11/07/2021 17:55:35 - INFO - __main__ - Step 147021: {'lr': 5.000046421952398e-07, 'samples': 28228032, 'steps': 147020, 'loss/train': 1.2135646343231201} 11/07/2021 17:55:36 - INFO - __main__ - Step 147022: {'lr': 4.996692367240096e-07, 'samples': 28228224, 'steps': 147021, 'loss/train': 1.273104190826416} 11/07/2021 17:55:36 - INFO - __main__ - Step 147023: {'lr': 4.993339436737687e-07, 'samples': 28228416, 'steps': 147022, 'loss/train': 1.6380504369735718} 11/07/2021 17:55:36 - INFO - __main__ - Step 147024: {'lr': 4.989987630446558e-07, 'samples': 28228608, 'steps': 147023, 'loss/train': 1.1947110891342163} 11/07/2021 17:55:37 - INFO - __main__ - Step 147025: {'lr': 4.986636948368651e-07, 'samples': 28228800, 'steps': 147024, 'loss/train': 1.3049498796463013} 11/07/2021 17:55:38 - INFO - __main__ - Step 147026: {'lr': 4.983287390505353e-07, 'samples': 28228992, 'steps': 147025, 'loss/train': 1.277927279472351} 11/07/2021 17:55:38 - INFO - __main__ - Step 147027: {'lr': 4.979938956857777e-07, 'samples': 28229184, 'steps': 147026, 'loss/train': 0.6481015682220459} 11/07/2021 17:55:39 - INFO - __main__ - Step 147028: {'lr': 4.976591647427864e-07, 'samples': 28229376, 'steps': 147027, 'loss/train': 1.486572027206421} 11/07/2021 17:55:39 - INFO - __main__ - Step 147029: {'lr': 4.973245462217002e-07, 'samples': 28229568, 'steps': 147028, 'loss/train': 1.3097813129425049} 11/07/2021 17:55:40 - INFO - __main__ - Step 147030: {'lr': 4.969900401226857e-07, 'samples': 28229760, 'steps': 147029, 'loss/train': 1.5089244842529297} 11/07/2021 17:55:40 - INFO - __main__ - Step 147031: {'lr': 4.966556464458538e-07, 'samples': 28229952, 'steps': 147030, 'loss/train': 1.0965406894683838} 11/07/2021 17:55:41 - INFO - __main__ - Step 147032: {'lr': 4.96321365191399e-07, 'samples': 28230144, 'steps': 147031, 'loss/train': 1.3293746709823608} 11/07/2021 17:55:41 - INFO - __main__ - Step 147033: {'lr': 4.95987196359432e-07, 'samples': 28230336, 'steps': 147032, 'loss/train': 1.1369673013687134} 11/07/2021 17:55:41 - INFO - __main__ - Step 147034: {'lr': 4.956531399501474e-07, 'samples': 28230528, 'steps': 147033, 'loss/train': 0.7999866008758545} 11/07/2021 17:55:42 - INFO - __main__ - Step 147035: {'lr': 4.95319195963656e-07, 'samples': 28230720, 'steps': 147034, 'loss/train': 1.436902403831482} 11/07/2021 17:55:43 - INFO - __main__ - Step 147036: {'lr': 4.949853644001246e-07, 'samples': 28230912, 'steps': 147035, 'loss/train': 0.9406614899635315} 11/07/2021 17:55:43 - INFO - __main__ - Step 147037: {'lr': 4.946516452596917e-07, 'samples': 28231104, 'steps': 147036, 'loss/train': 0.7199698090553284} 11/07/2021 17:55:44 - INFO - __main__ - Step 147038: {'lr': 4.943180385425238e-07, 'samples': 28231296, 'steps': 147037, 'loss/train': 1.2632631063461304} 11/07/2021 17:55:44 - INFO - __main__ - Step 147039: {'lr': 4.939845442487878e-07, 'samples': 28231488, 'steps': 147038, 'loss/train': 1.314500331878662} 11/07/2021 17:55:45 - INFO - __main__ - Step 147040: {'lr': 4.936511623785944e-07, 'samples': 28231680, 'steps': 147039, 'loss/train': 1.198068380355835} 11/07/2021 17:55:45 - INFO - __main__ - Step 147041: {'lr': 4.933178929321103e-07, 'samples': 28231872, 'steps': 147040, 'loss/train': 1.1376382112503052} 11/07/2021 17:55:46 - INFO - __main__ - Step 147042: {'lr': 4.929847359095019e-07, 'samples': 28232064, 'steps': 147041, 'loss/train': 1.0430419445037842} 11/07/2021 17:55:46 - INFO - __main__ - Step 147043: {'lr': 4.926516913108803e-07, 'samples': 28232256, 'steps': 147042, 'loss/train': 1.3785791397094727} 11/07/2021 17:55:46 - INFO - __main__ - Step 147044: {'lr': 4.923187591364398e-07, 'samples': 28232448, 'steps': 147043, 'loss/train': 0.981029748916626} 11/07/2021 17:55:47 - INFO - __main__ - Step 147045: {'lr': 4.919859393862913e-07, 'samples': 28232640, 'steps': 147044, 'loss/train': 0.8945174217224121} 11/07/2021 17:55:48 - INFO - __main__ - Step 147046: {'lr': 4.916532320606293e-07, 'samples': 28232832, 'steps': 147045, 'loss/train': 1.1634024381637573} 11/07/2021 17:55:48 - INFO - __main__ - Step 147047: {'lr': 4.913206371595647e-07, 'samples': 28233024, 'steps': 147046, 'loss/train': 1.1734188795089722} 11/07/2021 17:55:48 - INFO - __main__ - Step 147048: {'lr': 4.90988154683264e-07, 'samples': 28233216, 'steps': 147047, 'loss/train': 1.2044168710708618} 11/07/2021 17:55:49 - INFO - __main__ - Step 147049: {'lr': 4.90655784631866e-07, 'samples': 28233408, 'steps': 147048, 'loss/train': 1.5158109664916992} 11/07/2021 17:55:50 - INFO - __main__ - Step 147050: {'lr': 4.903235270055373e-07, 'samples': 28233600, 'steps': 147049, 'loss/train': 1.27064049243927} 11/07/2021 17:55:50 - INFO - __main__ - Step 147051: {'lr': 4.899913818044166e-07, 'samples': 28233792, 'steps': 147050, 'loss/train': 1.3449691534042358} 11/07/2021 17:55:51 - INFO - __main__ - Step 147052: {'lr': 4.896593490286427e-07, 'samples': 28233984, 'steps': 147051, 'loss/train': 1.0276514291763306} 11/07/2021 17:55:51 - INFO - __main__ - Step 147053: {'lr': 4.893274286783822e-07, 'samples': 28234176, 'steps': 147052, 'loss/train': 1.6817474365234375} 11/07/2021 17:55:51 - INFO - __main__ - Step 147054: {'lr': 4.889956207538015e-07, 'samples': 28234368, 'steps': 147053, 'loss/train': 1.8046503067016602} 11/07/2021 17:55:52 - INFO - __main__ - Step 147055: {'lr': 4.886639252550118e-07, 'samples': 28234560, 'steps': 147054, 'loss/train': 1.3895044326782227} 11/07/2021 17:55:53 - INFO - __main__ - Step 147056: {'lr': 4.883323421821795e-07, 'samples': 28234752, 'steps': 147055, 'loss/train': 1.037140130996704} 11/07/2021 17:55:53 - INFO - __main__ - Step 147057: {'lr': 4.880008715354434e-07, 'samples': 28234944, 'steps': 147056, 'loss/train': 1.5332194566726685} 11/07/2021 17:55:53 - INFO - __main__ - Step 147058: {'lr': 4.876695133149977e-07, 'samples': 28235136, 'steps': 147057, 'loss/train': 0.8832322359085083} 11/07/2021 17:55:54 - INFO - __main__ - Step 147059: {'lr': 4.873382675209259e-07, 'samples': 28235328, 'steps': 147058, 'loss/train': 1.477001667022705} 11/07/2021 17:55:54 - INFO - __main__ - Step 147060: {'lr': 4.870071341534221e-07, 'samples': 28235520, 'steps': 147059, 'loss/train': 1.705022931098938} 11/07/2021 17:55:55 - INFO - __main__ - Step 147061: {'lr': 4.866761132126252e-07, 'samples': 28235712, 'steps': 147060, 'loss/train': 1.2863739728927612} 11/07/2021 17:55:55 - INFO - __main__ - Step 147062: {'lr': 4.863452046986738e-07, 'samples': 28235904, 'steps': 147061, 'loss/train': 1.3225805759429932} 11/07/2021 17:55:56 - INFO - __main__ - Step 147063: {'lr': 4.860144086117347e-07, 'samples': 28236096, 'steps': 147062, 'loss/train': 1.37235689163208} 11/07/2021 17:55:56 - INFO - __main__ - Step 147064: {'lr': 4.856837249519463e-07, 'samples': 28236288, 'steps': 147063, 'loss/train': 1.278684139251709} 11/07/2021 17:55:56 - INFO - __main__ - Step 147065: {'lr': 4.853531537194478e-07, 'samples': 28236480, 'steps': 147064, 'loss/train': 1.3900705575942993} 11/07/2021 17:55:58 - INFO - __main__ - Step 147066: {'lr': 4.850226949144054e-07, 'samples': 28236672, 'steps': 147065, 'loss/train': 1.6469759941101074} 11/07/2021 17:55:58 - INFO - __main__ - Step 147067: {'lr': 4.846923485369858e-07, 'samples': 28236864, 'steps': 147066, 'loss/train': 1.317456603050232} 11/07/2021 17:55:59 - INFO - __main__ - Step 147068: {'lr': 4.843621145872723e-07, 'samples': 28237056, 'steps': 147067, 'loss/train': 1.6626261472702026} 11/07/2021 17:55:59 - INFO - __main__ - Step 147069: {'lr': 4.840319930654868e-07, 'samples': 28237248, 'steps': 147068, 'loss/train': 0.4159528613090515} 11/07/2021 17:55:59 - INFO - __main__ - Step 147070: {'lr': 4.837019839717127e-07, 'samples': 28237440, 'steps': 147069, 'loss/train': 0.860072672367096} 11/07/2021 17:56:00 - INFO - __main__ - Step 147071: {'lr': 4.83372087306172e-07, 'samples': 28237632, 'steps': 147070, 'loss/train': 1.2871623039245605} 11/07/2021 17:56:01 - INFO - __main__ - Step 147072: {'lr': 4.83042303068948e-07, 'samples': 28237824, 'steps': 147071, 'loss/train': 1.4564749002456665} 11/07/2021 17:56:01 - INFO - __main__ - Step 147073: {'lr': 4.827126312602071e-07, 'samples': 28238016, 'steps': 147072, 'loss/train': 1.0952948331832886} 11/07/2021 17:56:01 - INFO - __main__ - Step 147074: {'lr': 4.823830718801159e-07, 'samples': 28238208, 'steps': 147073, 'loss/train': 2.1415069103240967} 11/07/2021 17:56:02 - INFO - __main__ - Step 147075: {'lr': 4.820536249288133e-07, 'samples': 28238400, 'steps': 147074, 'loss/train': 1.2159453630447388} 11/07/2021 17:56:03 - INFO - __main__ - Step 147076: {'lr': 4.817242904064656e-07, 'samples': 28238592, 'steps': 147075, 'loss/train': 1.3473719358444214} 11/07/2021 17:56:03 - INFO - __main__ - Step 147077: {'lr': 4.81395068313184e-07, 'samples': 28238784, 'steps': 147076, 'loss/train': 0.5466746687889099} 11/07/2021 17:56:03 - INFO - __main__ - Step 147078: {'lr': 4.81065958649135e-07, 'samples': 28238976, 'steps': 147077, 'loss/train': 1.2014515399932861} 11/07/2021 17:56:04 - INFO - __main__ - Step 147079: {'lr': 4.807369614144852e-07, 'samples': 28239168, 'steps': 147078, 'loss/train': 1.7807214260101318} 11/07/2021 17:56:04 - INFO - __main__ - Step 147080: {'lr': 4.804080766093455e-07, 'samples': 28239360, 'steps': 147079, 'loss/train': 1.112200379371643} 11/07/2021 17:56:05 - INFO - __main__ - Step 147081: {'lr': 4.800793042338825e-07, 'samples': 28239552, 'steps': 147080, 'loss/train': 1.0914912223815918} 11/07/2021 17:56:05 - INFO - __main__ - Step 147082: {'lr': 4.797506442882626e-07, 'samples': 28239744, 'steps': 147081, 'loss/train': 1.84392249584198} 11/07/2021 17:56:06 - INFO - __main__ - Step 147083: {'lr': 4.794220967725971e-07, 'samples': 28239936, 'steps': 147082, 'loss/train': 1.3939682245254517} 11/07/2021 17:56:06 - INFO - __main__ - Step 147084: {'lr': 4.790936616870522e-07, 'samples': 28240128, 'steps': 147083, 'loss/train': 1.3055325746536255} 11/07/2021 17:56:06 - INFO - __main__ - Step 147085: {'lr': 4.787653390317948e-07, 'samples': 28240320, 'steps': 147084, 'loss/train': 1.1298093795776367} 11/07/2021 17:56:07 - INFO - __main__ - Step 147086: {'lr': 4.784371288069356e-07, 'samples': 28240512, 'steps': 147085, 'loss/train': 1.1290618181228638} 11/07/2021 17:56:08 - INFO - __main__ - Step 147087: {'lr': 4.78109031012669e-07, 'samples': 28240704, 'steps': 147086, 'loss/train': 1.4222662448883057} 11/07/2021 17:56:08 - INFO - __main__ - Step 147088: {'lr': 4.777810456491061e-07, 'samples': 28240896, 'steps': 147087, 'loss/train': 1.2600992918014526} 11/07/2021 17:56:08 - INFO - __main__ - Step 147089: {'lr': 4.774531727163856e-07, 'samples': 28241088, 'steps': 147088, 'loss/train': 0.7236965298652649} 11/07/2021 17:56:09 - INFO - __main__ - Step 147090: {'lr': 4.771254122147017e-07, 'samples': 28241280, 'steps': 147089, 'loss/train': 1.4127435684204102} 11/07/2021 17:56:09 - INFO - __main__ - Step 147091: {'lr': 4.7679776414416567e-07, 'samples': 28241472, 'steps': 147090, 'loss/train': 1.0558485984802246} 11/07/2021 17:56:10 - INFO - __main__ - Step 147092: {'lr': 4.7647022850491603e-07, 'samples': 28241664, 'steps': 147091, 'loss/train': 0.8764669895172119} 11/07/2021 17:56:11 - INFO - __main__ - Step 147093: {'lr': 4.7614280529714724e-07, 'samples': 28241856, 'steps': 147092, 'loss/train': 1.433984398841858} 11/07/2021 17:56:11 - INFO - __main__ - Step 147094: {'lr': 4.758154945209425e-07, 'samples': 28242048, 'steps': 147093, 'loss/train': 1.6129179000854492} 11/07/2021 17:56:11 - INFO - __main__ - Step 147095: {'lr': 4.754882961765239e-07, 'samples': 28242240, 'steps': 147094, 'loss/train': 1.3285629749298096} 11/07/2021 17:56:12 - INFO - __main__ - Step 147096: {'lr': 4.751612102639746e-07, 'samples': 28242432, 'steps': 147095, 'loss/train': 1.6812760829925537} 11/07/2021 17:56:13 - INFO - __main__ - Step 147097: {'lr': 4.7483423678346126e-07, 'samples': 28242624, 'steps': 147096, 'loss/train': 1.2041913270950317} 11/07/2021 17:56:13 - INFO - __main__ - Step 147098: {'lr': 4.7450737573515036e-07, 'samples': 28242816, 'steps': 147097, 'loss/train': 1.3429912328720093} 11/07/2021 17:56:13 - INFO - __main__ - Step 147099: {'lr': 4.7418062711918063e-07, 'samples': 28243008, 'steps': 147098, 'loss/train': 1.3420264720916748} 11/07/2021 17:56:14 - INFO - __main__ - Step 147100: {'lr': 4.738539909356909e-07, 'samples': 28243200, 'steps': 147099, 'loss/train': 1.6712294816970825} 11/07/2021 17:56:14 - INFO - __main__ - Step 147101: {'lr': 4.7352746718482e-07, 'samples': 28243392, 'steps': 147100, 'loss/train': 1.2770971059799194} 11/07/2021 17:56:15 - INFO - __main__ - Step 147102: {'lr': 4.732010558667621e-07, 'samples': 28243584, 'steps': 147101, 'loss/train': 1.2921339273452759} 11/07/2021 17:56:16 - INFO - __main__ - Step 147103: {'lr': 4.7287475698160056e-07, 'samples': 28243776, 'steps': 147102, 'loss/train': 1.355638027191162} 11/07/2021 17:56:16 - INFO - __main__ - Step 147104: {'lr': 4.725485705295296e-07, 'samples': 28243968, 'steps': 147103, 'loss/train': 1.4678258895874023} 11/07/2021 17:56:16 - INFO - __main__ - Step 147105: {'lr': 4.722224965106603e-07, 'samples': 28244160, 'steps': 147104, 'loss/train': 1.3022072315216064} 11/07/2021 17:56:17 - INFO - __main__ - Step 147106: {'lr': 4.7189653492515915e-07, 'samples': 28244352, 'steps': 147105, 'loss/train': 1.6713663339614868} 11/07/2021 17:56:18 - INFO - __main__ - Step 147107: {'lr': 4.71570685773165e-07, 'samples': 28244544, 'steps': 147106, 'loss/train': 1.1450453996658325} 11/07/2021 17:56:18 - INFO - __main__ - Step 147108: {'lr': 4.7124494905484425e-07, 'samples': 28244736, 'steps': 147107, 'loss/train': 1.3176201581954956} 11/07/2021 17:56:18 - INFO - __main__ - Step 147109: {'lr': 4.709193247703081e-07, 'samples': 28244928, 'steps': 147108, 'loss/train': 1.0418318510055542} 11/07/2021 17:56:19 - INFO - __main__ - Step 147110: {'lr': 4.7059381291975066e-07, 'samples': 28245120, 'steps': 147109, 'loss/train': 0.9979953765869141} 11/07/2021 17:56:19 - INFO - __main__ - Step 147111: {'lr': 4.702684135032831e-07, 'samples': 28245312, 'steps': 147110, 'loss/train': 1.2993390560150146} 11/07/2021 17:56:20 - INFO - __main__ - Step 147112: {'lr': 4.6994312652107185e-07, 'samples': 28245504, 'steps': 147111, 'loss/train': 1.2434377670288086} 11/07/2021 17:56:20 - INFO - __main__ - Step 147113: {'lr': 4.6961795197322796e-07, 'samples': 28245696, 'steps': 147112, 'loss/train': 1.5488122701644897} 11/07/2021 17:56:21 - INFO - __main__ - Step 147114: {'lr': 4.692928898599458e-07, 'samples': 28245888, 'steps': 147113, 'loss/train': 1.0032837390899658} 11/07/2021 17:56:21 - INFO - __main__ - Step 147115: {'lr': 4.6896794018136404e-07, 'samples': 28246080, 'steps': 147114, 'loss/train': 1.421139121055603} 11/07/2021 17:56:21 - INFO - __main__ - Step 147116: {'lr': 4.686431029375937e-07, 'samples': 28246272, 'steps': 147115, 'loss/train': 0.9388652443885803} 11/07/2021 17:56:22 - INFO - __main__ - Step 147117: {'lr': 4.683183781288014e-07, 'samples': 28246464, 'steps': 147116, 'loss/train': 0.973349392414093} 11/07/2021 17:56:23 - INFO - __main__ - Step 147118: {'lr': 4.6799376575512585e-07, 'samples': 28246656, 'steps': 147117, 'loss/train': 1.2590595483779907} 11/07/2021 17:56:23 - INFO - __main__ - Step 147119: {'lr': 4.676692658167336e-07, 'samples': 28246848, 'steps': 147118, 'loss/train': 0.9521170258522034} 11/07/2021 17:56:24 - INFO - __main__ - Step 147120: {'lr': 4.6734487831376346e-07, 'samples': 28247040, 'steps': 147119, 'loss/train': 1.2424429655075073} 11/07/2021 17:56:24 - INFO - __main__ - Step 147121: {'lr': 4.670206032463542e-07, 'samples': 28247232, 'steps': 147120, 'loss/train': 1.3059124946594238} 11/07/2021 17:56:24 - INFO - __main__ - Step 147122: {'lr': 4.6669644061464455e-07, 'samples': 28247424, 'steps': 147121, 'loss/train': 1.1094040870666504} 11/07/2021 17:56:25 - INFO - __main__ - Step 147123: {'lr': 4.663723904188011e-07, 'samples': 28247616, 'steps': 147122, 'loss/train': 1.384464979171753} 11/07/2021 17:56:26 - INFO - __main__ - Step 147124: {'lr': 4.660484526589626e-07, 'samples': 28247808, 'steps': 147123, 'loss/train': 1.3203239440917969} 11/07/2021 17:56:26 - INFO - __main__ - Step 147125: {'lr': 4.657246273352678e-07, 'samples': 28248000, 'steps': 147124, 'loss/train': 1.2569608688354492} 11/07/2021 17:56:26 - INFO - __main__ - Step 147126: {'lr': 4.654009144478555e-07, 'samples': 28248192, 'steps': 147125, 'loss/train': 1.163071870803833} 11/07/2021 17:56:27 - INFO - __main__ - Step 147127: {'lr': 4.6507731399692e-07, 'samples': 28248384, 'steps': 147126, 'loss/train': 2.0266411304473877} 11/07/2021 17:56:28 - INFO - __main__ - Step 147128: {'lr': 4.647538259825168e-07, 'samples': 28248576, 'steps': 147127, 'loss/train': 0.858270525932312} 11/07/2021 17:56:28 - INFO - __main__ - Step 147129: {'lr': 4.6443045040489575e-07, 'samples': 28248768, 'steps': 147128, 'loss/train': 1.1841111183166504} 11/07/2021 17:56:28 - INFO - __main__ - Step 147130: {'lr': 4.641071872641123e-07, 'samples': 28248960, 'steps': 147129, 'loss/train': 1.3650197982788086} 11/07/2021 17:56:29 - INFO - __main__ - Step 147131: {'lr': 4.637840365603885e-07, 'samples': 28249152, 'steps': 147130, 'loss/train': 1.359209418296814} 11/07/2021 17:56:29 - INFO - __main__ - Step 147132: {'lr': 4.6346099829380763e-07, 'samples': 28249344, 'steps': 147131, 'loss/train': 1.2862249612808228} 11/07/2021 17:56:31 - INFO - __main__ - Step 147133: {'lr': 4.631380724645362e-07, 'samples': 28249536, 'steps': 147132, 'loss/train': 1.27340829372406} 11/07/2021 17:56:31 - INFO - __main__ - Step 147134: {'lr': 4.628152590727408e-07, 'samples': 28249728, 'steps': 147133, 'loss/train': 0.06728442013263702} 11/07/2021 17:56:31 - INFO - __main__ - Step 147135: {'lr': 4.6249255811853244e-07, 'samples': 28249920, 'steps': 147134, 'loss/train': 0.8958089351654053} 11/07/2021 17:56:32 - INFO - __main__ - Step 147136: {'lr': 4.6216996960210533e-07, 'samples': 28250112, 'steps': 147135, 'loss/train': 1.6424862146377563} 11/07/2021 17:56:32 - INFO - __main__ - Step 147137: {'lr': 4.6184749352354284e-07, 'samples': 28250304, 'steps': 147136, 'loss/train': 1.6624094247817993} 11/07/2021 17:56:33 - INFO - __main__ - Step 147138: {'lr': 4.615251298830392e-07, 'samples': 28250496, 'steps': 147137, 'loss/train': 0.05761392414569855} 11/07/2021 17:56:33 - INFO - __main__ - Step 147139: {'lr': 4.612028786807054e-07, 'samples': 28250688, 'steps': 147138, 'loss/train': 1.4270838499069214} 11/07/2021 17:56:34 - INFO - __main__ - Step 147140: {'lr': 4.608807399167081e-07, 'samples': 28250880, 'steps': 147139, 'loss/train': 0.49043914675712585} 11/07/2021 17:56:34 - INFO - __main__ - Step 147141: {'lr': 4.6055871359118594e-07, 'samples': 28251072, 'steps': 147140, 'loss/train': 1.4060261249542236} 11/07/2021 17:56:34 - INFO - __main__ - Step 147142: {'lr': 4.6023679970430555e-07, 'samples': 28251264, 'steps': 147141, 'loss/train': 1.2293187379837036} 11/07/2021 17:56:36 - INFO - __main__ - Step 147143: {'lr': 4.599149982561779e-07, 'samples': 28251456, 'steps': 147142, 'loss/train': 1.2325314283370972} 11/07/2021 17:56:36 - INFO - __main__ - Step 147144: {'lr': 4.5959330924694175e-07, 'samples': 28251648, 'steps': 147143, 'loss/train': 1.2181060314178467} 11/07/2021 17:56:36 - INFO - __main__ - Step 147145: {'lr': 4.5927173267679144e-07, 'samples': 28251840, 'steps': 147144, 'loss/train': 1.5399407148361206} 11/07/2021 17:56:37 - INFO - __main__ - Step 147146: {'lr': 4.5895026854583797e-07, 'samples': 28252032, 'steps': 147145, 'loss/train': 1.3027565479278564} 11/07/2021 17:56:37 - INFO - __main__ - Step 147147: {'lr': 4.5862891685422014e-07, 'samples': 28252224, 'steps': 147146, 'loss/train': 1.477320909500122} 11/07/2021 17:56:37 - INFO - __main__ - Step 147148: {'lr': 4.5830767760210445e-07, 'samples': 28252416, 'steps': 147147, 'loss/train': 0.9687497019767761} 11/07/2021 17:56:38 - INFO - __main__ - Step 147149: {'lr': 4.5798655078960196e-07, 'samples': 28252608, 'steps': 147148, 'loss/train': 1.3755792379379272} 11/07/2021 17:56:39 - INFO - __main__ - Step 147150: {'lr': 4.5766553641690686e-07, 'samples': 28252800, 'steps': 147149, 'loss/train': 1.1855854988098145} 11/07/2021 17:56:39 - INFO - __main__ - Step 147151: {'lr': 4.5734463448413033e-07, 'samples': 28252992, 'steps': 147150, 'loss/train': 1.0958423614501953} 11/07/2021 17:56:39 - INFO - __main__ - Step 147152: {'lr': 4.5702384499141106e-07, 'samples': 28253184, 'steps': 147151, 'loss/train': 1.7517775297164917} 11/07/2021 17:56:40 - INFO - __main__ - Step 147153: {'lr': 4.5670316793891554e-07, 'samples': 28253376, 'steps': 147152, 'loss/train': 1.4204877614974976} 11/07/2021 17:56:41 - INFO - __main__ - Step 147154: {'lr': 4.563826033267826e-07, 'samples': 28253568, 'steps': 147153, 'loss/train': 1.4243890047073364} 11/07/2021 17:56:41 - INFO - __main__ - Step 147155: {'lr': 4.56062151155151e-07, 'samples': 28253760, 'steps': 147154, 'loss/train': 1.6674566268920898} 11/07/2021 17:56:42 - INFO - __main__ - Step 147156: {'lr': 4.557418114241596e-07, 'samples': 28253952, 'steps': 147155, 'loss/train': 1.0218199491500854} 11/07/2021 17:56:42 - INFO - __main__ - Step 147157: {'lr': 4.5542158413397484e-07, 'samples': 28254144, 'steps': 147156, 'loss/train': 1.489708423614502} 11/07/2021 17:56:42 - INFO - __main__ - Step 147158: {'lr': 4.5510146928470776e-07, 'samples': 28254336, 'steps': 147157, 'loss/train': 1.1774628162384033} 11/07/2021 17:56:43 - INFO - __main__ - Step 147159: {'lr': 4.5478146687655265e-07, 'samples': 28254528, 'steps': 147158, 'loss/train': 1.0970178842544556} 11/07/2021 17:56:44 - INFO - __main__ - Step 147160: {'lr': 4.544615769095928e-07, 'samples': 28254720, 'steps': 147159, 'loss/train': 1.171746850013733} 11/07/2021 17:56:44 - INFO - __main__ - Step 147161: {'lr': 4.541417993839947e-07, 'samples': 28254912, 'steps': 147160, 'loss/train': 1.3760852813720703} 11/07/2021 17:56:44 - INFO - __main__ - Step 147162: {'lr': 4.538221342999249e-07, 'samples': 28255104, 'steps': 147161, 'loss/train': 1.1528587341308594} 11/07/2021 17:56:45 - INFO - __main__ - Step 147163: {'lr': 4.535025816575222e-07, 'samples': 28255296, 'steps': 147162, 'loss/train': 1.6741724014282227} 11/07/2021 17:56:45 - INFO - __main__ - Step 147164: {'lr': 4.5318314145689767e-07, 'samples': 28255488, 'steps': 147163, 'loss/train': 1.5174938440322876} 11/07/2021 17:56:46 - INFO - __main__ - Step 147165: {'lr': 4.528638136982455e-07, 'samples': 28255680, 'steps': 147164, 'loss/train': 1.7800672054290771} 11/07/2021 17:56:47 - INFO - __main__ - Step 147166: {'lr': 4.525445983816767e-07, 'samples': 28255872, 'steps': 147165, 'loss/train': 1.2141470909118652} 11/07/2021 17:56:47 - INFO - __main__ - Step 147167: {'lr': 4.5222549550733016e-07, 'samples': 28256064, 'steps': 147166, 'loss/train': 1.535996675491333} 11/07/2021 17:56:47 - INFO - __main__ - Step 147168: {'lr': 4.519065050753446e-07, 'samples': 28256256, 'steps': 147167, 'loss/train': 1.307002067565918} 11/07/2021 17:56:48 - INFO - __main__ - Step 147169: {'lr': 4.515876270859143e-07, 'samples': 28256448, 'steps': 147168, 'loss/train': 1.0762087106704712} 11/07/2021 17:56:49 - INFO - __main__ - Step 147170: {'lr': 4.512688615391225e-07, 'samples': 28256640, 'steps': 147169, 'loss/train': 0.582775354385376} 11/07/2021 17:56:49 - INFO - __main__ - Step 147171: {'lr': 4.5095020843513577e-07, 'samples': 28256832, 'steps': 147170, 'loss/train': 1.2573195695877075} 11/07/2021 17:56:49 - INFO - __main__ - Step 147172: {'lr': 4.506316677741207e-07, 'samples': 28257024, 'steps': 147171, 'loss/train': 1.8677821159362793} 11/07/2021 17:56:50 - INFO - __main__ - Step 147173: {'lr': 4.503132395561882e-07, 'samples': 28257216, 'steps': 147172, 'loss/train': 1.5721454620361328} 11/07/2021 17:56:50 - INFO - __main__ - Step 147174: {'lr': 4.499949237814771e-07, 'samples': 28257408, 'steps': 147173, 'loss/train': 1.3981659412384033} 11/07/2021 17:56:51 - INFO - __main__ - Step 147175: {'lr': 4.496767204501817e-07, 'samples': 28257600, 'steps': 147174, 'loss/train': 0.9692528247833252} 11/07/2021 17:56:51 - INFO - __main__ - Step 147176: {'lr': 4.4935862956238525e-07, 'samples': 28257792, 'steps': 147175, 'loss/train': 1.4144182205200195} 11/07/2021 17:56:52 - INFO - __main__ - Step 147177: {'lr': 4.490406511182543e-07, 'samples': 28257984, 'steps': 147176, 'loss/train': 1.4807506799697876} 11/07/2021 17:56:52 - INFO - __main__ - Step 147178: {'lr': 4.487227851179554e-07, 'samples': 28258176, 'steps': 147177, 'loss/train': 1.3311965465545654} 11/07/2021 17:56:53 - INFO - __main__ - Step 147179: {'lr': 4.484050315615995e-07, 'samples': 28258368, 'steps': 147178, 'loss/train': 0.9465776681900024} 11/07/2021 17:56:53 - INFO - __main__ - Step 147180: {'lr': 4.480873904493532e-07, 'samples': 28258560, 'steps': 147179, 'loss/train': 1.6175988912582397} 11/07/2021 17:56:54 - INFO - __main__ - Step 147181: {'lr': 4.477698617813275e-07, 'samples': 28258752, 'steps': 147180, 'loss/train': 1.2721517086029053} 11/07/2021 17:56:54 - INFO - __main__ - Step 147182: {'lr': 4.474524455577167e-07, 'samples': 28258944, 'steps': 147181, 'loss/train': 1.2450016736984253} 11/07/2021 17:56:55 - INFO - __main__ - Step 147183: {'lr': 4.4713514177860404e-07, 'samples': 28259136, 'steps': 147182, 'loss/train': 1.249375343322754} 11/07/2021 17:56:55 - INFO - __main__ - Step 147184: {'lr': 4.468179504441838e-07, 'samples': 28259328, 'steps': 147183, 'loss/train': 0.9369739294052124} 11/07/2021 17:56:56 - INFO - __main__ - Step 147185: {'lr': 4.4650087155453936e-07, 'samples': 28259520, 'steps': 147184, 'loss/train': 1.1806901693344116} 11/07/2021 17:56:56 - INFO - __main__ - Step 147186: {'lr': 4.461839051098926e-07, 'samples': 28259712, 'steps': 147185, 'loss/train': 1.0458732843399048} 11/07/2021 17:56:57 - INFO - __main__ - Step 147187: {'lr': 4.458670511103269e-07, 'samples': 28259904, 'steps': 147186, 'loss/train': 1.4996390342712402} 11/07/2021 17:56:57 - INFO - __main__ - Step 147188: {'lr': 4.4555030955598096e-07, 'samples': 28260096, 'steps': 147187, 'loss/train': 1.1148353815078735} 11/07/2021 17:56:57 - INFO - __main__ - Step 147189: {'lr': 4.4523368044704915e-07, 'samples': 28260288, 'steps': 147188, 'loss/train': 1.3488986492156982} 11/07/2021 17:56:58 - INFO - __main__ - Step 147190: {'lr': 4.4491716378364245e-07, 'samples': 28260480, 'steps': 147189, 'loss/train': 1.157741904258728} 11/07/2021 17:56:59 - INFO - __main__ - Step 147191: {'lr': 4.4460075956589964e-07, 'samples': 28260672, 'steps': 147190, 'loss/train': 1.162227988243103} 11/07/2021 17:56:59 - INFO - __main__ - Step 147192: {'lr': 4.4428446779395946e-07, 'samples': 28260864, 'steps': 147191, 'loss/train': 1.4594026803970337} 11/07/2021 17:57:00 - INFO - __main__ - Step 147193: {'lr': 4.439682884679885e-07, 'samples': 28261056, 'steps': 147192, 'loss/train': 1.205827236175537} 11/07/2021 17:57:00 - INFO - __main__ - Step 147194: {'lr': 4.4365222158812556e-07, 'samples': 28261248, 'steps': 147193, 'loss/train': 1.3581422567367554} 11/07/2021 17:57:00 - INFO - __main__ - Step 147195: {'lr': 4.433362671544816e-07, 'samples': 28261440, 'steps': 147194, 'loss/train': 1.302354097366333} 11/07/2021 17:57:02 - INFO - __main__ - Step 147196: {'lr': 4.4302042516722316e-07, 'samples': 28261632, 'steps': 147195, 'loss/train': 0.9196863174438477} 11/07/2021 17:57:02 - INFO - __main__ - Step 147197: {'lr': 4.4270469562648907e-07, 'samples': 28261824, 'steps': 147196, 'loss/train': 0.21886643767356873} 11/07/2021 17:57:02 - INFO - __main__ - Step 147198: {'lr': 4.4238907853241804e-07, 'samples': 28262016, 'steps': 147197, 'loss/train': 0.7764143347740173} 11/07/2021 17:57:03 - INFO - __main__ - Step 147199: {'lr': 4.4207357388514893e-07, 'samples': 28262208, 'steps': 147198, 'loss/train': 1.1158151626586914} 11/07/2021 17:57:03 - INFO - __main__ - Step 147200: {'lr': 4.4175818168484813e-07, 'samples': 28262400, 'steps': 147199, 'loss/train': 1.3012551069259644} 11/07/2021 17:57:04 - INFO - __main__ - Step 147201: {'lr': 4.414429019316268e-07, 'samples': 28262592, 'steps': 147200, 'loss/train': 1.4405328035354614} 11/07/2021 17:57:04 - INFO - __main__ - Step 147202: {'lr': 4.411277346256515e-07, 'samples': 28262784, 'steps': 147201, 'loss/train': 1.4171152114868164} 11/07/2021 17:57:05 - INFO - __main__ - Step 147203: {'lr': 4.408126797670331e-07, 'samples': 28262976, 'steps': 147202, 'loss/train': 1.1396054029464722} 11/07/2021 17:57:05 - INFO - __main__ - Step 147204: {'lr': 4.4049773735596597e-07, 'samples': 28263168, 'steps': 147203, 'loss/train': 1.0296128988265991} 11/07/2021 17:57:05 - INFO - __main__ - Step 147205: {'lr': 4.401829073925334e-07, 'samples': 28263360, 'steps': 147204, 'loss/train': 1.593719482421875} 11/07/2021 17:57:07 - INFO - __main__ - Step 147206: {'lr': 4.398681898769019e-07, 'samples': 28263552, 'steps': 147205, 'loss/train': 0.6801626682281494} 11/07/2021 17:57:07 - INFO - __main__ - Step 147207: {'lr': 4.3955358480923804e-07, 'samples': 28263744, 'steps': 147206, 'loss/train': 1.013532042503357} 11/07/2021 17:57:07 - INFO - __main__ - Step 147208: {'lr': 4.39239092189625e-07, 'samples': 28263936, 'steps': 147207, 'loss/train': 1.3907338380813599} 11/07/2021 17:57:08 - INFO - __main__ - Step 147209: {'lr': 4.3892471201828486e-07, 'samples': 28264128, 'steps': 147208, 'loss/train': 0.04860461875796318} 11/07/2021 17:57:08 - INFO - __main__ - Step 147210: {'lr': 4.386104442952732e-07, 'samples': 28264320, 'steps': 147209, 'loss/train': 1.3019236326217651} 11/07/2021 17:57:09 - INFO - __main__ - Step 147211: {'lr': 4.382962890207842e-07, 'samples': 28264512, 'steps': 147210, 'loss/train': 1.3678992986679077} 11/07/2021 17:57:10 - INFO - __main__ - Step 147212: {'lr': 4.379822461949845e-07, 'samples': 28264704, 'steps': 147211, 'loss/train': 1.4193267822265625} 11/07/2021 17:57:10 - INFO - __main__ - Step 147213: {'lr': 4.3766831581792953e-07, 'samples': 28264896, 'steps': 147212, 'loss/train': 1.497988224029541} 11/07/2021 17:57:10 - INFO - __main__ - Step 147214: {'lr': 4.373544978898414e-07, 'samples': 28265088, 'steps': 147213, 'loss/train': 1.2783881425857544} 11/07/2021 17:57:11 - INFO - __main__ - Step 147215: {'lr': 4.370407924108033e-07, 'samples': 28265280, 'steps': 147214, 'loss/train': 0.9682691097259521} 11/07/2021 17:57:12 - INFO - __main__ - Step 147216: {'lr': 4.367271993810096e-07, 'samples': 28265472, 'steps': 147215, 'loss/train': 1.4313650131225586} 11/07/2021 17:57:12 - INFO - __main__ - Step 147217: {'lr': 4.3641371880057125e-07, 'samples': 28265664, 'steps': 147216, 'loss/train': 1.2091985940933228} 11/07/2021 17:57:12 - INFO - __main__ - Step 147218: {'lr': 4.361003506696271e-07, 'samples': 28265856, 'steps': 147217, 'loss/train': 1.4251518249511719} 11/07/2021 17:57:13 - INFO - __main__ - Step 147219: {'lr': 4.357870949883158e-07, 'samples': 28266048, 'steps': 147218, 'loss/train': 1.3335086107254028} 11/07/2021 17:57:13 - INFO - __main__ - Step 147220: {'lr': 4.3547395175680404e-07, 'samples': 28266240, 'steps': 147219, 'loss/train': 1.806418538093567} 11/07/2021 17:57:14 - INFO - __main__ - Step 147221: {'lr': 4.351609209752028e-07, 'samples': 28266432, 'steps': 147220, 'loss/train': 1.0846614837646484} 11/07/2021 17:57:14 - INFO - __main__ - Step 147222: {'lr': 4.348480026436785e-07, 'samples': 28266624, 'steps': 147221, 'loss/train': 1.2259047031402588} 11/07/2021 17:57:15 - INFO - __main__ - Step 147223: {'lr': 4.345351967623423e-07, 'samples': 28266816, 'steps': 147222, 'loss/train': 1.3101993799209595} 11/07/2021 17:57:15 - INFO - __main__ - Step 147224: {'lr': 4.342225033313607e-07, 'samples': 28267008, 'steps': 147223, 'loss/train': 1.4500553607940674} 11/07/2021 17:57:15 - INFO - __main__ - Step 147225: {'lr': 4.339099223509002e-07, 'samples': 28267200, 'steps': 147224, 'loss/train': 1.1292617321014404} 11/07/2021 17:57:16 - INFO - __main__ - Step 147226: {'lr': 4.335974538210441e-07, 'samples': 28267392, 'steps': 147225, 'loss/train': 1.8125379085540771} 11/07/2021 17:57:17 - INFO - __main__ - Step 147227: {'lr': 4.332850977419589e-07, 'samples': 28267584, 'steps': 147226, 'loss/train': 1.469495415687561} 11/07/2021 17:57:17 - INFO - __main__ - Step 147228: {'lr': 4.3297285411375565e-07, 'samples': 28267776, 'steps': 147227, 'loss/train': 1.1680316925048828} 11/07/2021 17:57:18 - INFO - __main__ - Step 147229: {'lr': 4.326607229366564e-07, 'samples': 28267968, 'steps': 147228, 'loss/train': 0.049965422600507736} 11/07/2021 17:57:18 - INFO - __main__ - Step 147230: {'lr': 4.323487042107166e-07, 'samples': 28268160, 'steps': 147229, 'loss/train': 1.1549115180969238} 11/07/2021 17:57:18 - INFO - __main__ - Step 147231: {'lr': 4.3203679793610283e-07, 'samples': 28268352, 'steps': 147230, 'loss/train': 1.4255669116973877} 11/07/2021 17:57:20 - INFO - __main__ - Step 147232: {'lr': 4.3172500411298165e-07, 'samples': 28268544, 'steps': 147231, 'loss/train': 0.7464693188667297} 11/07/2021 17:57:20 - INFO - __main__ - Step 147233: {'lr': 4.3141332274146406e-07, 'samples': 28268736, 'steps': 147232, 'loss/train': 1.9138201475143433} 11/07/2021 17:57:20 - INFO - __main__ - Step 147234: {'lr': 4.3110175382171656e-07, 'samples': 28268928, 'steps': 147233, 'loss/train': 1.3462313413619995} 11/07/2021 17:57:21 - INFO - __main__ - Step 147235: {'lr': 4.307902973538502e-07, 'samples': 28269120, 'steps': 147234, 'loss/train': 1.10246741771698} 11/07/2021 17:57:21 - INFO - __main__ - Step 147236: {'lr': 4.304789533380038e-07, 'samples': 28269312, 'steps': 147235, 'loss/train': 1.5310009717941284} 11/07/2021 17:57:21 - INFO - __main__ - Step 147237: {'lr': 4.3016772177434385e-07, 'samples': 28269504, 'steps': 147236, 'loss/train': 1.2058185338974} 11/07/2021 17:57:22 - INFO - __main__ - Step 147238: {'lr': 4.298566026630091e-07, 'samples': 28269696, 'steps': 147237, 'loss/train': 1.5313949584960938} 11/07/2021 17:57:23 - INFO - __main__ - Step 147239: {'lr': 4.2954559600413833e-07, 'samples': 28269888, 'steps': 147238, 'loss/train': 1.272863745689392} 11/07/2021 17:57:23 - INFO - __main__ - Step 147240: {'lr': 4.292347017978426e-07, 'samples': 28270080, 'steps': 147239, 'loss/train': 1.2527936697006226} 11/07/2021 17:57:23 - INFO - __main__ - Step 147241: {'lr': 4.289239200442885e-07, 'samples': 28270272, 'steps': 147240, 'loss/train': 1.5940614938735962} 11/07/2021 17:57:24 - INFO - __main__ - Step 147242: {'lr': 4.286132507436147e-07, 'samples': 28270464, 'steps': 147241, 'loss/train': 1.4557807445526123} 11/07/2021 17:57:25 - INFO - __main__ - Step 147243: {'lr': 4.2830269389595997e-07, 'samples': 28270656, 'steps': 147242, 'loss/train': 1.2530990839004517} 11/07/2021 17:57:25 - INFO - __main__ - Step 147244: {'lr': 4.2799224950143547e-07, 'samples': 28270848, 'steps': 147243, 'loss/train': 0.8125149607658386} 11/07/2021 17:57:26 - INFO - __main__ - Step 147245: {'lr': 4.276819175602353e-07, 'samples': 28271040, 'steps': 147244, 'loss/train': 0.8619740009307861} 11/07/2021 17:57:26 - INFO - __main__ - Step 147246: {'lr': 4.2737169807244293e-07, 'samples': 28271232, 'steps': 147245, 'loss/train': 0.8144082427024841} 11/07/2021 17:57:26 - INFO - __main__ - Step 147247: {'lr': 4.270615910382525e-07, 'samples': 28271424, 'steps': 147246, 'loss/train': 1.0579358339309692} 11/07/2021 17:57:27 - INFO - __main__ - Step 147248: {'lr': 4.2675159645777504e-07, 'samples': 28271616, 'steps': 147247, 'loss/train': 1.1651606559753418} 11/07/2021 17:57:28 - INFO - __main__ - Step 147249: {'lr': 4.264417143311494e-07, 'samples': 28271808, 'steps': 147248, 'loss/train': 1.4318729639053345} 11/07/2021 17:57:28 - INFO - __main__ - Step 147250: {'lr': 4.2613194465851436e-07, 'samples': 28272000, 'steps': 147249, 'loss/train': 1.1877186298370361} 11/07/2021 17:57:28 - INFO - __main__ - Step 147251: {'lr': 4.258222874400086e-07, 'samples': 28272192, 'steps': 147250, 'loss/train': 3.0743892192840576} 11/07/2021 17:57:29 - INFO - __main__ - Step 147252: {'lr': 4.25512742675771e-07, 'samples': 28272384, 'steps': 147251, 'loss/train': 1.1604254245758057} 11/07/2021 17:57:30 - INFO - __main__ - Step 147253: {'lr': 4.2520331036596806e-07, 'samples': 28272576, 'steps': 147252, 'loss/train': 0.11383511871099472} 11/07/2021 17:57:30 - INFO - __main__ - Step 147254: {'lr': 4.24893990510683e-07, 'samples': 28272768, 'steps': 147253, 'loss/train': 1.1547608375549316} 11/07/2021 17:57:31 - INFO - __main__ - Step 147255: {'lr': 4.2458478311011015e-07, 'samples': 28272960, 'steps': 147254, 'loss/train': 0.9386613368988037} 11/07/2021 17:57:31 - INFO - __main__ - Step 147256: {'lr': 4.242756881643883e-07, 'samples': 28273152, 'steps': 147255, 'loss/train': 1.1215252876281738} 11/07/2021 17:57:31 - INFO - __main__ - Step 147257: {'lr': 4.2396670567360074e-07, 'samples': 28273344, 'steps': 147256, 'loss/train': 1.1749415397644043} 11/07/2021 17:57:32 - INFO - __main__ - Step 147258: {'lr': 4.236578356379417e-07, 'samples': 28273536, 'steps': 147257, 'loss/train': 1.4195513725280762} 11/07/2021 17:57:33 - INFO - __main__ - Step 147259: {'lr': 4.233490780575222e-07, 'samples': 28273728, 'steps': 147258, 'loss/train': 1.1183141469955444} 11/07/2021 17:57:33 - INFO - __main__ - Step 147260: {'lr': 4.2304043293250883e-07, 'samples': 28273920, 'steps': 147259, 'loss/train': 1.1632869243621826} 11/07/2021 17:57:33 - INFO - __main__ - Step 147261: {'lr': 4.2273190026301256e-07, 'samples': 28274112, 'steps': 147260, 'loss/train': 0.5887569785118103} 11/07/2021 17:57:34 - INFO - __main__ - Step 147262: {'lr': 4.224234800491722e-07, 'samples': 28274304, 'steps': 147261, 'loss/train': 1.386408805847168} 11/07/2021 17:57:34 - INFO - __main__ - Step 147263: {'lr': 4.2211517229115427e-07, 'samples': 28274496, 'steps': 147262, 'loss/train': 1.283119797706604} 11/07/2021 17:57:35 - INFO - __main__ - Step 147264: {'lr': 4.2180697698906976e-07, 'samples': 28274688, 'steps': 147263, 'loss/train': 0.6568517088890076} 11/07/2021 17:57:36 - INFO - __main__ - Step 147265: {'lr': 4.2149889414305753e-07, 'samples': 28274880, 'steps': 147264, 'loss/train': 1.3559651374816895} 11/07/2021 17:57:36 - INFO - __main__ - Step 147266: {'lr': 4.2119092375328407e-07, 'samples': 28275072, 'steps': 147265, 'loss/train': 0.9482184648513794} 11/07/2021 17:57:36 - INFO - __main__ - Step 147267: {'lr': 4.2088306581986034e-07, 'samples': 28275264, 'steps': 147266, 'loss/train': 0.7447375059127808} 11/07/2021 17:57:37 - INFO - __main__ - Step 147268: {'lr': 4.2057532034292525e-07, 'samples': 28275456, 'steps': 147267, 'loss/train': 1.423430323600769} 11/07/2021 17:57:38 - INFO - __main__ - Step 147269: {'lr': 4.202676873226452e-07, 'samples': 28275648, 'steps': 147268, 'loss/train': 2.7683680057525635} 11/07/2021 17:57:38 - INFO - __main__ - Step 147270: {'lr': 4.199601667591313e-07, 'samples': 28275840, 'steps': 147269, 'loss/train': 1.6044645309448242} 11/07/2021 17:57:38 - INFO - __main__ - Step 147271: {'lr': 4.1965275865255003e-07, 'samples': 28276032, 'steps': 147270, 'loss/train': 1.114070177078247} 11/07/2021 17:57:39 - INFO - __main__ - Step 147272: {'lr': 4.193454630030125e-07, 'samples': 28276224, 'steps': 147271, 'loss/train': 0.9925128817558289} 11/07/2021 17:57:39 - INFO - __main__ - Step 147273: {'lr': 4.1903827981065736e-07, 'samples': 28276416, 'steps': 147272, 'loss/train': 1.4201996326446533} 11/07/2021 17:57:40 - INFO - __main__ - Step 147274: {'lr': 4.187312090756512e-07, 'samples': 28276608, 'steps': 147273, 'loss/train': 1.1710331439971924} 11/07/2021 17:57:40 - INFO - __main__ - Step 147275: {'lr': 4.1842425079810506e-07, 'samples': 28276800, 'steps': 147274, 'loss/train': 1.4137388467788696} 11/07/2021 17:57:41 - INFO - __main__ - Step 147276: {'lr': 4.1811740497815777e-07, 'samples': 28276992, 'steps': 147275, 'loss/train': 1.1889567375183105} 11/07/2021 17:57:41 - INFO - __main__ - Step 147277: {'lr': 4.1781067161594797e-07, 'samples': 28277184, 'steps': 147276, 'loss/train': 1.2510809898376465} 11/07/2021 17:57:41 - INFO - __main__ - Step 147278: {'lr': 4.175040507116423e-07, 'samples': 28277376, 'steps': 147277, 'loss/train': 1.243035078048706} 11/07/2021 17:57:42 - INFO - __main__ - Step 147279: {'lr': 4.171975422653518e-07, 'samples': 28277568, 'steps': 147278, 'loss/train': 1.331429123878479} 11/07/2021 17:57:43 - INFO - __main__ - Step 147280: {'lr': 4.1689114627721514e-07, 'samples': 28277760, 'steps': 147279, 'loss/train': 1.2797867059707642} 11/07/2021 17:57:43 - INFO - __main__ - Step 147281: {'lr': 4.1658486274737115e-07, 'samples': 28277952, 'steps': 147280, 'loss/train': 1.096589207649231} 11/07/2021 17:57:43 - INFO - __main__ - Step 147282: {'lr': 4.162786916759587e-07, 'samples': 28278144, 'steps': 147281, 'loss/train': 1.3183817863464355} 11/07/2021 17:57:44 - INFO - __main__ - Step 147283: {'lr': 4.1597263306314414e-07, 'samples': 28278336, 'steps': 147282, 'loss/train': 1.8498485088348389} 11/07/2021 17:57:45 - INFO - __main__ - Step 147284: {'lr': 4.1566668690903863e-07, 'samples': 28278528, 'steps': 147283, 'loss/train': 1.4056215286254883} 11/07/2021 17:57:45 - INFO - __main__ - Step 147285: {'lr': 4.1536085321375316e-07, 'samples': 28278720, 'steps': 147284, 'loss/train': 1.1275925636291504} 11/07/2021 17:57:46 - INFO - __main__ - Step 147286: {'lr': 4.1505513197748204e-07, 'samples': 28278912, 'steps': 147285, 'loss/train': 1.4445428848266602} 11/07/2021 17:57:46 - INFO - __main__ - Step 147287: {'lr': 4.147495232003362e-07, 'samples': 28279104, 'steps': 147286, 'loss/train': 0.7900604009628296} 11/07/2021 17:57:46 - INFO - __main__ - Step 147288: {'lr': 4.144440268824268e-07, 'samples': 28279296, 'steps': 147287, 'loss/train': 1.2823150157928467} 11/07/2021 17:57:47 - INFO - __main__ - Step 147289: {'lr': 4.14138643023948e-07, 'samples': 28279488, 'steps': 147288, 'loss/train': 1.0099585056304932} 11/07/2021 17:57:48 - INFO - __main__ - Step 147290: {'lr': 4.1383337162498315e-07, 'samples': 28279680, 'steps': 147289, 'loss/train': 1.4819892644882202} 11/07/2021 17:57:48 - INFO - __main__ - Step 147291: {'lr': 4.1352821268569874e-07, 'samples': 28279872, 'steps': 147290, 'loss/train': 1.0360876321792603} 11/07/2021 17:57:48 - INFO - __main__ - Step 147292: {'lr': 4.1322316620623356e-07, 'samples': 28280064, 'steps': 147291, 'loss/train': 1.3049287796020508} 11/07/2021 17:57:49 - INFO - __main__ - Step 147293: {'lr': 4.129182321867264e-07, 'samples': 28280256, 'steps': 147292, 'loss/train': 1.008519172668457} 11/07/2021 17:57:50 - INFO - __main__ - Step 147294: {'lr': 4.126134106272883e-07, 'samples': 28280448, 'steps': 147293, 'loss/train': 1.4920508861541748} 11/07/2021 17:57:50 - INFO - __main__ - Step 147295: {'lr': 4.123087015280857e-07, 'samples': 28280640, 'steps': 147294, 'loss/train': 0.7433518171310425} 11/07/2021 17:57:51 - INFO - __main__ - Step 147296: {'lr': 4.1200410488925753e-07, 'samples': 28280832, 'steps': 147295, 'loss/train': 1.334334135055542} 11/07/2021 17:57:51 - INFO - __main__ - Step 147297: {'lr': 4.116996207109147e-07, 'samples': 28281024, 'steps': 147296, 'loss/train': 1.4348912239074707} 11/07/2021 17:57:51 - INFO - __main__ - Step 147298: {'lr': 4.11395248993196e-07, 'samples': 28281216, 'steps': 147297, 'loss/train': 1.052781343460083} 11/07/2021 17:57:52 - INFO - __main__ - Step 147299: {'lr': 4.1109098973626804e-07, 'samples': 28281408, 'steps': 147298, 'loss/train': 1.3469798564910889} 11/07/2021 17:57:53 - INFO - __main__ - Step 147300: {'lr': 4.107868429402417e-07, 'samples': 28281600, 'steps': 147299, 'loss/train': 0.885260820388794} 11/07/2021 17:57:53 - INFO - __main__ - Step 147301: {'lr': 4.1048280860528363e-07, 'samples': 28281792, 'steps': 147300, 'loss/train': 1.1132062673568726} 11/07/2021 17:57:53 - INFO - __main__ - Step 147302: {'lr': 4.101788867315048e-07, 'samples': 28281984, 'steps': 147301, 'loss/train': 1.2816517353057861} 11/07/2021 17:57:54 - INFO - __main__ - Step 147303: {'lr': 4.0987507731904406e-07, 'samples': 28282176, 'steps': 147302, 'loss/train': 1.5372240543365479} 11/07/2021 17:57:55 - INFO - __main__ - Step 147304: {'lr': 4.095713803680123e-07, 'samples': 28282368, 'steps': 147303, 'loss/train': 1.2701653242111206} 11/07/2021 17:57:55 - INFO - __main__ - Step 147305: {'lr': 4.09267795878604e-07, 'samples': 28282560, 'steps': 147304, 'loss/train': 1.098244547843933} 11/07/2021 17:57:55 - INFO - __main__ - Step 147306: {'lr': 4.0896432385093e-07, 'samples': 28282752, 'steps': 147305, 'loss/train': 0.9326373934745789} 11/07/2021 17:57:56 - INFO - __main__ - Step 147307: {'lr': 4.086609642851291e-07, 'samples': 28282944, 'steps': 147306, 'loss/train': 1.1995899677276611} 11/07/2021 17:57:56 - INFO - __main__ - Step 147308: {'lr': 4.083577171813402e-07, 'samples': 28283136, 'steps': 147307, 'loss/train': 1.5522445440292358} 11/07/2021 17:57:57 - INFO - __main__ - Step 147309: {'lr': 4.080545825396742e-07, 'samples': 28283328, 'steps': 147308, 'loss/train': 0.9009713530540466} 11/07/2021 17:57:57 - INFO - __main__ - Step 147310: {'lr': 4.077515603602977e-07, 'samples': 28283520, 'steps': 147309, 'loss/train': 0.6175951361656189} 11/07/2021 17:57:58 - INFO - __main__ - Step 147311: {'lr': 4.074486506433217e-07, 'samples': 28283712, 'steps': 147310, 'loss/train': 1.2170215845108032} 11/07/2021 17:57:58 - INFO - __main__ - Step 147312: {'lr': 4.071458533889127e-07, 'samples': 28283904, 'steps': 147311, 'loss/train': 0.7178853750228882} 11/07/2021 17:57:59 - INFO - __main__ - Step 147313: {'lr': 4.0684316859718185e-07, 'samples': 28284096, 'steps': 147312, 'loss/train': 0.4127618968486786} 11/07/2021 17:57:59 - INFO - __main__ - Step 147314: {'lr': 4.0654059626829555e-07, 'samples': 28284288, 'steps': 147313, 'loss/train': 1.2576231956481934} 11/07/2021 17:58:00 - INFO - __main__ - Step 147315: {'lr': 4.062381364023371e-07, 'samples': 28284480, 'steps': 147314, 'loss/train': 1.0186593532562256} 11/07/2021 17:58:01 - INFO - __main__ - Step 147316: {'lr': 4.059357889995008e-07, 'samples': 28284672, 'steps': 147315, 'loss/train': 1.442527174949646} 11/07/2021 17:58:01 - INFO - __main__ - Step 147317: {'lr': 4.0563355405989766e-07, 'samples': 28284864, 'steps': 147316, 'loss/train': 1.2040868997573853} 11/07/2021 17:58:01 - INFO - __main__ - Step 147318: {'lr': 4.0533143158366647e-07, 'samples': 28285056, 'steps': 147317, 'loss/train': 0.3149068355560303} 11/07/2021 17:58:02 - INFO - __main__ - Step 147319: {'lr': 4.0502942157094604e-07, 'samples': 28285248, 'steps': 147318, 'loss/train': 0.8131262063980103} 11/07/2021 17:58:03 - INFO - __main__ - Step 147320: {'lr': 4.047275240218473e-07, 'samples': 28285440, 'steps': 147319, 'loss/train': 1.3820276260375977} 11/07/2021 17:58:03 - INFO - __main__ - Step 147321: {'lr': 4.044257389365369e-07, 'samples': 28285632, 'steps': 147320, 'loss/train': 1.0746140480041504} 11/07/2021 17:58:04 - INFO - __main__ - Step 147322: {'lr': 4.041240663151535e-07, 'samples': 28285824, 'steps': 147321, 'loss/train': 1.3363059759140015} 11/07/2021 17:58:04 - INFO - __main__ - Step 147323: {'lr': 4.0382250615780823e-07, 'samples': 28286016, 'steps': 147322, 'loss/train': 1.1089190244674683} 11/07/2021 17:58:04 - INFO - __main__ - Step 147324: {'lr': 4.0352105846463985e-07, 'samples': 28286208, 'steps': 147323, 'loss/train': 1.074535608291626} 11/07/2021 17:58:06 - INFO - __main__ - Step 147325: {'lr': 4.032197232358148e-07, 'samples': 28286400, 'steps': 147324, 'loss/train': 1.5640813112258911} 11/07/2021 17:58:06 - INFO - __main__ - Step 147326: {'lr': 4.029185004714164e-07, 'samples': 28286592, 'steps': 147325, 'loss/train': 1.2201787233352661} 11/07/2021 17:58:07 - INFO - __main__ - Step 147327: {'lr': 4.02617390171639e-07, 'samples': 28286784, 'steps': 147326, 'loss/train': 1.4307149648666382} 11/07/2021 17:58:07 - INFO - __main__ - Step 147328: {'lr': 4.0231639233659354e-07, 'samples': 28286976, 'steps': 147327, 'loss/train': 1.731792688369751} 11/07/2021 17:58:07 - INFO - __main__ - Step 147329: {'lr': 4.020155069663911e-07, 'samples': 28287168, 'steps': 147328, 'loss/train': 1.6871947050094604} 11/07/2021 17:58:08 - INFO - __main__ - Step 147330: {'lr': 4.0171473406119817e-07, 'samples': 28287360, 'steps': 147329, 'loss/train': 1.6657898426055908} 11/07/2021 17:58:08 - INFO - __main__ - Step 147331: {'lr': 4.014140736211258e-07, 'samples': 28287552, 'steps': 147330, 'loss/train': 1.3483388423919678} 11/07/2021 17:58:09 - INFO - __main__ - Step 147332: {'lr': 4.011135256463405e-07, 'samples': 28287744, 'steps': 147331, 'loss/train': 0.5322777628898621} 11/07/2021 17:58:10 - INFO - __main__ - Step 147333: {'lr': 4.008130901369811e-07, 'samples': 28287936, 'steps': 147332, 'loss/train': 1.2714864015579224} 11/07/2021 17:58:10 - INFO - __main__ - Step 147334: {'lr': 4.0051276709315855e-07, 'samples': 28288128, 'steps': 147333, 'loss/train': 1.1317789554595947} 11/07/2021 17:58:10 - INFO - __main__ - Step 147335: {'lr': 4.0021255651498387e-07, 'samples': 28288320, 'steps': 147334, 'loss/train': 1.0245035886764526} 11/07/2021 17:58:11 - INFO - __main__ - Step 147336: {'lr': 3.9991245840265147e-07, 'samples': 28288512, 'steps': 147335, 'loss/train': 1.463494896888733} 11/07/2021 17:58:12 - INFO - __main__ - Step 147337: {'lr': 3.9961247275624445e-07, 'samples': 28288704, 'steps': 147336, 'loss/train': 1.0668882131576538} 11/07/2021 17:58:12 - INFO - __main__ - Step 147338: {'lr': 3.9931259957592946e-07, 'samples': 28288896, 'steps': 147337, 'loss/train': 0.926088809967041} 11/07/2021 17:58:12 - INFO - __main__ - Step 147339: {'lr': 3.990128388618175e-07, 'samples': 28289088, 'steps': 147338, 'loss/train': 1.4250178337097168} 11/07/2021 17:58:13 - INFO - __main__ - Step 147340: {'lr': 3.987131906140751e-07, 'samples': 28289280, 'steps': 147339, 'loss/train': 1.1962558031082153} 11/07/2021 17:58:13 - INFO - __main__ - Step 147341: {'lr': 3.9841365483284096e-07, 'samples': 28289472, 'steps': 147340, 'loss/train': 1.0231572389602661} 11/07/2021 17:58:14 - INFO - __main__ - Step 147342: {'lr': 3.9811423151819846e-07, 'samples': 28289664, 'steps': 147341, 'loss/train': 1.0165395736694336} 11/07/2021 17:58:15 - INFO - __main__ - Step 147343: {'lr': 3.9781492067031413e-07, 'samples': 28289856, 'steps': 147342, 'loss/train': 1.1679595708847046} 11/07/2021 17:58:15 - INFO - __main__ - Step 147344: {'lr': 3.975157222893266e-07, 'samples': 28290048, 'steps': 147343, 'loss/train': 1.626948356628418} 11/07/2021 17:58:15 - INFO - __main__ - Step 147345: {'lr': 3.9721663637537486e-07, 'samples': 28290240, 'steps': 147344, 'loss/train': 1.1306854486465454} 11/07/2021 17:58:16 - INFO - __main__ - Step 147346: {'lr': 3.9691766292859753e-07, 'samples': 28290432, 'steps': 147345, 'loss/train': 1.249173641204834} 11/07/2021 17:58:17 - INFO - __main__ - Step 147347: {'lr': 3.966188019491057e-07, 'samples': 28290624, 'steps': 147346, 'loss/train': 0.8226308226585388} 11/07/2021 17:58:17 - INFO - __main__ - Step 147348: {'lr': 3.9632005343703816e-07, 'samples': 28290816, 'steps': 147347, 'loss/train': 1.1741011142730713} 11/07/2021 17:58:18 - INFO - __main__ - Step 147349: {'lr': 3.9602141739256136e-07, 'samples': 28291008, 'steps': 147348, 'loss/train': 1.4828258752822876} 11/07/2021 17:58:18 - INFO - __main__ - Step 147350: {'lr': 3.9572289381575865e-07, 'samples': 28291200, 'steps': 147349, 'loss/train': 1.2425472736358643} 11/07/2021 17:58:18 - INFO - __main__ - Step 147351: {'lr': 3.954244827067965e-07, 'samples': 28291392, 'steps': 147350, 'loss/train': 1.4518084526062012} 11/07/2021 17:58:19 - INFO - __main__ - Step 147352: {'lr': 3.951261840658138e-07, 'samples': 28291584, 'steps': 147351, 'loss/train': 1.9742943048477173} 11/07/2021 17:58:20 - INFO - __main__ - Step 147353: {'lr': 3.9482799789292147e-07, 'samples': 28291776, 'steps': 147352, 'loss/train': 1.1825953722000122} 11/07/2021 17:58:20 - INFO - __main__ - Step 147354: {'lr': 3.94529924188286e-07, 'samples': 28291968, 'steps': 147353, 'loss/train': 1.6246528625488281} 11/07/2021 17:58:20 - INFO - __main__ - Step 147355: {'lr': 3.942319629520186e-07, 'samples': 28292160, 'steps': 147354, 'loss/train': 1.5849599838256836} 11/07/2021 17:58:21 - INFO - __main__ - Step 147356: {'lr': 3.9393411418425786e-07, 'samples': 28292352, 'steps': 147355, 'loss/train': 0.9376751184463501} 11/07/2021 17:58:21 - INFO - __main__ - Step 147357: {'lr': 3.9363637788511487e-07, 'samples': 28292544, 'steps': 147356, 'loss/train': 1.3757119178771973} 11/07/2021 17:58:22 - INFO - __main__ - Step 147358: {'lr': 3.933387540547839e-07, 'samples': 28292736, 'steps': 147357, 'loss/train': 1.3994148969650269} 11/07/2021 17:58:23 - INFO - __main__ - Step 147359: {'lr': 3.930412426933483e-07, 'samples': 28292928, 'steps': 147358, 'loss/train': 1.2257760763168335} 11/07/2021 17:58:23 - INFO - __main__ - Step 147360: {'lr': 3.9274384380094676e-07, 'samples': 28293120, 'steps': 147359, 'loss/train': 1.0066938400268555} 11/07/2021 17:58:23 - INFO - __main__ - Step 147361: {'lr': 3.9244655737774583e-07, 'samples': 28293312, 'steps': 147360, 'loss/train': 1.2138792276382446} 11/07/2021 17:58:24 - INFO - __main__ - Step 147362: {'lr': 3.921493834238288e-07, 'samples': 28293504, 'steps': 147361, 'loss/train': 0.6717219352722168} 11/07/2021 17:58:25 - INFO - __main__ - Step 147363: {'lr': 3.9185232193936214e-07, 'samples': 28293696, 'steps': 147362, 'loss/train': 1.0411624908447266} 11/07/2021 17:58:25 - INFO - __main__ - Step 147364: {'lr': 3.9155537292448473e-07, 'samples': 28293888, 'steps': 147363, 'loss/train': 1.3568195104599} 11/07/2021 17:58:25 - INFO - __main__ - Step 147365: {'lr': 3.912585363793353e-07, 'samples': 28294080, 'steps': 147364, 'loss/train': 1.122069001197815} 11/07/2021 17:58:26 - INFO - __main__ - Step 147366: {'lr': 3.9096181230402486e-07, 'samples': 28294272, 'steps': 147365, 'loss/train': 1.109170913696289} 11/07/2021 17:58:26 - INFO - __main__ - Step 147367: {'lr': 3.9066520069869217e-07, 'samples': 28294464, 'steps': 147366, 'loss/train': 1.1404656171798706} 11/07/2021 17:58:27 - INFO - __main__ - Step 147368: {'lr': 3.903687015634483e-07, 'samples': 28294656, 'steps': 147367, 'loss/train': 1.4265810251235962} 11/07/2021 17:58:27 - INFO - __main__ - Step 147369: {'lr': 3.900723148984875e-07, 'samples': 28294848, 'steps': 147368, 'loss/train': 1.1860474348068237} 11/07/2021 17:58:28 - INFO - __main__ - Step 147370: {'lr': 3.8977604070389303e-07, 'samples': 28295040, 'steps': 147369, 'loss/train': 0.824448823928833} 11/07/2021 17:58:28 - INFO - __main__ - Step 147371: {'lr': 3.8947987897980377e-07, 'samples': 28295232, 'steps': 147370, 'loss/train': 1.0294963121414185} 11/07/2021 17:58:28 - INFO - __main__ - Step 147372: {'lr': 3.891838297263861e-07, 'samples': 28295424, 'steps': 147371, 'loss/train': 0.8882268071174622} 11/07/2021 17:58:30 - INFO - __main__ - Step 147373: {'lr': 3.8888789294375115e-07, 'samples': 28295616, 'steps': 147372, 'loss/train': 1.5994216203689575} 11/07/2021 17:58:30 - INFO - __main__ - Step 147374: {'lr': 3.885920686320099e-07, 'samples': 28295808, 'steps': 147373, 'loss/train': 1.4526385068893433} 11/07/2021 17:58:30 - INFO - __main__ - Step 147375: {'lr': 3.882963567913289e-07, 'samples': 28296000, 'steps': 147374, 'loss/train': 1.1114506721496582} 11/07/2021 17:58:31 - INFO - __main__ - Step 147376: {'lr': 3.8800075742184695e-07, 'samples': 28296192, 'steps': 147375, 'loss/train': 1.0333302021026611} 11/07/2021 17:58:31 - INFO - __main__ - Step 147377: {'lr': 3.877052705236472e-07, 'samples': 28296384, 'steps': 147376, 'loss/train': 0.9902777075767517} 11/07/2021 17:58:32 - INFO - __main__ - Step 147378: {'lr': 3.874098960969241e-07, 'samples': 28296576, 'steps': 147377, 'loss/train': 1.3450571298599243} 11/07/2021 17:58:32 - INFO - __main__ - Step 147379: {'lr': 3.8711463414176087e-07, 'samples': 28296768, 'steps': 147378, 'loss/train': 1.5434422492980957} 11/07/2021 17:58:33 - INFO - __main__ - Step 147380: {'lr': 3.8681948465832396e-07, 'samples': 28296960, 'steps': 147379, 'loss/train': 1.3923137187957764} 11/07/2021 17:58:33 - INFO - __main__ - Step 147381: {'lr': 3.8652444764672446e-07, 'samples': 28297152, 'steps': 147380, 'loss/train': 1.1048294305801392} 11/07/2021 17:58:33 - INFO - __main__ - Step 147382: {'lr': 3.862295231071011e-07, 'samples': 28297344, 'steps': 147381, 'loss/train': 1.1106390953063965} 11/07/2021 17:58:35 - INFO - __main__ - Step 147383: {'lr': 3.859347110396205e-07, 'samples': 28297536, 'steps': 147382, 'loss/train': 1.1966850757598877} 11/07/2021 17:58:35 - INFO - __main__ - Step 147384: {'lr': 3.856400114443659e-07, 'samples': 28297728, 'steps': 147383, 'loss/train': 1.0863401889801025} 11/07/2021 17:58:35 - INFO - __main__ - Step 147385: {'lr': 3.85345424321476e-07, 'samples': 28297920, 'steps': 147384, 'loss/train': 1.0872552394866943} 11/07/2021 17:58:36 - INFO - __main__ - Step 147386: {'lr': 3.850509496711174e-07, 'samples': 28298112, 'steps': 147385, 'loss/train': 0.7891728281974792} 11/07/2021 17:58:36 - INFO - __main__ - Step 147387: {'lr': 3.8475658749340116e-07, 'samples': 28298304, 'steps': 147386, 'loss/train': 1.2820509672164917} 11/07/2021 17:58:37 - INFO - __main__ - Step 147388: {'lr': 3.8446233778846596e-07, 'samples': 28298496, 'steps': 147387, 'loss/train': 0.9710532426834106} 11/07/2021 17:58:37 - INFO - __main__ - Step 147389: {'lr': 3.841682005564229e-07, 'samples': 28298688, 'steps': 147388, 'loss/train': 0.6533268094062805} 11/07/2021 17:58:38 - INFO - __main__ - Step 147390: {'lr': 3.8387417579743844e-07, 'samples': 28298880, 'steps': 147389, 'loss/train': 0.9934215545654297} 11/07/2021 17:58:38 - INFO - __main__ - Step 147391: {'lr': 3.835802635116237e-07, 'samples': 28299072, 'steps': 147390, 'loss/train': 1.1838154792785645} 11/07/2021 17:58:39 - INFO - __main__ - Step 147392: {'lr': 3.8328646369911735e-07, 'samples': 28299264, 'steps': 147391, 'loss/train': 1.2227534055709839} 11/07/2021 17:58:39 - INFO - __main__ - Step 147393: {'lr': 3.829927763600305e-07, 'samples': 28299456, 'steps': 147392, 'loss/train': 0.9666306376457214} 11/07/2021 17:58:40 - INFO - __main__ - Step 147394: {'lr': 3.826992014945296e-07, 'samples': 28299648, 'steps': 147393, 'loss/train': 1.0827728509902954} 11/07/2021 17:58:40 - INFO - __main__ - Step 147395: {'lr': 3.8240573910275356e-07, 'samples': 28299840, 'steps': 147394, 'loss/train': 1.4551172256469727} 11/07/2021 17:58:41 - INFO - __main__ - Step 147396: {'lr': 3.8211238918478554e-07, 'samples': 28300032, 'steps': 147395, 'loss/train': 1.3771206140518188} 11/07/2021 17:58:41 - INFO - __main__ - Step 147397: {'lr': 3.8181915174079204e-07, 'samples': 28300224, 'steps': 147396, 'loss/train': 0.8638104796409607} 11/07/2021 17:58:41 - INFO - __main__ - Step 147398: {'lr': 3.81526026770912e-07, 'samples': 28300416, 'steps': 147397, 'loss/train': 1.6034748554229736} 11/07/2021 17:58:42 - INFO - __main__ - Step 147399: {'lr': 3.8123301427525623e-07, 'samples': 28300608, 'steps': 147398, 'loss/train': 0.7367762327194214} 11/07/2021 17:58:43 - INFO - __main__ - Step 147400: {'lr': 3.809401142539637e-07, 'samples': 28300800, 'steps': 147399, 'loss/train': 1.2891789674758911} 11/07/2021 17:58:43 - INFO - __main__ - Step 147401: {'lr': 3.8064732670717304e-07, 'samples': 28300992, 'steps': 147400, 'loss/train': 1.4230722188949585} 11/07/2021 17:58:43 - INFO - __main__ - Step 147402: {'lr': 3.8035465163499537e-07, 'samples': 28301184, 'steps': 147401, 'loss/train': 1.3631864786148071} 11/07/2021 17:58:44 - INFO - __main__ - Step 147403: {'lr': 3.800620890375972e-07, 'samples': 28301376, 'steps': 147402, 'loss/train': 1.339385747909546} 11/07/2021 17:58:45 - INFO - __main__ - Step 147404: {'lr': 3.7976963891508953e-07, 'samples': 28301568, 'steps': 147403, 'loss/train': 1.2226908206939697} 11/07/2021 17:58:45 - INFO - __main__ - Step 147405: {'lr': 3.794773012675834e-07, 'samples': 28301760, 'steps': 147404, 'loss/train': 1.4542431831359863} 11/07/2021 17:58:45 - INFO - __main__ - Step 147406: {'lr': 3.7918507609524534e-07, 'samples': 28301952, 'steps': 147405, 'loss/train': 0.9435673356056213} 11/07/2021 17:58:46 - INFO - __main__ - Step 147407: {'lr': 3.788929633982141e-07, 'samples': 28302144, 'steps': 147406, 'loss/train': 1.3085505962371826} 11/07/2021 17:58:46 - INFO - __main__ - Step 147408: {'lr': 3.78600963176573e-07, 'samples': 28302336, 'steps': 147407, 'loss/train': 1.0544641017913818} 11/07/2021 17:58:47 - INFO - __main__ - Step 147409: {'lr': 3.7830907543048854e-07, 'samples': 28302528, 'steps': 147408, 'loss/train': 1.4312697649002075} 11/07/2021 17:58:48 - INFO - __main__ - Step 147410: {'lr': 3.780173001600995e-07, 'samples': 28302720, 'steps': 147409, 'loss/train': 1.143829107284546} 11/07/2021 17:58:48 - INFO - __main__ - Step 147411: {'lr': 3.777256373655169e-07, 'samples': 28302912, 'steps': 147410, 'loss/train': 1.1288037300109863} 11/07/2021 17:58:48 - INFO - __main__ - Step 147412: {'lr': 3.7743408704687953e-07, 'samples': 28303104, 'steps': 147411, 'loss/train': 1.3053563833236694} 11/07/2021 17:58:49 - INFO - __main__ - Step 147413: {'lr': 3.771426492043262e-07, 'samples': 28303296, 'steps': 147412, 'loss/train': 1.45244562625885} 11/07/2021 17:58:50 - INFO - __main__ - Step 147414: {'lr': 3.768513238379678e-07, 'samples': 28303488, 'steps': 147413, 'loss/train': 1.3781095743179321} 11/07/2021 17:58:50 - INFO - __main__ - Step 147415: {'lr': 3.7656011094794327e-07, 'samples': 28303680, 'steps': 147414, 'loss/train': 1.2075306177139282} 11/07/2021 17:58:50 - INFO - __main__ - Step 147416: {'lr': 3.762690105343913e-07, 'samples': 28303872, 'steps': 147415, 'loss/train': 1.2857071161270142} 11/07/2021 17:58:51 - INFO - __main__ - Step 147417: {'lr': 3.7597802259745075e-07, 'samples': 28304064, 'steps': 147416, 'loss/train': 1.3845674991607666} 11/07/2021 17:58:51 - INFO - __main__ - Step 147418: {'lr': 3.756871471372325e-07, 'samples': 28304256, 'steps': 147417, 'loss/train': 1.3278506994247437} 11/07/2021 17:58:52 - INFO - __main__ - Step 147419: {'lr': 3.753963841539032e-07, 'samples': 28304448, 'steps': 147418, 'loss/train': 1.5115771293640137} 11/07/2021 17:58:53 - INFO - __main__ - Step 147420: {'lr': 3.7510573364754606e-07, 'samples': 28304640, 'steps': 147419, 'loss/train': 1.3215324878692627} 11/07/2021 17:58:53 - INFO - __main__ - Step 147421: {'lr': 3.7481519561832765e-07, 'samples': 28304832, 'steps': 147420, 'loss/train': 1.4287058115005493} 11/07/2021 17:58:53 - INFO - __main__ - Step 147422: {'lr': 3.7452477006633124e-07, 'samples': 28305024, 'steps': 147421, 'loss/train': 1.3713163137435913} 11/07/2021 17:58:54 - INFO - __main__ - Step 147423: {'lr': 3.7423445699175107e-07, 'samples': 28305216, 'steps': 147422, 'loss/train': 1.2688161134719849} 11/07/2021 17:58:54 - INFO - __main__ - Step 147424: {'lr': 3.739442563946982e-07, 'samples': 28305408, 'steps': 147423, 'loss/train': 1.198217749595642} 11/07/2021 17:58:56 - INFO - __main__ - Step 147425: {'lr': 3.7365416827528364e-07, 'samples': 28305600, 'steps': 147424, 'loss/train': 1.1799134016036987} 11/07/2021 17:58:56 - INFO - __main__ - Step 147426: {'lr': 3.733641926336462e-07, 'samples': 28305792, 'steps': 147425, 'loss/train': 1.6403871774673462} 11/07/2021 17:58:56 - INFO - __main__ - Step 147427: {'lr': 3.7307432946989685e-07, 'samples': 28305984, 'steps': 147426, 'loss/train': 1.704954981803894} 11/07/2021 17:58:57 - INFO - __main__ - Step 147428: {'lr': 3.7278457878422987e-07, 'samples': 28306176, 'steps': 147427, 'loss/train': 1.7092797756195068} 11/07/2021 17:58:57 - INFO - __main__ - Step 147429: {'lr': 3.7249494057670084e-07, 'samples': 28306368, 'steps': 147428, 'loss/train': 0.13262306153774261} 11/07/2021 17:58:57 - INFO - __main__ - Step 147430: {'lr': 3.7220541484747627e-07, 'samples': 28306560, 'steps': 147429, 'loss/train': 1.1583088636398315} 11/07/2021 17:58:59 - INFO - __main__ - Step 147431: {'lr': 3.719160015966949e-07, 'samples': 28306752, 'steps': 147430, 'loss/train': 1.4703571796417236} 11/07/2021 17:58:59 - INFO - __main__ - Step 147432: {'lr': 3.7162670082449556e-07, 'samples': 28306944, 'steps': 147431, 'loss/train': 1.0771225690841675} 11/07/2021 17:59:00 - INFO - __main__ - Step 147433: {'lr': 3.713375125309615e-07, 'samples': 28307136, 'steps': 147432, 'loss/train': 1.8862223625183105} 11/07/2021 17:59:00 - INFO - __main__ - Step 147434: {'lr': 3.710484367162592e-07, 'samples': 28307328, 'steps': 147433, 'loss/train': 1.402703046798706} 11/07/2021 17:59:00 - INFO - __main__ - Step 147435: {'lr': 3.7075947338049976e-07, 'samples': 28307520, 'steps': 147434, 'loss/train': 0.6992180347442627} 11/07/2021 17:59:01 - INFO - __main__ - Step 147436: {'lr': 3.704706225238497e-07, 'samples': 28307712, 'steps': 147435, 'loss/train': 1.2722135782241821} 11/07/2021 17:59:02 - INFO - __main__ - Step 147437: {'lr': 3.701818841463922e-07, 'samples': 28307904, 'steps': 147436, 'loss/train': 1.960469126701355} 11/07/2021 17:59:02 - INFO - __main__ - Step 147438: {'lr': 3.698932582482939e-07, 'samples': 28308096, 'steps': 147437, 'loss/train': 1.3502511978149414} 11/07/2021 17:59:02 - INFO - __main__ - Step 147439: {'lr': 3.69604744829638e-07, 'samples': 28308288, 'steps': 147438, 'loss/train': 0.9740794897079468} 11/07/2021 17:59:03 - INFO - __main__ - Step 147440: {'lr': 3.693163438906189e-07, 'samples': 28308480, 'steps': 147439, 'loss/train': 1.372098684310913} 11/07/2021 17:59:03 - INFO - __main__ - Step 147441: {'lr': 3.6902805543131966e-07, 'samples': 28308672, 'steps': 147440, 'loss/train': 1.4279335737228394} 11/07/2021 17:59:04 - INFO - __main__ - Step 147442: {'lr': 3.6873987945187926e-07, 'samples': 28308864, 'steps': 147441, 'loss/train': 1.3649640083312988} 11/07/2021 17:59:04 - INFO - __main__ - Step 147443: {'lr': 3.6845181595246416e-07, 'samples': 28309056, 'steps': 147442, 'loss/train': 1.49618661403656} 11/07/2021 17:59:05 - INFO - __main__ - Step 147444: {'lr': 3.681638649331298e-07, 'samples': 28309248, 'steps': 147443, 'loss/train': 0.8299186825752258} 11/07/2021 17:59:05 - INFO - __main__ - Step 147445: {'lr': 3.678760263940706e-07, 'samples': 28309440, 'steps': 147444, 'loss/train': 1.187844157218933} 11/07/2021 17:59:06 - INFO - __main__ - Step 147446: {'lr': 3.6758830033539746e-07, 'samples': 28309632, 'steps': 147445, 'loss/train': 0.8729736804962158} 11/07/2021 17:59:06 - INFO - __main__ - Step 147447: {'lr': 3.6730068675722153e-07, 'samples': 28309824, 'steps': 147446, 'loss/train': 1.085397720336914} 11/07/2021 17:59:07 - INFO - __main__ - Step 147448: {'lr': 3.670131856597092e-07, 'samples': 28310016, 'steps': 147447, 'loss/train': 1.0850006341934204} 11/07/2021 17:59:07 - INFO - __main__ - Step 147449: {'lr': 3.6672579704297157e-07, 'samples': 28310208, 'steps': 147448, 'loss/train': 1.3300055265426636} 11/07/2021 17:59:08 - INFO - __main__ - Step 147450: {'lr': 3.664385209071197e-07, 'samples': 28310400, 'steps': 147449, 'loss/train': 1.1453102827072144} 11/07/2021 17:59:08 - INFO - __main__ - Step 147451: {'lr': 3.6615135725229234e-07, 'samples': 28310592, 'steps': 147450, 'loss/train': 1.4062467813491821} 11/07/2021 17:59:09 - INFO - __main__ - Step 147452: {'lr': 3.658643060786282e-07, 'samples': 28310784, 'steps': 147451, 'loss/train': 0.9236039519309998} 11/07/2021 17:59:09 - INFO - __main__ - Step 147453: {'lr': 3.6557736738626613e-07, 'samples': 28310976, 'steps': 147452, 'loss/train': 1.4721955060958862} 11/07/2021 17:59:10 - INFO - __main__ - Step 147454: {'lr': 3.6529054117531715e-07, 'samples': 28311168, 'steps': 147453, 'loss/train': 1.3436362743377686} 11/07/2021 17:59:10 - INFO - __main__ - Step 147455: {'lr': 3.6500382744589224e-07, 'samples': 28311360, 'steps': 147454, 'loss/train': 1.574283480644226} 11/07/2021 17:59:10 - INFO - __main__ - Step 147456: {'lr': 3.647172261981857e-07, 'samples': 28311552, 'steps': 147455, 'loss/train': 1.6011779308319092} 11/07/2021 17:59:12 - INFO - __main__ - Step 147457: {'lr': 3.644307374322531e-07, 'samples': 28311744, 'steps': 147456, 'loss/train': 1.4605882167816162} 11/07/2021 17:59:12 - INFO - __main__ - Step 147458: {'lr': 3.641443611482609e-07, 'samples': 28311936, 'steps': 147457, 'loss/train': 0.31097105145454407} 11/07/2021 17:59:12 - INFO - __main__ - Step 147459: {'lr': 3.638580973463479e-07, 'samples': 28312128, 'steps': 147458, 'loss/train': 1.3706406354904175} 11/07/2021 17:59:13 - INFO - __main__ - Step 147460: {'lr': 3.6357194602662514e-07, 'samples': 28312320, 'steps': 147459, 'loss/train': 0.217850923538208} 11/07/2021 17:59:13 - INFO - __main__ - Step 147461: {'lr': 3.632859071892314e-07, 'samples': 28312512, 'steps': 147460, 'loss/train': 1.2130670547485352} 11/07/2021 17:59:14 - INFO - __main__ - Step 147462: {'lr': 3.629999808342777e-07, 'samples': 28312704, 'steps': 147461, 'loss/train': 1.0295026302337646} 11/07/2021 17:59:15 - INFO - __main__ - Step 147463: {'lr': 3.6271416696190276e-07, 'samples': 28312896, 'steps': 147462, 'loss/train': 1.3462470769882202} 11/07/2021 17:59:15 - INFO - __main__ - Step 147464: {'lr': 3.6242846557221765e-07, 'samples': 28313088, 'steps': 147463, 'loss/train': 0.8068687915802002} 11/07/2021 17:59:15 - INFO - __main__ - Step 147465: {'lr': 3.621428766654167e-07, 'samples': 28313280, 'steps': 147464, 'loss/train': 1.433380126953125} 11/07/2021 17:59:16 - INFO - __main__ - Step 147466: {'lr': 3.6185740024155535e-07, 'samples': 28313472, 'steps': 147465, 'loss/train': 1.037937879562378} 11/07/2021 17:59:16 - INFO - __main__ - Step 147467: {'lr': 3.615720363007724e-07, 'samples': 28313664, 'steps': 147466, 'loss/train': 1.1487256288528442} 11/07/2021 17:59:17 - INFO - __main__ - Step 147468: {'lr': 3.612867848432344e-07, 'samples': 28313856, 'steps': 147467, 'loss/train': 1.1753249168395996} 11/07/2021 17:59:18 - INFO - __main__ - Step 147469: {'lr': 3.6100164586905236e-07, 'samples': 28314048, 'steps': 147468, 'loss/train': 0.8893922567367554} 11/07/2021 17:59:18 - INFO - __main__ - Step 147470: {'lr': 3.607166193783373e-07, 'samples': 28314240, 'steps': 147469, 'loss/train': 1.1073110103607178} 11/07/2021 17:59:18 - INFO - __main__ - Step 147471: {'lr': 3.6043170537122807e-07, 'samples': 28314432, 'steps': 147470, 'loss/train': 1.9718769788742065} 11/07/2021 17:59:19 - INFO - __main__ - Step 147472: {'lr': 3.601469038478633e-07, 'samples': 28314624, 'steps': 147471, 'loss/train': 1.2497750520706177} 11/07/2021 17:59:20 - INFO - __main__ - Step 147473: {'lr': 3.5986221480838186e-07, 'samples': 28314816, 'steps': 147472, 'loss/train': 1.0152186155319214} 11/07/2021 17:59:20 - INFO - __main__ - Step 147474: {'lr': 3.59577638252867e-07, 'samples': 28315008, 'steps': 147473, 'loss/train': 1.1046732664108276} 11/07/2021 17:59:20 - INFO - __main__ - Step 147475: {'lr': 3.5929317418148534e-07, 'samples': 28315200, 'steps': 147474, 'loss/train': 1.1067121028900146} 11/07/2021 17:59:21 - INFO - __main__ - Step 147476: {'lr': 3.5900882259434777e-07, 'samples': 28315392, 'steps': 147475, 'loss/train': 1.262515902519226} 11/07/2021 17:59:21 - INFO - __main__ - Step 147477: {'lr': 3.5872458349162086e-07, 'samples': 28315584, 'steps': 147476, 'loss/train': 1.621567726135254} 11/07/2021 17:59:22 - INFO - __main__ - Step 147478: {'lr': 3.5844045687336015e-07, 'samples': 28315776, 'steps': 147477, 'loss/train': 1.3342819213867188} 11/07/2021 17:59:23 - INFO - __main__ - Step 147479: {'lr': 3.581564427397599e-07, 'samples': 28315968, 'steps': 147478, 'loss/train': 1.3531484603881836} 11/07/2021 17:59:23 - INFO - __main__ - Step 147480: {'lr': 3.578725410909034e-07, 'samples': 28316160, 'steps': 147479, 'loss/train': 1.4772089719772339} 11/07/2021 17:59:23 - INFO - __main__ - Step 147481: {'lr': 3.5758875192695716e-07, 'samples': 28316352, 'steps': 147480, 'loss/train': 1.1288831233978271} 11/07/2021 17:59:24 - INFO - __main__ - Step 147482: {'lr': 3.5730507524800446e-07, 'samples': 28316544, 'steps': 147481, 'loss/train': 1.1142823696136475} 11/07/2021 17:59:25 - INFO - __main__ - Step 147483: {'lr': 3.570215110542119e-07, 'samples': 28316736, 'steps': 147482, 'loss/train': 1.5244795083999634} 11/07/2021 17:59:25 - INFO - __main__ - Step 147484: {'lr': 3.5673805934569035e-07, 'samples': 28316928, 'steps': 147483, 'loss/train': 2.1232197284698486} 11/07/2021 17:59:25 - INFO - __main__ - Step 147485: {'lr': 3.5645472012257876e-07, 'samples': 28317120, 'steps': 147484, 'loss/train': 1.0922740697860718} 11/07/2021 17:59:26 - INFO - __main__ - Step 147486: {'lr': 3.56171493384988e-07, 'samples': 28317312, 'steps': 147485, 'loss/train': 1.1991249322891235} 11/07/2021 17:59:26 - INFO - __main__ - Step 147487: {'lr': 3.5588837913305695e-07, 'samples': 28317504, 'steps': 147486, 'loss/train': 1.0493371486663818} 11/07/2021 17:59:27 - INFO - __main__ - Step 147488: {'lr': 3.5560537736689656e-07, 'samples': 28317696, 'steps': 147487, 'loss/train': 1.3047112226486206} 11/07/2021 17:59:27 - INFO - __main__ - Step 147489: {'lr': 3.5532248808667345e-07, 'samples': 28317888, 'steps': 147488, 'loss/train': 1.5092411041259766} 11/07/2021 17:59:28 - INFO - __main__ - Step 147490: {'lr': 3.5503971129247083e-07, 'samples': 28318080, 'steps': 147489, 'loss/train': 1.3918830156326294} 11/07/2021 17:59:28 - INFO - __main__ - Step 147491: {'lr': 3.547570469844275e-07, 'samples': 28318272, 'steps': 147490, 'loss/train': 1.40947687625885} 11/07/2021 17:59:28 - INFO - __main__ - Step 147492: {'lr': 3.544744951626822e-07, 'samples': 28318464, 'steps': 147491, 'loss/train': 1.0067059993743896} 11/07/2021 17:59:30 - INFO - __main__ - Step 147493: {'lr': 3.541920558273737e-07, 'samples': 28318656, 'steps': 147492, 'loss/train': 0.6299952864646912} 11/07/2021 17:59:30 - INFO - __main__ - Step 147494: {'lr': 3.5390972897861305e-07, 'samples': 28318848, 'steps': 147493, 'loss/train': 1.3290951251983643} 11/07/2021 17:59:30 - INFO - __main__ - Step 147495: {'lr': 3.536275146165391e-07, 'samples': 28319040, 'steps': 147494, 'loss/train': 1.0608692169189453} 11/07/2021 17:59:31 - INFO - __main__ - Step 147496: {'lr': 3.53345412741235e-07, 'samples': 28319232, 'steps': 147495, 'loss/train': 1.422754168510437} 11/07/2021 17:59:31 - INFO - __main__ - Step 147497: {'lr': 3.530634233528951e-07, 'samples': 28319424, 'steps': 147496, 'loss/train': 1.1338615417480469} 11/07/2021 17:59:32 - INFO - __main__ - Step 147498: {'lr': 3.5278154645157487e-07, 'samples': 28319616, 'steps': 147497, 'loss/train': 1.2601567506790161} 11/07/2021 17:59:32 - INFO - __main__ - Step 147499: {'lr': 3.524997820374687e-07, 'samples': 28319808, 'steps': 147498, 'loss/train': 1.1589308977127075} 11/07/2021 17:59:33 - INFO - __main__ - Step 147500: {'lr': 3.522181301106597e-07, 'samples': 28320000, 'steps': 147499, 'loss/train': 1.5714513063430786} 11/07/2021 17:59:33 - INFO - __main__ - Step 147501: {'lr': 3.5193659067131456e-07, 'samples': 28320192, 'steps': 147500, 'loss/train': 1.2804608345031738} 11/07/2021 17:59:33 - INFO - __main__ - Step 147502: {'lr': 3.5165516371951647e-07, 'samples': 28320384, 'steps': 147501, 'loss/train': 0.7818114757537842} 11/07/2021 17:59:34 - INFO - __main__ - Step 147503: {'lr': 3.5137384925540414e-07, 'samples': 28320576, 'steps': 147502, 'loss/train': 1.0868186950683594} 11/07/2021 17:59:35 - INFO - __main__ - Step 147504: {'lr': 3.510926472791165e-07, 'samples': 28320768, 'steps': 147503, 'loss/train': 1.046654462814331} 11/07/2021 17:59:35 - INFO - __main__ - Step 147505: {'lr': 3.508115577907645e-07, 'samples': 28320960, 'steps': 147504, 'loss/train': 1.2949997186660767} 11/07/2021 17:59:35 - INFO - __main__ - Step 147506: {'lr': 3.505305807904868e-07, 'samples': 28321152, 'steps': 147505, 'loss/train': 1.0613142251968384} 11/07/2021 17:59:36 - INFO - __main__ - Step 147507: {'lr': 3.502497162784224e-07, 'samples': 28321344, 'steps': 147506, 'loss/train': 1.6525901556015015} 11/07/2021 17:59:36 - INFO - __main__ - Step 147508: {'lr': 3.4996896425468216e-07, 'samples': 28321536, 'steps': 147507, 'loss/train': 1.7395681142807007} 11/07/2021 17:59:37 - INFO - __main__ - Step 147509: {'lr': 3.4968832471937715e-07, 'samples': 28321728, 'steps': 147508, 'loss/train': 1.7386701107025146} 11/07/2021 17:59:38 - INFO - __main__ - Step 147510: {'lr': 3.494077976726462e-07, 'samples': 28321920, 'steps': 147509, 'loss/train': 1.218544602394104} 11/07/2021 17:59:38 - INFO - __main__ - Step 147511: {'lr': 3.4912738311465574e-07, 'samples': 28322112, 'steps': 147510, 'loss/train': 1.4083847999572754} 11/07/2021 17:59:38 - INFO - __main__ - Step 147512: {'lr': 3.4884708104546137e-07, 'samples': 28322304, 'steps': 147511, 'loss/train': 1.0240473747253418} 11/07/2021 17:59:39 - INFO - __main__ - Step 147513: {'lr': 3.4856689146522957e-07, 'samples': 28322496, 'steps': 147512, 'loss/train': 1.2528364658355713} 11/07/2021 17:59:40 - INFO - __main__ - Step 147514: {'lr': 3.4828681437409913e-07, 'samples': 28322688, 'steps': 147513, 'loss/train': 1.5545426607131958} 11/07/2021 17:59:40 - INFO - __main__ - Step 147515: {'lr': 3.4800684977215334e-07, 'samples': 28322880, 'steps': 147514, 'loss/train': 1.6527454853057861} 11/07/2021 17:59:40 - INFO - __main__ - Step 147516: {'lr': 3.4772699765955874e-07, 'samples': 28323072, 'steps': 147515, 'loss/train': 1.2251867055892944} 11/07/2021 17:59:41 - INFO - __main__ - Step 147517: {'lr': 3.474472580364263e-07, 'samples': 28323264, 'steps': 147516, 'loss/train': 1.4320582151412964} 11/07/2021 17:59:41 - INFO - __main__ - Step 147518: {'lr': 3.4716763090286707e-07, 'samples': 28323456, 'steps': 147517, 'loss/train': 0.10862717777490616} 11/07/2021 17:59:42 - INFO - __main__ - Step 147519: {'lr': 3.468881162590476e-07, 'samples': 28323648, 'steps': 147518, 'loss/train': 0.5313294529914856} 11/07/2021 17:59:43 - INFO - __main__ - Step 147520: {'lr': 3.4660871410505114e-07, 'samples': 28323840, 'steps': 147519, 'loss/train': 1.6986315250396729} 11/07/2021 17:59:43 - INFO - __main__ - Step 147521: {'lr': 3.4632942444101647e-07, 'samples': 28324032, 'steps': 147520, 'loss/train': 1.0351128578186035} 11/07/2021 17:59:43 - INFO - __main__ - Step 147522: {'lr': 3.4605024726708235e-07, 'samples': 28324224, 'steps': 147521, 'loss/train': 1.0959014892578125} 11/07/2021 17:59:44 - INFO - __main__ - Step 147523: {'lr': 3.457711825833598e-07, 'samples': 28324416, 'steps': 147522, 'loss/train': 1.715294361114502} 11/07/2021 17:59:45 - INFO - __main__ - Step 147524: {'lr': 3.454922303899877e-07, 'samples': 28324608, 'steps': 147523, 'loss/train': 1.4011616706848145} 11/07/2021 17:59:45 - INFO - __main__ - Step 147525: {'lr': 3.4521339068707694e-07, 'samples': 28324800, 'steps': 147524, 'loss/train': 0.5914286375045776} 11/07/2021 17:59:45 - INFO - __main__ - Step 147526: {'lr': 3.4493466347476634e-07, 'samples': 28324992, 'steps': 147525, 'loss/train': 1.3586838245391846} 11/07/2021 17:59:46 - INFO - __main__ - Step 147527: {'lr': 3.446560487531669e-07, 'samples': 28325184, 'steps': 147526, 'loss/train': 1.2317562103271484} 11/07/2021 17:59:46 - INFO - __main__ - Step 147528: {'lr': 3.443775465224175e-07, 'samples': 28325376, 'steps': 147527, 'loss/train': 1.3422805070877075} 11/07/2021 17:59:47 - INFO - __main__ - Step 147529: {'lr': 3.440991567826568e-07, 'samples': 28325568, 'steps': 147528, 'loss/train': 0.9515489339828491} 11/07/2021 17:59:47 - INFO - __main__ - Step 147530: {'lr': 3.438208795339681e-07, 'samples': 28325760, 'steps': 147529, 'loss/train': 1.4300553798675537} 11/07/2021 17:59:48 - INFO - __main__ - Step 147531: {'lr': 3.43542714776518e-07, 'samples': 28325952, 'steps': 147530, 'loss/train': 1.514756202697754} 11/07/2021 17:59:48 - INFO - __main__ - Step 147532: {'lr': 3.432646625103897e-07, 'samples': 28326144, 'steps': 147531, 'loss/train': 1.3938957452774048} 11/07/2021 17:59:48 - INFO - __main__ - Step 147533: {'lr': 3.4298672273577744e-07, 'samples': 28326336, 'steps': 147532, 'loss/train': 1.4323700666427612} 11/07/2021 17:59:50 - INFO - __main__ - Step 147534: {'lr': 3.427088954527369e-07, 'samples': 28326528, 'steps': 147533, 'loss/train': 0.8027241826057434} 11/07/2021 17:59:50 - INFO - __main__ - Step 147535: {'lr': 3.4243118066140665e-07, 'samples': 28326720, 'steps': 147534, 'loss/train': 1.231466293334961} 11/07/2021 17:59:50 - INFO - __main__ - Step 147536: {'lr': 3.4215357836195336e-07, 'samples': 28326912, 'steps': 147535, 'loss/train': 1.0073132514953613} 11/07/2021 17:59:51 - INFO - __main__ - Step 147537: {'lr': 3.4187608855443255e-07, 'samples': 28327104, 'steps': 147536, 'loss/train': 1.47052001953125} 11/07/2021 17:59:51 - INFO - __main__ - Step 147538: {'lr': 3.415987112390384e-07, 'samples': 28327296, 'steps': 147537, 'loss/train': 0.6738470792770386} 11/07/2021 17:59:52 - INFO - __main__ - Step 147539: {'lr': 3.4132144641588205e-07, 'samples': 28327488, 'steps': 147538, 'loss/train': 1.1317230463027954} 11/07/2021 17:59:52 - INFO - __main__ - Step 147540: {'lr': 3.4104429408504666e-07, 'samples': 28327680, 'steps': 147539, 'loss/train': 1.289869785308838} 11/07/2021 17:59:53 - INFO - __main__ - Step 147541: {'lr': 3.4076725424669887e-07, 'samples': 28327872, 'steps': 147540, 'loss/train': 1.0611441135406494} 11/07/2021 17:59:53 - INFO - __main__ - Step 147542: {'lr': 3.4049032690092184e-07, 'samples': 28328064, 'steps': 147541, 'loss/train': 1.184960961341858} 11/07/2021 17:59:53 - INFO - __main__ - Step 147543: {'lr': 3.4021351204790997e-07, 'samples': 28328256, 'steps': 147542, 'loss/train': 0.6691480278968811} 11/07/2021 17:59:54 - INFO - __main__ - Step 147544: {'lr': 3.399368096877187e-07, 'samples': 28328448, 'steps': 147543, 'loss/train': 1.5037559270858765} 11/07/2021 17:59:55 - INFO - __main__ - Step 147545: {'lr': 3.396602198204868e-07, 'samples': 28328640, 'steps': 147544, 'loss/train': 1.3305447101593018} 11/07/2021 17:59:55 - INFO - __main__ - Step 147546: {'lr': 3.3938374244638084e-07, 'samples': 28328832, 'steps': 147545, 'loss/train': 1.1836538314819336} 11/07/2021 17:59:55 - INFO - __main__ - Step 147547: {'lr': 3.3910737756548405e-07, 'samples': 28329024, 'steps': 147546, 'loss/train': 0.9948487877845764} 11/07/2021 17:59:56 - INFO - __main__ - Step 147548: {'lr': 3.3883112517793523e-07, 'samples': 28329216, 'steps': 147547, 'loss/train': 1.2816345691680908} 11/07/2021 17:59:56 - INFO - __main__ - Step 147549: {'lr': 3.3855498528384545e-07, 'samples': 28329408, 'steps': 147548, 'loss/train': 1.3956201076507568} 11/07/2021 17:59:57 - INFO - __main__ - Step 147550: {'lr': 3.3827895788338115e-07, 'samples': 28329600, 'steps': 147549, 'loss/train': 1.4431883096694946} 11/07/2021 17:59:58 - INFO - __main__ - Step 147551: {'lr': 3.380030429765979e-07, 'samples': 28329792, 'steps': 147550, 'loss/train': 1.4316130876541138} 11/07/2021 17:59:58 - INFO - __main__ - Step 147552: {'lr': 3.3772724056369e-07, 'samples': 28329984, 'steps': 147551, 'loss/train': 0.8663014769554138} 11/07/2021 17:59:58 - INFO - __main__ - Step 147553: {'lr': 3.374515506447129e-07, 'samples': 28330176, 'steps': 147552, 'loss/train': 0.553237795829773} 11/07/2021 17:59:59 - INFO - __main__ - Step 147554: {'lr': 3.371759732198609e-07, 'samples': 28330368, 'steps': 147553, 'loss/train': 1.4678360223770142} 11/07/2021 18:00:00 - INFO - __main__ - Step 147555: {'lr': 3.369005082892174e-07, 'samples': 28330560, 'steps': 147554, 'loss/train': 1.1811977624893188} 11/07/2021 18:00:00 - INFO - __main__ - Step 147556: {'lr': 3.366251558528932e-07, 'samples': 28330752, 'steps': 147555, 'loss/train': 1.0840904712677002} 11/07/2021 18:00:00 - INFO - __main__ - Step 147557: {'lr': 3.36349915911055e-07, 'samples': 28330944, 'steps': 147556, 'loss/train': 1.3485217094421387} 11/07/2021 18:00:01 - INFO - __main__ - Step 147558: {'lr': 3.3607478846381377e-07, 'samples': 28331136, 'steps': 147557, 'loss/train': 1.2077330350875854} 11/07/2021 18:00:01 - INFO - __main__ - Step 147559: {'lr': 3.357997735112528e-07, 'samples': 28331328, 'steps': 147558, 'loss/train': 1.2289880514144897} 11/07/2021 18:00:01 - INFO - __main__ - Step 147560: {'lr': 3.355248710535386e-07, 'samples': 28331520, 'steps': 147559, 'loss/train': 1.4481618404388428} 11/07/2021 18:00:03 - INFO - __main__ - Step 147561: {'lr': 3.352500810908099e-07, 'samples': 28331712, 'steps': 147560, 'loss/train': 0.4186849296092987} 11/07/2021 18:00:03 - INFO - __main__ - Step 147562: {'lr': 3.3497540362315007e-07, 'samples': 28331904, 'steps': 147561, 'loss/train': 1.1937180757522583} 11/07/2021 18:00:03 - INFO - __main__ - Step 147563: {'lr': 3.3470083865069777e-07, 'samples': 28332096, 'steps': 147562, 'loss/train': 1.199292778968811} 11/07/2021 18:00:04 - INFO - __main__ - Step 147564: {'lr': 3.344263861735641e-07, 'samples': 28332288, 'steps': 147563, 'loss/train': 0.6165428757667542} 11/07/2021 18:00:04 - INFO - __main__ - Step 147565: {'lr': 3.3415204619188787e-07, 'samples': 28332480, 'steps': 147564, 'loss/train': 1.134924054145813} 11/07/2021 18:00:05 - INFO - __main__ - Step 147566: {'lr': 3.338778187058078e-07, 'samples': 28332672, 'steps': 147565, 'loss/train': 1.142945647239685} 11/07/2021 18:00:05 - INFO - __main__ - Step 147567: {'lr': 3.3360370371540716e-07, 'samples': 28332864, 'steps': 147566, 'loss/train': 1.2045303583145142} 11/07/2021 18:00:06 - INFO - __main__ - Step 147568: {'lr': 3.3332970122085247e-07, 'samples': 28333056, 'steps': 147567, 'loss/train': 0.8091486096382141} 11/07/2021 18:00:06 - INFO - __main__ - Step 147569: {'lr': 3.330558112222271e-07, 'samples': 28333248, 'steps': 147568, 'loss/train': 0.785569965839386} 11/07/2021 18:00:06 - INFO - __main__ - Step 147570: {'lr': 3.327820337196974e-07, 'samples': 28333440, 'steps': 147569, 'loss/train': 0.9832907319068909} 11/07/2021 18:00:07 - INFO - __main__ - Step 147571: {'lr': 3.3250836871334676e-07, 'samples': 28333632, 'steps': 147570, 'loss/train': 0.5622508525848389} 11/07/2021 18:00:08 - INFO - __main__ - Step 147572: {'lr': 3.3223481620331396e-07, 'samples': 28333824, 'steps': 147571, 'loss/train': 1.0096608400344849} 11/07/2021 18:00:08 - INFO - __main__ - Step 147573: {'lr': 3.3196137618973774e-07, 'samples': 28334016, 'steps': 147572, 'loss/train': 1.260302186012268} 11/07/2021 18:00:09 - INFO - __main__ - Step 147574: {'lr': 3.3168804867270143e-07, 'samples': 28334208, 'steps': 147573, 'loss/train': 1.3128719329833984} 11/07/2021 18:00:09 - INFO - __main__ - Step 147575: {'lr': 3.3141483365237147e-07, 'samples': 28334400, 'steps': 147574, 'loss/train': 1.6532232761383057} 11/07/2021 18:00:10 - INFO - __main__ - Step 147576: {'lr': 3.3114173112885893e-07, 'samples': 28334592, 'steps': 147575, 'loss/train': 1.0727100372314453} 11/07/2021 18:00:10 - INFO - __main__ - Step 147577: {'lr': 3.308687411022748e-07, 'samples': 28334784, 'steps': 147576, 'loss/train': 1.227901816368103} 11/07/2021 18:00:11 - INFO - __main__ - Step 147578: {'lr': 3.3059586357275795e-07, 'samples': 28334976, 'steps': 147577, 'loss/train': 1.5146360397338867} 11/07/2021 18:00:11 - INFO - __main__ - Step 147579: {'lr': 3.3032309854039153e-07, 'samples': 28335168, 'steps': 147578, 'loss/train': 1.1697593927383423} 11/07/2021 18:00:11 - INFO - __main__ - Step 147580: {'lr': 3.300504460053699e-07, 'samples': 28335360, 'steps': 147579, 'loss/train': 1.2929476499557495} 11/07/2021 18:00:12 - INFO - __main__ - Step 147581: {'lr': 3.297779059677486e-07, 'samples': 28335552, 'steps': 147580, 'loss/train': 1.710075855255127} 11/07/2021 18:00:13 - INFO - __main__ - Step 147582: {'lr': 3.295054784276941e-07, 'samples': 28335744, 'steps': 147581, 'loss/train': 1.0728759765625} 11/07/2021 18:00:13 - INFO - __main__ - Step 147583: {'lr': 3.2923316338528964e-07, 'samples': 28335936, 'steps': 147582, 'loss/train': 1.2015302181243896} 11/07/2021 18:00:14 - INFO - __main__ - Step 147584: {'lr': 3.2896096084070184e-07, 'samples': 28336128, 'steps': 147583, 'loss/train': 0.9413813352584839} 11/07/2021 18:00:14 - INFO - __main__ - Step 147585: {'lr': 3.2868887079401386e-07, 'samples': 28336320, 'steps': 147584, 'loss/train': 1.0952593088150024} 11/07/2021 18:00:14 - INFO - __main__ - Step 147586: {'lr': 3.284168932453646e-07, 'samples': 28336512, 'steps': 147585, 'loss/train': 1.6473511457443237} 11/07/2021 18:00:15 - INFO - __main__ - Step 147587: {'lr': 3.281450281948928e-07, 'samples': 28336704, 'steps': 147586, 'loss/train': 1.0325499773025513} 11/07/2021 18:00:16 - INFO - __main__ - Step 147588: {'lr': 3.278732756427094e-07, 'samples': 28336896, 'steps': 147587, 'loss/train': 0.8758202195167542} 11/07/2021 18:00:16 - INFO - __main__ - Step 147589: {'lr': 3.2760163558892554e-07, 'samples': 28337088, 'steps': 147588, 'loss/train': 1.3281567096710205} 11/07/2021 18:00:17 - INFO - __main__ - Step 147590: {'lr': 3.2733010803365217e-07, 'samples': 28337280, 'steps': 147589, 'loss/train': 1.2991634607315063} 11/07/2021 18:00:17 - INFO - __main__ - Step 147591: {'lr': 3.270586929770558e-07, 'samples': 28337472, 'steps': 147590, 'loss/train': 1.3502075672149658} 11/07/2021 18:00:18 - INFO - __main__ - Step 147592: {'lr': 3.267873904192198e-07, 'samples': 28337664, 'steps': 147591, 'loss/train': 1.0114811658859253} 11/07/2021 18:00:18 - INFO - __main__ - Step 147593: {'lr': 3.2651620036028284e-07, 'samples': 28337856, 'steps': 147592, 'loss/train': 1.2789673805236816} 11/07/2021 18:00:19 - INFO - __main__ - Step 147594: {'lr': 3.2624512280038376e-07, 'samples': 28338048, 'steps': 147593, 'loss/train': 1.3783442974090576} 11/07/2021 18:00:19 - INFO - __main__ - Step 147595: {'lr': 3.259741577396058e-07, 'samples': 28338240, 'steps': 147594, 'loss/train': 1.1775381565093994} 11/07/2021 18:00:19 - INFO - __main__ - Step 147596: {'lr': 3.2570330517811555e-07, 'samples': 28338432, 'steps': 147595, 'loss/train': 1.357991099357605} 11/07/2021 18:00:20 - INFO - __main__ - Step 147597: {'lr': 3.254325651159684e-07, 'samples': 28338624, 'steps': 147596, 'loss/train': 1.0083348751068115} 11/07/2021 18:00:21 - INFO - __main__ - Step 147598: {'lr': 3.2516193755335876e-07, 'samples': 28338816, 'steps': 147597, 'loss/train': 1.4146573543548584} 11/07/2021 18:00:21 - INFO - __main__ - Step 147599: {'lr': 3.2489142249036984e-07, 'samples': 28339008, 'steps': 147598, 'loss/train': 1.152767539024353} 11/07/2021 18:00:21 - INFO - __main__ - Step 147600: {'lr': 3.2462101992714045e-07, 'samples': 28339200, 'steps': 147599, 'loss/train': 1.2935291528701782} 11/07/2021 18:00:22 - INFO - __main__ - Step 147601: {'lr': 3.2435072986378155e-07, 'samples': 28339392, 'steps': 147600, 'loss/train': 1.1619845628738403} 11/07/2021 18:00:23 - INFO - __main__ - Step 147602: {'lr': 3.2408055230043196e-07, 'samples': 28339584, 'steps': 147601, 'loss/train': 1.4177582263946533} 11/07/2021 18:00:23 - INFO - __main__ - Step 147603: {'lr': 3.23810487237175e-07, 'samples': 28339776, 'steps': 147602, 'loss/train': 1.0519689321517944} 11/07/2021 18:00:23 - INFO - __main__ - Step 147604: {'lr': 3.2354053467414937e-07, 'samples': 28339968, 'steps': 147603, 'loss/train': 1.4809194803237915} 11/07/2021 18:00:24 - INFO - __main__ - Step 147605: {'lr': 3.2327069461152156e-07, 'samples': 28340160, 'steps': 147604, 'loss/train': 0.7598989009857178} 11/07/2021 18:00:24 - INFO - __main__ - Step 147606: {'lr': 3.230009670493472e-07, 'samples': 28340352, 'steps': 147605, 'loss/train': 1.347164273262024} 11/07/2021 18:00:25 - INFO - __main__ - Step 147607: {'lr': 3.2273135198776505e-07, 'samples': 28340544, 'steps': 147606, 'loss/train': 1.2735909223556519} 11/07/2021 18:00:25 - INFO - __main__ - Step 147608: {'lr': 3.224618494269138e-07, 'samples': 28340736, 'steps': 147607, 'loss/train': 0.9229835867881775} 11/07/2021 18:00:26 - INFO - __main__ - Step 147609: {'lr': 3.221924593669323e-07, 'samples': 28340928, 'steps': 147608, 'loss/train': 1.5886262655258179} 11/07/2021 18:00:26 - INFO - __main__ - Step 147610: {'lr': 3.219231818079038e-07, 'samples': 28341120, 'steps': 147609, 'loss/train': 1.5267784595489502} 11/07/2021 18:00:26 - INFO - __main__ - Step 147611: {'lr': 3.216540167499671e-07, 'samples': 28341312, 'steps': 147610, 'loss/train': 1.222668170928955} 11/07/2021 18:00:28 - INFO - __main__ - Step 147612: {'lr': 3.2138496419323314e-07, 'samples': 28341504, 'steps': 147611, 'loss/train': 1.2000938653945923} 11/07/2021 18:00:28 - INFO - __main__ - Step 147613: {'lr': 3.211160241378408e-07, 'samples': 28341696, 'steps': 147612, 'loss/train': 1.270831823348999} 11/07/2021 18:00:28 - INFO - __main__ - Step 147614: {'lr': 3.2084719658387327e-07, 'samples': 28341888, 'steps': 147613, 'loss/train': 1.1710014343261719} 11/07/2021 18:00:29 - INFO - __main__ - Step 147615: {'lr': 3.205784815315249e-07, 'samples': 28342080, 'steps': 147614, 'loss/train': 1.2364749908447266} 11/07/2021 18:00:29 - INFO - __main__ - Step 147616: {'lr': 3.203098789808234e-07, 'samples': 28342272, 'steps': 147615, 'loss/train': 1.2560279369354248} 11/07/2021 18:00:30 - INFO - __main__ - Step 147617: {'lr': 3.200413889319631e-07, 'samples': 28342464, 'steps': 147616, 'loss/train': 1.5371145009994507} 11/07/2021 18:00:30 - INFO - __main__ - Step 147618: {'lr': 3.197730113850272e-07, 'samples': 28342656, 'steps': 147617, 'loss/train': 0.8281817436218262} 11/07/2021 18:00:31 - INFO - __main__ - Step 147619: {'lr': 3.1950474634018233e-07, 'samples': 28342848, 'steps': 147618, 'loss/train': 1.10695481300354} 11/07/2021 18:00:31 - INFO - __main__ - Step 147620: {'lr': 3.192365937974839e-07, 'samples': 28343040, 'steps': 147619, 'loss/train': 1.0921530723571777} 11/07/2021 18:00:31 - INFO - __main__ - Step 147621: {'lr': 3.1896855375709853e-07, 'samples': 28343232, 'steps': 147620, 'loss/train': 0.7014150023460388} 11/07/2021 18:00:33 - INFO - __main__ - Step 147622: {'lr': 3.1870062621910943e-07, 'samples': 28343424, 'steps': 147621, 'loss/train': 1.3416333198547363} 11/07/2021 18:00:33 - INFO - __main__ - Step 147623: {'lr': 3.184328111836832e-07, 'samples': 28343616, 'steps': 147622, 'loss/train': 1.3812938928604126} 11/07/2021 18:00:33 - INFO - __main__ - Step 147624: {'lr': 3.18165108650903e-07, 'samples': 28343808, 'steps': 147623, 'loss/train': 1.3919742107391357} 11/07/2021 18:00:34 - INFO - __main__ - Step 147625: {'lr': 3.1789751862090766e-07, 'samples': 28344000, 'steps': 147624, 'loss/train': 1.1271966695785522} 11/07/2021 18:00:34 - INFO - __main__ - Step 147626: {'lr': 3.1763004109383596e-07, 'samples': 28344192, 'steps': 147625, 'loss/train': 1.0762463808059692} 11/07/2021 18:00:35 - INFO - __main__ - Step 147627: {'lr': 3.173626760697712e-07, 'samples': 28344384, 'steps': 147626, 'loss/train': 1.8707542419433594} 11/07/2021 18:00:36 - INFO - __main__ - Step 147628: {'lr': 3.170954235488521e-07, 'samples': 28344576, 'steps': 147627, 'loss/train': 1.1744763851165771} 11/07/2021 18:00:36 - INFO - __main__ - Step 147629: {'lr': 3.1682828353118974e-07, 'samples': 28344768, 'steps': 147628, 'loss/train': 1.7030946016311646} 11/07/2021 18:00:36 - INFO - __main__ - Step 147630: {'lr': 3.165612560169229e-07, 'samples': 28344960, 'steps': 147629, 'loss/train': 0.9833199381828308} 11/07/2021 18:00:37 - INFO - __main__ - Step 147631: {'lr': 3.162943410061625e-07, 'samples': 28345152, 'steps': 147630, 'loss/train': 1.328928828239441} 11/07/2021 18:00:37 - INFO - __main__ - Step 147632: {'lr': 3.160275384990197e-07, 'samples': 28345344, 'steps': 147631, 'loss/train': 1.2041722536087036} 11/07/2021 18:00:38 - INFO - __main__ - Step 147633: {'lr': 3.1576084849563315e-07, 'samples': 28345536, 'steps': 147632, 'loss/train': 1.7172735929489136} 11/07/2021 18:00:38 - INFO - __main__ - Step 147634: {'lr': 3.154942709960862e-07, 'samples': 28345728, 'steps': 147633, 'loss/train': 1.6604828834533691} 11/07/2021 18:00:39 - INFO - __main__ - Step 147635: {'lr': 3.1522780600054536e-07, 'samples': 28345920, 'steps': 147634, 'loss/train': 0.7285738587379456} 11/07/2021 18:00:39 - INFO - __main__ - Step 147636: {'lr': 3.1496145350912163e-07, 'samples': 28346112, 'steps': 147635, 'loss/train': 1.3446049690246582} 11/07/2021 18:00:39 - INFO - __main__ - Step 147637: {'lr': 3.146952135218983e-07, 'samples': 28346304, 'steps': 147636, 'loss/train': 1.0796400308609009} 11/07/2021 18:00:40 - INFO - __main__ - Step 147638: {'lr': 3.144290860390142e-07, 'samples': 28346496, 'steps': 147637, 'loss/train': 1.4124306440353394} 11/07/2021 18:00:41 - INFO - __main__ - Step 147639: {'lr': 3.1416307106060804e-07, 'samples': 28346688, 'steps': 147638, 'loss/train': 1.3030225038528442} 11/07/2021 18:00:41 - INFO - __main__ - Step 147640: {'lr': 3.1389716858679083e-07, 'samples': 28346880, 'steps': 147639, 'loss/train': 1.2500730752944946} 11/07/2021 18:00:42 - INFO - __main__ - Step 147641: {'lr': 3.1363137861770143e-07, 'samples': 28347072, 'steps': 147640, 'loss/train': 0.8127176761627197} 11/07/2021 18:00:42 - INFO - __main__ - Step 147642: {'lr': 3.1336570115339526e-07, 'samples': 28347264, 'steps': 147641, 'loss/train': 1.3631011247634888} 11/07/2021 18:00:42 - INFO - __main__ - Step 147643: {'lr': 3.131001361940666e-07, 'samples': 28347456, 'steps': 147642, 'loss/train': 1.0330497026443481} 11/07/2021 18:00:43 - INFO - __main__ - Step 147644: {'lr': 3.1283468373977107e-07, 'samples': 28347648, 'steps': 147643, 'loss/train': 0.4590761661529541} 11/07/2021 18:00:44 - INFO - __main__ - Step 147645: {'lr': 3.1256934379067513e-07, 'samples': 28347840, 'steps': 147644, 'loss/train': 1.1714727878570557} 11/07/2021 18:00:44 - INFO - __main__ - Step 147646: {'lr': 3.123041163468898e-07, 'samples': 28348032, 'steps': 147645, 'loss/train': 1.6515055894851685} 11/07/2021 18:00:44 - INFO - __main__ - Step 147647: {'lr': 3.1203900140852617e-07, 'samples': 28348224, 'steps': 147646, 'loss/train': 1.366958737373352} 11/07/2021 18:00:45 - INFO - __main__ - Step 147648: {'lr': 3.117739989756951e-07, 'samples': 28348416, 'steps': 147647, 'loss/train': 0.8467486500740051} 11/07/2021 18:00:46 - INFO - __main__ - Step 147649: {'lr': 3.115091090485356e-07, 'samples': 28348608, 'steps': 147648, 'loss/train': 1.0853301286697388} 11/07/2021 18:00:46 - INFO - __main__ - Step 147650: {'lr': 3.112443316271307e-07, 'samples': 28348800, 'steps': 147649, 'loss/train': 1.204175591468811} 11/07/2021 18:00:46 - INFO - __main__ - Step 147651: {'lr': 3.109796667116471e-07, 'samples': 28348992, 'steps': 147650, 'loss/train': 1.6516844034194946} 11/07/2021 18:00:47 - INFO - __main__ - Step 147652: {'lr': 3.107151143021958e-07, 'samples': 28349184, 'steps': 147651, 'loss/train': 1.4520893096923828} 11/07/2021 18:00:47 - INFO - __main__ - Step 147653: {'lr': 3.1045067439885997e-07, 'samples': 28349376, 'steps': 147652, 'loss/train': 0.8208987712860107} 11/07/2021 18:00:48 - INFO - __main__ - Step 147654: {'lr': 3.101863470018063e-07, 'samples': 28349568, 'steps': 147653, 'loss/train': 1.3333097696304321} 11/07/2021 18:00:49 - INFO - __main__ - Step 147655: {'lr': 3.099221321111179e-07, 'samples': 28349760, 'steps': 147654, 'loss/train': 0.6512184143066406} 11/07/2021 18:00:49 - INFO - __main__ - Step 147656: {'lr': 3.096580297269058e-07, 'samples': 28349952, 'steps': 147655, 'loss/train': 1.530150294303894} 11/07/2021 18:00:49 - INFO - __main__ - Step 147657: {'lr': 3.093940398493367e-07, 'samples': 28350144, 'steps': 147656, 'loss/train': 0.9486633539199829} 11/07/2021 18:00:50 - INFO - __main__ - Step 147658: {'lr': 3.091301624784937e-07, 'samples': 28350336, 'steps': 147657, 'loss/train': 1.3152755498886108} 11/07/2021 18:00:51 - INFO - __main__ - Step 147659: {'lr': 3.088663976144879e-07, 'samples': 28350528, 'steps': 147658, 'loss/train': 1.207562804222107} 11/07/2021 18:00:51 - INFO - __main__ - Step 147660: {'lr': 3.086027452574858e-07, 'samples': 28350720, 'steps': 147659, 'loss/train': 0.8985258936882019} 11/07/2021 18:00:51 - INFO - __main__ - Step 147661: {'lr': 3.083392054075429e-07, 'samples': 28350912, 'steps': 147660, 'loss/train': 1.1279833316802979} 11/07/2021 18:00:52 - INFO - __main__ - Step 147662: {'lr': 3.080757780648258e-07, 'samples': 28351104, 'steps': 147661, 'loss/train': 1.2911559343338013} 11/07/2021 18:00:52 - INFO - __main__ - Step 147663: {'lr': 3.0781246322944544e-07, 'samples': 28351296, 'steps': 147662, 'loss/train': 1.1703321933746338} 11/07/2021 18:00:53 - INFO - __main__ - Step 147664: {'lr': 3.0754926090148514e-07, 'samples': 28351488, 'steps': 147663, 'loss/train': 1.0917326211929321} 11/07/2021 18:00:53 - INFO - __main__ - Step 147665: {'lr': 3.072861710811115e-07, 'samples': 28351680, 'steps': 147664, 'loss/train': 1.1125149726867676} 11/07/2021 18:00:54 - INFO - __main__ - Step 147666: {'lr': 3.070231937684076e-07, 'samples': 28351872, 'steps': 147665, 'loss/train': 1.3049036264419556} 11/07/2021 18:00:54 - INFO - __main__ - Step 147667: {'lr': 3.0676032896351237e-07, 'samples': 28352064, 'steps': 147666, 'loss/train': 1.2819325923919678} 11/07/2021 18:00:54 - INFO - __main__ - Step 147668: {'lr': 3.064975766665368e-07, 'samples': 28352256, 'steps': 147667, 'loss/train': 1.5669556856155396} 11/07/2021 18:00:55 - INFO - __main__ - Step 147669: {'lr': 3.062349368776196e-07, 'samples': 28352448, 'steps': 147668, 'loss/train': 1.0389649868011475} 11/07/2021 18:00:56 - INFO - __main__ - Step 147670: {'lr': 3.059724095968441e-07, 'samples': 28352640, 'steps': 147669, 'loss/train': 1.2414309978485107} 11/07/2021 18:00:56 - INFO - __main__ - Step 147671: {'lr': 3.057099948243214e-07, 'samples': 28352832, 'steps': 147670, 'loss/train': 1.3372249603271484} 11/07/2021 18:00:57 - INFO - __main__ - Step 147672: {'lr': 3.0544769256021787e-07, 'samples': 28353024, 'steps': 147671, 'loss/train': 1.286085605621338} 11/07/2021 18:00:57 - INFO - __main__ - Step 147673: {'lr': 3.0518550280461686e-07, 'samples': 28353216, 'steps': 147672, 'loss/train': 1.3760560750961304} 11/07/2021 18:00:57 - INFO - __main__ - Step 147674: {'lr': 3.0492342555765715e-07, 'samples': 28353408, 'steps': 147673, 'loss/train': 1.5242061614990234} 11/07/2021 18:00:58 - INFO - __main__ - Step 147675: {'lr': 3.046614608194498e-07, 'samples': 28353600, 'steps': 147674, 'loss/train': 1.2308974266052246} 11/07/2021 18:00:59 - INFO - __main__ - Step 147676: {'lr': 3.0439960859010573e-07, 'samples': 28353792, 'steps': 147675, 'loss/train': 0.9598559141159058} 11/07/2021 18:00:59 - INFO - __main__ - Step 147677: {'lr': 3.0413786886973604e-07, 'samples': 28353984, 'steps': 147676, 'loss/train': 1.097133755683899} 11/07/2021 18:00:59 - INFO - __main__ - Step 147678: {'lr': 3.038762416584795e-07, 'samples': 28354176, 'steps': 147677, 'loss/train': 0.925425112247467} 11/07/2021 18:01:00 - INFO - __main__ - Step 147679: {'lr': 3.0361472695641936e-07, 'samples': 28354368, 'steps': 147678, 'loss/train': 1.4061965942382812} 11/07/2021 18:01:01 - INFO - __main__ - Step 147680: {'lr': 3.0335332476372214e-07, 'samples': 28354560, 'steps': 147679, 'loss/train': 0.9739943742752075} 11/07/2021 18:01:01 - INFO - __main__ - Step 147681: {'lr': 3.030920350804711e-07, 'samples': 28354752, 'steps': 147680, 'loss/train': 0.6337581872940063} 11/07/2021 18:01:02 - INFO - __main__ - Step 147682: {'lr': 3.028308579068051e-07, 'samples': 28354944, 'steps': 147681, 'loss/train': 1.8284255266189575} 11/07/2021 18:01:02 - INFO - __main__ - Step 147683: {'lr': 3.0256979324283506e-07, 'samples': 28355136, 'steps': 147682, 'loss/train': 1.419312834739685} 11/07/2021 18:01:02 - INFO - __main__ - Step 147684: {'lr': 3.023088410886443e-07, 'samples': 28355328, 'steps': 147683, 'loss/train': 1.293844223022461} 11/07/2021 18:01:03 - INFO - __main__ - Step 147685: {'lr': 3.0204800144439937e-07, 'samples': 28355520, 'steps': 147684, 'loss/train': 0.9804245829582214} 11/07/2021 18:01:04 - INFO - __main__ - Step 147686: {'lr': 3.017872743101835e-07, 'samples': 28355712, 'steps': 147685, 'loss/train': 0.9723736643791199} 11/07/2021 18:01:04 - INFO - __main__ - Step 147687: {'lr': 3.015266596861632e-07, 'samples': 28355904, 'steps': 147686, 'loss/train': 1.1628098487854004} 11/07/2021 18:01:04 - INFO - __main__ - Step 147688: {'lr': 3.012661575723941e-07, 'samples': 28356096, 'steps': 147687, 'loss/train': 1.379357933998108} 11/07/2021 18:01:05 - INFO - __main__ - Step 147689: {'lr': 3.010057679690148e-07, 'samples': 28356288, 'steps': 147688, 'loss/train': 1.3181573152542114} 11/07/2021 18:01:06 - INFO - __main__ - Step 147690: {'lr': 3.007454908761642e-07, 'samples': 28356480, 'steps': 147689, 'loss/train': 1.406026840209961} 11/07/2021 18:01:06 - INFO - __main__ - Step 147691: {'lr': 3.004853262939533e-07, 'samples': 28356672, 'steps': 147690, 'loss/train': 1.2246906757354736} 11/07/2021 18:01:06 - INFO - __main__ - Step 147692: {'lr': 3.0022527422246536e-07, 'samples': 28356864, 'steps': 147691, 'loss/train': 1.3280937671661377} 11/07/2021 18:01:07 - INFO - __main__ - Step 147693: {'lr': 2.999653346618669e-07, 'samples': 28357056, 'steps': 147692, 'loss/train': 1.1237373352050781} 11/07/2021 18:01:07 - INFO - __main__ - Step 147694: {'lr': 2.997055076122412e-07, 'samples': 28357248, 'steps': 147693, 'loss/train': 1.3686281442642212} 11/07/2021 18:01:07 - INFO - __main__ - Step 147695: {'lr': 2.994457930737271e-07, 'samples': 28357440, 'steps': 147694, 'loss/train': 1.0069546699523926} 11/07/2021 18:01:09 - INFO - __main__ - Step 147696: {'lr': 2.9918619104640774e-07, 'samples': 28357632, 'steps': 147695, 'loss/train': 1.428963541984558} 11/07/2021 18:01:09 - INFO - __main__ - Step 147697: {'lr': 2.98926701530422e-07, 'samples': 28357824, 'steps': 147696, 'loss/train': 1.2936785221099854} 11/07/2021 18:01:09 - INFO - __main__ - Step 147698: {'lr': 2.986673245259086e-07, 'samples': 28358016, 'steps': 147697, 'loss/train': 1.1352934837341309} 11/07/2021 18:01:10 - INFO - __main__ - Step 147699: {'lr': 2.9840806003295084e-07, 'samples': 28358208, 'steps': 147698, 'loss/train': 1.2908576726913452} 11/07/2021 18:01:10 - INFO - __main__ - Step 147700: {'lr': 2.981489080516875e-07, 'samples': 28358400, 'steps': 147699, 'loss/train': 0.8331197500228882} 11/07/2021 18:01:11 - INFO - __main__ - Step 147701: {'lr': 2.9788986858220177e-07, 'samples': 28358592, 'steps': 147700, 'loss/train': 0.8286173939704895} 11/07/2021 18:01:11 - INFO - __main__ - Step 147702: {'lr': 2.976309416246603e-07, 'samples': 28358784, 'steps': 147701, 'loss/train': 1.4014267921447754} 11/07/2021 18:01:12 - INFO - __main__ - Step 147703: {'lr': 2.9737212717911857e-07, 'samples': 28358976, 'steps': 147702, 'loss/train': 1.116593837738037} 11/07/2021 18:01:12 - INFO - __main__ - Step 147704: {'lr': 2.9711342524577077e-07, 'samples': 28359168, 'steps': 147703, 'loss/train': 1.1457059383392334} 11/07/2021 18:01:12 - INFO - __main__ - Step 147705: {'lr': 2.968548358246448e-07, 'samples': 28359360, 'steps': 147704, 'loss/train': 0.5023813247680664} 11/07/2021 18:01:14 - INFO - __main__ - Step 147706: {'lr': 2.9659635891593486e-07, 'samples': 28359552, 'steps': 147705, 'loss/train': 1.2110503911972046} 11/07/2021 18:01:14 - INFO - __main__ - Step 147707: {'lr': 2.963379945197242e-07, 'samples': 28359744, 'steps': 147706, 'loss/train': 1.665042519569397} 11/07/2021 18:01:14 - INFO - __main__ - Step 147708: {'lr': 2.960797426361239e-07, 'samples': 28359936, 'steps': 147707, 'loss/train': 0.9496939778327942} 11/07/2021 18:01:15 - INFO - __main__ - Step 147709: {'lr': 2.95821603265245e-07, 'samples': 28360128, 'steps': 147708, 'loss/train': 1.2213754653930664} 11/07/2021 18:01:15 - INFO - __main__ - Step 147710: {'lr': 2.955635764072262e-07, 'samples': 28360320, 'steps': 147709, 'loss/train': 1.3437886238098145} 11/07/2021 18:01:16 - INFO - __main__ - Step 147711: {'lr': 2.9530566206217857e-07, 'samples': 28360512, 'steps': 147710, 'loss/train': 1.1438242197036743} 11/07/2021 18:01:16 - INFO - __main__ - Step 147712: {'lr': 2.9504786023021313e-07, 'samples': 28360704, 'steps': 147711, 'loss/train': 1.3605036735534668} 11/07/2021 18:01:17 - INFO - __main__ - Step 147713: {'lr': 2.947901709114409e-07, 'samples': 28360896, 'steps': 147712, 'loss/train': 1.3650323152542114} 11/07/2021 18:01:17 - INFO - __main__ - Step 147714: {'lr': 2.945325941059729e-07, 'samples': 28361088, 'steps': 147713, 'loss/train': 1.488633155822754} 11/07/2021 18:01:17 - INFO - __main__ - Step 147715: {'lr': 2.9427512981394786e-07, 'samples': 28361280, 'steps': 147714, 'loss/train': 1.7100245952606201} 11/07/2021 18:01:19 - INFO - __main__ - Step 147716: {'lr': 2.9401777803547694e-07, 'samples': 28361472, 'steps': 147715, 'loss/train': 1.5993655920028687} 11/07/2021 18:01:19 - INFO - __main__ - Step 147717: {'lr': 2.93760538770671e-07, 'samples': 28361664, 'steps': 147716, 'loss/train': 0.7605358362197876} 11/07/2021 18:01:19 - INFO - __main__ - Step 147718: {'lr': 2.935034120196134e-07, 'samples': 28361856, 'steps': 147717, 'loss/train': 0.9931905269622803} 11/07/2021 18:01:20 - INFO - __main__ - Step 147719: {'lr': 2.9324639778247063e-07, 'samples': 28362048, 'steps': 147718, 'loss/train': 1.3697919845581055} 11/07/2021 18:01:20 - INFO - __main__ - Step 147720: {'lr': 2.9298949605935377e-07, 'samples': 28362240, 'steps': 147719, 'loss/train': 0.04476190730929375} 11/07/2021 18:01:21 - INFO - __main__ - Step 147721: {'lr': 2.9273270685034603e-07, 'samples': 28362432, 'steps': 147720, 'loss/train': 1.2629536390304565} 11/07/2021 18:01:22 - INFO - __main__ - Step 147722: {'lr': 2.924760301555585e-07, 'samples': 28362624, 'steps': 147721, 'loss/train': 0.5210383534431458} 11/07/2021 18:01:22 - INFO - __main__ - Step 147723: {'lr': 2.922194659751576e-07, 'samples': 28362816, 'steps': 147722, 'loss/train': 1.2940579652786255} 11/07/2021 18:01:22 - INFO - __main__ - Step 147724: {'lr': 2.919630143092267e-07, 'samples': 28363008, 'steps': 147723, 'loss/train': 1.350419044494629} 11/07/2021 18:01:23 - INFO - __main__ - Step 147725: {'lr': 2.91706675157849e-07, 'samples': 28363200, 'steps': 147724, 'loss/train': 1.3511896133422852} 11/07/2021 18:01:23 - INFO - __main__ - Step 147726: {'lr': 2.9145044852121883e-07, 'samples': 28363392, 'steps': 147725, 'loss/train': 5.666473865509033} 11/07/2021 18:01:24 - INFO - __main__ - Step 147727: {'lr': 2.911943343993917e-07, 'samples': 28363584, 'steps': 147726, 'loss/train': 1.3212401866912842} 11/07/2021 18:01:24 - INFO - __main__ - Step 147728: {'lr': 2.909383327925064e-07, 'samples': 28363776, 'steps': 147727, 'loss/train': 1.2866032123565674} 11/07/2021 18:01:25 - INFO - __main__ - Step 147729: {'lr': 2.906824437006461e-07, 'samples': 28363968, 'steps': 147728, 'loss/train': 0.5031548142433167} 11/07/2021 18:01:25 - INFO - __main__ - Step 147730: {'lr': 2.904266671239775e-07, 'samples': 28364160, 'steps': 147729, 'loss/train': 1.3557158708572388} 11/07/2021 18:01:26 - INFO - __main__ - Step 147731: {'lr': 2.901710030625837e-07, 'samples': 28364352, 'steps': 147730, 'loss/train': 1.5741502046585083} 11/07/2021 18:01:27 - INFO - __main__ - Step 147732: {'lr': 2.899154515165758e-07, 'samples': 28364544, 'steps': 147731, 'loss/train': 1.3364272117614746} 11/07/2021 18:01:27 - INFO - __main__ - Step 147733: {'lr': 2.896600124860926e-07, 'samples': 28364736, 'steps': 147732, 'loss/train': 1.3668155670166016} 11/07/2021 18:01:27 - INFO - __main__ - Step 147734: {'lr': 2.894046859712174e-07, 'samples': 28364928, 'steps': 147733, 'loss/train': 0.22175930440425873} 11/07/2021 18:01:28 - INFO - __main__ - Step 147735: {'lr': 2.8914947197208884e-07, 'samples': 28365120, 'steps': 147734, 'loss/train': 1.3744174242019653} 11/07/2021 18:01:28 - INFO - __main__ - Step 147736: {'lr': 2.8889437048881807e-07, 'samples': 28365312, 'steps': 147735, 'loss/train': 1.3095027208328247} 11/07/2021 18:01:29 - INFO - __main__ - Step 147737: {'lr': 2.886393815215438e-07, 'samples': 28365504, 'steps': 147736, 'loss/train': 1.5437047481536865} 11/07/2021 18:01:29 - INFO - __main__ - Step 147738: {'lr': 2.8838450507032155e-07, 'samples': 28365696, 'steps': 147737, 'loss/train': 1.134136438369751} 11/07/2021 18:01:30 - INFO - __main__ - Step 147739: {'lr': 2.881297411353179e-07, 'samples': 28365888, 'steps': 147738, 'loss/train': 1.5525790452957153} 11/07/2021 18:01:30 - INFO - __main__ - Step 147740: {'lr': 2.87875089716616e-07, 'samples': 28366080, 'steps': 147739, 'loss/train': 1.1406408548355103} 11/07/2021 18:01:30 - INFO - __main__ - Step 147741: {'lr': 2.876205508143548e-07, 'samples': 28366272, 'steps': 147740, 'loss/train': 1.3497350215911865} 11/07/2021 18:01:31 - INFO - __main__ - Step 147742: {'lr': 2.873661244286452e-07, 'samples': 28366464, 'steps': 147741, 'loss/train': 1.7972146272659302} 11/07/2021 18:01:32 - INFO - __main__ - Step 147743: {'lr': 2.871118105595705e-07, 'samples': 28366656, 'steps': 147742, 'loss/train': 1.1621803045272827} 11/07/2021 18:01:32 - INFO - __main__ - Step 147744: {'lr': 2.868576092072972e-07, 'samples': 28366848, 'steps': 147743, 'loss/train': 1.2053005695343018} 11/07/2021 18:01:32 - INFO - __main__ - Step 147745: {'lr': 2.8660352037188086e-07, 'samples': 28367040, 'steps': 147744, 'loss/train': 1.1611953973770142} 11/07/2021 18:01:33 - INFO - __main__ - Step 147746: {'lr': 2.86349544053488e-07, 'samples': 28367232, 'steps': 147745, 'loss/train': 0.9200687408447266} 11/07/2021 18:01:34 - INFO - __main__ - Step 147747: {'lr': 2.8609568025222963e-07, 'samples': 28367424, 'steps': 147746, 'loss/train': 1.5181939601898193} 11/07/2021 18:01:34 - INFO - __main__ - Step 147748: {'lr': 2.858419289681613e-07, 'samples': 28367616, 'steps': 147747, 'loss/train': 1.246403694152832} 11/07/2021 18:01:35 - INFO - __main__ - Step 147749: {'lr': 2.8558829020147726e-07, 'samples': 28367808, 'steps': 147748, 'loss/train': 1.2150968313217163} 11/07/2021 18:01:35 - INFO - __main__ - Step 147750: {'lr': 2.85334763952233e-07, 'samples': 28368000, 'steps': 147749, 'loss/train': 1.105513095855713} 11/07/2021 18:01:35 - INFO - __main__ - Step 147751: {'lr': 2.8508135022056736e-07, 'samples': 28368192, 'steps': 147750, 'loss/train': 0.9346885085105896} 11/07/2021 18:01:36 - INFO - __main__ - Step 147752: {'lr': 2.848280490066191e-07, 'samples': 28368384, 'steps': 147751, 'loss/train': 1.12730073928833} 11/07/2021 18:01:37 - INFO - __main__ - Step 147753: {'lr': 2.845748603104437e-07, 'samples': 28368576, 'steps': 147752, 'loss/train': 0.6080348491668701} 11/07/2021 18:01:37 - INFO - __main__ - Step 147754: {'lr': 2.843217841321799e-07, 'samples': 28368768, 'steps': 147753, 'loss/train': 1.5655750036239624} 11/07/2021 18:01:37 - INFO - __main__ - Step 147755: {'lr': 2.840688204719666e-07, 'samples': 28368960, 'steps': 147754, 'loss/train': 1.1104148626327515} 11/07/2021 18:01:38 - INFO - __main__ - Step 147756: {'lr': 2.8381596932988693e-07, 'samples': 28369152, 'steps': 147755, 'loss/train': 1.1543339490890503} 11/07/2021 18:01:39 - INFO - __main__ - Step 147757: {'lr': 2.8356323070605204e-07, 'samples': 28369344, 'steps': 147756, 'loss/train': 1.0750880241394043} 11/07/2021 18:01:39 - INFO - __main__ - Step 147758: {'lr': 2.833106046006284e-07, 'samples': 28369536, 'steps': 147757, 'loss/train': 1.3402118682861328} 11/07/2021 18:01:39 - INFO - __main__ - Step 147759: {'lr': 2.8305809101364375e-07, 'samples': 28369728, 'steps': 147758, 'loss/train': 1.7326353788375854} 11/07/2021 18:01:40 - INFO - __main__ - Step 147760: {'lr': 2.8280568994529243e-07, 'samples': 28369920, 'steps': 147759, 'loss/train': 1.3960940837860107} 11/07/2021 18:01:40 - INFO - __main__ - Step 147761: {'lr': 2.8255340139565767e-07, 'samples': 28370112, 'steps': 147760, 'loss/train': 1.1514995098114014} 11/07/2021 18:01:41 - INFO - __main__ - Step 147762: {'lr': 2.8230122536485047e-07, 'samples': 28370304, 'steps': 147761, 'loss/train': 0.9152685403823853} 11/07/2021 18:01:42 - INFO - __main__ - Step 147763: {'lr': 2.820491618529819e-07, 'samples': 28370496, 'steps': 147762, 'loss/train': 1.5299278497695923} 11/07/2021 18:01:42 - INFO - __main__ - Step 147764: {'lr': 2.8179721086016295e-07, 'samples': 28370688, 'steps': 147763, 'loss/train': 1.308122992515564} 11/07/2021 18:01:42 - INFO - __main__ - Step 147765: {'lr': 2.815453723865047e-07, 'samples': 28370880, 'steps': 147764, 'loss/train': 1.2836915254592896} 11/07/2021 18:01:43 - INFO - __main__ - Step 147766: {'lr': 2.812936464321458e-07, 'samples': 28371072, 'steps': 147765, 'loss/train': 0.15166670083999634} 11/07/2021 18:01:43 - INFO - __main__ - Step 147767: {'lr': 2.810420329971697e-07, 'samples': 28371264, 'steps': 147766, 'loss/train': 1.4579064846038818} 11/07/2021 18:01:44 - INFO - __main__ - Step 147768: {'lr': 2.80790532081715e-07, 'samples': 28371456, 'steps': 147767, 'loss/train': 1.2273569107055664} 11/07/2021 18:01:44 - INFO - __main__ - Step 147769: {'lr': 2.805391436858651e-07, 'samples': 28371648, 'steps': 147768, 'loss/train': 1.3670974969863892} 11/07/2021 18:01:45 - INFO - __main__ - Step 147770: {'lr': 2.802878678097587e-07, 'samples': 28371840, 'steps': 147769, 'loss/train': 1.425985336303711} 11/07/2021 18:01:45 - INFO - __main__ - Step 147771: {'lr': 2.800367044535068e-07, 'samples': 28372032, 'steps': 147770, 'loss/train': 1.2091176509857178} 11/07/2021 18:01:45 - INFO - __main__ - Step 147772: {'lr': 2.797856536172205e-07, 'samples': 28372224, 'steps': 147771, 'loss/train': 1.0450410842895508} 11/07/2021 18:01:47 - INFO - __main__ - Step 147773: {'lr': 2.7953471530098307e-07, 'samples': 28372416, 'steps': 147772, 'loss/train': 1.6061495542526245} 11/07/2021 18:01:47 - INFO - __main__ - Step 147774: {'lr': 2.79283889504961e-07, 'samples': 28372608, 'steps': 147773, 'loss/train': 0.5934788584709167} 11/07/2021 18:01:47 - INFO - __main__ - Step 147775: {'lr': 2.790331762292375e-07, 'samples': 28372800, 'steps': 147774, 'loss/train': 1.2873365879058838} 11/07/2021 18:01:48 - INFO - __main__ - Step 147776: {'lr': 2.787825754739237e-07, 'samples': 28372992, 'steps': 147775, 'loss/train': 0.867586076259613} 11/07/2021 18:01:48 - INFO - __main__ - Step 147777: {'lr': 2.785320872391306e-07, 'samples': 28373184, 'steps': 147776, 'loss/train': 0.9780843257904053} 11/07/2021 18:01:49 - INFO - __main__ - Step 147778: {'lr': 2.7828171152499694e-07, 'samples': 28373376, 'steps': 147777, 'loss/train': 1.4754854440689087} 11/07/2021 18:01:49 - INFO - __main__ - Step 147779: {'lr': 2.780314483315782e-07, 'samples': 28373568, 'steps': 147778, 'loss/train': 1.5670199394226074} 11/07/2021 18:01:50 - INFO - __main__ - Step 147780: {'lr': 2.777812976590688e-07, 'samples': 28373760, 'steps': 147779, 'loss/train': 0.8709926605224609} 11/07/2021 18:01:50 - INFO - __main__ - Step 147781: {'lr': 2.7753125950752413e-07, 'samples': 28373952, 'steps': 147780, 'loss/train': 1.083533763885498} 11/07/2021 18:01:50 - INFO - __main__ - Step 147782: {'lr': 2.772813338770552e-07, 'samples': 28374144, 'steps': 147781, 'loss/train': 1.179606318473816} 11/07/2021 18:01:51 - INFO - __main__ - Step 147783: {'lr': 2.770315207678009e-07, 'samples': 28374336, 'steps': 147782, 'loss/train': 1.2026954889297485} 11/07/2021 18:01:52 - INFO - __main__ - Step 147784: {'lr': 2.767818201798444e-07, 'samples': 28374528, 'steps': 147783, 'loss/train': 0.9401568174362183} 11/07/2021 18:01:52 - INFO - __main__ - Step 147785: {'lr': 2.7653223211335233e-07, 'samples': 28374720, 'steps': 147784, 'loss/train': 0.84019935131073} 11/07/2021 18:01:52 - INFO - __main__ - Step 147786: {'lr': 2.762827565683801e-07, 'samples': 28374912, 'steps': 147785, 'loss/train': 1.723322868347168} 11/07/2021 18:01:53 - INFO - __main__ - Step 147787: {'lr': 2.7603339354506654e-07, 'samples': 28375104, 'steps': 147786, 'loss/train': 0.7085483074188232} 11/07/2021 18:01:54 - INFO - __main__ - Step 147788: {'lr': 2.7578414304349487e-07, 'samples': 28375296, 'steps': 147787, 'loss/train': 1.2061829566955566} 11/07/2021 18:01:54 - INFO - __main__ - Step 147789: {'lr': 2.755350050638317e-07, 'samples': 28375488, 'steps': 147788, 'loss/train': 1.1004798412322998} 11/07/2021 18:01:55 - INFO - __main__ - Step 147790: {'lr': 2.752859796061602e-07, 'samples': 28375680, 'steps': 147789, 'loss/train': 1.225233793258667} 11/07/2021 18:01:55 - INFO - __main__ - Step 147791: {'lr': 2.750370666705637e-07, 'samples': 28375872, 'steps': 147790, 'loss/train': 4.566514015197754} 11/07/2021 18:01:55 - INFO - __main__ - Step 147792: {'lr': 2.7478826625720875e-07, 'samples': 28376064, 'steps': 147791, 'loss/train': 1.1717783212661743} 11/07/2021 18:01:56 - INFO - __main__ - Step 147793: {'lr': 2.745395783661786e-07, 'samples': 28376256, 'steps': 147792, 'loss/train': 1.3243250846862793} 11/07/2021 18:01:57 - INFO - __main__ - Step 147794: {'lr': 2.7429100299758425e-07, 'samples': 28376448, 'steps': 147793, 'loss/train': 0.8137656450271606} 11/07/2021 18:01:57 - INFO - __main__ - Step 147795: {'lr': 2.740425401515367e-07, 'samples': 28376640, 'steps': 147794, 'loss/train': 0.9936792850494385} 11/07/2021 18:01:57 - INFO - __main__ - Step 147796: {'lr': 2.737941898281471e-07, 'samples': 28376832, 'steps': 147795, 'loss/train': 1.3626378774642944} 11/07/2021 18:01:58 - INFO - __main__ - Step 147797: {'lr': 2.735459520275263e-07, 'samples': 28377024, 'steps': 147796, 'loss/train': 1.2579351663589478} 11/07/2021 18:01:58 - INFO - __main__ - Step 147798: {'lr': 2.732978267498132e-07, 'samples': 28377216, 'steps': 147797, 'loss/train': 0.9616263508796692} 11/07/2021 18:01:59 - INFO - __main__ - Step 147799: {'lr': 2.73049813995091e-07, 'samples': 28377408, 'steps': 147798, 'loss/train': 1.3601897954940796} 11/07/2021 18:01:59 - INFO - __main__ - Step 147800: {'lr': 2.728019137634707e-07, 'samples': 28377600, 'steps': 147799, 'loss/train': 1.124029517173767} 11/07/2021 18:02:00 - INFO - __main__ - Step 147801: {'lr': 2.7255412605506347e-07, 'samples': 28377792, 'steps': 147800, 'loss/train': 1.8269883394241333} 11/07/2021 18:02:00 - INFO - __main__ - Step 147802: {'lr': 2.7230645087000794e-07, 'samples': 28377984, 'steps': 147801, 'loss/train': 1.3863798379898071} 11/07/2021 18:02:00 - INFO - __main__ - Step 147803: {'lr': 2.7205888820841516e-07, 'samples': 28378176, 'steps': 147802, 'loss/train': 1.2036731243133545} 11/07/2021 18:02:02 - INFO - __main__ - Step 147804: {'lr': 2.718114380703407e-07, 'samples': 28378368, 'steps': 147803, 'loss/train': 1.2997841835021973} 11/07/2021 18:02:02 - INFO - __main__ - Step 147805: {'lr': 2.7156410045595103e-07, 'samples': 28378560, 'steps': 147804, 'loss/train': 1.4766504764556885} 11/07/2021 18:02:02 - INFO - __main__ - Step 147806: {'lr': 2.713168753653572e-07, 'samples': 28378752, 'steps': 147805, 'loss/train': 1.7000387907028198} 11/07/2021 18:02:03 - INFO - __main__ - Step 147807: {'lr': 2.7106976279861473e-07, 'samples': 28378944, 'steps': 147806, 'loss/train': 1.4214086532592773} 11/07/2021 18:02:03 - INFO - __main__ - Step 147808: {'lr': 2.708227627559179e-07, 'samples': 28379136, 'steps': 147807, 'loss/train': 1.5717908143997192} 11/07/2021 18:02:04 - INFO - __main__ - Step 147809: {'lr': 2.705758752372944e-07, 'samples': 28379328, 'steps': 147808, 'loss/train': 1.1265336275100708} 11/07/2021 18:02:04 - INFO - __main__ - Step 147810: {'lr': 2.7032910024293866e-07, 'samples': 28379520, 'steps': 147809, 'loss/train': 1.372201681137085} 11/07/2021 18:02:05 - INFO - __main__ - Step 147811: {'lr': 2.7008243777287833e-07, 'samples': 28379712, 'steps': 147810, 'loss/train': 1.426541805267334} 11/07/2021 18:02:05 - INFO - __main__ - Step 147812: {'lr': 2.698358878273077e-07, 'samples': 28379904, 'steps': 147811, 'loss/train': 1.3839139938354492} 11/07/2021 18:02:05 - INFO - __main__ - Step 147813: {'lr': 2.695894504062546e-07, 'samples': 28380096, 'steps': 147812, 'loss/train': 1.171671748161316} 11/07/2021 18:02:06 - INFO - __main__ - Step 147814: {'lr': 2.6934312550988547e-07, 'samples': 28380288, 'steps': 147813, 'loss/train': 1.2265619039535522} 11/07/2021 18:02:07 - INFO - __main__ - Step 147815: {'lr': 2.690969131383114e-07, 'samples': 28380480, 'steps': 147814, 'loss/train': 1.2078237533569336} 11/07/2021 18:02:07 - INFO - __main__ - Step 147816: {'lr': 2.6885081329161567e-07, 'samples': 28380672, 'steps': 147815, 'loss/train': 1.143945574760437} 11/07/2021 18:02:07 - INFO - __main__ - Step 147817: {'lr': 2.6860482596993694e-07, 'samples': 28380864, 'steps': 147816, 'loss/train': 1.108141303062439} 11/07/2021 18:02:08 - INFO - __main__ - Step 147818: {'lr': 2.6835895117335864e-07, 'samples': 28381056, 'steps': 147817, 'loss/train': 1.584938645362854} 11/07/2021 18:02:09 - INFO - __main__ - Step 147819: {'lr': 2.681131889019917e-07, 'samples': 28381248, 'steps': 147818, 'loss/train': 1.0997531414031982} 11/07/2021 18:02:09 - INFO - __main__ - Step 147820: {'lr': 2.678675391559748e-07, 'samples': 28381440, 'steps': 147819, 'loss/train': 0.702685534954071} 11/07/2021 18:02:09 - INFO - __main__ - Step 147821: {'lr': 2.676220019353914e-07, 'samples': 28381632, 'steps': 147820, 'loss/train': 1.0484561920166016} 11/07/2021 18:02:10 - INFO - __main__ - Step 147822: {'lr': 2.6737657724038025e-07, 'samples': 28381824, 'steps': 147821, 'loss/train': 1.0816516876220703} 11/07/2021 18:02:10 - INFO - __main__ - Step 147823: {'lr': 2.671312650710245e-07, 'samples': 28382016, 'steps': 147822, 'loss/train': 1.2560522556304932} 11/07/2021 18:02:11 - INFO - __main__ - Step 147824: {'lr': 2.668860654274352e-07, 'samples': 28382208, 'steps': 147823, 'loss/train': 1.3750470876693726} 11/07/2021 18:02:12 - INFO - __main__ - Step 147825: {'lr': 2.666409783097512e-07, 'samples': 28382400, 'steps': 147824, 'loss/train': 0.982012927532196} 11/07/2021 18:02:12 - INFO - __main__ - Step 147826: {'lr': 2.663960037180557e-07, 'samples': 28382592, 'steps': 147825, 'loss/train': 1.0448951721191406} 11/07/2021 18:02:12 - INFO - __main__ - Step 147827: {'lr': 2.661511416524875e-07, 'samples': 28382784, 'steps': 147826, 'loss/train': 2.6686112880706787} 11/07/2021 18:02:13 - INFO - __main__ - Step 147828: {'lr': 2.6590639211312995e-07, 'samples': 28382976, 'steps': 147827, 'loss/train': 1.9864027500152588} 11/07/2021 18:02:13 - INFO - __main__ - Step 147829: {'lr': 2.6566175510009396e-07, 'samples': 28383168, 'steps': 147828, 'loss/train': 1.0636754035949707} 11/07/2021 18:02:14 - INFO - __main__ - Step 147830: {'lr': 2.654172306134905e-07, 'samples': 28383360, 'steps': 147829, 'loss/train': 1.5112438201904297} 11/07/2021 18:02:14 - INFO - __main__ - Step 147831: {'lr': 2.6517281865345854e-07, 'samples': 28383552, 'steps': 147830, 'loss/train': 1.1857049465179443} 11/07/2021 18:02:15 - INFO - __main__ - Step 147832: {'lr': 2.6492851922005345e-07, 'samples': 28383744, 'steps': 147831, 'loss/train': 1.8215402364730835} 11/07/2021 18:02:15 - INFO - __main__ - Step 147833: {'lr': 2.646843323134418e-07, 'samples': 28383936, 'steps': 147832, 'loss/train': 0.9213279485702515} 11/07/2021 18:02:15 - INFO - __main__ - Step 147834: {'lr': 2.6444025793370685e-07, 'samples': 28384128, 'steps': 147833, 'loss/train': 0.9789549708366394} 11/07/2021 18:02:16 - INFO - __main__ - Step 147835: {'lr': 2.6419629608095963e-07, 'samples': 28384320, 'steps': 147834, 'loss/train': 1.2694826126098633} 11/07/2021 18:02:17 - INFO - __main__ - Step 147836: {'lr': 2.6395244675531115e-07, 'samples': 28384512, 'steps': 147835, 'loss/train': 1.2747503519058228} 11/07/2021 18:02:17 - INFO - __main__ - Step 147837: {'lr': 2.6370870995687247e-07, 'samples': 28384704, 'steps': 147836, 'loss/train': 1.4444173574447632} 11/07/2021 18:02:17 - INFO - __main__ - Step 147838: {'lr': 2.6346508568575457e-07, 'samples': 28384896, 'steps': 147837, 'loss/train': 1.4709118604660034} 11/07/2021 18:02:18 - INFO - __main__ - Step 147839: {'lr': 2.632215739420685e-07, 'samples': 28385088, 'steps': 147838, 'loss/train': 1.8320307731628418} 11/07/2021 18:02:19 - INFO - __main__ - Step 147840: {'lr': 2.629781747258975e-07, 'samples': 28385280, 'steps': 147839, 'loss/train': 1.431105375289917} 11/07/2021 18:02:19 - INFO - __main__ - Step 147841: {'lr': 2.6273488803740807e-07, 'samples': 28385472, 'steps': 147840, 'loss/train': 1.5697712898254395} 11/07/2021 18:02:20 - INFO - __main__ - Step 147842: {'lr': 2.6249171387665584e-07, 'samples': 28385664, 'steps': 147841, 'loss/train': 1.412402868270874} 11/07/2021 18:02:20 - INFO - __main__ - Step 147843: {'lr': 2.6224865224377946e-07, 'samples': 28385856, 'steps': 147842, 'loss/train': 1.2890408039093018} 11/07/2021 18:02:20 - INFO - __main__ - Step 147844: {'lr': 2.6200570313889004e-07, 'samples': 28386048, 'steps': 147843, 'loss/train': 1.3872003555297852} 11/07/2021 18:02:21 - INFO - __main__ - Step 147845: {'lr': 2.6176286656207084e-07, 'samples': 28386240, 'steps': 147844, 'loss/train': 1.1298432350158691} 11/07/2021 18:02:22 - INFO - __main__ - Step 147846: {'lr': 2.615201425134606e-07, 'samples': 28386432, 'steps': 147845, 'loss/train': 0.8744367957115173} 11/07/2021 18:02:22 - INFO - __main__ - Step 147847: {'lr': 2.6127753099314254e-07, 'samples': 28386624, 'steps': 147846, 'loss/train': 1.154037594795227} 11/07/2021 18:02:22 - INFO - __main__ - Step 147848: {'lr': 2.610350320012278e-07, 'samples': 28386816, 'steps': 147847, 'loss/train': 1.3824001550674438} 11/07/2021 18:02:23 - INFO - __main__ - Step 147849: {'lr': 2.607926455378551e-07, 'samples': 28387008, 'steps': 147848, 'loss/train': 1.459792137145996} 11/07/2021 18:02:24 - INFO - __main__ - Step 147850: {'lr': 2.6055037160313543e-07, 'samples': 28387200, 'steps': 147849, 'loss/train': 0.8140342235565186} 11/07/2021 18:02:24 - INFO - __main__ - Step 147851: {'lr': 2.6030821019712436e-07, 'samples': 28387392, 'steps': 147850, 'loss/train': 1.9937658309936523} 11/07/2021 18:02:24 - INFO - __main__ - Step 147852: {'lr': 2.6006616131998837e-07, 'samples': 28387584, 'steps': 147851, 'loss/train': 1.283080816268921} 11/07/2021 18:02:25 - INFO - __main__ - Step 147853: {'lr': 2.5982422497178304e-07, 'samples': 28387776, 'steps': 147852, 'loss/train': 1.3074219226837158} 11/07/2021 18:02:25 - INFO - __main__ - Step 147854: {'lr': 2.5958240115267486e-07, 'samples': 28387968, 'steps': 147853, 'loss/train': 1.0148130655288696} 11/07/2021 18:02:26 - INFO - __main__ - Step 147855: {'lr': 2.593406898627193e-07, 'samples': 28388160, 'steps': 147854, 'loss/train': 1.3407162427902222} 11/07/2021 18:02:27 - INFO - __main__ - Step 147856: {'lr': 2.5909909110208295e-07, 'samples': 28388352, 'steps': 147855, 'loss/train': 0.8258092999458313} 11/07/2021 18:02:27 - INFO - __main__ - Step 147857: {'lr': 2.588576048708213e-07, 'samples': 28388544, 'steps': 147856, 'loss/train': 1.2532862424850464} 11/07/2021 18:02:27 - INFO - __main__ - Step 147858: {'lr': 2.586162311690732e-07, 'samples': 28388736, 'steps': 147857, 'loss/train': 1.391814112663269} 11/07/2021 18:02:28 - INFO - __main__ - Step 147859: {'lr': 2.5837496999694955e-07, 'samples': 28388928, 'steps': 147858, 'loss/train': 1.2224854230880737} 11/07/2021 18:02:29 - INFO - __main__ - Step 147860: {'lr': 2.5813382135453367e-07, 'samples': 28389120, 'steps': 147859, 'loss/train': 1.2123167514801025} 11/07/2021 18:02:29 - INFO - __main__ - Step 147861: {'lr': 2.578927852419366e-07, 'samples': 28389312, 'steps': 147860, 'loss/train': 1.5985028743743896} 11/07/2021 18:02:29 - INFO - __main__ - Step 147862: {'lr': 2.576518616592971e-07, 'samples': 28389504, 'steps': 147861, 'loss/train': 1.0602279901504517} 11/07/2021 18:02:30 - INFO - __main__ - Step 147863: {'lr': 2.5741105060669844e-07, 'samples': 28389696, 'steps': 147862, 'loss/train': 1.3097550868988037} 11/07/2021 18:02:30 - INFO - __main__ - Step 147864: {'lr': 2.5717035208425164e-07, 'samples': 28389888, 'steps': 147863, 'loss/train': 1.3880681991577148} 11/07/2021 18:02:31 - INFO - __main__ - Step 147865: {'lr': 2.569297660920955e-07, 'samples': 28390080, 'steps': 147864, 'loss/train': 0.6684610247612} 11/07/2021 18:02:32 - INFO - __main__ - Step 147866: {'lr': 2.566892926302855e-07, 'samples': 28390272, 'steps': 147865, 'loss/train': 0.9858125448226929} 11/07/2021 18:02:32 - INFO - __main__ - Step 147867: {'lr': 2.5644893169896045e-07, 'samples': 28390464, 'steps': 147866, 'loss/train': 1.7548794746398926} 11/07/2021 18:02:32 - INFO - __main__ - Step 147868: {'lr': 2.5620868329823133e-07, 'samples': 28390656, 'steps': 147867, 'loss/train': 1.2752217054367065} 11/07/2021 18:02:33 - INFO - __main__ - Step 147869: {'lr': 2.559685474282092e-07, 'samples': 28390848, 'steps': 147868, 'loss/train': 0.869289219379425} 11/07/2021 18:02:33 - INFO - __main__ - Step 147870: {'lr': 2.55728524089005e-07, 'samples': 28391040, 'steps': 147869, 'loss/train': 0.8966471552848816} 11/07/2021 18:02:34 - INFO - __main__ - Step 147871: {'lr': 2.554886132807022e-07, 'samples': 28391232, 'steps': 147870, 'loss/train': 1.2795475721359253} 11/07/2021 18:02:34 - INFO - __main__ - Step 147872: {'lr': 2.552488150034116e-07, 'samples': 28391424, 'steps': 147871, 'loss/train': 1.151741623878479} 11/07/2021 18:02:35 - INFO - __main__ - Step 147873: {'lr': 2.550091292572443e-07, 'samples': 28391616, 'steps': 147872, 'loss/train': 0.8337490558624268} 11/07/2021 18:02:35 - INFO - __main__ - Step 147874: {'lr': 2.547695560423391e-07, 'samples': 28391808, 'steps': 147873, 'loss/train': 0.7099546790122986} 11/07/2021 18:02:35 - INFO - __main__ - Step 147875: {'lr': 2.5453009535877924e-07, 'samples': 28392000, 'steps': 147874, 'loss/train': 1.2156668901443481} 11/07/2021 18:02:36 - INFO - __main__ - Step 147876: {'lr': 2.5429074720664803e-07, 'samples': 28392192, 'steps': 147875, 'loss/train': 1.1677402257919312} 11/07/2021 18:02:37 - INFO - __main__ - Step 147877: {'lr': 2.5405151158611197e-07, 'samples': 28392384, 'steps': 147876, 'loss/train': 1.229691982269287} 11/07/2021 18:02:37 - INFO - __main__ - Step 147878: {'lr': 2.5381238849722655e-07, 'samples': 28392576, 'steps': 147877, 'loss/train': 1.466813564300537} 11/07/2021 18:02:37 - INFO - __main__ - Step 147879: {'lr': 2.535733779401306e-07, 'samples': 28392768, 'steps': 147878, 'loss/train': 1.3177179098129272} 11/07/2021 18:02:38 - INFO - __main__ - Step 147880: {'lr': 2.5333447991490734e-07, 'samples': 28392960, 'steps': 147879, 'loss/train': 0.8160572052001953} 11/07/2021 18:02:38 - INFO - __main__ - Step 147881: {'lr': 2.5309569442169557e-07, 'samples': 28393152, 'steps': 147880, 'loss/train': 1.0724204778671265} 11/07/2021 18:02:39 - INFO - __main__ - Step 147882: {'lr': 2.5285702146057855e-07, 'samples': 28393344, 'steps': 147881, 'loss/train': 1.073898434638977} 11/07/2021 18:02:40 - INFO - __main__ - Step 147883: {'lr': 2.526184610316673e-07, 'samples': 28393536, 'steps': 147882, 'loss/train': 1.457025170326233} 11/07/2021 18:02:40 - INFO - __main__ - Step 147884: {'lr': 2.523800131350729e-07, 'samples': 28393728, 'steps': 147883, 'loss/train': 1.358131766319275} 11/07/2021 18:02:40 - INFO - __main__ - Step 147885: {'lr': 2.5214167777090626e-07, 'samples': 28393920, 'steps': 147884, 'loss/train': 1.586800456047058} 11/07/2021 18:02:41 - INFO - __main__ - Step 147886: {'lr': 2.5190345493927844e-07, 'samples': 28394112, 'steps': 147885, 'loss/train': 1.0258543491363525} 11/07/2021 18:02:42 - INFO - __main__ - Step 147887: {'lr': 2.516653446402728e-07, 'samples': 28394304, 'steps': 147886, 'loss/train': 0.9702314138412476} 11/07/2021 18:02:42 - INFO - __main__ - Step 147888: {'lr': 2.514273468740003e-07, 'samples': 28394496, 'steps': 147887, 'loss/train': 1.2223914861679077} 11/07/2021 18:02:43 - INFO - __main__ - Step 147889: {'lr': 2.5118946164059965e-07, 'samples': 28394688, 'steps': 147888, 'loss/train': 1.1897010803222656} 11/07/2021 18:02:43 - INFO - __main__ - Step 147890: {'lr': 2.5095168894018194e-07, 'samples': 28394880, 'steps': 147889, 'loss/train': 1.5826411247253418} 11/07/2021 18:02:43 - INFO - __main__ - Step 147891: {'lr': 2.507140287728027e-07, 'samples': 28395072, 'steps': 147890, 'loss/train': 1.0553126335144043} 11/07/2021 18:02:45 - INFO - __main__ - Step 147892: {'lr': 2.504764811386007e-07, 'samples': 28395264, 'steps': 147891, 'loss/train': 0.9719062447547913} 11/07/2021 18:02:45 - INFO - __main__ - Step 147893: {'lr': 2.5023904603768685e-07, 'samples': 28395456, 'steps': 147892, 'loss/train': 1.4159449338912964} 11/07/2021 18:02:46 - INFO - __main__ - Step 147894: {'lr': 2.500017234701446e-07, 'samples': 28395648, 'steps': 147893, 'loss/train': 0.40735945105552673} 11/07/2021 18:02:46 - INFO - __main__ - Step 147895: {'lr': 2.4976451343611264e-07, 'samples': 28395840, 'steps': 147894, 'loss/train': 0.9764859676361084} 11/07/2021 18:02:46 - INFO - __main__ - Step 147896: {'lr': 2.495274159356742e-07, 'samples': 28396032, 'steps': 147895, 'loss/train': 2.814838409423828} 11/07/2021 18:02:47 - INFO - __main__ - Step 147897: {'lr': 2.492904309689681e-07, 'samples': 28396224, 'steps': 147896, 'loss/train': 1.5587165355682373} 11/07/2021 18:02:48 - INFO - __main__ - Step 147898: {'lr': 2.4905355853604984e-07, 'samples': 28396416, 'steps': 147897, 'loss/train': 1.008582592010498} 11/07/2021 18:02:48 - INFO - __main__ - Step 147899: {'lr': 2.488167986370582e-07, 'samples': 28396608, 'steps': 147898, 'loss/train': 1.5132777690887451} 11/07/2021 18:02:48 - INFO - __main__ - Step 147900: {'lr': 2.485801512721042e-07, 'samples': 28396800, 'steps': 147899, 'loss/train': 0.5887227654457092} 11/07/2021 18:02:49 - INFO - __main__ - Step 147901: {'lr': 2.4834361644129887e-07, 'samples': 28396992, 'steps': 147900, 'loss/train': 1.4360982179641724} 11/07/2021 18:02:50 - INFO - __main__ - Step 147902: {'lr': 2.4810719414469774e-07, 'samples': 28397184, 'steps': 147901, 'loss/train': 1.4369398355484009} 11/07/2021 18:02:50 - INFO - __main__ - Step 147903: {'lr': 2.478708843824673e-07, 'samples': 28397376, 'steps': 147902, 'loss/train': 1.1639107465744019} 11/07/2021 18:02:50 - INFO - __main__ - Step 147904: {'lr': 2.4763468715471857e-07, 'samples': 28397568, 'steps': 147903, 'loss/train': 0.9228408336639404} 11/07/2021 18:02:51 - INFO - __main__ - Step 147905: {'lr': 2.4739860246150713e-07, 'samples': 28397760, 'steps': 147904, 'loss/train': 1.1811991930007935} 11/07/2021 18:02:51 - INFO - __main__ - Step 147906: {'lr': 2.4716263030294394e-07, 'samples': 28397952, 'steps': 147905, 'loss/train': 1.3804047107696533} 11/07/2021 18:02:52 - INFO - __main__ - Step 147907: {'lr': 2.469267706791956e-07, 'samples': 28398144, 'steps': 147906, 'loss/train': 1.4615585803985596} 11/07/2021 18:02:52 - INFO - __main__ - Step 147908: {'lr': 2.4669102359028974e-07, 'samples': 28398336, 'steps': 147907, 'loss/train': 1.2734887599945068} 11/07/2021 18:02:53 - INFO - __main__ - Step 147909: {'lr': 2.46455389036393e-07, 'samples': 28398528, 'steps': 147908, 'loss/train': 1.1323671340942383} 11/07/2021 18:02:53 - INFO - __main__ - Step 147910: {'lr': 2.4621986701758857e-07, 'samples': 28398720, 'steps': 147909, 'loss/train': 1.262632131576538} 11/07/2021 18:02:54 - INFO - __main__ - Step 147911: {'lr': 2.459844575339876e-07, 'samples': 28398912, 'steps': 147910, 'loss/train': 1.476236343383789} 11/07/2021 18:02:55 - INFO - __main__ - Step 147912: {'lr': 2.457491605856732e-07, 'samples': 28399104, 'steps': 147911, 'loss/train': 1.3343708515167236} 11/07/2021 18:02:55 - INFO - __main__ - Step 147913: {'lr': 2.4551397617278424e-07, 'samples': 28399296, 'steps': 147912, 'loss/train': 0.9646472334861755} 11/07/2021 18:02:55 - INFO - __main__ - Step 147914: {'lr': 2.45278904295404e-07, 'samples': 28399488, 'steps': 147913, 'loss/train': 1.3396464586257935} 11/07/2021 18:02:56 - INFO - __main__ - Step 147915: {'lr': 2.450439449536712e-07, 'samples': 28399680, 'steps': 147914, 'loss/train': 1.3009071350097656} 11/07/2021 18:02:56 - INFO - __main__ - Step 147916: {'lr': 2.4480909814764143e-07, 'samples': 28399872, 'steps': 147915, 'loss/train': 1.041251540184021} 11/07/2021 18:02:57 - INFO - __main__ - Step 147917: {'lr': 2.4457436387745336e-07, 'samples': 28400064, 'steps': 147916, 'loss/train': 1.4495089054107666} 11/07/2021 18:02:57 - INFO - __main__ - Step 147918: {'lr': 2.4433974214321807e-07, 'samples': 28400256, 'steps': 147917, 'loss/train': 1.6569817066192627} 11/07/2021 18:02:58 - INFO - __main__ - Step 147919: {'lr': 2.441052329450188e-07, 'samples': 28400448, 'steps': 147918, 'loss/train': 1.3354871273040771} 11/07/2021 18:02:58 - INFO - __main__ - Step 147920: {'lr': 2.438708362829667e-07, 'samples': 28400640, 'steps': 147919, 'loss/train': 1.3192414045333862} 11/07/2021 18:02:58 - INFO - __main__ - Step 147921: {'lr': 2.4363655215717264e-07, 'samples': 28400832, 'steps': 147920, 'loss/train': 1.5850863456726074} 11/07/2021 18:02:59 - INFO - __main__ - Step 147922: {'lr': 2.434023805677477e-07, 'samples': 28401024, 'steps': 147921, 'loss/train': 1.2254160642623901} 11/07/2021 18:03:00 - INFO - __main__ - Step 147923: {'lr': 2.431683215148028e-07, 'samples': 28401216, 'steps': 147922, 'loss/train': 1.1206088066101074} 11/07/2021 18:03:00 - INFO - __main__ - Step 147924: {'lr': 2.429343749984214e-07, 'samples': 28401408, 'steps': 147923, 'loss/train': 0.89356529712677} 11/07/2021 18:03:00 - INFO - __main__ - Step 147925: {'lr': 2.427005410187144e-07, 'samples': 28401600, 'steps': 147924, 'loss/train': 1.2702008485794067} 11/07/2021 18:03:01 - INFO - __main__ - Step 147926: {'lr': 2.4246681957579286e-07, 'samples': 28401792, 'steps': 147925, 'loss/train': 1.3545547723770142} 11/07/2021 18:03:01 - INFO - __main__ - Step 147927: {'lr': 2.4223321066976776e-07, 'samples': 28401984, 'steps': 147926, 'loss/train': 0.6519181132316589} 11/07/2021 18:03:02 - INFO - __main__ - Step 147928: {'lr': 2.419997143007502e-07, 'samples': 28402176, 'steps': 147927, 'loss/train': 1.4667078256607056} 11/07/2021 18:03:02 - INFO - __main__ - Step 147929: {'lr': 2.4176633046882337e-07, 'samples': 28402368, 'steps': 147928, 'loss/train': 1.9461321830749512} 11/07/2021 18:03:03 - INFO - __main__ - Step 147930: {'lr': 2.415330591740983e-07, 'samples': 28402560, 'steps': 147929, 'loss/train': 1.2190929651260376} 11/07/2021 18:03:03 - INFO - __main__ - Step 147931: {'lr': 2.4129990041668603e-07, 'samples': 28402752, 'steps': 147930, 'loss/train': 1.2874237298965454} 11/07/2021 18:03:04 - INFO - __main__ - Step 147932: {'lr': 2.410668541966976e-07, 'samples': 28402944, 'steps': 147931, 'loss/train': 1.5215998888015747} 11/07/2021 18:03:05 - INFO - __main__ - Step 147933: {'lr': 2.408339205142163e-07, 'samples': 28403136, 'steps': 147932, 'loss/train': 1.4743821620941162} 11/07/2021 18:03:05 - INFO - __main__ - Step 147934: {'lr': 2.4060109936935304e-07, 'samples': 28403328, 'steps': 147933, 'loss/train': 1.4101120233535767} 11/07/2021 18:03:05 - INFO - __main__ - Step 147935: {'lr': 2.403683907622467e-07, 'samples': 28403520, 'steps': 147934, 'loss/train': 1.5588396787643433} 11/07/2021 18:03:06 - INFO - __main__ - Step 147936: {'lr': 2.4013579469295275e-07, 'samples': 28403712, 'steps': 147935, 'loss/train': 1.1103729009628296} 11/07/2021 18:03:06 - INFO - __main__ - Step 147937: {'lr': 2.3990331116161004e-07, 'samples': 28403904, 'steps': 147936, 'loss/train': 1.2473293542861938} 11/07/2021 18:03:07 - INFO - __main__ - Step 147938: {'lr': 2.396709401683295e-07, 'samples': 28404096, 'steps': 147937, 'loss/train': 1.2854819297790527} 11/07/2021 18:03:07 - INFO - __main__ - Step 147939: {'lr': 2.394386817131944e-07, 'samples': 28404288, 'steps': 147938, 'loss/train': 1.4116970300674438} 11/07/2021 18:03:08 - INFO - __main__ - Step 147940: {'lr': 2.3920653579628806e-07, 'samples': 28404480, 'steps': 147939, 'loss/train': 1.4278554916381836} 11/07/2021 18:03:08 - INFO - __main__ - Step 147941: {'lr': 2.38974502417777e-07, 'samples': 28404672, 'steps': 147940, 'loss/train': 1.7514386177062988} 11/07/2021 18:03:08 - INFO - __main__ - Step 147942: {'lr': 2.3874258157768894e-07, 'samples': 28404864, 'steps': 147941, 'loss/train': 1.3146151304244995} 11/07/2021 18:03:09 - INFO - __main__ - Step 147943: {'lr': 2.385107732761904e-07, 'samples': 28405056, 'steps': 147942, 'loss/train': 1.6986876726150513} 11/07/2021 18:03:10 - INFO - __main__ - Step 147944: {'lr': 2.3827907751336475e-07, 'samples': 28405248, 'steps': 147943, 'loss/train': 1.1298106908798218} 11/07/2021 18:03:10 - INFO - __main__ - Step 147945: {'lr': 2.3804749428932293e-07, 'samples': 28405440, 'steps': 147944, 'loss/train': 1.538727879524231} 11/07/2021 18:03:10 - INFO - __main__ - Step 147946: {'lr': 2.3781602360414823e-07, 'samples': 28405632, 'steps': 147945, 'loss/train': 1.124498724937439} 11/07/2021 18:03:11 - INFO - __main__ - Step 147947: {'lr': 2.375846654579794e-07, 'samples': 28405824, 'steps': 147946, 'loss/train': 1.3094271421432495} 11/07/2021 18:03:12 - INFO - __main__ - Step 147948: {'lr': 2.3735341985089976e-07, 'samples': 28406016, 'steps': 147947, 'loss/train': 1.4128119945526123} 11/07/2021 18:03:12 - INFO - __main__ - Step 147949: {'lr': 2.3712228678299252e-07, 'samples': 28406208, 'steps': 147948, 'loss/train': 0.6811928153038025} 11/07/2021 18:03:13 - INFO - __main__ - Step 147950: {'lr': 2.368912662543965e-07, 'samples': 28406400, 'steps': 147949, 'loss/train': 1.3520740270614624} 11/07/2021 18:03:13 - INFO - __main__ - Step 147951: {'lr': 2.366603582652227e-07, 'samples': 28406592, 'steps': 147950, 'loss/train': 1.6064952611923218} 11/07/2021 18:03:13 - INFO - __main__ - Step 147952: {'lr': 2.3642956281552664e-07, 'samples': 28406784, 'steps': 147951, 'loss/train': 0.9749755263328552} 11/07/2021 18:03:14 - INFO - __main__ - Step 147953: {'lr': 2.3619887990544709e-07, 'samples': 28406976, 'steps': 147952, 'loss/train': 1.0889390707015991} 11/07/2021 18:03:15 - INFO - __main__ - Step 147954: {'lr': 2.3596830953509506e-07, 'samples': 28407168, 'steps': 147953, 'loss/train': 1.2594108581542969} 11/07/2021 18:03:15 - INFO - __main__ - Step 147955: {'lr': 2.3573785170455386e-07, 'samples': 28407360, 'steps': 147954, 'loss/train': 1.2167173624038696} 11/07/2021 18:03:15 - INFO - __main__ - Step 147956: {'lr': 2.3550750641393447e-07, 'samples': 28407552, 'steps': 147955, 'loss/train': 1.2197058200836182} 11/07/2021 18:03:16 - INFO - __main__ - Step 147957: {'lr': 2.352772736633202e-07, 'samples': 28407744, 'steps': 147956, 'loss/train': 1.4016828536987305} 11/07/2021 18:03:16 - INFO - __main__ - Step 147958: {'lr': 2.3504715345284978e-07, 'samples': 28407936, 'steps': 147957, 'loss/train': 1.273316740989685} 11/07/2021 18:03:17 - INFO - __main__ - Step 147959: {'lr': 2.3481714578263425e-07, 'samples': 28408128, 'steps': 147958, 'loss/train': 1.5290100574493408} 11/07/2021 18:03:18 - INFO - __main__ - Step 147960: {'lr': 2.345872506527291e-07, 'samples': 28408320, 'steps': 147959, 'loss/train': 1.524251937866211} 11/07/2021 18:03:18 - INFO - __main__ - Step 147961: {'lr': 2.343574680632732e-07, 'samples': 28408512, 'steps': 147960, 'loss/train': 2.0034987926483154} 11/07/2021 18:03:18 - INFO - __main__ - Step 147962: {'lr': 2.3412779801437746e-07, 'samples': 28408704, 'steps': 147961, 'loss/train': 1.3897302150726318} 11/07/2021 18:03:19 - INFO - __main__ - Step 147963: {'lr': 2.3389824050612518e-07, 'samples': 28408896, 'steps': 147962, 'loss/train': 1.366581678390503} 11/07/2021 18:03:20 - INFO - __main__ - Step 147964: {'lr': 2.3366879553859966e-07, 'samples': 28409088, 'steps': 147963, 'loss/train': 0.8154717087745667} 11/07/2021 18:03:20 - INFO - __main__ - Step 147965: {'lr': 2.334394631119674e-07, 'samples': 28409280, 'steps': 147964, 'loss/train': 0.6821139454841614} 11/07/2021 18:03:20 - INFO - __main__ - Step 147966: {'lr': 2.3321024322625618e-07, 'samples': 28409472, 'steps': 147965, 'loss/train': 1.177587628364563} 11/07/2021 18:03:21 - INFO - __main__ - Step 147967: {'lr': 2.329811358816325e-07, 'samples': 28409664, 'steps': 147966, 'loss/train': 1.232828140258789} 11/07/2021 18:03:21 - INFO - __main__ - Step 147968: {'lr': 2.327521410781519e-07, 'samples': 28409856, 'steps': 147967, 'loss/train': 1.7823749780654907} 11/07/2021 18:03:22 - INFO - __main__ - Step 147969: {'lr': 2.3252325881595314e-07, 'samples': 28410048, 'steps': 147968, 'loss/train': 1.5428204536437988} 11/07/2021 18:03:23 - INFO - __main__ - Step 147970: {'lr': 2.3229448909511953e-07, 'samples': 28410240, 'steps': 147969, 'loss/train': 1.4758810997009277} 11/07/2021 18:03:23 - INFO - __main__ - Step 147971: {'lr': 2.3206583191576203e-07, 'samples': 28410432, 'steps': 147970, 'loss/train': 1.0912162065505981} 11/07/2021 18:03:23 - INFO - __main__ - Step 147972: {'lr': 2.3183728727799168e-07, 'samples': 28410624, 'steps': 147971, 'loss/train': 1.713012933731079} 11/07/2021 18:03:24 - INFO - __main__ - Step 147973: {'lr': 2.316088551818918e-07, 'samples': 28410816, 'steps': 147972, 'loss/train': 1.2260446548461914} 11/07/2021 18:03:25 - INFO - __main__ - Step 147974: {'lr': 2.3138053562757333e-07, 'samples': 28411008, 'steps': 147973, 'loss/train': 0.04548587277531624} 11/07/2021 18:03:25 - INFO - __main__ - Step 147975: {'lr': 2.3115232861514736e-07, 'samples': 28411200, 'steps': 147974, 'loss/train': 0.8905074000358582} 11/07/2021 18:03:25 - INFO - __main__ - Step 147976: {'lr': 2.309242341446971e-07, 'samples': 28411392, 'steps': 147975, 'loss/train': 1.617053747177124} 11/07/2021 18:03:26 - INFO - __main__ - Step 147977: {'lr': 2.3069625221633362e-07, 'samples': 28411584, 'steps': 147976, 'loss/train': 1.1038331985473633} 11/07/2021 18:03:26 - INFO - __main__ - Step 147978: {'lr': 2.3046838283019566e-07, 'samples': 28411776, 'steps': 147977, 'loss/train': 1.4754481315612793} 11/07/2021 18:03:27 - INFO - __main__ - Step 147979: {'lr': 2.3024062598631102e-07, 'samples': 28411968, 'steps': 147978, 'loss/train': 1.4699397087097168} 11/07/2021 18:03:28 - INFO - __main__ - Step 147980: {'lr': 2.300129816848462e-07, 'samples': 28412160, 'steps': 147979, 'loss/train': 1.4383704662322998} 11/07/2021 18:03:28 - INFO - __main__ - Step 147981: {'lr': 2.297854499259122e-07, 'samples': 28412352, 'steps': 147980, 'loss/train': 1.2281211614608765} 11/07/2021 18:03:28 - INFO - __main__ - Step 147982: {'lr': 2.2955803070953684e-07, 'samples': 28412544, 'steps': 147981, 'loss/train': 1.6620219945907593} 11/07/2021 18:03:29 - INFO - __main__ - Step 147983: {'lr': 2.2933072403588662e-07, 'samples': 28412736, 'steps': 147982, 'loss/train': 0.8755142092704773} 11/07/2021 18:03:30 - INFO - __main__ - Step 147984: {'lr': 2.291035299050448e-07, 'samples': 28412928, 'steps': 147983, 'loss/train': 1.4994627237319946} 11/07/2021 18:03:30 - INFO - __main__ - Step 147985: {'lr': 2.288764483171224e-07, 'samples': 28413120, 'steps': 147984, 'loss/train': 1.147247314453125} 11/07/2021 18:03:30 - INFO - __main__ - Step 147986: {'lr': 2.2864947927223045e-07, 'samples': 28413312, 'steps': 147985, 'loss/train': 1.1093393564224243} 11/07/2021 18:03:31 - INFO - __main__ - Step 147987: {'lr': 2.2842262277042447e-07, 'samples': 28413504, 'steps': 147986, 'loss/train': 1.355451226234436} 11/07/2021 18:03:31 - INFO - __main__ - Step 147988: {'lr': 2.2819587881184322e-07, 'samples': 28413696, 'steps': 147987, 'loss/train': 0.9866561889648438} 11/07/2021 18:03:32 - INFO - __main__ - Step 147989: {'lr': 2.2796924739659776e-07, 'samples': 28413888, 'steps': 147988, 'loss/train': 1.3198435306549072} 11/07/2021 18:03:33 - INFO - __main__ - Step 147990: {'lr': 2.2774272852474352e-07, 'samples': 28414080, 'steps': 147989, 'loss/train': 0.5691193342208862} 11/07/2021 18:03:33 - INFO - __main__ - Step 147991: {'lr': 2.2751632219644712e-07, 'samples': 28414272, 'steps': 147990, 'loss/train': 0.8301459550857544} 11/07/2021 18:03:33 - INFO - __main__ - Step 147992: {'lr': 2.2729002841176404e-07, 'samples': 28414464, 'steps': 147991, 'loss/train': 1.633750557899475} 11/07/2021 18:03:34 - INFO - __main__ - Step 147993: {'lr': 2.270638471708053e-07, 'samples': 28414656, 'steps': 147992, 'loss/train': 1.2327730655670166} 11/07/2021 18:03:35 - INFO - __main__ - Step 147994: {'lr': 2.2683777847368192e-07, 'samples': 28414848, 'steps': 147993, 'loss/train': 1.0261403322219849} 11/07/2021 18:03:35 - INFO - __main__ - Step 147995: {'lr': 2.2661182232047717e-07, 'samples': 28415040, 'steps': 147994, 'loss/train': 1.2899060249328613} 11/07/2021 18:03:35 - INFO - __main__ - Step 147996: {'lr': 2.2638597871132982e-07, 'samples': 28415232, 'steps': 147995, 'loss/train': 0.9845693707466125} 11/07/2021 18:03:36 - INFO - __main__ - Step 147997: {'lr': 2.2616024764632315e-07, 'samples': 28415424, 'steps': 147996, 'loss/train': 1.4520940780639648} 11/07/2021 18:03:36 - INFO - __main__ - Step 147998: {'lr': 2.2593462912554042e-07, 'samples': 28415616, 'steps': 147997, 'loss/train': 1.6522212028503418} 11/07/2021 18:03:36 - INFO - __main__ - Step 147999: {'lr': 2.2570912314909264e-07, 'samples': 28415808, 'steps': 147998, 'loss/train': 1.7327865362167358} 11/07/2021 18:03:37 - INFO - __main__ - Step 148000: {'lr': 2.2548372971709085e-07, 'samples': 28416000, 'steps': 147999, 'loss/train': 1.535750389099121} 11/07/2021 18:03:38 - INFO - __main__ - Step 148001: {'lr': 2.252584488296461e-07, 'samples': 28416192, 'steps': 148000, 'loss/train': 1.1368823051452637} 11/07/2021 18:03:38 - INFO - __main__ - Step 148002: {'lr': 2.2503328048681383e-07, 'samples': 28416384, 'steps': 148001, 'loss/train': 1.1756608486175537} 11/07/2021 18:03:38 - INFO - __main__ - Step 148003: {'lr': 2.248082246887606e-07, 'samples': 28416576, 'steps': 148002, 'loss/train': 0.5647555589675903} 11/07/2021 18:03:39 - INFO - __main__ - Step 148004: {'lr': 2.2458328143554197e-07, 'samples': 28416768, 'steps': 148003, 'loss/train': 1.5182775259017944} 11/07/2021 18:03:40 - INFO - __main__ - Step 148005: {'lr': 2.2435845072726891e-07, 'samples': 28416960, 'steps': 148004, 'loss/train': 1.7089651823043823} 11/07/2021 18:03:40 - INFO - __main__ - Step 148006: {'lr': 2.241337325640247e-07, 'samples': 28417152, 'steps': 148005, 'loss/train': 1.0472534894943237} 11/07/2021 18:03:41 - INFO - __main__ - Step 148007: {'lr': 2.2390912694597588e-07, 'samples': 28417344, 'steps': 148006, 'loss/train': 1.2831801176071167} 11/07/2021 18:03:41 - INFO - __main__ - Step 148008: {'lr': 2.236846338731502e-07, 'samples': 28417536, 'steps': 148007, 'loss/train': 0.9709368944168091} 11/07/2021 18:03:41 - INFO - __main__ - Step 148009: {'lr': 2.2346025334568644e-07, 'samples': 28417728, 'steps': 148008, 'loss/train': 1.0157403945922852} 11/07/2021 18:03:42 - INFO - __main__ - Step 148010: {'lr': 2.2323598536366785e-07, 'samples': 28417920, 'steps': 148009, 'loss/train': 1.6103477478027344} 11/07/2021 18:03:43 - INFO - __main__ - Step 148011: {'lr': 2.230118299272055e-07, 'samples': 28418112, 'steps': 148010, 'loss/train': 1.2667224407196045} 11/07/2021 18:03:43 - INFO - __main__ - Step 148012: {'lr': 2.2278778703641034e-07, 'samples': 28418304, 'steps': 148011, 'loss/train': 1.1441353559494019} 11/07/2021 18:03:43 - INFO - __main__ - Step 148013: {'lr': 2.225638566913657e-07, 'samples': 28418496, 'steps': 148012, 'loss/train': 1.1129719018936157} 11/07/2021 18:03:44 - INFO - __main__ - Step 148014: {'lr': 2.2234003889218258e-07, 'samples': 28418688, 'steps': 148013, 'loss/train': 1.105084776878357} 11/07/2021 18:03:45 - INFO - __main__ - Step 148015: {'lr': 2.22116333638972e-07, 'samples': 28418880, 'steps': 148014, 'loss/train': 1.152771234512329} 11/07/2021 18:03:45 - INFO - __main__ - Step 148016: {'lr': 2.2189274093178947e-07, 'samples': 28419072, 'steps': 148015, 'loss/train': 1.2534476518630981} 11/07/2021 18:03:46 - INFO - __main__ - Step 148017: {'lr': 2.216692607708015e-07, 'samples': 28419264, 'steps': 148016, 'loss/train': 1.296269178390503} 11/07/2021 18:03:46 - INFO - __main__ - Step 148018: {'lr': 2.2144589315606367e-07, 'samples': 28419456, 'steps': 148017, 'loss/train': 1.392519235610962} 11/07/2021 18:03:46 - INFO - __main__ - Step 148019: {'lr': 2.2122263808768695e-07, 'samples': 28419648, 'steps': 148018, 'loss/train': 1.4248086214065552} 11/07/2021 18:03:47 - INFO - __main__ - Step 148020: {'lr': 2.2099949556575462e-07, 'samples': 28419840, 'steps': 148019, 'loss/train': 1.3140544891357422} 11/07/2021 18:03:48 - INFO - __main__ - Step 148021: {'lr': 2.2077646559040543e-07, 'samples': 28420032, 'steps': 148020, 'loss/train': 1.5089579820632935} 11/07/2021 18:03:48 - INFO - __main__ - Step 148022: {'lr': 2.2055354816172268e-07, 'samples': 28420224, 'steps': 148021, 'loss/train': 1.6028841733932495} 11/07/2021 18:03:48 - INFO - __main__ - Step 148023: {'lr': 2.2033074327978963e-07, 'samples': 28420416, 'steps': 148022, 'loss/train': 1.2045029401779175} 11/07/2021 18:03:49 - INFO - __main__ - Step 148024: {'lr': 2.201080509447173e-07, 'samples': 28420608, 'steps': 148023, 'loss/train': 1.1876047849655151} 11/07/2021 18:03:50 - INFO - __main__ - Step 148025: {'lr': 2.1988547115664448e-07, 'samples': 28420800, 'steps': 148024, 'loss/train': 1.4843393564224243} 11/07/2021 18:03:50 - INFO - __main__ - Step 148026: {'lr': 2.196630039155989e-07, 'samples': 28420992, 'steps': 148025, 'loss/train': 1.6297005414962769} 11/07/2021 18:03:50 - INFO - __main__ - Step 148027: {'lr': 2.1944064922174712e-07, 'samples': 28421184, 'steps': 148026, 'loss/train': 1.313222050666809} 11/07/2021 18:03:51 - INFO - __main__ - Step 148028: {'lr': 2.1921840707514463e-07, 'samples': 28421376, 'steps': 148027, 'loss/train': 0.9199932217597961} 11/07/2021 18:03:51 - INFO - __main__ - Step 148029: {'lr': 2.1899627747590245e-07, 'samples': 28421568, 'steps': 148028, 'loss/train': 1.6064234972000122} 11/07/2021 18:03:52 - INFO - __main__ - Step 148030: {'lr': 2.1877426042413162e-07, 'samples': 28421760, 'steps': 148029, 'loss/train': 1.2749385833740234} 11/07/2021 18:03:53 - INFO - __main__ - Step 148031: {'lr': 2.1855235591994316e-07, 'samples': 28421952, 'steps': 148030, 'loss/train': 1.2447260618209839} 11/07/2021 18:03:53 - INFO - __main__ - Step 148032: {'lr': 2.1833056396339256e-07, 'samples': 28422144, 'steps': 148031, 'loss/train': 1.5759466886520386} 11/07/2021 18:03:53 - INFO - __main__ - Step 148033: {'lr': 2.181088845546464e-07, 'samples': 28422336, 'steps': 148032, 'loss/train': 1.4622344970703125} 11/07/2021 18:03:54 - INFO - __main__ - Step 148034: {'lr': 2.1788731769373238e-07, 'samples': 28422528, 'steps': 148033, 'loss/train': 1.7632861137390137} 11/07/2021 18:03:55 - INFO - __main__ - Step 148035: {'lr': 2.1766586338078932e-07, 'samples': 28422720, 'steps': 148034, 'loss/train': 1.3583619594573975} 11/07/2021 18:03:55 - INFO - __main__ - Step 148036: {'lr': 2.174445216159282e-07, 'samples': 28422912, 'steps': 148035, 'loss/train': 1.0317561626434326} 11/07/2021 18:03:55 - INFO - __main__ - Step 148037: {'lr': 2.1722329239920458e-07, 'samples': 28423104, 'steps': 148036, 'loss/train': 1.2645591497421265} 11/07/2021 18:03:56 - INFO - __main__ - Step 148038: {'lr': 2.1700217573075721e-07, 'samples': 28423296, 'steps': 148037, 'loss/train': 1.2303986549377441} 11/07/2021 18:03:56 - INFO - __main__ - Step 148039: {'lr': 2.167811716106971e-07, 'samples': 28423488, 'steps': 148038, 'loss/train': 1.1450040340423584} 11/07/2021 18:03:57 - INFO - __main__ - Step 148040: {'lr': 2.1656028003907978e-07, 'samples': 28423680, 'steps': 148039, 'loss/train': 1.2222694158554077} 11/07/2021 18:03:57 - INFO - __main__ - Step 148041: {'lr': 2.163395010160163e-07, 'samples': 28423872, 'steps': 148040, 'loss/train': 1.2777161598205566} 11/07/2021 18:03:58 - INFO - __main__ - Step 148042: {'lr': 2.161188345416454e-07, 'samples': 28424064, 'steps': 148041, 'loss/train': 1.3899860382080078} 11/07/2021 18:03:58 - INFO - __main__ - Step 148043: {'lr': 2.158982806159948e-07, 'samples': 28424256, 'steps': 148042, 'loss/train': 0.9601913094520569} 11/07/2021 18:03:58 - INFO - __main__ - Step 148044: {'lr': 2.1567783923923112e-07, 'samples': 28424448, 'steps': 148043, 'loss/train': 1.2196323871612549} 11/07/2021 18:04:00 - INFO - __main__ - Step 148045: {'lr': 2.1545751041143756e-07, 'samples': 28424640, 'steps': 148044, 'loss/train': 1.0269569158554077} 11/07/2021 18:04:00 - INFO - __main__ - Step 148046: {'lr': 2.1523729413269744e-07, 'samples': 28424832, 'steps': 148045, 'loss/train': 1.1405283212661743} 11/07/2021 18:04:00 - INFO - __main__ - Step 148047: {'lr': 2.1501719040312172e-07, 'samples': 28425024, 'steps': 148046, 'loss/train': 0.9618735909461975} 11/07/2021 18:04:01 - INFO - __main__ - Step 148048: {'lr': 2.1479719922279372e-07, 'samples': 28425216, 'steps': 148047, 'loss/train': 1.2061188220977783} 11/07/2021 18:04:01 - INFO - __main__ - Step 148049: {'lr': 2.1457732059182443e-07, 'samples': 28425408, 'steps': 148048, 'loss/train': 1.432700514793396} 11/07/2021 18:04:01 - INFO - __main__ - Step 148050: {'lr': 2.143575545103249e-07, 'samples': 28425600, 'steps': 148049, 'loss/train': 1.477178692817688} 11/07/2021 18:04:02 - INFO - __main__ - Step 148051: {'lr': 2.1413790097837837e-07, 'samples': 28425792, 'steps': 148050, 'loss/train': 1.215686321258545} 11/07/2021 18:04:03 - INFO - __main__ - Step 148052: {'lr': 2.1391835999606813e-07, 'samples': 28425984, 'steps': 148051, 'loss/train': 1.1609278917312622} 11/07/2021 18:04:03 - INFO - __main__ - Step 148053: {'lr': 2.1369893156353293e-07, 'samples': 28426176, 'steps': 148052, 'loss/train': 1.1292368173599243} 11/07/2021 18:04:03 - INFO - __main__ - Step 148054: {'lr': 2.134796156808283e-07, 'samples': 28426368, 'steps': 148053, 'loss/train': 1.5742629766464233} 11/07/2021 18:04:04 - INFO - __main__ - Step 148055: {'lr': 2.13260412348093e-07, 'samples': 28426560, 'steps': 148054, 'loss/train': 0.9936357140541077} 11/07/2021 18:04:05 - INFO - __main__ - Step 148056: {'lr': 2.1304132156541033e-07, 'samples': 28426752, 'steps': 148055, 'loss/train': 1.1946930885314941} 11/07/2021 18:04:05 - INFO - __main__ - Step 148057: {'lr': 2.1282234333286354e-07, 'samples': 28426944, 'steps': 148056, 'loss/train': 0.9251441955566406} 11/07/2021 18:04:06 - INFO - __main__ - Step 148058: {'lr': 2.1260347765056365e-07, 'samples': 28427136, 'steps': 148057, 'loss/train': 1.6146011352539062} 11/07/2021 18:04:06 - INFO - __main__ - Step 148059: {'lr': 2.123847245186217e-07, 'samples': 28427328, 'steps': 148058, 'loss/train': 1.3282915353775024} 11/07/2021 18:04:06 - INFO - __main__ - Step 148060: {'lr': 2.121660839371209e-07, 'samples': 28427520, 'steps': 148059, 'loss/train': 1.3839848041534424} 11/07/2021 18:04:07 - INFO - __main__ - Step 148061: {'lr': 2.1194755590617232e-07, 'samples': 28427712, 'steps': 148060, 'loss/train': 1.3653841018676758} 11/07/2021 18:04:08 - INFO - __main__ - Step 148062: {'lr': 2.1172914042585922e-07, 'samples': 28427904, 'steps': 148061, 'loss/train': 1.1319141387939453} 11/07/2021 18:04:08 - INFO - __main__ - Step 148063: {'lr': 2.1151083749629263e-07, 'samples': 28428096, 'steps': 148062, 'loss/train': 1.3151001930236816} 11/07/2021 18:04:08 - INFO - __main__ - Step 148064: {'lr': 2.112926471175558e-07, 'samples': 28428288, 'steps': 148063, 'loss/train': 0.5920829772949219} 11/07/2021 18:04:09 - INFO - __main__ - Step 148065: {'lr': 2.1107456928975975e-07, 'samples': 28428480, 'steps': 148064, 'loss/train': 1.2755368947982788} 11/07/2021 18:04:10 - INFO - __main__ - Step 148066: {'lr': 2.1085660401298778e-07, 'samples': 28428672, 'steps': 148065, 'loss/train': 1.4338997602462769} 11/07/2021 18:04:10 - INFO - __main__ - Step 148067: {'lr': 2.1063875128735088e-07, 'samples': 28428864, 'steps': 148066, 'loss/train': 0.9668135643005371} 11/07/2021 18:04:10 - INFO - __main__ - Step 148068: {'lr': 2.104210111129601e-07, 'samples': 28429056, 'steps': 148067, 'loss/train': 1.3781659603118896} 11/07/2021 18:04:11 - INFO - __main__ - Step 148069: {'lr': 2.1020338348989864e-07, 'samples': 28429248, 'steps': 148068, 'loss/train': 1.3938019275665283} 11/07/2021 18:04:11 - INFO - __main__ - Step 148070: {'lr': 2.0998586841824984e-07, 'samples': 28429440, 'steps': 148069, 'loss/train': 1.1954152584075928} 11/07/2021 18:04:12 - INFO - __main__ - Step 148071: {'lr': 2.097684658981247e-07, 'samples': 28429632, 'steps': 148070, 'loss/train': 1.2620763778686523} 11/07/2021 18:04:12 - INFO - __main__ - Step 148072: {'lr': 2.0955117592963424e-07, 'samples': 28429824, 'steps': 148071, 'loss/train': 1.1862753629684448} 11/07/2021 18:04:13 - INFO - __main__ - Step 148073: {'lr': 2.0933399851286173e-07, 'samples': 28430016, 'steps': 148072, 'loss/train': 1.336216688156128} 11/07/2021 18:04:13 - INFO - __main__ - Step 148074: {'lr': 2.0911693364791816e-07, 'samples': 28430208, 'steps': 148073, 'loss/train': 1.3960098028182983} 11/07/2021 18:04:13 - INFO - __main__ - Step 148075: {'lr': 2.0889998133488685e-07, 'samples': 28430400, 'steps': 148074, 'loss/train': 1.364005208015442} 11/07/2021 18:04:14 - INFO - __main__ - Step 148076: {'lr': 2.086831415738788e-07, 'samples': 28430592, 'steps': 148075, 'loss/train': 1.1307411193847656} 11/07/2021 18:04:15 - INFO - __main__ - Step 148077: {'lr': 2.0846641436497726e-07, 'samples': 28430784, 'steps': 148076, 'loss/train': 1.5485037565231323} 11/07/2021 18:04:15 - INFO - __main__ - Step 148078: {'lr': 2.082497997082655e-07, 'samples': 28430976, 'steps': 148077, 'loss/train': 0.9227186441421509} 11/07/2021 18:04:16 - INFO - __main__ - Step 148079: {'lr': 2.0803329760388233e-07, 'samples': 28431168, 'steps': 148078, 'loss/train': 1.2027108669281006} 11/07/2021 18:04:16 - INFO - __main__ - Step 148080: {'lr': 2.07816908051911e-07, 'samples': 28431360, 'steps': 148079, 'loss/train': 1.6497728824615479} 11/07/2021 18:04:16 - INFO - __main__ - Step 148081: {'lr': 2.0760063105243477e-07, 'samples': 28431552, 'steps': 148080, 'loss/train': 1.2633410692214966} 11/07/2021 18:04:17 - INFO - __main__ - Step 148082: {'lr': 2.0738446660556465e-07, 'samples': 28431744, 'steps': 148081, 'loss/train': 0.645044207572937} 11/07/2021 18:04:18 - INFO - __main__ - Step 148083: {'lr': 2.0716841471138392e-07, 'samples': 28431936, 'steps': 148082, 'loss/train': 1.6652600765228271} 11/07/2021 18:04:18 - INFO - __main__ - Step 148084: {'lr': 2.0695247537000362e-07, 'samples': 28432128, 'steps': 148083, 'loss/train': 1.4790928363800049} 11/07/2021 18:04:18 - INFO - __main__ - Step 148085: {'lr': 2.06736648581507e-07, 'samples': 28432320, 'steps': 148084, 'loss/train': 1.9081549644470215} 11/07/2021 18:04:19 - INFO - __main__ - Step 148086: {'lr': 2.0652093434600504e-07, 'samples': 28432512, 'steps': 148085, 'loss/train': 0.6841640472412109} 11/07/2021 18:04:20 - INFO - __main__ - Step 148087: {'lr': 2.0630533266360884e-07, 'samples': 28432704, 'steps': 148086, 'loss/train': 0.7965198755264282} 11/07/2021 18:04:20 - INFO - __main__ - Step 148088: {'lr': 2.0608984353437387e-07, 'samples': 28432896, 'steps': 148087, 'loss/train': 1.2170244455337524} 11/07/2021 18:04:20 - INFO - __main__ - Step 148089: {'lr': 2.0587446695841117e-07, 'samples': 28433088, 'steps': 148088, 'loss/train': 1.6510097980499268} 11/07/2021 18:04:21 - INFO - __main__ - Step 148090: {'lr': 2.0565920293585949e-07, 'samples': 28433280, 'steps': 148089, 'loss/train': 1.1800007820129395} 11/07/2021 18:04:21 - INFO - __main__ - Step 148091: {'lr': 2.054440514667466e-07, 'samples': 28433472, 'steps': 148090, 'loss/train': 1.6025381088256836} 11/07/2021 18:04:22 - INFO - __main__ - Step 148092: {'lr': 2.0522901255123904e-07, 'samples': 28433664, 'steps': 148091, 'loss/train': 1.354076623916626} 11/07/2021 18:04:23 - INFO - __main__ - Step 148093: {'lr': 2.050140861893923e-07, 'samples': 28433856, 'steps': 148092, 'loss/train': 1.3308465480804443} 11/07/2021 18:04:23 - INFO - __main__ - Step 148094: {'lr': 2.047992723812897e-07, 'samples': 28434048, 'steps': 148093, 'loss/train': 1.3094145059585571} 11/07/2021 18:04:23 - INFO - __main__ - Step 148095: {'lr': 2.0458457112706996e-07, 'samples': 28434240, 'steps': 148094, 'loss/train': 0.42140892148017883} 11/07/2021 18:04:24 - INFO - __main__ - Step 148096: {'lr': 2.0436998242681637e-07, 'samples': 28434432, 'steps': 148095, 'loss/train': 1.4085416793823242} 11/07/2021 18:04:25 - INFO - __main__ - Step 148097: {'lr': 2.041555062806122e-07, 'samples': 28434624, 'steps': 148096, 'loss/train': 1.0532768964767456} 11/07/2021 18:04:25 - INFO - __main__ - Step 148098: {'lr': 2.039411426885407e-07, 'samples': 28434816, 'steps': 148097, 'loss/train': 1.121937870979309} 11/07/2021 18:04:25 - INFO - __main__ - Step 148099: {'lr': 2.0372689165074064e-07, 'samples': 28435008, 'steps': 148098, 'loss/train': 1.3452718257904053} 11/07/2021 18:04:26 - INFO - __main__ - Step 148100: {'lr': 2.0351275316729534e-07, 'samples': 28435200, 'steps': 148099, 'loss/train': 0.9462639093399048} 11/07/2021 18:04:26 - INFO - __main__ - Step 148101: {'lr': 2.03298727238288e-07, 'samples': 28435392, 'steps': 148100, 'loss/train': 1.3590714931488037} 11/07/2021 18:04:28 - INFO - __main__ - Step 148102: {'lr': 2.0308481386380195e-07, 'samples': 28435584, 'steps': 148101, 'loss/train': 0.46590423583984375} 11/07/2021 18:04:28 - INFO - __main__ - Step 148103: {'lr': 2.0287101304394818e-07, 'samples': 28435776, 'steps': 148102, 'loss/train': 1.1223753690719604} 11/07/2021 18:04:28 - INFO - __main__ - Step 148104: {'lr': 2.0265732477883769e-07, 'samples': 28435968, 'steps': 148103, 'loss/train': 1.092431902885437} 11/07/2021 18:04:29 - INFO - __main__ - Step 148105: {'lr': 2.024437490685538e-07, 'samples': 28436160, 'steps': 148104, 'loss/train': 0.9801602363586426} 11/07/2021 18:04:29 - INFO - __main__ - Step 148106: {'lr': 2.0223028591320746e-07, 'samples': 28436352, 'steps': 148105, 'loss/train': 1.5696450471878052} 11/07/2021 18:04:29 - INFO - __main__ - Step 148107: {'lr': 2.02016935312882e-07, 'samples': 28436544, 'steps': 148106, 'loss/train': 0.1315273642539978} 11/07/2021 18:04:31 - INFO - __main__ - Step 148108: {'lr': 2.0180369726766068e-07, 'samples': 28436736, 'steps': 148107, 'loss/train': 1.047091007232666} 11/07/2021 18:04:31 - INFO - __main__ - Step 148109: {'lr': 2.0159057177762675e-07, 'samples': 28436928, 'steps': 148108, 'loss/train': 0.5568646788597107} 11/07/2021 18:04:32 - INFO - __main__ - Step 148110: {'lr': 2.0137755884294673e-07, 'samples': 28437120, 'steps': 148109, 'loss/train': 1.3025315999984741} 11/07/2021 18:04:32 - INFO - __main__ - Step 148111: {'lr': 2.011646584636484e-07, 'samples': 28437312, 'steps': 148110, 'loss/train': 1.2210217714309692} 11/07/2021 18:04:33 - INFO - __main__ - Step 148112: {'lr': 2.0095187063984278e-07, 'samples': 28437504, 'steps': 148111, 'loss/train': 1.5334408283233643} 11/07/2021 18:04:33 - INFO - __main__ - Step 148113: {'lr': 2.0073919537166864e-07, 'samples': 28437696, 'steps': 148112, 'loss/train': 1.103118658065796} 11/07/2021 18:04:34 - INFO - __main__ - Step 148114: {'lr': 2.0052663265915373e-07, 'samples': 28437888, 'steps': 148113, 'loss/train': 1.3115646839141846} 11/07/2021 18:04:34 - INFO - __main__ - Step 148115: {'lr': 2.0031418250243684e-07, 'samples': 28438080, 'steps': 148114, 'loss/train': 0.6635581254959106} 11/07/2021 18:04:34 - INFO - __main__ - Step 148116: {'lr': 2.001018449016012e-07, 'samples': 28438272, 'steps': 148115, 'loss/train': 0.6617177128791809} 11/07/2021 18:04:35 - INFO - __main__ - Step 148117: {'lr': 1.998896198567579e-07, 'samples': 28438464, 'steps': 148116, 'loss/train': 1.0775645971298218} 11/07/2021 18:04:36 - INFO - __main__ - Step 148118: {'lr': 1.9967750736796242e-07, 'samples': 28438656, 'steps': 148117, 'loss/train': 1.5047162771224976} 11/07/2021 18:04:37 - INFO - __main__ - Step 148119: {'lr': 1.9946550743535353e-07, 'samples': 28438848, 'steps': 148118, 'loss/train': 1.7188796997070312} 11/07/2021 18:04:37 - INFO - __main__ - Step 148120: {'lr': 1.992536200590145e-07, 'samples': 28439040, 'steps': 148119, 'loss/train': 1.6615643501281738} 11/07/2021 18:04:37 - INFO - __main__ - Step 148121: {'lr': 1.9904184523902858e-07, 'samples': 28439232, 'steps': 148120, 'loss/train': 1.1207631826400757} 11/07/2021 18:04:38 - INFO - __main__ - Step 148122: {'lr': 1.9883018297550681e-07, 'samples': 28439424, 'steps': 148121, 'loss/train': 0.8102813959121704} 11/07/2021 18:04:38 - INFO - __main__ - Step 148123: {'lr': 1.9861863326853248e-07, 'samples': 28439616, 'steps': 148122, 'loss/train': 0.6138293743133545} 11/07/2021 18:04:39 - INFO - __main__ - Step 148124: {'lr': 1.9840719611821656e-07, 'samples': 28439808, 'steps': 148123, 'loss/train': 1.4996364116668701} 11/07/2021 18:04:39 - INFO - __main__ - Step 148125: {'lr': 1.9819587152464235e-07, 'samples': 28440000, 'steps': 148124, 'loss/train': 1.2853280305862427} 11/07/2021 18:04:40 - INFO - __main__ - Step 148126: {'lr': 1.9798465948789314e-07, 'samples': 28440192, 'steps': 148125, 'loss/train': 1.3793953657150269} 11/07/2021 18:04:40 - INFO - __main__ - Step 148127: {'lr': 1.977735600080799e-07, 'samples': 28440384, 'steps': 148126, 'loss/train': 1.0826349258422852} 11/07/2021 18:04:41 - INFO - __main__ - Step 148128: {'lr': 1.9756257308531368e-07, 'samples': 28440576, 'steps': 148127, 'loss/train': 1.340358018875122} 11/07/2021 18:04:42 - INFO - __main__ - Step 148129: {'lr': 1.9735169871965e-07, 'samples': 28440768, 'steps': 148128, 'loss/train': 1.5683205127716064} 11/07/2021 18:04:42 - INFO - __main__ - Step 148130: {'lr': 1.9714093691122757e-07, 'samples': 28440960, 'steps': 148129, 'loss/train': 1.5222479104995728} 11/07/2021 18:04:42 - INFO - __main__ - Step 148131: {'lr': 1.9693028766010203e-07, 'samples': 28441152, 'steps': 148130, 'loss/train': 1.283203363418579} 11/07/2021 18:04:43 - INFO - __main__ - Step 148132: {'lr': 1.9671975096638428e-07, 'samples': 28441344, 'steps': 148131, 'loss/train': 1.258415937423706} 11/07/2021 18:04:43 - INFO - __main__ - Step 148133: {'lr': 1.9650932683015766e-07, 'samples': 28441536, 'steps': 148132, 'loss/train': 1.2760705947875977} 11/07/2021 18:04:43 - INFO - __main__ - Step 148134: {'lr': 1.962990152515609e-07, 'samples': 28441728, 'steps': 148133, 'loss/train': 0.9356074333190918} 11/07/2021 18:04:44 - INFO - __main__ - Step 148135: {'lr': 1.960888162306218e-07, 'samples': 28441920, 'steps': 148134, 'loss/train': 0.5120120644569397} 11/07/2021 18:04:45 - INFO - __main__ - Step 148136: {'lr': 1.9587872976750688e-07, 'samples': 28442112, 'steps': 148135, 'loss/train': 1.4051299095153809} 11/07/2021 18:04:45 - INFO - __main__ - Step 148137: {'lr': 1.9566875586224387e-07, 'samples': 28442304, 'steps': 148136, 'loss/train': 1.2486814260482788} 11/07/2021 18:04:46 - INFO - __main__ - Step 148138: {'lr': 1.9545889451497156e-07, 'samples': 28442496, 'steps': 148137, 'loss/train': 1.0329376459121704} 11/07/2021 18:04:46 - INFO - __main__ - Step 148139: {'lr': 1.9524914572577325e-07, 'samples': 28442688, 'steps': 148138, 'loss/train': 0.8983304500579834} 11/07/2021 18:04:48 - INFO - __main__ - Step 148140: {'lr': 1.9503950949473214e-07, 'samples': 28442880, 'steps': 148139, 'loss/train': 1.4256261587142944} 11/07/2021 18:04:48 - INFO - __main__ - Step 148141: {'lr': 1.9482998582195932e-07, 'samples': 28443072, 'steps': 148140, 'loss/train': 1.6244271993637085} 11/07/2021 18:04:49 - INFO - __main__ - Step 148142: {'lr': 1.9462057470753803e-07, 'samples': 28443264, 'steps': 148141, 'loss/train': 0.9909226298332214} 11/07/2021 18:04:49 - INFO - __main__ - Step 148143: {'lr': 1.944112761515515e-07, 'samples': 28443456, 'steps': 148142, 'loss/train': 0.9983658194541931} 11/07/2021 18:04:49 - INFO - __main__ - Step 148144: {'lr': 1.942020901541386e-07, 'samples': 28443648, 'steps': 148143, 'loss/train': 0.18644289672374725} 11/07/2021 18:04:50 - INFO - __main__ - Step 148145: {'lr': 1.9399301671535475e-07, 'samples': 28443840, 'steps': 148144, 'loss/train': 0.17297394573688507} 11/07/2021 18:04:50 - INFO - __main__ - Step 148146: {'lr': 1.9378405583528325e-07, 'samples': 28444032, 'steps': 148145, 'loss/train': 0.12352611869573593} 11/07/2021 18:04:51 - INFO - __main__ - Step 148147: {'lr': 1.935752075140629e-07, 'samples': 28444224, 'steps': 148146, 'loss/train': 1.4275259971618652} 11/07/2021 18:04:52 - INFO - __main__ - Step 148148: {'lr': 1.9336647175174915e-07, 'samples': 28444416, 'steps': 148147, 'loss/train': 0.9639179110527039} 11/07/2021 18:04:52 - INFO - __main__ - Step 148149: {'lr': 1.9315784854845308e-07, 'samples': 28444608, 'steps': 148148, 'loss/train': 0.47268205881118774} 11/07/2021 18:04:52 - INFO - __main__ - Step 148150: {'lr': 1.9294933790425796e-07, 'samples': 28444800, 'steps': 148149, 'loss/train': 1.3218934535980225} 11/07/2021 18:04:53 - INFO - __main__ - Step 148151: {'lr': 1.9274093981927476e-07, 'samples': 28444992, 'steps': 148150, 'loss/train': 1.8972126245498657} 11/07/2021 18:04:54 - INFO - __main__ - Step 148152: {'lr': 1.925326542935868e-07, 'samples': 28445184, 'steps': 148151, 'loss/train': 0.9252461791038513} 11/07/2021 18:04:54 - INFO - __main__ - Step 148153: {'lr': 1.9232448132727732e-07, 'samples': 28445376, 'steps': 148152, 'loss/train': 1.4595692157745361} 11/07/2021 18:04:55 - INFO - __main__ - Step 148154: {'lr': 1.9211642092045732e-07, 'samples': 28445568, 'steps': 148153, 'loss/train': 1.299556016921997} 11/07/2021 18:04:55 - INFO - __main__ - Step 148155: {'lr': 1.9190847307321014e-07, 'samples': 28445760, 'steps': 148154, 'loss/train': 1.4582955837249756} 11/07/2021 18:04:55 - INFO - __main__ - Step 148156: {'lr': 1.9170063778564673e-07, 'samples': 28445952, 'steps': 148155, 'loss/train': 1.311500072479248} 11/07/2021 18:04:56 - INFO - __main__ - Step 148157: {'lr': 1.9149291505785038e-07, 'samples': 28446144, 'steps': 148156, 'loss/train': 1.0690381526947021} 11/07/2021 18:04:57 - INFO - __main__ - Step 148158: {'lr': 1.9128530488990435e-07, 'samples': 28446336, 'steps': 148157, 'loss/train': 0.8652459979057312} 11/07/2021 18:04:57 - INFO - __main__ - Step 148159: {'lr': 1.910778072819197e-07, 'samples': 28446528, 'steps': 148158, 'loss/train': 1.3805500268936157} 11/07/2021 18:04:57 - INFO - __main__ - Step 148160: {'lr': 1.908704222339519e-07, 'samples': 28446720, 'steps': 148159, 'loss/train': 1.0556116104125977} 11/07/2021 18:04:58 - INFO - __main__ - Step 148161: {'lr': 1.9066314974613973e-07, 'samples': 28446912, 'steps': 148160, 'loss/train': 1.1561968326568604} 11/07/2021 18:04:59 - INFO - __main__ - Step 148162: {'lr': 1.9045598981856648e-07, 'samples': 28447104, 'steps': 148161, 'loss/train': 0.8609744310379028} 11/07/2021 18:04:59 - INFO - __main__ - Step 148163: {'lr': 1.902489424513154e-07, 'samples': 28447296, 'steps': 148162, 'loss/train': 0.9391859769821167} 11/07/2021 18:04:59 - INFO - __main__ - Step 148164: {'lr': 1.9004200764449752e-07, 'samples': 28447488, 'steps': 148163, 'loss/train': 1.431484341621399} 11/07/2021 18:05:00 - INFO - __main__ - Step 148165: {'lr': 1.8983518539816837e-07, 'samples': 28447680, 'steps': 148164, 'loss/train': 1.2954437732696533} 11/07/2021 18:05:00 - INFO - __main__ - Step 148166: {'lr': 1.8962847571246666e-07, 'samples': 28447872, 'steps': 148165, 'loss/train': 0.993301272392273} 11/07/2021 18:05:00 - INFO - __main__ - Step 148167: {'lr': 1.8942187858744797e-07, 'samples': 28448064, 'steps': 148166, 'loss/train': 1.2506828308105469} 11/07/2021 18:05:02 - INFO - __main__ - Step 148168: {'lr': 1.892153940232233e-07, 'samples': 28448256, 'steps': 148167, 'loss/train': 0.7283466458320618} 11/07/2021 18:05:02 - INFO - __main__ - Step 148169: {'lr': 1.8900902201987591e-07, 'samples': 28448448, 'steps': 148168, 'loss/train': 1.430280327796936} 11/07/2021 18:05:02 - INFO - __main__ - Step 148170: {'lr': 1.8880276257751684e-07, 'samples': 28448640, 'steps': 148169, 'loss/train': 1.4746208190917969} 11/07/2021 18:05:03 - INFO - __main__ - Step 148171: {'lr': 1.8859661569622933e-07, 'samples': 28448832, 'steps': 148170, 'loss/train': 1.4992244243621826} 11/07/2021 18:05:03 - INFO - __main__ - Step 148172: {'lr': 1.8839058137612442e-07, 'samples': 28449024, 'steps': 148171, 'loss/train': 1.5026280879974365} 11/07/2021 18:05:04 - INFO - __main__ - Step 148173: {'lr': 1.8818465961722986e-07, 'samples': 28449216, 'steps': 148172, 'loss/train': 1.4126156568527222} 11/07/2021 18:05:04 - INFO - __main__ - Step 148174: {'lr': 1.879788504197122e-07, 'samples': 28449408, 'steps': 148173, 'loss/train': 1.0700175762176514} 11/07/2021 18:05:05 - INFO - __main__ - Step 148175: {'lr': 1.8777315378362692e-07, 'samples': 28449600, 'steps': 148174, 'loss/train': 0.8013189435005188} 11/07/2021 18:05:05 - INFO - __main__ - Step 148176: {'lr': 1.8756756970908506e-07, 'samples': 28449792, 'steps': 148175, 'loss/train': 1.2981444597244263} 11/07/2021 18:05:05 - INFO - __main__ - Step 148177: {'lr': 1.8736209819616988e-07, 'samples': 28449984, 'steps': 148176, 'loss/train': 1.3177393674850464} 11/07/2021 18:05:07 - INFO - __main__ - Step 148178: {'lr': 1.8715673924499244e-07, 'samples': 28450176, 'steps': 148177, 'loss/train': 1.0945156812667847} 11/07/2021 18:05:07 - INFO - __main__ - Step 148179: {'lr': 1.869514928556082e-07, 'samples': 28450368, 'steps': 148178, 'loss/train': 0.5807943344116211} 11/07/2021 18:05:07 - INFO - __main__ - Step 148180: {'lr': 1.867463590281282e-07, 'samples': 28450560, 'steps': 148179, 'loss/train': 0.863661527633667} 11/07/2021 18:05:08 - INFO - __main__ - Step 148181: {'lr': 1.865413377626357e-07, 'samples': 28450752, 'steps': 148180, 'loss/train': 1.2019137144088745} 11/07/2021 18:05:08 - INFO - __main__ - Step 148182: {'lr': 1.8633642905924175e-07, 'samples': 28450944, 'steps': 148181, 'loss/train': 1.6723214387893677} 11/07/2021 18:05:09 - INFO - __main__ - Step 148183: {'lr': 1.8613163291802959e-07, 'samples': 28451136, 'steps': 148182, 'loss/train': 1.524882435798645} 11/07/2021 18:05:09 - INFO - __main__ - Step 148184: {'lr': 1.859269493390825e-07, 'samples': 28451328, 'steps': 148183, 'loss/train': 1.240620732307434} 11/07/2021 18:05:10 - INFO - __main__ - Step 148185: {'lr': 1.857223783225115e-07, 'samples': 28451520, 'steps': 148184, 'loss/train': 1.470814824104309} 11/07/2021 18:05:10 - INFO - __main__ - Step 148186: {'lr': 1.8551791986839983e-07, 'samples': 28451712, 'steps': 148185, 'loss/train': 1.2556569576263428} 11/07/2021 18:05:10 - INFO - __main__ - Step 148187: {'lr': 1.8531357397685856e-07, 'samples': 28451904, 'steps': 148186, 'loss/train': 1.3668208122253418} 11/07/2021 18:05:11 - INFO - __main__ - Step 148188: {'lr': 1.8510934064791542e-07, 'samples': 28452096, 'steps': 148187, 'loss/train': 1.42155122756958} 11/07/2021 18:05:12 - INFO - __main__ - Step 148189: {'lr': 1.8490521988173692e-07, 'samples': 28452288, 'steps': 148188, 'loss/train': 1.4590375423431396} 11/07/2021 18:05:12 - INFO - __main__ - Step 148190: {'lr': 1.847012116783786e-07, 'samples': 28452480, 'steps': 148189, 'loss/train': 0.951363742351532} 11/07/2021 18:05:12 - INFO - __main__ - Step 148191: {'lr': 1.844973160379515e-07, 'samples': 28452672, 'steps': 148190, 'loss/train': 1.3481011390686035} 11/07/2021 18:05:13 - INFO - __main__ - Step 148192: {'lr': 1.842935329605111e-07, 'samples': 28452864, 'steps': 148191, 'loss/train': 1.1261017322540283} 11/07/2021 18:05:13 - INFO - __main__ - Step 148193: {'lr': 1.8408986244619618e-07, 'samples': 28453056, 'steps': 148192, 'loss/train': 1.7298940420150757} 11/07/2021 18:05:14 - INFO - __main__ - Step 148194: {'lr': 1.8388630449506228e-07, 'samples': 28453248, 'steps': 148193, 'loss/train': 1.3309251070022583} 11/07/2021 18:05:15 - INFO - __main__ - Step 148195: {'lr': 1.8368285910722038e-07, 'samples': 28453440, 'steps': 148194, 'loss/train': 1.3488069772720337} 11/07/2021 18:05:15 - INFO - __main__ - Step 148196: {'lr': 1.8347952628275376e-07, 'samples': 28453632, 'steps': 148195, 'loss/train': 1.5118635892868042} 11/07/2021 18:05:16 - INFO - __main__ - Step 148197: {'lr': 1.8327630602174572e-07, 'samples': 28453824, 'steps': 148196, 'loss/train': 1.5282717943191528} 11/07/2021 18:05:16 - INFO - __main__ - Step 148198: {'lr': 1.8307319832430725e-07, 'samples': 28454016, 'steps': 148197, 'loss/train': 1.9905565977096558} 11/07/2021 18:05:16 - INFO - __main__ - Step 148199: {'lr': 1.828702031905216e-07, 'samples': 28454208, 'steps': 148198, 'loss/train': 1.2195147275924683} 11/07/2021 18:05:17 - INFO - __main__ - Step 148200: {'lr': 1.8266732062049984e-07, 'samples': 28454400, 'steps': 148199, 'loss/train': 1.4180444478988647} 11/07/2021 18:05:18 - INFO - __main__ - Step 148201: {'lr': 1.8246455061429746e-07, 'samples': 28454592, 'steps': 148200, 'loss/train': 1.273054599761963} 11/07/2021 18:05:18 - INFO - __main__ - Step 148202: {'lr': 1.822618931719977e-07, 'samples': 28454784, 'steps': 148201, 'loss/train': 1.3804723024368286} 11/07/2021 18:05:18 - INFO - __main__ - Step 148203: {'lr': 1.8205934829373937e-07, 'samples': 28454976, 'steps': 148202, 'loss/train': 1.5672129392623901} 11/07/2021 18:05:19 - INFO - __main__ - Step 148204: {'lr': 1.8185691597957798e-07, 'samples': 28455168, 'steps': 148203, 'loss/train': 1.2978724241256714} 11/07/2021 18:05:20 - INFO - __main__ - Step 148205: {'lr': 1.8165459622962456e-07, 'samples': 28455360, 'steps': 148204, 'loss/train': 0.9428514242172241} 11/07/2021 18:05:20 - INFO - __main__ - Step 148206: {'lr': 1.8145238904399008e-07, 'samples': 28455552, 'steps': 148205, 'loss/train': 1.3642498254776} 11/07/2021 18:05:20 - INFO - __main__ - Step 148207: {'lr': 1.8125029442270236e-07, 'samples': 28455744, 'steps': 148206, 'loss/train': 0.9380931258201599} 11/07/2021 18:05:21 - INFO - __main__ - Step 148208: {'lr': 1.8104831236590014e-07, 'samples': 28455936, 'steps': 148207, 'loss/train': 1.0009839534759521} 11/07/2021 18:05:21 - INFO - __main__ - Step 148209: {'lr': 1.808464428736667e-07, 'samples': 28456128, 'steps': 148208, 'loss/train': 1.2042245864868164} 11/07/2021 18:05:22 - INFO - __main__ - Step 148210: {'lr': 1.806446859460853e-07, 'samples': 28456320, 'steps': 148209, 'loss/train': 1.3067010641098022} 11/07/2021 18:05:23 - INFO - __main__ - Step 148211: {'lr': 1.804430415832392e-07, 'samples': 28456512, 'steps': 148210, 'loss/train': 1.1645288467407227} 11/07/2021 18:05:23 - INFO - __main__ - Step 148212: {'lr': 1.8024150978523946e-07, 'samples': 28456704, 'steps': 148211, 'loss/train': 1.3340163230895996} 11/07/2021 18:05:23 - INFO - __main__ - Step 148213: {'lr': 1.800400905521693e-07, 'samples': 28456896, 'steps': 148212, 'loss/train': 1.2568265199661255} 11/07/2021 18:05:24 - INFO - __main__ - Step 148214: {'lr': 1.79838783884112e-07, 'samples': 28457088, 'steps': 148213, 'loss/train': 1.284621000289917} 11/07/2021 18:05:25 - INFO - __main__ - Step 148215: {'lr': 1.796375897811786e-07, 'samples': 28457280, 'steps': 148214, 'loss/train': 1.322159767150879} 11/07/2021 18:05:25 - INFO - __main__ - Step 148216: {'lr': 1.794365082434246e-07, 'samples': 28457472, 'steps': 148215, 'loss/train': 1.488923192024231} 11/07/2021 18:05:25 - INFO - __main__ - Step 148217: {'lr': 1.7923553927096102e-07, 'samples': 28457664, 'steps': 148216, 'loss/train': 1.363006591796875} 11/07/2021 18:05:26 - INFO - __main__ - Step 148218: {'lr': 1.790346828638989e-07, 'samples': 28457856, 'steps': 148217, 'loss/train': 1.1888113021850586} 11/07/2021 18:05:26 - INFO - __main__ - Step 148219: {'lr': 1.788339390222937e-07, 'samples': 28458048, 'steps': 148218, 'loss/train': 1.4537924528121948} 11/07/2021 18:05:27 - INFO - __main__ - Step 148220: {'lr': 1.7863330774625652e-07, 'samples': 28458240, 'steps': 148219, 'loss/train': 1.2255045175552368} 11/07/2021 18:05:27 - INFO - __main__ - Step 148221: {'lr': 1.784327890358983e-07, 'samples': 28458432, 'steps': 148220, 'loss/train': 1.1726536750793457} 11/07/2021 18:05:28 - INFO - __main__ - Step 148222: {'lr': 1.782323828912469e-07, 'samples': 28458624, 'steps': 148221, 'loss/train': 1.4860334396362305} 11/07/2021 18:05:28 - INFO - __main__ - Step 148223: {'lr': 1.7803208931244096e-07, 'samples': 28458816, 'steps': 148222, 'loss/train': 1.376452088356018} 11/07/2021 18:05:28 - INFO - __main__ - Step 148224: {'lr': 1.7783190829956385e-07, 'samples': 28459008, 'steps': 148223, 'loss/train': 1.1037113666534424} 11/07/2021 18:05:29 - INFO - __main__ - Step 148225: {'lr': 1.7763183985269881e-07, 'samples': 28459200, 'steps': 148224, 'loss/train': 1.4388577938079834} 11/07/2021 18:05:30 - INFO - __main__ - Step 148226: {'lr': 1.7743188397192912e-07, 'samples': 28459392, 'steps': 148225, 'loss/train': 1.056257724761963} 11/07/2021 18:05:30 - INFO - __main__ - Step 148227: {'lr': 1.7723204065736575e-07, 'samples': 28459584, 'steps': 148226, 'loss/train': 0.960342288017273} 11/07/2021 18:05:30 - INFO - __main__ - Step 148228: {'lr': 1.7703230990906427e-07, 'samples': 28459776, 'steps': 148227, 'loss/train': 2.2034521102905273} 11/07/2021 18:05:31 - INFO - __main__ - Step 148229: {'lr': 1.7683269172716344e-07, 'samples': 28459968, 'steps': 148228, 'loss/train': 1.332168459892273} 11/07/2021 18:05:32 - INFO - __main__ - Step 148230: {'lr': 1.7663318611171875e-07, 'samples': 28460160, 'steps': 148229, 'loss/train': 1.0341451168060303} 11/07/2021 18:05:32 - INFO - __main__ - Step 148231: {'lr': 1.764337930628135e-07, 'samples': 28460352, 'steps': 148230, 'loss/train': 1.0391020774841309} 11/07/2021 18:05:33 - INFO - __main__ - Step 148232: {'lr': 1.7623451258055868e-07, 'samples': 28460544, 'steps': 148231, 'loss/train': 1.323881983757019} 11/07/2021 18:05:33 - INFO - __main__ - Step 148233: {'lr': 1.7603534466503757e-07, 'samples': 28460736, 'steps': 148232, 'loss/train': 1.1954606771469116} 11/07/2021 18:05:33 - INFO - __main__ - Step 148234: {'lr': 1.758362893163612e-07, 'samples': 28460928, 'steps': 148233, 'loss/train': 1.0260155200958252} 11/07/2021 18:05:34 - INFO - __main__ - Step 148235: {'lr': 1.7563734653458508e-07, 'samples': 28461120, 'steps': 148234, 'loss/train': 1.499428629875183} 11/07/2021 18:05:35 - INFO - __main__ - Step 148236: {'lr': 1.7543851631979245e-07, 'samples': 28461312, 'steps': 148235, 'loss/train': 1.3625644445419312} 11/07/2021 18:05:35 - INFO - __main__ - Step 148237: {'lr': 1.7523979867212213e-07, 'samples': 28461504, 'steps': 148236, 'loss/train': 0.5349995493888855} 11/07/2021 18:05:35 - INFO - __main__ - Step 148238: {'lr': 1.7504119359160187e-07, 'samples': 28461696, 'steps': 148237, 'loss/train': 1.9153895378112793} 11/07/2021 18:05:36 - INFO - __main__ - Step 148239: {'lr': 1.7484270107837043e-07, 'samples': 28461888, 'steps': 148238, 'loss/train': 1.461181879043579} 11/07/2021 18:05:36 - INFO - __main__ - Step 148240: {'lr': 1.7464432113251106e-07, 'samples': 28462080, 'steps': 148239, 'loss/train': 1.828113079071045} 11/07/2021 18:05:37 - INFO - __main__ - Step 148241: {'lr': 1.744460537540793e-07, 'samples': 28462272, 'steps': 148240, 'loss/train': 1.2768561840057373} 11/07/2021 18:05:37 - INFO - __main__ - Step 148242: {'lr': 1.7424789894321392e-07, 'samples': 28462464, 'steps': 148241, 'loss/train': 1.1899627447128296} 11/07/2021 18:05:38 - INFO - __main__ - Step 148243: {'lr': 1.7404985669997043e-07, 'samples': 28462656, 'steps': 148242, 'loss/train': 0.7265334129333496} 11/07/2021 18:05:38 - INFO - __main__ - Step 148244: {'lr': 1.738519270244321e-07, 'samples': 28462848, 'steps': 148243, 'loss/train': 1.549454689025879} 11/07/2021 18:05:38 - INFO - __main__ - Step 148245: {'lr': 1.7365410991670993e-07, 'samples': 28463040, 'steps': 148244, 'loss/train': 1.590097188949585} 11/07/2021 18:05:40 - INFO - __main__ - Step 148246: {'lr': 1.7345640537685947e-07, 'samples': 28463232, 'steps': 148245, 'loss/train': 0.762240469455719} 11/07/2021 18:05:40 - INFO - __main__ - Step 148247: {'lr': 1.7325881340501948e-07, 'samples': 28463424, 'steps': 148246, 'loss/train': 2.6371686458587646} 11/07/2021 18:05:40 - INFO - __main__ - Step 148248: {'lr': 1.7306133400124547e-07, 'samples': 28463616, 'steps': 148247, 'loss/train': 1.3524956703186035} 11/07/2021 18:05:41 - INFO - __main__ - Step 148249: {'lr': 1.7286396716564845e-07, 'samples': 28463808, 'steps': 148248, 'loss/train': 1.6633731126785278} 11/07/2021 18:05:41 - INFO - __main__ - Step 148250: {'lr': 1.726667128983117e-07, 'samples': 28464000, 'steps': 148249, 'loss/train': 1.1500866413116455} 11/07/2021 18:05:42 - INFO - __main__ - Step 148251: {'lr': 1.7246957119929075e-07, 'samples': 28464192, 'steps': 148250, 'loss/train': 2.0120201110839844} 11/07/2021 18:05:42 - INFO - __main__ - Step 148252: {'lr': 1.7227254206869657e-07, 'samples': 28464384, 'steps': 148251, 'loss/train': 1.2328054904937744} 11/07/2021 18:05:43 - INFO - __main__ - Step 148253: {'lr': 1.7207562550664024e-07, 'samples': 28464576, 'steps': 148252, 'loss/train': 1.3470700979232788} 11/07/2021 18:05:43 - INFO - __main__ - Step 148254: {'lr': 1.7187882151317725e-07, 'samples': 28464768, 'steps': 148253, 'loss/train': 0.435088187456131} 11/07/2021 18:05:43 - INFO - __main__ - Step 148255: {'lr': 1.716821300884186e-07, 'samples': 28464960, 'steps': 148254, 'loss/train': 1.0304553508758545} 11/07/2021 18:05:44 - INFO - __main__ - Step 148256: {'lr': 1.714855512324476e-07, 'samples': 28465152, 'steps': 148255, 'loss/train': 0.9011018872261047} 11/07/2021 18:05:45 - INFO - __main__ - Step 148257: {'lr': 1.7128908494534746e-07, 'samples': 28465344, 'steps': 148256, 'loss/train': 1.363210916519165} 11/07/2021 18:05:45 - INFO - __main__ - Step 148258: {'lr': 1.7109273122720149e-07, 'samples': 28465536, 'steps': 148257, 'loss/train': 1.0587197542190552} 11/07/2021 18:05:46 - INFO - __main__ - Step 148259: {'lr': 1.7089649007812069e-07, 'samples': 28465728, 'steps': 148258, 'loss/train': 1.1098898649215698} 11/07/2021 18:05:46 - INFO - __main__ - Step 148260: {'lr': 1.707003614981606e-07, 'samples': 28465920, 'steps': 148259, 'loss/train': 0.5996752381324768} 11/07/2021 18:05:47 - INFO - __main__ - Step 148261: {'lr': 1.7050434548745995e-07, 'samples': 28466112, 'steps': 148260, 'loss/train': 1.272194504737854} 11/07/2021 18:05:47 - INFO - __main__ - Step 148262: {'lr': 1.7030844204604657e-07, 'samples': 28466304, 'steps': 148261, 'loss/train': 1.1918264627456665} 11/07/2021 18:05:48 - INFO - __main__ - Step 148263: {'lr': 1.701126511740314e-07, 'samples': 28466496, 'steps': 148262, 'loss/train': 1.7284308671951294} 11/07/2021 18:05:48 - INFO - __main__ - Step 148264: {'lr': 1.6991697287152552e-07, 'samples': 28466688, 'steps': 148263, 'loss/train': 1.5107617378234863} 11/07/2021 18:05:48 - INFO - __main__ - Step 148265: {'lr': 1.6972140713861216e-07, 'samples': 28466880, 'steps': 148264, 'loss/train': 1.1748056411743164} 11/07/2021 18:05:49 - INFO - __main__ - Step 148266: {'lr': 1.6952595397534687e-07, 'samples': 28467072, 'steps': 148265, 'loss/train': 1.5076267719268799} 11/07/2021 18:05:50 - INFO - __main__ - Step 148267: {'lr': 1.6933061338184065e-07, 'samples': 28467264, 'steps': 148266, 'loss/train': 1.3504738807678223} 11/07/2021 18:05:50 - INFO - __main__ - Step 148268: {'lr': 1.6913538535817673e-07, 'samples': 28467456, 'steps': 148267, 'loss/train': 1.231734037399292} 11/07/2021 18:05:51 - INFO - __main__ - Step 148269: {'lr': 1.6894026990443846e-07, 'samples': 28467648, 'steps': 148268, 'loss/train': 1.7246867418289185} 11/07/2021 18:05:51 - INFO - __main__ - Step 148270: {'lr': 1.6874526702073678e-07, 'samples': 28467840, 'steps': 148269, 'loss/train': 1.6253089904785156} 11/07/2021 18:05:51 - INFO - __main__ - Step 148271: {'lr': 1.6855037670712724e-07, 'samples': 28468032, 'steps': 148270, 'loss/train': 1.2343086004257202} 11/07/2021 18:05:52 - INFO - __main__ - Step 148272: {'lr': 1.6835559896374863e-07, 'samples': 28468224, 'steps': 148271, 'loss/train': 1.4721927642822266} 11/07/2021 18:05:53 - INFO - __main__ - Step 148273: {'lr': 1.681609337906287e-07, 'samples': 28468416, 'steps': 148272, 'loss/train': 0.9942216873168945} 11/07/2021 18:05:53 - INFO - __main__ - Step 148274: {'lr': 1.6796638118787843e-07, 'samples': 28468608, 'steps': 148273, 'loss/train': 1.4742481708526611} 11/07/2021 18:05:54 - INFO - __main__ - Step 148275: {'lr': 1.6777194115558113e-07, 'samples': 28468800, 'steps': 148274, 'loss/train': 1.4267370700836182} 11/07/2021 18:05:54 - INFO - __main__ - Step 148276: {'lr': 1.6757761369384783e-07, 'samples': 28468992, 'steps': 148275, 'loss/train': 1.5367859601974487} 11/07/2021 18:05:54 - INFO - __main__ - Step 148277: {'lr': 1.6738339880273402e-07, 'samples': 28469184, 'steps': 148276, 'loss/train': 1.2833346128463745} 11/07/2021 18:05:55 - INFO - __main__ - Step 148278: {'lr': 1.671892964823507e-07, 'samples': 28469376, 'steps': 148277, 'loss/train': 1.1788246631622314} 11/07/2021 18:05:56 - INFO - __main__ - Step 148279: {'lr': 1.669953067327812e-07, 'samples': 28469568, 'steps': 148278, 'loss/train': 1.3162884712219238} 11/07/2021 18:05:56 - INFO - __main__ - Step 148280: {'lr': 1.6680142955408094e-07, 'samples': 28469760, 'steps': 148279, 'loss/train': 1.3066245317459106} 11/07/2021 18:05:56 - INFO - __main__ - Step 148281: {'lr': 1.666076649463888e-07, 'samples': 28469952, 'steps': 148280, 'loss/train': 1.2619932889938354} 11/07/2021 18:05:57 - INFO - __main__ - Step 148282: {'lr': 1.6641401290976022e-07, 'samples': 28470144, 'steps': 148281, 'loss/train': 1.1721876859664917} 11/07/2021 18:05:57 - INFO - __main__ - Step 148283: {'lr': 1.6622047344430625e-07, 'samples': 28470336, 'steps': 148282, 'loss/train': 1.2282265424728394} 11/07/2021 18:05:58 - INFO - __main__ - Step 148284: {'lr': 1.6602704655008238e-07, 'samples': 28470528, 'steps': 148283, 'loss/train': 1.2420766353607178} 11/07/2021 18:05:59 - INFO - __main__ - Step 148285: {'lr': 1.6583373222719966e-07, 'samples': 28470720, 'steps': 148284, 'loss/train': 1.3651478290557861} 11/07/2021 18:05:59 - INFO - __main__ - Step 148286: {'lr': 1.6564053047574136e-07, 'samples': 28470912, 'steps': 148285, 'loss/train': 1.3269519805908203} 11/07/2021 18:05:59 - INFO - __main__ - Step 148287: {'lr': 1.6544744129576294e-07, 'samples': 28471104, 'steps': 148286, 'loss/train': 1.6245640516281128} 11/07/2021 18:06:00 - INFO - __main__ - Step 148288: {'lr': 1.6525446468740323e-07, 'samples': 28471296, 'steps': 148287, 'loss/train': 0.2442389279603958} 11/07/2021 18:06:01 - INFO - __main__ - Step 148289: {'lr': 1.6506160065071774e-07, 'samples': 28471488, 'steps': 148288, 'loss/train': 0.23461437225341797} 11/07/2021 18:06:01 - INFO - __main__ - Step 148290: {'lr': 1.6486884918581746e-07, 'samples': 28471680, 'steps': 148289, 'loss/train': 1.354588270187378} 11/07/2021 18:06:02 - INFO - __main__ - Step 148291: {'lr': 1.646762102927579e-07, 'samples': 28471872, 'steps': 148290, 'loss/train': 0.3809545040130615} 11/07/2021 18:06:02 - INFO - __main__ - Step 148292: {'lr': 1.6448368397162238e-07, 'samples': 28472064, 'steps': 148291, 'loss/train': 1.1561568975448608} 11/07/2021 18:06:02 - INFO - __main__ - Step 148293: {'lr': 1.6429127022252188e-07, 'samples': 28472256, 'steps': 148292, 'loss/train': 1.261466383934021} 11/07/2021 18:06:03 - INFO - __main__ - Step 148294: {'lr': 1.6409896904556742e-07, 'samples': 28472448, 'steps': 148293, 'loss/train': 1.3847531080245972} 11/07/2021 18:06:04 - INFO - __main__ - Step 148295: {'lr': 1.6390678044078677e-07, 'samples': 28472640, 'steps': 148294, 'loss/train': 1.1031907796859741} 11/07/2021 18:06:04 - INFO - __main__ - Step 148296: {'lr': 1.6371470440829094e-07, 'samples': 28472832, 'steps': 148295, 'loss/train': 1.3096567392349243} 11/07/2021 18:06:04 - INFO - __main__ - Step 148297: {'lr': 1.6352274094819098e-07, 'samples': 28473024, 'steps': 148296, 'loss/train': 1.1957077980041504} 11/07/2021 18:06:05 - INFO - __main__ - Step 148298: {'lr': 1.6333089006054236e-07, 'samples': 28473216, 'steps': 148297, 'loss/train': 1.2342702150344849} 11/07/2021 18:06:05 - INFO - __main__ - Step 148299: {'lr': 1.6313915174542836e-07, 'samples': 28473408, 'steps': 148298, 'loss/train': 0.7174201607704163} 11/07/2021 18:06:06 - INFO - __main__ - Step 148300: {'lr': 1.6294752600296002e-07, 'samples': 28473600, 'steps': 148299, 'loss/train': 1.5213099718093872} 11/07/2021 18:06:07 - INFO - __main__ - Step 148301: {'lr': 1.6275601283322061e-07, 'samples': 28473792, 'steps': 148300, 'loss/train': 1.0631299018859863} 11/07/2021 18:06:07 - INFO - __main__ - Step 148302: {'lr': 1.6256461223629336e-07, 'samples': 28473984, 'steps': 148301, 'loss/train': 0.7772263884544373} 11/07/2021 18:06:07 - INFO - __main__ - Step 148303: {'lr': 1.6237332421223382e-07, 'samples': 28474176, 'steps': 148302, 'loss/train': 1.201957106590271} 11/07/2021 18:06:08 - INFO - __main__ - Step 148304: {'lr': 1.6218214876118076e-07, 'samples': 28474368, 'steps': 148303, 'loss/train': 0.9052629470825195} 11/07/2021 18:06:09 - INFO - __main__ - Step 148305: {'lr': 1.6199108588316192e-07, 'samples': 28474560, 'steps': 148304, 'loss/train': 1.2669758796691895} 11/07/2021 18:06:09 - INFO - __main__ - Step 148306: {'lr': 1.6180013557831608e-07, 'samples': 28474752, 'steps': 148305, 'loss/train': 1.0246626138687134} 11/07/2021 18:06:09 - INFO - __main__ - Step 148307: {'lr': 1.6160929784669876e-07, 'samples': 28474944, 'steps': 148306, 'loss/train': 0.8609293103218079} 11/07/2021 18:06:10 - INFO - __main__ - Step 148308: {'lr': 1.6141857268842098e-07, 'samples': 28475136, 'steps': 148307, 'loss/train': 1.0905523300170898} 11/07/2021 18:06:10 - INFO - __main__ - Step 148309: {'lr': 1.6122796010353824e-07, 'samples': 28475328, 'steps': 148308, 'loss/train': 0.9944153428077698} 11/07/2021 18:06:11 - INFO - __main__ - Step 148310: {'lr': 1.6103746009216157e-07, 'samples': 28475520, 'steps': 148309, 'loss/train': 1.3202219009399414} 11/07/2021 18:06:11 - INFO - __main__ - Step 148311: {'lr': 1.608470726543465e-07, 'samples': 28475712, 'steps': 148310, 'loss/train': 1.4074443578720093} 11/07/2021 18:06:12 - INFO - __main__ - Step 148312: {'lr': 1.606567977902318e-07, 'samples': 28475904, 'steps': 148311, 'loss/train': 0.9133514761924744} 11/07/2021 18:06:12 - INFO - __main__ - Step 148313: {'lr': 1.6046663549984518e-07, 'samples': 28476096, 'steps': 148312, 'loss/train': 1.4123486280441284} 11/07/2021 18:06:12 - INFO - __main__ - Step 148314: {'lr': 1.6027658578329774e-07, 'samples': 28476288, 'steps': 148313, 'loss/train': 1.434384822845459} 11/07/2021 18:06:14 - INFO - __main__ - Step 148315: {'lr': 1.6008664864067268e-07, 'samples': 28476480, 'steps': 148314, 'loss/train': 1.1399271488189697} 11/07/2021 18:06:14 - INFO - __main__ - Step 148316: {'lr': 1.598968240720533e-07, 'samples': 28476672, 'steps': 148315, 'loss/train': 1.1186988353729248} 11/07/2021 18:06:14 - INFO - __main__ - Step 148317: {'lr': 1.5970711207755063e-07, 'samples': 28476864, 'steps': 148316, 'loss/train': 1.2713923454284668} 11/07/2021 18:06:15 - INFO - __main__ - Step 148318: {'lr': 1.5951751265722013e-07, 'samples': 28477056, 'steps': 148317, 'loss/train': 1.6695586442947388} 11/07/2021 18:06:15 - INFO - __main__ - Step 148319: {'lr': 1.5932802581114513e-07, 'samples': 28477248, 'steps': 148318, 'loss/train': 1.0403273105621338} 11/07/2021 18:06:16 - INFO - __main__ - Step 148320: {'lr': 1.5913865153943662e-07, 'samples': 28477440, 'steps': 148319, 'loss/train': 1.5005518198013306} 11/07/2021 18:06:16 - INFO - __main__ - Step 148321: {'lr': 1.5894938984215013e-07, 'samples': 28477632, 'steps': 148320, 'loss/train': 1.3443975448608398} 11/07/2021 18:06:17 - INFO - __main__ - Step 148322: {'lr': 1.5876024071939665e-07, 'samples': 28477824, 'steps': 148321, 'loss/train': 0.10853268951177597} 11/07/2021 18:06:17 - INFO - __main__ - Step 148323: {'lr': 1.5857120417123173e-07, 'samples': 28478016, 'steps': 148322, 'loss/train': 1.3008339405059814} 11/07/2021 18:06:17 - INFO - __main__ - Step 148324: {'lr': 1.583822801977941e-07, 'samples': 28478208, 'steps': 148323, 'loss/train': 1.2951796054840088} 11/07/2021 18:06:18 - INFO - __main__ - Step 148325: {'lr': 1.5819346879911155e-07, 'samples': 28478400, 'steps': 148324, 'loss/train': 1.437968134880066} 11/07/2021 18:06:19 - INFO - __main__ - Step 148326: {'lr': 1.5800476997529512e-07, 'samples': 28478592, 'steps': 148325, 'loss/train': 1.07534658908844} 11/07/2021 18:06:19 - INFO - __main__ - Step 148327: {'lr': 1.5781618372642802e-07, 'samples': 28478784, 'steps': 148326, 'loss/train': 1.3189072608947754} 11/07/2021 18:06:20 - INFO - __main__ - Step 148328: {'lr': 1.5762771005259357e-07, 'samples': 28478976, 'steps': 148327, 'loss/train': 1.3908275365829468} 11/07/2021 18:06:20 - INFO - __main__ - Step 148329: {'lr': 1.57439348953875e-07, 'samples': 28479168, 'steps': 148328, 'loss/train': 0.6981621384620667} 11/07/2021 18:06:20 - INFO - __main__ - Step 148330: {'lr': 1.5725110043035563e-07, 'samples': 28479360, 'steps': 148329, 'loss/train': 1.1531052589416504} 11/07/2021 18:06:21 - INFO - __main__ - Step 148331: {'lr': 1.5706296448211864e-07, 'samples': 28479552, 'steps': 148330, 'loss/train': 1.5171027183532715} 11/07/2021 18:06:22 - INFO - __main__ - Step 148332: {'lr': 1.5687494110924738e-07, 'samples': 28479744, 'steps': 148331, 'loss/train': 1.4847818613052368} 11/07/2021 18:06:22 - INFO - __main__ - Step 148333: {'lr': 1.5668703031185282e-07, 'samples': 28479936, 'steps': 148332, 'loss/train': 1.5765693187713623} 11/07/2021 18:06:22 - INFO - __main__ - Step 148334: {'lr': 1.564992320899905e-07, 'samples': 28480128, 'steps': 148333, 'loss/train': 1.2736127376556396} 11/07/2021 18:06:23 - INFO - __main__ - Step 148335: {'lr': 1.5631154644377144e-07, 'samples': 28480320, 'steps': 148334, 'loss/train': 0.1351608783006668} 11/07/2021 18:06:24 - INFO - __main__ - Step 148336: {'lr': 1.5612397337325114e-07, 'samples': 28480512, 'steps': 148335, 'loss/train': 1.429985761642456} 11/07/2021 18:06:24 - INFO - __main__ - Step 148337: {'lr': 1.5593651287851285e-07, 'samples': 28480704, 'steps': 148336, 'loss/train': 1.5086407661437988} 11/07/2021 18:06:24 - INFO - __main__ - Step 148338: {'lr': 1.5574916495966762e-07, 'samples': 28480896, 'steps': 148337, 'loss/train': 1.2190356254577637} 11/07/2021 18:06:25 - INFO - __main__ - Step 148339: {'lr': 1.555619296167987e-07, 'samples': 28481088, 'steps': 148338, 'loss/train': 1.2668399810791016} 11/07/2021 18:06:25 - INFO - __main__ - Step 148340: {'lr': 1.5537480684996163e-07, 'samples': 28481280, 'steps': 148339, 'loss/train': 1.371404767036438} 11/07/2021 18:06:26 - INFO - __main__ - Step 148341: {'lr': 1.5518779665923966e-07, 'samples': 28481472, 'steps': 148340, 'loss/train': 1.1067572832107544} 11/07/2021 18:06:27 - INFO - __main__ - Step 148342: {'lr': 1.5500089904477154e-07, 'samples': 28481664, 'steps': 148341, 'loss/train': 1.4733842611312866} 11/07/2021 18:06:27 - INFO - __main__ - Step 148343: {'lr': 1.5481411400658506e-07, 'samples': 28481856, 'steps': 148342, 'loss/train': 1.7694662809371948} 11/07/2021 18:06:27 - INFO - __main__ - Step 148344: {'lr': 1.5462744154479125e-07, 'samples': 28482048, 'steps': 148343, 'loss/train': 1.344449758529663} 11/07/2021 18:06:28 - INFO - __main__ - Step 148345: {'lr': 1.5444088165944558e-07, 'samples': 28482240, 'steps': 148344, 'loss/train': 0.9944664835929871} 11/07/2021 18:06:29 - INFO - __main__ - Step 148346: {'lr': 1.5425443435068687e-07, 'samples': 28482432, 'steps': 148345, 'loss/train': 0.9755681157112122} 11/07/2021 18:06:29 - INFO - __main__ - Step 148347: {'lr': 1.5406809961854283e-07, 'samples': 28482624, 'steps': 148346, 'loss/train': 1.3248862028121948} 11/07/2021 18:06:29 - INFO - __main__ - Step 148348: {'lr': 1.5388187746312453e-07, 'samples': 28482816, 'steps': 148347, 'loss/train': 1.3905203342437744} 11/07/2021 18:06:30 - INFO - __main__ - Step 148349: {'lr': 1.5369576788451522e-07, 'samples': 28483008, 'steps': 148348, 'loss/train': 1.5393307209014893} 11/07/2021 18:06:30 - INFO - __main__ - Step 148350: {'lr': 1.5350977088279816e-07, 'samples': 28483200, 'steps': 148349, 'loss/train': 0.9182986617088318} 11/07/2021 18:06:30 - INFO - __main__ - Step 148351: {'lr': 1.533238864580566e-07, 'samples': 28483392, 'steps': 148350, 'loss/train': 1.0418822765350342} 11/07/2021 18:06:31 - INFO - __main__ - Step 148352: {'lr': 1.531381146103461e-07, 'samples': 28483584, 'steps': 148351, 'loss/train': 0.7479719519615173} 11/07/2021 18:06:32 - INFO - __main__ - Step 148353: {'lr': 1.529524553398054e-07, 'samples': 28483776, 'steps': 148352, 'loss/train': 1.2612488269805908} 11/07/2021 18:06:32 - INFO - __main__ - Step 148354: {'lr': 1.5276690864649e-07, 'samples': 28483968, 'steps': 148353, 'loss/train': 1.1341058015823364} 11/07/2021 18:06:32 - INFO - __main__ - Step 148355: {'lr': 1.5258147453045545e-07, 'samples': 28484160, 'steps': 148354, 'loss/train': 1.4250106811523438} 11/07/2021 18:06:33 - INFO - __main__ - Step 148356: {'lr': 1.5239615299184051e-07, 'samples': 28484352, 'steps': 148355, 'loss/train': 1.381839632987976} 11/07/2021 18:06:34 - INFO - __main__ - Step 148357: {'lr': 1.5221094403067294e-07, 'samples': 28484544, 'steps': 148356, 'loss/train': 1.3033251762390137} 11/07/2021 18:06:34 - INFO - __main__ - Step 148358: {'lr': 1.520258476470915e-07, 'samples': 28484736, 'steps': 148357, 'loss/train': 1.3362584114074707} 11/07/2021 18:06:35 - INFO - __main__ - Step 148359: {'lr': 1.5184086384112394e-07, 'samples': 28484928, 'steps': 148358, 'loss/train': 1.4843794107437134} 11/07/2021 18:06:35 - INFO - __main__ - Step 148360: {'lr': 1.5165599261290909e-07, 'samples': 28485120, 'steps': 148359, 'loss/train': 1.6768420934677124} 11/07/2021 18:06:35 - INFO - __main__ - Step 148361: {'lr': 1.514712339625024e-07, 'samples': 28485312, 'steps': 148360, 'loss/train': 1.4061801433563232} 11/07/2021 18:06:36 - INFO - __main__ - Step 148362: {'lr': 1.5128658788995942e-07, 'samples': 28485504, 'steps': 148361, 'loss/train': 1.2870137691497803} 11/07/2021 18:06:37 - INFO - __main__ - Step 148363: {'lr': 1.5110205439541892e-07, 'samples': 28485696, 'steps': 148362, 'loss/train': 1.2237542867660522} 11/07/2021 18:06:37 - INFO - __main__ - Step 148364: {'lr': 1.5091763347890864e-07, 'samples': 28485888, 'steps': 148363, 'loss/train': 1.4002546072006226} 11/07/2021 18:06:37 - INFO - __main__ - Step 148365: {'lr': 1.5073332514056736e-07, 'samples': 28486080, 'steps': 148364, 'loss/train': 1.3909801244735718} 11/07/2021 18:06:38 - INFO - __main__ - Step 148366: {'lr': 1.5054912938042288e-07, 'samples': 28486272, 'steps': 148365, 'loss/train': 1.1603304147720337} 11/07/2021 18:06:39 - INFO - __main__ - Step 148367: {'lr': 1.5036504619861392e-07, 'samples': 28486464, 'steps': 148366, 'loss/train': 1.2951220273971558} 11/07/2021 18:06:39 - INFO - __main__ - Step 148368: {'lr': 1.5018107559519602e-07, 'samples': 28486656, 'steps': 148367, 'loss/train': 1.401140570640564} 11/07/2021 18:06:39 - INFO - __main__ - Step 148369: {'lr': 1.4999721757022465e-07, 'samples': 28486848, 'steps': 148368, 'loss/train': 0.7946009039878845} 11/07/2021 18:06:40 - INFO - __main__ - Step 148370: {'lr': 1.4981347212383866e-07, 'samples': 28487040, 'steps': 148369, 'loss/train': 1.2885488271713257} 11/07/2021 18:06:40 - INFO - __main__ - Step 148371: {'lr': 1.4962983925606576e-07, 'samples': 28487232, 'steps': 148370, 'loss/train': 0.8507276773452759} 11/07/2021 18:06:41 - INFO - __main__ - Step 148372: {'lr': 1.4944631896701698e-07, 'samples': 28487424, 'steps': 148371, 'loss/train': 1.097436785697937} 11/07/2021 18:06:42 - INFO - __main__ - Step 148373: {'lr': 1.492629112567756e-07, 'samples': 28487616, 'steps': 148372, 'loss/train': 1.395557165145874} 11/07/2021 18:06:42 - INFO - __main__ - Step 148374: {'lr': 1.4907961612542487e-07, 'samples': 28487808, 'steps': 148373, 'loss/train': 1.4806286096572876} 11/07/2021 18:06:42 - INFO - __main__ - Step 148375: {'lr': 1.4889643357304804e-07, 'samples': 28488000, 'steps': 148374, 'loss/train': 0.6225084066390991} 11/07/2021 18:06:43 - INFO - __main__ - Step 148376: {'lr': 1.4871336359970066e-07, 'samples': 28488192, 'steps': 148375, 'loss/train': 1.2986441850662231} 11/07/2021 18:06:43 - INFO - __main__ - Step 148377: {'lr': 1.485304062055215e-07, 'samples': 28488384, 'steps': 148376, 'loss/train': 0.9151935577392578} 11/07/2021 18:06:44 - INFO - __main__ - Step 148378: {'lr': 1.483475613905383e-07, 'samples': 28488576, 'steps': 148377, 'loss/train': 1.299578070640564} 11/07/2021 18:06:44 - INFO - __main__ - Step 148379: {'lr': 1.4816482915486207e-07, 'samples': 28488768, 'steps': 148378, 'loss/train': 1.0737836360931396} 11/07/2021 18:06:45 - INFO - __main__ - Step 148380: {'lr': 1.4798220949854834e-07, 'samples': 28488960, 'steps': 148379, 'loss/train': 1.3567720651626587} 11/07/2021 18:06:45 - INFO - __main__ - Step 148381: {'lr': 1.477997024217359e-07, 'samples': 28489152, 'steps': 148380, 'loss/train': 0.8703852891921997} 11/07/2021 18:06:45 - INFO - __main__ - Step 148382: {'lr': 1.4761730792442473e-07, 'samples': 28489344, 'steps': 148381, 'loss/train': 0.38681313395500183} 11/07/2021 18:06:46 - INFO - __main__ - Step 148383: {'lr': 1.4743502600678138e-07, 'samples': 28489536, 'steps': 148382, 'loss/train': 1.1453473567962646} 11/07/2021 18:06:47 - INFO - __main__ - Step 148384: {'lr': 1.4725285666883358e-07, 'samples': 28489728, 'steps': 148383, 'loss/train': 0.9547638297080994} 11/07/2021 18:06:47 - INFO - __main__ - Step 148385: {'lr': 1.4707079991066462e-07, 'samples': 28489920, 'steps': 148384, 'loss/train': 1.5922048091888428} 11/07/2021 18:06:48 - INFO - __main__ - Step 148386: {'lr': 1.4688885573238552e-07, 'samples': 28490112, 'steps': 148385, 'loss/train': 1.2476855516433716} 11/07/2021 18:06:48 - INFO - __main__ - Step 148387: {'lr': 1.467070241340518e-07, 'samples': 28490304, 'steps': 148386, 'loss/train': 1.251231074333191} 11/07/2021 18:06:49 - INFO - __main__ - Step 148388: {'lr': 1.4652530511577444e-07, 'samples': 28490496, 'steps': 148387, 'loss/train': 1.196874737739563} 11/07/2021 18:06:49 - INFO - __main__ - Step 148389: {'lr': 1.4634369867760899e-07, 'samples': 28490688, 'steps': 148388, 'loss/train': 1.2213976383209229} 11/07/2021 18:06:50 - INFO - __main__ - Step 148390: {'lr': 1.4616220481963872e-07, 'samples': 28490880, 'steps': 148389, 'loss/train': 1.568894863128662} 11/07/2021 18:06:50 - INFO - __main__ - Step 148391: {'lr': 1.4598082354194686e-07, 'samples': 28491072, 'steps': 148390, 'loss/train': 1.6984329223632812} 11/07/2021 18:06:50 - INFO - __main__ - Step 148392: {'lr': 1.457995548446167e-07, 'samples': 28491264, 'steps': 148391, 'loss/train': 1.403191089630127} 11/07/2021 18:06:51 - INFO - __main__ - Step 148393: {'lr': 1.456183987277593e-07, 'samples': 28491456, 'steps': 148392, 'loss/train': 1.1474263668060303} 11/07/2021 18:06:52 - INFO - __main__ - Step 148394: {'lr': 1.4543735519140234e-07, 'samples': 28491648, 'steps': 148393, 'loss/train': 1.4231863021850586} 11/07/2021 18:06:52 - INFO - __main__ - Step 148395: {'lr': 1.4525642423568463e-07, 'samples': 28491840, 'steps': 148394, 'loss/train': 1.0308725833892822} 11/07/2021 18:06:52 - INFO - __main__ - Step 148396: {'lr': 1.450756058606617e-07, 'samples': 28492032, 'steps': 148395, 'loss/train': 1.1351863145828247} 11/07/2021 18:06:53 - INFO - __main__ - Step 148397: {'lr': 1.4489490006638905e-07, 'samples': 28492224, 'steps': 148396, 'loss/train': 1.366349458694458} 11/07/2021 18:06:54 - INFO - __main__ - Step 148398: {'lr': 1.447143068529777e-07, 'samples': 28492416, 'steps': 148397, 'loss/train': 1.293398380279541} 11/07/2021 18:06:54 - INFO - __main__ - Step 148399: {'lr': 1.4453382622048317e-07, 'samples': 28492608, 'steps': 148398, 'loss/train': 1.3359131813049316} 11/07/2021 18:06:54 - INFO - __main__ - Step 148400: {'lr': 1.443534581690442e-07, 'samples': 28492800, 'steps': 148399, 'loss/train': 1.2383098602294922} 11/07/2021 18:06:55 - INFO - __main__ - Step 148401: {'lr': 1.4417320269868862e-07, 'samples': 28492992, 'steps': 148400, 'loss/train': 1.1175259351730347} 11/07/2021 18:06:55 - INFO - __main__ - Step 148402: {'lr': 1.439930598094996e-07, 'samples': 28493184, 'steps': 148401, 'loss/train': 1.206420660018921} 11/07/2021 18:06:56 - INFO - __main__ - Step 148403: {'lr': 1.4381302950158826e-07, 'samples': 28493376, 'steps': 148402, 'loss/train': 1.418001651763916} 11/07/2021 18:06:57 - INFO - __main__ - Step 148404: {'lr': 1.4363311177501003e-07, 'samples': 28493568, 'steps': 148403, 'loss/train': 1.2718584537506104} 11/07/2021 18:06:57 - INFO - __main__ - Step 148405: {'lr': 1.4345330662984823e-07, 'samples': 28493760, 'steps': 148404, 'loss/train': 1.0619010925292969} 11/07/2021 18:06:57 - INFO - __main__ - Step 148406: {'lr': 1.4327361406621385e-07, 'samples': 28493952, 'steps': 148405, 'loss/train': 1.5251606702804565} 11/07/2021 18:06:58 - INFO - __main__ - Step 148407: {'lr': 1.4309403408416243e-07, 'samples': 28494144, 'steps': 148406, 'loss/train': 1.3282533884048462} 11/07/2021 18:06:59 - INFO - __main__ - Step 148408: {'lr': 1.4291456668374947e-07, 'samples': 28494336, 'steps': 148407, 'loss/train': 1.601609706878662} 11/07/2021 18:06:59 - INFO - __main__ - Step 148409: {'lr': 1.4273521186511372e-07, 'samples': 28494528, 'steps': 148408, 'loss/train': 0.5580316185951233} 11/07/2021 18:06:59 - INFO - __main__ - Step 148410: {'lr': 1.42555969628283e-07, 'samples': 28494720, 'steps': 148409, 'loss/train': 1.3133717775344849} 11/07/2021 18:07:00 - INFO - __main__ - Step 148411: {'lr': 1.4237683997336826e-07, 'samples': 28494912, 'steps': 148410, 'loss/train': 1.4665977954864502} 11/07/2021 18:07:00 - INFO - __main__ - Step 148412: {'lr': 1.421978229004528e-07, 'samples': 28495104, 'steps': 148411, 'loss/train': 1.281636118888855} 11/07/2021 18:07:01 - INFO - __main__ - Step 148413: {'lr': 1.4201891840961988e-07, 'samples': 28495296, 'steps': 148412, 'loss/train': 1.1487386226654053} 11/07/2021 18:07:02 - INFO - __main__ - Step 148414: {'lr': 1.4184012650089729e-07, 'samples': 28495488, 'steps': 148413, 'loss/train': 1.577560305595398} 11/07/2021 18:07:02 - INFO - __main__ - Step 148415: {'lr': 1.4166144717442374e-07, 'samples': 28495680, 'steps': 148414, 'loss/train': 0.8100746870040894} 11/07/2021 18:07:02 - INFO - __main__ - Step 148416: {'lr': 1.4148288043028256e-07, 'samples': 28495872, 'steps': 148415, 'loss/train': 1.19974946975708} 11/07/2021 18:07:03 - INFO - __main__ - Step 148417: {'lr': 1.413044262685015e-07, 'samples': 28496064, 'steps': 148416, 'loss/train': 1.1861692667007446} 11/07/2021 18:07:04 - INFO - __main__ - Step 148418: {'lr': 1.4112608468921928e-07, 'samples': 28496256, 'steps': 148417, 'loss/train': 1.427635908126831} 11/07/2021 18:07:04 - INFO - __main__ - Step 148419: {'lr': 1.4094785569249147e-07, 'samples': 28496448, 'steps': 148418, 'loss/train': 0.910209059715271} 11/07/2021 18:07:04 - INFO - __main__ - Step 148420: {'lr': 1.4076973927837356e-07, 'samples': 28496640, 'steps': 148419, 'loss/train': 1.1567341089248657} 11/07/2021 18:07:05 - INFO - __main__ - Step 148421: {'lr': 1.4059173544697657e-07, 'samples': 28496832, 'steps': 148420, 'loss/train': 0.8852680921554565} 11/07/2021 18:07:05 - INFO - __main__ - Step 148422: {'lr': 1.4041384419838376e-07, 'samples': 28497024, 'steps': 148421, 'loss/train': 1.4905778169631958} 11/07/2021 18:07:05 - INFO - __main__ - Step 148423: {'lr': 1.4023606553265067e-07, 'samples': 28497216, 'steps': 148422, 'loss/train': 0.573447048664093} 11/07/2021 18:07:06 - INFO - __main__ - Step 148424: {'lr': 1.400583994498883e-07, 'samples': 28497408, 'steps': 148423, 'loss/train': 1.3400262594223022} 11/07/2021 18:07:07 - INFO - __main__ - Step 148425: {'lr': 1.3988084595015217e-07, 'samples': 28497600, 'steps': 148424, 'loss/train': 1.3240045309066772} 11/07/2021 18:07:07 - INFO - __main__ - Step 148426: {'lr': 1.3970340503352551e-07, 'samples': 28497792, 'steps': 148425, 'loss/train': 1.466976284980774} 11/07/2021 18:07:08 - INFO - __main__ - Step 148427: {'lr': 1.3952607670009164e-07, 'samples': 28497984, 'steps': 148426, 'loss/train': 1.904985785484314} 11/07/2021 18:07:08 - INFO - __main__ - Step 148428: {'lr': 1.3934886094993383e-07, 'samples': 28498176, 'steps': 148427, 'loss/train': 1.1627930402755737} 11/07/2021 18:07:09 - INFO - __main__ - Step 148429: {'lr': 1.3917175778313529e-07, 'samples': 28498368, 'steps': 148428, 'loss/train': 1.1014258861541748} 11/07/2021 18:07:09 - INFO - __main__ - Step 148430: {'lr': 1.3899476719977932e-07, 'samples': 28498560, 'steps': 148429, 'loss/train': 1.4130967855453491} 11/07/2021 18:07:10 - INFO - __main__ - Step 148431: {'lr': 1.3881788919992144e-07, 'samples': 28498752, 'steps': 148430, 'loss/train': 0.9033591151237488} 11/07/2021 18:07:10 - INFO - __main__ - Step 148432: {'lr': 1.3864112378367266e-07, 'samples': 28498944, 'steps': 148431, 'loss/train': 1.191962480545044} 11/07/2021 18:07:10 - INFO - __main__ - Step 148433: {'lr': 1.3846447095106074e-07, 'samples': 28499136, 'steps': 148432, 'loss/train': 0.8909156918525696} 11/07/2021 18:07:11 - INFO - __main__ - Step 148434: {'lr': 1.3828793070222444e-07, 'samples': 28499328, 'steps': 148433, 'loss/train': 1.055014729499817} 11/07/2021 18:07:12 - INFO - __main__ - Step 148435: {'lr': 1.3811150303724707e-07, 'samples': 28499520, 'steps': 148434, 'loss/train': 0.9852409362792969} 11/07/2021 18:07:12 - INFO - __main__ - Step 148436: {'lr': 1.3793518795615635e-07, 'samples': 28499712, 'steps': 148435, 'loss/train': 1.380128264427185} 11/07/2021 18:07:12 - INFO - __main__ - Step 148437: {'lr': 1.3775898545903554e-07, 'samples': 28499904, 'steps': 148436, 'loss/train': 1.0902682542800903} 11/07/2021 18:07:13 - INFO - __main__ - Step 148438: {'lr': 1.3758289554599568e-07, 'samples': 28500096, 'steps': 148437, 'loss/train': 1.358510971069336} 11/07/2021 18:07:14 - INFO - __main__ - Step 148439: {'lr': 1.3740691821712004e-07, 'samples': 28500288, 'steps': 148438, 'loss/train': 0.89048171043396} 11/07/2021 18:07:14 - INFO - __main__ - Step 148440: {'lr': 1.3723105347246413e-07, 'samples': 28500480, 'steps': 148439, 'loss/train': 1.525715947151184} 11/07/2021 18:07:15 - INFO - __main__ - Step 148441: {'lr': 1.3705530131213896e-07, 'samples': 28500672, 'steps': 148440, 'loss/train': 1.3174302577972412} 11/07/2021 18:07:15 - INFO - __main__ - Step 148442: {'lr': 1.3687966173617228e-07, 'samples': 28500864, 'steps': 148441, 'loss/train': 1.280573844909668} 11/07/2021 18:07:15 - INFO - __main__ - Step 148443: {'lr': 1.3670413474467514e-07, 'samples': 28501056, 'steps': 148442, 'loss/train': 1.431447148323059} 11/07/2021 18:07:16 - INFO - __main__ - Step 148444: {'lr': 1.3652872033773078e-07, 'samples': 28501248, 'steps': 148443, 'loss/train': 1.350101351737976} 11/07/2021 18:07:17 - INFO - __main__ - Step 148445: {'lr': 1.363534185154225e-07, 'samples': 28501440, 'steps': 148444, 'loss/train': 0.7838642001152039} 11/07/2021 18:07:17 - INFO - __main__ - Step 148446: {'lr': 1.3617822927780576e-07, 'samples': 28501632, 'steps': 148445, 'loss/train': 1.600533127784729} 11/07/2021 18:07:18 - INFO - __main__ - Step 148447: {'lr': 1.3600315262496388e-07, 'samples': 28501824, 'steps': 148446, 'loss/train': 1.4942224025726318} 11/07/2021 18:07:18 - INFO - __main__ - Step 148448: {'lr': 1.358281885569801e-07, 'samples': 28502016, 'steps': 148447, 'loss/train': 1.4219666719436646} 11/07/2021 18:07:18 - INFO - __main__ - Step 148449: {'lr': 1.3565333707393767e-07, 'samples': 28502208, 'steps': 148448, 'loss/train': 1.4798576831817627} 11/07/2021 18:07:19 - INFO - __main__ - Step 148450: {'lr': 1.3547859817594766e-07, 'samples': 28502400, 'steps': 148449, 'loss/train': 1.7105071544647217} 11/07/2021 18:07:20 - INFO - __main__ - Step 148451: {'lr': 1.3530397186301003e-07, 'samples': 28502592, 'steps': 148450, 'loss/train': 1.1657778024673462} 11/07/2021 18:07:20 - INFO - __main__ - Step 148452: {'lr': 1.3512945813526355e-07, 'samples': 28502784, 'steps': 148451, 'loss/train': 1.5584828853607178} 11/07/2021 18:07:20 - INFO - __main__ - Step 148453: {'lr': 1.3495505699279154e-07, 'samples': 28502976, 'steps': 148452, 'loss/train': 1.6019643545150757} 11/07/2021 18:07:21 - INFO - __main__ - Step 148454: {'lr': 1.3478076843564945e-07, 'samples': 28503168, 'steps': 148453, 'loss/train': 1.4748653173446655} 11/07/2021 18:07:21 - INFO - __main__ - Step 148455: {'lr': 1.3460659246389285e-07, 'samples': 28503360, 'steps': 148454, 'loss/train': 0.1023520976305008} 11/07/2021 18:07:22 - INFO - __main__ - Step 148456: {'lr': 1.3443252907766046e-07, 'samples': 28503552, 'steps': 148455, 'loss/train': 1.210274577140808} 11/07/2021 18:07:23 - INFO - __main__ - Step 148457: {'lr': 1.3425857827698008e-07, 'samples': 28503744, 'steps': 148456, 'loss/train': 1.6937216520309448} 11/07/2021 18:07:23 - INFO - __main__ - Step 148458: {'lr': 1.3408474006193495e-07, 'samples': 28503936, 'steps': 148457, 'loss/train': 1.2552629709243774} 11/07/2021 18:07:23 - INFO - __main__ - Step 148459: {'lr': 1.339110144326361e-07, 'samples': 28504128, 'steps': 148458, 'loss/train': 0.9815352559089661} 11/07/2021 18:07:24 - INFO - __main__ - Step 148460: {'lr': 1.3373740138911127e-07, 'samples': 28504320, 'steps': 148459, 'loss/train': 1.3428620100021362} 11/07/2021 18:07:24 - INFO - __main__ - Step 148461: {'lr': 1.3356390093149928e-07, 'samples': 28504512, 'steps': 148460, 'loss/train': 1.7491270303726196} 11/07/2021 18:07:25 - INFO - __main__ - Step 148462: {'lr': 1.3339051305985562e-07, 'samples': 28504704, 'steps': 148461, 'loss/train': 1.5860474109649658} 11/07/2021 18:07:26 - INFO - __main__ - Step 148463: {'lr': 1.3321723777423577e-07, 'samples': 28504896, 'steps': 148462, 'loss/train': 1.4144258499145508} 11/07/2021 18:07:26 - INFO - __main__ - Step 148464: {'lr': 1.3304407507472304e-07, 'samples': 28505088, 'steps': 148463, 'loss/train': 1.670419692993164} 11/07/2021 18:07:26 - INFO - __main__ - Step 148465: {'lr': 1.328710249614007e-07, 'samples': 28505280, 'steps': 148464, 'loss/train': 1.492993950843811} 11/07/2021 18:07:27 - INFO - __main__ - Step 148466: {'lr': 1.326980874343797e-07, 'samples': 28505472, 'steps': 148465, 'loss/train': 1.6017826795578003} 11/07/2021 18:07:28 - INFO - __main__ - Step 148467: {'lr': 1.3252526249368791e-07, 'samples': 28505664, 'steps': 148466, 'loss/train': 1.5187909603118896} 11/07/2021 18:07:28 - INFO - __main__ - Step 148468: {'lr': 1.323525501394085e-07, 'samples': 28505856, 'steps': 148467, 'loss/train': 1.222072958946228} 11/07/2021 18:07:28 - INFO - __main__ - Step 148469: {'lr': 1.321799503716803e-07, 'samples': 28506048, 'steps': 148468, 'loss/train': 1.4922527074813843} 11/07/2021 18:07:29 - INFO - __main__ - Step 148470: {'lr': 1.3200746319050328e-07, 'samples': 28506240, 'steps': 148469, 'loss/train': 1.0321356058120728} 11/07/2021 18:07:29 - INFO - __main__ - Step 148471: {'lr': 1.3183508859598847e-07, 'samples': 28506432, 'steps': 148470, 'loss/train': 1.4601811170578003} 11/07/2021 18:07:30 - INFO - __main__ - Step 148472: {'lr': 1.3166282658821915e-07, 'samples': 28506624, 'steps': 148471, 'loss/train': 1.3934690952301025} 11/07/2021 18:07:30 - INFO - __main__ - Step 148473: {'lr': 1.314906771672786e-07, 'samples': 28506816, 'steps': 148472, 'loss/train': 1.4370558261871338} 11/07/2021 18:07:31 - INFO - __main__ - Step 148474: {'lr': 1.3131864033322226e-07, 'samples': 28507008, 'steps': 148473, 'loss/train': 0.46805402636528015} 11/07/2021 18:07:31 - INFO - __main__ - Step 148475: {'lr': 1.311467160861335e-07, 'samples': 28507200, 'steps': 148474, 'loss/train': 0.7179288864135742} 11/07/2021 18:07:31 - INFO - __main__ - Step 148476: {'lr': 1.309749044260955e-07, 'samples': 28507392, 'steps': 148475, 'loss/train': 1.4606930017471313} 11/07/2021 18:07:33 - INFO - __main__ - Step 148477: {'lr': 1.3080320535319158e-07, 'samples': 28507584, 'steps': 148476, 'loss/train': 1.2620975971221924} 11/07/2021 18:07:33 - INFO - __main__ - Step 148478: {'lr': 1.3063161886747722e-07, 'samples': 28507776, 'steps': 148477, 'loss/train': 1.179592490196228} 11/07/2021 18:07:33 - INFO - __main__ - Step 148479: {'lr': 1.3046014496906343e-07, 'samples': 28507968, 'steps': 148478, 'loss/train': 1.4095900058746338} 11/07/2021 18:07:34 - INFO - __main__ - Step 148480: {'lr': 1.3028878365800578e-07, 'samples': 28508160, 'steps': 148479, 'loss/train': 1.3433752059936523} 11/07/2021 18:07:34 - INFO - __main__ - Step 148481: {'lr': 1.3011753493438749e-07, 'samples': 28508352, 'steps': 148480, 'loss/train': 1.649875283241272} 11/07/2021 18:07:35 - INFO - __main__ - Step 148482: {'lr': 1.2994639879826408e-07, 'samples': 28508544, 'steps': 148481, 'loss/train': 1.113702654838562} 11/07/2021 18:07:35 - INFO - __main__ - Step 148483: {'lr': 1.297753752497466e-07, 'samples': 28508736, 'steps': 148482, 'loss/train': 1.6541929244995117} 11/07/2021 18:07:36 - INFO - __main__ - Step 148484: {'lr': 1.296044642888905e-07, 'samples': 28508928, 'steps': 148483, 'loss/train': 1.4731061458587646} 11/07/2021 18:07:36 - INFO - __main__ - Step 148485: {'lr': 1.294336659157791e-07, 'samples': 28509120, 'steps': 148484, 'loss/train': 1.0865559577941895} 11/07/2021 18:07:36 - INFO - __main__ - Step 148486: {'lr': 1.2926298013049563e-07, 'samples': 28509312, 'steps': 148485, 'loss/train': 1.1540107727050781} 11/07/2021 18:07:37 - INFO - __main__ - Step 148487: {'lr': 1.2909240693309564e-07, 'samples': 28509504, 'steps': 148486, 'loss/train': 0.8292993903160095} 11/07/2021 18:07:38 - INFO - __main__ - Step 148488: {'lr': 1.2892194632369013e-07, 'samples': 28509696, 'steps': 148487, 'loss/train': 1.5453298091888428} 11/07/2021 18:07:38 - INFO - __main__ - Step 148489: {'lr': 1.2875159830230686e-07, 'samples': 28509888, 'steps': 148488, 'loss/train': 1.1736855506896973} 11/07/2021 18:07:38 - INFO - __main__ - Step 148490: {'lr': 1.2858136286908462e-07, 'samples': 28510080, 'steps': 148489, 'loss/train': 1.5296549797058105} 11/07/2021 18:07:39 - INFO - __main__ - Step 148491: {'lr': 1.2841124002405112e-07, 'samples': 28510272, 'steps': 148490, 'loss/train': 1.3165984153747559} 11/07/2021 18:07:39 - INFO - __main__ - Step 148492: {'lr': 1.2824122976731746e-07, 'samples': 28510464, 'steps': 148491, 'loss/train': 1.6108677387237549} 11/07/2021 18:07:40 - INFO - __main__ - Step 148493: {'lr': 1.2807133209891132e-07, 'samples': 28510656, 'steps': 148492, 'loss/train': 1.143012523651123} 11/07/2021 18:07:41 - INFO - __main__ - Step 148494: {'lr': 1.279015470189715e-07, 'samples': 28510848, 'steps': 148493, 'loss/train': 1.351554274559021} 11/07/2021 18:07:41 - INFO - __main__ - Step 148495: {'lr': 1.2773187452752578e-07, 'samples': 28511040, 'steps': 148494, 'loss/train': 1.1529512405395508} 11/07/2021 18:07:41 - INFO - __main__ - Step 148496: {'lr': 1.275623146246574e-07, 'samples': 28511232, 'steps': 148495, 'loss/train': 1.281100869178772} 11/07/2021 18:07:42 - INFO - __main__ - Step 148497: {'lr': 1.273928673104774e-07, 'samples': 28511424, 'steps': 148496, 'loss/train': 1.4027537107467651} 11/07/2021 18:07:43 - INFO - __main__ - Step 148498: {'lr': 1.2722353258504126e-07, 'samples': 28511616, 'steps': 148497, 'loss/train': 1.3719385862350464} 11/07/2021 18:07:44 - INFO - __main__ - Step 148499: {'lr': 1.2705431044840453e-07, 'samples': 28511808, 'steps': 148498, 'loss/train': 1.448298454284668} 11/07/2021 18:07:44 - INFO - __main__ - Step 148500: {'lr': 1.2688520090067824e-07, 'samples': 28512000, 'steps': 148499, 'loss/train': 1.1357320547103882} 11/07/2021 18:07:44 - INFO - __main__ - Step 148501: {'lr': 1.267162039418901e-07, 'samples': 28512192, 'steps': 148500, 'loss/train': 1.598413109779358} 11/07/2021 18:07:45 - INFO - __main__ - Step 148502: {'lr': 1.265473195721789e-07, 'samples': 28512384, 'steps': 148501, 'loss/train': 0.8990229368209839} 11/07/2021 18:07:45 - INFO - __main__ - Step 148503: {'lr': 1.2637854779157243e-07, 'samples': 28512576, 'steps': 148502, 'loss/train': 1.2149626016616821} 11/07/2021 18:07:46 - INFO - __main__ - Step 148504: {'lr': 1.2620988860018167e-07, 'samples': 28512768, 'steps': 148503, 'loss/train': 1.1535927057266235} 11/07/2021 18:07:46 - INFO - __main__ - Step 148505: {'lr': 1.2604134199806215e-07, 'samples': 28512960, 'steps': 148504, 'loss/train': 1.1797243356704712} 11/07/2021 18:07:47 - INFO - __main__ - Step 148506: {'lr': 1.258729079852694e-07, 'samples': 28513152, 'steps': 148505, 'loss/train': 0.8961988687515259} 11/07/2021 18:07:47 - INFO - __main__ - Step 148507: {'lr': 1.257045865619144e-07, 'samples': 28513344, 'steps': 148506, 'loss/train': 1.5483105182647705} 11/07/2021 18:07:47 - INFO - __main__ - Step 148508: {'lr': 1.2553637772808047e-07, 'samples': 28513536, 'steps': 148507, 'loss/train': 1.2043377161026} 11/07/2021 18:07:49 - INFO - __main__ - Step 148509: {'lr': 1.253682814837953e-07, 'samples': 28513728, 'steps': 148508, 'loss/train': 1.4074839353561401} 11/07/2021 18:07:49 - INFO - __main__ - Step 148510: {'lr': 1.2520029782919772e-07, 'samples': 28513920, 'steps': 148509, 'loss/train': 1.1859972476959229} 11/07/2021 18:07:49 - INFO - __main__ - Step 148511: {'lr': 1.2503242676428772e-07, 'samples': 28514112, 'steps': 148510, 'loss/train': 0.8164668083190918} 11/07/2021 18:07:50 - INFO - __main__ - Step 148512: {'lr': 1.2486466828920406e-07, 'samples': 28514304, 'steps': 148511, 'loss/train': 1.420708179473877} 11/07/2021 18:07:50 - INFO - __main__ - Step 148513: {'lr': 1.2469702240400226e-07, 'samples': 28514496, 'steps': 148512, 'loss/train': 0.9519252777099609} 11/07/2021 18:07:51 - INFO - __main__ - Step 148514: {'lr': 1.245294891087656e-07, 'samples': 28514688, 'steps': 148513, 'loss/train': 0.3149808943271637} 11/07/2021 18:07:51 - INFO - __main__ - Step 148515: {'lr': 1.2436206840354957e-07, 'samples': 28514880, 'steps': 148514, 'loss/train': 1.3453176021575928} 11/07/2021 18:07:52 - INFO - __main__ - Step 148516: {'lr': 1.2419476028843747e-07, 'samples': 28515072, 'steps': 148515, 'loss/train': 1.114938735961914} 11/07/2021 18:07:52 - INFO - __main__ - Step 148517: {'lr': 1.2402756476351252e-07, 'samples': 28515264, 'steps': 148516, 'loss/train': 1.2533326148986816} 11/07/2021 18:07:52 - INFO - __main__ - Step 148518: {'lr': 1.2386048182883026e-07, 'samples': 28515456, 'steps': 148517, 'loss/train': 1.0768243074417114} 11/07/2021 18:07:54 - INFO - __main__ - Step 148519: {'lr': 1.236935114845017e-07, 'samples': 28515648, 'steps': 148518, 'loss/train': 1.3754608631134033} 11/07/2021 18:07:54 - INFO - __main__ - Step 148520: {'lr': 1.2352665373055462e-07, 'samples': 28515840, 'steps': 148519, 'loss/train': 1.1405597925186157} 11/07/2021 18:07:54 - INFO - __main__ - Step 148521: {'lr': 1.233599085671e-07, 'samples': 28516032, 'steps': 148520, 'loss/train': 1.7404135465621948} 11/07/2021 18:07:55 - INFO - __main__ - Step 148522: {'lr': 1.2319327599422115e-07, 'samples': 28516224, 'steps': 148521, 'loss/train': 1.4521968364715576} 11/07/2021 18:07:55 - INFO - __main__ - Step 148523: {'lr': 1.2302675601197355e-07, 'samples': 28516416, 'steps': 148522, 'loss/train': 1.1342201232910156} 11/07/2021 18:07:56 - INFO - __main__ - Step 148524: {'lr': 1.2286034862041274e-07, 'samples': 28516608, 'steps': 148523, 'loss/train': 1.2148979902267456} 11/07/2021 18:07:56 - INFO - __main__ - Step 148525: {'lr': 1.226940538196497e-07, 'samples': 28516800, 'steps': 148524, 'loss/train': 1.51048743724823} 11/07/2021 18:07:57 - INFO - __main__ - Step 148526: {'lr': 1.2252787160973998e-07, 'samples': 28516992, 'steps': 148525, 'loss/train': 1.069901943206787} 11/07/2021 18:07:57 - INFO - __main__ - Step 148527: {'lr': 1.2236180199076686e-07, 'samples': 28517184, 'steps': 148526, 'loss/train': 0.8851209878921509} 11/07/2021 18:07:57 - INFO - __main__ - Step 148528: {'lr': 1.2219584496281354e-07, 'samples': 28517376, 'steps': 148527, 'loss/train': 1.2418166399002075} 11/07/2021 18:07:58 - INFO - __main__ - Step 148529: {'lr': 1.2203000052590784e-07, 'samples': 28517568, 'steps': 148528, 'loss/train': 0.8187915682792664} 11/07/2021 18:07:59 - INFO - __main__ - Step 148530: {'lr': 1.218642686801885e-07, 'samples': 28517760, 'steps': 148529, 'loss/train': 1.319635272026062} 11/07/2021 18:07:59 - INFO - __main__ - Step 148531: {'lr': 1.2169864942571106e-07, 'samples': 28517952, 'steps': 148530, 'loss/train': 1.494956612586975} 11/07/2021 18:07:59 - INFO - __main__ - Step 148532: {'lr': 1.2153314276250326e-07, 'samples': 28518144, 'steps': 148531, 'loss/train': 1.117923617362976} 11/07/2021 18:08:00 - INFO - __main__ - Step 148533: {'lr': 1.2136774869070388e-07, 'samples': 28518336, 'steps': 148532, 'loss/train': 1.1571967601776123} 11/07/2021 18:08:00 - INFO - __main__ - Step 148534: {'lr': 1.2120246721036842e-07, 'samples': 28518528, 'steps': 148533, 'loss/train': 1.4529039859771729} 11/07/2021 18:08:01 - INFO - __main__ - Step 148535: {'lr': 1.2103729832155242e-07, 'samples': 28518720, 'steps': 148534, 'loss/train': 1.1973729133605957} 11/07/2021 18:08:02 - INFO - __main__ - Step 148536: {'lr': 1.208722420243391e-07, 'samples': 28518912, 'steps': 148535, 'loss/train': 1.1115353107452393} 11/07/2021 18:08:02 - INFO - __main__ - Step 148537: {'lr': 1.2070729831878402e-07, 'samples': 28519104, 'steps': 148536, 'loss/train': 1.3766309022903442} 11/07/2021 18:08:02 - INFO - __main__ - Step 148538: {'lr': 1.2054246720499817e-07, 'samples': 28519296, 'steps': 148537, 'loss/train': 1.213259220123291} 11/07/2021 18:08:03 - INFO - __main__ - Step 148539: {'lr': 1.2037774868306485e-07, 'samples': 28519488, 'steps': 148538, 'loss/train': 1.5785672664642334} 11/07/2021 18:08:04 - INFO - __main__ - Step 148540: {'lr': 1.2021314275301177e-07, 'samples': 28519680, 'steps': 148539, 'loss/train': 0.9094685912132263} 11/07/2021 18:08:04 - INFO - __main__ - Step 148541: {'lr': 1.2004864941492223e-07, 'samples': 28519872, 'steps': 148540, 'loss/train': 1.1603310108184814} 11/07/2021 18:08:04 - INFO - __main__ - Step 148542: {'lr': 1.1988426866890722e-07, 'samples': 28520064, 'steps': 148541, 'loss/train': 1.3824061155319214} 11/07/2021 18:08:05 - INFO - __main__ - Step 148543: {'lr': 1.1972000051499454e-07, 'samples': 28520256, 'steps': 148542, 'loss/train': 1.4898360967636108} 11/07/2021 18:08:05 - INFO - __main__ - Step 148544: {'lr': 1.195558449532952e-07, 'samples': 28520448, 'steps': 148543, 'loss/train': 2.62955904006958} 11/07/2021 18:08:06 - INFO - __main__ - Step 148545: {'lr': 1.1939180198386467e-07, 'samples': 28520640, 'steps': 148544, 'loss/train': 1.2592006921768188} 11/07/2021 18:08:07 - INFO - __main__ - Step 148546: {'lr': 1.192278716067863e-07, 'samples': 28520832, 'steps': 148545, 'loss/train': 1.3177821636199951} 11/07/2021 18:08:07 - INFO - __main__ - Step 148547: {'lr': 1.190640538221155e-07, 'samples': 28521024, 'steps': 148546, 'loss/train': 1.2738564014434814} 11/07/2021 18:08:07 - INFO - __main__ - Step 148548: {'lr': 1.189003486299356e-07, 'samples': 28521216, 'steps': 148547, 'loss/train': 1.5293558835983276} 11/07/2021 18:08:08 - INFO - __main__ - Step 148549: {'lr': 1.1873675603032986e-07, 'samples': 28521408, 'steps': 148548, 'loss/train': 1.2475651502609253} 11/07/2021 18:08:09 - INFO - __main__ - Step 148550: {'lr': 1.1857327602338153e-07, 'samples': 28521600, 'steps': 148549, 'loss/train': 1.1661510467529297} 11/07/2021 18:08:09 - INFO - __main__ - Step 148551: {'lr': 1.1840990860911838e-07, 'samples': 28521792, 'steps': 148550, 'loss/train': 1.09610116481781} 11/07/2021 18:08:09 - INFO - __main__ - Step 148552: {'lr': 1.1824665378765142e-07, 'samples': 28521984, 'steps': 148551, 'loss/train': 0.5124954581260681} 11/07/2021 18:08:10 - INFO - __main__ - Step 148553: {'lr': 1.1808351155906394e-07, 'samples': 28522176, 'steps': 148552, 'loss/train': 1.4656389951705933} 11/07/2021 18:08:10 - INFO - __main__ - Step 148554: {'lr': 1.1792048192341143e-07, 'samples': 28522368, 'steps': 148553, 'loss/train': 1.397231936454773} 11/07/2021 18:08:11 - INFO - __main__ - Step 148555: {'lr': 1.177575648807494e-07, 'samples': 28522560, 'steps': 148554, 'loss/train': 0.8214816451072693} 11/07/2021 18:08:11 - INFO - __main__ - Step 148556: {'lr': 1.1759476043118889e-07, 'samples': 28522752, 'steps': 148555, 'loss/train': 1.1717798709869385} 11/07/2021 18:08:12 - INFO - __main__ - Step 148557: {'lr': 1.1743206857478539e-07, 'samples': 28522944, 'steps': 148556, 'loss/train': 1.4944416284561157} 11/07/2021 18:08:12 - INFO - __main__ - Step 148558: {'lr': 1.1726948931159443e-07, 'samples': 28523136, 'steps': 148557, 'loss/train': 1.3399872779846191} 11/07/2021 18:08:12 - INFO - __main__ - Step 148559: {'lr': 1.1710702264169926e-07, 'samples': 28523328, 'steps': 148558, 'loss/train': 1.1607950925827026} 11/07/2021 18:08:13 - INFO - __main__ - Step 148560: {'lr': 1.169446685652109e-07, 'samples': 28523520, 'steps': 148559, 'loss/train': 0.9456471800804138} 11/07/2021 18:08:14 - INFO - __main__ - Step 148561: {'lr': 1.1678242708212938e-07, 'samples': 28523712, 'steps': 148560, 'loss/train': 1.272789478302002} 11/07/2021 18:08:14 - INFO - __main__ - Step 148562: {'lr': 1.1662029819259346e-07, 'samples': 28523904, 'steps': 148561, 'loss/train': 1.4405561685562134} 11/07/2021 18:08:15 - INFO - __main__ - Step 148563: {'lr': 1.1645828189665863e-07, 'samples': 28524096, 'steps': 148562, 'loss/train': 0.6801477670669556} 11/07/2021 18:08:15 - INFO - __main__ - Step 148564: {'lr': 1.1629637819438044e-07, 'samples': 28524288, 'steps': 148563, 'loss/train': 0.9589516520500183} 11/07/2021 18:08:15 - INFO - __main__ - Step 148565: {'lr': 1.1613458708586988e-07, 'samples': 28524480, 'steps': 148564, 'loss/train': 1.214746356010437} 11/07/2021 18:08:16 - INFO - __main__ - Step 148566: {'lr': 1.1597290857112697e-07, 'samples': 28524672, 'steps': 148565, 'loss/train': 1.2666499614715576} 11/07/2021 18:08:17 - INFO - __main__ - Step 148567: {'lr': 1.1581134265031823e-07, 'samples': 28524864, 'steps': 148566, 'loss/train': 0.8123763203620911} 11/07/2021 18:08:17 - INFO - __main__ - Step 148568: {'lr': 1.1564988932344367e-07, 'samples': 28525056, 'steps': 148567, 'loss/train': 1.4525878429412842} 11/07/2021 18:08:17 - INFO - __main__ - Step 148569: {'lr': 1.1548854859058655e-07, 'samples': 28525248, 'steps': 148568, 'loss/train': 0.9858706593513489} 11/07/2021 18:08:18 - INFO - __main__ - Step 148570: {'lr': 1.153273204518579e-07, 'samples': 28525440, 'steps': 148569, 'loss/train': 1.1195130348205566} 11/07/2021 18:08:19 - INFO - __main__ - Step 148571: {'lr': 1.1516620490731322e-07, 'samples': 28525632, 'steps': 148570, 'loss/train': 1.137447476387024} 11/07/2021 18:08:19 - INFO - __main__ - Step 148572: {'lr': 1.1500520195700803e-07, 'samples': 28525824, 'steps': 148571, 'loss/train': 1.7596502304077148} 11/07/2021 18:08:19 - INFO - __main__ - Step 148573: {'lr': 1.1484431160102559e-07, 'samples': 28526016, 'steps': 148572, 'loss/train': 1.2305337190628052} 11/07/2021 18:08:20 - INFO - __main__ - Step 148574: {'lr': 1.1468353383942143e-07, 'samples': 28526208, 'steps': 148573, 'loss/train': 0.9968480467796326} 11/07/2021 18:08:20 - INFO - __main__ - Step 148575: {'lr': 1.1452286867230655e-07, 'samples': 28526400, 'steps': 148574, 'loss/train': 1.3368875980377197} 11/07/2021 18:08:21 - INFO - __main__ - Step 148576: {'lr': 1.143623160997087e-07, 'samples': 28526592, 'steps': 148575, 'loss/train': 0.9995763897895813} 11/07/2021 18:08:22 - INFO - __main__ - Step 148577: {'lr': 1.1420187612173893e-07, 'samples': 28526784, 'steps': 148576, 'loss/train': 1.6526533365249634} 11/07/2021 18:08:22 - INFO - __main__ - Step 148578: {'lr': 1.140415487384805e-07, 'samples': 28526976, 'steps': 148577, 'loss/train': 1.1233147382736206} 11/07/2021 18:08:22 - INFO - __main__ - Step 148579: {'lr': 1.1388133394993338e-07, 'samples': 28527168, 'steps': 148578, 'loss/train': 1.4764021635055542} 11/07/2021 18:08:23 - INFO - __main__ - Step 148580: {'lr': 1.1372123175623638e-07, 'samples': 28527360, 'steps': 148579, 'loss/train': 1.3735514879226685} 11/07/2021 18:08:24 - INFO - __main__ - Step 148581: {'lr': 1.1356124215744501e-07, 'samples': 28527552, 'steps': 148580, 'loss/train': 1.234092116355896} 11/07/2021 18:08:24 - INFO - __main__ - Step 148582: {'lr': 1.1340136515361477e-07, 'samples': 28527744, 'steps': 148581, 'loss/train': 1.0935696363449097} 11/07/2021 18:08:24 - INFO - __main__ - Step 148583: {'lr': 1.1324160074482893e-07, 'samples': 28527936, 'steps': 148582, 'loss/train': 1.4191967248916626} 11/07/2021 18:08:25 - INFO - __main__ - Step 148584: {'lr': 1.1308194893117074e-07, 'samples': 28528128, 'steps': 148583, 'loss/train': 1.3741341829299927} 11/07/2021 18:08:25 - INFO - __main__ - Step 148585: {'lr': 1.1292240971269574e-07, 'samples': 28528320, 'steps': 148584, 'loss/train': 1.3224709033966064} 11/07/2021 18:08:25 - INFO - __main__ - Step 148586: {'lr': 1.1276298308948718e-07, 'samples': 28528512, 'steps': 148585, 'loss/train': 1.2873430252075195} 11/07/2021 18:08:27 - INFO - __main__ - Step 148587: {'lr': 1.1260366906162833e-07, 'samples': 28528704, 'steps': 148586, 'loss/train': 1.1826460361480713} 11/07/2021 18:08:27 - INFO - __main__ - Step 148588: {'lr': 1.1244446762914695e-07, 'samples': 28528896, 'steps': 148587, 'loss/train': 0.48382213711738586} 11/07/2021 18:08:27 - INFO - __main__ - Step 148589: {'lr': 1.1228537879215406e-07, 'samples': 28529088, 'steps': 148588, 'loss/train': 0.4423404932022095} 11/07/2021 18:08:28 - INFO - __main__ - Step 148590: {'lr': 1.1212640255070516e-07, 'samples': 28529280, 'steps': 148589, 'loss/train': 1.2780572175979614} 11/07/2021 18:08:28 - INFO - __main__ - Step 148591: {'lr': 1.1196753890488354e-07, 'samples': 28529472, 'steps': 148590, 'loss/train': 1.5861258506774902} 11/07/2021 18:08:29 - INFO - __main__ - Step 148592: {'lr': 1.1180878785474469e-07, 'samples': 28529664, 'steps': 148591, 'loss/train': 1.1711955070495605} 11/07/2021 18:08:29 - INFO - __main__ - Step 148593: {'lr': 1.1165014940037188e-07, 'samples': 28529856, 'steps': 148592, 'loss/train': 1.1856344938278198} 11/07/2021 18:08:30 - INFO - __main__ - Step 148594: {'lr': 1.1149162354182062e-07, 'samples': 28530048, 'steps': 148593, 'loss/train': 0.6057782173156738} 11/07/2021 18:08:30 - INFO - __main__ - Step 148595: {'lr': 1.113332102791742e-07, 'samples': 28530240, 'steps': 148594, 'loss/train': 1.0494444370269775} 11/07/2021 18:08:30 - INFO - __main__ - Step 148596: {'lr': 1.1117490961251586e-07, 'samples': 28530432, 'steps': 148595, 'loss/train': 1.3866816759109497} 11/07/2021 18:08:31 - INFO - __main__ - Step 148597: {'lr': 1.1101672154192888e-07, 'samples': 28530624, 'steps': 148596, 'loss/train': 1.368009090423584} 11/07/2021 18:08:32 - INFO - __main__ - Step 148598: {'lr': 1.10858646067441e-07, 'samples': 28530816, 'steps': 148597, 'loss/train': 1.2012618780136108} 11/07/2021 18:08:32 - INFO - __main__ - Step 148599: {'lr': 1.107006831891355e-07, 'samples': 28531008, 'steps': 148598, 'loss/train': 1.5306271314620972} 11/07/2021 18:08:32 - INFO - __main__ - Step 148600: {'lr': 1.1054283290709566e-07, 'samples': 28531200, 'steps': 148599, 'loss/train': 1.5202064514160156} 11/07/2021 18:08:33 - INFO - __main__ - Step 148601: {'lr': 1.1038509522140472e-07, 'samples': 28531392, 'steps': 148600, 'loss/train': 1.9349381923675537} 11/07/2021 18:08:34 - INFO - __main__ - Step 148602: {'lr': 1.1022747013209044e-07, 'samples': 28531584, 'steps': 148601, 'loss/train': 1.6711236238479614} 11/07/2021 18:08:34 - INFO - __main__ - Step 148603: {'lr': 1.1006995763929161e-07, 'samples': 28531776, 'steps': 148602, 'loss/train': 0.827761709690094} 11/07/2021 18:08:35 - INFO - __main__ - Step 148604: {'lr': 1.0991255774300823e-07, 'samples': 28531968, 'steps': 148603, 'loss/train': 1.1697746515274048} 11/07/2021 18:08:35 - INFO - __main__ - Step 148605: {'lr': 1.0975527044335132e-07, 'samples': 28532160, 'steps': 148604, 'loss/train': 1.1778842210769653} 11/07/2021 18:08:35 - INFO - __main__ - Step 148606: {'lr': 1.0959809574037639e-07, 'samples': 28532352, 'steps': 148605, 'loss/train': 1.1209073066711426} 11/07/2021 18:08:36 - INFO - __main__ - Step 148607: {'lr': 1.0944103363416669e-07, 'samples': 28532544, 'steps': 148606, 'loss/train': 0.959929883480072} 11/07/2021 18:08:37 - INFO - __main__ - Step 148608: {'lr': 1.0928408412477775e-07, 'samples': 28532736, 'steps': 148607, 'loss/train': 1.5622096061706543} 11/07/2021 18:08:37 - INFO - __main__ - Step 148609: {'lr': 1.0912724721232059e-07, 'samples': 28532928, 'steps': 148608, 'loss/train': 1.6608952283859253} 11/07/2021 18:08:37 - INFO - __main__ - Step 148610: {'lr': 1.0897052289679521e-07, 'samples': 28533120, 'steps': 148609, 'loss/train': 1.4142839908599854} 11/07/2021 18:08:38 - INFO - __main__ - Step 148611: {'lr': 1.0881391117834038e-07, 'samples': 28533312, 'steps': 148610, 'loss/train': 1.6710766553878784} 11/07/2021 18:08:38 - INFO - __main__ - Step 148612: {'lr': 1.0865741205698387e-07, 'samples': 28533504, 'steps': 148611, 'loss/train': 1.2110103368759155} 11/07/2021 18:08:39 - INFO - __main__ - Step 148613: {'lr': 1.0850102553280893e-07, 'samples': 28533696, 'steps': 148612, 'loss/train': 1.7291446924209595} 11/07/2021 18:08:39 - INFO - __main__ - Step 148614: {'lr': 1.0834475160589885e-07, 'samples': 28533888, 'steps': 148613, 'loss/train': 0.6181808114051819} 11/07/2021 18:08:40 - INFO - __main__ - Step 148615: {'lr': 1.0818859027628137e-07, 'samples': 28534080, 'steps': 148614, 'loss/train': 1.0146716833114624} 11/07/2021 18:08:40 - INFO - __main__ - Step 148616: {'lr': 1.0803254154409525e-07, 'samples': 28534272, 'steps': 148615, 'loss/train': 1.538388729095459} 11/07/2021 18:08:41 - INFO - __main__ - Step 148617: {'lr': 1.0787660540936827e-07, 'samples': 28534464, 'steps': 148616, 'loss/train': 1.0809426307678223} 11/07/2021 18:08:42 - INFO - __main__ - Step 148618: {'lr': 1.0772078187215595e-07, 'samples': 28534656, 'steps': 148617, 'loss/train': 1.518444299697876} 11/07/2021 18:08:42 - INFO - __main__ - Step 148619: {'lr': 1.0756507093256929e-07, 'samples': 28534848, 'steps': 148618, 'loss/train': 1.5011500120162964} 11/07/2021 18:08:42 - INFO - __main__ - Step 148620: {'lr': 1.0740947259063605e-07, 'samples': 28535040, 'steps': 148619, 'loss/train': 1.9181159734725952} 11/07/2021 18:08:43 - INFO - __main__ - Step 148621: {'lr': 1.0725398684646725e-07, 'samples': 28535232, 'steps': 148620, 'loss/train': 1.285168170928955} 11/07/2021 18:08:43 - INFO - __main__ - Step 148622: {'lr': 1.0709861370009066e-07, 'samples': 28535424, 'steps': 148621, 'loss/train': 1.3397330045700073} 11/07/2021 18:08:44 - INFO - __main__ - Step 148623: {'lr': 1.069433531516173e-07, 'samples': 28535616, 'steps': 148622, 'loss/train': 1.853712558746338} 11/07/2021 18:08:44 - INFO - __main__ - Step 148624: {'lr': 1.0678820520110266e-07, 'samples': 28535808, 'steps': 148623, 'loss/train': 1.098406434059143} 11/07/2021 18:08:45 - INFO - __main__ - Step 148625: {'lr': 1.0663316984860228e-07, 'samples': 28536000, 'steps': 148624, 'loss/train': 1.4315521717071533} 11/07/2021 18:08:45 - INFO - __main__ - Step 148626: {'lr': 1.064782470941994e-07, 'samples': 28536192, 'steps': 148625, 'loss/train': 1.6266535520553589} 11/07/2021 18:08:45 - INFO - __main__ - Step 148627: {'lr': 1.063234369379773e-07, 'samples': 28536384, 'steps': 148626, 'loss/train': 1.639100193977356} 11/07/2021 18:08:46 - INFO - __main__ - Step 148628: {'lr': 1.0616873937996374e-07, 'samples': 28536576, 'steps': 148627, 'loss/train': 1.7021422386169434} 11/07/2021 18:08:47 - INFO - __main__ - Step 148629: {'lr': 1.0601415442026973e-07, 'samples': 28536768, 'steps': 148628, 'loss/train': 1.3766512870788574} 11/07/2021 18:08:47 - INFO - __main__ - Step 148630: {'lr': 1.058596820589508e-07, 'samples': 28536960, 'steps': 148629, 'loss/train': 1.4839367866516113} 11/07/2021 18:08:47 - INFO - __main__ - Step 148631: {'lr': 1.0570532229606244e-07, 'samples': 28537152, 'steps': 148630, 'loss/train': 0.7557640671730042} 11/07/2021 18:08:48 - INFO - __main__ - Step 148632: {'lr': 1.0555107513171569e-07, 'samples': 28537344, 'steps': 148631, 'loss/train': 0.9234901070594788} 11/07/2021 18:08:49 - INFO - __main__ - Step 148633: {'lr': 1.053969405659383e-07, 'samples': 28537536, 'steps': 148632, 'loss/train': 1.2479267120361328} 11/07/2021 18:08:49 - INFO - __main__ - Step 148634: {'lr': 1.0524291859878577e-07, 'samples': 28537728, 'steps': 148633, 'loss/train': 1.4233278036117554} 11/07/2021 18:08:50 - INFO - __main__ - Step 148635: {'lr': 1.0508900923039688e-07, 'samples': 28537920, 'steps': 148634, 'loss/train': 1.282604455947876} 11/07/2021 18:08:50 - INFO - __main__ - Step 148636: {'lr': 1.0493521246077165e-07, 'samples': 28538112, 'steps': 148635, 'loss/train': 1.2962336540222168} 11/07/2021 18:08:50 - INFO - __main__ - Step 148637: {'lr': 1.0478152829002108e-07, 'samples': 28538304, 'steps': 148636, 'loss/train': 1.216082215309143} 11/07/2021 18:08:51 - INFO - __main__ - Step 148638: {'lr': 1.0462795671817294e-07, 'samples': 28538496, 'steps': 148637, 'loss/train': 1.5654425621032715} 11/07/2021 18:08:52 - INFO - __main__ - Step 148639: {'lr': 1.0447449774533824e-07, 'samples': 28538688, 'steps': 148638, 'loss/train': 1.3802545070648193} 11/07/2021 18:08:52 - INFO - __main__ - Step 148640: {'lr': 1.043211513715725e-07, 'samples': 28538880, 'steps': 148639, 'loss/train': 1.347458839416504} 11/07/2021 18:08:52 - INFO - __main__ - Step 148641: {'lr': 1.0416791759695898e-07, 'samples': 28539072, 'steps': 148640, 'loss/train': 1.1422866582870483} 11/07/2021 18:08:53 - INFO - __main__ - Step 148642: {'lr': 1.0401479642152545e-07, 'samples': 28539264, 'steps': 148641, 'loss/train': 1.7181527614593506} 11/07/2021 18:08:53 - INFO - __main__ - Step 148643: {'lr': 1.0386178784538292e-07, 'samples': 28539456, 'steps': 148642, 'loss/train': 1.2784976959228516} 11/07/2021 18:08:54 - INFO - __main__ - Step 148644: {'lr': 1.037088918685869e-07, 'samples': 28539648, 'steps': 148643, 'loss/train': 0.839471161365509} 11/07/2021 18:08:55 - INFO - __main__ - Step 148645: {'lr': 1.035561084911929e-07, 'samples': 28539840, 'steps': 148644, 'loss/train': 0.7418854236602783} 11/07/2021 18:08:55 - INFO - __main__ - Step 148646: {'lr': 1.034034377132842e-07, 'samples': 28540032, 'steps': 148645, 'loss/train': 1.287124514579773} 11/07/2021 18:08:55 - INFO - __main__ - Step 148647: {'lr': 1.0325087953494406e-07, 'samples': 28540224, 'steps': 148646, 'loss/train': 1.1078057289123535} 11/07/2021 18:08:56 - INFO - __main__ - Step 148648: {'lr': 1.0309843395620022e-07, 'samples': 28540416, 'steps': 148647, 'loss/train': 1.393808126449585} 11/07/2021 18:08:57 - INFO - __main__ - Step 148649: {'lr': 1.0294610097713597e-07, 'samples': 28540608, 'steps': 148648, 'loss/train': 1.8491020202636719} 11/07/2021 18:08:57 - INFO - __main__ - Step 148650: {'lr': 1.0279388059783457e-07, 'samples': 28540800, 'steps': 148649, 'loss/train': 1.1006096601486206} 11/07/2021 18:08:57 - INFO - __main__ - Step 148651: {'lr': 1.0264177281837927e-07, 'samples': 28540992, 'steps': 148650, 'loss/train': 1.2933969497680664} 11/07/2021 18:08:58 - INFO - __main__ - Step 148652: {'lr': 1.0248977763879785e-07, 'samples': 28541184, 'steps': 148651, 'loss/train': 0.7135953903198242} 11/07/2021 18:08:58 - INFO - __main__ - Step 148653: {'lr': 1.0233789505917357e-07, 'samples': 28541376, 'steps': 148652, 'loss/train': 1.5436292886734009} 11/07/2021 18:08:58 - INFO - __main__ - Step 148654: {'lr': 1.021861250795897e-07, 'samples': 28541568, 'steps': 148653, 'loss/train': 1.3051457405090332} 11/07/2021 18:09:00 - INFO - __main__ - Step 148655: {'lr': 1.0203446770012947e-07, 'samples': 28541760, 'steps': 148654, 'loss/train': 1.364410400390625} 11/07/2021 18:09:00 - INFO - __main__ - Step 148656: {'lr': 1.0188292292079293e-07, 'samples': 28541952, 'steps': 148655, 'loss/train': 1.1736356019973755} 11/07/2021 18:09:00 - INFO - __main__ - Step 148657: {'lr': 1.0173149074171883e-07, 'samples': 28542144, 'steps': 148656, 'loss/train': 1.016705870628357} 11/07/2021 18:09:01 - INFO - __main__ - Step 148658: {'lr': 1.0158017116293494e-07, 'samples': 28542336, 'steps': 148657, 'loss/train': 1.1537283658981323} 11/07/2021 18:09:01 - INFO - __main__ - Step 148659: {'lr': 1.0142896418452452e-07, 'samples': 28542528, 'steps': 148658, 'loss/train': 1.2374564409255981} 11/07/2021 18:09:02 - INFO - __main__ - Step 148660: {'lr': 1.0127786980657084e-07, 'samples': 28542720, 'steps': 148659, 'loss/train': 0.7925559878349304} 11/07/2021 18:09:02 - INFO - __main__ - Step 148661: {'lr': 1.0112688802910164e-07, 'samples': 28542912, 'steps': 148660, 'loss/train': 0.9407751560211182} 11/07/2021 18:09:03 - INFO - __main__ - Step 148662: {'lr': 1.0097601885222796e-07, 'samples': 28543104, 'steps': 148661, 'loss/train': 1.2705198526382446} 11/07/2021 18:09:03 - INFO - __main__ - Step 148663: {'lr': 1.0082526227597755e-07, 'samples': 28543296, 'steps': 148662, 'loss/train': 1.6956591606140137} 11/07/2021 18:09:03 - INFO - __main__ - Step 148664: {'lr': 1.0067461830046143e-07, 'samples': 28543488, 'steps': 148663, 'loss/train': 1.3573200702667236} 11/07/2021 18:09:05 - INFO - __main__ - Step 148665: {'lr': 1.0052408692570735e-07, 'samples': 28543680, 'steps': 148664, 'loss/train': 1.4509341716766357} 11/07/2021 18:09:05 - INFO - __main__ - Step 148666: {'lr': 1.003736681517986e-07, 'samples': 28543872, 'steps': 148665, 'loss/train': 0.8071626424789429} 11/07/2021 18:09:05 - INFO - __main__ - Step 148667: {'lr': 1.0022336197881843e-07, 'samples': 28544064, 'steps': 148666, 'loss/train': 1.7270479202270508} 11/07/2021 18:09:06 - INFO - __main__ - Step 148668: {'lr': 1.0007316840682234e-07, 'samples': 28544256, 'steps': 148667, 'loss/train': 1.4858282804489136} 11/07/2021 18:09:06 - INFO - __main__ - Step 148669: {'lr': 9.992308743586587e-08, 'samples': 28544448, 'steps': 148668, 'loss/train': 1.5087798833847046} 11/07/2021 18:09:07 - INFO - __main__ - Step 148670: {'lr': 9.977311906603226e-08, 'samples': 28544640, 'steps': 148669, 'loss/train': 1.2777522802352905} 11/07/2021 18:09:08 - INFO - __main__ - Step 148671: {'lr': 9.962326329737703e-08, 'samples': 28544832, 'steps': 148670, 'loss/train': 0.8495522737503052} 11/07/2021 18:09:08 - INFO - __main__ - Step 148672: {'lr': 9.947352012998345e-08, 'samples': 28545024, 'steps': 148671, 'loss/train': 1.877095341682434} 11/07/2021 18:09:08 - INFO - __main__ - Step 148673: {'lr': 9.93238895639348e-08, 'samples': 28545216, 'steps': 148672, 'loss/train': 1.7168272733688354} 11/07/2021 18:09:09 - INFO - __main__ - Step 148674: {'lr': 9.917437159923104e-08, 'samples': 28545408, 'steps': 148673, 'loss/train': 1.2487454414367676} 11/07/2021 18:09:09 - INFO - __main__ - Step 148675: {'lr': 9.902496623601098e-08, 'samples': 28545600, 'steps': 148674, 'loss/train': 1.2965996265411377} 11/07/2021 18:09:10 - INFO - __main__ - Step 148676: {'lr': 9.887567347430237e-08, 'samples': 28545792, 'steps': 148675, 'loss/train': 1.647507905960083} 11/07/2021 18:09:10 - INFO - __main__ - Step 148677: {'lr': 9.872649331418848e-08, 'samples': 28545984, 'steps': 148676, 'loss/train': 0.9038680791854858} 11/07/2021 18:09:11 - INFO - __main__ - Step 148678: {'lr': 9.857742575575256e-08, 'samples': 28546176, 'steps': 148677, 'loss/train': 1.396834373474121} 11/07/2021 18:09:11 - INFO - __main__ - Step 148679: {'lr': 9.84284707990224e-08, 'samples': 28546368, 'steps': 148678, 'loss/train': 0.8088366389274597} 11/07/2021 18:09:11 - INFO - __main__ - Step 148680: {'lr': 9.827962844408122e-08, 'samples': 28546560, 'steps': 148679, 'loss/train': 1.2253459692001343} 11/07/2021 18:09:13 - INFO - __main__ - Step 148681: {'lr': 9.813089869098458e-08, 'samples': 28546752, 'steps': 148680, 'loss/train': 0.7875246405601501} 11/07/2021 18:09:13 - INFO - __main__ - Step 148682: {'lr': 9.798228153984345e-08, 'samples': 28546944, 'steps': 148681, 'loss/train': 1.3862019777297974} 11/07/2021 18:09:13 - INFO - __main__ - Step 148683: {'lr': 9.783377699068563e-08, 'samples': 28547136, 'steps': 148682, 'loss/train': 1.0932046175003052} 11/07/2021 18:09:14 - INFO - __main__ - Step 148684: {'lr': 9.768538504356661e-08, 'samples': 28547328, 'steps': 148683, 'loss/train': 1.9089263677597046} 11/07/2021 18:09:14 - INFO - __main__ - Step 148685: {'lr': 9.753710569859742e-08, 'samples': 28547520, 'steps': 148684, 'loss/train': 1.457859992980957} 11/07/2021 18:09:14 - INFO - __main__ - Step 148686: {'lr': 9.738893895580581e-08, 'samples': 28547712, 'steps': 148685, 'loss/train': 1.3916475772857666} 11/07/2021 18:09:15 - INFO - __main__ - Step 148687: {'lr': 9.72408848153028e-08, 'samples': 28547904, 'steps': 148686, 'loss/train': 1.7868729829788208} 11/07/2021 18:09:16 - INFO - __main__ - Step 148688: {'lr': 9.709294327708839e-08, 'samples': 28548096, 'steps': 148687, 'loss/train': 1.154142141342163} 11/07/2021 18:09:16 - INFO - __main__ - Step 148689: {'lr': 9.694511434130138e-08, 'samples': 28548288, 'steps': 148688, 'loss/train': 0.8704668283462524} 11/07/2021 18:09:16 - INFO - __main__ - Step 148690: {'lr': 9.67973980079695e-08, 'samples': 28548480, 'steps': 148689, 'loss/train': 2.6177494525909424} 11/07/2021 18:09:17 - INFO - __main__ - Step 148691: {'lr': 9.664979427714826e-08, 'samples': 28548672, 'steps': 148690, 'loss/train': 1.1301921606063843} 11/07/2021 18:09:18 - INFO - __main__ - Step 148692: {'lr': 9.650230314892094e-08, 'samples': 28548864, 'steps': 148691, 'loss/train': 0.9130756855010986} 11/07/2021 18:09:18 - INFO - __main__ - Step 148693: {'lr': 9.635492462337081e-08, 'samples': 28549056, 'steps': 148692, 'loss/train': 1.6495143175125122} 11/07/2021 18:09:19 - INFO - __main__ - Step 148694: {'lr': 9.620765870052562e-08, 'samples': 28549248, 'steps': 148693, 'loss/train': 0.03116004355251789} 11/07/2021 18:09:19 - INFO - __main__ - Step 148695: {'lr': 9.606050538049638e-08, 'samples': 28549440, 'steps': 148694, 'loss/train': 1.6256725788116455} 11/07/2021 18:09:19 - INFO - __main__ - Step 148696: {'lr': 9.591346466331086e-08, 'samples': 28549632, 'steps': 148695, 'loss/train': 1.305224895477295} 11/07/2021 18:09:20 - INFO - __main__ - Step 148697: {'lr': 9.576653654905231e-08, 'samples': 28549824, 'steps': 148696, 'loss/train': 1.2747931480407715} 11/07/2021 18:09:21 - INFO - __main__ - Step 148698: {'lr': 9.561972103777628e-08, 'samples': 28550016, 'steps': 148697, 'loss/train': 1.1246854066848755} 11/07/2021 18:09:21 - INFO - __main__ - Step 148699: {'lr': 9.547301812959375e-08, 'samples': 28550208, 'steps': 148698, 'loss/train': 1.4515855312347412} 11/07/2021 18:09:21 - INFO - __main__ - Step 148700: {'lr': 9.532642782450473e-08, 'samples': 28550400, 'steps': 148699, 'loss/train': 0.9226098656654358} 11/07/2021 18:09:22 - INFO - __main__ - Step 148701: {'lr': 9.517995012259251e-08, 'samples': 28550592, 'steps': 148700, 'loss/train': 1.1755893230438232} 11/07/2021 18:09:23 - INFO - __main__ - Step 148702: {'lr': 9.503358502396809e-08, 'samples': 28550784, 'steps': 148701, 'loss/train': 0.9961158633232117} 11/07/2021 18:09:23 - INFO - __main__ - Step 148703: {'lr': 9.488733252863147e-08, 'samples': 28550976, 'steps': 148702, 'loss/train': 1.4012550115585327} 11/07/2021 18:09:23 - INFO - __main__ - Step 148704: {'lr': 9.474119263672143e-08, 'samples': 28551168, 'steps': 148703, 'loss/train': 0.9467273950576782} 11/07/2021 18:09:24 - INFO - __main__ - Step 148705: {'lr': 9.459516534823798e-08, 'samples': 28551360, 'steps': 148704, 'loss/train': 1.3714977502822876} 11/07/2021 18:09:24 - INFO - __main__ - Step 148706: {'lr': 9.444925066329213e-08, 'samples': 28551552, 'steps': 148705, 'loss/train': 1.270004153251648} 11/07/2021 18:09:24 - INFO - __main__ - Step 148707: {'lr': 9.430344858191164e-08, 'samples': 28551744, 'steps': 148706, 'loss/train': 1.2933728694915771} 11/07/2021 18:09:26 - INFO - __main__ - Step 148708: {'lr': 9.415775910417979e-08, 'samples': 28551936, 'steps': 148707, 'loss/train': 1.3136276006698608} 11/07/2021 18:09:26 - INFO - __main__ - Step 148709: {'lr': 9.401218223017982e-08, 'samples': 28552128, 'steps': 148708, 'loss/train': 1.105061650276184} 11/07/2021 18:09:26 - INFO - __main__ - Step 148710: {'lr': 9.386671795996726e-08, 'samples': 28552320, 'steps': 148709, 'loss/train': 1.496767282485962} 11/07/2021 18:09:27 - INFO - __main__ - Step 148711: {'lr': 9.372136629356987e-08, 'samples': 28552512, 'steps': 148710, 'loss/train': 1.6084855794906616} 11/07/2021 18:09:27 - INFO - __main__ - Step 148712: {'lr': 9.35761272311264e-08, 'samples': 28552704, 'steps': 148711, 'loss/train': 1.5827462673187256} 11/07/2021 18:09:28 - INFO - __main__ - Step 148713: {'lr': 9.343100077263689e-08, 'samples': 28552896, 'steps': 148712, 'loss/train': 1.130031704902649} 11/07/2021 18:09:28 - INFO - __main__ - Step 148714: {'lr': 9.328598691818457e-08, 'samples': 28553088, 'steps': 148713, 'loss/train': 1.2897100448608398} 11/07/2021 18:09:29 - INFO - __main__ - Step 148715: {'lr': 9.314108566785273e-08, 'samples': 28553280, 'steps': 148714, 'loss/train': 0.9187741875648499} 11/07/2021 18:09:29 - INFO - __main__ - Step 148716: {'lr': 9.299629702169687e-08, 'samples': 28553472, 'steps': 148715, 'loss/train': 2.095726251602173} 11/07/2021 18:09:29 - INFO - __main__ - Step 148717: {'lr': 9.285162097977251e-08, 'samples': 28553664, 'steps': 148716, 'loss/train': 0.7596377730369568} 11/07/2021 18:09:30 - INFO - __main__ - Step 148718: {'lr': 9.270705754216291e-08, 'samples': 28553856, 'steps': 148717, 'loss/train': 1.4525368213653564} 11/07/2021 18:09:31 - INFO - __main__ - Step 148719: {'lr': 9.256260670892358e-08, 'samples': 28554048, 'steps': 148718, 'loss/train': 1.2777726650238037} 11/07/2021 18:09:31 - INFO - __main__ - Step 148720: {'lr': 9.241826848011003e-08, 'samples': 28554240, 'steps': 148719, 'loss/train': 1.3240569829940796} 11/07/2021 18:09:31 - INFO - __main__ - Step 148721: {'lr': 9.227404285580554e-08, 'samples': 28554432, 'steps': 148720, 'loss/train': 0.9187221527099609} 11/07/2021 18:09:32 - INFO - __main__ - Step 148722: {'lr': 9.21299298360656e-08, 'samples': 28554624, 'steps': 148721, 'loss/train': 1.016813039779663} 11/07/2021 18:09:33 - INFO - __main__ - Step 148723: {'lr': 9.198592942094575e-08, 'samples': 28554816, 'steps': 148722, 'loss/train': 1.187378168106079} 11/07/2021 18:09:33 - INFO - __main__ - Step 148724: {'lr': 9.184204161052922e-08, 'samples': 28555008, 'steps': 148723, 'loss/train': 1.0547802448272705} 11/07/2021 18:09:34 - INFO - __main__ - Step 148725: {'lr': 9.169826640487156e-08, 'samples': 28555200, 'steps': 148724, 'loss/train': 1.2787086963653564} 11/07/2021 18:09:34 - INFO - __main__ - Step 148726: {'lr': 9.155460380402824e-08, 'samples': 28555392, 'steps': 148725, 'loss/train': 1.1425954103469849} 11/07/2021 18:09:34 - INFO - __main__ - Step 148727: {'lr': 9.141105380808256e-08, 'samples': 28555584, 'steps': 148726, 'loss/train': 1.204940915107727} 11/07/2021 18:09:35 - INFO - __main__ - Step 148728: {'lr': 9.126761641709003e-08, 'samples': 28555776, 'steps': 148727, 'loss/train': 1.3579375743865967} 11/07/2021 18:09:36 - INFO - __main__ - Step 148729: {'lr': 9.112429163110614e-08, 'samples': 28555968, 'steps': 148728, 'loss/train': 0.18388572335243225} 11/07/2021 18:09:36 - INFO - __main__ - Step 148730: {'lr': 9.098107945021417e-08, 'samples': 28556160, 'steps': 148729, 'loss/train': 1.2909557819366455} 11/07/2021 18:09:36 - INFO - __main__ - Step 148731: {'lr': 9.083797987446963e-08, 'samples': 28556352, 'steps': 148730, 'loss/train': 0.4674265682697296} 11/07/2021 18:09:37 - INFO - __main__ - Step 148732: {'lr': 9.069499290395577e-08, 'samples': 28556544, 'steps': 148731, 'loss/train': 1.5438406467437744} 11/07/2021 18:09:37 - INFO - __main__ - Step 148733: {'lr': 9.055211853870038e-08, 'samples': 28556736, 'steps': 148732, 'loss/train': 1.28790283203125} 11/07/2021 18:09:38 - INFO - __main__ - Step 148734: {'lr': 9.04093567787867e-08, 'samples': 28556928, 'steps': 148733, 'loss/train': 1.68722665309906} 11/07/2021 18:09:39 - INFO - __main__ - Step 148735: {'lr': 9.026670762427025e-08, 'samples': 28557120, 'steps': 148734, 'loss/train': 0.8466659188270569} 11/07/2021 18:09:39 - INFO - __main__ - Step 148736: {'lr': 9.012417107523429e-08, 'samples': 28557312, 'steps': 148735, 'loss/train': 1.353578805923462} 11/07/2021 18:09:39 - INFO - __main__ - Step 148737: {'lr': 8.998174713173435e-08, 'samples': 28557504, 'steps': 148736, 'loss/train': 0.8240156769752502} 11/07/2021 18:09:40 - INFO - __main__ - Step 148738: {'lr': 8.983943579382592e-08, 'samples': 28557696, 'steps': 148737, 'loss/train': 1.0952091217041016} 11/07/2021 18:09:41 - INFO - __main__ - Step 148739: {'lr': 8.969723706156452e-08, 'samples': 28557888, 'steps': 148738, 'loss/train': 1.2295701503753662} 11/07/2021 18:09:41 - INFO - __main__ - Step 148740: {'lr': 8.955515093506118e-08, 'samples': 28558080, 'steps': 148739, 'loss/train': 1.4253273010253906} 11/07/2021 18:09:41 - INFO - __main__ - Step 148741: {'lr': 8.941317741431587e-08, 'samples': 28558272, 'steps': 148740, 'loss/train': 1.908067226409912} 11/07/2021 18:09:42 - INFO - __main__ - Step 148742: {'lr': 8.927131649943965e-08, 'samples': 28558464, 'steps': 148741, 'loss/train': 1.5220364332199097} 11/07/2021 18:09:42 - INFO - __main__ - Step 148743: {'lr': 8.912956819048801e-08, 'samples': 28558656, 'steps': 148742, 'loss/train': 1.2546864748001099} 11/07/2021 18:09:43 - INFO - __main__ - Step 148744: {'lr': 8.898793248751646e-08, 'samples': 28558848, 'steps': 148743, 'loss/train': 1.3339457511901855} 11/07/2021 18:09:43 - INFO - __main__ - Step 148745: {'lr': 8.884640939058053e-08, 'samples': 28559040, 'steps': 148744, 'loss/train': 1.3026037216186523} 11/07/2021 18:09:44 - INFO - __main__ - Step 148746: {'lr': 8.870499889976347e-08, 'samples': 28559232, 'steps': 148745, 'loss/train': 1.5181083679199219} 11/07/2021 18:09:44 - INFO - __main__ - Step 148747: {'lr': 8.85637010151208e-08, 'samples': 28559424, 'steps': 148746, 'loss/train': 1.0929919481277466} 11/07/2021 18:09:44 - INFO - __main__ - Step 148748: {'lr': 8.842251573670802e-08, 'samples': 28559616, 'steps': 148747, 'loss/train': 1.085196614265442} 11/07/2021 18:09:46 - INFO - __main__ - Step 148749: {'lr': 8.828144306460839e-08, 'samples': 28559808, 'steps': 148748, 'loss/train': 1.1787524223327637} 11/07/2021 18:09:46 - INFO - __main__ - Step 148750: {'lr': 8.814048299884969e-08, 'samples': 28560000, 'steps': 148749, 'loss/train': 0.5369827151298523} 11/07/2021 18:09:46 - INFO - __main__ - Step 148751: {'lr': 8.799963553954293e-08, 'samples': 28560192, 'steps': 148750, 'loss/train': 1.5322593450546265} 11/07/2021 18:09:47 - INFO - __main__ - Step 148752: {'lr': 8.785890068671587e-08, 'samples': 28560384, 'steps': 148751, 'loss/train': 1.409613847732544} 11/07/2021 18:09:47 - INFO - __main__ - Step 148753: {'lr': 8.771827844042401e-08, 'samples': 28560576, 'steps': 148752, 'loss/train': 1.415947675704956} 11/07/2021 18:09:48 - INFO - __main__ - Step 148754: {'lr': 8.75777688007784e-08, 'samples': 28560768, 'steps': 148753, 'loss/train': 1.0911470651626587} 11/07/2021 18:09:48 - INFO - __main__ - Step 148755: {'lr': 8.743737176780675e-08, 'samples': 28560960, 'steps': 148754, 'loss/train': 1.506801724433899} 11/07/2021 18:09:49 - INFO - __main__ - Step 148756: {'lr': 8.729708734156461e-08, 'samples': 28561152, 'steps': 148755, 'loss/train': 1.4262328147888184} 11/07/2021 18:09:49 - INFO - __main__ - Step 148757: {'lr': 8.715691552216298e-08, 'samples': 28561344, 'steps': 148756, 'loss/train': 1.3713430166244507} 11/07/2021 18:09:49 - INFO - __main__ - Step 148758: {'lr': 8.701685630960188e-08, 'samples': 28561536, 'steps': 148757, 'loss/train': 1.4467893838882446} 11/07/2021 18:09:50 - INFO - __main__ - Step 148759: {'lr': 8.687690970399231e-08, 'samples': 28561728, 'steps': 148758, 'loss/train': 1.7057439088821411} 11/07/2021 18:09:51 - INFO - __main__ - Step 148760: {'lr': 8.673707570536204e-08, 'samples': 28561920, 'steps': 148759, 'loss/train': 0.9925828576087952} 11/07/2021 18:09:52 - INFO - __main__ - Step 148761: {'lr': 8.659735431379435e-08, 'samples': 28562112, 'steps': 148760, 'loss/train': 1.349719524383545} 11/07/2021 18:09:52 - INFO - __main__ - Step 148762: {'lr': 8.645774552937246e-08, 'samples': 28562304, 'steps': 148761, 'loss/train': 0.8789302110671997} 11/07/2021 18:09:52 - INFO - __main__ - Step 148763: {'lr': 8.631824935212418e-08, 'samples': 28562496, 'steps': 148762, 'loss/train': 1.3208413124084473} 11/07/2021 18:09:53 - INFO - __main__ - Step 148764: {'lr': 8.6178865782105e-08, 'samples': 28562688, 'steps': 148763, 'loss/train': 1.408480167388916} 11/07/2021 18:09:53 - INFO - __main__ - Step 148765: {'lr': 8.603959481942591e-08, 'samples': 28562880, 'steps': 148764, 'loss/train': 0.34206968545913696} 11/07/2021 18:09:54 - INFO - __main__ - Step 148766: {'lr': 8.590043646408696e-08, 'samples': 28563072, 'steps': 148765, 'loss/train': 1.104207158088684} 11/07/2021 18:09:54 - INFO - __main__ - Step 148767: {'lr': 8.57613907162269e-08, 'samples': 28563264, 'steps': 148766, 'loss/train': 1.037894606590271} 11/07/2021 18:09:55 - INFO - __main__ - Step 148768: {'lr': 8.562245757584574e-08, 'samples': 28563456, 'steps': 148767, 'loss/train': 1.3610221147537231} 11/07/2021 18:09:55 - INFO - __main__ - Step 148769: {'lr': 8.548363704302675e-08, 'samples': 28563648, 'steps': 148768, 'loss/train': 0.6129887700080872} 11/07/2021 18:09:55 - INFO - __main__ - Step 148770: {'lr': 8.534492911782543e-08, 'samples': 28563840, 'steps': 148769, 'loss/train': 1.2829968929290771} 11/07/2021 18:09:56 - INFO - __main__ - Step 148771: {'lr': 8.520633380032505e-08, 'samples': 28564032, 'steps': 148770, 'loss/train': 1.588966727256775} 11/07/2021 18:09:57 - INFO - __main__ - Step 148772: {'lr': 8.506785109055337e-08, 'samples': 28564224, 'steps': 148771, 'loss/train': 1.0194514989852905} 11/07/2021 18:09:57 - INFO - __main__ - Step 148773: {'lr': 8.492948098859366e-08, 'samples': 28564416, 'steps': 148772, 'loss/train': 0.8492699265480042} 11/07/2021 18:09:57 - INFO - __main__ - Step 148774: {'lr': 8.479122349452917e-08, 'samples': 28564608, 'steps': 148773, 'loss/train': 1.0546016693115234} 11/07/2021 18:09:58 - INFO - __main__ - Step 148775: {'lr': 8.465307860838766e-08, 'samples': 28564800, 'steps': 148774, 'loss/train': 1.2888312339782715} 11/07/2021 18:09:59 - INFO - __main__ - Step 148776: {'lr': 8.451504633025242e-08, 'samples': 28564992, 'steps': 148775, 'loss/train': 1.1694552898406982} 11/07/2021 18:09:59 - INFO - __main__ - Step 148777: {'lr': 8.437712666017894e-08, 'samples': 28565184, 'steps': 148776, 'loss/train': 1.3636283874511719} 11/07/2021 18:10:00 - INFO - __main__ - Step 148778: {'lr': 8.423931959822273e-08, 'samples': 28565376, 'steps': 148777, 'loss/train': 1.1762447357177734} 11/07/2021 18:10:00 - INFO - __main__ - Step 148779: {'lr': 8.410162514446706e-08, 'samples': 28565568, 'steps': 148778, 'loss/train': 1.600640058517456} 11/07/2021 18:10:00 - INFO - __main__ - Step 148780: {'lr': 8.396404329893969e-08, 'samples': 28565760, 'steps': 148779, 'loss/train': 1.214613676071167} 11/07/2021 18:10:01 - INFO - __main__ - Step 148781: {'lr': 8.382657406175164e-08, 'samples': 28565952, 'steps': 148780, 'loss/train': 1.448639988899231} 11/07/2021 18:10:02 - INFO - __main__ - Step 148782: {'lr': 8.368921743290292e-08, 'samples': 28566144, 'steps': 148781, 'loss/train': 1.497905969619751} 11/07/2021 18:10:02 - INFO - __main__ - Step 148783: {'lr': 8.355197341250453e-08, 'samples': 28566336, 'steps': 148782, 'loss/train': 1.1480594873428345} 11/07/2021 18:10:02 - INFO - __main__ - Step 148784: {'lr': 8.341484200058425e-08, 'samples': 28566528, 'steps': 148783, 'loss/train': 0.8295915722846985} 11/07/2021 18:10:03 - INFO - __main__ - Step 148785: {'lr': 8.327782319722533e-08, 'samples': 28566720, 'steps': 148784, 'loss/train': 1.2760202884674072} 11/07/2021 18:10:04 - INFO - __main__ - Step 148786: {'lr': 8.314091700248327e-08, 'samples': 28566912, 'steps': 148785, 'loss/train': 0.9244252443313599} 11/07/2021 18:10:04 - INFO - __main__ - Step 148787: {'lr': 8.300412341644137e-08, 'samples': 28567104, 'steps': 148786, 'loss/train': 1.7964880466461182} 11/07/2021 18:10:05 - INFO - __main__ - Step 148788: {'lr': 8.286744243912736e-08, 'samples': 28567296, 'steps': 148787, 'loss/train': 1.389471173286438} 11/07/2021 18:10:05 - INFO - __main__ - Step 148789: {'lr': 8.273087407062452e-08, 'samples': 28567488, 'steps': 148788, 'loss/train': 0.8442835211753845} 11/07/2021 18:10:05 - INFO - __main__ - Step 148790: {'lr': 8.259441831096059e-08, 'samples': 28567680, 'steps': 148789, 'loss/train': 1.49545419216156} 11/07/2021 18:10:06 - INFO - __main__ - Step 148791: {'lr': 8.245807516024662e-08, 'samples': 28567872, 'steps': 148790, 'loss/train': 1.8343287706375122} 11/07/2021 18:10:07 - INFO - __main__ - Step 148792: {'lr': 8.232184461853808e-08, 'samples': 28568064, 'steps': 148791, 'loss/train': 1.6662951707839966} 11/07/2021 18:10:07 - INFO - __main__ - Step 148793: {'lr': 8.218572668583501e-08, 'samples': 28568256, 'steps': 148792, 'loss/train': 1.3281669616699219} 11/07/2021 18:10:07 - INFO - __main__ - Step 148794: {'lr': 8.204972136227618e-08, 'samples': 28568448, 'steps': 148793, 'loss/train': 0.8156861662864685} 11/07/2021 18:10:08 - INFO - __main__ - Step 148795: {'lr': 8.191382864788932e-08, 'samples': 28568640, 'steps': 148794, 'loss/train': 0.9032496809959412} 11/07/2021 18:10:08 - INFO - __main__ - Step 148796: {'lr': 8.177804854270221e-08, 'samples': 28568832, 'steps': 148795, 'loss/train': 0.9691367149353027} 11/07/2021 18:10:09 - INFO - __main__ - Step 148797: {'lr': 8.164238104685362e-08, 'samples': 28569024, 'steps': 148796, 'loss/train': 1.6455273628234863} 11/07/2021 18:10:09 - INFO - __main__ - Step 148798: {'lr': 8.150682616031579e-08, 'samples': 28569216, 'steps': 148797, 'loss/train': 1.5512011051177979} 11/07/2021 18:10:10 - INFO - __main__ - Step 148799: {'lr': 8.13713838832275e-08, 'samples': 28569408, 'steps': 148798, 'loss/train': 1.3243300914764404} 11/07/2021 18:10:10 - INFO - __main__ - Step 148800: {'lr': 8.123605421561652e-08, 'samples': 28569600, 'steps': 148799, 'loss/train': 1.5344358682632446} 11/07/2021 18:10:11 - INFO - __main__ - Step 148801: {'lr': 8.11008371575106e-08, 'samples': 28569792, 'steps': 148800, 'loss/train': 1.0854352712631226} 11/07/2021 18:10:12 - INFO - __main__ - Step 148802: {'lr': 8.09657327090485e-08, 'samples': 28569984, 'steps': 148801, 'loss/train': 1.2436658143997192} 11/07/2021 18:10:12 - INFO - __main__ - Step 148803: {'lr': 8.083074087023023e-08, 'samples': 28570176, 'steps': 148802, 'loss/train': 1.1916404962539673} 11/07/2021 18:10:12 - INFO - __main__ - Step 148804: {'lr': 8.069586164111132e-08, 'samples': 28570368, 'steps': 148803, 'loss/train': 1.526277780532837} 11/07/2021 18:10:13 - INFO - __main__ - Step 148805: {'lr': 8.056109502180275e-08, 'samples': 28570560, 'steps': 148804, 'loss/train': 1.1592347621917725} 11/07/2021 18:10:13 - INFO - __main__ - Step 148806: {'lr': 8.042644101233232e-08, 'samples': 28570752, 'steps': 148805, 'loss/train': 0.6720967292785645} 11/07/2021 18:10:14 - INFO - __main__ - Step 148807: {'lr': 8.029189961275551e-08, 'samples': 28570944, 'steps': 148806, 'loss/train': 1.5000536441802979} 11/07/2021 18:10:15 - INFO - __main__ - Step 148808: {'lr': 8.015747082312786e-08, 'samples': 28571136, 'steps': 148807, 'loss/train': 1.0577431917190552} 11/07/2021 18:10:15 - INFO - __main__ - Step 148809: {'lr': 8.002315464356036e-08, 'samples': 28571328, 'steps': 148808, 'loss/train': 1.7181596755981445} 11/07/2021 18:10:15 - INFO - __main__ - Step 148810: {'lr': 7.988895107405302e-08, 'samples': 28571520, 'steps': 148809, 'loss/train': 1.461071491241455} 11/07/2021 18:10:16 - INFO - __main__ - Step 148811: {'lr': 7.975486011468913e-08, 'samples': 28571712, 'steps': 148810, 'loss/train': 1.7342860698699951} 11/07/2021 18:10:16 - INFO - __main__ - Step 148812: {'lr': 7.962088176555194e-08, 'samples': 28571904, 'steps': 148811, 'loss/train': 0.8028384447097778} 11/07/2021 18:10:17 - INFO - __main__ - Step 148813: {'lr': 7.948701602666918e-08, 'samples': 28572096, 'steps': 148812, 'loss/train': 1.3174967765808105} 11/07/2021 18:10:17 - INFO - __main__ - Step 148814: {'lr': 7.935326289812417e-08, 'samples': 28572288, 'steps': 148813, 'loss/train': 1.201812505722046} 11/07/2021 18:10:18 - INFO - __main__ - Step 148815: {'lr': 7.921962237994462e-08, 'samples': 28572480, 'steps': 148814, 'loss/train': 1.2191417217254639} 11/07/2021 18:10:18 - INFO - __main__ - Step 148816: {'lr': 7.908609447221382e-08, 'samples': 28572672, 'steps': 148815, 'loss/train': 1.6809436082839966} 11/07/2021 18:10:18 - INFO - __main__ - Step 148817: {'lr': 7.895267917501503e-08, 'samples': 28572864, 'steps': 148816, 'loss/train': 1.2024269104003906} 11/07/2021 18:10:20 - INFO - __main__ - Step 148818: {'lr': 7.881937648834824e-08, 'samples': 28573056, 'steps': 148817, 'loss/train': 0.9127306938171387} 11/07/2021 18:10:20 - INFO - __main__ - Step 148819: {'lr': 7.868618641235226e-08, 'samples': 28573248, 'steps': 148818, 'loss/train': 1.5461058616638184} 11/07/2021 18:10:20 - INFO - __main__ - Step 148820: {'lr': 7.85531089469993e-08, 'samples': 28573440, 'steps': 148819, 'loss/train': 1.6116881370544434} 11/07/2021 18:10:21 - INFO - __main__ - Step 148821: {'lr': 7.842014409242815e-08, 'samples': 28573632, 'steps': 148820, 'loss/train': 1.4676973819732666} 11/07/2021 18:10:21 - INFO - __main__ - Step 148822: {'lr': 7.82872918486388e-08, 'samples': 28573824, 'steps': 148821, 'loss/train': 1.201392412185669} 11/07/2021 18:10:21 - INFO - __main__ - Step 148823: {'lr': 7.81545522157423e-08, 'samples': 28574016, 'steps': 148822, 'loss/train': 0.9769932627677917} 11/07/2021 18:10:22 - INFO - __main__ - Step 148824: {'lr': 7.802192519376638e-08, 'samples': 28574208, 'steps': 148823, 'loss/train': 1.7027205228805542} 11/07/2021 18:10:23 - INFO - __main__ - Step 148825: {'lr': 7.788941078276657e-08, 'samples': 28574400, 'steps': 148824, 'loss/train': 0.8631606698036194} 11/07/2021 18:10:23 - INFO - __main__ - Step 148826: {'lr': 7.775700898279835e-08, 'samples': 28574592, 'steps': 148825, 'loss/train': 1.1846579313278198} 11/07/2021 18:10:23 - INFO - __main__ - Step 148827: {'lr': 7.762471979397279e-08, 'samples': 28574784, 'steps': 148826, 'loss/train': 0.8880283832550049} 11/07/2021 18:10:24 - INFO - __main__ - Step 148828: {'lr': 7.749254321628985e-08, 'samples': 28574976, 'steps': 148827, 'loss/train': 0.5495407581329346} 11/07/2021 18:10:25 - INFO - __main__ - Step 148829: {'lr': 7.736047924983281e-08, 'samples': 28575168, 'steps': 148828, 'loss/train': 1.1411412954330444} 11/07/2021 18:10:25 - INFO - __main__ - Step 148830: {'lr': 7.722852789465718e-08, 'samples': 28575360, 'steps': 148829, 'loss/train': 1.6370164155960083} 11/07/2021 18:10:26 - INFO - __main__ - Step 148831: {'lr': 7.709668915084622e-08, 'samples': 28575552, 'steps': 148830, 'loss/train': 1.3307387828826904} 11/07/2021 18:10:26 - INFO - __main__ - Step 148832: {'lr': 7.69649630184277e-08, 'samples': 28575744, 'steps': 148831, 'loss/train': 0.9629783034324646} 11/07/2021 18:10:26 - INFO - __main__ - Step 148833: {'lr': 7.683334949745712e-08, 'samples': 28575936, 'steps': 148832, 'loss/train': 1.401442289352417} 11/07/2021 18:10:27 - INFO - __main__ - Step 148834: {'lr': 7.670184858804552e-08, 'samples': 28576128, 'steps': 148833, 'loss/train': 1.3765817880630493} 11/07/2021 18:10:28 - INFO - __main__ - Step 148835: {'lr': 7.657046029019288e-08, 'samples': 28576320, 'steps': 148834, 'loss/train': 1.455342411994934} 11/07/2021 18:10:28 - INFO - __main__ - Step 148836: {'lr': 7.643918460398247e-08, 'samples': 28576512, 'steps': 148835, 'loss/train': 1.270567774772644} 11/07/2021 18:10:28 - INFO - __main__ - Step 148837: {'lr': 7.630802152946981e-08, 'samples': 28576704, 'steps': 148836, 'loss/train': 1.0758569240570068} 11/07/2021 18:10:29 - INFO - __main__ - Step 148838: {'lr': 7.61769710667104e-08, 'samples': 28576896, 'steps': 148837, 'loss/train': 1.0134087800979614} 11/07/2021 18:10:30 - INFO - __main__ - Step 148839: {'lr': 7.604603321575976e-08, 'samples': 28577088, 'steps': 148838, 'loss/train': 1.4976134300231934} 11/07/2021 18:10:30 - INFO - __main__ - Step 148840: {'lr': 7.591520797670116e-08, 'samples': 28577280, 'steps': 148839, 'loss/train': 0.9178491234779358} 11/07/2021 18:10:30 - INFO - __main__ - Step 148841: {'lr': 7.578449534959009e-08, 'samples': 28577472, 'steps': 148840, 'loss/train': 1.034023642539978} 11/07/2021 18:10:31 - INFO - __main__ - Step 148842: {'lr': 7.565389533445432e-08, 'samples': 28577664, 'steps': 148841, 'loss/train': 1.323042631149292} 11/07/2021 18:10:31 - INFO - __main__ - Step 148843: {'lr': 7.552340793140488e-08, 'samples': 28577856, 'steps': 148842, 'loss/train': 1.4962459802627563} 11/07/2021 18:10:31 - INFO - __main__ - Step 148844: {'lr': 7.539303314044177e-08, 'samples': 28578048, 'steps': 148843, 'loss/train': 1.2557777166366577} 11/07/2021 18:10:33 - INFO - __main__ - Step 148845: {'lr': 7.526277096164825e-08, 'samples': 28578240, 'steps': 148844, 'loss/train': 0.722313404083252} 11/07/2021 18:10:33 - INFO - __main__ - Step 148846: {'lr': 7.513262139507982e-08, 'samples': 28578432, 'steps': 148845, 'loss/train': 1.4839789867401123} 11/07/2021 18:10:33 - INFO - __main__ - Step 148847: {'lr': 7.500258444081976e-08, 'samples': 28578624, 'steps': 148846, 'loss/train': 1.0135327577590942} 11/07/2021 18:10:34 - INFO - __main__ - Step 148848: {'lr': 7.487266009889582e-08, 'samples': 28578816, 'steps': 148847, 'loss/train': 1.6581391096115112} 11/07/2021 18:10:34 - INFO - __main__ - Step 148849: {'lr': 7.474284836936351e-08, 'samples': 28579008, 'steps': 148848, 'loss/train': 1.2437288761138916} 11/07/2021 18:10:35 - INFO - __main__ - Step 148850: {'lr': 7.461314925233387e-08, 'samples': 28579200, 'steps': 148849, 'loss/train': 1.3827418088912964} 11/07/2021 18:10:35 - INFO - __main__ - Step 148851: {'lr': 7.448356274777912e-08, 'samples': 28579392, 'steps': 148850, 'loss/train': 1.2322770357131958} 11/07/2021 18:10:36 - INFO - __main__ - Step 148852: {'lr': 7.435408885583806e-08, 'samples': 28579584, 'steps': 148851, 'loss/train': 1.1153931617736816} 11/07/2021 18:10:36 - INFO - __main__ - Step 148853: {'lr': 7.422472757653842e-08, 'samples': 28579776, 'steps': 148852, 'loss/train': 1.276049017906189} 11/07/2021 18:10:36 - INFO - __main__ - Step 148854: {'lr': 7.409547890993573e-08, 'samples': 28579968, 'steps': 148853, 'loss/train': 1.4747029542922974} 11/07/2021 18:10:38 - INFO - __main__ - Step 148855: {'lr': 7.396634285605775e-08, 'samples': 28580160, 'steps': 148854, 'loss/train': 1.2452996969223022} 11/07/2021 18:10:38 - INFO - __main__ - Step 148856: {'lr': 7.383731941501549e-08, 'samples': 28580352, 'steps': 148855, 'loss/train': 0.5957202911376953} 11/07/2021 18:10:38 - INFO - __main__ - Step 148857: {'lr': 7.370840858686445e-08, 'samples': 28580544, 'steps': 148856, 'loss/train': 1.1492209434509277} 11/07/2021 18:10:39 - INFO - __main__ - Step 148858: {'lr': 7.357961037160466e-08, 'samples': 28580736, 'steps': 148857, 'loss/train': 1.4706668853759766} 11/07/2021 18:10:39 - INFO - __main__ - Step 148859: {'lr': 7.345092476937488e-08, 'samples': 28580928, 'steps': 148858, 'loss/train': 1.9867645502090454} 11/07/2021 18:10:40 - INFO - __main__ - Step 148860: {'lr': 7.332235178014735e-08, 'samples': 28581120, 'steps': 148859, 'loss/train': 1.0144908428192139} 11/07/2021 18:10:40 - INFO - __main__ - Step 148861: {'lr': 7.319389140406086e-08, 'samples': 28581312, 'steps': 148860, 'loss/train': 1.0621168613433838} 11/07/2021 18:10:41 - INFO - __main__ - Step 148862: {'lr': 7.30655436411154e-08, 'samples': 28581504, 'steps': 148861, 'loss/train': 1.2095894813537598} 11/07/2021 18:10:41 - INFO - __main__ - Step 148863: {'lr': 7.293730849139425e-08, 'samples': 28581696, 'steps': 148862, 'loss/train': 1.1423004865646362} 11/07/2021 18:10:41 - INFO - __main__ - Step 148864: {'lr': 7.28091859549529e-08, 'samples': 28581888, 'steps': 148863, 'loss/train': 1.2689237594604492} 11/07/2021 18:10:42 - INFO - __main__ - Step 148865: {'lr': 7.268117603187463e-08, 'samples': 28582080, 'steps': 148864, 'loss/train': 1.160699725151062} 11/07/2021 18:10:43 - INFO - __main__ - Step 148866: {'lr': 7.255327872215945e-08, 'samples': 28582272, 'steps': 148865, 'loss/train': 1.4826539754867554} 11/07/2021 18:10:43 - INFO - __main__ - Step 148867: {'lr': 7.24254940258906e-08, 'samples': 28582464, 'steps': 148866, 'loss/train': 1.0628329515457153} 11/07/2021 18:10:43 - INFO - __main__ - Step 148868: {'lr': 7.229782194315138e-08, 'samples': 28582656, 'steps': 148867, 'loss/train': 1.1238059997558594} 11/07/2021 18:10:44 - INFO - __main__ - Step 148869: {'lr': 7.217026247396952e-08, 'samples': 28582848, 'steps': 148868, 'loss/train': 1.2660002708435059} 11/07/2021 18:10:44 - INFO - __main__ - Step 148870: {'lr': 7.204281561840054e-08, 'samples': 28583040, 'steps': 148869, 'loss/train': 1.5128798484802246} 11/07/2021 18:10:45 - INFO - __main__ - Step 148871: {'lr': 7.191548137649994e-08, 'samples': 28583232, 'steps': 148870, 'loss/train': 1.1859411001205444} 11/07/2021 18:10:46 - INFO - __main__ - Step 148872: {'lr': 7.178825974837878e-08, 'samples': 28583424, 'steps': 148871, 'loss/train': 1.6615225076675415} 11/07/2021 18:10:46 - INFO - __main__ - Step 148873: {'lr': 7.166115073400925e-08, 'samples': 28583616, 'steps': 148872, 'loss/train': 2.008755922317505} 11/07/2021 18:10:46 - INFO - __main__ - Step 148874: {'lr': 7.153415433353017e-08, 'samples': 28583808, 'steps': 148873, 'loss/train': 0.7911688685417175} 11/07/2021 18:10:47 - INFO - __main__ - Step 148875: {'lr': 7.140727054694152e-08, 'samples': 28584000, 'steps': 148874, 'loss/train': 1.3390235900878906} 11/07/2021 18:10:48 - INFO - __main__ - Step 148876: {'lr': 7.128049937432657e-08, 'samples': 28584192, 'steps': 148875, 'loss/train': 0.764175534248352} 11/07/2021 18:10:48 - INFO - __main__ - Step 148877: {'lr': 7.115384081574083e-08, 'samples': 28584384, 'steps': 148876, 'loss/train': 1.1886417865753174} 11/07/2021 18:10:48 - INFO - __main__ - Step 148878: {'lr': 7.102729487121207e-08, 'samples': 28584576, 'steps': 148877, 'loss/train': 1.2597310543060303} 11/07/2021 18:10:49 - INFO - __main__ - Step 148879: {'lr': 7.09008615408513e-08, 'samples': 28584768, 'steps': 148878, 'loss/train': 0.991337239742279} 11/07/2021 18:10:49 - INFO - __main__ - Step 148880: {'lr': 7.077454082468627e-08, 'samples': 28584960, 'steps': 148879, 'loss/train': 1.2030435800552368} 11/07/2021 18:10:50 - INFO - __main__ - Step 148881: {'lr': 7.064833272274473e-08, 'samples': 28585152, 'steps': 148880, 'loss/train': 1.9987186193466187} 11/07/2021 18:10:51 - INFO - __main__ - Step 148882: {'lr': 7.052223723513774e-08, 'samples': 28585344, 'steps': 148881, 'loss/train': 1.300833821296692} 11/07/2021 18:10:51 - INFO - __main__ - Step 148883: {'lr': 7.039625436189301e-08, 'samples': 28585536, 'steps': 148882, 'loss/train': 1.3196601867675781} 11/07/2021 18:10:51 - INFO - __main__ - Step 148884: {'lr': 7.027038410306608e-08, 'samples': 28585728, 'steps': 148883, 'loss/train': 1.0403189659118652} 11/07/2021 18:10:52 - INFO - __main__ - Step 148885: {'lr': 7.014462645871244e-08, 'samples': 28585920, 'steps': 148884, 'loss/train': 1.565974473953247} 11/07/2021 18:10:52 - INFO - __main__ - Step 148886: {'lr': 7.001898142888763e-08, 'samples': 28586112, 'steps': 148885, 'loss/train': 1.48067045211792} 11/07/2021 18:10:53 - INFO - __main__ - Step 148887: {'lr': 6.989344901367489e-08, 'samples': 28586304, 'steps': 148886, 'loss/train': 1.1139601469039917} 11/07/2021 18:10:53 - INFO - __main__ - Step 148888: {'lr': 6.9768029213102e-08, 'samples': 28586496, 'steps': 148887, 'loss/train': 1.1763653755187988} 11/07/2021 18:10:54 - INFO - __main__ - Step 148889: {'lr': 6.96427220272522e-08, 'samples': 28586688, 'steps': 148888, 'loss/train': 1.5959471464157104} 11/07/2021 18:10:54 - INFO - __main__ - Step 148890: {'lr': 6.951752745615325e-08, 'samples': 28586880, 'steps': 148889, 'loss/train': 1.048215627670288} 11/07/2021 18:10:55 - INFO - __main__ - Step 148891: {'lr': 6.939244549986068e-08, 'samples': 28587072, 'steps': 148890, 'loss/train': 1.600671648979187} 11/07/2021 18:10:56 - INFO - __main__ - Step 148892: {'lr': 6.926747615845775e-08, 'samples': 28587264, 'steps': 148891, 'loss/train': 0.5623737573623657} 11/07/2021 18:10:56 - INFO - __main__ - Step 148893: {'lr': 6.91426194319722e-08, 'samples': 28587456, 'steps': 148892, 'loss/train': 1.6087892055511475} 11/07/2021 18:10:56 - INFO - __main__ - Step 148894: {'lr': 6.901787532048732e-08, 'samples': 28587648, 'steps': 148893, 'loss/train': 1.097593903541565} 11/07/2021 18:10:57 - INFO - __main__ - Step 148895: {'lr': 6.88932438240586e-08, 'samples': 28587840, 'steps': 148894, 'loss/train': 1.4173717498779297} 11/07/2021 18:10:57 - INFO - __main__ - Step 148896: {'lr': 6.87687249427138e-08, 'samples': 28588032, 'steps': 148895, 'loss/train': 1.2110246419906616} 11/07/2021 18:10:58 - INFO - __main__ - Step 148897: {'lr': 6.864431867650844e-08, 'samples': 28588224, 'steps': 148896, 'loss/train': 1.3648629188537598} 11/07/2021 18:10:58 - INFO - __main__ - Step 148898: {'lr': 6.852002502555355e-08, 'samples': 28588416, 'steps': 148897, 'loss/train': 1.2289466857910156} 11/07/2021 18:10:59 - INFO - __main__ - Step 148899: {'lr': 6.839584398982135e-08, 'samples': 28588608, 'steps': 148898, 'loss/train': 1.0917314291000366} 11/07/2021 18:10:59 - INFO - __main__ - Step 148900: {'lr': 6.827177556945064e-08, 'samples': 28588800, 'steps': 148899, 'loss/train': 1.229591727256775} 11/07/2021 18:10:59 - INFO - __main__ - Step 148901: {'lr': 6.81478197644414e-08, 'samples': 28588992, 'steps': 148900, 'loss/train': 1.38480544090271} 11/07/2021 18:11:00 - INFO - __main__ - Step 148902: {'lr': 6.802397657487691e-08, 'samples': 28589184, 'steps': 148901, 'loss/train': 1.274815559387207} 11/07/2021 18:11:01 - INFO - __main__ - Step 148903: {'lr': 6.790024600081269e-08, 'samples': 28589376, 'steps': 148902, 'loss/train': 1.8279787302017212} 11/07/2021 18:11:01 - INFO - __main__ - Step 148904: {'lr': 6.777662804227647e-08, 'samples': 28589568, 'steps': 148903, 'loss/train': 1.0150734186172485} 11/07/2021 18:11:02 - INFO - __main__ - Step 148905: {'lr': 6.765312269935153e-08, 'samples': 28589760, 'steps': 148904, 'loss/train': 0.39016297459602356} 11/07/2021 18:11:02 - INFO - __main__ - Step 148906: {'lr': 6.752972997209338e-08, 'samples': 28589952, 'steps': 148905, 'loss/train': 1.382609486579895} 11/07/2021 18:11:02 - INFO - __main__ - Step 148907: {'lr': 6.74064498605298e-08, 'samples': 28590144, 'steps': 148906, 'loss/train': 0.7677545547485352} 11/07/2021 18:11:03 - INFO - __main__ - Step 148908: {'lr': 6.728328236474402e-08, 'samples': 28590336, 'steps': 148907, 'loss/train': 1.7500196695327759} 11/07/2021 18:11:04 - INFO - __main__ - Step 148909: {'lr': 6.716022748479156e-08, 'samples': 28590528, 'steps': 148908, 'loss/train': 0.9221975803375244} 11/07/2021 18:11:04 - INFO - __main__ - Step 148910: {'lr': 6.703728522072794e-08, 'samples': 28590720, 'steps': 148909, 'loss/train': 1.2653045654296875} 11/07/2021 18:11:04 - INFO - __main__ - Step 148911: {'lr': 6.691445557258091e-08, 'samples': 28590912, 'steps': 148910, 'loss/train': 1.4177390336990356} 11/07/2021 18:11:05 - INFO - __main__ - Step 148912: {'lr': 6.679173854043374e-08, 'samples': 28591104, 'steps': 148911, 'loss/train': 1.1072721481323242} 11/07/2021 18:11:06 - INFO - __main__ - Step 148913: {'lr': 6.666913412434194e-08, 'samples': 28591296, 'steps': 148912, 'loss/train': 1.525580644607544} 11/07/2021 18:11:06 - INFO - __main__ - Step 148914: {'lr': 6.654664232433328e-08, 'samples': 28591488, 'steps': 148913, 'loss/train': 1.4854227304458618} 11/07/2021 18:11:07 - INFO - __main__ - Step 148915: {'lr': 6.642426314049099e-08, 'samples': 28591680, 'steps': 148914, 'loss/train': 0.43815988302230835} 11/07/2021 18:11:07 - INFO - __main__ - Step 148916: {'lr': 6.630199657287061e-08, 'samples': 28591872, 'steps': 148915, 'loss/train': 1.7018208503723145} 11/07/2021 18:11:07 - INFO - __main__ - Step 148917: {'lr': 6.61798426214999e-08, 'samples': 28592064, 'steps': 148916, 'loss/train': 1.4129382371902466} 11/07/2021 18:11:08 - INFO - __main__ - Step 148918: {'lr': 6.605780128646211e-08, 'samples': 28592256, 'steps': 148917, 'loss/train': 1.1498359441757202} 11/07/2021 18:11:09 - INFO - __main__ - Step 148919: {'lr': 6.593587256781275e-08, 'samples': 28592448, 'steps': 148918, 'loss/train': 0.7643460631370544} 11/07/2021 18:11:09 - INFO - __main__ - Step 148920: {'lr': 6.581405646557959e-08, 'samples': 28592640, 'steps': 148919, 'loss/train': 1.3016918897628784} 11/07/2021 18:11:09 - INFO - __main__ - Step 148921: {'lr': 6.569235297984588e-08, 'samples': 28592832, 'steps': 148920, 'loss/train': 1.4554566144943237} 11/07/2021 18:11:10 - INFO - __main__ - Step 148922: {'lr': 6.557076211063939e-08, 'samples': 28593024, 'steps': 148921, 'loss/train': 1.224259376525879} 11/07/2021 18:11:11 - INFO - __main__ - Step 148923: {'lr': 6.544928385804338e-08, 'samples': 28593216, 'steps': 148922, 'loss/train': 1.288068413734436} 11/07/2021 18:11:11 - INFO - __main__ - Step 148924: {'lr': 6.532791822208561e-08, 'samples': 28593408, 'steps': 148923, 'loss/train': 1.3829190731048584} 11/07/2021 18:11:12 - INFO - __main__ - Step 148925: {'lr': 6.520666520284934e-08, 'samples': 28593600, 'steps': 148924, 'loss/train': 1.0840615034103394} 11/07/2021 18:11:12 - INFO - __main__ - Step 148926: {'lr': 6.508552480036234e-08, 'samples': 28593792, 'steps': 148925, 'loss/train': 1.733709454536438} 11/07/2021 18:11:12 - INFO - __main__ - Step 148927: {'lr': 6.49644970146801e-08, 'samples': 28593984, 'steps': 148926, 'loss/train': 1.3999916315078735} 11/07/2021 18:11:13 - INFO - __main__ - Step 148928: {'lr': 6.48435818458859e-08, 'samples': 28594176, 'steps': 148927, 'loss/train': 1.5697646141052246} 11/07/2021 18:11:14 - INFO - __main__ - Step 148929: {'lr': 6.472277929403525e-08, 'samples': 28594368, 'steps': 148928, 'loss/train': 1.1768028736114502} 11/07/2021 18:11:14 - INFO - __main__ - Step 148930: {'lr': 6.460208935912814e-08, 'samples': 28594560, 'steps': 148929, 'loss/train': 1.1393177509307861} 11/07/2021 18:11:15 - INFO - __main__ - Step 148931: {'lr': 6.448151204127561e-08, 'samples': 28594752, 'steps': 148930, 'loss/train': 1.2101991176605225} 11/07/2021 18:11:15 - INFO - __main__ - Step 148932: {'lr': 6.436104734050541e-08, 'samples': 28594944, 'steps': 148931, 'loss/train': 1.1893961429595947} 11/07/2021 18:11:15 - INFO - __main__ - Step 148933: {'lr': 6.424069525687304e-08, 'samples': 28595136, 'steps': 148932, 'loss/train': 1.255910038948059} 11/07/2021 18:11:16 - INFO - __main__ - Step 148934: {'lr': 6.412045579043402e-08, 'samples': 28595328, 'steps': 148933, 'loss/train': 1.3165311813354492} 11/07/2021 18:11:17 - INFO - __main__ - Step 148935: {'lr': 6.400032894127161e-08, 'samples': 28595520, 'steps': 148934, 'loss/train': 1.4301024675369263} 11/07/2021 18:11:17 - INFO - __main__ - Step 148936: {'lr': 6.388031470938582e-08, 'samples': 28595712, 'steps': 148935, 'loss/train': 1.2448903322219849} 11/07/2021 18:11:17 - INFO - __main__ - Step 148937: {'lr': 6.376041309485992e-08, 'samples': 28595904, 'steps': 148936, 'loss/train': 1.5362969636917114} 11/07/2021 18:11:18 - INFO - __main__ - Step 148938: {'lr': 6.364062409777716e-08, 'samples': 28596096, 'steps': 148937, 'loss/train': 0.7168666124343872} 11/07/2021 18:11:19 - INFO - __main__ - Step 148939: {'lr': 6.352094771813754e-08, 'samples': 28596288, 'steps': 148938, 'loss/train': 1.4821161031723022} 11/07/2021 18:11:19 - INFO - __main__ - Step 148940: {'lr': 6.340138395599659e-08, 'samples': 28596480, 'steps': 148939, 'loss/train': 0.9118808507919312} 11/07/2021 18:11:19 - INFO - __main__ - Step 148941: {'lr': 6.328193281146532e-08, 'samples': 28596672, 'steps': 148940, 'loss/train': 0.471790075302124} 11/07/2021 18:11:20 - INFO - __main__ - Step 148942: {'lr': 6.316259428454374e-08, 'samples': 28596864, 'steps': 148941, 'loss/train': 0.06286223977804184} 11/07/2021 18:11:20 - INFO - __main__ - Step 148943: {'lr': 6.30433683753151e-08, 'samples': 28597056, 'steps': 148942, 'loss/train': 1.9863921403884888} 11/07/2021 18:11:21 - INFO - __main__ - Step 148944: {'lr': 6.292425508383493e-08, 'samples': 28597248, 'steps': 148943, 'loss/train': 1.3684126138687134} 11/07/2021 18:11:22 - INFO - __main__ - Step 148945: {'lr': 6.280525441010321e-08, 'samples': 28597440, 'steps': 148944, 'loss/train': 1.3668323755264282} 11/07/2021 18:11:22 - INFO - __main__ - Step 148946: {'lr': 6.268636635425873e-08, 'samples': 28597632, 'steps': 148945, 'loss/train': 1.1451725959777832} 11/07/2021 18:11:22 - INFO - __main__ - Step 148947: {'lr': 6.256759091627372e-08, 'samples': 28597824, 'steps': 148946, 'loss/train': 1.6736739873886108} 11/07/2021 18:11:23 - INFO - __main__ - Step 148948: {'lr': 6.244892809625924e-08, 'samples': 28598016, 'steps': 148947, 'loss/train': 0.9356997013092041} 11/07/2021 18:11:24 - INFO - __main__ - Step 148949: {'lr': 6.233037789424301e-08, 'samples': 28598208, 'steps': 148948, 'loss/train': 1.2902429103851318} 11/07/2021 18:11:24 - INFO - __main__ - Step 148950: {'lr': 6.22119403103083e-08, 'samples': 28598400, 'steps': 148949, 'loss/train': 1.561801552772522} 11/07/2021 18:11:25 - INFO - __main__ - Step 148951: {'lr': 6.209361534445513e-08, 'samples': 28598592, 'steps': 148950, 'loss/train': 1.1616103649139404} 11/07/2021 18:11:25 - INFO - __main__ - Step 148952: {'lr': 6.197540299676673e-08, 'samples': 28598784, 'steps': 148951, 'loss/train': 0.07259564846754074} 11/07/2021 18:11:25 - INFO - __main__ - Step 148953: {'lr': 6.185730326729866e-08, 'samples': 28598976, 'steps': 148952, 'loss/train': 1.3893027305603027} 11/07/2021 18:11:27 - INFO - __main__ - Step 148954: {'lr': 6.17393161561064e-08, 'samples': 28599168, 'steps': 148953, 'loss/train': 1.3521664142608643} 11/07/2021 18:11:27 - INFO - __main__ - Step 148955: {'lr': 6.162144166324546e-08, 'samples': 28599360, 'steps': 148954, 'loss/train': 1.4170563220977783} 11/07/2021 18:11:27 - INFO - __main__ - Step 148956: {'lr': 6.15036797887436e-08, 'samples': 28599552, 'steps': 148955, 'loss/train': 0.6574588418006897} 11/07/2021 18:11:28 - INFO - __main__ - Step 148957: {'lr': 6.13860305326841e-08, 'samples': 28599744, 'steps': 148956, 'loss/train': 1.4744830131530762} 11/07/2021 18:11:28 - INFO - __main__ - Step 148958: {'lr': 6.12684938950947e-08, 'samples': 28599936, 'steps': 148957, 'loss/train': 1.2218338251113892} 11/07/2021 18:11:29 - INFO - __main__ - Step 148959: {'lr': 6.115106987605868e-08, 'samples': 28600128, 'steps': 148958, 'loss/train': 0.8410100936889648} 11/07/2021 18:11:29 - INFO - __main__ - Step 148960: {'lr': 6.103375847560377e-08, 'samples': 28600320, 'steps': 148959, 'loss/train': 1.225640058517456} 11/07/2021 18:11:30 - INFO - __main__ - Step 148961: {'lr': 6.091655969378551e-08, 'samples': 28600512, 'steps': 148960, 'loss/train': 1.501157283782959} 11/07/2021 18:11:30 - INFO - __main__ - Step 148962: {'lr': 6.07994735306594e-08, 'samples': 28600704, 'steps': 148961, 'loss/train': 1.447305679321289} 11/07/2021 18:11:30 - INFO - __main__ - Step 148963: {'lr': 6.068249998628095e-08, 'samples': 28600896, 'steps': 148962, 'loss/train': 1.817878246307373} 11/07/2021 18:11:31 - INFO - __main__ - Step 148964: {'lr': 6.056563906070566e-08, 'samples': 28601088, 'steps': 148963, 'loss/train': 1.455615520477295} 11/07/2021 18:11:32 - INFO - __main__ - Step 148965: {'lr': 6.044889075398908e-08, 'samples': 28601280, 'steps': 148964, 'loss/train': 1.1250340938568115} 11/07/2021 18:11:32 - INFO - __main__ - Step 148966: {'lr': 6.033225506618666e-08, 'samples': 28601472, 'steps': 148965, 'loss/train': 1.5701473951339722} 11/07/2021 18:11:33 - INFO - __main__ - Step 148967: {'lr': 6.021573199732622e-08, 'samples': 28601664, 'steps': 148966, 'loss/train': 0.10964847356081009} 11/07/2021 18:11:33 - INFO - __main__ - Step 148968: {'lr': 6.009932154746323e-08, 'samples': 28601856, 'steps': 148967, 'loss/train': 0.9126046299934387} 11/07/2021 18:11:33 - INFO - __main__ - Step 148969: {'lr': 5.998302371668096e-08, 'samples': 28602048, 'steps': 148968, 'loss/train': 1.3481453657150269} 11/07/2021 18:11:35 - INFO - __main__ - Step 148970: {'lr': 5.986683850500718e-08, 'samples': 28602240, 'steps': 148969, 'loss/train': 1.362854242324829} 11/07/2021 18:11:35 - INFO - __main__ - Step 148971: {'lr': 5.97507659124974e-08, 'samples': 28602432, 'steps': 148970, 'loss/train': 1.3963425159454346} 11/07/2021 18:11:35 - INFO - __main__ - Step 148972: {'lr': 5.963480593920711e-08, 'samples': 28602624, 'steps': 148971, 'loss/train': 1.0867542028427124} 11/07/2021 18:11:36 - INFO - __main__ - Step 148973: {'lr': 5.9518958585191854e-08, 'samples': 28602816, 'steps': 148972, 'loss/train': 1.3823879957199097} 11/07/2021 18:11:36 - INFO - __main__ - Step 148974: {'lr': 5.9403223850507113e-08, 'samples': 28603008, 'steps': 148973, 'loss/train': 1.0517966747283936} 11/07/2021 18:11:37 - INFO - __main__ - Step 148975: {'lr': 5.928760173518066e-08, 'samples': 28603200, 'steps': 148974, 'loss/train': 0.4591025412082672} 11/07/2021 18:11:37 - INFO - __main__ - Step 148976: {'lr': 5.917209223929576e-08, 'samples': 28603392, 'steps': 148975, 'loss/train': 1.2039976119995117} 11/07/2021 18:11:38 - INFO - __main__ - Step 148977: {'lr': 5.905669536290792e-08, 'samples': 28603584, 'steps': 148976, 'loss/train': 0.8324646949768066} 11/07/2021 18:11:38 - INFO - __main__ - Step 148978: {'lr': 5.894141110601714e-08, 'samples': 28603776, 'steps': 148977, 'loss/train': 1.3801413774490356} 11/07/2021 18:11:38 - INFO - __main__ - Step 148979: {'lr': 5.882623946873444e-08, 'samples': 28603968, 'steps': 148978, 'loss/train': 1.4257560968399048} 11/07/2021 18:11:39 - INFO - __main__ - Step 148980: {'lr': 5.871118045108759e-08, 'samples': 28604160, 'steps': 148979, 'loss/train': 0.047377053648233414} 11/07/2021 18:11:40 - INFO - __main__ - Step 148981: {'lr': 5.8596234053104326e-08, 'samples': 28604352, 'steps': 148980, 'loss/train': 1.1561381816864014} 11/07/2021 18:11:40 - INFO - __main__ - Step 148982: {'lr': 5.848140027489568e-08, 'samples': 28604544, 'steps': 148981, 'loss/train': 1.3419705629348755} 11/07/2021 18:11:41 - INFO - __main__ - Step 148983: {'lr': 5.836667911646165e-08, 'samples': 28604736, 'steps': 148982, 'loss/train': 0.9529098868370056} 11/07/2021 18:11:41 - INFO - __main__ - Step 148984: {'lr': 5.8252070577857755e-08, 'samples': 28604928, 'steps': 148983, 'loss/train': 0.9941876530647278} 11/07/2021 18:11:41 - INFO - __main__ - Step 148985: {'lr': 5.8137574659167244e-08, 'samples': 28605120, 'steps': 148984, 'loss/train': 1.00044584274292} 11/07/2021 18:11:42 - INFO - __main__ - Step 148986: {'lr': 5.802319136041789e-08, 'samples': 28605312, 'steps': 148985, 'loss/train': 1.039579153060913} 11/07/2021 18:11:43 - INFO - __main__ - Step 148987: {'lr': 5.79089206816652e-08, 'samples': 28605504, 'steps': 148986, 'loss/train': 1.7663019895553589} 11/07/2021 18:11:43 - INFO - __main__ - Step 148988: {'lr': 5.779476262296468e-08, 'samples': 28605696, 'steps': 148987, 'loss/train': 1.442034363746643} 11/07/2021 18:11:43 - INFO - __main__ - Step 148989: {'lr': 5.7680717184344086e-08, 'samples': 28605888, 'steps': 148988, 'loss/train': 1.1602768898010254} 11/07/2021 18:11:44 - INFO - __main__ - Step 148990: {'lr': 5.756678436591445e-08, 'samples': 28606080, 'steps': 148989, 'loss/train': 1.0798661708831787} 11/07/2021 18:11:45 - INFO - __main__ - Step 148991: {'lr': 5.7452964167648004e-08, 'samples': 28606272, 'steps': 148990, 'loss/train': 1.4707624912261963} 11/07/2021 18:11:45 - INFO - __main__ - Step 148992: {'lr': 5.7339256589655776e-08, 'samples': 28606464, 'steps': 148991, 'loss/train': 1.403314471244812} 11/07/2021 18:11:46 - INFO - __main__ - Step 148993: {'lr': 5.722566163199328e-08, 'samples': 28606656, 'steps': 148992, 'loss/train': 1.7361503839492798} 11/07/2021 18:11:46 - INFO - __main__ - Step 148994: {'lr': 5.7112179294660504e-08, 'samples': 28606848, 'steps': 148993, 'loss/train': 1.0971252918243408} 11/07/2021 18:11:46 - INFO - __main__ - Step 148995: {'lr': 5.699880957774073e-08, 'samples': 28607040, 'steps': 148994, 'loss/train': 0.5387585759162903} 11/07/2021 18:11:47 - INFO - __main__ - Step 148996: {'lr': 5.688555248126171e-08, 'samples': 28607232, 'steps': 148995, 'loss/train': 1.320007562637329} 11/07/2021 18:11:48 - INFO - __main__ - Step 148997: {'lr': 5.677240800533445e-08, 'samples': 28607424, 'steps': 148996, 'loss/train': 0.9897400736808777} 11/07/2021 18:11:48 - INFO - __main__ - Step 148998: {'lr': 5.6659376149931216e-08, 'samples': 28607616, 'steps': 148997, 'loss/train': 1.6642168760299683} 11/07/2021 18:11:48 - INFO - __main__ - Step 148999: {'lr': 5.6546456915163025e-08, 'samples': 28607808, 'steps': 148998, 'loss/train': 1.2807344198226929} 11/07/2021 18:11:49 - INFO - __main__ - Step 149000: {'lr': 5.643365030105763e-08, 'samples': 28608000, 'steps': 148999, 'loss/train': 1.4914767742156982} 11/07/2021 18:11:50 - INFO - __main__ - Step 149001: {'lr': 5.632095630764278e-08, 'samples': 28608192, 'steps': 149000, 'loss/train': 1.2268697023391724} 11/07/2021 18:11:50 - INFO - __main__ - Step 149002: {'lr': 5.620837493500175e-08, 'samples': 28608384, 'steps': 149001, 'loss/train': 1.1181508302688599} 11/07/2021 18:11:50 - INFO - __main__ - Step 149003: {'lr': 5.609590618319005e-08, 'samples': 28608576, 'steps': 149002, 'loss/train': 2.1207528114318848} 11/07/2021 18:11:51 - INFO - __main__ - Step 149004: {'lr': 5.598355005223543e-08, 'samples': 28608768, 'steps': 149003, 'loss/train': 0.9825608730316162} 11/07/2021 18:11:51 - INFO - __main__ - Step 149005: {'lr': 5.587130654222117e-08, 'samples': 28608960, 'steps': 149004, 'loss/train': 0.6713739633560181} 11/07/2021 18:11:52 - INFO - __main__ - Step 149006: {'lr': 5.5759175653147254e-08, 'samples': 28609152, 'steps': 149005, 'loss/train': 1.4746838808059692} 11/07/2021 18:11:53 - INFO - __main__ - Step 149007: {'lr': 5.564715738509696e-08, 'samples': 28609344, 'steps': 149006, 'loss/train': 1.17788827419281} 11/07/2021 18:11:53 - INFO - __main__ - Step 149008: {'lr': 5.5535251738098036e-08, 'samples': 28609536, 'steps': 149007, 'loss/train': 0.9920785427093506} 11/07/2021 18:11:53 - INFO - __main__ - Step 149009: {'lr': 5.542345871226151e-08, 'samples': 28609728, 'steps': 149008, 'loss/train': 1.2018955945968628} 11/07/2021 18:11:54 - INFO - __main__ - Step 149010: {'lr': 5.531177830755962e-08, 'samples': 28609920, 'steps': 149009, 'loss/train': 1.3261535167694092} 11/07/2021 18:11:55 - INFO - __main__ - Step 149011: {'lr': 5.520021052407564e-08, 'samples': 28610112, 'steps': 149010, 'loss/train': 0.6255766749382019} 11/07/2021 18:11:55 - INFO - __main__ - Step 149012: {'lr': 5.508875536189284e-08, 'samples': 28610304, 'steps': 149011, 'loss/train': 1.4740216732025146} 11/07/2021 18:11:55 - INFO - __main__ - Step 149013: {'lr': 5.497741282101121e-08, 'samples': 28610496, 'steps': 149012, 'loss/train': 1.091124415397644} 11/07/2021 18:11:56 - INFO - __main__ - Step 149014: {'lr': 5.486618290148626e-08, 'samples': 28610688, 'steps': 149013, 'loss/train': 0.6571959853172302} 11/07/2021 18:11:56 - INFO - __main__ - Step 149015: {'lr': 5.4755065603401265e-08, 'samples': 28610880, 'steps': 149014, 'loss/train': 1.1754558086395264} 11/07/2021 18:11:56 - INFO - __main__ - Step 149016: {'lr': 5.464406092678398e-08, 'samples': 28611072, 'steps': 149015, 'loss/train': 1.3294414281845093} 11/07/2021 18:11:58 - INFO - __main__ - Step 149017: {'lr': 5.4533168871662154e-08, 'samples': 28611264, 'steps': 149016, 'loss/train': 1.5009132623672485} 11/07/2021 18:11:58 - INFO - __main__ - Step 149018: {'lr': 5.4422389438146815e-08, 'samples': 28611456, 'steps': 149017, 'loss/train': 1.4946922063827515} 11/07/2021 18:11:58 - INFO - __main__ - Step 149019: {'lr': 5.4311722626237955e-08, 'samples': 28611648, 'steps': 149018, 'loss/train': 1.4787367582321167} 11/07/2021 18:11:59 - INFO - __main__ - Step 149020: {'lr': 5.42011684359911e-08, 'samples': 28611840, 'steps': 149019, 'loss/train': 1.4116652011871338} 11/07/2021 18:11:59 - INFO - __main__ - Step 149021: {'lr': 5.409072686746175e-08, 'samples': 28612032, 'steps': 149020, 'loss/train': 1.2375762462615967} 11/07/2021 18:12:00 - INFO - __main__ - Step 149022: {'lr': 5.398039792073317e-08, 'samples': 28612224, 'steps': 149021, 'loss/train': 0.7496612071990967} 11/07/2021 18:12:00 - INFO - __main__ - Step 149023: {'lr': 5.387018159577761e-08, 'samples': 28612416, 'steps': 149022, 'loss/train': 1.1677589416503906} 11/07/2021 18:12:01 - INFO - __main__ - Step 149024: {'lr': 5.376007789273385e-08, 'samples': 28612608, 'steps': 149023, 'loss/train': 1.3096530437469482} 11/07/2021 18:12:01 - INFO - __main__ - Step 149025: {'lr': 5.365008681157413e-08, 'samples': 28612800, 'steps': 149024, 'loss/train': 1.4519041776657104} 11/07/2021 18:12:01 - INFO - __main__ - Step 149026: {'lr': 5.354020835240947e-08, 'samples': 28612992, 'steps': 149025, 'loss/train': 1.2927242517471313} 11/07/2021 18:12:02 - INFO - __main__ - Step 149027: {'lr': 5.343044251526763e-08, 'samples': 28613184, 'steps': 149026, 'loss/train': 1.358084797859192} 11/07/2021 18:12:03 - INFO - __main__ - Step 149028: {'lr': 5.332078930017637e-08, 'samples': 28613376, 'steps': 149027, 'loss/train': 1.387991189956665} 11/07/2021 18:12:03 - INFO - __main__ - Step 149029: {'lr': 5.321124870719118e-08, 'samples': 28613568, 'steps': 149028, 'loss/train': 0.9710397720336914} 11/07/2021 18:12:03 - INFO - __main__ - Step 149030: {'lr': 5.3101820736395353e-08, 'samples': 28613760, 'steps': 149029, 'loss/train': 1.1965105533599854} 11/07/2021 18:12:04 - INFO - __main__ - Step 149031: {'lr': 5.299250538778888e-08, 'samples': 28613952, 'steps': 149030, 'loss/train': 1.5171293020248413} 11/07/2021 18:12:05 - INFO - __main__ - Step 149032: {'lr': 5.288330266148278e-08, 'samples': 28614144, 'steps': 149031, 'loss/train': 0.11208520084619522} 11/07/2021 18:12:05 - INFO - __main__ - Step 149033: {'lr': 5.27742125574493e-08, 'samples': 28614336, 'steps': 149032, 'loss/train': 1.2015550136566162} 11/07/2021 18:12:06 - INFO - __main__ - Step 149034: {'lr': 5.266523507579946e-08, 'samples': 28614528, 'steps': 149033, 'loss/train': 1.2928496599197388} 11/07/2021 18:12:06 - INFO - __main__ - Step 149035: {'lr': 5.255637021656101e-08, 'samples': 28614720, 'steps': 149034, 'loss/train': 1.3804495334625244} 11/07/2021 18:12:06 - INFO - __main__ - Step 149036: {'lr': 5.2447617979789474e-08, 'samples': 28614912, 'steps': 149035, 'loss/train': 1.3037114143371582} 11/07/2021 18:12:07 - INFO - __main__ - Step 149037: {'lr': 5.233897836554036e-08, 'samples': 28615104, 'steps': 149036, 'loss/train': 1.678751826286316} 11/07/2021 18:12:08 - INFO - __main__ - Step 149038: {'lr': 5.223045137381366e-08, 'samples': 28615296, 'steps': 149037, 'loss/train': 1.2384881973266602} 11/07/2021 18:12:08 - INFO - __main__ - Step 149039: {'lr': 5.21220370047204e-08, 'samples': 28615488, 'steps': 149038, 'loss/train': 1.3714649677276611} 11/07/2021 18:12:08 - INFO - __main__ - Step 149040: {'lr': 5.201373525828834e-08, 'samples': 28615680, 'steps': 149039, 'loss/train': 1.2006441354751587} 11/07/2021 18:12:09 - INFO - __main__ - Step 149041: {'lr': 5.190554613454524e-08, 'samples': 28615872, 'steps': 149040, 'loss/train': 1.298755168914795} 11/07/2021 18:12:10 - INFO - __main__ - Step 149042: {'lr': 5.179746963357434e-08, 'samples': 28616064, 'steps': 149041, 'loss/train': 1.934680461883545} 11/07/2021 18:12:10 - INFO - __main__ - Step 149043: {'lr': 5.168950575537568e-08, 'samples': 28616256, 'steps': 149042, 'loss/train': 0.8421028256416321} 11/07/2021 18:12:11 - INFO - __main__ - Step 149044: {'lr': 5.158165450006025e-08, 'samples': 28616448, 'steps': 149043, 'loss/train': 1.733660340309143} 11/07/2021 18:12:11 - INFO - __main__ - Step 149045: {'lr': 5.147391586762806e-08, 'samples': 28616640, 'steps': 149044, 'loss/train': 2.8752388954162598} 11/07/2021 18:12:11 - INFO - __main__ - Step 149046: {'lr': 5.1366289858162385e-08, 'samples': 28616832, 'steps': 149045, 'loss/train': 1.2755908966064453} 11/07/2021 18:12:12 - INFO - __main__ - Step 149047: {'lr': 5.1258776471663213e-08, 'samples': 28617024, 'steps': 149046, 'loss/train': 0.8443887829780579} 11/07/2021 18:12:13 - INFO - __main__ - Step 149048: {'lr': 5.115137570824158e-08, 'samples': 28617216, 'steps': 149047, 'loss/train': 1.5477452278137207} 11/07/2021 18:12:13 - INFO - __main__ - Step 149049: {'lr': 5.104408756789747e-08, 'samples': 28617408, 'steps': 149048, 'loss/train': 0.7768949270248413} 11/07/2021 18:12:14 - INFO - __main__ - Step 149050: {'lr': 5.093691205071416e-08, 'samples': 28617600, 'steps': 149049, 'loss/train': 1.5531673431396484} 11/07/2021 18:12:14 - INFO - __main__ - Step 149051: {'lr': 5.0829849156691646e-08, 'samples': 28617792, 'steps': 149050, 'loss/train': 1.2289959192276} 11/07/2021 18:12:14 - INFO - __main__ - Step 149052: {'lr': 5.0722898885940947e-08, 'samples': 28617984, 'steps': 149051, 'loss/train': 2.4510064125061035} 11/07/2021 18:12:15 - INFO - __main__ - Step 149053: {'lr': 5.061606123846207e-08, 'samples': 28618176, 'steps': 149052, 'loss/train': 1.153768539428711} 11/07/2021 18:12:16 - INFO - __main__ - Step 149054: {'lr': 5.050933621431053e-08, 'samples': 28618368, 'steps': 149053, 'loss/train': 1.4278360605239868} 11/07/2021 18:12:16 - INFO - __main__ - Step 149055: {'lr': 5.0402723813541826e-08, 'samples': 28618560, 'steps': 149054, 'loss/train': 1.1186773777008057} 11/07/2021 18:12:16 - INFO - __main__ - Step 149056: {'lr': 5.029622403621148e-08, 'samples': 28618752, 'steps': 149055, 'loss/train': 1.8733577728271484} 11/07/2021 18:12:17 - INFO - __main__ - Step 149057: {'lr': 5.0189836882374995e-08, 'samples': 28618944, 'steps': 149056, 'loss/train': 1.191259503364563} 11/07/2021 18:12:17 - INFO - __main__ - Step 149058: {'lr': 5.008356235203237e-08, 'samples': 28619136, 'steps': 149057, 'loss/train': 1.4027611017227173} 11/07/2021 18:12:18 - INFO - __main__ - Step 149059: {'lr': 4.9977400445294644e-08, 'samples': 28619328, 'steps': 149058, 'loss/train': 1.3655023574829102} 11/07/2021 18:12:19 - INFO - __main__ - Step 149060: {'lr': 4.987135116216179e-08, 'samples': 28619520, 'steps': 149059, 'loss/train': 1.0438342094421387} 11/07/2021 18:12:19 - INFO - __main__ - Step 149061: {'lr': 4.97654145027171e-08, 'samples': 28619712, 'steps': 149060, 'loss/train': 1.2859196662902832} 11/07/2021 18:12:19 - INFO - __main__ - Step 149062: {'lr': 4.965959046698831e-08, 'samples': 28619904, 'steps': 149061, 'loss/train': 2.015444755554199} 11/07/2021 18:12:20 - INFO - __main__ - Step 149063: {'lr': 4.955387905503095e-08, 'samples': 28620096, 'steps': 149062, 'loss/train': 0.9508357048034668} 11/07/2021 18:12:21 - INFO - __main__ - Step 149064: {'lr': 4.9448280266872755e-08, 'samples': 28620288, 'steps': 149063, 'loss/train': 1.6514402627944946} 11/07/2021 18:12:21 - INFO - __main__ - Step 149065: {'lr': 4.934279410256925e-08, 'samples': 28620480, 'steps': 149064, 'loss/train': 1.7270375490188599} 11/07/2021 18:12:21 - INFO - __main__ - Step 149066: {'lr': 4.923742056220371e-08, 'samples': 28620672, 'steps': 149065, 'loss/train': 1.1273810863494873} 11/07/2021 18:12:22 - INFO - __main__ - Step 149067: {'lr': 4.9132159645776106e-08, 'samples': 28620864, 'steps': 149066, 'loss/train': 0.9060978889465332} 11/07/2021 18:12:22 - INFO - __main__ - Step 149068: {'lr': 4.902701135334198e-08, 'samples': 28621056, 'steps': 149067, 'loss/train': 1.9325759410858154} 11/07/2021 18:12:23 - INFO - __main__ - Step 149069: {'lr': 4.8921975684984574e-08, 'samples': 28621248, 'steps': 149068, 'loss/train': 1.5933809280395508} 11/07/2021 18:12:24 - INFO - __main__ - Step 149070: {'lr': 4.881705264070391e-08, 'samples': 28621440, 'steps': 149069, 'loss/train': 1.1230521202087402} 11/07/2021 18:12:24 - INFO - __main__ - Step 149071: {'lr': 4.871224222058324e-08, 'samples': 28621632, 'steps': 149070, 'loss/train': 1.1721570491790771} 11/07/2021 18:12:24 - INFO - __main__ - Step 149072: {'lr': 4.860754442465032e-08, 'samples': 28621824, 'steps': 149071, 'loss/train': 1.4769306182861328} 11/07/2021 18:12:25 - INFO - __main__ - Step 149073: {'lr': 4.8502959252960663e-08, 'samples': 28622016, 'steps': 149072, 'loss/train': 0.8292689919471741} 11/07/2021 18:12:26 - INFO - __main__ - Step 149074: {'lr': 4.839848670554203e-08, 'samples': 28622208, 'steps': 149073, 'loss/train': 1.5288103818893433} 11/07/2021 18:12:26 - INFO - __main__ - Step 149075: {'lr': 4.829412678247769e-08, 'samples': 28622400, 'steps': 149074, 'loss/train': 1.2703739404678345} 11/07/2021 18:12:26 - INFO - __main__ - Step 149076: {'lr': 4.818987948379538e-08, 'samples': 28622592, 'steps': 149075, 'loss/train': 1.1406266689300537} 11/07/2021 18:12:27 - INFO - __main__ - Step 149077: {'lr': 4.808574480952288e-08, 'samples': 28622784, 'steps': 149076, 'loss/train': 1.2768232822418213} 11/07/2021 18:12:27 - INFO - __main__ - Step 149078: {'lr': 4.798172275974344e-08, 'samples': 28622976, 'steps': 149077, 'loss/train': 1.3954501152038574} 11/07/2021 18:12:27 - INFO - __main__ - Step 149079: {'lr': 4.7877813334484824e-08, 'samples': 28623168, 'steps': 149078, 'loss/train': 1.4293450117111206} 11/07/2021 18:12:29 - INFO - __main__ - Step 149080: {'lr': 4.777401653380253e-08, 'samples': 28623360, 'steps': 149079, 'loss/train': 1.7694995403289795} 11/07/2021 18:12:29 - INFO - __main__ - Step 149081: {'lr': 4.767033235772433e-08, 'samples': 28623552, 'steps': 149080, 'loss/train': 1.2760121822357178} 11/07/2021 18:12:29 - INFO - __main__ - Step 149082: {'lr': 4.756676080630573e-08, 'samples': 28623744, 'steps': 149081, 'loss/train': 1.611955165863037} 11/07/2021 18:12:30 - INFO - __main__ - Step 149083: {'lr': 4.746330187960224e-08, 'samples': 28623936, 'steps': 149082, 'loss/train': 1.165128469467163} 11/07/2021 18:12:30 - INFO - __main__ - Step 149084: {'lr': 4.7359955577641613e-08, 'samples': 28624128, 'steps': 149083, 'loss/train': 1.4605215787887573} 11/07/2021 18:12:31 - INFO - __main__ - Step 149085: {'lr': 4.7256721900507115e-08, 'samples': 28624320, 'steps': 149084, 'loss/train': 1.223571538925171} 11/07/2021 18:12:32 - INFO - __main__ - Step 149086: {'lr': 4.7153600848198754e-08, 'samples': 28624512, 'steps': 149085, 'loss/train': 0.9998951554298401} 11/07/2021 18:12:32 - INFO - __main__ - Step 149087: {'lr': 4.7050592420799785e-08, 'samples': 28624704, 'steps': 149086, 'loss/train': 0.7170268297195435} 11/07/2021 18:12:32 - INFO - __main__ - Step 149088: {'lr': 4.6947696618337976e-08, 'samples': 28624896, 'steps': 149087, 'loss/train': 0.7242974042892456} 11/07/2021 18:12:33 - INFO - __main__ - Step 149089: {'lr': 4.684491344086883e-08, 'samples': 28625088, 'steps': 149088, 'loss/train': 1.5814898014068604} 11/07/2021 18:12:34 - INFO - __main__ - Step 149090: {'lr': 4.674224288844786e-08, 'samples': 28625280, 'steps': 149089, 'loss/train': 1.6041746139526367} 11/07/2021 18:12:34 - INFO - __main__ - Step 149091: {'lr': 4.6639684961075066e-08, 'samples': 28625472, 'steps': 149090, 'loss/train': 1.6649287939071655} 11/07/2021 18:12:35 - INFO - __main__ - Step 149092: {'lr': 4.6537239658861476e-08, 'samples': 28625664, 'steps': 149091, 'loss/train': 1.083066463470459} 11/07/2021 18:12:35 - INFO - __main__ - Step 149093: {'lr': 4.643490698180708e-08, 'samples': 28625856, 'steps': 149092, 'loss/train': 1.3534513711929321} 11/07/2021 18:12:35 - INFO - __main__ - Step 149094: {'lr': 4.6332686929967394e-08, 'samples': 28626048, 'steps': 149093, 'loss/train': 1.2089653015136719} 11/07/2021 18:12:36 - INFO - __main__ - Step 149095: {'lr': 4.623057950339793e-08, 'samples': 28626240, 'steps': 149094, 'loss/train': 2.0327980518341064} 11/07/2021 18:12:37 - INFO - __main__ - Step 149096: {'lr': 4.61285847021542e-08, 'samples': 28626432, 'steps': 149095, 'loss/train': 1.287854790687561} 11/07/2021 18:12:37 - INFO - __main__ - Step 149097: {'lr': 4.6026702526236196e-08, 'samples': 28626624, 'steps': 149096, 'loss/train': 1.1439334154129028} 11/07/2021 18:12:37 - INFO - __main__ - Step 149098: {'lr': 4.592493297575495e-08, 'samples': 28626816, 'steps': 149097, 'loss/train': 1.6686582565307617} 11/07/2021 18:12:38 - INFO - __main__ - Step 149099: {'lr': 4.582327605068271e-08, 'samples': 28627008, 'steps': 149098, 'loss/train': 1.1829010248184204} 11/07/2021 18:12:38 - INFO - __main__ - Step 149100: {'lr': 4.572173175113048e-08, 'samples': 28627200, 'steps': 149099, 'loss/train': 1.0389093160629272} 11/07/2021 18:12:39 - INFO - __main__ - Step 149101: {'lr': 4.562030007712603e-08, 'samples': 28627392, 'steps': 149100, 'loss/train': 0.7928924560546875} 11/07/2021 18:12:40 - INFO - __main__ - Step 149102: {'lr': 4.5518981028697115e-08, 'samples': 28627584, 'steps': 149101, 'loss/train': 1.1229628324508667} 11/07/2021 18:12:40 - INFO - __main__ - Step 149103: {'lr': 4.541777460589924e-08, 'samples': 28627776, 'steps': 149102, 'loss/train': 1.27433180809021} 11/07/2021 18:12:40 - INFO - __main__ - Step 149104: {'lr': 4.5316680808787926e-08, 'samples': 28627968, 'steps': 149103, 'loss/train': 1.4228005409240723} 11/07/2021 18:12:41 - INFO - __main__ - Step 149105: {'lr': 4.5215699637390916e-08, 'samples': 28628160, 'steps': 149104, 'loss/train': 1.262030005455017} 11/07/2021 18:12:42 - INFO - __main__ - Step 149106: {'lr': 4.5114831091791485e-08, 'samples': 28628352, 'steps': 149105, 'loss/train': 1.727910041809082} 11/07/2021 18:12:42 - INFO - __main__ - Step 149107: {'lr': 4.501407517196188e-08, 'samples': 28628544, 'steps': 149106, 'loss/train': 1.773747444152832} 11/07/2021 18:12:42 - INFO - __main__ - Step 149108: {'lr': 4.491343187801311e-08, 'samples': 28628736, 'steps': 149107, 'loss/train': 0.8177186250686646} 11/07/2021 18:12:43 - INFO - __main__ - Step 149109: {'lr': 4.4812901209972947e-08, 'samples': 28628928, 'steps': 149108, 'loss/train': 1.3416056632995605} 11/07/2021 18:12:43 - INFO - __main__ - Step 149110: {'lr': 4.4712483167869134e-08, 'samples': 28629120, 'steps': 149109, 'loss/train': 0.7726855278015137} 11/07/2021 18:12:44 - INFO - __main__ - Step 149111: {'lr': 4.461217775178494e-08, 'samples': 28629312, 'steps': 149110, 'loss/train': 1.4291445016860962} 11/07/2021 18:12:45 - INFO - __main__ - Step 149112: {'lr': 4.451198496172038e-08, 'samples': 28629504, 'steps': 149111, 'loss/train': 1.1800236701965332} 11/07/2021 18:12:45 - INFO - __main__ - Step 149113: {'lr': 4.4411904797758695e-08, 'samples': 28629696, 'steps': 149112, 'loss/train': 1.3635114431381226} 11/07/2021 18:12:45 - INFO - __main__ - Step 149114: {'lr': 4.431193725989991e-08, 'samples': 28629888, 'steps': 149113, 'loss/train': 1.5899966955184937} 11/07/2021 18:12:46 - INFO - __main__ - Step 149115: {'lr': 4.421208234822727e-08, 'samples': 28630080, 'steps': 149114, 'loss/train': 1.5470985174179077} 11/07/2021 18:12:47 - INFO - __main__ - Step 149116: {'lr': 4.41123400627963e-08, 'samples': 28630272, 'steps': 149115, 'loss/train': 1.0797107219696045} 11/07/2021 18:12:47 - INFO - __main__ - Step 149117: {'lr': 4.401271040360699e-08, 'samples': 28630464, 'steps': 149116, 'loss/train': 1.3120291233062744} 11/07/2021 18:12:47 - INFO - __main__ - Step 149118: {'lr': 4.391319337074262e-08, 'samples': 28630656, 'steps': 149117, 'loss/train': 1.273555040359497} 11/07/2021 18:12:48 - INFO - __main__ - Step 149119: {'lr': 4.381378896423094e-08, 'samples': 28630848, 'steps': 149118, 'loss/train': 1.0656378269195557} 11/07/2021 18:12:48 - INFO - __main__ - Step 149120: {'lr': 4.371449718412745e-08, 'samples': 28631040, 'steps': 149119, 'loss/train': 1.468342661857605} 11/07/2021 18:12:49 - INFO - __main__ - Step 149121: {'lr': 4.3615318030459926e-08, 'samples': 28631232, 'steps': 149120, 'loss/train': 0.8238980174064636} 11/07/2021 18:12:49 - INFO - __main__ - Step 149122: {'lr': 4.3516251503283864e-08, 'samples': 28631424, 'steps': 149121, 'loss/train': 1.3574289083480835} 11/07/2021 18:12:50 - INFO - __main__ - Step 149123: {'lr': 4.3417297602627026e-08, 'samples': 28631616, 'steps': 149122, 'loss/train': 1.0860657691955566} 11/07/2021 18:12:50 - INFO - __main__ - Step 149124: {'lr': 4.3318456328572674e-08, 'samples': 28631808, 'steps': 149123, 'loss/train': 1.2808433771133423} 11/07/2021 18:12:51 - INFO - __main__ - Step 149125: {'lr': 4.321972768114857e-08, 'samples': 28632000, 'steps': 149124, 'loss/train': 0.910753607749939} 11/07/2021 18:12:52 - INFO - __main__ - Step 149126: {'lr': 4.3121111660354706e-08, 'samples': 28632192, 'steps': 149125, 'loss/train': 0.707284688949585} 11/07/2021 18:12:52 - INFO - __main__ - Step 149127: {'lr': 4.302260826630211e-08, 'samples': 28632384, 'steps': 149126, 'loss/train': 1.1848032474517822} 11/07/2021 18:12:52 - INFO - __main__ - Step 149128: {'lr': 4.292421749899078e-08, 'samples': 28632576, 'steps': 149127, 'loss/train': 1.0610544681549072} 11/07/2021 18:12:53 - INFO - __main__ - Step 149129: {'lr': 4.282593935850399e-08, 'samples': 28632768, 'steps': 149128, 'loss/train': 1.7681163549423218} 11/07/2021 18:12:53 - INFO - __main__ - Step 149130: {'lr': 4.272777384484172e-08, 'samples': 28632960, 'steps': 149129, 'loss/train': 0.9083795547485352} 11/07/2021 18:12:54 - INFO - __main__ - Step 149131: {'lr': 4.262972095808726e-08, 'samples': 28633152, 'steps': 149130, 'loss/train': 0.4492214620113373} 11/07/2021 18:12:55 - INFO - __main__ - Step 149132: {'lr': 4.2531780698240595e-08, 'samples': 28633344, 'steps': 149131, 'loss/train': 1.080485224723816} 11/07/2021 18:12:55 - INFO - __main__ - Step 149133: {'lr': 4.2433953065385e-08, 'samples': 28633536, 'steps': 149132, 'loss/train': 0.988431990146637} 11/07/2021 18:12:55 - INFO - __main__ - Step 149134: {'lr': 4.233623805957598e-08, 'samples': 28633728, 'steps': 149133, 'loss/train': 1.3621623516082764} 11/07/2021 18:12:56 - INFO - __main__ - Step 149135: {'lr': 4.223863568081354e-08, 'samples': 28633920, 'steps': 149134, 'loss/train': 1.0247046947479248} 11/07/2021 18:12:57 - INFO - __main__ - Step 149136: {'lr': 4.214114592915319e-08, 'samples': 28634112, 'steps': 149135, 'loss/train': 0.49857208132743835} 11/07/2021 18:12:57 - INFO - __main__ - Step 149137: {'lr': 4.204376880465044e-08, 'samples': 28634304, 'steps': 149136, 'loss/train': 1.5472310781478882} 11/07/2021 18:12:57 - INFO - __main__ - Step 149138: {'lr': 4.1946504307333046e-08, 'samples': 28634496, 'steps': 149137, 'loss/train': 1.2357048988342285} 11/07/2021 18:12:58 - INFO - __main__ - Step 149139: {'lr': 4.184935243728427e-08, 'samples': 28634688, 'steps': 149138, 'loss/train': 1.4658515453338623} 11/07/2021 18:12:58 - INFO - __main__ - Step 149140: {'lr': 4.175231319450412e-08, 'samples': 28634880, 'steps': 149139, 'loss/train': 1.4609284400939941} 11/07/2021 18:12:59 - INFO - __main__ - Step 149141: {'lr': 4.16553865790481e-08, 'samples': 28635072, 'steps': 149140, 'loss/train': 1.4990490674972534} 11/07/2021 18:12:59 - INFO - __main__ - Step 149142: {'lr': 4.155857259099949e-08, 'samples': 28635264, 'steps': 149141, 'loss/train': 1.4692107439041138} 11/07/2021 18:13:00 - INFO - __main__ - Step 149143: {'lr': 4.146187123033052e-08, 'samples': 28635456, 'steps': 149142, 'loss/train': 1.054302453994751} 11/07/2021 18:13:00 - INFO - __main__ - Step 149144: {'lr': 4.1365282497124455e-08, 'samples': 28635648, 'steps': 149143, 'loss/train': 1.364483118057251} 11/07/2021 18:13:01 - INFO - __main__ - Step 149145: {'lr': 4.1268806391436816e-08, 'samples': 28635840, 'steps': 149144, 'loss/train': 0.23782849311828613} 11/07/2021 18:13:01 - INFO - __main__ - Step 149146: {'lr': 4.1172442913295364e-08, 'samples': 28636032, 'steps': 149145, 'loss/train': 1.0576425790786743} 11/07/2021 18:13:02 - INFO - __main__ - Step 149147: {'lr': 4.107619206272784e-08, 'samples': 28636224, 'steps': 149146, 'loss/train': 1.5174603462219238} 11/07/2021 18:13:02 - INFO - __main__ - Step 149148: {'lr': 4.098005383981751e-08, 'samples': 28636416, 'steps': 149147, 'loss/train': 0.977217435836792} 11/07/2021 18:13:03 - INFO - __main__ - Step 149149: {'lr': 4.088402824459214e-08, 'samples': 28636608, 'steps': 149148, 'loss/train': 1.4593647718429565} 11/07/2021 18:13:03 - INFO - __main__ - Step 149150: {'lr': 4.0788115277051727e-08, 'samples': 28636800, 'steps': 149149, 'loss/train': 0.20889408886432648} 11/07/2021 18:13:03 - INFO - __main__ - Step 149151: {'lr': 4.0692314937307296e-08, 'samples': 28636992, 'steps': 149150, 'loss/train': 1.3006798028945923} 11/07/2021 18:13:04 - INFO - __main__ - Step 149152: {'lr': 4.0596627225331086e-08, 'samples': 28637184, 'steps': 149151, 'loss/train': 2.2284977436065674} 11/07/2021 18:13:05 - INFO - __main__ - Step 149153: {'lr': 4.050105214123412e-08, 'samples': 28637376, 'steps': 149152, 'loss/train': 1.5539970397949219} 11/07/2021 18:13:05 - INFO - __main__ - Step 149154: {'lr': 4.040558968504415e-08, 'samples': 28637568, 'steps': 149153, 'loss/train': 0.5152166485786438} 11/07/2021 18:13:06 - INFO - __main__ - Step 149155: {'lr': 4.031023985676119e-08, 'samples': 28637760, 'steps': 149154, 'loss/train': 1.3472771644592285} 11/07/2021 18:13:06 - INFO - __main__ - Step 149156: {'lr': 4.0215002656468494e-08, 'samples': 28637952, 'steps': 149155, 'loss/train': 1.1028002500534058} 11/07/2021 18:13:07 - INFO - __main__ - Step 149157: {'lr': 4.0119878084193816e-08, 'samples': 28638144, 'steps': 149156, 'loss/train': 1.508039116859436} 11/07/2021 18:13:07 - INFO - __main__ - Step 149158: {'lr': 4.0024866139992675e-08, 'samples': 28638336, 'steps': 149157, 'loss/train': 1.0820603370666504} 11/07/2021 18:13:08 - INFO - __main__ - Step 149159: {'lr': 3.9929966823892824e-08, 'samples': 28638528, 'steps': 149158, 'loss/train': 1.3402241468429565} 11/07/2021 18:13:08 - INFO - __main__ - Step 149160: {'lr': 3.983518013594978e-08, 'samples': 28638720, 'steps': 149159, 'loss/train': 1.4445828199386597} 11/07/2021 18:13:08 - INFO - __main__ - Step 149161: {'lr': 3.974050607619129e-08, 'samples': 28638912, 'steps': 149160, 'loss/train': 1.2659348249435425} 11/07/2021 18:13:09 - INFO - __main__ - Step 149162: {'lr': 3.964594464467286e-08, 'samples': 28639104, 'steps': 149161, 'loss/train': 1.2372959852218628} 11/07/2021 18:13:10 - INFO - __main__ - Step 149163: {'lr': 3.955149584142226e-08, 'samples': 28639296, 'steps': 149162, 'loss/train': 1.3389532566070557} 11/07/2021 18:13:10 - INFO - __main__ - Step 149164: {'lr': 3.945715966649499e-08, 'samples': 28639488, 'steps': 149163, 'loss/train': 1.4361069202423096} 11/07/2021 18:13:10 - INFO - __main__ - Step 149165: {'lr': 3.936293611994657e-08, 'samples': 28639680, 'steps': 149164, 'loss/train': 1.259766697883606} 11/07/2021 18:13:11 - INFO - __main__ - Step 149166: {'lr': 3.9268825201804746e-08, 'samples': 28639872, 'steps': 149165, 'loss/train': 1.4282909631729126} 11/07/2021 18:13:11 - INFO - __main__ - Step 149167: {'lr': 3.917482691209728e-08, 'samples': 28640064, 'steps': 149166, 'loss/train': 1.4173554182052612} 11/07/2021 18:13:12 - INFO - __main__ - Step 149168: {'lr': 3.9080941250879686e-08, 'samples': 28640256, 'steps': 149167, 'loss/train': 1.475623369216919} 11/07/2021 18:13:13 - INFO - __main__ - Step 149169: {'lr': 3.898716821820747e-08, 'samples': 28640448, 'steps': 149168, 'loss/train': 1.7172783613204956} 11/07/2021 18:13:13 - INFO - __main__ - Step 149170: {'lr': 3.889350781410839e-08, 'samples': 28640640, 'steps': 149169, 'loss/train': 1.2472301721572876} 11/07/2021 18:13:13 - INFO - __main__ - Step 149171: {'lr': 3.8799960038610194e-08, 'samples': 28640832, 'steps': 149170, 'loss/train': 1.3052051067352295} 11/07/2021 18:13:14 - INFO - __main__ - Step 149172: {'lr': 3.8706524891796155e-08, 'samples': 28641024, 'steps': 149171, 'loss/train': 1.6343764066696167} 11/07/2021 18:13:15 - INFO - __main__ - Step 149173: {'lr': 3.8613202373666276e-08, 'samples': 28641216, 'steps': 149172, 'loss/train': 1.6411360502243042} 11/07/2021 18:13:15 - INFO - __main__ - Step 149174: {'lr': 3.851999248427607e-08, 'samples': 28641408, 'steps': 149173, 'loss/train': 1.2787046432495117} 11/07/2021 18:13:15 - INFO - __main__ - Step 149175: {'lr': 3.842689522368103e-08, 'samples': 28641600, 'steps': 149174, 'loss/train': 0.9292144775390625} 11/07/2021 18:13:16 - INFO - __main__ - Step 149176: {'lr': 3.8333910591908935e-08, 'samples': 28641792, 'steps': 149175, 'loss/train': 1.6841505765914917} 11/07/2021 18:13:16 - INFO - __main__ - Step 149177: {'lr': 3.824103858901529e-08, 'samples': 28641984, 'steps': 149176, 'loss/train': 1.6098246574401855} 11/07/2021 18:13:17 - INFO - __main__ - Step 149178: {'lr': 3.8148279215027835e-08, 'samples': 28642176, 'steps': 149177, 'loss/train': 1.6919506788253784} 11/07/2021 18:13:17 - INFO - __main__ - Step 149179: {'lr': 3.80556324700021e-08, 'samples': 28642368, 'steps': 149178, 'loss/train': 1.3550647497177124} 11/07/2021 18:13:18 - INFO - __main__ - Step 149180: {'lr': 3.796309835396583e-08, 'samples': 28642560, 'steps': 149179, 'loss/train': 1.5531914234161377} 11/07/2021 18:13:18 - INFO - __main__ - Step 149181: {'lr': 3.787067686694678e-08, 'samples': 28642752, 'steps': 149180, 'loss/train': 0.8927062749862671} 11/07/2021 18:13:18 - INFO - __main__ - Step 149182: {'lr': 3.777836800902823e-08, 'samples': 28642944, 'steps': 149181, 'loss/train': 1.4952350854873657} 11/07/2021 18:13:19 - INFO - __main__ - Step 149183: {'lr': 3.768617178023792e-08, 'samples': 28643136, 'steps': 149182, 'loss/train': 1.1951849460601807} 11/07/2021 18:13:20 - INFO - __main__ - Step 149184: {'lr': 3.759408818057586e-08, 'samples': 28643328, 'steps': 149183, 'loss/train': 1.157591700553894} 11/07/2021 18:13:20 - INFO - __main__ - Step 149185: {'lr': 3.7502117210153065e-08, 'samples': 28643520, 'steps': 149184, 'loss/train': 1.6176035404205322} 11/07/2021 18:13:21 - INFO - __main__ - Step 149186: {'lr': 3.7410258868941784e-08, 'samples': 28643712, 'steps': 149185, 'loss/train': 1.14633047580719} 11/07/2021 18:13:21 - INFO - __main__ - Step 149187: {'lr': 3.7318513157053036e-08, 'samples': 28643904, 'steps': 149186, 'loss/train': 1.15725576877594} 11/07/2021 18:13:22 - INFO - __main__ - Step 149188: {'lr': 3.722688007445907e-08, 'samples': 28644096, 'steps': 149187, 'loss/train': 1.2089614868164062} 11/07/2021 18:13:22 - INFO - __main__ - Step 149189: {'lr': 3.7135359621243146e-08, 'samples': 28644288, 'steps': 149188, 'loss/train': 1.0767865180969238} 11/07/2021 18:13:23 - INFO - __main__ - Step 149190: {'lr': 3.7043951797433026e-08, 'samples': 28644480, 'steps': 149189, 'loss/train': 1.3378938436508179} 11/07/2021 18:13:23 - INFO - __main__ - Step 149191: {'lr': 3.6952656603084225e-08, 'samples': 28644672, 'steps': 149190, 'loss/train': 1.5647996664047241} 11/07/2021 18:13:23 - INFO - __main__ - Step 149192: {'lr': 3.6861474038224486e-08, 'samples': 28644864, 'steps': 149191, 'loss/train': 1.4275761842727661} 11/07/2021 18:13:25 - INFO - __main__ - Step 149193: {'lr': 3.677040410290933e-08, 'samples': 28645056, 'steps': 149192, 'loss/train': 1.0022231340408325} 11/07/2021 18:13:25 - INFO - __main__ - Step 149194: {'lr': 3.6679446797138746e-08, 'samples': 28645248, 'steps': 149193, 'loss/train': 1.2323939800262451} 11/07/2021 18:13:25 - INFO - __main__ - Step 149195: {'lr': 3.658860212099602e-08, 'samples': 28645440, 'steps': 149194, 'loss/train': 1.2191424369812012} 11/07/2021 18:13:26 - INFO - __main__ - Step 149196: {'lr': 3.649787007453664e-08, 'samples': 28645632, 'steps': 149195, 'loss/train': 1.7939742803573608} 11/07/2021 18:13:26 - INFO - __main__ - Step 149197: {'lr': 3.640725065773287e-08, 'samples': 28645824, 'steps': 149196, 'loss/train': 0.6662933826446533} 11/07/2021 18:13:26 - INFO - __main__ - Step 149198: {'lr': 3.631674387069572e-08, 'samples': 28646016, 'steps': 149197, 'loss/train': 0.9732807278633118} 11/07/2021 18:13:27 - INFO - __main__ - Step 149199: {'lr': 3.62263497134252e-08, 'samples': 28646208, 'steps': 149198, 'loss/train': 0.3500913381576538} 11/07/2021 18:13:28 - INFO - __main__ - Step 149200: {'lr': 3.613606818597681e-08, 'samples': 28646400, 'steps': 149199, 'loss/train': 1.5321025848388672} 11/07/2021 18:13:28 - INFO - __main__ - Step 149201: {'lr': 3.604589928837832e-08, 'samples': 28646592, 'steps': 149200, 'loss/train': 1.062096118927002} 11/07/2021 18:13:28 - INFO - __main__ - Step 149202: {'lr': 3.595584302068522e-08, 'samples': 28646784, 'steps': 149201, 'loss/train': 1.3482815027236938} 11/07/2021 18:13:29 - INFO - __main__ - Step 149203: {'lr': 3.5865899382953034e-08, 'samples': 28646976, 'steps': 149202, 'loss/train': 0.8103650212287903} 11/07/2021 18:13:30 - INFO - __main__ - Step 149204: {'lr': 3.577606837518177e-08, 'samples': 28647168, 'steps': 149203, 'loss/train': 1.5168330669403076} 11/07/2021 18:13:30 - INFO - __main__ - Step 149205: {'lr': 3.5686349997426924e-08, 'samples': 28647360, 'steps': 149204, 'loss/train': 1.1291526556015015} 11/07/2021 18:13:30 - INFO - __main__ - Step 149206: {'lr': 3.559674424974402e-08, 'samples': 28647552, 'steps': 149205, 'loss/train': 0.9200286269187927} 11/07/2021 18:13:31 - INFO - __main__ - Step 149207: {'lr': 3.55072511321608e-08, 'samples': 28647744, 'steps': 149206, 'loss/train': 1.3746739625930786} 11/07/2021 18:13:31 - INFO - __main__ - Step 149208: {'lr': 3.5417870644732784e-08, 'samples': 28647936, 'steps': 149207, 'loss/train': 1.2857511043548584} 11/07/2021 18:13:32 - INFO - __main__ - Step 149209: {'lr': 3.5328602787487726e-08, 'samples': 28648128, 'steps': 149208, 'loss/train': 1.4185298681259155} 11/07/2021 18:13:32 - INFO - __main__ - Step 149210: {'lr': 3.523944756045339e-08, 'samples': 28648320, 'steps': 149209, 'loss/train': 1.282027006149292} 11/07/2021 18:13:33 - INFO - __main__ - Step 149211: {'lr': 3.515040496368527e-08, 'samples': 28648512, 'steps': 149210, 'loss/train': 1.459620475769043} 11/07/2021 18:13:33 - INFO - __main__ - Step 149212: {'lr': 3.506147499721113e-08, 'samples': 28648704, 'steps': 149211, 'loss/train': 1.6539149284362793} 11/07/2021 18:13:34 - INFO - __main__ - Step 149213: {'lr': 3.4972657661114236e-08, 'samples': 28648896, 'steps': 149212, 'loss/train': 1.1493757963180542} 11/07/2021 18:13:35 - INFO - __main__ - Step 149214: {'lr': 3.488395295536684e-08, 'samples': 28649088, 'steps': 149213, 'loss/train': 0.579005777835846} 11/07/2021 18:13:35 - INFO - __main__ - Step 149215: {'lr': 3.4795360880052195e-08, 'samples': 28649280, 'steps': 149214, 'loss/train': 1.2537206411361694} 11/07/2021 18:13:35 - INFO - __main__ - Step 149216: {'lr': 3.4706881435225823e-08, 'samples': 28649472, 'steps': 149215, 'loss/train': 1.6352647542953491} 11/07/2021 18:13:36 - INFO - __main__ - Step 149217: {'lr': 3.461851462088772e-08, 'samples': 28649664, 'steps': 149216, 'loss/train': 1.7926814556121826} 11/07/2021 18:13:36 - INFO - __main__ - Step 149218: {'lr': 3.4530260437093395e-08, 'samples': 28649856, 'steps': 149217, 'loss/train': 1.4993352890014648} 11/07/2021 18:13:36 - INFO - __main__ - Step 149219: {'lr': 3.444211888387061e-08, 'samples': 28650048, 'steps': 149218, 'loss/train': 1.7740142345428467} 11/07/2021 18:13:38 - INFO - __main__ - Step 149220: {'lr': 3.435408996127487e-08, 'samples': 28650240, 'steps': 149219, 'loss/train': 0.5856743454933167} 11/07/2021 18:13:38 - INFO - __main__ - Step 149221: {'lr': 3.426617366936169e-08, 'samples': 28650432, 'steps': 149220, 'loss/train': 1.188943862915039} 11/07/2021 18:13:38 - INFO - __main__ - Step 149222: {'lr': 3.417837000813107e-08, 'samples': 28650624, 'steps': 149221, 'loss/train': 1.3758894205093384} 11/07/2021 18:13:39 - INFO - __main__ - Step 149223: {'lr': 3.409067897763851e-08, 'samples': 28650816, 'steps': 149222, 'loss/train': 1.2434715032577515} 11/07/2021 18:13:39 - INFO - __main__ - Step 149224: {'lr': 3.4003100577939536e-08, 'samples': 28651008, 'steps': 149223, 'loss/train': 1.2560176849365234} 11/07/2021 18:13:40 - INFO - __main__ - Step 149225: {'lr': 3.39156348090619e-08, 'samples': 28651200, 'steps': 149224, 'loss/train': 1.4880198240280151} 11/07/2021 18:13:40 - INFO - __main__ - Step 149226: {'lr': 3.3828281671033355e-08, 'samples': 28651392, 'steps': 149225, 'loss/train': 1.5351670980453491} 11/07/2021 18:13:41 - INFO - __main__ - Step 149227: {'lr': 3.374104116390941e-08, 'samples': 28651584, 'steps': 149226, 'loss/train': 1.3620963096618652} 11/07/2021 18:13:41 - INFO - __main__ - Step 149228: {'lr': 3.365391328774558e-08, 'samples': 28651776, 'steps': 149227, 'loss/train': 1.063909649848938} 11/07/2021 18:13:41 - INFO - __main__ - Step 149229: {'lr': 3.356689804254187e-08, 'samples': 28651968, 'steps': 149228, 'loss/train': 1.4226418733596802} 11/07/2021 18:13:42 - INFO - __main__ - Step 149230: {'lr': 3.347999542835378e-08, 'samples': 28652160, 'steps': 149229, 'loss/train': 1.0988030433654785} 11/07/2021 18:13:43 - INFO - __main__ - Step 149231: {'lr': 3.3393205445209076e-08, 'samples': 28652352, 'steps': 149230, 'loss/train': 1.4645708799362183} 11/07/2021 18:13:43 - INFO - __main__ - Step 149232: {'lr': 3.330652809319101e-08, 'samples': 28652544, 'steps': 149231, 'loss/train': 0.8686509728431702} 11/07/2021 18:13:43 - INFO - __main__ - Step 149233: {'lr': 3.321996337227184e-08, 'samples': 28652736, 'steps': 149232, 'loss/train': 1.5196502208709717} 11/07/2021 18:13:44 - INFO - __main__ - Step 149234: {'lr': 3.313351128256259e-08, 'samples': 28652928, 'steps': 149233, 'loss/train': 1.5583770275115967} 11/07/2021 18:13:45 - INFO - __main__ - Step 149235: {'lr': 3.3047171824035496e-08, 'samples': 28653120, 'steps': 149234, 'loss/train': 1.6795140504837036} 11/07/2021 18:13:45 - INFO - __main__ - Step 149236: {'lr': 3.2960944996801576e-08, 'samples': 28653312, 'steps': 149235, 'loss/train': 1.0138683319091797} 11/07/2021 18:13:46 - INFO - __main__ - Step 149237: {'lr': 3.287483080080533e-08, 'samples': 28653504, 'steps': 149236, 'loss/train': 1.286368727684021} 11/07/2021 18:13:46 - INFO - __main__ - Step 149238: {'lr': 3.2788829236185534e-08, 'samples': 28653696, 'steps': 149237, 'loss/train': 1.1761510372161865} 11/07/2021 18:13:46 - INFO - __main__ - Step 149239: {'lr': 3.270294030291443e-08, 'samples': 28653888, 'steps': 149238, 'loss/train': 1.5316901206970215} 11/07/2021 18:13:47 - INFO - __main__ - Step 149240: {'lr': 3.261716400104753e-08, 'samples': 28654080, 'steps': 149239, 'loss/train': 1.2010847330093384} 11/07/2021 18:13:48 - INFO - __main__ - Step 149241: {'lr': 3.2531500330612586e-08, 'samples': 28654272, 'steps': 149240, 'loss/train': 1.4344087839126587} 11/07/2021 18:13:48 - INFO - __main__ - Step 149242: {'lr': 3.244594929169287e-08, 'samples': 28654464, 'steps': 149241, 'loss/train': 1.210158348083496} 11/07/2021 18:13:49 - INFO - __main__ - Step 149243: {'lr': 3.236051088426062e-08, 'samples': 28654656, 'steps': 149242, 'loss/train': 1.5868446826934814} 11/07/2021 18:13:49 - INFO - __main__ - Step 149244: {'lr': 3.227518510842686e-08, 'samples': 28654848, 'steps': 149243, 'loss/train': 0.9963176250457764} 11/07/2021 18:13:49 - INFO - __main__ - Step 149245: {'lr': 3.2189971964163846e-08, 'samples': 28655040, 'steps': 149244, 'loss/train': 1.4224897623062134} 11/07/2021 18:13:50 - INFO - __main__ - Step 149246: {'lr': 3.210487145155483e-08, 'samples': 28655232, 'steps': 149245, 'loss/train': 1.464375615119934} 11/07/2021 18:13:51 - INFO - __main__ - Step 149247: {'lr': 3.201988357062757e-08, 'samples': 28655424, 'steps': 149246, 'loss/train': 1.3471455574035645} 11/07/2021 18:13:51 - INFO - __main__ - Step 149248: {'lr': 3.193500832138208e-08, 'samples': 28655616, 'steps': 149247, 'loss/train': 1.685253620147705} 11/07/2021 18:13:51 - INFO - __main__ - Step 149249: {'lr': 3.185024570392936e-08, 'samples': 28655808, 'steps': 149248, 'loss/train': 1.3976025581359863} 11/07/2021 18:13:52 - INFO - __main__ - Step 149250: {'lr': 3.176559571824167e-08, 'samples': 28656000, 'steps': 149249, 'loss/train': 1.3652896881103516} 11/07/2021 18:13:53 - INFO - __main__ - Step 149251: {'lr': 3.168105836440227e-08, 'samples': 28656192, 'steps': 149250, 'loss/train': 1.2969226837158203} 11/07/2021 18:13:53 - INFO - __main__ - Step 149252: {'lr': 3.159663364241117e-08, 'samples': 28656384, 'steps': 149251, 'loss/train': 1.3956645727157593} 11/07/2021 18:13:53 - INFO - __main__ - Step 149253: {'lr': 3.1512321552323865e-08, 'samples': 28656576, 'steps': 149252, 'loss/train': 1.3572702407836914} 11/07/2021 18:13:54 - INFO - __main__ - Step 149254: {'lr': 3.1428122094195875e-08, 'samples': 28656768, 'steps': 149253, 'loss/train': 1.1293936967849731} 11/07/2021 18:13:54 - INFO - __main__ - Step 149255: {'lr': 3.134403526805496e-08, 'samples': 28656960, 'steps': 149254, 'loss/train': 1.5305206775665283} 11/07/2021 18:13:55 - INFO - __main__ - Step 149256: {'lr': 3.1260061073901116e-08, 'samples': 28657152, 'steps': 149255, 'loss/train': 0.7814787030220032} 11/07/2021 18:13:56 - INFO - __main__ - Step 149257: {'lr': 3.117619951184536e-08, 'samples': 28657344, 'steps': 149256, 'loss/train': 1.4945553541183472} 11/07/2021 18:13:56 - INFO - __main__ - Step 149258: {'lr': 3.109245058185994e-08, 'samples': 28657536, 'steps': 149257, 'loss/train': 1.4554269313812256} 11/07/2021 18:13:56 - INFO - __main__ - Step 149259: {'lr': 3.100881428400038e-08, 'samples': 28657728, 'steps': 149258, 'loss/train': 1.1719932556152344} 11/07/2021 18:13:57 - INFO - __main__ - Step 149260: {'lr': 3.092529061832217e-08, 'samples': 28657920, 'steps': 149259, 'loss/train': 1.735446810722351} 11/07/2021 18:13:57 - INFO - __main__ - Step 149261: {'lr': 3.084187958485307e-08, 'samples': 28658112, 'steps': 149260, 'loss/train': 1.5891817808151245} 11/07/2021 18:13:58 - INFO - __main__ - Step 149262: {'lr': 3.075858118362085e-08, 'samples': 28658304, 'steps': 149261, 'loss/train': 0.6442076563835144} 11/07/2021 18:13:58 - INFO - __main__ - Step 149263: {'lr': 3.0675395414681005e-08, 'samples': 28658496, 'steps': 149262, 'loss/train': 1.181536078453064} 11/07/2021 18:13:59 - INFO - __main__ - Step 149264: {'lr': 3.05923222780613e-08, 'samples': 28658688, 'steps': 149263, 'loss/train': 1.3116836547851562} 11/07/2021 18:13:59 - INFO - __main__ - Step 149265: {'lr': 3.050936177378949e-08, 'samples': 28658880, 'steps': 149264, 'loss/train': 1.3028775453567505} 11/07/2021 18:13:59 - INFO - __main__ - Step 149266: {'lr': 3.0426513901921085e-08, 'samples': 28659072, 'steps': 149265, 'loss/train': 1.141516089439392} 11/07/2021 18:14:00 - INFO - __main__ - Step 149267: {'lr': 3.0343778662483836e-08, 'samples': 28659264, 'steps': 149266, 'loss/train': 1.5869016647338867} 11/07/2021 18:14:01 - INFO - __main__ - Step 149268: {'lr': 3.02611560555055e-08, 'samples': 28659456, 'steps': 149267, 'loss/train': 2.3302762508392334} 11/07/2021 18:14:01 - INFO - __main__ - Step 149269: {'lr': 3.017864608106935e-08, 'samples': 28659648, 'steps': 149268, 'loss/train': 1.4570399522781372} 11/07/2021 18:14:02 - INFO - __main__ - Step 149270: {'lr': 3.0096248739147625e-08, 'samples': 28659840, 'steps': 149269, 'loss/train': 1.2675601243972778} 11/07/2021 18:14:02 - INFO - __main__ - Step 149271: {'lr': 3.0013964029795835e-08, 'samples': 28660032, 'steps': 149270, 'loss/train': 1.3969788551330566} 11/07/2021 18:14:03 - INFO - __main__ - Step 149272: {'lr': 2.993179195309725e-08, 'samples': 28660224, 'steps': 149271, 'loss/train': 1.4358092546463013} 11/07/2021 18:14:03 - INFO - __main__ - Step 149273: {'lr': 2.984973250902412e-08, 'samples': 28660416, 'steps': 149272, 'loss/train': 1.6857722997665405} 11/07/2021 18:14:04 - INFO - __main__ - Step 149274: {'lr': 2.97677856976597e-08, 'samples': 28660608, 'steps': 149273, 'loss/train': 1.0394572019577026} 11/07/2021 18:14:04 - INFO - __main__ - Step 149275: {'lr': 2.968595151903175e-08, 'samples': 28660800, 'steps': 149274, 'loss/train': 1.0537910461425781} 11/07/2021 18:14:04 - INFO - __main__ - Step 149276: {'lr': 2.9604229973140273e-08, 'samples': 28660992, 'steps': 149275, 'loss/train': 1.5451314449310303} 11/07/2021 18:14:05 - INFO - __main__ - Step 149277: {'lr': 2.9522621060068533e-08, 'samples': 28661184, 'steps': 149276, 'loss/train': 1.3427538871765137} 11/07/2021 18:14:06 - INFO - __main__ - Step 149278: {'lr': 2.9441124779844287e-08, 'samples': 28661376, 'steps': 149277, 'loss/train': 1.3729074001312256} 11/07/2021 18:14:06 - INFO - __main__ - Step 149279: {'lr': 2.935974113249529e-08, 'samples': 28661568, 'steps': 149278, 'loss/train': 1.042055606842041} 11/07/2021 18:14:07 - INFO - __main__ - Step 149280: {'lr': 2.9278470118049295e-08, 'samples': 28661760, 'steps': 149279, 'loss/train': 1.5501749515533447} 11/07/2021 18:14:07 - INFO - __main__ - Step 149281: {'lr': 2.9197311736561815e-08, 'samples': 28661952, 'steps': 149280, 'loss/train': 1.2427699565887451} 11/07/2021 18:14:07 - INFO - __main__ - Step 149282: {'lr': 2.9116265988060607e-08, 'samples': 28662144, 'steps': 149281, 'loss/train': 1.1348974704742432} 11/07/2021 18:14:08 - INFO - __main__ - Step 149283: {'lr': 2.9035332872573427e-08, 'samples': 28662336, 'steps': 149282, 'loss/train': 1.25687837600708} 11/07/2021 18:14:09 - INFO - __main__ - Step 149284: {'lr': 2.8954512390155783e-08, 'samples': 28662528, 'steps': 149283, 'loss/train': 1.2382670640945435} 11/07/2021 18:14:09 - INFO - __main__ - Step 149285: {'lr': 2.8873804540835435e-08, 'samples': 28662720, 'steps': 149284, 'loss/train': 1.6526788473129272} 11/07/2021 18:14:09 - INFO - __main__ - Step 149286: {'lr': 2.8793209324640136e-08, 'samples': 28662912, 'steps': 149285, 'loss/train': 1.5541012287139893} 11/07/2021 18:14:10 - INFO - __main__ - Step 149287: {'lr': 2.871272674159764e-08, 'samples': 28663104, 'steps': 149286, 'loss/train': 0.9925393462181091} 11/07/2021 18:14:11 - INFO - __main__ - Step 149288: {'lr': 2.8632356791791214e-08, 'samples': 28663296, 'steps': 149287, 'loss/train': 1.0901999473571777} 11/07/2021 18:14:11 - INFO - __main__ - Step 149289: {'lr': 2.8552099475193105e-08, 'samples': 28663488, 'steps': 149288, 'loss/train': 1.4122563600540161} 11/07/2021 18:14:12 - INFO - __main__ - Step 149290: {'lr': 2.8471954791914333e-08, 'samples': 28663680, 'steps': 149289, 'loss/train': 1.4359803199768066} 11/07/2021 18:14:12 - INFO - __main__ - Step 149291: {'lr': 2.8391922741927145e-08, 'samples': 28663872, 'steps': 149290, 'loss/train': 1.2655744552612305} 11/07/2021 18:14:12 - INFO - __main__ - Step 149292: {'lr': 2.831200332528705e-08, 'samples': 28664064, 'steps': 149291, 'loss/train': 1.2373827695846558} 11/07/2021 18:14:13 - INFO - __main__ - Step 149293: {'lr': 2.8232196542021803e-08, 'samples': 28664256, 'steps': 149292, 'loss/train': 1.0464951992034912} 11/07/2021 18:14:14 - INFO - __main__ - Step 149294: {'lr': 2.815250239218692e-08, 'samples': 28664448, 'steps': 149293, 'loss/train': 1.133966088294983} 11/07/2021 18:14:14 - INFO - __main__ - Step 149295: {'lr': 2.8072920875810147e-08, 'samples': 28664640, 'steps': 149294, 'loss/train': 0.7288067936897278} 11/07/2021 18:14:14 - INFO - __main__ - Step 149296: {'lr': 2.799345199291925e-08, 'samples': 28664832, 'steps': 149295, 'loss/train': 1.077060580253601} 11/07/2021 18:14:15 - INFO - __main__ - Step 149297: {'lr': 2.7914095743569734e-08, 'samples': 28665024, 'steps': 149296, 'loss/train': 1.2939577102661133} 11/07/2021 18:14:16 - INFO - __main__ - Step 149298: {'lr': 2.7834852127789356e-08, 'samples': 28665216, 'steps': 149297, 'loss/train': 1.2059882879257202} 11/07/2021 18:14:16 - INFO - __main__ - Step 149299: {'lr': 2.775572114560587e-08, 'samples': 28665408, 'steps': 149298, 'loss/train': 1.2229108810424805} 11/07/2021 18:14:16 - INFO - __main__ - Step 149300: {'lr': 2.7676702797047036e-08, 'samples': 28665600, 'steps': 149299, 'loss/train': 1.0368962287902832} 11/07/2021 18:14:17 - INFO - __main__ - Step 149301: {'lr': 2.7597797082168365e-08, 'samples': 28665792, 'steps': 149300, 'loss/train': 1.7098400592803955} 11/07/2021 18:14:17 - INFO - __main__ - Step 149302: {'lr': 2.751900400096985e-08, 'samples': 28665984, 'steps': 149301, 'loss/train': 1.1068367958068848} 11/07/2021 18:14:18 - INFO - __main__ - Step 149303: {'lr': 2.7440323553562517e-08, 'samples': 28666176, 'steps': 149302, 'loss/train': 1.2229390144348145} 11/07/2021 18:14:19 - INFO - __main__ - Step 149304: {'lr': 2.7361755739890858e-08, 'samples': 28666368, 'steps': 149303, 'loss/train': 1.5172635316848755} 11/07/2021 18:14:19 - INFO - __main__ - Step 149305: {'lr': 2.7283300560065894e-08, 'samples': 28666560, 'steps': 149304, 'loss/train': 1.3920927047729492} 11/07/2021 18:14:19 - INFO - __main__ - Step 149306: {'lr': 2.7204958014059868e-08, 'samples': 28666752, 'steps': 149305, 'loss/train': 0.4414733052253723} 11/07/2021 18:14:20 - INFO - __main__ - Step 149307: {'lr': 2.7126728101956044e-08, 'samples': 28666944, 'steps': 149306, 'loss/train': 1.185680627822876} 11/07/2021 18:14:20 - INFO - __main__ - Step 149308: {'lr': 2.7048610823782182e-08, 'samples': 28667136, 'steps': 149307, 'loss/train': 1.0865931510925293} 11/07/2021 18:14:21 - INFO - __main__ - Step 149309: {'lr': 2.6970606179538282e-08, 'samples': 28667328, 'steps': 149308, 'loss/train': 0.17766904830932617} 11/07/2021 18:14:22 - INFO - __main__ - Step 149310: {'lr': 2.689271416927985e-08, 'samples': 28667520, 'steps': 149309, 'loss/train': 1.439182162284851} 11/07/2021 18:14:22 - INFO - __main__ - Step 149311: {'lr': 2.6814934793062408e-08, 'samples': 28667712, 'steps': 149310, 'loss/train': 1.3539329767227173} 11/07/2021 18:14:22 - INFO - __main__ - Step 149312: {'lr': 2.67372680509137e-08, 'samples': 28667904, 'steps': 149311, 'loss/train': 1.3286848068237305} 11/07/2021 18:14:23 - INFO - __main__ - Step 149313: {'lr': 2.665971394283373e-08, 'samples': 28668096, 'steps': 149312, 'loss/train': 1.197540521621704} 11/07/2021 18:14:24 - INFO - __main__ - Step 149314: {'lr': 2.6582272468905767e-08, 'samples': 28668288, 'steps': 149313, 'loss/train': 1.8303619623184204} 11/07/2021 18:14:24 - INFO - __main__ - Step 149315: {'lr': 2.6504943629129807e-08, 'samples': 28668480, 'steps': 149314, 'loss/train': 0.6851144433021545} 11/07/2021 18:14:24 - INFO - __main__ - Step 149316: {'lr': 2.6427727423561366e-08, 'samples': 28668672, 'steps': 149315, 'loss/train': 1.173101544380188} 11/07/2021 18:14:25 - INFO - __main__ - Step 149317: {'lr': 2.6350623852228195e-08, 'samples': 28668864, 'steps': 149316, 'loss/train': 0.8327307105064392} 11/07/2021 18:14:25 - INFO - __main__ - Step 149318: {'lr': 2.6273632915158053e-08, 'samples': 28669056, 'steps': 149317, 'loss/train': 0.9893395900726318} 11/07/2021 18:14:26 - INFO - __main__ - Step 149319: {'lr': 2.619675461240645e-08, 'samples': 28669248, 'steps': 149318, 'loss/train': 1.3035341501235962} 11/07/2021 18:14:27 - INFO - __main__ - Step 149320: {'lr': 2.6119988943973384e-08, 'samples': 28669440, 'steps': 149319, 'loss/train': 1.1815766096115112} 11/07/2021 18:14:27 - INFO - __main__ - Step 149321: {'lr': 2.6043335909942122e-08, 'samples': 28669632, 'steps': 149320, 'loss/train': 1.3375296592712402} 11/07/2021 18:14:27 - INFO - __main__ - Step 149322: {'lr': 2.5966795510284912e-08, 'samples': 28669824, 'steps': 149321, 'loss/train': 0.1231415867805481} 11/07/2021 18:14:28 - INFO - __main__ - Step 149323: {'lr': 2.5890367745085018e-08, 'samples': 28670016, 'steps': 149322, 'loss/train': 1.1821995973587036} 11/07/2021 18:14:28 - INFO - __main__ - Step 149324: {'lr': 2.5814052614370197e-08, 'samples': 28670208, 'steps': 149323, 'loss/train': 1.3125114440917969} 11/07/2021 18:14:30 - INFO - __main__ - Step 149325: {'lr': 2.573785011814045e-08, 'samples': 28670400, 'steps': 149324, 'loss/train': 1.2085219621658325} 11/07/2021 18:14:30 - INFO - __main__ - Step 149326: {'lr': 2.566176025647904e-08, 'samples': 28670592, 'steps': 149325, 'loss/train': 1.2720409631729126} 11/07/2021 18:14:30 - INFO - __main__ - Step 149327: {'lr': 2.5585783029385967e-08, 'samples': 28670784, 'steps': 149326, 'loss/train': 1.0646849870681763} 11/07/2021 18:14:31 - INFO - __main__ - Step 149328: {'lr': 2.5509918436916745e-08, 'samples': 28670976, 'steps': 149327, 'loss/train': 1.7433141469955444} 11/07/2021 18:14:31 - INFO - __main__ - Step 149329: {'lr': 2.5434166479071374e-08, 'samples': 28671168, 'steps': 149328, 'loss/train': 0.5211770534515381} 11/07/2021 18:14:31 - INFO - __main__ - Step 149330: {'lr': 2.535852715593312e-08, 'samples': 28671360, 'steps': 149329, 'loss/train': 0.15642684698104858} 11/07/2021 18:14:32 - INFO - __main__ - Step 149331: {'lr': 2.5283000467501984e-08, 'samples': 28671552, 'steps': 149330, 'loss/train': 1.4101848602294922} 11/07/2021 18:14:33 - INFO - __main__ - Step 149332: {'lr': 2.520758641383347e-08, 'samples': 28671744, 'steps': 149331, 'loss/train': 1.0757824182510376} 11/07/2021 18:14:33 - INFO - __main__ - Step 149333: {'lr': 2.513228499492759e-08, 'samples': 28671936, 'steps': 149332, 'loss/train': 1.5120267868041992} 11/07/2021 18:14:33 - INFO - __main__ - Step 149334: {'lr': 2.5057096210839846e-08, 'samples': 28672128, 'steps': 149333, 'loss/train': 1.6958558559417725} 11/07/2021 18:14:34 - INFO - __main__ - Step 149335: {'lr': 2.4982020061625754e-08, 'samples': 28672320, 'steps': 149334, 'loss/train': 1.4713501930236816} 11/07/2021 18:14:35 - INFO - __main__ - Step 149336: {'lr': 2.4907056547285312e-08, 'samples': 28672512, 'steps': 149335, 'loss/train': 1.337040901184082} 11/07/2021 18:14:35 - INFO - __main__ - Step 149337: {'lr': 2.4832205667846273e-08, 'samples': 28672704, 'steps': 149336, 'loss/train': 1.2508430480957031} 11/07/2021 18:14:35 - INFO - __main__ - Step 149338: {'lr': 2.475746742339191e-08, 'samples': 28672896, 'steps': 149337, 'loss/train': 1.8884038925170898} 11/07/2021 18:14:36 - INFO - __main__ - Step 149339: {'lr': 2.468284181392222e-08, 'samples': 28673088, 'steps': 149338, 'loss/train': 1.7555692195892334} 11/07/2021 18:14:36 - INFO - __main__ - Step 149340: {'lr': 2.46083288394372e-08, 'samples': 28673280, 'steps': 149339, 'loss/train': 1.451332688331604} 11/07/2021 18:14:37 - INFO - __main__ - Step 149341: {'lr': 2.4533928500047876e-08, 'samples': 28673472, 'steps': 149340, 'loss/train': 1.1936603784561157} 11/07/2021 18:14:38 - INFO - __main__ - Step 149342: {'lr': 2.445964079572649e-08, 'samples': 28673664, 'steps': 149341, 'loss/train': 1.3391413688659668} 11/07/2021 18:14:38 - INFO - __main__ - Step 149343: {'lr': 2.4385465726528557e-08, 'samples': 28673856, 'steps': 149342, 'loss/train': 1.2682265043258667} 11/07/2021 18:14:38 - INFO - __main__ - Step 149344: {'lr': 2.4311403292481828e-08, 'samples': 28674048, 'steps': 149343, 'loss/train': 1.3341050148010254} 11/07/2021 18:14:39 - INFO - __main__ - Step 149345: {'lr': 2.4237453493641815e-08, 'samples': 28674240, 'steps': 149344, 'loss/train': 1.2851461172103882} 11/07/2021 18:14:39 - INFO - __main__ - Step 149346: {'lr': 2.416361633000852e-08, 'samples': 28674432, 'steps': 149345, 'loss/train': 1.2835049629211426} 11/07/2021 18:14:40 - INFO - __main__ - Step 149347: {'lr': 2.4089891801609698e-08, 'samples': 28674624, 'steps': 149346, 'loss/train': 1.6310832500457764} 11/07/2021 18:14:40 - INFO - __main__ - Step 149348: {'lr': 2.4016279908528616e-08, 'samples': 28674816, 'steps': 149347, 'loss/train': 1.2894247770309448} 11/07/2021 18:14:41 - INFO - __main__ - Step 149349: {'lr': 2.3942780650765273e-08, 'samples': 28675008, 'steps': 149348, 'loss/train': 1.5162914991378784} 11/07/2021 18:14:41 - INFO - __main__ - Step 149350: {'lr': 2.3869394028347423e-08, 'samples': 28675200, 'steps': 149349, 'loss/train': 0.9192835092544556} 11/07/2021 18:14:42 - INFO - __main__ - Step 149351: {'lr': 2.3796120041302827e-08, 'samples': 28675392, 'steps': 149350, 'loss/train': 0.9047369360923767} 11/07/2021 18:14:43 - INFO - __main__ - Step 149352: {'lr': 2.3722958689686992e-08, 'samples': 28675584, 'steps': 149351, 'loss/train': 1.4718486070632935} 11/07/2021 18:14:43 - INFO - __main__ - Step 149353: {'lr': 2.3649909973555427e-08, 'samples': 28675776, 'steps': 149352, 'loss/train': 1.4088304042816162} 11/07/2021 18:14:43 - INFO - __main__ - Step 149354: {'lr': 2.3576973892880384e-08, 'samples': 28675968, 'steps': 149353, 'loss/train': 1.1587257385253906} 11/07/2021 18:14:44 - INFO - __main__ - Step 149355: {'lr': 2.3504150447717364e-08, 'samples': 28676160, 'steps': 149354, 'loss/train': 1.216868281364441} 11/07/2021 18:14:44 - INFO - __main__ - Step 149356: {'lr': 2.3431439638121887e-08, 'samples': 28676352, 'steps': 149355, 'loss/train': 1.6836755275726318} 11/07/2021 18:14:45 - INFO - __main__ - Step 149357: {'lr': 2.335884146409395e-08, 'samples': 28676544, 'steps': 149356, 'loss/train': 1.2092705965042114} 11/07/2021 18:14:45 - INFO - __main__ - Step 149358: {'lr': 2.3286355925716817e-08, 'samples': 28676736, 'steps': 149357, 'loss/train': 1.0983936786651611} 11/07/2021 18:14:46 - INFO - __main__ - Step 149359: {'lr': 2.3213983022962735e-08, 'samples': 28676928, 'steps': 149358, 'loss/train': 0.6936861872673035} 11/07/2021 18:14:46 - INFO - __main__ - Step 149360: {'lr': 2.3141722755887216e-08, 'samples': 28677120, 'steps': 149359, 'loss/train': 1.4910807609558105} 11/07/2021 18:14:46 - INFO - __main__ - Step 149361: {'lr': 2.3069575124545773e-08, 'samples': 28677312, 'steps': 149360, 'loss/train': 1.2438743114471436} 11/07/2021 18:14:47 - INFO - __main__ - Step 149362: {'lr': 2.29975401289384e-08, 'samples': 28677504, 'steps': 149361, 'loss/train': 0.616104245185852} 11/07/2021 18:14:48 - INFO - __main__ - Step 149363: {'lr': 2.292561776912061e-08, 'samples': 28677696, 'steps': 149362, 'loss/train': 1.0686075687408447} 11/07/2021 18:14:48 - INFO - __main__ - Step 149364: {'lr': 2.2853808045092407e-08, 'samples': 28677888, 'steps': 149363, 'loss/train': 1.3403069972991943} 11/07/2021 18:14:49 - INFO - __main__ - Step 149365: {'lr': 2.278211095693705e-08, 'samples': 28678080, 'steps': 149364, 'loss/train': 0.8335750699043274} 11/07/2021 18:14:49 - INFO - __main__ - Step 149366: {'lr': 2.271052650465455e-08, 'samples': 28678272, 'steps': 149365, 'loss/train': 1.2782496213912964} 11/07/2021 18:14:50 - INFO - __main__ - Step 149367: {'lr': 2.2639054688272654e-08, 'samples': 28678464, 'steps': 149366, 'loss/train': 0.8180974721908569} 11/07/2021 18:14:50 - INFO - __main__ - Step 149368: {'lr': 2.2567695507819118e-08, 'samples': 28678656, 'steps': 149367, 'loss/train': 1.0056005716323853} 11/07/2021 18:14:51 - INFO - __main__ - Step 149369: {'lr': 2.2496448963377213e-08, 'samples': 28678848, 'steps': 149368, 'loss/train': 1.475245475769043} 11/07/2021 18:14:51 - INFO - __main__ - Step 149370: {'lr': 2.2425315054891426e-08, 'samples': 28679040, 'steps': 149369, 'loss/train': 1.436560034751892} 11/07/2021 18:14:51 - INFO - __main__ - Step 149371: {'lr': 2.2354293782472778e-08, 'samples': 28679232, 'steps': 149370, 'loss/train': 1.4405531883239746} 11/07/2021 18:14:52 - INFO - __main__ - Step 149372: {'lr': 2.2283385146121272e-08, 'samples': 28679424, 'steps': 149371, 'loss/train': 1.2804837226867676} 11/07/2021 18:14:53 - INFO - __main__ - Step 149373: {'lr': 2.2212589145892415e-08, 'samples': 28679616, 'steps': 149372, 'loss/train': 0.7442443370819092} 11/07/2021 18:14:53 - INFO - __main__ - Step 149374: {'lr': 2.2141905781758454e-08, 'samples': 28679808, 'steps': 149373, 'loss/train': 1.3251935243606567} 11/07/2021 18:14:54 - INFO - __main__ - Step 149375: {'lr': 2.2071335053802653e-08, 'samples': 28680000, 'steps': 149374, 'loss/train': 2.2345387935638428} 11/07/2021 18:14:54 - INFO - __main__ - Step 149376: {'lr': 2.2000876962052775e-08, 'samples': 28680192, 'steps': 149375, 'loss/train': 1.0284119844436646} 11/07/2021 18:14:54 - INFO - __main__ - Step 149377: {'lr': 2.1930531506536567e-08, 'samples': 28680384, 'steps': 149376, 'loss/train': 0.49243199825286865} 11/07/2021 18:14:56 - INFO - __main__ - Step 149378: {'lr': 2.1860298687281787e-08, 'samples': 28680576, 'steps': 149377, 'loss/train': 1.1497191190719604} 11/07/2021 18:14:56 - INFO - __main__ - Step 149379: {'lr': 2.179017850428844e-08, 'samples': 28680768, 'steps': 149378, 'loss/train': 1.4657210111618042} 11/07/2021 18:14:57 - INFO - __main__ - Step 149380: {'lr': 2.1720170957639783e-08, 'samples': 28680960, 'steps': 149379, 'loss/train': 1.5764524936676025} 11/07/2021 18:14:57 - INFO - __main__ - Step 149381: {'lr': 2.165027604736358e-08, 'samples': 28681152, 'steps': 149380, 'loss/train': 1.0623489618301392} 11/07/2021 18:14:57 - INFO - __main__ - Step 149382: {'lr': 2.158049377345983e-08, 'samples': 28681344, 'steps': 149381, 'loss/train': 0.6117708086967468} 11/07/2021 18:14:58 - INFO - __main__ - Step 149383: {'lr': 2.1510824135956286e-08, 'samples': 28681536, 'steps': 149382, 'loss/train': 0.6348945498466492} 11/07/2021 18:14:59 - INFO - __main__ - Step 149384: {'lr': 2.1441267134936215e-08, 'samples': 28681728, 'steps': 149383, 'loss/train': 1.0551788806915283} 11/07/2021 18:14:59 - INFO - __main__ - Step 149385: {'lr': 2.1371822770371864e-08, 'samples': 28681920, 'steps': 149384, 'loss/train': 1.4446430206298828} 11/07/2021 18:14:59 - INFO - __main__ - Step 149386: {'lr': 2.1302491042318738e-08, 'samples': 28682112, 'steps': 149385, 'loss/train': 1.3862327337265015} 11/07/2021 18:15:00 - INFO - __main__ - Step 149387: {'lr': 2.1233271950832354e-08, 'samples': 28682304, 'steps': 149386, 'loss/train': 1.0061641931533813} 11/07/2021 18:15:00 - INFO - __main__ - Step 149388: {'lr': 2.1164165495912714e-08, 'samples': 28682496, 'steps': 149387, 'loss/train': 1.240248441696167} 11/07/2021 18:15:01 - INFO - __main__ - Step 149389: {'lr': 2.1095171677587566e-08, 'samples': 28682688, 'steps': 149388, 'loss/train': 1.3916548490524292} 11/07/2021 18:15:02 - INFO - __main__ - Step 149390: {'lr': 2.1026290495912425e-08, 'samples': 28682880, 'steps': 149389, 'loss/train': 1.0253703594207764} 11/07/2021 18:15:02 - INFO - __main__ - Step 149391: {'lr': 2.0957521950887294e-08, 'samples': 28683072, 'steps': 149390, 'loss/train': 1.047667145729065} 11/07/2021 18:15:02 - INFO - __main__ - Step 149392: {'lr': 2.0888866042567677e-08, 'samples': 28683264, 'steps': 149391, 'loss/train': 1.2965153455734253} 11/07/2021 18:15:03 - INFO - __main__ - Step 149393: {'lr': 2.0820322770981337e-08, 'samples': 28683456, 'steps': 149392, 'loss/train': 1.1905814409255981} 11/07/2021 18:15:04 - INFO - __main__ - Step 149394: {'lr': 2.0751892136156025e-08, 'samples': 28683648, 'steps': 149393, 'loss/train': 1.4192931652069092} 11/07/2021 18:15:04 - INFO - __main__ - Step 149395: {'lr': 2.0683574138147254e-08, 'samples': 28683840, 'steps': 149394, 'loss/train': 1.1486576795578003} 11/07/2021 18:15:04 - INFO - __main__ - Step 149396: {'lr': 2.0615368776927267e-08, 'samples': 28684032, 'steps': 149395, 'loss/train': 0.8354367017745972} 11/07/2021 18:15:05 - INFO - __main__ - Step 149397: {'lr': 2.0547276052579333e-08, 'samples': 28684224, 'steps': 149396, 'loss/train': 1.8221162557601929} 11/07/2021 18:15:05 - INFO - __main__ - Step 149398: {'lr': 2.047929596510345e-08, 'samples': 28684416, 'steps': 149397, 'loss/train': 1.071940541267395} 11/07/2021 18:15:06 - INFO - __main__ - Step 149399: {'lr': 2.0411428514527375e-08, 'samples': 28684608, 'steps': 149398, 'loss/train': 1.4983816146850586} 11/07/2021 18:15:06 - INFO - __main__ - Step 149400: {'lr': 2.0343673700934374e-08, 'samples': 28684800, 'steps': 149399, 'loss/train': 1.2387815713882446} 11/07/2021 18:15:07 - INFO - __main__ - Step 149401: {'lr': 2.027603152429669e-08, 'samples': 28684992, 'steps': 149400, 'loss/train': 1.1612764596939087} 11/07/2021 18:15:07 - INFO - __main__ - Step 149402: {'lr': 2.0208501984669834e-08, 'samples': 28685184, 'steps': 149401, 'loss/train': 1.1539918184280396} 11/07/2021 18:15:07 - INFO - __main__ - Step 149403: {'lr': 2.014108508205381e-08, 'samples': 28685376, 'steps': 149402, 'loss/train': 0.7788569331169128} 11/07/2021 18:15:08 - INFO - __main__ - Step 149404: {'lr': 2.0073780816531884e-08, 'samples': 28685568, 'steps': 149403, 'loss/train': 0.6607428193092346} 11/07/2021 18:15:09 - INFO - __main__ - Step 149405: {'lr': 2.000658918810405e-08, 'samples': 28685760, 'steps': 149404, 'loss/train': 1.1391328573226929} 11/07/2021 18:15:09 - INFO - __main__ - Step 149406: {'lr': 1.993951019679807e-08, 'samples': 28685952, 'steps': 149405, 'loss/train': 1.0088876485824585} 11/07/2021 18:15:09 - INFO - __main__ - Step 149407: {'lr': 1.9872543842669456e-08, 'samples': 28686144, 'steps': 149406, 'loss/train': 1.6292476654052734} 11/07/2021 18:15:10 - INFO - __main__ - Step 149408: {'lr': 1.98056901257182e-08, 'samples': 28686336, 'steps': 149407, 'loss/train': 1.1072522401809692} 11/07/2021 18:15:10 - INFO - __main__ - Step 149409: {'lr': 1.9738949045972064e-08, 'samples': 28686528, 'steps': 149408, 'loss/train': 1.4982783794403076} 11/07/2021 18:15:11 - INFO - __main__ - Step 149410: {'lr': 1.967232060348656e-08, 'samples': 28686720, 'steps': 149409, 'loss/train': 1.699302077293396} 11/07/2021 18:15:12 - INFO - __main__ - Step 149411: {'lr': 1.9605804798261684e-08, 'samples': 28686912, 'steps': 149410, 'loss/train': 1.169663667678833} 11/07/2021 18:15:12 - INFO - __main__ - Step 149412: {'lr': 1.953940163035295e-08, 'samples': 28687104, 'steps': 149411, 'loss/train': 1.3963414430618286} 11/07/2021 18:15:12 - INFO - __main__ - Step 149413: {'lr': 1.947311109978811e-08, 'samples': 28687296, 'steps': 149412, 'loss/train': 1.2453014850616455} 11/07/2021 18:15:13 - INFO - __main__ - Step 149414: {'lr': 1.9406933206594924e-08, 'samples': 28687488, 'steps': 149413, 'loss/train': 1.0775364637374878} 11/07/2021 18:15:14 - INFO - __main__ - Step 149415: {'lr': 1.9340867950801145e-08, 'samples': 28687680, 'steps': 149414, 'loss/train': 0.522011935710907} 11/07/2021 18:15:15 - INFO - __main__ - Step 149416: {'lr': 1.927491533243453e-08, 'samples': 28687872, 'steps': 149415, 'loss/train': 1.2062002420425415} 11/07/2021 18:15:15 - INFO - __main__ - Step 149417: {'lr': 1.920907535152283e-08, 'samples': 28688064, 'steps': 149416, 'loss/train': 1.0559226274490356} 11/07/2021 18:15:15 - INFO - __main__ - Step 149418: {'lr': 1.9143348008093807e-08, 'samples': 28688256, 'steps': 149417, 'loss/train': 1.3831039667129517} 11/07/2021 18:15:16 - INFO - __main__ - Step 149419: {'lr': 1.9077733302175217e-08, 'samples': 28688448, 'steps': 149418, 'loss/train': 2.1154446601867676} 11/07/2021 18:15:17 - INFO - __main__ - Step 149420: {'lr': 1.9012231233822564e-08, 'samples': 28688640, 'steps': 149419, 'loss/train': 0.3814800977706909} 11/07/2021 18:15:17 - INFO - __main__ - Step 149421: {'lr': 1.8946841803035852e-08, 'samples': 28688832, 'steps': 149420, 'loss/train': 1.422516942024231} 11/07/2021 18:15:17 - INFO - __main__ - Step 149422: {'lr': 1.8881565009842837e-08, 'samples': 28689024, 'steps': 149421, 'loss/train': 1.0511678457260132} 11/07/2021 18:15:18 - INFO - __main__ - Step 149423: {'lr': 1.881640085429903e-08, 'samples': 28689216, 'steps': 149422, 'loss/train': 0.9255850911140442} 11/07/2021 18:15:18 - INFO - __main__ - Step 149424: {'lr': 1.8751349336404435e-08, 'samples': 28689408, 'steps': 149423, 'loss/train': 0.7896489500999451} 11/07/2021 18:15:19 - INFO - __main__ - Step 149425: {'lr': 1.8686410456214554e-08, 'samples': 28689600, 'steps': 149424, 'loss/train': 0.9834445714950562} 11/07/2021 18:15:19 - INFO - __main__ - Step 149426: {'lr': 1.8621584213757148e-08, 'samples': 28689792, 'steps': 149425, 'loss/train': 0.8916162252426147} 11/07/2021 18:15:20 - INFO - __main__ - Step 149427: {'lr': 1.8556870609032217e-08, 'samples': 28689984, 'steps': 149426, 'loss/train': 1.7639259099960327} 11/07/2021 18:15:20 - INFO - __main__ - Step 149428: {'lr': 1.8492269642095272e-08, 'samples': 28690176, 'steps': 149427, 'loss/train': 1.1908878087997437} 11/07/2021 18:15:20 - INFO - __main__ - Step 149429: {'lr': 1.842778131297407e-08, 'samples': 28690368, 'steps': 149428, 'loss/train': 1.1854989528656006} 11/07/2021 18:15:22 - INFO - __main__ - Step 149430: {'lr': 1.8363405621668606e-08, 'samples': 28690560, 'steps': 149429, 'loss/train': 0.34175923466682434} 11/07/2021 18:15:22 - INFO - __main__ - Step 149431: {'lr': 1.8299142568262152e-08, 'samples': 28690752, 'steps': 149430, 'loss/train': 1.4570778608322144} 11/07/2021 18:15:22 - INFO - __main__ - Step 149432: {'lr': 1.8234992152726947e-08, 'samples': 28690944, 'steps': 149431, 'loss/train': 1.2538655996322632} 11/07/2021 18:15:23 - INFO - __main__ - Step 149433: {'lr': 1.817095437511851e-08, 'samples': 28691136, 'steps': 149432, 'loss/train': 1.9547779560089111} 11/07/2021 18:15:23 - INFO - __main__ - Step 149434: {'lr': 1.8107029235492345e-08, 'samples': 28691328, 'steps': 149433, 'loss/train': 0.5797330141067505} 11/07/2021 18:15:23 - INFO - __main__ - Step 149435: {'lr': 1.8043216733820698e-08, 'samples': 28691520, 'steps': 149434, 'loss/train': 1.097873330116272} 11/07/2021 18:15:24 - INFO - __main__ - Step 149436: {'lr': 1.7979516870186842e-08, 'samples': 28691712, 'steps': 149435, 'loss/train': 1.408117651939392} 11/07/2021 18:15:25 - INFO - __main__ - Step 149437: {'lr': 1.7915929644563013e-08, 'samples': 28691904, 'steps': 149436, 'loss/train': 1.5834600925445557} 11/07/2021 18:15:25 - INFO - __main__ - Step 149438: {'lr': 1.7852455057032478e-08, 'samples': 28692096, 'steps': 149437, 'loss/train': 1.5269670486450195} 11/07/2021 18:15:26 - INFO - __main__ - Step 149439: {'lr': 1.7789093107595245e-08, 'samples': 28692288, 'steps': 149438, 'loss/train': 1.4511748552322388} 11/07/2021 18:15:26 - INFO - __main__ - Step 149440: {'lr': 1.7725843796279063e-08, 'samples': 28692480, 'steps': 149439, 'loss/train': 0.9917435050010681} 11/07/2021 18:15:27 - INFO - __main__ - Step 149441: {'lr': 1.7662707123139443e-08, 'samples': 28692672, 'steps': 149440, 'loss/train': 1.3431395292282104} 11/07/2021 18:15:27 - INFO - __main__ - Step 149442: {'lr': 1.759968308814863e-08, 'samples': 28692864, 'steps': 149441, 'loss/train': 0.7997185587882996} 11/07/2021 18:15:28 - INFO - __main__ - Step 149443: {'lr': 1.7536771691389895e-08, 'samples': 28693056, 'steps': 149442, 'loss/train': 1.973283290863037} 11/07/2021 18:15:28 - INFO - __main__ - Step 149444: {'lr': 1.7473972932890992e-08, 'samples': 28693248, 'steps': 149443, 'loss/train': 1.3096083402633667} 11/07/2021 18:15:28 - INFO - __main__ - Step 149445: {'lr': 1.741128681262416e-08, 'samples': 28693440, 'steps': 149444, 'loss/train': 1.005128264427185} 11/07/2021 18:15:29 - INFO - __main__ - Step 149446: {'lr': 1.7348713330672673e-08, 'samples': 28693632, 'steps': 149445, 'loss/train': 1.2646328210830688} 11/07/2021 18:15:30 - INFO - __main__ - Step 149447: {'lr': 1.7286252487036524e-08, 'samples': 28693824, 'steps': 149446, 'loss/train': 1.974453330039978} 11/07/2021 18:15:30 - INFO - __main__ - Step 149448: {'lr': 1.722390428177123e-08, 'samples': 28694016, 'steps': 149447, 'loss/train': 1.6332027912139893} 11/07/2021 18:15:30 - INFO - __main__ - Step 149449: {'lr': 1.7161668714876787e-08, 'samples': 28694208, 'steps': 149448, 'loss/train': 1.4719164371490479} 11/07/2021 18:15:31 - INFO - __main__ - Step 149450: {'lr': 1.7099545786380956e-08, 'samples': 28694400, 'steps': 149449, 'loss/train': 1.3734157085418701} 11/07/2021 18:15:32 - INFO - __main__ - Step 149451: {'lr': 1.7037535496339242e-08, 'samples': 28694592, 'steps': 149450, 'loss/train': 1.0026359558105469} 11/07/2021 18:15:32 - INFO - __main__ - Step 149452: {'lr': 1.6975637844751645e-08, 'samples': 28694784, 'steps': 149451, 'loss/train': 0.9742178916931152} 11/07/2021 18:15:33 - INFO - __main__ - Step 149453: {'lr': 1.6913852831673683e-08, 'samples': 28694976, 'steps': 149452, 'loss/train': 1.5606513023376465} 11/07/2021 18:15:33 - INFO - __main__ - Step 149454: {'lr': 1.685218045710535e-08, 'samples': 28695168, 'steps': 149453, 'loss/train': 1.5073233842849731} 11/07/2021 18:15:33 - INFO - __main__ - Step 149455: {'lr': 1.6790620721074403e-08, 'samples': 28695360, 'steps': 149454, 'loss/train': 1.1330640316009521} 11/07/2021 18:15:34 - INFO - __main__ - Step 149456: {'lr': 1.672917362363635e-08, 'samples': 28695552, 'steps': 149455, 'loss/train': 0.5355495810508728} 11/07/2021 18:15:35 - INFO - __main__ - Step 149457: {'lr': 1.6667839164818956e-08, 'samples': 28695744, 'steps': 149456, 'loss/train': 1.2874423265457153} 11/07/2021 18:15:35 - INFO - __main__ - Step 149458: {'lr': 1.6606617344594455e-08, 'samples': 28695936, 'steps': 149457, 'loss/train': 1.3028169870376587} 11/07/2021 18:15:35 - INFO - __main__ - Step 149459: {'lr': 1.654550816304612e-08, 'samples': 28696128, 'steps': 149458, 'loss/train': 1.2377294301986694} 11/07/2021 18:15:36 - INFO - __main__ - Step 149460: {'lr': 1.6484511620201704e-08, 'samples': 28696320, 'steps': 149459, 'loss/train': 1.2412636280059814} 11/07/2021 18:15:37 - INFO - __main__ - Step 149461: {'lr': 1.6423627716061208e-08, 'samples': 28696512, 'steps': 149460, 'loss/train': 1.2151832580566406} 11/07/2021 18:15:37 - INFO - __main__ - Step 149462: {'lr': 1.6362856450652385e-08, 'samples': 28696704, 'steps': 149461, 'loss/train': 1.2535442113876343} 11/07/2021 18:15:37 - INFO - __main__ - Step 149463: {'lr': 1.630219782403075e-08, 'samples': 28696896, 'steps': 149462, 'loss/train': 1.2952518463134766} 11/07/2021 18:15:38 - INFO - __main__ - Step 149464: {'lr': 1.6241651836168547e-08, 'samples': 28697088, 'steps': 149463, 'loss/train': 1.647325038909912} 11/07/2021 18:15:38 - INFO - __main__ - Step 149465: {'lr': 1.6181218487176798e-08, 'samples': 28697280, 'steps': 149464, 'loss/train': 0.9107719659805298} 11/07/2021 18:15:39 - INFO - __main__ - Step 149466: {'lr': 1.6120897776999988e-08, 'samples': 28697472, 'steps': 149465, 'loss/train': 1.3904743194580078} 11/07/2021 18:15:40 - INFO - __main__ - Step 149467: {'lr': 1.6060689705721386e-08, 'samples': 28697664, 'steps': 149466, 'loss/train': 1.4929002523422241} 11/07/2021 18:15:40 - INFO - __main__ - Step 149468: {'lr': 1.6000594273340995e-08, 'samples': 28697856, 'steps': 149467, 'loss/train': 1.3108062744140625} 11/07/2021 18:15:40 - INFO - __main__ - Step 149469: {'lr': 1.5940611479914325e-08, 'samples': 28698048, 'steps': 149468, 'loss/train': 1.565960168838501} 11/07/2021 18:15:41 - INFO - __main__ - Step 149470: {'lr': 1.5880741325413616e-08, 'samples': 28698240, 'steps': 149469, 'loss/train': 1.0549726486206055} 11/07/2021 18:15:42 - INFO - __main__ - Step 149471: {'lr': 1.5820983809922142e-08, 'samples': 28698432, 'steps': 149470, 'loss/train': 0.8386491537094116} 11/07/2021 18:15:42 - INFO - __main__ - Step 149472: {'lr': 1.5761338933439896e-08, 'samples': 28698624, 'steps': 149471, 'loss/train': 1.0560073852539062} 11/07/2021 18:15:43 - INFO - __main__ - Step 149473: {'lr': 1.5701806695994636e-08, 'samples': 28698816, 'steps': 149472, 'loss/train': 1.0006754398345947} 11/07/2021 18:15:43 - INFO - __main__ - Step 149474: {'lr': 1.5642387097614118e-08, 'samples': 28699008, 'steps': 149473, 'loss/train': 0.9414656758308411} 11/07/2021 18:15:43 - INFO - __main__ - Step 149475: {'lr': 1.55830801383261e-08, 'samples': 28699200, 'steps': 149474, 'loss/train': 1.119301199913025} 11/07/2021 18:15:44 - INFO - __main__ - Step 149476: {'lr': 1.552388581818609e-08, 'samples': 28699392, 'steps': 149475, 'loss/train': 2.019035577774048} 11/07/2021 18:15:45 - INFO - __main__ - Step 149477: {'lr': 1.5464804137166332e-08, 'samples': 28699584, 'steps': 149476, 'loss/train': 1.6784188747406006} 11/07/2021 18:15:45 - INFO - __main__ - Step 149478: {'lr': 1.540583509532234e-08, 'samples': 28699776, 'steps': 149477, 'loss/train': 1.331795334815979} 11/07/2021 18:15:45 - INFO - __main__ - Step 149479: {'lr': 1.5346978692681867e-08, 'samples': 28699968, 'steps': 149478, 'loss/train': 1.1214573383331299} 11/07/2021 18:15:46 - INFO - __main__ - Step 149480: {'lr': 1.528823492927267e-08, 'samples': 28700160, 'steps': 149479, 'loss/train': 0.9517438411712646} 11/07/2021 18:15:46 - INFO - __main__ - Step 149481: {'lr': 1.5229603805122505e-08, 'samples': 28700352, 'steps': 149480, 'loss/train': 1.2870272397994995} 11/07/2021 18:15:47 - INFO - __main__ - Step 149482: {'lr': 1.5171085320231372e-08, 'samples': 28700544, 'steps': 149481, 'loss/train': 1.2911159992218018} 11/07/2021 18:15:48 - INFO - __main__ - Step 149483: {'lr': 1.511267947465478e-08, 'samples': 28700736, 'steps': 149482, 'loss/train': 1.5959208011627197} 11/07/2021 18:15:48 - INFO - __main__ - Step 149484: {'lr': 1.5054386268420484e-08, 'samples': 28700928, 'steps': 149483, 'loss/train': 0.9504448771476746} 11/07/2021 18:15:48 - INFO - __main__ - Step 149485: {'lr': 1.4996205701528486e-08, 'samples': 28701120, 'steps': 149484, 'loss/train': 1.0972017049789429} 11/07/2021 18:15:49 - INFO - __main__ - Step 149486: {'lr': 1.49381377740343e-08, 'samples': 28701312, 'steps': 149485, 'loss/train': 1.229589819908142} 11/07/2021 18:15:50 - INFO - __main__ - Step 149487: {'lr': 1.4880182485965677e-08, 'samples': 28701504, 'steps': 149486, 'loss/train': 1.9950724840164185} 11/07/2021 18:15:50 - INFO - __main__ - Step 149488: {'lr': 1.4822339837322619e-08, 'samples': 28701696, 'steps': 149487, 'loss/train': 1.3627216815948486} 11/07/2021 18:15:50 - INFO - __main__ - Step 149489: {'lr': 1.476460982813288e-08, 'samples': 28701888, 'steps': 149488, 'loss/train': 1.0954017639160156} 11/07/2021 18:15:51 - INFO - __main__ - Step 149490: {'lr': 1.4706992458424218e-08, 'samples': 28702080, 'steps': 149489, 'loss/train': 0.8421230316162109} 11/07/2021 18:15:51 - INFO - __main__ - Step 149491: {'lr': 1.4649487728252141e-08, 'samples': 28702272, 'steps': 149490, 'loss/train': 1.0456537008285522} 11/07/2021 18:15:52 - INFO - __main__ - Step 149492: {'lr': 1.4592095637616654e-08, 'samples': 28702464, 'steps': 149491, 'loss/train': 1.0072755813598633} 11/07/2021 18:15:53 - INFO - __main__ - Step 149493: {'lr': 1.4534816186545507e-08, 'samples': 28702656, 'steps': 149492, 'loss/train': 0.9158637523651123} 11/07/2021 18:15:53 - INFO - __main__ - Step 149494: {'lr': 1.447764937506646e-08, 'samples': 28702848, 'steps': 149493, 'loss/train': 1.3677308559417725} 11/07/2021 18:15:53 - INFO - __main__ - Step 149495: {'lr': 1.4420595203207264e-08, 'samples': 28703040, 'steps': 149494, 'loss/train': 1.3662739992141724} 11/07/2021 18:15:54 - INFO - __main__ - Step 149496: {'lr': 1.436365367099568e-08, 'samples': 28703232, 'steps': 149495, 'loss/train': 1.399070382118225} 11/07/2021 18:15:54 - INFO - __main__ - Step 149497: {'lr': 1.430682477845946e-08, 'samples': 28703424, 'steps': 149496, 'loss/train': 1.3897676467895508} 11/07/2021 18:15:55 - INFO - __main__ - Step 149498: {'lr': 1.425010852562636e-08, 'samples': 28703616, 'steps': 149497, 'loss/train': 1.5613632202148438} 11/07/2021 18:15:55 - INFO - __main__ - Step 149499: {'lr': 1.419350491249638e-08, 'samples': 28703808, 'steps': 149498, 'loss/train': 1.772628903388977} 11/07/2021 18:15:56 - INFO - __main__ - Step 149500: {'lr': 1.4137013939125031e-08, 'samples': 28704000, 'steps': 149499, 'loss/train': 1.4782435894012451} 11/07/2021 18:15:56 - INFO - __main__ - Step 149501: {'lr': 1.4080635605512315e-08, 'samples': 28704192, 'steps': 149500, 'loss/train': 1.5424058437347412} 11/07/2021 18:15:56 - INFO - __main__ - Step 149502: {'lr': 1.4024369911713742e-08, 'samples': 28704384, 'steps': 149501, 'loss/train': 1.1032819747924805} 11/07/2021 18:15:57 - INFO - __main__ - Step 149503: {'lr': 1.396821685772931e-08, 'samples': 28704576, 'steps': 149502, 'loss/train': 1.5067238807678223} 11/07/2021 18:15:58 - INFO - __main__ - Step 149504: {'lr': 1.3912176443586778e-08, 'samples': 28704768, 'steps': 149503, 'loss/train': 1.6233878135681152} 11/07/2021 18:15:58 - INFO - __main__ - Step 149505: {'lr': 1.3856248669313897e-08, 'samples': 28704960, 'steps': 149504, 'loss/train': 1.185035228729248} 11/07/2021 18:15:58 - INFO - __main__ - Step 149506: {'lr': 1.3800433534966184e-08, 'samples': 28705152, 'steps': 149505, 'loss/train': 1.3108773231506348} 11/07/2021 18:15:59 - INFO - __main__ - Step 149507: {'lr': 1.3744731040515879e-08, 'samples': 28705344, 'steps': 149506, 'loss/train': 1.2887166738510132} 11/07/2021 18:16:00 - INFO - __main__ - Step 149508: {'lr': 1.3689141186018495e-08, 'samples': 28705536, 'steps': 149507, 'loss/train': 0.9514561295509338} 11/07/2021 18:16:00 - INFO - __main__ - Step 149509: {'lr': 1.3633663971501787e-08, 'samples': 28705728, 'steps': 149508, 'loss/train': 1.525678277015686} 11/07/2021 18:16:01 - INFO - __main__ - Step 149510: {'lr': 1.357829939699351e-08, 'samples': 28705920, 'steps': 149509, 'loss/train': 0.7609017491340637} 11/07/2021 18:16:01 - INFO - __main__ - Step 149511: {'lr': 1.3523047462493666e-08, 'samples': 28706112, 'steps': 149510, 'loss/train': 1.4277353286743164} 11/07/2021 18:16:01 - INFO - __main__ - Step 149512: {'lr': 1.3467908168057763e-08, 'samples': 28706304, 'steps': 149511, 'loss/train': 1.3994579315185547} 11/07/2021 18:16:02 - INFO - __main__ - Step 149513: {'lr': 1.3412881513685803e-08, 'samples': 28706496, 'steps': 149512, 'loss/train': 1.2438465356826782} 11/07/2021 18:16:03 - INFO - __main__ - Step 149514: {'lr': 1.3357967499405544e-08, 'samples': 28706688, 'steps': 149513, 'loss/train': 1.2731131315231323} 11/07/2021 18:16:03 - INFO - __main__ - Step 149515: {'lr': 1.3303166125244737e-08, 'samples': 28706880, 'steps': 149514, 'loss/train': 1.089555025100708} 11/07/2021 18:16:03 - INFO - __main__ - Step 149516: {'lr': 1.3248477391258895e-08, 'samples': 28707072, 'steps': 149515, 'loss/train': 1.4058796167373657} 11/07/2021 18:16:04 - INFO - __main__ - Step 149517: {'lr': 1.3193901297420262e-08, 'samples': 28707264, 'steps': 149516, 'loss/train': 0.9533635973930359} 11/07/2021 18:16:05 - INFO - __main__ - Step 149518: {'lr': 1.3139437843784352e-08, 'samples': 28707456, 'steps': 149517, 'loss/train': 1.3068647384643555} 11/07/2021 18:16:05 - INFO - __main__ - Step 149519: {'lr': 1.3085087030378917e-08, 'samples': 28707648, 'steps': 149518, 'loss/train': 1.3383584022521973} 11/07/2021 18:16:05 - INFO - __main__ - Step 149520: {'lr': 1.3030848857231714e-08, 'samples': 28707840, 'steps': 149519, 'loss/train': 1.1614617109298706} 11/07/2021 18:16:06 - INFO - __main__ - Step 149521: {'lr': 1.2976723324342743e-08, 'samples': 28708032, 'steps': 149520, 'loss/train': 1.3232786655426025} 11/07/2021 18:16:06 - INFO - __main__ - Step 149522: {'lr': 1.2922710431739759e-08, 'samples': 28708224, 'steps': 149521, 'loss/train': 1.6012667417526245} 11/07/2021 18:16:07 - INFO - __main__ - Step 149523: {'lr': 1.2868810179450519e-08, 'samples': 28708416, 'steps': 149522, 'loss/train': 1.3463163375854492} 11/07/2021 18:16:07 - INFO - __main__ - Step 149524: {'lr': 1.2815022567530532e-08, 'samples': 28708608, 'steps': 149523, 'loss/train': 1.4388784170150757} 11/07/2021 18:16:08 - INFO - __main__ - Step 149525: {'lr': 1.2761347595952044e-08, 'samples': 28708800, 'steps': 149524, 'loss/train': 1.3739995956420898} 11/07/2021 18:16:08 - INFO - __main__ - Step 149526: {'lr': 1.2707785264798321e-08, 'samples': 28708992, 'steps': 149525, 'loss/train': 1.5303318500518799} 11/07/2021 18:16:08 - INFO - __main__ - Step 149527: {'lr': 1.2654335574041608e-08, 'samples': 28709184, 'steps': 149526, 'loss/train': 1.586531162261963} 11/07/2021 18:16:09 - INFO - __main__ - Step 149528: {'lr': 1.2600998523709661e-08, 'samples': 28709376, 'steps': 149527, 'loss/train': 0.6732990145683289} 11/07/2021 18:16:10 - INFO - __main__ - Step 149529: {'lr': 1.254777411385799e-08, 'samples': 28709568, 'steps': 149528, 'loss/train': 1.2159851789474487} 11/07/2021 18:16:10 - INFO - __main__ - Step 149530: {'lr': 1.2494662344486596e-08, 'samples': 28709760, 'steps': 149529, 'loss/train': 0.7269560098648071} 11/07/2021 18:16:11 - INFO - __main__ - Step 149531: {'lr': 1.2441663215650988e-08, 'samples': 28709952, 'steps': 149530, 'loss/train': 0.9992548227310181} 11/07/2021 18:16:11 - INFO - __main__ - Step 149532: {'lr': 1.2388776727323414e-08, 'samples': 28710144, 'steps': 149531, 'loss/train': 1.0698994398117065} 11/07/2021 18:16:13 - INFO - __main__ - Step 149533: {'lr': 1.2336002879587138e-08, 'samples': 28710336, 'steps': 149532, 'loss/train': 1.344040870666504} 11/07/2021 18:16:13 - INFO - __main__ - Step 149534: {'lr': 1.2283341672414405e-08, 'samples': 28710528, 'steps': 149533, 'loss/train': 1.2128442525863647} 11/07/2021 18:16:13 - INFO - __main__ - Step 149535: {'lr': 1.223079310583297e-08, 'samples': 28710720, 'steps': 149534, 'loss/train': 1.3266353607177734} 11/07/2021 18:16:14 - INFO - __main__ - Step 149536: {'lr': 1.2178357179898348e-08, 'samples': 28710912, 'steps': 149535, 'loss/train': 1.1491121053695679} 11/07/2021 18:16:14 - INFO - __main__ - Step 149537: {'lr': 1.212603389463829e-08, 'samples': 28711104, 'steps': 149536, 'loss/train': 1.2861131429672241} 11/07/2021 18:16:15 - INFO - __main__ - Step 149538: {'lr': 1.207382325002504e-08, 'samples': 28711296, 'steps': 149537, 'loss/train': 0.5636351704597473} 11/07/2021 18:16:16 - INFO - __main__ - Step 149539: {'lr': 1.2021725246141868e-08, 'samples': 28711488, 'steps': 149538, 'loss/train': 0.5075281858444214} 11/07/2021 18:16:16 - INFO - __main__ - Step 149540: {'lr': 1.1969739882961018e-08, 'samples': 28711680, 'steps': 149539, 'loss/train': 1.0369837284088135} 11/07/2021 18:16:16 - INFO - __main__ - Step 149541: {'lr': 1.1917867160538e-08, 'samples': 28711872, 'steps': 149540, 'loss/train': 1.2319903373718262} 11/07/2021 18:16:17 - INFO - __main__ - Step 149542: {'lr': 1.1866107078900568e-08, 'samples': 28712064, 'steps': 149541, 'loss/train': 1.3959885835647583} 11/07/2021 18:16:17 - INFO - __main__ - Step 149543: {'lr': 1.1814459638048725e-08, 'samples': 28712256, 'steps': 149542, 'loss/train': 1.3788310289382935} 11/07/2021 18:16:17 - INFO - __main__ - Step 149544: {'lr': 1.1762924838010225e-08, 'samples': 28712448, 'steps': 149543, 'loss/train': 1.304234504699707} 11/07/2021 18:16:18 - INFO - __main__ - Step 149545: {'lr': 1.1711502678812824e-08, 'samples': 28712640, 'steps': 149544, 'loss/train': 1.1999956369400024} 11/07/2021 18:16:19 - INFO - __main__ - Step 149546: {'lr': 1.1660193160484279e-08, 'samples': 28712832, 'steps': 149545, 'loss/train': 1.4382038116455078} 11/07/2021 18:16:19 - INFO - __main__ - Step 149547: {'lr': 1.1608996283052342e-08, 'samples': 28713024, 'steps': 149546, 'loss/train': 1.1959612369537354} 11/07/2021 18:16:19 - INFO - __main__ - Step 149548: {'lr': 1.1557912046517016e-08, 'samples': 28713216, 'steps': 149547, 'loss/train': 1.4081352949142456} 11/07/2021 18:16:20 - INFO - __main__ - Step 149549: {'lr': 1.1506940450906056e-08, 'samples': 28713408, 'steps': 149548, 'loss/train': 1.4781557321548462} 11/07/2021 18:16:21 - INFO - __main__ - Step 149550: {'lr': 1.1456081496274972e-08, 'samples': 28713600, 'steps': 149549, 'loss/train': 1.4223341941833496} 11/07/2021 18:16:21 - INFO - __main__ - Step 149551: {'lr': 1.1405335182623765e-08, 'samples': 28713792, 'steps': 149550, 'loss/train': 1.2179815769195557} 11/07/2021 18:16:22 - INFO - __main__ - Step 149552: {'lr': 1.1354701509980193e-08, 'samples': 28713984, 'steps': 149551, 'loss/train': 0.9747281670570374} 11/07/2021 18:16:22 - INFO - __main__ - Step 149553: {'lr': 1.130418047834425e-08, 'samples': 28714176, 'steps': 149552, 'loss/train': 1.3008416891098022} 11/07/2021 18:16:22 - INFO - __main__ - Step 149554: {'lr': 1.1253772087771453e-08, 'samples': 28714368, 'steps': 149553, 'loss/train': 1.3302803039550781} 11/07/2021 18:16:23 - INFO - __main__ - Step 149555: {'lr': 1.1203476338261798e-08, 'samples': 28714560, 'steps': 149554, 'loss/train': 1.3171701431274414} 11/07/2021 18:16:24 - INFO - __main__ - Step 149556: {'lr': 1.1153293229843042e-08, 'samples': 28714752, 'steps': 149555, 'loss/train': 1.5912730693817139} 11/07/2021 18:16:24 - INFO - __main__ - Step 149557: {'lr': 1.1103222762542941e-08, 'samples': 28714944, 'steps': 149556, 'loss/train': 1.2180944681167603} 11/07/2021 18:16:24 - INFO - __main__ - Step 149558: {'lr': 1.1053264936389252e-08, 'samples': 28715136, 'steps': 149557, 'loss/train': 1.0089982748031616} 11/07/2021 18:16:25 - INFO - __main__ - Step 149559: {'lr': 1.1003419751409727e-08, 'samples': 28715328, 'steps': 149558, 'loss/train': 1.4393681287765503} 11/07/2021 18:16:26 - INFO - __main__ - Step 149560: {'lr': 1.0953687207576613e-08, 'samples': 28715520, 'steps': 149559, 'loss/train': 1.3799487352371216} 11/07/2021 18:16:26 - INFO - __main__ - Step 149561: {'lr': 1.0904067304973175e-08, 'samples': 28715712, 'steps': 149560, 'loss/train': 1.1342215538024902} 11/07/2021 18:16:26 - INFO - __main__ - Step 149562: {'lr': 1.0854560043599415e-08, 'samples': 28715904, 'steps': 149561, 'loss/train': 1.1788820028305054} 11/07/2021 18:16:27 - INFO - __main__ - Step 149563: {'lr': 1.0805165423483088e-08, 'samples': 28716096, 'steps': 149562, 'loss/train': 1.2670412063598633} 11/07/2021 18:16:27 - INFO - __main__ - Step 149564: {'lr': 1.0755883444624193e-08, 'samples': 28716288, 'steps': 149563, 'loss/train': 1.4320118427276611} 11/07/2021 18:16:28 - INFO - __main__ - Step 149565: {'lr': 1.0706714107078242e-08, 'samples': 28716480, 'steps': 149564, 'loss/train': 1.195788860321045} 11/07/2021 18:16:29 - INFO - __main__ - Step 149566: {'lr': 1.0657657410845235e-08, 'samples': 28716672, 'steps': 149565, 'loss/train': 1.0650006532669067} 11/07/2021 18:16:29 - INFO - __main__ - Step 149567: {'lr': 1.0608713355952926e-08, 'samples': 28716864, 'steps': 149566, 'loss/train': 1.7282160520553589} 11/07/2021 18:16:29 - INFO - __main__ - Step 149568: {'lr': 1.0559881942401317e-08, 'samples': 28717056, 'steps': 149567, 'loss/train': 1.588313341140747} 11/07/2021 18:16:30 - INFO - __main__ - Step 149569: {'lr': 1.0511163170273675e-08, 'samples': 28717248, 'steps': 149568, 'loss/train': 0.9648146629333496} 11/07/2021 18:16:30 - INFO - __main__ - Step 149570: {'lr': 1.0462557039514486e-08, 'samples': 28717440, 'steps': 149569, 'loss/train': 1.290989875793457} 11/07/2021 18:16:31 - INFO - __main__ - Step 149571: {'lr': 1.041406355020702e-08, 'samples': 28717632, 'steps': 149570, 'loss/train': 0.05347156897187233} 11/07/2021 18:16:31 - INFO - __main__ - Step 149572: {'lr': 1.0365682702351276e-08, 'samples': 28717824, 'steps': 149571, 'loss/train': 0.8436179757118225} 11/07/2021 18:16:32 - INFO - __main__ - Step 149573: {'lr': 1.0317414495947252e-08, 'samples': 28718016, 'steps': 149572, 'loss/train': 1.380077838897705} 11/07/2021 18:16:32 - INFO - __main__ - Step 149574: {'lr': 1.0269258931050463e-08, 'samples': 28718208, 'steps': 149573, 'loss/train': 1.352677822113037} 11/07/2021 18:16:32 - INFO - __main__ - Step 149575: {'lr': 1.0221216007660905e-08, 'samples': 28718400, 'steps': 149574, 'loss/train': 1.2373552322387695} 11/07/2021 18:16:34 - INFO - __main__ - Step 149576: {'lr': 1.0173285725806336e-08, 'samples': 28718592, 'steps': 149575, 'loss/train': 1.1963571310043335} 11/07/2021 18:16:34 - INFO - __main__ - Step 149577: {'lr': 1.012546808551451e-08, 'samples': 28718784, 'steps': 149576, 'loss/train': 1.2437480688095093} 11/07/2021 18:16:34 - INFO - __main__ - Step 149578: {'lr': 1.0077763086813186e-08, 'samples': 28718976, 'steps': 149577, 'loss/train': 1.5397889614105225} 11/07/2021 18:16:35 - INFO - __main__ - Step 149579: {'lr': 1.003017072970236e-08, 'samples': 28719168, 'steps': 149578, 'loss/train': 1.1656216382980347} 11/07/2021 18:16:35 - INFO - __main__ - Step 149580: {'lr': 9.982691014209788e-09, 'samples': 28719360, 'steps': 149579, 'loss/train': 0.08100341260433197} 11/07/2021 18:16:36 - INFO - __main__ - Step 149581: {'lr': 9.935323940363228e-09, 'samples': 28719552, 'steps': 149580, 'loss/train': 1.2069756984710693} 11/07/2021 18:16:36 - INFO - __main__ - Step 149582: {'lr': 9.888069508190434e-09, 'samples': 28719744, 'steps': 149581, 'loss/train': 1.2130473852157593} 11/07/2021 18:16:37 - INFO - __main__ - Step 149583: {'lr': 9.840927717691405e-09, 'samples': 28719936, 'steps': 149582, 'loss/train': 1.225708246231079} 11/07/2021 18:16:37 - INFO - __main__ - Step 149584: {'lr': 9.793898568921655e-09, 'samples': 28720128, 'steps': 149583, 'loss/train': 0.6572380065917969} 11/07/2021 18:16:37 - INFO - __main__ - Step 149585: {'lr': 9.746982061881182e-09, 'samples': 28720320, 'steps': 149584, 'loss/train': 1.3533339500427246} 11/07/2021 18:16:38 - INFO - __main__ - Step 149586: {'lr': 9.700178196569986e-09, 'samples': 28720512, 'steps': 149585, 'loss/train': 0.8649435639381409} 11/07/2021 18:16:39 - INFO - __main__ - Step 149587: {'lr': 9.653486973043579e-09, 'samples': 28720704, 'steps': 149586, 'loss/train': 1.581446886062622} 11/07/2021 18:16:39 - INFO - __main__ - Step 149588: {'lr': 9.606908391301961e-09, 'samples': 28720896, 'steps': 149587, 'loss/train': 1.17477285861969} 11/07/2021 18:16:39 - INFO - __main__ - Step 149589: {'lr': 9.560442451372885e-09, 'samples': 28721088, 'steps': 149588, 'loss/train': 0.8279416561126709} 11/07/2021 18:16:40 - INFO - __main__ - Step 149590: {'lr': 9.514089153284112e-09, 'samples': 28721280, 'steps': 149589, 'loss/train': 1.4926813840866089} 11/07/2021 18:16:41 - INFO - __main__ - Step 149591: {'lr': 9.467848497035637e-09, 'samples': 28721472, 'steps': 149590, 'loss/train': 0.6362777352333069} 11/07/2021 18:16:41 - INFO - __main__ - Step 149592: {'lr': 9.421720482682972e-09, 'samples': 28721664, 'steps': 149591, 'loss/train': 1.2783138751983643} 11/07/2021 18:16:41 - INFO - __main__ - Step 149593: {'lr': 9.375705110226119e-09, 'samples': 28721856, 'steps': 149592, 'loss/train': 1.2138943672180176} 11/07/2021 18:16:42 - INFO - __main__ - Step 149594: {'lr': 9.329802379692831e-09, 'samples': 28722048, 'steps': 149593, 'loss/train': 1.4981257915496826} 11/07/2021 18:16:42 - INFO - __main__ - Step 149595: {'lr': 9.28401229108311e-09, 'samples': 28722240, 'steps': 149594, 'loss/train': 1.3091192245483398} 11/07/2021 18:16:43 - INFO - __main__ - Step 149596: {'lr': 9.238334844424711e-09, 'samples': 28722432, 'steps': 149595, 'loss/train': 1.0513490438461304} 11/07/2021 18:16:44 - INFO - __main__ - Step 149597: {'lr': 9.192770039773146e-09, 'samples': 28722624, 'steps': 149596, 'loss/train': 1.273945689201355} 11/07/2021 18:16:44 - INFO - __main__ - Step 149598: {'lr': 9.147317877100659e-09, 'samples': 28722816, 'steps': 149597, 'loss/train': 0.8912418484687805} 11/07/2021 18:16:44 - INFO - __main__ - Step 149599: {'lr': 9.101978356462759e-09, 'samples': 28723008, 'steps': 149598, 'loss/train': 1.223225474357605} 11/07/2021 18:16:45 - INFO - __main__ - Step 149600: {'lr': 9.056751477859449e-09, 'samples': 28723200, 'steps': 149599, 'loss/train': 1.472423791885376} 11/07/2021 18:16:45 - INFO - __main__ - Step 149601: {'lr': 9.011637241318482e-09, 'samples': 28723392, 'steps': 149600, 'loss/train': 1.6478341817855835} 11/07/2021 18:16:46 - INFO - __main__ - Step 149602: {'lr': 8.96663564683986e-09, 'samples': 28723584, 'steps': 149601, 'loss/train': 1.2978566884994507} 11/07/2021 18:16:47 - INFO - __main__ - Step 149603: {'lr': 8.921746694479093e-09, 'samples': 28723776, 'steps': 149602, 'loss/train': 1.2303348779678345} 11/07/2021 18:16:47 - INFO - __main__ - Step 149604: {'lr': 8.876970384236182e-09, 'samples': 28723968, 'steps': 149603, 'loss/train': 1.3677043914794922} 11/07/2021 18:16:47 - INFO - __main__ - Step 149605: {'lr': 8.832306716166637e-09, 'samples': 28724160, 'steps': 149604, 'loss/train': 0.8971133828163147} 11/07/2021 18:16:48 - INFO - __main__ - Step 149606: {'lr': 8.787755690214949e-09, 'samples': 28724352, 'steps': 149605, 'loss/train': 0.804112434387207} 11/07/2021 18:16:49 - INFO - __main__ - Step 149607: {'lr': 8.743317306464383e-09, 'samples': 28724544, 'steps': 149606, 'loss/train': 0.8530346751213074} 11/07/2021 18:16:49 - INFO - __main__ - Step 149608: {'lr': 8.698991564914937e-09, 'samples': 28724736, 'steps': 149607, 'loss/train': 0.9470862150192261} 11/07/2021 18:16:49 - INFO - __main__ - Step 149609: {'lr': 8.654778465594371e-09, 'samples': 28724928, 'steps': 149608, 'loss/train': 1.278947353363037} 11/07/2021 18:16:50 - INFO - __main__ - Step 149610: {'lr': 8.610678008530437e-09, 'samples': 28725120, 'steps': 149609, 'loss/train': 0.6303962469100952} 11/07/2021 18:16:50 - INFO - __main__ - Step 149611: {'lr': 8.566690193695382e-09, 'samples': 28725312, 'steps': 149610, 'loss/train': 1.647204041481018} 11/07/2021 18:16:51 - INFO - __main__ - Step 149612: {'lr': 8.52281502117247e-09, 'samples': 28725504, 'steps': 149611, 'loss/train': 1.0595687627792358} 11/07/2021 18:16:51 - INFO - __main__ - Step 149613: {'lr': 8.479052490933948e-09, 'samples': 28725696, 'steps': 149612, 'loss/train': 1.000163197517395} 11/07/2021 18:16:52 - INFO - __main__ - Step 149614: {'lr': 8.43540260300757e-09, 'samples': 28725888, 'steps': 149613, 'loss/train': 0.8345909714698792} 11/07/2021 18:16:52 - INFO - __main__ - Step 149615: {'lr': 8.391865357448847e-09, 'samples': 28726080, 'steps': 149614, 'loss/train': 1.3184478282928467} 11/07/2021 18:16:52 - INFO - __main__ - Step 149616: {'lr': 8.348440754230025e-09, 'samples': 28726272, 'steps': 149615, 'loss/train': 1.0031498670578003} 11/07/2021 18:16:54 - INFO - __main__ - Step 149617: {'lr': 8.305128793406613e-09, 'samples': 28726464, 'steps': 149616, 'loss/train': 1.4376862049102783} 11/07/2021 18:16:54 - INFO - __main__ - Step 149618: {'lr': 8.261929474978614e-09, 'samples': 28726656, 'steps': 149617, 'loss/train': 1.3995612859725952} 11/07/2021 18:16:54 - INFO - __main__ - Step 149619: {'lr': 8.218842798946025e-09, 'samples': 28726848, 'steps': 149618, 'loss/train': 0.6092562079429626} 11/07/2021 18:16:55 - INFO - __main__ - Step 149620: {'lr': 8.175868765392114e-09, 'samples': 28727040, 'steps': 149619, 'loss/train': 1.0879653692245483} 11/07/2021 18:16:55 - INFO - __main__ - Step 149621: {'lr': 8.13300737426137e-09, 'samples': 28727232, 'steps': 149620, 'loss/train': 1.2589353322982788} 11/07/2021 18:16:56 - INFO - __main__ - Step 149622: {'lr': 8.09025862563706e-09, 'samples': 28727424, 'steps': 149621, 'loss/train': 1.5155117511749268} 11/07/2021 18:16:56 - INFO - __main__ - Step 149623: {'lr': 8.047622519491427e-09, 'samples': 28727616, 'steps': 149622, 'loss/train': 1.241424798965454} 11/07/2021 18:16:57 - INFO - __main__ - Step 149624: {'lr': 8.005099055879982e-09, 'samples': 28727808, 'steps': 149623, 'loss/train': 1.739328145980835} 11/07/2021 18:16:57 - INFO - __main__ - Step 149625: {'lr': 7.962688234802729e-09, 'samples': 28728000, 'steps': 149624, 'loss/train': 1.2762154340744019} 11/07/2021 18:16:57 - INFO - __main__ - Step 149626: {'lr': 7.920390056259663e-09, 'samples': 28728192, 'steps': 149625, 'loss/train': 0.9113873839378357} 11/07/2021 18:16:59 - INFO - __main__ - Step 149627: {'lr': 7.878204520306298e-09, 'samples': 28728384, 'steps': 149626, 'loss/train': 1.3405689001083374} 11/07/2021 18:16:59 - INFO - __main__ - Step 149628: {'lr': 7.836131626942633e-09, 'samples': 28728576, 'steps': 149627, 'loss/train': 1.065288782119751} 11/07/2021 18:16:59 - INFO - __main__ - Step 149629: {'lr': 7.794171376168669e-09, 'samples': 28728768, 'steps': 149628, 'loss/train': 1.5076345205307007} 11/07/2021 18:17:00 - INFO - __main__ - Step 149630: {'lr': 7.752323768039914e-09, 'samples': 28728960, 'steps': 149629, 'loss/train': 0.0524417944252491} 11/07/2021 18:17:00 - INFO - __main__ - Step 149631: {'lr': 7.710588802584129e-09, 'samples': 28729152, 'steps': 149630, 'loss/train': 0.927898645401001} 11/07/2021 18:17:01 - INFO - __main__ - Step 149632: {'lr': 7.668966479773553e-09, 'samples': 28729344, 'steps': 149631, 'loss/train': 1.1434675455093384} 11/07/2021 18:17:01 - INFO - __main__ - Step 149633: {'lr': 7.627456799635946e-09, 'samples': 28729536, 'steps': 149632, 'loss/train': 1.2607589960098267} 11/07/2021 18:17:02 - INFO - __main__ - Step 149634: {'lr': 7.586059762226816e-09, 'samples': 28729728, 'steps': 149633, 'loss/train': 1.7539290189743042} 11/07/2021 18:17:02 - INFO - __main__ - Step 149635: {'lr': 7.544775367546165e-09, 'samples': 28729920, 'steps': 149634, 'loss/train': 0.9896730780601501} 11/07/2021 18:17:02 - INFO - __main__ - Step 149636: {'lr': 7.50360361559399e-09, 'samples': 28730112, 'steps': 149635, 'loss/train': 1.478050708770752} 11/07/2021 18:17:04 - INFO - __main__ - Step 149637: {'lr': 7.462544506398051e-09, 'samples': 28730304, 'steps': 149636, 'loss/train': 1.7018589973449707} 11/07/2021 18:17:04 - INFO - __main__ - Step 149638: {'lr': 7.4215980399861e-09, 'samples': 28730496, 'steps': 149637, 'loss/train': 1.5418555736541748} 11/07/2021 18:17:04 - INFO - __main__ - Step 149639: {'lr': 7.380764216385893e-09, 'samples': 28730688, 'steps': 149638, 'loss/train': 1.3855127096176147} 11/07/2021 18:17:05 - INFO - __main__ - Step 149640: {'lr': 7.340043035597433e-09, 'samples': 28730880, 'steps': 149639, 'loss/train': 1.0484819412231445} 11/07/2021 18:17:05 - INFO - __main__ - Step 149641: {'lr': 7.299434497620716e-09, 'samples': 28731072, 'steps': 149640, 'loss/train': 0.7034434676170349} 11/07/2021 18:17:05 - INFO - __main__ - Step 149642: {'lr': 7.258938602511256e-09, 'samples': 28731264, 'steps': 149641, 'loss/train': 1.196815013885498} 11/07/2021 18:17:06 - INFO - __main__ - Step 149643: {'lr': 7.218555350296807e-09, 'samples': 28731456, 'steps': 149642, 'loss/train': 1.5084500312805176} 11/07/2021 18:17:07 - INFO - __main__ - Step 149644: {'lr': 7.178284740949615e-09, 'samples': 28731648, 'steps': 149643, 'loss/train': 1.1650806665420532} 11/07/2021 18:17:07 - INFO - __main__ - Step 149645: {'lr': 7.1381267744974334e-09, 'samples': 28731840, 'steps': 149644, 'loss/train': 1.4232795238494873} 11/07/2021 18:17:07 - INFO - __main__ - Step 149646: {'lr': 7.0980814509957745e-09, 'samples': 28732032, 'steps': 149645, 'loss/train': 1.4907102584838867} 11/07/2021 18:17:08 - INFO - __main__ - Step 149647: {'lr': 7.058148770444639e-09, 'samples': 28732224, 'steps': 149646, 'loss/train': 0.4295653998851776} 11/07/2021 18:17:09 - INFO - __main__ - Step 149648: {'lr': 7.01832873281627e-09, 'samples': 28732416, 'steps': 149647, 'loss/train': 1.5130114555358887} 11/07/2021 18:17:10 - INFO - __main__ - Step 149649: {'lr': 6.978621338193936e-09, 'samples': 28732608, 'steps': 149648, 'loss/train': 1.2809165716171265} 11/07/2021 18:17:10 - INFO - __main__ - Step 149650: {'lr': 6.939026586577635e-09, 'samples': 28732800, 'steps': 149649, 'loss/train': 1.4479130506515503} 11/07/2021 18:17:10 - INFO - __main__ - Step 149651: {'lr': 6.899544477967368e-09, 'samples': 28732992, 'steps': 149650, 'loss/train': 0.033663857728242874} 11/07/2021 18:17:11 - INFO - __main__ - Step 149652: {'lr': 6.86017501239089e-09, 'samples': 28733184, 'steps': 149651, 'loss/train': 0.07221072167158127} 11/07/2021 18:17:12 - INFO - __main__ - Step 149653: {'lr': 6.820918189875958e-09, 'samples': 28733376, 'steps': 149652, 'loss/train': 1.4050413370132446} 11/07/2021 18:17:12 - INFO - __main__ - Step 149654: {'lr': 6.781774010394815e-09, 'samples': 28733568, 'steps': 149653, 'loss/train': 1.525353193283081} 11/07/2021 18:17:12 - INFO - __main__ - Step 149655: {'lr': 6.742742474030728e-09, 'samples': 28733760, 'steps': 149654, 'loss/train': 1.4970887899398804} 11/07/2021 18:17:13 - INFO - __main__ - Step 149656: {'lr': 6.703823580755941e-09, 'samples': 28733952, 'steps': 149655, 'loss/train': 1.2057753801345825} 11/07/2021 18:17:13 - INFO - __main__ - Step 149657: {'lr': 6.665017330625966e-09, 'samples': 28734144, 'steps': 149656, 'loss/train': 1.0507755279541016} 11/07/2021 18:17:14 - INFO - __main__ - Step 149658: {'lr': 6.626323723613048e-09, 'samples': 28734336, 'steps': 149657, 'loss/train': 1.318480134010315} 11/07/2021 18:17:15 - INFO - __main__ - Step 149659: {'lr': 6.587742759772697e-09, 'samples': 28734528, 'steps': 149658, 'loss/train': 1.7186449766159058} 11/07/2021 18:17:15 - INFO - __main__ - Step 149660: {'lr': 6.549274439104913e-09, 'samples': 28734720, 'steps': 149659, 'loss/train': 1.4651103019714355} 11/07/2021 18:17:15 - INFO - __main__ - Step 149661: {'lr': 6.510918761609697e-09, 'samples': 28734912, 'steps': 149660, 'loss/train': 0.7859552502632141} 11/07/2021 18:17:16 - INFO - __main__ - Step 149662: {'lr': 6.472675727342559e-09, 'samples': 28735104, 'steps': 149661, 'loss/train': 1.2085551023483276} 11/07/2021 18:17:17 - INFO - __main__ - Step 149663: {'lr': 6.4345453362757436e-09, 'samples': 28735296, 'steps': 149662, 'loss/train': 1.4512754678726196} 11/07/2021 18:17:17 - INFO - __main__ - Step 149664: {'lr': 6.396527588464762e-09, 'samples': 28735488, 'steps': 149663, 'loss/train': 0.8713024854660034} 11/07/2021 18:17:18 - INFO - __main__ - Step 149665: {'lr': 6.358622483937371e-09, 'samples': 28735680, 'steps': 149664, 'loss/train': 1.869659423828125} 11/07/2021 18:17:18 - INFO - __main__ - Step 149666: {'lr': 6.320830022665813e-09, 'samples': 28735872, 'steps': 149665, 'loss/train': 1.0021696090698242} 11/07/2021 18:17:18 - INFO - __main__ - Step 149667: {'lr': 6.283150204677845e-09, 'samples': 28736064, 'steps': 149666, 'loss/train': 0.7552422285079956} 11/07/2021 18:17:19 - INFO - __main__ - Step 149668: {'lr': 6.2455830300289785e-09, 'samples': 28736256, 'steps': 149667, 'loss/train': 4.229720115661621} 11/07/2021 18:17:20 - INFO - __main__ - Step 149669: {'lr': 6.208128498691456e-09, 'samples': 28736448, 'steps': 149668, 'loss/train': 1.5917692184448242} 11/07/2021 18:17:20 - INFO - __main__ - Step 149670: {'lr': 6.170786610693036e-09, 'samples': 28736640, 'steps': 149669, 'loss/train': 1.3897953033447266} 11/07/2021 18:17:20 - INFO - __main__ - Step 149671: {'lr': 6.133557366061471e-09, 'samples': 28736832, 'steps': 149670, 'loss/train': 0.35693666338920593} 11/07/2021 18:17:21 - INFO - __main__ - Step 149672: {'lr': 6.096440764824518e-09, 'samples': 28737024, 'steps': 149671, 'loss/train': 1.678890585899353} 11/07/2021 18:17:21 - INFO - __main__ - Step 149673: {'lr': 6.059436806954421e-09, 'samples': 28737216, 'steps': 149672, 'loss/train': 0.5342602133750916} 11/07/2021 18:17:22 - INFO - __main__ - Step 149674: {'lr': 6.022545492506693e-09, 'samples': 28737408, 'steps': 149673, 'loss/train': 0.46581873297691345} 11/07/2021 18:17:22 - INFO - __main__ - Step 149675: {'lr': 5.985766821509086e-09, 'samples': 28737600, 'steps': 149674, 'loss/train': 1.3271907567977905} 11/07/2021 18:17:23 - INFO - __main__ - Step 149676: {'lr': 5.9491007939338485e-09, 'samples': 28737792, 'steps': 149675, 'loss/train': 0.9945909380912781} 11/07/2021 18:17:23 - INFO - __main__ - Step 149677: {'lr': 5.912547409808733e-09, 'samples': 28737984, 'steps': 149676, 'loss/train': 1.6837669610977173} 11/07/2021 18:17:24 - INFO - __main__ - Step 149678: {'lr': 5.876106669189252e-09, 'samples': 28738176, 'steps': 149677, 'loss/train': 1.6294516324996948} 11/07/2021 18:17:25 - INFO - __main__ - Step 149679: {'lr': 5.839778572047649e-09, 'samples': 28738368, 'steps': 149678, 'loss/train': 1.3504117727279663} 11/07/2021 18:17:25 - INFO - __main__ - Step 149680: {'lr': 5.8035631184394365e-09, 'samples': 28738560, 'steps': 149679, 'loss/train': 0.9437045454978943} 11/07/2021 18:17:26 - INFO - __main__ - Step 149681: {'lr': 5.767460308336858e-09, 'samples': 28738752, 'steps': 149680, 'loss/train': 1.096010446548462} 11/07/2021 18:17:26 - INFO - __main__ - Step 149682: {'lr': 5.73147014176767e-09, 'samples': 28738944, 'steps': 149681, 'loss/train': 1.7319916486740112} 11/07/2021 18:17:26 - INFO - __main__ - Step 149683: {'lr': 5.695592618787382e-09, 'samples': 28739136, 'steps': 149682, 'loss/train': 0.7543754577636719} 11/07/2021 18:17:27 - INFO - __main__ - Step 149684: {'lr': 5.65982773936824e-09, 'samples': 28739328, 'steps': 149683, 'loss/train': 1.2795615196228027} 11/07/2021 18:17:28 - INFO - __main__ - Step 149685: {'lr': 5.624175503537998e-09, 'samples': 28739520, 'steps': 149684, 'loss/train': 1.2617255449295044} 11/07/2021 18:17:28 - INFO - __main__ - Step 149686: {'lr': 5.588635911324414e-09, 'samples': 28739712, 'steps': 149685, 'loss/train': 0.9866201281547546} 11/07/2021 18:17:28 - INFO - __main__ - Step 149687: {'lr': 5.553208962727485e-09, 'samples': 28739904, 'steps': 149686, 'loss/train': 1.1304094791412354} 11/07/2021 18:17:29 - INFO - __main__ - Step 149688: {'lr': 5.517894657774969e-09, 'samples': 28740096, 'steps': 149687, 'loss/train': 1.5798141956329346} 11/07/2021 18:17:29 - INFO - __main__ - Step 149689: {'lr': 5.482692996466865e-09, 'samples': 28740288, 'steps': 149688, 'loss/train': 1.0971020460128784} 11/07/2021 18:17:31 - INFO - __main__ - Step 149690: {'lr': 5.447603978858684e-09, 'samples': 28740480, 'steps': 149689, 'loss/train': 1.2568718194961548} 11/07/2021 18:17:31 - INFO - __main__ - Step 149691: {'lr': 5.412627604922671e-09, 'samples': 28740672, 'steps': 149690, 'loss/train': 1.4389891624450684} 11/07/2021 18:17:31 - INFO - __main__ - Step 149692: {'lr': 5.377763874686581e-09, 'samples': 28740864, 'steps': 149691, 'loss/train': 0.8946272134780884} 11/07/2021 18:17:32 - INFO - __main__ - Step 149693: {'lr': 5.343012788150415e-09, 'samples': 28741056, 'steps': 149692, 'loss/train': 1.4664844274520874} 11/07/2021 18:17:32 - INFO - __main__ - Step 149694: {'lr': 5.308374345369682e-09, 'samples': 28741248, 'steps': 149693, 'loss/train': 1.6667892932891846} 11/07/2021 18:17:32 - INFO - __main__ - Step 149695: {'lr': 5.273848546344384e-09, 'samples': 28741440, 'steps': 149694, 'loss/train': 0.22614547610282898} 11/07/2021 18:17:33 - INFO - __main__ - Step 149696: {'lr': 5.239435391074521e-09, 'samples': 28741632, 'steps': 149695, 'loss/train': 0.09965775907039642} 11/07/2021 18:17:34 - INFO - __main__ - Step 149697: {'lr': 5.205134879615603e-09, 'samples': 28741824, 'steps': 149696, 'loss/train': 1.4811573028564453} 11/07/2021 18:17:34 - INFO - __main__ - Step 149698: {'lr': 5.17094701191212e-09, 'samples': 28742016, 'steps': 149697, 'loss/train': 1.3521482944488525} 11/07/2021 18:17:34 - INFO - __main__ - Step 149699: {'lr': 5.136871788047337e-09, 'samples': 28742208, 'steps': 149698, 'loss/train': 0.9860050082206726} 11/07/2021 18:17:35 - INFO - __main__ - Step 149700: {'lr': 5.1029092079935e-09, 'samples': 28742400, 'steps': 149699, 'loss/train': 1.370118260383606} 11/07/2021 18:17:37 - INFO - __main__ - Step 149701: {'lr': 5.06905927180612e-09, 'samples': 28742592, 'steps': 149700, 'loss/train': 1.4447212219238281} 11/07/2021 18:17:37 - INFO - __main__ - Step 149702: {'lr': 5.035321979457441e-09, 'samples': 28742784, 'steps': 149701, 'loss/train': 1.1349350214004517} 11/07/2021 18:17:37 - INFO - __main__ - Step 149703: {'lr': 5.001697330975219e-09, 'samples': 28742976, 'steps': 149702, 'loss/train': 2.538864850997925} 11/07/2021 18:17:38 - INFO - __main__ - Step 149704: {'lr': 4.968185326387209e-09, 'samples': 28743168, 'steps': 149703, 'loss/train': 1.1823065280914307} 11/07/2021 18:17:38 - INFO - __main__ - Step 149705: {'lr': 4.934785965721167e-09, 'samples': 28743360, 'steps': 149704, 'loss/train': 0.8721557259559631} 11/07/2021 18:17:38 - INFO - __main__ - Step 149706: {'lr': 4.901499248949337e-09, 'samples': 28743552, 'steps': 149705, 'loss/train': 1.043634295463562} 11/07/2021 18:17:39 - INFO - __main__ - Step 149707: {'lr': 4.8683251761272304e-09, 'samples': 28743744, 'steps': 149706, 'loss/train': 1.0019924640655518} 11/07/2021 18:17:40 - INFO - __main__ - Step 149708: {'lr': 4.835263747254848e-09, 'samples': 28743936, 'steps': 149707, 'loss/train': 1.3575055599212646} 11/07/2021 18:17:40 - INFO - __main__ - Step 149709: {'lr': 4.8023149623321885e-09, 'samples': 28744128, 'steps': 149708, 'loss/train': 2.0927791595458984} 11/07/2021 18:17:41 - INFO - __main__ - Step 149710: {'lr': 4.769478821414763e-09, 'samples': 28744320, 'steps': 149709, 'loss/train': 1.3322035074234009} 11/07/2021 18:17:41 - INFO - __main__ - Step 149711: {'lr': 4.736755324447062e-09, 'samples': 28744512, 'steps': 149710, 'loss/train': 0.6235494613647461} 11/07/2021 18:17:41 - INFO - __main__ - Step 149712: {'lr': 4.7041444715123506e-09, 'samples': 28744704, 'steps': 149711, 'loss/train': 1.1399173736572266} 11/07/2021 18:17:42 - INFO - __main__ - Step 149713: {'lr': 4.671646262610629e-09, 'samples': 28744896, 'steps': 149712, 'loss/train': 1.4903628826141357} 11/07/2021 18:17:43 - INFO - __main__ - Step 149714: {'lr': 4.6392606977418984e-09, 'samples': 28745088, 'steps': 149713, 'loss/train': 1.0629602670669556} 11/07/2021 18:17:43 - INFO - __main__ - Step 149715: {'lr': 4.606987776906158e-09, 'samples': 28745280, 'steps': 149714, 'loss/train': 1.4195711612701416} 11/07/2021 18:17:43 - INFO - __main__ - Step 149716: {'lr': 4.574827500158918e-09, 'samples': 28745472, 'steps': 149715, 'loss/train': 1.2281864881515503} 11/07/2021 18:17:44 - INFO - __main__ - Step 149717: {'lr': 4.542779867472424e-09, 'samples': 28745664, 'steps': 149716, 'loss/train': 1.2481846809387207} 11/07/2021 18:17:45 - INFO - __main__ - Step 149718: {'lr': 4.510844878902187e-09, 'samples': 28745856, 'steps': 149717, 'loss/train': 1.5640488862991333} 11/07/2021 18:17:45 - INFO - __main__ - Step 149719: {'lr': 4.4790225344204515e-09, 'samples': 28746048, 'steps': 149718, 'loss/train': 1.28342604637146} 11/07/2021 18:17:45 - INFO - __main__ - Step 149720: {'lr': 4.447312834082729e-09, 'samples': 28746240, 'steps': 149719, 'loss/train': 1.48613703250885} 11/07/2021 18:17:46 - INFO - __main__ - Step 149721: {'lr': 4.415715777861262e-09, 'samples': 28746432, 'steps': 149720, 'loss/train': 1.2117501497268677} 11/07/2021 18:17:46 - INFO - __main__ - Step 149722: {'lr': 4.384231365811564e-09, 'samples': 28746624, 'steps': 149721, 'loss/train': 1.1120613813400269} 11/07/2021 18:17:47 - INFO - __main__ - Step 149723: {'lr': 4.352859597933634e-09, 'samples': 28746816, 'steps': 149722, 'loss/train': 1.292288064956665} 11/07/2021 18:17:48 - INFO - __main__ - Step 149724: {'lr': 4.321600474227471e-09, 'samples': 28747008, 'steps': 149723, 'loss/train': 1.3508330583572388} 11/07/2021 18:17:48 - INFO - __main__ - Step 149725: {'lr': 4.290453994720833e-09, 'samples': 28747200, 'steps': 149724, 'loss/train': 0.8333852291107178} 11/07/2021 18:17:48 - INFO - __main__ - Step 149726: {'lr': 4.2594201594137184e-09, 'samples': 28747392, 'steps': 149725, 'loss/train': 1.3866337537765503} 11/07/2021 18:17:49 - INFO - __main__ - Step 149727: {'lr': 4.228498968333883e-09, 'samples': 28747584, 'steps': 149726, 'loss/train': 0.8147683143615723} 11/07/2021 18:17:49 - INFO - __main__ - Step 149728: {'lr': 4.197690421481326e-09, 'samples': 28747776, 'steps': 149727, 'loss/train': 1.9766745567321777} 11/07/2021 18:17:50 - INFO - __main__ - Step 149729: {'lr': 4.166994518883805e-09, 'samples': 28747968, 'steps': 149728, 'loss/train': 1.283099889755249} 11/07/2021 18:17:50 - INFO - __main__ - Step 149730: {'lr': 4.136411260569073e-09, 'samples': 28748160, 'steps': 149729, 'loss/train': 1.1607484817504883} 11/07/2021 18:17:51 - INFO - __main__ - Step 149731: {'lr': 4.105940646509376e-09, 'samples': 28748352, 'steps': 149730, 'loss/train': 1.0143285989761353} 11/07/2021 18:17:51 - INFO - __main__ - Step 149732: {'lr': 4.0755826767602256e-09, 'samples': 28748544, 'steps': 149731, 'loss/train': 1.3939422369003296} 11/07/2021 18:17:52 - INFO - __main__ - Step 149733: {'lr': 4.045337351293865e-09, 'samples': 28748736, 'steps': 149732, 'loss/train': 1.0039983987808228} 11/07/2021 18:17:53 - INFO - __main__ - Step 149734: {'lr': 4.015204670165806e-09, 'samples': 28748928, 'steps': 149733, 'loss/train': 1.232069492340088} 11/07/2021 18:17:53 - INFO - __main__ - Step 149735: {'lr': 3.9851846333760486e-09, 'samples': 28749120, 'steps': 149734, 'loss/train': 1.3546732664108276} 11/07/2021 18:17:53 - INFO - __main__ - Step 149736: {'lr': 3.955277240924593e-09, 'samples': 28749312, 'steps': 149735, 'loss/train': 0.28573691844940186} 11/07/2021 18:17:54 - INFO - __main__ - Step 149737: {'lr': 3.9254824928114385e-09, 'samples': 28749504, 'steps': 149736, 'loss/train': 1.3015015125274658} 11/07/2021 18:17:54 - INFO - __main__ - Step 149738: {'lr': 3.8958003890920965e-09, 'samples': 28749696, 'steps': 149737, 'loss/train': 1.1907141208648682} 11/07/2021 18:17:55 - INFO - __main__ - Step 149739: {'lr': 3.866230929766568e-09, 'samples': 28749888, 'steps': 149738, 'loss/train': 1.299757480621338} 11/07/2021 18:17:55 - INFO - __main__ - Step 149740: {'lr': 3.836774114834851e-09, 'samples': 28750080, 'steps': 149739, 'loss/train': 0.887730062007904} 11/07/2021 18:17:56 - INFO - __main__ - Step 149741: {'lr': 3.8074299443247026e-09, 'samples': 28750272, 'steps': 149740, 'loss/train': 1.3010749816894531} 11/07/2021 18:17:56 - INFO - __main__ - Step 149742: {'lr': 3.7781984182361226e-09, 'samples': 28750464, 'steps': 149741, 'loss/train': 1.2102910280227661} 11/07/2021 18:17:57 - INFO - __main__ - Step 149743: {'lr': 3.749079536569111e-09, 'samples': 28750656, 'steps': 149742, 'loss/train': 1.200911521911621} 11/07/2021 18:17:57 - INFO - __main__ - Step 149744: {'lr': 3.720073299379179e-09, 'samples': 28750848, 'steps': 149743, 'loss/train': 0.42761966586112976} 11/07/2021 18:17:58 - INFO - __main__ - Step 149745: {'lr': 3.69117970663857e-09, 'samples': 28751040, 'steps': 149744, 'loss/train': 1.423858404159546} 11/07/2021 18:17:58 - INFO - __main__ - Step 149746: {'lr': 3.662398758402796e-09, 'samples': 28751232, 'steps': 149745, 'loss/train': 1.29458749294281} 11/07/2021 18:17:59 - INFO - __main__ - Step 149747: {'lr': 3.633730454644102e-09, 'samples': 28751424, 'steps': 149746, 'loss/train': 0.9988315105438232} 11/07/2021 18:17:59 - INFO - __main__ - Step 149748: {'lr': 3.605174795362487e-09, 'samples': 28751616, 'steps': 149747, 'loss/train': 1.3640886545181274} 11/07/2021 18:17:59 - INFO - __main__ - Step 149749: {'lr': 3.576731780641218e-09, 'samples': 28751808, 'steps': 149748, 'loss/train': 1.3854159116744995} 11/07/2021 18:18:00 - INFO - __main__ - Step 149750: {'lr': 3.5484014104247843e-09, 'samples': 28752000, 'steps': 149749, 'loss/train': 1.3430687189102173} 11/07/2021 18:18:01 - INFO - __main__ - Step 149751: {'lr': 3.5201836847686962e-09, 'samples': 28752192, 'steps': 149750, 'loss/train': 1.7439637184143066} 11/07/2021 18:18:01 - INFO - __main__ - Step 149752: {'lr': 3.492078603645199e-09, 'samples': 28752384, 'steps': 149751, 'loss/train': 1.260573148727417} 11/07/2021 18:18:01 - INFO - __main__ - Step 149753: {'lr': 3.4640861671098035e-09, 'samples': 28752576, 'steps': 149752, 'loss/train': 1.5855780839920044} 11/07/2021 18:18:02 - INFO - __main__ - Step 149754: {'lr': 3.4362063751625093e-09, 'samples': 28752768, 'steps': 149753, 'loss/train': 1.2497882843017578} 11/07/2021 18:18:03 - INFO - __main__ - Step 149755: {'lr': 3.408439227803317e-09, 'samples': 28752960, 'steps': 149754, 'loss/train': 1.1860493421554565} 11/07/2021 18:18:04 - INFO - __main__ - Step 149756: {'lr': 3.3807847250322264e-09, 'samples': 28753152, 'steps': 149755, 'loss/train': 1.825169563293457} 11/07/2021 18:18:04 - INFO - __main__ - Step 149757: {'lr': 3.353242866904749e-09, 'samples': 28753344, 'steps': 149756, 'loss/train': 0.8128453493118286} 11/07/2021 18:18:04 - INFO - __main__ - Step 149758: {'lr': 3.3258136533931284e-09, 'samples': 28753536, 'steps': 149757, 'loss/train': 1.6739147901535034} 11/07/2021 18:18:05 - INFO - __main__ - Step 149759: {'lr': 3.2984970845251204e-09, 'samples': 28753728, 'steps': 149758, 'loss/train': 0.6347839832305908} 11/07/2021 18:18:05 - INFO - __main__ - Step 149760: {'lr': 3.271293160328481e-09, 'samples': 28753920, 'steps': 149759, 'loss/train': 1.1412756443023682} 11/07/2021 18:18:06 - INFO - __main__ - Step 149761: {'lr': 3.2442018807754547e-09, 'samples': 28754112, 'steps': 149760, 'loss/train': 1.4368616342544556} 11/07/2021 18:18:06 - INFO - __main__ - Step 149762: {'lr': 3.217223245921552e-09, 'samples': 28754304, 'steps': 149761, 'loss/train': 1.6387925148010254} 11/07/2021 18:18:07 - INFO - __main__ - Step 149763: {'lr': 3.190357255739018e-09, 'samples': 28754496, 'steps': 149762, 'loss/train': 3.6937129497528076} 11/07/2021 18:18:07 - INFO - __main__ - Step 149764: {'lr': 3.1636039102833635e-09, 'samples': 28754688, 'steps': 149763, 'loss/train': 1.0073249340057373} 11/07/2021 18:18:07 - INFO - __main__ - Step 149765: {'lr': 3.1369632095545884e-09, 'samples': 28754880, 'steps': 149764, 'loss/train': 1.3824491500854492} 11/07/2021 18:18:08 - INFO - __main__ - Step 149766: {'lr': 3.1104351535249375e-09, 'samples': 28755072, 'steps': 149765, 'loss/train': 1.3158482313156128} 11/07/2021 18:18:09 - INFO - __main__ - Step 149767: {'lr': 3.0840197422499217e-09, 'samples': 28755264, 'steps': 149766, 'loss/train': 1.0514910221099854} 11/07/2021 18:18:09 - INFO - __main__ - Step 149768: {'lr': 3.057716975729541e-09, 'samples': 28755456, 'steps': 149767, 'loss/train': 1.2120946645736694} 11/07/2021 18:18:09 - INFO - __main__ - Step 149769: {'lr': 3.0315268539637953e-09, 'samples': 28755648, 'steps': 149768, 'loss/train': 1.6297430992126465} 11/07/2021 18:18:10 - INFO - __main__ - Step 149770: {'lr': 3.0054493769804404e-09, 'samples': 28755840, 'steps': 149769, 'loss/train': 0.8822697401046753} 11/07/2021 18:18:11 - INFO - __main__ - Step 149771: {'lr': 2.979484544779476e-09, 'samples': 28756032, 'steps': 149770, 'loss/train': 1.032267451286316} 11/07/2021 18:18:11 - INFO - __main__ - Step 149772: {'lr': 2.9536323573886583e-09, 'samples': 28756224, 'steps': 149771, 'loss/train': 0.8047825694084167} 11/07/2021 18:18:12 - INFO - __main__ - Step 149773: {'lr': 2.9278928148079863e-09, 'samples': 28756416, 'steps': 149772, 'loss/train': 1.1838338375091553} 11/07/2021 18:18:12 - INFO - __main__ - Step 149774: {'lr': 2.9022659170652167e-09, 'samples': 28756608, 'steps': 149773, 'loss/train': 1.4155726432800293} 11/07/2021 18:18:12 - INFO - __main__ - Step 149775: {'lr': 2.876751664132593e-09, 'samples': 28756800, 'steps': 149774, 'loss/train': 1.0724422931671143} 11/07/2021 18:18:13 - INFO - __main__ - Step 149776: {'lr': 2.8513500560378714e-09, 'samples': 28756992, 'steps': 149775, 'loss/train': 1.3036813735961914} 11/07/2021 18:18:14 - INFO - __main__ - Step 149777: {'lr': 2.8260610928365625e-09, 'samples': 28757184, 'steps': 149776, 'loss/train': 1.6269872188568115} 11/07/2021 18:18:14 - INFO - __main__ - Step 149778: {'lr': 2.8008847744731557e-09, 'samples': 28757376, 'steps': 149777, 'loss/train': 1.2035993337631226} 11/07/2021 18:18:14 - INFO - __main__ - Step 149779: {'lr': 2.775821101003162e-09, 'samples': 28757568, 'steps': 149778, 'loss/train': 1.2467998266220093} 11/07/2021 18:18:15 - INFO - __main__ - Step 149780: {'lr': 2.750870072426581e-09, 'samples': 28757760, 'steps': 149779, 'loss/train': 1.8716505765914917} 11/07/2021 18:18:16 - INFO - __main__ - Step 149781: {'lr': 2.7260316887434132e-09, 'samples': 28757952, 'steps': 149780, 'loss/train': 1.4696166515350342} 11/07/2021 18:18:16 - INFO - __main__ - Step 149782: {'lr': 2.701305949981414e-09, 'samples': 28758144, 'steps': 149781, 'loss/train': 1.3061957359313965} 11/07/2021 18:18:16 - INFO - __main__ - Step 149783: {'lr': 2.6766928561405834e-09, 'samples': 28758336, 'steps': 149782, 'loss/train': 1.50459623336792} 11/07/2021 18:18:17 - INFO - __main__ - Step 149784: {'lr': 2.652192407248677e-09, 'samples': 28758528, 'steps': 149783, 'loss/train': 1.2514196634292603} 11/07/2021 18:18:17 - INFO - __main__ - Step 149785: {'lr': 2.627804603277939e-09, 'samples': 28758720, 'steps': 149784, 'loss/train': 1.1083861589431763} 11/07/2021 18:18:18 - INFO - __main__ - Step 149786: {'lr': 2.6035294442838807e-09, 'samples': 28758912, 'steps': 149785, 'loss/train': 0.8828412890434265} 11/07/2021 18:18:18 - INFO - __main__ - Step 149787: {'lr': 2.5793669302665024e-09, 'samples': 28759104, 'steps': 149786, 'loss/train': 1.728058099746704} 11/07/2021 18:18:19 - INFO - __main__ - Step 149788: {'lr': 2.5553170611980482e-09, 'samples': 28759296, 'steps': 149787, 'loss/train': 1.3672858476638794} 11/07/2021 18:18:19 - INFO - __main__ - Step 149789: {'lr': 2.531379837134029e-09, 'samples': 28759488, 'steps': 149788, 'loss/train': 1.292082667350769} 11/07/2021 18:18:20 - INFO - __main__ - Step 149790: {'lr': 2.507555258102201e-09, 'samples': 28759680, 'steps': 149789, 'loss/train': 1.4516167640686035} 11/07/2021 18:18:21 - INFO - __main__ - Step 149791: {'lr': 2.4838433240470526e-09, 'samples': 28759872, 'steps': 149790, 'loss/train': 1.4094643592834473} 11/07/2021 18:18:21 - INFO - __main__ - Step 149792: {'lr': 2.460244035024095e-09, 'samples': 28760064, 'steps': 149791, 'loss/train': 1.8208872079849243} 11/07/2021 18:18:21 - INFO - __main__ - Step 149793: {'lr': 2.4367573910333284e-09, 'samples': 28760256, 'steps': 149792, 'loss/train': 1.4069204330444336} 11/07/2021 18:18:22 - INFO - __main__ - Step 149794: {'lr': 2.4133833921025084e-09, 'samples': 28760448, 'steps': 149793, 'loss/train': 1.5574758052825928} 11/07/2021 18:18:22 - INFO - __main__ - Step 149795: {'lr': 2.390122038203879e-09, 'samples': 28760640, 'steps': 149794, 'loss/train': 0.9270344376564026} 11/07/2021 18:18:22 - INFO - __main__ - Step 149796: {'lr': 2.366973329392952e-09, 'samples': 28760832, 'steps': 149795, 'loss/train': 1.4447619915008545} 11/07/2021 18:18:23 - INFO - __main__ - Step 149797: {'lr': 2.343937265641971e-09, 'samples': 28761024, 'steps': 149796, 'loss/train': 1.162721872329712} 11/07/2021 18:18:24 - INFO - __main__ - Step 149798: {'lr': 2.3210138469786922e-09, 'samples': 28761216, 'steps': 149797, 'loss/train': 1.1240590810775757} 11/07/2021 18:18:24 - INFO - __main__ - Step 149799: {'lr': 2.2982030734031158e-09, 'samples': 28761408, 'steps': 149798, 'loss/train': 1.4518661499023438} 11/07/2021 18:18:24 - INFO - __main__ - Step 149800: {'lr': 2.275504944970752e-09, 'samples': 28761600, 'steps': 149799, 'loss/train': 1.01889169216156} 11/07/2021 18:18:25 - INFO - __main__ - Step 149801: {'lr': 2.2529194616260905e-09, 'samples': 28761792, 'steps': 149800, 'loss/train': 1.3678960800170898} 11/07/2021 18:18:26 - INFO - __main__ - Step 149802: {'lr': 2.230446623396887e-09, 'samples': 28761984, 'steps': 149801, 'loss/train': 1.3329813480377197} 11/07/2021 18:18:26 - INFO - __main__ - Step 149803: {'lr': 2.2080864303386515e-09, 'samples': 28762176, 'steps': 149802, 'loss/train': 1.5713214874267578} 11/07/2021 18:18:26 - INFO - __main__ - Step 149804: {'lr': 2.185838882395874e-09, 'samples': 28762368, 'steps': 149803, 'loss/train': 1.894429087638855} 11/07/2021 18:18:27 - INFO - __main__ - Step 149805: {'lr': 2.1637039796240653e-09, 'samples': 28762560, 'steps': 149804, 'loss/train': 0.7670736908912659} 11/07/2021 18:18:27 - INFO - __main__ - Step 149806: {'lr': 2.1416817220232253e-09, 'samples': 28762752, 'steps': 149805, 'loss/train': 1.5983365774154663} 11/07/2021 18:18:28 - INFO - __main__ - Step 149807: {'lr': 2.1197721095933544e-09, 'samples': 28762944, 'steps': 149806, 'loss/train': 1.6265714168548584} 11/07/2021 18:18:29 - INFO - __main__ - Step 149808: {'lr': 2.0979751423622075e-09, 'samples': 28763136, 'steps': 149807, 'loss/train': 1.3274095058441162} 11/07/2021 18:18:29 - INFO - __main__ - Step 149809: {'lr': 2.076290820329785e-09, 'samples': 28763328, 'steps': 149808, 'loss/train': 1.6564136743545532} 11/07/2021 18:18:29 - INFO - __main__ - Step 149810: {'lr': 2.054719143468331e-09, 'samples': 28763520, 'steps': 149809, 'loss/train': 1.312738060951233} 11/07/2021 18:18:30 - INFO - __main__ - Step 149811: {'lr': 2.033260111861113e-09, 'samples': 28763712, 'steps': 149810, 'loss/train': 1.7272670269012451} 11/07/2021 18:18:30 - INFO - __main__ - Step 149812: {'lr': 2.0119137254803743e-09, 'samples': 28763904, 'steps': 149811, 'loss/train': 1.5267047882080078} 11/07/2021 18:18:31 - INFO - __main__ - Step 149813: {'lr': 1.9906799842983604e-09, 'samples': 28764096, 'steps': 149812, 'loss/train': 1.1640733480453491} 11/07/2021 18:18:31 - INFO - __main__ - Step 149814: {'lr': 1.9695588883983374e-09, 'samples': 28764288, 'steps': 149813, 'loss/train': 1.146134376525879} 11/07/2021 18:18:32 - INFO - __main__ - Step 149815: {'lr': 1.9485504377525496e-09, 'samples': 28764480, 'steps': 149814, 'loss/train': 1.3027888536453247} 11/07/2021 18:18:32 - INFO - __main__ - Step 149816: {'lr': 1.927654632360998e-09, 'samples': 28764672, 'steps': 149815, 'loss/train': 1.662630319595337} 11/07/2021 18:18:33 - INFO - __main__ - Step 149817: {'lr': 1.9068714722236813e-09, 'samples': 28764864, 'steps': 149816, 'loss/train': 1.1720130443572998} 11/07/2021 18:18:34 - INFO - __main__ - Step 149818: {'lr': 1.8862009573961112e-09, 'samples': 28765056, 'steps': 149817, 'loss/train': 1.0914853811264038} 11/07/2021 18:18:34 - INFO - __main__ - Step 149819: {'lr': 1.865643087850533e-09, 'samples': 28765248, 'steps': 149818, 'loss/train': 1.8596140146255493} 11/07/2021 18:18:34 - INFO - __main__ - Step 149820: {'lr': 1.8451978636147005e-09, 'samples': 28765440, 'steps': 149819, 'loss/train': 0.808184802532196} 11/07/2021 18:18:35 - INFO - __main__ - Step 149821: {'lr': 1.8248652846608594e-09, 'samples': 28765632, 'steps': 149820, 'loss/train': 0.04778521880507469} 11/07/2021 18:18:35 - INFO - __main__ - Step 149822: {'lr': 1.804645351072276e-09, 'samples': 28765824, 'steps': 149821, 'loss/train': 1.0288599729537964} 11/07/2021 18:18:36 - INFO - __main__ - Step 149823: {'lr': 1.7845380627934393e-09, 'samples': 28766016, 'steps': 149822, 'loss/train': 0.5045995712280273} 11/07/2021 18:18:36 - INFO - __main__ - Step 149824: {'lr': 1.7645434198243494e-09, 'samples': 28766208, 'steps': 149823, 'loss/train': 1.4029489755630493} 11/07/2021 18:18:37 - INFO - __main__ - Step 149825: {'lr': 1.7446614222482725e-09, 'samples': 28766400, 'steps': 149824, 'loss/train': 0.842025101184845} 11/07/2021 18:18:37 - INFO - __main__ - Step 149826: {'lr': 1.7248920700096982e-09, 'samples': 28766592, 'steps': 149825, 'loss/train': 1.1815978288650513} 11/07/2021 18:18:37 - INFO - __main__ - Step 149827: {'lr': 1.705235363108626e-09, 'samples': 28766784, 'steps': 149826, 'loss/train': 1.0432758331298828} 11/07/2021 18:18:39 - INFO - __main__ - Step 149828: {'lr': 1.6856913016283226e-09, 'samples': 28766976, 'steps': 149827, 'loss/train': 0.9582653045654297} 11/07/2021 18:18:39 - INFO - __main__ - Step 149829: {'lr': 1.6662598855132771e-09, 'samples': 28767168, 'steps': 149828, 'loss/train': 1.4618878364562988} 11/07/2021 18:18:39 - INFO - __main__ - Step 149830: {'lr': 1.6469411147634894e-09, 'samples': 28767360, 'steps': 149829, 'loss/train': 2.1695175170898438} 11/07/2021 18:18:40 - INFO - __main__ - Step 149831: {'lr': 1.6277349894344707e-09, 'samples': 28767552, 'steps': 149830, 'loss/train': 2.1862733364105225} 11/07/2021 18:18:40 - INFO - __main__ - Step 149832: {'lr': 1.6086415095262208e-09, 'samples': 28767744, 'steps': 149831, 'loss/train': 1.2695311307907104} 11/07/2021 18:18:41 - INFO - __main__ - Step 149833: {'lr': 1.5896606750109844e-09, 'samples': 28767936, 'steps': 149832, 'loss/train': 1.2898818254470825} 11/07/2021 18:18:42 - INFO - __main__ - Step 149834: {'lr': 1.5707924859442723e-09, 'samples': 28768128, 'steps': 149833, 'loss/train': 0.6001967191696167} 11/07/2021 18:18:42 - INFO - __main__ - Step 149835: {'lr': 1.5520369422983294e-09, 'samples': 28768320, 'steps': 149834, 'loss/train': 1.194414734840393} 11/07/2021 18:18:42 - INFO - __main__ - Step 149836: {'lr': 1.5333940441009109e-09, 'samples': 28768512, 'steps': 149835, 'loss/train': 1.3147026300430298} 11/07/2021 18:18:43 - INFO - __main__ - Step 149837: {'lr': 1.514863791352017e-09, 'samples': 28768704, 'steps': 149836, 'loss/train': 0.6851193904876709} 11/07/2021 18:18:44 - INFO - __main__ - Step 149838: {'lr': 1.496446184079403e-09, 'samples': 28768896, 'steps': 149837, 'loss/train': 1.3300666809082031} 11/07/2021 18:18:44 - INFO - __main__ - Step 149839: {'lr': 1.4781412222553136e-09, 'samples': 28769088, 'steps': 149838, 'loss/train': 1.4478543996810913} 11/07/2021 18:18:44 - INFO - __main__ - Step 149840: {'lr': 1.4599489059075045e-09, 'samples': 28769280, 'steps': 149839, 'loss/train': 1.166578769683838} 11/07/2021 18:18:45 - INFO - __main__ - Step 149841: {'lr': 1.4418692350637309e-09, 'samples': 28769472, 'steps': 149840, 'loss/train': 1.0202692747116089} 11/07/2021 18:18:45 - INFO - __main__ - Step 149842: {'lr': 1.4239022096962373e-09, 'samples': 28769664, 'steps': 149841, 'loss/train': 1.5703895092010498} 11/07/2021 18:18:46 - INFO - __main__ - Step 149843: {'lr': 1.4060478298327794e-09, 'samples': 28769856, 'steps': 149842, 'loss/train': 1.3922572135925293} 11/07/2021 18:18:47 - INFO - __main__ - Step 149844: {'lr': 1.3883060954733572e-09, 'samples': 28770048, 'steps': 149843, 'loss/train': 1.318890929222107} 11/07/2021 18:18:47 - INFO - __main__ - Step 149845: {'lr': 1.3706770066179707e-09, 'samples': 28770240, 'steps': 149844, 'loss/train': 1.1160167455673218} 11/07/2021 18:18:47 - INFO - __main__ - Step 149846: {'lr': 1.3531605633221312e-09, 'samples': 28770432, 'steps': 149845, 'loss/train': 1.086296796798706} 11/07/2021 18:18:48 - INFO - __main__ - Step 149847: {'lr': 1.3357567655303271e-09, 'samples': 28770624, 'steps': 149846, 'loss/train': 0.43890175223350525} 11/07/2021 18:18:48 - INFO - __main__ - Step 149848: {'lr': 1.31846561329807e-09, 'samples': 28770816, 'steps': 149847, 'loss/train': 1.351232886314392} 11/07/2021 18:18:49 - INFO - __main__ - Step 149849: {'lr': 1.301287106597604e-09, 'samples': 28771008, 'steps': 149848, 'loss/train': 1.476118803024292} 11/07/2021 18:18:49 - INFO - __main__ - Step 149850: {'lr': 1.2842212454566848e-09, 'samples': 28771200, 'steps': 149849, 'loss/train': 1.2895876169204712} 11/07/2021 18:18:50 - INFO - __main__ - Step 149851: {'lr': 1.2672680298753125e-09, 'samples': 28771392, 'steps': 149850, 'loss/train': 1.0479518175125122} 11/07/2021 18:18:50 - INFO - __main__ - Step 149852: {'lr': 1.2504274598812426e-09, 'samples': 28771584, 'steps': 149851, 'loss/train': 1.3307799100875854} 11/07/2021 18:18:50 - INFO - __main__ - Step 149853: {'lr': 1.2336995354467196e-09, 'samples': 28771776, 'steps': 149852, 'loss/train': 1.1900957822799683} 11/07/2021 18:18:51 - INFO - __main__ - Step 149854: {'lr': 1.2170842566272544e-09, 'samples': 28771968, 'steps': 149853, 'loss/train': 0.7333908677101135} 11/07/2021 18:18:52 - INFO - __main__ - Step 149855: {'lr': 1.200581623367336e-09, 'samples': 28772160, 'steps': 149854, 'loss/train': 1.3482240438461304} 11/07/2021 18:18:52 - INFO - __main__ - Step 149856: {'lr': 1.1841916357224758e-09, 'samples': 28772352, 'steps': 149855, 'loss/train': 1.1148765087127686} 11/07/2021 18:18:53 - INFO - __main__ - Step 149857: {'lr': 1.1679142936926735e-09, 'samples': 28772544, 'steps': 149856, 'loss/train': 1.597974181175232} 11/07/2021 18:18:53 - INFO - __main__ - Step 149858: {'lr': 1.151749597277929e-09, 'samples': 28772736, 'steps': 149857, 'loss/train': 1.2361242771148682} 11/07/2021 18:18:54 - INFO - __main__ - Step 149859: {'lr': 1.1356975464782427e-09, 'samples': 28772928, 'steps': 149858, 'loss/train': 1.2721539735794067} 11/07/2021 18:18:54 - INFO - __main__ - Step 149860: {'lr': 1.1197581413213698e-09, 'samples': 28773120, 'steps': 149859, 'loss/train': 1.073218822479248} 11/07/2021 18:18:55 - INFO - __main__ - Step 149861: {'lr': 1.1039313818073105e-09, 'samples': 28773312, 'steps': 149860, 'loss/train': 1.0816869735717773} 11/07/2021 18:18:55 - INFO - __main__ - Step 149862: {'lr': 1.088217267908309e-09, 'samples': 28773504, 'steps': 149861, 'loss/train': 1.202623963356018} 11/07/2021 18:18:55 - INFO - __main__ - Step 149863: {'lr': 1.0726157996798768e-09, 'samples': 28773696, 'steps': 149862, 'loss/train': 1.4477200508117676} 11/07/2021 18:18:56 - INFO - __main__ - Step 149864: {'lr': 1.0571269771220137e-09, 'samples': 28773888, 'steps': 149863, 'loss/train': 1.2748726606369019} 11/07/2021 18:18:57 - INFO - __main__ - Step 149865: {'lr': 1.0417508002069642e-09, 'samples': 28774080, 'steps': 149864, 'loss/train': 1.381597638130188} 11/07/2021 18:18:57 - INFO - __main__ - Step 149866: {'lr': 1.0264872689902394e-09, 'samples': 28774272, 'steps': 149865, 'loss/train': 0.5057306885719299} 11/07/2021 18:18:57 - INFO - __main__ - Step 149867: {'lr': 1.011336383416328e-09, 'samples': 28774464, 'steps': 149866, 'loss/train': 1.147086262702942} 11/07/2021 18:18:58 - INFO - __main__ - Step 149868: {'lr': 9.96298143568497e-10, 'samples': 28774656, 'steps': 149867, 'loss/train': 0.8758807182312012} 11/07/2021 18:18:59 - INFO - __main__ - Step 149869: {'lr': 9.81372549391235e-10, 'samples': 28774848, 'steps': 149868, 'loss/train': 1.0896469354629517} 11/07/2021 18:18:59 - INFO - __main__ - Step 149870: {'lr': 9.665596009122978e-10, 'samples': 28775040, 'steps': 149869, 'loss/train': 1.3298932313919067} 11/07/2021 18:18:59 - INFO - __main__ - Step 149871: {'lr': 9.518592981594409e-10, 'samples': 28775232, 'steps': 149870, 'loss/train': 1.2771745920181274} 11/07/2021 18:19:00 - INFO - __main__ - Step 149872: {'lr': 9.372716411049087e-10, 'samples': 28775424, 'steps': 149871, 'loss/train': 1.169695258140564} 11/07/2021 18:19:00 - INFO - __main__ - Step 149873: {'lr': 9.227966297764568e-10, 'samples': 28775616, 'steps': 149872, 'loss/train': 1.4689629077911377} 11/07/2021 18:19:01 - INFO - __main__ - Step 149874: {'lr': 9.08434264174085e-10, 'samples': 28775808, 'steps': 149873, 'loss/train': 1.1177003383636475} 11/07/2021 18:19:02 - INFO - __main__ - Step 149875: {'lr': 8.941845442977936e-10, 'samples': 28776000, 'steps': 149874, 'loss/train': 1.3052916526794434} 11/07/2021 18:19:02 - INFO - __main__ - Step 149876: {'lr': 8.800474701475824e-10, 'samples': 28776192, 'steps': 149875, 'loss/train': 0.46474963426589966} 11/07/2021 18:19:02 - INFO - __main__ - Step 149877: {'lr': 8.660230417789627e-10, 'samples': 28776384, 'steps': 149876, 'loss/train': 0.988707959651947} 11/07/2021 18:19:03 - INFO - __main__ - Step 149878: {'lr': 8.521112591364233e-10, 'samples': 28776576, 'steps': 149877, 'loss/train': 1.1279821395874023} 11/07/2021 18:19:04 - INFO - __main__ - Step 149879: {'lr': 8.383121222477197e-10, 'samples': 28776768, 'steps': 149878, 'loss/train': 0.8121131658554077} 11/07/2021 18:19:04 - INFO - __main__ - Step 149880: {'lr': 8.246256311406076e-10, 'samples': 28776960, 'steps': 149879, 'loss/train': 1.4123514890670776} 11/07/2021 18:19:04 - INFO - __main__ - Step 149881: {'lr': 8.110517858150867e-10, 'samples': 28777152, 'steps': 149880, 'loss/train': 1.4103877544403076} 11/07/2021 18:19:05 - INFO - __main__ - Step 149882: {'lr': 7.975905862434018e-10, 'samples': 28777344, 'steps': 149881, 'loss/train': 1.303497076034546} 11/07/2021 18:19:05 - INFO - __main__ - Step 149883: {'lr': 7.842420324533084e-10, 'samples': 28777536, 'steps': 149882, 'loss/train': 1.2018780708312988} 11/07/2021 18:19:06 - INFO - __main__ - Step 149884: {'lr': 7.710061244725619e-10, 'samples': 28777728, 'steps': 149883, 'loss/train': 0.815712571144104} 11/07/2021 18:19:07 - INFO - __main__ - Step 149885: {'lr': 7.578828622734069e-10, 'samples': 28777920, 'steps': 149884, 'loss/train': 1.3020349740982056} 11/07/2021 18:19:07 - INFO - __main__ - Step 149886: {'lr': 7.448722458558433e-10, 'samples': 28778112, 'steps': 149885, 'loss/train': 1.1151938438415527} 11/07/2021 18:19:07 - INFO - __main__ - Step 149887: {'lr': 7.319742752753822e-10, 'samples': 28778304, 'steps': 149886, 'loss/train': 1.9748057126998901} 11/07/2021 18:19:08 - INFO - __main__ - Step 149888: {'lr': 7.191889505042682e-10, 'samples': 28778496, 'steps': 149887, 'loss/train': 1.451596975326538} 11/07/2021 18:19:09 - INFO - __main__ - Step 149889: {'lr': 7.065162715425011e-10, 'samples': 28778688, 'steps': 149888, 'loss/train': 1.4988290071487427} 11/07/2021 18:19:09 - INFO - __main__ - Step 149890: {'lr': 6.939562383900811e-10, 'samples': 28778880, 'steps': 149889, 'loss/train': 2.156938314437866} 11/07/2021 18:19:10 - INFO - __main__ - Step 149891: {'lr': 6.815088511025192e-10, 'samples': 28779072, 'steps': 149890, 'loss/train': 1.2833770513534546} 11/07/2021 18:19:10 - INFO - __main__ - Step 149892: {'lr': 6.691741096243043e-10, 'samples': 28779264, 'steps': 149891, 'loss/train': 1.4535033702850342} 11/07/2021 18:19:11 - INFO - __main__ - Step 149893: {'lr': 6.569520139831919e-10, 'samples': 28779456, 'steps': 149892, 'loss/train': 1.2047566175460815} 11/07/2021 18:19:11 - INFO - __main__ - Step 149894: {'lr': 6.448425642069378e-10, 'samples': 28779648, 'steps': 149893, 'loss/train': 1.3084232807159424} 11/07/2021 18:19:11 - INFO - __main__ - Step 149895: {'lr': 6.328457602677862e-10, 'samples': 28779840, 'steps': 149894, 'loss/train': 1.4425626993179321} 11/07/2021 18:19:12 - INFO - __main__ - Step 149896: {'lr': 6.209616021934927e-10, 'samples': 28780032, 'steps': 149895, 'loss/train': 1.4988442659378052} 11/07/2021 18:19:13 - INFO - __main__ - Step 149897: {'lr': 6.091900899840575e-10, 'samples': 28780224, 'steps': 149896, 'loss/train': 1.2663451433181763} 11/07/2021 18:19:13 - INFO - __main__ - Step 149898: {'lr': 5.975312236394803e-10, 'samples': 28780416, 'steps': 149897, 'loss/train': 1.4070396423339844} 11/07/2021 18:19:13 - INFO - __main__ - Step 149899: {'lr': 5.859850031597613e-10, 'samples': 28780608, 'steps': 149898, 'loss/train': 1.2804877758026123} 11/07/2021 18:19:14 - INFO - __main__ - Step 149900: {'lr': 5.745514285726561e-10, 'samples': 28780800, 'steps': 149899, 'loss/train': 1.6253581047058105} 11/07/2021 18:19:14 - INFO - __main__ - Step 149901: {'lr': 5.63230499850409e-10, 'samples': 28780992, 'steps': 149900, 'loss/train': 1.3286832571029663} 11/07/2021 18:19:15 - INFO - __main__ - Step 149902: {'lr': 5.520222170485312e-10, 'samples': 28781184, 'steps': 149901, 'loss/train': 0.9773249626159668} 11/07/2021 18:19:15 - INFO - __main__ - Step 149903: {'lr': 5.409265801115116e-10, 'samples': 28781376, 'steps': 149902, 'loss/train': 1.3544098138809204} 11/07/2021 18:19:16 - INFO - __main__ - Step 149904: {'lr': 5.299435890671056e-10, 'samples': 28781568, 'steps': 149903, 'loss/train': 1.0398155450820923} 11/07/2021 18:19:16 - INFO - __main__ - Step 149905: {'lr': 5.19073243943069e-10, 'samples': 28781760, 'steps': 149904, 'loss/train': 1.4319241046905518} 11/07/2021 18:19:16 - INFO - __main__ - Step 149906: {'lr': 5.083155447394017e-10, 'samples': 28781952, 'steps': 149905, 'loss/train': 1.0280025005340576} 11/07/2021 18:19:18 - INFO - __main__ - Step 149907: {'lr': 4.976704914283481e-10, 'samples': 28782144, 'steps': 149906, 'loss/train': 1.3123282194137573} 11/07/2021 18:19:18 - INFO - __main__ - Step 149908: {'lr': 4.871380840376637e-10, 'samples': 28782336, 'steps': 149907, 'loss/train': 1.1383570432662964} 11/07/2021 18:19:18 - INFO - __main__ - Step 149909: {'lr': 4.767183225673488e-10, 'samples': 28782528, 'steps': 149908, 'loss/train': 1.309288740158081} 11/07/2021 18:19:19 - INFO - __main__ - Step 149910: {'lr': 4.664112070451587e-10, 'samples': 28782720, 'steps': 149909, 'loss/train': 1.2481701374053955} 11/07/2021 18:19:19 - INFO - __main__ - Step 149911: {'lr': 4.562167374433379e-10, 'samples': 28782912, 'steps': 149910, 'loss/train': 1.3195956945419312} 11/07/2021 18:19:20 - INFO - __main__ - Step 149912: {'lr': 4.461349137618864e-10, 'samples': 28783104, 'steps': 149911, 'loss/train': 1.228426218032837} 11/07/2021 18:19:20 - INFO - __main__ - Step 149913: {'lr': 4.361657360285598e-10, 'samples': 28783296, 'steps': 149912, 'loss/train': 1.5523929595947266} 11/07/2021 18:19:21 - INFO - __main__ - Step 149914: {'lr': 4.2630920427111365e-10, 'samples': 28783488, 'steps': 149913, 'loss/train': 1.0380946397781372} 11/07/2021 18:19:21 - INFO - __main__ - Step 149915: {'lr': 4.165653184340368e-10, 'samples': 28783680, 'steps': 149914, 'loss/train': 1.205806851387024} 11/07/2021 18:19:21 - INFO - __main__ - Step 149916: {'lr': 4.069340785450848e-10, 'samples': 28783872, 'steps': 149915, 'loss/train': 1.3984349966049194} 11/07/2021 18:19:22 - INFO - __main__ - Step 149917: {'lr': 3.9741548463201325e-10, 'samples': 28784064, 'steps': 149916, 'loss/train': 0.3723243474960327} 11/07/2021 18:19:23 - INFO - __main__ - Step 149918: {'lr': 3.880095366670666e-10, 'samples': 28784256, 'steps': 149917, 'loss/train': 1.0860626697540283} 11/07/2021 18:19:23 - INFO - __main__ - Step 149919: {'lr': 3.7871623470575603e-10, 'samples': 28784448, 'steps': 149918, 'loss/train': 1.0640099048614502} 11/07/2021 18:19:23 - INFO - __main__ - Step 149920: {'lr': 3.695355786648147e-10, 'samples': 28784640, 'steps': 149919, 'loss/train': 1.0588749647140503} 11/07/2021 18:19:24 - INFO - __main__ - Step 149921: {'lr': 3.604675686275094e-10, 'samples': 28784832, 'steps': 149920, 'loss/train': 0.9744957089424133} 11/07/2021 18:19:25 - INFO - __main__ - Step 149922: {'lr': 3.515122045660846e-10, 'samples': 28785024, 'steps': 149921, 'loss/train': 0.7054277658462524} 11/07/2021 18:19:25 - INFO - __main__ - Step 149923: {'lr': 3.4266948650829576e-10, 'samples': 28785216, 'steps': 149922, 'loss/train': 0.8958981037139893} 11/07/2021 18:19:26 - INFO - __main__ - Step 149924: {'lr': 3.3393941442638743e-10, 'samples': 28785408, 'steps': 149923, 'loss/train': 1.462368130683899} 11/07/2021 18:19:26 - INFO - __main__ - Step 149925: {'lr': 3.253219883203595e-10, 'samples': 28785600, 'steps': 149924, 'loss/train': 0.9910423159599304} 11/07/2021 18:19:26 - INFO - __main__ - Step 149926: {'lr': 3.1681720821796766e-10, 'samples': 28785792, 'steps': 149925, 'loss/train': 0.475909948348999} 11/07/2021 18:19:27 - INFO - __main__ - Step 149927: {'lr': 3.0842507411921185e-10, 'samples': 28785984, 'steps': 149926, 'loss/train': 0.9773409962654114} 11/07/2021 18:19:28 - INFO - __main__ - Step 149928: {'lr': 3.00145586024092e-10, 'samples': 28786176, 'steps': 149927, 'loss/train': 1.027771234512329} 11/07/2021 18:19:28 - INFO - __main__ - Step 149929: {'lr': 2.9197874393260826e-10, 'samples': 28786368, 'steps': 149928, 'loss/train': 1.0203280448913574} 11/07/2021 18:19:28 - INFO - __main__ - Step 149930: {'lr': 2.8392454784476053e-10, 'samples': 28786560, 'steps': 149929, 'loss/train': 1.2449990510940552} 11/07/2021 18:19:29 - INFO - __main__ - Step 149931: {'lr': 2.7598299778830437e-10, 'samples': 28786752, 'steps': 149930, 'loss/train': 1.5051031112670898} 11/07/2021 18:19:30 - INFO - __main__ - Step 149932: {'lr': 2.681540937354843e-10, 'samples': 28786944, 'steps': 149931, 'loss/train': 1.1587164402008057} 11/07/2021 18:19:30 - INFO - __main__ - Step 149933: {'lr': 2.6043783571405576e-10, 'samples': 28787136, 'steps': 149932, 'loss/train': 0.6971631050109863} 11/07/2021 18:19:30 - INFO - __main__ - Step 149934: {'lr': 2.528342236962633e-10, 'samples': 28787328, 'steps': 149933, 'loss/train': 1.905118465423584} 11/07/2021 18:19:31 - INFO - __main__ - Step 149935: {'lr': 2.45343257737618e-10, 'samples': 28787520, 'steps': 149934, 'loss/train': 0.563270092010498} 11/07/2021 18:19:31 - INFO - __main__ - Step 149936: {'lr': 2.3796493778260873e-10, 'samples': 28787712, 'steps': 149935, 'loss/train': 0.7659665942192078} 11/07/2021 18:19:32 - INFO - __main__ - Step 149937: {'lr': 2.3069926388674667e-10, 'samples': 28787904, 'steps': 149936, 'loss/train': 1.7680420875549316} 11/07/2021 18:19:33 - INFO - __main__ - Step 149938: {'lr': 2.235462360222762e-10, 'samples': 28788096, 'steps': 149937, 'loss/train': 0.9041074514389038} 11/07/2021 18:19:33 - INFO - __main__ - Step 149939: {'lr': 2.1650585418919733e-10, 'samples': 28788288, 'steps': 149938, 'loss/train': 1.2944459915161133} 11/07/2021 18:19:33 - INFO - __main__ - Step 149940: {'lr': 2.0957811841526563e-10, 'samples': 28788480, 'steps': 149939, 'loss/train': 1.3153711557388306} 11/07/2021 18:19:34 - INFO - __main__ - Step 149941: {'lr': 2.0276302867272557e-10, 'samples': 28788672, 'steps': 149940, 'loss/train': 1.400766372680664} 11/07/2021 18:19:34 - INFO - __main__ - Step 149942: {'lr': 1.9606058498933267e-10, 'samples': 28788864, 'steps': 149941, 'loss/train': 1.5655077695846558} 11/07/2021 18:19:35 - INFO - __main__ - Step 149943: {'lr': 1.8947078736508693e-10, 'samples': 28789056, 'steps': 149942, 'loss/train': 1.1063365936279297} 11/07/2021 18:19:35 - INFO - __main__ - Step 149944: {'lr': 1.829936357999884e-10, 'samples': 28789248, 'steps': 149943, 'loss/train': 1.3283336162567139} 11/07/2021 18:19:36 - INFO - __main__ - Step 149945: {'lr': 1.7662913029403706e-10, 'samples': 28789440, 'steps': 149944, 'loss/train': 0.23578238487243652} 11/07/2021 18:19:36 - INFO - __main__ - Step 149946: {'lr': 1.7037727081947728e-10, 'samples': 28789632, 'steps': 149945, 'loss/train': 0.8468477725982666} 11/07/2021 18:19:36 - INFO - __main__ - Step 149947: {'lr': 1.6423805745957586e-10, 'samples': 28789824, 'steps': 149946, 'loss/train': 1.1620464324951172} 11/07/2021 18:19:37 - INFO - __main__ - Step 149948: {'lr': 1.5821149013106605e-10, 'samples': 28790016, 'steps': 149947, 'loss/train': 0.44574683904647827} 11/07/2021 18:19:38 - INFO - __main__ - Step 149949: {'lr': 1.5229756888945901e-10, 'samples': 28790208, 'steps': 149948, 'loss/train': 1.3174282312393188} 11/07/2021 18:19:38 - INFO - __main__ - Step 149950: {'lr': 1.4649629370699914e-10, 'samples': 28790400, 'steps': 149949, 'loss/train': 1.2953237295150757} 11/07/2021 18:19:38 - INFO - __main__ - Step 149951: {'lr': 1.40807664611442e-10, 'samples': 28790592, 'steps': 149950, 'loss/train': 1.3526259660720825} 11/07/2021 18:19:39 - INFO - __main__ - Step 149952: {'lr': 1.3523168160278765e-10, 'samples': 28790784, 'steps': 149951, 'loss/train': 1.6790889501571655} 11/07/2021 18:19:40 - INFO - __main__ - Step 149953: {'lr': 1.2976834468103606e-10, 'samples': 28790976, 'steps': 149952, 'loss/train': 1.2103965282440186} 11/07/2021 18:19:40 - INFO - __main__ - Step 149954: {'lr': 1.2441765381843163e-10, 'samples': 28791168, 'steps': 149953, 'loss/train': 1.5458208322525024} 11/07/2021 18:19:41 - INFO - __main__ - Step 149955: {'lr': 1.1917960904272995e-10, 'samples': 28791360, 'steps': 149954, 'loss/train': 1.4826616048812866} 11/07/2021 18:19:41 - INFO - __main__ - Step 149956: {'lr': 1.1405421038168662e-10, 'samples': 28791552, 'steps': 149955, 'loss/train': 2.3440375328063965} 11/07/2021 18:19:41 - INFO - __main__ - Step 149957: {'lr': 1.0904145777979047e-10, 'samples': 28791744, 'steps': 149956, 'loss/train': 1.0425513982772827} 11/07/2021 18:19:42 - INFO - __main__ - Step 149958: {'lr': 1.0414135129255265e-10, 'samples': 28791936, 'steps': 149957, 'loss/train': 0.9350542426109314} 11/07/2021 18:19:43 - INFO - __main__ - Step 149959: {'lr': 9.935389089221758e-11, 'samples': 28792128, 'steps': 149958, 'loss/train': 1.3979809284210205} 11/07/2021 18:19:43 - INFO - __main__ - Step 149960: {'lr': 9.467907660654085e-11, 'samples': 28792320, 'steps': 149959, 'loss/train': 1.7046674489974976} 11/07/2021 18:19:43 - INFO - __main__ - Step 149961: {'lr': 9.011690840776687e-11, 'samples': 28792512, 'steps': 149960, 'loss/train': 0.7871621251106262} 11/07/2021 18:19:44 - INFO - __main__ - Step 149962: {'lr': 8.566738629589565e-11, 'samples': 28792704, 'steps': 149961, 'loss/train': 1.1058440208435059} 11/07/2021 18:19:45 - INFO - __main__ - Step 149963: {'lr': 8.133051029868277e-11, 'samples': 28792896, 'steps': 149962, 'loss/train': 1.1972365379333496} 11/07/2021 18:19:45 - INFO - __main__ - Step 149964: {'lr': 7.710628041612822e-11, 'samples': 28793088, 'steps': 149963, 'loss/train': 1.5489463806152344} 11/07/2021 18:19:46 - INFO - __main__ - Step 149965: {'lr': 7.2994696648232e-11, 'samples': 28793280, 'steps': 149964, 'loss/train': 1.0489290952682495} 11/07/2021 18:19:46 - INFO - __main__ - Step 149966: {'lr': 6.899575896723852e-11, 'samples': 28793472, 'steps': 149965, 'loss/train': 1.4233659505844116} 11/07/2021 18:19:46 - INFO - __main__ - Step 149967: {'lr': 6.51094674009034e-11, 'samples': 28793664, 'steps': 149966, 'loss/train': 1.4321084022521973} 11/07/2021 18:19:47 - INFO - __main__ - Step 149968: {'lr': 6.13358219492266e-11, 'samples': 28793856, 'steps': 149967, 'loss/train': 1.2422442436218262} 11/07/2021 18:19:48 - INFO - __main__ - Step 149969: {'lr': 5.767482261220813e-11, 'samples': 28794048, 'steps': 149968, 'loss/train': 1.2096813917160034} 11/07/2021 18:19:48 - INFO - __main__ - Step 149970: {'lr': 5.4126469417603575e-11, 'samples': 28794240, 'steps': 149969, 'loss/train': 0.5047084093093872} 11/07/2021 18:19:48 - INFO - __main__ - Step 149971: {'lr': 5.069076230990177e-11, 'samples': 28794432, 'steps': 149970, 'loss/train': 1.349865198135376} 11/07/2021 18:19:49 - INFO - __main__ - Step 149972: {'lr': 4.7367701316858304e-11, 'samples': 28794624, 'steps': 149971, 'loss/train': 1.4956614971160889} 11/07/2021 18:19:49 - INFO - __main__ - Step 149973: {'lr': 4.415728646622874e-11, 'samples': 28794816, 'steps': 149972, 'loss/train': 1.3009979724884033} 11/07/2021 18:19:50 - INFO - __main__ - Step 149974: {'lr': 4.105951773025751e-11, 'samples': 28795008, 'steps': 149973, 'loss/train': 0.39447641372680664} 11/07/2021 18:19:51 - INFO - __main__ - Step 149975: {'lr': 3.807439510894461e-11, 'samples': 28795200, 'steps': 149974, 'loss/train': 1.2742643356323242} 11/07/2021 18:19:51 - INFO - __main__ - Step 149976: {'lr': 3.5201918630045624e-11, 'samples': 28795392, 'steps': 149975, 'loss/train': 1.301152229309082} 11/07/2021 18:19:51 - INFO - __main__ - Step 149977: {'lr': 3.244208826580497e-11, 'samples': 28795584, 'steps': 149976, 'loss/train': 1.3312891721725464} 11/07/2021 18:19:52 - INFO - __main__ - Step 149978: {'lr': 2.979490404397822e-11, 'samples': 28795776, 'steps': 149977, 'loss/train': 1.3207296133041382} 11/07/2021 18:19:53 - INFO - __main__ - Step 149979: {'lr': 2.7260365936809805e-11, 'samples': 28795968, 'steps': 149978, 'loss/train': 0.836780309677124} 11/07/2021 18:19:54 - INFO - __main__ - Step 149980: {'lr': 2.48384739720553e-11, 'samples': 28796160, 'steps': 149979, 'loss/train': 1.2203502655029297} 11/07/2021 18:19:54 - INFO - __main__ - Step 149981: {'lr': 2.252922812195912e-11, 'samples': 28796352, 'steps': 149980, 'loss/train': 0.07532329857349396} 11/07/2021 18:19:54 - INFO - __main__ - Step 149982: {'lr': 2.0332628414276854e-11, 'samples': 28796544, 'steps': 149981, 'loss/train': 1.255293607711792} 11/07/2021 18:19:55 - INFO - __main__ - Step 149983: {'lr': 1.824867482125292e-11, 'samples': 28796736, 'steps': 149982, 'loss/train': 1.1742411851882935} 11/07/2021 18:19:56 - INFO - __main__ - Step 149984: {'lr': 1.627736737064289e-11, 'samples': 28796928, 'steps': 149983, 'loss/train': 1.4772884845733643} 11/07/2021 18:19:56 - INFO - __main__ - Step 149985: {'lr': 1.4418706062446774e-11, 'samples': 28797120, 'steps': 149984, 'loss/train': 1.1910622119903564} 11/07/2021 18:19:56 - INFO - __main__ - Step 149986: {'lr': 1.2672690896664563e-11, 'samples': 28797312, 'steps': 149985, 'loss/train': 1.0476105213165283} 11/07/2021 18:19:57 - INFO - __main__ - Step 149987: {'lr': 1.103932187329626e-11, 'samples': 28797504, 'steps': 149986, 'loss/train': 1.1371670961380005} 11/07/2021 18:19:57 - INFO - __main__ - Step 149988: {'lr': 9.518598964586289e-12, 'samples': 28797696, 'steps': 149987, 'loss/train': 1.425508975982666} 11/07/2021 18:19:57 - INFO - __main__ - Step 149989: {'lr': 8.110522198290227e-12, 'samples': 28797888, 'steps': 149988, 'loss/train': 1.0216255187988281} 11/07/2021 18:19:58 - INFO - __main__ - Step 149990: {'lr': 6.815091574408072e-12, 'samples': 28798080, 'steps': 149989, 'loss/train': 0.9487575888633728} 11/07/2021 18:19:59 - INFO - __main__ - Step 149991: {'lr': 5.632307092939826e-12, 'samples': 28798272, 'steps': 149990, 'loss/train': 0.786053478717804} 11/07/2021 18:19:59 - INFO - __main__ - Step 149992: {'lr': 4.562168753885487e-12, 'samples': 28798464, 'steps': 149991, 'loss/train': 1.469866394996643} 11/07/2021 18:19:59 - INFO - __main__ - Step 149993: {'lr': 3.604676557245057e-12, 'samples': 28798656, 'steps': 149992, 'loss/train': 1.065239429473877} 11/07/2021 18:20:00 - INFO - __main__ - Step 149994: {'lr': 2.759830475262959e-12, 'samples': 28798848, 'steps': 149993, 'loss/train': 1.2347151041030884} 11/07/2021 18:20:01 - INFO - __main__ - Step 149995: {'lr': 2.0276305634503443e-12, 'samples': 28799040, 'steps': 149994, 'loss/train': 1.1358357667922974} 11/07/2021 18:20:01 - INFO - __main__ - Step 149996: {'lr': 1.4080767662960624e-12, 'samples': 28799232, 'steps': 149995, 'loss/train': 0.9665434956550598} 11/07/2021 18:20:02 - INFO - __main__ - Step 149997: {'lr': 9.011691393112642e-13, 'samples': 28799424, 'steps': 149996, 'loss/train': 5.6942620277404785} 11/07/2021 18:20:02 - INFO - __main__ - Step 149998: {'lr': 5.069076269847983e-13, 'samples': 28799616, 'steps': 149997, 'loss/train': 1.498703122138977} 11/07/2021 18:20:02 - INFO - __main__ - Step 149999: {'lr': 2.2529228482781605e-13, 'samples': 28799808, 'steps': 149998, 'loss/train': 0.8272415399551392} 11/07/2021 18:20:03 - INFO - __main__ - Step 150000: {'lr': 5.63230573291662e-14, 'samples': 28800000, 'steps': 149999, 'loss/train': 1.469771146774292} 11/07/2021 18:20:04 - INFO - __main__ - Evaluating and saving model checkpoint 11/07/2021 18:23:20 - INFO - __main__ - Step 150000: {'loss/eval': 1.2280396223068237, 'perplexity': 3.414529323577881} 11/07/2021 18:23:37 - WARNING - huggingface_hub.repository - Several commits (10) will be pushed upstream. 11/07/2021 18:23:37 - WARNING - huggingface_hub.repository - The progress bars may be unreliable. 11/07/2021 18:24:03 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small 5724e17..190a13b proud-haze-135 -> proud-haze-135 11/07/2021 18:24:04 - INFO - __main__ - Evaluating and saving model after training 11/07/2021 18:27:20 - INFO - __main__ - Step 150000: {'loss/eval': 1.2280396223068237, 'perplexity': 3.414529323577881}